DATASETS

Defense Gene Non-Defense Gene
PR10 Non-PR10 Total Pea  Non-Pea Total
No.a Sizeb No. Size No. Size No. Size No. Size No. Size
11 8 kb 38 43 kb 49 51 kb 29 26 kb 137 116 kb 166 142 kb

Note: "a" number of genes in dataset; "b" size of dataset in kilobase (kb).
 
 

SEQUENCE LANDSCAPES

[Clift et al., (1986) NAR 14:141-158; Levy et al., (1998) Bioinformatics 14:74-80]

For a simple example, the target sequence is (AGCT) and the source sequence (dataset) is (AGCTAGAG). XLAND will generate a result like this:
 
possible pattern A G C T AG GC CT AGC GCT AGCT
frequency 3 3 1 1 3 1 1 1 1 1

These frequencies are used to plot the sequence landscape. A peak in the landscape means a conserved DNA sequence in the dataset. The higher the peak, the longer the conserved sequence.

Step 1: Use each of the three DNA binding domains, PDA1, PDA2 and PDC1, as target sequences to scan each of 49 defense genes for shared motifs.

example:  short segment from a sequence landscapes of PDA2 against  potato PR10 (STH-21) and Arabidopsis hsr203J.