ID HELICOPY1 repbase; DNA; CEL; 2595 BP. XX AC . XX DT 30-MAY-2000 (Rel. 4.1, Created) DT 30-MAY-2000 (Rel. 4.1, Last updated, Version 1) XX DE HELICOPY1 is a nonautonomous DNA transposon - a consensus. XX KW nonautonomous DNA transposon; HELICOP superfamily; RCA1; RCC9; KW RCD1; RC35; HELICOPY1. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-2595 RA Naclerio G., Cangiano G., Coulson A., Levitt A., Ruvolo V. RA and La Volpe A.; RT "Molecular and genomic organization of clusters of repqetitive RT DNA sequences in Caenorhabditis elegans."; RL J. Mol. Biol 226(1), 159-168 (1992). XX RN [2] RP 1-2595 RA V V. and Jurka J.; RT "HELICOPY1."; RL Direct Submission to Repbase Update (MAY-2000). XX DR [2] (Consensus) XX CC There are about 50 copies of HELICOPY1 in the C.elegans genome. CC They are ~97% identical to the consensus sequence. CC Several portions of HELICOPY1 have been reported previosly CC as the RCA1, RCD1, RC35 and RCC9 minisatellite like repeats [1]. CC The authors [1] have noticed a coclustering of these repeats but CC did not find a right explanation to it. XX SQ Sequence 2595 BP; 933 A; 349 C; 534 G; 777 T; 2 other; atcattamyt atatataaaa aaatacagtt cgtctgtcca tagtttgtag tctatgtagt 60 ctttgtagtc tgtgacgtca cacccaaatg tcagtgagaa ttgtgggcgg ggcctgtcac 120 ctacccttcg tggtgagacc catcgtggtg agacccttcg tggtgagacc catcgtggtg 180 agacccatcg tggtgagacc cttcgtggtg agacccatcg tggtgagacc cttcgtggtg 240 agacccacat ttttggcggg aaattcaaat tttcagtgaa aaaattttgg cgggaaattc 300 aaattttcag tgaaaaaatt ttggcgggaa attcaaattt tcagtgaaaa aaattttggc 360 gggaaattca aattttcagt gaaaaaaatt ttggcgggaa attcaaattt tcagtgaaaa 420 aattttggcg ggaaattcaa attttcagtg aaaaaatttt ggcgggaaat tcaaattttc 480 agtgaaaaaa ttttggcggg aaattcaaat tttcagtgaa aaaaattttg gcgggaaatt 540 caaattttca gtgaaaaaaa ttttggcggg aaattcaaat tttcagtgaa aaaaattttg 600 gcgggaaatt caaattttca gtgaaaaaaa ttttggcggg aaattcaaat tttcagtgaa 660 aaaaattttg gcgggaaatt caaattttca gtgaaaaaaa ttttggcggg aaattcaaat 720 tttcagtgaa aaaaattttg gcgggaaatt caaattttca gtgaaaaaaa ttttggcggg 780 aaattcaaat tttcagtgaa aaaaattttg gcgggaaatt caaattttca gtgaaaaaaa 840 ttttggcggg aaattcaaat tttcagtgaa aaaaattttg gcgggaaatt caaattttca 900 gtgaaaaaaa ttttggcggg aaattcaaat tttcagtgaa aaaaattttg gcgggaaatt 960 caaattttca gtgaaaaaaa ttttggcggg aaattcaaat tttcagtgaa aaaaattttg 1020 gcgggaaatt caaattttca gtgaaaaaaa ttttggcggg aaattcaaat tttcagtgaa 1080 aaaaattttg gcgggaaatt caaattttca gtgaaaaaaa ttttggcggg aaattcaaat 1140 tttcagtgaa aaaaattttg gcgggaaatt caaattttca gtgaaaaaaa ttttggcggg 1200 aaattcaaat tttcagtgaa aaaaattttg gcgggaaatt caaattttca gtgaaaaaaa 1260 ttttggcggg aaattcaaat tttcagtgaa aaaaattttg gcgggaaatt caaattttca 1320 gtgaaaaaaa ttttggcggg aaattcaaat tttcagtgaa aaacaatttt ggcggaaaat 1380 tccaatttct gagaaaaatc gagaaatgtc tgcaatgttc cagaagtttc tagaaaattc 1440 gagaaaattc cggaatggtc cagaattttc tagaaaattc gggaaaactc tggaatattc 1500 cagaactttc tagaaaattc gagaaaattc tggaatgttc cagaattttc tagaaaattc 1560 gagaaaactc tggaatgttc cagaactttc tagaaaattc tagaaagttc tggaatggtc 1620 cagaattttc tagaaaattc gggaaaactc tggaatattc cagaactttc tagaaaattc 1680 gagaaaattc tgcaatgttc cagaattttc tagaaaattc gggaaaattc tggaatattc 1740 cagaactttc tagaaaaatc gggaaaagtc ggcaatgttc tagaactttc tagaaaattc 1800 gagaaaattc cggaatggtc cagaattttc tagaaaattc tagaaagttc tggaatggtc 1860 cagaagtttc tagaaaattc gagaaaattc cggaatggtc cagaattttc tagaaaatcc 1920 gagaaagttc tggaatggtc cagaattttc tagaaaattc gggaaaattc tggaatattc 1980 cagaactttc tagaaaaatc gggaaaagtc tgcaatgttc cagaactctc taaaaaattc 2040 gataaaattc tgaaatgttc aatttcgtgg aaaaattcaa gaaacttctg caaagttcta 2100 cacggggttc tggctcgacc ccgcgctcca ttggacactg aaatggcggg aaaaaacttt 2160 gaaatcgcaa gaggaatttt cacgcagcgc gttgaaaaaa gtgtatgcat ttgcgcgtga 2220 cggtgtttct tcaagttttg atactcctag aatattctga agtttcaata attttaaaaa 2280 aatatcaaaa attgttataa ttttcagaaa cgctcgccaa aaactttcgg aaaggaccgg 2340 aaaatgtttc atatgatttg aagaacgtag agaaaattat acgttcaaga aatttagaaa 2400 tagttccagg aaacttgagg ttttaagttg taattgctca gaacttagtt atataatatt 2460 tatgttgctc gaaaatttcc ggtaaccagg gaaaaaacgt tccatctgca aagaaggctt 2520 agaaaattta gaatttgatt tcaattttga gtacgccagt cggagcacgc gcttcagcgc 2580 gtgcgaacgg ctggt 2595 // ID HELICOPY4 repbase; DNA; CEL; 1857 BP. XX AC . XX DT 22-DEC-2000 (Rel. 4.2, Created) DT 22-DEC-2000 (Rel. 4.2, Last updated, Version 1) XX DE HELICOPY4 is a non-autonomous DNA transposon - a consensus. XX KW nonautonomous DNA transposon; HELITRON superfamily; CERP1; KW HELICOPY4. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1686-1465 RA Naclerio G., Cangiano G., Coulson A., Levitt A., Ruvolo V. RA and Volpe A.; RT "Molecular and genomic organization of clusters of repetitive DNA RT sequences in Caenorhabditis elegans."; RL J. Mol. Biol 226, 159-168 (1992). XX RN [2] RP 1-1857 RA V V. and Jurka J.; RT "HELICOPY4."; RL Direct Submission to Repbase Update (AUG-2000). XX DR [2] (Consensus) XX CC There are about 100 copies of HELICOPY4 in the C.elegans genome. CC They are ~97% identical to the consensus sequence. CC This family is a non-autonomous HELICOP/HELITRON-like DNA CC transposon. CC 210-bp 5'-termini of HELICOPY4 and HELICOP2 are 92% identical. CC 40-bp 3'-termini of these elements are 90% identical. CC CERP1, a satellite-like repeat identified previosly [1] is a CC portion of HELICOPY4 (position 1686-1465). XX SQ Sequence 1857 BP; 599 A; 266 C; 430 G; 562 T; 0 other; atcattactt atatataaaa aaataattcc gtttgttcca tagtttgtag tctatgtagt 60 ctttgtagtc tgtgacgtca cacccaaagt cactgagaat tgtgggcggg gtctgtcatc 120 cttcgtggtg agacccatcg tggcgagacc catcgtggtg agacccatcg tggcgagacc 180 catcgtggtg agacccatcg tggcgagacc cgtcgtggcg agacccatcg tggtgagacc 240 catcgtggtg agacccatcg tggcgagacc catcgtggtg agacccatcg tggtgagacc 300 catcgtggtg agacccaccg tggtgagacc cgtcgtggta agacccaaaa ttttggcggg 360 aaatttaaat tttctgtgaa aaatattttg gcgggaaatt taaattttct gtgaaaaata 420 ttttggcggg aaatttaaat tttctgtgaa aaatattttg gcgggaaatt taaattttct 480 gagaaaaata ttttggcggg aaatttaaat tttctgtgaa aaatattttg gcgggaaatt 540 taaattttct gtgaaaaata ttttggcggg aaatttaaat tttctgagaa aaatattttg 600 gcgggaaatt taaattttct gtgaaaaata tttttgcggg aaatttaaat tttctgaaaa 660 ttctaaaatt ctggaaatct agaatcttct ggaaatttcg aaaaaactag aaaaattggg 720 aaaagtctgg aatgttccag aactttctag aaaaatcgag aaaattctgg aatgttccag 780 aactttctag aaaaatcgag aaaattctgg aatgttccag aactttctag aaaaatcgag 840 aaaattctgg aatgttccag aactttctag aaaaattggg aaaattctgg aatgttccag 900 aactttctag aaaaatcggg aaaattctgg aatgttccag aactttctag aaaaattggg 960 aaatttctgg aatgttccag aactttctag aaaaatcgag aaaattctgg aatgttccag 1020 aactttctag aaaaatcggg aaaattctgg aatgttccag aactttctag aaaaatcggg 1080 aaaattctgg aatgttccag aactttctag aaaaattggg aaaattctgg aatgttccag 1140 aactttctag aaaaattggg aaaagtctgg aatgttccag aactttctag aaaaatcggg 1200 aaaagtctgg aatgttccag aactttctag aaaaatcgag aaaattctgg aatgttccag 1260 aactttctag aaaaatcgag aaaattctgg aatgttccag aactttctag aaaaatcgag 1320 aaaattctgg aatgttccag aactttctag aaaaatcgag aaaattctgg aatgttccag 1380 aactttctag aaaaatcgag aaaattctgg aatgttccga aaaattgagc ttagagcttt 1440 agaagaggta gttatttggg agttgatggg ggatcaagtc aaggtactgt agtggtacta 1500 taggggtact gtaggtatac ggtagggtta ctgtagtttt ggaaaaattg gcatttcgtc 1560 ctttgaagag gtattgggtt aggagtcggt ggaggataat gtcaaggtac tgtagtggta 1620 ttgtaaggtt actgtcttgg tcaaaaagta acagaaagtt ttcatactgt ctgtgaattt 1680 ttgaaacatg catgtcgtag aaaaatacat acatgtaaca gtgccagtaa accgcgtttt 1740 aagtgtttca agtttttttc atgatgagaa attttttgag tgattgaagg tatgtgtgtc 1800 aaatactttt aatggtgcca gtcgttgccc gcgccgtagg cgcggtcagc ggctggt 1857 // ID LINE2A_CE repbase; DNA; CEL; 4082 BP. XX AC . XX DT 31-AUG-1999 (Rel. 4.07, Created) DT 31-AUG-1999 (Rel. 4.07, Last updated, Version 1) XX DE LINE2A_CE is a non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; ORF1; KW reverse transcriptase; ORF2; LINE; CR1 superfamily; LINE2A_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-4082 RA Kapitonov V.V. and Jurka J.; RT "LINE2A_CE."; RL Direct Submission to Repbase Update (31-JUL-1999). XX DR [1] (Consensus) XX CC LINE2A_CE is a non-LTR retrotransposon related to the CR1 CC superfamily. Two open reading frames, ORF1 (position 71-883) and CC ORF2 (position 953-3955) encode two proteins. The second CC protein is a CR1-like reverse transcriptase. Protein encoded by CC ORF1 is C.elegans specific. There are about 10 copies of CC LINE2A_CE CC in the C.elegans genome; they are ~98% identical to the consensus CC sequence. Therefore, this family was active very recently. CC LINE2A_CE does not generate target-site duplications upon its CC integration in the genome; its 3'-tail ends by (TATA)n. XX SQ Sequence 4082 BP; 1207 A; 1074 C; 720 G; 1081 T; 0 other; cactcgtcct cccgttctcc ccagcggtcg gtcgcatatt gttcttgctc tcgtatcgat 60 ttttgccgaa atgcccccac ccgatgacga catgatagtt gattcttcac caactgctac 120 aattccgcct acttgtgaca atattctgcg aaatcaaaac ttgccatcaa caccagcttc 180 cggacagcca agtatcaagg aattgatcga acgaatcacc attttggaga aaactgtcaa 240 ggaacaatcg aagaaaattg ccgagttgga agctactaaa ggtttcccat taatcaccaa 300 tgatgctagc aaaagtaaga gtaaacttta ctccgctgta gtccaaaacg accctcaatc 360 tgtgaaaatc atcgaaaagg ctcacttcgc tgctgatctc cgaaagctcg gggaaaactc 420 gatttacgcg attattgaga atgtaccgga ttgcaagaaa gaagaacaaa cgacaattga 480 tgcctcactc atggaaaact tagccaagct ggatactctt ccgaaacctg aacaattctt 540 ccgaatcaaa tgcaaaaaac cggatgttcc ttctcgccct ctcaaggtta agtttgctac 600 tgaatatcaa cgggatactt tcattcgaca attttccaag gcacttcaca atcttcctga 660 acgcccagta tcgtctcgaa ctatcagatg ccggagagac atgtcgcccg aagaacttat 720 actgctcaaa cagaggcgag ctacagccta tgaggaaaac cgtaaggctg gagtaatcaa 780 atattacgtc cgtgatcttg atatttgtga actctccaca cctcgtccgc tgacagcaca 840 gataacacca acatcggcgc caggtctctc gagctccaca taggaattct gccaccggac 900 atattcaacg aaacttgcaa acagttcttc atcatcgtca cccactacat ctatgataga 960 gatttctatt actcgtcaat gtcctttggt ggccaaaatt acacctgttc actcaactgc 1020 caaatcatca aatgccaagt caactgtatc cactgccgcc atgctataca ctaaaaagtc 1080 aacaacccta tcaaattcga ctcaaaaagc tgctacgaag cgacaagcaa ccgctccaac 1140 acctgctaat catcaatcga catcctctgt ggactctctt agtaagctca attgtgcttc 1200 tgccaacatc agatcaattg cttcggcgga gcgtttgaaa tacatccagg attatatacg 1260 ggaagaaaat atcgatattc ttttcttgac cgaaacattt ctgtcaccgg atacgccatc 1320 ttcgctatgc tctactatgg acctgatgtg cattcgttcc gaccgcctac aatctcaccc 1380 gaaatccaga ggaggaggag ttgccatttt ccacaaaccc tcacttatta tgtcgcaaat 1440 tgattcgctc cagaacggat actacgacag acacttctgc gacatcttag ctgttgatca 1500 caaaccatcg aaggccaggt tcatacttgt gtatcgtccc ccagatacat ctatccaaca 1560 aactgctgcc ctctacagca acctgtccga gctcatcagc agtcctgcta actaccactt 1620 catactggga gacttcaatc tgccaaatct catatgggac aaactggatc aactaccaat 1680 aaacattcac caggacctgt ctgatcttat gagctctcac aatcttgcgc aaattataaa 1740 gaagcccact cgtaccgcca tgtctggaaa gcaaaacttc cttgacctcc tgttcacgga 1800 ttctccttca ctgatctcaa atgtcagtat cgactcacca ataatgctct ctgatcactc 1860 cacaataaga ttcaacttgg tcctgaatta tgcgagaact gcacgtcgat taaacaggcg 1920 aactactatt cttcagttcc ggaagtgtaa ctttgaagct ctgaacaacc atcttctgat 1980 cttcaactgg gctcgacaat tttcctactt ctcccgatgc gaaacgaaac tgatccattt 2040 cctaaaaata ttcaacgagc tgatacgtga attcacacct gctgccaaac tgactaatac 2100 aatatctccc aacttcaaga agaacttgcg aaagagagtc aaacagcgac ggctcaggag 2160 tccaccttcc gatcaaaaga agtatatcaa agctcgtcta cgctctatca agaagatgct 2220 tgcaaaagag gaaaatcgaa ttgttgagtc taaaaatccc agacaactgt tatcaatggt 2280 gaagaaacga acgtctactc cgtctcatgt gacttgtctc gtggtcaaag gacaattatc 2340 tacaaactca gtggctatcg cggatgaatt tctcaactct tttgccaaat ccttcacacc 2400 accctctgat ccgttcccag ccctgcctgc ccaaaagccc atcgcaattg atccggactt 2460 tacgccaata aatatatgca ggatcatcca gaagttacga cccaaaatcg gattctcgca 2520 ggataatatc aatttcttcg tcatcaagaa gtgtgtgcac tcgctctctg tacctctatc 2580 actcattttc tctgagtcgt atgcctctgg ccaatttccg gaaatttgga aatcttctat 2640 cattgtgcct gtgcataaaa aaggttgccg cacggatgct aacaattatc gaccgatctc 2700 tttgacacac ccactatcga gagtgttcga gaagttcata gttgagaagt taagaaagga 2760 atgcagctcc aaaatatcta aatcgcaatt tggcttcatg aattctcgct catgcaccct 2820 tgccctgctc aatgcctgct caaaaatcct cgactccttg acgatccgat caaagtacgt 2880 tgatgcgata tacttggact tcaaaaaagc cttcgatagc gtgccacaca acctactgct 2940 ctgcaagttg gaactattcg gcctggatgt caaaatgtgc aattggtttc gctctttcct 3000 cagcaatcgg acctcatcca taaaagtatg cgaccacgtt tcaaagaaca aacttgaggt 3060 gctgtctgga gtgcctcaag gctccgtctg tggaccgttc ctgtttttga tatatatcaa 3120 tgacttgctc ggtatgctcc ctcctgatgt tcaaatatca gcatttgctg acgacataaa 3180 gatatatggt gacaacagca attcaatcca aaagtctatc gatattgtca cggattggtg 3240 cagaaaatgg agtctcaact tggcagaaaa caagtctgta gtcgttcatt atggaaaaaa 3300 taatccgaag tttgtctaca ctgcgaatgg tatcatcatc gctaagaaga aatcagtgaa 3360 ggatctcggc atattcgttg acgacaaact aaatttccac ggccacatca cttatgtcac 3420 aaatgcagct ctactcaaat gccggcaact cctcaaggct tttcgctcaa caaatgccag 3480 cctatacttc aagctgtaca atatttatgt tcaaccaata ctcgattatg gatgcgaaat 3540 ctatagtccc acctcggggg ctctaatcaa acaactggaa aagcctctcc gtttcttcac 3600 caggctcgtc ttccaacgct gcaacataaa gtattcttca tacgaggatc gtttggccca 3660 agccaatttg aagtcagtgc aacataggcg ggtcttgcag atccttcgca cctaccacaa 3720 catcataact ggaaacttcc actacccaaa tgtgtcatcg tcggtgaaga aagctgtaac 3780 tccaagatac ccctacatgc tcagatctgt tggcgaaaca aacaaaggat tcctcagagt 3840 caacctcgcc acctggaacc gcctagcaaa gcaaatcccg gaaaaattaa atcgctctat 3900 gtttgcttcc cggttaaatt ctttccccct taatattctc attcccccaa cttgatctct 3960 caaactagta atggtttatg ggatcatttt atcatccacc aattcttccc tgtgattatt 4020 tgtatacgat ctccccttat atgttttgcc ttgaagacgt gaatgaatga tatatatata 4080 ta 4082 // ID HAT2_CE repbase; DNA; CEL; 564 BP. XX AC . XX DT 21-OCT-1999 (Rel. 4, Created) DT 21-OCT-1999 (Rel. 4, Last updated, Version 1) XX DE HAT2_CE nonautonomous DNA transposon - a consensus. XX KW nonautonomous DNA transposon; HAT superfamily; TIRs; HAT2_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-564 RA Kapitonov V.V. and Jurka J.; RT "HAT2_CE."; RL Direct Submission to Repbase Update (OCT-1999). XX DR [1] (Consensus) XX CC HAT2_CE is a putative non-autonomous DNA transposon. CC It generates duplication of 8 bp-long target site upon CC integration in the genome. HAT2_CE has 17 bp-long CC perfect terminal inverted repeats. CC The length of the target site duplications indicates CC that transposase encoded by the autonomous HAT2_CE-like CC element is related to the HAT superfamily. HAT2_CE is about CC 64% identical to the HAT1_CE consensus sequence and they CC share identical 13 bp-long termini. CC HAT2_CE individual copies are, on average, 98% identical CC to the consensus sequence. CC There are about 100 copies of HAT2_CE per haploid genome. XX SQ Sequence 564 BP; 165 A; 113 C; 119 G; 167 T; 0 other; caggggtgtg cggataaccg attttttcgg ctaacggata aatcggctaa tgccgatttt 60 ttgagaaccg gctaacggct aattcggcta acggctaatt tcaaaatttt cggctaacgg 120 ctaattcggc taatctcagt cattcaaatc ggctaatttt cggctaattc ggctaattcg 180 gaaaatattc atttggttaa gctttttttg tccattctgt taattcaggt tttgggttaa 240 ttttttcact gttattgagc aaattcaggg atgaacgact gttcaaatag gaaaaaatca 300 tacaaattca ccacattttg ttttccaaaa aaatatgtta gttgaattag ccgaaatagc 360 cgatcggcga attcggctaa catcggccaa aatcggctaa cggataacta cgtattagcc 420 gaactgccaa aaagtcggct aacggctaat tcggctaatc ggtaaaaagg tcggctaacg 480 gataattcgg ctaatattag ccgagttagc cgaattagcc gatcggcgaa ttcggctaaa 540 tcggctatta tccgcacacc cctg 564 // ID HELITRON2_CE repbase; DNA; CEL; 5514 BP. XX AC AC006730; XX DT 25-MAY-2001 (Rel. 6.04, Created) DT 25-MAY-2001 (Rel. 6.04, Last updated, Version 1) XX DE HELITRON2_CE is a Helitron DNA transposon. XX KW Helitron; DNA transposon; Transposable Element; HELICOP2; KW HELITRON class; HELITRON2_CE; helicase. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-5514 RA Kapitonov V.V. and Jurka J.; RT "HELITRON2_CE."; RL Direct Submission to Repbase Update (MAY-2000). XX RN [2] RP 1-5514 RA Kapitonov V.V. and Jurka J.; RT "Direct submission."; RL Direct Submission to Repbase Update (MAY-2001). XX RN [3] RA Kapitonov V.V. and Jurka J.; RT "Rolling-circle transposons in eukaryotes."; RL Proc Natl Acad Sci USA 98(15), 8714-8719 (2001). XX DR GenBank; AC006730; Positions 23891 18378. XX CC HELITRON2_CE is a HELITRON-like DNA transposon. CC 904-aa protein which is putatively involved in proliferation CC of HELITRON2_CE copies. There are about 100 nonautonomous CC elements CC in the genome, and they are 96% identical to HELITRON2_CE. CC These elements are ~500-1500 bp long. In many cases, CC HELITRON2_CE forms 7-9 kb minisatellite-like clusters composed CC from a ~200bp portion of HELITRON2_CE. CC HELITRON2_CE encodes a 904-aa protein, CEHEL2p, composed of CC the rolling-circle initiator and DNA helicase domains. CC CEHEL2p is encoded by 7 putative exons. CC CDS: 1834-1922, 1973-2235, 2431-2988, 3112-3296, 3657-3970, CC 4021-4294, 4343-5374. XX FH Key Location/Qualifiers FT CDS 0..0 FT /product="CEHEL2p" FT /translation="MNEFSQKTDFPSTSDSQEPVSKRTRSRFKSDSSGAER FT KRKQRGKESSEEREARHQYDQDRKRKKKASESKDELKVLLLKDRERKARER FT REAKGISDGPNYLGPMSKTCDHCKAVFFESKNFQSNIRQFNSAFSMASMGA FT HLDIPKGHGPYCYRIHGQVYHLAGPLHPEPGGRPSYGQIYILDTAQAIEER FT LGNPANSKCDPQLMAELTKLISTRNPYAKAYKMMAEVEEKENTDALKEGRV FT AEEVRLIFDISTTKDRRCYNVPVSNEVAVVFVGEDQDIPASRSLAIHPRGG FT GLTAIRDIDKISVKYLFKYVYKGHDRAALNIIRSRLKEGIVDEIKAHLDAR FT YVCAPEAIHHLFGFKIERKSDTDIRTVIDDDGVPVVYPTFVEAAKARGLLK FT DDSEYLKSLKEWIGCSVPAALRSLFSSILLFNEVHEPQVLWDTVKRDLSED FT YRHAGARDDEAEALAYFDISSRLHRGGKAFNDYVPEPSMHPTDIYLNTSDP FT SQCAAEGVRLFQTLNSDQKYVFERIVGALHDPTLPRLFFLDGPGGSGKTYL FT YIALYNMIIGENKMAACTAWTGIAAHLLPGGRTMVSLLKLNIHDRCESSLH FT RRQLKDARELANTDVFICDEASMIPKTALNTFDSLLRDITQVDQPFGGKIV FT ILGGDFRQILPIVEKGSRADQVDACIKSSPLWNQFETLRLHSNMRVTTNDT FT EWMNFLLGVGDGSLNDSESKVQIPFSLCDGNIVEKVFGDVIDPSTSDPCEN FT VILTPKNVDVTLLNEDVHSRGIGEERIYLSRDEIVTDDPADHMHYPTEFLN FT KMTPSTLPPHVLKLKKGSVVMLLRNLDVSGGLCNGSRFVVESLGRRTIGCR FT FACGERKGKFVIIPRIECYHDKNLSFKLKRTQFPVRAPDGRITNIVFTEIL FT " XX SQ Sequence 5514 BP; 1653 A; 939 C; 1195 G; 1727 T; 0 other; tcatttacta tatataaagc gcgtgtcctt ccgtccgttt gtagtttgta gtttgatctt 60 tgatctttca actctctttt ctgtagttgg aaaaggtatt gggttaagag ttgccggggg 120 ataatgtcaa ggtactgtag tagtactgta ggagtactgt aggtatacgg tagggttact 180 gtagattagg aaaaattgag tttttgtcta cagaagaggt attgggttgg gagttggtga 240 gggataatgt caaggtactg tagtagtact gtaggagtac tgtaggtata cggtagggtt 300 actgtagttt aggaaaagtt gagttttcgt ctacagaaga ggtattgggt tgggagttgg 360 tgagggataa tttcaaggta ctgtagtggt actgtaggag tactgtaggt atacggtagg 420 gttactgtag ttaggaaaag ttgagttttc gtctacagaa gaggtattgg gttgggagtt 480 ggtgagggat aatttcaagg tactgtagtg gtactgtagg agtactgtag gtatacggta 540 gggttactgt agttaggaaa aatttgaatt ttaaccattt gaagagatat tgggttggga 600 gttggtggag gataatgtca aggtactgta atggtattgt tggagtactg taggagtacg 660 gtagggttac tgtagttatg aaaaatgtga attttaacca tttgaagaga tattgggttg 720 ggagttggtg gaggataatg tcaaggtact gtaatggtat tgttggagta ctgtaggagt 780 acggtagggt tactgtagtt atgaaaaatg tgaattttaa ccatttgaag agatattggg 840 ttgggagttg gtggaggata atgtcaaagt actgtaatgg tactgtagga gtactgtagg 900 attatggtag ggttactgta gtcaggaaaa aatttgaatt ttaactcaag aagggtccca 960 ttggccccgc ccccttggtt ttaaattttt ttgaaaacat tttggtggga attcaaattt 1020 cacttttata aaaacatttt gacgggaatt caaattttaa ttttttgaaa acattttggc 1080 ggtaattcaa attttaattc tttgaaaaca ttttggagga aaattgatgg aaattttctt 1140 aacttctaaa agttaagaaa ttgtttaaag ttgagaaacg tctgaaacct tctgaaatcc 1200 actgaatgca atgtcctaga aaattttgtt aagttcatga gacttcaaat taaaaaaaaa 1260 ttcgaacgac aaaatgcctc attgggtgga gtaattttgt cgactccagc ctctctctgc 1320 agcatttcag gatgttctat atttatttta gtaaattaga ttgttttgtt gaccggaatg 1380 tcaaaaactt ttctaaatac ataatttgac tgtatttttt taacacactc tacgaaattt 1440 ataatttgtt tttctgtttt cgttatttac tgtacattaa ttacgaaatt caagatttgt 1500 ttttatgttt ttgttgttta ctgtattagt gaactgttgt ttacagtagt gtttacagta 1560 ttttttgttt actgtacatt aattacaaaa tttaaaattt gtttctctgt tttaggttgt 1620 gttttttgaa aaaaaatggt attatacttt tttgccgaac tacagttttc aattatctat 1680 agtacgattt tggttattac aaaaaaaaat cttcccctgt tttatctgtt ctatcgaaaa 1740 aaaatttctt ttctcttcat tttttcgttt ttttttaaat tattgtttgt gatagtcaat 1800 attttacaga ggaatccgaa gaattattaa ctatgaacga attctcacaa aaaactgatt 1860 tcccttcaac atctgattcc caagagcctg tttcaaaaag gacacgttct agatttaaaa 1920 ggtaagtttt tattttctaa gtttatttgt tagtaaaata aatgttttta gtgatagttc 1980 tggagctgaa cgcaaaagga aacagcgtgg aaaagagtca tcagaagaaa gagaagcacg 2040 ccatcaatac gatcaggata ggaaacgtaa aaaaaaagct tctgagtcca aagacgaact 2100 gaaagtgttg ctactgaaag atagagaacg taaagcaaga gaaagaagag aagcaaaagg 2160 tatatccgat ggacccaact atcttggccc gatgagcaag acatgtgatc actgtaaagc 2220 cgtctttttc gaaagtaagt ttttaaataa tatttttaca aatgaacgtt gaattttacc 2280 atgcagcaaa tttaatttca gatgaagtaa gcggctccac tgatcatatc aacatgtgtt 2340 gcaattttgg actgacaacc tccatcgatg ataagttcgc cgactatccc gatcgattga 2400 aggatcttct tctcggagat tcgattgagg caaaaacttc caatcaaata ttcggcaatt 2460 caactctgct ttttccatgg cttcaatggg tgcccatctc gatattccaa aaggacatgg 2520 tccatattgc taccgcatcc acggccaagt gtatcactta gcagggcctc ttcatccaga 2580 acctggagga agaccatcat acggccaaat ttatattctg gacacagctc aagctatcga 2640 agaaagatta ggaaatcctg caaattcaaa atgtgatccg cagttgatgg cagagctcac 2700 gaaactcatt tcgactagga acccatacgc taaagcttat aaaatgatgg cagaggtgga 2760 agagaaagaa aacaccgatg ctctgaaaga aggaagagtt gccgaggagg ttcgactaat 2820 cttcgacatt tcaactacta aagatcgtcg ttgttacaat gtccccgtat caaatgaagt 2880 ggcagttgta tttgtcggag aagatcagga tattccagct tcaagatctc tagctattca 2940 tccaagagga ggaggtctca ctgcaattcg tgatattgat aagatatgtg atccattgac 3000 atatccataa ggtcataaga tggataacag attcatcgtt ccgtataact cttactttct 3060 gcttcggtac aatgctcaca tcaacgtcga aatttgtgct ttcattgaag ctgtgaagta 3120 cttatttaag tacgtgtaca agggacatga tcgtgctgct ctaaacatca ttcggagtag 3180 actgaaagaa ggaatcgttg atgagattaa agcccacctt gatgctcgat acgtgtgtgc 3240 tccggaggcc atccaccatc tcttcggatt taagattgag agaaagtctg acacggtata 3300 tcgactagct gtccatcttc cgggttttca agttgtggta tttccttcaa atgtaacaaa 3360 ggatcagttg cagtctacgc ctgaaaaaga tacaactctg acagcttggt tcaaaattaa 3420 caaaattagc gaagaaacag tgcagagtgg ttccatgtcc gattcatttg tcgacgaatt 3480 gcattacatt gacatgccat cttaattcac attcgatcaa aaagagcgct gctggaaaac 3540 ccgaaagcgt ggaatccgac aaattggcag aatgtacaca gttgctccca gtgaaactga 3600 acgttatgtt cttcgcattc ttctactaaa cacaaaagga gctacttctt ttgaggatat 3660 tcgaactgtc atagatgatg atggagtgcc tgtagtttac cccactttcg ttgaagcagc 3720 taaggcacgt ggactgctga aggatgatag tgaatattta aaatcgctga aagagtggat 3780 tggatgctcg gttcctgctg ccctacgatc tttgttcagt tctattctac tcttcaatga 3840 ggttcacgag cctcaggtat tgtgggatac cgtaaaacgc gatctcagtg aggattatcg 3900 tcacgcagga gccagagatg atgaagctga agctttggcc tattttgata tttccagtcg 3960 acttcacagg tagctatcat ctctaaactt atatttatag tggaaatata tgttttcaga 4020 ggaggcaaag ctttcaatga ttacgttcct gagccttcaa tgcatccgac tgacatctat 4080 ctaaatacaa gcgatccatc ccaatgtgca gcagagggcg ttcgactatt tcaaaccctc 4140 aactcggacc agaaatacgt cttcgagcgt atcgtgggag cgctacacga ccctacttta 4200 ccccgtctct tctttttgga tggtcccgga ggaagtggta aaacatatct gtacattgca 4260 ttgtacaata tgataattgg agaaaataaa atggtaacta tcaactaatt catttttatt 4320 cctaacatgt ttttattcca ggcggcttgc acagcatgga ctggaatcgc ggcacattta 4380 cttccaggag gtcgaactat ggtgtctctt ctcaaattga acattcatga tcgatgtgag 4440 tcgtcgcttc accgaaggca attgaaagat gccagagaac ttgctaatac cgatgtgttc 4500 atttgtgacg aggcttcgat gattccaaaa acagctctca acacgtttga ttctcttctt 4560 cgggatatca ctcaagtcga tcaaccattt ggaggaaaaa tagtcatttt gggcggagat 4620 ttccgacaaa tccttcccat tgtcgaaaaa gggtcgagag ctgatcaagt cgatgcctgt 4680 atcaagagtt cgcctctgtg gaatcagttt gagactctac gcctccactc aaacatgaga 4740 gttaccacaa acgacacgga atggatgaac tttcttctcg gtgttggcga tggttcccta 4800 aatgattctg aatcgaaagt tcaaatccct ttttctttat gtgatggaaa tattgtcgag 4860 aaagtttttg gagatgtaat cgacccatca acttctgatc cctgcgaaaa tgtgattctt 4920 actccaaaaa acgttgatgt cacccttctc aacgaagatg ttcattccag agggatagga 4980 gaagaaagaa tctatttatc aagagatgaa attgtaacag atgatccagc agatcacatg 5040 cactacccaa cagaatttct gaacaagatg accccttcga ctctcccccc acatgtgctg 5100 aaactgaaaa agggatcggt ggttatgctg ctcagaaacc ttgatgtatc gggaggtctc 5160 tgcaatggat ctcgctttgt ggtggaatcc ttaggacgac gaacaattgg atgccgattt 5220 gcatgtggcg aaagaaaagg aaaatttgta atcatccctc gaatcgaatg ttatcatgat 5280 aagaatttgt ctttcaaact gaaacgaact caattccctg ttcgagcccc tgacggccga 5340 atcacaaata ttgtttttac cgaaattctt taaacgaatg ttttagaaac cttgatttga 5400 tttaataaaa attatttaaa aaaaattttt ttcaaaattt ttcacttcct tatttttaaa 5460 ataaaatttc aaacgaccag tcgcggccgc gccggaggcg cggtcagcgg ctgg 5514 // ID RCC9 repbase; DNA; CEL; 366 BP. XX AC X61257; XX DT 28-SEP-1995 (Rel. 1, Created) DT 28-SEP-1995 (Rel. 1, Last updated, Version 1) XX DE C.elegans RcC9 repetitive DNA. XX KW Repetitive DNA; RCC9. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-366 RA La Volpe A.; RT "Direct Submission."; RL Direct Submission to Repbase Update (16-AUG-1991)A. La Volpe, CNR RL International Institute of Genetics and Biophysic, Via Marconi RL 10, 80125 Naples, ITALY. XX RN [2] RP 1-366 RA Naclerio G., Cangiano G., Coulson A., Levitt A., Ruvolo V. RA and La Volpe A.; RT "Molecular evolotion of clusters of satellite-like DNA sequence RT in Caenorhabditis elegans."; RL Unpublished. XX RN [3] RP 1-366 RA Naclerio G., Cangiano G., Coulson A., Levitt A., Ruvolo V. RA and Volpe A.; RT "Molecular and genomic organization of clusters of repetitive DNA RT sequences in Caenorhabditis elegans."; RL J. Mol. Biol 226, 159-168 (1992). XX DR GenBank; X61257; Positions 1 366. XX SQ Sequence 366 BP; 100 A; 72 C; 64 G; 130 T; 0 other; gtaaatctct gtgattcaag ttttccgcat tcagtgtcca atggagcgcg ggtcgagcca 60 gaacctcgtg tagaactttg cagaagtttc ttgaattttc cacgaaattg aacatttcag 120 aattttatcg aattttttag agagttctgg aacattgcag acttttcccg atttttctag 180 aaagttctgg aatattccag aattttcccg aattttctag aaaattctgg accattccag 240 aactttctcg gattttctag aaaattctgg accattccgg aattttctcg aattttctag 300 aaacttctgg accattccag aactttctag aattttctag aaaattctgg accattccgg 360 aatttt 366 // ID NDNAX1_CE repbase; DNA; CEL; 2087 BP. XX AC . XX DT 12-NOV-1999 (Rel. 4, Created) DT 12-NOV-1999 (Rel. 4, Last updated, Version 1) XX DE NDNAX1_CE is a putative nonautonomous DNA transposon - a DE consensus. XX KW nonautonomous DNA transposon; DNAX family; NDNAX1_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-2087 RA Kapitonov V.V. and Jurka J.; RT "NDNAX1_CE."; RL Direct Submission to Repbase Update (NOV-1999). XX DR [1] (Consensus) XX CC NDNAX1_CE is a putative non-autonomous DNA transposon; CC its copies are 98% identical to the consensus sequence. CC It shares common termini with NDNAX2_CE element. CC These elements do not have terminal inverted repeats; CC It is possible that NDNAX elements share a common CC mechanism of their transposition with the ATREP1-ATREP10 CC elements found in the A.thaliana genome. XX SQ Sequence 2087 BP; 600 A; 402 C; 376 G; 706 T; 3 other; tctgccttct tcctatgtta aatatggaat gtgcaagggt cgtcactttt tgcgcacaaa 60 gtgtcacaaa aaattaaaaa agttgggtgt gcactttcgt gtctaacttt gctgtacctg 120 cacatttttc aattttcttt tttgtaggaa atagagaatc gctttttcta tcttyctatc 180 taaatggtta tttgatttga ttttatcttt taagagaaat tcttgatgaa acacgaaaaa 240 atcacttttt tcgcgtgtcg tgtcgctttt tttgcacacg tttagaggtc atagtataaa 300 atgtttttga taaggtaact gtatatctgg gattgttaaa aatttttagt ttttgcaatt 360 tcatcaattt tgaactgagt tatgatgatt tgaaaatcgg actcgaaaaa tcccggaatt 420 tcgaaaattg cgttgaaaac atgtttaacg ttcgtatttt tgttttattt tacattatca 480 aatgtaaatt ttaacgtaga ttccgttttt ctaaatagtt ttctcattat ccaaccggaa 540 agttacgaaa aaagcataaa ttgacgattt ttcaatgaaa ttgtgcgcat tcccccttgt 600 gtttttggtg tgcaaattag tgatttttga gattctttcg cctataagtc ggtcagtttg 660 gcactttttg gggtaaattt ttcgaaaaac gtatttttca ggaaatttac tcttcaaaaa 720 tataattttt tttacaaaaa agtcaaaaat aacttcagaa atcgcaaatt tcatgatttt 780 ctcaaaawaa tgacgttttt gctggaaaaa taaaaaattt tccactaccg aatgggattt 840 tatgttaaat tcgtaattag tacataaata ccgaatgcaa gctgaaagca cagctctcta 900 cgatcatttt caacgatttt agtttattta atttttgacc tcgaaatttc aatttccctc 960 ttttcctgga ccagtttttc actgaaatgt gaatgttatg tcctaaatct tattattttt 1020 gctatgagaa gcgagacaaa ccaaaaaatt ttagtcttga gttcctcgag ccgcgttcct 1080 gaagttgtgg gcttcaagcc ttgctcagga agctcaaaag ttcattcttg accaattttt 1140 aaattttcaa atgctataat attctacttt taacgctgat ttcaaatata tacttttctt 1200 tttccaattc ctattatgag ctgagttacg gcgctttcaa gtttcagcaa tgcccgcccg 1260 cccccctctt tcctcccctt tctcttgcct cctccctctc atgcataggg tggctgtaat 1320 atggtctcta attactgtta caggatcaga tatgtatatt ttacgacata tctcttaata 1380 gaagaacaat agacatcgcc acatttgttt ttgtaagagg tccccaatgg gctgtaaaac 1440 agtctatcca cctttcttcc tgcaagaaag gaggtaatgt atttgagact gttcatttat 1500 tgtaattgag gggttgaaaa ttattatttt aattttcaaa taaaaataca ttaaaatttt 1560 tttttaattt taaaataaaa aaatcctttt taattttaaa ataaaacatt ttttaatttt 1620 gaggtctgct ttcttcgcgg ggacagagcg ccgtaggcgc tcttaatcta aattataggc 1680 agagcaggcc gaaggcctgc tatgccagta ggctttcact gccgggtcag agcgccgcag 1740 gcgctctgaa tctaaattac aggcagagca ggccgtaggc ctgctatgcc agcctgcttt 1800 tactgccggg tcagagcgcc gcaggcgctc taaatctaaa ttgtgtactg cagggagaca 1860 ataaatattt ttaaaacagt aaaaacaaac acacaggttc gaggcgaagc cgagaacctg 1920 cgccagctac ctaatctggt agaggaagca acgcaggccg caggcctgcg gtgcgagacc 1980 maaaacattg ccaatcgcac gttcgaggcg aagccgagaa cccgcgccag ctagcggact 2040 cgccccaacc ccacctctac tagcgggccc catagggccc gctagtc 2087 // ID TIR23T5B_CE repbase; DNA; CEL; 1658 BP. XX AC . XX DT 14-SEP-1999 (Rel. 3.8, Created) DT 14-SEP-1999 (Rel. 3.8, Last updated, Version 1) XX DE TIR23T5B_CE is a putative non-autonomous DNA transposon - a DE consensus. XX KW Non-autonomous DNA transposon; TIR23T5B_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-1658 RA V V. and Jurka J.; RT "TIR23T5B_CE."; RL Direct Submission to Repbase Update (SEP-1999). XX DR [1] (Consensus) XX CC TIR23T5B_CE is flanked by 23 bp-long terminal inverted repeat. CC There are about ~25 copies of TIR21T5B_CE in the genome; they are CC ~98% identical to the consensus sequence. CC TIR23T5B_CE generates 5 bp long duplication of target-sites upon CC its integration in the genome. XX SQ Sequence 1658 BP; 569 A; 266 C; 276 G; 546 T; 1 other; cacagcgcct caaatcggga gagttaagaa aatcgagaat tggaaagtcc aacggagcgc 60 gcttgcacac ttttacggta ttaaatgagt ttcagacgaa atttcgcgat tattcgagtt 120 ttcaagatga aatttggaaa aaataattaa atattgggat tattattgct ttaaaaaagt 180 tttaaaaaac attttattga acaaattcaa gatttggctg ctttttcatt gattttcgaa 240 ccgactagaa caagttaact ctatgacaaa atttctacgc gaaacaaatt ttttcaagca 300 ttaatatgtt tgattccaat ttttaaacgt cagcttgata gaaataacac ggtttcacgt 360 cttgaaattg ttcaaaaatg cccacatcaa cggaaactgt ttagttggca gctgatcttc 420 actaaaccaa cactaaatgt tcagatcaaa tcacaacttg tttgatcatt tgaaaaaaca 480 tttttgaatc attgtttccc acaaaaagca gatgataagt agtatagcat gatgagaatc 540 catatgattc ttcatatgcg catagctata gatactgtat cttcgacaac tcctatctcc 600 tcagagakgc aagatatcaa aaagtgatta actaagaaat tgtagaacat atcaaaaact 660 atactaatta attaaaatca tgtgtttatc tcttaagatg agggagttag agcagccagt 720 agaataacaa aaagtatgca atacaaatgt acgtctaact cccccatctt gaaagataaa 780 aacgtaggtt aaattgaata tcatagtttt tgatgtgctc tacaagttct cagttgatca 840 ctttttggta tctcacctct ccgagtagat aggagttgtc gaagttgagg tatctagagc 900 cgtagagggt aggaagaacc tgatggttcc ttattaagga aaagaaaata taacacgaaa 960 cttgtaaaaa atgttaattt gatcatttat gtgagtattt gagccatttt ttaaggaaaa 1020 ttggagagca acacttacat aagaaatttg gcgtggcata gatccattgg tagattggaa 1080 cattgaaatt tgatcagaac tatttgtatt tttagtgatt ttgattgatt tgtagcttta 1140 ttctagtgaa atctttgatt tcataaaagt tttttaacca attttgcaag atttcccttg 1200 ataaatgttg ctttactttt ttatcgattt cgttgtaaag aaaaatcatt tttctgtgtt 1260 atcagtcaaa ttaatgacta aaactcttta aaaagattca aactattagt aaaaattcta 1320 aaattgaata tttatgcaat aaattgctct aaaatcaatt tttatttgca aaattctgtt 1380 tttggtcaaa aaaaattttc agattcgccg aaattcagag ctcatttcga gcttatttgg 1440 ggaaaaattt atattaaaat gaagtaaatc gcatcgcgaa cagtctctgg gcggtcccgc 1500 atgttgaaaa ttgctttaaa atccctaaaa tccagcaaaa acgcgatatt ttactgtttt 1560 gctacacatt aattccaatt cgtgcgacgt ctgcagcaaa cgcgctcccg cgccaaaacc 1620 acttctttag ctcttctctc ccgatctgag gcgctgtg 1658 // ID CELE14B repbase; DNA; CEL; 187 BP. XX AC . XX DT 16-SEP-1999 (Rel. 4.08, Created) DT 16-SEP-1999 (Rel. 4.08, Last updated, Version 1) XX DE CELE14B nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; CELE14; KW CELE14B; nonautonomous DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-187 RA V V. and Jurka J.; RT "CELE14B."; RL Direct Submission to Repbase Update (SEP-1999). XX DR [1] (Consensus) XX CC CELE14B is a second major subfamily of CELE14 repeats. CC CELE14B and CELE14A are 89% identical to each other. CC CELE14B is flanked by 30 bp-long imperfect terminal inverted CC repeats and 6 bp-long duplications of non-specific target sites. CC CELE14B copies are ~91% identical to the consensus sequence. XX SQ Sequence 187 BP; 50 A; 46 C; 38 G; 53 T; 0 other; cacgtggtgc caggctgtcc cattacggtt tgatctacaa aaaatgcggg aatttttttc 60 ccaaaaaatt gtgacgtcag cacgttctta accatgcgaa atcagttgag aactctgcgt 120 ctcttctccc gcattttttg tagatctacg tagatcaagc cgaaatgaga cactctgaca 180 ccacgtg 187 // ID HELICOPY3 repbase; DNA; CEL; 195 BP. XX AC . XX DT 09-AUG-2000 (Rel. 4.1, Created) DT 09-AUG-2000 (Rel. 4.1, Last updated, Version 1) XX DE HELICOPY3 is a nonautonomous DNA transposon - a consensus. XX KW nonautonomous DNA transposon; HELICOP superfamily; HELICOPY3. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-195 RA V V. and Jurka J.; RT "HELICOPY3."; RL Direct Submission to Repbase Update (AUG-2000). XX DR [1] (Consensus) XX CC There are about 50-100 copies of HELICOPY3 in the C.elegans CC genome. CC They are ~99% identical to the consensus sequence. CC This family is a nonautonomous HELICOP-like DNA transposon. CC Several copies are identical, therefore, it is likely that CC HELICOP-like transposons are curently active in the C.elegans CC genome. HELICOPY3 and HELICOPY2 share identical 22-bp 5'- CC and 38-bp (one mismatch) 3'-termini. Their internal portions CC are different. CC A 49-bp 5'-termini of the autonomous HELICOP2 transposons is CC 94% identical to the HELICOPY3 5'-termini. These transposons CC share 84% identical 38-bp 3'-termini. XX SQ Sequence 195 BP; 49 A; 44 C; 48 G; 54 T; 0 other; atcatttact atatataaag cgcgtgtcct tctgtcccta tgtagtttga tctctgatca 60 gagcaacgaa attttgggaa accttatgct aaatgcgcag cagacatcga tcgaggtccg 120 cgatagacac cgtatatcta atgtgttttc tcaaaaaagt cgggggccgc gccgtaggtg 180 cggtccacgg ctggt 195 // ID CELE32A repbase; DNA; CEL; 364 BP. XX AC . XX DT 23-AUG-1999 (Rel. 3.6, Created) DT 23-AUG-1999 (Rel. 3.6, Last updated, Version 1) XX DE C. elegans CELE32A repetitive element (a consensus). XX KW MITE; CELE32A. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-364 RA Belknap R.W.; RT "CELE32A."; RL Direct Submission to Repbase Update (AUG-1999). XX RN [2] RP 1-364 RA Jurka J.; RT "CELE32A."; RL Direct Submission to Repbase Update (AUG-1999). XX DR [2] (Consensus) XX SQ Sequence 364 BP; 108 A; 67 C; 65 G; 114 T; 10 other; actgtagttt tcgctacgag atattttcgc gtcaaatatg ttgcttaata cgcattctca 60 gaattttktt tttttttgaa cacggcccgg caagtggtac atccatgcaa atgcgctcta 120 ctgataattt gagtktagac caggtttggg cgcgtkataa cgaaaaaagc tttggtccaa 180 aaaatttaga atttattttc ggacattttt tatatgcatc acaaaaaaan tggaccaacc 240 gtttttgaga aaaacgcgcc caaacgtcca ggtatacggt agacaaattg cgtacaggta 300 ccacttctcg ggccgtgtga aatttaaaat cataatthtt ttccnnkama anaatcgatt 360 tttc 364 // ID CELE14A repbase; DNA; CEL; 177 BP. XX AC . XX DT 16-SEP-1999 (Rel. 4.08, Created) DT 16-SEP-1999 (Rel. 4.08, Last updated, Version 1) XX DE CELE14A nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; CELE14; KW CELE14A; nonautonomous DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-177 RA Oosumi T., Garlick B. and Belknap R.W.; RT "Identification of putative nonautonomous transposable elements RT associated with several transposon families in Caenorhabditis RT elegans."; RL J. Mol. Evol 43(1), 11-18 (1996). XX RN [2] RP 1-177 RA V V. and Jurka J.; RT "CELE14A."; RL Direct Submission to Repbase Update (SEP-1999). XX DR [2] (Consensus) XX CC Original classification of CELE14A as a Mariner-related CC nonautonomous CC DNA transposon [1] is wrong since CELE14A generates 6 bp-long CC duplications of non-specific target sites. CELE14A has 51 bp-long CC terminal inverted repeats; its copies are ~92% identical to the CC consensus sequence. XX SQ Sequence 177 BP; 45 A; 40 C; 41 G; 51 T; 0 other; cacgtggtgt caggctgtct cattgcggtt tgatctacga aaaatgcggg aatatttttc 60 cagaaaaatt gtgacgtcag cacgctctta accatgcgaa atcagatgag atgtctgcgt 120 ctcttctccc gcatttttcg aagatcaaag cgaaatggga ctttctgact ccacgtg 177 // ID CER2-I_CE repbase; DNA; CEL; 8558 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 06-AUG-2003 (Rel. 8.07, Last updated, Version 1) XX DE CER2-I_CE is an internal portion of the CER2_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER2_CE; CER2-LTR_CE; KW CER2-I_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER2_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (31-MAR-2003). XX DR [3] (Consensus) XX CC FEATURES Location/Qualif;iers CC gene 4096..4449 CC /note="Cer 2 Rnase region" CC gene 2349..2666 CC /note="Cer 2 Protease region" CC gene 4798..5337 CC /note="Cer 2 Integrase region" CC gene 3292..3807 CC /note="Cer 2 Reverse Transcriptase region" CC gene 1929..2153 CC /note="Cer 2 GAG region". XX SQ Sequence 8558 BP; 2390 A; 1714 C; 2177 G; 2277 T; 0 other; ggtagtcagc gagtgtcact gatttttgct aacgcgtatc agtgataatt acgatccacc 60 ggttagagaa tgcgattatt ttaaaaatta attttaattc ccttccgagt gtcttttcac 120 ttgtttccat tttctcctcg tttggatgtt ccggcagtgc gattatctgc agtagttgtg 180 ctgtatcaac gggagatcag gagccggcgg aatggattcg cgcacttgtg aatttttcgg 240 aggacactgg caagagagca gagcggatag gtcaaggtca ggtcagttta gtgaaaggtc 300 ggtggaagct gggtttggta cgctgcatcg tactggtaaa attgcccact aattttacat 360 tgaaagtgtg aatccacttt agaggttcag agcactccgt ggagttgctc gcgagcttgg 420 ttccatcctc ctttggaaac agctcagttg ggacaggcgg gtctgtcctg gagcttcccc 480 gaggaatgct ggtgcaggta ttccgggtca attttagcat atttttcaat ctttttcacc 540 ttactcgcga atcgcttaaa tttttcgtgc cagtttttgt aggaacctaa aaggagagtc 600 aattaatttc atcattacag caaacaaaaa acttcgcagc gatcaacttg acatgtctgg 660 acaacgtccg aatcaggatt tcgagcttca aacggaagtg gtcttcaatc attggcgttt 720 gggcactccg ctggctagga accatcttct cgagctgaag ggtgcggttg tggcagatct 780 gcacatcatt cgagctcaat tcgaacatga aaaggagcat tggttggccg agcgggctca 840 gatggccgag aaagtggaag ctactaagct gttgatggag agggcgacgg aactcgtcga 900 ctcactgaag gctcggatcg agattttgga aagtcaggta cgtcaaccgg aggcttcggc 960 cgttgatgga aatgggagag aggattcctc cgggacctct cttcgcggac tccgttccgt 1020 acaagttaat actggtggta gtggtgatgg agaaaatggg gctaccaggg ttgccaacct 1080 cgctgaggag gttgcagaaa atctcggaaa tgaagaactt ctcgtcgagt cgaacgcggc 1140 ttcggcatcg ccgccgagta acacgtctag tggggcgggt caaacattgc aggaggatgt 1200 tagaccagat gttactcaga gagttgtcct aagatcggaa agtaacaatt ccgatactat 1260 ggatgttccg ttctccgaaa cggtaaataa tgcaagtgca agttccaatg ggaatgtgag 1320 aagaccaagg atcaatatgc tgcggctgag gccggcagtg cagttgacgc catacagagg 1380 tactatggct gaggatttct cgtcgtttat tagatctttc aacgatcaat gtgaagcgag 1440 tgagaggatg gatgacgata ttcgcctgaa attcttcttg acatgtctcg tggatcggcc 1500 cgaaacgtgg ctgaagacga gatgaagctg aatccgacga ttaagctgaa agagttggtc 1560 gagaaaatga agtctacttt tgagaatcca gtgctgataa aacataagga gaaccagctc 1620 cgactttgca agcagaagaa ggacgagtcg gtcgaggcct tccatctcag aatcgaggag 1680 ttggagagag ctgcgagcac aaccgaatca gcggtttgga cgaagagaat cgtgttggac 1740 gcgtttctcc tggattgaac aagcacattc aggagaaagt gcgcttggaa tggcctgata 1800 cgatggtgaa ggctttggaa gtggccttga gagtcgaggc attggatgcg gagctgcaca 1860 agccgagtcg ggaagcagct gaggcattgg cgatctacaa gccgattcca aagaggaatg 1920 aacgattcaa aggtgagtgt ttcttttgta ataaagttgg ccacaaggcc gcagattgtt 1980 tcaagaaaaa cggtaagaag gataaaaagg gtaagttcgg tgagaagaag ccccaagatc 2040 ctgtcaatca tcacaagatc gagaagccgc aagggatggt gatttcgacg atggtggatg 2100 attcggagcc ctcatatgag gtactgaaga gtcaactcga agagatggct cggagacacg 2160 aattgcttca aagtggaatt cgtttcgagg atgagaagaa tggtgtacga agcctggcat 2220 ggccaaagat gactcaggtt agtgaaattg cggactccgt tccgtgccaa tattctcaaa 2280 gtctcgggaa ctcatttatt actgcccaca ttccaattcg tgcaaatgga attccatatg 2340 ccgctctcgt tgatactgga gcaaatatca cagttgcgtc tagaaatatc ttgaaagggc 2400 taggaattgt tagattgcaa gctgcagata atgataatgc agttggtttc ggaggaaacg 2460 aggtgagtat gatcggttca gcgatcgtac gttttcaaat cggtagccaa gtggtgaaac 2520 agagagttca tttcactaat ggtcactgta tgccggatat tgatggtagt taccaattta 2580 ttttcggaaa cgacttattg tctcgtttgc cgatttttat gttcgactat cagcagaaga 2640 tgtttcacgt tggagatgac gcgattccat tcggaagagc gccaggtcag aatgtgaagc 2700 ctagaaatta tcaggtgaag gtgagtgaag atacgatcat tccagcaggt actgaaaagt 2760 acgtcaagtg ttctatcgat gcatcattcg ggagcgaaaa ggacgtggtc ctgatgatcg 2820 attcgaaaat tggagaaaat gatttgttcg tttctccagc ggttgtcatg gcttcaaatg 2880 ctatgattct ggtgtcgaat ccaacggagg aggataagac tatttcgacg gatgttcatg 2940 ctgccgtcgc caatcggatt tcaactgatg gtaatgtttt atattgtttt tgatagtgag 3000 aatactcctt gtgttcaaac tgtcatcacg gactctgatc cggcctttaa tataaactta 3060 tctaaagtta aaatttccga ttctgaaaga gctaaattaa aagttcttat tgagaaatat 3120 catgatgttt tctcgaagaa tgcttatgat ttgggtagtt ccaagactga tccagttcat 3180 atttatacga ctacagaggt tccggtgaga ggacgagctt atcgtgttcc ggttaagtgt 3240 caagctgagc ttgagaaaca tattaattcg ctaattcaat cggagcgaat agtggagagt 3300 aatactccat ggacgagtcc aatagttctg gtgaagaaga agaatggctc gttaagagtg 3360 tgtttggatt ttcgaaagct gaatgaggta actattccag ataattaccc tttgcccagg 3420 atcgatgcta tcatcgagaa agtgggcggt tcgaggtatt tttccgtgtt ggatatggct 3480 aacggatatc tgcaattgag attagatgcc gagtcgtcat acaagtgtgg atttatcact 3540 gaaaataagg tttacgcgta cacgcattta ccttttggac tgaagtcagc agcttcatat 3600 tttcaacggg cgctgaggca ggttcttgat ggacttgagg atgtgatggt gtacatcgac 3660 gatgttctta tctacagtaa gacattcgaa gatcatatca ggacgctaga gatggttttg 3720 gaacgattcc ggaagttcaa tttgaaggca tcgccgaata aatgtgagtt cttcaaggaa 3780 tccattgtat tcttaggtca tgagatatcg agagacaatt attcgccgaa tcgtgtaaac 3840 gtcgagacga tcagatcgat gccgacgccg acgaacgtga atgaagttag aagattcgtc 3900 gggatgtctg ggttctttag aaagttcatt ccgaatttct cagagagggc tgagccgctg 3960 acaaggctta cgaagaagaa ccagaaattc gtatgggccg aggagcagca gaatgccgtc 4020 gacgagctct ctgaagctct tactaacaaa cctattctca cgtttccaga ttatgacaag 4080 ccgtttcata tttttactga tgcgagtgcc gttgctcagg gagcagcgct aatgcagaca 4140 gttggagaag acgaaaagga ttttgcagca atcgctttta ttagcagaac gttggcggac 4200 acggagacga gatggccagc cgtacatacc gagttaggag caattatatt cgcgctccgt 4260 caattccgtc cgtatgtttg tatgtcaaag ataattcttc attcagatca caagccgttg 4320 aggtatattt tggctaagag caaaatcaat gatcagattg ggagatggct cgttgagttg 4380 cagcagtacg atatttcgat cgtgcacatc gatggaaaga agaacatggt tgccggttgt 4440 ctatcgagag ctaaggatga gatcgcgccg ttgtcaggta ttgagatgga agatattatt 4500 gattttccag tctgcatgcc gattcgaaag aagaagagag cgagcgtggt gttcgtgtta 4560 caggcaaata agaatttgcg tctggatttg gtggaagaac aggataagga tccgattctt 4620 cgagctatca agaagttttt ggtcgagcca aaagcttcga tcgattgtgt tccgaaatct 4680 tggtgcgatg ttcttgaaca tgtcgagatt tcggagaaag gcgcgctctc cgtagcgtat 4740 cataactcat tccctaaaac tgttattcca gaacatttga ggaggtttat gtttgaagcc 4800 tttcattcga gtaagttaca gggaggtcat cataattgga agaagacgtt tcgaaaagcc 4860 agtgtaaggt atttttggcc ggatatgaaa tctgacattc tgagatggtg catggaatgt 4920 attccttgcc aacagaggtc gaagcctcac ccgtccactc gagagccgca gcagatcgtt 4980 gtcacttcga aacttttcga gaaagttgga gtcgatttgt gcggaccgtt gagatcaacg 5040 gctggaggac acaaatatta catgaatttg atttgttggt tcagtaagtt cgttgtttcg 5100 gttccactta ctgacgcgtc tacggataca gtggtgcgag ctatcctcac ggaggtagtg 5160 ttgaaatttg gaacgcccag cgaattggta tcggacagag catcgacatt tacgtctgaa 5220 gcttttagac agttctgtaa gttggtgagt attcagcagc atttggcgat tccttatcat 5280 agcaaaggca acggagcaac agagagaacg tttagaacgt ttcataacat gacgtcgaaa 5340 tacgtcaatg cagcccactc tgactgggat atactactac cttatctcac ttttagctat 5400 aacactgttg ttcatagtac tacaggagaa acgccgtttt tcctcgtgtt tgggagagac 5460 ccggttttcg cagtcgacag gattttggac ccgtcaccac cgaaagaagc tgggaagtca 5520 gatgtgaaga tttggaagga gcatctcgtc gagattcttc gtgaagcttg gaagaacaca 5580 gccgagattg cacttaaagc tcaactagca taccagaagc aggccaatca aggagcgaaa 5640 ggttctgaaa ttagaccagg agatagggtg atgtttaaaa attttaagtc aaaaattaat 5700 ctgtctagaa agctagtgaa accatggatt ggagattatc gagtcttaga ggttaatcat 5760 ccaaaagcgt taattttaga tttagatcat ccaggaaaag agccgagaga agttcatctc 5820 gatcagatta aaaagtttta tctttccgag aacgataacg acgaagagga cgccgcagtg 5880 gatgaagaac aagttccaac cgacgtcatc agccagccga tcgcagaggt gacgcaggca 5940 gtcgaagaaa gttcgaagaa agctgaaaaa cacgaagttg aagctgaaaa gaaggacaaa 6000 cccgtggagg tggcgaaaaa cagaagaaat ccgccacgtg caaagaagat tccagtgagg 6060 tttgccgaaa aataatcgat ttttcctaga aaatcggttt gccgtctctg tacggtgtct 6120 aacgttcatt tatttgtaat tttcagcgaa aatgagttta atttcgagtc agaaagtgat 6180 ttttaaaata atcgctacag gacccgtgga ttaagttaag aggctgtgcc tttaaacgga 6240 ttacggtacc atttgggtct cgcctcgaac catttgggtc tcgcttggaa cgcgggcccg 6300 catttttggg acacgtaaac aaggttgttt acgccacatt ttgggcgaat ttttcgattt 6360 ttcggcctaa atttttgcgt ttttgacgaa tttttactga tttttcgccg atttcgtgca 6420 aatttcttaa attctttcaa atttcacact ttttcttttg gttttagtgt ttcttttctc 6480 gagttttcaa tttgcactta aaaggcctaa aataattaaa ctttcgcagt tttacgtgat 6540 ttttgttttg ggtttacgtg tttaaaattt tgcaatcgtg gcaagatttc tttgttaaca 6600 gttaatcagt ttggtttttc agaagctaca aagcagcaaa tgtcaagcag tcccgtcacc 6660 atggcggatg agaactcgaa cccgagttcg aaaatcgatg gggcagcccc caggagccgg 6720 tcgccgcacc ggcgacaacc acaccactct cggagcagca gcaacaacat cagcagcagc 6780 ctccacagtc ggttgaggag aagaaggaga agccgaagcc agaggaagca gctccagtgg 6840 cagcagcagc aactccgatt gcggaaggaa cagcagaacc agtcgagaag aaggacgagc 6900 tcatgccaga agctccagct ccaacgcagc aagctgaagg atttggaatc aattcccttt 6960 ttgacggtct ccgtaccggt tctctctccc ccccttccac attcgtccaa ttcagagtat 7020 gaattcgcgt acgatcctag tttgaatcca agtgattcac atcctatggg aagtaatacg 7080 gtctctgaac cggcaaatgt tccattccaa aatcttttgc aaccaggttt agagggaatg 7140 gaagttgtcc agcaagaaga ggatatggat attgatgaag agttacttct cggaccagtc 7200 gcagaggagg aaccgagagg ttacgttcca cctgttcgtg aacgtcatcg gttttggcag 7260 caaggcactt gagaggacaa ggtatgttaa atgaagattt aattaaccct acgctctccg 7320 cagcggtcaa acttccttat acttctattc caaaagagat gactattctc cgagttttca 7380 tgaacattca tgatccacag ttagttattg gaactgcggc tgcaatcaag ttggctttta 7440 gtcaagcacc tttcttggag atcaaagatc gctttgatat tactttcatt gatgaggcct 7500 cacagttagc tctctacgta ttgggatcgc tagctacgat gcttccgaag tctcgcatga 7560 ttcttgtcgg agatatgcat caacttccgc catacatgga ggaagcgctt ccagctgagc 7620 tgaagcgagc ggcgattgga gagccactta cacttgcagt taagggaaga agatggccgt 7680 caatgcatct tactcgcgtc catcgatgcc cgaagatgat tactgaggtc ctcggagatc 7740 tgttttatgg aaatactctg acttcttcga agccaggcgt caccgacatt ccagtgctga 7800 aggcaatggg actcccgagc cgacatccaa tggtcttcgt caactacacg tcaccgcaga 7860 cagctgtcgg aaagagtttc tcgaacgaag gagaagctag atatgcgctc caactggtcg 7920 aagcactcac tcgatacgcg agcacggcga ataagaagat caccgccgcc attctaaact 7980 tctatggagc tcaatactcc tacgtgtact ccatggccga agacgaagtg accgtcaaca 8040 ccatcgacgg atgtcaagga caggaatatg acgtcacaat cgtgcttctc acaagatccg 8100 atccttatga gcgatcaaaa ttcctggtca atgccaacag gatcaacgtc gcgctgagcc 8160 gcccgaagat cgccacagtc atcattggtc aacgccacct gacaggaaat caaccggacc 8220 caagacagcc gaaacgcgga agacatcagc gagtttgcaa ctgggcccgt ctcatcgaga 8280 agcttccaaa ggaatgtttc gtcgacgcca aagatcaagt catcgctcag gtagagtgaa 8340 ggtgaagcag gttaaacctt agggtaaagt ttttattttc attttcttta ctctaggtaa 8400 caggttttct atagccgctt ttaaaactct ctactcaaaa acttttcctt ttattttttt 8460 ttaggaaaaa ctaacaggtt ccacaatacc tttgagaatc aaaagatttt caaagtcttg 8520 agctagaaca gggacgttct ttaaaaaaag gggggagg 8558 // ID HELICOPY2 repbase; DNA; CEL; 249 BP. XX AC . XX DT 30-MAY-2000 (Rel. 4.1, Created) DT 30-MAY-2000 (Rel. 4.1, Last updated, Version 1) XX DE HELICOPY2 is a nonautonomous DNA transposon - a consensus. XX KW nonautonomous DNA transposon; HELICOP superfamily; CEREP2A; KW HELICOPY2. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 249-1 RA V V. and Jurka J.; RT "HELICOPY2."; RL Direct Submission to Repbase Update (SEP-1999). XX RN [2] RP 1-249 RA V V. and Jurka J.; RT "HELICOPY2."; RL Direct Submission to Repbase Update (MAY-2000). XX DR [2] (Consensus) XX CC There are about 200 copies of HELICOPY2 in the C.elegans genome. CC They are ~95% identical to the consensus sequence. The original CC name, CEREP2A [1], and orientation have been changed because CC this repeat is a nonautonomous HELICOP-like DNA transposon. XX SQ Sequence 249 BP; 54 A; 35 C; 79 G; 79 T; 2 other; atcatttact atatataaag cgcttattcy gtktgtccat agtttgtagt ctatgtagtc 60 tttgtagtct gtgaagtttt ggcttctgga gggatagtga gttggggtta gtgtagggat 120 atagtcgggg tactgtagtg gtacaatagt ggtacggtag gaatactgta gggttacggt 180 agtttcagaa aaattagttt tcagccccag aagtcggggg ccgcgccgga ggtgcggtcc 240 acggctggt 249 // ID NDNAX2_CE repbase; DNA; CEL; 2846 BP. XX AC . XX DT 12-NOV-1999 (Rel. 4, Created) DT 12-NOV-1999 (Rel. 4, Last updated, Version 1) XX DE NDNAX2_CE is a putative nonautonomous DNA transposon - a DE consensus. XX KW nonautonomous DNA transposon; DNAX family; NDNAX2_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-2846 RA Kapitonov V.V. and Jurka J.; RT "NDNAX2_CE."; RL Direct Submission to Repbase Update (NOV-1999). XX DR [1] (Consensus) XX CC NDNAX2_CE is a putative non-autonomous DNA transposon; CC its copies are 98% identical to the consensus sequence. CC It shares common termini with NDNAX1_CE element. CC These elements do not have terminal inverted repeats. XX SQ Sequence 2846 BP; 797 A; 633 C; 524 G; 878 T; 14 other; tctgcctcct tcctaaaacc cccatcttgc cacacacacg cgttttcggt gttcaaaaaa 60 tgttcaaaaa atgtccgaaa aaattgtaat acaactttga acgtttatat ctcggctctc 120 actgaaccga tttttgaaat tcttttttct ctacattcac aaaatatagc gacgtgtgtt 180 catctaaaag ggggttgatt ttggagcatc ctgtcggctc cacaagcgga aaaccacacc 240 cckcttgttt ttgcgcacac cgaagtactg gccgaaaacc actttaaatt gaaggaaaat 300 tcgagtttat gttcaaaacg tttgagactc gaaacttaag ttctcctttt caatgcctat 360 ccattgtatc taagttcatt gtgatatctc tattcgttat tttaaaaacg ataattaaag 420 ttttgccaaa atttctgcac ctcccccctt ttctcacaca cacacaccca cctccccttt 480 tcatatttgg tgtgttttcg gccacatttt caccattgtt tcggtcaaat ctccgttagt 540 cttgctcttt ttccaaattt ttttttggaa aaatgttcat cacatcgaga ggaagcggaa 600 aatataattt tttgtttcaa aaaaggtaaa taatagtctc aaaaatagaa aaaatgaaca 660 cttctttcaa acaatttaca tttttccata aaattcacaa actttccact atcgttccgg 720 tttttatgtt aaatttagaa ttaatagaat gatttaaaat ggaatctcaa atcacagctc 780 gatccgatca tttttcgcgg agataggtta tttaattttt gcacatggaa atccagtttt 840 cctctgttcc tggacaagtt ttcaactatc cttttgattt tatgttttaa ttttgataat 900 tttcactaaa gcaaggtaga caaactaaat ttggtcggtc ttgacaactt tcagcacatt 960 trkggaagtt ttgagcttcc aggccttccc agaaggtgat atgtttcaat ttttgttatt 1020 tttaaaaata ataaccgccg tattattcta ctttttacgc caatttcaaa aattatttwa 1080 tttttctcaa actcctacta tgagcaaagt tatgacggat tgaagtttta gtgcataact 1140 cctcacaggc cgcctctcct cacaagaggc tgcatgcgtg cgggtggctg gaaaatgttt 1200 tctttccaaa aaaaaacata gaaaggcact ttaactctaa gaakattcag ttcactttga 1260 acatatactt caggcattgc aggccggagg cctgctgtgc gattacaact gtttcaagcc 1320 cagattgtct ggattgagtg ccgcaggcac tcttactctt agaagacgca gtcaacctta 1380 aatcattata ccaaggcagg ccggaggcct gatatgcgat tactcttaga taagtttgat 1440 ttagaaagct tttattattt tcagttttct aaaacaatcc ccattcccga agggctgggg 1500 gcccacaggc gttcacgatt ttccaaaaaa gtgaaatgtt tattttttgg ttatttttca 1560 aatttgaaaa caccatacga ttttggttta aatactaaat tgaaatagaa atttcgctct 1620 cttacgatct acgccctacc cactgctaag cctggcaagt cgcagatttg ctatcaacac 1680 tattggaaac tgtgggctgg ctttgatcct attccaagca gatctgttat tcactcaagc 1740 ctaataacca ggtttgaggg ccgcaggccc gaatactctt graaactgtg ggctggcttt 1800 ratcctattc taagcagagc aggctgtagg cctgcgacgc ggttgttatt yactcaaacc 1860 taacaaccag gttcgagggc cgcaggcccg aatactcttg gaaactatgg gctggctttg 1920 atcctattct aagcagatat gttattcact caagcctaat aaccaggttt gagggccgca 1980 ggcccgaana ctcttgraaa ctgtgggctg gctttgatcc tattctaagc agatctgtta 2040 ttcactcaag cctaataacc aggtttgagg gccgcaggcc cgaacactct tggaaactat 2100 gggctggctt tgatcctatt ctaagcagat atgttattca ctcaagccta ataaccaggt 2160 ttgagggccg caggcccgaa tactcttgaa aactgtgggc tggctttaat cctattctaa 2220 gcagagcagg ctgtaggcct gcgacgcggt tgttatttac tcaaacctaa caaccaggtt 2280 cgagggccgc aggcccgaac actcttggaa actatgggct ggctttgatc ctattctaag 2340 cagagcgcgt atgttattca ctcaagccta ataaccaggt ttgagggccg caggcccgaa 2400 tactcttgaa aactgtgggc tggctttaat cctattccaa gcagagcagg ccgtattcat 2460 gagttttcct tttccttcac cgttttttgt agttcacagt gatgattgaa racacgctta 2520 ttttattgat aatacattca aattattctt gaacaaaatc tcagcctaag ttattcgtac 2580 agtgtcggtc ctggagcaat gaatcaaagt ttaaagagat ttcactttac acagtaagag 2640 cttcgaggcg aagcctcaaa cctgaatagt caagttttct aaacatatac cccactcctg 2700 aaataatttc aargagatct cgaataaccc swtaacaggt tcgaggcgaa gccgagaacc 2760 tgcgccagct accttgctaa taacttagag cgaattaacc ctatccccat tcccgaaggg 2820 ctggcgggcc cataggcccg ctagtc 2846 // ID CERP5 repbase; DNA; CEL; 792 BP. XX AC . XX DT 28-JUL-1999 (Rel. 3.5, Created) DT 28-JUL-1999 (Rel. 3.5, Last updated, Version 1) XX DE C. elegans repetitive DNA (a consensus). XX KW Repetitive DNA; CERP5 repeat; CERP5. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-792 RA Jurka J.; RT "CERP5."; RL Direct Submission to Repbase Update (JUL-1999). XX DR [1] (Consensus) XX CC Nearly perfect palindrom. XX SQ Sequence 792 BP; 253 A; 143 C; 144 G; 249 T; 3 other; cactgcaact ttttcctcac gagggacgag gaaaagtggt ttctaggcca tggccgaggg 60 gccgacaagt ttcagcggcc atttatcttg ctttgttttc cgcctgtttt cttttgtttt 120 tcacagattt ttcccgtttt ttcttattaa aactgataaa taaatatttt ttgcagatgc 180 taaaacaatt tccaagtaaa aaaattatgt attcagtsgg caagcagcgg tgaaagtggt 240 caatgtaata tgatggatta cgggaataca aaacctaaac tttttctgaa acatgataca 300 tatgctgctt agatgctgaa actacctgat tttcataacg agaccgctga aaaagttttg 360 agattttcaa aattcaactt tttgtgtgaa aaagtcgaga ttttcacaya aaaagttgaa 420 ttttgaaaat ctcaaaactt tttcagcggt ctcgttatga aaatcnggta gtttcagcat 480 ctaagcagca tatgtatcat gtttcagaaa aagtttaggt tttgtattcc cgtaatccat 540 catattacat tgaccacttt caccgctact tgccgactga atacacaatt ttttcacttg 600 gaaattgttt tagcatctgc aaaaaatatt tatttatcag ttttaataag aaaaaacgga 660 aaaaatctgt gaaaaacgaa agaaaacagg cggaaaacaa agcaagataa atggccgctg 720 aaacttgtcg gcccctcggc catggcctag aaaccacttt tcctcgtccc tcgtgaggaa 780 aaagttgcag tg 792 // ID HELITRON1_CE repbase; DNA; CEL; 8484 BP. XX AC . XX DT 25-MAY-2001 (Rel. 6.04, Created) DT 25-MAY-2001 (Rel. 6.04, Last updated, Version 1) XX DE HELITRON1_CE is a Helitron DNA transposon, a consensus sequence. XX KW Helitron; DNA transposon; Transposable Element; HELICOP1; KW HELITRON class; HELITRON1_CE; helicase. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-8484 RA Kapitonov V.V. and Jurka J.; RT "HELITRON1_CE."; RL Direct Submission to Repbase Update (MAY-2000). XX RN [2] RP 1-8484 RA Kapitonov V.V. and Jurka J.; RT "Direct submission."; RL Direct Submission to Repbase Update (MAY-2001). XX RN [3] RA Kapitonov V.V. and Jurka J.; RT "Rolling-circle transposons in eukaryotes."; RL Proc Natl Acad Sci USA 98(15), 8714-8719 (2001). XX DR [2] (Consensus) XX CC HELITRON1_CE is a HELITRON-like DNA transposon. It encodes a CC 1466-aa protein, CEHEL1p, composed of the rolling-circle CC initiator CC and DNA helicase domains. CC There are several copies of HELITRON1_CE in the genome, they are CC ~99% identical to the consensus sequence. CC Approximately 2% of the C. elegans genome belongs to molecular CC fossils of nonautonomous elements transposed recently (<10 Myrs) CC by autonomous HELITRON1_CE-like elements. HELITRONY1_CE and CC HELITRONY2_CE are examples of these non-autonomous elements. CC Minisatellite-like repeats RCA1, RCD1, RC35, RC123, RCC9, CC reported CC previously (Naclerio,G. et al., 1992, J. Mol. Biol. 226, CC 159-168), CC are portions of HELITRON-like transposons. CC The CEHEL1p protein is encoded by 9 putative exons. CC CDS: 1930-2087, 2135-2412, 2533-3428, 3476-3799, 3850-4520, CC 4581-5207, 5254-5521, 5574-5762, 5814-6803. XX FH Key Location/Qualifiers FT CDS 0..0 FT /product="CEHEL1p" FT /translation="MDNELTSRPDSIMDTEPTTLPDSAMDTKPTNLPDFPS FT TSQEPVSRRTRSRTLSDNSNAVWKRERRSRETSQESESRLRMDWERKKEKR FT ASMSEEEKAEMKYKKRVWMKKKRNEVAKTHDTSSVANPNYLGSMRCVCKNC FT NARFFQGEVSSSNDHINMCCYFGLTTTLQDRFSNYPDELRHLLLDDSSHAR FT NFQKNIRQFNSSFAMASMGAQLDVPRGRGPYCFKIHGQVYHFAGPLHPDPG FT QRPAFGQIYILDTDQATDERLGNPANADCDPAVMAELSSLLLRTNPYAQAY FT KMMAEVEEKENSEAAKEGRHPGCVRLIFDISTTKDPRRYNLPQANEVAVVF FT VGEDDDVPTTRSLAVHPRGGGLKTIRDIDRICDPLTYPVLFPNGTDGWHPD FT LEKRPSEKKQGRITQKMYYSYLLMERSGVFNPLHHGRALFQQFAVDSWVKI FT EQNRLNYHRTHQVDLKAANYNAVQDYIAGDTDVPGRRIVLPSSFPGSPRAM FT VQNFQDAMAIVSKFGKPDIFLTFTCNPAWTEISENLGPRQSASDRPDLIAR FT VFKLKVDALFDDLLNRDILGHVAAYISVFEWQKRGLPHVHMLLTMAENSKP FT RTSEDIDKIVQAEIPNPDNEPELHRIVTTAMMHRPCGAQNPHSPCMVDGHC FT SKRYPKDFHPSTTLNVDGYPGYRRRDDGRYVEYGTQHLDNRRVVPYNKWLL FT LRYNAHMNVEICGFIEAVKYLFKYVYKGHDRAALNIIQNVRGDGNVVDEIR FT EHLDARYVCAPEAIHHILGFKLEKKASVTTQQLESSSQTDTTLTAWFKINQ FT KSKDIAESGNIPSTFVDSRQFFYMDMPTHFTFVKKDGWKVRGRGTRQIGRM FT YTVPPYETERYALRILLLNIKGATSFEDLRTVLDENNVPVVYATYVEAAKA FT QGLLNDDSEYLKSLKEWAGCSVPAALRSMFVAIILFNEVHDLNALWDAVKW FT DLSEDFRHAGAGKEEAEALAYFDIESRLQRVGKRFDVTKPSINPPPIDLDT FT VNPAQCASEGNRLLATLNDQQKRAADQILAALDDASLPRLFYLDGPGGSGK FT TYLYITLYNICVGRGLKVACTAWTGIAANLLPLGRTSASLFKLDIRNQCKS FT SLHQRQLKEAQELAENDVFIWDEASMVPKTALDTVDVLLRDLTKIDQPFGG FT KILILGGDFRQILPVVERSSRADQVDACIKRSPLWTEFQILHLISNMRVTS FT GDSDWIQFLLNVGDGSANDSDSKVTLPLSVMCDHNIVEEVFGAVIDPTTSD FT PCDNVILTPKNVDVAQLNDDVHNRMVGEERIYLSRDEVIVEHQADTMHYPT FT EFLNKMSPSSLPPHILKLKKGSVIILLRNLDVSAGLCNGSRFIVETLASHS FT LGCRFATGERKGHFTIIPRIDCYDDKNISFQLRRTQFPVRLSFALSINKAQ FT GQSFSKIGLWIPTDVFTHGQLYVALSRVRTKEGLIVKSSSNIVTNIVFNEV FT L" XX SQ Sequence 8484 BP; 2476 A; 1919 C; 1648 G; 2441 T; 0 other; tcattactta tatataaaaa aatagtgtgg gtctgtccat agtttgtagt ctatgtagtc 60 tttgtagtct gtgacgtcac ccctaaagtc actgagagtt gtgggcgggg ccagtcaccc 120 ttcgtggcga gacccatcgt tgcgagaccc ttcgtggcga gacccatcgt ggcgagaccc 180 atcgtggcga gacccatcgt ggcgagaccc atacattttg gcgggaattt aaattttctt 240 tgaaattttt cggcgggaat tcaaattttc tgtgaaaatt tttggcggta gctcaaattt 300 tctgaaacat ttttggcggg aattcaaaat ttctctgaaa atttttggcg ggaatttaaa 360 ttttctttaa aattttttgg cgggaattca aattttctgt gaaaaatttt ggcgggaatt 420 caaattttct gaaaaatttt tggcgggaat tcaaattttc tctgaaaaat tttggcggga 480 atttaaattt ccttcaaaat tttttggcgg gaattcaaat tttctgaaaa atttttggcg 540 ggaattcaaa ttttctctga aaaattttgg cgggaattta aatttccttc aaaatttttt 600 ggcgggaatt caaattttct ttgaaaattt ttggcgtgaa ttcaaatttt ctgaaacatt 660 tttggcggga attcaaattt tctctgaaaa tttttggcgg gaattcaaat tttctttaaa 720 attttttggc gggaattcaa attttctgtg aaaaattttg gcgggaattc aaattttctg 780 aaaaattttt ggcgggaatt caaattttct ctgaaaattt ttggcgggaa tttaaatttt 840 ctttaaaatt ttttggcggg aattcaaatt ttctgtgaaa atttttggcg ggaattcaaa 900 ttttctgaaa aatttttggc gggaattcaa attttctctg aaaatttttg gcgggaattt 960 aaattttctt tacaattttt tggcgggaat tcaaattttc tgaaaaattt ttggcgggaa 1020 ttcaaatttt cagtgaaaaa ttatggcggg aaattcaaat ttttagagaa attttgtcgg 1080 agtagtcaaa tttaactttg tttgacagaa aaaaaaagta tgaaggggaa gatagtttga 1140 accacagaag gttcacacca aattcccgca cacaaccaat gcgacatacc ggcacatttg 1200 tttagaattt aataaaccta tagaagagga aaacttcatg caataaactt cgagaatctt 1260 ctagaaattt cgaaaaactt ctagatatct cggattacgt aacacttgac gacgtcacac 1320 atgatgagga cattaagtaa acgtcacagg ctgaacccaa tataattttc agaaaattct 1380 ggaaccttct agaattttgg agaaacttcc agaaaattct gatgacgtca caatcactgg 1440 cgtaacaaaa aataagtcac agattatgac gtcacatatg atgacaccta tatagtctca 1500 cattcggaaa aaaataggtg atgacgccaa atatatcagc gtcacagaat gtgacgtcat 1560 aggctccgcc cattacagta aaagaaactt ctggaatctt ccagaagttg cagaaaaaga 1620 agagaatgtt ccaataacgt cacaaaagat gacgtcactg ataaagacat cacaggaact 1680 acccattaat ataatcaaga agattctaca accatctaga aacttctgga tttttcaatg 1740 tagtgtgcaa aatttaaatt ttgactattt cattgtgcaa tttttaattt atattattac 1800 aagtttaaac tatattaaat ctgtccttct aaaacccaat tcgtttcaag acccaaaact 1860 tctagaatct tccaaaaaaa ttctatataa ggagaagaaa attatcattt tctcattatt 1920 aatcaacaat ggataacgag ctaacaagtc gacccgattc aataatggat acggaaccaa 1980 caactctacc cgattcagca atggatacca aaccaacaaa cctgcctgat ttcccttcta 2040 catcccaaga acccgtctct aggaggacac gttccagaac attaaggtaa ggatcttgaa 2100 tgctcgtgga attttaccaa taagcagttt tagtgacaac tcaaatgctg tatggaagag 2160 agaacgtcgt tcaagagaaa cttcgcaaga atcagaatca agactcagga tggattggga 2220 aagaaaaaag gaaaaacgag cttcaatgtc agaagaggaa aaggctgaga tgaaatataa 2280 aaaaagagta tggatgaaga aaaaaagaaa tgaagtagca aaaacacacg atacgtcatc 2340 tgttgccaat cccaactacc ttggaagcat gagatgtgtg tgcaagaact gcaatgccag 2400 attctttcaa ggtatacatt ttttcaaagt ttagtgaact tctacttctt atttccattc 2460 tgagtttcag attggttttt tttcataact aataacatat actagcatca tacagaaatt 2520 taatgtttca ggagaggtgt cctcatccaa cgatcatatc aatatgtgtt gctacttcgg 2580 attaacaacc acacttcaag atcgattctc caactaccca gatgaactac gacatcttct 2640 cctcgatgat tcttctcacg cccgtaactt ccagaagaac atccgacagt ttaattcatc 2700 gttcgcaatg gcatcgatgg gtgcacaact tgatgttccg agaggccgcg gcccatactg 2760 cttcaaaatc cacggtcaag tttaccattt tgcgggacca cttcacccgg atccaggcca 2820 gagaccagca tttggacaga tctacatcct ggacactgat caagcaacag acgagaggct 2880 tggaaatcct gccaatgccg attgtgatcc tgctgttatg gctgaattga gcagtcttct 2940 gttgagaacc aatccatatg cacaggcata caagatgatg gctgaagttg aagaaaaaga 3000 aaattctgaa gctgcaaagg aaggacgaca tccgggatgc gttcgactga tcttcgatat 3060 ttcgacaact aaggatccac gacgttataa cttgccacaa gcgaatgaag tcgcagttgt 3120 ttttgtgggc gaagacgatg acgtgccaac cactcgctct ttggctgttc atccaagagg 3180 aggaggactt aagacgattc gtgatatcga taggatctgc gatcctctca catatcctgt 3240 attgtttcca aatggaaccg acggatggca tcctgacttg gaaaagagac catcggagaa 3300 gaaacaagga aggattactc agaagatgta ctacagttac ctgctcatgg agagatcagg 3360 agttttcaat ccactgcatc acgggcgtgc cctgtttcaa cagtttgcag tagactcatg 3420 ggtaaaggta agaatacatg atctcatgtt cacaaaataa tcatgttttt gcagatcgag 3480 caaaacaggc taaactatca tcgaactcat caagttgatt tgaaagccgc caactacaat 3540 gcggttcagg attatattgc tggagatact gatgtcccag gaagacgcat cgttcttcca 3600 tcatcttttc cagggagtcc ccgagcaatg gtccaaaact ttcaagacgc gatggctatt 3660 gtctcgaagt tcggaaagcc tgatatattt ttaactttca cgtgtaatcc tgcatggaca 3720 gagatttctg aaaacctcgg tccgcgacaa tctgcatctg atcgtccaga ccttattgcc 3780 cgtgtcttca agctaaaagt aagttgaaga tctatattta actcatcact ttcttcattg 3840 tgttccaggt agatgcgctg tttgacgatc ttctaaatcg ggatattctc ggacacgttg 3900 ccgcatacat ttccgtgttc gaatggcaaa aacgaggtct tcctcatgta cacatgctgt 3960 taactatggc agagaactca aagccacgaa cctcggagga catcgacaag attgtgcaag 4020 ctgagattcc aaatcctgac aacgagccag aacttcatcg cattgtgacc accgccatga 4080 tgcatcgtcc atgtggtgct caaaatcctc actcgccatg catggttgat gggcactgct 4140 cgaaaaggta tccgaaggat tttcatccta gcaccactct caacgttgat ggatatcccg 4200 ggtatcgtcg aagggatgat ggacgatatg tggaatatgg cacccaacat ctcgataaca 4260 ggagagttgt tccatacaac aaatggcttc tgctccgtta caatgctcac atgaatgtcg 4320 agatttgtgg attcatcgaa gctgtaaaat atctcttcaa atatgtctac aaaggacatg 4380 accgtgctgc actcaacatc atccaaaatg tacgtggaga tggaaatgtt gttgatgaga 4440 tccgagagca tctagatgct cgatacgtct gtgctcccga ggcaatccac cacatcttgg 4500 gattcaaact ggaaaagaag tccgatacag tttattgact cgctgtgcat ctcgaaggat 4560 tccaaacgat ctactttaga gccagtgtca ccacacaaca actcgaatct tcatcacaaa 4620 ccgacacgac tctcacagct tggtttaaga tcaaccaaaa gagcaaggat attgcggaga 4680 gcgggaacat cccttcgaca tttgttgatt cacgccaatt cttctacatg gatatgccaa 4740 cgcacttcac ctttgtgaaa aaggatggct ggaaggttcg aggaagaggc acaaggcaga 4800 taggccgaat gtacactgtc ccaccatacg aaaccgagag atacgcactt cgaattcttc 4860 ttttgaacat caaaggagcg acgtcattcg aagatcttcg tactgttttg gacgagaaca 4920 atgttccagt ggtatacgca acttatgtgg aagcagcaaa agctcaagga cttctgaatg 4980 atgacagcga atatttgaag tctttgaaag agtgggctgg atgctcagtg cccgcagctc 5040 ttcgctcaat gtttgtcgct atcattcttt ttaacgaagt tcacgatctg aacgcacttt 5100 gggatgccgt gaagtgggat ctttctgagg attttcgcca tgctggagca gggaaagaag 5160 aggctgaggc acttgcatat tttgatatcg aaagtcgcct tcaaaggtat cttttgagct 5220 tttttcaaaa gtagattata atatttttcc agggttggaa aaagattcga tgtcaccaaa 5280 ccatctatca atccgccacc aatcgatctc gacacagtca accctgctca atgtgcatca 5340 gaaggcaata ggctccttgc gacactcaat gatcaacaga aacgtgcagc tgaccagatc 5400 cttgctgcat tagatgatgc ctcgctaccc cgtcttttct atttggatgg gcctggaggg 5460 agcggaaaga cttatctcta catcactctg tacaacatct gtgttggaag aggactgaag 5520 gtaaactttc aaccaaacat actaaagtta tttcaatact gcatattttc aggtagcgtg 5580 tacggcgtgg acaggaattg cagccaattt actcccactc ggtcgaacat ccgcttccct 5640 tttcaagctt gatattagaa atcaatgcaa gtcttctctt catcagaggc aattgaaaga 5700 agcacaagag ctagcggaga acgacgtttt catttgggat gaagcatcaa tggttcccaa 5760 ggtattagag tgattctgat cgtactaatc aaatcattca ttcaactttc agacagccct 5820 cgacacggtc gatgtactgc ttagggatct cacaaagatc gatcagccat ttggtgggaa 5880 gatcctgatc ctcggaggag actttcgcca aatcctacct gtggtggaga gaagctctag 5940 agctgatcaa gtagacgcat gcatcaaaag atcaccacta tggacagagt ttcaaatttt 6000 gcatctcatc tcaaatatga gagtaacatc tggagactcg gattggatcc aattcctcct 6060 caatgttggc gatggatctg caaatgactc tgattcaaag gttacactcc cactctctgt 6120 catgtgcgat cacaacatcg tcgaggaagt gttcggagca gttatcgatc caaccacgtc 6180 tgatccatgt gacaacgtca tccttacacc taagaacgtc gacgttgctc aactgaatga 6240 cgatgttcac aatcgaatgg ttggcgagga aagaatctat ctgtcacgag atgaagtgat 6300 cgtggaacat caagcggata ctatgcacta tcccaccgag tttctcaaca agatgtcccc 6360 atcatctctt cctccccata ttctcaagct gaagaaaggc tcagtgatca ttctgctcag 6420 gaatctcgat gtttccgctg gcctttgcaa tggttcccgt ttcattgtcg aaacattggc 6480 cagtcactca cttggatgtc gctttgcaac tggcgagagg aaaggacatt tcacaatcat 6540 tccaagaatc gattgctatg atgataagaa catctctttc caactcaggc gcacacagtt 6600 cccagttcgc ttatcatttg cattatcgat caacaaggcc caaggacaat ccttttctaa 6660 aatcggtctc tggattccca ccgatgtttt cactcatgga caactatatg ttgctctatc 6720 aagagttcgt accaaagaag gtcttattgt aaaatccagt tctaatattg tcaccaatat 6780 cgtttttaat gaagttcttt gaataaagct tattattctt ctctgctatc tcgctagaca 6840 actcagctct cttctgaggt ctcactcgac aactcatctc ttctctgcta tctcgctaga 6900 caactcagct ctcttctgag gtctcactcg acaactcatc tcttctctgc tatctcgcta 6960 gacaactcag ctctcttctg aggtctcgct cgacaactca tctcttctct gctatctcgc 7020 tagacaactc agctctcttc tgaggtctcg ctcgacaact cagctcttct ctgctatctc 7080 gctagacaac tcagctctct tctgaggtct cactcgacaa ctcatctctt ctctgctatc 7140 tcgctagaca actcagctct cttctgaggt ctcgcccgac aactcatctc ttctctgcta 7200 tctcgctaga caactcagct ctcttctgag gtctcgctcg acaactcatc tcttctctgc 7260 tatctcgcta gacaactcag ctctcttctg aggtctcgct cgacaactca gctcttctct 7320 gctatctcgc tagacaactc agctctcttc tgaggtctcg ctcgacaact catctcttct 7380 ctgctatctc gctagacaac tcagctctct tctgaggtct cgctcgacaa ctcatctctt 7440 ctctgctatc tcgctagaca actcagctct cttctgaggt ctcactcgac aactcatctc 7500 ttctctgcta tctcgctaga caactcagct ctcttctgag gtctcgctcg acaactcatc 7560 tcttctctgc tatctcgcta gacaactcag ctctcttctg aggtctcact cgacaactca 7620 tctcttctct gctatctcgc tagacaactc agctcttttc tgctatctcg ctagacaact 7680 cagctctctt ctgaggtctc actcgacaac tcatctcttc tctgctatct cgctagacaa 7740 ctcagctctc ttctgaggtc tcgctcgaca actcatctct tctctgctat ctcgctagac 7800 aactcagctc tcttctgagg tctcgctcga caactccagc cttctcactc gcccccatat 7860 gtaggagtat ggggggaggg agtattatta taaccacaga aacacaacaa tgtgccacat 7920 aagagacagg agatagtgct cgcactaccc ctgaatcacc aaccctctca gaagcacctc 7980 tgctcttatc tgggtctcgc tagacatctc cagtagtttg accattctca gcaacacttt 8040 tgctatttta gaattttttt gccctcgcgg ggatcgaacc aaaaggtccg cgcgccaatg 8100 acgcgcgcgc taccagctgt gccatgctgg aagcgattga acttcgacca ataacaatat 8160 acaacatttc tttatttgtc aaagtgaaat agttttgaac aaacactatt caaaaaagta 8220 attttaaatt gaaattatat caaattcacg atttgttccc aaatatttaa aactgaaact 8280 caataagaga agatctcata ttctcactga taatccgaaa catgacattt gtttcaatgc 8340 tgttttcaac ctcaagttgt aaaattcaaa aaccaagctg aaaatatgtt ttaaaaatct 8400 agaaactaca ataattttta tgtgaatttt tggaaaaatc tcaagccaga cggagcacgc 8460 gccttggcgc gtgcgaacgg ctgg 8484 // ID MIRAGE1 repbase; DNA; CEL; 5625 BP. XX AC . XX DT 18-NOV-1999 (Rel. 4.1, Created) DT 18-NOV-1999 (Rel. 4.1, Last updated, Version 1) XX DE MIRAGE1 autonomous DNA transposon - a consensus. XX KW Mirage; DNA transposon; Transposable Element; transposase; KW Autonomous DNA transposon; TIRs; Mirage superfamily; MIRAGE1. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-5625 RA Kapitonov V.V. and Jurka J.; RT "MIRAGE1."; RL Direct Submission to Repbase Update (31-OCT-1999). XX DR [1] (Consensus) XX CC MIRAGE1 is a very young autonomous DNA transposon. CC There are more than 10 copies of the element fixed CC in the genome. Some of these copies are almost CC identical. CC MIRAGE1 generates 2-bp duplication of a target site upon CC integration in the genome. MIRAGE1 has 53-bp TIRs and CC encodes transposase (several exons). CC PALNN1_CE is a nonautonomous form of MIRAGE1-like CC transposon since it shares common TIRs with MIRAGE1. XX SQ Sequence 5625 BP; 1762 A; 1092 C; 1077 G; 1694 T; 0 other; ctcagataaa ctggagtcac ggcagagcga cagaaaatca cttctaatca gcaaaagcta 60 gtaaagccat agtttatcag cagagaacca ctgctaatta gcaggtcatt gctaattagc 120 taaaaagcac tgctaattag cagaaaccca ctgctaacta gcagaacatt gattattgat 180 tttattttgt tttttcttcg tcgtatttca catatttcat ttttttgacc ttttttattg 240 tattttttgt gtgatatgtt atcaatatgt gttaattgtt aattccagtg tgtttttcaa 300 agatttgtat cattcccggc tagaaacgga ggattatggc cagaaatcca agaacaacag 360 ttcgaagaat tttgaattca gcaccccgga gtcttctgat tcaaagcgtg acaatagttg 420 ctctaggtaa attttttaaa tgattttatt gggactaaat aactactttc agacccgacg 480 gacaataatc atggttctca ccagccggga agcaatggtc catcaaattc aagcacccag 540 gccgataact tcgagacgga aacaactgca gatgaagaac gacaaaagga agatggtgta 600 gaagatgaac tcgaagacga agaaacaaca ggattcattg atgaagacgg ggaaattcgt 660 tatccggtcg atgaagtggg aaattacgag atcgatcaag aagctgaaac cgatgagtca 720 ttttatttgg acactcaacg cgattcttct ctatgcgacg agcgaagtca actgtgtcgt 780 tctctgagcg atcaaaatct gagaacaact tttttcctca tcactactac caactcgaaa 840 gcagccattt tgagatatct acattttcac aatgtttcac aagatagcat tctcagaaaa 900 ttgctactga atgactctca tctaacactt gtagaagtat gtaatttatg tgcgcagaat 960 atcgaactgt aggtttatta taaaactaaa tttctgaaat atatcattac agatgcaaat 1020 gtgaagatat gcaagagaaa tgccaactaa ttttcggaga tctcaagaaa tggctcgaag 1080 ttatattgaa tagttttgcg aatatcatga tgaagactaa aacaactatt ttggaaggat 1140 ataatagctc cagtccaata aataagggta tctttaaaaa aaagctactc agcgaaatgg 1200 cagataatga aatgtggatg cacttcgaga gtagcttcga cggaatcaaa gtacataaaa 1260 atggaaagta agtatcttag tttgatggct ctaaaaattt taattggttt tcagtcagaa 1320 catttggcca tatagtttac tcaatttgga tctcgaagac gttcatcgag cttctcccca 1380 ggccctcatt ttagcagcgt tattcatcgg gttcaaaaat ccgacaacga aaattcatga 1440 tagacttaca caatggattc tgctccaaat ggacgaacat gtcttcttct cagagggagt 1500 ggcctggaaa gcggatctca cgtgtgccaa tcatgacgat ccagtgagat gaaaattcag 1560 aattttgaaa acaagtgttt ggtttcaggc acgacgaatc gtatacaatc agtgtggact 1620 ccgatcatca ggttcttgca acttttgctt aaaccaggaa actgagtgca aaatcaatga 1680 tgagtataca acacgtaagc atcatgaaaa ctattattaa aaaaataatc acctctttat 1740 agtacgcgaa aatctcaccg gatcgtgccc ttcgactatg aacgatggcc ttcgtcaaag 1800 aaaccagcat tctcatttcc acaagcatgt tttgtatatg catctatgcc cggttgactt 1860 gtttcattgt tttgaagaag gaattttaag caatatcaat acaggtcact agtaatgccc 1920 gaatgctccc gaaacgtgtt catttttcag cattattctc caaagggaaa tggaatttgt 1980 tcccgtcaaa atcggtggat ttttcatcaa caccttcgct tccatctcga tttcgttcgt 2040 tgtccggcaa agcttcacag tgtactggtt ctgagaaagc actggtaaaa agattcgttc 2100 agctaatttc tgcaaagtga acacacattt agattttcga gtcgatagtt gtagcagctg 2160 cattttccgg tgaacttgga ggtgtaccat cggctatcat tctcagcatt catggacttt 2220 atcgtctttg catagaaccc caatcaatca ctgacgagac tcttcatttg aaagttagtt 2280 tttattgttt ttgcttgaaa aaagtagacc attctttaca gattgaacaa atcagcaaat 2340 ccatcgaagt tttaattgtg aaacgagcac cggaaatgtt gaatgggatc aaagtacatg 2400 tatggaaatt gtcgtcaact aacacccatt actattttgc agcaagtgtt gtaccacctt 2460 gcaagaatga ctgagctgta cggtagctta tttccattgt caactcaatg gttcgagtac 2520 ttctatcaca ctattcaaag aactttagtt ccggaaattt acaatggact gggcatgagc 2580 atcatgagaa agtgagaaaa ttagagcaca atagggagca agaatgtttt tacagaatgt 2640 cggctcttca agagatcaaa accgaagctg cattccgtct acacacaaat caccagtttc 2700 gaacagaagc gaacttgaaa atgtgctatg aacttggatt attaaggcga ctcaaacgaa 2760 tatcggttgt tccgacccct ccgcagtacc aacagttttg cgagaccggg gatgttttcc 2820 ttaatgttgt ataccacgac ggcatcagat attcgaatta ttgcaacaaa aagacagatg 2880 actcaaatgt tacgtttatc gaatatgctc gcgtacgatt tgcaaagatt attgggatgc 2940 tgttgaagga aaacaacact gatcttcgtt ttattttgaa gaaatactgc aacacaaatg 3000 atcatttgat tgacatatcg aacaagttat atgaagagga tgtaccgcaa atgcatattc 3060 aaaactgggt tgattcagtt tcaaaatcga gtttcggagg tcgtgtcacg ttaacaaaca 3120 cagttgtcgt ggtttccgcc gattccattg ttggacatgc agtggttgtg aagcgaaata 3180 acaactgtgt ggctcttcca ttttctcgcc gtatcagtct atcttgactt tgataaccac 3240 atatagcttt gaatttgttt gtcttttgtc tgtccccccc cccccccgct cgaaaaaatg 3300 tctgcacctc ccccttcttt gacaaatgtt tgtctctacc ccttgctcga aaaatgttct 3360 cattatgttc ttgaggaatt ctttcagatt tttaaaagtt gtttgtttta atattttcat 3420 gaatgcttat ccgctttcaa ttaatttgtg tttatttgat gtgataatct gttctgttca 3480 accctgctta tttaatctta ttttattaac agatctaaaa atcaatttca gaaaaagtca 3540 tgtcattcct cgagaaacta gtggcaaaat acgagcccat taaagaagaa aaaccaaaat 3600 tagttccaaa gccgacaaga aaacggcaac cggaaactac ggaaatcgtt gtgaagactc 3660 gccgtattac tcgccgaaat gctaaggtta gtggaaggag ccacataaca ttaaatgtga 3720 acccaaagct aacctattcc tatgagtttt aatatttctg tcattttcag gcacctgtaa 3780 aagaagagga gattggacat caaaaagctc aaatcacgat cgtagatgac gaagagttta 3840 tggagcaacc tgtacaggag ccgattgaga ttgaagagga gcaacaagaa gaatctcatc 3900 tggaaattgc tcttccggag caaccaaatc atgagccaat gcaatcaagt ccatcgttgc 3960 ctagaaaatc gttcaaaccg gatatgcatc caattgcgaa ggtgtttact tctgctaatt 4020 tcgagttgct cggaaataaa ttaatttcag aaaggcggaa aaatctgtat tgtagatcta 4080 ggacttacat ctcacacact ccccaccgaa caaactccag gattcttggt aagttttttt 4140 ttccaaaaat ctggcatgat caaaattgtt gacttttaga aaagactcgg agagaacgtt 4200 ggaggatcaa ttgactatag ttatgaatcg attgacgagt tgatccgcac tgctccggat 4260 gcttggtcta aagcttcaat gggtgcaatt cgaggtgtta tgctcatggt cagttttagt 4320 tactatcttg aatttgcaat taatattgga gaaaatttca gcaggaactg gttactgagc 4380 gatcaatgca cccgaaaagt aaaccaatgg agaagaatcg tttacttgta tgtttataag 4440 aatggaacga ttttgagcat ttgaaaatgt tatttacagg gtgaagaaat ggcaaaattg 4500 aaccctaacg attttctgaa ccacaaactg ctggaagttt cgagtttgga tcaaactggt 4560 ataccaagaa cataaataaa aaccgccccc aagaacattc tatttcagtg tttatcaacg 4620 tcaatctcgt gttttcggcg ctgagcattc aacatggctc ggttagatca attgtcaatg 4680 ctatgtcaaa tcactggatt gactacgtga gcgatcaaat attacgttct atttttgact 4740 atttaacatt tcagctgatt ccgagagagc aacaaggatc cttcacgaca agcaaaaaga 4800 cggagcctct tactttcttc ccaggagact tgacaaattg gccagccggt tagtttatga 4860 tttacatggg cgagccacat aaaaatcgtt ttcagctctt ctcgctgcgg caatcaagtt 4920 ggacaacgac caggtggacg ttactcgcct tccaagtgtg gtcaatatgc tgttcaagca 4980 tcatatcaaa gtcagattgt aggtttcttg ttatttactt taaaaagcca gcgaatagaa 5040 tttttcttta acttgaaaaa ctatgttttc acaaatgtct ttttgcagcg atcatgcccg 5100 tggtttgtat cgaacaaaac acgacggcca gaatccacca aaaattgtgc tgccaaagcg 5160 tcccatcatc gattgcacac aattctacaa gagcagcgag gatatcgatg aggagcaagt 5220 tccatcaact tcgaactttc aatcatcttc gaactctcta tcaacttcga aatctgctgg 5280 aagggaaaat atcccagact ccccataaat tcactagttc tcctgttttc tgatactttc 5340 gaaacttgtt tttttgtttt tttttccatt tttcgttaaa tgtttttttt ctgttgtttt 5400 ttcattttgt ttcattttca taataaatcg tgttattttt tatggtttta taaaagttta 5460 atccaatatg aaagcaaaac tagctagcaa gaaagcataa ctacgtagca aaacatcacc 5520 gctaatcagc ggaaaagcac aattactgcg gttgactaag ccgtgacata aatgctaatt 5580 agcagtgatt ttctatcgct ctgccgtgac tccagtctat ctgag 5625 // ID RCD1 repbase; DNA; CEL; 359 BP. XX AC X61258; XX DT 28-SEP-1995 (Rel. 1, Created) DT 28-SEP-1995 (Rel. 1, Last updated, Version 1) XX DE C.elegans RcD1 repetitive DNA. XX KW Repetitive DNA; RCD1. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-359 RA La Volpe A.; RT "Direct Submission."; RL Direct Submission to Repbase Update (16-AUG-1991)A. La Volpe, CNR RL International Institute of Genetics and Biophysic, Via Marconi RL 10, 80125 Naples, ITALY. XX RN [2] RP 1-359 RA Naclerio G., Cangiano G., Coulson A., Levitt A., Ruvolo V. RA and Volpe A.; RT "Molecular evolotion of clusters of satellite-like DNA sequence RT in Caenorhabditis elegans."; RL Unpublished. XX RN [3] RP 1-359 RA Naclerio G., Cangiano G., Coulson A., Levitt A., Ruvolo V. RA and Volpe A.; RT "Molecular and genomic organization of clusters of repetitive DNA RT sequences in Caenorhabditis elegans."; RL J. Mol. Biol 226, 159-168 (1992). XX DR GenBank; X61258; Positions 1 359. XX SQ Sequence 359 BP; 90 A; 120 C; 89 G; 60 T; 0 other; gggtctcacc acgaagggtc tcaccacgaa gggtctcacc acgattggtc tcaccacgac 60 gggtctcacc acgaaaggtc tcaccacgat tggtctcacc acgaagggtc tcaccacgaa 120 gggtctcacc acgatgggtc tcaccacgaa gggtctcacc acgaagggta tcaccacgat 180 gggtctcacc acgaagggtc tcaccacgat tggtctcacc acgaagggtc tcaccacgaa 240 aggtctcacc acgattggtc tcaccacgaa gggtctcacc acgaagggtc tcaccacgat 300 gggtctcacc acgaagggtc tcaccacgaa gggtctcacc acgatgggtc tcaccacaa 359 // ID NPAL5A_CE repbase; DNA; CEL; 237 BP. XX AC . XX DT 31-AUG-1999 (Rel. 4.07, Created) DT 31-AUG-1999 (Rel. 4.07, Last updated, Version 1) XX DE NPAL5A_CE is a putative non-autonomous DNA transposon - a DE consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; CELE32A; KW MITE; NPAL5A_CE; Non-autonomous DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Belknap R.W.; RT "NPAL5A_CE."; RL Direct Submission to Repbase Update (AUG-1999). XX RN [2] RA Jurka J.; RT "NPAL5A_CE."; RL Direct Submission to Repbase Update (AUG-1999). XX RN [3] RP 1-237 RA V V. and Jurka J.; RT "NPAL5A_CE."; RL Direct Submission to Repbase Update (AUG-1999). XX DR [3] (Consensus) XX CC The final version of the consensus sequence [3] have been CC determined CC based on identification of 5 bp-long target-site duplications. CC NPAL5A_CE has perfect 9 bp-long terminal inverted repeats and it CC forms non-perfect palindrome. XX SQ Sequence 237 BP; 72 A; 52 C; 53 G; 60 T; 0 other; cacggcccgg caagtggtac atccatgcaa atgcgctcta ctgataattt gagtgtagac 60 caggtttggg cgcgtgataa cgaaaaaagc tttggtccaa aaaatttaga atttaatttc 120 ggacattttt tatatgcatc acaaaaaaac tggaccaacc gtttttgaga tacacgcgcc 180 caaacgtcca ggtatacggt agacaaattg cgtacaggta ccacttctcg ggccgtg 237 // ID CELE45 repbase; DNA; CEL; 398 BP. XX AC . XX DT 23-AUG-1999 (Rel. 3.6, Created) DT 23-AUG-1999 (Rel. 3.6, Last updated, Version 1) XX DE C. elegans Cele45 repeat - a consensus. XX KW MITE; CELE45. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-398 RA Belknap R.W. and Surzycki A.S.; RT "CELE45."; RL Direct Submission to Repbase Update (AUG-1999). XX RN [2] RP 1-398 RA Jurka J.; RT "CELE45."; RL Direct Submission to Repbase Update (AUG-1999). XX DR [1] (Consensus) XX SQ Sequence 398 BP; 106 A; 99 C; 84 G; 109 T; 0 other; actacttatt ttattgaatt tattaaatgt ttttcattta aaaaaaatta aagattaaag 60 gacgatccgt tcttcaagtg ctatgcactg cggatctggg gttcaggtac actgcctggt 120 ggtgatcacc tctgggctgt aatttaagcc acgtcctagc cggggactgt ggccgataat 180 ccagtcgtgg attgctccac ttcccaatag aggctgggtg aacctagggg ggtgagacca 240 gacttgaact cgtgacctcc agactgctag cagccaccac taccgactga gctatctgcc 300 cccccccccc cccttgaaaa aagagacatt attcgtcatt ttttgtcagc ttatcaatca 360 ctacgagtga ataaacaccg aaattgattt acagccca 398 // ID CELE1 repbase; DNA; CEL; 307 BP. XX AC . XX DT 23-AUG-1999 (Rel. 3.6, Created) DT 23-AUG-1999 (Rel. 3.6, Last updated, Version 1) XX DE C. elegans Cele1 repeat - a consensus. XX KW MITE; CELE1. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-307 RA Oosumi T., Garlick B. and Belknap R.W.; RT "Identification and characterization of putative transposable DNA RT elements in solanaceous plants and Caenorhabditis elegans."; RL Proc. Natl. Acad. Sci. U.S.A 92(19), 8886-8890 (1995). XX RN [2] RP 1-307 RA Jurka J.; RT "CELE1."; RL Direct Submission to Repbase Update (AUG-1999). XX DR [2] (Consensus) XX SQ Sequence 307 BP; 122 A; 43 C; 43 G; 95 T; 4 other; aatgtagttt cgaaattgaa aaaaatcatt aaaaatgaat aaaaattyta taaartttct 60 aacaattaaa tttgttatta cgggaacaca aaattctgag aatgcgtatt gcacaacata 120 tttgacgcgc aaaatatctc gtagcgaaaa ctacagtaat tctttaaatg actactgtag 180 cgcttgtgtc gatttacggg ctcgattttt gaaatgaatt waaattattt atcrataaaa 240 taatatgaaa ttcaatttca aaaatcgagc ccgtaaatcg acacaagcgc tacagtagtc 300 atttaaa 307 // ID CELE42 repbase; DNA; CEL; 220 BP. XX AC . XX DT 23-AUG-1999 (Rel. 3.6, Created) DT 23-AUG-1999 (Rel. 3.6, Last updated, Version 1) XX DE C. elegans CELE42 repetitive element (a consensus). XX KW MITE; CELE42. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-220 RA Belknap R.W.; RT "CELE42."; RL Direct Submission to Repbase Update (AUG-1999). XX RN [2] RP 1-220 RA Jurka J.; RT "CELE42."; RL Direct Submission to Repbase Update (AUG-1999);. XX DR [2] (Consensus) XX SQ Sequence 220 BP; 78 A; 51 C; 35 G; 54 T; 2 other; tcgtggtact ccacacggac aaatacattt agttttacaa ctaaaaacga gccgcgacgc 60 gacacgcaac gcgccgtaaa tctaccccag atatggccga sccaaaatgg cctagttcgg 120 caaactcttc catttcaatt tatgagggaa gccagaaatc cgtgaaatat ttagcaaaaa 180 attcaaattt tttcagatat ttacatttaa atayccaaaa 220 // ID HELICOP2 repbase; DNA; CEL; 5516 BP. XX AC AC006730; XX DT 23-MAY-2000 (Rel. 4, Created) DT 23-MAY-2000 (Rel. 4, Last updated, Version 1) XX DE HELICOP2 is a DNA transposon. XX KW DNA transposon; helicase; HELICOP superfamily; HELICOP2. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-5516 RA Kapitonov V.V. and Jurka J.; RT "HELICOP2."; RL Direct Submission to Repbase Update (MAY-2000). XX DR GenBank; AC006730; Positions 23892 18377. XX CC HELICOP2 is a DNA transposon; it encodes a DNA helicase-like CC 904-aa protein which is putatively involved in proliferation CC of HELICOP2 copies. There are about 100 nonautonomous elements CC in the genome, and they are 96% identical to HELICOP2. CC On average; these elements are 500-1500 bp long. In several CC cases, HELICOP2 forms 7-9kb satellite-like clusters composed CC from a ~200bp portion of HELICOP2. CC The HELICOP1 protein is encoded by 7 putative exons and its CC CDS is defined as: CC 1834-1922; 1973-2235; 2431-2988, 3112-3296, 3657-3970, 4021-4294; CC 4343-5374 . CC The HELICOP2 protein sequence: CC MNEFSQKTDFPSTSDSQEPVSKRTRSRFKSDSSGAERKRKQRGKESSEEREARHQYDQDR CC KRKKKASESKDELKVLLLKDRERKARERREAKGISDGPNYLGPMSKTCDHCKAVFFESKN CC FQSNIRQFNSAFSMASMGAHLDIPKGHGPYCYRIHGQVYHLAGPLHPEPGGRPSYGQIYI CC LDTAQAIEERLGNPANSKCDPQLMAELTKLISTRNPYAKAYKMMAEVEEKENTDALKEGR CC VAEEVRLIFDISTTKDRRCYNVPVSNEVAVVFVGEDQDIPASRSLAIHPRGGGLTAIRDI CC DKISVKYLFKYVYKGHDRAALNIIRSRLKEGIVDEIKAHLDARYVCAPEAIHHLFGFKIE CC RKSDTDIRTVIDDDGVPVVYPTFVEAAKARGLLKDDSEYLKSLKEWIGCSVPAALRSLFS CC SILLFNEVHEPQVLWDTVKRDLSEDYRHAGARDDEAEALAYFDISSRLHRGGKAFNDYVP CC EPSMHPTDIYLNTSDPSQCAAEGVRLFQTLNSDQKYVFERIVGALHDPTLPRLFFLDGPG CC GSGKTYLYIALYNMIIGENKMAACTAWTGIAAHLLPGGRTMVSLLKLNIHDRCESSLHRR CC QLKDARELANTDVFICDEASMIPKTALNTFDSLLRDITQVDQPFGGKIVILGGDFRQILP CC IVEKGSRADQVDACIKSSPLWNQFETLRLHSNMRVTTNDTEWMNFLLGVGDGSLNDSESK CC VQIPFSLCDGNIVEKVFGDVIDPSTSDPCENVILTPKNVDVTLLNEDVHSRGIGEERIYL CC SRDEIVTDDPADHMHYPTEFLNKMTPSTLPPHVLKLKKGSVVMLLRNLDVSGGLCNGSRF CC VVESLGRRTIGCRFACGERKGKFVIIPRIECYHDKNLSFKLKRTQFPVRAPDGRITNIVF CC TEIL. XX SQ Sequence 5516 BP; 1654 A; 939 C; 1195 G; 1728 T; 0 other; atcatttact atatataaag cgcgtgtcct tccgtccgtt tgtagtttgt agtttgatct 60 ttgatctttc aactctcttt tctgtagttg gaaaaggtat tgggttaaga gttgccgggg 120 gataatgtca aggtactgta gtagtactgt aggagtactg taggtatacg gtagggttac 180 tgtagattag gaaaaattga gtttttgtct acagaagagg tattgggttg ggagttggtg 240 agggataatg tcaaggtact gtagtagtac tgtaggagta ctgtaggtat acggtagggt 300 tactgtagtt taggaaaagt tgagttttcg tctacagaag aggtattggg ttgggagttg 360 gtgagggata atttcaaggt actgtagtgg tactgtagga gtactgtagg tatacggtag 420 ggttactgta gttaggaaaa gttgagtttt cgtctacaga agaggtattg ggttgggagt 480 tggtgaggga taatttcaag gtactgtagt ggtactgtag gagtactgta ggtatacggt 540 agggttactg tagttaggaa aaatttgaat tttaaccatt tgaagagata ttgggttggg 600 agttggtgga ggataatgtc aaggtactgt aatggtattg ttggagtact gtaggagtac 660 ggtagggtta ctgtagttat gaaaaatgtg aattttaacc atttgaagag atattgggtt 720 gggagttggt ggaggataat gtcaaggtac tgtaatggta ttgttggagt actgtaggag 780 tacggtaggg ttactgtagt tatgaaaaat gtgaatttta accatttgaa gagatattgg 840 gttgggagtt ggtggaggat aatgtcaaag tactgtaatg gtactgtagg agtactgtag 900 gattatggta gggttactgt agtcaggaaa aaatttgaat tttaactcaa gaagggtccc 960 attggccccg cccccttggt tttaaatttt tttgaaaaca ttttggtggg aattcaaatt 1020 tcacttttat aaaaacattt tgacgggaat tcaaatttta attttttgaa aacattttgg 1080 cggtaattca aattttaatt ctttgaaaac attttggagg aaaattgatg gaaattttct 1140 taacttctaa aagttaagaa attgtttaaa gttgagaaac gtctgaaacc ttctgaaatc 1200 cactgaatgc aatgtcctag aaaattttgt taagttcatg agacttcaaa ttaaaaaaaa 1260 attcgaacga caaaatgcct cattgggtgg agtaattttg tcgactccag cctctctctg 1320 cagcatttca ggatgttcta tatttatttt agtaaattag attgttttgt tgaccggaat 1380 gtcaaaaact tttctaaata cataatttga ctgtattttt ttaacacact ctacgaaatt 1440 tataatttgt ttttctgttt tcgttattta ctgtacatta attacgaaat tcaagatttg 1500 tttttatgtt tttgttgttt actgtattag tgaactgttg tttacagtag tgtttacagt 1560 attttttgtt tactgtacat taattacaaa atttaaaatt tgtttctctg ttttaggttg 1620 tgttttttga aaaaaaatgg tattatactt ttttgccgaa ctacagtttt caattatcta 1680 tagtacgatt ttggttatta caaaaaaaaa tcttcccctg ttttatctgt tctatcgaaa 1740 aaaaatttct tttctcttca ttttttcgtt tttttttaaa ttattgtttg tgatagtcaa 1800 tattttacag aggaatccga agaattatta actatgaacg aattctcaca aaaaactgat 1860 ttcccttcaa catctgattc ccaagagcct gtttcaaaaa ggacacgttc tagatttaaa 1920 aggtaagttt ttattttcta agtttatttg ttagtaaaat aaatgttttt agtgatagtt 1980 ctggagctga acgcaaaagg aaacagcgtg gaaaagagtc atcagaagaa agagaagcac 2040 gccatcaata cgatcaggat aggaaacgta aaaaaaaagc ttctgagtcc aaagacgaac 2100 tgaaagtgtt gctactgaaa gatagagaac gtaaagcaag agaaagaaga gaagcaaaag 2160 gtatatccga tggacccaac tatcttggcc cgatgagcaa gacatgtgat cactgtaaag 2220 ccgtcttttt cgaaagtaag tttttaaata atatttttac aaatgaacgt tgaattttac 2280 catgcagcaa atttaatttc agatgaagta agcggctcca ctgatcatat caacatgtgt 2340 tgcaattttg gactgacaac ctccatcgat gataagttcg ccgactatcc cgatcgattg 2400 aaggatcttc ttctcggaga ttcgattgag gcaaaaactt ccaatcaaat attcggcaat 2460 tcaactctgc tttttccatg gcttcaatgg gtgcccatct cgatattcca aaaggacatg 2520 gtccatattg ctaccgcatc cacggccaag tgtatcactt agcagggcct cttcatccag 2580 aacctggagg aagaccatca tacggccaaa tttatattct ggacacagct caagctatcg 2640 aagaaagatt aggaaatcct gcaaattcaa aatgtgatcc gcagttgatg gcagagctca 2700 cgaaactcat ttcgactagg aacccatacg ctaaagctta taaaatgatg gcagaggtgg 2760 aagagaaaga aaacaccgat gctctgaaag aaggaagagt tgccgaggag gttcgactaa 2820 tcttcgacat ttcaactact aaagatcgtc gttgttacaa tgtccccgta tcaaatgaag 2880 tggcagttgt atttgtcgga gaagatcagg atattccagc ttcaagatct ctagctattc 2940 atccaagagg aggaggtctc actgcaattc gtgatattga taagatatgt gatccattga 3000 catatccata aggtcataag atggataaca gattcatcgt tccgtataac tcttactttc 3060 tgcttcggta caatgctcac atcaacgtcg aaatttgtgc tttcattgaa gctgtgaagt 3120 acttatttaa gtacgtgtac aagggacatg atcgtgctgc tctaaacatc attcggagta 3180 gactgaaaga aggaatcgtt gatgagatta aagcccacct tgatgctcga tacgtgtgtg 3240 ctccggaggc catccaccat ctcttcggat ttaagattga gagaaagtct gacacggtat 3300 atcgactagc tgtccatctt ccgggttttc aagttgtggt atttccttca aatgtaacaa 3360 aggatcagtt gcagtctacg cctgaaaaag atacaactct gacagcttgg ttcaaaatta 3420 acaaaattag cgaagaaaca gtgcagagtg gttccatgtc cgattcattt gtcgacgaat 3480 tgcattacat tgacatgcca tcttaattca cattcgatca aaaagagcgc tgctggaaaa 3540 cccgaaagcg tggaatccga caaattggca gaatgtacac agttgctccc agtgaaactg 3600 aacgttatgt tcttcgcatt cttctactaa acacaaaagg agctacttct tttgaggata 3660 ttcgaactgt catagatgat gatggagtgc ctgtagttta ccccactttc gttgaagcag 3720 ctaaggcacg tggactgctg aaggatgata gtgaatattt aaaatcgctg aaagagtgga 3780 ttggatgctc ggttcctgct gccctacgat ctttgttcag ttctattcta ctcttcaatg 3840 aggttcacga gcctcaggta ttgtgggata ccgtaaaacg cgatctcagt gaggattatc 3900 gtcacgcagg agccagagat gatgaagctg aagctttggc ctattttgat atttccagtc 3960 gacttcacag gtagctatca tctctaaact tatatttata gtggaaatat atgttttcag 4020 aggaggcaaa gctttcaatg attacgttcc tgagccttca atgcatccga ctgacatcta 4080 tctaaataca agcgatccat cccaatgtgc agcagagggc gttcgactat ttcaaaccct 4140 caactcggac cagaaatacg tcttcgagcg tatcgtggga gcgctacacg accctacttt 4200 accccgtctc ttctttttgg atggtcccgg aggaagtggt aaaacatatc tgtacattgc 4260 attgtacaat atgataattg gagaaaataa aatggtaact atcaactaat tcatttttat 4320 tcctaacatg tttttattcc aggcggcttg cacagcatgg actggaatcg cggcacattt 4380 acttccagga ggtcgaacta tggtgtctct tctcaaattg aacattcatg atcgatgtga 4440 gtcgtcgctt caccgaaggc aattgaaaga tgccagagaa cttgctaata ccgatgtgtt 4500 catttgtgac gaggcttcga tgattccaaa aacagctctc aacacgtttg attctcttct 4560 tcgggatatc actcaagtcg atcaaccatt tggaggaaaa atagtcattt tgggcggaga 4620 tttccgacaa atccttccca ttgtcgaaaa agggtcgaga gctgatcaag tcgatgcctg 4680 tatcaagagt tcgcctctgt ggaatcagtt tgagactcta cgcctccact caaacatgag 4740 agttaccaca aacgacacgg aatggatgaa ctttcttctc ggtgttggcg atggttccct 4800 aaatgattct gaatcgaaag ttcaaatccc tttttcttta tgtgatggaa atattgtcga 4860 gaaagttttt ggagatgtaa tcgacccatc aacttctgat ccctgcgaaa atgtgattct 4920 tactccaaaa aacgttgatg tcacccttct caacgaagat gttcattcca gagggatagg 4980 agaagaaaga atctatttat caagagatga aattgtaaca gatgatccag cagatcacat 5040 gcactaccca acagaatttc tgaacaagat gaccccttcg actctccccc cacatgtgct 5100 gaaactgaaa aagggatcgg tggttatgct gctcagaaac cttgatgtat cgggaggtct 5160 ctgcaatgga tctcgctttg tggtggaatc cttaggacga cgaacaattg gatgccgatt 5220 tgcatgtggc gaaagaaaag gaaaatttgt aatcatccct cgaatcgaat gttatcatga 5280 taagaatttg tctttcaaac tgaaacgaac tcaattccct gttcgagccc ctgacggccg 5340 aatcacaaat attgttttta ccgaaattct ttaaacgaat gttttagaaa ccttgatttg 5400 atttaataaa aattatttaa aaaaaatttt tttcaaaatt tttcacttcc ttatttttaa 5460 aataaaattt caaacgacca gtcgcggccg cgccggaggc gcggtcagcg gctggt 5516 // ID CERP1A repbase; DNA; CEL; 454 BP. XX AC M37219; XX DT 28-SEP-1995 (Rel. 1, Created) DT 25-MAR-1997 (Rel. 3, Last updated, Version 2) XX DE Nematode CeRep1.1 repeat DNA. XX KW CeRep1 repetitive sequence; CERP1A. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-454 RA Felsenstein M.K. and Emmons W.S.; RT "Structure and evolution of a family of interspersed repetitive RT DNA sequences in Caenorhabditis elegans."; RL J. Mol. Evol 25, 230-240 (1987). XX DR GenBank; M37219; Positions 1412 959. XX SQ Sequence 454 BP; 142 A; 93 C; 79 G; 140 T; 0 other; cttccgattt taatccatta taacaaaatg ctgtttctca ggtggcgaaa aattttaatt 60 ccgagggggt cattttcata ggggattttt tcgtaggaaa acgctttcgg tgggacaaaa 120 tatacgacac gccttttcac agacccacca catgatcctt cacaatgcac cacacgactt 180 tttgcaatac accgcacgct tcttcgtatt tatttattcc tttttccaac gtaccgaact 240 tcgacacgaa ttcaatacaa aacgggtttt taatggaaat taattagttt tattcccgag 300 ctaaatacgg atattttcag aattttccga tgacaaaaac gggtttttca atagattcga 360 acgatccaaa atttaattat taacgaaaat gagcgttttc tttccggaaa aagaagaact 420 gatgccgaga gtgatccaag tggagccaaa tgtc 454 // ID RTE1 repbase; DNA; CEL; 3291 BP. XX AC AF054983; XX DT 04-FEB-1999 (Rel. 4.01, Created) DT 04-FEB-1999 (Rel. 4.01, Last updated, Version 1) XX DE RTE-1 is a non-LTR retrotransposon, complete sequence. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; LINE; SHINE; KW retrotransposons; RTE1. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-3291 RA Youngman S., van Luenen G.H. and Plasterk H.R.; RT "Rte-1, a retrotransposon-like element in Caenorhabditis RT elegans."; RL FEBS Lett 380(1-2), 1-7 (1996). XX RN [2] RP 1-3291 RA Malik S.H. and Eickbush H.T.; RT "The RTE class of non-LTR retrotransposons is widely distributed RT in animals and is the origin of many SINEs."; RL Mol. Biol. Evol 15(9), 1123-1134 (1998). XX RN [3] RP 1-3291 RA Malik S.H. and Eickbush H.T.; RT "RTE1."; RL Direct Submission to Genbank (21-MAR-1998)Biology, University of RL Rochester, Huchison Hall 334, Rochester, NY 14627, USA. XX DR GenBank; AF054983; Positions 1 3291. XX CC CDS join(65..193,187..3258) CC /note="apurinic-apyrimidic endonuclease; ORF2 CC (translational frameshift from ORF1); similar to second CC ORF of many other non-LTR retrotransposons" CC CDS 65..196 CC /note="putative ORF1" CC RTE1 has a direct terminal 200 bp-long direct repeat [1]. CC Southern blots indicate that there are approximately 10-15 CC additional Rte-1 elements in the C. elegans Bristol N2 CC genome and a similar number is found in the genomes of CC two other geographically distinct strains. The insertion CC pattern of Rte-1 is polymorphic between these strains [1]. CC Database searches and phylogenetic analysis indicate that CC representatives of the RTE clade of non-LTR retrotransposons CC are found in the bovine and sheep genomes of mammals and in CC the silkmoth and mosquito genomes of insects [2]. In addition, CC the previously identified SINEs, Art2 and Pst, from ruminate CC and viper genomes are shown to be truncated RTE-like CC retrotransposable elements. RTE-derived SINE elements are also CC found in mollusc and flatworm genomes. Members of the RTE clade CC are characterized by unusually short 3'untranslated regions that CC are predominantly composed of AT-rich trimer, tetramer, and/or CC pentamer repeats [2]. XX SQ Sequence 3291 BP; 984 A; 840 C; 732 G; 735 T; 0 other; ctctttctgt actaagcctt ttcgtggcga aagggatcaa cagataacca cttcaacaaa 60 ctcgatgatt tcgtcacatc gagagtggct tggtttggaa ccaagtgcaa attatctggc 120 caagaagtac accgtctcac gcaggccaga ggtagtgtcg aatactctct gtccgggcgg 180 ctctagattg gcttagcagg tcattataga gtatctagag acacaacttt tgctctaaac 240 agcaacccat cctgcctcag tcctccgcat ccggcctctg caaagggaaa gacagttcct 300 ggagcacgtc ggaagaggcg atctaatctc ggagtccaaa ccatatgtca atcggacttg 360 gaaattaaga agccgcggga caaaacaaga agaactgact atcgcgtgtg caccttcaac 420 tgcaggtcct tgtcttcgga tgatcgactt gctgagttgc tagaagagac gaggcgaatc 480 caattcgacg tcatcggatt gtgtgaaacc aaacgagcag ccgaggcaca cttgatacat 540 cacgacggta ccggcgtctt cttaggcaaa cggaatgaaa gttctgtatc cggaggggtc 600 ggcttcatag ttcgaagcac cctccttccc aaaatcgtag aagtacgatt tcttagtcac 660 cgcatcggct tactcacctt caaagtgagc cgaaaattca actgcacggt gattcaagtc 720 tatgctccaa ctgcggactc agaccttgag gaaatctgcg acttctacga cagtgttgaa 780 gacgtcttca gagagtgccg aagcaagtat aaacttgtta tcggcgactt caacgctcga 840 atgggatgca gacaaaacaa cgaaaggtac attggccccc atgccatgga accaagaaat 900 gatactggag agcttctcgc aacattttgt gaaaccaacc gtctgtggca cacgaactct 960 atgttcaaaa agcctatgca caaacggtgg actttcgtca gtcccgacgg gaatcacaga 1020 cacgaaattg accacattct ggccaatgga aagtttgtca cagataccac tgtactcccc 1080 tctttcacaa atggtagcga ccataggtta ctccgctgta acctccactt caacaattgt 1140 ctagcaaagt tggagcaggt cagacggaga aaacctccga aaagagtgtt ggacccggcg 1200 attgcttatg caatctcagc gacagtcacg gttcagagtg atcctgattt agataccgac 1260 tatgataatc tgattcagtc actaaaagaa cttcaagacc aagctatcgt tcgaccggcc 1320 aaccactcat ccaacagact ttcggaggaa acccgaaaac tgctgaacaa aagacgtttt 1380 atggatagaa atgatcccca attcaaatct atttcagata aatgtcgtga agcagttcag 1440 aaagaccatg aagcttttgc cagtactcgc cttctatctg ctgcgaacca gaaaaagagt 1500 ttgaaaagag ttgctaggga catcaacgaa tataagtcag ttatcccatg cctcaaatca 1560 acctctactg gtgaaagaat cacttccagg gtgaaaatgg agcaggagat tgagaagttc 1620 tacacggagc tcttcaaaag cgctgtgagc aactctcaaa catcttcaat acccgccacg 1680 gcaactccac cgccgtttct cccggaagaa attcgtcatg ttctccgttc cttcccgaat 1740 ggtaaagctg caggccaaga caaaatcagt gcagatttct tgaaatcttg ccacgataac 1800 gtcattgacc tgataacgga tcgattcaac aggtacctcc acagcagaaa tgtaccgaaa 1860 ccctggaaaa cctccaaaac cactctcatc ttcaagaaag gtgaccgtga gaatttggaa 1920 aactataggc ccatctgcct actacccgta ctctacaaag tattcaccaa gtgtttgctg 1980 aatagaatgc gaagatccct tgatgaggct caacctgtcg agcaggccgg attccgacgg 2040 tctttctcta cgatcgatca catccactcg ctccaaagac ttcttgaagt cggcagggaa 2100 taccagatcc cactgacact tgtcttcata gatttcaaga aggcatttga cagtgttgaa 2160 caccaggcaa tctggaaaag tctcgacgag caaggtgcag atggagccta tattgatcta 2220 ctgaaagagt gttataaaaa ttgtaccaca aattttaccc cattccacag gccagtcgca 2280 gtacctgtga ccaaaggagt tcgacaagga gatcccatct ctccgaatct cttctccgct 2340 tgcctcgaac acgttttccg aaagctttcc tggattgaat tgaaaggaga agctgaggat 2400 tacgatacga tccctggaat gagagtgaat ggcagaaatc taacgaacct cagatttgct 2460 gacgatattg tgctcatcgc caatcatccg aatattgcca gcaaaatgct ccaagaactc 2520 gtacaaaaat gctctgaagt aggtctcgag atcaatactg ggaagacgaa agtcttgcga 2580 aaccgattcg ctgaccccag taaagtctac ttcggtagcc cttcccccac cacccagctc 2640 gacgacgtcg acgagtacat ctacctcggt cgtcaaatca acgcccaaaa caacttgatg 2700 ccggaaatcc accgaagacg tcgagcagcc tgggctgcat tcaatggaat caagaacacc 2760 accgactcca tcaccgacaa gaagattcgt gcgaatctgt tcgactcaat tgtccttcca 2820 gcgctcacct acggttcaga agcctggaca ttcaccaaag ctctatccga acgagtacga 2880 atcacacatg cctccctaga aagacggctt gtgggaatca cactcactca acaacgagaa 2940 cgagatctcc atcgagaaga cattcgtacg atgtctctag tcagggatcc gctcaatttc 3000 gtgaaaaaga gaaagctagg atgggctgga cacgttgcga gaaggaaaga cggaagatgg 3060 accacgttga tgacagaatg gcgcccatat ggatggaaaa ggcctgttgg aaggccgccg 3120 atgcgatgga ctgattcgct gcgaaaggag atcaccactc gtgacgcaga cggagaagtc 3180 atcaccccct ggtccactat agccaaggac cgaaaacaat ggcttgctgt gatccgcagg 3240 aataccacga attcctgaag aacggatcgt ctaagtatct aagtatctaa g 3291 // ID CEREP3 repbase; DNA; CEL; 302 BP. XX AC . XX DT 01-DEC-1999 (Rel. 4.11, Created) DT 01-DEC-1999 (Rel. 4.11, Last updated, Version 1) XX DE CEREP3 dispersed repetitive element - a consensus. XX KW CEREP3; dispersed clusters. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-302 RA V V. and Jurka J.; RT "CEREP3."; RL Direct Submission to Repbase Update (NOV-1999). XX DR [1] (Consensus) XX CC This element does not have any noticeable hallmarks of known CC retroelements or DNA transposons. There are about 100 copies CC of CEREP3 in the C.elegans genome. They are ~73% identical CC to the consensus sequence. CC There are multiple clusters composed of tandemly repeated CC copies of CEREP3 in the genome. CC The most conservative portion of the repeat is a 70-bp CC 5' terminus. XX SQ Sequence 302 BP; 122 A; 41 C; 34 G; 105 T; 0 other; tttggccaca gtggggactc gaaccccggt cttgtgattg gtagtcggcg aaaatcccac 60 tgcgccattt ggacaaaatt tttgaaaatt ctgaaaattg atgaaaatta taaaaaatca 120 cagttttaag ttaaatttta aatttttata tttaaaaaaa tcaaatttta aactttttta 180 aaattaaaat tttgaaacta aattttcgac attttctaga aatttttgag tttctaaaac 240 tttaaaattc gaattctaca tctttaaatt acccaaaaac attaaaaaac caaaaaaaaa 300 at 302 // ID Tc3 repbase; DNA; CEL; 2337 BP. XX AC . XX DT 03-APR-2003 (Rel. 8.03, Created) DT 03-APR-2003 (Rel. 8.03, Last updated, Version 1) XX DE An active DNA transposon - a consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Tc3; KW mariner/Tc1 superfamily; transposase. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-2337 RA Collins J., Forbes E. and Anderson P.; RT "The Tc3 family of transposable genetic elements in RT Caenorhabditis elegans."; RL Genetics 121(1), 47-55 (1989). XX RN [2] RP 1-2337 RA Kapitonov V.V. and Jurka J.; RL Direct Submission to Repbase Update (MAR-2003). XX DR [2] (Consensus) XX CC Tc3 is an active mariner/Tc1-like DNA transposon [1]. There are CC 17 copies of Tc3 in the sequenced C. elegans genome. They are CC ~99.84% identical to the consensus sequence. Tc3 copies are CC flanked CC by the TA traget site duplications. The preferential target site CC is aTAt [2]. Tc3 has 467-bp terminal inverted repeats. CC Tc3 encodes a 329-aa Tc3p transposase (2 exons, positions CC 720-1136 and 1185-1757). CC Tc3: CC MPRGSALSDTERAQLDVMKLLNVSLHEMSRKISRSRHCIRVYLKDPVSYGTSKRAPRRKA CC LSVRDERNVIRAASNSCKTARDIRNELQLSASKRTILNVIKRSGVIVRQKLRPAPLLSAD CC HKLKRLEFAKNNMGTNWSKVVFSDEKKFNLDGPDGCRYYWRDLRKEPMVFSRRNFGGGTV CC MVWGAFTEKKKLEIQFVSSKMNSTDYQNVLELELSKYLRHYSRKDFRFQQDNATIHVSNS CC TRDYFKLKKINLLDWPARSPDLNPIENLWGILVRIVYAQNKTYPTVASLKQGILDAWKSI CC PDNQLKSLVRSMEDRLFEIIRTQGNPINY. XX SQ Sequence 2337 BP; 700 A; 445 C; 490 G; 702 T; 0 other; cagtgtggga aagttctata ggaccccccc taatttgaag gtttgaggaa cttccgaaaa 60 ttttttcgaa aaactgctaa tgccgttcgt ttttaaattg aaaaaaacct atatacattt 120 ttttccagaa gtttatctca aaaactgagg tcgcgctgga aaaaacgtca aaatccagtg 180 tgaaacttct ataggacccc ccgttttttt tcacgatttt tactaaaatc aacagatttt 240 ggaatttttg acaaagctca aatcaagttt gagttagaaa tgagttcaga taagcagttt 300 tgactttaaa aaattaatac gaaatgttct cgtgggatct ccagactggt tctgattctt 360 ccgatctttg atgttcaagt ctgtttcaag cttcctggtg ctctcggtaa tgccaaaact 420 tgataaactc tctttaacaa gttcctacta aaattcctag cacgcacttt agatgtttcg 480 actgtgtagt caagctgatt tggcaaaata tgcagcagga aacaatggaa ggcttatcag 540 gaatcaaatc gtttttcttt gattacaagg ttccatggga ccaatatttc aagttaaatt 600 gtccctcaca gatgttattt actatttttt gcgtgaatta ttaaatgtgg aattgtggca 660 tgtgttgtgg cacacatata gaggctggaa agcttacttc gaaagcagtc taacttgcaa 720 tgcctcgagg atctgccctt tcggacactg aacgcgctca gctggatgtt atgaaattgc 780 tcaatgtgtc cctgcatgaa atgagtagga aaatttcccg ttctcgacac tgtattcgcg 840 tgtatctgaa ggatccggtg agctacggta catctaaaag agctcctcgt cgcaaagctc 900 tctccgtgcg tgacgaacga aatgtgattc gtgctgcctc caactcctgt aagacggcaa 960 gagatattcg caatgagctt caattgtctg cttcaaaaag gaccatcctc aatgtcatca 1020 aacgatctgg tgtaatcgtt cgtcagaaac ttcgccctgc tccgttactc tctgcagacc 1080 ataaactcaa gcgattggaa tttgctaaga acaatatggg aacgaattgg agtaaagtga 1140 gaatttaaaa aagcaagagt gaataattag gatcattgtt ttaggttgtc ttctccgatg 1200 aaaagaaatt caatctcgat gggcctgacg gttgccgcta ctattggcgc gatttgcgca 1260 aggaaccaat ggttttttcg agacgtaatt ttggaggagg aacggtgatg gtttggggag 1320 cgttcacgga gaagaagaag cttgagatac agttcgtcag tagcaagatg aacagcactg 1380 actatcagaa cgtcttggaa ctggagctct ccaaatatct tcgtcactac tccagaaaag 1440 actttagatt tcagcaggat aatgcgacaa tccatgtgag caactcaacc cgcgactatt 1500 tcaagctcaa gaagatcaac cttcttgatt ggccagctcg aagtcctgat ctcaatccaa 1560 tcgaaaattt gtgggggatt cttgtccgta tcgtgtatgc tcagaacaag acttacccaa 1620 cagttgcatc gttgaagcaa ggaattctcg acgcttggaa gtctattccg gacaaccagc 1680 tgaaaagttt ggtcagatca atggaggaca gactgtttga gatcatccgc acacaaggaa 1740 acccgattaa ctattgatcc tttcttgatt ttagtatatg aatgttctgt tgttgatcaa 1800 aaataactgc aacttgttaa tacgctgttt ctgactggtt tcttggggat ggcgtaaaaa 1860 tgtttatggt gtgtgtgcta ggaattttag taggaacttg ttaaagagag tttatcaagt 1920 tttggcatta ccgagagcac caggaagctt gaaacagact tgaacatcaa agatcggaag 1980 aatcagaacc agtctggaga tcccacgaga acatttcgta ttaatttttt aaagtcaaaa 2040 ctgcttatct gaactcattt ctaactcaaa cttgatttga gctttgtcaa aaattccaaa 2100 atctgttgat tttagtaaaa atcgtgaaaa aaaacggggg gtcctataga agtttcacac 2160 tggattttga cgttttttcc agcgcgacct cagtttttga gataaacttc tggaaaaaaa 2220 tgtatatagg tttttttcaa tttaaaaacg aatggcatta gcagtttttc aaaaaaattt 2280 tcggaagttc ctcaaacctt caaattaggg ggggtcctat agaactttcc cacactg 2337 // ID MIRAGE1 repbase; DNA; CEL; 5625 BP. XX AC . XX DT 18-NOV-1999 (Rel. 4.1, Created) DT 21-MAY-2011 (Rel. 4.1, Last updated, Version 2) XX DE MIRAGE1 autonomous DNA transposon - a consensus. XX KW EnSpm; DNA transposon; Transposable Element; transposase; KW Autonomous DNA transposon; TIRs; Mirage superfamily; MIRAGE1. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-5625 RA Kapitonov V.V. and Jurka J.; RT "MIRAGE1."; RL Direct Submission to Repbase Update (31-OCT-1999). XX DR [1] (Consensus) XX CC MIRAGE1 is a very young autonomous DNA transposon. CC There are more than 10 copies of the element fixed CC in the genome. Some of these copies are almost CC identical. CC MIRAGE1 generates 2-bp duplication of a target site upon CC integration in the genome. MIRAGE1 has 53-bp TIRs and CC encodes transposase (several exons). CC PALNN1_CE is a nonautonomous form of MIRAGE1-like CC transposon since it shares common TIRs with MIRAGE1. XX SQ Sequence 5625 BP; 1762 A; 1092 C; 1077 G; 1694 T; 0 other; ctcagataaa ctggagtcac ggcagagcga cagaaaatca cttctaatca gcaaaagcta 60 gtaaagccat agtttatcag cagagaacca ctgctaatta gcaggtcatt gctaattagc 120 taaaaagcac tgctaattag cagaaaccca ctgctaacta gcagaacatt gattattgat 180 tttattttgt tttttcttcg tcgtatttca catatttcat ttttttgacc ttttttattg 240 tattttttgt gtgatatgtt atcaatatgt gttaattgtt aattccagtg tgtttttcaa 300 agatttgtat cattcccggc tagaaacgga ggattatggc cagaaatcca agaacaacag 360 ttcgaagaat tttgaattca gcaccccgga gtcttctgat tcaaagcgtg acaatagttg 420 ctctaggtaa attttttaaa tgattttatt gggactaaat aactactttc agacccgacg 480 gacaataatc atggttctca ccagccggga agcaatggtc catcaaattc aagcacccag 540 gccgataact tcgagacgga aacaactgca gatgaagaac gacaaaagga agatggtgta 600 gaagatgaac tcgaagacga agaaacaaca ggattcattg atgaagacgg ggaaattcgt 660 tatccggtcg atgaagtggg aaattacgag atcgatcaag aagctgaaac cgatgagtca 720 ttttatttgg acactcaacg cgattcttct ctatgcgacg agcgaagtca actgtgtcgt 780 tctctgagcg atcaaaatct gagaacaact tttttcctca tcactactac caactcgaaa 840 gcagccattt tgagatatct acattttcac aatgtttcac aagatagcat tctcagaaaa 900 ttgctactga atgactctca tctaacactt gtagaagtat gtaatttatg tgcgcagaat 960 atcgaactgt aggtttatta taaaactaaa tttctgaaat atatcattac agatgcaaat 1020 gtgaagatat gcaagagaaa tgccaactaa ttttcggaga tctcaagaaa tggctcgaag 1080 ttatattgaa tagttttgcg aatatcatga tgaagactaa aacaactatt ttggaaggat 1140 ataatagctc cagtccaata aataagggta tctttaaaaa aaagctactc agcgaaatgg 1200 cagataatga aatgtggatg cacttcgaga gtagcttcga cggaatcaaa gtacataaaa 1260 atggaaagta agtatcttag tttgatggct ctaaaaattt taattggttt tcagtcagaa 1320 catttggcca tatagtttac tcaatttgga tctcgaagac gttcatcgag cttctcccca 1380 ggccctcatt ttagcagcgt tattcatcgg gttcaaaaat ccgacaacga aaattcatga 1440 tagacttaca caatggattc tgctccaaat ggacgaacat gtcttcttct cagagggagt 1500 ggcctggaaa gcggatctca cgtgtgccaa tcatgacgat ccagtgagat gaaaattcag 1560 aattttgaaa acaagtgttt ggtttcaggc acgacgaatc gtatacaatc agtgtggact 1620 ccgatcatca ggttcttgca acttttgctt aaaccaggaa actgagtgca aaatcaatga 1680 tgagtataca acacgtaagc atcatgaaaa ctattattaa aaaaataatc acctctttat 1740 agtacgcgaa aatctcaccg gatcgtgccc ttcgactatg aacgatggcc ttcgtcaaag 1800 aaaccagcat tctcatttcc acaagcatgt tttgtatatg catctatgcc cggttgactt 1860 gtttcattgt tttgaagaag gaattttaag caatatcaat acaggtcact agtaatgccc 1920 gaatgctccc gaaacgtgtt catttttcag cattattctc caaagggaaa tggaatttgt 1980 tcccgtcaaa atcggtggat ttttcatcaa caccttcgct tccatctcga tttcgttcgt 2040 tgtccggcaa agcttcacag tgtactggtt ctgagaaagc actggtaaaa agattcgttc 2100 agctaatttc tgcaaagtga acacacattt agattttcga gtcgatagtt gtagcagctg 2160 cattttccgg tgaacttgga ggtgtaccat cggctatcat tctcagcatt catggacttt 2220 atcgtctttg catagaaccc caatcaatca ctgacgagac tcttcatttg aaagttagtt 2280 tttattgttt ttgcttgaaa aaagtagacc attctttaca gattgaacaa atcagcaaat 2340 ccatcgaagt tttaattgtg aaacgagcac cggaaatgtt gaatgggatc aaagtacatg 2400 tatggaaatt gtcgtcaact aacacccatt actattttgc agcaagtgtt gtaccacctt 2460 gcaagaatga ctgagctgta cggtagctta tttccattgt caactcaatg gttcgagtac 2520 ttctatcaca ctattcaaag aactttagtt ccggaaattt acaatggact gggcatgagc 2580 atcatgagaa agtgagaaaa ttagagcaca atagggagca agaatgtttt tacagaatgt 2640 cggctcttca agagatcaaa accgaagctg cattccgtct acacacaaat caccagtttc 2700 gaacagaagc gaacttgaaa atgtgctatg aacttggatt attaaggcga ctcaaacgaa 2760 tatcggttgt tccgacccct ccgcagtacc aacagttttg cgagaccggg gatgttttcc 2820 ttaatgttgt ataccacgac ggcatcagat attcgaatta ttgcaacaaa aagacagatg 2880 actcaaatgt tacgtttatc gaatatgctc gcgtacgatt tgcaaagatt attgggatgc 2940 tgttgaagga aaacaacact gatcttcgtt ttattttgaa gaaatactgc aacacaaatg 3000 atcatttgat tgacatatcg aacaagttat atgaagagga tgtaccgcaa atgcatattc 3060 aaaactgggt tgattcagtt tcaaaatcga gtttcggagg tcgtgtcacg ttaacaaaca 3120 cagttgtcgt ggtttccgcc gattccattg ttggacatgc agtggttgtg aagcgaaata 3180 acaactgtgt ggctcttcca ttttctcgcc gtatcagtct atcttgactt tgataaccac 3240 atatagcttt gaatttgttt gtcttttgtc tgtccccccc cccccccgct cgaaaaaatg 3300 tctgcacctc ccccttcttt gacaaatgtt tgtctctacc ccttgctcga aaaatgttct 3360 cattatgttc ttgaggaatt ctttcagatt tttaaaagtt gtttgtttta atattttcat 3420 gaatgcttat ccgctttcaa ttaatttgtg tttatttgat gtgataatct gttctgttca 3480 accctgctta tttaatctta ttttattaac agatctaaaa atcaatttca gaaaaagtca 3540 tgtcattcct cgagaaacta gtggcaaaat acgagcccat taaagaagaa aaaccaaaat 3600 tagttccaaa gccgacaaga aaacggcaac cggaaactac ggaaatcgtt gtgaagactc 3660 gccgtattac tcgccgaaat gctaaggtta gtggaaggag ccacataaca ttaaatgtga 3720 acccaaagct aacctattcc tatgagtttt aatatttctg tcattttcag gcacctgtaa 3780 aagaagagga gattggacat caaaaagctc aaatcacgat cgtagatgac gaagagttta 3840 tggagcaacc tgtacaggag ccgattgaga ttgaagagga gcaacaagaa gaatctcatc 3900 tggaaattgc tcttccggag caaccaaatc atgagccaat gcaatcaagt ccatcgttgc 3960 ctagaaaatc gttcaaaccg gatatgcatc caattgcgaa ggtgtttact tctgctaatt 4020 tcgagttgct cggaaataaa ttaatttcag aaaggcggaa aaatctgtat tgtagatcta 4080 ggacttacat ctcacacact ccccaccgaa caaactccag gattcttggt aagttttttt 4140 ttccaaaaat ctggcatgat caaaattgtt gacttttaga aaagactcgg agagaacgtt 4200 ggaggatcaa ttgactatag ttatgaatcg attgacgagt tgatccgcac tgctccggat 4260 gcttggtcta aagcttcaat gggtgcaatt cgaggtgtta tgctcatggt cagttttagt 4320 tactatcttg aatttgcaat taatattgga gaaaatttca gcaggaactg gttactgagc 4380 gatcaatgca cccgaaaagt aaaccaatgg agaagaatcg tttacttgta tgtttataag 4440 aatggaacga ttttgagcat ttgaaaatgt tatttacagg gtgaagaaat ggcaaaattg 4500 aaccctaacg attttctgaa ccacaaactg ctggaagttt cgagtttgga tcaaactggt 4560 ataccaagaa cataaataaa aaccgccccc aagaacattc tatttcagtg tttatcaacg 4620 tcaatctcgt gttttcggcg ctgagcattc aacatggctc ggttagatca attgtcaatg 4680 ctatgtcaaa tcactggatt gactacgtga gcgatcaaat attacgttct atttttgact 4740 atttaacatt tcagctgatt ccgagagagc aacaaggatc cttcacgaca agcaaaaaga 4800 cggagcctct tactttcttc ccaggagact tgacaaattg gccagccggt tagtttatga 4860 tttacatggg cgagccacat aaaaatcgtt ttcagctctt ctcgctgcgg caatcaagtt 4920 ggacaacgac caggtggacg ttactcgcct tccaagtgtg gtcaatatgc tgttcaagca 4980 tcatatcaaa gtcagattgt aggtttcttg ttatttactt taaaaagcca gcgaatagaa 5040 tttttcttta acttgaaaaa ctatgttttc acaaatgtct ttttgcagcg atcatgcccg 5100 tggtttgtat cgaacaaaac acgacggcca gaatccacca aaaattgtgc tgccaaagcg 5160 tcccatcatc gattgcacac aattctacaa gagcagcgag gatatcgatg aggagcaagt 5220 tccatcaact tcgaactttc aatcatcttc gaactctcta tcaacttcga aatctgctgg 5280 aagggaaaat atcccagact ccccataaat tcactagttc tcctgttttc tgatactttc 5340 gaaacttgtt tttttgtttt tttttccatt tttcgttaaa tgtttttttt ctgttgtttt 5400 ttcattttgt ttcattttca taataaatcg tgttattttt tatggtttta taaaagttta 5460 atccaatatg aaagcaaaac tagctagcaa gaaagcataa ctacgtagca aaacatcacc 5520 gctaatcagc ggaaaagcac aattactgcg gttgactaag ccgtgacata aatgctaatt 5580 agcagtgatt ttctatcgct ctgccgtgac tccagtctat ctgag 5625 // ID PAL5A_CE repbase; DNA; CEL; 792 BP. XX AC . XX DT 31-AUG-1999 (Rel. 3.7, Created) DT 31-AUG-1999 (Rel. 3.7, Last updated, Version 1) XX DE PAL5A_CE non-autonomous DNA transposon - a consensus. XX KW TIR; Non-autonomous DNA transposon; palindrome; IR-2; PAL5A_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-792 RA Devine E.S., Chissoe L.S., Eby Y., Wilson K.R. and Boeke D.J.; RT "PAL5A_CE."; RL Direct Submission to Genbank (MAR-1997). XX RN [2] RP 1-792 RA Kapitonov V.V. and Jurka J.; RT "PAL5A_CE."; RL Direct Submission to Repbase Update (AUG-1999). XX DR [2] (Consensus) XX CC PAL5A_CE is a putative non-autonomous DNA transposon; it CC forms almost perfect 396 bp-long palindrome and generates CC duplication of 5 bp-long target site upon its integration CC in the genome. CC PAL5A_CE can be still mobile since some of its individual CC copies are more than 98% identical with the consensus CC sequence. XX SQ Sequence 792 BP; 249 A; 146 C; 149 G; 247 T; 1 other; cactgcaact ttttcctcac gagggacgag gaaaagtggt ttctaggcca tggccgaggg 60 gccgacaagt ttcagcggcc atttatcttg ctttgttttc cgcctgtttt ctttcgtttt 120 tcaccgattt ttcccgtttt ttcttattaa aactgataaa taaatatttt ttgcagatgc 180 taaaacaatt tccaagtaaa aaaattatgt attcagtggg caagcagcgg tgaaagtggg 240 caatgtaata tgatggatta cgggaataca aaacctaaac tttttctgaa acatgataca 300 tatgctgctt agatgctgaa actacctgat tttcataacg agaccgctga aaaagttttg 360 aggttttcaa aattcaactt ttttggtgaa aaagtcgaga ttttcgcaca aaaagttgaa 420 ttttgaaaac ctcaaaactt tttcagcggt ctcgttatga aaatcaggta gtttcagcat 480 ctaagcagca tatgtatcat gtttcagaaa aagtttaggt tttgtattcc cgtaatccat 540 catattacat tgaccacttt caccgctgct tgcccactga atacataatt tttttacttg 600 gaaattgttt tagcatctgc aaaaaatatt tatttatcag ttttaataag aaaaaacggg 660 aaaaakctgt gaaaaacgaa agaaaacagg cggaaaacaa agcaagataa atggccgctg 720 aaacttgtcg gcccctcggc catggcctag aaaccacttt tcctcgtccc tcgtgaggaa 780 aaagttgcag tg 792 // ID NDNAX3_CE repbase; DNA; CEL; 1593 BP. XX AC . XX DT 06-APR-1999 (Rel. 4, Created) DT 06-APR-1999 (Rel. 4, Last updated, Version 1) XX DE NDNAX3_CE is a putative nonautonomous DNA transposon - a DE consensus. XX KW nonautonomous DNA transposon; DNAX family; NDNAX3_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-1593 RA Kapitonov V.V. and Jurka J.; RT "NDNAX3_CE."; RL Direct Submission to Repbase Update (APR-2000). XX DR [1] (Consensus) XX CC NDNAX3_CE is a putative non-autonomous DNA transposon. CC There are about 100 copies of NDNAX3_CE in the genome; CC they are ~95% identical to the consensus sequence. CC NDNAX3_CE shares common termini with NDNAX1_CE and NDNAX2_CE CC elements. CC These elements do not have terminal inverted repeats. XX SQ Sequence 1593 BP; 475 A; 314 C; 282 G; 522 T; 0 other; tctgccaagc agcctctaat aaatgccgag tgtacagact ccgccccttt ttcgcgtttt 60 ttcgcgtttt ttttggcttg cgaaaaaatg tcacacttcg ttttcattaa taactccaga 120 gccgtgtgac ttcatgctct aatatttgaa atcctactag aaatattttg ccccacggtg 180 gagcaaaaaa cggcactaag ttcggagcaa aattgagccc agggcggctt tccccagttt 240 gaaaaaattt tttgttgctc tttttacccc caaaaaagca tgttttaatt gaattaaaat 300 tcgggttttg ctcggttttg ttccaaattt atgtgcaaga tactactcag agagactgat 360 tttttgaaaa aaagtttgag ttcataagtg caaaaggaaa aaagttgtga taaaacaaaa 420 ggccaaaaaa tgacattttg ccaaaaaaaa atttttttcc gaaaaagtag tttttcgtct 480 ttatctcaag ttctacttca tcttttttga tatttttttt gtttacccca cgtaaaaaag 540 tacgctgaac acgattttta actcagaatt gagaaaagtt ctatgagtcg gccgagcaag 600 acgaataagt gccaaatttt gcacactttc ccattttcgc gaatctactt tttcaatcat 660 aactcggtca gttttcaagt tttcttagtt ttccaaaaat tgacgtgtag gtctcatcaa 720 gacgcatcga gacatataaa atttgtaaaa agttcagtgg gaaaattttt caagaaaaaa 780 ataattcaaa aatttagtac tggggggagt ggtttcctgg gtctcctgaa cattttttcc 840 actaaagttt tgcggaatgt atttttgatt catagttttt gattttattt aatggaagat 900 gaagtttcgt cgcattagaa atactctgcc aagagatatt atttttatta attttcaggc 960 caaaacttgt tttttttcta tacatctgtc tgtgtacaag atatagccct caactgtacg 1020 tctccttttc gggaaatcaa taatgagtac aatcctgtaa taaaattttc aaaaattcgt 1080 tttcccgctg acatatcagt aataaacggt tttcgaacgt cttttctcaa attgtaatta 1140 ttgaagaaag cggaatcgaa agacgactcc ctccagaaat ttttcttccc tacatcccag 1200 tttgaggtgt ctcccgcatg gccgagtggt tagtgcgtat gactgcagaa attttggtag 1260 ttggttcaat tctaccaagt gacaaatttt tttgtttttt tgtgagtaat cgggattcga 1320 atagaaatac tcgattttat acaaaatatg ggtacgtctc ccctaatata aagcctcctc 1380 aaacttcaat caatcaaaac tcagctttaa ctgattcatc gggtgcatct gtttttggtc 1440 aaaccgaact caaaaatcaa gaagactccc attgacctac tgaaattttc acgaaccctc 1500 taacaggttc gaggcgaagc cgagaaccta cgcccggcaa ggggactcgc cttacacccc 1560 ctacaactgg cgggcccgca gggcccgcta gtc 1593 // ID RCS5 repbase; DNA; CEL; 738 BP. XX AC X68553; XX DT 28-SEP-1995 (Rel. 1, Created) DT 28-SEP-1995 (Rel. 1, Last updated, Version 1) XX DE C.elegans RcS5 repetitive DNA sequence. XX KW Repetitive element; RcS5 element; RCS5. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-738 RA La Volpe A.; RT "Direct Submission."; RL Direct Submission to Repbase Update (02-OCT-1992)A. La Volpe, RL Intern Inst of Genetics & Biophysics, Via Marconi 10, 80125 RL Napoli, ITALY. XX RN [2] RP 1-738 RA Cangiano G. and La Volpe A.; RT "Repetitive DNA sequences located in the terminal portion of the RT Caenorhabditis elegans chromosomes."; RL Nucleic Acids Res 21, 1133-1139 (1993). XX DR GenBank; X68553; Positions 1332 2069. XX SQ Sequence 738 BP; 260 A; 112 C; 122 G; 244 T; 0 other; gaggcggttg taaactattc gttttttagc aaaaattaaa aaaaaaactt tatttaaatt 60 taaaaataaa tatcatatgt tatcatacct tagaatatca cgccttaatt tagtaatcat 120 tttagttcat aagtgtgaga gaaagaaaaa tggattaatc atcggaatta aatatcatca 180 gaatgattat attcatcaaa ggtcccactt tacttgaggg ttaatttgat cgatataaaa 240 tgtttatctc atgtgaagaa aggttcgaat tctgtaccta tgaatgctgc cgatatttga 300 attcctatct catggaaaga aggcctttct tgtgggaaga acggtgttat aaaaaaatat 360 acgcaggtac agaatttgaa cttttcttcc catgagagag gaatttgaat attggcattc 420 tataacttca ctaagaaaaa aaatctccaa aggttgatcg cgaatatgta gagattcgtc 480 gcatcccttc aaatattacg aaatttccaa gtttcttagt gataatcttt attcagatta 540 ctgaaaaata tacacagatt acttttgagt tgcctgcgaa gttattgtaa aagccaaggc 600 gagctgttgg tacagaattt gaagttttct tcccatgagt taggaatttg aataatggca 660 ttctataact ttactaagaa aaaatatccc caaaggttgc tcgcttccct ccaaatatta 720 cgaaatctcc aagtctta 738 // ID CERP6 repbase; DNA; CEL; 1940 BP. XX AC . XX DT 23-AUG-1999 (Rel. 3.6, Created) DT 23-AUG-1999 (Rel. 3.6, Last updated, Version 1) XX DE C. elegans transposable element - a consensus. XX KW Repetitive element; CERP6. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-1940 RA Jurka J., Surzycki S. and Belknap W.; RT "CERP6."; RL Direct Submission to Repbase Update (AUG-1999). XX DR [1] (Consensus) XX CC A palindromic sequence, distantly similar to TC5 DNA transposon CC at positions 228-310 and 1643-1725. XX SQ Sequence 1940 BP; 665 A; 319 C; 328 G; 628 T; 0 other; ttaaaagcaa agggtgtgcg cctttaaagt gtactgtagt ctgaaagtat tgtttctgtg 60 gcaaaatgtt ttaatcgaaa atctattaaa aatcgataaa tattccgcgg gacatggtac 120 cgtattttta aaagaaaaat tgtaaagggt aatagttacc gtaataccct aacaagggaa 180 gtctttgagg gggtccgtag atttggggtt ctcatgctaa aattcctaca gaagagtgtt 240 agttatgatc tctccaaaaa atttagctgc ccaggtcaag tttcagcaaa gttatgacgt 300 tttgaaattt cagttaaaaa caccattgaa atccactgtc ttaccatgca atccacgcaa 360 atctcagctt gcgtgaccac cgaaaatgtg acacccacca cattgagttg aaaaatgtcc 420 tcggtggccg agttgggagt gcgcgcgtct gataagattt aagctttggt tcgattcctt 480 ctatttttga aatatttttg taagttgaat aaagttgtaa aacaactcat tcaaacattt 540 ttgcgcattt ttaaagtgat tttattctta ttcgggaacc tagaatcatt gtccgcactt 600 tttggaaatt tttatttttt tcatttttgc tcaaaatttc ttgatcaact ccaagcaaaa 660 aattcaaaaa atttcatttt tctaaacaat tatgaaattg ctatgttgtt gttcagaaat 720 gtatgaaacg tacattacac aagttttaac tctctattcg caagtaaacc gtcgaaatga 780 tctacatctc acgaactttg tgcaaaatat ttaaccaact ttgaagttgc ataacttcgt 840 tgagataaat tattttgaaa aatgatcaac taacaaaatg tttgttgaat aatagtgaac 900 aaagttttaa ttataaactt tttgatacct tcagccacca aaaagttgtg accgttttcc 960 aaaacacaat gttaatagaa aaactagcca aatgccaacc ttgttttctt ctttgtagct 1020 ggaggtatca aaaagtttat aactaaaact ttgttcacta ttattcaaca aacattttgt 1080 tagttgatca tttttcaaaa taatttatct caacgaagtt atgcaacttc aaagttggtt 1140 aaatattttg cacaaagttc gtgagatgta gatcatttcg acggtttact tgcgaataga 1200 gagttaaaac ttgtgtaatg tacgtttcat acatttctga acaacaacat agcaatttca 1260 taattgttta gaaaaatgaa attttttgaa ttttttgctt ggagttgatc aagaaatttt 1320 gagcaaaaat gaaaaaaata aaaatttcca aaaagtgcgg acaatgattc taggttcccg 1380 aataagaata aaatcacttt aaaaatgcgc aaaaatgttt gaatgagttg ttttacaact 1440 ttattcaact tacaaaaata tttcaaaaat agaaggaatc gaaccaaagc ttaaatctta 1500 tcagacgcgc gcactcccaa ctcggccacc gaggacattt ttcaactcaa tgtggtgggt 1560 gtcacatttt cggtggtcac gcaagctgag atttgcgtgg attgcatggt aagacagtgg 1620 atttcaatgg tgtttttaac tgaaatttca aaacgtcata actttgctga aacttgacct 1680 gggcagctaa attttttgga gagatcataa ctaacactct tctgtaggaa ttttagcatg 1740 agaaccccaa atctacggac ccccctcaaa gacttccctt gtaaattcaa ttacctagtt 1800 tttagaaaaa taaaaataga aaagcaattt aatttttttc aggaaataat tgccggaatg 1860 cttgatctac caccgaaaaa gtggagaaaa cgagaaaccg atgaaatgtc gaagcagaag 1920 tctcgtgctg aaaatttcaa 1940 // ID CEREP2A repbase; DNA; CEL; 249 BP. XX AC . XX DT 07-SEP-1999 (Rel. 3.7, Created) DT 07-SEP-1999 (Rel. 3.7, Last updated, Version 1) XX DE CEREP2A dispersed repetitive element - a consensus. XX KW dispersed repetitive element; CEREP2A. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-249 RA V V. and Jurka J.; RT "CEREP2A."; RL Direct Submission to Repbase Update (SEP-1999). XX DR [1] (Consensus) XX CC This element does not have any noticible hallmarks of known CC retroelement or DNA transposons. There are about 200 copies CC of CEREP2A in the C.elegans genome. They are ~95% identical CC to the consensus sequence. CC CEREP2A encodes 83 aa-long protein that has distant similarity CC to one of the conservative domains in mariner/Tc-like CC transposases. XX SQ Sequence 249 BP; 80 A; 80 C; 35 G; 54 T; 0 other; accagccgtg gaccgcacct ccggcgcggc ccccgacttc tggggctgaa aactaatttt 60 tctgaaacta ccgtaaccct acagtattcc taccgtacca ctattgtacc actacagtac 120 cccgactata tccctacact aaccccaact cactatccct ccagaagcca aaacttcaca 180 gactacaaag actacataga ctacaaacta tggacacaca gaataagcgc tttatatata 240 gtaaatgat 249 // ID RC14 repbase; DNA; CEL; 169 BP. XX AC L00589; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE Caenorhabditis Rc14 repetitive element. XX KW RC14; Repetitive element. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-169 RA Ruvolo R.V., La Volpe A., Naclerio G., Coulson A. RA and Levitt M.A.; RT "Tandem repeats and SINE-like elements associated with insertion RT sites of the Tc2 transposon of C. elegans."; RL Unpublished (1992). XX RN [2] RP 1-169 RA Naclerio G., Cangiano G., Coulson A., Levitt M.A., Ruvolo R.V. RA and LaVolpe A.; RT "Molecular evolution of clusters of satellite-like DNA sequences RT in Caenorhabditis elegans."; RL J. Mol. Biol0-0 (1992)In press. XX DR GenBank; L00589; Positions 104 272. XX SQ Sequence 169 BP; 38 A; 55 C; 19 G; 57 T; 0 other; ctcccatgct tacatttccc atgcctacaa ttcccatgct tacacatgtt cccgtacata 60 caaatgttcc catgcttaca ctcagtagta acagtgttcc catgcgttca gctcattttt 120 tctcgttccc atgctacatt tcccatgctt acacgtattc ccgtacata 169 // ID CeRep57 repbase; DNA; CEL; 40 BP. XX AC . XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 05-JUN-2002 (Rel. 7.05, Last updated, Version 1) XX DE C. elegans repeat 57. XX KW MSAT; Satellite; Simple Repeat; Nonautonomous; minisatellite; KW repeat; C. elegans; CeRep57. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-40 RA Jekosch K.; RT "C. elegans repeat 57."; RL Direct Submission to Repbase Update (25-APR-2002). XX DR [1] (Consensus) XX CC Putative novel minisatellite repeat, found 248 times exclusively CC on CC chromosome X in the C.elegans genome. For further analysis CC compare http://www.sanger.ac.uk/Projects/C_elegans/repeats. XX SQ Sequence 40 BP; 7 A; 11 C; 10 G; 12 T; 0 other; tagagcgatt ccttaccctt aaatgggcgc cggtttcgct 40 // ID RTE1 repbase; DNA; CEL; 3291 BP. XX AC AF054983; XX DT 04-FEB-1999 (Rel. 4.01, Created) DT 20-APR-2006 (Rel. 4.01, Last updated, Version 2) XX DE RTE-1 is a non-LTR retrotransposon, complete sequence. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; LINE; SHINE; KW retrotransposons; RTE1. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-3291 RA Youngman S., van Luenen G.H. and Plasterk H.R.; RT "Rte-1, a retrotransposon-like element in Caenorhabditis RT elegans."; RL FEBS Lett 380(1-2), 1-7 (1996). XX RN [2] RP 1-3291 RA Malik S.H. and Eickbush H.T.; RT "The RTE class of non-LTR retrotransposons is widely distributed RT in animals and is the origin of many SINEs."; RL Mol. Biol. Evol 15(9), 1123-1134 (1998). XX RN [3] RP 1-3291 RA Malik S.H. and Eickbush H.T.; RT "RTE1."; RL Direct Submission to Genbank (21-MAR-1998)Biology, University of RL Rochester, Huchison Hall 334, Rochester, NY 14627, USA. XX DR GenBank; AF054983; Positions 1 3291. XX CC CDS join(65..193,187..3258) /note="apurinic-apyrimidinic CC endonuclease; ORF2 (translational frameshift from ORF1); similar CC to second ORF of many other non-LTR retrotransposons" CDS 65..196 CC /note="putative ORF1" RTE1 has a direct terminal 200 bp-long CC direct repeat [1]. Southern blots indicate that there are CC approximately 10-15 additional Rte-1 elements in the C. elegans CC Bristol N2 genome and a similar number is found in the genomes of CC two other geographically distinct strains. The insertion pattern CC of Rte-1 is polymorphic between these strains [1]. Database CC searches and phylogenetic analysis indicate that representatives CC of the RTE clade of non-LTR retrotransposons are found in the CC bovine and sheep genomes of mammals and in the silkmoth and CC mosquito genomes of insects [2]. In addition, the previously CC identified SINEs, Art2 and Pst, from ruminate and viper genomes CC are shown to be truncated RTE-like retrotransposable elements. CC RTE-derived SINE elements are also found in mollusc and flatworm CC genomes. Members of the RTE clade are characterized by unusually CC short 3'untranslated regions that are predominantly composed of CC AT-rich trimer, tetramer, and/or pentamer repeats [2]. Analogous CC consensus from Tammar wallaby is 95% identical to this consensus CC over the entire length. XX SQ Sequence 3291 BP; 984 A; 840 C; 732 G; 735 T; 0 other; ctctttctgt actaagcctt ttcgtggcga aagggatcaa cagataacca cttcaacaaa 60 ctcgatgatt tcgtcacatc gagagtggct tggtttggaa ccaagtgcaa attatctggc 120 caagaagtac accgtctcac gcaggccaga ggtagtgtcg aatactctct gtccgggcgg 180 ctctagattg gcttagcagg tcattataga gtatctagag acacaacttt tgctctaaac 240 agcaacccat cctgcctcag tcctccgcat ccggcctctg caaagggaaa gacagttcct 300 ggagcacgtc ggaagaggcg atctaatctc ggagtccaaa ccatatgtca atcggacttg 360 gaaattaaga agccgcggga caaaacaaga agaactgact atcgcgtgtg caccttcaac 420 tgcaggtcct tgtcttcgga tgatcgactt gctgagttgc tagaagagac gaggcgaatc 480 caattcgacg tcatcggatt gtgtgaaacc aaacgagcag ccgaggcaca cttgatacat 540 cacgacggta ccggcgtctt cttaggcaaa cggaatgaaa gttctgtatc cggaggggtc 600 ggcttcatag ttcgaagcac cctccttccc aaaatcgtag aagtacgatt tcttagtcac 660 cgcatcggct tactcacctt caaagtgagc cgaaaattca actgcacggt gattcaagtc 720 tatgctccaa ctgcggactc agaccttgag gaaatctgcg acttctacga cagtgttgaa 780 gacgtcttca gagagtgccg aagcaagtat aaacttgtta tcggcgactt caacgctcga 840 atgggatgca gacaaaacaa cgaaaggtac attggccccc atgccatgga accaagaaat 900 gatactggag agcttctcgc aacattttgt gaaaccaacc gtctgtggca cacgaactct 960 atgttcaaaa agcctatgca caaacggtgg actttcgtca gtcccgacgg gaatcacaga 1020 cacgaaattg accacattct ggccaatgga aagtttgtca cagataccac tgtactcccc 1080 tctttcacaa atggtagcga ccataggtta ctccgctgta acctccactt caacaattgt 1140 ctagcaaagt tggagcaggt cagacggaga aaacctccga aaagagtgtt ggacccggcg 1200 attgcttatg caatctcagc gacagtcacg gttcagagtg atcctgattt agataccgac 1260 tatgataatc tgattcagtc actaaaagaa cttcaagacc aagctatcgt tcgaccggcc 1320 aaccactcat ccaacagact ttcggaggaa acccgaaaac tgctgaacaa aagacgtttt 1380 atggatagaa atgatcccca attcaaatct atttcagata aatgtcgtga agcagttcag 1440 aaagaccatg aagcttttgc cagtactcgc cttctatctg ctgcgaacca gaaaaagagt 1500 ttgaaaagag ttgctaggga catcaacgaa tataagtcag ttatcccatg cctcaaatca 1560 acctctactg gtgaaagaat cacttccagg gtgaaaatgg agcaggagat tgagaagttc 1620 tacacggagc tcttcaaaag cgctgtgagc aactctcaaa catcttcaat acccgccacg 1680 gcaactccac cgccgtttct cccggaagaa attcgtcatg ttctccgttc cttcccgaat 1740 ggtaaagctg caggccaaga caaaatcagt gcagatttct tgaaatcttg ccacgataac 1800 gtcattgacc tgataacgga tcgattcaac aggtacctcc acagcagaaa tgtaccgaaa 1860 ccctggaaaa cctccaaaac cactctcatc ttcaagaaag gtgaccgtga gaatttggaa 1920 aactataggc ccatctgcct actacccgta ctctacaaag tattcaccaa gtgtttgctg 1980 aatagaatgc gaagatccct tgatgaggct caacctgtcg agcaggccgg attccgacgg 2040 tctttctcta cgatcgatca catccactcg ctccaaagac ttcttgaagt cggcagggaa 2100 taccagatcc cactgacact tgtcttcata gatttcaaga aggcatttga cagtgttgaa 2160 caccaggcaa tctggaaaag tctcgacgag caaggtgcag atggagccta tattgatcta 2220 ctgaaagagt gttataaaaa ttgtaccaca aattttaccc cattccacag gccagtcgca 2280 gtacctgtga ccaaaggagt tcgacaagga gatcccatct ctccgaatct cttctccgct 2340 tgcctcgaac acgttttccg aaagctttcc tggattgaat tgaaaggaga agctgaggat 2400 tacgatacga tccctggaat gagagtgaat ggcagaaatc taacgaacct cagatttgct 2460 gacgatattg tgctcatcgc caatcatccg aatattgcca gcaaaatgct ccaagaactc 2520 gtacaaaaat gctctgaagt aggtctcgag atcaatactg ggaagacgaa agtcttgcga 2580 aaccgattcg ctgaccccag taaagtctac ttcggtagcc cttcccccac cacccagctc 2640 gacgacgtcg acgagtacat ctacctcggt cgtcaaatca acgcccaaaa caacttgatg 2700 ccggaaatcc accgaagacg tcgagcagcc tgggctgcat tcaatggaat caagaacacc 2760 accgactcca tcaccgacaa gaagattcgt gcgaatctgt tcgactcaat tgtccttcca 2820 gcgctcacct acggttcaga agcctggaca ttcaccaaag ctctatccga acgagtacga 2880 atcacacatg cctccctaga aagacggctt gtgggaatca cactcactca acaacgagaa 2940 cgagatctcc atcgagaaga cattcgtacg atgtctctag tcagggatcc gctcaatttc 3000 gtgaaaaaga gaaagctagg atgggctgga cacgttgcga gaaggaaaga cggaagatgg 3060 accacgttga tgacagaatg gcgcccatat ggatggaaaa ggcctgttgg aaggccgccg 3120 atgcgatgga ctgattcgct gcgaaaggag atcaccactc gtgacgcaga cggagaagtc 3180 atcaccccct ggtccactat agccaaggac cgaaaacaat ggcttgctgt gatccgcagg 3240 aataccacga attcctgaag aacggatcgt ctaagtatct aagtatctaa g 3291 // ID RCA1 repbase; DNA; CEL; 84 BP. XX AC X61258; XX DT 28-SEP-1995 (Rel. 1, Created) DT 28-SEP-1995 (Rel. 1, Last updated, Version 1) XX DE C.elegans repetitive DNA. XX KW Repetitive DNA; RCA1. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-84 RA La Volpe A.; RT "Direct Submission."; RL Direct Submission to Repbase Update (16-AUG-1991)A. La Volpe, CNR RL International Institute of Genetics and Biophysic, Via Marconi RL 10, 80125 Naples, ITALY. XX RN [2] RP 1-84 RA Naclerio G., Cangiano G., Coulson A., Levitt A., Ruvolo V. RA and La Volpe A.; RT "Molecular evolotion of clusters of satellite-like DNA sequence RT in Caenorhabditis elegans."; RL Unpublished. XX RN [3] RP 1-84 RA Naclerio G., Cangiano G., Coulson A., Levitt A., Ruvolo V. RA and Volpe A.; RT "Molecular and genomic organization of clusters of repetitive DNA RT sequences in Caenorhabditis elegans."; RL J. Mol. Biol 226, 159-168 (1992). XX DR GenBank; X61258; Positions 398 481. XX SQ Sequence 84 BP; 36 A; 13 C; 12 G; 23 T; 0 other; gacgtcacag actacaaaga ctacatagac tacaaactat ggacagacga actgtatttt 60 tttatataaa atataagtaa tgat 84 // ID HELITRONY1A_CE repbase; DNA; CEL; 3084 BP. XX AC . XX DT 26-FEB-2001 (Rel. 4.2, Created) DT 26-FEB-2001 (Rel. 4.2, Last updated, Version 1) XX DE HELITRONY1A_CE is a non-autonomous DNA transposon - a consensus. XX KW Non-autonomous DNA transposon; HELITRON superfamily; RCA1; RCC9; KW RCD1; RC35; HELICOPY1; HELITRONY1_CE; HELITRONY1A_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-3084 RA V V. and Jurka J.; RT "HELITRONY1A_CE."; RL Direct Submission to Repbase Update (FEB-2001). XX DR [1] (Consensus) XX CC There are about 50 copies of HELITRONY1A_CE in the C.elegans CC genome. CC They are ~95% identical to the consensus sequence. There are CC several CC minor subfamilies of HELITRONY1A_CE in the genome. HELITRONY1A_CE CC copies harbor one half of a PALTA3_CE-like DNA transposon CC (positions CC 2890-2715) inserted into an ancestor of HELITRONY1A_CE. About 20 CC ancestral copies, free of the PALTA3_CE-like insertion, are CC present CC in the genome. XX SQ Sequence 3084 BP; 1063 A; 388 C; 655 G; 978 T; 0 other; tcattactta tatataaaaa aatacagttc gtctgtccat agtttgtagt ctatgtagtc 60 tttgtagtct gtgacgtcac gcccaaagtc agtgagagtt gtgggcgggg cacccttcgt 120 ggtgagaccc atcgtggtga gacccttcgt ggtgagaccc atcgtggtga gacccttcgt 180 ggtgagaccc atcgtggtga gacccatcgt ggtgagaccc ttcgtggtga gacccttcgt 240 ggtgagaccc aaaattttgg cgggaaattc aaattttctg tgaaaaacat tttggcggga 300 aattcaaatt ttctgtgaaa aattttttgg cgggaattca aattttctgt gaaaattttt 360 tggcgggaaa ttcaaatttt ctgtgaaaaa aattttggcg ggaaattcaa attttctgtg 420 aaaaaaattt tggcgggaaa ttcaaatttt cagtgaaaaa cattttggcg ggaaattcaa 480 attttcagtg aaaaaaattt tggcgggaaa ttcaaatttt cagtgaaaaa tttttggcgg 540 gaaattcaaa ttttcagtga aaaaaatttt ggcgggaaat tcaaattttc tgtgaaaaaa 600 attttggcgg gaaattcaaa ttttctgtga aaaaattttt ggcgggaaat tcaaattttc 660 agtgaaaaaa attttggcgg gaaattcaaa ttttcagtga aaaaattttt ggcgggaaat 720 tcaaattttc tgcgaaaaaa atttttggcg ggaaattcaa attttctgtg aaaaaaattt 780 tggcgggaaa ttcaaatttt cagtgaaaaa attttggcgg gaaattcaaa ttttcagtga 840 aaaaaatttt ggcgggaaat tcaaattttc agtgaaaaaa ttttggcggg aaattcaaat 900 tttcagtgaa aaaattttgg cgggaaattc aaattttcag tggaaaaaat tttggcggga 960 aattcaaatt ttcagtgaaa aaaaattttg gcgggaaatt caaattttca gtgaaaaaaa 1020 ttttggcggg aaattcaaat tttcagtgaa aaaattttgg cgggaaattc aaattttcag 1080 tgaaaaaatt ttggcgggaa attcaaattt tcagtgaaaa aaattttggc gggaaattca 1140 aattttcagt gaaaaaattt tggcgggaaa ttcaaatttt cagtgaaaaa attttggcgg 1200 gaaattcaaa ttttcagtga aaaaaatttt ggcgggaaat tcaaattttc agtgaaaaaa 1260 atttttggcg ggaaattcaa attttcagtg aaaaaaattt tggcgggaaa ttcaaatttt 1320 cagtgaaaaa aattttggcg ggaaattcaa attttcagtg aaaaaaattt tggcgggaaa 1380 ttcaaatttt cagtgaaaaa aattttggcg ggaaattcaa attttcagtg aaaaaaattt 1440 tggcgggaaa ttcaaatttt cagtgaaaaa aattttggcg ggaaattcaa attttcagtg 1500 aaaaaaattt tggcgggaaa ttcaaatttt cagtgaaaaa aattttggcg ggaaattcaa 1560 attttcagtg aaaaaaattt tggcgggaaa ttcaaatttt cagtgaaaaa aattttggcg 1620 ggaaattcaa attttcagtg aaaaaaattt tggcgggaaa ttcaaatttt cagtgaaaaa 1680 aattttggcg ggaaattcaa attttcagtg aaaaaaattt tggcgggaaa ttcaaatttt 1740 cagtgaaaaa attttggcgg gaaattcaaa ttttcagtga aaaaaatttt ggcgggaaat 1800 tcaaattttc agtgaaaaaa attttggcgg gaaattcaaa ttttcagtga aaaaaatttt 1860 ggcgggaaat tcaaattttc agtgaaaaaa attttggcgg gaaattcaaa ttttctgtga 1920 aaaaaatttt tggagggaaa ttcaaatttt ctgagaaaaa gattttggcg ggaatttcaa 1980 ttttggtttt ggaatcatct ggaaaattcc agaaaattct ataatgttct agaactttct 2040 ggaaaattcg agaaaattct ggaatgttcc agaactttct ggaaagttcc agaaccttct 2100 ggaaagttcc agaaccttct ggaaaattcg aaaaaaattc tggaatattc gtggtgagac 2160 ccataaattt tggccgaaaa ctcaaaattt ctgagaaaaa atttttggcg ggaaattcaa 2220 attttctgag aaactttttg ttagcttaag taccacctgg aactggcgag acccatcgtg 2280 gtgggaccct taaaaatatg ggcgggaact tcaaaattta tgagaaaaga aattttggcg 2340 ggaattcaaa tttccaagaa aaattttggc gggaaattca aattttctga gaaaaagatt 2400 ttgacgggaa attcaaattt tctgaggaaa acatttggcg ggaaattcaa attttggttc 2460 tggaatcaat tttttgtttc agcaccttct ggaagtttca agaaaattct agaatgttct 2520 agaatcatct ggaaaattca aaaaagttct gaaatgctct acaacctcct ataactgtag 2580 gagtttttat ttgtttcgtc acatctatat tcaaagaact tactattagt agttaattat 2640 tcagacttct ccagttaagg aaagttatta acttattgaa aaaactgtaa ctctgtggaa 2700 ttttttcggg ggagggggac agctcgccac gccaacagtg agaaaaatag tgtggtgcgc 2760 ggacctcgaa tgatgtcggc ggctgatatt acctggggta ccgctttttg ccttttcccc 2820 gtgagtgaac gagaaaaagg cggggactat tttggcgtac acgcaacgca gttttataat 2880 tttgagtgta gaaaaatcag aacattctaa ctgtctcttt gaattttctg aatcgaatac 2940 catggaagaa gaaaatgcga tttaatcgat gaagtttcaa cttttttaat gatacatttt 3000 tgaatgatta aaaggacatc tatgtgttat acattttaaa ctccgccagt cgttggccgc 3060 gccggaggcg cggtcatcgg ctgg 3084 // ID HELICOP1 repbase; DNA; CEL; 8486 BP. XX AC . XX DT 30-MAY-2000 (Rel. 4.1, Created) DT 30-MAY-2000 (Rel. 4.1, Last updated, Version 1) XX DE HELICOP1 is a DNA transposon, a consensus sequence. XX KW DNA transposon; helicase; HELICOP superfamily; HELICOP1. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-8486 RA Kapitonov V.V. and Jurka J.; RT "HELICOP1."; RL Direct Submission to Repbase Update (MAY-2000). XX DR [1] (Consensus) XX CC HELICOP1 is a DNA transposon; it encodes a DNA helicase-like CC 1466-aa protein which is putatively involved in proliferation CC of HELICOP1 copies. There are several copies of HELICOP1 in the CC genome; they are ~99% identical to the consensus sequence. CC Approximately 2% of the C. elegans genome belongs to molecular CC fossils of nonautonomous elements transposed recently (<10 Myrs) CC by autonomous HELICOP1-like elements. HELICOPY1 and HELICOPY2 CC are examples of these nonautonomous elements. Minisatellite- CC like repeats RCA1, RCD1, RC35, RC123, RCC9, reported previously CC in the literature (Naclerio,G. et al., 1992, J. Mol. Biol. 226, CC 159-168) are portions of HELICOP-like transposons CC (Kapitonov,V.V). CC The HELICOP1 protein is encoded by 9 putative exons and its CC CDS can be defined as: CC 1930-2087; 2135-2412; 2533-3428, 3476-3799, 3850-4520, 4581-5207; CC 5254-5521; 5574-5762; 5814-6803. CC The HELICOP1 protein: CC MDNELTSRPDSIMDTEPTTLPDSAMDTKPTNLPDFPSTSQEPVSRRTRSRTLSDNSNAVW CC KRERRSRETSQESESRLRMDWERKKEKRASMSEEEKAEMKYKKRVWMKKKRNEVAKTHDT CC SSVANPNYLGSMRCVCKNCNARFFQGEVSSSNDHINMCCYFGLTTTLQDRFSNYPDELRH CC LLLDDSSHARNFQKNIRQFNSSFAMASMGAQLDVPRGRGPYCFKIHGQVYHFAGPLHPDP CC GQRPAFGQIYILDTDQATDERLGNPANADCDPAVMAELSSLLLRTNPYAQAYKMMAEVEE CC KENSEAAKEGRHPGCVRLIFDISTTKDPRRYNLPQANEVAVVFVGEDDDVPTTRSLAVHP CC RGGGLKTIRDIDRICDPLTYPVLFPNGTDGWHPDLEKRPSEKKQGRITQKMYYSYLLMER CC SGVFNPLHHGRALFQQFAVDSWVKIEQNRLNYHRTHQVDLKAANYNAVQDYIAGDTDVPG CC RRIVLPSSFPGSPRAMVQNFQDAMAIVSKFGKPDIFLTFTCNPAWTEISENLGPRQSASD CC RPDLIARVFKLKVDALFDDLLNRDILGHVAAYISVFEWQKRGLPHVHMLLTMAENSKPRT CC SEDIDKIVQAEIPNPDNEPELHRIVTTAMMHRPCGAQNPHSPCMVDGHCSKRYPKDFHPS CC TTLNVDGYPGYRRRDDGRYVEYGTQHLDNRRVVPYNKWLLLRYNAHMNVEICGFIEAVKY CC LFKYVYKGHDRAALNIIQNVRGDGNVVDEIREHLDARYVCAPEAIHHILGFKLEKKASVT CC TQQLESSSQTDTTLTAWFKINQKSKDIAESGNIPSTFVDSRQFFYMDMPTHFTFVKKDGW CC KVRGRGTRQIGRMYTVPPYETERYALRILLLNIKGATSFEDLRTVLDENNVPVVYATYVE CC AAKAQGLLNDDSEYLKSLKEWAGCSVPAALRSMFVAIILFNEVHDLNALWDAVKWDLSED CC FRHAGAGKEEAEALAYFDIESRLQRVGKRFDVTKPSINPPPIDLDTVNPAQCASEGNRLL CC ATLNDQQKRAADQILAALDDASLPRLFYLDGPGGSGKTYLYITLYNICVGRGLKVACTAW CC TGIAANLLPLGRTSASLFKLDIRNQCKSSLHQRQLKEAQELAENDVFIWDEASMVPKTAL CC DTVDVLLRDLTKIDQPFGGKILILGGDFRQILPVVERSSRADQVDACIKRSPLWTEFQIL CC HLISNMRVTSGDSDWIQFLLNVGDGSANDSDSKVTLPLSVMCDHNIVEEVFGAVIDPTTS CC DPCDNVILTPKNVDVAQLNDDVHNRMVGEERIYLSRDEVIVEHQADTMHYPTEFLNKMSP CC SSLPPHILKLKKGSVIILLRNLDVSAGLCNGSRFIVETLASHSLGCRFATGERKGHFTII CC PRIDCYDDKNISFQLRRTQFPVRLSFALSINKAQGQSFSKIGLWIPTDVFTHGQLYVALS CC RVRTKEGLIVKSSSNIVTNIVFNEVL. XX SQ Sequence 8486 BP; 2477 A; 1919 C; 1648 G; 2442 T; 0 other; atcattactt atatataaaa aaatagtgtg ggtctgtcca tagtttgtag tctatgtagt 60 ctttgtagtc tgtgacgtca cccctaaagt cactgagagt tgtgggcggg gccagtcacc 120 cttcgtggcg agacccatcg ttgcgagacc cttcgtggcg agacccatcg tggcgagacc 180 catcgtggcg agacccatcg tggcgagacc catacatttt ggcgggaatt taaattttct 240 ttgaaatttt tcggcgggaa ttcaaatttt ctgtgaaaat ttttggcggt agctcaaatt 300 ttctgaaaca tttttggcgg gaattcaaaa tttctctgaa aatttttggc gggaatttaa 360 attttcttta aaattttttg gcgggaattc aaattttctg tgaaaaattt tggcgggaat 420 tcaaattttc tgaaaaattt ttggcgggaa ttcaaatttt ctctgaaaaa ttttggcggg 480 aatttaaatt tccttcaaaa ttttttggcg ggaattcaaa ttttctgaaa aatttttggc 540 gggaattcaa attttctctg aaaaattttg gcgggaattt aaatttcctt caaaattttt 600 tggcgggaat tcaaattttc tttgaaaatt tttggcgtga attcaaattt tctgaaacat 660 ttttggcggg aattcaaatt ttctctgaaa atttttggcg ggaattcaaa ttttctttaa 720 aattttttgg cgggaattca aattttctgt gaaaaatttt ggcgggaatt caaattttct 780 gaaaaatttt tggcgggaat tcaaattttc tctgaaaatt tttggcggga atttaaattt 840 tctttaaaat tttttggcgg gaattcaaat tttctgtgaa aatttttggc gggaattcaa 900 attttctgaa aaatttttgg cgggaattca aattttctct gaaaattttt ggcgggaatt 960 taaattttct ttacaatttt ttggcgggaa ttcaaatttt ctgaaaaatt tttggcggga 1020 attcaaattt tcagtgaaaa attatggcgg gaaattcaaa tttttagaga aattttgtcg 1080 gagtagtcaa atttaacttt gtttgacaga aaaaaaaagt atgaagggga agatagtttg 1140 aaccacagaa ggttcacacc aaattcccgc acacaaccaa tgcgacatac cggcacattt 1200 gtttagaatt taataaacct atagaagagg aaaacttcat gcaataaact tcgagaatct 1260 tctagaaatt tcgaaaaact tctagatatc tcggattacg taacacttga cgacgtcaca 1320 catgatgagg acattaagta aacgtcacag gctgaaccca atataatttt cagaaaattc 1380 tggaaccttc tagaattttg gagaaacttc cagaaaattc tgatgacgtc acaatcactg 1440 gcgtaacaaa aaataagtca cagattatga cgtcacatat gatgacacct atatagtctc 1500 acattcggaa aaaaataggt gatgacgcca aatatatcag cgtcacagaa tgtgacgtca 1560 taggctccgc ccattacagt aaaagaaact tctggaatct tccagaagtt gcagaaaaag 1620 aagagaatgt tccaataacg tcacaaaaga tgacgtcact gataaagaca tcacaggaac 1680 tacccattaa tataatcaag aagattctac aaccatctag aaacttctgg atttttcaat 1740 gtagtgtgca aaatttaaat tttgactatt tcattgtgca atttttaatt tatattatta 1800 caagtttaaa ctatattaaa tctgtccttc taaaacccaa ttcgtttcaa gacccaaaac 1860 ttctagaatc ttccaaaaaa attctatata aggagaagaa aattatcatt ttctcattat 1920 taatcaacaa tggataacga gctaacaagt cgacccgatt caataatgga tacggaacca 1980 acaactctac ccgattcagc aatggatacc aaaccaacaa acctgcctga tttcccttct 2040 acatcccaag aacccgtctc taggaggaca cgttccagaa cattaaggta aggatcttga 2100 atgctcgtgg aattttacca ataagcagtt ttagtgacaa ctcaaatgct gtatggaaga 2160 gagaacgtcg ttcaagagaa acttcgcaag aatcagaatc aagactcagg atggattggg 2220 aaagaaaaaa ggaaaaacga gcttcaatgt cagaagagga aaaggctgag atgaaatata 2280 aaaaaagagt atggatgaag aaaaaaagaa atgaagtagc aaaaacacac gatacgtcat 2340 ctgttgccaa tcccaactac cttggaagca tgagatgtgt gtgcaagaac tgcaatgcca 2400 gattctttca aggtatacat tttttcaaag tttagtgaac ttctacttct tatttccatt 2460 ctgagtttca gattggtttt ttttcataac taataacata tactagcatc atacagaaat 2520 ttaatgtttc aggagaggtg tcctcatcca acgatcatat caatatgtgt tgctacttcg 2580 gattaacaac cacacttcaa gatcgattct ccaactaccc agatgaacta cgacatcttc 2640 tcctcgatga ttcttctcac gcccgtaact tccagaagaa catccgacag tttaattcat 2700 cgttcgcaat ggcatcgatg ggtgcacaac ttgatgttcc gagaggccgc ggcccatact 2760 gcttcaaaat ccacggtcaa gtttaccatt ttgcgggacc acttcacccg gatccaggcc 2820 agagaccagc atttggacag atctacatcc tggacactga tcaagcaaca gacgagaggc 2880 ttggaaatcc tgccaatgcc gattgtgatc ctgctgttat ggctgaattg agcagtcttc 2940 tgttgagaac caatccatat gcacaggcat acaagatgat ggctgaagtt gaagaaaaag 3000 aaaattctga agctgcaaag gaaggacgac atccgggatg cgttcgactg atcttcgata 3060 tttcgacaac taaggatcca cgacgttata acttgccaca agcgaatgaa gtcgcagttg 3120 tttttgtggg cgaagacgat gacgtgccaa ccactcgctc tttggctgtt catccaagag 3180 gaggaggact taagacgatt cgtgatatcg ataggatctg cgatcctctc acatatcctg 3240 tattgtttcc aaatggaacc gacggatggc atcctgactt ggaaaagaga ccatcggaga 3300 agaaacaagg aaggattact cagaagatgt actacagtta cctgctcatg gagagatcag 3360 gagttttcaa tccactgcat cacgggcgtg ccctgtttca acagtttgca gtagactcat 3420 gggtaaaggt aagaatacat gatctcatgt tcacaaaata atcatgtttt tgcagatcga 3480 gcaaaacagg ctaaactatc atcgaactca tcaagttgat ttgaaagccg ccaactacaa 3540 tgcggttcag gattatattg ctggagatac tgatgtccca ggaagacgca tcgttcttcc 3600 atcatctttt ccagggagtc cccgagcaat ggtccaaaac tttcaagacg cgatggctat 3660 tgtctcgaag ttcggaaagc ctgatatatt tttaactttc acgtgtaatc ctgcatggac 3720 agagatttct gaaaacctcg gtccgcgaca atctgcatct gatcgtccag accttattgc 3780 ccgtgtcttc aagctaaaag taagttgaag atctatattt aactcatcac tttcttcatt 3840 gtgttccagg tagatgcgct gtttgacgat cttctaaatc gggatattct cggacacgtt 3900 gccgcataca tttccgtgtt cgaatggcaa aaacgaggtc ttcctcatgt acacatgctg 3960 ttaactatgg cagagaactc aaagccacga acctcggagg acatcgacaa gattgtgcaa 4020 gctgagattc caaatcctga caacgagcca gaacttcatc gcattgtgac caccgccatg 4080 atgcatcgtc catgtggtgc tcaaaatcct cactcgccat gcatggttga tgggcactgc 4140 tcgaaaaggt atccgaagga ttttcatcct agcaccactc tcaacgttga tggatatccc 4200 gggtatcgtc gaagggatga tggacgatat gtggaatatg gcacccaaca tctcgataac 4260 aggagagttg ttccatacaa caaatggctt ctgctccgtt acaatgctca catgaatgtc 4320 gagatttgtg gattcatcga agctgtaaaa tatctcttca aatatgtcta caaaggacat 4380 gaccgtgctg cactcaacat catccaaaat gtacgtggag atggaaatgt tgttgatgag 4440 atccgagagc atctagatgc tcgatacgtc tgtgctcccg aggcaatcca ccacatcttg 4500 ggattcaaac tggaaaagaa gtccgataca gtttattgac tcgctgtgca tctcgaagga 4560 ttccaaacga tctactttag agccagtgtc accacacaac aactcgaatc ttcatcacaa 4620 accgacacga ctctcacagc ttggtttaag atcaaccaaa agagcaagga tattgcggag 4680 agcgggaaca tcccttcgac atttgttgat tcacgccaat tcttctacat ggatatgcca 4740 acgcacttca cctttgtgaa aaaggatggc tggaaggttc gaggaagagg cacaaggcag 4800 ataggccgaa tgtacactgt cccaccatac gaaaccgaga gatacgcact tcgaattctt 4860 cttttgaaca tcaaaggagc gacgtcattc gaagatcttc gtactgtttt ggacgagaac 4920 aatgttccag tggtatacgc aacttatgtg gaagcagcaa aagctcaagg acttctgaat 4980 gatgacagcg aatatttgaa gtctttgaaa gagtgggctg gatgctcagt gcccgcagct 5040 cttcgctcaa tgtttgtcgc tatcattctt tttaacgaag ttcacgatct gaacgcactt 5100 tgggatgccg tgaagtggga tctttctgag gattttcgcc atgctggagc agggaaagaa 5160 gaggctgagg cacttgcata ttttgatatc gaaagtcgcc ttcaaaggta tcttttgagc 5220 ttttttcaaa agtagattat aatatttttc cagggttgga aaaagattcg atgtcaccaa 5280 accatctatc aatccgccac caatcgatct cgacacagtc aaccctgctc aatgtgcatc 5340 agaaggcaat aggctccttg cgacactcaa tgatcaacag aaacgtgcag ctgaccagat 5400 ccttgctgca ttagatgatg cctcgctacc ccgtcttttc tatttggatg ggcctggagg 5460 gagcggaaag acttatctct acatcactct gtacaacatc tgtgttggaa gaggactgaa 5520 ggtaaacttt caaccaaaca tactaaagtt atttcaatac tgcatatttt caggtagcgt 5580 gtacggcgtg gacaggaatt gcagccaatt tactcccact cggtcgaaca tccgcttccc 5640 ttttcaagct tgatattaga aatcaatgca agtcttctct tcatcagagg caattgaaag 5700 aagcacaaga gctagcggag aacgacgttt tcatttggga tgaagcatca atggttccca 5760 aggtattaga gtgattctga tcgtactaat caaatcattc attcaacttt cagacagccc 5820 tcgacacggt cgatgtactg cttagggatc tcacaaagat cgatcagcca tttggtggga 5880 agatcctgat cctcggagga gactttcgcc aaatcctacc tgtggtggag agaagctcta 5940 gagctgatca agtagacgca tgcatcaaaa gatcaccact atggacagag tttcaaattt 6000 tgcatctcat ctcaaatatg agagtaacat ctggagactc ggattggatc caattcctcc 6060 tcaatgttgg cgatggatct gcaaatgact ctgattcaaa ggttacactc ccactctctg 6120 tcatgtgcga tcacaacatc gtcgaggaag tgttcggagc agttatcgat ccaaccacgt 6180 ctgatccatg tgacaacgtc atccttacac ctaagaacgt cgacgttgct caactgaatg 6240 acgatgttca caatcgaatg gttggcgagg aaagaatcta tctgtcacga gatgaagtga 6300 tcgtggaaca tcaagcggat actatgcact atcccaccga gtttctcaac aagatgtccc 6360 catcatctct tcctccccat attctcaagc tgaagaaagg ctcagtgatc attctgctca 6420 ggaatctcga tgtttccgct ggcctttgca atggttcccg tttcattgtc gaaacattgg 6480 ccagtcactc acttggatgt cgctttgcaa ctggcgagag gaaaggacat ttcacaatca 6540 ttccaagaat cgattgctat gatgataaga acatctcttt ccaactcagg cgcacacagt 6600 tcccagttcg cttatcattt gcattatcga tcaacaaggc ccaaggacaa tccttttcta 6660 aaatcggtct ctggattccc accgatgttt tcactcatgg acaactatat gttgctctat 6720 caagagttcg taccaaagaa ggtcttattg taaaatccag ttctaatatt gtcaccaata 6780 tcgtttttaa tgaagttctt tgaataaagc ttattattct tctctgctat ctcgctagac 6840 aactcagctc tcttctgagg tctcactcga caactcatct cttctctgct atctcgctag 6900 acaactcagc tctcttctga ggtctcactc gacaactcat ctcttctctg ctatctcgct 6960 agacaactca gctctcttct gaggtctcgc tcgacaactc atctcttctc tgctatctcg 7020 ctagacaact cagctctctt ctgaggtctc gctcgacaac tcagctcttc tctgctatct 7080 cgctagacaa ctcagctctc ttctgaggtc tcactcgaca actcatctct tctctgctat 7140 ctcgctagac aactcagctc tcttctgagg tctcgcccga caactcatct cttctctgct 7200 atctcgctag acaactcagc tctcttctga ggtctcgctc gacaactcat ctcttctctg 7260 ctatctcgct agacaactca gctctcttct gaggtctcgc tcgacaactc agctcttctc 7320 tgctatctcg ctagacaact cagctctctt ctgaggtctc gctcgacaac tcatctcttc 7380 tctgctatct cgctagacaa ctcagctctc ttctgaggtc tcgctcgaca actcatctct 7440 tctctgctat ctcgctagac aactcagctc tcttctgagg tctcactcga caactcatct 7500 cttctctgct atctcgctag acaactcagc tctcttctga ggtctcgctc gacaactcat 7560 ctcttctctg ctatctcgct agacaactca gctctcttct gaggtctcac tcgacaactc 7620 atctcttctc tgctatctcg ctagacaact cagctctttt ctgctatctc gctagacaac 7680 tcagctctct tctgaggtct cactcgacaa ctcatctctt ctctgctatc tcgctagaca 7740 actcagctct cttctgaggt ctcgctcgac aactcatctc ttctctgcta tctcgctaga 7800 caactcagct ctcttctgag gtctcgctcg acaactccag ccttctcact cgcccccata 7860 tgtaggagta tggggggagg gagtattatt ataaccacag aaacacaaca atgtgccaca 7920 taagagacag gagatagtgc tcgcactacc cctgaatcac caaccctctc agaagcacct 7980 ctgctcttat ctgggtctcg ctagacatct ccagtagttt gaccattctc agcaacactt 8040 ttgctatttt agaatttttt tgccctcgcg gggatcgaac caaaaggtcc gcgcgccaat 8100 gacgcgcgcg ctaccagctg tgccatgctg gaagcgattg aacttcgacc aataacaata 8160 tacaacattt ctttatttgt caaagtgaaa tagttttgaa caaacactat tcaaaaaagt 8220 aattttaaat tgaaattata tcaaattcac gatttgttcc caaatattta aaactgaaac 8280 tcaataagag aagatctcat attctcactg ataatccgaa acatgacatt tgtttcaatg 8340 ctgttttcaa cctcaagttg taaaattcaa aaaccaagct gaaaatatgt tttaaaaatc 8400 tagaaactac aataattttt atgtgaattt ttggaaaaat ctcaagccag acggagcacg 8460 cgccttggcg cgtgcgaacg gctggt 8486 // ID CERP1A repbase; DNA; CEL; 1341 BP. XX AC . XX DT 28-SEP-1995 (Rel. 1, Created) DT 09-JUN-1999 (Rel. 3.3, Last updated, Version 3) XX DE Nematode CeRep1.1 repeat DNA - a consensus. XX KW CeRep1 repetitive sequence; CERP1A. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-1341 RA Felsenstein M.K. and Emmons W.S.; RT "Structure and evolution of a family of interspersed repetitive RT DNA sequences in Caenorhabditis elegans."; RL J. Mol. Evol 25, 230-240 (1987). XX RN [2] RP 1-1341 RA Jurka J.; RT "CERP1A."; RL Direct Submission to Repbase Update (JUN-1999). XX DR [2] (Consensus) XX CC This element has a 384 bp perfect palindromic structure. XX SQ Sequence 1341 BP; 401 A; 264 C; 246 G; 430 T; 0 other; cgagggggta cgttttcgta aggggatttt ttcgtaagga acgctttcag tagggcaatg 60 tgcggcgcgc cttttcgcaa gcccgccgca cagtccttcg caatgcgccg cacggctttt 120 tgcaatgcgc cgcacgcttc ttcgtattta tttatttatt tttccaacag gaaatttcga 180 acggaattca attaaaaacg gatttttaat ggaaattaat tagttttatt cgagctaaat 240 acggatattt tcagaatttt cgatgacaaa aacgggtttc aatcgatttc gaatcgatca 300 aaattaatat taatcgaaaa tgagcgtttc cttcccggaa aaagaagaac ttgatgctga 360 gagtgatcca agcggagcca aacgcttttt tagaggcaag tattgatgtc taaagattaa 420 ttgaagagtg aaaagtattt ttcagagttc acaaacaacc cgagtgagca gcaattttcc 480 aagctgaagg caagtttctt tttgtagttt tttaagttct caaataggta tataacatcg 540 atttgttttg ttttccagag aatgtatcat cacaacgact atcaaaacct gcgaggagtg 600 attcaaaaga ccattcaact atcaaaatca ttgatcctag acttgaagag acgatccgga 660 aatcgaacaa ccgctgtgcc attatacaaa aaccccgatt tctgtacttc agtgtttttg 720 aattataccg gtttcagtgc ttttaatgag aaacttttcc cttttatgga aattccgaaa 780 tgctatcact gatctctttg tttacctctg tgccattaga ttatagaaac aattttccat 840 actcaggtta tccaaatttc gccatgattt ttcttactta atttaatcga attaactttt 900 gtttgcagtt ttattttcca tttcgactga ataaaaaatc acttctatca ttttaatcgt 960 ttggctccgc ctggatcact ctcagcatca agttcttctt tttccgggaa ggaaacgctc 1020 attttcgatt aatattaatt ttgatcgatt cgaaatcgat tgaaacccgt ttttgtcatc 1080 gaaaattctg aaaatatccg tatttagctc gaataaaact aattaatttc cattaaaaat 1140 ccgtttttaa ttgaattccg ttcgaaattt cctgttggaa aaataaataa ataaatacga 1200 agaagcgtgc ggcgcattgc aaaaagccgt gcggcgcatt gcgaaggact gtgcggcggg 1260 cttgcgaaaa ggcgcgccgc acattgccct actgaaagcg ttccttacga aaaaatcccc 1320 ttacgaaaac gtaccccctc g 1341 //