ID CeRep55 repbase; DNA; CEL; 25 BP. XX AC . XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 05-JUN-2002 (Rel. 7.05, Last updated, Version 1) XX DE C. elegans repeat 55. XX KW MSAT; Satellite; Simple Repeat; C. elegans; CeRep55; KW minisatellite; repeat. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-25 RA Jekosch K.; RT "C. elegans repeat 55."; RL Direct Submission to Repbase Update (25-APR-2002). XX DR [1] (Consensus) XX CC Putative novel minisatellite repeat, found 1040 times in the CC C.elegans genome. For further analysis compare CC http://www.sanger.ac.uk/Projects/C_elegans/repeats. XX SQ Sequence 25 BP; 4 A; 8 C; 2 G; 11 T; 0 other; ctggcttccc actatatttt actct 25 // ID TC5A repbase; DNA; CEL; 1608 BP. XX AC . XX DT 16-SEP-1999 (Rel. 4.08, Created) DT 16-SEP-1999 (Rel. 4.08, Last updated, Version 1) XX DE TC5B nonautonomous DNA transposon - a consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW CERP6; TC5 family; TC5A; nonautonomous DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Oosumi T., Garlick B. and Belknap R.W.; RT "Identification of putative nonautonomous transposable elements RT associated with several transposon families in Caenorhabditis RT elegans."; RL J. Mol. Evol 43(1), 11-18 (1996). XX RN [2] RP 1-1608 RA Kapitonov V.V. and Jurka J.; RT "TC5A."; RL Direct Submission to Repbase Update (SEP-1999). XX DR [2] (Consensus) XX CC TC5A is a nonautonomous DNA transposon derived from TC5-like CC autonomous transposon. CC There are about 50 copies of TC5A in the genome; they are CC ~99% identical to the consensus sequence. CC TC5A elements are flanked by 3 bp-long target-site CC duplications (duplications of TNA in CTNAG targets sites) and CC 773 bp-long terminal inverted repeats. CC TC5A is 63% identical to TC5. They share identical 9 bp-long CC termini and 73% identical subterminal repeats (position CC 56-138). XX SQ Sequence 1608 BP; 534 A; 276 C; 269 G; 529 T; 0 other; caagggaagt ctttgagggg gtccgtagat ttggggttct catgctaaaa ttcctacaga 60 agagtgttag ttatgatctc tccaaaaaat ttagctgccc cggtcaagtt tcagcaaagt 120 tatgacgttt tgaaatttca gttaaaaaca ccattgaaat ccactgtctt accatgcaat 180 ccacgcaaat ctcagcttgc gtgaccaccg aaaatgtgac acccaccaca ttgagttgaa 240 aaatgtcctc ggtggccgag ttgggagtgc gcgcgtctga taagatttaa gctttggttc 300 gattccttct atttttgaaa tatttttgta agttgaataa agttgtaaaa caactcattc 360 aaacattttt gcgcattttt aaagtgattt tattcttatt cgggaaccta gaatcattgt 420 ccgcactttt tggaaatttt tatttttttc atttttgctc aaaatttctt gatcaactcc 480 aagcaaaaaa atcaaaaaat ttcatttttc taaacaatta tgaaattgct atgttgttgt 540 tcagaaatgt atgaaacgta cattacacaa gttttaactc tctattcgca agtaaaccgt 600 cgaaatgatc tacatctcac gaactttgtg caaaatattt aaccaacttt gaagttgcat 660 aacttcgttg agataaatta ttttgaaaaa tgatcaacta acaaaatgtt tgttgaataa 720 cagtgaacaa agttttagtt ataaactttt tgataccttc agccaccaaa aagttgtgac 780 cgttttccaa aacacaatgt taatagaaaa actagccaaa tgccaacctt gttttcttct 840 ttgtagctgg aggtatcaaa aagtttataa ctaaaacttt gttcactgtt attcaacaaa 900 cattttgtta gttgatcatt tttcaaaata atttatctca acgaagttat gcaacttcaa 960 agttggttaa atattttgca caaagttcgt gagatgtaga tcatttcgac ggtttacttg 1020 cgaatagaga gttaaaactt gtgtaatgta cgtttcatac atttctgaac aacaacatag 1080 caatttcata attgtttaga aaaatgaaat tttttgaatt ttttgcttgg agttgatcaa 1140 gaaattttga gcaaaaatga aaaaaataaa aatttccaaa aagtgcggac aatgattcta 1200 ggttcccgaa taagaataaa atcactttaa aaatgcgcaa aaatgtttga atgagttgtt 1260 ttacaacttt attcaactta caaaaatatt tcaaaaatag aaggaatcga accaaagctt 1320 aaatcttatc agacgcgcgc actcccaact cggccaccga ggacattttt caactcaatg 1380 tggtgggtgt cacattttcg gtggtcacgc aagctgagat ttgcgtggat tgcatggtaa 1440 gacagtggat ttcaatggtg tttttaactg aaatttcaaa acgtcataac tttgctgaaa 1500 cttgaccggg gcagctaaat tttttggaga gatcataact aacactcttc tgtaggaatt 1560 ttagcatgag aaccccaaat ctacggaccc cctcaaagac ttcccttg 1608 // ID CER12-LTR_CE repbase; DNA; CEL; 537 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 06-AUG-2003 (Rel. 8.07, Last updated, Version 1) XX DE CER12-LTR_CE is a long terminal repeat of the CER12_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER12-I_CE; KW CER12-LTR_CE; CER12_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER12_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (APR-2003). XX DR [3] (Consensus) XX SQ Sequence 537 BP; 120 A; 152 C; 88 G; 177 T; 0 other; tgttgtgact tcgcctccgt agaggtcctc cgccccaact ttgcggatta taattatcgg 60 caagaatcgc ccacaatcca gcgataatca tccgctgatt tcccactcac acatttccgt 120 taaattgttc gttctaataa tctttccgcc atttaaatta attaaattaa ataattataa 180 ttcaaatttg ccgccaagga tacaagcgcg cgccactgta caccggagtg tcgtagcgcg 240 cgctgtttat tcccttgctc acattaatta ttggcctaaa atgttgccat tctcttcatt 300 ctggttctcc cgacgacatc gctgcccgcc tgccgatctt cgtcccaata ttcctaaata 360 ttctcgccgt tttttccctg tttattccag ctctatgcta ccctgctaat aaacttggat 420 tttcgtgaac tgtgtactgt tatgaactcg cattgctcat cctacatttc gcattccttc 480 gtcgtccaca atttgtcact ctgaggtcct ggcttcctct cagacggagt gtcaaca 537 // ID PALNN1_CE repbase; DNA; CEL; 210 BP. XX AC . XX DT 08-SEP-1999 (Rel. 4.08, Created) DT 08-SEP-1999 (Rel. 4.08, Last updated, Version 1) XX DE nonautonomous DNA transposon, consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; PALNN1_CE; TIR; palindrome. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-210 RA Kapitonov V.V. and Jurka J.; RT "PALNN1_CE."; RL Direct Submission to Repbase Update (SEP-1999). XX DR [1] (Consensus) XX CC PALNN1_CE elements generate 2 bp-long target-site duplications; CC PALNN1_CE forms almost perfect palindrome. CC About 20 copies of PALNN1_CE are dispersed in the genome. CC This family of transposable elements is very young (it may CC be still active in the C.elegans genome) since there is only CC 0.005% average divergence of PALNN1_CE copies from the CC consensus sequence. XX SQ Sequence 210 BP; 66 A; 38 C; 38 G; 68 T; 0 other; ctcagtcaag agcgagtcac ggcaactcgg tccaaaacca tttctaatta gtaaactctc 60 aaaaaccaca actaaatagc ttaaaaccat tgtaaattag cttatttttg ctaattagca 120 atgattttaa gctaattagt tgtggttttt gagagtttac taattagaaa tggttttgga 180 ccgagttgcc gtgactcgct cttgactgag 210 // ID CEREP3 repbase; DNA; CEL; 302 BP. XX AC . XX DT 01-DEC-1999 (Rel. 4.11, Created) DT 03-OCT-2005 (Rel. 10.11, Last updated, Version 2) XX DE CEREP3 dispersed repetitive element - a consensus. XX KW CEREP3; dispersed clusters. XX NM CEREP3. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-302 RA Kapitonov V.V. and Jurka J.; RT "CEREP3."; RL Direct Submission to Repbase Update (NOV-1999). XX DR [1] (Consensus) XX CC This element does not have any noticeable hallmarks of known CC retroelements or DNA transposons. There are about 100 copies CC of CEREP3 in the C.elegans genome. They are ~73% identical CC to the consensus sequence. CC There are multiple clusters composed of tandemly repeated CC copies of CEREP3 in the genome. CC The most conservative portion of the repeat is a 70-bp CC 5' terminus. XX SQ Sequence 302 BP; 122 A; 41 C; 34 G; 105 T; 0 other; tttggccaca gtggggactc gaaccccggt cttgtgattg gtagtcggcg aaaatcccac 60 tgcgccattt ggacaaaatt tttgaaaatt ctgaaaattg atgaaaatta taaaaaatca 120 cagttttaag ttaaatttta aatttttata tttaaaaaaa tcaaatttta aactttttta 180 aaattaaaat tttgaaacta aattttcgac attttctaga aatttttgag tttctaaaac 240 tttaaaattc gaattctaca tctttaaatt acccaaaaac attaaaaaac caaaaaaaaa 300 at 302 // ID CER16-I_CE repbase; DNA; CEL; 6499 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 06-AUG-2003 (Rel. 8.07, Last updated, Version 1) XX DE CER16-I_CE is an internal portion of the CER16_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER16-I_CE; KW CER16-LTR_CE; CER16_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER16_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (APR-2003). XX DR [3] (Consensus) XX CC FEATURES Location/Qualifiers CC gene 4607..4915 CC /note="predicted Protease region (full)" CC gene 4295..4417 CC /note="predicted duplicate Protease region CC (fragment)" CC gene 3652..3864 CC /note="predicted Gag region" CC gene 5135..5821 CC /note="predicted Reverse Transcriptase region" CC gene 5912..6301 CC /note="predicted RNase region" CC primer_bind 2..19 CC /note="predicted Primer Binding Site (Gly-GCC)". XX SQ Sequence 6499 BP; 1706 A; 1839 C; 1374 G; 1580 T; 0 other; ttggtgcatc gaccgaaccg gctggtaaat gactcaattt ccagccggct gaacgaaccc 60 agagtcatgg aggcacgcga cgcaggagga tgagccttga ggttagtgta atttctcaaa 120 ttcactattt tcagtcgcat tcaagtgatt caagctcctt tattgttgag ccagccgtca 180 gtttccccgt acggaccgtt tccgtctgaa actccaggaa tttgctctga aaatttcctc 240 aacggcggat tttccgctct cctctgcgga aaatcggatt tttctctgaa aaaattgttt 300 tctagtgctt tctcttgtta tttcggctcg aatttctcaa attcagccgc cagtttcccc 360 gtacggtcca tttccgtccg aaactgatgc aatttcctct gaaaattgca tcatcgcctt 420 ttagcattga gagcactaga ttggcgtgac cagccgtcca ggttttgtgt ccgccgctca 480 aatcttcact ttttgtgatt ttgcgacttt gacaaccttt gtcgaaagtc gctacccaac 540 ccggcttatc atctggtaca gccacatatg gactcacctc ttgcccaagc ttcgtgaaac 600 ccatcagaat gcttgccact gccatttgga agatgacctt caccaggtcg aagaaaacta 660 gcctacttga gctcttcttc tttcttttca ttcgttttct aacgaataaa gacttccatt 720 gtaaatcctg acttaaaaat ctaaataaaa tccctataaa gctatgttag ttcgaatctt 780 ccagttctgt tctaaaagac ttcgcgtcct aggagactgg acagagtcac ccggaggtgc 840 tccgtcaaga gttgattaat tctattcgca actgaccggt catctagatc cggataagtt 900 ggtcagctcc gccagtttct gaaaagactg cgaataggag acttcttggt ccgcatatgc 960 gacacatcaa cagacgttca gtgggttcaa actggtcgtt tggtcgcata tgtacacatg 1020 catcttagtg gtggacctag aaggtttctc gaagagtcct agacttgacg agattcaata 1080 aagacgacgt gagatagtat tgctccgcaa cgcatgctct cttcatcctt ccgtataaca 1140 tcgctgatcc ctgttatcgc ataaacaagg agctagggct aggattctca gatgcatggg 1200 aatccgaacc agcacccgca tgaagtacgg aaagatgcat ccggagttcc gcacaacacc 1260 ggtagtggtc ttccccaaag tccactagaa ataccaccac gcatcccgtg gaatggctaa 1320 cgcatgccat tttccacaaa accccagggc gagttatgca acttgtccta tggactcgca 1380 actccatgaa acccttgaaa acggcgcagt tttcccatac gctactccgc ccactcggcg 1440 agcgtaactc caacagatta ttgaactcct cgatttcatt aatcacctgc tctattttca 1500 gttaactggt ttttattgtc ctcgacaatc accctgaaaa tgagctccca acaactaccg 1560 gaagtccttc ccaatgatac tattcctaat ggtgccattc cggaggcatc tgcggctatc 1620 caacctggaa accgcgcggt caccgcttct ggcgacccaa ctggcatgtc gtctcgagga 1680 cgtcctcgac attctcactc gtccaaaacg agaagccctg ccgctcccat caagggttat 1740 attacccgca tcatcaacca atctcggctc tgccacaacg aagccagccg actgatgctt 1800 gagcactttg gtgtcaatcc tgacccttct tcgccgatcg aagaagaagt gctccgcgct 1860 gctaaccaag cggtcttgaa aacccgccgt tatgctgaag cgctagggaa tctttgtcaa 1920 tacgtagatg acaagttcca agaaccagct ctgcgagact ctcctctgag agacgcatat 1980 cacagagagg tccaacgcca tctcgcagag ctcgatcctc agtcaatctt gaccgagatc 2040 aaccgcgata ttgtcatgct cgaccgcgag ctgacagacc acggctttcc cagctcgtcg 2100 aaatttgcaa cgcgacatgt tctcactccg tacgaatacg agtcgagcga catcgagtct 2160 tcgtctgatc ttcgcagcat cgacgacgcg ttcgacgggc tcaagcgtta tatgggaacg 2220 aaccacgttc tccatacggg cactgcgcca gttgctccag ccgaatgtcg cgcaacgatt 2280 catagaggta cccctcctcc ttatgattcg tcaaacttaa attaccgtac agtttctcag 2340 ttcaccgcga gaactcccac tcgtagttct gatgacgatg tttggtctat tcagcagacc 2400 ctagcagacg agctagcaga agaacggcgt caacgccgcc tggaacaagc cagaagtcgg 2460 gctcttgaag agagactgat ggaacgtcaa agaactgaga acgtacaaag ggctgaggaa 2520 acaccgatcc ttgatcaacc tcgccctcca gtcaccgccg gctacccaac gcagccactc 2580 ccagtggatg ctgtcctttg gcagacagtc ctggacatgg gtcaattgct cacatctcta 2640 cgagaagagc aagcaacagt ccgccaacat ctagcgtcat tcatgcatga tgtcacggaa 2700 cgcttacgcg cagccgatga gggcaacgca tgccctcagg agtccgttgt agatgaaagt 2760 gaagctgagg aggaacggag tggtccgcca cctacacctc aagcagacac aaacatccac 2820 tcctcatacg gagtaatcaa ctttgagact aacactaaga acttgccaaa gttcgatggc 2880 accggcaact tccgtgcatt caagaatggg tttaatacag tcgttcttgg cagcagactg 2940 cccgcggtga ccaaatataa tcttttacaa aaccacctga ttggcaaagc taaccagtgc 3000 gtctccacgc atgacgatcc atccgtggcc tacaaaacca caatggatat gctcgaatct 3060 gtttatggca gaggagacac tcagcgaggt ctcctcgagc gactcaggaa gctgaaattc 3120 agccagtcct tccctgaaca gatgagattg gatctcactt cacaccacct cctggtacaa 3180 agactggtcg ctaccggtct ctccgtctcc gacgatcgcc tcatcatggg gatcatcgga 3240 aagttacctc tatctttcag agataaggtc acggagttct atacggagct cggcgaacac 3300 gctacagcca cctccctgta ccagagaatt cgcaagcaca ttgacagctt cgaaaatggg 3360 ctcatcgcag ctagtatgag cccaccccag gtggtccctg tgaatgagat tccggcacag 3420 tacgctgaag catccgcttt tcacgtgcaa gagcaaaaac ccactggtta cagcttgaat 3480 ggaggcttta caccaaagac ctccaagcca atcttcaaca taaatgagca ctccggctcc 3540 tatgttgacc ctatcagcgg cgtcaccttg aaaggttatt tcaagccagg gacgagaggt 3600 gtgaatttga atcgcattca cagcaccttt cccttgagtg acgggacagc cagcacacct 3660 tgtgcggcct gcaacggcag ccacagtccg gtgagatgtt atctccccag ccacgctttc 3720 cgcaaagcgt tagagcaaaa aggactgtgc ccaaactgct gtaaatcgca tcccatcgaa 3780 aagtgcactt cactgtactg atgtgtgtat tgcgacggaa ttcatcattc cggcggctgt 3840 cccttgaagg aacactttcg agacatggcg aactacccac cgggcgctag accggctatc 3900 cagttatttc gtgagatccc tcttcgcagc gagaggggaa cttgcaacac ctcccactaa 3960 atcattgtgg gcgggcacgg ggccgaccgc acagcgcccc ctgtcaaacg ctgattcgca 4020 atcagcatgt tgctatgatg ttctctttca ctgtaactct tccagttctg tcatcgtcgt 4080 tccttcaatg tcgcttctaa aaattaaaaa ttcaaattta aatctaaaca taaccgatac 4140 atgtcgccca gaggagcaac tcgtctccga gtatgcgtct ttcccgtcct gtaaagacat 4200 tccgcatcta gtgaccacta tccgagaccc tggcgatgat aaagtcaccc tcatgcgctt 4260 ttacacagac gctaaccaac cattgttggc tttagtggac tccggcgcgt ctctgtcact 4320 tatccagtac cgaaaagcta tacgaatcag tctgaagtcc tctggctcag caactctatt 4380 gctgcaaggc ttctacgatg cctcgacctc tgtatctcca aatattaaaa ctctaaatat 4440 tatcgttaca agtccacttg ctgagaagaa acgcatctcc aagtttgcgg ctttcctgtc 4500 ctggacagat gtgccacatc gagaggtcga taaaagagac actggcagtg ataaactata 4560 tttcacttgc ctctccacag actcgaacca cacatcgttg gctctagtag gctcaggagc 4620 gtccttcccc ctcgtccagc accgaaacac taaacaaatc ggtctggcct caactggatc 4680 gatcactctc tcagagcaat gttaccgcaa tgcttggacc tctgtagctc aaataaaatt 4740 caaaattaaa attcgctcga ttagcgtcag cctgtcgatc gctggctttg gtaatttacc 4800 tagccccacc ttccgtgatc tcagtttctc tgagcaagac acagtctacc tggactactt 4860 gaaaattgat cacgcagctc caagcgtctc caacggaagg tgtatggacc tgatcatggg 4920 agaaagtgct acatcaacct cgcatgatga gcacgcccag catcgccagc taccgtcgac 4980 actgtcagtt gacaatcgca ttggcttaat ctacgcagag aataatggca aatgggtctt 5040 cgacaccttc gtcgcactca tatattctat actgctagcc aacacggtgc ttcatggtcc 5100 aacagcgaat tccgagctga aattccagtc atctctcaaa cagatgtgga aactcgagac 5160 tctcggtgtc gacccaattc tattggtcga cggagcaaag aactctactt cagattcttt 5220 aaccgagact gagcaaacca ccacacgcac ttttggtcaa ctcgagttcg ctttaccctt 5280 gaacgggatc gagtcgaaac ccaaaattca atatgccatc gcgtacacaa gcatgtgtag 5340 tctagctcag acgttcgcta acggccaaaa tttaatccaa aaattccacc aggtgatcaa 5400 agaccaagag tcggcgaaca ttatcgagcg ggtcactcct gacatggcgc aagagccctc 5460 gcagccgatt accgacgaca acctctgttg ttatcggttc acgagaatcc cattcggggt 5520 tgcaaacagc ccatttcttt tggctgtggc tatccatcat tttctagccc gcaaccctca 5580 tgagctcaac gacgagatca cagaaaatct ctatgtcgat aattgtcttc tcaaacttct 5640 gggcttagaa tggaattcta acgacgattc aatcgacacc cggtcggtga aaatcgcgct 5700 cttggatatt gatcacccca ccaagagaga agtagcttcg aacatggctc aaacctttga 5760 cccacttgaa ctgctcacaa cgagccaagt gttgatcaat cgactcatcc aatatacttg 5820 gattagaggc gtaatctgga tagaccgtat cttaattgag ctgctcccac gttggagagc 5880 attcaaaaag tcttttttag actcgtcaag ttacgtccat aggcaattga cgcacgcaac 5940 agatgcgtac actcacttca aatacgaaga ctgggttcca tccacccgtc tgatctccag 6000 cacatcaggt gccgccccga actacacaag tccaaagttg gaacttatag ggactctccc 6060 gttccacttt gttgattctc tctctaacat taaaaatatt aaaatccgaa tgcctacaat 6120 tacgtgtaca ccggctcctc acggtcttgt tcagacgagc atacattacg ctctcgaaca 6180 taaaatcgag acaatgcctt gcattgagaa ccatcgggaa cacccaacgt atcgggcact 6240 ccctcacacc tctattggcg agcaagagtc ggaacttgag gacgtcgact acgacattcc 6300 caacccggag ttttcggata gcatccccca atacgccgat cctgacacca caacgcctga 6360 tcaactgacc gatcacgccg aggctaacct acctgtaggt cgctccaggg attatatccc 6420 acggaaagcc aaagccccgt acattaacta tgtacacact ctggttgtca cgtctccgtc 6480 gtcctcctcc cccccggag 6499 // ID CER8-I_CE repbase; DNA; CEL; 17003 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 06-AUG-2003 (Rel. 8.07, Last updated, Version 1) XX DE CER8-I_CE is an internal portion of the CER8_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER8-I_CE; CER8-LTR_CE; KW CER8_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER8_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (APR-2003). XX DR [3] (Consensus) XX CC FEATURES Location/Qualifiers CC gene 12000..12212 CC /note="predicted Gag region" CC gene 15491..16072 CC /note="predicted Integrase region" CC gene 13494..13865 CC /note="predicted Reverse Transcriptase region" CC gene 14451..14852 CC /note="predicted RNase region" CC gene 12624..12950 CC /note="predicted Protease region". XX SQ Sequence 17003 BP; 5060 A; 4019 C; 3724 G; 4200 T; 0 other; tgatgcatcg accggtttcg aagctttgct tcgaaattgg aaacgaatga acctgttggg 60 aagagagaac tggttgccac gagcgaagtt agagaaggca ggtggataaa ccaccttttg 120 ttgacccgaa ctaccttagc tcggagatta ctccgtcttt ccctggctag acacccccgg 180 agctcactga cagccccgga gcaccatttc ggttctccgg aggacggtcg gaaacccccg 240 gagtgggaca gtcgtcagga gaaggatgga gcaacccaga aaacatctaa attacgtaga 300 gacctcaatt tctctactcg tcttctacca tcgatctatg gaggattttc gggacgtcac 360 tggcagtgtt tacatcatct gcgtcaggaa gcttctgctg aaggactacg accgtcgggc 420 tgccgctgat gtttggattc ctattgtcct caacttcgcg gaaatcgatt ttatcgagcc 480 actgtgccca aagaacgttt tctgcgaaaa ggacaaagga gtcacgccgt cactgccata 540 actggtcgtg tctacatcat ctacgtcagg aagcttctgc tgaaggacta cgaccgtcgg 600 gctgccgctg atgtttggat tcctcttgtc ctaaacttct tggaaatcga ttttatcgga 660 ccactgtgcc cgaagaacgt tttctgtgaa taggacagag gagtcacgct gtcactgcca 720 tcactggtcg tgtctacatc atctgcgtca ggaagctact gctgaaggac cgcgaccgtc 780 ggactgccgc ttggactgct gcaaatgctg aaagaagctg atgtttggat ccctcttgcc 840 ctaaatcgtc atggaaatcg ggttttatcg ggccactgtg cccgaggaac gttttctgtg 900 agaaggacaa aggagtcacg ccgtcactgc cgccgacacg aatctccatc cgaagatcga 960 ggagaatgtg gaaaaattgt cgttagcgag gaaattgcac tgtcaatttc ccctaatgat 1020 caatacaaat tcgtttctgg aatttgtggc gccaaaggtg actcaacgac aacatccgcg 1080 tcgaaaacgc acttttagag acgttacaat caacaaatgt cggttgagcg tccagaagtg 1140 gcgatcgata cggatgttac gagaaaacca gacagcgacc tcaaattacc agtaaacgag 1200 tagaacacca cctaaaaatt gatcgaaact gagaaaattg ctctagcatt ttccgttcga 1260 cgatttcccc actaaaaaac tctaaattac agactgtgtt ggatccgtgg agaagaagaa 1320 gcgtagagca gtcaaaatct ggaatttaaa tacgcgtggc tgaaggagat tgactgtatc 1380 ggctctgatt cccatccaac ggagccaacc agctctcatc gacatcgcag gagttgcacg 1440 aatctcattc gcacgttgga gaatgctcta cgaaggattt catctcgaga aaatccgata 1500 attcgaattt gcgacgaatt tgctgaaaat ctgcgttttt cccgaagaaa tttgggtgat 1560 ttcggcaaat ttcgatcaaa aatgaagatt gtcgattatt ccgacctgaa aaaattcaaa 1620 aaatccggtt ttccttcgaa acatgctgaa ttctatccaa atccactgaa ccaccactaa 1680 aattcgttga aattgaattt gcgcgtgggt tacggtagca gcgcggtgaa cctcacagat 1740 tgtgcaaggt gaacattaga catggcactt cccctaaaat tattcgaatt tttgggattg 1800 gcctacgttt ttgttcattt tctgacacct tttgatgatt ttccttcgtt tcttttcgta 1860 atttttcgta aaaaattcaa ttttcaatag aattattagt tttattcagc atggtaccga 1920 acaaagcagc cggacaaccg aaaacgagca ctccaccacc cgcgtctgag attccgatgc 1980 caacatccgc gattcccacc aatgtaagca ttttttattg aaaaattatt aaatatttcg 2040 atttaaaaag tgactttcac ctaaaaatat gacgcataat tgacagccgg tgccacagtt 2100 gcccgtgtta gtcaattacc ttaaggaaag ttgaaacgag cacgcagtct agcgattcac 2160 agcaatcgcg ggatgtgatg cttagtttca atatttccaa ccgccttcta taattaatct 2220 gccgaaatct ggttctcgga tcgcgatttt atcgccttcc tgccaaatga gaacctgcgt 2280 aaagagattt cgagcatttt aatttatagc catgacttaa atcgagaaaa agttgaaacg 2340 ggaacacagc ctagcgattc acacgcaatc gcggggcgtg attcttagtt caacatcttt 2400 ccaccacttc cctttcaata aaatgccgaa gtttggttct cggattgtgt gctggcactt 2460 tcctgccaaa tgagtccctg cgtaaagaaa cttcgagcat taccttgata tcaccctttt 2520 gaacgagaat ttaaattcac tttacttatt ttccggattg aactgtaccg aatattgaaa 2580 ctacttatga ctatcaagag ctaaagggaa gatgaaggtg gaagtcacga caaatactta 2640 cttttaacgg ttgaaacatt ccagcctcca aaattctaat aaaaaatttc tctaaatatt 2700 tatacagacg agtgttgttt caggcagtgg gcgaggatac accgactctc caggacgggg 2760 cgcaggcaaa gcacacagct ttgccgccaa ccctgccgga tgatgccgag actatcgtgc 2820 aatcactgcc aacatcatcc gcgaccaccg ccaacgagga aactctagga ggtgattaaa 2880 tattcgaaga aacgagctgg aaagtgcgat aatgtgctga atttggatca atatttttga 2940 aaggagtcta tagtgtgcga caatgaggaa gagtcacgac tcgagaaatc gaggacgatg 3000 gcactatccc tttgccgacc acgtggactc cggacaaaaa tagaagattt agaattcaca 3060 taattatttc attatttcag ctctattgga agcctggagc gctatgcagg tgtcagtcgg 3120 ggagagcacg cagaaggtag agagctgcct cgctacgatg accacccact tgggtgagtc 3180 acgagacgag tcagctgtca ccaacgcggc ccttgacaac atccgcggaa tggttagccg 3240 actggaaggt atagtattgg tttgtcatcg aacttccacc ctaattgtga attcttttag 3300 aaaagattgc tgggagccac gccacgctga tccaggactt ggaccatctg aagagagccg 3360 gagaggatca tcgtgtgacc atcggcaaca tcaacacccg cgtttcgcgc aactcgtgcc 3420 tcgacgtctc tgcacatgcc tactcgatct ttggaaacaa tggtctggtg aacgccaaga 3480 tcattgttga agccacgacc gcgtacggcc gcgagctctc gatcttcacg atcgacacag 3540 ccggctacta ctcgcgtacg ggtttcacaa agacgagtag ggaggcgaga acggagagcc 3600 cgaggatgag caaaatgagg gaagatgttg ttagatgggc gagagggcta ggtacgaaga 3660 gatgctacga gtaccgggca agttggatgg attatttttc catctccaat tcacacaccc 3720 catagtccag aaccagacaa tgatcaaggt atagttgata gtcaaaaaag cctcgaagcc 3780 aatttagttc aataatcttc ctttatagcc gatccaacat ccgcgtctga gagaaacaac 3840 gagttggtga tgtcgacatt ggagggcatc cgagagtcag ttgagtatct tcgctcactt 3900 gagatggttc tcgtagaccg gaaagagaaa gatatggcaa cgagacactg gagggcggaa 3960 cgcgagaaat tcgaggagat gtagtggaag gatcagatcg gcacgcgtgc ttcgactgtg 4020 aagcacgaat gtgagctctc cgcgatcaat gagcaggcga aggccgtcgc gagcgcaagt 4080 tcgcgacttg agactcactt gctctatatg aatcgtgaga aagccgcatc agtggagact 4140 atccgaccaa cacccgcgaa ttcgatcgtc aaagcaccac cagtgcctct caaaaaggaa 4200 gagaaaccgg ttcaatcatc cgcgaaagtc ccgacaatgg gcagagtaag tgcaatttaa 4260 ttccgattca taaaccccaa acatccaact gatttaacca ccttctttcc agttttgcat 4320 cttctgcaaa accaccaccc acgaatccat cgaatgcaat tctcacaagg aatcgactca 4380 acgcgtgagg tgtgcaagaa gaaatggata ttgcgatttg tgcttgttgt cgtacgtgat 4440 ttcggaggat ggacaccaca agggctgcac gggactcaat gtcacctgca agtactgcct 4500 ggcacgcccc gatttcgcct acgatgccga gcacaatgaa gtcttctgcg cagtgcgttt 4560 caagaacacg caaaaaggaa agaaggatgg agcaaacagc acccgcggtt cagctgaacc 4620 cgcagccccg gagcaaccgc acaagaagag gagaaagaga aaggcgacaa gggatgacaa 4680 ggtgggttta agttttattc cggggaaaaa agaacatgtt gagagagaga gtgaggaaaa 4740 agagaacgag cttcataaat tagacgttcc tttacaggag gagagtgctc aatccacctc 4800 cggaacaccc agcaataaga ttgactgggc agtatcagga gatggaaaga gtgagaaggc 4860 ggagcaatag ctctgacctt caatccacta ctaccacccg cgagttgaat gatcttcttt 4920 aatgttgttg ttacttgtta ttaatttgtt attttttaat ttgttaactt aatttgttct 4980 acttatttcc ctttcctacc caatggttcc atcacctgcg tttatctctt aaaaccctca 5040 ccaatcaccc gcgtctttca ctctttctcc ccggtctaat tacctctgtc atagcaggag 5100 ggtctgtaaa atcgctataa actttccccc caataaaacc gttacaaata gcctttctct 5160 ccattcatcg tttattattc acgagttctc taactcaatc attatcttta tttaaaaaat 5220 taaattatta ttatccattt aaaaagtctg agggagaaag gatccagcct atttcgttcc 5280 gagttaccac cactaccaca gagctttccc gtaagactgc tgacgataaa tttgatcaac 5340 aagtacttta gaggaaagcc agatggaaga acctaaaaat acttaaatat taaaaaatta 5400 aaacaaaaat ctccagatcc aaatttcgtt ctcatgcatt ctttggactg aagtgcatcc 5460 taaatcaaca taaaaataaa aaataaggtt gtagaggttg tgaaggttgt gacaaggaca 5520 ctgtgccgtt acgatctttg cacctcttgc tccttttgat cagtttaata gtccatcgac 5580 gattgaatcg gaactgaccg ggatgattcg cgaggagatg tgaaccaaaa aattatttgt 5640 tttgcatgac acagctgaac tcgagcgctt taagttacgg ttaggatttg aaactagttg 5700 gttataccac gtgatcgatt gtctagagtc cagtggcagc atgactccga ctattaaaca 5760 agacatccgt gctgactgtc acattagaaa acggccgttc tagacagtac agaacataat 5820 acaactagtg aacaattctt tccgtttcct tcccttacca cacattgtag gtatatgtga 5880 cccgagtgag agaaggaaca gcagttgaaa tacagataaa ttttggcaaa cagccttgca 5940 gaaccccagt cattgatccc ttttttcttc agaacaaagt gtgaaactga gtgaggactt 6000 ttatgcgctt gtgtgtgaga gattatctac ttagagttgg gtcacgtaat tagaatttgt 6060 ttttaaaaaa aaattattac tcaaaaatta tttttccgtt ttatttttta attttctcgt 6120 aaccttttta aattcttatc aaaaaatttc atcttgccac gtttttcagt ccatataagt 6180 tctaatctct ggcattaaat caatacttaa aaatatcttc ggctctttcg gttgtgtttt 6240 cttcttctcc agttgctccg caaacagtaa gtgatatagc tacaggtttc cttccaactc 6300 tctaaattgt aattaagttt atcattatag aagagtatgg aagacaaccc tgactataga 6360 ccacatgaag ttgagagata tccggttgat ttgccacgac aacatccgcg tccaggcatt 6420 ctagacattc gttgcgaatt caccagcatt aaggagtttt tcggggaaga gccacctttc 6480 aatgagcgga ctgtactacg gttcgcggaa tatgtcaggg gttactatcc tgatcatcaa 6540 cccacacatg tcatggatcg aggcggtcta gccctgtaca tttggataga tcgtatgcca 6600 acacccgcga cccatctcca acaaaccttc ttacgcttgc catttctgga ggaggtggaa 6660 aacctttcga gattcgtgct tcgacatgcg agagcacaaa acgagcatct ggagggtgaa 6720 cgtacaaatc gctgttcagt ttgtggaagt cacttattgg gcatgccttt ggcggatcac 6780 agctacattc ggtgccccct agatgcgttg gaaggagaac gtcgactgga gttcttagct 6840 ataaaccatg cagcattttg tagcaagtgc aacagccgtt ttgctaaaca cacaggttgc 6900 accggaccag tttgccactc ttgtgatcaa agaggtcaca cagctgctac tagactgtgc 6960 gattcacgac tcggaaggct cacaccggaa agccaacaag aattactcac cgtggtcact 7020 cagcatcgaa gaaactatct acttcacatc cgaaacctga ctaatgatat taacaagcca 7080 ctgcgctacc gttcatacac tgatcaggct tacatctttc ttgaaagacg ggtggaagat 7140 agatggttga tcgatggatg gggattctac catgatgacg ccggtgaatt tccacaacca 7200 gctatgcaaa gatactcgaa tcatccgcat cgttctgttc gctacaaagg actggtccca 7260 cctgagtacc attctcaggc ggtgaacccg atcccaacat ttccgccaca tgaagttgag 7320 tatctgcaag ctgtgcatga ggttatcatt gagtttcatc gctctggaga ggtaataggc 7380 aaatcagtag attgacatta cggaaatttt tagcttcaca tggagcgaat tcttccgaat 7440 gccgcaatcg cagacgaacc agttgccatc ccagttgagg ttgatccacc acccgcggtt 7500 attgttgaag agggaattgc catccatccc gttgagaatg tagatcctct tgaaatgcct 7560 gtaaatgaga ttcacgagag aaatgttgtg agtcttacga catcatagca ataatccttc 7620 aataccataa cagtcattcc ttcagctcgg agccaatcca aacccaataa ttacaaaaat 7680 ttcttttctt ttcctttaat aattatttta aaaaaaactt aaatacttta aaaattcaca 7740 tttcaggctg attctgaaga agacggtgac gatgatctga ttctacagga tccggtgaat 7800 cccacacggg agtagctcga cctgcaagaa ctgatgatag aaacccgaat caacggagaa 7860 gcctacatac attgtcagga gccgttgacg gttaataacg atacagccag accagcgttc 7920 cagcacaaag cctttgaaac actcctacaa catccgctcc ccgaaggcaa aaacgcttta 7980 actctccgtg tgcgggcgct acaagttata ctatgtggca gatccggtac ggagtgggaa 8040 gtaagcccag aggtgtcagc ggctctcctt cgacgttacg ccaaggactt gatcctttac 8100 ggacaagcct tgaagaacga agaaggtctg atagttcaaa tggaagagtt tctcagtgca 8160 actgtactgc gcaaggagcg ttatggatcg gatatcgtat atataccgtc aatgagcctc 8220 tgggcaaaag cagaagttcg tcaatcgctt actcgcctct ttgatccaca tccatttggt 8280 gatgagacaa cacggcgtct tgatcgggtt gggcacaacc atcgttattg gtttggactc 8340 ccaagtaatc aaggagctct gcatgaagac ttacaagtcg cagaggagga aaggtgaccc 8400 cttcccacgt gtccatgtaa tgaagattat ttatctttat tacaggctca gtgcgcagct 8460 tgtaccaacc aaagctcgta tctgggaatc tccactacat gacattggac agttgagaaa 8520 tgccctcgaa atcccagttc ccacggataa ccaggctgtg cgtcttcgta ttggttggat 8580 gattcatctg ctaacttgtc aactccctcc tgaaacggtc aaccagggtt cgtacgttga 8640 agtactggac tacttgaaga tcttacgggc ggcgacaaaa ctcgttctca gcatccgcga 8700 agaagcctct accgtaagga ttgagctctc gacctgtatg agacatatgg ccacatcatc 8760 cgcgagaggt cacaatttgg tactacccac ttttggactt ttcgtgaagt atgcgatggg 8820 gaaatgggag tcgtggctga atggttgttg ggatcacgtc tacgatgaga tcacgctcag 8880 acacgttgag tcatgcccat gccacgtgga tgaagcaatg ttccgttacg aatattaatt 8940 tgaacctttc caacaaccgc gtatctgatg acaacatctg cgtttccatc aataaaaatt 9000 aatttattaa aaatacgttg aaatacgatc acccactcat aagttagaat aaatcctcac 9060 gcgcgcgaaa aaagcttcat ctcaacatcc gcactcacct taacttttca ataaaacaaa 9120 aaaatcgcct cgctccagcc caccaacacc aaatgcattc cggaatgcgt aggagcccga 9180 ggccttataa tcctgataga gcttgactcc aaagagaagc ctgctctgtc ggattccacc 9240 acctagaaat tccaagagaa tttcaaccgc ctagaaattc caagagaatt tcaaccacct 9300 ggaaattcca agagaatttc aaccacctgg aaattccaag agaatttcaa ccacctggaa 9360 atcccaagag aatttcaacc acctgcaaat tccaagagaa tttcaaccgc ctagaaattc 9420 caagagaatt tcaaccacct ggaaatccca agagaatttc aaccacctgc aaattccaag 9480 agaatttcaa ccgcctagaa attccaagag aatttcaacc acctgcaaat tccaagagag 9540 tttcaaccac caagaaattc caagaggatt tcaccaaaaa caaaaaaaaa aaaaaccaaa 9600 atcaaatcaa aaaattccct caaaaattcc ctcaaattcc gactttagtt ttccctggtc 9660 cttctttatt tttctctttt ttcaattcat attgtctaat agttaataat taaggttact 9720 ggattattat tgtcctcggg caattaccct cgatttaaat tatggcgtcc ttgaacaatg 9780 gcccaatata cccggaccgt agcgaccacc cgtcacccgg gcaggacgca cggatcccaa 9840 catcaaccgc gactgaaaga gtgaccgcag gtcggacgaa aaagagaact acttctctcg 9900 cgatccgtct tgtaggacct ctgaaacgtc gcatcaacca aaaaattgag atagctaagc 9960 gtgtcttagc tgaatcggaa gcgaagatga aaatcttccc gaccatgccg cctcaccatc 10020 ttatcagcga tgaggacgca acttacctcg atgcactact aattcggctt cagaccgtat 10080 tgggggcact cgagggaatg cgcgacttca ttgacgataa gtttggcgat ccggaaatgg 10140 cggaaaaccc gaatcagcaa gcatactatc aggaagttat cgagctactc gagaagtcca 10200 aagcatcttc ctttgcgaat caactgagac gcgaaatcga agtgctgcgg atggaaatga 10260 gagtgagaaa catcccgatc tcatcgtttg acgtctctac actggaagcc acggacgcgg 10320 aagcaggagc tgtgactgac gaagatgaag ccgatgcgga gatggagatg gtcgaatcga 10380 cggtggaaga tcaagctcac aggagctcag gtctacatcg tacggtaacc aacctccgta 10440 ctctgcacgg agcctcaaac tcgccagcag ggactccact ttcgaatcgt tcggggcttt 10500 catgggatga tggcgcgatg actcttcacg acgagttacg atactcgtac atgatgggcc 10560 gcgtcgagtc tcaagtagat cagtcaacac ccgcgtcaaa tcgaggaaat ggtactcgtg 10620 catcgactga gcaacaacaa ctacatggta catcatcaac gctccagggg gtggtccaag 10680 ggaaacgtga gcagatcgaa cagggcccgc gtgacacccc cattctcagg aatgatgtcc 10740 ccacgactaa gggaccaggt acgcttacaa cgaccgagta tgctgggcca gtcttgccga 10800 cacgtagaga cagagatctc acgatgatcc atgcaagtgc tggaactcat acagagtcaa 10860 tcttccaccc aactccagga gaagcacctc ttcaggcgca ggtcggagat cctctgtcga 10920 gacagatggt gtcggcactt gaaaaaatca ttcaaggtca gaaccatttg cagtcagaag 10980 ttcattcctg tcgcgacttg atgaatgggc tcactgaacg agtgggagaa ctggaagcac 11040 caaaaattgt ctctaaaata aaaaacaacc aaaaaatagt cgacgatagt caatctgacc 11100 aaagttccga tgaggacgag ggactgccac caaagactcg gagaaatatt cgatatgtcg 11160 aggtgaagaa gtcaagttct ttggaaaccg aaggcatttt gaaatatcta gacaaattcg 11220 acgggtctac aaatctgaaa acatatctca ctgattttaa cgagtcggtg atgaaaaaca 11280 gaaatctgaa acccaccaat aaatttatga tcttaaaaac gcacctgttg ggcaaagcac 11340 gcgactgtat ctcgaacagc agtgttccgg ctgaggcaat tgagaaaacc atcggatcac 11400 tgaattcgat ttacgggaaa gatgaaaaca aaactagtct tctagctaag atccacgtcg 11460 tcggtttctc acagtcggat gtaaaggaga tgagaagagc agtggtacga cattccaatc 11520 tggccgatca gttgacactc accggtattg cagagaatga tgagaggacg tttacgcctc 11580 tcacgtctcg actccctcaa gcaatccgta ctcgagtcac tcaatactgg ggtcagaaag 11640 gagaaaacgc cacgttccac gagatctttg actttgtcaa caaatgcatt gatgacatag 11700 caagagaatc ggtgctagct ctcaaacatc tcccctacac cggccaaaat gaaactagcg 11760 gacatggaat cgagctcccg tatgaggtta ctctcaatgt agccgaagcc taccaggatc 11820 taggaaactt gaagaagtcc aagtacttgg cggatagccc tgtgtacaaa ccagcggacc 11880 atccagacaa gtattatgat acttcttcag gtgagttgtt ggaaggatgg tatgcaccgg 11940 gaaaaggtcc aatcatgaac cttctcccga agacatttcc aattcaaggc gaaatcccta 12000 agaacccatg tcgagcttgc ggagaacgac actccagtat aagatgtacg ctctccagca 12060 aggcattcag acaggcgcta cgggatgcca agctatgtcc aatttgcacg ggtacacacg 12120 gtattgtaaa atgcaaatgc aagctggtat gtgttctctg ccaaggtatg catcacacgg 12180 gtggatgcac aaaaagagag tattatcgag acatgaacaa ctacccagat gacgcgcctc 12240 ccgttcagca gtttttcgag ctagccggga aggccggcaa tcgggaacca accagcaata 12300 ggcaggcctg ggcaggcacg gggacaacac catcgtaccc cgttccccag gtcgcattgc 12360 aaccgccctc tcaaatatca tgtccgattt ctcttccaac accacccgcg cattcacccg 12420 acagtaacac ttttctcact ttccttccaa caacagatcc gcaacaagtc ccaacaaaac 12480 cagatgtaga acaattcgca gccgccattt cccgagcaaa accagcatac cagctggcgg 12540 cgatcgcaga aaagtcggta cgtgaccacg gactttcctt tttatgtcta cgcactaagg 12600 agaacgattg catcccgact cttgtagatt ctggagcatc gatatccctt cttgctcacg 12660 actgcgctga aaagctaaaa atgaaaatcc tggcaacgac gaaggtgtat ctatctggtt 12720 ttaacggtgc caccactcaa atgtctcacg tgttcaaact gcagctaagg aaaagagctt 12780 cacagtcaaa gctcacctta atgctagcag gaattgagca catgccaaaa agcccttaca 12840 aagcgccagt tttctccgag gaagatcgaa catacatgga gaataacaac atcaatcatt 12900 ctgagatcac ttcaaccgct aaacttgacg gtagaaggat cgatatggtt cttggtaatg 12960 acctgctagc atggcttaac gctaaccagg atactaagaa acatattctt ccatctggga 13020 ggctcgttga aattacggat ctaggataca ttgttcatcc agttccgaat tacacggttt 13080 accaaaacca gcttacctat atggatatca gaaatgatgg aactctaatg catgttacga 13140 ctttactcga tcatcaagca cccgaagatt ggaattcagc gttgactctt caagtagaac 13200 aacagtggag attagagaac atcggaattg aagaaccatc ttcacccgat tcttccaaaa 13260 aatcatccgc gaaagatcta caggcctcat tcgaaaatac cctcagatac aactctgatg 13320 gcattctaga agttgccttc ccactgaacg ggaatgaagt gcgcctgaaa gacaattacg 13380 gggtagcctt gagaagactg cacgctactc taaacgctct aaattcatca aaagatccca 13440 ctttattagc tcaatatcat caaattttca aaactcaaga agagacaggc atcatcgagc 13500 cagtgactca gaccatgaaa gatatgagca agtattcctt tatgttaccc catagaccag 13560 tgatcaaaga gtcttctaat accaccaaag tgcgggtcgt ctacgacgca tccagtcacg 13620 cggcaggtca actgagtctc aacgacgtgg tacatgcggg agctaatatg gtaactccgc 13680 ttttcggtat tcttctacgt tcacgtttca taagattcat gatcgtaggg gatttggaaa 13740 aggcctttca tcagatccac gtccaaccgg aattcaggaa cttgactcaa tttctctggc 13800 tgaaggacat tgacaagccg gttacgatgg acaacatctg cgttaaacga tttgtgcggc 13860 taccttttgg catgagctgt agtccaaatc tattgggctc ggcaattgta cattttctag 13920 cgcaaaatcc ggatgagttg aacgctgata ttctagataa cttatacgtc gacaatattc 13980 tgatcggagt agacgatcca gcgctgatta tgaacagaat tactcgtttg aaagaaatct 14040 tcaataaaat gaagatgaat attcgagagt tggaggctat cgacccaaaa gacagagcac 14100 ccgacacgaa aattaagttt ctgggatata agtgggattc ttgtccaacg acagatacgt 14160 atatcatcaa aatcgccgat gttgacctgt ttcaccccac taaaagagat gttgcatcta 14220 agatggctga aacctttgac ccattaggac aagtatcccc gatacaagtc ccgatgaaaa 14280 ggcttatcaa aaagctatgg agcaacgatg taaactggaa agatctaatc ccggagcacc 14340 tgataggaga ctggagagca attcaagccg cgtttaacag tccgaccgtg acagtgcctc 14400 gaaggctgac aacggatttc aattacaaag agattcagtt gttggttttc tcggacgctt 14460 cacaggatat ttatgctgcg gctgtctatg ctcacttctc ttacgaagga ctaccgccag 14520 ttactacatt gcttacaagc aaaaataaga tcaagccatc aaaagatgac aaatggacta 14580 ttccgaaatt agaactgtta ggaattgaga ttggttcaaa tctggcctcg actataatgt 14640 cggagctccg ttgtaaagtt tcaaaaatcc gcctatttac tgatagctcg tgcgccttgt 14700 actggatttt gtctaaaaag aacacacgtg tttgggttgc aaatcgaatt gaccaaattc 14760 atctcaacga gacccgtatg agagaatgcg gcatcgatac cacgatccat cactgtccaa 14820 cttctgacaa tcccgcagac cttgcgacaa ggggaatgag cacgtctgat ctccaggcca 14880 gtaaattgtg gttcaacgga ccggagttcc ttcagaagga cccggaagat tggccttgca 14940 aaatcgaaga cacggtgact tgtccagttg aattcgcaga cttggtttat tcagaaatga 15000 tcgatccgtc gacgaaaaaa ctgaagaagc cactgatgga aaaagctaaa aagccatccg 15060 cgaccgctac ggttttgcat attctcaaat taccttccag ttttgaatca atagttccat 15120 ttaagtacac caactctcta cgcaaattaa tgttaattgt ttatcggatt ctcacgttca 15180 tcagcaaaac tatcccagtc agagcatcag acaccggagt gttacggaga ttcaatgcag 15240 cgcccaatct tctagagaag agaagggtag cgagacattt catcatccgc gaacactacc 15300 aagagtgcga agagctcggc ttgatctttc cgccaagctt gaaatattac gtagcatcag 15360 acggcctata tcgtgttgtg aaacaggcaa ggtcgccagc tctgcctgca gaagcgaacg 15420 aaccgatcct aattcacccg aaacatccac tcgctgagct aatcatgcga gaaactcacg 15480 aaatcaatga acagaacaat acgctagggc agctctgcga accaaatact ggattccgaa 15540 ttactattcg gtagccaaga ctgtaatcag taaatgtgtt ccgtgcaaga aggtccacgg 15600 attgccgtat ccttacccgc actcgaaaac attaccaaaa agcagaacag agccgtccag 15660 tccgttccag agagttggtc ttgattactt gggtccgctt gaatatctca aagatgatgg 15720 agtttctctc ggtaaagcct atgtgctcgt atattcttgt tttgtcacca gagcagttgt 15780 actaagagtg gtgccggacg gcagtacgga aaaattcatt atggcgctaa agactatttt 15840 tcatcaagtt ggcgtgccac aaatggttta ttctgataat gctccgacat tcattctagg 15900 tggaactgta ctcaacactg atatctcttc gtgggaagat cgtaatgagt ccctcacttc 15960 tttcatggtc gccagatcaa tccacttttt tcgcattact acgttgtctc catggcaagg 16020 aggaatgtac gaaagaattg taggattggt aaaacaccag atgcacaaag tatgtgaggc 16080 gaacaaattc gactactaca cgttgagtta cgtcgtttct ggggcacagg ctatggtcaa 16140 caaccgcccc cttactcagc acgcccgaag cccagatgac atgattgcca ttagaccatg 16200 tgacttcctg aaccctggag taatgattga gacgccacct actgagttca caccatccgc 16260 gcaaaacgga ggtcctgaac aaagagtccg cgctcatctt gctcgtctcg aagaatccct 16320 tgaaatactg tggaagtact ggtcactagg ctatttgatc aatctgagag ctagctatca 16380 caaaaatgta aagtgttcca ctcttacacc aagagtagga caggtagtgc tggtgaacac 16440 aaatcttgtg cgacgacata aatggcctct aggtgtaatt gtgcaagtgc acgagtcaaa 16500 gagagaggac gaaattcgta cggcagttgt taggtgcaaa gggaagttgt acaaaaggtc 16560 tgtgtgccaa ctcattcctc tcgaagtcgc gtcctcggat gacgacagtt tccctgaaaa 16620 cactaaccag ggagatgata aagagagagt gacggatgca agaaagatcg ttcaacatcc 16680 gtcgactacc tcactgccaa agcctgcagc tgccctcttt gatcctcgta atgcacatta 16740 ctctcctgag ctgtttcctc gatcgacact ccctaacctt gctgaaacgt ctgagaatac 16800 cgatgaccta aatacccaaa atttaaatat ccccacaata ccaaatccaa attcaattcc 16860 gaagaagcga ctggagggac tcgagactat cggcgaggaa acgattttct cgaaaacatc 16920 gacttagact taccgaacga agagtatgat gaagcagtag tcagtcagga agcagaagct 16980 gctgactact ttcaaccccg gag 17003 // ID PALTTAA1_CE repbase; DNA; CEL; 596 BP. XX AC . XX DT 07-SEP-1999 (Rel. 4.08, Created) DT 07-SEP-1999 (Rel. 4.08, Last updated, Version 1) XX DE PALTTAA1_CE is a putative nonautonomous DNA transposon - a DE consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; PALTTAA1_CE; KW TIRs; TTAA target site; nonautonomous DNA transposon; palindrome. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-596 RA Kapitonov V.V. and Jurka J.; RT "PALTTAA1_CE."; RL Direct Submission to Repbase Update (SEP-1999). XX DR [1] (Consensus) XX CC PALTTAA1_CE is a putative non-autonomous DNA transposon; it CC forms almost perfect 298 bp-long palindrome and generates, CC putatively, duplications of TTAA target site upon its integration CC in the genome. CC The C.elegans genome contains about 100 copies of PALTTAA1_CE. CC PALTTAA1_CE copies are ~90% identical to the consensus CC sequence. There are several minor subfamilies related to CC PALTTAA1_CE CC in the C.elegans genome. XX SQ Sequence 596 BP; 206 A; 91 C; 90 G; 209 T; 0 other; ttaaaggtgg tgtagtcgaa ttttttttat tgctttatta gactcaaaat tgtctgaaaa 60 caccgaattt cataatgaaa cttcttgaaa acttctcaaa aaaaaagtta tgacggctca 120 aaaaatggcc taaaattagt taaaatttga aatttgaccg acttgtcaat gtcgcagcgg 180 ctggaaacta ttttttttga aatcaccgtc aaattttgag tatacaagtt aattatcttg 240 cgttttcaac ttgatttagg tattttaaag tcgatggacg gcgagatttt tttaattttt 300 taaaaccaaa tctcgccgtc catcgacttt aaaataccta aatcaagttg aaaacgcaag 360 ataattaaat tgtatactca aaatttgacg gtgatttcaa aaaaaaattg tttccagccg 420 ctgcgacatt gacaagtcgg tcaaatttca aattttaact aattttaggc cattttttga 480 gccgtcataa cttttttttg agaagttttc aagaagtttc attatgaaat tcggtgtttt 540 cagacaattt tgagtctaat aaagcaataa aaaaaattcg actacaccac ctttaa 596 // ID NDNAX3_CE repbase; DNA; CEL; 1591 BP. XX AC . XX DT 06-APR-1999 (Rel. 4.03, Created) DT 25-MAY-2001 (Rel. 6.04, Last updated, Version 2) XX DE NDNAX3_CE is a family of non-autonomous Helitron DNA transposons DE - a consensus. XX KW Helitron; DNA transposon; Transposable Element; Nonautonomous; KW HELITRON class; NDNAX3_CE; nonautonomous DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-1591 RA Kapitonov V.V. and Jurka J.; RT "NDNAX3_CE."; RL Direct Submission to Repbase Update (APR-2000). XX RN [2] RP 1-1591 RA Kapitonov V.V. and Jurka J.; RT "Direct submission."; RL Direct Submission to Repbase Update (MAY-2001). XX RN [3] RA Kapitonov V.V. and Jurka J.; RT "Rolling-circle transposons in eukaryotes."; RL Proc Natl Acad Sci USA 98(15), 8714-8719 (2001). XX DR [2] (Consensus) XX CC NDNAX3_CE is a non-autonomous DNA transposon. CC There are about 100 copies of NDNAX3_CE in the genome; CC they are ~95% identical to the consensus sequence. XX SQ Sequence 1591 BP; 475 A; 313 C; 282 G; 521 T; 0 other; tctgccaagc agcctctaat aaatgccgag tgtacagact ccgccccttt ttcgcgtttt 60 ttcgcgtttt ttttggcttg cgaaaaaatg tcacacttcg ttttcattaa taactccaga 120 gccgtgtgac ttcatgctct aatatttgaa atcctactag aaatattttg ccccacggtg 180 gagcaaaaaa cggcactaag ttcggagcaa aattgagccc agggcggctt tccccagttt 240 gaaaaaattt tttgttgctc tttttacccc caaaaaagca tgttttaatt gaattaaaat 300 tcgggttttg ctcggttttg ttccaaattt atgtgcaaga tactactcag agagactgat 360 tttttgaaaa aaagtttgag ttcataagtg caaaaggaaa aaagttgtga taaaacaaaa 420 ggccaaaaaa tgacattttg ccaaaaaaaa atttttttcc gaaaaagtag tttttcgtct 480 ttatctcaag ttctacttca tcttttttga tatttttttt gtttacccca cgtaaaaaag 540 tacgctgaac acgattttta actcagaatt gagaaaagtt ctatgagtcg gccgagcaag 600 acgaataagt gccaaatttt gcacactttc ccattttcgc gaatctactt tttcaatcat 660 aactcggtca gttttcaagt tttcttagtt ttccaaaaat tgacgtgtag gtctcatcaa 720 gacgcatcga gacatataaa atttgtaaaa agttcagtgg gaaaattttt caagaaaaaa 780 ataattcaaa aatttagtac tggggggagt ggtttcctgg gtctcctgaa cattttttcc 840 actaaagttt tgcggaatgt atttttgatt catagttttt gattttattt aatggaagat 900 gaagtttcgt cgcattagaa atactctgcc aagagatatt atttttatta attttcaggc 960 caaaacttgt tttttttcta tacatctgtc tgtgtacaag atatagccct caactgtacg 1020 tctccttttc gggaaatcaa taatgagtac aatcctgtaa taaaattttc aaaaattcgt 1080 tttcccgctg acatatcagt aataaacggt tttcgaacgt cttttctcaa attgtaatta 1140 ttgaagaaag cggaatcgaa agacgactcc ctccagaaat ttttcttccc tacatcccag 1200 tttgaggtgt ctcccgcatg gccgagtggt tagtgcgtat gactgcagaa attttggtag 1260 ttggttcaat tctaccaagt gacaaatttt tttgtttttt tgtgagtaat cgggattcga 1320 atagaaatac tcgattttat acaaaatatg ggtacgtctc ccctaatata aagcctcctc 1380 aaacttcaat caatcaaaac tcagctttaa ctgattcatc gggtgcatct gtttttggtc 1440 aaaccgaact caaaaatcaa gaagactccc attgacctac tgaaattttc acgaaccctc 1500 taacaggttc gaggcgaagc cgagaaccta cgcccggcaa ggggactcgc cttacacccc 1560 ctacaactgg cgggcccgca gggcccgcta g 1591 // ID PAL8C_5 repbase; DNA; CEL; 201 BP. XX AC . XX DT 09-JUN-2000 (Rel. 5.05, Created) DT 09-JUN-2000 (Rel. 5.05, Last updated, Version 1) XX DE PAL8C_5 is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; PAL8C_5; KW TIRs; nonautonomous DNA transposon; putative HAT superfamily. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-201 RA Kapitonov V.V. and Jurka J.; RT "PAL8C_5."; RL Direct Submission to Repbase Update (JUN-2000). XX DR [1] (Consensus) XX CC PAL8C_5 is a putative non-autonomous DNA transposon. CC It generates 8-bp duplications of target sites upon CC integration in the genome. PAL8C_5 perfect 35-bp CC terminal inverted repeats. CC There are ~10 copies of PAL8C_5 in the genome; CC they are 95% identical to the consensus sequence. XX SQ Sequence 201 BP; 71 A; 34 C; 27 G; 69 T; 0 other; ggtacttttc cgatttctgc cccccaaaat gttttttatt caatatttca atttaaattc 60 tcacgtgaat gtttatttca atactatttc aatttttagg cttagaaaac aacaatacta 120 agcctgaaat tacaaaaaag ttcacgtttc attaatgaaa atatcgaaaa cattttgggg 180 ggcagaaatc ggaaaagtac c 201 // ID CELE12B repbase; DNA; CEL; 171 BP. XX AC . XX DT 16-SEP-1999 (Rel. 4.08, Created) DT 16-SEP-1999 (Rel. 4.08, Last updated, Version 1) XX DE CELE12B nonautonomous Tc2-related DNA transposon. XX KW DNA transposon; Transposable Element; Nonautonomous; CELE12B; TC2; KW nonautonomous DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-171 RA Oosumi T., Garlick B. and Belknap R.W.; RT "Identification of putative nonautonomous transposable elements RT associated with several transposon families in Caenorhabditis RT elegans."; RL J. Mol. Evol 43(1), 11-18 (1996). XX DR [1] (Consensus) XX SQ Sequence 171 BP; 55 A; 29 C; 42 G; 45 T; 0 other; taccgtattt cctctattag tcttgaatgc aagactaatt ttcaatcgac ccgtaggggt 60 gcaagactaa tagagactgc aagactaatt ttcgattggt ccgtaggggt gcaatactaa 120 taggggtgca agactaatag agactgcgag actaatagag gaaatacggt a 171 // ID HELITRONY4_CE repbase; DNA; CEL; 1855 BP. XX AC . XX DT 25-MAY-2001 (Rel. 6.04, Created) DT 25-MAY-2001 (Rel. 6.04, Last updated, Version 1) XX DE HELITRONY4_CE is a family of non-autonomous Helitron DNA DE transposons - a consensus. XX KW Helitron; DNA transposon; Transposable Element; Nonautonomous; KW CERP1; HELICOPY4; HELITRON class; HELITRONY4_CE; KW nonautonomous DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1686-1465 RA Naclerio G., Cangiano G., Coulson A., Levitt A., Ruvolo V. RA and Volpe A.; RT "Molecular and genomic organization of clusters of repetitive DNA RT sequences in Caenorhabditis elegans."; RL J. Mol. Biol 226(1), 159-168 (1992). XX RN [2] RP 1-1855 RA Kapitonov V.V. and Jurka J.; RT "HELITRONY4_CE."; RL Direct Submission to Repbase Update (AUG-2000). XX RN [3] RP 1-1855 RA Kapitonov V.V. and Jurka J.; RT "HELITRONY4_CE."; RL Direct Submission to Repbase Update (MAY-2001). XX RN [4] RA Kapitonov V.V. and Jurka J.; RT "Rolling-circle transposons in eukaryotes."; RL Proc Natl Acad Sci USA 98(15), 8714-8719 (2001). XX DR [3] (Consensus) XX CC There are about 100 copies of HELITRONY4_CE in the C.elegans CC genome. CC They are ~97% identical to the consensus sequence. CC This family is a non-autonomous HELITRON/HELITRON-like DNA CC transposon. CC 210-bp 5'-termini of HELITRONY4_CE and HELITRON2_CE are 92% CC identical. CC 40-bp 3'-termini of these elements are 90% identical. CC CERP1, a satellite-like repeat identified previously [1] is a CC portion of HELITRONY4_CE (position 1686-1465). XX SQ Sequence 1855 BP; 598 A; 266 C; 430 G; 561 T; 0 other; tcattactta tatataaaaa aataattccg tttgttccat agtttgtagt ctatgtagtc 60 tttgtagtct gtgacgtcac acccaaagtc actgagaatt gtgggcgggg tctgtcatcc 120 ttcgtggtga gacccatcgt ggcgagaccc atcgtggtga gacccatcgt ggcgagaccc 180 atcgtggtga gacccatcgt ggcgagaccc gtcgtggcga gacccatcgt ggtgagaccc 240 atcgtggtga gacccatcgt ggcgagaccc atcgtggtga gacccatcgt ggtgagaccc 300 atcgtggtga gacccaccgt ggtgagaccc gtcgtggtaa gacccaaaat tttggcggga 360 aatttaaatt ttctgtgaaa aatattttgg cgggaaattt aaattttctg tgaaaaatat 420 tttggcggga aatttaaatt ttctgtgaaa aatattttgg cgggaaattt aaattttctg 480 agaaaaatat tttggcggga aatttaaatt ttctgtgaaa aatattttgg cgggaaattt 540 aaattttctg tgaaaaatat tttggcggga aatttaaatt ttctgagaaa aatattttgg 600 cgggaaattt aaattttctg tgaaaaatat ttttgcggga aatttaaatt ttctgaaaat 660 tctaaaattc tggaaatcta gaatcttctg gaaatttcga aaaaactaga aaaattggga 720 aaagtctgga atgttccaga actttctaga aaaatcgaga aaattctgga atgttccaga 780 actttctaga aaaatcgaga aaattctgga atgttccaga actttctaga aaaatcgaga 840 aaattctgga atgttccaga actttctaga aaaattggga aaattctgga atgttccaga 900 actttctaga aaaatcggga aaattctgga atgttccaga actttctaga aaaattggga 960 aatttctgga atgttccaga actttctaga aaaatcgaga aaattctgga atgttccaga 1020 actttctaga aaaatcggga aaattctgga atgttccaga actttctaga aaaatcggga 1080 aaattctgga atgttccaga actttctaga aaaattggga aaattctgga atgttccaga 1140 actttctaga aaaattggga aaagtctgga atgttccaga actttctaga aaaatcggga 1200 aaagtctgga atgttccaga actttctaga aaaatcgaga aaattctgga atgttccaga 1260 actttctaga aaaatcgaga aaattctgga atgttccaga actttctaga aaaatcgaga 1320 aaattctgga atgttccaga actttctaga aaaatcgaga aaattctgga atgttccaga 1380 actttctaga aaaatcgaga aaattctgga atgttccgaa aaattgagct tagagcttta 1440 gaagaggtag ttatttggga gttgatgggg gatcaagtca aggtactgta gtggtactat 1500 aggggtactg taggtatacg gtagggttac tgtagttttg gaaaaattgg catttcgtcc 1560 tttgaagagg tattgggtta ggagtcggtg gaggataatg tcaaggtact gtagtggtat 1620 tgtaaggtta ctgtcttggt caaaaagtaa cagaaagttt tcatactgtc tgtgaatttt 1680 tgaaacatgc atgtcgtaga aaaatacata catgtaacag tgccagtaaa ccgcgtttta 1740 agtgtttcaa gtttttttca tgatgagaaa ttttttgagt gattgaaggt atgtgtgtca 1800 aatactttta atggtgccag tcgttgcccg cgccgtaggc gcggtcagcg gctgg 1855 // ID CER4-LTR_CE repbase; DNA; CEL; 162 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 06-AUG-2003 (Rel. 8.07, Last updated, Version 1) XX DE CER4-LTR_CE is a long terminal repeat of the CER4_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER4-I_CE; CER4-LTR_CE; KW CER4_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER4_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (APR-2003). XX DR [3] (Consensus) XX SQ Sequence 162 BP; 47 A; 27 C; 35 G; 53 T; 0 other; tgttagggac gtgaagtatg gtcagcctcg ttggaaaccg tgaactatta gaagctacgt 60 ggaaactact ataaatagaa ctctgtgtag tattattgtt tattcgctac tttaccaatc 120 tttacaataa agctctggtt ctttggttgg ctaaaggaca ca 162 // ID CeRep57 repbase; DNA; CEL; 120 BP. XX AC . XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 02-SEP-2008 (Rel. 13.1, Last updated, Version 2) XX DE C. elegans repeat 57. XX KW MSAT; Satellite; Simple Repeat; minisatellite; repeat; C. elegans; KW CeRep57. XX NM CeRep57. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-120 RA Jekosch K.; RT "C. elegans repeat 57."; RL Direct Submission to Repbase Update (25-APR-2002). XX RN [2] RP 1-120 RA Smit A.; RT "Consensus."; RL Direct Submission to Repbase Update (02-SEP-2008). XX DR [2] (Consensus) XX CC Putative novel minisatellite repeat, found 248 times exclusively CC on CC chromosome X in the C.elegans genome. For further analysis CC compare http://www.sanger.ac.uk/Projects/C_elegans/repeats. XX SQ Sequence 120 BP; 21 A; 33 C; 30 G; 36 T; 0 other; tagagcgatt ccttaccctt aaatgggcgc cggtttcgct tagagcgatt ccttaccctt 60 aaatgggcgc cggtttcgct tagagcgatt ccttaccctt aaatgggcgc cggtttcgct 120 // ID CER7-I_CE repbase; DNA; CEL; 9400 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 06-AUG-2003 (Rel. 8.07, Last updated, Version 1) XX DE CER7-I_CE is an internal portion of the CER7_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER7-I_CE; CER7-LTR_CE; KW CER7_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER7_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (APR-2003). XX DR [3] (Consensus) XX CC FEATURES Location/Qualifiers CC gene 6245..8057 CC /note="predicted Envelope region" CC gene 1759..2076 CC /note="predicted Protease region" CC gene 4483..5102 CC /note="predicted Integrase region" CC gene 1308..1527 CC /note="predicted Gag region (frameshift)" CC primer_bind 1..17 CC /note="predicted Primer Binding Site (Gly-GCC)" CC gene 2566..3224 CC /note="predicted Reverse Transcriptase region". XX SQ Sequence 9400 BP; 3532 A; 2041 C; 1870 G; 1957 T; 0 other; tggtgcatcg accgggaatc gaacccgggc cgcacgcgtg gcaggcgagc attctaccac 60 tgaaccaccg atgctgagac tcaataaagt aactggcgag caatatccgt caggctgtta 120 aaacccactg ccctcaccgt ttatggtgca tcgactaaac gactaaaaca acaaccgcgc 180 ggcaaatcaa aaaagaaaaa aaaggcgagc aagtaagtaa gttcatttga aatttttctt 240 ccaaaagtca aaagtttcca aaaaagaaac ttcaaaaatt tcaaaacaag agaaccatga 300 gtctccaata taagctagct acaaccacaa aagcagtcaa tgccctcatg acagtaaaag 360 cctattgggc cggcaagacc aaggatcttg ccggggcaca attgaccaaa gaccagttag 420 gcgaaatcat agagggaatt gatcaagttg aaagcaaagt ggctaccaca gagagagcca 480 tggaagctct cacagaagca gcagatctgc tagacaatta tgttacggaa acacaaaagt 540 gtaccgagaa aattgatgaa attgttgaga taattgtcgc agcacaagac atcagagatg 600 tcctacatgc gaccctcaaa cgggctcagc aacagagttc agtcacaatg acacgaacaa 660 cagaccaacc ggttcaaatc ccaattgtca catacaattc agggtcacta ccggacataa 720 aaatcccaaa gttcaaaggt gaaagttggg agtttcaaaa cttctgggtt ctctttgaag 780 agctagtcca caaaacggac atgccagaca tggtgaaatt tatcagatta ctgggagctc 840 ttgaaggtga acccaaaacc ttggcaacca agtatcagat cacatctgag aactacgcca 900 aagcagttga ggcacttaaa aacaagtaca ccaactcaga aagcaccatc aatagcttga 960 atgaaaagct gctgaaggaa aaatcgaaaa atgattcgat tcaagctcaa aaagagctat 1020 acgaaaactt ggtcatcata attgatcagt tacgccagtt aggcgaacat ctcgaagaca 1080 gaattttcaa agatcttcta ttgaagaagt tctcagagaa gatcagaaga aaagtctacg 1140 aacaaaaact caaagaccag ggaacatgga ccacagatcg tctaatcaag gacattgaaa 1200 aaatcctgac gatcgaagag gacctgcaaa gtctcatgaa gacgtcccaa aatgagtcgc 1260 aaccaaatcc agcagcaaag tcgaaagaca aatctgcaaa aacgtttgac cgtcagtgtc 1320 tttactgtaa gtcaacaaaa catgctctcc acaagtgtga caaatttgtt acatcaaaca 1380 ttgattaaag ataatcgttg tttaaactgc ctcagcaaag gtcatactgt aggcgagtgt 1440 aaaagcactc gtaaacattg tgcaagctgt gaaggaggca agcatcactt ttctatctgc 1500 aacaaaagag atagaggaga gaaagagaaa gaagtcaaga caaagactac tgactcagca 1560 aagtcggaaa agccaaaagg aacgccgaca agagggcaac agaacgttgc gtgcacaagg 1620 tcaccagctg gcgaggtgac accagacaga tacgaaacag aaaaggtatc aaatgaccaa 1680 gaagtcataa gtctgccaac catctcagca agagcgttca accctgaaaa aagggactgg 1740 accgatctaa caatcatgtt agatacaggg gctgaccaat catatataaa ggcaaacatt 1800 gcaaaagatt ggaagctacc aaaagttgat gaaaaaacac tccgtcttca gacattcgga 1860 agtggaaaag ctcaaccaga gaaaatcttt gaaagatcaa aaattgaact tgaaactgat 1920 ggacaaaaaa tcagcatgat agttctacat tcagagtatc tcacagggaa aatctaaaaa 1980 gctcaaatga caaatgaaga tttgaagttc gctttgacaa agcaactgaa aatcaatcaa 2040 gaatgttttg aaaccagagt tgaaccagac atcattatcg gttgcgatca cattgcggaa 2100 atatggagag gtgacttcat acagttacca tccggaattc tattgatcaa aactgtgttt 2160 ggttacacaa caatggggaa aaacacaaaa tggtccaaac cattggagat ggattcaacc 2220 atctcagtca tgaacacagt cgtgaaagtg caaaacgacg acatcgagtt ccttcagcaa 2280 caggaaacga taatgcacgc tcctaacgag tatacaggag cagtcgtaga tgaaaaactg 2340 gagatggaaa agaaaacaac tcagtttttc aataatacga ttgagaaaag agaaaacggc 2400 taccacgtac gattgccttt caaggaggaa gtcattgata aacttccatc aaacttcgcc 2460 attgcgaaga aaagactaca aagcggtctc aaagcaaacc cacaagtcaa aaaacttgtg 2520 aacgatgtat tcgaagatca aatctcgaaa aacatcttgg aagaagtaga tgtatccaaa 2580 gatacagaag ggatgaggat tcattataat ccacattctc ccgtactgac accgcaaaag 2640 actacgacca aatgtcgcgt agttatcgac ggctcggcac atttcaaaaa cgagccgtca 2700 ctaaacgacg ccatctatca aggccctaca atactgcctg atagtgtgga caccagattc 2760 agatcaggca aaacggttct cttggccgat gtagagaaag cttttctcca ggtccatctg 2820 aacgagagcg acagagacgt gacaagagtc ctatgggtaa aaaaccctga cctgccacca 2880 acaagagaaa acttaagagt attaagattt acaagagtac tctttggact caacgttagt 2940 ccatttctct tgggagcgac cattcttttt catttggaca gaatggaaga taagaagctg 3000 gcaaacacga tagcaagaaa tctttatgtg gataatttga ttatcgcaac agatgatgac 3060 tctgaagcaa tgttcaaact ttacaacaaa gtaaagacag tattcaatgg actctccatg 3120 aatattagag aatttcagag taatgatcag tcattcactg acctactaga tgagtgtgac 3180 aaaacgtcag aaagcgaagt taaagtcctg ggggtaaaat ggtcaaccaa gacagatcaa 3240 ataacttcgt caactactga tattgacata ctcgaaaatt ctagaagaac agtatctagt 3300 gcaatcgcgt caatatatga cccgatgggt ctcctagtcc cgctgcttct tccacttaaa 3360 ttattccaaa gaaaactttg gatcgaagag gagaagtatg gatgggacac accgctaaaa 3420 gagaagcacg aaaaagagta caaatccatc acggataata tccgaggatt ctcaaaagat 3480 ttcaaccgtc acatcataga taaatctggc gagaatgaaa tcatagcatt cgcagacgcc 3540 agtcaggaag caattgcagc ttgtatctat gtaaaaaaca agtatggtgt aaacctcttg 3600 tttggaaagt caaatgtcaa gtctttgaaa gaaaaatgga ccattccaaa actggaagtc 3660 caagcactca aagtagccac tgacagagct ctttcaacac tgaccgctct ccaagatgga 3720 gacatcaaag tcacaaaggt catcctcttt tctgacagtg agattacctt agcatggctc 3780 aggtcagagc cagggaaaaa agaagttggc atactgataa aaaacaggat tgaatcaatc 3840 cgcaaaacca atgagacaat gctccaaaaa ggagtacaag tgttcttcgg atatgtcaac 3900 actcttgaaa acccagccga tctacgcaca agagggcttc caaaagaaga cttccaaaac 3960 agcatatggt ggaaaggacc agagtatgca aagagtgatt cctcaacttg ggaaacacga 4020 caaaaaatgt tcaaaataaa cagaacagaa gcagaaatgg caaacatcaa catttgcaaa 4080 gcaatcgaat cagaaaaatt cgaagcagtg ttcgacatta atcgaacaaa ctctcttcca 4140 aagatgagaa gagtagcagc atatgctttc cgagccataa aaaagatggc cacaaagcta 4200 ccaagaccga gacaacaaaa aatggaaaaa gcaattccag agctgacagt agcagccaca 4260 ggaaatcctc ccattacagc agccgaacac caatgggacg aaaaacggtt aatccgtgaa 4320 caccaagcaa atctcgaaag caaagatttg cgaaaatgga gcaaactcaa tatccaactg 4380 gtggaaggtc tagctcgctg taaaggaaga ctctttcata tgtcaccaga tattgacatc 4440 agtcaaccgg tctttatcat gccacaaagc caattggcaa aacttctcat ctgggaagca 4500 catggaaagt atcataccaa tgaacaacaa accatggaag ctacaagaga gaaggcatgg 4560 ataccttgtc taagacggca ggtgaagaaa atcattggaa agtgcgtcaa atgtcaaaga 4620 tacaaccgag cgccaatgaa atatccaaac atggcagaca tgccgtcatt ccgtgtacgt 4680 cggtccagac ccttcgaaaa cacagggctg gattacttcg gtccaatgac attcaggaaa 4740 gaagatggtt ccacagaatc atgttgggga tgcgtattat catgcgctac cacgagacta 4800 actcacatag agttagtcca acaatgctca acaaaagcat tcatcaatgc aatccgaaga 4860 ttcgtatctg aaagaggaat tcccgacaga attgtcagtg ataacgctcc tcaattctgg 4920 gacaacaaat cctcaacgaa gtatcagcaa gggcttccaa agaaaatgct cttgacaaag 4980 atattctaga atatcttgga aatacatcca ttgaatggag tttcatcact ccgtacgctc 5040 cttggcaagg aggcatgtat gagagaatga tgcgatccat aaaacaagcg atatataagt 5100 caatcggcaa aaacattctc acactcgacg acctcgaaac ggtaatgaaa gaagtctcag 5160 gaataataaa ttccagacct ctcacatacg tcacagaagg tgcaggaaca acaatttgcc 5220 cgaaagattt cataaatcca gaaatgagaa tgacaatccc tcttcaggtc acaaaagaca 5280 tgatggatga atacatccca ccagaggaga taaatctcac aaaacaagag accatagaag 5340 ctcttcaatc ctcagtggca atcattgaag atatctggaa gatatggaac aaaacctatc 5400 tttcagaact gagagaaaca cacaaactca gaatggacaa taagcgtgaa ataaacgatt 5460 ccccgtttgt tggccaggtc gtactcatat gcgatccaaa ccttccccgt aacaactgga 5520 aaatgggaaa aatcacggaa acaaaaccgt ccacagatgg aattctgaga gaagcccatt 5580 tattaacagc aacagggagg acaatcagaa gacccataaa tttgattact ccattagaga 5640 tcgaagagat tcaacatgaa acagacgctc aagacgcgga aaatgtcatg gaggataaca 5700 tggaaaatat tccagaaaag atccaagaag tagtcgcaaa tgggcaaagc gagatcaaca 5760 ctgaaattga tcagccaaat cagaacagaa ggtacaatct tcgaaagaag aaaaccatca 5820 actacgagga gaaagatgat gacaataatc aggcacaaac agtctcaaga tctcaaacat 5880 gggatattgg gagaacaatg ccaaaattat tgttcctctc gctttttgca ttgttgataa 5940 gtccttcgat ggcaacagct ccaatagagt gtacacaact ggcattcgaa tcaacgaaaa 6000 attcgatacc tttgaaactt gcagtgcaaa ttactgtact caaagaaaat gaatagactg 6060 gtcctcaggc aacaccgaga tctggctgcc gccagctatc aaatcggtag accacacagt 6120 cacaacaaaa attctgaaag acaacaaagt ctcaatacat gagacaagtt gtccagcaca 6180 gtccacatgt gacacaatcg attgtgtcat atgttggtcc aatatctcaa atccagagtg 6240 tcaccccaac tatgccatct tgggaatagg aatatttttg tacatcatca tgatgatgat 6300 acattgtcta ttccgggtac cagtgccaat cggagcacca atatcaatgg catggagcgt 6360 gataaagaga acgatctcca ccatttggaa tttgggaaaa gcatgcctaa gaacaagacg 6420 agcacaacga agacacagtt ggtcaacaat aataccaatt ttgtgcatca tcgttacagt 6480 cagctcagga tgtcaggaaa ttgactctct cacacagtca cagaaaatat gttccacgga 6540 tgacggaaca agttattgca aacttgtgac cagacaattc atgaacatta acccatacag 6600 gaaagaatct tgtatcagga tcacaaaaag tgggaaagtg acaaaagaag tgagagttca 6660 attcaaggcc gttcaacttg aatgcatgaa aaccacggtg acgttcacaa aagacgtcca 6720 aaccaaggtt tggagctcca aacgatgccc acacatgggc tcttgcggtg aagacaaatg 6780 taacaacatt tcgttatcat cactactacc agaactcgaa ccaacaaacg gttacgtgga 6840 aaaaacaagg tgtagtgaga gctgcggggg gcctggctgc ggatgtttct atccgtccag 6900 cgggtgcctc ttttaccgca tatacgcagt tcccaccacc aatgactcct tcgagatcat 6960 gaaatgcaac gaatggattg aaaaagtcca cctcaaagtt tcaataacgg agtgggacgg 7020 aaaaaatcca agaatggaga gtcaagaaat caaaattcca ttgaatttcg tgacaacaat 7080 caaagacatc acaatgtcag tgaactccat caacaagcca gaatctccaa tcctggcgac 7140 ttggttcata caaagagcca acaatacagc aatatgggag ccgtataaac accccatgtt 7200 caactgctca accaaagaca gcgaacaaaa taactgcaca ttcacagaga actgtgtatg 7260 cacacccgca gaagatagaa taaactgtgt gtgccaaaca gaaagtcttc agaactatat 7320 gaacagttta gacaaaagac tcccaatcca ggaaaaccac tggaaattac agctaagcga 7380 tgacaaaatt accgcaacca ttgctaatga agtaaccact ggagttacca tgacattaaa 7440 agataatgtt acaatgtccg catggataag caacgaaaag tgctacgtga aagctagtca 7500 tgcttcagga tgctacaaat gtgcaactgg aagctcagca acgctacaat gcttttcgca 7560 aaaaaccgac acgatcggaa acatagtatg cgaagaagat atgttctcta taccttgtac 7620 accagaaggt cacaaacaga acatcacttt cttcacagaa aaggctcagt tcagaaaact 7680 gcgccaaata agctgtggaa aacaaaccca gacctttgag ataacaggca ttctcaaatt 7740 tactggatcc atatggacat cgatatacag aattatcgaa ggaaagtcca ccataacaaa 7800 tgaaatctca tggccagatc agggacactt gtggaccagc tacctttcat tcatgaaaac 7860 attaatgatt acgataatca ttacattagc cgttttcgca gtaactggaa caatagtagc 7920 ttcattcatg ctactcggga tccggaaaac ggtcagattg acagtcttca taatagggct 7980 accaataaaa agcttgtaca aaatttattt tttttgtatc agccggtgtg gaaaaaatcg 8040 gtacaaatcc gaaacaaaaa ccgcttaaga cggtccacat cggcaaaatt caaaaaacaa 8100 aaattcaaaa tgtgcattaa aacaaaaaat aaacatcaat tgttttcaga aactcgtaac 8160 aagctcaaca tgaatccagc agctcaagca gttgaccaac atgatgccat ggacatcgaa 8220 attcttgaag aacttccaca acaagctcca actcgtcaaa acgaagggga caaacaagat 8280 ctccaagaaa aagatcattc tcggagcgat caagctcata aagatcttca gaagcaagga 8340 caaaataaaa ggaacgttgg aaaagataat tttccacaat ggaatcgcca agcaacatat 8400 cctcagaatg aaaaatattc tagaggaaca ggtggaaaca atcaacaagt tcccaccaaa 8460 agaaacgaag gaaattcccg aacaatacat gctgctagaa caagtgaaag aaaaagtcat 8520 gcagcacaag ctcgctgtca cagaagcgag cgatatgatg atattccatc atcatcaatt 8580 aaatgaatgg gcacaaaaag acccaagagc tcgctccaga ctcgagaagc tccaaaagga 8640 gagacaattc aatgtccacc aagcattgaa cacattaagt gccaataaag atgggacatc 8700 atccaaaatg tcactcatct cagataagat gaaagaagca tggacacaaa tccaggctca 8760 catgacacaa agcccggagc ccaatctgcg attggaacac cagattcaaa ttctccaaaa 8820 ggatcaacag gagatgcgat cttcaattcg ccttatcctg gaaactctct ccaaaagaga 8880 ccttcacaat ccgcaaaagc agaacgaagc gatctccatg gaagtgatca atcacgaaga 8940 atcagaacgt cttccaagaa ctgatctctg tcagttcaat cttcggaatt ataaaattcc 9000 gaagatcaaa gagcccgaaa ctcttcaaaa aagtggatca caaccactca aagccaaggg 9060 agaaaatctg tcaaaacaga ttccaaagat tcccaaacaa gggaacaaaa tgctgttcaa 9120 agcaaagttc cacagaagct ttacggttgc agacgaaccg aaaagcccat catgtgctta 9180 ttgcaccaga aaacacctgt ctgacagatg tgggatcatc aaagatccga aagcccgaaa 9240 agctcacctg gagaacaaca atcgatgtct gctctgcctc aagaagaaag aagaactcca 9300 cgagttccag aagaagagat gtcactactg tcacaggaca aatcaccact catcttggtg 9360 tcttcaagct ccaatctcag gagtccacta attgccgggg 9400 // ID CER15-LTR_CE repbase; DNA; CEL; 447 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 06-AUG-2003 (Rel. 8.07, Last updated, Version 1) XX DE CER15-LTR_CE is a long terminal repeat of the CER15_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER15-I_CE; KW CER15-LTR_CE; CER15_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER15_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (APR-2003). XX DR [3] (Consensus) XX SQ Sequence 447 BP; 94 A; 119 C; 90 G; 144 T; 0 other; tggcgaacgc ggagaaccaa gaataattct ccttgtcgcc actttgttac ttaaatttat 60 ttccattgat ttacacaatt atatccatta attatcgtct tcttcttcgt tttcacacac 120 caacggactt ggaacgtttg aaccgccgcc aactacagta ctccacctgc acatcagacc 180 ccccacgtcg tgctcgcctg cacatctaac cgccatcgac gcacgccgcc cgggagaatt 240 cttgagtgtg gtcctaaatt ccgtcgccgt ttagtcgttc ttttttattg ctccgtcccc 300 tttattttgg cccgtttctg tgtgatttgt gtgctggttg actaaccagc acacggagat 360 aaacgattat ttagcacata cctgtactgt tgttatttgg gaattacggg ttttgggtgg 420 gccgttttta ggagctatcg ctccgca 447 // ID LINE2H_CE repbase; DNA; CEL; 2978 BP. XX AC . XX DT 16-MAY-2006 (Rel. 11.05, Created) DT 16-MAY-2006 (Rel. 11.05, Last updated, Version 1) XX DE CR1-class LINE element from C. elegans - consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; LINE2H_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-2978 RA Jurka J.; RT "LINE2H_CE: CR1-type LINE element from worm."; RL Repbase Reports 6(5), 252-252 (2006). XX DR [1] (Consensus) XX CC There are several copies in the genome >98% identical to CC consensus. The 5' end is not exactly defined. XX FH Key Location/Qualifiers FT CDS 517..2754 FT /product="LINE2H_CE_1p" FT /translation="MKFHDLAQMVNFPTRFSKLNSGTFLDLIFTNNSGVLQ FT NLKPGTPFINSDHISIFFSLSLPQPTPAHSAPPKSFMNFRKCDFFNLNRYM FT VSYNWPKQLSFFATPDSKLNHFLKIFNELISDFTPQYPTLRVQSPKVTGRK FT LYRKLRKHHPDTESSVLKKRVKSRLKFLKKKNMAFENKLIHSKNSKSLFSY FT IRNRIKPSSSLTHFSVDNNNVDKPIEMAEVLVKEFSKSFVTHDPPFPTFPS FT CESTPSPVDTSPLTIASTILKLKPKIGYSLDNINFFILKKCCDSISIPLSL FT IFKEFITTNSFPSCWKTSIVIPVHKKGNSSDPKNYRPIALTHPLSRLFERI FT VLNPIRKTFSHKISKFQFGFLNNRSCITAHLNASSNYHKILSVPNQFVDII FT LFDFAKAFDSVPHELLLCKLRNFGFDDNICLWFKAFISDRVSKVKIGDSLS FT NSSFKNSSGVLQGTVTGPFLFLVYINDLLDLFPPDVHVIAFADDLKLFSNN FT CQSLRSSIKIIENWCKIWQLKLAENKTKVLHVGKKNPKYKYYVNGQKIECC FT SKACDLGIWVDDQLTYEKHILVKVNAAMFKCRQILKNFRSLNMMFYFKLFN FT TYIRPILEYGCEIYHPKSASLTKKLEQPLRYFSRHVFKKCKTQYDSYEHRL FT NQSNQQSLKHRRVFLILKTFHNIITGKYHFPNLALYLKNSKSPRFPYQMTA FT IGKTNNNSFLHTHLTIWNLVTKHFPVPVSGYTFSKRISCLPLESYV" XX SQ Sequence 2978 BP; 921 A; 660 C; 405 G; 992 T; 0 other; agctggttaa ttgcatgtag gctaatgctc gttccattgc atcgcttgaa aaacttgaat 60 ttgtccataa gttcgtagcc tctaatagtt tagacatctt gtttgttacc gaaacatttc 120 taagtaatgc ttttactgat ttctcatgct cttcacctca ctatacctgc ataagatctg 180 atcggcttga gtgccatcca aaaaatgcgg cggaggtact gctgtgtttt ttcataaaaa 240 atgggctctt acgcctgtct caattgcacc ttccttttat ctctcgcatt tttgtcaaat 300 tctggcatta gaccataagc aatccaaaac tagatttata ctgatatacc gcccccatcc 360 actaccgcaa gtagaaccgc tgagctcttt aaaaatctaa accagcttct cgactgcccc 420 tcctatactt atgtcctcct cggtgatttc aattttgcaa acaccaagtg gaattcaaat 480 tttcttccca cgggctctga gtatttctat aattggatga aatttcatga tctggcacaa 540 atggtaaatt ttccgactcg gttttccaaa ttaaacagtg gcacgtttct tgatctcata 600 tttacaaata attccggtgt tttacaaaat ttaaaaccgg gtactccatt tataaactca 660 gatcatattt ccattttctt ctccttatct ctcccccagc ccactcctgc tcattctgcc 720 cctcccaagt cttttatgaa tttcagaaaa tgtgattttt tcaacctcaa tagatacatg 780 gtgtcataca attggcctaa acagctatca ttctttgcta cacctgattc taagcttaat 840 cattttctca aaatttttaa tgagcttatc tctgatttca ctccccaata tccaacatta 900 agagttcaaa gtccaaaagt tacaggtagg aaattataca gaaaacttcg taagcatcac 960 ccagacactg aatcctcagt tttaaaaaag agagttaaat caagattaaa attcctcaaa 1020 aagaaaaata tggcttttga aaataaatta attcactcta agaactctaa aagtctattc 1080 tcttacatta gaaatcgtat aaagccttcc agttctctca ctcatttctc tgtagacaat 1140 aataatgttg ataaaccgat tgaaatggcg gaagtattag ttaaagagtt ttcaaaaagt 1200 tttgttactc atgatccacc cttcccaact tttccttcct gtgagagcac accgtcccca 1260 gtagatacat cgccattaac tatcgcgagt accattctta aactaaaacc caaaataggt 1320 tattcccttg acaacataaa tttttttatt ttaaaaaagt gctgcgattc catttctatt 1380 cctttatctc taatttttaa agaattcata actacgaatt cattcccatc ctgctggaaa 1440 acctcaattg tgatacctgt acacaaaaag ggcaacagta gtgaccccaa aaactatagg 1500 cccatagcct tgactcatcc attgtccaga ttattcgaaa gaatagtatt aaatcctatc 1560 aggaaaacat tctcccataa gatatccaaa tttcagtttg gattcttaaa caatagatcg 1620 tgtataactg cccacctaaa tgcttcatct aattaccata aaattctttc ggtacccaat 1680 cagtttgttg atattatcct tttcgatttc gctaaggcgt tcgattcggt gccacacgag 1740 cttcttttat gcaaattgag aaacttcggt ttcgatgata acatttgctt gtggtttaaa 1800 gcctttatat cagatcgcgt ctcaaaagtg aaaattggtg attccttgtc aaattcttcc 1860 ttcaagaact cttctggtgt cctacaaggc actgtcactg gaccctttct ttttttggtc 1920 tatataaatg accttcttga tctattcccc cctgatgttc acgttattgc atttgccgac 1980 gacctcaaac ttttcagcaa taactgtcaa agcttgcggt catctataaa aataattgaa 2040 aactggtgta aaatttggca gctcaagcta gccgaaaata aaacaaaggt acttcatgtt 2100 ggtaagaaaa acccaaaata taaatattat gtaaacggac aaaagattga atgttgttct 2160 aaagcctgtg atcttggcat atgggtcgat gatcaattaa cttatgaaaa acacattcta 2220 gtcaaagtaa atgcagcaat gtttaaatgc cgccaaatac tcaaaaactt ccgctctctt 2280 aatatgatgt tctacttcaa attatttaat acttatatta gaccaattct ggaatacgga 2340 tgtgaaattt atcatcctaa atctgcttcc ctgaccaaaa aattagagca gcccctacgt 2400 tacttttcta ggcatgtttt caagaaatgt aaaactcaat acgattcata tgagcatcgc 2460 ctcaatcagt caaaccagca gtccttgaag cacagacgag tcttccttat attgaaaaca 2520 ttccacaaca taataaccgg caaatatcat tttccaaatc tagccctcta tcttaaaaat 2580 tccaaatctc cccggtttcc atatcaaatg actgcaatag gcaaaacaaa taacaacagt 2640 tttctccaca cacatcttac aatatggaat ttagtaacaa aacatttccc ggtgccagtg 2700 tcaggctata cgttctccaa gcgcatttct tgcctccctc ttgaatctta tgtatgatct 2760 catttcctca ctcatttgta catgatgtca tattttttta tttatttttt tattattttc 2820 tcatcaactt ttcttggaaa tcatttgtat catctgacct cggacatgtt ccaatcgtgt 2880 caccttatca ttcacatttc ttacctcatc tcatttttat tcctcatttt gctcttaatt 2940 ttcaaggcaa gaaatagtga ataaataaat taataaat 2978 // ID CELETC2 repbase; DNA; CEL; 446 BP. XX AC . XX DT 16-SEP-1999 (Rel. 4.08, Created) DT 16-SEP-1999 (Rel. 4.08, Last updated, Version 1) XX DE CELETC2 nonautonomous Tc2-related DNA transposon. XX KW DNA transposon; Transposable Element; Nonautonomous; CELETC2; TC2; KW nonautonomous DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Oosumi T., Garlick B. and Belknap R.W.; RT "Identification and characterization of putative transposable DNA RT elements in solanaceous plants and Caenorhabditis elegans."; RL Proc. Natl. Acad. Sci. U.S.A 92(19), 8886-8890 (1995). XX DR [1] (Consensus) XX SQ Sequence 446 BP; 139 A; 85 C; 62 G; 160 T; 0 other; taccgtattt cttctattag tatggcctcc ctattagact tgcactccct attagtattg 60 caccttgaac acccttcgaa aattagtatt gcactcccta ttagtgttgc actccctaat 120 agtcttgcgt caactttttt tggtaaaaac ccacatcact aattttgaga ttttaacaac 180 attttttgcc tactttgcaa caatttttgg taaaatttct atgaaattga atttttgtac 240 ctttttgtgg caattacaat caaattttct gaattttcac ataaggttcg agttttttca 300 gaaaaaaata gtaagctcaa agtcagaaaa atgttctgaa atttagtatt gcactcccta 360 atagtcttgc actccctatt agtattgcaa gcgggaagac catcgaaaaa tagtattgca 420 gccatattaa tagaagaaat acgata 446 // ID LR9A repbase; DNA; CEL; 205 BP. XX AC . XX DT 23-AUG-1999 (Rel. 4.07, Created) DT 23-AUG-1999 (Rel. 4.07, Last updated, Version 1) XX DE C. elegans LR9A repeats (a consensus). XX KW LR9A; a linear repeat. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Belknap R.W.; RT "LR9A."; RL Direct Submission to Repbase Update (AUG-1999). XX RN [2] RP 1-205 RA Jurka J.; RT "LR9A."; RL Direct Submission to Repbase Update (AUG-1999). XX DR [2] (Consensus) XX SQ Sequence 205 BP; 65 A; 36 C; 42 G; 60 T; 2 other; artcgyacta ctcaggggtg ggcggcaatt gccgttttgc cggaaatttt gattttcggc 60 aaattgccga tttgccggat atcaatttgc ctcaaaatag atgtaggaac attcatagga 120 tgcgtacaat tttgccgatt aaaattgaaa ttctgaaatt tccaaaaaaa aatgtgcaaa 180 accacaattt gccgaaaatt ttcgg 205 // ID TC2 repbase; DNA; CEL; 2074 BP. XX AC S88451; X59156; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE C.elegans Tc2 transposon DNA. XX KW Mariner/Tc1; DNA transposon; Transposable Element; TC2; KW Transposon Tc2. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-2074 RA Levitt M.A.; RT "Direct submission."; RL Direct Submission to EMBL/GenBank/DDBJ (24-APR-1991). A.M. RL Levitt, NYU Medical School, Dept of Medical and Molecular, RL Parasitology, 341 East 25 Street, New York NY 10010, USA. XX RN [2] RP 1-2074 RA Ruvolo R.V., Hill E.J. and Levitt M.A.; RT "The Tc2 transposon of C.elegans has the structure of a RT self-regulated element."; RL Unpublished. XX RN [3] RP 1-2074 RA Ruvolo V., Hill E.J. and Levitt A.; RT "The Tc2 transposon of Caenorhabditis elegans has the structure RT of a self-regulated element."; RL DNA Cell Biol 11(2), 111-122 (1992). XX DR GenBank; X59156; Positions 1 2074. XX SQ Sequence 2074 BP; 640 A; 402 C; 435 G; 597 T; 0 other; ccgtatattc tctattagtg ctgcgtatct aatagaacgg cgtatctaat agagctgcac 60 ccaacaggca tttccgaaaa ttacagcggc agtctccaat agagcggcag tctctaatag 120 agcggcaccc ttttcgctgc gagacccgcc gctgcttttc agcttatttt ttgaattttt 180 ttcagtttat tttctttttt ttcaacattt ttcaattgtt ttacgtggtt tttttgtttg 240 aaatccttgt attaaaactt tacatgtaat tttgaaggaa aatgccaagc aattcgacaa 300 ctggtgctca aaaatgctcg aggaaagttc acagtctgaa actcaagcta gaagcaattg 360 aatatgcttc gaaattcagc aaaaccaaag cagcaagaga tctgaaggtc agcagacagt 420 gtattcaaga ctgggtggca caaaaaacca aaattgagtc tcagttgtaa gtttgaaaat 480 gctgatttgt gaatgcttag ttatatattt gaattatagg aaatctacaa ataaaacagc 540 tccaaaacgt ttagaaggtg gtggccgacc agtccaacgc gaaaccgtgg acgaaagtgt 600 gcttcgctgg tttcgggatc aacgagagaa aaaattccgt gtgagcagaa atttaatcag 660 acaaaaagct aaagaaattt gtggagagga agacttcaag gtagcaaact gcgtactttg 720 ttaaagaaat tcaactctca ttcaggcgtc gaccggctgg ctccaaaagt tcatgcgtag 780 gcaccacctc agtacacgac gagctactac aacttgttaa aaagagcctg atgaatttgc 840 caagaaaatc gttgattacg ttctttttgt ggaaaaaatg cgactccaga acaagtatac 900 agatactcac atttttgcag cggacgagac tgctgtttat ttggactttt cgaactcgtt 960 gaccttgaac gaaaaaggtt ctcgagaggt gggaaaaact aatactcttt ttttctattt 1020 cagtcaccct atgatgaaac gttgtctaat actctttcag gtgccagtta agacgacagg 1080 tcatgaaaag atgcatgtta cagtgcttct aactgctcga gccgatggat ataagtgccg 1140 tccatatgtg ctccttcctc gtaccagacc tgacaaagaa attgccgcaa gattcaagaa 1200 cagtctgcaa ctgagctggt gtaatcgaac gtttttcaac gattaagtga cagaagatta 1260 cttgaacact atcatcgggc ctcctttttt tggaaaaaga cttttggctt gggacgcatt 1320 cagatgccat atcagtgatg caacaaaacg agtgctcaaa aatttgaaag tggatacagc 1380 agttgttccc ggtggatgca caaagtttgt acaggtatga tcatcatctg aaacggtaca 1440 taaattgttc gcttaggctc ccgacgttta ctggaattct cctttcaaag ctagcattcg 1500 tcgaagttat gaggattgga tggtgcatgg acagaaaagc ttcacaaaaa gcaacaacat 1560 gcgtccgccg tcaatgctcc aatacttgga atggattgaa aacgcatgga atgatcttcc 1620 aaaggatttg atattgaaat cctttaaagg atgcgccttg acaatcgcta cagatggaag 1680 tgaagataat cagattcact gttttgcacc aaacggatcg attccgtcgg gtctggatct 1740 tctcaacagc acgcgtatag aaaattctct cgtcgacctc gtgcaggata ttggaatcga 1800 tgaaagtcaa gaagaaggca acgagtctga caattctatc atttccgtag ataatttttg 1860 ttttttttct gttgtttgtt tttctttgat ttttttttca tttatgaaaa attgttattt 1920 ctactccacc attaattgaa actcctgaaa acaaaaaaaa ctcgttcgaa aattagagcg 1980 gcagtctcca gtagaacggc agtctctaat agagctgcaa tcgagaaggc tctgaaaatt 2040 agagcggcat gcagcactaa tagagaatat acgg 2074 // ID HELITRONY1A_CE repbase; DNA; CEL; 3084 BP. XX AC . XX DT 26-FEB-2001 (Rel. 6.01, Created) DT 25-MAY-2001 (Rel. 6.04, Last updated, Version 2) XX DE HELITRONY1A_CE is a family of non-autonomous Helitron DNA DE transposons - a consensus. XX KW Helitron; DNA transposon; Transposable Element; Nonautonomous; KW HELICOPY1; HELITRON class; HELITRONY1A_CE; HELITRONY1_CE; RC35; KW RCA1; RCC9; RCD1; nonautonomous DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-3084 RA Kapitonov V.V. and Jurka J.; RT "HELITRONY1A_CE."; RL Direct Submission to Repbase Update (FEB-2001). XX RN [2] RP 1-3084 RA Kapitonov V.V. and Jurka J.; RT "Direct submission."; RL Direct Submission to Repbase Update (MAY-2001). XX RN [3] RA Kapitonov V.V. and Jurka J.; RT "Rolling-circle transposons in eukaryotes."; RL Proc Natl Acad Sci USA 98(15), 8714-8719 (2001). XX DR [1] (Consensus) XX CC There are about 50 copies of HELITRONY1A_CE in the C.elegans CC genome. CC They are ~95% identical to the consensus sequence. There are CC several CC minor subfamilies of HELITRONY1A_CE in the genome. HELITRONY1A_CE CC copies harbor one half of a PALTA3_CE-like DNA transposon CC (positions CC 2890-2715) inserted into an ancestor of HELITRONY1A_CE. About 20 CC ancestral copies, free of the PALTA3_CE-like insertion, are CC present CC in the genome. A 35-bp minisatellite, CC TGGCGGGAAATTCAAATTTTCAGTGAAAAAAATTT, is harbored and propagated CC by CC HELITRONY1A_CE. XX SQ Sequence 3084 BP; 1063 A; 388 C; 655 G; 978 T; 0 other; tcattactta tatataaaaa aatacagttc gtctgtccat agtttgtagt ctatgtagtc 60 tttgtagtct gtgacgtcac gcccaaagtc agtgagagtt gtgggcgggg cacccttcgt 120 ggtgagaccc atcgtggtga gacccttcgt ggtgagaccc atcgtggtga gacccttcgt 180 ggtgagaccc atcgtggtga gacccatcgt ggtgagaccc ttcgtggtga gacccttcgt 240 ggtgagaccc aaaattttgg cgggaaattc aaattttctg tgaaaaacat tttggcggga 300 aattcaaatt ttctgtgaaa aattttttgg cgggaattca aattttctgt gaaaattttt 360 tggcgggaaa ttcaaatttt ctgtgaaaaa aattttggcg ggaaattcaa attttctgtg 420 aaaaaaattt tggcgggaaa ttcaaatttt cagtgaaaaa cattttggcg ggaaattcaa 480 attttcagtg aaaaaaattt tggcgggaaa ttcaaatttt cagtgaaaaa tttttggcgg 540 gaaattcaaa ttttcagtga aaaaaatttt ggcgggaaat tcaaattttc tgtgaaaaaa 600 attttggcgg gaaattcaaa ttttctgtga aaaaattttt ggcgggaaat tcaaattttc 660 agtgaaaaaa attttggcgg gaaattcaaa ttttcagtga aaaaattttt ggcgggaaat 720 tcaaattttc tgcgaaaaaa atttttggcg ggaaattcaa attttctgtg aaaaaaattt 780 tggcgggaaa ttcaaatttt cagtgaaaaa attttggcgg gaaattcaaa ttttcagtga 840 aaaaaatttt ggcgggaaat tcaaattttc agtgaaaaaa ttttggcggg aaattcaaat 900 tttcagtgaa aaaattttgg cgggaaattc aaattttcag tggaaaaaat tttggcggga 960 aattcaaatt ttcagtgaaa aaaaattttg gcgggaaatt caaattttca gtgaaaaaaa 1020 ttttggcggg aaattcaaat tttcagtgaa aaaattttgg cgggaaattc aaattttcag 1080 tgaaaaaatt ttggcgggaa attcaaattt tcagtgaaaa aaattttggc gggaaattca 1140 aattttcagt gaaaaaattt tggcgggaaa ttcaaatttt cagtgaaaaa attttggcgg 1200 gaaattcaaa ttttcagtga aaaaaatttt ggcgggaaat tcaaattttc agtgaaaaaa 1260 atttttggcg ggaaattcaa attttcagtg aaaaaaattt tggcgggaaa ttcaaatttt 1320 cagtgaaaaa aattttggcg ggaaattcaa attttcagtg aaaaaaattt tggcgggaaa 1380 ttcaaatttt cagtgaaaaa aattttggcg ggaaattcaa attttcagtg aaaaaaattt 1440 tggcgggaaa ttcaaatttt cagtgaaaaa aattttggcg ggaaattcaa attttcagtg 1500 aaaaaaattt tggcgggaaa ttcaaatttt cagtgaaaaa aattttggcg ggaaattcaa 1560 attttcagtg aaaaaaattt tggcgggaaa ttcaaatttt cagtgaaaaa aattttggcg 1620 ggaaattcaa attttcagtg aaaaaaattt tggcgggaaa ttcaaatttt cagtgaaaaa 1680 aattttggcg ggaaattcaa attttcagtg aaaaaaattt tggcgggaaa ttcaaatttt 1740 cagtgaaaaa attttggcgg gaaattcaaa ttttcagtga aaaaaatttt ggcgggaaat 1800 tcaaattttc agtgaaaaaa attttggcgg gaaattcaaa ttttcagtga aaaaaatttt 1860 ggcgggaaat tcaaattttc agtgaaaaaa attttggcgg gaaattcaaa ttttctgtga 1920 aaaaaatttt tggagggaaa ttcaaatttt ctgagaaaaa gattttggcg ggaatttcaa 1980 ttttggtttt ggaatcatct ggaaaattcc agaaaattct ataatgttct agaactttct 2040 ggaaaattcg agaaaattct ggaatgttcc agaactttct ggaaagttcc agaaccttct 2100 ggaaagttcc agaaccttct ggaaaattcg aaaaaaattc tggaatattc gtggtgagac 2160 ccataaattt tggccgaaaa ctcaaaattt ctgagaaaaa atttttggcg ggaaattcaa 2220 attttctgag aaactttttg ttagcttaag taccacctgg aactggcgag acccatcgtg 2280 gtgggaccct taaaaatatg ggcgggaact tcaaaattta tgagaaaaga aattttggcg 2340 ggaattcaaa tttccaagaa aaattttggc gggaaattca aattttctga gaaaaagatt 2400 ttgacgggaa attcaaattt tctgaggaaa acatttggcg ggaaattcaa attttggttc 2460 tggaatcaat tttttgtttc agcaccttct ggaagtttca agaaaattct agaatgttct 2520 agaatcatct ggaaaattca aaaaagttct gaaatgctct acaacctcct ataactgtag 2580 gagtttttat ttgtttcgtc acatctatat tcaaagaact tactattagt agttaattat 2640 tcagacttct ccagttaagg aaagttatta acttattgaa aaaactgtaa ctctgtggaa 2700 ttttttcggg ggagggggac agctcgccac gccaacagtg agaaaaatag tgtggtgcgc 2760 ggacctcgaa tgatgtcggc ggctgatatt acctggggta ccgctttttg ccttttcccc 2820 gtgagtgaac gagaaaaagg cggggactat tttggcgtac acgcaacgca gttttataat 2880 tttgagtgta gaaaaatcag aacattctaa ctgtctcttt gaattttctg aatcgaatac 2940 catggaagaa gaaaatgcga tttaatcgat gaagtttcaa cttttttaat gatacatttt 3000 tgaatgatta aaaggacatc tatgtgttat acattttaaa ctccgccagt cgttggccgc 3060 gccggaggcg cggtcatcgg ctgg 3084 // ID Merlin1m_CE repbase; DNA; CEL; 388 BP. XX AC AF003130; XX DT 16-JUN-2003 (Rel. 8.05, Created) DT 16-JUN-2003 (Rel. 8.05, Last updated, Version 1) XX DE Nonautonomous DNA transposon Merlin1m_CE. XX KW Merlin; DNA transposon; Transposable Element; Nonautonomous; KW 8-bp TSD; Merlin/IS1016 superfamily; Merlin1m_CE; KW nonautonomous DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-388 RA Feschotte C. and Wessler R.S.; RT "Merlin1m_CE, a nonautonomous family of Merlin/IS1016-like DNA RT transposons from the the nematode C. elegans."; RL Repbase Reports 3(5), 95-95 (2003). XX DR Genbank; AF003130; Positions 40499 40112. XX CC Merlin1m_CE contains a short stretch of coding sequences CC (41 aa, VEIDESLFSKRKNNSGRILPQLWIFGGICRETGEFFLTEVD) CC with 46% identity (60% similarity) to the Merlin1_CB transposase CC (from nematode C. briggsae). CC Merlin1m_CE TIRs are very similar to those of elements from the CC PAL8C families. Presumably, PAL8C_1-PAL8C_5 belong also to CC the Merlin group of DNA transposons. XX SQ Sequence 388 BP; 110 A; 62 C; 79 G; 137 T; 0 other; ggtactagtc ctaaatcacc ccgccacttt ttttttgttc cttttttttt ttggtttttg 60 ttttcctttt tttttcgaaa aagtggaaga gttttcattt cttggaacat gtacatgcga 120 gaggtggtgg ccgcggacct cattaaaaga aacatgattg tagagattga tgaatcctta 180 ttctctaaga gaaagaataa ttctggaaga attttgcctc aactttggat ctttggaggt 240 atctgtagag aaaccggaga attcttcctg acggaggttg atcttattct tcctttcttt 300 tctaatctga acgagttatt attccacgtg ttcaaaaaaa aacgaaaaaa aaaagaaaaa 360 agtggcgggg tgatttagga ctagtacc 388 // ID TR5 repbase; DNA; CEL; 1654 BP. XX AC U86951; XX DT 27-JUL-1999 (Rel. 4.06, Created) DT 27-JUL-1999 (Rel. 4.06, Last updated, Version 1) XX DE Caenorhabditis elegans TR-5 repetitive element, consensus DE sequence. XX KW CEU86951; TR5. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-1654 RA Devine E.S., Chissoe L.S., Eby Y., Wilson K.R. and Boeke D.J.; RT "A transposon-based strategy for sequencing repetitive DNA in RT eukaryotic genomes."; RL Unpublished. XX RN [2] RP 1-1654 RA Devine E.S.; RT "TR5."; RL Direct Submission to Genbank (21-JAN-1997)Molecular Biology and RL Genetics, Johns Hopkins University School of Medicine, 725 N. RL Wolfe Street, Baltimore, MD 21205, USA. XX DR GenBank; U86951; Positions 1 1654. XX SQ Sequence 1654 BP; 481 A; 334 C; 348 G; 491 T; 0 other; atctacttct tctttccacc ggcggctcca gccattccac tgattgacga gaaaagtcct 60 ccgattcctc cgaagattcc accaagatct gaatctccgg cctccacaac ggccacggcg 120 aatagggcga caaagacaag gacgagaagg aagaaattca ttttttaaac aatgatgttg 180 atgatcggct tggaggattg ttttatacta tggattgggt ctcagctgga atggggtact 240 attcccttga gtcgtaaagc gtgagaattt ttagtcacgc ttttctcgca gtcaactgtg 300 aagcgtgaga gtttcagctt gtataaaacc cctttccaga acactgtacc cgttatcata 360 ctcaacaaag aacttcctaa ctctccttct tgtctttgtt gctctcttgg ctgtaactgt 420 ggttgaggcc ggagatccag atcttggtgg aatcttcagt ggaatcggag gtatgctcgg 480 atcgatcggt ggattggctg gagccggtgg tggaaagaag aagtaaatgt ttttgtgtcg 540 tattaacttt tatccgaata aaaatttgat ttcatctgaa acatgattta tccccacata 600 aaaacctaat gttgtgctct tataactacg cgtatctaac aaaaaactat acgttggatg 660 tcctaaaaca tgatatcgat tccgagatgt tttcgttttc gtctgtctta ctatgtagta 720 gaaatatatc acaatgctac tgaactgctt agagactatt ataactcatt ttagtttgaa 780 atatgaacga tcaaatgaat ggaaagggta catacattgt atgcaacgta tgatggttat 840 ttatggagtc tgaaatttga gacagaaaac tgacataaca aaactggttc aattaataag 900 tacggttttc cattcttcca agtttgttct cacaagtctg cattgtaacc aatcctggac 960 aacattcagt cttaaagaat atcacctaaa atataccctg aaataatctg caccgattat 1020 cggctaatgt taattgacaa aatatgaaat gcagaatact ttattcatac caaattcaag 1080 acatgaagca agaattgaag aaggaagaat ttacttcttc tttccgccac cggctccagc 1140 cattccactg atcgacgaaa aaagtccccc gattccactg aagattccac caagatctgg 1200 acctccggcc tccacaacgg ccacggcgaa tagcgcgaca aaaacaagaa tgagaaggaa 1260 gaagttcatt tttgatgatt atagtcgact tggagtgctg ttttatacaa tagaattggt 1320 ctcagctgga atggggtacg tactgttcca ttgatgtgca gtcgtaaagc gtgagaactt 1380 tcagtcacgc ttttctcgca atcaactgtg aagcgtgaga gtttcagctt gtataaaacc 1440 cctttccaga acactgtacc cattatcata atcaacaatg aacttcctaa ctctccttct 1500 tgtctttgtt gctctcttgg ctgtaactgt ggttgaggcc ggagatccag atcttggtgg 1560 aatcttcagt ggaatcggag gtatgctcgg atcgatcggt ggattggctg gagccggtgg 1620 tggaaaaaag aagtaaatgt ttttgtgtag tatt 1654 // ID RCD1 repbase; DNA; CEL; 151 BP. XX AC . XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 27-JUL-1999 (Rel. 4.06, Last updated, Version 2) XX DE C.elegans satellite-like repetitive DNA (a consensus). XX KW SAT; Satellite; Simple Repeat; RCD1; Repetitive DNA. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Naclerio G., Cangiano G., Coulson A., Levitt A., Ruvolo V. RA and La Volpe A.; RT "Molecular and genomic organization of clusters of repetitive DNA RT sequences in Caenorhabditis elegans."; RL J. Mol. Biol 226(1), 159-168 (1992). XX RN [2] RA La Volpe A.; RT "A repetitive DNA family, conserved throughout the evolution of RT free-living nematodes."; RL J. Mol. Evol 39(5), 473-477 (1994). XX RN [3] RP 1-151 RA Jurka J.; RT "RCD1."; RL Direct Submission to Repbase Update (JUL-1999). XX DR [3] (Consensus) XX SQ Sequence 151 BP; 33 A; 50 C; 41 G; 27 T; 0 other; tgggtctcac cgcgaagggt ctcaccacga tgggtctcac cacgaagggt ctcaccacga 60 tgggtctcac cacgatgggt ctcaccacga agggtctcac cacgatgggt ctcaccacga 120 tgggtctcac cacgatgggt ctcaccacga a 151 // ID RCC9 repbase; DNA; CEL; 240 BP. XX AC . XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 27-JUL-1999 (Rel. 4.06, Last updated, Version 2) XX DE C.elegans satellite-like repetitive DNA (a consensus). XX KW SAT; Satellite; Simple Repeat; RCC9; Repetitive DNA. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Naclerio G., Cangiano G., Coulson A., Levitt A., Ruvolo V. RA and La Volpe A.; RT "Molecular and genomic organization of clusters of repetitive DNA RT sequences in Caenorhabditis elegans."; RL J. Mol. Biol 226(1), 159-168 (1992). XX RN [2] RA La Volpe A.; RT "A repetitive DNA family, conserved throughout the evolution of RT free-living nematodes."; RL J. Mol. Evol 39(5), 473-477 (1994). XX RN [3] RP 1-240 RA Jurka J.; RT "RCC9."; RL Direct Submission to Repbase Update (JUL-1999). XX DR [3] (Consensus) XX SQ Sequence 240 BP; 68 A; 42 C; 30 G; 86 T; 14 other; ggaatattcc aganttttcc cgaattttga tcaaaattct ggaacattcc agaactttct 60 cgaattttct agaaaattct ggancattct agaattttcy cganttttcn ngaaanttct 120 ggaatattcc aganttttcc cgaattttga tcaaaattct ggaacattcc agaactttct 180 cgaattttct agaaaattct ggancattct agaattttcy cganttttcn ngaaanttct 240 // ID CER10-I_CE repbase; DNA; CEL; 11155 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 06-AUG-2003 (Rel. 8.07, Last updated, Version 1) XX DE CER10-I_CE is an internal portion of the CER10_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER10-I_CE; KW CER10-LTR_CE; CER10_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER10_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (MAY-2003). XX DR [3] (Consensus) XX CC FEATURES Location/Qualifiers CC gene 7230..7556 CC /note="predicted Protease region" CC gene 9087..9497 CC /note="predicted RNase region" CC gene 9604..10133 CC /note="predicted Integrase region (frameshift)" CC gene 8103..8768 CC /note="predicted Reverse Transcriptase region" CC gene 6535..6729 CC /note="predicted Gag region" CC primer_bind 1..18 CC /note="predicted Primer Binding Site (Gly-TCC type)". XX SQ Sequence 11155 BP; 3422 A; 2928 C; 2443 G; 2362 T; 0 other; tggtgcatcg accgggaata gcctagcggc tagggtgcgt gagcgcgccg caccagtccg 60 gggttcaacc ccggctgtgg cggcgatctt tctaatcctt ctgcgaagct gtttctggtc 120 aaaaccctga gtccggggcg ttggacaccc atgtgagtta tgagtttctc tcgtgggagc 180 cattgagttc acttagctgg agcgaacgag gatgggatga gccggtcaat gcaccaaatt 240 aaggcaattt taccccacat atatcctcgt gagcactgtc tggtcaatcg atcttctaag 300 catggtaaaa gccccctcag aacttgagta gatggtgagt gcaatcgatt gatcatcgtt 360 ccagcagagt taactcagtg acaaccacgg tgggagcgaa tgctcgcatt ggggtccgac 420 ccgcggaaac agggttttca aaataaggcg actggccaac cctgtaccaa gtggccaatt 480 ggcacggggc gatccaagaa cctaattttg aaaaccctgt ttccgcggga caacgctcgt 540 ggtcagaaca accctcaaga aggaccggag tcgacacgtt gcccccgttg cggaagtcat 600 ctattgggaa tggcgctaaa cgatcataag ctacatgact gcgtattcaa gaacatcact 660 ccaatggaga cgctcgaatt catggcagta aaccactgga catactgtgg aggatgcaat 720 acgcgatccc acatgcatcc gaggggagaa tgtcaaagag tcggatgcta ccaatgcggt 780 tccatgggcc acgcgacagc tcaagcccta tgcgatctac ccatcggcgg tgtcactcct 840 gaagctcaaa cgatgcagca agagacagcc tggggattcc gcagagaatg cttacagcgc 900 acgcgagctc tgattctcaa caccactcac ccactgaaat atcgtttatt cacggatcac 960 cgagcaatct tcatcgaagc tcgggaacct cacgatcagg cgggttggag agtctatcga 1020 gataccgaca acgaattccc ttttgcggac atccagcgat actcttatac gcacgaaaac 1080 ccaccacgga gagcatactt cacagtattg cagcctccag agtacgctct caaaactgtg 1140 aacaagattc cagcattttc tccggacgaa tccgcctatt tagagcggtg cggacattgg 1200 atcactcagt accgtcaagg caatgaggta cgtactatta aaagttggca cttaaagtaa 1260 aacagtcccc ttcttacagc taccaatccc aaccttactc gacgagtcag attctagcag 1320 tccaccaagc tctgatagcg aggatgagtc tgatgaagac aatgctcctc aacgtcaacc 1380 agaggaagtt ccagcacctc tcaacatcat tcctcccatc attgtcccac caccggcacc 1440 actaccacaa gttgttcatg gcatcgccag aatcgtgagt ccaaaagcac tagttaaatc 1500 aaagtaaaag gatttatagg tagcgcaaga ccacagacta gctcgcaacg gcaacttgat 1560 tgactacgtc aacgggtggg ctcaccggcc actggactgg gcgcaagaaa tggaacggga 1620 tgagagagca aaccaaggtc agcgagatag agtggctcgg gaaatcagac tgttcgaagc 1680 ccgattcgca cgagtaccgg ttgaaccagt tgaggaaatt gttcaagagg taacagtctc 1740 aaaaaaaaaa tgattaaaaa attgatttca actacttcca gagagaagaa gaagatcagc 1800 ctgacggaca tgccatggaa aaatgcacac gactatgctc tcgccccatc atctggatgg 1860 acgagacggt tcgcccgtgc ttccaaaaag aggccttcgc aaaactcgaa gcagcttctc 1920 ctccagacac aaaacgatcg atgtgccaca gagctcaagc ctacctccta accctcactg 1980 gtcaagcaga tcaccgctac ggagcttatg acaacatcag tcacttccaa atggctcagt 2040 acatcgaaga tctcaagtcg tacggaaaga gactacgagg cgaagaatgg actctctgca 2100 aattggaata caattccatc gagaacatca gaagcccgct acgtccggaa ggatccagat 2160 tcatctatct tccaacggtc actctttggt gtatcgcctc agtgcaagga gcaatgaagg 2220 ccgtattggc aaaacgcgac ccagctcaac atgtcaatcc tatcgacaat gtagttggac 2280 tcgactcaaa gccatggttc aacattttgc taccatcatt tgccgatgac gaaaccaacc 2340 cagcagcaca gtttgctcgg gaaactgagt aagtagtcaa aagtcaattc ccaaccaatt 2400 taaaccgcat gaattcaggc ttcgcaaaga actcgcaaac ccgatgaatg tccggtacaa 2460 caacaacact cgagcacatt tcgaaagcca cctggacagg tacttggatc ttaaaacccc 2520 aagagacctc actacgtgcg agaaatgagt ggaccgtact gatgctcacc ggtgaactac 2580 caaatgagtc tttcaccaca ggttcggaag aagaccattt ggaatattcc gggttcctca 2640 tcgcgattgt tcactggctg aaacgtctac attcttacgg gtacaactgc acagttcgtt 2700 cggaccactg tattgacgtg ctggcctctt cctcagctac caaccatgta ctgtacattc 2760 ccacgcttgg caaattcatg gatgcccacg tcggaaaaat gcacaactgg tacgatgttt 2820 tgctcgagaa ttggtacaca acggcactca gaaacagaca cgagtcatgc ggatgcctca 2880 tgcgcagata atactctttt acgtataact tcactccgcc aatcacacta tttgtcagta 2940 tttcttctga ataaatattt attgtaatgc gttgcgacag gagttacaga aataaaatgc 3000 ataaaataca gagatgaagc gcaacggctg attgaattgg taaaggcaag aatcataaga 3060 gttttttatc ccacaaaacc acacttcacg cttaaaggct tcccaaagag atatactatg 3120 gcaaccttca gtttggagtc ataaaaccat tacctgctca agggaaatcc gaaagtaggt 3180 ttgattcgga ttcatcctgg aatggtcgga gattccgtgg tgatcctttc ctgctacgag 3240 gggagcggct gggcgcacta gaaggcacag agcggagcac aagatccctt agtcccgaac 3300 tagtctcctt tctttgatcc aaaagcgaaa gctcgtagta ggcttgtgag ccaggagctc 3360 ttcaatgatt gcaagcattg aggaaggaag cttctgacga agcaaggact agtccttcta 3420 gagggacagt caccgaacac cacggaagcc tgatgtaagc taggtacgtc agatgtgcac 3480 tgacctccca tgcaaacata cgggcaacat accgatggtc aagacttctg ggaaggttca 3540 gtcccaatat aacctctacg agtcagataa aaaattgcgg ccacagcctg gattagagag 3600 cactaaaccg ggagtgcagg gacaccgctc tgcagagtag actttccaga aggacagtgc 3660 ttcgaactgc aacatgggca gcatgattgt acagcatgaa ctgagccgca gcgcactgtc 3720 acttgaccaa aggttcaaac ccaatggacg gcacttagtg catggggaag gcaacacagt 3780 tgcgcctatt gtggtggaac agacacgtaa tccatgaact ggaggaaccg tgccacccga 3840 gggaacaaaa ccacccacat caaatgaaat atgtcgaggg gtacacccat gcactaatct 3900 ctgtgggtcg cagcaaccac agtgaacgaa tgccgacaat actcaaaagc gtagctgaca 3960 aactttgctc accaaccgat tccatctacg gttcccaatc aacacaaaag atcgaagaca 4020 gcacttcatc tggcgtaacc gagccattac tcacacaaat ataaggtccc acttacggtt 4080 cccactctaa acccaagctt accaactgac aaggactata ttatcgactg gctaccaaca 4140 tcactcatgt tcggttacag ctcgaattct aactatttgt cttttaggta aactggttta 4200 actttgtccg tggacaacta cccccctact gcaatggcat caacgtcgaa caatcaacat 4260 caatccgagg cgcctccgac tgagactccg acccttacga cgagtcccac tacgagccct 4320 caacgccttc cccaacggaa cgccagatca ttcgccaagg caaccagcct tggacagaga 4380 ctggtcgggc cacttaagag ctacattacc agctccgtca accacggaaa tcggtgccta 4440 ggaaacgtca ctcaacatct gacgaaatac ttccacgccg attccgtgtc tccaattacc 4500 gagactagtc tcaacaatgc cgaatacata cttcgtaaag gagaagctta catcgataga 4560 ctcgaaaagc tgaaccaata cactcaagaa aagtacaagg acgttgaatt ccaagaatct 4620 cctgaaaaag agagattcca aagagaagtc gcccagcttc ttgtagacat caagccggaa 4680 gtggtacttg ttaacctaaa aatggaagtc cacaaacttc gcacggctct cgagcaagca 4740 aaccgggtgc tctctccaaa aaatactagc ggctacgtct tctccgactt taacacgggt 4800 cccttcagct ctgattccga ggcggaacaa accgtcagaa atcaacataa aagcccaacg 4860 ccaccggcca ctgccccagg cgcaactgct aacacctcgg ctacttcaac tccagacctc 4920 tccactatta gctcggaagc caacgctcac cttaattcag agtacccaca cacggactcg 4980 ttagcatcag agctagtgcg cacgactgag cagttcaaag cactggagag aagaaacgaa 5040 gagctgaggt gcattgcgga cctggacaaa cagaccattg ctcacctcca agatcgttgc 5100 acagctcaac aacaagagaa agaggaaagc attcgccaag gcgagagaga gcgggaggaa 5160 taccgtaaat atatcaaaga gagaacaaag gacgaacgca tcgcagagat gaaaagagaa 5220 tcagcggcga cggctattcc aaacatggtc acagaaccgc aacatcaacc accaatcatg 5280 gtaacgcagc cacatccatg ctcaaattat gaagcaaatc aatggagagc tcagagacca 5340 gaggaagttg agagtgatgc ggccatgatt gcaagaatgt ttgcgtcaag ccaagatcgc 5400 atgacagctg acatctctaa agtactcggc gaacaaacag ctactctaaa agccatcgcg 5460 ctccaagtca aacataactc tgatcgcctt gacgacaagg agctgaatgg cgacgaagca 5520 gaatacgatg cagcctgcgc tctcgaacaa cgcgctcaag aagcgacaaa ccaaaagccc 5580 aaagaggcag aagtggaaaa gctcttggaa cagcgttcga cacatagaat gaattacaag 5640 atggcgatgg aatacctaat ttcattcaac ggctctcaaa attccaactt cagtcaattc 5700 tacagtatgt tcaacagcac tgtgatgaac aataaagaca ttggagacga aaccaaacta 5760 gccatcttga tttcaaaact ggaaggagac gctagaaagt gcatcgcaga gatcagcgaa 5820 ccttctttgg cgattgagac aacactcgag tgcttagaga cggtatacgg aaaaagcgat 5880 acgaaacgaa gcttactcca aaagtttaaa gctcttccgt tccacccgtc cgatcccaag 5940 caaatgaaac tcgacctcat gagccacaag aatctagcat acaagctatg catccaaggt 6000 ttgggaaaag atgacgacag gatcaccatc gaaattcttg ataaattgcc acacattatg 6060 agaaccagag tctcggagca ctactcacgt ctatcggcga agggtaatgt aacagcagtt 6120 cttatgtatc aatacatctc tgaattaatc gatggctggc aagtcgaaat cgaaaactcc 6180 actcaccacg gagcgtacag aaacaataac accatgcacg agttgagaac cccacgagcc 6240 gcaatcgccc ttcacggcaa tgcctatgcg ggaccaactc cacaaaatgg tcaatacaat 6300 cagaatctgg gagggaatag taacaactac actccggccc catacaacag ccagaagcag 6360 ccggggggcc aagggtccag aatctcacta cccaaggtgt atgatgcaca gcttttcaaa 6420 gagcagttca aggacccaat tactggatct atgctcccag gttattacag gccaggcgaa 6480 ggagtccagt tggacataat ccacagaacc ttccccctga atgacgagac gtccaagttc 6540 ccttgcacgc tctgcaaagg ctcgcacaac ccattgcgct gtggtctgac cagctgggaa 6600 ttcagaaacg cgatagagat gactcgacgt tgtccaatct gcacattcaa acatagaatc 6660 gagaattgta cttcccaatt caaatgtgcg tactgcagtg actccatcac actggaggat 6720 gcccgcgaaa agagcattac agagatgtga aaaactaccc ggaagagccc gtcctatcct 6780 gcagtttttt tgtgacttgt ttaaaccaaa agggtcccaa tccaaaacca actccaccaa 6840 ctctcaataa ggcatttccg gcaggcacgg agacgaatgt accgagctcc gtgtcacaag 6900 ctgacttgcc aacagcactc caatactgta ccattagttc ctctcaatcc gtttcaatct 6960 ctagtaaaaa tcgttccaat gttgttactt ctaaccctat tgccacaggc aacccagttt 7020 ctattgttcc aaactcacta aataaaccta acaaggttac attcttgtcc tgctaccctg 7080 agccacccag ccctgaagct attcgcgagc acacagcgct cgtggccaga gataaacccc 7140 ataacttcat tcatatcgaa caccgtccag acgaacggct aacatttatg tgcctgacgc 7200 tggaacatgg cgaaccgctc atcgcactag tagattcagg ggctacaatc tcactcatcg 7260 gtcacaaacg cgcaagcgaa cttcagctgg aaatagtcat ctcagttacg atgactatga 7320 acggtttcaa tggcagggca acttcaatct caaatattta cccaatttct gtaaaactca 7380 gaaattcctc aaattgtgca agattcttta tcacgggtct cgataacatg ccaacaactt 7440 cattcactgc accgaaattt tcggcagatg atcaagcatt cctgttggaa aacggtatta 7500 acccatacga gatccaagca caaaaaagat taaacgggct cccgatggag gctgttcttg 7560 gtagtgacgt catcgcgtgg ctagaaagtg acgcccacac caagcgatac ctattaccat 7620 aaaaacgcgt tctacaagac acgataatag gttacatagt ccatcctcca ccggtcctgg 7680 taatctatac caacgagccg agtctttaag accgagctga cgaaaattgt gcaaatctca 7740 tggttgctaa cacgcttcta acaggcgagg aaactgaaga tttgctcacc aagctggtca 7800 atcagattga gcagtcctgg aagctagaaa atctgggaat cgaagagatc tcagttgcag 7860 aaaattccaa aaagacggct ctcgatttgc aagccgagtt cgatagaaca ataaggtaca 7920 acagcgacgg cgaactcgaa gttgcgctac cgtacaacgg caatcaaaac cgcttgagaa 7980 gcaacttcaa aattgcccac aaaggattgg aaatcttgta catcaacctg accaagagtc 8040 agaaatccaa catcatggat cagtacgatg cagtgatcaa agatcaacta tctgcgggca 8100 ttgtcgagcg agtctggcct aacatgatcg gagctgacga tgtgaaatac tatatgcctc 8160 acagagcagt tatcaaagag acttccaaca ccactaagct tcgcattgtg ctcgacgcat 8220 caagtccacc tggaccggaa ccatctctaa atgactgcct gtatgcaggg gataatttgg 8280 taactccact ctatggaatc ctcctcagag gacgtatcta cccgtatgtg gtcgtagccg 8340 atattgaaaa ggccttccac caggtaagac ttcaagaaga gttcaggaac gccactttat 8400 tcctttggct caaagacatc accaagcccc caactccaga caatatctgt acgtatcggt 8460 tcactcggat cccattcgga gtggcaagta gtccatggtt gttggcagcc tcaattctcc 8520 acttcctaga ccgcaacccc aatccattaa accaacgcgt tcgtgaaaat ctctatgttg 8580 acaattgtct cattggtacc ttcgataaat ctgaaatcac agagatcatt aaaaactcga 8640 aagacatatt catgaaaatg aaaatgaatc tgagagagta cgtgactaac tcagaagagc 8700 acatggctgc catcgaagtg gaagatcagg caaaacacag agttatcaag ctcctcggtt 8760 gcgaatggga ttcccatgag gatatcgata cgctatcagt agctctcgca gtacttgaca 8820 ttgatcaccc aactaagaga caagtcgcgt caaagatggc ggagacattc gatcctcttg 8880 gcctgataac accactcatc gtgtcgttca aacgattaat gcaaaacttg tggcgcaatc 8940 ccatgaaatg gaacgatcgt attccaaaag agctactagc agactggaaa actattcaag 9000 ccgcattcat ggataccaag atcaccgtcc cgagaagaat aagcattttt gatgatcccc 9060 cagaggagat tcaactacta atgttctcgg acgcatccac ggacatctat gcaacggctg 9120 catacgttca gtactcttac aagtctcgga aaccagtaac tcgcctactc actagcaaaa 9180 gcaaaatcag atcggcaaaa aacgagatgt ggactatccc taaactcgag cttcttggga 9240 tcgaatgctc ctccaatcta gccagaacaa tcattgctga actaggcaca ttggccacga 9300 agatcaagaa agttcgacta ttcacagata gcgcatgcgc tctgtactgg attctctcca 9360 acaagaccac aagagtctgg gtagcaaaca gggtgtcaca gattcaagca aatcaaaaga 9420 ttcttgaaga atgtggaatt gaaacatcac tccatcactg tccaaccaaa gagaatcccg 9480 cagaccttgc cactcgtgga atgacttcta ccgagctcca agcgagtacc ttctggttaa 9540 acggcccagc gtttctggaa cgaccacagt cagaatggcc gtgcaaaatt gaaggacagg 9600 taaaatgccg agttgaattc gaacaattgg tctactctga gatcttggat ccgaagaatc 9660 caaagaccaa gaaagataaa aaagagccaa ttaaatcaat cattccaaga gctgaactct 9720 tgttgcgtga aatgcaaaaa ggtgttcgga atgccatatc cctacccgaa taccaaacag 9780 cttccaggat gccgaacgga gccgtcgaaa ccattcgcaa aagttggtct ggattacctt 9840 ggtccagtcg agtatctccg agatgatcaa gagtcactcg gtaaggcata tgtcctgctt 9900 tatacgtgcc tagtcactcg tgccgcagtc ttaagagttg tgcccgacgc tacaacagaa 9960 tcctacctca tggctcttcg aacgattttc catcaagtcg gcgtgccctc tgaggtacac 10020 tctgataacg cggccatctt caagctcggc gcaaaaatga cgaacgacga tcgaggcagg 10080 gaggggttta tgaaagaatc gttggactac tgaaacatca aattcacaag atttgcggtg 10140 atcaaaaact cgatttcttc agcttacaat acgtcgtatc gagtgctcag gctatgatca 10200 acaatagacc tttggttgct cacgcacgtt ctccaaacga catgatcgcc ttacgtccaa 10260 tggacttcat gatcccagga gtcatgatcg aaactccaag aacgccggcc gacagtccca 10320 ccactagcac gactgaaatc cgtacccgag cccatctgga aaaattcgaa tcagccctgg 10380 agcgtctatg gaccatctgg acatttggag tcatgctcat tctgcgtgaa gtttcgcaca 10440 agcacaaaag atgctgcgat ccgaaaccgg aagttggaga cgtcgtgatc atcaatacga 10500 actacgtcag tcgccacaga tggcccctag ccttagtagt gcaagtaaat caatctaaac 10560 gagacggtga aatcagaacg gcagtcgtgc ggtgcaaagg aaaactgtac aaaaggtcag 10620 tctgtcagct catcccactg gagacgagcc gccaggacat tcgacacggt acaggacctg 10680 acaacgacac tcccgccaac gacaccaaca acgacacgga caaggacacc gctggcagcg 10740 accagtgcag accctgcccc acactgccca caccagcact cttagatttc gaaaactcac 10800 actttgcgcg gcgttcccag cccaaattct cccgaacatc ggtgaagaac ctagagatat 10860 cactctggat aggtggaaat cacacgaagc tattgggtct gagatagatg attttgaagg 10920 acccgactac gatactaaca atccactctt ccacgaagat ggggaggctg aagacagacc 10980 agttgagtac gtggacccca tcacggcgat ccccgagata gcttacgaca acgcagagac 11040 cagactgcca caaggtcgta ctagagaata tctcggtcga aaagctaagg ccccgtacat 11100 caactacaat cacgccgaga ttactcgcgt cttgtcggat ccttcacccc cggaa 11155 // ID TIR43YW1_CE repbase; DNA; CEL; 1740 BP. XX AC . XX DT 14-SEP-1999 (Rel. 4.08, Created) DT 14-SEP-1999 (Rel. 4.08, Last updated, Version 1) XX DE TIR43YW1_CE is a putative non-autonomous DNA transposon - a DE consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TIR43YW1_CE; KW nonautonomous DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-1740 RA Kapitonov V.V. and Jurka J.; RT "TIR43YW1_CE."; RL Direct Submission to Repbase Update (SEP-1999). XX DR [1] (Consensus) XX CC TIR43YW1_CE is flanked by 43 bp-long terminal inverted repeat. CC There are about ~20 copies of TIR43YW1_CE in the genome; they are CC ~98% identical to the consensus sequence. CC TIR43YW1_CE generates 2 bp long duplication of target-sites upon CC its integration in the genome (it prefers to be inserted in CC YW target-sites. XX SQ Sequence 1740 BP; 595 A; 300 C; 260 G; 585 T; 0 other; ctcagacaaa cgcttagtag cttaattaga tttcaaccat attaaacgag atttcaacca 60 tatttgttta gaaaatccat attggattat atttgctcca tattttatta gaaatccatg 120 ttagattaga tttcctccat atttcattag aaaacaaatc aaatccaata tggcacgatc 180 tcatccatat tggattagat ttgtccatgt tagattagat ttgttccgta ttggattaga 240 aatccatgtt ggattagatt tcctccatat ttcattaaaa aacaaatcta atccaatatg 300 gtcgaaatct aatccaatat ggcacgctct cagccatatt tgattagatt tcgcccatgt 360 tagattagaa atctcacgag aaatgggcgt attaaaataa ataaataaat aagaagaaaa 420 cggaaagtgc aaacgcgtag ggcttaaaat tttttaaatt aaaattgaag acctttaaaa 480 agttttaaaa taaaattatg atttgaaaat aagaattatc tcctgcaaat aataatctaa 540 agatgatgtc cgctgactga ctttgacgaa tctccgaaaa gtctccaact ggtaatgttt 600 ttttcagaaa cacttttttt ttcaaaatcc ggcaagaaat ttcacggcta caactgtgac 660 aacttcgatg gttacaaaca acgaaataac atattctatt aatttaaatt aactcctaca 720 aaccattact tttttcggta taattcgtgg aaaaattata tcatttcaca caaatgatac 780 tagagctctt gctgcgaaaa gcggaaacag tcaataccgt acttttaaag ttttcatgaa 840 ttttgtattt tacgtctgtt ttgtgccaac ccgcagacca tagttaaata aaattacttt 900 tcctagcatt tacattttga tacaaaatac tttaattcac acttccagca accaaaaagt 960 atgatttaac gatttttttt tagaaaaaac tcttttccaa attttctgag actaaaaaaa 1020 tctttgttct cttctattgg ttagtacgat tggaggctaa gattaacgaa agcggttaac 1080 aaactttttc aaatgaaaaa ccttgatatc aagccgttca agacttcaaa atacattccc 1140 tctactcagt tcatctaatt ttccttaact tatcgaagaa acaacgcgcg ccaccaccac 1200 cagctattta accgaacagt tcatcgaaag ttcatgtgtg cagattggcc gagtcggtaa 1260 gaacacagtc tttagaatat aacatcctgg atcgattccc cacagtggca actccagaga 1320 agcgcagtag gggagtttat agagcggcta cagttatcag gcgagacagc cagtgtgtag 1380 tgtaagatat atgcgagaca tgagatgttt tctcattttc tttatttttt ccacattatt 1440 ttcgggtaaa acattcttcg aaaaaaaatt attcagaaaa tccgagccaa atttgattag 1500 attttgtcca tatttgatta gatttgttcc atattggatt agatttcaac catattggat 1560 tagaattgtt ccatatttga ttagatttgt tccatattgg attagatttc aaccatattt 1620 gattagaaaa tccatattgg attagaatag gaagcctatt aattatgtcc ccattatatt 1680 tgctcagttt ctaatcaaat atggttgaaa tctaattaag ctactaagcg tttgtctgag 1740 // ID LINE2F_CE repbase; DNA; CEL; 957 BP. XX AC . XX DT 21-OCT-1999 (Rel. 4.09, Created) DT 21-OCT-1999 (Rel. 4.09, Last updated, Version 1) XX DE LINE2F_CE is a non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW CR1 superfamily; LINE; LINE2F_CE; ORF2; reverse transcriptase. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-957 RA Kapitonov V.V. and Jurka J.; RT "LINE2F_CE."; RL Direct Submission to Repbase Update (OCT-1999). XX DR [1] (Consensus) XX CC LINE2F_CE is a non-LTR retrotransposon related to the CR1 CC superfamily. Its consensus sequence is not complete at the 5' CC end. CC There are about 50 copies of LINE2F_CE in the genome; they are CC ~97% identical to the consensus sequence. XX SQ Sequence 957 BP; 269 A; 228 C; 115 G; 343 T; 2 other; gatcacgttg ttttacaaaa aggcatcgac actgttgttg aatggtctat atctaactcc 60 cttcctcttg ctcacgccaa aactgctctt ctaaggctgg gatcaaaaaa cccagttcat 120 ccctacttaa ttcaaaataa acttatacat gaaacttcta ccgttcgtga ccttggtctc 180 attacggact cctctctgaa ctttaaagcc catatcaata gaactgtagc tcttgctctc 240 ctgagatcaa aacaactact caaaagcttc aaatctagtt ccccctcatt ctacgcttat 300 ttatttaatt cttatgtact ccctataatc gaatattgct cagtcgtcta ctctccctct 360 cctaactcta aactctctgc tcatcttgaa aaacctctta gatcattctc tcgacaagta 420 ttacaaagat gcaatatctc atttacttct tatgaaaacc gtcttctaat tcttaatatc 480 tactcgattc gtcatcgaag gcttaaggca aagcttctac ttctttataa attcataagc 540 ggtacctctt acttcccaaa tctaaacgat ctgatcagaa ttacagagtc taaaagaagg 600 cctatgatta ttatatgtac aaaaccaaaa attactgatt ttttcacctc aactgtcccc 660 gtctggaatg ctatttttca taatactccc catttccttt ctccccgtga atttgttaaa 720 gttattgatt cctccataac acggttttga ataattttgt ttgtcggtta taattaattc 780 tatgatctct aatttcccaa aaagtctttm yttttccctt tcttctctca atgccggtct 840 atttttttcc tctctttgtc ggactcgtga gggtcccgac tttttttatt gttttgatta 900 actttgatta aacaataaat tcacttgtgg gtcatcctca aggcaattaa aaaaaaa 957 // ID PALTTTAAA3 repbase; DNA; CEL; 1503 BP. XX AC . XX DT 12-JUN-2001 (Rel. 6.05, Created) DT 12-JUN-2001 (Rel. 6.05, Last updated, Version 1) XX DE PALTTTAAA2 is a non-autonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; 6-bp TSD; KW PALTTTAAA3; TIRs; nonautonomous DNA transposon; palindrome. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-1503 RA Kapitonov V.V. and Jurka J.; RL Direct Submission to Repbase Update (JUN-2001). XX DR [1] (Consensus) XX CC PALTTTAAA3 is a non-autonomous DNA transposon; it CC forms almost perfect palindrome. CC There are 10 copies of PALTTTAAA2 in the haploid genome; they are CC ~98% identical to the consensus sequence. CC PALTTTAAA4 belongs to the same family of DNA transposons together CC with PALTTTAAA1, PALTTTAAA2 and CELE46B. Their transposase is not CC identified. XX SQ Sequence 1503 BP; 524 A; 238 C; 241 G; 500 T; 0 other; tttaaagggg gagtagagtt tgtggggaaa tatatgtttc tgactctaat tttgcccctg 60 ataccgaata tcgatgtgaa aaaatttaaa aaaatttccc tgattttata ttaattttta 120 aaatccgaaa atccattgga tgcctatatg tgagttttca aacgcaaaat tttcccgcca 180 gagacgcccc gcccacgaaa ccgtgccgca cgtgtgggtt tacgagctga atattttcct 240 tctattttta tttgatttta taccgatttt cgtcgatttt tctcattttt tctcattttt 300 ttggtgtttt ttattgaaaa ttttgtgatt ttcgtaaatt tattcctatt tattaataaa 360 aacaaaaaca attccattaa atatcccatt ttcagcgcaa aatcgactgg agacgaggaa 420 aatcgtctgg agatagaacg gatcaacaag attattatta tatcattaat aatatttatc 480 aattttcttc tgagagtctc attgagactc ttatttacgc caagaaataa atttaacatt 540 aaaattgttc atttttgaaa aaaaaataat taaaaaaaca cattttttgg aaaaaaaaat 600 aataaaaaaa attgtcctcg aggatcctcc ggagcgcgtc gaatcaatgt ttccggaact 660 ctgaaaatta aatgtttgta ttattgtaga accctttcgt tattgagatt tgataacttt 720 taagtaataa aattttcgca gtaagacatt aaaacatttc acaattaagc tggttctgaa 780 ctgtgtgaag tatattgaaa aaaactaact gatacaaaaa tataatttta tgatagtttt 840 ctggatgtcc caatataaac gatgtcaatt ctgcgacatg ctacagtcat ccacgaaagt 900 aacccgaata ccgacaaaag aagaggaacg ccaactttgg atagacgctc taggggctga 960 ttttggtcgg aaaacgctta aattttcagc gaacgaacta tttttttata ttttgattgt 1020 ttttaaataa tatttgccat aagaaattct cacttttcca ggaaacgtcg tttcgccgcg 1080 attttcctcg tctccagtcg attttgagct gaaaatggga tatttaatgg aattgttttt 1140 gtttttatta ataaatagga ataaatttac gaaaatcaca aaattttcaa taaaaaacac 1200 caaaaaaatg agaaaaaatg agaaaaatcg acgaaaatcg gtataaaatc aaataaaaat 1260 agaaggaaaa tattcagctc gtaaacccac acgtgcggca cggtttcgtg ggcggggcgt 1320 ctctggcggg aaaattttgc gtttgaaaac tcacatatag gcatccaatg gattttcgga 1380 ttttaaaaat taatataaaa tcagggaaat ttttttaaat tttttcacat cgatattcgg 1440 tatcaggggc aaaattagag tcagaaacat atatttcccc acaaactcta ctcccccttt 1500 aaa 1503 // ID PALTTTAAA2 repbase; DNA; CEL; 680 BP. XX AC . XX DT 09-DEC-1999 (Rel. 4.11, Created) DT 09-DEC-1999 (Rel. 4.11, Last updated, Version 1) XX DE PALTTTAAA2 is a putative nonautonomous DNA transposon - a DE consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; PALTTTAAA2; KW TIRs; nonautonomous DNA transposon; palindrome. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-680 RA Kapitonov V.V. and Jurka J.; RT "PALTTTAAA2."; RL Direct Submission to Repbase Update (DEC-1999). XX DR [1] (Consensus) XX CC PALTTTAAA2 is a putative non-autonomous DNA transposon; it CC forms almost perfect palindrome. CC There are about 50 PALTTTAAA2 copies ~85% identical to the CC consensus sequence. CC PALTTTAAA1,PALTTTAAA2 and CELE46B share similar termini including CC duplication of putative TTTAAA target sites. XX SQ Sequence 680 BP; 246 A; 84 C; 91 G; 253 T; 6 other; tttaaaggtg gagtagagtc raaatttttt ttttgcttta aatgacagaa aatggaccct 60 aataaccgaa tataactgta aaaaaatttt aaaaaaattt ctaaattttt tatgattttt 120 tcaattttka gaaaatcaaa gaaacggctg actaaatttg aattcccgcg caaatgagtg 180 acgtcatttt tgatattttc gcgttttctg gctaaattcg tcggtttttc tcgatttttt 240 cttctatatt tgatttgaaa cactgtttgt caattattta aaacacttat tttgaaaaaa 300 aattaataat aaatattaaa ttgcggattt taggtcaatt ttgacattwa attggcratt 360 tccttagggt tatcaagttt tttttacgga awaagtgttt taaataattg acaaacaatg 420 tttcaaatca aatatagaag aaaaaatcga gaaaaaccga cgaatttagc cagaaaacgc 480 gaaaatatca aaaatgacgt cactcatttg cgcggtaatt caaatttagt cagccgtttc 540 tttgattttt gaaaaattga aaaaatcata aaaaatttag aaattttttt aaaatttttt 600 tacagttata ttcggttatt agggtcyatt ttctgtcatt taaagcaaca aaaaaatttt 660 gactctactc cacctttaaa 680 // ID TURMOIL1 repbase; DNA; CEL; 5024 BP. XX AC AF040643; XX DT 15-NOV-1999 (Rel. 4.1, Created) DT 15-NOV-1999 (Rel. 4.1, Last updated, Version 1) XX DE DNA transposon TURMOIL1. XX KW Harbinger; DNA transposon; Transposable Element; KW Harbinger superfamily; TIR; TURMOIL1; palindrome; transposase. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-5024 RA Kapitonov V.V. and Jurka J.; RT "TURMOIL1."; RL Direct Submission to Repbase Update (NOV-1999). XX DR GenBank; AF040643; Positions 29989 24582. XX CC TURMOIL1 encodes HARBINGER-like transposase. The most CC conservative CC domain of the transposase is encoded by the region 3379-4097. CC The TURMOIL1 transposase is most similar to the transposase CC encoded by the TURMOIL2 transposon. The two transposons do not CC share any similar DNA fragments but they have one remarkable CC hallmark being flanked by long terminal inverted repeats. CC TURMOIL1 is flanked by almost identical 777-bp TIRs. CC A copy of the CELE1 transposon has been removed from the TURMOIL CC sequence. XX SQ Sequence 5024 BP; 1573 A; 955 C; 986 G; 1510 T; 0 other; taggttttta ctgccaaaat tagcgaaaat gtctgaaata cttcatttta caattttacc 60 cacttgattc ttgtttcatt ctgttttatt ttgaagaaaa acttatttta attcaaaaaa 120 cttttcagta atcgtcccaa tcaacaagct cactcttcaa cagtccgaga tgctcaacgc 180 gtgtgctcaa tatgctcaaa cgggatcaca gaagctcact cgagaagcta actacgttat 240 cacggaggtt aggttgtttt aaagtttcta aactcctgtt tagttaacat ttaaatgttt 300 cagataatct ctctctcggg cactcaacat ggcttgtgtc tacatcggtc gtttcccaaa 360 tagagcatcc gatcatgatg ccgagcactt cttctgcgga tatgaaaagc tgcctgatgc 420 cataatgaag aacgtatttg gtgttggtta taagataact attctcatca gaaaccctct 480 agaatgttat aatttcagga ttttcaagat cagagcgatg ctgacgacat catttgtgat 540 gagcagacaa gaattttgct tgtggagaac gcgaaggaat acccaacaat taatcacgat 600 tagctagctt gctagctcac agatatgggt aaaaacgatt tcgtacaata aattgaatca 660 aatatattaa ttttcagaag ctcaatggca ggttggctcc gtgatcggct ggtcccttcg 720 atgtctgatt gcagttcatt gcttcagaga gcagcggcgg agttttatca aaataagatg 780 gttttatagt tttcaaattt gttcgaaaaa tattaaatta tttccagagc gatccgcttt 840 gcaattggga tcaattgaat ccggagcatg tctcaatggt ggccgctcga atcgccaagt 900 tttccgaaga gatgtcgtcc aaagtgaaat ggtcactgct tgttgagccg gagatgttct 960 gttgccatct cacagagttc gtacaggagt tctaccggct cgatagaatg tttttacaca 1020 tttatctgtg cacaagcttc attgaaaaac actcaagaag atttctggcg tatggttttc 1080 caagagaagt cggatctcta taattctagt tttgcattta aaaaatttaa gatttttcca 1140 cttgttatta taagaaaaaa atgaatcaat taattaatta ttcttcgttt tcttttttca 1200 ggttcagtca gtttcttcaa tatatttaga gcgttcacaa attacgcagt caatcttcta 1260 tttctaaata ttttctatct ccgatgcagt caaaattcta aaaactgcta attctttttg 1320 atttattacc cgatggatgt agttttacag ctcggaaatg agttccgact cagattctga 1380 ctttggcaaa gaagcttcaa aagcacccaa acgaaggatc actgattgcc tgtttgatag 1440 cgacgacgaa gttcttgagt caccgggtag ggaagaactc agaaagcgat tgctttctga 1500 ttatgaggat gtaggatcag ttacgataga tacgccaaag ctaaaaaatc ggaatgttcg 1560 ccttcgtgga gcaggtcttg aagtgaaagc aggtcggcca cagaaaggtg gcgatgatta 1620 taacgatgta agaattcttc gtactttttc tttaaaattt gaaaaattca gtatggaaga 1680 agcgaaatgg aacaatttct tcgacttgtc tcggacgagc cttttctgtg gcacacaaaa 1740 gtttcggatt ttcacagaaa aaaagagaaa gggatgcctg cgtgggccga aatcgagagg 1800 acttgcaata cgtttatggg aaaaggaaga actccaggta aaggcattat aagttagcaa 1860 tttatggact tcgttttcag gaaaaactgc tgctgctctt ttcaaaaaaa tggctacaac 1920 ctaaaaatct catttgacac gaatccaaaa ggctccatca ggatccgccg tgacaccaaa 1980 gaaagatctg ttctgttatg ccgagttgat gtcctttatg gatgaagata tcactcacag 2040 agcaccaaca tggttcgtgg aaaagacatt ataatattat tttcaacttc aattttacag 2100 ttcctttatc gtcggagccg gacctgatgt tgatgttcat ccaactgaga cgagctcttc 2160 aaaaaaaaag atcagaaaaa agagcacaga aaatctaaaa gtggctattc gagggaagaa 2220 gcaatggaag atgctcaatt tttaatgaag aaatttatga agcaagatga tgatgatgtc 2280 aaatgtggaa gccgagtgtg ccaagctttt gagaaaagtg tagatggtat atctcacagt 2340 gatgccagtt tgatggagta ccaagttctg aaatttattg aaaacctcaa ggcaaaagag 2400 aatcaaaacg ctgacagctt ccgcccaact agctcctcct tcatttaatt cgcagccaat 2460 gagttattat tctgttccac acccacaaca ctataatact tatccacaat gtcctcagta 2520 tcatcaagac ccaccgtttc catatccaaa tataccttcg tcgtctcagt gtaatgcttc 2580 ttcatcatct tcatctcaga attctgatct tcatgatttg agccattttt cttacagtag 2640 agattcaaac tctttgtatc agttaaaaaa agagtagaat cgtatatttt gcatattatt 2700 tgcttttgtt atcactttca ttttaaaagt gttatttgtt ttgttttctg ttgtggcata 2760 aattgtaatt ttgaaatttg ctttctatat caatcaatat tatgggaaag gtactgtaga 2820 aaacttgttt tcaattggaa taatatcgga ggacagaaat tatatcacac tgcctttgag 2880 ataggaataa agctgatttt cagatggcac cttccagtcg agaaatcacg tcccgtaaga 2940 agaaaaagct gggaatcatt gtatggagaa aacaataacc tgaaaaatgt tgaaccgtct 3000 aattttcaga ttcccatcat tattcgcaga aattttgcgc ttctgaacaa taatgaagca 3060 aagaaagaaa aactctgcat caatcgtact acaaagaaaa ttgttgatgt tcgaaatttc 3120 agaactttcc gtgacaatgt agatgactat ctgagtgacg atggaatgat gaaactccat 3180 ctaggcatca ccatttcagt tttcaaaaaa ttgttaactc tcatccgtca cgcgcaaagg 3240 catttgaaca ttcataaaaa tttctcacag gtaacaccca gaacgagata tccaaaatgc 3300 atggtctttc ccagccgaca atctcaagaa ttgtatgtgg agttatcgat gatctagttc 3360 gggtttcttc ggaatatatc aagttcccaa caactagcga tgagattgaa acgatgacga 3420 aaaagtttta tgaaaaggaa gattcgaatg gggaggagag acgaatgccg tgctacggta 3480 ttgtcgacgg gaagcattgg agatgcgagc accctccaaa gagcggagcc ctgaattaca 3540 actataaagg gttcttctct ttcaattcat tgattgtgtc cgactcggat tacagaatcc 3600 tttttgtgca gatgtgtaag aacggattga attctgacgc acaactgtac caaaatggtc 3660 ctcttcccag gctcttgacc aaggcaattg aaaatgtcgg ttatcgtact cttcctgatc 3720 ccaacgtcct gatgccacca ttcatcctcg cggataatgg tttcggtctc cacaagtcga 3780 tgatgcaacc atataggcca acccaaattg gtctgaatcc agaagagaat atcagtttca 3840 atttcaagct gagcggtaca cgagttaaag tcgaaaacgt tttcggagta ttgacttcga 3900 aattccacat cttccagcaa aatctgaggc tcgagccgaa aacgtcgcag gcactcatca 3960 ttgctgtctg tgtggtccac aacatctctg tcggacctct cgaggtcatt cccgacgacc 4020 caaatcgtcc aaaatttaga gatccgtaca gaactgcaga gcagcagaga acggctttga 4080 agaaatacct tcttgatctt tgatttgaaa tctggaatga atacattgtt aaaagaaatg 4140 ctaatttttt ttatttcaaa attgaagtga ttttatttca aaaaacaatt ttttattatt 4200 tcgggaagat tccatggctt tgagcggttc caagaagagt gaaaaatcgc cgctgctctc 4260 tgaagcaatg aactgcaatc agacatcgaa gggaccagcc gatcacggag ccaacctgcc 4320 attgagcttc tgaaaattaa tatatttgat tcaatttatt gtacgaaatc gtttttaccc 4380 atatctgtga gctagcaagc tagctaatcg tgattaattg ttgggtattc cttcgcgttc 4440 tccacaagca aaattcttgt ctgctcatca caaatgatgt cgtcagcatc gctctgatct 4500 tgaaaatcct gaaattataa cattctagag ggtttctgat gagaatagtt atcttataac 4560 caacaccaaa tacgttcttc attatggcat caggcagctt ttcatatccg cagaagaagt 4620 gctcggcatc atgatcggat gctctatttg ggaaacgacc gatgtagaca caagccatgt 4680 tgagtgcccg agagagagat tatctgaaac atttaaatgt taactaaaca ggagtttaga 4740 aactttaaaa caacctaacc tccgtgataa cgtagttagc ttctcgagtg agcttctgtg 4800 atcccgtttg agcatattga gcacacgcgt tgagcatctc ggactgttga agagtgagct 4860 tgttgattgg gacgattact gaaaagtttt ttgaattaaa ataagttttt cttcaaaata 4920 aaacagaatg aaacaagaat caagtgggta aaattgtaaa atgaagtatt agacattttc 4980 gctaaataac aggcattttc gctaattttg gcagtaaaaa ccta 5024 // ID DNA-3-1_CE repbase; DNA; CEL; 711 BP. XX AC . XX DT 11-APR-2008 (Rel. 13.04, Created) DT 11-APR-2008 (Rel. 13.04, Last updated, Version 1) XX DE Non-autonomous DNA transposon: consensus. XX KW DNA transposon; Transposable Element; DNA-3-1_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-711 RA Jurka J.; RT "Non-autonomous DNA-transposons from C. elegans."; RL Repbase Reports 8(4), 506-506 (2008). XX DR [1] (Consensus) XX CC 3bp TSD. CC This sequence was derived from sequence data generated by The C. CC elegans Sequencing Consortium. XX SQ Sequence 711 BP; 230 A; 137 C; 135 G; 208 T; 1 other; actagggaac cgtcagaagg tgcccatcgg acttgcatat gagacctatg ccattcgata 60 gtgcatgtct taaaacggtt actcgtgaat ttttagcggc aaaactccag aaccaagctc 120 acggcgggct ctcaaagatc ttaaaatagc actgtaagga agaactttaa cgatctaacg 180 aagcaatttt ccatattcac tttggtagct catatctccg cggataaatt ttttacagaa 240 aagtcatcaa ctaaaaagtt gttgatattg ttgtaaagaa caagtttgta gttgaaagtt 300 ttttaccaaa aaatttttgt ttgagagaaa akcattagaa acggaatagc agcataaaaa 360 taacaacagc agttgccgca cttcacgcgc ttttatctca aacaaaaatt ttttggtaaa 420 aaactttcaa ctacaaactt gttctttaca acaatatcaa caacttatca gttgatgact 480 tttctgtaaa aaatttatcc acggagatat gagctaccaa agtgaaattg gaaaattggc 540 cctccaatgc ttcgttaaag tgctatttta ggatctttga gagcccgccg tgagcttggt 600 tctggagttt tgccgctaaa aattcacgag taaccgtttt aagacatggg ctatcgaatg 660 gcataggtct catatgcaag tccgatgggc accttctgac ggttccctag t 711 // ID CER2-1-LTR_CE repbase; DNA; CEL; 620 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 06-AUG-2003 (Rel. 8.07, Last updated, Version 1) XX DE CER2-1-LTR_CE is a long terminal repeat of the CER2-1_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER2-1-LTR_CE; KW CER2-1_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER2-1_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (MAY-2003). XX DR [3] (Consensus) XX SQ Sequence 620 BP; 141 A; 165 C; 124 G; 190 T; 0 other; tgtaaggagt ctaaccgtct ctacacttct cccacttccc ttttccccag tgatagaagg 60 ctaagtgtgt atagggatta atgcttttat ttgcaggatc accggtcaga aagtcagcca 120 cgtcatggat caaaccttcg ctcttctccg aatacagctc tacaattgat ccatccgtgc 180 acagtaccaa acgctcctca cgcgttcgat ccctgagata attgcaatat tcccacacac 240 tcgattcatt ccatgcctct aaacttccgg gctaccgtaa ccctgtgtgt gtgtgcgcac 300 acatgtgtgc gcgcgcgcgc gtggggagag cgcaccttgt ttacgttttc tggacctttc 360 ggcggaggaa tccagggctc cgccctgcca ccgcagaggg gtatataaga cgtggattct 420 atcactccat ttcttctttt acttttcaaa atccccttta tacctgtccc gttgcctctc 480 tttcttttgt acccattcaa tggggttaat tatacttaat aaacggattc tttagctcat 540 atcgtgttct gttgtagtat gggatgcaac aactgcattt gtcttaaaga taaactgtgg 600 ggaacactta aacctttact 620 // ID PAL8C_1 repbase; DNA; CEL; 212 BP. XX AC . XX DT 09-JUN-2000 (Rel. 5.05, Created) DT 09-JUN-2000 (Rel. 5.05, Last updated, Version 1) XX DE PAL8C_1 is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; PAL8C_1; KW TIRs; nonautonomous DNA transposon; putative HAT superfamily. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-212 RA Kapitonov V.V. and Jurka J.; RT "PAL8C_1."; RL Direct Submission to Repbase Update (JUN-2000). XX DR [1] (Consensus) XX CC PAL8C_1 is a putative non-autonomous DNA transposon. CC It generates 8-bp duplications of target sites upon CC integration in the genome. PAL8C_1 has perfect 26-bp CC terminal inverted repeats. CC The length of the target site duplications hints CC that transposase involved in proliferation of PAL8C_1 CC belongs to the HAT superfamily. CC There are only several copies of PAL8C_1 in the genome; CC they are ~99% identical to the consensus sequence. XX SQ Sequence 212 BP; 71 A; 39 C; 32 G; 70 T; 0 other; ggtacttatg ggtttcgttc cccccaaaat gttcattcaa ttatttaata ctgaattttt 60 aattttaatc cacacgtgaa agtttatttt aatactgttt tcattttcag gcttaggaaa 120 ccatcttcct aagcctgaga atgaaaaaaa agttcacgtc tctattaaaa actatggaaa 180 acatactggg gggaacgaaa cccataagta cc 212 // ID CER13-LTR_CE repbase; DNA; CEL; 243 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 06-AUG-2003 (Rel. 8.07, Last updated, Version 1) XX DE CER13-LTR_CE is a long terminal repeat of the CER13_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER13-I_CE; KW CER13-LTR_CE; CER13_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER13_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (MAY-2003). XX DR [3] (Consensus) XX SQ Sequence 243 BP; 63 A; 30 C; 53 G; 97 T; 0 other; tgtcaaatcg gagatgtttc aaacatcaac gatttgatgg ttaataaaac tgttgtgcgg 60 aggtgaatat ttatgacttt tgcggggtag cttttattgt tttattgagg ggtttagggg 120 tatataagat aactttttgt attgtagggt tatatttttc tacattttga ataaatcttt 180 cttattttgc tatttgtccc ttggactact tggatacgaa aactgttggc aaccgtgcca 240 aca 243 // ID CERP2 repbase; DNA; CEL; 328 BP. XX AC Z30215; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE Caenorhabditis elegans cosmid F40F12. XX KW CERP2; Repetitive element; palindrome. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-328 RA Kershaw J.; RT "Direct submission."; RL Direct Submission to EMBL/GenBank/DDBJ (22-FEB-1994). Nematode RL Sequencing Project, Sanger Centre, Hinxton, Cambridge CB10 Louis, RL MO 63110, USA. E-mail: jes@sanger.ac.uk or rw@nematode.wustl.edu. XX RN [2] RP 1-328 RA Sulston J., Du Z., Thomas K., Wilson R., Hillier L., Staden R., RA Halloran N., Green P., Thierry-Mieg J. et al.; RT "The C. elegans genome sequencing project: a beginning [see RT comments]."; RL Nature 356(6364), 37-41 (1992). XX DR GenBank; Z30215; Positions 30068 30395. XX SQ Sequence 328 BP; 112 A; 56 C; 60 G; 100 T; 0 other; tatcatactt tcactagatt ttgaagtacc ttattaaagg cgcacagatt tacgcaggtg 60 ggtctcggcg cgcaaaaaag tttatggtag tttctttgaa atttgaattt ttaaacctga 120 tttttcgcga tttttaggga gtaatgaaaa ggaaacacca aagtttgctg aaacctttgc 180 gacttcattg aaaaacgcga ggaaaataaa cgaaaccgct taaaaataac aaataactat 240 cataaatttt tttttgcgcg ccgaaaccca cctgcgtaaa tctgtgtgcc tttaataagg 300 cacttaagaa tcttgtgaaa atacgata 328 // ID NTC2A repbase; DNA; CEL; 470 BP. XX AC . XX DT 24-OCT-2000 (Rel. 5.09, Created) DT 24-OCT-2000 (Rel. 5.09, Last updated, Version 1) XX DE NTC2A is a nonautonomous Tc2-related DNA transposon - a DE consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; CELETC2; KW CEREP12; NTC2A; TC2; nonautonomous DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-470 RA Kapitonov V.V. and Jurka J.; RT "NTC2A."; RL Direct Submission to Repbase Update (OCT-2000). XX DR [1] (Consensus) XX CC NTC2A shares similar 25-bp terminal inverted repeats with CC Tc2-related transposons (Tc2, CELETC2, CEREP12A and CEREP12B). CC There are ~50 copies of NTC2A, they are 96% identical with the CC consensus sequences, and flanked by 2-bp duplications of the TA CC target site. Autonomous transposon directly related to NTC2A is CC expected to encode Tc2-like transposase. XX SQ Sequence 470 BP; 170 A; 69 C; 85 G; 146 T; 0 other; ccgtatttcc tctattagta aggcatgcaa aactaatttt cggacaccta atttgatgca 60 aaactaatgg aggtgcaaaa ctaatagagg gtgcaaaact aattttcgaa caggtttttt 120 ctcatgtttt ccattaagtt atggcataat atcatcaatt tcaataacaa cttatgaacc 180 aaaatggacg aattttacga ctgatacgca aaaattgtcc gagttgtact catattttgc 240 caattttgac ttgttatacc aagtctgtaa gagttttcct aatttttaga acgattttat 300 aatgcaaatt ttgaattcct aaaaataggg aacaaatgaa ggggtgcaaa actattagag 360 gtgcaaaact aatagagggt gcaaaactaa ttttcgatta gtgattttag atgcaaaact 420 aatggaggtg caaaactaat agaggtgcct aactaataga ggaaatacgg 470 // ID CELE42 repbase; DNA; CEL; 238 BP. XX AC . XX DT 23-AUG-1999 (Rel. 4.07, Created) DT 16-SEP-1999 (Rel. 4.08, Last updated, Version 2) XX DE CELE42 nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; CELE42; KW nonautonomous DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Surzycki A.S. and Belknap R.W.; RT "CELE42."; RL Direct Submission to Repbase Update (AUG-1999). XX RN [2] RP 1-238 RA Kapitonov V.V. and Jurka J.; RT "CELE42."; RL Direct Submission to Repbase Update (SEP-1999). XX DR [2] (Consensus) XX CC Several hundred copies of CELE42 are present in the C.elegans CC genome [1]. They are ~92% identical to the consensus sequence CC and generate 6 bp-long duplications of nonspecific CC target sites upon integration in the genome [2]. CELE42 is CC flanked CC by 22 bp-long imperfect terminal inverted repeats. XX SQ Sequence 238 BP; 64 A; 67 C; 53 G; 54 T; 0 other; cacggggttc tggccttcct cattgaattt ttcgcgctcc attgacaatc gcctgccgga 60 caacgcgtgg gaaagtcgtg tactccacac ggacaaatac atttagtttt acaactaaaa 120 tcgagccgcg acgcgacacg caacgcgccg taaatctacc ccagatatgg ccgagccaaa 180 atggcctagt tcggcaaact cttccatttc aatttatgag ggaagccaga aatccgtg 238 // ID CELE46B repbase; DNA; CEL; 705 BP. XX AC . XX DT 16-SEP-1999 (Rel. 4.08, Created) DT 16-SEP-1999 (Rel. 4.08, Last updated, Version 1) XX DE CELE46B nonautonomous DNA transposon. XX KW DNA transposon; Transposable Element; Nonautonomous; CELE46B; KW MITE; nonautonomous DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-705 RA Oosumi T., Garlick B. and Belknap R.W.; RT "Identification of putative nonautonomous transposable elements RT associated with several transposon families in Caenorhabditis RT elegans."; RL J. Mol. Evol 43(1), 11-18 (1996). XX DR [1] (Consensus) XX CC CELE46B is a putative nonautonomous DNA transposon flanked by CC ~300 bp-long terminal inverted repeats. XX SQ Sequence 705 BP; 270 A; 97 C; 92 G; 246 T; 0 other; ttaaaggtgt agtagcgcca gtggggaaat tgctttaaaa catgcctatg gtaccacaat 60 aaccaaatat catagtaaaa aaattttaaa aatttttcta aattttatat gattttttga 120 aaattgaaaa aatctcgaat tgcatcaaat tcctttttga attaccgcca atagttcgat 180 gttcgatgga gcgcgcttgc attattttaa catttattta tcaatttttc atgttttttt 240 gagtaatttt actggaaatt taatgaaaaa ttcaatataa atgtagattg tttattaaaa 300 agaattgaaa acagagaaaa aacgtttttc aacgacgttg agcctgaaat aatctatttc 360 aaagatttag actttagcgc ccgcttttta ataaaaaaat tacatttatc ttgaattttt 420 cattcaattt ccagtaaaat tactccaaaa aacatgaaaa attaatcaaa aataagaaaa 480 attgataaat aaaagttaaa ataatgcaag cgcgctccat cgaacatcga actattagcg 540 gaaattcaaa taggaatttg atgcaatacg agattttttc aattttcaaa aaatcatata 600 aaatttagaa aattttttaa aattttttta ctatgatatt tggtcattgt ggtaccatag 660 gcatgattta aagcaacttc cccactggcg ctactccacc tttaa 705 // ID MINISAT1_CE repbase; DNA; CEL; 238 BP. XX AC . XX DT 07-SEP-1999 (Rel. 4.08, Created) DT 07-SEP-1999 (Rel. 4.08, Last updated, Version 1) XX DE MINISAT1_CE repetitive element - a consensus. XX KW MSAT; Satellite; Simple Repeat; hAT superfamily; MINISAT1_CE; KW nonperfect palindrome; KW putative minisatellite formed by DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-238 RA Kapitonov V.V. and Jurka J.; RT "MINISAT1_CE."; RL Direct Submission to Repbase Update (SEP-1999). XX DR [1] (Consensus) XX CC MINISAT1_CE is represented by more than 150 copies which form CC dispersed clusters composed from several MINISAT1_CE CC units. It's possible that HAT-like transposase is involved CC in proliferation of the repeat since there is significant CC similarity of MINISAT1_CE to CELHAT1 transposon. CC MINISAT1_CE copies are ~85% identical to the consensus CC sequence. CC MINISAT1_CE is composed of six tandem units; these units are CC 45/38/45/45/38/45 bp-long. XX SQ Sequence 238 BP; 52 A; 48 C; 60 G; 78 T; 0 other; gttttgaaaa tttccggcaa tcggcaattt cggcaattgc cggttttgga aattttcggc 60 aatttcggca attgccggtt ttggaaattt ccggcaatcg gcaatttcgg caattgccgg 120 ttttggaaat ttccggcaat cggcaatttc ggcaattgcc ggttttggaa attttcggca 180 atttcggcaa ttgccggttt tgaaaatttc cggcaatcgg caatttcggc aattgccg 238 // ID CER5-I_CE repbase; DNA; CEL; 4660 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 06-AUG-2003 (Rel. 8.07, Last updated, Version 1) XX DE CER5-I_CE is an internal portion of the CER5_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER5-I_CE; CER5-LTR_CE; KW CER5_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER5_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (MAY-2003). XX DR [3] (Consensus) XX CC FEATURES Location/Qualifiers CC gene 3111..3632 CC /note="predicted Integrase region" CC gene 1137..1460 CC /note="predicted Protease region" CC gene 1617..2153 CC /note="predicted Reverse Transcriptase region" CC gene 2442..2804 CC /note="predicted RNase region" CC gene 933..1124 CC /note="predicted Gag region" CC primer_bind 2..18 CC /note="predicted Primer Binding Site (Ser-GCT type)". XX SQ Sequence 4660 BP; 1406 A; 1037 C; 1095 G; 1122 T; 0 other; ttggcgatca ggacgttcga gagcttacaa tggccgaagg taagcctact gctttatcag 60 ctttagaagc gcttactgcg caagttgccg ctcaagagca gcgctctcaa caacagtttg 120 aggcgcaagc agactgacag agcaacttat taatagattg cttgcccagc cagaactgcc 180 gcgccagcat gccggtgggg acgtggttgc ggcaccacaa agtaatgccc gtcttatgaa 240 tgatttgaag cgttccatcc aaaagttcca gttcaacgtc gccgagcctg aggctttcaa 300 gagataagct cacattcgtg gaagatggca aaaacttgtc agaacgcgag cgcacacggc 360 tgctgctagg ttgcttagaa gaatctacgt ttcatcgtta tgaagactct caacgagaaa 420 tatcagacat ctacagtatc tctttcgacg ataccgtaac tgcgttaacc aagattttcg 480 gttcaacgaa aagtttaatg atgcgacgtc aacagtgcct gcaaatctgc agagcaaacg 540 ggctgtccca ggattacctg gattatacca attccatcag tgacgccgtc cttgactcaa 600 aactatcgtc catgacgtcc gacgaatggt caattttctt atttctccgt ggactcaatt 660 cgccaggaga cgagaaagcg aaactgtatt taatgcagta cgtggaagct tccgaaaaga 720 agaacgagaa gctgaaactt tcggacgttc acgatgagtg gatgaagttc attcaaatgc 780 atcagcaatc gaagatcgtg tctgtcaaac cttctaagag cagccagcaa gttgatgtca 840 acaaggttga cacaaaccga tcaaagaaga agaagaaacc cattcctagg aaacccgaga 900 agtcgtctca agattcgaag aagaaaggag aaattccaac gtgtttctac tgcaacaaga 960 aagggcatta tgccacaaat tgccgcagca atccgaagac aggcaatcag gggggaaata 1020 aggggaaaag caaaggttgc gacagtgtcc acgtagatgg actagacgtc aaaaccgagc 1080 atcaagctaa gcatagaatg agcgtagagg tttgcggtaa ggatgtagct ttccagctag 1140 atactggtag catgatcact ttaatcagct ttaagtgttg ggaaaaagta ggtagtccgc 1200 cgcttgagaa ggttccgcat agattttcgt gcgcgaatgg tacgcctatg gccgtcaagg 1260 gtcgatgtct tgtcaaattc aaattgaaag gcatagaata cactgaatac gtgtatgttt 1320 gtgatcgaca aacaaatttg cttggaacaa gttggttaaa tttatgtcca caaatgagaa 1380 gtgctctggc acaaattgtc aatcaagtct ccacgtctga aactgaagca tctcgtctgg 1440 aagttatgct caaaaatgac ttcccagaag tcttcaaaga tggattaggt ctatgcacca 1500 aagagaaagc tgaatttagg acagaagaaa atgcagttcc ggttttcaag agagcacgcc 1560 ctgtgcctta cggatctctg gaagcagtcg aaaaggagtt ggatcgtctt caagaaatgg 1620 gtgtcattgt gccgatcaca tatgccaaat gggcagcacc aattgtagtc atcaagaaga 1680 agggcactgg aaaaatccgt gtttgcgcag acttcaagtg ttctggactg aatgcagcac 1740 tcaaagacga gttccatccc ttgccgacaa gcgaagacat tttcagtaga ttaaagggta 1800 ccgtgtactc ccaaatcgat ctcaaagatg catacctgca agtggagctg gatgaggaag 1860 cacagaaact ggccgttatc aacacacacc gtggcatctt caaatatctc aggatgacct 1920 tcggactaaa gccagcacct gcctcgttcc aaaagatcat ggacaaaatg gttagcggtt 1980 taacaggagt tgccgtctat ttggatgata tcatcatctc tgcgtcctca atcgaagaac 2040 atgaaaagat tcttcgtgaa ctgtttgaac gtatcaagga atacggtttt cgagtgagtg 2100 ctgaaaagtg cgcctttgca cagaagcaag tgaccttctt ggggttcatc gtggatgaac 2160 acggtcgccg acctgaccca aaaaagacgg aagtcatccg ttcaatgaag gctccaaccg 2220 atcaaaaaca gttagcatca tttcttggag cagtcagctt ctacagtcga ttcgtaccaa 2280 aaatgagcaa gttgagagga ccactggatt cgcttatgaa gagagacgtc aaatggaatt 2340 ggacagatat tcatcaagaa gctttcaata ctttgaaaaa tgccgtcgcg gacagcacta 2400 tgctctcaca cttcaacgaa tcttggccaa ttatcattgc ggcagatgct agccaatatg 2460 gtattggtgg agtgctgtta catcagactc cggagggaca cgaagttcct gtagctcact 2520 tcgcaagagc tctgactgac actgagaaac ggtacagtca aattgagaaa gaagctgtgg 2580 cccttgtcta cactgtaaag aaggctcata agttcatctt cggaagaaag ttcttacttc 2640 aaacagatca caagccactt ctagcaattt ttggaccgaa gaaggatctt ccggttcatt 2700 cacaaaatcg tctcgtccgc tgggcaacaa ctttgatgac ctataacttt gagttgagtt 2760 acgtatccac atcgaaaatt ttcaaggcag attggctgtc gcgaatgatg caggatcacc 2820 aacaaaacgc ggatgacgtc gtgattgcgg aaatatacga cgacgacgac gatgaggatg 2880 attccatcat ccaaaagttg aatccagtaa ctgaaacaga cattcggttt gagtctcaaa 2940 aagaccacga agtttcatcg gtgatcaaat tggttcgcaa cgactcttgg aagcctaaac 3000 catccaccga aattgagaag cattggattc gctatagaga tcgactgaag ttaattcatg 3060 attgtctctt gcttgatgac cgcgtcatcg tttccaagtc gcttcagcaa atcgtactca 3120 aacaactgca tgaaggacat ccgggcattg tacagatgaa acaaaaagcg agagcatttg 3180 tattctggag aggattggat tccgacattg aaaagatggt acgtcattgc aacaattgtc 3240 aagaaaactc caagatgcca cgtgtcgttc cgttgaatcc atggcctgta cccgaaactc 3300 catggaaacg gattcacatt gattttgctg gtcctttgaa tggatattac ctgctagtag 3360 tagtcgacgc aaagacaaag tatgcggaag tcaagttgac gagaagcatc tctgctgtta 3420 ctaccattga cttattggaa gaaatttttt ctattcatgg ttatccggaa acaatcatta 3480 gcgacaatgg gacccagctc acaagtcatc tctttgctca aatgtgccaa tcacatggga 3540 tcgagcacaa aacatcagct gtctactacc caagatctaa cggagctgct gaaaggttcg 3600 tggatacact caagagaggt atcgcaaaga tcaagggaga agggtcagtc aatcagcaga 3660 tcctgaacaa gtttctgatc agctatcgga acacaccaca ttctgcttta aatggttcta 3720 caccagcgga gtgtcacttc ggaagaaaaa tcaggacgac aatgtcgctt ctgatgccta 3780 cggaccgcgt tttaaaagtt ccaaagctga cgcaatatca gcaaaacatg aagcatcatt 3840 atgatctgcg aaatggggcg cgcgcaaaag cttttcaggt aaatcaaaaa gtttatgtcc 3900 aagtacacca cggcaacaag tctcaatgga agcatggtgt tattcgaaga aagtttggtg 3960 gtgttctcta cgaagttcaa gtcggagaca gaatccaaaa gtctcacgtt aatcagattc 4020 ggactcgtta tggtgactac agtcgtgaag acatttttga ggaaactatc tatcctctgt 4080 tcttctcgac tccaggaaag tttactagcg gttcagacaa taagtcaagt tccggggcca 4140 caggggggaa actaatcttt ttgattcttc aggctcgtct gttaatccta gattgtcttc 4200 cgattctttt ggttttgtgg gggagagtgg cggtgatgag atgtcccgca atgatgatcc 4260 agttgttact tcgaacggtt ccagtactga tgtcaatcgt ggaagccgaa tcacaagtga 4320 actaaagaag aaagaaagca atgcagtcag aggacagcca cgtggatcct gctctccaac 4380 caacaacgac gtttcggcgc ccggatacgc ctcacgttct cacccgacca ccgacccaag 4440 ccattccgta agacgaagtt cgcgaattcg acgagtaccg gacagatatg gttcatctgt 4500 ggagcaccct tcaacctcca ccggaactcc gcgaggatcg acatccacac agctcggtca 4560 agcctccact cgcaacggaa gtcgatatac agcttctggt cgaaacccct cttcccaagg 4620 gaatcgctat tcgtcaataa ggggggaagg tgttacggcg 4660 // ID MAR_CE repbase; DNA; CEL; 1239 BP. XX AC . XX DT 10-JUN-1999 (Rel. 4.05, Created) DT 10-JUN-1999 (Rel. 4.05, Last updated, Version 1) XX DE Mariner DNA transposon from C. elegans - a consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; MAR_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-1239 RA Jurka J.; RT "MAR_CE."; RL Direct Submission to Repbase Update (JUN-1999). XX DR [1] (Consensus) XX CC A very young family of mariners. XX SQ Sequence 1239 BP; 332 A; 312 C; 259 G; 313 T; 23 other; ttaggttgga ccggaagtct ttrtaactgg gtttgctttt ccaccgctcg ccgcagttgc 60 actttggtgt tccctcgttg ctagkgwata acaacgagwa aaggaacttc attgtaarma 120 acaccccatt cccttccctc tggcctgaat ttagttaacc aacgatgtat gtacctagtg 180 csatccacat ccgacatggt ttgatgtacc tcttcctctc tcataccacg atgaaggatt 240 cggaaagtct cctcaagaat gtttatgatt cacatgtccc ctactataat atcatacaca 300 attggttcca wcgcttcgaa aaggacgact tttctctcga agactctgag accccataaa 360 ttggatttgg atacgttgca gaaagccgtg gcaacggatc ctttccaaac cacccgtgag 420 ctgtcaacca cacttgggtc ctgtcaaacc aacatygtac gaggctkgga agctctygga 480 ataaagaaaa ttatgggtcg attcatacct cacaccttga cacaagccaa ccttgatttt 540 tgagtggatg attccttttc cctcctcatt ctccacgggg ccgatcgatg gttgggccga 600 ctcattactg gaaacgagaa gtgggtcctc tacgataaca accacargcg tgcycagtgg 660 attggagaag gagagacccc acaggatgcc gccaaacccg accttcaccc caaaaaaggc 720 atgctctcag tgtggtgggg agtgtacggc cccatctact gggaactgct acctgagggc 780 aaaataatta ctggggacat ctacgttatc taaccacgga acctgaagaa agtcwtcgat 840 cgttcacccc tgawggacaa gcaactctat tttcaacaaa acaacgctag tccccacgtc 900 tcgaaacaag tatwgyagga actggccags catggatgga tggttctcct tcacccaccg 960 tattctccag acattgctcc atctgaatac tggtcgtttt cagatatgac tcgtgtcctc 1020 gggggragaa ccttcaatac ccatgaaagt gtcaaagtgg cactcaagca ttatttcgac 1080 tcgcgcccgg aagggttctr taagcagggc attcataagt tgcccacacg ttggcagtat 1140 attgttgata atgatagcaa ataaaattta tcctgtcgtt tctcttgctt cgagaaataa 1200 acgattgaaa aagtyayaaa gacttccggt ycaacctaa 1239 // ID PAL3A_CE repbase; DNA; CEL; 150 BP. XX AC . XX DT 09-JUN-2000 (Rel. 5.05, Created) DT 09-JUN-2000 (Rel. 5.05, Last updated, Version 1) XX DE PAL3A_CE is a nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; PAL3A_CE; KW TIRs; nonautonomous DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-150 RA Kapitonov V.V. and Jurka J.; RT "PAL3A_CE."; RL Direct Submission to Repbase Update (JUN-2000). XX DR [1] (Consensus) XX CC PAL3A_CE is a putative non-autonomous DNA transposon. CC It generates 3-bp duplications of target sites (TTA and CC TAA) upon integration in the genome. PAL3A_CE has perfect CC 16-bp terminal inverted repeats. CC There are ~100 copies of PAL3A_CE in the genome; CC they are ~98% identical to the consensus sequence. XX SQ Sequence 150 BP; 60 A; 20 C; 21 G; 49 T; 0 other; tggggttatt caagtagtgt cggaaaatta aaaagtgtag aaaaattacg tcacaactgt 60 attaaaatac ataaaaacat gtattttaat acatttgtga cgtcacaaat gtatttaaat 120 acattttgct acattacttg aataacccca 150 // ID CER2-I_CE repbase; DNA; CEL; 8558 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 14-JAN-2009 (Rel. 14.02, Last updated, Version 2) XX DE CER2-I_CE is an internal portion of the CER2_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER2_CE; CER2-LTR_CE; KW CER2-I_CE. XX NM CER2-I_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER2_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (31-MAR-2003). XX DR [3] (Consensus) XX CC FEATURES Location/Qualifiers CC gene 4096..4449 CC /note="Cer 2 Rnase region" CC gene 2349..2666 CC /note="Cer 2 Protease region" CC gene 4798..5337 CC /note="Cer 2 Integrase region" CC gene 3292..3807 CC /note="Cer 2 Reverse Transcriptase region" CC gene 1929..2153 CC /note="Cer 2 GAG region". XX SQ Sequence 8558 BP; 2390 A; 1714 C; 2177 G; 2277 T; 0 other; ggtagtcagc gagtgtcact gatttttgct aacgcgtatc agtgataatt acgatccacc 60 ggttagagaa tgcgattatt ttaaaaatta attttaattc ccttccgagt gtcttttcac 120 ttgtttccat tttctcctcg tttggatgtt ccggcagtgc gattatctgc agtagttgtg 180 ctgtatcaac gggagatcag gagccggcgg aatggattcg cgcacttgtg aatttttcgg 240 aggacactgg caagagagca gagcggatag gtcaaggtca ggtcagttta gtgaaaggtc 300 ggtggaagct gggtttggta cgctgcatcg tactggtaaa attgcccact aattttacat 360 tgaaagtgtg aatccacttt agaggttcag agcactccgt ggagttgctc gcgagcttgg 420 ttccatcctc ctttggaaac agctcagttg ggacaggcgg gtctgtcctg gagcttcccc 480 gaggaatgct ggtgcaggta ttccgggtca attttagcat atttttcaat ctttttcacc 540 ttactcgcga atcgcttaaa tttttcgtgc cagtttttgt aggaacctaa aaggagagtc 600 aattaatttc atcattacag caaacaaaaa acttcgcagc gatcaacttg acatgtctgg 660 acaacgtccg aatcaggatt tcgagcttca aacggaagtg gtcttcaatc attggcgttt 720 gggcactccg ctggctagga accatcttct cgagctgaag ggtgcggttg tggcagatct 780 gcacatcatt cgagctcaat tcgaacatga aaaggagcat tggttggccg agcgggctca 840 gatggccgag aaagtggaag ctactaagct gttgatggag agggcgacgg aactcgtcga 900 ctcactgaag gctcggatcg agattttgga aagtcaggta cgtcaaccgg aggcttcggc 960 cgttgatgga aatgggagag aggattcctc cgggacctct cttcgcggac tccgttccgt 1020 acaagttaat actggtggta gtggtgatgg agaaaatggg gctaccaggg ttgccaacct 1080 cgctgaggag gttgcagaaa atctcggaaa tgaagaactt ctcgtcgagt cgaacgcggc 1140 ttcggcatcg ccgccgagta acacgtctag tggggcgggt caaacattgc aggaggatgt 1200 tagaccagat gttactcaga gagttgtcct aagatcggaa agtaacaatt ccgatactat 1260 ggatgttccg ttctccgaaa cggtaaataa tgcaagtgca agttccaatg ggaatgtgag 1320 aagaccaagg atcaatatgc tgcggctgag gccggcagtg cagttgacgc catacagagg 1380 tactatggct gaggatttct cgtcgtttat tagatctttc aacgatcaat gtgaagcgag 1440 tgagaggatg gatgacgata ttcgcctgaa attcttcttg acatgtctcg tggatcggcc 1500 cgaaacgtgg ctgaagacga gatgaagctg aatccgacga ttaagctgaa agagttggtc 1560 gagaaaatga agtctacttt tgagaatcca gtgctgataa aacataagga gaaccagctc 1620 cgactttgca agcagaagaa ggacgagtcg gtcgaggcct tccatctcag aatcgaggag 1680 ttggagagag ctgcgagcac aaccgaatca gcggtttgga cgaagagaat cgtgttggac 1740 gcgtttctcc tggattgaac aagcacattc aggagaaagt gcgcttggaa tggcctgata 1800 cgatggtgaa ggctttggaa gtggccttga gagtcgaggc attggatgcg gagctgcaca 1860 agccgagtcg ggaagcagct gaggcattgg cgatctacaa gccgattcca aagaggaatg 1920 aacgattcaa aggtgagtgt ttcttttgta ataaagttgg ccacaaggcc gcagattgtt 1980 tcaagaaaaa cggtaagaag gataaaaagg gtaagttcgg tgagaagaag ccccaagatc 2040 ctgtcaatca tcacaagatc gagaagccgc aagggatggt gatttcgacg atggtggatg 2100 attcggagcc ctcatatgag gtactgaaga gtcaactcga agagatggct cggagacacg 2160 aattgcttca aagtggaatt cgtttcgagg atgagaagaa tggtgtacga agcctggcat 2220 ggccaaagat gactcaggtt agtgaaattg cggactccgt tccgtgccaa tattctcaaa 2280 gtctcgggaa ctcatttatt actgcccaca ttccaattcg tgcaaatgga attccatatg 2340 ccgctctcgt tgatactgga gcaaatatca cagttgcgtc tagaaatatc ttgaaagggc 2400 taggaattgt tagattgcaa gctgcagata atgataatgc agttggtttc ggaggaaacg 2460 aggtgagtat gatcggttca gcgatcgtac gttttcaaat cggtagccaa gtggtgaaac 2520 agagagttca tttcactaat ggtcactgta tgccggatat tgatggtagt taccaattta 2580 ttttcggaaa cgacttattg tctcgtttgc cgatttttat gttcgactat cagcagaaga 2640 tgtttcacgt tggagatgac gcgattccat tcggaagagc gccaggtcag aatgtgaagc 2700 ctagaaatta tcaggtgaag gtgagtgaag atacgatcat tccagcaggt actgaaaagt 2760 acgtcaagtg ttctatcgat gcatcattcg ggagcgaaaa ggacgtggtc ctgatgatcg 2820 attcgaaaat tggagaaaat gatttgttcg tttctccagc ggttgtcatg gcttcaaatg 2880 ctatgattct ggtgtcgaat ccaacggagg aggataagac tatttcgacg gatgttcatg 2940 ctgccgtcgc caatcggatt tcaactgatg gtaatgtttt atattgtttt tgatagtgag 3000 aatactcctt gtgttcaaac tgtcatcacg gactctgatc cggcctttaa tataaactta 3060 tctaaagtta aaatttccga ttctgaaaga gctaaattaa aagttcttat tgagaaatat 3120 catgatgttt tctcgaagaa tgcttatgat ttgggtagtt ccaagactga tccagttcat 3180 atttatacga ctacagaggt tccggtgaga ggacgagctt atcgtgttcc ggttaagtgt 3240 caagctgagc ttgagaaaca tattaattcg ctaattcaat cggagcgaat agtggagagt 3300 aatactccat ggacgagtcc aatagttctg gtgaagaaga agaatggctc gttaagagtg 3360 tgtttggatt ttcgaaagct gaatgaggta actattccag ataattaccc tttgcccagg 3420 atcgatgcta tcatcgagaa agtgggcggt tcgaggtatt tttccgtgtt ggatatggct 3480 aacggatatc tgcaattgag attagatgcc gagtcgtcat acaagtgtgg atttatcact 3540 gaaaataagg tttacgcgta cacgcattta ccttttggac tgaagtcagc agcttcatat 3600 tttcaacggg cgctgaggca ggttcttgat ggacttgagg atgtgatggt gtacatcgac 3660 gatgttctta tctacagtaa gacattcgaa gatcatatca ggacgctaga gatggttttg 3720 gaacgattcc ggaagttcaa tttgaaggca tcgccgaata aatgtgagtt cttcaaggaa 3780 tccattgtat tcttaggtca tgagatatcg agagacaatt attcgccgaa tcgtgtaaac 3840 gtcgagacga tcagatcgat gccgacgccg acgaacgtga atgaagttag aagattcgtc 3900 gggatgtctg ggttctttag aaagttcatt ccgaatttct cagagagggc tgagccgctg 3960 acaaggctta cgaagaagaa ccagaaattc gtatgggccg aggagcagca gaatgccgtc 4020 gacgagctct ctgaagctct tactaacaaa cctattctca cgtttccaga ttatgacaag 4080 ccgtttcata tttttactga tgcgagtgcc gttgctcagg gagcagcgct aatgcagaca 4140 gttggagaag acgaaaagga ttttgcagca atcgctttta ttagcagaac gttggcggac 4200 acggagacga gatggccagc cgtacatacc gagttaggag caattatatt cgcgctccgt 4260 caattccgtc cgtatgtttg tatgtcaaag ataattcttc attcagatca caagccgttg 4320 aggtatattt tggctaagag caaaatcaat gatcagattg ggagatggct cgttgagttg 4380 cagcagtacg atatttcgat cgtgcacatc gatggaaaga agaacatggt tgccggttgt 4440 ctatcgagag ctaaggatga gatcgcgccg ttgtcaggta ttgagatgga agatattatt 4500 gattttccag tctgcatgcc gattcgaaag aagaagagag cgagcgtggt gttcgtgtta 4560 caggcaaata agaatttgcg tctggatttg gtggaagaac aggataagga tccgattctt 4620 cgagctatca agaagttttt ggtcgagcca aaagcttcga tcgattgtgt tccgaaatct 4680 tggtgcgatg ttcttgaaca tgtcgagatt tcggagaaag gcgcgctctc cgtagcgtat 4740 cataactcat tccctaaaac tgttattcca gaacatttga ggaggtttat gtttgaagcc 4800 tttcattcga gtaagttaca gggaggtcat cataattgga agaagacgtt tcgaaaagcc 4860 agtgtaaggt atttttggcc ggatatgaaa tctgacattc tgagatggtg catggaatgt 4920 attccttgcc aacagaggtc gaagcctcac ccgtccactc gagagccgca gcagatcgtt 4980 gtcacttcga aacttttcga gaaagttgga gtcgatttgt gcggaccgtt gagatcaacg 5040 gctggaggac acaaatatta catgaatttg atttgttggt tcagtaagtt cgttgtttcg 5100 gttccactta ctgacgcgtc tacggataca gtggtgcgag ctatcctcac ggaggtagtg 5160 ttgaaatttg gaacgcccag cgaattggta tcggacagag catcgacatt tacgtctgaa 5220 gcttttagac agttctgtaa gttggtgagt attcagcagc atttggcgat tccttatcat 5280 agcaaaggca acggagcaac agagagaacg tttagaacgt ttcataacat gacgtcgaaa 5340 tacgtcaatg cagcccactc tgactgggat atactactac cttatctcac ttttagctat 5400 aacactgttg ttcatagtac tacaggagaa acgccgtttt tcctcgtgtt tgggagagac 5460 ccggttttcg cagtcgacag gattttggac ccgtcaccac cgaaagaagc tgggaagtca 5520 gatgtgaaga tttggaagga gcatctcgtc gagattcttc gtgaagcttg gaagaacaca 5580 gccgagattg cacttaaagc tcaactagca taccagaagc aggccaatca aggagcgaaa 5640 ggttctgaaa ttagaccagg agatagggtg atgtttaaaa attttaagtc aaaaattaat 5700 ctgtctagaa agctagtgaa accatggatt ggagattatc gagtcttaga ggttaatcat 5760 ccaaaagcgt taattttaga tttagatcat ccaggaaaag agccgagaga agttcatctc 5820 gatcagatta aaaagtttta tctttccgag aacgataacg acgaagagga cgccgcagtg 5880 gatgaagaac aagttccaac cgacgtcatc agccagccga tcgcagaggt gacgcaggca 5940 gtcgaagaaa gttcgaagaa agctgaaaaa cacgaagttg aagctgaaaa gaaggacaaa 6000 cccgtggagg tggcgaaaaa cagaagaaat ccgccacgtg caaagaagat tccagtgagg 6060 tttgccgaaa aataatcgat ttttcctaga aaatcggttt gccgtctctg tacggtgtct 6120 aacgttcatt tatttgtaat tttcagcgaa aatgagttta atttcgagtc agaaagtgat 6180 ttttaaaata atcgctacag gacccgtgga ttaagttaag aggctgtgcc tttaaacgga 6240 ttacggtacc atttgggtct cgcctcgaac catttgggtc tcgcttggaa cgcgggcccg 6300 catttttggg acacgtaaac aaggttgttt acgccacatt ttgggcgaat ttttcgattt 6360 ttcggcctaa atttttgcgt ttttgacgaa tttttactga tttttcgccg atttcgtgca 6420 aatttcttaa attctttcaa atttcacact ttttcttttg gttttagtgt ttcttttctc 6480 gagttttcaa tttgcactta aaaggcctaa aataattaaa ctttcgcagt tttacgtgat 6540 ttttgttttg ggtttacgtg tttaaaattt tgcaatcgtg gcaagatttc tttgttaaca 6600 gttaatcagt ttggtttttc agaagctaca aagcagcaaa tgtcaagcag tcccgtcacc 6660 atggcggatg agaactcgaa cccgagttcg aaaatcgatg gggcagcccc caggagccgg 6720 tcgccgcacc ggcgacaacc acaccactct cggagcagca gcaacaacat cagcagcagc 6780 ctccacagtc ggttgaggag aagaaggaga agccgaagcc agaggaagca gctccagtgg 6840 cagcagcagc aactccgatt gcggaaggaa cagcagaacc agtcgagaag aaggacgagc 6900 tcatgccaga agctccagct ccaacgcagc aagctgaagg atttggaatc aattcccttt 6960 ttgacggtct ccgtaccggt tctctctccc ccccttccac attcgtccaa ttcagagtat 7020 gaattcgcgt acgatcctag tttgaatcca agtgattcac atcctatggg aagtaatacg 7080 gtctctgaac cggcaaatgt tccattccaa aatcttttgc aaccaggttt agagggaatg 7140 gaagttgtcc agcaagaaga ggatatggat attgatgaag agttacttct cggaccagtc 7200 gcagaggagg aaccgagagg ttacgttcca cctgttcgtg aacgtcatcg gttttggcag 7260 caaggcactt gagaggacaa ggtatgttaa atgaagattt aattaaccct acgctctccg 7320 cagcggtcaa acttccttat acttctattc caaaagagat gactattctc cgagttttca 7380 tgaacattca tgatccacag ttagttattg gaactgcggc tgcaatcaag ttggctttta 7440 gtcaagcacc tttcttggag atcaaagatc gctttgatat tactttcatt gatgaggcct 7500 cacagttagc tctctacgta ttgggatcgc tagctacgat gcttccgaag tctcgcatga 7560 ttcttgtcgg agatatgcat caacttccgc catacatgga ggaagcgctt ccagctgagc 7620 tgaagcgagc ggcgattgga gagccactta cacttgcagt taagggaaga agatggccgt 7680 caatgcatct tactcgcgtc catcgatgcc cgaagatgat tactgaggtc ctcggagatc 7740 tgttttatgg aaatactctg acttcttcga agccaggcgt caccgacatt ccagtgctga 7800 aggcaatggg actcccgagc cgacatccaa tggtcttcgt caactacacg tcaccgcaga 7860 cagctgtcgg aaagagtttc tcgaacgaag gagaagctag atatgcgctc caactggtcg 7920 aagcactcac tcgatacgcg agcacggcga ataagaagat caccgccgcc attctaaact 7980 tctatggagc tcaatactcc tacgtgtact ccatggccga agacgaagtg accgtcaaca 8040 ccatcgacgg atgtcaagga caggaatatg acgtcacaat cgtgcttctc acaagatccg 8100 atccttatga gcgatcaaaa ttcctggtca atgccaacag gatcaacgtc gcgctgagcc 8160 gcccgaagat cgccacagtc atcattggtc aacgccacct gacaggaaat caaccggacc 8220 caagacagcc gaaacgcgga agacatcagc gagtttgcaa ctgggcccgt ctcatcgaga 8280 agcttccaaa ggaatgtttc gtcgacgcca aagatcaagt catcgctcag gtagagtgaa 8340 ggtgaagcag gttaaacctt agggtaaagt ttttattttc attttcttta ctctaggtaa 8400 caggttttct atagccgctt ttaaaactct ctactcaaaa acttttcctt ttattttttt 8460 ttaggaaaaa ctaacaggtt ccacaatacc tttgagaatc aaaagatttt caaagtcttg 8520 agctagaaca gggacgttct ttaaaaaaag gggggagg 8558 // ID TIR54TA1_CE repbase; DNA; CEL; 196 BP. XX AC . XX DT 25-AUG-1999 (Rel. 4.07, Created) DT 25-AUG-1999 (Rel. 4.07, Last updated, Version 1) XX DE TIR54TA1_CE is a putative nonautonomous DNA transposon - a DE consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW TA target site; TIR54TA1_CE; nonautonomous DNA transposon; KW palindrome. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-196 RA Kapitonov V.V. and Jurka J.; RT "TIR54TA1_CE."; RL Direct Submission to Repbase Update (AUG-1999). XX DR [1] (Consensus) XX CC TIR54TA1_CE is a putative non-autonomous DNA transposon; it CC has almost perfect 54 bp-long terminal inverted repeats and CC generates duplication of TA target site upon its integration CC in the genome. Presumably, multiple transpositions of TIR54TA1_CE CC were induced by Mariner/Tc-like transposase. CC TIR54TA1_CE copies are ~90% identical with the consensus CC sequence. There are ~100 copies of TIR54TA1_CE in the CC C.elegans genome and they form several minor subfamilies. XX SQ Sequence 196 BP; 67 A; 39 C; 33 G; 57 T; 0 other; ttatcgaaaa aatacgatcg cacaattttc tcataattta tttttgatct accttgttga 60 ctaggctcct ccccttcctt cgacaagaca gccacaacaa aataacaacg gcaacaagat 120 tgggggcgga gtcaagttca acaggtagat caaaaataaa ttatgagaaa tttgtgcgat 180 cgtatttttt cgataa 196 // ID MSAT1_CE repbase; DNA; CEL; 226 BP. XX AC . XX DT 21-OCT-1999 (Rel. 4.09, Created) DT 21-OCT-1999 (Rel. 4.09, Last updated, Version 1) XX DE MSAT1_CE repetitive element - a consensus. XX KW SAT; Satellite; Simple Repeat; MSAT1_CE; satellite-like repeat. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-226 RA Kapitonov V.V. and Jurka J.; RT "MSAT1_CE."; RL Direct Submission to Repbase Update (OCT-1999). XX DR [1] (Consensus) XX CC MSAT1_CE is represented several hundred copies that form CC dispersed clusters composed from multiple, tandemly repeated, CC copies of MSAT1_CE. Only few copies of MSAT1_CE are dispersed CC in the genome as a single units. CC MSAT1_CE copies are ~95% identical to the consensus CC sequence. There is a second minor family of repeats related CC to MSAT1_CE (~ 70% identity) which is ~2 times older than CC MSAT1_CE. XX SQ Sequence 226 BP; 79 A; 36 C; 28 G; 83 T; 0 other; actttttttg aaatgtcttt cagaaacttt gtagtaaatt ttaagctctt ttagaattta 60 ccaacaatat tccagtaggt acaagaagct tcacgtagtt acagaaatag tacattttca 120 gccctacctt ttagtgcgta ttttattatt caatgaaaac taccatttat aggcaaaaat 180 agatggattt tccaaacttt gaaaattcat aaatctcttc aaagta 226 // ID HELITRONY2_CE repbase; DNA; CEL; 247 BP. XX AC . XX DT 25-MAY-2001 (Rel. 6.04, Created) DT 25-MAY-2001 (Rel. 6.04, Last updated, Version 1) XX DE HELITRONY2_CE is a family of non-autonomous Helitron DNA DE transposons - a consensus. XX KW Helitron; DNA transposon; Transposable Element; Nonautonomous; KW CEREP2A; HELICOPY2; HELITRON class; HELITRONY2_CE; KW nonautonomous DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-1 RA Kapitonov V.V. and Jurka J.; RT "HELITRONY2_CE."; RL Direct Submission to Repbase Update (SEP-1999). XX RN [2] RP 1-247 RA Kapitonov V.V. and Jurka J.; RT "HELITRONY2_CE."; RL Direct Submission to Repbase Update (MAY-2000). XX RN [3] RP 1-247 RA Kapitonov V.V. and Jurka J.; RT "Direct submission."; RL Direct Submission to Repbase Update (MAY-2001). XX RN [4] RA Kapitonov V.V. and Jurka J.; RT "Rolling-circle transposons in eukaryotes."; RL Proc Natl Acad Sci USA 98(15), 8714-8719 (2001). XX DR [3] (Consensus) XX CC There are about 200 copies of HELITRONY2_CE in the C.elegans CC genome. CC They are ~95% identical to the consensus sequence. The original CC name, CEREP2A [1], and orientation have been changed because of a CC classification of this repeat as a non-autonomous HELITRON-like CC DNA CC transposon [2]. XX SQ Sequence 247 BP; 53 A; 35 C; 79 G; 78 T; 2 other; tcatttacta tatataaagc gcttattcyg tktgtccata gtttgtagtc tatgtagtct 60 ttgtagtctg tgaagttttg gcttctggag ggatagtgag ttggggttag tgtagggata 120 tagtcggggt actgtagtgg tacaatagtg gtacggtagg aatactgtag ggttacggta 180 gtttcagaaa aattagtttt cagccccaga agtcgggggc cgcgccggag gtgcggtcca 240 cggctgg 247 // ID TC5 repbase; DNA; CEL; 3171 BP. XX AC U12433; XX DT 27-JUL-1999 (Rel. 4.06, Created) DT 27-JUL-1999 (Rel. 4.06, Last updated, Version 1) XX DE Caenorhabditis elegans mut-2 mutator strain transposon Tc5 DE putative transposase gene, complete cds. XX KW Mariner/Tc1; DNA transposon; Transposable Element; CEU12433; TC5. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-3171 RA Olsen P., Andrews S. and Collins J.J.; RT "The Caenorhabditis elegans transposon Tc5 encodes a protein RT similar to the Tc4v-encoded protein."; RL Unpublished. XX RN [2] RP 1-3171 RA Collins J.J.; RT "TC5."; RL Direct Submission to Genbank (19-JUL-1994)John J. Collins, RL University of New Hampshire, Biochemistry and Molecular Biology, RL Spaulding Life Sciences Bldg., Durham, NH 03824, USA. XX DR GenBank; U12433; Positions 1 3171. XX SQ Sequence 3171 BP; 966 A; 610 C; 621 G; 974 T; 0 other; caagggaagg ttctgaactc gttatcggac ttcgttacgc cactatatac attcgataga 60 ggatagttac agatgatccc ttcaaaaaat ttagctgctt cagagcaggt ttggccaagt 120 tgtgacgtct tgaattttgg tgctgaaatt cctcatatca agtgatattt caatgactac 180 cacgctgcag aaacaccagt gaactcacca ctctcaatta gcgttagcaa acatggcttg 240 gtggccgagt ggtagtggcg tgagtttcga ggtgtggtat tcgtggttcg gttccccgtc 300 aacataaact ttttttttta atttttaaag tcaatccatt tccaattaga acacatctat 360 aaactttttc aagtgggaaa atgtgcagat attatcccta tgaatcaaat gcgtcaattc 420 tccaaatttt tccgattttt tttttcaata tgtgttatag ttaaaagcac aataaaacag 480 atgtttaaag tacatacatt aaacattaaa ttttcattaa attttcaaat aatatcatcg 540 tggttaaaaa tgtaggccac aagaagagct gttaggtccc accacgcttc acacctttct 600 tgtagttttt tttttgttat tttctgttga ctcgtcttcc gttgtctata ttttaactga 660 aaatgccctt ccgcccacaa gtaatcatcg gagaacttat gaaaacgttt ggaactaata 720 caaacgcgtt gccaatgagt cgagaagaaa cgaaaactgt cgagaaattt acaaggattc 780 tcaaagatgc tgaaacggac gatcttctta ttcaaagtga cgacgaagaa gaagtattcg 840 gaggaattgt tgatgaagag gactggaaac ctgatgacga tgatccatcc gcttgcgtag 900 tacccgataa agtgaacttc tcttctggag ctgccattga tgttgcaatg gtaagtgttt 960 agaatttatc aattactcaa cattcgctga taaataattg gctaataaat aaaatttgaa 1020 atacattact ttttacaggt gcatagtgcc gttgaattta tgactgatgt cagaacaaag 1080 aaactacgat cttttgcttc aatgcagcgt aggtatcgtt ttattaaaac gcaacatgac 1140 atgcagaaac ttcgcgtttt tgctaaaaat agtgagtatt aacagcttca tattcggtat 1200 aaaactgggt ttttaagacg aaattcaatg ctcacgtgtt tcacaatttt cgacactttc 1260 tggacttctt cgtacaaaag tttttgaggc aatcgatgac agtgagtatt cctattattg 1320 aaaaactact gtgtttgcac gacaagtagt gcattctttg tcagacttaa aacacatctt 1380 gaaggaattt cgattgacaa gttcacgcta cgccgtcttg cagtgcaatt gaatgatgag 1440 cacgtccata ttgaaggatt tcaagcaagc gatggctggc tgaagaagtg gaaaaagaca 1500 aacggtctcg tttctcgcca cgtaactact ttcatcactc gtgccaatta cgtcaataaa 1560 gagctcacag aacaagctgc caaaaagttc gtggaggaag ttaaagcaga attggcaact 1620 ttggatcctg atgtcgttta taactgtgac caaagtgggt tcacgaaaga acaatattgc 1680 aaacggtaaa ttctaaaccg agtttttcaa agattattaa aatttttagg acgctcgcac 1740 caaaaggtgt taaacgtgtt gaaagactgg tacagtccaa agatgccctc acgcactctt 1800 acacaatcct tcccatgtta agcgcttccg gaaagttagc cccaatgttg tacgtggttc 1860 tgcaggtatg tttgacaata tgcacaacat tgccacacag tcttgtgact atcgttttac 1920 attatgcaac tttattaaat tgtaggagaa aggtggaaaa tttcccaaaa aagggcactt 1980 ctcaccagac aatctgatca tccgagctaa tacgtcccac attatgaata aacaactaat 2040 ggtcgactgg gttgaatccg ctgtttgtga tccttcgatg ccaaccgagg ttgtcctgct 2100 tctagacgct tggcctgctt ggaaaaacga aggggatgtt caagctgcag cattatccgg 2160 aaatacagta catgtgagat ctattccacc aggagctaca tcatttattc aaccttgcga 2220 tctttacttt ttctgtccgt tgaagaattt tgtcaaaaag gtgaacgcgt acatcatcta 2280 ctccggtatc accttcaaga cgtcagagcg tgacaacctg cttcgcgtga tatctgcagt 2340 gtaccgtgtc tttcgtgcac caattttcca atcatgctgg aagtacggct ggatccaagg 2400 aggatacata gatgaccaac atgtcaaagt ggaaactcca tccaaatttt gtttcaaagt 2460 ttctggatac tgttcgcaaa agaaaacgag agatacgatg tgtcaagata cggcttttct 2520 tctttgccca tactgtaaga aggttttatg ctttaaccac tgggttggat gcggcttccc 2580 agctcataag tgtaagtgtt aaaagccatt gttgagtata ttatatgttg cttttgtttt 2640 tttttttaat attggcatcg ttcgtttgtt ttttacataa actttaaaca tctgttttat 2700 tgtgctttta actataacac atattgaaaa aaaaaatcgg aaaaatttgg agaattgacg 2760 catttgattc atagggataa tatctgcaca ttttcccact tgaaaaagtt tatagatgtg 2820 ttctaattgg aaatggattg actttaaaaa ttaaaaaaaa aagtttatgt tgacggggaa 2880 ccgaaccacg aataccacac ctcgaaactc acgccactac cactcggcca ccaagccatg 2940 tttgctaacg ctaattgaga gtggtgagtt cactggtgtt tctgcagcgt ggtagtcatt 3000 gaaatatcac ttgatatgag gaatttcagc accaaaattc aagacgtcac aacttggcca 3060 aacctgctct gaagcagcta aattttttga agggatcatc tgtaactatc ctctatcgaa 3120 tgtatatagt ggcgtaacga agtccgataa cgagttcaga accttccctt g 3171 // ID LTR2_CE repbase; DNA; CEL; 434 BP. XX AC . XX DT 10-AUG-2000 (Rel. 5.07, Created) DT 10-AUG-2000 (Rel. 5.07, Last updated, Version 1) XX DE LTR2_CE is a putative long terminal repeat from an unknown DE retrovirus - a consensus. XX KW LTR Retrotransposon; Transposable Element; LTR2_CE; KW Long terminal repeat; retroelement; Retrovirus. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-434 RA Kapitonov V.V. and Jurka J.; RT "LTR2_CE."; RL Direct Submission to Repbase Update (AUG-2000). XX DR [1] (Consensus) XX CC LTR2_CE is a long terminal repeat from an unknown CC retrovirus. There are only a few copies of the repeat in the CC C. elegans genome; they are ~95% identical to the consensus CC sequence; 5-bp target site duplications. CC LTR2_CE is 67% identical with LTR1_CE. Therefore, both LTRs CC are fossils of related retroviruses. XX SQ Sequence 434 BP; 116 A; 80 C; 104 G; 134 T; 0 other; tgtttcggat cggtaaaggt tcccagtggt agaattatat aaatatttat tactaaaaac 60 cttttcataa ccacattaaa gtgctgaccg acagcatttc ggtgagagca gaagtagcac 120 gaggacgaag tccattcctt gcagccaatg cgcgtgccgc ccaggtggag gagattacgg 180 tagcagtcgt ttcacgtgtg tcgtagtgtc ggcgtggcga gaccaattgg cggtagttca 240 aatagggatt tccattgttg ctctttcgcg gggttgtcgt cgttgggttg catagataaa 300 gttgagcatt gactagcggt tatttcgtaa taataatata taatatactt gcatttcttt 360 gtagttcact aactctttac atttaagaat aagttgcgag ctgacgaata ttcatattct 420 cagcattcgc aaca 434 // ID CELE7 repbase; DNA; CEL; 363 BP. XX AC . XX DT 17-SEP-1999 (Rel. 4.08, Created) DT 17-SEP-1999 (Rel. 4.08, Last updated, Version 1) XX DE CELE7 nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; CELE7; MITE; KW nonautonomous DNA transposon; palindrome. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 42-320 RA Oosumi T., Garlick B. and Belknap R.W.; RT "Identification and characterization of putative transposable DNA RT elements in solanaceous plants and Caenorhabditis elegans."; RL Proc. Natl. Acad. Sci. U.S.A 92(19), 8886-8890 (1995). XX RN [2] RP 1-363 RA Kapitonov V.V. and Jurka J.; RT "CELE7."; RL Direct Submission to Repbase Update (SEP-1999). XX DR [2] (Consensus) XX CC CELE7 is present by several hundred copies dispersed in the CC C.elegans genome [1]. CC CELE7 is flanked by 170 bp-long terminal inverted repeats and CC it generates 3 bp-long target-site duplications [2]. CELE7 copies CC are ~80% identical to the consensus sequence. XX SQ Sequence 363 BP; 126 A; 52 C; 54 G; 131 T; 0 other; tttcggaaat ctacaaaatg acctattgca ccatgttgtt ccaacatttt tctattaaaa 60 attagtgtca aaatatagaa aatcacttga tttatctgaa aataacattt atttcagctg 120 aaacatttga gaaagtgctc aaaaatggcc tgaaagttgc caattgccga ttatcgaatt 180 atcgattttt ttcggcaatt ggcaactttc aggccaattt ttgagcactt tttcaaattt 240 ttcagctgaa ataaatgtta ttttcagata aatcaagtga ttttctatat tttgacacta 300 atttttaatg aaaaatgttg gaaaaacatg gtgcaatagg tcattttgta gaattgccga 360 aaa 363 // ID Tc3 repbase; DNA; CEL; 2337 BP. XX AC . XX DT 03-APR-2003 (Rel. 8.03, Created) DT 28-JUL-2005 (Rel. 10.08, Last updated, Version 2) XX DE An active DNA transposon - a consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Tc3; KW mariner/Tc1 superfamily; transposase. XX NM Tc3. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-2337 RA Collins J., Forbes E. and Anderson P.; RT "The Tc3 family of transposable genetic elements in RT Caenorhabditis elegans."; RL Genetics 121(1), 47-55 (1989). XX RN [2] RP 1-2337 RA Kapitonov V.V. and Jurka J.; RL Direct Submission to Repbase Update (MAR-2003). XX DR [2] (Consensus) XX CC Tc3 is an active mariner/Tc1-like DNA transposon [1]. There are CC 17 copies of Tc3 in the sequenced C. elegans genome. They are CC ~99.84% identical to the consensus sequence. Tc3 copies are CC flanked CC by the TA target site duplications. The preferential target site CC is aTAt [2]. Tc3 has 467-bp terminal inverted repeats. CC Tc3 encodes a 329-aa Tc3p transposase (2 exons, positions CC 720-1136 and 1185-1757). XX FH Key Location/Qualifiers FT CDS join(720..1136,1185..1754) FT /product="Tc3_1p" FT /note="transposase" FT /translation="MPRGSALSDTERAQLDVMKLLNVSLHEMSRKISRSRH FT CIRVYLKDPVSYGTSKRAPRRKALSVRDERNVIRAASNSCKTARDIRNELQ FT LSASKRTILNVIKRSGVIVRQKLRPAPLLSADHKLKRLEFAKNNMGTNWSK FT VVFSDEKKFNLDGPDGCRYYWRDLRKEPMVFSRRNFGGGTVMVWGAFTEKK FT KLEIQFVSSKMNSTDYQNVLELELSKYLRHYSRKDFRFQQDNATIHVSNST FT RDYFKLKKINLLDWPARSPDLNPIENLWGILVRIVYAQNKTYPTVASLKQG FT ILDAWKSIPDNQLKSLVRSMEDRLFEIIRTQGNPINY" XX SQ Sequence 2337 BP; 700 A; 445 C; 490 G; 702 T; 0 other; cagtgtggga aagttctata ggaccccccc taatttgaag gtttgaggaa cttccgaaaa 60 ttttttcgaa aaactgctaa tgccgttcgt ttttaaattg aaaaaaacct atatacattt 120 ttttccagaa gtttatctca aaaactgagg tcgcgctgga aaaaacgtca aaatccagtg 180 tgaaacttct ataggacccc ccgttttttt tcacgatttt tactaaaatc aacagatttt 240 ggaatttttg acaaagctca aatcaagttt gagttagaaa tgagttcaga taagcagttt 300 tgactttaaa aaattaatac gaaatgttct cgtgggatct ccagactggt tctgattctt 360 ccgatctttg atgttcaagt ctgtttcaag cttcctggtg ctctcggtaa tgccaaaact 420 tgataaactc tctttaacaa gttcctacta aaattcctag cacgcacttt agatgtttcg 480 actgtgtagt caagctgatt tggcaaaata tgcagcagga aacaatggaa ggcttatcag 540 gaatcaaatc gtttttcttt gattacaagg ttccatggga ccaatatttc aagttaaatt 600 gtccctcaca gatgttattt actatttttt gcgtgaatta ttaaatgtgg aattgtggca 660 tgtgttgtgg cacacatata gaggctggaa agcttacttc gaaagcagtc taacttgcaa 720 tgcctcgagg atctgccctt tcggacactg aacgcgctca gctggatgtt atgaaattgc 780 tcaatgtgtc cctgcatgaa atgagtagga aaatttcccg ttctcgacac tgtattcgcg 840 tgtatctgaa ggatccggtg agctacggta catctaaaag agctcctcgt cgcaaagctc 900 tctccgtgcg tgacgaacga aatgtgattc gtgctgcctc caactcctgt aagacggcaa 960 gagatattcg caatgagctt caattgtctg cttcaaaaag gaccatcctc aatgtcatca 1020 aacgatctgg tgtaatcgtt cgtcagaaac ttcgccctgc tccgttactc tctgcagacc 1080 ataaactcaa gcgattggaa tttgctaaga acaatatggg aacgaattgg agtaaagtga 1140 gaatttaaaa aagcaagagt gaataattag gatcattgtt ttaggttgtc ttctccgatg 1200 aaaagaaatt caatctcgat gggcctgacg gttgccgcta ctattggcgc gatttgcgca 1260 aggaaccaat ggttttttcg agacgtaatt ttggaggagg aacggtgatg gtttggggag 1320 cgttcacgga gaagaagaag cttgagatac agttcgtcag tagcaagatg aacagcactg 1380 actatcagaa cgtcttggaa ctggagctct ccaaatatct tcgtcactac tccagaaaag 1440 actttagatt tcagcaggat aatgcgacaa tccatgtgag caactcaacc cgcgactatt 1500 tcaagctcaa gaagatcaac cttcttgatt ggccagctcg aagtcctgat ctcaatccaa 1560 tcgaaaattt gtgggggatt cttgtccgta tcgtgtatgc tcagaacaag acttacccaa 1620 cagttgcatc gttgaagcaa ggaattctcg acgcttggaa gtctattccg gacaaccagc 1680 tgaaaagttt ggtcagatca atggaggaca gactgtttga gatcatccgc acacaaggaa 1740 acccgattaa ctattgatcc tttcttgatt ttagtatatg aatgttctgt tgttgatcaa 1800 aaataactgc aacttgttaa tacgctgttt ctgactggtt tcttggggat ggcgtaaaaa 1860 tgtttatggt gtgtgtgcta ggaattttag taggaacttg ttaaagagag tttatcaagt 1920 tttggcatta ccgagagcac caggaagctt gaaacagact tgaacatcaa agatcggaag 1980 aatcagaacc agtctggaga tcccacgaga acatttcgta ttaatttttt aaagtcaaaa 2040 ctgcttatct gaactcattt ctaactcaaa cttgatttga gctttgtcaa aaattccaaa 2100 atctgttgat tttagtaaaa atcgtgaaaa aaaacggggg gtcctataga agtttcacac 2160 tggattttga cgttttttcc agcgcgacct cagtttttga gataaacttc tggaaaaaaa 2220 tgtatatagg tttttttcaa tttaaaaacg aatggcatta gcagtttttc aaaaaaattt 2280 tcggaagttc ctcaaacctt caaattaggg ggggtcctat agaactttcc cacactg 2337 // ID CER10-LTR_CE repbase; DNA; CEL; 648 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 06-AUG-2003 (Rel. 8.07, Last updated, Version 1) XX DE CER10-LTR_CE is a long terminal repeat of the CER10_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER10-I_CE; KW CER10-LTR_CE; CER10_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER10_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (APR-2003). XX DR [3] (Consensus) XX SQ Sequence 648 BP; 160 A; 167 C; 108 G; 213 T; 0 other; tgttgtcgat tccccgtaat ccctcaggaa tctctcaacc taaaggactt ttaggactga 60 gagcttggct ccctcactca ccagctccca gtcgtccttt atttcccaat ttagctatgt 120 gtgttttccc ccaaatccat tattcaaaaa gccgccattc tgtccagatg caaacgcgcc 180 ccactaaacg ccgtgcagcg ttatggcgga gtatcgtttg cgcggtacag atgattcggt 240 ttttatagta aagcgataca gattaactct tccttctact ttttccgaaa tatatatttt 300 ttaatgagtt tttccatatt ttctcattcc ataaattcgt gtggacacag atccacacga 360 tccggtcctt ttttgcccgt ttgcacgtct tcaagtccaa ttccaacgtc aatccctaaa 420 tttgggccta aaatataatt tattttcata ctttctttat tatattcaat ccacccaaaa 480 caacgggaat ggataaataa acttgaattc gtgtaataat tgtgttctgt tattgaatcc 540 tcacaacatg cttatccgcg cgtccttcga cattatgctc cgagtccatc cgcgcgaatt 600 ggccgtcagt gggtttccga ctcccttcga ttagctgacg tgtcaaca 648 // ID CER11-LTR_CE repbase; DNA; CEL; 559 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 06-AUG-2003 (Rel. 8.07, Last updated, Version 1) XX DE CER11-LTR_CE is a long terminal repeat of the CER11_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER11-I_CE; KW CER11-LTR_CE; CER11_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-559 RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RP 1-559 RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RP 1-559 RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER11_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (APR-2003). XX DR [3] (Consensus) XX SQ Sequence 559 BP; 196 A; 85 C; 74 G; 204 T; 0 other; tgtttgtcac tcggtagggt gactgactcc cctctactag tcccgagtgt caacaataac 60 taatatcgaa gactttttga agagatagag gtaattgttt tatgatattt tcaaaacttc 120 atgttccatc atcccgccag tacaaaattt catattgaaa ccatatagta ctaacaagaa 180 tttttatatt tattcaaatt taatattata gacgtaaaaa ctaattttgg aaaaaaaact 240 agaaaaaatg atgttcattc ttgtggaata cattttaaga acttcacaat tactttccag 300 tttgcttgaa atattttaaa atagaaaagt tcagaatagc ttttgttctg aacatatcga 360 gacattactt tcaaaagtca caagttggtt tcaaaatgtt tttcgagacc actagcctag 420 acatttagta tattaatttt ttatgtcgct tggtataact ttaaattaat tcactttgag 480 cattcataat gaaatacgca tattagttag ttgacacaat attttataag tttcatccat 540 tatttattct agcaaaaca 559 // ID PAL8D_CE repbase; DNA; CEL; 410 BP. XX AC . XX DT 12-JUN-2001 (Rel. 6.05, Created) DT 12-JUN-2001 (Rel. 6.05, Last updated, Version 1) XX DE PAL8D_CE is a non-autonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8-bp TSD; PAL8D_CE; TIRs; nonautonomous DNA transposon; KW putative HAT superfamily. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-410 RA Kapitonov V.V.; RL Direct Submission to Repbase Update (JUN-2001). XX DR [1] (Consensus) XX CC PAL8D_CE is a non-autonomous DNA transposon. CC It has generated 8-bp duplications of target sites upon CC integration in the genome. PAL8D_CE has nearly perfect 150-bp CC terminal inverted repeats. CC The length of the target site duplications indicates that CC a transposase involved in proliferation of PAL8D_CE CC might belong to the HAT superfamily. CC There are ~50 copies of PAL8D_CE in the genome; CC they are ~98% identical to the consensus sequence. XX SQ Sequence 410 BP; 141 A; 62 C; 63 G; 144 T; 0 other; cagaggtggg cggatatttt tttcggatat tttctaataa tgaatttatc aataagtatt 60 gatgtagtta ttctggaaga aatgtatccg aacaaaagag taactattaa gcaacatttt 120 aatatgataa aacctagaaa aattctacaa gttttctgta taaacgatat gcaagtccat 180 tatccgatat ccgtaatatc cgtaatatcc gatatccgat gttgctcggc taataagttt 240 taaggtctcg gtaaggaaaa cttgtagaat ttttctaggt tttatcatat taaaatgttg 300 cttaatagtt actctttggt tcggatacat ttcttccaga ataactacat caatacttat 360 tgataaattc attattagaa aatatccgaa aaaatatccg cccacctctg 410 // ID TIR9TA1A_CE repbase; DNA; CEL; 337 BP. XX AC . XX DT 09-JUN-2000 (Rel. 5.05, Created) DT 09-JUN-2000 (Rel. 5.05, Last updated, Version 1) XX DE TIR8TA1A_CE is a putative nonautonomous DNA transposon - a DE consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW TA target site; TIR9TA1A_CE; TIR9TA1_CE; KW nonautonomous DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-337 RA Kapitonov V.V. and Jurka J.; RT "TIR9TA1A_CE."; RL Direct Submission to Repbase Update (JUN-2000). XX DR [1] (Consensus) XX CC TIR9TA1A_CE is a subfamily of TIR9TA1_CE. CC TIR9TA1A_CE is a putative non-autonomous DNA transposon; it CC has perfect 9-bp terminal inverted repeats and putatively CC generates duplications of TA target sites upon its integration CC in the genome. CC There are ~10 copies of TIR9TA1A_CE in the CC C.elegans genome; they are ~97% identical to the TIR9TA1A_CE CC consensus sequence. TIR9TA1A_CE is probably the youngest CC subfamily of the TIR9TA1_CE family. XX SQ Sequence 337 BP; 62 A; 126 C; 92 G; 55 T; 2 other; tagggctccc atgaggtcgc cgcaaaggcg cctccgccgg cctcagctgt cgccgcgccg 60 gcctcacggc gtcatccacg ccgcatgtat agtgaggygc gtaaccgcga acgtgtcggc 120 cgcttccgaa caaccacctc tycacactac gttgcacaca caccaagcta ctcatttcac 180 gctaagctgc ggaaccccga acgtgtcggc cgcttcaaat aactacctct cgcacttcat 240 ttcacacaca cagcggcgcg cggcgtcggc gcgaggccgg ccggcgtgag gccggcgtga 300 ggcccgcatt tcgcgcctca ctcagctggg agcccta 337 // ID PAL8B_CE repbase; DNA; CEL; 386 BP. XX AC . XX DT 08-SEP-1999 (Rel. 4.08, Created) DT 08-SEP-1999 (Rel. 4.08, Last updated, Version 1) XX DE PAL8B_CE is a putative non-autonomous DNA transposon - a DE consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp-long target site duplication; hAT superfamily; PAL8B_CE; KW TIRs; nonautonomous DNA transposon; palindrome. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-386 RA Kapitonov V.V. and Jurka J.; RT "PAL8B_CE."; RL Direct Submission to Repbase Update (SEP-1999). XX DR [1] (Consensus) XX CC PAL8B_CE is a putative non-autonomous DNA transposon; it CC has perfect 49 bp-long terminal inverted repeat and CC generates duplications of 8 bp-long target sites upon its CC integration in the genome. The PAL8B_CE nucleotide sequence CC forms a palindrome-like structure with a 140 bp-long stem. CC Presumably, multiple transpositions of PAL8B_CE were induced CC by HAT-like transposase. There are about 10-20 copies of CC PAL8B_CE in the genome and they are ~90% identical to the CC consensus sequence. PAL8B_CE and PAL8A_CE share common 13 CC bp-long termini (they start from the third nucleotide). CC Presumably, PAL8B_CE is an ancestor of young PAL8A_CE. XX SQ Sequence 386 BP; 118 A; 77 C; 71 G; 117 T; 3 other; ctgggctgtg cggctctttt ctcggcgcat ctgaaaattc cattttttca gtgttttagc 60 aagaaagagt ttgcgaatga ttatactagt tgaatagctg gagcaataag cttatcatta 120 aaaaaataaa ayttgaaaaa aagtgtacac aacattccat accattcgca atgcgccgag 180 agccgaatga ttcggctctc ggctctcggc tyattggttc gaggcgtata tctctacgct 240 ataaccactt tttttcaaat ttttttttaa tgataagctt attgctccag ctattyaact 300 agtataatca ttcgaaaact ctttcttgct aaaacacgaa aaaatggaat tttcagatgc 360 gccgagaaaa gagccgcaca gcccag 386 // ID CERP1 repbase; DNA; CEL; 222 BP. XX AC M14334; M37219; X61244; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE C.elegans repetitive DNA. XX KW CERP1; Repetitive DNA. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-222 RA La Volpe A.; RT "CERP1."; RL Direct Submission to Genbank (16-AUG-1991)A. La Volpe, CNR RL International Institute of Genetics and Biophysic, Via Marconi RL 10, 80125 Naples, ITALY. XX RN [2] RP 1-222 RA Naclerio G., Cangiano G., Coulson A., Levitt A., Ruvolo V. RA and La Volpe A.; RT "Molecular evolution of clusters of satellite-like DNA sequence RT in Caenorhabditis elegans."; RL Unpublished. XX RN [3] RP 1-222 RA Naclerio G., Cangiano G., Coulson A., Levitt A., Ruvolo V. RA and Volpe A.; RT "Molecular and genomic organization of clusters of repetitive DNA RT sequences in Caenorhabditis elegans."; RL J. Mol. Biol 226(1), 159-168 (1992). XX DR GenBank; X61244; Positions 233 454. XX SQ Sequence 222 BP; 76 A; 67 C; 19 G; 60 T; 0 other; tttcaaaaat tcacagacag tatgaaaact ttctgttact ttttgaccaa gacagtaacc 60 ttacaatacc actacagtac cttgacatta tcctccaccg actcctaacc caatatctct 120 tcaaaggaca aaaactcaaa ttttccaaaa ctacagtaac cctaccgtat acctacagta 180 cccctatagt accactacag taccttgact tgatccccca tc 222 // ID TURMOIL2 repbase; DNA; CEL; 6999 BP. XX AC Z81554; XX DT 15-NOV-1999 (Rel. 4.1, Created) DT 15-NOV-1999 (Rel. 4.1, Last updated, Version 1) XX DE DNA transposon TURMOIL2. XX KW Harbinger; DNA transposon; Transposable Element; KW Harbinger superfamily; TIR; TURMOIL2; palindrome; transposase. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-6999 RA Kapitonov V.V. and Jurka J.; RT "TURMOIL2."; RL Direct Submission to Repbase Update (NOV-1999). XX DR GenBank; Z81554; Positions 26409 19411. XX CC TURMOIL2 encodes HARBINGER-like transposase. The most CC conservative CC domain of the transposase is encoded by regions 3922-4137 and CC 4199-4651. CC The TURMOIL2 transposase is most similar to the transposase CC encoded by the TURMOIL1 transposon. Both transposons CC do not share similar DNA fragments but they have one remarkable CC hallmark being flanked by long terminal inverted repeats. CC TURMOIL2 is flanked by almost identical 1900-bp TIRs. XX SQ Sequence 6999 BP; 2307 A; 1234 C; 1244 G; 2214 T; 0 other; agggaaacaa gaaaatccgg aaaaaaacga gatgaagatg aatctagcga tggaattgac 60 gatatttcag agtattttat tgtcgatcca aagaaaaagc ttgtcacaca tgagcagaaa 120 cgaactatcg attttgaagt gagttttata taatttttat ttgaaaatat cattttttgc 180 atcaaaataa ttcgataaat cccatttctt gaaaaaaaaa tttatttcag gcttcgacac 240 cttcttctca atttaaaatg gccggcggat atgaaaatta ttccaaaagt caggaaagct 300 ccggaaattt gagtagaaag tagttgtcgt ctcacatcaa tgatgcgatc acaagttctt 360 cggattccga cgaagattat gtgagcagaa gaattcgcaa agctcgtcaa aaatctcatt 420 attttaactc caatcaatat tctgcggaga atctgagcag ttcagctagg aagtctgcaa 480 aacttccgaa ttctcgttca atgcaagatt atatgtgccg tgcggattcc gtaagtgtga 540 aaagtaattc aaataatact atttgttatt gcaggaatcc atatcgacat ccagctcttc 600 agctcttcaa acaaacaaat ttccggaaga ttcttcacat cgaaaagatt attcacttcc 660 aaaaaatcta tataatcaag atgattctct actgcacgat tttgaatacc cggaaaggca 720 taagaaggtt acttcaaccc gccgaattga ttctgaaaat tcccgacaag gctctccgat 780 tatttttgaa gaagatatcc aggagtgccc tttaacgaaa cgaactccga agattcattt 840 gtctaaagga gacagctaca aaaaagtgac ataattaatt ttttctctca ttcgcaagct 900 tctttcagat atcgggaccg caggttgagc acattattcg taagatcaag gcgcagccgt 960 tagtttggaa tcgaaatttt acatgtgaac ttccaggaga agaaacagga cgaatttggt 1020 ggaaaataga aaaagaaatt aaatttatac gtacatcaag agatgtctag tgaattagtt 1080 gctgatagtg aagggacagt tttcaggtta caatgttcga cttgtttgga aaacgttaat 1140 atcggaatat aaagttcatc gaaaacactt ttctgaaggc tcttcaacga ttgcacaaat 1200 aaaagaaaaa tttcaatttt atgacctaat gtgttctctg tttgatgaca atcaaatatt 1260 cgaagaaatc ccaacaaatt gtgagaagaa aaaaattcga cacggtgata cttattcaaa 1320 ggtacattta catttaaatg gaatagtgaa gtagtgtgct tttcagctta aaaatgcgga 1380 aattttactg tttctccaag aagtcgagca gtatccatgt ttgtacgatc tcactgacaa 1440 acattataag gatcgagaaa agcacgattt ttggatcaaa attgaaaagg atctaggatt 1500 tttggcagaa gatagaggat ctaatgcaat caaaatttgg aattcattaa aatcggaata 1560 tttgagagct cgcaaaaaaa agtgtcccag tggaagtggt ctcgacgata tagagaacag 1620 cttcgatttc tatcaggata tgagtttttt tgaagcatct ttatcaaaaa gattctcaac 1680 accgtgagtg aagcaggtgt tcaaaaagtg tgctcatccg tattttacag gatttcttcg 1740 tcttcacttg atacaactac acgtcttgcg aattacgaat cgaacaatgt tctcactcca 1800 aaaagaaatt ataagaaagt gaaggaagag tccacctatg gtaggaattg ccgtactatt 1860 catctaaatt gtatttcttt gcagaaaaat ataagaaggc ggagaatctg agcagttcag 1920 ctaggaagtc tgcaaaactt ccgaattctc gttcaatgca agattatatg tgccgtgcgg 1980 attccgtaag tgtgaaaagt aattcaaata atactatttg ttattgcagg aatccatatc 2040 gacatccagc tcttcagctc ttcaaacaaa caaatttccg gaagattctt cacatcgaaa 2100 agattattca cttccaaaaa atctatataa tcaagatgat tctctactgc acgattttga 2160 atacccggaa aggcataaga aggttacttc aacccgccga attgattctg aaaattcccg 2220 acaaggctct cctattattt ttgaagaaga ggaacctgat gaaaccactc gaaaatttct 2280 atccaaagga atatcaaagc gccagaaaaa atttgtttcc aaatgaaaga ttctctaaaa 2340 atggtggggt aatgtagttt tgcatttttg aaactattct tttgcagctt cctggagatt 2400 tagtcgttaa catcatcaat attatcaaaa aggaagaagt tctttggaat gtttctaaaa 2460 cagtgaaagt gaagaatgaa gatctccacc agttgtggcg aaatctggaa aaaggagttt 2520 cgggtttgaa atcagataaa ggtgtgttga gatatgtgta ttgttgcaat gtaattacgt 2580 catagatgat ttggaacaat taaacattga aaaatatatt gttcaggcta caacttgaag 2640 cgcgtatgga atgcgatcac tggtgaatat cgatatcaaa agggtctcct cgatgaagaa 2700 ggtgccacaa ttgaggaagt aatacaagat tttccgtact tttatagtct caaatttttt 2760 gatcagacta ttcttgttga gaaacagcca aaaccgtaag ccaatccaaa aaaaaaatca 2820 acacaaacac aattttagga aaactgtgtc tagagatgtt acgacgcgat gcttcgagaa 2880 aaaaactgtc gaaaaacaga aaatccgagg ttcagttttc gcagctaatc agatatataa 2940 aaaaaatttg tagataaaat caatgaggaa tcgcaaatac tggacgacca actttcattt 3000 ttgaagaggc aatttgtgga gttgaaaaat gacagttcgc tcacatccac atccactatc 3060 catgctgatg ttattgcaac tattactact acactagatg aactgccact tctggaacaa 3120 ctcaagctaa aaggagaaat cattgaatcc ggtgattcat tcaagaaaaa gaagaaaact 3180 tagtaatctt tcatttttca atttttaatg ctctcaacaa taataaaaag cgtattcatg 3240 gtcaaaaatt tacagtcaca ccagtcacac ctctgtgaaa tacttttcat ttcgacaata 3300 agtcaatcat ggtaatggaa aattatagat tttcgatcat aatgtctcag acacgacgaa 3360 gtaagcccac aaaatattag gtagaatgtg aaagcttata atttttcaga gttgcttatc 3420 ttgacgatga tggctgcaat tatgaaaaaa aagcaccgat tgatagagct gaaaaacgat 3480 ttggagtgcg acatggaatt cgaagaggac cgactttaca tcaaatctaa taattttgat 3540 agatttgtcg aataatgctc agacttggag gaaaagaagt ctcaaattca ggtatagcaa 3600 atgaaactta cctgagattt tattaataaa tgtttcttta ggtaatactg aaacttcccg 3660 taacagtggt atgtgacatt tgcacagcgg tcaacagtct caacatttcg acgaacagaa 3720 aactgtcttc ttttcaagaa gttgtaattt tccttcaatt ctgcaacatc ggtatacaga 3780 attgtcgctt tctgcattta gtaatataca atttcaggtg aatcgttttc ttcgatgtcc 3840 cgaagatggg gattagcacc gactacagta gccaatgcag tgtatcatgt tacaaatgcc 3900 attaattcga gaatgaagtg catcgaaaca cccaaatcag cagaggaatg gagaaaagtc 3960 gaaagaacat ttgccaaaaa gcacatatta cggtgtctcg gctctctcga cggaaaacat 4020 attcgaatca aagctccacc acattccggt tctctgtttt tcaattacaa acacttcttc 4080 tcgtttgttt tgctcgttgt agtagacgct gacggccgtg tcatttgggt cgatgttggt 4140 acgtttaaca ataacaatac attattaata tttcataaat catatttcag gatcaccggg 4200 atcgaacaat gatgcttcaa ttttttctga ttcaaaattg aaaacaattt tagatgaaga 4260 agcaaacctt ccaccgccaa ctttctggag tcgagatttt gttgtcaagc cttttgtaat 4320 tgccgatgga attttcaaaa ttactccaag gatgatgaca ctgtacggag gaaatggatt 4380 aaatatttct caaaaagttt taaacaaaaa attgtctcga gctagagtca aagcggaaca 4440 cgctttcgga agattaagtg caaaatttcg aattttgaga agagaaatgg aatgtccata 4500 cgaaaaatct gtggatatca ttctggcctt gtgtcatctt cacaattact taatcggaaa 4560 ggcggaagat gttgttaacg ttcccatcga gacaaacatg attccttata aagatggcaa 4620 agagcaacgc gattatctgt gtagatactt gaataactaa atatgtcagt ttaatatttt 4680 atttgaattt atattatctt tatgcatttt gctgtactcc aaaaatttcc tcttttattg 4740 tccttttgaa atttcataat attgattttt tattttaata aacaatactc cttttcgaaa 4800 atttgatgat acttggtaca tattttcaaa atgtcatgtt taagaagtgg aattaaattt 4860 gaatcaataa cttccaatgt ttctacgaca tatccgacga cgtcctcgta ttttttcttc 4920 acttcgcttc gtttctcgcc ttctttcagt tcttttaaca atgagacaca ggtcagcata 4980 atttcatcgc actcagccgt atccttctta tatttttctg caaagaaata caatttagat 5040 gaatagtacg gcaattccta ccataggtgg actcttcctt cactttctta taatttcttt 5100 ttggagtgag aacattgttc gattcgtaat tcgcaagacg tgtagttgta tcaagtgaag 5160 acgaagaaat cctgtaaaat acggatgagc acactttttg aacacctgct tcactcacgg 5220 tgttgagaat ctttttgata aagatgcttc aaaaaaactc atatcctgat agaaatcgaa 5280 gctgttctct atatcgtcga gaccacttcc actgggacac ttttttttgc gagctctcaa 5340 atattccgat tttaatgaat tccaaatttt gattgcatta gatcctctat cttctgccaa 5400 aaatcctaga tccttttcaa ttttgatcca aaaatcgtgc ttttctcgat ccttataatg 5460 tttgtcagtg agatcgtaca aacatggata ctgctcgact tcttggagaa acagtaaaat 5520 ttccgcattt ttaagctgaa aagcacacta cttcactatt ccatttaaat gtaaatgtac 5580 ctttgaataa gtatcaccgt gtcgaatttt tttcttctca caatttgttg ggatttcttc 5640 gaatatttga ttgtcatcaa acagagaaca cattaggtca taaaattgaa atttttcttt 5700 tatttgtgca atcgttgaag agccttcaga aaagtgtttt cgatgaactt tatattccga 5760 tattaacgtt ttccaaacaa gtcgaacatt gtaacctgaa aactgtccct tcactatcag 5820 caactaattc actagacatc tcttgatgta cgtataaatt taatttcttt ttctattttc 5880 caccaaattc gtcctgtttc ttctcctgga agttcacatg taaaatttcg attccaaact 5940 aacggctgcg ccttgatctt acgaataatg tgctcaacct gcggtcccga tatctgaaag 6000 aagcttgcga atgagagaaa aaattaatta tgtcactttt ttgtagctgt ctcctttaga 6060 caaatgaatc ttcggagttc gtttcgttaa agggcactcc tggatatctt cttcaaaaat 6120 aatcggagag ccttgtcggg aattttcaga atcaattcgg cgggttgaag taaccttctt 6180 atgcctttcc gggtattcaa aatcgtgcag tagagaatca tcttgattat atagattttt 6240 tggaagtgaa taatcttttc gatgtgaaga atcttccgga aatttgtttg tttgaagagc 6300 tgaagagctg gatgtcgata tggattcctg caataacaaa tagtattatt tgaattactt 6360 ttcacactta cggaatccgc acggcacata taatcttgca ttgaacgaga attcggaagt 6420 tttgcagact tcctagctga actgctcaga ttctccgcag aatattgatt ggagttaaaa 6480 taatgagatt tttgacgagc tttgcgaatt cttctgctca cataatcttc gtcggaatcc 6540 gaagaacttg tgatcgcatc attgatgtga gacgacaatg gaatctgaaa ataatcgatt 6600 ttttgattta acaaatgaca gaactcaccg tagaatattc cgtttcagga agaaaattcc 6660 cggtttcctc gtgtggtcgt tttctactca aatttccgga gctttcctga cttttggaat 6720 aattttcata tccgccggcc attttaaatt gagaagaagg tgtcgaagcc tgaaataatt 6780 ttttttttca agaaatggga tttatcgaat tattttgatg caaaaaatga tattttcaaa 6840 taaaaattat ataaaactca cttcaaaatc tatagttcgt ttctgctcat gtgtgacaag 6900 ctttttcttt ggatcgacaa taaaatactc tgaaatatcg tcaattccat cgctagattc 6960 atcttcatct cgtttttttc cggattttct tgtttccct 6999 // ID LINE2E_CE repbase; DNA; CEL; 3933 BP. XX AC AF077538; XX DT 10-SEP-1999 (Rel. 4.08, Created) DT 10-SEP-1999 (Rel. 4.08, Last updated, Version 1) XX DE LINE2E_CE is a non-LTR retrotransposon. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW CR1 superfamily; LINE; LINE2E_CE; ORF1; ORF2; KW reverse transcriptase. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-3933 RA Kapitonov V.V. and Jurka J.; RT "LINE2E_CE."; RL Direct Submission to Repbase Update (SEP-1999). XX DR GenBank; AF077538; Positions 11500 7568. XX CC LINE2E_CE is a non-LTR retrotransposon related to the CR1 CC superfamily. CC There is only 65% identity between the LINE2E_CE and LINE2A_CE CC nucleotide sequences. CC Their ORF1s encode 50% identical (68% similar) proteins. Reverse CC transcriptase-related proteins encoded by ORF2s are even more CC conservative: 59% identity and 75% similarity. Presumably, CC LINE2E_CE has been fixed in the genome recently (up to several CC million years ago) since there is only one false stop-codon CC accumulated in ORF2. XX SQ Sequence 3933 BP; 1199 A; 934 C; 719 G; 1081 T; 0 other; acgagacggt cgcatctctc tcctcctctg gtattgattg gttttcgcca aaatgccgcc 60 gcctgacgat aattctgcta atattaccga tgattcgccg gcaaatcgaa cttctatcaa 120 atgtgatcca ccgaaaaaag tacatggact cccgtctacc ccaacatcca gtcagccaag 180 catcaaggat attttagagc gtctcacata cttggagaac cttgtcgctg aacaggcaaa 240 aaccatcgag gaactgaaag cggctcaaaa ctccccatta ccagcaatta gccagtcgaa 300 taagaatata tgcccactgt actcggatat tgttaaaagt gatcctcaat ttggccaagt 360 cacggcactc actcatttgt catcggactt gcgcaaactg aactccaaat ccgaatatac 420 gattattgag aatgttactg atgacaaagc gaaaggaaaa tctcccaacg atatggaatt 480 tatgaagaag ttgtcgagta ttgaagatct tccgaaacca acggaatttt atcgtttgaa 540 atgcaagaac cctgacatca ctagtcgtcc actcaaattg aaattctcct ctgaagcgca 600 acgggactct ttcattcgtg ggttttctac tgcattgcat agcatgccag accgtccgat 660 tacccccagg ccaatccgtt gccgtagaga tatgactcct gaagaattgc tgatgctcaa 720 gaaaatgcga tccactgcct acaatgagaa cctcaaagcg ggtgagattc gttattatgt 780 tcaagacttg aaggttgcac agctatcaaa accgagacca ttgccaactc aaaaggagac 840 accggttcct actccagtgc tgatggattc agcctaggaa ctcggccacc ggaccgctaa 900 tcatcacctc aaccatagta attcatctac attatcgtgt ccagtggtgg cttctaatat 960 attcactaat ttcactaatc tatctcaagt acatgcagca actaaggatc tatcaatttc 1020 taataccggc gccaggcaaa cgtgcaatta cccagaagta tttcccgttg gttgtgtcct 1080 agcgaatgtg cggtcagtct ccactgtgga acgcctgagc tttatacata gtcacattcg 1140 atcgatgaat attgatgcac tctttgtaac tgaaacattt ctaacgccgt ctactctgtg 1200 ctgtgcgacg gatatattat gtcatcgatc tgatcgctcg gatgcgcatc caaagtcaaa 1260 aggaggtggt gttgcaatgt tccatcgatc ttcactactg ttttcaaaat gcgatgcact 1320 gatcgaaaaa tactacgacc aacaccactg cgatatacta gcaattgatc acaatccatc 1380 gaagacaaga tttatcatcg tgtacagacc tccggatacg tctctatcac aaactgctgc 1440 actttaccat aacttatcgg aactactaaa cagttcaaac cgtcgcaact acatacttgg 1500 tgatttcaat ttgcccaacg ttaaatggac cgacttggca cggctaccgt gtaacattca 1560 tcgcgatctt gcggcattta tgatctccaa caatttgtat caggttgtga acaagcccac 1620 acgagttgct ctatctggaa tggaaaatat tttggatctg attttcacag acagtccatc 1680 tctaatatca agcctcaaat cggaatcgcc gctactactg tcagatcatc tttccattaa 1740 atttgatctg gtgctaagtc aaccaaaagt agcaccatct ctgaagaagc atcgtgcctt 1800 attattccga aaatgcgact ttagatcgct caatgcacat cttcaatcgt tcaactggcc 1860 gcgtcaatta tcctatttct caaactgtga aaccaaactt atacattttc tgaagatctt 1920 caatgagttg gttagagaat atacaccaat tgctgtggag cgtgccaaaa tttctacgaa 1980 gtatagaagg aatctacaca gaagaatgaa gcaacggagg tctaggaatg ttcacattga 2040 tacaaaaagg attatcaagc tcgactactt gcggtttgaa aacgattggc aaaacaagaa 2100 actcggatcg tggagtcaaa aaacagcaat cagttattta acttggtcaa aaagcgaata 2160 aaacattcaa atgctatcac ttgcctagag gtcaacggac gaacaacaat ggacccgtca 2220 gaaatcacgg atgagttcat caaaacattt tccgaatcct ttacactccc ctcggagccc 2280 ttttctgcac tgccggctaa aaaaccttcc agtattacgc cggatttttc gccagtcaat 2340 atacaccgaa ttatacagaa attacgtccg aagattgggt attctcgaga tgacattaac 2400 ttctacgtta taaaaaattg tgtccacgcg ttaacaattc cactttccca catcttctcc 2460 gaatcatttg actctggaca tttcccggag atatggaagc agtcagttgt gataccaata 2520 cataaaaaag gaaagctctg tgatgcgaac aacttcagac cgatttcatt aacccatcca 2580 ctgtctagac tgttcgagaa gattgtaatg gataagttga gaagcctgac cagtgacaaa 2640 ctctcacgat ttcaattcgg gttcatgaac cgtcgatcct gcaccatcgc cctactcaac 2700 tcttgctcca aggcatacga actactatcg cgccgctcca cctacgtcga tctcgtatac 2760 ttcgatttca agaaggcatt cgactgcgtg ccgtacaatt tgctactctg caaactacaa 2820 ttatttggct tggatatgaa attatgcaac tggttccagt catttctatg tggacgcaaa 2880 tcagtgatca aagttgatga ctacgtatcg gacaattcat ttgacgtcat atctggagtg 2940 ccgcaagggt cagtgagtgg tccatatctt ttcttgatct acctgaatga tcttctggaa 3000 atgttcccat ctgatgtgta cgttacggct ttcgccgacg atatcaaaat tctgggagac 3060 aacagcactg ctattcaaag agctattgac attgtgacgg aatggtgtac aaaatggaaa 3120 ctgaacctgg ccgaaaacaa aacgaatgtg gttcatcttg ggaacaaaaa ccctaaaacg 3180 aaatactttg cgaatggtat ccagatttcc gaaaaagatt cagtgaaaga tcttggcgtc 3240 tttgtggata acaagctcaa cttcaaaaag catatcaatt atatttctaa ctgtgctctc 3300 ctggaatgca agcagcttct caaatcgttt cgttctacaa atgcaaacct ctacttcaag 3360 ctattcaacg tctacgttca accattattg aactatggaa gtgaagtctt cagtccaact 3420 gcaaaaagcc tgataaagca acttgaaatg ccactcaaat tctactctag aagggtattt 3480 caacgctgca atatgcaata cacgtcatat gaagaccgtc ttgcccagat gaatcagaag 3540 tcagtccaac atctacgaat tcttcagata cttcggactt tccacaacat tgtatctgga 3600 gaataccact tcccaaatgt ctcatccctg aagaaagcac gatctccaag atttcctctc 3660 atgatgaaat cagttagacc cgccgacaaa agcttcctac tggcaaacct ggaaatatgg 3720 aaccgcgttg cccaactgat ccctgagaaa actagccgac aaatgtttgc atctcgatta 3780 aattttattc ccttgcattt tctcctccct taaaaattgt gatctcttaa ctattaccgg 3840 ttagcaagat cactttaaca ttcaatcttt tatccggtga ttatcatgac ctgtctgtta 3900 tcaaattttt gattgaataa atatatatat ata 3933 // ID CEREP5 repbase; DNA; CEL; 303 BP. XX AC . XX DT 09-DEC-1999 (Rel. 4.11, Created) DT 09-DEC-1999 (Rel. 4.11, Last updated, Version 1) XX DE CEREP5 dispersed repetitive element - a consensus. XX KW CEREP5; dispersed clusters. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-303 RA Kapitonov V.V. and Jurka J.; RT "CEREP5."; RL Direct Submission to Repbase Update (DEC-1999). XX DR [1] (Consensus) XX CC This element does not have any noticeable hallmarks of known CC retroelement or DNA transposons. There are about 100-200 copies CC of CEREP5 in the C.elegans genome. They are ~80% identical CC to the consensus sequence. CC There are multiple clusters composed of tandemly repeated CC copies of CEREP5 in the genome. XX SQ Sequence 303 BP; 113 A; 51 C; 49 G; 82 T; 8 other; tgagatatcr aaaattgtca aaaattcgag aaatcyaaca aaatgctcaa awttttgaaa 60 ttttcatatc aaattcgaat tccccgcaaa atttcagtct gaaactcgaa aactagagst 120 ttctagctca attttgaaat ttcgagggtt acggtagcgc caaaagtacg caaacaccga 180 ctacagtacc cccaaaacac csgatttyac gcgattttcg aattttgagc tgaaaattta 240 ggratagact aagaaaatrg cggggaattc gaatttgata atgaaaattt gcgaaaaata 300 ttt 303 // ID CEREP1A repbase; DNA; CEL; 3401 BP. XX AC . XX DT 24-AUG-1999 (Rel. 4.07, Created) DT 24-AUG-1999 (Rel. 4.07, Last updated, Version 1) XX DE Nematode CeRep1.1 repeat DNA - a consensus. XX KW DNA transposon; Transposable Element; CEREP1A; CERP1A; KW target site; terminal inverted repeat. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Felsenstein M.K. and Emmons W.S.; RT "Structure and evolution of a family of interspersed repetitive RT DNA sequences in Caenorhabditis elegans."; RL J. Mol. Evol 25, 230-240 (1987). XX RN [2] RA Jurka J.; RT "CEREP1A."; RL Direct Submission to Repbase Update (JUN-1999). XX RN [3] RP 1-3401 RA Kapitonov V.V. and Jurka J.; RT "CEREP1A."; RL Direct Submission to Repbase Update (AUG-1999). XX DR [3] (Consensus) XX CC CEREP1A is a putative nonautonomous DNA transposon. CC This element is flanked by identical 383 bp-long terminal CC inverted CC repeats and it generates 8 bp-long duplications of a target site CC upon its integration in the genome. Presumably, HAT-related CC transposase is involved in propagation of CEREP1A [3]. XX SQ Sequence 3401 BP; 1055 A; 643 C; 653 G; 1050 T; 0 other; gagggggtac gttttcgtaa ggggattttt tcgtaaggaa cgctttcagt agggcaatgt 60 gcggcgcgcc ttttcgcaag cccgccgcac agtccttcgc aatgcgccgc acggcttttt 120 gcaatgcgcc gcacgcttct tcgtatttat ttatttattt ttccaacagg aaatttcgaa 180 cggaattcaa ttaaaaacgg atttttaatg gaaattaatt agttttattc gagctaaata 240 cggatatttt cagaattttc gatgacaaaa acgggtttca atcgatttcg aatcgatcaa 300 aattaatatt aatcgaaaat gagcgtttcc ttcccggaaa aagaagaact tgatgctgag 360 agtgatccaa gcggagccaa acgctttttt agaggcaagt attgatgtct aaagattaat 420 tgaagagtga aaagtatttt tcagcgttcg acagggatct aaagaagaaa agaaatccga 480 aagcgtctac tacaaatgta tgtgaatcga ctcctgcaac aatcgatctg gttactccac 540 ctgcatcacc acgtcggtct gattcgtata tgcagtttcc ttcaaattct cgagagctcg 600 atgagccgag atccactgac gatatcccgg aatgcgattc aatagccaat tctactcagg 660 aaacaagctt ggatacaaca gattccatgg caaatactga aagatttttt gattttgatc 720 cagctgtatt tcccgaagaa atggatcacg atctacagga aaccagtgaa ttcaatcttc 780 tgtgtaggtc ttaactaata attaaataga aataacagac gttgtatttt taggtcacaa 840 ttcatatcaa tctccgtttg aagacgaaga gactccatac catccggatt ggtttttgac 900 aaataggttg ttattaatca ataactctat caaaatataa aattccagca agtcacacgt 960 gttacgttct ctgatgcact atagaccgat tgccaatagc ccaagagggg tgcttgaaat 1020 ggcaccggaa tatatcagcg agaatagttc agatttttgg tatgccttct aacttctaaa 1080 acaattaaac atagaaacta tgcaatttca gtatttcgac gaaacgagga gctccattgg 1140 tcattttcaa caatcaaagc tatcgtccga gagaaacact gaaaaatggg gacattgtgg 1200 ctaggtgaat tatcgctagg aaaaaattgt cacacatttc attaatttat ttcagatgta 1260 acgttacact ggaaaactcg agaaagtgcc ccggatccgt gcgtatcaag actgagacag 1320 gccatgtgaa agtcagtcgt gattgtgatc acgaatcgag gagaagcgaa attgcagtat 1380 actgaaattt acgtttttca aaactaaatt tatattttag gtgcaattaa ttatgagaaa 1440 aatcgcagtt tcatcaatgt cttcttcacc atcaaaagaa cgcctttatg agcttgctgt 1500 tacactttca aaaagatatg aagttgttga acttctcaag agcaacgtaa ataataaatc 1560 gtggaatcaa taaaacatat actttttcca gaaagaaatc agtcgcatac tggacggtat 1620 gctgagtagg aaatcgacga atagttttcc accagccctc gattctgcct attttaaagg 1680 aggaaaatgt tgtttagaca gttctggaac gtttgagtct aataaggatt tgctagttgt 1740 gtggcaagca gaggaagctg aatctttgta tggtgcacac agcttgcatt tggatgggaa 1800 attctcagaa attccgaaaa attatcgaca gctttacatc gcaacagtca cggtgcgtat 1860 taattgaaag gtattttgat gtcaaccaat gacattgcag gatctcgata cgaacgtggc 1920 agttccattg gtttttgctt tgtgtggctc tgccaaagaa gatttgtaca gacgtttgtt 1980 cctatctttc ctacgtagtg gaatccaacc aaaatacgta ttttcaggta aacgttggtt 2040 atatttatta aatttatcaa cctatccaga ctttgaagtg gccagtgtta atgcgagcag 2100 agctgtgtgg ccgaatgcgg acattttcgg atgttggcgt cacttcaaag tgaatctttt 2160 taaaaatctg cgaaaaggta aacctaaata tttaatgctg ctaacctaat tcaaaatttc 2220 agctgttgga gacgatatca ataagccgga attcgtgaat cttttaaaat cgatttatac 2280 tctcagtttt gctccacacg acaaagtact tctttggctc cgtcttctga aatgtcgaac 2340 agttaatatt catccagcat tgaaaggttt gaaaaatctt gtcatgaaat caattattct 2400 cgtttttcag atttcttcag ctacctcgaa aaattctatt ttggagaaga tgctcgatat 2460 cccgttgatt tttggaactg tagagagagg gttcgaaacg gattagagtt cacaaacaac 2520 ccgagtgagc agcaattttc caagctgaag gcaagtttct ttttgtagtt ttttaagttc 2580 tcaaataggt atataatatc gatttgtttt gttttccaga gaatgtatca tcacaacgac 2640 tatcaaaacc tgcgaggagt gattcaaaag accattcaac tatcaaaatc attgatccta 2700 gacttgaaga gacgatccgg aaatcgaaca accgctgtgc cattatacaa aaaccccgat 2760 ttctgtactt cagtgttttt gaattatacc ggtttcagtg cttttaatga gaaacttttc 2820 ccttttatgg aaattccgaa atgctatcac tgatctcttt gtttacctct gtgccattag 2880 attatagaaa caattttcca tactcaggtt atccaaattt cgccatgatt tttcttactt 2940 aatttaatcg aattaacttt tgtttgcagt tttattttcc atttcgactg aataaaaaat 3000 cacttctatc attttaatcg tttggctccg cctggatcac tctcagcatc aagttcttct 3060 ttttccggga aggaaacgct cattttcgat taatattaat tttgatcgat tcgaaatcga 3120 ttgaaacccg tttttgtcat cgaaaattct gaaaatatcc gtatttagct cgaataaaac 3180 taattaattt ccattaaaaa tccgttttta attgaattcc gttcgaaatt tcctgttgga 3240 aaaataaata aataaatacg aagaagcgtg cggcgcattg caaaaagccg tgcggcgcat 3300 tgcgaaggac tgtgcggcgg gcttgcgaaa aggcgcgccg cacattgccc tactgaaagc 3360 gttccttacg aaaaaatccc cttacgaaaa cgtaccccct c 3401 // ID PALTA3_CE repbase; DNA; CEL; 360 BP. XX AC . XX DT 07-SEP-1999 (Rel. 4.08, Created) DT 07-SEP-1999 (Rel. 4.08, Last updated, Version 1) XX DE PALTA3_CE is a putative nonautonomous DNA transposon - a DE consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; PALTA3_CE; KW TA target site; TIRs; nonautonomous DNA transposon; palindrome. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-360 RA Kapitonov V.V. and Jurka J.; RT "PALTA3_CE."; RL Direct Submission to Repbase Update (SEP-1999). XX DR [1] (Consensus) XX CC PALTA3_CE is a putative non-autonomous DNA transposon; it CC forms almost perfect 180 bp-long palindrome and generates, CC putatively, duplications of TA target site upon its integration CC in the genome. Presumably, multiple transpositions of PALTA3_CE CC were induced by Mariner/Tc-like transposase. CC The C.elegans genome contains about 100 copies of PALTA3_CE. CC PALTA3_CE copies are ~93% identical to the consensus CC sequence. There are several minor subfamilies related to CC PALTA3_CE CC in the C.elegans genome. XX SQ Sequence 360 BP; 85 A; 92 C; 91 G; 92 T; 0 other; tacactcaaa ataataaaac tgcgtggcgt gtactgcaga aaacctaata tttaggcccc 60 gcctttttct cgtccactca cggagaaaag gcaaaaattt ggaccccaac caatatcagg 120 ccgccgacat cttacgggtt ccgcgcgccg ctatgtttaa ctcgctgtgg gcgtggcgag 180 ctgtctccgc ccgctgcgag ttaaacatag cggcgcgcgg aacccgtgag atgtcggcgg 240 cctgatattg gttggtcccc aaatttttgc cttttctccg tgagtggacg agaaaaaggc 300 ggggcctaaa tattaggttt tctgcagtac acgccacgca gttttattat tttgagtgta 360 // ID CER7-LTR_CE repbase; DNA; CEL; 327 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 06-AUG-2003 (Rel. 8.07, Last updated, Version 1) XX DE CER7-LTR_CE is a long terminal repeat of the CER7_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER7-I_CE; CER7-LTR_CE; KW CER7_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER7_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (APR-2003). XX DR [3] (Consensus) XX SQ Sequence 327 BP; 80 A; 105 C; 46 G; 96 T; 0 other; agtgtacgca acgtaccttc tcctttttgc caatataata ttgcacacat tccttcctcc 60 cccggccact ttacaatctc accctcaaac tccgcccccc tggggcaggg gtataaaaga 120 agcccgtccg caaggacggg cacagttatc gtcaacccca actctccaca cacacacact 180 ttcaatccat ccgtttctta tattatactt ttccactgtt tcttatattc ttccactgtt 240 tctgttgtct actatttctg agactcaata aagtaactgg cgagcaatat ccgtcaggct 300 gttaaaaccc actgccctca ccgttta 327 // ID CeRep56 repbase; DNA; CEL; 25 BP. XX AC . XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 05-JUN-2002 (Rel. 7.05, Last updated, Version 1) XX DE C. elegans repeat 56. XX KW MSAT; Satellite; Simple Repeat; C. elegans; CeRep56; KW minisatellite; repeat. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-25 RA Jekosch K.; RT "C. elegans repeat 56."; RL Direct Submission to Repbase Update (25-APR-2002). XX DR [1] (Consensus) XX CC Putative novel minisatellite repeat, found 436 times in the CC C.elegans genome. For further analysis compare CC http://www.sanger.ac.uk/Projects/C_elegans/repeats. XX SQ Sequence 25 BP; 6 A; 5 C; 2 G; 12 T; 0 other; taatgttctg ccaatttatt tactc 25 // ID PAL8C_4 repbase; DNA; CEL; 201 BP. XX AC . XX DT 09-JUN-2000 (Rel. 5.05, Created) DT 09-JUN-2000 (Rel. 5.05, Last updated, Version 1) XX DE PAL8C_4 is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; PAL8C_4; KW TIRs; nonautonomous DNA transposon; putative HAT superfamily. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-201 RA Kapitonov V.V. and Jurka J.; RT "PAL8C_4."; RL Direct Submission to Repbase Update (JUN-2000). XX DR [1] (Consensus) XX CC PAL8C_4 is a putative non-autonomous DNA transposon. CC It generates 8-bp duplications of target sites upon CC integration in the genome. PAL8C_4 has 35-bp CC terminal inverted repeats. CC There are 3 copies of PAL8C_4 in the genome; CC they are 99% identical to the consensus sequence. CC PAL8C_4 is most close to the PAL8C_3 family (89% CC identity between the consensus sequence). XX SQ Sequence 201 BP; 63 A; 42 C; 33 G; 63 T; 0 other; ggtactagtc ctaaatcccc ccgccacttt tccttttcta ccccgcattc aaatttaact 60 ctcacgtgaa tgtttatttt aatactattt tcatttttag gcttagaaaa caacattact 120 aagcctaaaa tacggaaaaa ggccacgttt ttatcgaaga aaatgtaagg aaaagtggcg 180 gggtgattta ggactagtac c 201 // ID CELE11 repbase; DNA; CEL; 218 BP. XX AC . XX DT 16-SEP-1999 (Rel. 4.08, Created) DT 16-SEP-1999 (Rel. 4.08, Last updated, Version 1) XX DE CELE11 nonautonomous Tc2-related DNA transposon. XX KW DNA transposon; Transposable Element; Nonautonomous; CELE11; TC2; KW nonautonomous DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-218 RA Oosumi T., Garlick B. and Belknap R.W.; RT "Identification of putative nonautonomous transposable elements RT associated with several transposon families in Caenorhabditis RT elegans."; RL J. Mol. Evol 43(1), 11-18 (1996). XX DR [1] (Consensus) XX SQ Sequence 218 BP; 71 A; 31 C; 40 G; 76 T; 0 other; taccgtattt cctctattag ttttgcaccc ctatgctcaa ctttgacagc atctcagtag 60 tggttcgtac tatagaaaag ttgtcaactg acaaaatttt tgtaagatgt ttttctatag 120 tttgtagttt gttaattatt tatatgttca aaacaaaata agatatgagc tgtcaaagtt 180 gagcaaaggg gtgcaaaact aatagaggaa atacggta 218 // ID Chapaev-2_CE repbase; DNA; CEL; 5022 BP. XX AC . XX DT 30-SEP-2007 (Rel. 12.09, Created) DT 30-SEP-2007 (Rel. 12.09, Last updated, Version 1) XX DE Autonomous DNA transposon - a consensus sequence. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev-2_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-5022 RA Kapitonov V.V. and Jurka J.; RT "Chapaev - a novel superfamily of DNA transposons."; RL Repbase Reports 7(9), 779-779 (2007). XX DR [1] (Consensus) XX CC Chapaev-2_CE is a young family of DNA transposons. The genome CC contains four copies of Chapaev-2_CE that are <2% divergent from CC the consensus. Chapaev-2_CE belongs to the Chapaev superfamily. CC Hallmarks of the Chapaev transposons are 4-bp target-site CC duplications, terminal inverted repeats with the conserved '5-CAC CC and GTG-3' termini, and the Chapaev transposase. The Chapaev CC transposase is characterized by the conserved CC D-x(60-80)-D-x(220-290)-E catalytic triad. Chapaev transposons CC populate genomes of different animals, including sea urchin CC Strongylocentrotus purpuratus, amphioxus Branchiostoma floridae, CC starlet sea anemone Nematostella vectensis, sea hare mollusc CC Aplysia californica, mosquitoes Aedes aegypti and Culex pipiens, CC and nematode Caenorhabditis elegans. The N-terminal portion of CC Chapaev transposase in Chapaev-1_ACa, Chapaev-2_ACa, CC Chapaev-3_ACa, Chapaev-1_BF, Chapaev-2_BF, Chapaev-1_NV, CC Chapaev-2_NV, Chapaev-3_NV, and Chapaev-1_SP is similar to the CC N-terminal portion of RAG1 (100-370 aa in the human RAG1). It CC includes a novel type of zinc finger, called Chapa: CC H-X7-C-R-X-C-G-X35-D-X4-H-X4-C-X2-C-W-Xn-C-X2-C-X8-G. In the CC amphioxus and anemone Chapaevs, the N-terminal portion contains CC also the RING finger motif. Some Chapaev transposases (e.g. CC Chapaev-2_ACa) show low similarity to the RAG1 core. XX FH Key Location/Qualifiers FT CDS join(1108..1254,1330..1521,1568..3736,3791..3991, FT 4041..4148,4201..4308,4356..4493) FT /product="Chapaev-2_CEp" FT /translation="MTKVYFKQKRKEKSCAGLLLANAQRARNSANGVSKDQ FT TEEEELNNSKVETCSSSPQANQEISSEDRNSHHPDIASSSSATTCRSSNCF FT NMDEGILTRDVEMDFSDEIPTFSHNRFSNPVFPPTPSRNSFSTPRVRIISN FT VEQIPQIKMLRLTIERLKNEKNEISEEIQKRNDEFAVFSEEIQIVRHENIT FT IRASNSSLQMEVCNIRTIVSGIEGRLVRSCADVKKSGKAMVLSLQRHAEEV FT ENLKKSAFKVKFSNLRNKSCIKNRFRKALHLLQHEICLEDDVGQFIRKFTK FT FLNSEENQSYKNKLSNFEAVDFLSKCGLSLSQMEKIKNYLTNAIGYDLLPS FT VKNTRDLSIQLSMISSFKVSTSFDNKGKVITIVQCMKIAEVLAYRIELLCN FT SNQFVDDGYTKGVIKIGVVGDAGGGSTKLALVIGNVSRPNSSRHVMVIAVY FT DGSDNYSSLKKFIPDVLEQLGKLTKIRYLDKGVQKTAKIVQIATGDCKFQS FT DILGHQGHSSHNFCFKCFAQNPRGAEERMKIKDMNVDETFHPRTIDLYKAC FT CENGDLGVRKGCTPLLPHVAIMFYLIPLLHIIMGIFDKYIFNPLWKYSVTL FT DNTTCFPILKTRKATLKNAVNLIKSAEEKYQAATGKMKLESHAELKALQSE FT KLLLDTIVNGTPGGTLEKMEQCWAKFGADKQAWFQSFCGNHLKLLLTPAVV FT EETFNIFGPNLCPMLLGLKSAMGKLSTIMSLSGNKFLNDSDVLILQNSIRG FT FVEDLKVAVPEETIILKLHLLVYHAPQMAKDVRNIGRITEQGVESVHAIFN FT ALERRFCNYRDKKRRYIHVLRELMCRNIMNDMTMNTSIPGLSIVPKDATVL FT PISSQNDLRDPKAVIRDVTAAQKRKNLAKKKINFRSKKRLPQSTKSRNAVP FT DFSQNVQTSIAFGGNSTKIGFRGLPSNALSRPNPMAITQTCSTSQPITRSH FT PLVAPPVSYQLPKCSNSLAEKIRASCGQARRSAAESSSTESHVDTNSKKKV FT YRLKFVQKSREIPKK" XX SQ Sequence 5022 BP; 1690 A; 907 C; 928 G; 1497 T; 0 other; cacggcacgg cgaacgggac atggcctaga aagttgcgaa aactagtccc gcggctgcag 60 tgctttattt tttttcatga ggttgcggga gtttttcgac gtttttttcg tgttttttga 120 tgttttctcg tcgttttttc ttgttttctt cgttaaaaat tgaaataaaa ataatatttt 180 cagcgaaaga ccacaaatcg ccgcaaaaat cgaccgcatc atccgcgaaa aggcgaaaaa 240 cgtcaaaaag tgagttgttt ttttgacata tctcaaaaac tgttggtcca gatttttttt 300 aaagagccta tggtatgaaa aaatacatgg cggttctgaa ttttaatgat ttaggtgtat 360 agcatgggcc caacttgatt atttttttga atttagtgaa aaatcagcac atttttgtat 420 aaaaaattac tttcctcaag aactgttggt ccagattttt tttaaacgtc ttagaatata 480 gaaaaaatca tgctggtttt taatttgaaa gattcaggct tttagcatgg gcccaagttt 540 actgtttttg gcatttcgtg aaaaatcgat aattcgaacg aaaaattatt tttctcaaca 600 aaaattggtc tcagaatttt tgaaaccttt tttaattata cactgaaata tgctgatttc 660 aaattgataa gttttgaatg ttcagccgac ggattttgca attatttttt tcgatttagt 720 gaatattcac atttctgaac aaaaattgaa ttttttcgaa aactgttggt ccagattttt 780 taaaaagatc ctaaaatatg aataaaagca tgctgattct aaattcaaaa cattcagctc 840 aatatcatgc gcctaaatta ttgatttttt caaatttcgt gaaaatcagt ggaaaatcag 900 tacatttctg aacgaaaagt attttttttc aaaaactgtt ggtccagatt ttttaaaaag 960 atcctaaaat atgaataaaa gcatgctgat tctaaattcg aaacattcag ctcaatatca 1020 tgcgcccaaa ttattgattt tttcaaattt cgtgaaaatc gatatttaaa aattgtttcc 1080 cttataatat tcatgcaata tttccagatg acaaaagtat acttcaaaca aaaaagaaaa 1140 gaaaaatctt gtgcaggcct tcttttagca aatgcccaga gagcaaggaa cagtgctaat 1200 ggagtttcaa aagatcaaac tgaagaggaa gaactgaaca attcgaaagt ggaggttcgt 1260 tcagctcaat tttgtctact ttaattaagt tggagtagtt tgttctaact tcaaaaacgt 1320 gattttcaga cgtgcagctc gtcgccacag gctaatcagg aaatcagttc agaagatcga 1380 aactcccacc atcccgacat tgcttccagt agctccgcaa ctacatgcag gtcgtcaaat 1440 tgtttcaata tggatgaagg aattctgacg agagacgtgg aaatggattt tagtgacgaa 1500 attccgactt tttctcacaa tgtaattttt ttcaatattt caaaactaat attatatgat 1560 tttgcagaga ttctccaatc ccgtatttcc accgacacct tcacgaaact cgttcagtac 1620 accgagggtt cgaataatct ccaatgttga gcaaattccg cagatcaaaa tgctccgact 1680 aacgatcgaa cgactgaaaa atgaaaaaaa tgaaatatca gaagaaatac aaaagcggaa 1740 tgatgagttt gcggtgtttt cggaagaaat acaaatagtc cgccatgaaa atatcactat 1800 aagagcatct aattcttcgc tgcaaatgga agtatgcaat atacgaacga tcgtcagtgg 1860 tatagaaggt cgattggtga gaagctgtgc agatgtgaag aaaagtggca aagctatggt 1920 tctgagtcta caaagacacg ccgaagaagt tgaaaacctt aaaaaatcag cttttaaagt 1980 taaattttcg aatctaagaa acaaaagttg tatcaaaaat cgatttcgca aagctcttca 2040 cttgcttcaa cacgaaatct gcctcgagga tgacgtcgga caatttattc gaaaattcac 2100 aaaattcctg aattcagaag agaatcaaag ctacaaaaac aagctttcaa actttgaggc 2160 cgttgacttt ttatcaaaat gcggattatc tctaagtcag atggaaaaaa tcaaaaacta 2220 tttgactaat gccattggct atgatttgct tccatcggta aagaacacaa gagatttgtc 2280 aatccagtta tcaatgatct ccagtttcaa agtttccact tcgttcgaca ataaaggaaa 2340 agtgattacc attgtgcagt gcatgaaaat tgcagaagtt ttagcctatc gaatagagct 2400 tctatgcaat tcaaatcaat ttgtggacga cggatacacg aaaggagtaa tcaagattgg 2460 agtcgttggt gacgctggag gtggaagcac aaagttggca ttggtaatcg gaaacgtatc 2520 tcgaccgaat tcttcgagac acgttatggt aattgcagta tatgacggat ccgacaatta 2580 ctcgtctctg aaaaaattca ttccggacgt tcttgagcag cttggcaagc taacgaaaat 2640 tcgatatttg gacaaaggag ttcaaaaaac cgcaaaaatt gtgcaaattg caactggaga 2700 ttgcaagttt caatctgata ttcttggaca ccaaggacat tcatcccaca atttctgctt 2760 caaatgtttt gcacagaatc cacggggagc tgaagagagg atgaaaatca aggatatgaa 2820 tgttgatgaa acattccacc cacgaacaat cgatctttac aaagcttgtt gtgaaaatgg 2880 cgacttgggt gttcgcaaag gatgtactcc acttctgcca cacgttgcga tcatgttcta 2940 tttgattcct cttctgcaca ttatcatggg tatttttgac aaatatatct tcaatcccct 3000 ctggaaatat tctgtcactt tggataacac tacttgtttc cctatcctga aaacacgaaa 3060 agcaactcta aaaaacgctg ttaacctaat caaaagtgcg gaagaaaagt atcaagcagc 3120 caccggaaag atgaaattgg agtcgcatgc tgaactgaaa gcacttcaat cggaaaaatt 3180 gttgctcgac acaattgtga atggtactcc cggaggaaca cttgaaaaaa tggaacagtg 3240 ttgggctaag tttggagcgg acaagcaggc ttggttccaa tcattttgcg gaaaccattt 3300 gaaattgctg ctaacgccgg ctgttgtaga ggagactttc aacatttttg gcccaaattt 3360 atgcccaatg ttgctcggat tgaaatcagc aatgggaaag ctttcaacta ttatgtcact 3420 ttcgggaaac aagtttttaa atgattccga cgtcttaata cttcaaaatt cgattcgcgg 3480 atttgtggag gatttgaaag tagcggttcc cgaagaaaca attattttaa agctgcactt 3540 gttggtctac catgcaccac aaatggcaaa agatgtgaga aacattggaa ggatcacaga 3600 acaaggagtt gaatctgtcc acgcgatttt caatgcactt gagaggagat tttgcaatta 3660 ccgggacaag aaaaggcgtt acatccatgt tctgagagag ctcatgtgtc gtaacatcat 3720 gaatgacatg actatggtaa ttttggtaca gaaaaacatt tctgcattct taaaaacaca 3780 actatttcag aacacgtcca ttcccggtct ttccatcgta ccaaaagatg caacagttct 3840 gccaatttct tcgcaaaatg atcttcggga cccaaaagca gttattcggg acgtcacggc 3900 tgcccaaaaa agaaaaaatt tagcgaaaaa aaaaatcaat tttcgctcca agaaaaggct 3960 tccacaatcg acaaaatctc gaaatgctgt ggtgagattt ttatttttca cgaaaagttt 4020 caaattttga aaattttcag cccgacttct cacaaaacgt acaaacttct attgcatttg 4080 gaggcaattc aacgaaaatt ggattccgtg gtctaccgag caatgctctt tctcgtccga 4140 acccgatggt aggttcactg ctttaaatca actcaaaatt gaaatcaaaa attatttcag 4200 gcaatcacac aaacttgctc gacaagccag ccaatcacta gaagccatcc acttgttgct 4260 ccaccagtat cttatcaact tccgaaatgc agtaatagtt tagctgaagt gagtgtaaag 4320 gtgaaatgac aatttcaata agttaaaaat tttagaaaat tcgagcaagt tgtgggcaag 4380 caagaaggag tgctgcagaa tcttcttcga ctgaaagcca cgtggacaca aattcgaaga 4440 agaaagttta cagactaaaa ttcgtccaaa aaagtcgtga aattccgaag aaatagtaat 4500 atatcgattt tctgtttttt ttattatgat ttctaatatt tttattttaa tgcttaattt 4560 aaaatgaccg aaaaataaag aattgattct caaaaagtaa tctcagagca atatatgttg 4620 tttctttata taatttatcc attttatcaa caaatcgcat ttttatcgct caaaaatcga 4680 taaaaacagc tcatcagaaa ctttcatcga gtaaaatgag acatcattcg tctgtgaaaa 4740 gcgaaatatc gattttcctt gaatatagtg aagaaaagat cttcatactt acgatttttc 4800 acaaattcga tgaaaatttg atagtttttt ttcaatttca gctctataat tccgaaaaaa 4860 atcttctgtt aattttcgcg aaatttcgtc aattaaaatg cgcgctgtgc agtacgcaac 4920 ggtgtacgaa actttaaact gagtattcaa cgttgagcac gcagtcagcc aaattggaat 4980 acggtagaga attctcgtcc atgtaccccc cgccgggccg tg 5022 // ID CER4-I_CE repbase; DNA; CEL; 4754 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 06-AUG-2003 (Rel. 8.07, Last updated, Version 1) XX DE CER4-I_CE is an internal portion of the CER4_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER4-I_CE; CER4-LTR_CE; KW CER4_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER4_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (APR-2003). XX DR [3] (Consensus) XX CC FEATURES Location/Qualifiers CC gene 3105..3626 CC /note="predicted Integrase region" CC gene 1128..1454 CC /note="predicted Protease region" CC gene 2436..2795 CC /note="predicted RNase region" CC gene 1611..2147 CC /note="predicted Reverse Transcriptase region" CC gene 930..1163 CC /note="predicted Gag region" CC primer_bind 5..21 CC /note="predicted Primer Binding Site (Gln-TTG type)". XX SQ Sequence 4754 BP; 1357 A; 943 C; 1309 G; 1145 T; 0 other; atagtggcga tcaggatcaa agttggaact cacaatagaa gaagacaaga cagataagtc 60 ccaagcagcg aaggaaatag tccaaacgct ggcgctaatg cagcagcagc tagcagagca 120 gcaggtgtta aataggcaac tagttgagca tatgagcggt agggaaacgg aagcgcgagc 180 ggtggctggg ggcgagggga gaacacgtcg aaataatgtc cacttgataa acgatctaaa 240 aaaggcagtt acgaaatacc agttttcgtt ggaagagcct gattcgtttc gtcgttggtg 300 ggaccgacat catctaattt ttagtgagga cgcagcggaa ttgtcagagc gtgaacgcac 360 gcggctgttg ctgtcgtgcc tcgaggaggg tacgttccgg aggtttgtgg atacacagcg 420 gaatatttgt gaaagatata tacgaggttc cattcgtgac caccgtggaa gccctggaga 480 aggccttagg gtcgcataga agtttgatga ttcggcggca gacgtgcctg cacctaagcc 540 gcactaattg tacgtcccta gatccattga aatggacaaa tcagattagt gaagccgttc 600 tcgatgcaga gttggcaaag atgtcaacga gtgattgggg cattttcctg ttcctgaagg 660 gcttggatgc accaggtgat gcccaagcga aggcatattt aatgcagttc tacgagacca 720 gcgaccggaa aggtgagaaa ttgacgctgt cgaatatgca cgatgaatgg gttcgttttc 780 tacaaaccaa gtcccaaacg aaaatcgtct cgcattcgtc accgaagacg tctgcagatg 840 tgaacaagat cagtgtcaag tatccgccga aaaagccagc ctacattgca gccgatgcta 900 agtcagaccc tcccaagact tcaaaaaatg tgatcacctg ttatcgatgt ggagcggcag 960 gtcacatgca atatgactgt ccggagaaac gtccacagaa gaagaactac aaagcaccta 1020 cgagagaagt caaatgcatt tcagtggaag gagtcgcggt ggagtcaggc ggtagtaaaa 1080 agcgcagctt gagcgttaag gtgggaggtc ggtcggtagc gtgccaatta gatacagtta 1140 gcgatataac attattatca aagggtagtt gggttaagct gggtagtcca aaactggaac 1200 cagtcccgca caaaattgta tgtgcaaatg gttcagcgtt agatgtcttg ggaagaacga 1260 aggtttcgtt cgaattaaag ggcgttgagt atgtggatta cgcatacgtc agcaaccggg 1320 aaaacaactt gttgggaatg agttgggtta gccactctcc agaaatgaga gatgcgctgg 1380 atgtgctcgt caccgaggtg gtcgaggtaa atcaagaaga agcttctaga gtggaattgg 1440 ctctgaagca ggcctatccg gaagtcttca ctgaaggatt agggttatgt tgcaaggaag 1500 acgcaaagtt cagggttaac gaaaccgctg ttccggtctt caaacgggct aggccagtgc 1560 cgtatggatc attgcaggca gtggaacaag agttaaatag attgttggat ctagaggtgc 1620 tggagcctat cacacattcg gattgggcgg ccccaatcgt ggtcattcga aagaaggata 1680 ctggaaaggt tagagtctgt gcggatttca aatgttcggg gctgaataac tcattgattg 1740 aggagatcca tccactaccg acgagtgacg acttgttcgg gacattgcaa ggatgtattt 1800 tctcaaaaat tgatttaaag gacgcctact tacaaattgc attagattca gaatctcaga 1860 agctggctgt gatcaacaca cataagggct tgttcaagta cagaagaatg acgtttggat 1920 tgaagcctgc gcctgcgaag tttcagaaaa tcatcgataa gatgattgca ggtctgccgg 1980 gagtagccgc atatttggat gatgtgatag tatcagcgaa ttccttggaa gaacatgaga 2040 aggttctgca tgaattgttg aagcgaatca aggattacgg attccgtatt agcccagaaa 2100 agtgcaattt tgcacagtct gaaatcacat ttcttggatt tattattgac aagagaggac 2160 gtcgaccaga cccgaagaaa acaagtgtta ttcgttcgat gaaagctcca acagaccaga 2220 aacaactgat gtcgttcctg ggtgccattt gcttttatgg acggtttgtg ccgaagatga 2280 gcgagttacg aggtcctctg gacaaactac tgaagaagga tgcggattgg atctggacgg 2340 acgtggaaca gaaagcattc gaggaactgc gaaaggctgt ggcggacagt actatgttgt 2400 cacatttcaa ctcgagttgg ccgatcatcg tagcagctga tgccagcaag tacgggatag 2460 gcggagtgat tcttcacgtg aatccagatg gagttgaggt accgattgct catttcgcaa 2520 gatcgttgac ggagactgag aaaagataca gtcaaataga aaaagaagca ttagctttga 2580 tctatactgt caagaagtcg cataaatttg tgtttggtcg aagattcaag ttgcaaacgg 2640 atcatagacc gttgctggct ctgtttggag acaacaggga tctccctgtg cactcgcaaa 2700 acagaattgt acgttgggcg acgacgttga tgtcgtacga tttcgagctg tcgtatgtgg 2760 ccacggaaaa attcgcaaaa gctgactggc tctccagaat gattcaagat tacccacgaa 2820 atgaagatga tgtggttatc gctgagattc tggaggattt tgaaaatgaa gagttaggtg 2880 atgattcgag attgagccca gttttggaga aagatgtccg cgaaaaatca gcactggatc 2940 gagaacttgc cgaggtgaca cagtatgtag aaacagatgg atggagggca aggccactga 3000 ctgatgtgga aaagtactgg ggcagactga gacaaagact gaaagtgctc aggggatgtc 3060 tccttttgga cgacagagtg gttgtgccaa aaagtttaca gatgaaggtt ttacgtcaac 3120 tgcatgaagg ccatccaggt gtggtgcgca tgaagcagaa agcttgctca tttgtgttct 3180 ggacaggaat tgataaggat gtcgaaaagc tggttagagg ttgtgaaaat tgtcaggaat 3240 ccgccaaaat gccacgcgtt gcaccgttgc gtccgtggcc ggagcctcaa aaagcatggt 3300 ctcgtgtaca tatagacttt gcgggtccag ttaatggaca ctggtttttg gtgatcgttg 3360 acgcaaaaag caaatatgct gaagtgaaga tgacgaaaac catttcggcc agcgcaacag 3420 tatcgttgct agaagaagtg tttgcgacac acggctatcc agagctgctt gtttcggata 3480 atggaacgca gttcacaagt aatcagttta aactgatgtg ccaggaatat ggaatggagc 3540 acaaaacgtc agccgtctac tatcctcgtt caaatggagc cgcggagcgg tttgtggata 3600 gtctcaagag agggctagcc aagatcacca gatcaggcgt agtcactcaa caagctctaa 3660 acaaattttt aatttgttac cgtaatacgc ctcattctgc attggcaggt gctacaccag 3720 ccgaatgtca ctttggacgg aaaattcgta ctaaaatgtc attgctagta ccaagccgag 3780 agatcgtcga gggtcctttg tcggattgtc aaaaccggat gaagttccaa tacgactctc 3840 gaaacgcggc gcgtgcaaaa acatttgagc tgggtcagta cgtttatgtc agggtacaaa 3900 aggggaacgt gtggagttgg gagcatggag aggttattaa aagactaggg gaggtgcttt 3960 atgaggtcca agtggggtct cgagttcaac gaactcacgt taatcagctg cgaaaaaagt 4020 ttgtaggtga ttcggaggat gtattgttgg atactgtgta tccgtgtttt ttcgattgcg 4080 ggcttaccgg gcgaagtcgt attgcttcta tcaccagagg gcgctacctt cctcacatat 4140 cgaacatttc cctgttgatc atcccttctc cactggctcg gatgtttcca ctctcgattc 4200 cacatcgatt ctcagttgtt accggggggt cgtagggggg aagctaatga tttggaaatt 4260 gcagattcag cttttcgtag tttgtcagat tcagccggtc caggaacaca gaatgtggtt 4320 agctcttctc ggttttcgtc tggtggtttg agtgataaac aattgtatga tggaactgat 4380 actcgtggaa gattagatct aagtgaactt gaaccagaac ctgagagctc aactacgcca 4440 gagtcgccga aagcttcaac tatgggaaat cagatggtca atgccgaaaa cgggaaaggt 4500 catatcaacc aaccgagcat tgtcccaagc cagtctccgc gacgaagtca ccgcatacga 4560 tacgtgccaa accggtatga tccctgctcc gacaacacct tcacaacggg aaccggtcat 4620 gtgcgaggat ctagacatat tccactagat tcagcagcca cacacaacgg gagtcgatat 4680 aacgcttctg caggcaatcg tcttgtctct ggtccaggca acagggtgac aactcgtcat 4740 caaggagggg gacg 4754 // ID CEMUDR2 repbase; DNA; CEL; 5505 BP. XX AC . XX DT 31-OCT-2000 (Rel. 5.09, Created) DT 31-OCT-2000 (Rel. 5.09, Last updated, Version 1) XX DE CEMUDR2 is an autonomous DNA transposon - a consensus. XX KW MuDR; DNA transposon; Transposable Element; 9-bp TSD; CEMUDR2; KW LONGPAL2; MUDR superfamily; Autonomous DNA transposon; KW transposase. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-5505 RA Kapitonov V.V. and Jurka J.; RT "CEMUDR2."; RL Direct Submission to Repbase Update (OCT-2000). XX DR [1] (Consensus) XX CC CEMUDR2 is an autonomous DNA transposon; there are two CC copies of CEMUDR2 in the genome; they are ~98% identical CC with each other. One of them is flanked by a 9-bp target CC site duplication. CEMUDR2 has ~450-bp nearly identical CC terminal inverted repeats. CC CEMUDR2 encodes CEMUDR2p, a 742-aa MUDR-like transposase. CC CEMUDR2p is encoded by 15 exons (based on prediction by Gene CC Finder, http://dot.imgen.bcm.tmc.edu:9331/gene-finder/gf.html). CC CEMUDR2p: CC "MLHKWLTSVDWEEIIAGKRFTEADSVDPAVLHNLLNQNKLIVRIPLEENGGFPFDGHVWS CC KCDPKNRIKHTARYEKVTVFGVTVNSRQCIDFDNRNFSRHVLPFNDFVFIYYCIQDNSYQ CC LPVDNGKKKRLTAKECEILRTMVIPHRNIHGAMEAAKALGINVTVRQLQNLSRGVPDAII CC GNSGSNAATTLETIRELERLYGTRMNHAVDSYGVLDVTFFQSFPEAIKIFLCSCPDMNVV CC QDWRRNVENICEMDEEFAKVGGVPNGKMIPCLLMDGESSLGEYGKTLDVMTIRCDYHIFS CC LMSHKYGKPVATKSKPYVFGTMENEKWKTGLLGVFTEDEYSERFEVMEKIVDTNVAQWFH CC RKRKMFFECASAMAKLLGGHLRQFSTNNISENFNGYVKCKLHKQMTIDKLILKLNEHCED CC VIVQCFLSTIGQSARVSLKEDLSELSVEQRLQFLKNIGVTGELLSALNTHPDPAVIRHAS CC VEYDQGLLKCSKCNTVKLEYTCRHFLHCLLKLRDTDRVVQWHKMLSKIKKQRIDSVKQPL CC IDPKSGHKLSDRIGCGQSSNNRKKKVAELSTLTVYKNNTNSDDDTGTANSRENLDQSLPD CC ASMSDNFSPIFSSTMINGSRRSSRSRNSTKVMDPSPSDTNESDETSGNCDVFVNRTRLLF CC DDDGDDSTVTLFNNTEKTLQFKCRIDSDEYSVKPTRGNIKPNMDKTMRVRVGDTAPGTTM CC HIFYTVKDEEEKSFKILLKYDP" CC CEMUDR2 and LONGPAL2 share 92% identical ~500-bp termini. CC There is a low 62% nucleotide identity between CEMUDR2 and CC CEMUDR1. CC Transposases CEMUDR2p and CEMUDR1p are 30% identical to each CC other. These transposases form the same superfamily together CC with MuDR-like transposases in plants and ISRm3/IS1201-like CC transposases in bacteria. XX SQ Sequence 5505 BP; 1802 A; 969 C; 998 G; 1736 T; 0 other; ggcaaatgct cttttactac ctcatctttt actacgttgt tcttttacta cggtttactc 60 ttttactacg gccgtatttt ttactacgtt ttccaatttc gatcttttac tacaatttgt 120 atcttttact acggccgtat ttttactacg ttttccaatt tcgatctttt gctacaattt 180 gtatctttta ctacggccgt attttttact acgttttcca atttcgatct tttgctacag 240 tttgtatctt ttactacggc cgtatttttt actacgtttt ccaatttcga tcttttgcta 300 cagtttgtat cttttactac gattaacgat tttttacggc ctaatgaaaa aaaaagaata 360 tcaaaagttg gaatttgaaa gttaccttgt agcagatttt aggtataatt acaatttcat 420 tctgaaatca acatgaaagt ttcatagctg tgttcttttc caacttttga tatatactac 480 tgtctgagcg gtttttaact gaatccgacg agatcataag gggaaattaa actttatgaa 540 aaaaaaacaa aaaatttgtt cgaaattttg tttaattttc tgaaaaaaaa gtatggtccc 600 tacaaccaag aaactaatta ctctacatta gtctagaata tatcgtcaat cagagcacgc 660 gctctgggtg cgaacagctg gttaaaaaat aaaaagtcgt attgggtttt tgtgagttga 720 ctactacact actcgtttat ccactgcgcc acactggccc agcttgctga accggagtga 780 ggtggggtga ctttgagaca cgccaatgac atcagaaatg gtagattcga agatgcttat 840 tctggcatca actttagaac cgcgtaagaa tactgaaaca tgttgtattc ctacttacct 900 aaatctattc ataaatacaa ttcactatat tgccgatgaa aatttcaaac agtctttaat 960 tactgaagaa cccacttatt tcatcaacac atcaaatgat tttcaatttt cgtaaagaat 1020 attgatattt tctcatccat gtaccattgc ttcaaacata gataactttt ttcctcatca 1080 agttttgcaa ttcttattat cactcgatct tttactacgc ggatctttta ctgcggcgat 1140 cttttactac gattcgctgc gagacccgtg ccttgtggag tacactcgga ggcaatttct 1200 tccttttatt ttcgaatttc ttcatttatt tccagaattt tttatttttt ttgcctaaaa 1260 taaatctctg ataagggaat gttacacaaa tggctcacta gtgtcgactg ggaggagatt 1320 atcgccggaa aaaggtcagt tgttgtataa ttgtgctgaa aaaatttagg atttgaatct 1380 tttaaaatta atttttattt gaattacaga ttcactgaag ctgacagtgt cgatccagct 1440 gttcttcaca acttgttgaa tcaaaacaag ctgattgtgc gaattccatt agaagaaaat 1500 ggaggatttc catttgatgg ccatgtctgg tcaaagtgtg atccgaagaa tcgaatcaag 1560 catactgctc gttacgaaaa agttaccgtt ttcggtgtca ctgttaacag cagacaatgt 1620 attgactttg ataatcgaaa cttctctaga cacgtaagga agtttaaact gactacataa 1680 cctaagtttt catgctttca ggtacttcct tttaacgatt tcgtgtttat ttattattgc 1740 attcaagaca acagctacca gttgcctgtt gacaatggaa agaaaaagag actaacaggt 1800 atgaataaaa aggaaaaaga aaaataatca ctaaaatgtg cttcagctaa ggaatgtgag 1860 atacttcgaa caatggtgat ccctcatcgt aatatccacg gagcaatgga ggcagccaaa 1920 gcacttggaa tcaatgtcac agtgagacag gttagattac tttcatgaaa aataaaaata 1980 aatgtatttt ccagttgcaa aacttaagcc gtggcgttcc ggatgcaatc atcggaaatt 2040 ctggctccaa cgccgccact acattggaaa caattagaga actggaacgt ctgtatggaa 2100 ctagaatgaa tcatgcagtt ggtgagtgga attcgtataa caattttgaa aaaaaaaaac 2160 aatttccaga ttcatacgga gttctcgatg ttacattctt ccaatcattt ccagaggcaa 2220 ttaaaatctt tttatgtagt tgtccagata tgaatgttgt acaagactgg aggagaaacg 2280 tcgaaaatat atgcgaaatg gacggtgaaa tgagaaagcg acaactaaag gatcttctta 2340 aaacaacccc agacggagtg tttttctatt cccgccttca tattgatacg acgtataacc 2400 ttggtgacgc atacgtcact gttattcttg gagaaactat gaactttgtg tgggttttat 2460 aatatttaaa gcgtgtctat atttttgctg taggactacc agctccgaca aatgccgagt 2520 actgccattg ggatacatgg ttcacacaaa ccgattagct gcaactcacg aaaagtttgc 2580 tgattttctt tcagaagaat ttgccaaagt tggcggcgtt ccgaatggta aaatgattcc 2640 atgtttgctg atggatggtg aatcaagttt gggagagtat ggaaaagtga gttgacttga 2700 ttttgattga aaaaacaata cgcattataa atttcagacg ctcgacgtga tgaccattcg 2760 atgtgactat cacatctttt ctttgatgag ccataaatat ggaaaacctg tcgccaccaa 2820 atcaaaacct tatgtttttg gaactatggt aatttttcca tggagttttc atataaaagt 2880 atcgtaattt caggaaaacg aaaagtggaa gactggactt ctaggtgtat tcactgaaga 2940 tgagtattct gaacgcttcg aagtgatgga aaaaattgtg gacacgaatg ttgctcaatg 3000 gtaatacaat tttctggaaa gtttatgaaa catactaatt agtgaggcaa tacgtttttg 3060 tttttagaga tacgtttata gataaaaaca tggcatagtg atttgtccgc tttatcatgt 3120 tttaccggaa tgtttgtgtt ttaggtttca tcggaaaagg aagatgttct ttgaatgtgc 3180 atcggcaatg gccaagcttc ttggtggaca cctaaggcaa ttttctacca ataacatttc 3240 ggagaatttc aatggatacg ttaagtgcaa actacacaaa caaatgacca ttgacaagtt 3300 aatcctgaaa ctcaacgaac actgtgaagg taggaaaaat aggaaaaagt acaaataaag 3360 ttaatatttt tcagatgtta ttgttcaatg cttcttaagc acgatcggtc agtctgcacg 3420 ggtctcattg aaagaggatc tcagcgagtt atcggttgag cagagacttc aattcttgaa 3480 aaatattgga gtcactggag aacttctttc cgctttgaac acacatcccg atccagctgt 3540 tatcaggtct tacgaattaa aaaaaatcca atcttaacta atgcccattt atagaaattg 3600 aagccgaaaa aagtcgccac gaagctgatg cagccgctga tgtggtaatc taccaaagaa 3660 acgatgaaat cttcaacttg agagatgact ctataaactg ttcatccagg tacccaaaaa 3720 attagacaaa atgtttctca ggactagtaa aatacattct tttcaggcac gcttctgtcg 3780 aatatgacca gggactccta aaatgctcca aatgcaatac ggtcaaattg gaatacactt 3840 gcaggcattt gtaagtaacg ccaccaggaa atagctcgtt tcaaaaaaaa cgtttcagtt 3900 tgcactgcct tttaaaactg cgtgacacag atcgggttgt ccaatggcat aagatgttgt 3960 ctaagatcaa gaaacagcga atcgattcag tgaaacagcc tctcatcgat ccgaaaagtg 4020 gtcacaaatt atctgataga atcggatgtg gccaatcatc aaataatcgt aaaaaaaaag 4080 ttgctgaatt atcaacgtaa gtttgccaat aatcaaataa ccagtaaaca ttaccattgt 4140 agccttacgg tctacaagaa caataccaac tccgatgacg acacaggaac cgcaaatagt 4200 agagaaaatc tagatcaatc acttccggat gcatcgatga gtgacaattt ttcgcctatc 4260 ttctcgtcta cgatgatcaa tggaagcaga cgatcaagca gatcacggta cattgacgaa 4320 acagtgttca agttaatatc attttttcag taacagcacc aaagtcatgg atccatcacc 4380 atctgataca aacgagtctg acgaaaccag tggaaattgt gatgtgttcg ttaatcgcac 4440 acgtcttctg tttgacgatg atggggacga ttcaacggtc actctcttca acaacacaga 4500 gaaaactctt caattcaaat gtagaattga ttctgatgag tactctgtga agccgactcg 4560 cggaaatatt aagccaaata tggataagac gatgagagtt cgagtaggtg atactgctcc 4620 tggcaccact atgcacattt tctacacagt gaaagatgag gaagaaaaat cattcaaaat 4680 tcttctgaaa tacgatcctt gaaaacctca ttttcattga atcttcaact gtaatatcct 4740 aattttttct tcttctttgt tgttctgttg atttttttaa atgttgtatc cctttttagt 4800 ataatcttga aagttccatg aagttgttgt tgttattgtt acatcgatat tgattctaca 4860 ttattattta aaagccattc cgaacagttt tcggaaaatt ctttgattag tgaaaaaaaa 4920 tcagtaataa aaaaatactt cactaaattg ggcgaattat cttactacat actgtttcag 4980 tgattgttca cttctaaaat tctagtaaaa ttcacattaa atggtgaaag aaaattgaat 5040 aagttttgga aagttttcaa aacaaaattt cgaacaaatt ttttgttttt tttcataaag 5100 tttaatttcc ccttatgatc tcgtcggatt cagttaaaaa ccgctcagac agtagtatat 5160 atgaaaagtt ggaaaagaac accgctatga aactttcatg ttgacttcag aatgaaattg 5220 taattatacc ttaaatctgc tacaaggtaa ctttcaaatt ccaacttttg atattctttt 5280 ttttcattag gccgtaaaaa atcgttaatc gtagtaaaag atacaaattg tagcaaaaga 5340 tcgaaattgg aaaacgtagt aaaaaatacg gccgtagtaa aagatacaaa ttgtagtaaa 5400 agatcgaaat tggaaaacgt agtaaaaaat acggccgtag taaaagagta aaccgtagta 5460 aaagagcgac gtagtaaaag atgaggtagt aaaagagcat ttgcc 5505 // ID CeRep52 repbase; DNA; CEL; 224 BP. XX AC . XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 05-JUN-2002 (Rel. 7.05, Last updated, Version 1) XX DE C. elegans repeat 52. XX KW MSAT; Satellite; Simple Repeat; C. elegans; CeRep52; KW minisatellite; repeat. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-224 RA Jekosch K.; RT "C. elegans repeat 52."; RL Direct Submission to Repbase Update (25-APR-2002). XX DR [1] (Consensus) XX CC Putative novel minisatellite repeat, consists of repeated 8mers, CC found 395 times in the C. elegans genome. Located preferably CC at chromosome ends, just one occurrence on chromosome X. CC For further analysis compare CC http://www.sanger.ac.uk/Projects/C_elegans/repeats. XX SQ Sequence 224 BP; 56 A; 55 C; 84 G; 29 T; 0 other; tgtaggcacg taggcacgta ggcacgtagg cacgtaggca cgtaggcacg taggcacgta 60 ggcacgtagg cacgtaggca cgtaggcacg taggcacgta ggcacgtagg cacgtaggca 120 cgtaggcacg taggcacgta ggcacgtagg cacgtaggca cgtaggcacg taggcacgta 180 ggcacgtagg cacgtaggca cgtaggcacg taggcacgta ggca 224 // ID RTE-1 repbase; DNA; CEL; 3291 BP. XX AC AF054983; XX DT 04-FEB-1999 (Rel. 4.01, Created) DT 03-JUN-2009 (Rel. 14.07, Last updated, Version 3) XX DE RTE-1 is a non-LTR retrotransposon, complete sequence. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; LINE; SHINE; KW retrotransposons; RTE1; RTE-1. XX NM RTE1. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-3291 RA Youngman S., van Luenen G.H. and Plasterk H.R.; RT "Rte-1, a retrotransposon-like element in Caenorhabditis RT elegans."; RL FEBS Lett 380(1-2), 1-7 (1996). XX RN [2] RP 1-3291 RA Malik S.H. and Eickbush H.T.; RT "The RTE class of non-LTR retrotransposons is widely distributed RT in animals and is the origin of many SINEs."; RL Mol. Biol. Evol 15(9), 1123-1134 (1998). XX RN [3] RP 1-3291 RA Malik S.H. and Eickbush H.T.; RT "RTE1."; RL Direct Submission to Genbank (21-MAR-1998)Biology, University of RL Rochester, Huchison Hall 334, Rochester, NY 14627, USA. XX DR GenBank; AF054983; Positions 1 3291. XX CC CDS join(65..193,187..3258) /note="apurinic-apyrimidinic CC endonuclease; ORF2 (translational frameshift from ORF1); similar CC to second ORF of many other non-LTR retrotransposons" CDS 65..196 CC /note="putative ORF1" RTE1 has a direct terminal 200 bp-long CC direct repeat [1]. Southern blots indicate that there are CC approximately 10-15 additional Rte-1 elements in the C. elegans CC Bristol N2 genome and a similar number is found in the genomes of CC two other geographically distinct strains. The insertion pattern CC of Rte-1 is polymorphic between these strains [1]. Database CC searches and phylogenetic analysis indicate that representatives CC of the RTE clade of non-LTR retrotransposons are found in the CC bovine and sheep genomes of mammals and in the silkmoth and CC mosquito genomes of insects [2]. In addition, the previously CC identified SINEs, Art2 and Pst, from ruminate and viper genomes CC are shown to be truncated RTE-like retrotransposable elements. CC RTE-derived SINE elements are also found in mollusc and flatworm CC genomes. Members of the RTE clade are characterized by unusually CC short 3'untranslated regions that are predominantly composed of CC AT-rich trimer, tetramer, and/or pentamer repeats [2]. Analogous CC consensus from Tammar wallaby is 95% identical to this consensus CC over the entire length. XX SQ Sequence 3291 BP; 984 A; 840 C; 732 G; 735 T; 0 other; ctctttctgt actaagcctt ttcgtggcga aagggatcaa cagataacca cttcaacaaa 60 ctcgatgatt tcgtcacatc gagagtggct tggtttggaa ccaagtgcaa attatctggc 120 caagaagtac accgtctcac gcaggccaga ggtagtgtcg aatactctct gtccgggcgg 180 ctctagattg gcttagcagg tcattataga gtatctagag acacaacttt tgctctaaac 240 agcaacccat cctgcctcag tcctccgcat ccggcctctg caaagggaaa gacagttcct 300 ggagcacgtc ggaagaggcg atctaatctc ggagtccaaa ccatatgtca atcggacttg 360 gaaattaaga agccgcggga caaaacaaga agaactgact atcgcgtgtg caccttcaac 420 tgcaggtcct tgtcttcgga tgatcgactt gctgagttgc tagaagagac gaggcgaatc 480 caattcgacg tcatcggatt gtgtgaaacc aaacgagcag ccgaggcaca cttgatacat 540 cacgacggta ccggcgtctt cttaggcaaa cggaatgaaa gttctgtatc cggaggggtc 600 ggcttcatag ttcgaagcac cctccttccc aaaatcgtag aagtacgatt tcttagtcac 660 cgcatcggct tactcacctt caaagtgagc cgaaaattca actgcacggt gattcaagtc 720 tatgctccaa ctgcggactc agaccttgag gaaatctgcg acttctacga cagtgttgaa 780 gacgtcttca gagagtgccg aagcaagtat aaacttgtta tcggcgactt caacgctcga 840 atgggatgca gacaaaacaa cgaaaggtac attggccccc atgccatgga accaagaaat 900 gatactggag agcttctcgc aacattttgt gaaaccaacc gtctgtggca cacgaactct 960 atgttcaaaa agcctatgca caaacggtgg actttcgtca gtcccgacgg gaatcacaga 1020 cacgaaattg accacattct ggccaatgga aagtttgtca cagataccac tgtactcccc 1080 tctttcacaa atggtagcga ccataggtta ctccgctgta acctccactt caacaattgt 1140 ctagcaaagt tggagcaggt cagacggaga aaacctccga aaagagtgtt ggacccggcg 1200 attgcttatg caatctcagc gacagtcacg gttcagagtg atcctgattt agataccgac 1260 tatgataatc tgattcagtc actaaaagaa cttcaagacc aagctatcgt tcgaccggcc 1320 aaccactcat ccaacagact ttcggaggaa acccgaaaac tgctgaacaa aagacgtttt 1380 atggatagaa atgatcccca attcaaatct atttcagata aatgtcgtga agcagttcag 1440 aaagaccatg aagcttttgc cagtactcgc cttctatctg ctgcgaacca gaaaaagagt 1500 ttgaaaagag ttgctaggga catcaacgaa tataagtcag ttatcccatg cctcaaatca 1560 acctctactg gtgaaagaat cacttccagg gtgaaaatgg agcaggagat tgagaagttc 1620 tacacggagc tcttcaaaag cgctgtgagc aactctcaaa catcttcaat acccgccacg 1680 gcaactccac cgccgtttct cccggaagaa attcgtcatg ttctccgttc cttcccgaat 1740 ggtaaagctg caggccaaga caaaatcagt gcagatttct tgaaatcttg ccacgataac 1800 gtcattgacc tgataacgga tcgattcaac aggtacctcc acagcagaaa tgtaccgaaa 1860 ccctggaaaa cctccaaaac cactctcatc ttcaagaaag gtgaccgtga gaatttggaa 1920 aactataggc ccatctgcct actacccgta ctctacaaag tattcaccaa gtgtttgctg 1980 aatagaatgc gaagatccct tgatgaggct caacctgtcg agcaggccgg attccgacgg 2040 tctttctcta cgatcgatca catccactcg ctccaaagac ttcttgaagt cggcagggaa 2100 taccagatcc cactgacact tgtcttcata gatttcaaga aggcatttga cagtgttgaa 2160 caccaggcaa tctggaaaag tctcgacgag caaggtgcag atggagccta tattgatcta 2220 ctgaaagagt gttataaaaa ttgtaccaca aattttaccc cattccacag gccagtcgca 2280 gtacctgtga ccaaaggagt tcgacaagga gatcccatct ctccgaatct cttctccgct 2340 tgcctcgaac acgttttccg aaagctttcc tggattgaat tgaaaggaga agctgaggat 2400 tacgatacga tccctggaat gagagtgaat ggcagaaatc taacgaacct cagatttgct 2460 gacgatattg tgctcatcgc caatcatccg aatattgcca gcaaaatgct ccaagaactc 2520 gtacaaaaat gctctgaagt aggtctcgag atcaatactg ggaagacgaa agtcttgcga 2580 aaccgattcg ctgaccccag taaagtctac ttcggtagcc cttcccccac cacccagctc 2640 gacgacgtcg acgagtacat ctacctcggt cgtcaaatca acgcccaaaa caacttgatg 2700 ccggaaatcc accgaagacg tcgagcagcc tgggctgcat tcaatggaat caagaacacc 2760 accgactcca tcaccgacaa gaagattcgt gcgaatctgt tcgactcaat tgtccttcca 2820 gcgctcacct acggttcaga agcctggaca ttcaccaaag ctctatccga acgagtacga 2880 atcacacatg cctccctaga aagacggctt gtgggaatca cactcactca acaacgagaa 2940 cgagatctcc atcgagaaga cattcgtacg atgtctctag tcagggatcc gctcaatttc 3000 gtgaaaaaga gaaagctagg atgggctgga cacgttgcga gaaggaaaga cggaagatgg 3060 accacgttga tgacagaatg gcgcccatat ggatggaaaa ggcctgttgg aaggccgccg 3120 atgcgatgga ctgattcgct gcgaaaggag atcaccactc gtgacgcaga cggagaagtc 3180 atcaccccct ggtccactat agccaaggac cgaaaacaat ggcttgctgt gatccgcagg 3240 aataccacga attcctgaag aacggatcgt ctaagtatct aagtatctaa g 3291 // ID NPALTA1_CE repbase; DNA; CEL; 177 BP. XX AC . XX DT 21-OCT-1999 (Rel. 4.09, Created) DT 21-OCT-1999 (Rel. 4.09, Last updated, Version 1) XX DE NPALTA1_CE is a putative non-autonomous DNA transposon - a DE consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; MITE; KW NPALTA1_CE; TA target site; nonautonomous DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-177 RA Kapitonov V.V. and Jurka J.; RT "NPALTA1_CE."; RL Direct Submission to Repbase Update (OCT-1999). XX DR [1] (Consensus) XX CC NPALTA1_CE is a putative non-autonomous DNA transposon. CC It generates TA target-site duplications. Presumably, CC Mariner/Tc1-like transposase has been involved in CC transpositions of the NAPLTA1_CE sequences. CC NPALTA1_CE is flanked by 49 bp-long terminal inverted repeats CC (one mismatch only) and it forms non-perfect palindrome with CC a 18 bp-long central loop. CC There are about 50-100 copies of NPALTA1_CE fixed in the CC C.elegans CC genome few millions of years ago since they are ~96% identical CC to the consensus sequence. XX SQ Sequence 177 BP; 49 A; 42 C; 41 G; 45 T; 0 other; tacagtgcat tttttttcca cttctacgac tttaaaggcg cgcgcattta tacaaaatgg 60 tcccgtcatt ggtctcgcca cgcgctcaac aaatcaatgg gatgcgcgtg gcgagaccat 120 tgcgcgaaaa tgcgcgcgcc tttaaagtcg tagaagttgg aaaaaaaatg cactgta 177 // ID TREP_CE repbase; DNA; CEL; 232 BP. XX AC X97533; XX DT 09-JUL-2004 (Rel. 9.06, Created) DT 09-JUL-2004 (Rel. 9.06, Last updated, Version 1) XX DE C.elegans DNA for telomeric repeat. XX KW TREP_CE; Telomeric repeat. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Wicky C., Villeneuve M.A., Lauper N., Codourey L., Tobler H. RA and Muller F.; RT "Telomeric repeats (TTAGGC)n are sufficient for chromosome RT capping function in Caenorhabditis elegans."; RL Proc. Natl. Acad. Sci. U.S.A 93(17), 8983-8988 (1996). XX DR Genbank; X97533; Positions 1 232. XX SQ Sequence 232 BP; 43 A; 38 C; 71 G; 80 T; 0 other; tataaatcct tgaaatttca taggatttcg cgcttttagg cttaggctta ggcttaggct 60 taggcttagg cttaggctta ggcttaggct taggcttagg cttaggctta ggcttaggct 120 taggcttagg cttaggctta ggcttaggct taggcttagg cttaggctta ggcttaggct 180 taggcttagg cttaggctta ggcttaggct taggcttagg cttaggctta gg 232 // ID RCS5 repbase; DNA; CEL; 1428 BP. XX AC . XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 23-AUG-1999 (Rel. 4.07, Last updated, Version 2) XX DE C.elegans repetitive DNA sequence - a consensus. XX KW RCS5; RcS5 element; Repetitive element. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Cangiano G. and La Volpe A.; RT "Repetitive DNA sequences located in the terminal portion of the RT Caenorhabditis elegans chromosomes."; RL Nucleic Acids Res 21(5), 1133-1139 (1993). XX RN [2] RP 1-1428 RA Jurka J., Surzycki S. and Belknap W.; RT "RCS5."; RL Direct Submission to Repbase Update (AUG-1999). XX DR [2] (Consensus) XX CC A palindromic sequence. Putative non-autonomous DNA transposon. XX SQ Sequence 1428 BP; 453 A; 223 C; 254 G; 451 T; 47 other; ttngcaggga tagaggaaga caaaaaaaac aatatggatt aaaaatcagg gcttcgcccc 60 gattttaata aaatatatga aaccctctta aaaagttacn agaaggtttt tccttgcgct 120 tggagcgcaa aagaaaagaa aaagagctat ttagacttan ggtgcccaac tggaataaaa 180 cattggaaat cccaaaacca gcntaaggnc ncgaaanaca tagtaggacg cccaacttga 240 atgaaatatt ggaaatcctt atgacacact tnagggcccg aaaaaaacta ggatgaccaa 300 ctggaataaa atattggaaa tccttntgac acaccggcgg aatggcgcng cttaagccta 360 aatagccact tttatcaaaa tacatttgag cgnggcggnt gtaaactatt cgttctttag 420 caaaaatnan aaaaaacttt atntaaattn aaaaataaat atcatatgtt atcacacntt 480 agnatatcac gccttaattt agtaatcatt ttagttcata agtgtgagag aaagaaaaat 540 ggattaatca tcggaattan atatcatcag agtgattata ttcatcaaan gtcncactnt 600 antggagggt taatttggtt gatataaaat tcctatttca tgtaaanaaa ggttcgaatt 660 ctgtacctat gaatgctgnc gatatttnaa ttcctntctn atgggaagaa ggcctttctt 720 gtnggaanaa cggagttata aaaaaatata cgcaggtaca gaatttgaac ttttcttccn 780 atgagatngg aatttgaata ttggcattcn ataacttcac taagaaaaaa aatctccaaa 840 ggttnatcgc aaatatgtan agattgctnn catcccttca aatattacga aatttccaag 900 tttcttagtg ataatcttta ttatatatat attttttatt atatntattt tttacttttg 960 agttgcctga gatgttattt taaaacgcga gccgagctca aatgtatttt gataaaagtg 1020 gntatttagg cttaagncgc gccataccgc cggtgtgtca taaggatttc caatatttta 1080 ttccagttgg gcatcctagt tttnttcggg cccttangct taagtgtgtc ataaggattt 1140 ccaatatttc attcaagntg ggcatcctac tatgtttttc gggcccttag gctggttttg 1200 ggatttgcaa tgttttattc cagttgggca ncctaagtct aaatagctct ttttcttttc 1260 ttttgcgntc caagcgcnng gaaaaacctt cttgtaactt tttaagaggg ttagtcttag 1320 atttgggctt aagcttagtc tttgatttgg gcttaagttt aggcttagga ttaggatcaa 1380 gcttagggct acttttctcg aactaaatta taaagcacaa acttctat 1428 // ID HELITRON1_CE repbase; DNA; CEL; 8484 BP. XX AC . XX DT 25-MAY-2001 (Rel. 6.04, Created) DT 19-JUL-2005 (Rel. 10.08, Last updated, Version 2) XX DE HELITRON1_CE is a Helitron DNA transposon, a consensus sequence. XX KW Helitron; DNA transposon; Transposable Element; HELICOP1; KW HELITRON class; HELITRON1_CE; helicase. XX NM HELITRON1_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-8484 RA Kapitonov V.V. and Jurka J.; RT "HELITRON1_CE."; RL Direct Submission to Repbase Update (MAY-2000). XX RN [2] RP 1-8484 RA Kapitonov V.V. and Jurka J.; RT "Direct submission."; RL Direct Submission to Repbase Update (MAY-2001). XX RN [3] RA Kapitonov V.V. and Jurka J.; RT "Rolling-circle transposons in eukaryotes."; RL Proc Natl Acad Sci USA 98(15), 8714-8719 (2001). XX DR [2] (Consensus) XX CC HELITRON1_CE is a HELITRON-like DNA transposon. It encodes a CC 1466-aa protein, CEHEL1p, composed of the rolling-circle CC initiator CC and DNA helicase domains. CC There are several copies of HELITRON1_CE in the genome, they are CC ~99% identical to the consensus sequence. CC Approximately 2% of the C. elegans genome belongs to molecular CC fossils of nonautonomous elements transposed recently (<10 Myrs) CC by autonomous HELITRON1_CE-like elements. HELITRONY1_CE and CC HELITRONY2_CE are examples of these non-autonomous elements. CC Minisatellite-like repeats RCA1, RCD1, RC35, RC123, RCC9, CC reported CC previously (Naclerio,G. et al., 1992, J. Mol. Biol. 226, CC 159-168), CC are portions of HELITRON-like transposons. XX FH Key Location/Qualifiers FT CDS join(1929..2084,2132..2413,2531..3427,3475..3801, FT 3849..4520,4581..5207,5251..5523,5573..5761, FT 5813..6799) FT /product="CEHEL1p" FT /note="rolling-circle initiator and DNA helicase FT domains" FT /translation="MDNELTSRPDSIMDTEPTTLPDSAMDTKPTNLPDFPS FT TSQEPVSRRTRSRTLSDNSNAVWKRERRSRETSQESESRLRMDWERKKEKR FT ASMSEEEKAEMKYKKRVWMKKKRNEVAKTHDTSSVANPNYLGSMRCVCKNC FT NARFFQGGEVSSSNDHINMCCYFGLTTTLQDRFSNYPDELRHLLLDDSSHA FT RNFQKNIRQFNSSFAMASMGAQLDVPRGRGPYCFKIHGQVYHFAGPLHPDP FT GQRPAFGQIYILDTDQATDERLGNPANADCDPAVMAELSSLLLRTNPYAQA FT YKMMAEVEEKENSEAAKEGRHPGCVRLIFDISTTKDPRRYNLPQANEVAVV FT FVGEDDDVPTTRSLAVHPRGGGLKTIRDIDRICDPLTYPVLFPNGTDGWHP FT DLEKRPSEKKQGRITQKMYYSYLLMERSGVFNPLHHGRALFQQFAVDSWVK FT IEQNRLNYHRTHQVDLKAANYNAVQDYIAGDTDVPGRRIVLPSSFPGSPRA FT MVQNFQDAMAIVSKFGKPDIFLTFTCNPAWTEISENLGPRQSASDRPDLIA FT RVFKLKVVDALFDDLLNRDILGHVAAYISVFEWQKRGLPHVHMLLTMAENS FT KPRTSEDIDKIVQAEIPNPDNEPELHRIVTTAMMHRPCGAQNPHSPCMVDG FT HCSKRYPKDFHPSTTLNVDGYPGYRRRDDGRYVEYGTQHLDNRRVVPYNKW FT LLLRYNAHMNVEICGFIEAVKYLFKYVYKGHDRAALNIIQNVRGDGNVVDE FT IREHLDARYVCAPEAIHHILGFKLEKKASVTTQQLESSSQTDTTLTAWFKI FT NQKSKDIAESGNIPSTFVDSRQFFYMDMPTHFTFVKKDGWKVRGRGTRQIG FT RMYTVPPYETERYALRILLLNIKGATSFEDLRTVLDENNVPVVYATYVEAA FT KAQGLLNDDSEYLKSLKEWAGCSVPAALRSMFVAIILFNEVHDLNALWDAV FT KWDLSEDFRHAGAGKEEAEALAYFDIESRLQRRVGKRFDVTKPSINPPPID FT LDTVNPAQCASEGNRLLATLNDQQKRAADQILAALDDASLPRLFYLDGPGG FT SGKTYLYITLYNICVGRGLKVVACTAWTGIAANLLPLGRTSASLFKLDIRN FT QCKSSLHQRQLKEAQELAENDVFIWDEASMVPKTALDTVDVLLRDLTKIDQ FT PFGGKILILGGDFRQILPVVERSSRADQVDACIKRSPLWTEFQILHLISNM FT RVTSGDSDWIQFLLNVGDGSANDSDSKVTLPLSVMCDHNIVEEVFGAVIDP FT TTSDPCDNVILTPKNVDVAQLNDDVHNRMVGEERIYLSRDEVIVEHQADTM FT HYPTEFLNKMSPSSLPPHILKLKKGSVIILLRNLDVSAGLCNGSRFIVETL FT ASHSLGCRFATGERKGHFTIIPRIDCYDDKNISFQLRRTQFPVRLSFALSI FT NKAQGQSFSKIGLWIPTDVFTHGQLYVALSRVRTKEGLIVKSSSNIVTNIV FT FNEVL" XX SQ Sequence 8484 BP; 2476 A; 1919 C; 1648 G; 2441 T; 0 other; tcattactta tatataaaaa aatagtgtgg gtctgtccat agtttgtagt ctatgtagtc 60 tttgtagtct gtgacgtcac ccctaaagtc actgagagtt gtgggcgggg ccagtcaccc 120 ttcgtggcga gacccatcgt tgcgagaccc ttcgtggcga gacccatcgt ggcgagaccc 180 atcgtggcga gacccatcgt ggcgagaccc atacattttg gcgggaattt aaattttctt 240 tgaaattttt cggcgggaat tcaaattttc tgtgaaaatt tttggcggta gctcaaattt 300 tctgaaacat ttttggcggg aattcaaaat ttctctgaaa atttttggcg ggaatttaaa 360 ttttctttaa aattttttgg cgggaattca aattttctgt gaaaaatttt ggcgggaatt 420 caaattttct gaaaaatttt tggcgggaat tcaaattttc tctgaaaaat tttggcggga 480 atttaaattt ccttcaaaat tttttggcgg gaattcaaat tttctgaaaa atttttggcg 540 ggaattcaaa ttttctctga aaaattttgg cgggaattta aatttccttc aaaatttttt 600 ggcgggaatt caaattttct ttgaaaattt ttggcgtgaa ttcaaatttt ctgaaacatt 660 tttggcggga attcaaattt tctctgaaaa tttttggcgg gaattcaaat tttctttaaa 720 attttttggc gggaattcaa attttctgtg aaaaattttg gcgggaattc aaattttctg 780 aaaaattttt ggcgggaatt caaattttct ctgaaaattt ttggcgggaa tttaaatttt 840 ctttaaaatt ttttggcggg aattcaaatt ttctgtgaaa atttttggcg ggaattcaaa 900 ttttctgaaa aatttttggc gggaattcaa attttctctg aaaatttttg gcgggaattt 960 aaattttctt tacaattttt tggcgggaat tcaaattttc tgaaaaattt ttggcgggaa 1020 ttcaaatttt cagtgaaaaa ttatggcggg aaattcaaat ttttagagaa attttgtcgg 1080 agtagtcaaa tttaactttg tttgacagaa aaaaaaagta tgaaggggaa gatagtttga 1140 accacagaag gttcacacca aattcccgca cacaaccaat gcgacatacc ggcacatttg 1200 tttagaattt aataaaccta tagaagagga aaacttcatg caataaactt cgagaatctt 1260 ctagaaattt cgaaaaactt ctagatatct cggattacgt aacacttgac gacgtcacac 1320 atgatgagga cattaagtaa acgtcacagg ctgaacccaa tataattttc agaaaattct 1380 ggaaccttct agaattttgg agaaacttcc agaaaattct gatgacgtca caatcactgg 1440 cgtaacaaaa aataagtcac agattatgac gtcacatatg atgacaccta tatagtctca 1500 cattcggaaa aaaataggtg atgacgccaa atatatcagc gtcacagaat gtgacgtcat 1560 aggctccgcc cattacagta aaagaaactt ctggaatctt ccagaagttg cagaaaaaga 1620 agagaatgtt ccaataacgt cacaaaagat gacgtcactg ataaagacat cacaggaact 1680 acccattaat ataatcaaga agattctaca accatctaga aacttctgga tttttcaatg 1740 tagtgtgcaa aatttaaatt ttgactattt cattgtgcaa tttttaattt atattattac 1800 aagtttaaac tatattaaat ctgtccttct aaaacccaat tcgtttcaag acccaaaact 1860 tctagaatct tccaaaaaaa ttctatataa ggagaagaaa attatcattt tctcattatt 1920 aatcaacaat ggataacgag ctaacaagtc gacccgattc aataatggat acggaaccaa 1980 caactctacc cgattcagca atggatacca aaccaacaaa cctgcctgat ttcccttcta 2040 catcccaaga acccgtctct aggaggacac gttccagaac attaaggtaa ggatcttgaa 2100 tgctcgtgga attttaccaa taagcagttt tagtgacaac tcaaatgctg tatggaagag 2160 agaacgtcgt tcaagagaaa cttcgcaaga atcagaatca agactcagga tggattggga 2220 aagaaaaaag gaaaaacgag cttcaatgtc agaagaggaa aaggctgaga tgaaatataa 2280 aaaaagagta tggatgaaga aaaaaagaaa tgaagtagca aaaacacacg atacgtcatc 2340 tgttgccaat cccaactacc ttggaagcat gagatgtgtg tgcaagaact gcaatgccag 2400 attctttcaa ggtatacatt ttttcaaagt ttagtgaact tctacttctt atttccattc 2460 tgagtttcag attggttttt tttcataact aataacatat actagcatca tacagaaatt 2520 taatgtttca ggagaggtgt cctcatccaa cgatcatatc aatatgtgtt gctacttcgg 2580 attaacaacc acacttcaag atcgattctc caactaccca gatgaactac gacatcttct 2640 cctcgatgat tcttctcacg cccgtaactt ccagaagaac atccgacagt ttaattcatc 2700 gttcgcaatg gcatcgatgg gtgcacaact tgatgttccg agaggccgcg gcccatactg 2760 cttcaaaatc cacggtcaag tttaccattt tgcgggacca cttcacccgg atccaggcca 2820 gagaccagca tttggacaga tctacatcct ggacactgat caagcaacag acgagaggct 2880 tggaaatcct gccaatgccg attgtgatcc tgctgttatg gctgaattga gcagtcttct 2940 gttgagaacc aatccatatg cacaggcata caagatgatg gctgaagttg aagaaaaaga 3000 aaattctgaa gctgcaaagg aaggacgaca tccgggatgc gttcgactga tcttcgatat 3060 ttcgacaact aaggatccac gacgttataa cttgccacaa gcgaatgaag tcgcagttgt 3120 ttttgtgggc gaagacgatg acgtgccaac cactcgctct ttggctgttc atccaagagg 3180 aggaggactt aagacgattc gtgatatcga taggatctgc gatcctctca catatcctgt 3240 attgtttcca aatggaaccg acggatggca tcctgacttg gaaaagagac catcggagaa 3300 gaaacaagga aggattactc agaagatgta ctacagttac ctgctcatgg agagatcagg 3360 agttttcaat ccactgcatc acgggcgtgc cctgtttcaa cagtttgcag tagactcatg 3420 ggtaaaggta agaatacatg atctcatgtt cacaaaataa tcatgttttt gcagatcgag 3480 caaaacaggc taaactatca tcgaactcat caagttgatt tgaaagccgc caactacaat 3540 gcggttcagg attatattgc tggagatact gatgtcccag gaagacgcat cgttcttcca 3600 tcatcttttc cagggagtcc ccgagcaatg gtccaaaact ttcaagacgc gatggctatt 3660 gtctcgaagt tcggaaagcc tgatatattt ttaactttca cgtgtaatcc tgcatggaca 3720 gagatttctg aaaacctcgg tccgcgacaa tctgcatctg atcgtccaga ccttattgcc 3780 cgtgtcttca agctaaaagt aagttgaaga tctatattta actcatcact ttcttcattg 3840 tgttccaggt agatgcgctg tttgacgatc ttctaaatcg ggatattctc ggacacgttg 3900 ccgcatacat ttccgtgttc gaatggcaaa aacgaggtct tcctcatgta cacatgctgt 3960 taactatggc agagaactca aagccacgaa cctcggagga catcgacaag attgtgcaag 4020 ctgagattcc aaatcctgac aacgagccag aacttcatcg cattgtgacc accgccatga 4080 tgcatcgtcc atgtggtgct caaaatcctc actcgccatg catggttgat gggcactgct 4140 cgaaaaggta tccgaaggat tttcatccta gcaccactct caacgttgat ggatatcccg 4200 ggtatcgtcg aagggatgat ggacgatatg tggaatatgg cacccaacat ctcgataaca 4260 ggagagttgt tccatacaac aaatggcttc tgctccgtta caatgctcac atgaatgtcg 4320 agatttgtgg attcatcgaa gctgtaaaat atctcttcaa atatgtctac aaaggacatg 4380 accgtgctgc actcaacatc atccaaaatg tacgtggaga tggaaatgtt gttgatgaga 4440 tccgagagca tctagatgct cgatacgtct gtgctcccga ggcaatccac cacatcttgg 4500 gattcaaact ggaaaagaag tccgatacag tttattgact cgctgtgcat ctcgaaggat 4560 tccaaacgat ctactttaga gccagtgtca ccacacaaca actcgaatct tcatcacaaa 4620 ccgacacgac tctcacagct tggtttaaga tcaaccaaaa gagcaaggat attgcggaga 4680 gcgggaacat cccttcgaca tttgttgatt cacgccaatt cttctacatg gatatgccaa 4740 cgcacttcac ctttgtgaaa aaggatggct ggaaggttcg aggaagaggc acaaggcaga 4800 taggccgaat gtacactgtc ccaccatacg aaaccgagag atacgcactt cgaattcttc 4860 ttttgaacat caaaggagcg acgtcattcg aagatcttcg tactgttttg gacgagaaca 4920 atgttccagt ggtatacgca acttatgtgg aagcagcaaa agctcaagga cttctgaatg 4980 atgacagcga atatttgaag tctttgaaag agtgggctgg atgctcagtg cccgcagctc 5040 ttcgctcaat gtttgtcgct atcattcttt ttaacgaagt tcacgatctg aacgcacttt 5100 gggatgccgt gaagtgggat ctttctgagg attttcgcca tgctggagca gggaaagaag 5160 aggctgaggc acttgcatat tttgatatcg aaagtcgcct tcaaaggtat cttttgagct 5220 tttttcaaaa gtagattata atatttttcc agggttggaa aaagattcga tgtcaccaaa 5280 ccatctatca atccgccacc aatcgatctc gacacagtca accctgctca atgtgcatca 5340 gaaggcaata ggctccttgc gacactcaat gatcaacaga aacgtgcagc tgaccagatc 5400 cttgctgcat tagatgatgc ctcgctaccc cgtcttttct atttggatgg gcctggaggg 5460 agcggaaaga cttatctcta catcactctg tacaacatct gtgttggaag aggactgaag 5520 gtaaactttc aaccaaacat actaaagtta tttcaatact gcatattttc aggtagcgtg 5580 tacggcgtgg acaggaattg cagccaattt actcccactc ggtcgaacat ccgcttccct 5640 tttcaagctt gatattagaa atcaatgcaa gtcttctctt catcagaggc aattgaaaga 5700 agcacaagag ctagcggaga acgacgtttt catttgggat gaagcatcaa tggttcccaa 5760 ggtattagag tgattctgat cgtactaatc aaatcattca ttcaactttc agacagccct 5820 cgacacggtc gatgtactgc ttagggatct cacaaagatc gatcagccat ttggtgggaa 5880 gatcctgatc ctcggaggag actttcgcca aatcctacct gtggtggaga gaagctctag 5940 agctgatcaa gtagacgcat gcatcaaaag atcaccacta tggacagagt ttcaaatttt 6000 gcatctcatc tcaaatatga gagtaacatc tggagactcg gattggatcc aattcctcct 6060 caatgttggc gatggatctg caaatgactc tgattcaaag gttacactcc cactctctgt 6120 catgtgcgat cacaacatcg tcgaggaagt gttcggagca gttatcgatc caaccacgtc 6180 tgatccatgt gacaacgtca tccttacacc taagaacgtc gacgttgctc aactgaatga 6240 cgatgttcac aatcgaatgg ttggcgagga aagaatctat ctgtcacgag atgaagtgat 6300 cgtggaacat caagcggata ctatgcacta tcccaccgag tttctcaaca agatgtcccc 6360 atcatctctt cctccccata ttctcaagct gaagaaaggc tcagtgatca ttctgctcag 6420 gaatctcgat gtttccgctg gcctttgcaa tggttcccgt ttcattgtcg aaacattggc 6480 cagtcactca cttggatgtc gctttgcaac tggcgagagg aaaggacatt tcacaatcat 6540 tccaagaatc gattgctatg atgataagaa catctctttc caactcaggc gcacacagtt 6600 cccagttcgc ttatcatttg cattatcgat caacaaggcc caaggacaat ccttttctaa 6660 aatcggtctc tggattccca ccgatgtttt cactcatgga caactatatg ttgctctatc 6720 aagagttcgt accaaagaag gtcttattgt aaaatccagt tctaatattg tcaccaatat 6780 cgtttttaat gaagttcttt gaataaagct tattattctt ctctgctatc tcgctagaca 6840 actcagctct cttctgaggt ctcactcgac aactcatctc ttctctgcta tctcgctaga 6900 caactcagct ctcttctgag gtctcactcg acaactcatc tcttctctgc tatctcgcta 6960 gacaactcag ctctcttctg aggtctcgct cgacaactca tctcttctct gctatctcgc 7020 tagacaactc agctctcttc tgaggtctcg ctcgacaact cagctcttct ctgctatctc 7080 gctagacaac tcagctctct tctgaggtct cactcgacaa ctcatctctt ctctgctatc 7140 tcgctagaca actcagctct cttctgaggt ctcgcccgac aactcatctc ttctctgcta 7200 tctcgctaga caactcagct ctcttctgag gtctcgctcg acaactcatc tcttctctgc 7260 tatctcgcta gacaactcag ctctcttctg aggtctcgct cgacaactca gctcttctct 7320 gctatctcgc tagacaactc agctctcttc tgaggtctcg ctcgacaact catctcttct 7380 ctgctatctc gctagacaac tcagctctct tctgaggtct cgctcgacaa ctcatctctt 7440 ctctgctatc tcgctagaca actcagctct cttctgaggt ctcactcgac aactcatctc 7500 ttctctgcta tctcgctaga caactcagct ctcttctgag gtctcgctcg acaactcatc 7560 tcttctctgc tatctcgcta gacaactcag ctctcttctg aggtctcact cgacaactca 7620 tctcttctct gctatctcgc tagacaactc agctcttttc tgctatctcg ctagacaact 7680 cagctctctt ctgaggtctc actcgacaac tcatctcttc tctgctatct cgctagacaa 7740 ctcagctctc ttctgaggtc tcgctcgaca actcatctct tctctgctat ctcgctagac 7800 aactcagctc tcttctgagg tctcgctcga caactccagc cttctcactc gcccccatat 7860 gtaggagtat ggggggaggg agtattatta taaccacaga aacacaacaa tgtgccacat 7920 aagagacagg agatagtgct cgcactaccc ctgaatcacc aaccctctca gaagcacctc 7980 tgctcttatc tgggtctcgc tagacatctc cagtagtttg accattctca gcaacacttt 8040 tgctatttta gaattttttt gccctcgcgg ggatcgaacc aaaaggtccg cgcgccaatg 8100 acgcgcgcgc taccagctgt gccatgctgg aagcgattga acttcgacca ataacaatat 8160 acaacatttc tttatttgtc aaagtgaaat agttttgaac aaacactatt caaaaaagta 8220 attttaaatt gaaattatat caaattcacg atttgttccc aaatatttaa aactgaaact 8280 caataagaga agatctcata ttctcactga taatccgaaa catgacattt gtttcaatgc 8340 tgttttcaac ctcaagttgt aaaattcaaa aaccaagctg aaaatatgtt ttaaaaatct 8400 agaaactaca ataattttta tgtgaatttt tggaaaaatc tcaagccaga cggagcacgc 8460 gccttggcgc gtgcgaacgg ctgg 8484 // ID TC4 repbase; DNA; CEL; 3483 BP. XX AC L00665; XX DT 05-FEB-1999 (Rel. 4.01, Created) DT 05-FEB-1999 (Rel. 4.01, Last updated, Version 1) XX DE Transposable element Tc4. XX KW DNA transposon; Transposable Element; TC4; TIR. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-3483 RA Li W. and Shaw E.J.; RT "A variant Tc4 transposable element in the nematode C. elegans RT could encode a novel protein."; RL Nucleic Acids Res 21(1), 59-67 (1993). XX DR GenBank; L00665; Positions 1 3483. XX CC TC4 is a DNA transposon with 139 bp-long terminal inverted CC repeat. CC Putative protein: CDS join(1175..1387,1435..1720,1764..1902, CC 1961..2582,2626..2979) CC /translation="MSFDIGNVMFALHRHFPNPLPSNPSISRHEQKVADHLASIIKES CC YYGDTSVEESEYLVQEEYGDWEPEVFEPYDCVLPDQDDVLFGEKAVSREELDKAITFY CC RSGKIGSRPITTMHHSYRWIRTDAHMNKLRKYEKDKKAFQESVRVRLAQLTQRLYEEV CC KEKLDNGVNLHDSDLMAMALEINTREMKLQKQNEMKLQKASQSWITRWKQSHRIVSRR CC VTKFVTRKCLINKDAIKKNADDFVKNARTEISNYHPSMVFNCDQTGIQKELYPARSLA CC FMGEKTVERLAQSKSSLTHSFTFLPMIFLDGSMGPKAFMVIAEPKGQFPPSRPIPNCP CC NLEVRAGYKTHIMTKQLMCDFFESCVFIPSVPKKLYIMLDSWPAFKDHTTIKNLVPNG CC HDVVIRNIPEHTTGMIQPLDVYWNAPWKSLIKKFTAYALRTQTDYVIAQRNNAICMVS CC VLYHQISAEHFRPFLQHCWKKAGYVGAANTSSTPFLTPAQYCIDHGDTVICYHTGCNH CC LGFIRCARCKMFVCFNHFVVSKQHLCSSP". XX SQ Sequence 3483 BP; 1065 A; 620 C; 685 G; 1113 T; 0 other; ctagggaatg accagaataa atggagcgat attcaaaaaa aatatattgt atcggaaagc 60 tggcattctc tactataaga atatgactga aatttttgcc cgttcgggct ggaaatctga 120 aatttttacg tctgaaattc tacactgaaa tcagtgcatt tcctatggtt aacagtggat 180 ttttgtctct ggcgccaaca gaagtctcac cacaatggtg gaagggcgaa aacatcggtt 240 cggtggtcga gtggtgaacg cgttcgcctc ttgagcagaa gtttgtgggt tcggttccca 300 tacatggttt aacttttggc cttttttata caaaattttc agaacgggaa acaagtattt 360 agaacatttt tttgagggtt ttacataatt tttttgcttt ttaattgaac cataattacc 420 ctggaaactt ttcagaaatt ttaatttttt tcgaaaattg tcactttttt ctccaccaaa 480 cccatgagaa aatttgatcg aaaaattttt ttttgaaatt tttttaaaaa tgcatgaaat 540 attttagagt gtcacaaata acctattttt cattattttc aatgaccgaa tcattgattc 600 tgatgcctta tcaagacgtt ttaccaaatc gatattggca aaacatcttg tttttgaggc 660 tccatatctc tgcaggaaaa aatcgcacta aaaagtgatc aactagaaac ttgttaaaca 720 caatgtgatc taaaacattt cagttgaaca cttttttgta aaaaatttcg ttgccgagtt 780 ggaaccaatt gatttgagct tcattatttt tgaatattct aaatagttaa agatctatat 840 cttggcaacg aaatttttta caaaaaagtg ttcaactgaa aagtttcaga ttacattgtg 900 tttaacaagt tttcagttaa tcacttttta gtgcgatttt ttcctgcaga gatatggagc 960 ctcaaaaaca agatgttttg ccaatatcga tttggtaaaa ctttatttta attttttttt 1020 ttcatattat tgtttttata ttgggtcaaa acgtatttgc tctgctaaat agttagctgg 1080 cgctaatttt taattttatt attttcgtat tttgataaat ttcactgaaa ttgtcacttt 1140 aatagtttac attctattgg ttttctttta ttcaatgtct tttgatatcg gaaacgtgat 1200 gtttgccttg catcgtcatt ttcccaatcc cctcccatca aatccctcga tttcgagaca 1260 tgagcagaag gtggcagatc atttggcaag cattataaag gaatcttatt atggagatac 1320 cagtgtggaa gaatccgaat atttggttca agaagaatat ggcgattggg agccagaagt 1380 ttttgaggta tgttggtttc gacactatta attgtcattt aaaaaacctt tcagccgtat 1440 gactgtgtgt tgccggatca agatgatgta ttgttcggtg agaaagctgt atcaagggaa 1500 gagttggata aagcgatcac cttctaccgt agcggaaaaa ttggaagtag gccgataacg 1560 actatgcacc atagttatcg ctggatacga accgatgctc atatgaacaa actgcgcaag 1620 tacgagaaag ataaaaaagc gttccaagaa agcgtgcggg ttcgtttggc acagcttaca 1680 caacggcttt atgaggaggt caaggagaaa cttgataatg gtaacgaaca ttaatataac 1740 aagttttaat ataaatattt caggagtcaa tttgcatgat tctgatttga tggctatggc 1800 actggaaatc aacacccgcg aaatgaagtt gcaaaaacaa aacgaaatga agttgcaaaa 1860 agcgtcacaa tcctggataa cccgatggaa gcaaagccac aggtatgtaa taaattgcgg 1920 agtatgttag atatatatat atatatatat ataatttcag aatcgtgagt cggagagtca 1980 ccaagtttgt cacacggaag tgcctgatca ataaagacgc tatcaaaaaa aacgcggatg 2040 attttgtcaa gaatgccaga acagagatct ccaactatca cccgtcgatg gtcttcaatt 2100 gtgaccaaac cggaattcaa aaggagctgt atccagcccg gtctttagct tttatgggcg 2160 aaaaaacagt cgagaggttg gcgcaatcga aatcgtcgct gacccactcg tttacgtttc 2220 tcccgatgat tttcctcgat ggctcaatgg gacccaaggc gtttatggta atcgctgaac 2280 caaaaggcca gtttcctccg tctcgtccaa ttccaaactg cccaaatttg gaagtgcggg 2340 ctggatacaa gacacacatc atgacgaagc aattgatgtg cgattttttc gaaagttgtg 2400 tcttcattcc gtctgtaccg aaaaaactgt acatcatgct ggacagttgg ccagcgttca 2460 aggaccatac aacgatcaag aacttggttc ccaatggtca tgatgtcgtc attcgcaaca 2520 ttccagagca cacaactgga atgatccaac cgttggatgt ctattggaat gcgccatgga 2580 aggtatgaat ttaaatcttt ataacttttt gatagtattt tccagagctt gatcaagaag 2640 ttcacagcat atgcccttcg aacccagacg gattacgtca tcgcacagag gaacaacgca 2700 atttgcatgg tatctgtgtt gtatcaccag atctcggcag agcacttccg accgttttta 2760 cagcattgtt ggaagaaagc tggatacgtg ggtgctgcga atacttcatc aacaccattt 2820 ttgactccag ctcaatattg cattgatcat ggtgacacag tgatttgcta tcacactgga 2880 tgtaaccatc tcggattcat ccgatgcgca agatgcaaga tgtttgtttg ttttaatcat 2940 tttgttgtgt caaaacaaca tctttgttca tctccttgaa taattaataa attcatgatt 3000 gcattacatt caagtttcgc tatgaataaa attgtttata tgaggggatc catggttgta 3060 gtggtccatg gttaagttta aattttaggg gaaaagatgt tgaatgaatc attatggtgt 3120 tggtcgacac attgatcatg ctgatatcga tcagaaaaat taaaatttgg gcttttatat 3180 agtttcgaaa aaacacgttt aaatttttga aaaactttac gttagggtgg aattgaaccc 3240 cagagtccca ccatgcttgt catccactct atccactcgg ccatctcgct gttgcaggca 3300 gtactaattg gggatggtga taagtaaagc acgtcaaaac gaataatttc agacgtaaaa 3360 atttcagatt tccagcccga acgggcaaaa atttcagtca tattcttata gtagagaatg 3420 tcagctttcc gatacaatat tttttttttg aatatcgctc catttattct ggtcattccc 3480 tag 3483 // ID MARINCE1 repbase; DNA; CEL; 1286 BP. XX AC . XX DT 09-SEP-2005 (Rel. 10.09, Created) DT 10-SEP-2005 (Rel. 10.09, Last updated, Version 1) XX DE Mariner-type DNA transposon from C. elegans - a consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; MARINCE1. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-1286 RA Jurka J.; RT "MARINCE1: Mariner-type autonomous DNA transposon from C. RT elegans."; RL Repbase Reports 5(9), 263-263 (2005). XX DR [1] (Consensus) XX CC There are multiple recent copies of this element in the genome, CC >99% identical to their consensus sequence. XX FH Key Location/Qualifiers FT CDS 156..1163 FT /product="MARINCE1_1p" FT /translation="MRASPMREPIVRFHRNGVAAKSIARRLKVSEKLVSTT FT IARFKELGNFSDRSGRGRPPTVTTPAMIKKVRGRFRHNSGRSVRAMARELK FT ISQSSLCRMVKNNLKLKAYKKSTCQFLSEAAKIKRKDRAMNLLRRFRNGAH FT RKVLFTDEKIFCIEQSFNTQNDRVYAKTQPNSRVQRTGYPKGIMVFAGITA FT NGKTPLIFVPQGIKVNGNNYLDMLKTELMPWVKKHFKKTKWTFQQDGAPAH FT KHKNVQAWCESNFPDFIAFNQWPPSSPDLNPMDYSVWSVLEAKACSKPHRN FT IDSLKDSLKKAWDELDINYLRATVDSFPRRLEACVAANGDIFEL" XX SQ Sequence 1286 BP; 370 A; 273 C; 289 G; 354 T; 0 other; tacagggtga gtcaaaatta tggtaagtcg aggcgcgcgc gcgaaagttc tgtggttcgg 60 gtctccccgc caaaattaat tgattatgtt atttcgtgtt ttgttcattt atttttccat 120 cgtttttcct aaaaaataat ttttttttcg caggtatgag agcgtcaccc atgcgtgaac 180 ccatcgttcg ttttcatagg aatggtgtcg cggcgaagag cattgctcga cggctcaaag 240 tgtctgaaaa acttgtttct accaccattg cacggttcaa ggagctcggc aacttttccg 300 atagatccgg acgcggaaga ccccccacag tcactacacc agcgatgata aaaaaagttc 360 gtggacgctt ccgtcacaac tccgggcggt cggtccgtgc gatggcccga gaattgaaaa 420 tttcgcaaag ttctctgtgc cgaatggtaa aaaacaactt gaaattgaag gcttacaaaa 480 aatctacttg tcaatttctg tctgaagctg ctaaaatcaa gagaaaagac cgtgccatga 540 atctactccg aagattcaga aatggagctc acaggaaggt acttttcacc gacgaaaaga 600 tattttgcat tgagcagtct tttaatacgc aaaatgatcg agtttatgca aaaactcagc 660 caaactcccg cgttcagcgt actggttacc cgaaaggaat catggtcttt gccggaatca 720 ctgctaatgg aaagactcca ctcatttttg ttccacaagg gataaaagtt aacggaaaca 780 actatttaga tatgttgaag acggaactca tgccctgggt caagaagcat ttcaaaaaaa 840 caaaatggac ttttcagcag gatggggcgc ctgctcataa gcacaagaac gttcaggcat 900 ggtgtgaaag caacttcccg gacttcatcg ctttcaatca atggccaccg tcatccccgg 960 atctcaatcc gatggattac tcagtatgga gtgtcctcga agcaaaagct tgttcaaaac 1020 cgcataggaa tatcgattcg ttgaaagatt cactgaagaa ggcatgggac gagttggata 1080 tcaactactt gcgcgccaca gtcgactcgt tccctaggcg tctggaggcc tgtgttgctg 1140 ctaatggaga tattttcgaa ctttaaaatg tttttctata gtgtttttta cggtgaataa 1200 aatgtgtata gttagaagtt gtttgtgttt ctggaaaaat gtatggagca cgggaactac 1260 ttaccataat tttgactcac cctgta 1286 // ID MARINER5_CE repbase; DNA; CEL; 1247 BP. XX AC . XX DT 21-OCT-1999 (Rel. 4.09, Created) DT 21-OCT-1999 (Rel. 4.09, Last updated, Version 1) XX DE Autonomous DNA transposon from C. elegans - a consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; KW Autonomous DNA transposon; mariner superfamily; MARINER5_CE; KW TA target site; TIR; transposase. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-1247 RA Kapitonov V.V. and Jurka J.; RT "MARINER5_CE."; RL Direct Submission to Repbase Update (OCT-1999). XX DR [1] (Consensus) XX CC MARINER5_CE is a prototype of one of the oldest Mariner-like CC DNA transposons in the C.elegans that were active 15-30 million CC years ago. There are about 100 copies of the element per the CC haploid genome, they are ~90% identical to the consensus CC sequence. CC MARINER5_CE is flanked by TA target site duplications and CC it has perfect 27 bp-long terminal inverted repeats. XX SQ Sequence 1247 BP; 348 A; 283 C; 261 G; 355 T; 0 other; ttaggttggc gggaaagtcc tgtcctaatt ttttgttttg tccttgtggg ctgaatatca 60 attttggtgt ttattggctc ctagtgtttg ataagcagaa aaaagtcttt gcagtaatta 120 accccccatc cctctccctc tctctcggtt caagttcact tgcgatgagt gtacctacct 180 caattcttgt tcggcatgtt ctgttataca tgctcctggc gcacaccaca atgaaggatt 240 caaaaactct tctgatgaaa gtttatgatg gacctgctcc ctcgtataaa actatacaga 300 agtggttcga tcgtttcgaa aaaagcgact tttcttagat tgaaaaggta tgctcaaaag 360 accacatgag ttggatttga atgcactaaa agccagtatt gcatcggatc catttcaaat 420 tacccgtgag cttgcaagca cgtttggggt ctctcataac tccataccag ataacatgaa 480 attgctcggc atgaggaaaa ttcttcgtcg attcctacct cattatttga cccgttccaa 540 cctttttcac caagttgatg tttcataaac tcttctcact ctccagccgg cgaccgttgg 600 ttgggctggc ttattattga aaacgagaag tgggtgttgt acgctaatca ccacagacaa 660 gctcagtgga ttggagaagg acaaaccgca caggatgttc cgaaactcgg ccttcacccg 720 aaaaaaagta tgctctctgt ttggtcggga gtggatggcc cctctactgg gagctgttac 780 ctgagggtga aactataacc gtagagaccc actctagaca actacggaat ttgagggaaa 840 cagtcgatcg ttcagcctta caggataaga aagtatattt ccaacataat aacgctcaac 900 cacatgttgc aaaacatgtg aaggaggaac tggcaaaatg tggatggccg attctccctc 960 aaccaccgta ttctcctgac atttctccaa gtgattattg gcttctctcc aatctgactt 1020 gtgccctccg aggtagaacc ttcacaacac gagaaaatct ccaaactgat atcgagctat 1080 actttgaatc cctgccagct ggattctaca ggcagggaat ccacaagtta actgaatgtt 1140 ggcaggacat tgttgaacat gatggagcat ataactgatt tctagtttga gtttgcttcg 1200 agaaataaat gattgaaaaa taggacagga ctttcccgcc aacctaa 1247 // ID CERP4 repbase; DNA; CEL; 911 BP. XX AC . XX DT 28-JUL-1999 (Rel. 4.06, Created) DT 28-JUL-1999 (Rel. 4.06, Last updated, Version 1) XX DE C. elegans repetitive DNA (a consensus). XX KW CERP4; CERP4 repeat; Repetitive DNA. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-911 RA Jurka J.; RT "CERP4."; RL Direct Submission to Repbase Update (JUL-1999). XX DR [1] (Consensus) XX CC A palindromic sequence. TA-target site duplication. CC Putative non-autonomous DNA transposon. 81% similar to CC inverted repeat annotated in Genbank Accession No. U04712. XX SQ Sequence 911 BP; 315 A; 135 C; 139 G; 319 T; 3 other; tattaggtct ccaaataagt tccgggtcaa aaatcataac tttgttcgct gcgtatcgat 60 ttttatgaaa ctgtgggaat ttatgttatc aaccatgatc tttcatttga caatagtcac 120 aaaatttttt gaccacccga agtgccctaa ctcggagcca attttttcag gcatttttct 180 gatctcgctt cttttcagct ttgaattgag gtttgtgtgc ggattttgct ttgtttagaa 240 tacattrtwa gaaaacaaca aaagtttgga aaaaaatccg tccaaaaaat tttttttggt 300 cggtcgtcaa aaaatcttca aaaaaatttt ttgtcgaaaa ttcttgattt tttatacaaa 360 aatgatgtaa ccaagtgtaa actattttta cacatacaaa acatttcaat ttagttcgat 420 acactaaaat gataatagaa aatataattt tttcggaaaa tttttgagtt ttttgagtat 480 ttctcgagat tcaaatttta aattcaaatg ttttgtatgt gtaaaaatag tttacacttg 540 gttacatcat ttttgtataa aaaatcaaga attttcgaca aaaaattttt ttgaagattt 600 tttgacgacc gaccaaaaaa aattttttgg acggattttt ttccaaactt ttgttgtttt 660 cttacaatgt attctaaaca aagcaaaatc cgcacacaaa cctcaattca aagctgaaaa 720 gaagcgagat cagaaaaatg cctgaaaaaa ttggctccga gttaggrcac ttggggtggt 780 caaaaaattt tgtgactatt gtcaaatgaa agatcatggt tgataacata aattcccaca 840 gtttcaaaaa aatcgatacg cagcgaaaaa agttatgatt tttgacccgg aacttatttg 900 gagacctaat a 911 // ID CELE1 repbase; DNA; CEL; 329 BP. XX AC . XX DT 23-AUG-1999 (Rel. 4.07, Created) DT 20-SEP-1999 (Rel. 4.08, Last updated, Version 2) XX DE CELE1 nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; CELE1; MITE; KW nonautonomous DNA transposon; palindrome. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-329 RA Oosumi T., Garlick B. and Belknap R.W.; RT "Identification and characterization of putative transposable DNA RT elements in solanaceous plants and Caenorhabditis elegans."; RL Proc. Natl. Acad. Sci. U.S.A 92(19), 8886-8890 (1995). XX RN [2] RP 1-329 RA Kapitonov V.V. and Jurka J.; RT "CELE1."; RL Direct Submission to Repbase Update (SEP-1999). XX DR [2] (Consensus) XX CC Several hundred CELE1 copies are dispersed in the C.elegans CC genome. They are ~88% identical to the consensus sequence. CC CELE1 is flanked by 145 bp-long terminal inverted repeats and CC TA target-site duplications. XX SQ Sequence 329 BP; 108 A; 56 C; 56 G; 105 T; 4 other; tattacggga acacaaaatt ctgagaatgc gtattgcaca acatatttga cgcgcaaaat 60 atctcgtagc gaaaactaca gtaattcttt aaatgactac tgtagcgctt gtgtcgattt 120 acgggctatc gattttcgaa atgaattaaa atttaatraa tattgtawtc rmtattcgaa 180 aataaattca tttcgaaaat cgagcccgta aatcgacaca agcgctacag tagtcattta 240 aagaattact gtagttttcg ctacgagata ttttgcgcgt caaatatgtt gtgcaatacg 300 cattctcaga attttgtgtt cccgtaata 329 // ID PAL9A_CE repbase; DNA; CEL; 1947 BP. XX AC . XX DT 21-OCT-1999 (Rel. 4.09, Created) DT 21-OCT-1999 (Rel. 4.09, Last updated, Version 1) XX DE PAL9A_CE is a putative nonautonomous DNA transposon - a DE consensus. XX KW MuDR; DNA transposon; Transposable Element; Nonautonomous; KW 9 bp target site duplication; MUDR superfamily; NPALTA1_CE; KW PAL 98A_CE; PAL9A_CE; RCC14; TIRs; nonautonomous DNA transposon; KW palindrome. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-1947 RA Kapitonov V.V. and Jurka J.; RT "PAL9A_CE."; RL Direct Submission to Repbase Update (OCT-1999). XX DR [1] (Consensus) XX CC PAL9A_CE is a putative non-autonomous DNA transposon; it CC forms almost perfect palindrome and generates duplications CC of 9 bp-long target sites upon its integration in the genome. CC Presumably, multiple transpositions of PAL9A_CE were induced CC by MUDR-like transposase. There are about 50-100 copies of CC PAL9A_CE in the genome and they are ~99% identical to the CC consensus sequence (two minor subfamilies). CC PAL9A_CE is a composite transposon since it carries another CC nonautonomous transposon NPALTA1_CE (position 888-1058). CC RC14, an unclassified satellite-like repeat, is a portion of CC PAL9A_CE terminus. XX SQ Sequence 1947 BP; 612 A; 361 C; 359 G; 612 T; 3 other; ggcaatctat cccatgctca catttcccat gcacacattt cccatgctga cactttctcc 60 cgtgcataca aaattcccat gcttacgccc agtagttcga gtactcccat gcctacaatt 120 aatttttctc ctttttttcc gttcccatgc tgacattccc atacatacat tttctcccgt 180 gcatacaaaa tattctgatg cttacatcca gtagtcgaag tactcctatg cctacaactc 240 atttttccct tttttttcgt tcccatgctg acattcccat acatacattt tctcccgtgc 300 atacaaaatt cccatgctta catccagtag tcgaacgttc ccatgccgac attcccacac 360 atacatttgt tcccatactt acaaaaatgt attcccatgc cgacatggar cakttttgtg 420 ctccagaaga gcacaaagtc aatttttttg ctcattttca ttgtattagg cttacttgac 480 agctttcagt tacaactgat cgcttttttt ttcaaaatgc attagttctc gattatgaac 540 aaggaatcga agaaaaatgg ggatgactgc ctatagagta aaattttttt ttgctcaaaa 600 aaaatttttt ttttgaaaat tttttttctc tctattttgt aggatataat taattcggtg 660 aattccacaa cttcaaatgt tcatattgct gtttccgtgc agttatgagt ttcgcagtgt 720 cttaggaaca ttccattttt tgttgcattt tgtaggaatt cggagctatc ttataatttc 780 tacaaattag aaaatccatt tcgcattatt tcatgtaact tgaacctatt gcttcaaaaa 840 tagcacagaa cgaatgttga ttctgtttct ttcattaaaa aatgtgctac tgtgcatttt 900 ttcccacttc tacgacttta aaggcgcgcg catttataca aaatggtccc gtcattggtc 960 tcgccagcgc tcaacaaatc aatgggatgc gcgtggcgag attattgcgc gaaaattcgc 1020 gcgcctttaa agtcgtagaa gtgggaaaaa atgcacagta gcacattttt taatgaaaga 1080 aacagaatca acattcgttc tgtgctattt ttgaagcaat aggttcaagt tacatgaaat 1140 aatgcgaaat ggattttcta atttgtagaa attataagat agctccgaat tcctacaaaa 1200 tgcaacaaaa aatggaatgt tcctaagaca ctgcgaaact cataactgca cggaaacagc 1260 aatatgaaca tttgaagttg tggaattcac cgaattaatt atatcctaca aaatagagag 1320 aaaaaaattt ttcaaaaaaa aaattttttt tgagcaaaaa aaaattttac tctataggca 1380 gtcatcccca tttttcttcg attccttgtt cataatcgag aactaatgca ttttgaaaaa 1440 aaaagcgatc agttgtaact gaaagctgtc aagtaagcct aatacaatga aaatgagcaa 1500 aaaaattgac tttgtgctct tctggagcac aaaactgttc catgtcggca tgggaataca 1560 tttttgtaag tatgggaaca aatgtatgtg tgggaatgtc ggcatgggaa cgttcgacta 1620 ctggatgtaa gcatgggaat tttgtatgca cgggagaaaa tgtatgtatg ggaatgtcag 1680 catrggaacg aaaaaaaatg gaaaaaatga ggtgtatgca taggagtact tcgactactg 1740 gatgtaagca tgggaatttt gtatgcaccg gaaaaaatgt atgtatgaga atgtcagcat 1800 gggaacggaa aaaagggaaa aaatgagttg taggcatggg agtactcgaa ctactgggcg 1860 taagcatggg aatattttgt atgcacggga gaaaatgtca gcatgggaaa tgtgtgcatg 1920 ggaaatgtga gcatgggata gattgcc 1947 // ID TC6 repbase; DNA; CEL; 1603 BP. XX AC X55356; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE C.elegans transposable element Tc6.1. XX KW Transposable Element; Repetitive DNA element; TC6; KW Tc6.1 sequence. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-1603 RA Dreyfus H.D.; RT "TC6."; RL Direct Submission to Genbank (29-AUG-1990)Dreyfus D.H., Albert RL Einstein College of Medicine, of Yeshiva University, 1300 Morris RL Park Avenue, Bronx NY. XX RN [2] RP 1-1603 RA Dreyfus H.D. and Emmons W.S.; RT "A transposon-related palindromic repetitive sequence from C. RT elegans."; RL Nucleic Acids Res 19(8), 1871-1877 (1991). XX DR GenBank; X55356; Positions 1 1603. XX SQ Sequence 1603 BP; 492 A; 313 C; 303 G; 495 T; 0 other; cagtgctcca cataatgata cggccacccc caaattttgg tataactcaa aactgggttg 60 agatagcaaa acatagtttc ttgtgaaaat gttcgctgta ctggctaact ttcagataag 120 tattggaaat atacctgaac cgttcgtaaa aaaagataaa ccattttttc atgaaaaacc 180 atataaaaaa atccacaaaa tgatacggcc acccttggtt tttgttttct tttttcgttt 240 tttttgcaat tttttttgct aaacgttagg tttcatgttc gtttgtgttt ttacagctat 300 gggccgtgga ataactttaa ctgacaacga aaaaggacaa attgtgcaaa attatctcaa 360 ggcttctcgg atcgtcagat ttttcgtgat ttgaaacgtt tgagggatat gatcactcga 420 tatgcttcaa atcctgccgc ttattgcacc aaaaagtctt ctggtcgccc accactcctt 480 tctggtagag acaagcgaaa aatcgttcgt cgagcattaa attgaacagt gacttgctcg 540 aaaagtagga gcgagatgaa cctgccagtg tctgttgaga ccgtacgacg tgtccttcga 600 agtcccagtt tatcaaaaga cgaaaattaa taaaggctaa tttcattacc gaaaaacact 660 gccaaaatcg tattcagttt gctaaaatca gccagagaac taacggagac aagtgaggat 720 tatggtataa tcattcaagc ccagtttttg gtttcagatc atcttttctt ttctcaaatc 780 gtgccagtaa tcacggtagc catcaggacc atcacagtta aactttttct cgccactgaa 840 gatgaactga aaccaaaaac tgggcttgaa tgattatacc ataatcctca cttgtctccg 900 ttagttctct ggctgatttt agcaaactga atacgatttt ggcagtgttt ttcggtaatg 960 aaattagcct ttattaattt tcgtcttttg ataaactggg acttcgaagg acacgtcgta 1020 cggtctcaac agacactggc aggttcatct cgctcctact tttcgagcaa gtcactgttc 1080 aatttaatgc tcgacgaacg atttttcgct tgtctctacc agaaaggagt ggtgggcgac 1140 cagaagactt tttggtgcaa taagcggcag gatttgaagc atatcgagtg atcatatccc 1200 tcaaacgttt caaatcacga aaaatctgac gatccgagaa gccttgagat aattttgcac 1260 aatttgtcct ttttcgttgt cagttaaagt tattccacgg cccatagctg taaaaacaca 1320 aacgaacatg aaacctaacg tttagcaaaa aaaattgcaa aaaaaacgaa aaaagaaaac 1380 aaaaaccaag ggtggccgta tcattttgtg gattttttta tatggttttt catgaaaaaa 1440 tggtttatct ttttttacga acggttcagg tatatttcca atacttatct gaaagttagc 1500 cagtacagcg aacattttca caagaaacta tgttttgcta tctcaaccca gttttgagtt 1560 ataccaaaat ttgggggtgg ccgtatcatt atgtggagca ctg 1603 // ID CEREP4 repbase; DNA; CEL; 190 BP. XX AC . XX DT 01-DEC-1999 (Rel. 4.11, Created) DT 01-DEC-1999 (Rel. 4.11, Last updated, Version 1) XX DE CEREP4 dispersed repetitive element - a consensus. XX KW CEREP4; dispersed clusters. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-190 RA Kapitonov V.V. and Jurka J.; RT "CEREP4."; RL Direct Submission to Repbase Update (NOV-1999). XX DR [1] (Consensus) XX CC This element does not have any noticeable hallmarks of known CC retroelements or DNA transposons. There are about 100 copies CC of CEREP4 in the C.elegans genome. They are ~73% identical CC to the consensus sequence. CC There are multiple clusters composed of tandemly repeated CC copies of CEREP4 in the genome. The most conservative portion CC of the repeat is a 40-bp 5' terminus. XX SQ Sequence 190 BP; 75 A; 15 C; 19 G; 79 T; 2 other; ttttgggtcc cgccacgaaa actcctgggg tactgtagtt ttgattattt ttgaatttaa 60 atattatttt tmtaaaaatt aaatattttt tacacatttt aaaaaaaaaa ttagctaaaa 120 atcttaaaaa ttgacaaagt tatgaatttt tgaaaatttt ttgaaattta aaaaaaattt 180 tttttsaaaa 190 // ID CERP3 repbase; DNA; CEL; 1365 BP. XX AC M20500; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE C.elegans CeRep3 repeat DNA. XX KW CERP3; CeRep3 repeat; Repetitive DNA. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-1365 RA Felsenstein M.K. and Emmons W.S.; RT "Nematode repetitive DNA with ARS and segregation function in RT Saccharomyces cerevisiae."; RL Mol. Cell. Biol 8(2), 875-883 (1988). XX DR GenBank; M20500; Positions 1 1365. XX SQ Sequence 1365 BP; 468 A; 271 C; 297 G; 329 T; 0 other; tctatataaa ctcttgaaaa aagtgttaca gctaaaggtg ggggaatcca ggcctttaaa 60 gctaaagccc gtcaattggc taacgaaatc tgggagattc aaaaaaaggc ctgattctca 120 gaaactcctt tgtgtggaat cggggactaa aagctgaaaa tcttcaacta caatcatgtc 180 agctggttaa atttaaaaaa aagtaacaca gagagctggg gcggccaaaa tcgaattttt 240 aagaccgaat cgggaagcta ggaaatgggg attcttttta aaaagcaaag cggccgcaga 300 catgtttcta tcgagatcca gaatatgagg tgaggatggt ggctgcactt gaatcctagg 360 atttataatt taccacatta tttgaagagt tgtgactatc acattataag ctgaaaaata 420 ccgtatgacg tagaatctga tggcgaggaa aaatctgaaa aaaatcggca aaaccaggcg 480 aattgtcgga attgaaaatt tccaggcaaa tcagacaaaa acaagcaaat tgctggaatc 540 ggaattgaaa atttccaggc aaatcagaca aaaacaagca aattgctgga atcgaaattc 600 aggcaaatgc ccaaaccgtt ctgaatggtc gagggaaggt ctgagttgag agatcggcaa 660 accggcgaat tgtcggaatt gaaaatttcc ggcaaatcga caaaacagca aattgctgga 720 attgaaaatt ttccggcaaa tcggggaaaa cggcaaaatg tcgaatttga gcttgatttg 780 ccggaaaagt taaaatgatt taccgaattt gccgaaaaac gacaattacc gaaataaaaa 840 ttttcgacaa aattgacaat gcgggaacta aacaattccg gcaaatcgac acaccggcaa 900 tttgccgatt tgtcgaacgg caatttgtcg aacggcggca atttgtcaag agtggaagct 960 catgattttt gatttgccaa aatgccaaaa cccaacaaat taattttcag aatcaacacg 1020 ggctctttct cggatgtccg aaacttccga caactcggct actgtcctcg gttcgacgct 1080 ctcaacatga agctgagtac tagggagaac ttgaagttct acgcgaggat tcgaggaatt 1140 gtaccggctc aaatcatcga tagtcggtaa aggtcttttt aaaaattcga aataaataaa 1200 aaaacaaatt tttccagata atcgatcgtc tgctgattgc tctctacctt ccgtccctac 1260 gcaaacaccc agaacgtcaa gcctctccgg tggaaatcga cgaaaactct ctgtcgcagt 1320 ggcgctggtc agtcagccat ctctgatctt cctcgacgaa ctatc 1365 // ID CER9-LTR_CE repbase; DNA; CEL; 517 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 06-AUG-2003 (Rel. 8.07, Last updated, Version 1) XX DE CER9-LTR_CE is a long terminal repeat of the CER9_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER9-I_CE; CER9-LTR_CE; KW CER9_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER9_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (APR-2003). XX DR [3] (Consensus) XX SQ Sequence 517 BP; 98 A; 176 C; 87 G; 156 T; 0 other; tgttgtgaga tgatcccatc tcccaattat ctggtactct gagacactcg cgaatactcc 60 tatcgtgttc gccgtaccag aaaacctata aaccgctcct ttccagtcgc tctatctccc 120 cctttctcca atctaatgcg cgccctcatt tgaatctccc gccaagctca atggtctcgc 180 agcgagtgtg cgccgcgggt ctcaccgcgc gcgccctcct tgctcgtcct aattcgctgg 240 cctaaattag ttttacctca actcccttca ctcgtctcgt cgccgcacca gcggcacgtt 300 ccttctttta atttaattcc tttctattta aacacttccc tttctatcat tccttagcta 360 ataaatgggg tttcatggaa ataacatgtg ttactcgttc acaccgctta tcctcctctt 420 cgctcggcac tctgcaaacg ccagttctct cttagactct tcttccggct tgaccccagt 480 tgcggctccc caacccgctt acactgggcg gtcaaca 517 // ID CER16-1-LTR_CE repbase; DNA; CEL; 562 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 06-AUG-2003 (Rel. 8.07, Last updated, Version 1) XX DE CER16-1-LTR_CE is a long terminal repeat of the CER16-1_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER16-1-LTR_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER16-1_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (MAY-2003). XX DR [3] (Consensus) XX SQ Sequence 562 BP; 117 A; 174 C; 95 G; 176 T; 0 other; tgttgtgact tgggcctaac gcttcccctg cgtcccaact ttgcagatat atggccggtc 60 aggctcaccc actcgccacc actcattatc cgcgaataca ttcactctcc gtcgtccttt 120 tccgtttatt atctattaat aataattccc ttttgaatta ccgccaattg cgtatttccc 180 gatgcaagcg cgctcctccg agacctcccg ttactccgga gcgtcgtagc tccgtaaatc 240 cacacaataa ttctatccta cttttaggcc aatcgttcat tctgttctac tttctccagg 300 cgagtcagcg gctacccatc cgcagacact gtgattcagg acctttcttc accatttcca 360 tcctgttaat cactttaatt cctatttata tcctctgaat aaactggaat tcgtctgaaa 420 ctgtgtactg ttattctctc cgcaccgctt atcaccaatg ccgctcgaaa attgctccgt 480 gtctctccgc gcgtatctga aatacgctgt ttgtcactcg gtagggtgac tgactcccct 540 ctactggtcc cgagtgtcaa ca 562 // ID CER6-LTR_CE repbase; DNA; CEL; 218 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 06-AUG-2003 (Rel. 8.07, Last updated, Version 1) XX DE CER6-LTR_CE is a long terminal repeat of the CER6_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER6-I_CE; CER6-LTR_CE; KW CER6_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER6_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (APR-2003). XX DR [3] (Consensus) XX SQ Sequence 218 BP; 62 A; 50 C; 31 G; 75 T; 0 other; tgttgggaag tcacgtgaac gtcctgtgtg gcactaacgt gttcgcccgg acgtaaagat 60 cttatcacaa cattccacgt attggttatt tattatgatg acaattcagt tcttactttt 120 cacgagtatc taatatactc tctatcctgc acacctgcta actacaatat tgcttatttc 180 ttgctcaata aactcttcta caatatcaat acataaca 218 // ID Vingi-2_CE repbase; DNA; CEL; 1460 BP. XX AC . XX DT 17-AUG-2010 (Rel. 15.08, Created) DT 17-AUG-2010 (Rel. 15.08, Last updated, Version 3) XX DE A family of Vingi non-LTR retrotransposons - consensus sequence. XX KW Vingi; Non-LTR Retrotransposon; Transposable Element; Vingi-2_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-1460 RA Kojima K.K., Kapitonov V.V. and Jurka J.; RT "Recent Expansion of a New Ingi-Related Clade of Vingi non-LTR RT Retrotransposons in Hedgehogs."; RL Molecular Biology and Evolution 28(1), 17-20 (2011). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 2..1420 FT /product="Vingi-2_CE_1p" FT /note="includes a part of reverse transcriptase FT domain and a CCHC zinc-finger motif." FT /translation="NMKQPAKRTVMTCLDQRAAFDKVPIKKLCVDLHESLN FT EAKISKWLTNYLSKRIIKANFGNHSSKWCTLLGGVPQGSVLSPNLFSFYMR FT NMPIHPKTEITVYADDTTILAQDTNPFKAGELIQEHLDLLYDFFNERKMSI FT SAEKSTTTLFTCDTKEFKTKIPVNWNGAEIPTTNQVRLLGVHLNTMCGSKD FT HVAETRKKMMVSTNILKALSGTKWGCSKELLISTAKALVKPIAIYGAPAWS FT QLLSETNLNKLEAAYRNTLRTCCGLTSDTPIEHIYAEAQMMPLKEEFELAK FT QQTYIATTKSGTHPNRNLESRRNTERRSNRPREPPLNLQEATKTHLSRLKG FT TKAEAQKANHTRFVREFLNSAPPNRILGATPPPLNSSEESSLPKATRTELA FT RLRCGHSLLIPRYKARIEGTPIGTCGCGDNEGGIDHYLSCGRSHPISKEKL FT WTSPMEAARALGLAYKPYDPGGGATTSV" XX SQ Sequence 1460 BP; 445 A; 388 C; 347 G; 280 T; 0 other; caacatgaag cagccggcaa agcgcacagt gatgacctgt ctcgatcagc gcgcggcctt 60 tgataaggta cccattaaga agctctgtgt cgatcttcac gaatctctaa atgaagccaa 120 gatctcgaaa tggctcacga actatctgag caaacggatt atcaaagcta actttgggaa 180 ccactcgtca aaatggtgta cactgctggg aggagtgcct cagggctcgg tacttagccc 240 gaatctattt tccttttata tgagaaatat gccaattcac ccaaaaactg agatcaccgt 300 gtatgcggac gacactacaa tccttgccca agacacgaat ccatttaaag ccggggagtt 360 aattcaagag cacctggatc tcctctacga tttcttcaac gagaggaaaa tgtcgatctc 420 ggctgagaag tccacgacga cactcttcac atgtgatacc aaggagttca aaacgaagat 480 tccggtgaat tggaatggag cggagatccc gacgacaaac caagttcgtc ttttaggagt 540 acacctgaat acaatgtgtg gctcaaagga tcacgtcgcg gaaaccagga agaagatgat 600 ggtttcgacc aacatcctca aagcactctc agggaccaaa tggggctgct ccaaagagct 660 actgatctcc acggcaaaag cactggtgaa gccgattgcg atttacggtg cccctgcctg 720 gtcacaactt ctatcggaaa cgaacctgaa caaactggaa gccgcgtacc gcaacacctt 780 gaggacttgc tgtggtctga ctagtgacac tcccattgag catatctacg ctgaagccca 840 aatgatgcct ctgaaggaag aattcgaatt ggctaaacaa caaacttaca ttgccacgac 900 aaaatctggt acgcatccga acaggaatct ggagtcgaga aggaacaccg aaagaaggtc 960 taacagaccc agggagccac ctctaaacct acaggaggcg accaaaaccc acttgagcag 1020 actcaaaggt acgaaggcag aagcgcaaaa ggccaatcac acaagattcg ttcgcgaatt 1080 cttgaactcg gccccaccca acaggatcct tggagccacg ccgccaccat tgaatagttc 1140 tgaagaatct agcctcccaa aagccaccag aacggaactg gctcgacttc gctgtggaca 1200 ctccttactg atccccaggt acaaagcgag aattgaagga acccccatcg ggacatgcgg 1260 atgtggagac aacgagggag ggatcgatca ctacttatcc tgtggaagat cgcatcccat 1320 atcgaaagaa aaactatgga cgtccccaat ggaggccgcc cgagccctcg ggctggccta 1380 caaaccatat gacccgggtg gaggagcaac gacgagcgtc tgaaaacggc gctgcgccgc 1440 ctgttgcaca acaacaacaa 1460 // ID PALTA4_CE repbase; DNA; CEL; 198 BP. XX AC U86950; XX DT 07-SEP-1999 (Rel. 4.08, Created) DT 07-SEP-1999 (Rel. 4.08, Last updated, Version 1) XX DE PALTA4_CE is a putative nonautonomous DNA transposon, a consensus DE sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW IR-5; mariner/Tc1 superfamily; PALTA4_CE; TA target site; TIR; KW nonautonomous DNA transposon; palindrome. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-198 RA Devine E.S., Chissoe L.S., Eby Y., Wilson K.R. and Boeke D.J.; RT "A transposon-based strategy for sequencing repetitive DNA in RT eukaryotic genomes."; RL Unpublished. XX RN [2] RP 1-198 RA Devine E.S.; RT "PALTA4_CE."; RL Direct Submission to Genbank (21-JAN-1997)Molecular Biology and RL Genetics, Johns Hopkins University School of Medicine, 725 N. RL Wolfe Street, Baltimore, MD 21205, USA. XX DR GenBank; U86950; Positions 1 198. XX CC PALTA4_CE has 99 bp-long terminal inverted repeats; it forms CC palindrome and is flanked by the TA target-site duplications. CC There are ~20 copies of PALTA4_CE dispersed in the C.elegans CC genome. XX SQ Sequence 198 BP; 61 A; 34 C; 36 G; 67 T; 0 other; tattacggga acaataaatc atgagaatgc gtacttggtg cattttattt gacgcgcgat 60 atctcttagc gaaaactaca gtaagagctt aaatatctat tgttatttga gttcttactg 120 tagttttcgc taagagatat cgcgcgtcaa ataaaatgca ccaagtacgc attctcatga 180 tttattgttc ccgtaata 198 // ID TC1A repbase; DNA; CEL; 855 BP. XX AC . XX DT 16-SEP-1999 (Rel. 4.08, Created) DT 16-SEP-1999 (Rel. 4.08, Last updated, Version 1) XX DE C. elegans TC1A nonautonomous DNA transposon. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW TC1 family; TC1A; nonautonomous DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-855 RA Oosumi T., Garlick B. and Belknap R.W.; RT "Identification of putative nonautonomous transposable elements RT associated with several transposon families in Caenorhabditis RT elegans."; RL J. Mol. Evol 43(1), 11-18 (1996). XX DR [1] (Consensus) XX CC TC1A is a non-autonomous DNA transposon flanked by 345 bp-long CC terminal inverted repeats and related to the TC1 transposon. XX SQ Sequence 855 BP; 288 A; 138 C; 133 G; 296 T; 0 other; tacagtgctg gccaaaaaga tatccacttt cagttttttg acgatttcga tattttttcc 60 aatgggcata acttcaaaac taggaaaggt accaaaaaat tttcaactgg taaaatgtag 120 ctcgtgatca ggcctatgta tttttacatg ttgcaattat tcatccatca catggcaagt 180 aataaagcgg cgggcatctc gtgagtccgt ttttgacgat gattactaaa acgactgtaa 240 ctcaagaaac atatttttaa tgaaaggttt gagaaagtaa caaaatgttt atttaatttt 300 tcattgtttg aacatatcaa ctttgtccta aaacctccat ttaaaaaaat gtatgcgctg 360 aaactagtgt ctcattagac actgtttaga ggctttgttc aaaaatcagg tttcttggat 420 tgaaaatctt tttccgacaa tttttgtgaa gaactgatgt tgttattata tttgaactac 480 atattaacca ataaacagaa ttcagaaaaa aaatctacta tttttgagac atgagctcaa 540 gattactcta cacttctctt ctttaaactc tcttgtcttt ttcaatggag gtttcaggac 600 gaagttagta tgttcaaaca ataaaaaatt atataaacat cgtgttactt tctaaaaacc 660 ttccattaaa aatatgtttc tcgagttaca gtcgttttag ttaatcatgt aaaaatacat 720 aggcctgatc acgagctaca ttttaccagt tgaaactttt ttgatagctt tcctagtttt 780 ggagttatgc tcaatggaaa aaatatcgaa atcatcaaaa aacagaaagt ggatatcttt 840 ttggccagca ctgta 855 // ID CELE6 repbase; DNA; CEL; 158 BP. XX AC . XX DT 16-SEP-1999 (Rel. 4.08, Created) DT 16-SEP-1999 (Rel. 4.08, Last updated, Version 1) XX DE CELE6 nonautonomous DNA transposon. XX KW DNA transposon; Transposable Element; Nonautonomous; CELE6; MITE; KW nonautonomous DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-158 RA Oosumi T., Garlick B. and Belknap R.W.; RT "Identification and characterization of putative transposable DNA RT elements in solanaceous plants and Caenorhabditis elegans."; RL Proc. Natl. Acad. Sci. U.S.A 92(19), 8886-8890 (1995). XX DR [1] (Consensus) XX CC About 50 copies of CELE6 are present in the genome; they are CC flanked CC by 50 bp-long imperfect terminal inverted repeats and TA target- CC site duplications. XX SQ Sequence 158 BP; 48 A; 36 C; 36 G; 38 T; 0 other; tattacgaaa ccatcagttc atgagaatgc ctaccttctg gcgcgaaaat agcggcaaca 60 gagagaacaa gcgacgacga gagatttagg tgccttcgct acgagatatt cccgcgtcaa 120 aacggtaggc attctcatga tctcatggtt tcttaata 158 // ID LTR1_CE repbase; DNA; CEL; 499 BP. XX AC . XX DT 07-SEP-1999 (Rel. 4.08, Created) DT 07-SEP-1999 (Rel. 4.08, Last updated, Version 1) XX DE LTR1_CE is a putative long terminal repeat from an unknown DE retrovirus - a consensus. XX KW LTR Retrotransposon; Transposable Element; LTR; LTR1_CE; KW Long terminal repeat; retroelement; Retrovirus. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-499 RA Kapitonov V.V. and Jurka J.; RT "LTR1_CE."; RL Direct Submission to Repbase Update (SEP-1999). XX DR [1] (Consensus) XX CC LTR1_CE is a putative long terminal repeat from an unknown CC retrovirus. There are about 20 copies of the repeat in the CC C. elegans genome; they are ~93% identical to the consensus CC sequence. Solo LTR1_CEs are flanked by 5 bp-long target-site CC duplications. XX SQ Sequence 499 BP; 129 A; 94 C; 123 G; 152 T; 1 other; tgtttcggag tggtttccgg cccatgaggt actgatagga actaacaaaa gactcttatt 60 taattaactc atttattacg agtgattcaa ggggaaagcc ttgaatcact aacactcaaa 120 cattcattta aacgtgccac cgtcgggcac gtgctgcggt ggtgaagacc gatgaagact 180 gccgacgact gacgatatag gccaatgtgc gctgcgccat gatgacaaga gtacggtcgc 240 agtcgttgtt gtatgtgtcg tggcgagacc tcatgcgcct tgaatcctcc ttttggcggt 300 agtttgaatt ggggtttctg acgccgtcct tttgcatggt tgttgccgtt gattgaagag 360 atgatgttga gtgttgaaga gcaagtgtta cgtaataaca atgaatcaat ctttacattt 420 aattcatttg ttaataaatc yttagattta ttaataagtc gtgagcgagt gaatctttgt 480 attcaccgac ttcgcgaca 499 // ID TIR9TA1_CE repbase; DNA; CEL; 322 BP. XX AC . XX DT 09-JUN-2000 (Rel. 5.05, Created) DT 09-JUN-2000 (Rel. 5.05, Last updated, Version 1) XX DE TIR8TA1_CE is a putative nonautonomous DNA transposon - a DE consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW TA target site; TIR9TA1_CE; nonautonomous DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-322 RA Kapitonov V.V. and Jurka J.; RT "TIR9TA1_CE."; RL Direct Submission to Repbase Update (JUN-2000). XX DR [1] (Consensus) XX CC TIR9TA1_CE is a putative non-autonomous DNA transposon; it CC has perfect 9-bp terminal inverted repeats and putatively CC generates duplications of TA target sites upon its integration CC in the genome. CC There are ~200 copies of TIR9TA1_CE in the CC C.elegans genome and they form multiple subfamilies and ~85% CC identical to the TIR9TA1_CE consensus sequence. XX SQ Sequence 322 BP; 59 A; 121 C; 84 G; 58 T; 0 other; tagggctccc atgaggtcgc cgcaacggcg cctccgccgg cctcagcggt cgccgcgccg 60 gcctcacggc gtcactcacg ccgcatgtat ccgaatgtat agtgcggcgc ggaaccccga 120 acgtgtcggc cgcttccaaa taactacctt ttcgcactac gttgcgcaca caccaagcta 180 ctcgtttcac gccaagctgc ggaaccccga acgtgtcggc cgcttccaaa taactacctt 240 tcgcacttca ttgcacacac acagcggcgc gcggcgtcgg cgtgaggccc gcatttcgcg 300 cctcactcag ctgggagccc ta 322 // ID PALTTAA3_CE repbase; DNA; CEL; 602 BP. XX AC . XX DT 12-JUN-2001 (Rel. 6.05, Created) DT 12-JUN-2001 (Rel. 6.05, Last updated, Version 1) XX DE PALTTAA3_CE is a non-autonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; PALTTAA3_CE; KW TIRs; TTAA TSD; nonautonomous DNA transposon; palindrome. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-602 RA Kapitonov V.V. and Jurka J.; RL Direct Submission to Repbase Update (JUN-2001). XX DR [1] (Consensus) XX CC PALTTAA3_CE is a non-autonomous DNA transposon; it CC forms almost perfect palindrome and has generated CC duplications of the TTAA target sites upon its integration CC in the genome. CC The C. elegans genome contains ~50 copies of PALTTAA3_CE. CC PALTTAA3_CE copies are ~94% identical to the consensus CC sequence. PALTTAA3_CE is 66% and 64% identical with PALTTAA1_CE CC and IR3_CE. A family of transposases, involved in propagation of CC these elements, remains to be identified. It's possible that this CC family belongs to the PiggyBac superfamily of transposases CC because CC of the the TTAA target site duplications. XX SQ Sequence 602 BP; 202 A; 95 C; 95 G; 206 T; 4 other; ttaaaggagg actaacggtt cggacgattt tgaacgtata gacccaaaat gagctcaaat 60 gaacgaattt cgtaatgaaa ctgctcaaaa aattttcaaa aattttttat ggcggttcaa 120 aattttgaaa aaattacact gattttrgct aaaatcacga atttttccca tttttccgtg 180 tcacatctgt ccgaagttgg cttttttgga attatcgtcc tttattacat atattggtar 240 ttwatctcat ttaatttcgt cgattaaagt acatttaaag ccgataggta accaatttcg 300 atgatttttg gttacctatc ggctttaaat gtactttaat cgacgaaatt aaatgagata 360 aactaccaat atatgtaata aaggacgata attccaaaaa agccaacttc ggacagatgt 420 gacacggaaa aatgggaaaa attcgtgatt ttagcyaaaa tcagtgtaat tttttcaaaa 480 ttttgaaccg ccataaaaaa tttttgaaaa ttttttgagc agtttcatta cgaaattcgt 540 tcatttgagc tcattttggg tctatacgtt caaaatcgtc cgaaccgtta gtcctccttt 600 aa 602 // ID PAL8A_CE repbase; DNA; CEL; 389 BP. XX AC . XX DT 25-AUG-1999 (Rel. 4.07, Created) DT 25-AUG-1999 (Rel. 4.07, Last updated, Version 1) XX DE PAL8A_CE is a putative nonautonomous DNA transposon - a DE consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp-long target site duplication; hAT superfamily; PAL8A_CE; KW TIRs; nonautonomous DNA transposon; palindrome. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-389 RA Kapitonov V.V. and Jurka J.; RT "PAL8A_CE."; RL Direct Submission to Repbase Update (AUG-1999). XX DR [1] (Consensus) XX CC PAL8A_CE is a putative non-autonomous DNA transposon; it CC has almost perfect 34 bp-long terminal inverted repeat and CC generates duplications of 8 bp-long target sites upon its CC integration in the genome. The PAL8A_CE nucleotide sequence CC forms a palindrome-like structure (its unpaired central loop CC is only several nucleotides long). CC Presumably, multiple transpositions of PAL8A_CE were induced CC by HAT-like transposase. There are about 50-100 copies of CC PAL8A_CE in the genome and they are ~98% identical to the CC consensus sequence. CC There are several subfamilies related to PALTA1_CE CC in the C.elegans genome; some of them share only 10-20 bp-long CC termini. XX SQ Sequence 389 BP; 110 A; 74 C; 85 G; 120 T; 0 other; tagggctgtg cggctgatga ctcggctgat ggagacaatc aagtttttca acgtgttagc 60 actgatcaat taaagaatat ttgtatttgc agtattggct gagtcatgag cttcttattc 120 tagaaaaaag ttttttttaa agtgcacaca ccttgacaaa acagtcggct ttttgaatca 180 gccgagtaac tcggctgtcg gctgtcggct gattggaaaa acgagttttt ctggaagggg 240 tgtgcacttt tttaaaaact tttttctaga attagaagct caagactcag ccgatatggc 300 aaatacaagt attccctaat tgatcattgc taatacgttg aaaaacgttg atttgctcca 360 tcagccgagt catcagccgc acagcccta 389 // ID CER6-I_CE repbase; DNA; CEL; 4675 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 06-AUG-2003 (Rel. 8.07, Last updated, Version 1) XX DE CER6-I_CE is an internal portion of the CER6_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER6-I_CE; CER6-LTR_CE; KW CER6_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER6_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (APR-2003). XX DR [3] (Consensus) XX CC FEATURES Location/Qualifiers CC gene 1006..1191 CC /note="predicted Gag region" CC gene 3167..3688 CC /note="predicted Integrase region" CC gene 1204..1526 CC /note="predicted Protease region (frameshift)" CC gene 2508..2866 CC /note="predicted RNase region" CC gene 1683..2219 CC /note="predicted Reverse Transcriptase region". XX SQ Sequence 4675 BP; 1396 A; 974 C; 1074 G; 1231 T; 0 other; attggcgatc aggacttcga acaaatgccg ggaggtaaga aagataataa agcagagggg 60 gatgctaatg cagctctcct gcaaactctt gccgctatga gagcccaaat ggaggctcaa 120 gggagagcgt cgcgtttagc tcttgaaaca cagactaaaa acgctgagcg tcagctcgcg 180 cttaccgaaa agttgatgga ggagaatcgt tttctccgtg aggctagtgg ttctggggaa 240 atgcctagac aggctgctgg gggtgaatcc cgcgttcaaa ataatgcgcg tataatgtct 300 gatttaatca agcgcgttcc aaagtttcaa tattcagtta cggagcccga caccttcaaa 360 agatggtggt cgcgctacca gctaaccttc acggaggatg gcaaagactt aactgaacgc 420 gaacgaaaca gaatgctttt agaatgtctt gaagaatcgg catatcatcg ctacgtggat 480 tcacaaagag atgtgacaga ttttctcgga attccattca cggaaactct ccaaaatctc 540 accaaagcgt ttggttctca tcgctcgatg atgatgcgga gacaacaatg tcttcaaatt 600 tcgcgcgcta atggagaatc actagaccca ctggaatact caaaccgagt tggagacgca 660 gtcctagaag caaaacttgc atccatgtct actgacgagt ggtctgtatt catattcttg 720 cgtggattag atgcgcctgg agacaatccg gctaaagctt atcttatgca gttttgtgaa 780 acagctgaaa agaagggcga aaagctcaag ctttcagata tccacgacga atggatccgt 840 ttcatccaaa tgaaacaaca aactaaggtc gtctcagctc ttccagtcaa gcctactcct 900 caacaagcag tcgaggtgaa caaagttgat cctcgcactc ccaaaaagaa gaagtttaag 960 cagaagcaga aacattccaa gtctggaaat gagaaaaagg aattcattcc aacatgtttt 1020 acctgtaaaa aatccggcca tactgctccg aattgtccaa ataattcaaa caacaattct 1080 aatgctgttc caaagaatgg taaaagatgt caaatcatcg aagttggcgg attagaagtt 1140 gaatctgaac catcgaagca cagcctgact gtgactgttg aagataaaga tttagcgttc 1200 cttctagata caggcagtca aatcacgttg attaacgtcg atagttggaa agaaattggt 1260 tgtttttgca aaaagttact catagcattt gctgcgcgaa tgggacacca atgacggtta 1320 agggtcgagt tcgtgtcaat ttcaaactca aagaggtcga ttacacagac tatgtgtacg 1380 tgtgcgaccg agacacaaac ttgatcggga cgagttggtt gaaccactct ccagaaatgc 1440 gtgctgccat tgctattatg gttaatcagg ttgttgtcaa cactgatagc gaagcaaacc 1500 gtcttgccga gactctcaaa agggaatatc ctgacgtgtt ttctgacgaa ttgggtcttt 1560 gtattaaaga aaaagctcaa ttacgtactg aagaaggtat aaaaccggta ttcaaaaaag 1620 ctcgtccagt accttttgga tctcgcgaag ctgtggagaa agagttattt cgactgaaag 1680 atatgggtgt catcacacca atcacctatt ccaaatgggc tgctccaatt gtagtcataa 1740 ggaaaaagga gtcgaataag attagagtct gtgcagattt caaatgttca ggtcttaatt 1800 cagctctaaa agatgaaatt catccgcttc ccacatgcga agatttattc gccaagttga 1860 aaggttcagt cttttcaaag attgactttc gtgacgccta tcttcaaatt gagttagatg 1920 acgaatctca agagttagca gtcatcaata ctcacctagg tctattcaaa tacgtgagaa 1980 tggcttttgg gttgaagcct gcaccggcag tgtttcagaa gatagtggat aagctgattt 2040 cgggcattcc tggagtgtct gcttatctgg atgatttgat tattgctgct agttcttacg 2100 acgaacatga agccatcctg cgtcaactct ttgatcgaat ccgtgaatat ggatttcgag 2160 tcagtcctga gaagtgcatg tttgcagtct cggagatctc gtttctggga ttcatcgtgg 2220 atgaaaaagg tcgtcgacca gacccaaaca aagcgtcaaa aatccgttcg atgaaggcgc 2280 caggagatca aaaacagcta tcatcatttt tgggtgctgt gtgtttctac agtcggttcg 2340 ttccgaaaat gtcggaattg agagggccac tcgacagact catgaaaaag gatgtcgaat 2400 ggtcgtggac aaaaactgag caaatcgctt ttgagaagct caagaataca gttgctgatg 2460 cgactatgtt gtcccacttt cacgaagatt ggcctataat tgttgccgct gatgcaagca 2520 actacggaat aggaggtgtt ctttgtcacg tcaatccaga cggtgtcgaa gttcccattg 2580 ctcactatgc tcgatcgctt acagctactg aacaaaagta cagtcaaatc gagaaggaag 2640 gattggcgtt gatctttaca atcaagaaag ctcacaaatt tattttcgga agaaaattta 2700 agcttcagac agatcaccaa ccgcttctcg caatcttcgg ttcgaagaaa gacttaccag 2760 ttcattcgca aaatcgactt gtgcgctggg ccacaaccct catgtcatat gacttcgatt 2820 tgagttgtcc acttccaaat ttgccaaagc cgactggttg tctcgtatga ttcaggaatt 2880 tccacgtaat caagatgatg tggttatcgc cgaaatctgg gacgatgaca acgatgattc 2940 cttgagttcc aacttcactc cagtcatgca agaagacatt caagtagctt ctgagaaaga 3000 ctatgaagtc tccgcggtga tgaaattgtt atcggataat tcctggaaat caaaaccgga 3060 taccgatatc gagaagtact ggagtcgact caaggatcgt ttgaagatca tacaaggttg 3120 tttacttgtt gatgatcggg taattgtgcc aaaacagcta caatcaatgg ttctaaaaca 3180 gttacatgac ggtcatccag gagtagtgca catgaaacaa aaagctcgtt cttttgtctt 3240 ctggagaggt ctcgattcgg agattgagaa acttgtacgt caatgcaaca actgtcaaga 3300 aaacgccaaa atgccaagag ttgttccatt aaaactttgg ccggaaccag aaaaaccttg 3360 gacacgaatt catgttgatt tctgtggacc actaaatgga cagtggttat tggttgtcgt 3420 tgatgctaag tccaagtacg cggaggtcaa attgacaaga agcatttctg ccatgtcaac 3480 ggtggatctg atggaagaga ttttttcaat tcatggttac ccggaagttc ttgttagtga 3540 taatggtaca caatttacca gtcacttctt caaaaagatg tgtgagtctc atggaattat 3600 tcacaagaca tctgctactt attatccaag atctaacgga gctgcagagc gcttcgtgga 3660 tactctcaag agaggtatcg ccaagatcaa aggagaaggt tcagtcaatc aacagattct 3720 gaacaaattt cttattagtt accgcaatac gccgcattcc gctttgtctg gtgctacacc 3780 agctgaatgt cactttggta gatcaatcag aacaactatg tcgttgttga tgccgaagcc 3840 agacgcaaat catcaagctg atctttctga ataccagaaa aagatgaaac agcagtatga 3900 ttcacgcaac ggtacgcgcg ctaaacattt ccaggttggt caacaagttt atgtcaaggt 3960 tcaacatgga aataagtcgg aatgggatta tggagttgtt tcgcggaaga ttggttcggt 4020 gttgtacgaa gtgcaagttg gctcgaggct ccaacgttct catgttaacc agttacggtt 4080 gagatacgga gacaagagca gtgtggagaa atttgaggat acaatttatc cgatgttctt 4140 cacttttgcc aatcattcgg ataagtttaa caattctcct ggttttaatc acgtttcgag 4200 tccgagggtc cacggggggg aaactaatat tttttattct gcaggtttgt ccgcagatag 4260 tttgctccca gcttctcaac gtgttggttt gtcgagggag gacggtggcg atgctgacaa 4320 caattcattg gttttaccta atggggattc cgatgctaac atggctcgtg gaagacctaa 4380 agaactaaag aagagtaata gcaaagtgcc aagtcatcaa tcaacgtcca agcctacaat 4440 gcgatcaatg aagtctacag tgctcacgaa cgtctcaagt gatctaccag ccaccgacct 4500 aagtcattct ttacgacgaa gccaacgact ccgcaacgct tcggcacgat atacgggtca 4560 acggaacatc cacaactggt caatgctcca ctcgcaacag cagcatccac aaccctgctg 4620 gtcgtacgat taactctcgc ccagttaatg gcaactcgtc acttaaggag ggagg 4675 // ID PAL5A_CE repbase; DNA; CEL; 792 BP. XX AC . XX DT 31-AUG-1999 (Rel. 4.07, Created) DT 14-JUN-2000 (Rel. 5.05, Last updated, Version 2) XX DE PAL5A_CE non-autonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; CERP5; IR-2; KW PAL5A_CE; TIR; nonautonomous DNA transposon; palindrome. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Devine E.S., Chissoe L.S., Eby Y., Wilson K.R. and Boeke D.J.; RT "Direct submission."; RL Direct Submission to Genbank (MAR-1997). XX RN [2] RP 1-792 RA Kapitonov V.V. and Jurka J.; RT "PAL5A_CE."; RL Direct Submission to Repbase Update (AUG-1999). XX DR [2] (Consensus) XX CC PAL5A_CE is a putative non-autonomous DNA transposon; it CC forms almost perfect 396 bp-long palindrome and generates CC duplication of 5 bp-long target site upon its integration CC in the genome. CC PAL5A_CE can be still mobile since some of its individual CC copies are more than 98% identical with the consensus CC sequence. XX SQ Sequence 792 BP; 249 A; 146 C; 149 G; 247 T; 1 other; cactgcaact ttttcctcac gagggacgag gaaaagtggt ttctaggcca tggccgaggg 60 gccgacaagt ttcagcggcc atttatcttg ctttgttttc cgcctgtttt ctttcgtttt 120 tcaccgattt ttcccgtttt ttcttattaa aactgataaa taaatatttt ttgcagatgc 180 taaaacaatt tccaagtaaa aaaattatgt attcagtggg caagcagcgg tgaaagtggg 240 caatgtaata tgatggatta cgggaataca aaacctaaac tttttctgaa acatgataca 300 tatgctgctt agatgctgaa actacctgat tttcataacg agaccgctga aaaagttttg 360 aggttttcaa aattcaactt ttttggtgaa aaagtcgaga ttttcgcaca aaaagttgaa 420 ttttgaaaac ctcaaaactt tttcagcggt ctcgttatga aaatcaggta gtttcagcat 480 ctaagcagca tatgtatcat gtttcagaaa aagtttaggt tttgtattcc cgtaatccat 540 catattacat tgaccacttt caccgctgct tgcccactga atacataatt tttttacttg 600 gaaattgttt tagcatctgc aaaaaatatt tatttatcag ttttaataag aaaaaacggg 660 aaaaakctgt gaaaaacgaa agaaaacagg cggaaaacaa agcaagataa atggccgctg 720 aaacttgtcg gcccctcggc catggcctag aaaccacttt tcctcgtccc tcgtgaggaa 780 aaagttgcag tg 792 // ID CER15-1-I_CE repbase; DNA; CEL; 6211 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 06-AUG-2003 (Rel. 8.07, Last updated, Version 1) XX DE CER15-1-I_CE is an internal portion of the CER15-1_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER15-1-I_CE; KW CER15-1_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER15-1_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (APR-2003). XX DR [3] (Consensus) XX CC FEATURES Location/Qualifiers CC primer_bind 1..15 CC /note="predicted Primer binding Site (Gly-TCC type)". XX SQ Sequence 6211 BP; 2258 A; 929 C; 1413 G; 1611 T; 0 other; tggtgcatcg aaccgaatct aactgattac ccatttaaat aactcgaaat agtcattatt 60 tatctcaaaa ataaaagacc tcttctggat gatgctccga agaggcgaaa tcacctggaa 120 aaaagacctt gggcggctac gagccaaaca aggcaacttg gatccttctg aatggaaaaa 180 ttcatttgga aacgctcatg gagcgaaaaa attactggaa acctcaaaag ggacggggtt 240 atcttcggaa aatactttcg acctgtaaaa tgacattttt acaggcggaa atcggaaaaa 300 acagtgccgt gatgtaaatt atagggtaat ctatgcccga attttcattt cgtccaggaa 360 aatgctcaaa attccgttaa aatagcggtt ttgagatcca aattcttatt aaaatctgga 420 aaacacgaga gattccattt tttagaagag acacaggaag agaatgtgcg ctccaccgca 480 aaaaagtggt gagagagcac acaaaagtgg gagacgcaga gaatatgcgg tcttttccat 540 cagccggcaa gaatgaattt aaattggcat ttttgaagaa atctttgtca atttcagatg 600 ttttctttca atttaagttg aattgatgcg tttttgatgt gattgaatcg ataattcctt 660 cacaaatcgt aatttccgtt gaaaattgtc gtttttccgc cgattaacct tgaaatcaag 720 tgttttcagc caaagagcga cgaaaaattc gagaaacgga ccggccaagc tgcggaacta 780 caggaatcgc tggagaaatc aataaacaca gaaatgttcc aatggaagcc tcaaagaaga 840 acaaaggatt ttccggcgaa aatctcggct gatattggat gaacgacgga aaaagggaaa 900 ttccacggat ttggcgacga aattctccat aaaagcagag aaaaaggacg aaaaggacca 960 gatgctggcc tgaaaagaca tcgtatggaa aggatcagga aatcggagaa ccagagaagg 1020 tccaaccgac ggaagcaaag aattccgagg tattaattta aaaaaagttt taaaaaattc 1080 ggaaaattat gaaaataaaa agaaatcttg tctttaaatt taaaagacaa ttctcacttc 1140 gacagagaga aattttgacg gaaaataaac aatttttaat gaaaaattgc ggatataatc 1200 ctgaaaatta atggaatatt atcattattg aatcgataat tttcagacgg tgtggtccag 1260 caacgagtca gaacaggcaa gaagacttca caaatcggaa agaaattgga gcacagtcgg 1320 atcgggaaaa ttcgccagcg aatcacaaga ctccgggata aacgaagaat ttaaatattt 1380 ctaaataaaa aatgtgcctt caataaaaat taatttattt aaataaatta atttatgaat 1440 cgagtgtttt gtattgagaa atacatgaaa tggaatgact tcctcagaag taggaagcat 1500 aatatgggat caaatgtttg attggaagga aactttgaaa taaattttca atgattattg 1560 gaacaaaatt gttgctcagt gtgtctggat cttgaactta gaatgctttc agaaagtaag 1620 atggaatgtc aagcatggga aatagcagga aaatggaaat tgggagtgtt attctaaaaa 1680 agaatcacta gaaattggat attttggaaa ttttttgaga ggaattgtca ctggatttcg 1740 gaacaaaaat aatttttcga atatgaaatt gtgaagtgtc cctaggaaaa ggaatacaat 1800 tatcaatgaa tttgaaacta tggcttgaca tgttgagtgt tggaatacgg ccataaatgg 1860 aatatttgag tcaaaaattg aaatatggag ccaaattcta gaatacgaaa atcgaattct 1920 agatcgggaa aacgttgaaa ttcgacaaat tgagatcaga tgtcgaagga agaacttatt 1980 ctgaaattgg aatgaaacct agaaaaaaaa tagttaggta gtgtgttcat atgctacagt 2040 acatcgggcc attctcccga aatttactgg gagtcatact gaatatgagg tataattcct 2100 atgagtgatt tgaaaaaatt ctcccgatac ttaatgggag tcttatggaa aactggaaac 2160 tggtactgga actggaactg ggaactggga aatctgaaaa gctaggaaaa ctggaactgg 2220 gaaagaggat ctgtggaact ggattagtgg aaatctggaa atctggaaat tggaactgga 2280 aaatggaaaa ctagagacct gaaaactggc aagctggaat ctggaattgg aaaatttgga 2340 aagctagaaa tctggaactg aaaaactggt tggctggaaa ttggaaaatt ggaaacctgg 2400 aacactgaaa cctgtaaaaa ctgggaaact ggaaaaccag ataactggaa actggaactg 2460 gaaaactggg aacttagaaa ctggaaacct ggaaaatctg gaaatcggaa tctaggaaac 2520 ctaggaaact ggaatctgga actgaaaaaa aaatgacaaa ctggaaatct ggaacagaaa 2580 atttggaaaa ccagaaaact ggaaactgaa aagctggaga tccaaaactg gaaaactgga 2640 aattggaaaa cttggaaaac tgggaaaact ggaactggaa atctgggaat tgaaaaactg 2700 gagaaactgg aaaaactaga attggaaacc tgggaattga aaaactggaa aaactggaaa 2760 aactggaaaa ttgggaaaat gaagctgagt acaagcataa gcttatgaaa ttctttcgat 2820 atttactgaa agtcttatgg taatggaaca tagatggtat aaagtaacat ggaaactctt 2880 tcgatattta ctgaaagttt tatggtaaat gaaaaagagg atacagagga acatggacaa 2940 attctcccga tacttacagg gagtcttatt gaagttggaa cttactccaa agagtaattg 3000 aggaaattct tccaaagtta actggaagtc ttactggaaa ttaagcttgc attcttgtgt 3060 agctgtgaca aattctctcg atacttactg agagtcttat tgaaattgga atgggcattc 3120 aaaatgaatg tgtaatattc tctcgatacc tactgggagt cttattgaaa attggaattg 3180 aattctttgg aacattctct cgatacttat agagagtttt actgaaattg gaacgggcat 3240 tcaaagtgaa tgtggaattc tctcgatact tactgagagt cttattgaaa ttggaaaaga 3300 aattcaaagt gaatgtagag cattctctcg atacttactg agagtcttat tggaactgga 3360 aatcgatttt gttggaaaca ttggaaattc tctcgatatt tactgagagt cttattggaa 3420 aattaactag gagctgtatt cagatgaaat gagctgacct tagttgtata gtgtatgttt 3480 atggaacgca catggaatgg tgattggaaa ttcaaagaaa tttggataac caagtctcat 3540 ttgaagtttt ttttttttgt tgtttgtgca aaaaacaaaa agggcggata tgagatttca 3600 gagaaagtag taaatgaagg aaaggaacaa aatagaaggt ggaactacgt tcaatggtta 3660 ccatgggacc cataattggg atgtggaata gctagattaa gaggatcccc ctttctgcgg 3720 attagggctt aatcatagca cctgggaaat tctgagacgt ttctgaacgg gaactcagca 3780 aacccgatga ggaacttgga actggaaatg tgaaacaatc cagaagaacg gaaagagtct 3840 aactcaaaag ttagtgtggg tgaaatggaa actggaatat ctgaaatggt ccagcaaatc 3900 agataagaaa ccaaagccgt caagatggat gatattctat tgagacaaga ccaggtcgtt 3960 tctgaatcgg agttaagagc aaatgagaaa ggattgaatt gaattgaaag ttgcacgtac 4020 tcacagtagg acctgttggc tgtagtatcc ggccagatct acgcttatca gaccaactgg 4080 aatatcatga tggttttcga gagactgcag taatcttgga attcattaga gttttcgcat 4140 attttatttt caagttattt tctttgcaat tttgtttctt ggtttatcgt gataattttt 4200 tgcccaaggt ggttcaagat aattatcatt ttgtatttca gaactatcaa tcgactcgaa 4260 cggtattatg gcgaaggtga atgcggcaat ggtgccgcta aaacagcaaa ttgatcagtg 4320 gggaaacact gcaaatgaag ttttgaaggt ctcccaggag atactgattg acgttaaagt 4380 tggaattatc caaccggaac tgaaacgaga cgagattctc aagaatcacg tcgacgttgt 4440 tgggatcttg aagattctag agaatcttga tgattggatc acaaagatga tccggaaaaa 4500 cgctgaaata tcggcaaatc tggagttggt gactcggaac actaagagtt tgcaagaaca 4560 cgcagctcaa aaagagtttg ttaaacttgc caaaagactt ggttacatga gaactgagct 4620 agaaggtatc attgattcat tgccagaatc tccgattgaa agcaagaagg aaggttctca 4680 agcaaagatt cttgtgaatc caaatgaaca gaacatgaaa ctgattttgg atagcttaaa 4740 agctcttgaa acaagattgg acgtttttga agccagcaac aagaagccat gttatggtaa 4800 taagaaaaag agatacgctg aaacaaagta cagtagtgat gactcagatg cgatggaaag 4860 ctacagcgat ggtagcgttg ttgtattgaa agatagtgtt agaagtgata catccagttc 4920 ggaactattg aaaacgtcga caactgccca acaggagcgt acacaaaagc gttatctgaa 4980 gatgaagctc caagaaccaa agaaattcga cggtgtaaac aagcttgagt atgaggaatg 5040 gaagatcatt ttcatggagg gatacggaaa aaatccagaa ctttcaaaga agtcgaagtt 5100 gatccaattg aaaagctgtg tcacgggaag agctcagtcg ctattgagtg gactgcaaat 5160 caccagtggc aactacaagg ttgcttggaa aattctggat cataactttt tgtcgaatgc 5220 tcgaccattg gatgaaattg aaaagagatt tagaaacgcc aaaattgacc aggaaaacta 5280 tgatcagatg aaacttgaca ttggaattat tacatctctg gtgtatgaca tgaaaaatcg 5340 tggactgaac gttgatactc catcgattta tcaatcgatt attgggaaat tgccatcctc 5400 tattgcagaa gacgtgctca tcaagactca gagctccaag ttcaatggtg aatttcacaa 5460 agtggaaaga tggattgata gaaagatcaa tgccaaaatg gccatcaatg agtatagaga 5520 tgaaaagaaa ggtctaaaga attttgatct ggaagacgag gcaaacgaag ttgaggagtg 5580 gacggatggt gatgatacct cagaaaatgg gtttcggaac gagggtgacc tcccggaaga 5640 atttggtcat ggaaactctg gatacgatag tgagtcgagc tcgtctcatt agagacttgt 5700 tcgttaaaaa aattcgtgct ccacggggag caaatcagcc agcagtgacg cctgttggca 5760 aggattccga aaaagttgga attttaatca gacacaatga tgtcaggaaa ccaaaaatgt 5820 ctaatgatat aaaccatcaa gatggttgta tacttgaaac gaaaggaaaa tacattccta 5880 agttaaatcc gcttgcagaa gcagcatcaa aaactgaacc accaaaagaa aatatttcgg 5940 gaacttcaga agaattgaaa tggagcacat cagccagaaa agattatgct ctatcaaaaa 6000 ttggaaccta tggtactgaa acagttggaa atagtgttca cggcgagaga ctgccaaaaa 6060 atgtcaaaat tggagttgct tatcacaacg ttgatattgt ctccagaaat aaatttgcta 6120 ttgagataat tgtgaccaag aaagatacta tcccagataa ggctaccgtg aaccgagatg 6180 tccctacgtg ccatcttttt ggggccggga g 6211 // ID Chapaev-1_CE repbase; DNA; CEL; 4753 BP. XX AC . XX DT 30-SEP-2007 (Rel. 12.09, Created) DT 30-SEP-2007 (Rel. 12.09, Last updated, Version 1) XX DE Autonomous DNA transposon - a consensus sequence. XX KW Chapaev; DNA transposon; Transposable Element; Chapaev-1_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-4753 RA Kapitonov V.V. and Jurka J.; RT "Chapaev - a novel superfamily of DNA transposons."; RL Repbase Reports 7(9), 776-776 (2007). XX DR [1] (Consensus) XX CC Chapaev-1_CE is a very young family of DNA transposons. The CC genome contains two identical copies of Chapaev-1_CE and two CC copies that are 0.5% divergent from the consensus. Chapaev-1_CE CC belongs to the Chapaev superfamily. Hallmarks of the Chapaev CC transposons are 4-bp target-site duplications, terminal inverted CC repeats with the conserved '5-CAC and GTG-3' termini, and the CC Chapaev transposase. The Chapaev transposase is characterized by CC the conserved D-x(60-80)-D-x(220-290)-E catalytic triad. Chapaev CC transposons populate genomes of different animals, including sea CC urchin Strongylocentrotus purpuratus, amphioxus Branchiostoma CC floridae, starlet sea anemone Nematostella vectensis, sea hare CC mollusc Aplysia californica, mosquitoes Aedes aegypti and Culex CC pipiens, and nematode Caenorhabditis elegans. The N-terminal CC portion of Chapaev transposase in Chapaev-1_ACa, Chapaev-2_ACa, CC Chapaev-3_ACa, Chapaev-1_BF, Chapaev-2_BF, Chapaev-1_NV, CC Chapaev-2_NV, Chapaev-3_NV, and Chapaev-1_SP is similar to the CC N-terminal portion of RAG1 (100-370 aa in the human RAG1). It CC includes a novel type of zinc finger, called Chapa: CC H-X7-C-R-X-C-G-X35-D-X4-H-X4-C-X2-C-W-Xn-C-X2-C-X8-G. In the CC amphioxus and anemone Chapaevs, the N-terminal portion contains CC also the RING finger motif. Some Chapaev transposases (e.g. CC Chapaev-2_ACa) show low similarity to the RAG1 core. XX FH Key Location/Qualifiers FT CDS join(1077..1367,1439..1663,1713..3732,3997..4127) FT /product="Chapaev-1_CEp" FT /note="transposase." FT /translation="MPPPSNQSAWSSNLLNARKRRWKKDEEDENSNTERNA FT SLDLNLSNVELTEALDRNGDAQITPERRRVSPFVDFDDNLDVESKRNIDLN FT LTITPETDIPFDTQPSTSSIAESSSLPPRTPHSFERSSSPLYSSSDSVGVF FT SNETPYVKGIREDLSRERMRLLEVDSILDSISENLDAILSSSNHSVQLNPR FT TDIDEETLLNRLMERDDEFAVFSDEFFMQRKEIISIRKTSKRKTNENLILR FT KQIAQITPRAKKYRIIWISELRRRNRSQSRNVVTLPKYTIEWSKLENKSSG FT KSRIDLALDLLQRISREEDIRNFMIDFQKYLQQNDSQSFQVQLTHWQTIIF FT QEKCRFSNNQLRRVKQYYKMFTDLEIMPTINLTMQLKTRMSTIDNYKATTY FT ISKGEKVTVVKIIDVEKSVIPKLERFSVSKQLRHDSYTDGKIVIGIGGDSG FT GGTTKLCLLIGNCDHANSPHRIVLLAVFDDSDSRELIMAYLSDLIVKINNF FT TSITYMEDGRVVTRGVVQKVVGDFKFTCDLLSHKKQCATYFCPFCFETNPR FT GGLMQKLKDLNLQKIYFLRTMNSYKLNSKYGSFGVRCGSGPILQNVKLEHY FT LPAMLHLIVGLFTKYIFEPIWMAVVSLDNKTSFEIRRNKEETKRVADLKIQ FT DANKKFEASPLKRKREMKAQFTALKEEKVLLDETLDGLAGGYLKQFENDLE FT EVGATRRAWFQMYTGNHTKLILSEKGVTAAFKNLKNHMTPMLLNVKNAMSR FT LSKIMSLSANRLLSDDDISELDESMKEFVEFLQAAHPEESITQKLHVLVAH FT VVEVAKTERSWEGFRSKESNRFMPFSIASKDASTRLETQGKDTCKNIDKQI FT LGANLMDFSSIFCRGFHKRRCSTWFLSTWKIVDVTP" XX SQ Sequence 4753 BP; 1590 A; 833 C; 876 G; 1454 T; 0 other; cacggtggtt caatcgctag atggaggcaa acacgctcta ttgcgaacaa ttgtttaaat 60 ttcaagcctt tcactcggca tcgattctct ccgttttttg ccgattttaa ccgtttttcc 120 gtatcttttt tgttattttt ttttgtaaaa tcgaaaaagt gtgtcttgtt tcaggtagta 180 agcgaaaaaa ttcgaggaaa acaaaggagc gtcggcgtgg gcagcaagga aaacgaatta 240 agtgagtttt ttgacgagtt ctcttaaaaa atcacaaaat attttatttt tttgattgaa 300 aaagagagag ctaaaaattc tgaacaagat acgggccgaa ttttttagaa atcgtacctt 360 cttcattaat tttttttcga aactaaaatt ttgcactgaa aatggaggaa aaatgatttt 420 tttcgaaaaa aatcacaaaa tgttatattt ttattgccga attagaatca gctaaaaaat 480 ctctacaaga taaaggcaaa catatacacg aatcgtacat ttttcactta aaattttcca 540 aaagtaaatt tttttcactg aaaatagttt ttaaaaacgt tccctttttc aagaaaatca 600 aaaactcttt acgatttata gttgttttat gatcaaaatt taattgtgta taagatggaa 660 ctcaattcat agagaaatct taatttttca ttaaaatttt ttcgaaaaaa tttttttttg 720 taataagaat ttataaaata attttttttg ccaaaaaaat cacaaaatgt gttagttttt 780 tgcttcaaag agaaagaact gttaatttta agcaagatac agagagaatt ttttagaaat 840 cgtacattct tcaagaaaat tttttcgaaa ctaaaatttt gcactgaaaa tggaggaaaa 900 atgatttttt tcgaaaaaaa tcacaaaatg ttttattttt attgccgaat tagaatcagc 960 taaaaattct ctacaagata aaggcaaaca tatacataaa tcgtacattt ttcacttaaa 1020 attttccaaa agtaaatttt tttcactgaa aattgaggaa aaaatttttt tttcagatgc 1080 ctcctccatc aaatcaatca gcatggtctt caaatcttct aaatgctcgg aaacgacgat 1140 ggaaaaaaga tgaagaagac gaaaattcaa atactgaaag aaatgcttcc ctggacctaa 1200 acctgtcgaa tgtagaattg acagaagctc tcgacagaaa cggagatgcg cagatcactc 1260 ctgagagaag acgagtgtct ccatttgtgg attttgatga taatctggat gtggagtcga 1320 aacgcaatat cgatttgaat ctgacaataa cacctgaaac tgatattgta agtgaattaa 1380 tccatgctac aactcggatc ctaagacgat tataatattt taaaaatgat ttttccagcc 1440 gttcgacaca caaccatcaa cttcttctat tgcggaatct tcttcattac ctccaagaac 1500 accacactct tttgaaagat cgtcaagtcc actctattca tcgagtgaca gcgtcggcgt 1560 attttcgaat gagacaccat atgtgaaggg aattagagaa gacttgagta gagaacgaat 1620 gagattgcta gaagttgatt ccattttgga tagcatatca gaggtagatt tatttctttg 1680 ttgttgattg tttattcaat tttccatttc agaaccttga cgccatcctg tccagctcaa 1740 accattcggt ccagctcaat ccaagaacag acattgacga agagacactt ctaaatcggt 1800 tgatggagag agacgacgag tttgccgttt tttctgatga atttttcatg caaaggaaag 1860 agataatttc aataagaaaa acatcaaaac ggaaaaccaa cgaaaatctg atattaagaa 1920 aacaaatcgc tcaaattacg ccaagggcga aaaagtacag aataatatgg atatcagagt 1980 tgcgaagaag gaatcgttca caaagccgaa acgttgtcac gcttcccaag tataccatcg 2040 aatggagcaa gctcgaaaat aagagctctg gaaaatctag gatagatttg gcactggatt 2100 tgcttcaacg aatcagtcga gaagaggata ttcgtaattt catgattgat ttccagaaat 2160 atctccaaca aaatgattcg cagtccttcc aagtccaatt gacgcattgg caaacgatta 2220 tttttcaaga aaaatgtaga ttctctaaca atcaattgcg gcgggtgaag caatactata 2280 agatgttcac cgatcttgaa atcatgccaa ctattaattt gacgatgcaa ctgaaaacga 2340 gaatgtcaac tatcgacaat tataaggcca caacatatat atcaaaagga gaaaaggtga 2400 ccgttgtcaa aattatcgat gtcgagaaaa gtgtcattcc gaagctggag cgtttttctg 2460 tatcaaaaca attacgccat gattcctata ccgatggcaa aattgttatc ggaattggtg 2520 gagattcggg aggaggaacg acgaaacttt gtcttctgat cggaaattgt gatcatgcga 2580 actcgccgca ccgaatcgtc cttctcgctg tttttgatga ctctgattcg cgggaactta 2640 tcatggccta cctgtctgat cttatcgtga agatcaacaa cttcaccagc atcacctata 2700 tggaagatgg aagggtggtg acacgtggcg ttgtccaaaa agtagtcggt gattttaagt 2760 ttacatgtga tttactgtca cataaaaaac aatgtgccac ctatttttgc ccgttttgtt 2820 tcgaaaccaa tccaagagga ggactgatgc aaaagcttaa agatctgaac ttgcaaaaga 2880 tatatttctt gagaacaatg aattcttaca agctcaattc taaatacggt agctttggtg 2940 tcagatgcgg aagtggacca attcttcaaa atgtcaagtt ggaacactac ttgcctgcaa 3000 tgctacactt gattgtcgga ctgttcacga aatacatctt tgagccgatt tggatggccg 3060 ttgtgagttt agacaataaa acatcatttg aaattcgaag aaataaagaa gagacaaaaa 3120 gagttgccga tttgaagatt caagacgcga acaaaaaatt cgaagcttca ccacttaaga 3180 gaaaaagaga gatgaaagca cagtttactg ctctgaaaga agaaaaagtg ttattggacg 3240 agacgctgga tggacttgct ggtggatatc ttaaacaatt tgagaatgat ctggaagaag 3300 ttggtgcaac acgccgagcg tggtttcaaa tgtacactgg aaatcacacg aagctgattt 3360 tgtctgagaa aggtgtcaca gctgcattca aaaatctgaa aaaccacatg actccgatgc 3420 ttctgaatgt caagaacgcg atgagcaggc tgtccaaaat catgtcattg tcggccaatc 3480 gattgctctc tgacgatgac atcagcgaat tggatgagtc gatgaaggaa ttcgttgaat 3540 tcctacaagc tgcccaccca gaagaatcaa tcactcaaaa attgcatgtc ctggttgctc 3600 acgtagtaga agtcgcaaaa acggaaagga gctgggaagg ctttcggagc aaggaatcga 3660 atcgcttcat gccgttttca atcgcctcga aagacgcttc cactcggtta gaaacacagg 3720 gaaaagatac ctgtacattg ctaaggagct gtcatgtagc aatctaattt ctgatatgga 3780 ggaagtaagt ggaacctttc aaaaaaacaa atcgtaacat tttctttgca ggacgcctcg 3840 tcttcccaat aaaccaccca gcgctctccc tcacacaatt ctgaaccttt tcgcaattaa 3900 aatttcggga tttcctcaat atcataaaat tcgggatttt ctatggttag ttgtgcttaa 3960 aattattaaa aaattattaa tttcattaaa tttcaggtaa aaacattgac aagcagatac 4020 tcggtgcaaa tttgatggat ttctcatcaa tattttgtcg aggatttcat aagcggagat 4080 gctccacatg gtttctttca acttggaaaa ttgtcgatgt gacaccttag gattctatcg 4140 aaaagctaaa attcaggatt tccttcggtt tgttgggatt aaaattattc gaaaataatt 4200 gattttatga aatttcagat tgaaaccttg tcaatgagct actcagcaaa atttcgatgg 4260 atttatctca aaaagttcac caggagtcga ggcaattaag aaatttcacc gggtttcctt 4320 attttcgaaa gttttaccta gaaaaatttc aacttgcatc cgtaaagcga agaaggaggc 4380 tgattccaca tattaaattc gttttttgaa tcgatttgta tgtatttttc aattttcaat 4440 cttccgtttc gattatttct ttgttttact gttcaaaatt aatttttctg tgtttcaata 4500 attcaattct caattgttag ctactaaatt gaataataaa atctattttt tatttttttt 4560 tcaccttgaa actattaatt ctgccttttt tctgctaaaa tttgcttcaa aaatctattt 4620 gccgctgcaa tgttttgcgg actacggtac gcaagtacgc aaacaccgcg atgacacatt 4680 gcggaccatt tcgctgcgta cctgcgagat ctttctcaaa ttttacgaga gatctagttt 4740 ttgtgatacc gtg 4753 // ID CER3-LTR_CE repbase; DNA; CEL; 424 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 06-AUG-2003 (Rel. 8.07, Last updated, Version 1) XX DE CER3-LTR_CE is a long terminal repeat of the CER3_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER3-I_CE; CER3-LTR_CE; KW CER3_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER3_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (APR-2003). XX DR [3] (Consensus) XX SQ Sequence 424 BP; 124 A; 102 C; 76 G; 122 T; 0 other; tgtaaggatt atcaccgaac ccacacattt tcgatgaaat ataaacatta tatttttact 60 gtaaattctc tgagataatt tacagagaaa gaacgaacga aatcggtgca gtacaacgat 120 ttctacggca actacaatcc acgaggagga gaaactccag ttcaaaccgc cgccaaacta 180 cagaaggtcg cattgtcgcg ccgagacccg tcgtcgttaa aggctgcgcc tttaacccgg 240 agacgtctat ataaggagag gagaacgctc attggtcatc atttcttgat tttcgtcgcg 300 tgttttgctc tcacttttgt ctcccgtttc acttgtcaga cgatgctatc gtctaacctc 360 ttttactcta ataaactctt ataacttcaa acggtttaat tcacggcaac caacaatctt 420 tact 424 // ID NeSL-1 repbase; DNA; CEL; 7028 BP. XX AC Z82058; XX DT 03-AUG-2009 (Rel. 14.07, Created) DT 03-AUG-2009 (Rel. 14.07, Last updated, Version 1) XX DE NeSL-1 is a non-LTR retrotransposon, complete sequence. XX KW NeSL; Non-LTR Retrotransposon; Transposable Element; NeSL-1. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-7028 RA Malik H.S. and Eickbush T.H.; RT "NeSL-1, an ancient lineage of site-specific non-LTR RT retrotransposons from Caenorhabditis elegans."; RL Genetics 154(1), 193-203 (2000). XX DR GenBank; Z82058; Positions 33197 26170. XX FH Key Location/Qualifiers FT CDS 395..5455 FT /product="NeSL-1_1p" FT /note="Ulp1 cysteine protease, reverse FT transcriptase, restriction enzyme-like nuclease." FT /translation="MLRRKGRHRMVMVNSVKWQPSAHAEAIGTGKSWAPQR FT SQASEHGWQSNAMFDPPNRILFARDSWSLNQSTHLQNQRSGSGLGIRPGQV FT RNNMVGGGPHRAGDPKRRVELVSIQGSEVTVRTIYPSDEIFSCYSKSCDIK FT TKAGYGPEDLKHLTRHIKNEHGLKARWAYQCGLCNEKSDPSVSEGHKWMEA FT HMVAVHQSSAEKRIKSYQKCTGARVAEQLQAAAPSLTVPGKHKSGSRDAAK FT DSMTPTKDDDPKTRIYQTRSVVKKSTQKTAEPTDEGSRGPKYASIFQKSVK FT ARKSLALLCELSSPKPMNPLPTNELTLKEGNSRELAKEEAPSEGIDDIVII FT DLDESEESPPRRKRFNTWCLDHESSREAWLDDTAIFWYISYLCRGSTKYSA FT LDPCLWSMYKVKGSRYILDRLESSITYFFPICEEDHWTLLVLKDNSYYYAN FT SLHQEPRGPVRDFINDSKRARKEFKVQVPLQRDSFNCGVHICLMTNSIMAG FT GKWHSEEDVRNFRKRLKKTLQEEGYELYSVNSLGIPFQAPTTEQMDYKETR FT CKRSYASVLTQISPPAKRPDCKPDNNIFVPTKDCAAEGNPQEKGRNESPEE FT INTEHIVVAGKPANNISPRCRSTSEMLFEMVKATTSSGRSSLGTMTQDEFI FT RTSTIAEAVPLMSIKLPPMELPRKILPPIPPRKPTQTNGGQKGKQQRVPTG FT KPDTLNAKVRNWFNNQLESYAMEGRSFQRLEWLTEVLTASIQKAAAGDEGI FT VDIICKRNPPLEVAKGEMCTQTENKRKTTNNAARIADPIQSSKGAGDVKAS FT YWKERARTYNRIIGSKEELCKIPIDQLEDFFKKSTSRTNVQESIMKEKSSK FT IPALKIGNWMEKKFIGKEVAFALRKTKDTAQGADGLRYHHLQWFDPSGELL FT AKVYNECQRHRKIPKHWKEAETILLFKNGDQSKPENWRPISLMPVIYKLYS FT SLWNRRIRAVPNVLSKCQRGFQEREGCNESLAILRTAIDVAKGKRRNLAVA FT WLDLTNAFGSIPHELIEYALTAYGFPQMVVDVVKDMYQGASMRVKNATEKS FT DRIPIMSGVKQGDPISPTLFNICLETVIRRHLESANGHQCLKTRIKVLAFA FT DDMAILTDSPDQLQRELSKLDNDCTPLNLIFKPAKCASLVIQKGVVRSASI FT KLKGNAIRCLDENTTYKYLGVQTGSAARISAMDLLEKVTKELECVVKSDLT FT PPQKLDCLKTFTLSKLTYMYGNSIPLITEIKMFANIVIRGVKVMHRIPVRG FT SPLEYIHLPVKDGGLGVACPKTTCMITFLVSTLKKLWSDDEYIKTLFTSLA FT EEVVKKESKKSTVTMDDIADYLNVEERINRSEFGYNSITRLRDVMRNLAIT FT GDSPLYRLKMVVKNGKIALLVQATSESMERIYTEEDAKKLQRSLKDQVNKA FT LKHRFNTTKVVKSKVVRVVQQHPASNRFVTKGGNLSLACHRFVHKARLNLL FT ACNYNNYDKSKSKVCRRCGKDLETQWHILQNCPFGFSKKITERHDAVLHKV FT KTLIESGGKKNWTMKIDEELPGFSRLRPDICLKSPDEKQIILADVACPYEH FT GVEAMERSWQAKIDKYETGFAHLRKSGTKLTVLPIIIGSLGSWWKPTGDSL FT KELGIKGSVINSAIPELCATVLEHSKNTYWNHIFGEAYIPNPMRNGHAKPA FT GNGWKKERLQKAPVRPTN" XX SQ Sequence 7028 BP; 2106 A; 1667 C; 1731 G; 1524 T; 0 other; gctcactttc tatcgtgtta accgtacgtt tacactccca gtgagtgtaa taaaggttat 60 tcgatagagg gtgtctccct ctttcttggg taattcttcg gcggtccggg gtctctccct 120 cgtctttttt ttaaactttt ctttctcatc cactcttttg ctccttttta ctaactcttg 180 tactctatag tcttttctca tcccccatcc gccgttgggc aaagtttatt tactttgtta 240 aatccatatt ttatctctct cacccgtaca gaaagcgtct ccttctcaaa cgcttttctg 300 tactttttct tatattttca ttaacatatt tttcctgttt atactaacct aacctccatt 360 gtcaattact aactaacttg tacaacggat ttcgatgttg cgccgaaaag gacgtcaccg 420 aatggttatg gtcaattctg tcaaatggca acccagtgca catgctgaag caattggaac 480 aggaaagtcc tgggcaccac agcggtccca ggcatccgaa cacggctggc aatcaaatgc 540 aatgtttgat ccccccaaca ggattctctt cgccagagac tcatggtcgc tcaaccaatc 600 aacgcatctt caaaatcaaa ggagcggatc aggattgggt ataagacctg gtcaggtaag 660 gaacaatatg gtggggggtg ggcctcacag agcaggggac ccaaagcgtc gtgtcgagct 720 ggtcagcata caaggaagcg aagtgaccgt cagaacaatc tacccgtcgg atgaaatatt 780 cagttgttac tccaaatcat gtgatatcaa aacaaaagct ggctatggcc ctgaggacct 840 aaagcacctg actcgtcata tcaagaacga gcatggtctc aaagctcgct gggcatatca 900 atgtggattg tgcaatgaga agtcggaccc aagtgtatcg gaaggccaca aatggatgga 960 ggcacacatg gtcgccgttc accaaagctc tgcggaaaaa aggataaagt cctatcagaa 1020 atgcacgggt gcaagagttg cagaacagct acaagctgct gctccatcgc ttactgtgcc 1080 ggggaagcac aaatcaggct ctagagacgc tgccaaagat tcgatgacac caacaaagga 1140 tgatgacccg aaaaccagga tctatcagac acgaagcgta gttaaaaagt cgactcagaa 1200 aacagcagag ccaacagatg aagggtctag aggcccaaag tacgcatcca tttttcagaa 1260 atccgtcaaa gcaaggaaga gcttggcgct tctctgtgaa ttaagcagcc ctaagcctat 1320 gaaccccctt cctacaaatg agctaactct gaaagaaggg aattcaagag agctcgccaa 1380 agaggaagca ccatctgaag gtatagacga catcgtcatc atcgatctgg acgaatcgga 1440 ggagtcgcca cccagaagga aacgattcaa cacctggtgt ctggatcatg agtcaagccg 1500 tgaagcatgg ctggatgaca cagcaatctt ctggtacatc tcctatctct gcagaggaag 1560 tacaaagtac tcagctttgg acccatgcct ctggagtatg tacaaagtca aaggctcaag 1620 atacattctt gaccgcttgg aaagctccat cacatatttt ttcccgatat gcgaggagga 1680 ccattggaca ctgttggtat tgaaagacaa ttcatactat tatgcaaaca gtctgcacca 1740 agagccacgt ggcccggtca gggacttcat caacgactca aaacgggctc ggaaggagtt 1800 taaggtgcaa gtacctcttc aaagagactc ctttaactgt ggagtgcaca tctgtctaat 1860 gaccaactcg attatggcag gaggcaaatg gcactctgaa gaagacgtca gaaacttcag 1920 aaaaagactg aagaagacac tccaggaaga aggctatgag ctttactcgg tcaatagtct 1980 gggtatacca ttccaagccc caacgactga gcaaatggac tacaaagaaa caagatgcaa 2040 aagaagctat gccagtgttc ttactcaaat aagcccgccg gccaaaaggc cggactgcaa 2100 acctgacaac aacatattcg taccaaccaa ggattgtgct gccgaaggta acccgcagga 2160 aaaaggccga aatgaatctc ctgaagagat caatacggaa catatcgtcg tcgcaggaaa 2220 acctgcaaac aacatcagtc caaggtgtcg gagcacctcg gaaatgctgt ttgagatggt 2280 gaaagccaca accagcagtg gaagaagcag cttgggcacc atgacgcagg atgagttcat 2340 ccgaaccagc acaatcgccg aggcagttcc cctaatgagc ataaaactcc caccaatgga 2400 gttgccaagg aaaattctgc caccaattcc ccccagaaaa ccaacccaaa ccaatggagg 2460 tcaaaaggga aagcaacaga gggtgcctac aggaaaacca gacaccctaa atgctaaagt 2520 ccggaactgg ttcaacaacc aacttgagtc gtatgcgatg gagggtcgca gcttccaacg 2580 actggaatgg ctgacggaag tactcactgc gtcgatacaa aaagcagcag caggtgatga 2640 aggaatagtt gatattattt gcaaacggaa cccgccactt gaagttgcga agggtgaaat 2700 gtgcacccag accgaaaaca aaaggaaaac gaccaacaat gcagcaagaa ttgcggaccc 2760 aatccagagc agcaagggag ctggtgatgt gaaggcatcg tactggaaag aaagggctcg 2820 cacttacaac aggattattg gtagcaagga ggaactctgc aaaattccca tcgatcaact 2880 ggaggatttc ttcaagaaat ccacgtcccg caccaacgtg caggagtcga tcatgaagga 2940 gaaaagctcc aaaattcctg ctctcaagat aggtaactgg atggagaaga agtttatcgg 3000 aaaggaggtg gcgttcgctc tgcggaaaac aaaagacacc gcgcagggtg cagacggact 3060 gcgataccac caccttcaat ggtttgatcc cagtggtgag ttattggcga aggtatataa 3120 cgagtgccaa cgacacagga agatcccaaa acactggaag gaggccgaga ccatcttgct 3180 gttcaaaaat ggagatcagt caaaaccaga aaactggcgc ccaattagcc tgatgcctgt 3240 gatctacaaa ctttactcca gtctgtggaa ccggagaatt agagctgtac caaatgtgtt 3300 gagcaaatgt cagcgagggt tccaggagcg cgaaggttgc aatgagagtc tagcaatact 3360 cagaacagca atcgacgtgg ccaaaggaaa acgaagaaac ctggcggtgg catggctgga 3420 tctgacgaac gcgtttggat ccatcccgca cgaattgatt gagtacgcgc tgacagcgta 3480 tggatttccg caaatggtcg tcgatgtggt caaagatatg taccagggag catcaatgag 3540 ggttaagaac gcgacggaaa aaagcgatcg aatcccaata atgtctgggg tgaaacaagg 3600 cgatcccatt tcaccaacac ttttcaatat atgcctggaa actgtgatta gaagacacct 3660 ggagtctgca aatggtcacc agtgcctcaa aacaagaatt aaggtactgg cgttcgccga 3720 cgacatggcg attttaacgg attcccccga ccagctccag cgagaactgt caaagctaga 3780 caatgattgc acgcccctga atcttatttt caagccagca aaatgtgcat cacttgtgat 3840 ccaaaaagga gttgtgcgga gcgcatcaat taagcttaaa ggaaacgcca ttcgatgcct 3900 tgacgagaac accacttaca aatatttggg agttcagacg ggttcggcag caagaatttc 3960 agcaatggat ctactggaga aagtcacgaa ggaacttgaa tgcgtggtca aaagtgacct 4020 gacgccgccg caaaagctgg actgtcttaa aacattcacg ctgtccaaac tgacatacat 4080 gtatggaaat tccataccac tgatcacgga gataaaaatg tttgcaaata tcgtcattcg 4140 aggagtcaaa gtgatgcata gaatcccagt ccgagggtca ccactggagt acatccatct 4200 tccagtgaag gatggagggc ttggtgtagc atgtcccaag acaacctgca tgattacgtt 4260 ccttgtctct actcttaaaa aactctggtc agatgatgaa tacatcaaaa cattattcac 4320 atcactggcg gaagaagtag taaagaaaga gtcaaagaag agcacagtca ctatggatga 4380 tatagccgac tatctcaacg ttgaggagag gatcaatagg agcgaatttg ggtacaattc 4440 cattacgaga ctgcgggatg tgatgaggaa cttggccatc actggcgact ccccacttta 4500 caggctgaaa atggtagtaa agaacgggaa aatcgctttg ctcgtccaag ccacaagcga 4560 aagcatggaa aggatctaca cggaagaaga tgcgaaaaag ctgcagcgct cactgaagga 4620 tcaagtgaac aaagcactca aacatcgatt caacaccacc aaagtagtga aaagcaaagt 4680 cgtccgagtc gtgcaacagc acccagcaag caacaggttt gtcacaaaag gtggcaacct 4740 gagccttgca tgtcaccgct ttgtgcataa agcacgtctg aatctactgg cctgcaacta 4800 caacaactac gacaaatcca aatcaaaagt ctgtaggcgt tgtgggaagg atctggagac 4860 gcagtggcac atactgcaaa actgtccgtt tggtttctca aagaagatca ctgagaggca 4920 tgatgccgtc ttgcacaagg tcaaaactct cattgaaagc ggtggaaaaa agaattggac 4980 aatgaagatt gatgaagaac ttccaggatt cagcagactc cgtccagata tctgcctcaa 5040 aagccctgat gaaaaacaaa tcatcttggc agatgtcgca tgcccatatg agcatggagt 5100 agaagcgatg gaaaggagct ggcaggcaaa aatcgacaaa tacgagacgg gattcgccca 5160 cctgcggaaa tcgggaacca agctgaccgt ccttccgatt ataatcgggt cacttggatc 5220 atggtggaaa ccgacaggtg acagtctcaa ggaattggga atcaagggaa gcgtgatcaa 5280 cagtgccatt ccagaactct gtgctactgt tctcgaacac agtaagaata cgtactggaa 5340 tcacatcttc ggtgaagcgt acataccaaa tccaatgcga aacggacacg caaaacctgc 5400 tggaaatgga tggaaaaagg aaagattgca gaaggcccct gtgaggccta ccaactagcc 5460 tccagggcac gccgcacgcc aaaagtcctg gcataactct gcaaataaca tcaaacgtca 5520 atcaactcca caaactctcc actctcttca agtcttctcg gtgcttccaa caccacaatg 5580 gtgaaagctc cttcaccttt tccctccaaa attcttccca tgtggggaag tcctgttctt 5640 gtaagctctc cggaggctgc aagagcagaa gaaattcttc tttctgacaa ggtcagaagg 5700 aagtcctgtt cttgaggcgt ccatcccggg cgtcatagga gagatcagat gcaccttcta 5760 gcaggagcta gaagggctgc cctgtcttga gatccccacg ggggtcaata gacgggaggg 5820 gctgctggct ttctcttttt aagaggaagc accaatccgg agatccttag gggtcaaagg 5880 attaaaaggc agcaggtcca attctcctca ctgacttcgg tcagagagga gtcccgcctt 5940 ggagacctcc ccggggaggt tgctgaagag gcggaagctc cttctagcaa gagctagagg 6000 gagttcccag tcctgaaacc cttgcggttg atgatggaat ggaagagtac ttcggtactg 6060 ctcgttgctc tctctgcgtt ttactgccga gggccggatt tgctcgaatc gcgaaaggtc 6120 tcaatcgacc attcaagatg acggcttatc taaggtccga aagcagttgg gagagtaacg 6180 tgttctccta cctttcaagt tgaatggtcg ttttactgtt tgggatagct gacttgatgc 6240 tagtacgctt catctgtgga tgacgctccc caagcagtca agtagacttg aaaggtgccc 6300 tcgccctagt tagctcttag accttatggg tcgccatggt tgtggacggg tatgcttgcc 6360 ggagccgagt cgtgtttctt agaaccaacc tcgacgaggc gaaagcttgc acaagttagc 6420 acaattgtgg tagggccgac tagaaaatga gtcccttagg gggttacgcc ttggcgaaag 6480 tgaggacaat tggcattgac gggtgcttcg gcactaggca aaggcgccac cacactgtcc 6540 aatctctaaa aagttcacat tcatcgaaga actaccggaa ccaaccacac atgtgttgaa 6600 acctacacgg tggaagggaa aggaaagctt cgctggaacg aaaagaacgg ataggttccc 6660 cttcttgatg gctgtgaggc ttaggatgga cgggaaggcc gtgaggcctc aggcgggtaa 6720 ctcggccaga cgctagttga tcttcggatc acgacagccc tggctaagag gaaccctgga 6780 tggagtgtga aggatgggcg ggtagggggt taagcctgtt gacagaccac cgactgcagt 6840 cacaaaatca gtgattatgc gggtggacca atctgttggc gggtgtttcc ctctacctga 6900 ccccgcaata tggtatgtac gatcctcgga tctaaaattc ataatggccc accacaacca 6960 taaacctccc tagcagctgg tggtcccgat aattcgggtt cttgccacta ctgcgaccca 7020 ggctcgcc 7028 // ID HAT2_CE repbase; DNA; CEL; 564 BP. XX AC . XX DT 21-OCT-1999 (Rel. 4.09, Created) DT 14-JUN-2000 (Rel. 5.05, Last updated, Version 2) XX DE HAT2_CE nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT superfamily; HAT2_CE; TIRs; nonautonomous DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-564 RA Kapitonov V.V. and Jurka J.; RT "HAT2_CE."; RL Direct Submission to Repbase Update (OCT-1999). XX DR [1] (Consensus) XX CC HAT2_CE is a putative non-autonomous DNA transposon. CC It generates duplication of 8 bp-long target site upon CC integration in the genome. HAT2_CE has 17 bp-long CC perfect terminal inverted repeats. CC The length of the target site duplications indicates CC that transposase encoded by the autonomous HAT2_CE-like CC element is related to the HAT superfamily. HAT2_CE is about CC 64% identical to the HAT1_CE consensus sequence and they CC share identical 13 bp-long termini. CC HAT2_CE individual copies are, on average, 98% identical CC to the consensus sequence. CC There are about 100 copies of HAT2_CE per haploid genome. XX SQ Sequence 564 BP; 165 A; 113 C; 119 G; 167 T; 0 other; caggggtgtg cggataaccg attttttcgg ctaacggata aatcggctaa tgccgatttt 60 ttgagaaccg gctaacggct aattcggcta acggctaatt tcaaaatttt cggctaacgg 120 ctaattcggc taatctcagt cattcaaatc ggctaatttt cggctaattc ggctaattcg 180 gaaaatattc atttggttaa gctttttttg tccattctgt taattcaggt tttgggttaa 240 ttttttcact gttattgagc aaattcaggg atgaacgact gttcaaatag gaaaaaatca 300 tacaaattca ccacattttg ttttccaaaa aaatatgtta gttgaattag ccgaaatagc 360 cgatcggcga attcggctaa catcggccaa aatcggctaa cggataacta cgtattagcc 420 gaactgccaa aaagtcggct aacggctaat tcggctaatc ggtaaaaagg tcggctaacg 480 gataattcgg ctaatattag ccgagttagc cgaattagcc gatcggcgaa ttcggctaaa 540 tcggctatta tccgcacacc cctg 564 // ID HELITRONY1_CE repbase; DNA; CEL; 2593 BP. XX AC . XX DT 25-MAY-2001 (Rel. 6.04, Created) DT 25-MAY-2001 (Rel. 6.04, Last updated, Version 1) XX DE HELITRONY1_CE is a nonautonomous DNA transposon - a consensus. XX KW Helitron; DNA transposon; Transposable Element; Nonautonomous; KW HELICOPY1; HELITRON class; HELITRONY1_CE; RC35; RCA1; RCC9; RCD1; KW nonautonomous DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-79 RA Naclerio G., Cangiano G., Coulson A., Levitt A., Ruvolo V. RA and La Volpe A.; RT "Molecular and genomic organization of clusters of repetitive DNA RT sequences in Caenorhabditis elegans."; RL J. Mol. Biol 226(1), 159-168 (1992). XX RN [2] RP 1-2593 RA Kapitonov V.V. and Jurka J.; RT "HELITRONY1_CE."; RL Direct Submission to Repbase Update (MAY-2000). XX RN [3] RP 1-2593 RA Kapitonov V.V. and Jurka J.; RT "Direct submission."; RL Direct Submission to Repbase Update (MAY-2001). XX RN [4] RA Kapitonov V.V. and Jurka J.; RT "Rolling-circle transposons in eukaryotes."; RL Proc Natl Acad Sci USA 98(15), 8714-8719 (2001). XX DR [3] (Consensus) XX CC There are about 50 copies of HELITRONY1_CE in the C.elegans CC genome. CC They are ~97% identical to the consensus sequence. CC Several portions of HELITRONY1_CE have been reported previously CC as the RCA1, RCD1, RC35 and RCC9 minisatellite like repeats [1]. CC The authors [1] have noticed a co-clustering of these repeats but CC did not find a right explanation to it. XX SQ Sequence 2593 BP; 932 A; 349 C; 534 G; 776 T; 2 other; tcattamyta tatataaaaa aatacagttc gtctgtccat agtttgtagt ctatgtagtc 60 tttgtagtct gtgacgtcac acccaaatgt cagtgagaat tgtgggcggg gcctgtcacc 120 tacccttcgt ggtgagaccc atcgtggtga gacccttcgt ggtgagaccc atcgtggtga 180 gacccatcgt ggtgagaccc ttcgtggtga gacccatcgt ggtgagaccc ttcgtggtga 240 gacccacatt tttggcggga aattcaaatt ttcagtgaaa aaattttggc gggaaattca 300 aattttcagt gaaaaaattt tggcgggaaa ttcaaatttt cagtgaaaaa aattttggcg 360 ggaaattcaa attttcagtg aaaaaaattt tggcgggaaa ttcaaatttt cagtgaaaaa 420 attttggcgg gaaattcaaa ttttcagtga aaaaattttg gcgggaaatt caaattttca 480 gtgaaaaaat tttggcggga aattcaaatt ttcagtgaaa aaaattttgg cgggaaattc 540 aaattttcag tgaaaaaaat tttggcggga aattcaaatt ttcagtgaaa aaaattttgg 600 cgggaaattc aaattttcag tgaaaaaaat tttggcggga aattcaaatt ttcagtgaaa 660 aaaattttgg cgggaaattc aaattttcag tgaaaaaaat tttggcggga aattcaaatt 720 ttcagtgaaa aaaattttgg cgggaaattc aaattttcag tgaaaaaaat tttggcggga 780 aattcaaatt ttcagtgaaa aaaattttgg cgggaaattc aaattttcag tgaaaaaaat 840 tttggcggga aattcaaatt ttcagtgaaa aaaattttgg cgggaaattc aaattttcag 900 tgaaaaaaat tttggcggga aattcaaatt ttcagtgaaa aaaattttgg cgggaaattc 960 aaattttcag tgaaaaaaat tttggcggga aattcaaatt ttcagtgaaa aaaattttgg 1020 cgggaaattc aaattttcag tgaaaaaaat tttggcggga aattcaaatt ttcagtgaaa 1080 aaaattttgg cgggaaattc aaattttcag tgaaaaaaat tttggcggga aattcaaatt 1140 ttcagtgaaa aaaattttgg cgggaaattc aaattttcag tgaaaaaaat tttggcggga 1200 aattcaaatt ttcagtgaaa aaaattttgg cgggaaattc aaattttcag tgaaaaaaat 1260 tttggcggga aattcaaatt ttcagtgaaa aaaattttgg cgggaaattc aaattttcag 1320 tgaaaaaaat tttggcggga aattcaaatt ttcagtgaaa aacaattttg gcggaaaatt 1380 ccaatttctg agaaaaatcg agaaatgtct gcaatgttcc agaagtttct agaaaattcg 1440 agaaaattcc ggaatggtcc agaattttct agaaaattcg ggaaaactct ggaatattcc 1500 agaactttct agaaaattcg agaaaattct ggaatgttcc agaattttct agaaaattcg 1560 agaaaactct ggaatgttcc agaactttct agaaaattct agaaagttct ggaatggtcc 1620 agaattttct agaaaattcg ggaaaactct ggaatattcc agaactttct agaaaattcg 1680 agaaaattct gcaatgttcc agaattttct agaaaattcg ggaaaattct ggaatattcc 1740 agaactttct agaaaaatcg ggaaaagtcg gcaatgttct agaactttct agaaaattcg 1800 agaaaattcc ggaatggtcc agaattttct agaaaattct agaaagttct ggaatggtcc 1860 agaagtttct agaaaattcg agaaaattcc ggaatggtcc agaattttct agaaaatccg 1920 agaaagttct ggaatggtcc agaattttct agaaaattcg ggaaaattct ggaatattcc 1980 agaactttct agaaaaatcg ggaaaagtct gcaatgttcc agaactctct aaaaaattcg 2040 ataaaattct gaaatgttca atttcgtgga aaaattcaag aaacttctgc aaagttctac 2100 acggggttct ggctcgaccc cgcgctccat tggacactga aatggcggga aaaaactttg 2160 aaatcgcaag aggaattttc acgcagcgcg ttgaaaaaag tgtatgcatt tgcgcgtgac 2220 ggtgtttctt caagttttga tactcctaga atattctgaa gtttcaataa ttttaaaaaa 2280 atatcaaaaa ttgttataat tttcagaaac gctcgccaaa aactttcgga aaggaccgga 2340 aaatgtttca tatgatttga agaacgtaga gaaaattata cgttcaagaa atttagaaat 2400 agttccagga aacttgaggt tttaagttgt aattgctcag aacttagtta tataatattt 2460 atgttgctcg aaaatttccg gtaaccaggg aaaaaacgtt ccatctgcaa agaaggctta 2520 gaaaatttag aatttgattt caattttgag tacgccagtc ggagcacgcg cttcagcgcg 2580 tgcgaacggc tgg 2593 // ID TIR9TA1C_CE repbase; DNA; CEL; 255 BP. XX AC . XX DT 15-JUN-2000 (Rel. 5.05, Created) DT 15-JUN-2000 (Rel. 5.05, Last updated, Version 1) XX DE TIR8TA1C_CE is a putative nonautonomous DNA transposon - a DE consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TIR9TA1C_CE; KW nonautonomous DNA transposon; putative TA target site. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-255 RA Kapitonov V.V. and Jurka J.; RT "TIR9TA1C_CE."; RL Direct Submission to Repbase Update (JUN-2000). XX DR [1] (Consensus) XX CC TIR9TA1C_CE is a putative non-autonomous DNA transposon; it CC has perfect 9-bp terminal inverted repeats and putatively CC generates duplications of TA target sites upon its integration CC in the genome. CC There are ~100 copies of TIR8TA1C_CE in the genome; they are CC ~87% identical to the TIR9TA1C_CE consensus sequence. CC TIR9TA1C_CE and TIR9TA1B_CE share common ~70-bp terminal CC portions. XX SQ Sequence 255 BP; 70 A; 64 C; 54 G; 67 T; 0 other; tagggcttcc atgtaggcgt taaaacgcct gcctgcctga ccttaaggcg acctccgcct 60 gcctctcgcc tcaatccgcg ccttatgcta aaacatacgt gaactttttt ttcaatttct 120 taatttgatt ttcatcaaat tgatagaaat gagaaaatta gaaatttgta aattcacgaa 180 aaaaaggcgt aaggcgggca ggtttcaggc aggcgtcagg ccctgaaacc gcgcctgcct 240 accatggaag cccta 255 // ID RC35 repbase; DNA; CEL; 216 BP. XX AC . XX DT 27-JUL-1999 (Rel. 4.06, Created) DT 27-JUL-1999 (Rel. 4.06, Last updated, Version 1) XX DE C.elegans satellite-like repetitive DNA (a consensus). XX KW SAT; Satellite; Simple Repeat; RC35; Repetitive DNA. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Naclerio G., Cangiano G., Coulson A., Levitt A., Ruvolo V. RA and La Volpe A.; RT "Molecular and genomic organization of clusters of repetitive DNA RT sequences in Caenorhabditis elegans."; RL J. Mol. Biol 226(1), 159-168 (1992). XX RN [2] RA La Volpe A.; RT "A repetitive DNA family, conserved throughout the evolution of RT free-living nematodes."; RL J. Mol. Evol 39(5), 473-477 (1994). XX RN [3] RP 1-216 RA Jurka J.; RT "RC35."; RL Direct Submission to Repbase Update (JUL-1999). XX DR [3] (Consensus) XX SQ Sequence 216 BP; 67 A; 43 C; 17 G; 88 T; 1 other; ttttttcact tgaaaatttg aatttcccgc caaaattttt ttcacttaaa atttgaattt 60 cccgccaaaa atttttttca ctgaaaattt gaatttcccg ccaaaatttt tttcactgaa 120 aatttgaatt tcccgccaaa anttttttca ctgaaaattt gaatttcccg ccaaaaattt 180 ttttcactga aaatttgaat ttcccgccaa catttt 216 // ID LINE2C_CE repbase; DNA; CEL; 3328 BP. XX AC . XX DT 10-SEP-1999 (Rel. 4.08, Created) DT 10-SEP-1999 (Rel. 4.08, Last updated, Version 1) XX DE LINE2C_CE is a non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW CR1 superfamily; LINE; LINE2C_CE; ORF1; ORF2; KW reverse transcriptase. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-3328 RA Kapitonov V.V. and Jurka J.; RT "LINE2C_CE."; RL Direct Submission to Repbase Update (SEP-1999). XX DR [1] (Consensus) XX CC LINE2C_CE is a non-LTR retrotransposon related to the CR1 CC superfamily. CC LINE2C_CE does not generate target-site duplications upon its CC integration in the genome. XX SQ Sequence 3328 BP; 926 A; 943 C; 435 G; 1024 T; 0 other; tgccttgctt caattactcc aaaacctcgt ctccgtcggg atctcatgcc cgaagaactt 60 tctcaacttc gtgagtccag gaagttctgt tatgaagaaa acatgaaggc cggcaagtca 120 gtttacatta tgaaagatat cgggtacacg aaaaacccca aacctaggga tttttatgca 180 gctcaaccta ctgctctcta aattctcatc cattttctcg ctctaccacc caaaccaata 240 gtccaatctt gaattgttct tattcttctt catcttcccc tctctctacc tcattaccgg 300 tatgttcgac tgacaagatt gttttacata ctttaaaatt ctctctggca aatgttagag 360 gtgtgtctgt tttgcctaaa ataatcatgc ttaaagaatt cttctcctct tctgacatag 420 acatccttct acttacggaa acatttttaa acagctcagt tccaactagc ctcttctcta 480 ccgcaaacat atccttcatt agattagatc gtaacactga gcttcattct aaagtgtctg 540 gaggcgggat tgtcataatg cacaagtcct ccattaacgt atctcttcga gatattaatc 600 tcgaaaagca ctatcccaaa catcgctgcg aaatccttgc agtcgatatt ttatcttcag 660 atccctctca tcccccatta actatagtcc ttatttatag acctccttgt tgttcggccg 720 ctgaaaacat tgcactttgc acacatcttg aagcctatat ccccctctcc aatattattc 780 tcactgggga tctcaatttt gctttaatcg aatggtcaaa taacactaca tcaacgcatc 840 ccttttccct ttttctaagc tctaacaacc tttcacaaaa agttaggttt cccacgcgta 900 tctctagaac ttccagtaac atccttgacc tagttacttg ctctcccgac gtatccgttc 960 ttgacctctc cccacgtcct gccgttttaa actctgacca tctttcggta gagttcaaaa 1020 tcccacttca tccatcctat cataaaccaa taataggagt ttcttcccaa cccgccaccc 1080 ccactaaaat ttttgactac aaaaaatgca attttcctct actcaactct gacctttctc 1140 gcgttaattg ggattgggaa ttcagcgctt tcaaaacggt ttccgaaaaa tatgagcatc 1200 ttcttactat cctagaaaat ctcttgacat tacactgccc cacaaaatct cccaatttag 1260 gcacttcttc agcaccagct tccaatttac ttcgcaaatt aaaacgaccg agaaaaaaga 1320 ccaacaagct ccttctctct agaaattccc cttttgaaac tataaaaaat gcccaagtca 1380 aatatcgaaa actctaccgc caatacagaa aatcagtaag gtgttttgaa aatgatataa 1440 ttaacaactc aaacttttcc aaagtaaggc gtctcatcaa tagccgcctc aaaatacaag 1500 aaaaagttcc agctattctc gttgacaata aaccgattgt caatgattcc gatataagcg 1560 aagtatttgc caaaatcttt gcctctcaat tctcgcctga gggcaacctg tctcccaatt 1620 cccccccccc ccccccacat aatcaaccca attcgccccc cgcgcccaac aagtcgccca 1680 ataactgtac acaagcagat tccttcctac cgtatataat tgaaagtgtt ctcagcaaac 1740 ttcctcccaa gtgtggtttc tccccccatg ctgccaacta tctagtgatt aaaaagtgtg 1800 ctacacctct cgcattgccg ctctctataa tcttcaggca gtctttttct gactccgcta 1860 ttcccaatcg atggaaacac gccgtaatta ttccaatacc caagaaagga aacccctcct 1920 ctcccagtaa ttatagaccg atctctttga cagacccatt tgctcggatt atggaacgca 1980 tcatttgttc tcgtatccgc tctgaatact ctcatctgct ctctccccat cagcatgggt 2040 tcctcaactt cagaagttgc ccttcctctc tagtgcgctc catctcactt tatcactcga 2100 tcctcaaaaa tgaaaaatct ctcgacattc tcttttttga ctttgcgaaa gcatttgata 2160 aagtatctca cccaatcctc ctaaaaaaac ttgccctgtt tggactagac aaacttacat 2220 gctcttggtt caaggaattc ctgcacctca gaacgttttc agttaaaatc aataaatttg 2280 tctcaagtaa tgcttacccc atttcctcag gcgtccccca gggctccgtg tctggccccc 2340 ttctctttat actatttata aatgacttgt tgatcgacct cgaacctaat atccacgtct 2400 catgctttgc ggacgatata aaaattttcc atcataatcc ctctacgctc caaaattcga 2460 ttgacactat agtcaaatgg tcaaagaaaa acaaacttcc tcttgcccct gccaaatctt 2520 ctgtcctctc tcttggatcc cgtaactcca accacacgta tagagtggat aatgtcccca 2580 ttcttccatc ctccacagtc cgcgatttag gtcttatcac agattttaaa ttaaactttg 2640 aacctcatat tataaaaatt agctgcctag ccatgctaag ggctaagcaa attctcaaag 2700 cgttctcctc taactcccac aagttttata gccacttatt taaaacatac gttgccccca 2760 tcataaacta ttgctctgag atctactccc cttcccccaa ctcctcactc tctgccatac 2820 ttgaaaaacc gcttaggacc ttcaccaaac gcgtcctgca aaggtgcaac gttaaattca 2880 cctcttacga aaacagactg tgcatcatgg aactattctc caccagacac actagaatta 2940 aagctcaaat gaaactatta tatagacttc ttaccggttc tactcatttc tctaaattaa 3000 ctcagtttgt caagttctct aattccaata ggagacccat gattctggtt aggaaagaca 3060 catgcacctc tcatttcttt gccaaatcca ttcctatttg gaacaatctt gtcaaaaata 3120 taccggtatt tctctccccc tatcagtttt ccaactttct cgatctcaat attccccggt 3180 attaactttt atccgctggt tctctttgac ttctaattga tctcttctac cacattaccc 3240 tctctctcct ttcccgtccc aaactcgaga cgtactcttt ctgggttagt ctcgttttac 3300 cgttttattt aacactcaaa attcaatt 3328 // ID CeRep53 repbase; DNA; CEL; 69 BP. XX AC . XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 05-JUN-2002 (Rel. 7.05, Last updated, Version 1) XX DE C. elegans repeat 53. XX KW MSAT; Satellite; Simple Repeat; C. elegans; CeRep53; KW minisatellite repeat. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-69 RA Jekosch K.; RT "C. elegans repeat 53."; RL Direct Submission to Repbase Update (25-APR-2002). XX DR [1] (Consensus) XX CC Found 150 times in the C. elegans genome. For further analysis CC compare http://www.sanger.ac.uk/Projects/C_elegans/repeats. XX SQ Sequence 69 BP; 12 A; 16 C; 15 G; 26 T; 0 other; gtcctgacag aataccggca gtattctggg gtctcattag gttttagttt cttctgtgaa 60 tctcctctc 69 // ID CER16-LTR_CE repbase; DNA; CEL; 544 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 06-AUG-2003 (Rel. 8.07, Last updated, Version 1) XX DE CER16-LTR_CE is a long terminal repeat of the CER16_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER16-I_CE; KW CER16-LTR_CE; CER16_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER16_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (APR-2003). XX DR [3] (Consensus) XX SQ Sequence 544 BP; 109 A; 160 C; 101 G; 174 T; 0 other; tgttgtgact tgggcccctc agagcccatt cgtcccaact ttgctgatgt atgattggtc 60 tggctcaccc actcgccacc aatcatgatc agcccattca ctcaataatt ctatttcgtt 120 cattaacgat tgctcttagt aattcccatt tgaattaccg ccaatatgtg tgattacgag 180 tgcaagcgcg ctcctttgta cacttgcgtt actccggagc atcgttgccc gtaaatccac 240 acaataattg cctccacttt ttaggccagt tcattcattc gcttcgactt cgccaagacg 300 catcgccggc tagccaaccg acgatggatt gacccgggat ctttacctaa ttctatcctg 360 tgtgtctcat tcctttttat ttcctcaata aattgggatt cgtctaaact gtgtactgtt 420 attcctcatc gcaccgctca tcttcactcg ccgctcgact attgctccgt gtcctccgcg 480 cgtattggaa tacgtcgttt tcacccgatg ggttgactct cccattcggt ctcgggtgtc 540 aaca 544 // ID LINE2D_CE repbase; DNA; CEL; 2651 BP. XX AC . XX DT 10-SEP-1999 (Rel. 4.08, Created) DT 10-SEP-1999 (Rel. 4.08, Last updated, Version 1) XX DE LINE2D_CE is a non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW CR1 superfamily; LINE; LINE2D_CE; ORF2; reverse transcriptase. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-2651 RA Kapitonov V.V. and Jurka J.; RT "LINE2D_CE."; RL Direct Submission to Repbase Update (SEP-1999). XX DR [1] (Consensus) XX CC LINE2D_CE is a non-LTR retrotransposon related to the CR1 CC superfamily. Its 3-tail is composed of (AATTC)n. XX SQ Sequence 2651 BP; 712 A; 704 C; 343 G; 890 T; 2 other; gacctccctg ctgctctgtc tctgaaaacg ctgctcttat agctcatctt gagagttata 60 ttcctcttag tcgcacgata cttgctgggg atctcaattt cccacaaatt aattggtcct 120 ccgccaattc atccttacat ccatttgctt catttctcag ctccagcaac ctttcccaaa 180 aagttcactt tcctaccaga atatctgcct cctccagcaa cattcttgac ttagtggcct 240 gctcagctga tatctctatc ctccatctct cccctcagcc agctcttctg aattccgacc 300 acctttcggt ggaattcaag attcctatcc tctcccctat agactctccg tgtcctactt 360 cccaagactc caatagttct cccttgtttg actataaaaa atgcaacttc cccctcctca 420 acagagatct tgcgcgtgtt aactgggact tcgaattctg ctctcaaaaa tccgtttcgg 480 ataaatttga tcgcctactt tcgattctta acaacctcct tactcaacac tgtcccacta 540 agtcaattag tgtctctaag tctccttcac cttatcacaa cattcatcga aaactgaagc 600 gcctaaggaa taaaatcagc aagcttattc tttcacgtaa atgtacattt gaaattatta 660 aaaccgcgca atctaaatac cgaaaacttt atcggaacta tagaaaatac gtgcgccgtt 720 ttgaaaacga tataattgcc aactcaaatt ttaccaaaat aaggcgtctt attaatagtc 780 gcctcaaatc tcagaacctc gttccagcca tcatccttga caataaatcc attgtcaatg 840 attcagagaa aagcgaaatt tttgctaaaa tcttcgcctc tcactactct gacgattcca 900 gccagtctca aaattctcta catagtcctt cacccacatc ccattcctct cttcagaatc 960 aaacatccac atctcttcat gtagactcat ttcagcctta tatgattgaa agcgtattgt 1020 cgaagcttcc ccctaaatgt ggtttttccc cacactgcgc caattatcta gtgctcaaga 1080 aatgtgctac tcctcttgct ttccctctat ccattatata caaacaatca tttgccgatt 1140 ctaaaatccc tgactgctgg aaaaaggcaa taatcatacc aatacccaaa aaaggtaatc 1200 cttcttcccc cagcaactac cgcccgataa gcctcactga cccctttgtt cgtgttatgg 1260 agcgtattat atgctctcga attcgcgctg attttggcca tcttctgtcc ccccatcaac 1320 atgggttcct ccactacaga agttgtcctt catcattagt tcgatctgtt tctctctacc 1380 attcccttct aaagaaggaa aaatctctcg atatcatctt tttcgacttt gctaaggcat 1440 ttgataaagt ttcacatagc attctcttag gaaaacttgc taagtttggt ctcgactctt 1500 taacttgttc ctggtacaag gaatttcttg aaaacagaac tttctccgtc aaaatcaaca 1560 aatttgtttc gaaaaactcc tatcccatct cttcaggtgt cccacaaggc tctgtttcag 1620 ggcctttact tttcattctt tttattaatg atttattgat tgacctagct cccaccataa 1680 atatttcctg ttttgctgat gacgttaaaa tttttcacac tgatcctact ataatacaaa 1740 actccattga tattatagtt agttggtcca agttaaatga acttccttta gctccaacta 1800 aatctgcgtt gctagctctt ggcactcgta acaaaaacca atcatattct gttgatggtg 1860 tccccattcc cccttcatcc actgttcgtg acttaggtct cattactgat tgcaaactta 1920 agtttgatcc ccacattgct aaagttagct gcctagccat gctcagggca aaacaaattc 1980 tcaaggcatt ctcctccaat acacctaagt tttatgccca cctctttaag acttacgtgg 2040 cycctattct caattactgc tctgaaatct atgcccctcc acctaaactc attgctttca 2100 gcaaaacttg aacaacctct aaggcacttt accaagcgcg ttctgcaaag atgcaatacc 2160 aaattctcct gttatgaaaa tcgactcagt attatggagc tccactccac cagacacgat 2220 agaattaaag ctcaattgaa attactctat agacttttaa caggtacttc tcatttttct 2280 aacctcaatc aatttgtcac tttttctagt tcaaatagac atccaatgat tttagttagg 2340 aaggacaaat gctccacsca tttctttgct cttatcatcc cagtctggaa caatttattc 2400 aaaaatgtta cggtttttat gtccccatat cagttctgtg aatttattga tctcaacatc 2460 ccccggttat aactacactt ttacctccat tctcgtcttt ctccccgttt cacctgttcc 2520 ctattatctt tttttttttt ttttttgtaa cttttactcg agttgaactc ttttagggtt 2580 cgtctcgttt tatttgttta tttctcttct tttagttaaa taaactcaaa ttcaattcaa 2640 ttcaattctg a 2651 // ID MIRAGE1 repbase; DNA; CEL; 5625 BP. XX AC . XX DT 18-NOV-1999 (Rel. 4.1, Created) DT 27-MAY-2011 (Rel. 16.05, Last updated, Version 3) XX DE MIRAGE1 autonomous DNA transposon - a consensus. XX KW EnSpm; DNA transposon; Transposable Element; transposase; KW Autonomous DNA transposon; TIRs; Mirage group; Mirage superfamily; KW MIRAGE1. XX NM MIRAGE1. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-5625 RA Kapitonov V.V. and Jurka J.; RT "MIRAGE1."; RL Direct Submission to Repbase Update (31-OCT-1999). XX DR [1] (Consensus) XX CC MIRAGE1 is a very young autonomous DNA transposon. There are more CC than 10 copies of the element fixed in the genome. Some of these CC copies are almost identical. MIRAGE1 generates 2-bp duplication CC of a target site upon integration in the genome. MIRAGE1 has CC 53-bp TIRs and encodes transposase (several exons). PALNN1_CE is CC a nonautonomous form of MIRAGE1-like transposon since it shares CC common TIRs with MIRAGE1. CC Since its originall discovery in 1999, Mirage transposase was not CC significantly similar to transposases from other superfamilies. CC However, a massive explossion of the number of protein sequences CC annotated in GenBank in 2008-2009 resulted in the PSI-BLAST CC convergence of Mirage transposase sequences with transposases CC from the En/Spm superfamily. Therefore, Mirage transposons should CC be viewed as a distant (nematode-specific) group of En/Spm CC transposons. XX SQ Sequence 5625 BP; 1762 A; 1092 C; 1077 G; 1694 T; 0 other; ctcagataaa ctggagtcac ggcagagcga cagaaaatca cttctaatca gcaaaagcta 60 gtaaagccat agtttatcag cagagaacca ctgctaatta gcaggtcatt gctaattagc 120 taaaaagcac tgctaattag cagaaaccca ctgctaacta gcagaacatt gattattgat 180 tttattttgt tttttcttcg tcgtatttca catatttcat ttttttgacc ttttttattg 240 tattttttgt gtgatatgtt atcaatatgt gttaattgtt aattccagtg tgtttttcaa 300 agatttgtat cattcccggc tagaaacgga ggattatggc cagaaatcca agaacaacag 360 ttcgaagaat tttgaattca gcaccccgga gtcttctgat tcaaagcgtg acaatagttg 420 ctctaggtaa attttttaaa tgattttatt gggactaaat aactactttc agacccgacg 480 gacaataatc atggttctca ccagccggga agcaatggtc catcaaattc aagcacccag 540 gccgataact tcgagacgga aacaactgca gatgaagaac gacaaaagga agatggtgta 600 gaagatgaac tcgaagacga agaaacaaca ggattcattg atgaagacgg ggaaattcgt 660 tatccggtcg atgaagtggg aaattacgag atcgatcaag aagctgaaac cgatgagtca 720 ttttatttgg acactcaacg cgattcttct ctatgcgacg agcgaagtca actgtgtcgt 780 tctctgagcg atcaaaatct gagaacaact tttttcctca tcactactac caactcgaaa 840 gcagccattt tgagatatct acattttcac aatgtttcac aagatagcat tctcagaaaa 900 ttgctactga atgactctca tctaacactt gtagaagtat gtaatttatg tgcgcagaat 960 atcgaactgt aggtttatta taaaactaaa tttctgaaat atatcattac agatgcaaat 1020 gtgaagatat gcaagagaaa tgccaactaa ttttcggaga tctcaagaaa tggctcgaag 1080 ttatattgaa tagttttgcg aatatcatga tgaagactaa aacaactatt ttggaaggat 1140 ataatagctc cagtccaata aataagggta tctttaaaaa aaagctactc agcgaaatgg 1200 cagataatga aatgtggatg cacttcgaga gtagcttcga cggaatcaaa gtacataaaa 1260 atggaaagta agtatcttag tttgatggct ctaaaaattt taattggttt tcagtcagaa 1320 catttggcca tatagtttac tcaatttgga tctcgaagac gttcatcgag cttctcccca 1380 ggccctcatt ttagcagcgt tattcatcgg gttcaaaaat ccgacaacga aaattcatga 1440 tagacttaca caatggattc tgctccaaat ggacgaacat gtcttcttct cagagggagt 1500 ggcctggaaa gcggatctca cgtgtgccaa tcatgacgat ccagtgagat gaaaattcag 1560 aattttgaaa acaagtgttt ggtttcaggc acgacgaatc gtatacaatc agtgtggact 1620 ccgatcatca ggttcttgca acttttgctt aaaccaggaa actgagtgca aaatcaatga 1680 tgagtataca acacgtaagc atcatgaaaa ctattattaa aaaaataatc acctctttat 1740 agtacgcgaa aatctcaccg gatcgtgccc ttcgactatg aacgatggcc ttcgtcaaag 1800 aaaccagcat tctcatttcc acaagcatgt tttgtatatg catctatgcc cggttgactt 1860 gtttcattgt tttgaagaag gaattttaag caatatcaat acaggtcact agtaatgccc 1920 gaatgctccc gaaacgtgtt catttttcag cattattctc caaagggaaa tggaatttgt 1980 tcccgtcaaa atcggtggat ttttcatcaa caccttcgct tccatctcga tttcgttcgt 2040 tgtccggcaa agcttcacag tgtactggtt ctgagaaagc actggtaaaa agattcgttc 2100 agctaatttc tgcaaagtga acacacattt agattttcga gtcgatagtt gtagcagctg 2160 cattttccgg tgaacttgga ggtgtaccat cggctatcat tctcagcatt catggacttt 2220 atcgtctttg catagaaccc caatcaatca ctgacgagac tcttcatttg aaagttagtt 2280 tttattgttt ttgcttgaaa aaagtagacc attctttaca gattgaacaa atcagcaaat 2340 ccatcgaagt tttaattgtg aaacgagcac cggaaatgtt gaatgggatc aaagtacatg 2400 tatggaaatt gtcgtcaact aacacccatt actattttgc agcaagtgtt gtaccacctt 2460 gcaagaatga ctgagctgta cggtagctta tttccattgt caactcaatg gttcgagtac 2520 ttctatcaca ctattcaaag aactttagtt ccggaaattt acaatggact gggcatgagc 2580 atcatgagaa agtgagaaaa ttagagcaca atagggagca agaatgtttt tacagaatgt 2640 cggctcttca agagatcaaa accgaagctg cattccgtct acacacaaat caccagtttc 2700 gaacagaagc gaacttgaaa atgtgctatg aacttggatt attaaggcga ctcaaacgaa 2760 tatcggttgt tccgacccct ccgcagtacc aacagttttg cgagaccggg gatgttttcc 2820 ttaatgttgt ataccacgac ggcatcagat attcgaatta ttgcaacaaa aagacagatg 2880 actcaaatgt tacgtttatc gaatatgctc gcgtacgatt tgcaaagatt attgggatgc 2940 tgttgaagga aaacaacact gatcttcgtt ttattttgaa gaaatactgc aacacaaatg 3000 atcatttgat tgacatatcg aacaagttat atgaagagga tgtaccgcaa atgcatattc 3060 aaaactgggt tgattcagtt tcaaaatcga gtttcggagg tcgtgtcacg ttaacaaaca 3120 cagttgtcgt ggtttccgcc gattccattg ttggacatgc agtggttgtg aagcgaaata 3180 acaactgtgt ggctcttcca ttttctcgcc gtatcagtct atcttgactt tgataaccac 3240 atatagcttt gaatttgttt gtcttttgtc tgtccccccc cccccccgct cgaaaaaatg 3300 tctgcacctc ccccttcttt gacaaatgtt tgtctctacc ccttgctcga aaaatgttct 3360 cattatgttc ttgaggaatt ctttcagatt tttaaaagtt gtttgtttta atattttcat 3420 gaatgcttat ccgctttcaa ttaatttgtg tttatttgat gtgataatct gttctgttca 3480 accctgctta tttaatctta ttttattaac agatctaaaa atcaatttca gaaaaagtca 3540 tgtcattcct cgagaaacta gtggcaaaat acgagcccat taaagaagaa aaaccaaaat 3600 tagttccaaa gccgacaaga aaacggcaac cggaaactac ggaaatcgtt gtgaagactc 3660 gccgtattac tcgccgaaat gctaaggtta gtggaaggag ccacataaca ttaaatgtga 3720 acccaaagct aacctattcc tatgagtttt aatatttctg tcattttcag gcacctgtaa 3780 aagaagagga gattggacat caaaaagctc aaatcacgat cgtagatgac gaagagttta 3840 tggagcaacc tgtacaggag ccgattgaga ttgaagagga gcaacaagaa gaatctcatc 3900 tggaaattgc tcttccggag caaccaaatc atgagccaat gcaatcaagt ccatcgttgc 3960 ctagaaaatc gttcaaaccg gatatgcatc caattgcgaa ggtgtttact tctgctaatt 4020 tcgagttgct cggaaataaa ttaatttcag aaaggcggaa aaatctgtat tgtagatcta 4080 ggacttacat ctcacacact ccccaccgaa caaactccag gattcttggt aagttttttt 4140 ttccaaaaat ctggcatgat caaaattgtt gacttttaga aaagactcgg agagaacgtt 4200 ggaggatcaa ttgactatag ttatgaatcg attgacgagt tgatccgcac tgctccggat 4260 gcttggtcta aagcttcaat gggtgcaatt cgaggtgtta tgctcatggt cagttttagt 4320 tactatcttg aatttgcaat taatattgga gaaaatttca gcaggaactg gttactgagc 4380 gatcaatgca cccgaaaagt aaaccaatgg agaagaatcg tttacttgta tgtttataag 4440 aatggaacga ttttgagcat ttgaaaatgt tatttacagg gtgaagaaat ggcaaaattg 4500 aaccctaacg attttctgaa ccacaaactg ctggaagttt cgagtttgga tcaaactggt 4560 ataccaagaa cataaataaa aaccgccccc aagaacattc tatttcagtg tttatcaacg 4620 tcaatctcgt gttttcggcg ctgagcattc aacatggctc ggttagatca attgtcaatg 4680 ctatgtcaaa tcactggatt gactacgtga gcgatcaaat attacgttct atttttgact 4740 atttaacatt tcagctgatt ccgagagagc aacaaggatc cttcacgaca agcaaaaaga 4800 cggagcctct tactttcttc ccaggagact tgacaaattg gccagccggt tagtttatga 4860 tttacatggg cgagccacat aaaaatcgtt ttcagctctt ctcgctgcgg caatcaagtt 4920 ggacaacgac caggtggacg ttactcgcct tccaagtgtg gtcaatatgc tgttcaagca 4980 tcatatcaaa gtcagattgt aggtttcttg ttatttactt taaaaagcca gcgaatagaa 5040 tttttcttta acttgaaaaa ctatgttttc acaaatgtct ttttgcagcg atcatgcccg 5100 tggtttgtat cgaacaaaac acgacggcca gaatccacca aaaattgtgc tgccaaagcg 5160 tcccatcatc gattgcacac aattctacaa gagcagcgag gatatcgatg aggagcaagt 5220 tccatcaact tcgaactttc aatcatcttc gaactctcta tcaacttcga aatctgctgg 5280 aagggaaaat atcccagact ccccataaat tcactagttc tcctgttttc tgatactttc 5340 gaaacttgtt tttttgtttt tttttccatt tttcgttaaa tgtttttttt ctgttgtttt 5400 ttcattttgt ttcattttca taataaatcg tgttattttt tatggtttta taaaagttta 5460 atccaatatg aaagcaaaac tagctagcaa gaaagcataa ctacgtagca aaacatcacc 5520 gctaatcagc ggaaaagcac aattactgcg gttgactaag ccgtgacata aatgctaatt 5580 agcagtgatt ttctatcgct ctgccgtgac tccagtctat ctgag 5625 // ID HATN1_CE repbase; DNA; CEL; 265 BP. XX AC . XX DT 06-NOV-2003 (Rel. 8.1, Created) DT 06-NOV-2003 (Rel. 8.1, Last updated, Version 1) XX DE HATN1_CE nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8-bp TSD; hAT superfamily; HATN1_CE; nonautonomous DNA transposon; KW palindrome. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-265 RA Kapitonov V.V. and Jurka J.; RT "HATN1_CE, a family of nonautonomous HAT transposons."; RL Repbase Reports 3(10), 185-185 (2003). XX DR [1] (Consensus) XX CC HATN1_CE is an old family of nonautonomous DNA transposons. CC HATN1_CE elements are flanked by 8-bp target site duplications. CC HATN1_CE copies are ~90% identical to the consensus sequence. CC The consensus sequence forms a nearly perfect palindrome. XX SQ Sequence 265 BP; 56 A; 79 C; 75 G; 55 T; 0 other; tagggctccc atgcggcgtc gctcaaacgc cgctccacgc cgctctcacg ccgcgatcca 60 aatctttaaa gcgatgcgcg gaacccaaaa agtgtcggac gggagccaag tttgcaccaa 120 atagtgacat accctatcgc actatttggt gcaaacttgg ctcccgtccg acactttttg 180 ggttccgcgc atcgctttaa agatttggat cgcggcgtga gagcggcgtg gagcggcgtt 240 tgagcgacgc cgcatgggag cccta 265 // ID CER12-I_CE repbase; DNA; CEL; 10993 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 06-AUG-2003 (Rel. 8.07, Last updated, Version 1) XX DE CER12-I_CE is an internal portion of the CER12_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER12-I_CE; KW CER12-LTR_CE; CER12_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER12_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (APR-2003). XX DR [3] (Consensus) XX CC FEATURES Location/Qualifiers CC gene 9391..10011 CC /note="predicted Integrase region" CC gene 6531..6857 CC /note="predicted Protease region" CC gene 8385..8784 CC /note="predicted RNase region (frameshift)" CC gene 7404..8064 CC /note="predicted Reverse Transcriptase region" CC primer_bind 1..18 CC /note="predicted Primer Binding Site". XX SQ Sequence 10993 BP; 3266 A; 3125 C; 2197 G; 2405 T; 0 other; tggtgcatcg accggatcgg atcttatccg aaaaacgagc ctgccaagcc agctcgcgac 60 acgaaaggtt tgcagaaatc gagccttgct cgacaaaatt gcctgccggg caaaaaggac 120 tcccagatct ctctggtcga gaacaaaaaa acatagcaaa ttgcctgccg ggcaaagtgg 180 actcccacaa cagctggtag agaacacgtc tgaaatctct attaaatttc ctgcgttgca 240 attggactcc ccttcataca aggggagaac attttcgcct gccgggcgac agcttctcac 300 ttcacaagtc gagaacattt gagatcctca atcactgaga tttcaaatta attgtaatgc 360 caggcaaatc ataatacgag attttcagac aatcacgagc cccaggctac acgttaagag 420 atcagaggtg cactaaggat gttgcgcagc acaggttttc tacaaaccga gaactgagcc 480 gctcaaagcg ctccgaaact atcgctcaaa cgcaggtttc gtagaatcaa gcagatagag 540 ccgcccatca tcgggaaagc gctccggagt cgacattttc ctccgcaaat tcatcattaa 600 tctccttaaa aaattctacc actgaacacg atctaataat cacaattgtc tcggctgccc 660 aggagttggc acatattttc ctgcaaagat gaagccgatc gacgtctcct aatgaaagtt 720 ctcaactttc aacctgaacg ctttatcaca tattgcgaaa tcacaaccag gttccccccg 780 gaagaaaccc ggggaacact aagctcttca aaagagcatc tcctgaaagt ttacatcaaa 840 ttcggacggc cttctccgaa cccaattcca ccaaggaaat taccgtttca tctccgccgc 900 aatacgactt tcttgctggc tacggtacac aacggctcaa tgcggacata gcatcccatt 960 tgacgcgcaa tcacggactt ccacaaattg gaatcgaatc ctaaaattct tttcgaaatt 1020 tttcgaaatt tttcgaaacg tttcgaaatt tttcgtataa taaccgaacg gaaaactctc 1080 ctcggtcctc gctgattcat cataataatc agcaaaattt taataataat aatcattatt 1140 atttcaaatc aattcacatc aagctccagc agcttataac gcctgtcgtg cgaagtggac 1200 tccctaaaat acaggtcgag aactaattca atagcaaaag atctaacaat gagtattacg 1260 tgtcatatca cagctgacct ccagtcacat atctgtcgaa acatttccag atattaaacc 1320 tctctttttt cctctttatt attttgtgtt acacgcacac tgaccgtcaa tttccttatg 1380 aatttctgat atctcggagc gcttatcacg cgaataattg atttaatttc cagacaatgg 1440 agaatccgat cgataacatc acagatattc cattgaccgc gtccatcaaa gatcatgagg 1500 aggacaacac ctagattgtg gctgtccacg aggaacacgt cccagacgct gtgattgacg 1560 aggaagtgat ccaagatcca ctcgaaatgc tgtatgagcc ggctgcccgc gcgagcaacg 1620 agctagaaaa gagccaggaa aaaggtatga ctactctcag tggttaaaag tacaaccagt 1680 tgttgttaca gctctcgcct acgcgatcga ggaagtgatg gctccgatca tcggagggct 1740 caagactgac gtcacaaaaa ttctggacaa tgtctacaac gccgtcaccg gcttgacaaa 1800 ccaatgggcc gacgtccagc agtccatgac tatcaacaac accactgaga gccaagaaac 1860 atcccaccag gctgcccagg caggcgctca atcgagcagt aacgtaagac acgatcataa 1920 aacactcata aattgcatac actgcagatc tgtcagctat gtgccgctaa ccacgacatc 1980 agcaactgta ccagctacaa taactggatc actcgacgcg aacgagcaga agcgctatcg 2040 atgtgcaagc actgtctcac cgtcggtgtc actgatgacg actggcatca cgaaggctgc 2100 ccaaaagcca aggacacttg ctccgtgtgc gccaacaaca ccaatcgtgt tggctgcaaa 2160 taccaccatg aagctctctg cacttcagtc aagaccactc gcaacagacg agcttgtaag 2220 gtgacgcata ttacactaat ataacacggt ttaaacaata actattgtag aactcacgca 2280 aagcaactga ccgttcgaga aattggagag gctcattctc acaacaacac gccggaacga 2340 cttacttccg tcgttaggaa caccaacgtt cctcttcact attgttttac gctgcccagc 2400 agcaccgttg ttatattttg ccgttaaaga atttcattct tgttatacca cctaccatct 2460 tcatgtcttg attgctcgag ttcattatct cgcttattat cttaaaactt tcttttttga 2520 tgatacgacg gatgcaagaa gtattgtgat accagagtcg agacacctct cctcaaaccc 2580 aagtcgagcc agtataaaaa taacatacaa atgagtcgag ccagcctcct caagaattaa 2640 atcatgaaat agagtcgagc cagtctcctc acaaactcaa tcatcaaacc gagtcgagcc 2700 agcctcctca ttatacgaga ataaccgagt ctcaagcact cgtaaacaaa aaagattcca 2760 aaagccgtga ttgtaaagcc tgctccgccg cagtcgagcc atggacgagc gctcattcct 2820 cgtccaacca ctgacgagcg tccactcctc gtctgacgag tgctcactcc tcgtccatcc 2880 actgacgagt gctcattcct cgtccaccca ctgacgagcg ctcattcctc gtccacccac 2940 tgacgagcgc tcattcctcg tccacccact gacgagcgct cattcctcgt ccatccactg 3000 acgagcgccc attcctcgtc tagaatgcgg acctactcaa agtcaccaac agcgtggctc 3060 agttcgcact cgaacgcaca acagcaagta gccaagcata cttgaacagg catagggagc 3120 tgatgcaaat ccccctatgg cttattgctc ctaatcagga cagccgttat tcacacacaa 3180 tcaataacga gcatcgtgga agcgcatatc acgccaagtg aaatacgtga gaggacttct 3240 acagtcccgg ctcagacgcc aatgatcatg gtgaacaggg agctagagcc ctgttagtcg 3300 gaagttgttc aaaatctctg gatctgatcc agaatgcgca taggtggtag cgtgtccact 3360 tctagatcaa aagtacgccc atgcggaaaa gtccaacatc gacgggtggc ggacataccc 3420 gcacatcaga ggcggaagac aactaacccg tcttgtaacg acgacctact taacggtctc 3480 ttacacatcg aaggaggttg aaaagtcggc tctcttgagc cccgctccga atttcttcga 3540 gccctccgtg agcgcccacg agccgccggg atagtcgtgc cattccgacg ctgtcctgta 3600 ataaaccgaa gacttatcgg tgtcagccag aagaacgtgc tcaacgcact acatatatag 3660 accttacccg ttcgaagcgt ctcatgatcg gccatcatta aaacccagac ctgttcggaa 3720 gcttctggtg agcatcgcaa agcaccttca gtcaggaggc ttttgtgctc ttagcaccgc 3780 ctattctacg aagccacgat agattctttc ttagcttctt aagcggtgca agctcatgca 3840 tcaagtgcag gaacacaaaa cttccttcaa acagaccatc actccagcag tggaaggcaa 3900 tcaaagctac tatcataccc accagatgga aaatttgaac gacaataatc actgtcgtaa 3960 cggcgagccc tcccgaagaa tcttacgaca aaacccaaga tgagccgcaa tcccctcgtc 4020 ttggtagtta ccgatccacg aagacggtaa cgcccccacg tgggctttat tttcaatgca 4080 tgaaatactg ccgacacggg tttcaaaaga gagagtagtc gacctatcct acctctttgc 4140 atccagcgta ccattcagac cggctctaat ggtacaaatt tcagttaact ggattcttat 4200 ttacccctga caatcgccca acaatgaact caaagttcga tgatcaatcc caagtcctcc 4260 cagaggcctc caacactact ggagaactcc caacggctgt ccccaaaagc caaccggctg 4320 caagcaccca aaaaattaga ggtcgtggcc gccctggcaa acttcgaagc ctagcggcac 4380 ccatcaaagg gtacctcact cgcgttacca acaatgcgaa gttttaccac ttgcaagccg 4440 gcgacatcat gcttgaccat ctcggtacca ctcctgatgc gtctaatcca gtcggagaca 4500 atgtgctccg cgctgctgat ctggttatcg taaaaaccag acaatacctg gaagcgatgg 4560 aaaacatccg cacgtatgtc gacgacaaat tccaggaaga cgctctacgc gattccccgc 4620 atcgggatgc ctactaccga gaagttcagg ctcacattga gaccatcaat ccgcaaaagc 4680 tcatcagcga cgctaaacgc gatatcaaca tgctcgagca agagctcaac gctcgcggtt 4740 ttccggtgac tcctaagatc accgaagaca ataacatcac tccatacgag tatgaatcct 4800 cagacctgtc cgaaactgat aacgaggatt tggaagaggt tgataacgca ttcgctgccc 4860 tcaaagagca tatgggttcg gagcatatca ctatcccgaa ccccactcgt cacgacgagc 4920 cacgagccac gattcctcca tcaatcgtct caccgtctaa tcataattcg tcgacgcgac 4980 cacaccgctc ggcgtcccaa ttcagtgcca ccgcacgctt ccgtgattcc gacgacgacg 5040 catggtctcg gaaaccatct cttgcggaag aattggaaga ggaacgtcac caaagacgcc 5100 ttgctcaaaa gaaagtaaga gccctccaag ataccgtaca agggtatcag caagctgccg 5160 aacaaagcca agaactggag caacagcaag tacaggagct acgcgaaatg cgagccgctc 5220 gtcagcgtag aatggatgca ttgcacagag agcgtcaatc ctctcctatt cccactctgt 5280 acttgacccc acggccaaga actcgcgatg ctccatcgcc tcaaaatttc gagcctcacg 5340 ctagcgaaac cgctcagatc acaggctgcc cagttcaagc caccactcgg cttaccggca 5400 ataacaccac tgaaattctc ggtctaaacg agatgcttgc gttcatgcca gaaatgagaa 5460 gctctatctc ccagctgaga gccgaccagc aagagctagc aagacaaaac gcttcagcca 5520 tcaatgagct gagaaatgat cggaatggcc tcgccacaga tcaaacaagc gagcaagacc 5580 aagtcgatga aactccagtg gaatcgagcc acacgcctcc tccaccacaa attaatcccg 5640 acgctcgctc gcttttcgga gtcatcaact tcgaaactaa cgccaaaaac ctaccgaatt 5700 ttgacggtag cggaaacttc aaggcattcc gaaatggctt tgaaactgtc gttctcaacg 5760 acgcacgtct ccctgaagta acaaaaaaca atttacttca aaaccatctc attggagacg 5820 ctgcccagtg catctcatat gatgacgatc caaccatggc ttatcaaatg accatgaaaa 5880 tgcttgaatc agtgtacggc aaaggcgaca cacagtccgg cctcctgcaa aaattcaaat 5940 gcctgagatt tcatcaaaca aacactgagc agatgaaact cgacctcaca gcacatcggc 6000 tgcttgctca aaggcttaaa actacaggtc tcagtcaatt cgacgcccgc atcacgatgg 6060 gtatcattgg caaactccca cccactttca tggggaaggc tgccctctga aggaccatta 6120 cagagatatg tctaactatc ccaaagacgc ctcaaccgtc atccagttat ttcgagactt 6180 ccctctccac agagagaggg gagaccgcaa attcattacc tcccaataaa tacctttggg 6240 caggtacggg gtcgaaagta cagaaccccc taccaacagc tgccttgcca acagccccta 6300 tcaacagtgt ctctatcaca aattcacatt atcctcgggg atcagacact cgtctagctc 6360 tctcatacga gacaaacaca ccaaactcca cgtcacacgt gtcagagttc gcagctttcc 6420 tgtcccggac agacgtggtt caccaagtga ccaacataaa aaacgctggt aacgataagc 6480 tcacattcat ttgtctacac acggacaaaa atcagccact actagcacta gtagattctg 6540 gagcatcact ctccttaatt gttcatgaca cagccatcca aattggccta acgtcagtgg 6600 gatcagccag cctatccctc caaggtttcc acgatgaatc cacttcagtg tcgcatatct 6660 ttccactgcg gttaaaaata cataactcgg tgatgtcgat gggtttctta attgctggcc 6720 acgcccaact tccgtcgacc cagtttctgg cgccagttct atctgacaac gataagaact 6780 gcctcgatct tctcaagatc gatcaccgct caatcactgc ccaaaaaacc cacaacggga 6840 aaccgatcga tatgattctc ggtaacgacg tcctggcgag tctttacgcc gatagtcaca 6900 caaaaaggca ccagctgccc tctcgacgag ttatagattc aacacgaatc ggatgcatcg 6960 tacacccagt gccatcaatg gtcattcata ccgaggagaa ccatacctgg atccttgacg 7020 agactatcaa accagcgcat tctatcatgc acgctgatat ggtgctaagc ggccagtccg 7080 cagaagaccc tgatcttcaa ctccactggc aaattgaaca aatgtggaag ctcgagaata 7140 tcggaattga gccaattccc ttggttgact ccagtaaaat gtccacaaaa gacctgcttg 7200 ccgagtttga aaacacagtc caatattcca acggacaact cgaagtagcg ctgcccttca 7260 acgggaatga atcaaagctt aaaaataatt acgccatcgc gtaccaacgt ctaaagagtc 7320 taatacaaac tctcactaaa ggcaccagcc tcatcgaaaa atatcacaag gtgatcatgg 7380 atcaaaagct gtccaacatc attgaacggg tcacacccga aatggctcaa gactctccaa 7440 tagagtactt catgcctcac cgagccgtaa tcaaagagtc ttcaaacact acgaaactca 7500 gagtggtgct cgacgcatcc agcccaatag gtcgagaatt ctccctgaac gactgccttc 7560 acgcaggcac gaatctggtg actccgctat tcggaatcct cataagagct cgttgctatc 7620 ggtacatcat cgtatcagac attgaaaaag ccttccatca ggtacgcctc cagaaagcgt 7680 ttcgcaacgt aacacaattt ctgtggatcc aagacccgtc aaaaccgacg gtcgaagaca 7740 acctctgtcg ctaccgattt acccgaatcc cattcggcgt agcaagcagc ccattcctct 7800 tggccgctgc cattctccac ttcctggctc gcaacccgca caggctaaac tcgcagatca 7860 gagaaaactt atatgtggat aattgtctcc tcggcaccaa cgagtttaat gagatcatga 7920 aaaccgcaat ggctgccaaa gctatcttca aaaagatgga gatgaacctc cgcgagttcg 7980 ttgtgaacga gtccatcatc atggatagaa tgaaagcaga ggacaaagcc gagtccagag 8040 agattaaact gctaggctac acttggaatt ccaacgacaa agttgattcc ttgtccgtaa 8100 aaatcgctgt cctcgacatt gagcatccaa ctaaacgaca agttgcatcg aaaatggctg 8160 aaaccttcga tcctctcggg ctcgtatccc cgctgcaggt tttatttaaa cgcttaatcc 8220 aacaaatctg gatcagcggc gtcaaatgga aagataaaat accggtcgag ctactcccga 8280 tgtggaaaga acttcaacaa gctttcatag acaaatcagt ccacgtaccc agaagactga 8340 cattcgtaaa cgacgagtgc gaagagatcc atctcctcgt ctttacagac gcttcacaag 8400 acgtctatgc ggccacagtt tacgctcacc atatctataa aaaatgggag ccagtcacca 8460 ctcattgcca gcaaaaataa gatcaaggaa gtctctgcca cgaacttcac aatccccaag 8520 ctcgaacttc tcgggattct atgtggctcc aacctcgcgg ttgccctgac aaaggagctc 8580 cgcattccga tcgcatcgat caagatcttc acagatagct catgcgctct gtactggatc 8640 ctatccagca aaaacaaaag gccttgggtt cacaaccgag tatcaaaata ctacgaaaac 8700 cgtgatatca tgcttcaatg cgggattcca acaacaatcc atcaagtacc tacgaaggta 8760 aacccggccg atcttgccac ccggggaatg actaccacag agcttcaaaa cagccatctc 8820 tggtataaag gccccaaatt ccttcgaaaa cccccggaag agtggcctca aaaaattgaa 8880 ggcgaagtta gctgccctgc taagttccga gacttagtat actctgagat cgtggattcc 8940 tcttccaaga agaagcgtaa acctctcatc gagaaacctc tacaagatac agagatcgag 9000 accgttctac acgtctccac ttcccttcag gaattcgaac cgctcattcc gttccactac 9060 acaaattctc tcacaaagct gaccagcatc gtgtactgga gactaactat gctgtgcaaa 9120 ttacttccca aaaaagtctg ggaaagcgac accatgctcg cattcagcaa ttgcacgtct 9180 tatctcatga gacgtaagct tgcgaggaaa atcatcatcc tccagcacta caaagagtcc 9240 gaagcgcttg gtctaaaact gccaccggat ctcgaatgct acacagatag ccatggattc 9300 ttgagagttc tgaaacaaat cacctcaaat gccctgccac aggaagcaaa cgaacctgta 9360 gtcatctaca aagaccaccc actagctgcc ctaattatgc gtgaaactca cgtcatcaac 9420 ggccatagcc ccgaactata cacagtctcg gctattaaaa cactattctg gatcccacat 9480 gtgaaggcgc tggctaaaac agtagtctcg tcctgtgtcg actgcaaaaa ggtccatggg 9540 ctgcccttca aatacccaca ctctaaaact ctacctgata ctcgcaccga gccgtcgaaa 9600 ccattcgcaa aatcaggtct agattacatg ggtcccgtcg aatacatcaa agacgacggc 9660 atcacaaaag gcaaagccta cgtcctggtc tacacctgcc tcatcacgag aggtgcagtc 9720 ttacgagtac taccagacgc aaccacagaa acctatctca tgggattacg gtcaatcttt 9780 catactgttg gcgtcccaac ggatatctat tccgacaatg ccgccatctt caaactaggt 9840 gcgtcgatga taaaccagga catcatctat ggtgacgaga tgagcgagtc gctgaccagt 9900 tacctggctg cccaaaaaat caatttcttc aacacaacac cactttcccc ctggcaggga 9960 ggagtgtatg aaagggtggt tggcctcgct aaacaccaaa tatacaaagt cagcaaatta 10020 gaacatatag acatgttctc actccaatac ctcgtatctg gagctcaagc catgattaac 10080 agtcgaccgc tgacaccaca cgcacggtct ccaaacgaca tgattgcact caggcccatt 10140 gacttccaaa tccctggagt catgctggac gtaccactcg tacaaccaga agcaaatggc 10200 cgaggagccg aacaacgagc tcgctcccac ctcgcacgtc tcgaggctgc cctgaacaga 10260 ctctggcaga tctggactct tggatatcta ttccatctca gaaaagcgaa acatcgcaac 10320 aaaaagtgca ctgccctgag accatccgct ggtcaagtcg tgttaatcaa cacaaataac 10380 gtcaatcgac ataaatggcc cctcggagta atcattcaag tacacgagtc gaatgaggtg 10440 cgcacggcca cagtgaaagc tcacggcaag ctctacttac gctcggtatg tcaactgatc 10500 cctctcgagg tacaagcatc agatgacttc attccttctg tcgacactcc agaagaagca 10560 gactctgagc ctgaagagca cgaatgcgga gatttctttc aaccacccat tcccggtctt 10620 gccctcttcg actcgccgaa cacacaatac gctcccgagg cattcccgcc agatacaatg 10680 ccaagcattg aaaatcaatt aaatcctaaa cctaatagca ccggctcttt cacagaccgc 10740 gtcgcaaatc atgacacaat tggtgatgtg aacttgagca ctgaagggat cgactacaat 10800 ctcccgaacc cagagtttga cgacgtacta gagtacgagg accccaacaa tgccactact 10860 ccggaaacgg atcatgctga agctaaaagc aggcagaacc agggattaca tccctcggaa 10920 ggctaaagct ccttatgtca actacgtaca ctctctggca gttcgggttg agtcgacccc 10980 ctcccccccg gaa 10993 // ID CER16-2-I_CE repbase; DNA; CEL; 8234 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 06-AUG-2003 (Rel. 8.07, Last updated, Version 1) XX DE CER16-2-I_CE is an internal portion of the CER16-2_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER16-2-I_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER16-2_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (APR-2003). XX DR [3] (Consensus) XX SQ Sequence 8234 BP; 2408 A; 2190 C; 1489 G; 2147 T; 0 other; ttggtgcatc gacctaaccg gctcgtgaat cactaagttc cgagccggaa acggatccag 60 agtcagggag gcacgtggcg cggattccag cggtcagagg taagtcgatt tctttcaatt 120 atcgtctttc ctaggttcat tttcgactat cccattcgaa aatggatctt tacagtcata 180 aaaagctaaa tttgactcta tcatgctcaa aatctggttt tgtttgtcaa ttttcgaaaa 240 aattcgaccc gcccgctcga atttcgaata tttttcgtct ctgaattcat tttcgttttc 300 acacacgaat ctggattttc tacgtgtatt ttgatcctat ttcgattatc acactcgcaa 360 ctggtctcat tagccttttt atcgctctca atcacgagtt ttccgcgtta aaatagctaa 420 aattggtcaa aaaaccttac gtaatcttgc gtaaactcga ttctgaaaag aatctacccc 480 cccccccccc ccccatcctc gtcgatttat ctctcttcag cagagatttc ctccaaaatc 540 ttcgtttttg cacaaaatgt cgataaaaat gatctgaaaa ttcaattttc cctcgatatc 600 tccgcaaatt tatctttttt ccaacgattc catcacaaaa tccactgaat cacctctcaa 660 aaccttcaac agttggaatt aagattcttc gaggagtacg gtacgcaacg gtgtatgcac 720 agattctgtg tacaccgcga cgcgcaaccg cacactcgca atttcgaagc cgaatcccaa 780 aatctttcgt catgattttc gtcgattttc gaattttttt tgcatttctt ttttgcattt 840 taatgatttt ttattaaatc taaacatttt ccgtcaatta tttacgtctc cccctcctga 900 gactcgtaat taattcgttt tttctttcag aaaatcactc tctcggcgga aaatgtcttc 960 aagacccccc ttcggcattg tcaagccgtc caaaccaaca ggcaaccctt caagcggctc 1020 ctccgagttg gtaagaactt aattatttat ttcgttattg tcaaaaaaac gtaaagaaaa 1080 accttggtcg caggggaacc cctggccaat ttttgtccgg aataattaat ttcttttaca 1140 ggattcgacc gcccagccac cacaagagcg cggaacggat cttcaaagag catcgtcggg 1200 ctctgacgtc catggccacc cttctcccgg atcacagagc catgtaagga ttttattttc 1260 aaatctcgta tctcaaaaac acgtaaaaga aaaccttggt cgcagggaaa cctctcgcca 1320 attttttcct gaaaataata aaactccttt gcagaactta acttctgctc agtccggagg 1380 ccgcaatgct cataaggatc ttcacgagcg ccatgtccat ccttcacaag gatatcaacg 1440 tcatggatcc aaggctccag ctcatggcca cccttcaagc caaggagcac gaaggtccca 1500 actcaagttg tcaaacttcc cgaaacttct ggccaacctt atgcaggata caaaagccac 1560 gcaagatttt aattcaaaat accacagcga tttcaaaaaa tcaaaaaatt cgctgatcac 1620 aggggaatct atcttcaaat tttcccatgg aataattaat ttcttcttcg ggaggtttca 1680 gcgtaaaaca aatcctctcc aacagagcct ttaattgcag gacatctgcc agttgtgcat 1740 ccaatcagga cacgctacaa ccgactgctt ccactatgtc ggacggagag cccgaagagg 1800 acaaggtgaa gctctgtgga aaccgcctgg aagtcggagt caccgacaat ctggccggca 1860 ctcagcgccg caaggtgaga tatagcgaac atcctaatga aattccaccc tcatttcttt 1920 aggaacaaca acagctgcaa ggcgctcaag ccaagaagca gcaggcccca cagcgcgttg 1980 tcaaccgcaa ggccaaccat caaaaattat ccaggaagcg tcgctatgct ccgtacaagg 2040 atgatgtcat tggagcgcat ggattcttca ataatatgcg gtgatcacct ccactcatat 2100 ctgttgtcaa attgtaacca tttattgttg ttgttattcg gccatcctta acccctttta 2160 agtgatgaac ttgttgttat ctcccctcag gccatcctta actgtatgtt atcgagttcc 2220 catcctttcg tttctacttg tctcactcaa ctttcctttc attgccctca attgccctca 2280 aaacctgttg gattgtacac tcactcccac cctgggtgca gttgtatcaa aaataaaagt 2340 gtttacactt tattaccgtg acaatgttgt taactcaacg ctaatcataa gctcccttct 2400 ctgagcttat caagaacaac tggatctatt catacctaga tataacagtt ggcatcatag 2460 gcttaatagt ctctcggctt caccatcaaa actagagact cctaggtcat caaaatcata 2520 ttgctatttc gaactttcaa ctcgaaatcg tatctcatcg caacattcac tagcgacagt 2580 catagagatt ccaaactgtt ctaatgatct ctaagctcgc ggtcaccctt cagaacaacc 2640 gtgggaaatt cattagtctt aggtttcacc acctacatca aatattatga cttcacacga 2700 tctacatatc atgaggttcc cacccaaagg gcttctaact acgtagtaag tcacgtaaat 2760 ttcaaagaag gttcgtctcc caaagaaata atcttcccaa aaaacgattg tgatttgtca 2820 aacgaatcac tcaaacgatc gtttacgatt gcgattctgt cggagcttcc ccaaaagcgt 2880 ccgacagatt cgaagacgtt cttgttgctc cgaatcagaa cagaagcaat tggaaactct 2940 tgctttttca tagtgaagta ctataccaca cgaaagtact aagcctatcg agcagttagg 3000 ggttcttctt aattgatgaa cgtgatgaaa gaataggtct tgaattcacc cctagtattt 3060 agtcatgaag aatgtactaa agttctgatg cacaggccgt ttcaagggtg ttcaaggtgg 3120 cagcttcccc attcttgcca cgacttgtac tagattctcc gaagtcgacg ggcctaggag 3180 cttctagcaa tcggatgtct ggcagacgtt cagagtgtct tgtcgaattt aatcctcgga 3240 ttgactccac acattgccgg agattcagag gtgtctacta gccgtaagat ggcaaagtgc 3300 tttcacaagt cctcgatagt aattccacat gtaactggaa ataagtcaac taatacgaat 3360 agatcaatgt aaggtgctgc atcactcagc aattcagacc ttagcattga taatagacta 3420 cgagaatccc ctctcgcaaa gtccaattaa aaaatcaaaa caagcttcgc cacgtgtcac 3480 aaagtgggtg aagcctctgt aatatttcat aaaagtcaaa acaagcttga caacgtgtca 3540 caaagcggat aaagcctctg tattcattca caaaatcaaa caaagttcaa aacgtcgtct 3600 cacgttcttg accttttcaa aacacgtctc ctacgtgtcg cctacctgat ttcaaaacac 3660 aaggcttcaa atcaaatatc tctgcggctc aactacccaa ttgaacctac agtaaaacag 3720 ctcaataacc ttaaaagcat cattcagctt ccaaagcttc acagcttaac tagcttattg 3780 ctcatttgtc tgaaacaagt tcacacccct tccttttcag acatgtggag agcaccacgc 3840 catcccaata cacgtggtag tgctccgaga tttcctctcc ctgtgcgtca acgtagagaa 3900 ccacctacat ttgtagacct ggccaccctt ttctacatac ctgtcagttc agctggttct 3960 tcccccatga ccctaagatc tgaaaaagat cggacaattg aaacccgcac gtggctgcca 4020 tggatttgaa acagtacaaa atccagaagt gagttattct caagaagttc tttccccaac 4080 ttcttatgat tcatagccag ttccaccact caaccaggaa caatgggagc ttgagcaaat 4140 tacaaaaaga ctgagacact gttaattact cgatccacac atatcgtatt gtaattttcg 4200 tttgcttttg ggtaatttcc cgtcaaataa agttctcctg atgaactaaa ctgtctcagt 4260 ccgtaaatat caaacatcaa atataaaatt tcaatccaaa aattctatca cacttggtgc 4320 acaacattgg caaaaatatg tcaaaattat tcaaaaccac ataaatcaaa agttattcgt 4380 tcagttgacg tccaggtctg ttctgataaa cacaatgtca cttgatatta gacagagttt 4440 aatattactc tcacgagcaa taaaaactcg gttcagaaaa gtaaactatg ttcgaaattg 4500 accggtcatc cagatccgaa cttggtttta cggctccgct actctctttg agacattgtt 4560 aggagacttc tcgagccaca tgtcacattt ccactcaaaa aaaaaagcca acaggtcggt 4620 tcaaccagtt ggtaggtgcc atggtcagat tgcatctctg aatggttcga gaagggattt 4680 tcattaagtc attcccacga agaaatccaa caaaccacaa aaactctagc gacgtaatgt 4740 cattcgccgc agagtataca ttatccaaaa aacataatgt caaacgacgt agagtttaca 4800 ttattcggaa agcataatgt ccacttgaac catgcgtccc tggatccact cattcccatc 4860 tgatcccacc gtgatcatac cggcgggagc tacagcactc ggttcagatg caaggaatca 4920 ttttgttaga tgatcaaaga atgagggggg aaaggttttc aaaaatacac acatcatctc 4980 cacctcggag attgaagtgt acactccaac agtattgagt attcaaaaca actcaatcaa 5040 aattcactca tttcagttaa ctggatttat tgtctttgac aatcaccctg gaaatgagtg 5100 cattctccaa atcttcaaaa ggtctcccgg aagtcgacac cccgcgtgtt gataacccgg 5160 aggcacccga ggccaccctt caaggtgact tatcggctac cgctcaaggt aacccctcag 5220 gcaggcaatc aaaccgacgc tcgagacgtg gtcgcccaca ccacttgaga tcgccagccg 5280 ccactatcaa gggtaccatc acccgcatcg tcaatatgac gcgacagttc catcaggaag 5340 cgtctagaat gatgctggaa catcttggtg gcaacccgaa cgtttctcct cccatcgaga 5400 acgctatcct ccgcaatgcg aatcaagttc tcaacaaact tcggcattac gcggtttagt 5460 tggagaacct acgccaattc gtgaacgaca agttccaaga accggcactg cgagattccc 5520 ccaataggga cacctatcac aaagaggtgg aagctcatct tgcaggactc caacctcaag 5580 ccctcctcaa cgaggcaaaa cgtgacatcg tcatgttgga aagagagttg tcatctctcg 5640 gctttccagt gaccccccat gttcgtcagg acaacgtact cactccgtac gtgacatccg 5700 acgccgactc atcacaatcc gagtcggacg atctgagcag catcaatcac gcgttcgaag 5760 atctcaatac aaacatgggc tcagatcaca atgtcatatc agggaacgag cacatcaaag 5820 ttcctgagcc cagagtgacg atcacatcag cgatcctctc acatccaaca cgggattccc 5880 cgattaacat gtatcggaca gcgtcgccat tcagcacaaa tccgagattc tgatgtcgac 5940 aggtggtctc atcaatccac cttagccgac gaacttgctg aagaacagca ccaacgcaag 6000 ttagaccaaa tcagaatcag atccttagag cggaaagtga gatcgagctc gatcagggtg 6060 atacttcacc tcctcttcca gtctctagcc gagtccttgg agtcgtcaac ttcgaaagca 6120 acgccaaaca cttgcaaaag tttgacggca ctggcaactt ccgtgccttc aagaacggat 6180 tcaatgcagt cgttcttgcc aatcctcgac ttccagcagt gacaaagttc aacatgttac 6240 gcagtcacct gctcggaaaa gccttgccat gcatctcaaa tcacgatgat tccatggtag 6300 cttacaaaac aactttggac atgttagagt ctgtatttgg taaaggagac actcaaagag 6360 gtttactcga gcaattcaga acgctcaagt tcaatcattc caatccggaa caaatgaaac 6420 tagatctcat ctcccgccag ctcttggtgc aaagactgac cgcatcgggt ctctcagcct 6480 ccgacgagcg gatcaccttg ggtctcatcg gaaaactccc tctctccctc agagataaag 6540 ccacggagtt ttacactgat ttggaaaccc cttccgcaaa tattctttat gagtgaattc 6600 gcaagcacat taacagcttc gaaaatggcc tgatcgcggc gtccatgcac accacaccct 6660 ggtcatcagt ctccgaacaa gcttctgatt caccaataca cgaggatccg gctagtcggt 6720 tatttcgtga tctccctctc cacagagaga ggggaactcg caacacctcc cactaattca 6780 ttgtgggcgg gcacggggcc aaccgcacag agccccctgt caaaagctga ttcgcaatca 6840 gcaagttgcg tcaacgttcc ctctctccaa atatttaatt tcatggtctc tcataacgac 6900 cttcatggcc aacctttccc aacactacgg aatgtgtcgc cattagatac aagtcatttc 6960 actgtctcaa aacgcgttac ggttgttttt tttttgtccc taatggactc cttttttcaa 7020 catgtcatac ccaaagcgat cggtctcttc ccgaccagct ttttcaagcc agtgtcacaa 7080 ggcttccacg atgcgtcgac cgctgtatct caaattccat cactaaattt aaaatctcaa 7140 aactatcata aggtgatcat ttaccaagaa tcgataatcc actgtcgtta ctggtccacc 7200 agaatcccgc ttaaagtgcc aagctcaaca ctaccattaa gcgctgtcat cagagacaat 7260 ctctacattg acgaacatct gctcgacgca aatgattttc acaaaatcag gcctacagtc 7320 atgccagcca agttctcttt caaagacaag aacatgaagc tgagcgagtt tgttacaaat 7380 ataagctcta tcatggctca cctccttcgc gaagatagag cagagtccag aaatatcaaa 7440 ctcttgggct acgtctggga ttctagcgac atcatctata cgctgtctac agataattcc 7500 gtcttggaca ttgatcacca acccaagaga gaaatagcct tcacacactc tctcttaaat 7560 atatcaaaaa tcgtaaacag aatgatacca atcaggtgtt caatgtttcc gggcaagtct 7620 tggaaggacc ccaataacct caagaggcct tccattgcga aagttcatag aaccaatcac 7680 gcgaacgaac ttcgcaccgc taggttcaaa acaaacgaaa agtgctacct actctccgta 7740 tgccagctta tctcacttaa aaatcctact ttccaaggat tatcccactc tctcaactca 7800 ccaggagaca aaaaagtggc cgcagaaaag cacacctgcg aagatcgcac acgacggcca 7860 acattcccgc tcgatgcgtt ctgcagtacc ccgaacgtac aatacgagac aatgctactc 7920 atcgcaaaat cacgtactct atcagcccgt atgaacatca agatgctcca cttatccctt 7980 ggacttcttg ctttggaatt tgacggcatc gactccgaca tgcctaatcc agacttccat 8040 gacaatgtac ttggatacag agatcctcac acgacgactc ctgattcatc atcagatcac 8100 gccgcggcca accttccacc aggtagaacc cgagattaca ttcctcgaaa ggccaaggcg 8160 ccctatatca attatgcaca cagtctgtca gtcatagtcg agcctctgcc ctcccccccc 8220 cccccccccc ggag 8234 // ID NDNAX1_CE repbase; DNA; CEL; 2085 BP. XX AC . XX DT 12-NOV-1999 (Rel. 4.1, Created) DT 25-MAY-2001 (Rel. 6.04, Last updated, Version 2) XX DE NDNAX1_CE is a family of non-autonomous Helitron DNA transposons DE - a consensus. XX KW Helitron; DNA transposon; Transposable Element; Nonautonomous; KW HELITRON class; NDNAX1_CE; nonautonomous DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-2085 RA Kapitonov V.V. and Jurka J.; RT "NDNAX1_CE."; RL Direct Submission to Repbase Update (NOV-1999). XX RN [2] RP 1-2085 RA Kapitonov V.V. and Jurka J.; RT "Direct submission."; RL Direct Submission to Repbase Update (MAY-2001). XX RN [3] RA Kapitonov V.V. and Jurka J.; RT "Rolling-circle transposons in eukaryotes."; RL Proc Natl Acad Sci USA 98(15), 8714-8719 (2001). XX DR [2] (Consensus) XX CC NDNAX1_CE is a non-autonomous DNA transposon, CC its copies are 98% identical to the consensus sequence. CC There are about 100 copies of NDNAX1_CE in the genome. XX SQ Sequence 2085 BP; 600 A; 401 C; 376 G; 705 T; 3 other; tctgccttct tcctatgtta aatatggaat gtgcaagggt cgtcactttt tgcgcacaaa 60 gtgtcacaaa aaattaaaaa agttgggtgt gcactttcgt gtctaacttt gctgtacctg 120 cacatttttc aattttcttt tttgtaggaa atagagaatc gctttttcta tcttyctatc 180 taaatggtta tttgatttga ttttatcttt taagagaaat tcttgatgaa acacgaaaaa 240 atcacttttt tcgcgtgtcg tgtcgctttt tttgcacacg tttagaggtc atagtataaa 300 atgtttttga taaggtaact gtatatctgg gattgttaaa aatttttagt ttttgcaatt 360 tcatcaattt tgaactgagt tatgatgatt tgaaaatcgg actcgaaaaa tcccggaatt 420 tcgaaaattg cgttgaaaac atgtttaacg ttcgtatttt tgttttattt tacattatca 480 aatgtaaatt ttaacgtaga ttccgttttt ctaaatagtt ttctcattat ccaaccggaa 540 agttacgaaa aaagcataaa ttgacgattt ttcaatgaaa ttgtgcgcat tcccccttgt 600 gtttttggtg tgcaaattag tgatttttga gattctttcg cctataagtc ggtcagtttg 660 gcactttttg gggtaaattt ttcgaaaaac gtatttttca ggaaatttac tcttcaaaaa 720 tataattttt tttacaaaaa agtcaaaaat aacttcagaa atcgcaaatt tcatgatttt 780 ctcaaaawaa tgacgttttt gctggaaaaa taaaaaattt tccactaccg aatgggattt 840 tatgttaaat tcgtaattag tacataaata ccgaatgcaa gctgaaagca cagctctcta 900 cgatcatttt caacgatttt agtttattta atttttgacc tcgaaatttc aatttccctc 960 ttttcctgga ccagtttttc actgaaatgt gaatgttatg tcctaaatct tattattttt 1020 gctatgagaa gcgagacaaa ccaaaaaatt ttagtcttga gttcctcgag ccgcgttcct 1080 gaagttgtgg gcttcaagcc ttgctcagga agctcaaaag ttcattcttg accaattttt 1140 aaattttcaa atgctataat attctacttt taacgctgat ttcaaatata tacttttctt 1200 tttccaattc ctattatgag ctgagttacg gcgctttcaa gtttcagcaa tgcccgcccg 1260 cccccctctt tcctcccctt tctcttgcct cctccctctc atgcataggg tggctgtaat 1320 atggtctcta attactgtta caggatcaga tatgtatatt ttacgacata tctcttaata 1380 gaagaacaat agacatcgcc acatttgttt ttgtaagagg tccccaatgg gctgtaaaac 1440 agtctatcca cctttcttcc tgcaagaaag gaggtaatgt atttgagact gttcatttat 1500 tgtaattgag gggttgaaaa ttattatttt aattttcaaa taaaaataca ttaaaatttt 1560 tttttaattt taaaataaaa aaatcctttt taattttaaa ataaaacatt ttttaatttt 1620 gaggtctgct ttcttcgcgg ggacagagcg ccgtaggcgc tcttaatcta aattataggc 1680 agagcaggcc gaaggcctgc tatgccagta ggctttcact gccgggtcag agcgccgcag 1740 gcgctctgaa tctaaattac aggcagagca ggccgtaggc ctgctatgcc agcctgcttt 1800 tactgccggg tcagagcgcc gcaggcgctc taaatctaaa ttgtgtactg cagggagaca 1860 ataaatattt ttaaaacagt aaaaacaaac acacaggttc gaggcgaagc cgagaacctg 1920 cgccagctac ctaatctggt agaggaagca acgcaggccg caggcctgcg gtgcgagacc 1980 maaaacattg ccaatcgcac gttcgaggcg aagccgagaa cccgcgccag ctagcggact 2040 cgccccaacc ccacctctac tagcgggccc catagggccc gctag 2085 // ID CER9-I_CE repbase; DNA; CEL; 12554 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 06-AUG-2003 (Rel. 8.07, Last updated, Version 1) XX DE CER9-I_CE is an internal portion of the CER9_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER9-I_CE; CER9-LTR_CE; KW CER9_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER9_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (APR-2003). XX DR [3] (Consensus) XX CC FEATURES Location/Qualifiers CC gene 7835..8047 CC /note="predicted Gag region" CC gene 10896..11516 CC /note="predicted Integrase region" CC gene 8052..8366 CC /note="predicted Protease region" CC gene 9885..10289 CC /note="predicted RNase region" CC gene 8901..9566 CC /note="predicted Reverse Transcriptase region". XX SQ Sequence 12554 BP; 3800 A; 3226 C; 2450 G; 3078 T; 0 other; tggtgcatcg accgaaccac accagagctc ctagtgactc tggagtggaa tcgagccgtt 60 gggaaagaga gagctgtgtg cccagaacta gagcaggaat aagtaggacc tgaacgaact 120 cacgtgattt gccaggacct ctaatttatt agcgtccgct tcacgttact gcataatctg 180 gatcccccgg aggagaggag atcttggctc atgacgacga caggacgcta gccaagttcg 240 agatattcgc cttcgagaga ttctccttcg gctgctgctg cttcttggct catgacgacg 300 acaggacgct agccaagttc gagatattct ccttcgagag attctccttc ggctgctgct 360 gcttcttggc tcatgacgac gacaggacgc tagccaagtt cgagatattc tccttcgaga 420 gattctcctt cggctgctgc tgcttcttgg ctcaggacga cgacgggtca cttgctggcc 480 aacttcggga cttcttctct tcggagaacc gctgcccgct gttggcccac cgggaagacg 540 ggacgctggc caatttcgtg atcttctcct tcatccgctt ttggctcttg gctcatgaca 600 acgatcggga cactgaccag ccgctgcaca ctctccgtca tccgctggtc ttcaatcggc 660 ttcgattcgg atctccccgc gattaaggta catttcttgg ccaatttttc gaccgcacga 720 ttttcctcta cccccagcca aagatttact tggcgggcga gagaaacgca tcgaaaaata 780 gattccaaat aaagaaatga cacacatatc gatcgggata cccagaacat gaagcagatc 840 acccccttcg ttgatcatcg gaactcctct ctctctcttc gacacgactt tgaatctgaa 900 catcttgacg tttttttcta gggggtttcg ttcagcatct gcctgtaaat acaccgggtg 960 ccatgctatc tcaaaattca tcctaaaaat tctctactcc caatagattc taaatcatgg 1020 gcgagagaat tacatgaaat tgtagagatg gcgaacgcac ctttaaacga tcctgatgat 1080 caagtttcaa agccgacgtc tttcgatttg accagcactc aaacctgcaa tagcagtagt 1140 tattggctga cgaatcacct ttcatccctc cttattcggc tacgcgtcat ttggatctgc 1200 aaaatgctac tggaatttcg gggttcaaac atccgctacc caacaatcga tcactttgag 1260 ctatgacatc atcatttaat tcgaaaactt atcagctaaa tcaccagtca agttacaaaa 1320 gacgattacg tagaaaggtc agtgaattca caagaaaaac atccgctaag ctttacatat 1380 ccaaaatggc agcacaagta attcagattt tgatcattat taggaaaaca gctcagatgc 1440 gttttgtact taatgacgaa tctatcatat tccctttttt tatttttcag gcgctgctac 1500 aggttcactg agctcaggaa gtgaccaacc gtgtagattc cggaatttaa atacgcgtgg 1560 ctgatggaat ctacatgaat tgacgcagaa gtacgcagag tgaacattag tctgaggtcc 1620 caccgaaatt cacacgaatt ttgggattcc tacggatttt tgattcattt tcgctcattt 1680 tttgtgtttt ctcctctgtt ttttcgaaac ttttcgtaat attttttgtt tcttcaattg 1740 atctcttgaa tagtcatcgt caattacagt aaacagccgg atcccaccga gaatgagaag 1800 ctcacaaagc aatacaggac tccatcctct tccaacaacc gctcagctcc cacttgtaag 1860 attttttcgt tgaaaatatt attatcattc aattagaaaa ggttatctca cactgagaga 1920 cagttaccct aatctctcaa ccacgaattt atctccaata gcccggcaac cccggagcac 1980 aggggatgaa ttcaaacaac aagtaaattg gttctcgatc tgtgctggct ggcacagacc 2040 tgccaaaaga gatttgaaat taacaatttc actcgttacc taaaaattaa taacttttaa 2100 atactactga aaaagccctt acctttgccg gcaaaagcct actcaggctc agtaagtaac 2160 ttattagtat attaatttca tcatcacaag aagattatct catccacgtc cttttccagt 2220 attgcatctt ttgccgcaga acctcacaca gatccatcga ctgcagaaca acgtcattct 2280 acatccatcg gaagaatcgt gcccgctcca atggatactg taccaagtgc atccaaccgc 2340 tgattcaatc tgacgcaggt caccacttga cctgcacgga gccgaaagtc tcgtgcgccc 2400 actgccacga gcactcggag aatcaagcca aatgtggtca caacgaggta ttctgcccgc 2460 ttcaatttgg caaaaaaccg gaaaaggcca ccggggccgg gaacaactcc aacaaacgcg 2520 ctaggactgc ttcaccaacc gctgtcaagg cagacgctaa gccagaggac tcggaggtta 2580 gtttatttta ttttttcggg aaaaatgaaa caaaaaatcg tttattgcgg gactttacga 2640 gacgtcagct tacacgggct ttccttacag gttgagacca aacgttccaa aacccaaccg 2700 ctcgccactg ttccggcggt acagggtacg gaaatcacaa tagattgagc ccgtcgcctt 2760 tctatttcat catctcacca cacaaggact gtaccccatt gttgatttaa tttattttaa 2820 ttgttgtctc cccgttaccc aattatccaa aaatttgttc atcacccgct aaaaatctca 2880 taatactcgt ctcaccccct ttgttcttaa cagtctctgt gatagccaga ggcggctgtt 2940 ccatcgctat ctttcacccc aataaaccct ttacgagtag ccttatcatt ccttcaattc 3000 gttcaaaaat gagtttaaat actcaataag tatttttatc aattctcagt tgtaaattta 3060 aaaaagggag tgtaaggcac cagcctactt gcaacgaaac ataaatatta atttaatttc 3120 ctgaagcatc tcaaacaatt cggagccgtg ttaatcacct ccagaaactt aaaagatgcc 3180 ctcactatgg aaattataac ataaacaaca aataaattat tatcttttgg aaaagaaaaa 3240 agtagcacat ccgctctttt ttcaaacaaa ataataatca caacagtttc ttgccttttg 3300 aagaagtgag ccaatacagc cgcttcataa aaagaaacaa atatgagcaa cttttccaat 3360 acttctaaag actgttttaa ttaacacgta cacacttaga ctgataggaa aaatgccata 3420 ataaagagac tacaataaaa tctaaaaaag attcttcaaa aatttcttgt gcagtagtga 3480 catgtttctt gtgcagcagt gacatctgac catcttgacg ttttctgcta gggggtttcg 3540 ttcagcatct gcctgtaaat acaccgggtg ccatgctacc tcaaaattca tcctaaaaat 3600 tctttactcc caacagattc agaatcatgg gcgaaagaat tacatgaaat tttgggatgg 3660 cgaacgcacc tttaaacaat cctgatgacc aagttttaaa gctgacgtct atctgctgct 3720 ggcaatacat acacttctaa cacaaacacg aaatcacaaa agaatctact caaataaaaa 3780 gtttcttcga cataagagca gataccatcc gctgctatac aaaaagttta ttcccgaaaa 3840 tgactcaaat aagactgaag aatcgttcga aacatctgaa gacttaaaaa gacattacga 3900 attaactcaa acaagaactt tttatagcta aacccccgga ggacaatcac ctttaattgg 3960 aataagtgct cacatgacac ttacattcaa taaggaattg atccaaagga aaatcatcta 4020 aaatttaaaa atcacttaaa catttttaaa cctaaacccc cggagagaac tcacctaatc 4080 ggaataagtg ctcacatgac acttatatta gttaaaaagt aatcacaagg gaaatcacat 4140 aaaattttaa attatataaa tcctttaaaa ataatttttt tatcacgaag aggttaccgc 4200 ttcaactaca agtttcttat ggtttactca ccacgaataa ttgagatcag aatccaaaga 4260 gcattccaaa gaaatcacaa aatttaaaaa agggttgcga cagcaacctt tgctggttaa 4320 taaaaaactt agtttagttt aaacaaagtt actaggtatt ctggggctag tctacgtttt 4380 cttaactctc gatcattcaa tgataccact gagctgaatc tagaaaaagc tgaaattcga 4440 attcaaagaa tgtttattat accacgtgat cacttagcgc acagtcagag tggcagaatg 4500 acaccgacta tcaagtagac atccgtgctg actacgactt taagacgtcg acatcctatg 4560 accgttagtt tagaacataa acaaacatct ttcaaattca atttcttcct catatcatca 4620 tccgcttcga cacagacgta cgtcaagtca aagtaagaca gtggtaatca ggaacagtag 4680 ttgagaagac agagtaggaa gacaatttga caaaactcca ggatatccaa aaacttcaaa 4740 aaattaaata gtggataagt ccaagagtat agtactagtt agcgtaacaa attctagaac 4800 aaatactttt tcagcggtaa ctccaaagaa aaggttttga ctctttgaaa tttatgcgca 4860 tagcttccac tactactcac aaatccacta attaatcaac atttaaaaaa aaaatccaat 4920 tacctagcct tgatcaagct gggcctcccg tttcgccttg agcgattcac ccttcaaagg 4980 gcctcccgtt tcgccttgag cgattcaccc ttcaatgggc ctcccgtttc gccttgagag 5040 atccaccctt taatgggccc cgtttcgcct tgagcaatcc acccttaaat gggccgccgt 5100 ttcgccttga gcgatccacc cttaaatggg ccgccgtttc gccttgagcg atccaccctt 5160 aaatgggccg ccgtttcgcc ttgagcgatc cacccttgaa tgggccgccg tctcgccttg 5220 agcgattcac taaataagta gcactggtag ccttgtacac ccttccgttc tactttcctt 5280 ttttacttct tcattatcac cttaacctcc tttaaattct aatttctttt aggttactgg 5340 tttaattatt gtcttcggac aatcaccctc atctctccta actatcatga ctctcggatc 5400 ttcccgatct ccatcaccgg gacggcgtcg ccacgcctca gaaggcggca cagcggtccc 5460 tccaactcca gccaattgcg ggaaacccac gaaaaaacgt acacgaggac aggtgtcctt 5520 ggcgacacgt atcgtgggcc ccctcaaaag gcgtatcaac cacaaggttg acgcagccaa 5580 gcgtatcttg gccgagactg aggcaaaaat ggaaattctc ctgaacatgc ctcaggatca 5640 actcgtcagc agcgaagact cgacttacct cgacgcacta ctgattcgtc ttcagacgat 5700 cttggtcgca ctcgaaggta tgcgtgacct catctctgac aaattccgtg acacggaaat 5760 gatggtcgat cccaatcgcc atcaacatca ccaggaggtg ctcgactacc tcgaaaagtc 5820 atcaaccgct agattcgtag atcacctgac tcacgacatc caacagcttg agactgaaat 5880 gagaagccga aacatcccca tcacccactt cgatccatct ctgctagcaa ccaccgacgt 5940 agaaacaggg gccaccaccg aggacgacgc caacgacgag gaaagaagag acattgaggc 6000 aacgatcgag gaccacgctc agaacaatgg gccgtccgat catcgtgtca tctctgatct 6060 tcgaactcct cacggtagca cgccgtctac aggaacccct cgtctatctt caccaggcat 6120 ggtttgggac aacgagggac tgagcctgca cgatgaatta caaatcgcga acttgctcga 6180 ccccgcgaac ccacagagat ctccaatggc tcctgctgct ccaacatcat ctgctgctgc 6240 tcccactcaa tctgccgctg ctccgccgtc atctgccact gctccgatgt catccgctgc 6300 cgcttaccat ctacatggtc aacatggtca acaactcggt gctccagctc ataccaatgg 6360 tggtactact caccgccagc agccgcttga acaaaggatc cagacaacta caaaaggggt 6420 actccaaggg aaacgtgagc agctcaaaca gggcccgacg gagaccccac tcctcgtcca 6480 acacgcccct acaactacgg gcccaggtac gcgaattacg ccacaaagag ttctgaatgc 6540 gcctgccttg caaacacatc aggtcataaa cagccaggtg ggttatccaa gtgcaggagc 6600 tcatgaatac tcggcttttc atccgctcct ccaatatgca ccacctcgcg gtgaagacac 6660 actcacgcac agactcctgg cggctatcga agctattgcc acaagtcaga gtcagatgca 6720 atcagagttg atttcacaag gccgctcagt acacgtgctc accgacagaa tggaagctac 6780 agaaaagtta attgtggagc ccaaaatttt aaatactacc gtcgccgaag agaccagtag 6840 accagctatg ccacagccaa cccatgaaac ggctcagcaa cagcaaacca ctggctccga 6900 gtacgaatat gagtttgatt ccgacgacga caacaaccag caaccgcttc ctcctcaacc 6960 acgtaccgaa atccgatacg tcgaggtgaa gaataatggt agtcacgata ctcaaaactt 7020 attaaagtac ctaggcaaat atgatggtaa ctcaaatatt gactcattcc ttaccgactt 7080 caaagaatcg gtaatggaaa acgaaaattt aaatcaagca aacaaattta tgatcttaaa 7140 aactcaccta ctaggcaaag cgcgtgactg catctctaga gaccacgtaa cggctaaggc 7200 gcttgagaag accatcacgt ctctcaaatc tgtgtttggg aaggatgaga ataagaccag 7260 ccttctagcc caaatccacg ctatcggttt ccctcaatca gacgtgcgag aaatgcgaag 7320 ggcgatcgca aaacattcca ttcttgtcga acaactagtg aactccggtt tagcagcgaa 7380 tgatgagaga acattcacac ctctcacatc tcgcctacca cccgctatcc gcacaagagt 7440 cactcagttc tggggctcga aaggagaaaa tgcgacattc caagaaatct ttgactatgt 7500 cacgacctgc gttgacgaca tggccagaga atcaatcttg gcccttagac atttaccaac 7560 cgctgaaagc gaaaccgaag taggaccact tgggatccca tactcaggtc aaatcaatca 7620 cgccaacgcc accactcaga atcagggaaa ccctaatgga aaaaagacct cgatctcctt 7680 ggccgacaaa ccggtataca aacgagaaga tcacccgaag acctactacg acagtaacac 7740 gggggaatct ttaccagggt acaacgctcc ggggaagcaa ggaccagtgc tccgcttact 7800 cccgagaacg ttccccctat acgaaggtac caccaaaaag acttgcaaag catgcaaagg 7860 gagtcatcac acattgagat gtacgctctc tagcaaagac ttcaggcaag ccttaaacgc 7920 atctcgtcta tgcccaatct gcacgggcta ccacagcgta gagcagtgca gatgcttgat 7980 gaaatgcatt ctctgtcaag ggttgcatca cacgggaggg tgtacgctga aagagtatta 8040 tagggacttg aaaaactatc cgccaacggc tccgccgccc cagcagtttt ttcgagctag 8100 ccgggaaggc cggcatttgg gacaaaacca gcaataagcg gttttgggca ggcaggggga 8160 caaatgtacc gaggcccctg tcaacagctg ttttgccgcc agcctcaaat gtttccgatt 8220 cttttcatac ccacagtact ctttctccaa catccgctat ccccactatc aacccgctcc 8280 taacctttct tccaacgttt tccgattctg cacactttga gattacttct accaatattt 8340 acaatggtag aagaatcgat atgattcttg gtaacgatct actggcatgg cttaacgcta 8400 acccagagac caagaaacat attcttccgt ctgggaggct ggtagaaatc acggacttcg 8460 gacacattgt acatccagtc cctgataaaa caatctacca aaatcacacc cagatcgaag 8520 tcactagtga aacattcatg catgcatccg ctctcatcaa cggccccaac ccagaagacc 8580 cgaatctagc cctaacactt caggtagaac aacagtggaa attagaaaac attggcatcg 8640 aggcacagcc cttgaatgac catactacaa catccgctaa agatctacag gcatcattcg 8700 agaacaccct aaggtataca ccagaaggga ttctcgaagt tgcctttcca ctaaatggaa 8760 atgaagtacg cttgaaagac aactatgagg tggccgtgaa aagattacac gctacggtta 8820 acgctctcaa aaattccaaa aatccaaatt tacttaaaca atacgacgaa atttttaaaa 8880 cccaggaagc aagtgggatc atcgaatctg ttaccccaaa catgaaatta gagactaagt 8940 acaactataa catgcctcac agagccgtga tcaaagagtc ctcaaacacc accaaagtgc 9000 gggtcgtcta cgacgcatcc agccacgcag taggtcaatt gagtctcaat gacgtggtcc 9060 atgcgggagc aaacatggtt attcctcttt tcggcatcct catccgctca cggttcatta 9120 aactgatgat cgtaggagat cttgaaaaag cctttcatca ggttcaagta cagcccgaat 9180 ttaggaactt aactctattt ctatggctaa cagatttaaa taagccgatt acgagggata 9240 atatttgcac caaacggttt gtgcgcctac catttggcat gagttgtagt cccaatttat 9300 tggcttcaac gattgtacat ttcttggtac acaatccaga cgagctgaac aacgacatcc 9360 ttgacaactt gtacgtcgac aatattctga ttgggacaaa tgatctagcc ttaatcatga 9420 atcgcatcac ccgcttgaaa caaatcttct cacatatgaa gatgaacatt cgggaatttg 9480 tggtaaacca tgatgagagt atggagaaaa tagacccaaa agacagagta tccgctagaa 9540 caatcaagtt actaggaatg aagtggaatt caagcccaga cgcagatacg tacaccatca 9600 aaatcgccga tgtccaaacc atcatgcatc ccaccaaaag agacgttgct tctaaaatgg 9660 ctgaaacctt cgacccattg ggactaatct caccgattca agtctcaatg aaaaggctta 9720 ttcaaaagct gtggtctcac gaagtaaact ggaaagatcc tatccctaaa cacctgctag 9780 atgactggca agcaatccaa gcttccttca tagatcgtac aataacggta ccgagaagac 9840 tgacaactga cttcgaatac aaagacatac agttactgat ctcctcggac gcttctcaag 9900 acatctatgc agcagccgcg tacgtctact tctcgtatgg tgacgataga cctccagtga 9960 tctccttgat cacaagcaag aataagatta agccatctag ggagacgaat tggacgatcc 10020 caaaattaga attgttagga atcgagattg gctcaaatct ggcttcttcc atcgtgaagg 10080 agcttcgttg taaagtaacc aacatccgct tattcactga tagctcttgc gctctgtact 10140 ggattttgtc gaaaaagaac acacgtgtct gggtggcaaa tcgaattgac caaattcatc 10200 tcaatcaaac acgaatgagt gaatgcggca tcgatacttc cattcaccac tgtccgacaa 10260 aagacaatcc cgcagacatc gcgactagag gcatgagcac atcagaactc cagaatagtg 10320 acttgtggtt caacggaccg gaatttctta aacagaaacc ggaagattgg ccatgtaaga 10380 ttgaaggaac attcacgtgc ccagcggaat tccaggcagt ggtattcgca gagattcttg 10440 acccaaaaac caagaagacg aagaaacctc tcatggaaaa agctgaaaaa ccacccgctt 10500 ccgaaacagt cttacatatt cttgaattac cttccaaatt cgaatcgatc atctcctttc 10560 ggtacaccaa ctctctacga aaattaatgt tagtcaccta taggaccctc ttagctatca 10620 gtaaaatgcg taagggaaaa gttccgactt cttggattct ggagaagttt atgatggcac 10680 ccaatcttct cgagaagaga cgggtcgcac gacattatat cttccttcag cattacaaag 10740 aatgcgcgga gcaagggctc acatttccat catcattgag gtactatgtt gcacctgacg 10800 ggctatatcg tgtgctgaag caagctaagt caccgactct gcccgctgaa gctaacgagc 10860 caattctggt tcacccgaaa catccgcttg ctaatcttct aatgctcgaa actcacgaga 10920 tcaacggtca tttaccggaa cagtacacta gggcagctct gcgcacaaga tactggctac 10980 cgaacgacag ctcagtagct cgatctgtga tcagcaaatg cattcagtgc aaaaaggttc 11040 atggattacc ctttccatac ccgcactcta tgactttgcc tgaaagcaga acaacgcctt 11100 cgaccccgtt ccaaaatgcc ggtcttgatt acatggggcc tgtagaatat agtaaagacg 11160 acggagtttc aaccggaaaa gcctacgtgc taatctatac gtgtttcacc actagggcga 11220 ccatcttaag agtagtgtct gacggtagta cggagagatt cattatggct ctaaaaacca 11280 tctttcatca ggtgggagtc ccaaaaatgg tctattctga caacgctcca gcatttatct 11340 taggtggatc aattctaaac gatgatatct caacgtggga acatcattcc gaccctctga 11400 catcctttat ggccacgcaa tccatccact ttttccgcat cacccccgtg tctccatggc 11460 aaggcgggat gtatgagaga attgtaggtt tggtgaagca tcagatcctt aaagtatgcg 11520 gagcagacag gttcgactac ttcacgttgt catacattgt ttctagtgct caagctatgg 11580 taaataaccg cccgctcatg caacactcac gccaaccgga tgatatgatt gccatcagac 11640 catgcgattt cctgaacccg ggagtcatga tcgagacgcc acccaccgag ttcacaccat 11700 ccgctcccag cggtgtcccc gaacaaagag tacgcgctca tcttgcaagc ctggaagaaa 11760 ccatcgagct tctctggaaa tactggtcgc taggttacat aatcaatctc agacagaacc 11820 atcacagaaa tgtgaggtgc gctgatctga aaccaacggt cggacaagta gtgctggtaa 11880 acaccaacct agtcaagcga cagaattggc ctctaggagt aatcgtacaa gtcaataggt 11940 cagaaaggac ggatgaaatc cgtacagcag tggtaaaatg caaggggaag ttgtacaaaa 12000 ggtccgtatg ccaactcatt ccactcgaag tccaatcctc ggatatggac agtcttcctg 12060 atacggaaaa tcgggaagat ggtcaagagt gtttgatgga tgcaggaatg accgttcaac 12120 atccgtcggc tccccctttg actattccat ccgctgccct ttttgactca cccgacgaac 12180 attactctcc ggagcttttt cctcgtgaga cttgcccaaa cgtaactgaa gctactgaga 12240 acccttcacc aaaaattcaa aataatacca tgatcccctt agtccccaat acttctacta 12300 ttcaaaacgc ccgcttagat ttacacgaaa gggtaggaga agtagataat ttcgaaaacc 12360 ccgacttaga ccaagtccac gttgacagta aggacgaagt tgagtaccag gacccgagca 12420 caaccgagga attgcccaca gcaattcctg gtcgctcaag acctattcta ccacgaaggg 12480 tcaagaagcc tgtgtactac aactacttcc ttcacacaac gacggccgtc acgtcgacgt 12540 tttcaacccc ggag 12554 // ID IR3_CE repbase; DNA; CEL; 578 BP. XX AC U86948; XX DT 07-SEP-1999 (Rel. 4.08, Created) DT 07-SEP-1999 (Rel. 4.08, Last updated, Version 1) XX DE Putative nonautonomous DNA transposon, consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; IR-3; IR3_CE; KW nonautonomous DNA transposon; TIR; palindrome. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-578 RA Devine E.S., Chissoe L.S., Eby Y., Wilson K.R. and Boeke D.J.; RT "A transposon-based strategy for sequencing repetitive DNA in RT eukaryotic genomes."; RL Unpublished. XX RN [2] RP 1-578 RA Devine E.S.; RT "IR3_CE."; RL Direct Submission to Genbank (21-JAN-1997)Molecular Biology and RL Genetics, Johns Hopkins University School of Medicine, 725 N. RL Wolfe Street, Baltimore, MD 21205, USA. XX DR GenBank; U86948; Positions 1 578. XX CC IR3_CE copies are flanked by 2 bp-long target-site duplications; CC IR3_CE has 275 bp-long terminal inverted repeats and forms CC palindrome. CC About 140 copies of IR3_CE are dispersed in the genome. XX SQ Sequence 578 BP; 199 A; 88 C; 93 G; 198 T; 0 other; aaggtggtgt agtcaaattt ttttattgct ttattagact cagaattgtc tgaaaacacc 60 gaatttcata atgaatcttc ttgaaaactt ttcaaaaaaa agttatggcg gctcgaaaaa 120 tggcctaaaa ttagtaaaat tttgaaattt ggccgactta tcattgtcgc agcggctgga 180 aacaattttt ttgaaattgt cgtcaaattt taattataga tgtcgattat cttgcgtttt 240 cagctttatt aaggtattta aaagtcgatg gacgaagaga attggtaaaa aaaattgaca 300 aatctcttcg tccatcgact tttaaatacc ttaataaagc tgaaaacgca atataattga 360 cctgtatacc caaaattaga cgacaatttt aaaaaaattg tttccggtcg ccgcgacaat 420 gataagttgg tcaaatttca cattttcact aatttagcgc catttttcaa gaggccatat 480 attttttttg aaaagttttc aagaagtttc atcatgaaat tcggtgtttt cagacaattt 540 tgagtctaat aaagaaaaaa attcgactgc accacctt 578 // ID CEMUDR1 repbase; DNA; CEL; 7227 BP. XX AC . XX DT 31-OCT-2000 (Rel. 5.09, Created) DT 31-OCT-2000 (Rel. 5.09, Last updated, Version 1) XX DE CEMUDR1 is an autonomous DNA transposon - a consensus. XX KW MuDR; DNA transposon; Transposable Element; 9-bp TSD; CEMUDR1; KW LONGPAL; MUDR superfamily; PAL9A_CE; Autonomous DNA transposon; KW transposase. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-7227 RA Kapitonov V.V. and Jurka J.; RT "CEMUDR1."; RL Direct Submission to Repbase Update (OCT-2000). XX DR [1] (Consensus) XX CC CEMUDR1 is an autonomous DNA transposon; there are several CC copies of CEMUDR1 in the genome; they are ~99% identical CC with the consensus sequence and are flanked by duplications of CC 9-bp target sites. CEMUDR1 has 250-bp identical terminal CC inverted repeats. It caries copies of two different CEREP46B-like CC nonautonomous transposons at positions 2115-2575 and 4277-4659. CC CEMUDR1 encodes CEMUDR1p, a 656-aa MUDR-like transposase. CC CEMUDR1p is encoded by 9 exons (based on prediction by Gene CC Finder, http://dot.imgen.bcm.tmc.edu:9331/gene-finder/gf.html). CC CDS join(1477..1803,2602..2731,2825..3056,3326..3455, CC 3782..4043,4120..4257,4684..4879,5177..5461, CC 5579-5849) CC CEMUDR1p: CC "MTGKRSSTERGKSFSEFSRLYSVVSNFSYVAADEITPDILQKLEHQSRLTVRVPLKGNDA CC SPFDGHIWFKSATQETTKHAPRYENAKIFGEIIKSRRCTDFENPFFTRHLHNLHHNANGD CC VVMCKTGPKAITTQDMIEDAKRTHGDRLSYTVAIKIFLKSCPEESRVTEWRDEIEDICQM CC APQARKVAIEELLAKTDDGVFFPSRLHVDTTYNLGDAYVTTLLGETNTFISTVLVQALDV CC RVIRCDLHIYSLMIHKFKRPKADKAKPFVFGVQEKCTKTYGSGKFTTSSLFKIVTLLFRY CC DNKKSMFFECASAKAKLMAGHLRQFSTNNISENFNRAIKVTIGKQLTLDKLIVKLNERCE CC VGHSEYVIVSVDLQTMSMDEKLQWFADAGITGGMLPALDVPYQLLSEVNLERSRQEAEPS CC GEIDIYQRDTDIFNLRDDSGTGPKYASVELTQGMLICNKCNSEKLDFICRHVLTVYKDDP CC KSNGQDDSANSRTTLTESENEVNFDNETTSLFSPQSTFTPIFSSTRLEERQLVERELKDN CC HAEEIQLDADKADESQLNDSRRSARNRLLRFDEIIDLWDIELYTNTTESLTFRATLGTEN CC NYSVRPDTGTVLKSRILKIRLLNDDAPNDVLTVFYKTKGALREKSFNIELRNEVEL" CC There are several CEMUDR1-like families in the genome; CC PAL9A_CE, LONGPAL1 and LONGPAL2 are nonautonomous elements CC transposed by CEMUDR1p-like transposases. XX SQ Sequence 7227 BP; 2260 A; 1415 C; 1365 G; 2187 T; 0 other; ggcaaacact ttttgactac tggatttttg actacattgc tttttgacta cgagtgtctt 60 tttcgctact gaccgaaaaa atagctacat tcgcactttt gtctttttga ctactgaatg 120 tctttttcgc tactgactga aaaatagcta cattcgcact tttgtctttt tgactactga 180 atgtcttttt cgctactgac tgaaaaatag ctacattcgc acttttgact ttttgactac 240 cacataaaaa ccaaggcgcg ctccgacgcg ccttgactgg cgttttccaa aactcaaaga 300 tagtttgtag aagttattca tcattacacc cagttttttc gaaagtttta tcagatattc 360 aacaacttgg ctcttaaaat ctaaaattga agcaaacgac aataatacaa atgattattt 420 tgaagaaatt tatatcaaaa atgttgtttt catgctttat tcaatagagt cgagaaacga 480 tatctttctg cacaaagtcc cactttgcga gctcggctct tgttgcaaca tttcacaaat 540 ctgaattata tttttgatag tggagattct tgggacaaca cctcttaaag tctgctgatt 600 tttttacact tatatttcga aacattttgt ttttgtctac attttccatt ctttttctgg 660 ttttttattg aaattggttc agatcgaggc ccattccata gctttttttc gttaatttaa 720 acaaaatttt gagaaaacga gcgacttcca gaaatttgtg catgaaatcc cagaaaatct 780 gagtcttcat gactttcatc gaaacttttc gaaaatacac tccgtgtttg gctccagaac 840 tgttcaaaac ggaaagtgaa tcaatatttt aaattattac tgcaaatagt tcaaatttcc 900 agacttccgc taatgaatga gcctcgtttc cgtacaattt caataaaaat ccaaattttt 960 tgttgaattc tccacctgaa aaacagttat tataatttaa tggtcagaaa catatttaaa 1020 cttcatcatt ttaccctttt ccaataaaaa tatcgttttc cgcgtgaaaa aataacagat 1080 atctgcgcag ataaaagttt tttgtctagc aaaaattaaa tttccaagaa atgcgccagc 1140 acgcgaagtg cgcatcgctg aaacgcgcaa gcaatctccc tcccccttcg tttccacccg 1200 aggagaccgt gcgtggcgat aaatcacttt tttctcaatg atggattttt cgctacatgt 1260 ctttttcgct actcttccct gcgagacccg tccatcgtgt agtcctcttg aaggaaatca 1320 ttatgtttct tctgcaattt ttccaaagcg ctccgcattt tcgcaatttc cgctgtcatt 1380 ttttaaaatt attctaataa ttttttttaa aaattattct aataattttt ttagcggaat 1440 aaatcagtat ggatccgcca aaatggctct gcagcaatga ctgggaaaag atcatcaacc 1500 gaaagaggta agagtttttc ggagttttca cgattgtatt cagtagtttc aaatttcagc 1560 tacgtcgctg cggatgaaat cacacccgac attctccaga aacttgagca tcagagcagg 1620 ctgacagtcc gcgttccatt gaaagggaat gatgcttcgc cattcgacgg tcatatctgg 1680 tttaagagcg ccacccaaga aactactaaa cacgcaccgc gttatgagaa cgccaaaatc 1740 tttggtgaaa tcatcaagag taggagatgc acggactttg agaatccgtt tttcaccaga 1800 cacgtaagtt tgtttgaaaa cttcacggcg aagacttatt gattcttagg ttctcccaca 1860 caaggaacac gtttatatct attactctgt gatggaagga cgatacaaac gtccatctga 1920 tccaaaccgg aaaagacgat tgttaggtga gaaagaacaa aagtaggata caactgtaat 1980 aactattttc agctgaagac cggaaaactc ttctcactac agtcatgcca caccgcacta 2040 tccgtggagc taaagtagca gcaaaggatt taggtatcga agtctcatca aagcaggttt 2100 gaaaactgtt tcggttaaag gtggagtagc gtcgttgggg atttttcttt aaaatactga 2160 gaatggcaca aaatatcgaa ttatcacaaa aatgaacgaa gaaaattgag taagtatgaa 2220 aaaattcgat cggaaaatag cgattttcaa aactctcgct aatctatttt ttgaaatacc 2280 gcgctttttc catacggtaa acaacaaacc tcacccattt tggtgcaata acaacttttc 2340 ggaatgtgtg tttgcacgaa aatgggcggg gtttgttgtt tacgttgttt accgtatgga 2400 aaaggcgcgg tatttcaaaa aatagattag cgagagtttt gaaaatcact attttccgat 2460 cgaatttttt cattcttact caattttctt cgttcctttt ttgtgataat tcgatatttt 2520 gtgccattct cagtatttta aagcaaaatc ctcaacgacg ctactccacc tttaaatgca 2580 aataatttta ttctgtttca gttgcataac cttcatcata atgcgaacgg cgacgtcgtc 2640 atgtgtaaaa cagggcctaa ggcaatcacg acacaggata tgatcgaaga cgcaaagcgc 2700 acacacggag acaggctgag ctacaccgtc ggtgtgtgaa aattaccttg catatgttga 2760 gaaatatgat tttcagctga tggaggagta ctcgacgtta ctttctataa atctttccct 2820 gaagcgatca agattttctt gaaaagttgc cctgaagaat ctcgcgttac cgagtggcgt 2880 gatgaaatcg aagatatttg ccaaatggct ccacaagcac gaaaggtagc aatcgaagaa 2940 ttacttgcaa aaacggacga tggagttttt ttcccttcga ggttgcacgt ggacaccacg 3000 tacaacctcg gggatgccta cgtgacgaca cttctcggag aaacaaacac atttatgtag 3060 gtttcaaaag aataaaggca acaataaacc cattatttta ggacaacgag atcaaggaag 3120 aatagagtct taccgttgtg ttatatgatt cacacttctc gggctactga agtccaccag 3180 ggatttgcct cttttattgt tgaagaattt agaaaggcag gaggagttcc aagtgggaaa 3240 atcatcccat gccttctaat ggatggcgaa ttcagtttaa ctgaatacag caaagtgagt 3300 cttccttttc caatttaata aaaagaagta ccgtacttgt tcaggcgttg gacgtgagag 3360 ttatacgatg tgatctgcac atctattcgt tgatgattca caagttcaaa agaccaaagg 3420 cggacaaggc gaagcctttt gtattcggag ttcaggtaaa attatatgtt ttttctccga 3480 gagggctaca cgtgcagaaa gtcctccgaa taagcttagc aaaagggtca cgtggcgata 3540 aatcaccttt ttctcaatga tggatttttc gctacatgtc tttttcgcta ctcttccctg 3600 cgagacccgt ccatcgtgta gtcctcttga aggaaatcat tatgtttcga gaaagcgaga 3660 agaagagttc ttcttttaat tatttaaaca ttgttaattc agcgagatgg aaagtggaaa 3720 acaggtcttc tcggagtctt cacgatagat gcgtacaatg agcgtctttc cattatgaaa 3780 ggagaagtgc acgaagacgt atggaagtgg taaattcacg acttcatccc tcttcaaaat 3840 cgtaacttta ttattcaggt acgacaataa aaaatcgatg ttcttcgaat gtgcgtcggc 3900 gaaagccaaa ctgatggctg gccatttgag acaattctca actaataaca tctcggagaa 3960 tttcaatcga gccattaaag tcacaatcgg caagcaactg acgctggaca aactgattgt 4020 aaagctcaac gagcgatgcg aaggtgagaa ataccaaagt gagaaaaatt tgagattttt 4080 ccatttcaga atcactcaac gagtgtttct ggagcacagt gggacactcc gagtacgtga 4140 tcgtgtcagt tgatctacag actatgtcca tggacgaaaa acttcagtgg tttgccgatg 4200 caggaattac tggaggaatg cttccagcgt tagatgttcc ataccaacta ctgtcaggta 4260 acacacaatt acagttttaa aggtggagta gcgtcgatgg ggaatttgtt ttaaaatacg 4320 cagaaggtgc caaaagtata agttataatg aagaaagaac gaaaaaattt cattaagttt 4380 gaaaaaaatc ggactgaata ttgctaattc tcgctaccaa atttgaatgg ccgccaaaat 4440 gaacttagaa ttttctgttt ttctggaaag cacgaaaaac aacaaattca aaattcattt 4500 tggcggcaat tcaaaattgg tagcgagaat tagcaatatt cagtccgatt ttttcaaact 4560 taatgaaatt ttttcgttct tttttcatta taacttatcc ttttggcacc ttctgcgtat 4620 tttaaaacaa attccccatc gacgctactc cacctttaag tcaaaatagg ttcccgtttg 4680 tagaagtcaa cttggaacga agccgtcaag aagcggagcc cagtggagaa atcgacattt 4740 atcaacgcga cacagacatt ttcaatctcc gagatgactc aggtaccggc ccaaaatatg 4800 catcggtgga gctcacgcag ggaatgctga tctgcaacaa atgtaactct gaaaagctgg 4860 attttatttg ccgccatgtg taagtgtagt cttcaagtct ccaagaacat aaacaatttt 4920 agtttgcgct gtctgaagga tctttctgac acggatagaa tgattcaatg gcataaaatg 4980 cttactttta tcaaaaagca gaagaaagag gaagctcttc aatcacaagg acccagaatc 5040 gacccaaaaa gtggaatgaa gctgtccgac agaattggaa agaagaattc cgccaacaac 5100 cggaaaagaa aaatggttga agtttccacg tacgtttctc ccagtcattc gtgaattttt 5160 aatacgcgaa ttacaggttg acagtataca aggatgaccc gaaatcaaac ggccaagacg 5220 attccgctaa ctcaagaact acactcaccg agagcgaaaa cgaagtgaat tttgacaatg 5280 aaaccacttc gctgttctcg cctcaaagca cattcacgcc tattttttcg tcaacgcgcc 5340 tcgaagaaag gcaacttgtg gaaagggagc tcaaggacaa ccatgccgaa gagattcaac 5400 tcgacgctga caaggccgac gagagccagc tcaacgacag tcgaaggtcg gctagaaatc 5460 ggtaagtcga tcacacattg ttttcttctt taactcttac gtttagtacc agttcaacaa 5520 ttttcgaccc gtcgccaact caaccaaacg atgatttcga aatgtctccg agccatagac 5580 tattgcgttt tgacgaaatc attgatctgt gggatatcga actgtacacc aatacgactg 5640 agagcctcac cttccgggcc acactgggaa ctgagaacaa ttattctgtc agacctgata 5700 ccggaactgt gttgaaaagc cgcatattga aaattcggct cctcaacgat gatgctccaa 5760 atgatgtttt gacagtcttc tacaagacga aaggtgcatt gcgtgagaaa tcattcaaca 5820 ttgagcttcg aaacgaagtc gagctttaat catatttctg acgaacttac cgtcattttt 5880 tttgttgttg ttttgaccct atttttaatt aatctgtttt ttttcttttt gttaagcatg 5940 ttgtagattt ctcaagttgt taagttgtta accgagtttt agttccatta attatattca 6000 ttcccagttt ttttttcgag acggttcaaa aagcctttag cattataccg gtatttcaca 6060 acgccatctc cccattcctt tgacattcaa ttttaaaatt ccccactcaa tctttcattc 6120 gaaaattaag aaatgttcca actagtttgt ttttttttgt ttttttaaag gcagttcttt 6180 gtttttttat tgttctagat gttgttagtt ttttagattt tttgtatttt ttgatgttca 6240 gaaagacaat ctgacttgtt ttgtttccca cccataacca tctctttaat tttttggttt 6300 atcatatttg ttgatctcca cattccctaa tttaccggtt tcacttcact cattcccctc 6360 gtctttatgt ttcctagttt ttatccccat ttattgtaag ggtttttcta gatgaatcgg 6420 ataaaaaaaa actcatcttt atagtaaata aagccaaaag agctcgaaat ttggtattat 6480 gctctttttg gagcacgaaa tccgcagaaa aaaattttgc cagcataaga atatttttca 6540 aaacgggcca attttcttgt attcttatgc tgacttcttt gtcagcataa gaagatttca 6600 atgtactggg cgccagcata agaagaaacg acagcataag aatagacaaa aaaccattga 6660 cagcataaga agacgtcagc ataagaatat gacagcatag gaatattggc ccactttatg 6720 tttacagaca aatgaatggg cacaattcaa aagatcaggg ctgaatttta tgtggtagtc 6780 aaaaagtcaa aagtgcgaat gtagctattt ttcagtcagt agcgaaaaag acattcagta 6840 gtcaaaaaga caaaagtgcg aatgtagcta ttttttcagt cagtagcgaa aaagacattc 6900 agtagtcaaa aagacaaaag tgcgaatgta gctatttttt cggtcagtag cgaaaaagac 6960 actcgtagtc aaaaagcaat gtagtcaaaa atccagtagt caaaaagtgt ttgccgctat 7020 ttttcagtca gtagcgaaaa agacattcag tagtcaaaaa gacaaaagtg cgaatgtagc 7080 tattttttca gtcagtagcg aaaaagacat tcagtagtca aaaagacaaa agtgcgaatg 7140 tagctatttt ttcggtcagt agcgaaaaag acactcgtag tcaaaaagca atgtagtcaa 7200 aaatccagta gtcaaaaagt gtttgcc 7227 // ID CeRep59 repbase; DNA; CEL; 93 BP. XX AC . XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 05-JUN-2002 (Rel. 7.05, Last updated, Version 1) XX DE C. elegans repeat 59. XX KW SAT; Satellite; Simple Repeat; C. elegans; CeRep59; KW satellite repeat. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-93 RA Jekosch K.; RT "C. elegans repeat 59."; RL Direct Submission to Repbase Update (25-APR-2002). XX DR [1] (Consensus) XX CC Consensus, found 671 times in the C.elegans genome. CC For further analysis compare CC http://www.sanger.ac.uk/Projects/C_elegans/repeats. XX SQ Sequence 93 BP; 31 A; 17 C; 17 G; 28 T; 0 other; tagggcatcg aaaagcaccc aatatttaga gaacagaaga ttttgagaat tactgcctcc 60 agaaattgat gattttccca ttgatttgtc tac 93 // ID CER15-1-LTR_CE repbase; DNA; CEL; 453 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 06-AUG-2003 (Rel. 8.07, Last updated, Version 1) XX DE CER15-1-LTR_CE is a long terminal repeat of the CER15-1_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER15-1-I_CE; KW CER15-1-LTR_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER15-1_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (MAY-2003). XX DR [3] (Consensus) XX SQ Sequence 453 BP; 96 A; 108 C; 94 G; 155 T; 0 other; tgtcgcgaac tcagtgaata ataaatattc ccttcttcac gactttcaca tttgtatctt 60 ttcccgttta attacacaat tatctctgtt aaaagtcgtt atcttcatcg ccttcacacc 120 acgagacgcc ccgggaccgt ttgaattgcc gccatctacc gtactccacc tgcacatctt 180 accacccacg aaggaaacgc cgccggcggt cggacgcggt cgacacggag ttctgatggt 240 tgtcgctttg gagttctttt aattgttgtt ttggttttgg ttaattgatt catcccgtcc 300 cttgataatt ggcctttgtg tgatttgtgt tgctgattga ctaatcagca acacggagat 360 aaatgatgac ttagcacata cctgttctgt tatttatttg ggttttcagg gtgtttattt 420 tgggccgttt tctggagcta ccgctccgca aaa 453 // ID TIR21T5A_CE repbase; DNA; CEL; 166 BP. XX AC . XX DT 08-SEP-1999 (Rel. 4.08, Created) DT 08-SEP-1999 (Rel. 4.08, Last updated, Version 1) XX DE TIR21T5A_CE is a putative non-autonomous DNA transposon - a DE consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; MITE; KW TIR21T5A_CE; nonautonomous DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-166 RA Kapitonov V.V. and Jurka J.; RT "TIR21T5A_CE."; RL Direct Submission to Repbase Update (SEP-1999). XX DR [1] (Consensus) XX CC TIR21T5A_CE is flanked by 21 bp-long terminal inverted repeat. CC There are about ~100 copies of TIR21T5A_CE in the genome; they CC are CC ~98% identical to the consensus sequence. CC TIR21T5A_CE generates 5-6 bp long duplication of target-sites CC upon CC its integration in the genome. XX SQ Sequence 166 BP; 53 A; 37 C; 39 G; 37 T; 0 other; cacgtgtatt cagctcgacc aacgcctcga aaattttcaa aaaaggcggg aaaaaatatt 60 tgaattcgcc aagaggaatt tcaccgcagc gcgtgcaaaa atgtctgcat ttgcgcgtga 120 cggtgtttgc acaaattaca ccgaatggtc gagctgaaaa cacgtg 166 // ID LONGPAL3 repbase; DNA; CEL; 1299 BP. XX AC . XX DT 27-OCT-2000 (Rel. 5.09, Created) DT 27-OCT-2000 (Rel. 5.09, Last updated, Version 1) XX DE LONGPAL3 is a nonautonomous DNA transposon - a consensus. XX KW MuDR; DNA transposon; Transposable Element; Nonautonomous; KW 9-bp TSD; LONGPAL1; LONGPAL2; LONGPAL3; MUDR superfamily; KW PAL9A_CE; TIRs; nonautonomous DNA transposon; palindrome. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-1299 RA Kapitonov V.V. and Jurka J.; RT "LONGPAL3."; RL Direct Submission to Repbase Update (OCT-2000). XX DR [1] (Consensus) XX CC LONGPAL3 is a non-autonomous DNA transposon; there are several CC copies of LONGPAL3 in the genome; they are 99% identical CC with the consensus sequence and are flanked by duplications of CC 9-bp target sites. LONGPAL3 forms almost perfect palindrome. CC Transpositions of LONGPAL3-like elements were mediated by CC MUDR-like transposases. XX SQ Sequence 1299 BP; 447 A; 212 C; 211 G; 427 T; 2 other; ggcaaatact gttttactac ttgatctttt actacgcatc tcttttacta caggatctct 60 tttactacgg aatctctttt actacgaatt ccacatctag atcttttact acaatcggct 120 cttttactac ggaatctctt ttactacggt ttaccaacat agctctttta ctacagtacg 180 aaaattttca atttaagaat taaattttca acaacattat gccggcaaac actcattgac 240 taccttctca ttgactacct ttgctcattg actacctttt gctcattgac tacctttgct 300 nattgactac cttcaaatgc tctttgacta cctctatcta ataaaatttt ccaaaatttt 360 ttttgaaaac ttttcaaaat ttgatcagaa aaatgcaatg tttgtagttt ttctagtttg 420 gatgttctgc acacgattgc agacgaaaac ctgaggttta gtagaacacc taagctcaaa 480 aaacgcactc acttcggatg ctgatatctc cgtggaaaaa ttttttatga caaagtgatc 540 aactacaaag ttgtttatct taatctaaag tacaactttg tagttaatcg ttttttatca 600 aaaaatttgt tggttgagat atgaacagaa aaagaagaag aaaatacaag catgaaggtg 660 ccgaacttta cacaacttta tctcgaccaa caaatttttt gataaaaaac aatcaactac 720 aaagttgtac tttagattaa gataaacaac tttgtagttg atcactttgt cataaaaaat 780 ttttccacgg agatatcagc atccgaagtg agtgcgtttt ttgagcttag gtgttctact 840 aaacctcagg ttttcgtctg caatcgtgtg cagaacatcc aaactagaaa aactacaaac 900 attgcatttt tctgatcaaa ttttgaaaag ttttcaaaaa aaattttgga aaattttatt 960 agatagaggt agtcaaagag catttgaagg tagtcaatka gcaaaggtag tcaatgagca 1020 aaaggtagtc aatgagcaaa ggtagtcaat gagaaggtag tcaatgagtg tttgccggca 1080 taatgttgtt gaaaatttaa ttcttaaatt gaaaattttc gtactgtagt aaaagagcta 1140 tgttggtaaa ccgtagtaaa agagattccg tagtaaaaga tccaattgta gtaaaagatc 1200 tagatgtgga attcgtagta aaagagattc cgtagtaaaa gagatcctgt agtaaaagag 1260 atgcgtagta aaagatcaag tagtaaaaca gtatttgcc 1299 // ID IR4 repbase; DNA; CEL; 227 BP. XX AC U86949; XX DT 09-AUG-1999 (Rel. 4.07, Created) DT 09-AUG-1999 (Rel. 4.07, Last updated, Version 1) XX DE Caenorhabditis elegans IR-4 repetitive element, consensus DE sequence. XX KW IR4; Repetitive element. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-227 RA Devine E.S., Chissoe L.S., Eby Y., Wilson K.R. and Boeke D.J.; RT "A transposon-based strategy for sequencing repetitive DNA in RT eukaryotic genomes."; RL Unpublished. XX RN [2] RP 1-227 RA Devine E.S.; RT "IR4."; RL Direct Submission to Genbank (21-JAN-1997)Molecular Biology and RL Genetics, Johns Hopkins University School of Medicine, 725 N. RL Wolfe Street, Baltimore, MD 21205, USA. XX DR GenBank; U86949; Positions 1 227. XX SQ Sequence 227 BP; 59 A; 56 C; 58 G; 54 T; 0 other; tattaccggt aacagagagt gtagatagtt agagagtgcc agacatccgg gacccaatgg 60 ggcggggcgc gcggaagaga cgattttgtg tcgatttacg aaatgatgac aacgaggaaa 120 atttctttcg taaatcgaca caaaatcgtc tcttccgcgc gccccgcccc attgggtccc 180 ggatgtctgg cactctctaa ctatctacac tctctgtacc ggtaata 227 // ID CER15-I_CE repbase; DNA; CEL; 10798 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 06-AUG-2003 (Rel. 8.07, Last updated, Version 1) XX DE CER15-I_CE is an internal portion of the CER15_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER15-I_CE; KW CER15-LTR_CE; CER15_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-10798 RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RP 1-10798 RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RP 1-10798 RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER15_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (APR-2003). XX DR [3] (Consensus) XX CC FEATURES Location/Qualifiers CC gene 1386..1610 CC /note="predicted Gag region" CC gene 9318..9931 CC /note="predicted Integrase region" CC gene 8288..8698 CC /note="predicted RNase region" CC gene 7310..7975 CC /note="predicted Reverse Transcriptase region" CC primer_bind 4..18 CC /note="predicted Primer binding Site (Gly-TCC type)". XX SQ Sequence 10798 BP; 3951 A; 1873 C; 2297 G; 2677 T; 0 other; aaatggtgca tcgaaccaaa tcaaactgtt tttccctttt atcaacttaa aaatatatcg 60 tttatctcaa aaaagaatcc aaagaccttt ccggatgacg ttccggcgag gcaattcact 120 gaaaaattgt attgggcggc acaaagccac gcaagacaat attcaagtcg actgaatgat 180 aaaattcatt tggaacacac tcacggaacc acagattgtt gaaaacccca aaacgggccg 240 gggttctctt caaaaaatca aaattttcgt taaaataaca attttattga aaaaatcgat 300 acatttcgct tgaatattgc cgtgaaattc cgtcaattcc ttttaaactt taaaaatagt 360 taatatcaac aatatttcta tttccagaat gtccggagat ccaatcgatc aagcatggtc 420 agatttcgag aaggaaactg acaaagaagc ggaaactgtg gaagaagtcg gagacgaagg 480 agaaatttcg ccgttaaaga ctcaggagaa cgatgagctc atcgaggagg acgatgatga 540 cgatatcatc atcacgaaag tgactggaga aaaagctgat aaaggcaatt tggagggact 600 attggctcag attgctcagc aaaactcgtc gtcgaaggac atcgtgcagc tctgcaacga 660 gctcagtgag tgaaaatcaa taaaattatt agttatcaaa ttttgctcca aaaatgagaa 720 ataatttcgc gaaattcatc aaaaactgat agaaatagat tttatcaaat aattatcgaa 780 tttcgattta ataaaattat attatttcca gagaagctgc catcggcgcc attcggagtc 840 ccgttattgg atagtggctc agccactgct ggaactcaac tggtcatgtt tggatctctg 900 gtgaaactcc ttgagagctc aaagcaaacg accagcgaca acaatactgc cgtcggtcta 960 ataaaatcga tcaacggatc actcaagtcg attgtggacg gagagaaagc agcgaacaat 1020 gcttggcgag agtctactat tgtgatgctg aaaaaagagt tgagaaagct ggaattggct 1080 cggctcgagg ataaggagac ccagaaggac aattggtccc gcttccacga gtgggctggc 1140 aaattgaaga agaacgatga cgtggatact gggaaaactg gttcaacttc ggatgttcca 1200 ggactgttga agcgtggaag ctctgaaact gtcgacgcag ccaaattcgg atcgaaaagc 1260 gctggaaaga ctcctaaaga gaagatctta gctgctcaag cagccaacga ggcaaagaag 1320 aaggtaaata attctgaaaa aagtaaaata cttataaaaa cagaaattaa ttccagaagc 1380 cagtgatctt caaatgtgtt ctatgcgtga aggatgcaca cgagacggag gaatgcacac 1440 actacccgac caaggagagc cgcatcgcac gcatggaagc aatgaacctc tgccagaaat 1500 gcgtgaaata tcacggagga agttgccact cgaaggtcca gctcaactgc tccatttgtc 1560 gcggagccca tcaaacattc ttgtgcaaca tcttcgaaga agatagatga gagaatccta 1620 aggaaatctg ttcaaaaaca gatcatttaa atcccgtgaa acaattatct gattcctctt 1680 gctacccgtt aattaattgt tattgtttaa aattgggtta tccatttaat aattacccgt 1740 tgttgttgcc aaaaataaaa attctgagag aacaagaaca gttttggtct tcttaatatt 1800 cagagtacat ccaaagagtt gaaatcgtcg tcgtgaatct gaaagtctgg gagaaacagt 1860 cggtcgcatc aggcgaacaa ggttttctag agaaaaattc cgtttgaatc aaagccgaga 1920 gattcgagaa tgtagaagag acgtagaaag agaatgtgcg ccccaccgca aaattgagga 1980 gagagcacac aaaattgaga gacgcagaga atgcggttct tctccgatag ccggcaaaaa 2040 taaattcgta tcgccacttt tgaagaaatc cttgtaaatt tctatgtttt cgcttcaatt 2100 taattaaact gatgtattca tgtgatcgaa tcgtcgtctc tttcacaaat cgcaccttcc 2160 gtcaaaaaaa tcgtcgttct tccgctcatt aatcccgaaa tcaagtattt tcagccaaag 2220 agcgacaaaa aaaagattcg agaaacggat cggccaagct gcggaactac tggaaacgct 2280 ggagaaatca ataagtgagt aatttatcta tgaataaatt caaaaaggaa aatccagaaa 2340 ttggtaaaaa gcaggggagc cgctgcccaa attttcattt gaaaaaggga aaatttttgg 2400 aattcagttg aaaatctgtc attaaaatgt taatttcaga cacggaaagg tgccaatgga 2460 ggagtcaaag gagaatgacg ggatgtcaga cggaaatctc tgctgatgtt ggatgaacga 2520 cgaaaaaagg gaaaatccac ggattctggt gacgaaattc tccataaaag gaaagaaaaa 2580 ggacgaaaag gaccgatgtt ggctgaaagg atatctcgtg aacaggagca tggaaatcgg 2640 agaacctgag aagttccaat cgacgaagga aaataatttc gaggtattaa ttcaaaaaaa 2700 gtttaaaaaa tttcggaaaa gtatggaaat tagaagaaat cttgtctttt aaatttaaaa 2760 gataaatctc atttcgacaa agaaaatttt tgacggaaaa taaacaattt tgaatgaaaa 2820 attgcggata taattcggaa aattgctgca atattataat tatcgaatcg ataattttca 2880 gattgtgtga tctaacgacg agagcagcta caaaggcaac acaaaacgat cgggacacag 2940 tcggaatgtg gaaaatcgcc aaagaagagc acaagacacg ggacgagaaa atttcaatta 3000 attctaaatg aagaattgtg ttttcaatga aaatttattt atttaaataa ataaaaataa 3060 attaattaat ttggagatcg agcgttattt atgaaggaat acttggaatg aaatgaattc 3120 tcagaattag gaaatgtttt aacagaatga gaataatctt cagattcaga ttgttcggtg 3180 gaaaggaata ttttcgtaaa aaagggtcga attctgaaaa aggaaaatca atttctagtt 3240 ccggaaaaca atagaaatct gccaaattgg aatcggatgt cgaaggagca atttattctg 3300 aaattggaat aaaaccctga aaaaaaaaaa aaaaattttt tttttttaag aaataaggtc 3360 gtacactaga agttagtagg ctattcttcc gaaatttact ggaagtctaa ttgaatatga 3420 agcatcattc atatgagtga tttggaaaat tctcccgata cttactggga gtcttatggg 3480 aaattagaaa ttgggaattg gaacattgaa aattgaaatt ggaacattgg aaattgaaaa 3540 tttggagatt tggaaacctg gaaacctgga aaactggaaa acagggaata tggaaaaccg 3600 agaacttgaa aaatggataa ctgaagctga atacaagcat gagttttttt ttggaaaatt 3660 ctttcgatat ttactgaaag tcttatggaa tggaacatag atggattaaa ggaacatgga 3720 aactctttcg atacctactg aaagttgtta tgttacaggg agtcttatgg aaaataggaa 3780 ttagctagat tgagaggatc cccctttctg cgggaaaggg cttaatcata gcaactggga 3840 aattcggaaa cgtttctggt cgggaactta acaacacccg atgaagaact tggaactgga 3900 gataggacaa tccaaaatgg aattgatcta aataaattga attaatctag atatcaaagg 3960 ttctagtaaa atcaatatgg aaaaatctga gttaatcggt tatcagagaa attggaacaa 4020 gatcatatcc tatcaaatta aaaccaggtc gtgtctgaat aagaatagaa gagcaaaaga 4080 gatggaatac attgagagaa aagttgcacg tactcacagt ggaacctgtt gaagcaatga 4140 tatctgatca atatctatgc caatcagacc aactggaaca tcatgatgat tctaagagac 4200 tgcagtaatc ttaagattta ttagagtttt cgccttttgt tttcaagtgt ctctcattct 4260 tttctttttt tttcttactt attctaattt tgccggccat atcgtgataa ttaaattgcc 4320 cagcaagcgt cagaataatt atcatttaat tttcagagct atcactcaac tcgaacgact 4380 atggcgattt tgaacgcgac tttgaagtcg cagaaacagc gtattggaag aggggcaact 4440 attgccaaga aaactttgaa agtctcccaa gaactcttgg ttgacttgaa agctggaatt 4500 ttccaaccgg aactgaagcg tgacgagatc ctgaaaactc gtgccgacgt ggtggggatt 4560 ttggaatttc tagagaatat tgaaaattgg atcgcggagg agatccaaca ggacgctgaa 4620 acatcagcaa atgtggagtt ggtggctcag tacactaaat ccatgaaaga gcatgcgact 4680 ctgaaagagt atgaagtgct agcgaaaaga ttgaagttga tgaaaacggg actggaagat 4740 gtcattaaaa accaagaaga ttggcaatca ttgccagcga ctccgattgg aaacaatgac 4800 ggagatgatc aagacaatga tccaatgtcc gacgaagctc caactgtgac agggttgaat 4860 caggaaagtg atcaagtggg acccaatcaa gttactctca atatggagtt aattctagct 4920 agtatcaaat ctcttgaaga tagatttaaa actatggaat caattcaggg aactgcaaag 4980 gcgaccaaat gtggaaaagt tgggtctacg accaaaagta attctgccaa cacgaaagga 5040 aaaggatcgg gtagccgaaa gaaggattac actcagtcag aagacagtag tgaggattct 5100 gattcgagtc aaaattccag caaatatttg acggatgcgt ttgaagtcag tgaaagaagt 5160 gatacgtctg attctgaaga agaatcaaag gcgagaactt ctagaaacac gcgccctcaa 5220 aaacgctatc tgaagatgaa gctccaagag cctaaaaaat ttgatggagt caacaagctc 5280 gaatacgaag agtggaaagt aattttcatg gagggatacg gagaaaatcc tgatctcacg 5340 aaaaaatcga aattgatcca attgaaagga tgcgtgactg gaagagccca atcattgttg 5400 aacggtcttc aaatcaccga caaaaattac aaagtggcct ggaaaattct ggatcataac 5460 tttttgtcga atgctcgacc attggatgaa attgaaaaga gattcagaaa cgccaagatt 5520 gatcagaaaa actatgatca gatgaaactg gacgtaggta ctatcacatc attggtatat 5580 gatatgaagc atcgtggaat gaatgtggat agtccatgga tttatcagtc aataattggg 5640 aagctacctg ctcacatcgc ggaggacatt cttatcaaaa ctcaaagctc aaaattcaat 5700 ggcgaattcc ataaagttga gagatggatc gatagaaaaa tcaacgcaaa aatggccatc 5760 aatgagtatc gaaatgaaaa gaaaggtcca aaagattttg aattggaaaa tgatgtgaac 5820 cagttggaag caaaaaatga gaagggtaac cccaaagaaa atggtcaaca ggggtttcgg 5880 aaagagggag gccacccggg acaaaaaggc cacaataaat ctacatgcag attccacaac 5940 aaaggagaag acaacggaca tccattctgg aaatgctcca aatctggtga agaaattctt 6000 gacattgccg tcaaacaagg actttgcttc atgtgcttca gcaataagca catcagcgac 6060 gaatgtcgga gagacaagct caagtgtcaa aagtgtaaac gtggacacaa taccatcatc 6120 cacggaccag tgggaaaatg gttcagagca aaccagaagc caagacagga acctcagcag 6180 tggaactaca agccaggaac acacggatac cttcagcaac aggcaggcgc aaacggcgct 6240 ggcccgagcg agtctcatta gagacccgct cgaacaaaac cttatcgctc cgaaaggagc 6300 gcaacagtca gcagcttatg ctgctgacaa agagcctgac aacaccaggg ctctatgtga 6360 ggggggtgac gacatcgaag agcacgaaat ttcagctctt gatgcagaat tttccagaaa 6420 gggagaaatg gcttatttac ttttgaaaac cacaaatgga aatgagttaa ttggtttaat 6480 tgatacagga gcaaattgtt gtctaatttc tgaagaatcg gcaaaaagaa ataacctagt 6540 agtggaaaaa actgttagaa tgagattgaa aggagttggt agtgatgacg tatttgtttc 6600 caacatttac aaagtgccat tcaataataa aatattgtac gccaaaagtt ataacaaact 6660 tacatcaaaa gtcagaactt ccaaacttga aaaatttgaa attcaaattc ttgaacaaat 6720 gggaaaagat acgagtgcac tggaaaagtt gcataacagc aatgggaaaa taattgacat 6780 gattttaggt actgatataa tttggcaact ttttgaagaa gctgaattca aaagagtgag 6840 caatgagaaa acccttgtgt gtacaagctt gggtgaattc attattccga ctagtttcag 6900 aacattgaga ggagaagaaa tgggtgaaga agtgtacacg agcgaagcat ggatagagga 6960 agaatttgac tcgtgcaatc agataacaga gatggaagaa gatgaagctg atttatcttc 7020 aacagctctt gaaaagatgt ggaggttaaa cgtcttagga ctcgagactc cagaattggc 7080 aaaagaagaa cagtctcaca tggaagacgt tttggcagaa ttccaaaaaa cagtaacatg 7140 ggaaaacgga aatcttttgg tgaaactgcc catgaatgga agtgaaattc atttagagaa 7200 caacctaccg attgcaatgc aacgattgaa aaatcagttc tataaactta catgatctcc 7260 agaagttatc aaagaatatt atgacattat ccaaaaacaa ttgaagtcag gttttgttga 7320 agtttgtagt gacattccac caaaaaacag atgtttctat attccacaca tgactgtagt 7380 gaaacaagat tcaaatacga caaagttaag aatggtgttg gatgcaagtt ctcatatgaa 7440 aaacaagttg tctcttaatc agtgtatcaa aagttcacca tcaataataa agctaataac 7500 tggaattttg atgcgatgca gagctggaaa attttatatg acgagtgata ttgaaaaagc 7560 ttttcatcaa attcggttgc aaaaagatga cagagacttc acaagattcg tttgggtaaa 7620 ggatccgtcc aaaggacctg ttacagacaa tttcatcaca tacagattca caaggattcc 7680 atttggatta tcatgttctc cgtatttatt agcggcatct atcctaacgt acatggaaaa 7740 atatccagct gaaatcaata agcatttgga acaaaactgt tatgttgaca atatgatgtt 7800 cacaacgaac aatgagtctg agttaataaa tatgtacaag gagtctaagg aagcagcaaa 7860 aaagtgggga atgaacctcc gagagtatca aaccaattcg gaaagtacat ctagtgctat 7920 tccagagatg gacagagctc cacaaaagcc caacaaaatt ttgagtcaca tatttgatgc 7980 tcaaaaagac acgatgactc tgaaaattcc ggttcctccc aaggatcaaa gaccaacaaa 8040 aagagaatta caatcgtttc tagcaaagtt ttatgatcca atgggaattc ttacaccact 8100 gactgtaaga ttgaaattat ttctgcaatc tctctggcct acaaagatca attggaaaca 8160 aaagattcca gaaggtacag aaaagatttg ggaaacaatt cggaaacagt tccaacatca 8220 acagttcacg ttcgcaagaa cattgacaca aaattttgat ttcatcaggt caaagctcat 8280 aattttcagc gatgcatcca aaaatagtta cggactagca gcatacatca gattcgaata 8340 tcccgacaaa actgttaaaa catccttgat tatgggaaaa tccagagtga aaccgttaaa 8400 caaagatgtg cggtatacaa ttccaagatt ggaattactc gccttggcga ctggcacaaa 8460 ttgtgcaatt tacattcaaa aagagcttca tacaaatatt ccatttgaat cagtagagtt 8520 cttctcggat tcaatgattg ctcttgcctg gacaaccact cagaaaccac tgaaaatctt 8580 tgtcagccat caagttcagc tgataaaaga gaattgcaaa atgttgggtg atctagagat 8640 caatcataaa ctacattatg ttcgcacaga agataatcca gcggacatga caactcgagg 8700 aaaaagttct aatgaattat tcgagtcgaa attgtggttt cagggcccag aattcttgaa 8760 taaatcagaa acagaatggc cacaacagtg gacagacaat tctactatcc caaaatgtat 8820 tgccaatgag atagatgagg aaacaagtgg tgaaaaaaag aaaagaagta ttggaaaagc 8880 tgaagaatac atgatcaatt tgtcaacagc taccgaagaa aacgaagaaa ctcttccatt 8940 cacttccaca gtaccgtatg aaaaaacgaa ttcattattc aaacttatca gaagtaccaa 9000 ggctgtcatg aaagcagcat cacttttcag aaagaacgct ggtaagcagt gtacaggaaa 9060 attactgaga aatttttcag cagctaaaga tggagtggaa ggaaacgaag acagacgaaa 9120 gtctgcactg tattacttaa ttattgatca ttacaaagat gcagactcaa aatcagaatt 9180 caaaattgca gcagatctca aaccaatgat ggataatttt ggcttgataa gacataatac 9240 gagattagac aagtctaatt taccacgaga tacaaaattt ccaattattc tgttgcgaga 9300 tcacaagctg acagaaatga taatgagaga tattcattgg agaaataagc atatcggcac 9360 ggagcaactc ttcgctgagt gtcggaagac ttactggatt ccaaaaggca ggcaactcgc 9420 aaggaaaatt gtgagctcgt gcacagtttg tcggagagca gcagggagaa aatacaaata 9480 tccagatatt ccacctttgc cggaatgcag agtacagaaa tctagaccgt ttgaaagaat 9540 tggactagac tattttggac ccatctacta caaggcgaat ggagttgaac gaaaatgctg 9600 gacattaatt tgtacatgtc tggtaacaag aaatatccat ctggaactgg tttcagacaa 9660 tggaacagtc gagtttattt tggctatgag aagattcttt gccagaagag gaacaccaag 9720 aaaagttttg cttgataatg ccagaacttt caaactggga gagaaaattt tcaatggaga 9780 tattcgaaga cttggctcgg aatgtgaagc attcacgacc ttcctggatc agaatccaat 9840 gaattgggat tttatcactc ccttgagtcc ttggaaaggc gtgatctacg agagactgat 9900 tgcaattgtc aaaagactac tgtactcgtc cggagacact gccaaactaa acttcgttca 9960 attattcaca acaatgacag agattgaagc catcgttaat tcacgaccaa ttactcgcaa 10020 ttcagaaaaa ttgacagatt caccagtaat tagaccagcc gactttctga atccaggagt 10080 cactctcgga atattacaag atgatgtgat agttgatgaa gattctgttg gaaaagtaga 10140 gcaattggcc agacttcact tggcaaaatt gaataatcac cttgaaggat tatggaaaga 10200 ttggggagaa ttgtacctca cacagctcaa agaagcgcat aaacaaaaca ggcaatatac 10260 cagagttgaa ccaaaaattg ggcacattgt tatagtagaa gaaaaactgc taccaaggca 10320 caagtggcta ctaggcctga ttagtgcgtt acacacagca ccgaatggag atgttcgtac 10380 cgtaacaatt cgatacaaca acaaagatct agtcagatca gtgaaccaac tggttcctat 10440 tgaaatgaat gaaggtcttc aagacattga aatcaatccg caagcagaag cagcatccaa 10500 aactgaacca tcgaatccaa agaattcaga gaattcagaa caaactgagc acaaatcttc 10560 aaatgacgaa tgtgcacagc caaaaattga agaagttgat actgaatcga ctgatgagtg 10620 tgatctcaga gagagattac caagaaaagc gaaactcgga gttacttatt acaactgtga 10680 tctggtctcc actaatattg cttcggatat aattggaaat aaggattcta tgataccggt 10740 tgaaaaccca cttgaccaag atgtcccttc aggccatctt tttggggccg ggagtgtc 10798 // ID TC5B repbase; DNA; CEL; 641 BP. XX AC . XX DT 16-SEP-1999 (Rel. 4.08, Created) DT 16-SEP-1999 (Rel. 4.08, Last updated, Version 1) XX DE TC5B nonautonomous DNA transposon - a consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW TC5 family; TC5B; nonautonomous DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-641 RA Oosumi T., Garlick B. and Belknap R.W.; RT "Identification of putative nonautonomous transposable elements RT associated with several transposon families in Caenorhabditis RT elegans."; RL J. Mol. Evol 43(1), 11-18 (1996). XX RN [2] RP 1-641 RA Kapitonov V.V. and Jurka J.; RT "TC5B."; RL Direct Submission to Repbase Update (SEP-1999). XX DR [2] (Consensus) XX CC TC5B is a nonautonomous DNA transposon derived from TC5. CC There are about 20 copies of TC5B in the genome; they are CC ~96% identical to the consensus sequence. CC TC5B elements are flanked by 3 bp-long target-site CC duplications (TNA) and 140 bp-long terminal inverted CC repeats ~95% identical to the TIRs flanking the TC5 autonomous CC transposon. XX SQ Sequence 641 BP; 215 A; 102 C; 103 G; 220 T; 1 other; caagggaagg ctctgaactc gttatcggac ttcgttacgc cactatatac attcgataga 60 gaatagttac agatgatcac tccaaaaaat ttagctgctt cagagcaggt tcgaccaagt 120 tacgacattt tgaaagtgcc aaaatacccc tttttaaaaa ttccaaaaaa ttcaaaaaaa 180 aatttttttt ctaaaagttg atataaaatg ttttcaaata gttggaaatg atggaaaaat 240 cataccatac aagtttcagt tcataattct caaaaaaacc cttgaattga tctacatgaa 300 catctacttt ttggaatttt tctcaagata tgtagattaa tttaagggtt tttttgagag 360 tgatgggcta aaacttgtat aatatatgtt ttttactatt ttaaatcgat tacaactatt 420 tttatcaact ttcagaaaaa aaatttttca gtgaaaattt tagatttttt tcaaaggggg 480 tgcatcaakg attttttttt cggcaacttc aaagtgtcgt aacttggtcg aacctgctct 540 gaagcagcta aattttttgg agtgatcatc tgtaaccatt ctctatcgaa tgtatacagt 600 ggcgtaacga agtccgataa cgagttcaga gccttccctt g 641 // ID NPAL5A_CE repbase; DNA; CEL; 237 BP. XX AC . XX DT 31-AUG-1999 (Rel. 4.07, Created) DT 03-OCT-2005 (Rel. 10.11, Last updated, Version 2) XX DE NPAL5A_CE is a putative non-autonomous DNA transposon - a DE consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; CELE32A; KW MITE; NPAL5A_CE; nonautonomous DNA transposon. XX NM NPAL5A_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Belknap R.W.; RT "NPAL5A_CE."; RL Direct Submission to Repbase Update (AUG-1999). XX RN [2] RA Jurka J.; RT "NPAL5A_CE."; RL Direct Submission to Repbase Update (AUG-1999). XX RN [3] RP 1-237 RA Kapitonov V.V. and Jurka J.; RT "NPAL5A_CE."; RL Direct Submission to Repbase Update (AUG-1999). XX DR [3] (Consensus) XX CC The final version of the consensus sequence [3] have been CC determined CC based on identification of 5 bp-long target-site duplications. CC NPAL5A_CE has perfect 9 bp-long terminal inverted repeats and it CC forms non-perfect palindrome. XX SQ Sequence 237 BP; 72 A; 52 C; 53 G; 60 T; 0 other; cacggcccgg caagtggtac atccatgcaa atgcgctcta ctgataattt gagtgtagac 60 caggtttggg cgcgtgataa cgaaaaaagc tttggtccaa aaaatttaga atttaatttc 120 ggacattttt tatatgcatc acaaaaaaac tggaccaacc gtttttgaga tacacgcgcc 180 caaacgtcca ggtatacggt agacaaattg cgtacaggta ccacttctcg ggccgtg 237 // ID MARINER4_CE repbase; DNA; CEL; 1250 BP. XX AC . XX DT 21-OCT-1999 (Rel. 4.09, Created) DT 21-OCT-1999 (Rel. 4.09, Last updated, Version 1) XX DE Autonomous DNA transposon from C. elegans - a consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; KW Autonomous DNA transposon; mariner superfamily; MARINER4_CE; KW TA target site; TIR; transposase. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-1250 RA Kapitonov V.V. and Jurka J.; RT "MARINER4_CE."; RL Direct Submission to Repbase Update (OCT-1999). XX DR [1] (Consensus) XX CC MARINER4_CE is a prototype of autonomous DNA transposon that was CC active in C.elegans genome more than 10 million years ago. CC MARINER4_CE is flanked by TA target site duplications and CC it has 31 bp-long terminal inverted repeats.There are only CC several copies of this element fixed in the genome, they are 96% CC identical to the consensus sequence. XX SQ Sequence 1250 BP; 317 A; 316 C; 295 G; 322 T; 0 other; ttaggttgtg aaaaaagttc tctccgaaat tattgaaagt tttgccccgg aaagttctgc 60 agctttctgg acgattctat aatagaatgt tgtagaacat cctagtagca tctcccctaa 120 gcctatataa agcgcgcgcg cttgctgatc gaccaactac gatcgaactt tctcaacctc 180 aacttcgact gctcatcttc ttctagtgga agagtggcgc tcgtgcggct gagacggctg 240 ctaaaatatg caacgtgttt ggcagggatg ccgtcaccga tcacaccgtt cgctattggt 300 tcgctcgctt cgcttctgga tgtacgagcc tggatgacaa cgaccgctca tgaggcattc 360 tgtcatgatc gatgatcagt tgtgagccac cgtgagggag aagccggatc aaactacacg 420 tgaattggca acaaagctcc aatgtagtaa ccacgctatc agcaaccact cttactggct 480 cggctacaga agagtgctgg cgagatgggt gcctcatgct ttttccgttt cagatgcaga 540 cgcgcgagac gatatgtcag tctctcctcc tcactcaaca gattaaagat tttttggcga 600 acttagttac cggagacgag agctggctac tctataataa cgccacccat tgttctgtct 660 ggatgtacat gacggcaagc cgcctgtgca gccaaagctg gagctccatg agaagaagtg 720 tttgctgtgc tgcttctggg acgcgaaggg tatgctgttt ttcgagttgc ttccgcaaga 780 cagaccaatc acgccttcca cctactctga tcagctggat tcccttgctc tcgcagtcca 840 agaaaatcac cctagacact ctgcggtcca ccttcttcac aacaacgcgc gccctcatgt 900 ggcaaaggac acccaggcca agttgcagga gttcggctga atcacagttt tccatccatt 960 gtacgcgccg gacattgcac cttcggatta tcatctttcc gcccgctcaa gatgttcctt 1020 agagatcaga agttcggaaa gttcgaggac ctcaaaaaag cagtgtccga cttcttcgac 1080 tcgcgctcgc ccgcgttctg gaagaagagg gtggacgatc ttcccaatag gtggctcaca 1140 gttgtgaata ataatgatga tcatatggtt gattgatagt gactaaaaat aaagttactg 1200 agaaaatgaa ctggagtcaa atttcggaga aaactttttt cacagcctaa 1250 // ID PALTA5_CE repbase; DNA; CEL; 352 BP. XX AC . XX DT 09-DEC-1999 (Rel. 4.11, Created) DT 09-DEC-1999 (Rel. 4.11, Last updated, Version 1) XX DE PALTA5_CE is a putative nonautonomous DNA transposon - a DE consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; PALTA5_CE; KW TA target site; TIRs; nonautonomous DNA transposon; palindrome. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-352 RA Kapitonov V.V. and Jurka J.; RT "PALTA5_CE."; RL Direct Submission to Repbase Update (DEC-1999). XX DR [1] (Consensus) XX CC PALTA5_CE is a putative non-autonomous DNA transposon; it CC forms almost perfect palindrome and is flanked by putative CC duplications of the TA target site. Presumably, multiple CC transpositions of PALTA5_CE were induced by Mariner/Tc-like CC transposase. The C.elegans genome contains about 100 copies CC of PALTA5_CE. PALTA5_CE copies are ~77% identical to the CC consensus sequence. XX SQ Sequence 352 BP; 103 A; 71 C; 73 G; 104 T; 1 other; tatctcaaaa actagaagag ctatcaaaaa gttgtcaact aacaaaatat agcccttaaa 60 attttctaca ttttgttagt tgaaaacttt ttgatagctc ttctccccgc ggagatacac 120 gcgtttaaag ttttgagagg cgagagagtg tgggggaggg gagacgcagt gtgcgcgcgc 180 cgcctgcgtc tccccccccc ccacactctc tcgccyctca aactttaaac gcgtgtatct 240 ctgcggggag aagagctata aaaaagtttt caactaacaa aatgtagaaa attttaaggg 300 ctataatttg ttagttgaca attttttgat agcttttcta gtttttgaga ta 352 // ID LINE2C1_CE repbase; DNA; CEL; 3207 BP. XX AC U13643; XX DT 14-SEP-2005 (Rel. 10.09, Created) DT 15-SEP-2005 (Rel. 10.09, Last updated, Version 1) XX DE CR1-type non-LTR retrotransposon from C. elegans. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; LINE2C1_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-3207 RG WormBase Consortium; RT "Genome sequence of the nematode C. elegans: a platform for RT investigating biology."; RL Science 282(5396), 2012-2018 (1998). XX RN [2] RP 1-3207 RA Jurka J.; RT "LINE2C1_CE: A recent copy of CR1-like non-LTR retrotransposon RT from C. elegans."; RL Repbase Reports 5(9), 261-261 (2005). XX DR EMBL/GenBank/DDBJ; U13643; Positions 19242 22448. XX CC The similarity of ORF to reverse transcriptase was identified in CC ref. 1. This sequence is relatively recent and its closest copy CC differs by 10 base substitutions, of which some are within the CC ORF. XX FH Key Location/Qualifiers FT CDS 180..3080 FT /product="LINE2C1_CE_1p" FT /translation="MLHLILVSAHWWLCLPLCLIPSPSSPFLLLITLEIRP FT LPPPTCQKNAPFLNCHLANTRSVACAERLVALHEYVRANKIDILCITESNL FT STQIPLALCSSHDILCLRCDRGPPHPKSRGGGVAVFYSKSITLIQADSLLD FT QFYPLHSCEIVTFDHKLSKSRFIIVYRPPLTSVTETNYLFENISLLLTSQA FT KNHYILGDFNIPSARWSRDIPVPPGIPQALEDLIISHNLTQMVNFSTRVAF FT SGIGNFLDLIFSDNPNTFLSCSPSLPLICSDHSTVSFTLAIDCNSKPLPPP FT KTKSLDFRKCDFLSLNQHLMSYNWPRQLSYFSNTDSKLAHFSRIFNELIVL FT YTPIAKPPRPKSTCHKSIYRLARRACSNTISVRRKMRKELKSIKRKFAHEE FT DRAIHSTDPRSVFQLVKKRISPRPQPINLDINNETVSDPVRIADEFIHNFA FT LSFTAPCPPFPALPAPKPSKLSPDFSPFNIHRHIKALKAKIGFSTDNINFF FT IIKKCSDSLTVPLSIIFSESFSSSFFPTPWKTSIIVPLHKKGSTTNPGNYR FT PITLTHPLSRLFERVVLDALKQKLSPLLSSKQFGFLPARSCTLAVLESTSK FT IQKVLLDKSKYVDTIYFDFKKAFDSVPHNLLLLKLLNVGLDSGCCEWISSF FT LSNRSSKIKIDHFISDNSFNVMSGVPQGSVTGPFLFLLYINDLLDLFPPDV FT HVTAFADDLKLLGSDPTSIQTSINIVADWCKKWRLNLAEHKTAVLHFGKQN FT PRHKYFVNSVEIKPRDSIRDLGIIVDTKLSFVNHINQTSNSALLKCRQILR FT SFRSTSPEFYFKLFNVYIRPVLEYGCELFPPNSTKLAKKLESPLRFFSKCV FT FQRCNLSYSSYSSRLSQFNLLSTHHRRVLQILRTFHKIISSEYHFPSLSSF FT VRASRSTRHPYLLVVCGVPSKCFLHVNLSLWNRIAKRFPKLLTPNAFASRL FT GSIPFDTLFPPT" XX SQ Sequence 3207 BP; 804 A; 907 C; 487 G; 1009 T; 0 other; ttattatgtg cgcgacctta agattgtgac tctctcgtca ccgcgtccac ttcctatctc 60 tgttgattcg gctgctcccg ttgctaacgc ataggtaaac ggccaccggg ctttattatc 120 gtcttgtccg tccattgctg cttcatgtcc gatacaatca tcttctctaa atatattcaa 180 tgttgcatct aattctcgtc tctgcccatt ggtggctgtg tctccctcta tgtcttattc 240 cgtctcccag cagtccgttc ctgctgctca taacactcga aatcagacct cttccccccc 300 cgacctgtca aaaaaatgcc ccctttctga attgtcacct tgccaatact cgttcagtcg 360 cttgtgccga acgcttagtc gcgctgcacg aatatgtacg tgccaataaa atcgatatcc 420 tctgcataac tgagtccaat ctttctaccc aaattcctct tgcattgtgc tcttcccatg 480 atattctctg cctacgttgc gacagagggc ctcctcatcc aaaatctaga ggtggtggtg 540 tagctgtttt ctactcaaaa agtattacat taattcaagc cgactcctta ctcgatcagt 600 tttatcccct ccattcttgt gaaatagtaa cctttgatca taaattatcc aagtcacgtt 660 ttatcattgt atatagaccc cctctaacct ctgttaccga aaccaactat ttatttgaaa 720 atatctcgct attgcttacc tcccaagcca aaaatcacta cattctcggt gatttcaata 780 ttccctcagc tcgttggtcc cgtgatatcc ctgtgccgcc tgggatcccc caagctcttg 840 aagaccttat aatttcccac aatttaaccc aaatggttaa cttctcaacg agagtcgctt 900 tttctggaat aggtaacttc ctcgatctga tcttctctga caatcccaac acgttccttt 960 cgtgctcccc tagcctacct ttgatttgct cagaccactc aaccgtctcc ttcacacttg 1020 ccattgattg taacagtaaa ccacttcccc caccaaaaac taagtccctc gacttcagga 1080 aatgtgattt tttatcctta aatcagcatc tcatgtcata taactggcca cggcaactct 1140 cgtatttctc caatactgat tctaaacttg ctcatttctc ccgcattttc aacgaattaa 1200 ttgttcttta cacccctatt gccaaaccac ctcgtccgaa atctacctgt cacaaatcca 1260 tttataggct tgcgcgtcgc gcgtgctcga ataccatatc agtcagaaga aaaatgcgta 1320 aagagctcaa atctatcaaa cggaaatttg ctcatgagga agaccgagcc attcactcca 1380 ctgacccccg cagtgttttt cagcttgtaa aaaaaagaat ctccccccga cctcaaccta 1440 tcaatcttga catcaataat gaaacggtgt ctgacccagt gcgcattgca gacgagttta 1500 ttcataactt tgctctgtca ttcacagcgc cttgcccacc cttcccagct cttccggcac 1560 ccaagccctc taaattatct cccgatttct cgccgttcaa catccacagg catatcaaag 1620 ctcttaaagc caaaattggg ttctctaccg ataatatcaa tttctttatc ataaaaaagt 1680 gtagtgattc cctaaccgtc cccttgtcca taatcttttc cgagtccttc tccagctcct 1740 tcttcccgac cccctggaag acttccatca ttgtcccatt acacaaaaaa gggtccacaa 1800 ctaatccggg taactatcgc ccgattacac ttactcatcc attgtcgaga ctatttgaac 1860 gagttgtact tgatgctctc aagcagaaat tatctccact cctctcttcc aaacaattcg 1920 gtttcctgcc tgcgcgctca tgcactctgg cagttctcga atcgacgtcc aaaattcaga 1980 aagtcctcct tgacaagagc aagtatgtgg atactatata tttcgatttc aaaaaagcat 2040 tcgatagtgt tccccacaac ttgcttcttc tcaaacttct gaatgttggg ctcgattcag 2100 gatgctgtga gtggatcagc tcgttcctca gtaaccgttc atcgaaaatc aaaattgatc 2160 attttatctc ggacaactct ttcaatgtca tgtcaggtgt cccccaaggt agtgtaactg 2220 gaccattcct ctttcttttg tatattaatg acttattaga tttgttcccc cctgatgtcc 2280 atgttacagc ctttgccgac gatctaaaat tactcggatc ggaccccacc tctatccaaa 2340 caagcatcaa tattgtcgct gattggtgta aaaaatggcg tctaaatctt gctgagcaca 2400 aaactgccgt attacatttt ggaaaacaga acccccgtca taaatacttt gtaaatagtg 2460 ttgaaattaa accgcgcgat tctatcagag atctcggaat cattgtcgat accaaattat 2520 catttgtaaa tcatattaac cagacttcta atagtgccct ccttaagtgt cgccaaatcc 2580 ttcgctcatt ccgctccaca agccccgaat tctattttaa actgttcaat gtttacatcc 2640 gccccgtact cgaatatggc tgcgagctct tcccgcccaa ctccaccaaa cttgccaaaa 2700 aactagaatc tcccctccgg tttttctcaa agtgtgtctt tcaaaggtgc aatttatcgt 2760 actcgtcata cagtagtagg ctttcccaat ttaatttatt gtctacccac catagacggg 2820 ttcttcaaat cctccgaacc ttccataaaa ttatttcaag cgagtatcac ttcccgtcct 2880 tatcatcgtt cgttcgagct tcacgctcta ctcgtcaccc ctatctactt gttgtctgtg 2940 gcgtcccgtc taaatgtttt ctacatgtta acttgtcact gtggaatcga atagcaaaac 3000 gtttccctaa actgttaact ccaaatgcat ttgcgtcgcg cctaggctct atcccatttg 3060 acaccctctt ccccccaact tgacctcccg ggctatagct ggtccaattg gtcattttac 3120 ttatctgtcc acacatcttt tttattttac tttttttttc tttgtcttcc ggtgtagcta 3180 tttcttgtgg caaataaaca attatta 3207 // ID HAT3_CE repbase; DNA; CEL; 391 BP. XX AC . XX DT 01-DEC-1999 (Rel. 4.11, Created) DT 01-DEC-1999 (Rel. 4.11, Last updated, Version 1) XX DE HAT3_CE nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT superfamily; HAT3_CE; TIRs; nonautonomous DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-391 RA Kapitonov V.V. and Jurka J.; RT "HAT3_CE."; RL Direct Submission to Repbase Update (NOV-1999). XX DR [1] (Consensus) XX CC HAT3_CE is a putative non-autonomous DNA transposon. CC It generates 8-bp duplications of target sites upon CC integration in the genome. HAT3_CE has perfect 11-bp CC terminal inverted repeats. CC The length of the target site duplications indicates CC that transposase encoded by the autonomous HAT3_CE-like CC element is related to the HAT superfamily. CC HAT3_CE individual copies are, on average, 92% identical CC to the consensus sequence. CC There are about 50 copies of HAT3_CE per haploid genome. XX SQ Sequence 391 BP; 183 A; 57 C; 48 G; 101 T; 2 other; cagggctggg accaaaaaaa aattttggac caaaaaacaa aaaacaaaaa attgaagttt 60 tygaaaaacc aaaaaaacca aaaaaaacca aaaaaaaaaa ccaaaaaatt tttgatgctt 120 aagttgattt ttaatggggt tattcaagta atgttgcaaa atgtattaaa atacatttat 180 gacgtcacaa ctgtgttaaa atacatgttt taatgtattt taatacagaa tagtctcgag 240 tcgagactag acacggyaaa catttttttt tggatttttt ggtttttttg gtcccaaaaa 300 accaaaaaaa aaccaaaaaa tcgatttttc gtcaaaatac caaaaaaacc aaaaaacaaa 360 aaacaaaaaa accaaaaaat tcccagccct g 391 // ID LTRCER1 repbase; DNA; CEL; 492 BP. XX AC U15406; XX DT 06-FEB-1997 (Rel. 2.01, Created) DT 06-FEB-1997 (Rel. 2.01, Last updated, Version 1) XX DE LTR of retrotransposon CER1, gypsy/Ty3 group. XX KW Gypsy; LTR Retrotransposon; Transposable Element; LTR; LTRCER1; KW env; gag; gypsy/Ty3 group; pol; retrotransposon CER1. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-492 RA Sulston J., Du Z., Thomas K., Wilson R., Hillier L., Staden R., RA Halloran N., Green P., Thierry-Mieg J. et al.; RT "The C. elegans genome sequencing project: a beginning [see RT comments]."; RL Nature 356(6364), 37-41 (1992). XX RN [2] RP 1-492 RA Britten J.R.; RT "Active gypsy/Ty3 retrotransposons or retroviruses in RT Caenorhabditis elegans."; RL Proc Natl Acad Sci USA 92(2), 599-601 (1995). XX DR GenBank; U15406; Positions 1 492. XX CC LTRs of CER1 retrotransposon. XX SQ Sequence 492 BP; 88 A; 97 C; 95 G; 212 T; 0 other; atgtcgggtt taacccgttt ttctcttaat tctctccgat tagctcaaag ggtaattcca 60 gacctcaaat ctttatttaa aaatatcttt tgcgcaagca aagatgactg acgcactctc 120 acgctggcgc acttctcgat tctggcgcga aatcagtttt tgtcatcttt tttcgaggat 180 tcttcctcat tcttggattt tgctcggaca cagcgtgttg actcgttgtt agagctccag 240 tttcagtggt tttggcctgt tgttggctta tttttatagt cttttattgt ttttcttgga 300 tttctggcgc tctgcctgtt tctttgaagg tttctggttt agtgttgatt cttctccagt 360 tccttggttt ttgtatactt ttcctctttc tctcaataaa tttatttgag agttcttaaa 420 aatcgtgttt ttctctttct tttggtctcg tttactggat tggattcaag ttctgtggac 480 aggaacttga ct 492 // ID MARINER2_CE repbase; DNA; CEL; 1242 BP. XX AC . XX DT 23-AUG-1999 (Rel. 4.07, Created) DT 23-AUG-1999 (Rel. 4.07, Last updated, Version 1) XX DE Autonomous DNA transposon from C. elegans - a consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; KW Autonomous DNA transposon; mariner superfamily; MARINER2; KW MARINER2_CE; TA target site; TIR; transposase. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-1242 RA Kapitonov V.V. and Jurka J.; RT "MARINER2_CE."; RL Direct Submission to Repbase Update (AUG-1999). XX DR [1] (Consensus) XX CC MARINER2_CA is a putative active autonomous DNA transposon; there CC are about 50 identical copies of MARINER2_CA fixed in the CC C.elegans CC genome. MARINER2_CA is flanked by TA target site duplications and CC it has perfect 27 bp-long terminal inverted repeats. CC There is 60% partial identity between MARINER2_CE and MAR_CE. CC Transposase is encoded by a single ORF (position 150-1151). CC MARINER2 transposase is most close to the one encoded by the CC human CC MARINER element (42% similarity and 60% identity between amino CC acid sequences). XX SQ Sequence 1242 BP; 355 A; 259 C; 266 G; 362 T; 0 other; tcaggttgtc ccataagttt ttgtactttt tttcaaatat ttttccaaaa cttctagaaa 60 gttttaaaat tttttcatcg taggtcgtgt caaggtcggg tcgtcccctt tcagaaaaga 120 ttcatttcat ccatttctac tttgccacga tgacaatcat caaacttgaa cgtcgagacg 180 ttagattgct tcttctttat gaatttcgtc ttggtcattc agcaatggaa gcggaacgaa 240 acatatgcgg tgcgatgggt gagggagcac tctcttataa tacagcaaag agttggtttc 300 aaaagttcaa gaacggcgac ttcagtctcg aagaaataga acgttctggg cgaccggtag 360 agttaaatga agaagaccta gtgaagctgg tggaggaaga gcctcgtctt agtcttcgtg 420 aaatggaaga gaagcttgag tgttgtcata gcacaattgc acgtcacttg ggtcgccttg 480 gttttacttc aaaacttggc acttgggtgc ctcatgaact ttcggcatca cagaagctca 540 ctcgggtcaa cgtttgtact caacttctaa cttttcgtcg aaagttcgat tggctgaaca 600 atctggttac tggagatgag aagtgggtgc tctatgttaa ccattccaga aaacgtcaat 660 ggcttccgat cggtgagaaa ggaataccga cgccaaagcc tgatcttcac ccaaaaaaga 720 ttatgatctg tgtctggtgg ggtgttcaag gacccgtgca ctgggaattg ttgccaacta 780 ataaaactat cactgctgat tactattgtg cccaattgga ccgagttgca gaaaagatca 840 acggaaaata tgaaaaacta tattttcttc acgataatgc tatgcctcat gtcgccaaga 900 agactttcca aaagctgcaa gatcttggtt ggactgtttt accgcatcca ccatattctc 960 cagatcttgc accaaccgac taccatttgt tcttgtctct cagtgactac atgcgcgaca 1020 agcaattcga cgacgaagag catctcaaaa ctgaactctc cactttcttc tcatcgcgtt 1080 cgccggattt cttctcccgt ggcatcatga tgttacctag taaatggcaa caagtggtgg 1140 acactaatgg tgaatacttg tgtgaatagt actacttgtc gcttgagaga aataaatttt 1200 tttcaaaaaa aaaatagtac aaaaacttat gggacaacct ga 1242 // ID HELITRON2_CE repbase; DNA; CEL; 5514 BP. XX AC AC006730; XX DT 25-MAY-2001 (Rel. 6.04, Created) DT 19-JUL-2005 (Rel. 10.08, Last updated, Version 2) XX DE HELITRON2_CE is a Helitron DNA transposon. XX KW Helitron; DNA transposon; Transposable Element; HELICOP2; KW HELITRON class; HELITRON2_CE; helicase. XX NM HELITRON2_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-5514 RA Kapitonov V.V. and Jurka J.; RT "HELITRON2_CE."; RL Direct Submission to Repbase Update (MAY-2000). XX RN [2] RP 1-5514 RA Kapitonov V.V. and Jurka J.; RT "Direct submission."; RL Direct Submission to Repbase Update (MAY-2001). XX RN [3] RA Kapitonov V.V. and Jurka J.; RT "Rolling-circle transposons in eukaryotes."; RL Proc Natl Acad Sci USA 98(15), 8714-8719 (2001). XX DR GenBank; AC006730; Positions 23891 18378. XX CC HELITRON2_CE is a HELITRON-like DNA transposon. CC 904-aa protein which is putatively involved in proliferation CC of HELITRON2_CE copies. There are about 100 nonautonomous CC elements CC in the genome, and they are 96% identical to HELITRON2_CE. CC These elements are ~500-1500 bp long. In many cases, CC HELITRON2_CE forms 7-9 kb minisatellite-like clusters composed CC from a ~200bp portion of HELITRON2_CE. CC HELITRON2_CE encodes a 904-aa protein, CEHEL2p, composed of CC the rolling-circle initiator and DNA helicase domains. XX FH Key Location/Qualifiers FT CDS join(1833..1919,1970..2239,2432..2986,3113..3295, FT 3656..3970,4018..4293,4342..5370) FT /product="CEHEL2p" FT /note="rolling-circle initiator and DNA helicase FT domains" FT /translation="MNEFSQKTDFPSTSDSQEPVSKRTRSRFKSDSSGAER FT KRKQRGKESSEEREARHQYDQDRKRKKKASESKDELKVLLLKDRERKARER FT REAKGISDGPNYLGPMSKTCDHCKAVFFESKKNFQSNIRQFNSAFSMASMG FT AHLDIPKGHGPYCYRIHGQVYHLAGPLHPEPGGRPSYGQIYILDTAQAIEE FT RLGNPANSKCDPQLMAELTKLISTRNPYAKAYKMMAEVEEKENTDALKEGR FT VAEEVRLIFDISTTKDRRCYNVPVSNEVAVVFVGEDQDIPASRSLAIHPRG FT GGLTAIRDIDKIVKYLFKYVYKGHDRAALNIIRSRLKEGIVDEIKAHLDAR FT YVCAPEAIHHLFGFKIERKSDTDIRTVIDDDGVPVVYPTFVEAAKARGLLK FT DDSEYLKSLKEWIGCSVPAALRSLFSSILLFNEVHEPQVLWDTVKRDLSED FT YRHAGARDDEAEALAYFDISSRLHRRGGKAFNDYVPEPSMHPTDIYLNTSD FT PSQCAAEGVRLFQTLNSDQKYVFERIVGALHDPTLPRLFFLDGPGGSGKTY FT LYIALYNMIIGENKMAACTAWTGIAAHLLPGGRTMVSLLKLNIHDRCESSL FT HRRQLKDARELANTDVFICDEASMIPKTALNTFDSLLRDITQVDQPFGGKI FT VILGGDFRQILPIVEKGSRADQVDACIKSSPLWNQFETLRLHSNMRVTTND FT TEWMNFLLGVGDGSLNDSESKVQIPFSLCDGNIVEKVFGDVIDPSTSDPCE FT NVILTPKNVDVTLLNEDVHSRGIGEERIYLSRDEIVTDDPADHMHYPTEFL FT NKMTPSTLPPHVLKLKKGSVVMLLRNLDVSGGLCNGSRFVVESLGRRTIGC FT RFACGERKGKFVIIPRIECYHDKNLSFKLKRTQFPVRAPDGRITNIVFTEI FT L" XX SQ Sequence 5514 BP; 1653 A; 939 C; 1195 G; 1727 T; 0 other; tcatttacta tatataaagc gcgtgtcctt ccgtccgttt gtagtttgta gtttgatctt 60 tgatctttca actctctttt ctgtagttgg aaaaggtatt gggttaagag ttgccggggg 120 ataatgtcaa ggtactgtag tagtactgta ggagtactgt aggtatacgg tagggttact 180 gtagattagg aaaaattgag tttttgtcta cagaagaggt attgggttgg gagttggtga 240 gggataatgt caaggtactg tagtagtact gtaggagtac tgtaggtata cggtagggtt 300 actgtagttt aggaaaagtt gagttttcgt ctacagaaga ggtattgggt tgggagttgg 360 tgagggataa tttcaaggta ctgtagtggt actgtaggag tactgtaggt atacggtagg 420 gttactgtag ttaggaaaag ttgagttttc gtctacagaa gaggtattgg gttgggagtt 480 ggtgagggat aatttcaagg tactgtagtg gtactgtagg agtactgtag gtatacggta 540 gggttactgt agttaggaaa aatttgaatt ttaaccattt gaagagatat tgggttggga 600 gttggtggag gataatgtca aggtactgta atggtattgt tggagtactg taggagtacg 660 gtagggttac tgtagttatg aaaaatgtga attttaacca tttgaagaga tattgggttg 720 ggagttggtg gaggataatg tcaaggtact gtaatggtat tgttggagta ctgtaggagt 780 acggtagggt tactgtagtt atgaaaaatg tgaattttaa ccatttgaag agatattggg 840 ttgggagttg gtggaggata atgtcaaagt actgtaatgg tactgtagga gtactgtagg 900 attatggtag ggttactgta gtcaggaaaa aatttgaatt ttaactcaag aagggtccca 960 ttggccccgc ccccttggtt ttaaattttt ttgaaaacat tttggtggga attcaaattt 1020 cacttttata aaaacatttt gacgggaatt caaattttaa ttttttgaaa acattttggc 1080 ggtaattcaa attttaattc tttgaaaaca ttttggagga aaattgatgg aaattttctt 1140 aacttctaaa agttaagaaa ttgtttaaag ttgagaaacg tctgaaacct tctgaaatcc 1200 actgaatgca atgtcctaga aaattttgtt aagttcatga gacttcaaat taaaaaaaaa 1260 ttcgaacgac aaaatgcctc attgggtgga gtaattttgt cgactccagc ctctctctgc 1320 agcatttcag gatgttctat atttatttta gtaaattaga ttgttttgtt gaccggaatg 1380 tcaaaaactt ttctaaatac ataatttgac tgtatttttt taacacactc tacgaaattt 1440 ataatttgtt tttctgtttt cgttatttac tgtacattaa ttacgaaatt caagatttgt 1500 ttttatgttt ttgttgttta ctgtattagt gaactgttgt ttacagtagt gtttacagta 1560 ttttttgttt actgtacatt aattacaaaa tttaaaattt gtttctctgt tttaggttgt 1620 gttttttgaa aaaaaatggt attatacttt tttgccgaac tacagttttc aattatctat 1680 agtacgattt tggttattac aaaaaaaaat cttcccctgt tttatctgtt ctatcgaaaa 1740 aaaatttctt ttctcttcat tttttcgttt ttttttaaat tattgtttgt gatagtcaat 1800 attttacaga ggaatccgaa gaattattaa ctatgaacga attctcacaa aaaactgatt 1860 tcccttcaac atctgattcc caagagcctg tttcaaaaag gacacgttct agatttaaaa 1920 ggtaagtttt tattttctaa gtttatttgt tagtaaaata aatgttttta gtgatagttc 1980 tggagctgaa cgcaaaagga aacagcgtgg aaaagagtca tcagaagaaa gagaagcacg 2040 ccatcaatac gatcaggata ggaaacgtaa aaaaaaagct tctgagtcca aagacgaact 2100 gaaagtgttg ctactgaaag atagagaacg taaagcaaga gaaagaagag aagcaaaagg 2160 tatatccgat ggacccaact atcttggccc gatgagcaag acatgtgatc actgtaaagc 2220 cgtctttttc gaaagtaagt ttttaaataa tatttttaca aatgaacgtt gaattttacc 2280 atgcagcaaa tttaatttca gatgaagtaa gcggctccac tgatcatatc aacatgtgtt 2340 gcaattttgg actgacaacc tccatcgatg ataagttcgc cgactatccc gatcgattga 2400 aggatcttct tctcggagat tcgattgagg caaaaacttc caatcaaata ttcggcaatt 2460 caactctgct ttttccatgg cttcaatggg tgcccatctc gatattccaa aaggacatgg 2520 tccatattgc taccgcatcc acggccaagt gtatcactta gcagggcctc ttcatccaga 2580 acctggagga agaccatcat acggccaaat ttatattctg gacacagctc aagctatcga 2640 agaaagatta ggaaatcctg caaattcaaa atgtgatccg cagttgatgg cagagctcac 2700 gaaactcatt tcgactagga acccatacgc taaagcttat aaaatgatgg cagaggtgga 2760 agagaaagaa aacaccgatg ctctgaaaga aggaagagtt gccgaggagg ttcgactaat 2820 cttcgacatt tcaactacta aagatcgtcg ttgttacaat gtccccgtat caaatgaagt 2880 ggcagttgta tttgtcggag aagatcagga tattccagct tcaagatctc tagctattca 2940 tccaagagga ggaggtctca ctgcaattcg tgatattgat aagatatgtg atccattgac 3000 atatccataa ggtcataaga tggataacag attcatcgtt ccgtataact cttactttct 3060 gcttcggtac aatgctcaca tcaacgtcga aatttgtgct ttcattgaag ctgtgaagta 3120 cttatttaag tacgtgtaca agggacatga tcgtgctgct ctaaacatca ttcggagtag 3180 actgaaagaa ggaatcgttg atgagattaa agcccacctt gatgctcgat acgtgtgtgc 3240 tccggaggcc atccaccatc tcttcggatt taagattgag agaaagtctg acacggtata 3300 tcgactagct gtccatcttc cgggttttca agttgtggta tttccttcaa atgtaacaaa 3360 ggatcagttg cagtctacgc ctgaaaaaga tacaactctg acagcttggt tcaaaattaa 3420 caaaattagc gaagaaacag tgcagagtgg ttccatgtcc gattcatttg tcgacgaatt 3480 gcattacatt gacatgccat cttaattcac attcgatcaa aaagagcgct gctggaaaac 3540 ccgaaagcgt ggaatccgac aaattggcag aatgtacaca gttgctccca gtgaaactga 3600 acgttatgtt cttcgcattc ttctactaaa cacaaaagga gctacttctt ttgaggatat 3660 tcgaactgtc atagatgatg atggagtgcc tgtagtttac cccactttcg ttgaagcagc 3720 taaggcacgt ggactgctga aggatgatag tgaatattta aaatcgctga aagagtggat 3780 tggatgctcg gttcctgctg ccctacgatc tttgttcagt tctattctac tcttcaatga 3840 ggttcacgag cctcaggtat tgtgggatac cgtaaaacgc gatctcagtg aggattatcg 3900 tcacgcagga gccagagatg atgaagctga agctttggcc tattttgata tttccagtcg 3960 acttcacagg tagctatcat ctctaaactt atatttatag tggaaatata tgttttcaga 4020 ggaggcaaag ctttcaatga ttacgttcct gagccttcaa tgcatccgac tgacatctat 4080 ctaaatacaa gcgatccatc ccaatgtgca gcagagggcg ttcgactatt tcaaaccctc 4140 aactcggacc agaaatacgt cttcgagcgt atcgtgggag cgctacacga ccctacttta 4200 ccccgtctct tctttttgga tggtcccgga ggaagtggta aaacatatct gtacattgca 4260 ttgtacaata tgataattgg agaaaataaa atggtaacta tcaactaatt catttttatt 4320 cctaacatgt ttttattcca ggcggcttgc acagcatgga ctggaatcgc ggcacattta 4380 cttccaggag gtcgaactat ggtgtctctt ctcaaattga acattcatga tcgatgtgag 4440 tcgtcgcttc accgaaggca attgaaagat gccagagaac ttgctaatac cgatgtgttc 4500 atttgtgacg aggcttcgat gattccaaaa acagctctca acacgtttga ttctcttctt 4560 cgggatatca ctcaagtcga tcaaccattt ggaggaaaaa tagtcatttt gggcggagat 4620 ttccgacaaa tccttcccat tgtcgaaaaa gggtcgagag ctgatcaagt cgatgcctgt 4680 atcaagagtt cgcctctgtg gaatcagttt gagactctac gcctccactc aaacatgaga 4740 gttaccacaa acgacacgga atggatgaac tttcttctcg gtgttggcga tggttcccta 4800 aatgattctg aatcgaaagt tcaaatccct ttttctttat gtgatggaaa tattgtcgag 4860 aaagtttttg gagatgtaat cgacccatca acttctgatc cctgcgaaaa tgtgattctt 4920 actccaaaaa acgttgatgt cacccttctc aacgaagatg ttcattccag agggatagga 4980 gaagaaagaa tctatttatc aagagatgaa attgtaacag atgatccagc agatcacatg 5040 cactacccaa cagaatttct gaacaagatg accccttcga ctctcccccc acatgtgctg 5100 aaactgaaaa agggatcggt ggttatgctg ctcagaaacc ttgatgtatc gggaggtctc 5160 tgcaatggat ctcgctttgt ggtggaatcc ttaggacgac gaacaattgg atgccgattt 5220 gcatgtggcg aaagaaaagg aaaatttgta atcatccctc gaatcgaatg ttatcatgat 5280 aagaatttgt ctttcaaact gaaacgaact caattccctg ttcgagcccc tgacggccga 5340 atcacaaata ttgtttttac cgaaattctt taaacgaatg ttttagaaac cttgatttga 5400 tttaataaaa attatttaaa aaaaattttt ttcaaaattt ttcacttcct tatttttaaa 5460 ataaaatttc aaacgaccag tcgcggccgc gccggaggcg cggtcagcgg ctgg 5514 // ID PALTTTAAA1 repbase; DNA; CEL; 593 BP. XX AC . XX DT 30-NOV-1999 (Rel. 4.1, Created) DT 30-NOV-1999 (Rel. 4.1, Last updated, Version 1) XX DE PALTTTAAA1 is a putative nonautonomous DNA transposon - a DE consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; PALTTTAAA1; KW TIRs; nonautonomous DNA transposon; palindrome. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-593 RA Kapitonov V.V. and Jurka J.; RT "PALTTTAAA1."; RL Direct Submission to Repbase Update (NOV-1999). XX DR [1] (Consensus) XX CC PALTTTAAA1 is a putative non-autonomous DNA transposon; CC it forms almost perfect palindrome. CC There are about 20 PALTTTAAA1 copies ~88% identical to the CC consensus sequence. CC This element is 60% identical to MER46B; CC PALTTTAAA1 and MER46B share identical 14-bp termini. XX SQ Sequence 593 BP; 198 A; 85 C; 91 G; 215 T; 4 other; tttaaaggtg gagtaccgaa atctgggaaa tatttttaaa tgactccaaa ttttcccctg 60 attccgaata tctatgtgaa aaaattcaaa aaaatttccc tgattttata tttgagcttg 120 aaatcgcgat tttcatttgy gtacccatga gatttttcaa atgcgcgccg aaataaatta 180 tccttggagc gcgtttgcct catttgattt tctccattta tttttatttt tttcagtttt 240 tcagctattt tcattcattt ttgtcgtatt ttatggattt ttttttgaaa aaaatatttt 300 ytttcgaatc aattgaacat tttattagra caaaaatgaa tgaaaatagc tgaaaaactg 360 aaagaaataa aaatttatgg agaaaattaa atgaggcaaa cgcgctccaa ggagaattta 420 tttcggcgcg catttgaaaa atctcatggg tacrcaaatg aaaatcgcga tttcaagctc 480 aaatataaaa tcagggaaat ttttttgaat tttttcacat agatattcgg aatcagggga 540 aaatttggag tcatttaaaa atatttccca gatttcggta ctccaccttt aaa 593 // ID LINE2A_CE repbase; DNA; CEL; 4082 BP. XX AC . XX DT 31-AUG-1999 (Rel. 4.07, Created) DT 02-JUN-2010 (Rel. 15.07, Last updated, Version 2) XX DE LINE2A_CE is a non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; ORF1; KW reverse transcriptase; ORF2; LINE; CR1 superfamily; LINE2A_CE. XX NM LINE2A_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-4082 RA Kapitonov V.V. and Jurka J.; RT "LINE2A_CE."; RL Direct Submission to Repbase Update (31-JUL-1999). XX RN [2] RP 1-4082 RA Marin I., Planta-Rengifo P., Labrador M. and Fontdevila A.; RT "Evolutionary relationships among the members of an ancient class RT of non-LTR retrotransposons found in the nematode Caenorhabditis RT elegans."; RL Mol. Biol. Evol 15(11), 1390-1402 (1998). XX DR [1] (Consensus) XX CC LINE2A_CE is a non-LTR retrotransposon related to the CR1 CC superfamily. Two open reading frames, ORF1 (position 71-883) and CC ORF2 (position 953-3955) encode two proteins. The second CC protein is a CR1-like reverse transcriptase. Protein encoded by CC ORF1 is C.elegans specific. There are about 10 copies of CC LINE2A_CE CC in the C.elegans genome; they are ~98% identical to the consensus CC sequence. Therefore, this family was active very recently. CC LINE2A_CE does not generate target-site duplications upon its CC integration in the genome; its 3'-tail ends by (TATA)n. CC This family is reported as Sam3 in [2]. XX SQ Sequence 4082 BP; 1207 A; 1074 C; 720 G; 1081 T; 0 other; cactcgtcct cccgttctcc ccagcggtcg gtcgcatatt gttcttgctc tcgtatcgat 60 ttttgccgaa atgcccccac ccgatgacga catgatagtt gattcttcac caactgctac 120 aattccgcct acttgtgaca atattctgcg aaatcaaaac ttgccatcaa caccagcttc 180 cggacagcca agtatcaagg aattgatcga acgaatcacc attttggaga aaactgtcaa 240 ggaacaatcg aagaaaattg ccgagttgga agctactaaa ggtttcccat taatcaccaa 300 tgatgctagc aaaagtaaga gtaaacttta ctccgctgta gtccaaaacg accctcaatc 360 tgtgaaaatc atcgaaaagg ctcacttcgc tgctgatctc cgaaagctcg gggaaaactc 420 gatttacgcg attattgaga atgtaccgga ttgcaagaaa gaagaacaaa cgacaattga 480 tgcctcactc atggaaaact tagccaagct ggatactctt ccgaaacctg aacaattctt 540 ccgaatcaaa tgcaaaaaac cggatgttcc ttctcgccct ctcaaggtta agtttgctac 600 tgaatatcaa cgggatactt tcattcgaca attttccaag gcacttcaca atcttcctga 660 acgcccagta tcgtctcgaa ctatcagatg ccggagagac atgtcgcccg aagaacttat 720 actgctcaaa cagaggcgag ctacagccta tgaggaaaac cgtaaggctg gagtaatcaa 780 atattacgtc cgtgatcttg atatttgtga actctccaca cctcgtccgc tgacagcaca 840 gataacacca acatcggcgc caggtctctc gagctccaca taggaattct gccaccggac 900 atattcaacg aaacttgcaa acagttcttc atcatcgtca cccactacat ctatgataga 960 gatttctatt actcgtcaat gtcctttggt ggccaaaatt acacctgttc actcaactgc 1020 caaatcatca aatgccaagt caactgtatc cactgccgcc atgctataca ctaaaaagtc 1080 aacaacccta tcaaattcga ctcaaaaagc tgctacgaag cgacaagcaa ccgctccaac 1140 acctgctaat catcaatcga catcctctgt ggactctctt agtaagctca attgtgcttc 1200 tgccaacatc agatcaattg cttcggcgga gcgtttgaaa tacatccagg attatatacg 1260 ggaagaaaat atcgatattc ttttcttgac cgaaacattt ctgtcaccgg atacgccatc 1320 ttcgctatgc tctactatgg acctgatgtg cattcgttcc gaccgcctac aatctcaccc 1380 gaaatccaga ggaggaggag ttgccatttt ccacaaaccc tcacttatta tgtcgcaaat 1440 tgattcgctc cagaacggat actacgacag acacttctgc gacatcttag ctgttgatca 1500 caaaccatcg aaggccaggt tcatacttgt gtatcgtccc ccagatacat ctatccaaca 1560 aactgctgcc ctctacagca acctgtccga gctcatcagc agtcctgcta actaccactt 1620 catactggga gacttcaatc tgccaaatct catatgggac aaactggatc aactaccaat 1680 aaacattcac caggacctgt ctgatcttat gagctctcac aatcttgcgc aaattataaa 1740 gaagcccact cgtaccgcca tgtctggaaa gcaaaacttc cttgacctcc tgttcacgga 1800 ttctccttca ctgatctcaa atgtcagtat cgactcacca ataatgctct ctgatcactc 1860 cacaataaga ttcaacttgg tcctgaatta tgcgagaact gcacgtcgat taaacaggcg 1920 aactactatt cttcagttcc ggaagtgtaa ctttgaagct ctgaacaacc atcttctgat 1980 cttcaactgg gctcgacaat tttcctactt ctcccgatgc gaaacgaaac tgatccattt 2040 cctaaaaata ttcaacgagc tgatacgtga attcacacct gctgccaaac tgactaatac 2100 aatatctccc aacttcaaga agaacttgcg aaagagagtc aaacagcgac ggctcaggag 2160 tccaccttcc gatcaaaaga agtatatcaa agctcgtcta cgctctatca agaagatgct 2220 tgcaaaagag gaaaatcgaa ttgttgagtc taaaaatccc agacaactgt tatcaatggt 2280 gaagaaacga acgtctactc cgtctcatgt gacttgtctc gtggtcaaag gacaattatc 2340 tacaaactca gtggctatcg cggatgaatt tctcaactct tttgccaaat ccttcacacc 2400 accctctgat ccgttcccag ccctgcctgc ccaaaagccc atcgcaattg atccggactt 2460 tacgccaata aatatatgca ggatcatcca gaagttacga cccaaaatcg gattctcgca 2520 ggataatatc aatttcttcg tcatcaagaa gtgtgtgcac tcgctctctg tacctctatc 2580 actcattttc tctgagtcgt atgcctctgg ccaatttccg gaaatttgga aatcttctat 2640 cattgtgcct gtgcataaaa aaggttgccg cacggatgct aacaattatc gaccgatctc 2700 tttgacacac ccactatcga gagtgttcga gaagttcata gttgagaagt taagaaagga 2760 atgcagctcc aaaatatcta aatcgcaatt tggcttcatg aattctcgct catgcaccct 2820 tgccctgctc aatgcctgct caaaaatcct cgactccttg acgatccgat caaagtacgt 2880 tgatgcgata tacttggact tcaaaaaagc cttcgatagc gtgccacaca acctactgct 2940 ctgcaagttg gaactattcg gcctggatgt caaaatgtgc aattggtttc gctctttcct 3000 cagcaatcgg acctcatcca taaaagtatg cgaccacgtt tcaaagaaca aacttgaggt 3060 gctgtctgga gtgcctcaag gctccgtctg tggaccgttc ctgtttttga tatatatcaa 3120 tgacttgctc ggtatgctcc ctcctgatgt tcaaatatca gcatttgctg acgacataaa 3180 gatatatggt gacaacagca attcaatcca aaagtctatc gatattgtca cggattggtg 3240 cagaaaatgg agtctcaact tggcagaaaa caagtctgta gtcgttcatt atggaaaaaa 3300 taatccgaag tttgtctaca ctgcgaatgg tatcatcatc gctaagaaga aatcagtgaa 3360 ggatctcggc atattcgttg acgacaaact aaatttccac ggccacatca cttatgtcac 3420 aaatgcagct ctactcaaat gccggcaact cctcaaggct tttcgctcaa caaatgccag 3480 cctatacttc aagctgtaca atatttatgt tcaaccaata ctcgattatg gatgcgaaat 3540 ctatagtccc acctcggggg ctctaatcaa acaactggaa aagcctctcc gtttcttcac 3600 caggctcgtc ttccaacgct gcaacataaa gtattcttca tacgaggatc gtttggccca 3660 agccaatttg aagtcagtgc aacataggcg ggtcttgcag atccttcgca cctaccacaa 3720 catcataact ggaaacttcc actacccaaa tgtgtcatcg tcggtgaaga aagctgtaac 3780 tccaagatac ccctacatgc tcagatctgt tggcgaaaca aacaaaggat tcctcagagt 3840 caacctcgcc acctggaacc gcctagcaaa gcaaatcccg gaaaaattaa atcgctctat 3900 gtttgcttcc cggttaaatt ctttccccct taatattctc attcccccaa cttgatctct 3960 caaactagta atggtttatg ggatcatttt atcatccacc aattcttccc tgtgattatt 4020 tgtatacgat ctccccttat atgttttgcc ttgaagacgt gaatgaatga tatatatata 4080 ta 4082 // ID RC123 repbase; DNA; CEL; 369 BP. XX AC L00588; XX DT 27-JUL-1999 (Rel. 4.06, Created) DT 27-JUL-1999 (Rel. 4.06, Last updated, Version 1) XX DE C.elegans satellite-like repetitive DNA. XX KW SAT; Satellite; Simple Repeat; CELREPS02; RC123; insertion site; KW Repeat region; satellite-related repeat; Transposon Tc2. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-369 RA Ruvolo R.V., La Volpe A., Naclerio G., Coulson A. RA and Levitt M.A.; RT "Tandem repeats and SINE-like elements associated with insertion RT sites of the Tc2 transposon of C. elegans."; RL Unpublished (1992). XX RN [2] RP 1-369 RA Naclerio G., Cangiano G., Coulson A., Levitt M.A., Ruvolo R.V. RA and La Volpe A.; RT "Molecular evolution of clusters of satellite-like DNA sequences RT in Caenorhabditis elegans."; RL J. Mol. Biol 226(1), 159-168 (1992). XX DR GenBank; L00588; Positions 409 777. XX SQ Sequence 369 BP; 94 A; 26 C; 122 G; 127 T; 0 other; agtttggggt tagtaggggg atatggtcgg ggtactgtag tagtactgta ggagtactgt 60 aggagtacgg taggattact gtagttttgg aaaaattgag tttttgcctt ttgaagagat 120 ataggtttgg agttattagt gggatatggt cggggtactg tagtagtgct gtaggagtac 180 tgtaggagta cggtaggatt actgtagttt aggaaaaatt aagtttttgc cttttgaaga 240 gatataggtt tggagttatt agtgggatat ggtcggggta ctgtagtagt actgtaggag 300 tactgtagga gtactatagg attactgtag tttgggaaaa attgactttc cgtctttgga 360 agggaaatt 369 // ID Vingi-1_CE repbase; DNA; CEL; 2938 BP. XX AC . XX DT 17-AUG-2010 (Rel. 15.08, Created) DT 17-AUG-2010 (Rel. 15.08, Last updated, Version 3) XX DE A family of Vingi non-LTR retrotransposons - consensus sequence. XX KW Vingi; Non-LTR Retrotransposon; Transposable Element; Vingi-1_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-2938 RA Kojima K.K., Kapitonov V.V. and Jurka J.; RT "Recent Expansion of a New Ingi-Related Clade of Vingi non-LTR RT Retrotransposons in Hedgehogs."; RL Molecular Biology and Evolution 28(1), 17-20 (2011). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 79..2871 FT /product="Vingi-1_CE_1p" FT /note="includes endonuclease and reverse FT transcriptase domains and a CCHC zinc-finger FT motif." FT /translation="MEISNVYVPPRSSSSNHARLMTDFSNIFQTKSKSIIS FT GDVNAHHSAWHSEGSEDTRGRELAELIDLHPDLIIQNEQVHTRADTYSISS FT PDITICTADLATKCHWSTLYKLGSDHIPMKLKINQAATPKRPKRLVANFKK FT ANWQLFRDHIESRINGYSGPMDAIPMEKFFNKVILEADKKAVPKGSIDPKL FT KVDPKPLRLLAKEIADRETRNPLDPEIPILTRELATKRAERKNSEWRKKAH FT DQNAKRKSDVSSFWKLIKGVTKGRQYKGPVAFGDGLPKTGATIANKMAKSL FT VEAGRTKISHDEELKWRKNVRRNKRISKIAPLQDRITSIELHNMAIKMSPS FT AAIGPDGIAQIHIRKLPMCGYSFLAYLYNVSLSTNIIPDKWKKANVIMIPK FT PKKDPAIASSYRPISLLSPIAKLLEKAILKRIKNSIESPAHQHGFKPEHST FT TTAVIQVTNDIIGGLNMKNPPERAIMACLDLRAAFDKVPTKKLCNDLMEAQ FT GIEPKIKLWLGNYLHKRLIRTSHDNHWSKWHTLLGGVPQGSVISPNLFTFY FT LKDMPTQQDTMLISYADDMSIIARDKKIEKAAEKVQLHIDEIAKYLKERGM FT SISAEKSTVTVFSCDPKEHKTKPDIYWMDDPIPVINAPKLLGITLNTMTGT FT KDHVGNAIKSMQNKTRVIKSMAGTSWGNDRETMLYTTQALIKPTALYGAPA FT WTSLLSDTNLEKLEIAYRSALRACVGLTKDTPTDHIYQECRVLPLKEEYKL FT ATQQMYLAAIKSNTHPCRDLKAKGQLERTSKPPRIPPLKLSDSEKLRLDNI FT PGDKTETQKKNHTKLVADFIRNAPNNRILNKPPPRVNYRDERTLPKETRCE FT LARLRCGHSLLVEKYKARLEERQIKKCLNCGDEEGDVQHLLSCLQTPVPLE FT ELWTSPLTVAAALGLPTKTPFDPGGGH" XX SQ Sequence 2938 BP; 981 A; 802 C; 638 G; 517 T; 0 other; cgaacaaaat ggagatctcc aacgtgtacg tgcctccaag atcctcctcc tcaaaccacg 60 cccgactgat gacagattat ggagatctcc aacgtgtacg tgcctccaag atcctcctcc 120 tcaaaccacg cccgactgat gacagatttc tccaacatct tccaaacgaa atccaaatct 180 atcatatccg gggacgtaaa cgcccaccac tctgcctggc acagtgaagg gtcggaagac 240 accagaggta gagaactggc ggagctcatt gaccttcacc cagacctgat catccagaac 300 gaacaagttc acactcgcgc cgacacctac tccatctctt ctccggatat caccatctgc 360 accgcagacc tcgccacaaa atgccattgg tccactctct acaagctagg ctccgaccac 420 atcccaatga aattgaagat caaccaagca gcgaccccca aaagaccgaa gcgactggtg 480 gccaatttca agaaagccaa ctggcaactt ttccgggatc atattgaatc cagaatcaat 540 ggctactctg gcccaatgga tgccattcca atggaaaagt ttttcaacaa agtcatcctc 600 gaagccgaca agaaagcagt cccaaagggt tcaattgatc caaaactgaa agttgacccg 660 aaaccactca ggctcttggc gaaagagatc gcagaccgcg aaacaaggaa cccattggac 720 ccggaaattc ccattcttac cagggaactg gcgacaaaga gagcagaaag gaaaaattca 780 gaatggagaa agaaggctca cgaccaaaac gccaaacgga aaagcgacgt cagctccttc 840 tggaagctga taaaaggtgt tacgaaaggg agacagtaca aaggcccggt agctttcgga 900 gacggtctcc ccaagacagg agcaacaatt gcgaacaaaa tggcgaaaag cctagtcgaa 960 gccggtagaa ccaaaatatc gcatgatgag gaactgaaat ggagaaaaaa cgtacgacga 1020 aacaaaagaa tctccaaaat agcccctctc caagatcgga tcacaagcat tgaactccac 1080 aacatggcca ttaagatgag cccatcggca gcaatcggac cagacgggat tgcacagatc 1140 cacatcagga agctcccgat gtgtggctac tcgttcctcg catacctgta caatgtctcg 1200 ctatccacaa acatcatccc agacaaatgg aagaaggcaa acgtgatcat gataccgaaa 1260 cctaaaaagg accctgcaat tgccagctca tacagaccga tctccttgct ctctccgatt 1320 gcaaagctcc tcgaaaaagc catcctgaag cggatcaaga acagcattga atcgcctgct 1380 catcaacatg gattcaaacc tgagcactcg acaaccacag cggtgatcca agtaacaaac 1440 gacataatcg gaggcctaaa catgaagaat ccaccggaac gtgcgataat ggcctgtctc 1500 gatctgcgtg ccgcatttga caaagtcccg acgaagaaac tgtgcaatga cctgatggaa 1560 gcccaaggca tcgaaccgaa gatcaaactc tggctcggaa attaccttca caagagactg 1620 atcagaacaa gtcacgacaa ccattggtca aaatggcata cactcttggg aggagtcccg 1680 caaggttcag tcatcagccc aaacttgttc accttttatt tgaaagacat gccgactcaa 1740 caagacacaa tgctgatctc atacgcggac gatatgtcaa tcattgccag agacaagaag 1800 attgaaaaag cagccgaaaa ggttcagcta cacattgacg aaattgccaa atatctcaaa 1860 gaacgtggaa tgtccatctc cgccgaaaaa tctacagtca cagttttctc atgtgatcca 1920 aaggagcaca aaaccaagcc agacatctac tggatggacg atccaatccc agtgattaac 1980 gccccaaaac tcctgggtat cacactgaac actatgaccg gaacaaaaga tcacgttgga 2040 aatgccatca aaagcatgca gaacaaaaca agagtgataa aatcgatggc cggaacgagc 2100 tggggaaacg atcgcgaaac gatgctatac acgacgcaag ccctaatcaa accaacagct 2160 ttgtacggag caccagcttg gacttcactg ctctcagata cgaatctcga aaaactcgaa 2220 atcgcgtaca ggagtgcact cagagcctgt gttggactaa cgaaggacac gccaacggac 2280 cacatctacc aagagtgcag agttctccct ttgaaagaag aatataagct ggcgacgcaa 2340 cagatgtact tggcagcgat caagtcgaac acgcacccgt gcagagacct gaaagcaaag 2400 ggccaactgg aacgaacctc aaaaccgcca aggatcccgc cgctgaagct atcggactct 2460 gaaaagctca gactcgacaa catcccaggc gacaagacgg agacccaaaa gaagaatcac 2520 acgaagctgg tagcggactt catcaggaat gcgccaaaca acagaatcct caacaaacct 2580 ccaccaagag tgaattacag ggacgaaagg acgctgccaa aggagactcg atgtgaactg 2640 gcacggctaa gatgcggtca ctcactcctc gtcgagaagt acaaagcccg gctggaagaa 2700 aggcaaataa agaaatgttt gaattgtgga gatgaagaag gagatgtgca acacctacta 2760 agttgcctcc aaactccagt ccccctagag gaactctgga cttctccact cacggtggcg 2820 gcagccttag ggctccccac aaaaactcca ttcgaccctg gaggaggtca ttagctatgc 2880 gagacatagc tggctgctgc gcagccattg ttgttgcaca acaacaacaa caacaaca 2938 // ID CER16-2-LTR_CE repbase; DNA; CEL; 492 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 06-AUG-2003 (Rel. 8.07, Last updated, Version 1) XX DE CER16-2-LTR_CE is a long terminal repeat of the CER16-2_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER16-2-I_CE; KW CER16-2-LTR_CE; CER16-2_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER16-2_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (MAY-2003). XX DR [3] (Consensus) XX SQ Sequence 492 BP; 109 A; 151 C; 80 G; 152 T; 0 other; gatataggcg cttggtcagg ttcacccacg aacccacaat caccatccgc atgagtcgct 60 cactctctgt tccattcgtt cactcagaaa catttcattc tttaattaat taaatattat 120 tatttgaatt agcgccattt gcgtatttcc taatgcaagt gcgccccccc cgcacacttt 180 cgttattgcg gagcgtcgta ctcccgtaaa tccacacaaa tatttagatg cctttttagg 240 ccaattctat ttagtcttca cacgtttcac cacggatgcc acccgtcccc acgtcatccg 300 tcagtttccc gttgatattt ccatttattt attcataata aacttggatt tatctaaaat 360 gtgtactgtt attccctctg caccgctcat cgtcatccgc cgctcgtcca agctccgtgc 420 cctccgcgcg aacacacatt tcgccgttct cacccgttgg gtctgactcc ccagcgactg 480 cgggtgtcaa ca 492 // ID RC14 repbase; DNA; CEL; 169 BP. XX AC L00589; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 01-JUL-2005 (Rel. 10.08, Last updated, Version 2) XX DE Caenorhabditis Rc14 repetitive element. XX KW RC14; Repetitive element. XX NM RC14. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-169 RA Ruvolo R.V., La Volpe A., Naclerio G., Coulson A. RA and Levitt M.A.; RT "Tandem repeats and SINE-like elements associated with insertion RT sites of the Tc2 transposon of C. elegans."; RL Unpublished (1992). XX RN [2] RP 1-169 RA Naclerio G., Cangiano G., Coulson A., Levitt A., Ruvolo V. RA and La Volpe A.; RT "Molecular and genomic organization of clusters of repetitive DNA RT sequences in Caenorhabditis elegans."; RL J Mol Biol 226(1), 159-168 (1992). XX DR GenBank; L00589; Positions 104 272. XX SQ Sequence 169 BP; 38 A; 55 C; 19 G; 57 T; 0 other; ctcccatgct tacatttccc atgcctacaa ttcccatgct tacacatgtt cccgtacata 60 caaatgttcc catgcttaca ctcagtagta acagtgttcc catgcgttca gctcattttt 120 tctcgttccc atgctacatt tcccatgctt acacgtattc ccgtacata 169 // ID NPAL0A_CE repbase; DNA; CEL; 286 BP. XX AC . XX DT 01-SEP-1999 (Rel. 4.08, Created) DT 01-SEP-1999 (Rel. 4.08, Last updated, Version 1) XX DE NPAL0A_CE is a putative non-autonomous DNA transposon - a DE consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; MITE; KW NPAL0A_CE; nonautonomous DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-286 RA Kapitonov V.V. and Jurka J.; RT "NPAL0A_CE."; RL Direct Submission to Repbase Update (SEP-1999). XX DR [1] (Consensus) XX CC NPAL0A_CE is a putative non-autonomous DNA transposon. CC It does not generate identifiable target-site duplications, CC unless it has strong CTAG target-site specificity and induces CC a duplication of the central TA. CC NPAL0A_CE is flanked by 118 bp-long terminal inverted repeats (11 CC mismatches only) and it forms non-perfect palindrome with a 50 CC bp-long central loop. CC There are about 20-30 copies of NPAL0A_CE fixed in the C.elegans CC genome very recently since they are ~98% identical to the CC consensus CC sequence. XX SQ Sequence 286 BP; 59 A; 83 C; 87 G; 57 T; 0 other; ctactccgca aatcgctcct cctggagcgc ggacggggcg cgactagatt acgggttcgc 60 gctccagtca cacgtttttt cgcgctcctt ccgcgccacg ttggcgctct gatcgcgtcc 120 cataaaattt cgtaagaatt tttcaaattc ggaatgcgcg cggagcaaac gcgattggag 180 cgcggacgtg gcgcgtaagg agcgcgaaaa aaagggtgac tggagcgcga acccgtgatc 240 tagtcgcgcc ccgtccgcgc tccaggacga gagatttgcc gagtag 286 // ID CELE12A repbase; DNA; CEL; 368 BP. XX AC . XX DT 16-SEP-1999 (Rel. 4.08, Created) DT 16-SEP-1999 (Rel. 4.08, Last updated, Version 1) XX DE CELE12A nonautonomous Tc2-related DNA transposon. XX KW DNA transposon; Transposable Element; Nonautonomous; CELE12A; TC2; KW nonautonomous DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-368 RA Oosumi T., Garlick B. and Belknap R.W.; RT "Identification of putative nonautonomous transposable elements RT associated with several transposon families in Caenorhabditis RT elegans."; RL J. Mol. Evol 43(1), 11-18 (1996). XX DR [1] (Consensus) XX SQ Sequence 368 BP; 119 A; 64 C; 66 G; 119 T; 0 other; taccgtattt cctctatcag tcttgcatgc aagactaatt ttcaattgac ccgtaggggt 60 gcaagactaa tagagactgc aagactatta gaggctgcaa gactaatttt cgaaggcccc 120 ataaccttgt aaaattggcc atttatcgtc aaaaatttga cttgaaacag tctaattcgt 180 gcaaaataca tacttttcta ttgttttatc tattatttga acactttaaa atttaattta 240 tctaaaaaat ggcgaaattg ttggagtttt cacaaaatac tgctggtctg gcaagaatag 300 gggtgcaaga ctattaccgt atttcctcta atagtcttgc atgcaagact aataggggaa 360 atacggta 368 // ID HAT4_CE repbase; DNA; CEL; 592 BP. XX AC . XX DT 14-JUN-2000 (Rel. 5.05, Created) DT 14-JUN-2000 (Rel. 5.05, Last updated, Version 1) XX DE HAT4_CE is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT superfamily; HAT4_CE; TIRs; nonautonomous DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-592 RA Kapitonov V.V. and Jurka J.; RT "HAT4_CE."; RL Direct Submission to Repbase Update (JUN-2000). XX DR [1] (Consensus) XX CC HAT4_CE is a putative non-autonomous DNA transposon. CC It generates 8-bp duplications of target sites upon CC integration in the genome. HAT4_CE has perfect 12-bp CC terminal inverted repeats. CC The length of the target site duplications indicates CC that transposase encoded by the autonomous HAT4_CE-like CC element is related to the HAT superfamily. CC There are about 70 copies of HAT4_CE per haploid genome. CC HAT4_CE individual copies are, on average, 92% identical CC to the consensus sequence. However, this family is composed CC of several minor subfamilies; and there is >95% identity between CC repeats and consensus sequences which belong to the same CC subfamily. XX SQ Sequence 592 BP; 123 A; 123 C; 123 G; 219 T; 4 other; caagggtgtc aaktcccgtg tcccgtttgt cccgttgtcc cgttttttgg gtgttttcac 60 gggaacggga mttcccgttg tcccgttttt gaaattttca cgggaacggg aattcccgtt 120 gtcccgtttt traaattttc acgggaacgg gacattttcc ctcatttttt ggctttttaa 180 atatctgttt aatattgaaa aaatcgataa aaatgtttat aatgcagttt tttttatata 240 atttgcctcg aatacactaa aatttacgcc aactatgcac tctttttttt aaaccgccca 300 gtttgtcccg ttttttgagt gttttcaaaa aaacgggaat tcccgttgtc ccgtttttga 360 aattttcacg ggaacgggaa ttcccgttgt cccgtttttg aaattttcac gggaacggga 420 attcccgttg tcccgttttt gwaattttca cgggaacggg atgtcccgtt gtcccgtttt 480 ttgagttttc acgggaacgg gaagtcccgt tgtcccgttt ttaaaatttt cacgggaacg 540 ggaattcccg ttgtcccgtt tttgtaattt ttacgggaca ttgacaccct tg 592 // ID CER3-I_CE repbase; DNA; CEL; 7872 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 06-AUG-2003 (Rel. 8.07, Last updated, Version 1) XX DE CER3-I_CE is an internal portion of the CER3_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER3-I_CE; CER3-LTR_CE; KW CER3_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER3_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (APR-2003). XX DR [3] (Consensus) XX CC FEATURES Location/Qualifiers CC gene 4862..5401 CC /note="predicted Integrase region" CC gene 2406..2723 CC /note="predicted Protease region" CC gene 4168..4521 CC /note="predicted RNase region" CC gene 3361..3879 CC /note="predicted Reverse Transcriptase region" CC gene 2013..2213 CC /note="predicted Gag region". XX SQ Sequence 7872 BP; 2337 A; 1717 C; 1787 G; 2031 T; 0 other; ggtagctagc gagtgaaccg aatttcgcaa catttcgaaa atcctaaaat tatttatcgt 60 ttggaattct aaatcctaaa atttttctcg aattttcgat gcctaaaatt caaatttaaa 120 atttaatcct aaaatttaaa attaattttt aattttctgg cctaaaattt ttcttttctc 180 tttttcgctt cgctttttcg ttttgatttt tcgcttttat ttttcttctc gattattctt 240 ttcagcgccc ttttcgtttt gtagcttttc ggtatatacg atccaccacc cgtccctatt 300 atttcgttta agttcccttt attagtttag tttgtaggta gctttagcat ttacatttag 360 ccttcgtaga gaagggccct ctctttttag gtctaaattg tctgtaagta atttggtagt 420 agcgcttgtg aggcttagca ttaacacaac acttaagtca agcctaactg ggattacggg 480 gttttattcc tatttctcca aaaaaaggtc tgtcctctca cgcgctccct tactgaggtc 540 taggtgaggt gtttgtagga aggtgcgcac cagtttagtt ccgctaaagc gaacggccta 600 gtagtctagc ttctagtcag tactcttaac ttcaaactcg taggtttagt taagttttcg 660 aggttccgta accatgtccc tttaggaaag tcttatcttt aaatgatcac tcgtctcttc 720 gtttcctcaa ttacgctagt acacctagga ttataggtag ttcaagtatg tcccgcaacc 780 cgcagttcaa tacaccccca cccaatgctc cagatggcgc ttatcgctcg aacagtcgtt 840 tgattgatgg ctccgtcgtc accactgctt atctcgtttc gctcaaagct ggtaccgtca 900 aggatgtcaa ctcgttgtat ggttccatcg tggagaacaa ggcaacagtg aaagcagtcg 960 acgataaggt cgtggtcatt ggcaacaagt gtgatgatgt cagcgcaaag cttgcagctt 1020 tgcaaaatcg aattcgcgag ctcgaggaca tggtcaagcg tcaaggtgag gagagtgctc 1080 aggctggtga ggagcgtgct ggaggtcgcg gtacctctgg cccttctcag gtagctggcc 1140 tgagcctagg taggccactt ttaaattcta atttaacccc aagagacgtg ttgtctggaa 1200 gcagtgcagg tagtaacgct actgactctt ctggaagtgg atccccgatg ccatcgagtc 1260 tccagaatcg tgacgtcgtc gaagaacaat cggaagatca gcaactcaat agtggtgagc 1320 ttgcgggaga tcaagatgat ctagaagaag caacaaattc aattcgtgga atttcgcctt 1380 caattggagt gatcggtttc aagacagccg ccactttgcc tccgttcaat ggaaacttga 1440 ctgaagattt cacaaccttc gtgcgaaagt tcaaggatcg gctagcagcg tctacagaga 1500 tggacaacga tcaaaagagg tctacattcc ttttattttt ggacaatcgc gcgcggagta 1560 cggccgatgg aataattgga gcacagcctg cgattacact cgaggatctt atcacaaaga 1620 tgtcagcagt gttcgagaat gcagccgtgc aggaacatag agaagagcag ttgagatatt 1680 gcacacagca aaagggtgag tcggttgaac ttttctttca gagaatcgat aagctgcaaa 1740 gacttgcgag tactggcccc aggagtgaag agatgaagaa ggtcgcgttt cgtgctttcc 1800 tcgaaggact cggtgaacat ctcaactacg ctattcgatc gaggcgcccc aaggatatgg 1860 cggcggcgct cgatgaagca ctacatgaag aaatgcttgt caatgcgaat gtggctcgga 1920 gagcagtacc aacagcagcg gcatacatgg catcaatcgg gaatcaaggc aagaagcagc 1980 agcagcagaa gcagtcgcag tcgccgaagt tccttggaga atggcattac tgtcacaaga 2040 agggtcacaa actcgcagat tgtcggaaga gagcagctga tcgtggtgtg gcaaaatcct 2100 cagtgcaaaa tcccgtggag catcatcaag tcaagcaggc agcagcagtt tcgggcacag 2160 aaaattggca acttctggaa cagaacacgg ctctgcaagc gcaagttcga gaccttgcag 2220 gtcggcttga gcgtttccaa cctagctttg acggatcttc ggggagcaac gtttgcagtc 2280 tgacttggtc gaagagtaag tcttttactt tcgaggatgc cactcctcaa cagaattctc 2340 aaagtaattc ttttatcacc gctcacattc cagttaaagc taatcagata ccgtattccg 2400 ctcttgtgga tacgggtgca aacattacag taacttcgaa agatcttttg aaaggtcttg 2460 gcattagcag aactttaccg atgctcaatg cgcaagctgt tggccttgga ggtcacacag 2520 tggatatggt tggatcagca gtcgttcaat tcaagattgg aactcacacg atcgaccatc 2580 gggtacactt cacaactgga cgttgcatgc cggaaatcaa tggcgaatat caattcatta 2640 ttggaaacga tatgctctcg aagcttccaa tgttcgtgtt caactacaag gatgctcttt 2700 tccacatcgg agatgacgtt cttcctctcg gacaatcatt cggacaacct tcaaagccaa 2760 ggaagttcaa cgtcaaagtg tgtgaggata ctatcattcc agctggaacg gaaaaatacg 2820 tcaaatgctc cgtcagtcct caaccgaagg atcgagaagt tattttgatg atcgactcag 2880 tttcaagtgg aatcgataat ttgtttgtgt cacccgcggt agtcatggct caggatgcag 2940 tcattttggt gtccaaccca acggatcagg acatgacgct caacaattcg gaggttgctg 3000 cgtcggctag tcaggtaacg caacatgacg gagtcttgtg ttgtctttga cagtgttgct 3060 cactgcagct ctccaaagat taacacagtc gagaacgcag tttctcaaaa ctctttttct 3120 atagacctct ccagtgttaa tatctctagc aatgagagag ctcagttaca agctctcatc 3180 gatgaatatc atgacgtatt ttcgaaaaat gcgtatgatt tgggtagttc gaaaactgac 3240 ccagtgcata tttatacaac tacagaagta ccggtaagaa gtcgagctta tcggacgcca 3300 gttagatatc aagctgagct tgagaagcac atcaactcac ttattcagtc aggacgaatc 3360 gtcgaaagca acacgccttg gaccagccca atcgtgctta taaagaagaa aaacggatca 3420 ctcagagttt gtttggattt ccgtaagcta aatgaggtca cagttcccga taactatccc 3480 ttacctagaa tagattctat tattgagcaa ataggaggtt ctaaatattt tacatcttta 3540 gacatggcaa atggatactt gcagctgaga ttggatgcag aatcatcaga caaatgtggt 3600 ttcatcactg agaacaaggt ctacgcctac acacatctac catttggact acggtcagca 3660 gcgtcatact tccagagaga tttgaaacaa gttctatagg gattggagaa ggaagtaacg 3720 gtctacatcg acgacgtgct gatcttcagc aagacgtttg aggagcattt aaactcgttg 3780 agaaaagtgc tcgagaggtt caggaagttt aatctaaaag tttcaccgaa caaatgcgaa 3840 tttttcaaac aatcaattac atttttaggt cacgagatca acgctgacaa ctacaagccg 3900 aatcgcatca acgtcgatgc aatcgtcaac ctgccaacac caagaaacgt cggagacgtc 3960 agaagattca tcgggatgtc aggattcttc agaaaattcc ttccaaattt ctcggagatt 4020 gctgagccgc tgacaaggtt cacaaggaaa ggacacaagt tcgtgtggaa agctgagcaa 4080 caaaaagccg tggacactct gaagcaagct ttaattagta agccaatttt agtttttcca 4140 gattatgaca aagagtttca catttttact gacgctagtg ctgtagctca gggggcagtg 4200 ctcatgcaac aaatggaaga tagtactaag gactatgcag cgatagctta tactagtaga 4260 accctttcag ataccgaatc tagatggccc gcaattcaga cagaattagg cgctattatt 4320 ttcgcgttaa gacagtttag gccttacata ggacagtcta gaactacgat tcatagcgat 4380 catagacccc ccatgtactt gcttggtaag agcaaagtta atgataactt ggctcgttgg 4440 ctcatcgagt tggctcaata tgatacgagg attgtgcaca tcgatggcaa gaagaacaca 4500 gtagctgatt gtttatcacg tgcaaaggat gaagttgcac ctctggatgg agtcgagatg 4560 gaagacatca ttagttttcc agtctgtatg ccaatacata tgtctaatat tcaagcttcg 4620 atggcattca ccccggttgg gacgctcgtt tggatttgct cgaggagcaa gagaaggatc 4680 cgatcctgaa atgcatcaag gacttccgac gccagcaacg ccgatcgaca tcgttccttc 4740 gacatgggca ggtgtgctgg agcacatcaa gcttactgag tctgggttgc tcgccgtagc 4800 aacccccggt agccgcccta aaacagttat tccccaacaa cttaaatcat tgattttaaa 4860 atcgttccac gagagtttac tcggtggagg acactttgga tataggaaaa cgttgcacaa 4920 aatcaaacgc aattattatt ggccaaatat gaggtcggac gtgctcaagt ggacgttgca 4980 gtgcaagatt tgtcaacaga agagaaaccc tcatccatct acaagagaac tacagaagat 5040 cgtaatcacg acgaaagtat tcgagaaggt tggagtggac ctgacaggcc cactgagaat 5100 gacagcttca ggtaacaagt agtacatggt ttgttggttc acaaaatttg taatttctgt 5160 gccgttacct aatgcgtcaa cagaaacagt tgctgatgca atgatgaaag agttactttt 5220 aaagttcgga acaccaagtc aactcgtttc agatagagca tcaaccttca catccgaggc 5280 attccgagcc ttctgcagga agctcgagat ccagcaacac ttggcaattc cgtatcacag 5340 caaaggaaat ggtgcaacgg agaggacgtt tcgaacattc cacaatatgg tttcgaaaca 5400 cgtgaataaa actcacacgg attgggatca gatactaccg tatatgacgt tcgtctacaa 5460 cacgacagtt cacgacacga caggagagac gccattcttc ctgatttttg gtcgagaccc 5520 cgtgtttgca atcgataaaa tcatgcatcc atccccacca aaggaaggag aagaagtcga 5580 cattccggct tggaaagagc acctcatcac aactctacgt ttggctcgaa aggaggctgc 5640 tgaaagatcc ctgaaagaac aagaagctcg acagaaagtg gcaaacgttg gagctaaagg 5700 atcgaagatc gtggttggag atcgagtctt cttccaaaat cacaaatcga aagcgaattt 5760 gtcgcggaaa atggtacttc catggatcgg agagttcgaa gtcattagca tcgatcatcc 5820 caaagcagtc atcaaggact tggaacatcc atcaaaacct gaaaggacag tgcacctgaa 5880 tcaaatcaaa aagattctag aaagagaaga tccaggcgcc gcactcaaag attcggacga 5940 ttccatctcc gaagtagaag caatcaaaga ggagcttgct acagctcaac ctgaaattaa 6000 tcaggccccc acccatcaaa ttgaagataa tatcgtcgag gatatttgca ttccacatgc 6060 acaagaggtc gaaacgagga aaaacccgtc acgtcagcgc aggagaccga agagatttga 6120 ggactgagtt aaaaaatgtt ttttatttat ttttaatcca aaaattcatt ttctacttca 6180 tttaggcgta attttccaaa ttagcgcgca tttaatttaa aggcgcagta ggtgacgtag 6240 gctagtggtc ccgccacgaa aaaggtacag taacccgtag gaggcattgc tcactcttaa 6300 attcaaattt taggtgtttt tgtcatcatt ttacctttat ttacccgttt ccactcattt 6360 ttcgtagttt aattacataa ttcttgcagt ttagcatatt ttaggccgcg tttatgtttt 6420 tcacccgttc taagcttaaa tttggttagt ttttggcttt aacactagtt taaacaattc 6480 ttagccctaa aatcttgctt agtttagcaa atacattagt gtaatctcac aaaatgcgta 6540 acgtatacct tagagtttca ttttcggaat gtttcgtctt cgcgtactta aaaggtttgt 6600 ttttcagagc acatcgccat gggcagccgg caaagcagta tgactactcg tttgcaggcg 6660 gagatcggag gaacaagctc ggaaggtgcg acagcggtac gaggacgtag ctcctcaggc 6720 catgtcgccg cacaagagtc ttcgatcgaa gcacaccaca ctccacgcaa taatcttaat 6780 attaaatatc tcgcattgtt acaattagtt ggacaaattg tcaacggaaa tgccgatgaa 6840 gcacaaagaa gagaagacga tgacgccttc ccattcgagc tcgtcgtaca tgaagcagag 6900 atggaggtcg aagaagtcga ctcaccacag gattctccag atgaattcga ggaagatatt 6960 gcggaagaag tcaaggataa atcggcatcg gaagaggaag aagaagatga acatcaagtt 7020 ttccagttcg aaaatccacg agatttagaa gatcctctgc gaagaagagt cccgtacgag 7080 atccacttcc cattccgaga gccaccacaa gaagatgcac caatgagaat ggctcaaaat 7140 ccagtcgacc aggaagccat cgatgatatc gatcaaggac tcatcgatag atatcatcaa 7200 ggattggtcg cggcagatcc gcatcaacga ttcgtgacac cactcacgag agccgcaaga 7260 gaatatttga gagctcaagt cgactaccag gagcgcagag ctactcacca tgaaggatgg 7320 acgcagttcc tcgaagcagt accagagttc gggagagctc gccactacta catcgacgag 7380 agaagtcgat caatatttaa tcgatgcatg gagaagagga cgtcgtgcag atcttgaaga 7440 cgccgagttt cgcctaccat tggaaagaga tgaaagatat tggctcttca accagattaa 7500 gaatcaacgc cgagagctgt tcgaagccgc ccaatggaga gctcaccatg cacatcaccg 7560 tcgcaacgaa caagagcaat agctcgcagc agctacaaca attttattga agaacaaaag 7620 aagaaagaaa aaaaaaaaaa aagaacaaca acaacacttc gttttggctc atccagtcga 7680 gtcacatcat catcaaggat ccttcgagct ggataaagca actgaaaagg tgaagtttcg 7740 ggtgaatcct taggaagttt ttactaaaag tcccaaaaac tttacgcttt tccaaagcaa 7800 tctttcgtac aaccctttag tcattttgaa attgaggaga tcagggacga tcccttttca 7860 aaagggggga gg 7872 // ID HAT1_CE repbase; DNA; CEL; 495 BP. XX AC . XX DT 09-JUN-1999 (Rel. 4.05, Created) DT 09-JUN-1999 (Rel. 4.05, Last updated, Version 1) XX DE HAT1_CE nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; CELHAT1; KW hAT superfamily; HAT1_CE; TIRs; nonautonomous DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-495 RA Kapitonov V.V. and Jurka J.; RT "HAT1_CE."; RL Direct Submission to Repbase Update (JUN-1999). XX DR [1] (Consensus) XX CC HAT1_CE is a putative non-autonomous DNA transposon; CC it generates duplication of 8 bp-long target site upon CC integration in the genome. HAT1_CE has 18 and 140 bp-long CC perfect and non-perfect TIRs, respectively. CC The length of the target site duplications indicates CC that transposase encoded by the autonomous HAT1_CE-like CC element is related to the HAT superfamily. CC HAT1_CE individual copies are, on average, 85% identical CC with the consensus sequence. CC (Oct. 1999 - name was changed from CELHAT1 to HAT1_CE). XX SQ Sequence 495 BP; 142 A; 102 C; 98 G; 153 T; 0 other; caggggtgtg cggcaaatct caaaatttgc cgagctcggc aaattcggca aatctctttt 60 ttcaatattt gccgagcacg gcaaattcgg caaatttgcc gagctcggca aattcggcaa 120 attcggcaaa tttgccgtgc ttaacaaact cggaaaaatt tgatattttt tgatgttttt 180 tggagcacca aaactactga attcttaaca cacatctggt ttctgaataa gttccgtgta 240 gtatgtctgc ttaagcatca aaataactca attttgtgtc attttactaa atttttggcg 300 aaaaaatcaa tagttttagt caaaattgta ctgtcaaatt tttgacgtgt gcggcaaatt 360 tcgaaatttg ccgagctcgg caaattcggc aaatctactt ttttgaaatt tgccgtgctc 420 ggcaaattcg gcaaatttgc cgagctcggc aaacggcaaa ttcggcaaat tcggcaaatt 480 tgccgcacac ccctg 495 // ID RC35S repbase; DNA; CEL; 210 BP. XX AC . XX DT 27-JUL-1999 (Rel. 4.06, Created) DT 27-JUL-1999 (Rel. 4.06, Last updated, Version 1) XX DE C.elegans satellite-like repetitive DNA (a consensus). XX KW SAT; Satellite; Simple Repeat; RC35S; Repetitive DNA. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Naclerio G., Cangiano G., Coulson A., Levitt A., Ruvolo V. RA and La Volpe A.; RT "Molecular and genomic organization of clusters of repetitive DNA RT sequences in Caenorhabditis elegans."; RL J. Mol. Biol 226(1), 159-168 (1992). XX RN [2] RA La Volpe A.; RT "A repetitive DNA family, conserved throughout the evolution of RT free-living nematodes."; RL J. Mol. Evol 39(5), 473-477 (1994). XX RN [3] RP 1-210 RA Jurka J.; RT "RC35S."; RL Direct Submission to Repbase Update (JUL-1999). XX DR [3] (Consensus) XX SQ Sequence 210 BP; 75 A; 9 C; 42 G; 78 T; 6 other; aawattttgg mgggaaattt aaattttctg tgaaaaatat tttggcggga aatttaaatt 60 ttctgagaaa aawattttgg mgggaaattt aaattttctg tgaaaaatat tttggcggga 120 aatttaaatt ttctgagaaa aawattttgg mgggaaattt aaattttctg tgaaaaatat 180 tttggcggga aatttaaatt ttctgagaaa 210 // ID CELE2 repbase; DNA; CEL; 325 BP. XX AC . XX DT 20-SEP-1999 (Rel. 4.08, Created) DT 20-SEP-1999 (Rel. 4.08, Last updated, Version 1) XX DE CELE2 nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; CELE2; MITE; KW nonautonomous DNA transposons; palindrome. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 26-300 RA Oosumi T., Garlick B. and Belknap R.W.; RT "Identification and characterization of putative transposable DNA RT elements in solanaceous plants and Caenorhabditis elegans."; RL Proc. Natl. Acad. Sci. U.S.A 92(19), 8886-8890 (1995). XX CC CELE2 is a palindrome. Its copies are ~80% identical to the CC consensus sequence and they are flanked by duplications of CC TA target sites. XX SQ Sequence 325 BP; 103 A; 59 C; 59 G; 104 T; 0 other; tattacccag acgcgaaatt ttgcgatttt tgcccaaaaa tacggtaccc ggtctcgaca 60 cgacaaattt ttgttaaatg caaaaaggtg tgcgccttta aagagtactg taatttcaaa 120 cttttgttgc tgcggaattt tcattgattt ttcaaatcaa gtttttaata aataaaatcg 180 atgaaaattc cgcagcaacg aaagtttgaa attacagtac tctttaaagg cgcacacctt 240 tttgcattta acaaaaattt gtcgtgtcga gaccgggtac cgtatttttg gcgcaaaaat 300 cgcaaaattt cgcgtctggg taata 325 // ID CER8-LTR_CE repbase; DNA; CEL; 559 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 06-AUG-2003 (Rel. 8.07, Last updated, Version 1) XX DE CER8-LTR_CE is a long terminal repeat of the CER8_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER8-I_CE; CER8-LTR_CE; KW CER8_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER8_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (APR-2003). XX DR [3] (Consensus) XX SQ Sequence 559 BP; 122 A; 174 C; 95 G; 168 T; 0 other; tgttgtgaga tgggctgggt tcccatccca tctctcaaat tagtgacctt ttgaggatgt 60 ctccatgact ctatcatgat cgccatacct caaaacctct aatacaattc cattcacccg 120 cgtttcttcc gttaactcgt ctccagtacg cttcctcatt tgaattccct ccaattggtc 180 tcgccgcgcc cttaaagctc cccccttttt gtataattat ccctggccta aaatgtaagc 240 cgctccgctc acttgctact ccaacaacac aagtcatcaa cgtccacctc atctggctcc 300 acacattcca ccatcaatgt cggcctcgtt ggatgtcggc cgctccaccg tgagcgaccg 360 actctggacc tatggaccct aattacttgt tattctatga gcaataaagg ggttttgtgg 420 catacaactg tgttctctta ttcacatcgc ttatctcttc ttcctcgatg gcactccgca 480 accgccagta tctctctaga cgcttaatgt atttgacgtc agaggtgttt ccccaacgca 540 ctaatctgag ctgtcaaca 559 // ID TC1 repbase; DNA; CEL; 1610 BP. XX AC K01135; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE C.elegans transposable element TC1. XX KW Mariner/Tc1; DNA transposon; Transposable Element; KW Repetitive sequence; TC1; transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-1610 RA Rosenzweig B., Liao W.L. and Hirsh D.; RT "Sequence of the C. elegans transposable element Tc1."; RL Nucleic Acids Res 11, 4201-4209 (1983). XX DR GenBank; K01135; Positions 46 1655. XX SQ Sequence 1610 BP; 458 A; 312 C; 337 G; 503 T; 0 other; cagtgctggc caaaaagata tccacttttg gttttttgtg tgtaactttt ttctcaagca 60 tccatttgac ttgaattttt ccgtgtgcat aaagcgaaat gttacgcaaa tttgcggacc 120 aaacattaca tgattatcga ttttttctga attttatttc aattttttga ttttttcgtt 180 tttccaattt tcattatttt ttttgaatta tcaataaaac gcactctgtt tgttgcactg 240 gatttgtttg gttgataaat tatttttaag gtatggtaaa atctgttggg tgtaaaaatc 300 tttccttgga cgtcaagaaa gccattgtag ctggcttcga acaaggaata cccacgaaaa 360 gctcgcgctg caaattcaac gttctccgtc gactatttgg aaagtaatca agaagtacca 420 aactgaggtg agttcgaaaa atattatttt ttaataataa atgtttagaa atccgtcgct 480 ttgagaatct cgcccggcag gcctcgagtg acaacccata ggatggatcg caacatcctc 540 cgatcagcaa gagaagatcc gcataggacc gccacggata ttcaaatgat tataagttct 600 ccaaatgaac ctgtaccaag taaacgaact gttcgtcgac gtttacagca agcaggacta 660 cacggacgaa agccagtcaa gaaaccgttc atcagtaaga aaaatcgcat ggctcgagtt 720 gcgtgggcaa aagcgcatct tcgttgggga cgtcaggaat gggctaaaca catctggtct 780 gacgaaagca agttcaattt gttcgggagt gatggaaatt cctgggtacg tcgtcctgtt 840 ggctctaggt actctccaaa gtatcaatgc ccaaccgtta agcatggagg tgggagcgtc 900 atggtgtggg ggtgcttcac cagcacttcc atgggcccac taaggagaat ccaaagcatt 960 atggatcgtt ttcaatacga aaacatcttt gaaactacaa tgcgaccctg ggcacttcaa 1020 aatgtgggcc gtggcttcgt gtttcagcag gataacgatc ctaagcatac ttctcttcat 1080 gtgcgttcat ggtttcaacg tcgtcatgtg catttgctcg attggccaag tcagtctccg 1140 gacttgaatc caatagagca tttgtgggaa gagttggaaa gacgtcttgg aggtattcgg 1200 gcttcaaatg cagatgccaa attcaaccag ttggaaaacg cttggaaagc tatccccatg 1260 tcagttattc acaagctgat cgactcgatg ccacgtcgtt gtcaagctgt tattgatgca 1320 aacggatacg cgacaaagta ttaagcataa ttatgttgtt tttaaatcca attgctcata 1380 ttccggtact ttaattgtca tttccttgca acctcggttt tttcaatatt tctagttttt 1440 cgattttttt gaatttttct gaagtttttt caaaatctgt tgaacatttt tgatgaatat 1500 tgtgttttta gattttgtga acactgtggt gaagtttcaa aacaaaataa ccacttagaa 1560 aaaagttaca cacaaaaaac caaaagtgga tatctttttg gccagcactg 1610 // ID TIR9TA1B_CE repbase; DNA; CEL; 274 BP. XX AC . XX DT 15-JUN-2000 (Rel. 5.05, Created) DT 15-JUN-2000 (Rel. 5.05, Last updated, Version 1) XX DE TIR8TA1B_CE is a putative nonautonomous DNA transposon - a DE consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TIR9TA1B_CE; KW nonautonomous DNA transposon; putative TA target site. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-274 RA Kapitonov V.V. and Jurka J.; RT "TIR9TA1B_CE."; RL Direct Submission to Repbase Update (JUN-2000). XX DR [1] (Consensus) XX CC TIR9TA1B_CE is a putative non-autonomous DNA transposon; it CC has perfect 9-bp terminal inverted repeats and putatively CC generates duplications of TA target sites upon its integration CC in the genome. CC There are ~100 copies of TIR8TA1B_CE in the genome ~88% identical CC to CC the TIR9TA1B_CE consensus sequence. CC TIR9TA1B_CE is 72% identical to TIR9TA1_CE consensus CC sequence (69% to TIR9TA1A_CE). XX SQ Sequence 274 BP; 53 A; 90 C; 86 G; 45 T; 0 other; tagggcttcc atgtaggcgc caaaacgcct gccgcgcccg ccttgcggcg acctccgcct 60 gccgcctcga gccgcgcctt ctgtatagtg cggcgcggaa cccgaaaagt gtcggccgcg 120 gcgaaagaac cacctttcgc actatgtggt gtaaacacca agcctttgac ttctcgccat 180 agtgcagaaa tgaggcgaga ggccggcgtg aggcgggcag gttggaggca ggcgtcaggc 240 cctgaaaccg cgcctgcctc ccatggaagc ccta 274 // ID CER17-I_CE repbase; DNA; CEL; 3820 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 06-AUG-2003 (Rel. 8.07, Last updated, Version 1) XX DE CER17-I_CE is an internal portion of the CER17_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER17-I_CE; KW CER17-LTR_CE; CER17_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER17_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (APR-2003). XX DR [3] (Consensus) XX CC FEATURES Location/Qualifiers CC gene 3101..3778 CC /note="predicted Reverse Transcriptase region" CC gene 1517..1750 CC /note="predicted Gag region" CC gene 2261..2575 CC /note="predicted Protease region". XX SQ Sequence 3820 BP; 1366 A; 956 C; 573 G; 925 T; 0 other; tggcgaccac taaataaaaa gatcctgcaa tcttcaaacc aacaaagatg agtctaatcc 60 ccagacaaca aataggcaga tgtcttgcga gactgaagga tcttcaagaa gaagcaaaag 120 aagagattga aaacgttgaa gaactccaga aaggaccaat cgatgctcta tttggaacaa 180 tcatcgacat tgtcgacctc gcagatagaa taaacactga atgttatcgt attgagcaat 240 aatcgacaaa atggaaactc ctgttcgatt ccgacccgtc cgaaaaagaa gttatgagaa 300 attacaaaaa tcagcacggt gattttgaag ttgaattgaa caagacaaaa gaagtacgtc 360 ttcaacttca caagatgtac aagaaatacg tagaaaccca caaaactcga actcccacaa 420 ctgtcacgga gtacccgcaa ttccctggag acgaagctcg gatagctaca tctcaatcaa 480 ctcaacagct catttcacaa ggaaatcaga gcaccccagt ctacaacctt gaacctattc 540 aattcatctc aattctgcta ctactaattc caacaccaca attcctatgt caactcaatc 600 tgctattcct ttttattcaa gtcaatcttc aaatcagcct actattccta catatttaat 660 tcaaccttca attcaatcta ctattccatt gcaaacagtg ccgtctacac aacaaagctg 720 tcaacatact ttaccacaac aaacacttgc ttctgcacaa caaccaattc aacaaacact 780 accgtatgtt caaaactcga attatcaaca ttatcaatct gccatccaaa atggtcaaca 840 agttttccaa caacctttca atcttccaat tcaattacct caaatgggac tcatgaaatt 900 tgatggagac ttcactaaat atcattcatt tatcgagatg ttttcatcac tattcgattc 960 acttccaatt ggagacgttt ccaaacttca atatttaacc atggctctgg atggtccagc 1020 ccgcgacatg attgctcatc tttcattgat tggaaccaac tatgccattg cgagagacgt 1080 cttacacgaa caatacgggg atacgaagag aactcgtcat caccttatta gaaaactaca 1140 agacctacca gcaatgtcga acaaccactc aatgagccaa cttcaagaat tatggtccaa 1200 cgcatcaaca atcttctacc aactacgcaa tctacaagct gactcagaca acgtcacaac 1260 tgccgacatc atttcaagaa aacttccgaa acggtacatc aaattattgt acactggaac 1320 aaattcggaa cgaaactaca gtgcttcaga actactacaa acagtcagtg gctacattaa 1380 gtcagaaact ctagtcaaca cgatttcaaa tgacaacaaa gaaccactcg atcgaaaact 1440 gaccacaatg tctgtgcatc aacatccacg ccatactcat cagcaactgc caaacaaaac 1500 taccaatgga caaacattgt ctccttgcat cttctgctca tcaacttcac acgttcatcg 1560 acatgaggaa tgtcctatct tcaatacagc cgaagcaaga atacagaaag cacgagaaat 1620 aggactatgc tttggatgtc tccgatcagg tcaccaacgt tcaaagtgca gtagaccaag 1680 aacttgcaac cattgcaaag gaaaccatca cacagtcttt tgccaatctc acaatcgcaa 1740 tgccacgaga aacccaaatc aaagagaatt tggtagccaa caaatgaatt ctaatcatag 1800 aatcaatgga gcattgacaa gtggaccgaa tacgagaaac ttccgagatt cccgatcaca 1860 atctccatca agatactcaa gagaaccgat tcgcaacagt caaggattca catttggaag 1920 acaaattcca tcaaaacaaa caaggtatca aaatcaatca tcaagcgaac ggatacattg 1980 tccaacccga agagttcagt ttggaactac agccatcgcc gaatctcaag acttcccaat 2040 tcattcaaca gaaaattcaa caacatctat gctaacattg acatcgatga aatacactcc 2100 agatgcccca attgctcaac taaatgaagc agaagaaaca ccacaactac cttccgatgt 2160 cccgatgcta tttcaagaga gagaacttcc agtcagcatg atgtctacaa ctgtcaatgt 2220 agaaaacatc aaaggagaac aaattccagt caatgtattc ttcgactcag gaagcaacaa 2280 atcatacatc acggaaaatc tacgagacca actggatcta ccatcaatta gcaaaaaaca 2340 actgaaaatt gccacattcg gttcggacaa tcttcagtcc atcaaatcaa aaggttacct 2400 tgtgaacttc ctcatcaaag acaagaagat accgattgcc ctcagcagtg taccatcaat 2460 tgtcaactcg ataacaacag cgaaaataaa tgaagaaaca gtccaagatt tgatacacaa 2520 cgacaatgcc attttgccac gtacaacaac tttacctgac attctgattg gactcgactt 2580 catgagtaga atactcggcg aaacaacttc aaagatttta ccaaatggaa ccacaatcaa 2640 ctacacagat gttggcatca tcgtcactgg aactgaaaat caatcaacca aagaagctac 2700 attagattcc gatcaatatg caaacaaaca caactaccac cactacacca caagttctac 2760 aaccgatcct cacggtatct tgcatattga ccctactgaa gaacatgaag ctcttcacaa 2820 gctacttgaa aaattctggt cattggaatc atgcttcatc tttgaaaatc cgaaaacaaa 2880 ggacgatgag ctaaccgacg aactattcaa atcaacaaca actcgtgacc cagatggaag 2940 gtatgtgtgc aaatggcctt tcaaaggtga caaatcaact cttcccgata atcgtcaact 3000 cgcctacttc cgtcttcagt caacattgaa acgactaaac aaagatattg aactctacaa 3060 gaaatacgaa gaaatcatca aagatcaaga aagaagagga ttcatagaac tcgttcaaga 3120 cgaatttcaa ccatcttatc agagacaata tttatctcac cacccggtga taaagcctac 3180 ttccacatca accaaagtca gaatagtgta cgacggatct gccagagcca cgaaaacatc 3240 gaaatcactc aacgccattc tacacactgg tgaatctctg ctaccaaatc ttactggagt 3300 tctactacgc attcgacaac cttcaattct agtgagcagt gatttggaaa aagcctttct 3360 gcagttggga ctccaccatg aagatcacga cttctgcagg ttcttgtggc aaccctggaa 3420 tccaggaaca caaagatatc aactaacacc tcaatcacca aagatcattc ttcatgaacc 3480 ttcaacttgg aatgtactat caatttggat tcaatggaat cttggaaact cgaacaatta 3540 tagactgcca atcaaacaag actcccactt cttctctcat ttgccgcggc tcccgagcgc 3600 cgccaatctc caatttaatt tcgattgtcg gaaatttcaa cactgaccat gatcgatgtt 3660 tctcaagcaa ctgtaagctg agttatttta tgttgtttaa tttgtttaat attatctgtc 3720 taatactaga aaacaattgc cctcaatgta ttgtatccca aaatctccaa aatttgaatt 3780 ttctgaataa tcctccttcc ccctctttgc ccctcgtgaa 3820 // ID LINE2B_CE repbase; DNA; CEL; 3259 BP. XX AC . XX DT 10-SEP-1999 (Rel. 4.08, Created) DT 10-SEP-1999 (Rel. 4.08, Last updated, Version 1) XX DE LINE2B_CE is a non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW CR1 superfamily; LINE; LINE2B_CE; ORF1; ORF2; KW reverse transcriptase. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-3259 RA Kapitonov V.V. and Jurka J.; RT "LINE2B_CE."; RL Direct Submission to Repbase Update (SEP-1999). XX DR [1] (Consensus) XX CC LINE2B_CE is a non-LTR retrotransposon related to the CR1 CC superfamily. CC LINE2B_CE does not generate target-site duplications upon its CC integration in the genome; its 3'-tail ends by (TCAA)n. XX SQ Sequence 3259 BP; 835 A; 935 C; 431 G; 1058 T; 0 other; ggtttttcac gtgtttctcc acctactgct cgttagataa ttcatcctct tcctcttcct 60 atctctctcc tccctccaat cctattcctc aaacttctga ttacacctct ttgacagttg 120 atactcatat ctctgaacgt ttatcttcct ctcagcaatt ttcctcgccc cccggtagct 180 ttataccgtc gctagataac ctaactctaa aaccaatgtc tcttgcaaat gtacgcggtg 240 tagctgtcct acctaagata attatgctca aagaattctt cacttcatca gacctggata 300 tcctctttct tactgaaact tttcttaaca atacagtacc ttctggtctt ttcgcttccc 360 ccctattttc tttcgtaaga gtagaccgta atccagaact tcattctaag gtggcggagt 420 tgctattatg tacaagaatt ctttaaaaat ttcaattcgt aacattaact ttgaaaccca 480 ctatcccaag catcgctgcg aaattctggc ctttgatatc ctcaacacgc attctcctgc 540 gcccccactt acggttttct tagtttatag accaccttgc tgttccgtag ctgaaaacgc 600 ctctctaata gcccatcttg aagattattt accactcagc cgcactctcc tcacaggcga 660 ttttaacttc ccccaaatca attggtcatc tcccaacgca tcgtcgcacc catttgcctc 720 ctttctgtgc tcaagtgatc tttcgcaaag agttcgattc ccaaccagag tttcttgcac 780 ctccagtaat attctagacc tagtggcctg ctcctccgat atctctgttc tggatctggc 840 cccacagcct gcgcttctga actcggatca cttatctgtt gaatttaaga tcccacttcc 900 ctatcctact gtaatcccta gttgttcttc ctcccaagat tctcgcaatg ttttattgta 960 tgattataaa aaatgcgatt ttccctcact caatagagac ctggcttgca ttaattggca 1020 gttcgaattt agcttcctaa actcggtttc tgataaattt gaacacctcc tttcaattct 1080 agagaacctc ctaactcaat actgcccaac caaatcaatg atcccgacct cttctatccc 1140 tccttcttca aattccttgt ggaaacttaa acgcctacgt caaaagatta gtaagcttgt 1200 tctctctagg aattgtcctc ccgaaaccat tataattgcc caagccaaat accgcaagtt 1260 gtaccgcaat tacaaaaact cccttcgacg ttctgaaaac ggaattgtct ccagctcaaa 1320 ctacgctaaa attaggcgtc tcatcaatag tcgcctcaaa acgctagacc ttgttccagc 1380 aattatatct gacaataaaa ccatagttag tgactctgat aaaagcgaaa tttttgctaa 1440 aatgttcgct tcacactact cccctgaaaa cacccagctt cctgacccgc ttctcaatag 1500 tcctcgcaat tctgattcta ccaactccaa ctcgactacc aactctatta ggaatgattc 1560 cttccaaccc tacataatag aaagcgttct atctaaactc ccacccaaat gcgggttctc 1620 cccccactgc gcaaattact tagtactgaa gaaatgcgcc accccgctcg cactgcctct 1680 ttctctaata ttccatcaat catttgctga atctactatt cccgcccgat ggaaacaggc 1740 aataatcatc cctattctca aaaagggtaa tccctcatcc cctcacaact acagacccat 1800 ctcgttgacg gacccctttg tgcgcgtaat ggaacgcatc atctgctccc gaatccgcac 1860 ggatttttcc catctacttt ccccccatca acatgggttc ttacaactta gaagctgtcc 1920 ttcctctcta gtccgctcca ttactctcta tcactctata attagacagg aaaaatcact 1980 tgacattatc tttttcgatt ttgccaaggc gttcgataaa gtctcccatt ctatcctcct 2040 caaaaaactt gcttcatttg gattagaccc cctcacttgt tcttggttca aagaattcct 2100 taatcaaaga acgttctcag taaaaatcaa taagttttta tctaaaaaca cctaccccat 2160 ttcttctgga gtcccgcagg gttccgtttc gggccctctc ctcttcattt tattcataaa 2220 cgacttactg attgatcttg aacactctat aaatgtctcc tgctttgcgg atgacattaa 2280 aatctaccat cacaaccctt taattatcca acaatctatc gacactattg ttagctggtc 2340 caagaagaat gagctgcccc tcgcccctgc caaatctgct tccctttctc tgggttccct 2400 aaacaccaac cactcatact cagttgatgg tgttcccata atcccctctt ctactgttcg 2460 cgacttaggt cttgtcaccg atcccaagct taaatttgag gcccacattg ccaaagttag 2520 ttccctagcc atgcttcggg ctaaacaact cctaaaagca ttctcttcca actcacctca 2580 cttctacggg tttcttttca aaacttacgt ggctccaatt ataaattact gcgctgaggt 2640 ttactctccc tcccctaatt cattgctctc gactaaactt gagaaacctc ttagacactt 2700 taccaagcgc gttctgcaac gatgtaatac taaattctcc tcttatgaag atagacttag 2760 cataatgaaa ctcttctcca ccagacacac tagaattaaa gctcaaatga aactactcta 2820 tagattcctc actggatcgt cccacttccc tagtcttaat caatttgtct catttgctag 2880 ctctaatagg caccctatga ttctcgttag aaaggacagt tgcaccgctc actttttcgc 2940 ccaatcagtt cccatttgga ataatctttt taaaaacgtc ccggttttcc tctctccttt 3000 ccaattctct cactttctcg atcttcacat tccccggtat tgatactatg tgatcccgag 3060 tccatgttct tttctttata attcttcatt ctctattctt cttgttttat gattcttacc 3120 tcacccattt cccacactcg agatgaactc ttgctgggtt tgtctcgttt tattaattta 3180 ttttctcatt aaacctaaat tgtactacct tttttaatgt attacctttt ttggcttatg 3240 ttgaaaataa atcaatcaa 3259 // ID CER12-1-LTR_CE repbase; DNA; CEL; 529 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 06-AUG-2003 (Rel. 8.07, Last updated, Version 1) XX DE CER12-1-LTR_CE is a long terminal repeat of the CER12-1_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER12-1-LTR_CE; KW CER12-1_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER12-1_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (MAY-2003). XX DR [3] (Consensus) XX SQ Sequence 529 BP; 121 A; 146 C; 91 G; 171 T; 0 other; tgttgtgact tgggctccgg agagccggcg catcccaact ttgcggatgt atagttaatt 60 aattggtcag aatcacccac gatcccacaa ttaacatccg ccgccgaata ctcacaatgt 120 cttccgtatt cttccgaaaa cattcgctct aataatcatt taataaacaa ttattatttg 180 aattaccgcc acgcaatgca ggcgcgctcc acggacaaca acggagtgtc gtagcgcgct 240 ctctttagcc attgctcaca ttaattattg gcctaaattt gatgatgcaa ttatcgtcac 300 tctcctccga ttctcccatc gtgtgatgcc gtactccttc taattaatta ttattcgact 360 cctggctgct ctgcagcaca ttctctacgt ttttcctgat caataaacag gtttttattc 420 gtactaattg ttctgttata ttcacaccgc tcatccctcc tctcgttccg tccgtcgtct 480 ttcttgtcac tccgaggttc tggctacctc tcggattgga gtgtcaaca 529 // ID MARINER3_CE repbase; DNA; CEL; 1253 BP. XX AC . XX DT 21-OCT-1999 (Rel. 4.09, Created) DT 21-OCT-1999 (Rel. 4.09, Last updated, Version 1) XX DE Autonomous DNA transposon from C. elegans - a consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; KW Autonomous DNA transposon; mariner superfamily; MARINER3_CE; KW TA target site; TIR; transposase. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-1253 RA Kapitonov V.V. and Jurka J.; RT "MARINER3_CE."; RL Direct Submission to Repbase Update (OCT-1999). XX DR [1] (Consensus) XX CC MARINER3_CE is a prototype of autonomous DNA transposon that was CC active in C.elegans genome very recently. There are about 100 CC copies of the element in the genome and they are ~99% identical CC to CC the MARINER3_CE consensus sequence. CC MARINER3_CE is flanked by TA target site duplications and CC it has perfect 31 bp-long terminal inverted repeats. CC There is no a single copy of MARINER3_CE, fixed in the genome, CC that preserves the transposase encoding ORF free of internal CC stop-codons. XX SQ Sequence 1253 BP; 363 A; 284 C; 254 G; 352 T; 0 other; ttaggttggt cgaaaagtct ttgcaaaatt tgtacttttc ttcatttctc agcgttaatt 60 catttttttc tgattctgtt tttttatttg catagtacgt aatgtgcatg tttaatatat 120 ttaaaaaaaa cttttgggtg ctgttcaaaa tgaccgagaa tttgctcgcc gaacgtcacg 180 ccctccgatg ggtttttctg tacgagttcc cccaaatctg caattgcaac gaagctcgtc 240 gtaacatgtg tgcagtgtta ggcaagaact ctgtcaccta taataccatg aagttttggt 300 tcgaaaagtt cacgaaaaag aactacgatc tcgatgataa accaagataa gatcgctctc 360 gtttgaatat cgacgaggat atttcgagag ccctggaaga tgatccaaga gcaacgaacc 420 gcgaactttc tgcgactctc aagcatcccc aaaaaaccat catcaaccat ctccacgaaa 480 ccggaagggt agaaaagttc ggtcaactcg ttcctcacaa tttgcccgat tctcagaaaa 540 attgttttgt gacctctctc tttcgctgct cactaggaaa cgaacaacgg actgggttaa 600 ggatatcatt actggaaatg ataaatgggt attgtatgtt agccatacca ggaaaaaaga 660 gtgggtcccg gtcgaggaaa ccgcgacacc tgacctcaaa ttagaacttc acggaaaaaa 720 gtgcttctct caattgggcg ggacagtaag ggtgtcattt ccagagagct tcttccagac 780 tttgctacaa tcaacgctgg cctatactgc atttaattgg aaaagatggt ccatgctcat 840 cgattacatc gccctagagg atcaaagttg ttgctgctcc atgacaacgc aagaccgcat 900 acaactttta agacccgcca gaagctccag acagtcggaa tccaaatttt atcttaccca 960 tcgtattcgc cgggcttggc tcctactgac taccatctgt tccgctcact ccagaatcac 1020 cttgccgggc agaagtttca tgatcgaaag gtcgtcgaaa cggggttgga cgacttcttt 1080 gcctcctaat cgcaggagtt ctaggcggag ggtactgttc aacttccgtt gtgttggcaa 1140 gaagtcatag gcattaatgg taaatatatt acccattgaa tcttgttcgc tttgaaaaaa 1200 tactacttcc aaaaaaaaat caaaattttg caaagacttt tcgaccaacc taa 1253 // ID PAL8C_3 repbase; DNA; CEL; 211 BP. XX AC . XX DT 09-JUN-2000 (Rel. 5.05, Created) DT 09-JUN-2000 (Rel. 5.05, Last updated, Version 1) XX DE PAL8C_3 is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; PAL8C_3; KW TIRs; nonautonomous DNA transposon; putative HAT superfamily. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-211 RA Kapitonov V.V. and Jurka J.; RT "PAL8C_3."; RL Direct Submission to Repbase Update (JUN-2000). XX DR [1] (Consensus) XX CC PAL8C_3 is a putative non-autonomous DNA transposon. CC It generates 8-bp duplications of target sites upon CC integration in the genome. PAL8C_3 has perfect 30-bp CC terminal inverted repeats. CC There are ~15 copies of PAL8C_3 in the genome; CC they are ~94% identical to the consensus sequence. CC PAL8C_3 is ~73% identical to the PAL8C_1 and PAL8C_2 CC consensus sequences. XX SQ Sequence 211 BP; 79 A; 33 C; 28 G; 71 T; 0 other; ggtacttttc cttttctacc ccgcattttt aaaagaatat tcaaaccatt gaattattcg 60 aatttaattc tcacgtgaat gtttatttta atactatttt catttttagg cttagaaaac 120 aacattacta agcctaaaat acagaaaaag ttcacgtttt tatcaaagaa aatgttccag 180 aaaaaatgcg gggtagaaaa ggaaaagtac c 211 // ID CER17-LTR_CE repbase; DNA; CEL; 294 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 06-AUG-2003 (Rel. 8.07, Last updated, Version 1) XX DE CER17-LTR_CE is a long terminal repeat of the CER17_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER17-I_CE; KW CER17-LTR_CE; CER17_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER17_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (APR-2003). XX DR [3] (Consensus) XX SQ Sequence 294 BP; 80 A; 50 C; 26 G; 138 T; 0 other; tgtcaagaat gatgtttcat gaatcattct tgaatagtat tatgttatat tttgtattat 60 gttatccttc gctttacttt tatttatttt tgccaaaatc atgtttatat taaaaaggct 120 tattcatcga tttacatttg tatataaacc ctaccattta atgttccttt tacgttccct 180 tttattcctg atatatcatt cctttcactc cagcttaaag cttcattgtt tatattcttc 240 agcaataaat aattattatt ctcctactgt tttctcttat tgatttcaat aaca 294 // ID PALTTAA2_CE repbase; DNA; CEL; 182 BP. XX AC . XX DT 09-DEC-1999 (Rel. 4.11, Created) DT 09-DEC-1999 (Rel. 4.11, Last updated, Version 1) XX DE PALTTAA2_CE is a putative nonautonomous DNA transposon - a DE consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; PALTTAA2_CE; KW TIRs; TTAA target site; nonautonomous DNA transposon; palindrome. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-182 RA Kapitonov V.V. and Jurka J.; RT "PALTTAA2_CE."; RL Direct Submission to Repbase Update (DEC-1999). XX DR [1] (Consensus) XX CC PALTTAA2_CE is a putative non-autonomous DNA transposon; it CC forms almost perfect palindrome and is flanked by putative CC duplications of TTAA target site. CC The C.elegans genome contains about 100 copies of PALTTAA2_CE. CC PALTTAA2_CE copies are ~77% identical to the consensus CC sequence. PALTTAA2_CE copies are distributed non-randomly and CC they CC form dispersed clusters. XX SQ Sequence 182 BP; 56 A; 35 C; 33 G; 57 T; 1 other; ttaatgaatt cagctcaccg agagctttaa tttaagtata atcatgacta ggttccgtca 60 attttgggtc ccrccacgaa aactacagta gttttcgtgg cgggacccaa atttttcaaa 120 attgacggaa cctagtcatg attatactta aattaaagct ctcggtgagc tgaattcatt 180 aa 182 // ID PALTA2_CE repbase; DNA; CEL; 1534 BP. XX AC . XX DT 25-AUG-1999 (Rel. 4.07, Created) DT 25-AUG-1999 (Rel. 4.07, Last updated, Version 1) XX DE PALTA2_CE is a putative nonautonomous DNA transposon - a DE consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; PALTA2_CE; KW TA target site; TIRs; nonautonomous DNA transposon; palindrome. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-1534 RA Kapitonov V.V. and Jurka J.; RT "PALTA2_CE."; RL Direct Submission to Repbase Update (AUG-1999). XX DR [1] (Consensus) XX CC PALTA2_CE is a putative non-autonomous DNA transposon; it CC has almost perfect 690 bp-long terminal inverted repeat and CC generates duplication of TA target site upon its integration CC in the genome. Presumably, multiple transpositions of PALTA2_CE CC were induced by Mariner/Tc-like transposase. CC PALTA2_CE copies are ~93% identical to the consensus CC sequence. There are ~50 copies of the repeat in the C.elegans CC genome. PALTA2_CE shares common (~90% identity) 70 bp-long CC terminal inverted repeats with PALTA1_CE. Their internal portions CC are 60% identical only. XX SQ Sequence 1534 BP; 540 A; 223 C; 224 G; 546 T; 1 other; ttagggtgtt cgataagttt tttcgttttt tctcaatttt gattttttct caaatatttt 60 tttgtgatta catagtagtt agtgagtagt gaaaaattta taaataccaa ccagttccgg 120 caacttccag actgcatcat ctgccttcga gtggtcgaga agaaaaaata attttttcga 180 tttttttggt tttaggtttg aatggaaatg ttagtgtttc atgcagttgg aaaacttttg 240 tataacgcag attaaaaata tcaggcaacg aaaagcactc cttgatatct taaaatttga 300 aaaagtaccg aaaattagtt gcaaaaatta caattttctc ttcagagtgg tatatcttag 360 actgcgtgac agatttctat ttatgaataa ctacaaaaat gttaagcatt tcaaaatgaa 420 caaattatat gttgacaact ttttggtatc tttcaccgtt taaaagttac acatctgaga 480 agcctgccaa ctcgaacttt tagagttctg aaaattggac gtttgtctga atatctcgat 540 ttctagcaaa tctatcaaaa aatttataac tacaaaaata atgaccaatt atttctacac 600 atttttgtag ttggtcattt taagatatct tttatagcaa cagagttatc gctcctataa 660 gaaaaaaacc gaaaaatttg ataatttgat ggaagaatga taaaaattcg ttgaagaatg 720 aaaaaacttg tgcggaaatt gtgttttttt gtttgatttc tataatatta ccataataaa 780 aacattaata tccatanaag atccatagaa aacatttttt tcggttttca attcaatttt 840 ttcaattaaa ttatcaaatt tttcagtttt tttcttatag gggcgataac tctgttgtta 900 taaaagatat cttaaaatga ccaactacaa aaatgtgtag aaataattgg tcattgtttt 960 tgtagttata aattttttga tagatttgct agaaatcgag atattcagac aaacgtccaa 1020 ttttcaaaac tttaaaaaat tcgagttggc aggcttctcc gatgtgtaac ttttaaacgg 1080 tgaaagatac caaaaagttg tcaacacata atttgttcat tttgaaatgc ttaacatttt 1140 tgtagttatt cataaataga aatctgtcac gcagtccaag atatacccct ctgaagtaaa 1200 aattgtcatt tttgcaacta attttcggta ctttttcaaa ttttaagata tcaaggagtg 1260 cttttcgttg cctgatattt ttaatctgcg ttatacaaaa gttttccaac tgcatgaaac 1320 actaacattt ccattcaaac ctaaaaccaa aaaaatcgaa aaattatttt ttcttctcga 1380 ccactcgaag gcagatggtg cagtctggaa gttgccggaa ctggttggta tttataaatt 1440 tttcactact cactgactac tatgtaatca caaaaaaata tttgagaaaa aatcaaaact 1500 gagaaaaaac gaaaaaactt atcgaacacc ctaa 1534 // ID CER1 repbase; DNA; CEL; 7881 BP. XX AC U15406; XX DT 06-FEB-1997 (Rel. 2.01, Created) DT 06-FEB-1997 (Rel. 2.01, Last updated, Version 1) XX DE Internal part of retrotransposon CER1; gypsy/Ty3 group. XX KW Gypsy; LTR Retrotransposon; Transposable Element; CER1; LTR; env; KW gag; gypsy/Ty3 group; pol; retrotransposon CER1. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1970-7881 RA Wilson R., Ainscough R., Anderson K., Baynes C., Berks M., RA Bonfield J., Burton J., Connell M., Copsey T. et al.; RT "2.2 Mb of contiguous nucleotide sequence from chromosome III of RT C. elegans [see comments]."; RL Nature 368(6466), 32-38 (1994). XX RN [2] RP 1-7669 RA Sulston J., Du Z., Thomas K., Wilson R., Hillier L., Staden R., RA Halloran N., Green P., Thierry-Mieg J. et al.; RT "The C. elegans genome sequencing project: a beginning [see RT comments]."; RL Nature 356(6364), 37-41 (1992). XX RN [3] RP 1-7881 RA Britten J.R.; RT "Active gypsy/Ty3 retrotransposons or retroviruses in RT Caenorhabditis elegans."; RL Proc Natl Acad Sci USA 92(2), 599-601 (1995). XX DR GenBank; U15406; Positions 493 8373. XX CC LTRs of CER1 are named as LTRCER1. CC CDS 730..7548 CC /product="gag, pol and env protein precursor" CC A Gypsy/Ty3-class retrotransposon (Cer1) has 492 bp terminal CC repeats and CC one long (6819 nt) open reading frame. Alignment with other CC Gypsy-class CC elements and with retroviruses indicates that an env gene CC occupies the CC 3' 1.2 kb of the open reading frame. Since Gypsy of Drosophila CC has been CC shown to be an infectious element, it is possible that Gypsy-like CC elements are active in C. elegans. XX SQ Sequence 7881 BP; 2639 A; 1158 C; 1888 G; 2196 T; 0 other; tgggggccga accgccgttg gtttccgtgg aattctcgtt tttcgttgcg aaaatcgagt 60 ggattttgaa gaatttggat cgtctcgagg atttttcttt ttgttagctt gctctgtacg 120 agtagttgat aaagagaaaa caaaaaagcg atttttctgg agctgaacgc tcaggaagtt 180 ctgatttttt ccagaattct gtttctgtct ctgtaagagc ggggatagtg atttctgccg 240 aatttttttg aataaaagtt ctggaatttc gaagattgta aaattcctag tgaattttat 300 cgattttttg agttgaaagc ttgatttttt caagtttttg ctgagaaatt caatttctca 360 tcaagaaatt ggtgaatttt gctgattttt gcatcgaaag tgcgaaattt tgcggaaaag 420 ttaaagtttt cgtgaaattt tgagattttc gaaaacaatc tttgattttt cggtgatttt 480 attcgattat tttggttaga aataggaaaa tcgctcgccg atttttcatt tttctggaaa 540 aagtcgtgaa tttccgcaat tggaaaaaat cgataattgg atttcactgg attattcgcc 600 tttgagtttg caatcgtttc gtgggatttt tcacatttct tgttgatgta ctctcaagtt 660 tttgctcatt tcttcattta attgctgtaa tctaaacgaa tttatttttg tttcagtgaa 720 atctgaagaa tggaggtgaa cgagggacag gatactgaag gaggatcatc gagagcacaa 780 actctcacac caccaccaaa tccacagcaa caactgtatg atgaagagga tttgctgcgg 840 gagtcgatgg ataccacgga gaagacattc gaaaatggtt tccaggtgag aaaagcggaa 900 catgaagtaa aaaagaagga tgtgattaaa catattcaaa attatgcaaa ggcaaatgaa 960 gcccaaaccg cattgatggt ggaaccattt ataaaaatca taaaagagga agaagatata 1020 attgaaatta gagaaaaatc gataatgatg ttgaaaaagg tggtgaatga acaaggaatt 1080 acaatttctg atttgcagat tcaaaaggag caaatccgtc agcatctaca agattcttct 1140 caaagaggga cagccgagga tgccgaaact caaaaaatga agcaattttt ggatacgaat 1200 gagcttcata atgtgagtga tttggaggaa ataattaaag agtattcagt ccttaagatg 1260 aaagaagaaa aagaaaagca atgtttacag atggcgagtg attcctgggc aatgatgcgc 1320 gaggaaatta tggaaaagag ggaaacaaat cgtgatttaa atcgacagct gaaggagaaa 1380 agtgaggagt tgatgcagaa aagtcaaatc ttggtggaaa ctacactgaa gctgaaggct 1440 gttgaagaag aaagagataa gcggaagaag gaagaacaat tcagagaagc tgacgctagg 1500 tcgaataact atgcacgaaa aggtgaaatt agttccaaca ttgaacaaaa gaatcatcaa 1560 aatattcaga tcatggatac aagatgcaca acatcgagct caagaatgaa tacaccagca 1620 caacgtatcg gtgagaattt gagcacttca aatgtgggaa ataatgttgt tcgggaaact 1680 gtaagagagt attgtgaaga gacaggggag attcttgagg actttgaagt aaatcaaaat 1740 gatagtgttc tgactgaaag gaatgtgact ggtagtgtaa gaaatgggga ttcccaggtc 1800 cagacgaata gtttagaaag gatgactcaa atgatgctcg cacaaagcct gccggaacca 1860 gcaaagttca cggcagaaga gggttctatc agtattgaag catttgagaa aacttttaaa 1920 ctaaaatttg gaactttttc ggatgagcaa caagtagcaa ttctggagtc gaagtacttg 1980 gagggaagag cacaaaaagc ttatcgttcg ttgacagcgg gagagaaggt gaaagtgaaa 2040 gtggttttaa atgcattagc taataggttg agattatctg tagaagatga gaatcatcgg 2100 gctaagcaaa aatggaacat tctttctcgt aagccagatc aatcgtgtga agattattgc 2160 ttgttgatcg atgacattgc tagaatcgct ttccgcagag tttctcctga agaattgtca 2220 tcgatgaagt atgtcaagct tctagatgaa gtcactgaca tgcatcttcg ctgctcgatt 2280 gataataaga taatggacac tgatgaaatt aatcactatg atgtatgtcg tgaaatgata 2340 atacgtcatg aatggaatgt ttcaaagatt aatgaaaaac agtgcttgaa ttctaaattg 2400 tcggagaaga gagggaaagt tcaaaatgcg gaaaattttg tgaatcaaaa tacacagaac 2460 aattttaaac cgttttctcc aaacaaggct gctgataatt ctcggaactc ttggaacaat 2520 aattcacaaa ataattcggc ggcttctcag aatatctcga gagagcaaag ttggaaaact 2580 atttcagttc ctcagaaaca tcagaatcct tcggataggt gcagtgactg tcagcaaaga 2640 ggatggcata tgttttggtg ctcgaaaaag tctaaagata atgcatcaca aaaatgtgat 2700 gaatgtcaac aaagtggatg gcacatggcc agttgcttca aattgaaaaa cagggcatgt 2760 ttccggtgta acgaaatggg gcacattgct tggaattgtc cgaagaaaaa tgaaaataca 2820 tcagaaaagg aagctccggt ggcaaaagta gagacaatcg aaggtgtaag aatgaaggat 2880 tgtttgctaa tggtgaaatc ggaaaaatct gaatctgagg ttacaagaag tctggaaaag 2940 ggacaaatag gaaaagctaa tgtagaaatt ctgttggata gtggtgctag tatttcgcta 3000 atgagtaaaa atacctggga gaaaattgtt gaagtcaatg gaaagtcttg ggaacaggac 3060 cagatatatg aggagttgga gtataaaaca gctagaacag caaataatca gttgttcact 3120 ttgctcaggg cggtaatggt agaaataaaa atgcaaacaa agtctgaagt tatcaagttt 3180 catattggag acatggacag ggaaaatgtt attatcggag ctggtcactt tgagcaaatg 3240 gggattcaaa tgaatatgat cattgaaccc agaatagttc gtatcgacga agacgtggaa 3300 attccgccga gaagttgtca gttagtagaa gttaatgtta ctggtatcat aagagaagga 3360 gcatattgtt tgataacacc aacaatgaga catgtggaaa atgcggttgt acgactgaat 3420 gaacaaggga aagcttgggt tagaattgtg aatcaattca aacatatgct gtccttgaaa 3480 aaaggggagg tgattgggaa aggagagaca ggaggctttg aggtgttatc aaataaagcg 3540 gaacaagata ttacagtgga agaggtgtta aatgatccta ctcttttttc ggaaattgag 3600 acagatacaa acagttgtga agtagtaaag acggctgaaa catatgagcg tttcacaaca 3660 atttgcgagc atcttaagag agaaaatggg gatgatagga aaatatggga tgttattgaa 3720 caatttcaag atgtgtttgc gatctcagat gacgagttgg gacgaaattc tggtacggaa 3780 tgtgtaatag aattgaagga aggagcagag ccaattaggc aaaagcctcg tcctattcca 3840 ttagcattaa aaccagagat tcggaaaatg attcagaaaa tgctcaatca aaaagtgatt 3900 agagaaagca agtcgccttg gtccagtcct gtagtgttgg tgaaaaagaa agatggctct 3960 attagaatgt gtattgatta tcggaaagtt aataaggtcg tgaagaataa tgctcatcca 4020 ctccccaata ttgaagcaac gctacaatcg ctagcaggaa aaaaactata cacggttttt 4080 gatatgatag caggattttg gcagattcca ttagatgaaa agagcaagga aatcacagct 4140 tttgctattg gtagtgaact ctttgaatgg aatgtacttc ctttcggatt ggttatttcg 4200 ccggcactgt tccaaggaac aatggaagaa attattggag atttgttagg agtatgtgca 4260 tttgtatacg ttgatgattt gttgattgct agtaaggata tggagcaaca tctgcaagat 4320 gttaaggaag cgttgacaag aatcagaaaa agtggaatga aacttcgtgc tagtaaatgt 4380 catattgcaa agaaagaagt tgaatatttg ggacataaag tgacactaga cggagtggag 4440 acacaagaag taaagacgga taaaatgaaa cagttctcga ggcctactaa tgttaaagaa 4500 ttacagtcat ttttgggatt agtcgggtat tatagaaaat tcattttgaa tttcgctcaa 4560 atagcttctt cattgacttc gttgatttct gcaaaagttg catggatttg ggaaaaagag 4620 caagagatag cattccaaga attgaaaaaa ttagtttgtc agacaccagt gttggcacag 4680 cccgatgtgg aagctgcttt gaaaggggat agaccattta tgatttatac tgatgcaagc 4740 agaaaaggca taggagctgt gttagcacaa gaaggacctg atggacaaca acatccaata 4800 gcatttgcat caaaagcatt aagtccggca gaaaccagat atcatatcac ggatctggaa 4860 gcactggcaa tgatgtttgc gttgcgtcgg ttcaagacga ttatttatgg aactgcaatt 4920 acagtattca cggatcataa gccattgata tcgttgttaa aaggatcccc actggcggat 4980 aggttgtgga gatggtcgat tgaaattttg gaatttgatg tgaaaattgt gtatttagcc 5040 ggaaaagcca atgcagtagc tgatgcgttg agtcggggag gttgtccacc taatgagtta 5100 gaagaagagc aaactaagga attgacgagc atagtgaatg cgattcagac agagctacct 5160 gatatacttg acagttcttg ctggttagaa aggttaaagg gagaagatga aggatggaaa 5220 gaagtgatag cggctttaga agggggaaag acgaagggaa ccttcaagat tgttggcatt 5280 gagtcggaaa tttctctgga atattataag attgttgggg gagtgctcaa gaatactgag 5340 atagaagaac agagtagaag tgtagttcca gagaaaatac gcacaccatt actgaaagag 5400 ttgcatgaag gaatgttagc ggggcacttc ggaattaaga aaatgtggag aatggtgcac 5460 agaaagttct attggccgca aatgagagtt tgtgtcgaaa attgtgtgag aacctgtgca 5520 aaatgtctat gtgctaatga tcattcaaag ttgacttcgt cattgacacc ttatcgaatg 5580 acgtttccgc tagaaatagt agcttgtgat cttatggatg taggcttatc agtacaaggg 5640 aatagataca ttctcactat cattgattta tttacaaaat atggaaccgc ggtacctata 5700 ccggacaaga aagcagaaac ggtgttgaaa gcttttgttg aaagatgggc aataggggaa 5760 gggagaattc cgttgaaatt gttgactgat caaggaaaag agtttgtaaa cgggttattt 5820 gcacaattca cacacatgtt gaagattgag catataacaa ctaaggggta caacagtaga 5880 gccaatggag ccgtggaaag atttaacaaa acgattatgc atataatgaa gaagaagact 5940 gcggttccta tggaatggga cgatcaagta gtgtatgcgg tgtatgctta taataattgt 6000 gtacacgaaa atacgggaga gacaccgatg tttttgatgc atggaagaga tgttatgggc 6060 ccacttgaaa tgtccgggga agatgccgtt ggcattaatt atgctgatat ggatgagtat 6120 aagcatttac ttactcagga gttgttaaaa gttcagaaga tagccaaaga gcatgcgatg 6180 agagaacaag aatcttataa aagtttgttc gatcagaagt atgcatcaaa gaaacacaga 6240 tttcctcaac cggggagtcg agttttgtta gaaattccat cagagaaact aggagctcaa 6300 tgtcctaaat tagtaaataa gtggagcgga ccgtatagag ttatcagttg ttcagagaat 6360 tcggcggaga tcacacctgt actgggaaaa cggaagcaca ttttgcaaat tccgtttgaa 6420 aatttgagag ttataccaga agcaatgcca gatatactaa tagtgactaa gaaggggaga 6480 tctaaaaaac ctgaaccaga aatttattgt gatgaaatta ctgttgtatc tgaaaataat 6540 gaaaattctt gtttcagttg tcgatatatc tgccgatgtg ctctgaagcc ttgtatgttc 6600 aatatgactc ttgtccctga agcccatacg ccatcaccaa ctcagcttta tcgaatgtac 6660 tgcattatgg agaaatcggg aaatagaaag atcgatccaa aacagctgat ggcaatgtca 6720 agccgcccgc tgccgtcacc attgcaaata actctgccgg ataagatgat ggataactta 6780 ttcaaggata tgatcggatg ctcctctctg tggacatacg tcgctgaact tggatgggaa 6840 aattcgtaca atcgctacgt ggacaagttg ctcaatgaaa attgcggtga cattttaaat 6900 ggtcccggaa ctatgctcat tctagctgat ggattgcgat tggaagacct gccggtttcc 6960 acgaagaact gcttcgtctg tactgactat gatgaggaaa ctctgattgc tcttcagaag 7020 aaatgttgcc gagaaagatt caagatgata gttctggtta ttccattcac aattgatgtc 7080 gagctagttg attgttggaa tcggcttatt gcaaaaatct cggaagaaac taaaatactg 7140 gtagtgtcca acatgactcc tgacgaattg gaagatcatg ctcttctagt ggaattcaca 7200 tccatcctgc aaaaatgtcg tcgagtagat gatggttatt tggaaataat atcgcttcat 7260 gataggttgg aagctcatcc aagaaagact ctcgaaatga cagctttagc cggtaaagtg 7320 gagtactgga aagctgtaca aactcgtgct aaggaagttg gaatggaatg gaaagcgttt 7380 gagttaaaac gctgtaccag tgatactcct gtcaaaaact cagactgtga agcctcgacg 7440 tctatgaagt ctgcgtccac tgtaagaacg tttgaggata gaatggtcaa acgaggaaat 7500 cacaatcgtg tatatcatca tttcactcca tacgggcgta aaaaatgagc ccttaattga 7560 ttttgttgaa cgttataatt ctttgttctt atgttaatta tcatttcata attatttgtt 7620 gttttaataa ttttttggtt atttttctta taattttgtc ccccaccctt gagttctagg 7680 ttattaattc ttaagttagg atccgtctat tatttgaatt catttccctc atttccctat 7740 taggttccgt agtattttta attttttttt atattttccc cttgctgatg agtgctgtga 7800 taatgatcag ttgtgttatg cctaatgggt gagatgtacg tcatttttga ataagggagg 7860 ttgagcttga aaaaccggga g 7881 // ID NDNAX2_CE repbase; DNA; CEL; 2844 BP. XX AC . XX DT 12-NOV-1999 (Rel. 4.1, Created) DT 25-MAY-2001 (Rel. 6.04, Last updated, Version 2) XX DE NDNAX2_CE is a family of non-autonomous Helitron DNA transposons DE - a consensus. XX KW Helitron; DNA transposon; Transposable Element; Nonautonomous; KW HELITRON class; NDNAX2_CE; nonautonomous DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-2844 RA Kapitonov V.V. and Jurka J.; RT "NDNAX2_CE."; RL Direct Submission to Repbase Update (NOV-1999). XX RN [2] RP 1-2844 RA Kapitonov V.V. and Jurka J.; RT "Direct submission."; RL Direct Submission to Repbase Update (MAY-2001). XX RN [3] RA Kapitonov V.V. and Jurka J.; RT "Rolling-circle transposons in eukaryotes."; RL Proc Natl Acad Sci USA 98(15), 8714-8719 (2001). XX DR [2] (Consensus) XX CC NDNAX2_CE is a non-autonomous Helitron DNA transposon; CC its copies are 98% identical to the consensus sequence. CC There are about 100 copies of NDNAX2_CE in the genome. XX SQ Sequence 2844 BP; 797 A; 632 C; 524 G; 877 T; 14 other; tctgcctcct tcctaaaacc cccatcttgc cacacacacg cgttttcggt gttcaaaaaa 60 tgttcaaaaa atgtccgaaa aaattgtaat acaactttga acgtttatat ctcggctctc 120 actgaaccga tttttgaaat tcttttttct ctacattcac aaaatatagc gacgtgtgtt 180 catctaaaag ggggttgatt ttggagcatc ctgtcggctc cacaagcgga aaaccacacc 240 cckcttgttt ttgcgcacac cgaagtactg gccgaaaacc actttaaatt gaaggaaaat 300 tcgagtttat gttcaaaacg tttgagactc gaaacttaag ttctcctttt caatgcctat 360 ccattgtatc taagttcatt gtgatatctc tattcgttat tttaaaaacg ataattaaag 420 ttttgccaaa atttctgcac ctcccccctt ttctcacaca cacacaccca cctccccttt 480 tcatatttgg tgtgttttcg gccacatttt caccattgtt tcggtcaaat ctccgttagt 540 cttgctcttt ttccaaattt ttttttggaa aaatgttcat cacatcgaga ggaagcggaa 600 aatataattt tttgtttcaa aaaaggtaaa taatagtctc aaaaatagaa aaaatgaaca 660 cttctttcaa acaatttaca tttttccata aaattcacaa actttccact atcgttccgg 720 tttttatgtt aaatttagaa ttaatagaat gatttaaaat ggaatctcaa atcacagctc 780 gatccgatca tttttcgcgg agataggtta tttaattttt gcacatggaa atccagtttt 840 cctctgttcc tggacaagtt ttcaactatc cttttgattt tatgttttaa ttttgataat 900 tttcactaaa gcaaggtaga caaactaaat ttggtcggtc ttgacaactt tcagcacatt 960 trkggaagtt ttgagcttcc aggccttccc agaaggtgat atgtttcaat ttttgttatt 1020 tttaaaaata ataaccgccg tattattcta ctttttacgc caatttcaaa aattatttwa 1080 tttttctcaa actcctacta tgagcaaagt tatgacggat tgaagtttta gtgcataact 1140 cctcacaggc cgcctctcct cacaagaggc tgcatgcgtg cgggtggctg gaaaatgttt 1200 tctttccaaa aaaaaacata gaaaggcact ttaactctaa gaakattcag ttcactttga 1260 acatatactt caggcattgc aggccggagg cctgctgtgc gattacaact gtttcaagcc 1320 cagattgtct ggattgagtg ccgcaggcac tcttactctt agaagacgca gtcaacctta 1380 aatcattata ccaaggcagg ccggaggcct gatatgcgat tactcttaga taagtttgat 1440 ttagaaagct tttattattt tcagttttct aaaacaatcc ccattcccga agggctgggg 1500 gcccacaggc gttcacgatt ttccaaaaaa gtgaaatgtt tattttttgg ttatttttca 1560 aatttgaaaa caccatacga ttttggttta aatactaaat tgaaatagaa atttcgctct 1620 cttacgatct acgccctacc cactgctaag cctggcaagt cgcagatttg ctatcaacac 1680 tattggaaac tgtgggctgg ctttgatcct attccaagca gatctgttat tcactcaagc 1740 ctaataacca ggtttgaggg ccgcaggccc gaatactctt graaactgtg ggctggcttt 1800 ratcctattc taagcagagc aggctgtagg cctgcgacgc ggttgttatt yactcaaacc 1860 taacaaccag gttcgagggc cgcaggcccg aatactcttg gaaactatgg gctggctttg 1920 atcctattct aagcagatat gttattcact caagcctaat aaccaggttt gagggccgca 1980 ggcccgaana ctcttgraaa ctgtgggctg gctttgatcc tattctaagc agatctgtta 2040 ttcactcaag cctaataacc aggtttgagg gccgcaggcc cgaacactct tggaaactat 2100 gggctggctt tgatcctatt ctaagcagat atgttattca ctcaagccta ataaccaggt 2160 ttgagggccg caggcccgaa tactcttgaa aactgtgggc tggctttaat cctattctaa 2220 gcagagcagg ctgtaggcct gcgacgcggt tgttatttac tcaaacctaa caaccaggtt 2280 cgagggccgc aggcccgaac actcttggaa actatgggct ggctttgatc ctattctaag 2340 cagagcgcgt atgttattca ctcaagccta ataaccaggt ttgagggccg caggcccgaa 2400 tactcttgaa aactgtgggc tggctttaat cctattccaa gcagagcagg ccgtattcat 2460 gagttttcct tttccttcac cgttttttgt agttcacagt gatgattgaa racacgctta 2520 ttttattgat aatacattca aattattctt gaacaaaatc tcagcctaag ttattcgtac 2580 agtgtcggtc ctggagcaat gaatcaaagt ttaaagagat ttcactttac acagtaagag 2640 cttcgaggcg aagcctcaaa cctgaatagt caagttttct aaacatatac cccactcctg 2700 aaataatttc aargagatct cgaataaccc swtaacaggt tcgaggcgaa gccgagaacc 2760 tgcgccagct accttgctaa taacttagag cgaattaacc ctatccccat tcccgaaggg 2820 ctggcgggcc cataggcccg ctag 2844 // ID PAL8C_2 repbase; DNA; CEL; 213 BP. XX AC . XX DT 09-JUN-2000 (Rel. 5.05, Created) DT 09-JUN-2000 (Rel. 5.05, Last updated, Version 1) XX DE PAL8C_2 is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; PAL8C_2; KW TIRs; nonautonomous DNA transposon; putative HAT superfamily. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-213 RA Kapitonov V.V. and Jurka J.; RT "PAL8C_2."; RL Direct Submission to Repbase Update (JUN-2000). XX DR [1] (Consensus) XX CC PAL8C_2 is a putative non-autonomous DNA transposon. CC It generates 8-bp duplications of target sites upon CC integration in the genome. PAL8C_2 has perfect 31-bp CC terminal inverted repeats. CC There are ~10 copies of PAL8C_2 in the genome; CC they are ~99% identical to the consensus sequence. CC The PAL8C_2 and PAL8C_1 consensus sequences are 90% CC identical to each other. XX SQ Sequence 213 BP; 70 A; 37 C; 31 G; 75 T; 0 other; ggtacttatg ggtttcgttc cccccaaaat gatttttaat tatttaacac taccttttta 60 attttaatcc acacgtgaat gtttatttta atactgtttt catttttaca aggcttagaa 120 aatcatttcc ctaagcctga gaatgaaaaa aagttcacgt ctcaattaga aaatatggaa 180 atcattttgg ggggaacgaa acccataagt acc 213 // ID CELE46A repbase; DNA; CEL; 438 BP. XX AC . XX DT 16-SEP-1999 (Rel. 4.08, Created) DT 16-SEP-1999 (Rel. 4.08, Last updated, Version 1) XX DE CELE46A is a putative nonautonomous DNA transposon. XX KW DNA transposon; Transposable Element; Nonautonomous; CELE46A; KW nonautonomous DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-438 RA Oosumi T., Garlick B. and Belknap R.W.; RT "Identification of putative nonautonomous transposable elements RT associated with several transposon families in Caenorhabditis RT elegans."; RL J. Mol. Evol 43(1), 11-18 (1996). XX DR [1] (Consensus) XX CC CELE46A is a putative nonautonomous DNA transposon flanked by CC 32 bp-long terminal inverted repeats. XX SQ Sequence 438 BP; 149 A; 66 C; 62 G; 161 T; 0 other; ttaaagttgg agtggcgcca gtgtggaaat tgttaaaaac cactcttttc gttccaaaat 60 gaccaaatat aacaataaaa tgtttcaaaa aaaattgaaa atatttttat ttattgccaa 120 aaagtgacaa ttactcagtt tttgcaactc ataattttga aagtcgacca aaaaaaaaat 180 ttttttccta cattttttat actgtaattt tgtttcaatt atttggtttt gaccattgta 240 ggggtcggaa catgcgacat tgttttggat ttcctcaaaa gctcttgttt ttcaaaattt 300 tggcaatttc ccaaaacttc gaccagaaat tatgaaaaat ctataatttt ttgaaatgtt 360 taatcatgat atttggtagt ttcgaatcat aattagcgta tttagacaaa atcgctactg 420 gcgcaactcc acctttaa 438 // ID CER13-I_CE repbase; DNA; CEL; 8008 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 06-AUG-2003 (Rel. 8.07, Last updated, Version 1) XX DE CER13-I_CE is an internal portion of the CER13_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER13-I_CE; KW CER13-LTR_CE; CER13_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER13_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (APR-2003). XX DR [3] (Consensus) XX CC FEATURES Location/Qualifiers CC gene 4042..4416 CC /note="predicted RNase region" CC gene 3064..3726 CC /note="predicted Reverse Transcriptase region" CC gene 2224..2535 CC /note="predicted Protease region" CC gene 4837..5430 CC /note="predicted Integrase region" CC primer_bind 3..21 CC /note="predicted Primer Binding Site (Arg)". XX SQ Sequence 8008 BP; 2471 A; 1737 C; 1737 G; 2063 T; 0 other; tttggtgacc caacattctt atattcggaa acgtccgctg tacgttttca aacgcttcca 60 accaacaaca tcaaagtcaa cggaaattga cactcaagcc tcaagccaag ctcgaagtac 120 tgagcaaccg caaatcaacc aacacacgga attgtgtgaa tcaaacaact gtcatacgaa 180 agtcaacatt catatgagag tagtgcatca acaagcaaga aaccagttat gagctctgtc 240 tatcgagcac aaattggtat caatcgtcgt cgactcaaag atgctcttga cactgtggag 300 acattcgatg acgtggagga cgtccaggat ggttcggaga aagagatctt tgacgacatc 360 gcagaaatag cggatacagc atatcttatt cggtcttcaa cggataagat caagtcatat 420 gacaaacaat ggaagaaact tatttcaaga gattcaaagg aggcggaagt tatgaaggat 480 tataagcagc gtctcggaga ttacgatgct gatatacaag aaggttccat caagttcaat 540 cagttgaagg agaaatatca atcattcgtc gcacttcatc agaagaaaag tccggcgact 600 acagaatatc cagcgttcag tggagagatg tctatatcaa gggatccatc tttatctcct 660 caaacaccag taatcgaaac agatcctatc actattacta ttccaaggga aacacgggtt 720 aatccaaccg ttatttcgtc tactattgat ccacttattc atcaccaatc tgtcgtcaac 780 aattctatac aatctaatca agggtttact acgtcacttc cagtcaaatt gccggcaatc 840 ccgatgccca gttttgatgg cacatacacg caatatcact cgttcatgga attattctct 900 tcattgattg atgagcaacc aattagtgat gtagccaaat ttcattatct caaggcagca 960 ttacaaggag aagccaagct tattgtaaag cacttaccac ttacgtcagc aaattatcaa 1020 gtggcaagag aattgctgca tgaacaatat ggagatgtta tgcgtactcg tcatcacctt 1080 caaagagagc ttcaacatct accatccatg gcacagatca ggaacacgtc ccaattacaa 1140 gagttctgga ccacagcatc tgctattttt cagcagtgga agaatatcga accatcttgt 1200 gataattcta caaatgcaaa tattattatg gccaaacttc caaagcggta tgtggaaaag 1260 ttattcactg gagataatgc acgaagaacg tatacggcgt cagaactgtt tcgacaaatc 1320 agtgagtaca ttcgctggga ctctcttgtc gtaaccatct gcaatgagaa ttcttccaat 1380 gacaaaagga caactacgat gatggctcac ggtcaacaac atcagcatcg ccattttcaa 1440 aagtcgtcca accacacaaa tgcgggatca caaggaagag tgtcccatcc gtgcgtcttt 1500 tgtgcatcgc caactgaatt tcatcgttat gaacaatgct cgatgtatcc aactgctgaa 1560 gaccgtcacc ggagagcaag ggaacttaag atgtgtttcc gatgtctacg tgttgaccac 1620 aaaagtagac agtgtactcg cttccggcct tgtttccatt gcagagaaaa ccaccattcc 1680 gccatttgtc agttgagaaa tcgagtcacc atgccaggaa gtcaatctgg gcagaattcg 1740 aatatgagat ccacaacacc caacggttcc aacggttcat ttcgttcaaa ctttaacaca 1800 caacctcaac aacttcaaca acaaccagtt cgtcagcaac atacaccata tcagcgacaa 1860 gatgctcgag cacaaccacc aatgagaagt tatcagagaa acgaaaatgg atcactggca 1920 aacagacgag tacagtttgg aaacactgct gcggtaacaa gtcaagtctt tcaaggcatc 1980 tcagcgaatg aggatagcga ggaaacggat agagaggaag atattaccgt gactttcaat 2040 gcggcgtcat ttcctacgga ggcacaacaa ttcgaaatag atgagcacga ggaggcacct 2100 agtgtgggag atcaattggc aacaggacca gcgaagactt tggagtcgct tcctattgca 2160 atgatgggaa aagaaatctt tgtggagaat agttcaggcc aactgatacc tactactgtc 2220 ttctttgata gtggtagtga tcgcagttat atcaccgaga aactgatgaa ggacttgaga 2280 ctggacccaa tggattcaaa acgcatccat attcaaacat tcgccagtcc gaccgtcaaa 2340 actatcaaag ccaatcgata catggtaact attcatgcgg agaatcacaa aattgctgta 2400 ccactcacgg aaattgatac gattgcaaaa tccataacaa tcgggaagat cgacgaacca 2460 acagtcggga atctcttcaa gaatgcgtca gcaatagtcc caagaaccca tgaagaaccg 2520 gggatattaa taggattgga ttctatgtca cagttacttg gcaacacgga gtcgcgaaga 2580 cttcccaatg gaacgaccgc acatatgact gaatgtggaa ttatgatcac cgggattgaa 2640 aaaacaccaa ccaacgtgct agaagagaat agtgcgtttg caaatgatga ggaattattt 2700 catttcactg ccaaatctac tagttcacca atcggcatac tgagcgccaa tcctgaagag 2760 gaaaacgcag ctcttcgagc tctactggaa agattttgga atctggagca tactatggtt 2820 atggataatc cgagaacatc agacgaggaa attgcacatc aattctttga agcaactact 2880 accagaacag aggatggacg gtaccagtgt aaatggccgt ttaagtcaga tacatggtcc 2940 ctcccggaca atcgctactt ggcataccat cgtcttcttt ctacactgaa acgacttcaa 3000 aaagacaaag aaatgtattt gaaatacgat gcaattatta aggatcaatt ggaccgaggt 3060 tttattgaaa tcgttcaaga tgaacacaag ccgacgaata ccaaggttca ttatttgagt 3120 catcatccgg tgttcaaatt gtcttcggtg tcaacgaagt tgcgtattgt ctacgatgcg 3180 tcggctcgag caaacaaaaa ttctcagtct ctcaacgatg ctttgcacac tggtgagaaa 3240 ctgcttccaa aactcaatgg ggttcttatg agaattcgag tgccaccaat tcttgtcagt 3300 tctgacatcg aaaaggcttt tttgatgttg gaactacatc cccaggatcg agattgttgt 3360 cgatttttgt ggcaacctcc aggacagaca caaccggttt gctatcgctt cttaagggtc 3420 cccttcgggg ttaaaagctc ccctttcctt ctcaatgcgt cgatcaagaa gcaccttcgg 3480 tcggaacaat cggagctagc gaagaaaatt gcacttaaca tttatgttga taacgtgttt 3540 atgggagttg ccaatgccaa ggaaggagta caattctata gagagtcaaa gaagatattt 3600 gcaaaagttc agatgaattt gacccagtat gagtctaact caaaagagct caacgagatg 3660 attggaatta tcgaagggaa agcagcagat caaaaggtca atctcaaact gttaggaact 3720 tcatggaaca cgaacaagga tgagattggc gtcaatattg ctccaccgat tacatccaat 3780 ttgtcgaaaa gacagattct tcagagttta gctagtacgt atgaaccgct gggaataata 3840 tcaccagtcg tgctcaaagg aaagctgttt ttccaaaagc tctggaataa ttctggcaat 3900 tgggattcgc cgttatctga ggacgagata aaagaatgga aggccattga aacaagttgc 3960 tgcggagaag ccatcacgat caatcgaaaa tacttttcaa caccaaatac ttcggaacat 4020 caatttgaaa ttcattgttt cggtgatgcg agtgaggcgg ctcacggagc cgtggcatat 4080 atcagaagga taggagtcca ttctatcgag accgcgtttg tttgttccaa gtcgacagta 4140 tctccattga agaaaaccct atctattccc caagcggaac ttgttgcggt ggaacgggga 4200 gctcggttgg ctcacactat tcagcaagaa ttggatttac cgatatcgat gattgtcatt 4260 tggagtgata gcctttgctc ccttgatcaa gttgcgagca attctgcgaa gaacgttttt 4320 tgtcgcaatc gaccacggga aattcatcgg ttaactccga atgcggtttt ttcgcatgta 4380 cccgggaaac tcaatccagc ggacattctc agtcgtggat gcggaatcga ggaacttcga 4440 gatcatcctc tttggtttca cggaccacct ttcctctgca aaaaggacct acctatccga 4500 acatcgagca ttcaacaagc tgtggccatg tcaatcattt cacaacaatc tgatgacacg 4560 attatcaata tcgatccaac gaggtttagt tcatttcatc gattgttcaa tgtcgtgagc 4620 aggatggtaa aacttttcaa caaacacgaa ggtgaacatg tgatcaagca gaaggctaaa 4680 cgtgtaatca ttcaattggc acaacagttg cacccgccgt ctgaagcaac catcaacaat 4740 ttgcgcttgg aaaagatcga caacatatgg tattacgttg gacgtatacc tgatagaaaa 4800 gtaccattct taccttcgaa tcatattgcg aggttgtttg tacgggcaat acatatcaac 4860 aacttccaca gcagcccaat ttacacattg tcaaaaatcc gagaagaggt gtggattacg 4920 aaaggacttt cgtttgtgcg taaagcaatt cgggaatgca tggtttgcaa aaggatgatt 4980 tctcgaccta gctatcaacc ggatttccca atgctgccta catcacgaac cacatggtct 5040 aagcccttta ccatttgcgg actggactat ggcggaccaa tacaagcaac caatcaagga 5100 cacaatcgaa agtactggtt cattcttctc acttgccttt ccacgcgatt cacggtggta 5160 gaactagtac cgtcattgga cgcagaacat ttgctgaaag ttatgaaaag attcgcgagt 5220 cagtatggca caccacagac tatcatcaca gacaatgctg ctcagatcaa gatgctggct 5280 caagtcacgg aagaagcaca aaaacaaatg tcatcaatgt caagttccca aactctgccc 5340 actttcaagt tcattccggc actttctcca tggagcggcg gattctatga aagaatgatt 5400 gctctcttta agaactgttt aattcgagcg ggatctacca aaacgttact ggacgaagaa 5460 gatttaagga ctttactcaa agaagcagaa gctgttgtga ataacagacc actcacctat 5520 gtttctgccg acgatattcg ccctcttcgt ccatcggatt ttgtgtttcc acagaaaaga 5580 gacgggaaat tgcttacagt ggaggaaacg ctggatgcgt ctcttctgaa gacatcgcat 5640 ggacagctga tagaaatttg gatgagaagt agttctatga tagaatactt tattcgtcgt 5700 tggaaggagg agtatattca gttactgcat tcgaggactc aaacaaaaca tcgacaaaat 5760 ccacacgcag ttacacgaga cttatgtgtt ggagacgtgg tgatgatcga atcggattcg 5820 aacaaaatga attggccatt ggcaaaagta gaagaagtgc ataaacgatc tgcaaaattg 5880 ttcacgcctt gtactgggaa agttgttgaa cgaccattaa caaagataca tcaactggaa 5940 atagatgtca cagagagtcc aatcagggca acatcggatg aagcaaggaa aggatcggaa 6000 gattctggat cagccggtct tcgaagatct cgtcggattt ctggaagcac gacagctatg 6060 acaatggcta cattacttgc aatttctttc ttcattccaa caacgagagc ggcaacactt 6120 cctacacaca gcgacagtca caagtcatca caagcaagcg ttacggaatt ggtgcatatg 6180 ggattgataa tggtggtttt ccttggagga gtatttattg tatccactat tctacaactg 6240 gcgctcagct tgtgtcgttg tggtcatatc atcgttggga caatgatcag tattgctcgg 6300 acaatatact ggggacttca actattcatg atacagtttg ctcaatactg tcgtcgtcgt 6360 ggtcgtcgtc aaaagataca agattatcga atcttgatgg tggttatctt catacaattg 6420 cagttcacat tggcgtgtaa cgacatcgct catcttacgg cgagcgaaaa cgcgtgtttt 6480 caaaccgagg aaagaacaaa ctgcattctc aacagcgtct ccatcgttac cgtgaaggca 6540 aatgcgtcac agtcttgttt gatgataaag acgaaggagg acatcgaggt cgagactctc 6600 aagatcacag ccacagcact ggtcagttat tgccagaagc acacggtctt cttctccagg 6660 gacttcaagc tcgaatatga atatactcgt agatgcgatt cagcaggcag ctgctcggtg 6720 gagaaatgtg gcaaaatgtc aggtgacgag aaccttccgg aattgagtca agcagcaaaa 6780 agtaagccag gattcacggc ttgtgcgcca ggatgtggag ggattacatg cagctgtttt 6840 tatgtggatc caagctgttt attctaccgc tattatgtcg tcccgacgag taacacgatt 6900 tatgaaatct ttacttgtcc aagttggaca aatcggttgc acgttgaaat atctgttgga 6960 gacaagacgt tcactacgga gatgacacca ggagttaaat ttcaagtacc gggaaccaat 7020 gtgagcatta caccaatttc acatacgagc gtcccgcttc aagcccactc agctactttc 7080 attacggcat tttcatttgg aactatgaaa gatcaatgga cggcgttcac ttacacacca 7140 ccgtcggccc ccgggagccc cgcaaagggc tttacgggtg agcttcaatg taaagatcgc 7200 aaaactgccg aagatttcaa atgcatattt gatccagatc tatgtcgttg ctcaggtttc 7260 tcaacaactg taaactgtca gtgcacaaac gaagaaatag gaggccattc caaaaagaac 7320 agacttccag ttcgtggaat aaaccatcag gttatgaaga tcaaggatac tgttgtcacg 7380 tcagctattt tagaagcagt ggtatcgttc aatgtagagt ttcggaatgc ttctgtgtca 7440 agactaactc aagtacaatc gtgtcaagtg aaacagatcg gacagctgag cgggtgctac 7500 tcttgtgcca ctggaggaca ggctcaactg tcatgtaaat caaagcggaa aatgatggca 7560 agagttctgt gtaatgcaat ggatggctcg gttcaatgcg gtcctgaagg aacagcaatg 7620 atttggacat tctcaacccc ggatcaaggt gttttggtca attgcactgc ggattgtggc 7680 ttacggacta actggatgct tagcggacga ctaaaggatc cacccacctt ctccatgaat 7740 aagtcatatt ttgcgaaccc ttttgaaact gttaagtctt cctccttctc tttcggtcaa 7800 gttctcttta atttcgccaa aaatatcttt gagggaatcc aaaacatttt gattgtatta 7860 gggttagttg ttgtatttat aatttctttt attattgtta ggaaacttgg atctagatat 7920 tttataaccc gtcgtaggaa ccaaagacta agatttcgca ctaaatattt ttagtaggtc 7980 ctgattccaa cctttatttg cccgggag 8008 // ID RCA1 repbase; DNA; CEL; 243 BP. XX AC . XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 27-JUL-1999 (Rel. 4.06, Last updated, Version 2) XX DE C.elegans satellite-like repetitive DNA (a consensus). XX KW SAT; Satellite; Simple Repeat; RCA1; Repetitive DNA. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Naclerio G., Cangiano G., Coulson A., Levitt A., Ruvolo V. RA and La Volpe A.; RT "Molecular and genomic organization of clusters of repetitive DNA RT sequences in Caenorhabditis elegans."; RL J. Mol. Biol 226(1), 159-168 (1992). XX RN [2] RA La Volpe A.; RT "A repetitive DNA family, conserved throughout the evolution of RT free-living nematodes."; RL J. Mol. Evol 39(5), 473-477 (1994). XX RN [3] RP 1-243 RA Jurka J.; RT "RCA1."; RL Direct Submission to Repbase Update (JUL-1999). XX DR [3] (Consensus) XX SQ Sequence 243 BP; 96 A; 39 C; 33 G; 63 T; 12 other; gacgtcacag actacaaaga ctacatagac tacaaactmy ggacagacga actgtatttt 60 tttatatata rktaatgata agacgtcaca gactacaaag actacataga ctacaaactm 120 yggacagacg aactgtattt ttttatatat arktaatgat aagacgtcac agactacaaa 180 gactacatag actacaaact myggacagac gaactgtatt tttttatata tarktaatga 240 taa 243 // ID CER5-LTR_CE repbase; DNA; CEL; 211 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 06-AUG-2003 (Rel. 8.07, Last updated, Version 1) XX DE CER5-LTR_CE is a long terminal repeat of the CER5_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER5-I_CE; CER5-LTR_CE; KW CER5_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER5_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (APR-2003). XX DR [3] (Consensus) XX SQ Sequence 211 BP; 62 A; 51 C; 23 G; 75 T; 0 other; agacgtgaac gtgttcgcac tacttggcga taacaactcg acaataagaa gcacgtgaac 60 tctcgaactc tcttattatt atttcacgta ttatccacat gtacattcct tgatcgatcc 120 atcatattcg atcattcctc actattctat catcctcttc tgctttctaa taaagcttct 180 tctattctta ttacaaattg aatatataac a 211 // ID CeRep58 repbase; DNA; CEL; 48 BP. XX AC . XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 05-JUN-2002 (Rel. 7.05, Last updated, Version 1) XX DE C. elegans repeat 58. XX KW MSAT; Satellite; Simple Repeat; C. elegans; CeRep58; KW minisatellite; repeat. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-48 RA Jekosch K.; RT "C. elegans repeat 58."; RL Direct Submission to Repbase Update (25-APR-2002). XX DR [1] (Consensus) XX CC Putative novel minisatellite repeat, found 198 times in the CC C.elegans genome. For further analysis compare CC http://www.sanger.ac.uk/Projects/C_elegans/repeats. XX SQ Sequence 48 BP; 22 A; 9 C; 4 G; 13 T; 0 other; cctcatcttc agaaaaatat aaattacggg caaatcaaca attaaatt 48 // ID IR1_CE repbase; DNA; CEL; 379 BP. XX AC U86946; XX DT 07-SEP-1999 (Rel. 4.08, Created) DT 07-SEP-1999 (Rel. 4.08, Last updated, Version 1) XX DE Putative nonautonomous DNA transposon, consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; IR-1; IR1_CE; KW nonautonomous DNA transposon; TIR. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-379 RA Devine E.S., Chissoe L.S., Eby Y., Wilson K.R. and Boeke D.J.; RT "A transposon-based strategy for sequencing repetitive DNA in RT eukaryotic genomes."; RL Unpublished. XX RN [2] RP 1-379 RA Devine E.S.; RT "IR1_CE."; RL Direct Submission to Genbank (21-JAN-1997)Molecular Biology and RL Genetics, Johns Hopkins University School of Medicine, 725 N. RL Wolfe Street, Baltimore, MD 21205, USA. XX DR GenBank; U86946; Positions 1 379. XX CC IR1_CE copies are flanked by 2 bp-long target-site duplications; CC IR1_CE has 143 bp-long terminal inverted repeats. About 20 copies CC of IR1_CE are dispersed in the genome. XX SQ Sequence 379 BP; 104 A; 87 C; 98 G; 90 T; 0 other; ctcggcattc agaggcaact tcctgccaca ctttggcaac ttatttagtg ccactaaaac 60 aagttgccga attgctggca ggaagttgcc gagttgttgg caggaagttg ccacgtagtg 120 gcagcaagtg ggaaaaaact ctcagcaaca tccaaatgtt ggcaggaatg tgcaaaagtt 180 tggcaggaag ttgcaagtaa gcaaagtttg gcaggaagtt gcctctgcat gccgaggagc 240 gttttttccc actagctgcc actacgcggc aacttcctga aaacaactcg gcaacttcct 300 gccagcaatt cggcaactta ttttagtggc actaaataag ttgccaaagt gtggcaggaa 360 gttgcctctg aatgccgag 379 // ID PALTA1_CE repbase; DNA; CEL; 1466 BP. XX AC . XX DT 23-AUG-1999 (Rel. 4.07, Created) DT 23-AUG-1999 (Rel. 4.07, Last updated, Version 1) XX DE PALTA1_CE is a putative nonautonomous DNA transposon - a DE consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; PALTA1_CE; KW TA target site; TIRs; nonautonomous DNA transposon; palindrome. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-1466 RA Kapitonov V.V. and Jurka J.; RT "PALTA1_CE."; RL Direct Submission to Repbase Update (AUG-1999). XX DR [1] (Consensus) XX CC PALTA1_CE is a putative non-autonomous DNA transposon; it CC has almost perfect 600 bp-long terminal inverted repeat and CC generates duplication of TA target site upon its integration CC in the genome. Presumably, multiple transpositions of PALTA1_CE CC were induced by Mariner/Tc-like transposase. CC PALTA1_CE copies are ~90% identical with the consensus CC sequence. There are several subfamilies related to PALTA1_CE CC in the C.elegans genome. XX SQ Sequence 1466 BP; 502 A; 227 C; 223 G; 499 T; 15 other; ttaggtcgtt caataagttt tttcgttttt ttctagtttt ggttttttct tggatattat 60 tttgcggttg tatagaagac tgttagtaat gacaagccca acaatatcag ccatatcggt 120 caacttccag attatttaaa actccgaact atgacgacga aaaaaaaatt tttttttcga 180 attttttcat ttctttatga aatcaaaaac tttgtatgcg tagaggtatt agacagttag 240 ttaaacaaaa ctgaaaaatt tcagactcag gcatagctta ttggtagcgc tacatctgtt 300 ggaagtgccg aaaacgcttg caaaattgcc ttttttatgt ttgagcggtg ctaactttga 360 aaccatgaga gatatacaaa agtaatgtat tacaacatta atgtacacat tatmagctac 420 awtttatcag tatgttacat tgttgkmcct cctcacgttt ctgagttaca gtcgatagaa 480 gtgagcttgg tccaattttc aaaaaaccct aatttttcat gttttcgcct gtaactccga 540 aatcaggcgt ctgatcaaaa atctttaaat taaaaaataa gtggttttgc aaadaacaaa 600 aaatgcatga tcgaaaaaat atgaaaaata ttgcaaaaaa ctattttatt ctttccaaaa 660 caacatcgtt haaaaataca tgtaatattt ttattactaa ttaggctatc ataatcaaaa 720 aacagttaga ggttatcttg taaaaaaatt ttttggatwt ttttttcgat gtttttttga 780 aaattttctg aaatttccgc aacttttctt tgacgattca tatctttgcg cacatatgag 840 atatgacaaa acggtcaact gagtacctgt ttataattwa aaagcactta wtttttaatt 900 taaagatttt tgatcagacg cctgatttcg gagttacagg cgaaaacatg aaaaattagg 960 gttttttgaa aattggacca agctcacttc tatcgactgt aactcagaaa cgtgaggagg 1020 kacaacawtg taacatactg ataaawtgta gctkataatg tgtacattaa tgttgtaata 1080 cattactttt gtatatctct catggtttca aagttagcac cgctcaaaca taaaaaagkc 1140 aattttgcaa gcgttttcgg cacttccaac agatgtagcg ctaccaataa gctatgcctg 1200 agtctgaaat ttttcagttt tgtttaacta actgtctaat acctctacgc atacaaagtt 1260 tttgatttca taaagaaatg aaaaaattcg aaaaaawttt tttttttcgt catcatagtt 1320 cggagtttta aataatctgg aagttgaccg atatggctga tattgttggg cttgtcatta 1380 ctaacagtct tctatacaac cgcaaaataa tatccaagaa aaaaccaaaa ctagaaaaaa 1440 acgaaaaaac ttattgaacg acctaa 1466 // ID CeRep54 repbase; DNA; CEL; 228 BP. XX AC . XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 05-JUN-2002 (Rel. 7.05, Last updated, Version 1) XX DE C. elegans repeat 54. XX KW MSAT; Satellite; Simple Repeat; C. elegans; CeRep54; KW minisatellite; repeat. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-228 RA Jekosch K.; RT "C. elegans repeat 54."; RL Direct Submission to Repbase Update (25-APR-2002). XX DR [1] (Consensus) XX CC Putative novel minisatellite repeat, consists of repeated CC 7mers, found 29 times in the C.elegans genome. CC For further analysis compare CC http://www.sanger.ac.uk/Projects/C_elegans/repeats. XX SQ Sequence 228 BP; 65 A; 65 C; 34 G; 64 T; 0 other; gacgctatac gctatacgct atacgctata cgctatacgc tatacgctat acgctatacg 60 ctatacgcta tacgctatac gctatacgct atacgctata cgctatacgc tatacgctat 120 acgctatacg ctatacgcta tacgctatac gctatacgct atacgctata cgctatacgc 180 tatacgctat acgctatacg ctatacgcta tacgctatac gctatacg 228 // ID CER2-LTR_CE repbase; DNA; CEL; 643 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 06-AUG-2003 (Rel. 8.07, Last updated, Version 1) XX DE CER2-LTR_CE is a long terminal repeat of the CER2_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER2-I_CE; CER2-LTR_CE; KW CER2_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER2_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (APR-2003). XX DR [3] (Consensus) XX SQ Sequence 643 BP; 144 A; 171 C; 132 G; 196 T; 0 other; tgtaaggttc taaccttctt tacatctctc ttccttcccc aatcccccag tgatagatta 60 ggagtagagt agattagaat aaactcattt atttgcagga agatcggtca gaagtcagtc 120 acgcattgga tcctacgcgt tcattgttac cagcccatcg gcacatcttc ggatccaacc 180 gtgcacaaga gccgaacgct atcgcagcgt caattcctca ataaccgagt tatttcgcct 240 actctatgcc cacaaacttc cccctaccgt aaccctgtgt gtgtgtgcgc acgcaagtgt 300 gcgcgcgcgc gggggagcgc gcaccttgtt tacgtcttct ggacctttct gcggaggaat 360 ccagggctcc gccctgccac cgcagagggg tatataagac ggcatttcta tcactcgaga 420 tcttctttta ctttcttgta tccccctttt acctgtccct cctgttattc tttcttctgt 480 cacccactct gttgggggtt taattaatat aataaatgta ttctttagca catatgtgtt 540 ctgttgtagt atgggatgca acaactcgtt tgtcatcatg ataattcggg ggaacacgct 600 taaaccttta ctggtagcag ctagtgcagc ttaaaccttt act 643 // ID CELE14A repbase; DNA; CEL; 177 BP. XX AC . XX DT 16-SEP-1999 (Rel. 4.08, Created) DT 03-OCT-2005 (Rel. 10.11, Last updated, Version 2) XX DE CELE14A nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; CELE14; KW CELE14A; nonautonomous DNA transposon. XX NM CELE14A. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-177 RA Oosumi T., Garlick B. and Belknap R.W.; RT "Identification of putative nonautonomous transposable elements RT associated with several transposon families in Caenorhabditis RT elegans."; RL J. Mol. Evol 43(1), 11-18 (1996). XX RN [2] RP 1-177 RA Kapitonov V.V. and Jurka J.; RT "CELE14A."; RL Direct Submission to Repbase Update (SEP-1999). XX DR [2] (Consensus) XX CC Original classification of CELE14A as a Mariner-related CC nonautonomous CC DNA transposon [1] is wrong since CELE14A generates 6 bp-long CC duplications of non-specific target sites. CELE14A has 51 bp-long CC terminal inverted repeats; its copies are ~92% identical to the CC consensus sequence. XX SQ Sequence 177 BP; 45 A; 40 C; 41 G; 51 T; 0 other; cacgtggtgt caggctgtct cattgcggtt tgatctacga aaaatgcggg aatatttttc 60 cagaaaaatt gtgacgtcag cacgctctta accatgcgaa atcagatgag atgtctgcgt 120 ctcttctccc gcatttttcg aagatcaaag cgaaatggga ctttctgact ccacgtg 177 // ID LONGPAL1 repbase; DNA; CEL; 2591 BP. XX AC . XX DT 09-JUN-1999 (Rel. 4.05, Created) DT 09-JUN-1999 (Rel. 4.05, Last updated, Version 1) XX DE LONGPAL1 nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; LONGPAL1; KW TIRs; nonautonomous DNA transposon; palindrome. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-2591 RA Kapitonov V.V. and Jurka J.; RT "LONGPAL1."; RL Direct Submission to Repbase Update (JUN-1999). XX DR [1] (Consensus) XX CC LONGPAL1 is a putative non-autonomous DNA transposon; it CC forms almost perfect 1200 bp-long palindrome and generates CC duplication of 9 bp-long target site upon its integration CC in the genome. CC The length of the target site duplications indicates CC that transposase encoded by the autonomous LONPAL1-like CC element may be related to Mudr-like family. CC LONGPAL1 can be still mobile since some of its individual CC copies are more than 99% identical with the consensus CC sequence. CC There are subfamilies distantly related to LONGPAL1 (60-70% CC similarity) which also have been transposed relatively recently. XX SQ Sequence 2591 BP; 908 A; 390 C; 376 G; 917 T; 0 other; ggcaaatact gaaatactac cttgtgttaa tactactttt gcttgaatac tactattgct 60 agaatactac ctttgctaga atactaccaa attgctagaa tactactatt gctagaatac 120 tgccattgct aaaatactac ttttgttgct taaatactac cttttcagaa aattttataa 180 ttttattcat tttttcattg gaattcttat taagtactat ttctggtcag atattttgaa 240 taagaaataa ttcacacagc attttgtatt gtgttaagca ctaaaagtgg cttgaacggt 300 tttattttgg gatccaatag gagagtatta atttgtagtg aaataggttt aaattaagct 360 gtggaccgtt ctcacaaaaa tactccgatt tcaaaaattg ccaatttgtg attgaaatgg 420 ttgtaaaaca acaataaact aattgcagaa ttcaaatatt tcttcccaga gtgaggttgt 480 aaatacccac acttgttcat agttttacga ctacatatcg tctccttttt gttttgatga 540 cacacagtta tggctaactg attcaaacat atagctacta aagtatttca ctttcaaaca 600 ctatatctac aagcaaacat atttgcttat caatatcagc catattggac accttaaaaa 660 catatcccat cttcaaactc tcacgatggc tgaagaaaaa cttttttttt ttgaattttt 720 caagtttttt tttaagttta ggtattaatt tgctctgatt tatcaagata ttaaacagct 780 atttgatttt ttcatacaat ctggtttgtc gcctattgat ggcgctctgt tcataatccc 840 aaaaaaaact tacaattaga aaagttattt tcaactttga acggtgttta ctcaagaaac 900 ttaaaagtta gaaaaaatta aatattgcaa tactgataaa cacactacga gctacatttc 960 tggtgttaac cattttcctg tatctcttct ggtttctgag ctacgttgtg tttatatact 1020 cagacgctgt taaaatttca tttcatacta ttccaaactc tagcctgtag ctcaaatttc 1080 agttgtctaa taaaaatgct gtcaactgac aaaacatttg tcgcactatt tggcacattt 1140 ggtttactgc gaataatgtc aaatggtgta ttcttgcaga cttatgttca atttgagaca 1200 aaaacaacat aaattgtaaa atttcactct ttcagatttc ttcacggata cacaatgcgg 1260 atacgtagag agaagtttta attgattaag aaataaaatt attttcatat aagtttttaa 1320 ttgtttcttt aaaattatgc tcaccatttt ttcgcaaact catcttgtgt aattgagcat 1380 acataattat gtaagaatac acaattttgc attactcaca ttagaaaaat gttcaaaatt 1440 aaaaaccaga tgttttgtca gttgactgtt ttttgattag acaactgaac tttgaattac 1500 aggccagaat gtgtaatagc atgatgtgaa attttaacag tgtctgagtg cataaacaca 1560 acgtaactca gcaaccgaaa gagatacagg aaaatggtta actgaagaaa tgtatgtagc 1620 ttttaacatg ttcattaatt ttgtaatgtt ttatttcttc tactttttaa gtttcttaaa 1680 tgagcgccgt tcaaatttaa aaataacttt tcctagcctt ttttgagatt gagcatcatc 1740 aacatgcgac aacaagaatg tatgaaaata tcaaatagtt gtttaatgtc ttgatacatc 1800 agagcaaatt aatacctaaa ctttaaaaaa aacttgaaaa attcaaaaaa aaagtttttc 1860 ttaagccatc gtgagagttt gaagatggga tatgttttta aggtgtccaa tatggctgat 1920 attgataaga aaaaatattt ttgcttgtag atatagtgtt tgaaagtgaa atactttagt 1980 agctatatgt ttgaatcagt atttgtgtgt catcataatt gtcataatca taatcataat 2040 catcataatc ataattgtgt gtcatcaaaa caaaaaggag acgatatgta gtcgtaaaac 2100 tatgaacaag tgtgggtgtt tacaacctca ctctgggaag aaatatttga attctgcaat 2160 tagtttattg ttgttttaca accatttcaa tcacaaatag ctaacaacta aaagcggagt 2220 tactttttga gaacggtcca cagcttaatt taaatcaatt tcactacaaa tgaatactct 2280 cgaatttgat cccaaaataa aaccgatcaa gccacttcta caatacaaaa tgctgtgtga 2340 attatttctt attcaaaata tctgaccaga aatagtactt aataagaatt ccaatgaaaa 2400 aatgaataaa attaaaaaat tttctgaaaa ggtagtattt aagcaacaaa agtagtattt 2460 tagcaattgc agtattctag caatagtagt attctagcaa tttggtagta ttctagcaaa 2520 ggtagtattc tagcaatagt agtattcaag caaaagtagt attaacacaa ggtagtattt 2580 cagtatttgc c 2591 // ID CER3-1-LTR_CE repbase; DNA; CEL; 432 BP. XX AC . XX DT 25-JAN-2005 (Rel. 10, Created) DT 25-JAN-2005 (Rel. 10, Last updated, Version 1) XX DE CER3-1-LTR_CE is a long terminal repeat of the CER3-1_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER3-1-LTR_CE; KW CER3-1_CE CER3-1-LTR_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-432 RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RP 1-432 RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RP 1-432 RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER3-1_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (MAY-2003). XX DR [3] (Consensus) XX SQ Sequence 432 BP; 114 A; 101 C; 81 G; 136 T; 0 other; tgtaaggatt ccaccctttt cacttctcat tttacataaa taaatactaa tttatttgtc 60 tgtaaattct ctctgataat ttacagaaag aacggacgaa atcggtcgag gacaacgatt 120 tctacagcga aggcaatccg cgaggaggag aaactccagt tcaaaccgcc gccaagctac 180 agaaggtcgc attgtcgcgc cgagacccgt cgtcgttaaa ggctgcgtct ttaacccgga 240 aatgcctata taaggagagg acgacgctca tttgtgatca tttcttaact ctttaacccg 300 ttacttgtcg ttttcccttt tctctcgttt actctgtttc gacgatgcaa ttagtctgtt 360 cccttttact taataaaccc ttataagatc gatcagtgtg cgtgagtggc agttacatcc 420 tataccttta tt 432 // ID CELE14B repbase; DNA; CEL; 187 BP. XX AC . XX DT 16-SEP-1999 (Rel. 4.08, Created) DT 03-OCT-2005 (Rel. 10.11, Last updated, Version 2) XX DE CELE14B nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; CELE14; KW CELE14B; nonautonomous DNA transposon. XX NM CELE14B. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-187 RA Kapitonov V.V. and Jurka J.; RT "CELE14B."; RL Direct Submission to Repbase Update (SEP-1999). XX DR [1] (Consensus) XX CC CELE14B is a second major subfamily of CELE14 repeats. CC CELE14B and CELE14A are 89% identical to each other. CC CELE14B is flanked by 30 bp-long imperfect terminal inverted CC repeats and 6 bp-long duplications of non-specific target sites. CC CELE14B copies are ~91% identical to the consensus sequence. XX SQ Sequence 187 BP; 50 A; 46 C; 38 G; 53 T; 0 other; cacgtggtgc caggctgtcc cattacggtt tgatctacaa aaaatgcggg aatttttttc 60 ccaaaaaatt gtgacgtcag cacgttctta accatgcgaa atcagttgag aactctgcgt 120 ctcttctccc gcattttttg tagatctacg tagatcaagc cgaaatgaga cactctgaca 180 ccacgtg 187 // ID LINE2G_CE repbase; DNA; CEL; 2835 BP. XX AC . XX DT 13-SEP-2005 (Rel. 10.09, Created) DT 13-SEP-2005 (Rel. 10.09, Last updated, Version 1) XX DE CR1-type non-LTR retrotransposon from C. elegans. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; LINE2G_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-2835 RA Jurka J.; RT "LINE2G_CE: CR1-type non-LTR retrotransposon from Caenorhabditis RT elegans."; RL Repbase Reports 5(9), 262-262 (2005). XX DR [1] (Consensus) XX CC This sequence is a composite of a truncated copy extracted from CC Genbank (AC084158 104839 - 107560) and an additional 5' 113 bp CC sequence. Therefore, it is labeled as "consensus." The 113 bp CC fragment is shared by other copies which are not 5' truncated. CC The sequence contains a large ORF and appears to be a recent CC insertion. XX FH Key Location/Qualifiers FT CDS 402..2579 FT /product="LINE2G_CE_1p" FT /translation="MPTRKSKILDVALSNSVISNLNIMAPIGSSDHNTIKF FT TVQLLQNISVPKKMIPDFSKLDWPCCATYFENIEWGKVFDTHSHVNDIYEC FT FLGHIRTMFSQAVPTMPMPETSKPVPQYLRSFEQMVKNQFSKTIMSKTMND FT FIEYIYLARKYRKKLRKYLSNLEVKLFKKKGNSKFSKHAKTLLKPRNCSVP FT ALIDNDENEIVSELDKANLLASYFEAQYTNQNNTNFALDDDFVSDQPIPWV FT TDAEVLKVIMASKKSCSVTSDQVPFTFIKILSPLISSVLAQIFNLSMMRGQ FT VPHAWTESIVIPLDKITNANNPTDFRPISLTSHLCRVYERCILAKIELFLI FT DNKFWSQNQHGFRPKKSTVTNMLESLNDWSKSIDLGEQIDIIYLDFSKAFD FT RLPHDLLLDKLVSLRMNKNLLIWLNEFLSNRSFRVRIGQTLSSRKLATCGV FT PQGAVLSPLLFGIYVNDISSILPEKVACKQFADDTKLYVSTPKTESENNLQ FT SALDAVVDWTRGSKLTLNQSETVHVTVGKRRTDFKYHLDGYPIERKAITRD FT LGFLISEKLDFSEHWRKCINLAKFQLANMFNQYSTSNKKLMILLYKTFIRP FT RLEYGTVVSSPTKKSDEKAIESVQNAFTRRLYSRIKKRYINPTDKDYKTAM FT ERNELFDLQTLSTRRKIIDRIQVIRMNNGKVDLKTSEFFEQQETHTRSKKK FT YIWSTGKSKLRRHFFVNRTLATMKQH" XX SQ Sequence 2835 BP; 960 A; 545 C; 492 G; 838 T; 0 other; agtgacgggt ctcagtaatc aatactctgt cttccgtacg gatcgtgcta aaggtaaacg 60 aggtggaggt gtttgtaatg tggtaaaagt agaaatccca tgtgaaacag tagtttcagt 120 tgaaaacgct taacgcttat gaaattctcg taaatgacct gaacattgac gtagttccct 180 ttagaatcat aaacatatat cgtgctcctt cttgcactta tcaaaatact gttagattaa 240 ttagtaaaat tgaagacctt gtggaacatc aaggcaaagt tgtagttgtg ggagatctta 300 atattggagg tatcgactgg ggcctcaacg actctaaaac taaattaggt tctttattag 360 tagatctgtg caaaaactat aatctggaac aatatgtcaa aatgccaacc cgcaaatcga 420 aaatactgga tgttgctctt tctaattccg taatcagcaa tttaaacatc atggccccaa 480 tagggtctag tgaccacaat accattaagt ttacggtgca gctcttacaa aatatatcag 540 tccctaagaa aatgatacca gatttttcga aactggactg gccctgctgt gcaacatact 600 ttgaaaacat agagtggggt aaggtatttg atacccattc ccacgtgaac gatatttatg 660 aatgctttct aggccacatt agaacaatgt tctctcaggc agtcccaact atgccgatgc 720 cagagacttc aaaaccagtg cctcaatatt taaggtcttt tgaacaaatg gtaaaaaacc 780 aattcagcaa aacgataatg tcgaaaacta tgaatgattt tatagaatat atttacctag 840 cgagaaaata tagaaaaaag ttgcggaagt atctaagtaa cctggaggta aagttattta 900 aaaaaaaggg caatagtaag tttagtaaac acgcaaaaac tcttctgaaa ccgcgaaact 960 gtagtgttcc agccctgata gataatgacg aaaatgaaat tgtctctgaa ctagacaaag 1020 caaatcttct agcatcctac tttgaagctc agtatacaaa tcaaaataac acgaattttg 1080 ctctagatga tgattttgta tcagatcaac caattccctg ggttactgat gcagaagttc 1140 taaaagtgat catggcctcg aaaaagtcgt gctctgtaac aagcgaccag gtcccattta 1200 cttttataaa aatactatcg cctctcatca gtagcgttct ggcccagatt ttcaacttga 1260 gcatgatgag agggcaagtc ccgcacgctt ggaccgaatc cattgtaata cctcttgata 1320 aaatcactaa cgctaacaat ccaaccgact tcagacccat tagcttgacc tcgcacttgt 1380 gcagagttta cgaaagatgt atattagcaa aaattgagtt atttctaata gataacaaat 1440 tttggtcaca aaatcaacat ggattccgcc ccaaaaaatc aacggtgact aatatgttag 1500 aatctttgaa tgattggtct aaaagtattg atctcggaga gcaaatcgat atcatatacc 1560 tagacttttc caaggctttc gaccggctcc cacatgactt acttttagat aaactcgtat 1620 ctttaagaat gaacaaaaat ttactgatat ggttgaatga atttctgtca aatagaagct 1680 ttagggttag aattgggcaa acgctgtcat ccagaaaatt ggccacctgt ggcgtcccac 1740 aaggcgcagt tctgtcacca cttttgtttg gtatctatgt taatgatata tcatcaatat 1800 tgcccgaaaa agtagcatgt aaacagtttg cagatgacac gaaattgtat gtatcgaccc 1860 ctaagactga aagcgaaaac aatttacagt ctgctctaga tgctgtagta gattggacaa 1920 ggggatcaaa actcactcta aaccaatctg agacagttca cgtcacagta ggaaaacgta 1980 gaacagattt caaataccac ctagacggtt accctattga gcgaaaagct attaccagag 2040 accttggatt tttaatctct gaaaagctgg atttctctga gcattggagg aaatgtatca 2100 atctagctaa atttcaactg gccaacatgt tcaaccagta cagtacctcc aataagaaac 2160 taatgattct cttatacaaa acttttatcc gccctcgttt agaatatgga acagtagtgt 2220 cttctccgac aaaaaaatcg gatgagaaag caatagagtc tgtacaaaac gcattcacta 2280 ggcgccttta tagtagaatc aaaaaaagat atattaatcc aactgataaa gattacaaaa 2340 ctgccatgga acgtaacgaa ctattcgacc tccaaacctt aagcacgaga cgtaaaataa 2400 ttgacagaat ccaagtaatc agaatgaata atggtaaagt agacttaaaa acatctgagt 2460 tttttgagca acaagagacc cacactcgtt ctaaaaagaa atatatctgg agcacaggta 2520 aatctaaact taggagacac ttttttgtca acagaacact agccacaatg aagcaacact 2580 gatctaatgt taccttatcc tcacttctat ccatcattta ttcttgcctt ttatacatcg 2640 atatttgtgt tttttgcttc ttctccaatg tagccattat tttcttctca tctctttttt 2700 atttttgttc cgttttattt atgtaattta tttatgtaac cacttgtgat ggtttgtatc 2760 tatttatgtg tgtttttttt ctacctcacg atatcttact atcgtaataa attaaattaa 2820 attaaattaa aactt 2835 // ID CER11-I_CE repbase; DNA; CEL; 7438 BP. XX AC . XX DT 06-AUG-2003 (Rel. 8.07, Created) DT 06-AUG-2003 (Rel. 8.07, Last updated, Version 1) XX DE CER11-I_CE is an internal portion of the CER11_CE LTR DE retrotransposon. XX KW LTR Retrotransposon; Transposable Element; CER11-I_CE; KW CER11-LTR_CE; CER11_CE. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RA Bowen J.N. and McDonald F.J.; RT "Genomic analysis of Caenorhabditis elegans reveals ancient RT families of retroviral-like elements."; RL Genome Research 9(10), 924-935 (1999). XX RN [2] RA Ganko W.E., Fielman T.K. and McDonald F.J.; RT "Evolutionary history of Cer elements and their impact on the C. RT elegans genome."; RL Genome Research 11(12), 2066-2074 (2001). XX RN [3] RA Ganko W.E., Bhattacharjee V., Bowen N.J. and McDonald F.J.; RT "CER11_CE: a family of LTR retrotransposons from C. elegans."; RL Direct Submission to Repbase Update (APR-2003). XX DR [3] (Consensus) XX CC FEATURES Location/Qualifiers CC gene 5120..5522 CC /note="predicted RNAseH region" CC gene 4136..4802 CC /note="predicted Reverse Transcriptase region" CC gene 3266..3593 CC /note="predicted Protease region" CC gene 2972..3185 CC /note="predicted Gag region" CC gene 6131..6752 CC /note="predicted Integrase region". XX SQ Sequence 7438 BP; 1999 A; 2193 C; 1597 G; 1649 T; 0 other; tccagagatg gattgtaggt tccgccggtc cctggaagag ggtgattagg agacttctcg 60 gctcaacacg ttcctcggcg acccaggtgg cggattcagc ccagctaagg tcagaacgtg 120 ttcaccatgc atcactgtga agagctgaga agagtcacat atagcctcag cttatgctcg 180 tccaagacgc tatagcttga gagcgtcaca ctatgtgtgg ccaataaaaa attacaaaac 240 tagcgtaaaa agttatgtgt tccatacttt ccgaatctta cgctgcgtca ccctgttccg 300 ctcatgatcc tgtcgaggtt gtccacacac aaacaaggag ctaggactgg aatgtttcgg 360 tgcatgaata ttccggtcaa cccaaaccgg agcaggtgct gacgcaaata tggatatatt 420 ctaaacgtgg gccgcccaca tacttatcca tgctaccagc gtcccccaac gcatggtaaa 480 tccctttcag atccccgaaa ggctctgtct cgtactgtga tgggtcagag atcacccaat 540 ctctgactag gcgtctcatc agacccgcat cattaggtgg cagaccactg tctgttacgt 600 tctaaaaaca gtagaaaatc caacaggttt tgtagacccc tatctaccta acttctcact 660 tttcagttaa ctggcttata ttgtcttcga caatcaccct gaaaatgagc tccaattcga 720 gtcagaaaca tcgttcgaaa gtcctaccgg atgaatctca acctcgcgag gtcatcccgg 780 aggcatcctc ggctaccctc ccaggtaacc aggtaaccaa cccgcaacgg aggtcacgcc 840 tctcaggcaa ctcgaggccg tccaagacgc cagagtgcgg gctccaaacg aagccccgcc 900 gcccctatca aaggttacat ctcccgtgtc gcaaacaatt tgcggtcgtg ccatgacgac 960 gccacggtcc tgatgatgga acatcttgga gttgacccca atccactcca tcccattgac 1020 caatacacgc ttctcgctgc taacaagatt ctgcaaagaa ccaagcggta tgctgaagcc 1080 ctggagaatc tacgccacta cgtagatgac aagttccaag aacctgtcct gcgaggctcc 1140 ccactcaagg acgtgtacca cgaacaagtc caagagcatc tcgcacggct ccagccaaag 1200 tcactcttga ctgaggcaaa acgagacatc actatgctcg aaagagagct gctcaatcac 1260 ggctttccca tcaccactag tgatcctcag gagctcgtac tcacccccta tgagtacgag 1320 acatcggaag acggtacctc ctcatccgac gtcgacgatc ttgccagctt cgatggcgcg 1380 ttcgacaatc tccgagaaac catgggatct gaccacgttc agatcgacca tcaaaatccc 1440 aaccctcgtg taacgatccc gagcgcgatt ctatcgcctc ccacaaacgg ttcgtcacac 1500 ttgaactacc gtacggtctc acaaccctct ccgctgacag ttcatagaga ttccgcactc 1560 ggatctctgt cccgccaacc ctctttggca gatgaacttc aagacgaacg ccatcagcat 1620 cgtcttagtc aaatcaggct gagagctctc gaagacactt tcatagctcg ccggcaagcg 1680 gatgaggaag ccgaggagct tggaagacag cagctctctc agtacagaga gatgagagcc 1740 gcaagagagc ggagactcga agagatgcgc tcccagcctt ctcctccaca agctccggca 1800 ccagctcctc gacgcccaca caccgtacac agcggagcgg aacccacgac cccggatctg 1860 gttccagctc cagctctcac tggctaccca actccggagc aactgctccc ccaggctatg 1920 ttgcaagcca tgacggagat gggaagactg atctcgcagc ttcaacgcga tcaaacccaa 1980 gccagaagag agcaaacttc gtttatgaat gaatgcaggg aacacctgcg tccccccgcg 2040 gaaggcagca tcgggcagag cgcctactcg ccagatgatg agggtgaaga gcagagccag 2100 cgtggatctt caccaccggt tcagcccatt cccgattcca gatccccgtc aggagttatc 2160 aacttcgaga ccaacgccaa gaacctcccc aagttcgatg gcactggaaa cttccgcgca 2220 ttcagaaacg gcttcgacac tgtcgtccta gacgacccac ggctcccctc ggtgaccaaa 2280 tgcaacctcc tacgcaatca ccttgtaggg aatgctcagc agtgcatctc gcacgatgac 2340 gatccactcg tggcctacca aaccacgatg gatatgctgg aatctgttta tggtaaagga 2400 gacactcagc gtggcctcct tgagcgcttc aggaaactga agttccacca gtccaaccct 2460 gaacagatga agctggatct cacttcacac cagctcttgg tgcaacgtct ggtctctacc 2520 ggactgagcg cgaccgacga ccgcatcact atgggtctta tcgggaaact ccctatctcc 2580 ttcagagaca aggtcacaga gttttacact gacatggatg atcatccgtc agcgatcgca 2640 ttctaccaga ggattcgaaa gcacatcaac agcttcgaaa atggcctcat tgcagcctct 2700 ctgcaaccac tccacgtggc gccagtgaat gagatcccgt ctcactacgt caaaggatcc 2760 gtccatgtcg tagatcaaaa acaacagcct aagaagggag aactgcgcca cccaaccagt 2820 tcctcgggtg gtcagaaaga gcgcgacact tccgcctttt acatcgaccc cgccaccggt 2880 gcccagctcg gaggtcactt aagaccaggc aaaagaggcg tccacctaac cctcattgcc 2940 cgcactttcc ccttgccaga tgagacatcc aaaaaacctt gtgctgcctg tggaggctcc 3000 catagcccca caagatgtca tctcactagt caagctttcc gtgaagcgac tgctcaaaag 3060 gggctgtgtg ccaactgctg cggaaaacac gccatcgaac agtgcaaatc acacttcacg 3120 tgatcatact gcgatggaca ccaccactct ggcggatgcc cgaaaaaaga gtactataga 3180 gacttggcta actaccctcc agggcaaagg cccctatcca gttatttcgt gacgtccctc 3240 tcccacgaga gaggggaagc cgcaacacct cccactaaat cattgtgaac tcatctcaca 3300 tctcaaggca aagcaggtcg gcttgatctc ctcgggttcc gctaagctct ccttacaagg 3360 ctttcatgag gcgtcgactt cgatatcaaa aattttcaaa atatcattaa aaaataaaca 3420 ctcttcggaa agtgtagggt tcctcatcgc aggatactcc gacctcccag ctacagcctt 3480 cagagctccc gcgttctctg atcaagatgc acatcatcta gactccctag agatcgatca 3540 cctatcgatc tcgtctcaga gaacgttcga tggcaaacgt attgatatga tcctgggaaa 3600 cgatgtccta acatgcctcc acggcgacag gcacacccgg cgtcaccagc taccctccag 3660 acgagtcgtc gatgacactc gtattggcta cattgtccac tccgtcccgt cgctaatact 3720 ctataccagt gatgagcgga agtgggtctt caatgatcaa aatggcctaa cgcactccct 3780 tatgctagcc aacatggtac tggatcatca atacgtggaa gaccccgagc taaaactgca 3840 ctggtccatt gaacagttat ggaaattcga gaatctcgga atcgaaccca tcccattggt 3900 cgacgagaca aagaaatcta cccaagatct tcttgccgag ttccagcaga acgcctgtta 3960 cactaacggc gtgctcgaag tcgccctacc tttcaatggg aacgaagaga aacttaaaaa 4020 taattatgcc atcgcctaca agcgactgtg tagtctccac gagactctca ccaaaggcaa 4080 aaatttaatt actaaatatg acagggtgat caaagaccag ctactggcag gcattattga 4140 gttggtcacc ccagagatga aaccagattc cccgatcgag tatttcatgc ctcatcgcgc 4200 ggttatcaaa gagtcctcta acaccacgaa gcttagagtt gtacttgacg catctagccc 4260 catcggcaaa gatctctcgt tgaatgattg cctccacgca ggaactaatc tcttgacccc 4320 actctacggt atcttgttac gatcgcgctg ctaccgatac atcatcgtag cagacatcga 4380 gaaggctttc catcaagtcc gactccaagt aaaacaccga agtgtgactc aatttctatg 4440 gctagccgac ccttcgcagc cagccaatgc cgacaacgtt gtccggtacc gtttcactag 4500 gattccattc ggggtcgcca gcagcccctt tctgctcggc gccgctatcc accactttct 4560 ggggcgcaac ccccacaggc tcaacaacga aatcagagac aatctgtacg ttgataactg 4620 tatgctcgga actgatgact tcactaaagt catgcccact gccatggctg ccaagtccat 4680 cttccgaaag atgaacatga atcttcgcga attcgtcacg aattgcgacg gcatcatgca 4740 gcacatccga gcagaagacc gagccgagtc cagagatatc aaactcttgg gttgcatgtg 4800 gaattccaac gaaaccgttg acacgtattc aataaaaatt gccgttctgg atattgatca 4860 tccgaccaag agagaagtag cctccaaact cgcagagacc tttgatccgc ttggtctggt 4920 aactccaatt ctagtacagt tcaaacgact catccagcaa ctctggattg caggcgtaag 4980 ctggaaagat cgcattccga tcgaactgct gcccttatgg cgaaacttgc aaaagtcatt 5040 cgtagataag tcaatccacg ttgaacggcg tctaacattc gtcaacgagg aagtcatcga 5100 ctgtcagttg atcatcttca ctgacgcttc tcaagacatc tacgcggcag cagcgtacgc 5160 ccatttcacc tataagaaat ggccgccagt taccaggctt atcactagca aaagcaagat 5220 caaagaagtc tctgccgcga attacacaat tccgaagttg gagctactcg ggattttatg 5280 tggatcaaac ctcgcagtaa ccttatcgaa agagcttcgt ctaccgatct catcgattaa 5340 actcttcaca gatagctcgt gcgcactata ctggatcctc tctgccaaga acacacgtgc 5400 ttgggttcac aacagagtcc agaagtatca cgaaaattgt gcccggatgt cagagtgcgg 5460 cctttcaacc tctctacatc acgttcctac caaagagaat cccgctgact tagcgactcg 5520 aggaatgtca actaccgagc ttcagaagag tttattctgg ttccgaggcc ccagattctt 5580 ggctaaccca ccagaatcct ggccacaaaa gattgaaggt acaataactt gccccgccga 5640 gtttcaagac ctggtgtaca aggagatcat agataccagt acagagaaga agaagtccaa 5700 acccctcatc gaaaaggcta ttccagcggc tcccaaagct acggagtcag ttcttcatct 5760 cactaccggt cccttcaaat cttttatccc gttcgattac tcgaactctc taaccaaatt 5820 atcaaatatt atatacagaa cattgaactc agtgtgcaaa atgtttcctg gaaagtcttg 5880 ggactccgaa atcatggttg agttcaaaaa ctccgagtca gctctccata gacgcaaact 5940 cgtgaggaag ctcattatcc ttcaccatta tcgtgagtcc gaagccctcg gtcttaagct 6000 cccagcagac ttagactatt acgtagacag ccacggtttc tacctcgtaa agaaacaagt 6060 gacctctcac gcgcttcccc aagaagccaa cgagccggtt atcctcttca aggaccatcc 6120 cttggctacc ctcgttatgc gtgagacgca cgtcatcaac ggacacagtt ccgaactgta 6180 cacggtctcc gctgccaaaa ccatgttttg gatcccgcat atcaaggttc ttgcaaaatc 6240 cgtagtctct aactgcgttg actgcaaaaa ggttcacgga ctccctttcc gttaccctaa 6300 ttccaaaacc ttgccggaaa agcgaacgtc gccctccaag ccgttcgcga cagcgggtct 6360 agactatatg ggtccaattg agtacctaaa agatgacggc gtcaccattg gcaaatctta 6420 cgttctagtc tatacttgcc tggtcactag aggcgccatg ttacgtgtgt tacctgatgc 6480 caccaccgag acctacctca tgggtctgag aagcatcttt cattgtgtcg gctccccaac 6540 cgatatctat tccgacaacg cagccatctt caagctcgga gcgtcgatgt tgaatgatga 6600 cattctctct ggagatgagt taagcgactc gttgactagc tacctggcta gccaacaaat 6660 caacttcttc tatatcactc ccttgtcccc ttggcaagga ggagtatatg aaagaatcgt 6720 gggacttcta aagcaccaac tctataaagt cagctcggtc gaaaaactct ctatgttctc 6780 tctccagtac ctagtctccg gtgctcaggc gatgatcaac agcaggcctt taacccctca 6840 tgctcggtct ccaaacgaca tgattgcact tcgtccaatc gatttccaac ttcccggagt 6900 tatgcttgac atacctttcg tgcaccccac gggtaacgga aggggagctg aagaacgtgc 6960 ccgagcacat ctcgctcaac tcgagactgc tcttaacaga ctctggcaaa tctggaccct 7020 aggttacctc tttcacctac ggaaagcgaa acatcgcaat aagaagtgca cttcgataaa 7080 accagctgtg ggacaggttg tcctcatcga cacaaaccac gtaaataggc acaaatggcc 7140 ccttggagtc attctccaag tacacgagtc caaacgcgat cacgaagttc gaaccgctac 7200 agtcaaagct cacggcaagc gctgcctacg ctccgtttgc cagctcatcc ctctcgaagt 7260 acaagcttca gaggatttca cctccgccga tcccccatcg gaaggagatc ttgtggaact 7320 cgaagagcac gattgcgacg atccaacgtc cgatatcccg actcaagcat acttcgaaca 7380 ctcccgctcg accgctcgaa cattgctccg tgtctctccg cgcgtatctg aaatacgc 7438 // ID HELITRONY3_CE repbase; DNA; CEL; 193 BP. XX AC . XX DT 25-MAY-2001 (Rel. 6.04, Created) DT 25-MAY-2001 (Rel. 6.04, Last updated, Version 1) XX DE HELITRONY3_CE is a family of non-autonomous Helitron DNA DE transposons - a consensus. XX KW Helitron; DNA transposon; Transposable Element; Nonautonomous; KW HELICOPY3; HELITRON class; HELITRONY3_CE; KW nonautonomous DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-193 RA Kapitonov V.V. and Jurka J.; RT "HELITRONY3_CE."; RL Direct Submission to Repbase Update (AUG-2000). XX RN [2] RP 1-193 RA Kapitonov V.V. and Jurka J.; RT "HELITRONY3_CE."; RL Direct Submission to Repbase Update (MAY-2001). XX RN [3] RA Kapitonov V.V. and Jurka J.; RT "Rolling-circle transposons in eukaryotes."; RL Proc Natl Acad Sci USA 98(15), 8714-8719 (2001). XX DR [2] (Consensus) XX CC There are about 50-100 copies of HELITRONY3_CE in the C.elegans CC genome. CC They are ~99% identical to the consensus sequence. CC This family is a nonautonomous HELITRON-like DNA transposon. CC Several copies are identical, therefore, it is likely that CC HELITRON-like transposons are currently active in the C.elegans CC genome. HELITRONY3_CE and HELITRONY2_CE share identical 22-bp 5'- CC and 38-bp (one mismatch) 3'-termini. Their internal portions CC are different. CC A 49-bp 5'-termini of the autonomous HELITRON2_CE transposons is CC 94% identical to the HELITRONY3_CE 5'-termini. These transposons CC share 84% identical 38-bp 3'-termini. XX SQ Sequence 193 BP; 48 A; 44 C; 48 G; 53 T; 0 other; tcatttacta tatataaagc gcgtgtcctt ctgtccctat gtagtttgat ctctgatcag 60 agcaacgaaa ttttgggaaa ccttatgcta aatgcgcagc agacatcgat cgaggtccgc 120 gatagacacc gtatatctaa tgtgttttct caaaaaagtc gggggccgcg ccgtaggtgc 180 ggtccacggc tgg 193 // ID CERP16 repbase; DNA; CEL; 233 BP. XX AC M14334; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE C.elegans CeRep16 repetitive element. XX KW CERP16; CeRep16 repetitive sequence. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-233 RA Jones D., Russnak H.R., Kay J.R. and Candido P.E.; RT "Structure, expression, and evolution of a heat shock gene locus RT in Caenorhabditis elegans that is flanked by repetitive RT elements."; RL J. Biol. Chem 261, 12006-12015 (1986). XX DR GenBank; M14334; Positions 125 357. XX SQ Sequence 233 BP; 83 A; 48 C; 34 G; 68 T; 0 other; taaatattct tacaaagtaa gcaaattcta tgaaattatc taaaaaatcg gggaaacatt 60 tcaaaacggc acagttttaa gtttttctgt cttataaaaa attcctctaa acatttctgc 120 cttcttccta tgtgaaatcc ggcaaatctg atatccggca aaccgcgaat cggaaaattg 180 ccggaactta aaatttccgg caaatcggca aaccggcaat ttcccaaatt tga 233 // ID SINE1_CE repbase; DNA; CEL; 261 BP. XX AC . XX DT 08-SEP-1999 (Rel. 4.08, Created) DT 08-SEP-1999 (Rel. 4.08, Last updated, Version 1) XX DE SINE1_CE is a putative SINE-like retroposon - a consensus. XX KW SINE1/7SL; SINE; Non-LTR Retrotransposon; Transposable Element; KW SINE1_CE; non-LTR retroposon; retroelement. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-261 RA Kapitonov V.V. and Jurka J.; RT "SINE1_CE."; RL Direct Submission to Repbase Update (SEP-1999). XX DR [1] (Consensus) XX CC SINE1_CE is a putative SINE-like retroelement. CC It's possible that this repeat is a 3'-tail of CR1(LINE2)-like CC non-LTR retrotransposon since it does not generate duplications CC of target sites upon its integration in the genome and has CC AATAAA(TTA)n 3'-tail. CC There are about 50-100 copies of SINE1_CE in the C.elegans CC genome; CC they are ~96% identical to the consensus sequence. XX SQ Sequence 261 BP; 72 A; 55 C; 45 G; 89 T; 0 other; attccgtgat tattttcatt ccgtgattat tttcattccg tgattatttt cattccgtga 60 ttattttcac tcgggaatcg ctcgcccact atgggggagt ctacgcaagg acaacgcaag 120 gacaaggaca acattctaat ggaatggaaa caattgcccg actgcaccaa ttctagttca 180 agcgaacaat gataactttt gtattctgta ttccttcacg tctcccagcg agcgtaataa 240 attattatta ttattattat t 261 // ID TIR23T5A_CE repbase; DNA; CEL; 1658 BP. XX AC . XX DT 14-SEP-1999 (Rel. 4.08, Created) DT 14-SEP-1999 (Rel. 4.08, Last updated, Version 1) XX DE TIR23T5A_CE is a putative non-autonomous DNA transposon - a DE consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TIR23T5A_CE; KW nonautonomous DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-1658 RA Kapitonov V.V. and Jurka J.; RT "TIR23T5A_CE."; RL Direct Submission to Repbase Update (SEP-1999). XX DR [1] (Consensus) XX CC TIR23T5A_CE is flanked by 23 bp-long terminal inverted repeat. CC There are about ~50 copies of TIR21T5A_CE in the genome; they are CC ~97% identical to the consensus sequence. CC TIR23T5A_CE generates 5 bp long duplication of target-sites upon CC its integration in the genome. TIR23T5A_CE, is one of two major CC subfamilies. The second subfamily, TIR23T5B_CE, is younger than CC TIR23T5B_CE since its copies are ~98% identical to the CC TIR23T5B_CE CC consensus sequence. Both consensus sequences are 98% identical CC to each other; the TIR23T5B_CE consensus sequence is kept CC in the appendix file. XX SQ Sequence 1658 BP; 579 A; 256 C; 269 G; 554 T; 0 other; cacagcgcct caaatcggga gagttgtgaa aatcgagaat tgaaaagtcc aacggagcgc 60 gcttgcacac ttttacggta ttaaatgaat ttcagacgaa atttcgcgat tattcgagtt 120 ttcaagatga aatttggaaa aaataattaa atattgggat tattattgct ttaaaaaagt 180 tttaaaaaac attttattga acaaattcaa tatttggttg atttatcatt gattttcgaa 240 ccgattagaa caagttaact ctatgacaaa atttctacgc gaaacaaatt ttttcaagca 300 ttaatatgtt tgattccaat ttttaaacgt cagcttgata gaaataacac ggtttcaagt 360 cttgaaattg ctcaaaaatg cccaaatcaa cggaaactgt ttagttggca gttgatcttc 420 actaaaccaa cactaaatgt tcagatcaaa tcaaaacttg ttttatcatt tgaaaaaaca 480 tttttgaatc attgtttccc acaaaaatca gatgataagt agtatagcat gatgagaatc 540 catatgattc ttcatatgcg catagctata gatactgtaa cttcgacaac tcctatctcc 600 tcagagaggc aagatatcaa aaagtgatta actaagaaat tgtagaacat atcaaaaact 660 ataataatta attaaaatca tgtgtttatc tttcaagatg agggagttag agcagccagt 720 agaataacaa aaagtatgca atacaaatgt aagtctaact cccccatctt gaaagataaa 780 aacgtaggtt aaattgaata tcatagtttt tgatgtgctc tacaagtttt cagttgatca 840 ctttttgata tctcgcctct ccgagtagat aggagttgtc gaagttgagg tatctagagc 900 cgtagagggt aggaagaacc tgatggttcc ttattaagga aaagaaaata taacacgaaa 960 cttgtaaaaa atgttaattt gatcatttat gtaagtattt gagccatttt ttaaggaaaa 1020 ttggagagca acacttacat aagaaatttg gcgtgtcata gatcacttgg tagattggaa 1080 cattgaaatt tgatcagaac tatttgtatt tttagtgatt ttgattgatt tgtagcttta 1140 ttctattgaa atctttgatt tcataaaagt ttttaaacca attttgcaag atttcccttg 1200 ataaatgttg ctttactttt ttatcgattt cgttgtaaag aaaaatcatt tttctgtgtt 1260 atcagtcaaa ttaatgacta aaactcttta aaaagattca aactattaat aaaaattcta 1320 aaattaaaca tttatgcaat aaattgctct aaaatcaatt tttatttgca aaattctgtt 1380 tttggtcaaa aaaaattttc agattcgccg aaattcagag ctcatttcga gcttatttgg 1440 ggaaaaattt atgttaaaat gaagtaaatc gcatcgcgaa cagtctctgg gcggtcccgc 1500 atgttgaaaa ttgctttaaa atccctaaaa tccagcaaaa acgcgatatt ttactgtttt 1560 gctatacatt aattccaatt cgtgcgacgt ctgcagcaaa cgcgctcccg cgccaaaacc 1620 acttttttcg ctcttctctc ccgatttgag gcgctgtg 1658 // ID CELE4 repbase; DNA; CEL; 470 BP. XX AC . XX DT 20-SEP-1999 (Rel. 4.08, Created) DT 20-SEP-1999 (Rel. 4.08, Last updated, Version 1) XX DE CELE4 nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; CELE4; MITE; KW nonautonomous DNA transposon. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 19-120 RA Oosumi T., Garlick B. and Belknap R.W.; RT "Identification and characterization of putative transposable DNA RT elements in solanaceous plants and Caenorhabditis elegans."; RL Proc. Natl. Acad. Sci. U.S.A 92(19), 8886-8890 (1995). XX RN [2] RP 1-470 RA Kapitonov V.V. and Jurka J.; RT "CELE4."; RL Direct Submission to Repbase Update (SEP-1999). XX DR [2] (Consensus) XX CC CELE4 copies are ~80% identical to the consensus sequence. CC CELE4 sequences are flanked by 11 bp-long identical terminal CC inverted repeats, 140 bp-long subterminal inverted repeats and CC duplications of TA target-sites. XX SQ Sequence 470 BP; 152 A; 84 C; 90 G; 143 T; 1 other; ttatggggac gtggaaacgc gctccacggc caattgaaaa cgctccgccc ccgaaccatg 60 ggtctcgtta ggtatttggc ggcaaaaccg taaattcaaa cgttttcaat tagtttcgcc 120 gatttcgcca attttaaaga ttttttgtta tgttttccgt tgtttttaat gatttttatg 180 ccgaaattga tgaaataaaa atttgagtaa aattgaaaaa atcattagca tttcatttta 240 aaacatcttt tcgacataaa aatcattaaa aaatgaaaaa caaaccaaaa aaagacgaaa 300 aatgtttgaa aattggcgaa atcagcaaaa ctaattgaaa acgtttgaat ttacggtttt 360 gccgccaaat acctaacgag acccatggtt ngggggcgga gcgttttcaa ttggccgtgg 420 agcgcgtttg cacgtcgatt tcaaaagtgt ttttttctgc gtccccataa 470 // ID CELE45 repbase; DNA; CEL; 266 BP. XX AC . XX DT 23-AUG-1999 (Rel. 4.07, Created) DT 16-SEP-1999 (Rel. 4.08, Last updated, Version 2) XX DE CELE45 is a putative SINE-like retroelement related to RTE1 - a DE consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; CELE45; SINE; KW non-LTR RTE1-like retrotransposon; retroelement. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-266 RA Oosumi T., Garlick B. and Belknap R.W.; RT "Identification of putative nonautonomous transposable elements RT associated with several transposon families in Caenorhabditis RT elegans."; RL J. Mol. Evol 43(1), 11-18 (1996). XX RN [2] RP 1-266 RA Kapitonov V.V. and Jurka J.; RT "CELE45."; RL Direct Submission to Repbase Update (SEP-1999). XX DR [2] (Consensus) XX CC There are several hundred copies of CELE45 in the C.elegans CC genome [1]; these copies have been multiplied in the genome few CC million years ago since they are ~97% identical to the consensus CC sequence. About 20% of CELE45 copies (in a random sample of 30 CC sequences) are flanked by long direct repeats [2]. The length of CC these repeats varies from 24 to 265 bp, and different CELE45 CC copies are flanked by different repeats. These long duplications CC of non-specific target sites resemble RTE1 non-LTR CC retrotransposon CC Therefore CELE45 can be classified putatively as a RTE1-related CC SINE-like retroelement. XX SQ Sequence 266 BP; 62 A; 70 C; 74 G; 60 T; 0 other; aaaggattaa aaaaaaggat taaaggacga tccgttcttc aagtgctatg cactgcggat 60 ctgggattca ggtacactgc ctggtggtga tccctctggg ctgtaattta agccacgtcc 120 tagccgggga ctgtggccga taatccagtc gtggattgct ccacttccca atagaggctg 180 ggtgaaccta gggggtgagg ccggacttga actcgtgacc tccagactgc tagcggccac 240 cactaccgac tgagctatct gccccc 266 // ID LONGPAL2 repbase; DNA; CEL; 3232 BP. XX AC . XX DT 25-OCT-2000 (Rel. 5.09, Created) DT 25-OCT-2000 (Rel. 5.09, Last updated, Version 1) XX DE LONGPAL2 is a nonautonomous DNA transposon - a consensus. XX KW MuDR; DNA transposon; Transposable Element; Nonautonomous; KW 9-bp target site duplication; LONGPAL1; LONGPAL2; KW MUDR superfamily; PAL9A_CE; TIRs; nonautonomous DNA transposon; KW palindrome. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; OC Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-3232 RA Kapitonov V.V. and Jurka J.; RT "LONGPAL2."; RL Direct Submission to Repbase Update (OCT-2000). XX DR [1] (Consensus) XX CC LONGPAL2 is a non-autonomous DNA transposon; there are ~10 CC copies of LONGPAL2 in the genome; they are ~97% identical CC with the consensus sequence and are flanked by duplications of CC 9-bp target sites. LONGPAL2 has 970-bp nearly identical CC terminal inverted repeats. LONGPAL2 shares identical 10-bp and CC 5-bp termini with LONGPAL1 and PAL9A_CE transposons, CC respectively, CC which are also flanked by 9-bp target site duplications. CC Presumably, transpositions of LONGPAL2-like elements were CC triggered by MUDR-like transposases. XX SQ Sequence 3232 BP; 1070 A; 534 C; 529 G; 1097 T; 2 other; ggcaaatact cttttactac ctcatctttt actacgttgc tcttttacta cggtttactc 60 ttttactacg gccgtatttt ttactacgtt ttccaatttc gattttttac tacgaaatgt 120 atcttttact acggtcgttt tttttactac ggtttccgaa tttgattttt tactacgaat 180 cgtatctttt actacggccg ttttttttac tacaatggca ggttggattt attactacgt 240 tttgtatctt ttactacgat taacgatttt tactacctaa tgaaaagaag aatatcaaaa 300 gttggaattt gaaagttacc ttgtagcaga ttttaggtat aattacaatt tcattctgaa 360 atcaacatga aagtttcata gctgtgttct ttttccaact tttgaaatat actactgtct 420 gagcggtttt taactgaatc ctacgatatc ataacgggaa aatgaaattt atgaaaattt 480 tccaaacaaa actataaatt aagcactaat cagttattgt agaaaaattc taaatctgtc 540 gtttttgaaa acattcatgt caactttgtg atttggcata atattaagaa actcgacaag 600 tagggctcta tatacccaca tatcttttcg agcagaggga atgtgaaaaa ttgatcaatc 660 gcaaacttgt tagctcaaga taaggaacgc ttttcccgtg caaactattt gaaaaaaaaa 720 tctgaatgaa tccagattcg aaccagcaat gacactactg aaaagcgcgc gctttaccac 780 ttctccaaac aggcgagtgg cggccgccgt tcccaggaat atgtgaaacc aaccgagctg 840 tacgaagtag gaagcttcaa ttactttttc agatatttag ttctgaaaac tttgttaact 900 ctccttatca acaactaaat ggcccttttt gtagttattt ctactggaaa aaatgtgcct 960 gtaaataata tcctgtttat aaacgttgtt gcccaatgaa tacctaccct tgacctgaaa 1020 tttatcttga aaactacttt tttgagtgta tttttaaaca ttctacaaat tccactgaat 1080 aacagctcgg gaacttctgc gattcctcga ataagttatt ttcaaaaaaa aattaggccc 1140 aaaatttgtt tctgaaaaac ggccccaatc ttgtattttc cagtaataaa tgcaatctga 1200 atttttttcg aaattatttt tcgcatggaa tgtctgtttt tgggcttaaa tcgcactttt 1260 ttgtcaattt tgcttttaaa aaatatgaag tgtcgtccaa taaaatgcac ggtacgcaaa 1320 agtacgcaaa caccaacgtg taccgtactc aggaattttt tcaatttcga tttttaaata 1380 acaaaaagcg acgaaaaagt tacgtttaaa gccggaaaat gggattttga tacgaaaaat 1440 tatttttaaa aatatactga ttatataaac ttttggaaaa ataagaattg tagtggtttt 1500 tgagaaaaac attttaggcc caaagattat tttcatatgc ttcatttctg tatctcgtat 1560 caacgttcat gtacaatttt ctagctgaat atacaaccaa acacacaata tatttagaat 1620 tgagatggtt tcagaaaata ttttaggcca gaatgtatat tttgctccaa aatacaaaaa 1680 aaacaagcat acttatcgtt tttttgtgct caaaattaaa tggtcatatk ctcgacatga 1740 aaaaccgaca aaatagtcgg agaacttcag tttcgtagag aatttcacag tttgatatga 1800 ggagatatac agttttggat tggaatgctt atcaacttca tgcaaactca gtttcattat 1860 ggtcttcgag gaggcgcgat gactttaaca tggctggtgt agtggtagtg agtggagttt 1920 gcatcctaca tgtcatatgc tcaaaccttg ccagacaaat tgttttttgt tccgacagag 1980 tgtactgaat taatatattt taaatatttt aaagtcatta gaatcctatt tcaccatgtt 2040 ttacgcgagt ttccggtttt cgatccaaaa aaatttccag attttttttg ttggaatttc 2100 ttcaaaatat atttaacttc atggaaaaaa cacaattttg gtaattctac aagtttcgac 2160 attcacatca cattttcagt ctaaaaatca cctaattaga gttgattgag catagagaca 2220 tcctactttc atagacttat aactctgttg gcgtttgatt gatattattt acaggcacat 2280 tttttccagt agaaataact acaaaaaggg ccatttagtt gttgataagg agagttaaca 2340 aagttttcag aactaaatat ctgaaaaagt aattgaagct tcctacttcg tacagctcgg 2400 ttggtttcac atattcctgg gaacggcggc cgccactcgc ctgtttggag aagtggtaaa 2460 gcgcgcgctt ttcagtagtg tcattgctgg ttcgaatctg gattcattca gatttttttt 2520 tcaaatagtt tgcacgggaa aagcgttcct tatcttgagc twacaagttt gcgattgatc 2580 aatttttcac attccctctg ctcgaaaaga tatgtgggta tatagagccc tacttgtcga 2640 gtttcttaat attatgccaa atcacaaagt tgacatgaat gttttcaaaa acgacagatt 2700 tagaattttt ctacaataac tgattagtgc ttaatttata gttttgtttg gaaaattttc 2760 ataaatttca ttttcccgtt atgatatcgt aggattcagt taaaaaccgc tcagacagta 2820 gtatatttca aaagttggaa aaagaacaca gctatgaaac tttcatgttg atttcagaat 2880 gaaattgtaa ttatacctaa aatctgctac aaggtaactt tcaaattcca acttttgata 2940 ttcttctttt cattaggtag taaaaatcgt taatcgtagt aaaagataca aaacgtagta 3000 ataaatccaa cctgccattg tagtaaaaaa aacggccgta gtaaaagata cgattcgtag 3060 taaaaaatca aattcggaaa ccgtagtaaa aaaaacgacc gtagtaaaag atacatttcg 3120 tagtaaaaaa tcgaaattgg aaaacgtagt aaaaaatacg gccgtagtaa aagagtaaac 3180 cgtagtaaaa gagcaacgta gtaaaagatg aggtagtaaa agagtatttg cc 3232 //