ID LTR4_CR repbase; DNA; CHL; 352 BP. XX AC . XX DT 31-MAY-2006 (Rel. 11.05, Created) DT 31-MAY-2006 (Rel. 11.05, Last updated, Version 1) XX DE LTR4_CR is a long terminal repeat of unknown LTR retrotransposon. XX KW LTR Retrotransposon; Transposable Element; Nonautonomous; KW Interspersed repeat; 5-bp TSD; solo LTR; LTR4_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-352 RA Kapitonov V.V. and Jurka J.; RT "LTR4_CR, a long terminal repeat from the green algae genome."; RL Repbase Reports 6(5), 257-257 (2006). XX DR [1] (Consensus) XX CC The genome harbors about 20 copies of LTR4_CR; they are ~90% CC identical to the consensus sequence. LTR4_CR is a solo LTR CC characterized by 5-bp target site duplications. The LTR CC retrotransposon it was derived from is not known yet. XX SQ Sequence 352 BP; 68 A; 90 C; 98 G; 83 T; 13 other; tgtaacgcgg gtacttcccg ttcctgacaa cttcgaaggg ccttttcccg tcgttacaat 60 tggtatcgag accacccarw gttccgatgk tryrrmwgmc graggtcayk ggttcgagtc 120 ctgcgtacgt cggaatttgg cttcgctcaa aaaaccggca ctaggggatt gcataccgtg 180 gtgccttctc gacgcggtga cgggtaatgc cggccgtggt gtcaaaatta ggtttctggc 240 gagcagtcac attcgcagat ccacccgagg acgtgtgatc caaggggggg agaatgtaac 300 gcgggtactt cccgttcctg acaacttcga agggcctttt cccgtcgtta ca 352 // ID Mariner-N6_CR repbase; DNA; CHL; 414 BP. XX AC . XX DT 08-JUN-2004 (Rel. 9.05, Created) DT 08-JUN-2004 (Rel. 9.05, Last updated, Version 1) XX DE Mariner-N6_CR is a family of nonautonomous Mariner DNA DE transposons - a consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW mariner superfamily; Mariner-N6_CR; nonautonomous DNA transposon; KW TA target site duplications. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-414 RA Kapitonov V.V. and Jurka J.; RT "Mariner-N6_CR, a family of nonautonomous Mariner transposons in RT the Chlamydomonas reinhardtii genome."; RL Repbase Reports 4(5), 134-134 (2004). XX DR [1] (Consensus) XX CC Mariner-N6_CR is a family of nonautonomous transposons. The CC consensus sequence is less than 5% divergent from ~50 CC Mariner-N6_CR copies. Features of Mariner-N6_CRs include CC imperfect 79-bp terminal inverted repeats and TA target-site CC duplications. XX SQ Sequence 414 BP; 95 A; 115 C; 98 G; 106 T; 0 other; tggctgtgtg cacattattt gacatggtca ggttgcgaca tgtcgacatg tgaaatgcga 60 catacaacat tgcgacatac caagactcgc ccacatttgc atagctacgc gtcctggagc 120 ccatttaagt aacatctccc agctttgcgg cggtgtattg cgctagcgta gccacaagct 180 cctcccactg gccgcaccct acgccacgcc accaccaccg tggcaaccga gacacgtttt 240 gttgtgtgcc acggcgttcc acgcttacca gaggcatatt gaatgcatgc acaatatgtt 300 tggcgggttt ggatgcgttt gggcggtatg tgcattatgt cgcaatgtcg tatgtcgcaa 360 ttcacatatc cacatgtcgc atcctgacca tgtcaaataa cttgcaagct gcta 414 // ID SINEX-1_CR repbase; DNA; CHL; 535 BP. XX AC . XX DT 04-MAR-2004 (Rel. 9.02, Created) DT 04-MAR-2004 (Rel. 9.02, Last updated, Version 1) XX DE SINEX-1_CR is a putative nonautonomous non-LTR retrotransposon - DE a consensus. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; KW Nonautonomous; SINEX-1_CR; nonautonomous non-LTR retrotransposon. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-535 RA Kapitonov V.V. and Jurka J.; RT "SINEX-1_CR, a putative SINE-like non-LTR retrotransposon from RT the green algae genome."; RL Repbase Reports 4(2), 45-45 (2004). XX DR [1] (Consensus) XX CC About 100 copies of SINEX-1_CR are present in the C. reinhardtii CC genome. They are ~98% identical to the consensus sequence. CC Usually SINEX-1_CR are flanked by 15-bp target site duplications. XX SQ Sequence 535 BP; 119 A; 126 C; 161 G; 129 T; 0 other; atcaccggat cgctaactca gttagcgatt tgactcgtcc ttacggcaga agggacccgg 60 gtacgaatcc ggataaaagc ccaattatgc aaaaggcgaa agattggtgc gaaaccgatt 120 ggtgatccca cgaacgatag gtaattgccc ttagtggcaa ttgcgggctt atgcccgctg 180 caacctagaa aggtcgtggt gcagaagtcc gatttagtgg cgaggtccaa ggttcaagac 240 aaggctcaag atccaaggct cgaggaggag cgccatggct cctcggtttg cacgaactgg 300 cagtgctcca ctactataac gcggcgtttc cctagctcga tatgctaggt gtgcaggctt 360 ggatgtagtg gactttgaag agcggcctag gacttggagg ttgtagtttc ggaggttgtg 420 actctttcgt ggtgaggcgt cagcgtgagg ggggcgggcc ctctcgccct agtcaccttg 480 ccccgttaat ccatgccagg ccctatgggc cggcgttgta attattatta ttatt 535 // ID Novosib-N1a repbase; DNA; CHL; 781 BP. XX AC . XX DT 17-AUG-2004 (Rel. 9.07, Created) DT 17-AUG-2004 (Rel. 9.07, Last updated, Version 1) XX DE Novosib-N1a is a nonautonomous DNA transposon - a consensus. XX KW Novosib; DNA transposon; Transposable Element; 8-bp TSD; KW Novosib superfamily; Novosib-1; Novosib-N1; Novosib-N1a. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-781 RA Kapitonov V.V. and Jurka J.; RT "Novosib-N1, a family of nonautonomous Novosib transposons from RT the green algae genome."; RL Repbase Reports 4(7), 195-195 (2004). XX DR [1] (Consensus) XX CC Novosib-N1a is a subfamily of Novosib-N1 nonautonomous CC transposons. The Novosib-N1a and Novosib-N1 consensus CC sequences are 76% identical. Novosib-N1a elements are flanked CC by 8-bp target site duplications and contain 17-bp terminal CC inverted repeats. This subfamily is composed of ~20 copies ~97% CC identical to their consensus sequence. XX SQ Sequence 781 BP; 161 A; 233 C; 221 G; 166 T; 0 other; caacccggtc tcacccggga cgtgcccata cacgcggcac cgtccaaggc ccaaaccgtc 60 ccaagtgcct attcatgaca tatgtatgtt atgaatagct gttatgggcc ctgccgcggc 120 ctgtccgctc gccgagctcg cgaaatgcag ctgcatgccc cgacattttg cgcgaagagc 180 tgcttttagc ccccgtagag actctctagc atctacctat agctacgtat aggcacggca 240 acgttgagat attgaggaaa tggctgcttg agggacctcg gtcgcgaaat ggccggtcgg 300 ctgattctcg cattttgctc cgggatacat tccaatgcgg tataatgtgt ctaaatagca 360 tgttgagtga gtttatgccc cctccagcac gtttccgctc gaccgtgccc gcacgtaccc 420 atcgagctcg ctggctggct ggccaaaacc cgtccatttc gaccgcaggc gagcgcagac 480 cgccctgcca ctcaaccagc aagtcaatgc tatgtatgag aagtgcctga gagcgcgggg 540 gagtcagggg tgaccttttt gccgtcgggg aaccccccta tctgccccca agaccccccg 600 cacggaaagg gccgtacggc actggggcaa atgtcggact acggatctga cacgctggca 660 aggtttgcgc aagctgccca taaggttaga ttcctagtcg gacttttgcc ccgaccggtt 720 ttgcggggtt gaagcgagtg caaatttgga cggtgcccac aacgcgggtg agaccgggtt 780 g 781 // ID Copia3-I_CR repbase; DNA; CHL; 5340 BP. XX AC . XX DT 13-MAY-2004 (Rel. 9.04, Created) DT 13-MAY-2004 (Rel. 9.04, Last updated, Version 1) XX DE Internal portion of the Copia3_CR retrotransposon. XX KW Copia; LTR Retrotransposon; Transposable Element; 5-bp TSDs; KW Copia3-I_CR; Copia3-LTR_CR; COPIA superfamily; endonuclease; KW protease; reverse transcriptase. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-5340 RA Kapitonov V.V. and Jurka J.; RT "Copia3_CR, a family of copia LTR retrotransposons present in the RT Chlamydomonas reinhardtii genome."; RL Repbase Reports 4(4), 106-106 (2004). XX DR [1] (Consensus) XX CC Copia3-I_CR is an internal portion of the Copia3-CR LTR CC retrotransposon, whose LTR is deposited in Repbase as CC Copia3-LTR_CR. Copia3-I encodes a 1670 Copia3_CRp polyprotein CC (pos. 69-5078) composed of the C2HC zinc finger, protease, CC endonuclease, reverse transcriptase, RNase H. The PBS is CC complementary CC to an internal portion of tRNA Met. XX FH Key Location/Qualifiers FT CDS 69..5078 FT /product="Copia3_CRp" FT /translation="MDDSSSGLVKFKQLQDDAANYFTWLIQLYDFLTTRKV FT PLGVYLYTSTPPGRTALELADENGQALSWLRTSCTPNVARQLWQAPDAVTA FT LAYIQGLFRASAAQKLQQCDMERSCFQLEEGENMSQYFARASDLRDRIYAL FT GGTWEEPAVRLRLLNGLPLPAYEVLVEICNSQPQQTVAELRGYLMQHEVTK FT RFRQANGGHVGNVNRSRGGNAGGGGGAGGSSSSGGIVCHYCEQPGHIKPNC FT AVRQADVSAGIQRASIRSPVQLQQQQQQQQQQKKHGGGGGGGGGSGSGDGA FT KKRGKGGGGGSGARGGARSAASRPAHVLVVSSVDDWAEDYPSSSLIVHTFG FT AEPASSGAISSYSMADDGVPSASVIGDHGGCSSTSTFSYSSAGNGLSSSSY FT SGSAAPPPGAGSSLLSNTYRAVTRDFGGSNSSSTGAGVSSCSSGVAGPAGL FT GGRVLVVCDVSRGSMLSPSAGGGGSDDASGSVGGSFASAAAFDVPVIDSGT FT FAHITPHRRLLHDFVVRPRVSHVLWGDGRSSPVGGSGTLRMLSGQQRVDVT FT GVLYVPDAHLSLLSVRRLASSGAKVAFEGESATVELAGRSILSGRVRDGQY FT PLSVTMLPVLLAAPAMAPPAIGGGTAAPVSGGGAAATGSDSSFLPSSAAAP FT SGSGGTTSSSLNSSGASSSSNGTAASSGGGGTVSSSSSSSVTALSAAHRLH FT RRFGHVGWHSLMQMVNGSLVTGLDVDLSALSQAAESVCSTCVEAKAASSPF FT PDSSSEPQQPLALAHSDVCGPMPVMGRGGSRYFITLLDDATGVSAVRMLTT FT REHAGEALQEMIVQLENCHPGGGKLRNLRSDNGGEYRSEELQQWLRERGTV FT QQFSAPYMPQQNGAAERLNRTLMDRTRAILFDAALSSSFWPEAVTYASHLR FT NLSPSSSCSSTPWEALTGVKPDISSLRTFGCRVYVTLPADQRSKLNPRADI FT GTYLGLQRNSAAYRVMVGGKVVVSRDVRFDEDVRGPASRLAGVPFGSNSSA FT AAPAAVNPPAVAESPAPSPAPAAPPSRPATRSQRPVLTPPLSAFASAAGGG FT SHSNSSNSAASAAAAAAAQLFDEDSDDDETPPLAPPSDDEDSYGVSTATTA FT GVGEPSSYAEASSGPHAADWRKAMEEEMESQRDNKTWELAAPPPGVRLLAN FT RWVYKLKPQPGGAPRFKARLVVKGFAQREGIDYSEVFAPTSRYVSLRAILA FT IAAARGLSLHQMDVKTAFLNGDLDEELWMQQPQGFEVSGDGGGGDTGGGGG FT ADGGGSGGSSHRSVPLACRLLKSVYGLKQAPRCWYRKLSEELGGLGFTPAT FT ADPALFVRHDEAGPVYVLVHVDDLLIAAGCSAQLAAVKAAIGKCFEVRDLG FT EASTYLGMEIKRDPSTGDILLQQRRYVNELLQRHGMTDAKPRSLPLPAGTR FT VLAASEQQPVLDDGGPYRSLIGGLNYVAVSTRPDIAYALSVLARHMAAPTK FT AHLALATGVLRYLKHTVDMGLRFSASASGGGGGAGSSSAGGGAAAGYDAGS FT FVGYCDADWAGDPNTRRSQTAFLFALGKTVVSWCSQQQRTVAASSVESEYQ FT AAAAATKEALWLRKLASDLGLRSGAVAIRCDSQGALSLARNPIASSPLSKH FT IDIQHHLVRERVARGEVAVEYCPTEQMIADALTKALPEAKFFFCRAAMGVS FT T" XX SQ Sequence 5340 BP; 847 A; 1736 C; 1847 G; 910 T; 0 other; ggttatgagc cccgcttgtc gttaattcgc tcgcggttgg caagccttat tatcgctagc 60 taggtaggat ggatgactcg tcctcgggac tggtcaagtt caagcagctt caagatgatg 120 ctgcgaacta tttcacttgg ctcatacagc tctatgactt cctcacgact cgtaaggttc 180 cgctgggagt gtatttgtat acctcgacgc cacctggtcg tactgctctc gagctggccg 240 acgagaacgg ccaggcgctg agctggctcc gcacgtcgtg cacgcccaac gttgcccggc 300 agctgtggca agcccccgat gccgtcaccg ccctggcgta cattcagggg ctgttccggg 360 cctctgcagc tcagaagctg cagcagtgcg acatggagcg cagctgcttc cagctggaag 420 aaggggagaa catgtcgcaa tactttgcgc gtgccagtga cctgcgcgac aggatttacg 480 cgctcggcgg cacctgggag gagcccgccg tgcgactgcg cctgttgaat ggtctgccgc 540 tgccggcgta tgaggtgctg gttgagatct gcaactccca gccccagcag actgttgcgg 600 agctgcgtgg ctacctcatg cagcacgagg tgaccaagcg cttccggcag gccaacggcg 660 gacacgtggg caacgtcaac cgcagccgcg gcggtaacgc tggcggtggc ggcggcgcag 720 gcggcagcag cagcagcggc ggcattgtct gccactactg cgagcagccc gggcacatca 780 agcccaactg cgccgtgcgc caggccgacg tcagcgctgg catccagcgc gcctccattc 840 gctcgcccgt gcagctacag cagcagcagc agcagcagca gcagcagaag aagcacggcg 900 gcggcggcgg cggcggcggc ggcagcggaa gcggagatgg ggccaagaag cgcggcaagg 960 gcggcggcgg gggttcgggt gccaggggtg gcgcccggtc cgccgcctcc cggcccgcgc 1020 atgtgctggt tgtgagcagt gttgacgact gggctgagga ctaccccagc agcagtctga 1080 tcgtgcacac ctttggcgcg gagccggcga gctccggcgc cattagcagc tacagcatgg 1140 ccgatgacgg cgtgcccagc gcctccgtga ttggggacca cggcggctgc agtagcacca 1200 gcaccttcag ctacagctcg gccggcaacg gcctcagcag cagtagctac agcggcagcg 1260 ccgcgccacc cccgggcgcc ggcagcagcc tgctgtccaa cacctaccgc gctgtcaccc 1320 gggacttcgg cggtagcaac agcagcagca ctggagcagg agtttccagt tgcagcagcg 1380 gcgtagccgg gcctgcgggc ctcggcggcc gcgtgctcgt cgtctgcgat gtgagcagag 1440 gcagcatgct gtcccccagt gcgggcggtg gcggcagtga tgatgcgtca ggaagcgtcg 1500 gcggcagctt tgcgagcgcc gccgccttcg atgtgccggt gattgactcc ggcaccttcg 1560 cacacatcac gccgcaccgc cgcctgctcc acgacttcgt ggtgcgccca agagtgagtc 1620 atgtgctgtg gggtgatgga cgcagcagcc cggtcggggg atcggggact ctccggatgc 1680 tctctgggca gcagcgcgtt gacgtgacgg gcgttctgta cgtgccagac gcccacctct 1740 ccctgctatc ggtgcgccgg ctggcgtcca gcggcgctaa ggtagcgttc gagggggagt 1800 cggcgactgt cgagctggct ggccgctcca ttctcagcgg ccgcgtgcgt gacggtcaat 1860 atccgctgtc tgtgactatg ctgccggtcc tgctggcggc gcccgccatg gcgccgccgg 1920 cgattggcgg cggcaccgcc gcgcctgtga gcggcggcgg cgccgctgcc actggcagcg 1980 acagcagctt tcttccttct tccgccgccg cgccctcggg cagcggcggc accaccagca 2040 gcagcctcaa cagctccggc gcaagcagca gcagcaacgg caccgccgcg tcttcgggcg 2100 gcggcggcac cgtcagcagc agctccagct cgtcggtgac tgctctgagc gctgctcacc 2160 ggctgcaccg gcgtttcggc catgttggct ggcattcgct gatgcagatg gtgaacggct 2220 ccctggtcac tggccttgac gttgacctga gtgctctcag ccaggcagcc gagtcagtgt 2280 gcagcacgtg cgtcgaggca aaggcagcct cttcgccctt cccggacagc agcagcgagc 2340 cgcagcagcc tctggctctt gcgcattctg atgtgtgcgg ccctatgccc gtgatgggcc 2400 gcggtggctc tcgctacttc atcacgctgc tagatgatgc cacgggggtg tcagctgtcc 2460 ggatgctcac caccagagaa cacgctggtg aagcgctgca ggagatgatt gtgcagctgg 2520 aaaattgcca ccctggcggc ggcaagctgc gcaacctccg cagcgacaac ggtggtgaat 2580 accgttcaga ggagctgcag cagtggctgc gcgagcgcgg taccgtgcag cagttcagcg 2640 cgccttatat gccgcagcag aacggagccg ctgagcggct caaccgcacg ctgatggacc 2700 gcacgcgcgc cattctgttt gacgctgcgc tctcctccag cttctggcct gaagctgtca 2760 cgtatgccag ccacttgcgc aacctgagcc cctcctccag ctgcagtagc acgccgtggg 2820 aggcgctcac cggcgtcaag ccggacatct ccagcctgcg cactttcggc tgccgcgtct 2880 acgtcaccct gcccgccgac cagcgctcca agctcaaccc gcgcgccgac atcggcacgt 2940 acctgggcct gcagcgcaac tctgccgcgt atcgcgtcat ggtcggcggc aaagttgtgg 3000 taagccggga cgtgcggttt gatgaggacg tgcggggacc ggcgtcgcgc ctggcgggcg 3060 tgccgttcgg tagcaacagc agtgctgcag cgccggcggc tgtcaacccg ccggcggtcg 3120 ccgagtcgcc ggcgccctcg ccggcgccgg cggccccgcc ctcacggcca gccacccgca 3180 gtcagcgccc tgtgctcacg ccgccgctca gtgctttcgc ctcggcggcg ggcggcggaa 3240 gtcacagcaa cagcagtaac agcgccgcct cggctgcagc agcggcagca gctcagttgt 3300 tcgatgagga cagcgacgat gacgagacgc cgccgctggc gccaccctcg gatgacgagg 3360 acagctatgg cgtcagcact gctacgactg cgggggtggg cgagcccagc agctatgctg 3420 aggccagcag tgggcctcat gcggctgact ggcgtaaggc tatggaggag gagatggagt 3480 cccagcgcga taacaagacc tgggagctgg cggcgccgcc tccgggtgtg cgtctgctgg 3540 ccaaccggtg ggtgtacaaa ttgaagccgc agcctggtgg cgccccgcgc ttcaaagcgc 3600 gcctggtagt caaaggcttt gcccagcgcg agggcataga ctacagcgag gtgttcgcgc 3660 ccaccagccg ctacgtctcg ctgcgggcca ttctggccat cgccgccgcg cgcggcctat 3720 cgctgcacca gatggacgtc aaaaccgcct tcctcaacgg cgaccttgat gaggagctct 3780 ggatgcagca gccgcagggg tttgaggtca gcggcgacgg cggcggcggc gacactggtg 3840 gcggcggtgg tgctgacggc ggcggcagcg gtggctcaag ccaccgcagc gtgccgctgg 3900 catgccgcct gctaaagtcc gtgtatggac tgaagcaggc accgcgctgc tggtaccgca 3960 agctcagtga ggagctgggc gggctgggct tcacgccagc cacggcggat ccggcactct 4020 ttgtgcggca cgacgaggcg ggtccggtct atgtgctggt gcacgtggac gacctgctga 4080 ttgccgctgg ctgctccgcg cagctggcgg cagtcaaggc cgccattggc aagtgcttcg 4140 aggtgcgtga cctgggagag gccagcacct atctggggat ggagattaag cgggacccca 4200 gcaccggcga catcctgctg cagcagcggc gctacgtcaa cgagctgctg cagcggcacg 4260 gcatgactga tgccaagccg cgcagcctgc ctttgccggc tggcacgcgt gtgctggcgg 4320 ccagcgagca gcagcctgtg ctggatgacg gcggccccta ccgctcgctg atcggcggac 4380 tgaactacgt cgccgtcagc acccggcccg acattgccta cgcgctgagc gtgctcgcgc 4440 ggcacatggc ggcgccgacc aaggctcatc tggcgctggc gacgggagtg ctgcgctacc 4500 tcaagcacac cgtggacatg ggcctgcgct tcagcgcttc cgcctctggc ggcggcggtg 4560 gcgcaggcag cagcagcgcc ggcggcggag cggcggcggg ctatgacgct ggctcctttg 4620 tcggctactg tgatgccgac tgggccggcg accccaatac ccgccgctcg cagaccgcct 4680 tcctctttgc cctgggcaag acggtggtca gctggtgcag tcagcagcag cgcacggtcg 4740 ctgcctcttc tgtggagtcg gagtaccagg ccgcggccgc agcgaccaag gaggcgctgt 4800 ggctgcgcaa gctggcgtcg gacctgggcc tgcgcagcgg tgctgtagcc attcgctgcg 4860 acagccaggg cgccctcagc ctggctcgca accccattgc ttcatcgccg ctgtcgaagc 4920 acatcgacat ccagcaccac ctggtgcgtg agcgcgtggc gcgcggcgag gtggctgtgg 4980 agtactgccc cacggagcag atgattgcgg atgcgctgac caaggcgctg ccggaggcca 5040 agttcttctt ctgccgcgcg gcgatgggcg tcagcacttg agcagggttc tgtcaaggac 5100 agatgccgct gcggcggtat gttagcaggg aggttagtta ggaatgggac cgtgctgggt 5160 tctgccacgt gcagtaaggc tgaggcctag tgcagtgact tgcggtgact tcagacggtg 5220 ctcagtttga gggggagtta tctcctcctt tatggccggt gattggcctt gtttgtttcc 5280 cctgtcagtt ggctgtcagt tggcggtttg ctattggacc cagttcggtc cgagggggag 5340 // ID Gypsy-8-I_CR repbase; DNA; CHL; 11552 BP. XX AC . XX DT 31-MAY-2006 (Rel. 11.05, Created) DT 31-MAY-2006 (Rel. 11.05, Last updated, Version 1) XX DE Gypsy-8-I_CR is an internal portion of Gypsy-8_CR LTR DE retrotransposon - a fossilized copy. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; 4-bp TSDs; Gypsy superfamily; Gypsy-8-I_CR; KW Gypsy-8-LTR_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-11552 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-8_CR, a family of gypsy LTR retrotransposons from the RT green algae genome."; RL Repbase Reports 6(5), 239-239 (2006). XX DR [1] (Consensus) XX CC This is an internal portion of Gypsy-8_CR LTR retrotransposon. It CC encodes one 2903-aa polyprotein composed of gag, reverse CC transcriptase, PHD zinc finger, and integrase. This polyprotein CC is characterized by unusually long distance between the reverse CC transcriptase and integrase (~900 aa). Gypsy-8-I_CR encodes also CC a second putative ORF downstream of the polyprotein. CC Corresponding protein does not match anything known (it can be an CC artifactual ORF due to the high GC content). XX FH Key Location/Qualifiers FT CDS 44..8752 FT /product="Gypsy-8-I_CR-1p" FT /translation="MSDEDGFPRRARPSAFHTDLQGLDYERITQHCERFDP FT QWHSVRDWLESVDELAEEFGWTESRKLQLASKRLGPTAKEWYRSWKLANPM FT ASSTWEGFCDALRERWGVSDRELHLALANCTQGPKETVREYADRYLGLVTQ FT LRLDYNRDPTHMHNFMRGLHPDIYDSVYDMEPHNLTAAIRKAIYASEKGKL FT SKADGRDTRGRSDSEERRVRFADRSPSNEPRHSNGSSTAPPRYSRPFGRDD FT RRDSRDTRSDQGPRMANRGPNNSSAGRGGGGVEDLSTRMSRMQLLHRRTPS FT RTLLFQAARPLYSPYRDAYALGYESEDSGGVMSPQGSTTAPVSYYTNAEAY FT TKRVGDFEPITLPRKRVPVPIPGPEPMEVDRLAPRRFTAGHADQRREREQR FT PAAAAPASSAAAPAAGDGQGPRAPAGLEARPTGRRPTPGRGRTQATTPAHA FT PTRARSPAADPAAPAARPPRTDPIPTGDAGEEEKRVADEICAKINAYPIPL FT SRALRVNPANIYTKIGGRLIGLGRQHSRELPTPVAAAPPTRPPTRAPAAAP FT TTRSPKATAPGPPGHPAAHAHRATAATLTPVVSGGEWDVVEAEVLLGKADG FT KFIRARAVVDTGASNTVIPRVTLRRLGRVEDIEPAEQTFTNADGVRSESGG FT LVRNLVIATGDMVSTVDAYVSDATSYVILLGSDFLAPIKADISYRKKMLEY FT TNDLGRRSALPIRYHFPPTPMACMVGRRARVVSDPPSEGEEAAAPAPPAPA FT TRSASRPRPSPFALCQVPGYVVPGPSPVLPPVPPPLTGAETLELAGILDSM FT LSHASPTAETARSAGARRVCMHRVLDTMDNGSPRASSLSSHPREDWPLDME FT AELDALVAGRPMEDTSIDEQRWQDIQDGRAPLPLSQPRSPDRRAPNPSGGT FT NAGPLTEQANPLQYPPGDPVGSPDGPEEAQRLLTELAAGQGSPEKVARVAR FT ILHWTASRELQVLQAAAFLAGARELARESSTDSHTDDTWWEDLPLGPSPRS FT VSSTVTTELDPNPSANTKEEEGATEEDEEGATRPNDPCKPPPACGYDPEGD FT AGGAAPPSRAFLTRQGQESGVTWDDCEDPFLPPLDPLSMPGDSTPDDASAP FT TFHADYVSDTESDGPPSLMDASDSGSDEESRGEEPAFPRRASHYPRFLVSY FT SAYRGQNPHEPGHSRVCVALGRQDGFLSDVASGSTLSAGYEAPLLDSDSGS FT GPPSPVTSAHFASPAPSGAKLDASWEDDDCCSPRSQSWAAPRTPIYYASGT FT PQTPRTSLPTLADVLLQRPTKEEVLRLSPTQHLLAAREYGQRVRDTQGELP FT ADDERRQLLLATLSPDLTPEHSTALAKVLLDNYDVFAASNQDLSVTSWLEA FT SIDTGDAAPVCTNPYRLSKGERDALEAEVQKMLASGVIQPSSSEWCSPMVM FT VKKRPAGSDKAGGGAADAPPVPPQWRACIDLRAVNDLTKPLRFPMPHVQTM FT IDTVAPPMGEKRIYSTVDLIAGFWQVPVRQEDRCKLAFQAPSGLYEFTVLP FT MGARQSPALFQRLLSLVLRPILFYGPGESTPGKCCALFIDDICVASKDPVQ FT HAEDLQNMFDCLRLANLKMSIKKCHFATTEIEFLGHAISGATGQYTLAPRN FT AEAIRNYPKLKTVRQIKAFLGLANYYRSMVPNFALISRPLYDCLGKLGYQW FT GPEQQAAFEQLKTALTTEPVLRAADVTRPFRLATDFSCNAVGACLSQLDDD FT SREYAVSYASKRLTGAETRWSSTDGEAYAAVWAVKKYHEYLASARFQLITD FT NTALTYIMRAKDLTGKLARYALRLQGYDMEIVHRPGTKHGNVDGLSRLGHL FT AEDGGREDTDDVEGSGGDQGLSTELLTSSASPGPLAGYVATAHRQPNGRMS FT TFLAGRSLSQEFEQVWSLPAARSHREVRIFMRRATSGQVGEQKAAGRAGPS FT QPAAEADRAAAGPSTAREDGAAEVPGGEGAGPGSPAAAGSAPPRAEPQAGH FT PASPLQRLLAMCRSPNPLSPGKRSRNAPPQPAPQTTSLRSPSGEPLRRRRL FT PQPPAHLQDYTEQGESEEGESPKCARCQQGKPEWNMLLCDFPGCSSAYHTT FT CLRPPLRRVPEGNWLCPEHATGGNPNAGKQHGDDLGVAAGTGTGGISGGAG FT DAAPAPANRRLVRVVRRLNLPSGAPNPGPVGAPALTTPPVAAIPAATTAGP FT GHQEAVARAAATASLAATNMPTAMATALTATAAPSGFTNFPTDELVRAYVG FT EGDSSEDEEEWEQEDGTATEGEGEVSEDVLAVNGVPNYSAGEADVWHDAGL FT LHYLQYRSFVARPDLSGTDFYKECARISRKARRYRWENEEEWEQEDGTATE FT GEGEVSEDVLAVNGVPNYSAGEADVWHDAGLLHYLQYRSFVARPDLSGTDF FT YKECARISRKARRYRWENGQLYKRATRNRPEVRILRHSERQAALAEVHNLA FT HPGVRATFDLCRSRFWWKGMGHDCRVFVEHCRQCHPTQHVLVRHQPLHPLP FT IVQVFHRINVDLSGPHALTPRGNRYIAVAIDAFSKYPVVGALPNKESATTA FT RWLWESVLCHWGSVAVVMTDQGTEWQGEFASLLARERVRHVKSGPRCPAQN FT GQVERFMGIMRSSLVRLCQEGVEANWDLYIHQVALSYRAARQRSTGCSPAL FT LLYGKELALAQQKPPLAQQTADPGEEWDSEGEGAEAVADERRQAAGVRKEL FT LDAAADGARGNMEKAQTRMVRDHARRTHVNVMPKPRPGGEGAYGSPDDTPA FT DGQAQGQPATAATTRRASGSDSDNTKTPSPPPSAVLSRGGAGSAAPGGRGR FT GRGRGAGSRGRGDRTTTQQPAAPKPALPTNSHDSAPEDDPALMHLPCLEKG FT TLVYRQRPSRSKLQSTREGPYRWLLWNRTGTMALVEDDQSTRFTIPVAQLL FT VPRGTMVPAR" XX SQ Sequence 11552 BP; 2175 A; 3953 C; 3775 G; 1649 T; 0 other; tggtttcgct accgggactt cttccatccc ccagtagcca ctcatgtcag acgaagatgg 60 ctttccccgc cgagcgcggc ctagcgcttt ccataccgat ctgcaaggac ttgactatga 120 acgtatcacc caacactgcg agcgcttcga cccgcagtgg cattctgtgc gtgattggct 180 ggagtccgta gatgagttgg cagaggaatt cggttggact gagtcacgaa agctccaact 240 tgcctctaaa cggcttggcc ccacggccaa ggagtggtac cggtcatgga agctagccaa 300 ccccatggca tcgtccactt gggagggctt ctgtgacgcc ctgcgcgagc gctggggtgt 360 gtcagaccgc gagctgcatc tggccctcgc caactgcaca caaggcccca aggagacggt 420 gcgcgagtac gccgaccgtt acctgggcct cgtgacccaa ctgcgcctcg actataaccg 480 tgaccctacc cacatgcaca acttcatgcg gggcttgcat cccgacatct acgactccgt 540 gtacgacatg gagccgcaca acctaactgc tgccatccgc aaggcgatct acgccagcga 600 gaagggcaag ctgtccaagg ccgacggccg ggacacacgc ggccgatctg acagcgagga 660 gcggcgagtg cgctttgctg atcgctcgcc cagcaacgag ccacgccaca gcaatggctc 720 tagcaccgct cctccccgct acagccggcc ctttgggcgg gatgaccgcc gagacagccg 780 ggatacccgg tcggaccaag gcccacgcat ggccaaccga gggcctaaca acagcagtgc 840 cgggcgtggg ggcggcgggg ttgaagacct cagcacccgc atgtcacgca tgcagctgct 900 gcaccgccgc acccccagcc gcaccctgct gttccaggcg gcccgcccgc tctacagccc 960 atacagggac gcttacgccc tgggctacga gagcgaggac agcggcggtg tcatgtcacc 1020 gcaaggctca accaccgctc ccgtctccta ctacactaat gctgaggctt acaccaaacg 1080 agtgggcgac ttcgaaccca tcaccctgcc tcgcaagagg gtgccagtcc ccatccccgg 1140 gcccgagccc atggaggtgg accgcctagc ccctcgccgc ttcaccgcgg gccacgctga 1200 ccaacgccgg gagcgtgagc agcggccggc cgctgccgcc cctgccagct cggccgctgc 1260 gccggctgca ggcgacgggc aaggcccccg tgccccagct gggttggaag cccgccctac 1320 gggacggcgg cctacccccg ggcggggccg gacacaggcc acaacccccg cccacgctcc 1380 aaccagggct cgcagccccg cggctgatcc cgccgcccct gcagcccgtc ctccccgcac 1440 ggaccctatt ccgacgggcg atgcaggaga agaggagaag cgagtggccg atgagatctg 1500 cgccaagatc aacgcatacc ccatcccctt gtcacgggcg ctgcgagtca accccgcaaa 1560 catctacacc aagattggtg gacggctgat cggtctgggg cgacagcact cccgggaact 1620 gccgacaccc gttgctgccg caccaccaac ccgaccccca acccgcgccc cggctgccgc 1680 tcctactacc cggagcccca aagcaaccgc cccggggccc ccgggccacc ccgctgctca 1740 cgcccaccgg gccacggcag cgaccttgac cccggtggta tctggaggcg agtgggacgt 1800 ggttgaggcc gaggtgctcc tgggcaaggc cgatggcaag tttatccgtg ctcgagcagt 1860 cgtggacacc ggtgctagca acaccgtgat tccacgggtg acgctacggc gcctgggccg 1920 agtggaggat atcgagcccg ccgagcagac cttcaccaac gctgatgggg tgaggtctga 1980 gtccggtgga ctggtgcgca acctggtgat cgccacgggt gacatggtta gcacggtgga 2040 cgcgtacgtg agcgacgcca ccagctatgt catcctgcta gggtctgact tcctggcacc 2100 catcaaggct gacatcagct accgcaagaa gatgttggaa tacaccaacg acctagggag 2160 gcgcagtgcc cttcccatcc gctaccactt tcctccgacc cccatggcat gcatggtggg 2220 gcggcgggcg cgagtggtca gcgacccccc aagcgagggc gaggaggctg ccgctccggc 2280 ccccccagcc ccggcaacac ggagcgcctc gaggcctcgc ccctcgccgt ttgcgctttg 2340 ccaagtgcca ggatatgtgg tgcccggtcc gtccccggta ctgcctccag tacccccgcc 2400 gctgacaggc gcggagaccc tggagctggc cggcatccta gacagcatgc tgtcgcacgc 2460 atcacccacg gcggagaccg cccgctccgc tggggcgcgg cgtgtctgca tgcaccgagt 2520 actggacacc atggataacg ggtcccccag ggcatccagc ctcagcagtc accctcgtga 2580 agactggccc ttggacatgg aggctgaact ggacgccctg gtagctgggc ggcccatgga 2640 agacacgtcc atcgacgagc aacggtggca agacatccag gatggccggg ctcccctgcc 2700 cctgtcccaa cctcgcagcc cagaccgcag ggctcccaac cccagcggcg gtaccaatgc 2760 tgggcccctg accgagcagg ccaacccttt gcagtacccg ccaggtgacc cagtaggttc 2820 accggacggg cctgaggagg ctcagcgact gctaacggaa ttggcggctg ggcagggctc 2880 acccgagaag gtggcaagag tggctcgcat cctacactgg acggcgtcac gagagctgca 2940 ggtgttgcag gccgcggcat tcctagccgg ggcccgcgag ctggcgcgcg agagcagcac 3000 cgattcccac accgatgaca cgtggtggga ggatctgcca ttgggcccat cgccgcgctc 3060 cgtaagcagc acggtaacta ctgagctgga tccgaacccc tcggccaaca ccaaggagga 3120 ggagggggct actgaagaag atgaggaggg ggctacccgc cccaacgatc cctgcaagcc 3180 tcctccggct tgtggctatg accctgaggg ggacgcagga ggggcggctc ccccttcccg 3240 ggcttttcta acccgccagg gccaggagag cggggtaaca tgggatgact gcgaagaccc 3300 ctttctcccg cccctggatc cactgagcat gccaggcgac tcgacacccg acgatgcgag 3360 cgcaccaacc ttccatgcgg attatgtgtc tgacaccgaa tctgatgggc ctccatcact 3420 tatggacgcc agcgactccg gcagcgacga ggaaagcagg ggcgaggagc cagccttccc 3480 ccggcgggcc tctcactacc cccgcttcct cgtcagctac tcggcctata ggggccaaaa 3540 cccacacgag ccgggccact cccgcgtctg cgtggcccta ggccggcaag acggcttcct 3600 gtctgacgtg gctagcggca gcaccttatc cgcgggctac gaggcccctc tgctggattc 3660 agattcaggg agcgggccgc cctccccagt caccagcgct cacttcgcct cccctgcacc 3720 gtccggggca aaactggatg cctcatggga ggatgacgat tgctgcagtc cccgcagcca 3780 gtcgtgggcg gccccacgta cgcccatcta ctatgcgtca ggcacgcccc aaaccccacg 3840 cacgtccctc cccacgttgg ccgacgtgct gctacagcgg cccaccaagg aggaggtgtt 3900 gcgcttgtcc ccaacccagc acctgctggc cgctcgggaa tatggccagc gcgtgaggga 3960 cacacaggga gagcttccgg ctgacgacga gcggcggcaa ctgctactcg caaccctcag 4020 cccggacctg actccggagc actcaaccgc tctggcgaag gtgctcttgg acaattacga 4080 cgtatttgca gcatccaacc aggacctgtc cgtcaccagc tggctggaag cgagtatcga 4140 caccggggat gcggcaccag tgtgcaccaa cccataccgg cttagcaagg gcgagcggga 4200 cgcgctggag gcagaggtgc agaagatgct ggcctctggg gttatccaac ccagctcgtc 4260 ggaatggtgt tcccccatgg tgatggtgaa gaagcggccg gcaggcagcg acaaggccgg 4320 cggaggcgcc gcggacgcgc ccccggtccc accccagtgg cgcgcctgca tcgacctccg 4380 cgctgtgaac gacttgacca aaccccttcg cttccccatg ccccacgtcc aaactatgat 4440 agacacggtg gcaccgccga tgggcgaaaa gcggatttac tccaccgtcg acctcatcgc 4500 cggcttctgg caggtgcctg tccgccaaga agaccgatgc aagcttgctt tccaggcgcc 4560 ctctgggctc tacgagttta cggtgcttcc gatgggagcc aggcagtccc cggcgctttt 4620 ccagcgcctt ctgagtctgg tcctgcgccc tatcctcttc tacggccctg gagagagcac 4680 accaggcaag tgctgcgcgc tgttcatcga cgatatctgc gtcgcgagca aggacccagt 4740 ccagcacgcg gaggacctgc agaatatgtt cgactgcctg cgcctggcta acctgaagat 4800 gtctatcaag aagtgccact ttgcgacaac agagattgag ttcctgggcc atgccatttc 4860 gggcgcgacg ggccagtaca ccctggcccc ccgcaatgcc gaggcaattc gcaattaccc 4920 aaagctaaag accgtgcgcc agatcaaggc cttccttggt ctcgcgaact attaccggtc 4980 aatggtaccc aacttcgcgc ttataagccg accgctatac gactgcttag gcaagctcgg 5040 ttaccagtgg gggcctgagc agcaggcagc gtttgagcaa ttgaagactg cccttacgac 5100 tgagcccgta ctgcgtgccg ccgatgtcac ccgccccttc cggttggcca ccgacttcag 5160 ctgcaacgcc gtgggcgcgt gcctgtcgca gctggacgac gacagccggg agtatgcagt 5220 ctcctacgcc agtaagcggc ttactggagc agaaacacgc tggtccagca cggacggcga 5280 ggcttatgcc gctgtgtggg cagtgaagaa gtaccacgag tacctggcgt cagcacgctt 5340 tcagctgatc acggacaaca cggcgctcac atacatcatg cgggccaagg acctgacagg 5400 caagctggcc cgatacgctt tacggctgca gggctatgac atggagattg tgcaccgacc 5460 tggaacgaaa cacggaaacg tggacgggct gagccggctg gggcacctgg cagaggacgg 5520 gggcagggag gacacagacg acgtggaggg gtctggtggc gaccaggggc tcagcaccga 5580 gctgctgact agctccgcct caccgggccc ccttgcgggc tatgtcgcca ccgcgcaccg 5640 tcagcctaac gggcgtatgt ccactttcct ggctggacgc tctctgtccc aggaattcga 5700 gcaggtgtgg tcactgcctg ccgcccgaag ccaccgggag gtgcgcatct tcatgcggcg 5760 ggcgacgtca gggcaggtgg gcgagcagaa agcagccggg cgggcagggc cctcgcagcc 5820 ggcggcggaa gcggaccgcg ccgcagccgg cccgtccacc gctcgtgagg atggagcggc 5880 agaggttcca gggggtgagg gggccgggcc tggcagccca gcggcagcag ggtcggcccc 5940 gcccagggct gagccccaag ctggccaccc tgcctcaccc ttacagcggc tgctggcgat 6000 gtgtcgctcc cccaaccccc tgagccccgg caagcgtagc cgcaacgccc cgccccaacc 6060 ggcgccgcag accacctccc tacgcagtcc gtccggggag cctctgcgcc ggcggcggct 6120 gccgcagcca cctgcgcacc ttcaggacta caccgagcag ggtgaatcgg aggagggcga 6180 gagccccaaa tgcgctcgat gtcagcaggg taagccagag tggaacatgc tgctgtgcga 6240 cttcccgggc tgtagcagcg cataccacac cacctgcctg cgaccaccgc tgcgccgtgt 6300 gccggagggc aactggctct gccccgagca cgctactggg ggcaacccga acgcggggaa 6360 gcaacacggg gacgacctgg gggtggcggc cggcaccggc acaggaggca taagtggggg 6420 cgcgggggac gcagccccag ccccagctaa ccgccggctt gtgcgggttg tcaggcgtct 6480 caacctgccc agcggcgccc ccaacccggg ccccgtgggc gcgcctgctc tcaccacgcc 6540 acccgtggcc gccattcccg ccgccacaac cgcaggtccc ggccatcagg aagcagtcgc 6600 aagggcggcg gctacggcct ctctggccgc aaccaacatg ccaaccgcca tggcgacagc 6660 cctaacggcg acggccgcgc caagcggctt taccaacttc cccaccgacg agctggtgag 6720 ggcatacgtg ggcgaagggg attccagcga ggatgaggag gagtgggagc aggaggatgg 6780 cacagcgacg gagggcgaag gggaggtgtc agaagacgtg ctagccgtga acggagtgcc 6840 gaactacagc gcgggcgagg ccgatgtgtg gcacgacgcc gggctgctgc attacctgca 6900 gtaccggagt ttcgtcgctc gccccgacct ctccggcaca gatttctaca aggagtgtgc 6960 gcgcatttca cgcaaggcgc ggcggtaccg atgggagaat gaggaggagt gggagcagga 7020 ggatggcaca gcgacggagg gcgaagggga ggtgtcagaa gacgtgctag ccgtgaacgg 7080 agtgccgaac tacagcgcgg gcgaggccga tgtgtggcac gacgccgggc tgctgcatta 7140 cctgcagtac cggagtttcg tcgctcgccc cgacctctcc ggcacagatt tctacaagga 7200 gtgtgcgcgc atttcacgca aggcgcggcg gtaccgatgg gagaatgggc agctgtacaa 7260 gcgtgccacc cgcaaccggc ccgaggtgcg tatcctgcgg catagcgagc ggcaggcggc 7320 cctggccgag gtgcacaacc tggcgcaccc cggcgtgcgg gctaccttcg acctgtgccg 7380 ctcccgcttc tggtggaagg gaatggggca cgattgccgg gtgtttgtcg agcactgccg 7440 ccagtgccac cccacgcagc acgtgcttgt gcgacaccag cccctgcacc cgctacccat 7500 cgtgcaagtg tttcaccgca tcaacgttga cctcagtggc ccgcacgcac tcacgccccg 7560 cggcaaccgg tacatcgccg tggccatcga tgcgttcagc aaatacccgg tggtgggtgc 7620 cctgcccaac aaggagagcg ccaccaccgc acgctggctc tgggagagcg tgctgtgcca 7680 ctggggttcg gtggcggtgg tgatgacgga tcagggcacc gagtggcagg gcgagttcgc 7740 ctccctgctg gctcgggaac gagtgcgcca cgtgaagtca ggccctcgtt gcccagcgca 7800 gaacggccag gtggagcgct ttatgggcat catgcgctcc tcgctggtgc ggctgtgcca 7860 ggagggagtg gaggcaaact gggacctcta cattcaccag gtcgccctgt cgtaccgggc 7920 tgcgcggcag cgctcaacgg gctgctcgcc ggctctgctg ctgtacggca aagagctggc 7980 gctggcgcag cagaagccac cactggcaca gcagacggcc gaccccggtg aggaatggga 8040 ctcagaggga gaaggggcgg aggcggtggc cgacgagcgg cgacaggccg ctggggttag 8100 gaaggagctg ctagacgcag cggctgatgg ggctcgaggc aacatggaga aggcccaaac 8160 ccgaatggtg cgtgaccacg cccgccgcac gcacgttaat gtgatgccaa agcctcgtcc 8220 gggtggggag ggcgcctacg gtagcccgga cgacacgccg gcggatgggc aggctcaggg 8280 gcagccggcc accgcagcca cgacgcgcag ggcaagtggc tctgacagtg acaataccaa 8340 aacgccatca ccacccccct cagcagtctt aagtaggggc ggcgcgggtt cagcagctcc 8400 cggcggcagg gggcgggggc gcggacgcgg cgcgggaagt agggggcgag gcgatcgcac 8460 cacaacccag cagccagctg cgccaaaacc cgcgctcccc accaacagtc acgactcagc 8520 ccccgaagac gaccccgcac tcatgcacct gccgtgccta gagaaaggca cgctggtgta 8580 ccgccagcgc ccgtctcgca gcaagcttca gtcaacccgg gagggaccgt ataggtggct 8640 gttgtggaac cgaaccggca ctatggcgct agtggaggat gaccagtcaa cccggttcac 8700 tatcccagtc gcccagctcc tggtgccgcg cggcaccatg gtgccggccc gctaggccat 8760 tcgctatggc acgcgtgtta ggacgcgtgc ccgggagaca gcgggccacc ctccacagag 8820 gtccacccgc agaatttagg atagaatatg gtcttattga cagcggcaac gggacccgca 8880 gcgaacagga acagatcacc gttgggctcc aaggctaccc aggtccccgg aattgaccac 8940 ccaaccaagt aggaaacaag ggcttctacc gtttacccaa gcgtggcagg ccgcaagggc 9000 ttgccccccg gcgctcgagt gagccttact tcgcagtcgc aaccccgacc gcgcagcgaa 9060 cccgaccacc gactacgctg atagagaaca ccgcgctaca caatcccgca atacgagaaa 9120 gcagggaaac cggcgacgta aaggggtgcc ggcagttggt cgcatgaggg gaactggcgc 9180 tggcaactca tatacaccga cggaggctca gtcctccaaa atgcatgttc acagcgatta 9240 tatccggtcc gccttaaagc agcacgcaga agcagaactt cgcaaacgac ttgtcgcaca 9300 ggcgatggga accgataatc ctcacggctc tccgccgtgg ccgctggcaa ctgagtatcc 9360 ttcaccggcg cagatggtcc aggctttccc tggggcggcg ggccccgccg cggcgctgac 9420 ggccgcgacg cacgcctcgc caggttcatc cccgatgggt gtcgcgtcca acggcgaggt 9480 gaggactcga gcgcgctatc agggagggcg gaaggcagaa tagaaccaga cgctaaggca 9540 aaagcggagg cgcaaccagg ggggtggacg agagccaggg agaagcaggg cgaatcgcca 9600 atagtaggga aggccaaccg gagctccggc gggcggaggc aggccacgtc aagggcgttc 9660 agcagcaaga cgagggcgag tgtattgggg accagcaggg agtggcctac ctggcgtctt 9720 ccgcctaacc gcgaccatca accgctccgc gccccagcag gcggtcccgc aaaccaccga 9780 cccgatggcg tacgccatcg cccacggcgg cccccgggcg cgtgcactcc tggcgcttgg 9840 gggcaagcgc ggcctcacgt cgccctatgg cggctttcgc tccagcccag cgaccaagat 9900 cagcgccggc gcgcaggcct cgcccacggt ctcccagggc ccggcgcggc tgcgcaccgg 9960 gccgccacag ccggcccagc ccaacgtggc gctgggtgcc gcggttggac gcggcaaaat 10020 tcaggcactg ttcgccgcgc cgccggcctc cagcgagcgg atgagcccaa ccaacacggt 10080 ggcggggggc tatccgcctc caccgccacc gccaccaccg tcgccgccgc cggggcctat 10140 gcagccgcca ccgccgccgc cgccgccgcc gccgccgccg ccgcccctga tgcagcggcc 10200 gccgccgcca ccgccacctt cctctccacg gcaacagccg gccgagcagc agcaggtctc 10260 gccatgcccg ccattgccgc cgccgtacct ggggcagcag ctgccgccgc cgtacctggg 10320 acagcagctg ccaccgccgt acctggggca gcagctgccg ccgccgccgg cgttcatgcc 10380 gccacacttc gtgccgatgg ccggcatggt cccgggccct tttctcccgg cactcgaccc 10440 gtttctggcc ccccagctga tgcaaatgct gttccaccgc gtggtccagc tggagcaggc 10500 tgctgccttc ggcgccttcg gccaggcggg gccgcccaca gcggcagccc aaccgatggc 10560 ggcgcagccg ctgctcgccg accctgattg ggcgggcacg acgtcggacc tgggcgcggc 10620 cggtattagc gacatatcac cccgccggct gctggcggac attccagagg aggactctag 10680 cagcgaggtg gcgctggcca tagagccgct ggccgccatg gcgccagcgg aggagggccc 10740 ggcgggcgga ggggcggcgc agagcggcgc accgccggtc gcagtcgtgg cgggcacagc 10800 cgccgcaggc gcagcagtgg gagtaggcgg cggaatggtg gggccggggg cgccgccggc 10860 gagggaggcg gagagggcgg agtcctctat ctccaccagc agcaacagca gctccccgcc 10920 gccgccaccg ccgccgccgc cgccgccgcc accaccgccg gcctccgagc agggggagac 10980 ggatgagggt gacagcgtgg aggtgcagat ggacagcgag ggtgatttcg agtgggcggc 11040 aggccaagca gcctatttag gacagcagga cgcgagctgc ggggctctca atcaggccct 11100 tgaggcgatc ggccactgcg gcatgggcct ccacatcagc caagcacggg ccctggtaag 11160 ctgcgccgtg gccggcgtca ccttcactca gccggccatc ctgatcgcgg gccggccctg 11220 ccacccgtgc aaatcggagg cactggcgca cgacgtcctg caagcggcca tggtcgccat 11280 gatcaacggc ctctacggtc gccacctggt ggagggcagt gtcgacatgc gcgcctacgt 11340 gagcaactgg gtggtcgagg cgattggcgt ccgcaggctg tgcactaccg agcccgacga 11400 ggaatttgag gagaggaggg aagagacaga gaccgaggag ggaggagagg agatggaggc 11460 ttagggcggg aaagggcgag agtggagtca acgtttgcgg caccctaggg ccgcactggg 11520 gcgcgcggct taagtagggg cggaccgaca gc 11552 // ID TE1-1_CR repbase; DNA; CHL; 683 BP. XX AC . XX DT 04-APR-2005 (Rel. 10.03, Created) DT 10-APR-2007 (Rel. 10.03, Last updated, Version 1) XX DE TE1-1_CR is a family of nonautonomous transposable elements - a DE consensus sequence. XX KW Transposable Element; Nonautonomous; TE1-1_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-683 RA Kapitonov V.V. and Jurka J.; RT "TE1-1_CR, a family of unusual nonautonomous transposable RT elements in the Chlamydomonas reinhardtii genome."; RL Repbase Reports 5(3), 66-66 (2005). XX DR [1] (Consensus) XX CC TE1-1_CR is a family of nonautonomous transposable elements. The CC consensus sequence is less than 5% divergent from CC TE1-1_CR copies. Classification of this family is not known. It CC is CC possible that it belongs to a novel superfamily (TE1_CR). CC Features CC of this superfamily include subterminal inverted repeats (pos. CC 33-45 CC and 679-667). The genome harbors several families of TE1_CR-like CC elements (e.g. TE1-2_CR). They all are characterized by conserved CC termini. XX SQ Sequence 683 BP; 149 A; 209 C; 193 G; 131 T; 1 other; acaaggggtg tcagctttgt acagctggaa aatgggctga gccccatgaa gcgcaaggga 60 tgtgacgatg ggcaccggcg acatgaacag cacccttgca agcccatcca gcgggactgg 120 ccacggggta tgccaacgcc gtgccgaggc accgacatgc cctgtccacg ccacgccacg 180 ccaccttacc cacgccactt cacggtccat gatatcccaa atgcacctca ytccacatcc 240 cactcaagca ccgcaaacgg ctagccgggc tcgggcgcgg gatcccgggc cgcgacacat 300 ttaaggctcg ggatcggcac gcatggttcg gctgttgcac aaaatttcga cattcttgcc 360 ccaagacgct tgtcgtcgac atgttttgat acatggatgt aagatattca gggcccgaga 420 gctatactcg cggactgaag aaagtcaaga tgtccatgga ctcgcgaggt cgcttctcgc 480 tccggccgag ttttccctgc cgcctatttt tctacaatag ggaatagcaa ttaataatac 540 gtggcgctcg cggtttgtgc cagtttggca ctttttgtcg acgcaggcag gcacgcaggc 600 aggcaggcag gcacggacag gggcaatcct ggggggcttc gcccccccct ggatcgcttc 660 gctcgggggg ctcagcccaa aaa 683 // ID DNA-1-9_CR repbase; DNA; CHL; 630 BP. XX AC . XX DT 04-MAR-2004 (Rel. 9.02, Created) DT 04-MAR-2004 (Rel. 9.02, Last updated, Version 1) XX DE DNA-1-9_CR is a nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; 9-bp TSD; DNA-1-9_CR; KW unclassified superfamily. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-630 RA Kapitonov V.V. and Jurka J.; RT "DNA-1-9_CR, a family of nonautonomous DNA transposons from the RT green algae genome."; RL Repbase Reports 4(2), 31-31 (2004). XX DR [1] (Consensus) XX CC Some 20 copies of DNA-1-9_CR are present in the C. reinhardtii CC genome. They are ~99% identical to the consensus sequence. CC DNA-1-9_CR is characterized by 37-bp TIRs and 9-bp TSDs. CC The genome harbors several young families of DNA-1-9_CR-like CC elements CC (less than 100 copies). XX SQ Sequence 630 BP; 140 A; 177 C; 175 G; 138 T; 0 other; cacacgggta tgacgccggt ctgaagaggg gcaaaaacgt gtccacacgt gtttctccac 60 gctcgcgtag acttagcctc cctctttcag gaattaactt atatgacata taaggatgca 120 aaagtttgac tcccgtcgca ttctgcgagg gtggctgtca ccacatttcg cgaggcatgc 180 tttcgccagg gtcgcactct taagcccggg agcatgtttc tagtctcctt gcatgtatgt 240 ttgatgacgg caacaataat tggcatcaaa gcgcggtaat tgcgcagact tgagaagcct 300 gttcaccgcg acgtcaccga ataggccgca aacaaaatat ttgaggtctc tatagtatca 360 aggtgatact tgtagaacgc tggtagctga ttcggaggtc ggcctgacga ccgtcgccga 420 ccgtgccaag ctgggcggaa cgcatccccc ggacgccacg cgagctagac cagtctaggc 480 cgtgtcgtcc gcccgggacc acgttcgccg tgccttaaat gggtagtgcc cgacctatag 540 ccggagtttg gggaccggac gacaaggcca aggcaggtaa ccaggctgtg ccgtttgacc 600 cctctcccag accggcgtca tacccgtgtg 630 // ID TCR1_LTR repbase; DNA; CHL; 164 BP. XX AC AF029671; XX DT 04-MAR-2004 (Rel. 9.02, Created) DT 04-MAR-2004 (Rel. 9.02, Last updated, Version 2) XX DE Chlamydomonas reinhardtii retrotransposon TCR1_LTR, long terminal DE repeat. XX KW LTR Retrotransposon; Transposable Element; Long terminal repeat; KW TCR1_LTR; retrotransposon. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-164 RA Ferris J.P., Woessner P.J. and Goodenough W.U.; RT "A sex recognition glycoprotein is encoded by the plus RT mating-type gene fus1 of Chlamydomonas reinhardtii."; RL Mol. Biol. Cell 7(8), 1235-1248 (1996). XX DR Genbank; AF029671; Positions 1 164. XX SQ Sequence 164 BP; 37 A; 47 C; 53 G; 27 T; 0 other; gagctatggg cggctcccca gggacctttt tgcaccgggc tggggagggc agaaaagacc 60 gaaagctcta ggggctgggg acctaggcat ctccccagga acaatttgta atcaaccgcg 120 atggcagcac agctctcccg acctgcaggc atgcaagctt ggcg 164 // ID Gypsy-5-I_CR repbase; DNA; CHL; 3664 BP. XX AC . XX DT 31-MAY-2006 (Rel. 11.05, Created) DT 31-MAY-2006 (Rel. 11.05, Last updated, Version 1) XX DE Gypsy-5-I_CR is an internal portion of Gypsy-5_CR LTR DE retrotransposon - a fossilized copy. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; 4-bp TSDs; Gypsy superfamily; Gypsy-5-I_CR; KW Gypsy-5-LTR_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-3664 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-5_CR, a family of gypsy LTR retrotransposons from the RT green algae genome."; RL Repbase Reports 6(5), 236-236 (2006). XX DR [1] (Consensus) XX CC This is an internal portion of Gypsy-5_CR LTR retrotransposon. CC The internal portion is incomplete. It encodes integrase and PHD CC domain. There are only a few copies of Gypsy-5_CR present in the CC genome. They are over 98% identical to each other. XX SQ Sequence 3664 BP; 654 A; 1090 C; 1342 G; 578 T; 0 other; tctgcgcgaa ccggggggac cggctatgac aaggcctcgg atgtcgtgac gcatatatcg 60 ttttgatatt gcgtgaatca agtgcttgga ttttttctgt acactccggt cgccgctttt 120 gtgctttcgc atgccacatg ccgcagactg cactgaccgc tggcgcgctc taacaattct 180 ttgcggaggg agcagaccct tggcagagag ttcagtgagc cagcggtgca gaggcatggt 240 agtggccgcg gcttgggacg cttttaggtg gctgtcgcaa cgaatggcag acggaggaca 300 gcctgggcgc gctcagcgtc gatgcggtgg cagcatcagt acagtaccag gggcggtgca 360 gcttggagat gcagaagccg cgggcggcca ggcagacgca gcaccaacgg cactgaccca 420 gggcgcagcg gcaccggcag acagtgccac agctgcagca agcggcgtca cgcctgtgat 480 tgtggcgcag gctgctctag ctgcgacctc tggtgcagcg caggctacag caggggccac 540 gggtgcagcc tcatgcgcag cgccaggctt gatgctgccc tccaggtcca acgttgtgga 600 gcgggccagg ctcgccgtca acgcagcgtt tgccatcgac gggctgcagg acaaggagga 660 tgaaggctgt gtccccacgc caggcccagg cagcatgggc atcgccggta gcgactcagg 720 ggtgtcggaa tcaacgtatg aggagtatga ggggtatgag aagtcgatgg tttcgactgt 780 cagccaggtg ggcgagtcag aggtgatgcg ccagttacgc accagcgcag agatgcgagg 840 tgcatggggc gttgaccggg ctgaccgacg gctgcgcgcg ggcagccttg tggcgcggcc 900 gttggcggtg gagggcatgc ctgttaccgc ccccagcgcc gcgagccgcg gccacagctc 960 ggccatcgga gccctccgcc tgcagcagcg atttaaccgg aagcggcgcc gtcacaagca 1020 gctgtcggag cgagcggcgg accctttgcg aggttgtggc gtccagcggc tgccccagcg 1080 gggagacagc aacccggcga cgtgcgcagg cctgccaccc acagcgggtt tacctcaggc 1140 tggcggatcc aggcggcacg ccacgccacg ccgcgtgact gggaagcgct catggggcac 1200 gtgccgggcg catcctccgg cctaccgccc agcgcagcgc tagccgcact gcgctcagcc 1260 ccgccgccgc tgaccctcag cgccgtccta gtagccgccg tcgtcctgca gcaccactac 1320 ctcgcgccta atgaatgaca cacacacgcc gccgcctcaa gtgctggcac ctgcagacgc 1380 aggaccggta ctgccgctgg ggggggagtg acgacgcaga ggagctggcg cgaatgctga 1440 tacagcggca gcccggcagc gacctgctgg tatttgccac catggtggcg gcgcagactg 1500 acgcgacgga ggccgcgcta gcggccgcca acaggcggca gcctgacgtg tggctagacg 1560 cagaggtgat ggcgctgatt caagggctgg cggggcaagc cgcagcctcc cgccgcgcgg 1620 tgcggcgggc cagctcatac aggtgggacg ggcagcaact gctgcggtgc ctgccggacg 1680 ggcgcgccag agtctgccca ccgcctgcag agcgattagg aatagcggag cgcctgcatc 1740 tctcgcttgg ccacctcggt gtccggcgca cgcgggccct gcttcagctg gggtactggt 1800 ggttcgacat gggagccaca gcagctcacg tggtgcgcaa ctgccgggca tgcgacctca 1860 gccgcgtgac gggagacgcg aggccagcgg agctgcagcc gctgccggtc agggggctct 1920 tctaccgctg gggcgtagac ctggccgggg agctcacgcc caccgagccc gacggctacc 1980 gctactgcgt catcgccatc gagcatttca gtaagcatat cgaggtggtg ccaatcgtgg 2040 acaagaccgc agaccgcgta gcgcgggtgt ttgcggacat catggcgcgc tttggggcgc 2100 cagcagaggt ggtgacagac aacggatcgg agtttgaggg cgagtttgca gcgctgctag 2160 agcgctgcta cgtggatcac cgacacgcct cacccgggca cccacaagca gatggcgctg 2220 cagagcgcat cgtgcgagtg gtgaaggacg cgctgcgcaa ggcgtgctac gagtccgccg 2280 acccagcccg ctgggaccgg gcgctgccgc atctactgct cggctaccgg tgctcgccgc 2340 aggcggccac gcagtattcg ccgttcaggc tgctgcacgg aggagtggag ccccaggtgc 2400 cagcagccgt cagggagggg ttcgcggagc ccctgagctt tgaggacgca gcggcagcga 2460 cggagagcct gcgggcaagg gcggcctggg tgcagcgtca ctacccggag gccgccggca 2520 acctgctggt ggctcagcat cgcgatacgc ggcgctatgc ggccgcgcgc gcggcgcgcc 2580 gcctgacaaa cacgccagaa ttcagcgctg gcgactacgt ttacatcaag aagctgaagg 2640 tggacaacac cctgcaattc cagtattacg acacaatact cagggtgaag agcgtggggc 2700 cgctgggtgt agcggtgctc ataggacggg acgggacgca gcagctgcgt cggagggtgg 2760 agcagctggc gccctgccac ctggacgtgg acccggcgat tgagccgcag ctgtttcgcc 2820 ccggcaagga cctggcctgc gaagcgtgca gctcgccggg gcagccggcc aggatgctgc 2880 tctgcgacgg ctgcaaccag gggtggcaca cgcactgcct gcggccgccg ctgcgggagg 2940 tgcccacagg cgcctggcta tgtccgaggt gcctggcgca cttatcgcag cagccaccga 3000 cagaggaggc gacgcgcgag tggcccgagc agctaggcca gttgctgttc cccaaggcca 3060 gcacgcgcag gttggatgat gaggcccggc agatggacgg cacggtggtg actaggcggg 3120 tgcggacggg caggggcaag ggcgcggggg agacggagca gaaggccctg ctgcagttcc 3180 gtggcgcact ctaccggccg cactactttg ccgtgaagtg gccaggcgag gtggagatgg 3240 agtactggac tctggagcat gtgcggcgcg ggcgggcgcg cgcataagga gagtggaagg 3300 accagctggg gggacgaggg cgtagctagc gttggatagt tgcgcaggta ttcgggatgg 3360 agggcccagc tggggggacg aagcggtatg aaggatggta ttgtggattc ggatgacggt 3420 gggaggagcg atcattcgct atttggcttg ggttgagatg tctaaggatg acgtgtgttg 3480 cgtatttgtg gcctctctga agaggaccga acgtggactg acaatttgga cgtattgacc 3540 atgcgtgctt gcctgtccct gcgtgagcca acctcggttg ccatgcttgg cctctctcgg 3600 ttgtgggggg aaatggcggg acaggtggcg gagacattca taaccggcag aatgacacga 3660 cagc 3664 // ID Gypsy14-LTR_CR repbase; DNA; CHL; 365 BP. XX AC scaffold_34; XX DT 22-FEB-2008 (Rel. 13.02, Created) DT 22-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE LTR retrotransposon from single celled green alga: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy14-CR; KW Gypsy14-I_CR; Gypsy14-LTR_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-365 RA Kohany O. and Jurka J.; RT "LTR retrotransposon from single celled green alga."; RL Repbase Reports 8(2), 101-101 (2008). XX DR Genome; scaffold_34; Positions 758862 759226. XX SQ Sequence 365 BP; 89 A; 110 C; 79 G; 87 T; 0 other; tgttgcagcg tgcaatatca tttatcgaac gatatagctt ttattaagct acgacatgca 60 aaaattgctt gtgacggttc agtgactcta cgaacggggc cagctgtccc cgtcgattca 120 accccgaaac gccgggagtc cgtcctgcgg accagactct ctaagttccc ccgattccgt 180 acccgctacg gtgaaccccc gagccttgaa atagtctcag ctctctatgc aacgataagc 240 aagcttactc aaacacttgt ttattcggag agtgccactg gttccgaccc cagtgtgccg 300 cgggagcgaa gcccgcctaa gactctcagc tttcaaaagc ttatccgttc tcccgaacta 360 caaca 365 // ID L1-4_CR repbase; DNA; CHL; 1609 BP. XX AC . XX DT 04-MAR-2004 (Rel. 9.02, Created) DT 04-MAR-2004 (Rel. 9.02, Last updated, Version 1) XX DE L1-4_CR is a 3' terminal portion of a putative L1-like non-LTR DE retrotransposon - a consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-4_CR; KW putative L1 superfamily. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-1609 RA Kapitonov V.V. and Jurka J.; RT "L1-4_CR, a family of L1-like non-LTR retrotransposons from the RT green algae genome."; RL Repbase Reports 4(2), 42-42 (2004). XX DR [1] (Consensus) XX CC The L1-4_CR elements are flanked by 5-12-bp target site CC duplications. CC There are several hundred copies of L1-4_CR present in the CC genome. CC They are massively truncated at 5'. Many elements are just 20-50 CC bp CC long 3' terminal portions. The consensus is not complete at its CC 5' end. Some copies are identical. XX SQ Sequence 1609 BP; 204 A; 631 C; 438 G; 336 T; 0 other; gcctgcatcc tactgcctta gacgacaaca cacactcgtt tcgtccgcgc tcgccgttcc 60 gccttccatg tagtgctctg ggtgtcagtc agcctgatcc ttttacctac ttgtgtctat 120 ccgggcatca gcatggctaa cagcaaacgg cgccgcgtgg gggtcaccga ggcatccggc 180 atcgcggagc aggtcatcgc gctccatcgc cgcccgcgca cgcttcgccc ccggcggaca 240 tgccgccctc tgcgacaacc cgcggggtca tctacgccgc ccccgcggcc gcatcggcta 300 cgttattggt gattacttgg tttactcctg ggttctgtct acttggtttt tggttcatgt 360 gccgccgccc ccgcggccgc cttggcgacg acgtgccgcc gcccccgcgg ccgcatcggc 420 tacattccgc cgcccccgcg gccgccctgg cgctgcgccg ccgcccccgc ggccgccctg 480 gttctgcgcc gccgcccccg cggcagccct ggctacgtgc cgccgccccc acggccgccc 540 tggctacgtg ccgccgcccc cgcggccgcc ttggtcacgt gccgccgccc ccatggcggc 600 gtgtcgccat cgcgtcatcc caaaaggagt attcgttgcg ggcgtgcgcg gcggcggcct 660 cccaacccat acacagccct ctggagatga cgccaccggc gataccggcc gatccagggg 720 cttcagtgtg tcgagttccg tgtgccttgc ctgcagtgcc cctagcccgc gcatgcacac 780 cccactaacg cacctgcccc cacccggccg gcagccgccg ccccgtgcgg ctgccggttt 840 cctgacgggt gtggggaacc gcaatgcggt ccgcatccgt cacctgcttg ctgtctgcac 900 cccttgcctc cttgtctgcg tatgcttgcg tcccttgccg cccctgcgga ggattcgttt 960 gcgtttgctg cttgtggctt cctggtgtcg cacggcatgc accgcacgcg cccatgccga 1020 ccgcagtgcg accgggatga gccatgcagc gatcgcgctg tcgttacttc ctcggccgcc 1080 cggcgccctt gctcatttat ttaaccgcat ccatcacgtc cccgtactta gtgttcttcg 1140 tttagtcacc ccgcgcagcg cacagccata acagccacag cccgcagccc ggcttgcgca 1200 ccatgtcctc gcggttgctt tggggttgac gttgcttgcc tcgcgcctag cgccatagct 1260 tctacgctgc agtgcatcac gcctcctgtc cctccctccc tccctccctt acatgtcgtg 1320 ctgggcaccg gtggcgctgg tgttctccag gttggtttcg ggcgcatcct ttctggtagt 1380 cccaacgcca gcccggccgg cgtcgtccag cccagccatc ccaatacagc agccaccttc 1440 cgtcagccag ccatgggcgc gaccgtccac agcgtttacc gtcggttacg aggtaacatg 1500 tgaattcgca acttgcgcta ctgactgcct actctcgtgc cgcctgcaag cccactccgc 1560 cttccgctct ggtctacgta cgcattagtt ccgcaacact cgtcaataa 1609 // ID Gypsy-12-I_CR repbase; DNA; CHL; 5773 BP. XX AC . XX DT 31-MAY-2006 (Rel. 11.05, Created) DT 31-MAY-2006 (Rel. 11.05, Last updated, Version 1) XX DE Gypsy-12-I_CR is an internal portion of Gypsy-12_CR LTR DE retrotransposon - a consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Nonautonomous; KW Interspersed repeat; 4-bp TSDs; Gypsy superfamily; Gypsy-12-I_CR; KW Gypsy-12-LTR_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-5773 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-12_CR, a family of nonautonomous gypsy LTR RT retrotransposons from the green algae genome."; RL Repbase Reports 6(5), 232-232 (2006). XX DR [1] (Consensus) XX CC This is an internal portion of Gypsy-12_CR LTR retrotransposon. CC Terminal parts of the internal portion are derived from Gypsy-8/ CC Gypsy-9_CR-like autonomous element. XX SQ Sequence 5773 BP; 1151 A; 2005 C; 1455 G; 1162 T; 0 other; gggggagttg cgggaaattg tcctgagctc ctctgcgtcc ccttatcgtc attcccagct 60 ttcctggcta cgcccggatt tatccgtcta ctgcgacagc taaaatggtt gactccgacg 120 ccgaatccac ttcatctgcg tcaagccggc gttcgagccg ttctacccaa ctcaagaagg 180 gcatttcgcc ctctgctctt aatgctgtcc ccacgttcaa cccctcctct gatgacccga 240 ctgattggat cgattctatt gaacgggttg gacaggtgtt tggttgggac gatcccacca 300 aactagccat tgcccgctgc cgcctgctac attctgccag ggtttgggac tctgctggta 360 aggacatcga ctcctggcca aagttcagtc aagccttcct cgaccgcttt gccctacgag 420 aaacagaaat ctatggccaa ctggcatcct gccgccaggg cctacgcgag tcagtacgtg 480 attactccga ccgtttccgc tatcttctgg cccgtcttgg tatcaggcct ggccgcgatt 540 acatgtacca ctacctgcgt gggcttcacc cccgcaccta ccgcaaggtt cacctcttgc 600 gccctcaaac tctggaagat gcgattgacg gcgccatcta tgtcagcgaa gttgaagatt 660 gggacaacgc cagcacaggg cgcagcgccc cggcacctcc cgaacgcagg gtgcggtttg 720 atgaacaacc gccaaacacc cgcagcctac cctctccggc accccccacg cctcctcaga 780 ctcgccatga ccgccaccgt ccactttcat cctccctctc ccgccgtgat gacagcggcg 840 tacgcccccg ccgtgatgag aagtatactg accgttcgcg catgccgccg cggtccgcct 900 ctcccccagc cccagctcct gactcctttg agcgtgagct ggaggatctg cagtctaagc 960 tcgccaagat gacgttggcc taccgtcacg gccatgattt gtccaccaat atgtatgaac 1020 gtgaggagga tggtgctgtg cctgcatacc tctctcaccg ctacgctagt gactccgact 1080 ccagaaccga ggatggcagg gagtcagagc tgtactcccg tgaacacgcc gtcttcacca 1140 ccaagcgtat cagcgattcc gaacccgtac gcttgcccaa caagcgtgcg cccgtagaca 1200 ccgacgccca acctatggag atcaacaatc ggccgccgcc gcgccgcgtc agggctaccc 1260 gtgccccgtc caaggcccct gccgacacgc cagtcccctc aactcaccga gccgcgggtc 1320 ctggggtgcc catagatacc tctcccgatg ctgctgccag cagggcggcc gtgccttccc 1380 ccgcgcgcaa agacggcaat gacaaatgcg acagggtttc cgctggccat cgtgacgttg 1440 gccgcgcaat ccgtagtccc gttggtggga ctacccgtgc cgaaaacacc ctggaaacac 1500 cgtcttcaac tcctcctcag cagcaaccgc cacctgcggc ggtccctctc atcccggcta 1560 cggccgagcc aagcccgccg ccagtctcct gcacaccttc ttggcctctc gttctcaaga 1620 cttaactgct tcggacccta tgcccatcct ggatgcgcac accgcacggt gggattctct 1680 gcctctctac aacctggaca gccgtctccc caagttctca gtgtgccggg ctacggtcac 1740 actaaccaac gtccgtggag agaaagagag catcagggca gtggttgaca ctggcgcctc 1800 ctacagtgtg gtgcctatgc acatcgtgcg ccgcctggac ctgttgcctt atgtcgctga 1860 ctcacgcgcc atgttcctga atgccgatgg tatcaagaag aaggcaaccg gtcaagtacg 1920 tggcatgcag gtggggctgg gccaagatct gatgtacact atggacatgt acgtgtccca 1980 agcccaaaac tacgatttac tgctgggaat ggacttcctg taccccatca aggcttccgt 2040 ggatttcggc ggccagcggc tgctgtatac caacgactac aaccaacgca gtcacgtgcc 2100 cattgcatgt gtggacgtcg aggagcctcc tcccctcgag acctccctgc ccgggctgca 2160 aggaatgcca tctatcggat acaattctgg gacctacggc cacggacggg atgcaatccc 2220 cgccctgcaa ttcgtcagcc ctgtgggcca cctgaaagag aaggcaggta aggcactgta 2280 taccctcaag agcgccgcca tggaggctgc ctctactgcc tcgtccggca tggaacagct 2340 actccaacac ctgccgcgta ccgctaaggt gcgctcttgg ttgtttacca cctgggagaa 2400 cacgttgaca aaatggacag ccaaggagta cggcaccgtg tcaaaccaga ttacacacga 2460 cgccgccgtc gtagcaagca cccctaagac gcccctttct tccgacagcc cttcaaacaa 2520 gaccgctacc gtgaagaacc tgcctctcct cgggacgccg cccttggcgc ccgtgcgcat 2580 cccgcgccct aacctttccg ccagaccaag ccctagctta cccccaactt ctcctggcct 2640 cctctacgat gcacccgcac acttctcctg cggcatctcg ttgatagaga tgccggcgac 2700 tgccggcgaa tctgcagcgg ctgaggcctt aattgccttg gcccatcagg ctggcgcgtt 2760 cctgccacct gaagcttcca acggcgaatg ttcccctcct ccttctcgct ccgcctctgc 2820 gtcgctgaac gtgttggaag gcgttaaagc cacatgtcaa cgtgtgggag acgagttacc 2880 tgagctttgc gacggcgcca cacccatcat cgccgctcct ccgccgtctt cccgtgctcc 2940 aagcaagatg tccccaaccg agcaagagca gattcgtgca gccgccttac cgccggtgcg 3000 tgcaaaccca cttatgacct gcgatcagct ggcccctgca gcatcccagc aggagccagg 3060 agttgtgccc ctcttcctta gcgacgggaa cactgatgcc cgccgtatga attttatctc 3120 agactcccca gctcggcctc ctcttccacc gccccagcag gctgccaaca cgcccaagga 3180 agctacgtcg tctgagaaag ttacacccga accggaacca gcgcctgagc ctgagcctgg 3240 ccctgaacct gatgatgagc ctgaacccga accccgcacc ctgccacgac gcgtaaaggt 3300 gcgctctgag gctgcacagc accctcagtc tgccgcggcg gttgcccttg ccaccgctgc 3360 cgccactcca cctccagcac tccagccaac cgtaacggga gcccctacaa ggcctgtgtc 3420 ggtagtgttg ccgccgcgcc gcgtcaatgc tccccaatta ctccatacag acttgtcaac 3480 gccgtccgtc agccaagaga ttattgcgac cgaagagccc aacgctgacc cgaccttgtt 3540 ccctagtgga gacggacgtg atccaagcca agtttccgta caccccaagc cggctctcga 3600 ggagtcgcca gccatgcgta cccatatgct caaccgcaac atgtttgatt tcgagatgat 3660 accccctgac tgctgtgaca gcgaacctac ggaggaccta cagtgtgaca tctgcggcaa 3720 aggagagccg cccccaaaca gcctgcttat ctgcgacaga cctgctgaac gccctgccca 3780 tgatggctgt gacccgtact gcctgtggcc gcgggtactt aacgtactat atggctcttg 3840 gtattgtccc cgacaccaga tggaaacggc cagcctctca ggagggcgtg gctcgaacaa 3900 gggcccggta ccggccttac ccggtcgtgc gccgagcata ggcccggtac gggccgccgc 3960 cacggcgggt gctgttgcgg ccgcggcggc tgccgctgca accccgcaaa acgacggcgc 4020 tgtcatccct accgccgcca ccgtcaccaa gactcaaggg cgcctgcgcg ccaccgcaca 4080 tggaatggaa tcccatccgg ataaacagtt acgcgatctg ccccccgcta ccaaattgtc 4140 cgtccaacgg cccctacgtg tacagaagct ggcaactgcc actgtacctc ctatcgtcct 4200 gcatcgaccc aaaacggacc gcaaccgata ccctctcacc agtctccccg accaaggtat 4260 tttattttac tgctggcgca cgagcccaat caaatggagg ggcctgcgtc tcctacagcc 4320 gctaagcaac gctggtcatc aatcgcctgc cgcacggctc aagatcattt caacgcgttt 4380 tatgtagtgc aagccgacct gccgccgcaa gcaagcagaa atgcctgggt cagcacgcta 4440 ctgggtcatg caagggacgc atgactgaag gcccggtagc gacagctgtg gcgacaaaag 4500 atgcacttcg taaatccgcg gctgacccat ctttgctact gcagaggaaa gagtagaaag 4560 tgccgcactg caacaggttc ctgctttccc caatccccgc cagtgcgggt gtgtgtgtcc 4620 catgccccct cgggtagccc aggttcgcca acaccgcccc ttccaccaaa ggcccacggt 4680 ccagcaagga tcccgtcttg agcatggcga gcccaagcca ctacgtatag cttccagcgg 4740 cgccgagacg tgagtcgcgg cgctgctcgt ccaagttcca ccgaccaaga ccaatcaaga 4800 tccaaggcca ttgtggatca gcaaaatatg ataaatagta ttgagcgaga tgatgagcat 4860 agcagcggca ttaggccgcg tcgagttggt tgatctggta gttcgcgatc gaagtttccg 4920 gacttgctcg atgtttctag tggcgcccca ggctagccta gaattttcga agaagttgtt 4980 tagcacaccc accatcgagc gacgttttgg gtagttgacg aggtcggttg aaggtctggc 5040 acgcccgcca ccggtagtag ttacctagcg atatctaggg gcctgacgca cattccaggt 5100 cgactcccac agctgcacag cacctagcgt tgcctgagga agcctaagtc tacgaccaag 5160 cagccagaga cgctagcggt agccgccccc ctcacaacta ggtcgcacca agcctgtgct 5220 tagctggctc cttgcccgaa cattcgtggc caactctgac tccacagcct ggttgcatca 5280 aggcaccagg gctcgtttgg cgtgtgctgt gtcttgcggt cacaacccgt ggctgacggc 5340 agcatacttg taacggttga gtgcaaatcc ccctccgcgg gttgcccctt ccccttctcc 5400 tgcgaccgac gcgtgcggac cggactggac ctaagccctt cgccatagtg cgcggcctct 5460 agtgtgtcag gcttcccccg tcaccccggc gcctacagtc tcccttctcc cttctccggt 5520 ttcccttcga gattgcgact tcctaccgtg atccagcctc tgcctccctt ctcccttctc 5580 cggtttccct tcgagattgc aacatcctac tgtgatccag cctctgcctc ccttctccct 5640 ccaccggtct cccttcgaga ttgcgactga ctaccgcacc cggctcctgc tttcttctgc 5700 cagccatctg gtctgcatac cgtatccacc agccttgggt gccagcagct cgcatctgtt 5760 gtaccgactt gcc 5773 // ID hAT-N3_CR repbase; DNA; CHL; 468 BP. XX AC . XX DT 13-MAY-2004 (Rel. 9.04, Created) DT 13-MAY-2004 (Rel. 9.04, Last updated, Version 1) XX DE hAT-N3_CR is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; 8-bp TSD; KW hAT superfamily; hAT-N3_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-468 RA Kapitonov V.V. and Jurka J.; RT "hAT-N3_CR, a family of nonautonomous hAT transposons from the RT green algae genome."; RL Repbase Reports 4(4), 121-121 (2004). XX DR [1] (Consensus) XX CC About 30 copies of hAT-N3_CR are present in the C. reinhardtii CC genome. They are ~98% identical to the consensus sequence. CC hAT-N3_CR is characterized by 14-bp TIRs and 8-bp TSDs. CC Some copies are less than 1% diverged from each other. Therefore, CC it is likely that this family and its autonomous counterpart are CC still active. XX SQ Sequence 468 BP; 99 A; 116 C; 139 G; 114 T; 0 other; cagggttgcc tagttcgtag tacgcgtact atgtactacg tactgtacta cgtactgtac 60 gctgccctcg tcggctagta cgttggaaaa tacgttgtgc tggggaaacc agcggggtgg 120 gccgtggcag cgccgtggct gggcgagggg cgcgaggggt gctggaaagg atgtccgtcg 180 aagtttgcga accatggctg ctctccgcac tgcagctgtg ttctcttaca acaatatgag 240 cataataagt tgaaggaaat tccgctagaa gactatgacc gcgcaaatcc aaattcacac 300 cgattcgggc atggtgcggg cacgcgtgcc gtgtttgcgt cggcttcccc cccggatttc 360 ctggcccagc gtattatgca ccgtatttta ccgtactgta ctaggaatcc tagtacgctg 420 tattaatacg cgtattatgg agagagggtc gcgtactagg caaccctg 468 // ID MSAT-2_CR repbase; DNA; CHL; 184 BP. XX AC . XX DT 04-APR-2005 (Rel. 10.03, Created) DT 10-APR-2007 (Rel. 10.03, Last updated, Version 1) XX DE MSAT-2_CR is a minisatellite unit - a consensus. XX KW MSAT; Satellite; Simple Repeat; MSAT-2_CR; Interspersed repeat; KW minisatellite; nonautonomous; tandems. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-184 RA Kapitonov V.V. and Jurka J.; RT "MSAT-2_CR, a family of minisatellites from the green algae RT genome."; RL Repbase Reports 5(3), 57-57 (2005). XX DR [1] (Consensus) XX CC The MSAT-2_CR is family of minisatellite DNA. XX SQ Sequence 184 BP; 37 A; 67 C; 41 G; 39 T; 0 other; tggggtccac accgcccagc catcaccaca caccccacct gccacaccca cccttgtgca 60 ctgttgtttc acattttcat atgtgcatgt tgcctgacct atttgcaatg cagacacgag 120 cagggagcca tgttgccagc cctcacagtg ccttcagtgc ccctgcacgc ctggacaagg 180 cggg 184 // ID EnSpm-N3_CR repbase; DNA; CHL; 6787 BP. XX AC . XX DT 28-MAR-2006 (Rel. 11.03, Created) DT 12-APR-2006 (Rel. 11.03, Last updated, Version 1) XX DE EnSpm-N3_CR is a family of En/Spm nonautonomous DNA transposons - DE a consensus sequence. XX KW EnSpm; DNA transposon; Transposable Element; Nonautonomous; KW EnSpm-N3_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-6787 RA Kapitonov V.V. and Jurka J.; RT "EnSpm-N3_CR, a family of En/Spm nonautonomous DNA transposons in RT the Chlamydomonas reinhardtii genome."; RL Repbase Reports 6(3), 122-122 (2006). XX DR [1] (Consensus) XX CC Copies of this transposon are less than 2% divergent from the CC consensus sequence, which is ~75% identical to EnSpm-N2_SP. It CC also contains an array of the MSAT-3B_CR minisatellites. XX SQ Sequence 6787 BP; 1061 A; 2076 C; 2348 G; 1288 T; 14 other; ctctagcact ataaggacgg gtttgtgcgt tacgcatgct tattgtgcgc tacgcatgcg 60 tctgggactg gacgcatgcg tgccagaacg catgaagcat gaggggcggg gggccgggtc 120 gggcgcacag ccgtgcgtac aagcacgcat gcgggccgca tgcaggcgca tgcgcaccga 180 cctgacgcaa gcactaagag acgcacgtat tcctgtaagt cacgacatag tcgtagacag 240 caagcacggc tcgctcgctc gtagggtcgc gtgatcgccg gacgcggcac tccaacactt 300 caacagcgtc agaccctgct atatgatgca atgtgtatat caaaacgaac agtataagta 360 ggctagcagg ccttgcgaag tattcactcg gacatagtgc tagactttta gtctttagac 420 ggtcgggcct aaacgagtga acgcatgtgt atatatggta tgctgtacta tctgggcgtc 480 gcaccaagca accagcatgg aggctgcacc tgacaaggcg ggttgtgtgg ctggcgtgta 540 gagggctggc cactgccgct gccgcgcact gctcgcaagg ccggggactg cgaggcagct 600 caaggtgtgc gtagagtgcg taggcgtggt tgttgctgcg acagtgcctt tgttaagcag 660 ccgcggcggg ggttatggtt atgagatagc actggcgtgg gactgggcgt gcacgatgaa 720 ggtcgcgggc ggcgacggcg caactgagtg caacggtggc actgcggtgg cggcccccca 780 catgccatgg ctgccgttgt cgccagcgct aacgttggac gcgtgctgcc ccccgccggc 840 gtcacccccg cggctgcacc atcagctgcg ccatcgcatc aggcgacgag acagcactgg 900 cgtgggsctg ggcgtttgtg attggtgccg cgggcggcgg cgtcgtggga gagcgcaacg 960 gcggtgctgc ggcggtggcc ccccacaggc catggctgcc gttgtcgcca gcgctaacgt 1020 tggacgcgtg ctgccccccg ccggcgtcac ccccgcggct gcaccatcag ctgcgccgtc 1080 gcatcaggcg acgagacagc actggcgtgg ggctgggtgc ttgggattag tgccgcgggc 1140 ggcggtggcg ccagtgagcg caacggcggt gctgcggcgg tggcccccca cakgccatgg 1200 ctgccgttgt cgccagcgct aacgttggac gcgtgctgcc ccccgccggc gtcacccccg 1260 cggctgcacc atcagctgcg ccrtcgcatc aggcgacgag acagcactgg cgtgggcctg 1320 ggcgtatgtg attggtgccg cgggtggcgg tggcgtggga gagcgcaacg gcggtgctgc 1380 ggcggtggcc ccccacatgc catggctgcc gttgtcgcca gcgctaacgt tggacgcgtg 1440 ctgccccccg ccggcgtcac ccccgcggct gcaccatcag ctgcgccgtc gcatcaggcg 1500 acgagacagc actggcgtgg ggctgggtgc ttgggattag tgccgcgggc ggcggtggcg 1560 ccagtgagcg caacggcggt gctgcggcgg tggcccccca caggccacgg ctgccgtcgt 1620 caccagcgct aacgttggac gcgtgctgcc ccccgccggc gtcacccccg ctgctgcacc 1680 atcagctgcg ccgtcgcatc aggcgtcaag acagcactgg cgtggggctg ggcgtttggg 1740 attggggccg cgggcggcgg cgtcgccaga gagcgcaacg gcgtggggct gcggcggtgg 1800 ccccccacag gatggctgcc gttgtcgcca gcgctaacgt tggacgcgtg ctgccccccg 1860 ccggcgtcac ccccgcggct gcaccatcag ctgcgccgtc gcatcaggcg acgagacagc 1920 actggcgtgg ggctgggtgc ttgggattag tgccgcgggc ggcggttcgg gtgcaacggc 1980 ggtgctgcgg cgtcccccca caggccatgg ctgccgttgt cgccagcgct aacgttggac 2040 gcgtgctgcc ccccgccggc gtcacccccg cggctgcacc atcagctgcg ccgtcgcatc 2100 aggcgrcgag acagcactgg cgtggggctg ggcgtttgtg attrgtgccg cgggcggcgg 2160 cgtcgccagt gagcgcaacg gcggtgctgc ggcggtggcc ccccacaggc catggctgcc 2220 gtcgtcgcca gygctaacgt tggacgcgtg ctgccccccg ccggcgtcac ccccgcggct 2280 gcaccatcag ctgcgccgtc gcatcaggcg acgagacagc actggcgtgg ggctgggygt 2340 ttgggattag tgccgcgggc ggcggtggcg cagtgagcgc aacggcggtg ctgcggcggt 2400 ggccccccac aggccatggc tgccgttgtc gccagcgcta acgttggacg cgtgctgccc 2460 cccgccggcg tcacccccgc ggctgcacca tcagctgcgc cgtcgcatca ggcgacgaga 2520 cagcactggc gtggggctgg gcgtttgtga ttggtgccgc gggcggcggt ggcgccagtg 2580 agcgcaacgg cggtgctgcg gcggtggccc cccacaggcc atggctgccg ttgtcgccat 2640 cgctaacgtt ggacgcgtgc tgccccccgc cggcgtcacc cccgcggctg caccatcagc 2700 tgcgccgtcg catcaggcga cgagacagca ctggcgtggg sctgggcgtt tgtgattggt 2760 gccgcgggcg gcggygtcgt sggwgagcgc aacggcggtg ctgcggcggt ggccccccac 2820 aggccatggc tgccgttgtc gccagcgcta acgttggacg cgtgctgccc cccgccggcg 2880 tcacccccgc ggctgcacca tcagctgcgc cgtcgcatca ggcgacgaga cagcactggc 2940 gtgggcctgg gcgtttgtga ttggtgccgc gggcggcggt ggcgccagtg agcgcaacgg 3000 cggtgctgcg gcggtggccc cccacaggcc atggctgccg ttgtcgccag cgctarcgtt 3060 ggacgcgtgc tgccccccgc cggcgtcacc cccgcggctg caccatcagc tgcgccgttg 3120 catcaggcga cgagacagca ctggcgtggg gctgggtgtt tgggattggk gccgcgggcg 3180 gcggtcgtcg tcgccagcgc tactggacgc tcccggggct gcactgctca gcagtggcgt 3240 ggatgtgccg cgggcgcggc gtcgtgggag agcgcaacgg cggtgctgcg gcggtggccc 3300 cccacaggcc aggctgccgt cgtcgccagc taacgttgga cgcgtgctgc cccccgccgg 3360 cgtcaccccc gcggctgcac catcagctgc gccgtgcatc aggcgacgag acagcactgg 3420 cgtgggctgg ggtttggacg ttgggctgcg gcggtggccc cccacagcca tggctgccgt 3480 cgtcgccagc ctaacgttga cgcggtgccc cccgccggcg tcacccccgc ggctgcacca 3540 tcagctgcgc cgtcgcatca ggcgacgaga cagcactggc gtgggcttgg gcgtttgacg 3600 ttggggcagc gggctggccg ctgccgctac tcacattccc ggggactgcg aggtagctca 3660 aggcagtgga taggtgcagt tgtggctgcg acgatgcttg tccgcagcag cagccgcggc 3720 aggggctatg gtcgcagacg ccatatcacg gtgcttttaa gcgcctcaag tcctgtgtta 3780 tgagtttgag gattgcacag caccgagccc cgaccacaaa gcgtttcccc ggctgtcccg 3840 tgaaaggcgc taggcacgag cggcggtgcg tgtgcctgag agctttaccc cgtatccagc 3900 ccagggcata aggtgttccc ccggctgtcc ctaaggcgtt gggcgtgagc ggcggtgcgt 3960 gtgagagctt taccccgtat ccagcccagg cacaagggtt cccccggctg tccctgataa 4020 ggcgttgggt gtgagcggcg gtgcgtgtgc ctgagagctt taccccgtat ccagcccagg 4080 gcacaaggcg ttcccccggc tgtccctgat aaggcgttgg gcgtgagcgg cggtgcgtgt 4140 gcctgagagc tttaccccgt atccagccca gggcataagg ygttcccccg gctgtccctg 4200 ataaggcgtt gggcgtgagc ggtggtgcgt gtgcctgaga gctttacccc gtatccagcc 4260 cagggcataa ggcgttcccc cggctgtccc tgataaggcg ttgggcgtga gcggcggtgc 4320 gtgtgcctga gagctttacc ccgtatccag cccagggcac aaggcgttcc cccggctgtc 4380 cctgataagg cgttgggcgt gagcggcggt gcgtgtgcct gagagcttta ccccgtatcc 4440 agcccagggc acaaggcgtt cccccggctg tccctgataa ggcgttgggc gtgagcggcg 4500 gtgcgtgtgc ctgagagctt taccccgtat ccagcccagg gcacaaggcg ttcccccggc 4560 tgtccctgat aaggcgttgg gcgtgagcgg cggtgcgtgt gcctgagagc tttaccccgt 4620 atccagccca gggcacaagg cgttcccccg gctgtccttc ataaggcgtt gggcgtgagc 4680 ggcggtgcgt gtgcctgaga gctttacccc gtatccagcc cggggcacaa gacgttcccc 4740 cggcttttcc tgctgcggca ctgggcgtga gcggcggtgc gtgtgcctga gagcagtgcc 4800 ccgtatccag ccccgagcac aaggcgttcc ccggctgtcg gcgttgggca tcagctgagg 4860 cgcgtgcgca gagaaggccc cgtgtgcgtt catattaatc cccctttcaa caccgcacat 4920 ggcaagacaa gcaagtatca agtatgcggc caaccaagtt taagtgttgc atggggcact 4980 gcttccaaac tctttgcggc acttccttct ttgccagttc aatttatata tactgtatcc 5040 ttctatgcat atcacacatt actaaagact acagacacgt caaggtgtca gtatgtagag 5100 acccgcaaac agttctctgc cactcggcag aggacgccct cgccctgcta tgtttgcttt 5160 taaggcccag tctccactag taacgctatg tggcgcacag gtgcgccttg cagcggccat 5220 aacgctgggt tgttgcaggg gtggtgacgt cgttgaccga gcgcaatatt tatcgcccac 5280 cttcaattcg ttcaggttca ggctcaacaa aagcatttca aaaggctaca gagccatcac 5340 gcgtggcgca ggtgaggcgg gggtaccatc cgcgcgactt ggttgctagc ttccgaggct 5400 gtcagctgag agcatgcgca tgctaaggtg tctgtataat ggcaccgtac aagcctgtca 5460 tcagttcatg ctcagttctc agtcctggta ccctcccgcc ctccctcagg actccggccg 5520 cacgaagggc cacccgtgcg gaccacccct gggtgggcta gacgtggcgc actcacacgt 5580 taaccccaac aggcaagtct atgtccagat attatttcgc aattgctgaa gtaagcactg 5640 catgagtagc gtgtctgcag ctgaacatgg ggggaaaggt accttgcgtc ttaccctctg 5700 atatgcctgg ttttgtggtt tggttcctga ccgaatcgtc ctgaacacta tgcaggctta 5760 cgaagggcag cacgggtgtg gctagctgtg gactggagcg ctggcaccgg cagcatgcgg 5820 agtctgcggt ggcactgtgc agcaggtcgg acgttgtggt tgaggtcgtt gtgtgtttgg 5880 caagacttag tatgggtcca cctgtgccgc ttatgccttg gcgggtggcg tttggttgtt 5940 ccgcggaagt ggctacatgc acatctgtgg ctagctacgg actggagtgc cggcaccggc 6000 agcatgcaga gtctgcggta gcactgtgca gcaggtcgga cgttgtggtt gtcgttgtgt 6060 gtttggcaag actttagtat gggcacacct ctgccgctta tgccttggcg ggtggcattt 6120 ggttgttacc cggaagcggc tacatacgca tctgtattca tgttacggta gggcacgttc 6180 ggtgagcgag gaggtcgatg ggtgacaggg catgtggagc gggcagactt cctttccagg 6240 gtgtgcgatg gcggactaaa agcctaagga gaggctggtg gatggaggcg gactcacgcc 6300 ttcagagaaa caggccgaat cctcgtttac actggacgcg cggcacgtgc cgttgcatga 6360 actgtaacac aaatgagcag ttgagggaat tggcttcgtg tgccccacgt gtgcgtgcat 6420 gcatgcgggt gggaagcgtg cagttggggg cagctgcttg cgttaggtaa cgcacatgcg 6480 tcccgcatgc gccatggcat gcgtagtact tcacgcaacc ggcatatgtg cttcaagcat 6540 gcgtatggca tgcgttcggg acgcatgcgt acatccagaa agagtcgctg cggccgctag 6600 ccaacaccag catgcgttag gtaacgcatg ctctggtagc gcacgcatgc gtatgggcgg 6660 gcaccgcatg cgtacagcac atgcttttca cgcgttaagc atgcgtttgg gccatgcggc 6720 acgcatgcta acgcatgctc taacgcatgg acccgtcgtt atagtgtagt gctcggcccg 6780 tgtcgtc 6787 // ID LTR3_CR repbase; DNA; CHL; 357 BP. XX AC . XX DT 08-JUN-2004 (Rel. 9.05, Created) DT 08-JUN-2004 (Rel. 9.05, Last updated, Version 1) XX DE LTR3_CR is a long terminal repeat of unknown LTR retrotransposon. XX KW LTR Retrotransposon; Transposable Element; 5-bp TSD; LTR3_CR; KW solo LTR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-357 RA Kapitonov V.V. and Jurka J.; RT "LTR3_CR, a long terminal repeat from the green algae genome."; RL Repbase Reports 4(5), 128-128 (2004). XX DR [1] (Consensus) XX CC The genome harbors about 20 copies of LTR3_CR; they are 99% CC identical to the consensus sequence. LTR3_CR is a solo LTR. CC The LTR retrotransposon it was derived from is not known yet. XX SQ Sequence 357 BP; 81 A; 100 C; 104 G; 72 T; 0 other; tgtaacggga gactcgctcc cttgtcaacc taggcgacaa tcctcttgtt acattggtgc 60 gaaatgccag gagtcaccct gactgttgag gtcccaggat cgatcctgga tgggtgcatg 120 ggtccgaact agtgcacaag agggttacac cctccttgcc ctgaggtgga tggcccgtga 180 gagcgaggca acaacgcaac tgggccaaag gcagtaaacc caccccgcgc ctgcagtgag 240 cctaaaaaca tctacggatg cgctggtggt ctcacgagga cgttcgcttc caggggggga 300 ggactgtaac gggagactcg ctcccttgtc aacctaggcg acaatcctct tgttaca 357 // ID TE2-7a_CR repbase; DNA; CHL; 743 BP. XX AC . XX DT 08-JUN-2004 (Rel. 9.05, Created) DT 08-JUN-2004 (Rel. 9.05, Last updated, Version 1) XX DE TE2-7a_CR is a subfamily of nonautonomous transposable elements - DE a consensus sequence. XX KW Transposable Element; AAC target site duplications; TE2-7_CR; KW TE2-7a_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-743 RA Kapitonov V.V. and Jurka J.; RT "TE2-7_CR, a family of nonautonomous transposable elements in the RT Chlamydomonas reinhardtii genome."; RL Repbase Reports 4(5), 142-142 (2004). XX DR [1] (Consensus) XX CC TE2-7a_CR is a subfamily of nonautonomous transposable elements. CC The consensus sequence is less than 5% divergent from TE2-7a_CR CC copies. Classification of this family is not known. It is CC possible that it belongs to a novel superfamily. Features of this CC superfamily include preferential insertions into AAAC targets CC that are accompanied by the AAC target-site duplications, a CC 18-21-bp CC palindrome separated by 3-5 base pairs from the 3' end; and the CC AGCC CC 3' termini. XX SQ Sequence 743 BP; 146 A; 215 C; 233 G; 148 T; 1 other; acctaccgct acgcagtacc ccggcttgcc ccctccccct gtcccattgc gcaccattgc 60 aggccaagta tgacgggaat tagcccctga gccgagctat cggctatggg ctcattccca 120 acgtccagtt cagcgcagtc gcagttgtgc gaacggggtg ggaggggggt gcggggggag 180 ggatgcccgg tgcccggact gtgcgagcca aggcgagaga gcgcgggcgg ggcgggggcg 240 ggggcgggcg ggtggggggg tgggggcccc gagatgttgc acatgatttc gcgaaatcgt 300 caccacgata gtatgcaccg ttgataccac aaaatcagcg ctgctgatgc atcgagatga 360 agcaaacggc gtcgctgctt ttgccgatcg gcgagggcgc atgcgcttcg cactcacaat 420 gtcttgtaaa atgtttattg ccttgaggta tcatcgtcta ccacaagtcc gcggctgaca 480 agaattgcaa cccgatggcg ctatcgagcg ccgggatcct ccaatccctt cgcctctagt 540 ctgacgtctt cctcgcgttt ccgcaaagtt atgcatcgct tgggacaaat tgaawggcgt 600 attatttgca caagaactct ggctacgttc gggtttcccg acgtaactgc acataaaact 660 ggaataaccg agggggcccc gcctgggact cgatgcgacc gcaatggctt ccgcccctcc 720 cccttcgggg gaaggggcaa gcc 743 // ID DNA-8-2_CR repbase; DNA; CHL; 2426 BP. XX AC . XX DT 29-MAR-2006 (Rel. 11.03, Created) DT 29-MAR-2006 (Rel. 11.03, Last updated, Version 1) XX DE DNA-8-2_CR is a family of P nonautonomous DNA transposons - a DE consensus sequence. XX KW P; DNA transposon; Transposable Element; Nonautonomous; P-1_CR; KW DNA-8-2_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-2426 RA Kapitonov V.V. and Jurka J.; RT "DNA-8-2_CR, a family of P nonautonomous DNA transposons in the RT Chlamydomonas reinhardtii genome."; RL Repbase Reports 6(3), 110-110 (2006). XX DR [1] (Consensus) XX CC This is a consensus sequence of a DNA-8-2_CR family of CC nonautonomous P DNA transposons. DNA-8-2_CR elements are less CC than 2% divergent from their consensus sequence and they are CC characterized by 146-bp terminal inverted repeats and 8-bp target CC site duplications. DNA-8-2_CR is a nonautonomous derivative of CC the P-1_CR transposon. XX SQ Sequence 2426 BP; 463 A; 684 C; 761 G; 511 T; 7 other; caggggaggg gggtgtgcaa ctcggcctgg atccagggtc gcgctgcatg ggacccggcc 60 aaaaggtgtt cggacccctc ccaaggagct cccggccatt catttgtgta tgtaagacaa 120 ctgtatagtc attagaactg atagcgaagg gccgcaggkc cccctccaac atttcgaggg 180 tttagagcca ttcctgagcg atttcatgac ttgacgtgtc ctatggcgtg aaatactgaa 240 atgaactgga acgctcgttt cgcggggatc tggcgtgtcc tggaggccct gggcgccctg 300 cgagcgacgg tgagaccagg ggagaacggg gaaatggacc ctttgcgggc tcgctgagcc 360 tcctaaggag ctgccagcgc cgacctaggc atctgcatcc caataccatg tcccagcggg 420 ggtgggtgcg gccttaggag gccaggaggg tgctggttgg gccgtttcca ggaccgcatg 480 gaaaggcccc cagcgcggca caagctcggg tggcgtcggc tatcgctcct ttcccgaggt 540 taggtgtcag aatggagcca actgtatgag aagccagaag cgcgaatccc gaccatgttt 600 actaggagta tcaggccaca cgcgatgtcc ggcgaaatgc gacgattcag ttcacagggg 660 gccaaatcgt cgcagccata tcactgcgcc tgcacgattt cgatatgtgt tggtggtact 720 tggaccatag cgcgcatgcc ggggtcctta gagggaggca ttcttcccca ccagaatgca 780 ggcgcaattt tgcatggcgg gcatcgcgcg ttcctagata tcatgtgctt aggtgtcaga 840 atggagaaca tgtatatggg agcccatagc gcttttcaga ctgcaaaaag gttcggcgta 900 gtgttcacac cagatgtccg gggactttgg gtgcacgatt ggaccggtac tgggctacac 960 cgggctatcc ctggcatgtg ggcagggctg tggttaggcc acagttgcgg ttcaggatgt 1020 tattacatgg cctgggggct tcgggcgggt ggtctcccct gtttacgtgc gaaaacctgt 1080 tgggctgggc tgttcctcgc tgccgcccgc ccatcccgca tcaggccctc ccctgacccg 1140 catcctgccc cgcgcagttc tccgttggtc tgcctcgatg agcatggttg ctgttgacgg 1200 cggcggcggc agcagtcatg atggcggcaa ctgccggtgg agacaactgc ccccacgcgc 1260 gggcgcacat ggactgcctg cckcrgacgt gccgcctccc gtgcagcggc cagtggcatg 1320 gcgcacgcaa gcggcagccg cccgcacgcc tgaggtacgt ggaagtaaat catgctttay 1380 tcctctgcca gccatggtct ggctgattgg caacaacgag gggacacttt gggattgcgt 1440 cggcagggca cgtgagtgat cctgggttcg tgtatctgac acgaaccggg acacaagttc 1500 cactgttgag tcctgataac tcaagaactg caagtgcact tcgttgtagc cgggggaggg 1560 gcctcccctg gttgtagtcg cctaggctga gtgcgttggc cgtgttgcgc ggcacagtgg 1620 cacgcacggg ccttcaacgc ccatcccggt agttctcact acctccctca gttggaacag 1680 tcagctaaag gaagcacgca gggcagcgta agacatttgc aacttgcacg ttcgttagga 1740 gcagcggggc gcagggacct ggcttgggac ctgggaaggc caggagatgc atacctraca 1800 ttgtcatcta aaggggcaca ggggaggcat gacacgcaat atawgtagca tgtatgcgtt 1860 ttacgtccgg tgaggtgctg ggggatcccg gtccaggttc cctctcctcc cagcgctccc 1920 aggcgctcct ccagggctgt cttccggcgc gcagacccgc gcacccctgt taacaccaat 1980 agacacggac caaacccctg aatatggcat ctgtacgtat gtctagagcg gacggcaggg 2040 gccgctggct tcggcgggcc tcggaaaggg ctccaaaacg gccatgtctc tgaagggtct 2100 gcccagtctc tggtctcgca acctaggagc tgcgcccgtc gctttctgga ggtctcatca 2160 gtttcatagt tagcggaaga gcctttgggg tgattgcaag cagccggaat ttcgctcgtg 2220 gtgtttccag tgcgcgcaat ttgtcaatgg ccggcccgct atatggactg gtttctggcc 2280 cgctatcagt tctaatgact atacagttgt cttacataca caaatgaatg gccgggagct 2340 ccttgggagg ggtccgaaca ccttttggcc gggtcccatg cagcgcgamc ctggatccag 2400 gccgagttgc acacccccct cccctg 2426 // ID NonLTR-6_CR repbase; DNA; CHL; 1226 BP. XX AC . XX DT 31-MAY-2006 (Rel. 11.05, Created) DT 31-MAY-2006 (Rel. 11.05, Last updated, Version 1) XX DE It is a putative nonautonomous non-LTR retrotransposon - a DE consensus sequence. XX KW Non-LTR Retrotransposon; Transposable Element; Nonautonomous; KW Interspersed repeat; NonLTR-6A_CR; NonLTR-6_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-1226 RA Kapitonov V.V. and Jurka J.; RT "NonLTR-6_CR, a family of non-LTR retrotransposons in the green RT algae genome."; RL Repbase Reports 6(5), 261-261 (2006). XX DR [1] (Consensus) XX CC This is a young family of non-LTR retrotransposons, characterized CC by frequent truncations of 5' terminal portions and insertions CC into (CA)n microsatellites. Average length of target site CC duplications was not determined. Numerous copies of NonLTR-6_CR CC are identical to each other; probably, this family is currently CC transposable. XX SQ Sequence 1226 BP; 205 A; 378 C; 386 G; 257 T; 0 other; aacacaaggg cactacctgt gtcatctgtg gatcgtcagg ggccatcagg gcttcaggaa 60 gtcaggacgg gtcttggcgc ttctggctgg ggctcgcttt tccaatgagg gacctatggg 120 cgggcctttg ggctgagggc ttttaagccc ccgggtattg cttttgctaa actgtactgt 180 acctttcaaa tgggccgggt gcgcggagct ggggtagttt gggcgacctg gcggactggg 240 cgacgacgag ctcaacttca acaccttcgt cgccctgttc ccgctggtct cggtaccagc 300 accacgctat ttgtctctta agcccgcgtc acaggtccgg gaccccgtcc ggtgcccgcg 360 atgcatgcgc gatatgccca tcggtaccgc tcggtcacct tctgcgacca cctgatggct 420 ctggtggctc tggcggacgg cggacctcgt cacggagtac ggccaggtcg ccgaggcgta 480 cggcgcggca ctacccgcga cacgtcgcgg cgcgtgttca ctatcagggg ggtaactgga 540 ttctgggggg tgggggatgg ggggctcgtc acgaacagcc gacgcggcca ctgcggccgc 600 gagcgtgcct agcctcggtg ccgccgtgga gtggggagcg ggaggccagc tcccgccagc 660 ccacggcgca cacacgcaca cgcacacaca ccagcttgac acgtctggaa acgctgcaac 720 cgccgcccgc cacgtccccc atgcagcctc gaccccgttt agtcgcagtg ccaccgctaa 780 ctcgccctct tccaatagtg ttattgtact tagtgggcgg gatttgggac gcgtgatacc 840 tagcttgtct gctgcacgcc gcctctcttg tcgccgttgc cattcacatg cccgtgacac 900 ttcctcgctg ccgccctatg cccatgcttc cacttttgcc tataggtgag ctcgactacg 960 accagcgggc gggaatccgg gaattaggcg tgctgggtaa acgttgttta gcgcgcccag 1020 ggtgcggagc atggacgcat acaggtgcat cgcggggtca ggccgaatgg tcgcccgtgg 1080 ggttccaggc ttatcgcggg taagtatgac tacctgggtc acgacagtca cgcgacgctc 1140 ggtaaaatgg tgggtatcgg taagtaactc cgtcgcgtca ttgggcgggc tttcgtttcg 1200 ttttttaaca cacacacaca cacaca 1226 // ID Gulliver repbase; DNA; CHL; 7144 BP. XX AC . XX DT 13-MAY-2004 (Rel. 11.05, Created) DT 31-MAY-2006 (Rel. 11.05, Last updated, Version 1) XX DE Gulliver is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Interspersed repeat; KW 8-bp TSD; hAT superfamily; Gulliver. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1..246 6675..7144 RA Ferris P.J.; RT "Characterization of a Chlamydomonas transposon, Gulliver, RT resembling those in higher plants."; RL Genetics 122(2), 363-377 (1989). XX RN [2] RP 1-7144 RA Kapitonov V.V. and Jurka J.; RT "Gulliver, a family of autonomous hAT transposons from the green RT algae genome."; RL Repbase Reports 6(5), 227-227 (2006). XX DR [1] (Consensus) XX CC It is a consensus sequence of the Guliver hAT transposon. It was CC identified cytologically as a mobile element by Ferris, P.J. CC (Genetics, 1989; he reported only sequences of 246-bp 5'- and CC 370-bp 3'-terminal portions). It was suggested [1], based on 8-bp CC TSDs, that Gulliver belongs to the hAT superfamily. Final support CC of this classification came after identification of a transposase CC encoded by Gulliver [2]. The exons coding for the transposase CC have been predicted by using FGENESH and comparisons with known CC hAT transposases. XX FH Key Location/Qualifiers FT CDS join(778..822,2970..3014,3238..3414,3706..3891, FT 4055..4273,4444..4684,4781..4853,5052..5183, FT 5345..6536) FT /product="Gulliver_1p" FT /note="hAT transposase." FT /translation="MEKRGTKLGANDNVFHPGAKASVWTKGGNKVMNMTLF FT NKHLLSKNHLDVIGAAKNHTKMSNIVEKQKVKLNPTLCSLIRSAYACGKDA FT MPLTHYVKMVKLQAANCADCKHDCKAHTCRKCDASKACGSGITISGPYHTA FT EKASEMLACLSEAISEEQLKNIRASPVISMMIDESTDRTVSHNLAVYITYV FT APDDSIKTEFLQLEAMNNGATAVNIYDRLKEVFTESKIDWSKLVAFTSDGA FT NVMVGKHCGVATRIKTDWPCVLTSHCAAHRLALACADFFKEFPALVKVDNM FT LSKIYNYAKTSTVRTAALNDMYKERKAKAYKILKPHTVRWLSRSECVRRIK FT ITYPILLAFFNERKKDKKDVAAAEIYEWLRQVDNLLLITCIDGVLAATAEL FT SKWFQQSDLALVDVHQYLELGLRHLINAYTYHKEGETSNAPPSFTAPIQAL FT IKDLAAKDGVFHGHQMVLTTPPTTIAPATTTTATTTAESDSDNDGSDTNTD FT TTAESGSDDDDTAPTAPSWAAFPQTLFRSKLALSSMVKKLVENIKMRFPAD FT VSVAAKFGVLGPRALAADSGVPKYGEEEVAALAQHFKPVLGDDCLLAVDQW FT VMARARLIAVAKEQKKSGDVMKARPFYERLLSWLSGMGRELTVLVQIMLVL FT QPSTAEVERGFSAMNDIKTPGRASMKLGTLDVLMRVRLVGPPIAYQQRPVA FT GVSLAPYAEFDATLLGPAVQKFAAKLGRVPQRSSHNARPSRVKHRVCEIDV FT KALLKEAEEEAVQNVDVSC" XX SQ Sequence 7144 BP; 1500 A; 1876 C; 2203 G; 1562 T; 3 other; cagggctcct atcttaatgt ctccagacat taaacggcca ttttggccat tttccagaca 60 aacggagggg ggggttcacg cacgcttttg aacaaaacaa gcggtgtctg aggagaggca 120 aactctacca tagtgacata tatattttgt agaaagtgag ggaatgtcat ggtcctttag 180 gagattttcg gcgatatgac gaggatgaag gacaccccga tcaagtcttc ctttttgcag 240 atgcgcagag ccggtgacga ggatttgccg gggatgccgc attttcagtc acgcagagtt 300 gtcaggggtg caaagcccgc tgcaactcct tccacagtcc atggtccagc taacccgcgc 360 acagtgaaga aaagctgaga cgatagaggt caggaatgaa cagaagtcag ggatggaccc 420 cgtgcacggg cggaaccgtg gagctgcgga ggggttgaag aaaacaggca gggacgaggc 480 gcgcgggaga gggggtatac agcagccttg acagcagctt cgtattggag tgcactgaac 540 cactcgcact cagggcgggg ctgctgtcaa gctcaaccat gctactcctc catgcgtagg 600 ggaatcaaca agaacgggac ctgggaaagg acctgggaaa ggaccgggga aaggaccggg 660 gaaaggacca gggaattgac cgaggaatgg ggagccatca cgggacattg actaggacac 720 agtgatatta ggcatgcaac aggcaacaca ttacattctg ccgtcggcac acaacaaatg 780 gagaagcggg gcacaaaatt aggagcaaac gataatgttt tcgtgagtag ctgctgggct 840 acagtgtcaa tagtgtttta gtgcatctgg aagctgatcc ccgttcagcg acggggtctt 900 cggagcagag taccgaccgc ttactgggtc ggctgtaccg gcccgtccgg ccggcgtgtg 960 gtgacgttct ggtcttcttg tggtcccgga gcccgttggc ccccgacgac tcttcctcaa 1020 cgtgactttt gcacgggtgc tagaacggtc ctctacgctg ggcatgcatg gggatttggg 1080 ggtgggttca gctggcgccg gattcccatc tcaactagac accgctcccg actagggggc 1140 aagttggcac gcggaggtgg ggaccgcatg ctaccatagt gcgggatatg atggtgcggg 1200 agatgatgtt gttgcaaacc ttgtactgct agggacgcat gcagggcgtc aagcattctg 1260 gagggcgcgg gcgatgcaag aggcatgggc tggcattgcc gggcgtgtga ttttggacgg 1320 ttgcatgtgc ggcctggtgg gcagcggctt ggaatgccct gtctgcggtt tctccgcctg 1380 acaacatgcg tgtgtcatgg caaaccttgt actgctaggg acgcatgcag ggcgtcaagc 1440 attctggagg gcgcgggcga tgcaagaggc atgggctggc attgccgggc gtgtgatttt 1500 ggacggttgc atgggcggcg tggtgggcag cggcttggaa tgccctgtct gcggtttctc 1560 cgcctgacaa catgcgtgtg tcatggcaaa ccttgtactg ctagggacgc atgcagggcg 1620 tcaagcattc tggagggcgc gggcgatgca agaggcatgg gctggcattg ccgggcgtgt 1680 gattttggac ggttgcatgg gcggcgtggt gggcagcggc ttggaatgcc ctgtctgcgg 1740 tttctccgcc tgacaacatg cgtgtgtcat ggcaaacctt gtactgctag ggacgcatgc 1800 agggcgtcaa gcattctgga gggcgcgggc gatgcaagag gcatgggctg gcattgctgg 1860 gcgtgtgatt ttggacggtt gcatgggcgg cctggtgggc agcggcttgg tagaggctnc 1920 atncngcccc caccagcgac tcccgaagct tccagcgctg taatcacatg gtaccatgtg 1980 tttgcgcgcg cgttctacac gccaggcata ccgcatcaac ctacaggccc cgtgcggatg 2040 ggcgcgccgc ggccctgtcc ctgtttatgg ttcctgacat tgcctgagca cccagtgccc 2100 ctgccggcgg cagaggctgc attccgcccc caccagcgac tccgaagctt ccagtgctgt 2160 aatcacatgg tacaatgtgt tggggcacgc gttctacacg ccaggcatac cgcatcaacc 2220 gacaggcccc gtgcggatgg gcgcgccgct gccctgtccc tgcttacggc tgccagcagc 2280 agtggctgca tacggcccac accagcgact ccccaagctt ccagcgctgt aatcacatgg 2340 taccatgcgt ttgcgcgcgc gttctacacg ccaggcatac cgcatcaacc tacaggcccc 2400 gtgcggatgg gcgcgccact gccctgtccc tgtttacggt tcctgacatc ggctgagcac 2460 ccagtgcccc tgccggcggc agaggctgca taccgccccc accagcgact ccgaagcttc 2520 cagtgctgta atcacatggt acaatgtgtt ggggcacgcg ttctacacgc caggcatacc 2580 gcatcaaccg acaggccccg tgcggatggg cgcgccactg ccctgtccct gtttacggtt 2640 cctgacatcg cctgagcacc cagtgcccct gccggcggca gaggctgcat accgccccca 2700 ccagtgactc cccaagcttt cagcgctgta atcacatggt acaatgtgtt ggggcacgcg 2760 ttctacacgc caggcatacc gcatcaaccg acaagccccg tgcggatggg cgcgccgcgg 2820 agacactcca cctgcttgca gattggaata atgaatcagt gcgtattgcg aagttcgtca 2880 acatcaagaa caccagccgt tggaatggga tcgggcacaa tttcgtcagg actcctgagt 2940 gcacgttctt gtgccttgcc tgcatgaagc accccggtgc gaaggcctct gtttggacca 3000 agggcggcaa caaagtgagt gtgcttgcaa ccatgtagcc tggcaatgca gcgtgtgagc 3060 cggattagga gagactaggg gtgtgcgcat actgcacgca agcaagcgca tgcactgcgg 3120 cgcaccgtct ggggtaccca tttacagctg cagcctgaca gctaagacag gacctgctac 3180 ccagccgcct ccccgcatgc ttgcgtaagc ttgcaaatcc cttgtgttct gctgcaggtc 3240 atgaacatga ctttgttcaa caagcacctt ctctccaaga accatctgga cgttataggc 3300 gcggctaaga accacacgaa gatgtccaac attgtcgaga agcagaaggt caagctcaac 3360 ccaacgctgt gcagcctaat ccgttcggct tacgcctgcg gcaaggatgc catggtgcgt 3420 ttgtgagcgc ggcgcgtggg caattgtgca tggcgcacat gtgggcgcgg cgcgcgtgcg 3480 tgctggtgtg tgttcgtttg gtgtgttggt gtgtggtggt gtagcaacgt ttgtggtgtg 3540 ttggtgtgtg gtggtgtagc aacgtttgtg gcacatttgg taagcggtac tagtgggcgt 3600 agtgtgctgt agtgtatgca tacggcatgt acaacaggca tggcgttcgg ctgcatcttt 3660 atcggctgac acgttacaac gtgcaccctt cctgcttgct cgcagcccct gacccactat 3720 gtgaagatgg tcaaactcca ggctgcgaac tgcgctgact gcaagcatga ctgcaaggcg 3780 catacgtgcc gaaaatgcga tgcgtccaag gcatgcggca gtggtatcac tatttccggt 3840 ccataccaca ccgcggagaa agcgtcggaa atgctggctt gcctgtctga ggtaagacct 3900 gaacatatgc agctgtggcg ggagcttacg ggattgggtt cgtttttggc gtgatggtat 3960 gccattgctt ggatgctggt gctgtccttg tatcataccc tcgccaagta taccggtgcg 4020 catgctaacg actgacaacc cgcgatctgc gcaggcaatt tcagaggaac aactcaagaa 4080 catcagggca tctcctgtca tctccatgat gattgatgag tctactgacc gcactgtttc 4140 gcacaacctg gccgtgtaca ttacatacgt ggcgcccgat gacagcatca agactgagtt 4200 tctgcaactg gaggcaatga acaatggtgc cacggccgtg aacatctatg acaggctcaa 4260 ggaggtgttc acggtgcgtt cttaacagtg tgaacacgtg gccttctcga gcagtagatt 4320 gtgcggttac gcagggacag atcatgcatg tagagccacg gacgtgcgga cagtgtcggc 4380 tgtgtgtgct gagctgtgtg ttctgctccc ccgcgtgtgc ctgctgcaac caattacgta 4440 caggagagca agattgattg gagtaagctt gtcgccttca ccagtgacgg tgccaacgtt 4500 atggttggga agcactgtgg agtggcgacc cgcatcaaga cggattggcc ctgcgtcctg 4560 actagccact gcgctgcgca ccgactggct ctggcctgtg cggacttctt caaggagttc 4620 cctgccctgg tcaaggtgga caacatgctc tctaagatct acaactacgc caagacctcg 4680 acaggtgcgc ttgggcgcgc tagtggttgt tgtgtggcga gttgtgagtg aatggcttct 4740 ttgatgtatt gctgtaactt gtgcacaccg tgtcctacag tgcgaaccgc tgctctgaac 4800 gacatgtaca aggaaaggaa ggcgaaggca tacaagatcc tgaagccgca cacgtaagcc 4860 gcacacgtcg tgccttgcta cctgctgaca tgtgcgctgg tgtgtgggtg taagtgtgta 4920 tgtgggtgtg tggacggcgg tagcctagcc tgtggagtag ggtgagacgg cttcacaagc 4980 tgcgtgctac ctgctgctga ggctgagggg ttgatataca tcccgcttgc acactgcgca 5040 aacttacgca gggttcggtg gctttcccgg tctgagtgtg tgaggcggat caagattacc 5100 tacccgatcc tgcttgcctt cttcaatgag cggaagaagg acaagaagga tgtggccgcg 5160 gccgagattt acgagtggct caggtgtgtg gatgccctgg gatgtcagga atgcaagcgt 5220 gtgcaagtag gcgtgggcgc gcactcgtgc tctagtgcgt atgtccgctg catgtgtgtc 5280 cgtgactgcg ttcctgtctt ggactaaaac gcgatcacta ccgcatgcta tttcacacct 5340 gcaggcaggt ggacaacctc ctgctcatca cctgcattga cggagttctg gctgcaacag 5400 cggaattgag caagtggttc cagcagagtg atctcgccct cgttgacgtg catcagtacc 5460 tggaactagg gctacggcac ctgatcaacg cctacaccta ccacaaggag ggcgagacgt 5520 cgaacgcacc tccatccttt acggcaccca tccaggccct gatcaaggac cttgcagcga 5580 aggatggcgt gttccacggc caccagatgg tgctgacaac ccctcccacg acgatcgcgc 5640 ctgccaccac caccaccgcc accacgacag cggagtctga cagcgacaat gacggctctg 5700 acaccaacac tgacacgaca gcggagtctg gcagcgacga tgatgacacg gcacctacgg 5760 caccgagctg ggcggctttc cctcagactc tattccgcag caagcttgct ctcagcagca 5820 tggtcaagaa gctggttgag aacatcaaaa tgcgattccc cgctgatgtc tcagtggctg 5880 caaagtttgg ggtgctgggc ccccgtgcat tggctgctga ttccggtgtg ccgaagtatg 5940 gcgaggaaga ggtggccgct ttggcacagc actttaagcc cgtcttgggt gatgattgcc 6000 tgttggcagt tgaccagtgg gtgatggctc gggcgcgcct gattgcagtg gcaaaggagc 6060 agaagaagag tggtgatgtt atgaaggcga ggccgttcta tgagaggctc ttgtcgtggc 6120 tgtctggcat gggccgtgag ctcactgtcc tggtgcagat tatgcttgtg ctgcagccat 6180 ctacggcaga agttgagcgt ggcttcagcg cgatgaatga catcaagacg cctggccggg 6240 caagcatgaa gcttggcacc cttgacgtac tgatgcgtgt gcgccttgtg ggcccaccaa 6300 ttgcgtacca gcagcggcct gtggctggtg tctcccttgc cccgtatgct gagtttgatg 6360 caacgctgct ggggcctgca gttcagaagt tcgctgcaaa gcttggccgc gtgccccagc 6420 gcagcagcca caacgcccgc ccgtcccgtg tgaagcatcg tgtgtgtgag attgatgtca 6480 aggcacttct gaaggaggct gaggaggagg ccgttcagaa tgtggatgtc tcgtgctaat 6540 aatttggcat acgaacgttt gaattctttg aaagcggtcc atctagcaaa tacatacccg 6600 atacgattgt gtatgattga aacgttgaaa gaagggacga tagtgtattg gggattcgat 6660 gcacgagggt gtgtgccgac gtgtgaagat gactaaactg gaagcggtta ggccgcgatg 6720 gcgctctgcc tgtgtgtatg tgcggtagaa ctgtgactgg tcatggcttt ggccccggca 6780 cttcggtgac agcaagcagt tagcaggtat agcatgtacg ccgcgcaggc aagcgaaggc 6840 tcagagttgc ttggggaaac gcgctgcgac atgcctgtgc atatgatcgc atttggactt 6900 gttcattagt gttgcactgc gtgaaacaga tacaaaagcc gccgtttaca gttgcgaaaa 6960 agcgaaattg caacatgtca agatgtctag acattttaca tggcttccac aaaggcactt 7020 tgtctagaca aaaaaatcgt ggatttgggg gcctgtttgt ctaccccctt gacaaaaatt 7080 tccgctccag aggccaggaa tgtctgggtc aagacattcc cccgcaaaca agatacgacc 7140 cctg 7144 // ID Copia1-I_CR repbase; DNA; CHL; 6068 BP. XX AC . XX DT 13-MAY-2004 (Rel. 9.04, Created) DT 13-MAY-2004 (Rel. 9.04, Last updated, Version 1) XX DE Internal portion of the Copia1_CR retrotransposon. XX KW Copia; LTR Retrotransposon; Transposable Element; 5-bp TSDs; KW Copia1-I_CR; Copia1-LTR_CR; COPIA superfamily; endonuclease; KW protease; reverse transcriptase. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-6068 RA Kapitonov V.V. and Jurka J.; RT "Copia1_CR, a family of copia LTR retrotransposons present in the RT Chlamydomonas reinhardtii genome."; RL Repbase Reports 4(4), 102-102 (2004). XX DR [1] (Consensus) XX CC Copia1-I_CR is an internal portion of the Copia1-CR LTR CC retrotransposon, whose LTR is deposited in Repbase as CC Copia1-LTR_CR. Copia1-I encodes a 1627 Copia1_CRp polyprotein CC (pos. 435-5315) composed of the C2HC zinc finger, protease, CC endonuclease, reverse transcriptase, RNase H. The PBS is CC complementary CC to an internal portion of tRNA Met. XX FH Key Location/Qualifiers FT CDS join(435..2084,2088..5315) FT /product="Copia1_CRp" FT /translation="MAGKLIKPSQVSELRAGDKDAFWDWVSNLKIVLKNNR FT MLDVVMPGIGVDGNGNFDAEDLKLDADARDFIESKVDSTLRLALTNAGTAK FT AAYDTIYAKFNPATVARKAELSRQLNMLRMQRGESVAVYCARARKLAHELL FT ELEIIMKELTEGEDILAASESSETNLVMWVLTGLPESWENFKDRMMDSASS FT TGDLGVTMDQLQSKMENREQRYQDGSGVGVSASAVAAIQVPPAPIYPAAQS FT QEDQVAVIAAALNRLGFQPRGAGHGGNNGGGDRPRKDMLRVKCYRCKQFGH FT YADKCPQGQQQQGQLGLGGGAGGSGSGQVPPLVAALAQLQGLYAYMGGAPQ FT KAAPMMQQPAQTPQVAAMAAAAPAVAAPAAAAAAAATGVAPQQGGAERVRT FT YRMFMMMAQSQAQTLPHGPSDWLLDSGCSSPVSYDPSDLFDLRPLLEPTFV FT QVADGRPCAATHIGNMILRTSLGNAEADAMMKDVLLVPELGVKALSISAMA FT RAGVETTMGNAGAVCKSRDGEVVMRGRLCPRSGVYVISGRALQPSAEQWAQ FT FQQXRGDPSPPPHQEDQAVGGAAGCWAVVDRSGELWRELQAARSRADMLAV FT DRIHRCFGHLGHTRMLQLPDVVSGLPVTRSQIQLWAATRGRCDACGLGKSA FT RRRLSRQDQPRGATAPLQLVHTDVCGPLPAAACGSKYIVTVVDEYSKYSVL FT RFTRDRETVPYVLQSVLVLMETQCAPLRVKEIRSDRGGEYVGSVLADFCAA FT KGILQNFSAPYTPQLNGVAERLNQSLQRAGRAMLVDSKLDVKYWRLAAEAA FT NAVRLRSPVRGLAATPHQLFTGVKPDASHMHVFGSKVYVHTPEPQRDHKFS FT PTGRAGRFIGWEPDGNYRVLVDSKLVVSRDVVFDEDRAANAGEPATATAAA FT SAPPLDLRVDSSDSDNDSPPAEAAGVSGGPPAGQSNEQGRRNTSQQGAREQ FT REAAPSAQELHVERQPGSGEAAAEASTSAVASEPHARRRRRRKPLKPPAQG FT GAAAGEATAAGGAPNLAAATPPDKAERGGGVPATALPPQPRRSDRSRAAPV FT RLGFSAIVVVRGALVQVLAAAVKGPVGQDLPGGDEPTFEQAKQRPDWPLWQ FT AAIQEELDSLRSLEVFQPAHPPAGAKALAVRWVLKVKRDANGKAARWKARL FT VVKGYQQRPGRDFGETFAPTGRLASLRFLLAIVAARNLELHQIDVKTAFLH FT GRVEEEIYADLPPGCDLGDSELKLLLRKSLYGLRQAPRAWYGKLKETLERM FT GFVESDADPGLFVLWAEQGILFVLVHVDDMLMAAELLPLMDSTKKALNGHF FT AITDLGEASFYLGMRIDRDRANRTIKLSQERYTLDLLEAHGMSESKPVDTP FT MTVGTPLMRADKGGGELLGLGEAQRYRGIVGSLLYLANCTRPEISLAVGTL FT ARFMAEPTEEHLKAAKRVLRYLKGTPGTGLVYGGHGLVLAGYAEPPSDAEL FT SAESAVELKLAGYADADYAGCPNTRRSTTGYVFTLNGAAISWQSKLQPTVA FT CSTVEAEYMAAGSATREALWLRKLGGDFGVLGSGPTVLLGDNQGAISLTRN FT PLHSAQTKHIDVMYHFVRERVAMGDLAVEYVPTSEMVADVLTKPLDAVKFK FT KFRGELGLK" XX SQ Sequence 6068 BP; 1100 A; 1660 C; 2241 G; 1067 T; 0 other; ggttatgagc cccgtgtacg gcatttccgg cacggttatc ttcagaggtt gaggtcgtca 60 gttcaatgcc tggacgctac gttaggggcg gggtagcctc ggagtagttt ggcaatgtgt 120 aagtgctgcg gataggcaag agtgccttgc agcaggacac taataaagtg acgatgagac 180 gctgaggacc cggatcgaat ggtgtatcgg tcgcgggggg gtgttgtata tcgacctcta 240 ccccattctc gctatatcag gggatggaat gactcgcaag taataggctc tgttgagccg 300 gcgcttttgg ggtcattcca gaatacaaca ggttatgagc cccgcgctct tgcggtggaa 360 tagcgatatc tctcttttga ttacttttcg ggcttcagcc taaatctaga gtagtgtagc 420 cgccttcaac aaccatggcc ggcaaactca ttaagccgtc gcaagtctcc gagctgcgtg 480 ctggggacaa ggatgccttc tgggactggg tgtcgaacct taagatagtg cttaagaaca 540 atagaatgct agacgtggtc atgcccggaa ttggagtgga cggcaacggc aacttcgacg 600 cggaggacct gaagctcgat gcagatgctc gggacttcat tgagagcaag gtggactcaa 660 cgcttcgcct ggccctgacc aatgcgggca cggccaaggc agcttacgac accatctacg 720 ccaagttcaa cccggccacc gtggcgcgca aggccgagct gtctcggcag cttaacatgc 780 tgcggatgca gcggggcgag tccgttgcgg tgtactgcgc tcgtgcccgc aagctggctc 840 acgagctgct ggagctggag attatcatga aggagctgac ggagggcgag gacattctgg 900 cggcgtcgga gtcctctgag accaatctgg tcatgtgggt gctgacgggc ctgccggaaa 960 gctgggagaa cttcaaggac cggatgatgg actcagctag ctccactggc gacctgggcg 1020 tcaccatgga ccaactgcag agcaagatgg agaaccggga gcagcgctac caggacggtt 1080 caggagtggg cgtgtcggcg agcgctgttg cggctatcca ggtgccgccg gcacctatct 1140 acccggcagc gcagagccag gaggaccagg ttgcggtgat tgcagctgca ctgaaccgcc 1200 tgggcttcca acctcgcggg gccggccacg gcggcaacaa cggcggtgga gaccgccccc 1260 gcaaggacat gttgcgggta aagtgttacc gctgcaagca gtttgggcac tatgccgata 1320 agtgtccgca gggacagcag cagcagggtc agctggggct gggcggtggt gccggcggca 1380 gcggatcggg acaggtgcct ccactggtcg cggcgctggc gcagctgcag gggctgtacg 1440 cctacatggg tggtgcgcct cagaaggcgg cgccaatgat gcagcaaccg gcgcagacac 1500 cgcaggtggc agcgatggcg gcggctgcac cagctgtggc cgctccagcg gcggcagcgg 1560 cggcggcggc gacgggggtt gccccgcagc agggcggcgc tgagagggtg cgaacgtacc 1620 gcatgttcat gatgatggcg cagtcgcagg cgcagactct gccgcacggc ccatctgact 1680 ggctgttgga ctcgggctgc tccagccccg tcagctatga ccccagcgac ctgttcgacc 1740 tgaggccgct gctggagccc accttcgtgc aggtcgctga cggccggccg tgcgctgcca 1800 ctcacatcgg caacatgatc ctgcggacca gcctgggcaa cgcggaggcg gacgccatga 1860 tgaaggacgt gctgctggtg ccggagcttg gcgtcaaggc gctatcaatc agcgccatgg 1920 cgagggcggg cgtggagacc accatgggca acgctggagc cgtctgcaag tcgcgcgacg 1980 gtgaggtggt gatgcgcggg cggctgtgcc cgcggtcggg cgtgtatgtg atcagcgggc 2040 gcgcgctgca gccgtcggcg gagcagtggg cgcagttcca gcagtgacgg ggtgacccgt 2100 cgcctccgcc gcatcaggag gatcaggcag tgggcggagc ggcgggttgc tgggctgtgg 2160 tggacaggtc gggagagctg tggcgtgagc tgcaagcggc gcggtctcgg gcggacatgc 2220 tggcggtgga ccgtatccac cgctgctttg gccacttggg acatacgcgc atgctgcagc 2280 taccggatgt ggtgtcaggc ctgccggtga cgcgcagtca gatacagctg tgggcggcca 2340 cgcgtgggcg gtgcgacgcg tgtgggcttg gcaagtcagc gcggcggcgg ttgagccgcc 2400 aggaccagcc gcggggtgcg acagcgcccc tgcagctggt gcacacggat gtgtgcgggc 2460 cgctgccggc ggcagcatgt ggcagcaagt acatagtcac tgtcgtggat gagtacagca 2520 agtactcggt gctgcgcttc acgcgcgacc gcgaaacagt gccttatgtg ctgcagtcag 2580 tgctggtgct catggagacg cagtgcgcgc cgctgcgggt caaggagatc aggtcagacc 2640 ggggagggga gtacgtcggc agcgtgctgg cggacttctg cgcagcgaag ggcatcctgc 2700 agaacttctc agcgccgtac acgccgcagc tgaacggagt agcggagcgg ctgaaccagt 2760 cgctgcaacg ggcggggcga gccatgctgg tggatagcaa gctggacgtc aagtactggc 2820 gactcgcagc tgaggccgct aacgcggtgc ggctgcggtc gccggtgcgt gggctggcag 2880 cgacaccgca tcagctgttc acgggtgtca agcccgacgc ctcgcacatg catgtcttcg 2940 gcagcaaggt gtacgtgcac acgccggagc cccaacgcga ccacaagttc tcgccaacag 3000 gccgcgccgg tcgcttcatc gggtgggagc cggatggcaa ctaccgtgtg ttagtggaca 3060 gcaagctggt ggtgtcgcgg gatgtggttt tcgatgagga cagggctgca aacgcggggg 3120 agccggcaac ggcgacagcg gcggcgtcag cgccgccact ggacctgcgg gtagacagct 3180 cagacagtga caacgacagc ccgccggcgg aggcggcggg tgtgtcaggc ggtccgccgg 3240 ccgggcagtc gaatgagcag gggcggcgca acacgtcgca gcagggcgca cgggagcagc 3300 gggaagcagc tccgtcggcg caggagctgc atgtggagcg acagccgggc tctggagagg 3360 cggcggcgga ggcatcaacc agtgcggtgg cgtcggagcc gcacgcgcgg cggcggcggc 3420 ggcgcaagcc gctgaagccg ccggcgcaag gtggagctgc agcaggagag gcaacggcag 3480 ccgggggagc gccgaacctg gcggcggcga cgccgccgga caaggcggag cgcggcggtg 3540 gtgttccagc caccgcgctg ccccctcagc cgaggcgctc agaccgcagc cgcgccgcac 3600 cagtgcgcct aggcttctca gccattgtgg tggtgagggg agcgttggtg caggtgcttg 3660 cggcagcagt caaggggccg gtagggcagg atctgccggg cggcgacgag cccaccttcg 3720 agcaggcgaa gcagcgcccg gactggcccc tgtggcaggc agcaatccaa gaggagctgg 3780 actcgctcag gtcgctggag gtgtttcagc cggcgcaccc gccagctggc gccaaggcgc 3840 tggccgtgcg ctgggtgctc aaggtcaagc gtgacgccaa cggcaaggcg gcgcggtgga 3900 aggcgcgcct ggtggtgaag ggctatcagc agcgaccagg ccgggacttc ggggagacat 3960 tcgcgcctac cgggcgcctc gcgtcactcc gtttccttct ggcaatcgtg gctgctcgca 4020 acctggagct gcatcagatc gacgtcaaga ccgccttcct acacggtcgc gtggaggagg 4080 agatctacgc ggacctgccg ccgggctgcg acctaggtga ctcggaactc aagctgctgc 4140 tgcgcaagtc gctgtacgga ctacggcagg cgccgcgggc ctggtacggg aagctcaagg 4200 agactctgga gcgcatgggc ttcgtggagt cggatgccga tccgggtctg ttcgtgctgt 4260 gggctgagca gggcatcttg ttcgtgctgg tgcacgtgga cgacatgctg atggcggcgg 4320 agctactgcc gctgatggac agcaccaaga aggcgctgaa cggtcacttc gccatcaccg 4380 accttggcga ggcgagcttc tacctgggca tgaggatcga ccgcgaccgc gccaaccgca 4440 ccatcaagct gtcacaggag cgctacacgc tggacctgct ggaggcgcac ggcatgtcgg 4500 agtccaagcc ggtggacacg cctatgactg tgggcacgcc gctgatgcga gcggacaagg 4560 ggggcggcga gctgcttggc ctgggtgagg cgcagcgcta ccggggcatc gtgggcagcc 4620 tgctctacct agccaactgc acgcggccgg aaatctctct ggccgtgggg acgctggcgc 4680 gcttcatggc tgagccgact gaggagcacc tgaaggcggc gaagcgggtg ctgcgctacc 4740 tcaagggcac gccggggacg gggctagtgt acggcggaca cgggctggtg ctggcgggct 4800 atgcggagcc gccatcggat gcggagctgt cagcggagtc ggcggtggag ctgaagctgg 4860 cggggtacgc tgacgcggac tacgctggct gccccaacac tcgccgctcg accactggct 4920 acgtcttcac gctcaacggc gccgccatca gctggcagtc caagctgcag cctactgtgg 4980 cgtgctcaac tgtggaggcg gagtacatgg ccgccggttc agcgacacgt gaggcgctgt 5040 ggctacggaa gctgggcggt gacttcggtg tgttgggctc agggcccacg gtactgctgg 5100 gcgacaacca gggcgccatc tcgctgacgc gcaacccgtt gcactcagcg cagaccaagc 5160 acatcgacgt gatgtaccac ttcgtgcgcg agcgtgtggc gatgggcgac ctggcggtgg 5220 agtacgtgcc gacgagtgag atggtggcgg atgtgctgac gaagccgctg gatgcggtga 5280 agttcaagaa gttccgtggg gagcttgggc tcaagtgaag cgtgttaggt tagggttgat 5340 ttctgtagag ttggaggtta ggtgatatgt tgctggtact gcaattggac acggacggtt 5400 gttacggacg ttagtcgtga tatgggaata gggacgtaat tggtggatgg aacgggctaa 5460 tgactaatga caattgcggc ggaaaccatg cggattgact tgacccaaca ttcatttgac 5520 ggacttgcag tgtgatcgca gcgcggacgc gcgcacagtg tgatatgctt ctgacggatt 5580 ggtgttgaga gcgttacgtg tgggcacatg cgtgaatgcc attattctta ggcgtgctta 5640 gtgcatgctt cacgtggccg agggttgtcg gctggcagca gtatgtttcg agctggttgt 5700 ggctgacttg ctgttgtgca tatgaaatcc acgtggtacg gtgcggtgtg ctatgcccaa 5760 acagagacat gcagggcgaa ctgcatggcg tgagactact gggctgtcat gagtagtcgc 5820 ccaaacagag gcgcacgggg ttaagcgtgt gacgaatgga agatatgccg ggtgcctgat 5880 cagctggagg tatgtcgagg ctgtagcggg tggtttaact cgttgcggta acggtagtga 5940 cgtggtgtag cgttgctgct tgagctggtg cgctggctgt gcgctggatg gctcaatggt 6000 gatttgcgat aatatttcag gttacactta gtaatgtggc tgcgtggaat gggcagcccg 6060 cgggggag 6068 // ID TE2-4_CR repbase; DNA; CHL; 1022 BP. XX AC . XX DT 08-JUN-2004 (Rel. 9.05, Created) DT 08-JUN-2004 (Rel. 9.05, Last updated, Version 1) XX DE TE2-4_CR is a family of nonautonomous transposable elements - a DE consensus sequence. XX KW Transposable Element; AAC target site duplications; TE2-4_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-1022 RA Kapitonov V.V. and Jurka J.; RT "TE2-4_CR, a family of nonautonomous transposable elements in the RT Chlamydomonas reinhardtii genome."; RL Repbase Reports 4(5), 139-139 (2004). XX DR [1] (Consensus) XX CC TE2-4_CR is a family of nonautonomous transposable elements. CC The consensus sequence is less than 5% divergent from TE2-4_CR CC copies. Classification of this family is not known. It is CC possible that it belongs to a novel superfamily. Features of this CC superfamily include preferential insertions into AAAC targets CC that are accompanied by the AAC target-site duplications, a CC 18-21-bp CC palindrome separated by 3-5 base pairs from the 3' end; and the CC AGCC CC 3' termini. XX SQ Sequence 1022 BP; 154 A; 273 C; 337 G; 246 T; 12 other; ctccccgctc cgcagcgcga tggctcgccc tttgtgcgga tttctcgctt gaggggaagt 60 gcgcaggcga ggcttcagcg ctagrrgcgt gtagccgagg ctggctgtga gcgcaaggtg 120 ttrgatagcg atccctggat tgacagacgt gcaccagagc attcaagcgg ggaaagttac 180 agcgtgctga gctggtaggc gtgacgaggc aggagacagc acaagtgtgc gctgcgcgcg 240 cagggggggg gcggaaatcc aagttagtgc gtgtgttttg ggtgggtaca aatctaggat 300 ctaggatgct gacacgccct agcgctggtg ctgcgcttcc tgctgttgcc tgtgcgtgca 360 ctgcacttgc mracttgcta tgcgacgtca gctatggtgg gggctggagg cgagattcag 420 gcgcattcac acgcccaggc ggcggcccac gggcaacatc gctctgctgg gtcgcctcgg 480 ccttgcaatt tcgcgctcgc gctgcgcttc ctgcccttgc ctgtgcgtgc actgcacttg 540 tcatctatgc cacgttggct atggtagggr ccgggggcga ggttcaggcg cgttgcagca 600 gcacggttgc ggctccacac gccccgkcgg cggcggcggc ggcccacggg caacaccgct 660 ccgctgggtc gacattcttc gacttcggct tggctttggc tttggctttg gctttggctc 720 tggctttggc tttggcttgc ggctttggct taggctaggc ttgtggcttc ntgcctttgg 780 cttcgctggg ctttggcttg gtgatgggtc cacatcaagg ccttggactt ggcctgggag 840 gtgcgcgttt tctaccagtt tttctgtgtt gcatgttttc aaaggtttct tsgcttggca 900 aatacaccaa agctgccgct ayttgcatac angctttcac gaaaacgcaa aaacagmgaa 960 caaccctaat gtatggttct ggtcgaggac taaggctgtc gccccccttt ggggggcgag 1020 cc 1022 // ID Gypsy-9-LTR_CR repbase; DNA; CHL; 3195 BP. XX AC . XX DT 31-MAY-2006 (Rel. 11.05, Created) DT 31-MAY-2006 (Rel. 11.05, Last updated, Version 1) XX DE Gypsy-9-LTR_CR is a long terminal repeat of Gypsy-9_CR LTR DE retrotransposon - a consensus. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; 4-bp TSDs; Gypsy superfamily; Gypsy-9-I_CR; KW Gypsy-9-LTR_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-3195 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-9_CR, a family of gypsy LTR retrotransposons from the RT green algae genome."; RL Repbase Reports 6(5), 242-242 (2006). XX DR [1] (Consensus) XX CC It is a long terminal repeat of the Gypsy-9_CR LTR CC retrotransposon, CC whose internal portion is Gypsy-9-I_CR. XX SQ Sequence 3195 BP; 673 A; 931 C; 950 G; 641 T; 0 other; tgtaggtgca aagcacgcgg cacctcctgt cagcgccagt acatgttttg taccagtcct 60 cgccagccac gtagcccgct aaaaattgtt acgggctagc atacaagtgc gtgctggggg 120 ttccaacgcg gggtaacacg gggagatcag gatgggtgaa ggggtgaggc cgcggcaggg 180 acgggcacac gcaacgtcgg tccaacctgg aacgcgaggc gtgagcccgt gtgtactata 240 ttggcaacca agttaaatgg gcggggtaaa tacagatgga gtgcaccaag cagacggcgt 300 gagagacaag gcgaattctg gattccgcaa agcaaaccgt gaggaaaagg gctggcagta 360 cacaggccgt ttgaagattg attgattgtg ttgagctatc tggatgtaca acaagaagcg 420 agcgagtagg ggagcagtgc ggtaagctgt cggcacaaga cgaaaggcaa gcaggtggcg 480 aactccaggg cccgaagcat tacatttggt agtatgtatg caatacgtaa tcaaggtgac 540 aaaaagaagg cagtttgagc aaggagtcgc tgaagggttg cggaccgagc aagtggtcga 600 aagccgcact cggcgacgaa gggtcggctg cataaaacgg tgaatgcggt gatcagttaa 660 ggctcccagt tgctgagcag caatggctgg ctcagcaacc gggtccaggc cggagcagcg 720 acagctgagc gcggcctggg catgcggcgg gcaagacgaa ggctggcgag aggccaggtg 780 tgtcggtgtg caggtgtttg tgtgtttgac ataagggcgc gcctgaatgc gctggtcgat 840 ggtgtcggag ggcctcgcag gcctgaggaa acagactgac tgtaggagcc ctcaaggcgc 900 taggcctgag gcggtgagag tagcaggata ggtgatgatg tgatgtgcgt gttaggaggt 960 ttggcggagg cgggcgctgc gcagatggcc tgcgcggctt ggcggccact caggccagcc 1020 aagccaagca ggacaggggg caggccgggc aagccagcca ggccacaggc cacagtcctc 1080 ggcctgaacg gcaagtccca agcgccgtac gacccattga ggggagtata tataggcgct 1140 gccggcgcag tgttcgtcac ccagcagtag agttgtattc aagacttacc taaccaaaac 1200 agaataactc ttcttacaga atcgggcctg gcaagcccac cgctgtggtc atagttagtg 1260 tattgccaag acagttgtga cgctttgaat aggttacgtg tgcacgaggg aaccaagccc 1320 tctggaccaa ctttgtagcc ggccatctgc ggccaggctg accgtgtaac gtgtgcgtgt 1380 gtgtgaatca acctggtcta cagttcccct gcctcgacaa cgcctagcca tccagccggc 1440 aacgtccagt gcagctctct gatctagacc tttggaagcg ccagctatta taccccccac 1500 actctccgcc tcccccatcg tgaggtaacg tgtagggttc tgtaaacaac ctgtcgtcag 1560 ctgtgcacct gccatcttaa gggctccagc tgttatacgt agagggatta tattgtggga 1620 cgctctctgc cccccgcctg tctgcctgtt tgtggtttgt gttcgtaccc cggccgtagg 1680 tggccgcagg gtttctcttc agtttctgtt gcaccgcagg tgcccacgtt tggagtcgca 1740 tagccccgtg tcaactgcta catcgcgtgg tcgtgggtca gtggattctg acacccacct 1800 ctcaaggcct caccgtttgg tcatcccgtt cccctcagtc ttaggcataa aggaggaggt 1860 atcgtgcttc tgtgaggccc acccacacac tagggcagca agtccccgaa gccccaggcc 1920 tagcgagtaa aaggagcctc gcagcctccg ggttgactcc cgcgcgtaag acgggcaagg 1980 agccccgcag ccagtagggt cggtgtgtcg taggtaacaa gggggttgag tttcccccct 2040 gggataagca gggactgact gggataacgt cgggatatca ggttaagggt gtaggtagcc 2100 ccagggatcg ggcgtcttgc agtggtgcca tccggtgccg taatcaggac agcagttgtg 2160 aacgcgcaac caggtcagca gctattgtca gcacacgcag ggccgcgctc tgccattgtg 2220 tttcgtgtcg caggcactag cctgcgtcgg tgtcccctcc ctcgcaggca tttcccctct 2280 ctctttcgtg tgcaggttca atcttcccct gccacctaaa caggtttccg ttacaggtcc 2340 gcccccgtca gtgacggctg cgccgtcact gcaactggcc tttccggcgt caagcaatct 2400 gtcccaagtc gcccgcgcac gcctaaatta gcagcaggct cctagccact gccgtgacca 2460 gtcccctgtg cccctaccgg cgggagggtc tcgtcaagca aaaacaagtc aagacagcgg 2520 ataaccctag ccagcggaga cgcagccagg cctaaactaa gttccggtag cattctgtgg 2580 gtcgccttcc cctggccttc ctctctttcc ctctctctcc tcctacccct caagacccac 2640 ctgctctggc cccgtgcccc aatcccggcc aagccgactc ccccgaaagg atattcttcc 2700 ttccccacct ctagctccca catcgtggag tgcgcaggac aggccagcgt tgagtgagac 2760 agcgtctgcc ctctggtgcc cggtccttgc aagccgagcc cgcctgtgcg gcgccggtgc 2820 gtgtcagcac ccggcgcccc ttactttctg tatccttact tctaaccgcc tagaccaaga 2880 aaacaactaa acaagaccgt atcccagcgg cgacaacaac ttctgcggag cgcacgtgtg 2940 tggcaacacc gtctggcaac accgaccgcg tctgaccaag gcccgctctc gtcggccccc 3000 acacccgcct ccgtctccct tctcccccct tctcagttaa aacagcaccg ggcgtcatca 3060 tacgaccggt gttgggtggc tagccatctg gaagagacct gcctacgaca agcatcttgc 3120 ctgcaaaagc tgtctggctg agcgaaaaaa ttagcctggc ggctgtaggg gggctgtgcg 3180 tcgttttaac cgaca 3195 // ID Gypsy18-I_CR repbase; DNA; CHL; 6137 BP. XX AC scaffold_48; XX DT 22-FEB-2008 (Rel. 13.02, Created) DT 22-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE LTR retrotransposon from single celled green alga: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy18-CR; KW Gypsy18-I_CR; Gypsy18-LTR_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-6137 RA Kohany O. and Jurka J.; RT "LTR retrotransposon from single celled green alga."; RL Repbase Reports 8(2), 98-98 (2008). XX DR Genome; scaffold_48; Positions 502446 508582. XX CC Positions [4922-5404] - Integrase core CC 'CGTAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 306..2996 FT /product="Gypsy18-I_CR_2p" FT /translation="MADPVPPGVLAGWRMCLRKYYEVYELPIPDILEDGNA FT RVLRYLASSHDFDLPPDALAAQALLHRLNAERQNFMAATTQRDDMPANSDG FT EPGARVGTAEGTGAAYAGPARTGIGGALALTGTGTAATAAVDAAAALATAP FT PPPTGLGNDAPSAHNPLSPLPPPHAELQPQLQPPRVVNASLAANPAGGTRY FT HTAASSPAHSAGSDGQDAEADPERLRQERDAALAELARVQRQQERDRATAV FT AAQTRLAEEARRLREAQENGTTAAAAADAPPRPAASAPAGPAPAAGRPGRL FT LGGIHAAPGALPIPHADRQELQQHHAHVVPQQTVQHQTAMPAASAHGAQAM FT SISQDREEGEVHHSRPAVDEPRPATRQRAEPQPRHHAGQPLRLDALNTHIN FT AFCSSDIGASAMDLSRDVAAPVLMAFLKRHMDDMGVHYEPHIDLLMKGKGG FT EWAQDPPVVRIANGDDLGFQEVQPRRVRFVDEAGALGSNGVDTVGTAAPLP FT TSAGGTLGVGAPALAGATDGTPPGSTAQMSPVTSPMRMQLHAQGQTHTAPD FT PAPAPAPAPHYRDRHDHRDEPRVVIHAFTGVLKGTITNIATWFDVLVMILS FT RRGIDPVASFLMYLGGQAQQWGLTWFRAWEREHNRQPTVDELRHDFLMQWD FT NPHLHTANAARQRLHNHEVTQTGTVSAYITQFRLIMQDIPDMHMNDRVAHF FT RAGLQKGLQAKYAFKPGSTVPWDNVYELMNHILEMTTATNFANGKTQPSVA FT AADTPASRSPSLPARGGIKKGGRARNHGNGGRAGGGGRHGGGGHGAGGHAG FT GGRGGGGGHGGGGHGGYGRGGGGYGRGGGGHDRGGHGGGHGRGGHQHGRGG FT GGGRGRGNLTAADREAMLAYLHAQREADNMQAGLNPNVRSILGM" FT CDS 3644..6070 FT /product="Gypsy18-I_CR_1p" FT /translation="MIFSKTPEEHLRHLEEVFQVLQTHNLRAKLAKCEFNK FT PELKFLGHIVGRDGLKVDPDKVRSVAEWAVPQNRKQVRSFLGLANYFRKFI FT QGYSSLVAPLTALTSEKSPWLWCDACQRAFEGVKRALITAPVLALPNTSEP FT FEVKTDASIYGTGAVLLQNGNAIAYLSHRFTPAEKNYTTTDQEALGVIHAL FT QEWRCYLEGAPDVTVVTDHQPLTYLDSMKANGLLSRRQARWIEFLQDKNVQ FT WSYQPGRINVADPLSRIYEDVVPEVPRAQLAMLVLQFRAQHPDLVLDPHVQ FT AITRGQRTRDAPPPAGTTIPADIRKQLADGYASDPKFRTRNWVAKLKKVGD FT LYYQGGQVMVPAVDTLRRSLLAEYHDGGSSGHPGHERLYEAIRRDYLWEGL FT GADCKRYAAACPDCQRNKASTAKPRGLLQPLPIAKRPWGSVTLDLITQLPE FT TTSGHTAIMVFVDRLTKMVHFAPTTNEIDAEEAAWLYLQNIFRHHGLADEV FT ISDRGPQFAGKFWPEVHKLLRTKVRLSTAFHPQTDGQTERMNRLLEETLRH FT YVGPRQDDWDECLPMIEFAINNARSKSTGVSPFELYTPYQPRTPGNLGLPP FT KHTFSVPMASSLVKHMGERLKRAIACLTSAQARQKQAADARRQDSTFAVGS FT LVLLSTKNLALKAPTGGTKKLMPKFIGPFTVLEKVGPVAYKLSLPEGYRIH FT PVFHVSLLKPYTSDGRYQPPQAPKLFDDDGNAYWEVEDIIGHRDRKLGGSR FT SKSVREYLVKWLGFGAEHNSWEPSTVLCEDELVEQCINDYLERVNLRNAKR FT QRRATGT" XX SQ Sequence 6137 BP; 1330 A; 2000 C; 1763 G; 1044 T; 0 other; agtggtatcg agttccagga ctagggatta gctactagcg agcgagcccg agcaggccga 60 ctctaatacc aaaagccggc ccggccgcca ggccagggcc cggcgccaaa ggcaaccgac 120 cttcgggcgg tcttaaaagg ctctcagcag ccataagtgg cctggtcgct ttgagcgcac 180 ctctattggc cctgcattgg cggcccgata cggataagcc tcgcctagtc gcttggtacc 240 gatcgttgga cgcacatatt gccttttggc ttgatcaaat agcttatagc cagcagcacg 300 ccacgatggc agacccagtt ccaccgggtg tgcttgccgg atggcgcatg tgtttgcgca 360 aatactatga agtctacgaa cttcccatac ctgacatcct tgaggatggc aacgcgagag 420 tgctacgcta ccttgctagc tctcacgact ttgacctgcc tcccgatgcc cttgccgctc 480 aggctctcct ccaccggcta aacgccgagc gccagaactt catggcggca accacacaac 540 gtgacgacat gccggccaat tctgacggtg agcccggcgc acgggtgggc acggctgagg 600 gcacgggcgc cgcatacgcg ggccctgccc gcacggggat tggtggcgca ctagccctca 660 caggcacagg caccgcagcg acagccgctg tcgacgctgc ggctgcatta gcaacagccc 720 cgccaccgcc cacgggctta ggcaacgacg cgccaagcgc tcacaatccg ctgtcgcctc 780 tgccacctcc ccatgcagaa ctgcagcccc agctgcagcc gccgcgcgtg gtcaacgcca 840 gcctggctgc taacccggct ggcggcacac ggtaccatac cgcagctagc agccccgcgc 900 acagcgccgg cagcgacggc caggatgcag aggcagatcc agagcgtctc cgccaggagc 960 gcgacgccgc cctcgcagaa ctcgcgcgcg tgcagaggca gcaagagcgc gaccgtgcca 1020 cggccgtcgc cgcccagacg cgcctcgccg aggaagcccg gcgcctgcgg gaagcccagg 1080 agaatggcac cacggccgcc gccgcggccg acgcccctcc tcggcccgca gcctcagccc 1140 ccgctggccc agcaccagca gccggccgac ccggccgcct gctgggcggc atccacgccg 1200 cgcccggcgc cctgcccatc ccccatgcgg accggcagga gcttcagcag caccacgcac 1260 acgtcgtgcc acagcagacg gtgcagcatc agacggcgat gccggcagcc tccgcgcacg 1320 gcgctcaggc catgtccatc agccaggacc gcgaggaagg cgaggtgcac cactcgcgcc 1380 ccgccgtcga cgagccgcgg cccgccacgc gccaacgcgc cgagccgcaa ccgcgccacc 1440 acgctggcca gccgctgcgc ctcgacgcac tgaacacgca catcaacgca ttctgcagca 1500 gcgacatcgg cgccagcgcc atggacctca gccgcgatgt ggccgcgcca gtcctcatgg 1560 cgttcctgaa gcgccacatg gacgacatgg gcgtccacta cgagccccac atcgacctgc 1620 tcatgaaggg caagggcggc gagtgggccc aggacccgcc agtcgtccgc atcgccaacg 1680 gcgacgacct gggcttccag gaggtccagc cgcgccgggt acgttttgtc gatgaggctg 1740 gggcactcgg tagcaatggt gtagacacgg tgggcacggc tgcgccactg cccaccagtg 1800 ccgggggaac gctaggcgtg ggggcacccg ccctggcggg cgccactgac ggcacaccgc 1860 ccggctccac ggcccagatg tccccggtca cgtctcccat gcgcatgcag ctgcacgcac 1920 agggacagac gcacaccgcg cctgatcccg cccccgcacc ggcaccggca ccgcactatc 1980 gcgaccgtca cgaccaccgt gatgaacctc gtgtcgtcat ccacgcattc acgggtgtgc 2040 tgaaggggac tatcacaaac atcgccacat ggttcgacgt gctagtgatg attctctcac 2100 gccgagggat cgatccagtg gcgagcttcc tgatgtactt gggaggtcag gcacagcagt 2160 ggggactgac gtggttccgc gcgtgggagc gcgagcacaa ccgccagccc acggttgacg 2220 aattacgcca tgatttcctg atgcaatggg acaacccgca cctgcacact gccaacgctg 2280 cacgacagcg cttgcacaat cacgaagtga cgcagacagg cacggtgtcc gcgtacatca 2340 cacaattccg cctcattatg caagacattc cagacatgca tatgaacgac cgcgtggctc 2400 atttccgcgc cggcctccaa aagggcctgc aagccaagta cgcattcaaa cccggctcaa 2460 ccgtgccgtg ggataacgtg tacgagctca tgaaccatat cctcgaaatg accactgcca 2520 ccaactttgc caacggcaaa acccagccat cagtcgctgc cgctgacacg cctgcctccc 2580 gttccccttc cctccccgca cgtggcggta tcaagaaggg cgggcgcgcg cgcaaccacg 2640 gaaatggtgg ccgcgccggc ggcggcggcc gccacggcgg cggtggccac ggcgctggtg 2700 gccacgccgg tggtggccgc ggcggcggcg gtggccacgg cggtggtggt cacggtggct 2760 acggccgcgg cggcggcggc tacggccgtg gaggcggtgg ccacgaccgg ggtggccacg 2820 gtggcggcca cggccgcggc ggacaccagc acggccgcgg cggaggcggt ggccgcgggc 2880 gcggcaacct cactgccgcg gaccgtgaag ccatgctggc gtatctccac gcccagcgcg 2940 aggctgacaa tatgcaggcc gggctgaacc ctaacgtgcg ctccatccta ggaatgtaga 3000 gtgccggcga gcgggttgag ggcacgactt tggtagatac cgagtcagtt gagagagatg 3060 ttgcggcgat aggagagttt acgccggcgg cggccatggc aatagccgtt cagatggcgc 3120 cgcgtacgtt gggtctcttg ccacaagcgc gttcggtggg tgactacggc cccacaacgc 3180 aggcgaaaca ggctaataca acaacagttg ctcgtattgc aaaagcagta gtcggaactg 3240 ctagccagat tgacgagaag cctgtgtaca tttggtgcgg accggacagg atggtagatc 3300 cggccgtatt tcgcctcctg caagagataa ctggtgttgc gttctcgatg gacgcgtcat 3360 gcaaccccaa cggtaccaat tctctctgtg cccgctacgc cacgccggat gagttctgca 3420 aaacagacgt agccggcgaa gccctctggt gctgcgcctc accaacagat gtgagacagt 3480 ttcagcagca ctaccagcgc tgcaagagca gcaacccaaa caacacctgc gcagtgtttg 3540 tcgtaccgaa gtcgcatcat ttgaagcagt acttctctca gcttgggtac aagctagtga 3600 taaacaattg ggcaaatctg ttcttgtgta cctcgatgac gttatgattt tctcaaaaac 3660 accagaggaa cacttgcgac acttggaaga agtcttccaa gtgctacaaa cgcataactt 3720 gcgtgccaag cttgcgaaat gcgagttcaa caaacctgaa ctcaaattct taggccacat 3780 tgtgggccgc gacggtctca aagtcgaccc cgacaaagta cgctcggtcg ccgaatgggc 3840 cgtgccgcaa aaccgcaagc aagttcggtc gttcttgggc ctagccaatt acttccggaa 3900 attcatacag gggtattcct cccttgtagc accgctcaca gccctgacta gcgagaagtc 3960 accctggctt tggtgtgacg cctgccagcg ggcattcgag ggtgtgaaac gtgcgctgat 4020 caccgcgcct gtgcttgcac tacccaacac ctccgaaccc ttcgaggtga aaacggacgc 4080 atcaatctac ggaacaggtg cagttctgct gcaaaacggc aatgccattg cgtacctcag 4140 ccaccggttt acaccagctg agaagaacta caccaccacc gaccaggaag ccctgggggt 4200 gattcacgct ttacaagaat ggcgctgcta tctcgagggc gcgccagatg taaccgtggt 4260 caccgatcac caacccttga cttacctgga ctctatgaag gccaacggac tgctctcacg 4320 ccgtcaggct cggtggatag aattcctgca ggacaaaaac gttcagtggt cctaccaacc 4380 aggccgcatc aatgttgcgg accctctaag ccgcatctac gaagatgtgg ttccagaagt 4440 gccacgcgcc caactagcaa tgttggtact gcagtttcgg gcgcagcacc ccgatctcgt 4500 cctagatcca cacgtgcaag caatcacgcg cgggcaacgg acccgcgatg cacctccacc 4560 tgcgggcaca accatccccg cagacatccg caagcaactt gcagatgggt atgcatccga 4620 tccgaagttc cggactcgga actgggtagc aaagctgaaa aaagtcggcg acctgtacta 4680 ccagggggga caggtgatgg tccccgccgt tgacacgtta cggcgcagtc tcttggctga 4740 ataccacgac ggtggttcca gcgggcatcc agggcacgaa cgtctatacg aggctataag 4800 gcgcgactat ctctgggaag gtctgggcgc agactgtaag agatacgccg ctgcctgccc 4860 agactgccaa cgtaacaaag cctcaaccgc aaagccgcgc ggtttgcttc agccactacc 4920 catcgccaaa cggccctggg gaagcgtcac actggatttg atcacccagc ttcccgaaac 4980 cacctccgga cacactgcta tcatggtttt cgtggaccgt cttacaaaga tggtgcattt 5040 tgcgcccacg acaaacgaga tagatgcaga ggaggcggca tggctctatt tgcaaaacat 5100 tttccgccac cacggattag cggatgaagt gatttccgac cgcggaccac aattcgcggg 5160 taaattctgg cctgaagtgc acaagctctt gcgcacaaag gtgcggctgt ctacagcctt 5220 ccacccacaa actgatggcc aaacggaacg catgaaccgc ttacttgagg agacgctacg 5280 gcattacgta ggccccaggc aagacgactg ggatgaatgc ctgccaatga ttgagttcgc 5340 tatcaacaat gcgcgcagca aatcaactgg ggtaagcccc ttcgaactct acacgccgta 5400 ccaaccgcgt actcccggca acttgggttt accacctaag catacctttt ccgtacccat 5460 ggcatcgtcc ttggttaagc acatgggcga acgcctgaag agagcaattg cctgtctcac 5520 cagtgcccag gcacggcaaa agcaggccgc tgacgcccga cgccaggact ccaccttcgc 5580 cgtgggatct ctcgttttgc tgagcacaaa gaatctcgcc ttgaaggctc caaccggcgg 5640 tactaaaaag cttatgccca aatttattgg accctttacg gtccttgaga aagttggccc 5700 ggtagcttat aagctatcgt tgcccgaagg gtaccgcatt catccggttt tccacgtgtc 5760 gctgctgaaa ccgtacacca gtgacggccg gtaccaaccc ccgcaagccc cgaaattatt 5820 tgatgatgac ggcaatgcat attgggaggt tgaagacatt attggacacc gcgaccgcaa 5880 actgggcgga agtcgctcca aatcggttcg cgagtacctc gtgaaatggc ttggcttcgg 5940 cgctgagcac aattcttggg aaccctcaac agtcctttgc gaggatgagc tcgttgagca 6000 atgcattaac gactacctcg aacgggtgaa cctgcgcaat gctaagcgcc aacgtcgcgc 6060 cactggcacg taaccaagtg tgtccggtta caacgcgtca cggctcccga ggacggttcg 6120 cttctgaggg gggggaa 6137 // ID SINEX-4_CR repbase; DNA; CHL; 518 BP. XX AC . XX DT 13-MAY-2004 (Rel. 9.04, Created) DT 13-MAY-2004 (Rel. 9.04, Last updated, Version 1) XX DE SINEX-4_CR is a putative nonautonomous non-LTR retrotransposon - DE a consensus. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; KW Nonautonomous; SINEX-4_CR; nonautonomous non-LTR retrotransposon. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-518 RA Kapitonov V.V. and Jurka J.; RT "SINEX-4_CR, a putative SINE-like non-LTR retrotransposon from RT the green algae genome."; RL Repbase Reports 4(4), 120-120 (2004). XX CC About 30 copies of SINEX-4_CR are present in the C. reinhardtii CC genome. They are ~99% identical to the consensus sequence. CC Usually SINEX-4_CR are flanked by 11-15-bp target site CC duplications. Th 58-bp 3' terminus of SINEX-4_CR is 91% CC identical to that of SINEX-1_CR. The putative Pol III internal CC promoter (pos. 50-73) was derived from tRNA. This region is CC similar to the Pol III promoter of RATHE2 (pos. 55-78), a SINE CC element identified in the A. thaliana genome (GenBank, AY033702). XX SQ Sequence 518 BP; 115 A; 136 C; 151 G; 116 T; 0 other; atcgaccacg gcagaggttg cttagcgata agttggtgtg taacgtcggt tcagacccga 60 gttcgatcct ccccaaattc ggtacagggg aaacctccgt gcgtattcaa atcacgcaca 120 ggcgctccgc agggaccgca cggctcttca gtcgtgtcct tgtcatccgt ggctaacgtc 180 agataagaga gcggtcgtga agtgccggca aaggggccgg actctggagc gatccagagt 240 ttcagttgag atgttgcccg acagtcggca ttacctgatc ccccgatctc aggtaccgaa 300 agccgtgagg gtagatcatc cgagctgaac atggatagga caccaggggc ttaatccacc 360 ccgctcccac cggtgggcag gaccggcaaa tgataaggtg gtcgtggtga cttcccgcct 420 tctctcgaac tgggttgaga gatgatcagc gaaggcgttg ccccgttaat acatgctagg 480 ccctatgggc cagcgttggg attattatta ttattatt 518 // ID Gypsy-1-I_CR repbase; DNA; CHL; 6701 BP. XX AC . XX DT 04-MAR-2004 (Rel. 9.02, Created) DT 04-MAR-2004 (Rel. 9.02, Last updated, Version 1) XX DE Gypsy-1-I_CR is an internal portion of Gypsy-1_CR LTR DE retrotransposon - a consensus. XX KW Gypsy; LTR Retrotransposon; Transposable Element; 5-bp TSDs; KW Gypsy superfamily; Gypsy-1-I_CR; Gypsy-1-LTR_CR; chromo domain; KW endonuclease; gag; protease; reverse transcriptase. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-6701 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-1_CR, a family of gypsy LTR retrotransposons from the RT green algae genome."; RL Repbase Reports 4(2), 35-35 (2004). XX DR [1] (Consensus) XX CC Gypsy-1_CR can be still active retroelement. It is flanked CC by identical LTRs. The internal portion encodes the 778-aa CC Gypsy-1_CR1p (pos. 37-2370) and 1331-aa Gypsy-1_CR2p (pos. CC 2664-6656) proteins. Gypsy-1_CR1p includes the gag-like domain. CC Gypsy-1_CR2p is composed of the protease, reverse transcriptase, CC endonuclease, and chromo domains. XX FH Key Location/Qualifiers FT CDS 37..2370 FT /product="Gypsy-1_CR1p" FT /translation="MGQKSGGSAPPATKQKSAGATRTTHGKPTGATTAAGK FT RASDQTSDHVKSRRAIAQGAEMGAAKDTSDVMDDANVVSGGSPAGSGKRRI FT PGQHGSNQGSPGQGHRDSNGDTVMGEVVGPGAGGTRAGAAGTQPPASTLPP FT SALNATDRRKRLMAAYLAAYARVNGHVQHYTRAAQPIVTADDVVRAHELAT FT VMGTSLTLTNQTMLDYILDVAQNEALPPVPTTAGAPADAPAGPMPAGGGNR FT AVLPTAVTPMDPVMAEIKQSLTRLDTALTDLKLQAQHDRKELEDLIALNAP FT PPAPATDTDLTNTASADSSSTSESEPSEAVDDEPSEVEPPAAEPPVQAAAP FT AADAPPPIRLLAFGAKPAAAKTAPTSTDQHRRRRQKYVSLGVSRFRQSPQY FT VPLRDDLYRHQNRASIKAFTKAFVKSVDAYLTAKRLDKMQALDELRAYEAM FT DIEMPLQYLRGENAEKLTENDRRRFYTQRITAPPTYAAAAAAAPSTIDEVI FT AQRDQVGTGGTGKKPKVPEPQKFSGVTDKHITCTTTWFDTFVEYCDEQKLE FT PSRHLLFFCTGTAQEWMVGYRHDMQLAKMAITTSRLRTAFLQQFGDVRRHT FT PAEARRRLMMLEHKQQPGERIATYTQRFRLIARDARDMAELDRIEHYMNGL FT AAQFQAVCRTDSEGKDWTSLDSLVSYALGQESRKTQSKGATLAIANTSRPP FT FRSRPRKQGCSTGGGGGGGGGGYGGKRPGGGARDSGKPKKIRADRNTCRHC FT FEHFANGFPAQHLQDCAVLKRKKAMAA" FT CDS 2664..6656 FT /product="Gypsy-1_CR2p" FT /translation="MRSFCLSVTFIPRIGCGGRNPLVSRAVRAVDSIRGSL FT PHVQERKPTDTSAVFVAPQWPELEQYFTKSGFRLLKSYPTGTQLFTIAGSN FT LTTLAAPWPVQLWYDPPEPDAKLSAIAPKVSYTMTFRGRAGHTDVTTLVDS FT GACTVGTADGYISQKAATQMGLRATSSPTPAVTLADGTRKALGGVVQTRLQ FT MGAYHGTVHLLVLPDTVTGAQIILGTDWLNRHGALMDWESHTLTLRRGHKR FT VALRPAYTESPDMLAGSVYADPMQVMDYVRALLHDASTPRNLISKNEFARC FT VTTYGEHAFLAVVRWPDPTDEAWTPTCALAAATLPTLGDTGLIPAHDLQNL FT LTEHADVFEELKGIRKDTTGISHTIPLVEGAHPPSKRMYRLSQGETTEVNK FT QVADLLAKGFIEPSNSPFGAPVLFVQKKDGTLRMCLDFRALNKLTVKRNYP FT MPNIQDLFDQLEGAKVFSSLDLQSGYYQIPITKEDQEKTAFITPKGQYQFK FT VLCFGLTNAPASFQATMSKIFKKQLGKTVLVYLDDILVFSKTPEEHLAHLR FT EVLGVLREHEFKAKLPKCDFNKSELKFLGHIVGRHGVRVDEDKVKVVKDWA FT TPTNLKQLRSFLGLANYFRRFIQGYSSLVAPLMALTGTKVPFVWTPECQAA FT FEGVKVALTSAPVLALPRFDLPFEVWSDASIYGTGAVLLQDGKPVAYTSAK FT FSSAEHNYTTTDQECLGVIHALTEWRCYLDGASHVTLVTDHQPLTYLQDQK FT SSNLLSRRQARWMEFLSRFSFTWEYRPGRINVADPISRIWEPMCAALTRSE FT ISSDLLDRIRKGYSGDSLFSDSRASASLQLKTSAGLWYRDTQVVIPAADGL FT RELILTEFHNSPTAGHRGARRLKEAISRRYWWPRLDADCTEYVASCPDCQR FT NKPSNQKPAGKLQPLPIPEANWDSVSLDLITKLPTTLAGHDAIVVFVDRLS FT KMTHFAPTHSDVDAEELAEIFLQYVFRHHGLPINLVSDRGSVFTGRFWKEL FT FSRLQTKLHFSTAYHPQTDGQTERMNRLLEETLRHYIGPLQDDWDKHLPLI FT EFAINNSQSGSTGATPFELNYGRRPRVPGDLALPSSNVPAAQNFSDAMRQR FT MAHAKKCLRAAQDRQRVATDSCRRDSKFDVGDDVLLSTKNIKLRIPGAAKL FT MPRYIGPFKIQRRIGAVAYELALPPGYKIHNVFHVSLLKAWKPGQAYNPPP FT AILEDNGDTYWSVESLLLHRDRKCGRKTLREFLVKWEGYGPEHNSWEPEAN FT LRESAPLEEAIEEYLARTATKQAAREKRINTVAQRRADRKAAATQRYATRK FT ATPPAAPTKAPRQTRGRAT" XX SQ Sequence 6701 BP; 1508 A; 2017 C; 1934 G; 1242 T; 0 other; agtggtatcg agggtccagg gggccttcgc gaccaaatgg gacaaaagtc cggcggctcc 60 gcgccgccgg cgaccaagca gaaatctgct ggtgctacgc gcaccacaca tggcaagccc 120 acgggagcta ccacagccgc tgggaagcgg gcgtcagacc aaacttctga ccatgtgaaa 180 tctcgacgtg ctattgcaca gggagccgag atgggagctg ctaaggacac ctccgacgtc 240 atggatgacg ccaacgtggt aagtggcggt tcgccagcgg gctcagggaa gcgccgcatt 300 cctggtcagc atgggagcaa ccagggcagt ccaggacagg gccatcgcga ttccaatggg 360 gataccgtga tgggagaggt agttgggcct ggtgcgggag gcaccagagc tggggcagcg 420 ggaacgcagc cgccagccag cacattgcct ccgtctgcct tgaacgctac ggaccggcgc 480 aagcgcctca tggctgctta cctagctgct tacgctaggg tcaatggcca tgtccagcat 540 tacacacgcg cggcgcaacc aattgtgacc gccgatgatg ttgtgcgcgc tcatgagctc 600 gccactgtca tgggtacgtc cctgacactg actaaccaaa ccatgctgga ttacattctg 660 gatgtcgcgc agaacgaggc attgcctccg gttccgacga ccgctggagc gccggctgac 720 gcgccggccg gcccaatgcc ggctggtggc ggcaaccgcg ccgtcctgcc gacggccgtc 780 acgccgatgg acccagtcat ggcggagatt aagcagtcgc tgacgcggct ggacacggcg 840 ctcacggacc tgaagctgca ggcgcagcac gaccgcaagg agctggagga cctgatcgcg 900 ctcaacgcac cccctccggc gccagcaacg gacaccgacc tgactaacac agcgtctgct 960 gatagcagca gcaccagcga gtcggagccc agcgaggcgg tcgatgacga gccctcggag 1020 gtggagcctc cggcggcgga gcccccggtg caggctgccg ctccggcggc ggacgccccg 1080 ccgccgattc ggctgctggc gttcggcgcc aagccggcgg ccgccaagac ggcgcccacc 1140 tccacggacc agcaccgccg gcggcgccag aagtacgtga gcctgggcgt ctccaggttc 1200 aggcagtcgc cgcagtacgt gccgctgcgc gacgacctgt accgccacca gaaccgggcg 1260 tcgatcaagg ccttcaccaa ggccttcgtc aagagcgtgg acgcctacct gaccgccaag 1320 cgcctggaca agatgcaggc gctggatgag ctccgcgcct acgaggcgat ggacatcgag 1380 atgccgctcc agtacctgcg cggcgagaac gcggagaagc tgacggagaa cgaccgccgg 1440 cgcttctaca cgcagcgcat cacggcgccg cccacgtacg cggcggcggc ggcggcggca 1500 ccgtccacca tcgacgaggt catcgcgcag cgcgaccagg taggcacggg tgggacgggc 1560 aagaagccta aggtgccaga gccgcagaag ttctcggggg tgactgacaa gcacatcacc 1620 tgcactacca cctggtttga cacgttcgtt gagtactgtg acgagcagaa gctcgaaccc 1680 tcacggcatc tccttttctt ttgcactggg acggcgcaag agtggatggt ggggtacaga 1740 cacgacatgc agctagctaa gatggccatt accacatcac ggttgcgcac agcgttcctc 1800 caacagtttg gggatgtgcg gcgtcacacg ccggcggagg cccgccgacg tttgatgatg 1860 ctagagcaca agcagcagcc gggggaacgc attgccacct acacgcaacg cttccgcctc 1920 attgcacgtg atgcacggga catggctgag ctagatcgca ttgagcacta catgaacggg 1980 ttagcggctc agtttcaggc tgtatgccgc acggactccg agggcaagga ctggacgtcg 2040 cttgacagcc ttgttagcta cgcgctgggg caggagtcac ggaaaaccca atctaaaggt 2100 gccacactcg cgattgctaa cacgtcacgc ccgcccttcc gatcccgtcc ccgcaaacag 2160 ggctgcagca ccggcggcgg tggcggaggc ggaggcggcg gctacggcgg caagcggccc 2220 ggcggcggcg cccgcgacag cggcaagccc aagaagattc gcgccgaccg gaacacctgc 2280 cgccactgct tcgagcactt cgccaacggc ttcccggcgc agcacctgca ggactgcgcg 2340 gtgctgaagc gcaagaaggc catggcggcc tagggggctg gcagtgagcg agagggcgcg 2400 ggtcgtgtgg atagcggctc gcctggtctt ggtagtcctt tagccatggc ggtgacgccg 2460 gagactgtgg tcatccctac cgttacgacg aatggggtag ctgcaatagc tgcgggtacg 2520 acgcaggcta cgtttccacc accgccactg ccaggaggct ccgatagcgc attcagtgct 2580 gacgagttcc gcctggtgga gaaatggctt ggcacgaaat tcacgtttga tgcagcgtgt 2640 aatgaaaagg gtaccaacgc tagatgcgct cattttgcct ctccgtcacg ttcattcctc 2700 gaatcggatg tggcgggaga aaccctctgg tgtcacgcgc cgtacgagca gttgacagca 2760 ttcgcggatc attaccgcac gtgcaagaac gtaaaccgac agacactagt gctgtattcg 2820 tggccccgca atggcctgag ctagagcagt actttaccaa gagcgggttt cgtctgttga 2880 agtcataccc tactgggaca cagctgttca cgatcgcggg aagcaacctc acgacgcttg 2940 cagcgccgtg gccagttcag ctgtggtatg acccgccaga gccggatgct aagctatcag 3000 ccatagcacc gaaggtcagc tacactatga cgttccgtgg gcgcgcaggt cacaccgacg 3060 tgaccacgct ggtcgacagc ggcgcttgca ccgttggcac agccgacggg tacatcagcc 3120 agaaggcagc cacccaaatg ggactgcggg caaccagctc tcctacccct gccgttacac 3180 tagctgacgg cacgcgcaag gcgctaggtg gtgtggtgca aacccgcctg caaatgggag 3240 cttaccacgg gactgttcac ttactagtgt tgcctgacac cgtcaccgga gcacaaatca 3300 tcctgggcac tgattggctt aaccgccacg gcgcacttat ggattgggag agtcacaccc 3360 tcacgctacg tcgtggacac aagcgggtgg cattacggcc ggcatacacc gaatccccag 3420 acatgttagc gggctcagtt tacgctgacc caatgcaagt catggactac gtgcgtgccc 3480 tattgcacga tgcctcaacc ccccgcaacc ttatctcgaa gaacgagttc gcgcggtgcg 3540 ttaccacata cggggaacac gcctttcttg cggtggtccg atggccggac ccgaccgacg 3600 aagcgtggac ccccacatgc gcactggccg cagccactct ccccactcta ggtgacacgg 3660 gactaattcc cgcacacgat ctgcaaaacc tcttaacaga acacgcggat gtgtttgaag 3720 agctgaaagg gatacgcaag gataccaccg gcatttcaca tacaatcccc ctggtggaag 3780 gcgctcaccc gccctcaaag cgtatgtacc gcctgtctca gggtgagact accgaggtta 3840 acaaacaggt ggccgatctg ttagcgaaag gctttatcga gcctagcaac tctccattcg 3900 gagccccagt cctgttcgtg cagaaaaagg atggcacact acgtatgtgc cttgatttcc 3960 gggcactgaa caagctaaca gtgaagcgca attaccctat gcccaacatt caagacctct 4020 tcgaccaact cgaaggtgca aaggtcttca gctcgctcga tttgcaaagt ggctattacc 4080 aaattcctat cacaaaagag gatcaggaaa agacggcttt cataacgccg aaggggcagt 4140 atcaattcaa ggtgttgtgc ttcggcctca ccaatgctcc tgcttctttt caagccacaa 4200 tgagtaaaat cttcaagaag caattgggga aaaccgtgct agtatacctg gacgatatcc 4260 tggtttttag caaaactcca gaagaacatc ttgcacactt gcgagaagta ctcggtgttt 4320 tgcgtgagca tgaattcaag gcgaaattgc cgaagtgcga cttcaataag tcggaattga 4380 agttcttggg gcacatcgta ggccgacacg gcgtgcgagt tgatgaggac aaggttaaag 4440 tggtgaagga ctgggccaca cctacgaatc taaaacaatt acgttcgttc ctcggactgg 4500 caaattactt ccgccggttc atccaggggt actcctccct ggtcgccccc ctcatggcct 4560 taacgggaac gaaagtaccg tttgtctgga caccggagtg ccaggcagca ttcgaaggcg 4620 tgaaggtagc acttacaagc gcacccgtac tagccctgcc tcgctttgac ctaccctttg 4680 aggtatggtc cgacgcgtca atttacggca cgggagctgt tctcttgcaa gatggcaagc 4740 ccgtagcgta taccagcgcc aagttcagct ctgctgagca caattacacc acgactgatc 4800 aggagtgcct aggagtaatc cacgcactta ctgagtggcg ctgctacctg gacggagctt 4860 cccacgtgac ccttgtcaca gatcaccaac ctctcaccta tttgcaagac caaaagagca 4920 gtaacttact gtccagacgt caagcacgct ggatggagtt cttgtcccga ttctccttta 4980 catgggagta ccgccctggc agaattaatg tggctgaccc aatatctcgt atctgggagc 5040 ccatgtgcgc agcgctcaca cgctcagaga tttcttcgga cctccttgac cggatccgta 5100 aaggctattc aggtgacagc ctattctctg attcgagggc atcagcaagc ctgcagctta 5160 aaacctcagc gggtctctgg tacagagaca cccaggtggt gatccctgcc gctgatggat 5220 tgcgggagct gatattgaca gagttccata acagccccac agccgggcac cgtggtgcac 5280 gccgcttaaa ggaagcaatt tcgaggcgct actggtggcc acgtctggac gcagactgca 5340 ctgagtacgt tgcctcgtgc ccggactgtc aacgcaacaa gccttccaac caaaagccgg 5400 ctggaaagct gcagcccctc cccattccgg aagcgaactg ggacagcgta tcacttgatc 5460 tgataacgaa attaccgaca acccttgctg ggcacgacgc tatagtggtg tttgtggacc 5520 ggctgagcaa aatgactcac tttgctccca cacacagcga cgtcgatgct gaggaattag 5580 ctgaaatttt cctccagtac gtgttcagac atcacggttt gccaatcaac ctggtgtccg 5640 accgtggctc tgtgttcacg ggtcggttct ggaaagagct gttctcacgg ctacaaacta 5700 agctgcattt ctctacagct tatcatccgc aaacggatgg ccagactgaa cgaatgaatc 5760 gccttctgga agaaaccctg cgccattata tcggcccact ccaagatgat tgggacaaac 5820 acctgcctct tatcgagttt gctatcaaca actcgcaaag cggtagcact ggggcgacac 5880 ccttcgaact caactacggt cgacgacctc gggtgcccgg tgacctggcc ttgccttcta 5940 gcaacgtgcc agcagcgcaa aacttctcag atgccatgcg gcagcgcatg gcgcacgcta 6000 aaaaatgcct gcgagcggcg caggacaggc agcgtgtggc cactgactcc tgccgtcgcg 6060 acagcaagtt cgacgttggg gatgacgtac tgctatcaac caaaaacatc aaactccgca 6120 ttcctggcgc agcgaagctg atgcctaggt acatcggtcc gttcaaaatt cagcgacgga 6180 tcggtgcggt ggcctatgaa ttggctttgc cacctggcta caagatacac aacgttttcc 6240 acgtatcctt gctgaaagca tggaagcccg gccaggcgta caatccgcca ccggccattc 6300 ttgaggataa cggtgatacc tattggtcag tagagtctct tctattgcac cgtgatcgca 6360 agtgcgggcg caagacccta cgggagttct tggttaagtg ggagggctac gggcctgagc 6420 ataactcttg ggagcctgag gctaacctgc gggagtccgc tccgttggaa gaagccatcg 6480 aagaatacct tgcacgcacg gctactaagc aagccgcgcg cgagaaacgc attaacactg 6540 tcgcacagcg cagggctgac cgcaaggctg ccgctactca gcggtacgcc acgcgtaagg 6600 ccacgcctcc ggcggcccct accaaagctc cacgtcagac acgtggcagg gcaacttgag 6660 ttactaccgc tcccgggacg ttcgctccaa ggggggggga a 6701 // ID MSAT-1B_CR repbase; DNA; CHL; 72 BP. XX AC . XX DT 04-APR-2005 (Rel. 10.03, Created) DT 10-APR-2007 (Rel. 10.03, Last updated, Version 1) XX DE MSAT-1B_CR is a minisatellite unit - a consensus. XX KW MSAT; Satellite; Simple Repeat; MSAT-1B_CR; MSAT-1_CR; KW Interspersed repeat; minisatellite; nonautonomous; tandems. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-72 RA Kapitonov V.V. and Jurka J.; RT "MSAT-1_CR, a family of minisatellites from the green algae RT genome."; RL Repbase Reports 5(3), 53-53 (2005). XX DR [1] (Consensus) XX CC The MSAT-1B_CR is a subfamily of MSAT-1_CR minisatellite. XX SQ Sequence 72 BP; 11 A; 16 C; 42 G; 3 T; 0 other; ctgggaggcg cgggagacgg aggcacaggc gagggaggag ggctgggagg cgccgggcta 60 ggcggcgccg gg 72 // ID hAT-N11a_CR repbase; DNA; CHL; 249 BP. XX AC . XX DT 17-AUG-2004 (Rel. 9.07, Created) DT 17-AUG-2004 (Rel. 9.07, Last updated, Version 1) XX DE hAT-N11a_CR is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; 8-bp TSD; KW hAT superfamily; hAT-N11_CR; hAT-N11a_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-249 RA Kapitonov V.V. and Jurka J.; RT "hAT-N11_CR, a family of nonautonomous hAT transposons from the RT green algae genome."; RL Repbase Reports 4(7), 203-203 (2004). XX DR [1] (Consensus) XX CC hAT-N11a_CR is a subfamily of hAT-N11_CR nonautonomous CC transposons. The hAT-N11a_CR and hAT-N11_CR consensus CC sequences are 84% identical. hAT-N11a_CR elements are flanked CC by 8-bp target site duplications and contain 15-bp terminal CC inverted repeats. This family is composed of ~100 copies ~98% CC identical to the consensus sequence. Some copies are identical CC to each other. Therefore, this subfamily may be still actively CC transposable in C. reinhardtii. XX SQ Sequence 249 BP; 55 A; 52 C; 81 G; 61 T; 0 other; cagtgctcgt atcttccatc tacactgtag aaatctgccc aaaatccgcc gaaggcggag 60 aaaaattttt gggtttgggt gccacgcatc tacgcagtgt agacggtttt ggccggtttt 120 ggcgcaaaaa tgggggaagc gtagacgttt tggggggctt gggggcttaa gtgtagatgc 180 cgcggtcagg gtcatctacg catgtgtaga tgccgggtgc ctgagggcat ctacaagata 240 cgagcactg 249 // ID RTEX-1_CR repbase; DNA; CHL; 5811 BP. XX AC . XX DT 26-FEB-2010 (Rel. 15.02, Created) DT 26-FEB-2010 (Rel. 15.02, Last updated, Version 1) XX DE A family of RTEX non-LTR retrotransposons - a consensus sequence. XX KW RTEX; Non-LTR Retrotransposon; Transposable Element; RTEX-1_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-5811 RA Kapitonov V.V. and Jurka J.; RT "First examples of RTEX non-LTR retrotransposons found outside RT metazoans."; RL Repbase Reports 10(2), 138-138 (2010). XX DR [1] (Consensus) XX CC RTEX-1_CR belongs to the RTEX clade of non-LTR retrotransposons. CC The consensus sequence is <1% divergent from several copies of CC RTEX-1_CR. The 3' terminus is composed of the (CA)n CC microsatellite. Some RTEX-1_CR elements are flanked by ~10-bp CC TSDs. XX FH Key Location/Qualifiers FT CDS 1..1422 FT /product="RTEX-1_CR_1p" FT /note="ORF1." FT /translation="MSTNDNSWYKYVEQTQREPEKPCAGGLVLAFLSKLEE FT LRTQTELNTSAVAVLRERQASSEKRVAEVESKIKHNVAQPVSANSFYFTDR FT LIIKGNMSNTSAETLSKIVCTKLSSDEVHIANLTVRKRSSRLAVLYLGACQ FT DLRGNLLATRLKLQGTGFKLDDCLTTTGLAERRRLQTTHDEAYQAGMRPEW FT RRGALLYARPEPSAPAVPWADLGYGSGSDMDAEAEFVPSGGERTPPHQQPP FT PPPPPKPGQQQQQQQQRQQDQGTSQGQAGGSWAARVANSGSSPTSPVRPFH FT GFSSATPGWSSRGPLPASPIRPFQGFTRYEPASTHQPGDKRGSRTPQSTEQ FT ARLPKPAKPSAHATVADVRNHAPTRAADLPRSKRDPRAPQQQAQAQAQAQA FT QAQAQAQVQAQAQAQVQAQAQAQAQVQAQAQAQMQAQAQVQAQAQTQKPAQ FT GQAALFAAAYTEYKRHSRPTPPPQSGRPP" FT CDS 1866..5702 FT /product="RTEX-1_CR_2p" FT /note="composed of the APE, RT, and CCHC domains." FT /translation="WDGGRGGISVWAWNAHGLLTGGSCKLHDRNINKAARA FT HDILVISEAHLTHRVPDGRIELDGYSVFRAHRSQEAIRSDFGGVAVYVREC FT LMGGVMHLRTDVSLTGCEVVWMRIRSKDGDSLLLGCCYLAPETSRVYKDGG FT KTRVAREATAEAVFGRLQQAISAMRQNGEEVLLVGDLNARMPAGLSDIPDL FT DQLAALEQVEHITALGGALTSMPSPEDYAGLPARASQDKHANCFGEFLARM FT CRSQGFVLLNGRAPGDESGRITFPKGEEGGSVIDLCIASPTLFQSVTSVDV FT GELLRWARRGAGYASDHRPVTLTLSWEVEGQASTEGQAKAKRPRTAFNAQK FT SERYSNLFEQDESPVVSKLTELKGHLEQGRYSTTEAVEALSKCLSGVLEKA FT FGQSRPAHLRETETPWWNEECAVARAALVQAKVALGKPVVKEGERWEAMRA FT ARSVYCRAKRRARAAHDAQVMRDRVARCRADAKALWKMIEERCTSKSPITA FT DGFRDHFARLLNDGAGTVDDSAAKRLLAYCCDEDGWRDSMYDDEEWAELDS FT ILNSDISIDEVTHALERLPNGKAPGTEAAPSECYKYAKTQGDPRADPPIPP FT VNRVAPVLEVLFNRIWRAQDGDESFPEQFTTTVLTPIYKRKGDVKTPGNYR FT GIAVGGALAKCYASILLNRLARAGELFKWRHPAQAGFRRKYGTAHHLFVLR FT HLVTKHTRAGAPPMIVVQIDFEKAFDKVPRPLLWLRLREKGVSGRLLEAIQ FT AAYEKVMMTVKADGKLSAAFEATQGVKQGCPLSTELFGLFIETLAEYIDAH FT EDWLDTASTAGTPELNGKKLSLLMYADDVSLLATTPERMRHLLSLVDTFCE FT AFGMKANVAKCERLVFTSDDQERRRLNDECSGLRLAGQPIPAVDKARYLGL FT VYGPGRAFAACRETLCEAARRAMYALTNRLNRLRIFSPDIRMRCFEVQVRS FT ILAYGCEVWGPDVLAEMLDGGPPPRRRDSNNLAHGPFEACLKDEAVKLQVQ FT YMRMTVGTKRPSHRLLFAELAQLPLHFFFAKLCIGFYNRIAVQKDSLAHDA FT LIDEVQDALVHPEGDGWCARLFRFISAHGVDVWQGRMHMIRPEREESRAGS FT PLPEGQIVSAFRESLMKAWKHERLQSEPSTFPSDNKQPGVQMSKYKHWMGL FT CAEGAAPLTMQGHSRAFIPVAHHKALMRFRLCCWPLTANRAYGRPREERIC FT PLCVANEVEDENHVLMRCTAYDQLRLGSEIDFTGGMQAVMQNADPARLAAL FT LDSIWEHRSISTPIRGPN" XX SQ Sequence 5811 BP; 1364 A; 1554 C; 1873 G; 1020 T; 0 other; atgtcgacaa acgacaatag ttggtacaag tacgttgaac aaacccagag ggagccggaa 60 aagccatgtg ccggcggcct ggtcctcgcg ttcctgagca agctggagga gctcaggaca 120 cagaccgagc tcaacaccag tgctgtagca gttctgagag agcgtcaagc tagctcggaa 180 aagcgggtag ctgaggtaga gtctaagatc aaacacaatg tcgcacagcc agtgtccgcg 240 aacagctttt acttcacaga taggctgatc atcaagggca acatgagcaa taccagcgca 300 gagacgctaa gcaagatagt ttgcacaaaa ctcagcagtg acgaggtcca catcgcgaac 360 ctgacggtcc gcaagcgcag ttcccgcctt gcagtgctct atctgggggc atgccaggac 420 ctccgcggga acctcctggc cacgcggctg aagctccagg gcaccggctt caagctggac 480 gactgcctca cgaccacagg cctggcggag cggcggcggc ttcaaaccac ccacgacgaa 540 gcataccagg ccggtatgcg cccggagtgg cggcgtgggg cactgctgta tgcgaggccg 600 gagcccagcg caccggccgt gccatgggcg gacctgggct acggatcggg ctcggacatg 660 gacgcggagg cagagttcgt gcccagcgga ggggagcgca cgccgccgca ccagcagccg 720 ccaccgccac cgccccctaa gcctggacag cagcagcagc agcagcagca gcggcagcag 780 gatcagggca ccagtcaggg tcaggcggga ggctcctggg cagcccgcgt ggccaactcg 840 ggctcgtctc ccacctcacc agttcggccc ttccacggct tctcgtctgc cacaccgggg 900 tggtcgtccc ggggtccgct acctgcctcc ccaattcggc ctttccaggg cttcacacgc 960 tacgagccgg catcaacgca ccaaccgggc gataagcgag gctcccgtac accgcagagc 1020 actgagcagg caaggctgcc caagcctgca aagccctctg cacacgccac ggtggcggac 1080 gtgaggaacc acgcgcccac gcgcgcggca gacctaccgc ggtcgaagcg agaccccagg 1140 gcaccgcagc agcaggcgca ggcacaggcg caggcgcagg cgcaggcgca ggcgcaggcc 1200 caggtgcagg cgcaggcgca ggcccaggtg caggcgcagg cgcaagcgca ggcgcaggtg 1260 caggcacagg cgcaggcgca gatgcaggcg caagcgcagg tacaggcaca ggcgcagacg 1320 cagaagccgg cgcaggggca ggcggcgctg ttcgccgctg cgtacacaga gtacaagcgg 1380 cacagccggc cgaccccgcc gccgcagagt gggcggccgc cttgactgcg gccaccgcca 1440 ccactgccac cgtagccgct ccggccgcgc cagcagctgt ggcggccact aaggccgcca 1500 ccgcctcggt ccccggcacg actaccccta ccgcggccgc cgccgcaggc gcctcgacgg 1560 agccagtggc aggtgacttc agggtcagcg ctggggggga ggtgctcgca gtgccagcaa 1620 acccgaaagg gaagagcgct gtggacgacg atgggtggca agtacagctc ggacgcagcc 1680 aggccggaag gggaggtggt agcagcactg gcaggggtgg aggcagggcc ggaggacgca 1740 cagcgggcgg tggcagggcc aggggcgccg gacacacgac gacaggggtg aatcgatttg 1800 taccgctgca cgacatgggt ctgagcgcgg atacggatac ggacatgggt aatgcccagg 1860 attagtggga cgggggccgg ggaggcataa gtgtctgggc atggaacgct cacggacttc 1920 tcactggagg ctcttgcaag cttcacgacc gcaacatcaa taaagccgca cgagcgcacg 1980 acatcctggt catttccgaa gcacacctaa cacatcgtgt gccagacggc cgtatagaac 2040 tagacggata ctcagtcttt cgtgcacata gatcgcaaga agctataagg tcggattttg 2100 gcggcgttgc agtttatgtg cgtgagtgtc tgatgggcgg agtgatgcat ttaagaacag 2160 acgtgagtct tacagggtgt gaagtcgtgt ggatgcggat acgttccaaa gatggagaca 2220 gcctgctttt aggatgttgt taccttgcgc cggaaacctc gcgcgtatac aaggacggcg 2280 gcaaaacgcg tgtggcaagg gaagccacag ctgaggccgt ctttggaagg ctccagcagg 2340 ctatctcggc gatgcggcag aacggcgaag aggtgctact agtaggtgac ctcaatgcgc 2400 gtatgccggc agggcttagt gatataccgg atctcgacca actagcagcg cttgaacaag 2460 tggagcatat cacggcactg ggtggagcac tcacgtcgat gcctagtcca gaggattacg 2520 cggggttgcc ggccagggct tctcaggata agcacgcaaa ctgttttggc gaatttctag 2580 cgcgtatgtg ccgttctcaa ggttttgtgt tactgaacgg cagggcacca ggggatgagt 2640 cgggtcgaat caccttccca aaaggggagg aggggggaag cgtcattgac ctatgcattg 2700 catcgccaac tttattccaa tcggtgacgt cagtagacgt gggtgagctg ctgagatggg 2760 cgcgcagggg cgccggttat gcgagtgatc acaggcctgt tacgctgact ttgagctggg 2820 aggtggaggg tcaagcaagt actgaaggcc aggcgaaggc aaagcgtcca cgcactgcat 2880 tcaatgcgca gaagagtgag cgctatagca atcttttcga acaagatgag tcgccagtag 2940 taagcaaatt aacagagcta aaaggccact tggagcaagg gcggtacagc actaccgagg 3000 cggtcgaagc gttgagcaag tgcctaagtg gagtattgga gaaggcgttt gggcagagca 3060 ggcctgctca tctgcgagag accgagaccc cgtggtggaa tgaggaatgt gcggtggcgc 3120 gtgccgcgct cgtccaggcg aaagtagcgc ttggcaaacc agttgtgaag gaaggagaaa 3180 ggtgggaagc gatgcgggcc gcgagatcag tgtactgccg cgcaaaacgc agggctagag 3240 ccgcccatga tgcgcaagtg atgcgggata gggtagcacg gtgcagagcc gacgctaaag 3300 cactgtggaa gatgattgag gagcggtgca cgagcaaatc ccccatcacg gcagatggct 3360 tccgtgatca ctttgcacgg ctactgaatg atggggcagg aacagttgac gacagcgctg 3420 caaagcgttt actggcgtac tgctgtgacg aagatggctg gcgagattcg atgtatgatg 3480 acgaggaatg ggctgagtta gatagcatat tgaacagcga tatctcgata gacgaagtga 3540 ctcatgcttt agagaggcta ccgaatggca aggccccagg cacggaagcc gcgccatcgg 3600 aatgctacaa gtacgcaaag acgcagggag accccagggc agaccccccc atcccgccgg 3660 tgaaccgggt agcacctgtt ctagaagtgt tgtttaaccg catctggcgg gcgcaagacg 3720 gagatgaaag ttttccggaa cagttcacaa ccacagtgct gacgccaatt tacaagagaa 3780 agggcgatgt gaagacgccc ggcaactaca ggggcattgc agtaggcgga gcgttggcta 3840 agtgttatgc atctatcctt ctgaacaggc tagcacgagc aggcgagttg ttcaagtgga 3900 ggcacccagc tcaggctggt ttcaggcgga aatacggtac tgcccaccac ctgtttgtcc 3960 tgaggcacct ggtgacaaag cacacacgtg caggagcacc accaatgatt gttgtacaga 4020 ttgattttga gaaggcgttt gacaaggtgc cgcgccccct cttgtggcta cggctgcggg 4080 aaaagggcgt gtcagggcgg ctgttggagg ccatacaagc cgcatatgaa aaggtcatga 4140 tgacggttaa agccgatggc aaactgagcg ctgcttttga ggcaacgcaa ggagtcaagc 4200 aagggtgccc actgagcaca gagctgttcg ggctctttat tgaaactttg gcagagtata 4260 ttgatgcgca cgaggactgg ttggacactg caagcacagc gggcacccct gagttaaacg 4320 gtaagaagct gtcgctccta atgtacgctg acgatgtttc gctgctagcc accacccctg 4380 agcgtatgcg gcacctgttg tcacttgtgg atactttctg cgaagcattt ggtatgaaag 4440 caaacgtcgc aaagtgtgaa cgtctggtgt tcacttcaga cgaccaggag cgtcgtagat 4500 tgaacgatga gtgcagtggg ctgcggctgg cagggcagcc catccctgcg gtggacaagg 4560 cacggtatct gggactagtc tacggccctg gacgtgcttt tgccgcctgc agagagacgc 4620 tatgtgaggc tgcgcggcgt gctatgtacg cgcttactaa tagattaaac cgtttgagga 4680 ttttctcccc cgacatacgc atgcgttgtt ttgaggtgca agttcgctcc atcctagcat 4740 atggttgtga agtgtgggga cccgacgtat tagcggaaat gctggacggc ggcccaccac 4800 cgcggcggcg tgacagcaat aacctggcgc acggaccgtt tgaagcatgc ctgaaagacg 4860 aggccgtcaa attacaagtg cagtacatga ggatgacagt gggtacgaag cgaccatcgc 4920 atcgcctgct gtttgctgaa ttagcacaac taccactcca tttctttttc gccaagcttt 4980 gcattggatt ctacaacagg attgccgtgc agaaggatag cctagctcac gatgcactaa 5040 ttgatgaagt acaagacgcg ttagtacacc cagagggaga tgggtggtgt gcacggcttt 5100 tccgttttat ctcagcgcat ggcgtagacg tatggcaagg ccgtatgcac atgatcaggc 5160 cggaaaggga ggagagccga gcaggtagcc cgctgcctga agggcaaata gtatccgcct 5220 ttcgagagag tctaatgaag gcgtggaagc acgagcggct gcagtctgag ccaagcactt 5280 tcccatcaga caacaagcaa ccaggcgtgc agatgagcaa gtacaagcat tggatggggc 5340 tgtgtgcgga aggagcggca ccactgacca tgcaagggca cagtagagca tttataccag 5400 ttgcgcacca caaggccttg atgaggttcc gcctatgctg ctggccgctt actgccaacc 5460 gcgcctatgg acgacctagg gaggagagga tttgcccgct atgtgttgca aatgaagtcg 5520 aagatgagaa tcatgtgctc atgcggtgta cggcctacga ccagttgcgt ttgggtagcg 5580 agatcgattt tacaggcgga atgcaggccg tcatgcagaa tgcggaccca gccaggttag 5640 ccgcgttact agattccatt tgggagcaca ggagcataag cacccccatt cggggaccaa 5700 actagctgca tatataagtg ttgcaggcgt tataagggcc ccccggcccg ggcctaggtt 5760 tctacaagga caggaatgca cgtcgtgctc accaccttgt aaccacacac a 5811 // ID Gypsy19-I_CR repbase; DNA; CHL; 6527 BP. XX AC scaffold_8; XX DT 22-FEB-2008 (Rel. 13.02, Created) DT 22-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE LTR retrotransposon from single celled green alga: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy19-CR; KW Gypsy19-I_CR; Gypsy19-LTR_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-6527 RA Kohany O. and Jurka J.; RT "LTR retrotransposon from single celled green alga."; RL Repbase Reports 8(2), 99-99 (2008). XX DR Genome; scaffold_8; Positions 461786 468312. XX CC Positions [5265-5756] - Integrase core CC 'GAGGC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(39..2165,2169..6440) FT /product="Gypsy19-I_CR_1p" FT /translation="MDPAAEAAAQAAAQAAQAAAAQQALAQQAAAAAAAAA FT AAAAAAPRAGSGGSAAPSSVHLPPRPASAPGMGSNNDDEMLSAGEGDEVSG FT MFRNILSLPTSTALALMSRVHNALQSTLRRSPTRAATQSLAEPTGTSINPL FT QPTSNGSDGSHESHGPRTASLARSAPEQALTMGPSGIYSRVRASLAAGPST FT GTGAPVSSDLPPTASDPSAANAPPRAARARLRPPPPPNNLSSAPQTHSHAP FT PPTAAPAPQYGYDHQPSAPMAPASYGPGPTYGHAGPAFGTGGPAFHQPPSV FT MSQLDTLRHRQNGHGHLLEDVQRTGHGTQQAVESLRNGLQNMHSLLVNTNT FT DVANLRNSQAPLNAAVHEIQLVSPGSAPSTAVPYARHLPRPSKLAGDPDLD FT KSIGDREVWFASVTSYMGRTGMNLLNDFEFFLEGPAATWFKNLRAYYHSTG FT KALDATALQAEFKRAYGDLENQSEPQRVRERISAGEARQTPGMPVSQYVQK FT FTTLLLEAPDMAEADRIQYFFDNMLPSLARDAIVDGDNKRHKTLASAIDAA FT YAAELGYRMRKRASASAAVASIPKGTDSDDCNNASSSDDEGSDGDGDATVA FT YVKRLPHRGKPHTHSKHAPRTHAFKHTKRPDRYVPLRNARKTQALGRLENK FT FDALRDRMGDGAGGSAKRQRSEEDKSIAGLKTLGFCTKCAREDVSRKWNTC FT RDHNPKFAAGVAQREGLGGSPAAQPSFNEHESLGSGQCTHPAPHQGCQHPS FT HKHTHATAATAPGTTHTHKADRMFSRAEFLSLQTASGRDFSFDAACNDEGS FT NAHCATFACPKRSFFKSDVGGHHVWLNPPHGQLQYWVKHYQRCKAARPFDT FT SAVIIAPKGSLSGKLTTGMTLLQEYATGTLLFATSADSNQLDPAPQDLQAW FT YDPPQQPKLRVASPDNALIFKCRIGHTQHKVLIDTGASHSIVSKNALPPGL FT TVAPHGAKEVEVADGKRVTLEGTAQLPLHIQKYTAAVPALVMPSLLPGVDL FT ILGMDWMRANGVKLDIPNLTCLLTKPANGSTKHILLVADSCNKGSTAGGED FT TNAGVAHMKAICAMLAATTPDLMSASAARRAMRKNVPGMLVVVREAKPDVL FT SRLDTESSAPPACAAQLRTNPPIFPETPHLCATHTTPTNPTSPSPTPTIPE FT TQLNALLDEFKDVFEPIQGPPKDRGIEHTIDLEPGAKPVFSRMYRLAPNER FT EEVSKQVQELLRLGLIQPSSSPWGAPILFAAKKDGGLRMCIDYRALNKVTV FT KNRYPIPNPEDLFDALHGATIFSSIDLQSGYHQIRINDKDRQKTAFRTPDG FT LYEFLVLSMGLANAPSVFQAVMNQVFQPYLNKFVLIYLDDVIIFSKTPEEH FT IEHLRTVLTALRQEQLQAKRSKCEFNRTELKFLGMIVSKDGLKVDESKVAT FT VRDWPTPKEVSSLRGFLGLANYFRKFIQGYSSLVAPLTQLTGSTAEWQWGT FT QQEEAFNGVKFALTNAPVLRFPDPNKHYEVISDASLAGTGAVLMQEGHPVA FT YTSSKFTPPERNYTTTEQELLGVIKALKQWRCYLEGPQITLVTDHNPNTFI FT DSQKSLAKLSRRQVRWIEYLSRFHYTWEYRPGRINVADPLSRMHATVLAAK FT AQSRYSLTSSLPGEIRVSYSHDPLFVDARALARHGVTYDQTDGFYRCQGKI FT VVPNYNDLRTRIIRELHDSPYAGHRGIERTLELVQREFWWPGITSDVRRQV FT LGCELCQRNKPLMQKPAGLCKPLELPRHVWTHVSMDFITHLPTTRDGHDTI FT VVFVDRLTKMCRVAPTTETITAEDFAQLFLETVWRSHGFPEEVVSDRGSVF FT VNKFMAELYRLTGTKQNVSTAYHPQTDGQTERMNRVLQEILRNYVNPTHDD FT WDKKLPLVEFAINNTYQESIKATPFQLNYCRNPRLPTLGSADSKVPAAARF FT CSDIEESIQRAKRCISAAQERYKHYADRNMREVSYKVGDRVLLNTQHINLK FT HPGTKKFLPLWIGPYTVTQTIGPVAVKLDLPANYRIHPVFHVSRLKPHKQA FT PGSVWTPPPPVAVLDDGAYWSVDRLLAHRDKKRGGKTVKEYLVSWEGYGPE FT HNTWEPEAHITESAIDEYWASQATRRSKRKVAADTAPAPAAGVTPTAPPRK FT RGRTA" XX SQ Sequence 6527 BP; 1446 A; 2248 C; 1674 G; 1159 T; 0 other; gtggtgcgaa acgcctttcg ttctagcccc tgatagctat ggaccccgct gctgaagccg 60 cggctcaggc tgccgctcag gctgcccagg ccgctgctgc tcagcaggct ctcgcccagc 120 aggctgccgc tgcggccgct gccgctgcag ctgcggctgc cgccgcgccg cgtgctggct 180 ctggtggctc cgctgctccg tcgtctgtcc acctgccccc tcgccctgca tccgcgccgg 240 gcatgggctc gaacaacgac gacgagatgc tctccgccgg cgaaggcgat gaggtcagtg 300 gaatgtttcg caacatactg tcattgccta cgtccactgc actcgctttg atgtcgcggg 360 tgcacaacgc attgcaatca actcttcgcc gctcgccgac tcgtgcggct acacaatcgc 420 tagctgaacc tacgggcact agcattaatc cactccaacc tacctcaaac ggtagcgacg 480 gcagccacga atcgcatggc cctcgcactg catcgctcgc tcgttccgca ccggagcagg 540 ccctcacaat gggcccgagt ggcatctata gccgcgtccg tgcgtccctc gcagcgggac 600 ctagcacggg cacgggcgct ccggtcagct ccgaccttcc accgaccgcg tcggacccgt 660 cagccgctaa cgcgcccccg cgtgcagccc gcgcccgctt gcggccgcca ccaccaccca 720 acaatctgtc aagtgctcca cagacgcact cacatgctcc cccacccacg gccgcccctg 780 ccccgcagta cggctacgac caccagccct ccgctcccat ggcgccggcc agctacggcc 840 ccggccccac ctacggtcac gccgggcctg cctttggcac cggcggtcct gccttccacc 900 agccgcccag cgtcatgtcg cagctggaca ccctgcgcca ccggcagaac ggccacggcc 960 acctgctgga ggacgtccag cgcaccggcc acggcaccca gcaggcggtc gagagtctgc 1020 gcaacgggct gcagaacatg cactcgctgc tggtcaacac caacactgac gtggccaacc 1080 tccgcaactc tcaggctccg ctgaacgccg ccgtccacga gattcagctg gtgagtccgg 1140 gttcggcacc tagcaccgct gtgccatacg cgcgccacct gccccgtccg tctaagcttg 1200 cgggtgatcc cgacttagat aagagcattg gggacaggga ggtgtggttc gcgtcggtta 1260 cctcatacat gggccgcacg ggcatgaacc tgcttaacga cttcgagttc ttcctagagg 1320 gacccgcggc cacatggttc aaaaacctgc gcgcatacta ccactccact ggcaaggctc 1380 tagacgccac cgcactgcag gctgagttta agcgcgcgta cggcgaccta gagaaccagt 1440 cggaacccca gcgcgtgcgc gagcgcatca gtgccggtga ggcgcgccag acccccggca 1500 tgcctgtctc gcagtacgtg cagaagttca ctaccctgct gctcgaagcc cctgacatgg 1560 ctgaggcaga ccggattcag tacttcttcg ataacatgct acccagcctt gcgcgcgatg 1620 ccatagtcga cggggacaac aagcgccaca agacccttgc ctccgcgata gacgccgcgt 1680 acgctgccga gctagggtac cgcatgcgca aacgtgccag cgccagcgcc gccgtagcca 1740 gcattcccaa aggcacggac agcgacgact gcaacaacgc ttccagcagt gacgatgagg 1800 gttctgacgg agacggggat gcaaccgtcg cgtacgttaa gcgcttgccc caccgcggca 1860 agccgcacac gcacagcaag cacgctcccc gcactcacgc attcaagcac actaagcgcc 1920 ctgaccgcta tgtgcccctc cgcaatgctc gcaaaacaca ggcgctcgga cgcctggaga 1980 acaagttcga cgccctgcgc gaccgcatgg gcgacggcgc cggcggctcc gcgaagcgcc 2040 agcgctcgga ggaggacaag agcattgccg gcctgaagac tctcgggttc tgcaccaagt 2100 gcgcccgtga ggacgtcagc cgcaagtgga acacctgccg cgaccacaac cccaagtttg 2160 ccgcgtaggg tgttgcgcag cgtgaaggtt taggtggcag ccccgctgcg cagccatcat 2220 ttaatgagca tgagtcgcta ggtagtgggc agtgtacgca ccccgcaccg catcagggtt 2280 gtcagcaccc atcgcacaag catacgcacg ccacggctgc tacggcaccg ggcaccacac 2340 acacgcataa ggcagatagg atgttcagtc gagctgagtt cttgagtttg caaactgcct 2400 caggacgaga cttctccttt gacgctgcgt gtaatgatga aggctccaat gctcattgcg 2460 ccacctttgc ttgtcccaag cgctcgttct ttaagtcaga cgtcggcggt caccacgttt 2520 ggctcaaccc accgcatggt cagctgcagt actgggttaa gcactaccag cgttgcaagg 2580 cagctaggcc attcgacacc agcgcagtga tcatcgctcc taagggctcg ctcagcggca 2640 agcttaccac gggcatgaca ctcctgcaag aatacgccac gggtacactt ctctttgcaa 2700 cgtcagcgga cagcaatcag cttgaccctg caccgcagga cttgcaagca tggtacgacc 2760 ccccgcaaca gcccaagctc cgcgtggcta gcccagacaa cgctctcatt ttcaagtgcc 2820 gcattggcca cacacaacac aaagtcctga tcgatacagg cgcatcccac tccatcgtta 2880 gcaagaacgc acttccccct ggcctgaccg tcgcaccgca cggcgccaag gaggtagaag 2940 tcgcagacgg caagcgcgtg acccttgaag gcaccgcaca actaccattg cacatccaga 3000 aatacaccgc agccgtccca gcactcgtca tgccatccct cctgcccggc gttgacttga 3060 ttctgggtat ggactggatg cgcgccaacg gcgtgaagct agacatccct aaccttacct 3120 gcctgctcac caagcccgcc aacggcagca ccaagcacat actgctggtt gctgactcgt 3180 gcaataaggg ttcgacagca ggtggggagg acacgaatgc cggcgttgca cacatgaagg 3240 ccatctgcgc catgctcgcc gcaaccactc ccgacctcat gtccgctagc gccgcccgcc 3300 gcgccatgcg caagaacgtg cctggcatgc tggtggtcgt gcgcgaagcc aaacccgacg 3360 tgctgtcacg gctcgatacc gaatcatctg caccgccagc gtgcgccgca cagctccgca 3420 ccaacccacc catctttccc gagacaccac acctgtgcgc cacgcatacc acccccacca 3480 acccaacttc cccttcaccc acacccacca ttcccgaaac acaactcaac gcacttcttg 3540 acgaattcaa agacgtgttc gaacctattc aagggccgcc taaggaccgc ggcatcgaac 3600 acacaatcga cctggaacct ggcgcaaaac cagtgttctc ccgcatgtac cgcttggccc 3660 ccaatgagcg cgaagaagtc agcaagcaag tacaagagct gctgcggcta ggccttatcc 3720 agccatcatc gtcgccttgg ggcgcaccca ttctgtttgc cgccaagaag gacggtggcc 3780 tgcgaatgtg catcgattac cgtgcactga acaaggttac ggtgaagaac cgctacccca 3840 tccccaaccc ggaggacctt tttgacgccc tgcacggcgc tactatcttc tcatccatcg 3900 acctgcagtc cggctatcat caaatacgca tcaacgacaa ggacaggcag aaaaccgcct 3960 tccgcacgcc cgacggactg tacgagttcc tggtcctaag catgggttta gccaacgcgc 4020 ccagcgtttt ccaggccgtc atgaaccaag tatttcaacc atatttaaac aagtttgtcc 4080 tgatttacct ggacgacgta atcatcttca gcaagactcc cgaggaacac attgaacatc 4140 ttcgcaccgt gctgacagcg ctgcgccaag agcagctaca agccaagcgc tccaaatgcg 4200 agtttaaccg caccgagctc aaatttctgg gcatgatcgt cagcaaagac ggactgaagg 4260 tggacgaaag caaagtcgcc accgtacgcg actggcccac ccccaaggag gtgtcaagcc 4320 tgcgcggttt cctcggtctt gccaactact ttcgcaaatt cattcagggg tactcaagcc 4380 tcgtggctcc actcacacag ctcacaggca gtacagccga atggcagtgg ggcacacagc 4440 aggaagaggc attcaacgga gtcaagttcg ccctcaccaa cgcacctgtg ctgcgttttc 4500 ctgaccccaa caagcactat gaagttatca gcgacgcctc actagcaggt acaggcgctg 4560 ttctcatgca ggaagggcac cccgttgcat acacaagctc caagttcacg ccccctgaac 4620 gcaactacac cactactgag caagaactgc ttggtgtgat caaagcgctc aaacaatggc 4680 gctgctatct ggaaggtccg caaatcacac tcgtcacgga ccataaccca aacacgttca 4740 ttgattcaca aaagagcctg gccaaactgt cccgccgcca agtgcgctgg attgaatacc 4800 tttcccgctt ccactacacg tgggaatacc ggcctggccg tatcaacgtc gcagaccccc 4860 tctctcgcat gcacgccacg gtacttgccg ctaaggcaca gtcccgttac tccttgacct 4920 cttctctgcc cggcgagatt cgcgtgagct attcacacga cccgctattc gtcgacgcaa 4980 gggcactcgc ccgccacggt gtcacctacg accaaactga tggtttctac cgctgccaag 5040 gcaaaatcgt tgtccctaac tacaacgatt tgcgcacccg cattatcagg gagcttcacg 5100 acagccccta cgccggccat cggggtatcg aacgaaccct cgagcttgtc cagcgagagt 5160 tctggtggcc gggcatcact agtgacgtaa ggcgccaagt cctaggatgc gagctctgcc 5220 agcgaaacaa accgctcatg cagaagcctg cgggtctgtg caagcctttg gagctccctc 5280 gacacgtatg gacacacgtg tcgatggatt ttattacgca cctgcccaca actcgagacg 5340 ggcatgacac tatcgtggta ttcgtcgacc ggctaaccaa aatgtgtcgt gtagccccca 5400 cgactgagac aattacagct gaagactttg cacagctgtt ccttgaaact gtttggcgaa 5460 gccacggttt tccagaggag gtcgtatccg atcgcggctc cgttttcgtg aacaagttca 5520 tggccgagct gtatcgcctc acaggcacca agcaaaacgt gtctacagct tatcatccac 5580 agacggatgg ccaaacggag cgcatgaacc gcgtgctgca agaaattctg cgcaactacg 5640 taaatcctac tcacgacgac tgggataaga agctaccact cgtcgaattc gcaatcaaca 5700 acacatacca ggaatccatc aaggcaacgc ctttccagct gaactactgt cgcaaccctc 5760 gcctgcctac tctcggctct gcagacagca aagtccctgc tgctgctcgc ttctgctccg 5820 atatcgagga aagcatccag cgtgctaagc gctgcatctc cgctgctcag gagcgctaca 5880 agcactacgc cgatcgcaac atgcgcgagg tgagctacaa agtaggcgac agagtgctac 5940 tgaacactca gcacatcaac cttaagcatc ccggcacgaa gaagtttctg ccgctctgga 6000 ttggacccta tacggtcact caaacaattg gacctgtcgc agtcaaactc gacctcccag 6060 cgaactaccg cattcaccca gttttccatg tttcgcgcct caaacctcac aaacaggcac 6120 ccggctccgt ctggacgcct ccgccccctg tggccgtgct tgacgatggc gcatactggt 6180 ccgttgaccg cctcctggcc caccgcgaca agaagcgcgg cggtaagacg gtcaaggagt 6240 acctcgtctc ctgggaaggt tacggtcctg agcacaacac ttgggaaccg gaagcacaca 6300 tcactgagtc tgcgattgat gagtactggg cttcccaggc cacacgcagg tccaaacgca 6360 aggttgcagc cgacacggcc ccggccccgg ctgcaggggt tacaccaact gcgccaccac 6420 gcaagcgtgg gcgtactgct tagggacgtc ctgccacggc agaccccaac actccgaagg 6480 ccccctgacg caagctcgca gggactctcg ctctaggtgg ggaggac 6527 // ID hAT-N9_CR repbase; DNA; CHL; 694 BP. XX AC . XX DT 08-JUN-2004 (Rel. 9.05, Created) DT 08-JUN-2004 (Rel. 9.05, Last updated, Version 1) XX DE hAT-N9_CR is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; 8-bp TSD; KW hAT superfamily; hAT-N9_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-694 RA Kapitonov V.V. and Jurka J.; RT "hAT-N9_CR, a family of nonautonomous hAT transposons from the RT green algae genome."; RL Repbase Reports 4(5), 146-146 (2004). XX DR [1] (Consensus) XX CC More than 30 copies of hAT-N9_CR are present in the C. CC reinhardtii; they are ~99% identical to the consensus CC sequence. The hAT-N9_CR family is characterized by 17-bp TIRs CC and 8-bp TSDs. XX SQ Sequence 694 BP; 179 A; 149 C; 209 G; 157 T; 0 other; cagggttgca cagtactcag tacgtgttag tacgtactag tacgtactga gtactagtac 60 gcctctcctg ggtctcagta cggcgcaaaa tacgccaggg gggcgggggg cgggggtgtg 120 ggggtcgccg gcgggtgccg gtggccggtg gtcgtgccgt gtcaatgcgt cggtggcaca 180 gtgcagtggc gagaggcaac atagacgcaa cataggcaac gtcttaagca taatatgatt 240 ctgtgtgtgt agttgctcac aaccaaaagc ctgatatcag tgacacgatg gccgatgcga 300 tgcgcgaacc gagcctgccc tattggatac gcgcaatgga gctctggatg gataaacttg 360 atggccaaac agtaggcaaa acaggctgcg cagttgtcac tccgctcatt tctcgacatt 420 taataacaag tatgcgtttc caacagcata gcgagctgta acggcctcga gtatggtcat 480 gtatagtagg atacagggaa ggatactaag gaggaacacg gagagcgaaa tttgcaatgg 540 gaccgggcat aggctgcatg ggacgtacta tagcacgtac taacccttgc tcgccggcgt 600 attataagtg aattatggtg tattgtacta gaaagctgtg tactgcgtat taatacgcgt 660 attataagaa aatggctagt acagtgcaac cctg 694 // ID hAT-N7_CR repbase; DNA; CHL; 1125 BP. XX AC . XX DT 13-MAY-2004 (Rel. 9.04, Created) DT 13-MAY-2004 (Rel. 9.04, Last updated, Version 1) XX DE hAT-N7_CR is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; 8-bp TSD; KW hAT superfamily; hAT-N7_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-1125 RA Kapitonov V.V. and Jurka J.; RT "hAT-N7_CR, a family of nonautonomous hAT transposons from the RT green algae genome."; RL Repbase Reports 4(4), 125-125 (2004). XX DR [1] (Consensus) XX CC About 20 copies of hAT-N7_CR are present in the C. reinhardtii CC genome. They are ~93% identical to the consensus sequence. CC hAT-N7_CR is characterized by 15-bp TIRs and 8-bp TSDs. XX SQ Sequence 1125 BP; 234 A; 289 C; 361 G; 240 T; 1 other; cagtgctcgt atcttccttc ttttcgacag aaatttgccc aaaaatgcct ccggcggaag 60 aaatttttga ttcttgtggc caaaaatctg tggaaacgct tcaggtttgg aggggtcttt 120 ctgagattcg cagaagacat tttagggcgt ttctgtcact tgtccgggcc caaggcaagg 180 gtgactgagg ttccccagga aactctatct acggccgtgt cctgctggca ccgtgggggc 240 acaccaagat agggcagcct tctactaggg gagtcattac gtgggtgggt gagcgtgatg 300 agcgtgggag agggcgctgg ggtttggcgt ggggatgggc cacgttttgc catggagcca 360 tagagccggt gctttttcag caacgtagtg ttaacgtggc aacagcaacc tcaccccgca 420 cgctgtaaca ggggcacatc ccgtgcggcg gaatattttg ctgtgccgga gccgaaacgt 480 gggctggggg caacagcaac ctcaccccgc acgctgtaac agcggcacat gccgtgcggc 540 ggaatatttt gctgtgccgg agccggaacg tgggctgggg gcaacagcaa cctcaccccg 600 cacgctgtaa cagtggcaca tcccgtgcgg cggaatattt tgctgtgccg gagccggaac 660 gtgggctggg ggcaacagca acctcacccc gcacgctgta acatgggaca tcccgtgcgg 720 cggaatattt tgctgtgccg gagccggaac gtgggctggg ggcaacagca acctcacccc 780 gcatgctgta acagaggcac atsccgtgcg gcggaatatt ttgctgtgcc ggagccggaa 840 cgtgggctgg gggcaacagc aaccttaccc cgcacgctgt agtttgtgtg gtgtggagtt 900 tctgttttga gatggatgca cggcacgaat gggagattga gagattgctg acacggcctg 960 ccacacccct ggcttgacac gaaccgtgca taggtcgggg gcggagcccc cggaaaattt 1020 ttgggatgaa agatgcggcg gcagcgggtt tctgaaaatt aaagatgccc tatttcggaa 1080 atctgtcact ggcgccgcac ctgccttctt aagatacgag cactg 1125 // ID Gypsy15-I_CR repbase; DNA; CHL; 6115 BP. XX AC scaffold_7; XX DT 22-FEB-2008 (Rel. 13.02, Created) DT 22-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE LTR retrotransposon from single celled green alga: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy15-CR; KW Gypsy15-I_CR; Gypsy15-LTR_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-6115 RA Kohany O. and Jurka J.; RT "LTR retrotransposon from single celled green alga."; RL Repbase Reports 8(2), 95-95 (2008). XX DR Genome; scaffold_7; Positions 167178 173292. XX CC Positions [4981-5463] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 341..1312 FT /product="Gypsy15-I_CR_2p" FT /translation="MDPTLGDAPSAAAGAAGLIPAGDAAAGSSGAAGTAGM FT DVDLPSLTTEQQGAAVSAAAAAVVAGAPVIAGVPAAADLPPTVTIASTALV FT SLLTQVGQQVAQQVAATAAPAALPAPAEPAAPPAPVASAAPAGYAAAAAAP FT AAAAQPAAPVPAQVGLGGHSAQMGGSSRNPQTLPTFSPTSDVDDFWFGFER FT YMRYERQLTPYDALLSACHSFADAFKFLRHNVLDGERLTLSELKDVFLRRF FT DFRVTTKAEDARDDLQKGRVKMDLAKGVEEYAARFLERVTVAGFTPGLDLK FT HDFMLMPFFRDGLIDKLQSKCVLDGDRSPLLV" FT CDS 2059..6078 FT /product="Gypsy15-I_CR_1p" FT /translation="MCKSRMIGFAYARASAAHKSLSGHTILLSPSVRKLQL FT CIARYRQYKAEDPSISAVMLMPALSDRHRSLLRGWNLKHTFAPGTQLFTPN FT CGGSCPSPVTTEPWQVWYDPPKAEPKEPSCAVTEVPEGPAQLDLFAGRLSH FT APVVVLADTGATHCFIAKALADRLGLSLRPSSLAPVRLADASKCDAVGECV FT VPLRLGTFSTEIRVHVLPSLTSSADLVLGQTFLKRFCVQMTPHLGTLHLLR FT PSGRTAVIRALNAPDPGDPEPRELPPLEEGEPACQLLEQPPVSAKVVSAKR FT AIRAASMEGAKAVLTFVRALSPSATHAPAPELCTAEAPLQSMLNSAAPPAP FT DPPLNLPPEVEQNPTLKTVLTEFRDVFAPISGLPPDRGVHHTIPLEPGSAP FT THRPQFRLNPIELAEVDTQVKDLLAKGLIRPSTSPYGAPILFVAKKTGELR FT MCIDYRALNRITVKNRYPLPRIDDTLDRLAGAKYFTALDLASGYHQLLIKP FT EDIPKTAFTTPLGHFEWLVLPFGLTNAPATFQATMNRIFAPFLNRFVTVYL FT DDILIFSRTAEEHEQHLRQVLACLRKWKLHAKLSKCEFWRSEVKYLGHIVS FT SDGIRMDPKKVEQIRQWPLPRDLSELRSFVGLANYFRRFIAGFASLAAPLT FT NMFSLARLPDTWPPQALHAFEQIKERLSTDVLLRYPDFSKPFVVMSDASLN FT GTGAVLLQEDRPVAFTSKKFSPAERNYSTGEQELLGVVNALREWRCYLQSA FT LPFTLYTDHHPLIYLKTQAHLSRRQARWVELLSEFHFDWVYRPGKLNVVAD FT ALSRHPSLSFLHAMQTRRQAAAAASSPPVPSATPSAPRARATHAPAPVSSG FT PVATFTTRIAAAVRADTWFQDAANVRGLATSPDGLFLREEEGSWRILVPND FT PDLKRDIIASCHSERLAGHQGRDRTTELVRRTFTWPTLTRDVADFVASCDA FT CQRSKPASGKPAGKLQPLPIPELPWESVSMDFITGLPRTQRHNTAICVFVD FT RLTKMVHIAPCSHEINGQQTADLFFDSVVRLHGLPREIVSDRGTVFTGAFT FT TALTKRLGIKQSLSTAFHPQTDGQTERINRILNDMMRNFCDGNHANWDRYL FT SAAEFAINNAQNRSTGKSPFFLTYGVHPKTPLTLDVGNSVPAAKHYTDNLM FT ERLNAAKRCLQAAQDRARQDADRHRRAVTFTEGQMVLLSTKNFREYQGKHK FT KLMPRWVGPFPIEKMINPVAARLTLPPGEEKHNVFHVSLLRPYRQPADATP FT VRVLPLPVTSNGTPTLRAERILDHETRKLRNREIHRYYVKFVGRDMENNQW FT LDESDFPDRTLIDAYWASRQVASSVPST" XX SQ Sequence 6115 BP; 1080 A; 1884 C; 1665 G; 1486 T; 0 other; tttggtgccc taagagtgcg ccgaaccgct ctcttagcag ttctttctgt ttatcgcgta 60 ccctggattg gccatcatac ctacggccgt ttctctgcgc cgccttgctc cgccggtcga 120 ctgtcgacca atacgggacc tcagcgccgc acatgtctca ggaagaggtc gtgcgcgtta 180 ccaagttgct ggagggtatc tctttcagca tctcgattag cacgacgact actacgggta 240 cgcgccgtta acgcgccgcc atcgcgtttg tgaatttcgc acttgtgcgc ctcgcgacct 300 gtgcctgttt cttttgagcg ctagctcctg atcacctctt atggatccta cgttaggtga 360 tgccccttct gccgcggctg gcgccgccgg cctgattccc gctggcgatg ccgcggctgg 420 ctcctccggt gctgctggca ccgctggaat ggacgtcgat ctgccgtccc tgactaccga 480 gcagcagggt gctgcagtct cggctgctgc tgctgctgtc gtcgccggag ctcccgtcat 540 cgctggagtc cccgcagccg ccgacctgcc gcccaccgtg accatcgcgt ccaccgctct 600 ggtctcgctg cttactcagg ttggccagca ggtcgcccag caggtcgcag cgaccgctgc 660 cccggccgcg ctgcccgcgc ctgctgaacc tgctgcgcct cccgcgcctg ttgcgtctgc 720 tgcgcctgcc ggctacgccg ctgccgctgc agcgcccgcc gccgctgcgc agcccgctgc 780 gcctgttcct gcccaggtgg gcctgggagg tcatagtgct cagatggggg gaagctctcg 840 caaccctcag acgttgccta cgtttagccc gacctctgat gttgatgatt tctggttcgg 900 ttttgagcgc tacatgcgtt acgaacgcca gctgacaccc tacgatgccc ttctttccgc 960 atgtcactcc ttcgctgatg ccttcaaatt cctgcgccac aacgttctcg acggtgagcg 1020 cctgactctg tcggagctga aagatgtgtt cctgcgtcgc tttgactttc gggtcacgac 1080 aaaggctgag gacgctcggg atgacctgca gaagggccgg gtgaagatgg atctagccaa 1140 gggtgtcgag gaatacgccg cgcgtttcct tgagcgtgtg accgttgcgg gattcacgcc 1200 tgggctagac ctgaagcatg acttcatgct gatgccattc ttccgtgatg gcctgatcga 1260 caagctgcag tcgaagtgtg tgctcgatgg ggaccgaagc cctttactcg tctgatggac 1320 ctgatctctt gggcttctag ccaggagcgt gtgttgcgcg ccgagaaggt gcttgccccg 1380 ccgtcggttc ctaccgtcgc ttacgccgcg caaacccctc gccccaaaat ccgcacgcag 1440 actgctactc gcgcccgtca acagcgcgct gccacccgcg cgtctgcgcc tgctgctgcg 1500 accgcccgcg cggacccaca accctctggc cgtgatggct ggcagcctgc taatcgccgc 1560 ggctccaagc gcgccgccga tggccctgcc ctgaaccccc tgcagctgac gcgcatccgt 1620 cacagctcgg gccggctcct ctccatggtg gagaaggaga tgtgcgacac ccgcggtctg 1680 tgttacttct gctttggcca gaaccaccgc tcgtctgagt gccctctggc tcagcgcgat 1740 gggaacggtg gtggtggtgg tggcggtggt ggtgccgctg gtggtggtgg tggtggtggt 1800 gttcgccgcc gcgagaacgg caacggtcct cgccgggctc aggcccaggt ggccgaggcc 1860 gctgacgccc gcgattaggg tgttgcccag cgctgtgatt gtggccagcg cgttgggaca 1920 tgttgctctg tgaagtttgc aagcttgaac gttacgcacg cgcgtgactt gtgctttgtg 1980 gatgacgccg ctctctcttt ggtggaagag aagattggtc ggcctgtcac ttcggatgct 2040 tgctccgctg cggtgcgcat gtgtaaatcc cggatgattg gctttgcgta tgcccgtgcc 2100 tccgccgctc acaagagcct gtcaggccac acaatcttgc tcagtccgtc tgttcgcaag 2160 ctgcaacttt gtattgcacg ctatcggcag tataaagcag aagaccccag catatcagca 2220 gtaatgctca tgcctgccct ttcggatcgt catcgttcct tgcttcgtgg ctggaacctg 2280 aaacacacgt ttgcgccagg aactcagttg ttcaccccta actgcggtgg ctcttgccct 2340 tcgcctgtta cgacagaacc ctggcaagtg tggtacgatc ctccgaaagc tgaacccaag 2400 gagccctcgt gtgcagtgac agaagtcccg gaaggacccg cacaactaga tttgtttgct 2460 gggcggctgt cgcatgcccc tgttgttgtg cttgcagata ctggagctac gcactgcttc 2520 atcgccaagg ccctagcgga ccgcctaggt ttgtcactgc gcccttcctc tcttgctcct 2580 gtgcggttgg cagacgcatc aaaatgcgat gccgtgggtg aatgcgttgt gcctttgcgt 2640 ctaggaactt tctctactga gattcgtgtt catgtcctgc cctctctcac atcgtctgct 2700 gaccttgtgc tgggccagac gttcctcaag cgcttttgcg ttcagatgac acctcatctc 2760 ggcactcttc atctgcttcg cccctctggt cgtacggctg tgattcgtgc tttgaacgca 2820 cctgatcccg gcgaccctga gccacgggaa ttgccgcccc tagaggaggg tgagcctgca 2880 tgccagctac ttgagcagcc acctgtttcc gccaaggtag tgtctgccaa gcgtgccatc 2940 cgtgctgcct cgatggaagg cgctaaagct gttttgactt ttgtgcgcgc cctgtccccg 3000 tccgcgacac atgcgccggc acctgagctt tgcacggctg aggcacctct tcaatccatg 3060 ctcaactctg ctgcgcctcc tgcgcctgat cctcctttga accttcctcc ggaagtggag 3120 cagaacccca cgttgaaaac cgtgctgact gaattccgtg atgtgtttgc acccatctct 3180 ggcttgccgc cggaccgtgg tgttcatcac acgatacccc tcgaacctgg ctctgcccct 3240 acacaccggc ctcagttccg gctgaacccc attgaattgg ctgaagttga tacccaggtt 3300 aaagacctgc tggctaaagg cttgattcgt ccgtcgacgt cgccctacgg cgcgccgatt 3360 ctcttcgttg ccaagaaaac tggggagctg cgcatgtgca ttgattaccg agccctgaac 3420 cgcattacag tcaagaatcg gtatccattg ccgcgtattg acgatacgct ggaccgactt 3480 gctggcgcga agtactttac ggctctcgat cttgcctctg gttatcacca gctgttgatt 3540 aaacccgaag acatacctaa aacggccttt acaacgcctt tgggtcactt tgagtggttg 3600 gttctgccat tcgggctcac caacgctcct gctacgtttc aagccactat gaaccgcatc 3660 tttgcgccgt tcctgaaccg gtttgtgact gtgtatttgg atgacatcct cattttcagt 3720 cgcaccgctg aagaacatga acaacacctc cgccaagtgt tggcctgcct ccgtaaatgg 3780 aaactacacg ccaaacttag caaatgcgag ttctggcgca gtgaggtgaa atacttgggg 3840 catattgtca gttcggatgg aatccgcatg gatccaaaga aagttgagca aatccgtcag 3900 tggccgctgc cgcgtgacct gtctgagttg cggtcatttg ttggtttagc caactatttc 3960 cgccggttta ttgctggttt tgcatccctg gccgcaccgt taaccaacat gttctcactc 4020 gcgcgcctac cggacacttg gcctccgcag gccctgcacg cgttcgaaca aatcaaagaa 4080 cgcctgtcaa ccgatgtgct tttgcgttat cctgatttct cgaagccctt tgtcgtcatg 4140 tccgacgctt cgctgaacgg aacgggtgct gtcttgttgc aagaggaccg accggtcgct 4200 tttaccagca agaagttctc ccctgctgaa cgcaactaca gcactggtga acaagaactg 4260 ttgggtgtgg taaacgccct gcgtgaatgg cgttgttacc tgcagagcgc gctacctttt 4320 actttgtaca cggaccatca tccgttgatc tacctcaaga cccaagcaca cctttcaaga 4380 cgtcaagccc gctgggtgga gctactgtct gagttccatt tcgactgggt ctaccgccca 4440 ggcaagctca acgtagttgc tgatgccctg tctcgacacc cgtcgctgtc cttcctgcat 4500 gctatgcaaa cccgccgtca agctgccgcc gcggcttcct cgccgcctgt tccctctgcg 4560 acgccctctg cgcctcgtgc ccgtgctacg cacgcgcctg cgcctgtttc ctctgggcct 4620 gtcgccacct tcacaacgcg cattgctgcg gctgtgcggg ctgatacctg gtttcaggat 4680 gcggctaacg tgcgtggtct ggctacgagc cctgatggtc tctttttgcg ggaggaggag 4740 gggagttggc gcatccttgt gcctaatgat cctgatctga aacgtgacat cattgcgagt 4800 tgccattctg aacgcctcgc cggacaccag ggtcgtgatc gcactactga gctggttcgc 4860 cgtacgttta cgtggcctac cctgacccgc gatgtggctg acttcgtagc gtcttgtgac 4920 gcctgccaac gttccaagcc tgcttctggc aagcctgctg gcaagcttca accgctgcct 4980 atcccggaac taccttggga gagtgtgtct atggacttca ttacgggttt acctcgtacg 5040 cagcgtcaca ataccgctat ctgcgtgttt gtggaccgcc tgactaagat ggtgcacatc 5100 gcgccctgtt cccacgaaat caatggacag cagactgctg acctattctt tgactctgtg 5160 gttcgcctgc atggcctgcc tcgtgagatt gtgtctgacc gcggcacagt attcaccgga 5220 gctttcacta ccgcgctgac taagcgcctg ggaattaagc agtctttgtc taccgctttc 5280 cacccccaga ctgatgggca gactgaacgg attaatcgta tcttaaatga tatgatgcgg 5340 aacttttgtg atggaaatca cgcgaattgg gaccggtact tgtctgccgc tgagtttgcg 5400 attaacaatg cgcagaaccg ttccaccggc aagtcccctt tcttcctgac ttacggcgtc 5460 caccctaaga caccgctgac actggacgtt ggaaattccg tccctgccgc taagcattac 5520 acagataacc ttatggaacg tctgaacgct gctaagcgct gtttgcaagc cgcgcaagac 5580 cgcgcccgcc aagatgcaga ccgacaccgc cgtgccgtga cttttacgga aggccagatg 5640 gtgttgctga gcacgaagaa ctttcgtgag tatcaaggca agcacaagaa gcttatgccc 5700 cgatgggttg gaccctttcc aattgagaag atgattaacc ctgttgcggc tcgcttgacg 5760 ctaccgcctg gtgaagaaaa gcacaacgtg tttcacgtct ctttgctgcg accataccgc 5820 cagcctgctg acgccacgcc tgttcgtgtg ctacccttgc ctgttacgtc caatggtact 5880 cccactctgc gtgctgagcg cattctggac catgagacac gcaagctgcg gaaccgtgag 5940 attcaccgtt actacgtgaa attcgtagga cgagacatgg agaacaacca atggttggat 6000 gaatctgatt ttcctgatcg caccttgatt gatgcctatt gggcttcacg acaggttgcc 6060 agttctgtac cttctacttg agctgcggga cgctagctct tggaggggga gcagt 6115 // ID MarinerL-1_CR repbase; DNA; CHL; 15988 BP. XX AC . XX DT 08-JUN-2004 (Rel. 11.05, Created) DT 31-MAY-2006 (Rel. 11.05, Last updated, Version 1) XX DE MarinerL-1_CR is a family of nonautonomous Mariner DNA DE transposons - a consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW mariner; Interspersed repeat; MarinerL-1_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1..49 15889..15988 RA Wang S.C., Schnell R.A. and Lefebvre P.A.; RT "Isolation and characterization of a new transposable element in RT Chlamydomonas reinhardtii."; RL Plant Mol Biol 38(5), 681-687 (1998). XX RN [2] RP 1-15988 RA Kapitonov V.V. and Jurka J.; RT "MarinerL-1_CR, a family of nonautonomous Mariner transposons in RT the Chlamydomonas reinhardtii genome."; RL Repbase Reports 6(5), 258-258 (2006). XX DR [2] (Consensus) XX CC MarinerL-1_CR likely belongs to a family of Mariner nonautonomous CC transposons. The consensus sequences is still incomplete because CC of assembly problems. This transposon is active, it characterized CC by 2-bp target site duplications. Like other transposable CC elements in the chlamy genome, MarinerL-1_CR is GC-rich; as a CC result, it contains long ORFs that most likely do not code for CC any real proteins. This element is probably still transposable CC (over 99% identity between different copies). The 49-bp 5' and CC 100-bp 3' termini of MarinerL-1_CR are 84% and 96% identical to CC the left end of Tcr3 [1]. XX SQ Sequence 15988 BP; 3018 A; 4837 C; 4547 G; 3262 T; 324 other; cactgtaacg acgggaaact cgcgtctcgg cgcgtctcag cgcgtccgag cgcgtcagga 60 tgcgtcaggg ggcgtctggc cgcgtatcgt gacgcgagtg ctgagacgcg aacagtgacg 120 cgaactactt tgaaggggac ccataccgtg gtgcgcacgg tgttttacac tagcctatac 180 tgctattgcg acaaatgtag ccttgcccgg cagtatgtgg gctgtcatgg gcagattggc 240 ccctgcacct gcgcctgctg ccgcaagccc ctgcacccct ggccctgcac ccgggcgggc 300 tggccagccc cgcctctgcc ccgtcgcccc aaccaacctt taggcgcagt tcagccagcc 360 tgcgccagcc agccgatatg tgccgatact gcttacaaca agcctataag cctagataca 420 gtaagcaaag gcattggcac aggcgagcat cggtgacacc ggtgcaatcg gtcgcccacc 480 accaagtgtt tgccccttgc ctcgtttgac cgctattctc ttgaaaacat aaacatgact 540 gacgaactca tgaatcactc atgatacgtc tgccaacggg ccttgttccg ttggtggcgg 600 cctggggcct tcagagcgcg cttcacagcg ggcttgtatg gctccaggcc gaccaaccca 660 cctcagcgct tactgcgcac gtagtgctgt caatgttaac attagtgtta gtgctggcgg 720 ctctggcaac ggttcacctg agctttgctc ctcgcgctat cagccaaagt tcacaaacgg 780 aagagctaca aatcttcatt aggcttcctt cagggcaact cgtcacgttc gtgggacctg 840 tcgatgccac tgtgggctct gtcttcgact ttatcgcttc cacgaccctc ttccatctag 900 ataacagccg gatatggcgc ttggtcgcgc gcggcggcga cacgctgctg gagcgttcga 960 ccgtgctggc acgtcgtggc ctggcgtccg gttccgagct gcaagtgctc gcacggctaa 1020 tgggtggcac tccaaagaga aggccatcag caccctcagg tatagttgtt gggagtcaag 1080 aggcttggca cggcaaagga cttacaacca caccgtgtgc acgcagacaa ctggctcagc 1140 ctccaccgct tcacccgctt ccgaaaccac ttcggagacc ccgtctacac gtgcaccaac 1200 cctggatgcc gttcgccaac gcagttcaag gatagagcgg cttgctctcg ccacgaaaaa 1260 tgctgtgagg gtgcacctcg ggacctccct ctccctgccg gtaccggtgg cgaaggcggc 1320 ctaggagacg ctgacgttcc tgagggaggg agggattcgc ctgcgggttc tgttagtgag 1380 agcgacagca tggacactaa cggtagcggc accggcagcg gtagcggcac cggcagaggt 1440 agcggcaccg gcagcggtag cggcaccggc agcggtagcg gcaccggcag cggtagcggc 1500 accggcagcg gtagtggcac cggcagcggt agcggcaccg acggcggcgg cggcgggaat 1560 gttggcaatg gcagtgacag gagtggtggt ggctacagta atagcggcag cgcccgggtg 1620 gacagcgact acctatactc cgatgaggaa gaggaagatg ctgctgacgc tgaaccgggt 1680 aagtatcgcg caggttgcgg tcctcctttg cgcgccgcaa gcccgggtct gggtgctggg 1740 tcatgcctct gtcctggggt tctcgcttcc ggacatggct tatgttcaga tcgttgagga 1800 gcgacatcta actttgcatt accctgatgt cccgcccaac cctgcccccc cacccccaat 1860 ccacaggtca tttcctggcg gcagctctac accgcgttgt ggcttctctc gaccaggacg 1920 accctgccaa catcaacacc attgagctgc tgaatgctat tctgcagcaa ttcgggcacc 1980 tggcgcctgt tgacggcgac ggtgatgctg atgaccaagc cggtgcggtt gccaacgacc 2040 ttgcggcgca gggcatggtc atgcaaccca ccaacatctc cgaactacag catggtgcgc 2100 ggtctgtgca gcacttcaag ggcatcatgt cagagccgct gtggagaaca ggcactcaat 2160 gtaagtcggg atgtgggtcg ggtcgggtcg ggcggggttg ggacagcggg atttcgacag 2220 gctttgacaa gggagctctg gggtattgcg tggtgggcgc agcagcaggc gcgggcaaca 2280 ctggaatgga atgggtattc gaactgccac tgcaacgcca cggaattctg tgcatgtaac 2340 ggcgcacttg attgacgcaa tgcacctccc tttccgtggc tttcaccctc tccgctggca 2400 cactactgca attccggtac gacacctatc tgcgcagctg cacctatcac tgtggggcag 2460 ttatggctcc aggccgacca acccacctca gcgcttactg cgcacgtagt gctgtcaatg 2520 ttaacattag tgtagtgctg gcggctctgg caacggttca cctgagcttt gctcctcgcg 2580 ctatcagcca aagttcacaa acggaagagc tacaaatctt cattaggctt ccttcagggc 2640 aactcgtcac gttcgtggga cctgtcgatg ccactgtggg ctctgtcttc gactttatcg 2700 cttccacgac cctcttccat ctagataaca gccggatatg gcgcttggtc gcgcgcggcg 2760 gcgacacgct gctggagcgt tcgaccgtgc tggcacgtcg tggcctggcg tccggttccg 2820 agctgcaagt gctcgcacgg ctaatgggtg gcactccaaa gagaaggcca tcagcaccct 2880 caggtatagt tgttgggagt caagaggctt ggcacggcaa aggacttaca accacaccgt 2940 gtgcacgcag acaactggct cagcctccac cgcttcaccc gcttccaaaa ccacttcgga 3000 gaccccgtct acacgtgcac caaccctgga tgccgttcgc caacgcagtt caaggataga 3060 gcggcttgct ctcgccacga aaaatgctgt gagggtgcac ctcgggacct ccctctccct 3120 gccggtaccg gtggcgaagg cggcctagga gacgctgacg ttcctgaggg agggagggat 3180 tcgcctgcgg gttctgttag tgagagcgac agcatggaca ctagcggtag cggcaccgac 3240 agaggtagcg gcaccgacag cggtagcggc accggcagcg gtagcggcac cggcagcggt 3300 agtggcaccg gtagcggcac cgacggcggc agcggcggga atgttggcaa tggcagtgac 3360 aggagtggtg gtggctacag taatagcggc agcgcccggg ttgacagcga ctacctagac 3420 tccgatgagg aagaggaaga tgctgctgac gctgaaccgg gtaaggatcg cgcaggttgc 3480 ggtcctcctt tgcgcgccgc aagcccgggt ctgggtgctg ggtcatgcct ctgtcctggg 3540 ggtctcgctt ccggacatgg cttatgttca gatcgttgag aagcgacatc taactttgca 3600 ttaccctgac gtcccgccca accctgcccc cccaccccca atccacaggt catttcctgg 3660 cggcagctct acaccgcgtt gtggcttctc tcgaccagga cgaccctgcc aacatcaaca 3720 ccattgagct gctgaatgct attctgcagc aattcgggca cctggcgcct gttgacggcg 3780 acggtattgc tgatgaccaa gccggtgcgg ttgccaacga ccttgcggcg cagggcatgg 3840 tcatgcaacc caccaacatc tccgaactac agcatggtgc gcggtctgtg cagcacttca 3900 agggcatcat gtcagagccg ctgtggagaa caggcactca atgtaagtcg ggatgtgggt 3960 cgggtcgggt cgggcggggt tgggacagcg ggatttcgac aggctttgac aagggagctc 4020 tggggtatgg cgtggtgggc gcagcagcag gcgcgggcaa cactggaatg gaatgggtat 4080 tcgaactgcc actgcaacgc cacggaattc tgtgtatgta acggcgcact tgattgacgc 4140 gatgcacctc cctttccgtg gctttcaccc tctccgctgg cacactactg caattccggt 4200 acgacaccta tctgcgcagc tgcacctatc actgtggggc agtttgctta cgcttggatg 4260 aaggagaaga ttgatgggcg cgtacgggat aatgttgcgg accgtcagct gcgtttcctt 4320 cgtgacgtgt gctttccacc tggcaataac ttgcccccgt cgctgtacat aatgaagaag 4380 atgttggaca ttccagacgc acgtgacttt gagaagcacg tttgtttgag cgacaagtgc 4440 ttgttccctg atttccctcg agatgagtgg caccagcacc tgcaagatga gtgcggctgc 4500 ggccatcgac ggtttaagcc tgcccggcca ggacaccaac cagttccgaa caaggtacgt 4560 tgcatgcttt acttgttgcc catgcggagc ctccccatag caataagtct ccccaggtct 4620 cccaatcctg catgcccttg ttccgtcaac ccttcttgac ttcagtgccc tttgccatga 4680 ccgcgtgcac gtgctgtctt cccacagagg ttctacgact tcggtgtcga gaatgtcgtg 4740 cgcggcatgt tcatggaccc acggttctgc acgtttcgcg gcacagggcg cgacaacaac 4800 cctgatgact tctacggctc gcggtacgcg caagacatca atgccaagac ttccggcgag 4860 ttcttcgagc ccgacagctc ggcttacgac ctgggctttg acttcggcga ggtcttctca 4920 ttcaagaagt acagctgcgg gatgctgttc attcggtgcg tgtgacggac ctcagcctgt 4980 tgcatagtat gggcaattgc tttcctgtgc acctacagtg tctccgtgtg tgcatttgat 5040 gtaatgggca tcgttcagta tgtgcctatg ctgtaatcat cacccaccag accttattgc 5100 tcccctcttt aactgctgtc ttcaggtgcg cggacctgcc agtgactcaa cgcagcaagc 5160 gccgccactg cttccccctg atcatcacgc ccggaccttc ggagcctgac tcaatggaag 5220 tctacttgcg cccaacactg acagcattca aggccttcgg gcctggcacc gagggtatga 5280 ccgtggtgga cgctgcttcg ggccggacct ttgggcacaa gatgtttctg ggcagtatct 5340 ttgctggtga gtgggctttt agtccgtgct ggggcgggag cgtttgggga cggtagtggt 5400 ggggcggggg caggagtgct caggcgtggg cgggaatgct agggtggtgt gggagacctg 5460 gggcaggggt gggggtgcag gggcgggggc ggggcgcggg gcaggtgtgc tggggtgggg 5520 tgggagtgct ggggtgaggg caggagtgct gtggcttagg tgggcatggt ggggcggggt 5580 tgggctggtg gtggtggttc tggggcgggg gtgggaggac tgggggcggg agtgcttggg 5640 tccatatgtt gacctctgct tggtctgcat gcagatacgc ctgctaaccg gacgttgagt 5700 ctttggctct cccatgcggc caagaacgca tgtgggcact gcatgctgct gggagagact 5760 ggccctaacg gccatggtac atactttctg gggtacaaca agccggcgga catcaacaca 5820 gcgctgaacg tcctgttgcc acctgactgt cagatgccag agactgcact atgcggtgac 5880 agcaggatcc ggatctctca tgcgcaccac atggatcggg cacaggttgt ggaaagtaat 5940 catgacttgg catcagcgct gggttgccat ggcatttcct gcatagcgaa ggagctgtcg 6000 tacatagaca tgaataacgt gtgggttgtg cccatagcgc atgcagctct gttcggtgtc 6060 atcaagaagt tctggcactt tgtgctgggc gagtccgtca agaaggctga gagtgacttt 6120 gtgttaagtg tagaggctcg tcgcagaatg actgcacgtg catcccattt catcattact 6180 aatgacttta accgacccta ccgtgacatt gtcccctgct cgtactggac catggaagaa 6240 tggctacact ggacggactg cttctcccta tacgtcctgc actcgcacgg aggggatcct 6300 aataaagtta agatgccggt agttacacgg aaagatgatg cagggctgga tcgtgagttt 6360 gacctgggga gaatgtggga gatgttgcgc acggccatgc tgcattatct gcggtatgac 6420 caattctcgg aggctgcgtg tgacagtgca gcagtaacct tcaaacagtt ctgctgtgac 6480 tcggagctcg tgtttggcat gaagtttgct acctacaaca tgcatctcct ggcatgtagg 6540 ctacgcgagc aggagcgtgc acgtgggcac gtggcgtttg caactgagtt ctgggtcgag 6600 cgcggtgtgc agcaggtcaa aagtggggtc aagttccgca caaccaggtg tccagagcaa 6660 attattgtga gtgggttcct tgcaagtgcc cggctcgctg agatgcgagc tgagcgtggc 6720 aggggaggtg gcaccgcgga tggtggtgca acggagccag ccggggtaga ggccggccat 6780 gcactggctg cgggtgggag ccagtacaag tcatttgatg agtgggtgcc aaattaccgt 6840 gcagcacacg tcacattgac gggggatgtt tgcgcggatg tagaggatgg tgagggttcc 6900 cagctgcttg ggcgcggata tgtggcgggt ccgaccgagc ggacagttgt cttcgaagca 6960 ttcatgaagg cttttagtaa agaggacttc gagacacgtt caggtgtggt ggagggtgac 7020 ttgcagaacg caagtgttat tatattcaaa cgtgcacttg tacgacgtga ccaggtttta 7080 ctatctacaa agtataactt gaccactgca cgtgagtcat tttatgtgaa gactaagtat 7140 gcgccatccc ttgacacccc agaggacgat caggacgtgt ctgtgtatgt tggagaggtt 7200 gcattctatg ctcgtgtgca aaccgtgcgt ggggttgtat gccgtgtggc aatttgcgaa 7260 ttgcataagg tggaggtggt acggccgcag tcgttgtact atgctgcaga ctttagtcgc 7320 cgcaaggcaa cccccaacca agaacgccag cagacgttcc tgaactcgtt ccccgtgcta 7380 ttgcaagcca ttgactcgaa ggtgatgcgc tgcaagctcg acgagaataa aagttacttc 7440 attacgtacc acttgttttc tcagttgcca accaatgatg gtgattggca aggagcagtg 7500 caagttgggt agccgctcac gcggctgatg cagtctgtaa catgatgatg tcttctgggc 7560 ttgtgtcttc cggnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 7620 nnnnnnnnnn nnnnnnnnnn nccgtgcatc ccatttcatc attactaatg actttaaccg 7680 accctaccgt gacattgtcc cctgctcgta ctggaccatg gaagaatggc tacactggac 7740 ggactgcttc tccctatacg tcctgcactc gcacggaggg gatcctaata aagttgtgat 7800 gccggtagtt acacggaaag atgatgcagg gctggatcgt gagtttgacc tggggagaat 7860 gtgggagatg ttgcgcacgg ccatgctgca ttatctgcgg tatgaccaat tctcggaggc 7920 tgcgtgtgac agtgcagcag taaccttcaa acagttctgc tgtgactcgg agctcgtgtt 7980 tggcatgaag tttgctacct acaacatgca tctactggca tgtaggctac gcgagcagga 8040 gcgtgcacgt gggcacgtgg cgtttgcaac tgagttctgg gtcgagcgcg gtgtgcagca 8100 ggtcaaaagt ggggtcaagt tccgcacaac caggtgtcca gagcaaatta ttgtgagtgg 8160 gttccttgca agtgcccggc tcgctgagat gcgagctgag cgtggcaggg gaggtggcac 8220 cgcggatggt ggtgcaacgg ggccagccgg ggtagaggcc ggccatgcac tggctgcggg 8280 tgggagccag tacaagtcat ttgatgagtg ggtgccaaat taccgtgcag cacacgtcac 8340 attgacgggg atgtttgcgc ggatgtagag gatggtgagg gttcccagct gcttgggcgc 8400 ggatatgtgg cgggtccnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8460 nnnnnnnaga acgccagcag acgttcctga actcgttccc cggtgctatt gcaagccatt 8520 gactcgaagg tgatgcgctg caagctcgac gagaataaaa gttacttcat tacgtaccac 8580 ttgttttctc agttgccaac caatgatggt gattggcaag gagcagtgca agttgggtag 8640 ccgctcacgc ggctgatgca gtctgtaaca tgatgatgtc ttctgggctt gtgtcttccg 8700 gggccgccaa ccgttccact aaagaaaact tcccggctgc cgacagcgca cgcagccctg 8760 caccggtgct gtatgacgat gcagccccag cccccttccc gccgttccac tcaagaaaac 8820 ttcccggctg ccgacagcgc acgcagccct gcaccggtgc tgtatgacga tgcagcccca 8880 gcccccttcc cgcctttcca ctaaagaaat cttcccggct gccgacagcg cacgcagccc 8940 tgcaccggtg ccgtatgacg atgcagcccc agcccccttc ccgccttcca gtcaagaaaa 9000 cttcccggct gccgacagcg cacgcagccc tgcaccggtg ccgtatgacg atgcagcccc 9060 agcccccttc ccgcctttcc actaaagaaa tcttcccggc tgccgacagc gcacgcagcc 9120 ctggaccggt gccgtatgac gatgcagccc cagccccctt cccgcctttc cactaaagaa 9180 atcttcccgg ctgccgacag cgcacgcagc cctgcaccgg tgccgtatga cgatgcagcc 9240 ccagccccct tcccgccgtt ccactaaaga aatcttcccg gctgccgaca gcgcacgcag 9300 ccctgcaccg gtgccgtatg acgatgcagc ctcagccccc ttcccgcctt tccactaaag 9360 aaatcttccc ggctgccgac agcgcacgca gccctgcacc ggtgccgtat gacgatgcag 9420 cctcagcccc cttcccgcct ttccactaaa gaaaacttcc cggctgccga cagcacacgc 9480 agccctgcac cggtgctgta tgacgatgca gcccctgccc ccttcccgcc tttccactaa 9540 agaaaacttc ccggctgccg acagcgcacg cagccctgca ccggtgccgt atgacgatgc 9600 agccccagcc cccttcccgc ctttccacta aagaaaactt cccggctgcc gacagcgcac 9660 gcagccctgc accggtgccg tcgtatgacg atgcagcccc agcccccttc ccgcctttcc 9720 actaaagaaa tcttcccggc tgccgacagc gcacgcagcc ctgcaccggt gctgtatgac 9780 gatgcagccc ctgccccctt cccgcctttc cactaaagaa atcttcccgg ctgccgacag 9840 cgcacgcagc cctgcaccgg tgctgtatga cgatgcagcc ccagccccct tcccgccttt 9900 ccagtaaaga aaacttcccg gctgccgaca gcgcacgcag ccctgcaccg gtgccgtatg 9960 acgctgcagc cccagccccc ttcccgccgt tccactaaag aaatcttccc ggctgccgac 10020 agcgcacgca gccctgcacc ggtgccgtat gacgatgcag ccccagcccc cttcccgcct 10080 ttccagtaaa gaaaacttcc cggctgccga cagcgcacgc agccctgcac cggtgccgta 10140 tgacgatgca gccccagccc ccttcccgcc tttccactaa agaaatcttc ccggctgccg 10200 acagcgcacg cagccctgca ccggtgccgt atgtcaacgc agccccagcc cccttcccgn 10260 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10320 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10380 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10440 nnnnnnnnnn nnnnnnnnnn tccgccgcca actccaaccg cccgcgcgcc tcggccctcc 10500 ccactttcac atcagactgg cgtgacacaa cgccagagcc caccgtcacg ccgcacggca 10560 cgcccgcctc atcgaattta gcgttaagtg gcacacgcaa gtcagcaccg tgcaccacct 10620 gccgcgcgtc gaacaccagc aggtcgctgt cctgggtgcg ggtgtaggcg ctgaccccac 10680 caaatgtgaa caggttgcgg ctgacatgcg ccgcacgctc cacatcgtcc aggaagtcct 10740 tgcactgcat ggcgatgagg ggtgagctgt catacagcag gccagggcca tacccataag 10800 cgcgccagcg agcccacgcc caaattgtca agcaccggct ttggtcaggg caatgctccc 10860 tggctggcat ttgcatcctt cctccaggcc acaaaaacat tccgccgggt cccttctcca 10920 ccccaccagc tcgtccaacc cttgacgcct ggcagctcac ccggcgccct gttgccgact 10980 tccacagagc caggaacacg acagttggcg ttgcgtcgtc cgggtccaga tgcgggtgtg 11040 tgaagtagtc cagcccgacc ggaaccaggc cccatggcgc ccctgtcgcc aggattgcag 11100 gcgtacccgc gacggtctcc ttcgcgctcg tcacgaggta gtcccacatc gccagcagca 11160 gcatggcgat gcacaggccg aacgcaccaa tgaccacccg cgcttcgtgg ctcctcgatg 11220 tgtgcgcata cctgtcaagg aatcaggccg tgcaccttgc atctgttgcc aagccatccc 11280 gaacgccgcc caaagccctc gcctccttgg atttcccctt aggttccgct ccaccaccgc 11340 tatgcacgcc cctccctcgc tgccatcgct tgccagtccc ataccactca catggtcacg 11400 cggctcgcct tgttggtctg cacatgccgg tagcctgcca tgtgagtttc gccgccacac 11460 tctttcagcc gaccacgggg gaccttcttc agcgtctgcg cctccatcag gcgccgctgc 11520 atgttgcgcg ccgctcgcac caacaccggg tacttgctct gcatcaccgg ccccggcaca 11580 tacaggacca ggaggactgg gttgaacacg ccgtcagtcc ccgtcatgtc aatccagaac 11640 aggcatgccc tgcccgtcat gtcccagtac acggtcagcg ctagctccgc tgatgtgcgg 11700 ggtctccccg cagattgcgc ctccaggtcc aacgccgcac gagctttggc gcgcgcattg 11760 cacgcgtcga gcacgttcgg ccgcagttgc cgcacgttgc ctcgcagtag gtcgcggagg 11820 gcatcgggga ctcgcagggc gatgtacagc tcccttagca aggtgccatc ggttagctct 11880 ccagcagcaa cctggtgcgg acacacggtc acaaatggca ttcaagacat ccaacacaca 11940 attgcagtgc tcaggacccc tgcacacgcg gcctgttggg gggcctgggg taatggcggt 12000 cccacgccac agatacccaa agttcccccc tcaccccggg ctccggcagg tctgtcacaa 12060 catcagtgcc agagcgggca gcaccagcac cactgcttgc tgcttccaca tcagccattc 12120 cctcctttgg cgtcaccaca ttcaatggtg gtgcagccac cgcagtgccc tccaccgcgg 12180 ctgctgccgt agtcgatgaa gctggccgcc catcctccac gccaccggac tccccctcct 12240 ccaggcacgc gccctgtgca ggcattcgtg catgagggcg tacgcatgcc gaccaggccc 12300 gcctgccatc gaggccacat cacgtgcatg catgcatgtc cacattacgt gcaccctatg 12360 gccttgccgc cccatcccgg ctgcgcacct tcagctccag gatcgcggac agggacacga 12420 tctcgttgtc ctccaccatg cccatgatgc tctcctccac gtccggcgcc acatccccca 12480 cctcatcgcc cattgcctgg ggcctggcca tgggtggcag ctccagccgt accagcttat 12540 ccgcaatgcc ggcgaaaagc gtcaacttgt ccccccgttc cctggccstt cttagctccc 12600 tgcgagagcc ggcggattgt gtgggatttc agtcagcgcg cccagcccct cgcgccgagg 12660 cctgagcccc cacatccgtg ctccgggcag ccccttccaa ttcgggttag cccggtcggc 12720 cccggacgcc cccgctcccc cccgccatgc cccatctgag gcctccgccc aatccaggcc 12780 cgcctccggc gatgtgacgt accccggcca ccacagccca tcmatccaag ccctcccaac 12840 catggcctgk acctcccccc agcctcacct tacgagctcc tcccggccgt gacgcctcca 12900 cttctcgccc agcgccaaca ggaaccttgg ggggcacccg cggtgggcgg ccttggaagg 12960 atccctcctc caccgctgct cctcctcgac tcgtgccgcg aggcatgagg ctttaggcat 13020 gcccggcatg tccagccgca caacgccgcc ggccctcgca agtgccgccc tgcacatgca 13080 gcccgcatgt gattttgcgg aagtcccgcg agtgtccccc ttcccctcct caaatctcac 13140 cttactatta cggcctcatc agatgacagc gacagcgtcc ccttcttcaa aaccgtctca 13200 gttatgtcca aatccttgcg ggcctcgccc tgctccaggc caacctcttc cgagctagac 13260 ggctccattc ctattaggtg cttcttccaa atcaagaggc accgggcccg cgaacgccgt 13320 tcaacgcgac gttgccaccc ggagccctgg gtggcaaagt cggcgcgcct gtaactccgt 13380 ggsrgtcagt gctgctacta tgtactcttg cttatattgt ttatgcccca ccatttgtca 13440 ctcgcgtgcg aacgccggac tccgcgctgt tcatggaggc cgtccgtcgc ttctacccct 13500 tcgccaagcg cggccctggc gaaggccccc gcctgctcaa caccagactg gccgccaatg 13560 gcgagttgaa aaagtattgg caagacaaga tggccgcggc ctacgcctgg agcctgaagc 13620 gaggcatcac gttgtcgcgc gaagacttcc aggccgactg gacaaagtga gtttttgttc 13680 ttctgtattg gcatgggaag gacgctagaa tgcggaggaa tgtcagtagc atgagcgatg 13740 aggaataggg ggatgagagt ggacttgacc gcgttgactt gttgaactga ccgtagttaa 13800 ttctttatac ataatgttgc ttcatcctac tactctttca caacatatcc aggaaggcaa 13860 ccgacttcaa cgccaagcgc accacttacg agaagacgac cgcagccgcc cgcgctgtca 13920 aggacagtga gtctgctttc taattgtata tactgtataa gcaccttgca atgagtgcgt 13980 tgagagacat gcgcgagagg cagactttgc gtcagtttga cctggacctt ctgctccatc 14040 ctccatcctc caccgcgtgc ttgcagagaa ccagcccaag aatgagcgtg gctggtacaa 14100 gaacgcatgg cgcagccagt ggggcaagat gtgggacggc tacaagcgcg accctgtgcg 14160 cttcacggct ggcgagaagc agacgctgac ggaagaggag gtcggtcttt gcaaggagct 14220 tgagtctttg ctgccaagcc gggcttccgc gagcgtcaag gacgccaagg tgtcctacgg 14280 tgccatcgac gccgggggcc ttctggtggg ttactttctt acatagccag actgtcagtg 14340 cgtgcttcgc ttacgagcgc atccttactg ctgatacgac caggcgcggt tgttggtgaa 14400 tgtctgttgc gcttacacaa ctctgctgct tcttgcaggg accgctggcc aaggccttga 14460 gcggcctctc ttcaaagacc aagcgctcct ctctggggtc cacaggcgcc gtcgacgtcg 14520 ccgccgcgcc ccagcccagc ccgcccatgc ccccgcccat gccccagtcc agcccaggcg 14580 ccatcgacaa gctggccgcg gcgggtgccg agcgcgtgaa ggtaagacca ccgtgatcat 14640 gttctctgcg cgcgtcgtct aatatggctc gctgcgtaag tcgtttactt ctgcactgga 14700 gatggtccct tgcggcatgg gtgccaggca ccactgcgga cggaccgtcc cttcgtgcaa 14760 gttcgctttc acagcattgc ggcacacttg ctgtcatgca ggcgctgggc gacaagaagc 14820 ggacggatgc gaagaagcgc aaggatgaga ggacggccga ggcgccagct aaggtgaatg 14880 ggggatcgag cttccgtcgt ctggttcagg gtgtgtgtgg cagcattgtt ggtgctctcg 14940 ggtggggttc agcagaatgc attcttaatg catgtggttc ttggtctacg ccagtgtact 15000 cttgcgtgtt ttctttcacc aatttatgat ggcgaatcat tgttgcgcct acgcccccat 15060 cccatgcaga agcacaaggg cgatgaggtg gtagccccgg ctgccaaggc caagaccgat 15120 gccaagactg ccgctgaggg caaggctgcc aaggttccgg ccgcgaaggc tggcaaggcc 15180 gccaggggca aggccgccag gggcaagtag actgccgatg ctggtgctac atggattgta 15240 caatgttcgg actaggccta gtacggctta ttctggactt agctaccgag cattaggatg 15300 tttggaccct gtgtagacca cctagcggta gtatgtttgg accctgtgta gacatccact 15360 ccagcatgag cgagtggcat ttgaggactg ctgtgctgca tgtgcatgac agtgagactg 15420 tgatgggcga cagagcgcct ttgactgtga tgcacttact gtcagatcgt agttgcgtgg 15480 aaagcgagtg agatggtcgg ttcaggtaat cctgaaacat gtgtgcagcc agggtccgcc 15540 gcagatgtgg gatagaggca gtgtaacgtg ctctggaatg aagagtttca tgtgtttctc 15600 cgtggattgt acggttcccc cgtgtttctg ttttcgaatc gcgtctcaag acgcagtgcg 15660 tctcgtgcat cgcgtgtgtg tgagacgcgg cgcgtctagg gtatcaggct agcgtctggt 15720 gtgacgcatc gcgttagcta agacgcgctc tccatgcaat tgtaccgcgt cacgactcgc 15780 gttccattgc gtctcgcaga cgcagtgaaa cgcgctatct gcaatggtgc cccgcttcac 15840 tgtttctggt tgcgtcccca tcgcgtctct cccgcgtctc ccccgcgttt cccccttaac 15900 gcgcagcgtc tcggcgcgtc tgacaccgcg tctcacaggt cagacgcggc ccagacgcgc 15960 ccagacgcgg tttcccgttg ttatagtg 15988 // ID RandI-1 repbase; DNA; CHL; 10812 BP. XX AC . XX DT 17-AUG-2004 (Rel. 9.07, Created) DT 22-JUL-2010 (Rel. 15.08, Last updated, Version 2) XX DE RandI-1 is a family of non-LTR retrotransposons - a consensus DE sequence. XX KW RandI; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; AP endonuclease; RNaseH; RandI superfamily; KW RandI-1. XX NM RandI-1. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-10812 RA Kapitonov V.V. and Jurka J.; RT "RandI-1, a family of RandI non-LTR retrotransposons from the RT Chlamydomonas reinhardtii genome."; RL Repbase Reports 4(7), 196-196 (2004). XX RN [2] RP 1-10812 RA Kojima K.K. and Fujiwara H.; RT "An extraordinary retrotransposon family encoding dual RT endonucleases."; RL Genome Re 15(8), 1106-1117 (2005). XX DR [1] (Consensus) XX CC RandI-1 is a family of RandI non-LTR retrotransposons. CC Approximately 100 copies of RandI-1 are present in the genome. CC They are ~1% divergent from the consensus sequence and are CC likely mobile. The RanI-1 elements are usually flanked by CC 10-15-bp target site duplications. The 5'-terminal portion of CC the consensus sequence may be still incomplete. It encodes a CC 3571-aa RandI-1 protein (nucleotide pos. 2-10714) composed of CC the AP endonuclease (aa pos. 990-1250), reverse transcriptase CC (aa pos. 1600-1900), and RNaseH (aa pos. 3040-3200) domains. CC This consensus is >99% identical to DualenCr1 in [2]. XX FH Key Location/Qualifiers FT CDS 2..10714 FT /product="RandI-1p" FT /translation="RMHTNTPLTNLASSSLGNMWKMVARRKVERKSAGICV FT LICFALCMLTTNATTAGNASGSAAADRWFDRGPPRHSSAFALLGNHDRTMP FT IQKWSARGLTTCTESSAKGGCSHVCNSRATGAQLPALSHTDGPIATHGINH FT AHGSVWRENFSFAHSMFTVHSNPANATLLSDRCSQKGIFDTSILHTTCYTA FT SKRAGAEPFGMSPKRPSQSPTAAKHPSFEELISSKHIPVASEPLANDVPER FT GRAEPPVFTANSQLYENASLLTEPPPTSSERRQYRSPSGLLSRAKVKPKPG FT CTNIRNVMITALQPVGEGLLSTNLKIAQACRSGVTKLCTAEGAYRSYATKT FT IGLLGESLLTSVLALGDLGSRHGGTMLRSGASALHSMAETAYGPQPTLNCS FT APWENSPLLRLHLTLVVHFHWAGAGHVRAAMHSHTIAIRVGAHLATRSRLA FT IYQTAAQTGKALHTILDKAIVSLTIITATGAMYMLWTHLLHAETRTASVNS FT EICANGAILSLLLWGPTLTTLALSPLCVIALALGITAIPAALGLLFALTVT FT SILAVLCLPALLKHVVQVTRQIASATWARRKLLLWGTAMAIILRNLYLTLP FT DPCSVQPAQPALWGPALVTGLLTASQATTLLTLTYAVNRTHTMPYVKPTLR FT GALPTDPTAPPAPADPRTHAAVNIRPPAALHWPPNPNTTYRETQRQHFCSV FT HALNNSLGLAWLDPLDVLSYAKRVHAHLTATQDPNALFWKECYCPNSGAFS FT EFLLNHYLYHNATISNIFAYPNRKLIMRRTHFPRLNGDISKEKVLESLPVA FT ARTRGFTVHQYTVRHTIAVRYEAGQWRVIDSVNSPIHNTVLHDNTWNTLDG FT EVWCLDAVRASDTATLQDALGQLREIVPPPRPPPHMEATRPVQTGPAATGR FT PAPEPAQHAQDNSPLRAAHRAPQHAAPHTTAPAQPHNANAGIAPRLNTTGT FT RQTILNWLVRPTARNHAHMEPNQAHTNTRPPSSNTTLHIVTHNVRGLLSEI FT LSTGQPGNLSFTLCNLAQWKADIVVLTETKLTGKTDSIKRAFRNEGYRLYC FT STTPRVATARGSAGVAVAISARYSDLGCVTHHTPPPTLHGYVSHVQIKTPG FT STPLTVLGIYAPEDMQTRKAIYTYCQQVVGRADACGHHLVTAGDFNAVARA FT HERDSPIDTADRAHQRFLADSGLRPIRGDTTTTAEWSYEQTRPGMAPYHSR FT IDDILLCPATRAACTEAREYTSTVAGNFDHKPVHAELLAADLQLWPAPQAG FT ARNPAPQQQTQQRWAEVALPVTQKQLAAAAIRLEEALVEATADLHSATRQA FT TQSIEHALTRHSMDPTGYPASVMHRDLAQDTSIQKADINQLAEQLASALDT FT GLTCLLEECTRKAPFTGKHHASRSTARVLKPLWDKEVALKTQLTNLTQGPQ FT ALPEVDAANAAAKLRNEIKACQAEHRQLVADRAKAQREAAATALQHTLATR FT PAQGHKRIFQKEDMERGLPAVRNPETGEVTTDSTSILAILETHFRKLSAPP FT RGTRTGDFRLPSNATRGYPFEKADATDQFTLDRNRHPDTHSMLPSMADTAN FT FEQCISHLSRNKATGPDGIPNELLRILPSGMKRNLHCILQIMYVKSQIPET FT WAASETVLLPKPGDALDIKNKRPIALANTCYKLYTSMLTLGIGELAGPLQL FT FSEAQEGFRAYCNTERQVLNLVHALEDAALFGKDVYAVYVDYSSAFNTIDQ FT DRLLQIMFDLGLPTDLIRAVRNLYAHATTRIRTEHGSTSAIPIERGTVQGD FT TLSPVLFILFMEPLVRWLHAGGRGYHYGCLTPSENLQYHCSAAAYADDLAA FT LTNSLDDLQVQCDKIASYAEWASLRVNHTKCATTAIWHDKSRSDPNLDGPT FT GKATLAAMRRNMTNTIKIGTTPVPYFPPTQPYKYLGVQLTFSLDWSAHVAR FT VTEIVKDKGTAIATSLATPAQRLRMIQQCVHTTVAYGFAAMPFTKQDITTL FT DTTLAGFAKRCYGLPRSFPTRTSLLPANEYGLGLGSLLPQYARVAQRALVL FT ALNDSGRLGIVTRALLPRQASIAGPTQAHLLPAHRSHHLTTLKQMTLAKEY FT GVTLYQNGSAFTAPTWSIAAALEAEAEARGVEPLPIEYVLPLADLRLELSH FT LVDRNTGKHLITSSDLEKHMGASRVRHKHKVALNRLSLALSMAARAGENAP FT AHGSPAPLTTAQRALPDVAAIVALAMRADPLPLGMANPLDRYLQPLPEAPP FT AQANTTQAPADLAPPSPAGAAQAHPGAQGAAAPEVPADGATQPSLRPPARA FT RKPATTRRPPQNQTRARRVTKHAAQRARIAETTRTTPEDLLTGDRAAINHA FT LRLTNGDESLPVGTRGYETACSVFQALAKDMTGHMRICHPHPQDPPSLAHA FT VEPRQQRRGLPPALTSRLTANAANPEPNAISEYLNNTGLPMEVNQAAAHND FT PMDADSPTPPLTAPAAQAAGGHGARPSWRLARARITHDLVPRPADNRTNGD FT TQLTYDLDNDGQELAQVLHSENKQTVTDKELTRKRKAREDTSGTREKYWKV FT QWKPSICPAGIVSAFVRMGYATIQNARCYKHPLATRPGPQPAQTEAEPPAE FT TQAHAISPEEPAQAAEPAAPQPTGAPLFSSWTTKLLRHVTWAPSTDRERDL FT KHSPQWEELLEECKTRLANGTNQGPRPRTVRPAPDRNLTARQRQGRHDAEP FT LDTASRARDIRTTTTITTSPCNPYKDIVAPGAYTITTTGGTRRDPAEAHVH FT EPSGRWLGTITYPRLLTLWERFRHTGNQRPNAFEEAVAALIMRYRYDPAQQ FT DRKAMPMHQVSLPAGTVAAIVQCLRVTQAVHIREMFASPLNSSTAAHEYWT FT RDPADGAFGALHDAYQTAWTGLQYAHPPSTPHDARKALMWALACAEAMRDS FT QEPTLTVLALPKAATFPHTQWLQHPLCHELANWSAGTAGLDTGLGTNTQAE FT RQKGLRLVIVGNPAGLQCFAPRLKRLVDTLKRGANAPTHISDPRTWTHTAT FT APTCPALPNSLLKQAKYQHANPKTIHAMADEARRFPNARFQTHHALAHDVN FT GTVWTDGSVSKIKTDNGKEVQVAGACAWFSDSRVVYVNPNGAGCTNTITRA FT ELAAIRAALAEFGGEGKEFATKKLTIASDSVASLYLIKRAINEPRRLHLSK FT HRDLLDSVVALLHARHARGAPTALIKVISHTGLHGNEEADKGAAAVATQEK FT PADVSEPADNDPHARHWWPTFTVTRDDDSTATHYVSDLNRGLTRAMAPSCR FT GGYANKTLYTEKWAEAAPSLDPRASNAYMTNSDASHGLRRFIQNARQGGLI FT CPARLALFKLRDSDKCGLCAHTQRSRGSDPERGTAGHLAGHCAHPQLVGAR FT IAKHNTAVRTIAECLHHGHNGGGYMIMDATSRAELPEYCAGMRPPSWLCPQ FT VPAADLNRMRPDILFIPNLPRSEAERFMTSPPANKGAYPVYILEVGYTSDL FT HHSEKLIQKQAQHATLATAMRAAGWTVHYDTKHVITLGHGGTVPRTLETLL FT KDLGAKPQAAKACCTRLHMHSVTTLRGTANLYYRLEREMGIANTRCRPGGR FT TQGTANGGPRAGEPG" XX SQ Sequence 10812 BP; 2610 A; 3799 C; 2691 G; 1712 T; 0 other; tagaatgcat acaaatacac ctcttaccaa cctcgcaagc tcgtcgctag gaaatatgtg 60 gaaaatggtt gctcggcgaa aggtggagcg aaagtccgca ggtatctgcg ttctgatatg 120 ctttgcactc tgcatgctga caaccaacgc gaccacggct gggaacgcgt ctggctctgc 180 cgcggccgac cgctggtttg accggggacc gccccggcat tccagcgctt tcgcgctatt 240 gggcaaccat gaccgtacca tgccaatcca aaagtggtct gcaagaggtc tcacgacctg 300 caccgagagc tcggcgaaag ggggatgttc gcatgtttgc aacagccggg caactggcgc 360 acagctgcca gccctaagcc acactgatgg ccccattgcg acacatggaa ttaaccacgc 420 acacggtagc gtgtggaggg agaacttcag ctttgcgcac tcgatgttca cagtacactc 480 gaaccccgcg aacgcgacgc tcttgtcgga tcgctgctct cagaaaggca tatttgatac 540 cagtatactg cacacgacgt gctacacagc tagcaagcgt gctggcgccg aacctttcgg 600 catgtcgcca aagagacctt cgcaaagccc aactgctgca aaacacccgt cttttgagga 660 gcttataagc tccaagcaca taccagttgc ctcagagccg ctcgccaacg acgtcccaga 720 aaggggccga gccgaacctc cagtatttac agctaattcc cagctctatg aaaatgcgag 780 tttgctcaca gaaccaccac cgacaagctc cgaacgaagg caataccgaa gccctagcgg 840 actcctgtcg cgcgcaaagg ttaagcccaa gcctggatgc accaacatac gcaatgtgat 900 gataacggct ttgcagccgg taggggaggg gctcctttct acaaatctga aaatagcaca 960 ggcatgcaga agtggcgtga caaagttatg cacagcggag ggagcatacc gttcatatgc 1020 gactaagaca atcgggctgc ttggagagag cctgttgacg agtgtgcttg cgctgggcga 1080 cctgggcagt cggcatggag gaaccatgct gcgctccggt gcttcggcac tgcactccat 1140 ggcagaaaca gcctacgggc cgcaaccgac ccttaactgt agcgccccct gggaaaattc 1200 accactcctt cgcctgcacc ttacactcgt cgtccacttc cactgggctg gagcagggca 1260 tgtgcgggca gccatgcaca gccatacaat tgccatcaga gtaggcgcac accttgctac 1320 ccggtctaga ctcgccatct accaaacagc cgcccagact ggcaaagccc tacacaccat 1380 acttgacaaa gccattgtca gcctcacaat catcaccgcg acaggggcca tgtatatgct 1440 ctggacccac ctcctgcatg ccgaaacccg cactgcctca gtcaactcag aaatctgtgc 1500 aaacggtgca atcctcagcc tgctactatg gggtcccaca ctcaccactc tcgctctctc 1560 ccctctgtgt gtaatagcgc ttgccctcgg gataaccgcc atccccgcag ccctcggtct 1620 cctgttcgca ctgacagtga catccatcct tgccgtcctc tgtctcccag ctctcctcaa 1680 acatgtggtc caagtgacac ggcagatagc ctctgcaacc tgggcccggc ggaagctact 1740 actctgggga acggccatgg ccatcatcct gcgaaacctg tacctcacac tgccggaccc 1800 ttgcagtgtg caaccggcac agccggcact ctgggggcca gcattggtca ccggcctact 1860 tactgctagc caggcaacca ctctcctcac gcttacatat gcagtaaacc ggacacacac 1920 aatgccctac gtcaagccaa cactgcgagg tgcgctgccc acagacccga cagccccgcc 1980 ggcgcccgcc gaccctcgca cgcatgcggc agtgaacatc cgaccaccag ccgcactgca 2040 ctggccgcca aaccccaaca ccacataccg tgaaacacag cggcaacact tctgcagtgt 2100 ccacgccctc aacaactctt tggggctcgc atggctcgac cccctggatg tcctctctta 2160 cgctaaacga gtgcacgctc acctcacagc cacgcaagac ccgaatgccc tgttctggaa 2220 agaatgctac tgccccaaca gtggcgcctt cagcgaattt ctcctgaacc actacctgta 2280 ccacaatgcc accatcagca acatattcgc ctaccccaac aggaagctga ttatgcgccg 2340 aacccacttt ccgcgactca acggcgacat cagcaaagaa aaggtcctag agagccttcc 2400 ggtagccgcc cgaactcgcg gctttaccgt ccaccagtac actgtgcgac acacaattgc 2460 tgtgcgttac gaagcaggac aatggagagt aatagactcg gtgaacagcc ctatccacaa 2520 cacggtctta cacgacaaca cttggaacac acttgatggg gaggtctggt gccttgacgc 2580 ggtgcgtgca tctgacacgg caacacttca ggacgcactt gggcaactcc gtgagatcgt 2640 cccgccacca cgcccgccac cccacatgga ggccaccaga cccgtgcaga cagggcccgc 2700 cgccacagga aggccggcac cagagccagc tcaacatgcc caagacaact caccgctgcg 2760 ggcagcacac agggcgccac agcatgcagc accccacact accgcacccg cgcagccaca 2820 caatgcaaat gccggcatag caccccgact caacacgaca ggcacccgac aaactatcct 2880 taactggcta gtacggccca ctgcacgcaa ccacgcccac atggaaccca accaggcaca 2940 cacaaacacg cgacccccca gcagcaacac caccctacac atcgtgaccc acaatgtacg 3000 gggactgctc tcagaaattc tgtccaccgg ccaacccggc aacctcagct tcaccctctg 3060 caacctagcc cagtggaaag ccgacatagt tgtcctaacc gagacaaaac tgacagggaa 3120 gactgactct atcaagcgag ccttccgaaa tgaaggatac cgcctctact gtagcactac 3180 tccccgcgtg gccacagcgc gcggtagcgc tggcgtagcg gtggccatct ccgcccggta 3240 cagtgacctc ggctgtgtca cacaccacac gcccccacca acactccatg gctatgtgtc 3300 acacgttcaa attaagaccc ccgggagcac gcctctcact gtactgggca tctacgcacc 3360 agaggacatg caaacccgca aggcaatcta cacatactgt cagcaggtgg taggccgggc 3420 ggacgcgtgc ggacaccacc tagtcaccgc gggagacttt aatgccgtcg cccgagcaca 3480 tgagcgggac agccccatcg acacggcgga ccgggcccac cagcggttcc ttgccgatag 3540 tggcctgcgc cctatccgcg gagacaccac taccactgcc gaatggtcat atgaacaaac 3600 acgcccgggc atggccccat accacagtcg catcgatgac atcctcctat gccctgccac 3660 gcgcgctgca tgcacagaag cgcgtgaata caccagtaca gttgcaggca actttgacca 3720 caaaccagtc cacgcagagt tactagccgc ggacctacag ctatggccgg caccacaggc 3780 cggcgcccgg aacccagccc cccagcagca gacccaacaa cgatgggccg aggtcgccct 3840 gcccgtcaca caaaagcaac tggccgccgc agctatccgc cttgaggaag ccttggtgga 3900 agcaacagcc gacctgcatt ccgcaacaag gcaggcaacg caatcaattg aacatgcact 3960 cacgaggcac agcatggacc ccacaggcta ccccgccagc gtcatgcacc gagacctggc 4020 gcaggacacg agcatccaga aggccgatat caaccaactt gcagaacaac ttgcctccgc 4080 actggacacc gggctaacat gcctccttga agagtgcacc cggaaagccc ccttcactgg 4140 caaacaccac gcatcacgct ccaccgcacg ggtgctaaag cccctatggg acaaggaagt 4200 ggccctgaag acacaactga caaacctcac gcaagggccc caggccctac ctgaggtaga 4260 cgccgccaat gcagcggcaa agctgcggaa cgaaataaag gcctgccagg ccgagcaccg 4320 ccagctagtg gcggaccgcg ccaaagccca gcgagaggcc gctgccacag cgctgcaaca 4380 taccctggcc acgcgccctg cacaggggca caagagaatc tttcagaagg aagacatgga 4440 acggggtctc ccagcggtcc ggaacccaga aactggggag gtgacgactg actccaccag 4500 catcctcgcc atactcgaaa cccacttccg aaagctgtcc gccccaccac gcggcacccg 4560 cactggggac ttccgactac ccagcaacgc cacacgcggc tacccatttg agaaagccga 4620 cgcaacagat caattcacac tggaccgcaa ccggcaccct gacacacact ccatgctccc 4680 cagcatggca gacacggcaa attttgagca gtgcatttct cacctctcca gaaacaaggc 4740 gacaggccct gacggcatac caaacgaact gctccgtatc ctgccatcgg gcatgaagcg 4800 caacctccac tgtatcctgc aaatcatgta cgtcaagtca caaatacctg aaacctgggc 4860 cgcatcggag accgtgctac tccccaaacc tggagacgcc ctggacataa aaaacaaacg 4920 gcccattgcg ctcgcaaata cctgctacaa gctgtacacg tccatgctta ccttgggcat 4980 tggcgaactg gctggccccc tccagctatt cagtgaagcg caggaaggct tccgtgccta 5040 ctgcaacacc gaaaggcagg tcctaaattt agtacatgcc cttgaggatg ccgcactgtt 5100 tggaaaagat gtctatgcgg tatacgtgga ctatagctcc gcgtttaaca caatcgacca 5160 ggaccgatta ctacaaatta tgtttgacct tgggctgcca acggacctga tccgcgcagt 5220 ccgcaaccta tatgcacatg cgaccacccg catccgcacc gaacacggaa gcacatccgc 5280 cattcctata gagcgaggca ccgtgcaagg ggatacactc tcacctgtgc tctttattct 5340 gttcatggaa cctcttgtcc gctggctcca tgcgggcgga cggggctacc actacggctg 5400 cctgaccccc agcgaaaacc tgcaatacca ttgcagtgcc gccgcgtacg cagacgacct 5460 ggcggcgctc accaactcgc tcgacgacct gcaggttcaa tgtgacaaaa tcgcatctta 5520 cgccgagtgg gcgagcctgc gtgtcaatca caccaagtgt gccaccaccg ccatctggca 5580 cgacaaatct cgctcggacc ccaaccttga cgggcccaca ggaaaagcta cccttgccgc 5640 catgcgccgc aacatgacca acacaattaa gattggcacg acccctgtcc cgtacttccc 5700 cccgacccag ccgtacaagt acctgggagt gcagctcacg ttctcgctgg actggtcggc 5760 acatgtcgca agagtcaccg agattgtgaa ggacaagggc acagccattg ccacatccct 5820 ggcgacaccg gcacaacgcc tgcgcatgat acaacagtgc gtgcacacca cagttgcata 5880 tggattcgca gcaatgccct tcacaaaaca ggacataaca acgctcgaca ctaccctggc 5940 tggttttgca aagcgctgct acggactccc ccgcagcttc cccacccgga cctccctcct 6000 gcccgccaac gaatacggct tgggcctggg ctccctcctg ccgcaatacg cgcgcgtggc 6060 ccagcgagcc ctggtccttg ccctaaatga ctcgggccgt cttgggatag taacccgagc 6120 ccttctgccg cgccaggcaa gcatagctgg ccccacacag gcccaccttc tcccagccca 6180 taggtcacac caccttacga cgctgaaaca gatgaccctg gcaaaagagt acggtgtcac 6240 actgtaccaa aatgggtctg cctttactgc gcctacctgg tccattgccg ctgcccttga 6300 ggcagaagct gaggcccggg gcgtcgaacc tctccccatt gaatacgtcc tccctcttgc 6360 ggacctgagg ctcgaactga gccacctggt ggaccgcaac acaggcaaac acctgattac 6420 tagctcggac ctcgagaagc acatgggagc gagccgggtc cggcacaagc acaaggtagc 6480 cctgaacagg cttagccttg ctctcagtat ggctgcacgt gcaggggaaa acgctccagc 6540 ccacggcagc cctgcccccc tgaccaccgc acagcgtgca ctgcctgacg tggccgcaat 6600 cgtggcctta gccatgaggg cggaccccct ccctctcgga atggccaacc ccctcgaccg 6660 gtacctgcag cccctcccgg aggcgccccc cgcccaagca aacaccacgc aagcccccgc 6720 cgacctagcc cccccctccc cagcaggggc ggcccaggcc catcccggcg cacaaggtgc 6780 tgctgcaccg gaagtccccg ctgatggggc aacgcaaccc tccctacggc caccggccag 6840 agcccgcaag ccggctacaa cacgccggcc accacagaac caaacacgcg cacggcgcgt 6900 tacaaaacat gcagcgcaac gcgcacgcat tgcagaaact acacgcacta cccctgagga 6960 cctactcacc ggggaccgcg ccgccatcaa ccatgccctg cggctcacca acggcgatga 7020 gagcctgccg gtgggcacgc gggggtacga aacggcatgc tcggtattcc aagcgctcgc 7080 caaagacatg acaggacaca tgcggatctg ccacccacac ccacaagacc caccctccct 7140 tgcacatgct gtcgaaccac gccaacagcg ccgtgggcta ccacccgccc tcacatcccg 7200 cctcacggcg aatgccgcca accccgaacc caacgccata agcgagtatc tgaacaacac 7260 aggattgcct atggaagtca accaggctgc cgctcacaac gaccccatgg acgctgatag 7320 cccgacgccg cccctgaccg ccccggcagc gcaagctgcg ggaggccacg gggcccgtcc 7380 cagctggcga ctggcccgcg ccaggatcac acacgatcta gtcccacggc ccgccgacaa 7440 ccgcactaac ggtgacactc aactcaccta cgacctggac aatgacggac aggaactggc 7500 acaggtcctg cacagcgaaa acaaacaaac tgtcacggac aaagaactga cacggaagcg 7560 caaagccaga gaggacacct ctgggacacg tgaaaaatac tggaaggtgc agtggaagcc 7620 gtccatttgc ccggccggca ttgtctctgc ttttgttcgc atgggctacg caactatcca 7680 gaatgccagg tgctacaagc accctcttgc gacgcgccca ggcccacaac ctgcgcagac 7740 cgaggctgag cctccggctg aaacccaagc ccacgcaatc tcacctgaag agcctgccca 7800 agctgcagag ccggcagcac cgcagcccac tggggccccc ctattctcgt cctggacgac 7860 caaactcctg cgacatgtca cgtgggcgcc gagcacggac cgtgaacggg acctaaaaca 7920 cagtccacaa tgggaggagt tactggagga gtgcaaaacc cggctagcaa acggtaccaa 7980 ccaaggcccg cggccccgaa cagtgcgccc cgcacctgat cggaacctta ccgccagaca 8040 gcgccaaggc cgccacgacg ccgagcccct ggataccgca tcccgcgcgc gtgacatccg 8100 cacgaccacc accatcacca caagcccgtg caacccctac aaggacatcg tggcccccgg 8160 agcatacaca attaccacca ctggtggcac ccgccgtgac ccagctgagg cgcacgtcca 8220 tgaacccagt ggccgctggc tgggcaccat cacctacccc cgcctcctta ccctgtggga 8280 gcgcttcagg cacaccggca accagcggcc caacgccttt gaggaagcgg tcgctgccct 8340 catcatgcgc taccgctatg acccggcgca gcaggaccgc aaggccatgc caatgcacca 8400 ggtgtcgctg ccggcaggca cagtagcagc catcgtccaa tgcctgcggg tcacacaagc 8460 agtacacatt cgggaaatgt ttgcgtcccc actaaactcc tccactgcag cccacgaata 8520 ctggacacga gaccctgctg atggggcctt tggcgcactg cacgatgcct accagaccgc 8580 atggaccggc ttgcaatatg cccacccccc ctctaccccc cacgatgcac ggaaggcgct 8640 catgtgggcc ctcgcatgcg ccgaggctat gcgtgactca caggaaccaa cccttactgt 8700 attggcgctc cccaaagcgg ccactttccc ccacacgcag tggcttcaac acccactttg 8760 ccacgagctg gccaactggt ccgcgggcac cgcgggcctt gacacaggac ttggcaccaa 8820 cacccaggct gaacgccaga aggggctccg actggtcatt gttgggaacc ccgccggact 8880 ccagtgcttt gccccccgcc tgaaacggct cgtggacaca ctgaaacggg gggcgaacgc 8940 acccacgcac atcagcgacc cgcgcacgtg gacacacacg gccacggccc caacctgccc 9000 ggccctccca aactcacttc taaagcaagc caaataccag cacgctaacc caaagacgat 9060 acacgccatg gctgacgaag cacgacgctt tcctaacgcg cgattccaga cccaccatgc 9120 tctggcccac gacgtgaatg gcactgtctg gacagatggt tcggtaagca aaatcaaaac 9180 tgacaatggc aaagaagtcc aggttgccgg cgcctgcgcc tggttcagcg attctcgcgt 9240 tgtctacgtg aatccaaatg gcgctgggtg cacaaacact attacgagag ctgaactggc 9300 agcaattcgt gctgccctcg cagaatttgg tggcgaaggt aaagaatttg caactaagaa 9360 gctcactatt gcatcagatt ctgttgcaag cttgtaccta atcaaacgag caattaacga 9420 gcctcggcga ctgcatctga gtaaacatcg ggacctcctg gattcagtcg tggcattgct 9480 ccatgcacgc cacgcccggg gcgcacccac cgctctgatt aaagtgatct cgcacaccgg 9540 gctacacggc aacgaggaag ccgataaagg ggcggctgcc gtcgccaccc aggaaaaacc 9600 cgccgatgta tccgaaccgg ccgacaacga cccgcacgcc cggcactggt ggccgacctt 9660 cacggtcacc agggacgacg atagcacagc cacgcactac gtgtcggacc taaaccgggg 9720 cctcaccagg gcaatggccc ccagctgccg tggtggatac gccaacaaaa ccctgtacac 9780 tgagaaatgg gccgaggcag cacccagcct tgacccacgc gccagtaacg cgtacatgac 9840 gaactcggac gccagccacg gcctacgccg gttcatacag aatgctcgtc agggaggcct 9900 catctgcccg gcccggctgg ctttgtttaa actacgagac agcgacaaat gtggtctctg 9960 tgcccacacc cagcgcagcc gcggatctga cccagagcgt ggcaccgcag gtcaccttgc 10020 cggccactgc gcgcaccccc aactcgttgg cgctaggata gccaaacaca acactgcagt 10080 gcggacgatt gccgaatgcc tacaccatgg tcacaacggt ggtggctaca tgataatgga 10140 tgccacctcc cgagctgaac tgcctgagta ctgtgcgggc atgcggccgc cgtcctggct 10200 ctgcccgcaa gtgcccgctg cagacctcaa caggatgcgg cctgacattc tgtttatccc 10260 gaaccttccc cggtccgaag cagaacgctt catgacatcc ccaccagcca acaaaggggc 10320 ctacccagtg tacatccttg aagttggata cacctctgac ctgcaccaca gcgagaagct 10380 cattcaaaaa caagcacagc acgccaccct tgccactgcc atgcgagctg cagggtggac 10440 tgtgcactac gatactaaac acgttattac actgggccac ggtggcactg tcccccgtac 10500 ccttgaaact ctcctcaaag acctaggagc caaaccgcag gcagccaagg cctgctgcac 10560 acgtctccac atgcatagcg tcaccacact ccggggcacc gcaaacctgt actaccggct 10620 agagcgagag atgggcattg ccaacacccg atgccgcccg ggaggccgca cccaggggac 10680 cgccaatggc ggcccccgcg caggggagcc ggggtaggga aatcaagttt tccgatggtg 10740 gggtgagctt gtctcactcc tccttaagac acgcgggcac ctgtgccctg cgtagatatt 10800 attattatta tt 10812 // ID P-1_CR repbase; DNA; CHL; 9810 BP. XX AC . XX DT 29-MAR-2006 (Rel. 11.03, Created) DT 29-MAR-2006 (Rel. 11.03, Last updated, Version 1) XX DE P-1_CR is a family of P autonomous DNA transposons - a consensus DE sequence. XX KW P; DNA transposon; Transposable Element; P-1_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-9810 RA Kapitonov V.V. and Jurka J.; RT "P-1_CR, a family of P autonomous DNA transposons in the RT Chlamydomonas reinhardtii genome."; RL Repbase Reports 6(3), 162-162 (2006). XX DR [1] (Consensus) XX CC This is a first example of P transposons in viridiplantae. P-1_CR CC encodes a 1822-aa TPase composed of the DDE domain conserved in CC all P transposases (aa pos. 537-1121) and PHD zinc finger (pos. CC 1670-1715). The P-1CR TPase is predicted to be encoded by 15 CC exons (based on FGENESH and comparison with other P transposases CC ). P-1_CR is characterized by 8-bp target site duplications and CC 146-bp TIRs. The genome is populated by several families of CC nonautonomous transposons derived from P-1_CR (DNA-8-1_CR, CC DNA-8-2_CR, DNA-8-3_CR). XX FH Key Location/Qualifiers FT CDS join(251..319,1149..1550,1737..1911,2047..2321, FT 2469..2833,2871..2909,3185..3586,3743..4192, FT 4301..4491,4597..4752,4886..4952,5197..5503, FT 5572..5690,5797..6001,6117..8360) FT /product="P-1_CRp" FT /translation="MGTKAGRQAIRRALELLWALKATILLVYLSVSPQFIT FT FSSAMGRDEQKQYSHCYTCGNKVLSYSFIRVWRGGSQPDWCKGGQGCCSLT FT VNGSKHFNSCQLHINGSHYLSATRGPGNPELPITLQPPPAKVGLRVCFECA FT QHNSTVQAELQQGYEEEQQHVLRRSHRRNASSVSGVDLDTFDPPPSDKQSE FT TPAAPSPPALEQLTPAPTTGDAVAADSGFTIRRQKSSSDSAFYDAAAPLYS FT YITGLDSPGSEFSFTSRQMKYFVKTVNLSNGLHSRKRSNYFVKQTDLFMTS FT PAGRRLTTITPPMSEAHLNTSPAAIMAAAAEAAAAFDVLECIAPSERLDSL FT LAAFSPTAKAGWETWDTRVTAAWRCQLVQQHSHTGDKHLRFDCTTGSHYHN FT RLSVVLKPDLGFSVQHALVAVQLPADAPSADRLTTAVGHATAARQQQSAEE FT HKAVQSKADAAAAAAASAATAAADRYAQALKNSEARIVELRVQNFTLGQDL FT AATQEQLLAVQAELEASRAKMTELSADESADLVTLLQHEDVQAHMKENPGQ FT AAFWRDQLEYVRRARQGGKMVNGMRWHNSTLQFALGLLNQGGAGLYERIGD FT QLSLPSISQLLNYKGFTACGSGWHTPVMEQAAELVKMAGAQPDGGLAFDEM FT KISSGLVFNIATNSFVGYSDVDVGTEAQRLHQLLGGKPPLPSAAGAAAMGS FT ESVPPPALATHVLHLTYTTLGPKTVRASVGYFFTNGVSAVELAPLVSHGIT FT QLRAAGLRVHYTVCDGAPENRKWMELMADGELAKQVAEECNITMPSANPTA FT HLRCFRDPADPTLPIIMLTDGPHLIKKGRNNIERSRSGTGANGRNTVEMLW FT PDDSGWVELSCPAPPSPWAPPSPAPPSPAPPSPAPPSPAPPSPAPPSPAPP FT SPWALPSPAPPNPAPPGPAPPSPWALPSPAPPNPAPPNPAPPSPAPPNPAP FT PSPAPPSPQWPTAAAQPVAATPEEVGVDARTAGEPPADIKAAALRLRGFVR FT LFNSTMMSLEREEPIESTDDSFLVQLLRNGETVYSWHEKLRQDEKDKKLPT FT TGAGRKGLSQECYSDFQVTCYGVVALVRSYISNSPQGSQPRSISLATLNQN FT CVENAFSQLRGHGQNRCPDARRVMDGEQHLRTNEVLIAANLTRARHQRNGS FT YAPASAASAGSASVDAVDSLLHCRRRGPTRAAAASSTWSLADLSNSTQPAG FT DQPSWTATVAAAAGLRQLAWQDLVGTGIFKAHCSAVPQLVAALQPDSCVSH FT AWGAYCAWLLPAMEDALVKSNGTTAATKHVFGAQHAAPCYHAWSWLCAQPE FT VGPISRPPFSATTPLVPARGRNTTAAEPPSATMAEWQRLGQSACLVHSYVF FT AAAARVGVLRGLLLPAAQPAAAPSAGRPKKQPPKQQPALPPKEATFELFSN FT RTACSRVTGWALHSTLQNATNLAKRSKTAAASSQARLVEELIEARTDGCFR FT RSFDDDLLPSLKLGRIVTPKPSITDFFEGLQRGLISHVKSASSVLRGGPLV FT VQGWIDEVRYDLAAWGSFQAACASAGVSVQHSNAQQQQHYSLADLRALMGV FT LVTKYVHANLEGLLRHLKVLVQADKASDRALRDERRADAATAAKKGKATTA FT AGGADGQRRGRKTAAPAAEAPEVESADEPDSSDGSDGSDSSDCESTYGDDF FT VCQLCNEPEPADAMLLCDSCDKGYHNFCLTPRVDGIPDGQWFCPNCAQLAA FT PAAAIARKRAVDAMPAEAKRTRSAAAAGAERNMAASGVGAAAAGGAVVAVV FT ISAGAAGLATGAAVEPLAPKRGRAKRGLPAEAPPAQREGARKRLPSVRLND FT " XX SQ Sequence 9810 BP; 1887 A; 3039 C; 3115 G; 1769 T; 0 other; caggggaggg gggtatgcaa ctcggcctgg atccagcccg ggctgcatgg gacccggcca 60 aaaggtgttc ggacccccca gcgcctgcgc ctgagacttt attattgtta cctggacaaa 120 tggttagtca tttaaactga caggcgtcgc taggaatggg gcgcaggccc ccctgtaaca 180 tttcgagtgt tgggagccac ctacggccgt tttgtgccac cgaccgatct atatagcatg 240 gtatagtgga atgggtacga aagctggtcg ccaggcgatt cgccgcgcac tggagctgct 300 gtgggccctc aaagcaacgg caagttgggg gtaatcggcc ctttgcgggc tcgctaggca 360 ttctaaggag ctgccagcgg ctaccagggt accgcaagga caataagaag ccatggcttg 420 ggctggcgtg gccttagagg tctcagcaac gcggccggtg tggcattaat tcccgcccgg 480 acatgcccct ggcgtcgcga acgtccatat ggcgtcgcct ccggctctcc tgccgtggtt 540 aggtgtcttg gccaaataga catctgtatg aacactgtgc gcctggcccg atatagttta 600 gacgtgcgca gacggtctgc actaaggtcg cgaaattcca ggggattcgc gttgccggct 660 tgcgtagttt cgctaaatca ctgggacgcc gttgaatcct acaatcacat gaggtccctg 720 atatacggag ctcattgccg gggccttagc gtggaccagt ggctccaacc agaatgctgg 780 cacattcttg catcgccagg accgcccaag tgaaattccc tggtggtttc gtgtcttggc 840 caactagaga tgcaagggat cctaacgcgc ttgtcgcatc gacaaacggt tcagattcgg 900 ggacatgcac taaggtcgcg tttttccgtg ccatttcgcc gccctatggc gcccgcccgc 960 gtgccccggt atccgcatac cagcatgtgg gttgagcgcc aaacactgcc cgtgaccgta 1020 gttcggtggt atgaataagt gggaggcgcc aggcgccaag gggtggtgtg taatggcact 1080 cgtgtgccct ttcccacatc gcctttcacc ctccctgtcc tgctgcctgc ctctcttccc 1140 tgccgcagat tctcctagta tacctgagcg tcagcccaca attcataaca ttcagttcag 1200 ccatgggccg ggatgagcag aagcagtact cgcactgcta cacgtgcggt aacaaggtgc 1260 tctcgtacag cttcatccgc gtctggcgcg gcggcagcca gccggactgg tgcaagggtg 1320 ggcagggctg ctgcagcctg accgtgaacg gcagcaagca cttcaactcc tgtcagctgc 1380 atatcaacgg ttcgcactac ctcagcgcca ccagggggcc cggcaatcca gagcttccca 1440 ttactctcca gccgccgcct gcgaaggttg gcttgcgcgt ttgtttcgag tgcgcccagc 1500 acaacagcac ggtgcaagca gagctacagc agggctatga ggaggagcag gcacgaaacc 1560 agcatcgggg ccacaatgca cgcagcacac ttcttgtcgc cggcttggtg aacaccagca 1620 tcgcggctgg cgcagcggct cagccgcgtg catttggcac ggagatccag ctgcagcagc 1680 aggcgccggg aagcactgta gtccctgtgg ctgctgctgc agccgctgct ccgcagcagc 1740 atgtgctgcg gcgcagccac cgcaggaacg cttccagcgt cagcggtgtc gacctcgaca 1800 ctttcgaccc gccaccatcg gataagcagt cggagacgcc tgctgcgccg tcaccgccag 1860 cactggagca gctaacaccg gcacccacca caggggacgc cgttgctgca ggtgtgacgg 1920 cggtgccaga accggtggca gctcagctgc gcccagccct gcgggccgtg tcagaccagt 1980 ggactggcac cacgccatct gcatgcgagc tggtgctgta tgctatcctc cgccttcact 2040 tcgcagacag tgggttcacg attaggcggc agaaaagcag cagcgactcg gccttctatg 2100 acgcggcggc gcccctctac agctacataa ccggcctaga cagccctggc agtgagttca 2160 gctttacatc gcgacagatg aagtacttcg tgaagaccgt caacctcagc aacgggctcc 2220 acagccgcaa acgctcgaac tactttgtta agcagactga cctcttcatg acttcaccgg 2280 ctggccgccg tctgactacc attacgccgc cgatgtcgga ggtaagtgaa acacgccggg 2340 atgggtggtt gggtgtgtcg cctacgcatg gtggcgactg cgctaggaat ggtggctggg 2400 gggcaagcgg taactggaaa ggcgtatcga cattattcac gtgcgccgtg tgtacttcgt 2460 ttacgcaggc acacctgaac acctctcctg cggccatcat ggcggcggcg gcggaagccg 2520 cagctgcctt tgatgtgctg gagtgcatcg cccccagtga gcggctggac agcttgcttg 2580 ccgctttcag tccgacagcg aaggccggct gggagacgtg ggacacacgc gtcacggctg 2640 cgtggcgatg ccagttagtg cagcagcact cacacacagg cgacaagcac ctgcgctttg 2700 actgtaccac gggcagccat taccacaaca ggctcagcgt ggtgctgaag cccgatctgg 2760 gattcagcgt gcaacacgcc cttgtcgcgg tgcagctgcc agccgacgcc cccagcgcgg 2820 atcggctaac aacgcaggca gccctgtcca acctcctgct gtaccttcag agctgtcggc 2880 catgctacgg cggcgcgtca gcagcagagg tgtgtgcgta caacggccag cgcagcagct 2940 ccaatcacac ctcggactca gtccctgtac ggcgctttgg caaaagcccg cgtgcggtcg 3000 tctaccccgc cggcgggcac cttgcgctgc ccaccgacga cccgctgctg cggaccagct 3060 acgagtcacg tcaggtgcac ttcagcgccg actgcagcag ccttgtcagc ggtgcaagct 3120 gctgctctcg ctgcaccagc tgccatgcac acgccccgtc cctgcgcgcc accctccgcc 3180 gcagtgcaga ggagcacaaa gcggtgcaat caaaggctga tgcagctgca gcagcagccg 3240 catctgcagc taccgccgca gcagatcggt acgcgcaggc gttgaagaat tcagaggcga 3300 ggatagtgga gctgcgggtg cagaacttca cgcttggcca ggaccttgcg gcgacacagg 3360 agcagctact ggcagtccag gcggagttgg aggcatcacg tgctaagatg acggagctgt 3420 cagcagacga gtcggcggat ctggtgacgc tgctgcagca cgaggacgtg caggcgcaca 3480 tgaaggagaa ccctggccag gctgctttct ggagggacca gctggagtac gtccgccgcg 3540 cgcgccaggg cggcaagatg gtgaacggca tgcgctggca caacaggtgt ttagggttgg 3600 gtctgggttg gggttggggt tggggttggg gcgacgggag gggaggtgca gcgcggggtt 3660 gggcggtgtg gtgggtgcgg ttcgttctta acctccatgc tgtttgtgct tccgcttgct 3720 ccttttccaa ctcgttccac agtactctcc agttcgcgct cgggctgctg aaccagggtg 3780 gcgctggcct ctatgagcgc ataggcgacc agctttcgct tccgtccatc agtcagctcc 3840 tgaactacaa gggcttcacg gcgtgcgggt ctggatggca cacgcctgtg atggagcagg 3900 cagcggagct ggtgaagatg gctggcgcgc agcctgacgg tgggcttgcg ttcgacgaga 3960 tgaagatcag cagcgggctg gtgtttaaca ttgcaaccaa ttccttcgtg ggctactcgg 4020 acgtggacgt gggcactgag gcgcagcgcc tgcaccagct cttgggcggc aagccgccgc 4080 tgccctcagc agctggcgct gcagccatgg gcagcgagtc agtgccgccc ccagctctgg 4140 cgactcacgt gctgcacctg acttacacga cgctcggccc caaaaccgtc cggtaggaat 4200 gtgcatgatg cttgtcgggt ttgccgtgca aatgggtaga gcatggggta accatctatc 4260 acctatcatc aatttgtgtg acttgcgtga ctggccccag ggcttccgtc ggttacttct 4320 tcaccaacgg tgtgtcggct gtggagcttg ccccgctcgt ctcgcacggc atcacgcagc 4380 tgcgcgcggc tgggctgcgc gtgcactaca cggtgtgtga tggcgcgcct gagaaccgca 4440 agtggatgga gctgatggcg gacggggagc tggcgaagca ggtcgcggaa ggtaggagtt 4500 cgcgggctag ctagcacgag ctagctacaa tcatggtaca tgctgctgag ttccttttat 4560 gccactgtta tgcgacattg cctgtccgtt gaacagaatg caacattacg atgccgtccg 4620 ccaaccctac cgcgcacctg cggtgcttcc gcgacccggc cgacccaacg ctgcccatca 4680 ttatgctgac ggacggccct cacctcatca agaaggggcg gaacaacatt gagcgtagcc 4740 gcagtgggac aggtgtgcga ccagttcggg gaggagcagt aacttgggca ttacctctag 4800 tccatgctgt ttgtcactga tatggccctg gatgccttga atgggaaccc cataatcgta 4860 tgacttgatg ccgcattctg tgcaggagcg aacgggcgca acactgtgga gatgctgtgg 4920 ccggacgaca gcggctgggt ggagctgagc tggcgggatg tccagacagt cgtggaatgg 4980 gacttgcacc actctccaca gctgatggcg catgtgaccc acagccacat ccacttgacg 5040 agctggacac gcatgcgttg ccatctggtc tcgcagcttt tccgtgggct ggaggagttc 5100 gtgcactggg tagacacgga gggatggaag cgtagcgccg cccaacccgt tgacgccgcc 5160 cagcccagcg ccgcccaacc cgttgacgcc gcccagccca gcgccgccca gcccgtgggc 5220 gccgcccagc ccggcgccgc ccagcccggc gccgcccagc ccggcgccgc ccagcccggc 5280 gccgcccagc ccggcgccgc ccagcccggc gccgcccagc ccgtgggcgc tgcccagccc 5340 ggcgccgccc aacccggcgc cgcccggccc ggcgccgccc agcccgtggg cgctgcccag 5400 cccggcgccg cccaacccgg cgccgcccaa cccggcgccg cccagcccgg cgccgcccaa 5460 cccggcgcca cccagcccgg cgccgcccag cccacaatgg ccggcgtgcg gcccacgcgg 5520 cgggtggcag cagcccagcg tgcacctgtg gctactgttc acctggtcca gacagcagct 5580 gcgcagccag tcgccgctac accggaggag gtgggggtgg atgcgaggac cgctggagag 5640 ccaccggctg acatcaaggc cgctgctctg cgcttgcgcg gcttcgtcag gtgagcgctt 5700 gaaattccag ggcctaaagt cggtgctgct ttggtacgta gcttcaaccc atttaattgc 5760 tgtactgctc tgaacacgtc ccaccgtttc ctacaggctg ttcaacagca cgatgatgtc 5820 gctcgagcgt gaggagccca tcgagtcgac cgacgacagc ttcctggtgc aactgctcag 5880 gaatggcgag accgtgtaca gctggcacga gaagctccga caggatgaga aggacaagaa 5940 gctgcccacc actggcgcag gtcgcaaggg cctttctcag gagtgctaca gcgactttca 6000 ggtgaggcag cacagccaga cttcgtttcg gacaggactg cttgctaggt ggccacgcag 6060 cacgtgtatc ggagggctta caataacccg tttgctctgg ccgtccgcgc acacaggtca 6120 cgtgctatgg cgttgtggcc ctggtgcgca gctacatcag caactcgcct caaggcagcc 6180 agccgcggtc aatctcgctt gcgacactca accaaaactg tgtcgagaac gcgttctcgc 6240 agctccgtgg ccatggccaa aaccgctgcc cggacgcgcg acgtgtgatg gacggcgagc 6300 agcacttgcg caccaacgag gtccttatcg ccgccaacct cactcgtgcc cgacaccagc 6360 gtaacggctc gtacgcgccc gcatccgcag ccagcgcagg cagcgccagc gtggatgcag 6420 tggacagcct tctgcactgc cgccggcggg gcccaaccag ggccgcggcg gcgtccagca 6480 cctggagcct ggcggacctg tcgaacagca cacagccggc cggcgaccag ccatcatgga 6540 cggcaaccgt agctgccgcg gctgggctgc ggcagctggc gtggcaggac ttggttggca 6600 cgggcatctt caaagcacac tgctcggcag tgccccagct tgtcgcagct ctacaaccgg 6660 actcatgtgt cagccatgca tggggggcct actgcgcatg gctgctgcct gcgatggagg 6720 acgcgctggt gaagtcaaac ggcaccacag ctgccacaaa gcatgtgttt ggtgcacagc 6780 atgctgcgcc gtgctaccat gcctggagct ggctgtgcgc ccaaccggag gttggcccaa 6840 tcagcaggcc cccgttctcg gccacgacgc ccctagtgcc tgcacgcggc cgcaacacta 6900 ctgcagcaga gccgccgtca gcaaccatgg cggaatggca gcggctgggg cagtcagcct 6960 gccttgtcca cagctacgtc ttcgcggccg ctgcacgcgt gggggtgctg cggggtttgc 7020 tgctaccagc ggcacagcct gcggcggccc cctcagcagg gcgtcctaag aagcagccgc 7080 caaagcagca accggcgttg ccgcccaagg aggccacgtt cgagctcttc agcaaccgga 7140 ccgcctgcag ccgtgtcacg ggctgggcac tgcacagcac cttgcagaac gccaccaact 7200 tagccaagcg gtccaaaacc gcggcggctt catcgcaggc gcgcctggtg gaggagctga 7260 ttgaggcgcg cacggacggc tgctttcggc gcagctttga cgacgacctg ctgccgagcc 7320 tgaagctagg ccgcatcgtt accccgaagc cctccatcac tgacttcttc gagggcctgc 7380 agcgcggcct gatatcgcac gtcaagtccg ccagcagcgt gctgcgcgga ggcccgctgg 7440 tcgtgcaggg gtggattgat gaggtgcggt acgaccttgc ggcctggggc agcttccagg 7500 cggcgtgtgc gtccgcaggg gtgtcagtgc agcacagcaa cgcgcaacag caacagcact 7560 actcgctcgc tgacctccgc gcccttatgg gcgtactggt gaccaagtac gtgcacgcca 7620 atctcgaagg gctgcttcgt cacctaaaag tgctggtgca ggccgataag gccagtgacc 7680 gggcgctgcg ggacgagagg cgggcagatg cagctacagc ggcaaagaag ggcaaggcga 7740 ccacagctgc tggcggcgca gacggacagc gtcgtggccg caagacggca gcaccagcag 7800 cagaggcgcc ggaggtggag tctgcggacg agcccgacag cagcgacggc agtgacggca 7860 gtgacagcag cgactgtgag tccacttacg gtgacgattt tgtctgtcag ctgtgcaacg 7920 agccggagcc agcagacgca atgctcctgt gtgacagctg cgacaagggc tatcacaact 7980 tctgcctgac acccagggtg gacggcatcc cggacgggca gtggttctgc cccaactgcg 8040 ctcagcttgc ggcgcccgcc gctgcaatcg cccgcaaacg ggctgtggac gctatgccag 8100 cagaggcaaa gcggacgcgg tcggcggcag cagcaggcgc agagcggaac atggcggcga 8160 gtggtgtggg tgcagcggct gctggcggag ctgtggtggc ggtcgtcatc agtgcggggg 8220 cggccggctt ggccacgggg gccgccgtgg agcccttggc tcccaagcgc gggcgcgcca 8280 agcgcggact ccccgcagag gcaccgcccg cccaacgcga gggagcgcgc aaacggctgc 8340 catctgtgcg cctgaacgac tgagtgcata tgggattggt tactgatgtc gcaccgtaac 8400 caccgtaggg actagcaagg tagagctgtg atgggcatcc ccattctttg caggcgtttc 8460 gcatgtacta catgaataaa cacagcagca gcctgaacgc cgcgcacggc atgcatacag 8520 catgcgcgcg ggactccata gtgaggactg gtccaaatgt ctcgattttg tatgtacgtc 8580 gtgtagcaca tcacttataa gtacatgcgc acatactggc atacgtagct ttgctttggg 8640 cgccgatgcg taccatatgc aaccttgaga ttcagtgcaa acttatgcat tcatggcaaa 8700 cattctctca ttcttagtac cagggtctag taccgtacag taaagtatgg agtcggaagt 8760 gaacaacgca atgaacacgg aagggaagaa cgccgtggcc gtacaatgta cggtagtatg 8820 gagtcggaga agtgaagggc agcatggctg agagtggagc gtcagcacag catgtagcat 8880 agcaggggag ccctgtagca tagtagcgat gacacagttt acgacttgta catgtacata 8940 gaggcaggtg atattgaagg cggcagtgaa ggtggccgtg ttttgtttgt ccgcatcata 9000 cgacctgtgc gatgtagatg accatacgtg gccacgcacg agatgcagat acgggcaggg 9060 taggcatcag gggaggcctt cccctggtag gtattacacg catggtgggg gtgtgcacac 9120 acgcgagtag gtgctcaagg agaccggagc atgagcacaa gagtcacgag tcccccggtc 9180 aggaggtgca taccagcgca gccctgctgg cgtgcacaaa gcacagggga gacatgtaac 9240 gaacttttag cagcatcctc gtgtttcacg gctgtgggct tcacgctttg cgtccagggg 9300 taaggtgcct tccgtccagt gacccttgcc tggaagcgcg ggacgcacac gcgcgcagac 9360 accaacagac acggaccaaa cccctaaata tggcatcggt acgtatgtgt agagctgtgg 9420 gcaggggccg ctggcttctc tgggcctcgg aaaggccacc aaacggccat ttccgcgaag 9480 ggtctgtggg gcatcgggca tcgctcctga gaggccgcgc cctccgcttt ctggcgttca 9540 cttttattac acaggtagct ggaggcccgc tgaggtgatc acaagtgtcc ggaatttcgt 9600 tccccgtgtt tcaagcgcgc gcacattggc catggccgac tcactcgacg cactcgtttc 9660 tgggcgcctg tcagtttaaa tgactaacca tttgtccagg taacaataat aaagtctcag 9720 gcgcaggcgc tggggggtcc gaacaccttt tggccgggtc ccatgcagcc cgggctggat 9780 ccaggccgag ttgcataccc ccctcccctg 9810 // ID DNA-8-6_CR repbase; DNA; CHL; 1465 BP. XX AC . XX DT 29-MAR-2006 (Rel. 11.03, Created) DT 29-MAR-2006 (Rel. 11.03, Last updated, Version 1) XX DE DNA-8-6_CR is a family of P nonautonomous DNA transposons - a DE consensus sequence. XX KW P; DNA transposon; Transposable Element; Nonautonomous; P-2_CR; KW DNA-8-6_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-1465 RA Kapitonov V.V. and Jurka J.; RT "DNA-8-6_CR, a family of P nonautonomous DNA transposons in the RT Chlamydomonas reinhardtii genome."; RL Repbase Reports 6(3), 114-114 (2006). XX DR [1] (Consensus) XX CC This is a consensus sequence of a DNA-8-6_CR family of CC nonautonomous P DNA transposons. DNA-8-6_CR elements are less CC than 2% divergent from their consensus sequence and they are CC characterized by 151-bp terminal inverted repeats and 8-bp target CC site duplications. DNA-8-6_CR is a nonautonomous derivate of the CC P-2_CR transposon. XX SQ Sequence 1465 BP; 350 A; 395 C; 406 G; 314 T; 0 other; caggggtgcg gggtggccaa ctcgccccat gtgatcggca acggccggca accgccccca 60 attcccagtt gcgaccaggc cccggcccga tttctttgca tagacctata aggagtaccc 120 catagatgac ccttggcgcg tcgggcgact aaaataaggc ctccgacaac agagcaatat 180 ttcatctgcc attgctgcgt gcaccacctt ggtgacgcgc ttggcgcgcc agatcatgct 240 ctattagcaa aaggaagaag atgaggtgct caggaggggg cttttaagct tcgaggctgt 300 gggcaatgct catggcggca gaaagctgcc acagggcaaa agaggagctt tgcaactgcg 360 acgttgctta tcatccgtcc tcagctcaat tattctgttg gaatgtaaat ctactcgccg 420 cagggttgaa ctgaacttgc attaagccgc ttccctctca tttttggtgt cccaagttct 480 aggaatcatt tacgatacat aggggcaaac ctgaggcaat tgccatgcca atagagccat 540 cgccgactgc acaaagtcgt cgcggaggtg aaaggccggg tgacaggggg ccgggaagcg 600 cgaacgccca aaagtagcag ccgtaactgc atggatgtta tgtatggcat cattcatgca 660 taattagccc cttggaggat caaaaacgcg agatataccc ttgtttcaat aaggatgtca 720 cttgtgtgca tagactaata gatcgcatag ccctaggaat catctaggat aggcaacgaa 780 gtggatgatg taccggggga tacgcaagcg tgagaaacgc gtataaaaag tcgtcgcggg 840 tgtttcaggc gcctagaatt ggtgcctgga ccgcgcgggg cgcacatttc acccctgatc 900 caccggcact aactatattt ccatctatag atgctcccgg gctccaaggg agagcgaggc 960 ggggaagaac gagcttgcca acagccagca tgacgctagc agcgtcgggc aaggccgtcc 1020 tggcccggcc ccagttgtgt tatgcgccca aactgatttt tgacatgtca aggtcaaagt 1080 agtgctggcg gctgctattg cgcagtcctg ctttcgaact taggctcaga attgcacctc 1140 gccgtgcgca tggaagcccc tctctgcctg cagtaggcac ccggagacct gaaacaaaga 1200 ttcttcagac ctccaataca taacaaggcc atagattgct atcgcgcaac ttttcctgcc 1260 tccaagccga gactcaaatt tgcgtcgaca agcatgtatt cggccaccgt ttcatagtcg 1320 cccgacgcgc caagggtcat ctatggggta ctccttatag gtctatgcaa agaaatcggg 1380 ccggggcctg gtcgcaactg ggaattgggg gcggttgccg gccgttgccg atcacatggg 1440 gcgagttggc caccccgcac ccctg 1465 // ID Gypsy-12-LTR_CR repbase; DNA; CHL; 649 BP. XX AC . XX DT 31-MAY-2006 (Rel. 11.05, Created) DT 31-MAY-2006 (Rel. 11.05, Last updated, Version 1) XX DE Gypsy-12-LTR_CR is a long terminal repeat of Gypsy-12_CR LTR DE retrotransposon - a consensus. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; 4-bp TSDs; Gypsy superfamily; Gypsy-12-I_CR; KW Gypsy-12-LTR_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-649 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-12_CR, a family of nonautonomous gypsy LTR RT retrotransposons from the green algae genome."; RL Repbase Reports 6(5), 233-233 (2006). XX DR [1] (Consensus) XX CC It is a long terminal repeat of the Gypsy-12_CR LTR CC retrotransposon, whose internal portion is Gypsy-12-I_CR. XX SQ Sequence 649 BP; 108 A; 213 C; 156 G; 172 T; 0 other; tgcctgtgcc tgcagtctgt agcactcgca taggattgct tgcttgtaaa cgcacatacc 60 tgttgtacag acttgcctgc ctgggagctt agcttgctca gctcagccgg gttgcgcatc 120 agcagcgcgt acgactccag cgggtactca gccttatagg gtagtctcgc tcctctctct 180 tatccaaata cctctggcgt tctacacgcc ctaacaacta cgcccaggct ccgcctgagg 240 atacggcttc tatcccctct ttctctcgtg caggtccggc ctgccggccg ccaagtctct 300 ctcccttctc tctctcccta tccctttcct ttacaggttc cccagttctc cgtgtacata 360 cgactgaatc atctctcgac cacacctctg gtatctccga ctattgctgt gatcgccgca 420 ggcgcatctt gatcaccccg caagtgccca ggacagcggc tggtatacag ggtcgggtgt 480 ggcggccgtc cagcctctcc acgctccgtg ctcataccct gagaaggtca aagcggtgca 540 ggtctgtcgt catacagctg catcgtggcc agtgcctcag tgacagcccg cgttacagcg 600 caaaggtttt tatagttccc ggttttttac gggtgctgcg cttttagtg 649 // ID SINE-6_CR repbase; DNA; CHL; 405 BP. XX AC . XX DT 20-MAR-2008 (Rel. 13.03, Created) DT 20-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE SINEX-6_CR is a putative non-autonomous non-LTR retrotransposon - DE a consensus. XX KW Non-LTR Retrotransposon; Transposable Element; SINE-6_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-405 RA COGNAT V., DERAGON J.M., VINOGRADOVA E., SALINAS T., REMACLE C. RA and MARECHAL-DROUARD L.; RT "On the evolution and expression of Chlamydomonas reinhardtii RT nucleus-encoded transfer RNA genes."; RL Genetics 2008 - Accepted. XX DR [1] (Consensus) XX CC 18 copies of SINEX-6_CR are present in the C. reinhardtii genome. CC The 100-bp 3' terminus of SINEX-6_CR is 56% identical to the CC RandI-5 3' terminus. Target site duplication are not present. XX SQ Sequence 405 BP; 82 A; 93 C; 131 G; 99 T; 0 other; tccccggtag ctcaattggt agagcatgcc gctgtcacat ggcagaccca ggttcgaatc 60 gcggattcgg ccgggttgag gctgacaagt atagatgcag gttcggatcc tgcccgggga 120 accgagtcag tattccggta tggagtccgc ggtactgacg gaagcgttgt agcgactctc 180 tgggttcgga tcccattgtt gcaacgtgga aacttcacga tggccgaatt cggagagttg 240 gtaggccgat aggtccagaa cttcggttcc tatggactga gtgaaggtgg atgcgtgggg 300 tgcctcttgc caaggtccca cagagatacg gtagggttag cctcgatggg actcttcctt 360 aaggcacgcg ggaccttggt ccgtgcgtag gtattattat tatta 405 // ID SINEX-2_CR repbase; DNA; CHL; 484 BP. XX AC . XX DT 04-MAR-2004 (Rel. 9.02, Created) DT 04-MAR-2004 (Rel. 9.02, Last updated, Version 1) XX DE SINEX-2_CR is a putative nonautonomous non-LTR retrotransposon - DE a consensus. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; KW Nonautonomous; SINEX-1_CR; SINEX-2_CR; KW nonautonomous non-LTR retrotransposon. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-484 RA Kapitonov V.V. and Jurka J.; RT "SINEX-2_CR, a SINE-like non-LTR retrotransposon from the green RT algae genome."; RL Repbase Reports 4(2), 46-46 (2004). XX DR [1] (Consensus) XX CC Several copies of SINEX-2_CR are present in the C. reinhardtii CC genome. They are ~99% identical to the consensus sequence. CC SINEX-2_CR are flanked by 14-bp target site duplications. CC SINEX-1_CR and SINEX-2_CR share a common ~330-bp 5' terminal CC portion. XX SQ Sequence 484 BP; 114 A; 106 C; 152 G; 112 T; 0 other; atcaccggat cgctaactag tttagcgatt tgaatcgtcc ttacggcaga agggacccgg 60 gtacaaatcc ggatgaaagc ccaattatgc aaaaggcgaa agattggtgc gaaaccgatt 120 ggtgatccca cgaatgaaaa aggtcgtggt gcagaagtcc gaattagagg cgtccaaggt 180 gcggtccaag gtccaagatc caaggtcgaa gcaggagcgc catggctctg gcatgttgca 240 actaactggc agtgctccgc agctttagtg cggcgttccc tagcgacata gggtgatggc 300 taggataggg cggctcggat tggctaggat aaggtggctc ggatgtgcta ggatgatggt 360 ggctaggatg cggtggtgca gaccgtagag tctgcttggc agtgctgggg ttgcactcgc 420 aagcgattct ccgataatct cacgctagga gcttaggccc agtgtaacct ttctttcttt 480 cttt 484 // ID Gypsy15-LTR_CR repbase; DNA; CHL; 284 BP. XX AC scaffold_7; XX DT 22-FEB-2008 (Rel. 13.02, Created) DT 22-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE LTR retrotransposon from single celled green alga: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy15-CR; KW Gypsy15-I_CR; Gypsy15-LTR_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-284 RA Kohany O. and Jurka J.; RT "LTR retrotransposon from single celled green alga."; RL Repbase Reports 8(2), 102-102 (2008). XX DR Genome; scaffold_7; Positions 166894 167177. XX SQ Sequence 284 BP; 67 A; 67 C; 79 G; 71 T; 0 other; tgttgtagag catattgtta tatgcgatat atgatatgac acgctacggt gtccagaatt 60 ggggctgttc tagctatgag tctagaacct gcaatgtgtc cagagatggg ccgctgcatg 120 gagtgcagta cgctcggcga cacttgaaat agtcctaact ggccggcagc cagtaacata 180 ggtgacctgt gcactagccc tgctaaggtc ttgctattgt tagccgggtc ggcaccggct 240 tagggcaata gctctcgcta attggcacag agccgcctac aaca 284 // ID Copia1-LTR_CR repbase; DNA; CHL; 217 BP. XX AC . XX DT 13-MAY-2004 (Rel. 9.04, Created) DT 13-MAY-2004 (Rel. 9.04, Last updated, Version 1) XX DE Long terminal repeat of the Copia1_CR retrotransposon. XX KW Copia; LTR Retrotransposon; Transposable Element; 5-bp TSDs; KW Copia1-I_CR; Copia1-LTR_CR; COPIA superfamily; endonuclease; KW protease; reverse transcriptase. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-217 RA Kapitonov V.V. and Jurka J.; RT "Copia1_CR, a family of copia LTR retrotransposons present in the RT Chlamydomonas reinhardtii genome."; RL Repbase Reports 4(4), 103-103 (2004). XX DR [1] (Consensus) XX CC Copia1-LTR_CR is a long terminal repeat of the Copia1-CR LTR CC retrotransposon, whose internal portion is deposited in CC Repbase as Copia1-I_CR. XX SQ Sequence 217 BP; 55 A; 55 C; 61 G; 46 T; 0 other; tgtcgtattt ggggggctca gcggttcggt ctgccccttc ccccatgcgt gtcattgcga 60 tggtctaagg agcctatagg gccgagagta agcccgacct agactaagtc gatataaggc 120 aacattgcac taagtggttg gtatggcacc aaaggaacgg agcctgtaaa cctggagttg 180 cattgcccgg tagaacgacc cacaccagaa tacaaca 217 // ID RandI-2 repbase; DNA; CHL; 9543 BP. XX AC . XX DT 17-AUG-2004 (Rel. 9.07, Created) DT 22-JUL-2010 (Rel. 15.08, Last updated, Version 2) XX DE RandI-2 is a family of non-LTR retrotransposons - a consensus DE sequence. XX KW RandI; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; RNaseH; RandI superfamily; RandI-2; KW Dualen. XX NM RandI-2. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-9543 RA Kapitonov V.V. and Jurka J.; RT "RandI-2, a family of RandI non-LTR retrotransposons from the RT Chlamydomonas reinhardtii genome."; RL Repbase Reports 4(7), 197-197 (2004). XX RN [2] RP 1-9543 RA Kojima K.K. and Fujiwara H.; RT "An extraordinary retrotransposon family encoding dual RT endonucleases."; RL Genome Research 15(8), 1106-1117 (2005). XX DR [1] (Consensus) XX CC RandI-2 is a family of RandI non-LTR retrotransposons. The CC 5'-terminal portion of the consensus sequence is incomplete. CC The reconstructed consensus sequence encodes a 2064-aa CC RandI-2p protein (nucleotide pos. 2-6193) composed of the CC reverse transcriptase (aa pos. 90-350) and RNaseH (aa pos. CC 1570-1730) domains. The genome contains ~100 copies of RandI-2 CC that are ~1.5% divergent from the consensus sequence. CC This original consensus is >96% identical to DualenCr3 in [2], CC but shorter. The 5' 3209 bp is added from DualenCr3, but still CC likely 5'-truncated. XX FH Key Location/Qualifiers FT CDS 1..9402 FT /product="RandI-2p" FT /translation="MSSNPMTLQIGTQPGQQQPDLTSPSHPLRGGNNTERT FT TTPAAENGHIPVPLSWPRQIPSTYFREQQHLNFCTIHAINNMLGAPWLRAH FT AVLAFCQVVHTALADTPELQATWGHHFKHNGLFHEAAINHYLRHRPAPIDT FT TGTRLLLRLAAPNLDPGHSQDTILGLLPECSRNMGFMVHDGRHAYAIRKEE FT DSWFLVESFVHIGRKPMTPDLWQAVKGDIFCLHPYHLDMPLDPRTYPTIPN FT LPPPPQPPSPPPPNQTKPKRGPTQRALSTMFARQTTQLPTNHSAGDNAAPT FT PHPGPAETTVTRPLTQATLNITRREATEPARTQNERPATPNSAARPTRRPR FT HDNQLDPEPTSPTRANTTTRPRANANMHAPNTNPTQPTANNTTQPTWAQVV FT ATHAEAAGNALRTVGRITTQAPHQRPETPPSAARPTQRPRADPPDSPTTHQ FT TVNPNTPRRTPPGPTPDPGAGAQDNTETRPDNTTWLHIATHNVRGLGTNLL FT STICHLREWNADIVILTETKLGPRTRWLKDTLRQEKLSYRTFTSTKPGTEQ FT YKRRSAGVLMAISKRYHSGSNVHTATIPPHLHGYIVHCTIRTPHSTPLHIL FT GVYAPEDMPTRRRIYKYCEETINTANKTDAHVIIGGDFNAVLNEGDRTGPP FT DEADKHHARFVSKNKLKRLPEFGGTISMSFYQTRDGTPQPTSRIDDILICQ FT NIHSALPTASRTPATLQVQPCGGLFDHYSLHARVPTFPIRLWPSHTATQAD FT TQPPETPGPWRTVVLPIPKSTLEAARTKIEHTLANPIATLASAIQPATTAI FT QQHLAQHTAGATNASELQHALQTDPAIQTPDLDALANQLLEILQDGLSILV FT DMCPKKPPTRGVFAPRKINKRIRESHNEIKQLRTQCDALLQQQHPYANITT FT TTAPNTGIPLPDPTTEQDAENPPPQQPPPAHDATTTAEPQRRPRPPPATPA FT IANLIDKIKRAGDTLRQILKKQTQHDIQKARDRFQHKLATRAKQAHKDIFS FT TQEQSRGIPALIHPQNKAHCTDKASILDAIHAYFQGQGASITGPRSGKYKP FT EDRTQVYPFDPSLKNSAPDTFTLHSATTPPTHALLPSIADSTNFDSCIDHL FT SRNKSPGPDGIPNELLRALPQTLRTTLHNLLTLMWVRAHTPASWTASETVL FT LPKPGDPLLLKNYRPIALANTTYKLWTSLITVAISDISHDLDLFSETQEGF FT LRYRNTERQTLNLVHALEDAGLTKQDIYVMYADFSSAFNTISHDRLLEIMW FT DLGMPHDLIRVVQNLYSNARTYIRSEHGLTAPIYIERGTVQGDTLSPVLFL FT VFIEPLIRWLHVGGRGYSYGCLPPHLNARYNCSSAAYADDLAVLTNTLSDL FT RIQCDKIHRYSEWAGLQVNHAKCRVTGILHRRAMQDKGLNGPTCNRTLRSM FT LENKIHIGTTSIPYLPATEPFKYLGVQVTMTLNWGPQFAYLCKNITDKSLS FT LQTCQASPEQCLRIIKSCIQSVAAYSFAVMPYTENDIRTLDNMIARLARTC FT YRLSRGFPTRAVLTPAEMSGLGVGSLLPIYVQKATRALTLALNDPGRLGMV FT THALLELQTRIAGTTDMHKLRHESHYYTTLKQLSIMRAYNLTLWDHGAPYK FT GPPNTLLTAISRSPHPISHHLVLTLTQAGLDLRNLVGKERTPEGRACLIPS FT DILFQHLPRNQCQHKHRVALNRLSLCLSAECDGAREAPFNQAESYKCHLPL FT PQRYRALPNTPAFAALLPRNATPGELTTQSTMDAHLQHMTALHPNTEAQTP FT QHPNGLGPRQRQTTLAHMLRTTGAPPTAPTQSPTATLTQTPTGTLPTPPLA FT QPMPPPLTQPTPPLTTPAGPDAPNTQTPPTVASLTTAHQQPKSKKQRPNAI FT KRLQQKIDADPRHDPLPPTDMRSHTEKSGFILNNDPDENDSGPPNFVMWAP FT HYFANVQKDMQGTNRMPRLGIPRHHSRFPQNRTRPNPTRTIPAHLADPYFL FT GSQVDDTNPLNPVTAYISANPPTTPPTCAYDSRPRWADTGDFMIAPEPPNW FT PNITGATPGDSHDPRQADHDPQGQMRAEYMQERERLRAHLDDHPPQPRPFK FT RWMGIRLQLLTGKLPWPTQAYHVAQAAKQLVYSLYHDIEEISHIQAGPITK FT AVTAPEKGASLHTERYYEVAWKDTTIVKGAAEAYTRMKYTGTAAAHMGYGQ FT ACHKYFLRMKWDPTSEPVDKCPTLQGWTPAISEHLRKLAEPPTPTPQPKPK FT DARLTPAQQQGRWTQADVAASHVRRTTKQNTRIDTNPCNPYLDVHPTGRYT FT IQIPDTYTDERTLAHVYNPTGRHIASVTLGTLLTLKARYEHALSTHHDATS FT PHRSQGGFPEDLAKLLLRTRVDSKGKTCKSAPARERSLPESLKLALYNCLA FT STHSDPDRPVGLVERFASPLNCSGRATKYYSQHAADSLFGASHDAYTAPFD FT GLSLAHPDPDPDACTKALAWATASIEHLEHQATPAITFMIIPQGKGAASRC FT LASPHVTILALIPPGVSASSMLDGLGAADEAASSKQGLQLLLIANTTGVQA FT LNSNNNLTRLQTSLAELGAVPPPTSQPPGRRQPLEWPARVTLPRDRTAPPG FT RPYSHRAKARAKQLTRPQAPAARGLDTCPTPMPTLSALYTTRAAFPTRSSY FT LYDAATIVWTDGSCIKVAAGEDGNEMNKLGACAWSQQTEHIYVEPGGLNST FT NTIQRAELSAILVALEYIRPRLCLLGMTQVTIASDSLVSLYLIRRALTAPQ FT RLLLSKHKNLLENILHELQMYADAGVPVQLLKVRSHTGLHGNEEADKGAAK FT VAANDMEAMGQAIRLAPDEPWSNIWWPKRNSDNFYLSDLNRGVLNSLPPEA FT QQGFTNNTPVLEQWAEASANMCPTLSNRTLASSTRHPWLVKQILYARYGYL FT FNAQLKRRYRLGGNGNCPLCHTPDSGGHILGGCTHPKMKGAYINRHNWAVQ FT RVAKCISKGTYGNGFMVMDAGPLADLPHYASAQRPPAWLAAERTSSPLITR FT LRPDILFIPNLPAHETHREGYRPPLNKARLPIYILEIGYSSDVRHLDKREE FT KSQQHQELKRLLEAAGWTVHYSTNECLALGVTGAIPCNLTKLLTSQLGVTH FT TAARVCGERLHTHAVNSAGGIIKMRRVTEQELKPRGG" XX SQ Sequence 9543 BP; 2603 A; 3434 C; 2022 G; 1481 T; 3 other; atgagtagca acccaatgac tctacaaatc ggcacgcagc ctggacaaca gcaacccgat 60 ctaacgtcac cctcacatcc gctacgaggc ggaaataaca cagagcggac cactacacca 120 gcagccgaaa acgggcatat cccagtacca ctaagctggc ccagacaaat accaagcaca 180 tacttccgcg aacaacaaca cctcaatttc tgtacgatac acgctatcaa caacatgctg 240 ggagcaccgt ggctaagggc ccatgccgtg cttgctttct gccaagtcgt gcacactgcg 300 ctggccgaca cacccgaact acaagcgacc tggggccatc acttcaaaca caatgggctt 360 ttccatgaag ccgccatcaa ccattacctc cgccaccgcc cagcgccaat cgacactacg 420 ggaacacgcc tgcttctacg cttagccgcc cctaaccttg acccaggcca ctctcaggac 480 acaattctag gcctactccc agaatgctcc cggaacatgg ggtttatggt gcacgatggg 540 agacacgcct acgcaatacg caaggaagaa gacagctggt ttctagtcga atcctttgta 600 cacattgggc gtaaacctat gacaccggac ctctggcaag cagtgaaagg cgacatattc 660 tgcttacacc cctaccacct agacatgccg cttgacccaa ggacataccc aacaattcca 720 aacctaccgc cgccgccgca accaccatca ccgccgcccc cgaatcaaac caaacccaaa 780 cgaggcccca cccaaagggc cctctccact atgttcgccc gacagactac ccaactaccg 840 acaaaccaca gcgcaggaga caacgcagcc cccacacccc acccaggccc cgccgaaact 900 acagtgactc gccccctcac ccaagccacc ctcaatatta cccgaaggga ggcgactgaa 960 ccagcgcgaa cgcagaacga acgccccgcc acccccaaca gtgcagcacg gcccacccgc 1020 cgcccacgcc atgacaacca gctcgacccg gagcccacaa gccccacacg cgcgaacacc 1080 acaactagac cgagggccaa tgcgaacatg catgcaccga acaccaatcc cacacaacca 1140 acggccaata atacaacaca accaacgtgg gcgcaagtag ttgctacgca tgccgaagcg 1200 gcaggaaatg cactccggac agttggccgg atcacaacac aagcacccca ccaaaggcca 1260 gagaccccgc caagcgccgc acgacccaca cagcgcccac gcgctgaccc acccgacagc 1320 ccgacaacgc accaaacagt taatcccaac accccacgac gcacaccgcc tggccctaca 1380 ccagaccctg gcgctggcgc ccaagacaac actgagacac gccctgacaa cacaacttgg 1440 ctacacatcg ccacacataa cgtacgcggg ctgggcacaa accttctttc caccatatgc 1500 cacctacgcg aatggaatgc agacatcgtg attcttaccg aaaccaagct gggcccacgc 1560 acacgctggc taaaggacac cctacggcaa gagaagctat cataccgcac tttcacgtcc 1620 acaaaaccag ggactgaaca gtataaacgc cgctccgcgg gcgtcctcat ggccatctcc 1680 aaacgctacc actcggggag caacgttcac acagcaacga tcccaccaca cctccacggg 1740 tatatcgtcc attgcaccat acgcacacca cacagcaccc ccctccatat actgggagtt 1800 tacgctccag aagacatgcc cacacgccgt cgtatataca aatactgtga agaaacaatt 1860 aacaccgcca acaaaacgga tgcacatgtt attattggcg gcgacttcaa tgcagtcctg 1920 aatgaggggg acagaacagg gccccctgac gaggcggaca aacaccacgc ccgctttgtc 1980 tcaaagaaca aactcaaaag gcttcccgaa ttcggcggca caataagcat gtccttctac 2040 caaacacgag atgggacccc tcaacccacc agtcgtatag atgacatctt aatctgtcag 2100 aacatccact cggccctccc cacggcaagc cgcacaccgg caaccctgca ggtgcaaccc 2160 tgcggtggcc tcttcgacca ctattcactc cacgcccgcg tgccaacctt tccaatccgc 2220 ctctggccct cacacaccgc gacacaagca gacacacaac ccccagagac gccgggcccg 2280 tggcgtactg tggtgctacc aatccctaaa tcaacacttg aagccgcgcg aacaaaaatc 2340 gagcacacgc tagccaaccc aatcgcgacc ctcgccagtg ccattcagcc tgccacaaca 2400 gcaattcaac agcatctcgc acaacacacg gctggcgcca ccaacgcctc ggaactacaa 2460 catgcccttc agaccgaccc ggccattcaa acaccggacc tagacgccct ggctaaccag 2520 ctcctagaga tccttcaaga cgggcttagt atcctagtcg acatgtgccc caaaaaacca 2580 cccacccgcg gtgtgtttgc acctcgaaaa ataaataagc gcatccggga gtcccacaat 2640 gagataaaac agctacgcac acagtgtgac gcactactcc aacaacagca cccatatgct 2700 aacatcacga cgacgacggc gcccaacact ggaatcccat tgcctgaccc aacgacagag 2760 caggatgctg aaaatccgcc accccaacag ccacccccgg cgcacgacgc caccaccacc 2820 gcagaaccac aacgccggcc tcgacctccc cccgccacgc cggcgatagc caacctgatt 2880 gacaaaataa agagagccgg agatacgctg cggcaaatcc tgaaaaagca aacacagcac 2940 gatatccaga aagcacgaga taggttccaa cataaacttg ccacgcgcgc caaacaagcc 3000 cacaaagaca tattttccac acaagagcaa agcagaggca tacccgcgct aatacaccca 3060 caaaacaagg cccactgtac ggataaagcc tccatccttg atgcaatcca tgcctacttc 3120 cagggccagg gagccagcat cacaggacca cgcagtggta aatataaacc cgaagatcga 3180 acgcaagtgt acccattcga ccccagcctc aagaacagtg cccctgacac ctttacccta 3240 cacagcgcaa ccacaccacc aacgcacgcg ctgctcccca gcattgctga cagcaccaac 3300 ttcgactcgt gcattgacca cctaagccgc aacaagtccc caggaccaga tggcataccc 3360 aatgagttgc ttcgcgcctt accgcaaacc ttgcgtacca ctctgcataa cctccttacc 3420 ctgatgtggg tgagggcaca cacaccagcc tcgtggacgg ccagtgaaac agtcctgctt 3480 cccaaaccag gagacccgct cctcctaaaa aattaccggc ccattgcact tgccaatact 3540 acatataaac tctggacaag ccttataacc gtagccatat ctgacatctc acacgaccta 3600 gacttattca gcgagacaca agaagggttt ctgcggtacc gcaacacgga acgacagact 3660 ctaaacctag tgcacgcact agaagatgca ggcctgacca aacaagacat atatgtgatg 3720 tatgccgact tctcaagcgc cttcaacaca atctcacacg acaggctttt ggaaattatg 3780 tgggacctag gcatgccaca cgacctgatt cgtgtggtcc aaaacctcta ctccaatgcg 3840 cgcacctaca tacgctctga acatggcctg actgccccca tctacattga acgtggcacc 3900 gtacaaggag acacactctc acccgtccta ttcctggtgt tcatagaacc actcatccgc 3960 tggctgcatg tcggggggcg cggctactcc tacggctgcc taccacccca cctgaatgcc 4020 cgctacaact gctcctccgc agcatacgcc gatgacctgg cagtgcttac aaacacctta 4080 agcgacctcc gaatccaatg cgacaaaata caccggtact ctgagtgggc aggcctccag 4140 gttaaccacg caaagtgcag agtcactggt atcctgcacc gcagagccat gcaagacaag 4200 ggtctgaacg ggcccacatg caaccgtaca ctaagatcaa tgctggaaaa caaaatccac 4260 attggcacca caagtatacc ctaccttcca gcgacggaac cgttcaagta cctcggtgtg 4320 caagtcacaa tgaccctaaa ctgggggccc caatttgcgt acctatgcaa aaatatcacg 4380 gacaaaagcc taagcctcca aacatgtcaa gcctcaccag agcaatgcct gcgcatcatc 4440 aaatcgtgca ttcaatccgt tgctgcatat agtttcgccg tcatgcccta cacggaaaac 4500 gacatacgca cacttgacaa tatgatcgca cgactggcca gaacatgcta tcgcctcagc 4560 cgcgggttcc ccacgcgagc cgtcctaacg ccagcagaaa tgtctggcct aggagtcggc 4620 tccctgctcc caatctatgt gcagaaagca acacgggccc taacattggc cctgaatgac 4680 ccaggccgcc tgggaatggt cacacatgcc ctgcttgaac tacagaccag aatagccggc 4740 actacagaca tgcacaaact ccgccatgaa agccactact atacgacact gaaacaactc 4800 agcatcatgc gcgcctataa cctcacccta tgggaccacg gggccccata caaagggcca 4860 cccaacacac tgctaaccgc aatcagccgc agcccacacc ctatctcaca ccacttggtc 4920 cttaccttga cacaagccgg ccttgatttg cgaaaccttg taggtaaaga gcgtacaccc 4980 gaggggcgcg catgcctcat tccttctgac atcctcttcc aacacctgcc acgcaaccaa 5040 tgccagcata agcaccgagt ggccctaaac cgccttagcc tatgcctgag cgcggaatgc 5100 gatggtgccc gtgaggcacc tttcaaccag gctgaaagct acaaatgtca cttacccctt 5160 ccgcagcgat acagagccct ccccaacacg ccagcctttg ccgccctcct accccgcaat 5220 gccaccccgg gcgagctaac aacccaatct acgatggacg cccacctcca acacatgact 5280 gcactacacc ccaataccga ggctcaaaca ccccaacacc caaatggcct aggcccccgc 5340 caacgccaaa ccacgttagc ccacatgctg agaacaacag gtgccccacc cacagcacca 5400 acccagtcgc caacggcaac tctaacccaa actccgacag gcacactgcc taccccgcca 5460 ctagcgcaac ccatgccacc gccgctaaca cagcccacgc ccccactaac aacaccagcg 5520 ggcccggacg cccccaacac ccagacacca cccacagtag cgtccctcac aacggcccac 5580 cagcagccaa aaagtaagaa acaacgaccc aacgccatca agcgcctaca gcaaaagatt 5640 gatgctgacc cccggcacga cccactgcca cctaccgata tgcgcagcca cacagagaaa 5700 tctggcttta tactaaacaa cgaccccgat gaaaatgact ccggtccccc gaactttgtc 5760 atgtgggcac cacactactt tgccaatgta caaaaagaca tgcaaggaac caaccgcatg 5820 cctcgacttg gaattccccg ccaccacagc aggttcccgc aaaaccgcac tcgccccaac 5880 ccgacccgta caatcccagc ccacctcgca gacccctact tcctgggtag ccaggtggac 5940 gacaccaatc cgctaaatcc cgtcacagct tacatatccg caaacccgcc aacaacacct 6000 cccacctgtg cttacgactc ccgaccgaga tgggcagaca ctggggattt catgattgca 6060 cctgagcctc caaactggcc gaacatcact ggtgccacac cgggggacag ccacgacccc 6120 cgccaggctg accacgaccc acaaggacaa atgcgtgcag aatatatgca ggagcgtgaa 6180 cgcctccgtg cacacttgga tgaccaccca ccccaaccaa ggcccttcaa gcgttggatg 6240 ggaatccgac tccagctcct gacaggcaaa ctaccytggc cgacgcaggc ataccacgtt 6300 gcccaagcag cgaagcaact agtgtactcc ctgtaccacg acattgagga aatctcgcac 6360 attcaagctg ggcccatcac taaagccgtc actgccccag agaagggtgc cagtctccac 6420 accgagcgct actatgaggt agcttggaag gacacaacca ttgttaaagg agcggctgag 6480 gcctacaccc gcatgaagta tactgggact gctgccgcac acatgggata cgggcaagcg 6540 tgccacaaat atttcctgcg catgaaatgg gaccccacca gcgaacccgt agacaaatgc 6600 cccaccctcc aagggtggac ccctgccatc tccgaacacc tgcggaaact tgcagaaccc 6660 cctacaccca cgccacaacc caagcccaaa gacgcccgtc ttacacctgc ccaacagcaa 6720 gggcgatgga cgcaagccga cgtggccgcg tcccacgtcc gacgcaccac caaacagaac 6780 acgcgcattg acaccaaccc gtgcaatccc tacctcgacg tccaccccac cggaaggtat 6840 acaatccaaa tcccagacac atacactgat gaacgcaccc tcgcccacgt ctacaaccct 6900 accggacgcc acatagcatc cgttacgcta ggtaccttgc tgactcttaa agctcggtat 6960 gaacacgccc tatctaccca ccatgacgca accagcccgc accgatctca agggggcttc 7020 ccggaagact tggctaaact gctactccgc acacgggtag acagcaaagg caagacatgc 7080 aagtcggccc ccgcacgtga gcgctctctc cccgagtccc tgaagctggc gctgtayaac 7140 tgccttgcct cgacacactc agaccctgac cgcccagtgg ggctggtgga acgstttgcc 7200 tctccactca actgctcggg ccgggccact aaatattaca gccaacacgc agctgactcc 7260 ctcttcgggg cctcacatga tgcgtacacg gccccgtttg atggcctttc cctcgcccac 7320 cctgaccctg acccagacgc ctgcacaaag gcactggcgt gggcaacagc atcgatcgaa 7380 cacctagaac accaggccac ccccgccatt accttcatga ttatcccaca aggcaaaggc 7440 gccgccagcc ggtgccttgc gagtccacat gtcaccatcc tggccctgat tccgcctggc 7500 gtatccgcct ctagcatgtt agatggcctc ggggcggccg acgaggctgc cagcagcaag 7560 cagggcctgc agctactcct cattgcgaac accacagggg tacaagcact caactccaac 7620 aacaacctga cacgtctcca aacctccctg gccgaacttg gagcggtgcc gccaccgacc 7680 tcgcaacccc cagggcgccg acaaccactg gagtggcccg cgcgcgtaac cctcccccgt 7740 gaccgcacag ccccaccggg ccgaccatac agccaccgtg caaaagcccg tgcaaagcag 7800 ctaactagac cccaagcccc agcagcacgg ggcctggaca catgccctac gcccatgccc 7860 acgctctctg ccctctacac cacacgagct gccttcccca ctcggagcag ctacctgtac 7920 gatgccgcca ccattgtttg gaccgacgga tcttgcatca aagttgcggc cggtgaagat 7980 ggcaacgaaa tgaacaaact cggtgcttgc gcgtggtccc aacagaccga acacatctac 8040 gtcgaaccag gcggcctcaa cagcaccaac acaatccaga gagccgaact atcagccatt 8100 ctcgttgcgt tagaatacat ccgcccccga ctctgtctgc tgggcatgac ccaagtaacc 8160 atagccagtg attccctagt gtctctctac ttgatccgcc gagcccttac tgcaccacaa 8220 cgactcctac tatccaagca taaaaacctg ctggaaaaca tactccacga gctacaaatg 8280 tatgccgacg ccggggtccc agtccagctg ctgaaagtcc gctctcacac agggctccac 8340 ggcaacgagg aagctgacaa aggtgccgcc aaagtcgccg ccaacgacat ggaggccatg 8400 gggcaggcca tacgcttagc accagacgaa ccgtggtcca acatctggtg gcccaaacgc 8460 aactctgaca atttctacct aagtgaccta aaccgtggag tcctgaactc attgcccccc 8520 gaagcacaac aaggctttac caataataca ccagtgctgg agcaatgggc ggaagcatct 8580 gccaacatgt gccccaccct cagcaaccgc accctcgcca gctccaccag gcacccctgg 8640 ctggtcaagc aaatactcta cgcccgctac ggctacctct tcaacgcaca gcttaaacgg 8700 cgctaccgcc tgggagggaa cggcaactgc cccctctgcc acacccctga ctccggaggg 8760 cacatcctcg gaggctgcac acaccctaaa atgaaaggcg cgtacatcaa tcggcacaac 8820 tgggcagtcc agcgtgtcgc caaatgcatc tccaaaggga cctacgggaa cgggttcatg 8880 gtcatggacg cgggccccct agccgacctg ccacactatg cgtctgcaca acgccccccc 8940 gcctggctgg ctgcggagag gacatcatca ccccttatta caaggctcag acctgacatt 9000 ctgttcatcc caaacctgcc cgcccatgaa acacatagag aagggtaccg cccgcccctg 9060 aacaaagcac ggctgcccat ctatatcctt gaaatcggtt acagctctga cgttcgccat 9120 ttggacaagc gggaagaaaa atctcaacaa caccaagaac tcaagcgact gctggaagcg 9180 gccggatgga cggtccacta ctccaccaat gaatgccttg cgctcggagt gactggggcc 9240 atcccttgca acctcactaa gctgctcacc tctcaacttg gggtgaccca cacagccgca 9300 cgagtatgcg gagagaggct ccacacacac gctgtgaaca gtgcgggcgg cattatcaaa 9360 atgcgccgag taaccgagca ggaactcaaa ccgagaggtg ggtgagccac ggctccccca 9420 cacctcctcg gagtgcgtgg tgaggtggca gcgcgagggg gatgggcctc ctcgccctag 9480 ccaccttacc ccgataatcc acgccaggcc ccatgggccg gcgtttgaat tattattatt 9540 att 9543 // ID RandI-5 repbase; DNA; CHL; 7466 BP. XX AC . XX DT 17-AUG-2004 (Rel. 9.07, Created) DT 17-AUG-2004 (Rel. 9.07, Last updated, Version 1) XX DE RandI-5 is a family of non-LTR retrotransposons - a consensus DE sequence. XX KW RandI; Non-LTR Retrotransposon; Transposable Element; KW AP endonuclease; RNaseH; RandI superfamily; RandI-5; KW reverse transcriptase. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-7466 RA Kapitonov V.V. and Jurka J.; RT "RandI-5, a family of RandI non-LTR retrotransposons from the RT Chlamydomonas reinhardtii genome."; RL Repbase Reports 4(7), 200-200 (2004). XX DR [1] (Consensus) XX CC RandI-5 is a family of RandI non-LTR retrotransposons. CC Approximately 50 copies of RandI-5 are present in the genome. CC They are ~1% divergent from the consensus sequence and are CC likely mobile. The RandI-5 elements are usually flanked by CC 10-15-bp target site duplications. The 5'-terminal portion of CC the consensus sequence is incomplete. The consensus sequence CC encodes the 2457-aa RandI-5p protein (nucleotide positions CC 1-7367) composed of the reverse transcriptase (aa pos. CC 460-750) and RNase H1 (aa pos. 1930-2100) domains. XX FH Key Location/Qualifiers FT CDS 0..0 FT /product="RandI-5p" FT /translation="XXNAADRTGPLDSADRLHQRFLAQTGLLPARGTMTTE FT GEWSYEQTQHGVTTHRSRIDDILMCTTTLTACPSPHEHVSTVAGNFDHKPV FT HIKLPADALHLWEAPQPAAPPATTSEDSPRWSRVTLPITKKQLAAAAAHLE FT DIMVEATADLHNASKGAVQVIDNALKQHTKNPTTYPANTMQWDLAHNRIVQ FT QTDVNQLAERLGKALEDGLTCLLEECTLKPPHTGKHHLSRSAARHLKPLWD FT KEVELKQTLTNLTAGGAAPDTTTEGEAATIRGQIKDCQAQHRQLVADRAKA FT QREAAAAALQHTLATRPAHGHKRIFQKEATERGLPAVRNPETGEVVTDPPS FT ILTALDGHFRKLSAPPRGPRTGEYRLSPGATRGYPFELPNATDAFTLERNR FT HPEAHSMLSSMADPANFDHCIAHLSRNKATGPDGIPNELLRILPHGMKRAL FT HNTLKIMYVKSTIPESWATSDTVLLPKPGDALDIKNKRPIALANTCYKLYT FT SMLTLGIGELAGPLQLFSESQEGFRAYCNTERQILNLVHVLEDAALYGKDV FT YAVYVDYSSAFNTIDQDRLLQIMFDLGLPDDLIRAVRNLYAHATTRIRTEH FT GCTSAIPIERGTVQGDTLSPVLFILFMEPLVRWLHAGGRGYRYGCLSPEDN FT ARYHCSAAAYADDLAALTTTLEDLQIQCDKISDYADWASLRVNHTKCAATA FT IWHNKSRSTPTLDGPTGQATLNAMHRELTNKLRIGNTTVPYYPPTKPYKYL FT GVQLTFSLDWKAQVSRVTETVKEKGTAIATSLATPAQRLRMLQQCVHTSIA FT YAFAAMPYTKQDIANFDATLAGYAKRCYGLPRSFPTRTTLLPIDEYGLGLG FT SLLPQYARVANRALVLALNDTGRLGVVTRALLPRQHSLAGASLAHHLPADR FT GRFFTTLQQMTLAREHGIQLFYKGTPYATPTWSIVEALDAAAAARAAAPLP FT VEHVLPLAELQLEFSHLVDRNTSRHLISTRELEKLLGASRVRHKHKVALNR FT LSLALVATAGLDGDQMPPEKHGSPNPLPPTQRVIPPLEELQALSARNDPRP FT PGMAPPLTALMTGGPAPQPLTPAPNTTTQMDIEEASPLYHNPPLQYTPTAL FT GQMAPTPAPAADTAAPVAQPEANLRTRPRKTPVMRTPTHPTAHSRRVTKHA FT AQKARIAENTRTTPDDLLTGDKAAISTAMRLTNGDESLAVGTRGYETARSV FT FTALAKDMSRHNHLCKPHPRDPPSFNNRTPVTRTQRPLAPAITTLLTANAT FT MPSTGAVQTYLHQGQPAAAAAEDPMDVDAAAPEHDQMEVDDSQHDLMEADD FT RQQASPLRLSRRRGKQTIWKLARERVVKDLVHPPTDDCTNGDAQLTYDLDN FT DGQELAAILYRTNQQTATDKEIARKRKAGEDTTNTREPYFAVQWKPSVIPP FT GLVTAFSRMGYATVTDVRCYKHPLQARPRATQPHPDGAGTTDADTATAAQP FT AGILPAHAPPTAPAPGPLETTHAPRAALFSSRSTRLLRQVTWANSNERERD FT LKHDLEWEELISEYTNRLAAGPDAPARPTRPPKDRNLTMRQRQGRHDDDPD FT DQATRARVTRAATTISTVPSDPYHDVSPPGSYAITTPTAGRALPREAHVYE FT PSGRWLGMLTYTRLQTLWESFNRTNNNRTKAFEEAVAALLHRYRRDPGQTD FT HKTAPQHQTSLPSATVRAVVRSLRNTGRGRVKEMFASPLNHTTDADEYWSR FT EAADVVFGAQHDAYSTAWTGLHYAHPPANPDASRQALVWALASAETVNNSP FT EPTLTILVLPKCATYPHTQWLTHPLCRRLATWTNGELDSPGTAGMEPGLGT FT DVPLQKQRGAILVAVGNTAGLNAFTQSWKGLRSSLVAGPHAPATVADTHLG FT SGTLNTSCPHIPRHTSKHAKYLATHLNIATVVTPDEAPATPRLNRTQFQRT FT LPLAHNPEEIVWTDGSVSKTTTENGRELQVAGACAWHKDRVVYVNPNGFGC FT TNTITRAELAAIRAALEEFGGETSMFAKKTLTIASDSAASLYLIKRAIDEP FT RRLHLSKHKALLSSIADLLHARAKRDAHTVFLKVVSHTGLHGNEEADKGAA FT DVATSTKPADVSELADNNPHAAHWWPATTRTREDGSTTVHYISDLNRGLLK FT TLDPSSHGGHSKHTLYTTKWAESAPHLDTRASTAYMTHPGAGHSLRRFIQN FT ARQGGLICPARLALFKLRASDTCALCAPIHQRAGSPPERGNAGHLAGHCAH FT PQLVGARIAKHNAAVRMIAECLHHGANGGGYMLMDASSADARPEYCAGTCP FT PKWMLRNNIPEADRKRMRPDILFIPSLPRSDVGASGPRGLTTAGRRQHKVY FT LIEVGYTSDLHHSEKCDQKQAQHTRLADALRDAGWDVVYKKEQIVTLGHGG FT TVSNTLEPLLRSLGATTTSAKSCCSRIHMHSVISLRTTSLLYYRLEREMGI FT VNSRQVGPTGGATAAGPSPRDPG" XX SQ Sequence 7466 BP; 1833 A; 2609 C; 1847 G; 1177 T; 0 other; ctaacgctgc agatcgcaca gggccgcttg actctgctga ccgactacac caacgctttc 60 tggcacaaac tggcttatta cccgcacgag gcaccatgac cacagagggc gaatggtcct 120 atgaacaaac gcaacatggt gtcacaacgc accgcagccg catagatgac atactcatgt 180 gcacgacaac gctcaccgcg tgccccagcc cacacgagca cgtaagtaca gtcgcgggta 240 actttgacca taaacctgta cacataaaac tgcctgctga cgcgctgcac ctatgggaag 300 caccgcagcc cgcagcgcca ccagcaacaa catctgaaga cagcccacgc tggtctcgcg 360 tgacgcttcc aatcaccaag aagcagctgg ccgccgctgc cgcccacttg gaggacataa 420 tggtcgaggc cacggcagac ctgcacaatg ccagcaaggg cgcggtgcag gtgattgaca 480 atgctctaaa acagcacact aaaaacccaa ccacctaccc ggcaaataca atgcaatggg 540 acttggcgca caacagaatt gtacaacaaa ctgacgtgaa ccagctggct gagcggttag 600 gcaaggctct cgaagacggc ctcacctgcc tacttgaaga gtgcacacta aaaccgccac 660 acactgggaa acaccatctt tcccggtcag ctgcccgaca tcttaaacca ctatgggaca 720 aggaggtgga actcaagcaa accctcacta acctgacagc cggcggtgca gcgccagaca 780 caacaacgga aggagaggct gccaccatac gaggtcagat aaaggactgc caagcccaac 840 accgccagtt ggtggcggac cgagccaaag cgcagcgtga agctgctgcc gctgccctcc 900 agcacacgct cgccaccagg cctgcgcacg ggcacaaacg catctttcag aaagaagcaa 960 cagaacgtgg cctccccgcg gtccgcaatc ctgaaacagg cgaggtggta accgaccctc 1020 ccagcattct aaccgccctg gatggccact tccgcaaact gtcagccccc ccacgagggc 1080 cccgcacagg tgaataccgc ctgtcacccg gcgcaacgcg tgggtacccg ttcgaactac 1140 ctaacgctac ggatgccttc acgcttgagc gtaacaggca cccggaagca cactccatgc 1200 tttctagcat ggcagacccc gccaactttg accactgtat cgcccacctc tctcgtaaca 1260 aagccacggg cccagacggc attccgaacg agctgctccg catcttgcct cacggcatga 1320 agcgagccct acacaacaca ctgaagatca tgtacgtcaa gtccacaatc ccggagtcct 1380 gggccacttc ggacacggtc ctgctcccaa aacctgggga cgctcttgac atcaagaaca 1440 agaggcccat cgctctagcc aacacttgct ataaactgta cacttctatg ctcaccctgg 1500 gcatcggcga actggcaggc ccactccagc tgttcagcga atcacaggag ggcttccggg 1560 cctactgcaa cacggaacgg cagatactga accttgtgca tgtactggag gacgctgccc 1620 tgtatgggaa agacgtatac gcagtatatg ttgactacag ctctgcgttt aacacaatcg 1680 atcaggacag gctactccag attatgttcg atctgggcct gcctgacgac ctgatacgcg 1740 cagtccgcaa cctatacgcg cacgccacca cccgcattcg tactgaacat ggatgcacat 1800 ccgccatccc aatagaacgt ggcacggtgc aaggtgacac cctgtcacct gtactgttta 1860 tcttgttcat ggaaccgctt gtgcggtggc tccacgcagg tgggcggggc taccgctacg 1920 gatgcctcag cccggaagac aatgcacggt accactgcag tgccgccgcg tacgctgacg 1980 acttggcggc cctgaccaca acgctggaag acctgcagat ccaatgtgac aaaatctcgg 2040 actatgcaga ctgggcaagc ctgcgagtca atcacaccaa gtgcgcagct accgctatct 2100 ggcacaacaa gtcccggtcc accccaaccc ttgacggacc caccggccaa gccacgctga 2160 acgcgatgca ccgtgaactg actaacaagc tacgaattgg taacaccact gtaccgtact 2220 acccgcccac aaaaccctac aaatacctgg gggtgcaact caccttctca ctagactgga 2280 aggcgcaagt ctcccgcgtc acggaaactg tgaaagaaaa gggcacggcc atagccacgt 2340 cactggccac gccggcacaa cgcctacgca tgctgcagca atgcgtccac acctcaatcg 2400 cctatgcatt tgctgcgatg ccatacacga aacaggacat cgcaaacttt gacgctacac 2460 tggccggata cgcaaaacgg tgctatgggc tgccacgcag cttccctacc cgtaccaccc 2520 ttctcccaat cgatgaatat ggcctaggac tgggctcact cttaccccag tacgcgcgtg 2580 tggccaaccg cgcgcttgtc cttgctctaa acgacacagg ccgactgggc gtggttaccc 2640 gtgccctcct gccacggcag catagcctgg cgggtgcgag cctcgcacac cacctacctg 2700 cagacagagg gcgcttcttc acaactctgc aacagatgac cctcgctcgt gagcatggta 2760 tacagctctt ttacaagggc acaccgtatg ccacccctac ctggtcaatt gtcgaggctt 2820 tggacgccgc agctgcagcg cgtgcggcag cacccctccc tgtcgaacac gtgctcccgc 2880 tagcggaact ccagctcgaa ttcagccact tggtcgaccg caatacgagc cgccatctga 2940 ttagcacccg ggagctcgag aaactcctgg gggcaagcag ggtccgccac aaacacaaag 3000 ttgcactcaa caggctgagt ctggccctgg ttgccacggc gggattggac ggagaccaaa 3060 tgccaccgga aaaacacggg agcccgaacc cgcttccgcc tacccagcgt gtgatcccac 3120 ccctggaaga actccaggcc ctctccgcac ggaacgaccc tcgacccccg ggcatggcac 3180 cacccctcac cgcactcatg acgggaggcc ccgccccaca gcccctcacc ccagccccaa 3240 acacaacaac ccaaatggac attgaagaag ccagcccact ctatcacaac ccgcccttgc 3300 aatacacgcc aactgccttg gggcagatgg ccccaactcc tgccccagcc gcggataccg 3360 cagctcccgt cgcccaaccc gaagccaacc tgcggacacg gccacggaag acgcccgtca 3420 tgcgcactcc cacacaccca actgcccact cccgccgcgt cacgaaacac gctgcccaaa 3480 aggcccgcat tgcagaaaac acgcgcacaa cacccgacga cctgctcacc ggtgataaag 3540 ccgcaatcag cacggctatg cgcctcacaa atggagacga aagcctggcc gttggcacac 3600 gtgggtacga aacggcacgt tcagtgttca cggccctggc aaaagacatg agcaggcaca 3660 accacctctg caagccccac ccaagggacc ccccttcatt caataaccga accccggtta 3720 cccgcacgca acggcccctg gcccccgcaa taaccactct gctcactgcg aatgcgacca 3780 tgccgtctac tggcgccgtg caaacgtacc tccaccaagg gcagccagcc gcggccgccg 3840 ctgaggaccc aatggacgtc gacgccgcag cacctgagca cgaccaaatg gaagtggacg 3900 atagtcagca cgacctgatg gaagcggacg ataggcagca agcctcacca ctcaggctgt 3960 ctcgccggcg gggcaagcaa acaatttgga aactagcccg ggaacgtgta gtcaaggacc 4020 tggtccaccc tcccaccgat gactgcacca acggagatgc gcagcttacc tacgacctgg 4080 ataatgatgg ccaggagctg gcggccatcc tgtaccgcac aaaccaacaa actgccacag 4140 ataaggaaat tgcacgcaaa cgcaaagcgg gcgaggacac aaccaacacc cgcgagccat 4200 actttgcggt ccaatggaag ccatctgtca tcccacctgg cctagtcacc gccttctcac 4260 gcatgggcta cgccacggtc acagacgtgc gctgctacaa gcacccactg caggctcgcc 4320 cacgcgcgac gcagccccac ccagacgggg ccggcacgac agacgcagac acagcaacag 4380 cggcccaacc cgccggcatc ctacctgcac acgcgccgcc caccgcacct gcgcccgggc 4440 cactcgagac aacacatgcc ccacgtgcgg ccctcttttc atcgaggtcg actcgtctcc 4500 tgcgccaagt tacgtgggca aacagcaatg agcgtgagcg tgaccttaag catgacctgg 4560 agtgggagga gctgatctca gaatacacca accgacttgc agcgggacct gatgcacccg 4620 cccgccccac acggcctccg aaggatcgca acctgacaat gcggcaacgc cagggccgac 4680 acgatgacga cccggacgat caagcaacgc gtgcacgggt gacccgcgca gcaaccacca 4740 tctctaccgt ccccagcgat ccctaccacg acgtttcccc cccaggcagc tacgccatca 4800 ccacccccac agcaggccgg gccctccccc gagaggccca tgtctatgaa cccagcggac 4860 gctggctcgg catgctcacc tacacacgcc ttcaaacgct ctgggaaagc ttcaaccgga 4920 ccaacaacaa ccggaccaag gccttcgagg aggcggtggc tgccctgctg caccgatacc 4980 ggcgcgaccc ggggcaaacg gaccataaga cggcaccaca gcaccagacg tccctcccct 5040 cggcaactgt ccgagccgtt gtgcgcagcc tgcgaaacac cggcaggggg agggtcaagg 5100 aaatgtttgc ctcacccctg aaccacacga ccgatgccga tgaatactgg agccgggaag 5160 ccgctgatgt ggtgttcggc gcccaacacg atgcctacag cactgcatgg acggggctcc 5220 actatgcaca ccctcctgca aaccccgatg cctcacgaca ggccctcgta tgggccctcg 5280 ccagtgccga gacagtcaac aactccccgg aacccacact gaccattcta gtcctcccaa 5340 agtgcgcaac ttacccacac acacagtggc taacccaccc gctctgccgc cgactcgcca 5400 cctggacgaa cggagaactg gactccccag gcacagctgg gatggagcct ggactcggca 5460 ctgacgtgcc cctgcagaag caacgcggag ccatactagt tgccgttggc aacacagctg 5520 ggctaaacgc cttcacgcag agctggaaag gccttcgctc ctcactggta gccggtcccc 5580 acgcgcccgc aaccgtcgct gacacccacc tcgggtcagg tacgttaaac acgtcgtgtc 5640 cccacatccc acgtcacacc tccaaacatg ccaagtacct cgcaactcac ctgaacattg 5700 ccactgtggt cacgcctgac gaagctccag cgacaccgag gttgaatcgg acacaattcc 5760 aacgtacact gccgctggcc cacaaccccg aggaaattgt atggaccgat ggctcggtat 5820 ccaagacaac gactgaaaac ggtagagaac tacaagtggc tggggcttgt gcttggcata 5880 aagaccgtgt tgtctacgta aacccaaatg gatttggctg cactaacact attacacgcg 5940 ccgagctcgc tgcaattcgt gcagccctgg aagaattcgg aggtgaaact agtatgttcg 6000 cgaagaaaac gctaaccatt gccagtgact cagctgcaag cttgtatcta attaaacggg 6060 ctattgacga accacgtcgc ttacatctga gcaaacataa ggcacttttg agttcgatcg 6120 ccgatttgct ccatgcacga gcgaagaggg acgcacacac cgttttcctc aaagtcgttt 6180 ctcacacggg tctgcacggt aatgaagaag ccgataaagg tgcagcagac gtagctacaa 6240 gcactaaacc ggctgacgtc tcggagctcg ctgacaataa cccccacgcc gcacactggt 6300 ggccagcaac gacccggacg cgggaagacg gctctaccac cgtgcactac atctcggacc 6360 tcaatcgagg cctcctgaaa acactagacc cttcctccca tggcggtcat tccaaacaca 6420 cactctacac gaccaagtgg gcggagtccg ccccacacct ggacaccagg gccagcaccg 6480 cctacatgac gcaccccggc gctgggcaca gcctccgacg cttcatccag aatgccaggc 6540 agggaggcct catctgcccc gcccggcttg cgctgttcaa actacgtgcg agtgacacat 6600 gtgcactctg cgccccgatt caccagcgcg ctggatcccc ccccgaacgg ggcaacgcag 6660 gccaccttgc cggccactgt gcgcaccccc aacttgttgg ggcccgcatc gccaaacaca 6720 atgccgccgt ccgtatgatc gcggaatgcc tgcaccatgg cgccaacggt ggcgggtaca 6780 tgttgatgga cgcaagcagt gcagatgcac gccccgaata ctgtgcgggc acctgccccc 6840 ccaaatggat gctgcgcaac aatatccccg aggccgaccg aaaaaggatg aggcctgaca 6900 tcctgttcat cccatcccta ccacgctccg acgtgggggc tagtgggccc cgcggcctta 6960 caaccgccgg ccgacggcaa cacaaagtct acctgataga agttggttac acatcggacc 7020 tccaccacag cgaaaagtgt gaccagaaac aagcccagca cactcgccta gccgacgccc 7080 tgcgggatgc aggctgggac gtagtatata aaaaggagca gattgttacg ctgggccacg 7140 gcggcactgt gtcaaacacc ttggaacccc tccttcggtc gctgggtgcc accaccacat 7200 cagcaaaatc ctgctgctcg cgcatacaca tgcacagtgt catcagcctg cgcaccacat 7260 cccttctcta ctaccgcctt gagcgcgaaa tggggattgt gaactcacgc caagtcggcc 7320 ccactggcgg cgccacggct gctggcccca gccctcgcga tccaggctaa ctttccacct 7380 tttgacacgg tggggtgagc aaaactcact cctccttaag aaacgcggcc tccttcgtga 7440 accgcgtaca tattattatt attatt 7466 // ID TE1-2_CR repbase; DNA; CHL; 664 BP. XX AC . XX DT 04-APR-2005 (Rel. 10.03, Created) DT 10-APR-2007 (Rel. 10.03, Last updated, Version 1) XX DE TE1-2_CR is a family of nonautonomous transposable elements - a DE consensus sequence. XX KW Transposable Element; Nonautonomous; TE1-2_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-664 RA Kapitonov V.V. and Jurka J.; RT "TE1-2_CR, a family of unusual nonautonomous transposable RT elements in the Chlamydomonas reinhardtii genome."; RL Repbase Reports 5(3), 67-67 (2005). XX DR [1] (Consensus) XX CC TE1-2_CR is a family of nonautonomous transposable elements. The CC consensus sequence is less than 5% divergent from CC TE1-2_CR copies. Classification of this family is not known. It CC is CC possible that it belongs to a novel superfamily (see TE1-1_CR). XX SQ Sequence 664 BP; 128 A; 215 C; 183 G; 138 T; 0 other; acaaggggtg tcagctttgt acagctggaa aatgggctga gcccccgtac gcttgggagc 60 taataggcgt gtaacgcccc gtcagagcct ccatgccgct acgacgcgca ggcgcatcgg 120 ctagggacac tcgggttggg gccgtgcggg atttcccgcg ccacgcatgc gtgccttgtc 180 acgttcccaa acaccccaag catttctttc gcctcctgcc atactcaaca cctccccggc 240 gccccacgct caaagggcta ggcgcggggc gggggcgacc tcggggcttc tcggcattct 300 ggcgcgtcgg caggcatggc tcagcaagtc gctcaaaact ttcgatatac ttccccgctt 360 tgcttgccgc caacatagtt tgccttatag actgcatatc agcgggcacc ggatgctttc 420 ctcgcgacct cgcgaaagtc aattgtccat ggacgcgcga aggcacttct cgctccgccc 480 gaattttccc tgccacctat ttacatgcaa taggcattaa caaatactaa tacgtagcag 540 tcgcggtttg cgccagtatg ctatgttttg ccgacgcacg aacgaaccag ccacggacag 600 gggtgttcct ggggggcttc gccccccccc tggaacgctt cgctgggggg gctcagccca 660 aaaa 664 // ID DNA-1-7_CR repbase; DNA; CHL; 710 BP. XX AC . XX DT 04-MAR-2004 (Rel. 9.02, Created) DT 04-MAR-2004 (Rel. 9.02, Last updated, Version 1) XX DE DNA-1-7_CR is a nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; 7-bp TSD; DNA-1-7_CR; KW unclassified superfamily. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-710 RA Kapitonov V.V. and Jurka J.; RT "DNA-1-7_CR, a family of nonautonomous DNA transposons from the RT green algae genome."; RL Repbase Reports 4(2), 30-30 (2004). XX DR [1] (Consensus) XX CC Several hundred copies of DNA-1-7_CR are present in the C. CC reinhardtii CC genome. They are ~99% identical to the consensus sequence. CC DNA-1-7_CR is characterized by 11-bp TIRs and 7-bp TSDs. CC The genome harbors several young families of nonautonomous CC transposons CC related to DNA-1-7_CR, DNA-2-7_CR, DNA-3-7_CR, and DNA-4-7_CR. XX SQ Sequence 710 BP; 156 A; 232 C; 173 G; 149 T; 0 other; cagggggggg ggtaggaaac aaagtgccag accccatgcc ccgaaatccc ccgggacccc 60 ccttgagggg acgaccgacc ccccgacgga caaaaccttg ttgttcaggg gttaaaaccc 120 ctccatttaa taccaaagtg gtgtcaatcg actcctcttg acgagcacta tcactccgtg 180 tcatcagcgg tcctgttctg agacttcggt caccggcggg atcctatcca tggaaatccc 240 ctcgacccag accaccccgc gctcaactct ctgccgcgtg acgaatggac aatgtgcatc 300 ggcagccttg ttagacgtga actcgtgcgc accagcgctt cagagcccag ctgtgtcagt 360 gcgcatgtgt tgaatgtgga gacatgtccc gacatgtcca cgcccggacg cgcaacttag 420 ccctcccacg ctttgtttgt gcttattacg tcataacgtg cacatatgag cctcgtactg 480 ctgcgtttgg cccctagttc tgcttgtggg gcgcttcccg aaatccgcca ggatgggctc 540 ccgaaacccg acggaccccc agggacgagg accgaccccc caactttggc ggcccagaac 600 ttcctcacca gacaccccct taaaaaacta aaagtacgtg cataatgctc tcctaaagct 660 ctgtaatttt gagcatgggg taagaatctg agctttctac cccccccctg 710 // ID CRTOC1 repbase; DNA; CHL; 5662 BP. XX AC X56231; XX DT 04-MAR-2004 (Rel. 9.02, Created) DT 04-MAR-2004 (Rel. 9.02, Last updated, Version 2) XX DE C. reinhardtii transposon. XX KW CRTOC1; transposon. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-5662 RA Day A.; RT "CRTOC1."; RL Direct Submission to Genbank (06-DEC-1990)Day A., Genetics RL Laboratory, Biochemistry Department, University of Oxford, South RL Parks Road, Oxford OX1 3QU, UK.. XX RN [2] RP 1-5662 RA Day A. and Rochaix D.J.; RT "A transposon with an unusual LTR arrangement from Chlamydomonas RT reinhardtii contains an internal tandem array of 76 bp repeats."; RL Nucleic Acids Res 19(6), 1259-1266 (1991). XX DR Genbank; X56231; Positions 1 5662. XX SQ Sequence 5662 BP; 1015 A; 1685 C; 1655 G; 1307 T; 0 other; gcgctgactt atcgcagcgc agtggagata agtctagtta ttgcgacgta actgccgtgt 60 tgcgttagag tcacgcacgg cgcaggacgc tcgggtacgt gcctgtgcat ggggccgaac 120 cgagctgggt cttgtacgcg tcaggagcac acggcgcctt atctgccgtt gtgcttctgt 180 actgtatttc ggatcgtccc tctgccggga cggtgacctc agtgtgtcgc acttaaacgt 240 tccctacatt tctggacttt ctttgcaatc ctatacctgg ttctaactat acttgaccat 300 gtatggaccg aataagcgtt taatatatac tcagacggag ttgcagcgtt ttgttgcgcg 360 atcctgctca atggaacccc ttagcttgat cacgctcgct ctctgatcgt aagggaatgc 420 ccttcgaagc ttctctggcg ctttgaacca cgctttggtt cgggggccgc attcgggagc 480 aaatcggagc agagcggagc tttcaagcgg agcaaaggcg cgcgaagcgt tgcggacaag 540 gcgttcggca agtcactgaa agcaaaaggg catgcacagc tgtgcgggcg ggctacttgc 600 ttgccatgcg cggtcctgct tgccgtgcct tcgtgtctac ccgtcgcttt acagttcaca 660 gctttgtgca atacctttcc acatcttcca ttgtgccacc cccacctccc caagaccctc 720 aggacttttg gcgcggtact tctcctgtct gcctatccag gccgcagggc ccgcgtgccc 780 ttggggaagg ggcgtgtgtg ccgttgggat ccggcctgtg cgccgcaagc aacgggcttt 840 gcgcccttgc cttatggaca atggacggca tacgtgccct tatgatacgg cctgtgtgcc 900 gcaagcaatg ggctccgcgc ccttgcttta tggacaatgg acggcatacg tgcccttatg 960 atacggcctg tgcgccgcaa gcaacgggct ccgcgccctt gctttatgga caatggacgg 1020 catacgtgcc cttatgatac ggcctgtgtg ccgcaagcaa cgggctccgc gcccttgctt 1080 tatggacaat ggacggcata cgtgccccta tgatacggcc tgtgcgccgc tagcaacggg 1140 ctccgcgccc ttgctttatg gacaatggac ggcatacgtg cccttatgat acggcctgtg 1200 cgccgcaagc aatgggctcc gtgcccttgc tttatggaca atggacggca tacgtgcccc 1260 tatgatacgg cctgtgcgcc gcaagcaacg ggctccgcgc cctggcttca tggacaattt 1320 acggcgtacg tgccctcatg atacagcctg tgcgccgcag gcaacgggct ccgcgccctt 1380 gctccatgga cacttcacgg cgtacgtgcc ctcatgatac ggcctgtgtg ccgcaggcaa 1440 cgggctccgc gcccttgctt catggacaat gcgccgcgta cgtgttctta tgatacggcc 1500 tgtgcgccgc aagcaacggg ctccgcaccc ttgttttatg gacaattcac ggcatacgtg 1560 cccgtatgat gtgacctgtg tgccgcaagc aacggcttcg cacccttgct tttgggtaat 1620 agatggcata cgtgccctta tgatacgacc tgtgtgccgc aagcaacggg ctccacactc 1680 ttgcgttgtg gattatagac ggcattgaaa tgcttacgtg ccttcgttgt acatgccttt 1740 gcgttgtgga caatgtgtgg tctgagcgcc acgttcggat acggcgtgtg tgccgccagc 1800 aacaggcttt gcgcctcgca tcatgtgtct tgcgatatgg cctgtgtgcc gcatgcaatt 1860 atgctgcctg ccctgtcgtt atggacgctt cgacttgttg cgtgccctgc tgcgtgccct 1920 gtcgcaatac gccttgagtg taccgtgcac ggcaagcctg cgcctcgcta ttgcttcgtg 1980 ttgacaacgg agcgggctta cgtgatcatg cgtcaccctg tacgtcttga ggtccgcacg 2040 cacatcatac tatcacgcgg caccaccctt gtagtttggc tgacgcaccc caagccaacc 2100 tatatgcatt cgatgtgtgc actaggccca agtgccgaat ttgtttttcc ggatatttcg 2160 ccctcagtga gcgatgtgga gttttgtgca gttcggccag catgctatgc ccagccaata 2220 acaataccgc atgacgcata actataccgc atgacgcata aacatgcctt cgtgccctgc 2280 accaggcatc ggacgctgtg tcacgcagtg agcccgaccc tgcgcaacca acattttgtt 2340 gcgagatacg gtcggagctg ggattacagc ctgcctggtg ggtttggatg gcgcccgtgt 2400 gttgggctgg gctgttgctg ctcgcggtgg ggcccaccac caagtcacgg cacccatccg 2460 ccctcccctc ttgttggccc acccgcctgt acacatgcca gtcacccgct cgccatcctg 2520 tgaaagcggg tagccgactt ggcaagcgct tttcctgaca cttggcgcag gtttgagtgg 2580 gataccagaa tggtctgaat gtagttgttg gataaccagt acactgcggt gtgtagctgg 2640 ttagcgggag tgccgtgcat gaaacacgct actcgacccg ccatgcccgc gcgatggtac 2700 caccaaccgt tcaacccaga tccatgccgg ggtagcatcg accccacagt cagactgata 2760 gctcctatcc aggtgtcagg cgccatgtat gtatctgtgg acgcgtcaag ctggcttgtg 2820 ccgtagcgtt ggccgcctgt atggcacggc atctgtgtca cgttatggcc tcatgcttac 2880 cgtagtcacg cggcttgcgt gctgtgcggc acgctccctg ccaatccttc aggacatgta 2940 tgcatacatg ttacttcgtc agagccatag caggggcagc gtgttctgtc aatgcctcat 3000 gaacccagag acccaagcca acgtacgcat tagttccgca acgcacgtca atgccaactg 3060 tatgtgtcgc ctgcccactc gcgagtggac gcctagggaa ccaaccttgg ttcctttcag 3120 ccccggcctt acttcacccg gcggggcaat tacttatcac cgaagtgcta ggagcagtgt 3180 gctatatgtc attactatta agagcgtatg gcgacacagg ctcacatgtg ggtagccagg 3240 cttggcaggc atcccaactc agcccggcct cctcacagca gtaccacgac gtgcccgtac 3300 gtggtcgagt gcggagtttg gctgccggcg tggctgtatc atctctcaca ttggatgacc 3360 catccgccac tgctgttcac tactggcacg tccctcgagt cgctcaccca ccggctccgc 3420 ccagcgttcg ctccctttgg ctgggccggg gcccgtggcg catccaaccc gccatcgcgg 3480 ccccgagtgc tccttatttc ctcccatcac tacgccttct atcactatag atacattgcg 3540 cgttccacgc gtgccgggta tccttcaccc ctccgcgccg ctcgaccagg ccagccttgc 3600 tggggttgct gaggtgttac ccttcatgtt gccctccctg ctattacggt acaccccaca 3660 gctgccgtgg cgtccggtat cggcacgtac gggacattgt gtgcatgcat ccccgcggcg 3720 tttggaggca aacattcacg tgcgcgcctg tcctgcgtcc gccggggtga tgctatctat 3780 gggtgtacct actgcttgat tggtagtgac tcttatgcaa gacactgcaa atctcaagca 3840 tggcacctag ctagcaagaa agaaattagt gttcgtggcc atgctgcacg gctgggcatg 3900 gctgcccgca tcctacacca cgacggcgcg ggtgaacgaa gggcaggttg ccgcgcgtga 3960 ctcgcgtacg taaaaccgct ctagtgttgc aactcgcgcc ttctcctgcg tggcgcatgt 4020 tggctagcct gtcccagctt cgagtcacga cgttgttatt attcccaagg ttgttccgag 4080 cagcctacaa cgtcaacacg tgttatggca tggccctggg ggccggtaga gagtaccgag 4140 gtctccagtg gttcgtgcca acacgtgcca acacgcactg ttacctttcc tgggcacacg 4200 gacggccaca gctgcccaca agccacacac ctgaacaagg atgcatgtgt ttccctgtaa 4260 cgccccggcg tcgtctgcat ggctggcgca cgcgggataa cgcatgtgtg tttctgtcgt 4320 ggccattggt gcacctgata cgtttgtgag tctggtatca tggcccttgc aaagccagtc 4380 gtgttcctat tgctgcttgt cttctggtag tgaccattgg ccgcccatga ccgacggagt 4440 gtggcgctgt caggccccgc gttggcgtcg ccctgcgcct gcagcaggtg ccggcggcgc 4500 ctccggcggc gctcatcccc gcgtgatggt gctgctcgtg cagccaatat ccccaagcac 4560 gaagctcgtt ctattgaccg ctgttgagtg tgcaactagg accgtacgtt cgtgcgcaag 4620 ctaggcgatg ggcggagcgc tccgcggtgt tcgagacaca tgatttcggt agcgcaaggg 4680 cacgaacgcc accgccatca ccgccgaccg caccttggtt tgcatgaccg gccgttgggc 4740 caagcgcttt gcgagaagag ctgcatacgc gaagccaatc aagcccagcc actagggctg 4800 ccgtcgcccg caccatgacc tcccggcgtt gaggactact accaaactct ggcagcactt 4860 tcggccacta gtgcaacctc aacacgggcg ggctggggcg ggcacggcgg acttggtggg 4920 gttatcggga gctgcgaggc cggaggtagg aggccgctga gggccacgaa tgagttgcta 4980 ggccgcttga ggcatgagtg gaggctattg tcggtttgag agattgggat tgtcgtttgg 5040 ggccgtggcg gtttgtaacg ctacacggca gtaaggagtc aataagcgct gacttatcgc 5100 agcgcagtgg agataagtct agttattgcg acgtaactgc cgtgttgcgt tagagtcacg 5160 cacggcgcag gacgctcggg tacgtgcctg tgcatggggc cgaaccgagc tgggtcttgt 5220 acgcgtcagg agcacacggc gccttatctg ccgttgtgct tctgtactgt atttcggatc 5280 gtccctctgc cgggacggtg acaacccacc cgccccccct ggtgccgccg cggattaatg 5340 tggtggcacc cgtgggcgct gcggcgtgcg tggttgtctg gactctgctg ctatcaggca 5400 cttcatacat gcgacacacc cagtactggc agcactttcg gccactagtg caacctcaac 5460 acgggcgggc tggggcgggc acggcggact tggtggggtt atcgggagct gcgaggccgg 5520 aggtaggagg ccgctgaggg ccacgaatga gttgctaggc cgcttgaggc atgagtggag 5580 gctattgtcg gtttgagaga ttgggattgt cgtttggggc cgtggcggtt tgtaacgcta 5640 cacggcagta aggagtcaat aa 5662 // ID DNA-4-7_CR repbase; DNA; CHL; 709 BP. XX AC . XX DT 04-MAR-2004 (Rel. 9.02, Created) DT 04-MAR-2004 (Rel. 9.02, Last updated, Version 1) XX DE DNA-4-7_CR is a nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; 7-bp TSD; DNA-4-7_CR; KW unclassified superfamily. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-709 RA Kapitonov V.V. and Jurka J.; RT "DNA-4-7_CR, a family of nonautonomous DNA transposons from the RT green algae genome."; RL Repbase Reports 4(2), 34-34 (2004). XX DR [1] (Consensus) XX CC 100 copies of DNA-4-7_CR are present in the C. reinhardtii CC genome. They are ~98% identical to the consensus sequence. CC DNA-4-7_CR is characterized by 13-bp TIRs and 7-bp TSDs. XX SQ Sequence 709 BP; 148 A; 226 C; 176 G; 157 T; 2 other; cagggggggg gtacgaacaa tagttccaga ccccaccccg caaaacctcc cgaccccccc 60 tgaccgacga ccgaccgtgc cgtaccgtgc cgtacacccc ctaaaycccc tctgttttct 120 tccaggatgg tgycaaacga ctgctttcag cgagctctat tagcatgtaa acacggaaag 180 ttcgtcgggc gcgtcttgac cacagccatg ttgtcgacat gttgtgaaca tcttgcggcc 240 gttcagttgc acccgtgtaa agcacgctta cgagacttac ctacgcatgg gacatacaca 300 tgcgttttcc tccggcgcgc aagcaccggt ccgcgacttg tggcacgttc tggtcaggcg 360 agacatgtcg caccccaaca tcccaaccca acatgtgttg acgggcggta gcccttcgtc 420 ttctggtagg ctcctgacgt ttcccagccg ttaaaagcat ccattacatg ctttgggcaa 480 cggtctgggc ttgcccggct gtcttatgag gcccagacga aagtttccgg cacaccgtga 540 cgaaaccttg gggaccccgg ggaccgacga ccgaccctga actttggggg cccatatctc 600 cgcaattact accctgatca aaattctgaa agcggattct gaacgctctc gtcaagctct 660 ttcaatttga agggtggggt atgtctgcgg cattcttacc ccccccctg 709 // ID L1-3_CR repbase; DNA; CHL; 1991 BP. XX AC . XX DT 04-MAR-2004 (Rel. 9.02, Created) DT 04-MAR-2004 (Rel. 9.02, Last updated, Version 1) XX DE L1-3_CR is a 3' terminal portion of a putative L1-like non-LTR DE retrotransposon - a consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-3_CR; KW putative L1 superfamily. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1991-1865 RA Day A. and Rochaix D.J.; RT "Characterization of transcribed dispersed repetitive DNAs in the RT nuclear genome of the green alga Chlamydomonas reinhardtii."; RL Curr. Genet 16(3), 165-176 (1989). XX RN [2] RP 1-1991 RA Kapitonov V.V. and Jurka J.; RT "L1-3_CR, a family of L1-like non-LTR retrotransposons from the RT green algae genome."; RL Repbase Reports 4(2), 41-41 (2004). XX DR [2] (Consensus) XX CC The L1-3_CR elements are flanked by ~15-bp target site CC duplications. CC There are several hundred copies of L1-3_CR present in the CC genome. CC They are massively truncated at 5'. Therefore, the consensus is CC not complete at its 5' end. Some copies are identical. XX SQ Sequence 1991 BP; 220 A; 763 C; 570 G; 406 T; 32 other; actcgctgcc gcccgccgct acyatgtrct ccaacraccc cttccacggc cagctgyagc 60 ccccgcacac cmtgcagatt cygccggagc agccwctgcg cgcgcccccc gcagctgcgc 120 gcgccacggc cgacgccgcc cacccgcctt ccttacgccc cgttcgccgg cgcctccctt 180 caacccgccg gcgctgctcg acctgcgccc nctctgccac tnccgtrctt gccagacatg 240 gtgctgtccc ttccacgccc ccacggccca gccacctccc cagcctcagc ctcctcctcc 300 ccagcctcag ccttagtccc ggccggcctc ctcctcctca gccctcagcc ccaggccggc 360 gccgccgccc aaccagccgc cgtctgacag cccccggccg ccgacgcacg cctgcttgcc 420 gccaacacca tcgtgttcgg atgcgtgccc cggtaccccg cccccgccgc ctctgccgct 480 gccgccacag ccgccgccag ctgcgcctcc ccctccgcca tctctcctcc ctgccgccgc 540 cccctctgcc gcctcctccc ctcccatggc ctccgactgg tcaccccgcc tagcactctg 600 ctggcataac cgagtccgcc ccctcctccg tctccgccgg nccgctcgcc mgcaacagcc 660 cgcggrcaag tcggccgcag cctccagcta cagcccgcgt gacagcmccg gcgcacrtgg 720 agctctgggc ctkggtagcg gcggtaccga gcggtgcggc ggcwwtagta agtgcatggg 780 ggtggggcgc cggacgcggs cagggcacgc cgctcggcgg tgtgccsgtc gccgcgccgg 840 tgttcctaac gccgcggctg ccaggcgagg tctggtaacg ktggtarcgc gsktktgtst 900 gggggcttgg cgccgggtmg cgggtcgtcg atcgygccgg ccctgggctg ctggcgcggt 960 cgcttcggta cctggccttt cggccggcgg gctgcgcang tgtgtgtgcg tgggcgtgtr 1020 tgtgtgtgtg tgcgngtgtg tgtgggtgtg ggtgtgggcg tgtgtgtgtg tgngtgtgcg 1080 tgtgcgtgcg tgtttagtgc gtgcgcgtgc gtgcgcgtcc gtgcgtgtgc gcgtgcctgt 1140 gtgggcgctt ctgggcctgc tgtgtggctg ctgtgtcatt tttaacgcgc gtgcctctgt 1200 gcctccgggg agctcttggt gagcccggac tcccgtaacc gtgcgcacca cacaccgacc 1260 aggcagctgc tacagaccgt gacagtttca ctcgccagct gtgcccgccg tccccgcctc 1320 tcctgacagc taaccgcctc ctgtgcctca catttcatat ctgcctacag tgttcacact 1380 ccagcgccag ctcgagggcc agctccagcg tgcgacgcca gcaaactttt tcgcgaaccc 1440 gcattcggtc cttagggcgc gcggtgcagc cttatcccga acttcagccg cgacattccc 1500 aatccaaccc gcgcttcctc acactccggt tgtgtttgtc tcgcactcgc attcgattcg 1560 cccgccactc atccgtacat tcgggctccc ttttctgcct tgccgcgccc tgttttcgcc 1620 cctgcttgtc atactgtcgc ccggccattg ctttgctgca tactgttgca ttttctctat 1680 gttgcatttg gtgctcatcc gtggacgctg gtcatcttcc tgcaggtgcg gtgcggctct 1740 gcatcgcgag cgagtgtggc gttgcggtgt gtctgtgtgg tgctatcgcc tggccgggcg 1800 cccggagcgt gaggaatgcg catcagcaaa cttccagcca gccagcaact tgtcggtaca 1860 gtaactccgg cgttcgacca gccatgcact cctgctctgc cgctgcaaac tgtttgctgt 1920 ctgcttgggg tacgggcgtc agctgcaccc tatgggacct cgccaacctg cctgaaccct 1980 gtcccatcac a 1991 // ID DNA-8-3_CR repbase; DNA; CHL; 2870 BP. XX AC . XX DT 29-MAR-2006 (Rel. 11.03, Created) DT 29-MAR-2006 (Rel. 11.03, Last updated, Version 1) XX DE DNA-8-3_CR is a family of P nonautonomous DNA transposons - a DE consensus sequence. XX KW P; DNA transposon; Transposable Element; Nonautonomous; P-1_CR; KW DNA-8-3_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-2870 RA Kapitonov V.V. and Jurka J.; RT "DNA-8-3_CR, a family of P nonautonomous DNA transposons in the RT Chlamydomonas reinhardtii genome."; RL Repbase Reports 6(3), 111-111 (2006). XX DR [1] (Consensus) XX CC This is a consensus sequence of a DNA-8-3_CR family of CC nonautonomous P DNA transposons. DNA-8-3_CR elements are less CC than 2% divergent from their consensus sequence and they are CC characterized by 23-bp terminal inverted repeats (plus CC subterminal inverted repeats at pos. 66-79 and 2807-2792) and CC 8-bp target site duplications. DNA-8-3_CR is a nonautonomous CC derivative of the P-1_CR transposon. XX SQ Sequence 2870 BP; 575 A; 793 C; 900 G; 602 T; 0 other; caggggagag gggtatgcaa ctcggttggc acgggctgca tggggctgca tgggaggccc 60 aaaccgtgtt cggacccccc agctcacgct tccgacttat tatttagata ggatggacaa 120 atgtgtacat atacttgttc agtctaggtc gtcggggcac ggcccctatt acttgaccat 180 tgcggacccg gacgggtgtt attgtaacac aactgcggtg ctttgacgta agtatcatct 240 gtggaagcct atagtgatag cgcggcccct tggagcctgc tggtcgcgct gctcacggta 300 agcatgcgtc cacaaatggc ttcggctggc tgtctgggca gatttccgta gcggcgaccg 360 cgccacaggg ctgcctatgc gcatatcatg cggaattatg ggagggcagc tcaagagcag 420 tcgccagggg cagtcgcgcg gcgcattttg gtaggagcac gcctcgcggc gcgaaacccc 480 ccgacttcgg ggaggtcgcg cgctggagcc ctgctagctt ttacagtttt caaacgctgg 540 ggccgtgtca ggtgctatac ctatcatata tcatccggcg gaacagcatt tagcagggtg 600 cgcgggtcgc agtctctgat cggctggtcg cagggctttc cggcgacatc aaccggcgtt 660 gccaacatat atgtgtagtt tctgctgtgt gcccgcatgt tttggggatc ttgtacctaa 720 atgtccattg taggcttgaa aggatgttga aaggcttcga ccctagtcgc cctggcatgc 780 atgcactggc atgtgatgcc gcggctcaga ggggtacgcg atttgctggt tatgtagcat 840 gggccgtggt aatacacgtc aggaaccgga ttccgttttc gggcaatgta ttactcttgt 900 ctttgggaag caatctctga tcggctgcac cggaccggtg aaaagaggcg tgggttgggc 960 ccgcggccgc acagaatcac gtctggtagc agtgacaacg atgacaggac ttgcccttcc 1020 ctccgccaac cactgtgtgt gcgcggcctt cctcctgcag gcctggctgc tctgcctgga 1080 cgtgtgcctg caccaccggc acatcagcgt cggtaggtgc cccattttct cctgcggctc 1140 ccaatcagca gcaacaggca gcagctgcac tcaattaact attggtaggg tggaaacagt 1200 ggcagctgct ccgcggctgc cgcctgtcac tggcgcctgg actggcactg tgccagcggc 1260 cgcggaggtg ttgctctatg ccatccatcg tgtgcactcc gcggaaagtg gctttagtaa 1320 gaaggtggac agcaccagcg gcagcagcag catcagcgcc tacgatcatg cagtgccgct 1380 ttgcagccgc atcgccgtgc tggacagccc tgacagcccg tacagactca ccccagcatc 1440 tgaagcattt catgcaagag aggaacccga accaacgttt gccaggcatc gttgcgcgct 1500 tggttctggc ccattgtggc attgtatgaa ggtgagctgc tgcggcctag gcttccataa 1560 tccaggtttt ctgcctgaag cccgcacctg gctgcgttct ggacggggag tggctgtggc 1620 tgtgcgcagt ggcagtggca aggcctgctg cggcggcacc tgctgtcgct gcaggccgct 1680 ggaccccgtg ccagaagagg ggaagcgtgc gcagttgcaa gcgggccctg ccactacagg 1740 tgtaaacccc tttgggttta acagctcctc tctcctctct cccggcgctt cttagtgcca 1800 catactacat ctcggcatag tctcttcttc tctaccttga ttctcaaagg caagtatcat 1860 gcgtactggt gattctggtg atactggttt ctagctttta ccagcttaca gcaacaccac 1920 aaaccgtagc cgaagggagc gggcgcaggt gctgtgggca cttgcaagca ggtggtaggg 1980 ccgtagggcc aacatctcgg caggcgaaag ctgtattgat aggacaagag tttagccagg 2040 tagacgtgtg gactggtggc acgcatgaca tccggagtgc atggcaagga ggcatcgggc 2100 atgcggggct ttgcaaggga atgtcgaacg gactgacggc gacacagggc gaagtgctag 2160 gggagagcat gggagagcac acccgtatgc aactgccact cctgcacgta accgtgtggg 2220 agtcaaatgc accaggaagc gggcctggcc gcgtgggcag gcaaccatgt atcaatacac 2280 gaatctctat gtgttgtgtt gccattgtgt cgacaaaggg gagtagcggg acgcgagcgc 2340 gggagtgggg ccggcgaccg ctgcctggcc tgtgcatggc cctgtgcaag atgtccacgg 2400 agaccgcaga cccggagaca ccagcagaca cgcaccaaac cccctgacta aggacactca 2460 tatgccccat tggctctggc cggcgtcgct cggttcggag agcttcggaa gagcaccaaa 2520 gccgggcatc gtcgcgaggg gtccttcagg cctgggccag ggggccggcg acacggcgcg 2580 tggctcatct taggccgcac atcaattgaa ctaacaggag caacatgtcc gcgcaggaag 2640 cacgaaacag gaatttcatt tgacgcgtgt gaagagtgcg aaatgccgcg atggtcgagg 2700 tcccggcagg cgggcctctc gggtcgagac caatttaaat ggttccattg gagtccaagt 2760 tacagtttca atcactcggg agcctgggaa agggggtccg aacactactt aagcccggat 2820 ccaggcagga tccagggctc ggggcgggag ttgcataccc ctctcccctg 2870 // ID TE2-7b_CR repbase; DNA; CHL; 772 BP. XX AC . XX DT 08-JUN-2004 (Rel. 9.05, Created) DT 08-JUN-2004 (Rel. 9.05, Last updated, Version 1) XX DE TE2-7b_CR is a subfamily of nonautonomous transposable elements - DE a consensus sequence. XX KW Transposable Element; AAC target site duplications; TE2-7_CR; KW TE2-7a_CR; TE2-7b_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-772 RA Kapitonov V.V. and Jurka J.; RT "TE2-7_CR, a family of nonautonomous transposable elements in the RT Chlamydomonas reinhardtii genome."; RL Repbase Reports 4(5), 143-143 (2004). XX DR [1] (Consensus) XX CC TE2-7b_CR is a subfamily of nonautonomous transposable CC elements. The consensus sequence is less than 5% divergent CC from TE2-7b_CR copies. Classification of this family is not CC known. It is possible that it belongs to a novel superfamily. CC Features of this superfamily include preferential insertions CC into AAAC targets that are accompanied by the AAC target-site CC duplications, a 18-21-bp palindrome separated by 3-5 base pairs CC from the 3' end; and the AGCC 3' termini. XX SQ Sequence 772 BP; 158 A; 226 C; 218 G; 170 T; 0 other; ccctaccgct acgcagtacc ccggcttgcc ccctccccct gtcgcattgc gcaccattgc 60 aggccacgca tgacgggaac ttaacccccg atcccagcat ttcatcggct atggtcacgt 120 tcccaacgtc cagttccgcg cagtcgcagt tgtgcgaacg ggatgggagg gagatgcggg 180 gctggtatca ggggagggat ttgcgcgggg aggcgccggt gaacacagtt gcatttgcgg 240 aaggcgagag agcgcgggcg gggcgggggc gagtcggggg cagggggccc gagagttcgc 300 acacgatttc gacaaatcgt caccacgata gtatgcaccg ttgatatcac aaaactcagc 360 gctgcttatg cattgcgatg aagcacattg cctttctgct atcgccgatc ggagagatcg 420 catgcgcttt gctgtttgcg ctcgcacata tccaagtcta tatgtttgcc ttgaagaata 480 tcgtctaccg caattcccgg aaatgcaaag ccaatcgcgc tatcgagcgc cgggatcctc 540 caacgcctct gatctcttcg cctctagtct tacgccttcc ttgtgttttt gcaaagttat 600 gcgttgctac ggacacattg aaaggcgtat tatttgcaca agaactctgg ttacgttcgg 660 gtttcccgac gtaactgcac ataaaactgg aataaccgtg agggcccccc tccggcgatc 720 gatgcgaccg caatggcttc cgcccctccc ccttcggggg aaggggcaag cc 772 // ID hAT-N6_CR repbase; DNA; CHL; 488 BP. XX AC . XX DT 13-MAY-2004 (Rel. 9.04, Created) DT 13-MAY-2004 (Rel. 9.04, Last updated, Version 1) XX DE hAT-N6_CR is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; 8-bp TSD; KW hAT superfamily; hAT-N6_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-488 RA Kapitonov V.V. and Jurka J.; RT "hAT-N6_CR, a family of nonautonomous hAT transposons from the RT green algae genome."; RL Repbase Reports 4(4), 124-124 (2004). XX DR [1] (Consensus) XX CC About 100 copies of hAT-N6_CR are present in the C. reinhardtii CC genome. They are ~98% identical to the consensus sequence. CC hAT-N6_CR is characterized by 15-bp TIRs and 8-bp TSDs. XX SQ Sequence 488 BP; 102 A; 112 C; 167 G; 107 T; 0 other; cagcgctcgt atcttccttc gttcggaacg aaatttgccc aaaaaggcct ctggcggacg 60 aaattttttt gatcttgact tggaatttcg gccaaaacgc acgggatctt ggggggtctt 120 tcggcaaaca gacgaagaca tttttgggat ttcggccact tttggccaag gacagggggt 180 tccagcggct actccacggc cgaaacctgc gtggcatatg gtaagaaggc tcaaagggct 240 gtatctatgg gtgttgacgg cgggagacag gggttcccct ggcgggagag tgaggttggg 300 tcggcgggaa gcaggggagg agtgggcgtg tcgtgtcggt gtcgacatcg tgcgggtcgg 360 gggcggagcc cccggaaaaa ttttgggggt gaacgaagcg gcggcagcct gtttcgtcaa 420 ctaaacgaag ccattttaag gaattcggcc actgacgccg cccctgcctt cgtaagatac 480 gagcgctg 488 // ID MSAT-3A_CR repbase; DNA; CHL; 92 BP. XX AC . XX DT 04-APR-2005 (Rel. 10.03, Created) DT 10-APR-2007 (Rel. 10.03, Last updated, Version 1) XX DE MSAT-3A_CR is a minisatellite unit - a consensus. XX KW MSAT; Satellite; Simple Repeat; MSAT-3A_CR; MSAT-3_CR; KW Interspersed repeat; minisatellite; nonautonomous; tandems. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-92 RA Kapitonov V.V. and Jurka J.; RT "MSAT-3_CR, a family of minisatellites from the green algae RT genome."; RL Repbase Reports 5(3), 58-58 (2005). XX DR [1] (Consensus) XX CC The MSAT-3A_CR is a subfamily of MSAT-3_CR minisatellite. XX SQ Sequence 92 BP; 17 A; 32 C; 31 G; 12 T; 0 other; ggctggatac ggtgtatggc tcacatgcac gcgcacctcc gctcacgccc accgccgcat 60 gaaggaaagc cgggggaacg cctggtgcgc cg 92 // ID L1-5_CR repbase; DNA; CHL; 852 BP. XX AC . XX DT 04-APR-2005 (Rel. 10.03, Created) DT 10-APR-2007 (Rel. 10.03, Last updated, Version 1) XX DE L1-5_CR is a 3' terminal portion of a putative L1-like non-LTR DE retrotransposon - a consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-5_CR; KW putative L1 superfamily. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-852 RA Kapitonov V.V. and Jurka J.; RT "L1-5_CR, a family of L1-like non-LTR retrotransposons from the RT green algae genome."; RL Repbase Reports 5(3), 50-50 (2005). XX DR [1] (Consensus) XX CC The L1-5_CR elements are flanked by 5-10-bp target site CC duplications. CC There are several hundred copies of L1-5_CR present in the CC genome. CC They are massively truncated at 5'. Many elements are just 20-50 CC bp CC long 3' terminal portions. The consensus is not complete at its CC 5' end. XX SQ Sequence 852 BP; 115 A; 300 C; 214 G; 223 T; 0 other; tctgggggtg tagttcatac gcatgtggca cttatgcatg catgcttgcc attgcatccc 60 ccacccggcc ggcgctgcta ccctgtgcag ccgccggttt tgttcgacgg gtgtggggaa 120 ccgcattgcg gtccgccccg tcgattgtcc tgttttgtcc tgtttaaggg gtttattatg 180 gtgccttgct gcgtgttgcc tgcgtaccag actgcagcca tgcccacgta gctcactgca 240 ttcatgccca catggcccat gccgcgtgca gtaccgccgg gatgagccgt gcagccatcg 300 cgctgtcatc actattacgc ggcctccccg tttccagtac ttgcattgcc acacgtattc 360 attcctcccc atttacatac ttgctttccg ctccgtccac tccgcctcca tagttccgtt 420 ccagcccgcg ccagccccaa ccgcagcgac agccagccgc ccagcccgcg gttacttgct 480 gtgctgtttt gcttcctgtc tggcccgctg cttccttccc ggtttcttac gcttccactc 540 actccctccc tcacatgtgc tgctggacga cggtggcgct ggtgttttcc aggttggttt 600 cggatgccgc gcgccccgca gctccgctcc gcgccagccc gccagccgtc cgacagccta 660 gccatcgcaa cctggttata tcagtgatgg gtcggcggcg ctcctctatt gcccgtggcg 720 tttaccgtct ggatgaggta acatgtctgc tgtgaccgcc tcctgctaac cggccttcta 780 tgcgtgctag cgactacaac ttctcttcgt gtcgcgcaca ttcgcgttcg catttcgaca 840 gcactccgct aa 852 // ID EnSpm-N2_CR repbase; DNA; CHL; 6161 BP. XX AC . XX DT 28-MAR-2006 (Rel. 11.03, Created) DT 12-APR-2006 (Rel. 11.03, Last updated, Version 1) XX DE EnSpm-N2_CR is a family of En/Spm nonautonomous DNA transposons - DE a consensus sequence. XX KW EnSpm; DNA transposon; Transposable Element; Nonautonomous; KW EnSpm-N2_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-6161 RA Kapitonov V.V. and Jurka J.; RT "EnSpm-N2_CR, a family of En/Spm nonautonomous DNA transposons in RT the Chlamydomonas reinhardtii genome."; RL Repbase Reports 6(3), 121-121 (2006). XX DR [1] (Consensus) XX CC This transposon is characterized by 3-bp TSDs and 25-bp TIRs; its CC internal portion contains an array of the MSAT-3A_CR CC minisatellites. XX SQ Sequence 6161 BP; 1019 A; 1749 C; 2276 G; 1114 T; 3 other; cactataagg acgggtttgt gcgttacacc tgctttttgt gcgtaaagca tgcgttgggg 60 gtgggacgca tgcgtccacg gaacgcaaga agcatgaggg ccggggaggg cggcaaacct 120 ctcggttgtg cgtgcgaaac acgcatgcgg aacgcatgca gacgcatgcg ccccacgaat 180 acgcaagccc ccgacgcaag ccccgtcaaa gccttcacct caagtacttg ctcaactcac 240 agagtgtaga cacatagcga acgatgactt tgccaaacct cgcgatcgcc caactgcact 300 gcaacgtgca aactattgct tctacatttg ctgcgttaac aaacagtata ttagcacgta 360 taacgcaatt gccatgagaa aggcgttagg gatgggcctt gttaaacaaa ggcaaagcaa 420 cgagtgccag gctctatgtt gtatgtatgt atgaagagtt aaacctaggt actaactagt 480 aatcgcggta aacgggggga gcgccggcgg tgcgtggtgg tgagcagccg cggcagaggc 540 tgtggctgcc gcggctgccc atggccagca gtcaccggtt gcgcggccgc tggtggggag 600 gaaggctggc gactgccgct gctgcgcgct gctatcagcc cggaatctgc acgggagcgg 660 gaggcatgag cagggtaaac gggggagcgg ccggggttgc ggcggcgcgt ggcggtgact 720 gttccaagca gccgcggcag aggctgtggc tgccgcggct gcccatggcc agcagtcacc 780 ggttgcgcgg ccgctggtgg ggaggaaggc tggcgactgc cgctgctgcg cgctgctatc 840 agcccgggag ctgcacggga gcgggaggca tgggggtacg ggagcggccg gggttgcggc 900 ggtgcgtggc gatgactgtt ccaagcagcc gcggcagagg ctgtggctgc cgcggctgcc 960 catggccagc agtcaccggt tacgcggccg ctggtgggga ggaaggctgg cgactgccgc 1020 tgctgcgcgc tgctatcagc ccggaagctg cacgggagcg ggaggaacgg ggggagcggc 1080 cggggttgcg gcggcgcgtg gcggtgactg ttccaagcag ccgcggcaga ggctgtggct 1140 gccgcggctg cccatggcca gcagtcaccg gttgcgcggc cgctggtggg gaagaaggct 1200 ggcgactgcc gctgctgcgc gctgctatca gcccggaagc tgcacgggag cgggaggcat 1260 gggcggggta aacgggagcg gccggggttg cggcggcgcg tggcggtgac tgttccaagc 1320 agccgcggca gaggctgtgg ctgccgcggc tgcccatggc cagcagtcac cggttgcgcg 1380 gccgctggtg gggaggaagg ctggcgactg ccgctgctgc gcgctgctat cagcccggga 1440 gctgcacggg agcgggaggc atgggtaaac gggagcggcc ggggttgcgg cggcgcgtgg 1500 cggtgactgt tccaagcagc cgcggcagag gctgtggctg ccgcggctgc ccatggccag 1560 cagtcaccgg ttgcgcggcc gctggtgggg aggaaggctg gcgactgccg ctgctgcgcg 1620 ctgctatcag cccggaatct gcacgggagc gggaggcatg ggcggggtaa acgggagcgg 1680 ccggggttgc ggcggcgcgt ggcggtgact gttccaagca gccgcggcag aggctgtggc 1740 tgccgcggct gcgcatggcc agcagtcacc ggttgcgcgg ccgctggtgg ggaggaaggc 1800 tggcgactgc cgctgctgcg cgctgctatc agcccgggag ctgcacggga gcgggaggca 1860 tgagcagggt aaacgggagc ggccggggtt gcggcggcgc gtggcggtga ctgttccaag 1920 cagccgcggc agaggctgtg gctgccgcgg ctgcccatgg ccagcagtca ccggttgcgc 1980 ggccgctggt ggggaggaag gctggcgact gccgctgctg cgcgctgcta tcagcccggg 2040 atctgcacgg gagcgggagg catgagcagg gtaaacgggg ggagcggccg gggttgcggc 2100 ggcgcgtggc ggtgactgtt ccaagcagcc gcggcagagg ctgtggctgc cgcggctgcc 2160 catggccagc agtcaccggt tgcgcggccg ctggtgggga agaaggctgg cgactgccgc 2220 tgctgcgcgc tgctatcagc ccgggagctg cacgggagcg ggaggcatga gcagggtaaa 2280 cggggggagc ggccggggtt gcggcggcgc gtggcggtga ctgttccaag cagccgcggc 2340 agaggctgtg gctgccgcgg ctgcgcatgg ccagcagtca ccggttgcgc ggccgctggt 2400 ggggaggaag gctggcgact gccgctgctg cgcgctgcta tcagcccgga atctgcacgg 2460 gagcgggagg catgggcggg gtaaacggga gcggccgggg ttgcggcggt gcgtggcggt 2520 gactgttcca agcagccgcg gcagaggctg tggctgccgc ggctgcccat ggccagcagt 2580 caccggttgc gcggccgctg gtggggaaga aggctggcga ctgccgctgc tgcgcgctgc 2640 tatcagcccg ggatctgcac gggagcggga ggcatgagcg gggtaaacgg cagcggccgg 2700 ggttgctgcg gcacgtggcg gtgattgtgc caagcagccg cgatggagcc gaaaccaaac 2760 cggtgcttgt cagcaccttt attgacagtg tcgggcagaa ccacggtgac cgtgtgcatg 2820 tgagccttac cccgtatcca gcccggcgca ccaggcgttc ccccggcgtg tcttcatgcg 2880 gcggtgggcg tgagcggagg tgcgtgtgca tgggagccct acaccgtatc cagcccggcg 2940 caccaggcgt tcccccggct tttcttcatg cggcggtggg catgagcgga ggtgcgtgtg 3000 catgtgagcc ttacaccgta tccagcccgg cgcaccaggc gttcccccgg cgtgtcttca 3060 tgcggcggtg ggcgtgagcg gaggtgcgtg tgcatgggag ccctacaccg tatccagccc 3120 ggcgcaccag gcgttccccc ggcgtgtctt catgcggcgg tgggcgtgag cggaggtgcg 3180 tgtgcatggg agccttacac cgtatccagc ccggcgcacc aggcgttccc ccggcgttct 3240 tcatgcggcg gtgggcgtga gcggaggtgc gtgtgcatgt gagccttaca ccgtatccag 3300 cccggcgcac caggcgttcc cccggcgtgt cttcatgcgg cggtgttaat gacgagtgtt 3360 gcggtgcatg cggaggtgcg tgtgcatcgg agccttaccc ggcgcaccag gcgttccccc 3420 ggctgtcttc atgcggcggt gggcgtgagc ggaggtgcgt gtgcatgtga gccttacacc 3480 gtatccagcc cggcgcacca ggcgttcccc cggcgttctt catgcggcgg tgggcgtgag 3540 cggaggtgcg tgtgcatggg agccttacac cgtatccagc ccggcgcacc aggcgttccc 3600 ccggcgtgtc ttcatgcggc ggtgggcatg agcggaggtg cgtgtgcatg tgagccctag 3660 caccgtatcc agcccggcgc acccggcgtt cccccggcgt gcggtgggcg gtgggcgtga 3720 ggtgagtgtg cgtgtgcatg acaccctaca ccgtatccag cccggcgccc ccggctttcc 3780 cccggcggtc ttcatgcggc ggcgggcgtg agcggaggtg cgtgtgcatg ggagccctac 3840 accgtatcca gcccggcgca ccaggcgttc ccccggcgtg tcttcatgcg gcggtgggcg 3900 tgagcggagg tgcgtgtgca tgggagcctt acaccgtatc cagcccggcg caccaggcgt 3960 tcccccggcg tgtcttcatg cggcggtggg cgtgagcgga ggtgcgtgtg catgggagcc 4020 ctacaccgta tccagcccgg cgcaccaggc gttcccccgg cgtgtcttca tgcggcggtg 4080 ggcgtgagcg gaggtgcgtg tgcatgtgag ctctacaccg tatccagccc ggcgcaccag 4140 gcgttccccc ggcgtgtctt catgcggcgg tgggcgtgag cggaggtgcg tgtgcatggg 4200 agccttacac cgtatccagc ccggcgcacc aggcgttccc ccggcgtgtc ttcatgcggc 4260 ggtgggcgtg agcggaggtg cgtgtgcagc cccgtatcca gcccggcgca ccaggcctcc 4320 cccggccttc cttcatgcgg cggtgggcat gagcggaggt gcgtgtgcat gtgaggcttt 4380 ccccgttatt gagaaagctg agacgaacca tgcgaagccg aggcskgtmt ccagcgccgg 4440 tggtccctaa gcacaagccg cggaggcgca ggcggaggcg agacccgtgc ccggaccggc 4500 gctgaagact gacgtaccgg cctgctgtag attaatttaa ggtctgaccg gagcatgcac 4560 agcaaatcaa gacccagcat ggagcgcgtt ccttgttgca tggggctgca tggggctcat 4620 atatttcgag ttcagttctt cacttctctt caagcaacct gtctttgtca catgtagcta 4680 catacctatc gttcagcacc atgtagagac ttcgtggtcc tctgccagtc ggcaatggac 4740 ttgctttcaa agttggcgct caagtctaca gggcgagtct tcaagtcctg ctaagccctg 4800 gcatcaatct ttggaaggta ttttatgagc tgctggtgct ggggctgggc tgcgctgtcg 4860 cagggggtgt cggaccctcg ttgaccgggt ctacaactaa gccaggtctt gacgctgctg 4920 ttacaggatg taaatcaagc atcagagacc aagtctggaa tcaacagtgc caggtgaggc 4980 aggtcaggac acatagcgac agccatttca gggggtcgac aagcgacaac aaattcaggg 5040 ggtcaagcga gctagcatcc atgcgcagta tggacgaagt tgtaagagca taagcagaga 5100 tgcagagtgt gaagctcccc ccctccccct ccccttgtcc cctcaggtcc ggcactggaa 5160 ttcgactcgg ggcgcagcgc gcgtaaatgc gtggcccttc taaccctcac caggcaagtt 5220 tgatcggaaa ttgctacaca actcataata ggatgaatga taggatttga atgacgtaca 5280 tgggaagtta gttctagggg accgaatgcg gtctaggccc tgcgtgtacc tagtgccacc 5340 atgcactgac caacttaccc caaacaccat gcaggccgag ctcaagtacg gaggcggcta 5400 gcggcaagcg gcagacttca ggtggcggca gcggaattgg caggcggcgc tgtgcgcgca 5460 tggtgttatg gtgtgatcgg cgggtggctg ggtgttgtag tgctggcagg ggatccgtat 5520 cggttgcctt cacacttgcc tcgagaccct gagtatgcct ggtctgtacg tttagtgttt 5580 tgtagcaatg gtaggccttg cagaatgtgt gcgtgcagca cgtggtgggt tgcgggtcgt 5640 gtggacggag cgggcgggag agccgtgcgg acttgcacca ttttggcaca ccacgacacc 5700 agtacgaggg gcacgtgtcg gagcgcagat gcatgaactg taacacatca agagagcttc 5760 aggtgcctca tgtgccccat tttatgtgcc gtgcgggacg cgcacagtat cgggcgggac 5820 gcacacagcg ccgggtggca gcttgcttta ggtaaagcac aggcgtgccg catgcgtgag 5880 accatgcgta gagctctaag caaccggcat ttttgcttta tttgtgctta atcgtgcgta 5940 tgctaagcat gcatacgcat gcgtgccagc cagaggggag cggcgcggca agtcctggca 6000 actaccatgc gttagctaac gtaagtgctt gtccttgacg catgcgagcg gtgtgcttcc 6060 tcctgcgtac cgcacgcact tttcgtgctt gccgcatgcg gaatttccat gcttaacgca 6120 tgctaacgca tgccataacg cacaaacccg tccttatagt g 6161 // ID DNAX1_CR repbase; DNA; CHL; 1281 BP. XX AC . XX DT 31-MAY-2006 (Rel. 11.05, Created) DT 31-MAY-2006 (Rel. 11.05, Last updated, Version 1) XX DE DNAX1_CR is a family of nonautonomous DNA transposons - a DE consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; KW Interspersed repeat; DNAX1_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-1281 RA Kapitonov V.V. and Jurka J.; RT "DNAX1_CR, a family of nonautonomous DNA transposons in the RT Chlamydomonas reinhardtii genome."; RL Repbase Reports 6(5), 218-218 (2006). XX DR [1] (Consensus) XX CC DNAX1_CR is a family of nonautonomous transposons. The consensus CC sequence is less than 5% divergent from ~10 DNAX1_CR copies. CC Features of DNAX1_CRs include imperfect 13-bp terminal inverted CC repeats and 8 or 9-bp target-site duplications. Its termini are CC distantly repeated to those in TOC2. XX SQ Sequence 1281 BP; 266 A; 407 C; 341 G; 266 T; 1 other; ggagggtggg aaagttgccg ccggggggca tcttgggggg tgggtctggt cgggcatctg 60 tctcgcgcgg catcttggtt gccgccgggc ggcatcttgg ttggcggcag gggtgccgct 120 gcgcatctca gctgccaccc gccccaatct gagtttgggg gaccggtccc gccgcgcatc 180 tcagttgccg cccgccccaa tccaagtttg ggagaccagt cccgccgcgc aactcggttg 240 acacgcatcc caatctgagt ttagcagggc ggccccgctg cacaactcag ttggcgcgcg 300 ctcatatctg gatttgagag ccaagcatcg tgctgtagtg tttgcacggt agtaggtgca 360 cggctgcatg gcggcacgca cagcggaagc cgggtagccc aagaacgggc agcccgcttc 420 catccccgtc aactctctca catataggct gcagacacag actgcagaca caggctctat 480 gcancaaact atattcgtgt gcgaagcctt gaaagaaggc aaggaagcaa gaactctggt 540 gatcagcagt tccttcagcg cgcttcatag tcctaggcag tgctttaatt gagtaccact 600 cagcaactgg aatgatcgta gacaccccgg gcgcccttaa atgggcaaag ttgccagcaa 660 gctgccgcga accagtggtt cgtctttggc aaacaccccc aaactgccta cagttttaat 720 ggtcctgtag tgttacatac tgtcaaccat atcccttact atggagcttc taactcgctc 780 agttcaggga ctgaaccaag agaggggagc tgccagaagc cttgctggga aatgcgagca 840 tatatcaata cttcagcagc ttaaatatgt atatgaacat ttaggcaggc aatccctccc 900 cagctgccgg gttgtcagca ggtgccgcta tccaccgcca tcacccggcc cgccatccac 960 tcaccatcct gtgaatcccc ccgaccgtcc tgtgttctcc taagctcctt tctgagggga 1020 tatcccatct gatgtcggga aactaggttg tggatacctc ccccgcgcgc aatcttcaat 1080 ggcgtcggac cgcatctgag ttgccgcgcg ggggccccgg ccgcatctga cacggccccc 1140 cggcccaatc ttagttgtcg gcacaggccg ccgccccaaa cttggacccc ccgccaacca 1200 caactaagtt gctgccacag cccggtttcc gcaaccgtat gaccagactc aatccacaac 1260 tttccggatt tcccaccctc c 1281 // ID NonLTR-2_CR repbase; DNA; CHL; 1207 BP. XX AC . XX DT 04-APR-2005 (Rel. 10.03, Created) DT 10-APR-2007 (Rel. 10.03, Last updated, Version 1) XX DE NonLTR-2_CR is a 3' terminal portion of a non-LTR retrotransposon DE - a consensus. XX KW Non-LTR Retrotransposon; Transposable Element; KW Interspersed repeat; NonLTR-2_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-1207 RA Kapitonov V.V. and Jurka J.; RT "NonLTR-2_CR, a family of non-LTR retrotransposons from the green RT algae genome."; RL Repbase Reports 5(3), 63-63 (2005). XX DR [1] (Consensus) XX CC The NonLTR-2_CR elements are flanked by 5-10-bp target site CC duplications. CC The consensus is not complete at its 5' end. NonLTR-2_CR elements CC are CC ~95% identical to the consensus sequence. XX SQ Sequence 1207 BP; 267 A; 365 C; 385 G; 190 T; 0 other; tgcagccgca aacacaacac aacacaaccg ccacaaaatg ggcaaacacg gtcaccgtgt 60 aaacaaaact acgacccaac tcgctgatca acctagggca gcgtggcgag cgtcccagga 120 atcgcctcaa tcgtctctat cgcactcagg acactcgccg gagatggcga gcagcggacc 180 cgagcgtcgg ctggacactc acagcagttc cagcagcagt ggagacaggc aggcctccgg 240 acagcacgct acttgcagcc cgatgagcat cggcaccgtc gactcgagcg agcagcccac 300 atccaagcgc gctgccgact ggttacggag catcaaactt cggcttcagt tctcgaccag 360 cctgatcgtg gacgcactgg caaacactga cacactccca gcatcctcaa tttgttctca 420 ggagttcgag cgtgcgctgg agcagttcga gctacaggtc gcagaacttg tatttagcgc 480 cccaaggcag gcaccggtgc cacacgcctc ggccaacgag gcacagaccg ccgcacccgc 540 ggcggagcgg cggcccaacg acctcgcagt gctgcgggta caggtggagg agcaacgggc 600 tgcgctggaa gccgtcaaac gctccctcgc ccgcgtgggg gatgcggccg ctgtggacga 660 gagcaagatc ctgctgaagt ctgtcagcgg cgagctgcta gagcgcatcc gggcgtccgg 720 gaacctagag gcggttgcta gcacgagtgg gacgtcggtg agcgggcgct atgtccgtgc 780 gtggcaatcg gcagggcggg aggtgtggaa cgtcgtggtg atggtgacgc cccaggaacg 840 cagcgcattg atcagggccg cacgcctggt ccgcagcaca ggtgtggtgg tggcgcctta 900 cctgacgtcc gtagggtgtg cactgcgcaa ggagcaactc ggagctttcg aagggtttcg 960 agcagcaggg ctgaagcccc gatggaaggg aggggcgggc attcggtgcc gaggcaccta 1020 atcacctagg ctgacgggct agcagttgac gtagttggca ttgagtagta cagtcgaaca 1080 cgcctaactt cgttgcgcaa agtaagtttc ccggcgttat aagggcccac cggcccgggc 1140 cactggtact tacggggaca ggatagcacg catgctcgtg cacaccacct tgtaaccaac 1200 aacaaca 1207 // ID SINEX-3_CR repbase; DNA; CHL; 507 BP. XX AC . XX DT 04-MAR-2004 (Rel. 9.02, Created) DT 04-MAR-2004 (Rel. 9.02, Last updated, Version 1) XX DE SINEX-3_CR is a putative nonautonomous non-LTR retrotransposon - DE a consensus. XX KW Non-LTR Retrotransposon; Transposable Element; L1-1_CR; SINE; KW SINEX-3_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-507 RA Kapitonov V.V. and Jurka J.; RT "SINEX-3_CR, a putative SINE-like non-LTR retrotransposon from RT the green algae genome."; RL Repbase Reports 4(2), 47-47 (2004). XX DR [1] (Consensus) XX CC About 30 copies of SINEX-3_CR are present in the C. reinhardtii CC genome. They are ~98% identical to the consensus sequence. CC The 125-bp 3' terminus of SINE-3_CR is 79% identical to the CC L1-1_CR 3' terminus. Target site duplications are not present. XX SQ Sequence 507 BP; 110 A; 116 C; 148 G; 133 T; 0 other; gggggggtcg tctaaatggt taagacactc aagccgattt cgttaaggct tcgagagatc 60 ctgggttcga atcccggtca ccccaccagg aagggccctg gtttaccaaa acccatccac 120 accttggtgg tgtgtagacc ccccctgtgg tgcagggagt ctctggggta caaggctgga 180 gctctgggcc tgcaactgga gagctttcaa gttcaagatc caagattcaa gtggagctct 240 gggcctgtca actggagagc tctagatgat ctgtgtggtc atggcagtgg ctgtagttgt 300 cacctagctg ttgagaactc cagtaggtgt gtggtgtcac ctggctgttg aggtttccag 360 taggtgtgtg gtggaacaaa aacttctctt ctcttctctg atctctggcc ctatcgaggt 420 tctgtgatat gatccaggct gccttggcag tagggtggta gtgcgtctac caccgcaacc 480 ttgtaaggat gactttaaaa aaaaaaa 507 // ID NonLTR-6A_CR repbase; DNA; CHL; 980 BP. XX AC . XX DT 31-MAY-2006 (Rel. 11.05, Created) DT 31-MAY-2006 (Rel. 11.05, Last updated, Version 1) XX DE It is a young subfamily of non-LTR retrotransposons - a consensus DE sequence. XX KW Non-LTR Retrotransposon; Transposable Element; Nonautonomous; KW Interspersed repeat; NonLTR-6_CR; NonLTR-6A_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-980 RA Kapitonov V.V. and Jurka J.; RT "NonLTR-6_CR, a family of non-LTR retrotransposons in the green RT algae genome."; RL Repbase Reports 6(5), 260-260 (2006). XX DR [1] (Consensus) XX CC This is a young subfamily of the NonLTR-6_CR family of non-LTR CC retrotransposons, characterized by frequent truncations of 5' CC terminal portions and insertions into (CA)n microsatellites. CC Average length of target site duplications was not determined. XX SQ Sequence 980 BP; 157 A; 323 C; 290 G; 210 T; 0 other; cagcatcacg ctcgtatcct atttattcag cccgcgtcac agggccagga ccttgtccgg 60 agccgcgatg caatcatgac atgccatcga tggcgcgcgc gcggcgtcga ctgctgcgag 120 cttctgggac tacatgatga ctctcctcat cgacgggctt tgttacgaac gtgctgtcgc 180 ggcggtgtac ggcgcagcac taccatgcgc ccgccccctc gcccagcccg tggactggtg 240 gacacacctc ggcgcgcttt tcctttcagg ggagtatatg gatgtccccc ggggggctgg 300 tttcggacag ccgacgcggc cactgcggcc gcgagcatgc ctagcctcgg tgccgccgtg 360 gagtggggag cgggaggcca gctcccgcca gcccacggcg ccacacactc gcacacacta 420 gcccacgtcc agcagttccg ggggttcgct gctttctagt gctccgcaac cgacgcccgc 480 cgacttgacc ccatgcagcc gatgactcct tataagtttc tcgacgacca cagcctcgct 540 ctcttttccc gcaatcatta gcgggccggg actgggatgc gtgtcctagc ttgcttgctg 600 cacgccgccc ttaacatcgc tgtcgccatt ttccctgctc gtgacacttg ctcgctgccg 660 cccctgtgcc tgcaattcct ctatcacccg taggtgagct cgactacgac cagcgggcgg 720 gaatcagggc attaggcgtg ctgggtcaac gttgtagcgc gcccagggcg cggcgcatac 780 aatgcatatg ggtgccactc ggggtcaggt cgttcgtcga tcgcccgagg gggggttcca 840 ggctttttgc gggtaagtat gactaccggg gtcgacgcgg ctacaagacg ctcggtatcg 900 gtaggtatcc gtaagtaact ccgtcgcgcc gtctgggcgg gcttcgtttt gttgttttct 960 tttcaacaca cacacacaca 980 // ID TE2-3_CR repbase; DNA; CHL; 542 BP. XX AC . XX DT 08-JUN-2004 (Rel. 9.05, Created) DT 08-JUN-2004 (Rel. 9.05, Last updated, Version 1) XX DE TE2-3_CR is a family of nonautonomous transposable elements - a DE consensus sequence. XX KW Transposable Element; AAC target site duplications; TE2-3_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-542 RA Kapitonov V.V. and Jurka J.; RT "TE2-3_CR, a family of nonautonomous transposable elements in the RT Chlamydomonas reinhardtii genome."; RL Repbase Reports 4(5), 138-138 (2004). XX DR [1] (Consensus) XX CC TE2-3_CR is a family of nonautonomous transposable elements. CC The consensus sequence is less than 5% divergent from TE2-3_CR CC copies. Classification of this family is not known. It is CC possible that it belongs to a novel superfamily. Features of this CC superfamily include preferential insertions into AAAC targets CC that are accompanied by the AAC target-site duplications, a CC 18-21-bp CC palindrome separated by 3-5 base pairs from the 3' end; and the CC AGCC CC 3' termini. XX SQ Sequence 542 BP; 115 A; 125 C; 178 G; 124 T; 0 other; ctcccagctt cgcagcgcga tggctcgccc tttgtggggc tgcgaagtca ggcactgtat 60 agtttgggat tgaaacgcgc ggcatgcagg tgcaaggagt tatacagcgt gttgaactgg 120 caggagagga tgtgcagcac caggtgacag gagacaggaa cgacaggagt cagggggaga 180 caggaggcga caggaggcag ggggaggcag gagagacagg aggggacacg gcacgggcag 240 ggcaaggatt aacacaggtt gaggattccg ctttgtggct ttctgctttc agcttgactt 300 tatggctctt atggctcttt gagcggctct ctgagcggcg gcggcctcgt ggcctggacg 360 gtgcacttta catactagtt tttctgtgtc gcatgccttc aagggttttt gcacttggca 420 aatactctgt tacagcctct attcttgtcc atgttctacc aaaaagcgca aaactggaca 480 gaaccctaat gtaaggttct tgccgaggac tatggctgtc gccccccttt ggggggcgag 540 cc 542 // ID Mariner-N5_CR repbase; DNA; CHL; 742 BP. XX AC . XX DT 08-JUN-2004 (Rel. 9.05, Created) DT 08-JUN-2004 (Rel. 9.05, Last updated, Version 1) XX DE Mariner-N5_CR is a family of nonautonomous Mariner DNA DE transposons - a consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW mariner superfamily; Mariner-N5_CR; nonautonomous DNA transposon; KW TA target site duplications. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-742 RA Kapitonov V.V. and Jurka J.; RT "Mariner-N5_CR, a family of nonautonomous Mariner transposons in RT the Chlamydomonas reinhardtii genome."; RL Repbase Reports 4(5), 133-133 (2004). XX DR [1] (Consensus) XX CC Mariner-N5_CR is a family of nonautonomous transposons. The CC consensus sequence is less than 5% divergent from CC Mariner-N5_CR copies. Features of Mariner-N5_CRs include 28-bp CC terminal inverted repeats and TA target-site duplications. XX SQ Sequence 742 BP; 160 A; 218 C; 176 G; 166 T; 22 other; ccgtagtgct acaaaggttt cggcgttcga aatttcggcg aggcgttgcc gaartgctgg 60 aaytattang atatttttta atttcggggg gccgccggrt tatttaacam waatgttttt 120 aagttcggcg gcccgcccac agccgcggac ggggcccacg gaaggggcct agcaagcact 180 gcatcaaggg gctacccgct accacacgcg tgtcctactg cgccacgcgc gtgtcctact 240 gttccagcag tacggtaccg gtgctgtgca ractggcaac ccctgctcag ctgtcaccgc 300 acacacaccg cacaccgcgc ttcccacacc gcacaccgcg ctatgaaacc cccagcaacy 360 cacccagcaa gcactaattc atctccagcr cgctcgtttg rgtcgtggtc cttcggcagc 420 acaggcatga aggccctgat ggcagtttgt ttgctcattt ttrswykmtt ggtcgattgy 480 gctacttaca actagtatat agaaaatatc ggttctctgc agcagagcga ctagataagc 540 ttaagataag gtcgtgcgtg gcgtgccaaa cccctccacc ccctcccttg cgcgcgcgcg 600 cgcgcgcgcc acgcgggcga aattgttccg atgatattaa ttttcggcga tattttnata 660 aaaaaaaatt gtaatwrttt crgcgctcaa cttcggcagc cacccccgaa ctttgaacgc 720 cgaaaccttt ktagcactac gg 742 // ID Gypsy14-I_CR repbase; DNA; CHL; 3161 BP. XX AC scaffold_34; XX DT 22-FEB-2008 (Rel. 13.02, Created) DT 22-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE LTR retrotransposon from single celled green alga: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy14-CR; KW Gypsy14-I_CR; Gypsy14-LTR_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-3161 RA Kohany O. and Jurka J.; RT "LTR retrotransposon from single celled green alga."; RL Repbase Reports 8(2), 94-94 (2008). XX DR Genome; scaffold_34; Positions 759227 762387. XX CC Positions [2036-2518] - Integrase core CC 'CAACT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 944..3145 FT /product="Gypsy14-I_CR_1p" FT /translation="MRVLAHRSAISGSYRDPKKTESIRNWPLPRDLSELRS FT FVGLANYFRRFIAGFGALAAPLTDLFRLPKLPGTWPPQALRAFAEIKKRLS FT TDVLLHYPDFDHPFAVVSDASLNGTGAVLLQDLRPVAFTSKKFSPAERNYG FT TGEQELLGVVNALREWRCYLLSGISFTLVTDHHPLTYLKTQATLGRRQARW FT LDLLEEFKFDWVYRPGRLNVVADALSRLPSLAHIVCGAMRTRRHPEQPTSS FT TDADSLFSRIVEATERDAWFQDPRHTDSLLRHSTGFYQKQAADCEFRVVVP FT NDPSLKRDIISRAHSSELSGHPGRDRTLKLLQRTFWWPTMGADVADFVASC FT DTCQRVKPASGKPPGLAQPLPVPDMPWESVSLDLITGLPTTQKGHNAILVL FT VDRLTKMVHVVPCKKTLSAEQTADAFFETIVRLHGMPKIVVGDRDTRWTGL FT FIPALLKRCGARLNLSTAFHPQTDGQTERMNRTLQDMLRAFTSDDPRSWDR FT FLPAAEFAINNLPNRTTGQSPFYLNYGFHPRTPLQLELGDVLPAAKAFADN FT FSDRIRTAQRQMTAAQDRAKAYQDQRRRDVEFAVGQQVLLSTRNFKVQKGQ FT RKLLPRWTGPFEMTEMINPVAARLALPADYPPMHDVFHVSLLRQYQPPAEG FT AAPPPIIAFDPTTMQPVLPTVEKILNHETKKLRTKTLHRYYVLLRGRSHGE FT SQWWDEADLLPEHQALLDAYWAAQPNGRTADVPA" XX SQ Sequence 3161 BP; 671 A; 1030 C; 816 G; 644 T; 0 other; tcttttgaaa gctgagaccg gccaccgcaa caaggactgc aaggaacccg ccgcggagcc 60 ccccgtgcgc cccacggcgt agagtgctgc ccaaccgccc agtgaccagc gtgttggagt 120 aggccttgct gctgctgccg ttgcgttccg tgccttgcgc caacaggggg ccctaggtgt 180 gctgttgtgg gtctcgccag gggccgcacc ccctgcagac acgccacctc cggcactgaa 240 cgccgccgaa ctaccgactg agacatcccc gaatcccgaa ctgcaaacaa tcctatccga 300 atacgaggat gtctttgccc ccatcaccga gttacctccc gaccgtggcg tttcccacac 360 gattcccctt gaaaagggcg cacaacccac gttcaggccg ccataccgac tcaccccgct 420 tgagaaggct gaggtgaacg gctgacctgc tcgaacgaaa atttatccgg ccctctgcct 480 ctccttacgg agccccgatc ctctttgtca caaagaaaac aggcgagcta cggatgtgca 540 ttgactaccg cgccctgaac aagatcacgg tgaagaaccg gtttcccctg ccgcgcattg 600 atgacacgct agaccgcctg gcaggcgcga agtacttcag tacactcgac ttggccagtg 660 gctactggca gtgccgcatt caccccgacg acatcgaaaa gacggcattc tcgacgccgc 720 aagggcactt cgagtggctc gtacttcctt tcgggttgtg caacgccccg gccacgttct 780 ctacgatgat gacccaaatc tttacaccat atctcaataa gtttgcgacg gtgtatctcg 840 acgatatcct gatttactcg aaaaccgccg aggaacacgg aagacattta cgactggtcc 900 tggacttgct gcgcaagcac aagcttcatg ccaaaatgtt aaaatgcgag ttctggcgca 960 ccgaagtgcg atatctgggt catatcgtga tccaaagaaa accgaatcga tccggaattg 1020 gccacttccg cgcgatttat ccgaactacg gtcatttgtt ggcctagcca actactttcg 1080 acgctttatc gcaggcttcg gcgctctcgc ggccccgttg acggacttgt tccgtctccc 1140 gaaattgccg ggtacatggc cgcctcaggc actgagagcg tttgccgaaa taaagaaacg 1200 attatcgact gatgtgctgc tgcattaccc cgatttcgac cacccctttg cagtcgtgtc 1260 cgacgcgtca ttgaatggca cgggcgccgt gcttctgcaa gaccttcgcc ctgtcgcctt 1320 caccagcaag aagttctccc ccgctgagcg caactacggc accggcgagc aagaactgct 1380 gggcgtcgtc aacgcacttc gcgagtggcg ttgctatctg ttgagcggca tctccttcac 1440 cctcgtgacg gaccaccacc ccctcaccta tctgaaaacc caagcaaccc tcggtcgccg 1500 acaggcccgc tggctcgacc tgcttgagga gttcaagttc gactgggttt accgccccgg 1560 tcgtttgaac gtggtggctg atgcgctctc tcgactgcct tccctggcac acatcgtgtg 1620 tggtgcgatg cgcacgcgcc ggcatcccga acagcccaca tcctcgactg atgccgattc 1680 cctcttctcc cgaattgtgg aagctaccga acgcgatgcg tggttccaag accctcgcca 1740 caccgactct ctgctgcgtc actccaccgg cttctaccag aagcaggcgg cagactgtga 1800 gtttcgcgtg gtggtgccga atgacccgtc cctgaaacgt gacatcattt cccgggccca 1860 ttcctccgaa ttgtcgggcc atcctggccg tgaccgaacc ctgaagctgc tgcaacgcac 1920 cttctggtgg cccacgatgg gagcagatgt tgcagacttt gtggcgtctt gcgacacgtg 1980 ccaacgtgtg aaacccgcat ctgggaagcc ccctgggctt gctcaaccac ttcccgtacc 2040 cgatatgcct tgggagagcg tctccctcga cctgattaca ggcctcccga ccacacagaa 2100 aggacacaac gccattctgg tgttggtcga ccgcctcacg aaaatggtgc acgttgtgcc 2160 ctgcaagaaa accctgagcg ccgaacagac tgcggacgca ttcttcgaaa ccattgtgcg 2220 cttgcatggt atgccgaaaa tcgtggttgg cgaccgcgac acacgctgga ccgggttgtt 2280 tattcccgct ctactcaagc gttgtggcgc tcgcctgaac ctatcgactg ctttccatcc 2340 ccagaccgat ggtcagaccg aacgcatgaa ccgaacactg caagacatgc tacgcgcatt 2400 caccagtgac gaccctcgca gctgggatcg cttcctgcct gctgccgaat tcgctatcaa 2460 caacttgccg aaccgcacga caggacagag cccgttctac ctgaactacg gcttccatcc 2520 ccgcacgccc ctacagttgg agctcggaga tgtcttgccc gcggcaaagg cttttgctga 2580 taacttctcc gaccgaatcc gaacggcaca gcggcagatg acagctgctc aagatcgtgc 2640 caaggcgtac caagaccagc ggcgccgtga tgtggagttt gcggtgggcc aacaggttct 2700 gttgagcaca cgcaacttca aggtgcagaa gggtcagcgc aaactgcttc ctcgctggac 2760 tggacccttc gaaatgaccg aaatgatcaa ccctgtggct gctcgactgg ccctaccggc 2820 agattatccg cctatgcatg acgtgttcca cgtttcgttg ctacggcagt accaaccccc 2880 tgccgaaggt gccgcacctc ccccgatcat tgctttcgat ccgaccacga tgcaacccgt 2940 actgcccacg gttgaaaaga tcctgaacca cgaaacgaag aagctccgaa ccaagacgtt 3000 gcaccgttac tacgtgctgc tacgaggccg ttcccatggc gaaagccaat ggtgggatga 3060 ggctgatctg ctacccgaac accaggcgct gctggacgct tactgggctg cacagcctaa 3120 tgggcggacc gcggacgttc ccgcctagaa aagggagcag a 3161 // ID Gypsy-2-LTR_CR repbase; DNA; CHL; 1254 BP. XX AC . XX DT 04-MAR-2004 (Rel. 9.02, Created) DT 04-MAR-2004 (Rel. 9.02, Last updated, Version 1) XX DE Gypsy-2-LTR_CR is a long terminal repeat of the Gypsy-2_CR LTR DE retrotransposon - a consensus. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Nonautonomous; KW 4-bp TSDs; Gypsy superfamily; Gypsy-2-I_CR; Gypsy-2-LTR_CR; gag; KW nonautonomous LTR retrotransposon; protease. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-1254 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-2_CR, a family of gypsy LTR retrotransposons from the RT green algae genome."; RL Repbase Reports 4(2), 38-38 (2004). XX DR [1] (Consensus) XX SQ Sequence 1254 BP; 253 A; 341 C; 328 G; 332 T; 0 other; tgtagcaggt aagcgcggac ttcgtgtaaa cgcgtctgcc atgtttccta gctatgctca 60 taaattgtac gaggttcgtg tttggagctt tcaaggttat tcaccaagca gcgactactg 120 atcaagattt gctggacaag agcgtgtgcg tgtgtagggt gagtgcttgt gcggcgcggc 180 aatcggcact tcccagtgtt tccgatgcgc atgcaagccg gcgtttctag aagcccagta 240 cgtgctggtg ttattcttat tctttaagcg tgtttattta ggtaagcaaa cagacccacg 300 cctgcgcggc gttggaggtg gacttcacgt tgaagaaggt ctgtttacga atgtgcactg 360 cgcgcaggtg tgaagtacgg cagttgggcc ccggtggtcc ccatattgag ggttggaccc 420 accatcgaag aaggtggttg gcagttgccc aaaagtactt aagggctgac cgcgccgagc 480 tgtgacaagc tattgccaca agagcctagc caagtacaag agtaagcccc ccaaccgact 540 gccaagtcac acatgctcaa gtgagccgtc tgtctgtgtt ccaatctttg aataggtagt 600 gccggccgta ggccaggcac ggcgcacggt cgcgcccttt aggtttctgt gtttgattgc 660 tctgcgggac ccaaccccag tgccttcttt gtaagcgtca agtaatacta ccggtacacg 720 gtctttgcct gtcccgcctt cctctctaac tcccactcag cgtctcagac ctgagacgct 780 gagcaagtca gccttagcta ttggaatagt cagcaacaag tcagcgagtg ttctgcttgc 840 gtgtatatcc gtttcccttc ccggcacaca ggttctccct gccaggggta tcaagtcaag 900 tcccgaccaa gactcctttc cctaccgcct cctcaccacc tactctcacc ctcaatccct 960 cgtctagtct tcttcttctc ttcgagtgtc catcgcctac ttctgtatac tatcgtgtct 1020 gtctcgctcg tgtgtgagcg caagccacgt gttaggtgtg tcgcgtgtca gtcaggtctt 1080 cccatacctg tgaacgctag tgtaccggta tactgaacca agggaaggtc aaatcgtagt 1140 gggtctgtct tcatacagct gctgcgtggc tctagagact cggtgacgcc tgcgccagta 1200 cgtgaaaaaa tctgggttcc tgtatatagg gctcgcgcgt cttttgatgc taca 1254 // ID Gypsy19-LTR_CR repbase; DNA; CHL; 271 BP. XX AC scaffold_8; XX DT 22-FEB-2008 (Rel. 13.02, Created) DT 22-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE LTR retrotransposon from single celled green alga: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy19-CR; KW Gypsy19-I_CR; Gypsy19-LTR_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-271 RA Kohany O. and Jurka J.; RT "LTR retrotransposon from single celled green alga."; RL Repbase Reports 8(2), 106-106 (2008). XX DR Genome; scaffold_8; Positions 461515 461785. XX SQ Sequence 271 BP; 65 A; 77 C; 57 G; 72 T; 0 other; tgtaacaaga ggtctacgga cttaaactct ctcgacacac ggttggaaga caagggcatt 60 ccgtatcaca ttccgtatgg cgttcgtcca ttgctcggac gttcttcgcc taaaccctag 120 taactacggt tacacggttt cccgccaacc gtgcccaccg ggctacttaa gcgcctcgga 180 aacaccccgg tgtatcttaa caccaccttt ctcggttgtt tatataggta gtctataggt 240 cagattctaa gtctagcaca cgactgttac a 271 // ID Pioneer1 repbase; DNA; CHL; 2816 BP. XX AC CRU19367; XX DT 24-MAR-2011 (Rel. 16.03, Created) DT 24-MAR-2011 (Rel. 16.03, Last updated, Version 1) XX DE Tyrosine recombinase-encoding DNA transposon in Chlamydomonas DE reinhardtii. XX KW Transposable Element; Pioneer1. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-2816 RA Graham J.E., Spanier J.G. and Jarvik J.W.; RT "Isolation and characterization of Pioneer1, a novel RT Chlamydomonas transposable element."; RL Current Genetics 28(5), 429-436 (1995). XX DR EMBL/GenBank/DDBJ; CRU19367; Positions 1 2816. XX CC This family is found in C. reinhardtii strain 224, but it is CC absent in strains 137c and 21 gr. XX FH Key Location/Qualifiers FT CDS 2023..1076 FT /product="Pioneer1_1p" FT /note="tyrosine recomibase." FT /translation="MCSCTLPSTGCRAIRAGRRRLRGPAPSSPTCRCSAAP FT WRALGGTGATTPSPARATRVRARGWRTTARATSGSRCWRATRRFQRCRCRC FT RSTRRSCDTCGARFASRPATPWPYWCTCATSCASCSCGRPLSAPTTLASWA FT WATSWTQCGRTSPTRAFRCLRRGSTCAVCPASASRAGYQDVPPGSCACHLA FT GASTCGAGLLHTARAGVVHVVVPPAGVPPAVRSWTCCSGPSLRTSAASRTP FT LRTSTLSARLKLHLQAAGLYAGESCHSFRRGALQAAAGGGATSEELMGLSH FT IRTPAVLGRYLDEERHLRDSRSQL" XX SQ Sequence 2816 BP; 654 A; 922 C; 878 G; 362 T; 0 other; gaacgtgact tcaatgctcg gacgcgcgga tgggcagtac acgactggca cccccctcag 60 ccagcactcg ttcccggtat agctattgct aaacgcttag ccgctatcac cccctactct 120 gcaagggcga agggcgacac ggaagcccga gcgcgccaaa ttttacccac ccaccatccc 180 aatgtcaatt tggggcctga ccgcaggtgg agacaaggga gaacggcgcg ttggagacga 240 caccagcagg cggaggcagg agcaggaggg gacaacgaca gggcaatagg agttcgtgac 300 agtgacaggc cgaggcgcat agcgggcggg cagtagtagg cagtagcggg gacggcagac 360 cgcctagtga caggctacgc gagagcacgg tgcgccggca gcaatagcaa gagtaagcag 420 ttgcaattca caagccgcaa cggtagcaga ctacggtagg ggacggcagt gacggaggca 480 ggcagtagca atgcagatgg taggggtagc gggctgtggc agggacggca gggcagcaat 540 aggcgtgcag gtggcagtcg gcagaaacaa ctggctgggc cgtagcaggc tgcgacgggg 600 cggccacacc agtaagaggc tagacgtaga aattggccgg ttgagctgag gcaagcaggt 660 aggagagacg aagagcaagg tagcctgcag agagtaaccc tttgcaggcc tcagcgcacg 720 actcattgtc actcgacgtc atagtacggt gctgatcgca cacagcaccc ggccttcctc 780 cccctaccga cgagacggct tccagccaga gactcaggcc accgcggaac taccatcaat 840 aatacaccaa catcaccaca ggcggcacca atgcacacgc agccgcctaa acagcaagca 900 gacgcaacct taccccacac tcgaacctac agtcctgcac cgagcaaatc atccgaccgc 960 cctgaaccgc caccaccgcc tgtaactcag cacgcacacc cgagctacgg ccacagccgc 1020 ccgcacattg gcacccacta gcgcaagcag ataatccact cacaccacca ttctacagct 1080 gcgagcgcga atcgcgcagg tgccgctcct catccaggta tcggcccaag acggcgggcg 1140 tgcgaatgtg gctcaggccc atcagctcct ccgaggtcgc cccgccgcca gccgcggcct 1200 ggagagcgcc gcggcgaaag ctgtggcacg actcacccgc atacaaaccc gccgcctgca 1260 ggtgcagctt caaccgagcg ctcagggtgg aggtgcgcag gggcgtcctt gaagctgcgc 1320 tggtccggag tgaggggccg gaacagcagg tccacgatcg gactgccggc gggacgccgg 1380 ctggcggcac aaccacatgt acaaccccag cacgcgcggt atgcagaagt ccggcgccgc 1440 aggttgaggc tccagccaga tggcaggcgc acgagccagg cggtacgtct tggtaccccg 1500 ctcgcgatgc agaggctggg cacaccgcac acgtgctccc acggcgcagg cagcggaaag 1560 ccctggtagg gctggtccgg ccgcactggg tccacgaagt cgcccaagcc cagcttgcca 1620 atgtcgtggg cgcgctgagc ggtctgccac aggaacagga tgcacaagag gtcgcgcacg 1680 tgcaccagta cggccaggga gtcgccggcc gcgacgcgaa tcgagcccca caggtgtcgc 1740 acgagcgcct cgtacttctg caccgacaac ggcaccgctg aaacctcctg gtagcccgcc 1800 agcatctgct gccgctggta gcccttgcgg tagtcctcca cccacgcgct cgaacacggg 1860 ttgccctcgc cggtgacggc gttgtagcgc ccgtcccgcc cagcgcgcgc cagggcgccg 1920 ctgagcagcg acaggtggga cttgatggcg ctgggccccg gagcctccgt ctgcctgccc 1980 gaatggctcg gcagccagtg ctcggtaatg tacacgagca catcgtccgg agtgcagtcc 2040 aggacagtac gcccgccccc gaagcgagct agccacgccg cgaactcccg cgccgccgcc 2100 tcctgccgcg agtaagtgct agccgctaag ctggcttcca cagtgcgcgc ggcggagacc 2160 aagccacgct gatacacggc ctcacggtcg gccgccgccg ccgcaggcac aggcgctcgg 2220 ggagcccagc cagcttgccg cagggcagcc aaagccggca cccgggacag aagcggaacc 2280 ggtgggagcg gaagcggcgg ggcagcggga cgaggatcga gctcgtggcc acagccggga 2340 accctactgg cggcacctgt cgcagaaaca cagaacaaca ttagcaaagg gagccgacgc 2400 ggaaggagcc gagcgaagga acctaccatc cgaaacccat ccaacaggcg cctagcaaac 2460 cccttatcga tcggctatcc aaaggcagaa tcgctcaccg cattagcatc gggcagcaga 2520 gacgcgaagc cagagacacc ggggccagga aaaggcgcca gctgaccctg aaagcgaggc 2580 gtaaatcaca gaagttaaac gcctacatcg attcaatcat ataataccac aagcagcagt 2640 tcagagctta ccgtgcccgg attacaccaa aggctcgaac tgccggtttc gcccaaccca 2700 acccgcccta ttaaattagg cagaccagga aacattggtc acggtgggaa gcgatacgcg 2760 ggagacacgc cacacaaagt tagccggtgt atgacgagca tttgcgttgc agcact 2816 // ID Copia3-LTR_CR repbase; DNA; CHL; 327 BP. XX AC . XX DT 13-MAY-2004 (Rel. 9.04, Created) DT 13-MAY-2004 (Rel. 9.04, Last updated, Version 1) XX DE Long terminal repeat of the Copia3_CR retrotransposon. XX KW Copia; LTR Retrotransposon; Transposable Element; 5-bp TSDs; KW Copia3-I_CR; Copia3-LTR_CR; COPIA superfamily; endonuclease; KW protease; reverse transcriptase. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-327 RA Kapitonov V.V. and Jurka J.; RT "Copia3_CR, a family of copia LTR retrotransposons present in the RT Chlamydomonas reinhardtii genome."; RL Repbase Reports 4(4), 107-107 (2004). XX DR [1] (Consensus) XX CC Copia3-LTR_CR is a long terminal repeat of the Copia3-CR LTR CC retrotransposon, whose internal portion is deposited in CC Repbase as Copia3-I_CR. XX SQ Sequence 327 BP; 66 A; 94 C; 97 G; 70 T; 0 other; tgttggtaat ttgggttatt gggcagcact gccggtgagg tggtagctgc ggtccgaact 60 tggggccctg ggaagttagc gaggcgcctg ccctgatcag gcagaagtgc ctccttgtct 120 gcccgcctgt agcgcctccc cgactgccag tacggcattg tgtcggttcg cggaccgtcg 180 ggaagactta agacccgacc aagcgtgttg ctgataacta cacacgcgca ttatataccc 240 agactaccga aggtgccagg cagcgtagct gcatcgtgtg taacgcaccc gctagcaatt 300 caggcttgcc gaaccaactt cgcaaca 327 // ID MSAT-1C_CR repbase; DNA; CHL; 90 BP. XX AC . XX DT 04-APR-2005 (Rel. 10.03, Created) DT 10-APR-2007 (Rel. 10.03, Last updated, Version 1) XX DE MSAT-1C_CR is a minisatellite unit - a consensus. XX KW MSAT; Satellite; Simple Repeat; MSAT-1C_CR; MSAT-1_CR; KW Interspersed repeat; minisatellite; nonautonomous; tandems. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-90 RA Kapitonov V.V. and Jurka J.; RT "MSAT-1_CR, a family of minisatellites from the green algae RT genome."; RL Repbase Reports 5(3), 54-54 (2005). XX DR [1] (Consensus) XX CC The MSAT-1C_CR is a subfamily of MSAT-1_CR minisatellite. XX SQ Sequence 90 BP; 13 A; 20 C; 48 G; 9 T; 0 other; ctaggcggca tcggcgacgg cggctccgga gatggaggct cagggctggg aggggccggg 60 ctggggggca gtggcgaagg cggatgcgga 90 // ID Novosib-3_CR repbase; DNA; CHL; 6736 BP. XX AC . XX DT 31-MAY-2006 (Rel. 11.05, Created) DT 01-JUN-2006 (Rel. 11.05, Last updated, Version 1) XX DE Novosib-3_CR is an autonomous DNA transposon - a consensus. XX KW Novosib; DNA transposon; Transposable Element; KW Interspersed repeat; 8-bp TSD; transposase; Novosib superfamily; KW Novosib-3_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-6736 RA Kapitonov V.V. and Jurka J.; RT "Novosib-3_CR, a family of autonomous Novosib transposons from RT the green algae genome."; RL Repbase Reports 6(5), 264-264 (2006). XX DR [1] (Consensus) XX CC Novosib-3_CR is a young family of autonomous transposons. The CC consensus sequence was derived from several copies that are ~99% CC identical to each other. These transposons are characterized by CC 8-bp target site duplications and 14-bp terminal inverted CC repeats. XX FH Key Location/Qualifiers FT CDS join(163..574,853..939,1127..2046,2186..6202) FT /product="Novosib-3_CR-1p" FT /translation="MTELKVSYSDGTLCNYCLVCAKVARARGKGIRTVNLE FT NYAPCGVNAAVLAVRRQQGSRAPEPCVHSTCYHNTLQPRQKELQAQAQAQP FT RTRAAVKLQPAPPAIRAPRAAVGKKEKLEELQREARAVTGLLSPYQGRAGG FT PVRMDTSAPPSPEQLGTTAAAAGVQSPGGPVRMDTSAPPSPASGPAAGGPA FT AGGPAAGGPAAGGPAAGGPAAGGPAGYTGVTRDQSVHGYKARSGRQFGSTG FT NALYLGLYATCHDAALAVNHFHEMLAERVGNRADVPPPPNAVPAGASLSAA FT EVAVVRNAVDKAVLAFWDKGVSAGKEQLEALRSSVADFLRARSLVSELDAA FT VGAIASGDVKPDDMVVQYTMAAIPNAAKLREGGATASGVRWPGVVKTVTAA FT ASAQRSGEGCLNVLRGEVKDPGLPFPSTSTIRRSMGEEEVDPQAVGVLRRG FT IQNFVARCREAGRLGAEATMAADGVDIPGMLEKEMLPSGTVRFDGDACHGD FT AISVAMLATKEREYNSLVQRLDAAVAASGGDSHPAADAAAAFASVQFVATW FT IQHEVLPLAHARLKTAQDAAAAQSARFQARNGVGEQRLKSQQVSHKCASLQ FT RIGRLQAVVAQAEGLLSGAQLVRDVNLVTLDMCAPSPADGSVTVSLSTAGS FT NAEAHCAALVGMLARNMRAAVSSLYDVTRMAATSLLVAMLTSPDSRVSEMV FT ARMHISSKASAQLMRWLVSSVKQELEAQGGKLLLVVADGEHFSLTNAAASG FT ELSSNLAQLNRESQRLCAAAEQARHASAKRHGWQRRDLAWQDYAQHVPLLP FT AQRVLPRGASMSALTRISIASTPRAAPLLEAAAALEQHQSLVAATVAAAAT FT AAAATAAATAAAATAAATAAATAAAATAAATAAAATAAAATAAAAVAKAAG FT PPPPLAQQQTQSRPKRTAAAPARLADYATEGMKPAVAAKHAKTAEVSVACI FT PDTTVASAAVAAAAPVSVVAVPAVGTTECDQQYDHYCLVCGDKDFTERNKI FT LLCDGPRCRCESQYAGWHQHCLKPALSKVPSGDWLCPQCVEADHRVIEEQA FT AAPLPPVALQPAVDVAGGLAAVAASAAAATMPPAAAPYVRHVASLRPGVQP FT LALDLFSPAARLSGSIEQSRNAPALVIQLQSHYEKMKEIEAHIPWRQLQRP FT VQQLSHEITSHRESAQCPVSMQRVPQGIITAFAAYMAAPLTPTLTVQTPLA FT GSSGSFRAKGLSRHCPGGVDAAAARKDVHAANAAARAAPSNALHEALKAEA FT AAARQLVLKLPIEEGTAVVSRLIWRSQYLARQAVLPGDRGQVNPAGTSFPL FT ASAAAESEAYWVFRLTHGMPPAGLGQARQEVIVRHAIDQARLDRMLLTCKM FT AGRDFGRQFQFTLPGLPSIVIDEPHKLKTAKPQTNDCCFLGKLDGLAAMQR FT AAAQCRSGWASDGDDSSTSAAVAGGAAAAAVHGHAAGAAGVAAAQGQAGAA FT PGAVAHGQATKAAGVAACGQAEAAASAAACKWGEAFLDKGVLLAVANELAS FT EYQATCPAERVERYGNLDPGALLAIIRGETDSQSVPCAQYWIGCVPLHAAL FT RTYGFVREAVVLRTLIEAFEAFDECGWEPVERALRILRRTYMLQRMLGNTL FT FTARVLHGSAKPSLPTYVNGFTLQNVTSWLVNGDTRLEVLQRLPVEWRAEL FT CERSFSSDCCELFFSLLVCLCGGKPSLRVIKGATPRITFMHRLRQQARATR FT GFSMRKRRTRGAYKEHELMLAATLADAGGAAGRVDGSKWNSGVKIVNAALK FT HKREGAINKRVASRVRGKMGWLRNTVHHMLKLVTGQPVPKA" XX SQ Sequence 6736 BP; 1240 A; 2061 C; 2264 G; 1171 T; 0 other; cacgcgggta ccaccccagg gtttgtgggc ggggtcggtg tgtccaagct ccaattccgt 60 ttctcggaac atacatatac atagaatcat agctcaactc gaatataact agctttggca 120 cctaccgtgt actgttgtta agtcgcgatg taattcccca caatgaccga gctcaaagtc 180 tcctattcag atggcacgtt atgtaactac tgtctggtat gtgcaaaggt cgctcgtgca 240 agaggaaaag gcattcgcac tgtcaaccta gaaaactacg cgccatgcgg ggtcaacgct 300 gcggtgctcg cagtccgccg gcagcagggg tccagagctc cagagccatg tgttcatagc 360 acatgttacc ataacacgct gcagccgcgt cagaaggagc tgcaggcgca ggcgcaggcg 420 cagcctcgca cccgtgccgc tgtcaagctc cagcctgcac cccctgccat tcgggcgccc 480 cgcgctgcgg tgggcaagaa agagaagctg gaggagctgc agcgggaagc tcgcgctgtt 540 acaggcttgc tttctccgta ccaggggcgg gcaggtgtgt acgggcagcg caagcaatgt 600 cggaagttac gcacgctgcc tggcgctggg catgggccaa ggctgtaatt gacgccctgt 660 ccgtttgtga ttgcagccac acgtgcgcag gctatcagca tgctggtgac cgctggggct 720 caatcgcctg caggcgttgc cagcccagac tcgccaggtc cgatgggggc gggcgcaatt 780 gttgcctggc gctgtaggct gagcttgaac cgtttcctac tgtgtatgtg acccgcccct 840 cttaccgtgt aggtggcccc gtgcgcatgg acacgagcgc acccccatct cccgagcagc 900 tggggacaac agcagcggct gctggagtcc agtcgccagg ttggtttggc tttattcggc 960 cgcgtggtgg gtatgggggt ggctggatta gagctggcac gatgtgggtt cggcaagggt 1020 gcggggttct cgcagccctc ggacccacat caacagcccc ccggtttatc gccctgcatc 1080 tctcgcccgc gtcgcatgct gacacatctt gccctgctcc atgtaggtgg ccccgtgcgc 1140 atggacacga gcgcaccacc gtcaccagct tccggcccag ctgccggcgg cccagctgcc 1200 ggcggcccag ctgccggcgg cccagctgcc ggcggcccag ctgccggcgg cccagctgcc 1260 ggcggcccgg cgggctacac gggagtgacg cgggaccaga gcgtgcatgg gtacaaggcg 1320 cgcagcgggc ggcagtttgg cagcactgga aacgcgctct acctgggcct ttacgccact 1380 tgccacgatg cagccctggc cgtgaaccac tttcacgaaa tgctggcaga gcgggttggc 1440 aatcgggcgg acgtcccgcc gccacctaac gctgtgccag ctggcgccag cctcagcgca 1500 gcagaggtgg cggttgtgcg gaacgcggtt gacaaggcag tgctggcatt ctgggacaaa 1560 ggagtatccg ccgggaagga gcagctggag gccctgcgca gctctgtggc tgacttcttg 1620 cgagcgcgca gcctggtgtc cgagctggac gctgcagttg gtgccattgc cagcggcgac 1680 gtcaagcccg atgatatggt ggtccagtac accatggccg cgataccgaa cgcagccaag 1740 ctgcgcgagg ggggggccac tgctagcgga gtgcggtggc cgggggtggt caagacagtc 1800 accgcggctg catcagcaca gcgcagcggc gagggttgcc tgaacgtgct gcgtggcgag 1860 gtgaaggatc ccggactgcc attccccagc accagcacca tccggcggtc catgggcgag 1920 gaggaagtgg atcctcaagc tgtcggcgtg ctgcgccgtg gcatccagaa cttcgtggcc 1980 cgctgccggg aggctgggcg actgggggcg gaggccacca tggctgccga cggtgtcgac 2040 atcccggtac ggcaaagtcg tcaaagttct gaacgtaacc ggttgtagtc acgcacaccc 2100 attttgtcag cccgtacgcc gccgtgcgtt tttgactgcc aacagctttt gatcctcagc 2160 cctctctgcc ggctggttga tacagggcat gctggagaag gagatgctcc ccagcggcac 2220 tgtgcgcttc gacggcgatg cctgccacgg cgacgccatc agtgttgcga tgctggccac 2280 aaaggagcgc gagtacaaca gccttgtgca gcggctagat gcagccgtgg cagcgagcgg 2340 cggggactca caccctgctg ctgatgctgc cgcggcattt gctagcgtgc agtttgtggc 2400 gacatggatc cagcatgagg tcctgccgct ggcacacgcg cggctgaaga cagcgcagga 2460 cgcggcggca gctcagagcg cgcggttcca ggcacgcaat ggcgtggggg agcagcgctt 2520 gaagtcacag caggtctcgc ataagtgcgc cagcttgcag cgcattgggc ggctccaggc 2580 tgtggtggcg caggcggagg ggcttttgag tggggcccag ctcgtgcgcg atgtcaacct 2640 ggtgactctt gacatgtgcg ctccgtcacc cgccgacggc agtgtcacag tcagcctttc 2700 cactgctggc agcaacgctg aggcccactg cgcggcactg gtgggcatgc ttgcacgcaa 2760 catgcgtgcg gcggtgtcat cactctacga cgtgacgcgt atggcagcca cttcactgct 2820 tgtcgccatg ctcacttcgc cggacagccg ggtgtcggag atggtggcgc ggatgcacat 2880 cagcagcaag gcgagcgcgc agttgatgcg gtggctggtc agcagcgtca agcaggagct 2940 ggaggcgcaa ggaggcaagc tgctgcttgt cgtggcagac ggcgagcatt tctccctcac 3000 caacgctgca gccagcggcg agctgtcgtc caaccttgcc cagctcaatc gcgagagtca 3060 gcggttgtgc gctgctgctg agcaggcccg acacgccagt gcgaaacggc acggttggca 3120 gcggcgggat ctagcctggc aggactacgc gcagcacgtg cctctgcttc cagcccagcg 3180 ggtgctgcca aggggtgcgt ccatgtctgc actgacgaga atcagcatcg ccagcacacc 3240 acgggcagcg ccgctgctag aggccgcagc ggcacttgag cagcaccaat ccctcgtcgc 3300 ggctactgtg gcagcggctg cgactgccgc ggctgcgact gcggctgcga ctgccgcggc 3360 tgcgactgcg gctgcgactg cggctgcgac tgccgcggct gcgactgcgg ctgcgactgc 3420 tgcggctgcg actgccgcgg ctgcgactgc ggcggcggcg gtggcgaaag ccgcagggcc 3480 gccgccgcca ctggcgcagc agcagacaca gtcgcgacca aagcgcactg cagcggcccc 3540 agcccgactt gctgattatg caacggaggg catgaagcct gcagtagctg ctaagcatgc 3600 caagacggcg gaggtgtcgg tggcctgcat tcctgacacc acggttgcat cagcagcggt 3660 agcagcagca gcacctgtgt ccgtggtggc agtgccagca gtcggcacaa cagagtgcga 3720 ccagcagtac gaccactact gtctcgtgtg cggcgacaaa gacttcacgg agcgcaacaa 3780 gattctgctc tgtgacggcc cgcgctgccg ctgcgagtcc cagtacgctg gctggcacca 3840 gcactgcctg aagccagcac tgtcaaaggt gccatccggt gactggctct gcccgcagtg 3900 tgttgaggcg gaccatcgcg tgatagagga gcaggcagcg gcaccacttc caccggtagc 3960 gctgcagcca gccgtggacg tcgctggggg gcttgctgct gtggcggcta gcgcagcggc 4020 ggccacaatg cctccggcag cagcgccata cgtccgccat gtcgccagcc tgaggccggg 4080 ggtccagccc ttggcactgg acctcttctc tccggccgct aggctgtcgg gaagcatcga 4140 gcagtctcgt aacgccccag cccttgtgat tcagctgcag tcgcattacg agaagatgaa 4200 agagattgag gcgcacatcc cgtggcgcca gctgcagcga ccggtgcagc agctgagcca 4260 cgagatcacc tctcaccggg agtcggcgca gtgccccgtc agcatgcagc gggtgcccca 4320 gggcatcatc acggcgttcg ctgcctacat ggcggcacct ttgaccccca cgctcacagt 4380 gcagacacca cttgccggga gttctggtag cttccgagcc aagggcctgt cacggcattg 4440 ccccggcggc gtggacgcag cggccgctcg caaggacgtg catgcagcaa atgcggctgc 4500 ccgcgccgcc cccagcaatg ccttacacga ggcgctgaag gcagaggcag ccgccgcacg 4560 gcagctggtg ctcaagctgc ccattgagga gggcacggct gtcgtctcgc gattgatctg 4620 gcggtcgcag tacctggcga ggcaggcggt gttgcctggc gaccgcgggc aggtaaaccc 4680 agcaggcacg tccttcccac tcgccagtgc agcggcggag tcagaggcct actgggtctt 4740 ccgccttacc cacggcatgc caccagcagg cctggggcag gctaggcagg aggtaatcgt 4800 ccggcacgca attgatcagg cgcggctaga ccgcatgctg cttacgtgca agatggccgg 4860 ccgcgacttc gggcggcagt tccaattcac tctgcccggc ctgcccagca tcgtcatcga 4920 cgagccgcac aagctcaaga cagcaaagcc gcagacgaac gactgctgct tcctcggcaa 4980 gcttgacggc ctcgcagcca tgcagcgtgc tgctgcccaa tgccgtagtg ggtgggctag 5040 tgacggcgat gacagcagca cctcagcagc agtagcggga ggagcggcgg cggcggcagt 5100 gcacggccac gcggccgggg ctgcaggagt ggcggcggcg caaggccagg cgggcgcagc 5160 accaggggcg gtggcgcacg gccaagcgac caaggctgca ggagtggcgg cgtgcggcca 5220 ggcggaagca gctgcttctg ctgcggcatg caagtgggga gaagccttct tggacaaggg 5280 cgtgcttttg gctgtggcca acgaacttgc gagcgagtat caggccacct gcccggcaga 5340 gcgtgtggag cggtacggta acctggaccc aggcgcactc ctggccataa tccgtggcga 5400 gactgactcc cagtcagtac catgcgccca gtactggata ggctgtgtgc cgctgcacgc 5460 agccctgcgg acgtacgggt tcgtgcggga ggcagtcgtg ctgcgcaccc tgatcgaggc 5520 ctttgaggcc tttgatgagt gcggctggga gcccgtggag cgggcgctac ggatcctgcg 5580 gcgcacctac atgcttcagc gcatgttggg caacacgctc ttcaccgcgc gcgtgctaca 5640 cggcagcgcc aagccttccc tgcccaccta cgtcaatggc ttcacactgc agaacgtgac 5700 cagctggctg gtgaacggcg acacgcggct ggaggttttg cagcggctgc cggtggagtg 5760 gcgtgcagag ctatgcgagc gctcattcag ctcggactgc tgcgagctgt tcttctcgct 5820 gctcgtgtgc ctgtgcggtg gcaagccgtc cctgcgcgtg atcaagggtg ccacacccag 5880 gatcacattc atgcaccgcc tgcggcagca ggcgcgcgcc acacggggct tcagcatgcg 5940 taagcgcagg actcgtggcg cctacaagga gcacgagctg atgctagctg ccacgctcgc 6000 ggacgcaggc ggcgctgcag ggcgtgtgga cggcagcaag tggaacagcg gggtgaagat 6060 cgtgaacgca gctctgaagc acaagcggga aggggcgata aacaagcgtg tggcgtctcg 6120 tgtgcgtggc aagatgggct ggctgcgcaa cactgtgcat cacatgctga agctggtcac 6180 gggccagccg gtgcccaagg cgtgataggc ggggcggggt acatgtatga tcgcgggtgg 6240 atggctggtg tagagggctg cggtatgcac tagtataggg acgacataag taagttttgg 6300 agtgatgagg tccaggccaa gcacgtctga ggacaaccag gagtgagcag gtcacatgcg 6360 ttagcgggag aggcacgagg ggggcacgag gagggacaag gaaccccagg tgacccaatg 6420 acatttttgc acataagttt ttgatgcatg ttctttttac aatggcgatt gaataccctg 6480 catacactgt acgtacgcta tccatcacgc taattacatt ttacttttca cgacaaaaaa 6540 aattccgtgt cccattgcac accccataca tggctgcagc cgtacggcct ttcaggtttc 6600 atctgctcag ctccgcacca tagtaaccgc atactacaca acgaggtatg ctggcggtct 6660 ggtcagatga tacctgtggg gtctaccagg ctgtcaaagt ttggacacag caaatgacca 6720 gagtggtacc cgcgtg 6736 // ID Gypsy-10-I_CR repbase; DNA; CHL; 6207 BP. XX AC . XX DT 31-MAY-2006 (Rel. 11.05, Created) DT 31-MAY-2006 (Rel. 11.05, Last updated, Version 1) XX DE Gypsy-10-I_CR is an internal portion of Gypsy-10_CR LTR DE retrotransposon - a fossilized copy. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; 4-bp TSDs; Gypsy superfamily; Gypsy-10-I_CR; KW Gypsy-10-LTR_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-6207 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-10_CR, a family of gypsy LTR retrotransposons from the RT green algae genome."; RL Repbase Reports 6(5), 228-228 (2006). XX DR [1] (Consensus) XX CC This is an internal portion of Gypsy-10_CR LTR retrotransposon. CC The internal portion is incomplete. It encodes a long polyprotein CC composed of gag, reverse transcriptase, PHD zinc finger, and CC integrase. XX SQ Sequence 6207 BP; 1329 A; 1807 C; 1831 G; 1088 T; 152 other; ctggatgtgg ctgtctgccg gctgggagac tctgccgcaa cgtggtattc tggggttgag 60 aactccatta ccacctgggc acaattccaa aacatttttc aagaacgatt tgaggcagat 120 aaggaggagt tgtacaatca gctggcacgc tgccaacagc ggcggcgtga atccgtacgg 180 gaatacgcgg atcggttccg cgacctggca tcccagctat cactcaagac cacggaccct 240 gtccacacct tttccttcct ccatggactt caaccagata tctatgaaca agtgtttttg 300 atgaagccca agacgctaga agccgctatc cgcgatgcca tatatgtgga cgagggtatg 360 cagcgtcgtg tctcaacccc tgccatcaca aatcgccctt ggcgcgagga gccgcgtgac 420 cgctgggcag accgctcgga tcgctacccg ggtcggttcg accgtcgcta tggtcgggat 480 cgtagcgagg aaattgaaga tcaacgtggt aactaccgac caccagaccg tcgggatacc 540 cgccccttcc aacccccagt acgcaactcc atgccactac ctcgaccggg gtcgcaggat 600 gtgcggcgag accagggtcg accgggagac cggcgggatg cgccgcagca gcgccagaat 660 caggccggtg gccaaaacag cgccattgat gacttggcca aacagatggc gcgcatgacg 720 ctcctgttgg agggaatcgc cggggcagga acgcacctga tggagatgtc tgggggcaac 780 caggccgagg cggatgacac gtacattgaa gaagactacg aggcctacca caacaagcgc 840 gtgtcggact ttgactacat tcccccgccc cgacggcgtg tggctgccgg tcccaatcaa 900 atggagcccg agcgcccccg cgttgcgccc ccggcacgcc cagcgcccca agccgtgaat 960 gcaccctcac ctttccctgc gcctcccatg cgtggacccc gccaggccgc tgcgccaccg 1020 ccagtcgcca ctggcgccgc cgtgtatcga ccccgccgag caccaggcga gccgtctggt 1080 cgcacggatc ctgtgccggc agcgaatcct gcgttcgtgg accaacaaat tgcggatgaa 1140 attgtgcaaa gggtgggcaa gtacccagtg cccctgcgca ctgcggtgga gtgcgacccg 1200 gcacgcattt acgctaaggt agggagtcag ctgctgcaaa cagctcgcca gaagcaaccc 1260 atcaacactg cagccgcggc cgctgccatg cctatgatgg cgccaaaccc cgtgccaccc 1320 tggcggcata cggtgacacc ggcgggtccc aatactgggc acgagcccca agcccatcga 1380 gacggtccga cccctatgga gaccaacacg gctgtgcatg ccgcacctgc tggctatcca 1440 gcccagttcc aggtgtgtaa ggcgagggtc ctggtcgcta acgggcgggg cgaatggaca 1500 ccggtgactg ccatcgtgga ctcaggcgct agccatagcg taatcacacg tgacacggtg 1560 cggcagctgg acctgctgga tgacgtgcag gattcgaacg ccacgttctt gaatgcctgt 1620 ggnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1680 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1740 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnncactga actacgagct gctgctgggc 1800 actgatttcc tttacccggc acgggcaaac atttgctacc accagcagcg tctggagtac 1860 acaacagatg ggaacgcccg cgctagcatc cccatccagt ttatgcgact gagtcatccc 1920 gccgctttgc ggacggaaat tgagcaggag ctagagcggg aagtctacct gtcacaccca 1980 ccggtaatgc acgccccact ggagacggtt gaagagactc ggtggggtga agccgtggtt 2040 gagatcgcgg cgacacatgc ggcaccgcct caggagacca cgcagtcatc atctgacgct 2100 tttgtcacct gtgaggaggg caatgacgac aagccgtggt caacagcatc cgtacacgcg 2160 gacagtgagt ccagtggctg gcagtctgac gaagaactgc tgcccccgct gtccgaccaa 2220 cccggcagcg agtctgaggc cagcggcccc gaagaggaga agactaccac ggctgagtcc 2280 ctgaccccgc cggctgttgc cgctgggaca cagcatgagc ccatgatcct aaatggaccc 2340 cgtttcgtcc aggaaacctt gggtacaacc ccggtgccgt ctggcgctga ccatgacgga 2400 cacctggacg ctgcgcgcat gtatggcatc cgtgcccgag cgctactacg aggagtactg 2460 ccctcccccg aattacgtga agccatcttg gctactgttg atcccaagct atccacacat 2520 cagcggcagt tgttggcttg gtgcctgatg cgcaatcacg atgtgtttgc tctcactact 2580 cggcaactgg gctgtactac gtgggtggac tttaaaatcg acactggaga ctgtcctccc 2640 atcgcgcagc agccctaccg catgtcacga gttgagcgcg aggcggttga cagggccatt 2700 gccgagatga aggcggatgg ggtggtcgaa gattcgacga gcccttgggt gtcgcccatc 2760 gtggttgtgg caaagaagaa cggccagcta cgcccttgtg tggacctgcg gaaggtaaac 2820 gcggccacaa gggatgtacg ttacccgctg gggcatattc aggacatttt ggacagcgtg 2880 gcccctcccg ctggtgagga acgctggctg ggctcaatcg acatgctgag cggttactgg 2940 caagttccta ttgtggagga agcgtccagg attcgcactg gcttcgcatc cccaacggga 3000 caataccagt atcgaaggat gcccatggga gctaaggggg ctgcggctgt gttcagtcac 3060 ttagcacgca agatgctggg tccggttctg gacgcgcctg ccccgggtca caacccactc 3120 actctcatcc caccacccga gaacgatgcg gagctggaaa tgtacccgcg tgagcgctgc 3180 actgcggtgt acctggatga tatcttgatc tccagccagt cgttcgaagc gcacctgtgg 3240 catgtgtgca tggtgctaga tctgatacgg tacgctggac tcaaggcgtc tgtgtccaag 3300 tgcgaattcg ggaggcggaa ggtggttttc ttgggtcacc aactggatgg agagcgtggg 3360 gttattacac ccagcccgcg caatgtggcc aacatcgccg cattgaagat tcctacgtct 3420 gtgcgggagc tacgcgcttt cctgggcaca gtgggctact atcgcaacat gattgcgggt 3480 ttcgccttaa tcgcgcgtcc ccttcatacc ctgctcaccc acgagactcc gtggcgatgg 3540 acggcgcgcg agcaggaggc gtttgacact ctgaagacgg cgctcactac cgaaccggtt 3600 ttgagggcgc ctgacttctc acgaccattc atggtacagg tggactggtg tcgtacggcg 3660 gtcgccgcct gtttggcgca gaaggatgag gaggggcaag aatatgcggt tcaattcgca 3720 agtaaagcca tggcaggagc gcagttgaac tactgcagtg cggacggaga ggcgtacgcg 3780 gctgtgtggg cggtaaagaa gttccacccc tacatttacg gaacccgcgc catacttgtc 3840 acggattcca tggcagttcg ctatctgcag actgctaacg ccagtgacct tcacggcaag 3900 ctgggccgct acgcactgat cttgcaggcg tacgacttgg acatctagca caaagccggg 3960 aagaagaatg ggaatgtgga cggcctgagc cggctggggc acctattggg accggatgag 4020 gacgaagggc aggaggagga ttatgaggcg gagacgtact tggatggaca cgaactaccg 4080 gtgttcatgt cagatggcat gactgattcc gccacacgga tagaagggat acgttctgcg 4140 cctagcccag tggtcgtagc cctcaaccta cccacgatgt cttcacgcga accaacgacg 4200 gctcgggacc ttggagcggg tggcccccag gaacccattc ccagggcaca tgatctacct 4260 tttccacacg tgcgcactta cgcggccggc gtggaaacga cgggtggaca cacagtcagc 4320 gctacgacca ggggtcagac gcagttgacg cgcacggctg cacaggcgcc gacgtcggca 4380 ccaaacaccg ctgcaccaac cagccccatg cctatggaag ggcgagatga gaagcggtgc 4440 gtcgtgtgtg agcacgccaa acccagcagt aatctggttc cgtgcaatgc accaggatgc 4500 aacctgtact ggcactttca ctgcttggac ccgcccctcc ctggcgtacc aaaaggccgg 4560 tggtactgtc cgatgcatct aacccagccc tacgatgatg atccccgccc tggcccaccc 4620 tctacgagta gtgacgatga tgtcgacgtg tccagctcca cagctaccac tggcaccggc 4680 gagggcgccg gatctgcagt ggaggagggc gacatggagg cggtcgtcgt gtcagaagaa 4740 gcagaagtag agtcagaaga cggcattccc gcagaaggtg ggcctcccca agccctcaaa 4800 ccccgacgtg gtggcaggac acgagggtcg tctgaagtat ggaacgacac cgcgctgctt 4860 accttgctaa aatccggcgt actgcccatc gatcctgcga agagcggcaa tgaaaacttc 4920 actgagatga tgcggctaca caacaagaag aaggggtatg tgtggagcga agatcgactg 4980 tacaagttga tcaagggtca acgagtatgg tgtccggagc ctggtgagcg cgagtcgctc 5040 attcgcgagg cgcatgggct agcccaccaa ggcattgaca aggtcatgag cttactagcc 5100 ccaagctgct actggcccaa catgcggaac acggtcaagg ccttcatcga gaagtgcacg 5160 gagtgcaagc ccggacgagc taagttgatt cgcaacttac cactgcgacc gcttcacgtc 5220 gtaccaatgt gggcggtagt tcatatcgac ctggccggac cactgaagac ttcgacgggc 5280 ggcaatcgat atatcattct ctgcgtggac ggatggagta agtggctgga agtgggtgcc 5340 attcccaaca aggaggcctc ggtgtgtgct aagtggttct ggcagacgtg gatctgtcgc 5400 tacggtactc ccggcgcggt agtcaccgac cgagggcgtg agtgggatgg cgagtttcac 5460 aacctcctgg tggcgcagcg gattacccac cgccacacag cacccaacaa cccccaagct 5520 aacggccaat gtgaacggct agttggtgta atgatggaag ggttacggaa gaaggtcaat 5580 gagaatgagg acgattggga ggagcgaatt cacgaggtgg cctacgccta ccgagcagca 5640 cggcaagata gcacacgcgt atcacccgcg ctgtgcgtgt ttggacgaga actcgccctt 5700 gccgctaccc gtgcagctcc ggacccgaac ccacctgctt ggggagaaga cgaggaggag 5760 gcacagggtg tacaaccgca ggattatctc gcattggatc agcggcgtga agcactcgaa 5820 gcgctggtgc cgcgtgtgga cgagaacctg gtgcgggcaa aagagaggaa tgagaagaat 5880 taccacaaga ggcgcgcgcc ggcgaaaccc cgcgggcccc cctccaagcg caggaaggcc 5940 gggaacgaag gcaaccccgt tatttcggct gcggcagtca aaaatgtcct acctgccctg 6000 gacaccgctg gacctggacc ttcccagccc ccggttccaa ctgctaccac tgggaagccc 6060 cgcgcccata tgatggcgtc gcccccgcta cagcccgccg gacccctccg gataggcaac 6120 actgcggtaa aaccctccga cccgccggag cagcaaccgg tccaatcatc ggacgctctg 6180 agtcatcttc ccgacttacc agagaat 6207 // ID Mariner-N3_CR repbase; DNA; CHL; 654 BP. XX AC . XX DT 08-JUN-2004 (Rel. 9.05, Created) DT 08-JUN-2004 (Rel. 9.05, Last updated, Version 1) XX DE Mariner-N3_CR is a family of nonautonomous Mariner DNA DE transposons - a consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW mariner superfamily; Mariner-N3_CR; nonautonomous DNA transposon; KW TA target site duplications. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-654 RA Kapitonov V.V. and Jurka J.; RT "Mariner-N3_CR, a family of nonautonomous Mariner transposons in RT the Chlamydomonas reinhardtii genome."; RL Repbase Reports 4(5), 131-131 (2004). XX DR [1] (Consensus) XX CC Mariner-N3_CR is a family of nonautonomous transposons. The CC consensus sequence is less than 5% divergent from CC Mariner-N3_CR copies. Features of Mariner-N3_CRs include 25-bp CC terminal inverted repeats and TA target-site duplications. XX SQ Sequence 654 BP; 149 A; 226 C; 145 G; 131 T; 3 other; ccgtagtgtt acaaaggttt cgggattaat actttcgggg agggagccta aaacatttat 60 gaaaatattc gggaaaataa atattcggga cgcccgccac ggagattttc gagcttgatt 120 cgataattcg gggcaccgct cggaatgggc ttgcggcacg gcctgtggcc cttgtgacag 180 ggcgcctctc gtgtaccatt caccagaacc gcgtactcct catgccattg tgcatacaca 240 gcaccagcca cgagccacca gtcaccacca gcttaccccg cctactcccc aagcccgcac 300 ccacgcttgc acccctcacc cgcaccccca ccggcacccc acgcccacac cggcacaccc 360 ccacctgcac cccacgccct caccccacgc ccgcaaccct gtgtcyaagt tgtagagaaa 420 atttctaagc gaaaatcgct gcatttacgc tggtcccatc gccttgcgtg ccgctgtgac 480 ccgtccgcac cccccacccc ggcgcgcgga gcgcgcccgc gaaaaaccaa atttcgtgct 540 gttgatactt attttcggga agccagggcc attaatawtt atytaatagg ttcgggaaac 600 agattcgggg cctctccacg aacttagttt cccgaaacct ttgtaacact acgg 654 // ID DNA3-1_CR repbase; DNA; CHL; 1989 BP. XX AC . XX DT 17-AUG-2004 (Rel. 9.07, Created) DT 17-AUG-2004 (Rel. 9.07, Last updated, Version 1) XX DE DNA3-1_CR is a nonautonomous DNA transposon - a consensus DE sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; 3-bp TSD; KW DNA3-1_CR; nonautonomous DNA transposon. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-1989 RA Kapitonov V.V. and Jurka J.; RT "DNA3-1_CR, unclassified nonautonomous DNA transposon present in RT the green algae genome."; RL Repbase Reports 4(7), 193-193 (2004). XX DR [1] (Consensus) XX CC The genome contains ~20 copies of DNA3-1_CR that are 99% CC identical to the consensus sequence. This transposon is CC characterized by 3-bp target site duplications and 30-bp CC terminal inverted repeats (4 mismatches). XX SQ Sequence 1989 BP; 410 A; 539 C; 570 G; 470 T; 0 other; ccctatggca cccaaatggt ggtaacgatc caaaaacggg ggtgacgatc cccggacagg 60 gggtaacgat tgtttcgggc ccgcacggcc tggcacgccc aggacgcaag ccaagtggtg 120 ctgtggtcac ctcttagcta cttgtcatgc atacgcacat tgcggactgg gcggagtgcc 180 tttcgtcgcc gacaagtgaa catgccgaca ttctagtctg caaagtgcgt attgtagtga 240 cagtttgtgt taaaacagtc taatatgtga agccttaaac ttatgtagtc tttgcgctcg 300 cgaacgagga agtctggggg ctgtaggccg actgtgtgac attgtgacag ggacccttgg 360 gcgtggtaac gaccctaagc caggggtaac ggggccttcg cgaggggtaa cagggttcct 420 gcatggggtt ctcgcgggcc tgacttaaca agcagcatag agcggcccct gatactacta 480 cacatatact gatatagtac catacatata ctgatacact gcacttcatg ccggtgccgc 540 tgcacttctg ttcctctccc ccactttgct ctctcgcgct tgctacttct cttcgtaaca 600 caaatttatg ctagctcaaa tgtcagtaga acaagagctg gcggcagacg ctgcttggct 660 ggcttgggac gcgagtcctc acgctgctga ctgctcgttg accggcttgg caacgctgct 720 cacacgtcag ctgcagccag ctattacgtg agagaaacga ctatgcctca taatctatca 780 gctgcgcttc aaatgtccga ccgagaagtc tcggtcgccg gcgtcaacaa cgccattcaa 840 catcgcgttc tgtcaacatc accctcgccc cgcacgccat agtccgcagg ttagtccgca 900 ggttagtccg caggttgaat gacgatggga tgcgttgaac ctagtggtac ctcgcaagcc 960 agtgtgcatg gttcatcttc tcgcgctgtg ttgggttgtc tggtggagcc cccgcctaca 1020 actagtgact gtctgctgcc tcccttacca gcgctcgctg taaggtgttt ttatggctcc 1080 cttgccacgg tgccttgtgt ctcacatacc accccgcccc cctccgcacg cacacaccct 1140 gccccgcgac agcttgctgc ccaacccctg ttgtacccta tggaccgtac ggttggctgg 1200 actgccccta cacctccagc attgtgctca tcaacggtga gtgtccgtga cacgtgtggg 1260 cgcacacaca tgcacatgtg cacacagaca cgccttgtgt ggtaggcccc gtgcactgcc 1320 aagcactctg tccatggctg gccttgacat gcattggcac ctacccgtct tgtggcagac 1380 gattaactgg gtgtgtctat gtgtgtgcac gcgtgcatgc aggcgctgcc ggtggaagca 1440 aaactacagg agccagcgtg acaccagctg tctggcgcta gagatggagg cgggcgcggc 1500 ggcgccagtg gcatagtttg gcatggtagc tgtggtggcg gcgggcgtca aggtggtggg 1560 tcagagacgt tgtgcccagc agacatgcaa gtggttgctg ccagtggcta gttgcacgtt 1620 cacacgcaga catgacttgg cgtattgttg catgtacgta tggacttgcg gtagtgttgc 1680 ataagtgttg catgagcatc tgcacttgtc gtgctatcaa gccttgacag atggctagca 1740 tgtggagtag cggtaggact acatagttct tacgttagta ctgatgtagg ccttcagcaa 1800 agcacaaggc tgccatgtaa cgggcctcaa agtgatggtt actggtgtgc cttgtagacc 1860 gtgacgaggc tgttaccatg gctggttaag atggtaagca gatcgtagcg agatcgtaac 1920 gatgccggag gggtaacgat ctacgaccta ggtaacagcg atcgtaacga tcgtttgggt 1980 gccataggg 1989 // ID EnSpm-N1_CR repbase; DNA; CHL; 757 BP. XX AC . XX DT 28-MAR-2006 (Rel. 11.03, Created) DT 28-MAR-2006 (Rel. 11.03, Last updated, Version 1) XX DE EnSpm-N1_CR is a family of En/Spm nonautonomous DNA transposons - DE a consensus sequence. XX KW EnSpm; DNA transposon; Transposable Element; Nonautonomous; KW EnSpm-N1a; EnSpm-N1_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-757 RA Kapitonov V.V. and Jurka J.; RT "EnSpm-N1_CR, a family of En/Spm nonautonomous DNA transposons in RT the Chlamydomonas reinhardtii genome."; RL Repbase Reports 6(3), 120-120 (2006). XX DR [1] (Consensus) XX CC This is a nonautonomous EnSpm DNA transposon characterized by CC 3-bp TSDs and 25-bp TIRs. XX SQ Sequence 757 BP; 162 A; 218 C; 231 G; 146 T; 0 other; cactagggcg caaaagtcgc gtccaaacag acaccggcgc cagcccgacc tctcccgcgc 60 gggagcgcgt tgatcccgac ccggtaagtc ccatcgtggg acagtacccc caaggcccct 120 tccatagcgc agatgttcct gggcgctctc cagcgtgcat gcgggttcta ggtagccacg 180 cccacgctgc gaggctggga aaaaggccca ttgtgacagc aagccagctt tcgcgccatt 240 ggttgccggc gacagccctg tggtgtagca gcttgggtat gcatagcttg ttggggtacg 300 ggtgggcaat gactcccaac cttgcgcgcg taggatttat gagggcatcc tgccccctgc 360 gcaaaagcat gaatttgcac ccatgcaatg cgcgagggga gcagctaccc accgcgcgct 420 gccctgctgc caggtatcaa ctggattatc aatcccaata cccaggcatt tagctggcat 480 ggctgggaag cgcgagggaa aaaccggcat tcctgaaacc cgccactgac tgtagccttt 540 tctcaacacc attcgaaagc gtttgtcaaa ccgtacagca ggctggacaa tacgcgcagg 600 gaggacggtg cgtttaatag gggtacggca gcaagcgttc tcggcctagg aagcagctta 660 cagagtgcgg ggcagttctg taaatagagg gtatcaaatt tgatgggcgg agtgccttgg 720 ggtgcaggtc gctggacgcg acttctgcgc cctagtg 757 // ID RandI-6 repbase; DNA; CHL; 13822 BP. XX AC . XX DT 17-AUG-2004 (Rel. 9.07, Created) DT 17-AUG-2004 (Rel. 9.07, Last updated, Version 1) XX DE RandI-6 is a family of non-LTR retrotransposons - a consensus DE sequence. XX KW RandI; Non-LTR Retrotransposon; Transposable Element; KW AP endonuclease; RNase H1; RandI superfamily; RandI-6; KW reverse transcriptase. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-13822 RA Kapitonov V.V. and Jurka J.; RT "RandI-6, a family of RandI non-LTR retrotransposons from the RT Chlamydomonas reinhardtii genome."; RL Repbase Reports 4(7), 201-201 (2004). XX DR [1] (Consensus) XX CC RandI-6 is a family of RandI non-LTR CC retrotransposons. Approximately 50 copies of RandI-6 are CC present in the genome. They are ~1% divergent from the CC consensus sequence and are likely mobile. The RanI-6 elements CC are usually flanked by 10-15-bp target site duplications. The CC consensus sequence encodes a 4527-aa RandI-6p protein CC (nucleotide pos. 44-13624) composed of the AP endonuclease (aa CC pos. 2010-2290), reverse transcriptase (aa pos. 2660-2960), and CC RNase H1 (aa pos. 4025-4190) domains. A 144-aa portion of CC RandI-6p (aa pos. 1463-1616) matches the Josephin domain (E < CC 0.001; pfam02099) and is conserved in other RandI polyproteins. XX FH Key Location/Qualifiers FT CDS 44..13624 FT /product="RandI-6p" FT /translation="PARTHRLAPTGPSPEDSVAEPCPSLPSNDQKLGKTGA FT GSEAANCPGNAETRPVARLPCKGASHPYHKGHGSKLLGITVLRRDPEPTFS FT RSSYRPDFHQPPRARQHPLSDGKTKQQHSHQQRHTKKAQTSQTTTSQTPRN FT ARAPRLTTNNTQQHSVTPQHQHTRTRPLEPTQDNPTPKNPTPDNDALTPRK FT LNSQRQNNPTNPQNTEIKRKPLPSHLSSFCFLTLTLALPMLLLTPFASIEL FT TSAPQMHTSPVTLEPASAMRPHAAAGLTQTAAPSHHASLSLATKMKPFHVD FT FTGDIILRPDLHYLYNAAGLTVGILHPLPEGNIALWQTQRLRPAAGPIPAS FT VLGLADGKQRILLMTACAGCLNRAMSLSSLLEPAGNYLRVIQQNLLHTKID FT PRLRQLAEAAITKFNEVLSSRGDLVNSIADAAIKAIGMPATNEAINNIRSL FT IYEARAGSQTWTLSDNQKVAALGSKRTAFTAGELLLHSACITRHPILILLL FT TRRHCLAVAITLPVAHAAAFLLDLQTHERWILELGPDNSITPAKRRPTVAI FT LAGASRSAPNSEGKVSFLAASEGRHDLEKVITCSYTTDQAHSPVYSPLFRH FT APNYRLKCSQLLPAYTSTTPPYPQVLPFTAIMVVQCAARAQDAAGDPNKVV FT AHMAYEAMDKICAGKPTTPLSTIVEGTKHHSKAPGNYTSTQTYGSSPPIAH FT ALSPNTPLQAPPTPPPVMPRSTEAMTAKDPGETATANIRDWWGSPLGACVD FT VRFVKLHDADLLASLGANRNQLYGDVFYAHFSSADVCEAIQRTGGLYLFSD FT TAYLRAVCLQSRPDGGAATPEMMDLLEEIGIQIELLDPLLSDTGVNELGAR FT VAQQLEQGMKALFPDIAQHDHLTNRIGLYTTFGTVQSLTTLDRDNPVEFLF FT ARKGGKLAPIIDLQPAAWQANDRQLFHLSDSRLYLLPCNALSAAAILICIN FT GGRNLASTMLWPELNRGRVYLTDAALDYQYKLNTGSADELWEHPCAHLGLA FT AGLALTGQAGAGMVPDFVGAKPIHAARAKTAVRHQLQAADAPEPLPMPGLQ FT LNLGESGPVLDAKTVEAAAEDEMEEMDLASLLVSSALGLKPGSTAPAFTAS FT TTTHPSSTHPHLTDKPTESATPPNCLTPHSSQCRAPSASAEPTPRHNAPPA FT TCRLGRLRSLPHTPGYAGILINASGYSGTHSTAYWDRVAQTRVAQVTEQGN FT DAGSSKDRRMHDYAVRQARGDVGPDCAVVTMHGTIGTLTVLTHAGLARDTN FT GVCREVHLAQINGWTNGPLASRMLLFYVFLLLPIAFRRYGQYARGGPKDDK FT CVSAAVDSILRAADGRLRPGLLAEDSKANCTRAMRVRNPPLHIASCSQDNA FT RPSVIRRHHELCLAHATATRRTNSPALTKTRCCPPSPARIRLYFPHASRSA FT PRPKPPALADARQAVQGQAKGRGTQCLLFTAGSKPPASSSAPPSSHSGRIY FT FEQQELSFCWIHAINNLLGAAAVSPHRVLLYLEQLLRQTRGRENTTEIFAL FT SQAFTPAIGQFSIHAVNHYLMYNCHRANQALALHAHDAVRPGDSPHSVTRR FT HAQATNGFLIFTPGYSSGIYGHVLAVKRHGANWIILDSADTQPRTIQHTTT FT LDMAGTIFTLSTRTPDTPPLLPLCLNGIGPPANWAPDRMGQPSFRALNQLA FT NHLTANGHQLLDTCSWGNNCAIFGTLAVLPGSRVPLIPSDPAQWAPMTGAI FT RACIASRLEHLTAPGTPGSDEMRTHILGLYEESPHPENTIRTHCRNIRDTS FT MLTIPDLIALQSLLPGVQIHVISPGYTDWLTLPTPQPNDTRHDVYLAHVPH FT SWALAGFPTAFPVTDCNHYIAVTRHTWTPGSVTQDPSPLFHESIWDGGPCT FT RDIIVTHAKSPTRSPLPKKARTNTITHALLRAMGSPTNPRKAAKRALSLSP FT PAPRKTTRATTNCEAKRAPNTRAPANPHTPTATPAVTNISPQQSPPAAPPN FT PPPRLAVKARQATLQAFLKAAQAPREDPATQVAPTHTDACHLVTLNARGLL FT SNLTAIRDELETWDADILILTETKTRAHRTRTERDIIKNSLKAYTSFVHST FT PRPPPLKGQKQDPRRHHPSGGVTLAVHTRYARKEHITCLPTPAQLQGYMVH FT LRIATHSSLPLHIIGIYCPPDNKSTRDALYTYITRATKSAARLKEPLLTGG FT DWNAAAYDTDRSTRTGTQGDKTHQTFLTRTSLKPAAGAHTQPRLYTYTQIQ FT HDTITHESRIDDFLANTHALRQDSPAREWTTECAGTTDHLALHLQTTLPYK FT PPRDASPHTEREPRRQPDDASPTLANIVKPITQSQSRAISQQWEAEGTTAL FT VTLHATLALPNCSAQQIIDSTVAAISHHTNSTASTRTAHDIITGARAQLSA FT IDLNAASTALYGALAVGAKLMLQTCPQKPPRTSRAKPSRTEAKKLKRLIAV FT EKQKKQQYLALVRAAQLQEPAIPPPTAPNPSQPEPPATPAPATAARQEWQD FT ALKTRRTTQKETSAKAASKWLTKISKLFATSQSKAAKIMLGKSGPRQNLTA FT VRHPATQHIATTSSDINDALHAHFQAAAEPTGGQRTGRYRPEDKDTTFPWN FT EPGAPDGGFLLARTAHPDAQSLLPNVADSSNFQRCMQALSRNKSPGPDGIP FT NELLKCMPEALQSAVHTLMCVAWIKAELPAGWSTSDTILLYKKADPLSISN FT YRPIALCNTLYKLYTSLVTRAATEACTDLLIHSEAQEGFQKHKGTERQIMN FT LLHVIEDAALTKRDLYVAYIDFSNAFGTIDHDRLLTIMHHLGLPTDFIEVV FT RNLYSSANTTIRTTHGRTNPVHIRRGTMQGDTLSPLLFIIFLEPLLRWLHY FT GGRGYRYGCLTNSENDEHNCSSLGYADDLTVAANSRDNLRCQTDKIARYAA FT WGGMRINRDKSACTGILHHYASANSTDKSIKGPACDRTLKRLLAGHIHVGT FT CTKDGKRSAITLPYIPPHKPYRYLGVEITMTHPQLKAVKADLTDAGVELSR FT YRKLPRQCYMLIKQCLKAKVAYSLPTMAYTQPDIRALDAQIAGIARRCVGL FT PRDCATRAILLPAELGGLGIGSLREEYVQRASRLLVISLNDTGRLGVVTRR FT LLHLQAAHAGSDLPLHDMKRAGQFFVTLKQLTLLQDHDLLAMLQGKEFTAA FT LPALVSKLRAAEATTPAYLRADWISPLTIDLGLNLADLVDANTGRHIIDTT FT QLDRRFPKGRVKDKHKRALNRFTVALCKPAEAPPKSMGHKPLPMQSRTLPA FT GVSKADLNSAGDLPKPIPQRTLTSLLARWQEALPSPAAPPKPSLEVLQAKP FT KQTAKQRHSHFIKHDTRTRNPEVEANDDALPGSTVHDPSDPTKDLAFYMSA FT AREFAPLAQQCKTLGTTRWTGTWRSPIRQMPARALITGPHTHTVHALLHTR FT PYQQPAAPTYQRLYLPDGCPATTPPPAPAHSHNRRKRGSQPTRCPYTHHAI FT QPRAPTRAAAPRRDSAAAQPSSRPAWRKVRDNALNGRLPQSIAGVQAAQHI FT TAHLTNRDDIIAAVVSGPMTHKTNAPPPKRKRKQGPPPEEPPWAERCRYRV FT RWQPTLTTNGILRAYELQNYNATSTTKPPCRLWGQGIRRFMVQVHWAETED FT PLSSLSENPGWAAALARYTLAQAVDAPPDPTPPPALDNHLTSKQRQGRWRD FT PAPSEAARKRAILRCSTIDVKSCNPDTDIHPTGQYCIQMGTREPGSATGTP FT SGPPQDPDPPARVYGPDGRCIGTVSAARLALLHARYRTLNPTHPPERFAAA FT LASLIIRYRPSKRHKESSTNPAHEVTLCPDLMSTLTSHIKFQTEAFASPLN FT AHCDIPAYYTTHPQDSEFGAAHDCYSRTLDGISFVHPPPQPEHTSLAMAWA FT VASAEAATPHTPHLSVVVVPYGDTSGYTAWLQHPLVHDLGHIKPAHTWAAM FT ASGLGQPQELPAHRRGLHLLCVTNERGRLLLNQALESGLALALADALGHTP FT HLATRKPPTDPDTPNPWPPTPRKLKRRILAQSSQTPQPQPPRDAPTLPSPT FT ELATLYRAYVPLAHTRGSAVFTDGSKVPHTREYTGLGAAVWDASDNRATHI FT DEDIAFNTPLSTILRCESSAIRTALRLHRDTPHLRVYTDSLVSLFILRRAI FT MEPESLHVSKHREMFGTIVDQLAERALAGHKTSFYHVRSHQTDTCVGNQKA FT DEAAQAVATRTVQPDATETLYSHPYEPLEGWIAQKEIVEHNGNLEERLRYA FT NDLNRDLKKMARKSTATGYTKSGLYAELWEAARKHIDERYGLNLFDKFSFE FT EARCAFKYRWGLSWNAKRAARVKHPMPGCAGGECPLCRGDDGGTHTLGGCT FT HPRMKAAYIARHNRAVQRIAKAISQGHYGGCVMYMDAGKDAPTYSGGTRIP FT AWMLPSTPESVRKKLRPDLLVIPTLRMHRSGLPARAPRTRRQRSAHKVYVI FT EVGYCSDLNHPDKRAEKLQQHEDLVAALRDAGWDVMYSPDTVISLGHTGTV FT TPHLHALLKTLGCPTQSAHSTCRKLVQHAVHTTTAITALRREIYSHKWPGR FT PP" XX SQ Sequence 13822 BP; 3345 A; 5165 C; 3208 G; 2104 T; 0 other; agaatgttgc caatactctt gccaatcgat gctggctagt tagccagctc gcactcacag 60 actggcgccg actgggccaa gtccggaaga tagcgtggct gagccatgcc ctagccttcc 120 tagcaatgat caaaagctag gaaaaacagg agcaggctct gaagcggcaa actgcccagg 180 gaatgcggag acacgaccgg tagcgagact cccctgtaaa ggggcctcac acccctacca 240 caaaggtcac ggttcaaagc tccttgggat cacagtgttg cgacgcgacc ccgaacccac 300 gttctcacga tcatcctatc gaccggactt ccaccagccg cccagggctc gccagcaccc 360 tctgagtgat ggcaagacta aacaacaaca ctcacaccaa caacgacaca ctaagaaggc 420 acagacgagt cagaccacga catcgcaaac cccgcgcaac gccagagcac cgcgacttac 480 caccaacaac acacaacaac actctgtaac accacagcac cagcacacgc gaaccagacc 540 cttggagcca acccaagaca acccgacccc caaaaacccc acccctgaca acgatgccct 600 taccccaaga aaattgaatt cccagagaca aaacaatccg actaacccac aaaacactga 660 aatcaagaga aaaccactgc cctcacacct ttcttctttc tgcttcctca cgctcacact 720 cgccctcccc atgctattac ttactccatt tgcctccata gaactcacaa gcgctcccca 780 aatgcacaca tcaccggtga ccctggagcc tgcaagcgcg atgaggcccc atgccgccgc 840 cggccttact caaacagcgg cgccctcgca tcatgcttct ttatctctag caacaaaaat 900 gaaacccttt cacgttgact ttaccgggga catcatcctg cgcccggacc tgcactacct 960 ctacaatgca gccgggctca ctgtaggaat cctgcatccg ctccctgagg gcaatatcgc 1020 gctatggcaa actcagaggc tgcgtcccgc cgcgggacca atcccggcat ccgtgctcgg 1080 ccttgcagac ggcaagcagc gcatcctgct catgaccgct tgcgccggct gcctcaaccg 1140 cgccatgagc ctgtccagcc tgctggagcc cgccggcaac taccttcgcg ttatccagca 1200 aaacctgctg cacaccaaaa tcgacccgcg cctgcgtcag ctggccgagg ccgccatcac 1260 caagttcaac gaggtcctct catctcgcgg cgaccttgtc aacagcatcg ccgacgccgc 1320 catcaaggcg atcggcatgc cggccaccaa tgaggccatc aacaacatac gctcgttgat 1380 ctacgaggcg cgcgctggca gccagacctg gacgctgagc gacaaccaga aggtggccgc 1440 ccttggaagt aagcgcacag ccttcaccgc tggcgagctt ttactccact ctgcttgcat 1500 cacccgacac cctatactca ttctgctcct aacccgtcgc cactgcctcg cagtggccat 1560 tacgctgccg gtcgcccacg ccgcggcgtt cctgcttgat ctgcaaaccc acgagcggtg 1620 gattctggag ctgggcccgg acaacagcat cacgccggcc aagcgccggc cgactgtggc 1680 catcctggcc ggcgcgagcc gctcggcgcc caactccgag ggcaaagtct ccttcctcgc 1740 ggcctctgag ggacgccacg acctagagaa ggtaattacc tgcagctata ccacagatca 1800 agcgcactcc cctgtctata gccccctctt tcgacatgca cccaattatc gcctcaagtg 1860 ctcacaactg ctgcctgcct atacgtcaac aacacctccc tatcctcagg tactcccctt 1920 cactgccatt atggtagtgc agtgcgcagc tcgcgcccag gacgccgcgg gcgaccccaa 1980 caaggtggtc gcgcacatgg cttacgaggc catggacaaa atctgcgccg gtaagcccac 2040 cacgccactg agtactattg tcgagggaac aaaacaccac agcaaggctc cggggaatta 2100 caccagcacc cagacctatg gctcctcccc accgatcgca catgccctct cgcccaacac 2160 cccacttcaa gcgcccccta caccgccccc cgtcatgccc cgatccacag aggccatgac 2220 ggcgaaggac cccggtgaga ctgccaccgc gaacatccgc gactggtggg gctcaccgct 2280 gggcgcctgc gtggacgtgc gcttcgtcaa gctgcacgac gccgacctgc tcgcctccct 2340 cggcgccaac cgcaaccagc tgtacggcga cgtcttctac gcccacttca gcagcgccga 2400 cgtctgcgag gctatccagc gcacgggcgg cctctacctg ttctccgaca ccgcctacct 2460 gcgcgcggtc tgcctccagt cccgcccgga cggcggcgcg gccacccccg agatgatgga 2520 cctgctggag gagatcggca tccagatcga actgttggac cccctcctca gcgacacggg 2580 agttaacgag ctgggtgcgc gcgtagcgca gcaactggag cagggcatga aggcgctctt 2640 ccccgacatc gcgcagcacg accacctcac caaccgcatc ggcctctaca ccaccttcgg 2700 caccgtgcag agcctgacca ccctcgaccg cgacaaccct gtcgagttcc tgttcgcccg 2760 caagggcggc aagctggcgc cgatcatcga cctgcagccc gccgcctggc aggccaacga 2820 ccgccagctg ttccacctgt ccgacagcag gctttacctg ctgccctgca acgctctcag 2880 cgcggcagcc atcctgatct gcatcaacgg cggccgcaac ctcgcctcca ccatgctgtg 2940 gccggagctc aaccgcggcc gcgtctactt gacggacgcg gctctggact accagtacaa 3000 gctgaacacg ggctcggccg acgagctgtg ggagcacccc tgcgcacacc tcggcctggc 3060 tgccggcctg gcgctcacag gccaggccgg cgcgggcatg gttccggact tcgtgggcgc 3120 gaagcccatc cacgccgccc gtgccaagac ggcggtgagg caccagctac aggcggccga 3180 tgcccctgaa cccctaccca tgcctggcct gcagctcaac ctgggcgagt ccggccccgt 3240 cctggacgcc aagaccgtgg aagccgctgc ggaagatgag atggaggaga tggacctggc 3300 cagcctgctg gtaagctccg ccctgggcct caaacccggc tccactgctc cagcctttac 3360 agcgtcgaca acaacccacc cctcaagcac acacccacac ctaaccgaca aacctactga 3420 gagcgccacc cctcccaact gccttacccc ccacagttca caatgccgag ccccaagcgc 3480 cagcgcggag ccgacgccgc ggcataatgc gccgcccgcc acctgcagat taggacggct 3540 gaggtctttg ccgcacactc ctggatacgc tggcatattg attaatgcat cggggtactc 3600 ggggacgcat tcaaccgcat attgggacag agtagctcag acgagagtag cacaggttac 3660 ggaacaaggg aatgacgcgg gtagtagcaa ggaccggaga atgcacgact atgcagtacg 3720 acaggcgcgt ggggatgtag gccccgactg cgcggtggtg acgatgcacg gaacgatagg 3780 cactttgact gtgcttacac atgcgggact ggcaagggat accaatggtg tgtgcaggga 3840 agtacatctc gcccaaatca atgggtggac aaatggcccg ctcgccagca gaatgcttct 3900 gttttatgtc ttcctgcttt tgcctattgc ctttcggcgg tatgggcagt atgctcgagg 3960 cgggcctaag gatgacaagt gtgtatcggc ggctgtagat tcgattttga gggcggcaga 4020 cggacgcttg cgacctgggc tgcttgcaga agatagcaag gcgaattgca cgcgggctat 4080 gagggtcaga aacccgcccc tccatattgc aagctgctcc caggacaacg cccgccccag 4140 tgtaatccga cggcaccatg aactctgtct ggcccacgct accgccacca gacgcacgaa 4200 ctctccagcc ctgaccaaaa cacgctgctg cccaccatcc cccgctcgca tacggctgta 4260 tttccctcac gcctcccgca gcgccccacg gccgaagcca cctgcactag cagatgcgcg 4320 acaggccgta caagggcaag cgaagggacg cggaacacaa tgcctcctct ttacagccgg 4380 cagcaaacca cctgcttcct cctccgctcc acctagctcc cactctggac gtatctactt 4440 tgaacagcaa gagctctcct tctgctggat acacgccatc aacaacttgt taggggcagc 4500 tgccgtatcc ccccaccgtg tcctgttgta tctagagcag ctcttacgtc aaacccgagg 4560 acgcgagaac accaccgaaa tttttgcatt gtcgcaagcc tttacaccgg ccataggcca 4620 gttctcaata cacgctgtca accactatct catgtacaac tgccacagag caaaccaagc 4680 actagccctc catgcccacg acgctgtacg ccccggcgac tccccacaca gtgtcacacg 4740 ccgccacgca caagccacca atggattcct catcttcaca ccaggctact ccagcggcat 4800 ctacggccat gtacttgcag tcaaacggca cggcgctaat tggatcatcc tcgactcagc 4860 cgacacccaa ccccgcacca tacaacacac cacgaccctc gacatggccg gcacaatctt 4920 cacgctgtct acccgcaccc ccgacacgcc acccctctta cccctctgcc tgaacggaat 4980 tggaccaccc gccaattggg cccccgaccg catggggcaa ccatctttcc gcgcactaaa 5040 tcagcttgcc aaccacctca ccgctaacgg ccaccaactc ctagacacat gctcctgggg 5100 caacaactgc gcaatctttg ggaccctagc cgtcctacca ggcagccggg taccactcat 5160 accctccgac ccagcacaat gggcgcccat gactggcgcc atacgagcat gcatcgcatc 5220 caggctcgag catctcaccg caccaggcac cccgggcagc gacgagatgc gcacacacat 5280 cctcggcctt tacgaagagt ccccacaccc agagaacacc atacgcacac actgccgcaa 5340 tatccgcgac acctccatgc ttaccatccc tgacctgatt gccctccaat ccctcctacc 5400 aggcgtccaa atccacgtca ttagccccgg ctacaccgac tggctcactc tcccaacacc 5460 ccaacccaat gacacccgcc atgacgtgta tctagcacac gtaccccact cctgggctct 5520 cgccggcttc cccaccgcgt ttcccgttac agactgcaac cactacatcg ccgtcacacg 5580 tcacacctgg acccccggct ctgtcaccca agatccctca cctctgttcc acgaatctat 5640 ttgggacgga ggcccgtgca cgcgagacat cattgtcaca cacgccaaat cacccacccg 5700 cagcccacta cccaaaaaag cacgcacaaa caccattaca cacgccctcc tgcgtgccat 5760 gggctccccc accaacccac gcaaggctgc aaaacgcgcg ctctccctat cccccccggc 5820 accacgcaaa acaacacggg ccacaaccaa ttgcgaagca aagcgagccc caaacacccg 5880 cgccccggca aacccacaca cacccactgc aacaccggcc gtaaccaaca tttcccccca 5940 acaatcacca cccgccgcgc ctcccaaccc cccaccgcgt ctggctgtca aggcacgaca 6000 agccacactc caagcattcc tcaaggctgc acaggcgcct cgggaagacc ctgcaacgca 6060 ggtggcaccc acacacaccg atgcatgcca cctcgtcaca cttaacgccc gaggcctgct 6120 ctcaaatctg acagcgattc gagatgaatt agagacctgg gacgcagaca tcctcatact 6180 caccgaaacc aaaacgagag cccatcgaac caggacggaa cgagacatca tcaaaaacag 6240 cctcaaagcg tacacgtcgt tcgtgcacag cacaccccgc cccccaccac tgaagggcca 6300 aaaacaagac ccgcggcggc accatccctc cggcggcgtc acactagccg tacataccag 6360 atacgcccgg aaagaacaca ttacatgctt gcccacacca gcacagctgc aaggatacat 6420 ggttcacttg cgaatcgcaa cacattcatc cctaccgctg catatcatcg ggatatactg 6480 cccaccagac aacaaatcca cccgcgatgc attatacaca tacatcaccc gtgccaccaa 6540 gagcgcagcc cgcctaaagg aacccctcct aaccggcggg gactggaacg ccgctgctta 6600 tgacacggat cggagcaccc gcactggtac ccaaggcgac aaaacgcacc agaccttcct 6660 gactcgcaca agcctcaaac cagccgcagg cgcccacaca caacccaggc tgtacacata 6720 cacacagata caacacgaca ccatcaccca cgagagccgc atagacgact tcctggccaa 6780 tacccacgcc ctgcgccagg acagccccgc tcgagaatgg accacagaat gtgcaggcac 6840 aactgatcat ctcgctctcc acctgcaaac taccctaccc tacaaacccc cgcgcgacgc 6900 ctccccacac actgaacggg aaccccgaag acaaccagac gatgccagcc caacacttgc 6960 aaacattgtc aaacctatca cgcaaagcca aagtcgcgcc atctcgcaac aatgggaggc 7020 agaaggaacc accgcgcttg taacactaca cgccacactt gccctaccca actgcagcgc 7080 acaacaaatc atcgactcca ccgtagccgc aatctctcac cacaccaact ccacggcgtc 7140 gacacgcacc gcccatgaca tcataaccgg cgcacgcgct caactctccg ccatcgacct 7200 caacgccgcc tctaccgctc tatatggagc gcttgccgtc ggggcaaaac tcatgctgca 7260 aacgtgcccc cagaaaccac ctcggacctc cagagcgaag ccctcccgaa cagaagcaaa 7320 gaaactcaag cgcctgatag cagtcgaaaa acaaaaaaag caacagtacc tcgccctcgt 7380 gcgggcggct caactgcagg aacccgcaat cccgcctcca acggccccca atccctcaca 7440 acccgagccc cccgcgacgc ccgcgcccgc cactgccgcc cggcaggagt ggcaagacgc 7500 actcaaaaca cgccgcacca cacagaaaga aaccagcgcc aaggccgcta gcaaatggct 7560 gactaagatc tccaaactct ttgcaacctc acaaagcaag gcagccaaaa ttatgctcgg 7620 aaaatcaggg ccccggcaaa acctcacggc tgtgcggcac cccgcaaccc aacacatagc 7680 cacaaccagt tctgacatca acgacgcgct ccacgcccac ttccaggcag ccgcggaacc 7740 caccggaggc caacgcaccg gccgctacag acccgaggac aaggacacaa ctttcccgtg 7800 gaacgaaccc ggagcgcccg acggcggctt cctattggcc cgcacagcac acccagacgc 7860 tcagtccctg ctgcccaacg tcgccgactc cagcaacttc cagaggtgca tgcaggccct 7920 ctcccgcaat aaaagccccg gcccggacgg catccctaac gagctactga agtgcatgcc 7980 agaggcgtta caatccgctg tccataccct catgtgcgtc gcgtggatta aggccgaact 8040 ccctgccggc tggtccacaa gcgacaccat tctcctgtac aaaaaggcag acccgctctc 8100 catctctaac tacaggccta ttgcgctgtg caacacactg tacaagctgt ataccagctt 8160 agtcacccgg gcggcaactg aagcgtgcac agatcttctc atccacagcg aagcacaaga 8220 gggctttcag aagcacaagg gaacagaacg ccaaatcatg aacttactgc atgtaatcga 8280 agacgcagcc ttaacaaaac gagacctata tgtcgcatac attgatttct cgaacgcctt 8340 cggtaccatt gaccacgacc gtttgcttac aataatgcac cacctgggcc taccaaccga 8400 ctttattgaa gtcgtacgca atctgtactc ctccgcaaac accacaatcc gcacaacaca 8460 cggccgcacc aaccctgttc acatacggcg cggcactatg cagggcgaca ccctatcgcc 8520 cctactgttc attatcttcc ttgaaccgct gctgcggtgg ctccactacg ggggacgagg 8580 ctaccggtac ggctgtctca ccaacagcga aaacgacgaa cataactgct cgtccctagg 8640 ctacgcggac gacctgactg tagctgccaa ctcccgagac aaccttaggt gccaaactga 8700 caagatcgcc cggtacgctg cctggggcgg catgcgcatc aaccgcgaca agagcgcgtg 8760 cactggcata ctacaccact atgcaagcgc caacagtacg gacaagagca ttaagggccc 8820 cgcctgtgac cgcactctca aacgcctcct cgcgggccac atccatgtcg gcacatgcac 8880 caaagacggc aagcgatccg ccatcaccct cccttacatc cccccacaca agccctaccg 8940 atacctcggt gttgaaatca ctatgacaca ccctcaactg aaagcggtca aagccgacct 9000 aaccgacgcc ggcgtggagc tgagcaggta ccgaaaactc ccacgccaat gctacatgct 9060 catcaaacag tgcctgaaag ccaaagtggc ctactcctta ccaaccatgg cctacacgca 9120 gccagacatt cgggccttgg acgcccaaat cgcaggcatc gctcgaagat gcgtcggtct 9180 accacgcgat tgcgccaccc gcgccatcct actacccgcc gagctaggcg gcctcgggat 9240 cggctcactc cgggaagaat acgtgcagag agcctcgagg ctacttgtaa tctcgctcaa 9300 cgacaccggc cggcttggtg tcgtgacacg ccgcctgctc cacctacagg ccgcgcacgc 9360 aggaagtgac ctcccactgc acgacatgaa aagggcgggt caattttttg tcacccttaa 9420 acaactcaca cttctacagg accacgacct acttgccatg ctgcaaggca aagaattcac 9480 tgccgcccta ccagcccttg tctctaaact acgggccgct gaagccacca cgcccgccta 9540 ccttcgcgcc gactggatca gccccttgac catcgacctt ggactgaacc tggcggacct 9600 ggtggacgca aacacagggc ggcacattat tgacaccacg cagcttgacc ggcggttccc 9660 aaaaggcagg gtgaaagaca agcacaagag ggccttaaac cgatttaccg tcgccctatg 9720 taaaccggca gaagcacccc ccaaatctat gggccacaaa cccctcccca tgcaaagccg 9780 caccctgccc gccggcgtct ccaaggccga cctcaattca gctggagacc tacctaaacc 9840 cataccccaa cgaactctca cctccctcct ggcccgctgg caagaggcac taccgagccc 9900 cgcggctcca ccaaagcctt ccctagaagt cctgcaggca aaaccgaagc aaacggcaaa 9960 gcaacgtcat tctcacttca tcaaacacga cacacgcaca agaaaccccg aggtcgaagc 10020 caatgacgac gccctacccg gctctacggt tcatgacccc tccgacccca caaaggacct 10080 agctttctac atgagtgccg cgagggagtt cgccccactc gctcaacaat gcaaaaccct 10140 agggaccacg cgatggacgg ggacctggag gtccccaatc aggcaaatgc cagcccgcgc 10200 actaatcacc ggcccacaca cccacacagt ccacgcgcta ctacataccc ggccatacca 10260 gcaacctgct gctcccacgt accagcggct gtacctaccc gatggctgcc cagccacaac 10320 cccaccgccc gcgcccgcac actcgcataa ccggcgtaaa cgcggctccc agcccacccg 10380 ctgcccatac actcaccatg ctattcagcc gcgggcccct acacgggcag ccgcacccag 10440 gagagacagc gcagccgcgc agccaagctc ccgccctgcg tggcgcaaag tcagagacaa 10500 tgcgctgaat ggccgcctac cccagtccat cgctggggta caggcagcac agcacatcac 10560 cgcccatcta actaaccgcg acgacattat cgcagccgtc gtcagcggac ccatgaccca 10620 caaaactaac gccccgccgc ccaagcgaaa acgaaagcag ggcccaccgc ccgaggagcc 10680 cccctgggca gaacgctgtc gctatagggt tcggtggcaa cccaccctga ccacaaacgg 10740 catcctccga gcctacgagc tacagaacta caacgccact agcacaacca aacccccgtg 10800 ccgactatgg gggcaaggca tccggcgctt catggtccaa gtgcactggg ctgaaacgga 10860 agaccccctc tcctcactct ccgaaaaccc cgggtgggcg gccgccctgg cccgctacac 10920 tcttgcccaa gccgtagacg cgccccctga ccctaccccg cctcccgccc ttgacaatca 10980 cctcaccagt aagcagcgcc aggggagatg gcgagaccca gcacccagcg aagccgcacg 11040 caaacgtgcg atcctccggt gcagcaccat cgacgtcaaa tcctgtaacc cggacacaga 11100 catccacccg acaggtcaat actgtattca aatgggcaca cgtgagccgg gcagcgccac 11160 aggcacgccg tccggccccc cccaagaccc cgaccccccc gccagggtct acggcccgga 11220 cggacgctgc ataggaaccg ttagcgctgc ccgacttgcc ctactccacg cccggtacag 11280 gacccttaac cccactcacc ccccggaaag gttcgctgcc gccctggcct ccctgattat 11340 acggtaccgc cccagcaaac gccacaagga gagttccacc aacccagcgc acgaggtcac 11400 cctctgcccc gacctgatgt ccacgctcac gtcccatatc aagttccaga cggaggcatt 11460 cgcatctccc ctcaacgccc actgcgacat cccagcatac tacactaccc acccgcaaga 11520 ttcagagttc ggggccgccc acgactgcta cagccgcacg ctagatggaa tctccttcgt 11580 tcacccaccc ccacaacccg agcacaccag cctggccatg gcgtgggcgg tcgccagcgc 11640 cgaggcggca acaccgcaca caccgcacct gtccgtcgtg gtggtcccgt atggcgacac 11700 cagcggctac acagcctggc tccagcaccc actggtccac gacctggggc acatcaaacc 11760 ggcgcacacg tgggccgcca tggcctccgg cctaggccag ccgcaggagc tccccgctca 11820 ccgacgtgga ctccacctcc tgtgcgtgac aaacgaaagg ggccgcctcc tcctcaatca 11880 ggccctggag agtgggcttg cactcgccct agcggacgcg ctcgggcaca ccccgcacct 11940 agctacccgt aaacccccca cggacccaga cacacccaac ccgtggcccc caaccccccg 12000 gaagttaaaa cggcgcatcc tcgcccagag cagccagacg ccccagcccc agccccctcg 12060 cgatgccccc accctaccat cccctacgga gcttgcaacc ctctaccgcg cttatgtccc 12120 gctcgcacac acacgaggct ccgcggtctt taccgacggg agcaaagtac cacacacacg 12180 agagtacacg ggtctgggag cggcggtctg ggacgccagc gacaaccgcg ccacgcacat 12240 tgatgaagac atcgccttca acacacccct gagcaccatc cttcggtgtg aatcatccgc 12300 catacgcacg gcgctccgcc tccatcgcga cacacctcac ctacgggtct acaccgacag 12360 tcttgtcagt ctgttcatcc tccgtcgcgc cattatggag cccgagtcac tccacgtaag 12420 caagcaccgc gagatgtttg ggaccatagt cgaccaactt gcggaacgtg ccctagctgg 12480 acacaagaca tcattttatc acgtacgttc gcaccagaca gacacatgcg taggaaatca 12540 gaaagcggat gaagcggcac aggctgtggc caccagaaca gtgcagccgg atgcgacaga 12600 aactctatac tcgcacccat atgaaccctt agagggatgg attgcccaga aggaaattgt 12660 ggagcacaat ggcaatttgg aagaacggtt gcgctacgcc aacgacctca atcgcgacct 12720 gaagaaaatg gcccgtaaat ccaccgcgac cggttatacc aaatcaggac tgtacgcgga 12780 actatgggag gcagcgagga agcacatcga cgaacgctac ggcctaaacc tgttcgacaa 12840 gttctcattt gaagaggccc gctgcgcgtt caaataccgc tggggcctct cctggaatgc 12900 caagcgggcc gcaagagtaa aacaccccat gcccggctgc gcaggcggcg aatgccccct 12960 gtgccgcggc gacgacggtg gcacgcacac cctgggcggc tgcacccacc cacgcatgaa 13020 agcagcctac attgcgcgtc acaaccgcgc tgtccagcgg atagcgaaag caattagtca 13080 gggccactat ggaggctgcg ttatgtacat ggacgccggt aaggatgccc ccacttacag 13140 cgggggaacc cgcatccctg cgtggatgct ccccagcacg ccagagagcg tgcgcaagaa 13200 gctccgtcca gacctgctgg tcattcccac actccggatg cacaggtccg gactgccagc 13260 ccgtgcaccc cgcacacgcc gacaaaggag cgctcataag gtctacgtca tagaagtcgg 13320 atactgctcg gacctaaacc acccggacaa acgcgcagag aaactgcaac aacacgaaga 13380 cctcgtggcc gctctccgcg acgcaggctg ggacgtgatg tacagcccag acactgtaat 13440 tagccttggc catactggca cggtcacacc ccaccttcat gccctcctca aaacactcgg 13500 atgccccaca cagtccgcac acagcacatg ccgtaaacta gtgcagcacg ccgtccacac 13560 cacaaccgcc atcacagccc tccgccgcga aatctacagc cacaaatggc cagggagacc 13620 accataaaac cgtatgaccc gctgtgggaa aggggccagg cccagtacgc ctaggcgcac 13680 tggcaacctc ccctctctca cttgggggac aggggctcag cggcggagac catctatgcg 13740 gtggtctctc acgccctcac acccctacct ccgcagaaca cgccctgtca tgcagggagt 13800 acctattatt attattatta tt 13822 // ID MSAT-3B_CR repbase; DNA; CHL; 92 BP. XX AC . XX DT 04-APR-2005 (Rel. 10.03, Created) DT 10-APR-2007 (Rel. 10.03, Last updated, Version 1) XX DE MSAT-3B_CR is a minisatellite unit - a consensus. XX KW MSAT; Satellite; Simple Repeat; MSAT-3B_CR; MSAT-3_CR; KW Interspersed repeat; minisatellite; nonautonomous; tandems. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-92 RA Kapitonov V.V. and Jurka J.; RT "MSAT-3_CR, a family of minisatellites from the green algae RT genome."; RL Repbase Reports 5(3), 59-59 (2005). XX DR [1] (Consensus) XX CC The MSAT-3B_CR is a subfamily of MSAT-3_CR minisatellite. XX SQ Sequence 92 BP; 18 A; 31 C; 29 G; 12 T; 2 other; ggctggatac ggggtaaagc tctcaggcac acgcaccgcc gctcacgccc aacgccktat 60 cagggamagc cgggggaacg ccttgtgccc tg 92 // ID DNA-8-5_CR repbase; DNA; CHL; 2527 BP. XX AC . XX DT 29-MAR-2006 (Rel. 11.03, Created) DT 29-MAR-2006 (Rel. 11.03, Last updated, Version 1) XX DE DNA-8-5_CR is a family of P nonautonomous DNA transposons - a DE consensus sequence. XX KW P; DNA transposon; Transposable Element; Nonautonomous; P-2_CR; KW DNA-8-5_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-2527 RA Kapitonov V.V. and Jurka J.; RT "DNA-8-5_CR, a family of P nonautonomous DNA transposons in the RT Chlamydomonas reinhardtii genome."; RL Repbase Reports 6(3), 113-113 (2006). XX DR [1] (Consensus) XX CC This is a consensus sequence of a DNA-8-5_CR family of CC nonautonomous P DNA transposons. DNA-8-5_CR elements are less CC than 2% divergent from their consensus sequence and they are CC characterized by 151-bp terminal inverted repeats and 8-bp target CC site duplications. DNA-8-5_CR is a nonautonomous derivate of the CC P-2_CR transposon. XX SQ Sequence 2527 BP; 526 A; 678 C; 802 G; 521 T; 0 other; caggggtgcg gggtggccaa ctcggcccat gtgatcggca acggccggca accgccccca 60 attcccagtt gcgaccaggc cccggcccga tttctttgca tagaccaata aggggcacct 120 catagaagac ccctggcgcg ccgggcgact agtttactgg cctttgaccg acgtttgaaa 180 ttcaccaagc aattctgcag cctttccctt tttggagcgc ccggcgcgcc aggagccctc 240 catgtaatca tatagactag atggagccat tacctggtcg cgggcggggc ctaacactct 300 gcttggcgct actgctgccg gtgagacgcc ggcctcatgc taagttgccg ggataacgga 360 aatgtcctac tggggcctgc ataagtagga tgggctgcgt ctcataacaa tatgtggtgt 420 caacgcgaca aatcaaatgc gcattcgggt caggacacac acggaagctt gcacaacccc 480 cgaataggca tctcagatac atatggtagc gtatgagcca ggcacgcaac atgcagcggc 540 gccaacgcgt atacaactct gtggcgtaga atttcaatgc gcgggatcag ctggggctgc 600 agcccgcggc tgccaggtgc gccatatctg ctgacaaggc taagaatgag aggtcccagg 660 ggtcagggtc ttcgatacta ggatattggg agacatgcac gatgcatgtg tgcatttgcg 720 tgcaggggaa gccctttttc cgttacactg tccaaatagg catctaccat attgtagcaa 780 gtgttaagtt gtattgtcct gtccggagcg cgggcgagtg ctatgaaaga ctgtggcgta 840 ggtgcggggt gctcgttcgc gcctggtttc cggtgttcct tcaacgtttg tttggcctcc 900 tgccccccct ctacaggtcc tggtgtgcag tatgtcgaag catggtggca agaagcacaa 960 ttggggccgc ggcaggggca ggggcagggg acagcccgct gccaacaagg acgcggcggc 1020 accggcacca gcggccagcg tgggccgagg gcggggccgt ggacggggcg ggggccggga 1080 tggaacgggc cgacaggaca cgcaaaccca tggatgttga cggacctggc gacgtgctca 1140 tggcggcaac gccggagccg gagcccgagc agctgcgcac gcaacctaag caattgtgtg 1200 acgcgggatg tgacgcagga gccgcaatgt ggtatgcgca catacatcac gaataagtca 1260 catagtggtg cactgtacgc gatatgatgt cgactagtac ttggtcgggt ctggcatgaa 1320 tcaaccgctg cagtgagtgc atgacaagca ggggtggcgc acgtctttat ggcgggggcg 1380 cgcacgtgat tttgacgtgg tttcttttac gcagctgtac atgtctgctg ccatttttgc 1440 acgcggcgta ctagtggccc ctctgtacat ctggaatgca accgtgaact ggtgatgtgg 1500 gctgtgtccg ccgtagcctg accatgtatg tacatgtacg cgctgattgt gccccggatt 1560 ctaatcaaag tgccatgcat gctttttcat tgcagtagta acctagtgca aatgacgcac 1620 gatgtggtct gaggagtgag tgaggcgtaa gggtgtgcca cgttcgaggt ggggaggcgc 1680 tggagggaat ggagaggtta cagggcaagc aatgcagatg tatgggagtg tgttagagca 1740 atgtgtgtgg atgcatgcaa agggcgcctg gccgcctgca acgagtgaaa agtactacga 1800 gcatggaatg cagcatgggc aaggcgccca ggcacccctg ctgcgcacag ctgcgcaggg 1860 cgcgcaagca acggacctgg tttggcacat gtgtttgggc gctcaactta gcaagccagc 1920 actgtgaatc cattatgtaa ccaacaagct ccagacgcat cagccgggtg cgcggggatg 1980 gcttcgtgcc ggttgcgggg aatgagaggg gcacagtgcc cagcggctaa acacagccgg 2040 cacagggggg cgcgaagttg gcgacgtccg gcaaggccgt cctgaaccgg ccccacttgt 2100 tttatgcgcc caaattgatt gttacatgcc aaatatcaaa gcggcgtcgg cggctgctat 2160 tgcgcagtct tgcttttgaa ctccggcttg aaattgcacc tcgccggtcg catggaagcc 2220 cctccctgct tgtagtaggc ccccgaagtc ctggaacaaa gctgcttcag acctccagta 2280 cataacaggt ccataggctg ctatcgcgca gctctccctg cctccaagcc gaggttccaa 2340 tttgcgtcgg caagcatgtt ttcgaccacc gtcatatagt cgcccggcgc gccaggggtc 2400 ttctatgagg tgccccttat tggtctatgc aaagaaatcg ggccggggcc tggtcgcaac 2460 tgggaattgg gggcggttgc cggccgttgc cgatcacatg ggccgagttg gccaccccgc 2520 acccctg 2527 // ID L1-2_CR repbase; DNA; CHL; 5292 BP. XX AC . XX DT 04-MAR-2004 (Rel. 9.02, Created) DT 04-MAR-2004 (Rel. 9.02, Last updated, Version 1) XX DE L1-2_CR is a L1-like non-LTR retrotransposon - a consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1 superfamily; KW L1-2_CR; endonuclease; reverse transcriptase. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 5228-5292 RA Day A. and Rochaix D.J.; RT "Characterization of transcribed dispersed repetitive DNAs in the RT nuclear genome of the green alga Chlamydomonas reinhardtii."; RL Curr. Genet 16(3), 165-176 (1989). XX RN [2] RP 1-5292 RA Kapitonov V.V. and Jurka J.; RT "L1-2_CR, a family of L1-like non-LTR retrotransposons from the RT green algae genome."; RL Repbase Reports 4(2), 40-40 (2004). XX DR [2] (Consensus) XX CC Several hundred copies of L1-2_CR are present in the C. CC reinhardtii CC genome. They are ~98% identical to the consensus sequence and CC constitute CC ~0.5% of the genome. Since some elements are identical to each CC other, CC the genome harbors currently active L1-2_CR elements. CC The consensus is incomplete at the 5' end. The reconstructed CC portion CC encodes the 1439-aa L1-2_CRp protein (pos. 639-4955) composed of CC the endonuclease and reverse transcriptase domains. CC A subset of more than ten L1-2_CR elements is characterized by CC a remarkable target site specificity. They are inserted CC in different copies of the DNA-2-7_CR transposon at the same site CC (position 593 in the DNA-2-7_CR consensus sequence; usually the CC 593-601 region of DNA-2-7_CR is duplicated around the L1-2_CR CC inserts). CC The DNA-2-7_CR sequences that harbor the L1-2_CR elements are CC less CC identical to each other than the L1-2_CR sequences. Therefore, CC the L1-2_CR and DNA-2-7 elements cannot be different parts of the CC same CC composite transposon. XX FH Key Location/Qualifiers FT CDS 639..4955 FT /product="L1-2_CRp" FT /translation="MRRPKQAAPGPRPLRVGSHNVRGLRKRVPHLGCSRLH FT ALLSCWERQRLDVVCLQETHVLDSDIDLVTADLAAAAQRLHAREWTAWWAP FT AQKPEGRRVTGGTAILIRSSLLQQGAMVLAGGAAAVSVAAGAWAGRGVSLQ FT LQWGGHTFTLTSAYFPLASTAQQAFIREWVGLRASGSGEHLLAADFNFVAD FT VALDTVTGRARSDGPAAAALAAACPGLIDVLRRRHPARRVCTFFHPHGASR FT LDRILCSGGLEPQVLECGVAAGVPSDHMLVTVALAASPEAAPPARSLPRAH FT LGFRDFKDLDRDYRAWLGTALAARPTDPVELLEWWPALKRAAATSANRLSR FT EAVTRRVAASQREAAALEAAAAAAAAVEAGGDVQVAAQAAVRARCTAAEAA FT VAAAAGAARRTRHAWLRGGEKPCPLLTRMLRPAGGPRVIAKLKQPDGTTTS FT DPTVMGQVMARYWRDVSAAPPPAPDARTQVLDALQQHGRRCTAEEADQLGR FT VAVSAAEVRAALAAAPAGRAAGPDGIPVELYQHYDSDFAPVLADVFTAVGE FT GAGVPDGLLDGVWSFFFKSGDPTEPSNYRPITLTDTDYRTLARVLCLRLQP FT VFGRVIDPEQTAFLTDRRIADNVLLLQLTPGLLKAAKKAPAVAAFLDFYKA FT YDTVDRSFLLACLERMGVGAGFLTWVSRLLTDTRGAALVNGRVSGWVPLMA FT GVRQGCPLAPLLYLAVAQALLSWLRSRGHGVLANLASVLASQYADDCTPFL FT EGFQAVPGFLADMDVFRRASGQRLNMTKVELMVMGTVGGAGALPAGMAGPP FT LPPGWRVVPAAKSLGVHYGDWGACSPPVTFEAVMGVLSKIARMPVSMFGRA FT AAASAYALGKVLYHLEFAGLPQLYVVDRLLARVAAVVDRRLSPAQFDANPH FT ARPAGLSIELMQLPPAVGGAGLLPLVPHVKARQAGLAVRCVLGVCGLLPYV FT PPWTRVAAAWLRQVHPAATPLRLLVRGAADRVLNKPLTPAGKPFARLLKAM FT TALPAVTLVAPPEPGAWVSHVPLWANPALCEDGRTWEVAFADLFALPGLAC FT VGQLVAAHDGLNELRQALTRPWAEGSRSGEACADMYVTAVWRRVLHWSTRA FT RLPSPLPGPASPQEAADRFAAAVALLPAGWAAAARAAQLARGPAAALPLPV FT ADVVSATAESVQRVVRGLGWLQCGGPPILLTAYTVKAGTVLQMAPQLAALK FT AKHLQYVCDAGVSGVGAALAAGAFVCTLARLWSLKWENHHKEALWRVAANA FT CWAFPRHASERARGVTVDPCWACGVDMQDGDRRHWFWDCTVALSLRESMGM FT AMGFLPEEALSAFSREELWLVRPPAGLAPPVWDVVCLAAMSALDFGRQRVV FT MAGLAARAKLPSARVLSIGLAVVADFWGRLQTFVTLGIRPKGWDAVPFAHP FT FISRAVGDGMVLRLPYDADSPPPSP" XX SQ Sequence 5292 BP; 654 A; 1572 C; 2036 G; 1030 T; 0 other; cccttcttgt aatgtcgcct tcataacttt gacgatgagg tgatgccgga ccggacgccg 60 actgcgctgg acgctgcggc agctgcggcc cggcgtgcgg cccccaagcg gagccgggaa 120 caggcgcagg cggcggccgc ggcgggcgag gcggcgctgg cgctggccat ccgggagagc 180 aaggcggtca aacaggcctt ggcgacggcg gcggcggtgc gagccgccgc gacagccgcg 240 gcggcggcgg cagcagcggc cggggccacg ggcaccccga cggcggccga ggccggggcg 300 gccgcggcgg tggtcgcgac agcaggcgtg agtggcacga tggccgccgt agaggccacc 360 gctgcggcgg gcgtggagcc ggcctcgccg ccggccgcgc cggtgtcaac ggcgggcttg 420 gggggcatcg tgggcccggc agtcctgccc gcggggtgga gcagcgacgg tggtgtgggt 480 gatgagggcc tggcgggcgg cgtcgggcag gggctgctgg gggatgcgga gacagcgccg 540 gatgccaacg cgaagctgct gtgtcagggc cggcggggcg gtcctggcga cttcatggac 600 gttagctcgg ttgccccgcc cgggggcggg cagcagtaat gcggcggccg aagcaggccg 660 ctccggggcc ccggcccttg cgggtggggt cgcataacgt gcgggggctg cggaagcggg 720 tgccccacct ggggtgctcg cgcctgcacg ccctgctgag ctgctgggaa cggcagcgcc 780 tggatgtggt ctgtctccaa gagacgcacg ttcttgactc ggacatcgac cttgtcacgg 840 cggacctggc ggcagccgct cagcggctgc atgcacgcga gtggactgcg tggtgggcgc 900 cggcccagaa gccggagggc cgtcgtgtta ctggcggcac ggcaatcctc attcggtcga 960 gccttctgca gcaaggcgcc atggtgctcg ctggcggcgc agctgccgtg tcggtggcgg 1020 ctggcgcgtg ggcgggccgc ggtgttagtt tgcagttgca gtggggtggc cacaccttca 1080 cgctgacgtc agcctacttt cccctggcat ccacagcgca gcaggctttc attcgtgagt 1140 gggtgggcct gcgggcgtcg ggctcgggcg agcacctcct agcagctgac ttcaactttg 1200 tggcagatgt ggcgctagac acagtcactg gccgggcgcg cagcgacgga cccgcagccg 1260 ccgcgctggc ggcggcgtgc cctggcctca ttgacgtgtt acggcggcgc cacccggctc 1320 gccgcgtgtg taccttcttc cacccccacg gggcgagtcg gctggaccgc atcctgtgca 1380 gcggcggcct cgaaccgcag gtcctggagt gcggcgtcgc cgccggggtg ccctccgatc 1440 acatgcttgt gacggttgcc cttgcggcgt cgcctgaagc cgcgccaccc gctcggagcc 1500 tgcccagggc ccacctcggt ttccgggact tcaaagactt ggatcgtgac taccgtgcct 1560 ggctgggtac ggccctggcg gcacgcccta ctgatcctgt tgagctgctg gagtggtggc 1620 cggcccttaa acgggctgcc gccacgtccg ccaaccgcct gagccgggaa gctgttacga 1680 ggcgtgtggc cgccagccag cgggaggcgg ccgcgcttga ggcggcggca gcagccgccg 1740 ccgcggtgga ggctggcggt gatgtgcagg tggcggcgca ggcggccgtg cgtgcgcgat 1800 gcacggcggc ggaggccgcc gtggccgcgg ccgccggcgc ggcgcgacgt acgcgccacg 1860 cgtggctgcg tggcggcgag aagccctgcc ctctcctcac gcgcatgctg cggccggcgg 1920 gtgggccgcg cgtgattgcc aagttgaaac agcccgatgg caccactaca tccgacccca 1980 cggtcatggg gcaggtgatg gcgcggtact ggagggacgt cagtgccgcg ccgccgcccg 2040 ctccagacgc ccgcacgcaa gtcctggatg cgctgcaaca gcacgggcgc cggtgtacag 2100 ccgaagaggc ggatcagctg gggcgggttg cggtctcggc ggcggaggtg cgcgcggcgc 2160 tagccgctgc gccggctggg cgcgcggcgg gccctgacgg cataccggtg gagttatacc 2220 aacactatga cagtgatttt gcgccggtcc tagcggatgt gtttacggcg gtgggggagg 2280 gggctggggt gcctgacggg ctcctggatg gcgtgtggtc tttcttcttt aagtcgggcg 2340 accctaccga gccctctaat tatcgcccca tcacgctgac ggataccgat taccgcaccc 2400 ttgcccgggt gttgtgcttg cgactgcaac cggtgtttgg gcgtgtcata gaccctgagc 2460 aaactgcgtt tctgacggac cgtcgcattg ccgataacgt cctcttgctc caacttactc 2520 ctggcctgtt gaaagccgcc aaaaaggccc cggctgtggc ggccttcctc gacttttaca 2580 aggcgtatga tacggtggac cgctcttttc tgctcgcgtg tctggagcgc atgggcgtgg 2640 gggcgggctt tcttacgtgg gtgtcccgcc tgttaactga tacacgtggg gcggcgttgg 2700 tgaatggacg tgtgtcagga tgggtgccgt tgatggcggg cgtgcggcag ggctgcccgt 2760 tggctccgtt gctctacctc gctgtggcac aagcccttct gtcatggcta cgcagccggg 2820 ggcatggtgt gctcgctaac cttgcgtcgg tgctggcgtc ccagtatgcg gatgattgca 2880 cgccgtttct ggaggggttt caggcggtgc cggggttcct ggcggacatg gatgtgtttc 2940 ggcgggcgtc tggacaacgg ttaaatatga ctaaagttga actgatggtg atggggacgg 3000 tcggcggtgc tggcgccctg ccggccggca tggctggccc gcccctgccc cctggctggc 3060 gtgtggtgcc ggcggccaag tcactagggg tgcattacgg ggattggggc gcatgctccc 3120 caccagttac gtttgaggcc gtgatggggg tgctgagcaa aatcgcgcgc atgcccgtgt 3180 ccatgtttgg tcgggcggcc gcggcgtcgg cttatgcttt gggcaaggtc ctgtatcatt 3240 tggaatttgc gggtcttccc caactgtacg ttgttgaccg gctgctagcc agggtagcgg 3300 cggtggtgga tcggcgactg tcgccggccc agtttgatgc caacccccat gcacgcccgg 3360 ctggcctctc cattgaactt atgcaactgc ctccggcggt ggggggggcg ggcctgttgc 3420 cgctggtgcc gcatgtgaaa gcgcggcaag ccggtctcgc ggtccggtgt gtgttgggtg 3480 tgtgtggtct gctgccgtac gtgcccccgt ggacccgggt cgcggcggcg tggctgcgcc 3540 aggtgcaccc ggcggcaacg cccctgcgtt tgttggtgcg gggggccgcg gaccgtgtgc 3600 tgaataagcc gctcaccccg gcgggcaagc cgtttgcgcg cctgctgaaa gccatgactg 3660 cattgcccgc ggtgactctt gtggcgccgc ccgagccggg cgcgtgggtg agtcatgtgc 3720 cgttgtgggc taacccggcg ctgtgtgagg acggccgcac gtgggaagtg gctttcgcgg 3780 atttgttcgc tctgccggga ctggcgtgtg tgggccagct ggttgcagcg catgacggcc 3840 tgaacgaact gcggcaggca ttgacacgcc cttgggcgga gggctcccgg tctggtgaag 3900 cgtgtgcgga tatgtatgtg actgctgtgt ggcggcgggt gctgcactgg agcacgcggg 3960 cgcgcctgcc gtcgcccctg ccggggccgg cgtcgcctca agaggcggcc gaccggtttg 4020 cggcggcggt ggcccttctt cccgctggct gggctgccgc cgcgcgtgcg gcccagctgg 4080 ctcgggggcc tgccgccgcg ctgccgctcc cggtggccga cgtcgtgagt gctacggcgg 4140 agtcggtgca acgggtggtg cggggacttg gatggttgca gtgcgggggg ccgcccatcc 4200 tgctcactgc ctacacggtg aaggcgggga ccgtgctaca aatggctcca cagttagccg 4260 ccctgaaggc taagcatttg cagtatgtgt gcgatgcggg tgtgtcgggg gtgggggctg 4320 ccctggcggc gggggccttc gtgtgcacct tggcccgctt gtggtcgctg aaatgggaga 4380 atcatcataa ggaagcactg tggcgtgtgg ccgccaacgc ttgctgggcg tttccacggc 4440 atgctagcga gcgggcgcga ggcgttactg ttgacccgtg ttgggcgtgt ggggtggaca 4500 tgcaggatgg ggaccggcgt cactggtttt gggactgcac tgttgcgctg tcactgcggg 4560 aaagtatggg gatggctatg ggtttcctgc cggaggaggc tctaagtgcc ttctctcgtg 4620 aggagttgtg gttagtgcgc ccgcctgcgg ggcttgcgcc acctgtgtgg gatgtggtgt 4680 gtctcgctgc tatgtctgcc ctggactttg gtcggcagcg tgtggttatg gccgggttgg 4740 cggcgcgagc gaagctgccg tcggcccggg tgctgagcat tggacttgcc gtcgtagctg 4800 acttctgggg tcgtctccag acgtttgtga ctttgggtat caggccaaag ggttgggacg 4860 ctgtgccgtt tgcgcatccc ttcatatctc gggctgttgg tgacggcatg gttttgcgct 4920 tgccgtatga cgctgattcc ccgcccccct cgccgtgaga ggttgtgcgt ggttggtggc 4980 tctgtgtgtg ggcaggatgc acgtgtgggc gaccgttcta gtacctggac gcttgcgccc 5040 tggttagtgc gtgcgctgtg cgtgtgtcct gcggcctaga cgtttcccgg cactcctgtg 5100 cctaggcttg tggcgttgag gcacagcggt ggggctctgg cgggttgagg cttggctagg 5160 actcactgtg cgcggagtca cacggacttt gcccgcggga gggtttgcag caaagtcggg 5220 gttgggccct gatagcatgt gatggctcgg cgtcttcggg agctgggact gtcccctctg 5280 taacatccgc at 5292 // ID Copia2-I_CR repbase; DNA; CHL; 5509 BP. XX AC . XX DT 13-MAY-2004 (Rel. 9.04, Created) DT 13-MAY-2004 (Rel. 9.04, Last updated, Version 1) XX DE Internal portion of the Copia2_CR retrotransposon. XX KW Copia; LTR Retrotransposon; Transposable Element; 5-bp TSDs; KW Copia2-I_CR; Copia2-LTR_CR; COPIA superfamily; endonuclease; KW protease; reverse transcriptase. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-5509 RA Kapitonov V.V. and Jurka J.; RT "Copia2_CR, a family of copia LTR retrotransposons present in the RT Chlamydomonas reinhardtii genome."; RL Repbase Reports 4(4), 104-104 (2004). XX DR [1] (Consensus) XX CC Copia2-I_CR is an internal portion of the Copia2-CR LTR CC retrotransposon, whose LTR is deposited in Repbase as CC Copia2-LTR_CR. Copia2-I encodes a 1591 Copia2_CRp polyprotein CC (pos. 63-4835) composed of the C2HC zinc finger, protease, CC endonuclease, reverse transcriptase, RNase H. The PBS (pos. 1-14) CC is complementary to an internal portion of tRNA-Met. XX FH Key Location/Qualifiers FT CDS 63..4835 FT /product="Copia2_CRp" FT /translation="MSDTSGKPQTILAEDATNFVSWRRDLSYFLQTKRLHL FT YLQTPIPPQRALYALEEDQQALGHLGMSCSEGIWDLISGARTTVDAIGIIE FT GEYRRASDGKLVALDRDRSSFAMLVGERLPAYFQRARRLRDDIRAAGGNWP FT ESDMRLRLLMGLRSEYHPAKSTAVGMADRSLLALQRMLEDWESLAPQPGGL FT GEAGYALTARPSGLGSGGSSSNAVGVAARGLPAAGPWGHSRGGGSAGYQGG FT SSSGGSNEQCWYCFVQGHQRRNCNVRVLDEKAGIFRANIRVPPPPGKGQPG FT GGGSGGSKGPGGGKGRGRGGKGGGSGKPFTGRGTVLAVQDGGEGAAEGSSD FT DDPNCNCADASGDMFGGSSCSVANDGTHHWGVGQAHDSGSSSSTAGYSHAH FT AGGGSSSSSVSAGLGCAGGCERVGCCGVSGDSTLHLPNSGGSSSVVPGRAD FT GCSAAGVRMPLLPGYSRTDDDGDGSSNSSSRSGGSAACCFTGFPGLGFRRL FT FKSRYSRDSRGSSCSSSSSAEPASAGGCDVAEDSNGGACYSHTLPPNTSSS FT SSSSSSRSCRTGFCAPTADGAGDGAATTYQLLVDSGAFRHITAHRHLLSNY FT TPSRVKMVRWGDGHIGAVQGEGMMSMLSGGQLLHVHNVLHVPTSELSLLSV FT IALTSNGSKVVFEGDGVQVVRGRRLLLQGRNRNGQYILSNLRLLSTSTEVP FT IMAPATASTAESSGSSTSNGTSTAAPAGGGGGNGRFAAPTGGGDNSSGAAA FT PTGGGETDSSAATQEVAGSSSSSRSSSGLSAQVQAALRLHRRLGHLGFSSI FT KRMVSQGMVTGADVSLSAVRDAAELVCSKCMEAKAASAAFPRSSSGATTVP FT LQLAHSDVCGPFPHQGRGGAQYYITLLDDATGYGLVRILQQRGQAYTALQE FT MVAEMESLYPGGSRLRTLRSDNGGEYVSLQLEQWLRSRGTVHQYSAPYTPQ FT QNGAAERVNRTLQDRCRALLLEADLPNGLWPDAVSYAAYIRNLSPSAGRNV FT TPYEAFTGKKPDVSSLRTFGCRVFVMVPAEQRDKLGARANVGTYLGPERNT FT AASRVKVGNGMVVSRDLKWDEAIRGPASRRAGTQFGDGVGDSSSNDAAHAA FT AAPAPGVPAQQTQMGPPPHPSPPSRPATRSQHPNLLPPINPVTAAASAVAG FT GGSSSSTATAAAAATESDDEEVPPLAPMSDDEDYEPAGCSSATAAVAEPFP FT GEPQSYEQAQRARDADKWLEAMQEEMASQRENGTWVVEPLPPGTRVLANRW FT VFKLKPGDETQPPRYKARLVAKGFQQREGIDYDEVFAPTTCSVSLRTLLAL FT AAEQGLLLEQLDVKTAFLNGQLDEELWMQLPQGFEEETASSGNGGAGGDGT FT EMASNGSGGSSCSARAAQECAGCSKNSSTSVSGNEGGRGGEPIMACRLLKS FT IYGLKQAPRCWYIKLCEQMGGLGFTPSAADAALFVRHDAEGPVYVLVHVDD FT MLIAAGSAQQMATVKAAINGCFKVRELGEARVYLGMRIIRRPDGKEIKLSQ FT RAYIEDLLQRHGMVDAKPRSLPLPPGTRVLAAALKDLVLSDSKQYSALVGE FT MNYLATNTRPDIAFALSLLPVGHA" XX SQ Sequence 5509 BP; 1047 A; 1626 C; 1945 G; 891 T; 0 other; ggttatgagc cccgcgtggc cctaaaaccg cgcgctttta ggctatctag gtgttctcta 60 atatgtcgga cacaagcggc aagccgcaga cgattctggc agaggacgcc acgaactttg 120 tcagctggcg ccgcgacctg tcttatttcc tccagaccaa acgtctgcac ctatacctac 180 aaactcctat ccctcctcag cgggcactct atgcgctaga agaggaccag caggcccttg 240 ggcatctcgg gatgtcatgc tcagagggga tttgggacct gatttccggg gcccggacca 300 ccgttgacgc tatcggcatc atcgagggag agtaccggcg cgcgtcggat ggcaagctgg 360 ttgcgctgga tcgtgaccgc agcagttttg ccatgctcgt cggcgagcgg ctgccggcct 420 acttccaacg cgcgcggcgg ctgcgggatg acatccgcgc tgccggcggc aactggccgg 480 agtccgatat gcgcctgcgt ctgctgatgg gcctgcgctc ggagtatcac ccggcgaaat 540 caacggcggt cggcatggct gaccgctcgc tgcttgcact gcagcggatg ctggaggatt 600 gggagtcttt ggcgccccag cccggtggac tgggtgaggc gggctacgca ctgacggcca 660 gaccctcggg tctcggcagc ggcggcagta gcagcaacgc agtaggtgtg gcggcacgcg 720 ggctgccagc ggccggaccc tggggtcaca gccgcggcgg tggaagcgcg ggctatcaag 780 gtggcagcag cagcggcggc agcaatgagc aatgctggta ctgcttcgtg caggggcacc 840 agcgccgcaa ctgcaacgtg agggtactgg atgagaaggc gggcattttc cgggccaaca 900 tccgcgtgcc gccaccgccg ggtaagggac agcctggcgg gggtggaagc gggggaagca 960 aggggcctgg aggcgggaag gggcgaggcc gcggaggcaa ggggggcggc agcgggaagc 1020 cgttcactgg ccgcggcaca gtgctggccg tgcaggatgg gggggagggg gctgccgagg 1080 gtagcagtga cgatgacccc aactgcaact gcgccgacgc gtccggggac atgtttgggg 1140 gcagtagctg cagcgtggcc aacgacggca cgcaccactg gggcgtgggg caggctcacg 1200 acagtggaag cagcagcagc accgccggct acagccacgc ccacgcgggc ggcggcagta 1260 gcagtagcag cgtcagcgcc gggctaggtt gcgccggtgg ctgtgagcgt gtcggctgct 1320 gcggcgtcag cggcgacagc acgctgcacc tcccgaacag cggtggcagt agcagcgttg 1380 tgcctggtag agctgacggc tgctcggcgg cgggagtccg catgccattg cttcccggct 1440 acagccgcac tgacgacgat ggtgacggca gtagcaacag cagtagcaga tccggtggga 1500 gtgccgcatg ctgcttcacg gggttccctg gtttgggatt ccgcaggctg ttcaagtccc 1560 gctacagccg cgacagtcgc ggcagtagct gtagtagcag tagcagcgcc gagcctgcga 1620 gtgccggcgg ctgcgatgtg gctgaggaca gtaatggagg tgcctgctac agccacacac 1680 tgccacccaa caccagcagc agcagcagca gcagcagcag ccgtagctgc cgcactggct 1740 tctgtgcccc tacggcggac ggcgccggcg atggcgccgc aaccacctat cagctgctgg 1800 tggactcagg ggcatttcga cacatcacgg cgcaccggca cctgctatcc aactacacgc 1860 cgtcacgcgt gaagatggta cggtggggag atgggcacat cggagcagtg cagggggagg 1920 ggatgatgag catgctgtca ggcgggcagc tgctacatgt tcacaacgtc ctgcatgtgc 1980 ctacgtcgga gctttcgctg ctgtcagtta ttgctctcac aagcaacggc agcaaggtgg 2040 tgtttgaggg agacggcgtg caggtggtca gagggcgtcg cttgctactg caagggcgca 2100 accgcaatgg gcagtacatc ctgtccaacc tgcgcctgtt atctacttca acggaggttc 2160 ccattatggc gccggctact gccagcaccg ctgaaagcag tggcagtagc accagcaacg 2220 gcaccagcac tgccgcacct gcgggtggcg gcggtgggaa tgggaggttc gccgcaccta 2280 cgggtggcgg cgacaacagt agcggtgctg ccgcgcctac gggcggcggc gaaaccgaca 2340 gcagtgctgc aacgcaggag gtcgccggta gtagcagcag tagcaggagt agcagtggcc 2400 tcagtgcaca ggtgcaggcg gcgctgcgac ttcaccgccg gcttggccat ctgggcttta 2460 gcagcattaa acggatggtg agccagggaa tggtcacagg ggcggatgta agcctcagcg 2520 cagtacggga cgccgccgag ctggtgtgca gcaagtgcat ggaggctaaa gcggcatcag 2580 cagcttttcc ccgcagcagc agcggcgcca caaccgtgcc gctgcagctc gcacactcgg 2640 acgtgtgcgg cccatttcct caccaagggc gcggtggagc gcagtactat atcactctgc 2700 tggatgatgc cactggctac ggcctagtgc ggattctgca gcaaaggggg caggcataca 2760 cagcgctgca ggagatggtg gcggagatgg agtcgcttta ccccggcggc agcaggctgc 2820 gcacgctgcg cagcgacaac ggcggcgagt acgtgtcttt gcagctagaa cagtggcttc 2880 gctcgcgcgg cacggtgcac cagtactctg caccatacac gccgcagcaa aacggagcag 2940 ctgagcgggt caaccgcacg ctgcaagacc gctgccgtgc gctgctactg gaggctgacc 3000 tgcccaacgg tctctggccg gacgcagtgt cttacgccgc ctatatccgc aacctgtcgc 3060 cgtcagcggg ccggaacgtg acgccgtacg aggcgtttac aggcaagaag cctgacgtct 3120 ccagcctgcg gactttcggc tgccgcgtct ttgtgatggt gccggcggag caacgcgaca 3180 agcttggcgc gcgtgctaac gtgggcacct accttggacc agagcgcaac acggcggcat 3240 cacgtgtcaa ggtgggcaac ggaatggtgg tcagccggga cctcaaatgg gatgaggcca 3300 tccggggtcc ggcgtctcgg cgggcgggaa cgcagttcgg ggacggcgtg ggagacagta 3360 gcagcaacga tgctgctcac gctgcggcag cgccagcacc tggggtgcca gcgcaacaga 3420 cccaaatggg cccacccccc cacccatcac cgcccagccg gcccgccacc cgcagccaac 3480 accccaacct cctgccgcca atcaatcctg tcaccgctgc agcatctgcg gtggcggggg 3540 gaggaagcag cagtagcacc gccacagctg cagcagcggc aacagagtca gatgatgagg 3600 aggtgccgcc gctagcgcct atgtcggatg atgaggatta cgagccggcg ggctgcagca 3660 gtgctacagc ggcagtagca gaaccgtttc cgggtgagcc gcagagctac gagcaggctc 3720 agcgcgcgcg tgatgcagac aagtggttgg aggcgatgca ggaggagatg gcgtcgcagc 3780 gcgagaacgg cacctgggtg gtagagccgc tgcctccggg cacgcgtgtg ctcgccaatc 3840 ggtgggtttt taagctcaag cccggcgatg agactcagcc gccgcgctac aaggcgcgcc 3900 tagtggccaa gggcttccag cagagggagg gtattgacta tgacgaggtc tttgctccca 3960 ccacgtgctc agtgtccctg cggacgctgc tggcgctggc ggcggagcaa ggcctgctgc 4020 tggaacagct agatgtcaag actgcgttcc taaatggaca gctggacgag gaactgtgga 4080 tgcagctgcc gcagggcttc gaggaggaaa ccgccagcag tggaaacggc ggtgctggag 4140 gagatggcac tgagatggcc agcaatggca gcggcggcag cagctgcagt gcccgagctg 4200 cacaggagtg tgccggttgc agcaagaaca gctcaaccag cgtcagcggc aacgaaggtg 4260 gtcgcggcgg cgagcccatc atggcctgcc gcctgcttaa atccatctac ggcctcaagc 4320 aggcgccgcg ctgctggtac atcaagctct gcgagcagat gggcggactg ggattcacgc 4380 cttctgccgc ggacgctgcg ctgtttgtgc ggcatgacgc agagggcccc gtttatgtgc 4440 tggtgcatgt agacgacatg ctcattgcgg ccggcagcgc gcagcaaatg gcgaccgtga 4500 aggccgccat caacgggtgc ttcaaggtgc gggagctggg ggaggctcgt gtgtatctag 4560 gaatgcgcat aatccggcgg ccggacggca aggagatcaa gctgtcccag cgcgcgtaca 4620 tcgaggacct gctgcagcgg catggcatgg tggatgccaa gccgcgcagc ctgccgctgc 4680 cgccgggcac gcgagtgctg gcggcggcgc taaaggacct ggtgctgtcg gacagcaagc 4740 agtacagcgc gctagtcgga gagatgaact atctggcaac caacacgcgg cctgacattg 4800 cgtttgccct aagcctgcta cctgtcgggc acgcgtgaga tggggctgtg cttcaggggc 4860 gagacagcgg ctgcaggcat tgccagtgga atgggaggtg gtcgcggcgg tggctctgac 4920 ggctgcagct tcgtcggcta ctcagatgcg gactgggctg gcgaccccgt aacgcgccgc 4980 tccaccacgg cctttgcgtt cctgtgcaac catgtggcga ttgcctggtg cagcaagcgt 5040 cagaagacgg tggcggcgtc cactgtggaa gcggagtatc aggccggtgc ggccgcggcg 5100 cgcgaggcgc tgtggctgcg taagctagct gctgacctgg acctggagga gggagcgatt 5160 ccgatcaaca ttgacaacca gggcgcgctc agcgtggcgc gcaaccccgt cacgtcgtcg 5220 cattccaagc acatcgacgt gcaccaccac ctactgcggg agcgggtggc gcgtggcgaa 5280 gtggcagtgc gatactgccc cacggagctg atggtggcgg acattctaac caaggcactg 5340 cccgccaaca agttctgctt ctgccgcgcg gcgatgggcg tggcataaaa gtagttgcat 5400 attcacatcc catgccgaac agaaggggct gcatgcctag gtgatgaggc tgtgctatct 5460 ttgcagtttg atagcaagtg tggctccggc agttcagccc gagggggag 5509 // ID LTR1_CR repbase; DNA; CHL; 491 BP. XX AC . XX DT 08-JUN-2004 (Rel. 9.05, Created) DT 08-JUN-2004 (Rel. 9.05, Last updated, Version 1) XX DE LTR1_CR is a long terminal repeat of unknown LTR retrotransposon. XX KW LTR Retrotransposon; Transposable Element; 5-bp TSD; LTR1_CR; KW solo LTR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-491 RA Kapitonov V.V. and Jurka J.; RT "LTR1_CR, a long terminal repeat from the green algae genome."; RL Repbase Reports 4(5), 126-126 (2004). XX DR [1] (Consensus) XX CC The genome harbors about 15 copies of LTR1_CR; they are 95% CC identical to the consensus sequence. LTR1_CR is a solo LTR. CC The LTR retrotransposon it was derived from is not known yet. XX SQ Sequence 491 BP; 119 A; 131 C; 140 G; 101 T; 0 other; tgtgggcgga acacggaagt tctacgacac acaagtatat aaccctgcgg acggcgccct 60 ggggcatcta gatgtgtctt ttaccgactg tcctactaca tttggtatcg agttcagggc 120 ccaacgtgcg ttatctgcgg acaatgcgca cgcgaccaca taaggtcggg ggttcaaatc 180 ctggtgacgt cggtcaaagt ccatcacttc ggctatgcga gagtcgtaaa ggcaccaaca 240 aacagttggc tactggcgat gctcaggtag cgggtatacg cgacgtgggc ctcgtcaccc 300 cactgccaca gaaacacaaa cggttgccaa agaagtactc tctacgctag cgtggtgact 360 gcccgggacg tgcagtccag aagggggaga gtgtgggcgg aacacggaag ttctacgaca 420 cacaagtata taaccctgcg gacggcgccc tggggcatct agatgtgtct tttaccgact 480 gtcctactac a 491 // ID Gypsy-2-I_CR repbase; DNA; CHL; 2974 BP. XX AC . XX DT 04-MAR-2004 (Rel. 9.02, Created) DT 04-MAR-2004 (Rel. 9.02, Last updated, Version 1) XX DE Gypsy-2-I_CR is an internal portion of Gypsy-2_CR LTR DE retrotransposon - a consensus. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Nonautonomous; KW 4-bp TSDs; Gypsy superfamily; Gypsy-2-I_CR; Gypsy-2-LTR_CR; gag; KW nonautonomous LTR retrotransposon; protease. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-2974 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-2_CR, a family of gypsy LTR retrotransposons from the RT green algae genome."; RL Repbase Reports 4(2), 37-37 (2004). XX DR [1] (Consensus) XX CC The internal sequence encodes gag and protease. There are only CC a few copies of Gypsy-2_CR present in the genome. They are CC over 95% identical to each other. The internal sequence is CC incomplete at its 3' end. XX SQ Sequence 2974 BP; 735 A; 798 C; 789 G; 652 T; 0 other; tggggggagt tgcgggaata gtcctgagaa actctattgt cccctaccca ttttcttcac 60 tatctgaaca aaaaaaaatg gcgaatcaag tagactcgga cagtgaatcg tccggttctg 120 taggctctac ccgtgccacc accatttcac cgtcctctct cttcgcaatt ccgctcttca 180 ccccctcctc tgaggacccg agcaactgga tagacaccat cgaacgtatg gggcgtgtgt 240 ttggatggga tgatacgtta aagctattta tagcgaagtg ccgtcttagt catgatgccc 300 gcatctggga ctcaggcatg gaaactcagg gactcaggca tggaaacaaa catggaaaca 360 aacacgtggg atgcgttctg tcgggcgttt cttgcccgtt ttgcagtaaa agagacaact 420 ctttacgccc aactggcggc atgccgccag aaagcaaatg agtctgttcg tgagtactcg 480 gataggttcc gtcacctact tgcccggttg cgcattaagc ccgacacaat acaaatgtat 540 aactacctga ggggtttaca ttccagcata tacagccagg tgtatctcat gcgcccttcc 600 agtctcgaaa cggccattga ggatgctatt tacgctagcg aggtagaaga gtccgcacga 660 aacccaaaca atttctcaca acagattcca ttacccatat cccgcacgga ctaccgtcca 720 gcagaccagc ggcctcgccc atcaaacttc aacctaagcc gcagggatga aaagttccgt 780 tcacaagagg aacggtttaa taaccggtca actaggcccc tgccgccgcc atctgctagc 840 agggtttcac aggatctgga gaaagacatt gaaagcatac aaaataagct ggcaaacctg 900 accctagcgt tgagacagca gggggaggtg gccacataca tgcatgatag gtgtgaatcc 960 gaaggggatg atggttggga ggatgcggaa tgttattact ctcatgggtg tggctatgac 1020 agtgatgatt gggatgactg cgatgacggt gatcctgagt tatatgcacg tgagcgtgct 1080 ttgtattcta ctatctaaac gagtgagtga ttttgagccc atcaggctgc ccaacaaacg 1140 tgtgcctgtt gaacctacac agaccctgcc catggagatc aatgaccggc ctgcaacccg 1200 tcggttcagt gctgcccgta ccgctggagg tgcttttggc tcgcccacac ccctgaaaga 1260 gactcccccg cctgacacac gggccacaga ttctccccag cctgatacag ctacggcctt 1320 tcgtgtgacc aacacggcaa cacctgcgcg acgggaacgg actgacaagg gtttaggagc 1380 gggcggagga agggcaacca cagggtaccg tgatattggt cgtgcaccca gagggccggg 1440 gggtgctgct acacgcgtgg accccattcc attcgatgcg tccgaagatg agaccactca 1500 gcgcattgtg gctgacgtgg tatctaaaat aaacaagtac tcaatccctt tgggagccgc 1560 agtccgggta aatccggtgc aaatttatag caaggttggg gggaaattga tgggtgtcgc 1620 acgtatggtg ggcacacgcc caagggacga acccgccagc agggatgcta atctatatga 1680 aagccaccga tgtaaggatg acgtgcttcg ccccctttct gatttcgatg tagggtggcc 1740 tctgcctgtt agtgtgccga ctttccacga cgcggagccc gcctccaagg caccaacaca 1800 cacaaccaga ggggcccgtg tccctgggct tgcccgcgtg acgtctggct attcgccacc 1860 gtccgccccc acctatcaac aaacaaacac attgccctgc tacaatctgg atactcgtcc 1920 ccagaagttt tcggtttgca ggggctatgt ccgcctgacc aacgccaggg gagagaagga 1980 agtaatccgt gcggtcattg acacgggcgc ctcgtacagt gttgtacctg tgtacatttt 2040 gcggcgactg gacctcatgc cgtatctagc tgagtcacgg gcaatgttcc tgaatgcgga 2100 tggacttaag aagaaagcca cagggcaggt acgaggcatg caagttagtc tagggggtga 2160 tttgacgtac accatggaca tgtatgtgtc ccaagctcaa aactacgaat tgcttttggg 2220 aatggatttc ttataccctc tcaaggcctc cattgatttt gaggggcaag tactggaatt 2280 cactaatgac tcaaaccagc gcagcaccat acctattaca tgtgttgtgg aggaagccgc 2340 ggcgccacgc accacttccc ttagggagga atgccctgat ggagtgcccg agatctacat 2400 gctggatggt gattcccctg caaccaaggc actcgtggcg cctgatgaaa ccgctggtgt 2460 aatcagtcct gcacgcaggc attctcgcat tatttccttc agggctagaa aacggactgt 2520 cccagcagat catatgcgca tgcggcaggt tttccgttgt gccaaaccca ttgtttccct 2580 ggctccgtgt gggccctgtt ttcatagcaa aatagagaac tcaacaggta atggaagtta 2640 cccatattta ccgcaccggc aacgcagccg tcaggtgaca tggttatgcc ctcagcatcc 2700 ctctccaaag ggatggtgtg gcccagacaa gggcccggta tggactgcca gtccccccgt 2760 gacattgacg taggcccggt acaggccctg acaatggtgc atattggagc cgtaggggtt 2820 cccctaagat gaagcgtatg cacgcacgta ttaccactca aaataaaacc taaaaaatca 2880 caaacaaacg gagtatagtg ttaggagcag gccgtggcat ccacgtgcag ggttggacgg 2940 ggacgcccaa ctgtaggccc ggtaatgcta cagc 2974 // ID TE2-6_CR repbase; DNA; CHL; 784 BP. XX AC . XX DT 08-JUN-2004 (Rel. 9.05, Created) DT 08-JUN-2004 (Rel. 9.05, Last updated, Version 1) XX DE TE2-6_CR is a family of nonautonomous transposable elements - a DE consensus sequence. XX KW Transposable Element; AAC target site duplications; TE2-6_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-784 RA Kapitonov V.V. and Jurka J.; RT "TE2-6_CR, a family of nonautonomous transposable elements in the RT Chlamydomonas reinhardtii genome."; RL Repbase Reports 4(5), 141-141 (2004). XX DR [1] (Consensus) XX CC TE2-6_CR is a family of nonautonomous transposable elements. CC The consensus sequence is less than 5% divergent from TE2-6_CR CC copies. Classification of this family is not known. It is CC possible that it belongs to a novel superfamily. Features of this CC superfamily include preferential insertions into AAAC targets CC that are accompanied by the AAC target-site duplications, a CC 18-21-bp CC palindrome separated by 3-5 base pairs from the 3' end; and the CC AGCC CC 3' termini. XX SQ Sequence 784 BP; 159 A; 211 C; 251 G; 161 T; 2 other; ccctaccgct acgcagtacc ccggcttgcc cctcccccct caggcgcccc gcccgcggtg 60 taagaggggg ggggggggca gttaccagcc caagccaatg gaggggaatg gaggggtggt 120 gcaagggggt atgtacagcg attttctagg aagaccaacg cattggcaca tccatgatct 180 cgcattcagt agttatgtag tyatggtmtt agcgctgatc tgatcagcac ggggcgccta 240 tcgcgggggg gggggggcgg gacacggcgg gcggaagggg ggcgagagtg gcgtgaggat 300 cggggcacag ccgtgggagt atgcgactgg ctgctccgct gcgggctacg agcatagtac 360 cagccattat cgcgcatgaa gaatgaaaag cgcttatcag atcagtgggg cgctgtattt 420 cgcgcgagga tttaagtgtc ggagggccgg atcctccgat gcctccgatt ccgtcgtcta 480 tcgccgggcg cctgctcgtt aaattttaca aataacgtaa catacaccac gcttggactc 540 tagcactgca tcggtacttg tgcgttacac caattcctcg ccatttcact cgtttttgca 600 aagttatgcg tcgccacaag tacagtgaac agcttcttaa atttagcacg cattttggct 660 ggagcaacgg ggcgtgccaa gtgcacttta ccggttactg gaataaccgg gagggccccc 720 cgcggcgagc tcgatgcgac cgcaatggct attgcccctc ccccttcggg ggaaggggca 780 agcc 784 // ID Gypsy-5-LTR_CR repbase; DNA; CHL; 1000 BP. XX AC . XX DT 31-MAY-2006 (Rel. 11.05, Created) DT 31-MAY-2006 (Rel. 11.05, Last updated, Version 1) XX DE Gypsy-5-LTR_CR is a long terminal repeat of Gypsy-5_CR LTR DE retrotransposon - a consensus. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; 4-bp TSDs; Gypsy superfamily; Gypsy-5-I_CR; KW Gypsy-5-LTR_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-1000 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-5_CR, a family of gypsy LTR retrotransposons from the RT green algae genome."; RL Repbase Reports 6(5), 237-237 (2006). XX DR [1] (Consensus) XX CC It is a long terminal repeat of the Gypsy-5_CR LTR CC retrotransposon, whose internal portion is Gypsy-5-I_CR. XX SQ Sequence 1000 BP; 214 A; 327 C; 267 G; 192 T; 0 other; tgtcgtgtca ttctgccggc tttgcacggc gccgccaccg gtcacaccac ttccccccac 60 agcagagagg acgaatactt gcagtcccga ggctgtcacc atgtgtgacc gcctgtccac 120 aagtccatag cgaaaggggc tcaaacgcgc ccgaaagccc catacagctc atcagcatcc 180 aggtgagtag cttctatgtg cagctcgtct caagggcacg cctgaatgtg acgcttgtct 240 ccggccaccg gctgcggggc acacgaaacc gcgggcacgc ggtcccccca gcttacgtcc 300 ccggtgatgt cagatgcgcg tccagaacca ccgtggacat gtcacaatga caacatacaa 360 agaccactcg caagcagtct cgcacggtcc gccgagcatg ttcatgtcca ttcaccaccc 420 tctgacgcgg ctcaataact ccataaagcc gctctccatg aaaggccagg atcctcactg 480 ttggaattct tctactgttt agagaactct ctaggctgct ttctcgttac ttggtggctg 540 caggataggc cgaccggcgc ctccgcaggc tgcccgagca gcgcgctcga gaaccggtat 600 ccgagacttg cgaagccggc ggctgtctcc tcgcctaact agtccgtttg gcagctcagc 660 tgacggtgat acgtggttgc ttaggctaga gtgacgttgc tatgcgcctg cgcgcctaga 720 cttggctgca acccgaagac gccccaacag atgacttccg gaacccagtg cgctgtccac 780 cagagttcgt tacagacgtc gatccattgt tggtgcacat aagagtaaac ctgtgcacag 840 ttgggagtaa cgagaaagca gcgccggtac aggccggcgc tgcgccctta tagccgggcg 900 acagcctcat cccgctgcat ggcacgaact aacccccaca accatggcac ggttagcccc 960 tgctgccggc gaaatgccat gccggcataa tggcgcgaca 1000 // ID Mariner-N4_CR repbase; DNA; CHL; 568 BP. XX AC . XX DT 08-JUN-2004 (Rel. 9.05, Created) DT 08-JUN-2004 (Rel. 9.05, Last updated, Version 1) XX DE Mariner-N4_CR is a family of nonautonomous Mariner DNA DE transposons - a consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW mariner superfamily; Mariner-N4_CR; nonautonomous DNA transposon; KW TA target site duplications. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-568 RA Kapitonov V.V. and Jurka J.; RT "Mariner-N4_CR, a family of nonautonomous Mariner transposons in RT the Chlamydomonas reinhardtii genome."; RL Repbase Reports 4(5), 132-132 (2004). XX DR [1] (Consensus) XX CC Mariner-N4_CR is a family of nonautonomous transposons. The CC consensus sequence is less than 5% divergent from CC Mariner-N4_CR copies. Features of Mariner-N4_CRs include 23-bp CC terminal inverted repeats and TA target-site duplications. XX SQ Sequence 568 BP; 124 A; 178 C; 135 G; 131 T; 0 other; ccgtactgcc aaaaaggttt cgttttatta aagttcggga gggctgctaa atattgctgg 60 aataaattcg atacttttta atttcgtggg cggttaggga aaagtaacaa ctctgatcca 120 aattttcggg gacgccaccg cctccgtcac cgccaccgcc agccatcgtg cgcctgggta 180 gcatcccgcc acgcacttgc ctacagcaaa gcaagtgccc tctcattgca aaccaagccc 240 actagcaccg ccgtgccacc actaaagatg ccttcgccag caactgccag cgcagcgcgc 300 cgctgccaat cacgcgcgca gcacgcgcgc aagctcctca tcaatgggtg tgtcagcgtc 360 gtccgactca gcgccctcgt gctccaccgt gcctcctcgc agtctccccg cactcccccc 420 gcgcggcagc gcgcgctaca cgggaaattg ttgctagtgg tatttaattt cgggggaatt 480 ttcatttaaa taagctgaaa ttgtttcggc gctcaatttc gtggactgtt cacgaacttt 540 ggatcaccaa acctttttgg cagtacgg 568 // ID NonLTR-3_CR repbase; DNA; CHL; 1070 BP. XX AC . XX DT 04-APR-2005 (Rel. 10.03, Created) DT 10-APR-2007 (Rel. 10.03, Last updated, Version 1) XX DE NonLTR-3_CR is a 3' terminal portion of a non-LTR retrotransposon DE - a consensus. XX KW Non-LTR Retrotransposon; Transposable Element; KW Interspersed repeat; NonLTR-3_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-1070 RA Kapitonov V.V. and Jurka J.; RT "NonLTR-3_CR, a family of non-LTR retrotransposons from the green RT algae genome."; RL Repbase Reports 5(3), 64-64 (2005). XX DR [1] (Consensus) XX CC The NonLTR-3_CR elements are flanked by 5-10-bp target site CC duplications. NonLTR-3_CR elements are ~98% identical to the CC consensus sequence. CC The consensus is not complete at its 5' end. CC The 3' terminus is composed of ATGCTC repeated two-three times. CC The NonLTR-3_CR and NonLTR-2_CR consensus sequences are 65% CC identical to each other. XX SQ Sequence 1070 BP; 212 A; 311 C; 366 G; 181 T; 0 other; ccgccatggg cgcgggctgc taactctcga cagtggaacc tcagcccgat cgtcgcccca 60 gcccgagctg ccgccgagca cggcgagcag tgggcctggg tgccggttgg aaagctttgg 120 gagcttcagt agtgtcagcg accagcagga ctcctgcccg gctgccagca ctagtgcgat 180 gagcatcggc actccggact ctggcgcggc cttcccattt cggcgcgcta cggaactgct 240 gcagcgtctc acgcgcctgt cttacacgac ccgcctgttc gatctcgcca caacaccagc 300 gctcgcgacc atccaggagt acctagcggc cttggaggag gcacttgggc agctagagca 360 gcaaacatca gcagccacgc agcacggcga cagcattcag cagctgcagc tacaagcgct 420 gacgctccac gcctctgtct cagaggccca gggcgctgcg cttgcagctg aacggcggca 480 gaaggagctg caaggcacgg tcgcagcgct gcaagtccag atggaggagc aacgttgcgc 540 gctagaggac acccagcgcg ccctggcccg agcagggtgc tccggggggg tcgacgaggg 600 taaagtcctg ctgaagtcgg ttagccctga gctgctcgag aaactcaagg cggctggaaa 660 tctggaggcg gtggccagtg cgagtggcgc aaacgtctcc ggacgatacg tgcgcgcgtg 720 gcggtcaaag ggtagggagg tgtggaatgt cgtggtggag gtgacacagc aggagcggag 780 cgcgctcatt cgggtggcgc agcaggtccg tgcggctggt gccgttgtcg caccgtatct 840 gacccgggta gggtgtgaga tgcgcaagga gcaactgaag gcttttgagg agcttcgtga 900 gaaggggctc cgaccccgat ggaaaggtgg agcgggcatt acgtgctcag gcgcgtaaac 960 agcgtataga cgctttagaa gacactttgc tttttgctat caagcgcctc cggactcaga 1020 agagttagtg gggccacatg gactccgacg ccatgtaaat gctcatgctc 1070 // ID Gypsy16-I_CR repbase; DNA; CHL; 3910 BP. XX AC scaffold_45; XX DT 22-FEB-2008 (Rel. 13.02, Created) DT 22-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE LTR retrotransposon from single celled green alga: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy16-CR; KW Gypsy16-I_CR; Gypsy16-LTR_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-3910 RA Kohany O. and Jurka J.; RT "LTR retrotransposon from single celled green alga."; RL Repbase Reports 8(2), 96-96 (2008). XX DR Genome; scaffold_45; Positions 491923 488014. XX CC Positions [673-1167] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 337..2568 FT /product="Gypsy16-I_CR_1p" FT /translation="MHLLRNGTHREGTSPLERRRAARRAERYRMQGGDLIL FT MGLDGRTRKVPPPPERLAIVKKTHEDTGHFGMRRTQALLLNSYWWPQISND FT VETVLKHCEACSRVNATFGSRNPQLTPLPIGGLFYRWGVDLCGPFQPTQRG FT HRYVMICIEHFSKYVILVPLTDKEASCTSFAFRQHVLGLYGACAEVVTDQG FT SEWKGEFASLLQESLIDHRQTSADHPQANGLAERSVQVCKRSLRKIAHSNT FT GSEQWDKFLPYIMLGYNCSVHSSTKVSPYSILHAVDPVTPPAIKERFAEEV FT NLDDPEAAARSIMQRGTAVKRNMTIAGGNLLTAQHRDKLRYAALRTGGMSP FT VLRRFEVGDYVYYRNTTARTSLEATARPDIYRVIEARPSGVLVLEGKCGTT FT ITAHVSHCAPCHLPIADQVVDPRLARPAPTHHCEVCKFPDGEEWMLLCDAC FT GTGWHTYCLKPPVKEIPEGTWVCPNCTKKGVTPEEVDAKGPTPAPRSLQPP FT MHLRKLVGATVMREGKGRNRRSKRQLGVASYAGRRGRAHYFTVEYEDGVSE FT MLSLPELRNRITTAKPTKHAAAAASVGELGDLTNVEHARGALCASMPGEHM FT AEKAERLACAARTGRVRAQPMTASEVEQLCEVIPLDLCGSVFVPWKCSDEV FT VEVLRSNGCRVRGSMEKEDVRCVEEKSFEKAKSDGVFNGVALLDVSPEMAD FT VALPAASKHSSVCVAAKLPWEYVTQADSARLKWCARCSGSRSWW" XX SQ Sequence 3910 BP; 806 A; 1014 C; 1228 G; 862 T; 0 other; ttccgatgtt tgcagatctg ttatgtcggc ctctgcaaat ggccgtgttg gccacggcgt 60 aggcggcctg aagcgcatgc gtagcgagcg agagggtggc gcaagacgat cacgttccgg 120 cttacatgtt gcctcagagt ttaacgtcca cgcaacacgg gcacggtgct atgctcccat 180 gtagccgtga gtgcacccac aggcgggctt gccgccgcct ttgtgtccgc ggcatggcct 240 acttgcccgt tagcggcatt ggccactgag ggcctagacc cagagtctcg caagacgggc 300 gattcaacta aggacatttg ggatgatgac gctaccatgc atttgttgcg caacggcact 360 caccgcgagg gcacatcgcc actagagcgg cgccgcgcag cacgacgcgc agagcgctac 420 cgcatgcaag gtggtgatct aattctcatg gggttggacg gccgtaccag gaaagtgccg 480 ccgccgcctg agcgcctagc aatagtgaag aagacgcatg aggataccgg gcacttcggt 540 atgcgtcgca cacaggcgtt attgctcaat tcgtattggt ggccgcaaat ctcaaacgat 600 gtggagacag tgctcaagca ctgtgaggcc tgcagcaggg tgaacgctac ctttgggtca 660 cgcaatccac agctaacgcc cttgccaatc gggggtctgt tctatcgctg gggcgtggac 720 ctgtgcggac ctttccagcc cacacagcgt ggccatcggt acgtgatgat ttgcattgag 780 catttcagca agtacgtcat tttagtgccc ctgactgata aggaggcatc ctgtacttcg 840 tttgccttcc ggcagcacgt gctgggactg tacggtgcgt gcgccgaggt ggtaacggac 900 cagggcagtg agtggaaggg ggagtttgcc tctctcttgc aggaatcgct tattgatcac 960 cgccagacct cagcagatca tccgcaagcg aacggcttgg ctgagcgctc tgtgcaggta 1020 tgcaagcggt ctctgcgtaa gatagcacac agcaataccg gcagtgagca atgggacaag 1080 tttctgcctt acatcatgct cggttacaac tgctctgtgc attcttccac taaagtcagc 1140 ccatattcta tcttacatgc cgtggaccca gtcactccac cggcaatcaa agagcgtttt 1200 gccgaagaag ttaacctcga cgaccctgaa gcggccgctc gttctatcat gcaacgtgga 1260 actgctgtta agcgcaatat gacaatcgca ggcggtaact tgctgacggc tcagcaccgc 1320 gacaagcttc gttacgctgc gttgcgcact ggaggcatgt ccccggtact acgtaggttt 1380 gaagtaggtg actacgtgta ctacaggaat accacagctc gcactagctt agaggcgaca 1440 gcgcggccgg atatttatcg tgtcattgag gcgcgtccct caggcgtgct ggtgctggag 1500 ggtaagtgcg gcacgactat tacagcgcat gtgtcgcact gcgcgccctg ccatctgccg 1560 atagcggacc aggttgtgga cccgcggctg gcacggccgg cccccacgca ccactgtgag 1620 gtatgcaagt tcccggacgg ggaagagtgg atgctgcttt gtgacgcgtg cggcacggga 1680 tggcatacgt actgcttgaa gccacctgtg aaggagatcc cggagggaac ctgggtttgt 1740 cctaactgca ctaagaaggg tgtcactccc gaggaagtag atgccaaggg cccaacacct 1800 gcaccacgct cactgcaacc gcccatgcat ctgcgcaaac tggtgggagc tactgtcatg 1860 cgtgaaggga agggcaggaa caggaggtca aaacggcaac tgggcgtcgc cagttacgcg 1920 ggaaggcgcg ggcgtgctca ctatttcact gtcgagtatg aggatggtgt gtcggagatg 1980 ctgtcgctgc ctgagctccg caacaggatt accaccgcca agcctaccaa gcacgcagct 2040 gccgctgcaa gtgtgggcga gctgggagac ctgaccaacg tggagcatgc gcgtggagct 2100 ctctgcgcgt ccatgccggg cgagcacatg gctgagaagg cggagcggct ggcctgtgcg 2160 gcgcgcactg gacgggtgcg tgcccagccc atgacggcgt ctgaggtgga gcaattgtgc 2220 gaggtcattc cgctggatct gtgtggaagc gtgtttgtgc cgtggaagtg cagtgatgag 2280 gtagtggaag tgcttcgtag caacggttgt cgggtgaggg gtagcatgga gaaggaggat 2340 gtgcgatgtg ttgaggagaa gagctttgag aaggccaagt ctgatggtgt gtttaatggt 2400 gtggctttac tagatgtatc cccggagatg gcagatgttg ccctgccggc ggctagcaag 2460 cattcgagtg tgtgtgtagc tgccaagttg ccttgggagt atgtcacgca ggctgattcg 2520 gcgcgcctga agtggtgcgc tcgatgcagc gggagcagaa gttggtggtg attccgtgtg 2580 gcacttgtgt gtggattgtg atctttgcat ctgtgatggc gcggagtgcc tttgtgcgca 2640 ccactggcgg tccggttgtg agtgtttcca tcttttaagt gcgatcttta tgcgtgtttt 2700 tcaacttagc gcctacttat agcagtctcg gcgtctttcc aatacgctgt tgactggggg 2760 gggagtcccg cgtaggcacg tggaatgtct gtgtacacac gtacgggcgc gacagctgtc 2820 gtgtcctggc gtgtgagcgt gcgtgccgtg tcacaccgac cattgggtcc ggtctttgca 2880 tttcctcttt atcattagcg tgcagacaag acaataagat ttacgtcctg catgatttgg 2940 agtcacgctt gtatgcgcgg actatagaaa gatctgtttg gataagagcg tcgtactgca 3000 tggaggcagc gacgtctcga ggccaagtgc gtgggagaga gccaccagcc cgtcttttgg 3060 ctctcgtccc gaacgtgcgc cccgtattac agaccaagcc tagtctcgtt gtaatatttg 3120 agagtctgat caactcctac gacgggaata gcgacatctc cggccggagt ctggagctaa 3180 gccacacagt acgttgagca gggctgcgta aacgcatttg ccttgtgtgc ttgtgtaggt 3240 aacagagaca caaaccccgc tggcgctcaa atcattctaa cggaagaatg cccacagagc 3300 gttggggagt ttaagccagt ttgaggacgg agatcgccat gaggcggacg ttgccgagtg 3360 ccggagcgaa acgccgcggt ggagagcatc tgcaaaatgc gagaactcgc gcagtcgaag 3420 gtgttgacga gttctgtagg ttcatacatg gcagtcattg aagttgcgta tgtcgtgctc 3480 atgtctttca gtgtttgaga cgcacacgtt gcatcgatgt tggaggcgct gtatgcccca 3540 gctcgcccag gagctcatag gagtggtgcc ctgaatgccc tgaggctcac ttggttgtgt 3600 tatgtctgag gttcgccatt agccgccgtg aacggagcga caggctcaac gcgcatggct 3660 atcacgcccg cgagccctct ccattgcggt tcacaccaag gcagtgtagg catggtttgt 3720 gagactttca gacgcatgac tgcactgaga cagacgtaca cgcacgcaga ctggcaatat 3780 gcatacggca caccctgcac aggcccagac gttgcagacg agcgccgcat tgacattgcg 3840 ccgcacgctg gggggggagt cgccgtcagg cacacgaaac gttcattgtc ctatcatagg 3900 gcgcgacagc 3910 // ID Gypsy-4-I_CR repbase; DNA; CHL; 4701 BP. XX AC . XX DT 31-MAY-2006 (Rel. 11.05, Created) DT 31-MAY-2006 (Rel. 11.05, Last updated, Version 1) XX DE Gypsy-4-I_CR is an internal portion of Gypsy-4_CR LTR DE retrotransposon - a fossilized copy. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; 5-bp TSDs; Gypsy superfamily; Gypsy-4-I_CR; KW Gypsy-4-LTR_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-4701 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-4_CR, a family of gypsy LTR retrotransposons from the RT green algae genome."; RL Repbase Reports 6(5), 234-234 (2006). XX DR [1] (Consensus) XX CC This is an internal portion of Gypsy-4_CR LTR retrotransposon. CC It could be incomplete and encodes remnants of gag, integrase and CC chromodomain. There are only a few copies of Gypsy-4_CR present CC in the genome. They are over 98% identical to each other. The CC internal sequence is incomplete and may contain insertions of CC external TEs. XX SQ Sequence 4701 BP; 914 A; 1471 C; 1436 G; 880 T; 0 other; attggtatcg agaccaggtt gagtacctgg gccacatcgt gtctgctgat ggtgtgcgga 60 tggacccgaa gaagtcctcg gcggtacgga actggccggc gcccaggaac ctgctggagc 120 ttcgcaagtt tttggggctg accaactact tccgtaagtt cattgagcgt tactcaattc 180 tggctgcgcc gctcacggca ctgttgaaga agggcgcgtt tacgtcggct tctgcttgga 240 cagcctcctg tcaagccgcg tttgacgcta tcaagcaggc tgttgcttct gatatcttgc 300 ttgcttaccc tgattactcc cagccattcc gcgtagaggt tgtgactgac gcctcgctgc 360 ttgggtccgg cgctgttctg ctgcaaaacg gccgccctat tgccttttcg agcaagaagt 420 tcacaggtcc ggagactcgc tacaccacgg gtgagcagga gtttctggcg gttcttcatg 480 cgctcaagga gtggcgctgt tacctggagg gccaaccgtt tacggtaaaa actgaccaca 540 agcccttaac atttctggat ggtgtaccga cactgaaccg tcgacaagcc cgctggatgg 600 agtatcttgc gcgctttaac tacacctggg agtacctgaa ggggaccttg aacgttgctg 660 acgccctgag tcgccacccc cttctgcagg gtatctggtt acacatggtc acgcgtggac 720 aagcgcgaaa ggcttctggg ctgggcacgg cgccctctgc cgcggctaag cccgcgcaga 780 cgcaagcgcc tgctccgaag accgccaagc ctacaccgtc gcctgctgta ccggctccgg 840 ctgcgcctga caagaagccg gccgctggga ctaagcgtgc taccgctact gctcccgcgc 900 ctgctcctcc tgtcaagacg cgtcgcctgg tgagtccacc ggcctgggct gctgcacccc 960 cggtagcgac tggtgacccc gtgaccgccc cggctgattc ccctgaggtg tctgtatctt 1020 tcggggaccg gattcgcgct gcctatgcct ctgatccgtg gttctctagc gagaccaaca 1080 cacgcgacct ggcgtgtgac ggtgggctgt ggtatcgcct tgtccttgga ggagggcctt 1140 acaagcagga gagagccaag agaacacaag cccccagcga ggccagcgca gcgaaaccca 1200 cgccaacgac cccgccgcag cgcgggcaga aaacccacgc aaacccgacg aaaaaaacca 1260 gaaaagcgca gcgcgcacac caccacagag aacacccata agcgcccaat ggcgcccaaa 1320 caacagccct gacaagcgaa agaagacgcc ccgcaggagc gccaaggaaa aaactcggaa 1380 aggccaaagc cacgcacaca cctgcggaca ggggacagag gggaagcagc ggccgagcgg 1440 ggcaagaacc aagaacagcc aggcgcaagc acgacaagca gacgacccac aggcgggccg 1500 agaaacaagg caggagtgga gacaagcagc aaagaggagc gcaccgggcc agaggcgggc 1560 gggcagggag gcacggggct tgagcgagac ggccacaaga ctggccgaga aggctctcgc 1620 tgtgtaggca gaatgctgga acagcgagta acgaaaccga taaccagcga cagctggtac 1680 gacttccatc tggaagctcg cttagaggct ccattccctc cgttacggct tcgccttcac 1740 tacagtcctc cgcctctaac cccacttcca gactgcagtc tccccagctg ccgctgatgg 1800 ctgaccaaat gccgctgggc tgggaaggga ccaggaaccg accagctagg aggagggcct 1860 tacaagcaga acccgaagag ataccacggc tggacgaggc ccgcccggtg ctggccaccg 1920 ctggaccccc ccccccaacc cgactgaccg agagtcgcgc acgcgatcga ggctcgaagg 1980 tttacagaac caggggtgcg cgcagaggcg ccactgctcc gcaacgccgc cgcagcggaa 2040 gtaacttgca ccgaaaccga caacagccaa ttaccgagcc gacccgcacg cgagatgtgc 2100 agcgaaagac cgagggcggc gagaaccgag ggcacgcggg ccatagcgaa ctccgaccgc 2160 gaccactact tcttgtcctt gccgctgtcc ttatagttgt cgcgcttgta gctgctgtag 2220 cggtcgtcct tgtagcggtc cgagcgggag cgctcgcgct cgcgggcggc agccgaggac 2280 cactcgtcgc gcttggcctt gacctcgtcc gcggcgaagc gcgaccagtc gaagggcgcc 2340 tccttggtgg cctcgtagct cttgggcagc gcgcgcaggc cgatctcggt ggcggtcgac 2400 atgccatggc ggggctccgc gccgagcttc tccatcgcgt actccagcac ctgaaagaag 2460 tcttccagag tggacaggtt cttgcgtagg gaggggaggt cgccagcgtc aaggagcagc 2520 cgcgagtaat gcacaatgcg cttggtcgcc gagtaggtgg ccaggtggag ccccttagag 2580 tccgaagggc gcgaaggcgc ctcgatgaga gtctccgcgc gccgacgagc gcggtccacc 2640 acgtcgcgcg aagcggccga tgagggctgc ttgggcgcgt cgtcctcgcc accgcccgcg 2700 ccaccgccgg cagcgggctc ctcacctacg tccgtctcga agtgctgcag cagctggtcg 2760 actgagacct cgatgccgga gaccttcgca tggatggtgt ccagcagggg cttgagcgcg 2820 gccgccagct tgttgaactt ggtctggctc actagcactg gtttgatcgc gctctggagc 2880 gcggccatcg ccgggctggg cgaggcaccc gccgcgcgcg cggccggggc atcatcggcg 2940 agtttctcct ccgccccggt cgcctcctcc aggtccgccg tgtagttgtc gtcctcagcc 3000 gccagagcag cgccgcggcc gcggcctgtg ccgcggccgc cgcgaccgcg gcctcggccg 3060 cgatcactag cggtactcat ctccagcacg gggcttgagc gagacggcca caagactggc 3120 cgagaaggct ctcgctgtgt aggcagaatg ctggaacagc gagtaacgaa accgataacc 3180 agcgacagct ggtacgactt ccatctggaa gctcgcttag aggctccatt ccctccgtta 3240 cggcttcgcc ttcactacag tcctccgcct ctaaccccac gtggtatcgc cttgtcaatg 3300 gcgcgcgtgt gatagttgtc cctgctgatg ataacttgcg ccgtgacatc atttctgcgt 3360 gccactctgg ccctctgtct gggcatgtcg gcactaagcg tactctagac attgtgtcgc 3420 gctcgttctg gtggcgtggc ctgtctgttg cggttgactc ctttgtccgc gcctgtgacc 3480 tatgtctgcg cagcaagtct gtctccggta agaccgctgg tacgctgcac tctctgccca 3540 ttcctgatgc gccttgggag tctgtttccc ttgatttcgt ttgtgcgctc cccaaaactg 3600 aagggggata cgacacatgc tgtgttatgg ttgaccggct aacgaaaatg gtgcatcttg 3660 tccctacgcg aggcacacct gattccccta cgacagctcg cctcttcttc gacaacgtcg 3720 tgcgtctaca cggtgtgccc aagtctctgg tgtccgaccg tggtccgcag tttacaagta 3780 agttttgggg atctctgtgc acgcttgtgg gcatgcggag caacctgtct accgcatacc 3840 atccccagac tgatgggcaa actgagcgca tcaaccgggt gttgggagat atgctgcgca 3900 actttactgt gggcagccct aacacttggg acttgtatct gactgctgcg gagtttgcga 3960 tcaacaatgc tgtgaaccgc tccactggct tttccccgtt cttcatgaac tacggatacc 4020 acccggcaac cccggtgtgg cgtgagcttg acgtgtctgc ccccgctgcc cgcgagtttg 4080 ctaaatctta cgtgcagagg atggctgagg cgcgcgcctg tctggaggcg gctcagagcc 4140 gtgctgcgaa ttactacgat cgcaacaaga aggacgtcac gtaccggcct ggcgacttgg 4200 tcctgctcaa caccaagaac ctgcgcgctc gggctaaggg accgcgcaag ctgctaccgc 4260 gctggattgg gccttttact gtggtacgtc cggttagtag tggggctgct gttcagcttg 4320 cactgccggc gtcgctgaag aacatccatc acacgttcca cgtgtccatg gtgcgcaagt 4380 acacgggcac ggcaggaggg cgtgctgggc ctgggtctga tgatgggcct ggacctgttg 4440 catggattga cgaccaaccc ctatggacag tagggcgcat cctggacttc cgccgtcgcg 4500 ctgtggcgtt aggtaacaac cgccggcgat tggttaagga gtacttggtc aaatggaagg 4560 gctactcctc taagcataac agctgggagc ctgaagctaa cttcaccccc gatatggctg 4620 ctgacctgca gagcgctcgc gagcgggcga ctcgcaccat cgaggattga ccgaggacgc 4680 tcaatcctga agggggatag t 4701 // ID hAT-N5_CR repbase; DNA; CHL; 783 BP. XX AC . XX DT 13-MAY-2004 (Rel. 9.04, Created) DT 13-MAY-2004 (Rel. 9.04, Last updated, Version 1) XX DE hAT-N5_CR is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; 8-bp TSD; KW hAT superfamily; hAT-N5_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-783 RA Kapitonov V.V. and Jurka J.; RT "hAT-N5_CR, a family of nonautonomous hAT transposons from the RT green algae genome."; RL Repbase Reports 4(4), 123-123 (2004). XX DR [1] (Consensus) XX CC About 30 copies of hAT-N5_CR are present in the C. reinhardtii CC genome. They are ~97% identical to the consensus sequence. CC hAT-N5_CR is characterized by 16-bp TIRs and 8-bp TSDs. XX SQ Sequence 783 BP; 190 A; 193 C; 214 G; 185 T; 1 other; cacgcgggta ccactctggt cttttgctgt gtccaaactt tgacagcctg gtagacccaa 60 caggtatcat ctgaccaggc cgccagcata cctacggctt cgtctcgttg tgtagtatac 120 ggttactatg gtgcggagct gagcagatga aacctgaaag gccgtacggc tgcagccgtg 180 catggggtgt gcagtgggac acgcaaatat tttcgtcgtg aaaactaaaa gcgtgatagc 240 gtgatggata gcgtacgcac agtgtatgca ggatacgcaa tcgccactgt aaaaagaata 300 ggcatcaaaa agttatgtgc acaattgtcc gggggggtgc atgctcctag ccaaacccgc 360 acgctcggag ggctctggga gtcgcccgag gcaaatccta ccgcagtatt tagacgccgc 420 ctttttgtag gtacataact tacatcatta gtggacatta gcgctcggca gaggacaaaa 480 catgttggtc agggctcaag tcgcgaaatt gaccagagcc catatctcgc ataggcggta 540 ttttgggcct gtatctagtg ccgtcgacta aggttcacct atakatgtca tccggacagc 600 gtagggcgaa ttccagtaag tcctggcgag cgcgacattt agcatttcga cttcggctcc 660 cgctcgagtt tatgcccgga aatcattcag ggaccttatt atatgtcaca tgataacatt 720 tcggcagtct agggcattgg acacacgcag tccctaggga cttgccggag tggtacccgc 780 gtg 783 // ID SINEX-5_CR repbase; DNA; CHL; 404 BP. XX AC . XX DT 20-MAR-2008 (Rel. 13.03, Created) DT 20-MAR-2008 (Rel. 13.03, Last updated, Version 1) XX DE SINEX-5_CR is a putative non-autonomous non-LTR retrotransposon - DE a consensus. XX KW Non-LTR Retrotransposon; Transposable Element; SINEX-5_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-404 RA COGNAT V., DERAGON J.M., VINOGRADOVA E., SALINAS T., REMACLE C. RA and MARECHAL-DROUARD L.; RT "On the evolution and expression of Chlamydomonas reinhardtii RT nucleus-encoded transfer RNA genes."; RL Accepted to Genetics (March 2008). XX DR [1] (Consensus) XX CC 11 copies of SINEX-5_CR are present in the C. reinhardtii genome. CC The 100-bp 3' terminus of SINEX-5_CR is 49% identical to the CC RandI-5 3' terminus. Target site duplications are not present. XX SQ Sequence 404 BP; 77 A; 99 C; 128 G; 100 T; 0 other; gggggggtag ctcagtaggt aagagcactt ccttatcacc ctgcggaccc gggttcgaat 60 ctcgtattcg gcccgtttcc cggcggataa ggttgaggtc gtgggtcgat cccacccccc 120 tcaccgtagg tgttccggta tggagtccgt ggcaccgaag aatcgttgta tcgaccctct 180 gggttcggat cccattggtg caacgaggaa actctacgct ggccgaattc ggagggttgg 240 taggccgata agctcagatc ttcggatcct atgagctgag tagagggtga tgcgtggggt 300 gcctctagtc agggtcccac agggtacggt agggttctaa gaaggactct tccttaagac 360 acgcggggcc gttggcccgt gcgtagctat tattattatt atta 404 // ID MSAT-1D_CR repbase; DNA; CHL; 120 BP. XX AC . XX DT 04-APR-2005 (Rel. 10.03, Created) DT 10-APR-2007 (Rel. 10.03, Last updated, Version 1) XX DE MSAT-1D_CR is a minisatellite unit - a consensus. XX KW MSAT; Satellite; Simple Repeat; MSAT-1D_CR; MSAT-1_CR; KW Interspersed repeat; minisatellite; nonautonomous; tandems. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-120 RA Kapitonov V.V. and Jurka J.; RT "MSAT-1_CR, a family of minisatellites from the green algae RT genome."; RL Repbase Reports 5(3), 55-55 (2005). XX DR [1] (Consensus) XX CC The MSAT-1D_CR is a subfamily of MSAT-1_CR minisatellite. XX SQ Sequence 120 BP; 18 A; 27 C; 68 G; 7 T; 0 other; ggggggcgca ggcgaaggcg gctccggcga gggaggcgcg gggctaggag gctccgggct 60 gggaggctcc ggcgagggag gagcaggcga aggtggctcc ggcgagggag gggcaggcga 120 // ID Mariner-N2_CR repbase; DNA; CHL; 519 BP. XX AC . XX DT 08-JUN-2004 (Rel. 9.05, Created) DT 08-JUN-2004 (Rel. 9.05, Last updated, Version 1) XX DE Mariner-N2_CR is a family of nonautonomous Mariner DNA DE transposons - a consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW mariner superfamily; Mariner-N2_CR; nonautonomous DNA transposon; KW TA target site duplications. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-519 RA Kapitonov V.V. and Jurka J.; RT "Mariner-N2_CR, a family of nonautonomous Mariner transposons in RT the Chlamydomonas reinhardtii genome."; RL Repbase Reports 4(5), 130-130 (2004). XX DR [1] (Consensus) XX CC Mariner-N2_CR is a family of nonautonomous transposons. The CC consensus sequence is less than 5% divergent from CC Mariner-N2_CR copies. Features of Mariner-N2_CRs include 27-bp CC terminal inverted repeats and TA target-site duplications. XX SQ Sequence 519 BP; 116 A; 134 C; 146 G; 119 T; 4 other; ccgtagtgat acaaaggttt cgggttttga atgttcggga cagggtacaa atatatgttg 60 aaagttattc gggaaattaa atattcggga cggccgccgg cgaaaaatgc aagtctgatc 120 cagaatttcg ggactcccgc cggcagtccg gccaggccgc cttcatgccc cggacacgca 180 tgcgaccgtg tgtcgtgcac aatggccccg tggctcaggc tttcgtgcac gccaagtgta 240 atgctacray rrcagtcctt tagtatgtca cttgattgca ctccgtgcat cggtgcatgg 300 cgcattcgaa gtaaggccgg cgtgctggcc ggcgtgtggt ggtaagaccc ccttgacccc 360 ctccaggcgc gcgcagcgcg cgctccggaa aatagtatat ccgaaaggtg atatttattt 420 tcgggaccga gaaggcgaaa taattaatta aataggttcg ggagcggatt cgggaccgcc 480 tcccgaactt tgctccccga aacctttgta tcactacgg 519 // ID Novosib-4_CR repbase; DNA; CHL; 6379 BP. XX AC . XX DT 00-0000 (Rel. 11.05, Created) DT 01-JUN-2006 (Rel. 11.05, Last updated, Version 1) XX DE Novosib-4_CR is an autonomous DNA transposon - a consensus. XX KW Novosib; DNA transposon; Transposable Element; KW Interspersed repeat; 8-bp TSD; transposase; Novosib superfamily; KW Novosib-4_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-6379 RA Kapitonov V.V. and Jurka J.; RT "Novosib-4_CR, a family of autonomous Novosib transposons from RT the green algae genome."; RL Repbase Reports 6(5), 265-265 (2006). XX DR [1] (Consensus) XX CC Novosib-4_CR is a young family of autonomous transposons. The CC consensus sequence was derived from several copies that are ~98% CC identical to each other. These transposons are characterized by CC 8-bp target site duplications and 20-bp terminal inverted CC repeats. XX FH Key Location/Qualifiers FT CDS join(162..567,894..1765,1894..5937) FT /product="Novosib-4_CR-1p" FT /translation="MTSLEAFFPDSSPCNYCLVCQHVIQIGRKGIRVVNLE FT TYACCGLSDDVLAVRKEQGVRAPLAAVHGACYISQLQPRLRVLREQAAAVP FT RTRAAVKSLPPKPARRAPAGAGQKLGQLQNETAHITGVLPSASQRRTGGGV FT THMETSPQESPGRGGVAAPGACTSSSGHTGVSWDNSKQAYRARSSCKLSGV FT SGQPRHLGHYPTSHDAAAAVNLYHEVLAWCLTWGVEVEPPPNAVPAGVALG FT RAEEAYVRSCALNAELDFWRAKEAALTAQLAEERRSSCADYLTACQLVPEL FT ASFVTTIATGGVLPGDLVVEYAMDAVTNAAKLREGGPAANGVRWRSRIKVV FT AAAAAAQCSGEATLNVLRGDVKDPGLPFPSTSTNRRTMGEDGVVPDATVGV FT LPTGIRKFVARCHVAGRPNADACMGTDGVDIPALLEKEIYPSGVRFAGDAC FT LGDSIGVAALAQKEREYNSLVSRLDSSAARVSDAAGSHASVQHVAAWVQRE FT VLPRARTSLREAQVASEAQDARFQRTNGGAMEAGEQQRLKQQQVVHKCASL FT QRIERAQTVVSLAEGLVRGAQRMHDIQLVSVTVGVLSPEGRITVALTPAGS FT DAEAACAEAVVLLARDMRAALPQLYDVLRHAATSMVVAFLTSPDSSVSELV FT ARMYVGGKAGEQTMRRLIADVAVALATEAKAQQQERRLAEKEQRRQEEERL FT QQLLAKAAPQPAVLPAAQQAAALPVAAAAAEEEPPLPPAGSLLMVVTDGEH FT FSLTNAAGGGAHSSNLAQLNRECQKQCAAAALVRKRTARGQEEVQLALWRD FT FVQHVPLNPPRQVLPRGAATSAAPAGSIAKMPHADALLGVVAAHNKHQAAV FT KAAVAAAAAAAAAAAKAPAVSAAATATEVPPPVQQVPPPAPTQAAHPAAPA FT RPKRTAVAPARYNSGETAVPAKQAKTSTTAVAPLALPAAAPATAADPLYQQ FT LGAFCMVCGSKHSSDSNDMLFCDGPRCSNESRYAGWHQKCLTPAVTMVPQG FT DWFCPRCVTAESRCKEQGHRSGAQAPTPRQQQAGARLLPAALPASRLTPGL FT QPLVSDLYSSAGRLAGSIERRADIYAISPQQVQEHGEKVKDIEAHVPWRQV FT ERPVQQLARELEAHSQAAQQPGSMDGVSPQVAAGYHAYMAAALRPQLPWGA FT DGGAGVAEPPPAAGAAVAVRGGGGTAAATATGAATSTAFRAPGLANGCSGC FT VDAAADRKDTHASRAASRAAAAAASDSVLDAEAAGVLQLVRQLPVREGTAV FT VSRLVWRSQYLARHALLLGDRVSTGADGPSFPFASGMAEAEAYWVYRLTHG FT MMPAGLAPATAEQIVQQAVDQARLDSAVLVCKEAGRDFTRQFHYTAEGIAC FT MVVDEPHKLKTAKPQDNPCCFLAKLDNLASMQRVATHGRLQPGTAAASTAL FT ATATPLSSAAALAAAAARVAAPCSPQLSPSEAEVAAAAAACKYGESCLDKG FT VLLEVAAELQREYQTATLVTRAAKYPGLNPSHLIAILRGETDAQSVNCAQY FT WIGCAPLQEALRERGFVREAVVMRTLMEAFEAFDECGLEPVERALRITRRT FT IMLQRMLGNTLFTARVCRGGAQPTLPQYVDGFTLQNVCSWLINGDSRLEVV FT ERLPAERRSELCERALSSDCCELFFSLLTTLCGGRPALRVIQGAMTRITFL FT LGLRQQTRTARGFSMRRRRTRGAYKEHELMVAAALLDPETVSAGRVGISKW FT NSALKIKNRCLKIKRQKSICKTAGARANGKMGHTRQVMFNKPKLATGVHGA FT ARK" XX SQ Sequence 6379 BP; 1200 A; 1900 C; 2174 G; 1105 T; 0 other; caagcgggtc ccactccagg gttcgtgccc ctaacgggca tgtccccact gattaccgcc 60 aggaaacaac atacatgtaa tagtttcata tgtattactt cattacttct ttaaaagccg 120 tagacgtggc ctgaacgagt caaaatgcaa attcttccac gatgacaagc ctcgaggcct 180 tttttcctga cagtagcccg tgcaactact gtctggtatg tcagcatgtc atacagatag 240 gtcgcaaggg catccgtgtc gtcaatctgg agacttacgc ttgctgtggc ctcagcgatg 300 atgtgctcgc tgtccgcaag gagcagggcg tgcgggctcc gttagcagca gttcatggag 360 catgctacat cagtcaactt cagccgcgcc tccgggtgct gcgggagcag gctgcggcgg 420 tgccccgcac ccgcgcagcg gtcaagtctc tcccccccaa gcccgctcgc cgggcgccgg 480 cgggggcagg gcagaagctg gggcagctgc agaacgagac tgcacacatc accggtgtgc 540 tgccgagtgc ttcccaaagg cgcacaggta tgtgtgggtg tgccgctgag ctttggtgta 600 tgcatgaatg gacatgtcca gtcaacgcta atccacttgc gcaactccac cgggccacag 660 ccgcccgcgc gcaggccacc agtatgctgc ggttggctgg cgaacaagcg cctgcaggag 720 ccgcgggtac gttatgtgga gtgtaagggt gattttgggg gcatgcagtt gtgtgcgggt 780 atgggatgga gcgcgacgtg ggttgggcct gttgctgcgt tggtaggtgt tgttgacggc 840 gctaaccgtg tgtctgctgt gtgtcgccct aactgcgtgt ggacactctg caggcggtgg 900 cgtgacgcac atggagacca gcccacagga gtcgccaggg cgcggcggtg tggcggcgcc 960 gggcgcatgc accagtagca gcggccacac gggcgtgtca tgggacaaca gcaagcaggc 1020 gtatcgggcg cgcagcagct gcaagctcag cggcgtcagc ggacagccgc gccacctggg 1080 gcactaccca accagccatg acgcagcagc ggcggtgaac ctctaccacg aggtgcttgc 1140 ctggtgcttg acatgggggg tggaggtaga accacctccg aacgctgtgc ctgctggcgt 1200 ggcactgggc cgcgcagagg aggcatatgt acgctcgtgc gccctcaacg cagaactcga 1260 cttctggcgt gcaaaagaag ctgccttgac agcgcagttg gcggaggagc gccgcagcag 1320 ctgtgccgat tacctcactg cttgtcagct agtgccggag ctggccagct tcgtgaccac 1380 catcgccacc ggcggcgtgc tgcccggcga cctggtggtt gagtacgcga tggatgcggt 1440 caccaacgct gccaagcttc gtgagggcgg gcctgcagcg aacggtgttc ggtggaggtc 1500 gaggatcaag gtggtcgccg ctgccgccgc agctcagtgc agcggcgagg ccactttgaa 1560 cgtgctgcgg ggcgatgtaa aggatcccgg cctgccgttt cccagcacca gcaccaaccg 1620 gcggactatg ggcgaggacg gggtggttcc ggacgctact gtaggtgtgc tgcccactgg 1680 cattcggaag ttcgtggccc gctgccacgt cgccggccgt ccaaacgcgg atgcttgcat 1740 gggcaccgac ggcgttgaca tcccggtgag tggtgtgtgt ttcagttggt ggttcggtag 1800 tgtgcatggt agcgtaacat gcgtagtcac gagcgagaga tcgcttgcgg cccctttcca 1860 actattgcct cgcattgcct tgtatgcttg caggccctac tggagaagga aatatatccc 1920 agcggcgtgc ggtttgctgg ggacgcgtgt cttggtgaca gcatcggcgt tgcagcgctg 1980 gcgcagaagg agcgcgagta caacagcctt gtgtcgcggc ttgacagctc agccgcgcga 2040 gtatccgatg cagcggggtc gcacgcaagc gtgcagcacg tggcggcatg ggtccagcgc 2100 gaggtgctgc cgcgtgctcg cactagcctc cgtgaggcgc aggtagcatc ggaagcgcaa 2160 gatgcccgct tccagcggac caacggcggt gccatggaag ctggagagca gcagcgactg 2220 aagcagcagc aggttgtcca caagtgcgcc agtcttcagc gtatcgagcg ggcacagacc 2280 gtggtgtcgc tggcagaagg acttgtgcga ggagcgcagc ggatgcatga catacagctc 2340 gtgtcggtga ccgtgggtgt cctctcgccc gagggacgca tcaccgtcgc cttgacccca 2400 gctggcagcg atgcagaggc ggcatgcgca gaagcagtgg tgctgctagc gcgtgacatg 2460 cgcgctgcgc tgccacagct gtatgacgtg ttgcggcatg cagccacctc gatggtggtc 2520 gcgttcctca cgtcaccgga cagcagcgtt tccgagctag tggcccgcat gtacgtcggt 2580 gggaaagcgg gtgagcagac gatgcgacgg ctcatcgccg acgtcgcggt ggcgcttgcg 2640 actgaagcca aagcgcaaca gcaggaacgg aggctggcgg agaaggagca gcggcggcag 2700 gaggaggagc ggctgcagca gctgctggcc aaggcggcac ctcagcctgc agtgcttccg 2760 gcggcgcagc aggcggcggc gctcccagtg gcggcagcgg cggcggagga ggagccgccg 2820 ctgcctccgg cgggcagtct tctgatggtc gtgacggatg gagagcactt ctcgctcacc 2880 aacgcagcag ggggtggggc tcactcctcc aaccttgcac agctcaatcg tgaatgccaa 2940 aagcagtgtg ctgctgcggc actcgtgcgc aaacgcacgg cacgtgggca ggaggaggtt 3000 cagctggcac tctggcggga tttcgtgcag cacgtgccat taaacccgcc acggcaggtg 3060 ctcccgaggg gagcagcgac ttctgcagca ccggcgggta gcattgctaa gatgccacat 3120 gctgacgccc tgctgggggt agtggctgca cacaacaaac accaggccgc ggtgaaggcg 3180 gctgtagcgg ccgcggctgc agcggccgcg gctgcggcca aggctccggc cgtgtcggca 3240 gcagcaacag caacagaagt cccaccgcca gtccagcagg tccccccgcc cgcccccaca 3300 caggcagcac acccagcagc accggcgagg cccaagcgca cagcggtggc tccagcacgt 3360 tacaacagtg gcgagacagc ggtgcccgca aagcaagcaa agaccagcac cacagcggtg 3420 gcgcccttgg cgctgccagc agctgcacca gctactgctg ccgaccccct ctaccagcag 3480 ctgggggcct tctgcatggt gtgcggcagt aaacacagca gcgacagcaa tgacatgctg 3540 ttctgcgatg gtccccgctg cagcaacgag agccgttacg cgggctggca ccagaaatgc 3600 ctcacgccgg cagtgaccat ggtgccgcag ggcgactggt tctgcccgag gtgcgtgaca 3660 gcagagtcgc gctgcaagga gcaggggcac cgctcgggtg cacaggcgcc aacacccagg 3720 cagcagcagg ctggcgcacg gctgttgcca gcggctctgc ctgccagccg cttgacgccg 3780 ggcctgcagc cgctggtgtc ggacctgtac tcgtccgcag gacgccttgc ggggtccata 3840 gagcggcgtg ctgacattta cgccatctcc ccgcagcaag tccaggagca tggcgagaag 3900 gtgaaggaca ttgaagcgca cgtgccatgg cgtcaggtgg agcgccccgt acagcaactg 3960 gcgcgggagt tagaggccca ttcccaggct gcacagcaac ctggcagcat ggacggggtc 4020 tcacctcagg ttgctgcagg ctatcacgcc tacatggcag cggcgctgag gccccagctt 4080 ccctgggggg ctgatggcgg cgcgggcgtg gcggagccgc cgcctgctgc tggtgcagcg 4140 gtggcggtgc gtgggggcgg gggcacggcg gctgcaactg ctaccggcgc agcaacaagc 4200 actgctttcc gggcgccagg gctggcgaat ggctgctccg gctgtgtgga cgcagcagct 4260 gatcggaagg acacccatgc ctcccgtgct gctagcaggg ctgcagcagc agcagcttct 4320 gattcggtac tggatgccga ggctgcgggc gtgcttcagc ttgtccgaca gctgccagtg 4380 cgggagggta ccgcggtggt ctcgcgcctg gtgtggcgta gccagtactt ggcacggcat 4440 gcactgctgc tcggggatcg cgtatccaca ggcgctgacg gtccttcctt cccatttgcc 4500 agtgggatgg cagaggcgga ggcctactgg gtctaccggc tcacgcacgg catgatgcca 4560 gcaggtctgg cgccagccac ggccgagcag atagtacagc aagcagtcga ccaggcgcgg 4620 cttgacagcg ctgtgctcgt gtgcaaggag gccggccgcg acttcacgcg ccagttccat 4680 tacacagcgg aagggattgc atgcatggtc gtggacgagc cgcacaagct gaagacagca 4740 aagccgcagg acaacccttg ctgtttcctg gcgaagcttg acaaccttgc ctccatgcag 4800 cgcgtcgcca cccatggccg cttgcagccg ggcacagcag ccgcatctac agccctggct 4860 acagccactc cgctcagttc ggcagcagca ctggcagcag cagcagcccg ggttgcagcc 4920 ccgtgcagcc cacaacttag cccgtccgag gcagaggtgg ccgctgcagc ggccgcctgc 4980 aagtacggag agtcctgcct ggacaaaggc gtgctgctgg aagttgcagc agagcttcag 5040 cgcgagtatc agacagccac acttgtcacg cgtgcggcaa agtaccctgg gctcaacccc 5100 agccacctca tcgccatcct tcgcggtgag acggacgcgc agtccgtcaa ctgcgcgcag 5160 tactggatcg gctgtgcccc gctgcaagag gccttacggg agcgcgggtt tgtgcgggag 5220 gcagtcgtga tgcgaacgct gatggaggcg tttgaggcat ttgacgagtg tggcttggag 5280 ccggtggagc gggcgctgcg catcacacgg cggaccatca tgctgcagcg catgctgggc 5340 aacacgctct ttactgcgcg tgtttgccgc ggcggcgcgc agcccaccct gccgcagtac 5400 gtggatggct tcacactgca aaacgtgtgc agctggctca tcaatggcga ttcgcgtctg 5460 gaagtggtgg agcggctgcc agcggagcgg cgttcggagc tgtgcgagcg agcgctcagc 5520 tcagactgct gtgagctgtt cttctcgctg ctgacaacac tgtgcggtgg ccgtccagcg 5580 ctgcgagtga tccagggcgc catgacccgt atcaccttcc tgctcggact gcggcagcag 5640 acgcgcacag ctcgtggctt cagcatgcgc aggcgccgca cacgcggcgc gtacaaggag 5700 cacgagctca tggtggcagc tgcgctgttg gacccggaga ctgtcagcgc tgggcgtgtg 5760 ggcatcagca agtggaacag cgcgctgaag atcaagaatc ggtgcctgaa gatcaagcgc 5820 cagaagagca tttgcaaaac tgctggcgca cgtgcgaatg gcaagatggg gcacacgcgc 5880 caggtgatgt tcaacaagcc gaagctggcc actggggtgc atggggcggc gcggaagtag 5940 agtacgggag gtcacaagta acgcataggt agaggctgct gaatgtctgc ttaggtacac 6000 gtgatgaata gggaaacgga gcagggcagg gcggagcagg ggtgaccagg ggcacaccgt 6060 gggacccccg gtgacaattc ctggcataac tttttcactc ggacgacttg cataatgatt 6120 cttaaatact ctgaatgtgc tcggcaagtg caatacatca cgctgtaaat ttacagttct 6180 ggtgaaaaat aatttttggg tcccccatgc aacacacccc ggacggcacg cggtgaaaaa 6240 cccctgatag attgtccttc gacgagacgt agccgtagac gtggcggcat ccttgaagga 6300 caatctatca ggggtttttg gtgccgtcgc gttggcgcct ccctggggac acactttttc 6360 ctggagtggg acccgcttg 6379 // ID hAT-N4_CR repbase; DNA; CHL; 645 BP. XX AC . XX DT 13-MAY-2004 (Rel. 9.04, Created) DT 13-MAY-2004 (Rel. 9.04, Last updated, Version 1) XX DE hAT-N4_CR is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; 8-bp TSD; KW hAT superfamily; hAT-N4_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-645 RA Kapitonov V.V. and Jurka J.; RT "hAT-N4_CR, a family of nonautonomous hAT transposons from the RT green algae genome."; RL Repbase Reports 4(4), 122-122 (2004). XX DR [1] (Consensus) XX CC About 50 copies of hAT-N4_CR are present in the C. reinhardtii CC genome. They are ~95% identical to the consensus sequence. CC hAT-N4_CR is characterized by 16-bp TIRs and 8-bp TSDs. XX SQ Sequence 645 BP; 142 A; 174 C; 191 G; 138 T; 0 other; caagcgggtc ccactccagg aaaaagcgtg tccccaggga ggcgcccacg ggatggcacc 60 aaaacccctt ggtagattgt ccttcaagga agccgccacg tctacggact catctcgtcg 120 tcatctcggc gcctatggct tgctgatgct tgaaggacaa tctaccaggg gggtttcacc 180 gcgtgccgtc cggggggtgc gtttccccag ccaaacccgc acgccgagga ggccaggaga 240 cgtcgccccg agaaaaatgc agagcgggat acagactccg ccacttaatg tatatgttac 300 atgcaatctg ttggagcaat agcgcttgac agcaggctaa aacgtgctcg tcgaggctca 360 agtcgcgaaa ttgaccgaag ccaacatcct acataggcgt tgttttcggg ctttatcgat 420 tgccgtcaac acagatatat gcataggtgt agttaggcta ctgtctggcc tggtatggcg 480 tgtcctggcg aacgcgacat ttagatgttc gaggtcggcc ccggctcgat tcaatgcccc 540 cgggttactc taatgacctt atgtgtggta taagatgcta taggaaaggt attgggctgg 600 ggacacgacc ggcccgtagg gccgtgcgcg agtgggaccc gcttg 645 // ID Gypsy-8-LTR_CR repbase; DNA; CHL; 2044 BP. XX AC . XX DT 31-MAY-2006 (Rel. 11.05, Created) DT 31-MAY-2006 (Rel. 11.05, Last updated, Version 1) XX DE Gypsy-8-LTR_CR is a long terminal repeat of Gypsy-8_CR LTR DE retrotransposon - a consensus. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; 4-bp TSDs; Gypsy superfamily; Gypsy-8-I_CR; KW Gypsy-8-LTR_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-2044 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-8_CR, a family of gypsy LTR retrotransposons from the RT green algae genome."; RL Repbase Reports 6(5), 240-240 (2006). XX DR [1] (Consensus) XX CC It is a long terminal repeat of the Gypsy-8_CR LTR CC retrotransposon, whose internal portion is Gypsy-8-I_CR. XX SQ Sequence 2044 BP; 401 A; 625 C; 598 G; 420 T; 0 other; tgtgggtgct ttgcacgggg cgccgctgca cgttgacggg acggattttg cacggacgtg 60 tttggcgcat agtcatataa acaatgagcg catgagccgc agcaccagag ggtcgtagcg 120 gcaggacgcg gatagcgaga gcaccgagca agtggatcga ctttggatct agcaaaatag 180 cggagcaagc agggccgccg gagcttgcca gtaaatcggg tggagaagat attacagtac 240 taatcgagca aggctgcggt gcaagactag gactggctgc cttaagctgt cggagcggcg 300 aagggccgcg aggagcgcaa gcgacgcgca ggcttagtat ggttgacttg tagtggatga 360 tagtttaagt tgacaacggt cccgtgaccg gttagcgtgt gcaaggtggt aagcttaatg 420 ctggacctag acgactgaga actttacgga tgcctaggag aacgccgcag gtgaagcgcg 480 gctgtgtttg ggagctgaaa gggtattagt tgcgcgcagc gatgcaagtg ttggcgcagt 540 gtatgtgtga gggacccgat gggcccttag cgaagttggt agcacttggg cccctcggcg 600 cagttgctta gcacttgcga tttggagcct aggcctgtac gacctgtacg agtcccatcc 660 ggccaatgca ggggccgacg ggaatatata agcgagcgag ttggggtagt tcagtcactc 720 tcctcctccc aactcctcac accaactact cgagagtcac tcttaaatag actctttaat 780 tacgagtctc tagtattgta atacaagttg gacaaagaac gatcccttgt atcgcggccc 840 cttgtgctcg gcgcgaagtc gcccgctcgc gcaagtctag gctagccttt cccctccccc 900 acagggtgcc tcccggcaca accaagtacc cgttcgaagc ccccttcccc ctccccttac 960 tatctctcac ctactccggt tcatccgagc cccagactct cccctctccc tatctctccc 1020 ctaccccaca ccgctgtgcc tgttgccccc gacctatccc cgtaagtcct gagccgttac 1080 ccgcagcccg tcagtgatag ccaaggcccc tgcccacact agggtcaaga gtaccccaac 1140 ccgcatagcg tacgccagca gagcctctgc gccaggttgc atacgtgtag tgtctggcag 1200 cgattaggta cagcagctga tagcagggcg agctgggagc gagccagagc agggtccggt 1260 tcggtagttg ggcctgagcc cgttcggtta gtggcagtag cagggagtga gggagacaga 1320 gacgggcgta agatagcgtg agatagtaca gggcctgggc ctaggcagtg gggcggacgg 1380 cattggaccg ggcacgcacg ccttgtctgc gggttgggtc gtcgggggct ctcccccccc 1440 tttcccttgg ggtgttcgcc tcccctgccc cagattgcag gtcgactgcg acacctctgg 1500 tctgaagaag ttttgcttat ccctgcacct ggtctacctg acctgcgtgt gaccaagtcc 1560 ccgacacgcc gtgccggctt gcgggacctg cttgctttcc tgtctcccct ctcttctccc 1620 ttcccgtctc taccacccgc cgcgtgcccc gcacgcgaag ggtaacccca gctaccgccg 1680 gcggcggtga ctctacttgg gcaacccgcc cgaccttatc tctctaatct ctgattaccc 1740 tgggcggcaa ggttgagtga gacagccttt gccacggctg tgcgcctgca cgcacgccac 1800 ccatccggct ctgtacgagc ccctgctcct gttatccaac ccgcctagac caaacagcga 1860 agcccctccc gtgagtaaaa cagctaccga cgtccccata cggcggcgct gggcactgct 1920 gggatcgaag agccccctct acggcacgcg atctgccccg acagctgttg acacgcggcg 1980 ataaagttag gcgaaaaaag tgtcctgtca tctctggggg gtcggcacgc ctttgtgacc 2040 gaca 2044 // ID LINE-8_CR repbase; DNA; CHL; 1881 BP. XX AC . XX DT 24-FEB-2008 (Rel. 13.02, Created) DT 23-MAY-2009 (Rel. 14.06, Last updated, Version 2) XX DE Putative non-LTR retrotransposon - consensus. XX KW Non-LTR Retrotransposon; Transposable Element; LINE-8_CR. XX NM L1-8_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-1881 RA Jurka J.; RT "L1-like non-LTR retrotransposon from green algae."; RL Repbase Reports 8(2), 82-82 (2008). XX DR [1] (Consensus) XX CC Originally named L1-8_CR and classified as L1-like. However, it CC lacks many features of L1 elements and its classification is now CC open. XX SQ Sequence 1881 BP; 330 A; 544 C; 630 G; 377 T; 0 other; acaccgccac cagcgccagg cgactggtgc tgggcagcac cattgtgggg caatccgctc 60 ctgcctgagg gtgaccacac ccgacggcct ggccttgagg cgcaccatac acgcctcgcg 120 gcgtgtcgcc acctgcgcac cgtaggtgac ctggtgcaag cactgacagc ggcggcagca 180 tggcgcgcaa cacggccagt gggagcgact gggggtgtgg catgggagga atgggtgcgg 240 cagcaccttg acccggagac gcagctacgc gagcagccgg cgctcgaggg cctgaatagc 300 cttgcagcag acataccaga gacctggctg gcagcagcac gtgcagtaca ggaacggctc 360 caggctgggt tgggccccgc cccgccctcg gagtcggcag tggttcggct gatcatgcct 420 tgccttggct ggaggatacc tggcatgcag caacccctgg cactggagaa gctgtctgta 480 cgtgttgcta ccgtcatgca gctgggagac gtcggggtag cccgcagtca actgcacacg 540 gcatatgcag ctgaggcgcg cgagacacca ccagcagccc ccccacacac aacgctggat 600 tccccccttc gaccggctct ggcggatacg ctgggagaac gcccacaagg agacagcttg 660 gcgggtgtca atcgacggca ttggcattgc cggtaacacg catctcacac atgcacctgg 720 tccggtgccg tgtggatgcg gtgtgacgcc aacccacagc cccaggctac acgcgttttg 780 gcagtgcgca gtagctcagg cggttgtgca gcagctcgca gcgcagcttg cggccccgcc 840 caccagggag caggtgtggt tgtttgaggc cccgactggc gtgcaacaag cagtgtggga 900 tgtagtggct atggcggcgt tagcggctat ggagaggggg cggtgcgcgc tgtacgccgc 960 aaggcgccgt ggcctagcca acgctgccac agcaccactg ccgcaacaac ctctacagca 1020 gggccagtta ccgcagcagc agccgcagca gcagccgcag cagcagcagc agcaacaaca 1080 ggctcaaata cagacggaca cccaacaagc gattgacacg gcagtgatcg acttgtgggg 1140 tcgcttaaaa gcattcacag cactgggagt gccgcggaag ggctgggatc aagtgggtgc 1200 gcagcatccc ttcctgagag tagtaaacgg ccagttgcgc tgtcacacag gtatgaacgc 1260 ggaacaaccc gctcaggcca tcacggacag tgacgaggac taagcacggt tccgcaggtg 1320 caagagggtg ggctagagag ggtcagggtt tctgctttct gttttctgtt tatgtttgtg 1380 gttgggctgg ggggcttcgc cagggccagc gatgcagggt gtggtttccg ccagagagcg 1440 gcccctgctg gtccttttga gggggcttgg ttagcttgct ttatgcgccc taagcgcctt 1500 gctttccctc tgttgtggcc aggggtgttg tttgtggcgg ggcggccttg tgtgtgtttc 1560 tggccttggc gggcggtgtt ttcctggcgt gtgttttggt ctggtgtgta cgtgtggtgt 1620 ggtgtgaagg aggcagttgt gtctgcccct ctgaggctcc gccctagcct gctggctgtt 1680 gtgctttctt ttggtggaga aggcgtgagc cttgtgtctc cgtgtctctt ttgtcttgac 1740 caggactggg gccttatccc ctgctgtgtc aggcgtgctt aggggtgagc gtgtggaacg 1800 cgctcaccca acactagggc ttgtcagccc tagtcatcag cggcttcgcc actaccaagt 1860 gtaattattc tctccctccc t 1881 // ID DNA-8-4_CR repbase; DNA; CHL; 2825 BP. XX AC . XX DT 29-MAR-2006 (Rel. 11.03, Created) DT 29-MAR-2006 (Rel. 11.03, Last updated, Version 1) XX DE DNA-8-4_CR is a family of P nonautonomous DNA transposons - a DE consensus sequence. XX KW P; DNA transposon; Transposable Element; Nonautonomous; P-2_CR; KW DNA-8-4_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-2825 RA Kapitonov V.V. and Jurka J.; RT "DNA-8-4_CR, a family of P nonautonomous DNA transposons in the RT Chlamydomonas reinhardtii genome."; RL Repbase Reports 6(3), 112-112 (2006). XX DR [1] (Consensus) XX CC This is a consensus sequence of a DNA-8-4_CR family of CC nonautonomous P DNA transposons. DNA-8-4_CR elements are less CC than 2% divergent from their consensus sequence and they are CC characterized by 151-bp terminal inverted repeats and 8-bp target CC site duplications. DNA-8-4_CR is a nonautonomous deletion CC derivative of the P-2_CR transposon. XX SQ Sequence 2825 BP; 565 A; 789 C; 840 G; 631 T; 0 other; caggggtgcg gggtggccaa ctcggcccat gtgatcggca acggtcggca accgccccca 60 attcccagtt gcgaccaggc cccggcccga tttctttgca tagacctata aggagtaccc 120 catagaagac ccctggcgcg ccgggcgact agtttactgg cctttgacga aacttgacca 180 tttgaacagc attaccgcat ttcccgcctc gatgcagcgc ccggtgcgcc aggtgaccat 240 ctagaagtgt tacgaagtag atggagcgaa gcggacaagg cggtcgctgt tggctgctct 300 gggcgctgga gggctctgct ctgacggtag ggtcgccggc accgccctta gtcttgttcc 360 gatgaatgat atgggctatc ctttgctcct atatttagcc tagcgcatcg ggcatatcat 420 ctgaggcgtc gctggctttg gttgcttcac attttaacgc tctgctgctt cttatacctg 480 tatcaagcct ttccgctatt acctatacca tatcaacgag gtggtcacgg acgaaatctg 540 caaagcgttg cccgcgcgtc caggatttta gctcgcagtt tgaaccgggg gtccgggtgc 600 agaggggcca gcttttggta cccgctacca gcaccacatg catcaacacc agggatcatg 660 gcacctgcag cgcactgcag tttaaccaat gcagtttcga actcggggcg ccatgcaaca 720 ttccttgcac gtcataaatt gtttgccttg gctcgttccg ctatttctag tatactcagg 780 atgccgattg actggatgga gggccgagcg cacctacggc acgccagcca tgtcagctcg 840 cagcatcggg atgcactggc cccgggcaaa cgcatgcata cagccgtgcc cacattgtgt 900 tttgttatca tatgagcata tggtcaacat agggctagtt ccctgtggtt aacaacgccc 960 ggtctggtgg tggcgtgccg attgcatgtc atcgacccac attcctagac gtgggcaacc 1020 attacttgct gttatctgag gtgtatgaat tgggttctct gttgtgcatg caggcgttcg 1080 ctgcaatggg cgtcgagaag ggctggtgcg ttctgtgcgg cacgtacgtc aagtcgaagg 1140 acccaactac tgcaatggaa ccggcagcgc accatcctag atgcagcagc agcggcagca 1200 acagaggcga agggaggcac gctggagcag ttccctgagc atgagcgctt tgtagtttgc 1260 acgcatcatg cacagcttca acggccccaa gtgggaccgc gcggtccagg atgactggca 1320 ctgcaacgtg caccgcaacg caagtgacgg tttggtgttg cggttccaat ccagcaacag 1380 ctgcgcacac gacctcaact cgcttcagct tcatctgtac cccgacttca catactgcat 1440 tttcagctct catgcaacag tgcagatgcc gcctgacgga cccagcagcc cgcgcctctg 1500 cacggagcag cacctcactg acgccttggt gtttccgggg gcttgccgcc cctgtgccgg 1560 cggccccact gcagctgcct ttgcaccata cctggcgcac aaggcagctt gcaacgaggt 1620 tgtgctgcag cgttaccggg ctgagtcgcg caccgtgcac cttccgtgca gccgcgccag 1680 tggcgagact gctgccactc agaaacttga cgtgcatccc agtatcgcgt gcgatggctt 1740 ggtttgttta cgcagctgta catgtctgct gccatttcaa aacacggcat actagtggcc 1800 cctctgtaca tctggaatgc aaccgtgaac tggtggtgtg ggctgtgtcg gtcatagtgc 1860 cacgcacccc tttttaattg caacctagtg caaatgacgt acgatatggt ctgaggagtg 1920 gtatgccacg ttcgaggtgg gttaagggta tgccacgttc gaggtggggg aggcgctgga 1980 ggggatggag aggtcacagg gcaagcaatg cagatgcatg ggaagtgtgt tagagctata 2040 tgtgtatgta ccggtatgcg tggatgcatg caaagggcgc atggagcatt tcatccgcct 2100 gcaacgagtg aaaagggcga ggagcatgga gtggcagggg tgcctaggtg cgcgtgggtg 2160 cgctagcaag ggacctggtt tggcacatgt gtttgggtgc tcagtttagc aagcctgcac 2220 tgtgaatcca tcatgtaacc agcaagctcc aggcgcatca gccgggtggg cggggatggc 2280 ttagtgccgg tcgcggggag tacgaggggc aaagtgccca gcggcttaac acagcctgca 2340 caggggggcg cgaagttggc gacgtccggc aaggccgtcc tgaaccggcc ccacttgttt 2400 tatgcgccca aattggttgt tacatgtcaa gtatcaaagc agcgtcggcg gctgctattg 2460 cgcagtcttg cttttgaact ccggctcgaa attgcacctc gccggtcgca tggaagcccc 2520 tccctgcttg tagtaggccc ccgaagtcct ggaacaaagc tgcttcagaa ctccaatgta 2580 agacaggtcc ataggttgct atcgcgcagc tctccctgcc tccaagccga ggttccaatt 2640 tgcgtcggca agcatgtttt cgaccaccgt catatagtcg cccggcgcgc caggggtctt 2700 ctatggggta ctccttatag gtctatgcaa agaaatcggg ccggggcctg gtcgcaactg 2760 ggaattgggg gcggttgccg accgttgccg atcacatggg gcgagttggc caccccgcac 2820 ccctg 2825 // ID hAT-N10_CR repbase; DNA; CHL; 574 BP. XX AC . XX DT 08-JUN-2004 (Rel. 9.05, Created) DT 08-JUN-2004 (Rel. 9.05, Last updated, Version 1) XX DE hAT-N10_CR is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; 8-bp TSD; KW hAT superfamily; hAT-N10_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-574 RA Kapitonov V.V. and Jurka J.; RT "hAT-N7_CR, a family of nonautonomous hAT transposons from the RT green algae genome."; RL Repbase Reports 4(5), 144-144 (2004). XX DR [1] (Consensus) XX CC A few copies of hAT-N10_CR are present in the C. reinhardtii; CC they are ~95% identical to the consensus sequence. hAT-N10_CR CC is characterized by 11-bp TIRs and 8-bp TSDs. XX SQ Sequence 574 BP; 89 A; 191 C; 160 G; 133 T; 1 other; cagtgtttcg gcaagttttt agggccttta ctttcctagt gcacgggtgg gcattggggt 60 gcggtgactt taatttccaa tgcccaccag ggggcgcagc ccccccacct cgtacggggg 120 gtccgggggg cggaagcccc ccggaaaatt ttttcactat gggtatcaaa atacgcgtat 180 tttcgtatgt catrccccta gtcctcgagc agcgtttgtg ctagccgaca aggcatcacg 240 catgcgtgcg cacgccacac atcttctctt catactgcca caccgcaact gatgagtctt 300 tccccatcct gccgtgtgcc catccgggcg tgcccgtcgc cccggccccg ccggtccggc 360 ccggccccgg cgcgcccctc tcccttcctg gcgcgtgcac cgggccgggg atttcgcagg 420 atgggtgggc acttgagggg tcagggaatt attttttata gtgcccaccg ggcactgggg 480 ctccaggcat ttaattttcc ggtgcccgcc ctccgggccc gacttatttt tccagtgccc 540 acgtgggcac taatgcccac ctcccgaaac actg 574 // ID L1-7_CR repbase; DNA; CHL; 2678 BP. XX AC . XX DT 22-FEB-2008 (Rel. 13.02, Created) DT 22-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE L1-like non-LTR retrotransposon - consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-7_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-2678 RA Jurka J.; RT "L1-like non-LTR retrotransposon from green algae."; RL Repbase Reports 8(2), 81-81 (2008). XX DR [1] (Consensus) XX CC Probably 5' truncated. XX FH Key Location/Qualifiers FT CDS 968..2440 FT /product="L1-7_CR_1p" FT /translation="MHGSAAGTYVPPWTAAATALAVQQLHTAAHPLCVLTM FT QRAAGGVGPAADGCAIFGQPVAATCPALVRLLGALSYLPPVTILRPAVLVP FT GPWCFALPLWGNPVLPCAAGLQAGGLERDFPALMALPGLLTLGTAVRCWEA FT LAAVRQLVASKPAGALGPRLARRCTIQYKRSVLRPILRITDMARVPPELQS FT GPDAAAQFSALLARVPAAWRVAASATLHAPGGAASAPPAPQVWLLLAQSLG FT WRHGAADVPLLTLTVKQATQLQLAPAYDALRVRHLAFIQEAYSGAAPPAEA FT IHALRAALARLWALVWEPRHKEPLWRLAVNGFTGFGMLAAWAADGRVEKCP FT CGTQMTAGARVHHFWDCVVAEALRDVMREHANVDITRNQLWLVQAPPGLSQ FT AVWDIVCLAAVAALEYGRQRLYACRDAADRTAEVAVVRRIGVEVIADFWSR FT LAAFVSLRRPPRRWDLVPNQHPFLASDDVGGVILVGPTADSPPASP" XX SQ Sequence 2678 BP; 446 A; 820 C; 882 G; 530 T; 0 other; tttgcgccga tggggagggg gggggggggg ggagggccct ccccccgcac gggatgcaag 60 catcgtacgg tagtgtaggc accagcatac aagcacgggc ggtgttcccg ccccccccat 120 acaggagggc actcgcacgc gaacggacgg cgcgcgtcag tcaggcacgt tcggggcgaa 180 taggccgcgg ccagcacacc agaaacatag agagcttcca aatatattat gacgaatgcc 240 aaaccgcaaa ggttgagcga ttaaggatgt cagtcaatca gtcagcagga gcactgtgcg 300 cacataatta gttctacaaa tagatcctta ttcaatacat ccgaagatct cagcaatgca 360 cagtaagtcg cgtccgcggg caagcttgct ggtggtgcat ttggctcgct cgctgcgctt 420 caggggaacc gtccgcgcgt agataataca gatcatctcg atctatctaa tgaatacgat 480 gaatcacgca tatgcgcctt ggggagattg cacaatgggg ggcaaaccgg tcccgcgctt 540 cgcatccctt cggccgatct tggcgcttgc aaaaaagtaa aggtgctggg cgccctggct 600 attcttcaac ataagagtca acgcgtttcg ttcacccacc ccacaaagtg catctccttt 660 gaccagcgtt ttctcacagc aaacgctgtc aaaaattaat ggcgcgccgc tgggttgggc 720 gtcagggcgc aaatagtcag ttacacttgg tcgcagcctt gaaagcgggc tggggcgcac 780 gccacacccc aaaaacacgc caggagacac actttggaac agcatgtgta gatgagtggc 840 ccgacctccc ggcctcccct ccgactgcgt cgccctgccg ccacgcgatg gcggctttgg 900 actgctgccg gtggtggaac acgtgcgcgc ccgacacgct gtgttggctg tgcgctggct 960 gcagcacatg cacggctctg cggctggcac ctacgtaccg ccgtggacag ccgccgccac 1020 cgccctggcg gtgcagcagc tacacacggc cgcccacccc ctctgcgtcc tgacgatgca 1080 gcgggcggcc ggcggtgtcg gacccgctgc cgatggctgt gcaatcttcg gccagccggt 1140 ggcggcaacc tgcccggccc tcgtgcgcct cctgggtgcc ctcagctacc tgcctccggt 1200 cacaatccta cgcccggctg tgctggtgcc agggccctgg tgctttgcac ttccgctctg 1260 gggcaatcca gtcttgccgt gtgcggcggg gcttcaggcg ggtggcttgg aacgtgactt 1320 cccagctctc atggcgctgc cgggtctgct cacgctcggc acggcagtgc ggtgctggga 1380 ggcgctggcc gcggtgcgcc agctggtggc ttccaagccg gctggtgcgc tcgggccgcg 1440 cctcgcacgt cgctgcacta tacagtacaa gcggtccgtc ctgcgcccca tcctgcgcat 1500 tacggatatg gcgcgcgtgc cgccggagct gcagtccggc ccggatgctg ctgcccaatt 1560 ctccgcgctg ctggcccgtg taccggcggc gtggagggtg gcggcgtcgg ccacactgca 1620 tgcgcctggc ggcgcggcct ccgcgcctcc tgccccccag gtatggctgc tgcttgcgca 1680 gagcctgggg tggcggcatg gcgcggctga cgtgcccttg ctgaccctta cggtgaagca 1740 ggcaacccag cttcaacttg cgcctgcgta cgacgcgctc cgcgtgcgtc acttggcctt 1800 catccaagag gcctacagcg gcgccgcgcc gccagcggaa gcgattcatg cgctgcgggc 1860 ggccctggcg cggctgtggg ctttggtgtg ggagccgcga cacaaggaac cgctctggcg 1920 tctggcagtc aatgggttca cgggctttgg gatgcttgcg gcgtgggccg cagatggacg 1980 cgtggagaaa tgcccgtgtg ggactcagat gactgctggg gctagggttc atcacttctg 2040 ggactgtgtg gtggcggagg ctctgcgtga tgtgatgcgg gagcatgcga atgtggacat 2100 cacacggaac cagttgtggt tagtacaggc gccaccaggg ctatcgcagg cggtgtggga 2160 catcgtgtgc ttggctgctg tggcggccct ggaatacggc cgacagcgcc tttacgcttg 2220 ccgtgatgct gcagaccgga ctgcggaggt tgctgttgtg cgaaggatcg gtgtggaggt 2280 gattgcggac ttctggtcac ggctagctgc gtttgtgagt ctgcgtcggc ctccgcgccg 2340 ctgggacctt gtcccgaacc agcacccatt cctagcgtca gatgatgttg gaggggttat 2400 cttggtaggg ccaacagcgg actcgccacc ggcctcaccc tagcttgtca gcttagggac 2460 tcgcaagagg tagtcttatg acgccgaccc acggcttagg tgagcagcgc tagcgtttgc 2520 ggtgagccgg gcctggggtt cctcccctct ccggaggcga ggagcatggg ggtcattcgg 2580 ggatctctcc tcgggtgagc gtgcgtgtct cgtacgtttt tggggccctg gcttagtcca 2640 cggctgtcgt cccacatgta acctctatca gctaataa 2678 // ID Gypsy-10-LTR_CR repbase; DNA; CHL; 931 BP. XX AC . XX DT 31-MAY-2006 (Rel. 11.05, Created) DT 31-MAY-2006 (Rel. 11.05, Last updated, Version 1) XX DE Gypsy-10-LTR_CR is a long terminal repeat of Gypsy-10_CR LTR DE retrotransposon - a consensus. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; 4-bp TSDs; Gypsy superfamily; Gypsy-10-I_CR; KW Gypsy-10-LTR_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-931 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-10_CR, a family of gypsy LTR retrotransposons from the RT green algae genome."; RL Repbase Reports 6(5), 229-229 (2006). XX DR [1] (Consensus) XX CC It is a long terminal repeat of the Gypsy-10_CR LTR CC retrotransposon, whose internal portion is Gypsy-10-I_CR. XX SQ Sequence 931 BP; 224 A; 241 C; 257 G; 209 T; 0 other; tgtgacaaac catttagggc attgaacttt gaataagttg atataagttg tgacttaact 60 caagaaagaa ttcgcaagta ggtcggcgat gtgtggacta ggcagccgca tgcacgcatg 120 cccgggttga acgaacacgc cacccgcgct aggaggttat gggcttaacc ctacctcgca 180 tgcatctttg agacttaagc gcatgttgct acaagggaag ggtagagaag gaaggaggtt 240 atggatgcgt aggtacggac gtgtgtatgc agacgtgtgg ggattacttg caggtcttcc 300 ccccctgtac gctcccttct ggagacggga gggttcgcac taacccaggt tgcgagtgta 360 gaaggtgcgt gtacgtaatt gtcagaccag taggcagtca gacggaggcg gtcagactgg 420 ggatagatta cgtaagtggc aacgttgcac ggatggtctg ttacgtaatt gacaacgttg 480 cacagggatt ctagatccat gtacgaccaa tgcgggtcgc ccctgagtat ttaagcggcc 540 gaccgcttgt tgtttgggca tagaaccggg tgttagcctg aagaaccata ccaggcaacc 600 tttcccaggt cgaacggtag tgccgagtgg acagatcttc ccacttcgga cacaagtgta 660 aaccaccatt gacattactg ttcaatctct cacaaccggc aactcgcggt tgttaagcgg 720 gaagggaaag tgcctaagtc tgcgggacac gcagcagcgg ctttcaccgc tggttttcct 780 caacgctaag ctgacccttc ctctggtcgc cctgcctatc cccaagctcg gcaacccaac 840 caaggacatc gcgctagggt gctaaacccc ccccccccct ggcagtctcg ataccttagt 900 ctccacagct atccaacccg ttaagaggac t 931 // ID Novosib-6_CR repbase; DNA; CHL; 3392 BP. XX AC . XX DT 17-MAY-2011 (Rel. 16.05, Created) DT 17-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Novosib; DNA transposon; Transposable Element; Novosib-6_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX CC incomplete at both ends. XX SQ Sequence 3392 BP; 650 A; 885 C; 1180 G; 677 T; 0 other; ctgttgggct tgtgcgggaa ctcggtgcgc ccccagtggc tgagcatgat gcccggccac 60 acctggtgtg ggttggagtg cagcgacaca caccacacat caccatcacc acacatcaca 120 ccatgattac accatgataa cacatgaaca tgaatcaagc aaaagagaga gggggagggg 180 gaggggagag ggggagggga gaggaagaag ggggtttgcg gggttggggg cacacggggg 240 cggccccatg ccgagggggc aaaatatgtt catcttactt gttacagcga gttagaagcc 300 cagaccggtt ttaatagcct ttactacgtc aatgtatatc aactattaac tctggcaatg 360 taaagctgcg tgtgccccgc caacttcctt cgatgctgaa tgcgaaatgt ctcgcgagcc 420 acgagcaagg tcagaaccac cacagagagt ttgtatagga tgccatagta ccgaacatgc 480 actcgcgcgc aagaaagtat ctatcagcaa cgttccagag acttggctat tgcctccatg 540 ctccagcgcg ctgccccaac gggcggaggg catagcgttg gactgcgctt gtagaaactg 600 cgttcgaaag aatgcagacc gcatggaggc tgctgccctg gaggctcaag cggcagccga 660 ccgagcggct gccgtgctgg cggaggcacg cgcacagaag cgcacgcgtg cccagcacgc 720 cgcacccgag cccccgcccc cagctcccat cccgaccatg tttcagcgcg tgctggacaa 780 gctccggacg cgcacacaaa agcagaaaca ggacgtggac acggcactga cggccgggca 840 ggtggatgtg ttgacagctg agcttgctgc tcaaggcatt accacccgct ctgggctgga 900 gcttgggcag ctgcggcggg aggatctcgc gcgggagccg tacagtactg tgtaagttgc 960 tgaccacggt tactgttatg tgggtgtact tatgggcatg tgctaagctc ccaattcttc 1020 gaagggaggt gaagcgactg ctatgaagtt gctcactgat tgcttgcttg caggcagacg 1080 gagtccacag ctgctggggg accggagctc aaacgtgcgc gtggcggctt gacagctgag 1140 gaactgcccg aggcacggga gctcctggcg gatgctgcca agccgccact ggtcagcccc 1200 gccggaaagc acaggagaag cagcggagga tgaccgggaa ccaatggagg cagaggccgg 1260 tggtggcagt gccgcaacag acagcggcaa cggcagtagg ggacggggca ggggccgggg 1320 ccgcggtcgg ggccgcagcc agggcgagca gccgcctggc gccgccggtg gtgccatggt 1380 ggaagcaggt gccggcagca gtggcggccg gggccggggc cgtggtcggg gccgcagcca 1440 gggcgagcag ccgcctggcg ccgccagtgg tgccgtggtg gaagcaggtg ccggcggccg 1500 gggcggccgt ggccggggcc gcggtcgggg ccgcagccgg ggcgagcagc cgcctggtgc 1560 cgccggtggt gccgtggtgg aagcaggtgc cggcggccgg ggccccagca ctggtgctta 1620 cggaggcgac ccggaggcag tagccagtgg tgccggcgag aaagcagcca cagacagctt 1680 ctcagccagc accagtgggc tgtttcccag ggacgctact gcctgggccc agtttgctac 1740 agcacaaata tttggtgggg aagtccccac tgtacagcag gccctgatgg cgttccgcct 1800 cgctcatgca tcacaacttc ctggcgtgtc gcctgggcat ctggcggatg cagctacgcg 1860 gtgtgcttat gacacagcac gcatgcatct gctcgcgcag gggagggact tctacagcca 1920 cgtatacata gcggaacact ctagcataac cggcgctgta ctgtacgacc atgaggacta 1980 cgtgcacaag ctaaaaaatc tcgtttctca gctgcgcggg gcgccgtcgg agggcgcagc 2040 accgggtgca ctcctgacac gcggccgcat cgaggcggct gcgcaggggc gaccacacct 2100 ggcgcacgtc acggcgggcc ttgccaagac cgtggacgct cagaatgtgc ccatctgtga 2160 ggcgctggtg tacgacgagg agctgcaggc gcagcttctg caggcacgca ggggccacag 2220 ccgtgcttgc ctcgatgctt tggtgctgcg agtggtggga cacgcgcacc aggcatggga 2280 tgcaccacat ctcaccaacg ccacgcggac gcgcaggttg cagaatctgc acattctgct 2340 gcgctcggtg ttgctgtcgg actttgacga cgtgttcaag atggcaacat tcgcttctcg 2400 cactgtcaaa gtgcgtggct tcaccacaga gctgctgttc gtgctgatgg ccaacattga 2460 ggcgcgcgag cagctactgt gcgagctgcc aggtgctggc cggctgctgg tggagcgcag 2520 cttgtctacg gacgatctgg agaacgagtt cagcctcatc gtgctgggct gtggcggcta 2580 caagcccacc gtggagtccg tgatgggctt cttggagcac atggacttcc tgtacgtgct 2640 gcggcgcgtg ggcaccgaga aggggctggt gatgccgaag agtgccaagg agcactacac 2700 ctatcatgag gcgacgcgac gcagtgatgt gcgctggaat gacggggcat ttctgggcac 2760 gggtgacgag gcactgcatg cgcggctgcg cctgtcggag agggtcatta cccgtgcatc 2820 agaagtgctg ggcatcacgc gcgagccctc catccgcttc cgccatcgca taggtgttcg 2880 ctaaataaat gaactgtctg tatgttttaa tgagcagcag cagcagcggt gtgtgtgtgt 2940 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgggtgtat 3000 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 3060 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 3120 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtac acatgacgga 3180 ctaggacttt attggaccaa aggcatgatc ataatgtgtc aacgcattat atgaacaatt 3240 gtattcatat ttgattgaaa tcacccgcat ctgggcgtaa aatggtgact tctgaggcca 3300 gggatgtgga aagagtgagg gaggtcacac cttcgaaccc cccaacttgc aagtcatata 3360 acttttgcgc gacaacgtcc ccagtctccc ga 3392 // ID hAT-N1_CR repbase; DNA; CHL; 893 BP. XX AC . XX DT 04-MAR-2004 (Rel. 9.02, Created) DT 04-MAR-2004 (Rel. 9.02, Last updated, Version 1) XX DE hAT-N1_CR is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; 8-bp TSD; KW hAT superfamily; hAT-N1_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-893 RA Kapitonov V.V. and Jurka J.; RT "hAT-N1_CR, a family of nonautonomous hAT transposons from the RT green algae genome."; RL Repbase Reports 4(2), 48-48 (2004). XX DR [1] (Consensus) XX CC About 20 copies of hAT-N1_CR are present in the C. reinhardtii CC genome. They are ~98% identical to the consensus sequence. CC hAT-N1_CR is characterized by 14 TIRs and 8-bp TSDs. XX SQ Sequence 893 BP; 189 A; 277 C; 228 G; 199 T; 0 other; cagtgtttcc aaaagcttat acaaggtatt agcgttatag taccccacgg gtacctgcct 60 gctcaagaca ttttacttcc agtacccggg ggccgaaggc cccgtctccc cttcgtaggg 120 ggggttcggg cgggcggaag ccccccccaa aatttttttt cacgaccata ccaaaataca 180 ctgttttgag tgaattgcat gcgtttaagt cttgccacgt cgtcctgtct tgcttccaaa 240 tcttgagtat cagccagccc gaaactgttt ccgccccgct acatgcagct tcctgcgtca 300 gtctgcacac ccacagccac agcaaccaca actactaagt gcattaatcg ccccgccaca 360 gcttagtccg cccagcggaa ctcggcactg tcacgccagt gcttcagggc tgccgcgtac 420 gggaagtccg acacagagaa ccaccgctgc gccatcatgc gcacagccag gcacaggtgc 480 cgctgctcca gccagttgcg ctgcttgttc ttgatgtagc tcatggtgct gaacacacgc 540 tccgcctcca ctgatccggg tacaatggta gccgccagct tggccaggcg cacgaactca 600 ctgcagctgt cggcaatgta ctgcgcgcga gtcagcgatt gccagaacgc cacgaagctg 660 ccctgcggca gcgtcacggt gccagatgca gcagcctcta agacagagct cgcgtggtcg 720 attgcctccg tgctgaacag ggcccgggta gtatagggtc cggcggatta aatttttggt 780 acccgtgtgg gtactcgagg ccccaataat tagtttccag tacccgcgtc aatttcggcg 840 aattattttc tcagtacccg aatgggtact agtacccggt tttggaaaca ctg 893 // ID TE2-2_CR repbase; DNA; CHL; 764 BP. XX AC . XX DT 08-JUN-2004 (Rel. 9.05, Created) DT 08-JUN-2004 (Rel. 9.05, Last updated, Version 1) XX DE TE2-2_CR is a family of nonautonomous transposable elements - a DE consensus sequence. XX KW Transposable Element; AAC target site duplications; TE2-2_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-764 RA Kapitonov V.V. and Jurka J.; RT "TE2-2_CR, a family of nonautonomous transposable elements in the RT Chlamydomonas reinhardtii genome."; RL Repbase Reports 4(5), 137-137 (2004). XX DR [1] (Consensus) XX CC TE2-1_CR is a family of nonautonomous transposable elements. CC The consensus sequence is less than 5% divergent from TE2-1_CR CC copies. Classification of this family is not known. It is CC possible that it belongs to a novel superfamily. Features of this CC superfamily include preferential insertions into AAAC targets CC that are accompanied by the AAC target-site duplications, a CC 18-21-bp CC palindrome separated by 3-5 base pairs from the 3' end; and the CC AGCC CC 3' termini. XX SQ Sequence 764 BP; 125 A; 209 C; 234 G; 196 T; 0 other; ctccccgctc cgcagcgcga tggctcgccc tttgtgggga ttttccgctt gcagtcgtgg 60 tgtaacaggg gaagtgcgca ggtgaagctt cagcgctaga gacatttttc ggcccacggg 120 caacatcgct ccgctgggtc gctgcggcgt tgcgcgctcg cgctgcgctt cctgcccttg 180 catgtgcgtg cactgcactt gctatgcgac gttggctatg gtgggggctg ggggcgaggt 240 tcaggcgcat cgcagcagca cggttgcgga gtgttgcaac gagcacgtcc gcgtcgtcca 300 gtcgttgtgt gcatgcccat gaacaccagc agcagccaca ctaccgcgtc gaagaagcaa 360 tctcctacaa tgtagggcac tggccaccgc atcaatacat tggcggccag gacagcgtcc 420 agttctgggc aaggcgacat tttttggctt tggcttcggc tttggctttg gctttggctt 480 ggcttggctt ggcgtgtggc tttttggctt tggcttggct tggctttgat ttcgacgctt 540 gcctgcaaat gctgatggct ccacatcaag ggcagcggcc tcggtcggag gtgcgctttt 600 tctaccagtt tttctgtgct gcgtatttcc acacgtttct tgccttggca aatagaccta 660 agctgccgct atttgcatac agtcgaaatg aaaaacagag aacaacccta atgtatggtt 720 ctggtcgagg actaaggctg tcgcccccct ttggggggcg agcc 764 // ID Copia2-LTR_CR repbase; DNA; CHL; 305 BP. XX AC . XX DT 13-MAY-2004 (Rel. 9.04, Created) DT 13-MAY-2004 (Rel. 9.04, Last updated, Version 1) XX DE Long terminal repeat of the Copia2_CR retrotransposon. XX KW Copia; LTR Retrotransposon; Transposable Element; 5-bp TSDs; KW Copia2-I_CR; Copia2-LTR_CR; COPIA superfamily; endonuclease; KW protease; reverse transcriptase. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-305 RA Kapitonov V.V. and Jurka J.; RT "Copia2_CR, a family of copia LTR retrotransposons present in the RT Chlamydomonas reinhardtii genome."; RL Repbase Reports 4(4), 105-105 (2004). XX DR [1] (Consensus) XX CC Copia2-LTR_CR is a long terminal repeat of the Copia2-CR LTR CC retrotransposon, whose internal portion is deposited in CC Repbase as Copia2-I_CR. XX SQ Sequence 305 BP; 73 A; 74 C; 101 G; 57 T; 0 other; tgttggtatt ctagggtcag ggggagcctg actgagctct gtgcgggtgc aggggaagtg 60 cagccactag gaggggttgc gggcatgaac cgctggagca gtgactcaga ggggcagcgc 120 accgggaccg tgtcggaggc gcaatgcagt gtgccccggg cgctaggcca agttgcgcaa 180 gtcagttgca gtagcagata caacaaattc ttctgacata tgagacactg aacagaacag 240 tcaaacacgc gactgtgtaa agtgtcactt gcgaacccta gcttggccga cctgtctcag 300 caaca 305 // ID Copia10-I_CR repbase; DNA; CHL; 2907 BP. XX AC scaffold_67; XX DT 22-FEB-2008 (Rel. 13.02, Created) DT 22-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE LTR retrotransposon from single celled green alga: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia10-CR; KW Copia10-I_CR; Copia10-LTR_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-2907 RA Kohany O. and Jurka J.; RT "LTR retrotransposon from single celled green alga."; RL Repbase Reports 8(2), 93-93 (2008). XX DR Genome; scaffold_67; Positions 484612 487518. XX CC Positions [2062-2556] - Integrase core CC 'GCCGG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 100..2598 FT /product="Copia10-I_CR_1p" FT /translation="MSEDANVAKPFQLTSANAHLWRSALEIYARWKGLALA FT VDNEIAPAADIEKLTALALSHVRSGDQMIYNEVAQAVATHNATAKDDAKWN FT KARKLLDKFEGHYKRMAVLHHSLWEKELRELRMQKDEEISHLFQRVTDLVM FT KLVWSGCRGFAEDKREELIVLNVFSALPKEYAAVAYAATEQYRGSMYSVVD FT AETRIIGFHNTLASMGNEDQGSGEPSTSVAAVGVGGYRGGGGGYRGGRGGR FT GGRGGRWGRGGRGQEDLSTVECYKCHQFGHYQGQCPLNAAGNGGGGGGGGG FT RGRGDYGGRGRGRGGAHVGRDGGDRRMWAITAVPGVHTASNTGVASSSGRA FT DKYSWVMDTGSPMNVSPLRELMSDFKESSVQLRGFNGAQSASIGVGSMILH FT VLGADGVVNYLRLNDVQCVPEADCNLMSRETMLEAGAEWNRTYRAVGSEEH FT YFDCDSNVAFMTAESRAGKLWVRVVGVGPPPTNIGGGMDGVHNTDPADGAA FT GPSSGANSGSQPSPGASKGVSSSSDKGVGVSTSSSAGVDSSGRSAGGSAST FT QRRQSGLEMLLSASRSFKPIIEPVPAPAEKEQPQLSREQLRLRLCAGRWHR FT RYGHLSMGALAGMVRNGSVTGLDVPAEVFEQLAREGHCDTCALTKQVSMSY FT KATGSRAAAPLDLLHTDIWQGDMKSRGGHLYVLTVLDDYSKLSRIALMKHK FT SEAPTKLMDMIREWQCHLGRKVKEVQSDRGSEYMSQQMQEFFREQGIVHLT FT SAPYTPQQNGAAERVQQTIARTARAMLADADLSADLWGEAMVTANVIRNRS FT PTTGNSKTPFELFTGRLPSLEMMRCVKSQVR" XX SQ Sequence 2907 BP; 627 A; 742 C; 1033 G; 505 T; 0 other; ggttatgagc cccgcgtgag tggcgggttt gccctttatt tgttaagagt ctaacaaaat 60 aacctaagtg gccaagtggc cagcctgtga ggcacaaaga tgagcgaaga tgccaacgtc 120 gctaagccgt tccagcttac atcagctaat gcgcacctat ggaggtctgc gcttgagatt 180 tatgctcgat ggaagggcct cgctttggcg gtggacaacg agatagcgcc ggcggctgac 240 atcgagaagc tgacggcgct tgctctaagt cacgtgagga gcggcgacca gatgatatac 300 aacgaggtcg cacaagcagt ggctacgcac aacgctaccg ccaaggacga cgccaagtgg 360 aacaaggcga ggaagctttt ggataagttt gagggccact ataagcgcat ggcggtcctg 420 caccacagcc tttgggagaa ggagctgcgg gagctgcgca tgcagaagga tgaggagatt 480 agccacctgt tccagcgcgt caccgaccta gtcatgaagt tggtttggtc gggatgccgt 540 ggcttcgccg aggacaagcg cgaggagctc atcgtgctga acgtcttcag tgcgttgccc 600 aaggagtacg ctgctgtcgc gtacgctgcc acggagcaat accgtggcag catgtacagt 660 gttgtggacg cagagacccg gattatcggg ttccacaaca ccctggcgag catgggcaat 720 gaggaccagg gcagcggcga gccgtccaca tcggtggcgg cagtgggcgt cggtggctac 780 cgcggcggtg gtggaggcta ccgcggcggg cgtgggggtc gcggcggacg cggcggccgg 840 tggggccgcg gcggccgcgg ccaggaggac ttgagcaccg tcgagtgcta caagtgccac 900 cagttcgggc actaccaggg acagtgcccc ctgaacgctg ccgggaacgg cggcggcggc 960 ggcggtggcg gtggccgcgg ccgtggcgac tacggtgggc gtggccgtgg acgcggaggt 1020 gcgcacgtgg gccgggatgg tggtgaccgg cgcatgtggg cgattacagc agtgccaggg 1080 gtgcacacag ctagcaacac gggagtagct agcagcagcg ggcgagcaga caagtacagc 1140 tgggtcatgg acacgggatc gcccatgaac gtatcgccgc tgcgggagtt gatgtccgac 1200 ttcaaagaga gcagcgtgca gctgcggggg ttcaacggcg cgcagtcggc aagcatcggg 1260 gtcgggtcta tgatcctaca cgtgctgggg gccgacggag tagtcaatta cttgcgcctg 1320 aatgacgtgc agtgcgtccc tgaggcggat tgcaacctca tgtcgcgcga gactatgctg 1380 gaggctgggg ctgaatggaa ccgcacctac cgcgcggtgg gttccgagga gcactacttt 1440 gactgcgaca gcaacgtcgc attcatgaca gcggagtcac gcgccggtaa gctatgggtg 1500 cgcgtggtcg gcgtgggtcc gccgccgact aacatcggcg gcggcatgga cggtgtgcac 1560 aacaccgacc ccgccgacgg tgccgctggt ccgtccagcg gcgccaactc cggcagtcag 1620 cccagcccag gggccagcaa gggagtcagc agtagcagcg acaagggggt cggcgtaagc 1680 accagcagca gcgctggtgt cgatagcagt ggtcgcagcg ccggcggcag cgcaagcacg 1740 cagcgcagac agagtggtct cgagatgctt ctcagcgcta gcagatcctt taagccaata 1800 atagagccgg taccagcgcc ggcagaaaaa gaacagccac agctgagcag ggagcagctg 1860 cgcctgcggc tgtgtgccgg cagatggcac cgccgatacg ggcaccttag catgggcgca 1920 ctagcaggga tggtgcgcaa tggcagtgtg acgggcttgg acgtgcccgc ggaggtcttc 1980 gagcagctcg cccgcgaggg gcattgcgac acctgtgcgt taaccaagca ggtcagcatg 2040 tcgtataagg cgacagggtc cagggcggcg gcgccgctag acctcctcca cactgacatc 2100 tggcagggtg acatgaagtc ccgcggcggg cacctgtacg tgttaacggt gctagacgac 2160 tacagcaagc tatcacggat agcactaatg aagcacaagt cagaggcgcc caccaagttg 2220 atggatatga ttcgagagtg gcagtgtcat ttggggcgca aggtgaagga agttcagtct 2280 gatcgtggga gtgagtacat gagtcagcaa atgcaagagt tcttcaggga gcagggaatt 2340 gttcacctga cgtctgcccc gtatacgccg cagcagaatg gcgcggcgga gcgtgtgcag 2400 cagacgatag cgcggacggc gcgggcaatg ctggcggatg cggatttgag cgctgatctg 2460 tggggtgagg ccatggttac ggcgaacgtc attcgcaacc gctcgccaac caccgggaac 2520 agcaagaccc cgttcgagct gttcacaggt cggctcccga gcctggagat gatgcggtgt 2580 gtgaaatcgc aggtacggtg aagtgaagcc gtaagtacag cgaatgtggg agcattcgga 2640 cacgggggag ccgtgcagat atgattgagt atggatttcc ggatggaatt gattgcggta 2700 gaaaccgtat tgatttgtgc tgattatgga cgatgcagtg aggaagacgt gatcttgagg 2760 acacgagaac atgggacgat tgcagtagca cagtgactac aagcggcaag cgtacgaggt 2820 atgcaaggaa ttgtttgtag cacgtgattt gaggtaggag gctattgcga tacgtaattt 2880 gagtatggag ttctgcccgc gggggag 2907 // ID Gypsy17-I_CR repbase; DNA; CHL; 4710 BP. XX AC scaffold_567; XX DT 22-FEB-2008 (Rel. 13.02, Created) DT 22-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE LTR retrotransposon from single celled green alga: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy17-CR; KW Gypsy17-I_CR; Gypsy17-LTR_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-4710 RA Kohany O. and Jurka J.; RT "LTR retrotransposon from single celled green alga."; RL Repbase Reports 8(2), 97-97 (2008). XX DR Genome; scaffold_567; Positions 9007 4298. XX CC Positions [3396-3893] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 451..1851 FT /product="Gypsy17-I_CR_1p" FT /translation="MPEPMEADLHGGAAGTELALALPVDPPAPAPDPNLAL FT ILARIAELEAQVAAKTAADAVVAATAGAAGAGPSNAPVLALPAPAAAAPSP FT VSATGAQQNQGIGDIPVPVVLEQAFGAAHMQDVMQRRAVQDAAIATAACAA FT AAQGGTDRSLAEAVAREWPSLKSAQQRALYLSLADWAGCMYSYSNQKLKEM FT ECAELPPQLAHSWLYPIAAGAYEQARQIATDRAAARNQQVTDGTGAGPSSG FT VGTPQRAGAMGEVAEPGAGGTVPSAEYAVPEGFDARDMENVNIPKFSGTVG FT GNIQSAKTWFEVLMRYLALRGRNPIAHFLFYLSGIVLEWGSTFLAEQDAKT FT AAGTPPLTVAGLRNEFLSVFDNPLHSCPQEARRKLSRLKMGSSYITYLTQF FT RILMLQAGHMNEVDKIHYFQEGLTGALVAQPSYRRAYRLSPEELVECKRQM FT SICLPKDLSSPAILRLARRCCL" FT CDS 1770..4535 FT /product="Gypsy17-I_CR_2p" FT /translation="MQTANVDLLAKGFIEPCHSPFGAAVLFVKKKDPDSPT FT GHKLRMVIDYRELNKITVKHRYPLPNIQDIFDKLAGSAVYSTLDMKACYHQ FT FRIDPKEIPPTAFITPFGQYAWRVCPMGLCNSPSHVMAAVNTILSKAGLGE FT YTMAYLDDVIIYSKSAEDHAKHLDAVLTALGQAHLRVKLSKCVFNRPELPF FT LGHIVGRSGLRMDPKKVQVVQDWIAPTDLKQLRRFLGLANYFRRFIQGYAS FT LVAPLTALTGSKTPWQWCDACQDAFEAVKKALISAPVLALPDLNKPYEVIS FT DASLYGTGAVLMQEGRPLAYLSHRFTPAERNYTTTDQEALGVINALREWRC FT YLEGAPNVTVVTDHQPLTHLATLRADGLLSRRQARWIEFLQSQMVTWQYRP FT GRINVADPLSRVYEGAPDMREGPPPNQPQVAAMTRSSAHTDVHKRISSSYQ FT RDAWFRKPSNTAELRKVNGLYYRGDQVVVPNDDELRTMLIAAHHDDECSGH FT AGPERTAEALKRVYSWRGLDADVARYVAQCSHCQRNKASTAKPRGLLQPLP FT IAKRVWGSVTMDLITHLPTTPAGYDSTAVFVDRLSKMTHFAPTHATVDAEG FT MAHLYVQNVFRHHGLPDEVISDRGPQFAGKFWAELQRLLKTSVKLSTAYHP FT QTDGQTERMNRLLEETLRHYINPAQDDWDDHLALVEFAINNAMNKSTRASP FT FSLNSPYAPRTPGTIGVAQSKVPLAASFAAEMQARLQRATMCLERARTRQK FT QAYDSHRQDASFAVGQSVLLSTKNIALKTPGVKKLLPKFIGPFPIEAKIGE FT LAYKLTLPPGYRIHPVFHVSLLKPYHCKGNYQPPPPAFLDDDGNAYWTVHD FT VIEHRDRRVGRKPVRDFLVKWEGFGPEHNSWEPEANLREDELVADIVDKYL FT VRVSLRPPPRAAKKPRLT" XX SQ Sequence 4710 BP; 1009 A; 1446 C; 1337 G; 918 T; 0 other; agtggtatcg agaccagaac tagtcgcatt ttttctttgt tagcgtgaac taacgcaggg 60 gcgcgaagaa ccgcgtcccc attggcggcc ccgctccggc ggcggccccc aactttgtta 120 gtttcactga gaccgtcttt cagaccgtca atacaaaccg gttcgcctca ggcaccgcgt 180 tgagctgagc ccgcgctaag cgccggcctt atccacaatg gctatgattc gcgagggctt 240 cgagcccagc gataagctgt gtggcttgtg ggttaagatc attacccaaa aggctgcaat 300 tctcaatgcc gcctcttcaa ctctcccctg cctcaatttg aacttgcggc ggcagagatt 360 ggtgaaaacc tcccgctctc tcgtgctacg acccgagctc gtttgcgagt actgatcggc 420 atcgattatg ccctttcaca gggccttgag atgccggaac ccatggaggc cgacctgcac 480 ggcggcgctg ctggcaccga gcttgcgctc gcgctgccgg tggacccccc ggcaccggcc 540 cccgacccca acctggcgct catcctcgcg cggatcgccg agctggaggc acaggtggcc 600 gccaagacgg ctgctgacgc cgtcgtggcc gccacggcgg gcgccgcagg cgccggcccc 660 tccaacgcgc cggtgctcgc gctgccggcg ccggccgccg cggcgccctc cccggtgagt 720 gcgacgggcg cacaacagaa ccagggcata ggcgacattc ctgtgccggt agtcctagag 780 caggcctttg gcgccgcaca catgcaagac gtgatgcagc gccgcgccgt tcaggatgcc 840 gctattgcca ctgctgcctg cgccgctgct gcacagggtg gaaccgacag gtcgctggcc 900 gaggccgtcg cgcgcgagtg gcccagcctc aagtcggcgc agcagcgcgc cctctacctc 960 agcctggctg actgggccgg ctgcatgtac tcctactcca accagaagct gaaggagatg 1020 gagtgcgccg agctcccgcc ccagctggcg cactcgtggc tgtacccgat cgcggccggc 1080 gcctacgagc aggcgcgcca gatcgccacc gaccgcgcgg cagcccgcaa ccagcaggta 1140 actgatggga cgggcgcggg gcctagctcg ggtgtgggca ccccgcagcg cgcgggtgcc 1200 atgggcgagg tggctgaacc gggtgccggc ggcacagtgc cgtcggctga gtacgccgtg 1260 ccggagggct ttgacgcccg ggatatggag aacgtgaaca ttcccaaatt cagcggcacg 1320 gtgggcggaa acattcagtc cgctaagacg tggttcgagg tgctgatgcg ctacctagcg 1380 ttgcgcggac gcaaccccat cgcgcacttc ttgttctacc tgtcgggcat cgtgctagag 1440 tggggatcca ccttcctagc ggaacaggac gctaagactg cggcgggcac cccgccgctc 1500 acagtcgcag gcctccgcaa cgagttcctg agcgtgtttg acaacccgct gcacagctgc 1560 ccgcaggaag cgcgccgaaa actctctcgg ctgaagatgg ggtcctcgta catcacctat 1620 ctcactcaat tccgcatcct gatgctgcaa gctggccata tgaacgaggt ggacaaaatc 1680 cattactttc aggagggatt gacgggcgcg ttggttgcac aaccgtctta ccgacgcgct 1740 taccgccttt ctccggagga gttggtcgaa tgcaaacggc aaatgtcgat ttgcttgcca 1800 aaggatttat cgagccctgc cattctccgt ttggcgcggc ggtgctgttt gtgaaaaaga 1860 aggaccccga tagtcctacc ggtcacaaac ttcgcatggt tattgactat cgcgaactga 1920 acaaaattac agtaaagcac cgttatccgc taccgaacat ccaggacatt tttgacaaat 1980 tggcaggctc ggcggtgtat tccacgctcg atatgaaagc gtgctatcac caatttagga 2040 ttgatccaaa ggaaattcca cctacggcgt ttattacccc gttcggacag tacgcgtggc 2100 gtgtgtgccc gatgggactg tgcaacagcc cgtcacacgt catggccgcg gtgaatacaa 2160 ttctctccaa ggctggccta ggagagtaca cgatggcgta cttagacgac gttatcatct 2220 attccaaatc cgcagaagac catgcaaagc accttgacgc cgtgctcacg gcgttagggc 2280 aggcacacct gcgtgtcaaa ctgtccaagt gtgttttcaa ccgtcccgag ctgccctttc 2340 tggggcacat tgtggggcgt tccgggttac gcatggaccc caaaaaggtt caggtggtcc 2400 aagattggat cgcgccaact gaccttaaac agctgcggcg tttccttggc ttagccaact 2460 acttccgccg ctttatacaa ggctacgcgt ccttagtggc gccgctcact gccctcactg 2520 gatctaagac gccttggcaa tggtgtgacg cgtgccaaga cgcattcgag gcggtgaaga 2580 aagccctgat ctccgcacct gtgttagcac tgcctgacct caacaaaccg tatgaggtta 2640 tttcggacgc atccctgtat ggaaccggcg ccgtccttat gcaagaagga cgcccactcg 2700 cgtatctgag ccacaggttt acacctgcag agcgcaacta tactaccacc gaccaggaag 2760 ctttgggtgt cattaacgca ttacgtgagt ggcgatgcta ccttgaagga gcgcccaacg 2820 taacagtcgt tacagaccac caacccctta cgcacctggc cactttgagg gccgacggac 2880 tgctctctcg ccgtcaggca cggtggatcg agttcctaca gagccagatg gttacttggc 2940 aatatcgccc gggcagaatc aatgtggctg acccactcag tcgagtttat gagggcgcac 3000 ctgacatgcg cgaagggccc cctccaaatc agccccaggt agcggcaatg acacgctcgt 3060 ccgctcatac tgacgtgcac aaacgaatta gctcctccta ccagcgggat gcgtggttca 3120 ggaagccgag caacacggca gagctgcgaa aagtcaacgg cctgtattac aggggggacc 3180 aggtggtggt tcccaacgat gacgagttgc gtacaatgct tatagccgca caccatgacg 3240 atgagtgcag cgggcacgct ggcccagagc gtacagcaga ggctctcaaa cgggtgtact 3300 cgtggcgagg tctggacgca gacgtagcac ggtatgtagc ccagtgctcg cattgccaac 3360 gcaacaaagc ctcgaccgcc aagccgcgcg gtttactgca gcccttgcca attgcaaaac 3420 gagtgtgggg cagcgtcacg atggacttga tcacccatct ccccaccacc cctgcgggtt 3480 atgactctac tgctgtgttt gtggaccggc tgagcaagat gacccacttt gctcccactc 3540 acgcaacggt agatgccgag gggatggctc acttatatgt gcaaaacgtt ttccgccatc 3600 atggcctccc agatgaggtc atctcggacc gtggcccgca attcgccggc aagttttggg 3660 ccgagttgca gcgcttgctg aagacttcag ttaaactaag cactgcatat catccgcaaa 3720 cagacggtca aaccgaaagg atgaaccgcc tgttagaaga gacacttcga cattacatca 3780 acccggcgca agacgactgg gacgatcact tagcgctggt cgagtttgca attaacaatg 3840 caatgaacaa gtctactcgg gcatcgccct ttagcttgaa ctcgccatac gcgccgcgca 3900 ctcccggcac catcggggtc gcgcagtcga aggtgcctct ggccgcttcc tttgccgctg 3960 agatgcaggc tcgcttgcag agagctacaa tgtgtcttga acgggcccgg acacgacaaa 4020 agcaagccta cgactctcat cgccaggatg cgagctttgc ggtgggtcaa tccgttttgc 4080 tgagcacaaa gaacatcgct ttgaagacac ctggtgtgaa aaagctatta cccaagttca 4140 ttgggccgtt cccgatcgaa gcgaagatcg gagaactagc gtacaagctg acactgcccc 4200 cggggtatcg aattcacccc gtctttcacg tttcactgtt gaaaccgtac cactgcaagg 4260 ggaattacca accccctccg cccgcattct tagacgatga cggcaacgca tactggacgg 4320 ttcacgacgt gattgaacac agggacaggc gggtaggccg caaacccgtg cgggatttcc 4380 tggtgaagtg ggaaggcttc gggccagaac acaattcctg ggaaccagag gcgaatttac 4440 gggaggacga attggttgcc gacattgtcg acaagtacct tgtccgggta tccctacgcc 4500 cacccccacg cgctgccaag aaaccgcgcc tgacctaagc agtgcacaca cgggcgaggc 4560 acacctctca tagctgtccg cacgcacagt gccgacacgc atcacagccc ataatcacac 4620 ttcgttggtg tcccggggac gggccacttc agaggggggg gagtgtaagg aggattccgt 4680 tacacgagca agtctctctc tcccgagccg 4710 // ID MSAT-4A_CR repbase; DNA; CHL; 1171 BP. XX AC . XX DT 04-APR-2005 (Rel. 10.03, Created) DT 10-APR-2007 (Rel. 10.03, Last updated, Version 1) XX DE MSAT-4A_CR is a minisatellite unit - a consensus. XX KW MSAT; Satellite; Simple Repeat; MSAT-4A_CR; MSAT-4_CR; KW Interspersed repeat; minisatellite; nonautonomous; tandems. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-1171 RA Kapitonov V.V. and Jurka J.; RT "MSAT-4_CR, a family of minisatellites from the green algae RT genome."; RL Repbase Reports 5(3), 60-60 (2005). XX DR [1] (Consensus) XX CC The MSAT-4A_CR is a subfamily of MSAT-4_CR minisatellite. XX SQ Sequence 1171 BP; 191 A; 392 C; 385 G; 192 T; 11 other; gagctcsctg gccgggcacg gggcgcatac gtggcgcgtg ggtgggcatg ccgacatgtt 60 tgccgggtcc gcatttcgtt aggattcggg ctacccccac ttcaaggtag cttgtttgcg 120 gtcatttggg cccacctata tgmtaatagg caaatgaagg aaccgacgcc aggaaagtgg 180 cgacttttgg ccagaccctc agcggctcgc gaccgggccg gccgcgcgag aattgcaaaa 240 acaactcctg aagcatgacc raatgcagct cgcccagtgc ttcgtatggc ttactcgtat 300 atctgtatat ttgatttctc gccgcgttca ccagcaccac cacaaacgtg tgccaggtcg 360 ggccgtacca cggtgccctg gccgtgccct gcgygcactt cctgggcccg ccacgcccct 420 ctgctgcatc ccactcctgc ccccagccct ccaggcgcct ctgggtacat gcatagcagt 480 cgtcagcatc acacctcggc aaagcagctc gcacacggac gccccctagg cggcgcccag 540 cccaggggtg cgtgcaaggc tacacactgc gcccgragct ggcaggggct gccgcggctg 600 ayttcgcakg ccgccatggc gtggatgccc gacatagcgc ctgtgcattg cggtgcaggc 660 ggctctgggc agcctagagc cgccgacacg caccaatgcg ctcggctcac cgcccaccgc 720 cgcccacggc gcmcacgcag cgcatcctca gtcgcggccc gggcgtggga gtggctgaag 780 gggtgctgcc ccgcttcaga tcacacctag agacgggcag gtcacgtggg gatgcgcgag 840 ggggcggggc ggacaaggcg tgtgcggagg catgcaggct tcgtgtgggg ggccttgagg 900 gcgtgtggct gcggcggtgg ggaatgggcc gcttccgtgc cttggagcct gggcgctgta 960 cctgcccggc tcgcagtggt aagccccggt agmtggcatg ctataacggc tctcgggccg 1020 ccaggaatca acacggccgg gccacacggt ggccgggcgt ccggccaccc cacacgccga 1080 agagcccacg gacgggccca gcgcccgcgc ggcsgccgca caaatgccca ggcctgcttg 1140 ttccatacgt gtgtccatkc aggcgtgcta g 1171 // ID Gypsy16-LTR_CR repbase; DNA; CHL; 190 BP. XX AC scaffold_45; XX DT 22-FEB-2008 (Rel. 13.02, Created) DT 22-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE LTR retrotransposon from single celled green alga: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy16-CR; KW Gypsy16-I_CR; Gypsy16-LTR_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-190 RA Kohany O. and Jurka J.; RT "LTR retrotransposon from single celled green alga."; RL Repbase Reports 8(2), 103-103 (2008). XX DR Genome; scaffold_45; Positions 488013 487824. XX SQ Sequence 190 BP; 33 A; 34 C; 75 G; 48 T; 0 other; tgtcgtgacc tggcatgtga ccgtgcatat cgtgtgactg agtgatttgg gtgtggtttt 60 ggcgtgtggg aataggggaa cgtgcccgag gttggttggc tcgacacggt gtattgcatg 120 ggaggccgga gtacgggtga tccgagagag gatataggtc gcgtctgaga gcggctagag 180 ccgtctctca 190 // ID L1-1_CR repbase; DNA; CHL; 7343 BP. XX AC . XX DT 04-MAR-2004 (Rel. 9.02, Created) DT 04-MAR-2004 (Rel. 9.02, Last updated, Version 1) XX DE L1-1_CR is a L1-like non-LTR retrotransposon - a consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1 superfamily; KW L1-1_CR; endonuclease; reverse transcriptase. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-7343 RA Kapitonov V.V. and Jurka J.; RT "L1-1_CR, a family of L1-like non-LTR retrotransposons from the RT green algae genome."; RL Repbase Reports 4(2), 39-39 (2004). XX DR [1] (Consensus) XX CC Several hundred copies of L1-1_CR are present in the C. CC reinhardtii CC genome. They are ~98% identical to the consensus sequence and CC constitute CC nearly 1% of the genome (some elements are less than 1% divergent CC from CC each other). L1-1_CR elements usually are flanked by 2-14-bp CC target CC site duplications. Many elements are inserted as a 15-50 bp long CC 3' CC terminal portion of the consensus, including the polyA tail. CC L1-1_CR1 encodes the 350-aa L1-1_CR1p (pos. 1057-2106) and CC 1534-aa CC L1-1_CR2p (pos. 2109-6710) proteins. CC L1-1_CR1p is not similar to known proteins; L1-1_CR1p is composed CC of CC the L1-like endonuclease and reverse transcriptase domains. XX FH Key Location/Qualifiers FT CDS 1057..2106 FT /product="L1-1_CR1p" FT /translation="MALPRRRPGVGDGCNQPRTHRRRRRSNSPAPKGAVAP FT PSSRSRGQGAWAAGCLPRRTVLARRRTCPLGGRPGDSRLVGAAGRGARVEE FT RGGQCPSRRPRRGRHCRRHNRRVVVRAGAGVSADTAGRAGGGAGSEAEYGA FT VCLTRDTCVAIGSDLVTHQNRCADVQGYEWAQLGRDTLGGRLLAYLRDQGA FT TVPDWAVCRVPAGRTAVLAQLHDEFTGWWRLYSAQPADTPLPSDVTEQLDD FT AAQQVQTAYMDYVLLDARTLRSAKRGPADPGGASGPSRRQRQHRSRSTSSL FT MSLGSAPLRPGGASSGAASMSTSSGGAPPSQGGGRRGKRHSSNSNRNRHGA FT AVAGGGS" FT CDS 2109..6710 FT /product="L1-1_CR2p" FT /translation="MRPPVGAANLRLLTVNVNGLGSPFKARALVSHLQQVG FT ADVAMVQETHATDTTALESCLRAAQGACLPWRHCLAASPAASPHSCGAAIL FT ARSRLSLPGCVLQPPSTDAAGRVVCWDWDVGHLRLRFVCVYAPTAVADKPA FT FFAGLHPHLATDRVLVVGGDWNCVTDASQEAAPSPSRAAGAPQLASLLAQF FT SLVDPWASKRGGAKGYTHPATPKPATPARLDRWYVSATAAPWVVDVARTYG FT APGDHNGVLLTLSLPDLPHAHREQWRFPTYLLFHPSLRLELEQRLEAHVAA FT NPVASTGDGACTQWEADKFFLREAATSIHRRHARQTRDGLHGVVLAADAAA FT ALADRPGASAAQRQAAAMANLAVREERAAAAAASHNARAALMEEHGERGTR FT WFHRQADEPAAGAQEPITHLKVPGQPAPVALTGPGTRNTVSAAAAAMYSST FT SPTGLFRVQPVCTASQQQLLAAIDRKVPADLQAAAEGSGDGALSDAELMAA FT LAGSANGKAPGSDGVPYEVYKVFWALLGPRLCAAAAAAFAAAADAHDGGEM FT AAALPASWREGIITLIYKGKSLDRAELASYRPITLLNCDFKMVSKAVSARL FT QPALDAVVDELQTAFITGRWIGDNALYLQGLIEWMRLDVGADGTPRQGGAL FT YFLDIEKAYDRVHRQWLYASAEGLGFGPRMLRWIRLLTANGSARVCVNGML FT SDAFPVLNGLPQGSTASPPLWVIQMQPLTSFLRRQVEQGALRTPLLPSGEQ FT APPAAHHADDTTLTARDPAVDGPVLMAAVQLFCRASNARVHPDKSKAMGLG FT RFAHLTGPCPHTGVPFTTGAVTHLGVPLSWDSDAAAADLYTRRARGMAFVA FT RLWAALSLTLVGRVHIAKQVLAAKLAYHFSFLNPSPAQLKELTDLVDHFAA FT RSMHAEDASLVSHGNPLLLPKRETACLPYKDGGVNHVDLPAFLSALQAKTF FT ALLAQPGRQPWKMLTRALLTHVRPDSATTWAWVYSDAPAPAGLPARLAAAV FT GHVRSAGVEQHPPQPATQPPAAPPQWRVSLDQLWVANAAGAVSYVHYTGRL FT LEPGPGVLPPAVDGAWQPACVLQHRKPRHLWTFEERAAYDAASPGERAGAW FT PRAPYFLAPEAGVVVHPEHCRIAGVSLADYTVRDVRRAITAANPAAPPAPA FT RPAAMPCPAPAQQAGGSGTQPAAQSRLAEREAEWQRAAAQLTTTAAQHFHN FT NPVALDPWLHRTSAAAGLQNTPARELQSYASPSQQSGEGPRRSARLQEQAA FT GGAGPSTGPATAAAAAAAAVEGDPRMPPPDASLLRGTWRRLWDSHASRGAK FT VLVYRLQHAYLPCGLYRAGKGIRPRVTTGCGGLGAHCPHPACGPPGPRAWA FT SLTHIFLECPAYAQARTWLQQLWACVAPQAAAPPVTDAGFMLGDRMGMWAS FT GPRGAGALLWSTLRATFLYAVWCAYWSREPAKQTSEHVVREVVSELRRVMQ FT LRFTAATLTPETLSALPTQLLTAQLKAAKLEHFVAIWSAGGALCEVEEVQG FT GSPKLNLRLTLASPVQAP" XX SQ Sequence 7343 BP; 1005 A; 2416 C; 2727 G; 1195 T; 0 other; acagcggggc cccgcccccg gtgccagttc tggtggcggg gggaagcaac cgtgctgctg 60 cggctgccct gcgggccgcg ctgcagcagc agccctcttc acagcagcgg gtggcagcgg 120 gagggcttag cggggtccag ccggcggcag gcgtgggggg gccgctggcg gccgggcctc 180 gccctgccgg tgctgggccg gctggcgtgc agcagcagca cagtggcgct cccgcgccac 240 tgccaccctt ggatcagcgg gcagccgccg ggcaggatca ggagcagacc gctgtgatgc 300 ggcggtactt ggccgagctg gcggagctca atcttcagcg gcaacagcgg ttacagcagc 360 agcagcagca ggagctgcag cgccaggggc agtcgcccag gccccggacc ggctcccagg 420 ttgtggtcct ggccacgctg gggggacagg acggggcggg cggtggcgct ggcacggcgg 480 cgtcacggcg tgggcaacgt gttgacatag ttgaccacgt gggggacgtg cagatgcagg 540 aggcggcggg tgtgggtgac gcagcggctg ccgcgggtgg cgccagcgga ccacccgctg 600 tggcctcgcc ctcgctgcca gctctggtgg cggggggaag cacccgggct gctgcggctc 660 ccctgagggc cgagcggcag cagcagccct cttcacagca gcgggtggca gcgggagggc 720 ttagcggggg gcagccgtcg gcaggcgtgg gggggccgcc ggcggccagg cgtcgcactg 780 ccgatgctgg gccggctggc gcgcagcagc agcccagtgg ggctcccgcg ccacagccgc 840 cctttgatca gcgggcagcc gccaggcagg atcagcagct gaaagctgcg atgcagcggt 900 acttggccca caaggcggac ctccatctcc agcagcaacg gcgggtacag caacagcagc 960 agcaggagca gcagtgccag gtgcagccgc ccaggccccg gaccggctcc cggtttgagg 1020 tcctggccac gctgggggga caggacgggg cgggcgatgg cgctgccacg gcggcgcccg 1080 ggcgtgggcg acgggtgcaa ccagccgcgc acgcaccgcc gccgccgccg cagcaacagc 1140 cccgctccaa agggcgcggt ggccccgccc agctcccgca gcagagggca gggcgcttgg 1200 gcggcgggct gcctcccccg caggacggtg ctagctcgcc gccgcacgtg cccactgggc 1260 ggccggccgg gggacagcag gctggtgggg gcagcgggtc ggggcgccag ggtggaggag 1320 cggggcgggc agtgccccag ccgccgcccc aggcggggcc gccactgcag acgtcacaac 1380 cggcgggtgg tggtaagggc aggggcaggg gtttcggccg acacggctgg gcgcgctggt 1440 gggggtgcgg gcagtgaggc ggagtacggg gccgtttgcc tgacccggga cacgtgtgtt 1500 gccatcggct ctgacctcgt gacgcaccag aaccgctgcg cagatgtgca ggggtatgag 1560 tgggcgcagc tgggccgcga cacgcttggt ggccgcctcc tcgcctacct ccgtgaccag 1620 ggcgccaccg taccggactg ggccgtctgc cgcgtgccgg ccggccgtac cgccgtcctg 1680 gcacagctgc acgacgagtt cactggctgg tggcgcctat actcagccca gcctgcagac 1740 acgcccctgc cttccgacgt gacggagcag ctggatgacg ccgcgcagca ggtgcagacg 1800 gcctacatgg actacgtgct cctagacgcc cgcaccctgc ggtccgctaa gcgtgggccg 1860 gcggaccccg gcggggccag cgggcccagc cgccggcaac gccagcaccg cagccgcagt 1920 acctccagcc tcatgagcct gggcagcgct ccattgcgcc ccggcggcgc cagctcaggc 1980 gccgctagca tgagcaccag cagcggcggc gcgcccccca gccagggtgg cggccgccgc 2040 ggcaaacgcc acagcagcaa cagcaaccgc aaccgccacg gcgctgccgt ggccggcggg 2100 ggctcataat gcggccacca gtgggcgcgg cgaaccttcg gctgctgaca gtgaatgtca 2160 acggcctggg ctcgccgttc aaggcgcggg cgctggtctc gcacctgcag caagttgggg 2220 cggatgtggc gatggtgcag gagacccacg ctactgacac gacggcgctg gagtcttgcc 2280 ttcgtgccgc gcagggggcg tgccttccgt ggcgccactg cctcgctgcc agcccggcag 2340 cgtcgcccca ctcctgtggg gcggctatcc tggcgcggag tcggctgtcc cttccaggct 2400 gcgtactgca gccgccgtca acggatgcgg cgggccgtgt ggtatgttgg gattgggacg 2460 tgggtcacct gcgcctgcgc ttcgtgtgtg tgtatgcgcc cacggccgtg gcggacaagc 2520 ctgctttctt tgccgggctg catccccacc tggccacgga cagggtgctt gttgtcggtg 2580 gggactggaa ctgtgtcacc gatgccagtc aggaggcggc ccctagcccg tcacgtgctg 2640 caggtgcccc gcaacttgcc agcctcctcg cccagttcag tctggtggac ccttgggcga 2700 gcaagcgtgg cggcgccaag ggctacacgc atccggccac gcccaagccg gccactcccg 2760 cacgcctgga tcggtggtat gtcagtgcca ccgcggcgcc gtgggtggtg gatgtcgctc 2820 gcacgtatgg ggcgcctggg gaccacaacg gtgtgctgct caccctgtcc ttgcccgacc 2880 tgccacatgc gcaccgggag cagtggcgct tccccacata cctgctgttt cacccctcgc 2940 tgcgtttgga gctcgagcag cgcttggagg cgcacgttgc cgcaaatccc gtggccagta 3000 caggtgacgg cgcttgcacg caatgggagg cggacaagtt cttcttgcgg gaggccgcca 3060 ccagcatcca ccgtcggcat gcacgccaga cccgggatgg gctgcatggc gtggtgctgg 3120 ccgcagacgc ggccgctgcc ctggccgacc ggccgggtgc cagcgccgcg cagcgtcagg 3180 ctgcggccat ggccaacttg gcggtgcggg aggagcgggc agctgccgca gcggccagcc 3240 acaatgcccg cgctgcactg atggaggagc atggggagcg gggcactcgc tggttccatc 3300 ggcaggctga cgagccagca gccggcgcgc aggagcccat cacgcacttg aaggtgccgg 3360 ggcaaccggc gcctgtggcg ctcacggggc cgggcacgcg caacactgtc tccgcagccg 3420 ccgcagccat gtacagcagc accagcccca ccggcctgtt ccgcgtgcag ccggtctgta 3480 cggcgtcgca gcaacagctt ctggcggcca ttgaccgcaa ggttccggcg gatctgcagg 3540 ccgccgcaga ggggtccggt gacggcgccc tcagtgatgc cgagctgatg gcagcgctgg 3600 ctggctccgc caatggtaaa gcacctgggt cggacggggt cccgtacgag gtgtacaagg 3660 ttttctgggc gctgctgggt ccgcgcttgt gtgctgctgc tgccgctgcc tttgctgccg 3720 ctgcagacgc ccacgatggc ggcgaaatgg cggcggcgct gcccgcctcc tggcgggagg 3780 gcatcatcac gctcatctac aagggcaaaa gcctggaccg cgccgagctg gcgtcctacc 3840 ggcccatcac gctgctcaac tgcgacttca agatggtgtc caaggccgtc agcgcccgcc 3900 tgcagcccgc cctggatgca gttgtggatg agctgcagac cgcgttcatc accggccgct 3960 ggattggaga caacgcgctg tacctccaag gcctgatcga atggatgcgc ctggacgtgg 4020 gcgcggacgg cacgccacgg cagggtggtg cgctgtactt cttggacatt gaaaaggcgt 4080 atgaccgggt gcaccggcag tggctgtatg cgtccgcgga gggacttggg tttgggccgc 4140 gcatgctgcg ctggatccgc ctgctcactg ccaacggctc tgcccgcgtg tgtgtgaacg 4200 ggatgctctc tgacgccttc ccagtgctga acggcttgcc gcagggcagc accgcctcac 4260 cacccctgtg ggtcatccag atgcagccac tgacgtcctt tctgcggcgg caggtggagc 4320 aaggggcact gcgcacgccc ctgttaccca gcggcgagca ggcgccacct gctgcccacc 4380 acgctgacga cacgaccctc acggcgcgcg acccggcggt ggacgggccg gtcctgatgg 4440 cggcagtaca gctgttctgc cgcgcgtcca acgcccgtgt ccatccggac aagagcaagg 4500 ccatgggcct tggcaggttt gcgcacctga cgggcccttg cccacacacg ggggtgccgt 4560 ttaccactgg cgccgtgacg cacctgggtg tgcccctgtc gtgggactct gatgcggctg 4620 cagctgactt gtacacccgg cgggctcgcg gcatggcgtt tgtggcgcgt ctgtgggctg 4680 ccctgtctct gactctcgtt ggccgtgtgc acattgcgaa gcaggtgctg gcggcgaagc 4740 tggcctacca cttcagcttc ctcaacccgt cgcctgcgca gctgaaggaa ctcaccgacc 4800 tggtggacca ctttgctgcg cgctccatgc acgctgagga cgccagcctg gtgtcgcacg 4860 ggaacccgct cctgctgcca aagcgggaaa cggcctgtct gccgtacaag gatgggggtg 4920 tcaaccacgt cgacctgcct gcgttcctgt ctgccctgca agctaagact ttcgccctcc 4980 ttgcccagcc aggccggcaa ccctggaaga tgctcacccg ggcgctgctt acccatgtgc 5040 gcccggactc cgccaccacg tgggcgtggg tgtacagcga cgcgccggcg ccagcggggc 5100 tgcctgcccg gctggcggcc gcggtcggcc acgtgcggag cgcgggcgtg gaacagcatc 5160 cgccgcagcc agccactcag ccgccagcgg cgccgccaca gtggcgggtt agcctggacc 5220 agctgtgggt ggctaacgct gcgggggctg tgtcctacgt ccactacacg gggcggctct 5280 tggagcctgg gcctggcgtg ctgcccccgg cggtggatgg ggcgtggcag cctgcctgtg 5340 tgctgcagca tcgcaagccg cggcacctgt ggacctttga agagcgggcg gcgtacgatg 5400 cagcatcacc aggggaacgg gcgggggcgt ggcctcgggc gccgtacttc ctggcgccgg 5460 aggctggggt ggtggtgcac ccggagcact gccggattgc gggtgtcagc ttggcggact 5520 acacggtgcg ggacgtgcgg cgggccatca ccgcggccaa cccggccgca cctccggctc 5580 cggcccgccc cgcagccatg ccgtgcccgg cgccagcaca gcaggcgggg ggttcgggga 5640 cccagccggc ggcacagtcg cggctggcgg agcgggaggc ggagtggcag cgtgcagcgg 5700 cgcagctgac caccacggcg gcgcagcatt tccacaataa cccggtggct ctggacccct 5760 ggctccaccg cacctctgcg gcagccgggc tgcagaacac gccggcgaga gaactgcagt 5820 cgtatgcgtc cccgtcccag cagtcgggtg aggggccgcg gcggtccgcg cggctgcagg 5880 agcaggcggc gggcggggcg gggcctagca cggggcctgc cacggcggcg gcggcagcag 5940 cggcggcggt ggagggcgac cctcgcatgc cgcccccgga tgcgtccctt ctgcggggta 6000 cgtggcggag gctgtgggac agccacgcca gtcggggggc aaaggtgctg gtgtaccggc 6060 tgcaacatgc ttacctgcct tgcgggctgt acagggcggg caagggcatt cggccacggg 6120 tgaccacggg gtgcgggggg ttgggggcgc actgtcctca ccccgcctgc gggccacctg 6180 gcccgcgggc gtgggccagc ctgacgcaca tcttcctgga gtgtccagct tatgcgcagg 6240 cgaggacgtg gctgcagcag ctgtgggcct gcgttgcgcc ccaggcagcg gcgccgccag 6300 tgacggacgc gggcttcatg ctgggggacc gcatgggtat gtgggcctca ggcccgcggg 6360 gggcgggcgc gctgctgtgg agcactttgc gggccacctt cttgtacgcc gtctggtgtg 6420 cgtactggtc ccgcgagcct gctaagcaga cgtcggagca tgtggttcgg gaggtggtca 6480 gcgagctgcg cagggtgatg cagctgcgtt tcactgccgc cacgctaacc cctgaaaccc 6540 tgtcggctct gcccactcag cttctcaccg cacagctcaa ggcggctaag ctggagcact 6600 ttgttgccat ctggtcggcg ggtggcgcgc tttgtgaagt ggaggaggtt cagggtgggt 6660 caccgaagtt gaacttgcgg ctgacgcttg catcacctgt gcaggccccc taggtttcct 6720 tttgggcggc gtttctttcc aggcgctggc gtggcgtatt atcgtcatca gcttatctgg 6780 cgcccatgtg ttttagctct ggctggcgtc gcagcgcctg ggcggctgtg ggacttcctg 6840 cacagcgcgt cttgcagcgg gctggagccc gcttagtcag cgccacatct ggcccgttta 6900 gttttctgct tgtgtctcct ctgccggttc tcctggggtg ttgcttaagc aaccgacttg 6960 tgggggtggg gtgggtttgg gcggggcgag tgcgtagcgc tggtgttttc ttttctggcg 7020 cgcggtggtg gtggttcttg cggttaggct gtggtgtagg ttggtgcttg ggttaggtct 7080 ggcggtgttt ttgtgcaacc cctcccgggg ggcggggggg ccagcctcct gctctgtctg 7140 ccgggtcggg gtggggtggg gtgggatgga tgggggtggt tggtggcggt tttcgaggtg 7200 tttgctttct gttttcttct cttttcttct cttcttgccc tgtcagggtt ctggtctcgg 7260 ggaccaggct gctttcaggc agcagggggg cagcgcgtct gcccctgttc aaccttgtaa 7320 ggatgattcc tcaaaaaaaa aaa 7343 // ID Novosib-1_CR repbase; DNA; CHL; 4776 BP. XX AC . XX DT 31-MAY-2006 (Rel. 11.05, Created) DT 01-JUN-2006 (Rel. 11.05, Last updated, Version 1) XX DE Novosib-1_CR is an autonomous DNA transposon - a consensus. XX KW Novosib; DNA transposon; Transposable Element; KW Interspersed repeat; 8-bp TSD; transposase; Novosib superfamily; KW Novosib-N1; Novosib-N1a; Novosib-1; Novosib-1_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-4776 RA Kapitonov V.V. and Jurka J.; RT "Novosib-1_CR, a family of autonomous Novosib transposons from RT the green algae genome."; RL Repbase Reports 6(5), 262-262 (2006). XX DR [1] (Consensus) XX CC Novosib-1_CR is a young family of autonomous transposons. The CC consensus sequence was derived from several copies that are~99% CC identical to each other. These transposons are characterized by CC 8-bp target site duplications and 15-bp terminal inverted CC repeats. The genome contains also nonautonomous elements derived CC from Novosib-1_CR (see Novosib-N1 and Novosib-N1a). The main CC hallmark of the Novosib superfamily is a Novosib transposase that CC is not similar to any known proteins. So far C. reinhardtii is CC the only species whose genome is populate by Novosib transposons. CC We identified four families of autonomous Novosibs (Novosib-1_CR, CC Novosib-2_CR, Novosib-3_CR, Novosib-4_CR). Elements from each CC family are over 95% identical to each other and less then 65% CC identical to elements from other families. Despite the high CC divergence, elements from all four families encode conserved CC Novosib transposase. XX FH Key Location/Qualifiers FT CDS join(319..540,646..786,943..1245,1373..2095, FT 2146..4062) FT /product="Novosib-1_CR-1p" FT /translation="IVKEQTGADAATVRLHNKCEKELQVLLVELQSAKENA FT RTRAQAVALAAVPDPRALAMPGAAEKAARLVLEAAAAPAASKRPAPRAVLG FT DITNGPVAASTPPACLPLPLGAQHEEEQLDTSGADGTSPPPPTPAATTAPA FT TMPTTVPVDVGAEVDVVAVAGVAAAEPASPPVPPAAATVTAADTLSQPTSQ FT GRRLYGVSLWRGRFKARVTLGRQEYNLLTTKDSGPDPSNRLQLLRQVETTA FT AEERASRQVGTGGKAAALKALWGCYAAAFTPLPVDPFILQAAAPMMPLLRG FT VRLRTVRNSSKPLLCAAPLPGPGPQLGPGKVQKKKRPASKRKPRNAKALEA FT APAPALVVGATARPRRNVRQPEWLREAATAREDREEDESEPHSEQEDASEG FT EEVQGVLPTVAAADVALDEGQRCQLCYSAASDDSQLHSTMLLCGDEQCTRG FT CNAGYHIGCLAACLPYDQRQGWHGRRTGAGAAPGGGPAAAEAGGGAGTGAE FT PEPGAAPGGGPAAAEAGGGAAEGAGGGAGAGDAAAAVEAIAATARRKWFGR FT VLLCRDLRGGAAYAASIQGIPHLEQQRQRFLKLAEAFPETRVVDPLEELIR FT EVRADPAAAERPPSMTDPVLHERISCFMQKSGGGGGSDMDMLDAEAAALVA FT VVPVTEGQRVVGHLLQCMRGTSVPQLATAAQLERGYWAFRLTHGQWPSRLR FT PAQALATMNSAVAQEQHHAKAAALKRTGVDFSLAHSHFRSGKAANLIDEPH FT KLKTAAKSQAQGSIFPPGASLEVMQDIAALRPCGFTGSARPGPMRGGNPGP FT TEPPRTGELGMDSCVVQAVLQQLVQDWQETPVMQRRQKFMDEDPKVLLAIA FT QGRTDTQSVACAQALLGSRLLATALRDRGFTREALWMETLIGAFRAWDEAQ FT LDPLERAMRNELRYFLISRCLGNSWSVVGGSCSHVCGLSLANAASFLNNTD FT ARLHVLENMRPQDREHLNERAISTDANEGHFACITGPGGQKPSVMVLQGRE FT RRIAHHVAMRQRPEQARGWSYRRARTRGGYAMRPTDHLHSSGSGRRLGKWN FT NGAKLADPGLSAKREGAIAKRAAKKAKGRQCSIRAVQFNKAKMMA" XX SQ Sequence 4776 BP; 904 A; 1440 C; 1663 G; 769 T; 0 other; caacccgggt tcactagggg gcgtgcccat acacaagcct ccgtccaaac tgcaatactt 60 ccacatacac taatttaggt catacaaaca ctatatcgac tctcagaact gctctgggtc 120 gtggtttgac cgatacgagt gcacatggat tgcgaagaat gcttatgtgc gtatcttaac 180 ctcaatggaa agctcgtacc taactattac tgtgctcgat gtgcgaaaac agtgaatcct 240 tcgcgttcag gcaaccgggc ttgcaaatgt caaagattta gctgcgttcc cgacgcgttt 300 gacgcacgtg ttctagcgat agtaaaagag cagactggag ctgacgctgc aacagttcgg 360 ctccataaca aatgcgagaa agagctccag gtgctgctgg tggagctgca gagcgccaag 420 gagaacgcgc ggacccgggc ccaagcggtt gccctcgcag ctgtgcccga tccgcgggcg 480 ttggcaatgc cgggagctgc agagaaggct gcgcggctgg tactggaggc ggcggcggca 540 ggtgcggtgt aaccatgtgt gaaggtgggg ccgtgtttgt gacgcctcat gcccatgtct 600 cacccaaggt ctcaccccac agcgccccgt cccaacccca cagagcctgc agcgagcaag 660 cgcccggctc ctcgtgctgt gctcggtgat atcaccaacg gccctgtcgc tgccagcacg 720 ccgcctgcct gcctgccgct gccacttggg gcccagcatg aagaggagca gcttgacacg 780 agtggcggtg agtttatgca tgagtgcccc tgtgggtcag gggctcgacc cgagaacaag 840 cgttggaagg ctacagtgaa cgatcagcgg tacttctgtg gttgatgcat tcacgcaggc 900 cctggcacgg agtctgctgg cgcactgcct gccgcaccag ctgctgacgg cacctcgccg 960 ccgccgccta cacccgcggc aacgactgcg cccgcaacca tgcccacgac cgtgcctgta 1020 gacgtaggcg cggaggtgga tgtggttgcg gtggcgggag tggcggcagc tgaaccagcg 1080 tcgcctccag ttccgccagc agcagccaca gtgactgctg ccgacaccct gtcgcagcct 1140 accagccaag ggcgccggct gtacggcgtg agcttatgga ggggtcgctt caaggcgcgc 1200 gtcacgctgg ggcgccagga gtacaacctg ctgaccacca aggaccagtg ggtggcggcg 1260 gcggctgtga acagcttccg cgcgcggctg ctagagctgg accccgctgg ttgcggcatc 1320 agcagccgga ccacggacca cggacggcga gcacttctcc cttaccaaca gctccgggcc 1380 tgaccccagc aaccggctgc agctgcttcg gcaggtggag actacagcag cagaggagcg 1440 ggcgagcagg caggtgggaa caggcggcaa ggcggcggct ctgaaggcgc tctggggctg 1500 ctacgcagcc gccttcacgc ccctgccggt ggaccccttc atccttcagg cggcggcacc 1560 aatgatgccg ctgctgcgtg gcgttcgtct gcgcacggtc cgcaacagca gcaagccgtt 1620 gctgtgcgcg gcgccactgc cagggccggg gccgcagttg gggccgggga aggtgcagaa 1680 gaagaagcga ccggccagca agcgcaaacc gcgaaatgcc aaggcgctgg aagcagcacc 1740 ggcgcctgct cttgtggtgg gcgccacagc gcgtcccagg cgcaacgtgc gacagccgga 1800 atggctccgg gaggcagcga ctgctcggga agaccgagag gaggatgagt cggagccgca 1860 ctcggagcag gaggatgcat cggagggcga agaggtgcag ggcgtgctgc caaccgtcgc 1920 ggcagctgac gtggcgctgg acgagggaca gcgctgtcag ctctgctaca gtgcagcgag 1980 tgatgacagc cagctccaca gcaccatgct gctatgtggg gacgagcagt gtacgcgtgg 2040 ctgcaacgcg ggctaccaca ttggttgcct tgcagcctgc ctgccgtacg accagcgtca 2100 gtgcctcacg tccgtcccgg agggtgactg gctgtgcccc cagtgcgtca gggctggcac 2160 gggcgccgaa ccggagctgg ggcggcgccg ggcggcggcc ctgctgccgc tgaggccggc 2220 ggcggggctg gcacgggcgc cgaaccggag cctggggcgg cgccgggcgg cggccctgct 2280 gccgctgagg ccggcggcgg ggcggcggaa ggtgccggcg gcggggctgg cgcgggggat 2340 gcagcagcgg cagtcgaggc catcgctgcc actgcccgcc gcaagtggtt tggccgtgta 2400 ctgctgtgcc gggacctgcg tgggggtgcc gcttatgctg caagcatcca gggcatcccg 2460 cacctggagc agcagcgcca gcgatttctc aaacttgcgg aggcgtttcc agaaacgcgt 2520 gttgtggacc cgctggagga gctgatcaga gaggtgcggg ccgacccggc agccgcggag 2580 cggccgccat ctatgacgga cccggtgctg catgagcgca tctcgtgctt catgcaaaag 2640 agcggcggcg gtggcggcag cgacatggac atgctggacg cggaggcagc tgcccttgtc 2700 gctgtcgtgc cagtgacgga gggccagcgc gtggtcgggc acctgctgca gtgcatgcgc 2760 ggcacatctg tgccgcagct ggcaaccgct gcacagctgg agcgtggcta ctgggccttc 2820 cggctcacac acggccagtg gcctagccgg ctgcggcctg cacaggcgct tgccaccatg 2880 aacagcgccg tcgcacagga gcagcaccac gcaaaggcag ccgcactcaa gcgcacgggc 2940 gtcgatttct ctctggcaca ctcgcacttc cggtcgggta aggcagccaa tctgattgac 3000 gagccccata agctcaagac ggctgctaag tcgcaagccc agggcagcat cttcccgccg 3060 ggtgcgtcac tggaggtcat gcaggacatt gcggccctgc ggccctgcgg cttcaccggc 3120 agcgcgcggc cagggcccat gcgaggcggt aatcctgggc caacagaacc accgcgcaca 3180 ggcgagctcg gcatggacag ctgcgtggtg caggcggtgc tgcagcagct ggtgcaggac 3240 tggcaggaga cgccggtgat gcagcggagg cagaagttca tggacgagga ccccaaggtg 3300 ctgctggcca tagcccaggg ccgcacggac acgcagtccg ttgcgtgcgc acaggcgctc 3360 ctaggcagcc ggctgctggc gacagcgctg cgggacaggg gcttcacgcg ggaggcccta 3420 tggatggaga cgcttatcgg cgccttccgt gcctgggatg aggcgcagct ggacccactg 3480 gagcgcgcca tgcgcaacga gctgcgctac ttcctgatca gccgctgcct gggcaactcg 3540 tggtcggtcg tcggcggcag ctgcagccac gtgtgcggcc tgagcctggc caacgccgcc 3600 tcgttcctga acaacacgga cgcgcgactg catgtgctgg aaaacatgcg gccgcaggat 3660 cgtgagcacc tcaacgagcg cgccatcagc acggatgcta acgagggcca cttcgcctgc 3720 atcaccgggc caggtggtca gaagccgtca gtcatggtgc tgcagggccg cgagcgccgc 3780 atagcgcacc acgtggccat gcggcagcgg cctgagcagg cacggggctg gagctacagg 3840 cgcgctcgca cgcgcggtgg ctacgccatg cgccccactg accacctaca cagcagcggc 3900 agtggccggc ggctgggcaa gtggaacaac ggcgcgaagc tggcggaccc cggtttgagc 3960 gccaagcgtg agggcgccat tgcgaagcgg gcggcgaaga aggccaaggg taggcagtgc 4020 agcatccgtg ctgtgcagtt caacaaggcc aagatgatgg cgtagagccg gattagagga 4080 tggtcgggac catcgggaag atgggcgcgc gggagtagga cgcgcggaga ggcagaggag 4140 aggggagtcg gcggggagga gggatcttga tggattgttg gactcgcgat gtgctcgcgt 4200 tgatatatgt ttgaattggt caatatattg tgcgagattg gcacaaagtc gggaaatgct 4260 tcaaaatgcg tcaaggatgc gacatgtcgg tccgcgaagc ttaacatagt gcatcgggag 4320 atacttcggt gcaatattac atatgtacgt agcatattga gacatgtcac ggtctctcct 4380 gtgcactcgg cgtggatggc tttggcacga acccaccggg ctcgctggct gcatcaccga 4440 aacccgtcat ttatgcctca ggcgagccca gatttgcctt ccatacaatc agcaagtcta 4500 tactatgtat ccaaagggtc tgagagcgtg ggtggaagca gggggggccg ttcaggcgtc 4560 gtggaacacc ccctgcccac ccctatgacc ccccgcacgg gaagggccgt acggcagtgg 4620 tagtactgtc ggacggtgaa tctgacacga caggcgtgtg cgtgtggggg taggcaaggt 4680 tggtttccac gtcggactgt actaccacga ctaaacacag gtccaaaggt tgtgcagttt 4740 tgggacggtg ccccaaactg aagtgaaccc gggttg 4776 // ID DNA-2-7_CR repbase; DNA; CHL; 710 BP. XX AC . XX DT 04-MAR-2004 (Rel. 9.02, Created) DT 04-MAR-2004 (Rel. 9.02, Last updated, Version 1) XX DE DNA-2-7_CR is a nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; 7-bp TSD; DNA-2-7_CR; KW unclassified superfamily. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-710 RA Kapitonov V.V. and Jurka J.; RT "DNA-2-7_CR, a family of nonautonomous DNA transposons from the RT green algae genome."; RL Repbase Reports 4(2), 32-32 (2004). XX DR [1] (Consensus) XX CC 100 copies of DNA-2-7_CR are present in the C. reinhardtii CC genome. They are ~98% identical to the consensus sequence. CC DNA-2-7_CR is characterized by 14-bp TIRs and 7-bp TSDs. XX SQ Sequence 710 BP; 161 A; 226 C; 163 G; 151 T; 9 other; cagggggggg gtaggaacaa acttttttga ccccatgcca caaaacctcc cggacccccc 60 gaccgaccga ccgaccgacc ccccctgccg tacgaccccc caaaaccccc tcagaacccc 120 ttagtttttt gtcaggttgg tgccaatcga ctgctctcgg tgagctctat cgctatgtaa 180 matgatccgc cttgggaaca cactttgacc gaccgacatg tggacatgtg gacatgtgca 240 cccgtatgcy gacaagtatc ctgatagact ggcctgctat gcaacggctt aytccaatgt 300 tacttgatra tacgaaatgt gctgacacgt tcaccatgca aaacccggcc ggcaccggcc 360 tgtccctgac cagttcccca ggcatctccc gavatgccga catgccgctc cccaaactgc 420 cccaaactat gcacgtgtgc tgagtctggg gtatacctay gcacgcatcg ttgagctgca 480 cccattgcgc cgcgygctgc aggacagaag cgctaggaac aaaacttggc acgaacctgt 540 gcaaaacctc ktgggacccc agaccgaccg accgaccggc mactttgagg ggtcataact 600 ttgcacccag aaacccattt tcaattttga aagcggattc gtgattttct cgttgagctc 660 tatccatttg aggcatgggg tctgtttcgg aagattctac cccccccctg 710 // ID L1-6_CR repbase; DNA; CHL; 5286 BP. XX AC . XX DT 04-APR-2005 (Rel. 10.03, Created) DT 31-MAY-2006 (Rel. 11.06, Last updated, Version 2) XX DE L1-6_CR is a 3' terminal portion of a L1-like non-LTR DE retrotransposon - a consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; KW Interspersed repeat; putative L1 superfamily; L1-6_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 3061-5286 RA Kapitonov V.V. and Jurka J.; RT "L1-6_CR, a family of L1-like non-LTR retrotransposons from the RT green algae genome."; RL Repbase Reports 5(3), 51-51 (2005). XX RN [2] RP 1-5286 RA Kapitonov V.V. and Jurka J.; RT "L1-6_CR, a family of L1-like non-LTR retrotransposons from the RT green algae genome."; RL Direct Submission to Repbase Update (31-MAY-2006). XX DR [2] (Consensus) XX CC The L1-6_CR elements are flanked by 5-10-bp target site CC duplications. There are several hundred copies of L1-6_CR present CC in the genome. They are massively truncated at 5'. Many elements CC are just 20-60 bp long 3' terminal portions. The consensus is not CC complete at its 5' end. Some copies are less than 0.5% divergent CC from the consensus sequence. XX FH Key Location/Qualifiers FT CDS 460..4911 FT /product="L1-6_CR_1p" FT /translation="QWGWWRRRWRRHAAGGGGRPCRCAPRPRPHRGDLGCP FT AGGSAVVRRPRQQAAGPRPLRVGSHNVRGLRSRVQHLGCPRLHALLRCWER FT QQLDVVCLQETHVRAEDIDLVTADLRAAALRLHVREWTAWWAPAPTPHQRV FT SGGTAILIRTCLLEQGAVTFPGGVAAPSVADGAWVGRGVSLPVQWGGHDFT FT LAAAYYPLASPEQRAFTQQWVASAVGGEGNHLLAADFNFVSDVALDTASGV FT PRPGDPAAATALAAACPGLIDVLRVRRPGQRVCTYIHSHGASRLDRILCSG FT SLVAQVLACAVEAGVPSDHRLVTLTLAATPEALPPSPGLPKARLGFLDFKD FT LTKDYRTWLTAALGARPGAPAELLAWWPQLKRAAAASAARLSRVAAVRRIA FT ASEEEARALAAAAAAAAAVEVGAALPEAAAAAVRARCAAATAAVQAAAGAA FT RRTRHEWLRGGEKPCPVLTRMLRPSGGPRTIARLKRPDGSATSDPTVMGQV FT MATYWQDVCAAPPPAPAARAQVLAALQQHGRRCTTEEAAQLGQLAVTAAEV FT RAALAAAPAGRAAGPDGIPVELYQYYESDFSPVLADVFTAVGEGAGVPGGF FT LDGVWTFFYKAGDPLEPSNYRPITLTDTDYRTLARVLCRRLQPVFGRVIDP FT EQTAFLSDRRIADNVLLLQLTPGLLKAAGQNSGVAAFLDFYKAYDTVDRSF FT LLACLERMGVGAGFLTWVSRLLTDTRGAALVNGRVSGWVPLAAGVRQGCPL FT APLLYLAIAQALLSWLRSKGHGVLAGLASALASQYADDCTPFLAGMQAVPG FT FLADMDVFRCASGQRLNMSKVELLVLGTSAGLGVAPAAVAGPPLPPGWRVV FT NTAKSLGVHFGDWKTATPPVAFDGVVGVLGLIVRMPVSMFGRAAAASAYAL FT SKVLYHLEFAGFPSPWAPVGKLLGQVAAVVDRRLSPAKYAVAPHARPVGVS FT LDIMQVPPALGGVGLLPLQLHVQARQAGLAVRCVLGACGLLAFVPPWTKVA FT EAWLRARHPAATPLRLLVRCDPSRVLDKLVTPECKPFLRLLGAVAHLPPIT FT LVAPPAAGDWVYHVPLWANPALCDGAQTFDVAFADLFALPGLVYVGQLVAA FT HDQLEAVRAAHTRPGGWGTRFGEAMLDLYLTSVWRPVLGWGPRRHLPSPLL FT GPASPLAAAERIAAAVALLPGGWEAAARAARLARGRAWVMPLPADAVRQAA FT AGALTQVVRGLGWVLDGGPPLLLLNYSVKAGTAMQLGPQVAALRSKHLLYV FT RDAGVSADGALAAAGAFAGTLARLWALKWENYHKEALWRVAANACWSFPKN FT AAQRAAGVAMPLCRACGRDMCDGDRRHYFWDCVVARALRENMAMAMLGSPP FT DTADDCFSRADLWLVQPPAGLAPPVWDVVCLAALSALDSGRQQVIMTGLRV FT RRALTSAEVMRISVSVIADFWGRLQSYVSLGLRPKAWATVPAHHPFLSRGA FT GDCVLLVLPDDVGSPPPSP" XX SQ Sequence 5286 BP; 607 A; 1614 C; 1999 G; 1066 T; 0 other; ggcggcggcg gaggcggcgg cggcggcggc tgccctggcg gcggcgggtg cggcggcgcc 60 tggtggtgcg gcacccggtg ccgcggaggc ccccgctgcc ccggccgccg ggccggcgat 120 ggcacccggt ggcgccgccg cggccaccgc cccggcagtg gcgaccgacg cggcgcctgc 180 ggccgctgca gccgcggtga cgctgccgcc cgcgtccgcc cctctggctc tggcgctggc 240 gcccggcgct ggtgccggtg ccgccgcatc tgcggcggcc gccacggatg gcagcggcag 300 cggaactggc agcggcagcg gcagcggcca cggcggcggc ggcggagtgg cggcggcggc 360 gacggcagct gcggcggccg cggccatgac aatggggatg gtgggcggcg gcggcagtgg 420 cggcggcggc gacggcagct gcggcggccg cggccatgac aatggggatg gtggcggcgc 480 aggtggcggc ggcacgccgc cggtggtggc ggccggccct gccggtgcgc cccgcgcccc 540 cgcccccacc gtggtgatct cggctgcccc gccgggggct ccgcagtagt gcgccgtcct 600 cggcagcaag cggccggccc ccggccgctg cgggtagggt cgcacaacgt gcgggggttg 660 cggagccggg tccagcatct gggctgcccg cgcctgcacg ccctactccg ctgttgggag 720 cggcagcaac tggacgtggt ctgcctgcag gagacccacg tccgggcaga agacattgac 780 ctcgtgaccg ctgacctgcg tgctgctgca ctgcggctgc acgtgcgtga gtggacagcc 840 tggtgggccc ccgcccccac gccccatcag cgtgtctccg gcggcactgc catccttatt 900 cgcacttgtc tgctggagca aggtgccgtc acgttcccgg gtggcgtggc ggccccctcc 960 gttgctgacg gcgcgtgggt gggccgcggc gtcagtctcc ccgtgcagtg gggcgggcat 1020 gacttcacgc tggcggcagc ctactatccg ttggcaagtc ctgagcaacg tgccttcacc 1080 cagcaatggg tggcgtcggc ggttggcggt gagggcaatc acctgctagc tgctgacttc 1140 aactttgtta gcgacgtggc gctggacacg gccagcggtg tcccgcggcc gggtgacccc 1200 gccgcagcga cagcgctggc ggctgcgtgc cctggtctga tcgacgtcct ccgagtgcgc 1260 cggcctggcc agcgtgtttg tacttacatt cattcgcacg gcgcgtcgcg ccttgaccgc 1320 attctctgca gtggcagcct cgtggcccag gtcctggcct gcgccgtgga ggcgggtgtc 1380 ccgtccgacc accgcctggt gacgctgacg ctggcagcga cgccggaggc cctgccacct 1440 tctccaggcc tgcccaaagc tcgcctgggg ttcctcgact tcaaggacct gacaaaggat 1500 taccggacat ggctgactgc ggcgcttgga gcccggccgg gtgccccagc tgagctgctg 1560 gcgtggtggc cgcagctcaa acgggcagcg gccgcgtcag ctgcccggct gagccgcgtg 1620 gcggccgtcc gccgcattgc agccagtgag gaagaagcgc gtgcgctcgc ggctgctgct 1680 gccgccgcgg ccgcagtgga ggtcggtgcg gcgctgccgg aggccgcagc ggcggcggtg 1740 cgcgcgcggt gtgccgccgc caccgccgcg gtgcaggctg ccgcgggggc ggcgcgccgc 1800 acgcgtcacg agtggttacg gggtggggaa aagccatgcc cggtgctgac ccgcatgctg 1860 cgcccgtcag gtgggccgcg taccatagcc cggctcaaac gccccgatgg cagcgctacc 1920 tcggacccaa ccgtgatggg ccaggtcatg gcaacctact ggcaagacgt ctgtgcggcg 1980 ccgccgcctg ccccagccgc ccgcgcgcaa gtgctggccg cgctgcagca acacggccgc 2040 cgctgcacga cggaggaagc cgcgcaactc gggcaacttg ccgtcaccgc cgcggaggtg 2100 cgcgctgcgt tggccgccgc gccggcgggg cgcgccgcgg gccctgacgg cattcccgtg 2160 gagttgtatc agtactatga aagtgacttc tcacctgtgc tggctgatgt gtttactgct 2220 gttggcgagg gggcaggggt ccctggtggg tttctggatg gtgtgtggac atttttctac 2280 aaggcggggg atcccctgga gccctctaac tatcgcccca ttaccctcac ggatactgat 2340 tatcgcaccc ttgcacgtgt gctgtgtcgg cgcttgcaac ctgtgtttgg gcgcgtcatc 2400 gaccctgagc aaactgcctt cctgtctgac cgccgcattg ctgataatgt actgctgctg 2460 caactgactc ctgggttgct gaaagcggcg ggacagaatt ctggtgtggc ggccttccta 2520 gacttttaca aggcgtatga tacggtggac cggtcgtttc tcctcgcctg ccttgaacgt 2580 atgggcgtgg gggccggctt cttgacgtgg gtgtcgcggc tgctgactga cacgcgtggg 2640 gcggcgctgg ttaatggacg tgtgtctggt tgggtgccgc tggcggcagg agtccggcaa 2700 gggtgcccgc tggccccgct gctgtacctc gccattgcgc aagccctgtt gtcgtggctg 2760 cgcagtaagg ggcatggcgt gcttgctggt ctggcctcgg ctttggcgtc ccaatatgcg 2820 gatgattgta cgccgtttct cgcgggcatg caggctgtgc cggggttcct ggcggacatg 2880 gacgtttttc ggtgtgcgtc ggggcagcgg cttaacatgt ctaaagtgga gctactggtg 2940 ttgggcacct ccgcgggttt gggggtggcg ccagcagctg tggctgggcc ccccctgcct 3000 ccgggttggc gtgtggtcaa caccgcaaag tcgctcgggg tgcactttgg tgattggaag 3060 acggccaccc cgcctgtggc ttttgacggg gtggtggggg tgcttggcct gattgtgcgt 3120 atgcctgttt ctatgtttgg gcgagcggcg gcggcctcgg cgtatgccct gagcaaggtc 3180 ctgtaccatc tggagtttgc ggggttcccg tcgccgtggg cgccggttgg caagttgtta 3240 ggccaggtgg cggcggttgt tgatcggcgg ctgtcgccgg ctaagtatgc ggttgcaccc 3300 catgctcgtc cggtgggggt gtcccttgac atcatgcagg tgccgccggc tctgggtggt 3360 gtaggactgc tgcctcttca gctgcacgtg caggcgcggc aagctggact ggcggtgcgg 3420 tgtgtgctgg gcgcgtgtgg tttgctggcg tttgtgccgc cgtggacgaa ggtggcggag 3480 gcttggctgc gggcacgcca tccagcggcc acgccactgc gcctgttggt gcggtgcgac 3540 ccgtcacgtg ttttggacaa gttggttacg cctgagtgca aaccgttttt gcgtctgttg 3600 ggggcggtcg cccacttgcc gcctatcacg ttggtggcgc cgcccgcggc tggggactgg 3660 gtgtatcatg tgccgttgtg ggcgaatccg gcattgtgtg atggtgcaca gacttttgac 3720 gtggcgtttg ccgatttatt tgccctgccg ggccttgtgt atgtgggaca gttggtggcg 3780 gcgcatgacc agttagaggc cgtgcgtgct gcccacaccc gcccgggtgg gtggggcacc 3840 aggtttggtg aggcgatgtt ggacttgtac ttgacgagcg tgtggcggcc tgtgcttgga 3900 tggggacctc ggcggcacct gccgtcgccg ctcctggggc cggcgtcgcc cctggcggcg 3960 gcggaacgca tagccgcggc ggtggccttg ctcccaggtg gctgggaggc cgcggcgcgt 4020 gcggctcgac tggctcgcgg gcgggcgtgg gtcatgccgc ttcctgcgga tgccgtgcga 4080 caggcggcgg cgggggcctt gacccaggtg gtgcggggac tgggctgggt cctggacggg 4140 ggcccgcctc tgctgctgtt gaactactcc gtgaaggcag gaacggccat gcaactgggc 4200 ccccaggttg cggcgctgcg gtctaaacat ttgttgtatg tgcgtgatgc tggtgtttct 4260 gcggacggcg cgcttgcggc tgccggcgcc tttgccggca ctctggcccg tttgtgggcg 4320 ttaaaatggg aaaattatca caaggaagcg ttgtggcggg tggcggccaa tgcgtgctgg 4380 tcgttcccca aaaatgccgc ccagcgtgca gctggggtgg caatgcccct gtgccgggcg 4440 tgtgggcggg atatgtgtga tggggaccgc cggcactact tttgggactg tgttgttgct 4500 cgggcgctgc gggagaacat ggctatggca atgctggggt cccctcctga cacggctgat 4560 gactgtttct cccgtgctga cctttggctg gtgcagccgc ctgctgggct ggcgccgccg 4620 gtgtgggatg tagtgtgttt ggctgcgctg tctgcccttg actcgggccg gcagcaggtg 4680 attatgactg gcttgcgtgt gcgtcgggct ctgacgtcgg cggaagtcat gcgcatcagt 4740 gtgtccgtta ttgctgattt ctggggccgg ctgcaatcgt atgtcagtct cggcctccgc 4800 ccgaaagcct gggccacggt gcctgcgcac catccgttcc tctcgcgggg tgcgggtgac 4860 tgtgtgctgc tagtgttgcc tgatgatgtg ggctccccgc ccccctcgcc ttaatttgct 4920 cgtgggtggt gggagcggtg tgcctctggt gtgtgcgctg tgggtgtggg gcgtcgacgt 4980 gggcgtcgcc ggactacgtg cccggtgtac ctgcgttgac gcctcgccgg tttgattgcg 5040 cctcgtgcgc tagtgcggtt tgctgtacgt tggtggggct ttggcgggct tgggcttagc 5100 ttcgactcgc tgtgcagctg ccgagacacg ctttacgcga agacctgttg cacggttgtg 5160 gaaggtcggt agccaactgg ggggagggtg gtttcctttc gggccggaga tggtccctct 5220 tgacctgggt gcacgggcac ccttgcttgg tcggattgtc ccttctttgt aatatcgcaa 5280 ccataa 5286 // ID Mariner-N1_CR repbase; DNA; CHL; 528 BP. XX AC . XX DT 08-JUN-2004 (Rel. 9.05, Created) DT 08-JUN-2004 (Rel. 9.05, Last updated, Version 1) XX DE Mariner-N1_CR is a family of nonautonomous Mariner DNA DE transposons - a consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW mariner superfamily; Mariner-N1_CR; nonautonomous DNA transposon; KW TA target site duplications. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-528 RA Kapitonov V.V. and Jurka J.; RT "Mariner-N1_CR, a family of nonautonomous Mariner transposons in RT the Chlamydomonas reinhardtii genome."; RL Repbase Reports 4(5), 129-129 (2004). XX DR [1] (Consensus) XX CC Mariner-N1_CR is a family of nonautonomous transposons. The CC consensus sequence is less than 5% divergent from CC Mariner-N1_CR copies. Features of Mariner-N1_CRs include 24-bp CC terminal inverted repeats and TA target-site duplications. XX SQ Sequence 528 BP; 137 A; 141 C; 132 G; 118 T; 0 other; ccgtagtgat aaaaaggttt cgggactaat attttcggga cagggtcgga taatatattt 60 caaactattc gggaaattaa atattcggga cggccagcgg cggtaattgc aaatctgatc 120 caaagtttcg ggactccgtc cggaccgcgt ctgggctgtc tggcctcagc catgcgcggc 180 cccactgcat gcacgccccc actataagat ttcgcactcc tcaagtgaga gacagcaaac 240 cacagccgtt ctacagccac acagcagaca ccatgcccgc atgtgaatat gctgcgctaa 300 acggcgcgta catgggctta ggatgctggc gtgccaaacc ctgtgccgcg cactgagacc 360 ctcatcccct gtaagcgcgc gcagcgcgcg ctccgaaaat atgatgtgat gtggggtgat 420 attatatttc gggacaagag caattaaata attagttaaa taggttcggg actcaacttc 480 gggacgctct cccgaactta gaaacccgaa acctttttat cactacgg 528 // ID LTR2_CR repbase; DNA; CHL; 593 BP. XX AC . XX DT 08-JUN-2004 (Rel. 9.05, Created) DT 08-JUN-2004 (Rel. 9.05, Last updated, Version 1) XX DE LTR2_CR is a long terminal repeat of unknown LTR retrotransposon. XX KW LTR Retrotransposon; Transposable Element; 4-bp TSD; LTR2_CR; KW solo LTR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-593 RA Kapitonov V.V. and Jurka J.; RT "LTR2_CR, a long terminal repeat from the green algae genome."; RL Repbase Reports 4(5), 127-127 (2004). XX DR [1] (Consensus) XX CC The genome harbors about 20 copies of LTR2_CR; they are 95% CC identical to the consensus sequence. LTR2_CR is a solo LTR. CC The LTR retrotransposon it was derived from is not known yet. XX SQ Sequence 593 BP; 113 A; 148 C; 197 G; 135 T; 0 other; tgtcacgccc cagcgggtta cctagcatag cgtgtcaccg tgctgtgttt agctgtgccg 60 atgcgcgtac ggcagacgga agggcaagac ccggggttcc gcaagaggct gtgaggggag 120 ttgccggtcg tgccagtcgc gtgtgtcgtg tgttgtgttc gtagaaccgg gtggctgagc 180 cgttgactgg gctcaggccg ccgcgtttcc cagtgtatcg tgtgtatgct atgttgcatt 240 agcctcgtgt gtttgggcgc cagtcgttag cggtcaggct tgactggtgg gcggtggctg 300 aactggcccg cgtcccatgc ggagatagcg aggttagcgc gtgagagagc tcaagcaagc 360 ttgtctttgc ttgagctact tgaacgacct ttcatttagc gatttcagag ttaaagaccc 420 gtcagagcag tcacgtcaca gactgcgaat aattgtaata taccgagacg gagcacgtat 480 tgtccgtcga ccgcctgttc tgctggtccg cggagcaagg gaccagagca gaccacagct 540 caacaactag cggggcacgt gcgggttaca tcagcccatg ggaaggcgtg aca 593 // ID Novosib-N1 repbase; DNA; CHL; 787 BP. XX AC . XX DT 17-AUG-2004 (Rel. 9.07, Created) DT 17-AUG-2004 (Rel. 9.07, Last updated, Version 1) XX DE Novosib-N1 is a nonautonomous DNA transposon - a consensus. XX KW Novosib; DNA transposon; Transposable Element; 8-bp TSD; KW Novosib superfamily; Novosib-1; Novosib-N1; Novosib-N1a. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-787 RA Kapitonov V.V. and Jurka J.; RT "Novosib-N1, a family of nonautonomous Novosib transposons from RT the green algae genome."; RL Repbase Reports 4(7), 194-194 (2004). XX DR [1] (Consensus) XX CC Novosib-N1 is a family of nonautonomous Novosib DNA transposons. CC It CC is composed of ~50 copies that are ~95% identical to their CC consensus sequence. Novosib-N1 elements are characterized by CC 8-bp target site duplications and by imperfect 17-bp terminal CC inverted repeats (3 mismatches). Novosib-N1 shares ~80% CC identical 5' 50-bp and 3' 250-bp termini with the Novosib-1 CC autonomous transposon. Therefore, transpositions of Novosib-N1 CC elements were mediated by a Novosib-like transposase. XX SQ Sequence 787 BP; 178 A; 232 C; 217 G; 160 T; 0 other; caacccgggt tcactcgggg cgtgcccata cacgccgcac cgtccaaaga ccaaaccttg 60 cctaaggctt atacatgaca tatgtatgtt atgacttgtt gttttaagcc ctggcgcggc 120 ctgcccgctc gccgagctcg cgaaatgcag cgacatgccc cgacaatgtg cacgaagagc 180 tgcttttagc cctcgcaaag ccttgaattc ccattcttat agctgaaagt agacacggca 240 acaatcacgg tttgaaaaaa tggctcctcg agggacctcg gtcgcgaaat tgccggtcgg 300 cagagcctca cattttgccc cgggatacat tccaatgcaa tattgggtat ctaaataaca 360 tattgagtga gttactgctc tctccagcac actttggccc gatcggaccg acacgcgccc 420 atcgggcgca tgcctggcgg gccaaaaccc gcccatttcg acctcaggcg agcgcggacc 480 gccctcccac ttgaccagca agtcattact acatgcacgg aggggctgag agcgtggggg 540 actcagggga gaccagtttg gccacgggga cccccctgcc cacccccatg accccccgca 600 cgggaagggc cgtacggcag tggtagtact gtcggacggt gaatctgaca cgacaggcgt 660 gttcatgcga tttgtggggg gtaggtaagg ttggtttcca cgtcggactg tactaccacg 720 actaaacaca ggtccaaagg ttgtgcagtt ttgggacggt gccccaaact gaagtgaacc 780 cgggttg 787 // ID Gypsy-3-LTR_CR repbase; DNA; CHL; 881 BP. XX AC . XX DT 24-MAR-2005 (Rel. 10.02, Created) DT 29-MAR-2005 (Rel. 10.02, Last updated, Version 1) XX DE This is a long terminal repeat of the Gypsy-3_CR retrotransposon DE - a consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3-LTR_CR; KW Gypsy-3_CR; Interspersed repeat; Long terminal repeat. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-881 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-3_CR, a gypsy LTR retrotransposon from the green algae RT genome."; RL Repbase Reports 5(2), 41-41 (2005). XX DR [1] (Consensus) XX SQ Sequence 881 BP; 138 A; 250 C; 309 G; 184 T; 0 other; tgtcgtgcgc cgttgccggc gtgtgtcatt caccggcaac cagtgcctga gtatgagtca 60 tgaggcgggt aacagcgtgt gaacttagga gcgggaagtt accccagcgg ttgagaactg 120 acgagctgtg ggtgggttgt gcgaactacg tcccggtgtg tggctagagg tttggcgtct 180 gtggtttaat tgtagccgtg tgacggtggt gtctctctgg ctctggcgtc accttcgcct 240 gcactgcgca gggcgaaggc tcgtgggctc gtgttggagg cgttgggcag ctggtcatgc 300 gggcgctgag ctgaggagga gctgcgttgg gggtctgttt gccgtggtgc tttataagtg 360 tgtgctcagg ggagcttggg gtccaggttt cgtagacggg cggttggctt gccctgctgg 420 gtggtgagcg ttgggtgtcg ggcagggggc cgcccatgca ggaggcggtg gctccctagt 480 cccgcaacac cggccctggc ccggtgttgc ggtgttgctc tctctcacac gacctatcgt 540 gcgtagcttt gtgaaagcat cggagtgatc aaccgactgc tgcctcagac tcaacacctg 600 gggccgggct gcggagcgcg ttcacgcaac caaactcaac cacggggcat cagttaccag 660 ccgctcgcac gcagccgctg tagctagctc aattaaggcc cgctcggctt agtccaccga 720 gcgcccagcc agctgggaaa gaccgccctc gcccccggag tcgccatcga cagctgacga 780 gcagtcgacc gtccgccctc tggccctagc gagtggccgt cgccaaagcg ccttcagccc 840 ccggtggcag ccctagagca aaacatggct ggggcgtgac a 881 // ID EnSpm-N1a_CR repbase; DNA; CHL; 5347 BP. XX AC . XX DT 28-MAR-2006 (Rel. 11.03, Created) DT 28-MAR-2006 (Rel. 11.03, Last updated, Version 1) XX DE EnSpm-N1a_CR is a subfamily of En/Spm nonautonomous DNA DE transposons - a consensus sequence. XX KW EnSpm; DNA transposon; Transposable Element; Nonautonomous; KW EnSpm-N1_CR; EnSpm-N1a_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-5347 RA Kapitonov V.V. and Jurka J.; RT "EnSpm-N1_CR, a family of En/Spm nonautonomous DNA transposons in RT the Chlamydomonas reinhardtii genome."; RL Repbase Reports 6(3), 119-119 (2006). XX DR [1] (Consensus) XX CC This is an interesting composite EnSpm transposon. It is composed CC of two EnSpm-N1_CR elements split by a ~5-kb internal sequence. XX SQ Sequence 5347 BP; 917 A; 1397 C; 1400 G; 955 T; 678 other; cactagggcg caaaagtcgc gtccaaacag acaccggcgc cagcccgacc tttcccgcgc 60 gggagcgcgt tgatcccgac ccggtaagtc ccatcgtgga acagtagccc caaggcccct 120 tccatagcgc agatgttcct gggcgctctc cagcgtgcat gcgggttcta ggtagccacg 180 cccacgctgc gaggctggga aaaaggccca ttgtgacagc aagccagctt tcgcgccatt 240 ggttgccggc gacagcccag tggtgtggca gcttgggtat gcatagcttg ttggggtacg 300 ggtgggcaat gactcccaac cttgcgcgcg taggatttat gagggcatcc tgctccctgc 360 gcaaaagcat gaatttgcac ccatgcaatg cgcgagggga gcagctaccc accgcgcgct 420 gccctgctgc caggtatcaa ctggattatc aatcccaata cccaggcatt tagctggcat 480 ggctgggaag cgcgagggaa aaaccggcat tcctgaaacc cgccactgac tgtagccttt 540 tctcaacacc attcgaaagc gtttgtcaaa ccgtacagca ggctggacaa tacgcgcagg 600 gaggacggtg cgtttaatag gggtacggca gcaagcgttc tcggcctagg aagcagctta 660 cagagtgcgg ggcagttctg taaatagagg gtatcaaatt tgatgggcgg agtgccttgg 720 ggtgcaggtc gctggacgcg acttctgcgc cctagtgacg ggcgggggcc acggcaaacc 780 ggcggggttt gatccagggt tcgccacgag cacgtggcat acccatgttc agactggcac 840 caccccaccg cctgtatgcg tccgcccatc actctaccgc tctcccctcc gcccgccctc 900 tgcaccaggt aggccggcat cacgccctgg cgctgcaact ggggggccat gcacgattgg 960 ccacagcact atgcgtcctg gtccccatcg tggcgtgacc agggatggga ggaaaaagcc 1020 gcggtggtaa tggtgcgctg ttggcgctgc cctnnnnnnn nnnnnnnnnn nnnnnnnnnn 1080 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1140 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1200 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1260 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1320 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1380 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1440 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1500 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1560 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1620 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1680 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nttgatggga 1740 agttgggggc cgtgctgtat agttaggctg ctgcatgccg gttgcgaacg gttgagggtg 1800 cttgacgtgt gcgagtgcgg ggatgctgac atgtctgcga cggtggtacc actctaccgg 1860 cggtgcaccc gaccacgcca cttcgtgtgt agttgtatac attggttgtg cgcaatatgt 1920 agcggcccgc accagagagc attactgtgg cgtggcctga caggcaatcg cagcttggtg 1980 gtcggcacgc gtgtcaacca ccctattgca caacccatgc ggcatggaga ccgccccctg 2040 gctgcgatgt acttgcacat acaggctctc tatagtagag ggcaggactg ccaagaagcc 2100 tttgtatatc ggtgacgctg gcgggcactc catttttgca tgtcgcgggg catgcctaga 2160 caaagttacc cttgcagccg gcactgccgc accattcctg ctataatcta agtgagatgc 2220 accattatgt gtagaccaca tcgcgtagcc caatcccaag ggctcgcgcc aaatgccgac 2280 cgaaatgcaa gcaattagca acgctgccca gcgccttttc gctggccgct tgcaccaaca 2340 attctcacat aggccataga tagcagcact ggaagcctgg agctcatgcg cgcgggcccg 2400 ccctttcaat cctggtgccg ctttccccgt ctgacgggcg ggggccacgg caaaccggcg 2460 gggtttgatc cagggttcgc cacgagcacg tggcataccc atgttcagac tggcaccacc 2520 ccaccgcctg tatgcgtccg cccatcactc taccgctctc ccctccgccc gccctctgca 2580 ccaggtaggc cggcatcacg ccctggcgct gcaactgggg ggccatgcac gattggccac 2640 agcactatgc gtcctggtcc ccatcgtggc gtgaccaggg atgggaggaa aaagccgcgg 2700 tggtatggtg cgctgttggc gctgccctgt ggtgcactgc atggccgtgt tcaggagtta 2760 gatccagcgc ctggccttcc actgctgctt tgactctggg ccgatggttg tatgatggga 2820 agttgggggc cgtgctgtat agttaggctg ctgcatgccg gttgcgaacg gttgagggtg 2880 cttgacgtgt gcaagtgcgg ggatgctgac atgtctgcga cggtggtacc actctaccgg 2940 cggtgcaccc gaccacgcca cttcgtgtgt agttgtatac attggttgtg cgcaatatgt 3000 agcggcccgc accagagagc attactgtgg cgtggcctga caggcaatcg cagcttggtg 3060 gtcggcacgc gtgtcaacca ccctattgca caacccatgc ggcatggaga ccgccccctg 3120 gctgcgatgt acttgcacat acaggctctc tatagtagag ggcaggactg ccaagaagcc 3180 tttgtatatc ggtgacgctg gcgggcactc cattttagca tgtcgcgggg catgcctaga 3240 caaagttacc cttgcagccg gcactgccgc accattcctg ctataatcta agtgagatgc 3300 accattatgt gtagaccaca tcgcgtagcc caatcccaat ggctcgcgcc aaatgccgac 3360 cgaaatgcaa gcaattagca acgctgccca gcgccttttc gctggccgct tgcaccaaca 3420 attctcacat aggccataga tagcagcact ggaagcctgg agctcatgcg cgcgggcccg 3480 ccctttcaat cctggtgccg ctttccccgt cagacgggcg ggggccacgg caaaccggcg 3540 gggtttgatc cagggttcgc cacgagcacg tggcataccc atgttcagac tggcaccacc 3600 ccaccgcctg tatgcgtccg cccatcactc taccgctctc ccctccgccc gccctctgca 3660 ccaggtaggc cggcatcacg ccctggcgct gcaactgggg ggccatgcac gattggccac 3720 agcactatgc gtcctggtcc ccatcgtggc gtgaccaggg atgggaggaa aaagccgcgg 3780 tggtatggtg cgctgttggc gctgccctgt ggtgcactgc atggccgtgt tcaggagtta 3840 gatccagcgc ctggccttcc actgctgctt tgactctggg ccgatggttg tatgatggga 3900 agttgggggc cgtgctgtat agttaggctg ctgcatgccg gttgcgaacg gttgagggtg 3960 cttgacgtgt gcgagtgcgg ggatgctgac atgtctgcga cggtggtacc actctaccgg 4020 cggtgcaccc gaccacgcca cttcgtgtgt agttgtatac attggttgtg cgcaatatgt 4080 agcggcccgc accagagagc attactgtgg cgtggcctga caggcaatcg cagcttggtg 4140 gtcggcacgc gtgtcaacca ccctattgca caacccatgc ggcatggaga ccgccccctg 4200 gctgcgatgt acttgcacat acaggctctc tatagtagag ggcaggactg ccaagaagcc 4260 tttgtatatc ggtgacgctg gcgggcactc cattttagca tgtcgcgggg catgcctaga 4320 caaagttacc cttgcagccg gcactgccgc accattcctg ctataatcta agtgagatgc 4380 accattatgt gtagaccaca tcgcgtagcc caatcccaag ggctcgcgcc aaatgccgac 4440 cgaaatgcaa gcaattagca acgctgccca gcgccttttc gctggccgct tgcaccaaca 4500 attctcacat aggccataga tagcagcact ggaagcctgg agctcatgcg cgcgggcccg 4560 ccctttcaat cctggtgccg ctttccccgt cactagggcg caaaagtcgc gtccaaacag 4620 acaccggcgc cagcccgacc tttcccgcgc gggagcgcgt tgatcccgac ccggtaagtc 4680 ccatcgtgga acagtagccc caaggcccct tccatagcgc agatgttcct gggcgctctc 4740 cagcgtgcat gcgggttcta ggtagccacg cccacgctgc gaggctggga aaaaggccca 4800 ttgtgacagc aagccagctt tcgcgccatt ggttgccggc gacagcccag tggtgtggca 4860 gcttgggtat gcatagcttg ttggggtacg ggtgggcaat gactcccaac cttgcgcgcg 4920 taggatttat gagggcatcc tgctccctgc gcaaaagcat gaatttgcac ccatgcaatg 4980 cgcgagggga gcagctaccc accgcgcgct gccctgctgc caggtatcaa ctggattatc 5040 aatcccaata cccaggcatt tagctggcat ggctgggaag cgcgagggaa aaaccggcat 5100 tcctgaaacc cgccactgac tgtagccttt tctcaacacc attcgaaagc gtttgtcaaa 5160 ccgtacagca ggctggacaa tacgcgcagg gaggacggtg cgtttaatag gggtacggca 5220 gcaagcgttc tcggcctagg aagcagctta cagagtgcgg ggcagttctg taaatagagg 5280 gtatcaaatt tgatgggcgg agtgccttgg ggtgcaggtc gctggacgcg acttctgcgc 5340 cctagtg 5347 // ID RandI-3 repbase; DNA; CHL; 11237 BP. XX AC . XX DT 17-AUG-2004 (Rel. 9.07, Created) DT 17-AUG-2004 (Rel. 9.07, Last updated, Version 1) XX DE RandI-3 is a family of non-LTR retrotransposons - a consensus DE sequence. XX KW RandI; Non-LTR Retrotransposon; Transposable Element; KW AP endonuclease; RNaseH; RandI superfamily; RandI-3; KW reverse transcriptase. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-11237 RA Kapitonov V.V. and Jurka J.; RT "RandI-3, a family of RandI non-LTR retrotransposons from the RT Chlamydomonas reinhardtii genome."; RL Repbase Reports 4(7), 198-198 (2004). XX DR [1] (Consensus) XX CC RandI-3 is a family of RandI non-LTR retrotransposons. CC Approximately 20 copies of RandI-3 are present in the genome. CC They are ~1% divergent from the consensus sequence and are CC likely mobile. The RandI-3 elements are usually flanked by CC 10-15-bp target site duplications. The consensus sequence CC encodes the 3671 RandI-3p protein (nucleotide pos. 98-11110) CC composed of the AP endonuclease (aa pos. 1030-1300), reverse CC transcriptase (aa pos. 1650-2000), and RNase H1 (aa pos. CC 3170-3340) domains. XX FH Key Location/Qualifiers FT CDS 98..11110 FT /product="RandI-3p" FT /translation="MATMLVLITYVMLILGVLVRTSHWPEEVRPFHTPEVL FT QALAGLLARSWTPSLTSGGPEEVATTAMHRMPPQKYAPFCIPATLVATNPA FT ICSAGSRLSSGSYGSQQMNVRSAAPTPTQQSQQPLPGTRKQRLCNETNTTL FT PVSLRPPAPDVQNLDLSRDRRRPATTSPRAANSCTSRTLKSPTLQHYVSTS FT LQPPASDGQNSDMSRGPATTSPMAANPFSPKTLKSPASQHYSTAERPPDDT FT PHDPGPGSSARGPRAPAALLCHTRTASPALQLWCQTMGPYPIVQLQMHNDT FT DPLIATVSYISAHITSALCWTAVGTFHGSCALLRHTIASLRTLLTHTAAYM FT TTCTGTLIAHSWRTAASITLWPPALTLVAASAARLAWSARTMLSHPSAGNL FT TTLTFTLLATLTLSQPLLLLISTHQHISTTLTLLTTTAAIYATLRKTHGAP FT LILLGLPVYATCRAVAVKPAYTMRTLLTILLLLTNIQATAEAPELTPAAAP FT LAMQLLTRIPGSPTLNTGVVGLTYLLSQTHIHPQPACLALTTNLATYRYLM FT DVTTPWSHTLGSPIPPPIQLRGGNPIEHTPATNRVAAATESGRKPERIQWP FT RQIQHTFFREQQQLNFCTVHAINNMLGKPWLLATDVLAFCQVSHTAMADTP FT ALYTLWGHHFRYHGLFHEAAINHYLRHRPPAVDHTGARLLLRTVAKTLTPG FT HTQDHIISLLPDCSKGLGLMVHDGRHAYAIRKENDTWYLVDSLAHIGRKPM FT TQELWAATKGDLFCLHPLHPDMPPDPLTYPTIPGLPEPPPPPTPTTQPRPR FT KGTVQLTLNTTMTRTLNRAPLNPNPENSTTSNRAQTRPQTQTTLANLVARS FT QTHAEPQGTQLTPPEDPQEPPPRTHAPTHEHGRPPTPPSAARPTRRLRRND FT QISPHSPTRHRTQQDATTENQQPQAPPPANDPVPTPQPRPTWAQVLTNNAS FT AVGTATLHVLTRLSPAPRHRPDTPTSAARPTQRPRQELPGSPTTHTATAPP FT DPPPTNPTATPPVTEQAAEPAASWLHVVTHNVRGLSTNLLSTVCHLREWDA FT DIVILTETKLGPRTRWLKDTLRHEKLSYRTFTSTKPGTAQYKRRSASVLMA FT ISKRYHSGSNIHTATIPPHLLGHLAHCTIRTPHSIPLHILGVYCPEDMPTR FT RRIYTYCTSTLKAAAAAGEHVLIGGDFNAVLTAADRTGPLDDADRHHARFV FT SSHRLHRITEPNGTTSMTYYQTRDGHPQAISRIDDILICQHTSNALVAAAT FT EPGGNTPTLKVQPSGGLFDHSSVHIHLPTFPMRLWTAPTGNRTGNTTQPPT FT GPWPQVVLPISTSTLEAVRTKIEHTLASPIANLAAALQPASTAIRDHLDRY FT TTGATNATELDTQLKGGPTVQAPNLDALAGQLHDILKDGLNILVDMCPKKP FT PRTGAFAPRRISKKIRRAHEELRQLRAAIADNDTLHTPGNAATGAPTTPQP FT SPCPPTEGEPTTPTPTRGPRPPPPNQTVSDLINKIKHVGSALRQTLKKLTQ FT SEVQKSRARFQHKLATRAKQAHKDIFSNQEQARGIPAVLHPHTKAHCTDKA FT SILEAIHTYFQDQGATITGPRTGNYKPEDRHPNTRYPFDPSNRTAPDTFKL FT HSTCSPTSHALLPSISDSANFDTCIDHLSRNKSPGPDGIPNELLRALPTPL FT RTALHNLLILMWVRAHTPASWTASETVLLPKPGDPLLLKNYRPIAQANTTY FT KLWTSLITVAISDISHDLDLFSETQEGFLRYRNTERQTLNLVHALEDAGLT FT NQDIYVMYADFSSAFNTISHDRLLEIMWDLGMPHDLLRVVQNLYANARTYI FT RTEHGLTSPVHIERGTVQGDTLSPVLFLMFIEPLIWWLHVGGRGYSYGCLP FT NHLNNRFHCSSAAYTDDLAVLTNTLSDLRIQCDKIHRYSAWAGLQVNHAKC FT RVTGILHRRAQQDKGLNGPTCNRTLKSMLENKIHIGDKPVPYLPATEPFKY FT LGVQITMNLHWGPQFAYLCDAIKEKSANLQTSLASPEQCLRIIKSCIQSMA FT AYSFAVMPYAENDIRTLDAMIARLAKKCYRLTPGFPTRATLAPADMSGLGV FT GSLLPLYAQKATRALTLALNDPGRLGMVTHALLELQSRIAGTAAIHKLRRE FT SHFYTTLKQLSIMREYDITLWDHGAPYAGPINTILQAVNRNPRPIPHHLVH FT TLAQAGLDLRALISTERTKEGLTCLIPSDSLFLRLPRNQCKRQHRIALNRL FT SLCLSAEGAPTRAAPYDQPETYNCHLPLPQQYRALPATPAFAALLPQSATA FT TGAPQTTLMAHRQHMTTLHPEPNSTQPHAPTAFNATAPPRAAQPRGVPTRQ FT AALTRHLRPVETACTLATIPAPPTQAGTPDPNQTATPIALAQGTAPTLPPA FT DTPATQPQTQTNTNKRKRINLQVKKVKGPSASKRLERRIEAEPRHDPLGTR FT GQRDHTGRHGFVLDHGEPVAGDTVPADFVMWAPRFYSMLQKDMSGKTHIPR FT LGIPPGKQKFPTTLPRPRYTRQIPTHLTTPYFLNDGEPHNPVQTYLRDHNA FT TKAPLCAYDTPARWAQAGDFIIEPEPPRWPTITGADPANPDDPREADPDPN FT GMSRKAYMAERALLRQRTQHQDGRPRLITRWMGIRAQVMTGGLQWPKNTSD FT ATRAASQLIFSLYYDLESVSHVHAGPIHKLIAAPARNERPQTALYYEVAWK FT PTTIVRGAAAAYTIMKYTGTPHAHLAYGQSCSRYFVHMHWNVTSEPASKCP FT TLQGWRAAHAAHLQRLSKPPSEPTPPPPRDSHLSPAQRQGRWTPPDVEGAA FT RRRTTRQNTQINTKPCNPYHDIYPTGAYTIQITDTYTDAQTLAHIHGPDGR FT HISSITMGTLHTLHTRYKHALASNPTETASLRSPGGFPEDVAKLLIRTRVD FT EKGKPCKTAPALERSLPTCLKTTLLDALAAAHPDETNPALMERFASPLNCS FT NRVCQYWSAQPADALFGATTDVLSAPFTGLSIAHPGPNPAAYTKALAWAVA FT TTELLEGQMDPAITLMVIPHNHKSPHTRLFSSPHVTHIAYLPPDTPATNLD FT EGLGDLSPAITLRHGLDLLLVANTTGVQAFTARNNLANLQAALTDIGAQPP FT RPTRTTAAVARWPRRITLPRDRHQPTSRPYSHRAQTRARQATHRTPPTPPA FT PPTCTVSPPIQAELNALRDCFPAISRPLYDAATVIWTDGSCIKIAVGDDGA FT EMNQLGACAWTATTGEPLYVEPGGLNSTNTIQRAELSAILAALRHYRPRAQ FT LLGLTKLVIASDSLVSLYLIRRAISDPMKLTLSKHKNLLSDIVKELNECAD FT ELLPVVFLKVRSHTGLAGNDEADKGAALVAQRDAGAMSQAVHLPPDEPWAN FT IWWPKRNSDNFYISDLNRGVLSSLPPRAQYGFTDSTTVLDNWAAAADGACP FT QSSNLTLASPARHPWLVKQILYARYGYLFNATLKRRYGLGGNGLCPLCEAP FT DSGGHILGGCSQRQMKGAYINRHNIAVQRVAKCISKGSYGGGYMVMDAGPL FT ADLPEYASGQRPPAWLFRNGTPPEVIARLRPDILFIPALDTTTTQREDYRP FT PRNRSRTPIYILEIGYSSDSQHLHKREEKSQQHQELKQLLEEAGWTVHYSP FT AECIALGVTGTIPANLVPLLTTLGVSPKAAKTCGHKLHHHAVNSAGGIIKA FT RRIMEYGHTRTPG" XX SQ Sequence 11237 BP; 2815 A; 4338 C; 2391 G; 1693 T; 0 other; aacgttctaa aactccctat tgcctctcca cactacgaga gcgccgacga gctcaacatt 60 tgagcgggac gtgccgaacg cggggatctt ccccaaaatg gccacaatgc tggttcttat 120 cacctatgtg atgctaatct tgggagtact cgtccggacg agccactggc cggaggaggt 180 ccgacccttc catacccctg aagtgctaca agctctcgca ggccttttgg cccgatcttg 240 gacgccgtct ctgacatcag ggggtccaga ggaagtggcg accacggcta tgcaccggat 300 gccaccccaa aaatacgcgc cattttgcat ccccgcgacc ctcgtcgcga cgaatcccgc 360 aatatgctct gcaggctcga ggctctccag tggaagctat ggatctcagc agatgaacgt 420 gagaagcgca gcgcctacgc cgacgcaaca atcgcagcaa cccctccctg gaactcgcaa 480 acagcggctt tgcaacgaga ccaatactac tttacctgtc agcctccgac ccccagcgcc 540 tgacgtgcag aacttagact tgtcgcggga ccgtcgcaga ccagcgacca cctcgcctag 600 ggctgcaaac tcctgcactt ccagaactct caaatcgccg acactacagc actacgtatc 660 taccagtctc cagccaccag cgtccgacgg acaaaactca gacatgtcgc gaggaccagc 720 gaccacttcg cctatggctg caaacccttt cagtcccaaa actctcaaat cgccagcctc 780 acagcactac agcacagctg aaagaccacc ggatgacacg ccccacgacc ccggccccgg 840 cagctcggcc cggggcccgc gcgcccccgc tgcactcctg tgccataccc gcaccgccag 900 ccccgccctc cagctgtggt gtcaaactat gggaccgtac cccatagtcc agctccagat 960 gcacaacgac actgaccccc tcattgccac cgtgtcctac atatcagcac atatcaccag 1020 tgcgctatgc tggacagctg taggcacctt ccacggatcc tgtgccctcc ttcgccacac 1080 catagcctcc ttgcgtaccc tcctcactca cactgccgcc tacatgacca catgcaccgg 1140 cactctgatc gcccacagct ggcgcactgc agcaagcata accctgtggc cacccgccct 1200 aaccctggtc gcagccagcg cagcacggct ggcctggtct gcccgtacca tgctctccca 1260 cccgtcagct ggcaacctaa ctaccctaac cttcaccctt ctcgccaccc taacgctatc 1320 ccaaccactc ctcctcctca tcagcacaca ccagcacatt agcaccaccc tcacgctcct 1380 caccacaacc gccgctatct acgccaccct caggaaaacc cacggcgccc cgcttatcct 1440 gctgggccta cctgtatacg ccacgtgccg cgccgtagct gttaaaccgg cctacaccat 1500 gcgtacactg cttaccattc tcctcctgct tacgaacatt caagccacgg ctgaagcacc 1560 cgaactaaca cctgcagctg ccccactcgc catgcaactc ctcacccgca tcccgggttc 1620 accaaccctc aacaccggag tcgtggggct cacctacctc ttatcgcaaa cccacataca 1680 cccccagccc gcgtgcctcg ctctcacaac caaccttgcc acctatcgct acctgatgga 1740 tgtcacaacc ccctggtcac acacactcgg cagcccgata ccccctccca tacaactcag 1800 aggaggcaac ccaattgaac acacaccggc aaccaaccga gtggctgctg ctactgagtc 1860 cgggcgcaaa cctgaacgaa tccaatggcc cagacaaata caacacacct tcttccggga 1920 gcaacaacaa ctcaacttct gcactgtaca cgccataaac aatatgctgg gcaaaccgtg 1980 gttactggcc acagatgtgc ttgccttctg ccaggtctca catactgcaa tggccgacac 2040 gcctgctcta tacaccttat ggggccacca cttccgatac catggactct tccatgaagc 2100 cgctatcaac cactacctgc gacaccggcc cccagccgtg gaccacacgg gggctcgact 2160 actcctgcgc acagtcgcca aaacactaac cccgggacac acgcaagacc atatcataag 2220 cctcctcccc gactgctcca agggccttgg tcttatggtg cacgacgggc gccatgcgta 2280 tgccatccgc aaagagaatg acacctggta cctcgtcgac tcactggcac acattgggcg 2340 caagcctatg acgcaggaac tctgggctgc caccaagggg gatctcttct gcctccaccc 2400 cctacacccg gacatgcccc ctgaccccct aacatacccc accatcccgg gcctcccgga 2460 acccccacca cccccgacgc ccaccaccca acccagaccc aggaaaggga ctgtccaact 2520 aaccctcaac accacaatga cccgcacctt gaaccgtgca ccactgaacc ccaacccgga 2580 aaactccacg accagcaaca gagcgcagac ccgcccccag acccaaacca ccctcgccaa 2640 cttagtggca cgatcccaaa cacacgccga gccccaaggc acccaactca caccgccaga 2700 agacccccag gagccgcccc cacgcacgca cgcacccaca catgagcacg gccgcccccc 2760 cactccccca agtgcagccc gccccacaag gcgcttgcgc cgcaacgacc aaatcagccc 2820 ccacagcccc accagacacc gtacccaaca ggatgccacg acagaaaacc aacaacccca 2880 ggccccaccg cccgccaacg acccagtccc cacgccacaa ccccgcccaa catgggccca 2940 agtcctcaca aacaatgcct ccgcagttgg taccgcaacg ctccacgtgc taacccgcct 3000 atcacccgcc ccacgacacc gacccgacac ccccaccagt gcggcccgcc ccacacaacg 3060 cccgcgccag gaattaccag gcagccccac aacccacact gcaaccgctc ccccggaccc 3120 accgccaacc aaccccaccg caaccccgcc cgtaacggaa caggcagctg aacccgccgc 3180 cagctggctc catgttgtca cacacaatgt gcgcgggctg agcaccaacc tcctgtccac 3240 cgtgtgccac ctgcgggaat gggacgcaga tattgtgatc ctgaccgaaa ccaagctggg 3300 cccacgcacg cgctggctaa aggacaccct aagacacgaa aagctatcat accgcacttt 3360 cacgtccaca aaaccaggga ctgcacagta taaacgccgc tccgccagcg tactcatggc 3420 catctccaaa cgctaccact cggggagcaa cattcacaca gcaacaatcc caccacacct 3480 cctaggccac ctggcccact gcactatccg cacaccacac agcatccccc tgcatatact 3540 aggcgtatac tgccccgaag acatgcccac acgccgccgc atatacacct actgcacctc 3600 cacattgaaa gcagcggccg ccgcaggcga gcacgtccta atcggtggcg acttcaacgc 3660 cgtactgacc gcagcagacc ggacagggcc ccttgatgac gctgacagac accatgcacg 3720 gtttgtctca tcccaccggc tgcaccgcat caccgagccg aacggcacca ccagtatgac 3780 gtactaccaa acacgtgacg gtcaccctca agccatcagc cgcatcgatg atatcctcat 3840 ctgccagcac acgtcgaacg ccctggtagc tgccgcaacg gagccaggcg gaaatacgcc 3900 aacactcaaa gtccaaccca gcggcgggct attcgaccat tcctcagtcc acatccacct 3960 tcccactttt cccatgcgcc tatggactgc acccacgggc aaccgcacgg gcaataccac 4020 acaaccccca acggggccct ggccacaagt ggtgctacct atctccacct ccaccctgga 4080 ggctgtaaga acaaaaattg aacacacact tgcctcaccc atagccaacc tcgctgctgc 4140 tctccagccg gcctccaccg ctatccgaga ccacctcgac cgctacacta ccggagccac 4200 caatgccact gagcttgaca cccaactaaa aggaggtccc accgtccaag ccccgaacct 4260 cgacgcactg gcaggacaac tccatgacat actcaaagac ggcttgaaca tcctcgtgga 4320 tatgtgcccc aagaaaccgc cgcggaccgg cgccttcgcc ccccggcgga ttagcaagaa 4380 gatacgccgg gcacacgagg aactacggca actccgagcg gccatcgcag acaacgacac 4440 cctacacaca cccggaaatg ccgccacggg cgcgccaacc acgccacaac ccagcccctg 4500 tcctccgaca gagggcgaac ccacaacccc aacccccacc cgcggccccc gaccgccacc 4560 gcccaaccaa accgtatccg accttatcaa caaaatcaaa catgttggaa gcgcactacg 4620 ccaaaccctc aaaaagctca cacaatcaga ggtacaaaaa tccagagccc gcttccaaca 4680 caaactagcc acgagagcta aacaagcaca caaagacatc ttctccaatc aagaacaggc 4740 acgtggtatc cccgccgtcc tacaccccca caccaaagca cactgtacgg acaaagcatc 4800 catcctggag gccatccata catatttcca ggaccaaggg gccaccatca ccgggccgcg 4860 caccgggaac tataaacccg aagacagaca tccgaacacc aggtaccctt ttgaccccag 4920 caacagaacc gcacccgaca ctttcaaact gcacagtaca tgcagcccga ctagccacgc 4980 actgctacca agcatctccg acagcgccaa ttttgacaca tgcatagacc acctaagcag 5040 aaacaaatcc ccgggacccg acggcatccc aaacgagcta ctacgcgcct taccgacacc 5100 cctgcgcacg gcactccaca acctacttat cctaatgtgg gtcagggccc acacacctgc 5160 ctcgtggact gccagcgaga ctgtcctgct ccccaaacct ggagaccccc ttctgcttaa 5220 gaactaccgc cctattgccc aagccaacac cacgtataag ctctggacta gtctcatcac 5280 ggtggccatc tctgacatat cacacgacct ggacctgttc agtgaaactc aagaaggctt 5340 cctacgctac cgtaacacgg aacgtcagac cttaaacctc gttcacgccc tagaagacgc 5400 cggcctgacc aaccaggata tctatgttat gtacgcagac ttctccagcg ccttcaacac 5460 catatcacat gaccgcctac tggaaattat gtgggatctg gggatgcccc atgacttgct 5520 acgcgtcgta caaaaccttt acgcaaatgc ccgtacttac atacgcactg aacacggcct 5580 tacctccccg gtacacattg aacgaggcac agtccagggc gacacactct cccccgtact 5640 ctttctgatg tttatcgaac cgcttatatg gtggctccat gtaggaggcc gcggctactc 5700 ctacggttgc ttaccaaacc acctcaacaa caggttccac tgctcctcag ccgcctacac 5760 cgacgacctg gcggtgctta caaacacctt gagcgaccta cgcattcaat gcgacaaaat 5820 ccaccgctac tcggcatggg cgggcctcca ggtgaaccac gccaaatgcc gagtcacggg 5880 aatcctgcac cgaagagccc agcaggacaa aggcctgaac ggtcccacct gcaaccgtac 5940 cctcaaatca atgctcgaaa acaaaatcca cattggcgac aaacctgtgc cttacctccc 6000 cgcaaccgaa cccttcaaat acctgggagt acagataacc atgaacttgc actggggacc 6060 ccagtttgct tacctatgtg atgccatcaa agaaaaaagt gccaacctgc aaacgtctct 6120 cgcgtcacca gaacaatgcc tgcgaattat aaaatcctgc atacagtcta tggcagcata 6180 cagctttgcg gttatgccgt acgcagagaa cgacatccgc accctcgacg ccatgattgc 6240 gcggctggca aagaagtgtt accgcctcac ccctgggttc ccgacacgtg ccacacttgc 6300 ccctgcggac atgtcaggcc ttggggttgg ctccctcctg cccttatatg cccaaaaagc 6360 cacacgggct ttgacccttg ccctgaacga ccccggcaga ctaggcatgg tcacgcatgc 6420 cctgctggaa ctacaaagcc gcatcgccgg cacggcagcc atacacaaac tacgtcggga 6480 aagccacttc tacaccacac tcaaacaact cagtatcatg cgagaatacg acatcaccct 6540 gtgggaccac ggagcgccgt atgccggccc catcaacacc atcctccaag cagtgaaccg 6600 taacccccga cccatcccac accacctcgt acatacgctg gcgcaggccg gcctggacct 6660 cagagccctg ataagcacgg aacgcacgaa agaaggccta acctgcctga ttccctccga 6720 ctccctgttc ctccgcctcc cccgtaacca atgcaaacgt caacaccgta tcgcgctgaa 6780 caggctgagc ctatgcctta gtgccgaagg tgccccaacc cgcgcggctc catatgacca 6840 gccggagacc tacaactgcc acctgcccct ccctcagcaa taccgtgccc tcccggcgac 6900 acccgccttc gcggccctgc tcccgcaatc cgccacagcc accggcgccc cccagaccac 6960 tctcatggct caccgtcaac atatgaccac cctacacccc gagcccaact cgacccaacc 7020 acacgcgccc accgctttta acgcaaccgc accaccccgg gccgcccagc cccgaggcgt 7080 gcccacgcgc caagctgccc tcacccgcca ccttcgcccc gttgagactg catgcacact 7140 ggcgaccata cccgcaccac cgacgcaggc aggcacgccc gacccgaacc aaaccgccac 7200 acccatcgcc ctagcccagg gtacagcacc caccctgccc cccgccgaca ctccagccac 7260 gcagcctcaa actcaaacca acaccaacaa acgcaagcgg attaatctgc aggttaagaa 7320 agtgaaggga ccgtccgcct ctaaacgcct ggaacggaga atcgaagcgg agccacgaca 7380 cgacccactg ggcacgcgag gacaacgcga ccacacaggg agacacggct ttgtcttgga 7440 ccatggcgaa cccgtggcag gagacaccgt acccgctgac tttgtcatgt gggccccgcg 7500 cttctactcc atgctgcaaa aagacatgag cgggaaaaca cacatacccc gccttggcat 7560 cccacctggg aaacagaagt tcccaactac gctaccgcgc ccacgctaca ctcgacaaat 7620 acccacacac ctgaccaccc cgtatttcct aaacgacggg gaaccacaca acccagtcca 7680 aacctacctt cgcgaccaca atgcaacaaa ggccccactg tgcgcgtatg acacgcccgc 7740 ccgatgggca caggccggag acttcattat tgaacctgag cccccacgct ggcccaccat 7800 aaccggggcc gatcccgcta accctgacga cccccgtgaa gccgaccccg accccaatgg 7860 catgtcccgc aaagcttata tggcagaacg tgcactcctg agacaacgca cacaacacca 7920 agacggacgc ccacgcctga tcacacggtg gatgggcata cgcgcgcaag taatgactgg 7980 cggcctccaa tggcccaaga ataccagcga cgccacacgc gcagccagcc agctgatctt 8040 ctccctctac tatgacctgg aatccgtatc acacgtccac gctggcccaa tccacaaact 8100 gattgctgcg ccggcccgca acgaacgccc tcaaaccgcc ctatactatg aagtggcctg 8160 gaagccgacc accattgtgc gcggagccgc tgcggcctat acgatcatga aatacacggg 8220 aacaccacac gctcaccttg cctacgggca gagctgcagc cggtacttcg tacacatgca 8280 ctggaacgtc actagcgagc ccgctagcaa atgccctaca ctccagggat ggcgagccgc 8340 gcacgctgcc cacctccaac gactatccaa accccctagc gagcctacgc cccccccacc 8400 ccgcgactca cacctgtcac cagcccagcg acagggccgg tggaccccac cagacgtgga 8460 aggagcggcg cgccgacgca ccaccagaca gaacactcaa atcaacacca aaccgtgcaa 8520 cccgtaccac gacatctacc ccacgggcgc ctacaccata cagattacgg acacatacac 8580 agatgcacaa acccttgcac atatacacgg ccctgacgga cgccacattt ccagcatcac 8640 catggggacc ctacacaccc tccacacccg ctacaaacac gccctggcca gtaacccaac 8700 cgaaaccgca tctctccgtt cccccggggg tttccccgaa gatgttgcca aattactcat 8760 ccgcacccgt gttgatgaaa aaggcaaacc atgcaaaaca gcgccggccc tcgaacgctc 8820 cctgccgacg tgcctaaaaa caaccctact cgacgccttg gctgcggccc accccgacga 8880 gaccaacccg gccctcatgg agcgcttcgc ctccccactc aactgctcca accgtgtatg 8940 ccaatactgg agtgcccaac cggccgatgc cctcttcggg gccacaacgg acgtcctgtc 9000 tgcccccttc accggcctct ccatcgccca ccccggcccc aaccccgccg cgtataccaa 9060 agctctggca tgggccgtgg ccaccacgga actgctcgaa gggcagatgg atccggccat 9120 caccctcatg gtcatcccgc acaaccacaa gagcccccat acccgcctct tcagtagccc 9180 tcatgtcaca cacatagcgt acctcccgcc agacacaccc gcaaccaacc tagatgaggg 9240 gctaggcgac cttagcccgg ccatcaccct acgacacggg ttagacctcc tactcgtagc 9300 caacacaaca ggggtgcagg ccttcacggc ccgcaacaac cttgccaacc tccaagccgc 9360 ccttacagac atcggggccc aaccgccacg ccccacccgc accacagctg ctgtagcgcg 9420 atggccgcgc cgaatcaccc tccccagaga ccgccaccaa cccacctcac ggccatacag 9480 ccaccgagcc caaacccggg cacgacaggc aacacaccgc accccgccaa cacccccggc 9540 cccgcccacc tgcacagtct ccccgcccat ccaagcagaa ctgaacgccc tccgggactg 9600 cttcccggcc atctcccggc cgctctatga cgcggccact gtcatctgga cggacgggtc 9660 gtgcatcaaa atcgcagtag gagacgacgg tgccgaaatg aaccagcttg gcgcctgcgc 9720 ctggacagcg acaacgggcg aaccactcta cgtcgaacca ggcggcctga acagcaccaa 9780 taccattcag cgcgcagaac tatccgccat cctggctgcg ctaagacact accgcccccg 9840 cgcgcaactc ctggggctga ctaagctagt gattgccagt gactcgctgg tctcactgta 9900 ccttatacgc agagcaatta gcgacccaat gaaactgacg ctgtccaaac acaaaaacct 9960 actcagcgac attgtcaagg aacttaacga gtgtgccgac gaactcctcc ctgtcgtatt 10020 cctaaaggta cgctcacaca caggcctggc aggaaacgat gaggccgaca aaggcgcggc 10080 cctggtagcc cagcgtgacg cgggcgcaat gagccaagcg gtgcacctgc cacctgacga 10140 gccctgggct aacatatggt ggcccaaacg gaactccgac aacttttaca tcagcgacct 10200 gaacagggga gtcctgtcct cactaccccc gcgggcacag tatggattca cagacagcac 10260 aaccgtcctc gacaattggg ccgccgcagc agatggcgcg tgccctcaat caagcaacct 10320 aaccctggcc agccccgccc gccacccctg gctggtaaaa caaatcctat atgccagata 10380 cggctaccta ttcaatgcca ccctcaaacg ccgttatgga ctgggcggaa acgggttgtg 10440 ccccctttgc gaggcacccg actcaggagg gcatatcctt ggagggtgct ctcagcggca 10500 gatgaaaggc gcatacatca accgccacaa catagctgta cagcgagtgg ctaaatgcat 10560 ctccaaagga agctatggcg ggggctacat ggtaatggac gcgggcccac ttgcggacct 10620 gcccgagtat gcatccggcc aaagaccccc cgcctggctt ttccgtaatg gcacaccccc 10680 agaggttata gccaggctcc ggccagacat tctcttcatc cccgccctcg acactacaac 10740 gacacaacgc gaagactatc ggccgccccg caaccgcagc cgcacaccaa tctatatcct 10800 tgaaatagga tacagctctg actcccaaca cctacacaaa cgagaagaga agtcccaaca 10860 acaccaagaa ctcaaacaac tcctagaaga agctggatgg accgtccact actcacccgc 10920 cgagtgcatc gccctaggcg tcactggtac catacccgca aacctggtcc cactcctgac 10980 caccctaggc gtgtccccca aggcggccaa aacctgcgga cacaagctcc accaccatgc 11040 cgtcaatagt gctggaggca taattaaagc ccgcaggatc atggaatacg ggcacacgag 11100 gaccccaggg tgatgtcccc aaccggcccc ccttgcgtgg tgaggcggca gggtagggga 11160 tgggccccta cactgacccg ccttgccccg ataatccacg ccaggcccta cgggccggcg 11220 tttcgattat tattatt 11237 // ID Gypsy-11-LTR_CR repbase; DNA; CHL; 2815 BP. XX AC . XX DT 31-MAY-2006 (Rel. 11.05, Created) DT 31-MAY-2006 (Rel. 11.05, Last updated, Version 1) XX DE Gypsy-11-LTR_CR is a long terminal repeat of Gypsy-11_CR LTR DE retrotransposon - a consensus. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; 4-bp TSDs; Gypsy superfamily; Gypsy-11-I_CR; KW Gypsy-11-LTR_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-2815 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-11_CR, a family of gypsy LTR retrotransposons from the RT green algae genome."; RL Repbase Reports 6(5), 231-231 (2006). XX DR [1] (Consensus) XX CC It is a long terminal repeat of the Gypsy-11_CR LTR CC retrotransposon, whose internal portion is Gypsy-11-I_CR. XX SQ Sequence 2815 BP; 562 A; 808 C; 847 G; 598 T; 0 other; tgtagctgct tcgcgtgtgc cgacgcatag cgctgtccgg acgcttattg cattgcacgt 60 ccgcccgagc gcgctgtggg caccttgact ccgtagagcg gggctacgca cgcattatga 120 cggctagggt acgtgccgac ggttggaccg gagagtgagc tacactggtt acggtgatca 180 gaggcgataa gtatgcagcc tcatgcagtt tgctcaagcg gccagagcac gggagagacg 240 tgaagctgtg aggtaatgca gcgacggggc gtgaacatgg accgcctgcc aagtgccttg 300 cgagatgtgg gcacgagacg atatggttct ataggaccac agagatgtta ttggccggtc 360 tgagcggcaa tgagagaggc ggccaggaga gacgtcagca tcggttgcag acctcccccc 420 ccccgcccct gacagaacgc ggtcggcggt gaactggact gcgcgagtat gatttgcaca 480 taaagctgta cataaaggtg tggcataacg aggaaagggc tgcagtctgt cggcagcagc 540 ccgggcatga actggcggta gctcgctgca taggggactt gaatgcttga gtcctacagc 600 gatgtctgaa gacgataagt ggttagggcg aagcggtagg agcgttttcc ccgaggagcg 660 cttgtcgatg cggagccatt atgcggttcg gccctccggc taccctgggc gcggcgacgg 720 tctttaggcc gaagcctgtg ccagtggtcc gggagccttg cgcgcttgct tttgtgtgac 780 tttgctaggc ggcaatagcg tttgcaggcc tagactttga gggacgaata gagaacaagg 840 caggttgagg gatggagact gggcattgat gctcacaccc ttccctagct tgccccgtgc 900 ccttggtggc cctggggcgt tgtttgagct acggtgcgca tttccaaata gcgacgcaag 960 cagggggtag ttcaggcgct gcatggggac gctgcatgag ggcaaagtct gagtggaagc 1020 tggcccctgc accttccagg cctagctccc ctgcgccaat cgcgatggtg gaaggtgccc 1080 ggggcaatat aggagcgctc cggagcgcca agaaactaac ccggcactcc tgtgtaatac 1140 gccacgagta attgacaagc acgaaccttg ctcataccgg aagaacagtc tcccagccct 1200 caagggctct tctcccctga cagcgggcct gtcagcaagt ttccagcgcc aaggcgcgac 1260 agtgattgag gcctaaggca atccgccaga gcctcggcga gctgcggcgt ctccctagct 1320 cgatatgcta ggtgtgcagg cttggatgta gtggactttg aagagcggcc taggacttgg 1380 aggttgtagt ttcggaggtt gtgactcttt cgtggtgagg cgtcagcgtg aggggggcgg 1440 gccctctcgc cctagtcacc ttgccccgtt aatccatgcc aggccctatg ggccggcgtt 1500 gtaattatta ttattattat ccgccagagc ctcctcttca gcgggctcct gtgccccgtc 1560 ttccgcattc gcacgcccgg tcgcatcgac cccgtgcgcg actcttcgtg acccgtctgc 1620 tcccgcgacg tggtcctgtg gtatcttacg gacttcgact acgtatatcg gcggctacca 1680 gcgcctggac ggcaacaacg agcctcgcga gcccctcggc tgagctcatc gccacctacc 1740 gagacctccc gacagctctc tccaggctgt ggcctattcc ctggattctg cttccccctt 1800 ggtttatcta caaatcaaga ctactctgag acttccctga cgaccccaac gtgaagaccg 1860 gcttgcgagc atcgattcgt cgccgactgc actgggtcaa gaccctgagt tgttcaagac 1920 ctccaataag acctctggtt ggctcttgag ctctgtttgg taacaactga ctgcatactg 1980 agattccggt tccattacgg ttgcaaggct cgcttgactc taggcttgca gcagccgcat 2040 tgtacctatt gtttgggaga aggcagctga gtgagacagc tcccaaagcg cccgttgctc 2100 ggactcgtac cccggcttga cacacgtgtc gttgcccgca gggagtccgc ttcggcgcat 2160 tctccaacag gtcctgcgcg acctcggccc tcacggcccg gcgccagcgc ctttatacat 2220 atctgaccgc caacgacgat tcagtggagt aacaaggcct tcaagacctg gtttattggg 2280 tgtaccgtct gacatcaacg gcgggcatca ccaacgtatt caagttcaag cgcatccccc 2340 ccttgcgacg cacgaaacag ccgcaggcgt cgtcatacga ctgttgctgg gagaagctcc 2400 gttgtagtcg gccccaacac atcgcgctgc aaaaggtgtc cgcccagcag gacgtggccg 2460 gcccgcgcag agtagcgaca gggctgcgtg ggctgcgtgg cgctgtggac gggcggggca 2520 cacacacgaa acacacgctt gccgatcccg cggttgaggg tcaatgcctt tcagcctata 2580 cgcaatatgc aaaatggatt tgtgcatgct catatttaca tatgcaccca gcctgcgctg 2640 ctggcctcag ctttgcggca cctccgcgag tgccggcatg atcatgtgtt gctcgcgcga 2700 gcacagcaag caaacgtctc aagtgcacca atggtgcact gaacgagttt aagccaacta 2760 taatgaaact cgcgcgcgta gtagtgccat gtgatggatc gaatagctat agtgt 2815 // ID TE2-5_CR repbase; DNA; CHL; 751 BP. XX AC . XX DT 08-JUN-2004 (Rel. 9.05, Created) DT 08-JUN-2004 (Rel. 9.05, Last updated, Version 1) XX DE TE2-5_CR is a family of nonautonomous transposable elements - a DE consensus sequence. XX KW Transposable Element; AAC target site duplications; TE2-5_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-751 RA Kapitonov V.V. and Jurka J.; RT "TE2-5_CR, a family of nonautonomous transposable elements in the RT Chlamydomonas reinhardtii genome."; RL Repbase Reports 4(5), 140-140 (2004). XX DR [1] (Consensus) XX CC TE2-5_CR is a family of nonautonomous transposable elements. CC The consensus sequence is less than 5% divergent from TE2-5_CR CC copies. Classification of this family is not known. It is CC possible that it belongs to a novel superfamily. Features of this CC superfamily include preferential insertions into AAAC targets CC that are accompanied by the AAC target-site duplications, a CC 18-21-bp CC palindrome separated by 3-5 base pairs from the 3' end; and the CC AGCC CC 3' termini. XX SQ Sequence 751 BP; 146 A; 200 C; 241 G; 164 T; 0 other; acctaccgct acgcagtact ccggcttgcc cctcccccct cagggcgctc catgccgtgc 60 gccccggaat gcggtgtggg gatggggtgg ggtgggatgg actggcggat ggtggaagta 120 tcatataggt ctgctggggg ctgcgagcag ccatactacc ggtcatgata ttgccttcgg 180 tagctcaggg cgccgcgggc tgggggtggg taagtgagcg gggatggggg ctcggactca 240 gcgggggcac ggactcaggg cgtggaagcc agcgagaagc gggcgagagt aacgccgatg 300 tgccgggtga tgtgcgagca gaatacagaa cgcggcattc gtcattacgg ggaaactaaa 360 acagatagcg ccaacctttt gcgcctcgcg ctttcgcgcg caatttccgg gatcctccaa 420 cgcgtcgatt cctccgtctt gcgtccttcc gtgccgggag ccttgattat cgatacaatt 480 tgtgatatat tgtatgttgc gctggagctc taatgcaaac gctgcgcttg tgcgtttacc 540 ccgctctggc atagggcttg cgttttcata atgttacgct ttgcaataca tacatagaaa 600 ggcgcattat ttgcacgaca aatttcgctg cgctcgggtt tcaaggttat aactcaagtc 660 aaaactggaa taaccgggag ggccccccgc ggcgagctcg atgcgaccgc aatggctatt 720 gcccctcccc cttcggggga aggggcaagc c 751 // ID NonLTR-1_CR repbase; DNA; CHL; 761 BP. XX AC . XX DT 08-JUN-2004 (Rel. 9.05, Created) DT 08-JUN-2004 (Rel. 9.05, Last updated, Version 1) XX DE NonLTR-1_CR is a non-LTR retrotransposon - a consensus sequence. XX KW Non-LTR Retrotransposon; Transposable Element; 5-15 bp TSDs; KW NonLTR-1_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-761 RA Kapitonov V.V. and Jurka J.; RT "NonLTR-1_CR, a family of non-LTR retrotransposons from RT Chlamydomonas reinhardtii."; RL Repbase Reports 4(5), 135-135 (2004). XX DR [1] (Consensus) XX CC The genome harbors several hundred copies of NonLTR-1_CR1. They CC are over 85% identical to the consensus sequence. NonLTR-1_CR CC elements are characterized by a high rate of the premature CC termination of their reverse transcription. Above 14% of all CC NonLTR-1_CR elements are shorter than 35 bp. Only a few CC elements are as long as the consensus sequence. Most likely, CC the consensus sequence represents the 3' non-coding terminus of CC an autonomous NonLTR-1_CR elements. Because of the frequent 5' CC truncations, a coding portion is unknown. NonLTR-1_CR elements CC are characterized by 5-15 bp target site duplications. XX SQ Sequence 761 BP; 93 A; 279 C; 202 G; 177 T; 10 other; gtgtgcgcgt gcgtgtgcgt gcgtttgcgt gctcgtgggt ccgtgcgtgt gcgtgtgtgt 60 tggtcccatc ggcgcacggc ctgtgctgat gacggggttc cgccacggcg mtgcccggtt 120 ctgaacatgg ccaacccgcc ttcgcatcgc ccccatcgca ttcacgcatc catgccccag 180 tgggcgcgca ctcatgggtg tgcgcgtccw sgccgrcagg gtwgcgaggg agccgttcgg 240 cggcccgcgc cctgtccgca cncacccacc catcccgccg cttcgtctcc ccgcccatgc 300 cgcggccgcc ccatctcacg ccgagcccat gcactcttgc cacttaacac tgccgccctc 360 cctcctttct ccctctttct tcctttgtca agccgcgcgc gggatcccat agccttgtac 420 tgccctgctt cgctacttcc gcgacttcta ctcatactcg ccgccttgcc atcccccaca 480 tgtccgtcgc gcccggctgc tggacgcttc tacctggcgc tnctncgttc gcrtccaggt 540 gagtatgtta gtatagattt gcaactagtt gggcgacggg gtccgcgcgc ggctcgcact 600 gctgacatga cttcggagct gctctggact gctccttcca cctgcygcta cactgcccaa 660 gcccgcccga ctacctatgg tgatgtacca cagctcgcat atgcggctgc agtttctcct 720 tgcagcgcga ctgtttcctt acgggattgg ttcaaagtta a 761 // ID hAT-N2_CR repbase; DNA; CHL; 238 BP. XX AC . XX DT 04-MAR-2004 (Rel. 9.02, Created) DT 04-MAR-2004 (Rel. 9.02, Last updated, Version 1) XX DE hAT-N2_CR is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; 8-bp TSD; Gulliver; KW hAT superfamily; hAT-N2_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-238 RA Kapitonov V.V. and Jurka J.; RT "hAT-N2_CR, a family of nonautonomous hAT transposons from the RT green algae genome."; RL Repbase Reports 4(2), 49-49 (2004). XX DR [1] (Consensus) XX CC About 100 copies of hAT-N2_CR are present in the C. reinhardtii CC genome. They are ~97% identical to the consensus sequence. CC hAT-N2_CR is characterized by 21 imperfect TIRs and 8-bp TSDs. CC hAT-N2_CR shares common termini with Gulliver. XX SQ Sequence 238 BP; 49 A; 68 C; 63 G; 58 T; 0 other; cagggctcct atcttaatgt ctccagacat taattggcca ttttggccgt ttcctagaca 60 ttcctcccgg gggctaatgt ctggggcgag actttgttgg cccgggtttg gggcgggttt 120 tgtccgaatt cctatggaga agcccccaaa ccgccatgtg cccagacaaa agacggcggc 180 cacccggccg tttttgtctg gagctagaca ttaccccgaa aacaagatac gacccctg 238 // ID NonLTR-4_CR repbase; DNA; CHL; 588 BP. XX AC . XX DT 04-APR-2005 (Rel. 10.03, Created) DT 10-APR-2007 (Rel. 10.03, Last updated, Version 1) XX DE NonLTR-4_CR is a 3' terminal portion of a non-LTR retrotransposon DE - a consensus. XX KW Non-LTR Retrotransposon; Transposable Element; KW Interspersed repeat; NonLTR-4_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-588 RA Kapitonov V.V. and Jurka J.; RT "NonLTR-4_CR, a family of non-LTR retrotransposons from the green RT algae genome."; RL Repbase Reports 5(3), 65-65 (2005). XX DR [1] (Consensus) XX CC The NonLTR-4_CR elements are flanked by 5-15 bp target site CC duplications. 5'-CT is always a 5' terminus of any target site CC duplication. The consensus is not complete at its 5' end. Because CC of frequent 5' truncations, a coding portion is unknown. CC NonLTR-4_CR elements are >95% identical to the consensus CC sequence. XX SQ Sequence 588 BP; 149 A; 133 C; 182 G; 124 T; 0 other; gagtccacat ccaagatacg gcggctcttg aggacgactt acttgacttg attgtcctac 60 gggaacactt atggctgaca actaagccgc atcatacaat tgaacaagct gtctgggatg 120 tagtctgcat ggcagcaacc tcggcgatgg aatatggtag gagattgcta tatgcagcca 180 ggaacgagca aggagtgatc gatgggaacg cagtaatggc gcgcatcaaa tcgcaagtta 240 tagcagactt ctggtctagg ttacaatcgt atgtgtgtct tggtgccaaa ccgagggggt 300 gggaccgagt ctcgccaggc cacccttgga ttgagcgaag agcagatgga agtatgaagc 360 tgaacagacc acgaagtgtc ataatggaag aggaggtgga gtgatggcac ctagtaacat 420 gttcggtatg gcgggaggag cgtcctatgg gagcaagcct actagagcta gagtaggccg 480 acggcgctgc aggcggcacg cccctggggt ccagagcagg gtgttcgttc tccacgtcac 540 tggctcgtga actgcgtcag caggtccctc ttgtaaccaa tgcaatct 588 // ID hAT-N8_CR repbase; DNA; CHL; 646 BP. XX AC . XX DT 08-JUN-2004 (Rel. 9.05, Created) DT 08-JUN-2004 (Rel. 9.05, Last updated, Version 1) XX DE hAT-N8_CR is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; 8-bp TSD; KW hAT superfamily; hAT-N8_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-646 RA Kapitonov V.V. and Jurka J.; RT "hAT-N8_CR, a family of nonautonomous hAT transposons from the RT green algae genome."; RL Repbase Reports 4(5), 145-145 (2004). XX DR [1] (Consensus) XX CC ~20 copies of hAT-N8_CR are present in the C. reinhardtii; they CC are ~99% identical to the consensus sequence. The hAT-N8_CR CC family is characterized by 14-bp TIRs and 8-bp TSDs. XX SQ Sequence 646 BP; 142 A; 182 C; 170 G; 152 T; 0 other; cacgcgggta ccaccaggcg gctggatggg gtctacggga gccccttctt ccgcgggcct 60 tttcactatg agcaataact cgtacaagga tgctgacccg acctaccccc cacaattcga 120 tagcgctcta tccgcctggc tagttagatg tcttgtacga gctattgccc gtggtgaaaa 180 ctgggctgcc gtccgggggg tgcgttttcc cagccaaacc cgcacgttgg acgttgcccc 240 ggggcaaatc ctaccgcacg gattagacac cgcctttctg taggtacata accaacatca 300 tcatcagcca gaagtggtcg gcaaaggtcc aaattatgct tatcggggct caagtcgcga 360 aattgaccga agcccatatc tcgcatatgc gctgtttggg gcctgaatct attgccgtcg 420 acattaattc tcgtatagat gtaatcaaaa tagcttcagg ctaagttggc ggggtcctgg 480 cgagcgcgac atatagcatt tcaacttgag ctctcgctca aaattatgcc cgagcaccat 540 ccagggacct tattatgtgt aatgggatgt caattcatgg tcggggcgac agtctgggca 600 tagaccttgc gatcccgccc ttgacccccg gagtggtacc cgcgtg 646 // ID RandI-4 repbase; DNA; CHL; 7807 BP. XX AC . XX DT 17-AUG-2004 (Rel. 9.07, Created) DT 22-JUL-2010 (Rel. 15.08, Last updated, Version 2) XX DE RandI-4 is a family of non-LTR retrotransposons - a consensus DE sequence. XX KW RandI; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; AP endonuclease; RandI superfamily; KW RNase H1; RandI-4; Dualen. XX NM RandI-4. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-7807 RA Kapitonov V.V. and Jurka J.; RT "RandI-4, a family of RandI non-LTR retrotransposons from the RT Chlamydomonas reinhardtii genome."; RL Repbase Reports 4(7), 199-199 (2004). XX RN [2] RP 1-7807 RA Kojima K.K. and Fujiwara H.; RT "An extraordinary retrotransposon family encoding dual RT endonucleases."; RL Genome Research 15(8), 1106-1117 (2005). XX DR [1] (Consensus) XX CC RandI-4 is a family of RandI non-LTR retrotransposons. CC Approximately 50 copies of RandI-4 are present in the genome. CC The 5' terminal portion of the master RandI-4 element is not CC completely reconstructed. Most of RandI-4 elements are shorter CC than 100-bp and represent the 3' terminus of the RandI-4 CC consensus sequence. The consensus sequence encodes the 2548-aa CC RandI-4p protein (nucleotide pos. 1-7644) composed of the AP CC endonuclease (aa pos. 130-350), reverse transcriptase (aa pos. CC 770-1070), and ribonuclease RNase H1 (aa. pos. 2030-2190) CC domains. CC This consensus is >97% identical to DualenCr2 in [2]. XX FH Key Location/Qualifiers FT CDS 1..7644 FT /product="RandI-4p" FT /translation="KLGPTPAALAQDPPTNPTGGPPAGPNTTNSPSPPTSA FT NPGPAARSNTATRIVPKPAGPTKRPARRTPNPTPKLRKLATGTPSAPPPAK FT RLHQTTIATSHKRRHTATLQNTPTEPTNGQAQASPSPPPDGRISILTLNTR FT ELLSHHESILHTVTTHTPDILFLTETKIPTRLHYSMGSAKKGLSSHYQTYT FT SNAPYNPREGTRPHAGVMVGVTHRFSRPSGTTDMNAHNTDSRLAGHVCHIR FT IAPPHSTPVHAIGIYAPSDDATRAAIYKYCTDQAAKTTAAKEHLLIGGDWN FT AVLFPSDRSTGQLNATDRAHIAFTTSAGLHPLTPPTHEHPRPHTYLQDQAT FT APHMSRIDDLLVSHHTHATITTTGGGQEAVHECGDALDHRALLSSVPLAAL FT RLYPFPTAPAAPSDPDAWVNYHPPTKDQREKARIKMEETIGQEIADLLNTL FT HPAHLAITNKLYPAGLTPAQAERNTPSPAEFRALPEVSAADTTSISNTLES FT ILQKCLDVLKSYCDKKPPRASTTSPRHHTSKTVTRLLTATSKLKERLTAAL FT TVTNAAGATPPTTSPHPAPTPPPNTTSPTPTPPTPAEADTPTAIRSQLLNA FT RTLLRAAKKEARKLNTKRAARAFRRLVATQPKRAHQKIFDETDQPRGTPAL FT TDPTTKEVHTAPEKILSSIHQHFTAQLLPINGPRTGHYDIPHSEKQYPWSR FT PDAPDRMPALSKPTDAPATLLPGIADSTTFFGCVNHLSRRKAPGPDGIPNE FT LLQTIPTSLQTAIHQLMTLMWVKAHIPTHWTQSDTVLLPKKGDPLLIQNKR FT PIALANTLYKLYTSLITVNAALFSEQAHIFTESQEGFLRGRNTERQVQNLL FT HAIEDAATTGRDLFLLYIDFTSAFNTIDHDKLLIVMHDLGYPTDLIEVVRD FT LYGKAATSVRTEHGTTPPIPIQRGTVQGDVLSPLLFIIFIEPLLRWLHVGG FT RGYQYGCLTKAENDKHNLSSGAFADDLVTATTQISNLRIQVEKIEKYAAWG FT GLKVNVSKCQLQGILHGRARADPTLGGPRSKPHQAALARMLEGQIRIDGTA FT IPYKPPHESYKYLGIHITPTLDWSAQVQYIREKIKKRGQRIATSLATPDQK FT LRMIQLTIHPAVAYSLPFMPYSTHDVTSLDSLIASTAKMCYGLKRGFPTRA FT ILQPTNALGLGVGSIRELYVRRAGKALVQALNDGGRLGAVTRAMLHLQHSV FT AQKIPADQLRTQCRFYTSLRQLSLITDAGLELTQQGSAYHTTLHGLSALLA FT RTVPPGADAEQALPFRLIQPLLELTSDFRELLTPDRTCVIDANTLKNTWPR FT AQRRHRLALNRLTLALHAETRTRAKTHNSTAPLTTAQRTLPDGHTFTGLHT FT AHTAPAAVDWGTLNRFLRQRAPATGQTANAGETAAATTAAATTAAAEPEED FT HHAGENHAPAQTRRPGTQQRKRACIKMVREPEGRGSPPADAPAAFDWDPSL FT DPLTEAHCADLIHTAPSLLGKSKPQNPAAHLRGLLLGPTPPATFTHRHKLS FT QEEQDAYLAATSDTWKHFSETYRTKALTTTNTLYGQQERVQQVIAHTTSKG FT APFYRVQWEPTVITPGSLTLYARNRYRPKPGGLTWLLTPGNEPRVLVEWED FT LEEPAAILDAQDNKQELLDALNARLAVPLGPRPPPPATDTHLSHDQRQGRW FT TDASNDTARGQRTLRHYVDINVEPCNPSRDIHPTQAYTVQLGTVLDPGSPA FT CGTHLAHVYDPRGRCIGTISAEQLSLLWQRYQHAQTTWPKVHAKHDQGRGF FT AAAIARLLLRYKPGGGSKSHPVTEKGECMLCEGLTQTLHTAIGSTTELFAS FT PLNVSDAATTYYTLYEEDMLFGANYDAYSQPWDGVAVACPEPNTEDGAHAV FT RWALAWAEHTETLTNTRNPPTTATLLILPTQRNAPYTRYLAHPRVTLLGSI FT SATTTHALLGQIHNDTGITLKHDLVLVSNTPGAQLAQPIRDALGTSLYLSG FT MPVDPQPQLTWAHTRWVTKQVRRMTRTRPAAPRVTPPPVPATPATLAPFAA FT TQPLRLNPYEGFYTDGSVQKDALGQPSAWGAAAIDARTLRCYQIDVNPGQH FT GCNTITRCELSAIHQVLKLDEVQALPEVTVYTDSAASLFLIRKTINQPQLL FT TFSKHKALLDNILALLKRRSHAGRRTNLLKVKAHSGIAHNELADQAAVKTA FT KGEQIPGAEWLVEESDNDPRSKYYWIRKIIKPTVEGVPREKNEGHYAHDLN FT RGIATLAAPHIAQGHTRETLYTGFVANAIPDLDIKSSTRTMNLQSGRHRKA FT KRCMDYLYGQLWNNKLAARFGRTVTGRRAPRAGPDTPAPCPLCRGPDSGGH FT ILGGCHNPQLRAMYIKRHNIAVQTIARAISRGDCGGCYMVMDATSKADLPD FT YAADNRLPPWLLPDQEGKPAGRHLHYRPDILLIPSISLAAALNPDFVVLPS FT EHDTIHIIEAGYTADTNHAAKQHEKAQQHQALAADLREAGWKVQYTPQSAI FT SLGFAGTIRKDLHPLLTSLGVTSHKARQCCDTLHDHAVATLNHIVLTRRRL FT ERGLPPGDPGGT" XX SQ Sequence 7807 BP; 1945 A; 3122 C; 1696 G; 1043 T; 1 other; aaactcggcc ccacacccgc agccctcgca caggaccccc ccactaaccc taccggcggc 60 ccccccgccg gccccaacac cacaaactcc cctagcccgc ctacgagcgc caacccaggc 120 ccagccgccc gaagcaacac tgccacgcgc atcgtgccca aaccagcggg ccccacaaaa 180 cgccccgcca ggcgtacgcc caaccccacc cctaaactac gcaaactcgc caccggcacc 240 cccagcgcac cgccgccagc caagagactc caccagacca ccatagcaac atcgcacaaa 300 agaagacaca cggccacact ccaaaacaca cccaccgaac ccacaaacgg ccaagcccag 360 gccagcccct ctccaccccc cgacggacgc atctccatcc tcaccctcaa caccagggaa 420 ctgttgagtc accatgaaag catcctccac acggtcacca cacacacccc cgacatcctc 480 ttcctcaccg aaaccaaaat tcccacccgc ctccactaca gcatgggcag cgctaaaaaa 540 ggcctctcct cgcactacca aacctacacc agcaacgcac cctacaaccc acgagaggga 600 acccgccccc atgccggcgt catggtcggt gtcacccacc gctttagccg cccgtctggc 660 accacagaca tgaatgcaca caacacagac tcacgcctgg caggccacgt ctgccacata 720 cgcattgcac caccacacag cacgccagta cacgccattg gcatatacgc ccccagcgac 780 gacgccaccc gtgctgcaat ctacaaatac tgcaccgacc aagccgccaa aacgacagcc 840 gcgaaagagc atctccttat tggaggcgac tggaacgcag tgctgttccc ctccgaccgc 900 agcacaggcc agctgaacgc tacggaccgc gcccacattg ccttcaccac ctcggcgggc 960 ctgcacccac tcacaccgcc cacgcacgaa cacccccggc cccacacgta cctacaagat 1020 caagccacgg caccacacat gagccgcatt gacgacctac tagtaagcca ccacacacac 1080 gccaccatca ctacaacagg aggtggacag gaagcagtgc atgaatgcgg tgatgccctc 1140 gaccaccgtg ccctcctctc cagcgtcccc ctcgctgccc tccgcctcta ccccttcccc 1200 acggcacctg ccgccccctc cgaccctgac gcatgggtaa actaccatcc accaaccaaa 1260 gaccagcgcg agaaagcgcg catcaagatg gaagaaacca ttgggcaaga gatcgccgac 1320 ctactgaaca cgctacaccc cgcgcacctc gccatcacca ataagctata cccggccggc 1380 ctcacccctg cccaagccga gcggaacacc ccaagccctg cggaattccg cgcccttccc 1440 gaagttagtg cagccgacac cacgtccatc agcaacacac tcgaatccat cctccaaaag 1500 tgcctggacg ttctcaaatc ctactgcgac aagaaaccgc cgcgcgccag cactacaagc 1560 ccacgacacc acacgtcaaa aacagtgacg cgcctcctga ctgccaccag caagctcaaa 1620 gaacgactca cagccgccct tactgtcaca aacgcagcgg gcgccacccc acccaccacc 1680 tcacctcacc ccgccccgac gcccccaccc aacaccacca gtcccacacc taccccgccg 1740 acccccgcag aagccgacac acccaccgcc atccggagtc aactactcaa cgcacgcacc 1800 ctcctccgtg ccgcaaaaaa ggaagcaagg aaactcaaca ccaaacgggc agcccgcgcc 1860 ttccgacgac tggttgcaac acagccaaag cgcgcacacc agaagatctt cgatgaaacg 1920 gaccagccac gcggcacgcc cgccctcact gaccccacca ccaaagaagt ccacaccgca 1980 cccgagaaaa tcctgagcag catccaccag cacttcaccg ctcaactcct cccaatcaat 2040 ggcccccgca caggacacta cgacataccc cacagcgaga aacagtaccc ctggagccgg 2100 cccgacgcac ctgatcgtat gccggcccta tccaagccaa ccgacgcacc cgccacccta 2160 ctgcccggca tagccgacag cacaacgttc tttggctgcg tcaaccacct gtcccgccgc 2220 aaggccccgg ggcccgatgg aatacccaac gaactcctcc agacaatacc aaccagcctg 2280 cagacagcca tccaccaact catgaccctc atgtgggtga aagcgcacat ccctacacac 2340 tggacacagt ccgacaccgt cctcctaccc aagaagggag accccctgct gatacagaat 2400 aagcggccta tcgcgctggc caacaccctc tacaagctgt acacctctct catcaccgta 2460 aacgccgccc tgtttagcga acaagcccac attttcacgg agtcacaaga aggatttctc 2520 agaggccgca ataccgaacg gcaagtccaa aacctgctcc acgcaataga agacgccgcc 2580 accaccggac gcgacctctt cctactgtac attgacttca cctccgcatt taacacgatc 2640 gaccatgaca aactgctcat cgtcatgcac gatctaggct accccaccga cctgattgag 2700 gtggtacggg acctctacgg caaggcagcc accagcgtgc gcactgaaca cggcaccaca 2760 ccccccatcc ccatccaacg cggcacggta cagggcgatg tcctgtcacc gctactgttc 2820 atcatcttta ttgaaccgct gctccggtgg ctacacgttg gcgggcgcgg gtaccaatac 2880 ggatgcctca ccaaagccga aaacgacaaa cacaacctca gctccggcgc cttcgccgat 2940 gacctggtca cagccacaac ccaaatctca aatctacgaa tccaagttga aaaaattgag 3000 aaatacgcag catggggcgg tttgaaggtc aacgtctcca aatgccaact acaaggcata 3060 ctccacggcc gcgcgcgcgc cgaccccacc cttggcggcc cccgcagcaa gccccaccag 3120 gcagcgctcg cccggatgct tgaaggccaa atccgcatag atggcaccgc tatcccatac 3180 aagcccccgc atgaatccta caaatacctt ggcatccaca tcacccctac cctagactgg 3240 agcgcacaag tacagtacat ccgggaaaag attaagaagc gagggcaacg catcgccaca 3300 tcacttgcca caccagacca aaagctccga atgatacaac ttaccatcca cccagcagtg 3360 gcatacagcc tcccgttcat gccgtacagc acacacgacg tcaccagcct cgactccctg 3420 attgccagca ccgccaaaat gtgctacggc ctgaaacgag gcttccccac ccgcgctatc 3480 ctccaaccca ccaacgccct tggcctgggc gtaggcagca tccgcgagct gtatgtccgc 3540 cgcgccggca aggctctcgt ccaggccctg aacgatggag gcaggctggg cgccgtcacc 3600 cgcgccatgc tccacctgca acacagcgtg gcacagaaga tacccgcaga tcagctccgc 3660 acacaatgcc gcttctatac cagcctgcga cagctgtcgc tgatcacaga cgcgggcctg 3720 gagctgaccc aacagggcag cgcataccac actacactac acggtttgtc ggccctcctt 3780 gcacgcaccg tgccgccagg cgccgacgcg gaacaggccc tgcccttccg attgattcaa 3840 cccctccttg aactcaccag cgacttccgc gagctcctca ctccggaccg cacctgcgtc 3900 atcgatgcca acactctaaa aaacacatgg ccgagggccc aaaggcggca caggctagcc 3960 ctcaaccgtc tcactctcgc actccacgct gaaacccgaa ctcgcgccaa aacccacaac 4020 agcaccgctc ccctcaccac tgcccagcgc actcttccag atggacacac cttcactggc 4080 ctccacacgg cccacaccgc acctgccgcc gtcgactggg gcacgcttaa ccgcttcctc 4140 cgccagcgcg cccccgccac cggacagact gccaacgccg gagaaaccgc tgcggccacc 4200 accgccgccg ccaccaccgc ggcggccgag cctgaggagg accaccacgc cggcgagaac 4260 catgcacctg cacaaacacg ccggccaggc acacagcaac gtaagcgggc ttgcattaag 4320 atggtcaggg aacctgaagg tcgtggatca ccgcctgctg atgcgcccgc agcatttgat 4380 tgggatccat cgctcgatcc gcttacagaa gcacactgcg cagacctgat ccataccgcc 4440 cctagcctcc tcggcaaatc caagccccaa aaccccgcgg cacacctcag aggcctgctc 4500 ctcggcccca cacccccagc cacattcaca caccgacaca agctcagtca ggaggagcaa 4560 gacgcgtacc tagccgccac gtccgacaca tggaagcact tcagcgaaac ctaccgcacc 4620 aaagccctaa ccaccaccaa caccctatac gggcaacagg agcgagtaca acaagtgata 4680 gcgcacacca ccagcaaagg cgcgcccttt taccgagtcc agtgggaacc caccgtcatc 4740 acccccggct ccctcaccct ctacgcccgc aaccgctacc gccccaaacc cggcggcctt 4800 acctggctcc tcaccccagg taacgaacct cgcgtccttg tggaatggga agacttggaa 4860 gaacccgccg ctatcctgga tgcacaagac aacaagcagg aactcctaga cgcccttaac 4920 gcccgcctcg ccgtccccct gggcccccgc ccgccacccc ccgctacgga cacccacctc 4980 agccacgacc agcgacaggg ccgctggaca gacgccagca atgacacagc acgagggcag 5040 cgcactctgc gccactacgt agatatcaac gttgagccct gcaaccctag ccgagacatc 5100 caccccacgc aagcatacac cgtccagctc ggtacagtcc tcgaccccgg ctccccggca 5160 tgcggcaccc acctggcgca tgtctacgac ccccgcggac ggtgcatagg caccatcagc 5220 gcagagcagc tcagcctcct gtggcaacgc taccagcatg cacaaacaac ctggcccaaa 5280 gtgcacgcca agcacgatca gggcagaggc tttgcggccg ccatcgcccg cctgctcctc 5340 cgatacaaac ccgggggcgg ctccaaatcc caccctgtta ccgaaaaagg cgaatgcatg 5400 ctgtgtgaag ggctgacgca aaccctgcac acagcaatag gcagcacaac ggagctcttt 5460 gcatcacccc ttaacgtctc ggacgccgcc accacctact acacactgta cgaagaagac 5520 atgctctttg gggccaacta tgatgcgtac tcacagccat gggatggagt tgctgtcgcc 5580 tgccccgaac ccaacaccga agacggagct cacgcagtcc gctgggcact agcatgggcc 5640 gaacacaccg aaacccttac aaacacccgc aacccgccaa ccactgccac gctgctcatc 5700 ctccccaccc aacgcaacgc gccatacaca cgatacctcg cccaccccag agttacactc 5760 cttggcagca tctccgccac aacaacacac gcccttctcg gccagatcca caacgacaca 5820 ggcatcaccc taaaacacga cctagtgctg gtcagcaata cacccggagc ccaactcgcc 5880 cagcccatac gcgacgccct aggcacatcc ctatacctct cgggcatgcc cgtcgacccg 5940 cagccccaac tcacgtgggc ccacacgcga tgggtaacca agcaagttcg ccggatgacc 6000 cgcacccgtc cggcggcacc ccgggtaaca ccccctcccg taccggccac gccagccacc 6060 ctggccccct tcgcggccac ccagcccctc cgcctcaatc cctatgaagg cttctacacc 6120 gatggctcag tgcagaagga cgccctcggt cagcccagcg cgtggggggc cgcagcgatt 6180 gatgcccgca cgctgcgctg ctatcaaatc gacgtcaacc ctgggcaaca cggctgtaac 6240 accatcacac gctgcgagct ctcggccata caccaagtgc tcaagctaga cgaagtgcaa 6300 gcgctccccg aagtcacagt atacactgac tccgctgcgt cgctgttcct catccgcaaa 6360 acaatcaacc agccccaact cctcaccttc agcaaacaca aagcgctgct ggataacatc 6420 ctcgcgctac tgaaacgccg ctcccacgcc ggccgccgga ctaatctcct caaggtcaaa 6480 gcacattcag ggattgcgca caatgaactc gcagatcagg cggcggtgaa aaccgccaag 6540 ggggaacaga tacccggggc cgagtggctg gtcgaggaat ccgataacga cccgcgcagc 6600 aaatactact ggatacgtaa aataataaaa ccgactgtgg aaggggtgcc acgagagaag 6660 aacgaaggac actacgcgca tgacctcaac cgcggcatcg ccacgctcgc ggccccgcac 6720 atcgcccaag gccacacccg cgaaaccctc tacactggct tcgtygccaa cgccatccca 6780 gacctggata tcaaatccag cacccgaacc atgaacctgc aatcaggccg gcaccgcaag 6840 gccaaacgct gcatggacta cctgtacggg caactgtgga acaacaaact cgcagcccgc 6900 ttcggccgca cagttacagg ccgccgagcc ccccgcgctg gccccgatac ccccgcccca 6960 tgcccactct gccgcggacc cgactccggc ggccatatac tgggaggctg ccacaacccc 7020 caactccgtg caatgtacat taaacgacac aacattgcag tccagaccat cgcacgagcc 7080 atctcccgtg gtgactgcgg cggctgctac atggtaatgg acgccacgtc taaagccgat 7140 ctgccagact acgcggctga caaccgcctg ccgccgtggc tgctccccga ccaggagggc 7200 aaaccagcag gacgccacct acactaccgc cccgacatcc tactaatccc ttccatctcc 7260 ctcgccgcag ccctaaaccc agactttgtc gtgcttccca gcgaacacga caccatccac 7320 atcatcgaag caggatacac cgccgacacg aaccacgccg ccaagcaaca cgaaaaagca 7380 caacagcacc aggcccttgc agccgacctg cgggaagcgg gatggaaagt gcaatacacc 7440 ccgcagagcg ccatcagcct cgggtttgca ggcacaatcc gcaaggacct ccaccccctc 7500 ctcacctccc tgggcgtcac ctcccacaaa gccaggcagt gctgcgacac cctacacgac 7560 cacgcagtcg ccaccctcaa ccacatagtc ctgaccagac ggcgcctcga acggggactg 7620 ccacctggcg acccgggcgg gacgtaacat cctggcgggg tgcaggcctg gcctgcacgc 7680 ccgccttcac atgctcggta gtgggggcgt acaggaggcg ggcctggccc acctcctcat 7740 ctccccccac tcccttaatc cacaccactt gcctacggca gggtgtctct atttctattt 7800 ctatttc 7807 // ID Gypsy-11-I_CR repbase; DNA; CHL; 10206 BP. XX AC . XX DT 31-MAY-2006 (Rel. 11.05, Created) DT 31-MAY-2006 (Rel. 11.05, Last updated, Version 1) XX DE Gypsy-11-I_CR is an internal portion of Gypsy-11_CR LTR DE retrotransposon - a fossilized copy. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; 4-bp TSDs; Gypsy superfamily; Gypsy-11-I_CR; KW Gypsy-11-LTR_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-10206 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-11_CR, a family of gypsy LTR retrotransposons from the RT green algae genome."; RL Repbase Reports 6(5), 230-230 (2006). XX DR [1] (Consensus) XX CC This is an internal portion of Gypsy-11_CR LTR retrotransposon. CC The internal portion is corrupted by mutations. It encodes one CC long polyprotein composed of gag, reverse transcriptase, PHD zinc CC finger, and integrase. XX SQ Sequence 10206 BP; 1994 A; 3412 C; 3117 G; 1683 T; 0 other; tcaagaagtc gttatgctga cagagcgtag aaaaagaaga gttgtgtgcg agtgccacac 60 gtacacacac gcacgcatgc cgtcgaacca accccatgtc gatgccacct ggcctggggc 120 tgctctgttt acatcgctag aacactttgc aagatcgcga gcccctatct gtgctgggct 180 ggtagagccg ggtggggtgt cggcacattc aggattgcca aagtgggggc ggttcggcaa 240 gcccggattg ccatcgactt cctcatcgac ttctcttcgt cgtcgtcaag atctatatct 300 acaagcttga gtatttgact acaactacag cccgagacct atttcgtgac tacgtatttt 360 ctgccaaact catcggcaaa tacttctgct tccagttttg agcgattcct ctttgtccac 420 aacatcgtgt ggactctgga agtgctcata catttacagt gagactctta ccgactcctg 480 agcatctgcg acctttggcc tttctgtcct tgacgcgtcc aacggcgttc ctttgagtac 540 tttctggttc aacaacgtcg tctttgacta caatatacca agtgatacta ctggtgcacc 600 cagcctaaac acacacacat ccgcatcggt tggtcttgtg ttaggcggtc acagccctac 660 cgctaccacg gcaaccctgt ctctacagcc gtccggtagt cccttcgttt ccccagatcc 720 tctggtgggt ccccctgacc caagcggcac cccgaccaag tcgtcggccg ccaaccggcc 780 ccgcccgggg ccggccatcc cctccactcc tcctccatct tcgggccttc cccccgtcga 840 catggcctca ctgctgaccg acgcgatcgc cagtcttggc cggtggcagg cctttcgcag 900 ctatgacgga gaatccagca cccacgagtg gctggaagac ctcgaagaga ccatggacaa 960 gatgggctat cggcagagca gcttcgtcga gcgtgcctca gagaagttgc aaggcgaggc 1020 caagtcctgg taccgctggt ggcgtgagta cgaccagccg ggcccactgt cgtgggagga 1080 tttcaaggcc accttcctgg acgagtttgg cctatcgacg tcagactggg aggagcgcct 1140 ggtcacctgc cagcaatgcc tgcaggagcc cacccaggcg tactccaacc gcttcaccac 1200 ctacgcccgg cgcgccaacc gcatgaacga cccctccctg gtcactgtct acgtccgcgg 1260 cctcaagaag tccctctccg aggccctgga cctgttcgga cggcgcaccc ctacctccac 1320 gctcaaggag gcgatgcgtg cggcagccaa cctcagccgc tgcaactggc gcgcgctgct 1380 ggctgacgaa gaactcccgc ccgagcctgc cggccgacgt gtgcgctttg aggactcacg 1440 cacgaccgac cgcccccgct caccctggac tcgcaacgcc gacgtacgcc gtgaggagga 1500 gcgacggtac tcgcgcccca gcagtaacgg cagtagcagt ggcagtggcg gcggccaccg 1560 gggcggcagc agcaccgcgc ccgttggtgg cggcggcggc aacggaggcg gcggcaaccg 1620 tgaccgagac cgcaacgacc gtaacttaca ctgctggcgg caggctggac gagaccccgg 1680 cctccaagct gccacgcccc cacgggccca ctggcgggcc aagccgaccg ctcttgcagc 1740 gtctagtgac ggcaacggcg cctccaccgc gcccagcagc gggactgtca cgggcgcctc 1800 cgccccaggc ggcggggcct gcgcccgcag cgcgtggtgt gcgagaggcg gcggcacggc 1860 cgctgcctga caccgaccgg ccccgtcccg cacccaacac gagggcaccg ggagcagggc 1920 cagcaccagc acgcgaggag ccgcgcgctg tacccctgcg ctcctctgcc gcagcggcga 1980 cgcaacccca gcgagcccta acccctgacc ccaagccccc ggcaccgcgg gtgccgccgc 2040 gcacagaccc tgtgacgcgc acggaggagg ctcaggtggc tgaagacgtc tttgccaaga 2100 ttgcggctca ctctatcacg ctggccaagg cggcgcagtg tgacagccgc gctatcctgg 2160 ccaaggtggg cggcaagtgc ctgggcctcg ctcgcagtgc cagcggcggc gccggcggca 2220 gcaacaccgg cgtcaagagc ggcctacttg ccgaagccca acctgcggct cctgagcacc 2280 gcagtgacgg cgaggaagac agccctggcc ccaagccccc gcagaaggag taccagatct 2340 gccgcgtgcg ggcggcgtgc ctggacatca acggcaacga gaaggagcaa cgcacgtgtg 2400 ccgacactgg cgcttccaac tgcgtcatcg ctcgtcgcgc cgtatccggc ctgggcctgc 2460 tcaactacat tcgcgacacc cgcaccaagt tctaccacgc caacggctct gaggaccggg 2520 ccaagggcaa ggttagcgtc acggtgtgcc ttcccggcct cacgcgggct gtggagatgt 2580 acgtgtcaga cgccgagaac tacgacctgt tgttgggcaa cgacttcctg gggcctctgc 2640 gtgtggacac gcgctacagc aacatgaccc tggagtacga cgaccactcc gggcaacggc 2700 gggccgtccc aatcaagatc acccgcagca gtgggcgtca gcgcaacgtg ccgttagcac 2760 ggctggacac gcacgcctac gtggtgtcca ccacctcaag caactacaac agtggcgctg 2820 aagaggcggc ggcgtgtgag gaggcgggtc ccgacaccac cccctcgccg ctgcagccga 2880 cggcggcagc cacaccgccg gcgttggggg cccacgcccg gcggctgcca cggcgccacc 2940 acaccggtgt gcggccgaca cagctgttcc tgctggacgg caacggctcc tccccctcgg 3000 ctatggcggt ggacgcaccc acccccgatg cttcggactt ggagccggtt gtgccaacgg 3060 cgggcagcga caaggacggc accagcagca cccaggctga gctaccaccc gccaacatgg 3120 agaccaccag tgagcccgcc accggtggcg ccggagaagt cttgggggtt ggtgataccg 3180 tgtgctgtga gccgctgttc ctcaactcca aggagataac ctctcaagac ctcacgggcc 3240 tggtggcgat gctggcagac acgccgccga aggaggcagc ccctactccg gctgccccaa 3300 cgaacagccc aagagccaag gtggtgcgtc agctgttcag ttggccagcg gacgaggcgg 3360 cgccacccga agacgcgatg tgcttcttgg tggcagttac caccccaacc gtccagcagg 3420 ccaacccgag ccacgtcaca cttgcggagt ggctggacaa ggcacctgac gtcaccagcg 3480 ggctggagcc aatcctgccc ccaacggcca cgccgctacc ctctggcctg agcccacagg 3540 agaagcagca gcggtggttg caagcctgca ccccgccggc ggatatgcct gtccgcatgg 3600 tgaccaccgc caaccccgcg acgggtgccc aacaccctgt accagaggct ttcgcagcag 3660 gagtcaacga cctgccgctc aaggagatac tggtggccgt ggacggtcac tttggcggtc 3720 tgctggtgag ccctgagaag gaaggcctag tccctctctc accattcacc gcaacggaca 3780 gcagcggcag ctacctgtcc agggccgact cccccggctc ccctcccagc tacatgggcc 3840 tgctgcaaga cgagggtgac gactggtacg gcggcgccac cggtctaccc ctgccacgcg 3900 gcaagcgtgg gcggaaggcc tttgtattcg tggccgactg ccacagccca ctgtccgtgt 3960 ggagcggcat gtcaggaggc gcggagtctc cccccctgat attcacccag ctgtcggagc 4020 tggacgacga atgggaggtc ccacccctgg cgccaccgtc cgacgacgag gacgaggcgc 4080 ccgcacttat gggggcacat gatgagtacg acccggcagt gggctggagt cgcggcggca 4140 gctcgcgctc tccacccacc aaccaaagtc cctttggcgc caacttcagc ggccgtagtg 4200 gtggtgccac cagtgacggc agcggcggcg gcagcctcag cagccacgcc cacgccgacc 4260 ccatggacgc agcggtggtg gagctcttcg tcatggacgc tgacggcttc cttcgccgtc 4320 cctcccgtga tgaggttctc tctctcccac cgacagagca cctgcagctg gctcaagagt 4380 atggccggcg ccaacgggag ttacgcggcc cgccagacca ggggttcggg gtgtccaact 4440 cggcggttgc gatcggcaac cgcccggatc cggcctgtct cggaccctgg aagtgggccc 4500 cgcgcaagcc tgccagcgca gtctaatagc ggacacatag aaaaatagtc ctgacgtgtc 4560 gggtggctaa aatgtgtgct tctatatatt gccgatagta gaatcagcat gtgctcaacg 4620 tcgctgccat acgcagcctg aagcgattcc tatgccatga tattgcaatt gttgctgcga 4680 caaggcacct ggagtcggct tgccgtgggc agggtcgcca caaggggcca aaaacaccgg 4740 ccatgctggc aattttgggt cgtttcccag aaacgaagca agcagaactg tttcgtcatg 4800 tccctgggga caaatacctg gtactgtccc cttttatgtt tctagctcga gcgaccagct 4860 aaatccaagc ctacatggcc tcaagctgcg ccgctcaagc accaaacgag cgcgcccggg 4920 gcttgctcgc tctctgtcct tcacgcggtt actatatatg tcaaatgagc ttctttcggc 4980 tgctcgcact cgttgtcgac gcccctaaag agctgatccc aactcccgaa acccgcgcgc 5040 agctggtgta gacgtgatcg ttgcagcatt ggcaggctct catgcgcaag ctatttgtta 5100 tgccataatc cgtgttccta cgggcctcat tactcaaaaa tccgcttgga tgcagtcgcc 5160 caacttcata ctcccactcc gagactggtt aatgtcattt agatatcatt tacgcgtttg 5220 tctacgcatg agaggacaac caaaaccaac agaacgaaag aaagcttcag gtgttggtga 5280 acatggtgcg cgcgcgatcg gcatttggcg atcggcgttt gtgtgccttt ttaatcggcc 5340 gagatgttct atgcctgaat catggctaca taagatgctg cgcactctac agcgtttcag 5400 aaacgtcagg agaccctcta tcgatgtcga ctcttcctcg caaaccacga taatgtattt 5460 ctgagcccat agaaaccagc tatatgtata cacatgttcg tttgccgatt gttagcgtca 5520 aaaagagccc ttcaagggaa gaacctgctg cgcgcgcctg tccgaacgcg cattcgcgct 5580 ctactgtgct ctgcgcaggt ccccaaccag cgcgcatgga cccaggcgta cagttatcat 5640 acttgggggg aatgagcatg cgcggggtcc aggggacgcg gtttcaaggc ttcatgcctg 5700 cccgcgaagt gctaattcga gagcatccgt cttgactcct gtcaacgtgt caggggagct 5760 taaggatatc atacctatag gtttcaagct tagtcgcacg agggaagcca cttccagggg 5820 gggtcggggc caaatgtcgg tggcaaccga cagctaaccg ccgagttgga caccccgaac 5880 ccctgcgcca gaccccaacg acgcgcgagt gcagaccatg cgcgagcggt ggacctgcga 5940 cggcaccgag ctgtcaccag agctgcagca agtgttgctg gacaactacg acctctttgc 6000 tctggacgac tcacagcttg gccgtacgga ctgggtcgag gccgctattg acaccggcga 6060 cgcgccgccg gtgtgcctca acccctaccg tcacagccgt gtggagaccg aggccatgga 6120 ggccgaggtg cggcgcatgc tgggcgctgg catcatccgc cacagcaact ctgcctggtc 6180 ctcccccgtg gtgatggtgc gtaagagtga cggcacgcac cgcttctgcg tggacctgcg 6240 cggtgtcaac aacgtcacgc agccggtgcg cttccccatg ccacacgtgc gtgacctcat 6300 cgacacgctg gcaccgcccc tgggcgagca gcgcgtctac tcctgcatgg acctcaagag 6360 cggcttctgg caagtgggcg tgcaggagac agaccgctgc aagctggcct tccaggctcc 6420 cagcggcctg tacgagttct gcgtgctgcc catgggtgcc cgcgcctcac ccgctctctt 6480 ccagcgcctt atgaccctgg tgctgcggcc ggtgctggct tctggggcgg gcctgggcta 6540 ccaccccacg ccgaccgagg agacgccacg caagtgctgc gtgctgttca tcgacgacat 6600 ctgcatctgg tcaccgtcgc agcaggagca cgccgcggac gtgctgacgg tgctggacct 6660 gctgcgcctg gccggcctgg tgctgtcgct caacaagtgc aagttcggca cgcccaagtg 6720 ccagtttctg ggccacgtgg tggacggcgt gatcggccac atcaccccct ctcccgacaa 6780 cgtggaggcc atccggtcct accccaagct tcgcaccgtg cgtcaggtca aggccttcct 6840 cggcctcacc tactaccgtg agcaggtgga ccactttgcc aacgtggcgc tgccgctcta 6900 ccagtgcctg ggcaagatgg gcttccactg gggtgagcgc caggctgcgg cgtttgaggc 6960 gctgaaggag gctatcacga gcttccccat cctgcgcgca cccgacttca cgcgcccctt 7020 cacgctggct actgacttcg ccaagacagc ggtggcggcg tgcctgtctt agcacgacga 7080 ggacgggcgt gagtacgtgg tgagctttgc ctccaagaag ctcaacgaca cgcaggccaa 7140 ctggagctcc acggacggcg aggcctttgc ggccatgtgg gctattacca aggtctacca 7200 cgcctacctg tacggcaccc acttcacctg gtgaccgaca actcggcgct cacttacatc 7260 atgaagacca aggacctgac gggcaagctc gcccgctacg cgctcaagct tgcgccctac 7320 acgttcacca ttgtgcaccg ggcaggacgc aagaacggca acgccgacgg gctgagccgg 7380 ctggggcacc tgcgggacgc tccctctgac gaggaagacg aggacagccc acagcggaag 7440 cccaagcgcc cagcggcggc gacggccatc gacctggagt ccctcgccga cgcctttgag 7500 gcggagacgc tgctgttcac ggcgggttcc acgtctcccc cgcgctacgc tgaggtgtgc 7560 gtgctcgagc ggctaccttc gctgaccgac ggtatcggcg gcgagctgtt cgaggacgag 7620 tgggacggct ggccggcgtc gtggcgcgag cggctgggag aggtggaggt tgacggcccc 7680 gcatcacagc aactccccgc cacgccgccc acgtccctgg ccgccatgga cggcagcacc 7740 agcgccccca accccatggt ggcgttcaac cttcccggtg agcaggacgc tgagagcagc 7800 cacactccgg ggccaggggc agcaaccgca aagggcgtcg cttcaccacc accagttggc 7860 ggcggcggcg gctacggcgg cggcaaaagc agcggcggca acgtgagcag cgacaacgca 7920 gtcacgccca ccctgaccat tcaaccgctg accctactca gctccactga gctttcaact 7980 gaccactggt ggcggcagtg gtggggtctg ccgaaggcct ctccgtctac cgacagcgct 8040 gacggcgcca ccaccagtac cgccagcggt gacccggtcc acctgaacat ggtggaggcc 8100 ggcgcatccg gtggcgacga ggagcgaccc accagccggc gccgccctct ctccagcttg 8160 gcgagcccag cggcacagac agtgcccggc aacacaaacc aagcaccccc ggtggcgctg 8220 cctccggcag ctacagcagg tggcggcaac gccggcggcg gcagcggcgg caggccccac 8280 cgcaaccgcc aacgccaacc ccgtagcctc aacggcagtg ttgacgtcgg ccttcgcaac 8340 ggctgcggtg ccgactacgc ctccgcgcct taagccccca ctacccggtg cgcccgcgtc 8400 ctccaatgcc ggtaccactg gcaacgcaag cgcaacagca acggcggctc acaccacaac 8460 ggcaacccgc tcggcggcta caccatcagc cacggcaggt gtggcgccag gcaacacgag 8520 ggcagacagc ggtgaagaca gtgacgctga ggatgacggc gacggggcgg cctcagacga 8580 tgacggaggt tacgagtctg cgggcagcgt gctcgagatt gacggcgtgc cgcagttcca 8640 ccccaacgag tcgcgcgtgt ggcaagacca ccgactccta cagttcctgc gcgccgaggc 8700 ttcaaacaaa agtctgcggc ggcagcttga gaaccaggcg gttgagacac gcagcctgca 8760 acagcagaag cagtcgctac agcagcagct gagcaacaca gttcacgcag agaacctcca 8820 actgatgcag caggtgcaga acctgcagca ggagctgacc cacacccggc aggagctgac 8880 ggacgccgtg agcgacggca aacggctgtt caacgagacg gagccctacc tcctctccaa 8940 cgtgctcagc ctgcggcagc ggctgacgcg agcggaggcc gctcccctgg cggccgcgct 9000 cgcaaagctc accggcgtgg aggagaacca ggaggccatg cagctgggcg tgttcaaccc 9060 cacggaggcc gtgtacaaga tcaaagactc ccagcagtgc atccacttct accccaccac 9120 cgcctaccac tttgagtccc tgagcgccgt gaccctgaac gcggcggcgg gagtgctaca 9180 gacgacggaa ggcagccgct gcttcgtgcg cagcgtggtg gaccacctgc agctccgccc 9240 gctgggtacg gcggaatcgc cggaggtggc ggcggcggcc gaggacctgc gcctcctgca 9300 gcgccagctg gccaacccca tggccgctct gctaccggag gatgacgagt acttcgacaa 9360 ggcgactacc gcgcaagaca agctagccaa gctgacgcag cgcgcggcga ccgcccagga 9420 gcgcctgcac cgtgccacca ccatcccttt cgtgtgcgtg cccctgcacg acacaaacac 9480 cacggagtaa gcgcggctgg ggcagcagtg agtaccgagc aaggctggct tcaaacacgg 9540 gacggacggt gcggggagca tgggtgtacg gtatgctgag tacgggggtg tggcggcgcc 9600 agggtggcac ttgcgtacgc caccgcccac cacgaggctt gacgcagtcc tgtcggccca 9660 gtaagaattt aggtgcctcc ataagaagtt aaggcatgtc gggtatgggt tgtttttggg 9720 ttagggcgca gaggggctta cgacaccagt gcctgtggac acgaccggca cgacgcgcgg 9780 cacaacacgc acaaaaaaat gatgggacct gaggagttga acggcgatcc gactgcatca 9840 agttaaaggc tatacggtta caaccatttg tgtgaggttc gggccacaag gcaccgatgt 9900 acagtatggc atgaacgcgg ggaggtgggg atcaccgcgt gtgtgacgtt atacgcatgg 9960 ttacggcggg caactggcag tgtggcaagc atccttcgga atcaaaaaaa gcatcctgca 10020 aggcacgttt gacgtatggg gatgcaacgc catgacctgc tatgtgactt tggagcggca 10080 aggagacggg caaggcaagg cgcttgcacg ggtggtgaac cgagggtgga agcgcgtgcg 10140 tagcagctcc gcgtcggctt aggggccgtg ctgggactca cggcttacac gagggcaagc 10200 gacagc 10206 // ID Gypsy17-LTR_CR repbase; DNA; CHL; 195 BP. XX AC scaffold_567; XX DT 22-FEB-2008 (Rel. 13.02, Created) DT 22-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE LTR retrotransposon from single celled green alga: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy17-CR; KW Gypsy17-I_CR; Gypsy17-LTR_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-195 RA Kohany O. and Jurka J.; RT "LTR retrotransposon from single celled green alga."; RL Repbase Reports 8(2), 104-104 (2008). XX DR Genome; scaffold_567; Positions 4297 4103. XX SQ Sequence 195 BP; 38 A; 55 C; 49 G; 53 T; 0 other; tgtcgaggcg gacacgcggc tccgcatggg caccgtacag aacggtgcgc gctcttaagg 60 gccgctctcc tactttttag gtgtatgtat cgtatcgtgt gtagccttct caagactccg 120 gtgactctca agacttccca agtaagacgc atgcattcta gtatctcctg ggctcctagc 180 atttgctagc ttaca 195 // ID DNA-8-1_CR repbase; DNA; CHL; 3539 BP. XX AC . XX DT 29-MAR-2006 (Rel. 11.03, Created) DT 29-MAR-2006 (Rel. 11.03, Last updated, Version 1) XX DE DNA-8-1_CR is a family of P nonautonomous DNA transposons - a DE consensus sequence. XX KW P; DNA transposon; Transposable Element; Nonautonomous; P-1_CR; KW DNA-8-1_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-3539 RA Kapitonov V.V. and Jurka J.; RT "DNA-8-1_CR, a family of P nonautonomous DNA transposons in the RT Chlamydomonas reinhardtii genome."; RL Repbase Reports 6(3), 109-109 (2006). XX DR [1] (Consensus) XX CC This is a consensus sequence of a DNA-8-1_CR family of CC nonautonomous P DNA transposons. DNA-8-1_CR elements are less CC than 2% divergent from their consensus sequence and they are CC characterized by 146-bp terminal inverted repeats and 8-bp target CC site duplications (NNNNNNNC target sites). DNA-8-1_CR is a CC deletion derivative of the P-1_CR transposon. XX SQ Sequence 3539 BP; 737 A; 1008 C; 1077 G; 716 T; 1 other; caggggaggg gggtatgcaa ctcggcctgg atccagcccg ggctgcatgg gacccggcca 60 aaaggtgttc ggacccccca gcgcctgctc ctgagacttt attattgtaa cctggacaaa 120 tgtgtagtca tatgaaccga caggtgtcgc taggaatggg gcgcaggccc ccctgtaaca 180 tttcgagtgt tgggagccat cttcggccgt tttggaccac cgaccgatct atatagcatg 240 gactagtgga atgggctcga aagctagtcg ccaggcgata cgccgcgcac tggagctgct 300 gtgggccctc aaagcaacgg caagtcgggg gtaatcggcc ctttgcgggc tcgctgggca 360 ttctaaggag ctgccagcag ctaccagagt accgcaagga caataagaag ccatggcttg 420 ggctggcgtg gccttagaag tctcaaggac gcggccggtg tggcattagt tcccgcccgg 480 acatgcccct ggcgtcgcga acgtccatat ggcgtcgcct ccggctctcc tgccgtggtt 540 aggtgtcttg gccaaataga catctgtatg aacactgtgc gcctggcccg atatagttta 600 gacgtgcgca gacggtctgc actaaggtcg cgaaattcca ggggtttcgc gttgcggact 660 tgcgtagctg cgctaaatca ctgggacgcc gttrtgtcct acaatcatat gaggttcctc 720 atatacagag ctgattgccg gggccttaga gtggaccggt ggctccaacc agaatgctgg 780 cacattcttg catcgccagg accgcccaag cgaaattccc tggtggtttc gtgtcttggc 840 caactagagg tgcaggggat cctaacgcgc ttgtcacatc gacaaacggt tgagatttgg 900 ggacatgcac taaggtcgcg ttttccgtgc catttcgccg ccctatggcg cccgcccgcg 960 tgccccggta tccgcatacc agcgtgtggg ttgagcgcca aacactgcct gtgaccgtag 1020 ttcggtggta tgaataaatg ggaggcgcca ggcgccaagg ggtggtgtgt aatggcacct 1080 gtgtgccctt tcccatatcg cctttcaccc tccctgtcct gctgcctgcc tctcttccct 1140 gccgcagatt ctcctagtat acctgagcat cagcccacaa gttacaacat tcagtttagc 1200 catgggtcgg gatgagcaga agcagtactt gcactgctac acgtgcggta acaaggtgct 1260 ctcgtacagc ttcatccgcg tctggaacca ccggttcgca ctacctcagc gccaccatgg 1320 agcccagcaa cccagtgctt cccattactc cccagccgcc gccggcgaag gttggcttgc 1380 gcattcgttt cgagtgcgcc cagcacaaca gcactgtgca agcagagcta cagcagggct 1440 atgaggagga gcaggcgcgg aaccagtatc ggggccacta tgcagcacac ttcttgtcgc 1500 gggcttggtg aacaccagca tcgcggctgg cgcagcggca cagccgcgtg tatatggcac 1560 ggaggtacag ctgcatcagc aggcgccggg gtcaagaagc gctgtagtgc ctgtggctgc 1620 tgctgcagcc gctgctccac agcagcatgt gctgcggcgc agccaccgca ggaatgcttc 1680 cagcgtcagc ggtatcgacc ttgacacttt cgacccgcca ccaccggata agcagttgga 1740 gacgcctgct gcgccgtcac cgccagcgcc ggagcagcta acagctgcac ccgccacaga 1800 ggacgccgtt gctgcagttg tgacggcggt gccagaaccg gaggcagctc agctgcgccc 1860 agtcctgcgg gccgtgtcag gctaatggac tggcaccacg ccatctgcat gcgagctggt 1920 gctgtatgct atcctccgac ttcacttcgc agacgttgtg gcggtcggca tcagtgtggg 1980 ggcggccggc ttggccgcgt tggagcccgt ggctcccaag cccgggcgcg ccaagcgcgg 2040 gctccccgca gaggcaccgc ccgcccagcg cgagggagtg cgcaaacggc tgccatctgt 2100 gcgcctgaac gactgagtgc attggtttct gcagtgcatt ggttactgat gtcgtaacca 2160 ccatatgatc tagcaagata gagctgtgat gggcataccc aattctttgc aagcttttcg 2220 catgtactac atgaatgaac acagcagtct gaacgccgcg cccggcacgc atacagcatg 2280 cgcgcgggac tccatagtga ggactgatcc aaatgtcttt attatgtatg tacgtcgtgt 2340 agcacatcac ctagaagtgc atgtagcttt gcttcgggcg cgcatgcgta ccgtaaggaa 2400 ccttgacatt catggccgat attctctcat ttcttagtac cagggaagcc ttccctagta 2460 ccgtacagtg cagtatggag tcggaagtga acaacgcaat gagcacggaa gggaagaacg 2520 ccgtggccgt acaatgcagt atggagtcgg aagtggaaag aagtgaaggg cagcatggct 2580 gagagtgcag cgtcagcaca gcatgtagca tagaagcgat gacacggttt atgacttgta 2640 catgtacaca gaggcaggtg atattaaagg cggcagtgag ggtgttttgt ttatccgcac 2700 ggaagcgtgg agatgggctg aggcagcacg cccctggcca accaagggat gcacacagta 2760 ggcatcaggg tagacgacca tacgtggcca ctcacgagat gcaggtacgg gcagggtagg 2820 catcagggta ggtagtacac gcatggtggg ggtgttcata catgcacaag gagaccggag 2880 catgagcaca ggagtcacga gtcccccggc caggaggtgc ataccagcgc agccgtgctg 2940 gcgcgcacaa agcacagtgg aggcatgtaa cgaactttta gcagaaatcg tgtgtttcac 3000 ggctgtgggc ttcacgcttt gcgcccaggg gtaaggtgcc tcgtccaggg gccctcgcct 3060 gaaagcgcgg gacgcacgcg cgcgctgaca ccaacagaca cggaccaaac ccctaaatat 3120 ggcatcggga cgtatgcgta gaggtgtggg caggggccgc tgggttctct gggcctcgga 3180 aagggcacca aacggccatt tccgcgaagg gtctgaaggg caccgggcat cgctcctgag 3240 aggccgcgcc cttcgctttc tggcgttcac tttaattcaa cagggagcag caggccttct 3300 gagttgatca caagtgtccg gaatttcgtt ccccgtgttt caagcgcgcg cacattggcc 3360 atggccgact cactcgacgc actcgtttct gggcacctgt cggttcatat gactacacat 3420 ttgtccaggt tacaataata aagtctcagg agcaggcgct ggggggtccg aacacctttt 3480 ggccgggtcc catgcagccc gggctggatc caggccgagt tgcatacccc cctcccctg 3539 // ID DNA-3-7_CR repbase; DNA; CHL; 857 BP. XX AC . XX DT 04-MAR-2004 (Rel. 9.02, Created) DT 04-MAR-2004 (Rel. 9.02, Last updated, Version 1) XX DE DNA-3-7_CR is a nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; 7-bp TSD; DNA-3-7_CR; KW unclassified superfamily. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-857 RA Kapitonov V.V. and Jurka J.; RT "DNA-3-7_CR, a family of nonautonomous DNA transposons from the RT green algae genome."; RL Repbase Reports 4(2), 33-33 (2004). XX DR [1] (Consensus) XX CC 100 copies of DNA-3-7_CR are present in the C. reinhardtii CC genome. They are ~98% identical to the consensus sequence. CC DNA-3-7_CR is characterized by 13-bp TIRs and 7-bp TSDs. XX SQ Sequence 857 BP; 188 A; 264 C; 215 G; 190 T; 0 other; cagggggggg gtacgaacta aacctccaga ccccatgtcc caaaacctcc cgaccccccc 60 tccggaccga ccgaccgacc cccaccaccg tgcaacccct caaaaccccc tcaaaaccct 120 ttggttttta gccaggtcgg tgtcaatcga ctgctctcgg cgagctctac aattgtatac 180 ctccggttca gctggaggca caacttgact ggcagccatg ccacacatgt ccatgtgtcc 240 atggccgacc accgaccctt ctgtcagact accggggcgt gcgcttaagc ttacaattgc 300 gctgtacata tgatgtttgg gattgtgtgc cggaagagat tggtcgactt tctcccggga 360 tcatgtgaag aatcccggga ttcatgtcac gctcgatggc cggcggcagc cttgtcgcca 420 tgacccacac gaagccttgg ccgttacacg ttagcatcag atcgcatagt ctccatgaca 480 ccagtgagag acgctcgagt ggggctctac gctactacga tgccgatgtg gacggacatg 540 cggccgcttt ccagaggcca aacgttctgc aaatgcttta gtatgtgcat gctaatgtgt 600 gctagtactt gtgctaatgg gcggagcgtt tagaggggcg tttagggcgc agagacgaaa 660 tctggcagtc ccacccacat ccgcgtgacg aaaccttggg gacccccaac cccgacgacc 720 gaccctgaac tttgagggcc cataactttg caattactac cctgattgaa actctgaaag 780 tggatttgga gtcctctcgt caagctcttt cgttttagaa catggggcct ttcccagtcg 840 aaactacccc ccccctg 857 // ID MSAT-1E_CR repbase; DNA; CHL; 120 BP. XX AC . XX DT 04-APR-2005 (Rel. 10.03, Created) DT 10-APR-2007 (Rel. 10.03, Last updated, Version 1) XX DE MSAT-1E_CR is a minisatellite unit - a consensus. XX KW MSAT; Satellite; Simple Repeat; MSAT-1E_CR; MSAT-1_CR; KW Interspersed repeat; minisatellite; nonautonomous; tandems. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-120 RA Kapitonov V.V. and Jurka J.; RT "MSAT-1_CR, a family of minisatellites from the green algae RT genome."; RL Repbase Reports 5(3), 56-56 (2005). XX DR [1] (Consensus) XX CC The MSAT-1E_CR is a subfamily of MSAT-1_CR minisatellite. XX SQ Sequence 120 BP; 16 A; 33 C; 63 G; 8 T; 0 other; ctgggcggct gaggcgaagg cggcacaggc ggtgcaggac tgggcggcgc ggggctaggc 60 ggctccgggc tgggaggcac cggcggcgca ggcgaaggcg gagcgtccac aggcgcgggg 120 // ID Gypsy18-LTR_CR repbase; DNA; CHL; 312 BP. XX AC scaffold_48; XX DT 22-FEB-2008 (Rel. 13.02, Created) DT 22-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE LTR retrotransposon from single celled green alga: long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy18-CR; KW Gypsy18-I_CR; Gypsy18-LTR_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-312 RA Kohany O. and Jurka J.; RT "LTR retrotransposon from single celled green alga."; RL Repbase Reports 8(2), 105-105 (2008). XX DR Genome; scaffold_48; Positions 502134 502445. XX SQ Sequence 312 BP; 72 A; 91 C; 67 G; 82 T; 0 other; tgtaagcgta gggtgcgcta catgtccgtt acacacgagg gaatgccaat acacacccag 60 agggtgccag acacgcgtgt gggtttcaca cggcggagtg tcccatgcac gagtcacaca 120 gcgtttatca acgctacacc ttcagagcct tacaagtatt cttacacggc gtatacatac 180 gccacctatc cttcggtttc tcatatagtt ccttactctc cgcctagttg cctagcatta 240 caagtcctcc tagctctagt tctctcctgg taagcaccta gtatgatcct gtctagctgg 300 tgacagctta ca 312 // ID Copia10-LTR_CR repbase; DNA; CHL; 216 BP. XX AC scaffold_67; XX DT 22-FEB-2008 (Rel. 13.02, Created) DT 22-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE LTR retrotransposon from single celled green alga: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia10-CR; KW Copia10-I_CR; Copia10-LTR_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-216 RA Kohany O. and Jurka J.; RT "LTR retrotransposon from single celled green alga."; RL Repbase Reports 8(2), 100-100 (2008). XX DR Genome; scaffold_67; Positions 484396 484611. XX SQ Sequence 216 BP; 59 A; 49 C; 64 G; 44 T; 0 other; tgttgagata tcggtcagaa ctaagggaga acgtagggga gtccgagagt gcgtgtgcga 60 tacaccatgc aaggacttga cccgggaccc atgcggaggg tataagctca gtgagctacc 120 cgatacagtt agtagtgagg taacgtgtaa tgtgcaaagc agtctgactg ttagccgccc 180 ctactccggc taggatacag ccgtaaactt ccaaca 216 // ID P-2_CR repbase; DNA; CHL; 9309 BP. XX AC . XX DT 29-MAR-2006 (Rel. 11.03, Created) DT 29-MAR-2006 (Rel. 11.03, Last updated, Version 1) XX DE P-2_CR is a family of P autonomous DNA transposons - a consensus DE sequence. XX KW P; DNA transposon; Transposable Element; Interspersed repeat; KW P-2_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-9309 RA Kapitonov V.V. and Jurka J.; RT "P-2_CR, a family of P autonomous DNA transposons in the RT Chlamydomonas reinhardtii genome."; RL Repbase Reports 6(3), 163-163 (2006). XX DR [1] (Consensus) XX CC This is a first example of P transposons in viridiplantae. P-2_CR CC encodes a 1680-aa TPase composed of the DDE domain conserved in CC all P transposases (aa pos. 453-1024) and PHD zinc finger (pos. CC 1502-1547). The P-2_CR TPase is predicted to be encoded by 13 CC exons (based on FGENESH and comparison with other P transposases CC ). P-2_CR is characterized by 8-bp target site duplications and CC 151-bp TIRs. The genome is populated by several families of CC nonautonomous transposons derived from P-2_CR (DNA-8-5_CR, CC DNA-8-5_CR, DNA-8-6_CR). XX FH Key Location/Qualifiers FT CDS join(261..317,1062..1205,1686..2237,2405..2635, FT 3134..3528,3621..4055,4151..4341,4440..4601, FT 4729..5314,5407..5608,5734..7818) FT /product="P-2_CRp" FT /translation="MERSRQGGRCWLLWALALTAFAAMGVEKGWCVLCGTY FT VKSKDPFATLSTGGMRPDYCNGTNDCCSYVSPPPLSHPSPPPPPAHPSPLP FT PLPPPSPPPPLPPLHWPTTAALPNPPPPPFPPPASSFLPADRPKWTEKVGA FT HMEPPTAVEIIFYSLLNAVFKDTTYKDITLSNVTRTDEVCARVAPLFAYLV FT NLTCTPKQLERFLSDFRTGKGNAAVTAWDRGWKRCTKYGIFIRDFSASAVG FT SFLVELLEQEDPLAVYQSVYETDIPPATILNAAAAAATEAKGGTLEQLPEH FT ERFARIMHSFNGRAQRGDPKWDRAVQDDWHCSVHRNRQASAVATAARKEAA FT AKARIAEQLQSALDAHQHAEAQLAQARKENCQLQARNTSLQADCERYAEEL FT DQSRARMRELSADHSSALLHLLEKPAARQVIEQHPGLAAMWADQLEYLGRS FT ASGGKATRGMRWHNSAIQLGLTVLNKGGEGLFEELRSFMPLPSINTLLAYK FT HFAIDGSGFHRAHAERAAELLRAGKADTRVGLAFDEMKLSRGLTFSTTSDS FT LTGFTDVDAAAYAQRLEGLLDSSGGAASADPDAPSVCRALATHVLHFEVTA FT LGERPTRYSVGYVFTDGIKAVELQSVVSEVIVLLEAAGLQVAYTVCDGASE FT NRAWMKMMADRDLAVQLAKDTEIKLPCPDKDVTECLRCFRSPANPERPVIM FT ISDAPHLIKKMRNAIEKSFGGTGKDGKNTSEVFFPEDVTPSTAPPPTGLPP FT AAPPPIPVPVAAPLPTGLPPAAPPPIPVAVAAQPPTGLPPAAQPPIPVPAA FT PPPPVPTPPPPVTTQHTEPVPPPVPAPRWRPIMWRDVVKCVRWDMHSLPRV FT MGNVTNAMTDLDSWSRMRNHLASKLFAGLEMFCKWTAEMGWVRDNQKAPPQ FT HIIDGARDLRPYVRLALSTKLSTDRVTPVRDASDPHLRQLLHNGTVVYDWH FT QALKAVPPDDDDLRPGVRHGLSHQCQSDFQVTCFGTVALARWYLRTGGTSI FT ILGTINQNCVENAFSQLRGHGQDRAPTAAAVVSGEASLRMNKALAVLNSGL FT GAHKSGSSYQAAHNGSLVAPDAAAAAELLRRRMRVALPGQRKEPAASTWQL FT RDLDTARAEPAAGLQSDARRAQALAQLIAPPLIGKKGPLAENPQLATALRA FT AEAGKPSPAFVQYSCWLLNSTAAAAQDAASKGTAADTFAPVAAARHGPACA FT AAWAWCVRNLTASEPQGSMPQLPLVSKGAAATDAASAAALRGLGDAACLVH FT GACFNAAYSLGVLNAVLMPDAHHGGTKAQQEEKAVHAQVFELYSSTTACER FT VLGWALHSAIKATPEADPTRRILQQFVSGRADGTHRRVFNDELQESLPQGK FT LYVPHPALVSFFVELHPTIIERFQCANMVLQDGAYAVKDLIGRIRCDQGCW FT ERFQAAYQQAAKDSTVGTGSSSHVRAAMGVLVSKYIHANLVGLLRHTQLLA FT GKDDAAARALRDERRADAAKKSKGGKPTKLVDASTCCLVCERPQPAENMLL FT CSRCSSPYHTHCLDPPIKMPALEEEWFCHECSAALTGAAQPLAAGSTSVMG FT AGGAGGRGRGRGRGRGRGQPAADKDAVALAPAASMDSGRGRGRGRGRGRGG FT TGGQDTNTQPMDVDGPGDARMAVPVPEPEPEQLRTQPMRVRFQPKRLQAVS FT PVKPADKRKRP" XX SQ Sequence 9309 BP; 1809 A; 2869 C; 2763 G; 1868 T; 0 other; caggggtgcg gggtggccaa ctcgccccat gtgatcggca acggtcggca accgccccca 60 attcccagtt gcgaccaggc cccggcccga tttctttgca tagacctata aggagtaccc 120 catagaagac ccctggcgcg ccgggcgact agtttactgg cctttgacga aacttgacca 180 tttgaacagc attaccgcat ttccagcctc catgcggcgc ccggtgcgcc aggtgaccat 240 ctagaagtgt tacgaagtag atggagcgaa gcagacaagg cggtcgctgt tggctgctct 300 gggcgctggc tctgacggta gggtcgccgg caccgccctt agtcttgttc cgatgaatga 360 tatggggtat cctttgctcc aatattcagc ctagcgcatc gggcatatca tctgaggcgt 420 cgctggcttt gattgcttca cattttaacg ctctgctgct tcttatacct atatcaagcc 480 tttccgctat tacctatacc atatcaacga ggtggtcagg gtcgaaatct gcaaagcgtt 540 gcccgcgcgt ccaggatttt agctcgcagt ttgaaccggg ggtccgggtg cagaggggcc 600 agcttttggt acccgctaca tgcaccacat gcatcaacac cagaaatcat cgcacctgca 660 gtgcactgca gtttaaccaa tgcagtttcg aactcggggc gccatgcaac attccttgca 720 cgtcataaat tgtttgcctt ggctcgttcc gctatttcta gtatactcag gatgccgatt 780 gactggatgg agggccgagc gcacctacgg cacgccagcc atgtcagctc gcagcatcgg 840 gatgcactgg ccccgggcaa acgcacgcat acagccatac ccacattgtg ttttgttatc 900 atatgagcat ttggtcaaca tagggctagt tccttgtggt taacaacgcc cggtctggtg 960 gtggcgtgcc gattgcatgt tgacccacat tcctagacgt gggcaaccat tacttgctgt 1020 tacctgaggt gtatgaattg ggttctctgt tgtgcatgca ggcgttcgct gcaatgggcg 1080 tcgagaaggg ctggtgcgtt ctgtgcggca cgtacgtcaa gtcgaaggac cccttcgcca 1140 ccctttccac gggcggcatg cgccccgact actgcaatgg aaccaacgat tgctgttctt 1200 acgtggcaaa cggcagaaac ccccagccag tccacctgca cggccgcttc catgacgagc 1260 agcacctgaa gccgctggcc attaccctcc ctccttctgg gtcaaagcaa ggcacacgga 1320 tctgttctca gtgtgccgtt gacaaccgcg gagagcacaa gagactgctg cacgcgcacg 1380 ttgcggcagc acaggcgcca ccacaaccag catccgtgat gcaagctgca agtgcggtca 1440 acctcctcat cgccattgac gctctgtcag ttggcatcaa caccgacacc agtaaccaca 1500 gcgggcctgg ccttggcctt ctcaaccgtg cggcagtgga gagcattcca gctggtggac 1560 cctcgccatc tgcaccaatg ccgccaccct ctgctgtcgc cgcgcctgta cctatgcctc 1620 ctccacccaa gtcaccgccg cctcctttgc cgccgctcgc cccactcccc ccacctgcgc 1680 cccagtcgcc gcctcctcta tctcacccct caccaccgcc ccctccagca cacccctcgc 1740 cgctgccccc tcttccgcct ccctcaccac cgcccccgct ccctccactt cattggccaa 1800 ctacggccgc actcccaaac ccgccgccgc cacctttccc tccgcctgca tcgtcctttc 1860 tgccagcgga tcggcctaaa tggactgaga aggtcggtgc gcatatggag ccgccaaccg 1920 ccgtggagat tatcttctac tctttgctca acgccgtgtt caaggacacc acctacaagg 1980 acatcacgct ttccaacgtc acgcggacgg acgaggtttg tgcgcgcgtg gcgccgctgt 2040 ttgcgtacct ggtcaacttg acctgtacgc caaagcagtt ggaacgtttc ctgtcagact 2100 ttcgcacggg caagggcaac gctgcagtaa ctgcatggga ccggggttgg aagcgctgta 2160 ccaagtacgg catcttcatt cgtgactttt ccgcatctgc cgtgggaagc ttcctcgtgg 2220 agttgctcga gcaagaggtg aatggctctg gtttgggctt tgcaccgggc tctgatgggt 2280 gcttgggcaa gggttgggtg tatcagtatg caagagctgg aattgtctgc aggtgcgctg 2340 gatgcacgcg tcggtctgcc aaacacattg ctaactcgtt gctgtgctca ccattgccct 2400 gcaggatccc cttgccgtgt accagtctgt gtacgaaacg gacatcccgc ccgccaccat 2460 tctcaacgca gcagcagccg cagcaacgga ggcaaaggga ggcacactgg agcagttgcc 2520 tgagcatgag cgctttgcac gcatcatgca cagcttcaac ggcagagcgc agcgaggcga 2580 ccccaagtgg gaccgcgcag tccaggatga ctggcactgc agcgtgcacc gcaacgcaag 2640 tgacggcttg gtgttgcggt ttcaacccag caacagctgc gcgcacgacc tcaactcgct 2700 tcagcttcat ctgtaccccg acttcacata cagcattttc agctctcatg caacagtgca 2760 gatgccgcct gacggaccca gcagcccgcg cctctgcacg gagcagcacc tcactgacgc 2820 cttggtgttt ctgggggctt gccgcccctg tgccggcggt cccactgcag ctgcctttgc 2880 accatacctg gcgcacaagg cagcttgcaa cgagacggag tcagttgtgc tgcagcgtta 2940 ccgggctgag tcgcgcaccg tgcaccttcc gtgcagccgc gccagtggcg agactgctgc 3000 cactcagaaa cttgacgtgc atcccagtat cgcgtgccat ggcttggtca gcggtcacac 3060 ctatgcaact cgctgcccat gctgcgaagc tcacaaccac tcgctcaggt cgaccctctc 3120 gtcatttgcc cagcgccagg cgtcagctgt cgccacagct gcaaggaagg aggccgccgc 3180 caaggcccgg attgcagagc agctgcagag cgcgctggac gcacaccagc atgcagaagc 3240 gcagctggca caggcgcgga aagagaattg tcagctccag gcgcgcaaca cgtcgctaca 3300 ggcagactgc gagcgctacg cggaggagtt ggaccagtcg cgagcccgta tgcgggagtt 3360 atcagctgat cacagctccg ccctactcca cttgcttgag aagccagcag cacgccaggt 3420 catagagcag caccctggct tggcggctat gtgggctgat cagctcgaat acctggggcg 3480 ctccgcgtct ggcggcaagg caacgcgtgg aatgcgctgg cataacaggt ggggctcatt 3540 ggagtcggga aggggtgggc ggggctgggt gcgtgcggga tctggctaac agtaccgtaa 3600 ttcgtcccgt gcccttgcag tgcgattcag ctgggcctga ctgtactgaa caagggcggt 3660 gaagggctgt ttgaggagct ccgatccttc atgccgctgc catctatcaa cacgctgctg 3720 gcctacaagc acttcgctat agacggctcc ggctttcacc gtgcgcatgc agagcgagca 3780 gcagagctcc ttagggcagg caaggcagac acgcgcgttg gccttgcttt tgacgagatg 3840 aagctgtctc ggggcttgac gttcagcact acttcagatt cgctcacggg ctttactgat 3900 gtcgacgccg ctgcgtatgc acagcgcctg gagggtctac tagatagcag cggcggtgct 3960 gcttcagctg acccggatgc gccgagtgtg tgccgtgccc tggcgactca cgtgcttcac 4020 tttgaggtca cagctcttgg ggagcggccg acacggtgag ggctctcccc gtgcacgccg 4080 gacgatgtgt gctccaggct gcgtgcctcc cgttcatgac caaactaccg tacgttcctt 4140 ccctctgcag gtactcagta ggctatgtgt tcacggacgg catcaaggcc gtggaactcc 4200 agtctgttgt gtccgaggtc atcgtacttc tggaagctgc aggactgcag gtcgcgtaca 4260 ctgtttgtga cggggcttct gagaaccgtg cctggatgaa gatgatggcc gacagggatc 4320 tggccgttca gttagccaaa ggtgcggtgt cgtgcagtga tggcgcgtca cacgtgttgt 4380 tagcctcggg tgcaaccaac ttctgacttc cacttcccgt ttattgccct ttcatgcaga 4440 taccgaaatc aagttgccct gtcctgacaa ggatgttaca gagtgcctac gctgtttccg 4500 cagcccggcc aaccccgaac gccccgtcat catgatttcc gatgcaccgc atctgatcaa 4560 gaagatgcgc aacgccatcg agaagagctt tggtggaaca ggcaagccgg gcgccggtct 4620 tgtatgtagg aggtggtcgt gagtgcagac cggaggcatt ggccatcgca aaagttctgt 4680 accagtggta gatgacacat gacgcagctt gctcatatct gtccacaggc aaggatggga 4740 agaacacgtc tgaagtcttc tttcccgaag acgtcacgcc atccaccgcg ccgccgccca 4800 ctgggctgcc gcccgccgcg ccgccgccca tccccgtgcc cgtggccgca ccgctgccca 4860 ctgggctgcc gcccgccgcg ccgccgccca tccccgtggc cgtggccgcg cagccgccta 4920 ccgggctgcc gcccgccgcg cagccgccca tccccgtgcc tgccgcaccg ccgccgcctg 4980 tacccacgcc accaccgcct gtcaccacgc agcacaccga accggtgccg ccacctgtac 5040 ccgcgccccg atggcggccg attatgtgga gggacgttgt taagtgtgtg cgctgggata 5100 tgcactccct cccccgtgtc atgggcaacg tgaccaacgc tatgacggat cttgactctt 5160 ggtcacgcat gcggaaccat ctcgcgtcaa agctcttcgc tggacttgag atgttctgca 5220 agtggactgc ggagatgggc tgggtgaggg acaaccaaaa ggccccacca cagcacatca 5280 ttgacggtgc ccgagatctg cgtccatatg tcaggtgcgg agcattactg ggcgctttgt 5340 ttgggggctt tgttttactc gccccacgta gattcaagcg ttctttctcc taattccctt 5400 atgcaggctt gcgctcagca ccaagttgtc aacagatcga gtcactcctg tcagggacgc 5460 gagtgaccct catctgcgcc agctactgca caatgggacc gtggtctacg actggcatca 5520 ggctctgaaa gcggtgcccc ctgacgatga tgaccttcgg cctggtgtcc gccacggctt 5580 gtctcaccaa tgccaaagcg atttccaggt gcggcaatcc aggttgcagc tcgtttggcc 5640 aggtgcatgt tcatctaata agcccgggat gttggcatgt tcgccctatg tcgcgcgtga 5700 ccaacatgtc gctcttcacg tttgtgcttg taggtcacct gctttggcac ggtggctctg 5760 gcccgatggt acctgcgcac cggtggcacc agcatcatcc tcggcactat caaccagaac 5820 tgtgttgaaa acgccttcag ccagctgcgc ggtcacggcc aggatagggc acctacggcc 5880 gcagctgtgg tgagcgggga ggccagccta cggatgaaca aagcgctcgc agtgctcaac 5940 tccggcttgg gtgctcacaa gtccggtagc tcgtatcagg cagcccacaa cggcagcctc 6000 gtggcgccag atgctgcagc ggcagcagag ctgctgcgcc gccgcatgcg tgtagccctg 6060 cctggacaac gtaaggagcc tgctgccagc acatggcagc tgcgggacct cgacacagcc 6120 cgcgcagagc cagcagccgg cctacagtcg gacgcgcggc gcgcgcaggc cctcgctcag 6180 ctcattgcac cgcccctcat cggcaagaaa gggcccctag ctgagaaccc ccagctggcg 6240 accgctctgc gtgcagctga ggccggaaag cccagcccgg cattcgtgca gtacagctgc 6300 tggttgctca actcaaccgc agctgcagct caagacgcgg ccagcaaagg caccgccgct 6360 gacacgtttg cgccagtggc cgcagctcga catgggccgg cgtgtgctgc ggcgtgggca 6420 tggtgcgtgc gcaacctcac agcctcagag ccacagggat cgatgcccca gctcccgctg 6480 gtgtcgaaag gtgctgcagc tactgatgcc gcctctgcgg ccgctctccg gggcctgggc 6540 gatgcggcct gcctggtcca cggggcctgc ttcaacgccg cctacagtct gggcgtgttg 6600 aatgcggtcc tgatgccgga cgcacaccac ggcggcacaa aggcacagca ggaagagaag 6660 gcggtgcacg ctcaggtgtt cgagctgtac agcagcacga ccgcgtgcga gcgcgttctg 6720 gggtgggcgt tgcacagcgc gatcaaagcg acacctgaag cggatcccac gcgacgcatc 6780 ctgcagcagt ttgtcagcgg cagagcagat ggcacccacc ggcgcgtatt caacgatgag 6840 ctacaggagt cactgccgca ggggaagctg tatgtccccc atccggcgct agtcagcttc 6900 ttcgtggagc ttcaccccac cataattgag cgctttcaat gcgcgaacat ggtgctgcag 6960 gacggcgcgt acgcagtaaa ggacctgatc gggcgcatcc gctgcgacca gggttgctgg 7020 gagaggtttc aggccgcgta ccagcaggct gcaaaggaca gcacagtcgg cacaggcagc 7080 agctcgcacg tccgcgcagc tatgggtgtc ctagtcagca agtacatcca cgccaacctg 7140 gtgggcttgc tgcgccacac ccagctgctg gcaggcaaag acgacgcggc agctcgcgca 7200 ctgcgggatg agcgtcgtgc ggacgccgcc aaaaagtcca agggcggcaa gccaaccaag 7260 ctggttgacg cctcaacctg ctgcttggtt tgcgagcgcc cacagcctgc agagaacatg 7320 ttgctgtgct ctcgctgcag ttcgccgtat catacccact gccttgaccc gccaatcaag 7380 atgcctgcac tggaggagga gtggttctgc catgagtgct ctgcggccct caccggggcg 7440 gcacagcccc tggcggctgg cagtacgtcg gtaatgggtg cgggcggtgc gggcgggcgg 7500 ggccgcggca ggggcagggg caggggccgc ggacagcccg ctgctgacaa ggacgcggtg 7560 gcactcgcgc cagcggccag catggacagt ggccgagggc ggggccgtgg acggggccgg 7620 ggccggggtg gaacaggcgg acaggacacc aacacccagc ccatggatgt tgacggacct 7680 ggtgatgcgc gtatggcggt gccggtgccg gagccggagc ccgagcagct gcgcacgcaa 7740 cctatgcgtg tgcgctttca gcccaagcgc cttcaggccg tgtctccggt gaagccggct 7800 gacaagcgca agcgcccatg agcatatgtc tcccaaatgc atacatgttt acagtatttt 7860 catgcattca tgatggctgc tttcatactg agcaatgaaa gtgaagtaca caacagcagg 7920 ctgggacctt ttacatgcat ttgattgtgt gacgcgggat gtgacgcagg agcagcattg 7980 tagtatgcgc acatacacca cgaataagta acatagtagc gcgctgtacg cgatatgatg 8040 tcgactagta cttggtcggg cgtggcacga ataaaacaag tgcagcaagt gcatgacaag 8100 cagggattgc cgcacacaca ctttatggca gtggcgcaca cgtgttttcg acgtggtttg 8160 tttatgcagc tgtacatgtc tgctgccatt tcaacacacg gcatactagt ggcccctctg 8220 tacatctgga atgcaaccgt gaactggtga tgtgggctgt gtcagtcata gcggccttag 8280 tggcctgacc atgtatgtac acgtacgcgc tgattgcgcc ctggattcta gttgtgccac 8340 gcaccccttt ttaactgcag tagtaaccta gtgcaaatga cgtacgatat ggtctgagga 8400 gtggtatgcc acgttcgagg tgggttaggg gtatgccacg ttcgaggtcg gggaggtgct 8460 ggaggggagg tgctggaggg gatggagagg tcacagggca agcaatgcag atgcatggga 8520 agtgttagag gtatatgtgt atgtacggta tgcgtggatg catgcaaagg gcgcatagag 8580 catttcatcc gcctgcaacg agtgaaaagg actaggagca tggagtggca ggggtgactg 8640 ggtgcgcgtg gatgcgctag caacggacct ggtttggcac atgtgtttgg gcgctcaact 8700 tagcaagcca gcactgtgaa tccattatgt aaccaacaag ctcagccggg tgcgcgggga 8760 tggcttcgtg ccggtctcgg ggaatgagag gggcacagtg cccagcggct taacacagcc 8820 tgcacagggg ggcgcgaagt tggcgacgtc cggcaaggcc gtcctgaacc ggccccactt 8880 gttttatgcg cccaaattgg ttgttacatg tcaagtatca aagcagcgct ggcgcctgct 8940 attgcgcagt cttgcttttg aactccggct cgaaattgca cctcgccggt cgcatggaag 9000 cccctccctg cttgtagtag gcccccgaag tcctgaaaca aagctgcttc agaactccaa 9060 tgtaatacag gtccataggt tgctatcgcg cagctctccc tgcctccaag ccgaggttcc 9120 aatttgcgtc ggcaagcatg ttttcgacca ccgtcatata gtcgcccggc gcgccagggg 9180 tcttctatgg ggtactcctt ataggtctat gcaaagaaat cgggccgggg cctggtcgca 9240 actgggaatt gggggcggtt gccgaccgtt accgatcaca tgggccgagt tggccacccc 9300 gcacccctg 9309 // ID TE2-1_CR repbase; DNA; CHL; 471 BP. XX AC . XX DT 08-JUN-2004 (Rel. 9.05, Created) DT 08-JUN-2004 (Rel. 9.05, Last updated, Version 1) XX DE TE2-1_CR is a family of nonautonomous transposable elements - a DE consensus sequence. XX KW Transposable Element; AAC target site duplications; TE2-1_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-471 RA Kapitonov V.V. and Jurka J.; RT "TE2-1_CR, a family of nonautonomous transposable elements in the RT Chlamydomonas reinhardtii genome."; RL Repbase Reports 4(5), 136-136 (2004). XX DR [1] (Consensus) XX CC TE2-1_CR is a family of nonautonomous transposable elements. The CC consensus sequence is less than 5% divergent from CC TE2-1_CR copies. Classification of this family is not known. It CC is CC possible that it belongs to a novel superfamily. Features of this CC superfamily include preferential insertions into AAAC targets CC that are accompanied by the AAC target-site duplications, a CC 18-21-bp CC palindrome separated by 3-5 base pairs from the 3' end; and the CC AGCC CC 3' termini. XX SQ Sequence 471 BP; 91 A; 111 C; 156 G; 113 T; 0 other; ctcccagctc cgcagcgcga tggctcgccc tttgtgggga ctcacagtca agtgtggaat 60 agcggggtgg cagtgtgtag ctgggacggg ctgtgagcgc attgtgtcgt gagcgaggtt 120 gggagaggag actggagcag gggagggttg gatcgtgttg gagcgccagg gcaagcaact 180 ttgtaagtgc ggcctcaaag cgttgagccc agcaggtgct atccctcgag gatatttggc 240 attggctttg cctagacctc agcttggctt tcgtcggctt tggcctcatg gcccggggcg 300 gtgcacttta gtgcccgttt tcatgtgttc attgttttca agagtttctg cattcgataa 360 atagcccgtc gaaggctgta ctggtagccg aaacaagcaa aaaagacaac aaccctaatg 420 taaggttctt gccgaggact aaggctgtcg cccccctttg gggggcgagc c 471 // ID MSAT-1A_CR repbase; DNA; CHL; 72 BP. XX AC . XX DT 04-APR-2005 (Rel. 10.03, Created) DT 10-APR-2007 (Rel. 10.03, Last updated, Version 1) XX DE MSAT-1A_CR is a minisatellite unit - a consensus. XX KW MSAT; Satellite; Simple Repeat; MSAT-1A_CR; MSAT-1_CR; KW Interspersed repeat; minisatellite; nonautonomous; tandems. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-72 RA Kapitonov V.V. and Jurka J.; RT "MSAT-1_CR, a family of minisatellites from the green algae RT genome."; RL Repbase Reports 5(3), 52-52 (2005). XX DR [1] (Consensus) XX CC The MSAT-1A_CR is a subfamily of MSAT-1_CR minisatellite. XX SQ Sequence 72 BP; 8 A; 21 C; 37 G; 6 T; 0 other; ctgggcggcg ctggcgacgg cggctccgga gacggaggcg ggctagacgg cgcagggcta 60 ggcggctccg gg 72 // ID Novosib-2_CR repbase; DNA; CHL; 3280 BP. XX AC . XX DT 31-MAY-2006 (Rel. 11.05, Created) DT 01-JUN-2006 (Rel. 11.05, Last updated, Version 1) XX DE Novosib-2_CR is an autonomous DNA transposon - a consensus. XX KW Novosib; DNA transposon; Transposable Element; KW Interspersed repeat; 8-bp TSD; transposase; Novosib superfamily; KW Novosib-2_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-3280 RA Kapitonov V.V. and Jurka J.; RT "Novosib-2_CR, a family of autonomous Novosib transposons from RT the green algae genome."; RL Repbase Reports 6(5), 263-263 (2006). XX DR [1] (Consensus) XX CC Novosib-2_CR is a young family of autonomous transposons. The CC consensus sequence was derived from several copies that are~99% CC identical to each other. These transposons are characterized by CC 8-bp target site duplications and 17-bp terminal inverted CC repeats. XX FH Key Location/Qualifiers FT CDS join(155..548,647..769,966..1336,1464..2231, FT 2268..2924) FT /product="Novosib-2_CR-1p" FT /translation="MLSAYRFDKKRNRRLNIYCARCAKSITATQPGNRTRK FT CADFLCIPDAFDQRVIDVIKEQNNGDVVTVQLHSKCEKELAAKLDVLRGAK FT ESARTRAQAVALADVPEPQRRTVPGAVAKVERLVEEAEAVGATGSLKCRPP FT RPVLGDITNIGAASPPLQPRTPDADMQQEPAGAGRQLYGVSQWRQYFKARV FT TVAGTDYHLLTSANLCVAGLAVNVFRARLLQLDPKCGLSSVRENHVPGSTT FT LSSSERLKVEMQVSSAMASRASRIAAVSAAQQHQHQQAPGAAHLQAALAKL FT GAVLSFLKMSLASDGAGGSNSSSSSTEAEAAALVAVVPVAEGIQVVWHYMQ FT CAQGSASPSLVNAAQLERAYWAFRLTHGQKPSRLRPAQTLAAMYSAVAQEQ FT LHARSAALKRKGRDFSLAHTYYPAGKAAKVIDEPHKLKTAAKEQEQGSIYP FT EGAPLAAMQDIASLRPCGFTGGPRQGPARGGNPGPRPPARGGQLSMDSQVL FT VAVLQHLVQEWQDTAAADQMEKFMGEDPKVLLAIAQGRTDTQSVACAQALL FT GSRLLAVCMETLIGAFRAWDEAQLDPLERALRNEQRYYLINRFLGNAWSVV FT GGSCSHVCGLSLANAASFLNNTDARLHVLAHMLPHEREHLNERATSTDANE FT GHFACMTGPGGQKPLVAVLQGRESRVSFLAAKRQQPEEVRGWSYRRPRSRG FT GYAMRATTHVRVGNGRRQSKWNNGAKLADPGLSVKREGAIQKRAARKAEGK FT RVSVRVAVHNKAKLMNQGSG" XX SQ Sequence 3280 BP; 645 A; 978 C; 1105 G; 552 T; 0 other; caacccggtc tcacccggga cgtgcccata cacgcggcac cgtccaagct tgtaaagtag 60 gtattatact tacttgttat atacaaaccc gtacacaaca agtcccactg cgtttagctg 120 ccgttctgct cgacgcgaac catcatgctc aagaatgtta tctgcatacc gcttcgataa 180 gaagcgcaac cgaaggttga acatttactg tgctagatgc gcaaaatcaa tcactgctac 240 gcaacctggc aaccggactc gaaaatgtgc cgacttcctg tgcattcctg atgcgtttga 300 ccagcgcgtt atagacgtaa tcaaagagca aaacaatggt gatgtggtca cggtccagct 360 ccacagcaag tgcgagaagg agctcgccgc gaagctggac gtgttgcgcg gcgccaagga 420 gagcgcacgc acccgggcac aggcagtcgc gctcgctgat gtccccgagc ctcagcggcg 480 cacagtgcca ggagcagtcg caaaggtgga gcggctggtg gaggaggcag aagcagttgg 540 ggcaacaggt gagagccctt gggggtgcgg gtgcagcatg cacgtacgtg tgcctgcacg 600 ccccctttca tgtgctgatc aatgcacgcg tgttcccacg ccgcaggttc tctcaaatgc 660 cggccaccgc gcccggtgct cggcgacatc accaacattg gagccgccag ccctcccctc 720 cagcctcgaa cacctgatgc cgacatgcag caggagcctg caggcgcagg tgtgtatggc 780 tttgctcgtt gcttcatgag tcttcatgca tggtaccgta taggttcaag gatgatctgg 840 gtccttccta ccctgtgctg taggccctcg atctacggcg gacgacatgg cgactttgcc 900 tgctgcatcg ccgcatcccg tcccagcagt ccagcccgca cagccagcac cacagcaacc 960 cgcaggcagg cagctgtatg gtgtgagcca atggcggcaa tacttcaagg cacgcgtgac 1020 ggtggccggt acggactatc atctgctgac ctctgccaac ctgtgtgtgg cggggctggc 1080 ggtgaacgtc ttccgtgcgc ggctgctgca gctggacccc aagtgcggac tcagcagcgt 1140 gcgggagaac cacgtgccag gcagtacgac actcagcagc agcgagcgcc tgaaggttga 1200 gatgcaggtg tccagtgcca tggcgtcccg tgcatcacgc atcgctgccg tgagcgcagc 1260 gcagcagcac cagcaccagc aggcaccggg tgccgcacac ctgcaggcag ctctcgccaa 1320 gcttggggcc gtgctggcga gtaggcgaga gcagctgctg cagttgctgg agtgcctggc 1380 aaccggggag ctgccagtcg acgccttcag tagccgagcg gccgcagtcc ctcacgtgcc 1440 cgatgctgca tgggcgcgtg cagagcttct tgaagatgag tctagccagc gacggggcag 1500 gcggcagcaa cagcagcagc agcagcacag aggcggaggc tgcagcgctc gttgccgtgg 1560 tgccagtggc ggagggcatc caggtggtgt ggcattacat gcagtgcgcg caaggctcag 1620 cttccccgtc gcttgttaat gcggcgcagc tggagcgcgc ctactgggcg ttccggctga 1680 cacacggcca gaagcccagc cggctccggc cagcacagac gctcgccgca atgtacagcg 1740 cggtggccca ggagcagctc cacgcacggt cggcggcgct caagcgcaag ggccgggact 1800 tctcgcttgc gcacacctac tacccagcgg gcaaggcggc aaaggtgatt gacgagccac 1860 ataagctgaa aacggcggcg aaggagcagg agcagggcag catctacccc gagggcgcgc 1920 cgctggcggc catgcaggac atcgccagtc tccggccatg tggcttcacc ggcgggccgc 1980 ggcaggggcc tgcacgggga ggcaatcctg ggccacgccc accggcgcgg ggcgggcagc 2040 tcagcatgga ctcccaggtg ctggtagccg tgctgcagca cctggtccag gagtggcagg 2100 acacggcggc ggcggaccaa atggagaagt tcatgggaga ggaccccaaa gtgctgctgg 2160 ccattgccca ggggcgcacg gacacgcagt cagtcgcgtg tgcgcaggcg ctcctgggca 2220 gccggctgct ggcgacgacg ctgcgggaca tgggcttcct gcgctaggct gtgtgcatgg 2280 agacgctcat tggggcgttc cgcgcctggg acgaggcgca gctagacccc ctggagcgag 2340 ccctgcgcaa cgagcagcgc tactacctga tcaatcgctt cctgggcaac gcctggtcgg 2400 tcgtgggcgg cagctgcagc cacgtgtgcg gcctgagcct ggccaacgcg gcgtccttcc 2460 ttaacaacac tgacgcgcgg ctgcatgtac tggcgcacat gctgccgcac gaacgcgagc 2520 acctcaacga gcgcgccacc agcacggacg cgaacgaggg ccacttcgcc tgcatgacgg 2580 ggcctggagg gcagaagcca ttggtcgcgg tgctgcaggg tcgcgagtcg cgcgtgtcat 2640 tcctggcggc aaagcggcag cagccggagg aggtgcgggg ctggagctac cgcaggccgc 2700 gcagccgcgg tggctatgcc atgcgtgcca cgacgcacgt acgcgtgggc aacggtcgcc 2760 ggcagagcaa gtggaacaac ggcgcaaagc tagcggaccc ggggctgagc gtcaagcgtg 2820 agggcgccat tcagaagcgg gcggcccgca aggcggaggg gaagcgggtc agtgtgcggg 2880 tggcagtgca caacaaggca aagctgatga accagggcag cggctgagtg aacacattgc 2940 ctaagtcttg tgtgtacgtg ggcaggcctt ggggcaggcg tgtgtatggg tacgggcgag 3000 cgggtggtgg gcaggtgcac ttgggattac aaatacacac tgacacacca gcttggcaca 3060 ttgtttatca acctttatat gacgtattta attaatacgg caccgttggc tggccggttt 3120 gaccatcaaa ataagctacc ttggatggct gtgcggctgt gcgatttgtg gggggtaggt 3180 caggttagat tcctagtcgg acttttgccc cgaccagttt tgcggggttg aagggagtgc 3240 aaatttggac ggtgcccaca gcgcgggtga gaccgggttg 3280 // ID DualenCr4 repbase; DNA; CHL; 10346 BP. XX AC . XX DT 22-JUL-2010 (Rel. 15.07, Created) DT 22-JUL-2010 (Rel. 15.07, Last updated, Version 2) XX DE DualenCr4 is a family of non-LTR retrotransposons - a consensus DE sequence. XX KW RandI; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; RNaseH; RandI superfamily; Dualen; KW DualenCr4. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-10346 RA Kojima K.K. and Fujiwara H.; RT "An extraordinary retrotransposon family encoding dual RT endonucleases."; RL Genome Research 15(8), 1106-1117 (2005). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 1..10149 FT /product="DualenCr4p" FT /note="includes josephin-related cysteine protease, FT AP-like endonuclease, reverse transcriptase, FT ribonuclease H, zinc-finger motif and FT restriction-like endonuclease." FT /translation="MLLGITVVRRAPEPTFPRSYCQPGYQPPPASPKLSGG FT KQTTEQHKQQRFNTKADRPLIGRPNASRTATSGLAASKMTNLIRYTTKFSM FT LTISLLCALLLSSSSHPSQASASVSHNTLTNQPHHSTHNSKTKNPLVGRQS FT ARTCFLSNVSPNHALNPTTESLQQAHDNTFIEPLENDTQNPTVRSPIVPIV FT KACRPSFIAILLLSAWAALVTWMPEAWRLGPLCNNDVHHSHGPNRQEPEPQ FT ASCQPALPKPPTHGARKAHRLAKGRPSNPESPYRPVTTTPRRTATQLYTAG FT SRDHTAPPSRTHPNRLPPAFFEQQQHSFCWIHAINNLLGCAAISPYRVLTY FT AEQMLHQAQGANQPAQTHAWTQAFTPVIGQFSVHAINHYLKYNCHRANQAL FT AIHVHDNIRPDDSPASIARRHKQTEHGFLIFTPGYSSGIYGHVIAIKRHGT FT SWLVLDSADTQPRAIRRTTALETTGTVFTLSARTPDTPPLHPLCLNATGPP FT ANWAPDRMRSADSNTPLPEPPEYRSLTQLAHRLTSQGHRLLDTCAWGNNCA FT LFATLAALPGSQVPLMPTEPASWASMTGAIRTCIASKLRQLTAPSSPGSDE FT LRLHILGLYEETETPERTVQTHCRNVEDTAMLTIPDLIAFQSLLPGVQIHI FT IGPGYTDWLPLPKPTAYDRPQHNVYLAHVPHSWALAGFPPTFPTAECNHYI FT AVTHSTWTHGPRTQDPSLLRWGPIWDGGPCTQDIIDKHANTPVRSPPPKKA FT RTNTLTHALLRAMGSPINRRKAAKRALSLSPPSPHKPTRPTTNLPAQRTPT FT KAPPSQPTPGTPPNHASTPAQQPPPVIHNPPQRAPGKKRQATLQACLEAAA FT LPQAQRPAPATPAHTSTCHLVTLNARGLLSNLAPIREESETWDADILVLTE FT TKTRAHRTRTERDIIQNSFKSYSTFVHSTPRPTPTKGPKQDQRRHQPSGGV FT TLAVHTRYARKEHITCLPTPATLQGYMVHLRMAPSSSTPIHIIGVYCPPDD FT KRTREALYTYITRASRSAARLKEPLLAGGDWNAAAYDTDRSTNTSTTNDTT FT HQNFLKQTGLSPAARTRPRPHTYTQIQHDTITHQSRIDDFLANNHAQGCPA FT ATEWTSACGGTTDHLGLHLKANLPYKPTGTTTAPAEPDIPSADTSTGPTLA FT NIVKPITQTQSRTITQRWEADGTSALETLYAALTIPGRSARQIIDSTVTAL FT SSNTHHVANTRTAHNIVTDARAQLSTLDLNAASTALYGALAVGAKIMLDTC FT PQKPPRTSRAKPSRAEAKKLKRLLATEKQKKQLYLALVHASPAQPQEPPTP FT PPTAPAPVPPVAPTNPTPAAAAKRDWETALKTRRTTQKEANAKAASKWLTR FT IRKLFATSQGKASKIMLGKSGPRQKLTAVRHPTTQHIATTSTDINDALHTH FT FQAAAEPTGGPRTGRYRPEDTTTTFPWNEPGAPDGGFLLARAQHPDTHSLL FT PNVSDASNFQRCMQSLSRNKSPGPDGLPNELLKCMPDALQAAVHTLLCVAW FT IKAELPDGWSTSDTILLYKKADPLSISNYRPIALCNTLYKLYTSLVTRAAT FT EACTELLVHSEAQEGFQKYKGTERQIMNLLHVIEDAALTKRDLYVAYIDFS FT NAFGTIDHDRLLTVMHHLGLPTDFIEVVRNLYSTANTTIRTAHGRTSPVQI FT RRGTMQGDTLSPLLFIIFLEPLLRWLHYGGRGYRYGCLTNSENDQHNCSSL FT GYADDLALAANSRDNLRCQTDKIARYAAWGGMRINRDKSACTGILHSFASA FT NSSDKSMKGPACERTLKRFLDGQIHIGSNVKDGKLTKIALPYVSPHKSYRY FT LGVEITMTMDWRPQLKAVKTDLTDAGEELSKYKKLPRQCYMLIKQCLKAKV FT AYSLPAMAYTQHDIRALDAQIAGITRRCVGLPRDCATRAILLPAEMGGLGI FT GSLREEYVQRASRLLMISLNDNGRLGVITRRLLHLQAAQAGSDLPLHDMRR FT AGKFFVTLKQLTLLQDHDLLAMLQGKEFAAALPTLVSKLRAAEASTPAYLR FT ADWISPLTIDLGLNLADLVDANTGRHIIDTVQLDRRFPKGRVKDRHKRALN FT RFTHALCKLADTQPKPTAGTKPLPLHNRALPNGVSKTDLRTAGDLPKPIPQ FT RTLTSLLERWQQSTPPRPSGAPARPSLEVLQAKPKPTAKQRHNHFIKYDTR FT TRNPAVEANDAALPGSTLHYPSDPTRDLTFYMHAAREFTPLVKQCKTLGTT FT RWTGSWRIPPRRVPAHAEAAGPHTHATHELLASRPYQAVAAPAHQRLELPD FT GCPTAATSPTHTKPPNRRKSSARPTRCPHTHHAAPARGLPPAAAQRGNRAT FT TQPSARPAWRKARDTALNGALPQSDAGKLAAQHITAHLANRDDIIAAVVSG FT PMTHKTNAPPPKRKRKLDPPPEEPPWADRCRYKVRWQPTLTTNGILSAYKI FT QNYDAINATKPSCMLWGQGIRRFMAQVHWADTEDPLSSLSQNPGWAAALAR FT YTLAQAEAPPDPTPPPALDTHLTSKQRQGRWRDPAPSMAARKRATLRCCTI FT DTNSCNPDTDIHPTGQYCIQLGTRTPGLAAGTATDPDPTARVYGPDGRCVG FT SISATRLALLHARYKALNPTQPPKAFAAAVASLLTRYRPSKRHKESSTNPA FT HEVTLCPDLMSTLTSHLKFTTEAFASPLNATGNIPAYYTAHPEDAVFGAAL FT DCYSRLLHDVSFLHPPPQPEQTNLAMAWAVASAEAAAAQAPHLSVVVVPYG FT ETSGYTAWLQHPLVHALGHIRPAHTWAAMASGLGQPLELPAHRRGLHLLCV FT ANEPGRLILNTALGNGLAAALSGALGHTPQPAARKPLPNPDAPKPWPKTPR FT KLERRLRTHTTPTPLPQPPPAAPTPPTPTELAALYPITIPLAHTCGSAVFT FT DGSKVPHSSDYTGLGAAIWDETENRTTHIDEDIAFDTPLSTILRCESSAIR FT MALSLNLDLPHLRIYTDSLVSLLILRRAILEPESLHVSKHREMFGAITDIL FT AQRGLAGLKTYFYHVRSHQADTCVGNQKADEAAQAVATRETPPDAVETLYS FT HPFETLEGWIAKRETVEHNGNVEERLRYANDLNRDLKKMARPSTAQGYTKT FT GLYAELWDATRQHIDARYGLNLFDGFPFEEARCAFKYRWGLSWNAKRAARV FT KHPIPGCAGGECPLCRCDDGGTHTLGGCTHRRMKAAYIARHNRAVQRIAKA FT VSQGQNGGCVMYMDAGKDAPAYSGGTRIPAWVLPGTPESVRLRFRPDLLVV FT PSLRMTRSGQPTNMPRTRLQKSAHKIYVIEVGYCSDLNHLDKRAEKLQQHD FT DLATALRTAGWDVIHNQDTIISLGHTGTVTTHLHALLKTLGCSTQSAHRTC FT QKLVQHAVHTTTAITALRRELCSHPWPGRPP" XX SQ Sequence 10346 BP; 2607 A; 3854 C; 2312 G; 1573 T; 0 other; atgctccttg ggatcacagt tgtgcgacgc gcccccgaac ccacgttccc ccgatcctac 60 tgtcaaccgg gctaccagcc gcccccggca agcccaaaac tgagtggagg caagcaaacc 120 actgagcagc acaaacaaca acgcttcaat accaaggcag accgaccgct aataggtcga 180 ccgaacgcaa gcagaacagc gaccagcggc ctagcggcca gcaaaatgac caatttgata 240 agatacacca cgaagttctc tatgttaact atctctttgc tgtgcgcact cctcctctcg 300 tcgagctcgc acccctccca agccagcgcc tcagtatcgc acaacacgct aaccaatcag 360 ccccatcatt caacccataa cagcaagact aaaaaccctc ttgtgggaag acagagcgcc 420 agaacctgtt tcctctcaaa tgtaagcccc aaccacgcgc tcaaccccac aacggaaagc 480 ttacagcaag cacatgataa tacgttcata gagccactgg aaaacgacac tcaaaaccct 540 accgtgcgct cccccatcgt cccaatcgtg aaagcctgcc gcccgagctt catcgccatc 600 ctgctcctta gcgcttgggc cgccctcgta acctggatgc cggaggcatg gcgcctgggg 660 cccctctgca acaatgatgt tcaccactcg cacggcccca acaggcaaga gccagaaccc 720 caggcctcct gtcagccagc attgccgaag ccgcctacac atggcgcgcg aaaggcacac 780 aggctggcga agggacgccc atctaaccca gaatcgccct accgtccagt cactaccaca 840 ccacgtagaa cagccactca gctatacaca gccggcagcc gcgaccacac cgccccccca 900 tcccgcacgc acccaaacag attaccgcct gcctttttcg aacagcaaca gcattccttc 960 tgctggattc acgccattaa caacctgcta ggatgcgcgg ccatctcccc gtaccgcgta 1020 ctcacgtatg ctgagcaaat gctgcaccag gcgcaggggg caaaccaacc agcacaaaca 1080 cacgcgtgga cgcaggcctt cacaccagtc atcggccaat tttcagtcca cgccatcaat 1140 cactacctca aatataactg ccacagagcg aaccaagccc tggccatcca cgtccacgac 1200 aacattcgcc ccgatgactc ccctgccagc attgcacgta gacacaaaca gaccgaacac 1260 ggcttcctaa tcttcacacc cggctactcc agtggcatct atggccatgt gatagccatc 1320 aaacggcacg gcaccagctg gctggtcctg gactccgctg acacccaacc gcgggccata 1380 cgccgcacca ccgcacttga aaccacgggc acggttttta cgctatcagc tcgcacacct 1440 gatacacccc ccctacatcc gctgtgcctg aacgcgaccg gcccgcccgc taactgggcg 1500 cccgaccgca tgcgctcagc agacagcaac accccacttc ccgaaccccc cgaataccgc 1560 tccctaaccc aactagccca ccgcctcacc tctcaaggac accgcttgct cgacacgtgc 1620 gcatggggca ataactgcgc cctctttgct accctagcgg ccctacccgg cagccaagtc 1680 ccactcatgc ccactgaacc tgcgagctgg gcatcaatga caggggccat acgcacatgt 1740 attgcatcca aactcagaca gttaaccgca ccaagcagcc cgggcagcga cgagctgcga 1800 ctgcatatcc tgggtctcta cgaggaaacc gaaacccctg aacgaaccgt gcagacgcac 1860 tgccgcaatg tggaagacac agccatgctc accattccag acctcatcgc attccagtcc 1920 ctcctccccg gagtacaaat ccacattatt ggcccaggat acacagactg gctcccgctc 1980 ccgaaaccca cggcatatga ccgcccccaa cacaacgtat acctagcaca cgtaccccac 2040 tcgtgggccc ttgcgggctt cccccctacc ttcccaaccg ccgaatgcaa ccactacata 2100 gcggtaacac actcaacttg gacacacggc ccccgtaccc aggacccctc cctcctccgc 2160 tggggcccca tctgggatgg cggtccatgc acgcaagata tcattgacaa gcacgccaat 2220 acccccgtcc gcagcccacc acccaagaag gcacgtacaa acacactcac tcacgctctc 2280 ctgcgagcca tgggatctcc catcaaccgc cgtaaagcag caaaacgcgc cctttccctg 2340 tccccacctt caccgcacaa accgacacgg cccaccacaa acctccctgc acaacggacc 2400 cctaccaagg cacctccctc gcaacccacc cctggcacac caccgaacca tgcctccaca 2460 cccgcgcaac aaccaccacc agttattcac aaccccccac agcgcgctcc tggaaagaaa 2520 agacaagcca cgcttcaggc atgtttggaa gccgcggcac tgccgcaggc ccagcgcccg 2580 gcaccagcta cacccgccca cacaagcaca tgccatcttg tcacgctcaa cgcccggggg 2640 ttgctatcca acctggcccc cattagggaa gaatcagaaa cctgggacgc ggacattctc 2700 gtactcactg aaactaaaac tcgagcccac aggaccagaa cagaacgcga catcatccaa 2760 aacagcttca aatcgtactc cacgtttgtc cacagcacac cccgaccgac accgacaaag 2820 gggccaaaac aagaccaacg gcggcaccaa ccctcgggcg gagtcacgct tgcggttcac 2880 accagatacg cccgcaaaga acatatcacc tgcctcccca cgccggcaac actccagggc 2940 tacatggtcc acctgcgcat ggccccttcc tcttcgacgc ctatccacat catcggagtc 3000 tactgcccac ccgatgacaa gcgcacccgc gaagcattgt acacctacat cactcgagcc 3060 tcacgaagtg cagcccgtct gaaagaaccc ctcctagccg gcggggactg gaacgctgct 3120 gcttacgaca cggacagaag caccaacacc agtactacca acgacacaac acatcaaaac 3180 ttcctgaagc agacgggctt atcacccgcc gcgcgcacac gccctaggcc acatacgtac 3240 acgcagatac agcacgacac aattacacat cagagccgca tcgacgactt cctagcaaac 3300 aaccacgccc agggctgccc tgcggctacg gaatggacct cagcatgcgg gggcaccacc 3360 gatcacctgg gcctacactt aaaggctaac cttccctaca aacccacggg aaccaccact 3420 gcacccgcgg aaccagatat cccgagcgca gacacaagca ccggccctac actagcaaac 3480 atcgtcaagc ctatcactca aacccagagt cgcaccatca cacagcgatg ggaagcagac 3540 ggaacctctg cccttgaaac cctgtatgcc gcgctcacca tacccggtcg cagcgcacga 3600 caaatcatcg actccacagt aaccgcactt tcatccaaca cacatcacgt agcaaataca 3660 cgcaccgccc acaacattgt caccgacgca cgcgcacaac tctccacgct tgacctcaac 3720 gccgcatcca ccgcccttta tggagcacta gccgtcggag ccaaaatcat gctcgacacg 3780 tgtccccaaa aaccgccccg aacctccaga gcaaaaccgt cccgggcgga ggccaagaag 3840 ctaaagcggc tgctagccac cgaaaaacag aaaaagcaac tctacctagc tctcgtgcac 3900 gcctccccag cgcaaccaca ggagcctccc actccgcccc ccacagcgcc agcacccgta 3960 ccacccgtgg cccctacgaa ccccacaccc gcggccgccg ccaaacggga ctgggaaact 4020 gccctcaaga cacgccgcac tactcagaaa gaagccaatg ccaaggccgc cagcaaatgg 4080 ttaacacgga ttagaaagct attcgcaacc tctcaaggca aagcctccaa aataatgctc 4140 ggcaaatcag gaccgcgaca aaaactgaca gccgtacgcc accctactac ccaacacatc 4200 gccaccacca gcaccgacat caacgatgcg ctacacactc atttccaagc tgccgctgaa 4260 cccactggtg gcccacgcac tggacgctac cgaccagaag acaccaccac caccttcccc 4320 tggaacgaac caggagcacc cgacggcggc ttcctgctgg cccgcgcaca acacccggac 4380 acccattccc tgctgcctaa cgtctctgac gcgagcaact tccaacggtg catgcaatcc 4440 ctctcccgca acaagagtcc tggccctgac ggcctcccaa acgagctact caaatgcatg 4500 ccagatgcgt tacaagccgc cgtccatact ctcctatgtg tcgcatggat taaagccgaa 4560 ctaccagacg gctggtccac cagcgacacc attctcttgt acaagaaggc agacccgctg 4620 tccatctcca attacagacc cattgcactg tgcaacacgc tctacaagct ctacaccagt 4680 ttagtcactc gggcagccac cgaagcatgc actgagctcc tcgtccacag cgaagcccaa 4740 gaaggctttc aaaagtacaa aggaacggag cgccaaatca tgaacctact gcatgtgatt 4800 gaggacgcag ccctcacaaa gcgcgaccta tacgtcgcgt acatcgactt ctcaaacgcc 4860 ttcggcacca tagaccacga ccgcttgctc actgtaatgc accacctcgg cctcccaacc 4920 gacttcatag aagtcgtgcg caatctgtac tctaccgcaa acaccaccat ccgcacagcc 4980 cacggccgca catcccccgt ccagataagg cgcggcacca tgcaaggaga taccctctca 5040 ccactgttat ttatcatctt tctggagcca ctcctgaggt ggctccacta cggcggccgc 5100 ggctaccgat acggctgcct cacaaacagt gagaatgacc aacacaactg ttcatcacta 5160 ggctacgcgg atgacctggc gctagcagcc aactcccgtg ataaccttcg gtgccaaact 5220 gacaagattg cgcgatacgc cgcctgggga ggtatgcgca tcaaccgcga caagagcgcc 5280 tgcacaggca tcctccacag cttcgcaagt gccaacagtt cagacaaaag catgaaaggc 5340 cccgcctgcg aacgcaccct caaacgcttc ctggatggcc aaatccacat tggctctaac 5400 gtcaaggatg gaaaactcac caagatagcc ttgccgtacg tgtccccaca taaatcatac 5460 cgctatctcg gcgttgaaat caccatgacg atggactggc gccctcaact taaggcggtt 5520 aaaaccgacc taactgacgc cggcgaagag ctgagcaagt acaaaaaact ccctcgccag 5580 tgctacatgc tcatcaaaca atgcctcaaa gcaaaagtgg cttactccct cccagccatg 5640 gcatacacgc aacacgatat ccgggcactg gacgcccaaa tcgcgggtat cacccgtcgg 5700 tgcgtcggcc taccacgcga ctgcgccacc cgcgccatcc ttctgcctgc cgagatgggg 5760 ggcctaggaa taggctcgct ccgcgaagag tatgtacaac gcgcctccag actgctcatg 5820 atttcgctga acgataacgg caggctaggc gtaattacgc gccgcctcct ccacctccaa 5880 gcggcgcagg cggggagcga cctcccgctg cacgacatgc gaagagcggg gaaatttttt 5940 gttaccctga aacaactcac cctccttcaa gaccacgacc tgctggccat gctgcaaggc 6000 aaggaatttg cagccgccct accaacacta gtgtccaaac tccgagccgc tgaggcgagc 6060 acgcccgctt acctgcgagc tgactggatc agccccctga caatagacct tggcttgaac 6120 ctggccgacc tggtggacgc aaatacggga cggcacatca ttgacactgt acagctggat 6180 cggcgatttc caaaaggccg ggtaaaagac cggcacaagc gcgccctgaa ccgcttcact 6240 cacgccctat gcaaacttgc agacacacaa cctaagccga ccgccggtac taaacccctc 6300 cccctgcaca accgagccct acccaatggc gtctccaaga cagaccttcg gacagctggc 6360 gacctgccta aacctatacc ccaacgcact ctcacctctc tactggagcg ctggcagcag 6420 tcgacacctc cgaggccctc cggagccccg gcgcggccgt ctttggaagt cctgcaggcc 6480 aaaccaaagc caacggcaaa gcaacgccac aatcacttca tcaaatacga cacacggaca 6540 cgcaatcccg cagtcgaggc caatgacgct gctctgcccg gatccaccct ccattaccct 6600 tccgacccca cgagggacct aactttctat atgcacgcag cacgggagtt taccccgctc 6660 gtcaaacaat gcaaaaccct cggcaccaca cgatggaccg gctcctggcg gatcccgcca 6720 aggcgggtcc cagcccatgc cgaggcggcg ggcccccaca cccacgctac ccacgaacta 6780 cttgcttccc gcccctacca agcagtcgct gcacctgctc accagcggct cgagctacca 6840 gacgggtgcc ccaccgcagc cacatcaccc acgcacacga aaccgcccaa ccgccgcaaa 6900 agcagcgctc gccccacccg ctgtccacac acgcaccacg cagcgccggc gaggggcctc 6960 ccaccggcag ccgcgcagag agggaaccgc gcgaccacgc agccaagcgc cagacctgca 7020 tggcgcaaag ccagagacac agcattaaat ggcgcactgc cccagtccga cgctgggaag 7080 ctagcggcac aacacatcac agcccacctg gctaaccgcg atgacatcat tgccgcggtt 7140 gtcagtgggc ccatgacaca taaaaccaat gccccaccac ccaagagaaa acggaaactg 7200 gacccgccgc cagaagaacc cccctgggct gaccgctgcc gctacaaagt tcgctggcaa 7260 ccaaccctca ccacgaacgg tatactcagc gcctataaga tacagaacta cgacgccatc 7320 aacgccacaa aaccctcatg tatgctatgg ggacagggca tccgccggtt tatggcccag 7380 gtgcactggg ccgacacgga agacccgctc tcctccctct cacagaaccc cggctgggcg 7440 gccgccctgg cacggtacac ccttgcccaa gctgaagcac cccccgaccc caccccaccc 7500 cctgcgcttg atacccacct caccagcaag cagcgccagg gccggtggcg ggacccagca 7560 cccagcatgg ccgcacgaaa acgtgccact ctccggtgct gcaccattga caccaactcc 7620 tgcaacccag acacagacat tcatccgaca ggtcagtatt gtatccagtt gggcacacga 7680 acacccggtc tagccgccgg cacggcgacc gaccctgacc ctaccgccag agtatacgga 7740 cccgacggcc gatgcgttgg atccatctcg gctactcgcc tcgccctcct ccacgcccgc 7800 tataaggcgc tcaaccccac ccaacccccg aaagcattcg ccgccgccgt agcctccctg 7860 ctcacacgat atcgccccag taaacgccac aaagagagct ctaccaaccc agcgcacgag 7920 gtcaccctct gtccagacct aatgtccacc cttacctccc atcttaagtt cacaacagaa 7980 gcatttgcct cccccctcaa cgctaccggc aacataccgg cctactacac ggcacacccc 8040 gaggacgcag tgttcggggc cgccctcgac tgctacagcc gactcctaca cgatgtgtca 8100 ttcctccacc caccaccaca gcccgaacaa accaacctgg caatggcctg ggcggtcgcc 8160 agcgctgagg cagcggcagc gcaagcgcca catttgtcag tcgtggtcgt gccgtatggc 8220 gaaaccagtg gttacacggc ctggctccag cacccactgg tccacgccct gggacatatc 8280 cgaccggcac acacgtgggc cgccatggcc tccggcctag gtcagccgct ggagctcccc 8340 gcccaccggc gcggactcca tctcctgtgt gtggcaaacg agcccggccg gttaattctc 8400 aacacggccc tggggaatgg actagcggcg gcgctgtctg gcgcccttgg acacactcca 8460 cagcctgccg cccgcaagcc cttgccgaac ccagacgcac ctaaaccgtg gcccaaaacc 8520 cctcggaagc tggagcgtcg cctccgcacc cacaccaccc cgacacccct accacagccc 8580 cccccggccg cccccacacc cccaaccccc acggagctcg cagccctcta ccccataacc 8640 attcctctcg cacacacgtg tggctccgcc gtcttcaccg acgggagcaa ggtcccacac 8700 tcgagcgact acacaggcct gggagcagcg atctgggacg aaactgaaaa ccgcacgacg 8760 catatagacg aagatatcgc ttttgacact ccccttagca ctattctcag atgcgagtcg 8820 tcggctatac gcatggcact tagcctcaac cttgacttgc cccacctgcg gatttacacc 8880 gacagccttg tcagcctgct tatccttcgc cgtgccattc tggagcccga gtcgctacat 8940 gtaagcaaac atcgcgaaat gttcggggcc atcactgaca tactggccca acgtgggttg 9000 gcaggtctga agacatactt ctatcacgtc cgttcacacc aagcagatac gtgcgtagga 9060 aatcagaaag ccgatgaggc ggcacaggca gtggcgacta gagaaacccc gccggacgcg 9120 gtggagacct tgtactcgca cccgttcgag acactagagg gctggattgc taagagagaa 9180 actgttgagc acaatggcaa cgtagaggaa cggttacgat acgccaacga cctcaatcgc 9240 gacctgaaga aaatggcccg gccctccacg gcccagggat acacaaaaac aggcttatac 9300 gcagaactgt gggacgctac caggcagcac attgacgcga gatatggact aaacctgttc 9360 gatggcttcc cgttcgaaga agcacgctgt gccttcaaat accgatgggg gctctcctgg 9420 aacgccaagc gcgccgcccg tgtaaaacac cccatacccg gctgcgcggg gggcgaatgc 9480 ccgctgtgcc gatgcgacga cggtggcacg cacaccctgg gcggctgcac gcaccggcgt 9540 atgaaagccg cgtacatcgc gaggcacaac cgcgccgttc aacggatcgc gaaggcagta 9600 agtcaagggc agaacggcgg atgcgttatg tacatggatg caggaaagga cgcccccgct 9660 tacagtgggg gaacacgcat acccgcatgg gtgctgccag gaacgccaga gagcgtgcga 9720 ctacgatttc gcccggacct gctggtcgtc ccctccttac gcatgacccg atctgggcag 9780 cctactaata tgcctcgcac gcgactccag aaaagcgcgc ataaaatcta cgtaatagaa 9840 gtgggctact gctctgacct caatcacctg gacaaaaggg cagaaaaact gcaacaacat 9900 gacgaccttg ccaccgcact ccgcaccgcc ggatgggacg tgatacacaa ccaggacaca 9960 atcatcagcc tcgggcatac cggcaccgtt accacccacc tccatgccct gcttaaaacg 10020 ctcggatgct caacacaatc tgcgcaccgc acctgccaaa agcttgtcca gcacgctgtt 10080 cacacgacta ccgccattac agctctccgc cgggaactct gcagtcaccc atggccagga 10140 cgcccaccct agcacagtac gccccgttat gagaatggga ccaggcccag cgcgcctagg 10200 cgcgctgcca gcctccccac tctccttggg ggacaggggc ccagtggcgg agaccactaa 10260 cgtggtggtc tctcatgccc tcacgcctct acctcctcag aacacgccct gtcacgcagg 10320 gagtaccatt attattatta ttatta 10346 // ID MSAT-4B_CR repbase; DNA; CHL; 1184 BP. XX AC . XX DT 04-APR-2005 (Rel. 10.03, Created) DT 10-APR-2007 (Rel. 10.03, Last updated, Version 1) XX DE MSAT-4B_CR is a minisatellite unit - a consensus. XX KW MSAT; Satellite; Simple Repeat; MSAT-4B_CR; MSAT-4_CR; KW Interspersed repeat; minisatellite; nonautonomous; tandems. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-1184 RA Kapitonov V.V. and Jurka J.; RT "MSAT-4_CR, a family of minisatellites from the green algae RT genome."; RL Repbase Reports 5(3), 61-61 (2005). XX DR [1] (Consensus) XX CC The MSAT-4B_CR is a subfamily of MSAT-4_CR minisatellite. XX SQ Sequence 1184 BP; 224 A; 383 C; 373 G; 203 T; 1 other; agcatactgg ctgggcaggg cgcacatcgc tggttcgcgt gtgggcatgt cggcatggtt 60 ggcgggtccg catcttggaa tgagtcggcc tacccccact tcaaggtatc ttgtttgcgg 120 tcttttacat gaacgtatat gctgataggg atatggagga accgacgcca ggaaagtggc 180 gacttttagc ctgaccttgc gtggctcgcg cgagggtcgg tcgcgctgga aaggcgaaaa 240 caattcattg caccagcccc gaggccactt gccccacacg caacatagat acatattata 300 tatgtaatat gaaatactcg cacaggcaag cagccgcatg gcgcaggtgt gccacgccgt 360 gccgtcgccg gcgaccctcg ccgcgccacc tcggggccac cggggcccgc ccggcccctc 420 cgccgcgtgc acagtcggcc cttgggctcc cagctagctt gcactatgag accagcagcc 480 gccactgcag tcgcgctcgg cagggcatag cacgcaggag cccaggccgc ggcgccatgg 540 tggggccctg agcgctgccc agccaggcga gcagacgctc ttgtgccata gaccggcgcc 600 cacaggggcc gtaaggctat gtgacaaggc tgtgatatac atcagggaca gaggacagca 660 aaaggggcga cataaagccc gtatgatgta cacacgtccc tctgtgcacc ttcctctcat 720 gcaccctgca tcacctgccg ccgccgccgc tcggtaaacc agtcacgggc ccggcgtgcg 780 accgggcgca ggccggctgc cggcgtcata atacgggaaa catatgtgac atgtcagctg 840 ctatgcacca gagggtccag ggcatgtggt tgtgggcgcg caggagtgca tgcctttgtg 900 ggtgtgcgtg tgggcgtgcc cgctgcgggg cagcrtgtgt agcacaagtg ccccggaggc 960 tgtgcggcgc gcctgccccg ccccgctatg tcaagctaca ctaggtggca tgcaaacacg 1020 gcactcagcc cgacattact agcggtgccc tgaccacacg gtggccgggc gcccggcagc 1080 ccctcccggc gcagtgccca tagacgggcc cagcgcctgt gtggcggccc cacaaatgcc 1140 ccaacctgct tggactaggc ttgtgaccat agacgcatgc ccac 1184 // ID Gypsy-6-I_CR repbase; DNA; CHL; 6090 BP. XX AC . XX DT 31-MAY-2006 (Rel. 11.05, Created) DT 31-MAY-2006 (Rel. 11.05, Last updated, Version 1) XX DE Gypsy-6-I_CR is an internal portion of Gypsy-6_CR LTR DE retrotransposon - a fossilized copy. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; 4-bp TSDs; Gypsy superfamily; Gypsy-6-I_CR; KW Gypsy-6-LTR_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-6090 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-6_CR, a family of gypsy LTR retrotransposons from the RT green algae genome."; RL Repbase Reports 6(5), 238-238 (2006). XX DR [1] (Consensus) XX CC It is an internal portion of Gypsy-6_CR LTR retrotransposon. It CC may be incomplete and encodes integrase and PHD domain. There are CC only a few copies of Gypsy-6_CR present in the genome. They are CC over 98% identical to each other. XX SQ Sequence 6090 BP; 880 A; 2403 C; 1824 G; 983 T; 0 other; tgcaacacaa cccacagttg ctattggcga actcggcgct gcatcccact gcgctctcgc 60 ccttcttctc cgcggctact tctcactgga caacaaggct ttgctcattc atcccttcct 120 ctcgcttatc tacatctacg aagcatgtca gatgcatcag tggtctcgcc ggatatagcc 180 acggctccgt cgagtcgggc gcggcgccca ttttcacttc gccggctgac gaccgcgaag 240 cttgctacac tcggcgcacc ttattggcag gctctcggga catattgcag atggcagacc 300 aggtcatcgc aactctaccg caggatggtt actctggcca tgcggattcg cccgccagct 360 gaggccctcg gctccgacag cgctggcgac gctgcacctg agtccgcgca cgcggccatt 420 ggcgccctca cgctcgcctc agcagcactg tcagcgccag ctgcctcctc cagcagctcc 480 ctgcctgcag ccgccagcgc tcccgcgccc gccgccgccg ccagcgccgc gcccagcagt 540 gacgccgccg ccgtcgaccc ggtcagctct gccgtctcca tgtacgcgcc ttacgcctac 600 gccttcccca tcctcgcggg ccacagcagc cctggcgact ccactgcgcc cgctgctgca 660 ccctgcgccg ccgccaacgg tgcatctgca tcttctgact tcctcggcgc tgcatcccag 720 agcgccattc tcaccgccgc accctcgtgt ggcttcgcct ccatccctca agatgctgca 780 tacgtcttca ccccttcacc tggtcctgcc tcaagcagca ccatgggcgc tgctgcatcc 840 cagagcagcg ccggcgtcgc cgcctccgcc atcggcgctg ccgccgcact ccccgcggct 900 ggcgccgcag ccggcgacgc ctcctcccgt gccgcccccg ctggcggcac acctcaagcc 960 acacaggcca ccggtcccgc gtccggctcg cgcctgccgc tcggctttga gcaattcgag 1020 atgcggccgg ccaattcaga ggtcctgccg ctcctcgcca cttcttcgcc tgcccctcct 1080 gttcttctcc tgcccctcct gcttcgctcc tgtatcacca cctctcgctg cggggcggcc 1140 ccgccttcgc tgctgggggg gggggaccgc tgggccgcgt gcacagcagc agctcagcca 1200 acgccgccgc caccagcgtt tccccaccta agccgccgcc cgccactgtg gccggcggcc 1260 gctcgccatg agcgccccag acggcgcgca gccgtcagca ggcgtcaacc cgtctgatag 1320 cgcatcagcg cctggcagct ccgcgcacgc cggctcccgc cttgagcggc agccagatgg 1380 gtgggcctat tgcgagtgtc ccgacgtcac ggcctcccaa cgtgacgcat tccttgggtt 1440 gttggcggcc accgccaccg ccttcacctc aagtgcctcg ggcagcacgc aaccagcagc 1500 ggcagctgca gtggcagtag caggagatga ggcgccagcc ggcgcctctg ccccgggcag 1560 cttcacaccc gacggccctc gctcgctccc gcctgacggc cggttcgagc ggcagcctga 1620 tgggtgggcc actgcgagtg tcccgacgtc acggcatccc aacgtgacgc attcctggcc 1680 ttgttggcgc caccacgagc gccttcacct caggtgcgtc ttgtgggcta gcagtggcaa 1740 ccagcagcgg cagctgcagt ggcagtagca ggagatgagg cgccagccgg cgcctctgcc 1800 ccgggcagct tcacacccga gggccctcgc tcgctcccgc ctgacggccg gttcgagcgg 1860 cagcctgatg ggtgggtcca ctgcgaccga caagacgtca cggcatccca acgtgacgca 1920 attctggcct tcatgaacgc cagcacgagc accttcgcct acaagttagc agatcttgtg 1980 ggctacagtg gcgacgtcgg ccttgagctt cgccacggcc gtgcaatcta tgagcgcccg 2040 cgcccgcaca ccgcagacga gctcacaatc atggagtcca agctactcct ggcctacggg 2100 gacccggcgg ccgccgcctg gctgccggga gctgcagctg gggggacacg ccgaccaccc 2160 acagccgcca ctgggggggc accggagcct cacgctgggg gggcgcctgt ggagggcgcg 2220 cccatcggcg ccgatggccg gccagcgtct cagcaagtgc tcgcggctgc catcaccgcc 2280 ccggccctgg tcaaccctgc cggcgcgctc atccaaagcg cgcgggcccg ctgggcacag 2340 ctgcaggcgc tgccctccgc ggtctcggtc aacgccaccg ccatcgtgcc cgacgcgaac 2400 gagctgtttg ccggccacaa cggcggcccc actgacgcca cggaagcccc gccggctgag 2460 cgcccgcctg accaactcct catccaactc ctcactgtca tccaagccgc gcgtgtgcag 2520 cttcggcagc tgccacggtc cccgccactg cccctccgcc tgcgctccgg cacgggcgac 2580 gtcctggggc tggacacgcg gctggtgcag cacaacttct tcgcagccgc catgggggat 2640 gcgggcatca cgctgctgga gctgcccggc ggcctggcct cagggctgca agccgccttg 2700 cggatgggct tccgcatccg gcgctacgtc cacgtcgaca ccgaccccgc cgtgcgtgcc 2760 tctgtcgccc ggctgctgcc catgctcgcc acgcagcatg caccgctgct gccgccgcag 2820 tcgtggcagc tgcactctgg ccccgaccgc ctcggcctca caccggcgga cctggtgccc 2880 tgggcactcg ggccggaggg agcggagcag ctgcggttgc acgacggcag ccaatggctt 2940 gtggtcacca gctgggcctc gcacgcgctc tggcgcgact cgctcgccct ccagctcctc 3000 ggcggcctgc agcagcagct agccagccgc cgccttcccc agctcgcctt cctcgcggag 3060 ggtccctggt gtgcggagtc accagacgcc tcgccgccct tcgggccgcc cgccggcatt 3120 gatgctgtgc aggccggcag ccccgcgcat cgcgtcacct ctctctacac caacctggcc 3180 gactcatccc aagtcgactt tgccttctct cacctgcgcc ctcttcccag ccgcctgctt 3240 gctgccgtgc tgccgcacgg ccgcacgcct ggtgctgcgc aacggcccct ccctcctcct 3300 tacgtggcgc tacaggctcc cggccaaccc ctcgccgcgc tgcccccgcc ttccccgccg 3360 cccgccgccc cggagccgcc accgctgcgt gccggccctg atggcctgcc ctgcgccccg 3420 gacgcggcgg actggctcgc cgtctccacc caccactcct ccctctcctt gcccgccgcc 3480 ttcgccgtcc agctgctccg ccgccacacg ccgccggctg tcttcacggc cgccttggct 3540 gccgccacgg ctctccaccg cctcttcttc acgcgtcacg acatgcagcc gcttccccct 3600 cgtcagatcg acgccccggc ctccacgctg ggggggggag ctgccgcctc gggcccagca 3660 gaccatctgg agaccgcctt tgcagcgctg gtgctgtcag acgagccgtc accactgctg 3720 gacgcgaccg agaccactct cgcggttgcc ctgcttcacc agcgtctcct cggggcgcgc 3780 atgggtgcgc agctgcatgt ctacgccgca gtggcggaag ctgcggagca agcggactgg 3840 gctgacgccg cactgccggc gtcggcggcc ccagcaccac catcgcctcc gcagccctgg 3900 gcccgtcaac ccctgacacc gcagacgtct ggaacgacgc ccccgtcatg gacctgctgc 3960 gcctgggcgc ccagcctgac aacctcagcc gcgaggagcg gcaacgtgtg cagcgccgcg 4020 ccgccagcta ccgctggacc ggagagcagc tcatccgcct catgcccgac ggcaacacac 4080 gcgtctgccc gccgccggag gcccgtgacg ccatcgtcca gcaaacgcac gagatcggac 4140 acctcggcga gcgccgcacc ttggcgctgt gccagctggg ctactggtgg gccggccagc 4200 gggcgacggt gcaaggcgtt gtgcagcgct gcaagctgtg cgacatggcc aacaccggag 4260 gcatggtgcg gcctgttcaa ctgcagccgc tgccaatccg cggcatgttc taccgctggg 4320 gcatcgacct ggcgggcaac ttgcacccga cttcccccga tggctacacc tacgtcatgg 4380 tgtgcgtgga gcacttcacc aagtggatcg agcttttccc gctcaccacc aagtccgcca 4440 gcgagaccgc acgagggctg gtggagctcc tgacgcgctt cggctcgccc gcagaggtca 4500 cgactgaccg cggcaccgag tttgagggtc cgttccaggc gctgtgcgag gccaacgcca 4560 tcgaccaccg gcgcacgcag cccgaccatc cgcaagccaa cggcctagcg gagcgcatgg 4620 tgcaggttgt gaaagcggca ctgcgcaagc actgccgcag cacgcagtcc aacgacacgt 4680 gggataaggc gctgccttgg ctcgccctgg cctaccgctg ctcgcctcaa gcttcaaccc 4740 gctactcgcc ctacaagctg ctctacggcg tggaccccat catcccgcct gccgtccgcc 4800 cgcgctttga ggagcacgag ctagccttca ccggcgacgg cagtgagcaa gcttacacgg 4860 aggcgctcct gcagcgctcg gagtggctgc ggcggcacgc gcccgccgca gccggcaacc 4920 tgctcatcgc acagcaccgc gacacccagc gctatgctct cacccgcagc ggcggctacc 4980 gcgcgcgggc actggagttt gtgcccggtg actacgtgta cctgcgcaac gcaaatcccg 5040 acaacacgtt ggagatgccg gtacgcgacc aggtgctgcg cgtggagcat gtgggcgaca 5100 tgggcgtggc cgcgctagtc ggccgcgaca ccgcacgtct ccgtgcgcga gtggagcagt 5160 tggtgccgtg ccacctgccc aacctggacc cgattcagga cgcccggctg ctacgcacca 5220 gcaacgacct ggagtgcgaa gtgtgccaca gcccgcagca gccctcccgc atgctgctgt 5280 gtgacggctg cggcaccggc tggcacctcc gctgcctcac gccgccactc agcagggtgc 5340 ctcctggcgc ctggatgtgc ccgggctgtg tggagctggg caaggctcca cccactggcc 5400 ccgcgccgca gcggccctcg gcgggcccag tgctgttccc caacgctgat acgcgacgcc 5460 gcgacgcgga agcagctacc ctcgacggcg tgtcagttgt acgggcagtt cctggcccag 5520 ccggaacacc gcggcagctg cagaacggca ctctgcgctt ccgcgggccc ttgtcacgcc 5580 ccttctactt catggtggag tggccgggag gctccgccga ggccatcacc ctggccgatg 5640 caaaacgcct gcaacggcaa acagtgccat cgtcacctgc atcagcccct gcccagggcg 5700 gcgagcccac ggccaagcgc cgtgttcagc ctcgccgctg atgcaccaac cctgccggca 5760 tcccaacggc aggacctcgc gcacgccggc atcccaacgg cgtgtgccgc gctggcatcc 5820 caacagcgcg cacccacgga catctgctcg gagctggctt ggactcgaag ccagcaatga 5880 ggattgacta acccagtgac gtgtgactag ctaggtgtct gttctcgttt gtcgtgtggc 5940 acgaactcac tcgcatatca ttctggtgtc atgcgtttgc tgtcccgtgc gtctgtgtgt 6000 gtttccgact ttgccagtct cgctgactcc cctgtcttca gtcgtggggg ggatctctcc 6060 gcccgggagc ggcgttgtgc aagttcggct 6090 // ID Gypsy-1-LTR_CR repbase; DNA; CHL; 484 BP. XX AC . XX DT 04-MAR-2004 (Rel. 9.02, Created) DT 04-MAR-2004 (Rel. 9.02, Last updated, Version 1) XX DE Gypsy-1-LTR_CR is a long terminal repeat of the Gypsy-1_CR LTR DE retrotransposon - a consensus. XX KW Gypsy; LTR Retrotransposon; Transposable Element; 5-bp TSDs; KW Gypsy superfamily; Gypsy-1-I_CR; Gypsy-1-LTR_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-484 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-1_CR, a family of gypsy LTR retrotransposons from the RT green algae genome."; RL Repbase Reports 4(2), 36-36 (2004). XX DR [1] (Consensus) XX SQ Sequence 484 BP; 92 A; 134 C; 141 G; 117 T; 0 other; tgtaacacgg gcctggcgga cttagccagg gttcgtgccc cgagcttgcg acacgcaaga 60 ctagctttgg agagcaccta tgtgtgcgta tcaatagctt agaacattat caatggctgt 120 gatttacagg gcacagtatt aaggagggcg cgtatgcgcg tccccatctc tgtacggggc 180 taggcgaacc cggctatatt tagcactcgt cagggccgct tggaggttac tcttagtatc 240 ggaaggatcg tctccttccc ggtgccttta tgctacggga gcgcagcttc ctagtatcct 300 ctctctgctc tctcctcaac ggactttccg tccgcgtccc tgtgtcgcgg cctacgaagg 360 tcctggaggt aagcttgctc tgaatgcttc taagtagtta gagccggcgt tacaataagc 420 tggacactcc aggaagatgt gcgttaggct ggcccacgcc cgcgggccag gtggcccgca 480 ggcg 484 // ID hAT-N11_CR repbase; DNA; CHL; 268 BP. XX AC . XX DT 17-AUG-2004 (Rel. 9.07, Created) DT 17-AUG-2004 (Rel. 9.07, Last updated, Version 1) XX DE hAT-N11_CR is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; 8-bp TSD; KW hAT superfamily; hAT-N11_CR; hAT-N11a_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-268 RA Kapitonov V.V. and Jurka J.; RT "hAT-N11_CR, a family of nonautonomous hAT transposons from the RT green algae genome."; RL Repbase Reports 4(7), 202-202 (2004). XX DR [1] (Consensus) XX CC hAT-N1_CR is a nonautonomous DNA transposon. Its copies are CC flanked by 8-bp target site duplications and contain 15-bp CC imperfect terminal inverted repeat (2 mismatches). This family CC is composed of ~20 copies ~95% identical to the consensus. XX SQ Sequence 268 BP; 52 A; 56 C; 93 G; 67 T; 0 other; cagtgctcgt atcttccatc tacgctgtag aaatttgccc caaatccgcc gaaggcggag 60 aaattttttg gggtttgggg ccgcgcatct acggagtgta gacgttttgg gcggatttgg 120 cggtatctta gacagggagc gtagatgttt tgggggtttg gggggcgtgc ccctagggaa 180 aattttctgg gcgtagatgg cgcgggaggg gtcatctacg ctgtgtagac gccgggggcc 240 taaagccatc tacaagatac cggcactg 268 // ID Gypsy-4-LTR_CR repbase; DNA; CHL; 363 BP. XX AC . XX DT 31-MAY-2006 (Rel. 11.05, Created) DT 31-MAY-2006 (Rel. 11.05, Last updated, Version 1) XX DE Gypsy-4-LTR_CR is a long terminal repeat of Gypsy-4_CR LTR DE retrotransposon - a consensus. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; 5-bp TSDs; Gypsy superfamily; Gypsy-4-I_CR; KW Gypsy-4-LTR_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-363 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-4_CR, a family of gypsy LTR retrotransposons from the RT green algae genome."; RL Repbase Reports 6(5), 235-235 (2006). XX DR [1] (Consensus) XX CC It is a long terminal repeat of the Gypsy-4_CR LTR CC retrotransposon, whose internal portion is Gypsy-4-I_CR. XX SQ Sequence 363 BP; 61 A; 107 C; 107 G; 88 T; 0 other; tgtagtgcca agtctggggc ttacaacgtg ggacgtctga gtccgtggca ctcctggtac 60 gtgccggcag agtctctgtc tgtctgtccg actctggggc gttacagctg gcgtaaccgc 120 ccggggcaca gctctggcgc tgtgtgcagc aatctgtgct gttagtgagg gttctgactg 180 tgtcctcaac atcgggtttc tgtaactgta tacgagtctt ccgacaagct ggcacctgcg 240 ctggcgtcgg gcatggtagg gtcgggtgaa gcctccggcg gctacaccga ctacttacac 300 ggccttgcct tccaaacccg gtagttctca ctacctccat acacacggtg ttccagcacc 360 aca 363 // ID Gypsy-9-I_CR repbase; DNA; CHL; 12404 BP. XX AC . XX DT 31-MAY-2006 (Rel. 11.05, Created) DT 31-MAY-2006 (Rel. 11.05, Last updated, Version 1) XX DE Gypsy-9-I_CR is an internal portion of Gypsy-9_CR LTR DE retrotransposon - a fossilized copy. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; 4-bp TSDs; Gypsy superfamily; Gypsy-9-I_CR; KW Gypsy-9-LTR_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-12404 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-9_CR, a family of gypsy LTR retrotransposons from the RT green algae genome."; RL Repbase Reports 6(5), 241-241 (2006). XX DR [1] (Consensus) XX CC This is an internal portion of Gypsy-9_CR LTR retrotransposon. It CC encodes one 2689-aa polyprotein composed of gag, reverse CC transcriptase, PHD zinc finger, and integrase. This polyprotein CC is characterized by unusually long distance between the reverse CC transcriptase and integrase (~800 aa). The internal portion CC contains also a second ORF2 (pos. 9768-11813), which may be CC artificial because of the high GC content. XX FH Key Location/Qualifiers FT CDS 48..8459 FT /product="Gypsy-9-I_CR-1p" FT /translation="MDDDDAPSSRRPRTSGYTPASSAGATQGIDYERIAAH FT CERFDPSTHSVRDWLDNVEELAEEFGWDEARKLHLAAKRLAPTAKEWYRAW FT KLANPMASSTWEGFCDALRERWGVSDRELHSQLANCSQRRGETVREYSDRY FT LSLVTQLRIDYNRDPTHMHNFMRGLEPDIYDEVYIMEPHNLTAAIRKAVYA FT SEKGRSAPHRTGSGNGRGDGGSESDTRGRSDSRDGRRVRFDTRSPSVGPRY FT GTKFDSERRYDRDTRDRRPDQHTPNRAGTGGGGSAGNGNNRPGGSGASDVD FT RLSSRMARMQLLHRLDPSVNLYQSLLARAGPGRYHDHVQAADSDEDSDDGG FT AMSEEDYFSSSSSGGSTRHYPPSLRTPSYRGYDYTAPEKAASAYTKRVGDF FT EATTLPRKRVPVPVPGPTPMETDRSPARRFTAGHGEQRRERELAREREQAR FT EREGPAPRPTQAAAAAASTSAAMPRGGPGRANPSAPATARPTTPGRRPRGD FT KPPAASAAAPRPRTAAAGGTAAQAATPRPRTDPIPAIESGDEERKVADEIC FT SKINAYPISLGRALRISPAGIYSKLGGRMLGIARHVGGAQDARPAAAAATP FT NLNGSGRGGPTAGATARVNNCRSQDPPAFKAPVVTTADDWDVVETEVLLGR FT ADGRFMPIPAVVDTGASNTVVPRITLKRLSRQGDVVRSDVPFTNADGVRAP FT SCGLVRGLLISLGDMVCSVDAYVTEATNYGILLGSDFLAPIRADISYSRGL FT LEYTNDLGHRSARRIQFSRLSGPAAHLAHKRPAPTEQPMREAEEPEAPTRQ FT PPLAGGNGRVQPLSPFAVHQQQIAPPELSNLSATASLARAERYDQMLEALQ FT QERTTRVRIPAPQSGRRVLMMDLDSVPTAPADNPVVPPAEPMNIEEEWQRL FT LEGQPPGPPPPSTTVSVEEQRWHDLMSAASDAHERPRSQPPTPRRTPSAVT FT GHHNPQSFETPGRQAREARLPPDYSPTNSLPSPDLPPEALPLLHELAAGRG FT SIDKVERVARILKWTASHELEVLEAAAFLAGQRDLQLPGCDDEENEEWWHN FT LDAYSLEGTDISDDDEGTGDYLTQAPQALTLRRSPSVDDLMEADPSQAETP FT SLISASNSDIPRDPFSFPAWAKAWENPATATERGEGLIEPDGTTNQRGAAN FT PIAGLPTSTRAGARSPDQYVCKATLHPQPRRDGTDSDDDGPSLRYESWDKS FT GSEDEETAFCFFAGTIGRPSRSPSLAVPDRGTDPVATITQARPTRDQVLGM FT TPDQHLTAARAYGARLREARGPIRPDDERRQMLLGSIDPTLPPDAQALLAK FT TLLDNYDLFAVSNQDLSVTDWLEATIDTGDAAPVCTNPYRLSKGERDALEA FT EVQKMLATGVIQHSSSEWCSPMVMVKKRPAGSDKAGGGATDAPPAPPQWRA FT CIDLRAVNELTKPLRYPMPHVQTLIDTIGPPMGEKRIYSTMDLIAGFWQVP FT VRSEDRCKLAFQAPSGLYEFNVLPMGARQSPALFQRLLSMVLRPLLWYGPG FT ETTGRKCCALFIDDICVASPDALAHAQHLQDTFDCLRLANLKMAVKKCHFA FT TSKVDFLGHTIDQFGTYTLAARNADAIRNYPKLKTVRQIKAFLGLANYYRS FT MVPNFAVISRPLYDCLGKAGYRWELEQQEAFDRLKLALTTEPVLRAADVTR FT PFRLATDFSCNAVGACLSQVDDEGREYAVSYASKRLVGAETRWSSTDGEAY FT AAVWAVKKYHEYLACGRFELITDNSALTYITKAKDLTGKLARYALRLQGYD FT MIIVHRPGTRHGNVDGLSRLGHLEEDEEATGPTDHPPQQQDTAVDSPADGK FT LHLVGLAMAPQRARRLGGSLSTFLHGRTLSQEFDRVWSAQTPRGGARAFLA FT LRAPPLAAEGSPVVTAAPAAETAALAGDGGSAAAAKAPHSDATTKAAAHDN FT AAGLRIRPATAAGFQQALAARAQQASPLLSPGKRGRVGAASTAATASASLR FT TPSGAPQRRRRLPCAPQHLQDYTEKGGSDSEGTSPKCGRCQQGKPEWNMLL FT CDFPGCEQAFHTTCLQPPLRRVPEGDWFCPQHQQQRAPNQGGGAGGGQGKG FT GLTPNHPPAAEGGGSGAMLSGAANVGGGPATTAADPLSPRRKLKIVRRHLP FT AAAVTTGRAAARDIHEGELPPTAAAGTAAADLQAGPGSWTATADPPEMELP FT ASTQAHDLSCRTFVGAQEGAAGETKPSATTARAPPPELVAAAVARDSSDED FT PEWEPGSGSVSEAEVSEDIIAVDGVPRYSTRESAIWADTGLLHYLQHRSFM FT ERPDLTWNEFYKECARIISKARRYRWANGQLFRSGTRNRPEVRVLRLSERQ FT QALAEVHGLAHPGARGTFDLCRSRFWWEGMGHDCKVYVEHCSQCHPAQHVL FT LRNLPLRPLPIMQVHHRVNVDLSGPHVITPRGNQYIIVAIDAFSKYPLVGA FT LPNKESATTARWFWEHVVCHWGSVAVVMTDQGTEWQGQFAELLGRERIRHV FT RTGARCPQQNGQVERFMGVMRSALVRLCQTGEETEWDTHLPQVALSYRAAR FT QRSTGCSPALLLYGRELTLGQQKPPLDQQAAEPGEDVDSADEGEQEVAEAR FT RQAANARRSILDGAAAAARANMESAQERMKRDYANRTFKNAPPKPHTAAAA FT APPTDAIAVAGDTDSDRTVTPTRSPRRPTTTEAHTGDVALGGGAVEHARGA FT RGSNPGGRGQDRSSGRGRGRRGQGGSAPPASASPERSHESIPDDDPALSHL FT PSLAKDALVYRVTKGRTKLQRDTEGPYKWKMWNRTGTLALVEDQQGRQFSV FT PTAQLLVHRGAAR" XX SQ Sequence 12404 BP; 2572 A; 3952 C; 4070 G; 1810 T; 0 other; tggtttcgct accgggacct tttccctatc gacagccacc actgagaatg gacgacgacg 60 acgcgcctag ttcgcgccgt ccccgaacgt ccggatatac ccccgcatca tcggcggggg 120 caacccaggg cattgattat gaacgcatcg cagcccattg cgagcgcttt gacccctcca 180 cgcactccgt gagggattgg ctggataatg tggaagagct ggccgaggag tttgggtggg 240 atgaggcacg caagctgcac ctggcagcta agcggcttgc tcccaccgcg aaggaatggt 300 atcgtgcatg gaaactagcc aaccctatgg cctcctccac ctgggagggc ttctgcgacg 360 ccctacgcga gcgttggggt gtgtcagacc gcgagctgca ctcgcaactg gccaactgct 420 cccaacgacg cggcgagacg gtgcgcgagt actcggaccg gtacctgagc ctggtcaccc 480 agctacgcat cgactacaac cgcgacccga cccatatgca caacttcatg cggggactag 540 aacccgacat ctatgacgag gtctacatta tggagccgca taacctgact gcagcgatcc 600 gcaaggcggt ttatgccagc gagaagggcc gctcggcgcc tcaccgcacc gggagtggca 660 acggaagagg tgacggcggc agtgagagcg atacccgcgg ccgcagcgac agccgtgatg 720 ggcgccgtgt acgcttcgac acacgctctc ccagcgtggg accgcgctac ggcaccaagt 780 tcgacagcga acggcgctat gatcgagaca cgcgtgaccg ccgccccgac cagcacaccc 840 ccaaccgggc gggaacaggg ggaggcggta gcgccggtaa tggcaacaac cggcccggag 900 gcagcggagc ctccgacgtg gatcgcctga gcagccgcat ggcacgcatg cagctgctcc 960 accgtctcga ccccagcgtc aacctctacc agtccctact ggccagggca ggaccaggac 1020 gatatcatga tcatgtccag gccgccgact ccgacgagga ctcggatgac ggtggggcca 1080 tgtcggagga agactacttc agcagcagta gcagcggtgg atctacccgt cactaccccc 1140 cgtcgctgcg gacccctagc taccggggtt atgactacac agccccggaa aaagctgcct 1200 ccgcctacac gaagagagta ggcgactttg aagccactac gctccctcgc aagcgggtac 1260 cagtaccggt gcctggcccc actcccatgg aaacggatag gtctccagcc aggcgcttca 1320 cggccggtca cggtgagcag cgccgagagc gggagttggc acgggaacgt gaacaggctc 1380 gggagcggga aggcccagcc ccgagaccca ctcaagcagc agccgcggcg gcgagcacct 1440 cagctgctat gccccgcgga ggaccggggc gcgcaaaccc cagcgctcca gccacggcac 1500 ggccgaccac acctggtcgc cgcccccgtg gtgacaagcc ccccgctgct tcggctgcgg 1560 ctccgcgccc gcgcacagca gccgctgggg gaactgcagc gcaagccgcg accccgcggc 1620 cacgcaccga ccccattcca gcgattgaat cgggggacga ggaacgaaag gtggcagacg 1680 agatttgctc caaaataaac gcctacccca tctcattggg gcgggctcta cgcatcagcc 1740 cagcaggcat ctacagcaaa ctgggcgggc gcatgctggg catcgcccgg catgtgggcg 1800 gagcacagga cgcaaggcca gcggcggcag ccgccacacc caacctcaac ggttcgggtc 1860 gcggcggtcc gacggccggg gctaccgcgc gcgtgaacaa ctgccgctca caggacccgc 1920 cggcattcaa ggcacctgtt gtcacgaccg ccgacgactg ggatgtggta gaaacagagg 1980 tgttgttagg tagggcggac ggacggttca tgccaatccc agccgtggtg gacaccggcg 2040 ccagcaacac tgtggtgcca cgcatcaccc ttaaacggct gagtaggcaa ggagatgtgg 2100 tacgatcgga cgtgcctttc accaatgcag atggggtacg cgccccctca tgcggccttg 2160 tgcgaggcct cctgatctcg ctcggcgata tggtgtgctc ggtggatgca tacgtcaccg 2220 aagctaccaa ctacggcatc ctgctgggct ccgacttcct ggcgcccatc cgggccgata 2280 tcagctacag caggggccta ctcgagtaca ccaacgacct aggccatcgc tcagcacgac 2340 ggatccagtt ctcacgactg agcggaccgg ccgcccacct agcccacaaa cggccggcac 2400 ctactgagca acccatgagg gaggctgagg aacccgaagc gccgacccgc caaccaccct 2460 tggcaggggg caacggccgg gtacagccac tctccccgtt cgctgtgcac cagcagcaga 2520 tagccccacc tgagctctca aacctctccg ccaccgcctc cctggcgcgg gcggagcggt 2580 atgaccagat gctggaggca ctgcagcagg agcgaaccac cagggtccgg atacctgcac 2640 cacagtcggg caggcgagtg ctaatgatgg acctagattc agtccctaca gcacccgccg 2700 acaatcccgt ggtcccacct gcggagccta tgaacatcga ggaggaatgg caacgactgc 2760 tggaaggcca accgcctgga ccccccccgc cttcaacgac agtctcagtg gaagagcagc 2820 gttggcacga ccttatgtcg gctgcaagtg acgcccacga gcggccacga tcacagccgc 2880 caacaccacg acggacacct tccgcggtca ccgggcacca caacccacag tcctttgaga 2940 ccccagggcg gcaggcgagg gaagcacggc tgccaccaga ctacagcccc actaacagcc 3000 tcccatcacc cgacttgcca ccggaggctc taccccttct gcacgagctg gcggctgggc 3060 gaggttctat cgacaaggtg gagcgcgtgg cgcgcatcct gaaatggact gcctcacacg 3120 agctcgaagt gctagaggcg gccgccttcc tagccggaca acgggacctg cagctaccag 3180 gctgtgacga tgaagagaat gaggaatggt ggcacaactt ggatgcctac agcctggaag 3240 gcactgacat ctccgatgat gacgagggca cgggcgacta cctaacccag gccccgcaag 3300 cgctaaccct gcgccgctcg ccaagtgttg atgacttgat ggaggccgac ccctcacaag 3360 ctgaaacgcc gagcctgatc tccgccagca actcggacat cccccgagat cctttttcct 3420 tccccgcctg ggctaaagct tgggaaaacc cggctacggc gacggagcgc ggtgagggac 3480 tgatcgagcc tgacggcacc acgaaccaac gaggcgcagc caacccaata gctggtctgc 3540 ccacgtctac ccgagcgggg gcaaggagcc cagaccagta tgtgtgtaaa gcaactctcc 3600 atcctcaacc ccgaagggac ggcaccgact cggacgacga tgggccatcc ctgcggtacg 3660 agagctggga caagagtggt agcgaggatg aggaaacagc gttctgcttc tttgcaggca 3720 ccatcggcag gccctcacga tcaccctccc ttgccgttcc agaccgtggc actgatcccg 3780 tggccacaat cacccaggcc cggccaacgc gtgaccaggt actcgggatg actcccgacc 3840 agcacctgac cgcggcaagg gcatacggcg ccagactaag ggaggcgagg ggccccatca 3900 gaccagacga cgagcgacgg caaatgctcc tgggctccat tgaccccacc ttaccgcctg 3960 acgcccaagc cctactggcc aaaactctgc tggacaacta cgacttattt gccgtctcta 4020 accaagacct gtcggtcaca gactggctag aggcgacaat cgacacgggg gacgccgcac 4080 ccgtgtgcac caacccctac cggcttagca agggcgagcg tgatgccctg gaggccgagg 4140 tgcagaagat gctggcaaca ggcgtcatac agcacagctc ttcggaatgg tgctccccta 4200 tggtgatggt aaagaagcgg ccggcaggca gcgacaaggc gggcgggggc gccacggacg 4260 caccacccgc acccccccag tggcgggcgt gtattgactt gcgcgctgtc aacgagttga 4320 ccaaaccgct tcgctacccg atgccccacg tccaaacact aatcgacacc atcggcccac 4380 cgatgggtga gaagaggatt tactccacta tggacctcat tgccggtttc tggcaggtcc 4440 cagtacgcag tgaggatcgt tgcaagctag cgtttcaggc cccgtctgga ctctacgagt 4500 tcaacgtcct tcccatgggg gcccggcagt caccggcact gttccagcgc ctcctgagca 4560 tggtgctacg tccactgctc tggtacgggc ccggggagac tacgggccgg aagtgctgcg 4620 ccctgttcat cgatgatatc tgcgtcgcca gcccggacgc cctggcgcac gcacaacacc 4680 tacaagacac ctttgactgt ttacgtctgg ccaacctgaa gatggcagtg aagaaatgcc 4740 acttcgcgac gagcaaggtg gacttcctgg gtcacaccat cgaccaattc gggacataca 4800 ccctggccgc ccgcaatgcg gacgccattc gcaactaccc aaagttaaaa actgtgcgac 4860 aaatcaaggc gttcctcggc ctggccaact actatcgctc gatggtgccc aactttgcgg 4920 tcataagccg accattatac gactgcttag gcaaagccgg ttaccgctgg gagctggagc 4980 aacaggaggc gttcgaccga ctgaagttgg cactaaccac ggagcccgtg ctgcgtgccg 5040 cggatgtcac ccgccccttc cgcctcgcca ctgacttcag ctgcaacgcg gtgggcgcct 5100 gcctttcgca agtggacgat gaaggacgtg agtacgcggt ctcctatgcg agcaagcggt 5160 tggttggggc ggagacgcgg tggtccagca cggacggcga ggcctacgcg gccgtatggg 5220 cggtgaagaa gtaccatgag tacttggcgt gtggacggtt tgagctgatc accgacaact 5280 cagccctcac ctacatcacg aaggcaaagg acctgacggg caagctggca cgctacgcgc 5340 tgcgcctgca gggttacgat atgatcatcg tacaccggcc cggcacccgg cacggcaacg 5400 tggacggact tagtcggctg ggacacttgg aggaggacga ggaagctacg gggccaacgg 5460 atcatccgcc gcagcagcaa gacaccgccg tcgactcccc cgctgatggg aagctgcact 5520 tggttgggtt ggcgatggca cctcagcgcg cccggcgcct tggtggcagc ctatctacct 5580 tcctgcatgg ccgcacgcta tctcaggagt tcgaccgggt gtggtcggcc cagacacccc 5640 ggggcggggc gagggccttc ctggcactgc gagcaccgcc tctggccgca gaaggtagcc 5700 cggtggtgac cgccgcccca gcggcagaaa ccgcggccct ggccggcgat ggggggagcg 5760 cggcagccgc aaaggccccg cacagcgacg caaccacaaa agctgcagcc cacgacaatg 5820 cagcagggct acgtatccgg ccggccacag ccgcgggatt ccagcaggcc ctagcagcgc 5880 gggcacagca ggcgtcccca ctgctgagcc caggaaagcg gggccgtgtt ggagccgcgt 5940 ccacagcagc cacggcgtca gcgtcccttc ggacgccctc gggggctccg cagcgccggc 6000 gtcggttgcc ctgtgcccca cagcacttac aggattacac cgagaaaggg ggcagtgaca 6060 gcgagggcac tagccccaag tgtgggcgat gccagcaagg gaagccggag tggaacatgc 6120 tgctttgtga cttccctggc tgtgagcagg catttcacac cacctgctta cagccacccc 6180 tgaggcgcgt ccccgagggc gactggttct gcccccagca ccagcagcag cgggctccga 6240 accaaggcgg gggcgcggga ggcggccagg ggaaaggagg gctgacgcct aaccatcccc 6300 cggctgccga aggtggggga agcggcgcca tgctcagcgg cgccgccaac gtaggcggcg 6360 gtcccgcaac cacagcagcc gaccccctgt caccacggcg aaaactcaag attgtgcgac 6420 ggcacctccc tgctgccgca gttacaaccg gcagggcggc ggcgagggac atccatgagg 6480 gagagcttcc cccaacagcg gctgccggga cagcggcagc agaccttcag gccggacccg 6540 gcagctggac ggccactgct gacccacctg agatggagct ccctgcttcc acccaagcgc 6600 acgatctcag ctgtcgaacc tttgttggcg cccaggaggg ggcggcgggg gagaccaaac 6660 cctccgccac cacagcacgg gcaccacccc ccgagctagt ggcggcggca gttgcgcggg 6720 actcctcaga tgaggatcct gaatgggaac caggcagcgg aagcgtatca gaggcagagg 6780 tttctgaaga cataatagcc gtcgatggcg tgccgcgcta cagcacccgt gagagcgcca 6840 tctgggcaga cacggggttg ctgcactacc tgcaacaccg cagcttcatg gagcggccag 6900 acctgacctg gaacgagttc tacaaggagt gcgcccgcat tatcagcaag gcaaggcgct 6960 accgctgggc gaacggacaa ctattcagga gcggcactcg caaccggcca gaggtgcggg 7020 tgctgcggct gtccgaacgc cagcaagcgc tggcggaggt gcatgggctt gctcacccgg 7080 gggcgcgcgg cacgttcgac ctgtgccgct cccgcttctg gtgggagggc atggggcatg 7140 attgcaaggt gtacgtggag cactgctcgc aatgccaccc ggcacagcat gtgttgctgc 7200 gtaacctgcc gctgcggccc ctacccatca tgcaagtgca tcaccgcgta aacgtggacc 7260 tgagtgggcc ccacgttatc acgccgcggg gcaaccagta cattatagtg gcaattgatg 7320 cattcagcaa gtacccgcta gtgggcgcgt taccaaataa ggagagcgcc acgaccgcgc 7380 ggtggttctg ggagcacgtg gtgtgtcatt ggggctcggt tgctgtcgta atgactgatc 7440 aggggaccga gtggcagggt cagttcgccg agctgctggg gcgcgagcga atccgccacg 7500 ttcgcacagg cgcccgttgc ccacagcaga atgggcaggt ggagcgcttc atgggcgtaa 7560 tgcgctcggc tctggtgcgg ctatgccaaa ccggagaaga gacggagtgg gatacgcacc 7620 ttccccaggt cgccttgtct tacagggcgg cgcggcagcg ctccactggc tgctcgccgg 7680 cactgctgct gtatggaagg gagttgaccc tgggacagca gaagccgccg ctggaccagc 7740 aggcggcgga acccggcgag gatgtggatt cagcggatga gggcgagcag gaggtggccg 7800 aggcccggcg gcaggcggcc aacgctaggc ggtccatttt ggacggcgcc gctgccgctg 7860 cacgcgccaa catggaatca gcacaagagc gtatgaagcg cgattacgcc aatcgcactt 7920 tcaagaacgc tccacccaaa ccacacactg cggcggcggc agccccgccc acggatgcca 7980 tcgcggtggc aggggacaca gacagtgacc gcactgtgac cccgaccagg agcccgcggc 8040 gtcccaccac cacggaagcc cacaccgggg atgtcgcctt aggagggggc gcggtggagc 8100 atgcacgcgg agcccgaggc agcaacccag gcgggcgagg ccaggaccgc agctccggac 8160 gggggcgtgg taggcggggc cagggcggct ctgctccccc agcctcggcc agccccgaac 8220 ggagccacga gtcaataccg gatgacgatc cagcgttaag ccacttgccg agtctggcaa 8280 aagacgcgct ggtgtaccga gtgacgaagg gcaggactaa gttgcagcgt gacacggagg 8340 gcccgtataa gtggaagatg tggaatcgaa cgggcactct ggcgttagtc gaggaccagc 8400 agggccgaca gttctcagtg cccacggcgc agctcttggt gcaccgtggg gcggctaggt 8460 aaccaggggt ggaggcgggt ctgctgcgcg gcagcgacgg ggtgggcggt aaaccgccgt 8520 cggggaaacc gcctaaccac cgcgaagcac gagcagcgtg agatgttggt gtggagagta 8580 gtgactgaat acagtgtata ggtggccagg tattaaattg tttacagcgc aggcgcgtaa 8640 cctttgctgg agtaaaagaa ggcgggccca ttaataaagg gctaggctat tcaaaagcaa 8700 cctcacttgg ggcgattgcg tcaacgagac gctggtttat attgggctca gttggaaaga 8760 gggccgggag gaggaaccgt gttcgcgctt tatgacgtgt gtgcgcttta cgccctgcgc 8820 cccggctacc agcgcgggag gctactggtg aaaagaacaa caccaaacag gcttgacagt 8880 aagattgggc ctacggccac cctagagatg gaggggcaag aggactggca gttggcgcga 8940 acagcatgcg aggcaagata gccagctaag ccagcaaggg cgctgtagcc cagagcgcaa 9000 agccaccatt gctagtttga gtacttacgt tgataacgct tgagcagtaa ctaaatctta 9060 gagcatagct gaaagtagtt gagttgcaag gtctagaggg agtttgagga cagaagtagg 9120 cgcataggct gtcatgccgg cgcttttagg caggatggtt ggggcagcag cggccggcgc 9180 ggagggcgac tctccgcccc tgcagcacaa cgtctggcga gccagccagt tgacgcccga 9240 tccaccagct ccgttgcgcc ccggcgcaac gccagtaaac tcaacgacca ccgaatatgg 9300 cgggtcggcg agcagcgggg tgagatcagg agggggaggg aaacgtgtca gacgatgcga 9360 agtggggttc gattgtagag cccgtgagtt gtcgagttag ggtttaaacg agaacattaa 9420 ttgtcttgta ttatcggtga cggccgttgc agtgacgcgg tcctaacgca ttcgaatggt 9480 tgccgcctat gtgcgtagtg gacggacgac ctgccaggcc atgaccaggg catgccaaac 9540 aacggccgcc tcgagttcac ggccgatgac gaggtgcggc cgggaggcgc gctatgcaat 9600 gagccttcca agtatgattc aagtgactag aacgttggcg tttgtgagag gcatttggtt 9660 tgacgaatat ctaaacgcat ggggtgacgg acttgacgac cacgcaggcg cgtgcaacac 9720 agcggaatga gctgggtctc accagagacg agcaaaacat ggctgagatg ctggcgcgac 9780 cgctcggccc agagcagcga acagctttct gggcacgtct ggccgagcag accgcagcgc 9840 tggcaggaac cccggcggaa gccggggcgg cgtcaggggc gtgctaccga gccgccgacg 9900 ccaaggccgg cgcagcggcc gacgccgagt cggacgacct ggaggggggt gttggggcct 9960 tcttcggcgg cttcggagct gtggccggca cagcagcggc agtggctgca gccggaaggc 10020 acgccaacgc tagcagctta atggccggcc caaccagcag caccaccagc gagtcggcgc 10080 tcctcactcc ggaacaggaa gaggcacgcc ggcgggccgg agcaagccgc accccgggtc 10140 gggaacggag cgagcggatg tgggcagccg cgtcaacaat gcccagcagc cggctgcgcg 10200 gcagccagct gggcttcgag gtcgttcctc gggagagcga cggcctgctc gtcccctatt 10260 ggatgccgcc aatcaccacc agccgcggca ccccaacgcc ggcgccacca ccgcagccac 10320 cacagcagca gccgcgggag ccgagcgatt cgggcggctc cgtggtcaca ttcggcctgc 10380 tggactcccc cagcctggga tcgccgccca gcaccgccaa ccaggagctc caactggcag 10440 cacgcatgat ggttacctca ccgtcggaca gcttcgtgcg ggatacgtgc gcgctggtcg 10500 cagccctgca gcggcgcgag gaggctcgcc tgccggcagg ggcagagggc gcagcaggac 10560 acatcacgac cgtagcagct gatcactgcg gaacgcgggt ggaggcccaa gcggagaccc 10620 cggcggtacc ggggccggca gagccggtgg aggaccggcg acgaccgctc tccatgtctt 10680 caggctcgga gatgggtcct cagccgcagc ggcaaaaggc agcgccccca ccggccgccc 10740 tagtgcccca gccgctgact ctactgcgga gcattcagag tagctatctg tcgccgccag 10800 ggatcccact ggtgccgccg ctcatgatgg ccaaggcggt cgcccgctcg cagcagcagc 10860 agcagcagca gcagcagcag cagcagccgc agccgatgca gcagcgacag cagcgacaag 10920 tacccggaga gcaggagcca cggccgccca gctcgccgac tcggccgcag tcacagcggg 10980 cacccgcagg tgcggccccg tcagtaccgc agccgcgaca ggaggcacgg gcgcattcct 11040 accggcgtgc cgggtactcg cgctggcgct ctcggtccaa cccgggggca gcaccgccgc 11100 cggtgcaggc gcaggcacca ccgccggtgc aggcgcaggc accaccgccg gtgcaggcgc 11160 aggcaccacc gccggtgcag gcgcaggcaa caccgccggt gcaggcgcag gggccgccgc 11220 cggtgcaggc gcaggggccg ccgccgcctc gggcactgcc tcaggacccg cggcgacgac 11280 agggcggctg gctgccgctg agcgctgtgc ccctgccgtc acaagtgaag accgaaccgg 11340 tggaccatgg tggtaccagc agcgtcaacc gccggctgtt ccggccggtg caagcgacgg 11400 ctggtaccag cgcccagccg ggcgctgcgg agcagacagc ggcggtggct cgcgccccgc 11460 gtaacctggg tacaggcccc agccgcgagc caccgcgccc atattggctg gagcagggtg 11520 acggcgtgcc cctttggctg caaaatgtgg cgatggggct ggcagctcgg gccctgcacc 11580 tggcattccg ggagtcggag gacgccgggt gggtgcacgc gatgctccag gggagcgaaa 11640 tgccgcccgc caccgtcacg ctgcgccaca gccgaggccg cctggacgtc gtcaaccgcg 11700 aaaaagcgac gcgcctggtc gagacgctgc tggaggccgc actggtgacc gccacccagc 11760 acgagccggg atgggcctcc aagatcgcgg ccaccttcga ccgcggtgcg tactagaagg 11820 caatacagca gccagggaga gacaggctag ctgagttaaa cagcgtcggg acccacccgg 11880 cagactgcgg actaagcaag ggaacccaag cggactgcac gtaagggacg gtttagatgg 11940 gatagcactg tagggattaa ttacgacgga atcaggatga acggagatat ggcaaggcca 12000 gggtaaaggg gacactacaa ctgttaataa tgaggggagt agctaggagc gtctgggggc 12060 agggagccct tgaccgcagc acacggacgc gtgcaggggt acgtactagg ggtagcattg 12120 ggcgccacat gtctgagtgg caggagcagc ctaagagaac atctgtgcgg gatttattga 12180 aggtttgaat agccatactt gagcagtgga gagttacgtg ggatcaacta agggcaccac 12240 ttcatggggc ccgcttgagg atgagggaag gggtctgtgt cggataaggg agtcatgtgc 12300 tggatcacgg tatgtttaaa ggtgcgggtc tccccggctg gcgccactcg gtgccgaagg 12360 ggccgcactg gggcgcgcgg cgtaggtggg ggcgaaccga cagc 12404 // ID DIRS-1_CR repbase; DNA; CHL; 6454 BP. XX AC . XX DT 04-APR-2005 (Rel. 10.03, Created) DT 10-APR-2007 (Rel. 10.03, Last updated, Version 1) XX DE DIRS-1_CR is a putative DIRS retrotransposon - a consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; DIRS-1_CR; KW Interspersed repeat; putative DIRS superfamily. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-6454 RA Kapitonov V.V. and Jurka J.; RT "DIRS-1_CR, a family of DIRS retrotransposons from the green RT algae genome."; RL Repbase Reports 5(3), 49-49 (2005). XX DR [1] (Consensus) XX CC It is an incomplete DIRS retrotransposon. It encodes CC the DIRS-like integrase and remnants of reverse transcriptase. CC DIRS-1_CR harbors a copy of L1-2_CR, which is masked by "N". XX SQ Sequence 6454 BP; 950 A; 1639 C; 1742 G; 1076 T; 1047 other; agtgctgaag ctcaaatgct cgtcaccgta tcagctaact ttgagtggcc ttcaccgttc 60 ctcgcgccaa cagccccagc atgcttcagt tcagcatacc ctctcgcgaa gagggcggga 120 cgggtgggag ctcggttgcg gtcagttcga gccgatacgg agcgagcaat gctatggtaa 180 gagtttaagc aagtagtcaa aataggcgaa gcaagtattg acggcgaaca actgggcgaa 240 caacttaaca cctttcgcct cgatatgcgc tcaccactgt aacaggcaaa tccgaatggc 300 agtttattat tatcggtatt gcgagctcgg gcgagcgagc tatcgaaacc tgtgagttct 360 ctcggcacga accgtgtcac cgtgcctccg tcgcgcaccc aatcactacc ttcgtgtgtc 420 gcgcaggtca caccgttaag cggggctcgt tatccgcctc tcgggtactt aaacctcgcc 480 tcgagcagcg accttctgac tccccacaac tttctcagtg tcggtttccc gccggtgggt 540 ggctcgaagt cggatgcata agggtgcata actggctacc ggcctttccc aacagttcaa 600 gctttcgaac aagccctttt atccgggcaa cgacggccag cctacccaac gcgcctccac 660 tggcggaata cggcgtgagt ggttaggggt cgagctgttc ttgcggtcgg ttactgaccc 720 gtttccatgt gcttgcaggg gtacctggct cagctgcggc tggggcaggt cgcttccctg 780 gtggttgatc ccggtgcaac acctgatgcc accgagcggc atgccgttta ccaggcgggg 840 ctggaggccg cgcacgagct gcgtcagtca gctttggcct ctgcgactgt cgccaggcaa 900 aacaaagccg cggcggagtt gtggggctgg ctccaacgct accggccggg cgtctcaccc 960 gctacctgtt ccgacgagga cctagcggca tacgtcatga aggggtggct tcccaaccac 1020 cattctcgcg cttcggcgga cgtgccggcc ggtcccagcg ctctgaaatc tcacctatcg 1080 tcactgagcg gcttcttcaa ccgcatcgac cgcggaggcc cctacaactc gtctactagc 1140 cgtggcaacc cttgcgagtc ttctatcatc gaggacctgc gcactgctta ccagcgggag 1200 caggtgcagg cgggttatgc ggaggtctct gcggtgccca tgacggaggc caagtaccgc 1260 gctctggcct tctacctgtg ggctcagtac agcgcggcag ccaccgcaat cgagcggcta 1320 gtagcgcttc gagaccactt atgcgtgttg ctgctgtggc agacggctgt gcgggggcac 1380 gacttgggca agctgggcac cggcgacttc gtaaaccccg acaacacggc gctgcccttc 1440 cagggtttcc cgctgctgcc gccatggcag tggggctcgt acctcggccc catcctgtgc 1500 ttctgcgagc gcggaaccaa gactcacaag ctggcgcgcg cacctccgat cttcctgatg 1560 cccgatgtgg cggagcctcg tctcagcata ccgcgcgtac tggcgctgta catggcgctt 1620 tgctctgctg ccgatgcgcc tcctggtagc gcagtggcgg acctgctgtt ccggcctctc 1680 gccccggatg gaaagaggtt taaggagact cccctgagca gttctgccat gggtgctcgg 1740 ctccgcatgc acctggttgc agctggtctg tacggcggcg aaaccgtgca cagctttcgg 1800 cgcggttcat tgcagaatgc gcaggcggtg gggttgccac cgtcttcact gttagacttg 1860 gggcagctgc gcactcccgc cgtgctagag cgttacttgg actctagccg tcacttaggc 1920 cgcgacgtgc gggctcgggt agaatgaatg tacagatggc aggagtggtt gcgaagtcta 1980 gggtgtgcaa gccatcgact gaaccgtgga tgtatgggag tgcaaaggga cagggtttag 2040 gcaggtggat gtatgggaga ggtggtggac acacacgccg attgcgtgtg acggtaccaa 2100 cgaccgcgtg ctttggagtg cccgggtgct gggcatggga agnnnnnnnn nnnnnnnnnn 2160 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2220 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2280 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2340 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2400 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2460 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2520 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2580 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2640 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2700 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2760 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2820 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2880 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2940 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3000 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3060 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3120 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3180 nnnnnnnnnt ctttgaagtg ggatgaagag gtgacggtac ggtagcgcgt ctgtgattgg 3240 tttgcttgag gctagatatg ccagactaca cgaggcacgc ctcgcaacta ttcggtggcg 3300 gcacgttgga ctgaaagggc gggtgggggg aggcttttga gagcaattag gttatgcagg 3360 tgcctgctat gttcaaactc aagcgcagtg taaagcgtag tgtgatgaca ataagcttaa 3420 gcttcagcta ttgacagggg atgccttctg ggcatttccc accgtactgc ccattgccga 3480 gcattccagt caccttgagt gctgaagctc aaatgctcgt caccgtatca gctaactttg 3540 agtggccttc accgttcctc gcgccaccag ccccagcatg cttcagttca gcatacccaa 3600 cgacaccgcg ccggcggctc tcagcgtaca ttcagcgggc catgtaccac tcctgaaacg 3660 ctgcaatgcc ttcgctcctc acgtacaccg cctccaccgc atagtgcgga gcttccggcc 3720 gcgggttggc cggcacttgt ctgcccggca cttgcctgcc tgacacagcg cctgaaccgc 3780 tccggcagcc gactgagttt gctcacccac acgactggcc acaccggcac gatggtaaca 3840 cagctcaccc gcttatcctc cactttcagc actactcttc ccatctcatc aaatggcgca 3900 ttgacgacaa cacgccgacc tgcgcccact gagctcagtc ttacgcgaac gcattcgccc 3960 ctgaagtgtt ggggctccag aagcgggaac agaagcgacc cggcagcgtc cccactcatc 4020 ctcctatcta gtcacgccgg ctgcagcccg cccaaccaca cgcgcgtgga ggggcgtgca 4080 tgccactgcg ggctcaatgg ccgtgatgcg gcccgccacg cgcgcagtgc aggtaggaga 4140 tggtgctggc ggcttgccga gtccaaggcg tgggggcgct aaatgcccac cagtcaggac 4200 gcaacctcgc gcggccagcg gcccgccgcc caccagcacg ggcctctccg gtggcggtca 4260 tgatggtctg gctgttgcgc atggaggccg cgtcgagctg ttcgcggggt ggcccagagt 4320 agtctgaagg cgctgcaaac gtcgggcagg cagggtagca gcagcggcct ccgaggggtg 4380 acgagtgttt cgtcggggta ccgatttgac ctacgcgcgg tggcagtgtc ttgttgcgcg 4440 agcgccgtgg tggatgccgc actcgtcagc tgttgcggca gcgagcctcg cgcgtgaagc 4500 tgtcgtagca ggctgctcaa gaagtcagcg ttgtccagca gtaaggccag gctcgaggtc 4560 aagctctgaa gtagggtcag ggaaggcgcg atgcctatgg cgcgcgggcg accggcaccg 4620 gcgtcagcgc gtcgcggcag cggggcaggc caggcctagg cggcgagtgc aggtagaact 4680 gtggctgctg ctgcggctgc tgctgcggct gctgctggct gctgctgcgg ctgctgctgc 4740 tgggattggg ctgctgcggc ggccagtatg ggcctgcggc cggggcggac ctttgcggga 4800 agccgagcgg cgccgccagg aggccctcct agaccagggc gcggccgcaa taaggcaggc 4860 gccgcagcct tcagctcagg ctgcgctatg gctggggcgg aggtactgcc gcaggccggt 4920 ctgcgccgcc agtggcgctg cggcgagggg tgcgcggccg gagcgggcgg ctccacgact 4980 gaagcggatg gcgctgcggc cgaaccggct tgggcggcaa gcccagggct gctgggtggg 5040 cagcgggccc gcttggtctg ctgcaagctc gatgaggcca ccttggcccg ggatcatcgc 5100 cggcggtgct gctgcggcgt tgatgtcggg gccagcctcc gttgctaagg cagctgcgag 5160 cggcggctgc tggtgaggcg cggcccctga gtcctgccca acccctactg cctcggcctg 5220 caagggaaaa gggggaggtg gcagaacgtg ccgtcagttg tatgcgtggg ctggaagagg 5280 gcgcggcctg gctgcggcat tgccgaagac atttgcccgc tgtacacgag cgaacggtgc 5340 tgctcaatcg gaacgcgggg tgctgccccc caagtcgcgt gtgcgggtgc tgcccgagac 5400 tcggcctctg tcttgggttc tgccctcgta cgtacgacga tgcggaagcg ccgaagaacg 5460 cgtgcgggtg ctgcccgaga accggtcaga tgggttctgc ccaaggggcc tgcctgctag 5520 gcgcgaacta gggtactgcc ctgccgtaat gcgggtactg ccctgtcgta atgcgggtac 5580 taccctgtcg aaagcgtgcg ggtgctgccc gagttgctag acgctgggtt ctgcccagat 5640 cagcgcgggt tctgcccggc ttaccgcaag cgcttcatct tctccagtcg cggcgccagc 5700 gcaggaggcg tcggccgcgt cggctgagtg gaacccggcg cgtacgttgc ggcctctgcg 5760 gggggtcaag gggcgtcaaa aggcgcctag gctctagcgg ctgccggagg tcaatctgga 5820 catgtaagaa atgggcttca gcgcgccggc tggggccctc tgaggctgtt tgctacacgt 5880 tcgcgcgcaa gcaccatcgc agctgcgccc ggctcctcgg ctcccttgcg cctgtgcgcg 5940 ccgaccaatc gacgctgaca aatgactgct cgtgtgtttc aggatctccc tttcccgcca 6000 gaggctatag ttttctggcg atgccgatta cccctgcacc caaggacacc gcttcagcgg 6060 ctctcctcgc tgacgctctc ttacggcaaa acctagggcg cctcctaaac cgctcctagc 6120 gctaaacctg acgtgttcgc gacagagcgc ttttgagagc aattaggtta tgcaggtgcc 6180 tgctatgttc aaactcaagc gcagtgtaaa gcgtagtgta atgacaatag gcctagggcg 6240 cctcctaaac cgctcctagc gctaaacctg acgtgttcgc gacagagcgc ttttgagagc 6300 aattaggtta tgcaggtgcc tgctatgttc aaactcaagc gcagtgtaaa gcgtagtgta 6360 atgacaataa gcttaagctt cagctattga caggggatgc cttctgggca tttcccaccg 6420 tactgcccat tgccgagcat tccagtctcc ttga 6454 // ID NonLTR-5_CR repbase; DNA; CHL; 5274 BP. XX AC . XX DT 31-MAY-2006 (Rel. 11.05, Created) DT 31-MAY-2006 (Rel. 11.05, Last updated, Version 1) XX DE It is a putative nonautonomous non-LTR retrotransposon - a DE consensus sequence. XX KW Non-LTR Retrotransposon; Transposable Element; Nonautonomous; KW L1-3_CR; NonLTR-5_CR. XX OS Chlamydomonas reinhardtii OC Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; OC Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas. XX RN [1] RP 1-5274 RA Kapitonov V.V. and Jurka J.; RT "NonLTR-5_CR, a putative long nonautonomous non-LTR RT retrotransposons in the green algae genome."; RL Repbase Reports 6(5), 259-259 (2006). XX DR [1] (Consensus) XX CC This transposable element is a putative non-LTR retrotransposon, CC characterized by ~15-bp target site duplications and frequent CC truncation of 5' terminal portions. Despite its length this CC element does not encode recognizable reverse CC transcriptase/endonuclease. Most likely it is a nonautonomous CC non-LTR retrotransposon. Although NonLTR-5_CR contains long two CC long ORFs in different frames, they seem to be meaningless. CC NonLTR-5_CR (pos. 3468-5270) is 69% identical to L1-3_CR (pos. CC 456-1990). XX SQ Sequence 5274 BP; 730 A; 2288 C; 1284 G; 971 T; 1 other; ccccgccgtc gccgcagctc ccacgcccgc gcctacagcc agccccagcc ccagcacccc 60 cagctccacc gcctgcactc ccagccccca cgccgcgctc cccgccacct cccccgcgcc 120 ccgccccctc gtgctcgact atccacccct tccactgcct cggcctgacg ccaccggcat 180 catctatggc atcactttcg ccgctgaccc gcgcagccgc gtctacatcg gcagcaccgt 240 cggcaatctt accgctcgcc tacgcgggca tgttgccgct gcgcgcggcg cccacaccgc 300 tgctacgccg cacctacggc ccatcctgcg caacatcatc cagtacggca ccgacaccat 360 gtacgccacc ccgctcagcg tcgcccccaa acgcatcccc cgcagcaagg acgaccccaa 420 ctggattgca tggtcgcgcc gccttgaagg cctgtggacg cgtgccgttg atgctttcgc 480 tgcaagaggc ctcaatgtcg cgcgcatcgc ccgcgccctg gccaacccca ccggactcga 540 cttccttttc gccactggcc ctcccccact cccgcccgcc ctcctccccg acgacggcca 600 accccgacgg ctttttgcct ctcgcaactg ggcccgccgc atctccgttg tccaccacca 660 catgcgcgcc ggccgcatcg ccctgcccga ccgcctccac agcctggcca cctacagcat 720 caactcattg cagcgcatgc ttctcgcttt gcgcagcctg gccccatcct cccttcaccg 780 cacagcaacc ccagccgaca tcgcagccat tcgccaggtc ctcactgccg cccacgccgc 840 cgcgcagcgc cgccagcttg acgcccgccc caagcagccc ctggtcctcc gcctcccagc 900 cccgcttcag gctcacctcc tcgccgccga cgccagagcc ctcctcgcct cgccaggcat 960 tgcccgcttc atgccggacc gcatgcccgc cgtctccacc gtcaccttcc tcgcacctcc 1020 gtccctgcgc cacctcctct gcaacgccgc aaaagccagc cgcgaccccg gcgccaccac 1080 ctcctgcacc tgccatctct tccccgccct tccccgcgac cctgcccacg gccacatctt 1140 cacccaccgc ctacagcccg ccttgacgcc ggccgccgct gccctctggt ccatgggagc 1200 caacacgcgc ccaccgcctg accccagcgc gcctctcccc acgtccctgg ccgacgacct 1260 gcggaagtca ctcttcggat actgccgacg cgccgcccgc gccgcacgca tcaccaccga 1320 ctccgccaca tcttgggtca acgccgccgc cgctgccctg cgcgccactg tcgacccttc 1380 cgccgccgcc tccgtcgatg ccgacatgct cctccacccc gcccccacac cgccgcctcc 1440 accggcattt ccgctgtccg ccgcagatgt cgccgccgcc accgccacct tccgcagcgt 1500 ccgccgcaac ctcatcatca cggtcatgga caagtgcccg gacaactttg tcgccgtgtg 1560 cccccaccta tatcatgcca agctcgccgc cgacttgcaa gcatccccct tctacgccgt 1620 cgccacccct gctgactact ccgccagcct taccgccgtc gggcttatgc tgcagccgct 1680 ggggctcccc ttccacgccg cccggccgcc acctgtcaac tacggcaccg gcaagtgcca 1740 caagaccccc ttcggttttc gctacatcac cgcctctccc gccatcccca ccaccgatgc 1800 cgccgtcgtc ctcaccggct tcctccgtac gctggacgcc accctgccga aactgtacgc 1860 agccctcttt ccgcgcctgc ccctgcggcc gtggcacgtc aacggcccct tcgtctccgc 1920 yagcctgctg cgccacgcgt ccctcgcccc caggccgatc aaccctctcc ccggcgccgg 1980 caccaccacc ttcacgccca caccccccgc cgccaccacc cccgcaccga ctacccccgc 2040 cccggacttc ccttgtgctg tctgcggccg caccaatggt cacggcctca tcctgtgtga 2100 tcaccccggc gctcgcaccg acggccgtgc cacatgcctc gtcggcgccc acggcgcatg 2160 cttacggccc gcaatcagca cacgcgccgc ccaccgcatg ggcacctggc actgccccct 2220 gcacgcgccg cagcctgttg aggaccccgc gacatcctgc ccctacactg gcaccctcac 2280 cttcggccgc acgatccggt acccgccccc tgcccctggg ccacccatcg ccaccgtcac 2340 ctacaccgcc tcgactgccg cgccatgtca gctgcccggc cctatctccg cgccggccgt 2400 gtgcacatat gacgtggaac ggctcttcac caacctgcct attcctacct gcatcgagca 2460 tttggtcaac ctcttctccc tcgtcctctc caccctgcat gatggagccg ccggcctcca 2520 gttctacccg tacaaccgtg cagccgaagc ccagtggcct accgaagccg catggcttgc 2580 ccgcgccagc cccaaatcca cctccgccgc ccgccgcggg cgccccccgc ctgctcccca 2640 tccgcatcga ggtctcgacg ctcacggccg atactatatg tggacgcccg gcgtcctccg 2700 cacggtgctg cgcgccctgc ttgcgcatgc ctacacctcc taccgcggcc agccctacct 2760 gcaaattcgc ggcgtcgcca tgggcgccaa cttcgcttcg tacgttgcca acctggtcct 2820 tgcctatgat gagttgcgct ggcagcacgc actctacacc cgcgtcttca tgccctccgc 2880 cagctccttg tgcgcggaag cttccctcgc ccttgatgtg ctactggcgt tccaagacac 2940 ccagcggtac accgacgacc tcctcggctg ctgcaacccc ttccttcccc acctgctgct 3000 gcaatcccaa tcgcttgccg gcatccctgg catctactcc accgacctca ccttggcggc 3060 ctccggcgcc acacccgccg ccggcgtcgc cacgccttac ctcaacttcg ccatcacccc 3120 ccacagcagc aacatctacg ggcacgtcgt gtacgaccta cagccgtacg acaaacgcga 3180 cagccccaag tttgcgcagc tcggcatctc ccgcttcacc ccgttctact cctgcatccc 3240 ccggcacgtg cgcttcaacg ttgtcatcgg cgccctcgtc acccttgcgc ggctctgcac 3300 caccctcggc accttcatca cgtcagcacg gcgcacgctg cggcggctgc atctgcgcgc 3360 ctacccgcgc cccttcctcc gcaaggccct catgcgcttc tatggtcggc atcgccaact 3420 gctgccgggt accatgacct cacgcgggct cctcagcctc ctgccgcccc cggatctgcc 3480 ccctcctccg ccggcggccc cgccgccgcc tccccctccg gcccccgcgc caccagtcgc 3540 cgccgccgtc gtcctgccgg ccctgcccgc accaccgctt cccttccctc ctgctggcca 3600 cgcggcctgg tgccacggcg acaacaacat cggctcaggc gacgacatgt ccatctcctc 3660 tgacagcggc agcgatatgt ccatctgtta gcagcagccc cagatgtgac gcagcagcgt 3720 cagcctcagc cagcacgaca gccgtggagc ggcgacctgg gcgacggtag cggcggtacc 3780 gagcggtgcg gtggcttgta gtaggtgatt ggggggtggg gcgccggtcc cggctgccct 3840 gaaccgctct gcggtgcggg gacggccgtg ccggtgttcc cgtgctgggc ttctggtttc 3900 cagttttacg gtagggacac aggtcggtgc taagtttggg gcttggcgcg cgtgtgggct 3960 ttttccttgt cggttcggcg caggtcgggt tcttgggttt gcgttagggg cgctggtttt 4020 cgggagtgtg ctgtcccgcc ggccgtgtgt ggggcgtgcg gtgcgtgatc gccggcttct 4080 cggccgtctt gtgggttgtg tgtgcgcgtg tgagtgcctg tttgcggcgg gcgcgtgttg 4140 gctgcgtggg cgggctcctt ggcgggcttt tcgtcctggc tgtgttttgc cctgtgtgtg 4200 tgcgcgtgtc ccctctccgc gctcccgtgc cgcagggctg ctggttttaa gacgccgcct 4260 cctgtcatgc caccggggag ctcttcgtga gcccgggccc catccagccg cgtacccaag 4320 tacccaccta cgtaccagcc ccagcgctgc cagcctctgc agccacccag cgtcccatgc 4380 cagctgtccc gtgcctctcc ccagctgtcc caccccagca gtcccacatc cgctgtcccc 4440 tcccagatgt ccctcccagc tgtcccttcc agtcgtccct tccagttgtc ccttccagct 4500 gtcccacccc agctgtccct tcggctccct tctcaaacgc agcttcagcg aaatctacag 4560 cctgcccgac tcccggcgtg tgcggccttc cgcccgcctt ccaaaacgca acagccctcg 4620 cctctcctcg ccagccgctc acagccgaaa ctctgcttct ttcttaatca gccacaaaca 4680 tcagccacaa acaatgcctc ggaaacaaca cctcggaaac ttcctcccat ctagtgcgcg 4740 gccacagctc ccctaccgaa cgtcgcagca cctatcctca ccatcagcaa cgcttcctca 4800 cctccggttg tgttgttggc gtgttgcacc gcattcggcc tgcccgccac aattgtcgta 4860 cacattcggg cccctcttct gctttgcttc gccttgttct tgcccctgct tgtcacactg 4920 tcgcccggcc attgttttgc tgcacgccgt tgcattacgc tagtgctcgt ctacacagct 4980 catccgtgga cgccggtcga ctgcctgcag gtgcgcatcg gctatgctat ccgcgcggtg 5040 tggcgtcgca ttgagtctgt gtggtgctat cgccttggcc gggcgcccgg agcgtgagga 5100 atgcgcgtca gcaaaacatc gacacaacct gccagctacc gtcggtacag taacagcgcg 5160 cgaacgacgc accaatgcct tgtcccttct tgctacgcat ttatgtgggc acgggtggga 5220 gccacacaca tcgggactac gctttcctgc cccttgccat gttccatcac gagc 5274 //