ID Copia-14_MLP-LTR repbase; DNA; FNG; 720 BP. XX AC AECX01002284; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-14_MLP_; KW Copia-14_MLP-I; Copia-14_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-720 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002284; Positions 21737 21018. XX SQ Sequence 720 BP; 172 A; 139 C; 131 G; 278 T; 0 other; tgttggagaa tggtattctc gagtgatgtt tcaagtctat taaggctgta gtttatttag 60 accaatccgg taagaaggaa aggtgaagag gttgtgaagg cttaagtaac atatcaattc 120 aagaggagga ttaaggaagg atgtccgctt ttggttagat taaaagagtt ttgtttaaaa 180 gttgtttcag taattgtttt acgttgtgtt tgttcaaaat gtgtcctggt tgcttttccc 240 ttatcaaccc cccctctctc ccttccgaat ctttttcccc atcgaggaat tagattttga 300 aggtgagagt cttacaaaca tttttccctt atacaaaacc attactaaaa attaactttg 360 ttcttttaca tactctggtt tctttttctt ttaatctttc tttcgtcttt acttctctag 420 tttttcttta cctttatatc ttcctcgcaa cctcgtcagg ttcgtctaaa ggagtttagc 480 actcactttc atctgtcagc ccgtgtctcg tcaacaccag gctaagtgtc ttttacacct 540 ttttaaaggt tagtagttat tattgtgtgt gtgatatgat gatcgaggaa ttagattttg 600 aagtttttct ttacctttat atcttcctcg caacctcgtc aggttcgtct aaaggagttt 660 agcactcact ttcatctgtc agcccgtgtc tcgtcaacac caggctaagt gtcttttaca 720 // ID Gypsy-89_MLP-I repbase; DNA; FNG; 5604 BP. XX AC AECX01000270; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-89_MLP_; KW Gypsy-89_MLP-LTR; Gypsy-89_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5604 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000270; Positions 51012 56615. XX CC Positions [2701-3120] - Reverse transcriptase CC Positions [4405-4884] - Integrase core CC 'GAGAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 311..1348 FT /product="Gypsy-89_MLP-I_2p" FT /translation="MQVEDSVDPLTRIMARLEAMDARLSEETRRRELAEQG FT RAEAEQRLAAMQQSVNAPQSQPLHQPTIAVTAPPKAPKVATPDKFDGTRGS FT KAEIFSNQVGIYIMMNPAQFPNDKTKIGWTLSYMTGKGGEWAKPITQKLLS FT TDPSDSLTWDGFVKNFKATFYDSERVAKAESAIRALKQTSTVLAYSLQFRD FT LALIVKWQESVLITQFEQGLKREIQVQMVRDVFTSLDQISELAIKIDNKLH FT NRGDEGSLGIKELPAVDPDAMDCSALSFSISKDEYRRRSEGGLCFKCGRSG FT HIARACGTGNLRGGWRGRWRRNVESRISTTEVKEEESAKESNRADESKNGV FT ARG" FT CDS 2137..5508 FT /product="Gypsy-89_MLP-I_1p" FT /translation="MGNARNFDEGVCTLSSTLAPPQREFARTIPFIRSEIA FT GKQSSILQHSQIPTSRPIKSYLSKQPVLLTPQLDAARASWNVSAKLAAEKS FT ETKSSLSAAELVPKEFHEYLPMFEKSHSKVLPPRRPYDFCVDLLPGATPQA FT GRAIPLSPAENEVLDEMIKEGLATGTIRRTTSPWAAPVLFTGKKDGKLRPC FT FDYRKLNALTVKNKYPLPLTMELVDSLLNADRFTSLDMRNGYNNLRVREGD FT EAKLAFICKAGQFEPLTMPFGPTGAPGYFQYFVQDIFRNRIGRDMAAFLDD FT ILIYTKPGEDHEKAVKDALDTLREQNVWLKPEKCKFAQKEITYLGLKLSHN FT KILMDNDKVKAVTEWPTPANVSDVQQFVGFANFYRRFISNFSRIARPLHDL FT TQKKVPFEWTVERQEAFQALKDAFTRAPILKIADPYKQFVLECDCSDFALG FT AVLSQISNDDGELHPVAFLSRSLIQAEQNYEIFDKELLAVVASFKEWRHYL FT EGNPHRLSVIVYTDHKNLESLMTTKELTRRQARWAETLSCFDFEIRFRPGK FT NSTKPDALSRRPDHKPHDDVKLTVGQLLKPSNLPEDAFVSELDVVDNWFEE FT EEEDISYFFEVEEEVHAVDDVLEVEDDVDKLIWDDSVILNEVRKKLKKDER FT LSVLIATCEAQKEGVWEDYAYTGGLLYFKGLVEIPNDHELKRRIVESRHDS FT QLAGHPGRMKTLLLVKRCYHWPSMKAFVNAYVDGCHSCQRVKTRSTRPFGA FT LQPLPIPAGPWTDVCYDLITDLPESSGRNCILTVIDRLTKMCHFVPCNTTM FT SSEELAKLMVKHVWKYHGTPKSITSDRGNIFISKLTKELNQQLGIRTQSST FT AYHPQTDGQSEIANKAVEQYLRHFVGYKQDNWYELLDMAEFAYNNSPHTST FT GISPFKANYGYDLSYSRIPSKEQCIPAVEEMLSQLKEVQDELRESLHLAQK FT TMKDQYDKRHGKSPEWAVGSKVWLDSRHISTTRPSAKFLHKWLGPFVIAEK FT VSTNAYKLTLPESMSRVHPVFSVGLLRPYVPSTVSGQLQPPPSTIIIDEEE FT EFEVIAILDKRKRGMKTEYLVSWKGYGPEDDTWEPASGLKNARELVDEFDR FT KYPEADKNYRRTRRLK" XX SQ Sequence 5604 BP; 1758 A; 1188 C; 1365 G; 1293 T; 0 other; tattgtagca tctttattat tagacgccag agaagctgaa gtataagaag aaccgcaagt 60 aaaagtaatc attgaagaaa atcgaaaatt aaaagaagaa ttgaaagaag cacgattcag 120 aagtagaagt tgatctcata ctggaaacct taaattcaaa cagaagaaga tttaaacttt 180 aaaaccttat taatcccgat caatccccaa aacgccagac tatacaatcc ccggcagtcc 240 ttcgagctcc ttatctccga cgtacgcttc agctgtatcg accgaatccg aacttgagaa 300 ttcaatcaaa atgcaagtag aagactcggt agatccttta accaggataa tggctcgatt 360 agaggctatg gatgcaaggc tgtcagagga aactcgtaga cgggagctag ctgaacaagg 420 ccgtgccgaa gctgagcaac gcctagctgc catgcaacaa tccgttaacg caccacagtc 480 ccagccccta caccaaccaa caatcgcagt taccgctcca cctaaagctc caaaagtggc 540 tacacccgat aaattcgatg gcacacgggg tagtaaggcg gagatctttt caaatcaagt 600 cggcatctac ataatgatga accccgccca gttccccaat gataagacca aaattggctg 660 gacattgtcg tatatgaccg gcaagggggg tgaatgggct aaaccaatca cgcagaagct 720 actgagtacc gacccgtcag attctttaac ttgggacgga tttgtaaaga atttcaaagc 780 aacattctac gactcggaaa gagttgctaa agctgaatcg gcaatacgag ccctcaagca 840 aacaagcacg gtactagctt actcgttaca attcagagac cttgcgttaa tcgtcaagtg 900 gcaggaatca gtcttaatca ctcagtttga acagggattg aagcgggaga ttcaggtaca 960 gatggtgcga gatgtgttta cttcattaga tcaaatttcg gaactcgcta ttaagattga 1020 taataagcta cacaatagag gtgatgaagg gagtttggga ataaaggagt taccagcggt 1080 tgatccggat gcaatggatt gttcagcatt aagttttagt atttcaaaag atgaatatcg 1140 aagacgatca gaagggggtt tatgttttaa atgtggcaga agtggtcata tagcaagagc 1200 gtgtgggaca ggtaatttaa ggggaggatg gagaggaagg tggagaagga atgtagaatc 1260 aagaatcagt acaactgagg tgaaagaaga ggagagtgcg aaggagtcaa acagagcgga 1320 cgagtcaaaa aatggcgttg ctcgaggatg aaggttgttc cttcctcgag ctcattagtt 1380 gaagggtcgg atatagatat gggggtaatt gaagcagacg ttaactcact tgaaatgaaa 1440 gataattgtt tatttgctac tgtgcccatc tatgacccaa acttagagac aacccatttt 1500 gcccgcgcca tgcttgacac gggagccact cacgatgtaa tgaatgagtc ctttgtagat 1560 cgtaccaacc tgactaccac aaaacttccc aatcccaaac ccgtcactgg atttaacggc 1620 gcaagatcat caattacgca tataggacac tacatcttag acatcgacag tgaaggcaaa 1680 cccaccccgt ttttgatctc gcgtctgaag gactctatcg attgcatcat cggagtagat 1740 tggatcagaa gacatcataa gaaattagat tggaagaacc gtactttgaa gttggaccaa 1800 ggaaccattg cggctaatga gatagcctcg tcacgactga aaacaatccc ggattggcct 1860 tgagaggagt ccctgggaca cgctaggaat cgtaacgagg gggtgtgtat cgtgagtgat 1920 acgctaacgc ccccgcaatg tgagtttgat acattacctt tattcaatcc tgtcgaaaca 1980 gctggcaagc ttgatttttc cgcatctatc agtataaagc agcctacaac gcacaccgac 2040 gatttcgaac atcaggacaa aaagcttacc gttgcggctg gtgaaccagt gtcgtcagaa 2100 ccgaaaaaca ccccaggagc accaggaagg gtacaaatgg ggaatgctag gaattttgac 2160 gagggggtgt gtactctgag tagtacgcta gcacccccgc aacgtgagtt tgctagaacc 2220 attcccttca ttcgtagtga aatagctggc aagcaatcct ctatcctaca gcacagccaa 2280 attccgacgt caagaccaat taagtcatac ctcagcaagc aaccggtatt actcacaccg 2340 cagttggatg cagcacgcgc ctcatggaat gtttcggcga agctagcagc ggagaagtcg 2400 gagacaaagt caagtttatc agcggccgaa ctagtaccca aggaatttca tgagtactta 2460 ccgatgttcg agaaatccca ttctaaagtc ctacccccaa ggagaccgta cgatttttgt 2520 gtcgacctac ttcctggtgc gactccacaa gcgggtagag caatcccgtt gtcgccggca 2580 gagaatgaag ttctggatga aatgatcaaa gaaggattag ccacaggcac gatcagacgt 2640 acgacatcac catgggcggc cccggttctc ttcacgggaa agaaagatgg aaagttgcgg 2700 ccttgcttcg attacagaaa actaaatgca ctcacagtta agaataaata tcccctgcct 2760 ttgacaatgg aattggttga tagtttgctt aatgcggata gattcacatc actagacatg 2820 aggaatggct ataacaacct acgagttaga gaaggggacg aagcaaaatt agcattcatc 2880 tgtaaagcag ggcaattcga acctctcacc atgccgtttg gacccactgg agctccaggg 2940 tactttcaat attttgttca agacattttt aggaaccgta ttggaaggga tatggctgca 3000 ttcttggacg atatccttat ttacaccaaa cctggagagg atcacgaaaa agcggtgaag 3060 gatgcgctag atacactacg agagcaaaat gtgtggttga aaccggagaa atgcaagttt 3120 gctcaaaagg agatcacgta cttgggattg aaactttcgc ataacaagat cttgatggac 3180 aacgacaagg tcaaagctgt aacggaatgg cctacaccag caaatgtcag cgatgttcag 3240 caatttgtgg ggtttgcaaa cttttaccgg aggtttatca gcaatttctc tcgcatagca 3300 cgtccactac atgatttgac ccagaagaaa gtaccgttcg agtggacagt agaaagacaa 3360 gaagcatttc aggccttgaa agatgcattc acacgcgcac ctatcctcaa gattgctgac 3420 ccctataaac aatttgtgct cgagtgcgac tgctccgatt ttgcattggg agcggttctg 3480 tcgcaaatct cgaacgatga tggtgaatta cacccagttg cgttcctgtc cagatctctt 3540 attcaggctg aacaaaacta tgagattttt gacaaagagt tgttagcggt tgtggcctcg 3600 ttcaaagagt ggaggcatta tctagaagga aatcctcacc gattgagcgt gatagtctac 3660 acggatcaca aaaacctcga atctcttatg acaacaaagg aactcacgcg ccgccaagca 3720 cgatgggcag aaacattgtc atgctttgat tttgaaattc gttttcgacc tggcaagaat 3780 tcaaccaagc ctgacgcttt atccagaaga ccagaccaca agcctcacga cgatgttaag 3840 ctaacggtgg gccaattact gaagccttca aacctaccag aagatgcatt cgtgtcggaa 3900 cttgacgttg tggacaattg gtttgaagag gaggaggagg atatatcgta tttcttcgaa 3960 gttgaggaag aagtccacgc ggtagatgat gtattagagg tggaagatga tgtagacaaa 4020 ctgatttggg atgatagtgt gatattgaat gaagttagga agaaattgaa gaaagatgaa 4080 agactttcag tattgatcgc aacttgcgag gctcagaagg aaggcgtatg ggaggattac 4140 gcatacacag gcggactgct gtatttcaaa ggcctagtag aaattcctaa cgaccacgag 4200 ttgaagagaa ggatcgtaga atcgaggcac gacagccaat tggcgggaca tccaggccga 4260 atgaagacat tattattagt caagagatgt tatcattggc cttcaatgaa agcctttgtg 4320 aatgcttacg tagatggatg tcactcatgt caacgagtga agacccgatc aactcgtccg 4380 tttggagcct tgcaaccact gccaatccca gcagggccat ggacagatgt atgttacgat 4440 ttaatcacag acttaccgga atcaagtggt aggaactgca tcttaacagt catcgatcga 4500 ttgacaaaga tgtgtcattt cgtgccgtgc aacactacga tgtcgtccga agagctggca 4560 aagttaatgg ttaaacacgt atggaagtat catggaacac ctaaatcaat tacgtcagat 4620 aggggaaata tcttcatatc gaagctgaca aaagaactga atcaacaact aggtatccga 4680 acccagtcat caaccgcgta ccatccacaa accgatggcc agtccgagat tgccaacaaa 4740 gctgtcgaac aatatttacg gcactttgtt ggatacaagc aagacaactg gtacgagctc 4800 ctggacatgg ctgaattcgc ctataacaac agcccgcata catcgacagg catctctcca 4860 ttcaaagcaa attatggata tgatttaagt tactcaagaa ttccatcaaa ggagcagtgt 4920 atcccggcag tggaggagat gttaagccaa ctcaaagaag tccaagacga actcagagaa 4980 tccctgcatt tggcgcaaaa gacgatgaag gaccaatacg acaaacgtca cggcaaatca 5040 ccggagtggg cagtgggatc taaagtctgg ctggactcaa ggcatatctc cacaacaaga 5100 ccaagcgcta agtttttgca taagtggttg ggaccctttg ttatcgctga aaaggtgtcg 5160 actaatgcgt ataagctaac actgcctgag tctatgagcc gggttcatcc tgtgttctct 5220 gtaggattat tacgaccgta tgtaccaagc acagtaagcg ggcagctaca acctccgcct 5280 tccactatca taatcgacga agaggaggaa ttcgaagtca ttgcaatcct tgacaaaagg 5340 aagaggggga tgaaaacgga atatcttgtg agttggaagg gatacgggcc cgaagacgac 5400 acatgggaac cagcgagtgg attgaagaac gcaagagaat tagttgatga gtttgataga 5460 aaatatcctg aagcggacaa gaattataga aggacacggc ggttaaagtg agggcaatgc 5520 tttttcccca caggggtttt ttaatgctag cctggggaaa gacgtcagct cagcaagagg 5580 gagtgggacg tagaggggag gtag 5604 // ID Copia-1_AM-I repbase; DNA; FNG; 5332 BP. XX AC ACDU01003803; XX DT 07-FEB-2011 (Rel. 16.02, Created) DT 07-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Allomyces macrogynus genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_AM_; KW Copia-1_AM-LTR; Copia-1_AM-I. XX OS Allomyces macrogynus OC Eukaryota; Fungi; Blastocladiomycota; Blastocladiomycetes; OC Blastocladiales; Blastocladiaceae; Allomyces. XX RN [1] RP 1-5332 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Allomyces macrogynus genome."; RL Direct Submission to RU (07-FEB-2011). XX DR Genome; ACDU01003803; Positions 70216 64885. XX CC Positions [2195-2725] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 302..5308 FT /product="Copia-1_AM-I_1p" FT /translation="MTVTMTSSVSVKTPISGQPATIEMLKADGSNWPTFAD FT RFLGFLTADQLEDLLELKSFKILVDADFEVVSLQQQEEMTDAQFKKLIELD FT SDVKFVADAIFWNGPAGAERKKCMAHNSYKAKSCIRSIIKVVLPSAVASEL FT LPVFSDATCVADAWALLRARFDKRSAQAHLANVKGLFTKPMSGSGAAAVRE FT HIADLQRRFDVIRRHERAKSSSHLRTDFADSFLVLALLSSLPPEYSPTVLS FT LGERYEEMAIVELQTILEGAADQLERSSGTETQALVVTKRKDTNACHNCGK FT IGHYVRFCPNRDHQRKNGAETANWRKKKGANVVAKDGDEHAMVHLSVDEYE FT RLMASARGADAAGVVFGAAEFDGFAHYPIVALCPEIVRLERQSHQGMATLP FT GTYPPIIHVDASNYEGPDPHHHRAHALMPLSARARLAQSPKVDSAAGAHFC FT GDKSLFATLNDMRPGEGFLVSLGDHRVCRPIGKGTMRIRTPSGNIVHLDPV FT WYIPGFKTLVSVSELYDRTAVRVIFGDRDVQFVHCETGDLLAKGMRHGDTW FT IVPGVCEKGPTKDARVLDVWGKRPVGAHVADAVKKVSLAELWHARLFHVGH FT DRVSAVCTLEHGIKDNVGVAKPLHECMTCLVANAHSQPHPLRHLRANRLFG FT VLHADLVSMKTVGMGGVNYFATVTDEYSRWCFVLLLRRKGDFGPQFLDLLK FT HAESFFGVSAGRLHSDHGGEFIGNSLRSILLSRRMHLSTTSPNSPESNGLA FT ERTQGVLKSMVRAAMTAAKAPDSLWPECVRAACYVRNRVPSDSLDGRSPYQ FT VVFGASPNMSKCKVWGSLGGAVVPTTQVRQGASGFAAGVAVRLVGYGDELP FT WVTRDGYRVWDGQKVFVTRDVKFAERLCVAESRVVAGAPRTLPSAKVKVGS FT PVRVDLGSLDVPSTVFDGVGGDEVVDEEIGRGARRDQVQVPMAVPDQLPAV FT NLIAELPLAGPALERDNLQVGERPGLPQVDQGPVGVLADVGEPGRQADAPE FT VSNGGVGALRPDAQAPVARQSRRLRGLQPEVARRFDPDAIARVRSTPDVGE FT VDPEPKRGVERAADAVEKVRVLRLNSDCTDASAPLVRSLAGPGVDSGSPLS FT GIVDLDLPTLFLRAHPDSDLSKFVESASAVLACVAAAQVGAQGAVWEPKTV FT EEALAHPDWRAPTFDEYNSLVENGTWVLVNRRPGNKVMRSGWIFKWKLGAD FT GKPVRAKSRLVAKGYTAIPGVHYTDTFAPVASFDTLRLILSLAAYLKVLPL FT QLDIKTAFLYGYIKERVLMEQPPGFVDKQNSDKVCMLVKVLYGTKQAPRMW FT FCRLADYLILLGFAESKYDKCLFMHKTDKGVIFIGVYVDDLPIVASTKELR FT KWIVDKLKSEFKVHDLGDAAWLLGCKITYSDEGSIYLDQSQYVKSMGKRFQ FT IPEKMRPVIPMRPNDFKELLELPKIIDPVVKTEYLAIVGSLLFAATKTRPD FT IAAATAILGRFMHAPGTLHLEAAKRLVGYLLGTLNLGIKFLGKDMKLDAFA FT DADWAKDRGDRKSTSGFVIRVGGSPVAWRSKKQTLVAKSTMAAEYVSASLA FT GDTILTIRALLEELGYKQDGPTPLMEDNESAERVAKDALLSSKHVEIHAHW FT LQDQVAKKTIDLRRVDTENQIADILTKPLAAPAFRKLRDMIGVAEVIAPRA FT " XX SQ Sequence 5332 BP; 1003 A; 1497 C; 1664 G; 1168 T; 0 other; accttgcgaa aggttatgag ccctgagcgc atcccaagcg ctcacggacc gaacgaacgc 60 tgctgatatc ctcgtcgtgg acccgctgct gctgtgctcg ccgatttctg cgagaaacgg 120 accgatttga ccgaccgacc gagtgggggc cacgctggac ccatccgtca ccacgacgac 180 gacgacgaca accgaaactc gaaccaccac cacccgaaac gtgctgcatc cgctcgagac 240 agtgcgacgc cgattttttc gactcgattt cgacaatttt tgccaacttt cgacctgcac 300 catgaccgtc accatgacct catctgtctc ggtcaagacg ccgatctccg gccaaccagc 360 gacaattgag atgctcaagg ccgatggctc gaactggccg acgtttgccg accgttttct 420 cggtttttta actgccgacc agctcgaaga cctgctcgaa ctcaagtcct tcaagatttt 480 ggttgacgcc gatttcgagg tcgtgtcact gcaacagcaa gaggaaatga cggacgcgca 540 attcaagaaa ctcattgagc tcgactcgga cgtcaagttt gttgccgacg cgatcttttg 600 gaacggtcct gcgggtgctg aacgcaagaa gtgcatggcc cacaacagct acaaggccaa 660 gtcgtgcatc cggagcatca tcaaggtcgt tttgccgtct gccgttgcgt ctgaactcct 720 gccggtcttc tcggacgcca cctgtgtcgc tgatgcgtgg gcgctgctgc gtgctcgttt 780 tgacaagcgt tccgcgcaag ctcatctcgc caatgtcaag gggctgttca caaagcccat 840 gtcgggctct ggtgctgctg cggttcgcga gcacattgcc gatttgcaac gccgttttga 900 cgtgattcgg cgccatgaac gtgccaagtc gtcctcgcat ctgcgcaccg actttgccga 960 ctcgttcttg gtcctcgcac tgctctcctc gctgccgccc gagtattcac caacggtgct 1020 ctccttgggc gagcgctatg aggaaatggc gatcgtcgaa ttacagacga ttcttgaggg 1080 ggctgccgac cagctagaac gcagctctgg gactgagacg caggcacttg ttgtgacgaa 1140 gcgcaaggac acgaacgcct gtcacaactg tggcaagatt ggtcattacg tgcgcttctg 1200 cccgaaccgg gaccaccagc gcaagaacgg cgctgaaacc gcgaactggc gcaaaaagaa 1260 gggcgcgaat gttgtggcca aggacggcga cgaacatgca atggtgcatt tgtcggtgga 1320 cgagtatgag cgtttgatgg cgtctgcccg tggtgcggac gccgctggtg ttgtttttgg 1380 tgctgctgaa tttgacggat ttgcgcacta cccgattgtc gccctctgcc ccgaaatcgt 1440 gagacttgaa cgtcaaagcc atcagggcat ggcgacgctg cccggcacgt acccgcccat 1500 cattcatgtc gatgcatcga attatgaagg accggatcct caccatcatc gtgcgcacgc 1560 gctgatgcct ttgtcggcgc gcgcacgtct tgctcagagc ccgaaagtcg actcggccgc 1620 tggcgcgcac ttttgcggcg acaagtcgct gtttgctacg ctcaacgaca tgcggcctgg 1680 cgaagggttc ctcgtctcac tgggtgacca ccgcgtgtgc cgtccgattg gcaagggcac 1740 gatgcgcatc cgcacgcctt cgggcaacat tgttcacctc gacccggtct ggtacattcc 1800 cggcttcaag acgcttgtct cagtctctga gctgtacgat cggactgctg tgcgagtcat 1860 cttcggggat cgagacgtcc agtttgtgca ttgcgagaca ggagatctgc tcgccaaggg 1920 catgcggcat ggggatacgt ggatcgtccc aggcgtgtgc gagaagggac caaccaagga 1980 cgcgcgggtg ctcgacgttt ggggtaagcg acctgttggt gcgcatgtcg ccgatgcggt 2040 caagaaggtg tcgctcgccg agctgtggca cgcgcgactc tttcacgttg gtcatgatcg 2100 cgtttcggcc gtgtgtacgc tcgaacatgg tatcaaggac aacgtgggtg tggccaagcc 2160 gctacacgag tgtatgacgt gtcttgtggc caacgcgcac agccagccgc atccgctgcg 2220 ccatttgcgt gcgaatcgac tatttggcgt cctccacgct gatttggtgt cgatgaagac 2280 ggtcggcatg gggggtgtca actactttgc gacggtcacg gacgagtact cacgctggtg 2340 cttcgtgctg cttctccggc gcaagggcga ctttggaccg cagtttctcg atttgctgaa 2400 gcacgccgag tcgttcttcg gcgtgtctgc gggccggctg cacagcgacc atggtggcga 2460 gttcattggc aacagtttac gctcgattct gctctcgcgt cgcatgcact tgtcgacaac 2520 gtctccgaac tcgccagagt caaatgggct tgccgagcgg acgcagggtg tgctcaagtc 2580 gatggtgcgt gcggccatga cggccgccaa ggcgccggat tccctctggc cagagtgtgt 2640 gcgcgcggcg tgctatgtgc gcaaccgtgt gccaagtgac tcgctcgatg gtcgctcgcc 2700 ataccaggtt gttttcggcg cgtcacccaa catgtcgaag tgcaaggtgt ggggctcgct 2760 tggtggcgct gttgttccga cgacgcaagt gcgtcagggc gcgtccggat tcgctgcagg 2820 tgttgcagtg cgcttggttg gctatggaga cgagttgccg tgggtcacgc gcgacggcta 2880 ccgcgtgtgg gacggacaga aggtgtttgt cacgcgcgat gtcaagtttg ccgagcggct 2940 ctgtgttgca gagtcgcgtg ttgttgctgg tgctcctcgc acgttgccgt cggccaaggt 3000 caaggttggg tcgcccgtgc gagttgattt ggggtcgctg gacgtgccgt cgactgtgtt 3060 cgatggcgtt ggtggcgatg aagttgttga tgaggagatc gggcgtggtg cgcggcgtga 3120 tcaagttcaa gtgcctatgg cggttccgga tcaacttccc gcggtcaacc tgattgccga 3180 gctcccactg gcggggccag cgttggagcg cgataatctg caagttggcg agaggcccgg 3240 tttgccgcag gtggaccagg gaccagtggg ggtcctcgcg gacgtggggg agcccggtcg 3300 tcaggccgat gcgccagaag tgagcaatgg gggtgttggt gctcttcgac ctgatgctca 3360 agcgcccgtc gcacggcaat cgcgccgttt gcgcggccta cagcccgagg tcgctcgccg 3420 ttttgatccc gacgccattg cccgcgtccg cagtacgccc gacgtggggg aggtggatcc 3480 tgaacctaaa aggggcgtcg aacgtgctgc tgacgcagtt gagaaagtgc gcgtgttgcg 3540 gctgaatagt gattgcactg acgcttcggc ccctcttgtt cgttccctag ctggccctgg 3600 cgttgactca ggctcgcccc tgtccggaat tgtcgatctc gatttgccga ctctgtttct 3660 acgtgcgcac ccggactcgg atctcagcaa gtttgttgag tcggcaagtg cggtgctcgc 3720 atgcgttgcc gcagctcaag tcggcgccca gggtgcagtt tgggaaccca agacggtcga 3780 ggaagcgcta gcgcacccag actggcgcgc gccgacattc gacgagtaca actcgctcgt 3840 cgagaatggg acatgggtgc ttgtcaaccg ccgcccaggc aacaaggtga tgcgctccgg 3900 ttggatcttc aagtggaaac tcggggcaga cggaaagccc gtgcgcgcca agtcgcgact 3960 tgttgccaag ggatacacgg ccattccagg cgtacattac acggacacgt ttgcgcccgt 4020 tgcgtcgttt gacacgctgc gtctgattct gtcgctcgcc gcgtatctca aggtcctgcc 4080 gctgcagctc gacatcaaga cggccttcct gtacggttac atcaaggagc gcgtgctcat 4140 ggagcagcca ccaggcttcg ttgacaagca gaactcggac aaggtctgca tgcttgttaa 4200 ggtgctgtac ggcaccaagc aggccccacg catgtggttc tgccgtttag ctgattacct 4260 gattttgctc ggattcgcgg agtccaagta cgacaagtgc ctgttcatgc acaagacgga 4320 caagggcgtg atcttcattg gagtctatgt cgacgacttg ccgattgtcg cgtcaaccaa 4380 ggagctgcgc aagtggattg tcgacaagtt gaagtccgag ttcaaggtgc acgacttggg 4440 cgatgcggca tggttgctcg ggtgcaagat tacgtacagt gacgaagggt cgatttacct 4500 tgaccagagc cagtacgtca agtcaatggg caagcggttc cagatccccg aaaagatgcg 4560 gccggtgatc ccgatgcgac ccaacgactt caaggagctg ctcgagttgc ccaagatcat 4620 tgatccagtt gtaaagaccg agtacctcgc cattgtcggg tcgctgctct ttgcggcgac 4680 gaaaacgcgg ccggacattg ctgctgcgac tgcaattctt ggccgtttca tgcatgcacc 4740 tggtacgcta catctcgagg ctgcaaagcg gcttgttggg tacctgttgg gcacattgaa 4800 ccttggcatc aagtttttgg gcaaggatat gaaacttgac gcgtttgctg atgccgattg 4860 ggccaaggat cgtggcgacc gcaagtcgac atcgggtttt gtgatacggg ttggcggtag 4920 ccctgtcgcc tggcgctcga agaagcagac gctcgtggca aagtcgacca tggctgctga 4980 gtacgtgtcg gcaagcttag cgggcgacac gatcctcacg atacgcgcgc tgctggaaga 5040 gctcgggtac aaacaagatg gaccgacgcc gctcatggaa gataacgagt cggcggagcg 5100 tgttgccaag gacgcgctgc tctcgtcgaa acacgttgag atccacgccc actggttgca 5160 ggaccaagtt gctaagaaga cgattgacct gcggcgcgtg gacacggaga accagattgc 5220 cgacattttg accaagccgc ttgctgctcc tgcgttccgc aagcttcgtg acatgattgg 5280 agttgcggaa gtgattgctc cgcgcgcgta gggaatgctt gccgtggggg ag 5332 // ID Gypsy-14_RO-I repbase; DNA; FNG; 4841 BP. XX AC AACW02000082; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-14_RO_; KW Gypsy-14_RO-LTR; Gypsy-14_RO-I. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-4841 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000082; Positions 33236 38076. XX CC Positions [3272-3748] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 137..4393 FT /product="Gypsy-14_RO-I_1p" FT /translation="MNRDQRSWFNEVLKGRSLTWSEVRKIIVKTYAAQDVA FT QELEYMDQLLSLKMLPTESIEAFTDRFQRIRRAAKWDDDIRTASIYKRALP FT GFLRQEVSRSLLNLGRDQQDSVTKVAAKARMVLSSNLCSESGSSNRQESGL FT KISSSPLLSKGTEASKYNPNNLETSNNQLGNGKRVSMGSMKNKFRCAIHGI FT ANHPTERCNKYKDLLKQNSSSVITPSAASNRSLSFVSVAKNCYRCLGNVPW FT SKEHAARCPRDKPYHGPTKAIRSVRLDTANSNGNKPNLTVTPQARPQQASS FT KVSSGDSNLMDVDDEGYPVSYDCKKLTKQNEVFKNTNHSLIVPITIENVNT FT MALVDSGSSFSSIDTKFVNKYNIAVDRNVSGSVILATSDNVSKRFGTTKFP FT LSVIYGDNDKNLIHTSHSFEVLPLSLDTEVVIGLDLMHKLNILVTNLAIRH FT PNLAPVIDKEITDDTPEPNKAPFGTLEQQAHFHNAIKPFVDQNALIPKNSF FT CTVPESVIRLDTVKGKTAYRAPYRTPFKLLPIMRECIDTWLKDEVIERASP FT NSDWNSPLTLAPKKDLLGNLTGHRPCLDPRLLNSILVSNDRHPIPKIEEIF FT DQLQGSTIFTTLDLRQAFHRFQIYEPDRVKTTFTFEGQQYQFRGCPFGLKH FT IASRYQRVINIVLRDLPYAQAFVDDIIIFSKSYEEHITHVQNIIQRLTKVN FT LILNPDKCHFAQSTVYLLGFCVDAKGSRLDPRKVTNALTWPRPSSGKEIQR FT FLGLVNYFRKYLPNISEVTAPLDKLRFEGKLDKLWTSEQESAFEKIKALLS FT SAPLLHHPDLEQPFYVATDASNYSIGAVLYQVIKNETRYIGFMARSLSSSE FT KNYSTTKRELLAVIFALKKFHPFLWGNPFTLYTDHKALTYLHTQPVANAMM FT INWLDTILDYNFKIIHRPGIQNILPDALSRLFEPEKTLEGDNKTIKTIVTS FT QIINSNGSILTSRMMMPADLMTPAPEDRQKLLMDTHLEGHRGAQAIVTALH FT SDGIHWTKLKEDALEIIRSCPDCQKFNIAKHGYNPLTSIYADAPWDHICID FT TAGPFPTSVQGNQYILLVVDVFTRYCVLKALPDKSSLTIALALRSILSLFG FT RPKIIQSDNGTEYVNEIVRLYVESSGIDHRLISAYHPRANGIVERWVGKAK FT NILHKRLQGRTEDWDLYVDSTQEALNNTHTALHGTRPFSLMFARRPNENKD FT YNNVLDKTKSPETMKQLETRINEFNDTVLPAIREKIKTSQAASRDKFNQTH FT RILTDIPTGSQVTLKNVNRVAKSDPLYVGNYTVKRKTQGGSYVLVDATGAL FT LPRDVPPSQIKVISQEVSLSNTDQSESYDVEAVLHHKGSPGNYLYKVRWKG FT YGEEDDTWEPASHFHDYRPIQKYWLRISEQEPAREVQLVPKKRYNQEKKKC FT ASQCN" XX SQ Sequence 4841 BP; 1526 A; 1043 C; 858 G; 1414 T; 0 other; atggaggggt aatgtctgga gaaaagatgc cgatgtacat gactccgtag aagacttact 60 tgacaccttc tccctaatca ttgagtcaaa tggtctatct gttgattcga gttggtctcg 120 cctggtccca atcaagatga accgagatca acgttcatgg tttaacgagg ttctaaaagg 180 gcgtagtttg acttggtctg aagtaagaaa gatcattgtc aagacctatg ctgcacaaga 240 cgtagctcaa gaattggaat acatggacca gctcctgtca cttaaaatgc ttccaaccga 300 gtccattgaa gcctttacag atcgatttca acgtattcga agggctgcca aatgggatga 360 tgacatcagg accgcttcca tttacaagcg tgctttacca ggctttttac gtcaggaggt 420 ctctcgttcc ttgctaaacc ttggtcgaga tcaacaagac tccgtgacaa aagttgctgc 480 caaagcacga atggtcctgt catcaaacct ttgttcagaa agtggctcat ctaaccgtca 540 agaatctggg ttaaaaattt cctcctcgcc tctcctttct aagggaacag aagcatcgaa 600 gtacaatcca aacaacttgg aaacctcaaa caaccaactt ggcaatggta aaagagtgtc 660 tatgggtagt atgaagaaca agttccgctg cgctatccat ggtattgcca atcatcccac 720 tgaaagatgc aacaagtaca aggatttgct gaagcaaaac tcttcctcgg tgatcactcc 780 ttctgctgca tctaatagat ctttatcatt tgtgtctgtt gccaagaact gttatcgatg 840 tttgggtaat gtcccttggt ccaaagaaca cgctgctcgc tgtcctcgtg acaagcctta 900 tcatggtcca actaaggcca tccgttctgt tcgcttggat actgcaaatt ccaatggtaa 960 caagcccaac ctcaccgtca ctcctcaagc tagaccacaa caggcgtcct ccaaggtgtc 1020 tagtggtgat agcaacttga tggacgtgga tgacgagggt tatccagtta gctatgactg 1080 taagaaatta acaaaacaga atgaagtatt taaaaatact aaccattctt taattgtacc 1140 tatcactata gagaacgtga acactatggc cctggttgat agtggttctt cttttagctc 1200 gatagacaca aaatttgtta ataaatataa tattgcagtt gatagaaatg tatctggttc 1260 cgttatactt gctacaagtg ataatgttag taaacgtttc ggtacaacca aatttccatt 1320 aagtgttatt tacggtgata atgataaaaa tcttattcat acatcccatt cttttgaagt 1380 ccttcctctt tccttggata ctgaagttgt aataggatta gacctaatgc acaagttgaa 1440 cattcttgtt accaacttag ctatcagaca tcctaatctt gcaccagtta ttgacaaaga 1500 gattacggat gacacacctg aacctaacaa agctcctttc ggtacacttg agcaacaagc 1560 tcatttccat aatgctatta aaccatttgt tgatcaaaat gccttaatac caaagaattc 1620 attctgtact gtacccgagt cagttatacg tttagatact gttaaaggaa aaaccgctta 1680 ccgtgcacca tatcgtactc cttttaaatt actaccaatc atgcgtgaat gtattgacac 1740 ttggttaaag gatgaagtta ttgaacgtgc atcacctaat tcggattgga attctccttt 1800 aacattggca cctaagaaag acttgcttgg taatctcaca ggacatcgac cctgtctcga 1860 tcctcgcttg ttaaattcta ttctcgtgtc aaacgataga catcctattc caaagattga 1920 agagattttc gatcaattac aaggatcaac tatctttacc actcttgatc ttcgacaagc 1980 atttcatcga ttccaaatct acgaacctga tcgtgttaag accacattca cttttgaagg 2040 tcaacaatac caatttagag gttgtccatt tggattgaaa catattgctt cccgttatca 2100 gcgagtgatt aatattgtat tacgtgattt accttatgca caggcttttg ttgatgatat 2160 catcatattt tccaagtctt atgaggaaca catcacacat gtgcaaaaca taattcagag 2220 attaactaaa gtgaacttga tccttaaccc agacaaatgt cattttgcgc agtctactgt 2280 ttacttgctt ggtttctgtg ttgatgccaa agggtctcga ctcgatcctc gtaaagttac 2340 aaacgctttg acttggccta gaccttcatc tggaaaagag attcaacgct ttttaggact 2400 ggtaaattat tttagaaaat acttgccaaa tatttctgaa gtcactgcgc ctttggacaa 2460 attacgtttc gaaggtaaac ttgataaact ttggacatct gaacaagaaa gtgcttttga 2520 aaaaatcaag gcattattgt cttctgcgcc acttcttcat catccagatt tagaacaacc 2580 tttttacgtt gctactgatg ctagtaacta ttcaattggt gctgttcttt accaagttat 2640 caaaaatgag actcgatata tcggtttcat ggcacgctca ctttcttcat ctgaaaagaa 2700 ttattctacc actaaacgtg agctattagc agtgatattt gcactaaaga agtttcatcc 2760 atttttgtgg ggtaacccat tcaccttata cactgatcac aaggccctga cttatctcca 2820 cactcagcct gtagctaacg ctatgatgat caattggctg gatactatac tggattataa 2880 cttcaaaatc atccatcgac ctggtattca aaatattctg ccagatgctt tgtcgcgtct 2940 ttttgaacct gaaaaaacgc tggaggggga taataaaacc atcaaaacaa ttgtcacttc 3000 acaaattatc aattctaacg gatcaattct tacatccaga atgatgatgc ctgctgactt 3060 gatgacccca gcaccagaag acagacaaaa gctactcatg gatacacact tagaaggaca 3120 ccgtggagct caagcaatcg taacagcttt acatagcgat ggtattcact ggacaaagct 3180 caaggaagat gcacttgaaa taatacgtag ttgtcctgat tgccaaaaat tcaatatcgc 3240 aaagcacgga tacaatccct taacctcaat ctatgctgat gcaccttggg atcacatctg 3300 tatcgacaca gccggacctt tccctacttc agtacaaggt aatcaataca tccttctggt 3360 agtggacgtc tttacacgat attgtgtact caaagccttg cccgataaat cctctctcac 3420 tattgcactt gcattaagga gtatcttatc gctctttggt cgaccaaaaa ttatccaaag 3480 tgataatggt actgagtatg tcaatgaaat cgtcagactg tatgttgaat catccggtat 3540 tgatcatcgt ctcatatcag cttatcatcc ccgagctaac ggtatagttg aacgatgggt 3600 tggcaaagca aaaaatattt tgcacaaaag attacaaggc agaactgagg attgggactt 3660 gtatgttgac agtacccaag aagccctcaa caatacacat acagctttac acggcacacg 3720 acctttctca ttgatgttcg caagacgtcc caatgaaaac aaggattata ataacgtgct 3780 ggacaaaact aaatcgcctg aaacaatgaa acagttagaa actagaatta acgagtttaa 3840 tgacactgtc ttacctgcta ttcgtgaaaa aataaaaact tcacaagctg cctctcgaga 3900 caagttcaac caaactcaca gaattctaac cgatatcccc actggtagtc aagtcacgct 3960 caagaatgta aaccgtgttg caaaaagtga tcccctctac gttggtaatt acactgtcaa 4020 aagaaaaact caaggtggtt catacgtgtt ggttgatgca actggagcac tcttaccaag 4080 agatgtacct ccatcgcaaa taaaggttat ttcccaggaa gtatctttat ccaatactga 4140 tcaatctgaa tcatacgatg ttgaggcagt tttacaccat aagggatcac cgggtaatta 4200 cttatataaa gttcgatgga aaggttatgg agaagaagat gacacttggg aaccggctag 4260 tcatttccat gattacagac cgatccaaaa gtattggtta cgtataagtg aacaagaacc 4320 agcgcgcgag gttcaacttg ttcctaaaaa aagatacaac caagaaaaga aaaaatgtgc 4380 atcgcaatgt aactaactct aaaagaaaca gacgttaaca cattgacctt cttccaccca 4440 actttcatta actatacata tatgttacca gcaaataacg caattataaa atatttacaa 4500 ggatagtaaa aaatggaatt ttttctcatg ccttaatatt tttcaatttg cacgtttgta 4560 cgcagtcaca tttacagaca aacatcactt tagcaataaa ttcctgttta gggtttatta 4620 tcgcctaaat gctttgttcc attcttgaac actatacttt atatgtttat ataaagagcc 4680 actatacgtt atatgacttc accctaccaa gcacctattt cttacaacaa atagagctca 4740 tcttgacatc tcaaacctca ttttgacatt cagttaaaca aacatctttt tacttgccac 4800 attaattcca gaaaagataa ttacaacctg gaagggggca a 4841 // ID Copia-60_MLP-I repbase; DNA; FNG; 5492 BP. XX AC . XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-60_MLP_; KW Copia-60_MLP-LTR; Copia-60_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5492 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR [1] (Consensus) XX CC LTRs are 96% similar to each other. The original sequence CC (Accession No AECX01000558, pos. 149273-142487), containes a 1396 CC bp insertion of a non-autonomous EnSpm element (deleted). Due to CC the reconstruction, it is listed as "consensus.". XX FH Key Location/Qualifiers FT CDS join(159..1268,1272..4835) FT /product="Copia-60_MLP-I_1p" FT /translation="MSDREDLDSLNQLPTPATTPAPFSNLDESDAEIQSES FT HSETSNSTVHSDQTTVPVMASVPIDTDTPAQKQSIKVNSILHKITINTKLS FT ATNYNPWSDDVRFGLAAASYDQYLIVKDVPTVITDPEVILATKKCIFHWLL FT ASMESAQSTRFISMISTFENGIKNTPSSPFLLWKTVRDYHISNSESVKLML FT RSDITDLSQGSTKDLLEYIDMFRAKIDAYLGANGEMSEEEQARQFVRSLNR FT DWAEKGCDLLDAGHVKFRNLETELKKTYQTRKMFNSNRQQSSCTIEASEAS FT QGRKNGRWQTCSKNRCLGREHPTKPHEQSECYHHPNNSSKMEAWKKSKKQA FT GEWIEYPRGGRGSSNSRGRGRRDAHFSGSSNFTHGSDFPDSSELKSVFDGL FT RLEDCEISYHVSVDGKFSCSAKPQVACVGDQCNSVALIDTGASHHMFHDAS FT LFEVGTMIANEDPGAKLNLAGGGATLDIHSIGNVNLLNSKGEDIKLKECLY FT VPDLSRNLIAGGRLIRAGAITKVLEDPNFRIDHGKKELFIGKFIGEGSMMY FT VAIRSLVSHEFHQSSISKTNQFTILKLHYSLGHPSEKYLKRMWSLGYFNDV FT LPKTVTTKEFEIISKCPVCPLAKNHRLPFPFTRPRATNFLENVHVDLSGII FT RTTAVNQEEYYILFTDDFSSYRVSFGLPNKSAETVFECFKRYIAFAERQTG FT QPLKMFSLDGGGKFINGLLSPYLEELGIVARVTSPHTPEENGVAERSNRTI FT NTKARCMLIQSCVPIRYWYYAVSYSVMLQNRTITTSLNLKETPQSVWKKHK FT SSMKRFQPFGCLAYRHVRKEVRGGKFEPVSRPGVLLGATEDNHNFIVLDQE FT SNQIHISHDVTFQPLIFPFMKDADKGPDWIFIEDLPLDSAEEKQLVIEPNS FT TPNVLRYDSDEEDPVIQESLPPRITEVLNETPESELEPTDQIINQPQINEI FT PEKEPVTKQPVQEQPRRSTRERQPVNRYTPSNSTSSNFVNIPMRSIVPKIS FT QRYSESQHSYYCREAMRARAEPKSYKMAMKAPDADKWKEACDKEMKNIKEM FT GVWEIVDRPTNAPVVGGRWHFKYKINPDGSISKHKARYVAKGYTQTEGIDY FT NETFAPTGRLASFRIMVSVAAAKSWDIKQMDAIAAFLNSTLKEEIYLELPE FT GYDQERANGKVARLRKALYGLKQSARCWNEEVKDKFTQIGLKQNPHDGCLW FT YGRDRNGMETLIYLHVDDMAITGDKIKETKELLKLKWRMEDLGPAHCIVGI FT EIHSTTDGGYSLSQPAFIQTVLERFHYEDCKPASTPFPGGNKILKASDADV FT LEFNQSGLPYNSLVGSLMYIAQGTRPDVAYAVGVLSQHLSRPSKEAWNMGL FT HVLRYLKGTQSLGLIYSSSNSQIEGNQSWSFPECHADANWAGDPSTRCSTT FT GYLFKLNGAAISWKSRLQPTVALSSTEAEYKATTEAGQEVVWLRGLLSNIS FT LTQESPTILCSDSTGAVSLTQKAIFHARTKHIEVQYHWIREQVEKGAIKMR FT HVNSKNMFADTLTKPLHPGPFRELRDQVGLDVIHGHLKQGEC" XX SQ Sequence 5492 BP; 1708 A; 1179 C; 1118 G; 1411 T; 76 other; rrrkrvrcst vsnrvakkak vkmdagravk tvvsnnmnnn attvrwkhgr nrnkvngntv 60 vaaatavvms ahmvtsrans nmvyvurkah ydratmrmas kcamtggtag cgagagtcta 120 gttcagctct cagttatctc aatcgaaacg caaatcgcat gtcagataga gaagaccttg 180 actccctaaa tcaattgcca actcctgcca ctaccccagc ccctttcagc aatctcgacg 240 agagtgacgc cgagattcag agtgaaagtc attcagagac ttcgaattcc acagtccatt 300 ctgatcagac caccgttcct gtcatggctt ccgtgcctat tgatacggac actccggctc 360 agaaacaatc gatcaaggtg aattcaatac tacacaagat aactattaat accaaattat 420 ctgccacgaa ttacaacccg tggtctgatg atgttagatt tggattggct gccgcatcat 480 acgatcagta tctcattgtc aaggatgtcc caaccgttat caccgatcct gaagtcattc 540 tagctacgaa aaagtgtata tttcattggt tgttagcaag catggaatca gctcagtcta 600 ctcgctttat atcaatgatc tctacctttg aaaacggcat aaagaacacc ccttcatctc 660 catttctcct ttggaaaacc gtacgcgact atcacataag caattcggaa tcagtcaagt 720 tgatgctacg aagtgatatc acagatctgt ctcaaggttc aactaaggac ctactagaat 780 acatagatat gttccgcgct aagatcgatg cttatttagg agcgaatgga gagatgtcgg 840 aagaagagca agctcgtcag ttcgtgagat cacttaaccg tgattgggcc gaaaaggggt 900 gcgacttact agatgctgga catgtcaaat ttagaaattt agaaactgaa ttgaagaaga 960 cgtatcaaac gcgtaagatg ttcaattcta accgtcagca atcgagttgc actattgaag 1020 cctccgaagc tagccaaggt cgaaaaaatg gacgctggca gacctgcagt aaaaaccgtt 1080 gtctcggaag agaacatccc actaaacctc atgaacaatc cgaatgctat caccatccaa 1140 ataactcaag taagatggaa gcatggaaga aatccaagaa acaagccggt gaatggattg 1200 aatacccccg tggtggccga ggcagctcta actcaagagg cagaggtcgt agagatgctc 1260 atttctcctg aggctcatca aattttactc atggttcaga cttccccgat tcgagcgaac 1320 tcaaatcagt ttttgatggt ttacgtctag aagattgtga aatcagctat catgttagtg 1380 ttgacggtaa attctcttgc tcagcgaagc cacaagtggc atgtgttgga gaccaatgta 1440 actcggtcgc tctcatcgat acaggcgcat ctcatcatat gtttcatgat gcatctcttt 1500 ttgaagtcgg aaccatgatt gcaaatgaag atccaggtgc aaaacttaac ctggcaggcg 1560 gaggagccac cctagacata cactcaattg gaaacgttaa tctgctgaat tccaaaggag 1620 aagacatcaa actcaaggaa tgcttatatg tacctgattt gtctcgaaac ctcatagccg 1680 gcggaagatt aattagagcg ggtgcaatta ctaaagtact agaagaccca aattttcgaa 1740 ttgatcacgg caagaaggag cttttcatcg gaaaattcat cggagagggt agtatgatgt 1800 atgtggcgat acgatctttg gtcagtcatg aatttcatca atcatctata tcaaaaacca 1860 accaattcac cattttaaaa ctccattatt ccttaggtca cccaagcgag aagtacctaa 1920 agaggatgtg gagccttggt tattttaatg atgtactacc aaagactgtt actacaaaag 1980 aatttgaaat tatatcaaaa tgtcctgttt gccctctagc caaaaatcac cggttaccat 2040 tcccattcac tagaccaaga gctacgaact tcctcgaaaa tgtccacgtt gatctgagtg 2100 gtataataag aactactgca gtcaaccaag aagaatacta tattctgttc actgatgact 2160 tcagcagtta tcgtgtttct tttggtttgc caaacaaaag tgccgagacg gtattcgaat 2220 gtttcaagcg gtacattgcc tttgccgagc gtcaaaccgg gcagcctctg aaaatgtttt 2280 cactagacgg aggaggcaaa tttatcaacg gcctcctcag cccgtacctt gaggagctag 2340 ggatagttgc aagggtcacc tctcctcata ctccagagga aaatggagta gcggaacgct 2400 caaatcgtac tatcaatacg aaagctaggt gtatgctaat acagtcatgc gtaccaatta 2460 gatactggta ctatgctgtg tcttattcgg ttatgttgca gaacaggacc atcacgactt 2520 cattaaatct aaaagaaacc cctcagtcag tatggaagaa gcacaaatcg agcatgaaac 2580 gtttccaacc ctttgggtgc ctagcctacc gacacgtcag gaaagaagtt cgaggtggga 2640 aatttgaacc cgtctctcgt cccggtgttt tgctaggagc cacagaagat aatcacaact 2700 tcattgtatt agaccaagaa tcaaatcaga tccatattag tcacgatgtg accttccagc 2760 cactcatttt cccttttatg aaggatgccg ataaaggtcc agactggatt tttatcgagg 2820 atctcccgct tgactcagcg gaagagaaac agttagttat tgaacccaac tcaacaccaa 2880 acgtattaag atacgattca gatgaagaag atccagtgat acaagagagt ctccctcctc 2940 gtatcactga agttttgaac gaaacacccg agtctgaact tgaacctaca gaccagatca 3000 tcaaccaacc tcaaatcaac gaaattccag aaaaggaacc tgtaaccaag cagcccgttc 3060 aggaacaacc aagacgatca acgcgggaac gtcaaccagt caatcgatac actcccagta 3120 actcaacttc aagcaacttc gtcaatattc ctatgcgtag cattgtaccc aaaatatcac 3180 aacgatactc tgaatcacag cactcatatt actgccgaga ggccatgaga gctagagccg 3240 agcctaagag ttataagatg gcaatgaaag ctcctgacgc agacaaatgg aaggaagcct 3300 gtgacaagga aatgaagaac atcaaagaga tgggggtatg ggaaatagtg gaccgtccaa 3360 ccaacgctcc agtcgtgggt ggtcgatggc atttcaaata caagatcaat cctgacggct 3420 ctatctcaaa gcataaagct agatatgttg cgaaaggtta cacccaaacc gagggtattg 3480 attacaatga gacttttgct ccgaccggaa gactagcatc ttttcgcata atggtctccg 3540 tcgcagcggc taagagctgg gatatcaaac agatggacgc tatagctgca tttctcaaca 3600 gtactttgaa agaagaaatt tacctagaat tacctgaggg ttatgatcaa gaacgcgcaa 3660 atggcaaggt agctcgattg cgaaaagcat tgtatggact aaaacaatca gccaggtgtt 3720 ggaacgagga agtgaaggat aaattcacac agattggttt aaaacaaaat cctcatgatg 3780 ggtgcttatg gtatgggaga gacagaaatg gtatggaaac attaatctat ctacatgtag 3840 atgacatggc gatcactggt gacaagatta aggaaacaaa agaattacta aagttgaaat 3900 ggagaatgga agatctaggg ccggcgcatt gtattgtagg cattgaaatc cattccacaa 3960 ctgacggtgg atattctctg agtcaaccgg cattcattca aacagtgtta gaaagatttc 4020 attatgagga ctgcaaacct gcatcaactc catttccagg aggtaacaag atactgaaag 4080 caagcgatgc tgatgtacta gagttcaacc aatcaggcct cccatataat agcttagttg 4140 gaagtctgat gtacattgct caaggaaccc gacctgatgt agcatatgca gtgggtgtcc 4200 tatctcaaca cctgtcacgc ccatcaaaag aagcgtggaa tatgggatta catgttttac 4260 gctatttgaa aggcactcaa agcctaggat taatttactc atcaagcaac agtcagattg 4320 aaggtaatca aagctggtct ttcccggaat gccatgccga cgccaactgg gcaggggatc 4380 caagtactcg ttgctcaact actggatatc ttttcaaact caatggtgcc gccatcagct 4440 ggaaaagcag actgcaacca acagtggcac tgtcctcaac ggaagctgaa tacaaggcaa 4500 ccactgaagc agggcaagag gtagtatggt tgagagggtt actgtcgaac atctcactaa 4560 cgcaagaatc acctactatt ttatgcagtg acagcaccgg agcagtatcc ttgactcaaa 4620 aggctatttt tcatgcacgt acaaaacaca tagaagtgca atatcattgg attagagagc 4680 aagtagaaaa aggagccatc aagatgaggc atgtcaatag taagaacatg tttgctgata 4740 cactaaccaa acccttacac cctggaccat ttcgagaatt aagagaccaa gtggggctag 4800 atgtgattca cggacatctg aaacaggggg agtgttgaga gttacattca cgtataactt 4860 gggtttcaga cgccgtgaaa gttatgtatt aagttcaatt ctcaaggatc aaaagttcaa 4920 attaaattaa acttaatctc taaatacaaa ccataatcat catcatttaa actatgacgc 4980 cactccgaca cgcgtcttac tcttttcctc atccgtctat cacggtatag gaaaagagta 5040 agtacccttc ccagttcaat tcatccattc atattttcaa ttcattcatg ctacagaaga 5100 cactaattga tttcattctt gttattctat tcttgtcttt catctgatac tatgtcgaca 5160 ctagaatcaa atactgagac ctgcttgtgt caatttcctt ccttgaatat catatcctga 5220 cttaaacctt taggttagac tatatttctc aactctctat atcaatttct tttcttctaa 5280 ttctgtttgt ctgttattgt tatgttttct ttttatcttg actctttagg ttagtctgtc 5340 aggtagttta aagacgtgag gtgtcaggag actcacctct aggattcgtc tattgggtat 5400 caggtgaaat ttatccgtct atcacggtat aggaaaagaa atcaaatact gagacctgct 5460 tgtgtcgatt tccttccttg aatatcatat cc 5492 // ID Mariner-5_AN repbase; DNA; FNG; 1315 BP. XX AC . XX DT 09-JAN-2004 (Rel. 9, Created) DT 09-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE DNA transposon. Mariner superfamily. Tc1 clade - a consensus DE sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; KW mariner superfamily; Mariner-5_AN; Tc1 clade. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-1315 RA Kapitonov V.V. and Jurka J.; RT "Mariner-5_AN, a family of nonautonomous DNA transposons in the RT Aspergillus nidulans genome."; RL Repbase Reports 3(12), 213-213 (2003). XX DR [1] (Consensus) XX CC DNA transposon. Mariner superfamily. Tc1 clade. CC TA target site duplication. 40-bp TIRs. CC The 350-aa Mariner-5_ANp transposase is encoded by a single CC ORF (pos. 159-1208). CC The transposase is closest to the Impala transposase found CC in Fusarium oxysporum (GenBank AAB33090, 49% identity). XX FH Key Location/Qualifiers FT CDS 157..1206 FT /product="Mariner-5_ANp" FT /translation="MGRRGKELTPDIRARLCELKAIGWTFAMIHQRYPHIP FT YSTIRNTIKRENDRKDQKSSPQSGRPKKISLEEQRHLLALIEQDHHIKMRK FT LSEAVQSSPSVRTVQRLFHTLHIQKWKQCERPEITPENAEKQLRWAETYAG FT YTAEDFNRVIWSDECLVERGAGIRRIYTFRSPRQQIIKRDVHAIRCGKGVK FT QMFWAAFGFNRRTGLLPLDGDPDAARGGVTAWVIRGVYKSFLPDILGPGDI FT FMHDGASVHRAYIVQQLLEDMGVEVMVWPPYSPDLNPIKNLWALMKAKIYE FT LHPELERAPDTEETLQLLIKAAIEAWHAIDERVLQNLCHTMPHRVQAILRA FT DGWYTSY" XX SQ Sequence 1315 BP; 364 A; 284 C; 330 G; 337 T; 0 other; cagtggtatg acaaaaatat tggcgagttc agacgcccca cagtaagcgc tagctatgaa 60 ccgcttccta attaggccta gctgaagata ttctgccgac caaaacaacg cgtagatggt 120 ctggcctggt gttgaagcta tagagcatct tacaatatgg gccggcgagg caaggaattg 180 acgccagata taagggccag gctctgtgag ctcaaagcca ttggttggac atttgcaatg 240 attcatcagc ggtatcctca tattccctat tcaactattc gcaacacaat taagcgcgag 300 aatgatcgaa aagaccagaa gtcttcccct caatctggac ggcctaaaaa gatctctctt 360 gaggagcagc gacatctact ggccctgatt gagcaagatc atcatataaa gatgcgtaag 420 ctctctgagg ctgtacaaag cagcccttct gtacgtacgg ttcagcgcct tttccatacg 480 cttcatatac agaaatggaa gcaatgtgag cggccggaga ttacacctga aaatgcggaa 540 aagcagctaa gatgggcaga aacatatgca ggatacactg cagaggattt caaccgagtt 600 atctggtcag atgagtgctt agtcgagcgc ggagctggta ttcgaaggat ctatacattt 660 cgatcaccta ggcagcaaat tattaagcga gatgtccatg ctatacgctg tggaaaaggc 720 gtgaagcaga tgttctgggc tgcctttggc ttcaatcgac ggactggcct cctaccactg 780 gacggggatc cagatgctgc tcgtggagga gttactgcat gggttattcg tggtgtttat 840 aaatcattct taccagatat cctgggccct ggggatattt ttatgcatga tggggcttct 900 gtacacagag cctatattgt acagcagctt ttggaagata tgggcgtgga ggtcatggta 960 tggccgccat attctccgga tttgaatcct attaagaatc tatgggcgct tatgaaagca 1020 aagatttatg agcttcatcc agagctagag agggctccag atacagaaga aaccctacag 1080 ctgcttatca aggcagctat agaggcctgg catgcgattg atgagagagt gcttcaaaat 1140 ctatgccata caatgcctca tcgcgtacag gcaatccttc gagctgatgg ttggtataca 1200 agttactgag caatgattag tttcatacta gcctaatcag gaaggcggaa tctgttttca 1260 tactagcgtt aattatgggg cgtctgaact cgccaatatt tttgtcatac cactg 1315 // ID Copia-1_LBS-LTR repbase; DNA; FNG; 170 BP. XX AC ABFE01000677; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_LBS_; KW Copia-1_LBS-I; Copia-1_LBS-LTR. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-170 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01000677; Positions 85413 85582. XX SQ Sequence 170 BP; 40 A; 45 C; 32 G; 53 T; 0 other; tgttgaacca ttgagcgctg cagcgtgatt gtgatcgaac cttacataac ccactactat 60 aagtactgta caggggcata ctactttcct cagtcgttct atcatctggc tgaggtgagt 120 attgtgccct gccgttctta caccattagt acttacccac atatctctca 170 // ID Gypsy-1-I_PPl repbase; DNA; FNG; 6502 BP. XX AC . XX DT 09-APR-2009 (Rel. 14.04, Created) DT 09-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE An internal portion of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW reverse transcriptase; Gypsy superfamily; Gypsy-1-I_PPl. XX OS Postia placenta OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Polyporales; Postia. XX RN [1] RP 1-6502 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from fungi Postia placenta."; RL Repbase Reports 9(4), 923-923 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 66..2372 FT /product="Gypsy-1-I_PPl_1p" FT /translation="MSGTGQSPPQHGRNPSRQVIYSPMPRPHSKHAPRFKG FT TSLDDFLEDFEAIARNAGVLNTEWPRTLPRYCVQEVRDVIEHLPEFQGSDW FT AIAKAALTELYQSNDKRRRVTADKLREFVADSAKTQSFRSRKDLDVYTRQF FT LAMSGELKRRNLITDNEINLRFYKGLPREVKLRIRQDLKSTTASSAPAMAD FT VLMLVRKLYSEDDIDAESDHEDYLDTDDEDDSEVSTDEERDDEPAKKKRKD FT KEKSSNPRVKATPDSVVPGPSGKSQENVLDALTKQMEELRIQVAEQARQAQ FT TRRICWICDTDSHRVGYRNCPEADKLVEEGLLRYSQGKLVRIDGSDLPLVR FT PGTGGVANALRTERRAARGKERDPPPHQSMNVGVISDTGEDVLTGEVYAIA FT VPFSDDEDAYSYPAARTPKKDVRFDPLAKKDDGKATGIKLKISGKGQPPSI FT TVQHPERPPAQSTPPTVEKAPATKAPVAKRKPPAASVADDSDVGMQSSSSK FT APKAKTPSYRFTSDVQESVNIEQVQKTILDTPITLPLRELLASSADLQKRF FT ANLTKTRREYAAKYGEYMEADSDQIEGARNAYLSLRWGEDAETIIGRYANS FT VSVRPTRLFAMTTGRFEAKIADTNLSAMIDTGSELNLISATAFEHLNIPLD FT IDGARWSLRGISGEPVSLQGCARDVPIEIGGHRFGHHFFVTQVGGSIGKQD FT VILGQPWLQWYAAMIRYSRAGPMTLTIYPKGDEGDGPAVSLKLVAPHHDRN FT VDRLVQSHERHDVQDFY*" FT CDS 2312..6403 FT /product="Gypsy-1-I_PPl_2p" FT /translation="QKCRSPRTVSRAARRAGFLLTGPARPDANSHAVPQAS FT IGRTFYTLCNAYNDPDNIADLISSLEPPEQYDSFLRQFYYTAPTNTTPRES FT PWVDRALGPNPTSAPVKRYKPVDRKVRPVPSYMPNPSAQTFKPIPSPELEP FT LPLEPPSLANFEPSERLTRERLQIILETVPSGFLNAREIDLLVFVLKERQF FT ALAFTDAERGTFSPRYFPDYEIPTIEHVPWQLPPNRVPRAIEDDVYTLLTE FT QRDAGKYEPSSASYRSRIIIVEKKTGKYRICHDLQPLNGVTIRDSALPPNV FT NDFAESFVGYAIYMIADLFSGYDNRRLAEISRDLTTFDCLIGAHRLTTLPQ FT GFCNAVQEFQRCITHVIRDEVPHHAGAFVDDVGIKGPCSTYNNMPVVPGSP FT IRRFVYEFLTTVNRILCRFELAGITASGYKLVAATPELHIVGSIASLLGWH FT LAHGVVNKVLKWMSCTNVSEVRSFLGIAGVARRWIQGFSIIVRPLTRLTRK FT SVDFAWTNDAQEAMDLIKQRVTTAPVLKPIDYQLARAISPHDGRPSDHGIV FT IVAVDSSKYGAGWVLYQMRETDKHPALFGSCTFSATESRYSQPKLELYGVF FT RALKELRHRIWGLYFRLDVDAKFLKQMIHSADLPNAPMTRWVSYIQLFDFY FT MQHVPAMAHQASDALSRRPQAPDDTDESNAEAYLDKVFAGMELRNALTLFG FT HRLRTTRSREFFLDTTSLSTASNLSTLGVSSAGEYALTTLTDSSLLGEDPW FT TSASVSLQDLDGQFFIGNDFMQRNVPQEHSEVYQLGDEEVECLVTEYRYAY FT WIAPQRVGESLGSPYGNAPSQRVSETRVFYPDISSERHVSAGHRFGRPDEE FT NEGQWAEITAYLQDSTFPSHLETQAARRKFISMCARFTLSDGRLWYVKKGK FT MPRLVITDQLRRKTLLEEAHNQCGHRGRDATYLRLSERYYWPNMFVNVSDF FT VRSCNACQFRSKSRPIMSYNPSWTVAVLRKFHIDTVYMPKGRRGMRYLLQA FT MEPAIGWPEARAAKRNTSKAWAKFIYTDIICRFSCIPVFVCDNGPEFKGAV FT KYLFDKYNIICVLVAPYHPEANGVAERAHPTLVNSILKASGTNASDWPLYL FT HGALLAMRTTTSRMTGYTPYYLLYGQNPVFGFDVADRTWSALDWYAVKDTA FT DLLALRLKQITRREVDIGKAMDRVNENRKKAVEAHERRYAGKIQKEMWPVG FT TLVLVHQTWLDNQHGHKGALRWAGPYVVRRVVDRFYELTELDGTVMKGAFA FT ADRVKKFFYRYEKQSMMDLPRHRERLAPADADRSSRAVEAVGALENENTRI FT FGSLREDLPCWRIGDIAREESVLVDNNELITIMASNLDDLYNMAPGRHPWW FT *" XX SQ Sequence 6502 BP; 1636 A; 1743 C; 1702 G; 1421 T; 0 other; actggtgacc accgcagggg gtgtgcaatt gtggtgttac atactaacgt ctccatactg 60 tcatcatgtc gggaacaggt cagtcgccgc ctcaacacgg gcgcaaccct tcgaggcagg 120 tgatctactc accaatgcct cggccgcatt cgaagcatgc tccgaggttt aaaggtacgt 180 cgttagatga ttttcttgaa gatttcgaag caattgctcg caacgctggg gtgttgaata 240 cggagtggcc ccgaacgttg ccgcgttact gcgtacaaga ggttcgggat gtcatagaac 300 atctgccaga gtttcagggc agtgactggg ccatcgctaa ggctgctctc acggagctat 360 accagtcgaa tgacaagcgg cgacgggtga ctgcagataa gcttcgcgag tttgtagctg 420 acagtgcaaa gacacaaagc tttcgctcta ggaaggacct cgatgtgtat actcgacagt 480 ttctggccat gtcaggcgag ctcaagcgga ggaacctcat cacggacaat gagatcaacc 540 tgcgcttcta taaaggactt ccacgggagg tcaagctgcg aattcgacaa gacctgaagt 600 ccacgacggc atccagcgcc ccagcaatgg cggacgtcct tatgctcgtt cgcaagctct 660 actccgaaga tgacattgat gcggagtcgg accatgaaga ttatctggac acagacgatg 720 aggacgacag cgaggtcagc acagatgaag aacgcgatga cgaacctgcc aagaagaagc 780 gcaaagacaa ggagaagagt tccaatccgc gagtaaaggc aacaccggac tctgtggtac 840 ctggtccatc cggcaaatca caggagaatg ttctcgacgc tctgacaaag cagatggaag 900 agttgcgaat acaggtggca gaacaggcac ggcaggcgca aacaaggcgg atctgctgga 960 tttgtgacac tgactctcat agggttggct ataggaactg cccggaagct gacaagcttg 1020 tcgaggaagg cctcctccgc tactcgcagg gcaagttggt gcgaatagac ggctcggact 1080 tacccttggt acgaccagga acaggaggtg tggctaacgc tttacggaca gagcgacgag 1140 cagcacgggg caaagagcgc gatccgcccc cgcatcaaag catgaacgtc ggcgtcatct 1200 cagatacagg agaagacgtc ctaacgggtg aagtatatgc aatagcagtc cccttctcgg 1260 atgacgagga cgcttactct taccctgcgg cacgtactcc gaagaaggac gtccgtttcg 1320 atcctctggc aaagaaagat gatggcaaag caacaggaat caagctcaag atctccggga 1380 aggggcagcc gccttccatt acagtccaac acccggaacg acctccagct caatctacac 1440 cgccgacagt agagaaagca ccggccacga aagcacctgt agcaaagcgc aaaccaccgg 1500 cagcctctgt cgctgacgac agtgatgtcg ggatgcagag cagctcctct aaagcaccta 1560 aggctaagac tccttcttat cggtttactt ctgatgtgca ggaatcggtg aatattgaac 1620 aggtacagaa gaccatactc gacacaccaa tcactttgcc tctccgagaa ctgctggcat 1680 cttcggcaga cttgcagaaa cggttcgcga acctgaccaa aacacggcgg gagtatgcgg 1740 ctaagtatgg tgagtatatg gaagctgaca gtgatcagat cgaaggcgcg cgcaatgcgt 1800 acctctcgtt aagatggggc gaagatgcgg agacaatcat aggtaggtat gcgaactcgg 1860 tatccgtacg tccaacgcgg ctgttcgcaa tgacaacggg acgctttgag gccaaaatag 1920 cggacactaa cctttcagcc atgatcgaca ctggatccga gctcaacctc atatcagcaa 1980 cggcgttcga gcatctgaac ataccccttg acatcgatgg ggctcgctgg tcacttcgcg 2040 ggatcagtgg cgaaccggtc tccttgcaag gatgcgcgcg cgatgtaccc atcgaaattg 2100 gaggacacag gttcggccac cacttcttcg tcacccaggt cggagggagt ataggaaagc 2160 aagatgtcat tcttgggcag ccgtggctac agtggtatgc tgcaatgata aggtatagtc 2220 gtgcagggcc tatgactcta accatctatc cgaaaggaga tgagggtgac gggccggctg 2280 tctctcttaa actggttgcg cctcatcatg acagaaatgt cgatcgcctc gtacagtctc 2340 acgagcggca cgacgtgcag gatttttact aacgggacca gctagacccg atgcgaattc 2400 gcacgctgtg ccccaagcgt ccatcgggcg tactttctat accctgtgca atgcgtacaa 2460 tgatcctgat aatattgctg accttatttc ttctctcgag ccaccagaac aatacgactc 2520 attccttcga caattctatt atacagctcc cactaataca acaccccgtg aatccccgtg 2580 ggtcgatcga gctcttggac ccaatccgac atccgctccc gtaaaacgtt acaagcccgt 2640 tgatcgcaaa gtccgaccag taccctcata catgccgaac ccctctgcgc aaaccttcaa 2700 gcctattcca tcaccagaac tcgagccgct tcccttggaa cccccatcat tagccaattt 2760 tgagccttct gaacgcctca cgcgagagcg tcttcaaata atactggaga ctgtccctag 2820 cgggtttctc aatgcacgtg agatagacct tcttgtcttc gtactaaagg agcgccaatt 2880 cgcactcgca ttcactgacg ccgaacgcgg caccttttct cctagatact ttccggacta 2940 cgaaattcct accatcgaac atgttccttg gcagctgccg cccaaccgag ttccgcgagc 3000 aattgaagat gatgtataca cgctcctgac tgagcagcgt gatgcgggca agtacgaacc 3060 ctcatctgcc tcatataggt ctcgcatcat tattgttgaa aagaagacgg gcaagtacag 3120 aatatgtcac gacctgcagc ccctcaatgg agtcacgatt cgcgactcgg cactcccacc 3180 taatgtcaat gatttcgcag aaagctttgt cgggtatgcg atctacatga tagctgacct 3240 tttctcaggc tacgacaaca ggaggttagc cgagatttca agggatctga ccacattcga 3300 ctgccttatt ggggcacacc ggctgacgac acttccgcaa ggcttctgca acgcagtaca 3360 ggagtttcag cgctgcataa cccatgtcat tcgtgatgaa gttcctcacc acgctggagc 3420 attcgtggat gatgtcggca tcaagggtcc ctgtagcaca tacaacaaca tgcctgttgt 3480 tccgggatca cccattcgtc ggtttgtgta cgaatttctc accacggtta accgcattct 3540 ttgcaggttt gagctcgcgg gaattactgc gtcaggttac aaactggtgg cagcaacccc 3600 ggaattgcac atagtaggtt ctatagcatc gctgctcggg tggcatctgg cacatggggt 3660 tgtaaacaaa gtcttgaagt ggatgtcctg tactaatgtt tcagaagtgc gcagctttct 3720 aggaattgct ggggttgctc gacgatggat acaagggttc tcgattatcg tgcgaccact 3780 caccagactc acgcgaaagt ccgttgactt cgcatggaca aatgacgcac aagaagcgat 3840 ggatctaatc aagcaacggg tgacgacagc accggtattg aagcctatcg actatcaact 3900 cgcgcgagca atctcgcctc atgatggacg acctagcgat cacggaattg tcattgtagc 3960 agtcgactcg tcaaaatacg gtgccgggtg ggtcttgtat caaatgcgtg agactgacaa 4020 gcatcctgca ttgttcggat catgcacctt ctcagcgaca gagtccaggt attcgcaacc 4080 caagttagag ctatacggag tgttccgcgc tcttaaggag cttcggcatc gcatctgggg 4140 cctctacttc cgtttggatg tagatgccaa gttcctgaaa cagatgatcc attcagctga 4200 ccttccgaat gcgcccatga cgcgctgggt atcctacata cagttgttcg acttctacat 4260 gcagcatgta cccgcgatgg ctcatcaggc atctgatgca ctgtcaagac gtccccaagc 4320 tccggacgac acggacgaat caaatgctga ggcatatctt gataaggtat ttgcgggtat 4380 ggaacttcga aatgcactta ctctctttgg tcaccgtctc cggacgacga ggagccgaga 4440 attctttctt gataccacat cgttatcaac agccagcaac ctctcgacac tcggggtctc 4500 atccgctggt gaatacgcgc tcactaccct caccgacagc tcgcttctgg gagaagatcc 4560 atggacgtca gcaagtgtat ctcttcagga cttggatggg cagttcttca tcggcaacga 4620 cttcatgcaa cggaatgtgc cccaagaaca ttcagaggtc tatcagctag gagacgagga 4680 ggtggaatgc ctagttacag agtatcgcta cgcatattgg attgcaccgc aaagggtcgg 4740 tgagtcgttg ggttctcctt atgggaatgc cccctcgcaa agggtgtccg agacgcgggt 4800 gttctaccct gacatttcca gcgaaagaca tgtgagtgcg ggccatcgct tcggcaggcc 4860 tgacgaagag aacgagggac aatgggccga aatcacggcc tacctacaag acagcacatt 4920 tccgagtcac ctcgaaacac aagcagcccg tcgtaagttc atatcgatgt gcgctcgctt 4980 cacgctgtcc gatggacggc tctggtatgt caagaaaggg aagatgcccc gtctcgttat 5040 tactgaccaa ctcaggcgca aaacactgct cgaagaagcg cacaatcagt gcgggcaccg 5100 gggacgcgac gcgacgtacc taaggctctc ggaacgatat tactggccaa atatgttcgt 5160 gaacgtctcc gactttgttc gctcctgcaa cgcgtgtcaa ttccggtcga aatcaaggcc 5220 gatcatgtcc tacaacccgt cgtggactgt cgctgtactc cgcaaattcc acatcgatac 5280 agtatacatg cccaaaggac gacgaggcat gcgctacctt ctgcaagcaa tggagcccgc 5340 aattggatgg cctgaagctc gcgctgccaa gcgcaatact tctaaggcat gggcaaagtt 5400 catctatact gatattatct gcaggttctc atgcataccg gtatttgtct gcgacaatgg 5460 acctgaattc aagggagcgg taaagtatct attcgacaag tataacatca tatgtgttct 5520 tgtagcccca tatcacccgg aagccaatgg cgtggccgaa cgtgcacatc ccacgctcgt 5580 caactcgatc ctcaaagcga gcgggacaaa cgcgagcgac tggccactat acctccacgg 5640 tgctctgctc gcaatgcgca caacgacgtc gcgaatgaca gggtataccc cttactacct 5700 tctgtatggg cagaaccccg tctttggctt tgacgttgcg gaccgaacgt ggtcggcgct 5760 cgattggtat gccgtcaagg atactgctga cctcttggct cttcgcctga agcagatcac 5820 gcggcgcgaa gtcgacatcg ggaaagcaat ggatcgggtc aacgagaatc gcaagaaggc 5880 tgtcgaagcg cacgaaagga ggtacgcagg taagatacag aaggagatgt ggccggtagg 5940 aacgctcgtc ctggttcacc agacgtggct agacaatcag cacggacaca agggtgcgct 6000 gagatgggcg ggcccgtacg tagtacgacg ggttgttgac cgcttctacg agttgacgga 6060 gctcgacggg accgtcatga agggagcctt cgctgctgac agagtcaaga agtttttcta 6120 caggtacgaa aagcaatcaa tgatggacct cccaaggcat cgtgaacgtc ttgcaccggc 6180 cgatgcggat cgttcctcac gagcagtcga ggctgtgggc gcgctcgaga acgagaacac 6240 gcgcatattc ggatccctac gagaagactt gccctgttgg cgaattggcg acattgcaag 6300 agaagaatct gtccttgtcg acaacaacga gctcataacc atcatggctt ccaatttgga 6360 tgacttgtac aatatggctc ctggccggca cccctggtgg tagggtggtg gtataatttc 6420 tctcacttct cttcttccct tctgaataat agctgatgta ttggttggcg atgctgagga 6480 cagcattgaa ctgtcgccca tc 6502 // ID Gypsy-44_MLP-LTR repbase; DNA; FNG; 186 BP. XX AC AECX01001168; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-44_MLP_; KW Gypsy-44_MLP-I; Gypsy-44_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-186 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001168; Positions 79941 79756. XX SQ Sequence 186 BP; 56 A; 50 C; 29 G; 51 T; 0 other; tgttatgact cgtattacac gtatagatga cactcactca gcacatgtca cagatcatga 60 tgtactttag attacgcact acagacctga ttgtattcct catcggacaa tctcgttatc 120 aatcataaga acctcacgag gctaactcaa tccttgtcga cgaacccctg agacccagtc 180 ttaaca 186 // ID Copia-6_LBS-I repbase; DNA; FNG; 8410 BP. XX AC ABFE01001950; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-6_LBS_; KW Copia-6_LBS-LTR; Copia-6_LBS-I. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-8410 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01001950; Positions 31644 40053. XX CC Positions [1505-2032] - Integrase core CC 'GGTCT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 5..2623 FT /product="Copia-6_LBS-I_1p" FT /translation="MGPGSGSVEMSTSPTKAFKFAKLNGENYALWFQHMQS FT SLQAHYLWLIVTGDEPCPTKHSDTQPTEPTALTAWKATKKEWLDWSLRDQA FT AQGLMKGAAEPSQWPHIAQMKTSKEMWDAWKKMYVTNQQHINVHYYFEDLY FT TRKYDESTSMADHIAAMLDLGLKIKAAGEEIPDIHIARALVLSLPRTQTWE FT LVKVQLFNKDKLTSEIVSTELQATANRTAHEKKSETAFLAGKKQASGKGDS FT KKKGGRGPKADDECRYCHKKGHWINKCPQHEEDEKKNNGGSVNLAVSNLRD FT LGAREIGWVFMAGVNVKESGISTELILDCGATSHMFCDRHFFTSYTPMSTP FT ESISVGDRRDIPVAGRGTICFQARLPDGYRSISLHSALHVPKLAANLISLG FT TLQRQGVGFSSYQNGIVIKLGEEELFRASFIDKSDTLYHVEVAQMQTESAY FT VATSGSLRLWHRRLGHISLDTIRRMHRKNMVEGLSINSLNQYDHLCEGCAL FT GKSHRLQFPKASTTKYELMDLIVLDLTGPMSVQTWGGASYALVIVEVSCRK FT PVGRLLHSKKEAYAALREVVAMLERQSGKKLKRMRADSGTEFVNELVNTFC FT QKNGIVLETTVPYTPEQNGIAERAIAVFFKMVRCMLHASKMDLRYWGEAFL FT YAVYIRSLTYTSALDDIVPAHAWSGIKPDVSHLRIFGSVAYANIPKKVRGG FT KLEVTSIKCRLLGWWADETKGYRLEDVETGKIITARDVRFAEDDSPGDLAV FT IETRGVAPTEAEINRLVPDDVFGKTAGSVSPISQSKSSPQPAAPVEPPVSS FT VDDASPSLEADVSVPKKTSKWDNLPPRDHPRRERKPAAPPGDEATEEEFLL FT ASNHTLDNRAFVTYPNEPQT" FT CDS 4628..5929 FT /product="Copia-6_LBS-I_2p" FT /translation="MSSSNETFSATPNSGPTPNTGAAPHTTSSGEQSSNQD FT EEHINSLASSTEGQNPVQQSGNVTGEGDPGMPGNFPVANAVLASCNDILKD FT FYSSRVSKGVALARIYTILLDGIPETEETGAHVEDAFERYLVIIENHQNHL FT NEAENRGRRQRSQSPANKGIEIDDEEIPPPKRAKPDDTQYPWVVSDFIHGA FT TLSPSLTTTLELLKLYAIDPKGTKRSLVNSPSCPEFPDSEWTNVLLGRAVN FT LDAVLSGYYSTSNNDERIEEIGELEIRFGSVSPAKLVSSAGEWSIAWNRTS FT RAICTAFPHRAGELAEYAEYIIGVFAATDVHFHDRVISFDKAVRRRVGSRR FT DLELTNFNRFADLRSTHMDSIGAAVIQRASIAVANKSGSSSGRGKKLPEPC FT NRWNEGLCTLENNQCRRLHVCNKCLLSGHTSKQDKCPPSQ" FT CDS 4641..6011 FT /product="Copia-6_LBS-I_5p" FT /translation="MKLFLLLLTLGLLLILGQHLTRPLVVNRVQIKMKSTS FT TPWRVQPKVKIPFSKAEMSQERVTLVCLGISRLPMPFSHPAMTSSRTFIHL FT EYQKGLRSRESTQSYSTASPKLKKLGHTLKMPSNVTLSSLRTIRTTLTKQK FT TAGAVSGHRAQLTKVLKLMMKRFRLLNVRNPMTRNTHGWYQISYTAPLYHR FT RLPPPSSYLNSMQLIPRAPSDLLSILPVARNFLTQNGPTSFWVVPSIWMLF FT SADTIQHQTTMNGLKKLENWKSASAVSPQQNSCQVLENGVLRGIVHLEPYA FT LPSHIGQVNSLNTLSTSLESLLRLMSTFMTALSHSTKLFDVVWVHVETWNS FT PTSTVLLISGQRTWIQLEPLLFNVRASPSLINLAVPQVVVRSFRSHATAGT FT KGYARLRIISVAGSTFATNVCSLDILVNKTSVPHHSNLQLKTAHHDVHDHI FT QPPRMTLGAYKK" XX SQ Sequence 8410 BP; 2119 A; 2168 C; 1980 G; 2143 T; 0 other; ggttatgggc cccggcagcg gatcagtcga aatgtcgact tcaccaacca aagccttcaa 60 gttcgccaaa ctcaacggcg aaaactatgc gctgtggttc caacacatgc agtcttcgct 120 ccaggcacat tatctctggc tgatcgtcac cggagatgaa ccctgcccca cgaagcactc 180 agacacacag cctacggaac cgacggcgct cacagcttgg aaggcgacga agaaggagtg 240 gctcgattgg tcgctgaggg atcaggcagc tcaggggttg atgaagggag cggcggagcc 300 gtcgcaatgg ccccatattg cgcagatgaa aacttcgaag gagatgtggg acgcatggaa 360 gaagatgtac gtcaccaacc agcaacacat caacgtacac tactacttcg aagacctcta 420 tacgcgcaaa tacgacgagt cgacgtcgat ggccgaccac attgcggcta tgctcgatct 480 tggcctcaag atcaaggcgg cgggtgagga aattcctgat atccacatcg cgcgcgccct 540 cgtcctctcc ctccctcgaa ctcaaacatg ggaactcgtc aaagttcagc tcttcaacaa 600 ggacaagctc acctcagaaa ttgtttcgac ggagcttcaa gctacggcta atcggactgc 660 tcatgagaaa aagagcgaga ccgcattcct tgctggaaag aagcaggctt ctgggaaagg 720 tgactccaaa aagaagggag gccgaggccc gaaggccgac gatgagtgtc gctactgtca 780 caagaaaggg cattggatca acaagtgccc tcagcatgag gaggacgaga agaagaacaa 840 cgggggttcg gtcaacctcg ctgtttccaa cctgcgggat cttggagctc gtgaaatcgg 900 ttgggtgttt atggctgggg tcaatgtgaa ggagtctggt atcagcactg agttgattct 960 cgattgcggc gccacctccc acatgttttg tgaccgccac ttctttacct catacacacc 1020 gatgtctacc cccgagtcca tatccgttgg tgatcgccgt gacattcctg ttgcaggccg 1080 cggtaccatc tgcttccagg ctcggttgcc agatgggtat cgttctatat cccttcacag 1140 tgctctccac gtccccaagc tcgccgccaa cctcatcagt cttgggactt tacaacggca 1200 gggtgtgggt tttagcagct atcagaatgg gatcgtgatc aagttgggag aggaagagct 1260 tttccgtgca tcgttcattg ataagtcaga cactctgtac cacgtcgagg ttgcccaaat 1320 gcaaactgag tctgcctatg ttgccaccag tgggagtctt cgtctctggc accgtcgcct 1380 tggccatatc agtctggaca caattcggag aatgcaccgg aagaacatgg tggaaggctt 1440 gtccatcaac tcgctcaacc aatatgatca tctttgcgag gggtgtgccc tcggcaaatc 1500 acatcgactt caattcccaa aagcaagcac caccaaatat gagctgatgg acctcatcgt 1560 tctcgacctc actgggccca tgtcggttca aacttgggga ggggcctctt atgcccttgt 1620 cattgtcgaa gtcagctgtc gaaagccagt tggtcgtctt ctgcattcca agaaggaggc 1680 ctatgcagcg ctacgggaag ttgttgccat gttagagaga cagtcaggga agaagctgaa 1740 aaggatgaga gcggacagcg ggacggagtt tgtgaacgaa ctcgtgaata cgttctgtca 1800 gaagaatggg attgtcttgg agacgacagt cccttacact cctgagcaga acgggattgc 1860 agagcgcgct atcgcagtgt tcttcaagat ggtcaggtgc atgctccatg ctagtaagat 1920 ggacctgcgc tattggggag aagccttcct ctatgctgtc tacatccgaa gtttaaccta 1980 cacatccgcc ctcgacgata tcgtccctgc tcatgcctgg tcaggcatca aacctgatgt 2040 ttcccacctt cggatatttg gatctgttgc ctatgccaat atcccgaaga aggttcgtgg 2100 tggaaagcta gaggtcactt ctatcaaatg tcgccttctt ggatggtggg cagatgagac 2160 gaaggggtac cgtctggaag acgttgagac tgggaagatc atcactgcgc gggatgtccg 2220 ctttgctgaa gatgatagtc caggggacct tgcagttatt gaaactcgag gtgttgctcc 2280 gacagaggct gagatcaaca gattggttcc ggatgatgta tttgggaaaa cggctggttc 2340 ggtttctccc atttctcagt ccaaatcaag tcctcaacct gcagctcctg tggaaccccc 2400 tgtatcttca gtggatgatg ctagcccctc tctggaagct gatgtttcag ttccaaagaa 2460 gacgtcaaaa tgggacaacc taccaccacg agaccaccct aggcgcgaac gaaaacctgc 2520 agctccccca ggagatgaag ctactgagga agagttccta cttgcatcca atcacaccct 2580 cgacaatcgc gctttcgtga cctatccgaa cgagcctcaa acatagtgtt tgaaagtccg 2640 gttaggttag gttattgggt gcccaggggc tctaaccgag accgagaccg gttagggttc 2700 gttcccagac ccaaaataac ctaactggac cgaagataac cgacgaaaac cggttagttg 2760 cagttactaa ccggttagga ccgaaaaaat agaaaatcga ttaaaacggc gtatatttct 2820 catattcctt gcagtaaaca ttattgccat gcgtcgtagc tttaaaacaa gcccaagaac 2880 gctaaatccg gtgaagaatt gacaatttta taataatatt cttgaaagcc tactaattta 2940 gccaatattt tagagatttg accataactc tgtcagattt ttagcgtttc tcatgctttt 3000 agatgcgttt tgaaggggta taatagggtc ggttgtaacc ggtcataacc ggttttttaa 3060 cggttagcta actgcaaaat aaccggttat ttcatttcca gaattgacta accactaacc 3120 acggtccggt ttttttgcgg ttaggtccgg tctggtctcg gttattttcc agttaattaa 3180 cctaacttcc aagcactact caaacataca aacaggctag tatgtggtca ccctactcgc 3240 agcagtggga gaaggcgatt gcgacggaat acgatacact aaggaatacc gacacttttg 3300 aatgggttcc tagtctccct catggacgaa aagcggttgg cagccgtttt gttttcaggg 3360 agaaacacga tggcaatggg gatctggcca agcacaaagc tcgcatcgtt gcaaaaggct 3420 actcgcaggt gcccggagag gacttttcag aaacgttctc gtctgtggca aagttcacca 3480 ctctacgcat gctccttgct ctcgtcgcac actcggatct cgagcttcat caagttgacg 3540 ttgttggcgc ttatcttcga gggttgttgg atgaggagat ttacatggag gtgcctgagg 3600 gggtgaagga ggaaggaaag gaggggtggt actggaagtt gaagaaggca ttgtatggtt 3660 tgaagcaggc aggaaggcag tggaaggcga agctcgacgg ggtgatgagg aatctcgggt 3720 ttgaaaaggg tcaagccgat gattgcctat atatcctccg agagaaggat gagattgtct 3780 tgttggtgct ggtttacgtt gacgatatgg ccgttgctgg acgttcgctc gctcgcatca 3840 cccagttcaa gactgacctc acaaaggtgt tcgacatcac ggaccttggt gagctgaagt 3900 acatattggg cattcaagtg aagagggaca gaaaggcacg caccatctca ttgaatcaga 3960 ccgcttacat ccatcacgtc ctcgcacgct ttggcatgca ggattgcact cccgtgtcaa 4020 cgccgctcgc tgtcaaacac aatctgtcaa tttcgcagtc acccaaaacg gaggaggagc 4080 gcaccgaata caccaagtac gcgaatggga ttcactacct ggaggtcgtt ggctcgttac 4140 tctatgccac acagacgcgg ccagacattc aatttccggt cggtctcatc tcgcagttcg 4200 ggggaaaccc agggaaacct cacctcaaag ctgcaaaacg aattatgcag tacctacagt 4260 gccttgcata atgctttgca gcgtgaattt ttttgaattg tatttggctg tgttgaaact 4320 ggacctgact caggcctatt tcggacaggg atcatttcgt caagtttaaa aatattttta 4380 ctatagtcta atcggggcac acgggtgcct ggggacaaaa acacatctcc tctccccgag 4440 atgcctagtc tcagagacta gcgatcctgt gcaggcgatc tgcatattta agacttatcc 4500 acggtggatt tgtctcttct ttcttcactc ccactcgagc ttgctatcgg atagctctcg 4560 tttgcgttca caccatcaac ctatagttgc atcagactca gtcttcaacc ttttgatcta 4620 tctgtctatg tcgagcagca atgaaacttt ttctgctact cctaactctg ggcctactcc 4680 taatactggg gcagcacctc acacgacctc tagtggtgaa cagagttcaa atcaagatga 4740 agagcacatc aactccttgg cgagttcaac cgaaggtcaa aatcccgttc agcaaagcgg 4800 aaatgtcaca ggagagggtg accctggtat gcctgggaat ttcccggttg ccaatgccgt 4860 tctcgcatcc tgcaatgaca tcctcaagga cttttattca tctcgagtat caaaaggggt 4920 tgcgctcgcg agaatctaca caatcctact cgacggcatc cccgaaactg aagaaactgg 4980 ggcacacgtt gaagatgcct tcgaacgtta ccttgtcatc attgagaacc atcagaacca 5040 ccttaacgaa gcagaaaacc gcgggcgccg tcagcggtca cagagcccag ctaacaaagg 5100 tattgaaatt gatgatgaag agattccgcc tcctaaacgt gcgaaacccg atgacacgca 5160 atacccatgg gtggtatcag atttcataca cggcgccact ttatcaccgt cgcttaccac 5220 caccctcgag ttacttaaac tctatgcaat tgatcccaag ggcaccaagc gatctcttgt 5280 caattctccc agttgcccgg aatttcctga ctcagaatgg accaacgtcc ttttgggtcg 5340 tgccgtcaat ctggatgctg ttctcagcgg atactattca acatcaaaca acgatgaacg 5400 gattgaagaa attggagaac tggaaatccg cttcggcagt gtctccccag caaaactcgt 5460 gtcaagtgct ggagaatgga gtattgcgtg gaatcgtaca tctcgagcca tatgcactgc 5520 cttcccacat agggcaggtg aactcgctga atacgctgag tacatcattg gagtctttgc 5580 tgcgactgat gtccactttc atgaccgcgt tatctcattc gacaaagctg ttcgacgtcg 5640 tgtgggttca cgtcgagact tggaactcac caacttcaac cgttttgctg atctcaggtc 5700 aacgcacatg gattcaattg gagccgctgt tattcaacgt gcgagcatcg ccgtcgctaa 5760 taaatctggc agttcctcag gtcgtggtaa gaagcttccg gagccatgca accgctggaa 5820 cgaagggcta tgcacgcttg agaataatca gtgtcgccgg ctccacgttt gcaacaaatg 5880 tctgctctct ggacatacta gtaaacaaga caagtgtccc ccatcacagt aatctccaac 5940 tcaaaacagc tcatcacgat gtccatgatc acatccaacc ccctcgaatg acgctggggg 6000 catacaaaaa atgatttcaa tttcacgtgg ttcccgtcaa ccccaccggg agccatggac 6060 attggaccgt cttgtccatg aacggtccat cgccttgggg ttatctattg acaattcatc 6120 tcatctcacg tacacctcag cactaaactc gtacctatca ttttgcaaga tccacaattt 6180 ccccattgaa cccactgaag aaacagtcag tttctttgtc gtgtttatgt ctactcatat 6240 taaacccgat tcagtcaatt catatctctc cggaatttgc aatcaactcg aagtatattt 6300 cccgaatgtc cgaaaaaacc gcaacagcat cttagtatct cgaactcttg ctggatgtcg 6360 gcgacgtttt ggaacaccaa cacggcgaaa gcaacccctc tcatctcatg atctcggaaa 6420 ggttttatct cgaattggat cttccaaaaa ctatgatgac cgactttttc tcgcaattct 6480 cttcactggt ttccatggat tgtttcgcct gggagaatta accttccccg acagagtcgc 6540 ttcccgtgat taccgcaaaa tcatgtcacg acatactgtt gaccttaccc aatcctctta 6600 ctcattgttg ttgcctggcc acaaggccga ccgcttcttt gaggggagta ccgtcataat 6660 tcagaaaaca aatctcacta cagatccaca caacatcttt gtgtcataca taacatctcg 6720 cgatgcaatc cacccataca aacctgagct ctggttgcgg cagtctggca ttgtgccgac 6780 acgctcctgg ttcatcaaaa aattgcgtaa ttttttccca tctcatatcg cagggcaatc 6840 catgcgtgct ggcggcgcca cttccctagc tgaagcaggc gttcccccac acatcatcca 6900 ggccatcggc cgatggtcct ctgatacata caaaatatac attaggaaaa atccggtcct 6960 tcttcaagct atgctgtttg ggcggcctgt ccatcaacaa tgctagcttt gctctaactc 7020 tcacaaccca tagacatttt ttttgaactt ttatttatac ttgaacagtg cgttcttcca 7080 ttttctggta tctcacaaca tccctccgct tccactccac tcaacttccc ttttcttttt 7140 tctttttctt tccaaatata caaacactct cgctcccctt cccttccctt ccctttattg 7200 tttttaatag gtataatagg acatcaatac cagcaatgga tgtttctcgc gagttctcca 7260 ggcgcagtac tgcgcctgga ggactctgca tatgcctatt tagcttttca ccttacttga 7320 gatttttgga cccttaaccg ttacacgatt agtcgagttt aaaaatattt ttactatagt 7380 ctaatcgggg cacacgggtg cctggggaca aaaacacatc tcctctcccc gagatgccta 7440 gtctcggaga ctagcgatcc tgtgcaggcg atctgcatat ttaagactta tccacggtgg 7500 atttgtctct tctttcttca ctcccactcg agcttgctat cggatagctc tcgtttgcgt 7560 tcacccgccc tcctgcatcc ctggtttccc tcgtacatca agtatagtac tcagtcatgt 7620 acaggacatc aataccagca atggatgttt ctcgcgagtt ctccaggcgc agtactgcgc 7680 ctggaggact ctgcatatgc ctatttagct tttcacctta cttgagattt ttggaccctt 7740 aaccgttaca cgattagttg ggccttggtt gttgcttggt tgtgacattt cagggtcatt 7800 acaaacataa tcaaatacaa tttgaaaaaa ttcgtgctgc aaagcattat gcaagacact 7860 gtaaagggca ccgcacattt cggtcttgtt ctcggacgtc gcggggaagg ttcatttgac 7920 ctagttggtt ggacagattc cgattgggct caagaccccg attcacgtcg ttcagtggga 7980 ggatttgtgt ttgatgttgc tgggagctca gtttcatggt cgtcgaagaa gcaaccgaca 8040 gtcgcgttgt ccaccgtgga agccgagtat gtggcatcat caaatgcgac gaaggaagca 8100 atctggctga gagttttgtt ggaggatatg ggatatcccc agaccacggc tacccttatc 8160 cacgcggaca atcaagggtg catagccctc gctcgcaacc ccgttgctca ctctcgtgca 8220 aaacacatcg atatccgtca ccatttcatt cgagaacggt tacaaaattc ggagatacgg 8280 ctggaatact gttcgacgaa ggacatgcta gctgacatct tcaccaagca gctcccacgc 8340 gaggccttcg aaacgttccg ttcggcgcta ggcgtactag ctgtgtgaga gataactctc 8400 cgagtgggag 8410 // ID DNA-4_AN repbase; DNA; FNG; 4562 BP. XX AC . XX DT 09-JAN-2004 (Rel. 9, Created) DT 09-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE Nonautonomous DNA transposon. Putative classification: MuDR DE superfamily - a consensus sequence. XX KW MuDR; DNA transposon; Transposable Element; Nonautonomous; KW DNA-4_AN; nonautonomous DNA transposon; KW putatively MuDR superfamily. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-4562 RA Kapitonov V.V. and Jurka J.; RT "DNA-4_AN, a family of nonautonomous DNA transposons in the RT Aspergillus nidulans genome."; RL Repbase Reports 3(12), 206-206 (2003). XX DR [1] (Consensus) XX CC Nonautonomous DNA transposon. Putative classification: MuDR CC superfamily. CC 9-bp TSD. CC Two identical copies. XX SQ Sequence 4562 BP; 1358 A; 893 C; 966 G; 1345 T; 0 other; gtgactacag ctcaggcgac cgcgctaaaa cctcaggcga ccttgagggt ccgtgcgctc 60 acctgacctc gcttaaagcg accgcggtcg tttgaggtca atacaacgca cgtaacttga 120 ataccacatg ctaacaccat attctaatcc ccgtgccccc ataatggcca aagtcaagaa 180 agaccctaat tacattcgat atactaagat ttcgaaagct gaattagacc ttcctgactt 240 caaggcagat gtgagcttgc gaaccagttt aagaaaccac catgagtgat atgattaaac 300 tagttgctaa ctaattcttc ttagaatcac ggctacctga ttccacatgt cggagagatt 360 tactgccgcg cccctgcatg caccaattcc gtactgctgc tgttgctgtt cgcctacaac 420 tagttactaa ctagttccag actcggtttc tgaacacaaa caacctcaag aagcacatca 480 gaaaagcgca cgtggataaa ttcgatctct tagagaaaga aggaggtggc cgtcctactg 540 caaaagagga ggatgatgcg attggtgatt acagcaagat tgttgttgcc agaactagtt 600 cctaactagt tcagtatttt acaaagccgt gctcgaagcc tatgatgcaa gacaagagga 660 tggggtaggg aagcctgaga tcccccgccg tcgcgacgga aaggtatgat tattctacaa 720 cgctttaaaa ccagttgcta actagttata gataaaccag tcagcagtca aaaaatacgt 780 tcgtgagcag ggttattcag tcccttgtga tgcatgcaag gaagcagata aggcaaaagg 840 tatgcagcaa tctaaagtaa aatacattaa ctggttacta actagtttag actgctgtag 900 ggaggctaat ctggatgttt gtgaccactt tgagctcttt gagccttacg aggatgaaga 960 ggaggccgag agtaacgagg tcttttaact agttgataac tagttccacg atttcttaat 1020 cagtccagac ccaactctcg tcgacgaagc atattttcaa gctctaattt ttcagcctct 1080 gccttctcct ttcgtattcg aacttgttga gcttctagtt caagttgttg tctttgcagc 1140 tcaaggtttt gaacctgtaa attcgctgta gctgatcgct ggaggatagg tgttctaaaa 1200 tgtagaacct ggggtcagcg gaactaatcc agaactagtt tcaacaagta agaatagcat 1260 actctgaaga agacctggat ggtgactgat gtgctggtgt attataacga gaacttttta 1320 gaagttagaa tcatcatgaa ctagtttcgg actagtttat agtacgaacc tgcttccccg 1380 tgatctagat cttgaacctc gaccacggga actagatcgg cgtggaggaa tgaatcctga 1440 agaactcgcc ctaaaaagaa tactattatc ttcaactgaa gatagaacta gctcattatc 1500 ctgagccagg cgttttcgtt tccgtgtgcc tggggtaact agttagtaac tagtttaaat 1560 aggtgaaaaa taacttgcct tcacgcctga tactattata aaaccgggta tccatacttg 1620 tgatccgtga taaatgacta gttccatatt gatttcgagc aagatattgg tcaatatcac 1680 gctggtcaag gattcgggag ctagagatat cagttgcaat cataactagt tgacaactag 1740 ttcttacctc aaggtcgctc ctaataacga gtcataagtt cccatataat atgacttata 1800 atggctttgc tccactgcat tcgtaatatt cctaattttc tcaaaatatt ctggctttat 1860 caggctacag gctttattta aacctgcagc aataactgaa tgccgtttat gccttgccca 1920 ctctgcaatt ttaggggttt cgtgagctgt ataacagtta gaactagttc ccaactagtt 1980 gggtaagatc cttaccttct aataattcaa tcaggtgaaa gtattcctct cgggttgagc 2040 aagtaagaag gctttccatc cgacatctag ctgaagaatg ataatcagat agatctccag 2100 tctgttgaac tagtttagaa atagttctta gaaaatgaac ccggcataaa atcagaatat 2160 tcataagctg ccatgtccag ggacgctgaa gaggatcaag ctcttgcagg taatggccaa 2220 gtcctgtata ctagttaata actagtctat ataataacta ggggctgctc ttacctgaca 2280 tctgctttgc acacatatct gtaataatag ccttaagtcc cctattatgt agataaaaga 2340 attgtgttgt acatccttgg atatcagcaa ttaactggaa gacccgtgta aagaggagtt 2400 tatagctaaa tgctgactca tgatccatga aaacccgaag taaagttata actgtagttg 2460 ttagctatgt agaactagtc tataactagt tcaggtactt actcttccca tgctgaggta 2520 aaaaggttgc aaaaataacc tcattaaagt cccctttaac tcgtttataa gacatatcaa 2580 cttcaaatga atcaagggat ccaaggagct ggatttgctt attaaatcca caaataatca 2640 tgacaatatc atggtctgta tagattctgc gaacatattc ctaagtatga ttagcagaga 2700 actagtcatg aactagttta catactcgat attctggatg ccgatccaac tcgcagcgta 2760 cacctgcaag gccacggcct ttaggatata gtaaagcctt ttcctttgct ataatagctg 2820 caaatcggtc ttgatttaca aagctattat ggatttgtga aagagtctgg ccattatatt 2880 tttggcaaaa ctgtttcagt gccggactcc gtagaaataa acctaaacta gtttagcaca 2940 tgctcaacta gttgtgaact agctgcactt actaggggtc aagtcaggac tctgcatctt 3000 tcgaataagg tccaaaatct catctagtat aagctgcgga ggcttatttg gaggaggagg 3060 aggatgtgaa tgaataccac gtgatacaaa gactgtataa gggcattctg caatatttaa 3120 tggtataaag aaagtaaatg ttgtactgca tctgaatttg ttgatctttc ccggcccttg 3180 cggatggtca acaccttaca aggactagtt agtaactagt tattcaaaga tagaatagac 3240 aggtccttac cacagtttgc tctccgagtt gtttgttgat caatcatttc acaggcttca 3300 ttagctacct ctagcggcct tgaaaataga tcttccaata atgaaatgtc aaggttagaa 3360 gccctgtgag aaaggctata ggtaaaatgc tgctgaaata tatctgatga ggcattgata 3420 caagcaatat agtaatggct tgctccggag atatcctgaa ctagttcaga actagtttcc 3480 ataggttaga agcaaggaca cttacaggct cagagtgttt gcgaagaact ggtttgcaac 3540 tagtttgcga tgctaggcat gcaacgctat tttggaagga cttgcagact gaacggtaga 3600 ggctgctagt aattagttag ggactagtta gggactagat atagttacct ataggttgta 3660 cgagtctctg gccgctctcg ctgtgatata atgtcccttc ttttttggaa aatagtattg 3720 aacatcgcat cagtaacctc ttggtgattc atatccagta aatctggatg taaatattca 3780 caggctttaa tgcctgtaca tgtataatga tatttcttga cacagcactg caaaaatgta 3840 gaaaagattg tacgtggagt tccagcaatt ttatgggaat attgaatcta aactagttag 3900 taactagtcc atctgataat gacgattagg tttaatagaa ctagttccga ccgagcggga 3960 actagtttac attacttact gatgtgagtg cactgcgaat cttttcaggc tggctttgat 4020 gaaaattaat cacatatgca tatccttgtg gatgagtttg tggatattca ggcaaatgat 4080 tagtatattc aattttttgt gctgagactg gcgaaggtgc tcctgttgtt aaaggaactg 4140 gcataggatc ctagaactag ttagaaactg gtatacccaa aatatatatt cctacctcct 4200 taacctctac ctcggcctgg tttcgctcat gtaaaattct atcatcattt tcatgatata 4260 tttcagcttc attctcattc aaagaatcac tctcagatga gattgataaa tttattaagt 4320 cattaatgac ggtttggtcg ttgaggtaat tgtactcggt atccattgta aactagttca 4380 aaactaattg ggagaatcat aggttgtatc gaggctggat tggtgtaaag tgattacaga 4440 gtgaaatatt tatttttttg agcgaggtcg tctgacctca acgatttagc gaagtcacgt 4500 gagctagcgg tccacaccaa ggtcgcctga ggttttagcg cggtcgtttg agctgtaacc 4560 tg 4562 // ID Copia-19_MLP-I repbase; DNA; FNG; 4234 BP. XX AC AECX01002810; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-19_MLP_; KW Copia-19_MLP-LTR; Copia-19_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4234 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002810; Positions 3793 8026. XX CC Positions [1603-2100] - Integrase core CC 'CCAAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 82..1335 FT /product="Copia-19_MLP-I_1p" FT /translation="MSQPTFSTKAAEKEDDNESSSGSDNHHSSKVKITKLT FT RKNWHEWQIRFDNILVSKGYDDLLDETWVKNNLNKPLFKRKNAWAINKLFE FT TVSIELRAILLKNSKSFFNAYTALSKSCGVSSILMIGMKVQHWMSIRYIPG FT GSIQEHCDQFSGNYSSLTASLAATTPAVLTISTELAAIFFLLSFNDDETLT FT PLIQNCWDLSPFNFETCYDKMIMEHNRRDSQEQDVINFSKVKGKNNEKPSS FT SQNKPNSSTFNPKSIVMTKKPFVKNFETIDEKVARLVAQHLSKKSEANLIQ FT QNSNSDKEQVDHVQDNLSDDIKNSGFMLTDSINYLGTEKFPLVYDSGATKS FT TVNNLDLLIDPAPISKSLNTYGGSVAISHVGKLNFGGTIIHPVYYSENGPR FT NLISTTQLEDHGMNIIHSHRKVQV" FT CDS 1414..4233 FT /product="Copia-19_MLP-I_2p" FT /translation="MTQEVNFTEAKDWHIILGHPSDVYLRKFLQLNDIKSS FT AVQDSTGCLICLRCKLKNSPHSNPIPSASKPFEKIHSDLLQITPVTGQGIK FT YVLVLIDDFSRFNRIYLLKKKSEAPNRILHYIAEIKSKTGSNPAYFHSDRG FT GEFTSTFLVNEFAKLGITIEQGPANSPKTNGLSERFNQTLLVKMRCILAQS FT SVPINFWDEAARYASQLINILPSKPLKWTSPVSILSDLNLSIEPLRKLTKL FT IPFGLKVFVRQQQESKIQPPSNSMLFLGYEPFSDAGRFLNLKNRHIVISRD FT YSASPLNFPYTSESVIKKPVETLPNRKVTYKSENVVVNLKTPKKITATSSI FT EGTPAIPSSRPETPPLPPTQPVAPAAPKKSHHVHIPYSDKAPKDISSKIDS FT SNILKTTRRKANSNNGDEEEELEVNLAIDVSLRKALMNPEEAPKWKEAMQS FT EFNSLVSKETGTLVPPPNHDKVIGGMWRLKKKLNEFGEVSRYKARWVCLGN FT NQEEGIHYFRTYASVARNESFKLLLVMVVQGHYYAYQFDIETAFLEGLIDA FT PIYLSQVSGFEEIGREAFVWLLYKSLYGTKQALRRWKAKLVEVLLMAGMVP FT SQGDESLFVNQSKTMFLHIHIDDGFVISKDKNVIMKLFQDLKKEFTMKIKE FT RPTQHLGYSLDWKTGGSILLHQTDFAKKTLDRFDMTECNPIKTPAPMNIHA FT LVASKGDPVKVLLCQQALGMLNHLSLHTRPDLTFTVNLLTQFTTNPNTFHW FT SAICHLFRYLKGTISLGLHYTKSQSPLRPELCGWADTDYATSFVTKKSTSG FT FTLTLYNNPICWTTKKQSVVAQSTTEAEYISSNKCAKQLRWMSILMTSLNL FT KISKPPLLINDNTGAVTISQEAQLNPNSKHIEVRYQYLRDLVMKNLMSIEQ FT VPSAEMIADVLTKPLGTVKHQVAMNQLKMACCDLGG" XX SQ Sequence 4234 BP; 1361 A; 1002 C; 731 G; 1140 T; 0 other; tggtagcgag agactcctct tggacaaaac tttatattat ttctctaatc attcgcagtc 60 cagaacaaat cgatcgacaa catgtcccaa ccaacttttt cgactaaggc cgctgaaaag 120 gaagatgaca acgaatcatc ctccggatca gataaccatc attcctctaa agtcaaaatc 180 accaagctga ctcgtaagaa ttggcacgaa tggcaaatcc gctttgacaa tatcctggtc 240 agcaaaggct atgacgactt actcgatgaa acttgggtaa aaaacaactt gaataagcct 300 ctattcaaac gaaaaaacgc gtgggcaatc aataagctct tcgaaaccgt ctcgattgaa 360 cttcgagcaa ttctactcaa gaatagtaaa agtttcttca acgcttacac cgctctatca 420 aaatcctgtg gagtttcttc aatcttaatg atcggaatga aagttcaaca ttggatgtcc 480 atccgttaca ttcctggagg ttcaattcaa gaacattgcg accaattctc tggcaattat 540 tcctctttaa cagcatcgtt ggccgccacc actcctgctg ttctcaccat atcaactgaa 600 cttgcagcta tctttttcct cctaagtttc aacgacgacg aaacacttac tcctctcatt 660 cagaactgct gggatttaag tccttttaat tttgaaactt gctacgataa gatgattatg 720 gaacataatc gtcgagattc acaagaacaa gatgtcatta atttttccaa agtcaaagga 780 aaaaacaacg aaaagcccag ttcatctcaa aataaaccta actcttcaac ctttaatcct 840 aaaagtatcg ttatgactaa aaaaccattt gtcaaaaatt tcgaaacaat tgacgaaaaa 900 gtcgctcgac ttgtcgctca acacttatct aagaagtctg aagctaacct cattcaacaa 960 aattccaact ctgacaaaga acaagtcgat catgttcaag ataatctttc tgacgatatc 1020 aaaaattcag gatttatgtt gactgattct attaactatc ttggaactga gaaatttcca 1080 ctggtttatg attcaggagc caccaaatcc actgtcaaca acctagatct actgatcgat 1140 cccgccccta tctccaagtc tctcaacact tatggaggct ctgtcgcaat ctctcacgtc 1200 ggcaaactca actttggtgg tacaattatt catccagtat attactccga aaatggcccc 1260 agaaatctca tctcaacaac tcagctcgaa gatcatggta tgaatatcat acattctcac 1320 cgaaaagtac aagtttgatt aggcgacaaa atcatttttc gattcttttg agaaggcgac 1380 ttgtacgtat caaaaaacac ctcctctctc aatatgactc aagaagtcaa ttttaccgaa 1440 gccaaagact ggcatatcat cctcggtcat ccctccgatg tctacttaag aaaatttctt 1500 cagttaaatg acatcaaatc atctgcagtt caagatagca ctgggtgtct catctgtctt 1560 cgctgtaaat taaaaaactc tccacactcc aatcccattc ccagtgcttc caaaccattt 1620 gaaaaaattc actctgatct gcttcaaata actccggtca ctggtcaagg tattaagtat 1680 gttctagtac ttatcgacga tttttcaaga tttaatagga tttatttatt aaagaaaaaa 1740 tcagaagccc ccaacagaat cttacattat atcgccgaaa tcaagtccaa gaccggaagc 1800 aatccagctt actttcactc cgatcgtggc ggggaattta cctcaacgtt cttggtgaat 1860 gaattcgcaa agcttggaat caccatcgaa caaggacctg ctaactcccc caaaacgaac 1920 ggtctctcgg aacgtttcaa tcaaactctc ctggttaaga tgagatgcat tctggcacag 1980 tcttcagttc caatcaattt ttgggatgag gctgccagat atgcgtctca attaatcaat 2040 atactaccct caaagccact taaatggact tcaccggtca gtatactcag tgatttgaac 2100 ctcagcattg aacctcttcg aaaacttacc aaactcatcc ccttcggtct gaaggttttt 2160 gttcgacaac agcaagaatc gaagattcaa cctccttcaa actccatgct gtttttagga 2220 tacgaaccgt tttccgatgc tggaagattc ttgaacttga agaatcgaca catcgtgata 2280 agtagggatt actcagcatc acctttaaat tttccctaca cgtctgaatc agtcatcaag 2340 aagccagtgg aaactcttcc gaatcgaaag gtgacttata agtctgaaaa cgttgtcgtt 2400 aatctcaaaa ctccgaagaa aattaccgct acctcctcaa ttgaaggtac cccggcaata 2460 ccttcgtcca gacctgaaac accaccgcta cctcccactc agcctgtcgc tcctgctgcg 2520 cctaagaagt ctcaccacgt tcacattcct tactcagaca aggcaccgaa agacatcagt 2580 tccaagattg attcatcaaa tatcctcaaa acaactcgtc gcaaagcaaa cagcaataac 2640 ggtgacgaag aagaagaact tgaagtaaat ctggccattg atgttagcct tcgaaaagca 2700 cttatgaatc ccgaagaagc cccgaaatgg aaggaagcaa tgcaaagtga attcaactca 2760 ttagtttcaa aagaaaccgg cactcttgtt cctccaccaa atcacgataa ggtcatagga 2820 ggcatgtgga ggctgaagaa aaagcttaat gaatttggtg aagtttcgcg ctataaagca 2880 cgctgggtat gtctgggtaa caatcaagaa gaaggaattc attatttcag aacttacgct 2940 tccgttgcaa gaaacgaatc cttcaaactt ctcctcgtca tggttgttca aggtcattac 3000 tacgcctatc aattcgatat tgaaacagca ttcctggaag gactgatcga tgctcccatc 3060 tacttatcac aagtgagtgg ttttgaagag atcggcagag aggcatttgt atggctatta 3120 tacaaatcgc tttacggtac gaagcaagct ctgcgtcgtt ggaaggccaa actggttgaa 3180 gtcttactga tggcaggtat ggtaccgtcg caaggtgacg aatcattgtt tgttaatcaa 3240 agtaaaacaa tgtttttgca tattcacatt gatgacggat tcgtaatcag taaagacaag 3300 aatgttatca tgaagctctt tcaagattta aaaaaggaat tcaccatgaa gatcaaggaa 3360 agacctactc aacatttagg gtatagtctg gactggaaaa ctggcggttc aattctctta 3420 catcaaactg atttcgccaa gaaaactctc gaccgctttg acatgactga atgcaaccca 3480 atcaagacac cggctccgat gaacatccac gccctagttg caagtaaagg cgacccggta 3540 aaagttttgc tttgtcaaca agcattggga atgcttaatc acttatctct tcacactcgt 3600 cctgatctaa ctttcaccgt caatcttctg actcaattca caaccaatcc taacaccttt 3660 cactggtcgg caatctgtca cttattcaga tacctcaaag gtaccatatc tctggggcta 3720 cactacacca aatcgcaatc tcctcttcga cctgaactat gtggatgggc tgacactgac 3780 tacgctactt catttgtaac caaaaaatca acttccggtt ttactctgac attgtacaac 3840 aatcccatct gctggacgac taaaaaacaa tctgtagtcg ctcagtcaac aaccgaggcg 3900 gaatacatat catcgaacaa gtgcgccaaa caactcaggt ggatgtcgat attaatgaca 3960 agcttaaatc tgaagatatc aaaaccgcca ttgctaatca acgacaacac cggtgcagtt 4020 actataagtc aagaagcaca attaaatccc aactctaaac atattgaagt acgatatcag 4080 tatctacgag atctagtcat gaagaatctt atgtcaattg aacaagttcc ttctgcggaa 4140 atgatagctg atgttctaac aaaacctctg ggtacagtaa aacatcaagt agctatgaat 4200 caactcaaga tggcgtgttg tgatttaggg ggag 4234 // ID I-6_AO repbase; DNA; FNG; 4498 BP. XX AC . XX DT 24-JAN-2006 (Rel. 11.01, Created) DT 02-FEB-2006 (Rel. 11.01, Last updated, Version 1) XX DE A family of I non-LTR retrotransposons. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-6_AO. XX OS Aspergillus oryzae OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC mitosporic Trichocomaceae; Aspergillus. XX RN [1] RP 1-4498 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-4498 RA Kapitonov V.V. and Jurka J.; RT "I-6_AO, a family of I non-LTR retrotransposons in the RT Aspergillus oryzae genome."; RL Repbase Reports 6(1), 14-14 (2006). XX DR [2] (Consensus) XX CC Its 5' terminal portion is incomplete. XX FH Key Location/Qualifiers FT CDS 2..480 FT /product="I-6_AO_1p" FT /translation="PRFAVVAHRTPTEDFLAPEDEKDFISSIMEENKMELH FT GFRINRIAWLKSRDKPIGKHGSLAIWFDTREAAEWTMDNGLLVRQTCVRVE FT PYKFKERRCHRCQGFGHLAWACRGKARCGHCAGAHELRHCPPGVRARCLDC FT TGEHPTNDRSCPNPAKHPYSX" FT CDS 481..4356 FT /product="I-6_AO_2p" FT /translation="MTTPNLRLVQLNIWKSRAGMEALINDSQTQNLDILLI FT QEPPLTAYSTHVNHSAWHLYQSTCQEDTPKKRSLLYVNRRISTASHRQIKC FT NHPDVTAVKMWTMERQMLIFSIYVPTTNHSQPMEEVSIQAMLEEIETCIQQ FT AIETTDKPTTIIMAGDFNRHHPMWSKNRIYHVAIEHAEELVTFFHKHGLQP FT CLPRGTPTYWSMSYPGSNSTIDLTVTDTPESLIKCHLYHDHYGSDHRAVYS FT EWSLDPSRNAEREPRRAYDRADWKQIGESVQAQIAQTPPIHTEAELEGAVA FT KLITCTTFAVDQHTPMAKPSPYSKRWFSPELKEQQREVNRARRKWQESCAE FT KGRQDPMSLILFGDMRTKRRAWTRAIEKAKATHWKDFLDSAGEGHLWKAAS FT YMRPRESYGNIPPLKQDKGETADNTEKARLFMRTFFPRMAPPEDTMEMEQR FT EEIPWTPITEQEVYRALRAAKPMKAPGEDGIPMLVWRQLWQWLRTEILRIF FT TASVNLGYYPEKWKRARIVVLRKPGKPDYTLPGAYRPISLLNTLGKVLEAV FT MAKRLSYYAEEYGLLPNTQFGGRPGRNPEQALLVLRNAIDRAWLASKVITL FT VAFDLKGAFNGVSSNILDRQLKAKGIPTKLRAWVASFMEGRSASISFDDFE FT SARSLLENAGLAQGSPLSPILFIFFNSNLVNQRVDYHGGASAFIDDYFRWR FT AGKSAEENIRKLQEEDIPRIEQWAKLTGSCFAAEKTELIHFTRKKREQTKG FT RLVIQGATIEPSATAKLLGVVCDQELRWKEHVQQAVNRATKVNIALAGLRH FT LRPGQMRQVYQACVTPIMDYASTVWHNPLKDKRHLRVLDTVQRSALIRILS FT AFRTVATATVEVETYTLPTHLRLKQRAQRVIVNLCTLPRDHPIQDVISRAR FT RRRDNVGSQPRFPLAESMKTMRLEQLDGLETIDPKPMAPWKPPGFLEIDIE FT PDREKAKDKATALQASSNMVVFSDASGQNNQLGAATVILDHNKDVVESRQL FT SIGSMANWSVYAAELIGIFYAISLVLKIVSSRPRTPTTSQQEPATILCDSM FT SALQAIRNPGNKSGQRIIHANLQAAAELKARGIPLRLQWIPGHCDDPGNDA FT ADKLARMAVGLDKMHPFPRLVSQERASIRKQILKEWEHEWKTCKKGSHLRR FT IDPKLPAIRTRRLYDSLPRNQAYLLTQLRTGHCWLAPYGKLHGHREDDKCE FT CGAKETVTHVLLDCSKLRIPRQKLRRELGEAFGDIPAMLGGKGETSHVKAV FT LDFAEASQRFRSRGPRGPQRQNSRQTATTGP" XX SQ Sequence 4498 BP; 1338 A; 1257 C; 1131 G; 772 T; 0 other; accgcggttc gcggtggtag cacatagaac accaacagag gactttctgg ctccagagga 60 cgagaaagat ttcatcagca gcatcatgga agaaaacaag atggaactgc acggcttccg 120 catcaatcgg atcgcctggc ttaaatcaag ggacaaaccg ataggcaaac acgggtcgct 180 ggctatttgg ttcgacaccc gcgaggcggc ggagtggacc atggacaacg ggctcctagt 240 gagacaaaca tgcgtcaggg tggagccata caaattcaag gagagaaggt gccaccgctg 300 ccaaggcttc ggccatctcg cctgggcatg cagaggaaaa gcaagatgtg ggcactgtgc 360 aggcgcgcat gagctccggc attgcccgcc tggggtaaga gcccggtgtc tggactgcac 420 gggagaacac cctaccaacg atcggagctg ccctaacccg gctaagcacc cgtactctca 480 atgacgacac ccaacctgcg actagtacaa ttgaacatat ggaaatctag agcaggcatg 540 gaagccctga ttaacgattc ccaaacacaa aacttggaca tactcctcat ccaagagccg 600 ccactcacgg catacagcac acacgtcaac cacagcgcct ggcacctcta ccagtcaaca 660 tgtcaggagg acacccccaa gaaacgtagc ctactctatg tgaacaggcg aatctcaaca 720 gcctcacatc gacagataaa atgcaaccac cccgatgtca cagcagtcaa aatgtggacc 780 atggagcggc aaatgctgat attctcgata tatgtcccaa ccacgaacca cagccaaccc 840 atggaagagg tctcaatcca agcgatgctg gaagagatcg agacctgcat ccaacaagcg 900 atagaaacca cagacaagcc aacaacaatt atcatggcag gagactttaa ccgccatcac 960 ccaatgtgga gcaaaaaccg catctaccat gtcgcgatag aacacgctga agaattagtc 1020 actttctttc acaagcacgg actccagcca tgcctccccc gaggcacccc tacatactgg 1080 tcgatgagtt accccggaag caactccacc atcgatctca cggtaacgga tacacctgaa 1140 agccttataa aatgccacct ttaccatgac cactatggat cagaccatcg agctgtttac 1200 tctgaatggt cccttgaccc atcacggaac gccgaacgcg aacctcgaag agcatatgac 1260 cgcgcggact ggaaacagat tggagaatct gttcaagccc aaatagcaca aacaccgccc 1320 atccatacgg aggctgaact tgaaggagca gtggcgaagc tcattacgtg cacaaccttt 1380 gcagtggacc agcacacacc aatggctaaa ccgtcaccat actccaaacg gtggttctcc 1440 ccagaactca aggagcaaca acgcgaagtt aatagggcac ggagaaaatg gcaagaaagc 1500 tgcgctgaga aagggcgtca agatccaatg tcactaattt tgttcggaga catgcggacg 1560 aaacgcagag catggacaag agccattgaa aaggcaaaag ccacccactg gaaggacttc 1620 ctcgacagcg ccggagaggg ccacctgtgg aaagcagcct cgtacatgcg ccctcgggaa 1680 tcgtatggaa acatcccacc attgaaacaa gacaaagggg agacagcgga caacacggag 1740 aaagcaaggc tatttatgag aaccttcttc cccaggatgg caccaccaga agacaccatg 1800 gagatggaac aaagagaaga gatcccatgg acacccatca ccgagcaaga ggtctacagg 1860 gcccttcggg ccgccaaacc catgaaagca cccggagagg acggaatacc aatgctggta 1920 tggagacagc tgtggcaatg gctgaggacg gagatcctcc ggatcttcac cgcatcggtc 1980 aacctagggt actatccaga aaaatggaag cgagcaagga tcgttgtcct ccggaagcca 2040 gggaagcctg actataccct gccgggtgca taccggccga tctctctcct gaacactctg 2100 gggaaagtcc tcgaagctgt catggccaaa cgcctttcat actacgcaga agaatacggc 2160 ctgctaccaa atactcaatt cgggggtcga ccaggccgta accctgaaca agcgttgctt 2220 gtgctgagga atgcgattga ccgagcatgg ctagcgtcga aagtgataac actagtggcc 2280 tttgacctca aaggggcatt taatggggtt agcagtaaca tccttgaccg acaactcaag 2340 gccaaaggaa tccccaccaa gctgcgagcc tgggtagcca gtttcatgga aggccgatca 2400 gccagtatct catttgacga ctttgagtcc gccagatccc tgctagagaa cgcgggcctg 2460 gcgcaaggtt cgcccctatc gccaattctc tttatcttct tcaactccaa cctggtaaac 2520 cagcgggtgg actaccacgg cggcgcgtcc gcatttatcg acgactactt tcggtggcgc 2580 gcggggaaat ccgccgagga gaacatcagg aaactccagg aggaagacat cccccgcatt 2640 gaacaatggg caaaactaac aggctcatgc ttcgccgcgg agaaaaccga gcttatccac 2700 tttactagaa agaaacgcga acaaaccaaa gggagactgg taatccaagg cgcgaccatc 2760 gagccctcag cgacagccaa gctgcttggg gtggtgtgcg accaggaatt aagatggaag 2820 gaacacgtac agcaagcagt gaatcgagcc accaaggtca acatcgccct ggcgggactt 2880 cgacacctcc gtcccggaca aatgaggcag gtctaccagg cctgtgtgac accaatcatg 2940 gactatgcat caacagtctg gcacaaccct ctcaaagaca agagacacct tagggtgctc 3000 gacacggtcc aaaggtcagc cttgattcga atattgtccg cattccgaac agtcgcaaca 3060 gcaacggtgg aagtggaaac atacacgctg ccaacgcatc tgcgactcaa acagagagca 3120 cagagggtga ttgtcaacct gtgcacgctc cctagagacc acccgattca ggatgtgatt 3180 tcacgagctc ggaggcgacg cgataacgtg ggatcccaac cccggttccc gctggcagaa 3240 tcaatgaaaa ccatgaggct ggaacaacta gatggactag aaacgatcga cccgaaacca 3300 atggcccctt ggaaaccccc aggctttttg gagattgaca ttgaaccaga ccgggagaaa 3360 gcaaaggaca aggctactgc cctacaggcg tcatccaata tggtagtctt ctcggatgca 3420 tccgggcaga ataaccaact tggcgcagcg acagtgatac tagaccacaa taaagatgta 3480 gtcgagtccc gacaactctc cattggatcg atggccaatt ggtcagtgta cgcggcggag 3540 cttattggga tattctacgc cattagcctt gtcctaaaga tagtaagctc aagaccaaga 3600 accccgacaa cctcacaaca agaaccagca acaatcctct gcgacagcat gtcagcacta 3660 caggccatca gaaacccagg caataagtca ggacaacgga tcatccatgc caatctgcag 3720 gccgccgcag aattgaaagc gcgaggcatc cccctccgcc tccaatggat cccgggacac 3780 tgcgacgacc ctggaaacga tgcagcagac aaactagcca gaatggcagt tggtctggat 3840 aaaatgcacc ccttcccccg ccttgtctcc caagaaagag caagcattcg aaagcagatc 3900 ctcaaagagt gggaacatga atggaagaca tgcaaaaagg gtagccacct acggcggatt 3960 gacccaaagt tgcctgctat ccgcacccga cgactatacg actcgttgcc acggaaccaa 4020 gcctacttac ttacccagct gcggactggc cactgctggt tggctccgta cggcaaactt 4080 catggacacc gagaagacga taaatgcgaa tgcggtgcca aggaaacagt aacccacgtc 4140 ctactagact gctcaaaact tagaatacca cgacagaaac tacgcaggga gcttggagaa 4200 gcgttcggtg atataccggc catgttagga gggaaaggag aaaccagcca cgtgaaggct 4260 gtactggatt ttgcagaagc atctcaaagg tttcgtagcc gcggaccaag gggcccgcaa 4320 agacagaact caaggcagac ggccaccaca ggcccctagg cgaggctcaa agttcgcccg 4380 taagaaatgg gatcatgtac aatagagata caagggaaga tatatagttg taggtagcct 4440 gcatatcgct ttagtgcgag aggtcagggt aaatgaaggc atccatccat ccatccat 4498 // ID TKM1_LTR repbase; DNA; FNG; 385 BP. XX AC AJ439546; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 08-NOV-2002 (Rel. 7.1, Last updated, Version 1) XX DE Kluyveromyces marxianus retrotransposon TKM1_LTR, long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; KW Long terminal repeat; RNaseH; TKM1_LTR; gag; integrase; pol; KW protease; retrotransposon; reverse transcriptase. XX OS Kluyveromyces marxianus OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Kluyveromyces. XX RN [1] RP 1-385 RA Neuveglise C., Feldmann H., Bon E., Gaillardin C. RA and Casaregola S.; RT "Genomic evolution of the long terminal repeat retrotransposons RT in hemiascomycetous yeasts."; RL Genome Res 12(6), 930-943 (2002). XX DR Genbank; AJ439546; Positions 1 385. XX SQ Sequence 385 BP; 137 A; 85 C; 57 G; 106 T; 0 other; tgttgatcct ggccgtccta cgacgttgtc tagggcgcca gatattatac caactataaa 60 agctaaggct aaagcctaga tatactagga tcaaggctaa agcatttgat aattgaattg 120 attccaaaaa gtatataagt agaaggtttc ttcatccaat tcccactttc gaattgacaa 180 cttacttact atccaattat aagtcaagtc agaagaaacc aaggagaaca acagtaaata 240 cttacttatc tagtgaatcc aacaattcaa ttaccaatta attagtccaa cagcttcaag 300 tcaaaatggc atccaacgac attatttcta ctaacgtccc ttctaaggtc cctactaccg 360 atactgagga atatccagtc caaca 385 // ID Copia-1_UM-I repbase; DNA; FNG; 4441 BP. XX AC AACP01000055; XX DT 28-JUN-2010 (Rel. 15.06, Created) DT 28-JUN-2010 (Rel. 15.06, Last updated, Version 1) XX DE LTR retrotransposon from the Ustilago maydis genome: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_UM_; KW Copia-1_UM-LTR; Copia-1_UM-I. XX OS Ustilago maydis OC Eukaryota; Fungi; Dikarya; Basidiomycota; Ustilaginomycotina; OC Ustilaginomycetes; Ustilaginales; Ustilaginaceae; Ustilago. XX RN [1] RP 1-4441 RA Kamper J., Kahmann R., Bolker M., Ma L.J., Brefort T., RA Saville B.J., Banuett F., Kronstad J.W., Gold S.E. et al.; RT "Insights from the genome of the biotrophic fungal plant pathogen RT Ustilago maydis."; RL Nature 444(7115), 97-101 (2006). XX RN [2] RP 1-4441 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Ustilago maydis genome."; RL Repbase Reports 10(6), 840-840 (2010). XX DR EMBL/GenBank/DDBJ; AACP01000055; Positions 4020 8460. XX CC Positions [1582-2097] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 772..4113 FT /product="Copia-1_UM-I_1p" FT /translation="MNHCTGRCYENASSTHFTRKSAPPTNRSHSRTSQTCS FT ASIRPNTSHWAPPSSLEMPRPGSPRSTKPRHRLTTLRPSPASRAQRKRQQT FT CLRCEQRGHSRQQCKATVILPPPSEYRKRVFTAGSQVLKAPLIGDAQVQTE FT ATDEGARVTLDREGGQIYLADGIPLRMVKNRKRRLFEFQGETWKEHALVAS FT PDLFMGVEENFASAERKPKLTATELWHERLGHPGRDKTRKLLQLLKNERDI FT KLDAETPTNCAGCIQAKSTKARHGAGSGERADAPLDLVHIDLIIDSSRETE FT FTCTLVAVDDYSKYVYVQPLVSKADAFLELQRMVSFLELQTGRQLKALRSD FT QGSEWTKAEARIWARDKGILWQMTVSYDSKQNGRVERMNRSLQEKMRALLI FT QRKLPARFWPYALRTAAFLINLTPNVDDKIPYAQIFSKTPTRFIRLLRVFG FT CLAWVNIPKAKRNRQKIDERSVPAIFIGYSLERKGWLFYSPNYSPNIFWSN FT SAKFMEVQSWAERTAWRPIDISPPHAPTHEEDVPDFGYTEIDQFDETANAP FT IDDFVDIDAIPDDADGMLPPPETTQEAAPVSSDGAQASREVTPVSSVGATD FT GTDTRDVLQPAGPSVDLPFRDNEAILSEKYAAIQNSKNEWGHMINFFAPQL FT TESIQDGFPERPTDAQPDDADPETWPRHLRRAPLSVNLRTGHALAITTSPA FT VNLSPTLNQAKRGEDWPLWQEAMRSEIGGLEANDTWIVADLPPGQNLVDSK FT WVLKIKTDANRVPTRYKARLVARGFTQREGVDFDEIFAPVAPLEAIRGILA FT VATVFDWEVDGIDVVQAYLNSTLHHDVYMKPPAGVNVPPGQVLRLVKGLYG FT LKQSGREWNLVLDAHLRKIGFHAVPSAPCFYSRGEGGSRTILTCYVDDILI FT TSPSRAEVDRTKAEIAEKWNIEDNGPVKEFLGIKIDRDRKNRHLSLDQSAY FT IKEMANNWLANDVRKTAKSPFAKYFEVGPDMACDAKQAKKYQELVGQLLWI FT SNTVRPDVAFAVGTLARYMSDPIEPAWLAALQVVRYLNFTADYRLILGPYR FT NMEQPVVTHTDANWASDRNTQRRSTSGFNDIHLWMPSQLEIPRPKNASPCR FT QSKPSS" XX SQ Sequence 4441 BP; 1156 A; 1333 C; 1137 G; 815 T; 0 other; agtgattaag agtctgaatt ctctgcgaca gagtccacat caagaattcc aaaaaatggt 60 taacacgcgc agccgccacg ggcgctcacc aagcttggac gatctcgaat acgaaccaag 120 cgctgaaagc aacaccgagc agcaaacgcg tgagtacgaa ctccccggaa gaactccggg 180 ggtggacgac aacaacgacg aaaccgacac ggacgatgcc gacaacatcg gcaagggccg 240 tacagtggtc ttcgagaccg ccgctgaaac cagcggtgaa gacgaacgcc gcgaggtcga 300 acccgtcgtc tcatacccaa gtatgaggga ggtcgccaaa cgaatggtgg ccattccgaa 360 actgactgtg cgaaactacc acaattggcg catcatgctc ctcaacacca tgcgcctcat 420 cccgaaagcc tcgggctaca ttaacggaac gatcaagcct tcctccaaga aattcaaccg 480 agaattcgac aattgcctgg tatctggtat ccagcagact ttcgccttgg atggggagca 540 caacgtgaac tgggtactgc ttgaactaag cactaaacac cgcggctcgt atcgactcct 600 ccagaagatc gaagaaaagc tctcaagccc ccgcgaacgc gccatgcgca aagtggcact 660 catcggacag gtcacacaat gtcaagatgt accacaatga cgtccgcaag ctttgcatag 720 agctacgctc catccgttca gaaagcgtca tcattggcaa acccgcaagt gatgaatcac 780 tgtacggggc gttgctacga gaatgcttca agcacccact ttacaaggaa gtctgcgcct 840 ccgaccaatc gctcacattc gagaacctcg cagacttgct ctgcatccat cagaccaaat 900 acgagtcatt gggcaccgcc atccagcctg gaaatgccca ggccagggtc gccgaggtcg 960 acgaaacccc ggcacaggct cacaacgctg aggccaagcc ctgcaagcag agcccagagg 1020 aagagacaac agacctgcct acgatgcgag caacgcggtc attcgcgaca acagtgtaaa 1080 gccactgtca tcctcccgcc ccccagcgaa tatcgcaaac gcgtgttcac agcggggagc 1140 caggtcttga aggctccact tattggggac gcacaggttc agacggaggc tactgacgaa 1200 ggtgcacgag tgacactaga ccgcgaagga ggccagatct atttggccga tggcatacca 1260 ctgaggatgg tcaagaaccg caaacgccgg ctgttcgaat tccaagggga aacctggaaa 1320 gaacatgccc tggtagcgtc cccggacctc ttcatgggag tcgaggaaaa ttttgcaagc 1380 gcagaaagaa aaccaaagct cacagcaacg gagctgtggc atgaaagact cggccacccg 1440 ggtcgcgaca agacgaggaa actcctgcag ctcctcaaga acgaacgaga catcaagtta 1500 gatgccgaga ccccgactaa ctgcgccggg tgcatccagg caaagtccac gaaggcacgc 1560 catggcgccg gcagcgggga acgcgccgat gcccccttgg atcttgtcca catcgacttg 1620 attatcgatt cttctcggga aaccgagttc acttgcacgt tggtcgccgt ggatgactac 1680 agcaagtatg tgtacgtcca gccgttagtt agtaaggcgg acgcgttcct cgagctacag 1740 cgcatggtct ccttcctgga actccagaca ggcagacaac tcaaggcact caggtccgat 1800 cagggctctg aatggacaaa agctgaggcc cgcatctggg ccagggacaa gggcattctc 1860 tggcaaatga cggttagcta cgactcaaag caaaacggcc gcgtcgagcg gatgaatcgt 1920 tcactccagg agaagatgcg agccctcctc atccagagaa aactcccggc tcgattctgg 1980 ccatatgccc ttcggaccgc ggcgttcctg atcaacctta ctccaaacgt cgatgacaaa 2040 atcccttacg cgcagatctt cagcaagacg cctactcgtt tcatcaggtt acttcgcgtt 2100 tttggctgcc tcgcctgggt aaacatcccc aaggcaaaga ggaatcgcca aaagattgac 2160 gagcgctccg tacctgcgat tttcatcggg tacagtttgg aacgcaaggg ttggctattc 2220 tatagcccga actattcccc aaacatcttt tggagcaact cggccaaatt catggaagtc 2280 caatcttggg ccgaacgcac agcatggcga ccgatcgata taagtccacc ccacgcgccc 2340 acccacgagg aagacgtgcc cgactttgga tacaccgaaa tcgaccaatt cgatgagacg 2400 gccaacgcgc ctatcgacga ctttgttgat atcgatgcca tccctgacga tgcggacggc 2460 atgttgcccc caccggagac gactcaggag gcagcgccgg tatcttccga tggcgcccag 2520 gcgtcgcggg aagtaacgcc ggtgtcctcc gttggcgcca ctgacggcac cgacactcgc 2580 gacgtcctgc agcctgccgg accgtcggtc gatctcccgt tccgagacaa cgaagccatt 2640 ctcagcgaaa agtacgcagc tattcagaat tcgaaaaatg aatggggcca catgatcaac 2700 ttcttcgcac ctcagttaac ggagtccatt caggatggtt ttcctgagcg tccaactgac 2760 gcccaaccgg acgacgcaga ccctgagacc tggcctcgcc accttcgccg cgcgccgctt 2820 tcagtcaacc tccgcacggg ccatgctctg gcaatcacca cttcgccagc tgtcaatcta 2880 tcccccacgc taaaccaagc gaagagaggc gaagactggc ccctctggca agaagccatg 2940 cgcagcgaaa ttggcggcct cgaggcaaat gacacttgga ttgttgctga ccttccccca 3000 gggcaaaacc ttgtcgactc aaaatgggtc ctcaagatca agaccgacgc aaaccgggta 3060 ccgactcgct acaaagcacg actcgtcgcc cgcggcttca cccaacgcga aggcgttgac 3120 ttcgacgaga ttttcgcccc agtggcgccg ctcgaggcga tccgtggcat tctggccgtt 3180 gctactgttt tcgattggga ggtggacggt atcgacgtcg tacaagcgta ccttaactct 3240 accctccatc acgacgtcta catgaagcca ccggccggag tcaacgtccc cccgggccaa 3300 gtcttaagac tggtcaaagg cctatacgga ctgaaacagt cgggccgaga atggaaccta 3360 gtcctcgatg cccacctccg caagatcggt tttcacgccg tgccgagtgc accttgtttc 3420 tactcgcgcg gcgagggggg cagcagaacg atcttgacat gctatgtcga tgacatcctc 3480 atcacatctc ccagcagagc cgaagttgac cgaacaaagg cagagattgc tgaaaaatgg 3540 aacatagagg acaacggccc agtcaaggaa ttcctcggga tcaagatcga ccgtgaccgc 3600 aaaaatcgac acttatccct tgaccagtcg gcttacatca aggagatggc caataactgg 3660 ttggccaacg acgtacgcaa aacagcaaag agtcctttcg caaagtactt tgaggtcgga 3720 ccagacatgg cctgcgacgc gaaacaggcg aagaagtacc aagagctcgt tgggcaacta 3780 ctttggatct caaacacagt ccgaccagac gtcgccttcg ccgtcggaac actggcgagg 3840 tatatgtccg acccgatcga acccgcctgg ctcgcagcac ttcaagtagt tcgatacctc 3900 aacttcacag cggattatcg gctcatcctt ggaccatatc gaaacatgga acaacccgtg 3960 gtcacacata ctgatgcgaa ctgggcatcc gaccggaaca ctcaacgtcg aagcacttcg 4020 gggttcaatg acattcattt atggatgccc agtcagctgg aaatcccacg tccaaaaaat 4080 gcgtcgcctt gtcggcagtc gaagccgagc tcgtagctgc ctccgaagca gcaagagaga 4140 acatgttctt tgcccaccta cttcgggact tgcggttgtg gtgaggtcaa accactcctc 4200 cctcaccgat agtctgggct gctgcctaag taagtaagga tcccgccaaa cactggaaac 4260 tcaagcacat cgacacccga tacttctttg ttcgaaacgc tgttcaagac ggtgacattg 4320 ctatcgagca cattgggacg gcagaaaacg cggccgacat tctcaccaaa ccgttccaac 4380 ccgaacccct ccgcaaggct gttaacaggc taggattggt acgaccattg agggggggag 4440 t 4441 // ID Gypsy-1_GDe-LTR repbase; DNA; FNG; 304 BP. XX AC AEFC01000931; XX DT 12-MAR-2011 (Rel. 16.03, Created) DT 12-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Geomyces destructans genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_GDe_; KW Gypsy-1_GDe-I; Gypsy-1_GDe-LTR. XX OS Geomyces destructans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Leotiomycetes; Leotiomycetes incertae sedis; Myxotrichaceae; OC mitosporic Myxotrichaceae; Geomyces. XX RN [1] RP 1-304 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Geomyces destructans genome."; RL Direct Submission to RU (12-MAR-2011). XX DR Genome; AEFC01000931; Positions 3746 4049. XX SQ Sequence 304 BP; 63 A; 76 C; 101 G; 64 T; 0 other; tgtcacgggt aggttaagag gaggtcagta ggaaggggga gctggttgcg gtcccgcatc 60 ggagacgtcc cgctacggtg cctctagcgg cctaaggctg ctaggggggc cttagggcct 120 taggctgcta ttaggctgtt gtcgtggctc cgactcgcaa ccgagacggc gggggcccga 180 ctccggggga gtagataaag tagctgagcc ccctactagg aagggctcag agattagatt 240 gttgaatacg atactggcta ctcgaaccta cggtcttagc acgctgtcta cctaaaccct 300 taca 304 // ID Gypsy-26_LBS-I repbase; DNA; FNG; 5564 BP. XX AC ABFE01003004; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-26_LBS_; KW Gypsy-26_LBS-LTR; Gypsy-26_LBS-I. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-5564 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01003004; Positions 8363 2800. XX CC Positions [4293-4781] - Integrase core CC 'CGATT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 603..5435 FT /product="Gypsy-26_LBS-I_1p" FT /translation="MSSMNLATVARHPGHKVAPVLSDGVTTPGALLDWENA FT CEDFFGACKEPIADDKKVSKVTGGLQNNRISIYVRNNRTRLHALKFPEFMT FT ELRETFLPADWSKDTHRKILAARMSLDQSFYNFCTDLVSSNNLLDGNPLHL FT SEQRLREQIFNNITEDLREKLEDSATELTALNALPFQKWMNAICDTDVKMA FT KAIKRNMKRYALELEKEEKKKRILSGPSRAVNASSTGATDNRRQYTQNASY FT KPRNLPPLTEEERALIYEHKGCLKCRRLYADHIGRDCNNDFPDTHVAITPA FT LAAAAKAEWEKHRPYKGKPLNRGPSASTSAPAVAAIMEESSEVSGDEDDDN FT DEGHSGIVAMTYPSAVALGDSSSDDSVSPPLTIPNLYWNATAPGRDGLFVP FT VKAMIDSGAHIVLIRPDVVDALGLEKKRLRKPQVINVAMKDDKKKKKIEPV FT LLSHYVSLSLSTLDNSWTSKPVLAVIAPGLCTKVLLGLPFLVHNRIIVDHE FT LPSAIVKETSIDLLNFVPTPRKIARPVKSPRRKRIEVRLLHRDLLKELKWR FT CAMIKKKLDRMMDIEHDDQEFVAVNFIAAVKGKLESLELQQKYNALEEKLK FT TEFKETFEPIPHVHNLPDDVYCRIKLKDATKTISKRTYGCPRKYREAWKTL FT IDKHVESGKIRLSNSEFASPSFIVPKSDPTALPRWVNDYRELNSNVVIDNH FT PLPRVDDILADCAKGKIWAVIDMTDSFFQTRVHPDDIHLTAVTTPFGLYEW FT TVMPMGYRNAPSIHQRRVTAALRKYLGKICHIYLDDIIVWSQDIVEHEKNV FT RLILQALIDAKLYCNKKKTRLFAFRVNFLGHTISQDGIEADEKKAEKIENW FT PVPTTATEARAFLGLVRYLNAFLPKLAVQSDVLSVFTTKEAEKKFPEWLPK FT HQLAFDTIKAIVTSRECLTSINHENMGQNKIFVTTDASDRVSGAVLSFGPT FT WESARPVAYDSMTFKGPELNYPVHEKELLAIMRALRRWKVDLLGSEFLVYT FT DHKTLLNFDRQKDMSRRQLRWMEELSIYNCKFVYVKGEDNSVADALSRLPY FT KFVEKCEQPNAEADAEYPFSYHPEKPITVFAPASEPIMCGIVAALVEAAPK FT NSFKVTIDDNVLTKLKASYKDDPWCKKLLRASQGLPNVQQKDELWYIGDRL FT VVPKDSGLREIIYRLAHDNLGHFGFHKCYDNIRESYFWPNMRRDLEEKYIP FT GCQECQRNKAPTTKPVGPLHPLRVPDGRCESIAMDFIGPLPEDEGYDCILT FT ITDRLGSDIQIIPTRTDITAEELAVIFIDNWFCENGLPLEIISDRDKLFVS FT RFWKSLNKLSGVNLKMSTTNHPETDGSSERTNKTVNQCLRFHVERNQRGWK FT SALPRIRFFLMNTVNKSTGYSPFQLKYGRSPRIFPHFDTSTPTDKNDLDAL FT EIVERIQKNVADAKDNLMLAKISQAYFANIHRGPEIEYKMGDKVMLSTENR FT RREFKEKPGDGRVAKFMPRFDGPWEITEAHPETSNYKLNLPHTTNIFPTFH FT ASQLKPFVPNDNDIFPSRKRPDIPDPILVDGVLENYVDRILDFKRINRKPS FT YLVRWVGFGPEDDEWLPASMLEDNEALDRWIDFGGTSHFTSKPT" XX SQ Sequence 5564 BP; 1648 A; 1325 C; 1182 G; 1409 T; 0 other; ctttttttta aacatcatcc aaaccgtgtt tcgctctctc gtcctagtct tcactggcga 60 caaaccaagg tcgatacgaa gtcagtttcg caaagcgaac acgcgattgt cgcgaacttg 120 tgtcgtactc tagtcctctt cctgtctagg cgtcttttct gacgcatata ctcgtcttcc 180 tgtccaggcg tctttttcca agcgcatata ctcgtgttca gctccagtcc agtctcacct 240 ttgtctaaac tctcagatca cgcgcatata ctcgccttgt ctcgcatata ctccagctaa 300 acatctcttc aacactggac aaccggcgtg cgacgagaaa gattgtggcc caacaaggct 360 tactcctcga accacctctt ctcagtgcta ctcgtcctcg cagaagatcc atacccgcca 420 aatcattctc cccctccccc ccttccttgt catcacctct cctcttgaat agatcatcga 480 ctcttcttaa caataaacca cgactgcaac ccacaccaga ttcaccagac tccacaacca 540 gttcttccac gtgcagctcg cttcttgatt ttgctcttgt cactctttcg catcaaagca 600 gtatgtcttc catgaatctc gcaacagtcg cacgtcatcc cggtcacaag gtggcgcctg 660 ttttgtcaga cggcgttaca acaccaggcg cgcttctcga ctgggagaat gcatgcgaag 720 atttctttgg cgcgtgcaag gaaccgatcg ccgacgacaa gaaagtctct aaggtcactg 780 gcggtcttca gaacaaccgc atcagcatat atgtcagaaa taatcgcacc cgtttacacg 840 ctctcaagtt tccggaattc atgactgaat tacgcgaaac ttttttaccg gcagactggt 900 ctaaggatac gcaccgtaaa atcttggcgg cacgcatgtc acttgatcag tccttctata 960 acttttgcac ggatctagtc tcttccaaca atcttctcga tggcaaccct cttcacttgt 1020 ctgagcaacg tcttcgagaa caaattttca acaacatcac tgaggacctt cgcgagaaat 1080 tggaagattc cgcaaccgaa ctcaccgcgt tgaacgcact cccattccag aaatggatga 1140 acgctatctg tgacacggac gtaaagatgg cgaaagccat caaacgaaac atgaagcgtt 1200 acgcgttaga actcgagaaa gaagagaaga agaagaggat cctctcggga ccctctcgtg 1260 ccgtgaatgc atcctccacc ggcgccacag ataatcgacg ccagtacacg cagaatgcat 1320 cgtataaacc acgcaatctg cctcccctta cagaagagga aagagctctc atttacgaac 1380 acaagggatg tctaaaatgt cgtcgcttgt atgcggacca cattggccga gattgtaaca 1440 atgatttccc ggacacccac gttgcgatca cacctgcctt agccgccgca gcgaaagcag 1500 aatgggagaa acatcgtcca tacaagggaa agccactcaa tcgaggacct tcggcaagca 1560 ccagcgcacc cgccgttgca gctatcatgg aggaatcaag cgaagtctct ggtgacgagg 1620 atgacgataa cgatgaagga cattccggaa tcgtcgcgat gacttatcct tctgccgtgg 1680 cgcttggaga ctcgagctcg gatgatagtg tgagtccgcc tttaacaatt ccgaatcttt 1740 attggaacgc caccgcccct gggcgtgatg gtctctttgt accagtcaag gcgatgattg 1800 acagcggcgc tcacattgtc cttattaggc ctgatgtcgt cgacgcgctc ggtttagaaa 1860 agaaacggct tcggaaacct caagttatca acgttgcaat gaaagacgat aagaagaaga 1920 agaaaatcga acctgtactt ttatcccatt atgtctcgct ctcgctctcg actttagaca 1980 attcttggac ctcaaaacct gttctcgcag ttattgcccc tggcctttgc acaaaagttc 2040 tcttagggtt acccttcctt gtacacaacc gcatcattgt agatcatgaa ttgccaagcg 2100 ctattgttaa agagacttct atagacttgt tgaattttgt accaacacct cgtaagattg 2160 cacgaccagt aaaatcaccg cgccgaaaac gaattgaagt cagacttctt catcgggatt 2220 tacttaagga attaaaatgg cgatgtgcaa tgataaagaa gaaattagat agaatgatgg 2280 acatagaaca cgatgaccaa gaatttgttg cggtgaattt tatcgcagca gtgaaaggga 2340 aattagagtc attggaactc caacagaaat acaatgcact cgaagagaaa ttaaaaacag 2400 aattcaaaga aacctttgaa ccgatcccac acgttcacaa tctgcccgat gatgtgtatt 2460 gtcgaataaa attgaaagac gccacaaaaa caatcagtaa acgtacatat ggatgtccgc 2520 ggaaatatcg tgaagcttgg aaaacactca ttgataaaca cgtggaatca ggaaaaatca 2580 gactatctaa ctcagaattc gcatcacctt ctttcattgt tccgaaatct gatccaacgg 2640 ctttaccgcg ttgggtgaat gattatagag aacttaattc aaatgttgtt atcgacaacc 2700 acccactgcc tcgtgtggat gacattcttg cagattgcgc aaaaggaaaa atttgggcgg 2760 taattgacat gacagattct tttttccaga caagagttca cccggatgat attcacttga 2820 cagcggttac aaccccattt ggcctgtatg aatggactgt tatgcccatg ggctatcgga 2880 acgccccatc tatacaccaa cgccgtgtaa ccgccgcgct tcggaaatat ctgggcaaga 2940 tctgccacat ctatcttgat gacattatcg tctggtcaca agatatcgtc gaacatgaaa 3000 agaacgtccg tttaattcta caagctttaa tagacgccaa actgtactgc aataaaaaga 3060 agacccgttt atttgccttc agagtcaact tcctgggcca tactatttct caggacggaa 3120 ttgaagcaga cgagaagaaa gctgagaaaa tagagaattg gccagtacct acaacagcca 3180 cagaggcacg cgcgtttcta ggcctcgtcc gttatttgaa tgcctttcta cccaaattgg 3240 ctgtgcagag tgacgttttg tcagtgttta caacgaaaga agcggaaaag aaatttccgg 3300 aatggctacc taagcaccag ttagctttcg atacaataaa ggcaatagta acttctcgtg 3360 aatgtttaac atcaatcaac cacgagaaca tgggacagaa caagatcttt gtaacaacag 3420 acgcgagtga tagggtgtca ggcgccgtcc tatcgttcgg tcccacttgg gaatcagcac 3480 gtcctgtagc atatgattcc atgactttta aaggccccga attgaactat cctgtgcacg 3540 aaaaggagtt actagcaatt atgcgggcct tgagaagatg gaaagttgac ttactagggt 3600 ccgagtttct agtatacaca gatcataaga ctctcctaaa tttcgatagg caaaaggaca 3660 tgtcccgaag acagttgcgt tggatggaag aactctccat ttataactgc aaattcgtgt 3720 atgtaaaagg tgaagataat tctgtcgctg atgcgttatc aaggttgcca tacaaatttg 3780 tcgagaaatg cgagcaacca aacgcagaag cggacgccga atacccattt tcataccacc 3840 cagaaaaacc aatcacagtc tttgcgcctg catctgaacc aattatgtgt gggatagtag 3900 cagcattggt cgaggctgca ccaaagaatt catttaaagt aaccatagac gacaatgttc 3960 tcaccaaact taaagcaagt tataaggacg atccctggtg taaaaagtta ttgcgagcaa 4020 gtcaagggct accaaatgtg caacagaagg acgagctttg gtatatcgga gatagattag 4080 tagtaccaaa agactcagga ctacgggaaa taatttaccg attagcacat gataaccttg 4140 gacatttcgg atttcacaaa tgttatgaca acattcgcga atcgtatttc tggccaaata 4200 tgcgaaggga tctcgaagaa aaatacattc caggttgcca ggaatgtcag aggaataaag 4260 ctccaacgac gaaacctgtc ggtccattac atccgttaag agttcccgat ggaagatgtg 4320 aatctattgc aatggatttc ataggcccgc ttccggaaga cgaagggtat gactgtatat 4380 tgacgataac agatagactg ggttctgata ttcagataat ccctacaagg acggatatca 4440 ccgcagagga actagccgtt atttttattg ataactggtt ttgcgaaaat gggttaccat 4500 tagagataat ttcggatagg gacaaacttt tcgtttccag attttggaaa tcattgaata 4560 aattgagtgg tgtcaacctg aagatgtcaa ctacgaatca tccggaaaca gatggtagta 4620 gtgaacgcac taataaaaca gtcaaccagt gtttacgttt tcatgtggaa cgaaaccaaa 4680 gaggctggaa aagcgcgtta cctcgaattc gtttttttct gatgaacacc gttaataaat 4740 caacagggta ttcacctttc cagttaaaat atggacgctc tcccagaata tttccgcatt 4800 tcgacacttc gacaccaact gataagaatg acctggatgc attggagata gtggaaagaa 4860 ttcaaaagaa tgtagcagac gcaaaggaca acttgatgtt ggcaaagatt tctcaagctt 4920 attttgcgaa cattcaccga ggtccggaaa tcgagtacaa aatgggggac aaggtcatgt 4980 tatccacgga aaataggcga cgtgaattta aggaaaaacc aggcgatggt agagtcgcca 5040 aatttatgcc aagatttgac ggaccttggg aaataacaga agcgcatcca gaaacttcaa 5100 attataaatt gaacctgcct cacactacta acattttccc taccttccat gcatctcaac 5160 tcaaaccttt cgttccaaac gacaacgaca tatttccttc gcgcaaaagg cccgacatcc 5220 cagatccgat tctggtcgac ggagtgctcg aaaattacgt tgatcgtatc ctcgacttca 5280 agagaattaa cagaaaacca tcataccttg tccgttgggt gggttttggt cctgaagatg 5340 acgagtggct ccctgcatca atgctcgaag acaacgaagc cctggatcgt tggatcgact 5400 ttggaggaac ttcgcatttt acttctaaac caacctgacg gtagcttttt cccacagggt 5460 tttttaacgc acccagtcag ttttacttac tttagcagaa tttttctctc tctcctctct 5520 ccgttgatga ttgatggcga taattttttg tggaggggga gggg 5564 // ID Gypsy-1_SPDB-LTR repbase; DNA; FNG; 572 BP. XX AC ACOE01000078; XX DT 12-FEB-2011 (Rel. 16.02, Created) DT 12-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Spizellomyces punctatus genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_SPDB_; KW Gypsy-1_SPDB-I; Gypsy-1_SPDB-LTR. XX OS Spizellomyces punctatus OC Eukaryota; Fungi; Chytridiomycota; Chytridiomycetes; OC Spizellomycetales; Spizellomycetaceae; Spizellomyces. XX RN [1] RP 1-572 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Spizellomyces punctatus genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; ACOE01000078; Positions 77128 77699. XX SQ Sequence 572 BP; 173 A; 156 C; 119 G; 124 T; 0 other; tgtagcgaac gactctgata aaccggttcg ctcaaaggca tcaggtgacc acgattgagg 60 tttccgcaat tgtaacctaa gcttcgaaac ccctaagatc aagctcggtt ctgggctaca 120 ccagcgcagc atgaaggata cgttacgcag ccatgatcga tcatggcgat ggtccgagat 180 gggaagatcg gacgccgcca ccctaatgaa ggctacggaa actagtgtag acattaccgt 240 agcatattct tgtataaaaa gaggaattca ataaaataag tcggtggcca gaagccgtag 300 cgacactaaa tccccacaac tagcttctca actccaatct taggttaaca ggaacctaac 360 agagcgcttc aatggcctga tcacccacac tgaagcctct aaatcctgct cggttcacaa 420 ctaactgact ctccttgcct ctcgccctca agtcgctacc gatgcccaaa gaaaggattt 480 ggagactacg attcctagta cggacaccca cacaacctgc taggagaccc aagttgtctt 540 taccatagtc taccaacaaa aaatctgtta ca 572 // ID Ginger2-1_MG repbase; DNA; FNG; 2035 BP. XX AC . XX DT 03-FEB-2010 (Rel. 15.01, Created) DT 03-FEB-2010 (Rel. 15.01, Last updated, Version 1) XX DE Ginger2 DNA transposon from Malassezia globosa. XX KW Ginger2/TDD; DNA transposon; Transposable Element; Ginger; KW Ginger2; integrase; Ginger2-1_MG. XX OS Malassezia globosa OC Eukaryota; Fungi; Dikarya; Basidiomycota; Ustilaginomycotina; OC Exobasidiomycetes; Malasseziales; Malasseziaceae; Malassezia. XX RN [1] RP 1-2035 RA Bao W., Kapitonov V.V. and Jurka J.; RT "Ginger DNA transposons in eukaryotes and their evolutionary RT relationships with long terminal repeat retrotransposons."; RL Mobile DNA 1, 3-3 (2010). XX DR [1] (Consensus) XX CC The 5'-end and 3'-end are not determined. XX FH Key Location/Qualifiers FT CDS 160..1863 FT /product="Ginger2-1_MG_1p" FT /translation="MGYQASLHEGLEHASSQAHVPIQEGFTNEHRNLFDEW FT LMSNTFKSVVIREQYLMRLHVLHALANGHNSRQLLKDGVCSLSFIQKTRQN FT YSLNERGELYYHARKGKATLLVIPDDEVFDVIVREHNAIHHQGTSKTWYEI FT SRRYHGIPKRAVDWVLRHCLLCHDHRPGPRPAPTQKISSHEVMERVQMDLI FT DMRREPDRKYRWILHIKDHYSRFCMLYPLRRRRGKHVVRCFLQWIATMGPP FT MILQTDNGVEFVNALITQVAVQHQILIRRGQSGRPRSQGLIERANGHVRNL FT IAKWCNRFGENRWSRSLPSIALACNTSVHSSTRRTPFEMVFGRAPSWQRLA FT RLDPLDPTLTNMELGLEYEQQQSQSSVDTNGPPTPSPVRSVVPPMALSPPS FT ETLPSPSPPPSPSPSPSSSSSQRISSDDPSALVSQSRKTTSVPIVNQEPIP FT REVYFRRGTRVSVSLTGVDRKPLDVYRLPALILSHRKNKGYRVCTAWGILS FT NRVPARHVLPLPEGHPFPARAFRYLQPGRRTPRVDVSFCMQQLRLEEDTVS FT ATTAADNDATTDTDIANVTNI" XX SQ Sequence 2035 BP; 526 A; 509 C; 451 G; 549 T; 0 other; ttcgaaagtg aacatagtgt ccaaatcttg aattactttt caccttaatc atgaagtcgt 60 cttaacaata ttcactggca ttcctaactc taagtgttta gaatatgcag caaactttgc 120 tggtcaatct tttctcccgt cgattatatt cggtggaaca tgggttatca agcctctctt 180 cacgaagggt tggagcatgc ctcttcacag gctcatgtac caattcagga gggatttacc 240 aacgagcatc gaaacctgtt tgacgagtgg cttatgagca atactttcaa gagcgttgtt 300 attcgcgaac aatacttgat gagacttcat gtactgcatg cacttgccaa tggccacaac 360 tcgcgacagc ttttgaaaga cggcgtatgt tcgctgtcat ttatccagaa gacgcgtcag 420 aactactctc tgaatgagcg tggtgagctc tactaccatg caagaaaagg aaaagcgaca 480 cttcttgtaa ttcctgatga cgaagttttt gatgtcattg ttcgtgaaca caatgccatt 540 caccatcaag ggacaagtaa gacatggtat gaaatctccc ggagatatca tgggatccca 600 aagcgggcag tcgattgggt gcttcgtcat tgtttgctat gccatgatca tcgcccgggg 660 cctcgacccg caccaacgca aaaaatttca tcgcatgagg taatggaacg tgttcagatg 720 gaccttattg acatgcgtag agaaccagat cgcaagtatc gctggattct tcacataaaa 780 gaccattact ctcgtttttg catgctgtac cctttgcgtc gtcgacgcgg aaaacatgtg 840 gtgcgctgct ttttgcagtg gattgctact atgggcccac ccatgatatt gcaaactgac 900 aatggcgtag aatttgtaaa tgccctcatc actcaagtgg ctgtgcaaca tcaaattctt 960 atcagacgtg gccagtcggg tagaccgcgc agtcaaggcc tcatcgagcg tgccaatggt 1020 cacgttcgga atctaatcgc caaatggtgt aatcgatttg gtgaaaatcg gtggtctcgt 1080 tcattgccat cgattgctct ggcgtgtaat acttcagtgc actccagcac ccgaagaaca 1140 ccgttcgaga tggtatttgg tcgtgcccca agttggcagc gacttgctcg attagaccca 1200 ctcgatccaa ctcttacgaa tatggagctc ggattggaat atgaacaaca acaatcccaa 1260 agttcggttg ataccaacgg gccaccaact ccgagtccag tccgctcagt tgtgccaccg 1320 atggcattat ctcctcctag tgaaactctc ccttctccat ctcctcctcc atctccatcg 1380 ccatcaccgt cttcatcatc atctcaaagg atatcgtccg atgatccttc cgcactggtg 1440 tcgcaatcgc ggaaaacaac ttctgtaccg attgtaaacc aagaacctat tccacgtgag 1500 gtctacttcc gacgaggtac acgagtcagt gtaagtctca ctggcgttga tcggaagcct 1560 ttagatgttt atcgacttcc cgctctaatc ttgagccatc gaaaaaataa aggctaccgg 1620 gtgtgcactg catggggaat tttgtcgaac cgtgtacctg cgcgtcatgt attgccacta 1680 ccagaaggac atccttttcc tgctcgtgca tttaggtatt tacagccagg caggcgaaca 1740 ccgcgtgttg atgtgtcttt ttgcatgcaa caacttcgtc tggaagaaga cacggtctcc 1800 gccactactg ctgctgataa cgatgcaaca acagacactg acatcgctaa tgtaaccaac 1860 atttgatatt tcctctaatg atatttgtga ctaatcattc tacctcgtct acaattaggc 1920 agcagccgcc aggggcaaat ggactgttgt agctccggtt cctgagcgca caggtcggcc 1980 aacaacaatg ttacttgatg gtgaagtaag gtcgtccatc tcttggaata aagac 2035 // ID Gypsy-60_MLP-I repbase; DNA; FNG; 5959 BP. XX AC . XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-60_MLP_; KW Gypsy-60_MLP-LTR; Gypsy-60_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5959 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR [1] (Consensus) XX CC Positions [4901-5380] - Integrase core CC 'GATTC' target site duplication CC LTRs are 98% similar to each other. CC It included an insertion of DNA3-1_MLP at positions 4006-4388 CC (masked by x). The insertion interrupts continuity between the CC second and the third ORF. XX FH Key Location/Qualifiers FT CDS 370..1413 FT /product="Gypsy-60_MLP-I_1p" FT /translation="MDGSMEQPPNPADPLAEVLARLANLDLRLAEESRLRQ FT NAERELAELRAATSASTQPSPVNVQSPPPQPVYQAAVAKTPKVATPDKFDG FT ARGSKAKIIANQVGLYILMNPTQFPDDRTKIGWTLSYMTGKGGEWAKPFTK FT KLMSNEANDLTWDGFSQSFEATFFDSERVAKAEQGIRALKQTNTVLAYSLA FT FNNLALIVKWPEPVLITQFQQGLKREIQVQMVRDKFEKLEQISELAIKIDN FT ILHKRSDDTSLNDVKESSVDPNAMDCSAYRFNISNEEYQRRWDSELCFKCG FT KSGHRARECGGGFRGRWRGKFRGGRGAKVSTTELNVEDSGKESGRAEESKN FT GVARG" FT CDS 4388..5860 FT /product="Gypsy-60_MLP-I_3p" FT /translation="XSWLIDKSDINEWFEDCEEVADYGESDYKEAEEVFEL FT EEAEVPQVWDDTLIIEEIRNKSKEDERLMRIMENCLKSPDHSYEDHALVDG FT VLYFKGLIEVPDNEDIKLKIMKSRHDSVLAGHPGRLKSLQLVKRQYHWPSM FT RVFVNAYVDGCHSCQRVKPRTTKPFGSLQPLPIPAGPWTDICYDLITDLPI FT SEGKDCILTVVNRLTKMCHFIPCTTTMTSEELAKLMIKYVWKYHGTPKSIT FT SDRGNVFISKITKELNKQLGIVTQASTAYHPQTDGQSEVANKAIEQYLRHF FT VTYKQDNWANLIDMAEFSYNNSPHTSTGISPFKANYDKKTQESPDWEVGSK FT VWLDSRHISTTRPSAKFAHKWLGPFEISHRISKNAYKLILPLSMKRVHPVF FT SVGLLRPYKASVISGQLQPPPAPIIVEEEEEFEVSDIINKRKRGSKIEYLV FT SWKGYGPEEDSWEPAENLGNAQDVVDRFNIKYPRAETKYKRTRRVR" FT CDS 2457..4006 FT /product="Gypsy-60_MLP-I_2p" FT /translation="NISARLASQDQETKPKLSAADLVPKEYHDYLEMFEKS FT KSNVLPPHRPYDFCVDLIPGATPQASRAIPLSPAENEVLEEMLAEGLSNGT FT IRRTTSPWAAPVLFTGKKDGKLRPCFDYRKLNALTVKNKYPLPLTMELVNS FT LLNADRYTALDMRNGYNNLRVREGDEAKLAFICKAGQFEPRTMPFGPTGAP FT GYFQYFVQDIFKDRIGRDMAAFLDDILIYTKPGEDHEKAVKDALDTLRRQN FT VWLKPEKCKFGQKEISYLGLRLSYNKISMDKSKVEAVTAWPTPKNVGDIQQ FT FIGFANFYRRFIADFSQIARPLHDLTQKDVTFEWTTAREKAFQDLKEAFTT FT APVLKIADPYKQFILECDCSDYALGAVLSQVSEDDNELHPVAFLSRSLIQA FT ERNYEIFDKELLAVVASFKEWRHYLEGNPHRLNVVVYTDHKNLESLMTTKE FT LTRRQARWAEILGCFDFEIRFRPGKKSAKPDALSRRPDLKPDEECKLTFGQ FT LLKPSNLPSDTFISELEAAEX" XX SQ Sequence 5959 BP; 1850 A; 1179 C; 1273 G; 1274 T; 383 other; tattgaagca tcttaaatca catacgacat caggagaagt aagaagaacg aatatcagaa 60 gttaagaagt taaattaaag aaagttgaaa caaagactca agaaagttaa aattagaaaa 120 ttaaagttag aaggaagaag attagaacct taaatcccct caaactccga ccagatctga 180 ccagattaaa gggattcaat tagatattcg aagatcatca gaaacaatta acccgaacaa 240 accttattca acctaatcta gttcacaaca cccaacttta ctatcccggg aagtcgatcc 300 cccagcccta ctccaacaca taccaccggc ttcgagacag ctgtatcacc ggaggctgaa 360 gaaggaatta tggatggatc aatggaacag ccaccgaacc cagcagatcc attggcggaa 420 gtcctagcta gattagcgaa cctggatctg agattagcgg aagagtctcg actacgtcag 480 aatgctgaaa gagagttagc ggaactacgt gcagcaactt cagcgtcgac tcaacctagc 540 cctgtcaacg ttcaaagccc ccctccgcag cccgtttatc aagctgctgt cgctaaaaca 600 ccaaaagtag cgacacccga caagtttgac ggtgcgaggg gtagcaaagc caaaataatt 660 gcaaatcaag ttgggttgta catattaatg aacccgaccc aattccctga cgaccgtact 720 aaaataggat ggacattgtc ctatatgacg ggcaaaggtg gcgagtgggc aaagccgttt 780 acgaagaagt tgatgagtaa cgaagcaaat gatctcacct gggacggatt tagtcaatct 840 tttgaggcca cattttttga ttccgaacgt gtagcgaaag ctgaacaagg aattcgagcc 900 ttgaaacaaa ccaacaccgt cttagcgtac tcactggcat tcaataatct tgctctcatc 960 gttaaatggc ccgaacccgt attgattact caatttcaac agggtttgaa gcgggagata 1020 caggtgcaaa tggttagaga taaatttgaa aaattagagc aaatatcaga actagcaatt 1080 aagatagaca atattttaca caaaagatca gatgatacta gtttaaatga tgttaaagaa 1140 tcatcagtag atcccaatgc tatggattgt tcggcttaca gatttaatat ttcaaatgaa 1200 gaataccagc gtagatggga tagtgaactc tgtttcaagt gtggaaaatc aggtcataga 1260 gcaagagagt gtggtggggg ttttagagga agatggagag gaaaatttag gggaggaaga 1320 ggagcaaaag ttagtactac tgaattaaat gttgaagata gtgggaagga aagtggaaga 1380 gctgaagagt caaaaaatgg cgtagctcgg ggatgaaggt tgttcctacc ccgggcttaa 1440 ggcatgatga gttagttata gatttaggag ccattgaaat agatattcaa acaattgcaa 1500 tgaaagaaaa tcgtattttt gccactatcc ctataattga cccatcccaa gaagcaacct 1560 attttgcaag agccatgttt gactcaggag ccacacatga tgtcatgaat gaagcctttg 1620 ttagacgcac caacatatcg accaccaagt tagatcaacc caagccagtc accggcttca 1680 acggatctca atcctttatc accgaggtag gacatcacat agtcagcatt aatggaagaa 1740 gaacgccctc gaccttttta atatcaccca tcaaggactc catcgactgc attattggaa 1800 ttgagtggat atgcgcacat cataaattaa ttgattggaa gaaaaggaca ttgttgaagg 1860 aaaatgaaat tgcggtcgtc gagacgacct tgacatcacc gaaaaaccct ccggggattg 1920 ttcaactgga acacctggag aaagctagga atgatggcaa gggggtgtgt atcccagtgg 1980 atacattaac acccccgcaa tgtgagtttg atagcatttt ttcaactgtt ttcaaagaaa 2040 aaagaagcaa tcaggatcct ctcccagatt ttagatccaa acagaagaac gacatacccg 2100 acgaaacaga cgtaccagag cccatcgaac cccaagttgc agcctctatg gaggcttcgt 2160 cgaatccgaa aaaatccctg gacgaaccca tagtggagaa ccaagggaaa gctaggaaaa 2220 acgacgaggg ggtgtgtatc caaacagata cgctaacacc cccgcaacgg aagttcgata 2280 gttcttcaaa gattaaagta aaagaagcgg ctagcaagca gtgtttttcc agacagttca 2340 gccaccagca cacgaatccg agcaacttga ccatgaccac cgctaaagga cgaacttacc 2400 tgcctcgacc caagatccta gcaacccctc atctagacgc agcaaaagcc tcatgaaaca 2460 tttcggcacg actggcgtct caggatcaag aaacgaagcc caaattatca gctgctgatt 2520 tagtaccaaa ggagtatcac gactacctgg aaatgtttga aaaatccaaa tctaatgtcc 2580 taccaccaca cagaccatac gatttctgtg tggacttgat cccgggagca acacctcagg 2640 ccagtcgagc tataccactg tcacctgcag agaacgaggt gttggaagag atgttagcag 2700 aaggtttaag caatggtaca atccgacgca ccacatctcc ttgggccgca cctgtactat 2760 ttacgggcaa gaaagatggg aagttaaggc catgttttga ctacagaaaa ctaaacgcac 2820 tcactgttaa gaacaaatat ccgttacctt tgaccatgga actggtcaac agcctactca 2880 acgctgatag atacactgca ttggatatga ggaatgggta taataattta agggttagag 2940 aaggagacga ggctaagtta gcgttcatct gcaaggctgg tcaatttgaa ccacgtacca 3000 tgccgtttgg acccacagga gcgccgggtt acttccagta cttcgtacag gacatcttca 3060 aggaccgtat tggtagagac atggctgcat tcttagacga catactaatc tacactaagc 3120 caggagaaga tcacgagaaa gcggttaagg atgcattgga tactttaagg cgtcaaaacg 3180 tatggcttaa acctgaaaaa tgtaaatttg gtcaaaagga aatatcttac ctaggtctaa 3240 gattgtctta taacaagata tccatggata agtctaaagt cgaggccgta acagcgtggc 3300 caactcccaa gaacgtgggt gacatacagc agtttattgg ttttgcgaac ttctatcgaa 3360 gattcatagc cgatttctct caaattgcaa gacctttaca cgacttaaca cagaaggacg 3420 tcacatttga atggacaacc gcgagggaaa aagcttttca ggacttgaaa gaagcattca 3480 ctaccgcccc ggtactcaaa atagccgacc cgtataaaca atttatactg gagtgcgact 3540 gttcggacta cgcattggga gccgtacttt cacaggtctc agaagacgat aatgagttac 3600 acccagtggc ctttctatct cgttctctca tacaagcaga aagaaattat gagatttttg 3660 acaaggagct actggctgtc gtcgcgtcat tcaaagaatg gcgacattac cttgagggaa 3720 atccacaccg actaaatgtt gtggtttaca ccgaccacaa gaatctggaa tcactgatga 3780 ccaccaaaga gttaacccgt agacaagcgc gctgggctga gatcctggga tgtttcgatt 3840 tcgaaatacg gttccgtccc ggaaagaaat cagcgaaacc tgacgcgttg tctcggagac 3900 ccgacttgaa gcctgatgaa gagtgtaaac tgacctttgg ccaattattg aagccaagta 3960 acctgcctag cgacactttc atctcggaac tagaagcggc ggagaxxxxx xxxxxxxxxx 4020 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 4080 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 4140 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 4200 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 4260 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 4320 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 4380 xxxxxxxxag agttggttga ttgacaaaag tgacataaat gaatggtttg aggattgtga 4440 ggaggttgca gattatggag aatcagatta taaagaagct gaagaagttt tcgaactgga 4500 agaagcagaa gtacctcaag tttgggatga caccttaatc attgaggaaa tcagaaataa 4560 atcaaaggaa gatgaacgtc ttatgcggat aatggaaaac tgcttgaaga gtccagatca 4620 ttcatacgag gaccatgcgt tagtggacgg agtgctctat tttaagggcc tgatagaagt 4680 accggacaat gaagacatta agctcaagat tatgaaatca agacatgata gtgtcttagc 4740 aggccatcca ggccgtttaa agtccctaca attggtgaag aggcagtatc actggccgtc 4800 gatgagagta ttcgtgaatg cgtacgtcga cgggtgtcat tcctgtcaga gagtgaagcc 4860 aaggacgact aaaccttttg gaagcctaca accactgccc ataccagctg gtccctggac 4920 tgatatctgt tacgatctta ttactgacct acccatatct gaagggaaag attgtatcct 4980 tacagttgtc aaccgcttga ctaagatgtg ccatttcatt ccttgcacaa caacaatgac 5040 ttccgaagag ctagcaaaac taatgatcaa atacgtatgg aaataccacg ggaccccgaa 5100 gtctattaca tcggataggg gtaacgtttt catatctaag ataacaaagg aactgaataa 5160 acagttaggc atagtgacac aggcttcgac ggcgtatcat cctcaaacgg acggccaatc 5220 cgaggtagca aataaggcta tcgaacagta ccttcgtcac tttgttacgt ataagcaaga 5280 taactgggct aacctgattg acatggcaga attttcatac aataacagtc cgcacacatc 5340 aacaggcata tcaccgttta aggcaaacta cgataagaaa acacaagagt cccccgattg 5400 ggaagtagga tcaaaagttt ggctggattc gagacacatc tcaacaacgc gaccaagtgc 5460 aaagtttgca cataaatggc taggaccttt cgaaatttct catcgaatat caaaaaatgc 5520 ctataaactg attctacctc tttcaatgaa gcgagtacat ccagtgttct ctgtaggcct 5580 actacgacca tacaaagcaa gtgtcatcag tgggcaactt caacctccac cggcaccaat 5640 catagttgaa gaagaagaag aattcgaagt tagtgatata atcaacaaga ggaaaagagg 5700 gtcaaaaatt gaatatctag tcagctggaa gggatatggc ccagaggagg actcatggga 5760 accggccgag aatttaggta atgcgcaaga tgtagtggat aggtttaaca tcaaataccc 5820 aagagctgag actaaataca aaaggacacg gagagtacgt tgagggcaat gctttttccc 5880 cacgtgggtt ttttaacgct agcccgggga aagacgtcag ctcagcaaga gggagcggga 5940 gacgtaaagg gggaggtag 5959 // ID Gypsy-121_MLP-I repbase; DNA; FNG; 5710 BP. XX AC AECX01000855; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-121_MLP_; KW Gypsy-121_MLP-LTR; Gypsy-121_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5710 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000855; Positions 3301 9010. XX CC Positions [4510-4989] - Integrase core CC 'ATCCT' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 371..1522 FT /product="Gypsy-121_MLP-I_3p" FT /translation="MSSVNLEEIMKKLEELSVKLAEETTLRQKAEMELNLM FT RQQQPAPMDQEQPKASIHQQPASLPVAQPAKPKPPKIATPDKYDGSKGPKA FT EIFMNQLGLYIQMNHESFSNDQARVAFALSYTTGKANMWGQVFTDQLLDTA FT TVHLVTWKLFVKSFKATFFDSERIPKAEDNIRALKQTKSVTEYWIRFSELS FT LVVKWPESVLISQFKQGLKREITVHMVRDEFEKVEDAAKIAIKIDNNINKR FT HQDYSLATHDQHTPSTTTPDPDAMDCLAYKLNISGDEYRRRGQNGACYSCG FT KTGHLIADCLMKKRGRSGYSGGYSNRYSNSYRNGGNVGGRGGYSNSSNSRI FT DELETQLKARLDEIDVQLGKSDKGKGVEGRSDQSKNGDAQD" FT CDS join(1588..2820,2824..4389) FT /product="Gypsy-121_MLP-I_1p" FT /translation="MHDTRIIDIIQLFDPKTATTKPARALVDSGATHEAIS FT RKYVSDVLFETAELPQKRSVTGFSGHESIVTHTGDYCVNNNQVETTFIITT FT LRDKYDVILGMPWIRQNHHLIDWENSKLKSTEESKIATVSTVSSTPKKSLQ FT DHELRLGGKARFSDEGVQALCLLSPPQCEFDASISKNVNELAVDQDLPLEL FT SSKEPGIMDTLQSLNPKTTPTDHVMRPQGNARNIEMGVELQKNSLKPPQSE FT SLNFLNPMSKKNVSKQFPFWKRSVNTGSMLRRNLSKMVTHVPTQAMIDAAE FT SSWNVSTRLAVEASKGKVERTAQELVPECYHDYLEMFEKSKSNVLPPHRPY FT DFRVDLVPNATPQAGRIIPLSPKETEVLHEMLEKGLANGTLRRTTSPWAAP FT VLFTGKKDGNLRPCFDYKLNSLTIKNKYPLPLTMELVDSLLDADEYTSLDM FT RNGYNNLRVQEGDEAKLAFICKEGQFEPLTMPFEPTGAPGFFQFFIQDILR FT TRIGRDVAAYQDDILIYTKPGVDHKKVVKEVLDILKAQNVWLKPEKCKFSQ FT KEIAYLGLIISRNQIKMDETKVKAVREWPAPKNLSEVLTFLGFSNFYRQFI FT HHFSKIAKPLHKLSQADVKFDWTEERNNAFEALKHAFTTAPVLSIADPYRP FT FVLECDCSDFALGAVLSQVSESDNVLHPVAYLSRSLIKAERNYEVFDKELL FT AVVSAFKEWRHYLEGNPNRLSVIVYTDHKNLESLMTTKELTRRQARWAEIM FT GSFDFEIRFRPGKQSTKPDALSRRPDLKPADGDKLTFGRLLKPENLPADAF FT IDALDLVDSWFVNEESINIDIDEINGLGEIEDIREDEEDKEDGVWTDARII FT QEIKKNSLDDDRVKHITELCQEMPNSKIISDYAVIDDVLYFRDRIVVPDNN FT NIKLQILKSRHSSLMAGHPGRMRTLMLVK" XX SQ Sequence 5710 BP; 1971 A; 1136 C; 1222 G; 1381 T; 0 other; tattgcaacg tctacaatct agtagaacaa ggaggattcc gaagatcaat caataagaag 60 aagaaactca aagttttaat tcagattgaa aagtttaaga gaaagtttaa agtaaattaa 120 agttagcaag atctcagatc tcaaagttta gaagtaaatt aaagtttaaa gtttcgaaga 180 caaacccaag tgatcacgta atcatattca atcctctgac caaatcattt accaattgaa 240 tacttctcgc gaattctctg cagtcattca accggaacat cgtttcaaaa cgccgaacta 300 tacattaccc cgcagctcta ccccgagtga agaggaagat aaattcgtcg acgcacctga 360 aagcaaaacc atgtcgagtg taaatttaga agagattatg aagaagttag aggagttgag 420 tgtgaaacta gctgaagaga cgacgctccg tcaaaaagct gagatggagt taaatctgat 480 gagacaacaa caacccgctc ctatggatca agagcaacca aaggctagta tacaccaaca 540 gcctgcgagc ctccctgttg ctcaaccggc taaaccaaaa cctcctaaga tagctacacc 600 tgataaatat gatgggtcca aaggaccgaa agcagagatc tttatgaatc aattaggact 660 ctacatacaa atgaatcacg aatcattcag taacgatcag gcaagggtcg catttgcatt 720 gtcttacaca acaggcaagg caaatatgtg gggtcaggtg ttcacagatc aactattaga 780 taccgctaca gttcatttag tcacttggaa attgtttgtc aaatctttta aggcaacatt 840 ttttgattca gaacgaattc ctaaggctga agacaacatc cgtgcattaa agcaaaccaa 900 gtctgttacc gagtattgga tccgattctc agagttgtct ttagttgtga aatggcctga 960 atccgtctta atatcccaat ttaaacaggg tctcaaaaga gaaataacgg tacacatggt 1020 cagagatgag ttcgaaaagg tggaggatgc tgctaagata gctatcaaaa tcgacaacaa 1080 catcaacaaa agacatcaag attattcatt agcaactcat gatcaacata ctccttcaac 1140 gactacaccg gaccctgacg caatggattg cttggcatac aaactcaaca tatctggaga 1200 cgagtaccgc cgtagaggac aaaatggagc ttgttattca tgtggtaaaa cgggtcactt 1260 gatagcggat tgtttgatga agaagagggg aaggagtggg tactcaggtg gttacagtaa 1320 taggtattca aatagttata ggaatggagg aaatgtagga ggaagaggtg gatactcaaa 1380 tagtagtaac tcaagaattg atgagttaga aacccaattg aaagcgcgct tagatgaaat 1440 agatgttcag ttaggcaaaa gtgacaaagg aaagggggtt gaaggcagat ctgatcaatc 1500 aaaaaatgga gatgctcaag actgaaggtt gtgcctccct caagcgtaaa cttaggtgtt 1560 gaagaaaatg ttattgctag cttagaaatg catgatacaa gaattattga tattattcaa 1620 ctctttgacc ctaagactgc cacaacaaaa cctgcccgtg cccttgtaga cagcggagcc 1680 acccacgaag caatcagcag gaagtatgtc tctgatgtcc ttttcgaaac tgctgaacta 1740 cctcaaaaaa gaagtgttac agggttcagc ggtcatgaat ccattgtaac acacaccggc 1800 gattattgtg taaacaataa tcaagttgaa acaactttca tcatcactac cttacgtgac 1860 aagtatgatg tgatcctcgg tatgccttgg atccgtcaaa atcatcattt gatcgactgg 1920 gaaaactcaa agttgaaaag cactgaagaa tccaaaattg caactgtttc tacagtttcg 1980 tccacaccga aaaaatcctt gcaagaccac gaattgaggc ttggagggaa agctaggttt 2040 agtgacgagg gggtgcaagc tttatgctta ttatcacccc cgcaatgtga gttcgatgct 2100 tctatttcta aaaatgtgaa tgaattggct gtcgatcagg atctcccttt agaactgtca 2160 tccaaggaac ccggcataat ggacactctg caatctctaa atccaaaaac aaccccgact 2220 gaccacgtta tgaggcctca ggggaacgct aggaacattg agatgggggt tgagcttcaa 2280 aagaactcgt taaaaccccc gcagagtgag tcacttaatt ttcttaaccc tatgagtaaa 2340 aagaatgtta gcaagcagtt tcccttttgg aaaagatccg taaacacagg ttcaatgcta 2400 cgacgaaatt taagcaagat ggtgacacac gtaccaacac aggctatgat tgatgcagcc 2460 gaatcttcgt ggaatgtctc tacaagatta gcggtcgaag cgtcgaaggg aaaagtcgaa 2520 cggactgctc aagaattagt gcctgaatgt tatcatgact acctcgaaat gtttgaaaag 2580 tcaaagtcaa atgttttacc acctcatcga ccatatgact ttagagttga tttagtccca 2640 aacgccacac ctcaagcagg caggattatt ccgttgtcac caaaagaaac cgaagtttta 2700 cacgagatgc tagagaaggg ccttgcgaat ggaacattgc gccgaacaac ttccccgtgg 2760 gcggcgccag tgctctttac aggcaagaaa gacggaaatt tgcgcccgtg ttttgattat 2820 tgaaagctga actcattaac tatcaaaaac aaatatccat tacccctcac gatggaacta 2880 gtggatagcc tattagatgc tgatgaatac acaagtttag acatgagaaa tggctataat 2940 aacttaagag tacaagaagg tgatgaagcc aagctagctt ttatttgtaa agagggtcaa 3000 tttgaacctc tgacaatgcc gttcgagcca actggagccc caggcttctt tcaattcttc 3060 attcaagata ttctaagaac tcggatagga agggacgtgg cagcatatca agatgacata 3120 ttaatttata caaaacctgg agtggaccac aagaaagtag tcaaagaagt tctagatatc 3180 ttgaaggctc aaaacgtatg gctcaaaccg gaaaaatgca agttctctca aaaagaaatt 3240 gcatacttag gtctcattat ctcacgaaat caaattaaaa tggatgaaac caaagttaaa 3300 gcggttcgag aatggccagc accaaagaat ttatcagaag tgctcacgtt cttaggcttc 3360 tctaatttct ataggcaatt tatacatcac ttctcaaaga tagcaaagcc attacacaaa 3420 ctctcacaag ctgatgttaa attcgattgg acggaagaga ggaataacgc atttgaagca 3480 ctgaagcacg cctttacaac agcaccagtc ttaagcatcg ctgatccgta ccggcctttc 3540 gtcttagagt gtgattgctc ggattttgcc ctaggggcag tcctttcgca agtatctgaa 3600 tccgacaacg tgcttcaccc ggtggcatac ttatcaaggt ctttaattaa agccgagagg 3660 aattatgaag tctttgacaa ggagttattg gcagtagtgt cagccttcaa ggagtggaga 3720 cactaccttg aagggaaccc aaataggcta agtgtcatag tatataccga tcacaaaaac 3780 cttgaatcgc tgatgaccac taaagagcta actagacggc aggcaaggtg ggcggaaatt 3840 atgggtagtt tcgatttcga aatccggttt cgcccaggga aacaatccac taagccagac 3900 gcactctctc gacgtcctga tctgaaacca gcagatggcg acaaactgac attcggacga 3960 cttctaaaac ctgagaacct tccagccgat gctttcattg acgctttgga cctagtggac 4020 tcgtggttcg tgaacgagga atctatcaac atcgacatcg acgaaatcaa tgggctagga 4080 gaaattgaag acataaggga agatgaagaa gataaagagg atggagtatg gactgacgca 4140 cgcatcatac aagagatcaa aaaaaattcg cttgacgacg accgtgtaaa gcacataact 4200 gaactgtgtc aagaaatgcc aaattcaaag attatctctg attatgctgt tattgatgat 4260 gtgctctatt ttcgagatag gattgtagtc cccgacaaca acaacatcaa gttacaaatc 4320 ctcaaatcac gacacagcag cttaatggcc ggtcatccag gaagaatgag aactctaatg 4380 cttgtaaaat gaatgtttca ttggccatca atgaagatgt atatcaacaa gtatgttgaa 4440 ggttgtcagt catgtcaaag agtaaaatca agaaccagca aaccttttgg atcactgcag 4500 ccgctaccaa tcccgttagg tccttggact gatgtgtgct atgatatgat tactgaccta 4560 ccagattcag gaggatgtga cagcatacta acagtagtcg acagatttag caagatggct 4620 cattttttac cgtgtaagaa atcgatgagc tctgaagaac tagcaaaagt catgctacag 4680 aatatctgga agattcacgg cactcccaaa tcaatcacat cagatcgtgg gaacattttt 4740 atatcaagat taacgaaaga aatgaataaa ttattaggca tcaagactca agcatcaacc 4800 gcctatcacc cacaaacgga cgggcagtca gagatcacga acaaggcagt cgaacagtat 4860 atacggcact tcacatccta taaacaagac aactggcaag aattattacc attagctgag 4920 ttttcgtata ataataatct ccatgtgtca atagggatgt caccgttcaa agccaattac 4980 ggcttcgatg ttagtctgac aggaacaatc aggactgatc agtgtctacc agcggttcaa 5040 gaatcaatca cccaaatacg agaagtacaa gaagacctaa gatatgcaat gaaataagct 5100 caagacgaaa tgaaacaaca attcaataag aaagtgcttg cgacgccgaa atgggacaaa 5160 ggagattatg tgtggctcaa cagcaaacac atttcaacta caagacctac ggctaagtta 5220 tctcatagat ggctgggccc gtaccagatt gtagaacaga tttcaactaa tgcttacaaa 5280 ttgtcattac ctagggaaat gaagaatgta caccctgtat ttcatgtcaa actactcaga 5340 aaattcaaca agagtgaaat ccctggacaa atagtagaag aaccagaacc ggtaatgatt 5400 caagagcaag aagaatttga agtagaagaa attctgaaca agaggaagag aagaggtaaa 5460 actgaatatt taataagttg gaaaggatac gaatcaaatc acgactcatg ggaaccagaa 5520 ggaaatttag acaatgcaaa ggaattagta aatgttttca acacaaaata tcctcaagca 5580 gaaagtaatt atttcaggac acggagaaga taagagaggg tgaggctttt ttcccaagtg 5640 gtttttaatg ccaacccgtg ggaagatacc taacccatca agagggggtg gaggtataaa 5700 ggggggatga 5710 // ID Gypsy-3_PPM-I repbase; DNA; FNG; 6926 BP. XX AC ABWF01002000; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Postia placenta genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_PPM_; KW Gypsy-3_PPM-LTR; Gypsy-3_PPM-I. XX OS Postia placenta OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Polyporales; Postia. XX RN [1] RP 1-6926 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Postia placenta genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABWF01002000; Positions 17032 23957. XX CC Positions [5756-6232] - Integrase core CC 'CAGCGG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 153..2678 FT /product="Gypsy-3_PPM-I_2p" FT /translation="MTASQPSSPSRSRSNTNINPSTDTSILTLQNAQRLLT FT ARMTTVAPVPMPAPGSRRAPKFRGKGLEAFLVDFEKLAKRAGLKDGDLPTE FT VLSYCSTSVRELLRRNAKFKGKDWEDAKKAMRLFYRDKSEEQVTVVGLRDF FT AERMRRKNKVNTSRALDEYAIAFGKRMGDLVAQGQMPDKERDVLFFRGLGK FT NIRMAIRPELRQLKGKKPLVLDPPTMEEVLKEAQDHFNVLDIDYDPESEDD FT ETGSDSEDSEGESSGNESNSSGVSDSKGKDKKRGGKVVRVKTVKSRAPTPD FT SGALALESVARLSEQVRQLTLTMGQAQGASSMVNNGLTNVPPSFSEGGRFC FT WGCGKKEGIDLDHALNIKRCPQTLDLMREGLIRFSQETGRLVRADGSQLPP FT SNGMPGGFAAILRREQWERKAKDRDRDAPPHQQGKAANVFAMGLMRNDEDV FT LRGNVFAASCEEVYSFPAQTRAQSKAMGKGSNTDEIQKSVRFEFEHVGGKS FT GAKSTEPMVTVREPAIPTRLTPAREKESSEADVQPHQANSETGWRERERKK FT REAQREARQQDGGDERPKFGKSPHFRFTSDIQDGVSLDGVRDKILDTPVSL FT PLREVLGMAPELQKRFAALTRTRKEMTVRSVELQVDDKQEGTDEPGTAMVA FT YSTAEELFGLLERYAGAVAVGSRRYYAMACGVLEGKFGSEQVMFLVDSGSE FT LNLITRRVWEQSGVEMDSDGSRWSLKGINGDPVPLLGCCRDASVEIDGWHF FT DHHFFVSTREFGNYDGILGQPWLQWHSTQVEYNRGGPVNLLIYPTGDRTDK FT PIKLKILGPAHQRNSDKLIMDEPDKQRVVSSVEEVSDEGF" FT CDS 2765..6874 FT /product="Gypsy-3_PPM-I_1p" FT /translation="MVGEVSIVSIDDVCRLVQYDPPNSFDSILRTTLSISS FT VGSVPIPSPWVKRVMSNCVASVSVGRRYKPVDQKVRPVPTYMPNPSAQKFK FT PIDVSNLPPLPTNPPSRFEFRPTERLTQDRLDAIIDTVPEGFLTSAELDLM FT VYIVDNCQQAIAWTDSERGTFSREYFPDYEMPVIEHTPWVRAPIPVPRAIE FT GKVREMLRKQIEAGKYEYSSASYRSSTFTVEKKSGDLRNVHNLKDLNGVTI FT RDSALPPRPDKFAESFVGYVIYLLADLLSGYDARVLAETSRDLTTFHSLVG FT PLRLTCLPQGYTNSMQEFQRCTTHVLGNMCPDKAKVFVDDMAAHGPSSRYN FT EETIPENDDIRRFVWEYAHVVYELFSRLERSGITASGTKIVLATPALDVLG FT AIASEEGLQLHHGLVNKVLKWTYPTNVSEVRGFLGTAGVGRKWIKGFSIIA FT RPLTVLTRKSEDEFIFNEEAKRAMDELKRLVTEVPVRKSIDYELGRLITQG FT TRASNHGLVVLAVDSSIYGAGWVLYQYVHVDKHPVLFGSCTFNPTESRYSQ FT PKIELYGVFRAIKEVRHRIWGLHFRLEVDAKYLKQMIYEPDLPNAPMTRWV FT SYIQLFDFELEHVPANKHAAADGLSRRPRSADDSDESDGEQELERLLGHGW FT YAIGSEIRHSLFLQLLPDLKMSGGLMGFDAEHGRYNSSIGCRMVVEMGEFE FT SARADRISRRSAAARNASAEVRVFDINVVGGGYAAWSVATWGMEDTKADEV FT ASSMLCTLLRMEDDVSYIGNEFQVRKVPRERTDRFLFAGELVTLEYTSWRP FT AWATSERKQGIELEKAWERERNAVQSSFVRRGDTMVYARAVSSRYLYEDLP FT APASLRDMRCVTHEGRVRDRDDPAMFEEIRKYLQEGELPERCANDSRVRGT FT FVRLARSFILNDRRLWRVSKGELPKLVVLDVERRREIIAEAHNDCGHRGRD FT PTFRKVADRFWWPNQYDEIAFFVRSCNACQYRAKSRSLQPLSITISPSILR FT RFVLDTIHMPQGSNGHRYLLHASDDVSRWPEAVSSRKNNAETWAKFIWKVL FT CRFGCIPVFVCDGGPEFKGAARTILLRHGVSVILSSPYHPEGNGIAERDGQ FT TLMRAVMRSCGKRTKDWPLFLEAGLLAVRTTTSRATGYTPYFLLYGKQCLF FT PFDLTDRTWSILEWDKVRTTEDLLTMRLQQIARKDELVEDAVKHLIKSRRR FT SADDYNKKHARSMTEAFEPGMWVLVHETWLDNQHGNKGALRWAGPYVIQER FT HPSGSYAIRELDGVVLKEAVAASRLKLFYYRNDHQVMMSGLSSDWRDRFPP FT GLPSFLFTSSTTAIVDFARNDELEELSYTEDHPTGVTRRSRSNLRELLRTQ FT KEVEPWW" XX SQ Sequence 6926 BP; 1775 A; 1464 C; 2092 G; 1595 T; 0 other; caagtggtga ccaccgcagg gggtaagtaa agtggttttc cgtaaaggtt gacattctgg 60 ctagatcgtc aataacctcg attccgattc caacggcgca aggcggacgc aggcgcttgg 120 acgttcaacc atcggatcgc caaacacggt cgatgactgc atcgcagccg agttctccaa 180 gtcgctcacg ttcaaataca aatatcaatc cttccacaga cacgagcatt ctcactctac 240 aaaacgcgca acggttatta acagcaagaa tgacaacagt agcacctgta ccgatgccgg 300 caccagggtc caggcgcgcg cccaagtttc gaggtaaggg gctggaggca ttcttagttg 360 actttgaaaa attagcgaaa cgggccgggc tcaaagatgg agacttgccg acggaggttt 420 tgtcgtattg ttctacgtcg gtccgagaac tacttcgacg gaatgcaaaa ttcaaaggta 480 aggactggga ggacgctaag aaggctatgc gtctgtttta cagggataag agcgaagagc 540 aggtgacagt ggtagggttg cgagactttg ctgaacgaat gcgtaggaag aataaggtaa 600 acacgagccg agccttggac gaatatgcga tagcatttgg taagcggatg ggggacctag 660 ttgcgcaagg ccagatgccc gacaaagaac gggatgtttt attctttagg ggcctgggaa 720 agaacattcg catggctatt cgcccggaac taaggcagct taaaggtaag aaaccccttg 780 ttctggatcc tccaaccatg gaggaggtgt tgaaggaagc gcaagaccac ttcaacgttc 840 tggatatcga ctatgacccg gagtcagaag atgatgagac ggggagcgat tcggaagata 900 gtgaaggtga gtcgtcgggt aatgaatcga actcatcggg tgtatcggat agcaaaggga 960 aggacaagaa aaggggtggg aaggtagtgc gagtgaagac tgtgaagtca agggcgccta 1020 cgccagattc gggtgcgctc gcactagaaa gtgtggcccg gttgtcagaa caggtgcgcc 1080 agctgacctt gacaatggga caagcgcaag gtgcgtcgag tatggtgaac aacgggctga 1140 caaatgtgcc gcccagcttt tcagaaggcg gaaggttctg ttgggggtgt ggtaagaaag 1200 aaggtataga cttggatcat gcgctgaaca tcaagagatg cccgcaaacg ttggatttaa 1260 tgcgcgaggg tttgatcagg ttctcacagg aaacggggcg actcgttaga gccgacggat 1320 cacagttgcc tccatcgaac gggatgcctg ggggtttcgc agccattttg cgccgggaac 1380 aatgggaacg caaagcgaag gatcgggacc gtgacgcacc tccgcatcag caaggtaaag 1440 cagcgaacgt gtttgccatg gggttgatgc gcaatgatga ggatgtgttg aggggtaacg 1500 tgtttgcagc ttcatgtgag gaagtgtatt ccttcccggc tcaaacaaga gcgcagtcta 1560 aagcgatggg aaaggggtcg aacactgacg agatacagaa atcagtcagg tttgagtttg 1620 aacatgtcgg tgggaagtcc ggagcaaagt cgacagaacc aatggtgacg gtgcgtgaac 1680 ccgcaattcc gacaaggctg acccctgcga gagaaaagga gtcctcggag gcggatgtgc 1740 aaccacatca ggcgaactcc gaaacaggat ggcgtgaacg agagcggaag aaacgtgaag 1800 ctcaacgcga ggctcgtcag caggacgggg gagatgagag gccaaagttt ggtaagtccc 1860 ctcatttccg attcacgtct gacatacagg atggggtatc gctcgatggg gtccgtgata 1920 agatccttga cactccggta tccctgccat tgcgtgaggt attggggatg gctccggagt 1980 tacagaaacg gtttgctgca ctgacgagaa cgaggaaaga gatgacagtg cgaagtgtag 2040 agttgcaggt cgacgacaaa caagagggta cagacgaacc agggacagcc atggttgcat 2100 attcaacggc cgaagaactg tttgggttgt tggagcgcta tgcgggcgca gttgcagtgg 2160 ggtcgcgccg atactatgca atggcatgtg gcgtcttaga agggaagttt ggatcggaac 2220 aggtgatgtt ccttgttgat tcggggtcgg aattgaattt gatcacgcgg cgggtctggg 2280 agcaatctgg tgtcgagatg gattcggacg ggtctcgctg gtccctgaag gggattaatg 2340 gtgatcctgt gccattactc gggtgctgtc gtgacgcatc ggtcgagatt gatgggtggc 2400 attttgacca tcatttcttc gtaagtacga gagagtttgg taactacgac ggtatcctag 2460 gacagccttg gctccaatgg cattcgacgc aggtcgagta caatcggggg ggaccagtta 2520 atttattaat ctatcctaca ggagaccgta cagacaagcc aatcaagctg aagattttag 2580 ggcctgcgca tcaacgaaat tcagacaagc tcatcatgga cgagccggac aagcagagag 2640 tagtgtcgtc agtcgaggag gtttctgacg agggtttcta gatctggacg tgcatttaaa 2700 accgcgccct ttcacgaggg gtacgtccag acatattccg cggataggaa gaacattata 2760 cgcaatggtg ggtgaagtat cgattgtttc gattgacgat gtttgcaggc tagtacagta 2820 tgatcctccg aattcatttg actctattct acgcacaact ctttctattt catcagtagg 2880 ttcggtacca atcccaagtc catgggtcaa gcgggttatg agcaattgcg tggcttcggt 2940 atctgtaggg aggcggtaca aacccgtcga tcagaaagta cgtccggtac ctacgtatat 3000 gcctaatcct tcagcacaga aattcaagcc gatcgacgtg tcgaatttac ctcctcttcc 3060 tacgaatccg ccttcaaggt ttgagttcag accaacggaa cgacttactc aggatagatt 3120 ggacgcgatt attgataccg tgccagaagg gttcttgaca tcagctgagt tggatctgat 3180 ggtttacata gttgataatt gtcagcaggc gattgcatgg acagattctg aacgaggtac 3240 gttctctcgt gaatacttcc ctgattatga aatgcccgtt atagaacaca cgccgtgggt 3300 acgcgcgcca atacctgtgc caagggccat tgaaggaaag gtacgtgaga tgcttagaaa 3360 gcaaattgaa gcaggaaaat atgagtattc ctcggcgtcg tatcgatcgt caacgtttac 3420 ggttgaaaag aagagtggtg acttacgcaa cgtgcacaac ttgaaggatt tgaatggggt 3480 aacgatacgg gattctgcat tacctccgcg accagataaa tttgcagaaa gctttgtggg 3540 ttatgtcatt tacctattag ctgaccttct atcaggatat gatgcgcggg ttcttgcgga 3600 aacatcaaga gatttgacta catttcattc gctggtcggt ccgttaaggc tcacatgttt 3660 gccacaggga tataccaatt cgatgcaaga attccagcgc tgtacaacac atgttttggg 3720 aaacatgtgc cccgacaaag cgaaggtatt tgtggatgat atggcggcac atgggccgag 3780 ctcgaggtat aatgaagaaa caattccaga gaatgacgac atccgtagat tcgtttggga 3840 atatgctcac gtcgtctatg aattgttttc taggttagaa cgctctggga taacggcatc 3900 gggtacgaag atcgtcttag caacgcctgc gcttgatgtt ctaggtgcta ttgcctccga 3960 agagggtctt cagttgcatc atgggctggt caataaggta ctgaagtgga cgtatccgac 4020 taatgtgtcg gaggtgcggg gctttctagg aacggctggt gttggacgca aatggatcaa 4080 aggtttctca ataattgcac gaccactgac agtgcttacc aggaagtcag aagatgagtt 4140 catattcaac gaggaagcga aacgcgcaat ggacgagctc aaacggttgg tgactgaagt 4200 gccggtacgc aaatcgattg attatgaact aggcagacta attacacagg ggacgcgagc 4260 atcgaatcat ggacttgtgg ttctagcagt agactcctcg atatatgggg ctgggtgggt 4320 gctttatcag tacgtgcacg ttgacaagca tccagttctc tttgggtcgt gtacattcaa 4380 tccgaccgaa tcgcgctatt cgcagccgaa gatagaactc tatggcgtgt ttcgggcgat 4440 taaggaggta cgtcatagga tatggggctt acatttccgt ttggaggtag acgccaagta 4500 tttgaaacaa atgatatatg aaccggactt accaaatgcg ccgatgactc gatgggtgtc 4560 atacattcaa ctcttcgatt ttgaactaga acatgttccg gcgaacaaac atgctgcagc 4620 ggatggcttg tcaaggcgtc cccggtcggc tgatgattca gacgagtcag atggggaaca 4680 agagctagaa cgtttattgg gtcatggatg gtacgcgata gggagcgaaa tccgccactc 4740 actctttcta cagttgttac cagaccttaa gatgagcggg ggtttaatgg ggtttgacgc 4800 tgaacatggt cggtataaca gctcgattgg atgcaggatg gtggtcgaga tgggtgagtt 4860 tgaaagcgca agggccgaca gaatcagcag acgcagcgca gcagcgagga atgcatcagc 4920 ggaggtaagg gttttcgata ttaacgtggt aggtgggggt tatgcagcat ggtcggttgc 4980 gacttgggga atggaagaca cgaaagcaga tgaggtggcc tcctcgatgt tgtgcactct 5040 gttgcgcatg gaagacgacg tgtcgtatat tgggaatgag ttccaggtgc gtaaagttcc 5100 ccgtgagcgg actgatcgat ttctgtttgc aggggagtta gtgacacttg aatacacaag 5160 ctggcgacct gcctgggcga cgtcagagag aaagcagggg atagaattgg agaaagcctg 5220 ggaacgggag cggaatgcag ttcagagctc gttcgttagg aggggggata cgatggtata 5280 tgcgcgtgcg gtgagtagtc ggtaccttta tgaagactta ccggcgccgg cttccttaag 5340 ggatatgcga tgtgtgacgc acgaggggcg tgtacgggac cgcgacgacc cggcaatgtt 5400 tgaggagatt aggaagtatt tgcaggaagg ggaacttcct gagaggtgcg caaacgattc 5460 tcgagtacgc ggcacattcg tgcgactggc gcggtcattc atattgaatg atcgccgtct 5520 ctggcgggtc tccaaaggag aactgccaaa actcgtggta ctggacgtgg aaaggcggcg 5580 ggaaattata gccgaggccc acaacgactg tggtcatcgc ggtcgtgacc ctaccttccg 5640 aaaggttgca gataggttct ggtggccgaa tcaatacgat gaaatcgcgt tctttgtgcg 5700 gtcttgtaat gcgtgccagt acagggcaaa gtctcgttcg ctgcagccat tgtcgatcac 5760 gatctcgccc tcgatcctgc ggcgttttgt gttggacaca atacacatgc cgcagggttc 5820 gaatgggcac agataccttc tgcatgcctc ggatgacgta tcgaggtggc cggaggcggt 5880 ctcttcccgc aagaataatg ctgaaacgtg ggcaaagttc atatggaagg ttttgtgccg 5940 gttcggatgc attccagtgt ttgtgtgtga cggaggacca gagtttaagg gagcggcgcg 6000 cacgattttg ttgaggcatg gcgtatccgt tatattgagc tcaccgtatc atccggaggg 6060 caatgggatc gcagagcggg atggtcaaac cttaatgcgg gcggtcatgc gatcttgtgg 6120 caagcggacg aaggattggc cgctgttcct tgaagcaggg ctattggcgg tgaggacaac 6180 gacgtcgaga gccacagggt acacacccta tttcttgcta tatggcaaac aatgtctgtt 6240 tccttttgat ctgacagata ggacttggtc cattctggag tgggacaagg tgcgaacgac 6300 agaagattta ctcacgatgc ggttgcagca gatcgctcgc aaggacgagc tggtcgaaga 6360 cgcggtcaaa catctcataa agtctcgacg acgatccgca gacgattata acaagaagca 6420 tgcgcggtca atgaccgagg ctttcgagcc tgggatgtgg gtgctcgtgc atgagacctg 6480 gttggacaat cagcatggaa ataagggagc tcttcgatgg gctggtccgt acgtgattca 6540 ggagcgacac ccgtcggggt cgtatgcaat acgagagttg gatggagtcg tgctgaaaga 6600 ggcggtcgcc gctagtcggt taaagctttt ctattacagg aacgaccatc aagtcatgat 6660 gtcggggtta tcaagtgact ggcgcgaccg attccctccc ggcttgccta gctttctgtt 6720 cacttcgtcg acaacagcaa tcgtcgattt tgcgagaaac gacgagctgg aagaactctc 6780 gtacacagaa gaccatccca cgggcgtaac aaggaggagt cgatccaatt tacgagaact 6840 cttacgcacg caaaaggaag tcgagccatg gtggtagtcg aagtggcgtg cggatcttga 6900 ggtcaagatt aaaagttcgc ccctct 6926 // ID Gypsy-91_MLP-LTR repbase; DNA; FNG; 418 BP. XX AC AECX01000209; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-91_MLP_; KW Gypsy-91_MLP-I; Gypsy-91_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-418 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000209; Positions 29158 29575. XX SQ Sequence 418 BP; 89 A; 115 C; 70 G; 144 T; 0 other; tgtgaaacct ccggatccgt ttaacaaagc ctcgccggat ccgggtttcc tgtagtggtg 60 aaccatcaga tggttcacgc gacgcgcatg ttttatttac ttacttccat atatattata 120 tggatatctt ctcgatatcc ttttccatat cttgtttcct tatttgttcg ccgcttccgg 180 cctttatctt gtgcgctctt atcttgtagc ctagaatgag atttcactgc gaaagtctcg 240 cctataaaag gcggaccttt cttcgctgca attcaatccc cagcttgctc tctactgatt 300 caccgcacag tgccttttgc ctaatcactt ccttataatc ctcgaatccg ctattctcgc 360 attataagcc tttacccgaa ttagactctc ggtacttctg aaccctagta gattcaca 418 // ID Copia-50_MLP-I repbase; DNA; FNG; 4585 BP. XX AC AECX01002655; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-50_MLP_; KW Copia-50_MLP-LTR; Copia-50_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4585 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01002655; Positions 528 5112. XX CC Positions [1538-2053] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(86..2626,2630..4315) FT /product="Copia-50_MLP-I_1p" FT /translation="MSDGQTTTSSAPATPSDFTIQQDHTHITQVMSNHTST FT SEAMHIAPLEHDWQRWSPLMLAHFMEGDLDGIVDGTEEEPAEDATAAIKLD FT YQRRRKKAAGFISRKLSPENHALVINTANIKDPKAIWDALVKLYASTKARN FT RARILRKFLNLKCSDNALSSFLTEYRKITHEMTEVSFKIDDDILAHMLLYK FT LSPRYHSTHDLLAHTAETADTLLTLDQVFEHLQQIVLDYQPSSVPPPAALV FT AERRTPAYPRCLNGVHNPLTSHTEEQCYQAHPELKPIRNINRSNNRSSANA FT TISGTVLTTYVFNATLTGKPVLDSGASQSMFSSRLCFTDYRPCHAAIHVAN FT GQTINAVGTGTVSGSHDGKAISFTNWLHVPELSTNLVSMVALVRKGCTFNF FT TPDSSFNVIVGSDVVLTGDTKSGVMEVNFDMGKSPSPSLSLTAATQINSVT FT LHRRLGHPGRVPLEKAFPGVIFSDSCEPCILSKHHRLPSKSHLPLACDRLS FT VLHSDLSGIISPPSLGGGRYYFKITDQKTNFKFVYILKTKCQTFSYFVQFK FT ALVENQTNLKIKTLVNDNGGEYTSKQFSEFISHEGIKMDFTAPYTPQQNPI FT SERGNRTTTEKARALLKQANLPLVFWAEAVSTSVYLENITPIARNNFLTPF FT KSWFGKSPSYSHLRVFGCLAYVHVGKERRTSKFSDVAKKGVLLGSQGSMHN FT YRIYLLDEHRVVYSHDVVFNEEVFPFADPTTLHAFNKKFGSFDNHFLEDLN FT DDVSYEELLPSSPIHPLSEDEDMSISKIPSFDAVPSTSSSDQPVQNESTTD FT QQPSDTPVIKDSQIEVEPQVEDQPIEDEPNEDEEPRQREELTSHPYAYIPA FT SVPAPKAISSSIDTSNIITSRRRANLAVNTLTSTRRALLVNQNSEPQSYRE FT AMNRTDSLKWKEAINKELNSLIEMGVFTEMELPEGAHALGTTWAFRRKTDE FT NNVVIKHKARLCAQGFSQIPGLDYNETYAPTGRAASMRIALSICGIDDLEL FT RLMDAVGAFLNGIPEEVLYIKIPQGYIPKLTGKNIVLLLNRSLYGLKQSPR FT CWYKMVKTFFLSINFSPSKSDPCLFISKDPNWRCFVHVHVDDMLIMGKDTA FT RFSKLIQTRFKMEDLGDVSFYLGMRLERDRVARTITLTQDKYILSMLDEYG FT MRDCHPVSTPMVPGSYLAPATDEDHKLFLETGQNYNRAVGLLNYLVLCTRP FT NLAFTAGQLAQHLKKPGPEHWAAFKRVLRYLQGTYQDGLVLGGGVIDLKIY FT ADSDYAGCPATRRSTTGYISKVGNGCVSWRSRKQASVSTSSTQAEYRAAYK FT AAQEAVWLRRILTDLGCPQSGGTTFLCDNQSSLALQKNPLFKDRSKHFAVH FT LHWIREQVDANVITPTYIPTKEMLADVCTKSLPRPQHQYLTDQIRN" XX SQ Sequence 4585 BP; 1293 A; 1132 C; 868 G; 1292 T; 0 other; gtcacgatct cattcctatc tcattaggtt atgagcccag cccaatcatc ttaatcagat 60 tcagcgctat caacaatttc aattaatgag cgacggtcaa accacgactt cgtccgcacc 120 tgctacacct tccgatttca ctatccaaca agatcacacc cacattaccc aagtaatgtc 180 gaatcacact tctacctctg aagccatgca catcgctcca ctggagcatg attggcaaag 240 gtggtcccct ctcatgctcg ctcacttcat ggagggagac cttgacggta ttgtcgacgg 300 caccgaagaa gaaccggctg aagatgcaac tgccgccatc aaactcgact atcaacgtcg 360 acgtaagaag gcagctggtt ttatctctcg taaattaagt ccagagaatc atgcgttagt 420 catcaatacg gctaacatca aggatcccaa agctatctgg gatgccttag tcaagctcta 480 tgcctccacc aaagcgcgta atcgagctcg tatattacga aaatttctca atcttaaatg 540 ttccgacaat gctctttcat cctttcttac tgagtatcgt aaaatcactc atgaaatgac 600 tgaggtctcg ttcaaaattg acgatgacat cctggcacat atgctattat acaagctttc 660 tccacgctat cattccactc atgatttact cgctcacacc gctgaaactg ccgataccct 720 tctcacgctt gatcaagtat tcgagcacct tcagcaaatc gttcttgatt accaacccag 780 ctctgttcca ccacctgccg ctctagttgc tgaacgccga actccagctt atccacgttg 840 cttaaacggt gttcataatc ctttaacgtc tcatactgaa gagcaatgtt atcaagccca 900 tcctgaactc aagccaattc gaaacatcaa tcgttcgaat aatcgtagct cagcgaatgc 960 gaccattagt ggtacggttc tcactactta cgttttcaac gctactctca ctggaaaacc 1020 tgttcttgac tctggcgcct cccaatccat gttcagctct agactttgct ttactgatta 1080 tcgaccttgt cacgccgcca tccatgtcgc caacggacaa accatcaatg ccgtgggaac 1140 cggaactgtg tcaggatctc atgacggtaa agccatttct ttcacaaatt ggcttcatgt 1200 accggagctt tcaacaaacc tggtcagcat ggtcgccttg gttcgaaaag gctgtacatt 1260 caactttact ccagactcta gtttcaatgt tattgttggt tctgatgtgg tacttactgg 1320 ggacaccaaa agtggagtta tggaagttaa cttcgacatg ggcaagtcac cttcaccttc 1380 actctcactc accgctgcaa ctcaaattaa ttcagttact cttcacaggc gcctaggcca 1440 tcctggacga gtccctttgg agaaagcgtt tcctggagtt atcttttctg attcttgtga 1500 accttgtatc ctgtccaaac atcatcgcct tccatccaaa agccatcttc ctcttgcttg 1560 tgatcgtctc tctgtgttac atagcgactt aagtggcatt atctctcctc cttctctcgg 1620 tggtggaaga tactatttta aaataaccga tcaaaagacc aatttcaaat ttgtttacat 1680 tctgaaaacc aagtgtcaaa ctttttccta ctttgtgcaa ttcaaagctc ttgtcgagaa 1740 tcaaaccaat cttaaaatta aaactctcgt caatgataat ggtggtgaat atacatcaaa 1800 gcaattttcc gaattcatta gccatgaagg aatcaaaatg gatttcacag ctccatatac 1860 tccacaacag aatcccatct ctgagcgtgg aaatcgtaca actactgaga aagccagggc 1920 tttactcaag caagccaatt tacctctcgt tttctgggct gaagctgttt caacctctgt 1980 gtacctggaa aacatcactc ccattgctcg aaacaacttt ctcactccat tcaaaagctg 2040 gttcgggaaa tccccttctt attcccatct acgtgtgttt ggatgtcttg cgtatgttca 2100 cgttggtaag gaacgtagaa ctagcaaatt ttcagacgtt gctaagaagg gtgtactact 2160 tggctcccaa ggctcgatgc ataactatcg aatttatctt ctcgatgagc atagagtagt 2220 gtatagtcat gatgttgtgt tcaacgaaga agtatttccc ttcgctgatc caactacatt 2280 acatgcgttt aacaagaaat ttggcagttt tgataaccat ttccttgaag atcttaatga 2340 tgatgtatct tatgaagaat tactaccttc ctcacctatt catccactta gcgaagacga 2400 agacatgtca atttctaaaa ttccctcatt cgacgctgta ccctcaactt catcctcaga 2460 tcagcctgta caaaatgagt ccactactga tcaacaacca agcgacacac ccgtgatcaa 2520 agattctcaa attgaagtgg aacctcaagt cgaagatcag cctatcgaag atgaacctaa 2580 tgaagatgaa gaacctcgtc aacgagaaga actcactagt cacccttgat atgcttacat 2640 tcccgcctct gtacctgctc ctaaagccat cagtagctcc atcgatacat ccaatatcat 2700 cacatctaga agaagggcca atttagctgt caacacactc acctcaactc gaagagctct 2760 cttagtcaat caaaattctg aaccccagtc ctatcgtgaa gccatgaatc gtacggattc 2820 actgaaatgg aaagaagcta tcaacaaaga actcaattca ctcatcgaga tgggagtctt 2880 tacggaaatg gagcttcccg aaggcgctca tgctttaggt acaacttggg cctttcggcg 2940 gaagacggat gaaaacaatg ttgtcattaa acacaaagcc cgtctatgtg cacaaggttt 3000 ctcccagata ccaggtctgg actacaacga gacttatgct cccactggac gagcggccag 3060 catgcgaatt gctctcagca tctgtggtat cgatgactta gaacttagac tcatggatgc 3120 agttggagcg tttctgaatg ggattcctga agaagtctta tatatcaaga tacctcaagg 3180 atacatacca aagctaactg gcaagaacat cgtcctactg ttgaatcgat ctttgtacgg 3240 tctcaagcag tctcctcgct gctggtacaa aatggttaaa accttcttcc tctcaatcaa 3300 tttctctcct tctaagtccg atccgtgcct gtttatttca aaagatccta actggcgatg 3360 ttttgtgcat gtacatgttg acgatatgtt gatcatgggc aaggacacgg ctcgcttctc 3420 gaaactcatt caaactaggt tcaagatgga ggacttaggt gatgtatcat tttacctagg 3480 tatgaggctg gaacgtgatc gtgtcgccag aactattaca ctaactcaag acaaatatat 3540 tctaagtatg ctcgacgaat acggtatgag ggactgtcat cctgtttcaa cacctatggt 3600 acctggctct tatctcgctc cagcaactga cgaggatcac aagttatttt tagaaacagg 3660 tcaaaattat aatagagcag ttggactcct caattattta gttttatgca cgaggcccaa 3720 cctagcgttc acggctggac aattagctca gcatctcaag aagccgggtc ctgaacactg 3780 ggctgctttc aagagggtgc ttcgctatct tcaaggaacg tatcaagatg gattggtgtt 3840 agggggtgga gtcatagatt taaagattta cgcagattca gattatgcag ggtgtccggc 3900 aacaagacgc tcgacaacgg gttacatctc aaaagtgggt aatggttgtg taagttggcg 3960 gtcgagaaaa caagcatcag tgtccacttc ctcaactcaa gctgaatacc gtgctgccta 4020 caaggcagct caagaagcgg tatggctacg gcgtatttta actgacttag gctgtccgca 4080 atcaggtggt actacatttc tatgcgacaa tcaaagttct cttgcgcttc agaagaatcc 4140 tctatttaaa gacagatcta agcacttcgc ggtccatctc cactggattc gagaacaagt 4200 ggatgctaat gtcataacgc ctacttacat accaacaaaa gagatgctgg ctgacgtgtg 4260 tactaaatca cttcctcgac ctcaacatca atatcttact gatcagatta gaaactagga 4320 tgttatcaat tgaggggggg tattgaatta tcatcacaat agttaacatc cactcaaggc 4380 cgtagcaact attccgcatt ggtcatagta tccaacgtct ccttatttct ctctttacca 4440 ttctcatatg ttttatcttt gtcccaaagc ggcgcatgag aagcgcaaac ctgatgaaag 4500 attgttaacc tatgtatcgc aaagttatca ctatataaag accaattgtt tgcgtgttat 4560 tgtattttct tcactcaaac attat 4585 // ID TY3-1p_LTR repbase; DNA; FNG; 365 BP. XX AC AY198187; XX DT 15-APR-2008 (Rel. 13.04, Created) DT 15-APR-2008 (Rel. 13.04, Last updated, Version 1) XX DE Saccharomyces paradoxus Ty3-like retrotransposon, Long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; TY3-1p; KW TY3-1p_LTR. XX OS Saccharomyces paradoxus OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Saccharomyces. XX RN [1] RP 1-365 RA Fingerman E.G., Dombrowski P.G.;,Francis,C.A. RA and Sniegowski P.D.; RT "Distribution and sequence analysis of a novel Ty3-like element RT in natural Saccharomyces paradoxus isolates."; RL Yeast 20(9), 761-770 (2003). XX DR EMBL/GenBank/DDBJ; AY198187; Positions 1 365. XX SQ Sequence 365 BP; 136 A; 80 C; 54 G; 95 T; 0 other; tgttgtatct caaaatgaga tacctcagcg ttactagatt caccaaccta gacataaaac 60 atgtatgaaa cacgtacgaa acaatagctc caaaacggac aatattgagt atactaggca 120 gcctacttgc ctaagacgaa ccaaaccaac caaacgtata aatacctgaa caattagttt 180 agatccgaga ttctgcgctt ccacccttta gtgaaatcca gatcttatat agattatata 240 agacaagtaa catcaagtaa catttctgtg aatcacgtta ataataagtc tgacaacaag 300 ttactctcct aaacgacttt aggattgtca agacatccgg tattactcga gctcgtaata 360 caaca 365 // ID Harbinger2-1_TMe repbase; DNA; FNG; 2731 BP. XX AC . XX DT 13-AUG-2010 (Rel. 15.09, Created) DT 13-AUG-2010 (Rel. 15.09, Last updated, Version 1) XX DE A family of autonomous Harbinger transposons - a consensus. XX KW Harbinger; DNA transposon; Transposable Element; KW Interspersed repeat; Harbinger2; Harbinger2-1_TMe. XX OS Tuber melanosporum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Pezizomycetes; Pezizales; Tuberaceae; Tuber. XX RN [1] RP 1-2731 RA Kapitonov V.V. and Jurka J.; RT "Harbinger2, a novel clade of Harbinger transposons in protozoan, RT fungi, choanoflagellate, and metazoans."; RL Repbase Reports 10(9), 1219-1219 (2010). XX DR [1] (Consensus) XX CC Harbinger2-1_TMe belongs to a novel clade, Harbinger2, of CC Harbinger DNA transposons. This clade includes transposons CC present in protozoan (brown alga), fungi, choanoflagellate, and CC metazoans. CC Harbinger2-1_TMe is a consensus sequence of a family of CC autonomous Harbinger transposons that were active in the Tuber CC melanosporum genome recently. The consensus was derived from 6 CC copies ~98% identical to it. The genome contains only three CC full-size copies of Harbinger2-1_TMe; ther are flanked by the TWA CC target site duplications. This transposon codes for the 399-aa CC TPase (2 exons). XX FH Key Location/Qualifiers FT CDS join(189..416,453..1418) FT /product="Harbinger2-1_TMe_1p" FT /note="Harbinger TPase." FT /translation="MDIRNKAFHRKVVLLFLLLIAHIHLIQRRRVRLQVES FT ERQGRLQPNPSLSRPISYQPQVFLLDTCGWNDVDFREYFTGVNEDYELRAN FT KLVFSFSKEEIRRILPCLQLERVQWRYRFHPHPEEALCILLYRLSYPHRLK FT DCLKIFQCSRTKLSVIFNDTITYLISRYAETLRWDKKRLTITTLQQYAQAI FT QNATGLNGIWGFVDGTMRPFCRPGENQGTFYSGYKKSHAFKFQSIVSPDGL FT LCSLIGPVPGPVGDWIVWRSSGIVEILQNMFEASGISEDERLYVYGDSAYA FT PAFGVMGPYVERVNKPLTMEEEAANLVMSGERIVVEWGFARVVNYWALNSF FT KSGLKVGLSPIAGYYMVATLLSNILLCISGGNQISEKYHLSPPSLEEYLYI FT SDQV" XX SQ Sequence 2731 BP; 725 A; 597 C; 572 G; 837 T; 0 other; agggaggttt tgctgcgacg gtcgtcgctt ggatcgtgct aaaagtctcg gcaaagttca 60 gtgtggcaac cggttttttt ttcaggacat tttgcggcga ctgtcgtaac ttgaactcaa 120 ccctacagtt aagttaatcc ttcagatctg ctgcacttaa ctcttaccaa acctgtccac 180 agtcctacat ggatatcaga aacaaagcct ttcaccgaaa ggtagtcttg ctttttctcc 240 ttctgattgc tcacattcat ctcattcaac gaaggcgagt acgattgcaa gtggagtctg 300 agcgacaagg gcgactccag ccaaatccat ctctgtcacg gccaatatct taccagcctc 360 aagtatttct tcttgatact tgtggatgga atgatgttga ctttagagag tatttccggt 420 atgttttctg actcagatat actttgccta gtacgggggt caacgaggac tatgaattaa 480 gggctaacaa gctggtgttt agcttttcaa aggaagaaat acgacgcata cttccatgcc 540 tgcagttaga gcgagtccaa tggcgctacc gcttccatcc acatcctgaa gaagctttat 600 gtatcctact ctaccgactc tcctatccac accgtttgaa agattgtctt aaaatctttc 660 aatgctcgcg cacaaagctc tcggtcattt ttaatgacac catcacatat cttatttcaa 720 gatatgctga gactctcagg tgggataaaa agcgacttac gataaccacg cttcaacagt 780 atgcacaggc aatacaaaat gctacaggat tgaacggaat ttggggcttt gtggatggga 840 ctatgagacc attttgtcgt ccgggtgaaa accaaggtac tttttattct ggatataaga 900 agtctcatgc atttaaattt caaagtattg tttctcctga tggtttactt tgttctttga 960 ttggtccagt tcctggacca gtaggagatt ggatagtatg gagatcttcg ggaattgtgg 1020 aaatcttgca gaatatgttt gaagcaagtg ggatatcaga agatgaaaga ttgtatgtat 1080 atggtgattc agcatacgca cctgcctttg gggtaatggg gccatatgtg gaacgggtta 1140 ataaaccatt aaccatggag gaagaagcag ctaatctggt tatgtctggt gagagaattg 1200 tggtagaatg gggttttgca cgagttgtta attattgggc attaaattct tttaagtcag 1260 gactaaaagt aggactctca ccaattgcgg gatactatat ggtagctact ctgttaagta 1320 atattctatt atgtatttca gggggaaatc agatttcaga aaagtatcat ttgtctcctc 1380 caagtttgga agaatacttg tacatatcag atcaagtata aaggaaaaaa aaaaaagcta 1440 tttggatttt aaaatccgca aaatttccac tgtattactt tgcacttcct ctaatgccct 1500 ctccgttgtt tccaagcgtt cctccacact cccaagacga ttttgtattt cctctttctc 1560 aaagataacc tcctgaccca ctttctctgg agtagaaccg ctatcgggga tcagagtatt 1620 tgccacacgt tcaagaacag tagttagacg actactttct tcagccaaac tggtatctaa 1680 aagctcaccc cattcccgga tttttcttct cttctctcgt aatcctgctg ctttatcctc 1740 cgaagcttta cctgtattct ccttcttctc ttcggtttca gtatggtgac tatcaagggc 1800 aacaatcgca gactcgtaat ttctttttcc tttcatcgta gaaagtagtc ctttccggca 1860 tgcttctgca gcttcttggt ctttcaactt tctcgctttt gtagcttctc cttgtgaact 1920 gtgacctaaa ttgcctcgtc catctccact ctagtaatat tagcaatttt ttttcttttt 1980 taatagaaaa gtctggtata taagagtatc aacctacact ttctacccgt gccttccatt 2040 gtgccattgt ttgctcatat tctccgtcag tctgagcagt tccactttcc tttaagtctt 2100 gatgaacttt aacttcgtaa tcggcaagca ttcgtgccat tgtactccga ggattgctca 2160 caggacggga gactgcattt gagaacaatt gttgaatctt taaccaaaac cgttcacgtc 2220 ctccccgaat tgaaaattct cctccatgaa gtacacacag tcgggcaaga agaagcttct 2280 ccaattgact atatcgatta cgtggagcaa cttgtactgc aggtacaatt tcaagttcag 2340 gtgactccaa ggatgcccga acaggctgca gagattctcc ggcaggctgt aaagctgccc 2400 cagcagactg tgaagcttct ccagagggcc atgaagatga ttctgtaggt tgcgataata 2460 atctggccgg ctctggggag gatctggtag gctgcgataa tggcatttga agcttcgttg 2520 aggatagact ctgacctgga tatgaaagta aaagattttc atgtcaacaa actagtaaac 2580 ataactcctc ggcgactgtc ttatttaaga actgttctta agactgtgca aagctgagcc 2640 accaagcgct atcgcacaat tggaaaggtt agtgccgttg ctgtggtgag aaggaaagtt 2700 cgcacgacga ccgtcgcagc aaaacctccc t 2731 // ID Copia-2_TMe-I repbase; DNA; FNG; 4643 BP. XX AC CABJ01001613; XX DT 13-FEB-2011 (Rel. 16.02, Created) DT 13-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Perigord black truffle genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_TMe_; KW Copia-2_TMe-LTR; Copia-2_TMe-I. XX OS Tuber melanosporum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Pezizomycetes; Pezizales; Tuberaceae; Tuber. XX RN [1] RP 1-4643 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Perigord black truffle genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; CABJ01001613; Positions 445689 450331. XX CC Positions [1794-2105] - Integrase core CC 'GTTTT' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 80..1339 FT /product="Copia-2_TMe-I_1p" FT /translation="MSEDSSNNAYTDKRFHCPLLTEKNYPTWERSIKMRLI FT AENCWEIVIGDEEAPNSPVLADGSSRAAETAYAIALKEFKTELVDFRKREG FT KAASIINSTVSSGIEFYAKDTINPREMWNVLQNKLTLVDNWGLQRTLKRDF FT YKMSYDGKETITKYINHLRVFQQQLQGTNNEISNNELVNRIMTSLPASWEQ FT RIITLDDRRDLTLDDLERALRSHQAKIADIPTQATKAFMVTRHASYQRGRG FT RGYRGRGRNFRPVDRGGFEHRIRSCWYCLKPGHSQNDCGVKKKAEEAKKDR FT IRRHTGKEADSGRISASISLADTHALMLKRESVKYAPGEWFIDSGATDHMC FT NEHSDFFSLKRLSSLIHVVLGDGTTVYAYGVGSIHLSPQILLNQVLYVPSL FT GMKLLSVSAITRLGYQVIFNDLGCQV" FT CDS join(1944..3494,3498..4457) FT /product="Copia-2_TMe-I_2p" FT /translation="MHTVMERARTMLLESQLEDRFWAEAVNTSVYLHNRCP FT TRALDGCTPYEAWHETKPALQHLRPFGCNAYVHVPAQRRKKLDAKSRLCTH FT LGYVHNTTKLWQVWDNASSRAVQVADVVFDENSFSGRTYTHSLPPLSTLLV FT DEIDYTSVNDMFTEAGSLPSEPTSGNTISQMYHDGPPEVDDTSGRMRTNML FT EDRISVSDVPNPGTHTHASPMPTGSRVVRPMTLNSMAPEEGEPTTSHRVAH FT DRHQLPAPRKSTRARKPSFWLRDSVTFAASATVGEDPMSYRDALEGPNRKQ FT WEMAIREEFKSHIDNGTWELAELPPGKNDITCKWIFKLKSNADNSTRFKAR FT LVIRGFEQVPGIDFHETFAPVAKFVTVRVLLALATHYDWEIEQMDVKTAFL FT DPQLQEEVFMAIPEGYAEYSDMPHAAGEYPVVHLLKALYGLKQAPRAWYED FT IHKFFTEAGLSRSSEDHSLYFSADMIVILYVDDLLLFAKDMQSIDRMKLKL FT ATAYLMTDLGPIQQFLGLQINNCQAHSLELCQSSYIQTVLTRFQMSNCKGI FT STPMESNLQFPRSLDKDEIHDRPAYQSKIGSIMYMMLGTRPDLAFTISALS FT KHNDRPSHSHHIALQRVFRYIQQTRNTGIRYQSSSGEPGTFPKSICYTDSD FT WAGDTSDRKSTGGYVFILCGGAISWKTRKQDVIATSSTEAEYVAITEAAKE FT AVWLRRLLIELESRVVNSSMLNITSNHFHGTDEQWESLTNHNITKQSISKL FT PWSSSQPQTIYADNQGAIKLSDNPQFHARTKQIDIRHHFIREARERKEVTV FT TYVPTADMTADILTKALTKEKHLQHMRGMGMVDLL" XX SQ Sequence 4643 BP; 1335 A; 1046 C; 1051 G; 1211 T; 0 other; ggttatgagc ccggattgac gctgcaaagc cgaaatcctc gcccactgtc cttgagtaga 60 gaaaagataa atcccaacga tgtctgaaga ctccagcaac aatgcatata ccgataaacg 120 attccactgt ccacttttga cggaaaagaa ctacccaacc tgggagagaa gtatcaagat 180 gcgcctgatt gctgaaaact gttgggagat tgttattggt gatgaagaag cccccaattc 240 ccctgttctt gctgacggtt cctcacgagc tgctgaaact gcctatgcca ttgcgttaaa 300 agagtttaag actgaattgg tagactttag aaaaagagaa ggaaaagctg cctctatcat 360 taactctacc gtctcctccg gcattgagtt ttatgctaaa gacactatta atcctagaga 420 aatgtggaat gttcttcaaa acaagcttac tttagtagac aattggggcc tccaacgtac 480 actcaagcgc gacttctaca agatgagcta tgacggtaag gagacgatca caaagtatat 540 aaaccatctt cgggtatttc aacaacagct ccaaggaacg aacaacgaga tctcgaacaa 600 tgaacttgtc aaccgaataa tgacctcgct tccggcaagc tgggagcaac gaatcatcac 660 cctcgatgat agacgagatc ttacacttga cgacttagag cgtgcccttc gaagccatca 720 agcaaagatc gccgatatac caacgcaagc tacgaaggct tttatggtta ctagacatgc 780 aagttaccag cgaggcagag gaagaggcta ccgtggccgt ggtagaaact ttcgaccagt 840 tgaccgcgga ggtttcgaac accgtatcag gtcttgttgg tactgtctca agccaggaca 900 ctcacagaat gactgtggtg ttaagaaaaa ggcagaggag gcaaagaaag atcggatacg 960 acgtcataca ggcaaggaag cggattctgg aagaatcagt gctagcatct cgcttgctga 1020 tacccatgcc ttgatgttga aacgagagtc agtgaagtat gcccctggtg agtggttcat 1080 cgattctggt gctaccgacc atatgtgcaa tgagcatagt gacttcttct ctctgaaaag 1140 attgtcttca ttgattcatg ttgtgttggg tgatggtaca acagtatacg catatggggt 1200 gggatcgatt catctaagtc cccaaatcct tctaaatcaa gtactctacg ttccttcttt 1260 agggatgaag ctactctctg ttagtgcgat aacacgactt ggatatcagg ttatcttcaa 1320 cgacttaggt tgtcaagtct agaaagaaga aaatgagata ctctcagcct cactcgaggg 1380 taacctattt aaggttaacc aaatacgtct aaacttgtgt aggataccaa atgtggaaac 1440 aaataacgtc acaccggtgg caagaatgtc acctgcaagt ggattcaact tgcaactttg 1500 gcatcaacga ttagggcacc tgaacgctcc tgatgtacaa cgtttagaga acctttctac 1560 gggacttcga atcaacaaat ccaaacaagc atcttcctta tgccgttctt gccttgatgg 1620 caaacagcaa cggtcattta accggaagaa cgtttcttca cgggtattgg agaaattggc 1680 cttggtccat tcagactcct gtggaccttt tactacacca cccattgcgg gagcaaagta 1740 tttcattatt tatgttgatg attatacacg tatggtctgg tgttatttct taaagcagaa 1800 ggctgcttca gaggtattgg atatatttaa aatgtttaag gtgctggttg agaagcactc 1860 ggaacttcct tgcctccgaa ggaatatcct acgaaccatc agctccgtac acgcagcatc 1920 agaatggaat cagtgagcgc accatgcata cagttatgga acgagcacgg acaatgctcc 1980 ttgagtcaca actcgaagat cgtttttggg ctgaggcagt taacacctct gtctatctcc 2040 ataatcgttg tcctacacga gcccttgatg gatgcacccc gtatgaggcc tggcatgaaa 2100 caaaacccgc cttgcaacat ttacgaccgt ttggttgtaa tgcttatgtc catgttccag 2160 cacagcgtcg aaagaagctg gatgcaaagt cgcgactttg cacacacctt gggtatgttc 2220 ataacacgac aaaactgtgg caagtatggg acaatgccag cagccgagct gtacaggttg 2280 cagatgtggt atttgatgag aatagcttta gtggacgtac ctatacacat tctctgcccc 2340 ctttgagtac actgttggtt gatgaaattg actatacttc tgtcaatgat atgttcacag 2400 aagccggtag tctaccgagt gaaccgacca gtgggaatac aatatcacag atgtaccacg 2460 atggtcctcc tgaagttgat gatacttcag ggagaatgcg caccaacatg ttagaggata 2520 ggatcagtgt gtccgatgtg ccgaatcctg gtactcacac acatgcttca cccatgccta 2580 cgggaagcag agtggtaagg cccatgactt tgaacagcat ggcacctgag gaaggcgagc 2640 ccactacttc tcacagagtg gctcacgata ggcatcaatt accagctcct cgaaagtcca 2700 cacgagctcg gaagccttcc ttttggttaa gggatagtgt cacttttgct gcgagtgcaa 2760 ctgtgggtga ggaccccatg tcatatcgcg atgctcttga agggccaaat cgcaagcagt 2820 gggagatggc aattagagaa gagtttaagt ctcacattga taacggtaca tgggagcttg 2880 cggagctccc cccaggaaag aatgatatca cttgcaagtg gatttttaag ctaaaatcaa 2940 acgcagataa ttctacaaga tttaaggctc ggcttgtaat ccgcggattt gagcaagtac 3000 cgggtattga tttccacgaa acctttgccc cagtggctaa gttcgttacc gttcgtgttt 3060 tgcttgcact tgcaacccac tacgattggg aaattgagca gatggatgtt aagacagcat 3120 tcctggatcc gcaattgcaa gaagaagtct tcatggcaat tccggaggga tatgctgagt 3180 attcggatat gccgcatgct gcgggagaat atccagttgt gcatttattg aaagcgcttt 3240 atggtttgaa gcaagcacca cgggcgtggt atgaagatat tcataagttc ttcacagagg 3300 caggcctgtc aaggtcaagt gaagatcata gtttatactt ctccgctgat atgatcgtta 3360 tcctttatgt tgatgaccta cttctatttg ctaaagatat gcaatctata gatcgtatga 3420 aactgaagct tgctacagct tacctgatga ccgaccttgg accaattcaa cagtttcttg 3480 gtctacagat caactgaaat tgtcaagcac attcccttga actctgccag tcatcgtata 3540 tacaaactgt tcttactcgc ttccagatgt ccaattgcaa gggaatatca actccgatgg 3600 aatccaatct tcaattccct cgaagccttg ataaggatga gatccatgat aggccagctt 3660 atcaatccaa gatcggaagc attatgtaca tgatgcttgg aaccaggcca gatctagcct 3720 tcacgatatc agccttaagc aaacacaatg accgaccctc tcactctcat catatcgcac 3780 tccaacgagt cttccgttac attcaacaaa cacggaacac tggaattcga tatcaaagtt 3840 caagtggtga acctggaact tttccgaaat cgatctgcta tacggactcg gactgggcag 3900 gtgatactag tgatcggaag tccaccggcg gatatgtctt tattctctgt ggtggggcaa 3960 tatcatggaa gacaagaaaa caagatgtta ttgcaacgtc tagcacagaa gcggaatatg 4020 tcgcaataac tgaagcagct aaggaagctg tctggttaag gcgattgcta attgaattag 4080 aatcgcgagt tgtgaatagc tctatgctca atattacaag caaccacttc catggcactg 4140 acgagcagtg ggagtccctg accaatcaca acatcacaaa gcaaagtatt tctaaactcc 4200 cgtggtcttc ctctcaaccc caaacaatct acgccgataa tcagggagct attaagctct 4260 ccgataaccc tcagttccat gcacgtacaa agcaaatcga catacgacat cactttatcc 4320 gagaagcacg agaacggaaa gaggtaactg taacctatgt tcctactgcc gatatgaccg 4380 ccgacatcct caccaaagct cttacgaagg agaagcactt gcaacatatg agggggatgg 4440 gaatggtaga cctattgtag gaaggagtac aagatttttg tttatttttt cttttcacac 4500 ccaacactct ttttccctgg caagcaattt ttttttccca agggtttttt ccttacctgg 4560 tttttaatga ggtcgtggta atttcatagt ttattgcatt tcttggttga ctggcttcaa 4620 tcgatactac aaggaagtgg gag 4643 // ID Gypsy-15_LBS-I repbase; DNA; FNG; 10808 BP. XX AC ABFE01001144; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-15_LBS_; KW Gypsy-15_LBS-LTR; Gypsy-15_LBS-I. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-10808 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01001144; Positions 8510 19317. XX CC Positions [2798-3283] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 2690..3925 FT /product="Gypsy-15_LBS-I_1p" FT /translation="MPTMSKDAEIYATSCDVCQKIKTDHRAKMGALRPTHI FT PSRPFSTVTMDMITGLPPSREQKFTAILVIVDKLTKFAIIIPTHTTLTQEG FT FAKLFVERVVNVYGLPEVIISNRDRQWATIFWRSVVSNYGSVMALSSVHHP FT QTDGQTEVLNATIKQMLRAYVSTDKENWSNWLSVLAYSYNSSVHSSMKYSP FT NFLLMGYNPRTSVSAIVPEINPAHRPFLPSQTAEDFVEALEIHHNSAKDAI FT ALAQDRQAKAYDKKRRLVQELNVGDYALVNPHSLELVDVEGTGKKLIQRMI FT GPFEVVEKINPVVYQLRLPDTYSMHPVFNLEHLKKYVPSPEQFGERSELPS FT TRDLRASEEYEVKAILGHRLVSKRKANCRMFLVRWKGYGPVDNSWVSEYDL FT RNSPLLKREYLDSQGLSI" FT CDS 8097..9728 FT /product="Gypsy-15_LBS-I_2p" FT /translation="MPDANARTRMADDHYLGVWLNGMEERQARWYLKEGIP FT CFIVREIMPLERAQLAAPETMIDFAAGSSAAPLHWGVNDYNSLALSRGDLS FT LRDTSSFHNPGWVWSEPPVQKNKSPAKEPPRTQSSGWAWSGIQASKDKSPV FT KEQPKSQTIDYGPPPPETVIIAKDQVPWIKPPPVKRATPSRPGAPPHEQKK FT WIKWVENHQPEGTFCKVGAKFAPNHRSHSMYDRDKHRHLFFLHPPRAPEGC FT VSNPDVFGIPCPKGIYKDMTKTQHSQPFWIYKTQEPQSSDVGKVAPVPRPE FT DLPRLNETPRPPPDNDSDSDDSYYPDLDFLRQSVQNKATSSKVTSESAIPT FT PAAQVVEARAPLPAQIVESPPVTPSVRTEPAALGTTVELRVNLSEPTRIPP FT PLPVPVTNCTQNEDEISLGDEDSIHEAMGPQIPPLTFTVDSEIRMEEAPVT FT DPLEFASAFLMLYGLPSSEGFSTTQSLITSIAARLHLTVRQIFRANSGHNQ FT SFWFEMELVDQARQMRTYMHHRRENDRELLVTYADYEDYVGALARSSL" XX SQ Sequence 10808 BP; 2998 A; 2772 C; 2133 G; 2905 T; 0 other; gactatcgtg gtaaaagatt tgaaaggaac tccctgttat gctaggctcg aagtggcctg 60 ggaagctgga aaggaagtac ctcagttaat aacgagacta cataaagaaa cacatgaagt 120 caaacatcca gacgagtggg gagccatgtt ggatgaaact gttgtccatg tggaaagagt 180 tatcacctat tggagcagaa cctttaaatc tgctgaacgg aattatagtg ctactgaaca 240 ggaagtgctc gccgccaagg aagcacttgt caagtttcaa cctttcattg aaggagaaga 300 aattacttag tatgggtgtg agtctatgag aatgccaatt gtcatctagc tgcctggggg 360 gcggtattcg ctgcctaccc tggactcaaa atagtgcatt gagccggccg aatacattct 420 aacatcaatc ccttatcccg cttgatacaa attccaccac acgattctcc gcttagcgac 480 gacattacac ctatagaaca ggatcctgtc aaacgtaata tcactcaaaa agctgaagac 540 cgaatattcc gtgcagctgc tcctaaggca gcattcagtg ttatgtggtg ggaagacgta 600 attgacaaac acgcatctgc ggtccggaca agacgacaaa ccactgccga agaaaggcgc 660 aaaactatga ggtctttacc atcccccatg agttccggcg gaatcatgga attccaatgg 720 aaactagttg gctggagcct cagccattct ggttcccaat tccacagaaa ttccaatgga 780 atccgacgga atccggtgga aatggtcgga atccgggagc gtcccggaac agtttccatt 840 ggaatccatt ggaattccaa ccagattcca tggaattcca atggaaagta tattccagtc 900 aaattccaac agattccgag agattccaat ggattccaat atttgttgtt gttattataa 960 tatataaaaa aagaagtgga tgcacaagga ttgaactcta gcctcgggaa acgtcacgtg 1020 aagtccaagc agcaacattt ccactacacc atacggccgt taaaattcta caaacattta 1080 ttctatacaa gccaccctca cgcccacctt cctaaacgag tcacggcgtt tgccggcgtc 1140 accatcagca tctccgtttt gctgtcaacc acaaccacaa ccacaaccat aaccacaacc 1200 accaccaccg tcaccgcgac cagcgcggtc acctcgcatc acgcccacgc cagcggacaa 1260 cgacgacgac gatggacgac gacacgacga cacgacgacg acgacggacg acgacacgtc 1320 agcgacgacg aggccggcca cacgacgacg gacgacgacg acgcgtcacg tcaacacacg 1380 acgacgacga cgacgacgac gacgacacgt caacggacga cgacgacgac gacacatcaa 1440 cggacgacga cgacgacgga tgaggacgac gacacgtcag cgacaacgag acggccgcac 1500 gtcaacggac gacgacgagc ccccttcctt ttccctcccc ccttcctatt ccctcctcct 1560 tcctattccc tcccctttcc tattccctcc cctttcctat tccctcccct ttcctattcc 1620 ctcccctttc ctattccctc ccctttccta ttccctcccc tttcctattc cctccccttt 1680 cctattccct cccctttcct attccctccc ctttcctatt ccctcccctt tcctattccc 1740 tcccctttcc tattccctcc cctttccttt tcccttttcc tttccccctt ccctcctcat 1800 ttcctgcccc ttccttgccc atcccctccc tctctagtat acactgtatg tagttaaata 1860 tattttgtat tgtaattgaa atacatcatt atttgatcta aatcacttgt gaattttata 1920 gggaggtttg aatttctgac ttcaaaaaat acagctcaag tcacctgacc tctacctcag 1980 ttctgatcag cactaatcag tcttctggaa ttccattcag attccggtgg aatccattgg 2040 aactccattc agaatccagc ggaatccatt ggaattccat tcagaatccg gcagaatcca 2100 ctggaatttg agattcctac cattccggcg gactccggat ggaattccga cattctatgg 2160 gagtccgccg gaactcatgg gggagggtaa agtactggaa actatcacta aaccaaccag 2220 tgatgatcaa aatgagtctg actctgtgat agacgaagag acctcacctg ttgactccgg 2280 cgagcaaggt gaagtcttgc ccttccccaa atccgaccat tggacctacc cagcaggaac 2340 taagccctca gaattaccac caaatgaaga atggatgagt cgaacgcagt tgctcatagc 2400 ggtagaccca atcatttgca aggaatttgc agaaggttac gaaagggaca aattcttcgc 2460 cccccggtat atccaaaaac aacctaatga acaaacaata atctctgcta gccactttca 2520 gagaggtcaa gacagtctac tttactttat caatgcaggt tggagaacat gactttgtgt 2580 acctagtgca aagatcaact acgtgttaag gtggatacat gaatctcctt atgagagtgc 2640 ccatgctgga ccccgccact tcttagcgca acttcaagaa ctattctaca tgccgaccat 2700 gagcaaggat gccgaaatct atgctacttc ttgcgacgtt tgtcagaaaa tcaaaacaga 2760 ccatcgtgcc aagatgggcg cgctaaggcc cacgcatatt ccatcaaggc ccttttctac 2820 tgttactatg gacatgatta ctgggctccc tccatccaga gaacagaagt tcaccgctat 2880 actagtaata gtggacaagc ttaccaagtt cgccatcata atccccactc ataccacttt 2940 aacccaggaa gggttcgcca aactctttgt ggaaagagta gttaacgttt atggattgcc 3000 tgaagttata atatcaaacc gagacaggca atgggctact atcttttgga ggtcggtagt 3060 ctctaactat gggagtgtca tggcgttgtc ttctgtgcat cacccccaaa ccgatggaca 3120 aactgaagtc cttaatgcca ctatcaaaca aatgcttcgc gcttacgtct ccacagacaa 3180 ggagaactgg tcaaattggc ttagtgttct agcctattcc tacaacagca gcgtgcattc 3240 gtcgatgaag tactccccta acttcttact tatgggctac aatccaagga cctcagtgag 3300 tgcgatagtg ccggaaatca accctgctca ccgccccttc ctgcctagtc aaacagccga 3360 agactttgtt gaagctctag agatacacca taactctgcc aaggatgcta tcgctttagc 3420 ccaagataga caagcgaagg cgtatgacaa gaaaagacgt ctggtgcagg agttgaatgt 3480 cggagattac gcccttgtga atccacattc tctcgaattg gtagacgtcg aaggcaccgg 3540 gaagaagctt atccaaagga tgattggacc ctttgaagtt gtcgaaaaaa tcaaccctgt 3600 ggtataccaa ctacgactac cagatactta ctctatgcac cccgtgttca acttagaaca 3660 tctaaagaaa tatgtaccct cgcctgaaca atttggcgaa cgatccgaat taccctcaac 3720 acgtgatctt cgagcttcag aagaatacga ggtcaaagcg atcctgggtc accgtttagt 3780 cagcaaaagg aaagctaact gcagaatgtt cctagtccga tggaaaggat atggacctgt 3840 ggacaattct tgggtatcgg agtacgactt acgaaattcc cccttgctca agagagaata 3900 tctcgattcc caaggcctat ctatttgatg ctccttggac gaccctccaa atattaccct 3960 ccgaacccta aattctggtt agggtcatta ataattctgt acagttcgat caaaaaactt 4020 tgatactccg atttctcttt agtacatagt gttttgaacc tttcttcctc aagtccccac 4080 agagctctca tcagaagtgt cctgctgctt tcaagcgtga ttgcacttga gccgcatatt 4140 atggctgggg agtggtagct accactttta tgataaaata caggcaattg cctgcacacc 4200 gacatccaca ctgcctaatg aatttatgaa atgagaagag taggcaaaac ttatgaaatg 4260 tatttcattt ctttaccagt gcttctcatg actatttcat aactgtcctg agacatattt 4320 caatactact tcattgcaat ttcattactg cctggccttc catttcattg tctacccagc 4380 attgtttcat tgctatttca ttcttgcctg acttttaatt tcattgccta cccagcctta 4440 tttcattgcc ttttcatatc cttgcctggg ctacttcata ttcatttcat atccctgcct 4500 ggacaatttc atatccctgc ctgggcagtc tcatatccct gcctgggcaa tctcatatcc 4560 ttgcctgagc aatctcatat ccctgcctgg gctacttcat attcatttca tttcttgcct 4620 taggcaattt catatccctg cctgggtgat ttcatatccc tgcctgggta atttcatatc 4680 cttgcctggg caatctcata tccctgcctg ggctacttca tattcatttc atttcttgcc 4740 ttaggcaatt tcatatccct tcctgcataa tttcatatcc ctgcctgggc aatttcatat 4800 ctatttcatt ccctaccttg ggcaatttca tatccttgcc tttgcaatct catatccctg 4860 cctgggctat ttcataccta ccctactgca tttatcattg ccccctcaga atgtcaggca 4920 ctgccttcag ggtccatttc agggtccatc ccaaagagga aacatcacct ggtttgacca 4980 aaagtgaaga aaaccaagag gagaatgttt ttacctatca gggtccatat aatgggatgt 5040 tgggatgatg gtcacatgac caattaccaa aaatggtcaa aatgaaagtt gtagccctac 5100 actatagctt cccgaatctg tttgaattat gttctgaagt gctcccattg tgaagatatg 5160 aggtaaaaga ttttcaaaag tgttgcttct tttttgtgtt tttttttttg catttttctc 5220 catgaaaaca aaaccaaaca acatgatttc agtatatttg gaaagctata aaaaaaaagg 5280 gctacaatta ttgtatcgtg ggaaaaatga ccaaaaaagg aagatgcatg ccctagaaaa 5340 gaaattgaca ctgatagact ataatagtca gatatagact accataacta ccaaaaagtc 5400 agacatagac tacaaagttt actaatagcc agacttagac tacaaggttt actgatagac 5460 tacaaaagtc agacttagac tacaaagttt actgatacac tacaaataac tacaaatgtc 5520 agacatagac tacaatgttt actgatacac aaaagactac aaaagtcaga cttagactac 5580 aaattttgct gatagactac aaaagaccac aaaagtcgga cttagattac acagtttact 5640 gatagactac aaaagaccac aaaagtcgga cttagactac acagtttact gatagactac 5700 aaagtatact gataaactac aaaagactac aaaagtcgga cttagactac aaggtttact 5760 gatagactac aaaagaccac aaaagtcaaa aacttgtact tagactacaa agttagtagt 5820 gtctgctttt tgagactaca aaagtttgta gtgtgcgctt tttcattaga aaagtttgta 5880 gtgtccgctt tttgggacta caaaagtttg tagttgacat gtagtttatg gtgttcttgg 5940 ctctgtccag attatggggg tgaggttgaa gggttgacat gatcttactt acatattacc 6000 aatgctgctg cattagtttt tgacattatg cacaagttgt cttcacatta ggaaaggcta 6060 tcctatgcct tcataatggc acctactgtc ctatctcaaa atataattat tataagttca 6120 tgtggagata tggctgaagg aataccatga ctttacatat tgctattgcc acatcaccac 6180 tttttgacat catggccaag ttggctttca catcaggaca ggctattctc tgctttcaca 6240 ttgatagtta ctgctctgtc tgctgtcaca ttgatagcta ctggcctgtc ccaaactttg 6300 atatcagctc tttgagggct atagaatgat gggctgattc atccttggag gtgtccttaa 6360 gttaggtcat gttgtaagtt gaccatgtat tgagacagaa cagtagatat caacatgaat 6420 ggcctcctct ggcttgacac ccatcttgaa catgatgtga aaaagtgatg atgtggcaat 6480 agcaatatgt aaagtcatgg tattccttca gccatatctc cacatgaact tataataatt 6540 atattttgag ataggacagt aggtgccatt atgaaggcat aggatagcct ttcctaatgt 6600 gaagacaact tgtgcataat gtcaaaaact aatgcagcag cattggtaat atgtaagtaa 6660 gatcatgtca acccttcaac ctcaccccca taatctggac agagccaaga acaccataaa 6720 ctacatgtca actacaaact tttgtagtcc caaaaagcgg acactacaaa cttttctaat 6780 gaaaaagcgc acactacaaa cttttgtagt ctaagtacaa gtttttgact tttgtggtct 6840 tttgtagtct atcagtaaac cttgtagtct aagtccgact tttgtagtct tttgtagttt 6900 atcagtatac tttgtagtct atcagtaaac tgtgtagtct aagtccgact tttgtggtct 6960 tttgtagtct atcagtaaac tgtgtaatct aagtccgact tttgtggtct tttgtagtct 7020 atcagcaaaa tttgtagtct aagtctgact tttgtagtct tttgtgtatc agtaaacatt 7080 gtagtctatg tctgacattt gtagttattt gtagtgtatc agtaaacttt gtagtctata 7140 tccaactttt gtagtgtatc agtaaacttt gtagtctaag tctgactttt gtagtctatc 7200 agtaaacctt gtagtctaag tctggctatt attaaacttt gtagtctatg tctgactttt 7260 tggtagttat ggtagtctat atctgactat tatagtctat cagtgtcaat ttcttttcta 7320 gggcatgcat cttccttttt tggtcatttt tcccacaata caataattgt agcccttttt 7380 tttttatagc tttccaaata tactgaaatc atgttgtttg gttttgtttt catggagaaa 7440 aatgcaaaaa aaaaaaacac aaaaaagaag caacactttt gaaaatcttt tacctcatat 7500 cttcacaatg ggagcacttc agaacataat tcaaacctaa ctccttccag aatcctttct 7560 gccctggatc ctgactgact aacttgaacc tcagactggg gcatttcaag gaacaacatc 7620 gccattacga cggtcacatg ggaccatttg atcctaccag gaacactcaa ctctggttcc 7680 ctgggaagga atggagggcc cttatcctgt cgcctgctgc ggagactcca cgtgatttcc 7740 cagaatacga acttgtgacg aattgctgga agagcgatcc atcccccgct tttgatactg 7800 gctgcatcca cccttcctat gtcgacaaac tcatccattt gaatcaagat gtggaaagga 7860 gaatggaagc acttcacaag gagtacgttg atcatcattt cctggagccg tcgcactgag 7920 cattatggag ctctgccata cgacctctca acccttcctc agagaacctg aactctctga 7980 aacaaattag atgattcccg ctagctgtag attgggtcac cgaagctcaa cgaggtctga 8040 aagataaatg tgccttcatc aattatgtct cctggctaca agcttccccc ttttggatgc 8100 ctgatgctaa cgcccgtacc cgaatggcag acgatcacta tctgggtgtc tggctaaacg 8160 ggatggaaga acgccaagcg aggtggtatt tgaaggaagg catcccatgc ttcatagtaa 8220 gggaaatcat gcccttggag cgcgctcaac ttgcagctcc agaaactatg atagacttcg 8280 cagcaggttc cagtgcagcg ccgttacatt ggggtgtcaa tgactacaat tctctagccc 8340 tgtctcgagg tgacctatcg ttacgcgaca catcttcctt ccataatccg ggttgggtat 8400 ggtctgaacc ccctgttcag aagaacaaat ccccggccaa ggagcctcca aggactcagt 8460 cctctggatg ggcatggtct ggaattcaag catcaaagga taagtctcct gtcaaggaac 8520 agccgaagtc tcagaccatc gactatggac cccctcctcc agaaaccgta atcatcgcca 8580 aagaccaggt tccctggatc aaaccgccac cagtcaagag agccactcct agccgaccag 8640 gtgctcctcc gcatgaacag aaaaagtgga tcaaatgggt agaaaatcac cagcctgaag 8700 ggaccttctg caaagtgggc gccaagtttg ctcccaacca ccgttctcat tccatgtatg 8760 acagagacaa acatcgacac ctcttctttc ttcacccgcc tcgagcgcca gaaggatgcg 8820 tatcaaatcc ggacgtcttt ggtataccct gtccaaaagg aatatataag gacatgacca 8880 agactcaaca ttcacagccc ttctggatct acaaaactca ggagccacaa tcatctgatg 8940 taggcaaggt agctcccgtt cccaggccag aagatctccc acgtttgaat gaaaccccac 9000 gacctccacc tgacaatgac agtgacagcg acgacagcta ctatcccgac ctggacttct 9060 taagacaatc agttcaaaat aaagcaacgt cctcaaaggt aacctctgag agcgcaattc 9120 ctacacccgc tgctcaggtt gtcgaagcta gagctccact tccagcccaa atcgtcgaat 9180 cacctcctgt cactccatct gtcagaacag agcctgctgc cttaggcacg accgtagaac 9240 tgagagtcaa cttgtctgag cccactcgaa ttccccctcc cttgcctgtc ccagtaacga 9300 attgtacgca gaatgaagat gagatctcgt tgggggatga agattctatt cacgaagcta 9360 tgggtccgca gatcccacct ctcaccttca ctgtggattc ggagatcagg atggaagagg 9420 ctcccgtaac ggaccctctc gagttcgctt ccgcatttct gatgttatat gggctacctt 9480 cctcggaagg attctcgacc acccaaagtc tgataacatc aatcgctgct cgtcttcatc 9540 ttacggtcag gcagattttc agagccaact ctggccacaa tcagagcttt tggttcgaaa 9600 tggagttggt ggatcaagca cgtcaaatga ggacctacat gcatcacaga agagaaaatg 9660 acagggaatt actggtcacc tacgcagatt atgaagacta tgtcggagcc cttgctcgtt 9720 cttccctctg atggccagac cccgcggcgg ttaccgaaga gcaaaagcct tcctcgattc 9780 cagaacccat gaccagttca tcctccctgg gtagggacat caaggaacgt cacaagtctc 9840 gggacgtcca gcgatgatct ccctcagcag accgctatca catcaacaga agatcttcac 9900 cccttctgcg acgacgatca cctgtgcatc ccaaatattc tagaagacga tatcagcgga 9960 ctccttctcc tcgctatcat gaatgtgccc cttaccgtcg atctaggtcc ccagaaccct 10020 gccctctctg tcactcacat aactcaatcc cgatgggccc ctgggaaaat cagaataaac 10080 tcgactctgc tgctgttccg aatgtggact caaatacaac catgcagatc cctccattgc 10140 ccgtattacc cagtggacct gccttcgtag gattaccgca taacaccccg ctgccctttg 10200 gcgcaaacgt tgccttcatg tggtcaccat caggcaatac cttcagccct gtccttttac 10260 agggaaatac gactgtacta cccttcccta tgccatcgtt ggaggcaacc cctagtgctc 10320 tgttaccatg gccagtggct gcggcattgc cttcaatgac cgtgccattg tctttcgccc 10380 ttgaatggcc tacagagcca ttagcttcca gaattttgac tacaagacaa atgccttctg 10440 gatccccacc tcccaccccc acacagccaa ccctcatgtc tcagatgaca gaacgtctat 10500 ccgtcaggtt gagcaacaca gcatcttccg atctagctgc cagactctcg gatcccactc 10560 agtgggtagc tgccgaatga gtgtcggagg atttgcctat gggaagccct gtcccttgga 10620 atagtcatat gtcttgggga ggtgcagttc agccaccaac ggccccactg gtcactccgg 10680 atgatctgca tcctgcgcag gattccatgg acgaggatcc cgtcgatctt gatgatgtcg 10740 gagaatcaga ctataagagg acaaagaggg gtcgtcagag tgggcagaag atccaggggt 10800 acagatag 10808 // ID Gypsy-60_MLP-LTR repbase; DNA; FNG; 150 BP. XX AC AECX01001396; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-60_MLP_; KW Gypsy-60_MLP-I; Gypsy-60_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-150 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001396; Positions 40103 39954. XX SQ Sequence 150 BP; 39 A; 38 C; 26 G; 47 T; 0 other; tgttatgatc cctatactgg ggatatgctt atattgagta caacctcacc gccatagttc 60 tgtgcgcgtg ccatccttgt tgtaacctca tgcttcaatc tcgttataat aataagcaca 120 cctgatctcg tagaagcctt gaccataaca 150 // ID Gypsy-2_BFB-I repbase; DNA; FNG; 6141 BP. XX AC AAID01002526; XX DT 25-FEB-2011 (Rel. 16.02, Created) DT 25-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Botryotinia fuckeliana genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_BFB_; KW Gypsy-2_BFB-LTR; Gypsy-2_BFB-I. XX OS Botryotinia fuckeliana OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Leotiomycetes; Helotiales; Sclerotiniaceae; Botryotinia. XX RN [1] RP 1-6141 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Botryotinia fuckeliana genome."; RL Direct Submission to RU (25-FEB-2011). XX DR Genome; AAID01002526; Positions 8678 14818. XX CC Positions [4755-5243] - Integrase core CC 'ACCAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 172..1587 FT /product="Gypsy-2_BFB-I_1p" FT /translation="MAAQAPAPGLSPQFLFLNHVLNSAETDVSSFTDLFND FT PNAVHFAAAQAVFESIESITSSLQAQAAAAEAQVTDAQSKIVELKDTIEIL FT KLAIRNNPQAPPRRKYLTAPEKFSGVEQDIAKRQDQYTAFRSAVNRCLTAD FT YENESEFQKITFLANLLSGPAYTNNQIHFDAITDNPHDSNLWPSGWRTVAS FT IWCELNKQYQTLDLSRKASMDYDKCKMKKIPFGNFITEFKQLAIKCKKTAE FT QQVADLKLKVSQELIDASVNRPNKPSAADIDAWTIWWQGIYDDLEEKTHLD FT SQRKNQGTNFNRDQPKPHDRKQAPVATLPTPDLSSDPMQLDAARQGQRPPS FT RYTREQCIDLGLCLYCKKPGHIKSNCEEKRANDAKYNVNTNANMRYTPYTN FT RVPATNQYQGPGPYRNAAAPSHQNPFSSANRFSSTPLRYASPIPPSLTQEV FT DLGGYIEGSVHSEESGSSFPQPGMLKD" FT CDS 1659..6071 FT /product="Gypsy-2_BFB-I_2p" FT /translation="MSRDPLYLRSINSKSLRISTEISLAPNVIIPLVSLID FT SGAAGYGFIDRKVVSELRIPTRPLPYTRYLLLADGKPSDVLSQYALLDIRI FT NDHREIGLFYVTTLSTADPMILGLPWLQRHNPSIDWSAMTLRFTSTYCSRY FT CCPASTMATTVPDIANTHKITSQESREVASVDPKPPLRKLAPKPASMEEVT FT DESYENTSLPPRDQSTILEHIETQNARRNLDITNPKAPLEPNPYHTRCISG FT EAETRAKMIPNRPALKPAPARTVAGFRIHRRPPRNKNILPLLHTPIPSSPI FT DESFDVLRGIRPEMQDIKFFTANSFIQFCKSPDAYVTKVSWDELDRACELH FT HVMTGDAVQLRRGLVDSDIDKFMEKTDRPVPSATEIQDKLPPWLRNLYPGF FT LPSLANELPPRRSWDHKIEIIPGKEPPYNKGRPMSPAELRVVRRWLDDNLS FT KGFIRESRSRSAAPLLLAAKPGGGVRICQDYRGLNNVTIKNRYPLPLIKET FT LDALCHAKIYTKLDIIAAFNKLRIAEGHEWKTAFTTRFGLFESLVMPFGLC FT NAPASFQNYINHVLFDLLDRTCTAYLDDILIYSENVTDHRQHVREVVQRLI FT DAGLQIDIDKCEFETKRTKYLGLIITPGGIEMDPEKVSTILEWKPLTKLKD FT LQRFLGFANFYRRFIRDFSRIAEPLNRLLKKEQTWDWTSDQDDAFKTLKDA FT FSSAPVLTIFDHNRRTVVETDASDWAAGGVLSQYDDEGRLRPVAYFSSKHS FT AAECNYEIYDKELLAIIKSLEEWRPELYGAQEPFEIITDHKNLEYFTSTKM FT LNQRQARWAEFLSGFNFRIIYRPGHKAVRPDALSRRAEDRPAHADPNDDRI FT KNRLQTILPERVFDTAAFKDIIRQANSDLDLTIAPMGMIIPDTDKPIDDLI FT DKAYESSELVTTMRTTLRDPSARHWPLSIRKDLRIALQDCRLVNGKIFYRD FT RLFVPPSAELRTQIIYRTHSTGPAGHPGRVKTVDLVSRTYWWPNMSREIET FT FVQACQLCFRTKASRLAAPGFLEPLPVPFRPWSDISIDYITPLPVSERHGK FT KYQHIITVVCRLTKMRHFIPVVGLSAEELADRFVEKIYCLHGAPDNIVSDR FT GSQFVSEFWKHLSERLSIALKRSSSFHPESDGQTERINAMVEQYLRAFMNF FT HQNDWPDWLPLAEFALNNTTSETTGISPFFANYGFHPRLGIEPSTPVPPNL FT SYQRKREFLKANEVADRFGLILTKLKALAAQSIQRYEDYANKTRSDAPLYK FT EGDKVWVSTKNMKTNRPMKKGDDKWDGPYKVLKVYKRACLLQLPANFKIFP FT VFHNSLLRPLHRSPGLPGQDIINVTESRRNQGRVLERDDETHEETERWEFE FT EILDCHNEDGFHYQIKWKHHPASWQPAEDLRGQENVIIAFHLANPGKPDPP FT AWVGFKRPIAPDLLNPPPVPTPNATPKRGRGRPKKLRQLTTTHKEVHFHPV FT VGIRVF" XX SQ Sequence 6141 BP; 1662 A; 1757 C; 1317 G; 1405 T; 0 other; gtttgtacca tcttcataag aagctgtcac agcttcattt taattaaaaa ctccaccgac 60 gtcgcatccc gatcgcgctg atcgtaccca ccactggccc gccacgcacc ctaaccgctg 120 ttgtttaaga actctgcagg gttgctttgc gaccttgaga gtgtcctaag gatggccgcc 180 caagcacctg cacctggtct atccccacag ttccttttct tgaaccatgt tctcaactcc 240 gccgaaactg atgtctcttc gttcacagat ttgttcaacg atccgaacgc tgtacatttc 300 gcagctgctc aagctgtgtt tgaatcgatc gagtcgatca cgagttcact acaagctcaa 360 gccgctgcag cggaagccca agtcaccgac gcccaatcga agattgtcga attgaaagac 420 acaattgaga tattgaaact tgccattcgt aataacccgc aagccccgcc ccgccgaaag 480 tacctcactg cccccgagaa gtttagtggt gttgagcaag acattgcaaa gcgtcaggat 540 caatacacag cgttccgtag cgcggtaaat cgttgcttaa ccgccgacta cgagaacgaa 600 tcggagtttc aaaagattac attccttgct aacctgctct ctggtcctgc ttatacgaac 660 aatcagatac acttcgatgc aatcactgac aacccgcatg attccaacct ttggccctcc 720 ggatggcgca ccgttgcgag catctggtgc gaattgaaca agcaatatca gactcttgat 780 ctttcacgaa aagcctctat ggattacgat aagtgcaaga tgaagaagat cccgtttggt 840 aacttcatca ccgagttcaa acagcttgct ataaaatgca agaaaaccgc tgagcaacaa 900 gtcgctgatc ttaagctcaa ggtttcccag gagctgatcg atgcttcggt caaccgccct 960 aataaaccct ctgctgccga catcgatgca tggacaatct ggtggcaagg tatatacgat 1020 gatttggaag aaaagactca cctcgatagc caacgaaaga accaaggtac caatttcaac 1080 cgtgatcaac cgaaacctca tgatagaaag caggctcctg ttgcaaccct tcctacaccg 1140 gatctctcta gcgatcctat gcaattggat gctgctcgcc aaggtcagcg tcccccatct 1200 cgttatactc gagagcaatg tattgatctc ggcctctgcc tctattgcaa gaagcccggt 1260 catattaaga gcaattgcga agagaaacga gccaacgatg caaagtataa tgttaatact 1320 aatgctaata tgagatacac cccttacacc aatcgtgtcc cagcaaccaa tcaataccaa 1380 ggccccggac cgtatcgcaa tgccgctgcc ccctcgcacc agaatccgtt ctcgtctgcc 1440 aaccgtttct cgtccacgcc attacgctac gcctctccga tcccccctag cctcacccaa 1500 gaggttgatc taggagggta catcgagggt tctgttcact cggaggagag cggaagctca 1560 tttccgcagc cgggcatgtt aaaagattaa cctcggttca cagtcgcgaa tcgaggtcct 1620 ttgtgggaaa tttacctttg tgggttagaa gatcggttat gtctagggat cctttgtatc 1680 ttagatcaat aaatagcaaa tctcttagaa tctcgacgga aatctccctt gcgcctaatg 1740 tgatcatccc gcttgttagt ctcatagata gtggagccgc gggttatggc tttattgata 1800 ggaaagtcgt ctccgagctc cgtatcccta cacgccctct cccttacact cgttacttac 1860 ttctagccga tggaaaaccc tccgatgtcc tgtcgcagta tgcactccta gatatacgaa 1920 tcaacgatca cagggagatc ggcttgttct atgtcaccac tttatcgact gcagatccta 1980 tgatcctcgg tctcccatgg ctccagcgcc acaaccccag catcgattgg tcggcgatga 2040 cactcagatt tacgtctaca tattgctcga gatactgctg ccccgctagc actatggcaa 2100 ccactgtccc cgacatcgcc aatacccaca agataacttc gcaagaaagt cgcgaagttg 2160 catcagtcga ccctaaacca cccttgcgga agctcgctcc gaaaccagcc tctatggaag 2220 aagtcaccga cgaaagctac gagaatacct cgttaccacc tcgagatcaa tccacgatcc 2280 tcgaacacat cgagacccaa aacgctcgga gaaacctcga tatcaccaat cccaaagctc 2340 cgctcgaacc gaatccatac cacactcgct gtatcagcgg cgaagcggaa acccgagcta 2400 agatgatccc gaaccgacca gccctgaaac cagccccggc tcgaacggtt gccggcttca 2460 ggatacacag gagaccacca cggaataaaa acatcctacc cttattacat acaccgatac 2520 catcgagccc catagacgag tcgttcgacg tcctgcgagg tatccgacct gaaatgcagg 2580 atataaagtt ctttacagcc aacagtttca tccaattctg caagagccca gatgcttatg 2640 tcacaaaagt gtcctgggac gaattggaca gagcttgcga gctacatcat gtgatgacag 2700 gagatgctgt tcaattgcgc cgtggcttag tagatagcga tatcgataag ttcatggaaa 2760 agacggatcg acctgttcct tccgctacag agatccaaga taagctacct ccctggctcc 2820 gcaacctgta cccagggttc ctaccgtccc tggcaaatga gcttccaccg cgacgctcct 2880 gggaccacaa gatcgagatt ataccaggca aagaaccacc atataacaaa ggccgtccga 2940 tgtccccagc cgaacttcga gtcgttcgcc gatggctcga cgacaacttg agcaagggat 3000 tcattcgaga atctcgctcc cgatcggccg ccccactcct tctagctgcc aaacccgggg 3060 gaggcgttcg aatctgtcaa gactaccgcg gtctaaacaa cgttaccatc aagaaccgat 3120 accccttacc attgattaag gaaactcttg atgctctttg ccacgcaaaa atctacacca 3180 agcttgatat catcgccgcc ttcaacaaac tacggattgc agagggccac gaatggaaaa 3240 ccgctttcac tactcgcttc ggtctattcg aatcattggt gatgcccttt ggactctgca 3300 acgcgcctgc ctcattccag aattatatca atcatgttct ttttgatctc ctggacagga 3360 cttgcacggc gtatttggat gatattctaa tctactccga gaacgttacg gatcataggc 3420 agcatgttcg tgaggtggtt caacgactaa ttgatgctgg actacagatc gatattgaca 3480 agtgcgagtt cgaaacgaag agaactaagt acctcggctt gatcatcact cctggaggca 3540 tagaaatgga tccggagaag gtcagtacta tactcgagtg gaaaccacta actaagctca 3600 aagatctcca acgcttcctt ggatttgcta atttttaccg acgtttcatt cgcgacttct 3660 ccagaatcgc agaacccctc aatcgactcc taaaaaagga acagacgtgg gattggacct 3720 ctgaccagga cgatgccttc aagacattga aggacgcttt ctcctccgcc ccggtcctca 3780 ccatatttga tcacaatcgc cgcacagtcg ttgagaccga tgcttccgac tgggctgcag 3840 gtggcgtcct ttcgcaatat gatgatgaag ggcgcctacg cccggtagca tacttctcat 3900 caaaacatag cgctgctgaa tgcaactatg agatctacga caaagagctc ctggccatta 3960 taaagtcctt agaagaatgg agaccggaat tgtatggagc tcaggagcca ttcgagatca 4020 tcactgacca taaaaacctc gaatacttta cgtccacgaa gatgttaaac caacggcagg 4080 ctagatgggc cgaattcttg tctggattca acttccgtat tatataccga cccggccaca 4140 aagccgttcg ccccgacgct ttgagccgca gagcggaaga tcgcccagca cacgccgacc 4200 ctaacgacga ccgaatcaag aaccgcttgc agacaatcct cccagagaga gtatttgata 4260 ccgctgcatt caaagacatc attcgacagg cgaacagtga tctagacctg actatagccc 4320 caatgggtat gatcatcccc gataccgaca agccgatcga cgacctcatc gacaaggcat 4380 acgaatcatc cgagctggtt accaccatgc gcacaacact tcgggatccc tctgctcgcc 4440 attggccgct aagcatccgt aaggaccttc ggatcgcctt acaagattgt aggctagtaa 4500 atggaaagat cttttatcgc gaccgcctct ttgttccccc ctccgccgaa cttcgcacgc 4560 agattatata ccgaacccat tcaaccggac cagccggaca cccgggccgc gtcaaaactg 4620 tcgaccttgt ctcccgtacg tattggtggc ccaatatgtc aagggagatc gagacattcg 4680 tacaggcttg ccaactctgc tttcgaacaa aagcttcccg cctcgccgca cctggattcc 4740 ttgaacccct gccagtaccg ttccgcccct ggtccgatat ctctatcgac tatatcacac 4800 cactaccagt tagtgaacgc catggaaaga agtatcaaca catcatcaca gtcgtctgcc 4860 gcctgaccaa gatgcgtcat ttcataccgg tcgtggggct gtccgcagag gagctcgcgg 4920 atagatttgt cgaaaaaatc tactgcctac acggtgcccc tgataatatc gtgtccgaca 4980 ggggctcgca attcgtctct gagttttgga agcatttatc cgagcgattg tcaattgccc 5040 tgaaacgatc ttcctcattc catcccgagt ccgacggcca gacagaaagg attaatgcta 5100 tggtcgaaca atacctgcga gcttttatga acttccatca gaatgattgg cctgattggc 5160 ttccactcgc agaatttgca ttgaacaata ccacgtcgga gaccaccggc atctccccat 5220 tcttcgctaa ctacggtttc cacccacgtc tcggcatcga gccatctaca cctgttccac 5280 caaatctgtc ttatcaaagg aaacgagagt tccttaaggc caacgaggtt gcagatcgat 5340 ttggtctcat cttaacgaaa ttgaaggcac ttgcagccca atcgatccag agatacgagg 5400 actatgccaa caaaacccgc tcggatgccc ctttgtataa ggaaggtgat aaagtatggg 5460 tgagtacaaa gaacatgaaa accaatcgac cgatgaaaaa aggtgatgac aaatgggacg 5520 gcccttataa ggttctcaag gtttacaaac gtgcttgcct gctccaactc ccagctaact 5580 tcaaaatctt cccggtattc cacaactcac tgctccgacc gctccaccgc tcaccaggat 5640 taccgggcca agacatcata aacgtcacag agtcacgcag gaaccaaggc cgagtccttg 5700 aaagggatga cgaaacacac gaggaaactg aacgatggga attcgaagaa atcctcgatt 5760 gccacaatga ggacgggttt cactatcaaa ttaaatggaa acaccacccg gcgtcctggc 5820 aacccgccga agacctccgc ggccaggaaa acgtgatcat tgccttccat ctagcgaatc 5880 caggcaagcc agaccctccc gcctgggtag gtttcaaaag acctattgct cctgatcttc 5940 ttaaccctcc accagtacct acaccaaacg caaccccgaa acgaggccga ggcagaccta 6000 agaagcttcg tcaattaaca actactcata aggaggttca ttttcatccg gtagtgggga 6060 ttagagtatt ttagtgttat tttgttacct tagattaatt tatattgcga tcgtggacca 6120 atcgccttga gcggggggta c 6141 // ID FOSBURY repbase; DNA; FNG; 253 BP. XX AC U15189; XX DT 11-NOV-1996 (Rel. 1.1, Created) DT 11-NOV-1996 (Rel. 1.1, Last updated, Version 1) XX DE Magnaporthe grisea Fosbury retrotransposon, 5' LTR. XX KW Gypsy; LTR Retrotransposon; Transposable Element; FOSBURY; KW Gypsy group; LTR; Retrotransposon Fosbury. XX OS Magnaporthe grisea OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Magnaporthales; OC Magnaporthaceae; Magnaporthe. XX RN [1] RP 1-253 RA Shull V. and Hamer E.J.; RT "Genetic differentiation in the rice blast fungus revealed by the RT distribution of the Fosbury retrotransposon."; RL Fungal Genet Biol 20(1), 59-69 (1996). XX RN [2] RP 1-253 RA Shull V.; RT "Mitotic Instability in Magnaporthe grisea can be due to the RT Transposition of fosbury, a Novel Rice Pathogen-Specific Gypsy RT Class Retrotransposon."; RL Thesis (1994) Biological Sciences, Purdue University. XX RN [3] RP 1-253 RA Shull V.; RT "FOSBURY."; RL Direct Submission to Genbank (28-SEP-1994)Verel Shull, Biological RL Sciences, Purdue University, West Lafayette, IN 47907, USA. XX DR GenBank; U15189; Positions 338 590. XX CC This is a LTR sequence from Gypsy group retrotransposon Fosbury. CC 5 bp terminal inverted repeats. XX SQ Sequence 253 BP; 51 A; 86 C; 60 G; 56 T; 0 other; tgtcacagac ctgaaggaca gccactgggc tgtcgcttag tcatgacccc tgtcacgtga 60 aggcgcgagg gccgagcgcc tcgcgcgcgt atatctacgc gaaatctctg cagtctccga 120 tatgtagctc ctcgagctac gtcttcgtcg acaaccacct gctacactgt acctggaaga 180 ctagatagcc aatatactct gtcctgttac ctgatcgccc gcacctacct gtccgccctg 240 cctgcccgtg aca 253 // ID Gypsy-7_LBS-LTR repbase; DNA; FNG; 767 BP. XX AC ABFE01000288; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-7_LBS_; KW Gypsy-7_LBS-I; Gypsy-7_LBS-LTR. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-767 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01000288; Positions 35001 34235. XX SQ Sequence 767 BP; 238 A; 186 C; 133 G; 210 T; 0 other; tgttatggta catgtgctct cactacagtc ttattccctc tcatcctaca gtacttacct 60 cctacctctt ttcttaactc cttacggact cccatcagtg tacttagtct attcttgtgt 120 ttaccaactg tctgtaaata gcgcgtagta tacccggatt caacacgcta tctccaccta 180 acgcgttgtg atcagtctcg acgcgtgtaa tcagttgtga cgcgacgcga ctgcttggca 240 acgtttggat aagagctgtt tggtcaatca aggggtagcc aaacaaggac atatccaaac 300 acgccgaaca agaacaaagg ctcgatatat cttgaacgtt ccaaggtatt ctataaagat 360 ctagaaatac gaaaataggt acttaagcac acttgtaaga tagaagaaat caatcatctt 420 ttaccccaaa tataactgcg tgaactaagt tcttgagaaa gactaaacac gacgctacag 480 caagtattta gtaaagccgc gcgctactaa ctacttcaag tccgaaacgt taacgtcttt 540 agactaacgt ttctgattgg cgccgcttaa gtcgaccaca agaaacatag gtctcacagg 600 atctagagca agcccaaaca cttacactca cacttgacta tcagccttcc ataaaggttt 660 attctaactt acaagtctac atagtgagtc ttcattttat aaagctaact aacaagttgt 720 ttgtattcac aggaatacac caacgccaga gtccgagaac tctaaca 767 // ID Gypsy-51_MLP-I repbase; DNA; FNG; 7977 BP. XX AC AECX01002217; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-51_MLP_; KW Gypsy-51_MLP-LTR; Gypsy-51_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-7977 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002217; Positions 44798 36822. XX CC 'ACGT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 370..6798 FT /product="Gypsy-51_MLP-I_1p" FT /translation="MSSIIQIHDQEGGEEGDSELELPPTSHYKRTPRSSRT FT YALENPVNYSTSLPSGSLPPPPGPASGSTVRGKQREESTSEDPIPPSAFTG FT FREQETLLPSDHEEEPDWQQQFINQQQHYLRREREMADIINLIRLKEEKGK FT KERVKEVIKVVKKGKGKKISRKKEVIKEEESDGDSSASSSSSDDSSSTSEE FT SEEEEGNERSRSHVKPRKIDNSDTSVRFEVGGNVKKFIINYHSQAEANGAS FT ELDMVRQITSFVVGEELKEEIRDMEGWGVGEWSWKKVKEQLIARYQTTTEQ FT PRYSIDHLRTLSERTMRNGGVTCKSEYQTFRINFDKIHQYLARHDYVDSSD FT EEVARYFYEAFSDELQKKFKKRMMKKVSSGRYKLPNLEKLKKVVDHYIEEE FT VLLVFEDIGEEKYTIKNEIAGVKSEVIQKDTSSKVAKIGGMQAKANQQMEI FT LCTDLKNLQINHVQNPLQLPLNNSYNFQIPISNALQNQIHQQVVPNQRNNV FT QYPPNNNSRPQQFNNNFQPNRAPNNSNFNRFNNNNNNRPVNNPLPNNNNQI FT PAINNQNFMPQQNTMPFCSYCNMEGHSMGRCKWVIQDKQAGLLQDRMDGLF FT LPRSNERMYKKANEGIRQQVIEYSEDQANAMARNNMQQQQQNNQPVNNPPV FT TNTAVVPPVIPTSVLTREEGKDKVHELKTNCGVLEEWSVPEFTKKCTEVKA FT GAGTKLYGVEMKTRSGKAVDEGDVLNKLKKKKKVTINDMIEEIDAMDVDED FT NNVQDQHDILDGSLRRIHKDEESNKSEDGSAQTKKPTYIIPNDGDFVKDRL FT SKGLPAFANKAKIVSEVVEKIMNGAIELQIKELCTISPVISDEVKKWVSKR FT RLNLNQTTMASVANNSILSDFEYETESDSSLEVNVPQATLYSTPLGHVEIT FT IGKMKVKALIDSGSQINIIPLKIMQLLPVQTLVNLKSGVRGISGHRTPLCG FT IAENVKIHVGANIDGIVHFFVAEDDDTPILLGRPFLFDFEAQLNFEEDVGE FT RMTLKDSRGVWMKIRLCEPDKGTWERKLNVSIEERQGFMLENDGGNKLFEE FT LREELIEDEIYGLKFENEIHMEQTLIDEINIIMNLENEDIFENQGGIEYRS FT FGAKYKSVERKVKPVNAPMPQYLNFPLQRPALSRDPYKTPLIPNPPEFIET FT DKTTVERLKQCNFGPSGWLSDEEWKLFLFILVLREGAVAYCEEERGLLKHS FT YGLPYVVPVIDHEPWQIRPIPIPTAIRNDYIELVRQRIKTGLYEQSSSSYS FT SPVFCVLKGDGKLRIVHDLQVLNKVTIKDAGIPPSPEEFVESFSGRACYGL FT GDIMGGYDERELAQVSRPLTTFETPLGRLQLTRLPQGATNSVAVYQAQMMW FT ILQEELPKHAGIFIDDGGIKGPESDYNNEVLKENNGRRKFIWEYAIILERI FT LFRIEEAGLTVSGKKFAVCVPALEIVGHIVSGKGRSVSIKKKNKIQTWPIP FT TNKTQVLGFLGTCIYVRMFIPNFGHLAAPLRRLTRMNVDWEWNQDCDKAFN FT ILKKIVGEDIVLAALDYSEGANKIILAVDSSYIAAGGVLMQENDEGEIRPV FT FYESEVFTEVESRYSQPKLELCGVAKIMRKFKTKLWGQHFELQVDAKALIQ FT MINAPSLPNAAMTRWVAYIQLFSFDLVHKHGKSFGMPDGLSRREPGSESDE FT AESFDEDKKEIKVYESFQLQVGSSELEEEEVILEENSMWSQEGIWRNLEEY FT LTTLIRPKDCEDAEFQKIKSKVPLFYVDGSRLKKRGVPQGRLVISRIEGQE FT HILKKLHEELGHRGIEETYRRCLIRFWWPEMKESVRNWVVSCEECQKKSSK FT KEKEVGRATGENTVFGRVSLDAIHIKAGSHDYAIIARDDLSGWVEAAPLKK FT LTATAVAKFITKEWIMRYGSVKCFTVDGGSEFKGIFREAVTMAGSKVVEST FT AYWPQAEGMVERGHKDIKGALIKLSDESGTSWEAFLPQVLFADRISTKRTT FT GMSPYEMIFNQVAVLPIDLEAGTFLGIDWARITTGEELLKARMEQLLCREE FT IINKAYDKMMKARVQGIIYWDKKNAHRMRKPLKEGDLVLTYNKILEFQWGK FT LFKNRWNGPFRIIKQHVGGSYILEELNGVELSRRYAAEHVKRFYPRGIIPT FT AEEEEEVTDEEEAQL" XX SQ Sequence 7977 BP; 3060 A; 946 C; 1689 G; 2282 T; 0 other; tattggtgac tccactgggg ctctttactt tatacaagca agatccgtca attatattta 60 ataatttccc taattattat cttaaaacat caacaattta ttattaaatc atcaataatc 120 aaattaataa taatacaact catcatttcg aaagaaaatt atattcaaca aatatcaaca 180 aaattcaatt aataaaattc aatatcaact aataacatat taaataacaa taatcaatct 240 ttaaaagaaa gaatataact taaaaaccac gcatatattc acttttaatt cattataact 300 ttcaaaatta tcaatatttc gagtttcttt attttataat atcattttat aataattaaa 360 atcaatatca tgtcttcaat cattcaaatt catgatcaag aaggaggaga agaaggggat 420 tcagaattag aattaccacc aacttctcat tataaaagaa ctccacgtag tagtagaact 480 tatgctttag aaaatcctgt gaattattca acatctttac cctctggctc tttaccacca 540 ccaccaggac ctgcttcagg atcaactgta cgtggaaaac aacgtgaaga atcaacttct 600 gaggatccaa ttccaccttc agcatttact ggatttagag aacaagaaac attattaccc 660 tcagaccacg aggaagaacc tgattggcaa caacagttta ttaatcaaca acaacattat 720 ttaagaagag aaagagaaat ggcggatata attaatttaa taagacttaa ggaagaaaaa 780 gggaagaagg aaagagttaa ggaagttatt aaggtagtta agaagggaaa aggaaagaag 840 ataagtagga agaaggaggt tattaaggaa gaagaaagtg atggagattc aagcgcctca 900 agcagctcaa gtgatgattc aagctcaact tcagaagaaa gtgaagagga agaaggaaat 960 gagagaagta ggagtcatgt taaaccacgg aaaattgata attctgatac aagtgttaga 1020 tttgaagttg gaggaaatgt taagaaattt attatcaatt atcatagtca agctgaagca 1080 aatggcgcct cagaattgga tatggtcagg caaattacca gttttgttgt tggagaagaa 1140 cttaaggagg aaatacgtga tatggaaggt tggggagttg gtgaatggag ttggaagaag 1200 gtcaaggagc aattaattgc cagatatcag actaccacgg aacagccaag atattctatt 1260 gatcacttaa gaactttatc tgaaagaaca atgagaaatg gaggtgttac gtgtaaaagt 1320 gaatatcaaa catttagaat taactttgat aaaattcatc aatacttggc tagacatgat 1380 tacgtggatt caagtgatga agaagtagca agatatttct atgaggcttt ttcagatgaa 1440 cttcagaaga aatttaagaa gaggatgatg aagaaagtat caagtggaag atataaatta 1500 cctaacttag agaaattaaa gaaggtggtt gatcattata ttgaagaaga ggtgcttttg 1560 gtatttgaag atattggaga agagaaatat accatcaaga atgaaattgc tggagttaaa 1620 agtgaagtta ttcaaaagga tactagtagt aaagttgcaa agattggggg aatgcaggct 1680 aaagcaaatc aacaaatgga aatattatgt actgatttaa agaatttaca aataaatcac 1740 gtccaaaatc ctttacagtt acctttaaat aattcttata actttcagat tccaatttca 1800 aatgctttac agaatcaaat tcatcaacaa gttgttccaa atcaaaggaa caatgttcaa 1860 tatccgccaa ataataactc aaggcctcaa caatttaata ataactttca acccaacaga 1920 gcgcctaata attcaaattt taacaggttt aataacaata ataataatag acctgttaat 1980 aatcctttac ccaataataa taatcaaatt cccgccatta ataatcaaaa ctttatgcct 2040 caacagaata ctatgccatt ttgttcttat tgtaacatgg agggacatag tatgggaagg 2100 tgtaaatggg ttattcagga taagcaagct ggattacttc aagatagaat ggatggttta 2160 ttcttaccta gaagtaatga aaggatgtat aagaaggcta atgaaggaat tagacaacaa 2220 gttattgagt actctgaaga tcaagcaaat gctatggcac gtaataatat gcaacaacaa 2280 cagcagaata atcaacctgt aaacaatcca cctgtaacaa atacagcagt agtaccacct 2340 gtaattccca cgagtgtttt aactagagag gaaggaaagg ataaggtaca tgaattaaaa 2400 actaattgtg gagttttaga agaatggtct gtacctgaat ttactaagaa gtgtactgaa 2460 gttaaagctg gagctgggac taaattatat ggcgtggaaa tgaagacaag gagtgggaaa 2520 gctgtggatg aaggagatgt attaaacaaa ttaaagaaga agaagaaagt tacaattaat 2580 gatatgatag aggaaattga tgctatggac gtggatgaag ataataatgt tcaagatcag 2640 catgatatat tggatggaag tttaaggaga attcataaag atgaagaaag taataagagt 2700 gaagatggtt cagctcaaac aaagaaacct acatatatta ttccaaatga cggggatttt 2760 gtgaaggaca gattaagcaa aggattacca gcatttgcaa ataaggcaaa gatagtttct 2820 gaggttgtgg aaaagattat gaatggtgct attgaacttc aaattaaaga attatgcacc 2880 atctcacctg taatctctga tgaagttaag aaatgggttt caaaaagaag gcttaattta 2940 aatcaaacaa ccatggcttc agttgcaaac aattctatat tatctgactt tgaatatgaa 3000 acagaatcag attcctcttt ggaagttaat gttcctcaag caactcttta ttcaacacct 3060 ttgggacacg tagaaattac aattgggaag atgaaggtta aagctttaat tgattcaggc 3120 tctcaaatca atattattcc tttaaaaatt atgcaattat taccagttca gaccttggtt 3180 aatttaaaaa gtggcgtgag aggaattagt ggacatagga cacctctttg tggtattgct 3240 gagaatgtca agattcatgt aggcgccaat attgatggaa tagttcattt ctttgtggca 3300 gaagatgatg acacgcctat cttacttgga aggcctttcc tctttgattt tgaagctcag 3360 ttgaattttg aagaagatgt tggtgaaaga atgactttaa aagatagtag aggtgtatgg 3420 atgaagataa gattgtgtga acctgataaa ggcacgtggg aaaggaaatt aaatgtttca 3480 attgaagaaa gacaaggctt tatgttggaa aatgatggag gaaataaatt atttgaagaa 3540 ttacgtgagg aattaattga agatgaaata tatggactta agtttgaaaa tgaaatccat 3600 atggagcaaa ctttaattga tgaaattaat atcattatga acttggaaaa tgaagatata 3660 tttgaaaatc aaggaggaat tgaatataga agttttggcg ccaaatataa atcagttgaa 3720 agaaaagtta aacctgttaa tgcgcctatg cctcaatatt taaattttcc tcttcaaaga 3780 ccagctttat caagagatcc ttataagact cctttaattc caaaccctcc agaatttatt 3840 gaaactgata agaccacagt ggaaaggttg aagcagtgta attttggacc ttcaggatgg 3900 ctttcagatg aagaatggaa actatttctt tttatattag tcttaagaga aggagctgtg 3960 gcttattgtg aagaggaaag gggtttattg aagcactctt atggattacc ttacgtggtt 4020 ccagtaattg atcatgaacc ttggcaaatt agaccaatcc caatacctac agcaattaga 4080 aatgattata ttgaattagt aagacaaaga attaagactg gtctttatga acaatcctct 4140 tcaagttatt ctagtccagt tttctgcgtg cttaaaggtg atggtaaatt aagaatagta 4200 catgaccttc aagtattaaa taaagtaact attaaagatg ctggaatacc tccttcacct 4260 gaagaattcg tggaatcttt ctctggtaga gcttgttatg gattagggga tattatggga 4320 ggatatgatg aaagggaatt ggcgcaagtt tcaagaccat taacaacttt tgaaactcct 4380 ttgggaagat tgcaattgac aagattacct caaggtgcaa ctaattctgt agcagtttat 4440 caggcgcaaa tgatgtggat attgcaagaa gaacttccta aacatgcagg aatatttatt 4500 gatgatggag gaattaaagg acctgaaagt gattataata atgaagtatt gaaagaaaat 4560 aatggtagaa ggaaatttat ttgggagtat gctattatat tggaaagaat attattcaga 4620 attgaagaag cagggttaac agtgtctgga aagaaatttg ccgtgtgtgt accagctttg 4680 gaaattgttg gacatatagt aagtggaaaa ggaagaagtg tatcaattaa aaagaagaat 4740 aaaattcaga cttggccaat tccaacaaat aaaactcaag ttcttggatt cttgggcacg 4800 tgtatttatg tcaggatgtt tattcctaat tttggacacc ttgctgcacc tttaagaaga 4860 cttacaagaa tgaatgtgga ctgggaatgg aatcaagatt gtgataaggc ttttaatata 4920 ttaaagaaga tagtgggaga agatattgta ttggcagctt tggattatag tgaaggagca 4980 aataaaatta tattggccgt ggattcaagt tatattgcag ctggaggagt tttgatgcaa 5040 gaaaatgatg aaggagaaat aaggcccgtg ttttatgagt cagaagtatt tacagaagtt 5100 gaatcaagat actctcaacc taaattggaa ttatgtggcg tggcaaagat tatgaggaag 5160 tttaaaacaa aattatgggg tcaacatttt gaattgcaag tagatgcaaa ggctttaatt 5220 caaatgatta atgcgcctag tttaccaaat gcagctatga caagatgggt agcttacata 5280 caattatttt catttgatct tgttcataaa catggaaaga gctttggaat gcctgatgga 5340 ttatcaagaa gggaacctgg aagtgaatca gatgaagcag aaagttttga tgaagataag 5400 aaagaaatta aagtatatga aagttttcaa ttacaggtgg gaagttcaga attagaagaa 5460 gaggaggtta tacttgaaga aaattctatg tggagtcaag aaggaatttg gcgtaatttg 5520 gaggaatatt taacaacttt aataagacca aaagattgtg aagatgctga atttcaaaag 5580 attaagagta aagtaccttt attttacgtg gatggtagca ggttaaagaa aagaggagta 5640 cctcaaggaa gattggtaat atcaagaatt gaaggacaag aacatatatt aaagaagtta 5700 cacgaggaat tgggacatag aggaattgaa gagacttaca gaaggtgttt aataagattt 5760 tggtggcctg aaatgaaaga atcagttaga aattgggtag tatcatgtga agaatgtcaa 5820 aagaaaagtt caaagaaaga aaaagaagtt ggaagggcta caggtgaaaa taccgtgttt 5880 ggaagagtca gtttggatgc tattcatatc aaagctggaa gtcatgatta tgctataatt 5940 gcaagggatg atttatctgg gtgggtagaa gcagcacctt taaagaaatt aacagctaca 6000 gcagtagcca aatttattac aaaagaatgg ataatgagat atgggtccgt gaaatgtttt 6060 acagttgatg gtggatcaga atttaaagga atatttagag aagcagttac tatggctgga 6120 tcaaaagtgg tggaatcaac agcttattgg cctcaagcag aaggaatggt tgaaagagga 6180 cataaggata ttaaaggcgc attaattaaa ttaagtgatg aaagtggaac atcttgggaa 6240 gcatttttac ctcaagtctt atttgctgat agaatttcca cgaaaagaac aactggaatg 6300 tctccttatg agatgatatt taatcaagta gcagtattac caattgattt agaggctggg 6360 acatttttag gtattgattg ggcaagaatt accacgggag aagaattatt aaaagcaagg 6420 atggaacaat tattatgtag agaagaaatt attaataagg catatgataa aatgatgaaa 6480 gcaagagttc aagggattat atattgggat aagaaaaatg cgcatagaat gaggaaacca 6540 ttaaaagaag gagatttagt attaacttat aataagattt tggaatttca atggggaaaa 6600 ttattcaaga atagatggaa tgggcctttt agaattatta aacaacacgt gggaggatct 6660 tatattttag aagagttaaa tggtgtagaa ttgtcaagaa gatatgcagc agaacacgtc 6720 aaaagatttt atccaagggg tattattcca acagcagaag aagaggagga ggtcactgat 6780 gaggaagagg cgcaattata attacttaaa gtatataata atattaattt caaataacta 6840 aatataaaaa tatataataa gactaaaaga atagaagaaa agaatttgaa aatataataa 6900 agaataagaa taatggatat gtttcactag ttatcaatgt acatttaata gtaaatagta 6960 atattaagaa ggagaagatg ggtaaaggcg tggatcaagc caactgggaa gttcataaat 7020 ggtaggagta tattcagaat aattataatc agtatatggg gaattaacag aaggaattac 7080 gggtggatca ttattaacta ttggaagttc agtattagga ggattagtag aaggagtatt 7140 ataaggggat aataaagcag catttgcacg ctcagcatga ttaataaatg gaataaaggt 7200 ggatacagtt aaattttcat cctcaccctt tctggaaaca tattcagcaa tagaatgggc 7260 acgcaaaaga tgataatgat tattagtaat gttgggtact tcatatccag cttctttaaa 7320 ctgtcttata gtatcatcca agatcctcca agaatgccaa ttttgataag taatatagtc 7380 aagtttccaa gctaagttat gttcatataa tttttcgaaa aaggggtcag aagtagtaaa 7440 attggtttca attgatgaag ttattgaagg gtttggagaa tcaggagctg aaggtgtatt 7500 gtgttgatta atattataag gatttaagaa attgactctg ataggtgtat taggcgtgtt 7560 tggaggtggg aaattgtaag caggagaacc aggtacaggt gaaatattgg tagaaggcgc 7620 tgctggatag gctagaatta aaagaaaatt ataaaagaaa gtataaaagg aataaaatta 7680 attaatataa ataaaaacat tgaagattat tatagtaaaa gaaatgaaag aaaagactca 7740 cagcatatta taccaaggtt atggcatctt tgacacgcag gatttatttg atataagcag 7800 gaagaatgga ccatgcggca ttgaacacaa ggtgaagcaa cagatggagg tgtagtgata 7860 accatattta taattaagaa ggatatttaa agatttgagg aaaaattcaa atgtatggag 7920 tagatgattt attttattga ttggtaaact gggggcagtt tgggaagaag aggagga 7977 // ID Copia-24_MLP-LTR repbase; DNA; FNG; 802 BP. XX AC AECX01002457; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-24_MLP_; KW Copia-24_MLP-I; Copia-24_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-802 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002457; Positions 8124 7323. XX SQ Sequence 802 BP; 160 A; 195 C; 148 G; 299 T; 0 other; tgttatgata ctgtctaagg tagtcaagtc catcccgatt aggaaatcaa acgctacagt 60 attaggtctg aattcaggta tcttcaagtc gtgtagaatg tttccctttt gggaacatta 120 tttgtgtgtg ggctttatca aaatggttgt gtacgcgtgt ctagtctgta tctgtgtatc 180 cccttgcttc agggtgaaaa ttgtgtgcga ttcagtttct ctcttatata cccagggtct 240 ctcgagctcc ttgttccccc ctcatcgtgt ggaacttgga agcttagagt acgcacctta 300 ttttctattt ttttctcttt tcatatttca cttcttggat gagtgctcat gtgtcacgat 360 agacctccct accactcttt cctcctgctc actcgcaggt aaggcgtcga ctttctccgc 420 cagtctttcg tgaatatttc tttgagtctt acaactcatc tataacctgt ctgtccaggt 480 aaatcttgtg ttctcttaaa gttcttttat ttttctcttt actagaagtg gaatgttatc 540 atcgtgtgga acttggaagc ttagaacctc cctaccactc tttcctcctg ctcactcgca 600 ggtaaggcgt cgactttctc cgccagtctt tcgtgaatct ttctttgagt cttacaactc 660 atctataaca tgtctgtcca ggtaaggcgt cgactttctc cgccagtctt ttgtgaatct 720 ttctttgagt cttacaactc atctataacc tgtctgtcca ggttacacgc gttgaatttc 780 ttccgcgtct ctcatcattt ca 802 // ID Gypsy-15_MLP-LTR repbase; DNA; FNG; 341 BP. XX AC AECX01001312; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-15_MLP_; KW Gypsy-15_MLP-I; Gypsy-15_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-341 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001312; Positions 7692 7352. XX SQ Sequence 341 BP; 76 A; 76 C; 81 G; 108 T; 0 other; tgtaaggagt acacatagac ttagcagtta tgggaagtgt agcatgggat tagccaggtc 60 atactttatg tttctttcca acgcccggct tcggagagtc ctcactactc agagagccaa 120 ttgtgggact ttctggagct aggtgagttt ctcttcatct tgttgtttct tttcttttgg 180 actttgtcgc tgactactca gagagccaat tgtgggactt tctggagcta gatcgtaatt 240 tcatataacg atgttccaag tttatcgccc gtagtgctct ctcgaggagt cgagccagaa 300 ccttatctgg accccattga aggttcccac ggaaccttac a 341 // ID Gypsy-84_MLP-I repbase; DNA; FNG; 8705 BP. XX AC AECX01001040; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-84_MLP_; KW Gypsy-84_MLP-LTR; Gypsy-84_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-8705 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001040; Positions 117176 108472. XX CC Positions [5792-6211] - Reverse transcriptase CC Positions [7499-7978] - Integrase core CC 'TGAGT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 4457..8602 FT /product="Gypsy-84_MLP-I_1p" FT /translation="MKKEELSNWREVTAFDGRRSFIRFKVEIRVEDEDDTE FT EFRVAELKGQYDVILGMPWFRKHGHLINLSDSTIGCITSVPQQVPTEDGIV FT DKTRKLEEGANRLSVVTPPRCESVCPNSLDKRTFTGKDPILEQVVLPEKVT FT IEEESPDAPMAGQLGKLGGQVSKSDPGVAAISPSANAEPPPSESDCTISVV FT DKARAGKDLLLEQVDHQDESTQAMGQTQMKSEKTKIHEKKALTSKQKVRRN FT KRRNLARKKKKALKSPDASQAGQLGKLGGQASTCDSGVAAISPSANAEPPL FT SESEFLFVMDDYEDVIASKFFRGHVEQPNSNDVKVDALKSSWSKSAELAAS FT DPKLQAERPAAELVPEEYHEFLPLFEKKSSLRLPPRRKYDFRVDLLEDATP FT YAGRIIPLSPAENEALYKMVEEGLKAGTIRRSTSPWGAPVLFTGKKDGKLR FT PCFDYRKLNSMTVKVKYPLPLTMELVDSLLDAEEFTSLDMRNGYNNLRVAK FT GDEAKLAFICRAGQFEPLTMPFGPTGAPGYFQFFIQDILKDRIGKDVTAFL FT DDMMVYTKKGSVHKDAVRHVLQVCKDQSLCLKPEKCKFSQSEIAYLGLIIS FT KNRVKMDPKKVEAVTDWPAPKNVSEVLSFVGFANFYRRFIDNFSKVARPLH FT ELTKKDTKFEWNQQRQDAFDTLKRLFTSAPVLKIADPYKAFVLECDCSDFA FT LGAVLSQVSDVDGELHPVAFLSRTLVPAERNYEIFDKEMLAILASFKEWRH FT YLEGNPNRLDVIVYSDHKNLESFMTTKQLTRRQARWAETLGCFDFEIRFRP FT GKQSQKPDALSRRPDLQPRGEDKLTFGQLLRPNNLREDTFIADITIDSFEL FT LFEDEIGGLQFEDNFEDEECVEICALSSGEDIWDDEMILSSLRNKLKDDDK FT IQHLIKSITLRNAKQDNSEEEYEFNDGLLYFRKKVVVPRDDKLKFEILRSR FT HDSKLVGHPGRARTLSMVERNFYWPSMKKFVNRYVDGCSSCQRVKSRNTKL FT FGSLQPLPVPAGPWTDICMDLITDLPLSEGKDTILTVVDRLTKMGHFIACN FT KTTDSIGLARLLIDNVWKLHGTPKSIVSDRGSIFVSRITREMNAQLGIKTK FT PSTAFHPQTDGQSEVSNRSVEGYIRHFANYDQNNWVKNLSTFEFAYNNLDH FT TAIGMSPFKSNYGYDLSLGRVIHGERCVMAVEERLKQLTEVQEEIKAQMER FT SQEIMKLNYDQKVKQEEEWEEGSKVWLNSKNLSTTRPTAKFSHRWLGPFKI FT EKKISPVAYKLCLPLSMKSLHPVFHTSLLRQYEPDTIEERKQAKPEPIVLE FT GVEEFEVEEILDCRRRRGVLEYLLSWKGYGPEEDSWEREESLENAQDLLND FT FVKRYPEAENGYKRTRRKK" XX SQ Sequence 8705 BP; 2645 A; 1776 C; 1930 G; 2354 T; 0 other; tattgtacca tcttaaaagt acagacgcca gaagacaaat caatattaac tttaaatccg 60 aaacctcatc tcacattata accttatcaa cgatacaaac taacgccaca atctacaaac 120 tcgacagtac aaacctacgc cacaactcaa taaccccaac tgtacgaaac aactgtataa 180 ctcttggata cagactagtc tcactcgtaa ttaggccctc gccccggtcg atccccgtcc 240 cctaattagg ggacgaggaa tgatccccga attaggggac gggcaatgat ccccttcccc 300 taattaggtt aggggaacaa tccccgtccc cgaatttgga gacggggaat gacatcggta 360 ttttcgggga atatcgccct tttgcaaata caaaaacgaa cgattagacc gttcacggta 420 gaccacaaag gaatcaagga ccttcaacaa cgatacttaa caaaatctca aggttattcc 480 aatcaatgtc gagtcgatgg gatcaatgga agtttgaaag taaggttcaa ttgtgatcag 540 gtctttcatg aagaatccta caaacatctg agctaatcat tttatcatcg gttctttatc 600 attgcttcta tttcaagttt tagattcctt taaacggatc ttgattcttc atgtcattca 660 gatgtttgga aattataatt tataacgtac aaaaagagta atttgaaaga gaagtgatac 720 ttaaaagttg aaacttttca atttgtttaa tttttgttgt ctttcaatgc aatgagtgcc 780 aatgacacca tcaaatcagt ttcttcaaag tggttcaact catccaaaca aactaagact 840 gttgaaataa atacacacta aatcaattga aagaataaag accaagatgt attttaatca 900 ggaatcgaga ggatttttga gaacgatttt cagcttcaat ttcaatttca atatcagttt 960 caatttcagt ttcaatttct tcttgagatt catcttgaga atgtttatgt tcttcttctt 1020 gtgctggtta attttgattt ggaaatgagg ttgaagtaat ttgatcagtt ctagttttga 1080 tagttgtact tcgagatctt cttctaaact ttgatcgatg tgatttgaat tcgacattac 1140 ttcctgagct cgtcatcctt tgggagtcgt tcgaagactc ttcggcctac gatttctcaa 1200 gattgattcg cagaatcccc ggcattcccc gtccccgtcc ccgaatttcc ccgaattcga 1260 ggaacagaaa cgatccccga attcggggac ggggataggg gatctccgaa tgaatcgggg 1320 acggggcgag ggcctactcg taattcgaaa caacgacgaa ctacgggata actcatagcg 1380 acgatcagaa ttctcaggat tctcaagatt caccgagttc tacatcttcg ttatacaccg 1440 ctgaaacaag tgattcaatg gcaaacttag aagaaatcac ccgccagtag gccctcgccc 1500 cggtcgatcc ccgtccccga aaccggggtc ggggatcatt ccccgaatta ggggtcgggg 1560 aatgatcccc gcccccgaaa ttggggatcg gggatcaatc ggggaatttc ggggaataga 1620 gcattttccc agaacaaaaa acaaacgatc gtccattacc caacaagata ccttcaccga 1680 tatcatctaa cctcaaacaa gatcaagtgg aaaggatcat tagatcctca cattcttcaa 1740 aaaccaatca aaaaaaatat tcaaaatgga agcatgtcaa aaaagttagg ttgtaaatta 1800 gtaatccaat ctctatcatg attgattcat cgaatcatca tttttgaacg ttttgagcta 1860 attcttttgg atttcatcat ccgttcactt ttagattcct ttacccagat atttaaattc 1920 atccgactgt attcacttca aaaatcatca tactgactca aaagacaaaa ttcattgagt 1980 agtgattcaa ccttgccgat ttgttttttt ttgttgttct aacttcttga aatgcaatcc 2040 aattatgaag gaatacatta tatcaagttt tgatagcttt acttcaagat ctcctaaact 2100 ttgatcgatg tgattctttt gatgtggaaa tgagattgaa gtgattcttt tgttttggaa 2160 atgagattga agtgaactga tcagttctag ttttgatagt tgtgcttcga gatcttctaa 2220 gctttgatcg atctgattcg aattcaccac tacgtgccga gttcatgatc ctttaggagt 2280 catttgaact tcaaagactc atctgactac gattcttcat gatccattcg gggaatcccc 2340 ggcattcccc gatccccgtc ccctaaatag gggacgggga tcgccccccg tcccctaggg 2400 gtaacttcgg ggatccccga atcattcggg gacggggcga gggcctaccc gccagattgc 2460 ggagcttaac gcacgcctca atgccaagac agcttgacgt ctcgaggaaa ctgcgcgaag 2520 agttgaagcc aaagcccgtc ttgaggcatc tatctctcaa acgacggtat cacaaccttt 2580 agcccaaacc ccgagcatgc ctgtcactca catggcgaag atggctaccc ctgataagtt 2640 tgatgggaag cggggagcta aagccgagat gtttgttcag caagtttccc tgtatgtctt 2700 agagcatcca ctatcgttgc tgtaaacagc ctcggtactg taaatttgac tgaaatttac 2760 cgtaccgagg ccaaaaatcc ccaatcgtgt acaaagtaca gatcggtagc tacagattcg 2820 gtcaaaaatc tgtagaggtc cagtaaatct actgcacctc tacagaatct gtaccgaatc 2880 ggtatccacc aaatctaccg attctacata gggattcatc accaccttcc caaaatcacc 2940 ctctactatc cttcaatctc tatcaccctc tcatcgatca actctttcat cttctcatcg 3000 atcaaatttt tcatcttctc atcgatcaaa tttcttgcct tcttgtcaat caaccatttc 3060 aaccaatctt ctcaataatc gacttgatca aatcaaagtg acttcatcaa atgaacatct 3120 tgtgctattt ttttgctttt tttgattgtc cttgttgttg tacaatcgtc catcagacaa 3180 tgatggatcc ttgaggtcat tcatcagacc atgatggatc tcagcatctc gtaaatctag 3240 atactttgag actatccatg tttctgagcc aaggtcaata gcttgcagat tatttactga 3300 atgtacaatt ctctatgatt gtggggcagg gtcatttctg tacggtacgt accgtacaga 3360 acgatattac tgactgtcac tcacatggcg aagatggcta cccctgataa gtttgatggg 3420 aagcggggag ctaaagccga gatgtttgtt cagcaagtta gtggatgctc ttaaccaacg 3480 aacgctcgtt cccagccgag tacaacaaag tcgcgttcgc cgtatcttat ctcacgggcg 3540 atgcaagtgt atgggctgcg cctttcgttc gcaagatcat aggcgatggg aaggaatcag 3600 tcacgttcaa aagtttttta gatgtcttca aaggaaccta ctttgatccg cattgtgtaa 3660 gcaaggccaa gaacgccatt cgtgcgttat ggcaaacgaa gtcagtcttg gaatactcga 3720 ctcgctttaa tcaattagcc agcattgtga gctgggagga ttcgactttg atgagtcatt 3780 tcaaaggaca tttgaagccc gagatcacgg ttcccttgat tagagacact tcgtcgactt 3840 tagaagaact gattaaggcg gctatcgagg tggatcactt gttacaccca aatcgtgatg 3900 gaatttcttt gtttcatgag gcgtcggaag agaacgaagt tgttattaaa ggaggagttg 3960 acagtggagg tgctagaagc gtgattaaat cagaagatgc catggactta tcagcagtac 4020 gattcaattg ttcttatcaa gagtatcaac gtcgcaagaa agaaaatctc tgtttttatt 4080 gtggcaaagc gaaccattca gtatctcgct gtactcaggc taaggctgat aaaggaaaga 4140 agactagctt tagaggtcaa ttttcagaag tcgtgattgg taacggtggg ggaagtagta 4200 gtagtgttgt tagtagtgtt gtagagccga aaaatgggag cgctcaagag tgaatgttac 4260 gccacccttg agcagtttta gtggttttcg tgtcggtgtt ggtgcaaata aattgttaag 4320 taattcagac ccaagaattt ttaagtgcat ctccatccgt gatcccaccc aagccgtaac 4380 ccatagagct ctttgcctac ttgactgtgg ctcgactcac aatgtcatta aagaaaaatt 4440 tgttgatgaa aaaagaatga agaaagaaga attatcaaac tggagagagg taactgcctt 4500 tgatggaaga aggagtttca ttcgatttaa agttgaaatc cgagttgagg atgaagatga 4560 cacagaagaa tttcgagtgg cagagttgaa aggccagtac gatgtgatac tgggtatgcc 4620 ttggtttaga aaacatggtc atcttatcaa tttgagcgac agtacgatcg gatgcatcac 4680 ctcagttccg caacaggtcc ctaccgagga tggtattgtg gacaaaacta ggaaacttga 4740 ggagggggct aatcgtttat cggttgttac gcccccgcga tgtgagtctg tttgcccaaa 4800 ttccctagat aaaagaactt tcactggcaa ggatcccatc ctagaacagg ttgtcctacc 4860 ggaaaaggtt acgattgaag aagagtcacc tgatgcgccc atggccggcc agttaggaaa 4920 actaggaggc caagttagca aaagtgaccc gggggtagcg gctataagcc catctgctaa 4980 cgcagaaccc ccgccgagtg agtctgactg cactatctct gttgttgata aggcgcgcgc 5040 tggcaaggat ctccttttag aacaggttga tcaccaggac gaatcgacac aagcgatggg 5100 tcaaactcaa atgaagtcag agaaaaccaa aattcatgaa aagaaagcac tcacatcaaa 5160 gcagaaagtc agaagaaaca aacgccgcaa cttagcacga aaaaagaaga aagcactcaa 5220 gtcacccgat gcgtcccagg ccggccagtt aggaaaactt ggaggccaag ctagcacatg 5280 tgactcaggg gtagcggcta taagcccatc tgctaacgca gaacccccgc tgagtgagtc 5340 cgaattcctt tttgttatgg atgactatga ggacgtcata gctagcaaat tttttcgtgg 5400 gcatgttgaa cagccaaact ccaacgacgt gaaagtggac gcattgaagt cttcatggag 5460 taaatcagct gaattagcag cgtcggatcc aaagctacag gctgaaagac cagcggccga 5520 gttagtgcct gaggagtatc atgagtttct acctctattc gaaaagaaat catctctacg 5580 actaccaccc cgacgaaagt atgatttcag ggtggatctt ctggaggatg ctactccgta 5640 tgctggtagg ataattcctt tgtcccctgc ggaaaatgaa gccctgtata agatggtaga 5700 agaaggactc aaagcaggaa caatcagacg ctcgacatct ccctggggcg ctcccgtgct 5760 gtttactgga aagaaagatg gcaaattacg accatgcttt gactaccgta aactgaattc 5820 aatgacggtg aaagtaaagt atcctttgcc tcttactatg gaactcgtcg atagtttgct 5880 ggatgctgaa gagtttactt cattggatat gaggaacggc tataacaact taagggtcgc 5940 caagggagac gaagctaaat tggcctttat ctgtcgagcg ggacaattcg aacctttgac 6000 gatgccattt ggtcctaccg gcgcacccgg atattttcaa ttctttatac aagacattct 6060 gaaggataga attggaaaag atgttaccgc tttcttggac gacatgatgg tgtataccaa 6120 gaagggtagt gtgcataagg atgcggttcg acatgtgctt caagtttgca aagatcagtc 6180 attatgttta aaaccagaaa aatgcaagtt ctcacaatct gagattgctt atctaggttt 6240 gatcatatca aagaaccgtg tcaagatgga ccccaagaaa gtggaggctg taactgattg 6300 gccagcaccc aaaaatgttt cagaagtgct gagttttgtt ggttttgcaa atttctaccg 6360 ccgttttatt gacaactttt caaaagtagc gcgcccacta cacgaactaa caaagaagga 6420 tacaaaattt gaatggaatc aacaacgtca agacgctttt gatacgttga aacggctatt 6480 cacttcggca ccggtactca aaatcgctga cccctataag gcttttgtgc ttgaatgcga 6540 ttgttcggat ttcgcattgg gagccgtcct gtctcaagtg tctgacgtcg atggtgaact 6600 gcacccggtc gccttcttgt caaggacgtt agtaccggcg gaacgaaact acgaaatttt 6660 cgacaaggag atgttagcaa tcttggcctc cttcaaggaa tggcgacatt acctagaggg 6720 taatccaaac cggttagatg tgattgtata ctctgaccat aagaatttag agtcatttat 6780 gacaacaaag caactgacgc gtcgacaagc aagatgggcg gagaccttag gttgtttcga 6840 cttcgaaata agattcagac ctggtaaaca atcccaaaaa cctgacgcct tgtcacgacg 6900 gcccgatttg caacctcgag gagaggataa gttgacattt ggccaacttt taagacctaa 6960 taacttacgt gaagatacat tcattgctga catcaccatt gactcttttg agttactgtt 7020 tgaagacgaa ataggaggtc tacaatttga ggacaacttt gaagacgaag aatgtgtaga 7080 gatatgtgct ttaagctctg gtgaagatat ttgggacgac gaaatgattt tatcatcact 7140 aagaaacaag ttgaaagacg atgacaaaat tcaacattta atcaaatcta taacattaag 7200 aaatgcaaaa caagataact ccgaagaaga atacgagttt aatgatggac tcttgtattt 7260 ccgtaagaaa gttgtagtac cacgtgatga taaactgaaa tttgaaattc tgcgttcaag 7320 acacgacagt aaactagtgg gacacccagg tcgagctcga acattgtcaa tggttgaacg 7380 gaatttctac tggccgtcaa tgaagaaatt cgtcaatcgc tatgttgatg gatgttcttc 7440 ttgtcagagg gtaaagtcac ggaataccaa attatttgga tccttacaac ctctacccgt 7500 ccctgccgga ccatggactg acatctgcat ggacttaatt actgacctac cattgtctga 7560 aggaaaggat acaattttaa ctgtggtcga tcggttaact aagatggggc actttattgc 7620 ctgtaacaag acaactgact ccattggctt agcaagatta ttgatcgaca atgtatggaa 7680 acttcacggt acgccaaaaa gtattgtgtc agatagaggt tcaatatttg tcagtaggat 7740 cacgcgagag atgaacgccc aacttggcat caaaacgaag ccgtcaacgg cgtttcatcc 7800 tcaaacggat ggtcaatcag aagtttcgaa tcgcagcgtt gaaggctata taagacattt 7860 tgctaattat gaccaaaaca attgggtaaa gaacttatcg acgtttgaat ttgcttataa 7920 caatttagat cacacggcaa ttgggatgtc acctttcaaa agcaattacg ggtatgattt 7980 aagcttagga cgtgtgattc atggtgaaag atgtgtaatg gcagttgaag aaaggttaaa 8040 acaactgact gaagtgcaag aggaaatcaa ggcgcaaatg gaacggagtc aagaaattat 8100 gaaattaaat tatgatcaaa aagtgaagca agaagaggag tgggaagaag ggagtaaagt 8160 atggttaaat agtaaaaatt tatcaacaac aaggccgacg gcgaagttca gccataggtg 8220 gctgggacca tttaaaatag agaagaagat ttctcctgtt gcttataaat tatgcttacc 8280 tttgtcgatg aaaagcttac atcctgtttt tcacacttct ttgttaagac agtacgagcc 8340 tgatacaata gaagaaagga aacaggcaaa gccagagcct attgtattgg aaggtgttga 8400 agagtttgaa gttgaagaga tactggattg tcggagaaga agaggagtat tggaatattt 8460 gttgagttgg aaaggctacg gaccggaaga agactcatgg gaacgtgaag aatcattaga 8520 aaacgctcag gatttattaa atgattttgt gaagagatat ccagaagctg agaatggtta 8580 caaacggaca cggagaaaga agtgagaggt tcaagctttt tccctgaaag tgagggtttt 8640 ttaatgctga accgtggaaa ggatgcagag ctgcaagagg agcttgggca ttaatggggg 8700 agtaa 8705 // ID Gypsy-19_MLP-I repbase; DNA; FNG; 5876 BP. XX AC AECX01000924; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-19_MLP_; KW Gypsy-19_MLP-LTR; Gypsy-19_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5876 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000924; Positions 228363 234238. XX CC Positions [3098-3637] - Reverse transcriptase CC Positions [4772-5251] - Integrase core CC 'CATG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 530..5716 FT /product="Gypsy-19_MLP-I_1p" FT /translation="MEDPPQYVQIPTEDLRMIRAQLEQFGIDAHRRDQNHA FT QEIGNLRNQHENVLQQQANQINHLEHQLNDRINNPAPPKPDAFRAEPRKFT FT GYGDNAQLWFSELESNFKMHRYPEASWGEMIGPYLDEESLMFWHEARKAQG FT GTISDYEDFKAHFLNQYDFDLMIEETLEKIKVCYYKGDINDYILRFRKLMA FT YVPPDILSFQARKFSFADKLPVHYREKINNKADVRTKKDMELIYAAAREAE FT RTVKINNRSFKNYDQKASTSHKTSDKKHSFNFNRNNFRSHSHSSHFGRPHT FT VTNGAGPMDLDVVDLSNVECYSCHQKGHMSDKCPRGHKKKTFHGSKNRQNN FT FDRRKPNLLLMEHFPYSFGTPDSNSLDSFIPPFQDHELQEESPFLTPFSDA FT DGHVFRVSQESSESLNGPTEDGRIDLEEYKRNYLETLRDLDPEMLLDLEKE FT REAVARVIKEADHYRCCKSCKAYGNEDSPPTISTGIKFFDMSPREINNANN FT KRPIKEDTSSGSENDQDMFCVDMENKRLRLDPSQIDGYRFTNERTPSPFPT FT LEPVALAERELENVEIWNQILATSYTNWADRVEFGSNCTADEEDSSTAAIE FT SSSNFGVEYSSVCTDYDDMKAQAVDSEPSNVSPYDLSILELAIHEDAHKSS FT LPLYPVLFRNMQLDAIIDSGAAANYVSAEEVEKMKEINPDRIKIRSVSNQG FT VRLANGSRETCDKVAIFEAEAEDEFKDNNFKFEIKAFILSLPNISLILGLP FT WLREHQPYIDYTTGKYTIKSGVKTFHLSPKQYEPKLFTIDNKCWTEHLERL FT ARGIAPKCWMEDIIIKDRNWKHFIDTGDAKPIKSHGRPHTPLEHSTIKQFI FT EESLEQGIIERSDSPWSSPLLLVKKPDNSTRVCVDYRALNKVTRKNAYPLP FT RIDDAYLFLSGAKCFSTIDLKSGFWQIPMAPEDKQKTAFTCRLGHFQWRVM FT PFGLCNAPATFQEMMNSILKDVIDKFALVYLDDIIVYSKTEEEHETHIKAI FT FEILKKEGLVVSAKKCQLGKSSLLFLGHVVDGDGIRTNPDKIQKIVEWPIP FT TNISQVRGFLNLCTYYKRFILKFSSIASPLYKLTEGSPRPGSSIVWGDEET FT LSFEKLKEALTKTVPLQHPVPFKPFVLDTDASGTNIGAVLQQDTDYEIPKD FT VSFDYNLYQKKLKNNNLRPIAFESRKLSKTEQNYSAQERELLAIVHGLKHF FT RGYVEGSPVLVRTDHESLKHFKTQKHINRRLARFVDEIEFFDNHIVYRPGK FT EQLAADSLSRKPDTKFDADPPETADALFTINDCGDQYQQLLKYKLQLQSGM FT DPSEVGSGDLTLDGDRLIKINLDNPLLNTLTVPTSKLEAEKICYRVHEHLG FT HRNVKDVIRSTKARYWFPDLDKTVKEAIDLCKACQVHAAPTKQQGMSLQFV FT PRGKPFRKWGMDFVGPLPKTANGNQYMITAIDYGTGWAYAKPLRSTSSDAA FT VALVKEITLNHGLPEEILTDNGSEFISQNFKTYLKDNNVKQKNTSPYHPQT FT NGLDERFHGTLVNALRKFCSPYKQNLWDEYLNTCLFAYRASYSHSMKASPF FT YMAYGEEARLPDEAVSMKFDSSYENLELIYRQRNIAVDKLKASRETLIQEL FT NERAQERSKESEEGYTERNLRPGDRVLRRFEGRPSKLHPKWDGPFIVKDAD FT PHGTFTLMTSNGHVLKAKVNGSRIKKFKGTSDQFFFASQRLHERDAAANEG FT SSRNVNNVI" XX SQ Sequence 5876 BP; 1798 A; 1448 C; 1095 G; 1535 T; 0 other; caaatatata tttgtgtgtt tttttattct ttttccctct aactctaata tcatcgtcta 60 cttgatatcg aacaaggttc ctcatttatc tgtgatttct aactctgtgt ttctcttata 120 tccttttctt tagacaccct tatcgaaaac cctataacca gaaacaaaca gcaaccccac 180 aaatcacaaa taaacccgtc gtaagtcttc gtaattccct agctagttat tccgatctct 240 ctcgacacct ttccctcgat cagttagagt aactaaaaga attcctctat cctttacgca 300 caatagaatc aaccaaccgt tttaccaaca aacaacccat tcatcgcgat tttttaccac 360 tagatttttg aatctcttgt taacaaatac acctattttc tttttgaaga acttagtatc 420 gtacattttt tatcttagac tctgatcaag tcttaaaact atctctcacg aatcacttta 480 tttttatctt tttattcgac ttgttatatt ttcaggtgcc cattcagaaa tggaagaccc 540 acctcaatac gttcaaatcc ccaccgagga tctccgcatg atccgtgcac agcttgaaca 600 gtttggcatc gatgctcacc gccgcgatca aaaccatgct caagaaattg gcaacctcag 660 aaaccaacac gagaacgtcc tacaacaaca ggctaatcaa attaatcatc tcgagcatca 720 actcaacgat cgcatcaata acccagcccc acctaaaccg gatgctttta gggccgagcc 780 tcggaaattc accggatatg gtgataatgc ccagctatgg ttttccgagt tagaatctaa 840 tttcaaaatg catcgctacc cagaggcttc ctggggtgag atgattggtc catacttgga 900 cgaagaatct ctcatgttct ggcacgaggc tcgcaaagct caaggcggta ctatctcaga 960 ttatgaagat ttcaaggctc actttttaaa tcaatacgac ttcgatctca tgatcgaaga 1020 gactttggag aaaatcaaag tctgctacta caagggtgat atcaacgatt acatccttag 1080 gtttagaaag ttgatggcat acgtaccccc agacattctt tccttccaag ctcgcaagtt 1140 ctccttcgcc gacaaactcc cggttcacta tcgagagaaa atcaataaca aggccgatgt 1200 cagaactaag aaggacatgg aattaattta tgccgctgct agagaagccg aacgcaccgt 1260 caaaatcaat aaccgcagtt tcaagaacta cgaccagaag gcctcgactt ctcataaaac 1320 ttccgacaag aaacactctt tcaatttcaa tcgcaacaac tttcgctctc acagtcactc 1380 aagccatttt ggtagacctc acactgttac caacggagca ggtcctatgg acctagacgt 1440 agtcgatcta tctaacgtcg aatgttactc ctgtcaccaa aaaggtcaca tgtctgataa 1500 atgtccccga ggacacaaga agaagacttt tcatggatcg aagaatcgac aaaacaactt 1560 cgatcgtcga aagcctaact tactcttgat ggaacatttc ccttattctt ttggaactcc 1620 cgattctaat tcacttgatt catttatccc tccattccaa gatcatgaat tacaagaaga 1680 gtccccgttt ttaactcctt tctcggacgc tgatggacac gtttttagag tgtcccaaga 1740 atcatcagaa agcttaaacg gtcctactga agatggaagg attgatttag aagagtacaa 1800 aagaaattac ctggaaacct tacgtgactt agatcctgaa atgctacttg atttagaaaa 1860 agaacgtgaa gccgtcgcta gagtcatcaa agaagctgat cattaccgat gctgtaaatc 1920 ttgcaaagct tatggtaatg aagacagccc tccaactatc tcaaccggta ttaaattctt 1980 cgacatgagt cctcgagaaa tcaacaatgc caacaacaaa aggcccataa aagaagacac 2040 atcatccggc agtgagaacg atcaagatat gttctgtgtt gatatggaga acaaaagact 2100 cagattagat ccctcccaaa ttgacggcta ccgtttcact aatgaacgga ccccttctcc 2160 attccctact ctcgaaccag tcgcattagc agaaagagag ttagagaatg tcgaaatctg 2220 gaatcaaatc ttagcaacgt catataccaa ttgggcggat cgggttgaat ttggatccaa 2280 ttgtaccgcc gacgaagaag actcaagcac tgccgccatc gagtcaagca gcaatttcgg 2340 agtagaatat agttccgtct gtaccgacta tgacgatatg aaggcacaag ctgtcgactc 2400 agaaccaagc aatgtttctc catacgatct gtcaatacta gagctcgcca ttcacgaaga 2460 tgcccacaag agctcacttc ccttataccc agttttattt cgcaacatgc aactcgacgc 2520 tatcatagat tcaggcgcgg ctgctaacta cgtttccgca gaagaagtcg agaagatgaa 2580 agaaatcaac cccgaccgta tcaaaatcag gtccgttagt aaccaaggag ttaggctagc 2640 taacggtagt cgtgaaacat gcgacaaggt cgccatcttc gaggcagaag ctgaagacga 2700 atttaaagat aataacttca agtttgaaat caaggctttt attctttccc ttcccaacat 2760 ttccttaatt cttggactac cctggttgcg agaacaccag ccctacatcg attacacaac 2820 gggaaaatac actatcaagt ccggtgttaa aaccttccat ctcagcccaa aacaatatga 2880 acccaagctg tttactatag acaacaagtg ctggactgaa catctagaac gcctcgctag 2940 aggcatagcg cctaaatgtt ggatggaaga catcatcatc aaggatcgta actggaaaca 3000 ttttattgac acaggcgacg caaaacctat caagtcccat ggtcgccctc atactccctt 3060 agagcatagc acgatcaagc aatttatcga agaaagcctt gaacaaggca taattgaacg 3120 ctctgattcc ccttggtcat cccctttgtt actagtcaag aagcccgata attctaccag 3180 ggtatgtgtt gattatcgag cactcaacaa ggttacaaga aagaacgctt accctttgcc 3240 ccgcatcgat gatgcttatc tttttctttc cggtgcaaag tgtttctcaa cgatcgactt 3300 gaagtcaggc ttctggcaaa tccctatggc tcccgaggac aaacagaaga cggcttttac 3360 ttgtaggctt ggtcactttc agtggcgggt gatgccattt ggcctttgta acgcacccgc 3420 tacgtttcaa gaaatgatga attcaatctt gaaggatgta attgacaaat tcgctcttgt 3480 ttatttagat gatatcattg tctactccaa gaccgaagag gaacacgaaa ctcacatcaa 3540 agctatcttc gaaatcctca agaaagaagg attagtggtc tctgccaaga aatgtcaact 3600 cggcaagtct tcattattat ttttaggtca cgtcgtcgac ggagatggca ttcggacaaa 3660 ccccgacaaa attcagaaaa ttgttgagtg gccaattcct actaacattt ctcaagttag 3720 aggctttttg aacctatgca cttactataa acgcttcatt cttaagtttt cctctattgc 3780 ttcacctctt tataaattga ctgaaggatc ccctaggcca gggtcgtcaa ttgtctgggg 3840 ggatgaagag actctctcat ttgaaaaact taaggaagcc ctcaccaaga cagtccccct 3900 tcaacaccct gtccctttca aacccttcgt tcttgatacc gatgcgtctg gaaccaatat 3960 aggcgctgtt ttgcaacaag acactgacta tgagattccc aaggacgtaa gttttgatta 4020 taatctctat caaaagaaac tcaagaacaa caatctaaga cccattgctt tcgaatcccg 4080 taaactctca aaaacggaac agaattattc agcgcaagaa cgtgagttac ttgcaatagt 4140 tcatggtctc aaacatttcc gaggctatgt tgaaggatcg cccgtcttag ttagaactga 4200 ccatgaatct ttaaaacatt tcaaaaccca aaaacatatc aatcgacgcc ttgcccggtt 4260 tgtggacgaa atcgaatttt ttgacaatca catagtctac agaccaggta aagaacagtt 4320 ggctgcagac tccctctctc gcaaacccga caccaaattt gacgccgatc ctccagaaac 4380 cgccgatgct ttattcacta tcaatgactg cggagatcag taccaacagt tactcaagta 4440 caagctccaa cttcaatccg gcatggatcc ttcggaagtt ggttcaggtg atctcaccct 4500 agacggtgat agactaatca aaatcaatct tgataacccc ctacttaata ccctcacagt 4560 accgacaagc aaactcgagg ccgagaagat ctgctaccgt gttcatgaac acttaggaca 4620 tcgaaatgtc aaagatgtca tcaggtcaac gaaagcccgc tactggtttc ctgacctcga 4680 caaaacggtc aaggaggcca ttgacctctg taaagcatgc caagtacacg cagccccgac 4740 caagcaacaa ggtatgtctc tacagttcgt tccccgtggt aaacccttta gaaagtgggg 4800 aatggatttt gttggtccct tacccaagac ggccaacgga aatcagtata tgatcaccgc 4860 catagattat ggcaccggtt gggcatatgc taaacctttg cgttctactt cctcagatgc 4920 tgctgtcgct ctcgtcaaag aaatcacatt gaatcacgga cttcccgaag aaatcctaac 4980 ggacaacggc tctgaattca tatctcaaaa cttcaaaact tacctcaagg acaacaacgt 5040 caagcaaaag aatacttcac cctatcatcc tcagaccaac gggttagatg aacgatttca 5100 cggtactctc gtgaatgccc ttcgtaagtt ttgctcaccc tataaacaga atctctggga 5160 tgaatatcta aatacttgcc tcttcgcata tcgcgcttct tactctcatt ccatgaaagc 5220 ttccccattt tacatggctt atggcgagga agctagatta ccggacgaag ctgtaagtat 5280 gaaatttgat agctcctatg aaaacctcga actaatctac agacaacgta acatcgcagt 5340 cgacaagctc aaagcctcaa gagaaaccct tatccaggaa ctcaatgaac gtgcgcaaga 5400 aaggtcgaag gaatcggaag aaggctacac ggaacgaaac ctccgcccgg gcgacagagt 5460 cctcaggcgg ttcgagggac gcccatccaa actccatcct aagtgggatg gccccttcat 5520 cgttaaagat gccgaccccc acggtacgtt caccttaatg acttctaacg gtcatgttct 5580 gaaagctaaa gtaaatggct ctcgtattaa aaaatttaaa ggaacctccg accaattttt 5640 ctttgcctcc caacgtcttc atgagcgaga tgcagctgct aatgagggaa gctcaaggaa 5700 tgttaataat gtcatttgaa gattttcaaa cttgtttaga acatgttccc aacagcttta 5760 tcgaggatta cgtggaaagg ctggctgtca tgaacttagc cgcactaaga gaattatcct 5820 ggcgcagcaa ataggtaata aactaggaag tttatggtct taaggagggg atggtg 5876 // ID Copia-2_GDe-I repbase; DNA; FNG; 4692 BP. XX AC AEFC01001875; XX DT 26-MAR-2011 (Rel. 16.03, Created) DT 26-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Geomyces destructans genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_GDe_; KW Copia-2_GDe-LTR; Copia-2_GDe-I. XX OS Geomyces destructans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Leotiomycetes; Leotiomycetes incertae sedis; Myxotrichaceae; OC mitosporic Myxotrichaceae; Geomyces. XX RN [1] RP 1-4692 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Geomyces destructans genome."; RL Direct Submission to RU (12-MAR-2011). XX DR Genome; AEFC01001875; Positions 591 5282. XX CC Positions [2101-2385] - Integrase core CC 'GTTTC' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 3078..4187 FT /product="Copia-2_GDe-I_3p" FT /translation="MIQIQLKKQSSKQTGQNGKPPSKRNTETSNTRALGSL FT CSELNYHHMRKCYLILKTKRDKNGSIMKRKARWVAKGFRQKQGTDFDQTYA FT GVCKTTTWKLAIAIAAFFGLEIEQVDVVGAFLNSDTDTEIFMDVPPDWEVN FT GSILENAPNWMCHLQKALYGLKQAPRLWQRELAKALHGLGFTTCASDYSVY FT CNQKTGILIVTYVDDMLIIGKQITTIQGLKKELMQKFEIEDLGPASYSVGV FT RITRNREAGTISLCQDAYITKILHRYGMENCKAVDTPVATGAAEVMIPFEG FT KASHAEIKLYGSKIGSLMYLAVQTRPDISYGVSVLSRFLTNPSPAHMKAGD FT RILHYLKGKDAWNRIWHARRYASLWIL" FT CDS join(280..2097,2101..3078) FT /product="Copia-2_GDe-I_1p" FT /translation="MGEGNGGRNNLPENMKLRGEENYIQWKTAMEDLADAN FT DLVHFVHPKGKAPPQVDEFDEKVDIEKLATWKSWKAGDASMKLIIQTNVKK FT TPSQLLAGCKPAREMWVILQSQYEGTGTVLNFNAIETYTKIKYEDYANLEQ FT FIIGFKKAIEKLANLEISPPDAWHPILFIMALSDAWPIWAERQCSNTRSEA FT TKLTLSALIEDITDEARKKDKKADGSAHYSGKPNQKGKPNSKPKDSAKGKG FT GKMCKNCDNPNARHEPENCFITNKKLRHEWEQRTGKKFMPFKPSNSSSKSN FT KLKKAKNDNSSDEEDEDRQFGKNATTYICVQLNHSRSPIQNGTLFDTGAET FT HIASSIEEFNSGTYKTSLTLPSIDTASGITEPLGQGERTLQCATDDGGIHT FT LHLTKVQYLPNCRMNIFGARKLLGRGELHITDKNLVVNKKGIPMFRFDENM FT MIIVASPRAYAFTATTHKNEKNPPHSIRLWHRRFGHLGLANVQKTSKMVRG FT MAISNEEDLTPQEQEDEDLQLCDPCERGKAKKHIRKQAVTQNLQPFDEVHI FT DVVMITPEGIGKKKYATIFTEKATTVRWAYFHRSKNEAYDALIKYQKMVNT FT QFGKIIKKRMDGGKEYSPKKLTELAEELGQIVEMTTPYNPEQDGTSERSIG FT IICERTRTAMIDMAIPQFLWPLILESIVLITNRTATSKLKGKTPYEALTDN FT LHPGQDNHPSVAHFRVLGCKTYVQIPKEKRVTSAKVTERAEVGILVGYEGT FT HIFKIYVPTRRGSLENCIVRSSNVRFDEGGLITKPFPDEDEIESDNEDASI FT TLPHIARGEANYDRDLIQRVSKTQHEEHHSEIENESEISESENSPQELELH FT ELDQYSEVDDEDIELMPTITARRTRKVYIPKPEFNRVTRNQATTTATASYE FT AHSTVTEPEFELSIHCSIHSWSTYIYLR" XX SQ Sequence 4692 BP; 1617 A; 1088 C; 1029 G; 958 T; 0 other; ggttatgagc ccggctatcg caaagccaaa agttatcata ttcagagaag tgctcattac 60 agcacataca atctgaattc gtggacaagc gctaaacata cagcacatac actccacgaa 120 aacctcaaga cgaacttaga gcctctagca gatgatcgat tcaactcagc aagaagagga 180 agaagaagat caattctaca atctaggaag ccccagattg aattcatcat ccatcatcca 240 tctctcacaa actcgaaatt ccttacaaga cctttcacaa tgggtgaagg caatggaggt 300 cggaataacc tcccagagaa catgaaactc cgaggagagg agaattacat acaatggaaa 360 actgcgatgg aggacctcgc ggatgccaac gaccttgtcc atttcgtaca tccaaaaggc 420 aaagctcctc cacaagtaga tgagttcgat gagaaggtgg acattgagaa actcgccaca 480 tggaaatcat ggaaagcagg agacgcatca atgaaactca tcatccaaac taatgtgaag 540 aaaacaccat cacagctgct tgcaggttgc aagccagcac gcgaaatgtg ggtgatctta 600 cagtctcaat atgaaggaac tggcacagtt ctcaacttca atgcaattga aacatacacc 660 aagatcaagt acgaagacta cgcaaacctt gaacaattca tcattgggtt caagaaagca 720 attgagaagc tcgccaacct cgagatctct ccaccagatg cgtggcaccc aatcctcttc 780 atcatggcac tttcagatgc atggccaatc tgggcagaac gccagtgctc gaatacacgc 840 agtgaagcca caaaactcac actgtctgca ctcattgaag atatcacaga tgaggctcgc 900 aagaaggaca agaaagcaga tggaagcgca cactacagtg gaaagccaaa tcagaagggc 960 aagccaaatt cgaaacccaa agatagtgca aagggcaaag ggggcaagat gtgcaagaat 1020 tgtgataatc caaatgcgag gcatgaacct gaaaactgct tcatcacaaa caagaagctc 1080 cgtcacgaat gggaacaacg gactggaaag aagttcatgc catttaagcc gtcaaactca 1140 agtagcaaat caaacaagct aaagaaggca aagaatgaca actcatcaga tgaagaagat 1200 gaagatcgtc aatttggcaa gaacgccacc acatacattt gtgttcaact caaccactca 1260 aggtcgccaa ttcaaaatgg cacattattt gacactggag cagagacaca tatcgccagc 1320 tcaattgagg aattcaacag tggcacatac aaaacctcat tgactcttcc atccattgac 1380 acagcaagtg gcatcaccga gccacttggc caaggcgaaa gaacattgca atgcgcaaca 1440 gatgatggag gcattcacac cctacacctc accaaagttc aatatcttcc aaactgcaga 1500 atgaacatct ttggagctcg aaagctctta ggaaggggtg aattgcatat tacggacaag 1560 aacttggttg taaacaagaa aggtattcca atgttccgtt tcgatgagaa tatgatgata 1620 attgtagcat cacctcgagc ctatgcattc acagcaacta ctcacaaaaa tgagaagaat 1680 cctccacact caattcgact ctggcatcgc agatttggac atcttggcct cgcgaatgta 1740 cagaagacct caaaaatggt aagaggcatg gcgatctcta atgaggaaga ccttacacca 1800 caagaacaag aagatgaaga tctccaattg tgcgatccat gcgagagagg aaaggcgaag 1860 aaacacattc gcaaacaagc tgtaactcaa aacttgcaac cgttcgatga agttcacatt 1920 gatgtggtca tgatcacgcc agaaggaatt ggaaagaaga aatatgcaac tatctttaca 1980 gaaaaggcaa caactgttcg atgggcatac tttcatcgct cgaagaatga ggcatatgat 2040 gcactcatta aatatcagaa gatggttaac acacaatttg gcaagatcat caagaaatga 2100 agaatggatg gaggaaagga atacagtcca aagaagctca ctgaacttgc agaagaactt 2160 gggcaaatcg tcgaaatgac aactccttac aacccagagc aggatggcac atcagagcgg 2220 tcaattggca ttatctgcga gcgtacgcgg actgcgatga tcgacatggc tattccacag 2280 ttcctctggc ctctcatact tgagtcaatt gtacttatta ccaaccgcac agcaacttca 2340 aagttgaaag gcaagacacc atacgaagct cttacagaca accttcatcc tggacaagac 2400 aaccacccat ctgtagcaca ttttcgagtg cttggatgca agacttatgt acaaatccca 2460 aaggaaaaga gggtcacaag tgcaaaggtt acagaacggg cagaagttgg aattcttgtt 2520 ggatatgaag gcacacatat ctttaaaatc tatgtgccta cacgcagagg atcactcgaa 2580 aattgcatag tgcgatcctc aaatgtccga ttcgacgaag gaggactcat tacgaagcca 2640 ttcccagatg aggacgaaat tgaaagcgat aacgaagatg catctatcac actgcctcat 2700 atagccaggg gtgaggcaaa ttatgacaga gatctaattc aacgagtctc taaaacacag 2760 catgaagagc atcactcaga aatcgaaaat gagagcgaga tctcagaatc tgaaaactca 2820 cctcaggaac tagaacttca tgagttggat caatactcag aagtcgatga tgaggatata 2880 gagctgatgc ccactatcac agcgaggaga actcgaaagg tttatatccc aaaacctgag 2940 ttcaacaggg tcacccgcaa ccaagcgacc acaacagcca ccgcaagcta tgaagctcac 3000 tcgacagtca cagaaccaga attcgagctc agcattcatt gcagcattca cagctggagc 3060 acatacatct atctacgatg atccaaatac aattgaagaa gcaaagcagc aagcagactg 3120 gccagaatgg caagccgcca tccaaaagga ataccgaaac ctcaaacaca agggcacttg 3180 gaagcttatg cagcgaactc aactaccatc acatgcgaaa gtgctaccta atcctcaaaa 3240 ccaagagaga caagaatggc tctatcatga agcgcaaagc gagatgggtt gcaaaaggtt 3300 ttagacaaaa acagggaact gattttgatc agacatatgc tggagtgtgc aagacaacaa 3360 catggaaact cgcaatcgca atcgcagcat ttttcggact tgaaattgag caagtggatg 3420 tggttggagc cttcctcaat tctgatacgg acacagaaat ctttatggat gttccaccag 3480 actgggaagt caatggcagc atactggaaa atgcgccaaa ttggatgtgc catctacaga 3540 aagcacttta cggattgaag caagcgcctc gactttggca aagagagctt gcaaaggcac 3600 ttcatggact tggattcact acatgcgcaa gcgattacag cgtttattgc aatcagaaga 3660 ctggaatcct catcgtcaca tatgtggatg atatgctgat cattggcaaa caaatcacaa 3720 caatccaagg gctgaagaag gagctgatgc aaaaattcga aattgaagac cttgggccag 3780 catcatactc tgtaggggtg aggatcacac gcaacagaga ggcaggaaca atttccctct 3840 gtcaagatgc atacatcaca aagattctgc accgctatgg catggagaat tgcaaggcag 3900 tggacacacc tgtggctaca ggagctgcag aagtcatgat cccatttgaa ggcaaggctt 3960 cacacgcaga aattaaactc tatggatcga agattggatc gctcatgtac ctcgctgtcc 4020 agactcgacc agatatctca tatggagtct ctgttctgtc cagattcctc accaatcctt 4080 cgccagctca catgaaagca ggagatcgaa tccttcacta cttaaaaggc aaagatgctt 4140 ggaatcgaat atggcatgca aggcgatacg catctctatg gatactgtga ctcagactat 4200 gcaagcgata aagcaaaccg taaatcggtc cttggaaatg ccttcttctt cgccggaggt 4260 gtgatttcac acacttcaaa gcgacagcag accgtggcac aatcgaccac agaagcagag 4320 tactatgcac tcgcaaaggc agtgtcagaa gcactatggc tacaacagat catgaagcag 4380 atgatgtaca caggaaatga cattcaagct acgaagatct atggagacaa ccaaggctca 4440 ttgagcttgg ccgaaaaccc caaattccac cagagaacca agcatatcga tgtcaagcac 4500 cacttcattc gggagcacat tgaaaacggg tcaattgacc tctggtatgt ggaaacagcc 4560 aacatggcag cagatggatt gaccaagccg ctacccgcca cacaacatga gaagttaata 4620 cagcagcttc agatgaaggt gatcacagcc taggaaatgg gaatggcctc aattcagtca 4680 tgcaaagggg tg 4692 // ID Gypsy-39_MLP-I repbase; DNA; FNG; 12779 BP. XX AC AECX01002289; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-39_MLP_; KW Gypsy-39_MLP-LTR; Gypsy-39_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-12779 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002289; Positions 18702 5924. XX CC Positions [10250-10750] - Integrase core CC 'ACTTC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 230..1864 FT /product="Gypsy-39_MLP-I_1p" FT /translation="MSAPTFTSGEKGKSPDSSLAPSSASRPAGRQTESSIR FT TESKVLPSLQYSNERLEATGVKPLEAPGPESNYHQWQFVMGTIIRGSSFGY FT VLRENQPSPLPASEEHDRCKVSALILRYVHPSNYEFLSSYEDDPRSQWKAL FT REAHQDALSGSIMYWLKKLVTAKMGDLSITEHLESMSADYQKFKSLVTPDK FT PLTADTVFATAVSLSLTADWQPVLAPLLQRESVTSTTILKVLREEATRRSI FT SSEEIEVIAAAKDGKSKDVSCSHCSKSGHVMEDCRILKSKLEKYKVLKAEK FT AEQDIPKSKPTKNQVRRSKAAKAKAAAIAELSSASEASININSEISSAAVI FT KASSGKTTGWLIDSGTSQVMTPCSDVVTRKKTDNTNISLANGSIVKSISKG FT FVSLPFSDFEDIKSLYVPSLSEPLLSVSKLADKKITTVFNSSTVSFVKNCK FT ITGDVVGTGYRHGNLYYLNQEVCHSKPSDQTSKAMVSDVHLWHSRFNHPGY FT AALKRKLNSLNIKVDEAELRKLQTCSICVQGKVRRRNMSSRAGWRSIKL" FT CDS 6767..9619 FT /product="Gypsy-39_MLP-I_3p" FT /translation="MCQWREGVLYDPAVEKMLENKLLEKKKEFLNERRKDL FT VAAVLDNVDVVEKASDLSYREMKMLKEFEVLFPDVLPDVESLDYAEFFPPE FT VQSPSSKVRHNIVLTDPNVVINERQYPYAPRYMDAWKKLVDEHVAAGRLRR FT SDSPYALPSMIIPKADPTAAPRWVCDYCRLNKFTVKDRSPLPNVEEAVRLV FT GTGKIYSILDQINAFFQTLNRPEDIPLTAVKTPWGLYEWVVMPMGLTNAPA FT THQRRCEEALGDLVNTVCVVYIDDIVVYSQSVEEHEQHLRLVMERLKAAKL FT FCSIKKTKLFRRRIKFLGHNISAEGISPDEDKVEKVASWRTPKTAKQLKEF FT LGTVQWMKKFVDGLAQYAGHLTPLTSVKNAAKKFQWGDKEQDAFDNIKRII FT TTLPTLKNLDYESNDPIWLFSDASGHALGAALFQGAEWETSSPIAYKSRQM FT SPAERNYPVHEQELLSVINALNKWKMLLLGMKVNVMSEHHSLTQLMTQQGL FT SRRQARWLETLSQFDLDFRYVKGLDNSVADALSRMEDVATLEVKSELSTDL FT TDQIKEGYRTDSFCERLAKVLPLRDNCVIQDGLMFMDGRIVVPTVDGLQRE FT LIGVAHNSVGHLGSLKTAERVRREFYWPHLLQDVEDFVKKCDSCQRNKART FT TRLPGKLQSTQVPLRPMADIALDFVGPFPKVQGYDMLLTCTCRLTGFTRLI FT PTCQTDNAEKTARRLFGSWLTIFGAPQTMIGDRDKIWTSKFWKELQSLLGV FT RVHLSTAYHPQANGRAERTNKTVGQMLRHHVGGKHGKWLPALPAVEYAINS FT AINTATGVSPMQFVFGYVPKLFPIDVKQAEVSEGVREWVEKRQEGWAAWRD FT KLWAARASQAVSYNARRGADIVLREGDWVLIDSKDRQQSVKGPVAKLRARY FT DGPYEVIEVLNEGRNVKVRLDDGNRSHDMFHQSKLRKYCSDELEMDA" FT CDS 10376..12751 FT /product="Gypsy-39_MLP-I_4p" FT /translation="MKSKNEVLSKFKLFFNAMTTKVDGVIAELRSDNGTEY FT INKEFTSFCAEKGITQTTGPPDSPQLNGVAERCNRTVKEKVCCCLIESSLP FT DSFWGDALEYAIETLNNMPTRTNEKFQSPISLLGIHERQEHTFHAFGCQVW FT FHVIKPNSTLKPRARHAIFLSYLNNNKGAIVWDTEKKTALKATSLVFLDKV FT FPGLKSPKSCKTQEPIGNDTLPWPTLESDEQIITKDSLPSDSITQEESSRP FT KRTTKPPTRFGNFVNSASTRKQPKRYGEYGRAAFIESEKEPSTYKQAKKSV FT NWDLWKLAAEAEMCSLVGKETWKLVPRPARRKIIQCKWIFKTKRNIDMSID FT KLKARLVALGFSQIKGIDFSEVFSPTSRQESIRLFLSLMAKNGWKGVAVDI FT KTAFLNGDLDEEIYMEQAEGFVDEDHPDWVYRILRSLYGLKQSPRQWNIKL FT HEYLISIGYTRSAHDPSIYFKKDDQGEINSMLVTHVDDIAVTGTDKEIDNF FT CVLIKKRFEVSSDKPLSHFLSLNISKVKENKITVDQTHYISNLQKQFMKYN FT VKCSNTPNDKSFKNLVPTSNEDTISEDLYSSLIGGLLWVAQCTRPDIAFVV FT NRLSQFLKKPLTIHWNSALRVLGYLVKTKELKLNLGGSKLNPTAYSDADWA FT EDRHERRSTTGYVFMLGIGPVSWRSRKQRTTSLSSTEAEYMAMSDSCREAR FT WLVSILNELNISRGKSFKLCVDNEGAEALAQNPSHHSRTKHIHTRYHFVRQ FT CVEEGIVKLVHVSSSKMLADMFTKVLSKILLIKHRLSLSIF" XX SQ Sequence 12779 BP; 3736 A; 2646 C; 2815 G; 3582 T; 0 other; ggttatgagc ccacctcagt gactctataa gattgcactc tcggaaaacc gtttcaataa 60 ctatcaacaa actttaacaa gtatagttcg aaactgtcta gaagactgta aaggaaaagt 120 ccgtctgata cagacaaaaa actaaatcga ttgtcagaaa taaactccaa aaccctaaaa 180 acctctagtt taccactctc tgttttgatc ttcgataatc ttcatcatca tgtctgcacc 240 aactttcaca tctggtgaga agggcaagtc gcccgattct tcgttagcac cgtcatctgc 300 ctctagacca gctggtcgtc aaaccgaatc ttcgattcga actgaaagca aagtattgcc 360 atctctgcaa tattcgaatg agcgtttaga ggcaactgga gttaagccat tagaagctcc 420 tggcccagaa tcaaactacc atcaatggca gtttgtaatg gggacgataa tcagaggatc 480 aagttttggt tatgtacttc gggaaaatca accatcccct cttcctgcat ctgaagaaca 540 tgatcgatgc aaagtatctg cattgatact tcgatatgtt catccttcga attacgaatt 600 cttgtcatca tacgaagacg atccgagaag ccagtggaaa gctctgcgtg aagcgcatca 660 agatgctttg tcaggatcca ttatgtattg gcttaagaaa ttagtcactg cgaaaatggg 720 tgatctgagt ataacggaac atctagaatc aatgtcggcg gattatcaaa aattcaaatc 780 gttggttact cccgataagc cactcactgc cgatactgtc ttcgccacgg cagtaagtct 840 gtcattgact gctgactggc aaccagtttt agctccatta cttcaacgtg aatcagtcac 900 atcaactact atcttgaaag tgcttcgaga agaggcgacg agaagatcga tttcctctga 960 ggaaatagaa gtcatagcag ctgctaagga tggaaagtcg aaagatgtat catgctctca 1020 ttgttcgaag tctggtcatg taatggaaga ttgccgaatt ttaaaatcga agctggaaaa 1080 gtacaaggtt ttaaaggcgg aaaaagctga acaagatatt ccaaagtcta aaccgaccaa 1140 gaatcaagtg agaagatcga aggcagcaaa agcaaaagca gctgctattg ctgagctctc 1200 ctctgcttca gaagcttcga tcaatatcaa ctctgaaatt tcttctgccg ctgtaatcaa 1260 agcgtcgtcg ggaaaaacca ctggatggct aatcgactcg ggaacgtctc aagtgatgac 1320 tccatgctca gatgtagtta ccagaaagaa aactgacaat acgaatatca gtctagcaaa 1380 tggttcgatt gtcaaatcaa tttcaaaagg atttgtatcg ctaccgtttt ctgattttga 1440 agatatcaaa tcactctatg ttccttctct gtccgaacct cttctatctg tctctaaact 1500 tgccgataag aaaatcacta ccgtattcaa ttcatcgact gtttcgtttg ttaaaaactg 1560 taaaatcacc ggtgacgtag ttggaactgg atatcgacat ggcaatctat attatctcaa 1620 tcaagaggta tgtcattcga aacctagtga tcaaacgtcg aaagcaatgg tctctgacgt 1680 acatttatgg cactcaagat ttaaccatcc tggttatgca gctttaaaac gaaaactcaa 1740 ttcattaaac attaaggtgg atgaagctga actacgtaag ttgcaaactt gtagcatatg 1800 tgttcaggga aaagtgagac gacgtaacat gtcaagtcga gcagggtgga gatcaatcaa 1860 actgtaaggt gtcacactta caaacacaca tgtcacatac tatacagtac agtagataca 1920 attagagtct agactaccct tgtttccttt cctcttctct ttttctcttt atcggtttct 1980 tcagcttctt cagtttctat tcatgttatc agggttgttg tgatctgatg ttctttatga 2040 actaggtatg ggtttccttt cctcttctct ttttctcttt atcggtttct tcagcttctt 2100 cagtttctat tcatgttatc agggttgttg tgatctgatg ttctttatga actaggccta 2160 gttttcaaaa ttcactttgg ttctagataa ctttcgtgcc tgctcgtgcc caatcagtgt 2220 tgttcctcca taggaacttc cacttttttt atctattacc aaatcgatac caatacgaat 2280 ccagattttt tttcgaatca actgaaacta ctgcgccagc gaattgaatt accccgcatc 2340 cccgatatag aattgaattg actgaactat cttgaatact gaactgaact gaaaccgatc 2400 tgctggtgac cgatcacctg ggacactagt ctatgccaaa cacccgccag aataggaccc 2460 tggaaagccc ctatataccc gcctaattgg gtatgtagga attggccttt ttgggtaaaa 2520 atggtcctta tgttctgtgt aggaattcta aaaatttgaa aaaaaaagtc acacaggagc 2580 cttattttgt atgtaggaat atgtcaaaaa gtgtggaaaa aaaaattagg tttcttacac 2640 attaggtagg aaagagggtt ctcagggcca ccaggctacc cacttcaata cataaggtga 2700 gtagcaatga tcatcgagtc acaatgaaac ctgatgttat tgtgtataca taccaatcat 2760 gatagtattg agttcgaagc tcaggtaaga caattttttg cacattttta tgtaaaaggg 2820 tgtggaaaaa tgtcatatta tgacagttga tggtcattga aatcgagatg tctgacagtc 2880 tggcatgggt cagcaagaag gttgtttcaa ccagtttcaa ccgagtccaa acatttgaaa 2940 tggcctgaaa gttgctgaaa cagcttgatt ctgctcgatc acaacatttt ttttgtttga 3000 tcttgatagt tttcgttgat attagacaca attactatat ctcgtgtagc aaaaggaaaa 3060 ggtaatgaac gaaagaaaaa aaacaaccac caggatcctg tggcttctga tcacgccaat 3120 gcaaatggag ccaccaaatt ggaccctcgg acatgtagta aattggtagc catcatgctg 3180 aaaaactctg agcagatgga caaggctctg gatcgcttac atcgatggaa cattcatgta 3240 aagcttgacc tcaattggat tgcaaaaaat acatcaacaa ctctcgacat cgcaacacaa 3300 acactttata caagtggcaa tgtatcagaa gacaatccac ttgaagtaaa caagtttgtt 3360 gtggttataa aaaacaatgt atgtcatccc ctgtttcaaa tatgaagctg ccgactctga 3420 tcatttaatg tactcttgat tcgttttaga tgcttttttt gggtaagatc tgtgctctct 3480 acatttacca gaatggtaag cacgagtgga tcaccaaatt gtccactcga tctaagctct 3540 catatgcatc tacccagctt tacatgtata aagggaatcc tggagcgtgt agttttgaag 3600 aaaactttcc aaatcgcctc atattctcgc atcttgatat agcagatatt cactatgtgt 3660 ttccccccac ctcatcaatg aatattactg aacccaaggc taacaccata tggattcctc 3720 aagcttatcg tgctatgtta aaggctatga gccagaaaca gtacatcaag gcaattctcg 3780 aacaacttgc tgttgaacac gctaaagcta gaaaaaaata agctatactt gaattgtaat 3840 atctaacttg gtttttattt ctctccacct tatctagaga tttgattttg tgttatttct 3900 tcaagttgtt tcgcttcatc tctcccaaat tcaaaaaaat accagtctct ttagtccctt 3960 ttctgaaacg tttgactgct ctttttctac ttcttgatat tgaattctca gaccctgatc 4020 tccctcatgt gtagagattc actttgatgt ttttacctgt tacagcagtg gaccgactgc 4080 agtggctggc caggaactgg aaacacagtc cctccagtgg ctggccagga attgaattac 4140 tcgttttttt taaaactcaa ctacataaca ctacatagct atactactca ctagtctatt 4200 tctctctcaa caacttcaac tcttgtgatt cttctaccaa attcccaaaa aaatattcaa 4260 gtttttccgg aaaaaaaatc aagatgtcta attgctctga aacgcccaaa aacacagatt 4320 ttgagggtcc ttatgcagat ggaaggaata ttgggacctt tctccgaaaa aaaaatggtc 4380 ctttgtcaac aggtaggaat ccgagttttt tttgtgaaaa ccggcgaaat ttgtgtaggt 4440 aggaggggaa aatttcggga tttccattag gagggtatat cggggctttc cagggtcctg 4500 ccagaatacc gacaaggtcg agcaaaggct tgacgatccc gaacagctgc tgcgtggacc 4560 tcgagcaaat tctgcttcca actctgggtc cactaactgt ccgccacagc tttattctga 4620 tcttgcaaga acgttacatc cgaaaacacc acagcctggc gagaaaatct tagcagcctc 4680 caaatctgta gccgagctgt ttcatcttaa ccccgttcca ttgggaagcc tctcacccag 4740 ctcatccgtc ttactatcca atatcctcca ggatcgacct tctacgcgtc gaaaatcttt 4800 tgttcctggt tccttgccca ataacactgg tatttcgatc gacgcgacag tcttacgtcc 4860 cctacctacg cctgatcaga caccgaatat gtcagaagag gtaggaggag tgacatctgc 4920 tgacgacgta caatctgaaa tcaaacgtat cctggaggag agcgtagagg ctttgactcg 4980 aatctccgcg gacaacgatg acttacgtgc agagatggcc gatcttaagg aattgataca 5040 tcagatgatc caacaaaacc gtgcagacag cgcaccgcct gaagatcgtg gtccaacagc 5100 caccacctcg actcctgccc atgagacagg tgggttgcca ccttcgcccc atcctgactc 5160 gacgatgcga gcgcaggctc aggagggatc aaacgcgtcc actggtgacc aaccctcctt 5220 cgctcgaacg aaccctttcg ctgcgttttg caccccacct gtaaaccgcc aacctgctcc 5280 actaccccaa ccgtacgttc atccgaatgt ggccccaccg atctcatcta tgccaggcaa 5340 cacttatgcc tattggcatc cagcggcggc tcctgttgag cgcgaagtac tcgcgcgtca 5400 cctcaaggat agcgagatac cacaatttac gtgcgcctat ggtgatgttg caggatttag 5460 attgtggcga taccgcattg aagcccgctt taaagtcaaa ggtctcgaca gcgatacgga 5520 gcgactcaag attttagcgg ctgcgttggc agtgcctcat gctgttcaat ggcaccgtac 5580 gcacgaggca gagcttagtg gcaaatcgtg gaatgaagcg atggaaatgt ttagtcatgg 5640 agttttacca tccgggtggt tgcgagatgc acgtcaagcg ttacaagatc tagcccagaa 5700 gcctggtgag actatgtcga actacatcat aagagcgcgt gggattcaag atgtggtgac 5760 tacggcggtg tgtgaagatc gagatttagc tgagagaatt gtgggggaac aggtctaatg 5820 tttcgagagt cagcggccaa ggatgattga ttagaaaatg tgattgattt acgtacggga 5880 acctggtcct tctctttatt cgaaaagcgc gcccttgata ctgctcgtta tcttcaagcc 5940 attgattcca acccgtttcg tttgtcagca cgtaatcaac cggcacctaa tcgacaaact 6000 aatggctccg gcacgtctac tgcatatacc ccttatgtgc ggccgccagt tgatcctgac 6060 gcgaggagag aatggtgggc agcgtttatg cggtccacag gcaggtgccc tcgttgcaag 6120 gtgcagtgtg ctaagtggtt ggggggatgc gaggaacggc cgaacatgac cttcgtatca 6180 atgcctttag attttccccg ggcaccacct tatcctcctc cgaaagcggt acagaaccca 6240 ccagcgaatg ctaccaactc aagccgacca ggagcgccta ggagagggcc agtcattccc 6300 atcgcgagcg ctcaagtcgt atcttaacct ggcgcgacct ctactttcca agagttcccg 6360 gatttaggca agacggatgt ggccgcttat ggtgaactgg tcgcggcgtt gggaatggag 6420 aatgatgggt atgaaattga ggctcctgtt gcacctatcg tgttgaaact aacgataaat 6480 ggtgtcgaaa tccgcgcatt aatggacaca gcggcggcga caaacctcat gtcgaatcga 6540 ctagcacgaa agctgaggct ggtacggcgt aaacttctga aaccaactct aatcagacta 6600 gcgattgaca caagtgaggc agatgttagc ttaactgatt acgctatcgc ctcggtcaaa 6660 agtactaaaa tgacctttgg tgccacgttc ttcaaactag ccgaattgca cgatggccat 6720 tacgacgtaa ttttaggaac gcctttcctc aagaaacacg agttggatgt gtcagtggcg 6780 cgagggagtg ttgtatgatc cagcggttga aaaaatgtta gaaaataagt tgttggaaaa 6840 aaagaaagaa tttttaaatg aaagaagaaa agatttagtt gcggcggtgt tagataatgt 6900 agatgttgtt gaaaaagctt ccgacctttc ttatcgtgaa atgaaaatgc tgaaagaatt 6960 tgaagttctc tttcccgacg tattgccgga tgtggagtcc ctggattatg ctgagttttt 7020 tccaccagag gtacaaagtc caagctcgaa ggtccgtcat aacattgttt tgacagaccc 7080 taatgtagtc atcaatgaga gacaataccc ttacgcacca aggtatatgg atgcctggaa 7140 gaaacttgtg gatgaacacg ttgcagctgg gcggttgcgg aggtcagaca gtccttacgc 7200 tttgccttca atgatcatac cgaaagcaga cccaactgct gcacctcgat gggtgtgcga 7260 ttattgtcgt ctgaacaaat tcactgttaa ggatcgatca ccactaccaa atgttgagga 7320 ggcggttaga ctggtaggaa cgggaaaaat ttattccatt ctcgatcaga ttaacgcgtt 7380 cttccagaca ttgaacagac ctgaagatat tcccttgacg gcggtcaaaa cgccatgggg 7440 actctatgag tgggttgtca tgcccatggg attgacgaat gctccagcca cccatcaacg 7500 acgctgcgag gaagccctag gtgatttggt taacactgtc tgcgttgtct acattgatga 7560 catcgtggtc tactctcaat cagttgaaga acatgaacaa cacctacggc tggttatgga 7620 acgactgaaa gcagctaaac tgttttgttc aatcaaaaag acaaagcttt ttcgtcgtag 7680 aatcaaattt ttaggacaca acatcagtgc agagggaatt agtccagatg aagataaagt 7740 tgagaaggtt gcaagttgga ggacaccgaa gacagcaaaa cagttgaagg agtttttggg 7800 aacggtgcag tggatgaaga agtttgtgga tgggttggct caatacgctg gacacttgac 7860 acctttgacg agtgtgaaaa atgcggctaa gaaattccaa tggggagata aagaacaaga 7920 tgcctttgat aatataaaga ggattattac tacactacct actttaaaga atttggatta 7980 tgagtcaaat gatcctatct ggttgtttag cgacgcgagc ggccacgcgc tgggagcggc 8040 actgttccaa ggcgccgagt gggaaacctc gtctccaatc gcgtacaaaa gtcggcaaat 8100 gtcacctgcg gaacgaaatt atccagttca cgaacaggag ctactatctg tcatcaatgc 8160 gttaaataag tggaagatgc tgctattggg tatgaaggtc aatgtcatgt cggaacacca 8220 ttcattaacc caattgatga cacaacaagg tctcagcagg cgacaagccc gatggcttga 8280 gactttatcg caatttgatc ttgacttccg ttacgtaaaa ggcttagata atagcgttgc 8340 agatgcattg tcgaggatgg aggatgtggc tactcttgag gtgaaatcag aattgtcaac 8400 cgacttgacc gatcaaatca aggagggtta caggacggat tcgttctgtg agaggttagc 8460 aaaggtgttg ccattgcggg ataactgcgt gatacaagat ggcttaatgt ttatggatgg 8520 tcgaatagtt gtgccgacgg ttgatgggtt acagagggag ttgataggtg ttgctcataa 8580 ttctgtgggc catttgggat cgctgaaaac tgctgagcgg gtgaggagag agttctactg 8640 gccacacttg ttgcaggacg tcgaagactt tgtgaagaaa tgtgacagct gtcagcgcaa 8700 caaagcgcgt acgacccggc tacctggcaa acttcaaagc acgcaagttc ctcttagacc 8760 tatggcggat attgcgttgg acttcgtggg accctttcct aaagtccagg ggtatgatat 8820 gttgctgaca tgcacgtgta ggctaacggg attcacgcga cttattccaa catgtcaaac 8880 tgataatgcc gaaaagacag caagaagact cttcggaagc tggctgacga tctttggtgc 8940 tccacagacc atgattggcg acagggacaa aatttggacc tcgaaattct ggaaagagct 9000 gcagtctcta ctaggggtac gtgttcacct ttccaccgca taccatccac aagccaatgg 9060 tcgagcggag cgcaccaata agacggtagg ccagatgtta cgtcatcatg tgggggggaa 9120 gcatggtaaa tggttgcctg ccttacctgc tgtcgaatat gcaattaatt cagcgatcaa 9180 cacggcgaca ggcgtttctc ctatgcagtt tgtttttggt tatgtaccaa aactgttccc 9240 aattgatgtg aagcaggcag aggttagtga aggagtgagg gagtgggttg agaagagaca 9300 agagggttgg gctgcatgga gagacaagct gtgggcggca agggcgagtc aagcagtcag 9360 ctataatgca cgacggggag ccgacatcgt tttacgcgag ggagactggg tcctcatcga 9420 cagtaaggac cgacaacaat cagtcaaggg tccagttgca aaactccgag caagatacga 9480 tggaccatat gaggtgattg aagtattgaa tgaaggtcga aacgtcaagg tccgattgga 9540 cgacggcaac cgcagtcacg acatgtttca ccagtcaaag cttcggaagt actgttcgga 9600 cgaactggag atggatgcat aacaaggagt ggcaagtact tgccgtagca gaagtacgtt 9660 tttctccttc gtatgcaccg cctagtgggt ttctaccgtt tgtaagcaaa acctcagcca 9720 cgctgtgagc acaagatttt cgccaatttc ttttgtagta atgcaacggc cgggctcagc 9780 atcgacaact tgggcggatc gagtttcttt tatttctttt ctcttttctt tcttttcttt 9840 tattttaatt ttctttttgt acttttgttt gctttcagtt gcttttaact ttctgttttg 9900 attttccgtt tttctttttc ttctttttct tttattttaa cttggaaatt ttgtgggggg 9960 ggatttttct tttataggag gggagggtgt aaggtgtcac acttacaaac acacatgtca 10020 catactatac agtacagtag atacaattag agtctagact acccttgttt cctttcctct 10080 tctctttttc tctttatcgg tttcttcagc ttcttcagtt tctattcatg ttatcagggt 10140 tgttgtgatc tgatgttctt tatgaactag gcctagtttt caaaattcac tttggttcta 10200 gataactttc gtgcctgctc gtgcccaatc agtgttgttc ctccatagga acttccacaa 10260 acctctcagt gtgattcact ctgatgtttc atcttattct gttgtatctc gaagaggatt 10320 tgaatacttt gtaaccttta ttgatgatta tacaagattc actaaaattt atctaatgaa 10380 atcaaaaaat gaagttcttt cgaaatttaa attattcttc aatgcgatga caacgaaagt 10440 tgatggagtt attgctgagt tacgttctga caatggtacg gagtatatca acaaagaatt 10500 cacttcattc tgtgcagaaa aaggaatcac tcaaacaacc ggaccacctg actcaccaca 10560 actaaatggt gttgctgaaa ggtgtaatag aactgtaaaa gagaaagttt gttgctgttt 10620 gattgaatct tctttgccag atagtttttg gggagacgct ttggagtatg ctattgagac 10680 gttgaacaac atgccgacaa gaacaaacga aaaatttcaa tcaccaattt ctttactggg 10740 tattcatgaa cgtcaagaac ataccttcca tgcatttggt tgtcaggtat ggtttcatgt 10800 tataaaacca aattcgacgc tcaaacctcg agcacgtcat gctatttttc tttcttattt 10860 aaacaataac aaaggtgcta tagtttggga caccgaaaag aagactgctt taaaagcaac 10920 ttctttagtg tttcttgata aagtatttcc aggattaaaa tcaccaaaat cttgcaaaac 10980 acaggagcca attggaaacg acactcttcc atggccaact cttgaatctg acgaacaaat 11040 cattacaaaa gattcattac catctgattc aatcacacaa gaggaatcat ctcgaccaaa 11100 acgaacgaca aaacctccga ctcgttttgg taactttgta aattcagcgt ctacacgaaa 11160 acagcctaaa aggtatggag agtacggtag agctgctttt attgaatcag aaaaagaacc 11220 atctacttat aaacaggcca agaaatcagt caactgggat ctttggaagt tggcggccga 11280 agcagaaatg tgttcattag tggggaagga aacttggaaa ctggttccta gacctgctcg 11340 acgtaaaatc atccaatgca agtggatttt caagacgaaa agaaacatcg acatgtcaat 11400 tgataaatta aaagcaagac ttgtcgcatt aggattttct caaattaagg gaattgattt 11460 ttcagaagtt ttttcaccaa cttcacgtca ggagtcaata agactatttc taagtctgat 11520 ggcgaaaaat gggtggaaag gagtagcagt agatattaaa actgcattct tgaatggaga 11580 tctcgatgag gagatctaca tggagcaagc tgaaggcttt gtagacgaag atcatcctga 11640 ttgggtttac agaatcttaa gatctttata cggtttgaag cagtctccgc gtcaatggaa 11700 cataaaatta catgaatatc ttatttccat tggttataca cgatcagcac acgatccttc 11760 aatttatttc aagaaggatg atcaaggtga aatcaattca atgttggtaa ctcatgtcga 11820 tgacatagca gtaacaggaa ccgacaaaga aattgataac ttttgtgtac tcatcaaaaa 11880 gagatttgag gtttcttcag ataaaccatt atctcacttt ctttctctta atatttcaaa 11940 agttaaagaa aacaaaataa cagttgatca aacacattat atatcaaatc ttcaaaaaca 12000 atttatgaag tataatgtca agtgttcgaa tacaccgaat gacaaatcat tcaagaattt 12060 agttcctaca tcaaatgagg atacaatttc tgaagatctg tattcaagct taatcggagg 12120 tctgttgtgg gttgcccagt gcactcgacc tgatatagct tttgttgtta ataggctttc 12180 gcagtttttg aaaaagcctt tgacaataca ttggaattca gcattaaggg tattagggta 12240 tttggttaag acgaaggaat tgaagttaaa tttaggagga agtaaattga atccgactgc 12300 gtattcagat gcggattggg cagaagatcg tcatgaaaga agatcaacta caggatatgt 12360 ttttatgtta ggtattggac cggtttcatg gagatcgagg aagcaaagga caacgtcatt 12420 gtcgagtaca gaggctgagt atatggccat gagtgactca tgcagagaag cacgttggtt 12480 ggtatcaatt ttaaatgaac tcaacatttc aagagggaaa tcatttaaat tatgtgtgga 12540 caacgaaggg gcagaagctt tagcacaaaa tccgtctcat cattctagaa caaaacacat 12600 tcatacaagg taccattttg ttcgtcagtg tgtggaagaa ggaattgtta aactagttca 12660 tgtgtcttcg tcaaaaatgc tagcggatat gtttacaaaa gttctttcta aaattttact 12720 aataaagcat aggttatcat tgagtatttt ttaacaagta tgttattagc acggggggg 12779 // ID Copia-1_AM-LTR repbase; DNA; FNG; 177 BP. XX AC ACDU01003803; XX DT 07-FEB-2011 (Rel. 16.02, Created) DT 07-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Allomyces macrogynus genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_AM_; KW Copia-1_AM-I; Copia-1_AM-LTR. XX OS Allomyces macrogynus OC Eukaryota; Fungi; Blastocladiomycota; Blastocladiomycetes; OC Blastocladiales; Blastocladiaceae; Allomyces. XX RN [1] RP 1-177 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Allomyces macrogynus genome."; RL Direct Submission to RU (07-FEB-2011). XX DR Genome; ACDU01003803; Positions 64884 64708. XX SQ Sequence 177 BP; 31 A; 42 C; 40 G; 64 T; 0 other; tgtcgaagtg ttcgtctcgc atttgtgttt cttgttattt gtcttccctc attgtttcct 60 tgttggagct gttcgagttg gattcttgtg taaatgatgg tggcaaccta ggttatggtt 120 gccaccacca cctgaagtgc agagtgcaca caccaaatgc attccttctc ccgctca 177 // ID Gypsy-112_MLP-LTR repbase; DNA; FNG; 192 BP. XX AC AECX01000667; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-112_MLP_; KW Gypsy-112_MLP-I; Gypsy-112_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-192 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000667; Positions 413 222. XX SQ Sequence 192 BP; 56 A; 54 C; 29 G; 53 T; 0 other; tgttatgaac tgaacacgtg acactttaag atcacatgtc acagatttag attatcactt 60 gtgactgcgc cacgttgtac tttctctttc ccttcatccg acaatcatca taaggaaaga 120 agatactaga taaagaaccc tgattgactc cttcaacccc gtcgccatta cccctgagac 180 ccgcccttaa ca 192 // ID Gypsy-2_RO-I repbase; DNA; FNG; 5146 BP. XX AC AACW02000235; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_RO_; KW Gypsy-2_RO-LTR; Gypsy-2_RO-I. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-5146 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000235; Positions 282704 287849. XX CC 'TGGTT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(48..1427,1431..4979) FT /product="Gypsy-2_RO-I_1p" FT /translation="MESPKQDNRPMEQDPSDMEIEQASQDPSLSDDGSEVI FT PKINANAHEITSDHDDIDAEVSKLLRYKKEESARLYQKHIMAGEEEQADLA FT FNKMRKYGAQLLALSGKLVPLTSTPVKKEIGLRLTQNDIPKFQLTSWPDIP FT FPGEKCYESVEHFLRSFEKVIYSAALDIQEVWRRYLPVSLAFDHDDWVEKD FT LKRCDTWTSARIVFTNKFLTDNARTDAMHRLFTMTMNKHESISSYMTRFLS FT TCIEAGANKNDPMVAARFKASFSEPVLEKCKTAFVLKENKNVPWTVERAAE FT IARNLLGDDPRTYTSSSYANIASGSKRIHTNAGPFSIPKRNKGHGSSGAFF FT CSRHGGSNANHNEDDCFSKTNKKNDAASTVKNKYYGNKNFPQKASTSKPCR FT YCGRSWSHGHQCQEYADFKRKLHVDPKKPRDSDKSIYAIVRTTNDENDDDH FT DMTEAENHAYDCKALNKITEDEDTNPFKLLTPILIEGKKLIGKIDTGSDIT FT CISKICLNNKLNIDKVNSVNGSLNFLSSDNNCKRIGQTDPLTLTYMNNISF FT KHSFEVVNFNETMTQEFDVLLGTDVLGKMNIGLTGVAYKYPGNEDDYSNRL FT KEDAQFLNMNFDANNEIEPNNSPYGTEDQQKQFMAYIQNAIDDNQNIPAGS FT FCTVPESVVELPMEKGAKSFKRQYPIAHRQMPELEEQINEWLKAGIIKENR FT ANTSFNSPLLMVPKKDENNEITSHRVCLDVRGINKLIPDVVYPVPKIRSIF FT DNLAGGEVFTKIDLKNAYNSFLVAPKDQHKLSFQFKNRTYTFQGCCFGLKT FT VTAIFCKVMKILFADMDFIETYVDDCVIKGTADNHASNVKKVIERLTSVNL FT KINVKKCSWYQRSIYLLGFVVESGGIIKVDPRRLTNIDSWPIPRNKKEVMR FT LMGIVSYMRDFIPLISRVAAPIDRLRNDPDVQNNWTQEHTDAFIALKEILK FT SKTLLHTPDLSKKFYVATDASQYGVGAVLTQRDELNRTLHIAFASKSLSPS FT QRKWNTTKRELYAVVFALEKYREFLWGNKFELRTDHKALMYLHTQEQANPM FT MIGWLETLLDFNFDVIHIQGILNQLPDLLSRLYEPCLNNEFMLGGGDAAQK FT ENKKEKKEIKNKNQNKIKVKRQTNSTAKRHNLIYSQDKKLNVLAVKIKNKK FT FANNPTTDYMTPPPEKRARLLEDAHTFGHFGSTSIVEQLHSQGLHWNNIYK FT DATETIKSCIECQRHNISKKGYHPLKNILAYLPFDHIGIDLLGPLPVTANE FT NVYVLVMIDVCTRYIILRPLKNKQSDTVAKELIKIFGDYGIPKIIQSDNGR FT EFKNSLMHSLSKHLGIDRRYSTAFHSRGNGVSESAVKTTLNTIRKMVNSNS FT NDWDDVLPIVQLASNYKIRHRSKSAPFSLMFARRLNSFEDYGDQSIADKIP FT NRPVTEQELIERTDRMYQIVFPAIRERTQKIIEEASKRFNDKNMIIDLPKD FT TAVMVRLPGRTSKLSPLYEGPYIVVRKTQGGSYVLKDEQNELLHREYVPSE FT LKIVSIDETAIEDTYYEVEDIRDHRGGPGQREYLVKWAGYGERENTWEKAS FT SFSSTVPIDKYWLKRKELKELEQARKQKMIANEKDYTKKTSARPNVTEENK FT NHKRKDMVQDSTVLRKSKRIRNNRN" XX SQ Sequence 5146 BP; 1861 A; 903 C; 986 G; 1396 T; 0 other; ttttttaaaa ttaatcatcc ttctggttct taccctggta aactgctatg gaaagcccta 60 agcaagacaa tcgacccatg gaacaagatc catctgatat ggagatcgaa caagctagtc 120 aagacccttc tctttctgat gatggctctg aagtaatccc aaaaataaat gccaatgcgc 180 atgaaattac ctctgatcat gatgacattg atgctgaagt ttcaaagctc ctaaggtata 240 aaaaggaaga atctgcccga ctgtatcaaa aacatataat ggctggtgaa gaagaacaag 300 cagacttggc tttcaacaaa atgagaaaat atggtgcaca gttgcttgct ctctctggga 360 aactggttcc ccttacgtct actcctgtca agaaggaaat tggtttgcgg ttgacacaaa 420 atgacattcc aaaattccag cttacctctt ggcctgatat cccatttcct ggtgagaaat 480 gttatgaatc tgtcgaacac tttttgcggt cctttgaaaa ggtgatttat tctgctgctt 540 tggacataca ggaggtttgg cgccggtatc tccctgtttc gttggctttt gaccacgatg 600 actgggtcga aaaggatctg aaaaggtgtg atacatggac ctcagctaga attgtcttta 660 cgaacaaatt cctgactgac aatgctagaa cagatgcgat gcatcgtcta tttacaatga 720 ccatgaataa acatgaatct atctcttcgt atatgacccg ttttttaagc acttgtattg 780 aggctggggc gaataaaaat gatccaatgg tggctgctag atttaaagca tcattctctg 840 aacctgttct ggagaagtgt aagacagctt ttgttttgaa agaaaataaa aatgtaccat 900 ggactgtaga gagagctgct gaaatagcaa gaaatttatt aggtgatgat ccaaggacat 960 acaccagctc atcctacgcc aatatcgcca gtggtagtaa aagaattcac acaaatgctg 1020 gtccttttag tattccaaaa agaaacaaag gtcatggttc gtctggtgct ttcttttgct 1080 caagacatgg tggttcaaat gcaaatcaca atgaagatga ttgctttagt aaaacgaaca 1140 aaaaaaatga tgctgcgtcg accgtaaaaa ataaatatta tggcaacaaa aattttcctc 1200 aaaaggcaag cactagtaaa ccctgccgct attgtggtag aagttggtcc catggccatc 1260 aatgccaaga atatgcagac tttaaaagaa aacttcatgt cgacccaaag aaaccaagag 1320 attcggacaa gtcaatctat gccattgttc gtactacaaa tgatgaaaat gacgatgatc 1380 atgatatgac agaagctgaa aatcatgcat atgactgtaa ggcattatga aataaaataa 1440 ctgaagatga agatactaat ccttttaaac ttttgactcc tatattaata gagggcaaga 1500 aattaattgg caagattgac actggcagtg atatcacttg catatcaaaa atttgtttga 1560 ataataaatt gaatattgac aaagtgaatt ctgtaaatgg ttctttaaat tttttgtctt 1620 cagataataa ttgtaaacga attggacaaa ctgatcctct cactttaact tatatgaata 1680 acatctcttt taaacattct tttgaagttg taaattttaa tgagacaatg acacaagaat 1740 ttgatgtctt gcttggaact gatgttcttg gaaaaatgaa tatcggtctt actggagtag 1800 catacaagta cccaggtaat gaagatgatt attcaaatag attaaaggaa gacgcacaat 1860 ttcttaatat gaattttgat gccaataatg agattgaacc taacaactca ccctatggta 1920 ccgaagatca acaaaagcaa ttcatggctt atatccagaa tgcaattgat gacaatcaaa 1980 atattcctgc tggatctttc tgcacagtac cagaatctgt agtagaatta ccaatggaaa 2040 agggagcaaa gtcctttaag agacaatatc cgattgctca ccgacaaatg cctgaattag 2100 aagagcaaat aaatgaatgg ttaaaagcag gtatcattaa agagaaccgc gctaacacat 2160 cctttaatag tcctttatta atggtaccaa agaaagatga gaacaatgag ataacgagcc 2220 acagagtatg tttagatgtt cgtggtatca acaaactaat ccctgatgtc gtttatcctg 2280 ttccaaaaat aagaagcata tttgacaact tagctggtgg tgaagtgttc actaagattg 2340 acttgaaaaa cgcatacaat agtttccttg tagcccctaa agatcaacac aaactctcct 2400 tccaatttaa gaatcgaaca tatacctttc aaggatgttg tttcggtcta aagacagtta 2460 ctgcaatatt ctgtaaagtg atgaaaatac tcttcgcgga tatggatttc attgaaacct 2520 atgttgatga ctgtgtaatc aaaggaacag ctgataatca tgcttcaaac gtaaaaaagg 2580 tgattgaacg attaacttcc gtcaacctaa aaatcaatgt aaagaagtgc agttggtatc 2640 aacgatcaat ttatctttta ggatttgtag ttgagagtgg tggtatcatc aaagtagatc 2700 cacgccgcct aacaaacatc gatagttggc ctataccacg caataaaaaa gaagtgatgc 2760 gattaatggg tattgtatca tatatgagag actttatacc tttaatctct cgtgtagctg 2820 cacctatcga cagacttcgt aatgatcctg atgtacaaaa caactggacc caagaacaca 2880 ctgatgcatt cattgcttta aaagagattc taaagtcaaa gacactgttg cacactccag 2940 acttgtcaaa gaaattctac gtggctactg atgcctcaca atatggtgtt ggtgcagtat 3000 tgacccaaag agatgagctg aatagaactt tacacattgc atttgcatcc aaatctttgt 3060 caccctctca acgtaagtgg aatacaacta aacgtgaact gtacgcggta gtgtttgcac 3120 tagaaaaata tcgtgaattt ttatggggaa ataaatttga attacgtact gaccataagg 3180 ctttaatgta tttacatacc caagaacaag caaaccctat gatgataggc tggttagaaa 3240 ctttactaga tttcaacttt gatgtaattc acatacaagg aattctcaat caattaccag 3300 acttactctc tagactttat gaaccttgct tgaataatga gttcatgctg ggagggggtg 3360 atgctgcgca aaaagaaaat aaaaaagaaa aaaaggaaat taaaaataaa aatcaaaata 3420 aaataaaagt taaaagacag acaaactcta cagcaaagag acacaacttg atctatagtc 3480 aggacaagaa actaaatgtg ctagctgtga aaataaaaaa taaaaagttt gctaataatc 3540 ccacaactga ctatatgact cctccacctg agaagagagc acgtctactt gaagatgctc 3600 ataccttcgg acacttcggt agtacgagta ttgtggagca gttacattca caaggacttc 3660 attggaataa tatttataaa gatgctactg agacaatcaa atcatgtatt gaatgccaga 3720 gacacaatat aagcaagaaa ggttatcatc ctttaaagaa tattcttgca tatctaccat 3780 ttgaccatat tggaattgat cttttaggac ctttacctgt tacggcaaat gaaaatgtct 3840 atgtccttgt gatgatagat gtatgtacca ggtacatcat actgcgtcca ctaaaaaata 3900 aacaaagtga cactgtagca aaagaattaa taaaaatctt tggtgactat ggcataccta 3960 aaatcattca aagtgataac ggcagagagt ttaagaatag cctcatgcac tcattatcca 4020 agcacctagg aatagatcgc agatacagta cagcctttca cagtcgaggt aatggcgtga 4080 gcgagagcgc tgtaaagacg acattaaaca ctataagaaa aatggttaat tctaacagca 4140 atgattggga tgatgtatta ccaattgtcc aactagcgtc gaattataaa ataagacacc 4200 gttcaaagag tgctccattt tccttgatgt ttgctcgaag acttaatagc tttgaggatt 4260 acggtgatca gagtatagct gataagattc ctaatagacc tgttacagaa caagagctaa 4320 ttgagagaac tgatcgcatg taccagattg tgtttcctgc cattcgtgaa agaacacaga 4380 agatcataga ggaagcaagc aagaggttca atgacaaaaa tatgatcata gatcttccga 4440 aggatactgc agttatggtt agattaccag gacggactag taagttatca cctctttatg 4500 aagggcctta cattgtcgtt cgaaaaactc aaggtggttc ctatgtctta aaagatgagc 4560 aaaatgaact gttacatcgt gaatatgtcc catcagaact caagatagtc tctattgacg 4620 aaacagctat agaagataca tactatgaag ttgaggatat tagagatcac agaggcggtc 4680 caggacaaag agaatacctt gtaaaatggg ctggttatgg tgaaagagag aatacgtggg 4740 aaaaagcatc atccttctcc agcacagtac ccattgataa atattggctt aagagaaaag 4800 agttgaagga attagagcaa gcacgaaagc aaaagatgat agctaatgaa aaagactaca 4860 ctaaaaagac ttctgctaga cctaatgtga ctgaagaaaa caagaaccat aaaagaaagg 4920 acatggtaca agatagcaca gtccttagaa aaagtaaaag aataagaaac aatagaaact 4980 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aagttcatgt agcagagcgg ccgctgactt 5040 cataaacttg ctgagcatca agatacataa attaaccaat aagaaaataa catttagtat 5100 ctgcatgaat gccgacttga ctattgtcaa gtctggggga ggataa 5146 // ID Copia-5_MLP-I repbase; DNA; FNG; 4187 BP. XX AC AECX01002168; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-5_MLP_; KW Copia-5_MLP-LTR; Copia-5_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4187 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002168; Positions 6209 2023. XX CC Positions [1567-2067] - Integrase core CC 'AGGCA' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 13..4125 FT /product="Copia-5_MLP-I_1p" FT /translation="MSTDPDTTSSAPPKTKRSYEAPATMNPALAAATTALA FT AAPAAIRMQLTDKNYNVWTTWMESEMDNLNIFSVVDGTYVLQENAPADELV FT TYSKLERKAYNFIVRHLDKNNLALATQFSSKAGTAKTGAGLWKHLKEKYMP FT NNVFHQTHVLHTFLHIKYQTSTQFCSEVRSALATLSRVGLDIPERVLVVLI FT LARLPRSMESFIRVVSHSMNAKTEVNFILNRLEQDAHQQATMDEVLPSTAL FT ISTGNKNPSHMCTHCKRPGHLEPRCWIAHPELKPSYARGRKVNANIASYEA FT PVATSSANVYSTPTAVYAGVDLEDPLSAPSFLTSNFALNASLTPVSTLLDS FT GCSDVMYANRDAFSSYSDFLCNINIGEVGRTVQAIGRGVVTVQGAGGIVTF FT ENALHVPVLPYNLIPLSLLWQKGSQILYHGDDTFSVVRGDRLVFNGVVKNR FT LLFPSISTIISTLDNTNIVSFNLWHDRLGHINHDYLKSTAAQTEGVDDLPG FT KDFCESCAISRSTRLPFSGKIPRPSHPLDVVHTDLSGRINPPTPSGYEYFM FT KITDGSTSYRWLFLLRFKNEALDKFKEWKAQAERSSGRVVKKIVSDGGGEY FT VNKLFYEYLKNEGISHDVTASHTPQQNPFSERGNRTTNEKARTMLHRASMP FT AFMWGAAVLTAVYLENRTVSKSCDGMTPFEALYGSVPSIGHIRIFGCAAYR FT HLTHGRDGKFGPRAEKLILIGFVEGMKNYRLYDPSTGTISTSHDVRFNEAE FT FPYLQLSNDPNAEIVPFTEEEILGREPLRIRLHQPVIAEEIRQERHILTAT FT ALISIIGNSIDESANLSTLDLEPNPPTYARALLTRNSSKWQRATFEEFDSF FT AANHVFEIVDSLPPGHKPISTRFVYTRKFDDSGFLNKFKARCVARGFLQEE FT GKDFEETFSPTGRLATLRAVFGHAATEDLEIVQADFVTAFLNSTLGEGEVV FT YILIPDGFVDWIKQLPLESPFRAWLPALIERCGSLWLKLLRSLYGLKQASR FT SWYLTIKAWYYKHGFIVSDADACLFVWVGKNGDRVLIYMWVDDIVLVGSNV FT RWVLDELKKDFRIKDLGPVKMMLGMEIKRNREAKILSICQSKYINELLKVY FT GMEDCKSLGTPLQSNLPILPGTEEEVLSFKSSGLNYRRAIGSLNYLSQCTR FT PDLSHPVSLLSQFLDKPTMDHWNHFKCVLRYLRGTSTLCLTYGVQPVNETL FT LKFQPDISGPPVAFSDSNWAGCTISRRSTSGYTFIFNGGAVSWRCKKQPTV FT ALSSTEAEYKGFLDAGQDGMWIRRLMADFGYDQLISTTLFGDNQGSIALSR FT NPVFHSRTKHIEIQFHWIREKVQDKSIDIKFCSTNDMIAIFSQSPSTGQNF FT NNSDVISA" XX SQ Sequence 4187 BP; 1086 A; 976 C; 848 G; 1277 T; 0 other; gtactttact ccatgtcaac cgatcctgat acaacctcat cggcaccacc aaaaaccaaa 60 cgatcttacg aagctcctgc tacgatgaat ccggctcttg ctgctgcgac gaccgcgcta 120 gccgcggctc ctgctgccat tcggatgcag ttaactgaca agaactacaa cgtttggacc 180 acttggatgg aaagtgaaat ggacaacttg aacattttta gtgtcgttga tggaacttac 240 gtgctccaag aaaacgcgcc agctgatgaa ctcgtcactt actcgaaact tgaacgcaag 300 gcctataact tcattgtccg acacttggac aagaataact tagctttggc cactcagttt 360 tctagcaagg cgggtacagc taaaaccggc gcgggactat ggaagcatct caaagaaaaa 420 tacatgccga ataatgtatt tcatcaaacg cacgttttac atacctttct ccacatcaag 480 tatcaaactt caacacaatt ttgcagcgag gttcgatctg cccttgctac tctttcacgt 540 gtcggtctcg atatcccgga acgcgtcctt gtcgtcctta ttcttgcgcg tctcccccgc 600 tcaatggagt cgttcattag ggtagtctca cattccatga acgcgaaaac cgaggttaac 660 ttcattttaa accgactcga gcaagatgcg catcaacaag ctactatgga tgaagtttta 720 ccttctactg ctcttatttc gactggcaac aaaaatccca gtcatatgtg cactcactgt 780 aaacgacctg gccatcttga gccacgatgc tggattgctc accctgagct taaaccttcc 840 tacgctcgtg gacgaaaagt gaacgccaat atcgcttctt acgaggcgcc tgttgccact 900 tcttcggcca acgtttactc tactcctact gctgtctatg ctggagtaga tcttgaggac 960 cctttgtcag caccatcttt tctaacttcc aacttcgctc tcaatgcatc acttacgcct 1020 gtatctacac ttctcgactc aggttgttct gatgttatgt acgccaaccg tgacgctttc 1080 tcaagctact ccgattttct atgcaacatc aatattggtg aagttgggag aactgttcaa 1140 gccattggca gaggagttgt tacggtacaa ggtgccggtg gaattgtcac ttttgaaaac 1200 gctctacatg tacctgtatt accctacaac ttaattcctc tttcactttt atggcagaaa 1260 ggctctcaaa tcttatacca tggtgatgat actttttcgg ttgtacgcgg tgataggctt 1320 gttttcaatg gagttgtcaa gaatcgtctt ttatttcctt ctatttccac tattatttca 1380 actttggaca atacaaacat tgtttctttc aacttatggc atgatcgcct tggacacatt 1440 aaccacgact atttaaaatc tacagctgcc caaactgaag gagtagatga cttgccaggt 1500 aaagattttt gtgaatcctg tgctatcagc cgatctaccc gacttccttt ttctggaaaa 1560 attccccgtc cttctcatcc tcttgacgtt gtccacactg atctttctgg tagaattaat 1620 ccacccactc catctggcta cgaatatttc atgaaaatta ccgatgggtc cacatcttat 1680 cgttggttgt tcttgctacg cttcaagaat gaagctctgg acaagttcaa ggaatggaaa 1740 gctcaagcag aaaggtcttc tggcagagtg gtgaagaaaa tcgtcagcga tggaggagga 1800 gagtatgtca acaaactctt ttatgaatat ctcaaaaacg aaggtatctc ccatgacgtt 1860 accgcttctc acacaccaca acaaaaccct ttttctgaac gtggcaaccg tacaactaac 1920 gagaaagccc gcacaatgct ccatcgtgca tctatgccag cattcatgtg gggtgctgcc 1980 gttttaactg ctgtttatct tgagaatcgg acggtttcaa aatcttgtga cggtatgact 2040 ccttttgagg ctctctatgg ttcggtgcct tccattggtc acattcgaat ttttgggtgt 2100 gcggcttatc gacatctcac tcatggacga gatggcaaat ttggtcctcg agctgaaaag 2160 ctgatcctca ttggtttcgt tgaaggcatg aagaactatc gactttatga tccttccact 2220 ggaaccatct ctacgtctca tgatgttcgg tttaatgaag ccgaatttcc atatttacaa 2280 ctttctaacg atcctaatgc tgagatcgtt ccttttactg aagaagaaat cttgggtcga 2340 gaacctttgc ggattcgact tcatcaacct gtcattgctg aagaaatacg acaagaacgt 2400 catattctta ctgcgacagc tttaatttca attattggca actcgatcga tgaatctgcc 2460 aacttatcta ctctggatct cgagcctaat cctcctacgt atgccagagc ccttctcacg 2520 cgaaactcct caaaatggca acgcgcaact ttcgaagaat tcgactcttt tgctgctaac 2580 catgtttttg agatagtgga tagtctccct cctggtcaca aacccatttc tacccgtttt 2640 gtatacacgc gcaagtttga tgactctgga ttccttaaca aatttaaagc cagatgtgtt 2700 gctcgtggat tcttgcaaga ggaaggcaag gacttcgagg aaaccttttc tcctacaggt 2760 cgacttgcta ctcttcgtgc tgtttttgga cacgctgcaa ctgaagattt ggaaattgtg 2820 caagctgatt ttgttactgc ttttcttaat tctacccttg gagaaggaga agttgtctat 2880 attctcattc ctgatggttt tgttgactgg atcaaacaac tgccattaga atcccctttt 2940 cgagcttggt tacctgcgtt aatagaacgc tgtggatctt tatggctcaa acttcttcgt 3000 tctctttacg gccttaaaca agcatctcgg agctggtatt taacaatcaa agcttggtat 3060 tacaagcatg gttttatcgt tagtgatgcg gatgcgtgtc tttttgtatg ggttgggaag 3120 aacggtgata gagtcttgat ttacatgtgg gtggatgata tcgtgcttgt tggtagcaat 3180 gttcgatggg tacttgatga attaaagaag gattttagga tcaaagattt aggaccggta 3240 aaaatgatgt taggaatgga aatcaagaga aacagggaag caaagattct ctctatttgt 3300 caatcaaaat acatcaatga actactcaag gtgtatggca tggaagactg caagtctttg 3360 ggtactcctc tccaatcaaa cttaccaatc ttacccggta ctgaagaaga agtgctatcc 3420 tttaaatcca gtggcctcaa ctatcgacgc gctattggtt ctctcaacta tctctctcaa 3480 tgtactcgcc cagacttatc ccatcccgtc agtcttttgt ctcaattttt ggacaaacca 3540 accatggatc actggaatca cttcaagtgt gtgctccgtt atttacgggg tactagtact 3600 ctctgtctca cttatggtgt tcaacctgtt aatgagactt tacttaaatt tcaacccgac 3660 atatctggac ctcctgttgc tttttcggac tccaactggg caggctgcac catatctcgt 3720 cgctctactt caggttatac ctttattttt aatggtggag ctgtcagctg gcgatgtaag 3780 aaacaaccaa ctgttgcctt atcctctact gaagcggagt ataagggatt tcttgatgct 3840 ggacaggacg gcatgtggat tagaaggctt atggctgatt ttggatacga tcaacttatc 3900 tctactactc tgtttggtga caatcaaggt tcaattgctc tttctcgcaa tcctgttttt 3960 catagtcgaa ccaaacatat cgaaatacaa tttcattgga ttagagagaa ggttcaagac 4020 aagagcattg acattaagtt ctgttctact aacgacatga tcgctatatt ctcacaaagt 4080 ccctcgacag gtcaaaactt caacaattca gacgtgatct cggcttgatt gagggtactg 4140 atttggctaa cgaggggggg gtttcggtaa ttaatcagtt tttgcct 4187 // ID Gypsy-20_LBS-LTR repbase; DNA; FNG; 562 BP. XX AC ABFE01001469; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-20_LBS_; KW Gypsy-20_LBS-I; Gypsy-20_LBS-LTR. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-562 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01001469; Positions 2998 2437. XX SQ Sequence 562 BP; 137 A; 148 C; 109 G; 168 T; 0 other; tgtaacggat gtgattacag aacacttctt catcccttcg tactattatt tccgattcta 60 cgctaactag cagagctctg ccctcgtggt cacttagtca cggtagtcac acttgcttac 120 aattcttcca gtcacgcatg tagtatctca tctttttatc catcattctt gctactgctt 180 gtagttctcg agtatcgtcc tcacgactcg atcacacgcg taacagggat attgtaagaa 240 ggtgagtagc atagtgacat tcagaggaga gctcgtgact ctccagtagc tgattctttt 300 atttcattcc gtcttccgca gtaagatctt actacatctt cttgcggcat tccgaggaag 360 aagagtacgt ctacgctcgt ggcttaccag agcatcagta gctactcgcg tagtaagact 420 gactcataag actcagatta gatctttgac agaggtcttc acaccatagt ggtaaccttc 480 accacagaag ctcaccagtt ctaccttgtc gttgtcgctc gaaccccata aatctacgtg 540 gtcctccgaa caggttccca ca 562 // ID Copia-8_LBS-LTR repbase; DNA; FNG; 298 BP. XX AC ABFE01002315; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-8_LBS_; KW Copia-8_LBS-I; Copia-8_LBS-LTR. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-298 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01002315; Positions 15370 15073. XX SQ Sequence 298 BP; 71 A; 83 C; 40 G; 104 T; 0 other; tgttgaggat tagaacccta cgctacgtac gctctttatc tctatttacc aactccttca 60 tttacttccg tacttacaat tttctcgcta cttccgtacc tcgcacctca gtcgtttacc 120 actacgtgta accgtctgag gtgtgtattc ttttacttat atatcttccg atactttact 180 tactatctct accttagcta ccactaacgg tcttatccta acaggtagag attgacaata 240 cagtcgttta ccactacgtg taaccgtctg agctaccact aacggtctta tcctaaca 298 // ID Gypsy-8_MLP-LTR repbase; DNA; FNG; 189 BP. XX AC AECX01002120; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-8_MLP_; KW Gypsy-8_MLP-I; Gypsy-8_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-189 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002120; Positions 167573 167385. XX SQ Sequence 189 BP; 55 A; 50 C; 33 G; 51 T; 0 other; tgttacgatc ctcaagagac atgtaactgt tagtgaaata ccgggtagta gatgtcacag 60 acacgagatt agactatgtt gtacaaacca cagttgtctt ttctctttct tcttcatacg 120 acaatctgca tagcttatta gacacccaac cttgatcccc gccgagaacc ccacaccgag 180 gccttaaca 189 // ID Copia-1_BFB-I repbase; DNA; FNG; 6059 BP. XX AC AAID01001576; XX DT 25-FEB-2011 (Rel. 16.02, Created) DT 25-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Botryotinia fuckeliana genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_BFB_; KW Copia-1_BFB-LTR; Copia-1_BFB-I. XX OS Botryotinia fuckeliana OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Leotiomycetes; Helotiales; Sclerotiniaceae; Botryotinia. XX RN [1] RP 1-6059 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Botryotinia fuckeliana genome."; RL Direct Submission to RU (25-FEB-2011). XX DR Genome; AAID01001576; Positions 1728 7786. XX CC Positions [3214-3747] - Integrase core CC 'CTACT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 433..5949 FT /product="Copia-1_BFB-I_1p" FT /translation="MAGVFPESPQTERYVPENPASSAGSKMASVIATEEYE FT ELSRRWTDHPIDIQSANAKEIRDFVKYKAYEYQRQKYYDQELWADYNLAFD FT GFTPAHFSQVSMRILYKLRHELRSCGVYVDKPEECTTRPVYEALVDTHRSP FT YRAWTPKEIIEAISEKEFVFRSRSIQQRIVIEKLRAPLGDPEMSRVKDLHA FT GPSGFNTYVLPQLPSISQRLERADKGKGVDIEHKVSIEPQPKTPPQPKTPP FT QLPSFAPGSGSIARKYYSPRKPPPLPTFQGFTGPPPGPPRINLEQPSGASP FT FYHNNQPSQQSPQQWEQHHDQHNPNQQYMHQRMHQSGHPGGEPSGSNGGSS FT DGGRGNGGGGNGGGYDGDNPEYSDGIQNDLPRIPQRAINVGQLLGNLTKLY FT EKADKYSGQDDNFTHKYSIFINNCQRIDLPEYLLPKAFPTMLVEVALDFYH FT TNLGNIPDTIEEICRMMETTFEGPEHRRNLMRQWEGITLKSVMAEPDNAGK FT SMSHCLQSMVMKLRHLHKGLSPTMRTDELLHQKIVNACAPVSACSIACFRP FT AKDIPSLLEELRSAIMTWEQSHANETTTFYTDRKYQRGRGGYSQRSFGHND FT RRPFKPRTKKKCFVCRKEDCWSTRHSKEERDESRSKYKEQFKDRNNFDKSF FT EQFLVEYEGPNQDDDEDTDPDTDSEVAAYFAQFELQESPAQEPHSSFITHT FT GPINGRETMMNLNNKIFEHALTRRVRKERTTYISRYDAREFQGIIVDTAAA FT EWSTVGYEQYMAYKTTVENVSLDESRAGVANVVFGIGRAKSIGSFHLDLPP FT GTVEFHVMNAETPFLLSLKDMDKLGIYYNNIIDAVVVNKTGRSFPCVRRNK FT HAVLLWRINVQQFVQQTIDDNECYLTDIELRRLHRRFGHPSAHRLLRVLQR FT AGHSDVSQQEIDKLTKFCQYCQKHGKSPGRFKFSLKDDVDFNHSIIVDVMY FT IDTSPILHVIDEGTRFQAAQWLTNISAKHTFDTLRKCWMDTYTGPPDYIIH FT DAGKNFVSKEFIQYAATMSTKTKSVPIEAHWSIGAVERYHAVIKRAYAIIA FT SELESSTTKEMMLQMAVKAVNDTAGPDGLVPTLLVFGAYPRMADLDPPAPT FT VLQRQSAIRKAMNEINKLRATRQVSDALSQRNGPSTMALYDLPINSKVWVW FT REHKGWTGPHHLLGIRDQICTVKLSSGPTEFRTTQVKPHVQEEQDRPQGEP FT EHIEPVQQEDGGTILPDNPDDVTSQEGREGMQPPPSTEEPPSNEPNGPQAR FT SLLQGRIPPIRLLRAHANAQHFFSSESEEHETPPSLALYLPDEDEIDDIEY FT TECYAASAVPPTSTIFVESRRKEVNGLMENGVFRLVKLVDVPENTRIFNSR FT FVDEIKNEGTDKAFPKSRLVIQAWKDQGKSAVLTQSPTIQRVSQRIMLSVM FT AMLPKNEVTMSIRDITQAYTQSASNLVRKFYARPPAEMDVPEGFILQVVRP FT LYGVPEAGNHWFSTYHKHHTDALKMVQSTYDPCLLHTTKDDTAFGIVGLQT FT DDTLIVANKEFADREQTELKNAKFMAKERETLTSSTPLKFNGGLITQHENG FT SIYLNQQKQCENLRLVKTEPSDMHGTRGKLRKSATAKDQYVAQRARGAYVA FT TMCQPEAAFDLSFAAQTVNPKEEDANTLNKRIQWQIDNSERGLQFVPLHMD FT SLKLVAFTDASFANNKDLSSQIGYIIVLMDKYDKANIIHWSSIKCKRVTRS FT VLASELYGMVLGFDISAAIKGTVDKIFPARKIPLVICIDSKSLYDCLTKLG FT TTNEKRLMIDVMAIREAYEKREIVEIKWIDGESNPADAMTKSKPCQALKDL FT VDNNTITIKVTEWVDRD" XX SQ Sequence 6059 BP; 1927 A; 1461 C; 1365 G; 1306 T; 0 other; ctgatcagtg atatcacaga tcaggagcct ggcaccttaa tttcaagcgc aacgtgtgtt 60 tgtgactaag gttggtattt ctttgaagag acacaagctc ttgtatcaat ccatcgtggt 120 ttgatacctg cccttgtgtg aataagaacc gacggcaacg tggcggcagt tcttacgcaa 180 gtattaatcc aacgtggttt gatacctgca ccatcgagag aagtcggcaa cgtggcgacg 240 actctcacac taactttagt ctacattcga ctcttctttg caacgtgtat tgaagaagta 300 gaaaccagta gcaccagctc gagctcagca acatcataca caatataact tcgcaaactt 360 ttcgcgaagt tgtatcccaa ctacactcat accatcgacc tacctcatca tcacccgcct 420 gctatccaca aaatggcagg agttttccca gaatcccccc agacggaaag atatgtccct 480 gaaaatcctg cttcaagtgc tggatcgaaa atggcatcag tcattgcaac tgaagaatat 540 gaggagttat cacggagatg gacagaccac cccattgaca tacaaagtgc aaatgcgaaa 600 gagatcagag attttgtcaa atacaaggct tatgaatacc agcgtcagaa atactacgat 660 caagagctgt gggcggacta caatcttgca ttcgatggat ttacaccagc gcatttttca 720 caagttagca tgcgcatcct gtataagctc aggcacgaat tacgctcgtg cggagtgtac 780 gtcgataaac cagaagaatg tacaacccga ccagtttatg aagctttggt tgatacacat 840 cgatcacctt atagagcatg gacacctaag gagattattg aagcgatttc cgaaaaggaa 900 tttgtgtttc ggtcgagaag catacaacag agaatcgtga ttgaaaaact acgtgcacca 960 ctcggagacc ccgagatgtc tcgagtgaag gacctgcacg ccggtccttc agggttcaat 1020 acatacgttc taccacaatt accttccata tcacagcgat tggaaagagc tgataaaggt 1080 aaaggtgttg atattgaaca caaggtatcg atagaaccac agccaaagac acctccacaa 1140 ccaaagacac ccccgcaact cccaagcttt gcaccaggaa gcggatcaat tgcacgtaaa 1200 tactacagtc ctaggaaacc accaccactt ccgacatttc aaggattcac cggcccacca 1260 ccaggaccac cccgaatcaa ccttgaacaa ccctcgggag catcaccttt ttaccacaat 1320 aatcaaccat cacaacagtc acctcaacaa tgggaacagc accatgatca acataacccg 1380 aatcaacagt atatgcatca gcgtatgcat caaagcggac atccgggcgg agaaccgagc 1440 ggcagcaatg gcggttccag tgatggcggc agaggtaacg gtggcggagg gaacggagga 1500 ggatacgacg gagataaccc cgaatatagt gatggcattc agaacgattt acctagaata 1560 ccacaaagag ccatcaacgt tggacaatta ttgggaaatc tcacgaaact ctacgagaaa 1620 gccgacaagt acagtgggca ggacgacaat ttcactcaca agtactcgat attcatcaat 1680 aactgccaac gaatcgactt acctgaatac ctactgccga aagcattccc tacgatgctc 1740 gtcgaagttg cactcgactt ctaccataca aatttaggta atatacctga cacgatagaa 1800 gagatctgtc gaatgatgga gactacattc gaaggcccag agcacagacg caatcttatg 1860 cggcaatggg agggcattac actcaagagc gttatggcag aaccagataa tgctggcaag 1920 tcgatgagtc actgtctgca atcgatggtg atgaagcttc gtcatctgca caagggactc 1980 agcccgacga tgagaactga tgagctattg caccagaaga tcgtcaatgc atgtgcacca 2040 gtatcagcat gttccattgc ttgctttcgc cctgcgaagg acataccaag tcttcttgaa 2100 gagctacgat cagctataat gacatgggag caatctcatg caaacgaaac gacgacattc 2160 tacaccgatc gtaaatatca acgcggtcga ggtgggtaca gtcagagaag tttcgggcac 2220 aacgatcgtc gcccattcaa accacgaacg aagaagaagt gtttcgtctg cagaaaggag 2280 gactgttggt cgactcggca cagcaaggag gagcgcgacg aatcaagaag caagtacaag 2340 gagcagttca aggatagaaa caactttgac aagagctttg agcaattcct tgtcgaatac 2400 gaaggaccta accaagacga cgacgaagac accgatccgg ataccgacag cgaagtagca 2460 gcctactttg cgcaattcga actacaagaa tcacctgcac aagaaccaca ctcaagtttc 2520 atcacccata ctggacctat aaatggcaga gagactatga tgaacctaaa caacaagatc 2580 tttgagcatg ctttgacacg tcgagttcgt aaagagagaa cgacctacat cagcaggtac 2640 gacgcaaggg aattccaagg catcattgtc gacacagccg ctgcagaatg gtctacggtt 2700 ggctatgagc aatacatggc atataagacg accgttgaaa atgtatcact ggacgaaagc 2760 agagcgggag tcgccaatgt ggtgttcgga atcggcagag ctaaatcaat aggctcattc 2820 catctcgatt taccaccggg aactgtcgaa ttccatgtaa tgaatgcgga gacaccattc 2880 ttattatcac tcaaagatat ggacaaactc ggcatatact acaacaatat catcgatgca 2940 gtagtggtga acaagacagg gaggagcttc ccatgcgttc gccgaaacaa gcatgcagtg 3000 cttctgtggc gcataaatgt gcaacaattc gtgcaacaaa ccattgatga taatgagtgc 3060 tatcttactg atatagaatt acgtcgtctg caccgtcgct ttggtcatcc atcagcccac 3120 agactcttac gagtgctgca acgagccggc catagcgatg tctcgcaaca agagatcgac 3180 aaacttacca agttctgcca atactgccag aaacacggta aatcacccgg tcgattcaag 3240 ttcagtctaa aggatgatgt cgatttcaac cattccatca tagtggatgt gatgtacatc 3300 gacactagtc cgattctaca cgttatcgac gaaggtacac gatttcaagc agcacagtgg 3360 ttaacgaaca tcagtgcgaa gcacacattt gatacactta ggaagtgctg gatggacacc 3420 tacaccggtc caccggatta cattatacat gatgctggaa agaatttcgt cagcaaggag 3480 tttatccagt atgcggcgac catgtcgacg aagacaaaaa gtgtcccaat cgaagcccat 3540 tggtcaatcg gtgcagtcga acgataccat gcagtcatca agcgcgcgta cgccatcatc 3600 gctagcgagc tggagagttc aacaacgaag gagatgatgc tacaaatggc tgtgaaggca 3660 gtaaacgaca ctgcaggacc agatggattg gttccaacac tcctagtttt tggagcatat 3720 cctcgaatgg ccgatcttga cccaccagca cctacagttc tgcaacgaca atctgcgatc 3780 agaaaggcga tgaacgagat caataaacta cgggctacac gacaagtttc agatgctcta 3840 agccaacgca atggaccatc cacaatggct ctgtacgacc tacctatcaa ctcgaaggtc 3900 tgggtatgga gagaacataa gggttggacg ggaccacacc atttactagg cattagggat 3960 cagatatgta ctgtgaagct gtcaagtgga ccaacagagt ttcgtaccac tcaggttaag 4020 cctcatgtcc aagaagagca ggatcgaccc cagggagaac ctgaacatat cgaacctgtg 4080 caacaagagg atggagggac tattttacca gacaaccctg atgatgtaac ttcgcaagaa 4140 ggtcgcgaag gtatgcagcc tcctcctagc actgaagaac ccccttccaa tgaacctaat 4200 gggccacagg ctcgcagtct tttacagggc agaattccac caatacgatt actacgagcc 4260 catgcgaatg cacaacattt cttctctagt gagagcgaag aacacgaaac accaccgagt 4320 ttagcactct accttccgga cgaagatgaa attgatgaca tcgaatacac tgaatgctat 4380 gcagcatctg ccgtaccccc aacatctaca attttcgttg aatcaagacg aaaggaggtc 4440 aatggactaa tggagaacgg cgtatttcga ctcgtcaaac tcgtggacgt acccgagaac 4500 acaagaatat tcaactcacg attcgttgat gagattaaga atgaaggaac cgacaaggca 4560 ttcccaaagt ctcgactagt catacaagca tggaaggatc aagggaagag cgctgttttg 4620 acgcaatcac ctaccataca acgcgttagc caacgtatca tgctatccgt gatggcaatg 4680 ctacccaaga acgaggtaac catgagcatt cgcgacatca cccaagcata tacacaatcg 4740 gcatctaatt tagttcgaaa attttatgca cgaccaccag cagagatgga cgtccctgag 4800 ggctttatcc tgcaagttgt acgaccatta tatggcgtcc cggaagcggg caatcattgg 4860 ttcagcacct accacaagca ccatactgac gctctcaaaa tggtacaatc gacatatgac 4920 ccatgccttt tgcacacaac caaggacgat acagcatttg gcattgtagg actccaaacg 4980 gacgatactc tcattgttgc caataaggag ttcgcagatc gcgagcaaac tgagcttaag 5040 aatgcgaagt ttatggcaaa agaacgtgaa acactgacat caagtacacc actcaagttc 5100 aacggaggtc taatcactca gcacgagaat ggatcaatct accttaatca acagaagcaa 5160 tgcgagaact tacgacttgt caaaacagaa ccttctgata tgcatggaac ccgtggaaaa 5220 ctgagaaagt cagcaacggc aaaggatcag tatgtcgcac aacgtgcccg tggagcatat 5280 gtcgcaacca tgtgtcaacc agaagcagct ttcgatcttt cctttgctgc acaaacagtt 5340 aacccaaagg aagaggatgc taacaccctt aataagcgca ttcaatggca gatcgacaat 5400 tcagaaagag gattgcaatt cgtaccactc catatggatt ctctcaaact ggttgcattc 5460 actgatgctt cttttgctaa caacaaagat ctgtcctccc agataggata catcattgta 5520 ttaatggaca agtacgacaa agctaacatc atccattggt catcaatcaa gtgcaagaga 5580 gttaccagaa gcgtattggc ttcagaactc tacggaatgg tcttaggatt tgatatcagt 5640 gcagcaatca aaggcacagt cgacaagatc ttcccagcac gcaagatacc attagtcata 5700 tgcatcgatt ctaagtcact ctatgactgt ctcacgaagc ttggtaccac taatgaaaag 5760 agactaatga ttgacgttat ggcgattcga gaagcatatg agaaacgaga gattgtggag 5820 atcaagtgga ttgatggaga atctaaccct gcagacgcga tgactaagtc aaagccgtgt 5880 caagcactca aggacctggt cgacaacaac accatcacga tcaaggtcac agagtgggtt 5940 gacagagact agaaggcgtg gcagggggag tgtaagacta ggaaggaagg tggttacaca 6000 aaggaattta aggaattagc tagaaggaca cgttaccagc gactttagag agttgccag 6059 // ID Gypsy-1_LBS-I repbase; DNA; FNG; 6363 BP. XX AC ABFE01000018; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_LBS_; KW Gypsy-1_LBS-LTR; Gypsy-1_LBS-I. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-6363 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01000018; Positions 58657 52295. XX CC Positions [5136-5654] - Integrase core CC 'GCGG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 487..1740 FT /product="Gypsy-1_LBS-I_1p" FT /translation="MSGPLIAVPEFGGEPEGSIQSTDFLKKFRRFMMYLHI FT TEDERMVESFGDHLKSSSPAEEWFKEQDASKKAFKVVEAAFLERFPPVAKA FT KKTETELERDLCELRLKVEELGKMEKYQGEDVWSHVVFAEKALSLAKQAKI FT STGTNSIWKVRDELPEIIRQKVKETYANWTEFCSAIKEVEMSHIRDGVKKY FT QKEKEEKEKVDAAIAGLHRAQQQQRRPPQPTVPLSPMSNASNALQSMTIRN FT TRSTPAATTNTTQPAAATNANPFTSSTGGRGNLFPPVTNTDRDTLRNSLTF FT YPLQPNTQEGNRVWIEQAREWMAKHGNGPVTIATGFPLRPGGAPPGSGECY FT GCGYTGHHRNSGQCTTELINHRERTFRTICGRILRSASAAQVNLVDDAVDN FT FQSWLGDSALTSTMNQGNGEGPPA" FT CDS 1554..6362 FT /product="Gypsy-1_LBS-I_2p" FT /translation="MHHRAYQPSRTHFPNHLWANPALCISGPGQSRGRRSR FT QFSKLARRQCANIYNESGKWGRAACINGERAAQISVEPTEIDRPNSSDNYT FT VVHSHANSSVIDLYSVGDDDTTWGSLKKKEVPFRHWLCLHGPRGEVVRVNA FT LFDGGAMVGAMCSLFFEKIKHRLDGQAKPSNRLLRVANGTIVQSQAVWKGM FT LELNGIRAEGEFEVFNSGGGWTFLFGKPLLRRFQAVHDYHTDTVSIRSGDK FT STVLHSNTIVAPAGVSLKSDVEGRRNLVGGSSSANPPSRQVLHLDTCDPLV FT RNDKSVFAADVMEFTMEEVTEENMEGDDVAWIEECVVETGEQSEGKQEEEH FT SMDQGGGDIPPSREVQDYIPASEEAKMADDPHTIVSEYAVDVEEDQKSVEV FT PVVCQIELPELEHEDLSGGSAEPPSRGVPTHPADHQKATLADAPCLVLPVT FT NPTEASPDEAIFTRHTDPFLQTRVSKILDLVQIGEDITETQCNEVKALISE FT FADCFALSLSEVNLIPNAVHKLNIPEDATFRTKIPQRSFNPDQKVFMAAKV FT QEMLKGGIIRPMHPGEVRCVAPSVLAQKAHDNTGLSQDELKHKVNDECVKH FT GLPMAFDLPPRPPPADNTPTPTSPKKWRLCQDFGEINKVTTIAPVPQGDIR FT AKQLRLSGHRYVHIFDFAAGFYGIAIHPDSQPYITFYLEGHGYFAYERMPF FT GVTGGPSEFGHTVGQRMHDLITDGTCENFVDDGGSAADTFEEGISKLRRIL FT ERVRREKLSLSPGKLQVFMTEAVFAGARVGPDGVSPDSSKLTAVVNWKIPE FT DASHLEGFLGLTAYFRDLVKGYAALEKPLRDLLRAVDIPNGTKKAAYQRIM FT KAYKLQQHWKDEHTATFINLKARLVSEPVLTAPRFDGTHFILTTDACKDAF FT AGVLSQKIATTLPGGKVVTRLHPIGFASKRTSSSEEKYKPFLLEFAALKYS FT FDKFSDIVYGYPVEVETDCQALRDILLSDKLSATHARWRDGVLAHNIVDVR FT HIPGKVNIADGVSRQYEGTDKTPGDGSEWTVTPDWEETTGLVHDLYHVADL FT PDLTVLKERFKGEPLFLDVIDAIVGLSSNNATVRDKKRAQHRKTQYMLEDG FT KLWFVGGGSGTRARARRECVSRVEAVQLARLEHEQGGHWHRDAMKLALLDR FT YYSPKLDESIIKAIMDCARCKNFGGTHLHSLLQPIVRRHPFELLVGDYLSL FT PVGKGGYHTAGIYLDTCSQHVWGYKFKTHGSATTTNRSLDDIFHNFAPPDT FT FMADGGKHFKNREVEENCERWGTKLHTVAAYSPWVNGLVEGTNKLLLYVLA FT RLCAPEIGEDGWQAVTWDKLPATWPDHFDKAIRILNWRILPALKFCPKEIL FT LGLVVNTSKTPIEVSSSFLPPSDIDIHMTYTAQQRLDGYAEAVQHAVRRKA FT AFDRKVTKSRAGVIEFKKGELVQVYNNKLALTLSTERKITPMWSPPRRVVE FT RLLNSYKLETLEGTPLEGLFNARRLRGFTPREGTELAVEQKEFEEKLASEE FT EGSGELSGAQEEASAAVEGGGLADEELEDQDLELEDSELEVPDEEGVGDPG FT FFYNGEEQEEQEDEDSGIGARVAARRRGRLHNGGGQ" XX SQ Sequence 6363 BP; 1684 A; 1613 C; 1774 G; 1292 T; 0 other; cctggtgatg agagcgtgat tcgcacttgt acacgcgcgc ttctttcgtc tatatacgtc 60 catccgtcta tcaccgtctt aagtccaccg tcccacgtca acgtcatacc gtccaacgtc 120 accgcatctg acgcccgacg gcctacaatc tccgaaaagc agctcgtgtc tccagcgtgt 180 tctaaacgtg caccccacac cttgtctaca gcactcataa gcaacctggc gcccaacaac 240 tcttggaagg accagccacg tcagaaaggg aagttcgctc ctagagatag agcagctacc 300 atcacaccat cgtcaaccac accgtctaca ggcacatcca caccatcaac agaaagctcc 360 cttacactct cgaacccaga cagcctctct ccgtttgagg accttttccg gccgccatca 420 gcttctcagc aaccgtcgcc gcaacagcct cctcgtccca gctcttgtca tcccccgaag 480 cctttcatgt caggtccact cattgcagtg cctgaatttg gaggtgagcc ggaaggctct 540 atccaatcta ctgatttctt aaagaaattc cgtcgtttca tgatgtacct gcatatcacc 600 gaggacgaac gcatggtgga gagcttcggc gaccacctca agtccagctc cccagcagag 660 gagtggttca aggagcaaga tgcctcaaag aaggcgttca aagtcgtcga ggcggcgttt 720 ttagagcgct ttccgccagt ggcgaaggca aagaagacgg agacggaatt ggagagggac 780 ttatgtgagc tgcggcttaa ggtggaggag ttggggaaga tggagaagta ccagggggag 840 gatgtctggt cccacgtcgt cttcgccgaa aaggcgctca gtctcgcaaa gcaagctaaa 900 atcagcacgg gcaccaactc catctggaaa gtccgcgacg aattgccgga aatcatacgc 960 cagaaagtga aggagacgta cgcgaactgg accgagtttt gcagtgcgat aaaggaggtg 1020 gagatgagcc atattaggga cggggtgaag aagtatcaga aggagaagga ggagaaggag 1080 aaagttgatg ccgcgatagc aggcttacac cgcgcacagc agcagcaacg ccgtccacca 1140 cagcccaccg ttcccttatc gccaatgtcg aacgctagca atgcattgca atcaatgact 1200 attcgaaaca cacgaagcac cccagctgca accaccaata ccacccagcc tgcagccgcc 1260 accaatgcca accccttcac aagctcaaca gggggccgag gcaatctctt tccgccagta 1320 accaacactg accgagacac actccgaaac agcctcacgt tttaccctct gcagcccaac 1380 acgcaggagg gtaacagggt gtggatagag caggctcgtg agtggatggc gaagcatggg 1440 aacggtcctg tcaccattgc tacaggattt ccgctgcgtc caggaggagc gcccccaggc 1500 tcaggagaat gttacggctg tggttataca ggccaccacc gtaacagtgg gcaatgcacc 1560 acagagctta tcaaccatcg cgaacgcact ttccgaacca tctgtgggcg aatcctgcgc 1620 tctgcatcag cggcccaggt caatctcgtg gacgacgcag tcgacaattt tcaaagctgg 1680 ctcggagaca gtgcgctaac atctacaatg aatcagggaa atggggaagg gccgcctgca 1740 taaacggcga gcgggcggcc cagatttcgg tagagcccac agagatcgac cgccccaaca 1800 gctcggacaa ttacactgtt gtacatagtc atgcgaattc atctgtgatt gacttgtatt 1860 ctgttgggga tgatgatact acatggggat cactcaaaaa gaaggaggtg ccgttcaggc 1920 attggttgtg cctgcacggg cctcggggcg aggtagtaag ggtgaatgcg ttatttgatg 1980 ggggagcaat ggtcggcgcg atgtgttcgc tgttctttga gaagatcaag catcgccttg 2040 acggccaagc caaaccctcg aaccgactcc tgcgagtagc gaatgggacg attgtccagt 2100 cacaagcagt atggaaaggc atgttagagt tgaacggaat acgcgcagaa ggggaattcg 2160 aggtcttcaa cagtggaggc ggctggacgt tcctatttgg gaaaccactc ttgcggcgtt 2220 tccaggcggt gcatgactac cacaccgaca cggtctccat tcgatctggc gacaagtcaa 2280 cagttctcca cagcaacact atagtggcac cagcaggggt gagcctcaaa agtgacgtgg 2340 aggggcgaag aaacttggta gggggctctt cgagcgcgaa tcccccttcg aggcaagttt 2400 tgcatctgga tacatgtgat ccattagttc ggaatgacaa gtccgttttt gctgcagatg 2460 ttatggaatt taccatggaa gaggtgaccg aggaaaacat ggaaggtgat gacgttgcgt 2520 ggatcgagga atgtgtggta gagacaggag agcagagtga agggaaacag gaagaggagc 2580 atagtatgga ccaggggggt ggtgacatac ccccctcgag ggaagtacaa gattatatcc 2640 ccgcttcaga ggaagcaaag atggctgacg accctcacac tatagtgtct gagtatgcag 2700 ttgatgttga ggaagatcag aagtccgtgg aagtgcctgt tgtatgccaa atcgaactgc 2760 ctgagctgga gcatgaagat ctaagtgggg ggagtgcaga acccccctcg aggggagtac 2820 ccacacaccc cgctgatcat cagaaggcca cattagctga cgcgccctgc cttgttttgc 2880 cagtcacaaa ccctacggaa gcttcacctg atgaagcaat ctttacacgt cacacggacc 2940 cgtttctgca aacgcgtgtc tcaaagatat tagacctggt acaaataggc gaggacatca 3000 ctgaaaccca atgcaacgag gtcaaggcgc tcatttcgga attcgcggac tgctttgcgt 3060 tgtcgctcag tgaagtgaac ctcataccga atgcagtcca caagctcaat atacccgagg 3120 atgccacatt tcgtacaaaa attccccagc gatccttcaa cccagaccaa aaagtgttta 3180 tggcagcgaa agtccaggag atgttgaaag gaggtataat tcggccaatg caccctggag 3240 aagttcgttg cgtggcaccg tctgtcttag ctcagaaggc acacgataac acgggcctgt 3300 cacaagatga gttgaagcac aaggtcaatg atgagtgtgt gaagcatggt ctacccatgg 3360 cgttcgacct tccgcctcgt ccaccacctg ctgataacac gcctacaccc acgtcaccca 3420 aaaaatggcg cttatgccag gatttcggcg aaatcaataa agtcacgaca atcgctccag 3480 taccccaagg ggatatccgt gcaaagcagt tgcgcttatc tggccacaga tacgtccaca 3540 tcttcgactt tgcagctgga ttctacggca ttgcgattca cccggactct caaccatata 3600 ttacgttcta cttggaagga catgggtatt tcgcatatga aagaatgccc ttcggcgtta 3660 cagggggccc ctcagagttt gggcatacag tgggccaacg catgcatgac ctcatcaccg 3720 atggcacttg cgaaaacttt gtcgacgacg gcggatcggc agcggatacc tttgaggaag 3780 gaatttcgaa gttgcggcgc atactggagc gtgttcgtag ggagaagctg tccctatcac 3840 cagggaaact gcaggtcttc atgacggaag ccgttttcgc aggcgcaagg gtcggaccag 3900 acggtgttag tccagactcc agcaagctca cggcagtggt taactggaag attcctgaag 3960 acgcatcaca tttggaagga ttcttaggac tcacagctta tttcagggat ctggtgaaag 4020 gctatgcggc acttgagaag cctctccgtg atcttttgcg tgcggtagac atccccaacg 4080 gcaccaagaa agcggcatac cagcggatca tgaaggctta taagcttcag cagcactgga 4140 aggacgaaca cacggcgaca ttcataaacc tgaaggcacg gttagtctca gagccagtcc 4200 tcacggctcc tcgttttgat gggacacatt tcattctgac gacggatgcg tgcaaagatg 4260 cgtttgcagg ggttctttca cagaagattg caacaacatt acctggaggg aaggtggtca 4320 ctcgactgca ccctatcggc ttcgcatcta aacgaacatc gtcatcagaa gagaaataca 4380 aacccttctt gctcgagttc gcggcgctaa aatattcctt tgacaagttc tcagatattg 4440 tatatggcta ccccgtggaa gtggaaacgg actgccaggc tttgcgcgat atcctgctca 4500 gtgataagct aagcgcgaca catgcacgct ggagggatgg agtactagct cacaacatag 4560 ttgatgtcag gcatataccg gggaaagtga atattgcaga tggtgtcagc aggcaatacg 4620 agggtacgga caaaacccca ggcgatggca gtgaatggac ggtcacccct gattgggagg 4680 aaacgacagg actggtgcat gacctgtacc acgtggcaga cttgccggac ctaacagtcc 4740 tgaaggagcg gttcaagggc gaaccgctgt tcctcgatgt catcgacgcc atcgtgggct 4800 tatcgtctaa caatgcgaca gtcagggaca agaaaagagc acaacaccgg aaaacccagt 4860 atatgctcga agatgggaag ctttggtttg tgggcggcgg cagtggtaca cgggcaagag 4920 ctaggcgtga gtgcgtatcg cgagtggagg cagtccagct ggcgaggcta gagcatgagc 4980 aagggggaca ctggcatcgg gatgccatga aactcgcgct cctggatcga tactacagtc 5040 ccaagttaga tgagtcgatc ataaaggcga tcatggactg cgcacgatgc aagaattttg 5100 gaggtacgca cctccactcc cttttgcaac cgatcgtcag gcgtcaccca ttcgaattac 5160 ttgtgggtga ctatctctcc cttcccgtgg ggaaaggtgg ataccacaca gctggaatct 5220 acttggacac ctgctctcag cacgtgtggg ggtacaaatt caagactcac ggaagcgcaa 5280 caacaaccaa caggtctcta gacgatatct tccacaactt cgcaccaccg gacaccttca 5340 tggccgacgg gggcaaacat ttcaagaacc gagaagtgga ggagaattgt gagcgatggg 5400 gtactaagct gcacacagtt gcagcctact caccgtgggt caatgggctt gttgaaggta 5460 ctaacaaact cctcttgtac gttcttgcaa ggttgtgtgc tccagagatc ggcgaggatg 5520 gctggcaagc ggttacgtgg gacaaactgc ctgcaacgtg gcctgaccac ttcgacaaag 5580 ctattcgcat tctcaactgg cgtattttgc cagctctcaa gttctgcccc aaggaaatct 5640 tactggggct ggtggtgaac acatcaaaaa cgcctataga agtcagcagt tcatttctgc 5700 cgccatcgga cattgatata cacatgactt acacggcaca acaacgtctt gacggatatg 5760 cggaggcggt ccagcacgct gtacgacgga aggccgcatt cgatcgcaag gtcaccaagt 5820 caagggctgg ggtcattgaa ttcaagaaag gggagctggt acaggtatac aacaataagc 5880 tagcgctaac cctaagcacg gaacgcaaaa taacacctat gtggtcgccc ccgcgtcgcg 5940 ttgtggaacg cctcttgaac tcgtacaaac tggagacatt ggagggcacg ccattggaag 6000 ggctgttcaa cgcaaggcgt cttcgaggtt tcacgccgag ggaaggaacg gagctggccg 6060 tggagcaaaa ggagtttgag gaaaagctcg catcagagga ggaaggttca ggagagttga 6120 gtggggcaca agaggaggca tcagcagcag tagaaggagg aggtctagcc gatgaagagt 6180 tggaggatca ggacttggag cttgaggatt cggagttaga ggtcccagat gaagagggtg 6240 ttggtgatcc gggattcttc tacaatgggg aggagcagga agagcaggaa gatgaagaca 6300 gtggaatcgg tgctagagtg gcggcaagga ggcgaggacg cctccataat ggaggggggc 6360 aga 6363 // ID Copia-2_MVPL-I repbase; DNA; FNG; 4232 BP. XX AC AEIJ01000987; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Microbotryum violaceum genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_MVPL_; KW Copia-2_MVPL-LTR; Copia-2_MVPL-I. XX OS Microbotryum violaceum OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Microbotryomycetes; Microbotryales; Microbotryaceae; OC Microbotryum. XX RN [1] RP 1-4232 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Microbotryum violaceum genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AEIJ01000987; Positions 10945 6714. XX CC Positions [1657-2238] - Integrase core CC 'AACAA' target site duplication CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 475..3603 FT /product="Copia-2_MVPL-I_1p" FT /translation="MSAQISALTTELYARKFSSGDRGQLLHHLTTIQGILQ FT DLEACGSGMPPAQQLTLMLDSVFDPQHESFTSSLRRPLALPSEIVSEARSY FT QFKVSLESRINGLSAMTSGRAMAPSPVPALVAQGESQKGARHKRRLKKGNR FT DGQGNRRPTDSTCHACGAKGHWSCDAVCPEKRSKSSAAPALVAMNSYSVYL FT AGSSSRAPDADFVWIADTGAGHHFVGDCLLLSDFKESPIQVQMADNSLGQA FT TGYGRMSVKASQGHLLNFKEMYYLPGSKYGLISMSSLRAGGAKLVYGHGGD FT TQQVWLDGSLVAETVKTKAKQPSYVFDFEIVSPPSPVLVAATRASVGASLM FT EWHWNFGHMAPSSILQLVKSKAIVGIRLLDKKIEDCEPCIIAKARAGSHKH FT FSQKPGHVLQRVSIDLGFVNDDDTNGRSIYTVIVDQLSTYKWAFPLGSKSA FT SEVLKVWRGFQAQVERQSGQKIKFVQSDNGGEFEVFFGKQPDVSHLRAFGS FT TAYVLVPKAIRKKLDDHTVKGTFLGHSGEYNYKVRVEHRRSFKIVVSSDVT FT FSDKAPSIADSAVEPRIYEVEPIQEHELVHQLPFPGLEQISALRPALVPEL FT GPAFRPAPAPLLRFQDMDDDDVAPPPLPAVGPLAEDAEVEEDPIDHRREEA FT VEAIDSDDDNGPAEHDQPLPPEDGYEYRPYRVGRNPGALENVDAGNILPTR FT LRAQRRADALRVVSTRSVYRHPPELIAMVSSCHTPLPKTFAEAIASSEAKH FT WMAASEKELGSFKLHKVFKLTRLPTGARALGYRWVFTRKEDAEGNIISYKA FT RLVIQGFAQRLGIDYNETFAPVSSITTILFLIAMSAAVGLVLEQFDYDSAF FT LNGIMEEDICMKVPEGWTGGSRPGHALKLLKSMYGTKQAPRQWNAALHKLM FT TDRGYTRSNVDACLYFKYHGKSFAIITLYVDDGLAASNDQAFLDSEISAFD FT AVYKLKRLGPVKTFRLFEPRISSSSTNRSTFEAYSNTILSKTSRRSRSRLP FT WRIASSRLLPHRSPTSLYINRRWELYNTLRNVFVPIS" XX SQ Sequence 4232 BP; 915 A; 1163 C; 1084 G; 1070 T; 0 other; ttgtttctgc gataggttat gagctcaacg ggtagcgtag gaggcccggg tcagaaaaac 60 gaatcccatt gcaccaccat cggtatggac accgattcac cgtccaccga aacgagtgat 120 cgtggattgc atgtcccaat ccttcgcgca tccaattaca tggattggcg ccgacgtctg 180 attggtcgtc tgggctccaa ggacttgcat ctttgcgcat cgaggccgtt gagcgacgct 240 gcgaaggccg ttcttcaggt cgcaatcaat gatcaatcgc gcggcttcta ctcgcctctc 300 gagcggctcg acttcgcagc tatggacgcg aagatcgcca aggagctaat gaaggaggtt 360 tgaactcaag ctgaaatcat tcagtttctc gatgaggcca atcttcgctt ggtcaggggt 420 tgttacacgt cttacaacat actcgacttg ctgaacacgt tctacctcgg aacgatgagc 480 gctcagattt ccgccttgac gactgagctc tatgcgcgca agttctcgtc tggcgatcgt 540 gggcagttac tgcatcactt gaccaccatc caaggaattt tgcaagatct tgaggcgtgt 600 ggctctggga tgccaccagc tcaacagttg acgctgatgc tcgattccgt cttcgacccg 660 caacacgagt cgttcacaag ctcactgcgt cgacctctgg ctctaccttc ggagattgtt 720 tccgaggcac ggtcttacca gttcaaagtc tcgctggaaa gtcgaatcaa tggactatct 780 gccatgactt ccggacgcgc catggctccg tcgcctgttc ctgcgctcgt tgcgcaaggc 840 gaatcacaaa agggcgcgcg tcacaaaaga cgtctcaaga aaggcaatcg tgacgggcaa 900 ggaaatcgcc gtcctaccga ttccacctgt catgcgtgtg gagcgaaggg acactggtct 960 tgcgatgcag tgtgcccgga aaaacgctcg aagtcatctg cagcccccgc attggttgca 1020 atgaactcct attccgttta ccttgctggt tcttcctcga gagctcccga tgccgacttc 1080 gtttggatcg cggataccgg agctggtcac cacttcgttg gcgattgttt gctgctctcg 1140 gatttcaaag aatccccaat tcaagtccag atggccgaca actcgcttgg tcaagccact 1200 ggatacgggc gaatgtctgt taaggcttcg caaggtcacc ttctcaactt caaggagatg 1260 tactacctcc ctggatcgaa gtatggtcta atctcaatgt cgtcgctccg cgctggaggc 1320 gcgaaattgg tttatggcca tgggggagat actcaacagg tttggcttga cggctctctc 1380 gtcgccgaaa ctgtcaagac gaaggccaaa caaccaagtt acgtcttcga ttttgagatt 1440 gtctctccac cctcgcctgt gcttgtcgca gctacccgtg cttccgttgg agcttcgctc 1500 atggaatggc attggaactt cggacacatg gcaccctctt caattcttca actcgtcaaa 1560 tccaaggcta ttgttggaat tcgtttactc gacaagaaga ttgaagactg tgagccgtgc 1620 attatcgcga aggcgcgcgc gggctctcac aaacatttca gtcaaaaacc cggacatgtt 1680 cttcaacggg tttcgatcga tctgggattc gtcaatgatg acgataccaa tggacgatcc 1740 atatatacgg ttatcgtgga ccaactttca acctataagt gggcttttcc acttggttcc 1800 aagtcggcat cagaagtcct caaggtctgg cgtggctttc aagcgcaggt cgagaggcag 1860 tccggtcaga agatcaagtt tgttcaatct gacaatggag gcgagttcga ggtcttcttt 1920 ggcaagcaac ccgacgtttc gcatcttcgc gcgttcggat ctactgccta cgttctcgtt 1980 cccaaggcga ttcgtaagaa gctcgatgac catactgtca agggtacttt ccttggccat 2040 tcgggggagt acaactacaa ggttcgggtt gaacaccgtc ggtctttcaa gatcgtcgtc 2100 tcttccgatg tcaccttctc cgacaaggct ccttcgatcg cggactcggc tgtcgagcct 2160 cgcatctacg aagttgaacc tatacaggaa cacgagttag tacatcaact gccgtttcct 2220 ggcttggagc agatttcggc tttacggcct gcgctcgtac ccgagcttgg tccggctttt 2280 cggccagctc ccgctcctct tcttcgtttc caagatatgg acgatgacga cgttgctcct 2340 cctcctttac cggctgtcgg cccgttggcc gaagacgccg aagttgaaga ggacccgatt 2400 gatcatcgac gcgaggaagc agttgaggct atcgacagtg acgatgacaa cggccctgcg 2460 gaacacgatc aaccattacc tcctgaggat ggttacgagt atcgacctta ccgtgtcggc 2520 cgcaaccccg gtgcgctcga gaacgtcgat gccggcaata ttttaccgac tcgcctgcgt 2580 gctcagcgtc gtgcagacgc ccttcgtgtc gtgtctactc gctcggtcta ccgtcatcca 2640 ccagagctca ttgctatggt ttcgtcttgt cacaccccgc ttcccaagac gtttgcggaa 2700 gcgatagcct cttctgaggc caaacactgg atggccgctt ctgagaagga gttgggctca 2760 ttcaagttgc acaaggtctt caagcttaca aggctaccca ccggggctcg cgccctgggc 2820 tatcgttggg tgttcactcg gaaggaagac gctgaaggca acatcatcag ctacaaggct 2880 cgtctcgtca tccaaggctt tgctcaacga cttgggattg actacaacga gacgtttgct 2940 cctgtctcgt cgatcacgac aatcttgttt ctcatcgcaa tgtctgccgc tgtcggactc 3000 gtcctcgaac agtttgacta tgattctgcg tttctcaacg ggatcatgga ggaggatatc 3060 tgcatgaagg ttcctgaggg atggactggt ggttctcggc ccggtcacgc tctcaagctc 3120 ctgaaatcca tgtatggcac caagcaagct cctcggcaat ggaacgcggc tctccacaag 3180 ctgatgacgg atcgcggcta tacgcgctca aatgtcgatg cttgtctgta tttcaagtat 3240 catggcaagt cgtttgcgat catcacattg tatgttgatg atgggcttgc tgcatcgaac 3300 gaccaggctt ttctggactc cgagatttcg gctttcgatg cagtctacaa gctcaagcga 3360 cttggcccgg tcaagacgtt ccgtttgttc gaaccaagga tttcatcttc gtccaccaat 3420 cgaagtacat tcgaggctta ctcgaacact atactttcga aaacaagtcg aagaagccgg 3480 tcgcgactcc catggaggat cgcgtcatct cgtcttctac cgcaccgttc tccgacatcc 3540 ttgtatatca atcggcggtg ggagctttac aatacgctgc gcaacgtgtt cgtcccgata 3600 tcgtgacttc cgtttgtgcg gtggcaaagt gcgtggctgc tccaaccgaa ggagactgga 3660 tctgcgtcaa gcgaatcttt cgctatcttt ctgacaccgt cgactatggg ctactctacc 3720 gcgtaggcgg ttctaccaag ttcgaggtgt acagtgacgc gtcgtttggt tgcgatcaca 3780 acaacggcag atctgtggga gcgtatgtcg ttatcatggc tggcgctgct atttcctgga 3840 agtcgaaaca acagacgatg gttgcttctt ctacagcaga gtctgagact ctggctgctt 3900 caaccgcagc aaaggaggct atcggtcttc gcaacctggc gtccaaactt cgcatcaatc 3960 aaggtcaatc tactgttctg cacaaggaca accaaccttg catcgatttg gcgaagaatc 4020 ctggagctcg aggtcgtact tgccattgga atgtacatca tttctattta cgcgaacgaa 4080 tcgaggttgg cgatatcgat ctccgctact gtccgaccaa tctgaacacg gccgacatct 4140 tgacaaaacc cctatccaag ctcaagttct ctacgcatcg cgaaggactc gggatggtct 4200 cgctggctac cttggcatgt gggagtgtcg tt 4232 // ID NHT1 repbase; DNA; FNG; 2198 BP. XX AC U78574; XX DT 08-FEB-1999 (Rel. 4.01, Created) DT 03-JUL-2010 (Rel. 15.08, Last updated, Version 2) XX DE DNA transposon NHT1. XX KW Mariner/Tc1; DNA transposon; Transposable Element; transposase; KW TIR; Fot1 transposon; NHT1. XX NM NHT1. XX OS Nectria haematococca OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Hypocreomycetidae; Hypocreales; Nectriaceae; OC Nectria; Nectria haematococca complex. XX RN [1] RP 1-2198 RA Enkerli J., Bhatt G. and Covert F.S.; RT "Nht1, a transposable element cloned from a dispensable RT chromosome in Nectria haematococca."; RL Mol. Plant Microbe Interact 10(6), 742-749 (1997). XX RN [2] RP 1-2198 RA Enkerli J., Bhatt G. and Covert F.S.; RT "NHT1."; RL Direct Submission to Genbank (15-NOV-1996)Botany, University of RL Georgia, Plant Sciences Building, Athens, GA 30602, USA. XX DR GenBank; U78574; Positions 1 2198. XX CC DNA transposon has 100 bp-long TIR and it codes for transposase CC similar to the transposase encoded by Fot1 transposon. XX SQ Sequence 2198 BP; 598 A; 514 C; 566 G; 520 T; 0 other; ccatccacaa cccctctttc ggcatacccc ctctttcggc aagcaaaata aaaaaagctt 60 caccaaaact caccaacttc acttatacca gaattggcca aagtaaatag ctacttagag 120 gacaatgcct ctatcgatta tagaaatgat tggagttaac gactttagct ctcttaggct 180 gtgcggggta cgcgatctat gcatgctgtg ctatggaaat agtgttggtt taatcatcaa 240 actcctcacc gctctgttcc gtgttttttg acctttgtct gcgacaacga caagaaaatg 300 cctaggtcga gacctcgatg gaaatataca gaggaaaaca tggcagaggc tatcttggat 360 gttactgatg atggcgtttc acctcctcaa gccgctcaga gacggggagt gcctcgaagc 420 actctagtcg acagacttaa cggccagaaa gccgtgaaag agcagattca gcctcgccga 480 cgtttgtcca agaatcaaga ggacagattg gctttctgga tcctccgtca ggaatctctg 540 ggctatgctc cgtcccacaa tcagatccgc gcttgcgtca cgggcttgtt gagacagcag 600 ggcgaacatc ccgagttagg acgcaattgg gtgactagat ttatcaagcg ccgcacagac 660 ctgacgacca agatgggtag acgccaagaa gccaaaaggt ctgactcttt cacgcctaag 720 gcagttcatt ggtactttga tatcagggag ggccagtatg gctggatcaa gcctgaaaac 780 accgtcaacg ttgacgaagg gggtattatg actggtttcg gtaagcatct atctgtatac 840 cttgtcagtg cttgaaattt gaccgactaa tcctttttat ttaggcttag atagcctggt 900 tgttggaagc gcggacccaa ggcggaaggc atttctcaag ggaccacaaa ctcgaaactg 960 gacttcattt atcgaagctg tcactgctga cggccgcgcc ttaatccctg gcataatctt 1020 caaggggaaa gaactgcaga aacagtggtt tgttgaggaa ttcaaggaga tagcagactg 1080 gtattacata acttcgccta acgggtggac tgacgaccac attggcgttg aatggcttga 1140 aagagtctat ctgccccaga caatgccggc cgacgactct gatgcgcgtc tgatcatatt 1200 agatggccat ggaagtcatg caacggtatg ttcttccttt tcctcagctc aaggtcacta 1260 ctaagtaagg gaaggatgaa tggatggcca cgtgcttttt gaataacgtt tattgttgct 1320 atctgccagc acactgctct catgggctcc agccgttgga caatggagta ttcaatgcct 1380 ccaaagccgc atatcgacga gagttggaga atttcgcttc actgactgat tccactccaa 1440 tggacaaggt caatttcatc agggcctacg ctaaggcgcg tcgagttgga atgactgaga 1500 agaacatact ctccggctgg agggttactg ggaattggcc gatctcacgt gccaaagcgc 1560 tgccacaccc agaaatccaa caagataggc caaatggcag cccaagagcg actcccgaac 1620 ccaggccgta ttttgactcg gacgatacac caaagacgag ccgtcaaatt cgtgatcttg 1680 ggctgaacaa aacaccaaag acacgaagac ggtataacgt aatagcaaag ggctttgagg 1740 ctcaacagca gacggtagcg gcgcatactg cgaggattgc tagcctagag gaggaattgg 1800 ctcgcctgaa gagagggaag aagaggaagg cggtgccgaa tcctaacaaa cgttttatga 1860 ctcttggtga gactctagct gctggggaag ccatatccga agaggagact caaaatatgc 1920 ctgttgcggt ggaatgtggc cgttcagggg agccggggtc agaatcggag gctgcgtcgg 1980 tcattgaagt cagagaggaa accatacccc atcaactaac cacgcggtca gggcggctta 2040 tcaaaagacc caggattcaa tagagagatg caactattat tcacaagacg caaattcttg 2100 cacgcgccta tcaatttggc caagttctgt gtaagtaaac atgttgattt ttggtggtgc 2160 tttttttttt tgcctggcga aagggtggtt gtgcatgg 2198 // ID Copia-2_SPDB-LTR repbase; DNA; FNG; 252 BP. XX AC ACOE01000236; XX DT 12-FEB-2011 (Rel. 16.02, Created) DT 12-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Spizellomyces punctatus genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_SPDB_; KW Copia-2_SPDB-I; Copia-2_SPDB-LTR. XX OS Spizellomyces punctatus OC Eukaryota; Fungi; Chytridiomycota; Chytridiomycetes; OC Spizellomycetales; Spizellomycetaceae; Spizellomyces. XX RN [1] RP 1-252 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Spizellomyces punctatus genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; ACOE01000236; Positions 11413 11664. XX SQ Sequence 252 BP; 73 A; 66 C; 39 G; 74 T; 0 other; tgttagcaat cacaggtagc agacctctaa gccactgaac tgctatgcca tgcaggcatg 60 tcttcactgc gccatgcgcc acatgcgcca tgcatgtcgt tagatttctc ttgcatcgct 120 agtagtatat taaagaatga cttcttaaca tacaatagac ttcttaacat ctatacaaga 180 gtagagttca ctctgttcct actctctatc actaatacat tccctatagg aatataccct 240 cactagctaa ca 252 // ID Coprina_Cc1 repbase; DNA; FNG; 3033 BP. XX AC . XX DT 29-APR-2007 (Rel. 12.04, Created) DT 17-MAY-2007 (Rel. 12.04, Last updated, Version 1) XX DE Coprina_Cc1 is a Penelope-like retroelement. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Penelope-like element; reverse transcriptase; Coprina_Cc1. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-3033 RA Arkhipova I.R.; RT "Distribution and phylogeny of Penelope-like elements in RT eukaryotes."; RL Syst. Biol 55(6), 875-885 (2006). XX RN [2] RP 1-3033 RA Gladyshev E.A. and Arkhipova I.R.; RT "Telomere-associated endonuclease-deficient Penelope-like RT retroelements in diverse eukaryotes."; RL Proc Natl Acad Sci U S A 104(22), 9352-9357 (2007)in press. XX DR [2] (Consensus) XX CC Coprina_Cc1 is a Penelope-like retroelement from the inky cap CC mushroom, Coprinus cinereus (aka Coprinopsis cinerea). Its single CC ORF contains homology to reverse transcriptases. No associated CC endonuclease has been found. Most copies are associated with CC telomeres and are 5' truncated by addition of reverse-complement CC C. cinereus telomeric repeats, (TAACCC)n. XX FH Key Location/Qualifiers FT CDS 127..2549 FT /product="Coprina_Cc1_1p" FT /translation="TTPSLTAAPWQVEQTSEPPRKRRKTTEDVQRRPEQAE FT EGRQGKGKGQLAVGVLPSDFRYDNPSSYPDWLLTVPYPVAISTILLNTPAR FT ILEAAQFRNYVHLSPGVNVPLEYQYALSVGLRYMFPTPRNELLIRDAWKDF FT ERRIRWRLFFTFSNEDNSLFDPDYEVPKQKSSAPPRLPAYLEHGLQRGELF FT VNQTIAKIRRLPVEAPTYKSLKPNRDDLFEFLTSNNYVITGTDKNLGIAVS FT ERTWIDERCKEILDVRSDYREIHIIQLNQICNEQCRQMELIAQLATATHPN FT GKQLGEFFRSKITEKCSDTSGREYGDHTVPIFYGIPKIHKVPTKMRPIIPC FT HSAIQNPAAKFVSKNLKFLIKESPTILHGSKDLAQKLSNVKLKPGRRWFFI FT SGDVVAYYPNIPREDCLREVFKMWSDARFGRTLDDNDEGTAEHYEFSLMYN FT ALMTGNKKLVFRYGSKYYEQIRGLAMGVADSPDLANLWGVRSEISCGVLTN FT PLIEFYGRYIDDCLGVVYAHSEEEALSIAESIKIEGCTIEWTASEQYVHFL FT DMTVYRDSYSQLQWQPFRKAGNHQERIPWISAHPLDVKRGTFLGEMSRLAT FT LSSQYDTYREALHGLAALYTKRGYPSELVSKWLKDNASDKWEKRLSDRNHD FT STGTDGVLVLKSTFNTAWNYFSARELGDTILGYWKTYAEKAKKDQLGGIHW FT QQFSDNVGDFTDVPDELMSLFRTTKGLRYMPDVSKTNIWQRKVLVSRKRTR FT NLFDLTSLWKKQVLSKLEEDILMDVDSDSDQMSVDTPSDRSDGIDPNFFIN FT YTLGRT" XX SQ Sequence 3033 BP; 816 A; 812 C; 679 G; 726 T; 0 other; taaccctaac cctaacccta accctaaccc taaccctaac cctaacccta accctaaccc 60 taaccctaac cctaacccta accctaaccc taaccctaac cctaacccta accctaaccc 120 taacccacca ctccctcgct taccgccgcc ccgtggcaag tcgaacaaac ctcagaaccc 180 ccgaggaaaa ggagaaaaac gaccgaagac gtccaacgcc ggcccgagca agccgaagaa 240 gggcgccaag ggaaagggaa aggccaacta gccgtcggag tgcttccgtc ggactttcgg 300 tacgataatc cgtcgtctta cccagactgg ttgctgacag tcccctaccc agtagcgatt 360 tctaccattc ttctaaatac tcccgcgcgc attctcgagg ccgcccagtt tagaaattat 420 gtacatttat cgcctggtgt taacgtcccc cttgaatatc aatatgcttt atccgtcggt 480 ttacgttata tgttcccgac cccgcgtaat gagttgttaa ttcgtgacgc gtggaaagac 540 ttcgaacgtc gcattagatg gcgtctattt tttacgtttt ctaatgagga taactcgtta 600 ttcgatccag actacgaggt gccgaagcag aaatcctctg ccccacctcg tttacccgct 660 tatctcgaac atggactcca acgtggtgag ctcttcgtta accaaacgat agccaagatc 720 cgtcggttac cggttgaggc gccaacatac aaatctctca agcccaaccg agacgacctg 780 ttcgagttct taactagtaa caattacgtg ataactggta ccgataagaa ccttgggatt 840 gcggtgtccg agagaacctg gattgatgag cgttgcaagg agattctgga tgtgagatct 900 gattaccgtg agatccacat aattcaacta aaccagattt gtaacgagca atgtagacaa 960 atggagttaa ttgcgcagct cgcaactgca acccacccca acggcaaaca actcggtgag 1020 ttcttccgga gcaaaatcac cgaaaagtgt tccgacactt cgggtcgaga atatggagat 1080 cacactgtgc ccatcttcta cggtatccca aaaattcaca aggtgccaac gaagatgcgt 1140 ccgatcattc cctgtcacag tgccattcaa aaccctgccg ctaaatttgt ctcgaaaaat 1200 ctcaaattcc ttatcaagga atcgccgacc attctacatg gatccaaaga tctggcgcaa 1260 aaattgagta acgtcaaact taaacccggt cgccggtggt tctttatctc cggcgatgtc 1320 gttgcgtatt acccgaatat cccccgagag gactgtctac gagaagtctt caagatgtgg 1380 tctgatgctc gcttcgggcg cactctcgac gacaacgatg agggaacggc tgaacactac 1440 gaattcagtc tcatgtacaa tgccctcatg accgggaata agaaactcgt tttcaggtat 1500 ggtagtaagt attacgaaca aatccgcggc cttgcaatgg gcgtcgcgga ctcaccggac 1560 ctcgccaatc tctggggcgt ccgatccgag atctcgtgcg gagtacttac gaacccgttg 1620 atagaattct acgggagata catcgatgat tgcctaggcg ttgtttatgc gcattcggag 1680 gaggaagccc tatcaatcgc cgaatcaatc aaaatcgaag gctgtaccat cgaatggact 1740 gccagcgaac aatacgttca cttcttggat atgaccgtgt atagggactc gtattcccaa 1800 ctacaatggc aaccattccg gaaagctggc aatcatcagg agagaattcc atggatttcc 1860 gcccatcccc tcgacgtgaa aagagggacg ttcttgggcg agatgtcgag gttagccacg 1920 ttgagctcgc aatacgacac gtatcgcgag gcgttgcacg gcttggcagc cctgtacact 1980 aaacgcggtt acccttcgga acttgttagc aagtggctta aggataatgc ttccgacaag 2040 tgggaaaaac gattaagcga tcgtaaccac gattcgaccg gcacggatgg cgtgttggta 2100 cttaagtcca cgtttaacac cgcatggaac tacttctccg ctcgtgaatt aggagatacc 2160 atactcggat attggaaaac ttacgccgag aaggctaaga aagatcagct cggtggcata 2220 cactggcaac aattttccga taacgtcggg gactttactg acgtgccgga cgagcttatg 2280 tctctgttcc ggacgacgaa aggtcttcga tacatgccag acgttagtaa aacaaacatc 2340 tggcaacgca aagtattggt ttcacgaaaa cgtactcgca atctctttga cctgacaagt 2400 ctttggaaaa agcaggtact ctccaagttg gaagaagaca tcctcatgga cgtcgattcc 2460 gattctgatc agatgagcgt tgatacccca tcagatcgat cggacggaat cgatccaaac 2520 ttctttatca actatactct gggtagaacg taacgtatac acgactcagg gggctttgtt 2580 gacatcccag tgatcggctt gcactaccga ttcccaggat attcggagac gagtactagt 2640 taccgttagt ggttcggact tcattcggcg gaggcacttc ggtgcgtttg tcggggaacc 2700 cagctttgcg gcggtgttct aggaagagtc cgtctaaatc ttggtgtcgg cccgatcaca 2760 aggggctctg tccccctgtt tgggcagcct ccgcccccat ttaaaactga acacgaaatg 2820 cgttaacgat atccagatat taattcacga gttcaataag ctttaaccta tcccggtagg 2880 tactacgatt gtctgtcgct tacagacggt attaaccatg ggtgtttagg aagcctatct 2940 aaccctaaac ctaactctaa ccctaaccgt acgtaccttt tgaacttaat cgttcaagct 3000 gatttgtcta accgatcaat ctctagctaa ccc 3033 // ID Gypsy2-I_AO repbase; DNA; FNG; 6419 BP. XX AC . XX DT 25-JAN-2006 (Rel. 11.01, Created) DT 02-FEB-2006 (Rel. 11.01, Last updated, Version 1) XX DE An internal portion of the Gypsy2_AO LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy2_AO; KW Gypsy2-LTR_AO; Gypsy2-I_AO. XX OS Aspergillus oryzae OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC mitosporic Trichocomaceae; Aspergillus. XX RN [1] RP 1-6419 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-6419 RA Kapitonov V.V. and Jurka J.; RT "Gypsy2_AO, a family of Gypsy LTR retrotransposons in the RT Aspergillus oryzae genome."; RL Repbase Reports 6(1), 3-3 (2006). XX DR [2] (Consensus) XX CC This is an internal portion of the Gypsy2_AO LTR retrotransposon. CC Its long terminal repeat is Gypsy2-LTR_AO. Two ORFs encode a CC 378-aa Gypsy2_AO-1p (zinc knuckle) and 1999-aa Gypsy2_AO-2p (RT, CC integrase and chromo domains) proteins. XX FH Key Location/Qualifiers FT CDS 32..1165 FT /product="Gypsy2-I_AO-1p" FT /translation="MVSTRSSSALPPVDPQGADGTATQSSATQGVSQREFP FT AAQNARTSDVPMEPIVEEDESETPERGPTSERQSEEPMLSLSAIARVAEVL FT KTIMQTSSADNGFGGLKLAAPTPFEGKTIREVQNYIADMEDHFAASASAQR FT NDKDKVLYATSYLGEKAKQAWRLKRESLGEPSWEEMKDFLYTFVDPEADRH FT RNVAYRLINEKQNGRNMREWCARSLEVWQAFPNNSLTAQTWIQTLDQEVQR FT EITRAARQPQTLGEAHDMAIQYWSMLHRERSLRDSEGSGRKRADGGNQGPA FT GFNRGRKRPRENETPATRPRGSRSSPGQPPFKARRWNDGTQAAHERKRDWQ FT KKEGRCFRCNEMGHRIWECPKAADGAPGKDASREQ" FT CDS 1169..4906 FT /product="Gypsy2-I_AO-2p" FT /translation="ELRPLELLVLRHPKEPTAFPILGTVELQLQESPSISA FT DIKPIGGWDHTVQTVKALLDSGADDNFISYRYLLQHGITVSKDHLQPPVRV FT RYADGTPAPCYGKLIVKSQVTDWENQTRGYEMEYYITDLQDPDYQIVLGRK FT WMRHADPDVVFSAGTWRYRKEAPKISIENPKRFAKTMRHTLTMMVMYRPDT FT DDGPPELPPRYRKWAHVFSEKEAAELPDEKAAHPIPLVEGGVPPYGPLYNL FT SQHELQVLREYLDKMLERGWIRHSTSAAGAPVLFVRKPDGSLRLCVDYRGL FT NAVTVKNRYPLPRIDELMDRLVGAKYFTKLDLRDAYHRIRIQKGDEWKTAF FT RTRYGHFEYTVMPFGLCNAPATFQAYINEAMKGILDDYCVVYLDDILIYSQ FT TEEEHVRHVGEVLSRLERANLYAKLSKCNFHQEEVTFLGFIVGRDGIRVDP FT ERVRTVGEWPALESFHDIQVFLGFTGYFRRFVHQYARITKPLSDMLKGMQK FT GRKTGPFILTTEAREAFDELKAAFSGPPILRHFDPALRIKLITDASGFAIA FT GILLQPVDGVEEPRRRSDWQPVAFYSRKLTEIEGRYPVHDQELLAIVECFK FT VWRHYLEGAPHAIRVQSDHESLQYFYSKKTVKLDKRQARWAELLAAYDFQI FT EYRPGHLNPADAPSRRKDYEDVHVQRNVGLLPTLQRKLRAVPDDRNDVGAP FT SVAESEISIVSSRGPELLALRTMAREVAREEVVCEATSHPLRDAILQAQQG FT DAFVGQIRGKLLPGEGKKESGHKSDVAETSGDLTGRWRVDGGLLYRGETIY FT VPPCSALRQEILRVHHDDPFAGHFGREKTLELIRRKFYWDGLRTDIENYVR FT DCPVCQKMKVPRKLPQGELASLPVPEGPWQDLTMDFVVGLPPSGRKGRVYD FT AILVVVDRYTKAARYLPTTGTVTAEELANMFLDEIVCKTGSPRSLVTDRGS FT LFTSAYWMQFCQGLRIKGRLSTAFHPQTDGQTERQNQTLEAYLRMYTNYQQ FT SDWADWLPTAEFAYNNAKNASTGYSPFMAWQGMEPAVPGLEAIGPTTSNAS FT VELRLKGLADIRTKLEENLKKATERQAEGYNKRHKATQLRVGQEVLLSTKN FT IKSWRPNKKLDLKYDGPFMITEAVGKQAYRLRLPKAYGHIHPVFHVSLLKP FT YHRRAGVEAGEAQPIVIDDQEEWEVEEVLAHRVYYRKLQYLVKWKGWPSYE FT NSWEPEENLKNAAETVAAYRKASEVPEAPRRSRRRA" XX SQ Sequence 6419 BP; 1655 A; 1766 C; 1845 G; 1153 T; 0 other; gttcgaatcg cttcaaccca acgacttcaa gatggtcagt acacgaagtt cgagcgcctt 60 accgccggtg gacccccagg gggccgacgg cacggccact cagtcatccg caacccaagg 120 agtctcccag agggagttcc cggcagccca aaatgctcga acctccgacg tcccaatgga 180 accgatcgtc gaagaggacg aaagcgaaac tcccgaaagg ggaccgacgt cggagcgcca 240 gagcgaagaa ccaatgttgt cgctcagcgc gattgcacgg gtggctgagg tgttgaaaac 300 aatcatgcag accagctccg cggacaacgg gttcggcggt ctcaagctgg ctgcacctac 360 ccctttcgaa ggcaaaacta tacgtgaagt acagaattac atcgcggata tggaagacca 420 cttcgccgcg agtgcgtcag cccagcggaa cgataaggac aaagtcctgt acgctaccag 480 ctacctcggc gaaaaggcca aacaggcgtg gcgcctgaaa agagaaagcc tcggggaacc 540 aagctgggaa gagatgaaag acttccttta tacgttcgtc gacccggagg cggaccgaca 600 tcggaacgtc gcgtatcgcc tcataaatga gaaacagaac ggaagaaaca tgcgcgaatg 660 gtgtgcccga agcttagaag tatggcaagc cttcccaaat aactctctca cggctcaaac 720 gtggatccaa acgctcgatc aggaggtcca gagggagatc actcgcgcgg ctcgccagcc 780 acaaaccctg ggagaagcac acgatatggc aatccaatat tggtcaatgc ttcatcgtga 840 acgcagtcta cgcgacagtg aaggaagcgg ccgaaagcgc gccgacgggg gaaaccaggg 900 tccggctgga tttaacagag gcaggaaacg ccctcgtgaa aacgaaaccc ctgctacgcg 960 tccccgcggc agccgctctt cgcccgggca accgcctttc aaagcccgtc gatggaatga 1020 cggcacgcag gcggcccatg agcgcaaaag ggactggcag aagaaagagg gacgctgttt 1080 ccgatgcaac gagatgggcc acagaatctg ggagtgcccc aaggccgccg acggcgcgcc 1140 ggggaaagat gcctcccgcg aacaatagga actgcggcct ctggagctct tagtcctgcg 1200 tcacccgaaa gaacctacgg cgttccctat cctcggcacg gtggagctac aactgcaaga 1260 atcacccagt atcagcgccg atattaagcc gataggggga tgggatcaca ctgtacaaac 1320 ggtgaaggca ctactagact ccggggccga tgacaacttc atctcctatc gatacttact 1380 ccaacacggg atcaccgtgt cgaaagatca tctccaacca ccggtgcgcg tgcggtacgc 1440 ggacggtacc ccggcgccgt gctacggcaa gctgatcgta aaatcccaag taaccgattg 1500 ggagaaccag acaagggggt acgagatgga gtactacatt accgacctcc aagacccgga 1560 ctatcagatt gttttaggac gcaagtggat gcgccacgct gatcccgacg tcgttttctc 1620 ggcgggaaca tggcgctacc gaaaagaagc tccaaagatc tctatcgaga atcccaaacg 1680 gtttgcaaag acgatgcgtc acacccttac catgatggtt atgtaccggc cggataccga 1740 cgacgggccg ccggagctgc cacctcggta tagaaagtgg gcccacgtct tttccgaaaa 1800 agaagcagct gagcttccgg atgagaaggc cgcacatccg ataccattgg tagaaggggg 1860 agttccgcca tacggacccc tatacaacct ctcgcagcat gaattacagg tgttgcggga 1920 atatctcgac aagatgctcg aacgaggatg gatcaggcat tccaccagcg cagccggagc 1980 gccggtgctc ttcgttcgga agcccgatgg ctcgctaagg ctgtgtgttg actatcgggg 2040 actgaatgcc gtgaccgtga agaatagata cccattgcct cgtatcgacg agctgatgga 2100 tcggctcgtg ggagcaaagt atttcactaa gcttgattta agagacgcat atcatcgaat 2160 tcgcatccag aaaggcgacg agtggaagac ggcgttccga acgcgttacg ggcattttga 2220 atacacggtg atgccgtttg gcctatgtaa cgcgccagca accttccagg cgtacatcaa 2280 cgaagccatg aaaggaatcc tggatgacta ctgcgtcgtg tacttggacg acatcctcat 2340 ctactcgcag accgaggagg aacacgtgcg gcacgtcggc gaggtgctgt cgcgcctaga 2400 gcgggcaaat ctatacgcca agctgtccaa gtgcaacttc caccaagaag aagtaacgtt 2460 cttaggattc atcgtgggac gcgatggtat acgcgtcgat ccggaacgcg tgcgtaccgt 2520 cggcgaatgg ccggcgttag aatcctttca cgatattcag gtgttcttag gattcacggg 2580 gtactttagg cgcttcgtgc accagtacgc ccggatcact aagcccctgt ctgatatgtt 2640 gaaaggaatg cagaagggaa ggaagacagg gccgtttatc cttaccacgg aggcgcgaga 2700 agcttttgat gaactgaaag cggcgttcag cggccctccg atcctgcgac actttgaccc 2760 tgcattacga atcaaattga tcacagacgc ctcaggattc gctatcgcgg gcatcctcct 2820 acaacctgtt gacggggttg aggagccccg ccggcgttcc gactggcaac cagttgcttt 2880 ctactcaagg aagctgacgg agatagaagg tcgttatcct gttcacgatc aggagttgct 2940 tgccatcgtg gaatgtttca aagtgtggag acactacctc gaaggcgcac cgcatgcgat 3000 ccgcgtgcag agcgaccatg aaagcttgca atacttctat tcgaagaaga cagtgaaact 3060 ggataagcgt caggcccgat gggcagagct cctggcggct tacgatttcc agatagaata 3120 ccggcctggg cacctgaacc cggccgacgc gccttcgcgg cggaaagact atgaagatgt 3180 ccacgtacag agaaacgttg gactgttacc cacgcttcag aggaagttac gggcggtacc 3240 ggatgatcgg aacgacgtcg gagcgccttc ggtggccgag tcggagattt cgatcgtgag 3300 ctcaagaggt ccggaactct tggctctcag aacgatggca cgagaagttg ccagagagga 3360 agtcgtttgc gaagcaacgt cgcaccctct tcgagatgcg atcctgcagg cccaacaggg 3420 ggacgccttt gttggccaaa tccgcggcaa actgctgccc ggtgaaggaa agaaggagag 3480 cgggcacaaa agcgacgttg ctgaaacaag cggcgacctg acggggcgat ggcgcgttga 3540 cggcggattg ctgtaccgcg gagaaaccat ctatgtgcct ccgtgctctg cactcaggca 3600 agagatcctg agagtgcatc acgatgatcc atttgcgggc cactttggcc gagagaagac 3660 tcttgaattg atcagaagga agttctattg ggacggtctg cgtacagaca ttgagaacta 3720 cgtccgggac tgtcctgtct gccagaagat gaaggtccct agaaagctgc cgcagggcga 3780 gctggcctcc cttcctgttc cagaagggcc atggcaagac ctgacgatgg acttcgtggt 3840 cggcctgccg ccatcgggcc gaaagggccg cgtatatgac gcaattcttg tggtggtaga 3900 ccgatacacg aaggccgcga gatacctacc cacaaccgga accgtgactg cagaagaact 3960 ggctaacatg ttcctagatg aaatcgtctg caagaccgga tccccgagaa gtctggtcac 4020 tgacagaggt tcgctcttta ccagcgcgta ctggatgcaa ttctgtcaag gtctccgtat 4080 aaaaggccgc ctgagtaccg ctttccaccc gcagacggac ggacagactg aacgtcagaa 4140 tcaaacgctc gaagcatact tgcggatgta tacaaactac caacagagcg actgggcaga 4200 ttggttgcct accgcggagt tcgcttacaa caatgccaag aacgcgtcca cgggatattc 4260 gcccttcatg gcttggcaag gcatggaacc ggcggtcccc gggctcgaag ccatcggacc 4320 gacgacatca aacgcatcag tagaactgcg gttgaaaggc ctggccgaca tccggaccaa 4380 actggaagaa aatctgaaga aggcaacgga gcggcaggca gaaggatata ataagcgcca 4440 caaggccacg cagctgcgcg taggacaaga agtcctgcta tcaacaaaga atatcaagtc 4500 atggcggcca aacaagaaac tggacttgaa gtacgacggc ccatttatga ttacggaggc 4560 cgtcgggaaa caagcgtacc gcttgcggct gccaaaggcg tacggtcata tccaccccgt 4620 ctttcatgta tcactgttga agccatacca ccggcgcgca ggagtagagg ctggagaagc 4680 tcaaccgatc gtcatcgacg atcaggagga atgggaagtg gaagaagtgc ttgctcatcg 4740 agtatactat cgaaagctgc agtatcttgt caaatggaaa gggtggccga gttacgagaa 4800 ctcgtgggaa ccagaggaaa accttaagaa cgctgctgaa actgttgcgg cttaccggaa 4860 agccagcgag gtgccggagg caccccgtcg gtcacggagg agagcatgat ctttttccca 4920 cagggttttg atatgcctcc gaaagacgcg gcggcattac tcttcctctt tcacccacag 4980 ggttttgagg tttaatgagg atgtcctgag ctactacacc catatcgtca acaagaatac 5040 tcatgcagga agagcaaaac aacttcatta agttaagaaa agaaaaacag aaaaacggcc 5100 cgaaggcaca acgcaggttg cccgccagcg ggccccaaac aaacaaacag aaaaccaaac 5160 aaacgaacta ccacaaactc tcccagtcca ccaggtcgcc ctcaggaaga ggctccatac 5220 cgccgagctc ccgctggacg tttaccagac ggaataggtt gcggtttata tcccgcaacg 5280 ccgctagaca cggatggacc ttgttcagct taccgaggcg gttggtagcc tcctcgatat 5340 gcttggtcag gtcatccgtg gccttcacgg cctcttgact cggcggcagg gcctgcagac 5400 gagcgacctc ggcccgcgcg ggtcccggca gcgcaagaca cggcttcttc aggcgcgcac 5460 accgcacgca gggccgcccg gcaccccgag acacgcaaac caagggtccc tcgacaccag 5520 aagaagtgcc aacgaacaac gacttggcgc agcgcctaca gggcacctat aaaccgtcag 5580 cgtcaccccg tcgcgccgca ctcggattca gactgcctac ttaccggggg ggaactggga 5640 ggcgccgggg gaggcgacgg agaccgcggc ggcggcgaag gcgaccccca ggggggcaaa 5700 ggagagtccc ccccaggggc cggaggcagg agatcatcaa gatccccgta agggggccgg 5760 ggcaacgtat cctaaaccac ctgaatcagc cagaacgcac ccctgccaga ctgaggaaca 5820 cttacctccg aaagaggagg gcccaacggg gcaggaggcg ggggaggggg cgcgagcggg 5880 ggcacggggg gctaataagc atattagctc tgtttccctg cgaaggcgca ccgacggctc 5940 cccgaaagat gaggatattt agaaaggagt actcacaggg ttctgcaggg ccaaccgggc 6000 ccgctttgcc tcctgagagg ctttcgcaac tccacgaacg gcgttcttac gtttacccat 6060 cttcaaaaca acagagacgc cgtcaattcg caaccctagc ggcgccccta cgggaaagga 6120 agccaaaact taccgcgctg tctgcacttt caaagtgatc tgaagaggtg aacatggcta 6180 cgagaaatgt tacaatttat aggaataaag gaccctacga cgctatcccg aaagcgttcc 6240 gtcagtgttc cgactattca ggactattgt gaagcaatgt gttggacccc tacaatccta 6300 ttgttgagcg tttgaataag gcgcgttttt gtgttgcgcg gcctatcgaa gcgaggtgtt 6360 gagcgcctca gaactcaggc acgaaccggg acggttcgtt tttccgaagg ggggatagt 6419 // ID Copia-2_LBS-I repbase; DNA; FNG; 4566 BP. XX AC ABFE01000771; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_LBS_; KW Copia-2_LBS-LTR; Copia-2_LBS-I. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-4566 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01000771; Positions 42671 38106. XX CC Positions [1778-2299] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 188..4564 FT /product="Copia-2_LBS-I_1p" FT /translation="MSTSSSTSIKKLTHGNFHAWEPLVTAELQRLGVWRMC FT TGDEAESLVKPTPMSLPETATVSERISAQRNHADALQAYEAACHRNDQAIG FT MIKTHIEPSQFEGIDKLGTAKEVWEKLVLRHKDTHTGLSAFYTKVGILEKK FT YADGEDMHAHLDFLTMENRKLDKKAFDDEFLAQIMLMSLPRDSTWETVVVV FT LLQSASDTTPLSTTVVSTKLMQEYRRITGGEQADSALAARQAKASSSKAKS FT KKQCGYCKFLGHGELDCRKKKRDLENGTTEGKDSGKKDRQKKLSTANVAQV FT TATSEPEQAAENIASVYAEFPSDDNDDDVHVFIATEVVALLSHQSHLETYI FT DSGCSRHLTPRRELFIDNTFTKLEKPIKVHLGDTSVIPAVGRGTIHYLMET FT PLGVVPALIPNALWVPELAASLLSVACFTDHGKHDILFDNDDCFIRSKPSG FT KCVASAHKTSGSLYRLIARPMTSKEYANAAITSRHLDINLLHRRLGHLGYD FT NVKQLVDKGMVDGVTSVGGRIEFCEACVHGKQHRLPFPPSKKRARRKLDLI FT HSDVCGPFPNSIGGKRYFVTFIDDCTYKGWIYFMAKKSEVYTMFRKFKLLV FT ETATGQKIKVFRSDGGGEFTSFEFEGYLESCGIFHEKTTPYTPEQNGVAEI FT MNRTIVERGRCMLFEGNLSSGFWTYAFECSMYLSNRSPVSRLPDSTPEEAW FT SETKPVITSFRPFGCPAYAHIPKAKRTKLAWKTRKCIMVGYEVGTKAYRLW FT DLKTRSIVISRDVIFDERINPPAVPAPPVDLSEIVWDGELTGDVQGLTQVG FT DAWDKADVESPPSAPVTNTQPIEHLIDEEENQLQDLDPLPNPLPVEPPHQE FT HPLPLPDLPAQPRRHRTELELLGDPPIIEGPRPRRLPERYRADSPPPEAEV FT PLDNGDADDDAFAQIAFVFAASATSTGYTDPMTLTEAMGSPDADSWQQAID FT EEVKSLQAMGTFVIVEKLPPGQKAVSSKLVFRVKRDNIGAIERFKARMVAK FT GYSQIPGMDFDETFAPVVKLTSIRVLCALAVTLKLHFHHLDVDTAFLNGIL FT KEEIYMRLPQGVGPLSGKIARLLRSIYGLKQASRVWNELLDHELAKLNFRR FT INADYCIYILQEGDFIVFLAVYVDDMGLLCNDLDFMQKIKDRISKIFKIKD FT LGSISQLLGVAIDYDHEAGILQLSQSRYIQQSLERYGFNDGRTHPTPLSSG FT AKISKSDCPSTPSEVEFMQNFPYQSLLGTLMYSMLGTRGDIAFAVGALSKV FT ASNPGKTHWDEAVHVLRYLGGTCDYCLVFDRKKAGEMTSFILGYSDSDWAG FT DLDTRRSTGGFVFLACGAAICWSSKLQNTPALSSTEAEYMAFTCASQEVIW FT LRQLLEQLGFKQDKPTLLLGDNQGAIALSKNPGNHPHTKHIQLRYHFIRFA FT VEDGQILLDYVPTKDMVADGLTKGLTGEKHMRFVSMLGLKPRMSG" XX SQ Sequence 4566 BP; 1111 A; 1259 C; 1021 G; 1175 T; 0 other; ttaggttatg ggccccggac ctacctacct gtgaaaacct cggatacaac acaccactca 60 cgcatattcc ccatcggaaa ccatccatat ctcgcatacc atttccacac acgcctaagc 120 gctcgttcat ccactttcag ctctcggatc accttatccc cacaatcacc accaaacgtt 180 cccaccgatg tcaacgtcat cttccaccag catcaagaag cttacccacg ggaatttcca 240 tgcttgggaa cctcttgtga cggcagagct ccaacgcttg ggagtttggc gcatgtgcac 300 cggcgacgag gccgaatcgc ttgtgaaacc cacgccaatg tcccttcctg aaacggcgac 360 cgtttccgag aggatttcgg ctcaacggaa ccacgcggac gccttacaag cttacgaagc 420 cgcctgccat cgcaatgacc aggccattgg catgatcaag acccatatcg agccatcgca 480 gttcgagggg attgacaagc ttgggacggc gaaggaggtt tgggagaagc tggttcttcg 540 gcacaaggac actcatacag gcttgagtgc gttctatacc aaggttggga tattggagaa 600 gaagtatgct gacggtgagg acatgcacgc ccatctcgac ttcctcacca tggagaatcg 660 gaagctcgac aagaaggcat tcgacgacga gttcctcgct caaatcatgc tcatgtccct 720 tccccgagac tccacctggg agacggttgt cgtcgtcctc ctccaatctg cttccgacac 780 cactccactc tccaccactg tcgtctccac caagctcatg caggagtatc gtcgtatcac 840 tggtggtgaa caggctgact ctgcgctcgc tgctcgccag gccaaggcgt cttcctcgaa 900 ggcgaagtcc aagaagcagt gtggctattg caagttcctt ggtcatggtg agttggattg 960 ccgtaagaag aagcgggatt tggagaatgg gactactgag gggaaggact ctgggaagaa 1020 ggatcgtcag aagaagttgt ccaccgctaa cgtcgcacaa gtcaccgcca cctctgaacc 1080 agagcaagct gcggagaaca tcgccagtgt ctatgcagag ttcccttctg acgataacga 1140 cgacgacgtt catgtcttca tcgccaccga agttgtcgct ctcctctccc accagtctca 1200 cctcgaaacc tacatcgatt ctggctgttc ccgccatctc acccctcgtc gtgaactttt 1260 catcgataac accttcacca agcttgagaa acccattaag gttcatcttg gggacacttc 1320 ggtcatacca gctgttggtc gagggactat tcattatctg atggaaaccc ctttgggagt 1380 tgttcctgcc ctcataccca atgctttatg ggtacctgag cttgcagcct ccctcctctc 1440 tgtcgcttgc ttcactgatc acggcaaaca tgacatcctc ttcgacaatg acgactgctt 1500 cattcgctca aagccttctg gcaaatgcgt tgcatcagcg cataaaacca gtgggagcct 1560 ctatcgcctc atcgcgcgtc ctatgacatc caaggagtac gcaaatgctg ctatcacctc 1620 tcgtcacctc gatattaacc tccttcatcg tcgcctaggt catctcggtt atgacaatgt 1680 caaacaacta gttgacaagg gcatggtgga tggtgttaca tcagtgggag gtcgtataga 1740 attttgcgaa gcttgtgtac atggcaaaca acatagactc ccctttcctc cttccaaaaa 1800 gcgcgctaga cgaaaattgg atctcatcca ttcagacgtc tgtggtcctt ttcccaacag 1860 cattgggggc aagcgctact ttgtcacctt catcgatgat tgcacctaca aaggttggat 1920 ctacttcatg gctaagaagt cagaggtgta caccatgttt cgaaagttca agttgcttgt 1980 tgagactgca actggccaga agatcaaggt atttcgttca gatggtggcg gcgagttcac 2040 ctcatttgag ttcgaaggct atctggagtc ctgtggcatc tttcatgaga aaaccactcc 2100 ttatacccca gagcagaacg gtgttgctga aatcatgaac cgcactattg ttgaacgtgg 2160 tcgatgcatg ctttttgagg gtaacctctc atctggattc tggacctatg cttttgaatg 2220 ttccatgtac ctcagtaatc gctccccagt ctcacgtctc ccagactcta ctcctgaaga 2280 ggcctggtct gagaccaaac ctgtcatcac ctcctttcga cctttcggct gtcctgctta 2340 tgctcacatt cctaaagcca agcgcaccaa acttgcctgg aaaacaagga aatgcatcat 2400 ggtcggctat gaagttggta ctaaggccta tcggctctgg gatctgaaga cacgatccat 2460 tgtcatctct cgggacgtca tttttgatga gcgcatcaat cctcctgcag tcccagcacc 2520 accagttgat ttatccgaga ttgtttggga tggtgaactg actggagatg tccaaggctt 2580 gactcaagtg ggagacgcat gggacaaagc tgatgtggag tctcctcctt cagcacctgt 2640 caccaacact caacccattg aacacctcat cgatgaggaa gagaatcaac ttcaggacct 2700 agatccactt ccaaatcctc tccctgttga accaccacat caggaacatc ctcttcctct 2760 tcctgatctt cctgctcaac ctcgtcgaca tcgaactgaa cttgagctac ttggggatcc 2820 accgataatt gagggaccac gaccacgtcg tcttcctgag agatatcgtg ctgattctcc 2880 ccctcctgaa gcagaggttc ctttagacaa tggtgacgca gatgatgatg catttgctca 2940 aatcgctttt gtatttgccg cttcagccac atccactggt tacactgatc ccatgacgct 3000 tactgaagcc atgggcagtc ccgatgcaga ttcgtggcaa caagccattg atgaggaggt 3060 taaatccctc caagctatgg gcacatttgt cattgttgag aaactacctc ctggacaaaa 3120 ggctgtgagt tccaaactcg tatttcgagt caaacgggat aacattggtg ccatcgaacg 3180 cttcaaagca cgcatggttg cgaaaggcta ctctcagatc cccgggatgg atttcgatga 3240 gacttttgca cctgtggtaa agctcacttc aatccgtgtt ttgtgcgccc tagcagtcac 3300 cctcaaactt cacttccatc accttgatgt tgacacagca ttcctcaatg gtattctcaa 3360 agaagagatc tacatgcgcc ttcctcaagg tgttggtcct ctctctggga aaatcgctcg 3420 gctacttcgt tcgatctatg gattgaagca agcctctcga gtttggaacg aactccttga 3480 ccatgagctc gcaaagctca attttcgtcg aatcaacgcc gattactgca tctacattct 3540 tcaagaaggg gacttcattg tattcctcgc tgtctatgtt gatgacatgg gattactctg 3600 caatgaccta gacttcatgc agaagatcaa ggatcgcatc agcaagatct tcaaaatcaa 3660 ggatcttggt tctatcagcc aactccttgg tgttgcaatt gattatgatc atgaagctgg 3720 catccttcaa ctttcccaat cacgctacat ccaacagtcc cttgagcgct atggtttcaa 3780 tgatggtcga actcatccca cacccctcag ctctggagcc aagattagca aatctgactg 3840 cccctcgact ccctctgaag tcgaattcat gcagaacttt ccttatcaaa gcctactcgg 3900 taccctcatg tacagcatgc taggaactcg tggagacatt gcttttgcag tgggagcact 3960 cagcaaagtt gcttccaacc caggtaaaac tcattgggat gaagctgttc atgttcttcg 4020 ctatcttgga gggacttgtg actactgcct tgtcttcgat cgcaagaagg ctggtgagat 4080 gacttcattc atcctgggat attccgactc tgattgggct ggcgatctcg atactcgacg 4140 atccactgga ggatttgtct ttcttgcatg tggtgcagct atctgctgga gcagcaaact 4200 ccagaacaca cctgcactct catctactga ggcagagtat atggcattca catgtgcttc 4260 tcaggaagtg atctggcttc gtcagctcct cgagcaactt ggcttcaagc aggacaagcc 4320 cactttgctc ctaggtgata accaaggagc tatcgcactc tcaaagaatc ccggcaatca 4380 tcctcacacc aaacacattc aacttcgcta tcacttcatt cgctttgctg tcgaagatgg 4440 tcaaattctt cttgattatg ttccaactaa ggatatggtt gcagatggat taacgaaggg 4500 tcttactgga gagaaacaca tgcggttcgt ttccatgctc ggtttgaaac cacgaatgag 4560 tgggag 4566 // ID Gypsy-5_RO-LTR repbase; DNA; FNG; 259 BP. XX AC AACW02000062; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_RO_; KW Gypsy-5_RO-I; Gypsy-5_RO-LTR. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-259 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000062; Positions 100638 100896. XX SQ Sequence 259 BP; 104 A; 37 C; 32 G; 86 T; 0 other; tgtcatgcct tttgctgata tttttggtaa tgttacagtt cattcctcat taagattatt 60 gtgcaagtta ctgaagatga ctcacgaagc agataacaat gaactatcaa catgactcac 120 aaaatataat tattactgta tataactctt gtttgaaaaa taatataaat actcgaagaa 180 tttgaatatc aataaaacaa gaccttaaaa aactttattt attattttaa agagtaaaaa 240 cataagttca agcacgaca 259 // ID TCA1_I repbase; DNA; FNG; 4838 BP. XX AC AF043301; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 08-NOV-2002 (Rel. 7.1, Last updated, Version 1) XX DE Candida albicans retrotransposon TCA1_I, internal region. XX KW LTR Retrotransposon; Transposable Element; TCA1_I; KW internal region; internal portion. XX OS Candida albicans OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; mitosporic Saccharomycetales; OC Candida. XX RN [1] RP 1-4838 RA Chen y.J. and Fonzi A.W.; RT "A temperature-regulated, retrotransposon-like element from RT Candida albicans."; RL J. Bacteriol 174(17), 5624-5632 (1992). XX RN [2] RP 1-4838 RA Chen J., Wang Q., Fu Z., Zhou S. and Fonzi A.W.; RT "Tca1, the retrotransposon-like element of Candida albicans, is a RT degenerate and inactive element."; RL J. Bacteriol 180(14), 3657-3662 (1998). XX DR Genbank; AF043301; Positions 394 5231. XX SQ Sequence 4838 BP; 1870 A; 742 C; 831 G; 1395 T; 0 other; gattagaagc ttggtaaagt tctgctttgc tcaataggtt tcagattcag aaagattgtt 60 aaaacttaga tcatcttcgt tcatcacaaa ccaagaactt tacggaatgt acgaatatca 120 ctttcattag tagataattc gttacttaat ccagtgatta atcttgaggt tcgaaagatg 180 gttaatagaa atttatttga caattacgac taaggttaca taataaatca ttggtatcac 240 ggctatgaaa gcttccaaga tgtgatttta ataacagagt gtttttggtc tcaacagatg 300 agaatacatt gaatttaatg aattgctaac aaaagtcatc aattagtcta cgactgaaat 360 ggaatactat tcatattgtg ttaatgattg acattaaaaa tgaacaaaat aaaaggtatc 420 atagatttta tgctattcaa agtgatggtg gtgttgacaa aacgttttga aaagtaacga 480 tatttggact aaaaaggatt caagaattat gtcattgctg aataattttg gagaacctgc 540 cttaataatt ttatgaggtg tctactatat ggtaaagttt tgatgaacaa aagaaatgaa 600 attataatga ttgccaaatt tgtgcacatt cccattaaag ggataagagt ctacaataaa 660 gcaacagtag ttttcatgaa cactacttaa acaccaagga agaacaagat ctatgaggtg 720 ccatttttaa aaaattaagt tgaaggggaa aatctaaaag tacctatcaa tttatggaag 780 tttatttgtt aatctgtcaa ctatgtgaag gaacgtatag tcgaactact tggtaactca 840 tttcgttaat ccacacatga acgtttaaaa cctaaaaaat gaagtcatac atcttttcat 900 acaatgatct tatcaattca agacatacct ttgtgatgtt ataattttgt aagtcattca 960 aaggggagat ttgcactgaa taactcatct atgctcatac atgctggtat tgaaaaattg 1020 attttcacgt ttgttacctt tcagaagtct atgcattaca atagtgaaaa ttcaatgcga 1080 acccattgct aaatttacca gactcaaaag agaaaatgat tgagataaaa aattacagag 1140 attattcaca aatcgtccag tattgttaag agaaaagtga attagatgat aatcataagc 1200 ataggaatca acttcatgat gtcagataaa cccattatgg tattttatct atcattattc 1260 caacatgata tcccagaata catgtgataa tgaaattcaa taaactggtt aaagagaaat 1320 tttgaaatat ggcttcttta agaaatttta taaaatgaaa agagttgctg aaatccatgc 1380 tattacacac tttttcatat tcctgagaag tgaaagctac gtcacacagt cttcgttaga 1440 taagaactct aaatgttgga atatttgtac caggacatgc ataagcatcc agtcacaatg 1500 gccataaaca tgagaaacct acccaatgaa gactaccaat gaattatata ctgaaaaaat 1560 gtccaagata tgagtattaa ttaactattc accttatgaa tatcccaaat attcagacct 1620 atcataatga ttatttcata gacaaaaatg agtaccaatt ccacattatg agttgttgaa 1680 tgttgtggag tatgaaatta tgatgaacat ccacgtataa attagttgtc gagaattatc 1740 gatcgaacag atattagacc tagagctgat cccacctggc aaccgtacct gatgccgtca 1800 tacatcaagt acaccatact gtacagacac ctgatcatgg ggtgtaaaga taccatgatc 1860 aatcacaccg actattacga tctggggagg gtaattaccc cggacaccag gtgcgcaccg 1920 aaaaattggg aattcgagat cgcgggccta ccaccctaaa cactgcgatt tgatgttagg 1980 tgtaaacgac gaaacatacg ataccgtgat acatcgagga atcatattgg ttctctagat 2040 tccaagatga ttgccacacc atctttacca aaacgaatag ataatcaaaa tgatatcaat 2100 tcagggaaga tgacgttact gcaaagtttc ttaaagcagg taacgaactg aaattatata 2160 agacattgtc ataaagcaaa acaacatgta catgacattg ccgcaaacat tgacttatgc 2220 ataagaaata catcagatga gataagaaac tgaaaaagta atctcatttt ttctacggca 2280 aattttacaa agaagacaat atgggaataa aatagaccat attaccaaat tatgaaatgg 2340 acactctcta aagacttatc aatcattaca agctgctttt aacgtggaat tttctaagtg 2400 ttcgacattg atgttgtttt tatcactatc atcatatttc caaagttttc agggtcgatg 2460 attatagaac aagagctaac ctcaacgaga gcactgtttg aaagcacccg acaccccgag 2520 gattcaataa aattgaggat agaacggtaa aatagtaatc actaacgaag acatgctaga 2580 aatttttaca tgaaagacat tacatttgcc cctatgttca atgaacacga atgaaatttt 2640 ggatatacaa ccacatttta agtaataggt tatgggtata aaactaaact atagctagtt 2700 ggtattggta aaggagaaag gtccaatgac atgattttgt caataagtgc ttcaagaaaa 2760 taatattagc caacggataa gattgatcta gacaggagca ataacaacac gataacactg 2820 ataaattgca agaaccaagt aggtctcaat atttgatggt tattatcact ctttccccaa 2880 caaaagaata caaatgttca aaccaacacc acttcagggg aggagaattt atgaataaaa 2940 ttgatatgaa gtagtggatg atttggaaaa aaattcggta atgttatgaa tttaccaata 3000 gaataagata acattactac ccctgattgg tatattagac agaacttgtg atgatactaa 3060 aactcgaaaa gagatgtttc aagcaggggc atataatgtg caaaatgtca caatgactct 3120 aagattcaaa tggcatcatg atttctgaac taaataccag ccaaacactc tcggttgata 3180 cgtccaattt aaaagacggt caccgaaaac aaagtgaaag taagtgaaat catatgaata 3240 gattcacaag aaatcgtaat ggaaacacaa taatcatttc ataaatcttt gaagtataca 3300 caagaattga ggtcaataat tatctattaa aagactacca caaggttgac tttatgatcc 3360 caatacacca atagaggaga ttgaaacatc ctaggtcaac gtcgaagatg gaaatttgag 3420 acaaagatga ataaaaggtc gaccgataac attgtttgat gaatccaaag agaattggta 3480 cttcaataca ttaaataaag atcttccgtt aaatgtaaga tacgaaattg atgtgaatcc 3540 gttacttaaa tatctaaaga agaataatgc aacttatgag atatgttgaa aacatgaatt 3600 agcttaaatc acactaaagt gagaagtttg attttaagaa acataccata gagtatttat 3660 aatatcgtcc atcctactgc aaaaagacta ttccatgcta tactcaaatc aaaaaactgg 3720 gtacatactt aatgatgatg actaaattca aaccaagtgt caccatttct agatggctac 3780 tcaaaaatga gaatgatggc aaacaaatgg aagatagatc aatgtttgaa ttgcttagac 3840 agagattttg gctaaaacac cagcagacta tgaatgcgga atacaagtga taaagtttta 3900 caacttcacc aaattgtgaa tataatcaat ctggaatccc ggaaatagaa atcaaagatt 3960 gaaaattttt cgaaatcgac taataaaatg catgatttcg tcaaatttaa acaacacata 4020 cgcaaatata ttaaattaga ataccgatgt ggattgaaaa tggaggtgaa tacgtgtaag 4080 tacggttaaa attccacagt tttaattgtg tttcatggca tgtgtcaaga ttatccattt 4140 gcataaactt atcgaaatat tggtagtaag gaaacaaggt gaatgccggc cggatttacc 4200 aagtactgaa actcatatta ggtttggatg tatgaaaatc gagatatctg ataactacag 4260 aaaacacata tactacttaa aaaatacccg aagtcatttt catattacaa tcggatagtt 4320 aaacaagggg agctaacgct caaagagaga gtcacatatt tgaatatgac tgttgacaca 4380 gatctttatc tagtgcgggt tcatagatct agtgctaaga aaatagattg agttaatatg 4440 atcattagtg aatgaagaaa tgataacgtg aagcgatgtt acaagaagat tacgatacaa 4500 aagtgatact tacgttaaaa ttattcagta tttgcaagca tattagatta ataaggttaa 4560 aattaacaaa aaacaattag gtccataagc ccatattacg aaattattct tagaaaaagt 4620 tatgattaat tttcaatcca acattacaat attatctcaa aataaagatg ctaggttaaa 4680 acacactatt gtccttagag aaatggaagt tatgtggagt taaaaaataa tcagtggaaa 4740 gaatcaccga cacaatgatt tgacatcaat tgttgataac aaagtcgcta gaagagatta 4800 aggaatcaat gatgaaccat gattactgaa tcagggag 4838 // ID Gypsy-16_LBS-LTR repbase; DNA; FNG; 1036 BP. XX AC ABFE01001152; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-16_LBS_; KW Gypsy-16_LBS-I; Gypsy-16_LBS-LTR. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-1036 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01001152; Positions 71093 72128. XX SQ Sequence 1036 BP; 202 A; 262 C; 194 G; 378 T; 0 other; tgtggaagtg caccttgttc cattcatggc gcacttcgca tatttttact tttttgttat 60 ttatgttttt gagacgagtc ctttactaaa aatcttaaac gtcgagttta gtctaggttg 120 gacattagac gtatgagcca cgtgacgcat cgacccacct tcgcgttcaa cgctttcctt 180 ttgaagggct aggacttgat cagtacgctt tcattttttt tgtctttgta tttattattc 240 ttttagtgct tgtatttttg acaaagtccg ttgaaccgac gctcgttctg aagactagac 300 ggtagggtag agcatcaacg ttcgggacat ccgcactgag ggcggcagtc gcgcacgcaa 360 atagacctct cccgacgttg tacgttcccc gagttattcc tagtccgagg attgtgctga 420 gggaatgcgt gagtgatgtc atttttaatt ttaattcttc tttcgctttt ctgtacagaa 480 cttaatgact tctttctatt tttacctttt ccccaaagtt ttggtcaggc tgcatgagag 540 tacgtgcgta gccttgtgat ttcgatattt ttgtacagta taaaggtgcc ctcccacctc 600 tcgaatggag ctgtcttctc cccctcacct ttcagtctga gcctcagtcg accattggca 660 ttctcataag attttaagag ttttacgtcg ttttctcaag ttattttcaa gctttccgct 720 ttacgtgcac gaagtatctt tacgattcgt cgaagcaacc ttcaacggcg cgttcctgga 780 cgcgctcacc gctaagttcc tcatttagca tctacctttc tccctccgtt ttaaggcctt 840 cccacacttt caccggcgct tccgttttcg tccttccctt tgtcttcatc ttatttcctc 900 gctcgacgta cctcttgtgc tttttgtgca aacctttcga acgttcttct cactcctttc 960 ttttctgtca ttttactctg gttcttcgta gacaaccaga tcacctgtcc ttcaattaaa 1020 attctgttct tccaca 1036 // ID TDH2_I repbase; DNA; FNG; 5173 BP. XX AC AJ439551; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 08-NOV-2002 (Rel. 7.1, Last updated, Version 1) XX DE Debaryomyces hansenii retrotransposon TDH2_I, internal region. XX KW LTR Retrotransposon; Transposable Element; RNaseH; TDH2_I; gag; KW integrase; internal region; pol; protease; reverse transcriptase; KW internal portion. XX OS Debaryomyces hansenii OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Debaryomycetaceae; OC Debaryomyces. XX RN [1] RP 1-5173 RA Neuveglise C., Feldmann H., Bon E., Gaillardin C. RA and Casaregola S.; RT "Genomic evolution of the long terminal repeat retrotransposons RT in hemiascomycetous yeasts."; RL Genome Res 12(6), 930-943 (2002). XX DR Genbank; AJ439551; Positions 377 5549. XX SQ Sequence 5173 BP; 2034 A; 763 C; 771 G; 1605 T; 0 other; atagattaga agtcaaacta agtgttacac ttcaatgtca gaacaattta cgtattctga 60 tcctactcat gtcaaacatg caaaaaccgt taataacaga gtctatgaat cgagtaatga 120 agaagaaatg gggtcaaaaa cctataaaat ccatcattct aaatcttctg atgtgatcaa 180 taaatttgat tcatcatcgc tattgaaatt taaccaaaaa gaaattgttg ataatgccat 240 tgaacaagca aaagaattgt accaatgtta ctttgagaaa ataccttttg ttcacgttaa 300 tgatgtttca actttgttat tactccaaaa tgcgtttaat aaattgttat caaataacaa 360 aggtcttgga aaattattgt taattaatga tccaacgatc caggactatg gaaaagtaaa 420 gatgtcttca gtatattgtc aaaataataa tctccagatt tggtttcccg tttttagaga 480 attattatct caatcattat tccataaaac actggcgata cagaaatatt atccagagta 540 tgttttttct ttaaactata cgtataaata tattcgtgat aatagagagg tatttgttag 600 tgaaatttta ttaaaactta ttaatttcta tattcatgca atggataatc cttttatgga 660 cgtaactgat atttctgcaa ctgctactta tggaactggt caagaaattg atttggaaat 720 aaatgatttt attaaaaaat tggacctata tcgaataatc ttaatttata aaagacttaa 780 gactaatata aagaacctgg aagcatttat tgatgaatta acatttgctt atcttgagtt 840 tccaaacaac aataacttgc taaatctaga tcaaattatt gtaaatttga aaagaaatac 900 aaggtttatt aaacccaaat attttcagga gaaaactgat aattattcaa acagattcga 960 taataataaa aagaagcaac tgaagaaaaa taataaatat gaacatagga atcaaaattc 1020 ttcaaaggat aatgatgaaa attaacaaat tggtagtcac aatactgata taaaattatc 1080 tagtttttca ataaataata ttatttcaag tgaccaacca atcaataatc agtacatcta 1140 tgataataat aaaatagatg atagcaaaaa ttatgttttc gacagcggag caagcgttac 1200 tgtggtaaat gatataaaac aattaataaa tataaaggaa gatttagaag atagtgaaag 1260 gctattctat actgtttcaa atgtaccaat ttatgcaaaa gctaagggtg atttagttat 1320 aaaaatcaaa aactgcaaat ttacaattaa taacgcttat tatattccag gaatccaact 1380 gaatctaata agtttcaatg atttattgaa taaaggtttt ctaatattat ttgacaacga 1440 ttatttattt cttattaaaa aatctaatgg aaaatgtttt aatattgcaa aacgagatac 1500 gaattctaat ttgtttaaca atattagatc aagagatttt ttcagtatga aactatctcg 1560 aaaaattata gatgaattaa ataatatgaa tcagaaaatt atcaacaaga aaacattgaa 1620 ttatttcaat aataagtaca ttaagatttc taacatgaat catgaaatcg caaacccgtt 1680 taaacgcaaa cgcgatctaa tgtactatca tcttatgggt aaccatatgt ctcttgaaag 1740 tatgaaatac ttaattaaaa gtgggcatat aaaaatgagt tctgaaataa cggcttcgga 1800 agaagaaagg gttaaatctt gtaatgaatg cttagcaatc aattcaaaac aatcctcaca 1860 taaccatact catttcacag cacccagaag gttatttaga ttacattcag atacgttggg 1920 aatatttagt caccgaggca aaaagtatta cattacaacc ttaattgatg aatactccgg 1980 ttacttaaat acaatatggt ctgaacataa atcaattcaa gatttattat ttgaaaaaat 2040 aagaatttgg aataataaat ttgtagatgc aaatgttgcg ttctttcgaa ctgacaatgc 2100 attggagatg cctactaaag atcaattagc agaaatcggt attgaaaaag atgaaattgc 2160 atcttacagt ccagaattaa atggaattag tgaaagaact aatagatcca taattcaatt 2220 tataagaaaa gccctcttac ccatacaaga tactagaact ctttatcttc tacctaagat 2280 tgttgactat gtaacataca taaggaacat gacacctgtt cgttcaaaag gaggtctgtg 2340 tccgtatgca ttattctatg atacgaataa gttccattat aatcctatac aatttggtct 2400 agacgtaatt gttaagttaa gttccacagt ggaagctaaa aagtatggag tcagcaactc 2460 aaaaactagt ccatgtattt tatttggtac atttattggt tacggtacag acgtacacgt 2520 ttataaagtt attttatcta ccaaagactt tccaataata gtgacaccaa atattacatt 2580 aatgaaatca atggataatt taaaagtcta tctaaagagt ctcgattacc ttcaagacaa 2640 agatgctcaa gacatcgata tttcactcgg aaaacttaca gatagaacag atgaaaaatt 2700 atctatagca atagagcata tgaatcaaga tacaatttat gaatgtaatg agtatgatac 2760 ccaggaaagt ctagtaaatt caaactcaac gattcaagac gtagctcaac atatattcga 2820 aagaacacat gaatcaaatt cactaggaat gtcaaatcca gtccactcta attatacaat 2880 ggtgactgat acgtcaaatg attttgatta taccatagag gaaactatga acgaagttaa 2940 tctgaatcca tctgatacaa atgaaacgac aactgattca aatatcacaa aaactaattt 3000 caacgactca ctgaatagta gtgataattc acaacaaaaa tcgtctactc atcaaacgtt 3060 acctcatcca ctggccccgg ataaaaatga tgaaataaat aagataaatg aagatggtaa 3120 tacacattta tcattagagt cgagtacatt atcggaaaag acaccgacaa ctcaaaataa 3180 tgtacgggtt tcaactagaa gtaaaaaagc cattgtagag acggttgaac ccagaggaaa 3240 aataacctca agtaaagatt cgagagttaa aaaggatcca cataaaacat ctttacgcaa 3300 ttcttcaaaa tcctcgaaga aaataaatga cttacaatcg catgcaccta aagtaaataa 3360 gtcattagaa actattccat atactgtcat aactcgatct atgtcaagga aactaaactc 3420 acaatcatcc aaatccaatt ctttagtcac tcggaaaatt aatactgtat atcgaaaagt 3480 cgatttaaca gataacaact ggaagcagtc aatacaacgc gaattagata cttttaagaa 3540 atatgaagta tatacggttg tgaaaaatcc taaaaatgtc aaacctattc caactacttg 3600 ggttcataca cataaaatta acgatctcaa agaagttcag tataaatcac gttgcgttgt 3660 acagggcttt aggcaaattg caaatgaaca ctatgatacc tcgaaggtgt catctcctgt 3720 gattgaatta tccataattc gtttacttac agcgatagca gttgaatatg aatggccgat 3780 acatcatctt gatatatctt ccgcatattt acatgcagat atcgactatg agaaatccat 3840 atttgttaaa ccaccacctg gatcgaatat tgattctggt aaatgttggc aattaaacaa 3900 atctgtttat ggaatgaaac aagcagggta tatgtggtat caatgtataa ctaaggttct 3960 tatggatctt aatttcgaac ctgatactgc cattagcgga atgttttgta aatattttgg 4020 tgaaaataag aagctcatcg ttgcactata tgtggatgat atgtttttaa cttcgtctaa 4080 tattacaatt cttaacgatt tcaaacttga actcgctaaa catttcgacc ttaaatattt 4140 cgctgacata tctgaattct taggaattga attcattcaa attgcagggg ggtatagatt 4200 atcacaacac aatttcttga attctgtaat taaaaaattc aatttaacaa ataattatgg 4260 aaagtatatt ccaataatta aagagaaaaa taaaatttcc gaaactcaac taaagagtga 4320 atttactaat gacgaagtca ttaaaaatga atcattgcta aatgaagaag ataaaagact 4380 atatcaatca ggtgttggtt cgttattatg ggcagctaat aacacgcggc cagacataag 4440 ttttgctgta aatcaattga gctcaaataa tcaaaatccg acatcagtcg atcttgaaaa 4500 attgatatat tgtttacgtt atgtaaaaca aacaatatct tttagtttag agtataagcg 4560 gaatcgattt tcacataaaa agggatcatt tatcattcaa actttttcag atggatcttt 4620 tgcacccaca aaagatagaa gactgattac aggttataca gtttatttaa atggtaatct 4680 aataaattgg tctaccataa ggcagaaggt tatcactgat agttctgcag cttgtgaaat 4740 caatgcacta cattcagctg tgagaagtac acttaaatca agacaagcta tattagatct 4800 aaatttagtt attgatgaaa ttacattatt tgaagacaac gccgcagtta tagcgaactg 4860 taataatgaa ggtacttcat attcaaggcg tatggtcgat atcaaattaa aatttatcag 4920 acagttagtt tctgaaggta tattaaaatt aaaatatgtt aacacaagta ttaatatagc 4980 agatatgtta acaaaggcat taagccgaaa attatttgaa aatcttcgtt cactactatt 5040 tgaaagaaat gatttaaata aagaatagaa tcatttacta tatgtcaaag agtaccaatt 5100 gttttttacc acaaagttag gttttacaat ataaagatct tatgattaaa ctaatctata 5160 agatctaggg gag 5173 // ID Gypsy-116_MLP-I repbase; DNA; FNG; 5588 BP. XX AC AECX01000868; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-116_MLP_; KW Gypsy-116_MLP-LTR; Gypsy-116_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5588 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000868; Positions 47114 41527. XX CC Positions [4389-4868] - Integrase core CC 'AAGAG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 274..1362 FT /product="Gypsy-116_MLP-I_1p" FT /translation="MDDLSANIANILQQLTNLNSKLEEETAQRKATQDELA FT LVQQQLKNFSQASLQPPPTTPQPPPHVDNPMTSDVKEPITAADFKSLIGST FT RGPKVGTPDKFDGTKGDLAEAFVNQVGLYLLANSHAFPDDKTKVIFTLSYL FT TGEANQWAAPFYKRLLRPENEAPLTWEEFAATFEATYFDSDRQSRAQRDLR FT ALQQTGSVSEYTTKFMSLAARTGWGEMEHVSHYKLGLKQEVRVNMILKTFT FT SLPEITAYALAIDNELHPGANRTINRNTTITPTPARDPNAMDLSSARFSVS FT KEELARRSERELCFKCGRGRHKAADCGRKDERGNWIRSGGGNARIAELETQ FT LAELKGKGRADESKNGGTRD" FT CDS 1458..5489 FT /product="Gypsy-116_MLP-I_2p" FT /translation="MRLPLSQPQILPSLATSDSPIALCLLDCGATHEAISE FT AFVRKHGIHTSKMNTPCTVSAFDGQTKQLTEEAHLIINDDTTPTRFIVTQL FT KNSYDVLLGMPWFQTHGNQIDWVNGTFTTSSPTDIAAAETVSLLPTTPLTS FT KEDQRDARNSSEGVCIGADDSNNTLIPPQCESFLDPVKYDLETAGDLSHFL FT ELSHPRLTVIPTHEPMEPEIATEISVSSIPTTPLKPKEEQRDARQCEEGVC FT IGADDSNNTLIPPQCEFDNVSDPTHPETAGKCDPFQNRSSPTATLPELCTA FT KASWSTAARIAAEGHKDVSKTPVKELVPTRYHKFINMFTKSKAQGLPPRRK FT FNFRVNFIPGATPQAGKIIPLSPAENEVLDAMVDEGLANGTIHRTKSPWAA FT PVLFTGKKDGKLRPCFDYRRLNALTVKNKYPLPLTMDLIDSLLDSERYTKL FT DLRNAYGNIRVAEEDEDKLAFIFRAGQFAPLTMPFGPTGAPGCFQYFIQDI FT LLGRIGKDVEAFIDDIMIHTKEGVDHEQAVESVLEILSKHSLWLKPEKCEF FT SQPEVEYLGLIISKNKIRMDPAKVQAVTDWSTPKNTSEVLRFIGFSNFYQR FT FIGQFSKMARPLHDLTRKDVTFSWNKERQKAFNELKHAFTSAPVLKIADPY FT QPFILECDCSDFALGAVLSQKGNDGEIHPVAYLSRSLIPAERNYEIFDKEL FT LALVASFKEWRQYLEGNPNRLDVVVYTNHKNLESFMTTKQLTRRQARWAET FT LGCFDFVIKFRPGRHSTQPDALSRRPDLQTSPGDKLTFGQLLRPSNITHDT FT FSEIDAFDAQFDLEETVEHPQAPEWFQLDVLGIEATGEQEDQMEFKSDQQL FT MNEIRTAMKEDLNLQELKTVVLNPVSTKVKAALTEYQIVDELLYKGGKVVV FT PENESLRADILRTYHDSKLAGHPGRTKTLNLVKRNYTWPGMKNFVNRYVEG FT CASCQRVKPSLMKPFGALEPLPIPAGPWTDISYDLITGLPASQGKDSILTI FT IDRLTKMAHFVPCKETDGAEQLADIMMREVWRLHGTPKTIISDRGSIFVSK FT ITSQLNKRLGIKLLPSTAYHPRTDGQSEIANKVVEQYLRHFVSYHQDDWAA FT LLPPAEFAYNNSTHTSTGVSPFKANYGYDLNLGPIPSEDQCVPAVEERLQR FT LAEVQMELQCCIAGAQESMKLQFDKHVRSTPTWKVGDKVWLSSRNISTTRP FT SPKLDYRWLGPFSIVKPISKSAYKLKLPLTMKGVHPVFHVSVLRKFEQDTI FT QPRKTVPSPPIEVQGEEEWEVDTILDSRKRYNKLEYLVSWKGYGKEDDSWE FT PASNLSNAQDMITEFNQRFPNASSKHKRSRRR" XX SQ Sequence 5588 BP; 1791 A; 1286 C; 1272 G; 1239 T; 0 other; tattgcagtg tctcactacc tgcagcggac gactagatcg aatctcgaat tagaatacgg 60 ggtcgaaatt gaaaattaga attgaaactg aattagaacc gaaagcttaa attgattgat 120 tgaaacttta tctgaagaaa tagcaagaag acccggacta tcaccgcaaa aacggcgaac 180 tttacagctc ctgcagaagg acaagaatca gataactcaa ctgaattcga aactgtatcg 240 gcgcaagaag taactttaga agaatccgaa gcaatggatg atctatcggc caatatagcc 300 aatatacttc agcagctgac aaacttaaat tcaaaactag aagaagaaac tgctcagcgt 360 aaggcaactc aagacgaatt agccttagta caacaacagt tgaaaaactt ttcccaggct 420 tcattgcaac cccctcctac taccccgcaa ccgccacctc acgtagacaa tcccatgact 480 agcgacgtta aggaacctat cacggcggca gatttcaaat ctctcatagg ttcaacccga 540 ggtccgaaag tgggaacccc ggataagttc gacggaacta aaggagatct cgccgaagcc 600 tttgtaaatc aggttggctt gtacttacta gcaaattctc acgccttccc ggacgacaaa 660 acaaaagtca tctttacatt atcgtatctg acgggagaag cgaaccaatg ggcggctccg 720 ttctacaaac gactattgcg gcccgaaaac gaagcaccac ttacttggga agaatttgca 780 gctacctttg aagctacgta ttttgactca gaccgtcaaa gccgagcaca acgcgatctt 840 cgagcactac agcaaacagg atcagtgtcg gaatacacga caaaattcat gtctttggca 900 gctagaacgg gttggggaga gatggaacac gttagtcact acaagctagg attgaagcaa 960 gaagtcaggg tcaacatgat attaaaaacc ttcacgtcat taccggaaat aacggcttat 1020 gcgctagcaa tcgataacga actacatcca ggggcaaacc gtacaatcaa tcgcaacacc 1080 accatcacac caaccccagc ccgagacccc aacgctatgg atttatctag tgctagattc 1140 tcagtctcaa aggaagagtt ggcaagacgg agtgaacgag agttgtgttt caaatgtggt 1200 agaggtagac ataaagcagc tgattgtgga aggaaagatg agaggggtaa ttggattaga 1260 agtggaggag gaaatgcaag aattgcagaa ttagagactc aattggcgga attgaaagga 1320 aagggtagag ctgatgaatc aaaaaatgga gggactcggg attgatagat gtgcctatcc 1380 cgagtaatga ggaggagatt attggaattg gagctgtcac ttatttgaaa cgcaatgcac 1440 gcgattcgcg cctttttatg aggttaccat tgtcccagcc ccaaatccta ccctcccttg 1500 ccacatcaga ctcaccaata gcactttgcc tcctcgactg cggcgccacg cacgaggcaa 1560 ttagtgaagc atttgtcaga aagcacggaa ttcacacgag caagatgaac acaccctgta 1620 ctgttagtgc tttcgacggt caaaccaaac aactcaccga agaagctcat ctcatcatta 1680 acgacgacac gacaccaacc cgattcattg taacgcaatt gaagaactca tacgatgttc 1740 tgttaggcat gccgtggttt cagacgcatg gtaatcagat tgattgggta aatggaactt 1800 ttactacaag ttcacctact gatattgcag ccgcagagac ggtttcgctg ctaccgacaa 1860 cacccttgac gtcaaaggag gaccagaggg acgctaggaa tagtagcgag ggggtatgta 1920 ttggcgctga tgacagcaat aatacactaa tacccccgca atgtgagtca tttttagatc 1980 ctgttaaata tgacctagaa acagctggcg acctttcaca ttttctagaa ctgtcccatc 2040 cacgactcac ggttataccg acgcacgaac caatggaacc cgaaattgcg actgaaatat 2100 cagtctcttc cataccgaca acacccttga aaccgaaaga ggagcagagg gacgctaggc 2160 aatgtgaaga gggggtatgt attggcgctg atgacagcaa taatacatta atacccccgc 2220 aatgtgagtt tgataatgta tcagatccta ctcaccctga gacagctggc aagtgtgatc 2280 cttttcaaaa tagatcttca ccaacggcaa ccctacccga attatgcacg gcgaaggcct 2340 cttggtcaac ggcggcgagg atagcagctg aaggacacaa agacgtcagc aaaacaccgg 2400 tcaaagaact cgtaccaact agataccata agtttatcaa catgttcact aaatccaaag 2460 ctcaaggact tccaccaagg aggaaattca atttcagggt caacttcata cccggagcta 2520 caccacaggc gggcaaaata ataccgctat ctccagctga gaacgaggtg ctcgacgcta 2580 tggtggacga aggattggca aatggaacca tccaccgtac aaaatctcct tgggcggctc 2640 cggtgttatt cacgggcaaa aaagatggta aattaaggcc atgtttcgat tacagaagac 2700 ttaatgcact caccgtcaag aataaatacc ccctacctct taccatggac ctcatcgaca 2760 gcttacttga ttccgaaaga tacaccaaat tggatctaag gaatgcttat ggtaatatac 2820 gggtagcgga ggaggacgaa gacaaattag cgttcatatt tcgagctggt caattcgccc 2880 cattaaccat gccctttggg ccaacgggtg cgccggggtg ttttcagtat ttcatacaag 2940 atatactact tggtagaatt ggaaaagatg tggaagcttt tatcgacgac ataatgattc 3000 acaccaagga aggagtcgac cacgaacagg cggtagaaag tgtcctagaa atacttagca 3060 agcactcact atggctgaag ccggagaagt gtgagttttc acaacctgaa gtcgagtacc 3120 tgggactgat aatctcaaag aacaagatca gaatggaccc agctaaggta caagccgtga 3180 cagattggtc aaccccaaag aacacgtctg aagtcttacg ttttattgga ttctcaaact 3240 tctatcaacg gttcatagga caattctcaa aaatggcaag acctcttcat gatttaacaa 3300 gaaaagatgt cacgttcagt tggaacaaag aacggcagaa ggcatttaac gaactaaaac 3360 acgcgttcac gtcggcacca gtactcaaaa tcgccgatcc ataccagcca ttcatccttg 3420 aatgtgactg ttctgatttt gctttaggtg cggtactgtc acaaaaaggt aacgacgggg 3480 aaatccaccc ggtagcctac ttgtcgcgct cactcattcc ggccgagcga aattacgaaa 3540 tattcgataa agagctcctg gcactcgtgg cttcattcaa agaatggcgt cagtatttag 3600 agggaaaccc aaatcgatta gatgtggtgg tttacaccaa ccataagaat cttgaaagtt 3660 ttatgacgac caagcaactt actagacgac aagcacgttg ggcggaaaca cttggttgtt 3720 ttgatttcgt aattaaattt cgcccaggcc ggcattcaac acaaccagat gcattatcga 3780 gacggcctga tctacaaact tcaccggggg ataaactgac gtttggtcag ctgttaagac 3840 catcgaacat cactcacgac actttctccg agatcgacgc attcgacgcc cagtttgacc 3900 tcgaggagac agtagaacac ccacaagcac ctgagtggtt tcaattggac gtgctaggaa 3960 tagaggctac tggcgagcag gaagatcaaa tggaattcaa atcagaccag caattgatga 4020 atgaaatccg tacagccatg aaagaagatt tgaacctaca ggaattaaaa acagtagtcc 4080 ttaaccctgt ttcgaccaag gtaaaggcag cattgaccga ataccagatt gtcgatgaat 4140 tgctatacaa aggtggaaaa gtagtagtgc cggaaaatga atctttacga gctgatatcc 4200 tacgcaccta ccacgacagc aagctagccg gacacccagg gagaacgaag accttaaacc 4260 tcgtgaaaag aaattacacc tggccaggca tgaagaattt tgttaacaga tatgttgaag 4320 gctgcgcgtc atgtcagaga gttaaaccct cactgatgaa accctttggc gcacttgagc 4380 cattaccaat cccagcaggt ccatggaccg atatcagtta cgatttaatc actggactac 4440 cggcctcaca agggaaggac agcattttga caatcatcga ccgtctcaca aaaatggctc 4500 attttgtacc gtgtaaggaa acagatggcg ctgagcaact ggctgatatt atgatgcgag 4560 aggtgtggcg gcttcatggc acaccaaaaa ctatcatatc agatagaggt agtattttcg 4620 tatcaaagat tacatctcag ctgaacaaac gtcttggcat caagttactc ccttcaaccg 4680 cgtatcaccc acggacggac ggccagtctg aaatagcaaa taaagtagtg gaacaatact 4740 tacgccactt tgtttcgtac caccaggatg attgggctgc attactccca ccggccgaat 4800 ttgcgtacaa taacagcacg cacacctcaa caggtgtatc accattcaaa gcaaactacg 4860 gctatgacct taacctcgga ccaatacctt cggaggatca gtgtgtacca gcagtagaag 4920 aacgactaca gcggttagcg gaagtacaga tggaactaca atgttgcata gcgggagcgc 4980 aggagtcaat gaaactccaa ttcgacaaac acgtacggtc aacaccgaca tggaaggttg 5040 gggataaggt gtggctcagt agtcgcaaca tctcgacaac acgaccaagc ccaaagttgg 5100 actatcgctg gctaggacct ttcagtatcg ttaaaccaat ttccaagtct gcttataagc 5160 taaaactgcc gctgacaatg aagggagtcc accctgtatt tcatgtctct gttttaagga 5220 aattcgagca agacactatc cagccacgga agaccgtacc atcacctcca attgaagtgc 5280 aaggagagga agaatgggag gtagatacaa ttttagattc aagaaaacgc tataacaaat 5340 tggagtatct ggtgagctgg aaggggtatg ggaaggaaga tgattcgtgg gaaccagcaa 5400 gcaatttaag caacgcacaa gacatgatta ctgaattcaa tcaacgattt ccaaatgcct 5460 caagtaagca caaaaggtcg aggagaagat aagtgtgggt caagcttttt cccactgggt 5520 tttttaacgc tgacccaggg aaagaaggca ggacaacaag aggagtctgg gccttaaaag 5580 gggagtac 5588 // ID Gypsy-3_LENY-LTR repbase; DNA; FNG; 356 BP. XX AC AAPO01000087; XX DT 12-FEB-2011 (Rel. 16.02, Created) DT 12-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Lodderomyces elongisporus genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_LENY_; KW Gypsy-3_LENY-I; Gypsy-3_LENY-LTR. XX OS Lodderomyces elongisporus OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Debaryomycetaceae; OC Lodderomyces. XX RN [1] RP 1-356 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Lodderomyces elongisporus RT genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; AAPO01000087; Positions 8513 8158. XX SQ Sequence 356 BP; 94 A; 95 C; 68 G; 99 T; 0 other; tgtcgcatac agggcatcta cgtcgtgatt aagctcgcta ccaacctata agtagctccc 60 ttactaacct cttgtagaac gcctgaaacg ccttagcatc ttgccgcctc gagtatgtcc 120 gcctctgggc atcagactag cgaactgcag atacgatagc caattagaga gctgatattt 180 ctcgagataa gctttgtgct acgtcttctg aggttgtcga gaagagctta aatagccacc 240 aagatcccag ttagatagac ctcctctgta cttacctcct tagatagtta atacattcga 300 acccctatca acctttagtt tgacacttac gctatttccc ggatcaagat gtcaca 356 // ID Copia-1_VA-I repbase; DNA; FNG; 5075 BP. XX AC ABPE01003719; XX DT 13-FEB-2011 (Rel. 16.02, Created) DT 13-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Verticillium albo-atrum genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_VA_; KW Copia-1_VA-LTR; Copia-1_VA-I. XX OS Verticillium albo-atrum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetes incertae sedis; Phyllachorales; OC mitosporic Phyllachorales; Verticillium. XX RN [1] RP 1-5075 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Verticillium albo-atrum genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; ABPE01003719; Positions 31010 25936. XX CC Positions [2007-2507] - Integrase core CC 'TTCA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 3214..5037 FT /product="Copia-1_VA-I_1p" FT /translation="MRNGSDDLPEERLTIYRNAGPIEVWKAAFAAGRLANP FT LGAIDGEQIDKAKWLRLLRNPRTLHRRQMPPLPKHHGDLATHPMKTMFEEA FT EAAHLRSHEEMNSWAEISKHDLRVRGYQILDCMWVYVYKFDKHGQFQKCKA FT RLVVRGDQQRKSTHEETYAATLAGRSFRTLMAVAARFDLELIQYDAVNAFV FT NAKLDKNVFMRMPPGHRKPGMILMLRKALYGLRESPLLWQRELTATLTELG FT FKPVPHETCCMTKGGVIIFFYVDDIVLAYRKQREDEVERITRALQAKYKLT FT GGTPLQWFLGVEIIRDRENKLLWLSQSDYLDKIANLIDRTDLRHDVPMKQE FT ELLPHDELASCGSIRSYQRKIGSILYAAVITRPDVAFAASRLARHNSNPGP FT EHHTAADQVLLYLRKTKTMALQFGGDDVFTMASDASFADNTLDRKSSQAYA FT MTLFGGTIGWRANKQSTVTTSTTEAELLALSQAAKEAMFVSRLIKELGVTL FT DDERIRIRCDNQQTIKLVNKDIATLRTKLRHVDIHNHWLRQEAQKNRILVE FT YIPTANMIADGLTKPLNHQRLQIFTAQIGLIDISDRLEGRLLRPIEKQELW FT ERMEELEIGD" FT CDS join(51..1313,1317..2741) FT /product="Copia-1_VA-I_2p" FT /translation="MPNDPTAKALLRTFEDWGRWDEEFQTKATSLHQWEYI FT DPDSEQSLLEKPQRPEFSKYARRVPARSTEPDANSGLRGRRSGQSSQTISQ FT QSSEPIVSLPETTDPHQRARNIAELTAEDKVSFQFEWRVYEQDYKEYKEQE FT TNCEKLKAWVIETVDYGLRQSSCRPQWDLRRWYLNLKTSVGTTDREQQTAA FT RRKYQEATSILTKAPKDFANWITTWEIATSHALVWKTGGVEDPNTWFDDLT FT KAIRPILGHWVTIYRGIYKDKLEDKTLSIGEVAKDLRTEAEQQNLLHPQEK FT RPKIVRGSFGPTFAGESSVGSGTGVQDENITSRPQEEERTARKRKAQDTLT FT TTRRTRAATSNRGDAACRACDGRHDLGNCYYIFPTKAPEVFRERKVIREAV FT DSRLRTDKSLKDDVNRLKKVNQLEEAELGRGVGQCTASFVVAALNAAFSIT FT QYPLKNSAILDSGSTIHIFNNNNRFTRFQQAEEGDYVFAGKRRVPIRGYGI FT VDIQTRGPKGPKVIRLCDVALCENFPCNLVSLRQLHKRGYWWDNRPRQNCL FT RDRNHAIVSRLAVRHDQFVIEDIPLGCPDSAFVTRRHRADSWSERTSKADA FT RKWHLRLGHPGPRALEHLANTSRGARVKGTTTVECDGCALSKMTRQVSRRP FT RKLGHAPGDRLAIDFHDVTRDQEGQRSVMLVTDRWSGMIWDYYLTDRRTEV FT LTAAIKDLLGVLKRQYELAPKVIECDNEIMKHSLTTIAFLRTLHIKVEPSA FT PYTPAQNGSAEVSGSIIKTKARAMRVGARLPEHLWVEIYRAAVYLHNRTPK FT YIYSWRTPYDGFHTFLAHRDGVVVENRKPYQAHLRVYGCKAYAMTREAQLG FT TQKKQTQPQGMDWVSRGIPIHEHIPNLESTYWQDHIYEGRHLQ" XX SQ Sequence 5075 BP; 1383 A; 1391 C; 1373 G; 928 T; 0 other; attaagagtc tcatacgctc acgagccgct gcgccggacg acgatacgcg atgccgaacg 60 atccgaccgc gaaagcctta cttcggacct tcgaagactg gggcaggtgg gatgaagagt 120 tccagacaaa ggccacaagt cttcatcaat gggaatatat cgacccagac agtgaacaga 180 gcctgctgga gaagccccaa cgaccggagt tctccaagta tgcgaggagg gttcctgccc 240 gctccacaga accggacgcc aattcggggt tgcggggccg aaggagtgga cagtccagcc 300 agacaatcag ccaacagtct tctgaaccca tcgtatctct gccggagacg accgacccac 360 accagagagc tcgaaacatc gctgagctta ccgcggagga caaggtctcc tttcaattcg 420 aatggagagt ctacgaacaa gattacaagg aatacaaaga gcaggagacc aactgcgaga 480 agctgaaggc ctgggttatt gagacggttg actacggtct gcggcagagt tcctgtcgac 540 cacaatggga tctacggagg tggtacctca acttgaagac aagcgtgggc actaccgaca 600 gagaacaaca aactgccgca cgccggaagt atcaagaggc tacaagcatc ttgactaaag 660 cgcccaaaga ctttgccaac tggatcacca cgtgggagat agccaccagc catgcactgg 720 tgtggaaaac tggaggtgtc gaagacccca acacttggtt cgacgacctt accaaggcca 780 ttcggcccat tctcggccat tgggtcacaa tctacagggg catttacaag gacaagctcg 840 aagacaagac tctgtcgatc ggagaggttg ccaaggacct tcgaacggag gctgagcagc 900 agaatcttct ccacccacaa gaaaaacgtc ccaagatcgt cagaggctcc tttggcccga 960 catttgctgg tgaatcatca gttgggtcag gaacgggcgt ccaagacgaa aacatcacct 1020 cacgcccgca agaagaagag agaacggccc gcaagcggaa ggcgcaagac actctgacaa 1080 caacacgacg aactagagcc gcgacctcta accggggcga tgctgcctgt cgagcgtgcg 1140 atggcagaca cgacttgggc aactgctact acatcttccc gaccaaggcg ccggaggtct 1200 tcagggaacg gaaggtgatt cgagaagcag tcgacagccg tctacggaca gacaaaagtc 1260 tgaaagatga cgtgaaccgt ctaaagaagg ttaaccaact ggaagaagcc gaatgactcg 1320 ggaggggtgt tggacaatgc actgccagct ttgtggtcgc tgctctcaac gctgcatttt 1380 ccatcaccca gtacccgctc aagaactcgg ccattctcga ttcaggttcg accattcaca 1440 tcttcaacaa caacaatcgt ttcacgaggt ttcagcaggc agaagaagga gactatgtct 1500 ttgcggggaa aaggagagtt cccatccgtg gatatgggat cgtggacatc cagactcgag 1560 ggcccaaggg accgaaggtc atcagactgt gcgatgtggc tctctgcgaa aacttcccct 1620 gcaaccttgt ttcgttacgt caactgcaca aacgcggata ttggtgggat aaccgcccaa 1680 ggcagaactg tctccgagac cggaatcatg cgatcgtttc cagactggcg gttcgtcatg 1740 atcagtttgt catcgaagac atacctctgg gatgccctga cagcgcattc gttacccgac 1800 gacaccgagc ggattcttgg tcagagcgta cttccaaagc tgatgcccgg aagtggcact 1860 tgcgacttgg ccatcctgga ccgcgggccc ttgaacatct agcgaacacc tccaggggtg 1920 cgagagtcaa gggaacaacg accgtggaat gcgacggttg cgcactctcc aagatgacac 1980 gacaagtcag tcgccgacca aggaaacttg gccatgcgcc cggcgaccga ctagccatcg 2040 actttcatga tgtgaccagg gatcaggagg gccaacgcag tgtcatgctc gtcaccgatc 2100 gctggtcggg aatgatctgg gactactacc tcacagacag aaggacggaa gtcctgactg 2160 cggcaatcaa ggacctgctt ggcgtcctga agcggcagta tgagctggcc cccaaggtca 2220 ttgaatgcga caatgagatc atgaagcaca gtctcaccac cattgcattc ctccgcacgc 2280 ttcatatcaa ggtcgaacca tcagcgccgt atacacctgc ccagaatggg agtgccgagg 2340 tgtctggaag catcattaag acgaaggcca gagcgatgcg cgttggcgcc agactgccgg 2400 aacacctgtg ggttgagata taccgagccg ctgtctacct ccacaaccgg actccgaagt 2460 acatctacag ctggcggact ccatacgatg gatttcacac attcctggca cacagggacg 2520 gagttgtcgt ggagaaccgc aagccatacc aagcacacct ccgcgtctat gggtgcaagg 2580 cttatgccat gacccgggaa gcgcaactag gtacacaaaa gaaacagact caaccccaag 2640 gcatggattg ggtttctcgt gggataccga tccacgaaca tataccgaat ctggaatcca 2700 catactggca ggatcatatc tacgagggac gtcatcttca atgaagacga actcttcaat 2760 ggggatttga accagctcaa ggatgatttg ctccacatca gtcagcaaga cttgatccat 2820 ctactcgata gggtagacca gcctgaacca gaccgtgcca ccgaggcagt agaagtgcct 2880 gacggtgcct ttctacccca acgaacttgg gagggccttc ctggagagga cgaagacgtt 2940 gaaccccagg agaactggga ggacttcgcc gttcaggaag cggaagggga cacagaagct 3000 gaaccgtcag aggctgcgtc tgatcaacca gccaggactg gcgagcgctt tgggtcaacc 3060 ggggatgcgg aaatcgcact gggactacca gagggcggca cggagagcca ggttacggcc 3120 ggcaccgatg accaagccaa gaactctgac gcccgggcta gagaatacgc tacccctgtg 3180 tcacttccac cggcagcact cctgtctttg gtgatgcgga acggctcgga tgatctgcct 3240 gaggagagat taacgatcta cagaaacgcc ggaccaattg aggtgtggaa agctgccttt 3300 gctgctggac gtctggccaa tccactcggc gcgatagacg gagaacagat cgacaaggcc 3360 aagtggcttc gactgctacg aaatcccagg acgctccaca ggcgacagat gcctccacta 3420 cccaagcacc atggagactt ggcaacacat cccatgaaaa caatgtttga ggaggcggaa 3480 gcagcacacc tgcgcagcca cgaagaaatg aactcatggg cggagatttc caaacatgac 3540 cttcgagtgc gaggctatca gatccttgac tgcatgtggg tttacgtgta caagtttgac 3600 aaacacggac agttccagaa gtgcaaagca cgcctggtgg tcagaggaga ccaacaacgt 3660 aagtcgactc acgaagagac atacgcagcg accctagcag ggcgctcctt tcggacactc 3720 atggcagtcg cggcgcgctt cgacctggaa ctgatccagt atgacgccgt caacgcgttt 3780 gtgaacgcca agttggacaa gaacgtcttc atgcgcatgc caccagggca ccggaaacca 3840 ggaatgatcc tgatgctccg gaaagcgctc tatgggctga gagagtctcc actactgtgg 3900 cagagggaac tcacagctac actgaccgaa ctcggattca aacccgttcc tcatgagacc 3960 tgctgcatga ccaagggcgg tgtcattatc ttcttttatg tcgacgacat tgtcctggca 4020 taccgaaagc agagggagga cgaggttgag cgcatcacac gagcgctgca agctaagtac 4080 aaattgacag gaggaacacc attgcaatgg ttcctcggcg ttgaaatcat cagagacaga 4140 gagaacaagc tcctctggct ctcccaatcc gattaccttg acaagattgc taacctcatt 4200 gaccgcacag atttgcggca cgatgttccg atgaaacagg aagaactcct gccacatgat 4260 gaactcgcaa gctgtgggtc aatccgaagc taccagcgaa agatcggctc aatactgtac 4320 gctgcggtca tcacacgacc agacgtggca ttcgccgctt cgcgacttgc gcgacataac 4380 tcaaacccag gacccgagca tcatactgca gctgaccagg tgctgctgta tctaaggaaa 4440 acaaagacga tggcgttaca gttcggcgga gacgatgttt tcacaatggc aagcgacgca 4500 tccttcgcag acaatactct tgaccgaaag agctcccaag cttacgccat gaccttgttt 4560 ggcggcacaa tcggctggag agccaataaa caatcaacgg tcaccacgtc tactaccgaa 4620 gcggagctcc ttgccctctc acaggcagcc aaggaagcga tgttcgtctc tagactcatc 4680 aaggagctcg gggttacctt ggacgatgaa cgcatccgaa taagatgtga caaccaacag 4740 acgatcaaac tggtcaacaa agacatcgcc acattgcgga ccaaacttcg acacgtcgac 4800 atacacaacc actggttgag gcaggaagca cagaagaaca ggatcctggt ggaatatatc 4860 ccgacggcaa acatgatcgc cgatggactt acgaagcccc tgaatcatca acgcctccag 4920 attttcacgg cacagattgg actcatcgat atctcagatc gactggaggg aaggctactc 4980 cggcccatcg agaaacagga actttgggaa agaatggaag agctggagat cggagactga 5040 cagtttccac ctgcgaggcc tcaactgggg gggtg 5075 // ID Gypsy-37_MLP-I repbase; DNA; FNG; 3809 BP. XX AC AECX01001016; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-37_MLP_; KW Gypsy-37_MLP-LTR; Gypsy-37_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-3809 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001016; Positions 18740 22548. XX CC Positions [2648-3160] - Integrase core CC 'GGGAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 801..1766 FT /product="Gypsy-37_MLP-I_2p" FT /translation="MAPNTRSSSSRASSIAGSAGAIGDDSGRRATSAVDSR FT KPSGSTNVSRRSRSQLDPGGIQSPREVSGQLPHLVQSQAGERFQNIQEIHT FT TRQEHVLPIIEDDRLQMFDGTQLKLESIGLGLEDVREASARSEIPAPDALF FT TTDTTITTSGEDSRRRDHRDHSSSRHGSEECERTILPLDRELSNTFAPAFQ FT LYLAPIKLSVETLHKEIKAIHSLVRTREPSKRLDIVDKTIANLVESINSQV FT RALNQVGTVVKQSSEDMTATKNGLDHWSKLAIEAIDMGMNGTRDQLMEKNK FT ERQHGKTSRDCLQQLHVYCSRISRNHLSSM" FT CDS 2030..3778 FT /product="Gypsy-37_MLP-I_1p" FT /translation="MLRWQMAIQEYKSYMTITHRPGILHSNADALSRMAMP FT NDSNNPAWEAEDEDRDVPIMGINLCDLSDKFFTKIEQSYTKNSNTTILLKI FT FSLEHRDQSLESTLEIQWREPFTEGKFSLISGLLYHKERHTNVLVICDKDD FT IEQLISVCHDDFMSGHLSEDRVVERMSSTAWWPDWRKDVQQYVESCERCQK FT ANRQTGKRFGLLQRIEEPKFPWEVINMDFVTALPPAGSDNVNSVLVVVDRF FT SRRTRFIPCHKEIDAMGTALLFWQYIISDCGLPRVIISDRDPKFTSEFWKG FT LTKLIGTTLAMSTSYHPQTDGLAERMIQSLEDMLRQYCAFGLSFKDKDGYT FT HDWKTLLPALELAYNTSTHSTTGRTPFELERGYNPRTPKDMVGNKGIQVHP FT TSLSFHTMLSKARQHAKDCIEEAVKYNKDRWDKTHREPEFVIGDKVLISTI FT NFNNIQGPRKLKDSFVGPFTVVKLHGPNAVEVALTGEFARKHPTFLVSLLK FT HFKQSDTNKFPNRMEAKDAIPFDDDAPKTVLKVLDQKRVKISGKDVRLYLV FT RYKGKSADGDEWLPEDKITNAQQVLRKFRRDKKNSS" XX SQ Sequence 3809 BP; 1239 A; 823 C; 828 G; 919 T; 0 other; tttgggggcc tcatctattt tacaagtttc ttttttacaa ccaaagaaaa acaagcaccc 60 gactctatct cctaccatct ttcttagaat cataccaaac aagattgctt ccttctatct 120 gaaccaatca tcgattgatt tgtttagaga actaccgtac tatttgcata caacagaaaa 180 cccgttcagt gacgaagtct acgatccagt tttcggagct catactttaa gaccgaatca 240 tccgttctta cgacttgtca gcttagatca agaaccagaa ctcgcacttt tagaagatgt 300 catcgacgtt tgcgacctct gatttcctct cagaaacaaa tcaataactg ttaagatacg 360 agtaagcagc gcctttataa actatcacaa atagcaaggc tgaacgacaa gttttttgtt 420 tgtttaacaa aaactcctct tacatatata gtgcccatcg atacgattca accaattgca 480 ccttataacc caggagagga tctgattgac tacgaagaag ccaaaagtga gcatccattt 540 agtagctatt aaaagttttt tgacattttc gtctttttga gaaaataacg atttgattgt 600 ttgttatatt ggacagtgaa tacgactaac aataatccta gcttgactaa agacgtaccg 660 tcagtaaacc aagcttttca ggaattgtcg aagttcacga tacctaagaa aaccgcagaa 720 gctatcagtg tgtagaattt cgaaattttt ttttagaaga gggaaataac ccattacaca 780 tacgcattcc tgactgatta atggcaccta acactagatc cagctcctcc agagcatcga 840 gcattgcagg atcagcgggg gcaatcggcg atgattctgg acgaagagcc acaagcgcag 900 tcgatagcag aaagcccagc ggatccacca acgtatccag gaggtccaga tcgcaacttg 960 acccaggagg aatacaatct cctcgagaag tatcaggtca actaccacat ttggtacaaa 1020 gccaggcagg agaaagattt caaaacattc aagaaatcca cacaacacgc caagagcatg 1080 tattgccaat tattgaagat gatcggttac aaatgtttga tggaacgcag ttgaaactgg 1140 aatccatcgg actgggattg gaagacgttc gagaggcgag tgcgaggtca gaaattcccg 1200 ccccggacgc cttattcacg accgacacaa ccatcacaac atcaggggag gacagtagga 1260 gacgagatca tagggatcat tcaagcagcc ggcatggcag tgaagaatgc gagaggacca 1320 tcttaccctt agatagagag ttatcgaata cttttgcgcc tgctttccaa ttgtatttag 1380 cccctattaa gcttagtgtt gaaacgttac acaaagaaat taaagctatc cattcactcg 1440 ttagaactcg tgagccaagc aaacgcttgg atatcgttga caaaaccata gctaatttag 1500 tagagagcat caacagtcaa gtaagagcat taaaccaagt tggtacagta gtcaaacaat 1560 ctagtgaaga tatgacagca actaaaaatg gtctggacca ttggtcaaaa ctagctattg 1620 aagcaataga tatggggatg aacgggacgc gtgatcagct catggagaaa aacaaagagc 1680 gacagcatgg gaaaacgtca agagactgct tacaacagct ccatgtctat tgcagccgga 1740 tttcacgaaa ccatttatcc tctatgtaga tgcgagcttc tttggtttag gtgcagcact 1800 gcatcaaaaa caagtgagtg gaggcaaaac cgttaacggc ccagtctgct tcattttgag 1860 acagctgaag gagagcgaga aaaagtatgg tgcaccccag ctggaatgcc tggcgcttgt 1920 ttgggccttg aataaactgc attactacct cgatggcatt tattttgagg ttatcacgga 1980 ctgtcaggca attaagtcac ttttaaatac caagacgccg acgagacaca tgctccgctg 2040 gcagatggcc attcaggagt ataagtcata catgactatt acccacagac ccggaatact 2100 acatagcaac gctgacgcgc tgagtaggat ggcaatgcct aacgattcca ataatcctgc 2160 ttgggaggca gaagacgaag acagggacgt accgattatg ggtattaacc tgtgcgacct 2220 ttcagacaaa ttttttacga aaatcgaaca gagttacacg aagaattcta acaccacaat 2280 tttattgaaa atattcagtc ttgaacatag agatcaatct ttagaaagca ccctggaaat 2340 acaatggaga gaaccattta cggagggaaa attctcccta atatcgggac tactatatca 2400 taaagagaga cataccaacg tcttagtcat ctgtgataaa gatgacatag aacaactcat 2460 ctcggtctgc catgacgatt tcatgtctgg tcacctgagc gaggacaggg tcgtagaacg 2520 aatgagctcg acggcttggt ggcctgactg gcgaaaagat gtacaacaat atgtcgagtc 2580 ctgcgaaaga tgccagaagg caaatagaca gacaggcaag agattcgggc ttctgcaaag 2640 gatcgaagaa cccaaatttc cgtgggaggt cataaatatg gacttcgtga ctgctctacc 2700 accggcagga agcgacaacg tcaacagcgt ccttgtagta gttgacaggt tttctaggcg 2760 cactagattc attccttgcc ataaagaaat cgatgcaatg ggtacggcgt tattattttg 2820 gcaatatatc ataagcgatt gtgggctacc tcgagtcatc ataagcgata gggacccaaa 2880 attcacgtcg gaattttgga aaggcctgac taaacttatt ggcaccacgc tagcaatgtc 2940 tacgtcatat cacccccaaa cggatggcct tgcagaacgc atgatacaga gcctagaaga 3000 tatgcttagg caatactgcg cgttcggtct gtcgttcaaa gacaaagatg ggtacaccca 3060 tgactggaag actttgttac cagctttgga gttggcgtac aatacgagca cccatagtac 3120 aacgggaaga actccgttcg agttagaacg agggtataac ccaagaacac ctaaggatat 3180 ggttgggaac aaaggtattc aagtacaccc gacgtccctc agtttccaca cgatgttgtc 3240 aaaagctagg caacacgcta aagactgcat agaggaagcc gtcaaataca ataaggacag 3300 gtgggacaag acacacaggg agcctgaatt cgttataggt gataaagtat tgatttctac 3360 tatcaacttt aataacattc agggtcccag gaaacttaag gattcctttg tagggccctt 3420 taccgtagta aaattgcacg gcccaaacgc tgtcgaagta gcattaacag gtgaatttgc 3480 aagaaaacac ccgacttttc ttgtatccct gctcaaacat ttcaagcaat cggacacaaa 3540 caaattccca aacagaatgg aggcaaaaga tgcgatacct ttcgacgacg atgcaccaaa 3600 aacggttctt aaagtattag atcaaaaaag agttaagata tcaggaaagg acgtacgctt 3660 atatctagta agatataaag gtaaaagcgc tgacggcgac gaatggttac ccgaagacaa 3720 aataaccaat gcccaacaag ttctgcgtaa gtttaggaga gataagaaaa attcctctta 3780 ggctttttct tttttccgac cccggtgag 3809 // ID Copia-30_MLP-LTR repbase; DNA; FNG; 312 BP. XX AC AECX01003112; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-30_MLP_; KW Copia-30_MLP-I; Copia-30_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-312 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01003112; Positions 1021 710. XX SQ Sequence 312 BP; 84 A; 63 C; 41 G; 124 T; 0 other; tgttggaaca aatcacgttc acaaaatggt ttatgtcaca tttctatttc atattatgtt 60 tactaatgat ttaatacatg ttgtaatttg ttacatgact ttactcacga actttgtaag 120 tttccaaacc taatcacgtg atgtaccgct agttttagtt catcaatcca tccggcctga 180 gagaagcttt ctttcttttc attcaactat attgaaagct tcttctctac aaccattata 240 tcatctcatc actccggatt actcattcct tcttgacttc tgactgatat aattgttgtt 300 tgggattgat ca 312 // ID Gypsy-2_MLP-I repbase; DNA; FNG; 8007 BP. XX AC AECX01001632; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_MLP_; KW Gypsy-2_MLP-LTR; Gypsy-2_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-8007 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001632; Positions 135530 143536. XX CC Positions [6822-7334] - Integrase core CC 'GATTC' target site duplication CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS join(792..4169,4173..7964) FT /product="Gypsy-2_MLP-I_1p" FT /translation="MPPRKSDRVRSIESQSGRPNYSNTARRASSVSSVPGR FT RNARGHTGRSITTDGPPRATTTVDFTSRTQLQELEVGPGYESQQHPPNLDL FT SWIMEDVSRRLNDHPREGGSTEGELRLGSVCRREEALGEDQQWFHRVEPEP FT SRISKPQQTLPVSRLERDPRTSSVDIVGRTSPSRQSSANQGTVQEATPSGR FT RCPEHALDVGVQQGVLLSSERDQQIQRQGASEECERPTFSQIRTVFTATPS FT PFLSSHRILGDRNPVIENNVTRSPILHVVDKRVVQPTKEGNQRDYETNEMN FT HEVVHEQNRHLSPRFEQRESTVPTNVPSFDNASKTESISDRIKYRDDLERH FT DQQDQVSNNSLRKRKPGNLIQESKFIAHPKSIEDILRETEHGISPFVAAGP FT VRDNTDGLSSTSLDKDFSLANKLRKLRLQTESLSPKYNPTAINHGEEVSDT FT IRNLQKSIEQKLDNDIRILSEKWDFGINMLNDELTLNFDDIKKDVLEIREE FT ILSERNAGTHAEGKHSIEKILDKHSLACKQMIEIHSTHLSEMMCRLGEKLD FT NQAHQIVLLHRHVAKIISMIDRQANDRDEVPVQVLGNNPFVDYSEKHEAQH FT TTVAPKLETAVLLSTAQEAEQQKKDAAAAAAEERKALQKGQIPAKDWPTFS FT GEGEYNHLEWIEWIDNTREDTCIPDSLITCKLGVILTDSARSWYTVKRRNI FT KEKKSWDDWKKMIMDHFGTPIWKRRMATAFDRDTFKWENREKPLAWLLLQR FT RRMDAAWPFLSISEQINKILGLCNGDIKHAVQSRITDHDNFEAFMNIFEEV FT VTNTLIGRSLLKNKDYQNNTFSQRPGNSFSSRDRNSFSSRERSSSFREHRS FT IEHPKGQDIGKSKDYKNHKNLGTVTGKVSFRDNKSNRFEKKSIHAISNETD FT KEQNEEYLTEKFNDEIEDDSESSDNDELDLCVGNIDMLAVHQEYDSPPSVM FT IHTEETVQVQTPAVSLEDLAENLRIAEETVSSTDHPPFIERVTFNNTESRP FT FVAISISGYSSEMLLNTGCEPLIISTKVLNKYWPTWQTDTKTTNVSAKYAD FT DSDMIGDILIPIKLEHAYNACFLMVNLIVLIDDNLDHLILGSQDLREFGFS FT VELKDRTHCRIRTRYRSFFPIIPQKIWDEQSGEKAHGDISTITVESEEGLD FT IAELQSQASIPQQWEENAKPSSQISDARLLMSKPEKGRAHTIGQHCITSAL FT ISERQKIEVLLDSGAACSIVGTDYLNRFVKDWKESMMPPSNMTFRGCSSKL FT RPLGVIEIPMIFPHHLGSVRIRPEFVVMDNATSNYFILGGEFLRLYGFDIV FT HSKEKYFTIGNENKRKKFLLHSSREQKIAAVSLKLDESFEKAFGECNISPR FT LKATELYSLQAVVRKYKKAFTYGENPIGTIKGHKLKRRLNIEKPYPPILKK FT QAYPASPRSRDEIEKHINELVQYGILRKVGADEEVDVTTPVIIAWHNGKSR FT MCGDFRALNTFTIPDRYPMPRISHCLTNLGKAMYITTMDVMKGFHQNEVDE FT LTRYLLRIISHMGIHEYLKMPFGIKGAPAHFQRMMDTEFAAELSEGWLIIY FT IDNIIVFSETWEEHLEKLELVFKKLVRMGMTISLQKTNFAFEELKALGHVV FT SGLWIAVDQNKVAAVLKKQIPQSVKEVQQFLGFANYYRSHLEGFAKASGPL FT YKLLQKGVTFEMTADGVASWNILKKKLTEAPFLLHADFKLPFKVYVDASFD FT GLGATLQQIQIVDGKPFEGLISCISRKLKQSELNLYGATQLENLCLVWALE FT KFHYYLDGALFEVITDCTALKSLLNMKTPNRHMLRWQIAIQEYRSCMTITH FT REGKKHENADALSRMALENDIENPAWDPEDIGRDMPIMGITISDLSEEFYD FT KVMHDYTKEPNTVKLLSIMRQGCKALDLSSSLDKEWKKSFDEGRFTLLSGI FT LYHREKHSCAMVITSQRLKTDILKVCHDDIMSGHLSSERTIERVKQIAWWP FT GWKALTEDYVSSCDRCQKANRATGKRLGLLQHIEEPQIPWEVINMDFVTGL FT PPAGHDNVDCVLVIVDRFSKRCRFLACHKAATAMDVALLFWERIVSDSGLP FT RVIISDRDPKFTSEFWKGLFSLAGTSLAMSTAYHPQTDGLAERMIESLSDL FT IRRYCAFGLEFKDNNGYTHDWKSLLPALEIAYNTSVHSTTGKSPFEVERGF FT NIRTPSKMIRHKESTFHPTAMDFHNMLEKARKHAGDCIIAAKEYNKQRWDK FT THKEPSFKIGDLVLISTVNFTNLQEPRKMKDSFIGPYVIKALHGTNAVEVI FT LTGQLARKHPTFPVSLLKKYKKSDESKFPNRTEVEQIDPPMEEENIGAIKK FT IIGDKYVKVNGKSTRLYLARFKGKDTDEDKWLEAKDIPDSDRFLRKFRVER FT RETRAQP" XX SQ Sequence 8007 BP; 2813 A; 1599 C; 1799 G; 1796 T; 0 other; attgggggcc tcattgtgtg ttaataaccc aagactaaca aaagaaaacg tgctttagtt 60 ttcaatatat attttttttt ttgctttaga aaaccaataa aaaacccaca aaaaaatatc 120 aatttttttt ttctgttatc attttctatc ccacaatttc tttccatctg atagatcagt 180 gctcgttaga tctattcaaa gaactacctt actacctttt tgtccaagaa aatcctttca 240 cggacaagat ttataatcag tgttttggat accacgtgct cagtcctaga ttacagtatc 300 tctttcaaaa tttcggacga agcaaactca gtgttaactc aactcctagc gattaataaa 360 tcattaggaa gtctaatctt tacattcgta agggttaaat tatttaacga accactaccg 420 gcggagttgt atagagacct ctggaagctt ggcaagaaac taaaagcttt ggccataaga 480 ctcgacgcag gagaatacga gtgagtacaa aatagtaaaa caaaagaaaa gatcaaactc 540 actgataaaa cattacggtc cacctagtga ccaaaacgca gcagtccaat caggctctaa 600 cgaaaggaga atagggagta ggcaagatcc cattgaactc gacagcccag agcccgaaca 660 accagccagt ccagggtcat acaacgggta caacccacac gacgacattg aagaccttgt 720 ctataggtca acctggtaag taacattaaa actttaccta gttctgcttt aacttcgaac 780 taagcctaaa tatgccacct agaaaatctg acagagtcag gagcattgag agccagtccg 840 gcagaccgaa ctacagcaat accgcccgaa gagcaagctc agttagctca gtaccagggc 900 gacgcaatgc aagaggacat accggcagat caatcactac cgacggtccc ccaagagcaa 960 caacgactgt tgatttcact tcacgaacac aacttcaaga attggaagtc ggcccaggat 1020 acgaatccca acaacacccg ccaaatctcg atctttcttg gattatggaa gatgtctcac 1080 gacgccttaa cgaccatcct agggaaggag gaagtacaga aggggagctt aggctgggat 1140 ccgtatgtag aagagaggaa gctcttggcg aagaccagca atggtttcat agggttgaac 1200 cagaaccctc acggatcagc aaaccccaac aaaccttacc agtctccaga ttggaacgag 1260 atccaagaac cagttcagtc gacatcgtcg ggaggaccag tccgtcacga caatcatcgg 1320 caaaccaagg aaccgtacaa gaagccaccc caagcggaag aagatgccca gaacatgcac 1380 tcgatgttgg cgttcagcaa ggcgtactat tgagcagtga aagggatcaa caaatccaaa 1440 ggcaaggggc atcagaagaa tgcgaaagac caacattctc acaaataaga actgtgttta 1500 cggccactcc aagcccattt ttgtcatctc atcgtatctt aggtgatcga aatcctgtta 1560 tagaaaacaa tgtaactaga agtcctattt tacatgtagt agacaaaaga gttgttcaac 1620 caactaagga gggtaatcaa agagattacg agacgaatga aatgaatcat gaagttgtcc 1680 atgaacaaaa cagacatctg agccccagat tcgaacaaag agagagtact gtacccacaa 1740 atgttccatc atttgacaat gcgagtaaga ctgagagcat ttcagaccgg atcaaatacc 1800 gggatgactt agagagacac gatcagcagg atcaagtaag taataattcc ctgaggaaga 1860 ggaaaccagg caacttgatt caggaatcaa aattcattgc gcaccctaaa agtatcgaag 1920 atatactgag agaaaccgag cacggtatat ctccgtttgt agcagctgga ccggttaggg 1980 ataatacgga tgggctcagt agcacgtcat tagacaaaga ctttagctta gcaaataaac 2040 tccgtaagct aaggctacaa acggagagcc tgtcgccgaa atacaaccca acagcgataa 2100 atcatggaga ggaagtatca gacacgataa gaaacctcca gaaatcaata gaacagaaac 2160 tagacaacga tataagaata ctgagtgaga aatgggactt cggtattaat atgctgaatg 2220 atgaactaac cttgaatttt gacgacatta aaaaagatgt gctagagata agagaagaaa 2280 tactctcgga aagaaacgca ggtacacacg cagagggtaa acacagcatt gagaaaattc 2340 tagataagca tagcttagca tgtaaacaga tgatcgagat acacagcaca catcttagcg 2400 aaatgatgtg tagactaggt gagaagctag ataaccaagc acaccaaata gttttgctgc 2460 accgacacgt agccaaaatc atctccatga tagatagaca agcaaatgac agagacgaag 2520 tacctgtaca agttttagga aataatcctt tcgtagatta tagcgagaag cacgaagcac 2580 aacatacaac ggtagctccg aagctagaaa cagctgtatt actttccaca gcacaggaag 2640 cagaacaaca aaagaaagac gcagcggcag cagcggcaga ggaaaggaaa gccttacaaa 2700 aaggccaaat cccagccaaa gactggccga cttttagcgg agagggagag tataatcacc 2760 ttgaatggat tgagtggatc gataatacaa gagaagatac atgtattccg gactcgttga 2820 taacttgtaa gttaggagta atactcacag attcggcgcg aagctggtat acagttaaaa 2880 ggcgcaatat aaaagaaaag aaatcatggg atgattggaa aaagatgatc atggaccact 2940 ttggtacgcc aatctggaaa agaaggatgg ccacagcttt cgacagagac accttcaaat 3000 gggagaatag agagaagcca ttagcctggc ttcttctgca gcgtaggaga atggatgctg 3060 catggccatt tctctcaatt agtgaacaaa tcaataagat actaggacta tgtaacgggg 3120 acatcaaaca cgcagttcaa tcccgcataa cagaccatga caacttcgaa gccttcatga 3180 atatattcga agaagtcgtg accaacacct tgatcgggag aagcttgctc aagaataaag 3240 actaccaaaa caataccttt tcgcaaaggc ccggtaacag tttctcgagc agagatcgaa 3300 acagtttctc gagtagagaa agatcatcta gttttagaga acacaggtct attgaacatc 3360 cgaaaggaca agatatcggg aaatcaaaag actacaaaaa tcacaaaaac ctaggaactg 3420 taacgggaaa ggtgtcattc cgtgacaata agagcaatag atttgagaag aagtccatcc 3480 acgctataag caatgaaaca gacaaagaac aaaacgaaga atacttgacg gagaagttca 3540 atgacgaaat agaggatgac tcggaatcat cagataacga cgagctggac ctttgcgtgg 3600 gcaatattga tatgctagca gtacaccaag aatacgattc ccctccttct gtaatgatac 3660 acacagagga aacagtacag gtacagactc cagctgtatc attagaggat ctagcggaaa 3720 atctaagaat agctgaggag acagtatcgt caaccgacca tccaccattc atagaaagag 3780 taacgttcaa caacacagaa agtagaccgt tcgttgcgat cagcatctca ggctacagca 3840 gcgagatgct ccttaatact ggctgtgagc cattgataat ctcaactaaa gtactgaaca 3900 aatattggcc aacatggcaa actgatacaa aaacgaccaa cgtatcagcg aaatatgcag 3960 atgattcgga catgatagga gatatcttga taccgattaa actagaacat gcatataatg 4020 cgtgtttctt gatggtcaac cttatagtgt tgatcgacga caatttagac cacctgatat 4080 tgggtagcca agacctacgg gagtttggat tttctgtgga gctgaaggat agaacacatt 4140 gcagaatacg tacgaggtac cgaagcttct gattcccaat tataccccaa aagatatggg 4200 atgaacagag tggagaaaaa gctcacgggg atatctctac gatcacggtt gaatcagaag 4260 aaggactgga tatagcagaa ctacagtcgc aggcgtctat accccaacaa tgggaagaga 4320 acgcaaagcc atcatcacaa atatccgatg caaggctcct tatgtcgaag cctgaaaaag 4380 gcagagcgca cacgataggg cagcattgta tcacatcagc cctgatcagc gaaagacaga 4440 agattgaagt cctgcttgac agcggagcag cctgctcaat agtgggaaca gattatctga 4500 atagatttgt gaaagattgg aaggagtcta tgatgccacc aagcaatatg acattcagag 4560 ggtgtagcag taagctacga ccattaggag taatagaaat accaatgata tttcctcatc 4620 atttaggatc agtaagaatt agacctgaat ttgtagttat ggataatgca acatcaaact 4680 actttatact gggaggagag ttccttagac tatacggctt tgatattgta cacagtaaag 4740 aaaagtattt tacaattggc aatgaaaaca aaaggaagaa gtttctgttg cactcttcga 4800 gagaacagaa aatagcagca gtatcattaa aattagacga aagctttgag aaagcttttg 4860 gagaatgtaa catctctcct agattgaaag ctacggaatt gtactcccta caggcggtgg 4920 ttaggaagta taaaaaggct ttcacatacg gcgaaaaccc aatagggact atcaaaggtc 4980 ataaacttaa acggaggctc aatattgaga aaccgtatcc tccaatactg aaaaaacagg 5040 cgtacccggc tagccctcgt agcagagatg agattgagaa gcacatcaat gaacttgttc 5100 agtatggtat cttgcggaag gtaggggcag atgaagaggt ggatgtaacg acaccagtca 5160 tcattgcctg gcacaacggg aaatcccgaa tgtgtggtga ctttcgtgca cttaatactt 5220 ttacaatacc ggacagatat cctatgccca ggattagcca ttgtttgacg aacctaggga 5280 aagccatgta catcacgact atggatgtaa tgaaggggtt ccatcaaaac gaagtagacg 5340 agttgactag atacttgcta agaatcatat ctcacatggg gattcacgag tatttgaaaa 5400 tgccattcgg tattaaaggt gcaccggcac attttcaacg catgatggat accgagtttg 5460 cagcagaact gagtgaaggt tggctgatta tctatataga caacataatt gttttttctg 5520 agacatggga agaacattta gaaaaactag aactggtttt taagaaactc gtaagaatgg 5580 ggatgacgat atccttacag aagactaact tcgcctttga ggaactgaaa gcattaggac 5640 atgtagtttc aggtttatgg attgccgtag atcagaataa agtggcagcg gtactcaaaa 5700 aacaaatccc gcaatcagtt aaagaagtcc agcaatttct gggattcgcg aactattata 5760 ggtctcactt ggagggattt gccaaagcca gcggaccgct gtacaaatta cttcaaaaag 5820 gggtaacatt cgaaatgacg gcagatggag tagcgtcatg gaatatacta aagaagaagt 5880 tgacagaggc tccattcctg cttcatgcag atttcaaact gccttttaaa gtgtacgttg 5940 acgcaagctt tgatgggtta ggagccacac tgcaacagat ccagattgtg gatggcaaac 6000 cctttgaggg attgatcagt tgtatttcaa gaaaattgaa acaatccgaa ctaaacttat 6060 atggggccac tcagctagag aatctgtgct tagtatgggc actggagaag tttcactact 6120 atctagatgg ggccttattc gaagtcataa ctgattgtac agcgctgaaa tcacttctga 6180 acatgaaaac tcctaataga catatgctgc gatggcagat agctattcaa gagtacagat 6240 catgcatgac aataacccat agggaaggaa agaaacatga gaatgcagac gccttgagta 6300 ggatggcact agaaaacgac atagaaaacc cggcatggga tcccgaggat attggtcgag 6360 atatgccgat tatgggaatt acaatatcag acttgtcaga ggaattctac gataaggtca 6420 tgcatgatta tacaaaagaa ccaaacacag tgaaactgct aagtataatg agacaagggt 6480 gcaaagcatt agatctgtca tcatctttag ataaagaatg gaagaaatca ttcgacgagg 6540 gcaggtttac cttactcagt gggatcctgt atcaccgaga aaaacactcg tgtgcgatgg 6600 ttataacttc gcagagactt aaaacagata tacttaaagt atgccatgat gatatcatgt 6660 caggacactt atccagtgaa aggactattg agagagtcaa acaaatagcc tggtggccag 6720 gttggaaggc tcttacagaa gattatgtga gctcctgcga taggtgccaa aaagcaaata 6780 gagctacagg aaagaggcta ggcctccttc aacatattga agaaccccag attccgtggg 6840 aagtcataaa tatggacttt gtaacgggat tgccaccagc tggacacgac aatgtggact 6900 gtgtcctagt catagtggat agattttcga aacgctgtag atttttggca tgccacaagg 6960 ctgctacagc catggacgta gcactactat tttgggaacg aatagtctca gactcaggct 7020 taccaagggt gatcattagt gatcgggacc cgaaatttac ttcggaattc tggaagggat 7080 tgttcagcct agcaggaaca tcattagcga tgtctacagc gtaccacccg caaacagacg 7140 gcttggcgga acgcatgatt gagagtctca gcgacctcat aaggagatat tgcgcatttg 7200 gtcttgagtt caaggacaat aatggataca cgcatgactg gaagagcttg ttaccagcct 7260 tagagatcgc gtataatacc agcgtacata gcactacggg aaaatcgccc tttgaagtgg 7320 agagaggatt taacattaga actccgtcaa aaatgatacg acacaaagag tcaacgttcc 7380 atcctacagc aatggacttc cataacatgt tggaaaaagc aagaaaacac gccggcgatt 7440 gtataatagc tgcaaaagag tataataagc aacgttggga taagacgcat aaagagccat 7500 catttaagat tggggacctg gtattaatat ctacagtaaa tttcacaaat ctgcaagaac 7560 cacggaaaat gaaggactca tttataggac cctatgtaat taaagcctta cacgggacaa 7620 acgcagtaga agttatatta acaggacaat tggcacggaa acacccaacg tttccagtct 7680 cactgttaaa gaaatacaaa aaatcagatg agagcaaatt tcctaataga acagaagtcg 7740 aacagataga cccacctatg gaagaagaaa acataggagc cattaagaag ataatcggcg 7800 ataaatacgt gaaagttaat gggaaaagca ccagacttta tttggcacgc ttcaaaggga 7860 aagacacgga tgaagacaaa tggttggaag ccaaagacat accagactcc gatagattcc 7920 tcagaaagtt cagagtagag agacgagaaa ctagggcaca accctgaggt tggggcgtca 7980 ttagattaca cccctgaggt tggggaa 8007 // ID Copia-52_MLP-I repbase; DNA; FNG; 4665 BP. XX AC AECX01000262; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-52_MLP_; KW Copia-52_MLP-LTR; Copia-52_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4665 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000262; Positions 17249 21913. XX CC Positions [1988-2371] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(146..2371,2375..4639) FT /product="Copia-52_MLP-I_1p" FT /translation="MNEEEFPFPVDFYTSSPLNSSTSSAEDFQSTSDIDSV FT GEQTILEGLIPNMASSTLPPTNVGTATVTSLFGNKRYQIASHMSNSLLKIS FT ITEKLNDTNYPNWSSEVRNGTKSLSFTNFLLSDKDESGINDDLKIFNDVVR FT ESLTTWMISKLDQPNRNWFEPFITVETGIGPDLESSPSKLWIKIRDHHAPR FT TPAHQMLMRRILFQLNQGSSTLSKHLDNFHRAYTAFTTSGGSIGQVELGQQ FT LLMSLNSEWIKVAEDIADKDTFEYNSVVTALNVRITNREMLHPNTKNLPII FT EANAVSPMRRRQGFVQTRCSPKKCVSLTHTEADCIRNPKNIDKYKAWVASK FT KAKGEWVERGPPSTSSVSVHQAEAITVPQSFPSLSSLQDALERSEFSASAS FT AVYIVNSNADIENEHFGIVDSGTTHHMFKSIKYFDVKSFIASSSPSERVGL FT AGGSTTLEIQGRGNVTLIGPSGDLTTLTDCLLVPSLKQNLIAGGRLLYEGW FT ITTRRDNGSFVIQKNGKTALTGKVDPTSMLLHVNAVRTLSSEPSPSSAAIT FT DNITETLQLLHHQLGHPNLAYLKQMIKGDIVLGIPVSLSKVVLPQTLPCNS FT CDLAKAHCQPHSQTRTRSLHPLDNVHIDLSGIMRTPALCRSLYFILFTDDH FT TSFCHITGLKSKEKEEVHDAIQTYLSLVERQCDRKVKCLTLDGGGEFLNDV FT MLPWCQSQGIYLRITAAYTPEENGVAERSMRTIVSKGRAMLLANLPIRFWM FT EAVKNAVFLNNRTMTTTIPKNKTPFEMWYGRKPDISHIRAFGCLAYVLIRK FT PDRDNKYAPNSEHGVLAGHTEHNRNYKIFTLKDSKIVITHDASFREDVYPF FT TQLPTFDISHLTADEENVDQLVLQTEPPVLPATEPDNDIDEQLVTVADVPI FT EPLPDVLPQPSLPRRSTREVRPVDRFIPNSNFVFWYNNKLYDDVFIPRGDV FT SAYAFAVKSNVRLITEPGSYKQAMKSTDSAEWKKACDKEIENMTRKNVWTI FT VDRPLDAPVVGCQWHFKIKHRPDGSIIKYKARLVAKGFTQTFGVDYDKTYA FT PTGKPASYRVLVAIASYFGWSIHSMDAVAAFLNSLLKEKVYMEQPEGYESL FT GSEDHVCELLRALYGLKQSAHEWNDEFKTMLFTIDFVQAEGDECVFIRRRS FT QFDIIIFYLHVNDMAVTGTDQSIPGFKKEVMAFWEMDDLGEATCVVGIETL FT RVSKHHYAIHQRSMTEALLLQFGLSDCKPASTPIQGGLKLTTSTSEEAASF FT AKLNRPYRSGVGSLMYLAQCTRPDIAYAVGVLSRHLDTPCERHWDALQHVM FT RYLRGTLSLGIHYHAGDTTIFKMQASWNVPITNVDSDWAGDKSSRRSTTGY FT LTTIFGGAISWRSRLQQTVALSSTEAEYRATTEAGQETQWLRNLFRDVGLE FT YSGPININCDNLGALDLASNAVHHGRTKHIEIEHHWIREQVNQGQINLVYC FT KSSEMTADLLTKPLHPGPFWEHMKGVGLRRCV" XX SQ Sequence 4665 BP; 1358 A; 1080 C; 950 G; 1277 T; 0 other; ctggtttatt aggttagttc cttagaggtc tgcatacctc gagctggaga agagtgctta 60 cctttagcac tggtttatta ggtgttctca ttcatttaca tggtagcgag agagacagtt 120 tcgaccgcat caatcgttat aacatatgaa tgaagaagag ttcccatttc cagtagattt 180 ttacacctct tctccactaa actcatcaac ttcatccgct gaggattttc aatcaacatc 240 tgatatcgat tctgttggcg aacaaaccat cttagaaggt cttataccca acatggcatc 300 atctacctta ccgcctacta atgtaggaac tgcaactgtc acttcacttt tcggtaacaa 360 acgttaccaa attgcttctc acatgtcaaa ttcattattg aaaataagta tcactgaaaa 420 gctcaacgat accaactacc ctaactggtc atcagaggtt cgaaatggta ctaaatcatt 480 atcttttacc aatttcctcc tctcagataa ggatgaatca ggcatcaatg atgatctcaa 540 gattttcaat gatgtggttc gagaatccct cactacgtgg atgatatcca aactcgatca 600 accgaatcgt aattggtttg aaccttttat aactgtagaa actggcattg gtcctgatct 660 cgaatcttcc cctagcaaac tatggattaa gatcagagac catcacgctc cccgtactcc 720 agcacaccaa atgttaatga gaagaattct attccaactc aatcaaggat cttctactct 780 atcaaaacac ctagacaatt ttcatcgagc ctatacggct tttactacgt ctggtggaag 840 cattggtcaa gtagagttag gacaacaact tcttatgtcc ttgaattctg aatggatcaa 900 agtagctgag gatattgcgg ataaagatac gttcgaatat aattcagtgg tcactgctct 960 caatgttcgg attacgaatc gagaaatgtt gcatcctaac accaaaaact taccaattat 1020 tgaagccaac gcagttagtc ctatgagaag acgccaaggg tttgttcaga ctagatgttc 1080 tcccaagaaa tgtgtatctt taacacacac cgaagctgat tgtatcagaa accccaaaaa 1140 tattgataag tacaaagcgt gggttgcatc caagaaagca aaaggggagt gggttgagag 1200 aggaccgcca tcgacatcat cagtctcagt acaccaagcc gaagcaatca ctgttcctca 1260 gtcttttcct tcactatcat ctcttcaaga cgcattagaa cgatctgagt tttccgcgag 1320 cgctagtgcc gtgtatatcg tcaattcaaa tgcagacatc gaaaacgagc acttcggaat 1380 agttgattcc ggaactacac atcatatgtt caaatcaatc aaatactttg atgttaaatc 1440 cttcattgct agctcatctc ccagcgaacg agttggattg gctggcggtt caactacgct 1500 tgaaattcaa ggaagaggaa acgtcacctt aataggacct tccggtgatc ttacaacatt 1560 gaccgactgt cttcttgtcc catctttgaa acaaaactta attgctggtg gtagactttt 1620 atatgaagga tggatcacta ctcgtcgcga taatggttca ttcgttattc aaaagaatgg 1680 aaaaacggca cttaccggaa aggttgatcc aacgtctatg ctactccacg tcaatgcggt 1740 tcgtacatta tcatctgaac ccagtcctag ctctgccgca ataaccgaca acatcactga 1800 aacactccaa cttcttcatc atcaactagg tcacccgaat cttgcatatt taaagcaaat 1860 gatcaagggg gatatagtgt tgggaattcc tgtttcttta tcaaaagttg tattgccaca 1920 aacactacca tgcaattcat gtgacttagc aaaagctcat tgtcaaccgc actctcaaac 1980 tcgtactaga tctctgcatc cattggataa tgtccacata gacctaagtg gaataatgcg 2040 tacacctgcc ctatgtcgaa gcctctattt tatccttttc actgatgatc atacatcatt 2100 ttgtcacatt actggcctta aatcaaaaga gaaggaagaa gttcatgatg ctattcaaac 2160 atatctttcc ctagttgaac gtcaatgtga tcgaaaggtg aaatgtctta cccttgatgg 2220 tggcggtgaa tttcttaacg acgtaatgtt accatggtgt caaagtcaag gtatatacct 2280 tagaattacc gctgcttaca ctcctgaaga aaacggtgtt gcagaacgat ccatgcgcac 2340 catagtctcc aaaggtagag caatgctact ttgagcaaac ttaccaattc ggttctggat 2400 ggaggcggtg aagaatgcag tcttcttaaa caaccgcacc atgaccacga ctattccgaa 2460 gaacaaaacc ccttttgaaa tgtggtatgg aagaaaacca gatatatccc atatccgggc 2520 ttttgggtgt cttgcctatg tgctcatcag aaaacctgac cgcgataaca aatacgcacc 2580 aaattctgaa cacggtgttt tagcaggtca taccgaacac aatagaaact acaagatttt 2640 tacgcttaaa gattctaaga ttgttatcac acatgatgct tcttttcgag aagatgtcta 2700 tcctttcaca caacttccta cttttgacat atctcatttg acagcagatg aagaaaacgt 2760 cgatcaactc gtcttacaaa ctgaaccacc tgtattgcct gctactgaac cagacaacga 2820 tatcgatgaa caactagtca ctgtagcaga cgttcccatt gaacccttac ccgatgttct 2880 tcctcaacct tcactcccac gtcgttcaac tagggaggtg cggccggtag accgttttat 2940 tcctaattct aatttcgtct tctggtataa caacaaactt tatgatgatg tattcatacc 3000 cagaggggac gtcagcgctt acgcattcgc cgtcaaatca aatgttcgcc taataactga 3060 acctgggtca tacaagcaag ccatgaaaag tactgattca gctgagtgga agaaggcgtg 3120 tgataaggaa attgaaaaca tgactcgtaa aaacgtctgg accatcgtcg accgtccgct 3180 ggatgcaccg gtggtaggtt gtcaatggca cttcaaaatc aaacaccggc ctgatggatc 3240 catcatcaaa tacaaagcac gtcttgtagc aaaaggattt actcaaacat ttggtgttga 3300 ctacgataaa acttatgcgc caactggtaa acccgcctcg tatcgagtac ttgtagcaat 3360 tgcatcttat tttggttgga gcattcattc tatggatgcg gtggccgctt tcttgaattc 3420 attgttaaaa gagaaggttt acatggagca accggaaggt tatgaaagtc ttggtagcga 3480 agatcacgtc tgtgaactat tgcgtgcgct ctacgggctc aagcagtcgg cccatgaatg 3540 gaacgatgag tttaaaacta tgctgttcac cattgacttt gttcaggctg agggtgatga 3600 gtgcgtattc attagacgtc gctcacagtt tgatattatc attttttact tgcacgtcaa 3660 cgacatggca gttactggca ccgaccaatc cattcccggt ttcaagaaag aagttatggc 3720 tttctgggaa atggatgacc tcggtgaggc gacttgcgtg gttggaatcg aaaccctgcg 3780 tgtcagcaaa caccactacg ctatccacca acggtcaatg acggaggcac tcctattgca 3840 attcggccta tcagactgca agccggcctc aacacccatt caaggaggct taaaactcac 3900 cacctcaact agcgaggagg ctgcttcttt cgccaaactc aaccgaccct atcgatccgg 3960 agtcggtagc ttaatgtacc tggcacaatg cacacgcccc gacatcgcgt acgccgtggg 4020 tgttttatca cgtcatctgg ataccccttg cgagcgccac tgggatgcct tgcaacacgt 4080 catgcggtac ttgcgaggta cgcttagcct tggcattcat tatcacgccg gggacaccac 4140 cattttcaag atgcaagcta gctggaacgt acctatcaca aatgtcgact ctgactgggc 4200 tggcgataaa agttcaagac gttctactac aggttacttg acaactatct ttggtggggc 4260 catatcttgg aggtctcgtt tgcaacaaac agttgcactt tcttcaacag aagcggagta 4320 ccgcgccaca actgaagctg gtcaagaaac tcaatggtta cgaaacttat ttcgcgatgt 4380 tggtcttgag tactctggcc caatcaacat taactgcgac aacttgggag cattggatct 4440 ggcatctaac gcggtgcacc atggaaggac caagcacatc gaaattgaac atcattggat 4500 acgtgaacag gtcaatcagg gtcaaatcaa tttagtttat tgcaagtcct ccgagatgac 4560 agcggacttg ctaacaaaac ctttgcatcc tgggccgttt tgggagcata tgaagggggt 4620 aggattaaga agatgtgtct aatttgtgtc ttgattgagg gggtg 4665 // ID Copia-3_PPM-LTR repbase; DNA; FNG; 414 BP. XX AC ABWF01002028; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Postia placenta genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-3_PPM_; KW Copia-3_PPM-I; Copia-3_PPM-LTR. XX OS Postia placenta OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Polyporales; Postia. XX RN [1] RP 1-414 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Postia placenta genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABWF01002028; Positions 71984 72397. XX SQ Sequence 414 BP; 82 A; 114 C; 85 G; 133 T; 0 other; tgtcgcaatc tgagatgtga tttcttgtac ggcgtgttta caatacacgc atggtcgact 60 tccgtacgta tttcttattc tattgttttc ttgtgttctt gttaccttat catgtatctt 120 gaatctcgat tacgatacaa gatcactcta cggtacataa ggctacctgt cgaatgagga 180 atgctccaga gtcttacacc tggagcatcc ctcgtatgtc ccattacttc tgttcacatg 240 tacttatact tacgagccgt gcagtgacct ctgtgtcgct gcaggtatga atacagagtc 300 ttacacctgg agcatccctc tgacctctgt gtcgctgcag ctcctgctct ccggtcttgt 360 cacctcaccc agccgcgcca ctgcgtttac ggcgttagct gtaccctagc gcca 414 // ID BOTY_I repbase; DNA; FNG; 5464 BP. XX AC . XX DT 17-APR-2011 (Rel. 16.04, Created) DT 17-APR-2011 (Rel. 16.04, Last updated, Version 2) XX DE Botryotinia fuckeliana gypsy-type retrotransposon: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Long terminal repeat; Gypsy superfamily; retrotransposon; KW BOTY_LTR; BOTY_I. XX OS Botryotinia fuckeliana OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Leotiomycetes; Helotiales; Sclerotiniaceae; Botryotinia. XX RN [1] RP 1-764 RA Diolez A., Marches F., Fortini D. and Brygoo Y.; RT "Boty, a long-terminal-repeat retroelement in the phytopathogenic RT fungus Botrytis cinerea."; RL Appl. Environ. Microbiol 61(1), 103-108 (1995). XX RN [2] RP 1-5464 RA Jurka J.; RT "Consensus sequence of the internal portion."; RL Repbase Reports 11(4), 1129-1129 (2011). XX DR [2] (Consensus) XX CC Positions [3754-4251] - Integrase core. CC LTRs are 93% similar to each other. XX FH Key Location/Qualifiers FT CDS 52..4905 FT /product="BOTY_I_1p" FT /translation="MASRATATGQSTGDTNDIEMTDAPKEITINETLKIAL FT PDKYQGSRQELDTFLLQLEIYFRFNEDKFTTKESKSIWAASYLRGEATKWI FT QPYLRDYFEHDDKDRMQPTRTIFNSFEGFKTEIRRIFGNSNELEVAEDKIF FT NLKQTGSALKYATEFRRYAGTTKWDEIAIMSHYRKGLKPEVRLELERSAES FT TDLNDLIQDSIESDDRLYRYRQSQRSYKPQGNQKQGRYRKNEGRPRYNPQR FT YGDPMELDATHYTNGNDDSEKRRRRENNLCFECGKAGHRAADCRSKKTGGK FT RGNFKPKFGKGQLNATFTIPENPTKSENTETFTVEEFQQLLKELPRNQEGM FT NAIDLWEQEYYRTPTPSVTEESHQDEAEADHATMSWTACYDEFCGIHRSDK FT EATGWFPKKRKTKNHQNNVTCEDLTPNITSQEVRKVTQQLNATGQAGQIYC FT KVQINGHIQSAMIDSGATGNFIAPEAAKYLEIPLQTKQHPYRLQLVDGQLA FT GSDGKISQETIPVRMGITQHTEVIQLDVVPLGQQQIILGMPWLKAHNPKID FT WAQGVVTFDQCKSGHRDTLEASARRNTRQGELNANNTGDVGHPVQGPPLRA FT KASTPPLQMQKPTTRHEIAIEAKERPTIPEQYKKYEHVFKEPGIHEALPEH FT KPWDHEIILEEGKMPVHTPIYSMSADELKRLREYIDDNLAKGWIRESASQV FT ASPTMWVPKKDGPDRLVVDYRKLNALTKKDRYPLPLATELRDRLGGATIFT FT KMDLRNGYHLIRMKEGEEWKTAFKTRYGLYEYQVMPFGLTNAPATFMRLMN FT NVLSQYLDTCCICYLDDILVYSNNKVQHIKDVSNILESLSKADLLCKPSKC FT EFHVTETEFLGFTVSSQGLKMSKDKVKAVLEWKQPTTIKEVQSFLGFVNFY FT RRFIKGYSGITTPLTTLTRKDQGSFEWTAKAQESFDTLKQAVAEEPILLTF FT DPEKEIIVETDSSDFAIGAVLSQPGQNGKYQPIAFYSRKLSPAELNYEIYD FT KELLAIVDAFREWRVYLEGSKYTVQVYTDHKNLVYFTTTKQLNRRQVRWSE FT TMANYNFRISYVKGSENARADALSRKPEYQENKTYESYAIFKKDGESLVYN FT APQLAATHLLEDNHLRKQIQSHYDKDATATRIRKTIEPGFTIENDTIYFHG FT KVYIPSQMTKEFVTEQHGLPAHGHQGIARTFARIREISYFPRMRTIVEEVV FT GNCDTCIRNKSSRHAPYGQLQTPDMPSQPWKSITWDFVVKLPLSKDPTTGI FT EYDAILNIVDRLTKFAYMIPFKETWDAEQLAYVFLRIIVSIHGVPDEIISD FT RDKLFTSKFWTTLLALMGIKRKLSTSFHPQTDGQTERTNQTMEAYLRCYVN FT YRQDNWVELLPMAQFAYNTSETETTKITPARANFGFNPQAYKIPIPQEVNA FT ESAIVQVEQLKDLQEQLALDLRFISSRTAAYYNTKRSMEPTLKEGDKVYLL FT RRNIETKRPSNKLDHRKLGPFKIDKVIGTVNYRLKLPDTMNIHPVFHISLL FT EPAPPGAPNAPFTEIEPVNPNAIYDVETILDCKYVRNKVKYLIKWLDYPHS FT ENTWELKEDLSCPEKLRAFHLKYPHLPTKPQAPHQTTKATKDRRNRRKKNH FT " XX SQ Sequence 5464 BP; 1895 A; 1276 C; 1187 G; 1106 T; 0 other; ctttgagcac catttctgct tcaagtaccg attcgataac caaccgctaa tatggcatcc 60 agagctaccg ccacaggtca gtctaccgga gataccaacg acatcgagat gaccgatgcc 120 ccaaaggaga tcactatcaa cgaaaccctt aagatcgcct taccagacaa gtaccaaggt 180 agtcgacaag agctcgatac tttcctctta caacttgaga tctacttccg attcaatgag 240 gacaagttca ctaccaagga atccaagagc atatgggccg catcatacct tcgaggtgaa 300 gcaaccaaat ggattcaacc atatttgcgc gactatttcg agcatgacga taaggatcgc 360 atgcaaccca cccgaacaat cttcaatagt tttgaaggat ttaagacaga gattcgtaga 420 atcttcggaa attccaacga gttagaggta gcggaagata agatcttcaa cctcaagcag 480 acaggatcag cattgaaata tgctacggaa tttcgaagat atgctggaac aaccaagtgg 540 gacgaaatcg ctatcatgag tcactaccgc aagggactca aaccagaagt cagactggaa 600 ttagaaagat ctgccgagag tacagatctg aacgatctaa ttcaggactc catcgaatca 660 gatgatcgtc tctacagata tcgacaaagc cagagatcat acaaacccca aggaaatcag 720 aagcaagggc gttaccgcaa gaatgagggt agaccacgtt acaatccaca gagatacgga 780 gaccccatgg aactagacgc tacgcactac acaaacggga acgatgactc agaaaagaga 840 cgaagacgag aaaacaactt atgctttgaa tgtggaaaag cagggcaccg agcagcagac 900 tgccgaagca agaagacagg aggaaaaagg ggcaacttca aacctaagtt cggcaaaggc 960 caacttaacg ctacctttac aatcccagaa aacccaacta aatccgaaaa tactgagact 1020 ttcaccgttg aggaattcca gcaattacta aaggaattac cacgaaatca agagggcatg 1080 aatgcaatag acttatggga acaagagtat tacagaaccc caacaccctc tgtgacagaa 1140 gaaagtcacc aggacgaggc agaagcggac cacgccacga tgagctggac agcttgctat 1200 gatgaattct gcggaatcca tcgatcagat aaagaagcaa ccggatggtt ccccaagaaa 1260 aggaagacga agaaccatca gaataatgta acatgcgagg atttaactcc caatataact 1320 tcgcaagaag ttcgcaaagt tacccagcag ttgaatgcta cgggacaggc aggacagata 1380 tactgcaagg ttcagataaa tggacacata caatcagcca tgatagattc aggggctaca 1440 ggaaatttta ttgcaccaga agctgcaaag tacttggaaa taccacttca gacgaaacaa 1500 cacccctatc gattgcagtt agttgacgga cagctagcag ggtctgacgg aaagatttcg 1560 caggagacaa tcccagtacg aatgggcata acccaacata cagaggttat acagcttgac 1620 gttgtgccat tgggccaaca acagatcatc ttaggaatgc catggttaaa ggcacataat 1680 ccgaaaatag attgggcaca aggagttgtg acatttgatc agtgcaaaag cggtcacagg 1740 gacacgctag aggcgtccgc gagacgtaac acgcgccaag gagagttgaa cgcgaacaac 1800 accggcgacg taggacaccc agtccagggt cctccattaa gagcgaaggc cagtacacct 1860 cctctacaaa tgcagaagcc aacgacacgg cacgaaatcg caatcgaggc aaaagaaagg 1920 cctacgatac cagaacagta caagaaatat gaacatgttt tcaaagaacc agggatccat 1980 gaggctttac cggaacacaa gccatgggat catgagataa tattggagga aggcaagatg 2040 cctgtgcaca ccccaattta ttcaatgtca gccgatgagt taaaaaggct cagagaatac 2100 atcgacgaca atttagccaa gggatggatc agggaatccg cgtcccaagt ggccagtcca 2160 actatgtggg tacccaagaa ggatggaccc gatagactag ttgtagacta tagaaagctt 2220 aacgcactca ctaagaagga tcgatatcca cttccattag ctacggaatt aagagatcga 2280 ttaggcggag ctacgatatt caccaagatg gacctacgta atggttacca cttgatcaga 2340 atgaaggaag gcgaagaatg gaaaaccgct ttcaaaacaa gatacgggct atacgagtac 2400 caagttatgc cattcgggct aaccaacgca ccagctactt tcatgaggct tatgaacaat 2460 gtgttgtcac aatatttgga tacttgctgt atatgctact tggacgacat cctagtatat 2520 tcaaacaaca aggttcaaca cattaaggac gttagcaaca tcctcgaaag cctatccaag 2580 gcagacttgc tgtgcaaacc aagcaaatgc gaattccatg tcacagagac agaattcttg 2640 ggattcaccg tatcaagcca agggctcaag atgagcaaag acaaggttaa ggcagtgctc 2700 gaatggaagc agccgaccac aatcaaggaa gtacaatcct ttctagggtt cgtcaacttc 2760 tacagaagat ttatcaaggg ttattcaggg attactacac ccttgaccac gttaaccaga 2820 aaagatcaag gaagcttcga atggactgcc aaagcacagg agtcattcga tacgctcaaa 2880 caagcagtgg cagaagagcc aatactgttg acttttgacc cagagaaaga aatcatagtg 2940 gagacggact cctcggattt cgctatagga gcagttctga gccaaccggg ccagaatgga 3000 aaataccagc caatcgcatt ctactcccga aaactatcac cagccgagtt gaattacgag 3060 atatatgaca aagaattact ggcgatagtc gatgcattta gagaatggcg agtatatttg 3120 gaaggatcga aatacacagt acaggtgtat acagatcata agaacttggt ttacttcacc 3180 acaacgaagc agttaaacag acgacaggtc agatggtcgg agaccatggc caactacaat 3240 tttagaattt catatgtcaa aggatcagaa aacgctagag ccgacgctct tagccgaaaa 3300 ccagaatatc aagaaaacaa aacgtacgag tcatacgcta tattcaagaa agacggcgaa 3360 tcactggttt acaatgcacc acagcttgca gcaacacacc tgttggaaga caaccacctc 3420 aggaaacaga tccaatcaca ctacgacaag gatgctactg ccacacgcat acgcaagaca 3480 atagaaccag gattcactat agaaaatgat accatatact ttcatggaaa agtatacatt 3540 ccgagtcaaa tgaccaagga atttgtgacg gaacaacatg ggttgccggc acatggacac 3600 caaggaattg caaggacatt tgcaagaata cgggaaatca gttacttccc acgaatgaga 3660 acgatagttg aagaagttgt tggaaattgt gacacctgca tacgaaacaa gtcatcacga 3720 catgctccgt atggtcagct ccagacccca gacatgcctt ctcagccatg gaagtccatc 3780 acatgggact ttgtggtcaa actaccactc tcaaaggatc ctactacagg aattgagtac 3840 gacgcgatac tcaatatagt agacaggcta acgaaatttg catatatgat accattcaag 3900 gaaacatggg atgctgagca actagcatat gtgttcctaa ggatcatagt aagcatacac 3960 ggagtaccag atgagataat ctcggatcga gacaagctct ttacctcgaa attctggact 4020 accttattag cacttatggg tatcaagaga aagctatcga catctttcca cccacaaaca 4080 gatggtcaaa cagagaggac caatcagaca atggaagcat atcttagatg ctatgtaaat 4140 tatcgacaag acaattgggt agagctatta cccatggcac agttcgcata caatacatcg 4200 gaaacggaaa ccacgaaaat cacaccagca cgagctaatt ttgggtttaa tccacaagcg 4260 tataaaatcc cgataccaca agaagttaat gccgaatcag cgatagtaca agtcgaacag 4320 ctgaaagatc tccaagagca actggctctt gatctaagat tcatatcttc cagaacagca 4380 gcgtactaca atacgaaacg tagtatggaa cctacgctta aagaggggga taaagtttat 4440 ttgctacgac gaaacatcga aaccaagaga ccaagcaata aactcgacca caggaaacta 4500 ggaccattca agattgataa ggtaatagga acggttaatt atcgattgaa attaccagac 4560 acaatgaata tccacccagt attccacata tccttgctcg aaccagcacc accaggagcg 4620 ccaaatgcgc catttacaga aattgaacca gtcaacccaa acgccatata cgatgtcgaa 4680 acaatactag actgcaaata cgtcagaaac aaggtcaagt atttgatcaa atggttagac 4740 tacccacatt cagaaaacac atgggaactc aaggaagatc tcagctgccc tgagaaacta 4800 cgggcattcc acctgaagta cccacatctg ccaacaaagc ctcaagctcc gcatcagaca 4860 acgaaggcaa cgaaggatcg aagaaatcga aggaagaaga accactagga gtaggcgcag 4920 caacagcggt tgctgtttct ttctccacac gctccttttc caacttttcc ttttccaatc 4980 tttcattctc ctcagctaga tcaagctcat ctagagtacg aagaccacga cgaatcatct 5040 ccttttcacg ctcgcgcaga aatcgttgtt gcttgcgcaa tcgaagaatc cgggcttaat 5100 ctcctgcgac tagccatagc ctcttcctcc tcctagcgca gccgctcctg ctgtcgatcc 5160 aaagattccc agtcgaggca tctctgaccc aggaccgcaa cgaccttttc ttgacggata 5220 cactcggcac agcgcggaca aactatcgtc aacaacgcaa cgacgattgg atttacgcac 5280 aaagagcaag aaccatttcg ataccagaac gctcgacgcg gcgaaccaag ctaaggatat 5340 ttgacttaag ttgttcgtgt aggaggcata ttgaagaagg aaacatcaca atcattaagt 5400 aaacgaaagg gaagacgtac taatataaac gcttgggaca agcgctaggc taagaagggg 5460 atag 5464 // ID Gypsy-43_MLP-I repbase; DNA; FNG; 11229 BP. XX AC AECX01002299; XX DT 18-MAY-2011 (Rel. 16.04, Created) DT 18-MAY-2011 (Rel. 16.05, Last updated, Version 2) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-43_MLP_; KW Gypsy-43_MLP-LTR; Gypsy-43_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-11229 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002299; Positions 22477 11249. XX CC Positions [10031-10510] - Integrase core CC 'TTCAG' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 257..1309 FT /product="Gypsy-43_MLP-I_2p" FT /note="gag." FT /translation="MSEETPTLTLLMQQVADLYIRLTNETNLRQAAELSQQ FT QADLRLSQLEANLAANQPVNPPTAQQNYVPTPHAPKTPKVATPDKFSGSVG FT LLAEGFISQISLYILTNHAMFTTDKSKVTFALSYPSGHALRWAQPTLGLVL FT NPTTADSVTFESFVKAFKGVLFDPMRKTRAEKELRALKQTGTVAEYSVKFN FT QWAAATEWDIPVLVSQYKQGLKLEVRTGMLQKSFKALEEITNLAAEIDSEI FT HGERLNVQTSTTSRPSPARDPDAMDLSATRFPISNEEYKRRMEDGKCFKCG FT GTNHYARNCGKFRKWKGKSGGRVAELEARLAEFEGRSSSNSGNSLGEGSEA FT DKSKNGDA" FT CDS 1342..2526 FT /product="Gypsy-43_MLP-I_1p" FT /note="retropepsin." FT /translation="MKRSEDLIMVDAGKIICNSNDPRLFAPITLSNPSCPT FT SKIDATALIDCGSTHKVLGENYAQEHRIRTETLPEAKNISGFDGKNRKITE FT DTRLIIDDDKEPSRFIVTKLKDSYDAILGMPWIRKHGNKIDWRTGKFKEVT FT DVAAIETALSDPITPQVKKDTVVGEARNCDEGVCTHSSTLAPPQCEYGSPL FT SILNLEAAGKCIPPLNNLEPRIVDAAQASRVSQDPKTPSKDLKEVERKQAR FT QCDLGVCIHNDGTLTPPQCEFNAFSNPIHKEERTGKLAHLQNNCRTSISAA FT KASWSTSARIAADTKASNPEKTAIELVPTCYHKYLHLFTKSKAMTLPPRRK FT YDFRVKLLPGATPQASRIIPLSPAENDVLEEMIKEGLANKTIRLGRPCSIH FT R" FT CDS 4434..3358 FT /product="Gypsy-43_MLP-I_3p" FT /note="tyrosine recombinase." FT /translation="MPPYRQTSQPTRHNPIRLPKLPGHFDDAFRNASNAGA FT HMAKSSWAETTLKGYSSGLSKFIEYVESTGKAFDERLRIQPNDIYNFITWA FT GRSGAVSEVQPKRKIASKTIDKYIDGIQVWHLVKHVRKIKLDKAIVKLLLK FT ATRKEEEGELLTTHKKSVTVQQLFKLLDYCEGRSEHHEFVAAIALIAFWGL FT MRLGEIFRMKIEDGAILRHHVEFGRSELGDWVKIHLRKGKTAAPGEFQIIH FT LHEQPSVLDPVAAMRRLMQMNATEDRDEFLFKTKSGILKKRKFLKTLEEAW FT GSSSTQTWSGHSFRVGGASLRANLGTKERRLKKLGRWKSDSHKRYVKLFTL FT KELVDTKEFLTAIKLR" FT CDS 8807..11131 FT /product="Gypsy-43_MLP-I_5p" FT /note="ribonuclease H, integrase and chromodomain." FT /translation="MDPAKVKAVKEWPAPKSVTELQRFVGFANFYRRFISQ FT FSKTARPLHELTCKDAKFEWNKKRDKVFKDLKEAFTTAPILRIADPYKPFV FT LECDCSDYALGAVLSQVADDGLLHPVAYLSRLLIAAEKNYKIFDKELLAVV FT ASFKEWRQYLEGNPNCLDVIVYTDHKNLESFMPSKQLTRRQARWAETLGSF FT DSQIRFRPGREATRPGALSRRPDLAPPSDEKLTFRQLLKPENIVDNTFLGQ FT IDSFEAHFEDETIVNENSEEWFKIDVLDLDSERDTSIKSDVSILKSIRNAS FT ATDNSVKELIQVVERPLSNQMKNLVADYSTQEGILYHKDCVVVPADNNIKT FT EIVRSRHDSPMAGHPGRAKTLSLVKRHFRWKGMKAFVNRYVDGCDSCQRVK FT TSNQKPFGMLKPLPIPAGPWTGISYDLITELPESNGKTCILMVVDRLTKMA FT HFIACNNTTSAEDLAELMLRNVWRLHGTPKTIISDRGGIFISQLTQSLYKK FT LGIKSTPSTAYHPETDGQSEIVNKAVEGYLRHFTSYKRDNWEPLLAMAEFA FT YNNSTHSLTGLSPFKANYGYDPTYTTIPLSEQCIPAVKARIQQINKVQSEI FT TESLKSAQELMKYYHDKNRRTNPEWKVGAKVWSSSKNVSTTRPSAKMDHRW FT LGPFKIIKCVSEVAYKIELPLSLHRLHPVFHVSQLREYKADTIKEQIQLPP FT PPIELQGEQEYEVEAVLDGRMRGKKEEYLISWRGYGREHNSWEPVTNLGNA FT KQLVDKFKIKYPEAMKTHRRRRRM" FT CDS 6527..4773 FT /product="Gypsy-43_MLP-I_4p" FT /translation="MNWCHRVTRSPIPNETNSPIPNETNEQSHEIIFNRRG FT ENVRSWPGKPSYDMNLDVWTEELRKVNLLEKYRHILKMFKKGFTQGIVNAD FT IPNKRFYCPPNHTSAQQAKQKIEDNFKTELEAGRMYGPWTQEEVYKKIGFF FT RTNPLGAVINGDGSFRPISDLSYPRHLRNIHSVNANVDKEEFATSWDDFKK FT VAVFFIEYKGKLLLGIFDWAKAYRQIPVAVDQWRFLMVLDFNNEVLLDTRV FT QFGGVAGCGTFGWAAEAWKELMVHKFKMLAGFRWVDDTLFVKPVEDTQAGS FT MADVVEASGRMGVVTNEGKWREFSFKQKYLGFNWDGENKTVGLPDKKLKLR FT ITQIEEYSKQNRQSFKETGKMVGRLQHTAYVFPQMKVYCAAMYRWLQQWKH FT KYALRDIPDEVSEDLQIWKETLMTANPLRVIPKPETRTIGWVGDASTSYGL FT GIIVGKKWSRFKLKDDWDKTDKEGLKRGIAWAETVAIRLGWIMVKSLKNVE FT GNTYLALTDNTVSEGAINNGKSRDTLVNRQWKLIQADLLEHNSAIEAKRVR FT SKDNAADALSRGLDSTKRKQDEIKVKIPGDLVEFLYQT" XX SQ Sequence 11229 BP; 3009 A; 2583 C; 2386 G; 3251 T; 0 other; tattgtggta tcttcaacta ggagaaatcc acaaacttaa tccgaagaag tagattgaag 60 tagattcaaa cttaatctaa taataaaaga agacgataca gaagaaggaa actttaaaac 120 ttaattataa ttgaaacctc ttcggaagat cattaaagtt aaagaaaatt cagaagaaga 180 cgccaaaata ccgacattca cggcaagaga agaagaaccg gacagcgaag attttgaatc 240 agcggctagt ctagagatgt cagaagaaac gccaaccctc acgttgctca tgcagcaagt 300 cgctgatctg tatatccggc ttaccaacga aacaaatttg cgtcaagctg ccgaactcag 360 ccaacaacaa gctgatttaa gactgtctca actcgaggct aacttggccg cgaaccaacc 420 tgttaacccg cctaccgctc aacagaacta cgtgccaacc cctcatgccc ctaaaacccc 480 taaagtagct acaccggata agttttctgg ctcggtagga ctgttagcag aaggattcat 540 cagccaaatt agtctgtaca tacttacaaa tcacgccatg tttaccaccg acaaatctaa 600 agtaacgttc gcactgtcat atccatctgg ccatgcttta agatgggctc aacctaccct 660 aggactcgta cttaacccga ctacagcgga ctctgtcact tttgagagtt ttgttaaagc 720 gttcaaggga gttctctttg accctatgag aaagacccga gctgaaaaag aattacgcgc 780 cttgaaacaa acgggaaccg tcgccgaata ctctgttaag tttaatcagt gggcagcagc 840 aactgaatgg gacataccgg tacttgtcag tcagtacaaa caaggattga aattagaggt 900 cagaacggga atgttacaga aatctttcaa agctttagaa gagatcacaa acctggcagc 960 tgagatcgat agcgaaatcc acggtgaaag actcaacgta cagacctcga ccacatcccg 1020 acccagtccc gctagagacc ccgatgccat ggatctatca gctactcggt tccctatttc 1080 aaacgaagag tataaaagga ggatggagga tggaaagtgt tttaaatgtg gtggaacaaa 1140 tcattatgct aggaactgtg gtaaattcag gaaatggaag gggaagagcg gaggaagggt 1200 ggctgaatta gaggcaaggt tagctgaatt tgagggtaga agtagtagta atagtggtaa 1260 ctctttagga gaaggaagtg aagctgataa gtcaaaaaat ggagacgctt gagcaggaag 1320 gatgtgtccc actcgagcga gatgaagagg agtgaagatt tgataatggt tgatgctgga 1380 aaaatcatat gcaattcaaa tgacccacga ttgtttgcac ccataacctt gtccaatccc 1440 tcgtgcccca catccaaaat tgacgctaca gccctgatag actgtggttc tactcacaaa 1500 gtcttaggag agaactatgc tcaagagcat agaatcagaa ctgaaaccct acctgaagca 1560 aagaacatat caggttttga cggaaagaac cgaaaaatta ctgaagacac aagattgatt 1620 attgatgatg ataaggaacc ctcaagattc attgtcacta agttgaaaga ttcatatgac 1680 gctatactag gcatgccgtg gatcagaaag cacggaaata aaattgattg gaggacagga 1740 aagttcaagg aagttactga tgttgcagcc atagaaacgg ctttgtccga tccgataaca 1800 ccccaagtca aaaaggacac agtggttggg gaagctagga attgtgacga gggggtgtgt 1860 actcatagta gcacgctagc acccccgcaa tgtgagtatg gtagtccctt atcaatcctt 1920 aaccttgaag cagctggcaa gtgtattcct cctctgaata atttagaacc cagaatcgtg 1980 gatgcggcac aagcatcgag ggtatcacag gatccgaaaa caccttccaa ggacctgaaa 2040 gaggtggaac ggaagcaagc taggcaatgt gatctggggg tgtgtatcca taatgatggt 2100 acattaacgc ccccacaatg tgagttcaat gcatttagta atcctatcca taaagaagaa 2160 cggactggca agcttgcaca tctccaaaat aactgcagga catccatatc agcggctaaa 2220 gcgtcgtggt caacatcggc gcgcatcgca gcggatacaa aagcttccaa ccccgaaaag 2280 acggcaatag aactagtccc gacttgttat cacaaatatc tgcacttgtt cacgaaatcc 2340 aaagcaatga ctctacctcc acgcaggaaa tacgatttca gagtgaagtt actacctggc 2400 gcaaccccgc aagctagtcg aatcataccc ttgtcaccgg ctgaaaatga tgtactagaa 2460 gaaatgataa aagaaggtct agctaacaag actatcagac tgggccgccc ctgttctatt 2520 caccggtaaa aaggacggta aactcagacc ttgttttgac tacaggaagc ttaatgcatt 2580 gactgttaag aataaatacc cgctaccact caccatggaa ttagtggaca gtctcttaaa 2640 tgcggagaag tttactaaat tagacttgag gaacgcttac ggtaatctgc gggtagctga 2700 agaagacgaa gatatattag cattcatatg ttgagccggc caatttgcac ctctcacgat 2760 gccatttgga ccgacaggcg cacctggcca cttccagtat ttcatccaag atatactact 2820 gggacatata gggaaggaca caggagcttt tctggatgac ataatggtat atacaaaaga 2880 aggtgtcaat catgaagatg tagtagagga gatactggaa atcctaagca aacatcaatt 2940 atgggtgaaa cctgagaaat gtgaattttc aaaggcggaa gtcgagtatc cattctcttt 3000 ttcattaagg ctcagaacta tgttttgact attgtataca atagtgtcgc cttccggcga 3060 cacttagtga catttcacct tgcactgtcc ttgccttcgt ggtgaggaca atgctatgtg 3120 ggggaagcct gggcccgcct cttggttggt ttcccacaac caacttctat gagtgcgggc 3180 cgctttcagc tttcaccccg cccgccctcc aataggtgtc tggatgggtt gaccgacatc 3240 ccttctacca aggtgttcag tgggtaggtg taccgtcctt ggcctgtcgg tcaaccccat 3300 ctacggtgcg tctatctagg gctttgatta gcaagtgcgc ttctcaggtt cacgctatct 3360 aagttttatt gcagtcagga attcttttgt atctactaat tctttcagcg tgaaaagttt 3420 gacatatctc ttatgtgagt ctgacttcca tctccccaac ttcttcagcc gtctctcttt 3480 cgtgcctaaa ttcgctctta aggatgctcc tcctactcgg aaggaatgac ctgaccaagt 3540 ttgggtactt gaagaccccc aagcttcttc taacgttttg agaaattttc ttttcttcaa 3600 aattcctgat ttcgttttga acaagaactc atctcgatct tccgttgcgt tcatttgcat 3660 caatcttctc attgcagcta ccgggtctag tacagaaggt tgttcgtgga ggtgtataat 3720 ttgaaattcg cctggtgctg ccgttttccc cttccttaaa tgaattttca cccagtctcc 3780 taattctgac cttccaaatt ctacgtggtg tctcaaaatt gctccgtctt ctattttcat 3840 cctgaatatc tctccaagtc tcattaatcc ccagaacgct attaacgcta ttgccgctac 3900 aaattcatga tgctccgatc ggccttcaca gtaatctaaa agtttgaata attgttgaac 3960 tgtaactgat ttcttatgtg tcgtgagtag ttctccttct tcttctttcc ttgtagcctt 4020 gagtaaaagc ttgactatcg ctttgtctag ttttatcttc ctgacgtgtt tcacgagatg 4080 ccacacttgg atgccgtcaa tatacttgtc gatcgttttc gacgcgatct tcctttttgg 4140 ttgaacttcg ctaacggctc ctgaacgtcc tgcccacgtg atgaaattat agatgtcgtt 4200 cggttggatt cttaaacgtt cgtcgaaagc cttccctgtt gattccacat attctataaa 4260 tttcgacaat cctgatgaat agcctttcaa ggttgtctct gcccagctac tcttcgccat 4320 gtgggctcct gcgttcgaag cattcctaaa ggcgtcgtcg aagtggccag gtagtttcgg 4380 taaacgtatt gggttgtgtc tggtgggttg actggtctgt cggtaaggag gcatgtgaga 4440 tgttttgtta aggatggttc ccgtatatgg gattcaagtt ccggtgggtc tacgccctct 4500 cccccgaccc gtctcgtcgg gtcggtttca tatctttgtt ggtatctcgc gaggtcttat 4560 ctagttctca cgactatcct tcgtactcga cgatcacttc gacatccccc ttactatctt 4620 tgtctttcat caccaccttc ctctgtgtat agtctttttg tcttactcgt ctatgtgatc 4680 catagaactg atctttatgc ggcgggcgct cactctttct aaggcaactc acttctgaca 4740 ctttccgtaa gtcatttcca atttacattt caagtttggt acaagaactc tactaaatca 4800 cctggtatct ttaccttgat ctcgtcttgt tttctctttg ttgagtctag gcctctcgac 4860 aaggcgtctg cggcgttgtc ttttgagcgt actcgcttcg cctctattgc gctattgtgt 4920 tccaataggt cagcctgaat cagcttccat tggcggttta ccaacgtatc tctcgacttg 4980 ccgttgttga tagcaccttc tgatacggtg ttgtctgtta atgcgaggta agtgtttcct 5040 tccacgttct ttaaactttt caccattatc catcccaacc taatcgctac tgtttctgcc 5100 cacgcaatcc cccgttttaa tccttctttg tcagttttgt cccagtcgtc tttcaactta 5160 aatctggacc acttcttccc aactataatg cccaatccat atgaggtcga agcatcgcca 5220 acccacccta tcgtacgagt ttcaggtttt gggatgactc gaagcggatt cgccgtcatt 5280 agcgtttctt tccatatttg taagtcttcc gacacttcat ctgggatgtc tcgtaaggcg 5340 tacttatgct tccattgctg taaccatcta tacatcgctg cgcagtagac tttcatttgc 5400 gggaatacgt acgctgtgtg ttgtaatctt cctaccattt ttccggtttc tttaaaggat 5460 tgcctgtttt gcttagagta ctcttcaatt tgagtaattc ttaacttcaa cttcttgtca 5520 ggtaagccaa ccgtcttgtt ttctccgtcc caattgaatc cgagatattt ttgtttgaaa 5580 ctgaattctc tccactttcc ttcgtttgtg actactccca ttcttccgct tgcttctacg 5640 acgtcggcca tactacctgc ttgcgtgtcc tcaactggtt ttacgaacaa cgtatcatct 5700 acccaacgga aacctgccaa catcttaaac ttatgtacca tcaactcctt ccaagcctct 5760 gcagcccatc caaacgtccc acaacctgcg acgcccccga attggactct tgtgtccagt 5820 aacacttcat tgttgaaatc taacaccatc aagaatctcc attgatcaac tgccactggg 5880 atttgccgat atgctttggc ccagtcgaag atcccgagta acaatttccc tttatattcg 5940 atgaagaata ccgccacctt tttaaagtcg tcccaagagg tggcgaattc ttctttatct 6000 acgtttgcgt tcactgaatg aatgtttcta agatgtcttg ggtacgataa gtcactgatg 6060 ggtcggaatg atccatcgcc gtttatcaca gcgccaagtg gattcgtcct gaagaatcca 6120 attttcttgt acacttcctc ttgagtccac ggcccataca ttctccctgc ctccagttca 6180 gttttgaaat tgtcttctat cttctgtttc gcttgttgag ctgacgtgtg atttgggggg 6240 caatagaatc ttttgttagg tatgtctgca ttgacgatac cttgtgtaaa ccctttctta 6300 aacatcttta agatgtgtcg atatttttcc agtaagttta ctttccttaa ttcttctgtc 6360 cacacatcta aattcatgtc gtatgatggc ttccctggcc agcttcttac attttcccct 6420 cttcgattga agattatctc atgactctgc tcgttagtct cattaggaat cggtgaatta 6480 gtctcattag gaatcggtga acgtgttact ctatgacacc aattcatgat tcagtaggca 6540 tggcaagaat tttggcaaat agggtacggg gtggggcaat tttcacttgt tagctaccat 6600 ttaaggtaat acatttcatt tctatccttc tacacaaagt agcttacgaa cgcttccatt 6660 tcccgtccga tcttttctca taacgcctct cggggctctt actacgttct ctctttttct 6720 cctctcccca tctgggtcct cggttttcaa agtttgatct actgttgaaa ccggctctgc 6780 tttcgtatcc tccacctctc gacctacggt tccctccatt tccttggtac gtacttccac 6840 ttgcgtgtct gttgaacgtt ggattgttgt tgttggggtt gttgtagagg ttaccgtaag 6900 gtgagtggtt tgtgcttgta gatggcttac tcgtgattga ccttttgacc tggtttgtat 6960 aggggcatat gtgcgccttt ggaccacctg cggcgtatgg gttggttcgg taatgtcgct 7020 cgttatagct ttcagccttc gtcttagcct gtgctagcag gtgttcttgc agttccgccc 7080 agttcccgag ggcgctatcg atggtttccc tcattacccg cgaccatcca cccgaagtct 7140 ttcctcaact gctttacaat catcatatgg cttttgaacc tatctgctac cgggctccaa 7200 tcttgatctc gtaaatgtat gcagaacagt tccatacagt cgctccaaga ttcaaaatcc 7260 atggtgagtt cttctttcgg gccttctcct ccgtacattc tcttttctct ttctgacttc 7320 tcctctgact tggggggtaa caaggaccat gccatttggt ctcgaatcag gaattcttcg 7380 tcgaatactg gtaaaggtat gaacgacttc aggcctttta gcttaacggt taaatgaggg 7440 gataatggag ttatggtgtc tcccatcttc gccggaccgg atttgaagac tcttccgttg 7500 tgaatcatgt ctccgttaga caaggtgagg cttccttttt tcgaggtcat cattgcttta 7560 gccttgtctg actcttcttg gcttccacat cccgggtacc aatcttgtct tttcccgtct 7620 acgtcttcat taagtgtcgt ttcttggact accattacgc taggtgtgtt cggtatcttc 7680 ttctgcgcca ctaggaagga tctgacattt gcttcgtcat ctgaagattt cgatgagtcg 7740 tcgtcaacca cgtattggtt cattggatct tggtccatgt cctccttgtt cagacgatct 7800 gtcaggcgtt tagcgttttc ttcgtcattc cgatcgtaag cgtcttggat taattcgact 7860 aaagcttctc gtcgagcgtc tgttcaacca aaccacgttt ccgttagcgc ctattccgct 7920 ctttactgtg tttctatgct acatattgtg agcctgtgct ggctgggccc tattctttaa 7980 gccaggccag gctgaggtat ctattgtaag ccgaccctgg ctgggcacct atttgaagcc 8040 gagccaggct atttccagta tgcgtgacga tatagtcgat ttgttgcgaa tctatgtcga 8100 tttcgagggg aagagagaag gaatagagag atttaaatga agacagtgag tcttaccttt 8160 cttaattctt tctgttatac gatttaacgt cgtgccttct tccaacgttt catcttctaa 8220 ttcctcttca ttctcctctt ctacttcctt tattcctccc cattcctcac tttcttcttc 8280 ttcttcttca ttcctttctt ctacttgatt caaccttcgt cgtggtatgt gtgatctatt 8340 tccgtcactt tcttcttctt ctactctcct tctcgcttga ttcctcatcc taccatctcc 8400 tcttctaggt gttacaagcc tttcttctaa ttgagctcca gttgctgccg cttctgctct 8460 agtcaatcgt cgttgtatgt tatttgctgt tctatcgatt tgttgtctta gggccatttt 8520 ggtgtacttt aaattctaat aaaaattaag aaacgagttc tagcaaagag ttgactgagc 8580 gctggaaaaa atgaaaaagc ttctttcaaa gcgtgaaaac taagagtgtg aggagatgat 8640 gaaggttaag gttgaaatct agaatttcaa atctttacct gaggatctgt attgatctaa 8700 ctttcacacg ttagccttag tctttcacag gaatggacac ctacaccatt tcgtgcctta 8760 tggtgtatat tatctaggtt tacttatttc aaaaaacaag attcggatgg accctgcaaa 8820 agtcaaggca gtcaaagagt ggcctgcacc aaaatctgtg acggaattac aaagatttgt 8880 tggttttgct aatttttata gaagattcat cagtcagttc tcgaagacag caagaccatt 8940 acacgaactt acctgcaagg acgcgaaatt tgaatggaac aagaagagag acaaagtgtt 9000 caaagatctg aaagaggcat tcacaacggc tccaattcta cgcatcgccg atccttacaa 9060 gccattcgtg ctggaatgtg actgttcgga ctacgcactc ggagcggtat tatcgcaagt 9120 tgcggatgat ggacttctcc acccagtagc ttatttgtca cggttgttga tagcagcgga 9180 gaagaattac aaaatcttcg ataaggagct tttggcggta gtggcttcct ttaaggaatg 9240 gagacagtac ttagaaggaa atccaaattg cctggacgta attgtttata cggatcacaa 9300 gaacttagag agttttatgc caagcaagca gctgacgcgc cgacaagcaa gatgggcaga 9360 gacattaggg agtttcgatt cccagatcag atttagacca ggaagagagg caaccagacc 9420 cggcgctcta tcgagaaggc cagatttagc acctccctct gacgagaaac ttacattcag 9480 gcagctatta aaacctgaaa acatcgttga caatacattc ctaggacaaa tcgacagctt 9540 cgaagctcac tttgaagatg aaaccattgt gaacgagaac tcagaagaat ggttcaagat 9600 agacgtactg gacttggatt cagaaagaga cactagtatc aaatccgatg tcagcatcct 9660 caagtcaatc cgcaacgctt cagctactga taactcagta aaagaattga tacaagtggt 9720 cgagagacca ctatcaaatc agatgaagaa tctagtagca gactactcaa cacaagaagg 9780 tattttatac cacaaggact gtgtggtagt accagccgac aacaatatca aaactgaaat 9840 tgtaagaagc cgccacgata gcccaatggc tggtcacccg ggtcgtgcaa aaactctaag 9900 tcttgtcaag cggcatttca ggtggaaagg aatgaaagcg tttgtaaatc gttacgttga 9960 tggatgtgac tcatgtcaaa gggtcaagac ttcaaatcaa aaaccttttg gtatgttgaa 10020 accccttcct ataccggcgg gtccttggac aggcatctcg tatgacctca taacggagtt 10080 gccagagtca aacggcaaga cgtgcatact aatggttgta gacagactaa cgaagatggc 10140 gcacttcatt gcctgtaaca acaccacttc agccgaagat ctagcggaat tgatgctacg 10200 taatgtatgg agactgcacg ggacgccaaa gacaatcatc tcagacagag gtggtatatt 10260 catttcacaa ctgactcaat cactttacaa gaaactaggt attaaatcca caccatcgac 10320 agcgtatcac ccagaaactg acggacaatc agagattgtg aataaagccg tagaagggta 10380 tctgagacat ttcacaagtt ataaacggga caattgggaa cctttattag ctatggctga 10440 atttgcttac aataacagta cgcacagttt gacaggatta tccccgttta aggcaaatta 10500 cggctacgac ccaacttata cgactattcc attgtcagag caatgtatcc cggctgtcaa 10560 agcaagaatc caacagatta acaaagttca gagtgaaatc acagaaagcc ttaaatcagc 10620 acaagaattg atgaaatact atcatgacaa aaacagaagg acaaaccctg aatggaaggt 10680 gggagcaaag gtgtggtcaa gcagcaagaa cgtatccaca acacgaccaa gcgcgaaaat 10740 ggaccatcga tggctagggc ctttcaaaat tattaagtgt gtatcagagg tagcttacaa 10800 gattgaattg cctctcagtt tgcatagatt acacccagta ttccacgtat cccaactacg 10860 agagtacaaa gcggatacaa tcaaagaaca aattcaacta ccaccaccgc caatcgaatt 10920 acaaggagag caggagtatg aagtagaagc agtactagat gggaggatgc gtggcaagaa 10980 ggaggagtat ttaattagtt ggagagggta cggtagagag cacaattcgt gggaaccggt 11040 aacaaattta ggaaacgcaa aacaattagt agacaaattt aaaatcaagt atccagaagc 11100 aatgaagaca cacaggagaa ggagaaggat gtgagagcca agctttttcc ccaagtgggt 11160 tttttaatgc tggttcgtgg aggagtgcag ggcaatcatc acagagcctg ggcgttaaaa 11220 gggggatac 11229 // ID Gypsy-6_MLP-LTR repbase; DNA; FNG; 1021 BP. XX AC AECX01002027; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-6_MLP_; KW Gypsy-6_MLP-I; Gypsy-6_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-1021 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002027; Positions 15412 16432. XX SQ Sequence 1021 BP; 328 A; 208 C; 158 G; 327 T; 0 other; tgtcagccct gagtactaaa ccagtgtacc aatgggcgac aaatatatta aaaaccttaa 60 aaacagcaat tacaactcaa ggctcagttt caagagtcaa gttacatatg atttggcata 120 tgtaacatgt gcaaaactca catcaaagag tgtgtacgta cacaactttg atgtgaacca 180 aaactacata tatttcacag acgtaacaca tatgtaacaa gtgaaatata cttataaccc 240 aatagtaacc gtgttacttt aattaaccta accttagaac cacaatttgt gcgcacttgt 300 agtccaaaga ttctcaaggt tcagactata taaaccctga agatcacacc agcttttgaa 360 tcatcacatt aacattcaac aactttacaa attgctcatt tcatcttcat agaactttta 420 aactttgtct aacatctaca aactttaaaa gcctgattct acgaactttg aatcttcaac 480 cttgtgtctt gttgacttta ttgtgtgcgc atcaaaactt cgtttctatc tgattagact 540 ctacaatcag ttagttttgt aatatccatt tggtggaatt acgtataagg aagtagtcct 600 tgagtaaggc ttatatcaga tcagggattt atctgatata agaaactcta aatctacggg 660 gaacctctat tagtcagagt ttaaagttcc tggtgttaag atagacttta acacctcctt 720 tattgatcat tctttgtaca agaagtgact caatttaggc ttgagaactg cgcgtcagtg 780 gctctatttt aattgttgca aataagaaca attacccaaa atccctgggc gtgaaatcct 840 ttgttaaaag gtcatttaac tggagtcttt acttgtttat tacaagtaaa gcttctacgc 900 ccgcgtcttc acatactcca actcgtggat tgtaatcccc acgagtctag tagaagtttg 960 tttaaacttc tatctccttt ttattgactc tactagccca acagacgcta gtaggctcac 1020 a 1021 // ID Gypsy-18_MLP-I repbase; DNA; FNG; 5633 BP. XX AC AECX01000171; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-18_MLP_; KW Gypsy-18_MLP-LTR; Gypsy-18_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5633 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000171; Positions 124551 118919. XX CC Positions [4420-4899] - Integrase core CC 'AGCTC' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 1396..3507 FT /product="Gypsy-18_MLP-I_1p" FT /translation="MREIDTVSNHEIKGSDPRVFTSISLSKSHLATSHKPL FT TPPAQALIDCGSTHEVLCTTFVERSGLPVSKLKTTGDVYGFNGAPRTVAHD FT AELYIDNDEEKTRFLVTKIKDLYDAILGMPWLKKNGHRIDWKNGILRPKSE FT STQIAALNLATENHQSNTDLLEKHDHLATLEVVSSSPKNTPDRQCARKGKA FT WDSDKGVVSIASVKPPQGEFDNGPRPLLLATDDKPLPFRIQTLPAIPETSM FT PKEPTAPATEERIKPKVSPTPTNTPDNVSVRKGEARKIGEGALLLQEIKPP FT RGESDTPIPSVVEADHKLLHPGIRFDASIAAANASWNVSAKLAAEASKDKI FT EMSAEELVPTQYHRYISMFIKTKSMTLPPCQRYDFRVELVPGATPQASKMI FT PLSPAEETALNTMIDEGLSKGTLRRTTSPWAAPVLFTGKKDGNLRPCFDYR FT KLNALTVKNRYPLLLTRELIDSLQDAENYTSLDMRNGYNNLRVWEGDEKKL FT AFICKRGQFEPLVMPFGPTGAPGYFQFFISDIFRDRIGKDLAAYLDELLIY FT TPAGVDHESVVEEVLKILESHSIWLKPKKCKFSRKEIDYLGLLISKNKVRM FT DPLKVSAVTDWPAPANVTQVQQFLWFAKFYRRFIKNVSKVTRPLHDLTCEE FT KLFVWTKSREDVFQTLKTSFTTAPILRIVDPYKPFILECDCSDYALGVVLS FT QNDD" XX SQ Sequence 5633 BP; 1720 A; 1322 C; 1376 G; 1215 T; 0 other; tattgtagca tctaactcaa gacagaatcc agaggaagag ttatacaaga agaagaaagc 60 aagaaaagaa agaagaaaga aaatttttaa ggaaagaaaa aagaagacac aaaaagacat 120 tagaagaaga agaaattaaa agaaaagaaa gacgaaagaa gacaaaagta tatatataga 180 tagtcgcatt agttcattga cgtaatcccc aaatcccaat catcaccacc acgccttcat 240 tcgaagtacc atcagacgac ctcacgtcgt tagaagacga ttcctggtac gatcccctac 300 tagacaaacc cgtacaacca tcagcatcaa tggcaactgc cgaagacatg gagcgaatct 360 caaggcaaat ggcggacctg aacgccagac tcaccgagga aaccgctctt tgtcaggctg 420 ccgaagcacg actaaatcag atcgaggcag cccgagcacc accaacaaac cccggtcctg 480 cacccaccac caccgccacc cctatggttg ctgctgtacc accaccaccg gtgaccagca 540 atacttgcat tcaagtcgcc cgtcctgaac gatttaccgg cacgcgagga gcggccgcag 600 aggcctttgc gagtcaagtc ggactctacg ttgtggtaaa cgccagcctc tttacctcgg 660 acaccacaaa agtcatgttc gcgttgtcat acctaggagg cgatgctatc cagtgggcgc 720 agcctttcct tcgacgggcg ctacaccctg ctgaatcgga gcctgtcacg ttcgccaaat 780 ttaccgaggc attccaggct gtgttttttg attccgacga cggcaacacg ctgagaaggc 840 attacgtgcg ctgaagcaaa ccaagtcagc cgctgagtac acaattcagt tcaatcaact 900 cgcgcctacc acgggatggg aactttcgac tctcattagt cattattgtc aagggttgaa 960 attagatgtt cgtgtagcaa tggtgcggga gcaatttgag aagctggaag acatcacaag 1020 attagcgtgt gcaattgatg gagaattaag gggtgaagtc agcacttaca caccgccaac 1080 ccgctcagca ccctcggacg cgatggatat atcatcagct aggttttcta tttcgtcaga 1140 ggagtatcaa cgtagagtgg gggagcaact atcttttcat tgtgggaagt ctgggcatag 1200 agcgagatgg tgtaagggga agggtcgtga taagggaaga gttggaggaa agattgctga 1260 actggaggct aagattgcgg ctttagaagg agagaagacg gagagaatca ggttgtcagc 1320 tgctgatgag tcaaaaaatg gaggcgctcc gcagtgacgg acgtgccacc cctgggccta 1380 ggcaggaatg aagagatgag ggagatagat actgttagca accatgaaat caagggctcc 1440 gacccacgtg tgtttacctc catctccctc tccaagtccc accttgccac gtcccataaa 1500 cctttgaccc ccccagccca agctttgatc gactgtggtt caacacatga agtgctatgt 1560 accacatttg tcgagcgcag tggtctacct gtatccaaac tcaagaccac aggtgatgtc 1620 tatggattca acggtgcccc acgcactgtt gcccacgacg ccgaactcta cattgataat 1680 gatgaagaga agacccgatt tctggtaacg aaaatcaagg acttgtatga cgccattctt 1740 ggcatgccct ggttaaagaa aaacggccac cggatcgatt ggaagaacgg catattacgc 1800 ccgaagtcag aatcaactca gatagcagca ctcaaccttg ctaccgaaaa ccaccaatcc 1860 aacaccgact tacttgaaaa acacgatcac cttgctaccc ttgaagtggt atcgtctagc 1920 ccgaaaaaca ccccggatcg ccaatgtgct cggaagggga aagcctggga cagtgacaag 1980 ggggttgtta gcattgctag cgttaaaccc ccgcaaggtg agttcgataa tggacccaga 2040 cccttactcc ttgccacgga tgacaagcct ttaccttttc gaattcagac cctaccagcc 2100 atacccgaga catcaatgcc aaaggaacca actgcacccg ccactgagga gcggataaag 2160 cccaaagtgt cgcctactcc gacaaacacc ccggataatg tcagcgtgcg gaagggggaa 2220 gctaggaaaa ttggcgaggg ggctctctta cttcaggaga tcaagccccc gcgaggtgag 2280 tccgatacgc ccattcccag tgttgttgaa gcagatcaca agcttttaca ccccgggatt 2340 agatttgacg catcaatagc cgcggccaac gcatcctgga acgtgtcggc taaactggcg 2400 gctgaagctt caaaagacaa gattgagatg tcagcagaag aactcgtccc aacccagtat 2460 cacagatata tctccatgtt tataaagacg aagtcaatga cattacctcc ctgccaaagg 2520 tacgatttcc gcgtagaact cgtgccagga gcaaccccgc aagccagcaa aatgatcccg 2580 ttgtcacctg ccgaagagac tgccctcaac accatgattg atgagggact gagcaaagga 2640 accctacgta ggacaacgtc gccgtgggcc gcacctgtct tgtttactgg aaaaaaagat 2700 ggtaacctcc gtccttgctt cgactaccgg aaattaaatg cactcactgt gaagaaccgc 2760 tacccactcc tgttaaccag ggagctaatt gacagcctgc aagatgccga gaattacaca 2820 agcctcgaca tgcgcaatgg ttataacaat ctgcgggtgt gggaaggaga tgagaaaaag 2880 ttagctttca tctgcaagag aggtcaattc gaaccactag taatgccctt tgggccgacg 2940 ggagcccccg gctacttcca attttttata tccgacatat ttcgagacag gattggaaag 3000 gatctagccg cttacctgga tgaactactg atatacacgc cagcgggagt tgaccacgag 3060 agtgttgtcg aagaagtgtt gaaaatactt gagtcgcaca gtatctggtt gaagcccaag 3120 aaatgcaagt tttcaagaaa agagattgac tacctcggac tactgatctc aaaaaataaa 3180 gtacggatgg atccattgaa agtatcagcc gtcaccgact ggcctgcacc tgcaaatgta 3240 acccaagttc agcaattctt atggtttgct aaattttata gaagattcat caagaatgtt 3300 tcaaaagtga cacgaccctt acacgactta acttgtgagg aaaaactgtt cgtgtggacc 3360 aaatcgcgag aagatgtgtt ccaaacacta aagacttcct tcacaacggc accgatactg 3420 agaattgtgg atccctataa gccctttatt cttgagtgtg actgttcgga ctacgccttg 3480 ggcgtagtac tttcccaaaa cgacgattga ggggttctcc acccagtagc attcctgtcg 3540 agatctctag tccaggctga acaaaactat gaaatttttg ataaggagtt gttagctgta 3600 gtagcgtcct tcaaagaatg gaggcactac ttagagggga atccaaacag actagaagtt 3660 acggtattta ctgaccataa gaacctcgag acgttcatga caaccaagca actcacaagg 3720 aggcaggcta ggtgggctga ggtattagga tgcttcgatt tttatatttg attccgacca 3780 ggacgagatt cgaccaaacc caatgcctta tcgaggagac cagacttgga accgacacag 3840 gatgagaaac tatcattcgg gagtcttcta cgaccggaga acctatctga gtcttcgttc 3900 aacgccgacc tagacagcat tgaggcctgg tttgaggagg tggaggactg gtttgagcaa 3960 gacgtaatgg aatcaacgcc agaaacaatt gaaattgacg ctattgaaag aacgcaagac 4020 tcacctgtct ggactgatga agccatactc gacagaatac gggaacaatc gccactcgac 4080 tcacgtattg aaaacttaat gaaagtagtg aagacaatga agggaaaggt gttataagat 4140 gcagttaagg gttatgaggt ccatggaggt atactttaca aagatggcct gatagaagtc 4200 ccaaacgata gtcgctgtaa gtttgagatc ctgagaagtc gacacgacag tgccttggcg 4260 ggtcacccag gacggatgaa gaccttgagc ttggtacaga gacaatacca ctggccatcc 4320 atgaaaatgt acgttaacaa gttcgttgat gggtgcaact cttgcctaag agtaaaacca 4380 tcaaaccaag tacccttcgg atcgctggaa ccgctaccta taccagcggg gccttggaca 4440 gacgtcagct atgacttcat tacagaactt ccattgtcaa acggtaagaa ctgtatattg 4500 acagttgtag attgccttac aaaaatgggg cacttcatac catgcacgac agaaatgaat 4560 tcagaagagc tcgctacact gatgctgaag tatgtatgga aattgcacgg cttacctagg 4620 acaatcgtat ctgaccgagg gagcgtgttt gtgtcaaaaa tcaccgaatc actgaacaac 4680 caactgggca tcaagctgca cccgtcgacg gcgtatcacc ctcagacaga cggacagacg 4740 gagattgtta acaaagccgt agagcagtat ttgagacatt tcatatctta ccgacaaaat 4800 gattgggaag agcacttacc gcttgctgag tttacttata ataacagtac acatttgtcc 4860 acaggtgtat tgcccttcaa agccaacgtg ggatatgacc tatcgtttgg aaggatccca 4920 tcgactgaga ggtgcatccc ggttgtggaa gagcgactga agactattga agaagtacag 4980 gacgaattaa aagaatcatt aatgcaagcc caagaaatta tgaaaaagaa tcatgatact 5040 cacacaaggc ctacatccaa ttggaagata ggtgatcatg aagtgtggtt gaatagccgt 5100 aatatatgaa ctacaagacc tagtagcaaa ctcgaccatc ggtggttagg tcccttcagt 5160 atcgtcaaga aagttttgac ctcggcgtat aaactggcgt tacccataag catgagtaaa 5220 gtccatccag tattccatgt atcagttttg aggaagcact caccagacgc catcaaggaa 5280 agagtagaga cagcgccacc agcaatcgaa atcgagggtg aagaggagtg ggaagttgag 5340 gaaatcctag acaagcgtcg aagaggcgga aaaatcgagt atttgatttc atggaagggg 5400 ttcaatcgaa ctgaggattc atgggaacca ctgattaacc tacagaatgc aaaggagatg 5460 atcgacatgt ttgatttaag gtttccgaaa gctgaggaag atcacagagg aaccaggagg 5520 gtacggaatt agagagaggg gctgaagctt tttcccaccg ggttttttaa cgccaagccc 5580 cggggaaaga cgtccggcag caaagagggg ccggggcgta aaaggcggga tag 5633 // ID CACTA-2_Ccinerea repbase; DNA; FNG; 1872 BP. XX AC . XX DT 16-MAY-2011 (Rel. 16.05, Created) DT 16-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW EnSpm; DNA transposon; Transposable Element; CACTA-2_Ccinerea. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-1872 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX CC incomplete. XX SQ Sequence 1872 BP; 462 A; 518 C; 464 G; 428 T; 0 other; ttgggacatc tgggatgcaa ctgcacttcg agatttcaag gacagcgacg ggacgacatt 60 cgtcaaggct ccaaaggggg agagtcgtct tgtctttagc ctcaatatgg atggctttaa 120 cccataccgc aacaaagagg caggaaagaa ggtgacggta ggggctctgt acatggtctg 180 cctaaaccta ccaccgacga tcagatacga tcttgaaaac atctaccttg ttggcatcat 240 tcccggtcct caaggtcctt cccaacacga actcaaccac cttcttcgtc cgcttgtcaa 300 ggatctgctg aggttatgga gccctggggt ttacctcacc caaacgtttc gcttccctga 360 aggctgcact gtccgcggcg ctttggtccc attggtatgc gatctgccag cttcgcgaca 420 aacgagcggc tttgcccact acaaatccaa caagttctgt tcggaatgcc atcagtgttt 480 ggacgacatc aatgacctcg acgtcgaaaa ttggagaagg cgaagctgga aagaccatct 540 cgaatgggct aggaaatgga aatcggcatc ttctctagcc gaacgagaag cgatcaccca 600 tgaacatgga gtgcgctggt cggagctgct gcgattgcca tattgggatc ctaccgcctt 660 caccatcatc gactcgatgc acgctttcta ccttcgatta tttcaacacc actgtcggtc 720 gatttggggt atggatgtca ctctcgaaga tggggacggg atcaccttcg atcgcacagg 780 cacacagcct actgaagccc aggttcgaca tggcacgcat atcctccgcc acggcacaga 840 ggaagcgctc aaagggctat ctgttcccgt tctccgtgag ctatgtcgcg agacatctac 900 cttgaatttc cggggaaaga aaaagtttct tgtcgacaga ttgcttcaat atgtacgtac 960 ctcgtctaga tatctcgaca aagtactgac cctcagatag cgagtccggc agggctggtt 1020 cacggacaca ggtcgatacg tacccccctc ctcgtctgaa gatcctgccc ctgtcagcaa 1080 cgatgaatct gaactttcca aagggcgaac aattgaagac cttttcgcga acgggagcaa 1140 gacggacatc aggaagctca cgaaggtaga cgcacttatc gtgttccgta gcaaggttat 1200 gccgatgatt tctccaccca tgtcggaaga acttgtatcg agactgggga aggaggttct 1260 caaagagtac atccaaaacg aggttagtgc acttacgcct ccagagtacg ggttttctca 1320 ttaccatgct cactttctcg tagcggaagc ggctaggctt gatatcgtcg gacggatcga 1380 gccctgcaac tgcgccaagg aaaacgcgtg tcctgggtag aggagtcctg gaggaaatac 1440 gacgcgatat ggtgcaactg cgggtcccca cttggcaagc gcttgctcca aagcgaccgg 1500 gagaaaagaa atggggcaag ttcaccgcag atcaatggag gaccttctgt atgattaacc 1560 ttccaatcac cctcatccgc ctgtggggaa gcaagccttc tggctctatc gaacgccgcc 1620 gcctcgagaa cttcatgcac cttgtcagcg ctgtcaagtt agccaccatg caccgattaa 1680 acgaagaacg tattcgccaa tatgaatttc acatccgcca gtacctcacg acgctgcttg 1740 agctgtatcc cggtactacc atcacccctt atcaacacct cgctctccac tttggacggc 1800 aactgcgctc tttcggccca gtacatgcgt ggcgttgttt cccgttcgaa cgctacaact 1860 acattatgca ga 1872 // ID hAT-N2_AN repbase; DNA; FNG; 397 BP. XX AC . XX DT 09-JAN-2004 (Rel. 9, Created) DT 09-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE Nonautonomous DNA transposon. Putative classification: hAT DE superfamily - a consensus sequence. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; hAT superfamily; hAT-N2_AN. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-397 RA Kapitonov V.V. and Jurka J.; RT "hAT-N2_AN, a family of nonautonomous DNA transposons in the RT Aspergillus nidulans genome."; RL Repbase Reports 3(12), 219-219 (2003). XX DR [1] (Consensus) XX CC Nonautonomous DNA transposon. Putative classification: hAT CC superfamily. CC 8-bp TIRs; subterminal inverted repeats; 8-bp TSDs. XX SQ Sequence 397 BP; 109 A; 95 C; 102 G; 91 T; 0 other; tagggttgtc aacccgcacc gcaacccgca gcgggccacc gcaccgcacc gcagcggtgc 60 ggtgagggtt gcaaaattgg cagtccgcgc gggttgcggg ttctaatagg ggaacgcgtg 120 cgggtttgcg ggccaccgcg cgggtagaaa aatacataaa aatacataaa attcataaaa 180 attcacaaaa ttgcataaaa tatcataaat atgtataata ttgtgaaaat tatgaaaatt 240 atgtattttt atgtattttt taacctgcgc ggtggcccgc agatctagta gtacaggttg 300 cggtgcagtg cgggttactt atctagaacc cgcaaacccg cgcgggcaga tttctaaccc 360 tgcgggttgt acccaacccg caccgagtgc accccta 397 // ID Gypsy-21_MLP-I repbase; DNA; FNG; 5506 BP. XX AC AECX01001225; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-21_MLP_; KW Gypsy-21_MLP-LTR; Gypsy-21_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5506 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001225; Positions 107727 102222. XX CC Positions [4305-4784] - Integrase core CC 'CAAGA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(1602..4163,4167..5216) FT /product="Gypsy-21_MLP-I_1p" FT /translation="MKDTRIIDIITLHDPQKATTTLARALIDSGATHEAVS FT LSFVQTHSLSTSQLQESQTVVGFSGHESRINEVGKYKINKDEDSTTFIVTN FT LHDKYDLILGMPWIKKNYEIIDWKSGKLKKEEEEIATVETVLSGLQKAWLL FT SKEPLRHAKNRDEGVEFKFNSLTPPQCECNATISNPVSKTGSDLVSPLMNR FT QRILQGLTNIEKRYTKPTQFHHKFNQSYPTYKIAAAKTSWNISEKLAADEK FT KDQVVKTAAELVPATYHEYLHMFEKNNSNVLPPHRPYDFRVDLIPGATPQA FT GQIIPLSPKENDALNEMIEKGLSNGTIRRTTSPWAPPVLFTGKKDGNLRPC FT FDYRRLNVLTVKNKYPLPLTMELIDSLLDADEFTSLDMRNGYNNLRIREGD FT EGKLAFICKAGQFEPLTMPFGPTGAPGFFQYFIQDILKNHIGRNVAAYQDD FT ILIYTKPGEDHKAVVKEVLNILRQQNVWLKPEKCSFSCSEVSYLGLVISRN FT QIRMDETKVKAVKDWPAPKNLTEVQMFLGFANFYRHFISHFSKIAKPLHEL FT SRKNEPFSWNKGREEAFEKLKIAFTSAPVLKIANPYRPFILECDCSDFALG FT VVLSQISIDNEQLHPVAFLSRSLLQAEKNYEIFNKELLAVVTSFKEWRQYL FT EGNPHRLNVIVYTDHKNLQSLMSTKELTRRQARWAEVLGSFDFEIRFRPGK FT QSTKPDALLRRPDLMPSKGEKLTFGQLLKPANLPTDAFLDELDLIESWVDD FT DLDEISILANKISEDNTDEQDKVWNDSQIIMEIRKKSTTDQKIKELIQLCK FT DMPVSRLLDGYNTEEGLLYRHGRIVVPNNDKLKLEILRSRHDSRIAGHPGR FT NTLALVKRSYYWISMKSYVNKYIDGCQSCQRVKARMTKPFGTLQPLPIPYG FT PWTDICYDMITDLPESNGFDCILTVVDRLTKMAHFLGCKKTMNSDNLTTIM FT IKQVWKLHGTPKSITSDRGSVFISKITKDIHRKLGISTQSLTAYHPQTDGQ FT SEVTNRAVEGYLRHYTTYMQDDWHSLLPIAEFSYNNSHHVAIGMSPFKANY FT GFDVTFTGVPNSGQYLPIVEEQFKRIKEVQQELKSAMEQAQESMTIQFNKK FT VQASPMWKINDKVWVNSKNIATTRPSAKFAHRWLGPYSIVKQINANAFKLK FT LPDSMKEIHPVFHVNLLRRFEESEIKNQTNKPPTPVVINNEMNMK" XX SQ Sequence 5506 BP; 1950 A; 1096 C; 1121 G; 1339 T; 0 other; tattgtagat aggataatac aaaatcaacg gactcattca aaattagatt aagctcaaaa 60 gtacaagtat tcaagttgaa agtttaaaag aaagaagaag aataaagtta atagttaaga 120 agaaagttta aaagtttaaa agtttaaaag tcaattaaag tttcaagatt cagtatcaag 180 tagaaatcta gatctctcta aatccgaagt ataccactca ccattcacaa accgcaagta 240 aagttagcag tattcaacca acctcgttca tattcgcacc acatcattaa aatcagtcac 300 aacgcccgat ttcaaagctc ctactaattc ttcggcctcg gagtcgagtg aaagctctga 360 aaagtttata gaagtctttc acgaaccggg taatttgaca atggactctc agggagatcc 420 cactatggcc gaaattctac ggcaattaaa tcaaatgaat gctcaattca ccgaagtcaa 480 agctagccta gccgaagaat ctctgaaacg ccaagaagct gagaagcgaa gcgaagaagc 540 tatgcaacgc ctgcaacagt atgaaatctc gatgtcgaac gctacacaac ctgctatgcc 600 tatgatgccc gatcctacta cttcaacatt gcctactcaa cctgtatatg ttaatcaaac 660 gacgccatct cagaaactcc cgaagatgtc aactcctgac aagtttgacg gtagcaaggg 720 ctcaaaggca gaagtgttta tgaatcagtt aggtctttac atgcaattaa acagcaattt 780 attcgccaat gaacaagcaa aagtagcgtt cgcattatcg tacacgactg gcaaagctag 840 tatctggggt caaagcttag tagatcaact attagattct gagaatgcgc atctagtaac 900 ctggacgaaa tatatcaact ctttcaaagc aacctttttc gactcggaac gaattgcgaa 960 ggctgaaaag gaattacgtc cctaaaacaa aaaggatcag tatccgatta ttggatcaag 1020 ttttcagaac tgtcacttgt agtaaaatgg ccagaagctg tgttacgatc tcaattcgag 1080 caaggtttaa aaacggaaat ctcggtatac atggtgagag atgtttttga caatgctgat 1140 gaaatggcga agactgctat caaattagac aataaaatca ataagcgtca agattacaac 1200 tcctatccga caacgtcaaa catcactact cctgcctcta cgccagctat tgaccccgac 1260 gctatggact gttcggctta tcaactaaac atatcaggag aagagtatag aagaagagga 1320 acctcatcag cctgttattc atgtggaaaa accgatcatt acatagctag ttgtccagat 1380 agatcaagga ggggaagggg aggcagtcgc tttagcagag ggggtagtag atttaaagga 1440 agggtggcgg aaatggaaag tagcttagga ggaagtgaaa gtgaaaagaa ggatgaaagt 1500 agagcagagt cttcaaaaaa tggcgatact cgggagtgaa cgttgtgcct cccccgagca 1560 atcaatttgt ggatgaatta ggagatgtta gtagtttagt aatgaaagac actagaatta 1620 tagacatcat aaccctacat gacccacaaa aagccacaac tacccttgcc cgagccctca 1680 ttgacagcgg agccactcac gaagcagtaa gcctaagctt tgttcaaacc cactcactta 1740 gcacctctca gttgcaagaa tcacaaactg tggtaggttt tagtggacat gagtcacgta 1800 tcaatgaagt aggcaagtac aagatcaaca aagatgaaga ctccacgacg ttcatagtca 1860 ctaatctcca tgacaagtat gacttaatct tgggtatgcc ttggataaag aaaaattatg 1920 aaatcattga ttggaaatct ggtaaactca aaaaagaaga agaagaaatt gcaactgttg 1980 aaacagtttt gtcaggtctg caaaaagcct ggttgttgtc taaggagcct ctcaggcacg 2040 ctaagaaccg tgacgagggg gtggagttca agtttaactc tttaacaccc ccacaatgtg 2100 agtgcaatgc aaccatatca aatccagttt caaagaccgg tagcgatctt gtttctcctt 2160 taatgaacag acaacgaatc ttgcaaggat taacgaacat tgagaagaga tacacaaagc 2220 cgacccaatt tcatcacaaa tttaatcagt catacccgac ttataaaatc gctgctgcca 2280 aaacgtcatg gaacatttca gaaaaattag cagctgatga aaagaaggat caagttgtca 2340 agaccgcggc tgagctagta cccgctacct accatgaata cctgcatatg tttgagaaga 2400 ataattcaaa tgtcctaccg cctcacagac cttatgactt cagagtggat ctaatccctg 2460 gcgctactcc gcaagcaggt caaataattc cgttatcacc aaaagaaaat gatgctctca 2520 acgaaatgat tgagaaagga ttatctaatg gaactatcag aagaacgaca tcaccttggg 2580 cccctccggt actgttcact gggaaaaaag atggcaattt acgcccttgt ttcgattaca 2640 gaagattgaa tgtgttaacg gtcaaaaata agtatccatt accattaaca atggaattaa 2700 ttgatagtct attagatgca gatgaattca caagcttgga catgagaaac ggttacaaca 2760 acttacgcat tagagaaggt gatgaaggca agcttgcgtt tatttgcaaa gcagggcagt 2820 ttgaaccgct aaccatgcct tttggaccaa cgggagcccc tggcttcttt cagtacttca 2880 ttcaagatat tttaaagaat cacattggac gaaatgttgc tgcttatcaa gacgacatcc 2940 tcatctacac gaagcctgga gaagaccata aagcagttgt aaaagaagtc cttaacatct 3000 tacgtcagca aaatgtgtgg ctgaaacccg aaaaatgctc attttcatgc agtgaggtct 3060 cttacttagg actggtgata tctaggaatc aaatcagaat ggatgaaact aaagtgaagg 3120 cggtgaaaga ctggccagca ccaaaaaatt taaccgaagt gcaaatgttt ctaggatttg 3180 ccaactttta tagacatttc atttcacatt tctccaaaat agccaagcca ctacacgaat 3240 tatcacgcaa aaacgaacct ttctcatgga acaaaggacg agaagaagcc tttgaaaaac 3300 tcaagatagc tttcacctca gctccagttt taaaaatagc caatccgtat aggccattca 3360 tcttagaatg tgactgttca gattttgcac taggtgtggt actatcacaa atatcaattg 3420 acaacgaaca acttcacccg gttgcattcc tttcaagatc attattacaa gctgaaaaga 3480 attatgaaat attcaacaaa gaattactag cggtagtcac ttcattcaaa gaatggagac 3540 aatatttaga aggaaatccg caccgactca atgtaatagt ctacacagat cacaaaaact 3600 tgcagtcctt aatgtcaacc aaagaactaa cgagaaggca ggccagatgg gcagaggtcc 3660 taggaagttt tgattttgaa atacgtttcc gccccggcaa acagtccaca aaaccagatg 3720 ctcttttgag gcgtcctgat ttgatgccaa gtaaaggtga aaagctaacc tttggacaac 3780 tgctcaaacc tgcaaacctc ccgaccgacg cattcctaga cgaacttgat ctaattgaat 3840 catgggtaga tgatgacttg gacgaaatct cgatattggc caacaaaatc agcgaagaca 3900 acacagatga gcaggataag gtgtggaatg actctcaaat cattatggaa attaggaaga 3960 agtcaacaac agaccagaaa atcaaggaac taattcaatt atgcaaggac atgccggtgt 4020 cacgattact agatggttac aacactgaag aaggactatt gtaccgccat ggaaggatag 4080 ttgtccccaa taatgacaag cttaaattag aaatcttacg atcaagacat gacagcagaa 4140 tagcaggcca cccaggccga aattgaacgc tagctttagt aaaaagatca tattattgga 4200 tatcaatgaa atcatatgta aacaaataca ttgatggttg tcaatcatgt caaagggtaa 4260 aagcaagaat gactaaacca tttgggactc tacaacctct accaatacca tatggccctt 4320 ggaccgacat atgctacgat atgataacag acttacctga atccaatgga tttgattgta 4380 tactaacggt ggtagatagg ctaactaaaa tggctcactt cttaggttgt aagaaaacga 4440 tgaactcaga caatttaacc accattatga tcaagcaagt atggaagtta cacgggacac 4500 caaagtctat tacttctgac agaggaagtg tcttcatatc gaaaatcacc aaagacatcc 4560 atcgtaaatt aggcatcagt actcagtctt tgactgcgta ccaccctcaa acagatgggc 4620 aatcagaagt aacgaaccgt gctgtagagg gctatttaag acattacacc acttacatgc 4680 aggacgactg gcactcatta ctaccaatag ccgagttttc ctacaacaat agtcatcatg 4740 tggccatagg catgtctcca tttaaagcta attatggttt tgacgtaact ttcacaggtg 4800 tgccgaactc tggacaatac ctacctattg ttgaggaaca gttcaagaga attaaagagg 4860 tgcagcaaga acttaaaagt gcaatggaac aagctcaaga atcgatgacc attcaattca 4920 ataagaaggt acaggcttca ccaatgtgga aaatcaacga caaggtctgg gtaaacagca 4980 agaatattgc gacgacgaga ccatcagcca aatttgcaca tagatggttg ggtccttatt 5040 caattgttaa acaaattaac gccaatgctt tcaagttgaa gctaccagat tccatgaaag 5100 aaatacatcc ggtctttcac gtaaacttat taagaagatt tgaagaaagt gaaattaaga 5160 atcaaactaa caaaccacct actccagtag tgatcaacaa tgaaatgaat atgaagtaaa 5220 tgaaattttg aacaaaagga aaaggtatgg aaaggtggaa tacttgatta attggaaagg 5280 ttacggcccg gagcaagatt cgtgggaacc agaagcagga gttcaaaatg caaaggactt 5340 agtagatgaa tttaataaaa ggtttcctgg aaaagaaaaa tcatatagaa aggcaagaag 5400 gttagtgaga gggtgacact ttttcctcaa gtgggttttt taatgcaaac ccgtggaaaa 5460 gatgtcaggc tcagcaagag ggagctggga cataaagggg gagtga 5506 // ID DIRS-1_MLP-LTR repbase; DNA; FNG; 910 BP. XX AC AECX01001077; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 24-MAY-2011 (Rel. 16.05, Last updated, Version 2) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW DIRS; LTR Retrotransposon; Transposable Element; Copia; KW Copia-45_MLP_; Copia-45_MLP-LTR; Copia-45_MLP-I; DIRS-1_MLP-LTR. XX NM Copia-45_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-910 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX RN [2] RP 1-910 RA Kojima K.K. and Jurka J.; RT "DIRS-type retrotransposons from fungi."; RL Direct Submission to Repbase Update (24-MAY-2011). XX DR Genome; AECX01001077; Positions 182454 183363. XX CC [2] Re-classification as DIRS based on the presence of tyrosine CC recombinase. XX SQ Sequence 910 BP; 351 A; 157 C; 195 G; 207 T; 0 other; tgtaaagatg agaggtgcag gctcgagatg agctgagtat cgtggctatg ccgaaatata 60 gcagatctgg ggaaatcagg tctgatgtga gtttgggcaa caggcctcac tcagctaaaa 120 gatgctaata agaatgagac aaacctgggg aaatcaggtc tgatgtgagt ttcggcaaca 180 ggcctcactc agctaaaaga tgctaataag aacgagacaa acctgtgcga atgcaacaag 240 gtcaagtaaa gtgatcaaat gaaaagatga gtaaacacta aaataagatt cttttcgatg 300 atttgtaaac taatcatacc tggggaaatc aggtctgatg tgagttttgg caacaggcct 360 cactcagcta aaagatgcta ataaaaacaa gacaaaccta gcgcaacaac aattagtaaa 420 aatgaaagaa agagaaagat atagtgaaat gacacaaacc tgtgcgaatg caacaaggtc 480 aagtaaagtg atcaaatgaa aagatgagta aacactaaaa tactgaagaa acatatataa 540 gagctaatgg tcaggctgat gatgttagga aggataggtg taaggaaaat cactcacaga 600 ttctttttga tgagggaaaa gaatctgggt gttgaacaac agtagctagt tatacaagca 660 tcacacacaa tcgtcaatac gtaacatatt aaaagagaaa gaaaaacaca cagctcaagt 720 cacacgactt attcccaaaa caaacataca cttccgtaac agatcatttc ataactcaac 780 tctccttcat ttcctcttga tttgtaacag tcgtacataa tgtgtaataa atgccggaag 840 tccttgacga gtgtaattag agatgtcaaa gcggacaccg tggtaggttc agtatcacag 900 tatcacaaca 910 // ID Gypsy-3-I_AF repbase; DNA; FNG; 5726 BP. XX AC . XX DT 28-FEB-2006 (Rel. 11.02, Created) DT 07-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE Internal portion of the Gypsy-3_AF LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; Gypsy-3_AF; KW Gypsy-3-LTR_AF; Gypsy-3-I_AF. XX OS Aspergillus fumigatus OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Neosartorya. XX RN [1] RP 1-5726 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-5726 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-3_AF, a family of gypsy LTR retrotransposons in the RT Aspergillus fumigatus genome."; RL Repbase Reports 6(2), 64-64 (2006). XX DR [2] (Consensus) XX CC This is an internal portion of the Gypsy-3_AF LTR CC retrotransposon. Two proviral copies are 94% identical to each CC other. All 374 mismatches are transitions. Therefore, most of the CC mutations was induced by RIP. Its PBS (pos. 2-13) is CC complementary to Gypsy-3-LTR_AF (pos. 207-218 (self-primed CC reverse transcription). XX SQ Sequence 5726 BP; 1715 A; 1385 C; 1286 G; 1340 T; 0 other; taatactaag ctttctttat ttctcttttc ttccctgcta ctagattgca gagatcttca 60 ccacactcgt taaacaacgc gttgtggtcc tttctctgtt ctctggaagc aagcattttt 120 ctgcatgcca acgacgctta acagttggac caaacgaact gtgcgaacca acgggcgaat 180 caagatgaat gaaaagacca agaacaaatt accgatgaac gactgaatgg ataacaaaat 240 ggctaaacca gctgagatcc ccctatcaga gggcagccac gcccatggat tcactagacg 300 actgaccctt ctcatgaacc aagttgagag taatgggatg gatgaagaca ctagtggcga 360 ggcggaatct gtctcctatg aagaattcac tgagcattgg aagactgatc caacccgctg 420 ttaggaggcg atcacccaga tctatagaaa tctcaagaac tggtcagacc agcgaggcga 480 ggacctcact ctcatgaacg aatgagtcat tgaactgaac aaagaagccg ataccctgaa 540 ggactaaaag gaagcagtaa ttcttgagcg tgactgtctg actgcggcac tgaatgctgc 600 taactaggaa cgagacgatg ctctcaagca gttggactaa gtatggcgtg aataagatga 660 atttgcccta cagattgccc gtggctttgg aagtattagc tcccagactt ttccagctgg 720 aattagcagc aagttaatta agatccctga ccctctgctc ctaggaaaca gcaaagaccc 780 ttgatttgaa gactagtttc tggcgattaa acagaagttg taagctaatg ctgatcatta 840 caacactcca accctctgaa tcgcctatgt tgcaagccaa actaaaggaa acgcccgcaa 900 gcatattact ccttgtctct gagacgatac cccaaacccc tacaaggacg ccaccgatat 960 gctcgtttat ctagagaatg tcttcgctga ccccaaccac gagcgagtgg ctaagcagaa 1020 gtataacacc ctgtatatga ggcctagcac aaagtggcat gacttcctct ctgagttctt 1080 gtatctggcc gtggaagcag gagtggtgga agagacttag aaagatgacc tgtataacaa 1140 gctcacactc cgcatgcaag aattgacaat gcctgcctac aatgatgaca ctaaatcttt 1200 ctgggagttc actgactact gcaaccaaac agccaccaac attgagaaca tggaacagat 1260 ttgctcttgc atgaacaaca ctagtggagg aacccacagc aaaggaaccg cgaccactgg 1320 aggaacaacc acccaagcag ctaagacaaa cctcaccacc accgcagcca agggacgcac 1380 ggctacccct agcctggatg aggaaacccg caagcgcctg atggtggaag gaaagtgctt 1440 ccactgcttt aagccaggtc atattagtcc aaattatcta gagaagcaaa ttaagcagct 1500 gaaagtgctg gaacagctag ttcctgagga acaaggggtc tcagaaaacg accaagccta 1560 ggacaagtct ccttcctagg cgaaaagcga atcgaactct gtaacctaga tctgaggaac 1620 ctactaggcc atgaagagga tagttttaca atgcaaacac atcttttaat aaatggattc 1680 tcagtatctg ctggttcatt gattgattgt ggagccaatg cctatgcctg tatcaacact 1740 tcactagcta tcagcatctc ttaatgattt gggacaccta ctatactact ggaaggggaa 1800 catgctgtta ctggattcga tggaaagcac tcagttccac tgacacatgc tattctcctc 1860 accctagcca ttaataattg cgtgtaacaa cagataccct tcttgatact agacataggg 1920 cgccatgaca tcatcttagg tcgaatgtgg ctagcaaaac atcagatcct ggtgaactgt 1980 gcggctaagt aacttattta gtcctcccct gcctcctact aagaagaaat agctattcag 2040 atgaatcagg acgtgcctcg cagttgcctg ggtcgacgat ctatcaatca ggaacaccag 2100 agagacgcta tcagacgaga tcgatatttt tatcgcatgc aagatggtta caacaataaa 2160 tcccttctca aatctcccac ttcacttaat caaatacagc aagagagtaa tgagcaatac 2220 cgttgcgacg tcagtaggag cacttcccga ccacccccat accgagcgcc acgaactgaa 2280 ggaataaacc aacgacatgc tttgcttaag atggagcgag ctttccgcag tgcccaagaa 2340 gtcaagcctg tgcctgctat ctggattata gccagactaa tatctcagaa gatcactgtt 2400 aatatagcag cagtgggagc tgtagccttc caccaatgtg cgcagaagca agagactact 2460 gttttcacca cttctctata taagctagat taattaattg aggacaagag aggtccagaa 2520 gagccatccc acacagagat tctgcgtcta gcagccagag agttgaacaa gccagcaagc 2580 atactggcta gtgagccgct aagcgagtct ggaagggaga ccactgttga tagcatggta 2640 atggaagagc taagtgagat taagcaagtg ctaccaaagc cttatcatga cctcgcagac 2700 gtcttcctga aatcagaatc tgacatctta cctccgcacc aaggcgaatt caaccatcag 2760 attgaactag actaggcaaa caatgtgggc tacggccctc tgtacaagat gaatgcagaa 2820 gagctggagg ctacaagaga gtacatcatt gacaacctgc ataaaggctt catagttcct 2880 agcaatgcac tatttgcttc tccaatcttg atggcagaga aatcaggagg aggactccgt 2940 ttctgcgttg actatcggaa acttaatgct atcacaaaaa aagattatta tccactacca 3000 ctcatcgacg aggtgttaga tcgaatatct caagcaaaaa tcttcactaa gctggacatc 3060 cagcagggct ttcatcggat ccaaatggac cctgaatcag aggatttgac gactttccga 3120 tctcgatatg ggagctataa gtattgagtg atgcccttta gacttacaaa tggactcgct 3180 gcattccaaa gatttgtcaa tcatgtcttc attaactatc tagataagta tttgacggcc 3240 tttgttgacg atctgctgat ctactcagat aatgaattag aatattaatt gcatgtacga 3300 acggtgctgc aacaattgcg ggaaaatgga ttgcaggtgt ccttgaagaa atgcgaattc 3360 cacgtcacag agacgcggta tctgggcttt attatctcta ctgaaggcat caaggttaat 3420 cctaggaagg tagaagtcat ccaaaaatgg gcagttccca caactgttaa gggagtgcaa 3480 tctttcctgg gattttgcaa tttttaccgc cgcttcattg aggcatatag ccggatagcc 3540 agacccctaa tcaatctgac gaaaagcgac actccgttta aatagacccc agagtgtgaa 3600 gcagttttcc agaagctgaa gcagcgactg gtatcagcac ctctgctatg actatatgac 3660 ccatctcttc caacgcaagt agaaactgat gcttctgcgg aggttgtcgc agcagtgcta 3720 tcccaaaggc atggtattaa agactggcat ccagtggcat tctattcaaa gactatgtcc 3780 ccagcagagc agaattatga cattcatgat tgagagatgc ttgcaatcat cagagccctt 3840 gaggaatggc gagcagaatt agaaggatta cagcgtgcag agctatttaa tgtcttttct 3900 gactaccaag ccctatagta tttcatgacg tccaaacagc tcaacgcacg ccaggcccgt 3960 tgggcggaat tcatctctcg attctgattc atcatctaat actgtccagg acggtggaat 4020 actcttgcag atgccctctt gcgacctgcc acaacgcacc agaagggaga gaatgatcac 4080 cgtatccgta ctttgctgaa acccgaatac cttagcccct agataaggtc agagatatca 4140 gcattatcca gatcaacaga ggtagtcgtc agagtgctag aagcaaaccg aacagcagag 4200 gaattggatt ctgacagaga gaaagcccgt gcggcagcgg atccccaatg gacattgcaa 4260 ggagatcatc ttctcttcca agggagattg gtcgtgccag atatgggaga tctccgagcc 4320 cgattgctgg atgaaatcca ctgttaaccg tcgacagtgc atccagggaa ggggaagatg 4380 agataactag tcaaggattg atactactgg acctcctgga gcaaggatgt tgattaatat 4440 gtggtgaact gccttatctg ccagcgatta aagacacaac gtgacttgcc tccaggactt 4500 ctgcagctgt tactaatacc agatcgacca tggcagcaca tatccatgga tttccgctcc 4560 ttcccgcgtt caaaaaatga gtttgatgcg gcctttgtag tcgttaattg attaactaaa 4620 cgccctgtcg ctgttccctg ctataagacc actactgtga aagacatggc ccggctcttc 4680 ctccagtaca tctacccatg gacagggctg cccgaaacta ttgtattaga ttgaggagga 4740 caattcatat cggagttctg ggatgaactg tgcaaaatcc tttaaataca gattaaatta 4800 ttatcagggc agcatccaca aacaaacagc caaacagaaa ttatgaacca gtatatcgcc 4860 caacgcctgc gaccttttgt tagttattat caggataatt gggatgaatg gttacctatc 4920 ctgaactttg cagcagcagc actccctagt gattctactg gtctatctcc cttcctggtt 4980 gaaagaggtt acaaaccacg catgtcattt gattggacga aagcctcctc accccagacg 5040 ttgcagatag atcgtcaaga ggcccaacaa ctggtaagga gaatggaaca gatatgggag 5100 ctagcaaaat caaacatgca gacagctcag cgttgctaaa aagtacaggc agataaacac 5160 cgcagagagc tggattttga tgtgggggac tatgtttttg ccaccaccac tgactggcag 5220 caaagccacc cgacccggaa gctcagtgat cagatggcag gtccgtatcg gatcttggag 5280 aaggtgggaa atacttattg actcgaattg ctgcctagcg tcaaagtgca tctgatcttt 5340 gcgcctgaaa aattacagaa agccccaagc agcgaaccat taacagggca gcatcttgac 5400 cctcccccac ccatcgaagt taaggggcaa gacgaatggg aagttaatga gatccttgcc 5460 gtctgattgc actaccgcaa gctgcaatat cgcgttaggt ggaaaggcca tgatacagac 5520 ctgacttggt atccagccaa caacttcaag cacgccccag aacagatccg ggacttccat 5580 gcccgatatc ctgacctacc tggacccccg aaacgactgc cggaatggct acgagctgct 5640 gaaactaatg aattcatccc tgaccaccct gacgacgacc aaccacttgt cgaatcggga 5700 cgatcccagc ctaaggaggg gggtga 5726 // ID Copia-13_MLP-I repbase; DNA; FNG; 4822 BP. XX AC AECX01000951; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-13_MLP_; KW Copia-13_MLP-LTR; Copia-13_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4822 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000951; Positions 11249 16070. XX CC Positions [1985-2509] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 77..2551 FT /product="Copia-13_MLP-I_1p" FT /translation="MSSREDTSEDSLLPTPASTPAPENEFEETDAESDTEG FT NNKDTSSESFHSTHTLVPNNPSPALIMANNQATNYETPAQRQSIRVNGILS FT KITINIKLDATNYAPWSDSIRFGLGAAASYDEYLDSESAHSDGVDPSVHLA FT TKKSIFHWLLANMEQTQSTRFISMISTFENGVKKTPTSPMILWKSIRDYHI FT SNSESVKLMLRSEITDLSQGSTKDLLEYIDMFRAKVDAYLGANGEMSEEEQ FT ARQFVRSLNREWAEKGCDLLDAGHIKFRDLEIELKKTYQTRKMFNSGRQQS FT SRVVDTSEASHGRRTGRWQTCSKNRCIGRDHPTKPHDPSECYHHPNNSSKM FT EAWKKSKRDAGEWVEYPRGSRGSSRGRGRGGAHFAGGFSRHIGGSDFPDSR FT ELESAFEGLKLEDREFSYHVDIDGHYSCSAKPQLACVGDQCSSVALIDTGA FT SHHMFHDASLFEVGTMMANDDPKAKLNLAGGGATLDIHSIGNVTLLNSRGE FT EIKLKECLYVPELSRNLIAGGRILRAGAVTTVLEDPNFRIDHGKKELFIGR FT FIGEGSMMYVTIRSAVSRESHQSSKSSVNQLTILKLHYSLGHPSEKYLKRM FT WNLGYFHDVLPQNINPKDFEIISKCPVCPLAKNHRLPFSSTRPRATTFLEN FT VHVDLSGIIRTTAVNREEYYILFTDDFSSYRVSFGLPDKSAETVFDCFKRY FT IAYSERQTGEKLKMFSLDGGGEFINGLLSPYLESLGIVVRVTSPYTPEENG FT VAERSNRTINTKARCMMIQSCLPIRYWYHAVSYSVLLQNRTITTSLNLKAT FT PHSMWKGRQSNFKRFQPFGCLAY" FT CDS 3149..4822 FT /product="Copia-13_MLP-I_2p" FT /translation="MRSIVPKMTQTPLRIFECREAMRARAEPKSYKMAMKS FT ADAVKWREACDKEMKNIQDMGVWEIVDRPKDAPVVGGRWHFKYKLNPDGSI FT SKHKARYVAKGYTQTEGIDFNETFAPTGRLASFRVMVAVAAGKDWDIEQMD FT AIAAFLNSDLKEEIYLELPEGDDDERANGKVARLKKALYGLKQSARCWNDE FT VKDKFQQIGLVQNPHDACLWYGKDENGDETLIYLHVDDMAITGDKIQEIKR FT LLKLKWRMEDLGPAHCIVGIEIHSHEDGGYSLSQPAFIQTVLERFNCEDCK FT PASTPFPGGTKISKASDSEVLDFKQSGLPYNSLVGSLMYIAQGTRPDVAYA FT VGALSQHLSRPSMTAWKMGLHVLRYLKGTQSLGLVYSSQATSVEGIQSWNF FT PKCHTDSDWAGDPSTRRSTTGYVFKLNGEAVSWKSRLQPTVALSSTEAEYK FT ATTEAGQEVVWLRGLLSCISLGQENPTILCSDSTGAVSLTQKAIFHARTKH FT IEVQYHWIRQQVEKNVIKMRHVSSKNMFADTLTKPLHPGPFRELRDQIGLE FT VIHGHLKQGVC" XX SQ Sequence 4822 BP; 1495 A; 1043 C; 1088 G; 1196 T; 0 other; ggacgagctt cgagactttt catggtagcg agagattagt tcatctctca attattgaag 60 tcaaaagccg aagcctatgt ctagcagaga agatacctct gaagatagct tactacctac 120 accagcttcc accccagctc cagaaaacga gtttgaagaa acagacgccg aaagtgatac 180 agaaggcaac aacaaagata catcttcaga atctttccat tccacccaca ctttagtccc 240 aaacaatcct tctcccgcgt taatcatggc caataatcag gctaccaatt acgagactcc 300 tgctcagcgc cagtcgatca gagttaatgg tattctttcc aaaattacca tcaacataaa 360 gcttgatgct accaattacg caccttggtc cgacagtatt cgattcgggc taggagctgc 420 agcatcatac gatgaatacc tcgactcgga atcagctcat tctgatggtg ttgatcccag 480 cgtgcaccta gctaccaaga agagcatatt tcattggtta ctcgccaata tggaacagac 540 tcagtccact cggttcatct cgatgatatc cactttcgaa aacggtgtca agaaaacccc 600 aacatctcca atgatacttt ggaagtccat tcgcgattat cacatcagta actctgaatc 660 agtcaagctt atgttaagaa gcgaaatcac cgacctttcc caaggatcga ccaaagacct 720 actcgagtac attgacatgt ttcgagctaa ggttgatgct tacttgggtg caaacggtga 780 aatgtctgag gaagagcaag ctcgtcaatt cgttagatct cttaacagag agtgggcaga 840 gaaaggatgc gacttacttg atgctggtca catcaaattc agagatctcg agattgaatt 900 gaagaaaacc tatcaaactc gaaagatgtt caactcagga agacagcaat ctagtcgtgt 960 agtggataca tcagaagcga gccatggaag acgtaccggc cgttggcaga cctgtagcaa 1020 gaatcgatgc attggccgag atcacccgac aaagccacac gatccatctg aatgttatca 1080 tcatcccaat aactccagta agatggaagc ttggaagaaa tccaaacgtg acgctggaga 1140 atgggtggag tatcctcgtg gaagtcgtgg tagctcaaga ggtcgcggaa gaggtggcgc 1200 tcactttgct ggtggattct cccgacacat cggtggttct gactttcctg attctagaga 1260 actcgagtca gcattcgagg gattaaagct agaggatcgc gaattcagtt atcatgttga 1320 tatagatgga cattactctt gctcggcaaa acctcaactc gcatgtgtag gtgatcagtg 1380 tagttcggta gcactcatcg atactggtgc atcgcatcat atgttccacg atgcttcact 1440 attcgaagtt ggtactatga tggcaaatga tgatccgaaa gcaaaactca acttagctgg 1500 aggaggcgcg accctggaca ttcattcaat cggaaacgtt acgcttctta actcaagagg 1560 tgaagaaatt aaattgaagg agtgtttata tgttcctgag ttatctagaa atctcatagc 1620 tggtggaagg attcttaggg caggcgcagt caccacggtc cttgaagatc caaactttcg 1680 aatcgatcat gggaagaaag aactattcat tggacgattc attggcgagg gcagtatgat 1740 gtatgtcact atacgatcag cggtcagtcg agaatcccat cagtcatcaa agtcatcagt 1800 caaccaactt acaattctga aactgcatta ctccttaggt cacccaagcg agaaatacct 1860 aaaaaggatg tggaatcttg gttactttca tgatgtactg ccacaaaata taaaccccaa 1920 agactttgaa ataatatcaa aatgccctgt ttgtccttta gccaaaaatc atcgattgcc 1980 attttcctcc accagaccca gagccaccac ttttcttgag aacgttcacg tcgatctaag 2040 tggaataatc agaaccacag ctgtcaaccg tgaggaatat tatatactat tcacggatga 2100 cttcagcagt tatcgagtat ctttcggact gcctgataag agtgcagaga ctgtctttga 2160 ctgttttaag cgttacatag cttactctga acgacagacc ggggagaaat tgaagatgtt 2220 ctccttagac ggtgggggag aattcatcaa cggtctctta tctccttatc ttgaatcact 2280 tggaattgtc gtgcgagtca cctcaccgta cacacctgaa gaaaatggtg tggcagaacg 2340 atcaaaccgt accataaaca caaaagctag gtgcatgatg attcagtcat gtctaccaat 2400 tagatactgg tatcatgcgg tttcttactc agtactactt cagaacagaa ctattaccac 2460 ttcgctcaat ctcaaagcta ctcctcattc aatgtggaaa ggaagacaaa gtaacttcaa 2520 acgctttcag cctttcggat gcttagccta ttgacatatc agaaaagaaa ttcaaggcgg 2580 aaaatttgaa gctgtatcaa gacccggggt tctattagga gccacggaag acaatcataa 2640 tttcactatc ctcgatttag aaacaaatca catacatacc agtcacgatg tgacatttca 2700 acccttggtg tttcccttta tgaaggatgc caataagaat ccagactggg tttttattga 2760 agatctaccc ttgttgacta gacaggaaga agaacctacg gaagatcaga ttccaggtcc 2820 gacacatcat aatcatgact cagatgatga agatgatccg tttgataata caaacaatcc 2880 tgaagttgta caagaacaga catatgagtc agacgatgat aacgacattg ttgaggagag 2940 tctcactccc cagatcgtta ccaaacctgt tttaacccaa gaaaagccaa ttcaggaatc 3000 aaataagact ccgacaagcc ctgatgctga tagtcccccg tctcagcccc aacaagcgtc 3060 aagaccagaa ccccgaagat ctaatcgaga tcgtcaacaa gtgaattgat acgtgcctgg 3120 tgattctaat ataatccata ctccggctat gcgcagtata gtacctaaaa tgacacaaac 3180 gcctcttcga atctttgaat gtagagaagc gatgagagcc agagccgaac ctaagagcta 3240 taaaatggca atgaagtcag ctgatgccgt taaatggaga gaggcgtgtg acaaagagat 3300 gaagaacata caggacatgg gggtctggga gatagtagac cgaccaaaag atgcaccggt 3360 agttggtgga cgttggcact tcaaatacaa gctcaaccca gacggatcta tttcaaagca 3420 taaagctcga tacgtggcga aaggttatac tcaaactgag ggtatagact tcaatgaaac 3480 ctttgcgcca acgggacgac tagcttcttt cagagtaatg gtggcggtgg ctgccggaaa 3540 ggattgggat atagaacaaa tggatgctat cgcagctttc ctaaacagtg atctgaaaga 3600 ggaaatctat ttggagttgc ctgaaggaga tgatgatgaa cgtgcgaatg gaaaggtagc 3660 tcgtttgaag aaggcgttgt acggtctgaa acagtcagct aggtgttgga atgatgaagt 3720 aaaagataaa tttcagcaaa ttggtttggt gcaaaatcct catgacgctt gcttatggta 3780 tgggaaagat gaaaatggtg atgagaccct aatttacctg catgtagatg atatggctat 3840 caccggggac aagattcaag agatcaaaag actactcaag ttgaagtggc gaatggagga 3900 tttaggacca gctcattgca tagttgggat tgagattcac tcacacgaag atggaggtta 3960 ctcgttaagt caaccagctt tcattcaaac tgtcttggaa aggttcaact gtgaagattg 4020 taaaccagca tctacgcctt ttccgggggg aaccaaaatt tcaaaagcca gcgattcaga 4080 agtgttagat ttcaagcaat caggactccc atataatagc ctggttggaa gtctgatgta 4140 catcgcccaa ggaacccgcc ctgatgtagc ttacgcagtt ggagcactgt ctcagcatct 4200 atctagaccg tcaatgacag cgtggaaaat gggactccat gttttacgtt acttaaaagg 4260 aactcaaagc ctaggtctcg tttattcatc tcaagctacc tcagttgaag gtattcaaag 4320 ttggaacttt cccaaatgtc atactgactc cgattgggca ggggatccaa gcactcgacg 4380 ctctacaact ggatatgtat tcaaactcaa cggggaagcg gtaagctgga aaagtcgatt 4440 acaacctaca gtagccttat cgtcgacgga agcagagtac aaagctacga ctgaagcagg 4500 ccaggaggtt gtatggctgc gaggactatt gtcatgtata tcattagggc aggagaatcc 4560 gaccattcta tgtagtgata gcactggggc agtatcattg acgcaaaagg caatctttca 4620 cgctcgcaca aagcatattg aagtgcaata tcattggatc aggcagcaag tcgagaagaa 4680 tgtcatcaaa atgagacatg ttagcagtaa gaacatgttt gccgacaccc tcactaagcc 4740 tctacatcca ggtcctttcc gtgaacttag ggatcagata gggttggaag tcattcacgg 4800 acatctgaaa cagggggtgt gt 4822 // ID Gypsy-14_MLP-LTR repbase; DNA; FNG; 184 BP. XX AC AECX01001397; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-14_MLP_; KW Gypsy-14_MLP-I; Gypsy-14_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-184 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001397; Positions 15255 15438. XX SQ Sequence 184 BP; 54 A; 53 C; 30 G; 47 T; 0 other; tgttacaact acacatgtaa ccaggcgtga cacacttagg aagagatgtc acagaggaca 60 cgtattagta tctagatcac aatgcttgta ctttcctctt tctcctcatc tgacaatcta 120 catagaggaa ccaggatacc acgtcttcac actccgtccc cagatccccg tcctgatcat 180 aaca 184 // ID Gypsy-12_CCO-LTR repbase; DNA; FNG; 828 BP. XX AC AACS02000012; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the mushroom Coprinopsis cinerea genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-12_CCO_; KW Gypsy-12_CCO-I; Gypsy-12_CCO-LTR. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-828 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the mushroom Coprinopsis cinerea RT genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AACS02000012; Positions 880710 879883. XX SQ Sequence 828 BP; 215 A; 153 C; 299 G; 161 T; 0 other; tgttagactc tacggaactg cggagtctgc ggagtcgaag ggaccttggc ctcaacgtat 60 tgcgttgtgc ccacgctggt ggtatatggt ccaacccgaa aggagagggg aggagaggga 120 gagagagacg cgccttacgg cggaagacga ggagagagag gagagagcct tgcggagcgg 180 aatcaggagc actgtgcgga tatgcggagt atgcggaaag ctctagcaag cctgaaggct 240 caagactgga cagaggagaa gttctgtata agtctgattc gagattccgg atgagtatct 300 actacagaaa tctagctagc ttatatatct gtagatactc tggatatact ctgggataag 360 tctgggacaa gagaaagaac tgagtcataa gaaatgcgga ataccgcgga atacgaaagg 420 gatgagtaag tcgattccaa agatacaata gcaaagcaag gaagggagtc ataaagaagt 480 ataagaggtc acatgcagag tctgggcgcc tagagggcgg ttacgaggtt tctccgcacg 540 gtccgcaccg gccctcagtt tgcgggccgt gtcgtaccgg cggcgctccg cataaatgcg 600 gagcggagtg tctagcgccc atagactccg catgcagatc gactgcggag catgcgggaa 660 tgcggaggga agggtcgcgg agatatgggg atgtgggatg cggagagatg gtggcgagtg 720 tggggatgtg cgggatcatg cgggagatca tgtgagactg tgcggggagt gtgggtcaag 780 gctggtgggt tgtgggtgga tgtgggtgga tgtttcgaga ccctaaca 828 // ID TCG3_LTR repbase; DNA; FNG; 964 BP. XX AC AY673967; XX DT 04-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Candida glabrata transposon LTR-retrotransposon Tcg3 (LTR DE portion). XX KW Gypsy; LTR Retrotransposon; Transposable Element; TCG3_LTR. XX OS Candida glabrata OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Nakaseomyces; mitosporic Nakaseomyces. XX RN [1] RP 1-964 RA Neuveglise C.; RT "A LTR-retrotransposon in C. glabrata."; RL Unpublished, 2004. XX DR EMBL/GenBank/DDBJ; AY673967; Positions 1 964. XX CC LTRs differ by a single bp substitution. XX SQ Sequence 964 BP; 320 A; 175 C; 179 G; 290 T; 0 other; tgtaagcatt atgatattag tgcactacat gtgacctatt taagtagatt cgtaaggttg 60 taggaagcta aacgctctga tgagtgtgcg tcatgcataa gtcatgttat tcagttcaca 120 cttgattgac ggatgagatg tatccttaaa gccggcaaca tataaggaga aacctcagaa 180 gtaggaaagt ggagaattga cgtgacatga cgaggtatct cataagatac ctcgggaacc 240 ttgtggggca taaacggatg ccgcctaata gctagcagcg tgccccacta aattaaagat 300 gcggagatta aagttatgtt agcatgacgt acatatgacg aatcaaatct gaagcaaggt 360 gtgacttaac ctcaagatca gataagcgct taataaaata attaccccaa ttacttaccc 420 caactaaaag gaaactccgc ggggtaaaag gagatgttaa gattctccat tcggccataa 480 gttatctaat ccgagaagga tataaatagc atccgaattc tcagacgtga gaattaaaat 540 tcgtttcttc agtaaattgt attattatac tttccttatt ctttcagtag aaagatcaca 600 ataagaacag taataagaaa taccaaaaag cttattctag ttttctaata agacttatta 660 tcatttaaca ttttagtttt gatacttggt acaagttcag cttatatcga gtcacacatt 720 actattgttt cagagttata agttttaagt taaaccagga gaatctttgt tcgaagaacg 780 agtctttata agtcacatag acttattagg ttcctcttct gtgacacata agtgaaagtc 840 cgaaccgttt atatattact gacctgtatt tctattatcc aaactattac atcccgtgat 900 acgtaccgat ctacggctat aatccgctga atacctgtga cgaactcgta ccctgtatct 960 taca 964 // ID OCCAN_MG repbase; DNA; FNG; 2688 BP. XX AC AB074754; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 08-NOV-2002 (Rel. 7.1, Last updated, Version 1) XX DE Magnaporthe grisea DNA transposon OCCAN_MG. XX KW DNA transposon; Transposable Element; OCCAN_MG; transposase. XX OS Magnaporthe grisea OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Magnaporthales; OC Magnaporthaceae; Magnaporthe. XX RN [1] RP 1-2688 RA Kito H., Takahashi Y., Sato J., Sone T. and Tomita F.; RT "Direct submission."; RL Direct Submission to Genbank (22-NOV-2001). XX DR Genbank; AB074754; Positions 1 2688. XX CC 76bp terminal inverted repeats. XX SQ Sequence 2688 BP; 710 A; 701 C; 649 G; 627 T; 1 other; tacttggggg ctgttcgttg tgggccccct caaagagggg gttacaaaac cttgattttc 60 gaaaaaaccg tgggccaccc cgcatcagtt ttttacctag aaaatcgtta acaatttaac 120 ttatgccgag tgtaatcccg ttccataaca acacaattgc acttaccgca aatataccta 180 tatcatgacg tccccagaat ctccagctat cttaggagcc cgcatgtatg cccaatatgt 240 cgaaactcac cgcagtaggg cggataccgc gcgggagggc agatccaagc cagaatcgct 300 acgatcgttc tgtaaaagga acggttaccc gtattcgagt actcagcggc atgctcgtaa 360 tctcctgatc aatggcattc ctaagatatc gatagagcgc agcgggcggc cttcattgtt 420 gactgatgct gaagatcaag ctctcgttgc gtatatcatg cagcttgata aacttggtgc 480 atatcccgac tcacaacacg taatatctgc ggccaatctg atgcgtcagt cacgtatacc 540 gccctgtggt ccagttcaaa tgaactggta cggccgttgg cgtaaaaaac acccggaact 600 gcgacttacc aagcaacaac ccgtcgagat tactcgcctt agttgggagc aacagatcga 660 tcttgtggag gcctggtatc gccgcacggc ggctcatgcg gtagagcttg ggatcacggc 720 ttcagcggcc tggaacgctg acgaatgcgg tactagaatc ggtgtgagag atggccggat 780 accggtattg gtcgtgacga aaaagaggca tgaaaagcca cgcaccgcgg atccggctaa 840 ccgtgagagt tgtacgttga taggcggtgg caacgctgtt gggcagtcgc taccgccctt 900 ctgtatcttc acaaaatggc caacgagcga ttggcttgat ctggatctgc ctgagggtat 960 cgtttttacg cggtcagaaa cggggttttc caacggagat atccagctca cctggataca 1020 gcatttcaac cggtattcgt ggccagaagt tgctgccgtg caatcgatcg gatctcccac 1080 gttgaaagag tggtttggtc atgactttga cgtcagattc gactttgtga accgcactgc 1140 ggccaaaatc gatccaaatt cacagcgggc caaaacacgt atttggaggt ggctttttct 1200 cgacggcttt accggccact ttggcatgga gatacttgac tattgcctca aatttgatct 1260 tgagatcgtt attttaccgc ctcattcaac gcattatatg cagcccatgg acgtatccgt 1320 gttcactcac ctgaaaaacg agttgcaggc ggtcctgcat gagcatataa acaccggtat 1380 tccggtgttc acccgctcaa atttcgttgc tgcgattcga gcaagtcgtt ttcgatctac 1440 aggttaattt agcggtaaac tgatatatct tcccttaact tagcgctgct ggcaaacggc 1500 cttcacaact ggccacgtaa taagcggttt ccaagataca ggcttatttc ctgtcgacgg 1560 caccaaagtg ctcgccaaac tgcgtggcca cgccacggca gcgaatacac cgcgatatcc 1620 ggactccctt cctaccgctg aacgattttc acgggccaaa tatgcggcaa gccgtatggc 1680 ggccaaggcg caccacttta gttcagatac cgttgatgct atctcggatt tacaagccgt 1740 tgccaacgag gcgatcatcc tacagcaacg cgttgcccaa gagcagacga ataagcaaaa 1800 acgccttcaa cgcgcttcac gttacaaagt taaaatgacg ttgaagtcaa aagacaagca 1860 gttccagacc gctaatacgt tagaagatat gcgggtcgct cacgaagcca aacaggcgtt 1920 gatcgaggaa acagccctat accaacaaga aaagttcgtc aaagatgcgt ggtataggga 1980 taagaatcag gctattcaac gctggcgaga aacagagggc atgcccaacg ggattaagat 2040 acaacagaag agatatctgg aggacctcag cttcacgccg gagaacgttc cggcggtcag 2100 ggaacgccct ggaaaaagga agaagaccga ttcacagccc tcacagtcat ggttcgtcga 2160 taaagggcca atttcagctg gaaatacaag tacccagatc aatattaccg tggataccga 2220 ctctgcctgt tctatcagca tatgcgtatc acgatcctcg caatcagcct cacctcccgc 2280 cccaaacagg ccttccaaac cgttcccacg gtatatacca ttggctacac cctctccccg 2340 tcaacttcag gatacgagac cgcttkaaga ccgttcctcg ccgccaatac cgggtggtag 2400 aaaggatgag ttggaagtac ccgggtcgcc ttgtaaccgc ttgagcgacc ggaaaagggc 2460 gattgagcca aaactccaga aatagatatc tacagcgcaa ttcacggctt caatttatct 2520 gcaaattcta acctttttcg tgggtgtagt gtttacgaac gtgaatcgaa ctttcatgag 2580 ctactttgtt agcgatctgc taacaaattt gctggcccac ggttttttcg aaaatcaagg 2640 ttttgtaacc ccctctttga gggggccaca acgaacagcc cccaagta 2688 // ID Gypsy-5_LBS-I repbase; DNA; FNG; 6145 BP. XX AC ABFE01000277; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_LBS_; KW Gypsy-5_LBS-LTR; Gypsy-5_LBS-I. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-6145 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01000277; Positions 133147 127003. XX CC Positions [4877-5365] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 598..1527 FT /product="Gypsy-5_LBS-I_1p" FT /translation="MSSTNIATVARHPGHKVAPVLSDGVTTPVALLDWENA FT CEDFFANCKEPIPDEKKVSKVTGGLQNNRINEYIRNNRVRLHSLKFPEFMS FT ELRETFLPTDWAKETHRKILAARMALDQPFYNFCTDMVSLNNLLAGSTLHL FT SEQRIKEQIFNNITEDLREKLEDSPTELAALNVLPFPKWMNFISETDVKMA FT KAIKRNMKRYALELEKEEKKKRILSGPPHTANASSSGGADNRHPYTQNAYR FT SRNTLPPLTEQERALIYEHKGCLKCRRLYTDHIGRDCNNDFPETHVVITPT FT MAATAKAEWEKRRQNKGK" FT CDS 2243..5746 FT /product="Gypsy-5_LBS-I_2p" FT /translation="MTWPSAVALGDSDSDESVSPPLTFPNLFWDARAIGRD FT GFLVPVKAMIDNGAHIVLIRPDVVEKLGLERKQLRKPQVINVAMKDDQKKK FT NSNPVLLSDYVSLSLSTLDNSWTSKPVTAVIAPGLCTNILLGLPFLVHNRI FT VVDHESPSAFVKETSIDLLNFVPSPRKIARRVKSPRRKRLEIRLLHRDLLK FT ELKWRCSVIKRKLDRMMDIERDNDEFIAINFVAAVKDRLESLELQQKYNAL FT EDKLKAEFSKTFEPIPHVDELPDDVYCRIKLKDATRTISKRTYGCPRKYRE FT AWKTLIDGHVNSGKIRLSNSSFASPSFIIPKSDPTALPRWVNNYRELNSNV FT VIDSHPLPRVDDILADCAKGKIWATIDMTDSFFQTRVHPDDIHLTAVTTPF FT GLYEWTVMPMGYRNAPGIHQRRVTAALRKYLGKICHIYLDDIVVWSQNIEE FT HERNVRLILQALIDAKLYCNKKKTKLFCFRVNFLGHTISQDGIEADEKKAE FT KIENWPVPTTATETRTFLGLVRYLNAFLPKLAIQSDILSVFTTKEAEKKFP FT EWLPKHQLAFDTIKAIVTSRECLTSINHENMGNNKIFVTTDASDRVSGAVL FT SFGPTWESARPVAYDSMTFKGPELNYPVHEKELLAIMRALRRWKVDLLGSE FT FLVYTDHKTLLNFDRQKDMSRRQLRWMEELSIYDCRFVYVKGEDNSVADAL FT SRLPYKYVEKNEQSNAENDASYPFSYHAEHPITVFAPKEKPAMCAIVAALV FT DAAPKNSFRVTIDDELLAKLKASYKTDKWCQKLLRASQGLPSVQSKDDLWY FT IGERLIVPKDSGLREIIFRIAHDNLGHFGFHKCYDNIRDSYFWPNMRRDLE FT EKYIPGCQDCQRNKAPTTKPVGPLHPLKVPDGRCESIAMDFIGPLPEDEGY FT DCILTITDRLGSDIQIIPTRTDVTAEELAIIFIDNWYCENGLPLEIVSDRD FT KLFVAKFWKSLHKLSGVELKMSSTNHPETDGSSERTNKTVNQCLRFHVERN FT QKGWRKALPRIRFFLMNTINKSTGYSPFQLKYGRSPRILPHFDKSEMNDND FT DIDALEIVERIQQNVADAKDNLMLAKISQAYFANGKRRAEIKYKVGDKVML FT STANRRREFKEKPGDGRVAKFMPRFDGPWEITEAHTETSTYKLNLPHTTNI FT FPHFSCVTD" XX SQ Sequence 6145 BP; 1848 A; 1448 C; 1326 G; 1523 T; 0 other; cttttttttg aacaccatcc aaaccgtgct ttccactcgt cctggtcttc actgtcgaca 60 aagtctagag ctcaacacac attcgatttt acagagcgag tacgcgaatt tcacgagctt 120 tcaacgctct ctagttctct tctcgtctag gcgttttttt ccgacgcata tactctggtt 180 ccggtccagg cgtcttatta caagcgcata tactcgtgtc cctcgccagt ccagtcccgc 240 cttgtcgtaa ttgtccctta cgcacgcata tacttgcccc ccccactcat actccagtta 300 cagagcctct ttccaatact ggacaaccgg cgcgctacga gaaagaacgt tgcacagcaa 360 ggcctacacc ttgaacctcc tcttatcgtt ccaacccgcg tccgcaagaa atctgtacct 420 gccgaaacga gctccccttc cctatcacca cttccactct caattgcatc ccctcctgtt 480 ctctccaata acccaatacc acgccccacg ccagattcac ccaacttcag ttccagttcc 540 tctacgagca gctccctctt gaacttctct cctgtcgttg tatcgcatcg taaccggatg 600 tcttctacga atatcgccac ggttgcacgc catcccggcc acaaggtagc gcccgtctta 660 tccgatggcg tcacaacacc ggtcgcgcta ctcgactggg aaaacgcgtg cgaagatttc 720 ttcgctaatt gcaaggaacc tattccggac gaaaagaaag tctcaaaagt caccggcggt 780 ctacagaaca accgcatcaa cgaatatatc agaaacaacc gcgtccgttt acactctctc 840 aaatttcctg aattcatgtc cgagttacgc gaaacgtttt tacctacgga ctgggcgaaa 900 gaaacacatc gaaaaattct ggcagcgcgt atggctctcg atcaaccgtt ctacaatttt 960 tgcacggata tggtctcgtt gaacaacctt cttgcgggca gcactctaca cttgtctgaa 1020 caacgcatca aggagcagat tttcaataat attactgagg atctccgcga gaagttggag 1080 gattcaccca ccgagctagc agcgttgaat gtgcttccgt ttccaaagtg gatgaacttc 1140 atcagtgaaa cggacgtcaa aatggccaag gctatcaaac ggaacatgaa gcgctacgcc 1200 ttggagctcg aaaaagaaga gaaaaagaag aggatcctct cgggtccacc tcacacggcg 1260 aatgcatcct cgagcggcgg tgcagacaat cgacaccctt acacgcagaa cgcatatagg 1320 tcccgcaaca cacttccacc tctaacggaa caggaaagag ccctcatata cgaacacaag 1380 ggttgtttga agtgtcgacg cttgtatacg gaccacatcg ggcgggattg taacaatgac 1440 ttccctgaaa ctcatgtcgt gattacgcct accatggccg ccacagctaa ggccgaatgg 1500 gagaaacgtc gtcagaacaa gggtaaatag tgtttggaag tccggtctcg tgactggaaa 1560 aagaccgtaa ccagaccgga ccaagaccgc aaaagaccgg acctgcggtt acggtctttt 1620 atttttgaaa tgtaaagacc gcaaaaagac cggttaacgt gaaccggtct tgaccggttt 1680 ggaccggttt tttgtagccc cttaaatacc ctttaaaacg cacccaagaa cgtatgattt 1740 ggtaaaaaaa cgagagagat atgaattaaa atgtaaatta cttttgggaa taatagatgt 1800 accagactta ttttatagaa gcaccaatac ctaacgggat ttaacgttct tgggctcgtt 1860 ttgaagctat tagagatgtt aataaattta ccgattcata attttaaaat attggcctaa 1920 agaccggtca taaaccggtt aaaactgatg cagaccgaaa ccggtctata gaccgcaaaa 1980 aaccgcggtc tgcggtcttt ttgcggtccg gtctggtctt ttgacttctg gggaaaaggc 2040 agaccggtta cggttacggt taacgcccct gggcatcaaa aaaccggacc gaaccggact 2100 ttcaaacact aagggtaaac ccctcaatcg aggtccagct ccgatcacca gcgccccagc 2160 cgtcgctgct atagtcgaag aatcaagtga accctcgggg gatgaggagg acgatgagaa 2220 cgaaaaatcc ggtattgttg caatgacttg gccctccgca gtcgcgctgg gcgattcaga 2280 ttctgacgag agcgtaagtc ctcctttgac cttcccgaat ctcttttggg atgcgcgcgc 2340 aattgggcgt gatggtttct tggttcctgt caaagcaatg attgacaacg gtgctcacat 2400 tgttctcatc aggcctgacg tcgtcgagaa acttggtcta gaacgaaagc aactccgtaa 2460 accacaagta atcaacgtcg caatgaagga tgatcagaaa aagaagaatt caaatcctgt 2520 attattatcg gactacgttt cgctctccct ttccacctta gataattcct ggacttcaaa 2580 acccgttacc gcagtgatcg cccctggact ttgcacaaat attctgttag gccttccgtt 2640 tctcgtacac aaccgtattg tcgtagacca tgaatccccg agcgcttttg taaaagagac 2700 ttcaatcgac cttttgaact ttgtaccctc tcctcgtaaa attgcacgtc gtgtgaaatc 2760 accgcgccgt aagcgactcg aaattagact actacatcgc gatctactta aggaactgaa 2820 atggaggtgc agcgtaatta aaagaaaact agaccgcatg atggatatag agagggacaa 2880 tgatgaattc attgcaatta attttgtagc agcagtaaaa gacagattgg aatcattgga 2940 acttcagcag aaatataacg cactagagga taaattgaag gcagaattta gcaaaacttt 3000 tgaaccaatt ccgcatgtcg acgagttacc tgatgatgtc tactgtcgaa ttaaattaaa 3060 agatgctaca agaactataa gcaagagaac atacggctgt ccacgcaaat accgtgaagc 3120 atggaagaca ctcatagacg gacatgtaaa ctcaggaaag attagattgt caaactcttc 3180 ctttgcttcg ccttccttta tcattccgaa gtcagatcct acagcgttac cgcgttgggt 3240 aaacaattac agagaattaa attcaaatgt ggtcattgac agtcatcctc tacctagggt 3300 tgatgacatc cttgcagatt gtgcaaaagg aaagatatgg gcgacgatcg atatgacaga 3360 ttcctttttt caaacaagag tgcatccaga cgatatccat ttaactgcag tcacgacacc 3420 ttttggcctt tacgaatgga cggtaatgcc catggggtat cgcaatgccc ccggtattca 3480 tcagcgtcga gtgacagcag cattgcggaa ataccttgga aagatctgcc atatatatct 3540 ggacgatatc gttgtatggt ctcaaaacat agaagaacat gagagaaatg tccggttgat 3600 tttgcaagcg ttgattgatg cgaaactata ctgcaataag aagaaaacaa agctgttctg 3660 tttcagagta aattttttgg gacatacaat ctcacaagat gggattgaag cagatgagaa 3720 gaaagcagag aagatagaaa actggcccgt acctactacg gccaccgaaa cacgcacatt 3780 ccttggtctc gtgcgttacc tgaatgcctt tttacccaaa ttagcgatcc aaagtgatat 3840 attatctgtt ttcacgacaa aagaagctga aaagaaattt cctgaatggt taccaaagca 3900 ccagttagca tttgacacaa taaaagcaat cgtaacctca cgagaatgcc tcacgtcaat 3960 taatcatgaa aacatgggga acaacaaaat ttttgtaaca acggatgcaa gcgatagagt 4020 ttccggggca gtcctttcct tcggcccaac ctgggaatca gcacgacctg tggcatatga 4080 ttcgatgaca ttcaaaggtc ccgaattaaa ttaccctgtt catgagaaag aactcctcgc 4140 tattatgcgc gcattacgaa gatggaaagt agacttactg ggatccgaat tcctagtgta 4200 caccgaccac aaaaccctcc tgaattttga cagacaaaaa gacatgtctc ggcggcagtt 4260 acgttggatg gaggaacttt caatctacga ctgtcgtttt gtgtatgtaa agggcgagga 4320 caattctgtt gcagacgcac tgtctagatt accctacaaa tacgtcgaaa agaatgagca 4380 atcaaacgca gaaaatgatg caagctaccc cttctcatac catgcagaac acccaatcac 4440 agtattcgct cccaaagaaa agccagcgat gtgtgcgatt gtagcagcgt tagtcgatgc 4500 cgcacccaag aactcattca gggtaacgat agacgacgaa ctcctggcaa aacttaaagc 4560 gagctataaa actgataaat ggtgtcaaaa gttattgcgt gccagtcaag gtctgcctag 4620 cgtccagagc aaggacgatc tgtggtatat tggtgaaaga ttgatagttc caaaagattc 4680 aggcttacga gaaataatct tccgcatcgc acatgacaac cttggacatt ttggtttcca 4740 caagtgctat gataacatcc gtgattcata tttttggccg aacatgcgca gagatctgga 4800 agaaaaatat attccaggtt gtcaagattg tcaaagaaat aaagcaccca ccacgaaacc 4860 tgtgggacca cttcacccat tgaaggtccc tgatggaaga tgtgaatcaa ttgcaatgga 4920 ctttataggc ccgctgccag aagatgaagg atacgattgc attttaacga taacagacag 4980 acttggatcg gatattcaaa taattccgac aagaacagat gttaccgcgg aagaactggc 5040 tattattttt attgataact ggtattgcga gaatggctta cctttagaaa tcgtgtcaga 5100 tcgagacaaa ttgttcgttg ccaaattttg gaaatctttg cacaaattga gcggcgtgga 5160 actgaaaatg tcgagcacca accatccaga aacagacggt agtagcgaac gcacaaacaa 5220 aacagtaaac caatgtttac gttttcatgt ggagcgaaac caaaagggtt ggagaaaagc 5280 cttacctcga attcggtttt tcctgatgaa cacgattaat aaatccacag gatattcacc 5340 ttttcaattg aagtatggcc gctcaccccg gatattaccg catttcgaca aatctgaaat 5400 gaacgataac gacgatatcg acgcattgga aatagtggaa agaattcagc agaatgttgc 5460 cgatgctaaa gataacttga tgttggcgaa gatctcccaa gcatatttcg cgaatggcaa 5520 acggagagca gagataaagt acaaagtggg agacaaagtg atgttgtcaa cagcgaacag 5580 acgtcgagaa ttcaaagaaa aacctggtga tggaagagtc gcgaaattta tgccgagatt 5640 tgatggaccc tgggaaatta cagaggcaca tacagaaaca tcgacatata aattgaattt 5700 accacatact actaacatct tcccccactt ttcatgcgtc acagattaaa ccatttattc 5760 cgaatgacga cgaaatattt ccttctcgca agaacgcaga aattccggaa ccggtcctgg 5820 tcgacggagt actcgaaaac tacgtggatc ggattctcga tttcaagaag agaaacagga 5880 aaccttcata cttagtccgt tgggttggat ttggccctga acatgacgaa tggctcccag 5940 catcgatgct agaagacaac gaagccctgg atcgttggat cgaattcgga ggaacttcgc 6000 atttttcttc taaatcaacc tgacggtagc tttttcccac ggggtttttt aacgcaccca 6060 gtcagtttta cttacttagg taaaattttt ctctctcctc tattttttgt tgcgttggcg 6120 ttgaaatttt gtggaggggg agggg 6145 // ID Gypsy-50_MLP-I repbase; DNA; FNG; 5906 BP. XX AC AECX01001249; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-50_MLP_; KW Gypsy-50_MLP-LTR; Gypsy-50_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5906 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001249; Positions 71761 77666. XX CC Positions [4725-5198] - Integrase core CC 'AAGAA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 310..1407 FT /product="Gypsy-50_MLP-I_1p" FT /translation="MATMEDFNRLSQQLTDINNRLNLENAQRNDLAAKLAE FT ETNLRTQAEARVAQLEATQATTSAPSNTPPTQPAQTPTLPMPTPSAGPTLP FT TIKVATPDKYEGARGALAEAYASQVGIYIAMNKALFTTDDSQVMFAISYLT FT GEAIKWAQPFQQRILDPAEPKTNPVTYLEFTLAFESVFFDSDRQKRAEAAL FT RALKQTKSAAEYTIRFNQLAPTTKWELPTLISHYRQGLKSAVRIPTIRDQF FT DDLESITRLACAIDNDLRGEAADPATIARLSNPDAMDISSARFNISSDEYN FT RRMSGRLCFKCGKPDHMARWCGTKSREGGRGRFGNKGKVAELEAKIAALQV FT GLGNQSGSSSSAADLSKNGAARE" FT CDS 1452..5198 FT /product="Gypsy-50_MLP-I_2p" FT /translation="MELDSFNVDAINKNDPRVFTSISLSETPCTTSPKSCS FT FQAHALIDCGSTHVVLGKKFAERTGIPVSELPSVGEVYGFDGAPRAVAHDT FT NLFINDDKHKTNFLVTSIKDSYDTILGMPWLRKNGHRINWKTSTLKPHDSP FT KNIAAINTALSSPPTSPDCLEAQVGKTRERDKGALCDLEITPPRCESRNPI FT PPLLFDTDNKQDVTISQLQCMKTDHALHLPDTLQRPICLNGNPETIAASET FT ASSIPTTSPNRLEAQVGNARDSHEGALCITETKPPRCESKNPFWPLLLDTD FT DDQDLLPQHRCTETTNLPDPHETRGPVIKPPVTTIATTKVVSPTPKTTPNS FT HKAQLGTTRKCGEGALCNNEIKPPRCEYRNALDPARDSVDKLLTHNNRLKP FT SMCAASTSWNLSAKLAADDARDKPEKSAEELVPTRYHWYINMFQKSKAMTL FT PPHRRYDFWVDLIPGATPQASKIIPLSPAEETALNSMIDEGLAKGTIRRTT FT SPWAAPVLFTGKKDGNLRPCFDYRKLNALTVKNKYPLPLTMELVDSLQDAE FT NYTSLDLCNGYNNLRVKEGDEQKLAFICKRGQFEPLVMPFGPTGAPGFFQF FT FISDILREKIGRELAAYLDDLLIYTPEGVDHKQVVEEVLQILQSHNIWLKP FT EKCKFSKKEIEYLGLLISKNKVRMDPLKVSAVNDWPIPKNVNQIQRFLGFS FT NFYRRFISEFSKITSPLHDLTRDDIPFLWTKAQQQAFDTLKTSFTTAPILK FT IANPYKPFILECDCLDYALGAVLSQLDDKEVLHPVAFLSRSLVQAERNYEI FT FDKELLAVVASFKEWRHYLEGNPNRLEVTVYTDHKNLETFMTTKQLTRRQA FT RWAETLGCFDFHIQFRPGRQSTKPDALSRRPDLEPPAGEKLSVGSLLRPEN FT LSEASFNVNVDSVEAFFVDESIEHEDVEEWFEQDICQNRIEEKLEVDAFES FT TKDSPIWTDDEIMERIRETSKTDTRIQGLITTLKGPDVPEVEEVLQGYEVR FT DRVLYHKGIVEVPNDKTCKFEILRSRHDSVLAGHPGRAKTLSLVQRQYKWK FT LMKTYVNNYVDGCDSCIRVKPSSLSPFGSLEPLPIPAGPWTDISYDLITDL FT PISQGMNCILTVVDRLTKMGHFLPCTTEMSSSELADIMMASVWKLHGAPKS FT IVSDRGSIFVLKLTESLNQQLGIAIHPSTAYHPQSDGQTEIVNKAVEQYLR FT HFVSYRQDDWTNHLPLAEFAYNNSTHTATGFSPFKANLG" XX SQ Sequence 5906 BP; 1760 A; 1451 C; 1436 G; 1259 T; 0 other; cattgtagcg tctatctgtg acggactgaa ggaatcaaat tgaagtacaa gaagataaag 60 aagaagaaaa agttaaagaa atagaaaatt aagatagaag aaaaaatttt aagaagaagt 120 aaagaaagaa gaaaattgat tgaagaacaa gtaaaagcag taagaaagtt cattcaagta 180 gcaaccttgt agaagaacaa acccacaccc acacccacca caccgtcctt ccaattacca 240 accgaggaga ccgagtcgac tgatagcaat tcctggcaca acgctgatac ggtagacctg 300 agacccaaca tggccactat ggaggacttt aaccgactgt cccagcagct gaccgacatc 360 aataatcggc ttaatctaga gaatgctcag cgtaatgacc tcgctgctaa attagctgag 420 gagaccaact tacggactca agctgaagca cgggtcgctc aactagaggc aacccaagcc 480 actaccagcg ctccatccaa cacccctccc acgcagcctg ctcagactcc gactctcccg 540 atgccgacac ccagtgccgg tcctacgttg ccgacgatca aagtcgctac ccccgataag 600 tacgagggtg ctagaggggc attggccgag gcttatgcga gccaggtggg aatctatatt 660 gctatgaaca aggctctgtt cacgaccgac gattcccagg ttatgttcgc aatttcctat 720 ttgacgggcg aggctattaa gtgggctcag cctttccaac aaaggattct ggaccctgcg 780 gagccaaaaa ccaaccctgt cacctacctg gaattcaccc tagcgttcga gtccgtgttc 840 ttcgactccg accgtcaaaa acgcgcagag gcagccctac gtgcattgaa acaaaccaag 900 tcagcggcgg aatacactat ccgcttcaat caactcgcac ctaccaccaa gtgggagctc 960 ccaacgctta tcagccacta caggcaaggc ttgaagagcg ccgtccggat ccccacgatt 1020 cgtgatcaat tcgacgatct cgagagtatc acacggcttg cttgcgcaat cgacaacgat 1080 ttacgtggag aagctgccga cccggccacc attgctcgtt tgtcgaatcc ggacgccatg 1140 gacatttcat cggcgcgctt caacatctct tctgatgagt ataaccgtcg gatgtcaggc 1200 cgtctttgtt tcaagtgtgg gaagccagat catatggctc gttggtgtgg tactaagagt 1260 agagagggag gtcgaggaag gtttgggaat aaggggaagg tagccgagtt ggaggccaag 1320 atagctgctt tgcaagtggg attgggaaat caatcgggat cttcatcatc agcggcggat 1380 ctatcaaaaa atggcgcagc tcgggagtga cagatgtgcc acccccgggc cggagtgagg 1440 agaaatcaac aatggaatta gattcattta atgtagatgc aataaacaaa aatgaccctc 1500 gtgtgtttac ctctatttca ttgtctgaga ctccctgcac cacatccccc aaatcctgta 1560 gttttcaagc ccatgcgttg attgactgcg gctccactca tgttgtgctc ggcaagaaat 1620 ttgcagagcg caccggtata cctgtgtccg aattaccatc cgtgggtgaa gtgtacggct 1680 ttgatggtgc cccacgcgct gtcgcccacg acacaaacct atttattaac gacgacaagc 1740 acaagacaaa tttcttagtc acctccatca aagattctta cgacacgata ctcggtatgc 1800 cgtggctacg caagaacggg caccgtatca attggaagac aagcacgttg aaacctcacg 1860 actcaccaaa aaacatcgca gccatcaaca cggctttgtc cagcccgcca acatcccccg 1920 attgccttga ggcccaagtg gggaaaacga gggaacgtga caagggggct ctctgtgatt 1980 tagagattac gcccccgcga tgtgagtcca gaaatcctat cccaccgcta ctatttgaca 2040 cagacaacaa gcaggatgtt actatatcac agctccagtg tatgaagaca gaccacgcat 2100 tgcacttacc ggacacccta caacgaccaa tctgtttgaa cggcaaccct gagaccatcg 2160 cagccagtga aacggcttcg tccatcccga caacatcccc caatcgcctt gaggcccaag 2220 tggggaacgc gagggacagc cacgaggggg ctctctgtat tacagagact aagcccccgc 2280 gatgtgagtc caaaaacccg ttttggccgt tattgcttga cacagatgat gatcaggatc 2340 tcttgccaca gcaccggtgt acagaaacga cgaacctacc agacccacac gaaaccaggg 2400 gaccggttat caaaccacca gtaacgacca tcgccaccac taaagtggtg tcgcctactc 2460 cgaaaaccac cccaaacagc cataaggccc aattggggac cactaggaaa tgcggcgagg 2520 gggctctctg taataatgag attaagcccc cgcgatgtga gtacaggaac gctctagacc 2580 cggcacgaga ttcagttgac aagcttttaa ctcacaataa caggcttaaa ccgtccatgt 2640 gcgctgccag cacgtcctgg aacctgtcgg ccaaactcgc cgccgacgat gctagagaca 2700 aacctgaaaa gtcagcagaa gagttggtgc caacgcgtta tcattggtat attaacatgt 2760 tccagaaatc gaaagccatg actctacctc cacaccgacg atacgacttt tgggttgatc 2820 tcataccggg ggcaacacca caagccagca agataattcc tctatcgccc gccgaagaaa 2880 ctgcgctaaa ctccatgatc gacgaaggtt tggccaaagg gacgatacgt cgaactacct 2940 ccccctgggc cgccccagtc ctcttcacag ggaagaaaga tgggaatctc agaccctgtt 3000 tcgactatcg gaaattgaat gcgttaacag tgaagaataa gtatccacta cctctcacta 3060 tggagttagt ggatagctta caagacgcgg aaaattatac aagtctggac ctttgtaatg 3120 ggtataacaa cctgagagtg aaagagggag atgaacaaaa gttagccttc atatgtaaaa 3180 ggggacagtt tgagcctctt gtcatgcctt ttgggcccac aggtgcaccc ggcttctttc 3240 aattcttcat atccgacatc ttacgcgaga agataggaag agaattggct gcttacctag 3300 atgacctgtt aatctacacg ccggaaggag tggaccacaa gcaagtagta gaggaagttc 3360 tccaaatcct gcagtcacac aacatttggt taaaaccgga aaagtgtaaa ttttcgaaga 3420 aggaaattga atacctcggc ttactcatat caaagaataa agtccgtatg gaccctctca 3480 aagtatctgc agtgaatgac tggccgatcc cgaagaacgt gaaccaaata cagcgattcc 3540 tgggattttc aaacttctac cggaggttca tcagcgagtt ctccaaaatc acaagcccgc 3600 tgcacgattt gactagagac gacatacctt ttttgtggac gaaagcacaa caacaagcat 3660 ttgacacact caaaacctct ttcacaacag ccccaatcct gaagatcgca aatccctata 3720 aaccctttat ccttgagtgt gattgcttgg actatgcgct tggtgctgtc ctctcacaac 3780 tcgacgataa ggaggtgtta catccggtgg cattcctgtc gcgatcccta gtgcaggcgg 3840 agcgaaacta cgagattttc gataaagagt tgttagctgt ggtggcttct ttcaaggaat 3900 ggaggcatta ccttgaaggt aaccccaacc gactggaagt tacagtttat acggatcaca 3960 aaaatcttga aacttttatg acgacgaaac aacttacgag gaggcaagcc cgctgggctg 4020 agacgctggg gtgcttcgat ttccatatcc agttccgacc aggccgacaa tcaacaaagc 4080 cagacgcgtt gtcgagaagg cccgacctag aaccccctgc aggagagaag ctgtcggttg 4140 gtagcttgtt aagacccgaa aacctatcgg aagcatcgtt caatgtcaat gttgatagtg 4200 tggaggcctt ttttgtggac gagtcaattg aacacgaaga cgttgaggag tggtttgaac 4260 aggatatttg tcagaatcgt atagaggaga agctggaagt cgatgctttt gaatcaacaa 4320 aggactcacc tatttggact gacgacgaaa ttatggaacg aatccgagag acatcaaaaa 4380 cagacacacg tattcaaggc ctaataacaa cactgaaggg acccgatgta ccagaagttg 4440 aggaggtgtt acaaggctat gaagttcgag acagagtgct gtaccataag ggtattgttg 4500 aggtaccaaa tgacaagacg tgcaaatttg agatcttgag gagcaggcat gatagtgtcc 4560 tagcaggtca tccgggaaga gcgaaaacgt tgagcttggt gcaaaggcaa tacaaatgga 4620 aactgatgaa aacttacgtg aataattatg tagacggttg tgactcttgt atacgagtca 4680 agccctcgag tttatcaccg ttcggatcct tagaaccttt acccatacca gcgggcccct 4740 ggaccgatat cagctatgat cttattactg acttgcctat ctcacagggc atgaactgca 4800 tattaacagt cgtagaccga ttaacgaaga tgggtcactt cctcccctgt acgacggaaa 4860 tgtcctcaag tgaattagcg gatatcatga tggcaagcgt gtggaaactc catggcgctc 4920 ctaaatcgat agtatcggac agaggcagca ttttcgtatt gaaactcacc gaatcactaa 4980 accagcaact tggtattgcc atacacccgt cgacggcgta ccacccgcag tctgatggac 5040 aaacggaaat tgtcaacaag gccgttgagc aatatctgcg tcatttcgtg agttaccggc 5100 aggacgactg gactaaccat ttacctttgg ccgagtttgc gtataacaat agcacgcata 5160 cggccaccgg attctcaccg ttcaaagcga acttggggta aatcccagcg acagagcgat 5220 gcataccgga ggtggaggag agactaaaga acatagaaga gattcagagt gagctaaaag 5280 agaacttaat gagagcacag gaaacgatga agagaaatca cgacaagaag gtacgaccta 5340 ccccggagtg ggatgtcgga gacgaagtat ggctcagtag ccgacacatc tcgacaaccc 5400 gaccgtcagc aaaactagac catcggtggt tggggccgtt cagtatcgaa aagaaagtat 5460 cgacatcggc ctataagtta agattgccaa gcagtatgag taaagtccat tcagtttttc 5520 atgtttcggt attaaagaaa cactcaccag acatgattga agagagacag caagcaccgc 5580 cgtatccagt tgaaatagag ggcgaagaag agtgggaggt ggaggctatt ctagacaagc 5640 gaatgaggaa taggaaagtc aaatacctaa ttagctggaa aggctttgga agatcagaag 5700 attcgtggga gccagcggca aacgtgacaa actctaaatc attgatcgac gagtttaatt 5760 tgaaatatcc gaaggcagca gaagaataca ggcgatcaag gcgtatgtga gaggggcgac 5820 gctttttccc accgggtttt ttaatgccag ccccggggaa gaacgcaggg ccgccaagag 5880 ggagcctggg cgtaaacggg ggatag 5906 // ID Copia-1_GDe-I repbase; DNA; FNG; 5436 BP. XX AC AEFC01001314; XX DT 12-MAR-2011 (Rel. 16.03, Created) DT 12-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Geomyces destructans genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_GDe_; KW Copia-1_GDe-LTR; Copia-1_GDe-I. XX OS Geomyces destructans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Leotiomycetes; Leotiomycetes incertae sedis; Myxotrichaceae; OC mitosporic Myxotrichaceae; Geomyces. XX RN [1] RP 1-5436 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Geomyces destructans genome."; RL Direct Submission to RU (12-MAR-2011). XX DR Genome; AEFC01001314; Positions 5316 10751. XX CC Positions [1810-2307] - Integrase core CC 'CTCCG' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 197..1105 FT /product="Copia-1_GDe-I_1p" FT /translation="MEYAERPRMTKLSGRNFRNWAKQLRLTLKDRKVLSAI FT EPDCKTDDDVDTAPAKDELPELAQARKDQIADRRNTRACSIIIECCIQSIV FT DKIIKFDTAKEMWDYLHKTYARDGLQQLIAKKDTFNSYSPSKTMPVSDVAS FT TLDDFEEDIAQINPNERPTELTKTGHLIRIMLAHGGKYDIAAAQIKAAKVS FT DYQDIIANFADIEEQIKESKAVIESARQASTSTENTRGNSGRAAAEDTEGG FT EHRKVITAHVITAGRPATSSETAPRKEEENQSLLGHRLARSLHQAVERGYH FT PNRLRKPTLLE" FT CDS join(2396..3232,3236..4165) FT /product="Copia-1_GDe-I_2p" FT /translation="MRYGLRWLDTCQPRSSTGYMIRSQTGSSLQHRRFVED FT QRLDLPDASNGPHEDVGFDPMEAVPQGDISIDAGDISNPRWDTPIGGLENP FT CEDIPVMDSGRSRDLSARVPRTGRPAPTANPANQEAASDNLSLAQQHQHGD FT DGSDYAVRRRPVASEVSCDGSPLGPATSPSPVLMSGPDDNSRFDREADRSG FT DIFAEDEELSETADDQLCAEAEASREETPHDEEFIARRSRRARQPTRPYEE FT AREALFEEELGAHHAASSKLPIPDRYSKAISDPEHGADTLAVQEELAQLQS FT LGTWERAKLPRGKKALGCRWVFAVKYTPTGLLDRFKARLVAQGFGQIPGDD FT YLETFSPTIRAESLRLLLAIGAYEDMNIRQIDVVSAYPQSELHAEVYMRPP FT EGLYCPEDCVLRLRKSLYGLKQSGREWYIEACKGLGQLGFQPVFREPSIFV FT TPDRRILIGLYVDDMLILSKESSDIDRVVKGISKRWKIKDLGEVNMILGIR FT VTRDRKNHRIYLDQEGYIDEVIKRFHLGMATAFPIPASTDQSQLLKGNDSE FT AEADQHLYQRGVGSLAWLAICTRPDIAYAWGQLSQSCSRPTI" XX SQ Sequence 5436 BP; 1438 A; 1410 C; 1480 G; 1108 T; 0 other; ggttatgagc ccgggtcgct caggagcttc ggaatactga accaggaatt gattccgaaa 60 atctcacatc gataaagtga gaagctgagc aaccagtcag aattgcccag ccaaaaaagc 120 ctgcaagagt aagccatcgt ttggctgcag caaaacccgg tacaacaagc ctggaaaagt 180 gataactcca ttgaatatgg agtacgccga acgccctcgg atgaccaagt tatccggccg 240 aaacttcagg aactgggcaa agcagcttcg acttaccctc aaggatcgga aagtcctgag 300 cgccattgaa ccagactgca aaaccgacga cgatgtcgac acagctcctg cgaaggatga 360 gttgccagag cttgctcaag cccggaagga tcaaatagct gatcgaagga acacccgggc 420 ttgctccatt attatagagt gctgcataca gagtatcgtc gacaaaatta tcaaatttga 480 tactgcgaag gagatgtggg actacctcca caaaacctac gcccgtgatg gccttcaaca 540 gctgatagcg aagaaagaca ccttcaacag ctactcgccc tcaaagacta tgccagtgag 600 cgacgtagcc agtaccctcg acgactttga agaagatatc gcccaaatca accctaacga 660 gcgacctacg gaactgacga aaaccggtca ccttatcaga attatgttag cccacggagg 720 caaatatgat attgctgcag cccagattaa ggctgccaaa gtatcagact accaggatat 780 catcgccaat tttgccgata tcgaagaaca gatcaaggag tcgaaagctg tcatcgaatc 840 agcccgacaa gcgtctacga gtactgaaaa tacgagggga aatagtggaa gagcggccgc 900 ggaggataca gaggggggcg agcaccgcaa agtgataacc gcgcatgtta tcactgcagg 960 aagaccggcc acttccagcg agactgctcc tcgaaaagaa gaggagaacc aaagtctact 1020 gggccatcga ctggcccgct cgctgcacca ggcggtggaa aggggttatc accccaaccg 1080 tctgaggaag ccaacgctgc tggaataagg accaccccag taacggaaac ctcttggatt 1140 gcttctatca aggatccaag cagtgtcact tctatcaagg atccaagcag tgtcacttcg 1200 aactgggttg gagaagcaag agaccaaggc gactcctgga tcatggattc gggttgctca 1260 cgccacatga cattcgcaag aaaagcgttc gtcaattact atcctcttgg caagccaatc 1320 gctgttagat tggctaacgg aacagagatc caagctgtcg ccgaaggcac tgttagcttc 1380 gatatagctg tcaacggtgt caaacgccgg atccagcttc acgaggtgct gcatgttccg 1440 cagctcgccg gaagcctcgt ctcagtaccc caccttcaag ataggagtat tatgacaagg 1500 acaacccgcg agggaaagat ggtccttgag cttgacgaac agactatcgg cgtcgccgtt 1560 cgagttggcc gttcctacgt ccttaacgga acctaggaag gcaccatcac tgcgctgcgc 1620 gcagtggcga tgtcaccaaa agatgccatg atttggcatc aacgcttcgg acacttgagc 1680 tcaaagagtc tgagcttggc tcacactgct gttgacggtc tgccaggacc aattggtgat 1740 ctggcggacc cgtgtggaga gtgcctgctc aataagagta tgcgggttat aaattgacag 1800 gcatcagagc atgcaaaagc ccccctcgac aggatacata gtgatatctg ggggccttac 1860 agagtaccgt caatcagtgg taagaacgtc tattttgtga cgttcactga cgactacaca 1920 cggaagactt gggtctacgt tatgcagtca aggggccaac tacgaggtat ttttactgag 1980 ttcaggactg aggtgaaaca ccaaaccgac aggaagatca agatcgttcg ctgcgataat 2040 ggcagcgagt acgaagctct tgagcgagac tttgggccct cgcacggtat cctgttcgag 2100 tttacaatgc catatacgtc atatcagaat ggtgtttctg aacgcctaaa ccgggccctg 2160 gtcgcagtga tcagggcgat gctcgcaggc gcgcagcttc caaaatggct ctgggctgaa 2220 gctgtgatgg cagcctccta tctgaggaat aggctgccaa ttggacccgg aggaaagaca 2280 ccggaagagg cgtacagcgg taagagaccg tctgtcgctc accttcgtgt ctggggctgc 2340 gttccctacg ccaatctctc attggatcag aggcagggtg acaagcttgc acccaatgcg 2400 atacggactg cgttggttgg atacatgcca acctcgaagc agtaccggtt atatgatccg 2460 gtcgcagacc ggatcctcac tgcaacatcg gagattcgtg gaagatcaac gacttgatct 2520 tcctgatgcc agcaacggtc cccacgaaga tgtgggattc gaccccatgg aagctgttcc 2580 ccagggggat atctccatcg atgcaggaga tatcagcaat ccccgctggg ataccccaat 2640 agggggcctt gaaaatccct gtgaggatat cccggtcatg gattcaggca ggtcacgtga 2700 cctgtcagcc agggttcccc gcacgggacg acccgccccc acagccaacc cagccaatca 2760 ggaagcagca agtgataatt tatcacttgc acaacaacat caacacggcg acgacggcag 2820 cgactacgct gtgcggcgac gccctgtagc atcagaggtg agctgtgacg ggtctccact 2880 gggacctgca acgtcaccgt cgccagtgtt gatgtctggg cctgatgaca actcacggtt 2940 tgaccgagaa gctgatcgat caggcgatat ctttgctgaa gacgaagagc tgtctgagac 3000 agcggatgac cagctgtgcg cagaagcgga agcttcgcgc gaggaaaccc ctcacgatga 3060 ggagttcatc gcccgacgaa gtaggcgggc aagacagcca acgaggccgt atgaagaagc 3120 gcgcgaagcg ctcttcgaag aagagttagg agcgcaccat gcagcgtctt ccaagctgcc 3180 gattcctgat cgctattcga aggcgatcag cgacccagaa cacggtgcag actagactct 3240 tgcagtccag gaagagctgg cgcagcttca atccctcgga acatgggagc gagcgaagct 3300 cccgcgaggc aagaaagccc ttggttgtcg atgggtattt gctgtcaaat acacaccgac 3360 agggttgctt gacaggttca aagccagact cgtcgctcag gggtttgggc aaataccagg 3420 cgacgattat ctcgagacct tctcaccgac aatccgcgct gagtctctcc ggcttctcct 3480 agcaattggt gcttacgagg atatgaatat ccggcagatc gacgttgtga gtgcttatcc 3540 acaatcagaa cttcatgcgg aggtctatat gagacctcct gaagggctct actgccctga 3600 agactgcgta ttacgactac ggaagtcact ctacgggtta aagcaatctg ggagggaatg 3660 gtatattgag gcctgcaaag gactcggtca actaggattc cagccagtct tccgcgagcc 3720 tagtatcttc gtgacgccag accgaaggat cttgatcggg ttatacgtcg atgatatgtt 3780 gatacttagc aaggaatcat cagatattga ccgtgtggtc aaagggatta gcaaacgctg 3840 gaagatcaag gacctcgggg aggtcaatat gatacttggt attagggtga cccgcgatcg 3900 gaagaaccat cgaatctatc ttgatcaaga aggatatatt gacgaggtta tcaaacgctt 3960 ccacctagga atggcaacgg cctttccaat tcctgcgtcc accgatcagt cgcagctgct 4020 caagggcaac gacagcgaag ctgaagccga ccaacacctc tatcagcgag gagttggatc 4080 actggcctgg cttgctatat gcactcgccc ggatatcgca tacgcctggg ggcagctcag 4140 tcaaagctgc tcgagaccta cgatatgaaa ttggaacggg gttcttcacg tattgcacta 4200 catcaaagga actaggactt tacgactttc gtttgggggg atagggggca attctacccc 4260 ctatttacaa ggatacagcg actcagacta cgctggtgac cagactgatc gacactcagt 4320 atcaggccaa atattcatgt tgaatatggg gcctgtcagc tggaattcga cgaagcagcg 4380 ctgcgtcgca acatcgatga ctgaggctga atatattgcc ctctgcaaag ccagcaagca 4440 aggacagtgg cttcacaccc tcctgcacga acttggtcgc aggaagctcc ttggtggccg 4500 tggatatcaa gtccagatcc tcagtgacaa tcaggctagt cttgcgatcg ctgcggaccc 4560 gatgtctcac cgaagaacga agcacattga cgtccgttat cactacgtgc gtcaattgat 4620 tgccttcaag aaaatgagcg ttgactatat accaactcaa gatatgttgg ccgacgtcct 4680 caccaaaccg ctggcgcaac cagcgttctc gcactgcact cagggatatc tgatgcagca 4740 gacttaaaag acacctgaac ccccgtctcc cccgctcaac ggaagaggac agtgacccct 4800 gacaccctcc gcatggcatg cgagagccgt ttgccgcagg cgctgaattt ttggggggcg 4860 ccgagggctc ggcgccctat tttggatata tcagctcttg gcgtgttact gggcgggcgg 4920 gagagggggg gggtgggcgg gcagaagcgg cgttggaaac gggttgaaaa aggggtataa 4980 gaacgtggtt cccggccccg aaaaggcgaa caagacgaca tcaaggccgc gccccattca 5040 cattgggggc tggccttgat gaggccacga agaaaagtgt ggtgcgatgt ggtgaaccac 5100 caatgccgca tgaagcggcg acaagaagca actcggcgac aacactgagc gcaagaagga 5160 gcaggtccaa caccgagtct tcgtctctgt ccagccaggg acaaaagaaa atgacccagc 5220 ggacgaaccc cccccccccg agctgccaga aatcaagatt gatttgagaa agaaagaggg 5280 ggagagcttc ttcgaagaag cttttatatg attctcgctc attattgagc tggttgccgg 5340 tttcgacggc gagggtggtg gtggacttgg tgattttcca gtccaccgac caatttcctt 5400 atttttatca cttttctacg agctacgtaa gggggc 5436 // ID Gypsy-76_MLP-I repbase; DNA; FNG; 18348 BP. XX AC AECX01001117; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-76_MLP_; KW Gypsy-76_MLP-LTR; Gypsy-76_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-18348 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001117; Positions 83393 65046. XX CC Positions [17301-17615] - Integrase core CC 'ATGGC' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1547..2713 FT /product="Gypsy-76_MLP-I_1p" FT /translation="MSQFPRATQPFPFDALFLIDSGATHNVLSESKATETN FT IIRLADHTTRVVTGFDGSRSQSSHEINLFMKNDSSLTNFIITKIKDSYDGI FT LGIPWIRQNYHLINWSTGEIANTHTDVAAIEALSRPTPPLQDLDMEPVREA FT RQSDKGMCIAFDTLTSPQRAYDSVSSIHSFETVGDHRIPMNSQHTTRTTTS FT NDDQHTKPGIAAVNSASSNPKTTPEDPDVMLTGKARNLDEGMCALDGKLTS FT PQCEFDNETNPLSPETASKLLLIPNSQIPPSNDVILEDTKNSPVLETSIAT FT VATVSSALKKPPTDPILRPTGHARKLDEGMCVDTDTLTSPRCESDIPLPST FT HFVRKPLFTARKKKPEAPFFWGITRWYPRVPESFCAKHGPDGLLPG" FT CDS 6551..9139 FT /product="Gypsy-76_MLP-I_2p" FT /translation="MQSPKIGKPFYEKDAGIMYLNAVARYQGYVLSVKSHK FT TDMHRETVRIYLECSRAAKPTKKKKKNKSTGVKDNEDDHDDADHNEADEPK FT EKRASSSKKTGCTFEVSLNFHASKGRWFVTSDADRNPLKAKHNHPAFEHLY FT EESRYRQINPENDKIISFLTSAGVRPRNIRPILKVQSGTDVLVRDIYNSRA FT RQKKCWLNGRTPIQGLFDLVTKQKWAFRTVTSDNGTLQSFFFASPEAIQLA FT RRFPTVLGIHCTYKTNRFNLPLLHIVGTTNAHKSFTVALCFLHSETEDAYE FT WAMQQLKSIVFDSDETVSLPRTFATDAEPALFNAINTIFPDSVHILCIWHI FT NENIGKNCKKNFKTNDEWEDFMKVWNQFTCSTTPEEYDINLKALTVLAKKH FT NILEYLERNWLHRTNKFVSLFTSSTPHFGNSSTSRVEGSHFGLKTFLEGRN FT NDFLTCFNAFKRHGEHQYNEIMISFGIEKGNTLQDIPPFLKDLHGVISHFA FT IKDIAAQYEKISQTQLQPDRPCTKTFRTAWGMPCSHDIAAAREVSDTLSPD FT LIHDQWRLVMDIAPHSRQGLHNAARSKFNALLNLPEHSLRKLYEEVDLIKT FT GRYSLVPILPPEVKHNNRGRPRTGSQKQPAVSKNGRWKSQHEVEETKRKKK FT EKAKRRRLEEDMTREEAMLDEDSEDQGVSESHSKPEGVTKESDTPSTLSNP FT INGPSTSLILIATPQSSNPIPTTKLYKCSKCRQPGHRANKCPVNQSHSQPT FT DLFNATQPQPDDFFDATQPTDLFATQPTNDCEQGLFFQEESEHNKALLLFT FT EDSESQSGSDTENDEALCPFCDEEMPANPSDKLLSLRAELLAMPEITQGIG FT RSGSRSLPVS" FT CDS 13518..14573 FT /product="Gypsy-76_MLP-I_4p" FT /translation="MRCLLSYGYKGFEVIVTKLQLVFLSDSSDAQLNTKNT FT APLGPDLYIRRVLLPELSLSLIAEDLGLPRTHHKVNATLEASRKFGAWVFP FT IEYEYEYKNTSSTPNNINNSKSLHSNPLDHSNPLDQSNPLNQSNTLDKSCL FT VQPKDQLILKQLPVVIVPYIKSIHDPPPHGNCGFYAIAASMNMYTEDAYLV FT IRREMLAELLSHQEDHTKLISTPSQRPRTRKQQSDAEEVVQDLMRRLDHEG FT ISCTDEYWLRMPSLGFVIASAFNRPVLLFNPKRSNCYTFFPYRTPPNKQAP FT IVLAFVDGIHFVSLGLGHLLVPFPEVYGQLRNAAILETTLVPQWIKEYKAE FT LERWTSSHH" XX SQ Sequence 18348 BP; 5850 A; 3873 C; 3622 G; 5003 T; 0 other; tattgtcgga tctcttcaat taggaatcaa ggaagaagaa ctagattaaa ctcagaagaa 60 accgattaga taatttaaag ttaaacttac tgaactgaac tgagaccaag attaagatta 120 ctgaatcata caatcgactg aatcgaagta ttgagaactg aattgaatta tcaagattag 180 atacaattga aacttagatt gaactgaaaa cttacttcgg aacgaacgcc acaacatcac 240 catcttacag aacccccaca ctcggagacg ctgactccga cgacgacgaa actgaaacca 300 ctcacagtca cccattcgtc gattctgtaa ctcatcttat tactgatccc gctgtcggaa 360 tggaagatat acagagacaa ttaaatgaac tgtcggcatc actagcaaac aaacgcaacc 420 ttcgtgaaca agctgagcgt cgatcaaatg aagccgaagc aagactcgct gctattgaag 480 cgagtgctcc taactctcag caatctactg ctgcgcctcc tcctgcccca acttacactc 540 accaggaagt cgcacctgtt cccaaagggc caaaggtagc aacgcccgat aagtttagcg 600 gcactcgagg aggtcctgct gagatctttg caacccaagt ccagttatat ctcctggccc 660 atccgtatca attccctgat gaccgcagta aagtcgtatt cacggtttct taccttaccg 720 gtgcagcaag cagctgggct cacccgttaa ctttggaatt actggataac tcaacatctc 780 atttggtcac attggattgc tttatcacta atttcaaagc aatgtatttc gacacggaaa 840 ataaatcaaa ggcggaacgt gcgttgcgta ctttaactca gaaaggatca gtggccacgt 900 aaacacatga attcaacttg tacgcgactg ccacgggttg ggaggttcct actctcatca 960 gccactacgg acagggattg aaaaaagaaa ttagagtcgc gatggttatg gttcaagaaa 1020 aattcactag tattgaacag atagcgaacc ttgcaatcaa actcgatagc aaaatccacg 1080 gagctactag caccgccaca acctttcatg ttcctaccgc tgatccaaat gcgatggaca 1140 tatcgtcagg tttcgttaaa ttatctgacg aagagcgtac tagacgacta aggactgggt 1200 cttgttttca ttgcgctggt caaggtcata tagctgcaga ttgtcctgtg aagagaaatg 1260 agaggaaggg aaagaagcgt ggaggtttta atggtcctag tagtttcaga cataagatag 1320 cagaattaga agctaaggtg gcagcgatgg gatcttcagt tgtgaatcaa ttagatcaag 1380 tagaagggag cagtaaggcc gatcaatcaa aaaatggcgg tgctcaggag tgagtgtgcc 1440 tatcctgagc caccagaggg aattagatga cataaaatta ggagcgagta gatttcattc 1500 atgcaattta aatgatccac gtctgttttt gcgtacctca atttcaatgt cccaattccc 1560 tcgagccaca caaccttttc cctttgatgc cctgttcctt atcgactctg gtgccacaca 1620 caatgtgttg agtgaatcca aagccaccga gacaaatatt attcgattgg ctgatcacac 1680 taccagagtg gtgaccggct ttgatggatc aagaagtcaa tcctctcacg aaatcaattt 1740 atttatgaag aacgactcat cactgacaaa ctttattatc acaaagatca aagattctta 1800 tgatggaatt ttaggcatac cctggatcag acagaactat cacctcatca actggagtac 1860 tggagagatt gccaacactc atactgacgt tgcagccatt gaggctttgt ccagaccgac 1920 accaccctta caggaccttg atatggagcc cgtgagggaa gctaggcaaa gtgacaaggg 1980 gatgtgtatc gcatttgata cgttaacatc cccgcaacgt gcgtatgatt ctgtctcctc 2040 gattcattct tttgaaacag tcggcgatca ccgtattcct atgaattcac agcatacgac 2100 gagaacaact acttcaaatg acgatcaaca caccaagcca ggcattgcag ctgttaactc 2160 agcttcgtca aatccgaaaa ccaccccgga ggatcctgac gtgatgctaa cggggaaagc 2220 taggaattta gacgagggga tgtgtgcctt agatggcaag cttacatccc cgcaatgtga 2280 gttcgataac gaaaccaatc cattatctcc cgagacagct agcaagcttc ttttgattcc 2340 aaattcacag atcccaccct cgaatgacgt gatattagaa gatacgaaaa acagcccagt 2400 actagagacc tccatcgcaa ctgttgcaac agtttcgtca gctctgaaaa aaccccctac 2460 ggatcccatt ttgaggccta cggggcatgc taggaaactt gacgagggga tgtgtgtcga 2520 tactgacacg ttaacgtccc cgcgatgtga gtccgatatt cctcttccat ccacacattt 2580 tgtaaggaaa ccactattta ccgcccggaa aaaaaaaccc gaagccccct ttttttgggg 2640 cattacccga tggtacccga gagtacccga atctttctgc gctaaacacg gccccgacgg 2700 gctgctcccg ggctgagaat gcgggcagaa cggcgattca cagcccatgg gggcttatca 2760 ccaccagacc gggcttaggg tgcgggctac tagccctcat atgaatgata ccacgcatac 2820 agatacctca ccatccattg aaacttcatc atcacatcca tcacatccat cacccaatcg 2880 aaccgaccag ggaaacaaac gttgtggaaa gcaattccta tgatcacgtc ttaatctcat 2940 atcatttttt ttattattac attagatcaa ctcacaacag aaaaaagaaa agatccataa 3000 atcacactct tcaattttta tcaaggcatg tggactcaag acaaataatc aaaatcatca 3060 tttcaattca aattatcact atatggatct tgttattctc aatgaacatc aatcttacat 3120 tcaaaatggt atattttata cagacataga agttcaaata aaaacaatac atacaaaaaa 3180 gataaataaa tgagatacat acataccttg gaaaaaaaaa gagggtcaaa aattgaaaat 3240 gtcaacattg atgttacaca aataagatta gaattgatga atcaagaatg aaaactcacc 3300 tagtagatta atcaatcccc aaacagttga tctacaagaa agaagtcaag aatcgaaatt 3360 ggaatctatg cagctatgaa ctaacattaa agcaagtcag gagaaagtgg agaagtcagt 3420 ttaaatgatg caacagaata gaagaaaaaa aaaacttacc taaatgatca agaagtcgaa 3480 attaaaattg aagtcaagag agttgaaaat tgattaagaa ggcctacaat caaatttaaa 3540 tgaattaatc ttaaacaatt gatgcaaagt tcaaaatcga atgacagaca aataaagaaa 3600 caatataact tgtcattatt gtgtaagaga gtggagaaaa atgttaagaa cagacctcaa 3660 aatcaaaatc tagatagaca aagatgaaag agaaagtaac aaaagactta caagatagat 3720 cagagctcaa gatcgaaatt gaattgaatg tttatgaaga ttgtagttga aacaggctga 3780 gatcgaaatg gagttggtgt agttagagac tgaattgatt gatattgaag tgaaggtggt 3840 tagagttgag tagtgatcga gatagagggg tcctatggaa gcttttgttt caacttatca 3900 aatagatacc tagaagcgaa atgaaaaggg aatcaaaatt ttaaacaaag aagataaagt 3960 tagtaaacaa gaacaaacaa agataaagaa atgaaagaac ttacttaaat gaagatcaag 4020 atcaaaggtc agaatcgaaa ctgaatccat caatctattc atcagataca ataagaatag 4080 aaaaaaaagg aaacagaata gacattcgat tttgaggtct gaagttaaaa ctaatgaaaa 4140 acttaataga aggatcgatt ttgagggttt aaattgagat agagaaagag ggtggtgtca 4200 gtgttgtgca agatgaatag tagaaagaaa gttaagagag aaaagactta catatttatc 4260 aagatctaaa tcaaagatca gaatcgaaat tgaaactaaa taggaattcg attttgaggt 4320 ccaaagttaa aactaatgaa aacaaagaaa ggaggtaaag gagtcagtat gagaacaaac 4380 aaagaagtga atgggaaaag aaacttactt aaatgatgtc aaaatcgaaa tctaaataca 4440 taaagtcaag gatgggatag tgtagaagaa gagaaaagaa atcaaactta cattgtaaat 4500 cgatagtgtg aagatcgagc tggagattga ggatggagtg cgatcgagag gtccagatcg 4560 aaagagagtg gaggaggttg aggtagaggt ggagacgagt gaatgaataa aataaaacta 4620 aatcgaaaat cgacggaaat ccagttgtgt cgtgacttac aagtgaagtc cgatcggagg 4680 tagtaagtgg aggtcaaatt gaaatttcaa aagaagaggg gtcttgttcg ttcgtgtgtc 4740 tgtggaaaga atcgcatcat tttcattctt ttttgggatc catcggaggg atcatcgtgt 4800 tttttttgtg ttatttgcgt atcagtgttg atcaaagagt gttctaagtt tttttttaaa 4860 gtgttctagt gttctgattg tcgtatcagt gttcgtgtct gggtctaagt gttataatcg 4920 tgtcttcatg tgtccttgag aacgtatcag ttttttttag tgttctatca aagaattaga 4980 atatcatctt tagttagtgt gtccgattaa atcaagttct ttgagtatta aaagataaga 5040 aagatagatg gagaaaacat aacctctatt gttgtttaga ttatcatttg tagtgattag 5100 tgtcttgaag aggatcattg tgtttttttt tcttatattt tctgtttgct gagctcggtc 5160 agaaatcagt catcaactcg tcaggttaga ttacagtttg aaattacctc agaccgggtt 5220 gagtcagggc tgtagttgtt acatttacaa tttttacaat ttcactttta cattaaaact 5280 atttacaatt tcacttttac attaaaacta tttacaataa aactcaacat caacatagtt 5340 ttgtatcacc ctctcatata gtgtagtgtg gggtaccgtt ggaatccgcc agatcttgcg 5400 gatcttctag tgatattaga atagtgttac ggattagatt tgagaaggta gaaatttgtt 5460 agagaattgc ttccgtgatt tttttatttt tttgaacttt tatttaaagg ttgagggcag 5520 tctacgacaa ccatagtttc acctgtgtgg ttattctttc tgcgcggtga ttcttcccct 5580 ctccggctca gcctacgcta cacagcgtag gagttctaca acgtgtatcc accggatttg 5640 catgaagatg attttgctta cgatggtaag caattttccc tacattgcac ttttcacttc 5700 ctctagtcat actaacccca tcgtatcttc tatctctatc gtatcctctt cagctgtcct 5760 caacgctccc tttcagctgg actaacccct ttcaaaccgg caacaacaaa ccctctgcgt 5820 tcaccatcta tataatgaag cgaagtcgca aaggacagac cctgaacgat gaccgaccga 5880 agggttcgct gaacccaact ctcacttacc ctatgccaca acccagctca acactaccga 5940 ttacattacc aacaaacaac tctgttatcc accaccacgt ctatgtccca tctcaaacta 6000 attatgtagc tggacctttg gctcaacccc acgtctatat cccgtctcaa acgaattatg 6060 tagctggacc ttcgcctcaa ccccatctcg tgtggcactc aactcctggc ttcccatacc 6120 caccatcgac tcagtttgtg ttttcgaatc cccctggaac accatctttt ccacccactg 6180 tgtcacaaat acaacctcag catctcgatt tgtcaagagg tcaagctcaa gctgtatcaa 6240 gcggaacaga tatcctttca cttcctatgc actcgatcac gcgtgtcaac aatgcgaacc 6300 ttccatcatc acacttgaac cctactagta ccacctgtga aagcgatagt gttgttacaa 6360 atgaagtacc acctgaatta aacgagagta gcatcttaca agagtttgat ttggccaacg 6420 acaagcgaga caatgaatgg cctcctattg gtgagttcag ttcgcttttc tttttcttta 6480 ttcaaatcta tgatagttat tcgatctgat tcctcatcaa ctcaaaacag gagcggctcc 6540 agttcggttg atgcaatcac caaaaatcgg gaaacctttc tatgagaagg acgctggcat 6600 aatgtatctg aatgcggtcg cacggtatca aggatacgtt ttatctgtca agagccacaa 6660 gaccgacatg cacagagaga cggtacggat ctatctagag tgttcccggg cagcaaaacc 6720 gacaaagaag aaaaagaaga ataaatcaac tggggttaag gacaatgagg atgatcacga 6780 cgatgctgat cacaacgaag ctgatgagcc aaaggagaag cgtgcatcct cttccaagaa 6840 gacgggatgc accttcgagg tttcactcaa ttttcacgcc tctaaaggac ggtggtttgt 6900 cacttcagat gccgaccgaa accctctaaa agcgaaacac aatcatccgg cttttgaaca 6960 tctatacgaa gaaagtcgat atcggcaaat caatccggaa aacgacaaga tcatttcatt 7020 cctaacttcg gctggagttc gacctcgcaa cattcggcca attctcaaag ttcaaagcgg 7080 taccgacgtt cttgtgcgcg atatctacaa ctcccgggca cgtcaaaaga aatgctggct 7140 caatggacga acacccattc aagggttgtt cgacttagta accaaacaga aatgggcttt 7200 ccgaacggtt acttcagaca acggcaccct tcagtctttt ttctttgcaa gtcctgaggc 7260 catccagctt gcaagacgat ttcctaccgt cttgggcatc cactgcacct ataaaaccaa 7320 tcggttcaac ttaccactac tccacatagt cggtaccacc aacgcccaca agtcttttac 7380 tgtagccctt tgtttcttac acagtgagac cgaagatgct tacgagtggg ccatgcaaca 7440 gctgaagtcg attgtgtttg actccgatga aacagttagt ctccctcgaa cctttgccac 7500 cgacgccgaa ccagcacttt ttaatgcaat caacactatc tttccggact ctgtacacat 7560 attatgcatc tggcatatca atgagaacat tggcaagaac tgtaagaaga atttcaaaac 7620 caacgatgaa tgggaagatt ttatgaaagt atggaatcag tttacttgct caactactcc 7680 cgaagaatat gatatcaact tgaaagctct cactgtcctc gctaaaaaac acaacattct 7740 cgaatacttg gagcgcaatt ggcttcatcg aacgaacaag tttgtatctc tttttacatc 7800 atcaacaccc cacttcggga attcctctac atcccgggtc gaaggtagtc actttggttt 7860 gaaaactttt ctcgaaggcc gcaacaacga ttttctaaca tgttttaatg cattcaagcg 7920 acatggggaa catcagtaca acgagataat gatatctttc ggaattgaga agggtaacac 7980 gcttcaagac ataccaccat tccttaagga ccttcacggc gtgatatccc attttgctat 8040 caaggatata gccgcccagt atgagaagat atctcagact caactgcaac ctgaccggcc 8100 ttgcactaag acattccgaa ccgcttgggg aatgccatgt agtcatgaca ttgctgctgc 8160 acgcgaggta tctgacactc tatcgcctga cttaattcat gatcaatggc ggcttgtgat 8220 ggacatcgcc ccccattctc gacaaggcct tcataatgcc gcacgttcaa aattcaatgc 8280 cttattgaat ctaccagaac attccctccg taaactatac gaggaagtcg atcttatcaa 8340 aaccggacgg tactcattag ttcctatatt accacctgaa gtcaagcaca acaatcgagg 8400 tcgacctcgc actgggagcc aaaaacaacc agctgtctct aagaatggtc gatggaaatc 8460 tcaacatgaa gttgaagaaa cgaagagaaa aaaaaaagaa aaagcaaaac gaagacgact 8520 tgaagaagat atgactcgag aagaggcaat gttggatgaa gactctgagg atcaaggagt 8580 atctgaaagc cattcaaagc ctgaaggtgt cacaaaagag tctgacactc cctctacctt 8640 atcaaatccg atcaatggtc cctctacctc gttgattctg atagctaccc ctcaatcatc 8700 caacccaatc cccaccacca aactttacaa atgtagcaaa tgccgccaac ctggacatcg 8760 ggccaacaaa tgtccggtta accagtcgca ttctcaacct accgatttgt ttaatgccac 8820 tcaacctcaa cctgacgatt tctttgatgc cactcaacct accgatttgt ttgctactca 8880 acctaccaat gattgtgaac aaggtttgtt ttttcaggag gagagtgaac acaacaaggc 8940 attgttactg tttactgagg attccgaaag tcaaagtggg tccgacacag aaaatgacga 9000 agcactttgt ccattttgcg atgaagagat gcctgcgaac ccatccgata aacttctctc 9060 tttgcgggcc gaactgctgg ctatgcccga aataactcag gggattggtc gctcaggatc 9120 aaggagtctt ccagtaagtt aaaccccatt ctaacaattg tattcaattg ttgaccttat 9180 ggatttcatt tttcagttcg cacaaacggc aactttttgc ggccttcatg aggcagagcg 9240 ccatataatt cccaaaggta tcagtgaagg ctggccaacc tcaatcaact tcaacggtct 9300 tgcgaggtga gacccaactt gtactttgtc tgtaacacca ctggctgaca tcatataagt 9360 taaaggcaaa ccgtgttcgc cggcaaaact cgacaaaact cccaattagt actttcaatt 9420 tggaggttta cgcgtccttt gagtaaaatc cctggctcaa gcgttcgcct catcatctaa 9480 tgtcaagctc ccatagagat catatcatct caatcctgtc caatccttca acacttcaac 9540 ttcaactgca atcatgaact ataatcagga cctcacttca atctaaaccc cttatcttct 9600 ttcccaactc tcaatcagtt cgttctcatc cgctcacctt cctcccattt gaatcaaagt 9660 gaccacccaa tctttgaacc atcaagcatg tttttgatga gggccattcg actgttggat 9720 tggtgtcaat caaatcagaa tcagccacct ttgaactttc tttgtgagtc cattacctga 9780 aaaatttgaa tgttcatcta agtttggctg atgtctagct tctattcgaa aaaactgact 9840 atctgacctt ttcttttctt ttgatccttt atgcatcttg gatagagtaa tatcttgaaa 9900 ctcacctcaa ctttgataca aacacctcta aagccatcta ccaatccaaa tgttaccttt 9960 cactggtttg gagatttccg tattctgttg acttaccata cgtggtacac ccttagcaaa 10020 gcttattgta catcaaaacc gatccatcaa gctctttggg gtctatttgg aattgcgagc 10080 aatggtacag cgactaaaag atccacagta cgaaaagaag atcatcaaaa aggtgaagac 10140 actttgattt ctctgactcg atatttgatt tctcaacttg cttatacagt ttgtatgctt 10200 ggattagtca ctcaggagtc atcggattca aagtttgact tcattgatcc gttgggatga 10260 tcagattgat tataaaagga aagtagcctg gatgaaacat gaatcaattg gtctgatgct 10320 tcaaaactca aatcaccatc aatttcatca acaagcccga aacctgactc gattaatgaa 10380 caaaatttct aatgtacata ttcaacattc aaattgatca caatcttaca gaagttactt 10440 caatagaaga aatagaagaa tgagattggt tagctatcaa attagtttta cttcaagatg 10500 aaagcatcct ttccgaaagg atttcaggag aatttggttt actaaaattt aatcataatt 10560 tttttgaatt acttcgatct agaattaata aaaatatgaa tcgatcaacc atggaagatg 10620 gtttaaaact gatggatcaa atttgacaca tttgaatcaa gttaatattg atcgatcaca 10680 tttgattcgg cggggtctcg acagaatgat ttcatctatt gtgtttacgt cgtttgtgag 10740 tcactttctt tttcccttgt tcacttcctg tttactttca attttctttt ctttcttttt 10800 gattctgatc tttctttgtg tatcttttga tttcaccgat catcacgacc agccagttcc 10860 agcctatacc gtgcttcttg attaagtacg gttgagtccg ctccatcaag aaaacatgac 10920 tccagccgat cacaagtccc agtgaaacat ccgtccttga atcttctact ggtacctcct 10980 ccaagccgcg agtttcaccc gtcaatcgag cagtaacatc agtgaacaaa ttttcagaac 11040 gatttttggt ttgcattacc tactgtaatt acaatcaaag ctattgtgag accgatcgaa 11100 cttgggttta ggtatcattt tgatatccga agatctaact atcaatcaga taaggtaaag 11160 ctcaaccttt caagttgttt tttattcatt taacttattg cttggtttcg ttttcttgga 11220 tagcctgaat ggtatcttga tcatgtactt cattgaattg aagaacatga aaggtccgta 11280 taaatagata ccccaaacct aatgaacatg aacaaatata atcatataga tgcaatggta 11340 agtccatcaa gcctaatccg caacaagaaa acgagttaaa ctgaatgcag gattttttct 11400 cttaaactga aatttcaagt ttgttgagtg aaatgatttg aaagcactta aaacagagtg 11460 taccacaagt atcagaaatt aaatcacttt tagctcatgc aacttgataa ttgaacaaat 11520 tttagatcaa tcaaagacag catttcaacc tatcaacaat caattcaaga ttatgagaat 11580 cttaaatgtg tctgaatcac catatagttt gattggattc agttgatctt cagtatcagt 11640 aaaagttgaa ggaaatgaaa gtaattccag gattggctag gtgacagttt ttaaataagg 11700 tttttgttag ttgtaacagg atgttaagtg ttcttaatca atgtaatgat tattttgtaa 11760 gtcattcttt ctttacattc atcaaagtca aattatatca ttcattaatg ttaacttaca 11820 ctgatcaccc aattaatcaa accacttaca atctgtactc attcaattaa atcaataagt 11880 acatcaattc ataaaagaga acaaaaacta tttttgaaaa ccagtcaaga atgctaattc 11940 aaatataaat acaaccatac acaggattgg aattagatct acaatgatgt atgaagttcc 12000 tcaattgaag tttgatgttg aaaattgtgg gattaataca atacaagaaa atttaaaggt 12060 ttgtgaaatc agctggaaaa gatcatttga tgctattcaa gttttgtctt gagacaatat 12120 tttggataat caaagtgaaa ataaatggaa tattattcaa gtttgtgaag tttgttttga 12180 tgaagattaa atcaaggatg atgaatttaa gacaaaggag ttggagattg atgggaatga 12240 gtggtgtata gatgaagtct atgctttttt aaaaagaagc cctgagtgtt ggaggtaagt 12300 aaataagttg agattggctt ttgtgtacaa ttattgtgtt tgttcttttg taaagatttg 12360 aggagtttat caactttttg cgtttgaatt atgttggagt tttctttaat tttgtaattc 12420 agaaggttta cacatattat atacctgagt taaagaagca aaagaaggag gacaatagga 12480 caacacaagt attgaaccag taaattacag atgaaagtct tgaatatatt ttggtgaagt 12540 tgcaaaaaac aaagacccag aatctccata gttactgttg attgtgaatg tagagtatgt 12600 tgacccaatt gatttggaaa agcactcact tcacaatcag taatggacaa agactgatgg 12660 gatcagcact tggtcagaca ctggtaatat gcatctacat aggcaggaat gaatgcagca 12720 ggtgcccaaa ggcatgaagg cacctgtgta gttccataaa aaaggaacta aggagcacag 12780 ggcttagaca gactcggtac ctgtgttgca gagtaccaag cctagatggg gaattcaaat 12840 ttccccaaca gaaacaaatt tgggaaaaat taacttcatc ctttttccct aaatcttttt 12900 cctctcactc actcacacca acaaccacca atcactacga gtaagatagt gagagtctca 12960 caatttctta taatttttct taacctccca cccgcccctg caccctgcca cacagagagc 13020 ctaccccatc agaacaccag cggcccgcag gctcccccca aacatgtgac aagacttaga 13080 aatatccaaa tcttgtgact tttggtagtt tgattaggta gcttagtgaa taaagaatgt 13140 ataatacaga tcgaaataaa aagaaaacaa aaacttggta aaaataacca ataagaaaaa 13200 catgttccac atagagttgt gagactctga ttgatggaaa ggagatcaga ctgatgtgag 13260 acgttgatgc ggcgcccgcc gcccagatgt gctagtactt ttgtatggta caacagccgc 13320 atagactcgc aggccaattt tttgtcaaag ataataaaac gtgaaacgaa gtctcactat 13380 cttgacctag ctttaacaaa atggaagagc cgtgatcgag gtcagatgtt acaatttgaa 13440 ggttttgatg tcgagcttcc tggctagtga gtttatgagt caaaccctac aaataagtcg 13500 tgtggtttaa tgtgctaatg cgctgtttac ttagttatgg atacaaaggt ttcgaggtca 13560 tcgtaaccaa actccagcta gtattcttat ccgattcatc agatgcacaa ctaaacacaa 13620 aaaacacggc ccccctcggc ccagatttgt acatccgacg tgtcctgctt cctgagttaa 13680 gcctatcttt gatcgctgag gatctaggat tacctcgaac ccatcataaa gtcaatgcga 13740 cccttgaagc aagtcggaag tttggcgctt gggtgttccc aatcgaatac gagtatgagt 13800 acaagaacac ttcttcaaca ccaaacaaca tcaacaactc caaatctctt cattccaatc 13860 ctctcgatca ttccaatcct ctcgatcaat ccaatcctct caatcaatcc aatactctcg 13920 acaaaagctg tcttgttcaa ccaaaagatc agttgatatt gaagcaacta ccagttgtaa 13980 tagtaccata catcaagtca atccacgacc ctcctcctca tggaaattgt gggttttacg 14040 cgattgcggc atcaatgaac atgtacaccg aagatgctta ccttgtcatt cgccgagaaa 14100 tgcttgccga gctcttgtcg catcaggaag atcacacaaa actgatatcc acgccttctc 14160 agcgcccccg gacgaggaaa cagcaatctg acgcggagga ggtggtacaa gaccttatgc 14220 gtcggcttga ccacgaggga atctcttgca ctgatgagta ctggttgagg atgccttcat 14280 tgggctttgt gatagcaagt gcgttcaacc gacccgtgct tctattcaat ccaaaacgaa 14340 gcaattgtta tactttcttc ccatatcgaa caccacctaa caaacaagcc cccatagtac 14400 tcgcgtttgt ggatggtatt cacttcgtca gtctaggtct gggtcatctt ttagtaccgt 14460 ttcctgaagt ttatggacag ctccgaaatg cggcaatact tgagaccact ctggttcctc 14520 aatggataaa ggaatacaag gctgagttgg aaagatggac atcttctcac cattgactct 14580 catgtggttt catttcttct ttttttttgc tttccaactt gttgttcttt cttgttcagt 14640 tgttctgttt gttgcttctt cttgcttgtt gtttacttct gatgagaaga aaagatgtgt 14700 tttgtttgtc tcaagagttg cttgtaatcg atcctctatt gatactttgt tttcaatcca 14760 aaactacaat acacatggga gaaagaccca aaaaaaacat gaagctggac ttgaacttga 14820 acttgaaagc ttcacaagga gattgactgc caacaataca tggtaaatct caaaaggtca 14880 ttttggagag cagcccgctt tgatttcaaa gcgggctgag ggaagacaca gcccgcgaaa 14940 tacccggaaa gggggcttcg ggtttttttt tccgggcggt aaatagtggt ttccttttgt 15000 aacagctggc cagcgtgatc atctcctaaa taggtctgat acaaacatcc aatctgcgaa 15060 cgcatcctgg tccacttccg caagattggc agcagacatt aagatgaaag aacctgtgaa 15120 gacagtggaa gaactcgttc ctatgttcta tcatcgtcac ctgcatatgt tcaggaagtc 15180 gaattctcaa caattacccc ctagacgtaa atacgacttc aaggtggatt taattccggg 15240 tgctcagcct caagtgggtc gtattatccc actgtcacct gctgagaaca aagttttgga 15300 caaaataatc gaagaaggtc tcagcaacgg aactatccgc cggactacct ctccatgggc 15360 tgcccccgtt ttatttacgg ggaagaaaga tggcaattta aggccatgct ttgattatcg 15420 aaagctcaac tctttgacaa ttaagaataa atatccgcta cccttaacta tggatttggt 15480 agatggtctt ttagacgccg ataaatacac caaacttgat cttaggaatg cgtatggaaa 15540 cttacaagta gctgaaggct atgaagacat tttggcattc atttgtaaac aaggccaatt 15600 tgccccgctt acaatgcctt ttgggcccac tggcgcaccg ggttatttcc agtacttcat 15660 tcaagatatc ttacttcgcc gaattggtaa agacacagca gcctatcttg atgacacaat 15720 gatttatact aaaaccagag ttcatcacga aagcgctgtt gatagtgtac ttgatgtatt 15780 ggataagcat gatctgtggc tgaaacccga aaaatgcaaa ttttccaagt ctgaagtgga 15840 gtatttagga cttatcatct caaaaaacag agtcaagatg gaccctacaa aggttaaggc 15900 cgtgaaatag tggccagcac cacgaaacgt caccgaatta caacgcttta tcggattctc 15960 aaacttttac cgtcgattta tagatcattt ctccaagacc actcgtcctt tacataactt 16020 gaaacgacta aaaacgccat atgtatggga cgagaggtgc aataaagcct ttgagtcact 16080 caaaaccgcc tttacatcag ccccgatcct taagatagca gatccgtaca agccattcat 16140 ccttgaatgt gattgctcgg atttcgccct aggtgcggta ttatctcaat attgcgatgc 16200 tgataaggaa cttcacccgg tagcatatct atcacgctca ctggtacaag cagaacgcaa 16260 ttatgaaata tttgacaaag aactattggc aattattgcg tccttcaagg agtggcgtca 16320 ttatttggag ggtaatccta accggttaga agtgatagtt tacactgatc acagagacct 16380 ggagactttc atgacaacaa agcagttgac aagacggcag gcgcactggg ctgagattat 16440 ggggtgtttt gattttgtga ttaaatttag accagggagg aatgcgacca aacctgatgc 16500 attgtctagg aggccggatt tagcacctag cgaagctgac aaactaacat ttggacaact 16560 cattaagcca gagaatctag cccacgactc gtttcaaatt gaaatagcca gctttgatac 16620 aaattttgaa gatgaggcta tccagcttga tgacgcggag cattggtttg aggttgatgt 16680 cttaggcatt gacgaggata tcgaggacat agaagaaaaa tctgacaata agctgacgga 16740 gtttatgaca gatgaagata tcaataattt aatatgaaaa gctaacgaag aggatgatag 16800 aatcaaagag ctgattcaag cagcaacaaa ccctatatca tctaagataa agaaggctct 16860 caaatcttac gacgtcaaag atggtatttt atacaacaac agtaaaattg aagtaccgaa 16920 tgacgagcac atcaagtacc tcatagtccg cagttgtcat gatagtctac ttgctggtca 16980 cccaggtcga tccaagaccc tcagcctggt acgtcgaagt tttacatggc cgtctcaaaa 17040 gtcatatgtc aataaatacg tcgacagctg tgattcttgc ctgagagtga agtcaagcac 17100 ccagaaaccg ttcggtactc tagaaccact cccgatcccg gcaggtcctt ggaccgatgt 17160 tagctacgac cttataacga aactgccaac ttctgacgga tacgacagca ttttgactgt 17220 ggtggacagg ttaacaaaga tgtcacattt tcttccatgt cgagaatcta tgacagcgaa 17280 cgaactggct gacatcatga tcaaagacgt ttggaagctg cacggtaccc caaaaaagta 17340 ttgtctcaga caggggtacc atctttgtct cacagatcac gaaacaatta gacaacaatt 17400 taggaatcaa atcacatcca tctaccgctt tccacccacg aacagacggg cagagtgaaa 17460 tagtgaataa ggtaatagaa caatacctta gacatttcgt agcttacaaa caggacgact 17520 gggcatcgct attacctaca gctgagtttg catacaacaa ccgtgatcat gaatcaaccg 17580 gcatatcacc attcaaagcc aactatggat atgacccgac tttcaacagg atcccatcaa 17640 gcgagcaatg cataccgctt gttgaaacta gacttcaatt gatagaagaa gtacagcagg 17700 aactgaccag ttgtttagaa tttgcgcagg agtcgatgaa aaaacagttt gacaaacacg 17760 tacagaaaac tccggagtgg caagtaggag atcaagtgtg gctcaatagc aagaacatat 17820 caactaccag gccaactcct aaattggagt acagatggct aggtcccttt tttattactg 17880 aaagagtttc aacgtcagct tacaaattga atctcccgat gtcaatgaaa ggtgtacatc 17940 ccgtcttcca tgtgtctgta ctacgaaaac acgacattga cccaattagc cagaggaaac 18000 aagttccacc accaccaatc caagttaacg aagatgaaga atgggaggtc gaggaaattt 18060 tagattgtag atcacgccat aagaagttag aatatttagt gaagtggaag gggtacgaat 18120 tgaatcacaa ctcatgggaa ccagaatcaa atttagataa ttgtcaagac ttggttaatg 18180 attttaagag tagattccca gatggagcgt caaaatatag gagaagacgg ggaaaatgag 18240 agagggcaag ctttttccca cagggttttt taacgctgcc cagggaaaga atgcagagct 18300 cgcaaaaggg agcttgggca ttaaaagggg ggcttaaagg agggataa 18348 // ID Gypsy-97_MLP-I repbase; DNA; FNG; 5404 BP. XX AC AECX01000463; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-97_MLP_; KW Gypsy-97_MLP-LTR; Gypsy-97_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5404 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000463; Positions 53547 58950. XX CC Positions [4207-4686] - Integrase core CC 'CGGGC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1291..5307 FT /product="Gypsy-97_MLP-I_1p" FT /translation="MDVSHSSVLERKKDLFMVDAGKLICNSNDPRLFAPIT FT LSDPLCPTSKTNAKALIDCGATHEVLGESYVRKHQIKTNPLPEAKDVSGFD FT GKNRKITEDAKLCIDDEKVPSRFIVTKLKDSYEAILGMPWIKRNGNKIDWK FT KGTFKEIQDIATVKAVSSDLKKPQVEMETVVGDAKNRDEGVCIHSGTLAPP FT QCEFDTYIPDPHVETVCKLNGPMNSSNDSDGSTESSAPKTPSGDLREVIRE FT DARQADEGVCIPRSTLTPPQCEFDTSIHTLSNESAGELVPLLDNCRNSISA FT AKVSWSTSARIAAEAKAKTQEKKVDELVPTRYHKYLHLFKKSQAMTLPPRR FT KYDFKVKLIPGAVPQASRIIPLSPAENDVLEEMIKEGLANKTIRHTTSPWA FT APVLFTGKKDGKLRPCFDYQKLNALTVKHKYPLPLTMELVDSLLDARKFTK FT LDLRNAYGNLRVAEEDEDKLAFICRSGQFAPRTMPFGPTGAPGHFQYFIQD FT ILLGHIGKDTGAFLDDIMIYTKVGVNHEDVVEEILKILEKHQLWIKPEKCE FT FSKDEVEYLGLLISENKIRMDPAKVKAVKEWPAPKSVTELQRFVGFANFYR FT RFINQFSKTARPLHELTCKDVKFEWNKRRNKSFEDLKEAFTTAPILRIADP FT YRPFVLECDCSDFALGAVLSQTAEDGLLHPVAYLSRSLIAAEKNYEIFDKE FT LLAVVASFKEWRQYLEGNPNRLDVIVYTDHKNLESFMTSKQLTRRQARWAE FT TLGSFDFQIRFRPGREATRPDALSRRPDLAPSQEEKLTFGQMLKPNNIVED FT TFLPEVDCVEAWFEDELIENEKADEWFEADVLNHEVASPIESDTTIIEKIR FT QASTEDHQISEIINAVRKPISKKAQGWTEEYSVEDGVLYINNCVMVPENNS FT IKTEIVKSRHDSPMAGHPGRAKTLSLVRRNYRWKGMKSFINRYVDGCESCQ FT RVKSSNQKAFGQLKPLPIPAGPWTDISYDLITELPESNGKTCILTVVDRLT FT KMAHFIPCTNTTSAEDLADLMLRNVWRLHGTPKTVVSDRGGIFISQITTSL FT YKQLGIKSTPSTAYHPETDGQSEIVNKVVEGYLRHFTSYKQDDWEPLLAMA FT EFAYNNNTHSSTGVSPFMANYGYDPTYTTTPSINQRIPAVEERIKQIQEVQ FT AEIKESLAQAQESMKYFYDRNKRKAPSWKIGSKVWLNSKNISTTRPSAKMD FT HRWLGPFKIIKCISEVAYELELPLSMHRLHPVFHISRLREYKADTIAGRKQ FT PPPPPIELDGEEEYEVESILDRRTRRKTEEFLVSWKGYGREHDSWEPLRNL FT KNAKELVEAFKIKYPEAAKNYRTRRRM" XX SQ Sequence 5404 BP; 1832 A; 1078 C; 1247 G; 1247 T; 0 other; tattgcagta tctttaaatc ccaagaactc aagaagatct caaacttaat ctgaagaaag 60 aagaacggta cagattgaat taattgaaac gttaaagtta aaaaaagtga agatccttgt 120 ggagaagatc gaccaaactt aaagtacccc gcaagaaaga caccaagata ccgacattca 180 cagtcggcga agaagaacag gacagcgaag actttcgatc tgcgattagt ctagagatgt 240 ctgaagaagc acaagtgcca aatatcgcac tacttgtgca gcaggtggcc gacctaaaca 300 cccggcttac aaatgaaacg aatctacgcc aagccgccga actcagcagg caacaagcag 360 aacttcgatt aactcaattg gaggccaact cggctgccca aactgctcag aacccggccc 420 atccacctaa tgtacaaccc acctacgtac cagtcccacc ggctcataag acaccaaaag 480 tagcaacacc tgataaattt accggcaccg ttggactgtt agctgaagga ttcattagtc 540 aaatcacgca atgtttccca ccgataaatc taaagtcact ttcgctcttt cttacttatc 600 tggagaagcg ctacgatggg cccaacctac cttaggactt gttctgaacc cagagacagc 660 agacacagtt actttcgaaa actttgtaaa ggcatttaaa ggcgtattct tcgaccccat 720 gagaaagact agagctgaaa aagaactacg agcattgaaa caaactggaa cggtagctga 780 ttattcaatt aaattcaatc aatgggctgc agccacgaaa tgggaagtac caattcttat 840 tagtcagtat aagcaaggtt tgaaattaga aattagaatg ggaatgctac agaagacatt 900 cgaaggttta gaagatatca caaatttagc agctgaaatt gacaatgaaa ttcatggtga 960 acggatggtg attcataatt caagtacaac acgtaacacc cctgctagag acccggacgc 1020 aatggaccta tcagccgcac ggttccctat ttctggcgaa gagtatagaa gaaggggtga 1080 agcgggagaa tgttataagt gtggaggaac aaatcattac gctagaaact gtggaaaatt 1140 taagaagtgg aaaggaaaag ggaagagtag agtagctgaa ttagaagcaa gattagctga 1200 atttgaaagt aggggtagta gtagtggtgg aaattctcat ggtgaaggaa gtgaagcaga 1260 tctgtcaaaa aatggagacg ctcgagagtg atggatgtgt cccactcgag cgtattagag 1320 aggaaaaagg acttatttat ggttgatgct ggaaaactta tttgcaattc aaatgatccg 1380 cgtttattcg cccccataac tttgtccgat cccttgtgcc ccacatccaa aactaatgcc 1440 aaagccttga tagactgcgg tgctactcat gaagttctcg gtgagagcta tgttcgcaaa 1500 caccaaatca aaaccaaccc cctacccgaa gctaaagatg tttcaggatt cgacggaaag 1560 aaccgaaaaa tcactgaaga cgcaaaatta tgcattgatg atgagaaagt gccttcaaga 1620 ttcattgtaa ccaaactcaa ggattcatat gaagcgatcc tcggtatgcc ttggatcaaa 1680 cgaaacggca acaagatcga ctggaagaaa ggaaccttca aggagattca agatattgca 1740 accgtcaaag cggtttcgtc cgatctgaaa aaaccccaag tcgaaatgga gacagtggtt 1800 ggggacgcta agaatcgtga cgagggggtg tgtatccata gtggtactct agcacccccg 1860 caatgtgagt tcgatactta cattccagac ccacatgttg aaacagtttg caagcttaat 1920 ggccctatga atagctctaa tgacagtgac ggatccacgg aatcatcggc gccgaaaaca 1980 ccttctggag acctgagaga ggtgatacgg gaggacgcta ggcaagctga tgagggggtg 2040 tgtattccaa ggagtacatt aacgccccca caatgtgagt tcgatacgtc aattcatacc 2100 ctgtctaatg aatcagctgg cgagcttgta cctctcctgg ataactgtag aaattcgatt 2160 tcagcagcaa aggtgtcatg gtcgacgtca gcccgcatcg ccgccgaagc caaggcaaag 2220 actcaagaga agaaagtgga tgaattggtt cccacacgct atcacaaata cctgcacctc 2280 tttaagaaat cacaggcgat gactcttccg cctcgccgaa aatatgattt taaggttaaa 2340 cttatcccag gagccgtacc acaagctagt agaataatcc ctctatcacc agccgagaat 2400 gacgtattgg aggagatgat aaaagaggga ttggctaata aaacaataag acacacgact 2460 tcgccatggg cggcccctgt gctttttacg gggaaaaaag atggcaaact ccgaccatgt 2520 ttcgattatc aaaaattgaa tgctcttacg gttaagcaca aatacccgct accactcacg 2580 atggagttag tagacagttt attggacgcg aggaagttta ctaaattaga cttgaggaat 2640 gcttatggta atctgagggt agctgaagag gacgaggata aattagcatt catttgtcga 2700 tctggacaat ttgcacctcg aaccatgccg tttgggccga caggagcacc tggacatttt 2760 caatatttta ttcaagatat cttattagga catattggaa aggacacggg tgcattttta 2820 gatgatatca tgatttatac taaagtagga gtcaatcacg aagatgttgt tgaagaaata 2880 ttaaagattt tagagaaaca ccaattatgg ataaagccgg agaaatgtga attttcaaag 2940 gatgaagtag aatatttagg acttcttatt tcagagaaca aaatccgaat ggacccggcc 3000 aaagtcaagg cggtgaagga atggcctgca ccaaaatctg tcacggaatt acaacgattt 3060 gtagggtttg caaatttcta tcgaaggttt ataaatcaat tctcaaagac ggcacgacca 3120 ttgcatgaac ttacatgtaa ggatgtcaaa tttgaatgga acaaacgacg aaataaatct 3180 ttcgaagatc ttaaagaagc tttcaccaca gcgccaattc tacgaatagc cgacccctat 3240 agacctttcg tattggagtg tgactgctcc gacttcgcat taggggcagt gttatctcaa 3300 acggcggaag atggcctatt acacccagta gcttacttat ccagatccct aatagcggcc 3360 gaaaagaatt acgaaatatt cgataaggag ctattagcgg tagttgcttc cttcaaagag 3420 tggaggcagt atttggaagg taatccaaac cgcctcgacg taattgtgta tacagatcac 3480 aagaatctgg aaagtttcat gacaagcaaa cagctaactc gaagacaagc tagatgggcg 3540 gaaacgttag gaagttttga ttttcaaatt agattcagac cagggagaga ggcaacacga 3600 ccagacgcgt tgtcacgaag accggattta gcgccttcac aggaagaaaa actcacgttc 3660 ggacagatgc tgaagcctaa caacattgtc gaggatacat tcttaccaga agtagactgt 3720 gtagaggcct ggtttgaaga tgaattgatt gaaaatgaga aggcagatga atggtttgaa 3780 gcagatgtat taaatcatga agtagcaagt ccaattgaat cagatactac aattatagag 3840 aaaattcgtc aagcctcaac agaagaccat cagataagcg aaatcatcaa tgctgtaaga 3900 aaacctatat ctaagaaggc gcaagggtgg acggaagaat attcagtgga ggacggtgtg 3960 ttatatatca acaattgtgt aatggtgcca gagaacaaca gcatcaagac tgagatcgtc 4020 aaaagtcgac acgacagccc aatggcaggt cacccaggtc gcgcaaaaac tctgagtctg 4080 gtgaggagga attacagatg gaaaggtatg aagtcattca taaaccgcta cgtagatggg 4140 tgtgaatcgt gtcagagagt caagtcttca aatcaaaaag cttttggtca attgaagcca 4200 ttaccaattc cagccggtcc ttggacagac atttcatacg atctcattac ggaattacct 4260 gaatcaaatg gtaagacttg tatactaacg gttgtagaca ggttaacgaa gatggcacac 4320 ttcatcccgt gcaccaatac cacttcagca gaagatttgg cggacctgat gttaaggaac 4380 gtgtggaggt tgcatggaac gcctaaaacg gtagtatctg accggggagg gatattcata 4440 tcacaaatta ccacatcact atataaacaa ttgggcatta agtcaacacc gtcaacagct 4500 tatcacccgg agacggacgg tcaatctgag atagtaaaca aggtagtaga agggtatctg 4560 cgccatttca ctagttataa acaagatgat tgggagccgc tgttggctat ggctgaattt 4620 gcgtataata acaataccca cagttccaca ggagtttcgc cgttcatggc aaactatggc 4680 tacgatccga cgtacaccac gacgccgtca attaatcagc gcattccggc tgtagaagaa 4740 cgaattaaac aaattcaaga agttcaagcg gagattaaag aaagtttagc gcaagcacaa 4800 gaatcaatga aatatttcta cgacaggaac aagaggaaag cacctagctg gaagattgga 4860 tctaaggttt ggctaaatag taaaaacatc tcaaccacga ggccaagcgc aaaaatggat 4920 cacagatggc tagggccctt taaaatcatt aaatgcattt cagaagtagc ttacgagctt 4980 gaattacccc tcagtatgca caggttacac ccggtattcc atatttcaag attaagagag 5040 tataaagctg atacaatagc aggacgcaaa caaccgccac caccaccaat tgaattagac 5100 ggagaagaag agtacgaagt tgaatctatt ctagatagaa gaacaagaag aaagactgaa 5160 gaatttttag ttagttggaa aggttatggg agagaacatg attcatggga accgttaaga 5220 aatttgaaaa acgcaaaaga gttagtagaa gcatttaaaa ttaaatatcc agaggcagca 5280 aaaaactata ggacacggag aaggatgtga gggccaagct ttttcccaag tggtttttta 5340 acgctggtcc gtgggaaggg cgcagggcaa ttcattataa gcctgggcgt taaaaggggg 5400 atac 5404 // ID CACTA-1_Ccinerea repbase; DNA; FNG; 7869 BP. XX AC . XX DT 12-MAY-2011 (Rel. 16.05, Created) DT 12-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW EnSpm; DNA transposon; Transposable Element; CACTA-1_Ccinerea. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 7869 BP; 1734 A; 2130 C; 1976 G; 2029 T; 0 other; cactatgaaa ccagaaacga atacaaaagg atgcgcgcaa gggggggaat gtacccactt 60 cgcgcataca aaccaggcgt ttcgcatgat ttgtatgcgc ccaacggaca gagagcacta 120 tccgttgggc gcatacaaat catgagcaaa atgtcttaac tatccgtccc tggatcggcc 180 cagggatttc tgtctgcgga cacttttcgt cggttttatg ggttgtttta cgtcgcgtac 240 catatatcat ccagcggcgt ttgctcctca ggtcgagtta tttgtatccg catgccagac 300 atttctaacc agaaaaaatg tgttcattag aaagccgtat actacaaatc gaagctgggg 360 tacaaataaa ctgggtgata cttctacaaa tcgacgcggg ttataaaaga gtaatgatcc 420 agacgtatag ggtattgaaa cggatgaagg tgagggtgct tatcagtcag agaaatcggc 480 ctcatcttct tcaaacatgg ccttccatgc atcctccttg cccttgccct tgcctttacc 540 tttggccttc ccctttccct tcccggcggt tagctcttcc tcgaaatcct cgtcaccctc 600 ctcatcctcg gtcgatttgc ccttcgtctt gcgggcacgc gtctctttgt gtggagacac 660 tcgggaacga gtgattctac cggtgcttga ggacgcagga aacggctgtt ggcttctttt 720 gtcgaagagg tcggtgtcgc tatgagccgc agcctttcgt ggtcttgggg ctggtctctt 780 gggtgatggt gtagccacgg tatcgccatc gttgtcatct ccttctgggg cagaggataa 840 aggcttcctg gcgagcgcta cagcacggct ttgactaaag atggagttag agctggatga 900 ttgaggaaag ggaggagcgg catccagttg gtcctggttg gccgcgagct tcttgcgctt 960 gggtgctggc ggagaggcgt tgtcgtcgtc ctcggcgggg gtgggttgct tgcgcttgtg 1020 cttgggtgct ggctccccgt cgtcatcgtc gccgtcgtcg tcgccgtcgt cagagactgg 1080 ggcgggcttc tttgccttgc gcttggtctt gggcgtcggg tggtcgtcgt cgtcgtggtc 1140 gccgtcagag actggtgcgg gcttcttcgc cttgcgcttg gtcttgggcg tcggctcgtc 1200 gccgtcgtcg tcgtcgccgt cgtcgtcgcc gtcagagact ggtgcgggct tcttcgcctt 1260 gcgcttggtc ttgggcgtcg gctcgtcgtc gtcgtcgtgg tcgccgtcag agactggtgc 1320 gggcttcttc gccttgcgct tggtcttggg cgtcggctca tcatcgtcat cgtcgtcgtc 1380 gttggagatt ggggcgggct tcttcgcctt gcgcttggtc ttgggcgtcg gctcgtcatc 1440 gtcgccgtcg tcgtcgtcgc cgtcagagac tggggcgggc ttcttcgcct tgcgcttggt 1500 cttgggcgtc ggctcgtcgt cgtcgtcgtc gttggagagt ggggcgggct tcttcgcctt 1560 gcgcttggtc ttgggcgtcg gctcgtcatc gtcaccgtcg tcgtcgtcgt cgtcgttgga 1620 gactggggcg ggattcttcg ccttggcctt ggtcttgcta gagccaacgg cgaatgcgtc 1680 cagtatctgc ttgtgcctgt ctttcctgct acgctggtca tcctcgtcat cagctgttgc 1740 atccgagtca ctctcatcgc tcgacttgtt gtcactacca gctgttgctg gcatcgttcc 1800 aagctgcttc tcgagtctgc gagcagacgc tcgcttacca cggagacagc ccttgatgat 1860 cgaagcagtc ggccagttgt tcttgaatcg ggcaagataa gggtgtctga tcatcatctt 1920 attggacatg tcatttgatt atttgtgtag tggaataaat ggaagacgta ccatgttaaa 1980 tatcttccca accttcccta gatcctggtt ccgaaagtgc tccaaaggag gaaggcgtgc 2040 tatgtcaata aactcgcgaa cagaagcctg aaaaatgcaa gttagaagaa tccggtaaac 2100 cagattgcac gtacgaggat gttgaggtat tcgcgatagt ggaatttccc atagcgctgc 2160 ttgtcgagat gcatctcatc tacaacagtg tacccgcctc gtcctggttc ccctggctgt 2220 ccaggaggct tttctatctc ccacagtttc ttgttggcgg tttcatggac ccagagtgaa 2280 agctgcgatg gctcgaggtt gactggaaga acaacagtca ttttctcgta gaaacagata 2340 cggcagcact taccagattc ataccaccta cggcgcttcg cccgtgttga tttcttgcgc 2400 tcaccctgag atttggtccc cgacttgagt gttttcctct ttgcgcttat gtagtcctgt 2460 ggctgagatg gaggcggtgg ggaaagtgtt gggttgtcta ttggggacgg ttcggctttc 2520 ttcttgctgg aagaagaggt ctgcttcgaa gatttcttgc cgtttgattt cttcgaagct 2580 gtatccctgc tacctgcctg tgcaggcttc gttcctgagc catattcagg attagccccc 2640 aaaatagtat attcgccttg agggtcactg acctttatct ttttgcttag gcggcatggc 2700 tgaaaacgac ccgttgagga agggcggaga gggaagaagt caacgttcaa cgaccgcgtt 2760 gctgagaggc tctctcggcg tggttcccac aggtcacatg gtcgcggact tgtgcaaaat 2820 tctttgactt ttcgtcgcgt tcaacttcta cacatccaca ggcaaacgcc aaaatcaaat 2880 tactctctct ttacttgcgt cgactcaatg gcccgacctc caaaatccgt cgaatctgac 2940 agcgaaaagg aaaatgaggg agaaagtgaa ggggagaacg aagagaagga tggtgaagtc 3000 aacttggaac cgtgccctca ttgcggcaag atgctctcga cacgtcaaat tagccgccac 3060 ctggagggaa agagtgtgcc cttcttgatc agggccagcc aaatggctaa gaacttgaag 3120 gcaaagactg agaaactggc aaagaaggtt tcgtcaacct tttcacggtc gcctacaccc 3180 tcatcaacct acaccccacg ctctggaccg atgtcttcac cggcattttt cagctctccc 3240 atggtcatct ctcctgttat tgcccaatcc ctcaatagtg tttctatggc gagtcctatt 3300 ctcaacaccg aaacggagca tccgccagaa cgccgttgga gaacgacagt agaggatgtg 3360 cctgaggaag acgaggaact tgatgtgggg gcaacagagc tggaggaagt tggattaggt 3420 ctgccagatg ttgagatcga agagtatgcg agtgacgagg gggaagagga tgatttttgg 3480 ggtattcccg aggcatctga gtctgaggaa agtcttgggt tgggctacag ttcggacgag 3540 gactacaaca actacgacga tgacgaggag ttcaacagac tgtgggctgc ggccatagca 3600 cagggtatgt attccggtat ccattctcaa ttagtgttgc taatgatcac ggggtgcagg 3660 cggtatgact ctcactgccg aggatagctc ggttctgaga gcagtgatgt acaaacttcg 3720 tcactgcctc tcagacagag cattcaacga actcccgtgg gtcttcccag accaggtcgg 3780 gcttacgacg ctcaagaata gcaagtcgca gctccgcttt ctttgcgact tcaagcccga 3840 gtcctaccat tgctgcatca acgtctgtgt cctctatgtc gacgattatg ccgaccttac 3900 ccactgtccc aatccgaaat gccgccatcc tcgatacaac gacaatggtc agccctacaa 3960 gatgttcaca tatgttcccg ccatccctcg cctcattgcc cttgcagcca atcctgaagc 4020 cgagaagcta atgggctatc gaaaccaaca ctttccgcca gaccccgatg acgacattca 4080 aacaccaaaa ccgaagatga ctgatatctt cgatgggacc aactatcggc gactagtgaa 4140 gaaaagggtc gtggtcgaca atgtccggta tccccatcga ttttttgagc agaagaccga 4200 catcgccctc ggcttttcga ctgatgggtt ttgcccattc cgccgccgga agaagacatg 4260 ttggccactc cttctcttca attacaacct tcctccggag attcggtttc atgcaaaatg 4320 ggtgatgtgt cttggggtca tcccgggacc aaacaagccg tgggacagca attcttttct 4380 tcgcccgtat gtcaatgaga tgaagaaact cgccatcggc gtccccgcct tttccgcaca 4440 atccaactcg aatttcaagc ttcgcggctt ccccgtcgac ggtttcggag acatccccgc 4500 aatttcaatg gttttatgcc tcaagggtca caacgggaag tgtccctgcc gcgactgcaa 4560 gatccaaggc atccgggatc cagctgcaaa ggcgactacc cactatgttc cacttgcacg 4620 cttcccaccc gagcctccat acgatcccaa aaaattgcct cgccggaccc acgccgagtt 4680 tattgcagat gggaaagctg cacaggacaa gacggtctct gctgctgagc gggagcgtcg 4740 ttccaaagca tccggcatca agggtctcac cgtcttgtcg gagataccct ccctcttctt 4800 cccagattca ttcccattcg actttatgca tcttatctgg gagaacttaa tcaagaactt 4860 ggtgctattt tggaccggca gcttcaagga attggaccac gccaaccaag actatgtcat 4920 tgaccgcacg gtatgggatg caatagccaa agcaggttcc caggcagggg ccacaattcc 4980 caacgcatac gccgcccgtc ccaaggacat cactaacgac cgcatcgcca ccacggccga 5040 cacctggtcc ttttggacct tgtttctcgc accagtcttc cttcgccgac agtttaagaa 5100 cagagagtac tacgaccatt tcatcgaact cgtcaagatt atcaacattt gccttcaatt 5160 cgaatatacg ctggaggata ttggcaaaat tcgagaggga ttcaatgact gggttctgga 5220 atacgaacgg taggtcatat cctttgttta agcacattta atctaaccaa cacgcttagc 5280 ttgtactttc agtacgaact cgaccgttta ccggcctgtc caaccaccat ccattccctc 5340 ctccatatcg ccgatggtat caaggccaat gggccggtat gggcttattg ggcttttgca 5400 atggagcgtt actgtggacg actccgccct tggattaaga gccgacgaca tccttgggcg 5460 aacctcgata acaatgtcgt tctggctgca caagccgacc aaatcttgca caactacaac 5520 ttacacagtg tcttcgactc aaactccaat aaggaaagca tccgagactt cgcaatccct 5580 gacgatttct gtgagttcaa ctcttccccg cgcccgtctg ctcatttttc aatgctctta 5640 gatgctcaat ggtacacact ggcatcaccg cgcgtcaaaa acccggaaat cgaaccagga 5700 gttgaacgca agttggctgc cgccctatca actaggtata cacctgaagg gaagcctgct 5760 atttcgctga gaacagtgcg gaagtacttc aaagtcaaga atgtgcagct tttcggacga 5820 atcaagatgc tgcatggcgg agacaccatt cgtgcagcac ggttatgcag gactaccgcg 5880 aaggatagtc gggaagcttc atttgttagg gtatgtggcc gtgctccttg actgtgactg 5940 acatagctca cgccagggtc tctaatctag tacgagctgg cagtagaccg gaatgcagca 6000 ctctcctacg acaaggccga cgaggacctc gttctagaga cctatttcgg tgaggtacag 6060 caagtatttg tgatcaagat gcctcgaagc aaagagctgg gtatcaagga ggaggagatc 6120 gtcgttttcg tcgaaattca gccatgcaac acgcgcctca acgagatcga tcttacgatg 6180 tgcgcctcac gcccggggaa actgaaagga ttggacatcc atatctaccc tggtatgaaa 6240 gcgggtaccg agattgtgga tcttacgact atcatggctc ttattggtcg gttcaagtgg 6300 gagggtatgt gggttatcat tgatcgtagt gggaatgtga accgtccaga gcttgtgcac 6360 gaccaactgt tgtaattttg ttaacttttc aactttaaat tacccttaat tctattgatt 6420 ttcttgtcta tttcctcgta gtgaacctgt agtaatctac aaattcctcg agtaggtggg 6480 ttcgaggttt aattggggtt gtttgcccca gcctgtctga tgatgcccgc cgcgtcgacc 6540 acgtctttat caacgaccac cacattcctc aatctattcg ctcactattt actgtctttt 6600 ctgcaaactc agcgctttct cgtacgcgcc tttacggttc tcacctggcg atgtcaaaac 6660 gaccctttag cgagatccaa aacaacgcga gtccacagcc ccgccggtcc cgtcggttgc 6720 aggatctctt cggccccttg gaccaaccgt cagtgtatac gcgtaccgat tgggactcca 6780 aggagctcct ctatcacctt gagcaacaga agaggtatag aaggctccag gtacgtccct 6840 ggcacaagtc atttcatacc tagctgactc taaacaccag cgtcgttctg aatcggagcc 6900 tccgctgcca gatagccaac tgccggatcc acactgggac cctcaagcac cgtttgcatg 6960 gcctaccgtc ccccgaaccc cacgagtccc cactccacta tctcagcccc cgccttcacc 7020 aaccgaccaa aatccggagt cccgccttga atcacagctg cccgagcatc aaaccccact 7080 atcctcacct catccatccg accactttca atctattgga ccggcaactc accacatcca 7140 tgacagcgat aggcccatgg ttggccagac accatcgcat ttcgaaatca cccactcatg 7200 gcgagacccg tcactccgcc aaactctcga cattcccaac agcacccctg tctctagagc 7260 atgcccattc tgcatacccg ttggacacga aagcttctcc agccttttta gagaccccac 7320 ggcgaagatt tgtagctacc atcgtcgaat ggctgtatgg gaaatacaaa tgaacctgct 7380 tcgccttggg ccactgatca ctcatgcgac aaggcatcgc gatgcgcttg tcgaagaacg 7440 agaatcatat catcagctac tggaacagat ggagcactcc gggtccgcgt ataactaggc 7500 cactcgcttt tcacttttct ttcttgtgtt tggcccttgt atcaccaccc tcctttgatt 7560 tgtgggcctc tatcattact tatggtcgcc ttgtagccta ctgtgtttca attccctttc 7620 gatttgtagt atatagatcc taattcacgt tgttctcctt aaatcgagca cactttctta 7680 aggtggttca agtatccgga caccgtgtag tttatacctg tccaaaaaga ctgcccgata 7740 caagaaatcc gtttgtgttt tggacctaat caagcttttt tttgacgatg caaagtgtcc 7800 gtgaatcgga tggcaattac atgtatccgt gaatctatat acattgtatc cggctaggag 7860 ttcatagtg 7869 // ID Copia-2_MLP-I repbase; DNA; FNG; 4523 BP. XX AC AECX01002036; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_MLP_; KW Copia-2_MLP-LTR; Copia-2_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4523 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002036; Positions 12239 7717. XX CC Positions [1823-2335] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 115..1575 FT /product="Copia-2_MLP-I_1p" FT /translation="MASSPIDSGSPAQLSPSSDLDQLSDSSDLESSSQSTT FT RPLSSIMATTNTQEDRINDRIKIPELRDGNFPHWNKRLHFALQTRGLSSFL FT TSDAPPTDPTALLLYRRQQGRVMEIMVESLDTANDSLVKITDTPKIAYETL FT CKAHGSSGGVLAAAAICEIATARLKPGQSLSDFIIRIRNLHNQLAQYSAED FT KEIALSSKLLAIFLLNGLGKEFEFITAPFFADLSNLTVQMVMDRLVIETAK FT RSSSGVTSSTNAFTAKVVPPAQTTNRVVRRIGPGPNDVCNLENHRGLPHTN FT RNCHFQNSTGPKAKAKSSSPNPSSNKLSESEMAKRYLEIEAAHLAKKNPPP FT ASAFNVTTSNQFSALELADDFPDKGFAAHAYSAQTDGSSYGSILADTAASR FT NIVSNKHMFTTLKAIDSVKINGISNVPQYATHIGTIVIPGFSEVTHQPVDI FT LIPEVLYVPTMTVNLLSLSQLCANGASFSGSDESITVVGLPR" FT CDS 1520..4513 FT /product="Copia-2_MLP-I_2p" FT /translation="MGPPSRVLMKALRLLVYPGDAYFICRKTSDDCLWDCE FT VRIMSSINAYSASAECWHHRLGHLHYDAIKRLAAKGDIKISGSVSSNRSPC FT NTCQKAKISQLPFKSHFPSSSAVLNRIHSDVIGPFPPSLGGAKYLVTFIDD FT CTRYATIFPIKLKSEVFHCFVNFKTRVELIFESKIKVFHSDRGGEYQSKEF FT LSYLTQHGISLEQGPAETPEQNSISERYNRSLIERVRCNLHHASLPTNMWA FT EIALATTFTLNHSPHSFLNHETPLTKWNSFLPGKGYHGPDPSFLRTLGCSA FT VYLAPRITGKLGLKGRNGVLVGYELMAKAYHIWDLLDRKIVVTRSVLFNEA FT EFPFVHLPNIVSSPSFIILQDPGSDIPSISTLSSTSTVSNLNHSKDSNSLL FT STSDSSSTSTLNVENVSNSPIHNPVLKTLLLPTLCSPSSSQSPREHSLSPT FT PIKTRPIGNASKPQRLGNFIAHAGSDMNDEPSYKQAMEASDADEWRKAMQL FT EFDSLTQHSVGKLVARPEGARVIGGMWVLKKKRDEKGNVLKYKARWVCFGN FT RQVEGVDFHDTYSAVGKSDMFRLLVAIASYLKCLVIQFDIMTAFLHGLMKE FT CVYIQQVKGFVTPGTEGMVVQLGKSLYGTRQGARDFSDDLRRKLKAFGFIT FT SKADNCLFIFRRASSFLYLHIHVDDGFLVSNSDNLVEEFRLHLLKSYEVKW FT KTKPLLHLGMRIQYHQDGAISIDQAHFLQDMLDRFGFNTLNPVKVPLPLGI FT KLRNGTPEDVAAASYLPYQSLIGSLNWAAISTRPDIAFAVSQLSRYNSCYT FT FEHWNAAKHLVRYLKGSISRGIMFKGSMPADLKGYGDADYANDPTDRRSVT FT GYLFTYGGAILSWRSRCQKSTALSTTEAEYMSISDCARQAIWFKTLFHDLN FT LPVSAVSIASVGNAIQLFNDNRGTVFLCKEPVINNRSKHIDVRYHFIRDHV FT RLNNIKASHVPTTSMPADFLTKPLSVEAFERCCVQVSNVECSS" XX SQ Sequence 4523 BP; 1200 A; 998 C; 889 G; 1436 T; 0 other; taggttatga gccgttttac cgagtaaaca cagtttatac acatacgaca ttggggattt 60 ttaatttctc ctctatatcc catctggttt tccgtctgct tatttcgcat ctatatggct 120 tcttcgccta tcgactccgg atcacctgct cagttatccc cttcatccga ccttgatcaa 180 ctctccgatt cgtctgatct cgaatcgtca tctcagtcga caactcgtcc tttatcttca 240 atcatggcca ctacgaatac tcaagaggat cgtatcaatg accgcatcaa aattcctgaa 300 cttcgcgatg ggaattttcc tcactggaat aagcgtctac acttcgctct tcaaactagg 360 ggtttatctt cttttctaac ttctgatgct cctcctaccg atcctaccgc ccttcttctg 420 taccgacgac aacaaggacg agttatggaa atcatggttg agtcactgga cacggcgaat 480 gattctttag tcaagattac cgacacgcct aagattgctt atgaaacttt atgtaaagct 540 cacggcagta gtggtggtgt tttggcggct gctgctattt gtgaaatcgc tacggctagg 600 ctcaaacctg gtcaatctct gtccgacttc attatccgaa ttcgtaatct gcacaatcag 660 ctagctcaat attctgctga ggataaagaa attgctctgt cctctaagtt attagcgatt 720 tttctcttga acggtttagg gaaagaattt gagttcatca ctgctccttt cttcgccgat 780 ttgtccaatc tcactgttca aatggttatg gatagattgg ttattgaaac ggctaagcgt 840 tcgtcatcag gagtcacttc ctcgactaac gcttttactg ctaaagtggt accacccgct 900 caaacaacaa atcgtgtggt tcgacgtatt ggtcctggtc ctaacgacgt atgcaatttg 960 gagaatcatc gcggccttcc tcatacaaat cggaactgtc attttcaaaa cagtacaggt 1020 ccaaaagcta aagctaaatc ttcatctcca aatccaagtt caaacaagtt gtcagaatct 1080 gaaatggcga agcgttatct tgaaattgag gctgctcact tagccaaaaa gaaccctccg 1140 cctgcttctg cctttaatgt aacgaccagt aatcaattta gtgctttgga gctggccgat 1200 gactttccag acaaaggttt tgctgctcat gcatattcgg ctcagacgga tggttcttct 1260 tatggatcga ttcttgcgga tacggctgct tcacgtaata tcgtttccaa caaacacatg 1320 ttcacaacct taaaagcaat cgattcggtg aagatcaatg gtatatcaaa tgttcctcag 1380 tatgcgactc atatcggaac tatcgtcatt ccgggttttt ctgaagtcac tcatcaaccc 1440 gtcgacatct tgataccgga agttctatat gttcctacta tgactgtcaa tttgttatcg 1500 ttgagccaat tatgtgccaa tggggcctcc ttctcgggtt ctgatgaaag cattacggtt 1560 gttggtttac ccaggtgatg cgtatttcat atgtcgtaag acatccgatg attgtttatg 1620 ggattgtgag gttcgcatta tgtcttcaat aaatgcatat tctgcctcgg ctgaatgttg 1680 gcatcaccgt ttaggtcacc tccattacga tgctatcaaa agattagctg caaaaggcga 1740 cattaaaatc tctggctctg tatcttctaa tcgttcacct tgtaatacat gtcagaaagc 1800 caagatttct cagcttcctt ttaaatctca ctttccatca tcatctgctg tgcttaatag 1860 aatacattca gatgtaatag gtccttttcc tccttctctt ggtggtgcta aatacttagt 1920 cacatttata gatgattgta ctagatatgc caccattttt cctatcaaac tcaagtcaga 1980 agtatttcat tgttttgtca attttaagac cagggtagaa ttaatttttg aatccaagat 2040 caaggtattt catagtgatc gtggcggcga atatcaatct aaagaatttc tatcttattt 2100 gacccagcat ggcatttctt tggagcaagg tcccgcggag actcctgagc agaattcaat 2160 ttctgaaagg tataatcgat ccttaataga acgggttaga tgtaacttac atcatgcttc 2220 attaccaaca aacatgtggg ctgaaatagc gttggcaaca acctttactc tcaaccactc 2280 gccacactct ttcttaaatc atgaaactcc acttaccaaa tggaatagct ttcttcccgg 2340 aaagggttat cacggacctg atccgtcgtt tttaagaaca cttggatgtt ctgctgtgta 2400 tctagctccg cggatcacag gtaaacttgg attaaaaggc agaaacgggg tgttagttgg 2460 atatgaatta atggctaaag cgtaccacat atgggactta ctggatagga agatcgtggt 2520 cactagatca gtactattta atgaggccga gtttcctttc gtgcatcttc ccaatatcgt 2580 atcttctcct tcttttatta ttttacaaga tcctggtagt gatattcctt ctatctcaac 2640 attatcttca acctcaacag tttcaaatct aaatcactcc aaggattcaa attctctttt 2700 atccacctcc gattcatctt ctacttccac tttaaacgtt gaaaatgttt ccaatagtcc 2760 tattcacaat ccagttctta aaacattgtt gcttcctact ttatgctctc cttcatcatc 2820 tcaatctcct cgtgaacatt cactttcgcc aaccccgatt aaaactcgac ctataggaaa 2880 tgcaagtaag ccacaacgtc tgggaaattt cattgctcat gctggatcag atatgaatga 2940 cgaaccttca tataaacagg ctatggaagc atccgatgca gacgaatggc gaaaagctat 3000 gcagttggaa tttgattcct tgactcagca ttctgttggt aaacttgtgg ctagacctga 3060 gggtgcgcgt gttataggag gtatgtgggt tttaaagaag aaaagagatg agaagggtaa 3120 cgtgctgaaa tacaaagctc gttgggtttg ttttgggaat cgtcaagttg agggagttga 3180 ttttcatgat acgtattcgg ctgttggaaa atcagacatg tttcggttgc tggtagctat 3240 agcatcatat ctcaagtgtt tggtaattca attcgatatt atgacagctt ttcttcatgg 3300 attaatgaag gaatgtgtat acattcaaca ggtcaaaggt tttgttacgc ctggtactga 3360 aggaatggtg gttcaactcg gtaaatccct ttatggaact cgtcaaggag ctcgtgactt 3420 tagcgatgat cttcgtcgaa aattgaaagc tttcggtttt atcacatcca aagccgacaa 3480 ttgtttattc atcttccggc gtgcatcatc attcttatat cttcacattc atgttgatga 3540 cggttttctg gtatcaaact ccgacaacct cgttgaagaa ttccgccttc atcttctgaa 3600 atcttatgag gtaaaatgga agactaaacc attgttgcat ctaggcatgc gaattcagta 3660 tcatcaagat ggtgccattt caatcgacca ggctcatttt ctgcaggaca tgcttgatcg 3720 attcggattc aacactttaa atccagtcaa ggttcctcta cctcttggca tcaaattacg 3780 aaatggtacc ccggaagatg tagctgcagc gtcctattta ccttatcaat ctctcatagg 3840 atctctgaac tgggctgcaa taagtactag gccagatatt gctttcgcag ttagtcaatt 3900 atcacgatac aattcttgct atacatttga gcattggaat gccgcaaagc accttgttcg 3960 atatctcaaa ggatcgattt ctcgcggtat catgttcaaa ggatcaatgc ccgctgacct 4020 caaaggatac ggtgatgctg attacgccaa cgatcccaca gatcgaaggt ctgtaactgg 4080 gtatttgttc acttacggag gagctatctt gtcttggcga agtcgttgtc aaaagtcaac 4140 tgcattgtcc actaccgaag ccgaatatat gtcgatttcc gactgcgctc gtcaggcaat 4200 ttggtttaag actttatttc atgatctcaa tctaccagtc tccgctgttt ctattgcttc 4260 tgttgggaat gctattcaac tgttcaatga taatcgtgga acagtgtttt tatgtaagga 4320 gcctgtcatc aacaataggt cgaagcacat tgatgtgcga tatcatttca ttcgtgacca 4380 cgtcagactc aacaatatca aggcatctca tgtgccaacc acgtcaatgc cagcagattt 4440 tcttactaaa ccactttctg ttgaggcttt tgagaggtgt tgtgttcagg tttcaaacgt 4500 agagtgctcg agttaggggg gaa 4523 // ID Gypsy-59_MLP-I repbase; DNA; FNG; 6026 BP. XX AC AECX01001344; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-59_MLP_; KW Gypsy-59_MLP-LTR; Gypsy-59_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-6026 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001344; Positions 221474 227499. XX CC Positions [3083-3502] - Reverse transcriptase CC Positions [4829-5308] - Integrase core CC 'ATATC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 4934..5929 FT /product="Gypsy-59_MLP-I_3p" FT /translation="MGHFIPCNESMSAEDLADLMVKFVWRLHGTPKTITSD FT RGSIFISQITKELDKRLGIRLHPSTAYHPRTDGQSEIVNKAVEQYLRHFIN FT YRQDNWEELLPIAEFSYNNKDHASTGVSPFKANYGYIPNFGGVPLGEQCVP FT SVEERLKLLAEVQTELTESLHLAQEEMKIQFDKGVRSTPDWKVGDQVWLNS FT KNLSTTRPSHQMDFKWMGPFNIIEKISQSAYKLTLPASMRGVHPVFHVSLL FT RKHEPDTIQQRQQEEPKAIIIDHEDEWEVSEILDCRKKFNKKEYLVSWKGF FT GMEHNSWEPEVNLKNSNDLLNEFKMKFPNAVKNYKRTQSK" FT CDS join(2219..2809,2813..4597) FT /product="Gypsy-59_MLP-I_1p" FT /translation="MDPRREARNFSEGMCASTSALTSPQSESFLLTHSKSH FT LEKDGKLVSSGMIQPNRTTLPATRHDFDSHLEGQDQATEDCLHTQDLIVEA FT SVPHDTTEGIAADTASSSPKNNPGSPREEPTGHARNCDEGACTFLSTHMPP FT QCEFDPHSLNSAPETAGKPFCHLNNSDHYNIDAANASWSTSARFAADEKSK FT LAKRTVEMVPTAYHRYLNMFKKSKAQCLPPRRKYDFKIELIEGAQPQASRI FT IPLSPAENAALDEMINTGLANGTIRRTTSPWAAPVLFTGKKDGNLRPCFDY FT RKLNAVTVKNRYPLPLTMDLIDSLLDADEFTKLDMRNAYGNLRVFEGCEDI FT LAFICRAGQFAPLTMPFGPTGAPGFFQYFMQDILLKHIGKDAAAFLDDIMI FT YTKKGIQHKTAVLEILYILEKHQLWLKPEKCEFSKSEVEYLGLVISHNKVR FT MDPTKVKAVTEWPAPRTVNELQRFIGFANFYQRFISHFLQTTRPLHNLTKN FT NTPYIWDEQCNEAFEGLKAAFTSAPILKIADPYKQFMLECDCSDFALGAVL FT SQYSEDDGQIHPVAYLSRSLVQAERNYEIFDKELLAIVAAFKEWRHYLEGN FT PHRLEVIVYTDHRNLETFMTTKQLTRRQARWAETLGCFDFQIKFRPGRHAA FT KPDALSRRPDLEPSQNDKLTFGQLLKPDNIRPDTFTEISTLESFFKNEDVN FT LENSDYWFEVDVLGVSEAELGECDSSIDSSDTIEPQVSILNDNDLISMIRT FT ATHNDEQLREIMTATEHPISSKMRQAVSGYAIKDGILYNHG" XX SQ Sequence 6026 BP; 1812 A; 1344 C; 1387 G; 1483 T; 0 other; tattgtcggt tctaatccta acgggcatcg aggatatcag aagattagaa acgaattaaa 60 ttgatcaaag ttaaaattag aagatatcag attgaacctt atcgattccc aaagtctcga 120 tttagaaacg aagagaaata aaattgagac tcaaacttaa aagaagacca gatttattaa 180 gattaaaatt agaatcagaa tcagaatact tttgaattgg attgattaga ttaagttaaa 240 ctgagaaatt agaagaccga actcaacctt agaagattac cgaatcttga cgatcaccac 300 caaattcaac cagccacgtc tcccacttac agaaccccag acggtgacga tcaggatact 360 gattccgagg ctgaaacacc tttcgtcgac gttaacgccg gtctacccgt cacagaactc 420 acggatatcg agatggaaga cattcaacat cagttgaacg aactccaagc gtcgttagcg 480 agtgaacgcg ccttgcgtga acaagccgaa gctcgaggtc gacaagcgga ggagcgttta 540 gccgctattg aggctaatcg tcagaaccca aaccccactc cttcgcctac catggctact 600 cctcaaccag ctacaatccc agctgccaag ggtccgaaag tctccacccc ggataagttt 660 aatggtaccc ccggtagccc agcggaggtc tttgccagtc aagtccagct gtatatgctg 720 gcacacccgt ctttattccc agacgacaga actaaagtcg tatttgcttt gtcatatctt 780 actggcactg cgagcgcgtg gtcccagccc ttaatgactg aattactcga cgaatccact 840 gctcatctgg tcacgttcga gcgattcgtg cgtaatttca aagccatgta tttggcacgg 900 aaaagaaagc gaaggccgag aaagctctac gatctctgtc tcagaagact acagtagccg 960 cgtatactca tgagttcaac ttgtatgcta cgaataccgg ctgggaagtg ccgacgttga 1020 tcagccagta cgaacaaggt ctaaagcgtg atattagagt cgctatggtt ctagtccagg 1080 acgaatttac caccatcgag cagatctcta acttggctat taagttggac aacaagctac 1140 acggtgttgc cgacacttct acttcatcat caactccagc tcgcaaccca aacgccatgg 1200 atatatcttc atcctatact cgattgactg aagaagaacg tactcgccgt ttacgcacgg 1260 gttcttgttt taaatgcaat gggcagggtc atattgccaa tgcttgtcct aataatagag 1320 gtggtgatcg tcgaggaaga ggtagagggg gttatcaagg aggctatcgt gctagagtgg 1380 ctgagttaga gatgaaagta gctgaattaa gtggaaagga gaatgaggcg agagatttag 1440 gagaaagtgc tagtcgtagt gatgcgtcaa aaaatggagg cgctcaagcc tgaaggtagc 1500 actgccagca tagatgcctc gggttgagaa atccgaggca tcatcggagg caaatcctcg 1560 gacagacctt gtccgaggtt tttgcccgag gtattttttt gccctgagaa ttacctcgga 1620 tcaaaacctc gggaagcttc tgtccgaggt tttgcatccg aggatacctc ggattttgta 1680 atccgaggat aaatatgttg gcggtgtagt gcctagcttg agcttagggg gaatgggaga 1740 ttcaattagt ttaggtgcta gtaatgttgt aacttgcaat aacaatgacc caagactatt 1800 tttatgagtt tcactttctt tgacccataa accccgcgcc acaccattct ttagaccatc 1860 agcccgactc ctgattgatt caggagccac tcacaatgtc ctgggagaca cttttgctcg 1920 agaagcggac cttctacgcc acggagtaag cacaacgagg gaaatcacag gtttcaacgg 1980 ctcgaagact acttcatctc atgaaatcga cttacttatt gactttgaca aatcaccgac 2040 gcatttcatt atcacgaacc tcaaaaatac ttatgatggc atactcggta tcccatggat 2100 ccgcgacaat agccaccgaa ttgactggac aagtggtatc gtaaccacca ctgatatttc 2160 tgctgcctct acttatgtag agtcgttaaa accgccaaca ccctctgtgg accccgcgat 2220 ggaccctagg agggaagcta ggaattttag cgaggggatg tgcgcttcaa cgagtgcatt 2280 aacatccccg cagagtgagt cctttttgtt aacccattca aaatcacatt tagaaaagga 2340 tggcaagctt gtttcttccg gaatgataca gcccaatcgt acgacgcttc ctgccacacg 2400 ccacgacttt gattctcacc tggaaggaca agaccaagcc acagaggatt gtttacacac 2460 tcaagacctt attgtagaag cctctgttcc acacgatacg actgaaggca ttgcggctga 2520 tacagcctcg tcgagtccga agaacaaccc tggaagccca agagaggagc ccacagggca 2580 cgctaggaac tgcgacgagg gggcgtgtac ttttttaagc acacatatgc ccccgcaatg 2640 tgagttcgat ccccattcat tgaattccgc gcccgagaca gctggcaagc ccttttgcca 2700 tttgaataac agtgaccatt acaacatcga cgcagcgaat gcatcatggt caacatcagc 2760 tagattcgcg gctgacgaga agtcgaagtt ggccaagcgg acagttgagt agatggtccc 2820 tactgcctac catcgttacc ttaacatgtt taagaagtcg aaggcgcagt gtttacctcc 2880 tcgacgtaaa tacgatttta agattgagct gattgaaggt gcacaaccgc aagccagtcg 2940 gattattcca ttatccccag cagagaatgc tgcgctggat gaaatgatca atactggact 3000 ggctaatgga acaatccgta ggactacatc gccttgggcc gcgccggtct tattcactgg 3060 aaagaaagac gggaatttga ggccttgttt tgattatcga aagctcaatg cggtaactgt 3120 caagaaccgc taccctcttc cattgacgat ggacctcatt gacagtctac tggatgccga 3180 cgagtttacc aaattagaca tgcggaacgc ttacggtaat ctacgagtgt ttgaaggctg 3240 cgaagacata cttgctttca tatgccgagc gggtcaattt gctcccttaa ccatgccctt 3300 tggaccaact ggcgccccgg gttttttcca atacttcatg caagatatcc tgctgaaaca 3360 cattggtaaa gatgctgcag ccttcttaga tgatatcatg atctacacca agaagggtat 3420 tcagcacaaa acagcggtac tggagatatt atacattcta gaaaagcatc aattgtggct 3480 gaaaccagag aaatgtgaat tctctaagtc tgaagttgaa tatttgggac ttgtcatttc 3540 tcacaacaag gttagaatgg atcctaccaa ggttaaagct gtgactgaat ggccagcccc 3600 tagaaccgtg aacgaacttc agagattcat tggattcgct aatttctatc aacgcttcat 3660 cagtcatttt ttgcaaacta ctcgaccttt acataatctg acaaagaaca acacaccgta 3720 tatatgggat gaacaatgta atgaagcgtt tgagggattg aaggcggcgt tcacatcagc 3780 gccaatcctg aaaattgccg acccttacaa acagttcatg ttagaatgtg attgctcaga 3840 tttcgccttg ggggcagtct tgtcgcaata ctccgaggac gacgggcaaa ttcatccggt 3900 ggcctatcta tctcgatcat tagtgcaagc agagcgcaac tatgaaattt tcgataaaga 3960 gctacttgcc atcgtcgcgg ctttcaagga atggcgtcat tacttggagg ggaaccccca 4020 tcgactagaa gtgattgttt acactgacca ccgcaacctg gaaactttca tgacgacgaa 4080 acaattgact agacgtcagg cacggtgggc ggaaactttg gggtgttttg attttcaaat 4140 caagttcagg cctgggagac atgcagccaa accggatgca ctctctagac gacctgattt 4200 agaacctagt caaaatgaca agctcacttt cggccaatta ttgaaacctg acaacattag 4260 accagacaca ttcacggaga tttcaacatt agaatcattt ttcaagaatg aagatgtaaa 4320 ccttgaaaac tctgattact ggtttgaagt cgacgtttta ggggtgtctg aagctgaatt 4380 gggggagtgt gattcatcaa tcgacagcag tgataccatc gagccacagg ttagcatatt 4440 gaatgacaat gacctaatat caatgatccg cacagccacg cacaacgacg aacaactacg 4500 cgagattatg acggcaacag agcatccaat ctcttcgaag atgcgacaag cggtgtcagg 4560 atacgctatc aaggacggta tactgtataa tcatggttga atcgaggtcc cagacgtaga 4620 cgacataaaa tttcaaattc taaggagtag acatgatagc ttactagcag gtcatcccgg 4680 taggaataag acactcaact tagtgcgaag aagtttcatt tggccatctc agaaggcgta 4740 tgtcaataga tatgttgacg gttgtttgtc atgcttacga agcaaaacaa gtaaccagaa 4800 gccgttcgga tcactggaac ctcttccgat accgactgga ccttgggtag atattagcta 4860 cgatctcatt acgaaattac cgaaatcaaa cggcaaagat agtatactga ccgtagtaga 4920 cagactgacg aagatggggc attttatacc atgcaacgag tcaatgtcag cggaggatct 4980 ggccgattta atggtaaaat tcgtgtggag attacatgga actccaaaga cgatcacgtc 5040 ggaccggggg agtattttta tatctcaaat caccaaagaa ctcgataaga gattgggtat 5100 tcgcttacat ccatccacag cttatcatcc tcgcacagat ggccaatcgg aaatcgtcaa 5160 taaggcagtg gaacagtacc tacgccattt tatcaactat cgtcaagaca actgggagga 5220 gttgctgcca attgccgaat tctcatataa caacaaggac catgcttcaa ctggggtatc 5280 gccatttaag gctaattacg ggtatattcc gaatttcggt ggagtaccac ttggggagca 5340 atgtgtccct agtgtagaag agaggctgaa gctattagct gaagtgcaga cagaactaac 5400 cgaaagttta catttagcac aggaggaaat gaagatacaa ttcgacaagg gagttagatc 5460 aacaccagat tggaaagtgg gtgatcaggt gtggctcaac agcaagaatt tatcaacgac 5520 aagacccagc caccaaatgg atttcaaatg gatgggtcct ttcaacatta ttgaaaaaat 5580 ttcacaatct gcgtataaac tgactttgcc tgcttctatg aggggtgttc acccggtgtt 5640 tcatgtatct ttacttagga aacacgaacc tgacacaatc cagcaacgtc aacaggagga 5700 accaaaagct atcatcattg atcacgagga tgaatgggag gtatcagaaa ttttagactg 5760 tcgaaagaaa tttaataaaa aggaatactt agtcagctgg aaaggatttg gcatggaaca 5820 taactcatgg gaaccggaag tcaacctcaa gaatagtaat gatttattaa atgaatttaa 5880 gatgaaattt ccaaatgcgg tcaagaatta taaaaggaca cagagtaagt gagagggcaa 5940 gctttttccc acagggtttt ttaatgctgc ccgtggaaag aatgcagaac tcgcaagagg 6000 gggtttgggc ataaaaaggg ggataa 6026 // ID Copia-2_MVPL-LTR repbase; DNA; FNG; 227 BP. XX AC AEIJ01000987; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Microbotryum violaceum genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_MVPL_; KW Copia-2_MVPL-I; Copia-2_MVPL-LTR. XX OS Microbotryum violaceum OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Microbotryomycetes; Microbotryales; Microbotryaceae; OC Microbotryum. XX RN [1] RP 1-227 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Microbotryum violaceum genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AEIJ01000987; Positions 6713 6487. XX SQ Sequence 227 BP; 29 A; 57 C; 62 G; 79 T; 0 other; tgatatgggt ctgtgcctgt ggtatctgcg cagatgcgcg tgcgtttagt cattagtcac 60 tttggtattg gagtttgatc tgtttgagtt agggagcgcg cggggctttt cttcccgcgc 120 ggctgtttac tcttggtagc tttctcatca ctcgctccac attgtggtcc tgacttctcg 180 gtgttgtgca ccctcaccta tcggaaggct ttacttccgc gctctca 227 // ID Gypsy-41_MLP-LTR repbase; DNA; FNG; 314 BP. XX AC AECX01002345; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-41_MLP_; KW Gypsy-41_MLP-I; Gypsy-41_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-314 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002345; Positions 9777 10090. XX SQ Sequence 314 BP; 72 A; 65 C; 72 G; 105 T; 0 other; tgtaagggta tgcccttaca ggcagtgtgt acagaagcgt atgggtttag acagtagtat 60 agttagtcta ttcatcctga ctcccaagtg actcttttct ccttaggaga ctctccactc 120 ggagtcaggt aagtcatcat tcagttcttt ctattttagt tctcttgttg tgattaggtt 180 gtctggttgc tgattagagg agactctcca ctcggagtca gttttgggaa ttctatagag 240 acgttatata gttgacttcg agcctctgag ctgtcccaca agaacccagt gaaggttcta 300 tcctgaacct taca 314 // ID Copia-1_CCO-LTR repbase; DNA; FNG; 233 BP. XX AC AACS02000001; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the mushroom Coprinopsis cinerea genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_CCO_; KW Copia-1_CCO-I; Copia-1_CCO-LTR. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-233 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the mushroom Coprinopsis cinerea RT genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AACS02000001; Positions 4069281 4069513. XX SQ Sequence 233 BP; 52 A; 49 C; 43 G; 89 T; 0 other; tgttgaattt cggtaaagtc tcgtaggact attcgtccta tttcgtcttc cttattgttc 60 gtccatcctt gtatcttcgg gaggacttat atcgtcttca tatcgtctag aattcgggta 120 attcttaggc tacaacctcg ggtattcata ggtgacttga attgtattta agctgggtat 180 ttacatcgga atacatcatc gttctactcg ataactcgtc tactttctca aca 233 // ID HOBS_LTR repbase; DNA; FNG; 750 BP. XX AC DQ370139; XX DT 09-MAR-2006 (Rel. 11.02, Created) DT 09-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE Ustilago maydis retrotransposon HobS LTR retrotransposon (LTR). XX KW Copia; LTR Retrotransposon; Transposable Element; HOBS_LTR. XX OS Ustilago maydis OC Eukaryota; Fungi; Dikarya; Basidiomycota; Ustilaginomycotina; OC Ustilaginomycetes; Ustilaginales; Ustilaginaceae; Ustilago. XX RN [1] RP 1-750 RA Kamper J., Kahmann R., Bolker M., Saville B.J., Banuett F., RA Kronstad J.W., Gold S.E., Perlin M.H., Woesten H.A.B. et al.; RT "Living in pretend harmony: the genome of the biotrophic fungus; RT Ustilago maydis."; RL Unpublished (2006). XX DR EMBL/GenBank/DDBJ; DQ370139; Positions 1 750. XX SQ Sequence 750 BP; 196 A; 225 C; 187 G; 142 T; 0 other; tgttagaata tctacgattg gcgtttagtc tgatcgccca caatcatgca acccttgcag 60 taacctaacc ggtatacacg agtgaccgga cgggaacgag taggtgtctc agtacgcgcg 120 ctatacgcgc gcgacacgca ctcaatacac cagcggtacg tccgcggtac gcgtgcgata 180 cactcgcagt acgcgcgcca tacgcgcgcg acacgcgctc aatgcacccg cagtacgacc 240 gcggtacgcg tgcgatacac tcgcagtacg cgcgccatac gcgcgcgaca cgcgctcaat 300 gcacccgcag tacgaccgcg gtacgcgcgc aagtatagta actcgccaga gtttccccgc 360 ttcacaacat agcgctggta gcgagaagcc cagagtagat tcacacaagt cgctggacga 420 ctgtgggaac acgccgttgg ctacagactc gtaccaagtg gaagaacctg agggagaaca 480 gactcatgca tctggccagt tgtggccaca gctagtcagt tgtatgagga gctgacgtat 540 agggagaagc tcatgtagga tactccagtc gactcatgta cataaataca gagggaacgc 600 tcacactttg tagttagtcc ctcgtccaat tcacactatc caaatacatc ccaaaccatt 660 ctcgacccta ctcagtccca actcgcctct ctaaggaaag ggagacttcg atcatccagc 720 gcccgctaag gattcaccgc gaatattgaa 750 // ID Copia-65_MLP-I repbase; DNA; FNG; 5932 BP. XX AC AECX01000622; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-65_MLP_; KW Copia-65_MLP-LTR; Copia-65_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5932 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000622; Positions 41702 35771. XX CC 'GTTTT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1312..2319 FT /product="Copia-65_MLP-I_1p" FT /translation="MLLALIPDVADYLEPTMIPGDENYRPKYAEVVNSILY FT WTMHDELSRMVDSIEHPCGRLAELRKQFSKVSFAGRCAKLIELNDLKYDSG FT VGSLESHLNVLRKKKEDLKMIGLDLSDEMFAIIVRNSMPADFPNVDAAFED FT RIAQNPATVITTSDVQKAITAADIQYRRVKGPEALKVTTRIPPTSGNSSAS FT TSFNTYRPQRCYNCNGFNHFANQCRRKRQFQSREKASTSGSVKTNEVELEM FT AELDLQPWDDPKDVTVSDVRLNTSVDDAIFDTGATHHIFNSPETFMNLKKI FT SPIKSISYRLGKRLRYGKFYRTKGQNSQSCQNHASVPTESVEIL" FT CDS 2982..5903 FT /product="Copia-65_MLP-I_2p" FT /translation="MVADGSTKAVITGVGSVRISDMDNPLLNCVLEPVYLC FT EGLRHNLVSGIAIHDQGINFESDKTGLKLRMKDGVMMKAKRVGQRWIFKVL FT KSSVETMAIGSYKLWHERLGHPNERVLRKMIADGACHGLPMKLGPTERCEV FT CADAKSTKTNRIGPSFMSHDQPLSLIVADLCGPFQTKSVGGASYFLNIRDV FT HSTFIKVYPLNQKLDTIACVKRYIAEVERLTGLKVKRWRNDGGKEFMNQAL FT ETYLASLGIVLEKTMRYHHEQNGTSEQSQRTIQSIMRCLLFQEDIPKSFWG FT FAVAAAAYLHNRTPNSNTNGKSPQEILFKEKPQADHLCVFGSWAFVHVPQE FT IRKKLDHRAVKCRFIGYPSGSKGWKFWDPKTNMFLESAHAEWLEEEASNEF FT VKHSTDEWPIPDKTSGIQHLLNDLNISHEEEELLSVLEVAFELDDSKITEG FT VREQDVLVQHIQMLAAGIAHKIPSGFKTAMKSSEKDQWKQACDKEIEMLKR FT MNVWVEVALPEGKRAVSTKWVFAKKLDGDGEVKKYKARFVVRGFNQEQGVD FT FHETFAPTARFASLLIILAISVCRGWIVKGFDVVSAYPHSPIDEEIYILAP FT EGYPCLREGDVLKLRRALYGTKQAARCWWKFFSNVLSKVGCKYCVNDQSIY FT VLKSEGDVAIVWIHVDDGLVCSSSDKMINYLRESLEKFFELVWQDEVEQIV FT GVKLVKKDDGLFLTQEVLATQIVTEAGFLMSSAETPMVAGLSLESAKDDEV FT ALEASKYLSLIGSLSYLAVGTRPDIAFTVNYLARFSAKPLSSHWTALKHLL FT QYVSGTRTNGLFFSRKDPNEFIEAYCDANWGGEFSRSTHGYVIFLFGCPVA FT WASRRQSCVATSTCHAEYMALGTTARELVWILNVVEEMLGKRIKGKMLCDN FT TAAVKVAKDLHMTKRSHHVAREFHYVNELVFDGIVEVLWIDGRNQKADIMT FT KPLGHVLFNFFKPLLCMNG" XX SQ Sequence 5932 BP; 1728 A; 991 C; 1393 G; 1820 T; 0 other; ggttatgagc ctgcgtctaa aatagctgca ttgaaattcc cagatattct agattcttca 60 aacttaatct tgaaaattca tttttattaa tagagtgacc acccgatcaa tcaaacttaa 120 atcggatatt cgacctggaa tctaaaccga agatgactcg tgcagctgca aaggcattcg 180 ccaataggtt tgcatcaatg actaaagatc tgccgattaa gttaagcagc atcgcgaaac 240 tggaagtcga cggttcaaac tttacaacat gggaacgtaa tattacaaat tacttagagc 300 atccgcactc gtgcggtaaa tcgcgtgcgg taaggtaaaa tttaccgcaa ctgcggggcc 360 tttcccacac tggtgagaaa ttaggcccgc ggtagttaac ttttgatcga tttgcgttag 420 cgaatcggta aatttaccga ttggctaacg catgtggtag ccaaattacc agccccctct 480 tcacttctca agcacaccaa cttgccaact tgccaacttg actcaactcc gactcaactc 540 ctcttgattc ctctcgactc aaaacacccg acttaagaat caatggagac ccgactgtct 600 tctaattcat caaactaatg ggtatgtctg gtttcttact gtatgattct ctttatccaa 660 cattgattaa gattgttcta ctttacatgt gattatatat atgttcattg agattttaga 720 tttatcaata tcgaattggg tttaaaaggg ttttagatca aagtggatca atttctgatc 780 gtttgttttg attttagaag gttttgaaca tgtcttgtcg tttggattga cttcaaaagg 840 atttgaaaaa tagatatcac attcaactgt agaggggaac tatgcaaaag ggtcttcatt 900 atctgcattt cttttcaaat tctcaatggt cttcaacatg acaactttta attgatggtg 960 caatgaattt ttgtttgctt gtctacaaac ctctctgtgc caaccatcca gtggtgatga 1020 gatctcaagt tactctcaaa tgaacttccc tgtcccaaag ctcttcttcc ctggcaggga 1080 tcgcttgaaa tgcactgatt gactgaaaag tcgattgatt tttttgtttg gtatattttg 1140 agtctgtaaa ctctgcatca gtgtggaaac tcaaattacc tctcaagtta gttaactacc 1200 gtgcaggtaa ttttggcacg agtgtgaaat ttggtcacgt taagtaattt tttacttaat 1260 gtagaggttg ttaccgcaca tgaccaagtt aacaaccgca aacgagtgcg gatgctctta 1320 gctttgattc ccgatgttgc cgactatctt gaacctacta tgattccggg ggatgagaat 1380 tacaggccca aatatgctga agtggtaaac agtatcttgt actggacgat gcatgacgag 1440 ctgtctagga tggtggacag tattgagcat ccgtgtgggc gtttagctga gcttaggaaa 1500 caattttcaa aagtttcgtt tgccggaaga tgtgctaagt tgattgaact caatgacttg 1560 aaatatgata gtggtgtagg aagtttagaa agtcatttga atgtgttaag gaagaagaaa 1620 gaagatttga aaatgattgg attggattta tcggatgaga tgtttgctat tatcgtcagg 1680 aattcaatgc cagcagactt tccaaatgta gatgcagctt ttgaagaccg tattgcgcaa 1740 aatccggcaa ctgtaatcac aactagtgat gtacagaagg cgattacggc ggctgatatt 1800 caatatcgaa gagttaaagg tccagaggct ttgaaggtca cgactcgtat tcctccgaca 1860 agtggtaact cgtcagcatc cacttcgttc aacacgtaca ggccacaaag atgttacaac 1920 tgtaacggat ttaatcattt tgcgaatcag tgtagacgca aacgtcaatt tcaatctcgt 1980 gagaaggcat ctaccagcgg atctgtcaag actaatgaag tcgagttaga aatggctgaa 2040 ttggatcttc aaccttggga tgatcctaaa gatgttactg ttagtgacgt tcgcctgaat 2100 acttcagttg atgatgctat ttttgatacc ggggctaccc accacatttt taattcacca 2160 gaaacattca tgaatctcaa gaagatatca ccgattaaga gcatctccta tcgtctcggt 2220 aaacggcttc ggtacggtaa attttaccgt accaagggtc aaaactccca atcgtgtcaa 2280 aatcatgcat cggtacccac agaatctgta gaaattctgt agaggtccag taaatttgct 2340 ggacctctac caattctgca gaggtcggta tctgcaaatc tgctcgactt ctaccgagac 2400 gatcctacaa ctcatcctca aatcagtccc actctatccc actacacctg tatcaccatc 2460 aaacaccaat ttcatcttgt aaattctccg ttttctgtga tcaatggttt tcaaatggtc 2520 atatagataa cttggctgtt ctctaattga tctttttcta attgatcttt ttctaatcga 2580 tcgttttttg attgactttt ttcaatcgtt tgttttcagc tcgtttgatt tcagctcgat 2640 tgttttcagc tcgattgttt tcaagtcaat tgttatgaag tcaatcatct tgttttcaaa 2700 gtgattgttc tgaaatcaat tgtcatcaga tcaattgtct ttgcaaactg attgtgctcg 2760 gatggattgt tgcgtgatag aacatgagat ttgcagtttt ttttaccgaa catacgatag 2820 tggcaatgag gtgtttctgt acagtatgta ctgtactttg tttgatataa tgcaacgata 2880 gtggatttcc accgttctgt atcaagtgat accgtaccga gtcagtatgt acagaaaatt 2940 tacagaaaat ttactcgaac gataggagat gctctaatgt gatggttgct gatgggtcta 3000 ctaaggcggt gattacaggt gttggaagtg ttaggatatc agatatggat aatcctttgt 3060 taaattgtgt attagaaccc gtttatctgt gtgagggttt gcgtcataat cttgtttctg 3120 gtattgcaat tcacgatcag ggaataaatt ttgaatctga caagactgga cttaaattga 3180 gaatgaaaga tggagtgatg atgaaagcta agagagttgg tcagaggtgg atttttaaag 3240 ttctgaaatc aagtgtagaa acgatggcga ttggttctta taagttgtgg catgagaggt 3300 taggacaccc gaacgaacga gtgctccgta agatgattgc tgatggagca tgccacggac 3360 tgccaatgaa gcttggtccg actgaaagat gtgaagtttg tgctgatgcg aagtctacaa 3420 aaacaaatcg gataggacca tcatttatgt ctcatgatca gcctctcagt ttaattgtag 3480 cggacctgtg tggacctttt caaacgaaat ctgtgggggg ggcgtcgtat tttttgaaca 3540 tcagggatgt acattcaact ttcatcaagg tttatccgct taatcaaaaa ttggacacga 3600 tagcttgcgt taagaggtac attgctgagg tagaaagatt gaccggatta aaagttaaga 3660 gatggagaaa tgatggtggt aaagaattta tgaatcaagc actggaaacc tatctagcaa 3720 gtctaggaat tgtactagag aagactatgc gttatcatca tgaacaaaat ggaacatctg 3780 aacaatcgca aagaacaatt cagtcaatca tgagatgttt gctatttcag gaagacattc 3840 cgaagtcatt ttggggtttt gcggtggcag cagcagcgta tttacacaat cgaacaccaa 3900 actctaacac gaatgggaaa tcacctcaag agatactatt caaggaaaaa ccacaagctg 3960 atcatttgtg tgtttttgga tcttgggcat ttgttcatgt acctcaagaa attcgtaaaa 4020 agctggatca tcgggctgtc aagtgtagat tcataggtta tccatctgga tcgaaaggat 4080 ggaaattctg ggaccctaaa acgaatatgt ttttggagtc agctcatgct gagtggttgg 4140 aggaagaagc tagtaatgaa ttcgtaaaac actctaccga tgaatggccg attccagata 4200 agacttcagg gattcagcat ttactcaatg atttaaatat cagtcatgaa gaagaagaat 4260 tgcttagtgt acttgaagta gcgtttgaat tagatgatag taagataact gaaggagtcc 4320 gagaacaaga tgtattggtt cagcacattc aaatgctcgc agcaggtatt gcgcataaga 4380 taccatctgg tttcaagacg gctatgaaga gttctgagaa agatcagtgg aagcaggcgt 4440 gtgataagga gattgagatg ttgaaaagaa tgaatgtgtg ggttgaagtg gcgttacctg 4500 aagggaagag agctgtgtct acgaagtggg tgtttgctaa aaagttggat ggagatggtg 4560 aagtgaagaa gtataaagca agatttgtgg tcaggggttt caatcaagaa caaggtgtgg 4620 attttcatga gacgtttgcg cctactgcta gatttgcatc tctattgatc attttagcaa 4680 tatcggtgtg tagaggatgg atagtgaagg ggtttgatgt ggtttcagcg tatccgcata 4740 gtccaattga tgaagaaatt tacattttgg cacctgaagg ttatccgtgt ttgagagaag 4800 gagatgtttt gaagttaaga cgtgcattgt atggaactaa acaagcggcg aggtgttggt 4860 ggaaattctt ttcaaatgta ctctcgaagg ttggttgcaa atactgtgtt aatgatcagt 4920 cgatctatgt gttgaaatca gaaggagatg tggcgattgt gtggattcat gtggatgacg 4980 ggttggtttg ttcatcaagt gacaagatga ttaattattt gcgagaatct ctcgagaaat 5040 tttttgaact ggtgtggcaa gatgaagtag agcagatagt aggtgtgaaa ttggttaaga 5100 aggatgatgg tttattttta actcaagaag ttttagcgac tcagatcgtg acggaggctg 5160 gatttttgat gtcgagtgcg gaaacaccta tggtggcagg tttgagtctc gaatcagcaa 5220 aggatgatga agtagcatta gaagctagca agtacttgtc tttaattggg tcactgagtt 5280 atttagctgt tggaacacgt ccagatatag cgttcactgt caattattta gcacgattct 5340 cggcaaaacc attatcgtct cactggacag cattgaaaca tttactgcaa tatgtatcgg 5400 gtacgagaac aaatggtttg tttttcagca ggaaagatcc aaacgaattc atagaagcat 5460 actgtgatgc gaattggggg ggggagtttt cgagatctac gcatggatat gttattttct 5520 tatttggttg tccggtggct tgggcatcga ggaggcagag ttgtgttgca acatctacat 5580 gtcacgctga atatatggca ttaggtacta cagcacggga attagtgtgg atattgaatg 5640 tggtggagga aatgttgggg aaacgtatca aaggaaagat gttgtgtgat aatacggcgg 5700 cagttaaagt agcaaaagat ttacatatga caaaaaggtc tcatcatgtg gcgcgtgaat 5760 ttcattatgt gaatgagtta gtttttgatg gtattgtgga agttttatgg attgatggga 5820 ggaatcagaa ggcggatatc atgacgaaac cgcttggaca tgttcttttt aattttttca 5880 aacctttact ttgtatgaat gggtgatgtg catcgctagt gaaagggggg gg 5932 // ID Gypsy-36_MLP-LTR repbase; DNA; FNG; 391 BP. XX AC AECX01001004; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-36_MLP_; KW Gypsy-36_MLP-I; Gypsy-36_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-391 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001004; Positions 64623 65013. XX SQ Sequence 391 BP; 100 A; 86 C; 66 G; 139 T; 0 other; tgtaagacct gcggccatta caatacataa atactaagat acatatcaaa actacttaat 60 tattaacaga cagatttcta tttcctttat atagttgttt tgacttctac gcttggttcc 120 tcagtgattc gggactgacc ttcctacccc ttatcctttg gaaccaggta aagtccttat 180 ttcaaatact tgttttcctt tactttcaat cagacttgta ggagtgattc gggactgacc 240 ttcctacccc ttatcctttg gaaccagtct tacgtactgt gttttgatag atttgaaata 300 ataaggtctc ttgggttcta gaaactctcg tgctgtgatc atcacagcgt cccttgtgcg 360 atttccagtg aaggttctta ggaaccttac a 391 // ID Gypsy-73_MLP-I repbase; DNA; FNG; 5689 BP. XX AC AECX01001139; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-73_MLP_; KW Gypsy-73_MLP-LTR; Gypsy-73_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5689 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001139; Positions 9125 14813. XX CC Positions [4492-4971] - Integrase core CC 'GCTTT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 347..1465 FT /product="Gypsy-73_MLP-I_2p" FT /translation="MNPEHEQPSMAEILRQMRELNTQFLEVKASLAEESQK FT QEEAERRSDEAEQRLRQFETTRTTSTQLPHSSAPIAQTSVPQTISQIKPPK FT IATPNKYDGTKGQKAEVYVNQVCLYMQMNAAAFVNEQAQVAFALSYLDGKA FT SIWSQNFTDQLLDQDKMKQVTWKKFIESFKATFFDSERLAKAEKEMRALTQ FT TKAVVDYWIKFSELALIVKWPENVLLSQFKQGLKKEITVHMVRDEFKEVEE FT MAKLAIKLDNEINKRDSNTHHPIVSTNTTSTSTTITDPDAMDCSAYRVNIS FT NEEYNKRGAMGMCYSCGKDDHFIRDCPTRKRRGGRGGWRGRGGYGGYGKFK FT SKVAEVDGVKEEEKDDGRAEASKNGGAREC" FT CDS 1636..5595 FT /product="Gypsy-73_MLP-I_1p" FT /translation="MSHRFVSANKFKTDPLPQTRSVTGFSGHESRITHTVD FT LCVNSIESTPTTFIITELRDKYDIILGMPWIRQNFEKIDWKNGQLKENHSL FT IAVVESTSSRPDKASRNHKEEPMRHARNLDEGVEPENSLTPPQCAYISNYS FT NDKKEAASKPLHHVENLTHTNSNEHIQLPEQEKRNETAADRKAALLVPKNP FT LEDHERRDPTRQARHHDKGVEFKLSSSQPPQSMCKHSSISNVNEHAGKRFS FT FQPQHAIQRLNSSLNRQNKVMDQLRTLQPPTKSMIDVMKTSWNLSAKIAAD FT QVKDSPTKNAAELVPECYHDYLPMFEKSNSDVLPPHRPYDFRVDLIPGASP FT QAGKVIPLSPKETEVLNEMLEKGLKNGTIRRTTSPWAAPVLFTGKKDGNLR FT PCFDYRRLNAITVKNKYPLPLTMELIDSLLNADEFTSLDMRNGYNNLRVRE FT GDEAKLAFICKSGQFEPRTMPFGPTGAPGFFQYFIQDILKAHIGRNVAAYQ FT DDILIYTGPGVNHQEVVKEVLEILRRQNVWLKPEKCKFSKKEISYLGLIIS FT KNQIRMDESKVKAVKDWPTPKNLSEVQTFLGFANFYRRFIDQFSKIARPLH FT ELSQKNVAFEWNEQRQRAFETLKTAFTTAPVLKIADPYKAFVLECDCSDYA FT LGAVLSQVSDDDNELHPIAYLSRSLIQAERNYEIFDKELLAVVASFKEWRQ FT YLEGNPNRLNVIVYSDHKNLQSLMTTKELTRRQARWAEILGSFDFEIRFRP FT GRQSTKPDALSRRPDLVPAEGTKLTFGQLLKPENLPADAFIDELEIIDKWF FT EEEELLEQDEENNSEEEDNNQMLMTDREILQIIKMKSKSDPKINEMIRLCA FT EMPMSKHIKDYQTIDGILYFKNKAVIPSDTDLKLHILRSRHDSRLAGHPGR FT MRTLALVKRAYHWPSMKAFINKYVDGCGSCQRVKSRTEKPFGSLQPLPIPE FT GPWLDICYDLITDLPKSGGYDSILTVVDRLTKMVHFVACRKSMKSEELADL FT MIKEVWRLHGTPRTVTSDRGNIFISRITKDFHRRLGIKTQSSTAYHPQTDG FT QSEITNKAVELYIRHFTAYKQDDWLTLLPFAEFSYNNSEHLAIGVSPFKAN FT YGFNVNFTDVPSSEQCLPLVEQRIDQIKNVQRELKDAMGLTQELMKMQHDS FT KVRDTPNWKKGDKVWLNNKHLSTTRPTAKFAHRWVGPFVISARVSKNAYKI FT VLPKSMSKVHPVFHVNLLRKFEESKIPGHHKMPPPPIVMNNEDEFEINEIL FT DKRKRGRKLEYLINWKGYGEEHDSWEPEEGLKNAKQLMNDFNRKYPDAEVR FT YKRTRRKK" XX SQ Sequence 5689 BP; 2003 A; 1108 C; 1212 G; 1366 T; 0 other; tattgctaag tctcatcaaa agagatccaa gagaagaatc accgaaagta agacgaaatc 60 aaaacgaacg attaagaaga aagttaaaag aagttaagaa gaaagatttt caaagttaat 120 aaagttaaat taaagtataa acttaatctg cactccgcaa gcaaagttag aagaaagttg 180 attactcagt attaccacag aagtaacatt taaagaagat aacttatacc ccgcataatc 240 tcgaacacca ctgcatcctt ccaaaacgcc aaactttaag cttccctcta actcaacgga 300 ctcggagtca tcggaaaaag aattctacga gacagaagag accgcaatga atccagaaca 360 cgaacagccc tctatggccg agattttacg gcaaatgaga gaattaaaca ctcaattctt 420 agaggtcaaa gcaagcttag ctgaagaatc gcagaaacaa gaagaagctg agagaagaag 480 tgacgaagca gaacaacgac tacgtcaatt tgaaacaacc cgcactacct ccactcaact 540 acctcacagc tccgccccga tcgctcagac ctctgttccg caaacgatta gtcaaataaa 600 gcctcctaag atcgccactc cgaataaata tgatggtact aaaggacaga aagccgaagt 660 ttatgttaat caggtctgct tatatatgca aatgaatgct gctgcatttg tgaacgaaca 720 agctcaagtt gcgttcgcgt tatcttacct tgatgggaaa gccagtattt ggagccaaaa 780 ttttaccgat caactactcg accaagataa aatgaaacaa gttacttgga aaaaattcat 840 tgaatcattt aaagcaacgt tctttgactc tgaaagattg gctaaagcag aaaaagaaat 900 gagggcgtta actcaaacaa aggcagtggt agattattgg attaagtttt cggaattagc 960 gttgatagtt aaatggcctg agaatgtatt actatctcaa tttaagcaag gattaaagaa 1020 agaaatcacg gtgcatatgg tacgggacga attcaaggaa gttgaagaaa tggcaaagtt 1080 agctattaaa cttgataatg aaattaataa aagagactcc aacacgcacc acccaattgt 1140 ttcgaccaac acaacttcaa cctcaaccac tattactgat ccagatgcaa tggattgttc 1200 ggcttatcgt gttaacatat caaatgagga gtataacaag agaggagcaa tgggaatgtg 1260 ttatagttgt ggtaaagatg atcattttat tagagactgt ccaactagaa aaagacgtgg 1320 aggtagagga ggttggagag gtagaggagg atatggaggt tatgggaaat tcaaaagtaa 1380 agttgctgaa gtggatggtg tcaaggaaga agagaaggac gatggaagag ctgaagcctc 1440 aaaaaatgga ggcgctcggg agtgctagtg gtgccacact cgagcggtaa ttacttaact 1500 gaagattgta gtagcttaaa ttcacttgaa ataaaagata cgcgaatcat tgattttgtt 1560 actattttcg aagttaagag tgccaccccc attgttgcga gagccttagt agacagcggt 1620 gctacacatg aagcaatgag ccaccggttt gtctctgcca acaagttcaa gacggaccca 1680 ttacctcaaa ccagaagtgt aactggtttt agtggtcacg agtcaagaat cacccatact 1740 gttgacctct gtgtcaactc cattgaatca acaccaacca ccttcataat cactgaattg 1800 cgcgacaagt acgacataat tcttggaatg ccttggatac gacaaaattt tgaaaagatt 1860 gattggaaga acggtcaact caaggaaaat cattcactca ttgcagttgt agaatcaact 1920 tcgtcaagac cggacaaagc ctcgaggaac cacaaggagg agcctatgag gcatgctagg 1980 aatcttgacg agggggtgga gcctgaaaac tcattaacac ccccgcaatg tgcgtatatt 2040 tctaattatt caaatgacaa aaaagaagca gctagcaagc cattacatca tgtagaaaac 2100 ttaacacaca cgaattcgaa cgaacacatt caactgccgg aacaagagaa acgaaatgaa 2160 actgcagctg accgtaaggc agctttgttg gtgccgaaaa accccttgga ggaccacgaa 2220 agacgggatc ctacgaggca agctaggcac catgacaagg gggttgaatt caagttgagt 2280 tcgtcacagc ccccgcagag tatgtgtaaa cattcttcaa tctcaaatgt gaatgaacat 2340 gctggcaagc gtttttcttt tcaaccacag catgctatcc aacgactcaa ctcaagccta 2400 aatcgtcaaa acaaagtgat ggatcaacta cgaactttac aaccaccgac aaaatcaatg 2460 attgatgtaa tgaaaacgtc atggaacctg tctgcgaaga tagcagcaga tcaagtgaag 2520 gactcaccta ctaagaacgc ggctgaactc gtacccgaat gctatcacga ctatttacct 2580 atgttcgaaa agagcaattc tgacgtttta ccacctcatc gcccatatga tttcagagtt 2640 gatttgatac ctggagcatc gccacaagca ggaaaagtta ttcctttatc accgaaagaa 2700 actgaagtgc ttaatgagat gcttgaaaaa ggactgaaaa atggaacaat tcgccgtaca 2760 acttctcctt gggcagctcc tgtcttattt accggcaaga aagacggaaa tttacgaccc 2820 tgtttcgatt acagacgact caatgccata actgtaaaga acaaataccc ccttccactg 2880 acaatggaac taatagatag cctactaaat gcagatgaat tcacgagttt agatatgagg 2940 aatgggtata ataatctaag ggtcagggaa ggcgacgaag caaaactagc atttatatgt 3000 aaatctggcc aatttgaacc gcgaacaatg ccctttggac caacgggagc gcctggattt 3060 tttcagtact tcattcaaga tatactgaaa gctcatatag gccgaaatgt ggctgcgtat 3120 caggatgaca tattaatcta caccggtcca ggagttaatc atcaagaggt cgtgaaagaa 3180 gttttggaaa tattaagacg tcaaaatgta tggttaaaac ctgaaaaatg taaattctca 3240 aagaaagaaa tttcgtattt aggattaatc atctcaaaga atcaaattag gatggatgaa 3300 agcaaagtca aagcagtaaa agactggcca accccaaaaa atttatctga agtacagacc 3360 tttttgggat ttgcaaattt ttacagaaga ttcattgatc agttctcaaa gatcgcacga 3420 ccactgcacg agctgtcaca aaagaatgtt gcgtttgaat ggaatgaaca acgtcagaga 3480 gcattcgaaa cactcaagac tgcgttcacc accgccccgg tattgaaaat tgcggatcca 3540 tacaaagctt ttgtattgga atgtgattgc tctgactatg cgctaggagc ggtactttct 3600 caagtgtctg atgatgacaa tgaactacat cctattgcat acttatcaag atctctaatt 3660 caggcagagc ggaactatga aatattcgac aaggaattat tggcagtagt agcatctttc 3720 aaagaatggc gacagtatct cgaggggaat ccaaaccgcc tcaatgttat tgtgtatagt 3780 gatcataaga atttacaatc cttgatgaca accaaagaac tgacacggcg gcaagcaagg 3840 tgggccgaaa ttcttggcag ctttgatttt gaaattagat ttcgaccagg tagacaatcc 3900 accaagccag atgcgttatc tcggagaccg gatttagtcc cagctgaagg aacaaagctc 3960 acatttggac agctactaaa acccgaaaat ttgcctgctg atgcatttat tgatgaatta 4020 gagattattg ataaatggtt tgaagaagaa gaattgctgg aacaagatga agagaacaac 4080 agtgaagaag aagataacaa tcaaatgctt atgaccgaca gagaaatatt acaaattata 4140 aagatgaaga gcaagtccga cccgaaaatc aatgagatga ttaggctttg tgctgaaatg 4200 ccaatgtcga agcatatcaa ggactatcaa actattgatg gcattttata tttcaaaaac 4260 aaggctgtta ttccgtctga cactgatttg aaattacaca tccttcgttc tagacacgat 4320 agcagattag caggtcatcc ggggagaatg cgaacgttag cactggtaaa gcgcgcatat 4380 cactggccgt caatgaaagc attcatcaac aaatacgtgg acggatgtgg atcatgtcaa 4440 cgcgtaaaat ctaggaccga aaagccattt ggaagtctcc aacctttacc tatcccggaa 4500 ggaccatggc tcgacatttg ctacgaccta atcacggact tacccaaatc tggaggttat 4560 gacagtatcc taactgttgt tgacagactc acaaagatgg tgcactttgt tgcatgtagg 4620 aagagcatga aatccgaaga gcttgctgat ttaatgatca aggaagtttg gaggttgcac 4680 ggcactccac gaacggtaac atctgatcga ggaaatatct ttatatcacg tatcactaaa 4740 gatttccaca gaagactagg catcaagacc caatcctcaa cggcgtatca tccccaaaca 4800 gatggccaat cagaaattac gaacaaagcg gttgaattat atatcaggca cttcaccgca 4860 tacaagcaag atgactggct gacgttatta ccatttgctg aattttcgta taacaacagt 4920 gaacacttgg cgataggcgt gtccccgttc aaagctaatt atggattcaa tgtcaacttc 4980 accgatgttc cgtccagtga acaatgttta ccgctggtgg aacaaagaat tgatcaaatc 5040 aaaaatgttc aacgtgaatt aaaagatgca atgggcctaa ctcaagaatt aatgaagatg 5100 caacatgata gcaaagttcg agatacgcct aattggaaga aaggagacaa ggtctggctc 5160 aacaacaagc acctatcaac aacacggccc acagctaagt ttgcccatag atgggtagga 5220 ccatttgtta tttcagcgcg tgtgtctaaa aatgcttaca agattgtgtt acctaaatct 5280 atgagtaagg tacaccctgt ttttcacgtc aacctattaa gaaaattcga agaaagcaag 5340 ataccaggac atcacaaaat gccgccacca ccaatagtta tgaataatga agatgaattt 5400 gaaattaatg aaatacttga taagagaaaa agaggaagaa aattagagta tttaattaac 5460 tggaagggat atggtgaaga acatgattca tgggaaccag aagaaggtct caagaatgca 5520 aaacaattaa tgaatgattt taatagaaag tatccagatg ctgaagttag atacaaaagg 5580 acgaggagaa agaagtgagg gtatggcttt ttcccactgg gttttttaat gccacccggg 5640 gatagatgct ggcctgcaag aggaggtaga gacataaaag ggggagtgg 5689 // ID Copia-3_LBS-I repbase; DNA; FNG; 4289 BP. XX AC ABFE01000651; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-3_LBS_; KW Copia-3_LBS-LTR; Copia-3_LBS-I. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-4289 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01000651; Positions 140189 144477. XX CC Positions [1829-2110] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 368..1825 FT /product="Copia-3_LBS-I_1p" FT /translation="MIHISGATTAQEMWDQLTMVKESKGTLGVLATHRALY FT RMSADEGIDLVEYISKLRKLQEELHLMENMVSDEDFVMILITSLPESWDNY FT TSSYLGASGNKLTLKSHELIAILIEEYHRRQGRNEETSGGLSMQARHQQKG FT GRNTSSQGKKKTDVECYNCHKKGHMSNECWAKGGGREGQGPKGRRGPNRGD FT RSHQTQDSMNNSLGDCAYMVNQNIHNFSKYDWVLDSATTSHICTIRDAFTD FT YEPLNSEVDGIGGSAMTQGRGTVAVNFRIDGQSIKHTLQNVLHVPKAPNCL FT LSIPRLAIGGGRVEFNGNGCQLYDKTNRVIGKGKLTNMLYLLDACAELPNR FT ESAQYTSIRTHSWDQWHRLYGHISVNAIKNLKSNEMVDGLKIDESTIPSLT FT CEACIQAKQAYKPFPQEAKNRSEIPGEHIMSDVWGPAQTESIGQWKYYVSF FT TDDGARLTAVLFLKNKGQAFDRIKEYVAIIEWKFGKPPRFM" FT CDS 2747..4288 FT /product="Copia-3_LBS-I_2p" FT /translation="MQTEEKEHWKTAVEAELNVLEKMGTWKMEDLPVEREA FT IGCCWVFDKKRDEHRNIVKYKAQLVAQGFSQKPGTDFSHNGMFAPVMRFET FT LCTMLALAAINKWDMHQLNVKSAYLNGKLTEEIYMKQPPGFSDSSRRVCRL FT QRALYSLKQAGHAWNHEFNGAMADLGFTQLRTDYCCYIRREGEKFAILLVW FT VDDILTLTNNPAESDRVETELKSKYEIKALGQPSLLLGMKVTHDTTNHTIS FT LSQTHYINKLLKKFNLKNLNPVTMPLDPNANLNQDDEPSENDSTQGTGIYA FT TMIGSLMYTALGMRPDIGYATNRLAQFTSQPQPKHWTAVKRVFRYLKGTKN FT YMLTYGGQDSNADVNIYCDADWASHADRKSVSGYVVTIAGGAVAWSSKKQN FT TVALSTAEAEYTSATHTAKQVLWHRSLFNELKIPQPKTSTIFTDNQAAISI FT GHNPEFHARTKHIDIALHFLHDHVEKGNLDLIYISTHNNLADLFTKGLPRI FT VHQDLTYEIGVIPDQGGV" XX SQ Sequence 4289 BP; 1443 A; 885 C; 934 G; 1027 T; 0 other; ctctcaacag gttatgggcc ctggcctata gaacttatac acatattcgc atacacacgc 60 atatgcacaa actttatctc gtacgcttat ctacattacg taacctacta caatgtcaga 120 agaatataca acgggatctt accagatgga aatgctgaag gccacaaatt ggatgccatg 180 gaagcgacgg atgatggccg tgctttgtga ccttggactg gagaagtata ttgcaaagga 240 tgcaaaacct ccaggatcag ccgatccctg aagcccgaca acagaagaat tagcagcaac 300 gaagaaatgg gccgaaggtg acgccaaggc gcagacaaga atagaactcg ctattagcga 360 tgcagaaatg atccacatca gtggcgcaac aacagcgcag gaaatgtggg atcaactcac 420 catggtcaag gaatccaagg gcacgctagg ggtgcttgca acacatcgcg cactatacag 480 aatgagtgcg gatgagggaa ttgatttggt tgaatatata tcaaagctaa gaaagcttca 540 ggaagaattg cacctaatgg aaaacatggt atcagatgaa gatttcgtca tgatcttgat 600 cacatcactc cctgaaagtt gggataatta tacttcatcg tacctgggtg catctggaaa 660 taaactgacc ctgaaatcac atgagcttat tgctattttg atcgaagaat accaccgtcg 720 tcaaggacga aatgaggaga catctggagg attgtctatg caagctaggc atcagcaaaa 780 gggcggaaga aatacaagct cacagggaaa gaagaaaaca gatgtggagt gttacaactg 840 ccacaagaaa ggccacatgt caaatgaatg ttgggccaaa ggtggaggcc gtgagggaca 900 aggtcctaag ggccgtcgag gaccaaatcg aggtgacaga tctcatcaaa cacaagattc 960 catgaacaat tcactaggcg attgtgccta catggtcaat caaaatatcc acaatttttc 1020 gaaatatgat tgggttcttg attcagccac gacgtcacac atttgcacaa tccgagatgc 1080 atttactgat tatgagccgt taaactctga ggttgatggc attggtggat cagcaatgac 1140 acagggacga ggaacagtgg cagtcaactt tagaattgat ggccaatcaa tcaaacacac 1200 tttgcagaat gtgttgcatg tcccgaaagc accaaattgc ttactctcaa ttcctcgctt 1260 ggcaattgga ggaggaaggg tagaatttaa tggcaatgga tgccaactat atgacaaaac 1320 taatagggtc attggaaagg gaaagttgac gaacatgtta tatttgcttg atgcatgcgc 1380 ggaattgcca aatcgggaaa gcgcccaata cacatcaatt cggacacact cttgggatca 1440 atggcatcga ctttatggcc acatctctgt caatgcaatt aaaaacctga agtcaaatga 1500 aatggtggat gggctcaaga ttgatgagtc tacaatccca tcattaactt gtgaagcttg 1560 tattcaagca aaacaagcat acaaaccttt tccacaagaa gcaaaaaatc gatctgagat 1620 acctggtgaa cacatcatga gtgatgtatg gggaccagcg caaacagagt caatcggaca 1680 gtggaaatat tatgtttcat ttacagatga tggcgcacgt ttgaccgccg tactattcct 1740 aaaaaacaag ggacaagcat ttgaccgcat taaggaatat gttgccatta ttgaatggaa 1800 atttggaaaa cccccaagat ttatgtgatt tgataatggg aaggaactgg ttaatgagaa 1860 acttcgaaat tgggctgcag aaaaaggaat tattattgaa acatctgccc catattcacc 1920 ttctcaaaat ggagttgcag aaaggtttaa tcgcacactt ttggaacttg cacgggcaat 1980 gttaattgcg aaaaaccttc cagtattttt atgggacgaa gcagtcacgc acgcggcata 2040 tttacgaaac cgtgcgccca catgcacttt aaacggcaaa acaccgtatg aagcttggac 2100 aggtaataaa cctgatattt cgcatctcag ggagttcgga tgcgacattt gggtgctaga 2160 cgagaccaaa ggtcgttcta agcttgcacc aaagtcaaac aaatttattt tcgtaggatt 2220 tcacgatgga ttgaagtcag ttcgttatta caatgctaaa actcgaaaaa ttaaagtctc 2280 atgcaatgtg gcatttaacg agaatgaaga gccacgagaa ctcgaaatta aagctaattt 2340 accgggtttg cgcgtcgagg gggagccaga ggaaaaatca aacactcaaa cacctcaaga 2400 aatcgaaaat ccattaaatc ctatccctac aacagttcct gcacaagaaa tgcgaagtcc 2460 tattgaaccc ataaattccc cagctcttag acctcgtaga actcacactg aacacgatta 2520 tcgactccta aataatccac aggcacgacc aacagatcac acacttgaaa atgcgacacc 2580 acctgatatt acatgacaca cagagtcatc agctgcgaaa ataatttcta aaaatcgcac 2640 tcaccttgcc ttcgaagatt ttatcagatt ttcaaccgaa aaatcattcc aagtgctctc 2700 aggaatttct gataaagatg gattgccaga aacacttgaa gatgcgatgc agactgagga 2760 gaaagagcat tggaagactg ctgtggaagc agaattaaat gttttagaaa aaatgggcac 2820 atggaaaatg gaagacttac cagtagaaag ggaagcaatc ggatgctgtt gggtatttga 2880 caaaaaacga gacgaacaca gaaacatcgt aaaatacaag gcacaacttg tagcgcaagg 2940 tttctcgcag aaaccaggaa cagattttag ccataatggc atgtttgcac ctgttatgcg 3000 cttcgagacg ctatgtacga tgcttgcact cgccgcaata aataaatggg atatgcatca 3060 gctcaacgtg aaaagtgcat atttaaatgg aaaactcaca gaggaaattt atatgaagca 3120 acccccaggt ttttcagaca gcagcagacg tgtttgtcga ttgcaacgtg cactttacag 3180 tttaaagcag gctggacatg cgtggaatca tgaatttaat ggtgctatgg cagacctggg 3240 attcacgcag ttaagaacgg attattgttg ctatattcgg agagaggggg agaaatttgc 3300 gatattattg gtatgggtgg atgacatact cacactcacc aataaccctg cggaaagtga 3360 tcgtgtcgaa acagaactca aatcaaaata tgaaatcaaa gcattgggcc aaccttcatt 3420 actgttaggc atgaaggtta ctcatgacac tacaaaccat actatctcac tttcgcaaac 3480 tcattacatc aataaactcc tcaagaaatt taatctcaaa aacctgaatc cagttaccat 3540 gccacttgat ccgaacgcca atttaaatca ggatgacgag ccatcagaaa atgacagcac 3600 acaaggtaca gggatctatg caacaatgat tggatctcta atgtacactg cattgggaat 3660 gcgcccagat attgggtatg caacaaacag acttgcacaa tttacaagcc aaccccaacc 3720 gaagcattgg acagccgtta aaagagtatt ccgttatttg aaaggcacga aaaattatat 3780 gctcacgtat ggaggacaag actctaatgc agacgtgaat atttattgcg acgctgattg 3840 ggcttcacat gctgacagaa aatcagtcag tggatatgta gtaaccatag ctggtggagc 3900 tgtggcttgg agctcgaaaa agcaaaatac tgttgctctt tcaactgctg aggcagaata 3960 tacttctgct actcatactg ccaaacaagt actttggcat aggtctttat tcaatgaact 4020 caaaattccg cagccaaaaa catccacaat tttcacggat aaccaggctg caatatctat 4080 tggtcataat ccagagtttc atgctcggac aaagcatatt gacatcgcgc ttcatttctt 4140 acatgatcat gtcgaaaaag gaaacctaga tttgatctac atatcaactc acaacaatct 4200 cgcagattta ttcacgaagg gtctcccgcg gattgtacac caagacttaa cttatgagat 4260 aggtgtaata cctgaccaag ggggagtgt 4289 // ID Copia-5_MLP-LTR repbase; DNA; FNG; 702 BP. XX AC AECX01002168; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-5_MLP_; KW Copia-5_MLP-I; Copia-5_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-702 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002168; Positions 2022 1321. XX SQ Sequence 702 BP; 187 A; 141 C; 107 G; 267 T; 0 other; tgttactttc gactgcttta gtccaaactg atattttctc gttacgacca aacccttact 60 ttatttcaat attaactgac ctaacatgga cttaactgcg actaaatgat gatatgaaat 120 gacgcaccca gagtggcttg tcaatggtga tggatattaa aatggactat ggtattaggg 180 attgtagcgt tgaatgcgtg tttgagaatt tgtatgactg tataagactt atgtataaat 240 agtagtcttc taagcattgt atttttgttt tctcactcac tcatctcgct atatcttctt 300 tcgtccatcg acttccacac tattcaggtg tgtttcaagc ttgttgttaa ctaagtccaa 360 caaagctata tattaacacc agcgctttta ggtctgcatc atgttatttt tctagtttct 420 attttctttt tctattttga atgaaatata actcactcat ctcgctatat cttctttcgt 480 ccattttatt ctttttcttc catcgacttc cacactattc aggtgtgttt caagcttgtt 540 gttaactaag tccaacaaag ctatatatta acaccagcgc ttttaggtgt gtttcaagct 600 tgttgttaac taagtccaac aaagctatat attaacacca gcgcttttag gtttttgcca 660 ctttctgccc ctcacagatt taggagcata caaaacatat ca 702 // ID MuDR-2_FO repbase; DNA; FNG; 2980 BP. XX AC . XX DT 03-JUL-2010 (Rel. 15.11, Created) DT 03-JUL-2010 (Rel. 15.11, Last updated, Version 3) XX DE Mutator-like transposon, partial consensus. XX KW MuDR; DNA transposon; Transposable Element; MuDR-2_FO. XX OS Fusarium oxysporum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Hypocreomycetidae; Hypocreales; OC mitosporic Hypocreales; Fusarium; OC Fusarium oxysporum species complex. XX RN [1] RP 1-2980 RA Jurka J.; RT "DNA transposons from Fusarium oxysporum."; RL Repbase Reports 10(11), 1847-1847 (2010). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(128..1408,1450..2829) FT /product="MuDR-2_FO_1p" FT /translation="MIATQPTKMDQSHRLAFPDDALPPEGIYKSRESLLAA FT INSWAKPRGYAFTTGKSLKTSSGRIKVIFACDRNKLPPSTSITRQRRTCSR FT RTGCKFSVLAKQSLDGNTWVLSHRPDKEYAHHNHPPSPDASAHPAHRQLNE FT RDAAIISSLTTAGAAPRDIRTYLHNSSDTLATQQDIYNRIAAMRRDLREGQ FT SSIQALVDQLQGEGFWCRVRLDSDNRLTAIFFAHPDSVAYLQSNPDVLLLD FT CTYKTNKHAMPLLDMVGVDSCQRSFCIAFAFLSGESEEDYSWALHHLKSLY FT HHELPSVVLTDRCLAAINATATWFPSSKALLCLWHVNKAILQYCQPDFVLK FT SADTSGRVEDEKWDEFYGFWHTIVDSPTEEIFRERLAKFELKYAKRYPRAV FT GYIKLYWLEPHKERIIKAWVDKYLHFGNVATSRLPSRAEGIHSLIKSYIKT FT STFDLFDSWQAMRHAVTNQLKELNHIRASQQIRTPLDISGGMFEAVQGWVS FT HQALRKVQEQRKLALASPQAPCSQSFTSSHGLPCSHTLKRLEEEQRSLLLD FT HFHPHWNLKRGADRPRPILEPRRAPQQGIIRVRGQPATSTKREPSGFEISQ FT PGKKAPSTCSRCHAVGHARSSRACPLRFQDLLIQEAPASKSIPQPDVTPVA FT VPSPADPIEAISVSQGPSLVEGMPVLECIAEATARSPATAHTPLAQSSHTV FT QVSLTAPQHSXVPDCPVGTVPGPSGEECLNHEPSDSRNLPSQPLLRHDSPE FT AIYGRYVAARSAWYAAQPAGSIKTNQQYRRAMGLPLRYDKQSYEWCLDYKQ FT MSKRCITSTGSREWTKEEMMAYLDWSKAEDERIEAQVVKEMGKNPLANKRR FT GMTEIWRKAEMDNIEQETLYAAEDKAELCIVVKR" XX SQ Sequence 2980 BP; 794 A; 792 C; 741 G; 652 T; 1 other; ggcggccaat acatgggcac ctaggtgccc aaataaaata gcgcactagg tgcccaaatt 60 atgtcagcaa tcgtcgcccc gattggccag cgcgcgtttg acatgaccga cgcgacctgc 120 gctctgtatg atagcaacac agccaacgaa aatggaccaa agccaccggc ttgccttccc 180 ggacgatgcc ctcccgcctg aaggcatata caaatctcgt gagtctctat tggcagccat 240 taattcttgg gcgaaacctc gaggttatgc gttcacgacg gggaaatcat taaagacgtc 300 aagtggccga atcaaagtca tatttgcctg tgatcgaaat aaactcccac cgagcacatc 360 tattacacgc caacgccgca cttgtagccg aagaaccggc tgtaagtttt cagtgctggc 420 caagcaatca ttggatggaa atacgtgggt cctaagccat cgaccagaca aggagtatgc 480 ccaccacaat catccaccga gtccggacgc gtctgcacac ccggcgcatc gccaacttaa 540 cgaaagagat gcagcaatta tttcgagcct gacaactgct ggcgctgctc ctcgagatat 600 caggacgtac ctccataaca gttcagatac ccttgcgacg cagcaagaca tctataatcg 660 gatcgcggca atgagaagag atctgcgtga gggccagagc agtatccaag cgttagtcga 720 ccaactacag ggagagggct tctggtgccg ggttcgattg gactcggata atcggctgac 780 agccatattc tttgcccacc ctgactctgt tgcatatctg caaagcaacc cggacgtact 840 gctattggat tgtacctata agacgaacaa gcacgccatg ccgctgcttg acatggtcgg 900 agttgactct tgtcagcgtt ccttctgcat cgcatttgca tttctgtccg gtgaatcaga 960 agaagactat tcatgggcac ttcatcatct caaatcactc taccatcatg agctcccttc 1020 tgtagtctta actgaccgat gtctcgctgc aataaatgca actgcaacct ggttcccttc 1080 ctccaaggcc ctgctttgcc tctggcatgt taataaggca atcctgcagt attgtcaacc 1140 ggactttgtg ctgaaaagtg ccgacacaag cggccgggta gaggacgaaa aatgggacga 1200 attctatggt ttttggcaca ctattgtgga ttcgccaaca gaagagatat tcagagagcg 1260 cctagccaag ttcgagctca agtacgccaa aagatacccc cgggccgttg ggtacatcaa 1320 attgtactgg ctcgagccgc acaaagagag aatcattaaa gcatgggtgg ataaatacct 1380 gcattttggt aatgtggcca cctcgaggta gctatccagg gtcagatata gaggtcggaa 1440 agctaataat tgccatctag ggccgaagga attcactccc taatcaaatc atacatcaag 1500 acctccacat tcgacctctt cgattcttgg caggccatgc gccatgcggt caccaaccaa 1560 ctgaaggaac tgaatcacat acgagcctct caacagatac ggacaccttt ggatatctct 1620 ggaggaatgt tcgaggctgt ccagggttgg gtttcccacc aggccctgcg caaagtgcaa 1680 gaacaacgga agttagcttt ggcatctccc caagctccct gcagtcagtc tttcacctcc 1740 tcacatgggc taccgtgttc tcacaccttg aagagactcg aggaggaaca gcgcagcctt 1800 ttgcttgatc attttcatcc tcactggaac ttaaagcgag gcgcggatcg gccccggccc 1860 atactagagc cacgtcgagc cccgcagcag ggcattatca gggtgagagg tcaacctgcg 1920 acgagcacca aacgagaacc ctctggattt gagatctctc agcctggcaa gaaagcgccc 1980 agcacatgta gcagatgtca tgctgttggt cacgcgaggt cctcacgagc gtgtcctctg 2040 aggttccagg atttgctcat tcaggaagcg ccggcctcga agtcaattcc gcagccggac 2100 gtaaccccgg tcgcagtccc aagtccggct gacccaattg aggcgatttc agtgagccaa 2160 gggccctcat tagttgaagg catgccagtg ctagaatgta ttgcggaagc tactgcacgc 2220 tctcccgcta ctgcacacac cccacttgcc cagtcttcac atactgtaca agtttcgctt 2280 acagctcctc aacattctgn cgtcccagac tgtcccgttg gaactgtgcc aggcccttct 2340 ggcgaagaat gtctgaatca cgaaccatcc gattcgcgca accttccttc gcaaccacta 2400 ctgaggcatg actcgccaga agccatatat ggaagatatg tcgctgcaag aagcgcgtgg 2460 tatgcggcgc agccggctgg cagcatcaaa acaaatcagc aatatcgcag ggcgatgggg 2520 cttccactca ggtacgacaa gcagagctac gagtggtgtc tggactacaa acagatgtcc 2580 aagcgctgta tcacgtcaac aggatcaaga gaatggacga aagaagagat gatggcgtat 2640 ttggattgga gtaaggcgga ggatgaacgc atcgaagccc aagttgttaa ggagatggga 2700 aagaatcctc tagcaaacaa aagaagaggc atgacagaaa tatggcgaaa agcagaaatg 2760 gataatatag agcaagaaac cttatatgca gcggaggata aggcagaact ttgtattgtt 2820 gtcaaacggt aactagttgg cttgtgcttt gaaaatatta gaaaacagca ttacaacctt 2880 agacccgtcg ctggccaatc ggggcgacga ttgctggcat aatttgggca cctggtgcgc 2940 tattttattt gggcacctag gtgcccatgt attggccgcc 2980 // ID TCN5-I repbase; DNA; FNG; 4888 BP. XX AC . XX DT 30-MAR-2005 (Rel. 10.03, Created) DT 30-MAR-2005 (Rel. 10.03, Last updated, Version 1) XX DE C. neoformans LTR retrotransposon - internal consensus. XX KW LTR Retrotransposon; Transposable Element; Interspersed repeat; KW reverse transcriptase; TCN5-I; internal portion. XX OS Cryptococcus neoformans OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Tremellomycetes; Tremellales; Tremellaceae; Filobasidiella; OC Filobasidiella/Cryptococcus neoformans species complex. XX RN [1] RP 1-4888 RA Goodwin T.J. and Poulter R.T.; RT "The diversity of retrotransposons in the yeast Cryptococcus RT neoformans."; RL Yeast 18(9), 865-880 (2001). XX RN [2] RP 1-4888 RA Loftus B.J., Fung E., Roncaglia P., Rowley D., Amedeo P., RA Bruno D., Vamathevan J., Miranda M., Anderson I.J. et al.; RT "The genome of the basidiomycetous yeast and human pathogen RT Cryptococcus neoformans."; RL Science 307(5713), 1321-1324 (2005). XX RN [3] RP 1-4888 RA Gentles A. and Jurka J.; RT "C. neoformans LTR retrotransposon TCN5."; RL Direct Submission to Repbase Update (15-MAR-2005). XX DR [3] (Consensus) XX CC 591 bp LTR deposited as TCN5-LTR. XX FH Key Location/Qualifiers FT CDS 30..3387 FT /product="ORF1p_TCN4" FT /translation="ALLSHQKIFFDTITPIEDQSKPRLRQDTPFPEMTTPP FT TDKESLQRKIKQIEDERKREREEHKREMAELQAQMRGLLKREAEKRDLPPH FT NPATSSLPAKSESPHPPIMYDSLPQPKYSFPVLSEHADAAAIHQHISKLRS FT IFTLMAAGYRYEPTAFEARKLAHAAQSLTGIRHLQFLEGDGLQCRTFDDWA FT KAFKSAMLPLGWVSETEKKIYSLQPQLLRLDKIPLAIMDFKQWFALLKDSD FT SPMSEDVATHWLRNHMTPGLLAELERDFGGENQLRQASLAELLKHMEICAT FT RLLRYQSAFAPIAPTRTPIAAVSPEASSPIDIAQWLNPRLPLPKGGAGRRA FT RAHLASEQRCFLCRQPGHKSPDCPKRKEPTAIAAVSTFDHEEAEFEQEEGE FT MFAEVLATHLAPELSVPPILLECRIGTNGSPFLALFDTGATVTLVDPSLIT FT THQLQTYPSEQRRVVSLAGGARGPALQRRVGVEVCIQNQVFALHGYVMPLH FT ARYKVILGLDFIRSHGLLSGASRLGNLAPDLLGPVVASVTTADGYXDLRLA FT ILHEYHDIFPDNIGEVANYPPICDANSKVRHHINLTPGAIPFKSASYRSPH FT MWRQQLIEEIQKHREAGRLRPSSSPWAAPAFLVKKENGKFRFICDYRGLNK FT VTTPDSTPVPNVDDILHRAACGKIFAKIDLSDAFFQTLMHEPDIEKTAITT FT ELGLFEWVVMPQGACNSPATQQRRLNEALRGLLGDSCEAYVDDIIVWAADA FT EDLDKRLRAVLAALRKSGLVCSPTKSEFFRHKVKFLGHVISANHIGPDPAK FT LRTIASWPLPQSVKELRSFLGLLQYLRKFIPSLATHTRTLTALLPPTPAAE FT KAWEKQQRALRKGQSPKDVLSWVWAWSSEATAAFEILKAKVAEISGLRPLD FT YAAALSGECPIYLFTDASNHGTGAWLGQGPDPDHAFPVAYDSRSLSAAERN FT YPTHEKELLAIVRALKLWRPLLLDVPIQVQTDHFTLKWFLQQRDLSERQKR FT WLGXLSRFDLRIDHISGVNNFIADALSRLGGVDDEQDGMETAEVSVAVLGL FT LGQDTSTITKVAQGYAQDQVMGAWLQEEDRAPGVTLENVENGQGRSTSGVA FT MGRQAMCARYQX" XX SQ Sequence 4888 BP; 1118 A; 1481 C; 1133 G; 1139 T; 17 other; ctttttttaa gcgaatctca actgcatgag ctctgctgag ccaccagaaa attttttttg 60 acaccattac acctattgaa gatcaatcca aaccacgact tcggcaagat acaccattcc 120 ccgaaatgac aacaccccct acagacaagg aatccctaca gaggaagatc aaacagatag 180 aggacgagag aaagcgggag agagaggagc acaagcgcga aatggcggag ctacaagcac 240 aaatgcgagg tcttctcaaa cgcgaggccg agaagcgcga cctccctccg cacaatcccg 300 ccacttccag cttacctgcc aaatcagagt cccctcatcc tcctattatg tatgattctc 360 tcccccaacc caaatattcc tttccagtcc tctctgaaca tgccgacgca gcggctattc 420 accaacacat ctcgaagctt cgcagcatct ttaccctcat ggccgctggt taccgctatg 480 aacccaccgc ttttgaggcc cgcaagctcg cccacgccgc ccagtctctt actggtatcc 540 gccatcttca atttctcgaa ggtgatggcc tccaatgtcg cacctttgat gactgggcaa 600 aagctttcaa gtctgcaatg cttccgctcg gctgggtatc cgagaccgaa aagaaaatct 660 attctcttca accccagctc ctccgacttg acaaaatccc attggccatc atggatttca 720 aacagtggtt cgccctcctg aaggattccg acagccccat gtccgaagac gttgctacac 780 actggctgcg aaaccacatg acacctggtc ttttggcgga gcttgaacgg gactttggtg 840 gtgagaatca gcttcgtcag gccagtttag ctgaacttct caagcatatg gaaatctgtg 900 ccaccagact ccttcgctac cagtccgctt tcgcccccat cgcgcccacc cgcacgccga 960 tcgctgctgt ctcgccngaa gcgtcttcgc ccatcgacat tgctcaatgg ctcaacccac 1020 gacttccgct ccccaagggt ggtgctggtc gccgcgctcg agctcacctt gccagtgaac 1080 agcgntgctt cctctgccgc caacccggtc acaagtctcc cgactgccca aagcgcaagg 1140 aacccacggc aatcgctgct gtaagcacct tcgaccatga ggaagcggaa tttgagcagg 1200 aggaggggga gatgtttgcc gaggtattag ccacacacct tgcgccggag ctgtcagtcc 1260 cacccatcct tctcgaatgt cgcataggca ctaatgggtc acccttcctc gccctcttcg 1320 acacaggagc aacagtcact ctagttgatc cgtccctcat caccacccac caactccaaa 1380 cttatccatc agagcaacgg cgggtggtat cgttagcagg aggagcacgg ggtccagctt 1440 tacaacgtcg tgtaggagta gaggtatgca tccagaatca agtcttcgca ctacatggtt 1500 atgtcatgcc cctacatgcc cgctataaag tcatcctagg cctggatttc atccgctctc 1560 atgggctcct cagcggtgct tctcggctag gcaatctcgc tcctgacctc cttggccccg 1620 tggttgcntc agtgacgacc gctgatgggt atgangacct ccgcctagca atcctgcatg 1680 agtatcacga cattttccca gacaatattg gtgaagttgc aaattaccca ccgatctgtg 1740 atgccaactc aaaggtccgc caccatatca acctcacccc cggtgcgata cccttcaaat 1800 ccgcgagtta tcgatcccct cacatgtggc gacagcagtt aattgaagag atccagaaac 1860 atcgggaggc tggccgattg cgaccctcca gttcgccgtg ggcagcaccg gcgttcttgg 1920 ttaagaagga gaatggaaaa tttcggttta tctgtgatta tcggggcctt aacaaagtca 1980 ctactccaga ctccacgcca gttcccaatg tcgacgacat cctccatcga gctgcttgtg 2040 gtaaaatctt tgccaaaatt gacctgtctg atgccttctt ccaaacactt atgcatgaac 2100 cagacatcga aaaaacagcc attactaccg aactcggcct atttgaatgg gtagttatgc 2160 cacaaggtgc ttgcaactca cctgctactc aacaacgccg actaaatgag gccctccgtg 2220 ggttgctggg ggattcttgt gaggcttatg ttgatgatat cattgtctgg gctgctgatg 2280 ccgaggatct tgacaaacgg ttgcgagcag tgctggctgc cttacgaaaa tcggggcttg 2340 tgtgctctcc caccaagtct gaattttttc gccacaaagt caaatttctt ggtcacgtga 2400 tatccgccaa ccacatcggc ccagaccccg ccaaactccg caccatcgcc tcctggccgc 2460 tccctcagtc cgtcaaagag ctgcgatcct ttctaggcct ccttcaatac ctcagaaaat 2520 tcataccgag cctagctacc cacacacgaa ccctcaccgc cctcctcccg ccgactccag 2580 cngccgagaa agcctgggag aagcaacagc gtgcactccg gaagggccag tcccccaagg 2640 atgtcctgtc atgggtctgg gcatggtcta gtgaagcgac cgccgcgttt gagattctga 2700 aggcgaaagt ggcggagatt tctggcctcc ggccattgga ctatgcagct gcgttatccg 2760 gcgagtgccc catctacctt ttcactgatg caagcaatca cggcactggc gcctggcttg 2820 gtcaaggccc tgaccccgat catgcatttc cagttgctta cgactctcgc agtttatcag 2880 ccgccgaacg caactatcca acacatgaga aagaactctt ggccatcgtc cgtgccctca 2940 aactctggcg gcctctcctt ctggatgtcc ccattcaagt tcagaccgac cacttcactc 3000 tcaaatggtt cctccaacag cgcgacctat ccgaacgcca aaagcgctgg ctgggcntcc 3060 tatcccggtt cgacttgcgt attgaccaca tctcaggtgt caataacttc atcgcagatg 3120 ccctctctcg gcttggcggc gtcgacgatg agcaggatgg tatggagaca gctgaagtca 3180 gtgtagcagt cctcggtctc ctcggccaag acacatccac catcaccaag gtcgcgcaag 3240 gctacgctca agatcaggtc atgggagcat ggttgcaaga ggaggatagg gcacctggag 3300 taacgttgga gaatgttgag aatggtcaag gacggagtac atcaggtgtt gcgatgggaa 3360 ggcaggctat gtgtgcccga taccagtgag ctccgagaag gtttcatccn ccagtgtcat 3420 gatgatgtgg gccatttcgg cgtagcgaag acattggaga tggtgcgccg cagctattac 3480 tgggagggca tgaggcagga tgttgtggac tatgtaagct cgtgtgcacc ctgtcagact 3540 tcnaagagca ctactgtcaa gcccgcaggc cgccttcact ctctaccagt tccacaggcg 3600 aagtttctcg acattggcat tgacttcgtg ggccctctcc ccacgtcgca aggttttgac 3660 caactcattg tcatcaccga ccgcctcacn ggttatgtgg tcctcattcc taccaccatc 3720 acagcaaatg cgcgagaggt tgcccgactt ttttacgacc actggctttc caaattcggc 3780 tgccctcgct ccatcgtctc cgaccgcggn agcatcttcc agtctggggt ttggcgtcat 3840 gtcaccaagc atatcgacac caagtctatg ctttcaacnt catatcaccc ccagacngat 3900 ggtatctctg aacggtccaa caaatcagtn atcgagtcgc tncgaaccct cactgacatc 3960 cgtgggcgta catgggctga ccatttgcag cgcattgctt tngctctcaa caaccatgtc 4020 cgngcatcaa ccaagcacac tcctgccgag ctggtttttg gcaagcgtct ctctcatgtt 4080 cctcccttgg tggctgacac tccagcgaaa attgccgaag ccttgcagtt catggtacct 4140 tcggaagacg aatgggaagc agcagctcgg cgtatgcggc tcgacgaagg ggaagcacga 4200 gacaatctcc tcatggccaa acatcgacaa gcagttcaag ccaacaaaca tcgccggcct 4260 gatcccattt acaaggtcgg tgaccaagtg ctggtgaaca cccgtcatgt caggcatgaa 4320 tacaagtcct catcaggctt caaaaattct gctaagttca tcccacgtta tgatggtcct 4380 tacactattg tcaaagcttt tccaaatcaa tccctctatg aactcgatgt ccctgctcgc 4440 gccaatgaca ccgctcgacg ccatgtctcc ctcctcaagc cttacgtggt ctctgaacgt 4500 tatcattcct cgtcacctcc cctccaaccc caagtcccag ctccacctcg cccccaggtc 4560 ctcgaggtcc ttgacatccg caccaggaag aacactgagg aggtccgagt ccggttagtt 4620 ggtgagccag cccttggcaa gtggctaccc cgagttgaag tcgagttatg ggatggtttc 4680 aaggccgcgt gggaggtgta tgatggccca gatgagttgt tgttggagta attttgatgt 4740 gttttttctt tcttctttca aacatggacc ccacttatta ccctttagcc tagttgtttc 4800 accaatggac cccacttatt tttttaccta gtttttcntg ctctcgcctt ttcttccaag 4860 accatggcct ttttcaggaa ggggaggg 4888 // ID Copia-2_BDJ-LTR repbase; DNA; FNG; 534 BP. XX AC AATT01000295; XX DT 07-FEB-2011 (Rel. 16.02, Created) DT 07-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Batrachochytrium dendrobatidis DE genome: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_BDJ_; KW Copia-2_BDJ-I; Copia-2_BDJ-LTR. XX OS Batrachochytrium dendrobatidis OC Eukaryota; Fungi; Chytridiomycota; Chytridiomycetes; Chytridiales; OC Chytridiales incertae sedis; Batrachochytrium. XX RN [1] RP 1-534 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Batrachochytrium dendrobatidis RT genome."; RL Direct Submission to RU (07-FEB-2011). XX DR Genome; AATT01000295; Positions 5154 4621. XX SQ Sequence 534 BP; 180 A; 112 C; 66 G; 176 T; 0 other; tgttggaata tccagacgga tactcttaag cggtaaccta tggcaaagct aattctataa 60 aaaagcacca acttaaaatc ccaaacaatg gatggtatgg atgctgcata aactaacaag 120 cattgtataa caaataagct atacgcaagg tgaaacttat ttaaggcaat gactatcgtt 180 ccctttatac aagaactggg aagcttagga ccctcaacgg cgaagcttag agtgaagatc 240 ttatatctcc actctctccc ttaactactt cttattttac tataaaagct agaatgctta 300 cttttatcaa taaacaactc ttcaaactat ttcatcttcc taactacttt attacatctt 360 ttcactactc ttgaaaagac taagtttgct gatcttgatt taaagtgcaa cttcctctta 420 cacttctgat ttgttctctt ctgataacta ctaacagtta cagattatat tctactttat 480 actttatact attccatcat taaaacaaat cacgaagctc taaatattct gaca 534 // ID Merlin-3_Roryzae repbase; DNA; FNG; 2210 BP. XX AC . XX DT 17-MAY-2011 (Rel. 16.05, Created) DT 17-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Merlin; DNA transposon; Transposable Element; Merlin-3_Roryzae. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-2210 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 2210 BP; 667 A; 425 C; 469 G; 649 T; 0 other; ggatctatga aagatttacc gtccatctac tatatatata tatacaaact ccgcatgcgg 60 agtttggtat gcttaatatg gtaattgggg tcacagaaca cgattacatc atttttaagc 120 ggagtttatt aagccaaaat tggccaattg cgagatctac taagaacagt cattttttct 180 tccttgttca ttacttcact caagagggat gacaagtaga aggtcattaa aaaagtccgg 240 accgagatac gacttctgca ggtcgagaaa aacagctcca aaaaccaaaa attgtatgac 300 aaaaaggctt gcgaaacgag aatcggcaaa tcaacatttg tggttattgg gttatgctaa 360 tggcttctca ttctaaaggc gttggttcga ttcccggtag acgctcagca aacatagtct 420 actttttgca ttatttttgg gtctaagtaa cttattattt tttttggaaa tatcaaaagt 480 tttcggaaaa attttatcgg ttctgtatgg taaaacgaat tcccgttgcg gagatgctgc 540 ctatctatcg gctgatttcg gacagataac aaaacatgta aaaaaaatat aaaaaggcct 600 gttatttctg tttttttttc tttccttcat ctttttatca gttttatcaa tctccaatgt 660 cttctaacgc tgctaacaac atgccaactc ttgcatacat cgctgaaaga actatgacgg 720 ctgaatgtac attgaagtaa gtttattttt taagtgggag ggaaaataac taaaggcaac 780 tggtaaaagt tacttgaagg agaacggtgc ctttttttca tcgcgcaagt gcaaggaagg 840 ctccgaaatg cgcttgaacc agtccagcag ggatggattc ttttggcgat gcggctcaac 900 cacctgtgcg tgcggtcgta gccgtatttc ttacctggat ggatccttct ttcaaggtcg 960 caagtcaatt atccatcaaa cactgttagt agtgtatctc tttctgctgc aagttccaaa 1020 cggaacaata tctaccatga ctggacttag cctgccaact gtccgcagca ttgtcaaaga 1080 tatctatcaa gtaatggagg cagatcttcg aattgaagat gttcaagttg gtacgtgatg 1140 ctttgttgat atcagagtat gtaattaatt agtcatactt aggtggggtt ggcagtgatg 1200 gacagccaat tgttgttgaa atagatgaga gcaaatttgg taaaagaaag tacaataagg 1260 gaaagagggt agacggtgtt tgggttgttg gaggtgtaga gcgcacgccc gaacgtaagg 1320 tgttcttgct gacggtacct aaccgcaacc aaaatacctt gaagctcata atcgatactt 1380 ttgctaagga tggtaacatt tgaaatatgt atatacttga tgatcagtgc acctaaccct 1440 tgtttacagg ctcacttgtc atggtagact gctggaaagg ccataagggg attgacagcg 1500 atccaagccg aaatttagtt gttcaaacgg tcaatcactc caaaacattt cgcgatccaa 1560 agacaggtgc ctgcactaac acaatagaag gtacatactt ctgtagatca tagaattctt 1620 aaacaatgct aacaacctgc tattaggtac ttggaatggt atcaaacgag gagtgaccag 1680 ccgtcatcgc acagcatcga tgatgccatg gaaattagtg gaattcatct ggcgaagaaa 1740 acatgctggc aatcactgga aggcaatgct tgcttgcttt tcgcaagtgt cgttcaccag 1800 agcaggcatt gccgaccaag gtccattggt atcgttgctc acaaccgtct ttgaagatgc 1860 cactggtgat cagggcgatg aagagccatt tatttttatc tcagatgacg attccgatag 1920 cgaatctgat gaaaatgata cctcagcacc tagtacgccc gtaaaaaaac tccgaagtgc 1980 ttaaaataaa taaagaaatt tttcattgat aactataaaa atcagggcct atagcgcctg 2040 ttttctttcc taattatttc ccgcatatta ccatttttgg aaattaatgt tacatcatct 2100 aaaaatagac gacccaggct ttcgaatgat gtatagcata cgtctgtacg ctcaatttta 2160 gcgccgctat aggcttccaa agagatggac ggtaaatctt tcatttagcc 2210 // ID Copia-54_MLP-LTR repbase; DNA; FNG; 394 BP. XX AC AECX01000274; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-54_MLP_; KW Copia-54_MLP-I; Copia-54_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-394 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000274; Positions 51523 51130. XX SQ Sequence 394 BP; 93 A; 91 C; 45 G; 165 T; 0 other; tgagataatt aacaagtcta gaacttcatt tctattttgt catagttaat atctacatga 60 tgtcaacttc atctatttca tttcttctct gacataatta tgtcttctcc ttctctatct 120 attcatgtac cttggttcta cgagatcact tttatatata agactgaatg tttcctcaac 180 tattgtttcc tttttctcat tcattcttat ctgtgtttct cgtaccacta ggtactttgc 240 ttttccacat cactcgtgtt cttttagcac ttactaaaga actctcgtgt tcttccaaag 300 gtactttgct tttccacatc actcgtgttc ttttagcact tactaaagaa ctctcgtgtt 360 cttccaaagc tcgtactctt atagaaagtg ccca 394 // ID PiggyBac-1_Nha repbase; DNA; FNG; 2235 BP. XX AC . XX DT 16-MAY-2011 (Rel. 16.05, Created) DT 16-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW piggyBac; DNA transposon; Transposable Element; PiggyBac-1_Nha. XX OS Nectria haematococca OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Hypocreomycetidae; Hypocreales; Nectriaceae; OC Nectria; Nectria haematococca complex. XX RN [1] RP 1-2235 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 2235 BP; 632 A; 590 C; 510 G; 503 T; 0 other; ctcctgagtg cgcatagccc tcccaagatg tcacgagacc cacccgaccc cggtaacccc 60 ggaagtacat gaggggtacg aaaagtttaa tcccagaagg aattcctgct aattccccgc 120 caaaacacca tcaattcatt ttgataccat caatgacttc ccctccaatt ctagaagaga 180 tacaggtcat caacccctat gatgacaata gctttatcaa ggaagagcac agtcattgct 240 tgtggcagga tgaccccacc caggccgtta atcttccggc tgaaaatgac cggggcacca 300 acttcaagcc tttccacgta gaaatacgca aatttcgcat cagcccactg cctccaacgc 360 ctctacaact attccagctt ttcctaccta tatcactcgt tgaaaaatgg gtatcttaca 420 ctaattcatg gattacatgg ctcaaagaaa acggcgtcgt tgacagctgg aataacccga 480 tggggaagac ctcatatctg cacaaatggg aaggaacaac ggtctcagag gtgttgacga 540 tgattggtgt gctaatctat atggatgtgc ataaggaaaa gacgatacga agctattgga 600 atccccccaa gcccggtgtt caacgccctg cgcactcatt tatcaagttc atctcataca 660 acaagtttca gctcatccac aggcgtcttc gtccctttga ccacaccaaa tatgacgaaa 720 ccgcacctat tccgaaggta ttccaatgcg ttgaagagtg gtctgaccac atacaagccg 780 tatctatgca gatattcctg ccggggtctc accttgccgt cgatgagtgc atgatccgct 840 atacggacag aagcgatgac ataacggtta tcaaaagcaa gcctgacccc gtgggcttca 900 agatatgggt cattgcccaa tacggcttct ttattcggtg gatttggcac gtcaaggaga 960 aaccacacgg tgccgttggc gttgaatttc aactcagaag tcatcatcac aaggtcggcc 1020 aagcaagcga agaaaagtca cagtcgaagc ccctgatata gaagacgagt ctttctcacc 1080 gaattcgact caggctatcg tcgttgccct tacaaatatg ctgccaaaag cgaaatatca 1140 cgtctttgtg gataacctct tctcgtcgtc tcccctcttc cgcaacctcc gtaaccacgg 1200 cctcggggcc acaggcacgg ctcgcacgaa tagcggcatc catgaggaac tcgtacaaga 1260 taagaacaac gacggcaaag tcaagaaaat gtacgaattc aatacggtca aggcgatacc 1320 aacccctgac aaccaggtac tgacttcaga cacttttcac tgagaaatcc ctactaattc 1380 gtcttcgtgc cgtaacccta ggtcaaccaa atcgcttgga aagacaacaa gcttgtgtta 1440 tttttgacca ccgtgttcac tggtgccgac gatgagcgtg tcatacggca gaggaagaag 1500 ccgtcgtcac ggaagtcaga agccaagcca atacgccgct ttttcggtga cgaagctgtc 1560 aagatgatca acatcccgat agtggctgct gagtataacg atcaaatgaa tcacgttgac 1620 cgtggcgacc aactgaggtc atattataag tacgatcatc ccctacgccg tggtgcgtgg 1680 caggcacttg catggacttt cctcctcgac gtggccctcg ttaacagcta tattgctcaa 1740 cttcacggac cgcagccaaa ttggaagaga tataccaatc agagggaatg gcgagagtgc 1800 atctataacg aattattcaa cgcctacggt catgacagtc aggcaaggca gcgataccga 1860 ccaggagacg agcgtgatct acaaaaccct gaattacaga gggatcacat caatcgggag 1920 atcaatcatg tcgaccgtca tgtcaaatca gactgcctag cctgtcaggg ctgtcggcag 1980 gggcagctga gagcaaaaag tgaggcttgg agccctctta cggagaccag cggtaataag 2040 aggaggcaca atgtacggag tcagacctca catggctgcc gtatttgcaa ggttgcgatt 2100 tgtaataacc aacattgttg ggacttctat caccacctaa tttaggagag gaattaagta 2160 ccctttattt ggcgccttct gattggacca gacccctcat cgtgacatct tgggagggct 2220 atgcgcactc aggag 2235 // ID Gypsy-5_LENY-I repbase; DNA; FNG; 6442 BP. XX AC AAPO01000113; XX DT 12-FEB-2011 (Rel. 16.02, Created) DT 12-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Lodderomyces elongisporus genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_LENY_; KW Gypsy-5_LENY-LTR; Gypsy-5_LENY-I. XX OS Lodderomyces elongisporus OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Debaryomycetaceae; OC Lodderomyces. XX RN [1] RP 1-6442 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Lodderomyces elongisporus RT genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; AAPO01000113; Positions 42129 48570. XX CC Positions [5408-5911] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 136..1752 FT /product="Gypsy-5_LENY-I_1p" FT /translation="MSEGEDESPGLKRTEAPQINQVEINEILDLKSEMMMM FT QEQMKQMTAMFKAAMIQTVPQLSTPVPARNDEANKAQISDDNNSITLPTDV FT QMKVHPEEHTFSPGRNFPTSRPHSLASPLTHQSRQSGTPSAYGSDRSVASY FT DGILYHGEVVHSGFWVEGHEEACQIINEARQRYPHGEDVRVIPYIRVPDIA FT VEKAFMAYYDKLLAKQGRTQNDNMALGQYDIAKRVPLKFSDEEGFMFWLKS FT VLWHKHTYGVPDKLIINELRNGANLTKSESLKDMIEMSCKSEGPFEQPVVD FT YLPILGNGGTPPETNQVTDVMTAIHDALKKKHNPRVKMTIIGIAVKKYLRI FT TNTSHLAQETWISIFKTLFERMGSVMDQIIMYMTPEVKIEKPGRTGPIVTF FT EDASIIKIGLGLLKNKPNDKIIQGVKSYDLTFERIWDAVNATFQTRMHLIS FT DDEEKDAGTTLVSKGDKSQKTTKCNFCGKYRHTVDSCHLLVTALDEKLVKK FT VDGKYYLANGEPLVMDFYKCPLLKHLSLKSLSSNARHRTPPTK" FT CDS 1724..6304 FT /product="Gypsy-5_LENY-I_2p" FT /translation="MRGIARPQQSKSKAAQVMEVEVIPKSTVTSNSRENTH FT HERLPTEVMQSNSHIMSDSIDEVIKPGPIMIDDVPVYLVHAANAQSAKKVV FT KDWTAKKAHLPSSRKDAHNKLKADKVKQQLNAAVRQRQEMERVERESAPSP FT VTEQSMQESRRELDTESPPIGEQSPPYDSHLPFLELQFPDENIDVDPDYEY FT EEPSSAGIRFEPSALTMDRELENLAMEDTVDPDEELKLFLSEGKGIEQTST FT PQENGNMMSHPQEEVPKVKSDVVVNEPLNEEQANATADDGVDSPVADDSIK FT LFSVEDILRENQLDELAKLSATSDHQNQLADDSTDPQRKSQRRVRTSKLTI FT LDKQAAFKDAMKHNFFVFMDLSKYLGMHDDARAMMRESTKAMLVDRDVLQK FT ALKDGIAKFKKKPVVTEVNLTTTSTGYHQPGVTQELLCFPASIHGRRIEMK FT YDTAAQLSLINPQTLKGLPIKPFPLSQPVFVQGVNNDTKAITQGVFLDVTV FT HFISLPAILYLHEQIPVGQVLLGLPFQDAHKLSIGFTDDGDKRELRFRANG FT NIHKFPIRYDGDSNYHIDSFPTVQVSQTVVIPPLSEMLRPAFTGSRASEDD FT IRYFVDQCAEVSDVFYIKGGDPGRLKPEIHPPVRINLRDPNVHWRMKSIPL FT GSKRTAAVEILQEMIRQGQLVYSDATYRNPWFLISKKDGRHRLLIDLRELN FT KNVELEGGHPLSVDDLTTEISGCWFISTIDVQNAYFQIPLDAATSDVTSFN FT SPLGLLKYAVLPQGYINSVSEFSSILQKILSPVAKDVMCFIDDIAIVGPKV FT DELTDSLVREHLDKIVEVFRLLTNAGLKINPAKLKIAVPECDFLGYHISPA FT GKTLIRGQVDALLNYPLPNTVKQLESLLGLVNYYRQLIVGHAELTAPLYNL FT VNQARKEPKHQIHWDPTTKRFFHQIITVLTNQPILQPLNFKDLITVHTDAS FT TDSWGGVLQNTNAAGESKLVLCYSGKFHGSEKNYTIYEKELFSIYKTFDAI FT HPLLFGFTGVIHLYCDNKALVLVMNKPLDNSHFVNRVYKWLNFIRTFNYQI FT HHIDGLKNIIADALSRCHTSTPQTDHYEVREAIAEFRAKLDPKVLAEVNVV FT TIEAEPNNIYKKIPLQALRKYLEARTIPVSHNAPADARRFINRALEFYLHD FT GILYKLGRTGSYTRRVVYENNQVEHIFNLAHRDRGHPGVETTFNLINTGFY FT IPNLYRRLADYIKRCVYCQKDQPATRQRDPLYLNYPAGMFHTIVCDCVKLR FT DATVVVARDEFLGWPEAQVLPKVTAEAVADFIYSNFIARVGTFSVLKTDNG FT REFHNEVLQQLLSNYGISPSYSVPYHPQGNGMIEASHKRLIRFIKLVPENK FT NLSKVLNAALWVDRTTVRKRTGFTPQYLVFGFEGNNLLQSLMQTPSDIQQY FT TEDELFKFRVGQFYFRKQQEDSAKETQRRDRDHQKEVFDKRYDTNVPIKKG FT DLVLVYDAATNKMGLNWSGPFVVRKVLSRIYHLSNLQGIPMKRQYTREMLK FT PFVAAPLSTTK" XX SQ Sequence 6442 BP; 1921 A; 1395 C; 1423 G; 1703 T; 0 other; tcttcaacaa aagattctaa gacctctaat cctactccaa ggagtatcaa ccctggtcca 60 agtaaaccag accttgctgg tgtcgactca tcttcaattt ccgggcagtc tatttctacc 120 tcgggaagag ccgatatgag cgaaggtgaa gatgaatctc ctggtttgaa aaggacagaa 180 gctcctcaaa taaatcaggt tgagattaac gagatcttag acttgaaatc tgagatgatg 240 atgatgcaag agcagatgaa acaaatgact gctatgttta aagccgctat gattcaaaca 300 gtcccgcaac ttagcactcc tgtaccagct aggaacgatg aggctaacaa ggcccagatt 360 tcggatgata acaacagcat aaccctgcca actgacgtgc aaatgaaagt tcatccggaa 420 gaacacacat tttcaccagg acgtaacttc cctacctcga ggccacactc actcgcttca 480 cccttgactc accaatcccg tcaaagtggt actcctagtg cttatggatc tgatagatct 540 gttgcatctt acgatggtat cctttaccat ggagaagtag tccatagcgg attctgggtt 600 gaaggccatg aagaagcttg tcaaatcata aacgaggcaa gacagaggta ccctcatggg 660 gaagacgtca gggtaattcc atacattcga gtcccagaca tagctgtcga aaaggcattc 720 atggcgtact acgacaaact ccttgccaaa caagggagaa ctcaaaatga caatatggca 780 ctcggtcagt atgacattgc aaaacgtgtc cctttgaagt tctccgatga ggaaggtttc 840 atgttctggt taaagtcggt tttgtggcac aaacacacct acggtgttcc tgataagctc 900 atcattaatg agttaaggaa cggagctaat ttgacgaagt cagagtcgtt gaaggacatg 960 attgagatgt catgtaagag tgagggcccc tttgaacaac cggtggttga ttacttacct 1020 atcttgggca atgggggcac tcctccagaa acgaaccaag ttactgatgt tatgactgct 1080 atccatgatg ctttgaaaaa gaaacataat cctagagtga aaatgaccat tattggtatt 1140 gctgtcaaaa agtatttgag gatcaccaat acatctcacc tagcacagga aacatggatc 1200 tcaatcttta agaccctttt cgaaaggatg ggtagtgtca tggatcaaat aatcatgtat 1260 atgaccccgg aagttaaaat cgaaaagccg ggaaggactg gaccaatagt cacgtttgaa 1320 gatgcgtcga tcataaagat tggccttgga ctcctcaaga ataaacctaa cgataaaatc 1380 attcaaggtg tcaaatcata cgatttgaca tttgagagga tctgggatgc tgttaatgca 1440 acatttcaaa cgaggatgca tcttatctca gatgacgaag aaaaagatgc cggtactaca 1500 ctggtgtcta aaggtgacaa atcacaaaag actactaaat gtaatttttg tggaaagtac 1560 agacacactg ttgatagttg ccacttattg gtgactgctt tggatgaaaa gttggtaaag 1620 aaagtcgatg ggaagtatta cttagcaaac ggtgaaccgt tggtcatgga tttctataag 1680 tgtccccttt taaaacacct gagcttgaaa tctctaagtt ctaatgcgag gcatcgcacg 1740 cccccaacaa agtaagtcca aggcagcaca agttatggag gtagaggtta ttcctaagag 1800 tacagtcact agtaactcta gagaaaatac ccaccacgaa cgattaccga cagaagtgat 1860 gcaatctaat tctcatataa tgtctgattc aatagatgaa gtgattaaac ccggacccat 1920 catgattgat gacgtcccgg tttacttggt gcacgcagcc aatgctcaat ctgctaagaa 1980 ggtagtgaag gattggactg cgaagaaagc ccatctccct agctcgcgaa aagatgctca 2040 caataagttg aaggctgata aggtgaaaca gcaactcaat gcagcagtta gacaacggca 2100 ggagatggag agagtggagc gtgaaagcgc acctagccct gtaacagagc aatctatgca 2160 ggagtcccga cgggaattag acactgagtc cccgccgatc ggagagcagt ctccacccta 2220 tgactcacat ttaccattcc tggaactcca gtttccagat gaaaacatag acgtagaccc 2280 agactacgag tatgaagaac ctagctcggc tggtattcgg ttcgaaccct ccgcactgac 2340 aatggatagg gaattagaaa acctagccat ggaggacacc gtagatcctg atgaagagtt 2400 aaagttattc ctatccgaag gaaaaggaat tgaacagact tctacgccac aggagaatgg 2460 aaacatgatg agtcaccctc aagaagaagt gcctaaggtg aaatctgacg ttgttgttaa 2520 tgaaccactt aatgaggaac aggcaaatgc taccgctgat gatggagttg actccccagt 2580 ggctgacgat tcaattaaac tctttagtgt cgaagacatc ttgagagaga atcagctaga 2640 tgagcttgcc aaattaagtg caacgtcaga tcaccaaaat caacttgctg acgattccac 2700 tgatccacag cggaaaagtc agcgccgagt acgtactagt aaattgacca tactagacaa 2760 acaggctgca tttaaagacg ccatgaaaca taacttcttc gtgttcatgg atctttccaa 2820 gtatttggga atgcatgatg atgctcgagc catgatgaga gagagtacta aagctatgtt 2880 ggttgacaga gatgtcttac agaaggcact taaggatggt attgccaagt tcaagaagaa 2940 accagttgtt actgaagtaa atctgacgac cacttcgaca ggctaccacc aaccaggtgt 3000 cactcaagaa ctactctgtt tccccgcatc catacatggc cgtaggatag aaatgaaata 3060 cgacaccgcg gcccagttga gtttgatcaa cccgcaaaca ctaaaagggt taccaattaa 3120 gccttttccc ttgagccagc cagtatttgt tcaaggtgtt aataatgaca ccaaagcaat 3180 tactcaggga gtgtttttag atgttactgt ccatttcatc tccttacctg caatcctcta 3240 tttgcacgag caaattcctg ttggtcaggt actactcggg ttgccattcc aagatgcgca 3300 caaactctcc attgggttta ctgatgatgg tgataaacga gagttacgtt tccgcgcgaa 3360 tggcaacatt cataaattcc ctattcgtta cgatggggat tctaactacc acatcgattc 3420 atttccaact gtccaagtga gtcaaaccgt cgtcattcct cccctttcag agatgcttcg 3480 acctgcattt accgggtcta gggcgagcga agatgatatt cggtatttcg tggaccaatg 3540 tgctgaggtg tctgatgtct tctacattaa aggtggagac cctggcagac ttaagcctga 3600 aatacaccct cccgttcgta tcaacttgcg agatccgaat gttcactggc ggatgaaatc 3660 gatcccatta ggatctaaac gtaccgctgc agtagagatt ctccaagaaa tgatacgaca 3720 ggggcaattg gtctatagtg atgcgactta tcggaaccca tggtttctca taagtaagaa 3780 ggatggaagg catcgattgt tgattgacct tcgggaactc aacaaaaatg tggagttgga 3840 agggggtcac ccactctccg tggatgatct cactacagag attagtggtt gttggtttat 3900 aagtactatt gatgtgcaga atgcgtattt tcaaatacca ttggatgctg ctactagcga 3960 tgttaccagt tttaacagcc cgttaggttt gttgaagtat gctgtccttc ctcaaggata 4020 tatcaattca gtaagtgagt ttagttctat cttacaaaag attttgagcc cagttgcgaa 4080 ggatgtgatg tgttttattg acgacattgc catcgtgggt cccaaagtag atgaacttac 4140 cgactcactg gttcgagaac acctagataa gattgtcgaa gtgttccgtc ttctaaccaa 4200 tgctggcttg aagattaacc ctgcgaagtt aaagatcgct gtacctgaat gtgattttct 4260 tggataccat atttcacctg cagggaaaac gttaattagg ggtcaagtgg atgcattatt 4320 gaattaccca ctccctaaca cggtcaaaca gctagaaagt ttgcttggct tggtcaatta 4380 ttaccgacag ttgattgtag ggcatgctga gttgactgct cccctctaca acttagtcaa 4440 tcaagcaagg aaggaaccca agcatcagat tcattgggat cctacaacta agcggttttt 4500 ccaccaaatc atcacagtac taaccaacca accgattctc caacccctta attttaagga 4560 cctcattact gtccacactg atgcttcgac agactcttgg ggtggtgtgt tgcaaaacac 4620 caatgcggct ggagaatcga aactagtcct gtgctattct ggaaagttcc atggttctga 4680 aaagaactat actatttacg aaaaggaact tttcagtatt tataagacgt ttgatgcaat 4740 ccatccactc ttgtttggat ttactggggt cattcatttg tactgtgaca acaaagctct 4800 tgtccttgtg atgaataaac cactcgataa ttcacacttt gtaaatcggg tttacaaatg 4860 gttgaacttc atcaggacct tcaactatca gatccatcac attgatggac tgaaaaacat 4920 cattgccgat gctcttagcc gatgccacac gtctactccg cagactgatc attatgaggt 4980 cagagaagct attgccgagt ttagggcgaa gttggatcct aaagtactag cagaagttaa 5040 tgttgttacc attgaagctg agcctaataa catttataag aagatcccac ttcaagcgct 5100 tagaaagtac cttgaggcac gtaccattcc cgtttcgcac aacgcacctg ctgacgccag 5160 acgtttcatc aaccgcgcac tagagttcta tttacatgat ggaatccttt acaagttagg 5220 tcggaccggt tcatacactc gacgcgtcgt ttatgagaac aatcaggttg aacacatatt 5280 caatcttgct catagagatc gtggtcaccc aggagttgag actacgttca atctaataaa 5340 tactggattc tacattccga acctctatcg cagacttgcc gactatatca aacggtgtgt 5400 ttattgtcaa aaggaccaac ctgcaactag gcagcgggat ccattgtact tgaactaccc 5460 ggctggtatg tttcatacaa tcgtctgtga ctgcgttaag ctccgtgacg ctactgttgt 5520 cgttgcacgt gacgaatttc tgggatggcc cgaagctcag gttttaccaa aggttactgc 5580 agaggcagta gccgacttta tttattcaaa ctttattgct cgtgtaggaa cctttagtgt 5640 tttgaaaacc gacaatggtc gtgaatttca caatgaggtc ctccagcaat tgctctcaaa 5700 ttatggcatt tctccatctt attctgtacc gtaccatccg caggggaatg ggatgatcga 5760 agcgtcgcac aaacgactta tcagattcat caaactggtt ccagagaata agaatttgag 5820 caaagttttg aatgctgcat tgtgggtgga tcgtactacg gttcgtaaac gtacgggatt 5880 cacaccacaa tacttggttt ttggatttga aggtaataat ctgctccaat cattgatgca 5940 aacaccatct gatatacaac aatacacgga ggacgaacta ttcaagttcc gcgttggtca 6000 gttttatttc cgtaaacaac aggaagactc tgctaaggaa actcaacgcc gcgatcggga 6060 tcatcagaag gaagtttttg acaaacgtta tgacactaat gttccgatca agaaaggaga 6120 cttagtctta gtttacgatg ctgctaccaa caagatgggc ttgaattggt cgggtccttt 6180 cgttgtccgg aaggtgctgt cgaggattta ccatcttagt aacctccaag gcattccaat 6240 gaagcgtcag tatactaggg agatgttaaa accgtttgtt gccgctccgc ttagtactac 6300 aaaatgattt cctactttgc agggggggag gccacaacgg ttatcatcat tattgtacag 6360 ttactttacc ccaaatcaat tttttccgaa acacaataaa aaatggccgt atcgctcatt 6420 cagactctga agggggggag ga 6442 // ID Copia-51_MLP-LTR repbase; DNA; FNG; 453 BP. XX AC AECX01002953; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-51_MLP_; KW Copia-51_MLP-I; Copia-51_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-453 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01002953; Positions 4949 5401. XX SQ Sequence 453 BP; 111 A; 76 C; 59 G; 207 T; 0 other; tggttattgg ttattatgaa ttacggttat gtttcattat ttgattttcc tatttctgaa 60 ttatgatttg atttgttttc ttttttgatc tctgttacta tgctacggta tttgataata 120 caccattctg gttacattat atttcataat acacatgtca ttgtctttta actctctctt 180 ctattagtat ttgtggctac atatacctgt gaggttttgc accaagattc gaacaccatc 240 tgttttgtct cttttacttt tataagctca attaattgac gtgcttataa ttccgtgcat 300 tactcacagg taagcttact ttcatgtttt ctttatctct ttttctattt tatcatattg 360 tatcaaagaa ataaagaaca cgatctgttt tgtctctttt acttttataa gctcaatgaa 420 ttgacgtgct tataattccg tgcattactc aca 453 // ID Copia-1_LENY-I repbase; DNA; FNG; 3405 BP. XX AC AAPO01000012; XX DT 12-FEB-2011 (Rel. 16.02, Created) DT 12-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Lodderomyces elongisporus genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_LENY_; KW Copia-1_LENY-LTR; Copia-1_LENY-I. XX OS Lodderomyces elongisporus OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Debaryomycetaceae; OC Lodderomyces. XX RN [1] RP 1-3405 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Lodderomyces elongisporus RT genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; AAPO01000012; Positions 70767 67363. XX CC Positions [1497-1994] - Integrase core CC 'AAAAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 762..2519 FT /product="Copia-1_LENY-I_1p" FT /translation="MVRLIKKTLESYPKRQSTSDTDKKKVIYSKQSDNWGT FT NNNSGSTEFSKANTVMYANRVATNRDLFIIDTGASVTMVKSQDLLHDYIPF FT DEPRPVKAANDEVMEVLGQGTLICAVDDKQLHIPNVGYVPKLMQNLISAPA FT LLNDDDTISMTSTALIHSRLGKIGILRDVCESIMSVVHSVPTVALAQSTSP FT LLLLHEAWGHPNNNAFKYMLRQQSYSYNENDLQFHCQSCLQGKQFQNIPNI FT ATDDKALTPLEVIHADVIGPLPTSITGDMFALVVSDEYSRYMCCIPMQRKS FT DVSELLIILIHNMETHHQCQVRQLRSDNGGEFQNNQLKLFCETKGIQHTFI FT TPHMHHQNGLAERANRSVQDKARVLMLQLKLPEVYWPYAYKTAAFLLNATP FT KQVLNNISPFELWFGKPVDYNLLKPFGIQCYAHIPEHYRSRLDPNGRLCIL FT LGFPDNRKAYTLLDIEAGEVVDSSHVKFLDKYYDGEFPTVQQQSMSLPHGI FT TAIPRTTTNIRDNSQEEHDRLSSTQSGEDTALHDNDTSSLDTPELSSLRNF FT LDINDPSDANLSSEESDHEDMQITPLDNLSWFHNDHSDF" XX SQ Sequence 3405 BP; 1172 A; 650 C; 569 G; 1014 T; 0 other; ggttatgagc ccactcgctt gcttcgctaa cgtaacggta tcagagatga gtaaattgaa 60 agattcatct aattataaac tatggaatat tagttttctt tcatatgctg gtctcgcacc 120 gcccgaattc aaagcctttg tgcttgatga cacaactcat gcaggagagg ttcctggaag 180 aaattccact tttgagagta tggttgatca tttagtggtc attacagtta gtaatgacat 240 tttggaagaa gttagagata agtctctcat tgggaaagct gcttatctat atattaaaga 300 ccaatatgga gtattgcaaa tacatgaaca gattcaatat attaccgatc tttatgataa 360 aattttgagc aacagtgtta agtttgagaa aaagatgctg tattttaaca acatctggaa 420 gttcctcaca aaaaaaaaca aatgaatcgg aacgtgcttg tttaaaacat ttcatctgga 480 ttcaccaaac ttccaccagc tttgagcaac actttaaagc tcaaaatcca atcatctcgg 540 agaaagagct cacaaagaca atcagaattt ttcctgatct agatgtcaag actgttgctc 600 ttgccacgca cggatctaaa ggaagagctc tgtcagaggg ccagcagggg acactgcata 660 tctggggatg gcaatgcttt aattgttttg gactcggtca taactataag caatgttcat 720 cacccagacg acttagtgct attcctgatg ttgaagaaca catggtaaga ctcatcaaga 780 aaacactaga gtcttaccca aagagacaat caacttctga cacggacaag aagaaagtga 840 tatactctaa acagtctgat aattggggta caaataataa ttctggttct acagaatttt 900 caaaagccaa cacagtcatg tatgccaata gagttgctac aaacagagat ttgtttatta 960 tagatacggg agcatcagtc actatggtta agagtcaaga tcttttacat gactatattc 1020 cctttgatga acctaggcct gtcaaagctg ctaatgatga agtgatggag gttcttggtc 1080 aaggaacctt aatttgtgct gttgatgaca aacagctcca tataccaaat gttggttatg 1140 taccaaaact tatgcaaaac ttgatttcag ccccagcgtt acttaatgat gatgacacca 1200 tatctatgac gagtactgca ctcattcata gtagactcgg aaaaatcggt atacttagag 1260 atgtatgtga atcaattatg tctgttgttc actctgtacc aacggttgca ctcgctcaat 1320 caacgtctcc actactatta ctacatgaag catggggtca tcctaacaac aatgccttta 1380 aatatatgtt gcgtcaacaa agttattctt acaatgagaa tgatttacag tttcactgtc 1440 aatcttgttt acaagggaaa caatttcaga atataccaaa tattgctact gatgacaaag 1500 ccttgactcc tcttgaagtt attcatgctg atgttattgg tcctttacct acctcaatta 1560 ctggagatat gtttgcactt gtagtctcgg atgaatattc ccgatacatg tgctgcatac 1620 ccatgcaacg caagtctgat gtatcagaac ttttgattat tctaattcac aatatggaaa 1680 cacatcacca atgtcaagta cgtcaattac gatctgataa tggaggagaa tttcaaaaca 1740 atcaacttaa attattctgt gaaactaaag gtatacaaca tacttttatt acaccacata 1800 tgcatcacca gaatggattg gctgagagag ccaacagaag tgtccaagat aaagcacgag 1860 tcttgatgct acaactgaaa cttcctgaag tatactggcc ttatgcttat aagactgctg 1920 cttttctcct aaatgctaca ccaaaacaag tacttaacaa tatctcacca tttgaattgt 1980 ggtttggaaa gcctgtcgac tataaccttc ttaaaccctt tggaattcaa tgttatgctc 2040 atattcccga acattaccga agtagacttg atcccaatgg tcgtttatgc atcctcttag 2100 gctttcctga taatagaaaa gcttacactt tattggacat tgaagcaggg gaagtagttg 2160 atagcagtca tgtcaagttt cttgataaat attatgatgg tgaatttcca actgtacaac 2220 aacaatccat gagtttacca catggtatta cagctatacc tcgaacgact actaatattc 2280 gcgacaattc tcaagaagaa cacgatagat tgagttctac tcaatcaggg gaagatactg 2340 cactgcatga taatgatacc tcctctcttg acactccaga attgtcctca ttacgcaact 2400 ttctagatat taatgatccg agtgatgcta atttatcttc agaagaaagt gaccatgaag 2460 atatgcaaat tacacctctt gataatctct cctggtttca caatgatcac agtgattttt 2520 gattttacac ccatcacact ggtttttggc tccacatcca ccgattctga tgtttctgac 2580 ccactagatc ttgaaattga tatgacactt ttaacatcgc actgcgaggg gaagaaaaat 2640 gaatgcaata cttttggaag catctcagtt gttagcaaga ttaatgatga tagtcgcata 2700 aagacaaata tacagtccta ctcaaaacaa gaccaaccta tcaatatcga gacaaaatta 2760 ttttacataa atactaattt ggcacagtgc aatgtgaaaa gtgataatta tctaccatac 2820 acttttataa tgggtgaaac atatattatc tccaaaccag aaagtaatgt tcccaaaacc 2880 tataaacaag tattacaatc atctgaatct ataaagatag tgttagcaat ctcagcaaca 2940 tgtaacttca taattcatca gatagaacta attttgaaat tgttatatgc aagcatcacg 3000 gtaagaggtg ttatagcata tattacaaat aaattaataa gacattttca acaatcaaat 3060 gaactacacc ttagatatgc gaatcatgtg ctacgctatc taatactgac taaacattta 3120 gatttgctct acaagaaaag aaagaatcat ttttgaaaaa gcacaaacat tatgaagaat 3180 aaacactatg aataaccaac atatattata tcatacacaa acgaaaccta tcaatacaag 3240 atatcattct ttttaaagca catactggac ataacaatta ttgttaaaat cattcaaaca 3300 caattaaaga tttctggtac attgacaaga acaatacaac aaagaattca atttaataaa 3360 tctacttgaa ctatctgagt atacgtccct ttgattaagg ggaag 3405 // ID Gypsy-97_MLP-LTR repbase; DNA; FNG; 162 BP. XX AC AECX01000463; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-97_MLP_; KW Gypsy-97_MLP-I; Gypsy-97_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-162 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000463; Positions 53385 53546. XX SQ Sequence 162 BP; 37 A; 38 C; 34 G; 53 T; 0 other; tgttatgatc cgtagatgtc acgagatcag tacttacatg tcatgggtcg cgagaatatc 60 tccgtcccac agttgtacag cgttgagctt tttccctcat actgcaatct tagttatcat 120 cgcaagagtt ctgttgcact ctgtatagtg ctgtccgtaa ca 162 // ID OPHIO3 repbase; DNA; FNG; 1849 BP. XX AC DQ649005; XX DT 06-AUG-2006 (Rel. 11.08, Created) DT 06-AUG-2006 (Rel. 11.08, Last updated, Version 1) XX DE Ophiostoma novo-ulmi subsp. novo-ulmi transposon OPHIO3. XX KW Mariner/Tc1; DNA transposon; Transposable Element; OPHIO3. XX OS Ophiostoma novo-ulmi OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Ophiostomatales; OC Ophiostomataceae; Ophiostoma. XX RN [1] RP 1-1849 RA Bouvet G.F., Jacobi V. and Bernier L.; RT "Direct Submission."; RL Direct Submission to EMBL/GenBank/DDBJ (21-JUN-2006). XX DR EMBL/GenBank/DDBJ; DQ649005; Positions 1 1849. XX FH Key Location/Qualifiers FT CDS 977..1222 FT /product="OPHIO3_1p" FT /translation="SSIAIEVIQPMLSCLHTYKIKSGFLSCLLTALTSFNR FT SISLTSLLSRPVTVSVFLPCPLKKRIVTRISATSLPVIEKPVPLL" XX SQ Sequence 1849 BP; 577 A; 398 C; 368 G; 500 T; 6 other; agtcggtggg gccccttttg ggccaccccc tgttttgggc caccattaat aataaggcgg 60 tatacaaatt aggttagata aatggcaata taaattcgaa taaaaatcgt aaaaatcaaa 120 tttttgaaaa ttatgaaaat atatgatttt ctacatccaa cgatgcctct atataccgaa 180 acggatgtng ccaatgccct ggctgacgta gatcgaggtg tattttttng agtagtagct 240 aaaagctata acgtaccnag aagtactctt taagcccgtt aaaatggacg ttaaggctat 300 aaagtagggg tagagtcctt ataaatcctc tctntacgcc ttgaagctgt gcttgtagag 360 tgggtgacgg tataagcatc gctaggcatg gcttttacat acggctaaat gagagacgtn 420 gtagagcatt ttttgactgt taatggcagg ccgtaaaagc tnggaaagac ttggtaagaa 480 gcgtttttaa aacgtaatcc agctattaag acccttaagc gtgtattagt cgaatctcaa 540 cgtattaata gcgcttcaaa accaaagatt gaagcatttt tcgagctttt ataaaacgaa 600 gcaattacta atatcccact acaataccgg tataatataa atgagacaag ccttacagaa 660 agtctaaggg ccaatagctt aatcgtaggc cggttaaaca aacgtcgaaa ggtagtaaag 720 tcaagcggct cgtgtatcta agtaacaaca tttaaataca tctctgccgc cggctttata 780 ctctctcctt taattatttt tgctacgaat acagtctagc atcaatattt ccttttagat 840 ataaaagaat atcgtcccta aaagtttacc aacactcaga ctagctagac aaacaacgat 900 atagctaaag agtagctcga taaagtattt ctcctagaaa cgcagccaga ttcgcccgag 960 gaatagcgcc tactaatcct cgatagccat agaagttata caaccgatgc tttcatgctt 1020 acatacatat aaaataaagt ctggcttctt atcttgcctc ctcacagctc tcacgtcctt 1080 caaccgctcg atatctctta cttctctcct ctcaaggccc gttaccgtgt ctgtttttct 1140 tccctgccct ctaaagaaga gaatagtcac tagaataagc gcaacttcct tgcctgttat 1200 cgagaagccc gtacctctgc tctaagtgcc acaaatatca aaagcgggta gagagcggcc 1260 ggcttatggc ccgtcagtct tcgcaaagct ttagcgaatc catttgttat catagagaca 1320 ctaaacgacc ttttagataa tcgtcaagac cagcctgaaa tgcctatcac accgagaaaa 1380 actcgttctc aagcaatcgc tttccagacg ccgttatcta gcgttcaagt ccgtcgacat 1440 gtctctcaaa tccaatacgg agattacgaa agcttacgga gacttttagc gaagactagc 1500 aatgccttag acgttaaaac aggtgaaatt accgtcttac agcgccaaat caaagcctta 1560 gaggcccagc tagcctctta taaaaataca aaaaaaaaag aagtcaggcc cgatcctaac 1620 actaggttcg tcacaatcga agacgtacag acggtcagaa atgctatcga attggaagaa 1680 aatgaaagga cggctcggga gatagcagac ggtgatatag ccgatttttt gtacactttt 1740 gataggttta cgggggctga agtgccgttg tagttcttaa tacatgtttt tttatttcta 1800 ttataagtgg cctagaaggg ggggtggcct aaaaggggcc ccaccgact 1849 // ID Copia-27_MLP-I repbase; DNA; FNG; 7849 BP. XX AC AECX01002580; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-27_MLP_; KW Copia-27_MLP-LTR; Copia-27_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-7849 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002580; Positions 14357 6509. XX CC Positions [5104-5628] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 3976..7794 FT /product="Copia-27_MLP-I_1p" FT /translation="MDDLQSAIHLIDSITNLPSTTVELIHNVVKPLTRKDV FT IKYLRDYDTRQNNFSTEATREVHHVEANTSSAGMPYNRTSKIMCTESVCRG FT PHAPERCWSRPENFKERDDFLARRRTRGSWNPRGRQIPNRTSQSNSSTTIR FT GMQRVDTPNANSVSDSIQMLSLNVEFTSLVEDAEANSTESTSSDSIWALHD FT TGATHHMFNDSSLFVQSSLKKVDDVNKRLKLAGGDVSLAVDSIGKVQLKAG FT DGTVFELNDCLFVPELSKNLISGGTLKMKGVREVYDDSELSCFALVKDGLA FT LFNGVILKNGLMNVKINPVSFSNQLPQSSTTVSSSVIHHRLGHLSESYLKQ FT MKRNDSVIGIGEILSTPESCDTCNLSKNTKIPFNHTRPRALRFLENVHVDL FT SGIYRVNGFNNENYFILFCDDFSSYRHIFGLTSKSKEEVFESFKAYIAVSE FT RQTGLKIKQFTLDRGGEFVNSLLGSHLRDLGIELHLTSGYAPEENGVSERG FT MRTINTKARCLLIQSKLPSIFWFLACKTAVFLTNRTITRSLPDFKTPFEMW FT HFRKPTLSHLRIFGCQAFRLIRKEIQNSKYSPVSSEGVLVGFEQDNFNYQI FT CDLSDNKFHTSHHVTFNENVFPYHNSKYNQESSNDIVRSIYSDNEEEITAL FT KPLIENKMTDLNDEDNNAKSTEKEDSSSSNDETIENPVPTIATQNKFNNLI FT PRKSTRSQTKINYKGMGGFAELNNKIKSNVARFDDDDFCISSYFDCLPTCF FT NAFLNSTPDPKSFKRAMSAGDSPEWKLACDKEMSSLLKKDVWTLVDRPTHK FT SVIRGMWVFRKKINSDNSVKYKARFVAMGNTQVEGEDYGDTFAPTGKPTSL FT RLLLAMAAINGWEVHQMDAVTAFLNGILEDEIYIQQPEGYVVVGLESKVLK FT LNRSLYGLKQSPKIWQDDVTQFLIYTLCFEQCIIDPCIYFRSDEEKQLFTA FT VYIHVDDMAITGNDSSTFKSEISSKWEMEYLGLAQTIVGIKIDRPGPFIYS FT LTQTKFALTVLKRFDMLDSKPASTPLTPNLKLYQSSDEECSSFSSLGLNYR FT SAVGSLMYLSQCTRPDLAHSVGVLSQHLDRPSLQHWNAALQVFRYLKGTIH FT LGIVYSGEDNLKISGQRSFSYPVSHCDSDWAGDKSTRRSTTGYIFTLAGGA FT LSWKSRLQPTVALSSTEAEYRAITEAGQELLWLRKMMEYFGCRDSNPTVLQ FT SDNQGAIHLTKKSIFHGRTKHIEIQYHWIREVVKNGDLIVEHCPTSEMVAD FT LLTKALGKQQFI" XX SQ Sequence 7849 BP; 2307 A; 1342 C; 1558 G; 2642 T; 0 other; gtcgtccatc ttagtctgtc agccgtgtct cgtcaacacg tagctgaagt atttctgtta 60 ccttttaaaa gttatttgtc tacagtcaaa agtaaaatcc tacatggtag cgggagcttt 120 cgatccgccg tccaatctct ttgtctaaat ttatgtcatc gaacactgaa gaaaatccag 180 aaacagtcaa tccagaaaca accaattccg aatctgaaac atcctcttct tcaactatcg 240 gtcgaaccgt catcgataac ttaagaactc aacaactaac aaatccaatc atggcatcag 300 ctgatgattc tgtgtttcct ctgtttggaa acgccattca aaagtacggt ttgcaactca 360 ctaatgcact cagtaaattc aagatctcag aaaatcttga ggatggtaat tatcccactt 420 ggatatagat gtaaaacgtc gtgtgacaca agcgtgaaca gctgatccag agctgtcact 480 tgaatgcgaa gagatgcgag gatgaaagtg tcacgaaggg aatactccga agagcgaaag 540 catatcatgt ggatctaata atgatattaa tttcttatct gtgtcaaata ggaatgaaat 600 gaatcacaag aatagatata ttatatacaa gagttctgag ctgaggcaaa ggatcgggaa 660 aactgattct tacaagatga acaaaagaaa aactcagcat gtgattctga tcacaagaag 720 actgagtgct gagactgagg ataagtgtcg agtattttgt gatacactat tgagagtatc 780 aacatccccc cttgcttgat acttttgtag ggataggaag tgagttcgta gatataagat 840 ggagttcgtt ggtatctaca ttaccaatta agaaccgaca tcgattgagc atttctttcg 900 gagcattctt tgtgaggaaa tcagctggca ttaactttgt atcgatcata tgagtagata 960 tttgatgtga ggaaattaaa tcgcgaatga agtgatgcct aatgtctatg tgctttgatc 1020 gactgtttgt agccgattct tgagagagaa atattgctcc attgttatcg ttaaaaatac 1080 ttagcggagg tagagatatt ggagatgttg gaatgggggt ttgagtcagg tgagatatca 1140 ttctacgaaa ccataagcag tgtttagcac aatctccaag tgccatatac tctgcttctg 1200 tggtagatag agctacagtg ggttggcgac gacttttcca gctgactaga gatccgccta 1260 acgtgaatac ataaccagtt gttgatctgc gagtttcggt acaggctgca taatcggcat 1320 cagcgtaggc acgaagttcg gttgatggtg tgttgtgctt cttgaataat ataccacgtt 1380 ctaaagttcc ttgaacgtat ctgaaaagat gttttacagc ttgccaatgt ctgtatgtgt 1440 atttagatga ataacgagct aacgttgata cggcatattg aatatccggt cgagtgtgga 1500 cagcggccca gttgagacag ccaatggctt gctgataggg taggtgagct gcatctaaga 1560 tttcttcgct ggtaccagga gataattcaa cggagtttgg gagaggagta gatgtagtat 1620 tgcagtgtgt catttcgaaa cgatcaagaa cgttcttcag ataatgttct tgagataaat 1680 gaattgtacg attctttctg tctcgcgtga tcttgattcc aaggtgtaag gtaggttcag 1740 tggtccacac tagcttgtat aagttttcta gtttttctct agtttctttg atcagggatt 1800 ttgaattcga aaatactaga ccgtcgtcaa caaacatagg gatgtggatg atttcattac 1860 ctcttcgaaa agtaaagagg gagttgtcgt ctttgtaggg aataaaaccg atgcttcgaa 1920 gtttttcatc aaggtcggca ttgaattctc ggtgagcttg tttcgttccg tagatggatt 1980 gtttgagtag ccagactttc ttaggaaact ttgagttata aaaaccttga acttgtcttg 2040 tgtaaacagg cattttgttc tttccgttga ggaaagcagt acatatatcg aattgttcca 2100 tttcccaatc ctcggaagca cagagagaga ataacatgtt catagtgtct gattgaacaa 2160 cggatgcaaa tgttttatcg tagtcaacac cccatatttg gtgattgcct agagctaccc 2220 atctagcttt gtattttatg atgtttccgt gttcgtcacg tttccttttt aaacgccaca 2280 taccacctat gacattcgct tctgaggggg catcgactag ttcaccaacg ttgtgagagg 2340 ttaatgaatt aaattcgctt tccatggcct tgatccagtt ctctcgttcg cttcctctca 2400 ttgcctcagt gtacaaagga caatcggcac ttctttttgc tgcgtagatt gtgaggtttc 2460 cgtaacgtaa aggaggtttt ctattgcggg cagggcgtag agaggtaggg ttggaaatga 2520 tttttggttg ttgtttattt gatttttgct cttgaggttt ctcagaagga ttggaagtat 2580 tgttatctga tttgtgttgg tcttctgata taggatctgt attttcattg ttggtatttg 2640 tttcattctt ctttttcgga atttcttttt ctttttcttt attgttctga atttcattga 2700 tattgttatc ttttggtctt ttgattttga tgattatttt gggttgttct ttttcttttt 2760 cttttgcttg tttttcttct tcaattcttt gtgtttctct ttctttctct aattttgatt 2820 ctaattcttt ttgatatctg atttggtttt gttttgctgt ttctaatttt aattctaatt 2880 ctttttgttg tttaatttgt ttttgttttt ctttagctaa ttttaactct agttctttct 2940 gatgtttgat ttgtttttct ttttcttttt ctttttgttg ttctttttgt tgttcttttt 3000 ctttttctaa ttttgactct agctcttttt gatgtttgat ttgtttttca ttttcatttt 3060 gttgttttga ttttaattct aattcttttt gattttgatt ttgattttga atttcttttt 3120 ctttttgtat ttgtacttgg ttttctgttt cttttgatga tgtggtttta ggtgcatgta 3180 ttgattgtga tagaggaatg agttgcttcg gggtgaattc aattactgat tcgggactgc 3240 gaggtggata attttggata tcagacagtg aaagttccga gaaatttgga gagatgggtg 3300 tcagttgttg aggagatagg aaaggggatg aattgattga tatggtatca ttttggcgag 3360 gaattgactg caagtttgtt ttgtcaaatg ggagtgaaat gaatcacaag aatagatata 3420 ttatatacaa gagttctgag ctgaggcaaa ggatcaggaa aactgattct tacaagatga 3480 acaaaagaaa aactcagcat gtgattctga tcacaagaag actgagtgct gagactgagg 3540 ataagtgtcg agtattttgt gatacactat tgagagtatc aacattggag tcgtgcagtt 3600 tttggtagtc tcgacaatct tgagctgcat cactatctta tcatcaaaga ttacaaagat 3660 tccaaattaa ataatgccca aattgttaaa actaacaaaa tcatagttgg ctttatcttg 3720 aatcatctgg ataagagtaa tcacactcaa gccataaatc atttaactga caagaataac 3780 actcttcaaa ttatttatga cccgtttagt ctgtgggagt ttctcaagga tagacatttt 3840 ctaattaatg atcagcgatt agcctctatc tctaaaactc tcaatactgt tactattcat 3900 agaagtgact cgttatctag ttatctcgac aagtttgaaa gtctttttat agaattcact 3960 cgttatggag ggaaaatgga tgacttacaa tcagctattc accttattga ctctattact 4020 aatctcccat ctactactgt tgaattaatt cataatgttg taaaacctct aactcgcaaa 4080 gatgtgatta aatatctcag agattacgac acccgtcaaa ataatttctc aactgaggct 4140 acccgcgaag ttcatcatgt ggaagcgaac acttcaagtg ctggaatgcc gtataatcga 4200 acttcaaaaa tcatgtgtac tgagtcggtt tgtagaggtc ctcatgctcc tgaacgatgc 4260 tggtctcgac ctgaaaattt taaagaaagg gatgactttc ttgcacgaag aagaactcgg 4320 ggtagctgga atccaagagg tcgacaaatt cctaatcgaa cttctcaatc caactcttct 4380 actaccatcc gaggaatgca acgagttgat actcctaacg cgaactcggt ttcagactca 4440 atacaaatgt tgtcacttaa cgttgaattc acctctctag ttgaagacgc tgaagcaaat 4500 tctaccgaat ccacatcttc tgattcgata tgggcacttc atgatacagg tgccactcat 4560 cacatgttta atgatagtag tttgtttgtt caaagtagtc ttaaaaaggt ggatgatgtt 4620 aacaaacgac taaagttagc tggaggagat gtttcattag cagtagatag catcggaaaa 4680 gttcaactaa aagcaggtga tggaactgtg tttgagctca acgattgttt gtttgttccc 4740 gaacttagca agaatttaat ttcaggtggt actttaaaaa tgaaaggtgt cagagaagtt 4800 tacgatgatt ctgaattatc ttgttttgct ttagtaaaag atggattagc tttatttaat 4860 ggagtcattt taaagaacgg actcatgaac gtcaaaatca atcctgtaag cttctcaaat 4920 caattacctc aatcatcaac tacagtaagc tcatcagtca ttcatcatcg cttaggtcat 4980 ttaagtgaaa gttatcttaa acaaatgaaa aggaatgata gtgtgattgg aattggggaa 5040 atacttagta ctccagaatc ttgtgatact tgtaatctgt ctaaaaacac caagattcct 5100 ttcaatcata cccgtcctcg agctttaaga tttcttgaaa acgttcatgt tgatctcagt 5160 ggaatttatc gtgttaatgg tttcaataac gaaaattatt ttatcttgtt ttgtgatgac 5220 ttctcgagtt ataggcatat ttttggtctg actagtaaat caaaagaaga agtttttgaa 5280 tcatttaaag cttacattgc tgttagtgag agacagactg gattaaaaat taaacaattc 5340 acacttgatc gtggaggaga gttcgtaaat tccttactcg gttctcatct tcgtgatctt 5400 ggtatcgagt tgcatttaac ttctggttat gctcctgagg agaacggagt gtcagaacga 5460 ggcatgagaa ctattaacac caaagcaaga tgtttactca ttcaatctaa acttccttct 5520 atcttctggt ttttggcttg caagaccgct gtatttctca ccaatagaac tatcactaga 5580 tctcttccag actttaaaac tcctttcgaa atgtggcatt ttcgaaaacc tactttgtct 5640 cacttacgaa tttttggttg tcaagctttt cgactgatca ggaaggaaat acaaaattca 5700 aaatattctc cagtcagttc ggagggagtg ttggttgggt ttgaacaaga caactttaat 5760 taccaaattt gtgatttatc tgataataaa ttccatacgt ctcatcatgt cacattcaac 5820 gaaaatgttt ttccttatca taattctaaa tacaatcaag aatcttcaaa tgacattgta 5880 agaagcattt actctgacaa tgaagaagaa atcactgcgt tgaaaccttt aattgaaaat 5940 aaaatgactg atctaaatga tgaagataac aatgctaaat cgactgaaaa agaagactct 6000 tcaagttcta atgatgaaac aattgaaaat cctgttccta ctattgcaac tcaaaataaa 6060 ttcaacaatt taatacccag aaaatctaca agaagtcaaa ctaaaattaa ctataaagga 6120 atgggtggtt tcgctgaact taacaataaa attaaatcta atgtagccag atttgatgat 6180 gatgatttct gtatctcgtc atattttgac tgcttaccaa cctgttttaa cgcctttcta 6240 aattcaactc cagatccaaa atcattcaag agagctatgt cggcgggtga ctcccccgaa 6300 tggaagcttg catgtgataa ggaaatgtcg tctttgctaa aaaaggacgt ctggacgtta 6360 gttgatagac caactcacaa atctgtaatc cgaggaatgt gggtgttccg taagaaaatc 6420 aattccgata actctgtcaa atacaaagca aggtttgttg ctatgggcaa tacccaggtt 6480 gaaggtgaag attacggtga tacttttgca cccaccggaa aaccaacatc tcttcgactt 6540 cttcttgcta tggctgctat caacggttgg gaggttcacc aaatggatgc agtaacagca 6600 ttcttaaatg gtattttaga agacgaaatc tatattcaac aacccgaagg gtatgttgta 6660 gttggacttg aaagtaaagt gttgaaacta aatagatcat tatatggact gaaacaatca 6720 ccaaaaattt ggcaagatga tgttactcag tttcttatct acactctttg ttttgaacaa 6780 tgcatcatcg atccttgtat ctattttaga tcagatgaag aaaaacaact gttcacagcg 6840 gtttatatcc atgttgacga catggcaatt accggtaacg actcttcaac tttcaaatct 6900 gaaatttcct ccaaatggga aatggaatat cttggactgg ctcaaacaat tgttggaatc 6960 aaaatcgaca ggcctggccc attcatctat tctcttactc aaaccaaatt tgcgcttact 7020 gttttaaaac gttttgatat gctcgactca aagcctgctt caactcctct tactccaaac 7080 ctcaagttgt atcaatcttc tgatgaagaa tgttcaagtt tctcttcact gggtctcaac 7140 tatagaagtg cagttggttc attaatgtac ttgtctcaat gtacaagacc tgatctggct 7200 cattctgtag gagttctttc tcaacattta gacaggccta gcttacaaca ttggaacgca 7260 gcgttacaag ttttcaggta tctcaaagga actattcatt taggaattgt ttattcagga 7320 gaagataatc ttaaaatttc tggtcaacgt agtttctctt atcctgtttc acactgtgac 7380 tcagattggg caggagataa atctacacga cgatcaacca ctggttatat ttttacttta 7440 gccgggggtg cactttcatg gaaaagtagg ttacaaccaa cagttgcttt atcatctact 7500 gaagctgaat atagagctat aacagaagct ggtcaagaac ttttatggtt aagaaagatg 7560 atggagtatt ttggttgtcg tgattcaaat cctacagtac ttcaaagtga taatcaagga 7620 gctattcatc tcacaaagaa atcaattttt catggaagaa caaaacatat agaaattcag 7680 tatcattgga ttagagaagt agttaaaaat ggtgatctta ttgtagaaca ttgtcctact 7740 tcagaaatgg tagctgatct tcttactaaa gctcttggta aacaacaatt catttgactg 7800 cggagtagac taggaataaa gtagtaactg gcaagttctt gagggggtg 7849 // ID Copia-44_MLP-LTR repbase; DNA; FNG; 637 BP. XX AC AECX01001150; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-44_MLP_; KW Copia-44_MLP-I; Copia-44_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-637 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001150; Positions 79900 80536. XX SQ Sequence 637 BP; 173 A; 132 C; 112 G; 220 T; 0 other; tgttgagaga caacccagtc tctcatgatc ttacttaggt ttcagctata gtgaaagtct 60 aattgacact tgacttatta caaatacgct accccgctta aacttagatc atagtggctg 120 aaactgaact aaatctaaca atactattaa gtcaaaccta aacctagaag ttcctttatt 180 tgaattgcgc ctccatcaat ttgggctaat actcttttcc ttatccgcta ttctacggaa 240 ggaaaagagt aagccccttt cactatttca attctatcat attctttctt tatctttctt 300 tcatttctat atggatgtta ctaattgttt ctcgtcattc atatcttcat gtttcacttg 360 tttgttaatc ggcgctagaa tcgtagctaa aacttatacg cgccgattcc ttttctccat 420 cttattctgt gacctgacta tacctcatag gttagtgaat caggtcttgt gactgaggtg 480 attgagactc atctctaggt attgtcacgg ggaaaacagg ttagtgaatc aggtcttgtg 540 actgaggtga ttgagactca tctctaggta ttgtcacggg gaaaacagac ctcataggga 600 aaacagatat ctcgactagg agacttgtcg cttttca 637 // ID Gypsy-106_MLP-I repbase; DNA; FNG; 7929 BP. XX AC AECX01000915; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-106_MLP_; KW Gypsy-106_MLP-LTR; Gypsy-106_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-7929 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000915; Positions 20134 12206. XX CC 'ACGG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 372..6788 FT /product="Gypsy-106_MLP-I_1p" FT /translation="MSSVIQIHDQNVADDGDEDIGLPPTSHYKRTPRVSGS FT RIYVQENPVNYNVLPPPPPVSGSTARGKQREASASEDPIPPTAFTGFREQE FT ALVPSDHEEEMAWQQQFFTQQQYYLQREKEMADMINSMRRIKEENEQKKKD FT KEVIKVVKKEKGKKVVKKIIKEEDSDGDSSVSSSSSEEDSSVSENSSEEER FT NEEVRSHVKPRKIDNSDTSVRFEVGGNVKKFIINYHSQAEANGASELDMVR FT QITSFVVGEELKEEIRDMEGWGTGEWSWKKVKEQLIARYQTTTEQPRYSID FT HLRTLSERTMRNGGVTCKSEYQTFRINFDKIHQYLARHDYVDTNDEEVARY FT FYEAFSDELQKKFKKRMIKKKMMKKVSSGRYKLPNLEKLKKVVDHYIEEEA FT LLVFEDIGEERYTIKNEIAGVKSEVIQKDTSSKVAKIGGMQAKATQQMEVL FT CSDLKNLQINHVQNPLQIPAPNSYNFQLPISNALQNQIHQQVGPQQRPINN FT QYAPNNNRNQHYNNNQHYNNQQPNSFQPGRAPNQQNFNRFNNNVNKPYNPN FT NPHTNNQLPAVNNNNNFNNQPITLFCAYCSTEGHTVGRCRWVIQDKQAGLL FT QDRMDGFFLPRSDVRLSKKGEDGIRQQVIKYSEEQANAVARGNIQQQQQNP FT VPPPATNTAVVPPPVIPTSVITKNDGKDRVHELKASCGILEEWSVPEVNKK FT QVEVKAGAGTKLYGVGMKTRSGKDVEEGSGFKLKKKKKVAINDVVEEIDAM FT DVDEEDVEDQRRGILEDTLRRINKDNESEKSEEGSQQTNKPSYIIPNDGDF FT VRSRSERGLPAFADKTKVVSEVVEKIMKGAIELQIKELCAISPVISDEVKK FT WVSKRRLNLNQNTMASVVQSTAVSDFDYESDSDSSLEVNVPQATLYSTPLG FT FVEITIGKKKIKALIDSGSQINIIPLKIMQLLPVQTLVNLKSGVKGISGHR FT TPLCGIAENVKVSIGTNISGLVHFFVAEDEDTPILLGRPFLFDFEAQLNFE FT EDVGERMTVKDNRGVWMKIRLCEPDRGTWERKLNVSIEERQGFMVEEFKQE FT ELFEELREELNEDEVYGFKFEDEVKLDEGLVNELNLIMNMEDEDMLSRDYE FT GRDYRSFSTKYKSVDKKVKPVNAPMPQYLNYPLQRPPLSRDPYVTPLIHNP FT PEFIETEKTTVERLKQCNFGPEGWLSEEEWKLFLFIMVLREAAIAYCEEER FT GLLKHSYGLPYAIPVVEHVPWQIRPIPIPTAIRNDYIELVRQRIRTGLYEQ FT SSSSYSSPVFCVLKGDGKLRIVHDLQVLNKVTIKDSGVPPSPEEFVESFSG FT RACYGLGDIMGGYDERELAEFSRPLTTFETPLGRLQLTRLPQGATNSVAVY FT QAQMMWIWQEELPQHVGIFIDDGGIKGPESDYNNEVLKENNGIRKFIWEYA FT VVLERILFRIEEAGLTVSGKKFAVCVPALEIVGHIVSKNGRSISIKKKNKI FT QTWPTPTNKTEVLSFLGTCIYVRMFIPNFGHLAAPLRRLTRMKVDWEWTQD FT CEDVFNKFKKIIGEEIVLGALNYSEGAPKIILAVDSSYIAAGGVLMQPVFY FT ESEVFSEVESRYSQPKLELCGVAKMMRKFKTKLWGQHFELQVDAKALIQMI FT NTPSLPNAAMTRWVAYIQLFSFDLVHKHGKGFGMPDGLSRRVLGSESEEAE FT SFDEELKEIKVYESFSLEDMEIRSEEEEKIEDNMWDQEGIWRNLQEYLLKM FT TRPEDCMDAEFQSIKSKAPLFYVERDRLKKRGVPQGRLVILKLIGQNFILK FT KLHEELGHRGVEETYKRCLIRFWWPEMKESVRRWVMTCEICQKKGGKKQKE FT VGRATGESTIFGRVSLDAVHIKAGSYDYAIIARDDLSGWVEAAPLKKLTST FT AVAKFIVKEWIMRYGSVKCFTVDGGSEFKGIFREAVIMAGSKVVEATPYWP FT QAEGMVERGHKDIKGAVVKLSDETGTSWEAFLPQALFADRISTKRTTGMSP FT YEMLFSQLAVLPVDLEAGTFLGIDWEDVYTGEELIKARMEQLLCREEVIEK FT AYKKMMKARVEGIIYWDKRNAHRMRKPLQEGDLVLTYNKSLEFQWGKLFKN FT RWNGPFRIVRQHVGGSYILAELNGVELSRRYAAEHIKRFYPRGVIPTAEEE FT EVVSVEEEAQI" XX SQ Sequence 7929 BP; 2848 A; 892 C; 1808 G; 2381 T; 0 other; tattggtgac tccactgggg ctctactttc aacggctaca tctgtcattt aaaactgaat 60 aatttcccta attattatct ataacattaa caaatttatt attaaaatta actattacat 120 taataataaa attcaagatt caatttcgaa gaaaaattat tatcatcaag tatcaagaat 180 taattcattc agtcaactta aacaattttt atatttaaat catcaaacag ttcaagattt 240 ctttataaga tttaattaaa ttaacacgca tatatttata tttaattcaa gaagaatatt 300 caatattgag attatttcga attttttttc ttcaaattaa attgttttaa aaaaatttag 360 cagaagaaag aatgtcttca gtaattcaaa ttcatgatca aaatgtggcg gatgatggag 420 atgaagatat aggattacca cctacttctc attataagag aacaccacga gtttctggta 480 gtaggattta tgttcaggag aatcctgtaa attataatgt attaccacca ccaccacctg 540 tttctggatc aactgctcgt ggaaaacagc gtgaagcatc tgcttctgaa gatcctattc 600 cacctacagc tttcacggga tttagagaac aagaagcttt ggtaccctct gaccacgagg 660 aagaaatggc ttggcagcag caattcttta ctcaacaaca atattatttg caaagagaga 720 aggaaatggc ggatatgatt aattcaatga ggaggattaa ggaagaaaat gagcagaaga 780 aaaaggataa ggaggttatt aaggtagtta agaaagagaa aggaaagaag gttgtaaaga 840 agattattaa ggaagaggat agtgatggtg attcaagcgt gtctagtagt tcaagtgagg 900 aggatagttc agtttcagag aatagtagtg aggaagaaag aaatgaagaa gttagaagtc 960 atgtaaaacc acgtaaaatt gataattctg atactagtgt aagatttgaa gttggtggta 1020 atgtaaagaa gtttattatt aattatcata gtcaggctga agcaaatggc gcctcagaat 1080 tggatatggt aaggcagatt accagttttg tggttggaga agagcttaag gaagaaatac 1140 gtgatatgga aggatgggga actggagaat ggagttggaa gaaggtgaag gagcaattga 1200 ttgcaagata tcagactacc acggagcagc caagatattc tattgaccat ttaagaactt 1260 tatctgaaag aacaatgaga aatggaggtg ttacgtgtaa aagtgaatat cagaccttta 1320 gaattaactt tgataagatt catcaatatt tagctagaca tgattatgtg gatacaaatg 1380 atgaagaggt ggcaaggtat ttttatgaag ctttctctga tgagcttcaa aagaaattca 1440 agaagaggat gattaaaaag aagatgatga agaaggtatc aagtggaaga tataagttgc 1500 ctaatttgga gaagttaaag aaagtggtgg atcattatat tgaggaagaa gcgcttttgg 1560 tctttgaaga tattggagaa gaaagatata caattaagaa tgagattgct ggagttaaaa 1620 gtgaggttat tcagaaggat acaagtagta aagtggctaa gattggcgga atgcaagcta 1680 aggcaactca gcaaatggag gttttatgtt cagatttgaa gaatttgcaa attaatcatg 1740 tgcagaatcc tttacaaatt cctgcgccta attcttataa ttttcaactc cctatctcta 1800 atgctttgca aaatcaaatt catcagcaag ttggacctca acaaagacct attaataatc 1860 aatatgcgcc taataataac aggaatcaac attataataa taaccaacat tataataatc 1920 aacaaccaaa ttcttttcaa cctggaagag cgcctaatca acagaacttt aataggttta 1980 ataataatgt caataagcct tataatccta ataatcctca taccaataat caattgccag 2040 ctgtgaataa taataataat tttaataatc agcctataac tttgttttgt gcgtattgta 2100 gcactgaagg gcatacagtt ggtagatgta gatgggttat tcaagataag caggctgggt 2160 tacttcagga tagaatggat ggatttttct taccaagaag tgacgtgaga ttgtctaaga 2220 agggtgaaga tggcattagg caacaagtaa ttaagtattc tgaggagcaa gcaaatgctg 2280 ttgcacgtgg taatattcaa caacaacaac aaaatcctgt tccaccacca gctactaata 2340 cagctgtggt accaccacca gttattccta cgagtgttat aactaaaaat gatggaaaag 2400 atagagtaca tgaattgaaa gctagttgtg gaattttaga ggaatggagc gtgcctgagg 2460 ttaataagaa gcaagtagaa gttaaggcag gagctggaac aaaattatat ggagtaggaa 2520 tgaagactag aagtggaaag gacgtggaag aaggttcagg atttaaattg aagaagaaga 2580 agaaggtagc aattaatgac gtggtggagg agattgatgc tatggatgtt gatgaagaag 2640 atgtggaaga tcaaagacgt ggaattcttg aagatacttt aagaagaatt aataaggata 2700 atgaaagtga gaagagtgaa gaaggaagtc agcaaacaaa caagccttct tatattattc 2760 caaatgacgg ggattttgtc agaagtagat cagaaagagg tttaccagcc tttgctgata 2820 aaacaaaagt agtttcagaa gtggtggaaa agataatgaa aggtgctata gagcttcaaa 2880 tcaaggaatt atgtgctatt tcacccgtga tatctgatga ggttaagaag tgggtatcaa 2940 aaagaagact gaatcttaat cagaatacta tggcttcagt tgttcaaagt actgctgttt 3000 ctgactttga ttatgaatca gattcagatt cttcattgga agttaatgtt cctcaagcta 3060 ctctttattc tacacctcta ggattcgtgg aaattactat tggaaagaag aagatcaaag 3120 ctcttattga ttctggatct caaattaata ttattccttt gaagattatg caattacttc 3180 ctgttcagac tttggtaaat ttaaaaagtg gcgtgaaggg aataagtgga cataggactc 3240 ctttgtgtgg aattgctgag aatgttaaag tcagtattgg aacaaatatt agtggattgg 3300 ttcatttctt tgttgcagaa gatgaagaca cgccaattct tttgggaaga ccttttctct 3360 ttgactttga agctcagctt aattttgagg aagatgtagg agaaagaatg actgtgaaag 3420 ataatagagg tgtttggatg aaaattagat tgtgtgaacc tgatagaggt acgtgggaaa 3480 ggaaattaaa tgtgtcaatt gaagaaaggc aaggttttat ggttgaggaa tttaaacaag 3540 aagaattatt tgaggaatta cgtgaggaat taaatgaaga tgaagtatat ggattcaagt 3600 ttgaagatga agttaaatta gatgaaggat tggtaaatga attgaattta attatgaata 3660 tggaagatga agatatgctg agtagagatt atgaaggaag ggattataga agttttagta 3720 ctaaatataa gtcagttgat aagaaagtta aacctgtgaa tgcgcctatg ccacagtatt 3780 taaattatcc acttcaaagg cctccattat ctagagaccc ttatgttact cctttaattc 3840 ataaccctcc tgaatttatt gaaactgaaa agaccacggt ggaaagattg aagcagtgta 3900 attttggacc tgaaggttgg ctttctgaag aagaatggaa gttgtttttg tttattatgg 3960 tgttaagaga ggcagcaata gcgtattgtg aggaggaaag aggtttattg aaacatagtt 4020 atggtttgcc ttatgcaatt ccagttgtgg aacacgtgcc ttggcaaata agacctatac 4080 ctataccaac tgcaattaga aatgattata ttgaattggt tagacaaaga attaggactg 4140 gtctttatga acaatcctct tcaagttatt ctagtcctgt gttttgcgtg cttaaaggtg 4200 atggaaagtt gaggatagtt catgatcttc aagtattaaa taaagttact attaaagatt 4260 ctggagtacc accttcacct gaggaatttg tggaatcttt ttctggtaga gcttgttatg 4320 gacttggtga tatcatggga ggttatgatg agagggaatt agctgaattt tcaaggcctt 4380 taacaacatt tgagactcct ttaggaagat tacaattaac aagattacct caaggagcaa 4440 caaactctgt ggcagtatat caggcgcaga tgatgtggat ttggcaagaa gaacttcctc 4500 aacacgtggg aatttttatt gatgatggtg gtattaaagg acctgaaagt gattacaata 4560 atgaagtatt gaaggagaac aatggtatta gaaagtttat ttgggaatat gccgtggtat 4620 tagaaagaat attattcaga attgaggaag caggtttgac agttagtggg aagaagtttg 4680 ccgtgtgtgt acctgcttta gaaatagttg gacatatagt gagtaagaat ggaagaagta 4740 tatcaattaa aaagaagaat aagattcaga cttggccaac tccaactaat aaaactgaag 4800 tattgagctt tttgggtacg tgtatttatg taaggatgtt tattcctaac tttggacatt 4860 tggcagcacc tttgaggagg ttgacacgga tgaaagtgga ttgggaatgg actcaagatt 4920 gtgaagatgt ttttaataaa tttaagaaga ttataggaga agaaatagta ttgggagctt 4980 taaattatag tgaaggtgcg cctaaaatta tattggctgt tgattcaagt tatattgctg 5040 ctggtggagt attgatgcaa ccagtatttt atgagtctga agtcttttct gaagtggaat 5100 caaggtactc tcaacctaaa ttagaattgt gtggcgtggc taaaatgatg agaaagttta 5160 aaactaaatt atggggacag cattttgaat tgcaagtgga tgctaaagct ttaattcaga 5220 tgataaacac gcctagttta cctaatgctg ctatgacaag atgggtagct tatattcaat 5280 tattttcatt tgatttggtt cataaacacg ggaagggatt tggaatgcct gatggtttat 5340 caagaagagt tctaggaagt gaatcagaag aagctgaaag ttttgatgaa gaattaaaag 5400 aaattaaagt atatgagagt ttttcattag aggatatgga aataagatca gaagaagagg 5460 agaaaattga agacaatatg tgggatcaag agggtatttg gcgtaattta caagaatatt 5520 tattaaagat gacaagacct gaagattgta tggatgctga gtttcagagt atcaagagta 5580 aggcgccttt attttatgtt gaaagagaca ggttaaagaa aaggggtgta cctcaaggaa 5640 ggttggtaat tttgaaatta attggtcaga attttatatt aaaaaaatta catgaagaac 5700 ttggtcatag aggagttgaa gagacctaca agaggtgttt aattaggttt tggtggcctg 5760 aaatgaaaga gtcagttaga aggtgggtga tgacgtgtga aatttgtcaa aagaaaggtg 5820 ggaagaagca aaaagaggtt ggaagagcta ctggagaaag tactattttt ggaagagtaa 5880 gtttggatgc tgttcatata aaggcgggta gttatgatta tgcaattatt gctagagatg 5940 atttgtctgg ttgggttgaa gcagctcctt taaagaagtt aacttcaact gctgtggcta 6000 aatttattgt taaagaatgg attatgaggt atgggtccgt gaaatgtttt actgttgatg 6060 gtggatctga gtttaaagga atatttagag aggctgtaat tatggcagga tcaaaagtag 6120 ttgaagccac gccttattgg ccacaagctg aaggtatggt tgaaagaggt cataaagata 6180 ttaaaggtgc tgttgtaaaa ttaagtgatg aaactggtac atcttgggag gctttcttac 6240 ctcaagcttt atttgctgat aggatttcta caaaaagaac tactggaatg tcaccttatg 6300 aaatgctttt ctctcaattg gctgttttac ctgtggactt agaagctgga actttcttgg 6360 gaattgattg ggaagatgtt tacacaggag aagaattaat taaagcaaga atggaacaat 6420 tgttatgcag agaagaagtt attgagaagg cttataagaa aatgatgaag gcaagagttg 6480 aaggaataat ttattgggac aaaagaaatg cgcatagaat gaggaaacct ttgcaagagg 6540 gtgatttggt tttaacttat aataaaagct tggagtttca atggggaaag ttatttaaga 6600 acaggtggaa tggacctttt agaattgtta ggcaacatgt aggtggatct tatatattgg 6660 cagaattaaa tggagtggaa ttgtcacgta gatatgcagc tgaacatatt aagagatttt 6720 atccacgtgg tgtgattcca acagcagaag aagaggaggt ggtcagtgtt gaagaagagg 6780 cgcaaatata aagatattta aaggtatata ataatattaa tttcaaagtt taaaagaata 6840 aaaagattaa aagataagaa tataaattta ttaagatatg tttcactagt tataaagtat 6900 aatgttatgt taaatgaaat attaaatcaa ataaaaagga taagttaaaa gtgttttaaa 6960 attaatcaaa taagcgtgga tcaagatggt taggagtaga aggaacatca gagttatcag 7020 gacttgggta attaagagtt atattgtcaa tattattaaa atcaggagga gaaagagaag 7080 gagggccagt attaggcgga tttgaaagag gtgaatcagg agcatgagta ttattattat 7140 gaagtctgtt ggcattttca gcgtggtgga tccaaggaat aaaagtattt atagttggag 7200 ctccatcaag accttttcta acaatatatt catctatagc agcttcacgt agaaaattat 7260 aatgattatt aactatgttt ggaatgatat taccattatc tttgaattgt cttataacat 7320 catcaagaat gcgccaagaa ttccaatttt gagaagtaat ataatcagtc ttccatggta 7380 aattgtgatc ataaagagaa agaaataatg ggtctgtagt aggaaaggta gtttcaacag 7440 aagaagaaat tgaaggaact ggtgatggag gtaaatccac gtgttgtaaa tcatgatcaa 7500 atatattatt taagggtgtt gaagagcctt catagaaata ttgatgagta gtttggtggt 7560 tgaaatgggc aactgaataa agattaaata aaagaaagaa agaataaaaa acattagtaa 7620 aaattaaaaa attaaataaa tagtaggaaa tttgaaatta tgacttacca catatagaat 7680 ttaatattac acatctttga catcttaaag gttgaataat gctaagagga cttaattcat 7740 aaaagcaatc tacttcttca aggcggcatt gttcacatgg agctcttata gttggatgag 7800 aagtcatttt taggtgtttt tgtcaaaatt tattaaggaa gttaaaagaa gagttttaga 7860 attgaaggat atgcaatttt tgaattgttt taatggtaaa ctggggacag tttgagaaga 7920 agaggagga 7929 // ID LMR1_LM repbase; DNA; FNG; 7373 BP. XX AC . XX DT 17-NOV-2004 (Rel. 9.1, Created) DT 17-NOV-2004 (Rel. 9.1, Last updated, Version 1) XX DE Leptosphaeria maculans putative retrotransposon, partial DE consensus. XX KW Non-LTR Retrotransposon; Transposable Element; LMR1_LM; KW gag-pol domain; putative retrotransposon; Repeat region. XX OS Leptosphaeria maculans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Dothideomycetes; Pleosporomycetidae; Pleosporales; Pleosporineae; OC Leptosphaeriaceae; Leptosphaeria; Leptosphaeria maculans complex. XX RN [1] RA Taylor L.J. and Borgmann E.I.; RT "An unusual repetitive element from highly virulent isolates of RT Leptosphaeria maculans and evidence of its transfer to a weakly RT virulent isolate."; RL Mol. Plant Microbe Interact 7(2), 181-188 (1994). XX RN [2] RP 1-7373 RA Gentles A. and Jurka J.; RT "Putative retrotransposon from L. maculans."; RL Direct Submission to Repbase Update (OCT-2004). XX DR [2] (Consensus) XX SQ Sequence 7373 BP; 2606 A; 1157 C; 1222 G; 2123 T; 265 other; ataaggntat ctagcnccta ggcttantac tctatagtan taaccctttt aaatannntn 60 ctangaacaa atangtatta ttctaanagg ttttaggata actanngaga gttttantat 120 ntctagacta ntaagtatta gaggactttn ctaacannta aagcttacaa gnaatagtgc 180 nactanntag agaaacttta nacntattan ttaanctgta aaccttacta agnttaggaa 240 aantttagga gatanatant tanctagtag ttaagnttat anaaggtnaa nanttagnaa 300 cncttcctta ctagagctct tncttagtag taannaaagt taagaaggan taantagnnt 360 aagatgctat anaanttagt taaaaggact anaacntacc tctaactata gntaaanctt 420 ttattaaagt tgctagttan aagatantat ccttaataga ctanagggtt aaattagntt 480 accctttagn tagtagggat gcaactacta tatacacctc tatctgccta ntttaancct 540 ttactatagn ttaagtagta ataaactaaa cancagnntt ttagtnaaat ataatanana 600 tactatanat ncnctnccct tatatntact attaataaat antattaaag aagagggtnt 660 atagngcntn taatagttaa gngcttaann nntantagca tagctaagat taagaannta 720 ggggacntaa aaagaaatat cnttagtnta gtttntatta tantaaaact aancnaagat 780 anctaaactn tacgcagnaa gtctagtant ctataactcc ttaagtagag gnncctagcn 840 ttggtactaa gcanagctan cttaacacaa agnatctata naaaganctn tanactaana 900 gagctctata ctantaaact aatagcccta aactttagta nagaagtang taggtaagag 960 ntaagagctt cnctagnana aatagtagaa gnntanctnt cctagantaa aggtanctng 1020 taggaaanan ctaaagcnaa tagtactcta ctaagctagn gtgtagagta ntgctatagg 1080 taagggcgcc ttaagctctt aannagaaga taaggaanct agctncttta ggattctcct 1140 acagattcta ggcaagngga acctagagct cccnntagan tttaaggact gcnttnagcg 1200 cgtaatagnc cttagggaag naacctaata aannacaaag aggtaggnta tanntagtgt 1260 tgcaggaggg atatcaacta gatcacggag catcttgatc cgatcaatcc tagttaatac 1320 ccctgtataa cccactatct aatcgactgc cagcttactg ctaagtgcct aacaaaaagc 1380 aggagctcta gtaaggaagg tttactaatt atttaccttc tagtaagggt agctaaagct 1440 ctatatatat gagtccaatc gtagtagata caatctaata atctatctcc taaattattc 1500 ctaaccttag taatctaaat ttctatagtt ataagcctag atcgcgcttt accttattaa 1560 agatcgtcct atctatttaa gcgtattgtt taacttggac actaagctag caagctttac 1620 tctttatact atttcttaag caaaagctta cttattactc ttctactata agcgactaaa 1680 tcctaaaggg ggatatccta cttactagat taactaacta gcctacctgg tacatttaat 1740 tcagatacta tgccctaact aaagggatct aggcagaggt agaccctaag gctctagatg 1800 ctaaaaacca ctttaagaat aaacctatta agcctaacat gctaacagat attgcgctat 1860 ctaatataac aagcaagcag tataaactct aaatagcata gtactaagct tgcttattag 1920 attaggcatt aaagttatta gtaatctaga acctctttac ataggttaat aatatagtag 1980 attaagggat ccttacccta gcctatatag agcttgttaa ttaaggaaga gtaaccctct 2040 aagcactaat ctaagtgctc cgtaataacc tagcgctaac taacttaagc gctgcaacac 2100 tagcaagatt gcagtataaa gcttatcttg ctaaggcaaa gataagtagc taagacgcta 2160 atcgctagca gaaggagtgg ataatcctgt atacgcaagc taaggtatac aacgttccta 2220 atattaaagg atagcttgta gtacaggatt atcttaaagc tctatctgct aggatcttac 2280 tagattaggg ttaaactatg ttttaaagga ttattaaaaa gttagtgata ggagagctaa 2340 cttataacct cctataggtt gctaagatct atttattact gctctaagag aacaatataa 2400 gattaagctc taagggtcct agagtgtttg caacaatagg agctctacct tctaattaac 2460 ctaagaagga gtctttatac ctatatagta gatcttacct atagaaacct cctaaataca 2520 aaaayrttat taaagcttag actaatacag gtaacaagct tactactact taataattag 2580 agattaagaa aaagattaat aggaatccaa aatttaaaga gttaagaaaa gctctaatta 2640 agtcctatcc tattaatcaa gattatagta gtaaggyaar cgcgatcacc tatctaggya 2700 gtatctaggc ggcactaatt aaccctatat taatagcgga gaaaacaaac aggatctatt 2760 taactataga ctttaaagct taygctctat taaagagcac tatctttaay aataggggtg 2820 cagtgcacct agttaataat ataagctacc tagaagaaag cttgtttaga ttagttaaat 2880 ataagayagt taaagcagga acttaagctt ttctaatctt aggcagaggg actagagtaa 2940 tccctaatrc tcttaataga cyaagaggtc caaaaacaga agatttagtg cttactaacg 3000 tggtgttagt agaaggtttt tatgtraata ttatattaga agcttaattg cttaaagcag 3060 gagtttrgtt ccttaggcta gatgcyacct tgcggtttag attattagaa aagagcgttg 3120 tattagctaa gttactgcgc aagtttaact taactttcct agaatayaag ccctctacyc 3180 cttatywaat aatctaaagc atagtgccta aacaacccta ayaatcctag acgacttacc 3240 taagacacga yagtgaggag ctttagcact aayaattagg ctatttagga cctaaggcgc 3300 ttaaagctct agttaagtta gcwataaacg ttaggattaa aggaactcct aggagcaaat 3360 gcgagcactg cgctattaca catgcctaac aggttatatt aagataacta agggaaagat 3420 tacyayrtct atactactag gtattatagg atctatttaa yatgctaaca ggtatagctt 3480 ataagcaata gatcttagta ttaaagtgcr actacttagg aaagctttat acctatctac 3540 tgcaagctaa aaaccttaat aagattatrc rggtgtttaa aaattttaag agcttaatac 3600 ttaactaata taagcttagc atagttaaga ttatgcaaga caacaacgtt gcaacgctcc 3660 cttagcgtgg caaatctygc ttttagatct aggyagctaa yaatagtatt aaaattaaga 3720 gcttacctgt atatacctat aaacctaata gaggagcaga aagagtaggg yaggagatta 3780 taacaaaatt rattaaaata aggattagtg ctaacctact aacaaagctc tagcctaaaa 3840 ttrttaaagt agcracttag ctctataata taagcctrtc ctatgcttay aatataatat 3900 cacctaayaa agtgctagat tgttagttta ctagatactt taggtrgtrg caactagagc 3960 agataaggga ggyaactact aatctccgcc ctaattagag cggaatatac gcctatagct 4020 rttaagctta cccccttaat agagattaag tagctaggcg ttacaagagg gtttttaagg 4080 tgaaccctta gggttatatt agatatctag taggatayag agcatctaay atatatagga 4140 tatagatccc cttrcttaat tagattatta taacgyrgaa crttaccttt aataaggatc 4200 ttttctayaa agagaaagat ctagagyagc tttaatagtt agaggcttaa aagatagtta 4260 ayrttattag caaagataag atctataata taggagaagc atataaagag cttaatatct 4320 ttaattagct ttgtattgca gcagagcaat ataaggagtc tagtaagcaa graggactaa 4380 acctagcgca ggagctaggg ggtagggtag agctagagga ggtagataat taggctagct 4440 agcctagtaa ttcacaaccc cctaagcaga cacctctagc gtaggtacta agcacrgcta 4500 gcttagcaya aaggatctat ayaaagtccc ctaagctaat agggctctaa actcctaaac 4560 taacactaga actaaacttt agtataggag ayaaaggatc tatrcaggtt atagattagg 4620 gtcttactac ctaagatagt agtaatctaa cttactttaa cgcyayaata ggtagctcct 4680 aggaaagacc tagaggggag agtgctatta gtaggttact ctataagagc acrgagatag 4740 gtaagggcac cctatctagc tcttataggg gtrgtaagta gagctcctac ctacctgctt 4800 taggactaga ttctgtatct cctgcagatc ccttacaagc ggcacctaga gctcctagaa 4860 gaagtagaaa accgcragtt aaaggactgc cactaacgyr tgttagtaag caagcttagg 4920 gacaggaccc taataaacya ctaggaggct ctataagtrc ggggatatat atagtaatag 4980 atctattaga tatagatcta attagaaatc ccttatacaa tcttttagta tatcctaaac 5040 taaacgctat aatccacgct gtratygyag tagtaatagg gagyaaatcy cctaaaaacc 5100 ctaaaagraa tacgcactag gacgctctat aaaaagagct aaaataatag aaggatctct 5160 ataactacta aataggatag caatttagag acgyagtata taaagaaatt aatactctac 5220 taaaagctag tacctaggag gagattaata ggctaamtat aggagagtgt ctactcctac 5280 ttaaataggt gtttayatac aagcttaatt aggatagtta cctaattaag tgcaaagcaa 5340 ggatagtagt aagaggagat ctatagctta ctaacttaat ttatttaacc tacgcagcta 5400 ccctagtagc ttaaaccttt aggactataa tagctattag agctaagttt aaccttaaga 5460 tatattaata taacgttgtt agagctttcc ttaacgcctt aagggattaa caccctatag 5520 ttatctrcaa gctacctaaa ggatattaaa tacctaggaa gtgcgttaag cttaaacgag 5580 ctctatatag attaaaagac ttactattat tatagtataa taagctctta actacactct 5640 aagaaaataa gcttattgct tctaaagarg agcyatgcct attctttaay agagatcgca 5700 gtatcttgtt aatattctat gtrgacaata tcctatcgct ctatcacyaa aactacgcaa 5760 gctaagctya caaagttatc taagctctaa agcaaagata tactatagaa gaaaagggac 5820 ctgtaagcta gtttctaggg gtaagagtaa tctaggatag aaagagatag ayaataacgc 5880 ttrtttataa taaatacatt aacaagattr caaagaaatt taatctagya gagataggaa 5940 aattccctac tatactacta ttaagtaaag atattaaaaa gagcactaga gaagccacta 6000 aaaargagat taaggactat taggagcgcg ttagattaat cctttacacc ttaattatag 6060 tgcgccctaa tattrcctat gcagcctcct agctctccta ataccttact aacctatcta 6120 aayaacactt taatgcrgtt aattaagtaa tyrtctatct atactraact taatactaat 6180 taatctaata taggaatagg gatcctaata agcttataat atatagtaat rcgttatttg 6240 ctaataatat taatacttag crattattat atagatacct aatyacgctc tttagaggac 6300 ctattattta gaaggyagct taacaagcaa ctgttactac tttaactact aaggyagagc 6360 tccttgcgct taagyaagta agtaaagaag caatagcgtt aaaacgcttt ttaactaaaa 6420 tacaccttac tttagatact acctagataa ttaattgtaa taattaataa actattaggt 6480 tagtagtagg caataataaa aggatcacta ctaarctrcg ctatgtagat atttaaaaca 6540 tatagcttag acaagagtat awaaagggat ctttcyatat tacctacyta cctactagta 6600 atatactagc taataggctt actaaaaacc taactataca ataatttata aggtttaggg 6660 agcacctaaa sttatataat agtagagyat atattatata gtattaatta aagtaaggta 6720 gtatataaga tctattaata tataatataa gaataactaa ctatagccta cctctctatt 6780 attaactagg tagcttccct aaggagctct attacgcgcg caaagcgtgc cttagagtct 6840 atagggagct rcctaggttg ccctaaccta gaatctatag ggggaacctt agaggagcta 6900 gagtccttat cttctaataa ggagctctag gcgcccttag ctatagtatt agccttgcgc 6960 gyaagcttag cagnagttag ctcctactac ttagccttac tctccttaag ttatccttac 7020 tgcctacttc cttactaggg ctattattag tatagatctt actactagag aagnactact 7080 agtactaagt ttagacttat taaagagcgt nnagtagata agtgtataat aaatactgct 7140 agagttagag ctaagaagct aaggnaggag cncccttcct tagcttctnn atctttagta 7200 gtagctttag ctcacgctct tctttagtat agcnacgtag gtaggtagag taagtanaag 7260 tagtagtttt taactttaac taanatanag gttagtangg atataagtat agtttagatc 7320 tttagagtta taaatcttat ataggtagat agagttatag agcctantct aga 7373 // ID DIRS-2_MLP-LTR repbase; DNA; FNG; 112 BP. XX AC AECX01000664; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 24-MAY-2011 (Rel. 16.05, Last updated, Version 2) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW DIRS; LTR Retrotransposon; Transposable Element; Copia; KW Copia-66_MLP_; Copia-66_MLP-LTR; Copia-66_MLP-I; DIRS-2_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-112 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX RN [2] RP 1-112 RA Kojima K.K. and Jurka J.; RT "DIRS-type retrotransposons from fungi."; RL Direct Submission to Repbase Update (24-MAY-2011). XX DR Genome; AECX01000664; Positions 571 682. XX CC [2] Re-classification as DIRS based on the presence of tyrosine CC recombinase. XX SQ Sequence 112 BP; 24 A; 28 C; 27 G; 33 T; 0 other; tgttgtagag ttatggggtt acacgactgc accacagcgt gtcacgtgat gttgtacagt 60 ctctgctttg tagtcaagtc tcatctctcc gggagaagct tccacatcac ca 112 // ID Mariner-2_AN repbase; DNA; FNG; 1876 BP. XX AC . XX DT 09-DEC-2003 (Rel. 8.11, Created) DT 19-JUL-2005 (Rel. 10.08, Last updated, Version 2) XX DE DNA transposon, Mariner superfamily, Tc1 clade - a consensus DE sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; KW mariner superfamily; Mariner-2_AN; Tc1 clade; transposase. XX NM Mariner-2_AN. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-1876 RA Kapitonov V.V. and Jurka J.; RT "Mariner-2_AN, a family of nonautonomous DNA transposons in the RT Aspergillus nidulans genome."; RL Repbase Reports 3(11), 195-195 (2003). XX DR [1] (Consensus) XX CC DNA transposon. Mariner superfamily. Tc1 clade. CC The consensus sequence was reconstructed based on multiple CC alignment of 3 copies. They are 99% identical to the consensus. CC Mariner-2_AN elements are characterized by TA target site CC duplications and 37-bp TIRs. CC It encodes a 393-aa Mariner-2_ANp transposase (2 exons, pos. CC 291-1116, 1318-1673). The transposase is closest to mariner/tc1 CC transposases from frog and fly (GenBank, AAP49009.1 and CAA82359, CC respectively). XX FH Key Location/Qualifiers FT CDS join(291..1115,1317..1670) FT /product="Mariner-2_ANp" FT /note="transposase" FT /translation="MPRGGFHPVELRVQVLTLSAIGFSTEKISKSLNLSPR FT TVQSIVKKGRDRGYRPEVSLRVQLEFVEDRKRSGRPVEITEATQNTVITSV FT TADRAGREKLSEILAYEAGISHSSVLCILHSHGFVIAKPSWKPGLTEAACL FT RRLEFCLAHQHWTLEDWKRVIFTDETGVILGHRRGAIRVWRTVKDSHTRNC FT VRRRWKACSDFMVWGCFSYNKKGPLHIYKPETAAMRKQADIEIEAMNRELE FT PLCREEWELATGLSRVHLRPNRGRVPKWNWNEKNEDSAPAHCHRIQQHVYK FT AEDVQKILDWPGNSPDLNAIEPCWAWMKKRTTSRGAPRDKKTGEAEWRQAW FT ADLPQETIQHWIERLIRHIQIVIELEGGNEYKEGREDRDTRSWAGRRIKG" XX SQ Sequence 1876 BP; 484 A; 418 C; 449 G; 525 T; 0 other; cggaggcgtg caccaaaagt ttttgcgcga cgctagtgtc acgccatgta atactgttac 60 gccctacata acaatttcta tggcaacccc actaccatac atacattaat attttgacat 120 atattgcttc ttcattgttt cagctgcctg cagcctcttg aagctattca cactgctcat 180 atattctatt tgactgatca atttggtgtt ttagcacgct tagcacgctt tttatactaa 240 ttcaacttat cctgccacag ttgctctcct tcctcaacac cagcttcatg atgcctcgcg 300 gcggctttca tccagtagaa ctccgtgtcc aagttcttac tttatcagct atcggattta 360 gtacagagaa gatctcaaaa tctttgaatc tctctcctcg tacggtccag agcatcgtaa 420 agaaaggcag agatcgtggc taccggccgg aagtaagcct gcgcgtgcag cttgaatttg 480 ttgaggatag aaagcgatct ggccggcctg ttgagattac tgaagctact cagaatactg 540 ttattacttc agtaactgca gatcgagcag ggcgcgagaa attatcagaa attcttgctt 600 atgaagctgg tatctcccat tcttctgttc tttgtatcct tcattctcat ggctttgtta 660 ttgcaaaacc ttcctggaag cctggtctga ctgaagctgc ttgtcttagg cgtcttgaat 720 tctgccttgc ccaccaacat tggacattag aagactggaa acgcgtgatc tttaccgacg 780 agactggtgt tattcttggc caccgccgcg gagcaatacg agtgtggagg actgtgaaag 840 attcacatac aaggaattgt gtacggaggc gctggaaggc ctgctctgac ttcatggtat 900 ggggttgctt ctcatataat aagaagggcc ctttacatat ctacaagccg gagactgctg 960 ccatgcggaa gcaggcagat atagagattg aagccatgaa tcgtgagctg gaacctctat 1020 gccgggagga atgggagttg gctacaggtc tttctcgtgt tcatttacgc ccaaatcgcg 1080 gccgtgttcc taaatggaat tggaacgaga agaacggtaa gcttatacgt aaaggtaaag 1140 gggggattga ttggtggaga tatcaaacag tttgttccct tatctctata attctctatt 1200 atagagtagt taagcacgtg ctaattactt attctactgc ctaggaagtc cttaaacctc 1260 ttcttattcc atttgcaaaa gaatgcatga ttgagcgccc aaatactatt gttttagagg 1320 atagcgcgcc tgcccactgt caccgaatcc agcagcatgt ctataaagca gaagacgtgc 1380 aaaagatcct tgactggcct ggcaattcac cggatctcaa cgcaattgag ccgtgctggg 1440 cttggatgaa gaagcgtaca acatcccgcg gtgcgccccg cgataagaag acaggagaag 1500 cagaatggag gcaggcttgg gcggatctcc cacaggagac tatacaacac tggattgagc 1560 gtctaattcg tcatattcag attgttatcg agctagaagg gggtaatgaa tacaaggagg 1620 gccgtgagga tcgcgatacg cgtagttggg caggcaggcg gattaaaggg tgactatcac 1680 cacgtgtaga cctcgctcta cagccaatag aggcccctga atagcttcat ttctcttgtt 1740 tttgatttcg gggtttatgc ggatatagtt agttgtgggt caaaaaacat gttgctatag 1800 taatttgtat gtaagcttgt tacgtcggcg cattaaatta ctagcgtcgc gcaaaaactt 1860 ttggtgcacg cctccg 1876 // ID Gypsy-55_MLP-LTR repbase; DNA; FNG; 179 BP. XX AC AECX01002782; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-55_MLP_; KW Gypsy-55_MLP-I; Gypsy-55_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-179 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002782; Positions 5334 5156. XX SQ Sequence 179 BP; 45 A; 41 C; 34 G; 59 T; 0 other; tgtaatgacc taaagatccc aatactggga ttatgcttat actgtagtat ctactttgta 60 ctatagtcag tagggagagg agggacactt tgtcctctct tccctcactt tgcaatcttg 120 ttattaaggc tatagtaccc tgttgctcct atccctcgag agctagtcag accattaca 179 // ID Gypsy-1-LTR_RO repbase; DNA; FNG; 464 BP. XX AC . XX DT 27-FEB-2009 (Rel. 14.02, Created) DT 27-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE An LTR portion of the Gypsy LTR retrotransposon from Rhizopus DE oryzae. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW reverse transcriptase; Gypsy superfamily; Gypsy-1-LTR_RO. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-464 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Rhizopus oryzae."; RL Repbase Reports 9(2), 631-631 (2009). XX DR [1] (Consensus) XX SQ Sequence 464 BP; 198 A; 86 C; 43 G; 137 T; 0 other; tgtgagaatg gtctcttttt ggacgaagta cgaaacaact ccgacatgga ttgttaaaat 60 aacttaagtt attttaacaa ccattgtcaa agaaataaca atagccgcaa aaataaaaaa 120 aaaaaaaaaa aaaaaagaaa aaaaaaatta tcatatatat atcccatctt tttatgaaga 180 aattgtcagt cagtttaaat aaatcatata tttctactca ataattaaca aactttaaca 240 agaaagtccc tgtttaccta aactttttac tctactttaa cttttatcaa gaaacatcgt 300 attgatctta cacataagat ctggtggtca ctacttatca agcatctgct taaattccat 360 acaaaacaac aacttacaag gcaatcacaa ccttaaatca agctttacaa gcaaacttta 420 cccaatttaa ctcacttact tacaagccaa cctttaaaat aaca 464 // ID Gypsy-54_MLP-I repbase; DNA; FNG; 7908 BP. XX AC AECX01001611; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-54_MLP_; KW Gypsy-54_MLP-LTR; Gypsy-54_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-7908 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001611; Positions 38881 30974. XX CC Positions [5752-6222] - Integrase core CC 'CCGT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 253..6690 FT /product="Gypsy-54_MLP-I_1p" FT /translation="MSSVIQMHNQDVADDGDEELELPPTSHYKRTPRISGS FT RVYVPDVSTSINILPPPPPPGPSSGSTIHGKQREEPPASEDPIPPSAFTGF FT REQEPLLPSDHEEDVDWQQQFYNQQQYYLQREKEVADIITNNRIKEEREKK FT EKEKEVVKVVKKSKGKKVIKKIIKEDDGGDYSSSSSSSDDDSTSNGDVSSE FT ERNEEARSHIKPRRIDNTDSSIRFEVERNVKKLIINYHSQAEANGASELDM FT VRQVTSFVVGEELKEEIREMDGWGTGEWSWKKVKEQLIERYQTTTEQPRYS FT IDHLRTLSERTARTGGVTNKAEYQTFRINFDKIHQYLTRHDYIDSNDEEVA FT RHFYDAFSDELQKKFKKRMIRKKLMKKVDSGRYRLPNLNKLKKVVDHYIEE FT EALLIFEDIGEEKFAIKNEIAGVRSEVIQKDTSNKVAKIGGMQAKATQQME FT VLCSDLGNLHINHVQAPLLLPPVNALHPYLPVSNSLQTQIHQQAEPFRAQH FT QQNPSNYQRVNQPYNNNNNFNNQPRNFPAQNYKGQQPNYSRPYNNNNVNRP FT YNPQNPNTNNQNTYQPPPPPNGNQGSTVFCVYCNTEGHTVGRCRWVIQDKQ FT AGLLQDRMDGFFLPSGNVRMFKQGEDGIRQQVIRYSEQQANDATRQNIQQQ FT RVTTPAVHVNTPVVPPVTPTAIITKNDGKDKIQELKTSCGILEGNWRMPEI FT LRSNGNELLGGGGTKIYDITENTCSGKTVTQAEPKKKKKVTINEEAEDIDD FT GMDLDIGNNIEEAGINEMEESLRRIHADDDNRNIKEGSNKTIRPSYIMPGH FT GEQKNKLERPVFADNPRVVTDVVQKLMKSVIDIQVKELCAISPVVSEEVKK FT WVSKRRLNMNQETMASAVNSTALSDFEFDSESNSSLEVNLPQSPLYSTPLG FT YVTITIGKIKLKALVDSGSQISIIPQKFMQLLPVQTMVNIKGAVRGIGGQK FT TVLCGIAEKVKVRIGRNIEGLVQFYVTEESQTPIILGRSFLFDYAVQLDFK FT KDLGEMMTLKDNRGVWMKVRLCEPDKGNWERELNVEIEERQGYAARDLCLG FT IDYDIVEDEDDEEEVFGLQFEKESVDYDLINELSLIMCLEDQEIFGRGREL FT KDLRSYSTKYKTVDKKVKPVNAPMPQYLNLPLQRPPLSRDPYITPLIKNPP FT VFKETEKTTVERLKQCNFGPAEWLSEEEWKLFLLILVLREAAIAYCEEERG FT LLKHSYGQPYAIPVVEHEPWQIRPIPIPVAIRNDFIELVRQRIRTGLYEQS FT TSSYTSPVFCVLKGDGKLRVVHDLQVLNRVTIKDSGVPPAPEDFVESFAGR FT ACYGLGDIMGGYDERELAEFSRPLTTFETPLGRLQLTRLPQGATNSVAVYQ FT AQMMWILQDEILQHVGIFIDDGGIKGPESDYNNEVLKENNGIRKFIWEYAV FT VLERMLFRIEEAGLTVSGKKFAVCVPALEIVGHIVSKKGRSISIKKKNKIQ FT SWPTPTNKTQVLGFLGTCIYVRMFIPNFGHLAAPLRRLTRLKVEFIWSQEC FT EDAFNLLKKIVGEDIILSALDYNEEASPIILAVDSSHIAAGGVLMQADGNG FT EVRPVFYESEVFTEVESRYSQPKLELCGVAKILKKFRTKLWGQHFELQVDA FT KALIQMINTPSLPNAAMTRWVAFIQLFSFDLVHKHAKSFGMPDGLSRRMRG FT SDSDSAESFNEEDKDIKVYESFKLQNEDLTSEKEEEESGTIWAQEGVWKYL FT QEYLTTLKRPEGCLDDEFQRLNNRAPSYYVENERLKKRGVPYGRLVVSNLE FT GQRFVLEKLHEELGHRGIEETYKRCLVRFWWPEMKEFVRDWVLTCEVCQKK FT SSKKEKEMGRATGESTLFGRVSLDAIHIKAGNYNYAIIARDDLTGWVEAAP FT LKKLTAPAVAKFITKEWLMRYGSVKCFTVDGGSEFKGVFRDAIAMSGSKLG FT ESTAYWPQSEGMVERGHKDIKGALVKLSDETGTSWETFLPQVLFADRISTK FT RTTGLSPYEMLFNQLAILPVDLEAGTFLGIDWDIIYTGEDLLKARMEQLLG FT REELIKKAYEKMMKARVDGIIYWDKKNAHRLRNPLEKGTLVLTYNKSLEFQ FT WGKLFKNRWNGPYKVVEQHMGGAYILEELDGTQLSRKYAAEHVKRFYPRGV FT AILAEEEEVGIEEEEAQL" XX SQ Sequence 7908 BP; 2957 A; 940 C; 1722 G; 2289 T; 0 other; tattggtgac tctgcccggg cttgacaaat taaaatcaac ttataattaa atcaatttct 60 cttttaatca ttatcaacaa gcaaattata ttatattaca tcaagattca ataaaaatta 120 ttcacgtggt tatcaacaga agatctttaa aattaaaatt attcaaatca atcataaaca 180 ttatctttca ttctcattta aaagtatatc aaaattagtt cttatttaaa gaatttatta 240 tataaagaag ctatgtcttc agttattcaa atgcataatc aagatgtggc ggatgatgga 300 gatgaggaat tagaattacc accaacatct cattataaaa gaactccacg aatttctgga 360 agcagagttt atgttccaga tgtttctact tctattaaca tattaccacc accaccacct 420 ccaggacctt cctctggatc aactattcat ggaaaacaac gtgaagaacc acctgcttct 480 gaggatccaa ttccaccttc agcctttact ggattcaggg aacaggaacc attattaccc 540 tctgatcatg aggaagacgt ggattggcaa caacaattct ataatcaaca acaatattat 600 ttacaaagag aaaaagaagt ggcagacatt attactaata atagaattaa ggaagaaaga 660 gagaagaagg agaaggaaaa agaagtggta aaagtcgtga aaaagagtaa aggcaagaag 720 gttattaaga aaattattaa agaagatgat ggaggagatt attcaagttc ttcaagttct 780 tcagatgatg attcaactag taatggagat gttagtagtg aagaaaggaa tgaggaggct 840 agaagtcata ttaaaccacg tagaattgat aatactgatt caagtattag atttgaagtt 900 gaaaggaatg ttaagaagtt gatcattaat tatcatagtc aagctgaagc aaatggcgcc 960 tcagaacttg atatggttag acaagttact agctttgttg ttggagaaga attgaaggag 1020 gaaatacgtg aaatggatgg ttggggtact ggtgaatgga gttggaagaa agttaaggaa 1080 caattaattg aaaggtatca aactaccacg gagcaaccaa gatattcaat tgatcattta 1140 agaactttgt cagaaagaac tgctaggact ggtggtgtta caaataaagc tgaatatcaa 1200 acatttagaa ttaatttcga taaaattcat caatatttaa caagacatga ttatattgat 1260 tcaaatgatg aagaagttgc acgccatttc tatgatgctt tctctgatga acttcaaaag 1320 aaattcaaga aaagaatgat tagaaagaag ttaatgaaga aggtggatag tggaagatat 1380 aggttaccta atttaaataa attaaagaag gttgtggatc attatattga agaagaagct 1440 ttattaatct ttgaggatat tggagaagag aagtttgcaa ttaagaatga aattgctggt 1500 gttagaagtg aagtaattca aaaggatact agtaataaag ttgctaaaat aggaggaatg 1560 caagcaaaag ccacgcagca aatggaggtt ttatgtagtg acttgggaaa tttacatata 1620 aatcacgtgc aggctccatt attattacca cctgtaaatg cattgcatcc ttatcttcct 1680 gtttcaaatt ctttacaaac tcaaatccat caacaagctg aaccttttag agctcaacat 1740 caacaaaatc cgtcaaatta tcaaagagtt aatcaacctt ataataataa taacaatttt 1800 aataatcagc cacgtaattt tccagctcag aattataaag gacaacagcc caattacagc 1860 aggccttata ataataataa tgttaataga ccttataatc ctcaaaatcc aaatacaaat 1920 aatcaaaata cttatcaacc tccaccacct ccaaatggaa atcaaggaag tactgtcttt 1980 tgcgtgtatt gtaatactga aggacatact gtaggaagat gtagatgggt aattcaagat 2040 aagcaagctg gattattgca agataggatg gatggtttct tcttaccttc aggaaatgta 2100 aggatgttta agcaaggaga agatggaatc aggcagcaag ttattagata ttctgaacag 2160 caagccaatg atgctacacg tcaaaatata caacaacagc gtgtaacaac tcctgctgta 2220 catgttaata cacctgtagt accacctgta actcctacag ctattattac taaaaatgac 2280 gggaaagata aaattcaaga gcttaaaaca agttgtggta ttcttgaagg gaattggagg 2340 atgcctgaaa tattaagaag taatgggaat gaattattgg gaggaggtgg aactaaaata 2400 tatgatatta cagaaaatac atgtagtgga aagactgtaa ctcaagctga acctaagaag 2460 aagaagaagg ttactattaa tgaagaagct gaggatattg atgatggtat ggatttagat 2520 ataggaaata atattgaaga agctggtata aatgaaatgg aggaaagctt aagaagaatt 2580 catgctgatg atgataatag aaacattaag gaaggttcaa ataaaacaat tagaccttct 2640 tatataatgc caggacacgg ggaacagaag aataaattgg aaaggccagt ctttgcagac 2700 aatccaagag ttgtaacaga cgtggttcag aaattaatga agagtgttat tgatattcaa 2760 gttaaggaat tatgcgccat ttctcctgtg gtatctgaag aagttaagaa gtgggtttca 2820 aagagaagat taaatatgaa tcaagaaaca atggcttcag cagttaatag tacagcttta 2880 tctgactttg aatttgattc tgaaagtaat tcatctctgg aagttaattt acctcagtct 2940 cctttatatt ccacgccttt gggatatgtt actattacaa ttggtaaaat taaattaaaa 3000 gcattggttg attcaggatc tcaaattagt ataattcctc agaagtttat gcaattgtta 3060 cctgttcaaa ctatggttaa tattaaagga gcagtaagag gaattggcgg acagaagact 3120 gttttatgtg ggattgctga gaaagtgaaa gttagaattg gtagaaatat tgaaggcttg 3180 gttcaattct acgtaactga ggagagtcaa actccaataa tattgggaag atctttctta 3240 tttgattatg ctgttcaatt ggatttcaag aaagatttag gggaaatgat gacattaaaa 3300 gataatagag gagtatggat gaaggtgagg ttatgtgaac ctgataaagg aaattgggaa 3360 agggaattaa atgtagaaat tgaagaaagg caaggttatg cagcacgtga tttatgttta 3420 ggaattgatt atgatatagt tgaagatgaa gatgatgaag aagaagtatt tgggttgcaa 3480 tttgagaagg aaagtgtgga ttatgattta ataaatgaat taagcttaat tatgtgcttg 3540 gaagatcaag aaatatttgg gagagggcgt gaactcaagg atttaaggag ttatagtacc 3600 aaatataaga ctgtagataa gaaggttaaa cctgtaaatg cgccaatgcc acagtattta 3660 aatcttcctt tacaaagacc acctttatca agggatcctt atataactcc tttaattaaa 3720 aatcctccag tcttcaaaga aactgagaaa accacggtgg aaaggcttaa gcaatgcaac 3780 tttggtcctg cagaatggtt aagtgaggag gaatggaaat tattcttatt aatattggta 3840 ttaagggaag cagcaatagc ttattgtgaa gaagaaagag gattattgaa gcattcatat 3900 ggacaacctt atgcaatacc cgtggtggaa catgagcctt ggcaaattag acctattcct 3960 atacctgtag caataaggaa tgatttcatt gaattagtta gacaaagaat aagaacaggt 4020 ctttatgaac aatccacatc aagttatact agtcctgtat tctgtgtact taaaggtgat 4080 ggaaaattaa gggtagtaca tgatttacaa gtattaaata gagtaacaat taaggactct 4140 ggggtaccac ctgctcctga agatttcgtg gaatcctttg ctggaagagc ttgttatgga 4200 cttggtgata taatgggagg ttatgatgaa agggaattag cagaattttc acgtccttta 4260 acaacttttg aaaccccttt gggaagatta caattaacta gattacctca aggagcaact 4320 aattcagtag ctgtatatca agctcagatg atgtggatat tgcaagatga aattcttcaa 4380 cacgtgggaa tttttattga tgatggtggt attaaaggac ctgaatcaga ttataataat 4440 gaagtattaa aagaaaataa tggtataaga aaatttattt gggaatatgc cgtggtttta 4500 gaaagaatgt tgttcaggat tgaagaagca ggacttactg tgtctggaaa gaagtttgct 4560 gtgtgtgtac ctgctttgga aattgttgga catattgtaa gcaagaaagg aaggagtata 4620 tcaattaaaa agaagaataa gattcaaagt tggccaactc caacaaataa aactcaagta 4680 ttaggatttt tgggcacgtg tatatatgta agaatgttta ttcctaattt tggtcatttg 4740 gcagcacctt taagaaggct tacaagatta aaagtggagt ttatatggag tcaagagtgt 4800 gaggatgctt ttaatttatt gaagaagatt gttggggaag atataatatt atcagctttg 4860 gattataatg aagaagcaag tccaattatt ctagccgtgg attcaagtca tatagcagct 4920 ggaggagtat taatgcaagc ggatggaaat ggggaagtca ggccagtatt ctatgaatca 4980 gaagtattta ctgaggtgga atcaaggtat tctcaaccca aattagagct atgtggcgtg 5040 gcaaagattc ttaaaaagtt cagaacaaag ttatggggtc aacattttga attacaagtg 5100 gatgcaaagg ctttaattca gatgattaac acgccaagtt tacctaatgc tgctatgaca 5160 agatgggtag ctttcattca attattttca tttgatttag ttcataaaca tgctaaaagc 5220 tttggaatgc cagatggatt atcaagaaga atgcgtggaa gtgattcaga ttcagctgag 5280 agttttaatg aggaggataa agatattaaa gtatatgaaa gttttaaatt gcagaatgag 5340 gatttaacat cagaaaaaga ggaggaagag agtggaacaa tttgggcgca agaaggtgtt 5400 tggaagtatt tacaagagta tttaacaact ttaaaaagac ctgaaggatg cttggatgat 5460 gagtttcaaa ggcttaataa tagagctcct tcttattacg tggaaaatga aagattgaag 5520 aaaagaggag taccttatgg aagattggtt gtttcaaatt tagaaggaca aaggtttgtt 5580 ttagagaaat tacatgagga attaggacat agaggaattg aggagactta taaaagatgt 5640 ttggtaagat tctggtggcc tgaaatgaag gaatttgtta gagattgggt attgacgtgt 5700 gaagtatgtc agaagaaaag ttcaaagaaa gaaaaagaaa tgggaagggc aactggtgaa 5760 agtactttat ttggaagagt gagcttggat gcaattcata ttaaggcggg taattataat 5820 tatgctataa tagctagaga tgatttaact ggatgggtag aagctgcacc tttaaagaag 5880 ttaacagcac ctgcagtggc caaatttatt acgaaagaat ggttaatgag atatggttct 5940 gttaaatgtt ttactgtgga tggaggttct gaatttaaag gggtatttag agatgcaatt 6000 gctatgtcag gatcaaaatt aggggaatcc acggcatatt ggccacagtc agaaggaatg 6060 gttgaaagag gtcataagga tattaaagga gctttggtaa aattaagtga tgagactggg 6120 acttcttggg aaacatttct acctcaagta ttatttgcag acaggatttc cacgaaaaga 6180 accactggat tatctcctta tgagatgtta tttaatcaac tagcaatttt acctgtggat 6240 ttggaagctg ggacattttt aggtattgat tgggatatta tttacactgg agaagattta 6300 ttaaaagcaa gaatggaaca attattggga agagaagaat taattaagaa ggcttatgaa 6360 aagatgatga aggcaagagt agatggaatt atatattggg acaagaaaaa tgcgcataga 6420 ttaagaaatc cattagagaa gggaactttg gtattaactt ataacaaaag tttagaattt 6480 caatggggaa aattatttaa aaatagatgg aatgggcctt ataaggtagt tgaacaacat 6540 atgggaggtg cttatattct agaagagtta gatggaactc aattatcgcg aaaatatgct 6600 gctgaacatg ttaaaagatt ttatcctagg ggtgtggcaa tattagcaga agaagaggag 6660 gtgggcattg aagaggaaga ggcgcaatta taagtataaa ttaccactaa taatattatt 6720 aatttcaaaa gataaaaata tatataaaag attaaaagaa ttaaaagacc agaataaaaa 6780 tataagataa gatttaaaaa aaaaaaaaat aaggatatgt ttcactagtt atcagtataa 6840 taattatatt aaaattttaa aagaagatta aaaattaaaa atgtacaagt taaaagatat 6900 aaaaaaaaaa ttaaggagaa gaagggaaaa ggcgtggatc aagccagctt ggaagttcat 6960 aaatagtagg agtatattca gaataattat aatcagtata aggagaattt acagcgggaa 7020 ttacaggagg actattatta acataattaa ttagaggagg ttcagtatta ggaggattgg 7080 tagaaggagt actgtagggg gataaaatag cagcatttgc acgctcagca tgattaagga 7140 atggaataaa agttgctaca gtcaaacact catcttcacc ccttcttgaa acatactcag 7200 ctatagaatg tgcacgcaac aaatgataat gattattaat tatgtttgga acttcaaagc 7260 ctgcttcttt aaattgtctg attgtatcat ccaggatcct ccaagaatgc caattctgat 7320 aagttatata atcagttttc cacgccaggt tatgttcata taatctttca aaaaatggat 7380 cagatgtagt aaagttggtt tcaattgatg aagtaataga gggattaggg gaatcaggtg 7440 ctgaaggcgt gttatttaaa ttaatattat aagggttaag gaaatttatt ctgataggag 7500 tattaggcgt gtttggtgga gggtaattgt tggcaggaga attaggtaca ggtgaattat 7560 tggatgaagg cgctgagggg taggctatta aaattaaata aagaaaagac attaaaaatt 7620 agtaaaagaa ttaaataaaa acattgaaac ttaatataaa gaaatagaag aaagaaattg 7680 gacttacagc atattattcc tagactatgg catctttggc aagcaggatt gatttgatag 7740 tggcaggagg agtttaccat ccggcattgg acacaaggtg aagctacaga aggtgaaaca 7800 gtgttgacca tattggaaat aaaggaaaat tcgagaattt agtgatttgg agtgtttaat 7860 tgaattgttt aattgataaa ctgggggcag tttggaagaa gaggagga 7908 // ID Gypsy-15_RO-LTR repbase; DNA; FNG; 512 BP. XX AC AACW02000285; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-15_RO_; KW Gypsy-15_RO-I; Gypsy-15_RO-LTR. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-512 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000285; Positions 5705 5194. XX SQ Sequence 512 BP; 183 A; 97 C; 66 G; 166 T; 0 other; tgtgacgcag ttggtgtaac ttttgctgaa gaaaggactc agtcctagga aggactcagt 60 cctaggaggg actcagtcga aggagggacc tcagtcgaag gagggacctc agtccctaaa 120 gataaacaaa aaaaaaaaat ataataatca tataaatatc ccctcttata atttgactga 180 acttcacttt taaaaatatc ttttattcaa ataaacatcg agttcctctt tatcaaaact 240 ctttattttt attttataac tcaagcaagt caagtccttt gaagtcaaat tatcttcatc 300 gtatcacttt tatattctct tatcaagaaa actctttatt accttttact gtttgatcta 360 caatacaaga tctggtggtc acttactaca agcaaaccaa atttattcat ttaaacacaa 420 atctttgatc tcaaattctt ttaaactcta aagaaaagtc aaatttactt ctaaactctt 480 aaggccatta caacaaaaac gttaaagttg ca 512 // ID GYMAG1_LTR repbase; DNA; FNG; 208 BP. XX AC AACU01000830; XX DT 01-SEP-2005 (Rel. 10.09, Created) DT 01-SEP-2005 (Rel. 10.09, Last updated, Version 1) XX DE GYMAG1: Gypsy-type LTR retroelement from Magnaporthe grisea (LTR DE portion). XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; GYMAG1_LTR; GYMAG1_I. XX OS Magnaporthe oryzae OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Magnaporthales; OC Magnaporthaceae; Magnaporthe. XX RN [1] RA Dean R.A., Talbot N.J., Ebbole D.J., Farman M.L., Mitchell T.K., RA Orbach M.J., Thon M., Kulkarni R., Xu J.R. et al.; RT "The genome sequence of the rice blast fungus Magnaporthe RT grisea."; RL Nature 434(7036), 980-986 (2005). XX RN [2] RP 1-208 RA Jurka J.; RT "GYMAG1: Gypsy-type LTR retrotransposon from the rice blast RT fungus Magnaporthe grisea."; RL Repbase Reports 5(9), 243-243 (2005). XX DR EMBL/GenBank/DDBJ; AACU01000830; Positions 10606 10813. XX CC LTRs are identical and there are two distinct ORFs. CC This appears to be a recent insertion. XX SQ Sequence 208 BP; 45 A; 61 C; 47 G; 55 T; 0 other; tgtgaggatc aggcccctag acctaccctc agaatcacga ctcggctcca caccgagcca 60 gggatcggac aaggatccac tacctccgga tttaagcgcg tctcttgagc gcgtcgtcga 120 ttgtaatttc tctcttctct tcagtttaga attttcgtcg atagcgccta atacactgag 180 gttctccaat gtatgcctgg ccgtgaca 208 // ID Gypsy-66_MLP-LTR repbase; DNA; FNG; 356 BP. XX AC AECX01002913; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-66_MLP_; KW Gypsy-66_MLP-I; Gypsy-66_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-356 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002913; Positions 3578 3933. XX SQ Sequence 356 BP; 72 A; 71 C; 75 G; 138 T; 0 other; tgtaaggaag agtatgggta cgagttagtt acatatagac tatggggtta cagagtagag 60 tgtagtgttt cttatataga ttgtccatcc tctttccttt ttggtttctt ctcttgagga 120 acccttttcc tccttctctc tactagtagc ttttcttatt gattgttcgg tgttttatag 180 gtatgactcc ctcaggagga acccttttcc tccttctctc tactagtagc ttttcttatt 240 gattgtttgg tgttttatag gccttttaag aaatcatact caggtgctac tttcgtctgg 300 cttcggccgt gtgccctcag aatcagtgaa ggtcgactag agagatcgac cttaca 356 // ID Gypsy-44_MLP-I repbase; DNA; FNG; 10949 BP. XX AC AECX01001168; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-44_MLP_; KW Gypsy-44_MLP-LTR; Gypsy-44_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-10949 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001168; Positions 90890 79942. XX CC Positions [9747-10226] - Integrase core CC 'CCACT' target site duplication CC LTRs are 100% similar to each other. CC Contains an insertion of a Mariner-type transposon at positions CC 415-594 (masked by "x"). XX FH Key Location/Qualifiers FT CDS 618..1652 FT /product="Gypsy-44_MLP-I_2p" FT /translation="MEDIQRQLAELTNALNEERTLRKQAEDRLMAQMQGVQ FT SEAAGTAAQQAPPPPPNPSATAPTPKGPKVSTPDKFAGARGSQAEVFASQV FT QLYILAHPHMFLDDKSKVIFAISYLTGNASSWAQPMTRELINPATSHLVTF FT DQFVSNFEAMYFDTEKKSKAEKALRSLTQKTTVAAYTHDFNMHAVNSGWET FT PTLVSQFEQGLKKDIRVAMVLVQETFTTIEQISNLAIKIDNKLHGVADTSS FT AFHTTTPDPNAMDLSAVYTRLSDKDRAECMRTGSCFKCNVQGHRSRDCPNK FT KFSKSGGSKNHYKSKIAELEVKVAEMSRGRDDFNDRGEGSSRADASKNGGA FT QE" FT CDS 1652..3886 FT /product="Gypsy-44_MLP-I_1p" FT /translation="MKVVPILGERREEELMELGASQIVMCNIKDPRLYFMT FT TLSTSPLSRATTIKTPRANFLIDSGATHNVISLSFAREHGLLEHAINSTCD FT ISGFDGSTSKSSHEIDLIFENDTTPTTFIITKLKNTYDGILGMPWLRRHGH FT SIDWTLSNNDTNPTCIATTSAVSPSPKTTSTDASLGPLGTARRLDKGMCIY FT PDTLASPQCEYDLTLHHPQIEAASKLLSPQNTTYRTVTDNHTDIADDTGHL FT NTPFNSETPTIASDIPDSSIPPQNPVDPLEEPQGHVRTSDEGARVSDDKIT FT PPQCEYNLTTPSHIAETAGKLVFPQKYDHTHRPNTDFRAGQHKPTTPTPST FT FQAADSKHIASVTPDSSIPPQNPVDPLEEPQGYVRTSDEGARVSNDTITPP FT QCEYDLTTSPHVTETAGKLVFPRTPLNLEIDTTKTSWSTSARLAAEVNRKT FT PLKTVEQMVPSCYHRHLHLFNKSQAQRLPPRRRYDFRVQLVPGAQPQASRI FT IPLSPAEDAALDEMIKTALANGTIRRTTSPWAAPVLFTGKKDGNLRPCFDY FT RKLNSLTVKNRYPLPLTMDLIDSLLDADKFTKLDLRNAYGNLRVAEEDEDK FT LAFICKQGQFAPLTMPFGPTGAVGFFQYFMQDILLSRIGKDTAVYLDDTMI FT YTKKGEKHEPAVDSVLETFGKHQLWLKPEKCEFSKTEVEYLGLLISYNKVK FT MDPTKVKAVTEWPAPKNVSELQRFIGFANFYRRFIDHFSATTSNG" FT CDS 9009..10850 FT /product="Gypsy-44_MLP-I_3p" FT /translation="MTTIQLTRRQARWAETLGCFDFIIKFRPGRQATKPDA FT LSRRPDLAPTKEGKLTFGQLLRPDNITPDTFIDSIEVASIESFFENEDIDL FT QDTDKWFEIDVLGVSNPTDNLVTEESRAPSDEEIINEIRQATSNDQRIQEL FT INVVQNPISSKLKSAVSKYNVKDGILYNQNRIEVPQVNAIKLLILKSRHDS FT LIAGHPGRSKTLSLARRCFSWPGMKAFVNRYVDSCDSCLRVKTSNQQPFGS FT LEPLPIPAGPWIDISYDLITKLPESNGKDSILTVVDRLTKMAHFIPCKESM FT SSDELADIMVKEVWRLHGTPKSIISDRGSIFVSQLTKELNTRLGIRLQPST FT AFHPRTDGQSEIVNKAIEQYLRHFVNYRQENWESLLPTAEFSYNNKDHVSI FT GVSPFKANYGYNPTFGGIPLKEQCIPKVEERLKLLQDVQSELSECLKLAQE FT EMKNQFDKSVRPTPDWKIGDQAWLNGKNISTTRPSPKLSHRWLGPFDITEK FT ISPSVYRLKLPSTMKGVHPVFHVSLLRKHNTDSIEGRQPATPTPITIDGND FT EWEVEEVLDCRIKNGRREYLISWKGFGAEENSWEPLSHLKNSDGLVKDFDK FT EYPKAAGQHRRRRRKR" XX SQ Sequence 10949 BP; 3357 A; 2428 C; 2452 G; 2532 T; 180 other; tattgtccaa tccatccaga caagcattga agcatcaata acgaattcga attacagaaa 60 tcgagacccg actagatacc actcaccaga acttagatta tagaactcag attatctaat 120 catagatagt tttagaattg atcagaactt aattgactca ccgatatcag atctgaactt 180 accgagactc cggattgtgc attcagaaca aactagaaac ctcaagatta ataagttaaa 240 aacttagaat taccgaaact ttaaacagcc tctgaaccgc cttacaccca cctttactcc 300 gacaaccacg tctccctcat acagaacccc gccagacaac gtcgatctcg acagcgacaa 360 cgaacaaacc ttcatcgacg ttcgcaccga atcttcaacc tcacatcaca ctacxxxxxx 420 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 480 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 540 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxacctcg 600 tgattcacac cgaagccatg gaagatatcc aacgccaatt ggctgagctc actaatgctt 660 tgaacgaaga acgcactctt cgcaagcaag ctgaggaccg attaatggct cagatgcaag 720 gtgtccaaag cgaagcagct ggtaccgcgg cacaacaagc accaccaccg ccacctaacc 780 cttctgccac cgctcctacg cctaaagggc caaaagtctc tacgcccgat aaattcgcag 840 gggctagagg aagccaagcc gaggtgtttg ccagtcaagt acaactgtac atcctggccc 900 acccccacat gtttcttgac gataaatcga aagtcatatt tgctatatcg tatttaacgg 960 gaaacgcgag cagctgggct caaccaatga ccagggaatt gatcaaccca gccacttctc 1020 atttagtcac ttttgaccaa tttgtgtcca acttcgaagc tatgtacttc gatacagaaa 1080 agaaatcaaa agccgaaaag gctctccgtt ctttgactca aaagacgacc gtagccgcgt 1140 acacccatga tttcaacatg cacgccgtga actctggttg ggaaacacct actttagtta 1200 gtcagtttga acaaggtctc aagaaagaca ttagagtcgc tatggtcctg gtacaagaga 1260 cgttcactac aattgaacag atctctaatt tggcaatcaa gatcgacaac aagttacacg 1320 gcgtcgccga tacctcgtct gcatttcaca caactacgcc cgaccccaac gccatggacc 1380 tgtctgcggt ctacactcgt ctatctgaca aagaccgggc tgaatgtatg cgtactggat 1440 catgttttaa gtgtaacgtt caaggtcata gatcaagaga ctgtccaaat aagaagttta 1500 gtaaatcagg tggtagcaag aatcattata aatcaaagat agcagaattg gaagtgaaag 1560 tagcggagat gagcagagga agagatgatt tcaatgatag aggagagggt agtagtcgag 1620 ctgatgcctc aaaaaatgga ggcgctcagg aatgaaagtt gtgccaatcc tgggcgaaag 1680 aagagaggag gaattgatgg aattaggtgc tagtcaaatt gtaatgtgca atataaaaga 1740 tcccagatta tacttcatga ctacactatc cacaagtccc ctatccagag ccacaaccat 1800 taagaccccc cgagccaatt ttctaattga ctctggtgcc actcacaacg tgattagcct 1860 ttcattcgcg cgtgagcatg gactgttgga acacgcaatt aacagtacct gcgacattag 1920 cgggtttgac ggttccacat ccaagtcatc tcacgaaatc gacctcatat tcgaaaacga 1980 caccacccca accactttta tcattacaaa actgaagaac acatacgatg gcatattggg 2040 tatgccctgg ctacgacgtc atggccattc tatcgactgg acgttatcaa acaacgacac 2100 gaacccaacc tgcattgcca ccacatctgc ggtgtcgcca agcccgaaaa caacctcaac 2160 ggatgccagt ttaggcccat tggggacagc taggagactt gacaagggga tgtgtatcta 2220 tcctgatact ttagcatccc cgcaatgtga gtacgatctc acacttcatc acccccaaat 2280 cgaagcagct agcaagcttc tctctcccca aaatactacc tacagaaccg ttacagacaa 2340 tcacaccgac atagccgatg atactggtca cctcaacaca ccattcaatt ccgaaactcc 2400 tactattgcg tctgacatac cagactcgtc aattccgcca caaaaccccg tcgatccttt 2460 ggaggagcct caggggcacg ttaggactag tgacgagggg gctcgagtca gtgatgacaa 2520 gattacgccc ccgcaatgtg agtacaatct cacaactcct tcccatattg ccgaaacagc 2580 tggcaagctt gtttttccac aaaaatacga ccatacacac agacccaaca cagacttcag 2640 agcaggacag cacaaaccaa caacccctac tccaagtacc tttcaggccg ccgacagcaa 2700 gcacattgcg tctgtcacac cagactcatc aattccgcca cagaaccccg tcgatccttt 2760 ggaggagcct caggggtacg ttaggactag tgacgagggg gctcgagtca gtaatgacac 2820 gattacgccc ccgcaatgtg agtacgatct cacaacctct ccacatgtta ccgaaacagc 2880 tggcaagctt gtttttccac gaacacctct gaatctagaa atcgacacca caaagacttc 2940 gtggtcaacg tcagcccgtt tggcagccga ggtaaacaga aagacaccac tcaagaccgt 3000 ggagcagatg gtcccgtcgt gctatcacag gcacttacac ttattcaaca aatctcaagc 3060 tcagcgatta ccgccaagac gaagatacga cttccgagtt caacttgtac caggagcaca 3120 gccccaagct agtcgcataa taccgctttc tccagctgag gacgctgcgc tcgacgagat 3180 gattaagaca gcactggcaa acggcacgat acgtaggaca acatcccctt gggcggcccc 3240 tgtcctcttc actgggaaaa aagatgggaa cttacgtcca tgctttgatt accggaagct 3300 aaattcacta acggttaaga atagatatcc gctgccgtta actatggatc tcatagatag 3360 ccttcttgac gccgacaaat tcactaagct cgacctacgt aacgcatatg gaaatctaag 3420 ggttgcagaa gaagatgaag acaagttggc ttttatttgt aaacaaggtc aatttgctcc 3480 tctcacgatg ccatttggcc cgacaggagc agtaggattt ttccaatatt tcatgcagga 3540 tatactccta tctcgtattg gaaaggatac cgcggtgtac ttagacgata ccatgatcta 3600 caccaagaaa ggcgaaaagc atgaaccggc agtcgacagt gtactcgaaa cttttgggaa 3660 acatcaacta tggttaaaac cggaaaaatg tgaattctca aagactgaag tcgaatacct 3720 tggattactc atatcgtaca ataaagtgaa gatggatcct accaaagtga aagccgttac 3780 tgagtggccc gcacctaaga acgtatcaga attgcaaaga ttcatcggat tcgcaaactt 3840 ttataggcga ttcattgatc acttctctgc aaccaccagc aacggttgaa gcgccccgct 3900 ccgccccgct ttcaggcgtt tttttagcag cgggacgctg tcccgctaca taggggaatt 3960 gctcccgccc cgctttatgt tcagacccgc tacatcattg atgcagcggt gcgtgtgctg 4020 gacccgctct cgccccgcta ccttcgtttt gaagaaaagg aagcgggatg aggcggaaga 4080 ggtatcgaat tttttttaat atgcacaaag aaatctatat tgacatagga aaattaaagt 4140 actcaaaaaa cgttgataag gagtgactac aaataatagg atgaaatcga gtataggagg 4200 acaagcaaca aaacataaaa atttcaaaaa cagtgacaat gagcgacttg aaacattaat 4260 aaagtaaaga aaaaataaac aaaactaagg aaaaaaacat aaagtattca aaaagattat 4320 taaggagcaa cttgaaacat tagtaaagga aaaaaggaaa taactgaaga aacacttgag 4380 aagatagatg agataaaaca acaaaaggtt gacattatga gtaagcttgc tcagttttgt 4440 ttgtgagcat gagtttgaaa ggtggcaaga tatttgatag cttctgagaa cttgtcatca 4500 gtggtaatgt catgtcttaa cgattctcga cagccaacag cacgagagat ggtttttggt 4560 ttcaacgaac cacaggcagg aacagcaata tcagctgcca aagaaaaaac acgttcctcg 4620 gcgcatgagg tagctgaaat gctgagatag tcacgggcca ttgacgcaag gacagggtag 4680 tggcaagaat tatcctaaga caatgaaaca tacgggtcag ttcttgatta ttgtgtgtag 4740 gttatgagag aaaagataac tcacgtccca ccaagacagc tcgctttggc ccgggaggat 4800 tgggtggttc ccttggaggt atagctcgag ttcggccgaa cggacagaat ctgacgtttg 4860 tacggatgcc attgaagtga atacatcaaa ctggtcgaat acttccccag aaacagggga 4920 tggagtcggt tcagggctgg aggctggagt agcaggccaa gcagcaagag tcttctgaaa 4980 gatttcttca atgcactctt tggcttgtgc agcttcattg gggtagtgta agtcgaaaaa 5040 cttgagtcga tacttaggat tgagtatggt gaccactatg atcacttcgg cggatacagc 5100 ttcattgcgg tacttgacca gttggtttac catggtttct accatggttg cgaggcttgg 5160 aaagattgtc gtgtcggata tctcactgag cagatcgctc actctgagat actcggccag 5220 aagcaaggcg ccagttggcg tattgccctc catgcgctta gtcgaatgac gaaaaaccta 5280 gaaagcaaac atcagaactc tgatcacgaa aagaacatac ccattgagaa attgaaaatg 5340 aaagaaagaa gtacttactt ctagaacttt acataatgtc tccacagcat cccactcatc 5400 catcgaaatc attatgcctt gatagtactt ggttttccta tcagcaataa ggatggaatc 5460 gaccacctga aatgatatgt acatttggtc agtaatgctc caatcaatgt ttcacaaaaa 5520 aaacttacat ctcggccagc cacgagccga gagaacatct ccagatcaat gttccaacgg 5580 actccccaac ccgcaattgg acctggtcca cgatagccca tctgctcagc tcgagcttga 5640 aactcacctc gcctggcatt gctgcgagtg aggagagcat tgaattcttt cagctacaaa 5700 cggaagaacc gattgaatca attttaagaa gacataacac attagataaa gataaaaaat 5760 acctaccttt aatgcacctt ttaccacaac gcagccatta gtaggaggat cctcgtaatc 5820 tggatcatcc tcgtgctcat cgtcttcagc tttttcatgt cttgattgaa attcagattg 5880 agagttgatg acatcgtctt catcatcaga ttcgtcgtga tgaacacttt cggtcagatc 5940 atcattgagg ataagagcag gaacagggac tgaagtgttg gccggggtag taggcttgat 6000 gtgacctgca cataggccta gtactttgag gcccttcttg acaactaagc caagcttatg 6060 ggcgtagcac cgaacgtggt gttgcgagga atcccaatca atcggattgt caccttcgga 6120 aaacataatt tccatttcgc gtgccatggt gttgtttgaa gcacctgaat cagttgtctg 6180 gcctaaaatg taagaaattc aacaaatcaa acattagtta cacggaagga tgaatgagaa 6240 gaaatgaaaa attacgtatc tttttgtgca agccgtgagt catcaagaaa cgaccaactg 6300 gtttggctaa aagatagcca tatttattcc aagctaccaa cttcaaggta agatgaacag 6360 tccgaaaatt ccactcgtca tcgatgaaat ttgctgatac accaatgaac gcatagcgat 6420 ttcccttggt ggtccaaacg tcatggatca atgagaatct acttcgattg tcctgttata 6480 catatcattt gtcagtaccg ggtatgaata agttggccag agatagaaaa gaaagaactt 6540 acctttacta aggcaatagc cgcggtactt aagctattat acaactcctt cgccacacca 6600 gcagcccaag tagcactcct caaatcggcc cgacggttgg ccagcttgaa tgcagcacgc 6660 aggggggaat cgtagaaacg attaaatggg agggcgtgta aaaggatcca cattaccagc 6720 acaatgttgc acaactcgac attgaaggtg gtacgtgata aaaatgaatc gagcgtaggt 6780 tgaccatggg cttcgtttga agaagtatcg agcgtggcct gttcttgagc ccacgatggt 6840 ggtaattggc agccagcttg tatggcacgt gatctttgag gacatgctgg ccgaccctta 6900 gctccatcgc ggtgagcata aagattgtag tgagtccgac gtccacgggt gaaccaagtt 6960 gcgcaccacc ggcacttgta acggcggggg tcatcaccgg tctacatgaa acaaaagatc 7020 aagtgttagt agctgataat ctgacagtag aaaaaaacga tggataaaga aaaaaagaaa 7080 attacaaacc tcacccttga catgaattgg agggtgatag tactccaaca gcttggagac 7140 ccgtgccagt tgggcacggc gaaccccttc agtttgaaca acctcgatat ccgaatcctg 7200 attgatatcg atacaggcag ggaccccagt ctttaggata tggtgaggat tgaaagcgaa 7260 gagatgatta gaagaaaaga aataaaaggg caaacgcaac aaaagtaaaa gagaagatat 7320 gagtcgacaa attgaagtaa agggtgaaac tgaaaagcga cttactggga ctggtacagg 7380 aacgactttc ttggtcttgg ccttctttcc tggaccttga accaaagtca ctgaggcacc 7440 agagccccga ggtcgacgct tggcacgagt gctgttcacc ttgacgatct gagtcttctt 7500 ctttttggta atctttgcca ggaagtcttg aggttgagct atctccacga cggaatcaga 7560 gtcggattca ttggtagtgt gagcaggatc gaaggcacga ctatcaaagt tgatgtcgga 7620 aggtaattca ctatcgctgt cttcgttgtt gtcgccctta ccggctccgt cgctggaagg 7680 acgttggact gagacttgcg agtctgggga ttgaatcaca aaaccaggct taggagttgg 7740 actagtaggg cttctagggc gaagcgtagg accggtactg gtaggaggag caagagttga 7800 catagagtga gccatagttg ttattggaag gtatcaacag tacgatgaga aagaattgag 7860 aacggtgttg ttattggaag gtatcaacag tacgatgaga aagaattgag aacggtaagt 7920 ttttgtttga aaagaataca atcagtaaaa taattgctag ttttttttat gatagattgg 7980 tcaatgatcg tgtgtgggag gaaggatcaa ggtactcagc ttcttatgaa gatcgagtac 8040 ggtgatatga acagcttctg tcacctgata cgcctaaaag aattgagtta cgataaccgg 8100 aggggttgca tgaaagaaat ggggttgatt gggtcagttt ggtgcgcttc aaagacgtga 8160 ccaaagaaga aaaagggctc acaatatgct tcaggccatt cgagtctgcc ggacatcaag 8220 attagcgtta ggtcgaagat ttggcttgtc gagtggagga taagggatga tattgtgttt 8280 gtttcgtgct cgttcgatgg ggttgcaggc gctgtcaaag atttggctgg tcaaaaattg 8340 tgtatcgagg gtttggaagc ctgttcacgc ttgtcggttg gggttgcagg cgccaaggat 8400 cgatatgagc tggtctgaga atgatatcaa ggttgttatc ttgcatcatc tgagattaag 8460 cgctacatcc cgctctgagc aaggggtgcg gggcgcaaat ctaaatcgcc ccgccatttt 8520 tacagcccgc tacaacccca gatttagatt ggggcgatgg cgggttgctt tgacccgcta 8580 caaaaaaaaa cgtcaagcgg tgcgcttcaa ccgttgccac caggccactg cacaatctca 8640 ccaaacacaa gactgtcttc aattggagcg agcgttgtaa tcaagctttc gaacacctga 8700 aaaccgcctt tacaaccgca ccagtcctca agatagctga cccgtataag gcctttatac 8760 tcgaatgtga ctgctcagat ttcgcactgg gagctgttct ttctcaacga agtgatgacg 8820 atggtgaaat acaccctgtg gcctatttat cacgatccct aattcaggcc gaacggaatt 8880 atgagatctt cgataaggaa ctgttggcaa ttgtcgcggc attcaaggaa tggtgacatt 8940 acctagaggg aaacccaaac cgactggaag tcgttgttta tacggaccac aggaacttag 9000 aaacctttat gacgactata caactgacaa gacgtcaggc tcgttgggcc gaaacgctgg 9060 gttgtttcga ctttataatc aagttcagac ccggtagaca agccaccaag cccgatgctc 9120 tatcaagacg tccagaccta gctccgacga aggaaggaaa gctgacattc ggccagttgc 9180 tacgcccgga caatataacc cctgatactt tcattgattc cattgaagtc gcttcaattg 9240 agtcattttt cgaaaatgaa gacatcgacc tacaagacac tgacaaatgg tttgaaatcg 9300 acgtcctagg agtatctaat ccaactgata acctagtaac cgaagagagc cgcgcaccgt 9360 ccgacgaaga aatcatcaac gagatacgtc aagcaacatc aaatgaccag agaattcagg 9420 aacttataaa tgtcgtgcag aacccgatat cgtcaaaatt gaaatcagcc gtatcaaaat 9480 acaacgttaa ggatggaata ctttataatc aaaaccgaat tgaagtacct caagtcaatg 9540 ctatcaagct actcatatta aaaagccgac atgacagctt aatagcgggt caccctggaa 9600 gatcaaagac tttaagcttg gcccgtaggt gcttctcctg gccaggaatg aaagcttttg 9660 tgaatagata cgttgacagc tgtgattcat gtttgagagt taaaaccagc aatcagcagc 9720 ctttcggatc cttagaaccc ctgccaattc cagcaggccc gtggatcgac attagttacg 9780 acctaattac caaattgcca gaatccaatg gaaaagacag catcctcacg gtcgttgaca 9840 gattgacaaa gatggctcat ttcataccat gcaaagaaag tatgtcctca gatgaattgg 9900 cggatattat ggtaaaggaa gtatggcgtc tgcatggaac accaaaatca attatttcgg 9960 acagaggctc aatcttcgta tcacagctga caaaagagct gaataccagg ctaggtatac 10020 gattgcaacc gtcgacggct tttcacccga ggacagatgg ccaatccgag atagtcaata 10080 aagctatcga acaataccta agacacttcg ttaactaccg tcaagagaat tgggaatcac 10140 tactaccaac agctgagttt tcatataata acaaagatca cgtgtctata ggggtttcac 10200 ctttcaaggc taactacggg tacaatccga catttggagg tataccacta aaagaacaat 10260 gtatcccaaa agtggaagaa agattaaaat tacttcaaga cgtacaatca gaattatcag 10320 aatgtttgaa attagcacaa gaagaaatga agaatcagtt tgataagagt gtgaggccaa 10380 cgccggactg gaagattgga gatcaagcgt ggctgaacgg gaaaaatatt tcaactacta 10440 gacctagtcc caaattaagt cacaggtggt taggaccgtt tgacatcact gaaaaaatct 10500 ctccttctgt ctacagattg aaattaccaa gcaccatgaa gggagtgcac ccggtgttcc 10560 acgtgtcctt actacgaaaa cacaacaccg acagtataga aggacgacaa ccagcaacac 10620 caactccaat taccatagac ggaaatgacg aatgggaagt agaagaggta ttagactgtc 10680 gaataaaaaa tggaaggcgt gagtacttga taagttggaa agggtttgga gctgaagaaa 10740 attcttggga gcctttatca catttgaaga acagtgacgg gttagtgaaa gatttcgaca 10800 aagaataccc taaagcagca ggacaacata ggagaagaag gcgaaaacgg tgagagggta 10860 aagctttttc ccatgtggtt ttttaatgct acccggggat aaggatgcag agctggcaag 10920 aggaagcttg ggcattaaat gggggataa 10949 // ID Harbinger2-1_AAp repbase; DNA; FNG; 3550 BP. XX AC . XX DT 13-AUG-2010 (Rel. 15.09, Created) DT 06-JAN-2011 (Rel. 16.02, Last updated, Version 3) XX DE A family of autonomous Harbinger transposons - a consensus. XX KW Harbinger; DNA transposon; Transposable Element; KW Interspersed repeat; Harbinger2; Harbinger2-1_AAp. XX NM Harbinger2-1_AAp. XX OS Ascosphaera apis OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Onygenales; Ascosphaeraceae; OC Ascosphaera. XX RN [1] RP 1-3550 RA Kapitonov V.V. and Jurka J.; RT "Harbinger2, a novel clade of Harbinger transposons in protozoan, RT fungi, choanoflagellate, and metazoans."; RL Repbase Reports 10(9), 1213-1213 (2010). XX DR [1] (Consensus) XX CC Harbinger2-1_AAp belongs to a novel clade, Harbinger2, of CC Harbinger DNA transposons. This clade includes transposons CC present in protozoan (brown alga), fungi, choanoflagellate, and CC metazoans. CC Harbinger2-1_AAp is a consensus sequence of a family of CC autonomous Harbinger transposons that were active in the CC Ascosphaera apis genome recently. The consensus was derived from CC a few copies ~96% identical to it. The genome does not contain CC any full-size copies of Harbinger2-1_AAp. However, it contains CC ~10 full-size copies of nonautonomous elements that share termini CC with Harbinger2-1_AAp. Most of these copies are flanked by the CC ANT target site duplications. Therefore, the same should apply to CC Harbinger2-1_AAp. This transposon codes for the 370-aa TPase and CC 375 unclassified protein. XX FH Key Location/Qualifiers FT CDS 279..1385 FT /product="Harbinger2-1_AAp_1p" FT /note="Harbinger TPase." FT /translation="MDRLILLHILQENARLNANHDDNPRRQVRRRRQGQRT FT HRGNYPRFKFNLDRLHDQDCKLKFRMYKHEIRAILPYMRLQYIIYRYHNKP FT SQELAFCIFLARLSYSTRLYELPYYFGVSVSYIDAVFNDVLDHFGSIYSEF FT MRWDHSRLNFSQMRQYADAIGEGCIWGFVDGTTQKIARPTVNQRMFYSGHK FT RYHCYKYQAVVTPDGLVSSLAGPVEGVRGDWALFHESKIADDIIEEFDSNG FT VPIGEELCLYGDPAYAMTGVTMSSYRRPPGGELPPEYVEWNRHMSTKRIAV FT EHFFSTVKNRWGLMSLHTINRIGSTPVGNLYRAACFLNNCLTCLRGYNQIS FT LQFQCQPPSLEQYLALVDRADASESS" FT CDS 3153..2032 FT /product="Harbinger2-1_AAp_2p" FT /translation="MSYNTGTLFFANETPFGPVSQPVAVPDSIEIGDEWAD FT TAAPPSPTQSIESESNPRRVSSTPTSSFPPSHDGQSQLSQAEDTMPVNNHV FT TAFRPHGDGTKIPDNEKLSLCQFAIAHAEEYRAKRKCDFNDMLGEFCRREL FT GRHVSRPGDVLVRVVKQVEQREKYLLKSTGVAIPDSDLLQAGASWRDIVKD FT VEKIEEAWQDEVNKKRVSGMRENMVTSQHKRANDEPANERLKRQNTNDELS FT RLAQTMLRSEELKVQRQQVELDSARTQRIQEASLEERMGTLESRIDEKLVR FT MEEMMEKTLSSVETLQYQRRQPLPKLTPQEPRQHVWSASQPVSMAASPSQQ FT PSLRAQATQMNPFGLHPPPRFTYGNSNRWDM" XX SQ Sequence 3550 BP; 917 A; 799 C; 878 G; 956 T; 0 other; aggggttttc tgtagcagac ggtcaaatcg acgacggtca aatctgaacc gtggaacggg 60 tagcctgatg cagtttagtg aattacatca aatacgccgc tagtattctg ggcgttttgt 120 gtgtaaaatt ctcggcatag tggataatgt gcatttcatt aaatcttcta gaagatattt 180 aaatcgacgt cgatcagtcc agatctatag accctgaaga aaacaacaac ctcaattctc 240 aatctaatct cagactgtcg tgtcataata acaacatcat ggatcgtctc attcttttgc 300 acatcttgca agaaaacgcc aggcttaacg caaaccacga cgataaccca cgacgtcagg 360 ttcgtcgtcg tcgacaagga cagcgaacgc atcgaggaaa ttatcctcgc ttcaagttta 420 atcttgatcg actccatgat caagattgca aactcaagtt ccggatgtat aagcatgaga 480 ttagggcgat acttccctac atgcgtctcc aatatataat atataggtac cacaataagc 540 cttctcaaga gcttgcgttt tgcatatttt tggctagatt gtcttattcc acaagactct 600 atgaactgcc atattatttt ggagttagcg tgtcatatat tgatgcagtg ttcaacgacg 660 tgctggatca ttttggcagc atatattcgg agttcatgag gtgggatcac tcgagactta 720 acttcagtca gatgagacaa tatgccgatg caataggcga aggttgtatc tggggtttcg 780 tcgacggtac gactcagaag atagcgaggc cgacggtgaa tcaaaggatg ttttactcgg 840 gtcacaagcg gtatcactgt tataagtacc aagcagtcgt gacccctgac gggctggtgt 900 cgtctttggc agggcctgtc gaaggtgtac gtggtgactg ggctttgttt cacgagtcga 960 aaattgccga cgacatcatc gaagaattcg attctaatgg tgtgcccatt ggtgaagagt 1020 tgtgtctgta tggggatcca gcttatgcca tgacaggggt gacaatgtcg tcttaccgac 1080 gaccacctgg aggagagctt ccacctgaat acgtcgaatg gaaccgtcat atgtcaacaa 1140 aacggatcgc tgtggaacat ttcttctcga cagtgaagaa cagatgggga ttaatgtcgt 1200 tacataccat caatcgaatt gggagcacgc ctgtcggtaa cttatacagg gcagcttgtt 1260 tcctgaacaa ctgtctaact tgtcttcgcg gttataacca aatatcgctc caatttcagt 1320 gtcaaccacc ttcattggaa cagtatttgg cgctcgtgga tcgagctgac gccagcgaga 1380 gctcgacata gagacaggga cagggatagg ggggaagggg agaatagaca cagagtgagg 1440 taaatgatat gggtgagaat tgaacgactt agtcaaattc aagagcttgt atgggagaag 1500 aggaaaggga gagagagagg gatagagaga caattcgctt attctttcta taccctactt 1560 tcttcaatct cctgaattgg acatcaagag cttagtcgag ctctagagag gcaattgagg 1620 atggcaaaca ggccaatttg tttagttaac gcgtcttttg tctttagcaa acagctttta 1680 ttacctagag aggcagtgtt caatatcaat agtgtgcgag tgcagagatt tcaatgtgcg 1740 agtgcagaga tttcattcat gcagctggtg atggtgaagt gaattgcaat tcaggaggtt 1800 aggggttaat catgtagaat tcggaaactg accaatcaga gcgcgtcatt tgcgggctcg 1860 gcgcggaatc aattaaaatc caaccaatca cagcgcgacg aaggacttag agaaaaaaaa 1920 agtgatttca aagacaataa cacaaagctc tgaatttgtc taagtacaac tctcacccaa 1980 ctctctaaac cacctcaatt cagtcaattt gtctcttttt cagcttcaaa tcatgtccca 2040 tctgtttgag ttcccatacg tgaagcgagg aggcgggtgc aagccgaatg ggttcatctg 2100 cgtcgcttga gcacggagac tgggctgttg agacgggctt gcagccatcg agactggctg 2160 agaagcagac caaacatgct gacgaggctc ttgaggtgtc agttttggca gcggctggcg 2220 tctctggtat tgcagagtct caacgctaga caacgtcttc tccatcatct cctccattcg 2280 cacgagcttc tcatcgatcc gagattcaag agtccccatc ctctcctcga gggaagcctc 2340 ttgaatcctc tgtgttcggg cagagtccaa ttcgacttgt tgtcgttgga ccttgagctc 2400 ctcggatctc agcattgtct gagcaaggcg gctgagctcg tcgtttgtat tctgtcgttt 2460 caacctctca ttggcgggct catcgttggc tctcttatgt tgactcgtca ccatattctc 2520 tcgcatccca cttactcgtt tcttattcac ctcgtcttgc catgcctcct ctatcttttc 2580 cacgtcttta acgatgtccc tccacgacgc cccagcctgc agaagatcgc tgtctggtat 2640 tgcaacgccg gtggatttca gaagatactt ctcacgttgt tcgacctgct tgaccacacg 2700 taccaaaacg tcacccgggc gagagacatg tctccccagc tctctgcgac agaactcacc 2760 cagcatatcg ttgaagtcac attttctctt cgctcgatac tcttcggcgt gggctatggc 2820 aaattgacag agactaagct tctcattgtc cggaatcttt gtaccatccc cgtgaggcct 2880 gaatgctgtg acatggttgt tgacgggcat ggtgtcttcg gcttgggaga gttgactctg 2940 tccatcgtga gaagggggga aagaagacgt tggcgtcgac gagacacgtc tggggtttga 3000 ctcagactca attgactgtg tcggcgaagg gggggctgcg gtgtcagccc attcgtctcc 3060 tatctcaatg ctgtcaggga ctgcgacggg ctgagagaca ggaccgaagg gcgtctcgtt 3120 cgcgaagaat agagtcccgg tattgtacga catgttggcg ttgaattggg ttgacttggg 3180 ggatttgaag tcgactttga gtaggtgata attcactgaa taaagtctct ttctgcaagg 3240 tatgttgttg ttgttgtttc ttcgctccaa attcaattcc tttgattaaa caaatctata 3300 tacttcccta ctaattataa caagctttcg cctagattac cctacagggc tgatccagac 3360 tttgacaagg agttgacggg tggaaaccaa ggctacggtc aactgaccgt agtacaacta 3420 cgatacagtc gcgacggtca gaacagtgga gtagctgaaa agctcaaaac cgagttaatc 3480 ggtgctcttt cctaatcggg taaaaaaaac tggagaccgt cgacttgacc gtatgctaca 3540 gaaaacccct 3550 // ID Copia-36_MLP-LTR repbase; DNA; FNG; 801 BP. XX AC AECX01001637; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-36_MLP_; KW Copia-36_MLP-I; Copia-36_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-801 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001637; Positions 1259 459. XX SQ Sequence 801 BP; 186 A; 164 C; 145 G; 306 T; 0 other; tgttggaggt taatagttgt cggtcaagac aatcaccagt tgccttattt agaaaaacat 60 aaacattgca agaagaagtg tcaaatatga agattgaact agaagacgca gggaggtgat 120 gaaaattagt tgcagattgt tattcagggg tttctttttg ttcatttcta attttcttat 180 atgtagcgct ctctttgttc ccctttagaa atcttgttcc tcatatctta aattatcgga 240 acaagatatc taaaggtata gtgtctttat ttttctcatt cattactttt gtttactaac 300 tgttatcttg ttggttatgt ttcacttgac gtcttgttag ttttatttct ttaccatcac 360 cctctgtctt caggtcagtg agcttagagc tccgtagtct tctatcatct cagcggtgtc 420 tcgtcaacac gcgactgagc tattgtcttt acctttcgtc aaaaggtctg ttagtgttat 480 gtttttctac tttcttcctg tgttgttttt ccggttcctc atatcttaaa ttatcggaac 540 aagatatcta aagttttatt tctttaccat caccctctgt cttcaggtca gtgagcttag 600 agctccgtag tcttctatca tctcagcggt gtctcgtcaa cacgcgactg agctattgtc 660 tttacctttc gtcaaaaggt cagtgagctt agagctccgt agtcttctat catctcagcg 720 gtgtctcgtc aacacgcgac tgagctattg tctttacctt ttgtcaaaag gtcgtcacag 780 tcttcatctt gaaatctcac a 801 // ID Gypsy-64_MLP-I repbase; DNA; FNG; 5891 BP. XX AC AECX01002907; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-64_MLP_; KW Gypsy-64_MLP-LTR; Gypsy-64_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5891 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002907; Positions 8464 2574. XX CC Positions [4691-5170] - Integrase core CC 'CTCCA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 306..1256 FT /product="Gypsy-64_MLP-I_3p" FT /translation="MSGTTKPLNLESMARRIEQLSAALAAKTERRVAAETQ FT QVPAPPPAPPARTPKVATPDKFDGSRGSKAEEFASQVGLYILTNEALFPSD FT RSKVTFALSYLTGDAVKWAQPFLNRVLNPSEGDTVTYDEFAEAFESVFFDS FT DRQKRAEAGLRSLRQTRSAAEYTVMFNQLAPTTKWEIPTLISHYRQGLKRE FT VRLAMIRESFTDLPSITALACAIDNDIRGEYNSSSSIVQPSDPDAMDVSST FT RFDIPSNEYKRRGEEGLCYKCGKPGHRARWCGRQGGKSKSRGGGKIAELEA FT KIAALQSRVGEGSRSDESKNGDARE" FT CDS 1658..2677 FT /product="Gypsy-64_MLP-I_1p" FT /translation="MPWLRENGHRINWKTGEIRPKTEHKTPSAHTAAIASP FT IPATTPDGTESNTGKARNIGEGALTVEPNSGIKPPRCEFDLTPSTIPITDD FT KLYHFRNNFEQIRNLEDNPTNKDETARPNEEELTAAHESQSPTLTDKARKE FT DPPNELEELQPPVMNERPTIAADEPASPVPKNTPDCPEAPMGNIRKRGEGA FT LTEDHISEIKPPQCEHDPVSSLTVNADDNPIRPWNSYAQTPTCKVPSVPNA FT SANLIAAASTASSGPKTTPEGPEPRKGNTRNRDEGALTEDHISGIQPPQCE FT HDPVPSVILEADNKLLRPWNNRNPEICTATASWNISAKLAADASKGNV" FT CDS 3032..5791 FT /product="Gypsy-64_MLP-I_2p" FT /translation="MELVDSLRNAERYTSLDMRNGYNNLRVHEGDEPKLAF FT ICKRGQFEPLVMPFGPTGAPGYFQFFISDIFRDRIGKDLAAYLDDLLIYTP FT AGVDHEQAVEEVLKTLAEHSIWLKPEKCKFSQEEIAYLGLLISKNQVRMDP FT VKVSAVKEWPAPKTVNQTQRFLGFANFYRRFISNFSKITRPLHELTRDNVR FT FKWTPARDKAFESLKEAFTTAPILKIADPYKAFVLECDCSDYALGAVLTQE FT DIDGVLHPVAFLSRSLVQAERNYEIFDKELLAVVASFKEWRHYLEGNPNRL FT EVVVYTDHKNLETFMTSKQLTRRQARWAEMLGCFDFHIRFRPGRQGTKPDA FT LSRRPDLETSTNEKWLFGSILKPTNLSDSSFIAELDCIEAWFNDEGIENPA FT VEEWFEEDIKQENSPDDIHIDAIDRTTTSNGALWTDDLILQRIQEVSKEDP FT RIQDLIMAVKNQSKHIPLGYVVADQVLYKKGIVEVPDDPKVKLEILRSRHD FT SLLAGHPGRAKTLSLVQRQYNWPSAKVYVNCYVDGCESCVRTKASTASPFG FT TLEPLPIPAGPWTDISYDLIPDLPLSNGMNCILTVVDCLTKMGHFVPCTTE FT MNAEDLATLMLKNIWKLHGTPKTIVSDRGSVFVSKITESLNRQLGIELHPS FT TAFHPRSDGQTEIVNKAVEQYLRHFVCYRQDDWEELLPLAEFAYNNSTHSS FT TGVSPFKANLGYDLSLGRIHSNEQCIPTVERRLKVIEEVQEELTENLKRAQ FT IAMKAQFDEGVRPTPEWNIGDEVWLSSRHISTTRPSTKLDHRWLGPFSITK FT RVSTLAYRLSLPASMGRVHPVFHVSVLRKHTPDSVEGRVQEQPEPVQVEGE FT EEWEVHEVLDKRRRRNKDEYLISWKGFGRADDTWEPLTNLKNAIELIDEFD FT RQFPKAAEEHRRRRKV" XX SQ Sequence 5891 BP; 1776 A; 1440 C; 1439 G; 1236 T; 0 other; cattgtggca tctcatcaag gcaaaccgag aaaggaagat cacaacaaaa gaagaagttg 60 aaaaagttta aatacgattt agaagaagtt gaattttttt ttcattgaaa ttttttttct 120 tgaaaagcat ataagaagag aagacgacac tattagaagt aaagaagtta aaattatagt 180 gtgaagacga caccctacta gaccaaccaa tcatctaatc cgccacgcct actttccaat 240 taccatctgt tcactctgcc actccgcctg attccgaaat atccgagtac aaatctgtag 300 ccgacatgag cgggaccacc aaaccactta acctggagtc gatggccaga cggattgagc 360 aacttagcgc cgcccttgct gccaaaaccg aacgacgagt cgccgccgaa actcaacagg 420 tgccagctcc accacctgct cctcctgcca gaactccaaa ggtagcgacg ccggataaat 480 ttgatggctc ccgaggttcc aaagccgagg agtttgccag tcaagtgggt ctctacatcc 540 tcactaacga ggccctattc ccgtcggacc ggtctaaagt cacattcgcc ctgtcctact 600 tgaccggcga tgctgtcaaa tgggcgcaac ccttcctcaa tcgcgtccta aacccatcag 660 aaggcgatac tgtcacttac gacgaattcg cggaagcctt cgagtcggtg ttcttcgact 720 cagatcgcca aaagcgagcg gaagcgggcc tcagatcatt gaggcagact cggtcagcag 780 ccgaatatac ggtcatgttc aaccagctcg ctcctacaac aaagtgggaa atcccgacct 840 tgataagtca ttaccgtcaa ggattgaaaa gggaagtgcg acttgccatg attcgcgaat 900 ccttcaccga cctccccagt atcaccgcct tggcttgtgc gattgacaat gacatccgcg 960 gtgaatacaa ctcctcgtca tcgattgtcc aaccatcaga ccctgacgcg atggacgtct 1020 ccagcactcg ttttgatatc ccatcaaacg agtataaacg ccggggagag gagggtttgt 1080 gctataagtg cggaaagccg ggtcacagag cgagatggtg tggacgtcag ggaggaaaaa 1140 gcaagagtag aggaggtggg aagatagcag agttagaggc taagatcgca gcgttgcaga 1200 gtagggtggg ggaaggtagt aggtctgatg aatcaaaaaa tggagatgct cgagagtgac 1260 ggacgtgcca cccctgagcc gagacgtaaa ggaactgtta gaagtagctg gtaatagtca 1320 tcatgcaatt aaaaaatcca atgacccacg gatctttacc actatttcac tatcagagtc 1380 cccaagcgcc acgtcccatt ttaaccccag acccaaacct agtgtgaaag cacgagccct 1440 cgtcgattgc ggatcaacgc atgaggtcct aggtaccaaa ttcgccaacg aagctggcct 1500 ccccctcacc aaattggctg cagccggcga cgtctacggc ttcgatggac aacctaggag 1560 tgttgcccac gacgccaaac tatttgtcaa taatgagcaa aaccccacca gatttctcgt 1620 caccaagatc aaagacactt acgacgtgat ccttgggatg ccatggctcc gagagaacgg 1680 ccatcgcatc aactggaaga cgggcgagat cagaccgaag accgaacaca agacaccatc 1740 ggcccatacc gcagccatcg catcgcctat accggcaacc accccagatg gcactgaatc 1800 caatacgggg aaagctagga atataggcga gggggctctc actgtagaac caaacagtgg 1860 gataaagccc ccgcgatgtg agtttgatct gaccccgtcc accattccta tcacagatga 1920 caagctgtat cattttcgga acaactttga gcagattagg aacctagaag acaacccgac 1980 gaataaggat gaaaccgcaa gaccaaacga agaagaactt accgcagccc atgaatcaca 2040 gtcacccaca ctaactgaca aagcaaggaa agaagaccca ccgaatgaac ttgaggaact 2100 acaaccgcct gtaatgaatg aaagaccaac tattgccgct gatgagccag cgtcgcccgt 2160 accgaaaaac accccggact gtcctgaggc acctatgggg aacattagga aacgaggcga 2220 gggggctctc actgaggacc atatcagtga gatcaagccc ccgcaatgtg agcatgatcc 2280 cgtatcttca cttaccgtta acgcagatga caaccctata cgcccctgga atagctatgc 2340 acagacaccg acctgcaaag tacctagcgt accaaatgca tcagccaatc taattgccgc 2400 tgcctcgaca gcgtcgtccg gaccgaaaac caccccagaa gggcctgagc cacgtaaggg 2460 gaacactagg aaccgtgacg agggggctct cactgaggac catatcagtg ggatacagcc 2520 cccgcaatgt gagcatgatc ctgttccttc cgttattctt gaagcagata acaagctttt 2580 acgcccctgg aataatagga accccgaaat ctgtacagca acagcatctt ggaacatatc 2640 agctaaactg gcagcggacg ctagtaaggg taatgtctaa aaaacagcag aggagctagt 2700 ccctacgaga taccatcgtt acatcaacat gttcaggaag agcaaggcca tgaccctccc 2760 accacaccaa cgttacgatt ttcgtgtcga cctagtacca ggagctacgc cgcaagcagg 2820 gaagatcata ccattgtcac ctgcggagga agtcgcatta gacaagatga ttgatgaagg 2880 tctggagaaa ggaaccatat gacgtaccaa atccccttgg gcagccccgg tcctcttcac 2940 cggcaagaag gatggaaatc ttcggccgtg cttcgactac cgaagactaa acgctctgac 3000 cgtgaaaaat cgttatccat taccattgac catggaactc gttgacagtt tgcgcaacgc 3060 cgagaggtac acatccctcg atatgaggaa tgggtataat aacctacgag tgcatgaagg 3120 agatgaacca aagttagcgt ttatctgtaa gagaggacag ttcgagccac tcgtgatgcc 3180 tttcgggccg acaggagcgc cgggatactt ccagttcttc atatcagata tattccgtga 3240 taggatcggg aaagacctag ccgcttacct ggacgacctc ctcatctata cgccagcggg 3300 tgttgatcac gaacaagctg tagaagaagt actgaaaacg ttagctgaac actcgatatg 3360 gctgaaaccg gagaagtgca agttctctca ggaggaaatt gcttacctag gactcctgat 3420 ctcaaagaac caagttcgga tggatccagt aaaagtatcc gctgtgaagg aatggccagc 3480 accaaagacc gtaaaccaaa ctcaacgatt tctagggttc gctaacttct acagaaggtt 3540 catcagcaac ttttccaaga ttactcgacc cctgcacgag ttaaccagag ataacgtacg 3600 gttcaagtgg accccagctc gagacaaggc ttttgaatcc ctgaaggagg ccttcactac 3660 cgcgccaatc ttgaagatag ctgatcccta taaggcattc gtcttggaat gcgattgctc 3720 ggattacgcg ctcggagctg tgctgacgca agaggacatt gacggggtac tacatccagt 3780 tgccttctta tctcgttcct tagtccaggc agaaagaaac tacgaaattt ttgacaagga 3840 attgttagca gtggttgcct catttaagga atggagacat tacctggagg gaaacccgaa 3900 tagactggag gtagtggtgt acacagatca taagaattta gaaacattca tgacaagtaa 3960 acaactcaca agaaggcaag ctcggtgggc ggagatgctt ggttgttttg attttcacat 4020 ccggtttcga cccgggaggc agggtactaa gccagacgca ctttcacgga gacctgacct 4080 agaaacttcg acaaacgaaa agtggctgtt tggatcgata ttgaagccca caaacttatc 4140 agactcatct ttcatagcag aactcgattg cattgaagca tggtttaacg atgagggcat 4200 agagaaccca gcggtcgagg agtggtttga agaggacatc aaacaggaaa actcaccgga 4260 tgatattcac atcgatgcta tcgaccgtac gaccacctca aatggagcac tgtggactga 4320 cgacttaatt cttcaacgga tacaagaagt atctaaagaa gacccccgaa tccaagacct 4380 aatcatggct gttaagaatc aaagcaagca tattccgtta ggctacgtag tagcggatca 4440 ggtgctatat aagaaaggta ttgtggaagt accagacgat ccaaaagtga aactcgagat 4500 tctacggagt cgtcatgata gcttgctggc tggacatcct ggtagagcaa aaacactaag 4560 tttagttcag cgtcagtata actggccgtc ggcgaaagtg tatgtcaatt gttatgtaga 4620 tggttgcgaa tcttgcgtac gaaccaaagc atcgacagca agcccatttg ggacgctgga 4680 acctctacca attccagcag gcccatggac cgatatcagt tacgacttaa tccccgacct 4740 acccctatcg aacggcatga actgtattct tacagtagtt gattgcctga caaagatggg 4800 tcactttgtt ccatgcacca ccgaaatgaa tgcggaggat ctggcaacac tgatgttgaa 4860 aaacatatgg aaactgcatg gcacacctaa aacaatcgta tcagatagag gtagcgtatt 4920 cgtttcgaag atcacagagt cgctgaatag acagttagga attgagctgc atccatcaac 4980 ggctttccac cctagatcag atgggcagac agaaatcgtc aacaaagctg tcgaacaata 5040 cttacgccat tttgtatgtt atagacagga cgactgggag gagctcttac cgctagccga 5100 attcgcgtac aataatagca ctcactcctc tacaggagtg tctccgttca aggccaattt 5160 agggtatgac ctgtcgttag gacgaattca ttctaatgag cagtgcattc caacagtaga 5220 gagaagacta aaagtaatag aagaggtgca agaggaactc accgaaaatc tgaaaagagc 5280 tcagatcgcc atgaaagcac aattcgacga gggagtacga cccacgccag aatggaacat 5340 tggagatgaa gtgtggttga gtagtcgtca catatcgaca acaagaccaa gcacaaaact 5400 tgaccatcga tggttaggcc ccttttcgat cacgaagaga gtgtcgacct tggcgtatcg 5460 cttatcccta ccagcgtcga tgggtagggt tcatccggtc tttcacgttt cggttttgag 5520 gaaacacact ccggattcgg ttgaaggtcg agtgcaggaa caaccagagc cagttcaagt 5580 agagggtgaa gaggaatggg aggtgcatga agtactagac aaacgaagaa ggagaaataa 5640 agatgagtat ttgatcagct ggaagggatt tggaagggca gatgatacat gggagccact 5700 aactaacctt aaaaatgcta ttgaattgat cgatgagttt gatcgacagt ttccgaaggc 5760 tgcagaagaa cacagaagac gaagaaaggt gtaagtgagg ggtacggttt tttcccaacg 5820 ggttttttaa taccaccccg gggaaggagg cagggccgcg aacagggagc ccgggccaaa 5880 agcggggata g 5891 // ID Gypsy-4_RO-I repbase; DNA; FNG; 5078 BP. XX AC AACW02000293; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_RO_; KW Gypsy-4_RO-LTR; Gypsy-4_RO-I. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-5078 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000293; Positions 219628 214551. XX CC Positions [3769-4242] - Integrase core CC 'CTTTT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 43..5043 FT /product="Gypsy-4_RO-I_1p" FT /translation="MSATGNSTPATPNAPAPIDYAMELINSPEQVADAVLK FT ETEDYASSLNELSINEDNRAISSSSEDNLLEMKNAKRVAYDLLERVRVELT FT RMYANPQASMDEINQGREVVQQAKYRYNGIVELYNECKKNRMHKPMLSHAG FT NPRSAVNKEHTAQDEWSSFNNSKISRSELPILHLSVDKKDSDPDDWKYFNT FT VEIFFKTFERIFRINQVNILVHWRDYMALSIGEANLDWYQETIESHSPSYS FT WEEAKDIIRKHFENPAKAIEMFGRLLSSKQRNNETVSDFTARFNRLAQTAQ FT CSDSNMLARIYINSLHDDLNLHIRTVLSSNYGATFLQDINSIREVEKLASG FT LEVHKHQPYAASTTYKHSKGERHGFKGAKRGHSESSNGDFIKKQKSSNGHF FT TSSSSSDSSSSKKCLYCKDTWFKGHKCMEFFAQKNKSTVRKIEKRKQNVKK FT LDHPSTKAWNEFAKRQSNNKKSSTDDELEDKMQLGKCIRAIKQEKNDLLTH FT NSFSLYTPIIIQTNRALGLIDTGAEISVINRKFIEINKVKFYHKTGLINLA FT GKDNVVKRIGITEPLELKYNGKTFHHSFEIMDFNEDDKCNVILGLDILSHL FT GIALTGVAHNWDDNEVIFDDSIDDTVKPNNSPAGTELERSQFIEKIHPLLA FT ENMNIPKDAFCTVPESIIHLPTEKGKIVNHKQYPIAYKLKPVLDEAINKWL FT DNGTITKAPVNTAWNSPLTLADKKDANGNKTGKRPCLDPRHINALLPDDKF FT PIPLITDIFHELSGSSIFTTLDLTAAFHRFKIHDEDQPKTAFTYNGQQYMF FT VGAPFGIKFLSNVFQRVMTQIFKDMPFVRTFIDDIVIMSKNMEEHAVHVKA FT AIEALTRVNLVLNVDKCHFAQSCVYLLGFCISAKGSSLDTRKLTNIEEWPR FT PTCGTDVQRFLGVVNYFREHIPKAAMLTAPLDHLRNSKIITEKDWTPLHQT FT HFEAIKKVLLSNVVLSQPDLRQPFFVATDASNYGIGAVLYQEHVELTSNVD FT STDVVKNDNVLKDKGKKKTTIKYIGFMARSLSKSERNYSTTKRELLAIIFA FT LKKFHKFLWGNPFTLYTDHKALTYLHTQKYANTMMMNWFDTILDYQFKVIH FT LPGMDNILPDHLSRLFPTDQMLRGSEQDTRTKNGIIHTAIRAIRSNKHKKF FT AVKKPAQYKQCDYMTPPKNERKQLMLRAHLMGHFGAENIVKVLHNDGLHWK FT NMIVDAVELVKSCVACQKHNITRKGYHPLRPVYSYLPGDHWSMDLAELPTT FT PNDNKYLLVLLDICTRFCILRCLPNKKAKTIAGTLVQVFSDFGYPRILQSD FT NGTEFKNVTMKLLTESTGIDHRLISPYHPQANGSAERWVKTAKIAIAKSVQ FT GTGQDWDFYVPSVQLFINNKVSKRLNTSPFSLMFARKMNGFEDYRKKDPKK FT AVPLSYEELLERIDHMNQVVFPAIKDKTDKYVENMKKQFEKHNTIVDDFKE FT GSHVMAKVPTMTGSLNPVYDGPYTVIRKTRGGSYVLQDEMGLLMSRDYTPS FT ELKPIDLHNINEDNEEVYEIEGIVDHRGRGNNREFKVRWKGYSKDEDSWLT FT PDKITHTSTIENYMKRMGLHPQNKFKKDPVKNKPLIHKHNNEKGILHKATE FT SLKSLYDKDTKVNQDNKYKLSKDINKTHNKHKRKAIDINDLRRSKRNRT" XX SQ Sequence 5078 BP; 1782 A; 945 C; 927 G; 1424 T; 0 other; tttttttaag tgaatacacc acgatattgt tctaaaatca agatgtctgc tactggtaac 60 tctactcctg ctactcccaa tgctcctgct cccattgact acgctatgga attgatcaac 120 tctcctgagc aagtcgctga tgctgtctta aaggaaacgg aagactatgc tagctcttta 180 aatgagttgt ctataaatga agataaccga gccatttcct cttcttctga agacaattta 240 ttggaaatga aaaatgctaa aagggtggca tatgacctcc ttgaacgagt tcgggtggaa 300 ctcactcgta tgtatgcaaa tcctcaggct tctatggatg aaattaacca aggaagagag 360 gtagttcaac aggctaaata tcgatataac ggtatagttg agctatacaa tgaatgcaag 420 aaaaatagga tgcataagcc tatgttgtct cacgccggga accctagatc tgctgtcaat 480 aaggaacata cggcacagga cgaatggtca agtttcaata acagcaagat cagccgctct 540 gagcttccca ttctgcatct ttcggtggac aagaaggatt ctgaccctga tgactggaaa 600 tatttcaaca ccgttgaaat atttttcaag acctttgagc gcattttcag gatcaatcaa 660 gtaaacatct tggtccactg gcgtgattac atggccttgt ccattggcga agccaattta 720 gattggtacc aagaaacaat cgaatcacac agcccttcgt actcttggga agaagccaag 780 gacattatac gcaagcactt tgaaaaccct gccaaggcaa ttgaaatgtt tggacgccta 840 ctctcatcca aacagcgaaa caatgaaacg gtgtctgatt tcactgctcg tttcaaccgc 900 cttgctcaaa ccgctcagtg ttcggattca aacatgcttg cacgaatata catcaactct 960 cttcatgatg atttaaacct tcacataaga acggtattgt catccaacta tggtgctact 1020 ttcttacaag acatcaattc tatcagagaa gttgaaaaac ttgcttctgg cttggaagta 1080 cacaagcacc agccctatgc tgcttctacc acatacaagc actctaaagg tgaacgccat 1140 ggctttaagg gtgcaaaaag aggtcattct gagtcctcaa atggtgactt tataaagaaa 1200 caaaaaagtt ctaatggtca cttcacctct tcaagctctt ctgactcctc cagttccaaa 1260 aagtgtctat attgtaaaga tacatggttc aagggacata agtgcatgga attctttgct 1320 caaaagaata agagcacagt aaggaagata gaaaaacgaa agcaaaacgt taaaaaactt 1380 gatcatccct ctactaaggc atggaatgaa tttgccaaga gacaatcaaa taacaagaaa 1440 tctagtactg atgatgaact agaggataaa atgcaactcg gtaagtgcat aagagcaata 1500 aaacaagaga aaaatgactt gttgactcac aattcctttt ctttatatac tcctataatc 1560 atccagacga atagagcact tggtttaatt gacactggtg cagaaatatc agtcataaat 1620 agaaaattta ttgaaataaa taaagttaaa ttctatcata aaactggtct tattaattta 1680 gctggaaagg ataatgttgt aaaaaggatt ggtattactg aaccacttga attgaaatat 1740 aatggaaaga ctttccacca ttcatttgag attatggatt ttaatgaaga tgataaatgt 1800 aatgtcattc tcggccttga tattttaagt catttgggta ttgctctcac tggagtcgct 1860 cataattggg atgataatga agtcatcttt gatgattcaa ttgatgacac tgttaaaccc 1920 aacaactctc ctgctggcac tgaacttgaa cgatcccagt tcatagaaaa aattcatcct 1980 ttacttgcag aaaatatgaa catacccaaa gatgcattct gcacagtacc tgaatccata 2040 atccacttac ccacggaaaa aggaaaaatt gtaaatcata agcaatatcc catagcatat 2100 aaattgaaac ctgtcttgga tgaagccatc aataaatggt tagacaatgg taccatcaca 2160 aaagctcctg taaatacggc ctggaattct ccattaacct tagctgataa aaaggatgca 2220 aatggaaata aaactggcaa aagaccatgt ctagatcctc gacacataaa tgctctatta 2280 ccagacgata agtttcctat acctctcata acggatattt ttcatgaatt atctggatcc 2340 tcaatcttta ctactttgga tttgactgct gcatttcatc gttttaagat tcatgatgaa 2400 gaccagccta agaccgcatt tacatataac ggtcagcagt atatgtttgt tggtgctcct 2460 tttggaataa agttcctatc aaacgtattc caaagagtaa tgacgcagat atttaaagac 2520 atgccctttg tacgaacttt cattgacgat attgttataa tgtcgaaaaa tatggaagag 2580 catgctgtac atgtaaaagc tgctatagaa gcgcttacta gggtgaatct cgttctcaac 2640 gtcgataaat gtcattttgc acaaagctgc gtttatttac tcggtttttg tatctctgct 2700 aaaggatctt cgttagatac tcgtaagcta acaaatattg aagaatggcc tagacctaca 2760 tgtggcacgg atgttcagcg attcttaggt gttgtaaact atttccgcga acatattcct 2820 aaagcagcca tgttaactgc tcctttagat catcttcgta acagcaaaat aataactgag 2880 aaggattgga cacccttgca tcaaactcat tttgaggcaa taaagaaagt tttactttcc 2940 aatgtggtgt taagtcaacc tgatctcaga cagccatttt ttgtcgcaac tgatgcttca 3000 aattacggta ttggtgctgt actatatcaa gaacatgtgg agttgacatc taatgttgat 3060 tctacagatg tagttaaaaa cgataatgtt ttaaaggata agggaaagaa aaagaccaca 3120 ataaaatata ttggtttcat ggcacgttct ctttccaagt ccgagagaaa ttatagtacc 3180 acgaagagag aacttctggc aatcatattt gcactcaaaa agtttcataa atttctatgg 3240 ggaaatcctt ttacacttta tacagatcat aaagctctga cctatcttca tacacaaaaa 3300 tatgcaaata ccatgatgat gaattggttt gataccattc ttgattacca atttaaggta 3360 attcatcttc ctggtatgga taacatatta cctgaccatt tatctcgcct tttccctacg 3420 gatcaaatgc tgagggggag tgaacaggat actagaacta aaaatggcat tatacataca 3480 gcgataagag ctataagatc aaataaacat aaaaaatttg cagtaaaaaa acctgcacaa 3540 tataagcagt gtgattatat gactcctcct aaaaatgaga gaaagcaact tatgctaaga 3600 gctcatctaa tgggtcactt tggtgctgaa aatatagtaa aagtattaca taatgacgga 3660 ttgcactgga aaaacatgat agtcgacgca gtcgaacttg ttaaatcatg tgttgcttgt 3720 caaaagcata atataaccag aaaaggatat catcctctaa gaccagtata ttcataccta 3780 cctggtgatc actggtcgat ggatttagct gaattaccca caactccgaa tgacaataaa 3840 tacttattgg tgctattaga tatatgcaca cgattctgta ttctaagatg cttaccaaat 3900 aagaaagcca aaactattgc tggcactctt gtccaagttt ttagtgactt tggttacccg 3960 cgtattctac aatcagacaa cggaaccgaa tttaaaaatg tcacaatgaa attgttaact 4020 gaaagtacag gcatagatca cagacttatc tcaccttatc atccgcaagc aaatggcagt 4080 gcggaaagat gggttaagac tgctaagatt gctattgcta aatcagtaca aggaacagga 4140 caagactggg acttttatgt acctagtgta cagttattta taaataacaa agtatctaaa 4200 aggcttaata cttcaccttt ctctttgatg tttgcgagaa aaatgaacgg atttgaagat 4260 taccgaaaaa aggatccaaa gaaagccgta ccgctctcat acgaagaatt gttggaaaga 4320 attgatcata tgaatcaagt agtcttccca gccattaaag ataaaactga caagtatgtt 4380 gaaaacatga aaaagcagtt tgaaaagcat aatacaatag ttgatgactt caaagaagga 4440 agtcatgtca tggctaaagt tcctactatg actggttcac taaaccctgt gtatgatggt 4500 ccatatacag tgattcgtaa aaccagaggt ggatcctacg tacttcagga tgaaatggga 4560 ttactaatgt ctcgagacta tactccttct gaattaaagc caattgatct gcataatata 4620 aatgaggaca atgaagaggt ttatgagatc gaaggaattg tagatcatag aggtagaggt 4680 aataacagag agttcaaagt acgttggaaa gggtatagta aggatgaaga ctcatggctt 4740 acgcctgata aaatcaccca tacctctact attgagaact atatgaaacg aatgggatta 4800 catccacaaa ataaattcaa gaaagatccc gtaaaaaata aacctttgat acataagcat 4860 aataatgaaa agggaatttt acataaagct acagagtcac ttaaatcact ctatgataaa 4920 gacaccaaag ttaatcagga taataaatat aaattatcta aagatataaa caagacccat 4980 aataaacata aaagaaaagc catagatatt aacgacttga gacgaagcaa aagaaatcgt 5040 acataaataa acgattgtcc caatctggca gggaggga 5078 // ID Gypsy-5_PPM-I repbase; DNA; FNG; 5878 BP. XX AC ABWF01004803; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Postia placenta genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_PPM_; KW Gypsy-5_PPM-LTR; Gypsy-5_PPM-I. XX OS Postia placenta OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Polyporales; Postia. XX RN [1] RP 1-5878 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Postia placenta genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABWF01004803; Positions 698 6575. XX CC Positions [2989-3420] - Reverse transcriptase CC Positions [4668-5147] - Integrase core CC 'CCCGC' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 2092..3096 FT /product="Gypsy-5_PPM-I_1p" FT /translation="MVYELDQGSQHFHIPVRLQGKRRHKDIIAMVDSGATT FT KFINKRFIVENKVQTRRLKEPIPLYNIDGTLNKDGSISEVAVLQMQIGEHV FT EKMVFTVTDIGPEDVIIGLDWLREHNPEINWEEGSLKLSRCPETCSARRTG FT RMEQAATARDTGVRPTARRCSPKSKRSVIGACRSKVLPDFEPEEDPVGVEW FT DEADLIEAWEQGITLPGAPQLFVAAGHTYSQLFTEEEIKKKVVKTAEESVP FT NQYHEFLKVFSKEASERLPERKLYDHAIELVPGYSTFHSKVYPLSNNEQEE FT LDKFLKEQLAKGYIRESKSPISSPFFFIKRRKGCFDLCRTTIG" FT CDS 3855..5864 FT /product="Gypsy-5_PPM-I_2p" FT /translation="MQKQDDGKWHPITFRPHAMDPAQRNYEIYDKEMLAII FT EALEDWRHYLEGLPNQFEIVTDHKNLEYWRTSQHLTHRQARWSLFLAHFDF FT RITHKAGTSNGKADALSRRSDHQVGDEDDNTDQVVLKPERFRIASTRHGHA FT SVTADQSLLKRIRECSDKDRVVAEALEKVQNLGPPRLQKGFEEWNAEQGLL FT LFRGMVYVPKNAELRRDIIKIHHDSPIAGHGGRAKTLELLSRNYWWPGMSK FT FVNEYVSTCDVCNRTKTFPAKPQGPLKPNEIPERPWQIITTDMIVGLPKSD FT RFDSILVTADRFTKQVHFSACHETLTAEGAADLYIRDVFKHHGAPAKVISD FT RGPQFASKYLHAVYKGLGIEPALSTVFHPQTDGQTKCWNQEIEQYLHAFTS FT HRQDDWAKQLPLTEFALNNRVHSATGHSPFFLHQGYHPEIQVRINPMSAMP FT AAADRLKILKEIRDDTRSALELSAERMKVYYDKHVQEAPVFAPGDKVWLDA FT RNLKLKQPSKKLSPKRLGPYAVRRKLGDLDYELVLPKSVPVHPVFHVSLLS FT KYTRSEIPGREPEEPPAIEVEGDEEYEVERIKDSRIFRRQLQYLVKWKGYD FT DSHTSWEPARNVANAPALIADFHRKNPTAPCRLNAAVFGGLNFQPMPSPLT FT EARCRSRWEMGKGASHEVMRR" XX SQ Sequence 5878 BP; 1380 A; 1747 C; 1602 G; 1148 T; 1 other; ttaggtcgaa actccccata gctgccggcg ttccttggaa caccaacact cactctacac 60 ttcctacagg ttgggtggtg cgcaccctta ctatacccta cgatacatag atagcatacg 120 atacatcgcc accacttagg gaatgaactc ttcagtacct gcacttcagt gggcggtact 180 tgtcctcacg gacgcatgga ttaggttcct gctgcatact agccaaaagc agtactgggc 240 cccacgtgaa ctgcgaaccc tagtgttacc tcgggcgccg taggtgtaag ctagtctaac 300 acggcctttc gtagcacgcg cacctcccct tcctcccccc atcaccgtgg ctgacgaatt 360 cttcgactgc tctcgacttg agcacgcctt cgacacccag gagcaaatcg acgtctcccg 420 cgacgaccct gagagcgacg ggaatcaagc tcccttcttt atctgtgatc gacaaccgtg 480 ccctaaccgc actccgcgat ccatcactac cgacgaaggg cgctacgtgc ccatccgacg 540 agtccagcac cccagcggcc tggcacttcg atacgctggc acctcgagat ccacatctcg 600 ccacgtcact cccgtccctt cccgacctgc ctccccaggc acccgtatcc cccaacctgt 660 caccagtaca ggtcaaacac gaggagatcc caatctccct cgaggaattg aggcagtcgc 720 acagcctaca gcgacccctc aaacgatctn ccagccctcc ttgggtgtac ggttggaaag 780 gaacccgaag cctgagccca gcaacagtcc agcggttact tgggccagct cgtcatcagc 840 aatctcgtcc tcgactccag tccctgtcgt acgtcacccc gccaccggac ttcctccgtc 900 gcctccaccg ccgtcgccac cgcgaggccg ctcgagcact cgctcctctc aaagttcgcc 960 cggcggacag tcacaacagc cgtcttcgcc cgcgggcagt cctccgtccc cctcgtcgcc 1020 ggtaatgtcc tcaccagcgt ccccccctga taaggacacc ctaaaactcc tcctcccgct 1080 ccgatacgat ggcaagaccg tcatcaagtg cgacaggttt ctgtcgcaac ttcgcatcta 1140 ctggctggtt aacacgtcgt taaccaccat cgaacttaaa gtgcaggttg cactaagcct 1200 gcttgacagc gatgcccgca cctgggccac tccttacttt gcccagctcg tatcagtgca 1260 actcggtgtg cagggagtaa cgaccccctt cgcaaatgag gcggcctttg ccacggcctt 1320 caaggcccgc ttcggcaatc tcgacgacga agcagcggcc caggtagaac tggccaaact 1380 ctgcgcggac aagtcggtcc gcgaaaaacg cactgctgcg gagttctccg cgctgttcaa 1440 gggtccggcg gaccattccg ggtatgggga cctggaacta tgcgacaagt acctgagcgg 1500 catcccctcc cacgtctacc gcaaaatcga gctcgaaacg ttcgctacgt gggaagacgc 1560 tgacaagcgc gccacggagg tcgagcagat cctcgacatc agtcaggcct gacggcccga 1620 gttgaacaac ttcttctcgg ctcgaggtcg aggacgtggt ggggcacgtg gtggtgcacc 1680 ccagtcacac ggagcttcgg ccagcatcaa tgcagccgtc ggaaaaggaa acttccccag 1740 cacttgcttt ggctgtggga agcaagggta ccgacgcttc gagtgcccca attgtaagga 1800 caagccttac acaaagcgcg ccgacgcgcg ggctacggtt gcctcgggtt ccacccaggc 1860 cgcgacaagt gcccccgtca cgacatcacc ctcggcaacg ataagtgccg cttcagcgaa 1920 atccgagcaa tcggagctgg cggacttaat ggcacaggtg aagtcgatgc gtgaggagct 1980 cgagcactat cggacaatga aggaggaggg tttttgatct ggtccgcgct tcgtgccgca 2040 cgcgtcgggc cgcatatatc ctgtaacaag tatgatgtat tgtcgactgc aatggtgtat 2100 gagttggacc aggggtccca acactttcac attccagtcc gactccaagg gaagagacgg 2160 cacaaagaca taatcgccat ggtggacagc ggagccacca caaaatttat caacaagcga 2220 ttcatcgtag aaaataaggt gcagacgcgg aggctaaaag agcctattcc gctctataat 2280 attgacggca ccctgaataa agacggaagc atctcggaag tagccgtgct acagatgcag 2340 ataggagaac acgtcgaaaa gatggtgttt acggtcacgg acatcggtcc agaggatgtg 2400 atcatcgggt tggattggct acgtgagcac aaccctgaaa tcaactggga ggaaggttcc 2460 ctgaagttat cccgatgccc tgagacgtgc agcgccagga ggactggccg gatggagcaa 2520 gcagcgactg cgcgtgatac gggggtcagg cccacggcgc gcaggtgttc gccgaagtcc 2580 aaacggtcgg ttataggcgc ttgccggagt aaggtactcc cggacttcga acccgaagaa 2640 gatccggtgg gagtagagtg ggacgaagct gacttgatcg aggcctggga gcaaggtatt 2700 acgctgcctg gcgcccctca gctgttcgtc gcagctgggc atacatactc ccagttgttc 2760 acggaggagg agatcaaaaa gaaggtcgtt aagaccgccg aggagtcagt gcctaatcag 2820 taccacgagt tcctgaaggt cttctcaaag gaagcatcag aacggttacc cgaaaggaag 2880 ctgtacgatc atgcgatcga actcgtgccc ggctactcga cattccattc aaaggtctat 2940 cccctgtcga acaacgagca agaggagctt gacaagttcc taaaggaaca gttagcgaaa 3000 ggttacatac gggaatcgaa gtcacctatc tcgtccccct tcttcttcat caaaagaagg 3060 aagggatgct tcgacctgtg caggactacc atcggttgaa cgcgatcacg gttaaaaatc 3120 gctacccctg gccgctgatc gccgaaatgg tcgacaaact ccgcggcgcc acgctgttca 3180 cgaagttcga cgtccggtgg ggatacaaca acgtccggat caaagccggt gacgagtgga 3240 aagcggcctt tgtgaccaac cgagggttgt acgagccctt ggtgatgttc ttcggactca 3300 ccaactcccc tgcgaccttc caggcgatga tgaacgagat cttccacgat ctcatcatcg 3360 gcggcaagat cctcgtgtac ctggatgaca tactcgtctt ctctaccaac aaggaggagc 3420 atgagaaggt cacacgcgag gttctgcacc gccttcagga taacgatcta ttcctgaagc 3480 ctgagaagtg cgaatgggat gttcccaagg tcgactacgt cggctatgtg ttcggtggcg 3540 atgaggtagc gatggatccc gcgaagttga aaggaatcaa tgagtggccg gtaccccaga 3600 acaaaaaaga tgttcagaag ttccgggggt tttccaactt ttaccgccgc ttcattaagg 3660 acttcgctaa gatctcgtga ccactcgacc gactcacagg gaacgaccca tggcattggg 3720 gtgaagagga gcagcgcgcc tttgatgaac tcaaacagct ttttgtgacc actccagtct 3780 tggcattgta cgaccctaac cgggagacgc gaattgaagt cgatgcctca ggatacgcca 3840 ccggcagcgt attgatgcag aagcaggacg atggcaaatg gcatcctatc accttccggc 3900 ctcacgctat ggaccctgcc cagcgtaact acgagatcta cgataaggag atgcttgcga 3960 tcatcgaggc attagaagat tggcgccact acctcgaggg attacccaat cagttcgaga 4020 tcgtaaccga tcacaagaac ctcgagtact ggcgcacgtc acagcacctc acgcatcgtc 4080 aagcgcgatg gtcactgttt ctggctcact tcgacttccg catcacgcat aaggccggaa 4140 cgtcgaatgg aaaggctgat gccctgtcac gccgatcaga tcatcaggtc ggcgatgagg 4200 atgataacac cgatcaggtg gttctcaaac cggagagatt ccgcatcgcc tccactaggc 4260 acggtcacgc ctcggtgacg gctgatcagt cgcttctgaa gcgcatcagg gagtgttcgg 4320 ataaggatcg ggttgtggcc gaggcacttg aaaaggtgca gaacctcgga ccacctaggc 4380 tgcagaaagg ctttgaggag tggaatgccg agcaagggct tctgctgttc cgtgggatgg 4440 tctatgtccc aaagaacgcg gagctccggc gggatattat caaaatccac catgactcgc 4500 ccatcgccgg acatggtggt cgcgccaaaa cactggagct cctgtcacgg aactactggt 4560 ggcctgggat gtctaaattc gtcaacgagt acgtaagcac ctgtgacgtg tgcaatcgta 4620 ccaagacatt cccagccaag ccccagggtc cgctgaagcc gaacgagatc cccgagcgcc 4680 cgtggcagat catcaccacc gatatgatcg ttggcttgcc gaagtctgac agattcgact 4740 cgatcctggt caccgccgac cgctttacca agcaggtcca cttctcagcc tgtcatgaga 4800 ccctgacggc agaaggtgcg gcagatctgt acatccgcga tgtgttcaag catcatgggg 4860 ccccggccaa ggtcatatca gaccgaggtc cccagttcgc ttcgaaatat ctccatgcag 4920 tgtacaaggg actagggatc gagccggcgc tgtccacagt gttccacccc cagacggacg 4980 gccagaccaa gtgctggaat caggagatcg agcagtacct gcatgctttc acaagccaca 5040 ggcaggatga ctgggcaaag caattgccgc tcacagagtt tgcgctgaac aaccgagttc 5100 actccgcgac cgggcactcc cccttcttcc tgcatcaggg gtaccatcct gagatccaag 5160 tgcggatcaa tccgatgtct gcgatgcccg cggccgctga ccggctcaaa atcctgaagg 5220 agatccggga cgacacgcgc tctgcacttg agctgtcagc cgagcgcatg aaggtgtact 5280 atgacaagca cgtgcaagag gcaccggtct ttgcgccggg ggacaaggtc tggcttgatg 5340 cccggaacct taagctgaaa cagccgagca agaagctcag tcctaagcgc ctcggaccat 5400 atgccgtgcg gcgaaagctc ggagacttgg attatgaatt agtcctgccg aagtcggtgc 5460 cggtgcaccc ggtgttccat gtctccttgc ttagcaagta cacacgtagc gagatcccgg 5520 gccgagagcc agaggagccg cctgccattg aggttgaggg cgacgaagag tatgaggttg 5580 agcggatcaa agattcaagg atcttccggc gtcagctgca gtacctcgtc aagtggaagg 5640 gttatgacga ctcgcacacg tcatgggaac cagcgcgcaa cgttgccaat gcgccagccc 5700 tcatagccga cttccaccgg aagaatccga ctgccccttg ccgcctcaat gccgcggtct 5760 ttggtggtct caactttcag ccgatgcctt cacctctgac tgaagctcgc tgccgctcca 5820 gatgggagat gggcaagggc gcctcgcatg aggtcatgcg ccgttgaagg gggggtaa 5878 // ID Gypsy-52_MLP-LTR repbase; DNA; FNG; 308 BP. XX AC AECX01001645; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-52_MLP_; KW Gypsy-52_MLP-I; Gypsy-52_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-308 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001645; Positions 10445 10138. XX SQ Sequence 308 BP; 75 A; 69 C; 67 G; 97 T; 0 other; tgtaaggatt gggttacagt atgggattag acaggttgac ttctctatac tcactatcag 60 ccctggctcc acgttgggtt ctcttctctt cacgaacccc gactggagct aggtaagagt 120 cattatcatc actctcatat tcatttatac ttgttctgat tgtttcgctt ctcttcacga 180 accccgactg gagctaggct agcgagcttg taatgcaaac aagagttcta gaatagtttt 240 cagtggttaa agtccaccgt gagagttccc gagagtctag tgaaggttct tttacagaga 300 accttaca 308 // ID Gypsy-6_LBS-LTR repbase; DNA; FNG; 956 BP. XX AC ABFE01000274; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-6_LBS_; KW Gypsy-6_LBS-I; Gypsy-6_LBS-LTR. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-956 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01000274; Positions 2998 2043. XX SQ Sequence 956 BP; 209 A; 216 C; 178 G; 353 T; 0 other; tgtggaagtg cgtcacgttc cattcacgac gccttctata ttttgagttt tcgcttttgt 60 ctcataattt ttacatttta ttttgtcttt tgtattttaa ctgttcttta ctgaaagtcc 120 aaagcgaaag aaacacctat tttcttacag atcaagaaga atgtagccaa acagtgattt 180 ctgctgcttg gagaagatgt caaacacact tgctagttgc tcaacaacac gcacctttct 240 tcgaagtgga cttgaggttt ccttatgtat tcgtatgctc aactttattt tcttttcttt 300 tattttattt tctttgcttt tcttttttta cttatgacag aacccttttc gctcaacgca 360 ttttctgacg gctagacggt agggtcgagc gtcaatgtta ggaatgtccg cactgaggca 420 ggcagtcgcg catgaaaata gacctcttcc tacattgtaa gttccctgag tcatccctag 480 cttgaaggac tcccggcgag aggagggagc gacgttggat tcgttttatt ctttggacat 540 tttctcttgt tggttatttt tctattttta gacgttttct gtttattttg gtcagactgc 600 atgagagaac atgtgtaggt ctcgtgatcg tttgtaaagt atatataggt tcattgcgtc 660 ttttgaaatt gattcttccc cccccatcca atactcaagt actcttaagt tttcaagtct 720 ttactttttc actgccgacg tgttctgtcc tttttgacga caactcatat cgacaacacg 780 cgtgcaagtt tatacggaga ttagacggag acgttgggcg cgttcacacg tcactcacgc 840 taaacgctca gtcttctttt ctttgatccc cattcctttc gtacttcatc ccccatattt 900 ttctgtggca gacgaccagt gacccctccc tgtccttgaa ctttcgttct tccaca 956 // ID TY2_I repbase; DNA; FNG; 5295 BP. XX AC U20162; XX DT 22-AUG-2005 (Rel. 10.08, Created) DT 22-AUG-2005 (Rel. 10.08, Last updated, Version 1) XX DE Ty2 LTR-retrotransposon from yeast (internal portion). XX KW LTR Retrotransposon; Transposable Element; internal portion; KW TY2_I. XX OS Saccharomyces cerevisiae OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Saccharomyces. XX RN [1] RP 1-5295 RA Dujon B., Albermann K., Aldea M., Alexandraki D., Ansorge W., RA Arino J., Benes V., Bohn C., Bolotin-Fukuhara M. et al.; RT "The nucleotide sequence of Saccharomyces cerevisiae chromosome RT XV."; RL Nature 387(6632 Suppl), 98-102 (1997). XX DR Genbank; U20162; Positions 4724 10018. XX FH Key Location/Qualifiers FT CDS 227..1273 FT /product="TY2_I_1p" FT /translation="MMTPNKAMASNWAHYQQPSMMTCSHYQTSPAYYQPDP FT HYPLPQYIPPLSTSSPDPIDLKNQHSEIPQAKTKVGNNVLPPHTLTSEENF FT STWVKFYIRFLKNSNLGDIIPNDQGEIKRQMTYEEHAYIYNTFQAFAPFHL FT LPTWVKQILEINYADILTVLCKSVSKMQTNNQELKDWIALANLEYDGSTSA FT DTFEITVSTIIQRLKENNINVSDRLACQLILKGLSGDFKYLRNQYRTKTNM FT KLSQLFAEIQLIYDENKIMNLNKPSQYKQHSEYKNVSRTSPNTTNTKVTTR FT NYHRTNSSKPRAAKAHNIATSSKFSRVNNDHINESTVSSQYLSDDNELSLR FT PATERI" FT CDS 1233..5270 FT /product="TY2_I_2p" FT /translation="MTTNLVLGQQQKESKPTHTIDSNDELPDHLLIDSGAS FT QTLVRSAHYLHHATPNSEINIVDAQKQDIPINAIGNLHFNFQNGTKTSIKA FT LHTPNIAYDLLSLSELANQNITACFTRNTLERSDGTVLAPIVKHGDFYWLS FT KKYLIPSHISKLTINNVNKSKSVNKYPYPLIHRMLGHANFRSIQKSLKKNA FT VTYLKESDIEWSNASTYQCPDCLIGKSTKHRHVKGSRLKYQESYEPFQYLH FT TDIFGPVHHLPKSAPSYFISFTDEKTRFQWVYPLHDRREESILNVFTSILA FT FIKNQFNARVLVIQMDRGSEYTNKTLHKFFTNRGITACYTTTADSRAHGVA FT ERLNRTLLNDCRTLLHCSGLPNHLWFSAVEFSTIIRNSLVSPKNDKSARQH FT AGLAGLDITTILPFGQPVIVNNHNPDSKIHPRGIPGYALHPSRNSYGYIIY FT LPSLKKTVDTTNYVILQDNQSKLDQFNYDTLTFDDDLNRLTAHNQSFIEQN FT ETEQSYDQNTESDHDYQSEIEINSDPLVNDFSSQSMNPLQLDHEPVQKVRA FT PKEVDADISEYNILPSPVRSRTPHIINKESTEMGGTIESDTTSPRHSSTFT FT ARNQKRPGSPNDMIDLTSQDRVNYGLENIKTTRLGGTEEPYIQRNSDTNIK FT YRTTNSTPSIDDRSSNSESTTPIISIETKAVCDNTPSIDTDPPEYRSSDHA FT TPNIMPDKSSKNVTADSILDDLPLPDLTHKSPTDTSDVSKDIPHIHSRQTN FT SSLGGMDDSNVLTTTKSKKRSLEDNETEIEVSRDTWNNKNMRSLEPPRSKK FT RINLIAAIKGVKSIKPVRTTLRYDEAITYNKDNKEKDRYVEAYHKEISQLL FT KMNTWDTNKYYDRNDIDPKKVINSMFIFNKKRDGTHKARFVARGDIQHPDT FT YDSDMQSNTVHHYALMTSLSIALDNDYYITQLDISSAYLYADIKEELYIRP FT PPHLGLNDKLLRLRKSLYGLKQSGANWYETIKSYLINCCDMQEVRGWSCVF FT KNSQVTICLFVDDMILFSKDLNANKKIITTLKKQYDTKIINLGERDNEIQY FT DILGLEIKYQRSKYMKLGMEKSLTEKLPKLNVPLNPKGKKLRAPGQPGHYI FT DQDELEIDEDEYKEKVHEMQKLIGLASYVGYKFRFDLLYYINTLAQHILFP FT SRQVLDMTYELIQFMWDTRDKQLIWHKNKPTKPDNKLVAISDASYGNQPYY FT KSQIGNIFLLNGKVIGGKSTKASLTCTSTTEAEIHAVSEAIPLLNNLSHLV FT QELNKKPIIKGLLTDSRSTISIIKSTNEEKFRNRFFGTKAMRLRDEVSGNN FT LYVYYIETKKNIADVMTKPLPIKTFKLLTNKWIH" XX SQ Sequence 5295 BP; 1962 A; 1124 C; 830 G; 1379 T; 0 other; tggtagcgcc tatgcttcgg ttacttctaa ggaagtccca tcaaatcaag atccgttagc 60 cgtttcagct tccaatttac cggaatttga tagagattcc actaaggtta attctcaaca 120 agagacaaca cctgggacat cagctgttcc agagaaccat catcatgtct ctcctcaacc 180 tgcttcagta ccacctccac agaatggaca gtaccaacag cacggcatga tgaccccaaa 240 caaagctatg gcctctaact gggcacatta ccaacaacca tctatgatga cgtgttcaca 300 ttatcaaacg tcacctgcgt attatcaacc ggacccacac tatccgttgc cacagtatat 360 cccaccactg agtacttcct cacctgatcc aatcgattta aaaaatcaac actctgaaat 420 acctcaagct aagacaaagg tgggaaataa cgtcttacca ccacacactt taacatcaga 480 agaaaacttt tctacatggg ttaaatttta catcagattt ttgaagaact ctaatctcgg 540 tgacatcatt ccaaatgacc agggtgaaat caaaagacaa atgacttatg aagaacatgc 600 gtatatatac aataccttcc aagcatttgc cccatttcat ttattgccaa catgggtaaa 660 acaaatttta gaaattaatt atgctgacat ccttacagtc ctttgtaaaa gtgtgtccaa 720 aatgcaaact aacaatcaag aattaaagga ttggatagct cttgccaacc ttgagtacga 780 cggaagtaca tctgctgata catttgaaat tacagtcagt acgatcattc agaggctaaa 840 agaaaacaat atcaatgtta gcgacagatt ggcctgtcaa ctaatactta aaggtctatc 900 cggtgacttc aaatacctac gtaatcaata tcgtaccaaa acgaacatga aactttccca 960 attattcgct gaaattcaat taatatatga cgaaaataaa atcatgaatc taaataaacc 1020 gtcccaatac aaacaacaca gcgaatacaa aaatgtttct cgcacatctc caaacacgac 1080 taacacaaag gttacaactc gtaattatca tagaacaaat agttcaaaac caagagcagc 1140 aaaagctcac aatattgcta catctagtaa attctcaagg gtgaacaatg atcacattaa 1200 tgaatcaacc gtttcatcac aatacttaag cgatgacaac gaacttagtc ttaggccagc 1260 aacagaaaga atctaagcca acacacacaa tagactcgaa tgacgaacta cctgatcacc 1320 ttcttattga ttcaggagct tcgcaaacgc ttgtcagatc agcccattat ttacaccatg 1380 caacacccaa ttctgaaata aacatagtcg atgctcaaaa acaagacatt cctataaatg 1440 ccattggtaa tcttcacttc aactttcaga acggcaccaa aacatcaata aaagcactac 1500 acacaccaaa catagcctat gatctattaa gtttgagtga gctggctaac caaaatatta 1560 ctgcctgctt taccagaaac actttagaaa gatcggatgg tacagtacta gctcccatag 1620 tcaaacatgg agacttttac tggttatcta aaaaatacct aattccttcg cacatttcaa 1680 agctaacaat aaacaacgtc aacaaaagca aaagcgtaaa taaatatcca tatccgttaa 1740 tacatcgaat gcttggacat gctaacttcc gaagtattca gaagtctctt aagaagaatg 1800 cagttacata tttgaaggaa tcggatattg aatggtctaa cgctagcaca tatcaatgtc 1860 ctgactgtct aatcggcaaa agcacgaaac atagacatgt caaaggatca cgactaaagt 1920 accaagaatc atatgagcct tttcagtact tgcataccga tatatttggt cctgtacatc 1980 acttaccgaa aagtgcacct tcttacttta tatcgtttac agatgagaaa accagattcc 2040 aatgggtgta cccattacac gaccgtcgtg aagaatctat cctcaatgtt tttacatcga 2100 tattagcatt tattaagaac caattcaatg ctcgcgttct agttatccag atggatcgtg 2160 gctccgagta cactaacaaa actcttcata agttctttac gaacagaggt attactgcat 2220 gctatacaac cacggcagat tctagagcac acggtgtcgc tgaacgatta aatcgtactt 2280 tattaaacga ttgtcgcaca ctgcttcatt gcagtggtct accaaatcat ctatggttct 2340 cagcagtcga attttctact ataatcagaa attcattagt ctcaccaaaa aacgataaat 2400 ctgcaagaca acatgcaggt ttagctggac tggacattac tactatacta cctttcggtc 2460 aaccggttat agttaacaac cataatcccg actcgaaaat acatcctcgt ggcattccag 2520 gttacgcctt acatccgtca cgaaactctt atggctatat tatctatctt ccatcattaa 2580 aaaagacagt agatactacc aattacgtta tattacaaga caaccaatcc aaattggacc 2640 aattcaatta tgatacactc acctttgatg atgatctcaa tcgtttaaca gcccataacc 2700 aatcttttat tgaacaaaat gaaacagagc agtcatatga tcaaaataca gaatctgatc 2760 atgactatca atcggagatt gaaataaact ctgatcctct agtgaacgac ttctcgtccc 2820 aatcaatgaa ccccttacaa ttagaccacg aaccagtcca aaaagtacgt gcaccaaaag 2880 aagttgatgc cgacatatct gaatacaata ttcttccatc tcctgtacga tctcgtacac 2940 cccatatcat taataaagag agtaccgaaa tgggtggtac cattgaatca gatactactt 3000 cacctagaca ctcgtctacc ttcactgcac gaaaccaaaa gcgacctggt agtcccaatg 3060 atatgattga tttgacctca caggatagag ttaattatgg acttgaaaac atcaaaacta 3120 cacgtttggg tggtacggag gaaccatata ttcaacgaaa tagtgataca aatatcaaat 3180 acaggactac aaatagtacg ccctcaatag atgaccgttc gtccaacagt gaatccacta 3240 ctcccatcat ctccatagaa acaaaggctg tatgtgataa tacaccctcc attgatacgg 3300 atccgccaga atatcgatct tctgaccatg cgactcctaa tataatgcct gacaaatcct 3360 caaaaaatgt tacggctgat tctattcttg acgacctccc acttcctgac ttaacccata 3420 aatctcctac ggacacttct gatgtttcaa aagatattcc acacatacac tctcgtcaga 3480 ctaattccag tttgggtggt atggatgatt ctaatgttct gactactacc aaaagtaaga 3540 aaagatcatt agaagataat gaaactgaaa ttgaggtatc ccgagacaca tggaataata 3600 agaatatgag aagtctggaa ccaccaagat cgaagaaacg cataaattta attgcagcaa 3660 taaaaggagt gaaatcgatc aaaccagttc gaacgacctt aagatatgat gaagcaatta 3720 catataataa agacaacaaa gaaaaagaca gatatgttga agcttatcat aaagaaatta 3780 gccaactatt gaaaatgaac acttgggata caaacaaata ttatgataga aatgacatag 3840 atcctaaaaa agtaataaac tcaatgttta tatttaacaa gaaacgtgat ggtacacaca 3900 aagctagatt tgttgcaaga ggcgacattc aacaccccga tacatatgat tctgatatgc 3960 aatccaatac cgtacatcac tatgcactga tgacgtcact gtcaatcgca ttagacaacg 4020 actattatat cacacagctg gacatatcct ctgcttactt atatgctgat atcaaagaag 4080 aattatacat aagacctcca ccacatttag gtttgaatga taaattacta cgtttgagaa 4140 aatcactcta tggtttgaaa caaagtggtg caaactggta tgaaaccatt aaatcatatt 4200 taataaattg ttgcgacatg caagaagttc gcggatggtc atgcgtattt aagaatagtc 4260 aagtaacaat ttgcttattc gttgatgata tgatattatt cagcaaagac ttaaatgcaa 4320 ataagaaaat cataacaaca ctcaagaaac aatacgatac aaagataata aatctgggtg 4380 aaagagataa cgaaattcag tacgacatac ttggattaga gatcaaatat caaagaagca 4440 agtacatgaa attaggtatg gaaaaatcct tgacagaaaa attacccaaa ctaaacgtac 4500 ctttgaaccc aaaaggaaag aaacttagag ctccaggtca accaggtcat tatatagacc 4560 aggatgaact agaaatagat gaagatgaat acaaagagaa ggtacatgaa atgcaaaagt 4620 tgattggtct agcttcatat gttggatata aatttagatt tgacttacta tactacatca 4680 acacacttgc tcaacatata ctattcccct ctagacaagt tttagacatg acatatgagt 4740 taatacaatt catgtgggac actagagata aacaattaat atggcacaaa aacaaaccta 4800 ccaagccaga taataaacta gtcgcaataa gcgatgcttc atatggtaac caaccatatt 4860 acaagtcaca aattggtaac attttcctac tcaacggaaa agtgattgga ggaaagtcga 4920 caaaggcttc gttaacatgc acttcaacta cagaagcaga aatacacgca gtcagtgaag 4980 ctataccgct attgaataac ctcagtcacc ttgtgcaaga acttaacaag aaaccaatta 5040 ttaaaggctt acttactgat agtagatcaa cgatcagtat aattaagtct acaaatgaag 5100 agaaatttag aaacagattt tttggcacaa aggcaatgag acttagagat gaagtatcag 5160 gtaataattt atacgtatac tacatcgaga ccaagaagaa cattgctgat gtgatgacaa 5220 aacctcttcc gataaaaaca tttaaactat taacaaacaa atggattcat tagatctatt 5280 acattatggg tggta 5295 // ID MGRL3_LTR repbase; DNA; FNG; 250 BP. XX AC AF314096; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 08-NOV-2002 (Rel. 7.1, Last updated, Version 1) XX DE Magnaporthe grisea gypsy-type retrotransposon MGRL3_LTR, long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Long terminal repeat; MGRL3_LTR; gag; pol; KW retrotransposon. XX OS Magnaporthe grisea OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Magnaporthales; OC Magnaporthaceae; Magnaporthe. XX RN [1] RP 1-250 RA Kang S.; RT "Organization and distribution pattern of MGLR-3, a novel RT retrotransposon in the rice blast fungus Magnaporthe grisea."; RL Fungal Genet. Biol 32(1), 11-19 (2001). XX DR Genbank; AF314096; Positions 1122 1371. XX SQ Sequence 250 BP; 47 A; 67 C; 62 G; 74 T; 0 other; tgttacggag tggcgtgcct gggcgactgc caaccgtgac tagctccacc tcacgtgaca 60 aggcgccagg gtgagtttgt tatttgttcg cttaggcgcg tgtacgcgat tggcactgcg 120 tgcctttctt cttcctttac acttagcttt agatcttcga atagaatcgc ccctatattg 180 cagccttgtc tctctgggat cgctttcgtg acattatcta aagctcaacc aatcagctac 240 cgggaacggt 250 // ID Gypsy2-LTR_AO repbase; DNA; FNG; 272 BP. XX AC . XX DT 25-JAN-2006 (Rel. 11.01, Created) DT 02-FEB-2006 (Rel. 11.01, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy2_AO LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy2_AO; KW Gypsy2-I_AO; Gypsy2-LTR_AO. XX OS Aspergillus oryzae OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC mitosporic Trichocomaceae; Aspergillus. XX RN [1] RP 1-272 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-272 RA Kapitonov V.V. and Jurka J.; RT "Gypsy2_AO, a family of Gypsy LTR retrotransposons in the RT Aspergillus oryzae genome."; RL Repbase Reports 6(1), 4-4 (2006). XX DR [2] (Consensus) XX CC This is a LTR of Gypsy2_AO. Solo LTRs and proviral copies are CC flanked by 5-bp TSD. XX SQ Sequence 272 BP; 64 A; 80 C; 68 G; 60 T; 0 other; tgtaacggcg ggttgcctca ggcaactaga tcccccgcag gcgagccgcc agcgggccgt 60 tggcttacgt aacgcacaaa tatatttatg ttagaatcgc gcagaggatt ccgatcctca 120 agcgattcgt tctgttatac tagatagggg aactgccata caatcctaaa caaccttgag 180 cttcctagta ctgccttcct gaagcccgtt ggccgaccgc gctccgcagg cctaccgcct 240 atagatagta gtacgccggc agagcgctta ca 272 // ID Gypsy-100_MLP-LTR repbase; DNA; FNG; 333 BP. XX AC AECX01000471; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-100_MLP_; KW Gypsy-100_MLP-I; Gypsy-100_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-333 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000471; Positions 1152 820. XX SQ Sequence 333 BP; 79 A; 54 C; 56 G; 144 T; 0 other; tgtcatagaa ggacgcctgg tcgtttcatc tgtttaattt agtttcatta tgttagtagc 60 tacaaatgtt atgtcctgag atcacgtctt gagattacgt gatacgttag atgtctgtgt 120 tatgttattt ttctttgttc ctgttcctgg tattttggtt ttgttacata tatgtagttc 180 tgttttcttt cttttagatt tctctttttc tttatcgaaa ttattattat cttcctgtct 240 ttttagaaaa aaacttttac ttcagaccag acttttagaa cactagttgt tcaagaccag 300 acctgtatcg aggatacaag gtagaccatc aca 333 // ID Gypsy-91_MLP-I repbase; DNA; FNG; 5727 BP. XX AC AECX01000209; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-91_MLP_; KW Gypsy-91_MLP-LTR; Gypsy-91_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5727 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000209; Positions 29576 35302. XX CC Positions [4599-5078] - Integrase core CC 'GTAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(408..4073,4077..5546) FT /product="Gypsy-91_MLP-I_1p" FT /translation="MDPATATLIAQLQEEVNRLSLNASQQNQQPPHTHTER FT VETIRVDLPIFNGKNDPEIWLTKVEAMLTSRKYPLERWTNTIVGCMKDEAE FT TWWYNLAKEYGYEGMSWELFKSKLLEHYNYSFKQLEARQAIDNLKFRTAEE FT YIEKFKRLSFKIPKEKLTDWETQYLFIRNLPHDLQRRVLSEKGETLDALAK FT SLRENERLSRSVFSNKSHSSPLPSDSRWQTNSNFRNRQPNHSHNNFRNSSA FT TPMDLDVAEVSKSRCYNCNKTGHFSKECRAPRKKTNQPNSNHRTQNKFSFN FT LMEDSDRIEDSGEGTVEVDLHYQEETTQMELDLHYLEGTPEIGFNPHFIDE FT NNIPDKLLDLVFNTPDNRTPMERLKNNLEVDEDVTHLHPEAIEFTPGLRDH FT YTRRNVNHTRGLLAQDEMMAVKKLWCETMSDGDEDLEHFLHEIEDVYLNPV FT SSHYETDIAAYSNQNESLDSHITTKTTEKDASSCSSSVLPPSLWDGKEQVY FT IQEDYQLNVDELLEIFAGYGDKESTPERIVIDLTKDEEEPQPLFVKEVEDF FT IEETPSSSQITLVDATEDIKEISIEEWRKSVSNGKRKYEPDILFPQEEISP FT RKKFPRRGTPYPYLPSNSTLSFNYLELNLSEGETQLPRFEFLAGGLRATTI FT IDTGAGTIYISKKFARTACLQGRMKVEICPARQVKVANGQTTIIRHKAKFS FT LRWGQDYLQHCEALITDLPGFDLVLGLPWLRKNEVVPDYKTLGWNFKYHHK FT DIKILPQNHPEGKQQTLFILDAEGRSSESIRMEAIARKFVPNCFKESISTD FT TKRKWSHIIDTGDAKPVKVHGRPYTPPEHLLIDKFVKEGLEEGIIRPSNSP FT WSAPIILVKKANGETCVCVDFRSLNALTRKNSYPLPRIDEAYQFLKGAIWF FT TCIDLKSGFWQVSMDPSSIEKTAFGTKKGSYEFLVMPFGLCNAPATFQSMM FT NDILRPVLDKCALVYLDDVVIYSSSFEQHIKDVTKVLQLLDGVNLVLSPKK FT CKWAHTDLIFLGHKVNGQGIQVDDEKIKKIQEWPTPTNITEVRGFLNLSTY FT YKRFIKDFSKIATPLYALTEGSPKKRAAISWGKEHDAAFKSLKHHLANTAL FT LSHPIPFRPFVIDTDASGFNIGAVLQQDPNSELIDSSFSLVEYNKKAKNST FT LRPIAYESRKLSKTEQNYSAQERELLAIVHALKKFRGFIEGSPILVRTDHE FT SLKHFKTQRHVNRLARFVDEIEFFNVLIIYRPGSQQLAADALSRRPNTEAD FT VDPPETAAPLFVTKQETLDSFDTITKYRDQLLKGVDPSLVGSGRFSIQDNT FT LMIQNLDNEDKLPVPSNRREAEEIIKGIHTDLGHRSVKDTLDAVQKRLWCP FT DMTKVVEGVIEVCNPCQLSAPPTSKQTTSLRVIERGGPFKKWGMDFVGPLP FT QTANGYQYLVTAIDYGTAWACAIPLKKQSAKAAIQMVKSLILQFGLPEEIT FT TDNGPEFDSYEFINFLKANLIQHNQTSPYHPQSNGLVERFHQTLISSVRKL FT VSPDSQNLWDEVLEQALFGYRVSKNATLGKSPYFLVHGIEPRLPHNPRFFG FT PVPPISTQEAELIYRKRNLDSKELNIARQEAINTANEKAQRRAAQSEETYV FT EQNFKEGDLVLRRFEDRPTKLHPRWDGPFVIQDIHPNGSCVLMTSGGHSLM FT LPTNQDRLKHYKGPPNNFFYASADVKRRDKWALQRRGMLSYNN" XX SQ Sequence 5727 BP; 1781 A; 1379 C; 1060 G; 1507 T; 0 other; gaaatatttc tgcgattttt tttcgtagca tccgtttatt ttttatcgtt atctttcctc 60 gcctgatcag caggaaccgt tgaaagctta tagaagctta gtatcggaac cgtatctttc 120 cgaaggatct gttggaaccg ttctttccaa ggatcattaa tcgttttttt tccgttggag 180 tcattctcca aggactatta ttatatctct tacactccga atttagtgta agccgttttt 240 aagactatct ctctaattta tatatattca cgaccaagtt cactatactg acacagcttt 300 gcaaaccgtg ttgggataag tatagtccgg aagaacttgc cgaaatcttt tcgcgaagag 360 tctcccctaa aaatcaacat actagagatc agtcacttat atcagcgatg gaccccgcaa 420 ccgccacttt aattgctcaa cttcaagagg aagtcaacag actatccctc aacgcgtctc 480 agcaaaatca acaaccacct cacactcaca cggagcgtgt cgaaaccatc cgtgtggact 540 tacccatctt caatggtaag aacgatcccg aaatttggct aactaaagta gaagccatgc 600 ttacctctcg aaaataccct ttagagcgtt ggaccaacac tattgttggt tgcatgaaag 660 atgaggccga aacctggtgg tataacttag ccaaagaata cggctatgaa ggcatgagtt 720 gggaattgtt taaaagtaag ctactggaac attataacta ttcttttaaa caactggaag 780 cccgacaagc cattgataat ctcaaatttc gaaccgcaga ggaatatatt gaaaagttta 840 aacgcctcag tttcaagatt cctaaagaaa aacttacgga ctgggaaact caatatttat 900 ttataaggaa cttacctcac gatctccaga ggcgagtcct gtctgagaaa ggtgagactt 960 tggacgcctt agctaaatct ctccgtgaaa atgagaggtt gtctcgttca gtgttctcta 1020 acaagtctca ttcttcccca ttaccatctg attcccgttg gcaaaccaac tcaaatttcc 1080 gtaaccgtca accaaatcat tctcacaaca attttcgaaa ttcctcagcg actcccatgg 1140 atttagacgt cgccgaggtg agtaaatcaa gatgttataa ttgcaataag accggacatt 1200 tctcgaaaga atgtcgcgct ccaaggaaga aaactaatca accaaattcc aatcatcgca 1260 cacagaataa attctcattt aacttaatgg aagactccga tagaattgaa gattccggag 1320 aaggaactgt agaagttgat ctacattatc aagaagaaac tactcaaatg gaactggact 1380 tacattattt agaaggaact cccgagattg gatttaaccc tcactttatc gacgagaaca 1440 acatacccga taaactccta gacctagtat tcaatacccc cgacaataga actcctatgg 1500 aaagactcaa gaataactta gaagttgatg aagacgtcac tcacctacac cctgaagcca 1560 ttgaattcac gccgggtcta cgagaccatt acacccgacg caacgtaaac catactagag 1620 gtcttcttgc tcaagacgag atgatggctg tcaagaaatt atggtgtgaa actatgtctg 1680 atggagacga agatctcgaa cacttccttc acgagatcga agatgtctat ttgaatcctg 1740 tctcttccca ttacgagacc gacatagccg cttactcaaa tcaaaatgaa agcttagata 1800 gtcacatcac cactaagaca accgaaaagg acgcctcttc ttgcagctct tcagtcctgc 1860 ctccttctct atgggacgga aaagaacaag tttacattca agaagattat cagctcaatg 1920 tagatgaatt attggagatc ttcgccggat atggtgataa agaaagtacc ccagaacgta 1980 ttgttataga tcttactaaa gatgaagaag aacctcaacc cttgttcgta aaagaagttg 2040 aggacttcat agaagaaact cctagttcaa gtcagatcac tctagttgac gcaacagaag 2100 acatcaaaga aattagtatc gaagaatgga gaaaatcagt cagcaatggt aaaagaaagt 2160 atgaacctga tattctattt ccacaagaag aaataagccc acgaaagaaa tttcctcgtc 2220 gtggtactcc ctatccttac cttccatcga attcaacttt gtcattcaat tatctcgaat 2280 tgaacctctc tgaaggagaa actcaattac cccgatttga attcttagcc ggtggtctcc 2340 gagccaccac aattatcgac actggcgccg gtacaattta tattagcaag aaattcgcca 2400 ggaccgcatg tcttcaagga aggatgaagg tcgaaatctg cccagctcga caagtcaaag 2460 tagctaatgg tcaaactacg atcatccgac acaaggctaa gtttagcctc cgatggggcc 2520 aagattacct ccaacactgt gaagccctca tcactgattt gcccggattt gaccttgttc 2580 ttggtctacc gtggttaaga aaaaatgaag ttgttcccga ttacaagact ctcggatgga 2640 attttaaata ccatcacaag gacatcaaga tcttacctca aaaccatcct gaagggaaac 2700 aacagacctt atttatactc gacgcagaag gaagatcctc cgaatcaatc cgaatggaag 2760 caatagctag aaagtttgtt ccaaattgtt tcaaggaatc aatatcaact gacaccaagc 2820 gtaagtggtc acacataatc gacactggcg acgccaaacc agtaaaagtt catggcagac 2880 cttatacgcc accggaacat ctcttaattg ataaatttgt gaaagaagga ctcgaagaag 2940 gtattattcg tccttctaac tctccctggt ctgctccgat aattcttgta aagaaagcta 3000 atggagagac ttgtgtttgt gtagatttca gatctctcaa tgcattgaca agaaagaatt 3060 cgtatccact ccctagaatt gatgaagcat atcaattcct caaaggcgca atttggttca 3120 cgtgcataga tttaaaaagt gggttttggc aggtttcaat ggacccatct tcaattgaaa 3180 agacagcctt cgggaccaag aaaggatcat atgaattctt ggtcatgcca tttgggctgt 3240 gcaatgcgcc agcaaccttt caatctatga tgaatgatat actgagacct gttttggata 3300 aatgtgcttt ggtttatcta gacgatgtag tgatctattc atcatctttc gaacaacaca 3360 tcaaggatgt gacaaaagtt ttgcaacttt tagatggtgt gaaccttgtc ttatcaccaa 3420 agaagtgcaa atgggcacat actgatttga tttttttagg acacaaagtc aatggtcaag 3480 gtattcaagt tgacgacgaa aaaatcaaga aaatacaaga atggcctacc cctacgaaca 3540 ttaccgaagt tcgtggattt cttaatttat caacctatta caagcgtttt atcaaggatt 3600 tttcaaaaat tgcaacccca ctgtacgcct tgacggaagg ctccccgaaa aagcgggctg 3660 caatttcgtg ggggaaggag catgatgctg ccttcaaatc cctcaagcat catcttgcca 3720 acaccgccct cctatcccat ccaattccat tcagaccttt cgttattgat actgacgcct 3780 ccggtttcaa cattggcgcc gtcttgcagc aagatccaaa ttcagaactt attgactcct 3840 ccttctcact tgtcgaatac aacaagaagg ccaagaactc gacccttagg cccattgcct 3900 acgaatctcg aaagctctcg aagacggagc agaactactc agcgcaggag cgtgaactcc 3960 tggctatcgt acatgcattg aagaaattcc gaggattcat cgaaggatcc cctatcctcg 4020 tacggactga tcacgagtcc ctcaagcatt tcaaaactca acgccacgtc aattgaagac 4080 tcgcaagatt tgtggacgaa attgaattct tcaatgttct aatcatttat cgacccggct 4140 ctcaacagct cgccgccgat gctctctctc gacgcccaaa cactgaagca gatgttgatc 4200 cacccgaaac cgccgctccc ttattcgtta ccaaacagga aactttagac agtttcgaca 4260 ctatcaccaa gtaccgcgac caactcctta aaggcgtcga cccctctctc gtaggatctg 4320 gaagattctc cattcaagat aataccttga tgatccagaa cttggacaac gaagacaaac 4380 tcccagtgcc tagcaaccgc cgcgaagccg aagaaatcat taaaggcatc cacactgatt 4440 taggacatcg aagcgtcaag gatacattag acgcagtcca gaaaaggctt tggtgccctg 4500 atatgaccaa agtcgtggaa ggtgtcatcg aagtctgcaa cccttgccaa ctttccgcac 4560 ctccaacctc aaaacaaacc accagcctcc gagtcataga gcgtggtgga cctttcaaga 4620 aatggggaat ggactttgta ggtcctctcc ctcaaaccgc aaacggttat caatacctag 4680 tcaccgccat tgattatggc acagcttggg cttgtgcgat cccgctcaag aagcagtctg 4740 ctaaagctgc aatccaaatg gtaaagagtc tcatcttaca gtttggactt cccgaagaaa 4800 taaccactga caatggccct gaattcgact cctatgaatt cattaacttc ttaaaggcca 4860 atctcatcca acacaaccaa accagccctt accaccctca atccaatgga ctcgtcgaaa 4920 gattccatca aactttaatc tcatcagttc gaaaacttgt ctcgcctgac tctcaaaatc 4980 tctgggatga agttctcgaa caagccctct ttggctaccg tgtgtccaag aacgcgacat 5040 taggaaaatc tccctatttc ctagtccacg gtatagaacc acgcctacct cacaatcctc 5100 gcttctttgg tccagtacct cctatctcaa ctcaagaagc cgaattaatc taccgaaaac 5160 gcaacctaga ttcaaaagaa ctcaacatag cccgacagga agctatcaac accgccaatg 5220 agaaagctca aagacgtgcc gctcagtctg aagagactta tgtcgaacaa aacttcaaag 5280 aaggcgatct cgtcctgaga agattcgaag accgcccaac caaacttcac cctcgatggg 5340 acggaccctt tgtcatccaa gacatccatc caaatggaag ttgtgtgctc atgacttctg 5400 gaggacactc tctaatgtta ccaaccaacc aagaccgctt aaaacattat aaaggtcccc 5460 ccaataactt tttctacgcg tccgccgatg ttaaaagacg cgacaaatgg gctctgcaac 5520 gcagaggaat gttaagctat aacaattaag aattgtttct tttccttttt atttcttata 5580 tttatttttc tctcaattgt tttcttataa ctaacttgta gtagtataat ggatatgttt 5640 caatttatat tttatttgtt ttcctttata cttctttcca aatatttctc ttcttgaata 5700 cggtgcttcg aactggagaa gggatga 5727 // ID Gypsy-68_MLP-I repbase; DNA; FNG; 8568 BP. XX AC AECX01001167; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-68_MLP_; KW Gypsy-68_MLP-LTR; Gypsy-68_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-8568 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001167; Positions 23706 15139. XX CC Positions [5820-6299] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 1610..3817 FT /product="Gypsy-68_MLP-I_1p" FT /translation="MLAHPHLFCTKHSKVVFALSYLTGAAIAWAQPLTQEL FT IDGSKSHLVTFDCFVQNFKSMYFDTKKKLKAERALQNLSQKTSVSAYAHEF FT NVHATATGWETPTLISQFKQGLKKEIRVAMVMVQNPFESIEEIANFAIKID FT SKLHGVSENTISSSVAVDPNAMDISASFVRLSDEERERRLNTGNCFKCNGH FT GHRARYCTENRGSQARGNGKGKSGFRARIADLESQLEAAGGRGESQVKQES FT TSRAETSKKWRSSGMTVVPILSQQGESDGIGLGANTFVLCNEDDPHLFLHT FT TISISHFPGGTPQIMFPARLLIDSGATHDVLGEFFATRRGLIQHTTQNTRV FT VTGFDGSKSHASYDIDLHLDNNDQATPFIKLKDSYDGVLGIPWIKKNTDWI FT DWRKGTLTQQNHIAALSQALYCPKKPSDNHEMDPRREARDIYEGMCIDTNT FT LTSPQCELNLPITPIHPRTVGKLPFPTTSQIESTTAHESPDLLTEYKDKIL FT TTTPVATVVALSIPHHTPEDLERKPTGHTRISDQGASILTDATVPPQCECD FT PVPISNVTKSAGQPLHHLNNRPLVIDKAKTSWSTSAKLAAEAKKLEPVKTV FT EELVPSYYHRHLDMFWKSKSQGLPPHRQYGFKVDLVPGARPQASRIIPLSP FT AENKALNKMINSGLANGTIRRTTSPWAAPVLFTGKKDGNLRPCFDYQKLNA FT LTVKNKYPLLLTMNLVDSLLDADTFMYQVKSTERIR" FT CDS 3998..4909 FT /product="Gypsy-68_MLP-I_2p" FT /translation="MTYTKKGTHHKEAVDSVLDILSKQRLWLKPEKCEFSK FT SEVEYLGLLISHNKVKMDPMKVKAVTNWPPPHNSQELQRFIGFSNFYCRFI FT NHFSSTTRPLHNLTRTNTPFIWDEACDKAFKKLKNMFTSAPVLKIADPYKP FT FILECDCSDFVLGVVLSQVCDKDGELHPVAYLSRSLVQAEQNYEIFDKELL FT AIVASFKEWRHYLEGNPNQLDVIVYTDHRNLESFMTTKQLTRRQARWAQIM FT GCFDFQIKFRPGRQAAKPAALSRRPDLAPDAAENLMFGQLLRSENIGPNTF FT SLRTPTKERLGV" FT CDS 5316..6623 FT /product="Gypsy-68_MLP-I_3p" FT /translation="MQLNNSSEKGVDLANIETFFKDENVELENAEHWFEID FT VIGISEDKTTDKTILTNDKSISKIRTLNTNPVSSKIHQLTKNYSFVDGILY FT NQGRIEVPEDNHIKFQIIMSRHDSLLAGHPGQAKTLSLVRRSFIWPLLKAY FT VNKFVDGCDSCLRVKSSTKQPFGTLKPLPVPAGPWTNISYDLITKLPVSNG FT NNSILTVLDRLTKMAHFVACKETMSSEDLADLMLKNVWKLHGTPKSIISDR FT GSIFVSQITQEVDKRLGIRLHPSTAFHPQTNGQSEIVNKVIEQYLRHFVDY FT RQEDWEAILPMAEFTYNNKDHESIGISPFKANYRFNPIFNKVPSAEQCVPA FT VEARLKLLEEVQTELTTCLTLAQDTMRHQFNKHVRKTPEWKVGDKVWLSSK FT NISTTRTSPKLDHRWMGPFPIVKRCRTYNPTRVIQSWKRIIY" XX SQ Sequence 8568 BP; 2774 A; 1902 C; 1730 G; 2162 T; 0 other; tattgtcgca tcttccaacg ggattcaagg accaagccta gatcgattca tcagcaggcc 60 aaaccaccaa attcacagat ttagaacccc cgctgcaacc ttgtttatta catcagatta 120 tgaaatgagt tcatgcggac agatagtcga cgcgtagaaa tgtgtggtac ctggacagaa 180 caggtacagt atgagttcta gtgaactcgt aaggtagtaa agagagggcg gagaaagtcg 240 acgccttacc tggacagaac aggtacagta tgagttctag tgaactcgta aggtagtaaa 300 gagagggcgg agaaagtcgc cgccttacct gcgaaaagca ggatacgata gagtggtagg 360 aagtttaaga acttcagttt ccacgcgatc tatttcaaga gataacacac acacaatagt 420 acttagagaa atatgaaaag aaatattcac ctggacagaa caggtacagt atgagttcta 480 gtgaactcat aaggtagtaa agagagggcg gagaaagtcg acgccttacc tgcgaaaagc 540 aggatacgat agagtggtag gaagttctat cgtagagagt gagtacagtc caggattcat 600 attgagagaa aaagattaga gaaaaatatg ctcactaaga acttcagttt ccacgcgatg 660 gggaagaaac cgaagtttga gagtggcagg gattataaga gaaattttcg acaacaaatt 720 tttggagggc taatcccaaa taggaaaggt aaaacgcaca caaaacagga acaaaagaac 780 attgactaat ctggacaagg atgaattaaa gctctacgct ttgggatcaa tctgacatga 840 tcttctaaat cacacaatga aactactcca tacctccttc ttctacattc tccccagcta 900 ccaccggtgc cttctacatc gcatcatgat ggacttgacc tcttgagaac tgtaccacaa 960 cagattagat tgaatccaaa ctcacatcag attgaaaaga ttaagattga aactcaagag 1020 actgtcagaa gattgtcaga agattaagat tagaaataga aatagattaa gaagcacacc 1080 aaatacctgc tttaacacca ttgtcaatct caaagatatt ccaaacccaa accttactac 1140 aattagatca tcaatttagc cgacttctca aaccttaatc tagtacaagc aagacatttg 1200 ccaccacatc ctcttcttac agaaccccca cagccagcaa ctctgattca gatactgaac 1260 ccgttaacta ctttgttgac gccggcaccg ttatctctga caccacgtac tcacctgcca 1320 aaatgggggg aatacaaagc cagttggcag aactgaccat gctgttagct gaagaacggc 1380 ttgccagaca acaggcagaa gcttgatacc agcaatccga ggcttgtgtt aatgcgttat 1440 tagccaacca ccaggatgtt cctaccaacg ccccccccct cagcctgcat ccgcaaccat 1500 acccgtgcaa gaacagcatg agaagggacc caaagttgca accccagata agtttagtgg 1560 cactaggggc agaccagcca gaatctacgc tagccaagtc caactatata tgttggcaca 1620 cccccacttg ttttgcacca aacacagcaa agtcgtgttt gccctgtcct acctgactgg 1680 agctgcaatt gcttgggccc aaccgctcac tcaggagctc atagatggat ccaaatctca 1740 tctcgtgacc tttgactgtt ttgttcaaaa cttcaaatct atgtattttg atacaaaaaa 1800 gaagttgaag gcagaaagag ctctacaaaa cctctcccag aagactagcg tttctgcgta 1860 cgcacacgag ttcaatgtcc atgccaccgc taccggatgg gaaactccta ccctgatcag 1920 tcagttcaaa cagggactga agaaggagat aagagtagct atggttatgg ttcaaaatcc 1980 atttgagtca attgaagaga tagccaattt cgcgatcaag attgatagca agcttcatgg 2040 agtgtccgaa aacaccatct ctagctcagt agccgtcgac cccaatgcaa tggacatatc 2100 agccagtttt gtacgattga gtgatgaaga gcgtgaacgt cgtttgaata ctgggaattg 2160 ctttaaatgt aacggtcacg gtcacagagc tagatattgt actgagaata gaggtagtca 2220 agcgcgagga aatggaaagg ggaagtctgg ttttagagct aggattgcag atttagagag 2280 tcaattagag gcagctggag gtagaggtga atctcaagtg aagcaagaaa gtacgagcag 2340 agctgaaaca tcaaaaaaat ggaggagctc aggaatgacg gttgtgccaa tcctgagcca 2400 acaaggggaa tctgatggaa taggattggg tgctaacaca tttgtattgt gcaatgaaga 2460 tgatccacac ttatttttac acaccaccat ttctatttcc cacttccccg gaggcacccc 2520 acaaattatg ttccctgccc gcttgttgat tgactccgga gctacccacg atgtcctagg 2580 tgagttcttt gctacgcgca ggggtctcat ccaacacact acacaaaata caagagtcgt 2640 cactggattt gacggatcta agagtcacgc ctcttacgac attgacctac accttgacaa 2700 caacgaccaa gccacacctt ttattaaact caaagactca tatgatggag tcctaggtat 2760 tccctggatc aagaagaaca ctgactggat tgactggcgt aaaggcaccc ttactcaaca 2820 gaaccacatt gcagccttgt cacaggcttt gtattgcccg aaaaaaccct ctgacaacca 2880 tgaaatggat cccaggaggg aagctaggga catttacgag gggatgtgta ttgataccaa 2940 tacattaaca tccccgcaat gtgagctcaa tttgcctatt actcctatac atcctagaac 3000 agttggcaag cttcctttcc ctactacatc acagattgaa tcaacgacag cacacgaatc 3060 accagacctt ctgactgagt acaaagacaa aatcctgact actacacctg tggctactgt 3120 agtagccttg tccattccgc accacacccc tgaggacctc gagaggaagc ctacagggca 3180 cacaaggatc agtgaccagg gggccagcat tcttacagat gctacagtgc ccccgcaatg 3240 tgagtgtgat cctgtgccta tctcaaatgt aaccaaatca gctggccagc ctttacatca 3300 tttgaataac aggccattag tcattgacaa agctaagaca tcatggtcaa cttcagctaa 3360 gctcgcagca gaagccaaga agctggaacc agtgaagacc gttgaggaac ttgtaccaag 3420 ttactaccac aggcacttag acatgttttg gaaatccaag tctcaaggat taccaccaca 3480 ccgccagtac ggcttcaagg tagacctagt accaggtgca cgacctcaag ccagccggat 3540 tatcccatta tcacccgctg agaacaaagc actcaacaaa atgatcaact caggacttgc 3600 taacgggact atacgcagaa ccacatcccc ttgggcggcc ccagtactat tcacaggtaa 3660 aaaagatggt aatctgaggc catgctttga ttatcaaaaa ttaaatgctc tcaccgtgaa 3720 gaacaaatac ccactgctgc tcaccatgaa cctagttgac agcctactgg acgccgacac 3780 gtttatgtac caagttaaat ctacagaacg catacggtaa tctccgagtg gctgaggaag 3840 acaaagacac gcttgcgttc atctgcaaac aatgccaatt cgcacccctg acaatgccgt 3900 ttggaccaac tgaagtcccc ggatacttcc agttttcata caagacatac tggtgggacg 3960 tatcggcaag gactcagcag cttacctaga tgacacgatg acatacacaa agaaaggtac 4020 tcaccacaag gaagctgttg attccgtact ggacatcttg agcaaacaac gactctggct 4080 caaaccggag aaatgtgagt tctccaagtc cgaagttgag tacctaggcc tcctgatctc 4140 gcataacaag gtcaagatgg acccaatgaa ggtcaaagca gtcaccaatt ggccgccacc 4200 gcacaactca caagaactcc agcggttcat tggcttttcc aacttctact gcagattcat 4260 caatcacttc tctagcacaa cacgcccgct gcacaactta actcgaacta ataccccttt 4320 catctgggat gaagcctgtg acaaagcttt caagaaactc aaaaacatgt tcacctctgc 4380 cccggtattg aagatagcgg atccgtataa gccattcatt cttgaatgtg actgttcaga 4440 ttttgtgttg ggggtggtac tctctcaagt gtgtgataag gacggcgaac tacatccagt 4500 tgcctatcta tccaggtctt tagtgcaagc agaacaaaac tacgagatat tcgacaaaga 4560 attgctagca attgtggcct ctttcaagga atggcgccat tacttggaag gcaacccaaa 4620 ccagcttgat gttattgtat acaccgatca caggaactta gaaagtttta tgacgaccaa 4680 acaactcacc agaaggcaag ctagatgggc acaaatcatg ggctgtttcg actttcaaat 4740 aaagttcagg cctggaagac aggccgcgaa acctgccgca ttgtcacgac gcccggactt 4800 agcccctgat gcagcagaaa acctcatgtt tgggcaactc ttgcgttcag aaaacattgg 4860 acccaacaca ttttcattga gaacaccgac caaggaacga cttggtgtgt aattccatga 4920 aaggaattgt gtaagaattt atgcttatag tgggggcaac tggttgtgga gcgcaagcgg 4980 aagttggaaa attgttttct accacaccag tctcccacga gcttaagcag ccgagtctta 5040 cacatctcaa tctcgcgtaa ttgtttagtc acaatctctt aagtcatact tcgttctgaa 5100 atgacctccc acccacccgg caaagccagg ggcatcaagg tttgcaaagc gagaaggatt 5160 ttctcagcca cgcaaaccaa gaggcccggc ggcggcgccc ccccgtacta gttgtgctca 5220 gcaactcaaa gtgagcgcaa cgggtggagg tagtttcagg atgcggtagg catctgggaa 5280 caaacaataa tcctacaaag gattgtaaaa cataaatgca attgaacaat tccagcgaga 5340 agggagtaga ccttgcaaat attgaaacat ttttcaaaga tgaaaacgtt gaactggaga 5400 atgctgagca ctggtttgag attgacgtaa taggaatttc agaagacaaa actacagaca 5460 aaactattct cactaatgac aaaagtatta gcaaaatcag aaccttgaat accaacccgg 5520 tctcttctaa aatacaccaa ctgacaaaaa actactcatt tgtagatggt atattataca 5580 accaagggcg gatagaggta ccggaagaca accacatcaa atttcagatc attatgagcc 5640 gacatgacag tcttttagcg ggacacccag gacaggcaaa gactttaagt ctagtacgga 5700 ggagcttcat ctggcctttg cttaaagctt atgtgaacaa gtttgtcgac ggctgcgact 5760 catgcctaag ggtaaagtct agcaccaaac aaccgtttgg cacactcaaa ccgctaccag 5820 tccctgccgg accttggacc aatataagtt acgaccttat aacaaaacta ccggtatcca 5880 acggcaacaa cagcattctg acagttttgg accgtctcac taaaatggca cattttgttg 5940 cctgtaaaga aactatgtca tctgaagatt tggcggattt aatgttaaag aacgtatgga 6000 agttacatgg tacaccaaaa agcatcatct ctgacagggg cagcatcttt gtatcacaga 6060 tcacacagga ggtggacaaa cgtcttggaa taaggctaca cccttcaaca gccttccatc 6120 cacaaaccaa tggacaaagt gagattgtca ataaggtgat tgaacaatac ctgagacact 6180 ttgtagatta tcgacaagaa gactgggaag ctatattacc catggcggaa ttcacataca 6240 acaacaagga tcatgaatca attggaatat caccattcaa agcaaactac agattcaacc 6300 caatcttcaa caaagtgccg tcagctgagc aatgtgtacc agcagttgag gctaggttga 6360 agttattaga agaagtacaa acagagctaa ctacttgtct aacattagca caggacacaa 6420 tgcgccatca attcaacaaa cacgtccgga agacaccgga atggaaagtt ggtgacaaag 6480 tgtggcttag tagcaagaac atctcgacaa cgagaactag ccctaagttg gatcacagat 6540 ggatgggtcc tttccctatc gttaaacgtt gtagaactta taacccaaca agggttatac 6600 agagctggaa gaggataatc tactaataaa aatgactagt agaagttgtc taaggactgc 6660 tcttctaaaa caagtcaatc aaaaagagaa acctgttgac ttgagtactg acaaggcacc 6720 agagaaaagc acctgattta attaatctca acacaatttt agtgttattc cagtcgagtc 6780 aaaatgtaga aaaataatct caatatcttt ttgatattat tccagaatga cagtgtcata 6840 tgaagcacat aaaacaaatt tctcctctga tgaagggctg gtttgagtga cctagtactg 6900 ctataataaa tttattaaga taaataaaat taaagctctg taagagatga agaagaaagg 6960 aaacataggt tcgcacaaag ggcacaagtt tataaacaca tcaatagaaa ttgttagagt 7020 ggtaaacagc tcttcagaat cctctttacc atctaaaaaa ccaaaagatg actaaaatgt 7080 tggtacacag ctatgatcta atgattcaac cactctctac actcctaaat ttcgccatca 7140 gtagctcatt gacccattat gatcattact atgatgtaat gtgatcttct attcatcatc 7200 tgataagatt tattactaat ttgaccagta gagatgacta attcctattc ttatgatcaa 7260 atgctatgta tatgctatgt atatgctgtg tatatgctgt gtatatgctg tgttttgaat 7320 gttctagaat gaatattgat ttgatcttat gatgattcag catgtcaatg atgtaattgc 7380 ttgttattaa tgatttgtga tcttgttgat gtgtaatcaa atcctttgat tccttataac 7440 ttttgtcata cacctctgtt accaatgatt taaaggttgt acattatgaa ttgaatgttt 7500 ttgcgtcgaa tgaactatga ctcatcatgt tatgacgtca tttggcacat gtaatcagct 7560 gaaaaccttc attttcgact attttcacct attttcttca ctttacacac cttttactga 7620 gattttcttt ccttctgtaa cattacacag taatctaaaa ccgcggaaat ccctaaaaag 7680 actcttggta acctcattca ttactaatac ccctgttaca tatgtaaaac tacctgtaaa 7740 cccttcttta ccaacttcga ctttttccca tttccttaat aacttttatt ttctcttatc 7800 aaccaccaag attaaagttg ttactgacga aatattgctc ctctttcaca ttctaaccct 7860 aataaaccca taaacctaat atcaataccc taaaaaccat ttccaacctg tatacctcta 7920 ttttcaagtt tttcttatgt ttttttcttt ctttcttttt gtaatctgtt tgttaatagt 7980 ttataattat cttattaagt ttgtttaaat ggtacaatga tgtaaatatt aaactatatg 8040 taatataatt gatgtaattc aaaattcgat tttctttgat tttttgattg gtctcataga 8100 tgattaccac aacatgtatc agcatctact tacaaattga agttacccgt gtccatgaga 8160 aggattcacc cagttttcca tgtattggta ctacaaaaac ataatcccga agccattgat 8220 cagagaagaa gaagcacacc tccagcaatc gttatagaag gacaggacaa atgggaagtc 8280 ttgaccatct tagacagaag acaagaaggg aaaagcagtg agtacttagt agcgtggaag 8340 ggttttagca cagatcacaa ttcgtgggaa ctggaaatga acctcaagaa taaatagtca 8400 agatttagtc aatgaattta atttaaaata cctaaaggca atagagaaac ataggaggag 8460 gagaaggcgg tgagagggaa tagcttgttc ccattaggtt ttttaatgct gcccggggaa 8520 ggattgcaga actcgcaaga gggagtttgg gcgttaaaag ggggataa 8568 // ID Gypsy-2_TMe-I repbase; DNA; FNG; 5882 BP. XX AC CABJ01000834; XX DT 13-FEB-2011 (Rel. 16.02, Created) DT 13-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Perigord black truffle genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_TMe_; KW Gypsy-2_TMe-LTR; Gypsy-2_TMe-I. XX OS Tuber melanosporum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Pezizomycetes; Pezizales; Tuberaceae; Tuber. XX RN [1] RP 1-5882 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Perigord black truffle genome."; RL Direct Submission to RU (13-FEB-2011). XX DR Genome; CABJ01000834; Positions 154286 160167. XX CC Positions [4221-4709] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(753..3005,3009..4823) FT /product="Gypsy-2_TMe-I_1p" FT /translation="MHVKRITDVATDLIMYKDRMERGSISKYILKRRFYDT FT MHPTLRREVEPLEEDGDDIAVIIARAERKDAILHETKAYKTQTSSSRNSQP FT RHKASSNSGDDTPRVFKKKTKSSKESRREKGECFECGKTGHMAKNCPSKRG FT KGKAIKQEASANNVESDNAQSSKYEEIHIGYINVECNSVRPPATFKAHTAL FT EGTILISGKKARVLFDTGAIGRDMISNAFVSVHGLPTRDLNQPIPVNMAVK FT GSRSTSHKTCSATIRLGSAVLPPQEMLIGNLAKYDALIGMPFLNRHKAIIR FT CGESTIDFPDQKVRVSCKPTSREYRAAVCEATTQKLLDEYSDVFPDQIPEE FT LPPMREVNHHIRLRDPSNLKNQPTYSIPDAYKSPLCDWINQQQRRGVIYRA FT EAPGAAPMFVQPKADGKRIRPLVDLKLRNENTVMDSSTIPNQTQIQNAVGR FT ARFRSKIDLSDAYFQTRLDPESKRYNSFKTPFGSFISRVMLQGDMNAPATF FT IRIMENLLSEYLHHFVWVYIDDILIFSNTATEHAYHIGLVCTKLREAKFYA FT SRNKSILFADKLDILGHVIDDEGIHAAPGKIACLTEWSTPRNQKELMRFLG FT LVNYISQFLPHHATVTAPLTDLTGNAEFVWTHTHDQAFTNIKELIKSVNVI FT QPIDYSKPEPIWLITDASDVGVGAWVGQGASHETARPAALHSRKFTNSQMN FT YGTTDKEALAVIDALAAFNHILMGNTFTIVTDHQPLTNLKTERIPGRRRTR FT IGEISKYDAKIIYTPGRTNYLADALSRLYENQDLANPPLVHDHTRQSEDSP FT DDTSYEQETSMAAASSTTPSNPFSYTMYSAAGTERGKDVDEEWKDNCTLRS FT TPPHELEQSRLHWSHCYHDTCPFHQTEIKFGNRYQRPYYMPADDSKNTSFD FT TVEGAEEVINEDESTQPPYTLNPVPTRYQGLEELEQAAAYVDAISYLHPPQ FT SAPPSSPPIIDSRPLYGGPLRVYNNEIIVSVIPPTIPEHGRQLYETFEEEE FT APIEPTRGSLIEEERDVRSTTEELTSSFRSQMIAGYRHDPIYFKAMDASRH FT GDLAPYNISNRMVFTSTRYGEHYLYVQKGHMFNGMTIREYVISEIHNKGHH FT SADRNLQYASEYVYWPEMRKDFRDFVRQCELCQANKERTQLPEGKAQTLQL FT PCEIFESYAIDFAGPFNKSAGYDTILVAVDRFAGYTWLIPMKVTDSSKDTW FT NNLQRHLFTPHGFPFSIVSDADPRFTTRFWKQTLTSLGINHIMAAPGHHET FT NGQAERKIREVKTGTRNMINERQTNWEPALVELAVYINAGFSDTLGMSPYK FT TVFGRQYPLLPTTLFKNSSVPASDDHLNRHQAIRNDAFQAVKAA" XX SQ Sequence 5882 BP; 1861 A; 1532 C; 1211 G; 1278 T; 0 other; tttttttaac ctcagcaagt ggttaagaac ctgtccctgg atttaggcct gatcaaccta 60 tcgatatcca cctcagaacg ctagcggaga gaagcggcag aatcacagca cattcggctt 120 catctcaaac caagatagag tctttgacga ccaggagtgg gaagcaaatc ccacggcaag 180 accttacaag aaagaaacaa ccccggagct ctgccaaaga gccgggcgat cctgcaaaca 240 cggaagacac acaaccccca ggagaattcc cttctactcc agcaaagaag ccttcctttg 300 aacctccaaa cgacccagaa tctagtagcg aatcagatat ggctagcaca ggaggagggc 360 cttctggagg agggaaatct cgggcaaggc ctgaagaaga ggaggtaccc gcagcatttg 420 aaccccctaa gtttaaactt cacctcccag cgaagttcac aggcgaaggg aaagacctca 480 aacctgaagc ctttgaacga tgggccacag aactctcaca cttcctactc ttacaaggaa 540 aagatgtgga agaccctaca accggactta tcattggatc attctgtgaa ggaaaggcaa 600 ccgacgcata ccaccaatgc tataaggact tcggcgggaa tgtaagatta agaggatcta 660 cagttctcga ttacctgaaa ggatgattcc aatcctctat aaccaacgac ctcctataca 720 acaaattcaa ccagctccag caagttaacg gaatgcatgt caagcgaatc accgatgtgg 780 ctacggactt gataatgtat aaggaccgaa tggagagagg aagcatctcc aaatacatcc 840 tgaagcgacg attctacgac accatgcacc ctaccctccg gcgagaagta gaaccactcg 900 aggaagatgg cgatgatatt gcagtaatta ttgcaagagc agaacgaaag gacgccattc 960 tccatgagac caaggcctac aaaacacaga cttcctctag tagaaatagt cagccaagac 1020 acaaagcctc cagcaactct ggagatgaca ccccacgggt tttcaagaag aaaacaaaat 1080 cctccaaaga atcgagacgg gaaaaaggag aatgttttga atgtggaaaa acaggacaca 1140 tggcaaagaa ctgcccttcg aagagaggaa aaggaaaggc aattaagcaa gaggcatcgg 1200 ccaacaatgt tgaaagcgat aacgcccaat ctagtaaata tgaagaaatc catatcggat 1260 atatcaatgt ggaatgcaac agtgttcgac ctccggctac attcaaggca cacaccgctc 1320 tggaaggaac aatcctaatc agtggtaaaa aggctcgagt cctattcgac actggcgcaa 1380 tcggaagaga tatgatcagt aatgcctttg tctctgtcca tggactcccc actcgagacc 1440 ttaaccaacc cataccagtc aatatggctg taaaaggatc acgatccaca agccataaaa 1500 cctgtagtgc aaccattcga cttggttcag cagtcctacc cccccaggaa atgctaattg 1560 gaaacttagc caaatacgat gccctaatag gcatgccatt cctgaaccgc cacaaagcta 1620 taatcaggtg tggggaatca accatcgatt tcccagatca gaaggtcaga gtctcatgca 1680 agccaacaag cagagaatac cgagccgcag tctgtgaagc aactacccaa aaactattgg 1740 atgaatactc tgatgttttc cctgatcaaa tacccgaaga actgccacct atgagagagg 1800 tgaaccacca catccgcctc cgagacccct ccaatctcaa gaaccaaccg acctactcta 1860 ttcctgacgc atacaaaagc ccactttgcg attggatcaa ccaacaacaa agaagaggag 1920 tcatctaccg ggcagaagcc ccaggtgccg ctcctatgtt tgttcaaccc aaagcagatg 1980 gcaagaggat ccgtcccttg gttgatctca aactacggaa tgaaaacacg gttatggaca 2040 gctccactat cccaaaccaa acacagatcc aaaatgcagt tggacgggct agattccgaa 2100 gtaagattga cctaagcgat gcctacttcc aaactaggtt agacccagaa agcaaacgct 2160 acaacagttt caaaacaccc ttcggaagtt tcattagccg agttatgtta caaggtgata 2220 tgaatgcgcc cgccaccttt atacgaatta tggaaaacct cttaagcgaa tacctccacc 2280 attttgtttg ggtctacatt gatgatattc tcattttctc gaacaccgca acagaacacg 2340 cctaccatat aggtttggtt tgtaccaaac ttagggaagc taaattctac gctagcagaa 2400 acaaatcaat cctgtttgcc gacaaactcg atattctagg acatgttata gatgatgaag 2460 gaatacatgc agccccagga aaaatagcct gtttaacgga atggtcaaca ccccggaatc 2520 aaaaggaact catgcgattc ctaggactag tcaactatat ttcccagttc cttccacacc 2580 atgccactgt aacagcgccg ctcacagacc ttaccggtaa tgcagaattt gtctggaccc 2640 acactcatga ccaagccttt accaacatca aggaattgat taaaagcgtc aatgttatcc 2700 aacctatcga ctatagtaaa ccagaaccga tatggttgat tactgatgca tcggatgttg 2760 gagttggagc atgggttggg caaggggcct cccatgaaac ggcaaggccg gcggcattac 2820 atagcaggaa gttcactaac tcacagatga actatggaac aacagacaag gaagcccttg 2880 ctgtgattga tgctctcgcc gctttcaacc acatcctgat gggcaatacc ttcaccattg 2940 ttaccgacca ccagcctctc actaatctta aaacagaaag gataccagga agaagaagaa 3000 caagatagat aggcgaaatc agcaaatacg atgcaaaaat tatctacacc cctggaagaa 3060 cgaactattt agctgatgct ctttcccgcc tatatgagaa tcaagacctc gcaaaccctc 3120 cccttgttca cgaccacaca agacaaagcg aggacagccc tgacgacacc tcctatgaac 3180 aagaaacatc gatggctgcg gccagctcta ctactccctc caatcctttc tcttacacca 3240 tgtacagtgc tgccggaaca gaacgaggaa aggacgttga tgaagaatgg aaagacaatt 3300 gcacactccg cagcacacct cctcacgaac ttgaacaaag tagactccac tggtcccatt 3360 gctatcacga cacctgccct tttcaccaga ctgagatcaa gttcggcaac agataccaac 3420 gaccgtacta tatgcctgct gatgattcca aaaacaccag tttcgacacc gttgaaggag 3480 cagaagaagt gattaatgaa gatgaatcaa cccaaccccc ttataccctt aatcctgttc 3540 caacacgata ccaaggtctt gaagaactcg aacaagcagc agcctacgtg gatgcaatat 3600 cctatttaca ccccccacag tcagccccgc catcctctcc tcccataatc gattcacgac 3660 cactatatgg tggaccctta cgagtataca acaatgaaat cattgtctcg gtcatacccc 3720 ctactattcc cgagcatggg cgtcaactct acgaaacctt tgaggaagag gaagccccta 3780 tcgaaccaac aagaggatcc ctaattgagg aggaacgaga tgtcagatca accacagaag 3840 aattaacctc ctctttccga agccaaatga tagcaggata ccgccatgac ccgatttact 3900 tcaaagccat ggatgcctcc aggcatggag acctcgcccc ctacaacata tccaatagaa 3960 tggtctttac cagcacccgt tatggagaac attacctata cgtacaaaaa ggccatatgt 4020 tcaacggaat gacgattaga gaatatgtca tatccgagat tcacaacaaa ggacaccata 4080 gtgcagatcg caacctccaa tatgcctcag aatatgtcta ctggcctgaa atgcgaaagg 4140 acttccgcga ttttgtcagg caatgcgaac tctgccaggc aaataaggag cgaacacaac 4200 ttcctgaagg gaaagctcag accctccaac taccatgtga aatatttgaa tcctatgcca 4260 tcgatttcgc aggaccattc aacaagtccg ctggatatga caccatattg gtagctgtag 4320 atcgattcgc tggatacacc tggttaatac caatgaaggt taccgactcc tctaaagaca 4380 cctggaataa cctacaacga cacttattta ccccccacgg gttccccttt tctattgtca 4440 gtgatgctga tccccgcttt accaccagat tctggaagca aaccttaacc tctctcggaa 4500 taaaccacat tatggctgcc ccaggacacc atgagactaa tggccaagca gaacgaaaaa 4560 tccgcgaggt caagaccggg actaggaaca tgatcaatga acgacaaacc aactgggaac 4620 cagccctagt tgaactagcc gtatatatca atgccggatt ttcagatacc ctgggaatgt 4680 caccatataa gacagtcttt ggccgccaat accctctctt acctacaact ctattcaaga 4740 actcctcagt ccctgcctcc gatgaccatc taaaccgcca ccaagcaatt cggaacgatg 4800 ccttccaagc tgttaaggct gcctgattcc gaagcaccac tactgcccag aaacgacgtc 4860 gtaaacatgt tcccatctca ccaggagaca tgctcatggt acatggtaac atgtttatta 4920 ccaatgtcgg acgcagcaag aaactacaac cccgatggcg aggacccttc aaagtcatat 4980 cctttgacga gcatacagag aattacactg tccagatgga tggacgcatg taccgccgca 5040 atactgcagt ctttcacgcc tcagctgtca aacgattcca tcccaatgac gactccaagt 5100 ttcctggaag aacccactct tggcctgcac ctatcatcat caacaagaag gatgaatggg 5160 aagttgaaga aatcaaagac caccgccttt ggcatggaaa tgatcagttc ctagtcaaat 5220 ggaaagtata tccaacgagc aaaaacagtt gggaaccagt ggaaggactt gagcacgcta 5280 tggatatggt gaatagatgg tggaaagaca acatgccaaa cgaatccctc cccccaacaa 5340 tcaattatat caccatgtgc tggacaccta caatgccaga aactgagcga tggcacgcag 5400 aactcgaaac ttgagatgga ttttgggcac cacactttga gtcagagttc gaaagtagcg 5460 atgaggagaa tcaagctatg gagcaggatc tcgacaactt tatctagtca ccaatcacca 5520 tccgaatacc ataacgatgt caaactatgt ggagctcaat gggaggcagc gacaacaggt 5580 tgttggacgg ttgttgaacc gcgatgttat ccaacttgaa gactacttcg aaatcctctg 5640 gattgactac tgtcctgatt gcgatcatgc ttgggttgat gtacagcacg aatctggtat 5700 tcgtgactac tgtgatttag acttctttgt gtctaagtgt ttccttagag tttccgcttg 5760 tccttggtgt gatacctatg atgaagagtt tttggcggaa actgaagcca tgagtttctc 5820 ggaatccgcc tttgatgatt attaggtttt gtttgtttaa acagttgttt tgttttgggg 5880 aa 5882 // ID Ylli repbase; DNA; FNG; 6872 BP. XX AC YLI319752; XX DT 28-JUN-2005 (Rel. 10.06, Created) DT 03-JUN-2009 (Rel. 14.07, Last updated, Version 2) XX DE Yarrowia lipolytica non-LTR retrotransposon. XX KW L1; Non-LTR Retrotransposon; Transposable Element; Ylli; L1-1_YL. XX NM L1-1_YL. XX OS Yarrowia lipolytica OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Dipodascaceae; Yarrowia. XX RN [1] RP 1-6872 RA Casaregola S.; RT "Direct submission."; RL Direct Submission to Genbank (28-JUN-2005). XX DR Genbank; YLI319752; Positions 1 6872. XX FH Key Location/Qualifiers FT CDS 365..2548 FT /product="Ylli_1p" FT /note="gag-like protein." FT /translation="MNIYNRARPEKTFTMETAQAQASAGNSLGAPNPPPPD FT LSRGDGATSTETPNQTDIEKIQKNDKNDENNKKNDKNDKNDKKNDNNITLN FT AWKSKAEVKALLTSNPSTLKFNLNKERISEPLAAPNGRHTSGRFNVSYVAS FT KENFFRRALDRSATPDWNLPISMFLEHLDNAAEVDEKDTISVLEGVIEKFD FT EIKGQVGLAEFDLSRHNLDIVPHLIDQINLEQDPEDCYQVGNGWHYLTSDA FT LTDPHEQRMAVILETVSLIVEANTQDYRIMRKGSANPAGRRVLYYFNLPSY FT MKMERQTEDAFKIHLNAILNQFDVDIDQAIPGSSLLSGTLLMTSANHQNIS FT GPVIAVRALLGSTLPQTIPDFFVNLDPTKARLTGSKLIKMHFANGDNICVK FT CKSTKHIREACPEKDMVTPKIFRTRAHQGTLPQRGKALASIHAPSTVEPPV FT QTRKFHHTPATHQWETVGTKSPRHRRTRDTSPQPTGQKPLRSYYNFEVLSD FT KTGEDTPEETEQTNQLPSTNQQHGTQNLPINIPASEEDTVPDDQETEQADV FT SMNGFDTQPDALAHPEVAPDVTQFLPPATPLNTEATNNGLPHDHTSPNTTN FT QTQPPPGSIHEGRPRGGHTTSSFSGEPSHTTLGKKHIAVFQTASSEVPVQI FT LNPDRSENRLCWLPSQEVAKFIDGRCPTFLSTCHFKVFHDGATLSSVDEIV FT EPPNSPPNPPRVTTLHEVDTMLRPGQNQ" FT CDS 2515..6453 FT /product="Ylli_2p" FT /note="reverse transcriptase." FT /translation="GGHNAPTGAEPVSMSFTSTTTPSKAPCKSTQAKMKSP FT NVKVITANIGGFKMAEALNRLPDTIVNIIRTTHYPDLVLLQETNWVDSSLQ FT TAQQIISNSPYPGGRHYTLLGSTAGVSNRSVGVGLVYSDNVTITNFTTVFE FT YFPALDNRLCLADVHIKGTDRHLSLINVYAPNEQGASPFTNRQFYQSLDDY FT LRLQPCQYPLIAAGDWNAVASNDGRIGEKVTTHLKDFLANWDLLDTYTLIS FT KHRKGLYTHTNNSRGAGRRLDQLHISSTLSQWIKSTQLVTNKQGHNITKSS FT HHAVQFVFNFGNLTQRQDRGPGTWRMPWWVFDTEYIAWLKFHVKSILKRYG FT PLKPSLKLQCLKENIKVTIQQEAKNRALRDSNHPDNRRREALMATASDWQA FT YPANEPYPMLHARVEQSKQQMEIHSLRNHQGRVKTDTSSLCDISARYFDNI FT FDRAADINDTDDSAFLDLFPEETRRVNTANPSLDSSFTKEEIFDTIKASTH FT NSAPGPDGIPYRFYRDCWDELGDLMTDVYNEAGADSPISTERNTAIIKLLY FT KSGDQADISNYRPISLINTEVKIYTQLINKAIQNILPDSIHGAQNGFVPGR FT HITNNLDTMDHFCNAYSQLNMGWVVGSLDFRKAYDTISQSWVIKTLRNVGV FT SELMINRILAVQQNAVTRINIRGVLSRPVRIKIGVRQGCPLSPTLFIIAVD FT ALVRRLDREMFGLAPGLPLSHHHTTGHRPPQPPTHPEAVAAGHMKVSAFAD FT DIAVFLNNIQDVATVGRVLCLFQRVSGLTLNPSKTVLQKIGPPDCFVPLTF FT IDAEWQRTIDATWPTDNNTRQAPTLRTSDNIFRYLGVHFGNRERLDTHYAQ FT IKEDLKESLRRLSLWGLPYYSKAWMINIYFFSKLNFLGPYVSQIDNTFIRE FT LNELACDKINKLSPDKTRKTFSNGFIQTPVGRGGLNLRDMGRFMVCLKAAR FT AYRFFHGSSPALWDCTFQFYSRTHSLTQEKPHQRVDCRFPTGQAWGYAASP FT TMRDAIRASFELNKPLTAEHANPREDHEPSTTDTVNHRIWERNQVNVRAAE FT SFREAHVDIAAARRYHPVRMVSTAEIDHLRWSMGPLTLENFMFHKTSSPDL FT PNFPHTLARPKKWTQRSDYGGISDDMVWQEVMHDLRKHYIADANKAQVLHL FT MRISRLPLVKWRYPDDHVFTKKNPGCGLCDKAIIQDLHEHIFCKCEVLLSM FT LTRMKIPSVDSLKDWIFTESKCGVLPSFPGHVNSDAPRKHQQVKTRKYLRE FT LAYGIWKMERSLRYSGDDATLGHVQQGLLQFLKEAQMCFYDGQPPALHDER FT E" XX SQ Sequence 6872 BP; 1913 A; 2075 C; 1519 G; 1365 T; 0 other; aacccaagag gatgaataga accataccat ggccggggga aacacgtgtc tatcaatatc 60 tccgcgatct cctcgtagac ttgctaagat acttgcgctg ccacggaccc atttcagatg 120 attgcttcag gacccgtaat cagcaagtga tctgtaacca gcaagtgatc tgtagtgata 180 tttttccttg cccgcaatca atgcaattct ttaccaaaat ctgaatacat gtcctttttt 240 tttctttctt ttttttcttc tttttttctc actttattga ttctgaacag ttacaataat 300 acttggtggt tcaagtaaga tttatatagt tgaaacaata ctttcacagc aaagttataa 360 ttaaatgaat atatacaatc gagcgcgacc agaaaaaacg ttcaccatgg aaacagcgca 420 agcccaagct tcggcgggta actccttggg cgctccgaac cccccacctc ctgatttatc 480 gcgaggtgat ggagccacct caaccgaaac ccccaaccaa accgacattg aaaaaatcca 540 aaaaaacgac aaaaacgatg aaaacaacaa aaaaaacgac aaaaacgaca aaaacgacaa 600 aaaaaacgac aacaatatca ccctcaacgc ctggaagagc aaggccgaag tcaaggctct 660 cctgacatcg aacccctcca cactcaaatt caacctcaac aaggagcgaa tttcggagcc 720 ccttgcggcg ccgaatggta ggcatacctc cggccgtttc aacgtgtcct atgtagccag 780 caaggaaaac ttcttccgaa gagctcttga ccgctccgcc acccctgatt ggaatcttcc 840 tatctccatg ttcctcgagc acctcgataa cgccgctgaa gtcgacgaga aggacaccat 900 cagtgtcctc gagggcgtca tcgaaaagtt cgacgagatc aagggtcagg ttggtctggc 960 ggagtttgac ctgtccaggc acaacctaga catcgtcccc cacctgatcg accagattaa 1020 cttagaacag gaccctgaag actgctacca agtgggaaat ggttggcact acctgacgtc 1080 ggacgccctc actgaccccc acgaacaacg aatggctgtc attctggaga cagttagtct 1140 cattgttgag gccaatactc aagattatcg catcatgagg aagggctcgg ctaaccctgc 1200 tggacgaaga gtactttact acttcaacct accctcgtac atgaagatgg agcgtcagac 1260 ggaggatgct ttcaagatcc acctcaacgc tattctcaac cagttcgacg tagacattga 1320 tcaggccatt ccaggttcct ccctcttgag tggcaccctt ctgatgacct ctgccaacca 1380 tcagaacatc tcgggtcctg tgatcgcagt tcgtgctctc ctaggctcca ccctgcccca 1440 gaccatcccc gacttcttcg tcaacctgga tcccaccaag gctaggctta ccggctcaaa 1500 gttgatcaag atgcacttcg ccaacggcga taacatctgt gttaagtgca agagcactaa 1560 gcacatccgg gaggcctgcc ccgagaaaga catggtcact cccaagatct tccgcacccg 1620 cgctcaccag ggtaccctcc cacagagagg taaagccctt gcttccatcc atgccccatc 1680 caccgtggag cccccggtcc aaactcgcaa gttccaccac acgcctgcta ctcaccagtg 1740 ggagaccgtc ggcactaagt cccctagaca tagacggact cgagacacgt ccccccaacc 1800 caccggtcag aagcccctcc gatcttacta caactttgaa gtcttgagcg acaaaaccgg 1860 agaggacaca cccgaagaga ccgagcaaac taaccagctg ccctctacga accagcagca 1920 tggaactcag aatctaccca tcaacatccc cgcctccgag gaagacacag ttcccgacga 1980 ccaggaaacg gaacaggctg acgtatccat gaacggcttt gacacccagc ctgacgcact 2040 tgctcaccct gaagtagccc ctgatgtcac ccagtttctt ccccccgcta ccccgcttaa 2100 caccgaggcc accaacaacg gccttcccca cgaccacacc agtcccaaca cgaccaacca 2160 aacacaaccc ccccccggaa gtatccacga gggccgaccc cgaggagggc acaccacctc 2220 cagtttctct ggcgagccct ctcacaccac gctcggcaag aagcacattg cggtcttcca 2280 gaccgcctcg agcgaggttc cggtgcaaat cctgaacccg gacaggtcgg agaacagact 2340 ctgctggctg ccttctcagg aggtagcaaa gttcattgac gggaggtgcc cgacattcct 2400 gtctacatgc cacttcaaag ttttccacga cggtgcgaca ctcagctcgg tagacgagat 2460 tgtagaaccc ccgaacagtc ccccgaaccc ccctagggtg accactctcc atgaggtgga 2520 cacaatgctc cgaccggggc agaaccagta agcatgtctt tcacctctac taccacccct 2580 tctaaagccc catgtaagag cacccaggcc aaaatgaaat ctccgaatgt taaagtcatc 2640 acggctaaca tcggcggttt taagatggcg gaagcgctta accgattacc cgacaccatt 2700 gtaaacatta ttcgaactac tcactacccc gatctcgtcc ttcttcaaga gacgaactgg 2760 gtggactcct ccctacaaac cgctcaacaa atcatttcaa actcgcccta cccggggggc 2820 agacactaca ccctcctcgg ctccacagct ggtgtttcta atagaagcgt cggcgtgggg 2880 ctagtctatt ccgacaacgt caccatcaca aactttacaa cagtttttga atacttccca 2940 gccctagaca acagactgtg tttggccgac gtgcacatca aaggcaccga cagacaccta 3000 tcgctgatca acgtatacgc accaaacgag cagggagcct cccccttcac taaccggcag 3060 ttctaccagt cattagacga ctatctacgc ctccaaccct gccaataccc cctgatagcg 3120 gcaggcgatt ggaacgcggt cgcctccaac gacggcagga ttggcgaaaa agtgacgact 3180 caccttaagg acttcttagc taactgggac ttactagaca cctacactct aatcagcaaa 3240 cacagaaaag gcctctacac tcacaccaac aacagccgcg gagctggcag gcgcctggat 3300 cagctgcaca tctcttcgac actctcacag tggatcaaat ccacacaact ggtaaccaac 3360 aagcagggtc acaacatcac gaagtcctct caccacgcgg tccaattcgt ctttaacttt 3420 ggcaacctca cccagcgaca agacagaggc ccaggcacct ggaggatgcc ctggtgggtt 3480 tttgatacag agtacattgc ttggctcaag ttccacgtca aatccattct gaagcgatat 3540 ggtcctttga agcccagcct gaagctgcaa tgcctgaaag aaaacattaa ggtcaccatc 3600 caacaggaag ccaaaaaccg cgcacttcga gattccaacc acccagacaa cagacgccgc 3660 gaggccctca tggcaacagc ctcagactgg caagcctacc cagctaacga gccatacccg 3720 atgctccatg ctcgggtcga acaatccaag caacagatgg agatccactc cctacggaac 3780 caccaaggcc gagtgaagac cgacacctcc tctctctgcg acatctcagc tcgatacttt 3840 gacaacatct tcgacagagc tgccgatatc aacgacaccg acgacagcgc ctttttagac 3900 ctcttccccg aagaaacccg gagggtgaat acagctaacc caagcctgga ctcgtctttt 3960 accaaggagg agatcttcga taccatcaag gcgtctacgc acaacagcgc gccagggcca 4020 gacggcatcc cgtacaggtt ctaccgcgac tgctgggatg agctggggga cttgatgacg 4080 gacgtctaca acgaagctgg agccgactcg cccatcagca ccgaacgcaa caccgctatc 4140 atcaaactac tgtacaagtc gggagatcag gccgacatct ccaactacag gcctatttct 4200 ttgatcaaca cagaggtgaa gatctacaca cagctcatta acaaagccat ccagaacatt 4260 cttccggaca gcattcacgg cgcccaaaac ggtttcgtac cgggtcgaca catcactaac 4320 aatctcgaca ccatggatca tttttgcaac gcctactccc agctgaacat gggttgggtc 4380 gtaggatcac tggacttccg gaaagcctac gacactatca gccagagttg ggtgataaag 4440 acgcttcgaa atgtgggtgt ctcggaactg atgatcaacc gtatcctagc tgtacaacag 4500 aacgcagtaa cgagaatcaa catcagaggt gtcctatctc gcccagtccg catcaagatc 4560 ggagtgagac aaggatgccc tctctctcca accctcttca tcatcgcagt agacgcactt 4620 gtgcgtcgtc tggaccgaga gatgtttgga ctggccccag gcctgccact atcccaccac 4680 cacaccactg ggcacaggcc gccacaaccc cctactcacc cggaggcagt ggcagctgga 4740 cacatgaagg tgtcggcttt cgcggacgac atagcggtgt tcctcaacaa catccaggac 4800 gtggccacgg ttggtagagt gctctgcctc ttccaacggg tctcaggcct cacactaaac 4860 ccgtcaaaga cggtgcttca gaagatcggc ccccccgatt gttttgttcc actaaccttt 4920 attgacgcag aatggcagcg aacaattgac gccacctggc caaccgacaa caacacacga 4980 caggccccga cgctccggac atctgacaac attttccggt acctcggagt acacttcgga 5040 aatcgcgaaa ggttggacac gcactacgct cagatcaagg aggacctcaa ggagtcgctg 5100 cgacgccttt cactatgggg tctcccctat tacagcaagg cgtggatgat caacatctac 5160 ttcttctcga aactgaattt ccttggaccc tacgtcagcc agatcgacaa caccttcatc 5220 cgagagctga acgaactagc gtgcgacaag atcaacaaat tatcacccga caaaacacgc 5280 aaaaccttca gcaatggttt catccaaact ccggtaggca gaggaggact gaacctgcgg 5340 gacatgggac gtttcatggt gtgcctgaag gccgctcggg cttacaggtt cttccatggc 5400 tcatcgccgg ccctttggga ctgcaccttc cagttctaca gcagaactca cagtctcaca 5460 caggaaaagc cacaccagcg cgtcgactgc cgctttccga caggccaagc ttggggctac 5520 gcagcctccc ccaccatgcg agacgcaatc agagcttctt tcgagctcaa caagcccctc 5580 accgcggaac atgccaaccc tcgagaagat cacgagccct ctacaacaga cacagttaat 5640 cacagaatct gggaacgaaa ccaggtgaac gtgagagcag cagaatcttt ccgggaagca 5700 cacgtcgaca tcgcagccgc gagacgatac catccggtca gaatggtgtc cacagcggaa 5760 atcgaccacc tccggtggag catgggacca cttactctag agaacttcat gtttcacaag 5820 acctcgtctc ccgacttgcc aaacttccca cacacacttg cacgaccgaa gaagtggact 5880 caacgaagcg actacggggg catctctgac gacatggtct ggcaggaggt aatgcatgac 5940 cttcgaaaac actacatcgc cgacgccaac aaagcacaag ttcttcacct catgcggata 6000 tctcgacttc cgctcgtgaa atggagatac cctgacgacc acgttttcac caagaagaat 6060 ccaggatgcg ggctctgtga taaggccatc attcaggatc tgcacgagca tatcttctgc 6120 aagtgcgaag ttcttctttc catgttgacc aggatgaaga ttccgtcagt ggactctcta 6180 aaggactgga tctttaccga gtcgaaatgt ggagtactcc cctctttccc agggcatgtg 6240 aacagtgacg ccccccggaa acatcaacaa gtcaagacac gcaaatatct gcgggaattg 6300 gcttatggga tctggaagat ggagagatcc ctccggtact caggggacga cgccaccctg 6360 ggacatgtcc aacagggttt gcttcaattc ctgaaggaag cccagatgtg tttctacgat 6420 ggacaaccgc cagcacttca cgatgagcga gagtagacac agtagcaagc gtaaaaggcg 6480 gccgaggcca ccgagagaac agcgtagcag ggcgcgtagt caccacaggg gacgcagaac 6540 caaaccaatg acgaagaaga accacaagga gaagttttca aaggcaatgc aaatgaagag 6600 ggcaatggaa ggattgagat tagagaactg gagactggag tggcgttttc ccgatgaacg 6660 aacaaacacg cgaagctatg tggaccaaca tacaacacgg actgaaccag gtttttttat 6720 gattttttta ctggaaatag gtacgtgcca agttggacca tgacactaaa cgtgtttaat 6780 tagtaatatt cgtgtaagcg tacattcatt tcaaaggtta ttctttcacg gcaaagttat 6840 aattaaatga atgtatatgc agaaaaaaaa aa 6872 // ID Copia-62_MLP-LTR repbase; DNA; FNG; 656 BP. XX AC AECX01000577; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-62_MLP_; KW Copia-62_MLP-I; Copia-62_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-656 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000577; Positions 9472 8817. XX SQ Sequence 656 BP; 166 A; 137 C; 95 G; 258 T; 0 other; tgtcgaaata caatcaagac aaggtagagg tgaatgcagt ttttgttaat gtagttgtga 60 caagtgttgt tttggatatg tttcgttctc actttttctc gtctgtgatt ggatattatg 120 agatcacccc aacttgacta agatgtacat caatcccaat gaatttcctt ctcattgtta 180 ttgatatatg tgctgttgtt tttgtttgcc ccttttctct ttttctctct ttcccatctt 240 tcggaaagag agtgagtcat ttcatacctt tcacattacg tagaattact aacgttataa 300 caccactaga atccgcttag atcactattc gagcacattg ctcattaaac ccttttcatt 360 tctcacttta acaatttcct ccattaacta ggtacctatc tcttgaatct tttccttttt 420 cacttatacc taactacaaa cgtgttattc acttttcatt tctttttctt cttatttatc 480 ttgttaggtt agtaccaagg tctgcatacc tttggtttgg agaagagtgc tcacgtacgc 540 actgctgtta tcaggttagc taagaaatac aattctatct ttcggaaaga gaaatccgct 600 tagatcacta tttgagcaca ttgctcatta aacccttttc atttctcact ttaaca 656 // ID Gypsy-1-I_RO repbase; DNA; FNG; 6867 BP. XX AC . XX DT 27-FEB-2009 (Rel. 14.02, Created) DT 27-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE An internal portion of the Gypsy LTR retrotransposon from DE Rhizopus oryzae. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW reverse transcriptase; Gypsy superfamily; Gypsy-1-I_RO. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-6867 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Rhizopus oryzae."; RL Repbase Reports 9(2), 630-630 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 171..1325 FT /product="Gypsy-1-I_RO_1p" FT /translation="MAYINGHFALANANMNANYRPALIDFHGYEGEDFRHF FT VESLESYFAINNITQEPRKLIILKAQLRGAAKVYYEKEILKRIPGINYEQA FT VDLLKNYYITPELIQSYELEFNEMYQGEQEHPQIFLARLKEAADLANITND FT AVIESRFRAGLLREIKQFCIQSSSKNLQDWINHAEGWWNANRPRKIAMVDN FT PFIPRNANQALIYHNENYDQHHPSSNHNIDLIDTNEHAMPALAYNQIHYNN FT AQPSSNYYYNNAHGTNQLTTMDTSRNRNQLHQAQYIGNQHIRPNQQQDLIG FT LIQQAIRQELNSQQYQQQPTRNNRYNGYNNNYNRSGRYGNNDNNGRYDNSN FT KNYMSNQRRSYDQPASQDQRQPNNQPINQHNNQQNKQTKNY*" FT CDS join(1307..3448,3381..4394,4734..6104,6121..6867) FT /product="Gypsy-1-I_RO_2p" FT /translation="TDKKLLGSVAFDNSQKNGQLNTQKDTKPINNLQQQQL FT NAILTDKYNSKQKDLYAAIRPERPPDVATTTPYPNSRPTNKEKGKQVAKPT FT ITRKVTTRSHIEEVKNPTPVQQMQATRTEAMDTDLPIQRKVTEKSNKPRTR FT RIEPDIKYDIISDVLKQKADIEIGDLITVAPSLRKKLVDECRPKRKSRQSQ FT QVAQQTMALIEDEEINTTAAYSTVSIGDKNIKALVDSGASKTCMSKALADA FT LELEIDSASENVFTLGNGTKQPALGLIYDVPIEVKEDMVIPCTIEVLPSCP FT SHLILGSNWLNRAKAKIDFNSSSLKVKYKNQKAELPIHFIRKSTPLPKMKT FT FHQDYQHPISLTNSNSEKHVHFEDRDSEGSYSSSEEEDESESESDDDIEME FT VTALERNYEQESLQVLENDKYEEVVITNSQEMYSIKSTDTGFLMQAHSSKH FT ISLDKPKEDTQNILYDFYITHPKLMQSSGYFDRSSSFIVNKKSLDICLYNR FT TEKDIYLKPGEEIGILEILDPNKDTIINAYEMDKHSTLCTLEHDNIKQNPN FT KQEQDTLKPQLLEKLEIGDINESMREELLKLLKRYQHIFDWDNDTIGRTNL FT INHKIIIEENTLPISHRPYRISPLEAEHLQKELDKYCKLGVISPSNSPWAA FT PVILVKKKNGDYRMVIDYRKLNAKTKKDAYPLPRIDDLLDTLGKAKVFSAL FT DMRSGFSSSTIRREEKPKYFQHLTCALVFHQVPLDENSKEFTAFTTKYGTY FT HYNTLPMGLVNSPATFQRLIDLCFRPLINKCLVAYIDDLNVYSHNEQEHII FT HLEQVFQCIENANLKLNPEKCFFFKDHLKFLGYIVTNQGIQTDPDKIKKIV FT EYPIPTTITQVRSFLGIASYYRRFIKNFAAIARPLHDQTKTKKKIPWTQAT FT TASFETLKKLLTTAPVLARPDFNRSFILVTDASKLGLGCVLTQLDDDGKEH FT PIIYASRGLKPNESNYAPTKLECLAVIWAVKLFRPYLLGKKFMIITDHSAL FT TGLLKTPNPTGIIARWIVTLSEYDFDIKYRPGRVNESADFLSRLGHYIKNL FT ILPKMEQQQIDLVKQYLQELRLPEDITHKQKRYLQKQAHKFTIYKDKLYRY FT NTDNGIIRKVLNKQEAEEIMYSYHQHPLGGHLAYNNTLHKIASRYYWENMT FT KDIMEYVKKCHRCQRHGKKSLKEELYPVPVSVKPFDRIALDVKHVQASRSG FT NRYIIAGIDYLTKYVEARPIRFQTASEIALFLYEEIICRHGCPTIIVSDNG FT KPFVSKLIQQVCKNFSIIHKTTTPYNPQSNGLIERFNRTLGQILQKRTKEE FT KDDWDSYLPAALFAYRTIKQGSTKNTPFFLLYGYEPKTPFDIDHHVYERNS FT PKFEAILRHRTIHQIYNLNRIRDAGVQNIQRAQESQKKQIENKILDERKEL FT KPPFKLGDIVLIYRDYLSTSWSAKLQDKWEGPYVIQHILGKGTYHIKSMDP FT HDIKLRRIHGNRMKPYLLPKVQWCQENERSIMTNLDEQTNDLLHLQLNNIN FT KRRRRKRRKETKTIYKDKEKFKKKKKKQHQNMNAANNNKCLSKDFSMDMDE FT YLTRIYKEQGYEVTMTAIMDFLMMWEDCNVGPTEVQYEMDGKTCTREEYAV FT YTIENISAMVSAEKKEEGVINIYEANWEDEDDLELLNNLEGLIKNQHKQFV FT DTLHEFEMVQELKEQMCDLLATNINHWAHFRFRRPNNTPESVIAYLAAKQI FT MLKAIIPDYRLHRALTLKIKQKDNWARIVGDSSGTAGI" XX SQ Sequence 6867 BP; 2791 A; 1343 C; 1145 G; 1583 T; 5 other; tttggtggtc actacgaggg aataaaccaa ataaatcaaa aaaatcacag aaaaatatat 60 aacacraaca acatcgatca gaaattaaaa caattcaatc aacwctataa attcawcaaa 120 aacttayaaa aacgaaactt taaaaactat caaatctatc tgaaactgaa atggcttata 180 taaatggaca cttcgcgttg gctaatgcta atatgaacgc aaattaccga cctgctttaa 240 ttgatttcca tggctacgaa ggagaagact tccgtcattt tgtagaaagt ttggaatctt 300 actttgctat taataatatt actcaagaac cacgcaagct tattatactc aaggcacaac 360 taagaggagc cgccaaagtg tattacgaaa aagaaatact caaaagaata cctggaatca 420 actacgaaca agccgtagac ctcttgaaaa attattacat aactccagaa ttgatacaga 480 gttacgaatt agaatttaat gaaatgtatc aaggcgaaca ggaacaccca caaatattct 540 tagccagact aaaagaagcc gcagaccttg ctaacataac taatgatgca gtcatcgaaa 600 gccgatttcg tgcaggactc ttacgagaaa tcaaacagtt ctgtatacag agtagttcta 660 aaaacttaca agactggatc aatcacgccg aaggatggtg gaatgccaat agacctcgaa 720 agattgctat ggttgataat cccttcatac caaggaacgc caatcaagct ttgatatatc 780 ataatgaaaa ttatgatcaa catcatccat caagcaacca caatattgac ttaattgaca 840 caaatgaaca tgctatgcca gcactagcat ataaccaaat acattataac aatgctcaac 900 ctagtagtaa ttactattac aacaatgcac atggtacaaa tcaattgaca acaatggata 960 catcaagaaa ccgtaatcag ttgcatcaag cccaatacat tggaaatcaa catatacgtc 1020 ctaaccaaca acaagatctt attggtctta tacaacaagc tattcgccaa gagcttaata 1080 gtcaacaata tcaacagcaa cctactagaa ataacaggta taatggttat aacaataatt 1140 ataataggtc tggaagatat ggaaataacg acaacaatgg acgatacgat aacagcaaca 1200 aaaattatat gtcaaatcaa aggagatcat atgaccaacc agcaagtcag gatcaacgac 1260 aaccaaataa tcaacccatc aaccaacata acaatcaaca aaataaacag acaaaaaact 1320 attagggtca gttgcttttg ataactcaca gaagaatggt caactgaaca cccaaaaaga 1380 tacaaaaccc attaacaacc ttcaacaaca acagctcaac gcaatattaa cagataaata 1440 taatagtaaa caaaaggatc tctatgcagc aatcagacca gaaagacctc ctgacgttgc 1500 cacaactaca ccctatccaa attctagacc cacaaacaaa gaaaaaggaa aacaagtagc 1560 aaaacctacc ataaccagaa aagtcactac acgaagtcat atcgaagaag tgaaaaatcc 1620 cacaccwgta caacaaatgc aagccactcg aactgaggct atggatacgg atctccctat 1680 tcaacgcaag gttactgaga aatcaaataa accaagaacc agaagaatag agccagatat 1740 caaatatgat ataatctcag atgtgttaaa acaaaaagcc gatatagaaa ttggagattt 1800 gatcacagta gctccatctt tacgcaaaaa gcttgtagat gaatgtcgac ctaagagaaa 1860 atctagacaa agtcaacaag tagctcaaca aacaatggct ctaatcgaag atgaagaaat 1920 caataccact gcagcctact caactgtgag tattggagac aaaaatatta aagccctagt 1980 agacagtgga gcatcaaaga catgtatgtc aaaggctcta gcagacgctc ttgaattaga 2040 aatagattca gcatcagaaa atgtattcac actaggaaat ggaaccaagc aacctgctct 2100 cggattgata tacgacgttc caattgaagt caaggaagac atggtcatac cctgtacaat 2160 cgaagtccta ccatcctgcc cttcacatct catactagga agcaattggt taaaccgcgc 2220 taaagccaag attgatttca acagctcttc attaaaagta aaatacaaaa accagaaggc 2280 agaattacca atacacttca tacgcaaaag cacaccgtta cctaaaatga aaacatttca 2340 tcaagactat caacatccta taagcctcac aaactcaaat tccgaaaagc acgtacattt 2400 tgaagatcgt gattcagaag gcagttattc atcaagcgaa gaagaagacg agtctgaatc 2460 agaaagtgac gatgatattg aaatggaagt aacagcctta gaaagaaact atgaacaaga 2520 atcattgcaa gttttggaaa atgacaaata cgaagaagta gttattacaa actcacaaga 2580 gatgtattcg ataaagtcaa cagacacagg atttttgatg caagctcatt cgtcgaaaca 2640 tatcagccta gacaaaccca aagaagacac tcagaacata ctatatgact tttacatcac 2700 acatccaaag ttaatgcagt cttcaggata ctttgatcgg tcatcttcat tcatagtcaa 2760 caagaagtca ttagacattt gcttatataa tcgcacagaa aaagatattt atctaaaacc 2820 tggagaagaa ataggaatat tggaaattct tgatccaaac aaggatacaa taatcaacgc 2880 atacgaaatg gacaaacact caacattatg cactttggaa cacgataata ttaaacaaaa 2940 tccgaacaag caagaacaag atacgctaaa gccacagcta ttggaaaagt tggaaatagg 3000 agatataaac gaatcaatga gagaagaact gcttaagtta ctaaaaagat accaacatat 3060 atttgactgg gataatgata ctattggacg tacaaatttg atcaaccaca agattattat 3120 agaagaaaat acattaccta ttagtcaccg tccgtacaga atcagtcctt tggaagcaga 3180 acaccttcaa aaagagttgg acaaatattg taaactagga gttatatcgc catcaaatag 3240 tccgtgggca gcaccagtaa tactagtaaa gaagaagaat ggtgattaca gaatggtcat 3300 tgattataga aaactaaatg caaagacaaa gaaagatgca tatccattac caagaataga 3360 cgacttattg gacaccctag gaaaagccaa agtattttca gcacttgaca tgcgctctgg 3420 tttttcatca agtaccatta gacgagaata gcaaggaatt cactgctttt actaccaaat 3480 acggaacata tcactataat actttgccta tgggacttgt aaattcgcca gcaacctttc 3540 agcgtttaat agatttatgt ttccgcccgc taatcaacaa atgcctagta gcctatatag 3600 acgatcttaa tgtatactcc cataacgaac aagaacatat aatacatcta gaacaagtat 3660 ttcagtgtat agagaatgcc aatctcaagt taaatccaga gaaatgtttc ttctttaaag 3720 accatttgaa atttttggga tacatcgtta ccaaccaagg catacaaacc gatccggaca 3780 aaataaagaa gattgtcgaa tatccgatac caaccactat tacgcaagtt agatctttcc 3840 taggaatagc atcgtattat cgaagattta taaagaattt tgcagctatt gcaagaccgt 3900 tgcatgacca aaccaaaacg aaaaagaaaa ttccgtggac gcaagctaca accgcttcat 3960 ttgaaacact caagaaattg ttaacaactg cacctgtcct ggcaagaccc gattttaacc 4020 gaagttttat cttggtcaca gacgcctcta aactaggact aggatgtgtc ctaacccaac 4080 ttgatgacga cggaaaagaa catcccatta tatatgcaag ccgtggactc aaaccaaacg 4140 aatccaacta tgcccccaca aaactagaat gcttagcagt tatttgggca gtgaaactct 4200 ttagacctta tttacttgga aagaaattca tgatcattac agatcactca gccttaactg 4260 gacttctcaa gacaccaaac ccaactggaa tcatagcccg atggattgtc acattatcag 4320 aatatgattt tgacatcaaa tatcgacctg gacgtgtcaa cgaaagtgca gatttcttat 4380 caagacttgg acattaaata acatacaaca caacactgga atttacaaca ttaatatata 4440 catctatata attatcaact acatggagga agggagggta gatgatacat aaacttaaga 4500 aaaatccaaa aacaactcaa agcacgcata aaaaagaatc atattatcaa ctacatggag 4560 gatgggaggg gaagttgaaa caaaatgtac aaaatcatca aaaacaatag aaaaatacaa 4620 aaatcataag agaaggaagg acggcctaga aaacaaacaa tcgtgagaag aaaaacctta 4680 acaacactat aattggaaat aacatacata aaatcaagac cctgaaacaa taatatatca 4740 agaaccttat attaccaaaa atggaacaac agcaaattga tctcgtgaaa caatatttgc 4800 aagagttgag actaccagaa gacatcacgc acaaacagaa aagatactta cagaagcaag 4860 ctcacaaatt taccatatac aaggataaac tatatagata caacaccgac aacggaataa 4920 tacgaaaagt gctaaacaag caagaagctg aagaaatcat gtattcatat catcaacacc 4980 ctcttggtgg acatttggca tataacaaca ctctgcataa aatcgcatca cgttattatt 5040 gggaaaatat gaccaaagac attatggaat acgtgaagaa atgtcacaga tgtcagagac 5100 atgggaaaaa gtcgctaaaa gaagaattat accctgtgcc agtatcagta aaaccctttg 5160 atcgcatcgc tttagacgtt aaacatgtgc aagcttcacg atccggaaac agatatatca 5220 tagcaggaat cgactacctt actaaatacg tggaagcaag acccattcgt ttccaaactg 5280 catcagaaat tgctttattc ttatacgaag aaatcatatg tagacatggt tgtcccacaa 5340 ttatagtttc ggataatggc aaaccatttg ttagcaagct gatacaacaa gtatgcaaga 5400 atttttcgat tattcacaag accactacac catacaatcc tcaaagcaat ggattgattg 5460 aacgtttcaa cagaactcta ggacaaattt tacaaaagcg tacaaaagaa gaaaaggatg 5520 attgggattc atatctacca gccgctctgt ttgcctatcg aacgattaaa caaggctcaa 5580 cgaagaatac accatttttc ttactatatg ggtacgaacc gaaaacacca tttgacatag 5640 atcatcatgt gtatgaacga aattcaccca aatttgaagc tattctacga caccggacaa 5700 ttcaccaaat atacaacctt aacagaataa gagatgcggg agtgcaaaac atacaacgag 5760 ctcaagaatc gcagaaaaag cagattgaga ataagatact cgatgaacgc aaggagctga 5820 aaccaccgtt caaattaggg gatatagttc tcatttaccg agattatttg tcaacatcat 5880 ggtcagcaaa gttacaagat aagtgggaag gaccatatgt gatacagcat atacttggaa 5940 aaggaacata tcatatcaaa agcatggatc ctcatgacat aaagcttaga agaatacatg 6000 gaaataggat gaagccttat ttgttaccta aagttcaatg gtgccaagaa aatgaaagaa 6060 gtatcatgac caacctggat gaacaaacaa atgatctgct tcattgaaga atactgatga 6120 cttcaactca ataacataaa caaaagaaga agaagaaaaa ggagaaaaga aacaaaaaca 6180 atatataagg acaaggaaaa attcaaaaaa aaaaaaaaga aacaacacca aaacatgaac 6240 gccgcaaaca acaacaagtg cctttccaaa gatttctcta tggatatgga cgaataccta 6300 acaaggattt acaaggaaca aggctacgaa gttactatga ccgcgataat ggacttctta 6360 atgatgtggg aagattgcaa tgtcggccct accgaagtac aatatgaaat ggatggaaaa 6420 acttgtacaa gagaagaata cgctgtttat actattgaaa atataagcgc aatggtgagc 6480 gcagagaaaa aggaagaagg tgtaattaat atctatgaag caaactggga agatgaagac 6540 gacctagaat tattgaacaa cctagaagga cttatcaaaa atcaacataa acagtttgta 6600 gatactctgc atgaatttga aatggtacaa gaattaaaag aacagatgtg tgatctactt 6660 gcgacaaata taaaccactg ggcacatttc cggttcagaa gacccaacaa tacaccagaa 6720 tcagtaatcg catacctagc agcaaagcaa atcatgttga aagctatcat accagattac 6780 agattgcatc gcgctttaac tctcaagatc aaacagaaag acaattgggc cagaatcgtg 6840 ggcgattctt ctggtactgc ggggatc 6867 // ID PYRET_LTR repbase; DNA; FNG; 475 BP. XX AC AB062507; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 08-NOV-2002 (Rel. 7.1, Last updated, Version 1) XX DE Magnaporthe grisea Ty3/gypsy-type retrotransposon PYRET_LTR, long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Long terminal repeat; PYRET_LTR; Ty3/GYPSY superfamily; KW gag-pol pseudogene; retrotransposon. XX OS Magnaporthe grisea OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Magnaporthales; OC Magnaporthaceae; Magnaporthe. XX RN [1] RP 1-475 RA Nakayashiki H., Matsuo H., Chuma I., Ikeda K., Betsuyaku S., RA Kusaba M., Tosa Y. and Mayama S.; RT "Pyret, a Ty3/Gypsy retrotransposon in Magnaporthe grisea RT contains an extra domain between the nucleocapsid and protease RT domains."; RL Nucleic Acids Res 29(20), 4106-4113 (2001). XX DR Genbank; AB062507; Positions 1 475. XX SQ Sequence 475 BP; 157 A; 115 C; 90 G; 113 T; 0 other; tgttacgacc agacagttgg gccggtacca gggaggccag gtggggtcgt gcccctcctg 60 acgacactcg cccaaaaaaa gtcaccgacc ggcaaaacag ggattacgta aggacaaaac 120 cacgtgacag cacgtgactg ccaagacaat tgatctcaaa cggacaaggg atgatctcaa 180 agatcacaac ctgatctcca gagaccctgg ggagtcttta agatcagaga cctttcagga 240 gagatcacat tcggaatata atttatataa ggcacgctca ttaccatata aataaattcg 300 attttaggat tgtttagcag caattaaact ttaattaacc tcaatattta tttgagaacg 360 attataattt cacccccaat tattaaaaac cccagttacc caattagacg ctcaattgct 420 tcaacccact gtactttacc ctaagaacag gttacccgat accttgatcg taaca 475 // ID Gypsy-101_MLP-I repbase; DNA; FNG; 7917 BP. XX AC AECX01000545; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-101_MLP_; KW Gypsy-101_MLP-LTR; Gypsy-101_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-7917 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000545; Positions 47908 39992. XX CC Positions [6744-7256] - Integrase core CC 'ACCTT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 775..2085 FT /product="Gypsy-101_MLP-I_1p" FT /translation="MPPRKSSRVSSLERPNYSDSIRRSSSIGSKRGCSNAV FT KPTTRTVNPGDPGGSGLGTTRSYSEVARVPLHQVEKCANVESQQHSEHHDL FT SEVVEIDSRNSEDSDRRGSGEDAMPPLGPLRGREEPSSEDLGRVHRQRAVP FT QTAARSESPTNPSRTIHRSNETPSAQTDERPLQQTSPSRGQSPQYATDVVV FT QPSLLSSCEGDERSQQREREAEPPTMSKAFYTTPSPFLTSHRFISDLVPKY FT DDTNVVRSSSVNVKPMPIVHPQPMPVTKNVVQVPSTTVDIVTSNSAVINEN FT IVQSAKNLPHQRQITQTQHSVEIVSQHEKKDQMSLRTDNKFDKDRFLADLQ FT KSAKNSKSIEQILRETEVKTPPLFNKEVPREIVDDKSSVSVEAEKSLNTKF FT RDLRLQTESLSPKSMSSLNDHRYDLETRIIRCKIILRTKSKNP" FT CDS 2139..7733 FT /product="Gypsy-101_MLP-I_2p" FT /translation="MNIHFEEMRQSINESDRKRTFEETNKILSDHLDSMDD FT IMSNTIMVNKEIVEANSRHLAGLVKIIDDKMIEQSLTCSLIHRNVVRILSS FT IDKLTTKIETQTGEQSILQNIVPDATVANSNNPFAHEFDVIPKVTHPSERR FT YAEQEESIRLEEKTSMTKTSERSQMDLEILKLLRKEYSQSRDWPRFTGEGE FT YNHLDWIDWIDGVQEEFNMPDALITCKLAVVMTGTARAWYQSKKKTMLKTS FT WAEWKKAIMQDFGTPLWKRRMASAFERDKFKNEYRSRPNPWLLSQRKRLEA FT AWPFLTTSEHIDKILGLCSGELEHAIQSRIKDRSNFEAFMNIFEEIVTTTS FT IGRITDKASNLRVTFSNSREGGGSFPREVNKDLRGSFTKDFRSKDPDSTKA FT METEQKNAFRSPFRSRGDRQRYGKKPINAVNCEEDQLSYHDEMSEQGESIQ FT KSINSETSDDGEDFCISNVDMFQRGDGVLQENDTDTESEEDVTHMTQEISL FT ETLANNISKAESMANSIPITPFIARRKLPTSDIDTFLRISIRGFEEVMLID FT TSKQISTIPKFILEQCWPTWETDMSTLPDLHTSDEKSEYGKIGSINLPIKL FT EHDTRPCFIHMNFVITEDEPSIFLRLGCQSLKELEMTIHMGRESSCRIWET FT NESFSFKLWTLDKYNSNRAREKSTQVIATITTEPGDGLDIAELQSQAATPQ FT QWDGEIKSKQISDARLLMSKPARGRAHKIGHHCITTVLLDNRIEANILLDS FT GAACSIVGTNYLDMIFPEWKENILPPSTMTFRGCDGKLKALGIIEIPIVFP FT HHLGSVRIMPEFVVMENASSNYFILGGEYLRLYGIDIVHSKEKFFTIGNEN FT KKKKFLLSLLKEGKISAIQEVDEEFEKTIGDCNISPRLKSSELHSLKAVMR FT KYRKAFAYSKNPIGTIVGHELKLRLTVQKPYPPILKKQAYPASPWSRDEIE FT KHLDELISYGIIRKVGADEDVDITTPVIIAWHNGKSRMCGDFRALNTFTVP FT DRYPMPRISHCLTNLGAALYISTMDVMKVFHQNTLDEETRKMLRIISHKGI FT HKYLCMPFGVKGAPAHFQRMMDNEFAAELSEGWLIIYIDDIIVFSKTWEEH FT LRRLIQVFEKIIKMGMTISLQKTNFAFQELKALGHVVSGLWIAIDQNKVAV FT VLQKQIPQSVKEVQSFLGFANYYRSHLEGFARVSGPLYKLLQKGVSFEMTH FT NRVESWNTLKKRLTEAPFLLHADFKLPFKLYVDASFDGLGATLQQIQIIDG FT KPYEGLIGCISRKLKQSELNYGATQLENLCLVWALDKFHYYLDGAFFEVIT FT DCTALKALLNMKTPNRHMLRWQIAIQEYRSCMTITHREGKKHDNADALSRM FT ALDNNESNPAWEPEDDIRDIPIMGISISDLSEEFFEQIKEDYKSEPNTVKL FT VEILRADHKVHELTSSLDGIWKKSFEEGRFTLLSGVLYHREKHSCVMVITS FT ETTITDILKICHDDVMSGHLSLDRTLDRVKQTAWWPNWRKLTEEYVITCNT FT CQKTNQTTGKKLGLLQQITEPKVPWEIINMDFVTGLPAAGHDNVDCVLVVV FT DRFLKRCRFLPCHKNATAMDIALLFWERIISDAGLPRVIISDRDPKFTSEF FT WKGLYSLTGTKLSMSTAYHPQTDGLAERMISSLEDLIRRYCGYGLAFKDGE FT GYTHDWKTLLPALEIAYNTSTHSTTGKTPFEIEQGFNIRTPKDFLRPKDST FT FHPTASNFHTMLTRARMHAEECIKDSTKYNKERWDKTHKEPDFKVGELVLI FT STINFNNLQGPKKLKEAFIGPFVIKELHGKNAVEVILTGKIERKHPTFPVS FT LLKRYEKSDEKKFPHRKVSDTADIPIKDEVQGEIQKIIDEKLV" XX SQ Sequence 7917 BP; 2862 A; 1568 C; 1649 G; 1838 T; 0 other; attgggggcc tcatcgcgtg tttaataacc cgagtctaat aaaaactatt cttttttttt 60 tagttttttt tagatataat attaaaaccc ataaattttt tttatcattt tcctttttac 120 cttttcttcc gtttaaaaaa aaagcttctt tctatttttt ccgaaaatct ctttcgaatt 180 aataaatcaa cgttcactag aactatttca acagttaccg tattttgatt tcgtacacga 240 aaaccctttc agcgacgaaa tatacaaccc ctgtttcgga attcatattc taaacccaag 300 agtccaattc atcttacaaa atttcgcgat caacgaattc aacagcattg gagcttcttg 360 atattagcag agagctaagg aaggtaatta atcccttagt cgagatccaa gcatcaaagt 420 atcataagac agatatcata ggtgatatac ttgaagttat caagagactt gaatcagaag 480 cggaaaaact ccgaaagtta gtcgtaatta ccaagtgcat catttgtgca ttaacgagta 540 ctaacataag actacaacag tgaacaagaa cagcaattcc aatcaagacc tatccgaaca 600 gactcaggca gcagggacaa ccccatcagt cttgtcagtc cagaaccaca gcaaccagac 660 agccccagct cctcatacca cgggtacgac gacgacgaag actttgtccg cagatccatc 720 tggtaaatat tttttaccaa ataagaagtc cggcctataa agacgtgatc tgacatgcct 780 cctcgtaaat catccagagt cagcagcctc gaaaggccca actactcaga ctccatcaga 840 agatctagct cgatcgggag taaacgagga tgttccaatg cagtcaaacc taccaccaga 900 accgtcaatc caggagatcc aggaggctca ggacttggaa caacaagatc atatagtgaa 960 gttgcacgag taccacttca ccaagtggag aaatgcgcaa acgtcgaatc ccaacaacac 1020 tcggaacatc atgatttatc tgaagttgtg gaaatcgact cacgaaactc tgaagattct 1080 gatcggcgag gaagcggtga agacgctatg ccacccttgg gacccctacg aggaagagaa 1140 gaaccttcta gcgaggacct tgggagggtt catcggcaaa gagcagtacc tcaaaccgca 1200 gcaagatcag aatcaccaac aaacccaagc aggaccatcc acaggtccaa cgagacacca 1260 agcgcacaaa ccgacgaacg tcccttacaa cagaccagcc caagcagagg acaaagccca 1320 caatatgcaa cagatgttgt cgttcagccg agcttattat cgagctgtga aggggatgag 1380 aggtcccaac aaagggaaag ggaagcagaa ccaccaacaa tgagtaaagc tttctatacc 1440 acccctagcc cattcttgac atctcataga tttataagcg atctagttcc taagtacgat 1500 gataccaacg ttgtaagatc ctcttctgtg aatgttaaac ctatgccaat tgttcatcct 1560 caacctatgc ctgtcacgaa gaatgttgtt caagttccca gtactactgt agatatcgtc 1620 acttcaaact ctgctgtaat caatgaaaac attgttcaaa gtgccaaaaa cttgccgcat 1680 cagagacaaa taactcaaac acagcatagt gtcgaaatcg tttctcaaca tgagaaaaaa 1740 gaccagatga gtttaagaac tgataacaag ttcgacaaag atagattctt agctgattta 1800 cagaaatcgg ctaaaaactc taagagcatt gagcaaattc tgcgagaaac agaagtaaaa 1860 acaccaccac tcttcaacaa agaagttcct agagaaatcg tagatgataa gagttctgtc 1920 tcagtagaag cagagaaaag tctcaacact aaatttagag atctaagact acaaaccgaa 1980 agtttgtcgc ccaaatccat gagttctttg aacgatcatc gatatgacct agaaacaaga 2040 attatccgtt gcaaaatcat attgagaaca aaatcgaaga atccataacg gtactgaata 2100 agaaatggga cctaggtctg aacattctga acgacgaaat gaacatacat ttcgaagaaa 2160 tgcgacaaag cataaatgag agcgacagga aacgcacatt tgaagaaacg aataagattt 2220 taagtgacca tcttgatagt atggacgata taatgtccaa tacaatcatg gtaaataaag 2280 agatagttga agctaatagc cgacacttag caggactagt caaaataata gacgataaga 2340 tgatagagca aagtttgacc tgctcgttga tacatcgaaa tgtggtaaga atattatcat 2400 caatagacaa gctaacaacg aaaatagaaa ctcaaactgg agaacaaagc atcttacaga 2460 acattgttcc agatgcaaca gtagcaaata gtaacaaccc gttcgcacac gaattcgatg 2520 tgatacctaa agtaacacat ccatcagaaa ggcgatatgc agagcaagaa gagagtataa 2580 gactcgaaga aaagacaagc atgactaaga cgagcgagag atcacaaatg gacctggaaa 2640 tcctgaagct actacgtaaa gaatattcac aatcaaggga ttggccgaga ttcaccggtg 2700 aaggggaata taatcacctg gattggattg attggataga cggagttcag gaggagttca 2760 atatgccgga cgctttaatc acatgtaagc tggcggtagt gatgacaggt acagcacgtg 2820 cttggtacca aagcaagaag aaaaccatgc tcaaaacaag ctgggcagaa tggaagaaag 2880 caatcatgca agacttcgga acacctttgt ggaaaaggag gatggcatca gcattcgaac 2940 gtgataaatt caagaacgag tacagatctc gaccgaaccc ttggctacta tcacaacgca 3000 aaagattaga agcagcgtgg ccttttctaa caacatcaga acatatagac aaaatactag 3060 gtctatgtag cggagaacta gaacatgcga tccaatcgag aatcaaagat cgatcgaatt 3120 tcgaagcgtt catgaatatc tttgaagaga ttgtaactac tacctctata ggaaggataa 3180 cagacaaagc aagtaacttg agagtcacct ttagtaacag tagagaaggc ggaggttctt 3240 ttccgagaga agtaaacaaa gatctgagag gatcctttac gaaagacttt agatcaaaag 3300 atccagactc gaccaaagct atggagaccg agcaaaagaa tgccttcaga tcacccttca 3360 ggagtcgagg agatagacag cggtatggca aaaaaccaat caacgcagtg aactgcgaag 3420 aagatcagct ctcatatcac gacgagatgt cggagcaagg agaatccata caaaaatcga 3480 taaatagtga aacttcggat gacggagaag atttctgcat tagcaatgtc gacatgttcc 3540 agcgagggga tggtgtatta caagaaaacg atacagacac tgaaagtgaa gaagatgtta 3600 cacacatgac acaagaaata tcccttgaaa cgttagcaaa caacatcagc aaagcagaat 3660 cgatggctaa ttcaattcca ataactccat ttatcgctcg aagaaaactc ccaacatcag 3720 atattgatac cttcctcagg atatctatca gaggattcga ggaagtgatg ttgatagaca 3780 cttccaagca aatttcgacg attccaaaat tcattttaga acaatgctgg ccaacgtggg 3840 aaacagatat gagcacgctt ccagatttac atacatctga cgaaaaatca gaatacggaa 3900 agataggatc aataaattta cctatcaaac tagagcatga tacgagacca tgttttatac 3960 atatgaactt cgttatcaca gaagacgagc cttcgatatt tttaagactc ggatgtcaaa 4020 gtttgaaaga actcgaaatg accattcata tgggaagaga atcatcgtgt cgaatatggg 4080 aaacgaacga atcattttca tttaagctgt ggacgttaga caaatacaac agcaaccgcg 4140 cacgagaaaa atccactcaa gtaatagcga cgattaccac tgaaccaggt gatgggctag 4200 atatagctga actgcagtct caagcagcta ccccgcagca atgggacgga gaaattaagt 4260 caaagcaaat atcagacgct agactattga tgagtaaacc agcaagagga agagcccata 4320 aaataggcca tcactgtatc acgacagtgc tattagataa tagaatagag gccaacatat 4380 tgcttgacag tggagcagcg tgctcgattg tgggaacaaa ctatctagat atgatattcc 4440 cagagtggaa agagaacata ttaccaccca gcacgatgac atttagagga tgtgatggga 4500 agcttaaggc tctagggatc atagagatac cgatagtgtt tccacatcat ctgggatctg 4560 tgaggatcat gccagaattc gttgttatgg aaaatgcgtc atcaaattac tttatattag 4620 gaggagaata cctgagattg tatggaatag acatagtgca tagcaaagaa aaattcttta 4680 ccattgggaa tgagaataaa aagaagaaat ttcttctcag ccttctgaaa gagggaaaaa 4740 tatcagcaat ccaagaggta gacgaagaat ttgagaaaac cataggtgac tgcaacatat 4800 ctcccagact gaagtcatca gaactacatt ctttgaaagc agttatgagg aaatatcgaa 4860 aagctttcgc atacagtaaa aaccctatag gtactatagt aggccatgag cttaaactaa 4920 gattgacagt acaaaaacct tatccaccaa ttcttaagaa acaggcgtat ccagcaagcc 4980 catggagccg agatgagatt gagaaacact tagacgagtt gatcagttat ggtatcatta 5040 ggaaagtagg agctgacgaa gatgttgaca ttactacacc tgttattatt gcttggcaca 5100 acgggaaatc ccgcatgtgt ggggatttta gagcattgaa tacgtttacc gtaccagaca 5160 gatatcccat gccgcgaata tcacattgcc tgacgaattt aggagcagca ttgtacatct 5220 cgacaatgga cgtgatgaag gtttttcatc agaatactct ggatgaagag accaggaaaa 5280 tgctacgtat tatctcacat aaaggaattc acaaatacct ttgtatgccg tttggagtta 5340 aaggagcacc agctcacttt caacggatga tggacaacga attcgctgct gaactgagtg 5400 agggatggct tatcatctat attgacgata ttattgtatt ttcgaaaaca tgggaagagc 5460 atctgcgtag actgatccaa gtgttcgaga aaataattaa aatgggaatg accatatctc 5520 tacaaaagac aaactttgca tttcaagaat tgaaagcact gggacatgtg gtatcgggac 5580 tatggatcgc gatagatcag aacaaagtag cagtggtatt acagaagcag atacctcaat 5640 cagtgaaaga agtacaatct ttcttaggtt tcgcgaacta ctacaggtca cacttggagg 5700 gattcgctag agtaagcggc ccattatata aacttctcca gaaaggagta tcgtttgaaa 5760 tgacacacaa tagagtggag tcatggaaca ctttaaagaa gagactgaca gaagcccctt 5820 ttctgttaca cgcagacttt aaactaccat tcaaattata tgtcgatgca agcttcgacg 5880 gtttaggtgc gacattgcaa cagattcaga taatcgacgg gaaaccctat gaaggattga 5940 ttggttgtat atcaaggaag cttaaacaat ctgagttaaa ctacggagcc acgcagttag 6000 aaaacttgtg cttagtatgg gcactagata aattccatta ttatcttgat ggagcatttt 6060 tcgaagtcat aacagactgt acagcgttga aagccctttt aaacatgaaa actccaaacc 6120 gtcacatgtt acgatggcag atagcgatcc aagaatacag atcatgtatg acaattaccc 6180 atagagaagg aaagaaacat gataacgctg acgctctcag cagaatggct ctagataata 6240 atgagagcaa cccagcctgg gaaccagaag acgatattag agatataccc attatgggaa 6300 tcagcatatc agacctatca gaagaattct ttgaacagat taaagaagat tataagtcag 6360 aaccaaacac agtgaaattg gtcgaaatac ttagagcaga tcacaaagta catgagttaa 6420 catcgtcttt ggacggaatt tggaagaagt cctttgaaga aggaaggttt acactattga 6480 gtggggtatt ataccacaga gaaaagcatt cgtgtgtcat ggtaattacg tcagaaacaa 6540 caatcacaga tatattaaaa atttgtcacg atgacgtgat gtcaggacat ctatctctag 6600 acagaacgct ggacagagtt aagcaaacgg cttggtggcc gaactggaga aagctcacag 6660 aagaatatgt cattacgtgc aatacatgtc aaaagacaaa ccaaaccacg ggtaagaaac 6720 taggtcttct tcaacagata acagaaccta aagtcccatg ggagattatt aatatggatt 6780 ttgtcactgg cctaccagca gcaggacacg ataatgtgga ctgtgttcta gtagtagtcg 6840 atagattttt gaaaagatgc cggttcttac cctgccataa aaacgctaca gcaatggaca 6900 tagccttatt attttgggaa aggataattt cggacgcagg gctaccgcga gttataatca 6960 gcgacagaga tcccaaattc acctcggaat tctggaaagg tctgtactcg ctgacaggaa 7020 caaaattgtc tatgtcaaca gcataccatc cgcaaacaga tggcctagcg gaaaggatga 7080 taagcagtct cgaagacttg attagaaggt attgcggcta cggccttgca ttcaaagatg 7140 gcgaaggata cacccacgac tggaagacac tgctaccagc attagagatt gcatacaata 7200 cgtctacaca tagtactacc ggaaaaacac cattcgaaat agagcaagga tttaatataa 7260 gaactcctaa agacttcttg agaccaaaag actccacctt ccatccaaca gcatcaaatt 7320 ttcatacgat gttgactaga gctagaatgc acgcagagga atgtataaaa gattcaacta 7380 aatataataa agaacgatgg gataagaccc acaaagaacc tgatttcaaa gttggagaat 7440 tagtcttaat atcgaccatc aatttcaaca acctccaagg acccaaaaag ctgaaagaag 7500 cgttcatagg accgtttgtt atcaaagagc ttcatggaaa aaatgcagta gaagttatac 7560 ttacaggaaa gattgaaagg aaacacccga catttccagt atcattacta aagagatacg 7620 aaaaatcaga tgagaagaag ttcccacata ggaaagtgtc agatacggcg gacataccca 7680 tcaaagacga agtccaagga gaaattcaga agatcattga cgagaagctt gtttgactga 7740 acggcaaaga tgtacggttt taccttgtta gattcaagaa cagtacagca gacagagacc 7800 aatggttaga aatgaaagat atatcacaag caacgctatt gctgaggaga tacagagtat 7860 ctaaacgaca ataatgtaaa aaatcagagt ggaaaactcc accttgtggt tggggaa 7917 // ID TCA2_LTR repbase; DNA; FNG; 381 BP. XX AC AACQ01000023; XX DT 04-AUG-2005 (Rel. 10.08, Created) DT 30-AUG-2005 (Rel. 10.08, Last updated, Version 1) XX DE Copia-like LTR retroelement from Candida albicans (long terminal DE repeat). XX KW Copia; LTR Retrotransposon; Transposable Element; TCA2_LTR. XX OS Candida albicans OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; mitosporic Saccharomycetales; OC Candida. XX RN [1] RP 1-381 RA Jurka J.; RT "TCA2: A recently inserted Copia-like element from Candida."; RL Repbase Reports 5(8), 227-227 (2005). XX DR Genbank; AACQ01000023; Positions 19319 19699. XX CC LTRs are 100% identical. This appears to be a very recent CC insertion. XX SQ Sequence 381 BP; 121 A; 65 C; 66 G; 129 T; 0 other; tgttcgctat agggtaggtc ttccaagcta attttacccg acacaagatg aaatattttc 60 tgttgagcac tcgttgtcga cagtgaaaaa ttttcactca agaaaatatt ttatcatcac 120 tttttctaga atggaggttc aagtgttgga gaatagacag cgaacacctg atattcccaa 180 ggtcgaatta gattgaaaga taaataatag tcatatttat tttgtattta gtcaataaat 240 tatcttttta tatttaaatt cttagtattg tcataccacg tagattgata cggacatact 300 tagcacattt aacatatatt aagcaccgat tacctgtgac attccggagt ttactgtttc 360 gcgcacgctg gcagacgaac a 381 // ID I-1_PPl repbase; DNA; FNG; 2423 BP. XX AC . XX DT 30-JAN-2011 (Rel. 16.02, Created) DT 30-JAN-2011 (Rel. 16.02, Last updated, Version 1) XX DE Non-LTR retrotransposon from the Postia placenta genome. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-1_PPl. XX OS Postia placenta OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Polyporales; Postia. XX RN [1] RP 1-2423 RA Jurka J.; RT "Non-LTR retrotransposons from the Postia placenta genome."; RL Direct Submission to RU (29-JAN-2011). XX DR [1] (Consensus) XX CC >97% identical to each other. Low-copy. XX FH Key Location/Qualifiers FT CDS 155..2026 FT /product="I-1_PPl_1p" FT /translation="MTELARPDNGEGTSGFVDDSTFLAEAKTFEDTHHKLD FT QMMTRPDGALHYAREHNVQFELTKTALLDLTRKRERPTGHPTRTRPMTRPT FT SVIAGQPITPSASHKXIGVILDQELRFKEHAAYALGKGQFWAAQLRRLSKP FT TQGMPLRFSRQLYLSVAVPRMLYAADVWCAPTLTSTKNGRRPSQHVRKLIG FT VQRLVTMQSLGALRSTATDLLDAHADLLPIDLLVDRIVMHSALRLATLHPS FT NPMCAVAKKTARRLVKRHRTALHHTFRIFSVDPDRTETINPTRLSPNWEPR FT FRVHIAPSAEKAVEEAESCRDGHKIWTDGSLIDGGVGAAAVLEKNDEHKAV FT LRKYLGSGTEHIVFDGENVGLALGLELVRKERNVRRVSAFVDNKAALLATG FT NTHARPGHHLTDIVHKQIDHLFRCHPHARLTLRWXPSHVGVPGNEAVDHQA FT KMAARQRPDPPNMTALPRVLRKSLPMSVAALKQHRTANXKREALTRWGTSP FT RCDRLRRIDPTLPSNKFLKLAGTLPKQQTSLLIQLRSGHAPLNKHLFRLGK FT ADSARCSCGTDDETVIHFLLRCPNWNRARAPLRRAFPPSNLQLRTLLNDPD FT ALTHLFDYIKATGRFAAGRNDRLNPH" XX SQ Sequence 2423 BP; 611 A; 807 C; 549 G; 453 T; 3 other; gaaagagcta actgactgga cctacaggaa actcacaggg cgcacgacca ccatcgcctt 60 tgacggctac accactgaac tgtacacaat tcccagtgga cttgaccagg gatgtcccct 120 ttcggtaata ttccatcact tctataacgc gcgcatgact gaactcgctc gtcctgacaa 180 tggcgaggga acgtccggat ttgttgatga ctccaccttc cttgcggaag ccaaaacctt 240 cgaagataca catcacaaac ttgaccaaat gatgacacgc cccgatggcg cgttgcacta 300 cgcacgcgag cacaatgtcc aatttgaact tacgaagacg gcccttttag acctgactag 360 gaaacgagaa cgaccaacgg gccacccgac tcgaacacgc cccatgacgc gacccacctc 420 agtcatcgca ggacagccaa taacgccctc agcctctcac aagtncatcg gcgtcatcct 480 cgatcaagag ctgcgattca aagaacatgc ggcctacgcc cttggcaaag ggcaattctg 540 ggcagctcaa ctgcgacggc tatcgaaacc aacgcagggc atgccccttc gcttctcccg 600 ccaactgtac ctatcagtcg cagtcccacg catgctctat gctgctgacg tttggtgcgc 660 gccaaccctg acgagcacaa agaacggacg acgcccgtca cagcatgtgc ggaaacttat 720 aggtgtacaa cgcctggtaa ctatgcagag ccttggagcc ctacgctcca cagcaacgga 780 tcttctcgat gcacatgctg acctactacc aatagatctt cttgtagatc ggatcgtaat 840 gcactctgcg ctacgactcg caaccctcca cccgagcaat cccatgtgcg cagtggcaaa 900 gaagaccgca cgccgccttg tcaaacgaca ccgtactgct ctgcaccaca ccttccggat 960 cttcagtgtc gaccctgacc ggacagaaac gatcaatcca actcgactca gcccaaactg 1020 ggagccacga ttccgggtgc acatcgcgcc cagcgcggag aaggcggtag aggaggcaga 1080 atcatgccga gatggacaca agatctggac ggacggatct ctaatcgacg gaggagttgg 1140 agctgcagca gttctggaga agaacgatga gcacaaggct gtcctgcgga aataccttgg 1200 gagcggcacc gagcacatag tgtttgatgg cgagaacgtc ggcctcgcac tgggattgga 1260 gctggtacga aaggagcgca acgtgcgcag ggtgtccgct tttgttgaca acaaagccgc 1320 gctattagct accggcaaca ctcatgcccg accagggcac cacttaacgg acatcgttca 1380 caagcagatt gaccatctgt tccgatgcca cccccacgca cgtctcacgt tacgatggnt 1440 tcccagccat gtcggcgtcc cgggcaacga agcggttgat caccaagcga aaatggcagc 1500 gcgtcagagg cctgaccccc ctaacatgac agcgctgccc cgcgtactcc gcaaatcact 1560 gcccatgagc gtcgccgccc tgaaacagca cagaacggcc aacntcaaac gagaagctct 1620 cacccgttgg ggaacatccc ctcgctgcga cagactacgc cgaatcgacc ccactctccc 1680 atcgaacaaa ttccttaaac tcgccggtac cctccccaag cagcagacta gcctcctcat 1740 acagttgcga tctggacacg cccccctcaa taagcacctg ttccgacttg gcaaggccga 1800 ctcagcgcgg tgctcctgcg gcacagacga cgagacagtg atccacttcc tgcttcggtg 1860 cccgaactgg aaccgtgcgc gcgcaccact ccggcgagcc ttccctccct cgaacttgca 1920 actacgcacg ttgctcaacg accccgacgc tctcacgcat ctctttgact acatcaaagc 1980 gaccggccgc ttcgcggccg gacggaacga ccgcctgaat ccccactgac tcaaagtcga 2040 ggtatgacag tcttcccgtg acgaacgacc actccacgca ctcccactac tccttgcacg 2100 aacacatccc tcatcatata ctgattatta ctgtacattt cttaatctac tacctacata 2160 ctattactct tacctcaccc tgcgcccatg cctttcacta cgccctcacc cggccccttt 2220 gacaagctga actcgagaga tgtcctcacc caccccccac gtcgatacac gttagataga 2280 tcgctcgcgc tccacttact ccatgctacc ggacagccca tcgatactac ctcccacgct 2340 acgcagtgta ctcgcaggta ttatccggca tagggcgaag ccctcaagcg gatataaaac 2400 tggatttaaa aaataataaa aaa 2423 // ID Gypsy-3_AM-LTR repbase; DNA; FNG; 169 BP. XX AC ACDU01007691; XX DT 07-FEB-2011 (Rel. 16.02, Created) DT 07-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Allomyces macrogynus genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_AM_; KW Gypsy-3_AM-I; Gypsy-3_AM-LTR. XX OS Allomyces macrogynus OC Eukaryota; Fungi; Blastocladiomycota; Blastocladiomycetes; OC Blastocladiales; Blastocladiaceae; Allomyces. XX RN [1] RP 1-169 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Allomyces macrogynus genome."; RL Direct Submission to RU (07-FEB-2011). XX DR Genome; ACDU01007691; Positions 31734 31902. XX SQ Sequence 169 BP; 35 A; 55 C; 42 G; 37 T; 0 other; tgttggcgcg tatgcaagcg tcccccgctt gcgccctgcc aacacgtggt ccccaattag 60 acgcatagtg tgaggcgtct tcctacataa gaaggacgct gctgtgcgaa atacatcctt 120 acgcaatcct ccccgaagta tcagtcaggg ccccgtcccg gtcgtatca 169 // ID Gypsy-8_RO-LTR repbase; DNA; FNG; 343 BP. XX AC AACW02000181; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-8_RO_; KW Gypsy-8_RO-I; Gypsy-8_RO-LTR. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-343 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000181; Positions 1745 1403. XX SQ Sequence 343 BP; 99 A; 48 C; 94 G; 102 T; 0 other; tgatatgtgc tatgttggtc aggaccgcct cttttttcca aatgaaacac cgatgggtcc 60 gggagaagat gaccggaata gactagagta tggactggaa ggttatttga ccctgagtgg 120 tggtcgtgct gagactgaaa tgaaggtgga agttcagggt cctaaagagg gtccttatcc 180 taatgcttgg tcagtcgcta atgaagatgg taatggtggc cttatgtatt ttttttgaag 240 aataatggtg gccccattaa attgtagatg tataacggag tggctagcag ccaatgaaaa 300 gtataaaagg atgtttgatt ttttattgaa taaacgatat aca 343 // ID Copia-17_MLP-LTR repbase; DNA; FNG; 254 BP. XX AC AECX01001143; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-17_MLP_; KW Copia-17_MLP-I; Copia-17_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-254 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001143; Positions 7958 8211. XX SQ Sequence 254 BP; 58 A; 58 C; 60 G; 78 T; 0 other; tggtcttgac tacaatcctg tatacgtagt agcatatcct tgtagcgggg ttaggttaga 60 tcacttatat agcctcgctc cacccggttc tttcctttct tcactatcga agattggatt 120 agttctcgga ccgggtgaag cgacgagcgc tgacctttag ctttaacagg ttagtaacct 180 cttgcaatac tatcgaagat tggattagtt ctcggaccgg gtgaagcgac gagcgctgac 240 ctttagcttt aaca 254 // ID Gypsy-118_MLP-I repbase; DNA; FNG; 5884 BP. XX AC . XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-118_MLP_; KW Gypsy-118_MLP-LTR; Gypsy-118_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5884 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR [1] (Consensus) XX CC 'CTTG' target site duplication CC LTRs are 99% similar to each other. CC This is a reconstructed sequence (insertion of an EnSpm like CC transposon is deleted). Therefore it is listed as "consensus". CC Original coordinates: Positions 10620 17319 Accession No CC AECX01000825. XX FH Key Location/Qualifiers FT CDS 462..3248 FT /product="Gypsy-118_MLP-I_1p" FT /translation="MEEHNGFVQIPAEDLRLIKAQLQQNNTDAQVQAQAIA FT ELCNQNQQTHQELASAQNHIYQLEQTQHTRSTQPAPLKTDAFRAEPRKFGG FT FGDNAELWISELESNFRTHRYPEADWGEMIGNYLDEESRMFWHQVRDTANG FT GTCTDYHDFRNQFLAKYSCGMLKEEVREKLKLCFYKGDVGKYILRFRKLIC FT QIPDKDLPFFEKSYLFLDKIPASHRQDINKTEYKKDKDMEILYAAARESER FT NMRINQRKIQPNDRKFTSDRKFNFQKPYNNFHNRRPSIPFGKPPMSTNSAV FT PMDLDMADLSKVECYKCHKTGHFSRNCTEKSSHKNPRNGNSRGDRRKPNLL FT LLDLVQAIREPEFVAPFQDHELQELAAFITPFHDANGQTFQASRESSESLD FT GPCEDGREALAEYQRHYLDALRDMEPEDFLDLQKERDAVEKLFKKASENTV FT RCCGACYAYGCEDSPPTTSKGIEYFNLEPHELPSGPGKRPREDIGPEVETL FT PAHDPDGVNELYYMNLEKFINNRLDKKKIKLDRTLIDGYKYTGTPFNSPVE FT EKEQFISPEKELENVETWNKILSTAFKAKRLEDLTMSGFNEDIEDPVIWTV FT DSIETNKIIFDLEDSKDEQEDNSTGVWDLFHIDEAKQEVPDLNLIDLTKPE FT EKEIIELNIVDLAAHETKYTSLPIYACYFRNMPVDAILDTGAAANYVSEAK FT LNAMKRMFPHRIKIKTVEGRGVRLANGAQEVANKVAIFEVEMDHHEKNGSF FT SFEIEAFVLPLPNISLILGLPWFKEFNPSIDFKTGVYMVDAKVGNKTFSFG FT PRSDEPKLFTIEDKSFGNQLLESAKKIAPYCFKDDIEIHERKYKHYIDTGD FT SKPIKSHGRPLTPPEHELINQFVEEGLKQGIIEKTDSPWSSPLLLVKKPRR FT QHPGLCRLPRPEQGNKKKHVPLTPH" FT CDS 3160..5706 FT /product="Gypsy-118_MLP-I_2p" FT /translation="KNQDGSTRVCVDYRALNKVTRKNTYPLPRIDDAYLFL FT SGAKIFSTIDLKSGFWQVQMADEDKDKTAFTCRQGHFQWKVMPFGLCNAPA FT TFQEMMNDILKDIIDKFALVYLDDVVIYSKSEKEHQEHIKMLFEILNKHNL FT VVSGKKCQWGLPSLLFLGHVVDGNGIRTNPDKIKKIVEWPIPANISHVRGF FT LNLCTYYKRFISKFSMIASPLYKLTEGSPKPGSPIKWGEDEQQSFEKLKIS FT LSETVPLQHPTPFKPFVLDTDASGTNIGAVLQQDTAVDVPKDVSFDYLDYQ FT KKLKNGNLRPIAFESRKLSKTEQNYSAQERELLAIIHGIKHFRGFVEGSPV FT LVRTDHESLKYFKTQKHINRRLARFVDELKFFDAYIIYRPGKEQLAADALS FT RKPNTDSDPDPPETANALFNINQEKDAYTELIQYRLKLEAGVDRTLVGDGT FT FDLGSNALIKYDPTDENIGPLTVPTNKADALALCFGLHEDLGHRNEKDVIS FT AVKKRYWFPEVTATVKEAISLCSACQFAAKTTSQHYLPTQSISGGEPFRRW FT GMDFVGPLPKTANNNQYLITAIDYGTGWSYAQPLKSTSAEAVVDMTKNIIL FT NHGVPNEIVTDNGSEFLSNKFKDFLKDYDIKHVTTTPYHPQANGLVERFHG FT TFMNALRKFCSPYNQLLWDEYVNTCLLAYRTSYSHSMKASPYYMAYGSEAR FT LLSEKVSIMFDSSFENLDLIHKQRNLSVHKHRIKRQDLIQELNQRAKEKDK FT ESEEGYTERNLRPGDRVLRSYEARPSKLHPRWDGPFIIHNAYPNGTFSLMT FT SNVHILNSKTNGCRLKIFKGTSDEFYFASQRLQQRDTAARQQHRQSS" XX SQ Sequence 5884 BP; 1891 A; 1522 C; 1170 G; 1301 T; 0 other; aataattatt tgtgtgtatt tcttttttat cttctaactt tcaatcaaca cgtaccttga 60 tatcgaaaaa aaaaaattgt tgatctatcc ctctttgttt acctctgtaa cctccctatt 120 tcctagatcc cataccggaa agataaacct acagaagcac ccaagacccc catagatccc 180 gtcgtaagtt caccggattc cctagctagt tatccatacc cttcacaaca tttctttcga 240 ttagttagac ctcctgatag acgatccctt actctattgc acacagaaaa caacagcgta 300 tcctaagctt taatcgttat agactaagtt ctatcctttt tcttgttgaa gaaaataaaa 360 gtacccgtac gtacccttaa agactctgat caagtcctac ccagtatatc ttatagtaac 420 ccttccatta tcttttcgac gcacagaaaa caccctccat aatggaagag cataatggat 480 ttgtccaaat cccagctgaa gacctcagac tcataaaggc tcagctacag caaaacaaca 540 cggatgctca agtccaagca caagcaattg ccgaactctg taaccaaaac caacaaaccc 600 accaggaact agccagcgca caaaatcaca tctaccagct tgaacaaacc caacataccc 660 gcagcactca accagctccc ctgaaaactg acgcatttcg cgcggagcca cgtaagttcg 720 gcggcttcgg cgataatgct gaattatgga tctcggaact agaatccaat ttcagaaccc 780 atcgctaccc tgaggctgac tggggtgaga tgatcggtaa ctacctcgac gaggaaagcc 840 ggatgttttg gcaccaagtc cgagacaccg ccaacggtgg aacctgcacc gactaccatg 900 acttccgcaa tcagttcctc gccaagtaca gctgtggaat gctcaaagaa gaagttcgag 960 aaaaactcaa gttatgtttt tacaaaggcg atgtcggcaa atacatcctg cgattccgga 1020 aactcatctg tcaaatacca gacaaagacc tccccttctt cgaaaagtct tatttatttc 1080 tcgataagat ccctgcctct catagacaag acatcaacaa aactgaatac aagaaggaca 1140 aggacatgga aatcctgtac gccgcggccc gcgaaagtga aaggaacatg cggatcaacc 1200 aacgcaagat ccaacccaac gaccgaaagt tcacatcaga ccgcaaattc aactttcaaa 1260 aaccatacaa caacttccat aaccgacgcc catctatccc attcggtaaa ccccctatga 1320 gcacaaactc cgccgtgccc atggacctag acatggccga cctaagcaag gtcgagtgct 1380 acaagtgtca caagaccgga cacttctccc gcaactgtac tgagaaatct tcccataaga 1440 accctcggaa cggcaactct cgaggcgacc gtaggaaacc caacctcctt ctacttgacc 1500 tcgtccaggc catcagagag cctgagtttg tggcaccctt ccaggaccac gaactacagg 1560 aattagcggc cttcatcacc ccattccacg acgccaatgg tcaaaccttt caggcctcgc 1620 gagaatcctc agagagcctt gacggaccct gtgaagacgg aagagaagcc ctggccgaat 1680 atcaacgcca ctacctagac gctctaagag atatggagcc agaagatttc cttgaccttc 1740 aaaaggaacg agacgccgta gaaaaactct tcaagaaggc ctccgaaaac actgtacgtt 1800 gttgcggagc ttgctacgct tacggctgtg aagatagccc accaactacc tcgaaaggca 1860 ttgagtactt caacttagaa cctcacgagc tgcccagtgg ccccggcaag cgccctagag 1920 aagacatcgg accagaggtt gaaaccctac cagctcacga tccagatggt gtcaacgagc 1980 tatactacat gaatctagaa aaattcatca acaatcgact cgacaagaag aaaatcaaac 2040 tcgatcgcac cttgatcgat ggatataaat acaccggcac ccctttcaac tcgccggtag 2100 aagaaaaaga acaattcatc tcgcctgaaa aagaactcga gaacgtcgag acctggaaca 2160 agattctgtc aaccgccttc aaagcaaaac ggctcgaaga cctcactatg tctggcttca 2220 acgaagacat agaagacccc gtaatctgga ctgtggattc catcgagacg aataaaatca 2280 tctttgatct cgaagatagc aaagatgaac aagaagacaa ctcaacggga gtttgggacc 2340 tgttccacat agacgaagca aaacaagaag tccccgacct caatttgatc gacttaacta 2400 agccggaaga aaaggagatc attgagctca atattgtcga cttggccgcc catgagacta 2460 agtacacgtc cctaccaatc tacgcatgct attttagaaa catgccagtt gatgctatct 2520 tggacactgg tgcggcagca aattacgtct cagaagcaaa gcttaacgct atgaagagga 2580 tgtttcctca ccgtattaag atcaaaacgg tcgaaggacg aggagtgaga ttagccaacg 2640 gcgcacaaga ggtcgctaat aaagtcgcaa tttttgaagt ggagatggac catcatgaga 2700 agaacggttc tttcagcttt gagatagaag ccttcgtgtt acccctacct aacatctcgt 2760 tgatcttagg cctcccctgg ttcaaagagt tcaacccgag tattgacttc aagacaggtg 2820 tctacatggt ggatgctaag gttggcaata aaaccttctc tttcggacca cgatcagatg 2880 aacctaaact atttactata gaggacaagt cattcggcaa ccaactactc gaatcagcaa 2940 agaagatcgc gccctactgt ttcaaggacg acattgagat tcacgagcgc aagtataaac 3000 attatatcga cacgggtgac agtaagccta tcaaatctca cggtagacct cttaccccac 3060 cagagcacga actgattaat caatttgttg aagaaggctt gaaacagggt attatagaga 3120 aaacagactc cccttggagc tcgccgctgt tgttggtaaa aaaaccaaga cggcagcacc 3180 cgggtctgtg tagattaccg cgccctgaac aaggtaacaa gaaaaaacac gtacccctta 3240 ccccgcattg acgacgcata cctatttctc tcaggtgcta aaatcttctc tactattgat 3300 ttaaaatctg ggttctggca ggttcagatg gcggacgagg acaaggataa gactgccttc 3360 acgtgcaggc agggtcactt ccaatggaaa gtgatgcctt ttggactatg taatgcccca 3420 gccacttttc aagaaatgat gaacgatatc ctgaaggata ttattgacaa atttgccctt 3480 gtttatttag acgacgtagt aatctattca aaatctgaga aagagcacca agaacatatt 3540 aaaatgttat ttgaaatctt aaataaacac aacctggtag tgtcgggcaa gaagtgccaa 3600 tgggggctac cctcactact gttcttaggt cacgtggtag atggtaatgg aataaggaca 3660 aacccagaca aaattaaaaa gattgttgag tggccaatcc ctgctaatat ctctcatgtg 3720 agaggttttt tgaacttgtg tacttactac aaacgcttca tttcgaagtt ttccatgatt 3780 gcctctccgc tttataaact cactgaaggc tcaccgaaac ccggatcgcc tattaagtgg 3840 ggggaggacg agcagcaatc tttcgaaaaa ctcaaaatct ccctctcaga aactgtccca 3900 cttcaacacc caacaccttt caaacccttt gtactcgaca ctgacgcatc aggtaccaac 3960 ataggagcag tcctacaaca agataccgct gtcgatgtac caaaagatgt aagctttgac 4020 tacttagatt accagaagaa actaaagaac ggcaacctaa gacccattgc ctttgaatca 4080 cggaaacttt caaagaccga acaaaactat tctgcccaag aaagagaact attggccata 4140 attcacggga tcaagcactt tagaggtttt gtagaaggtt cgcccgtttt agttaggact 4200 gaccatgagt ccctaaaata tttcaaaacg caaaagcata ttaacagaag acttgcccgg 4260 tttgtagatg agctcaaatt cttcgatgca tacatcatct atcgaccagg taaggaacaa 4320 ctcgcggcgg acgccctgtc cagaaagcca aacactgact ctgaccctga tccccccgaa 4380 acagccaatg ctctattcaa cattaaccag gaaaaagacg cctacacgga gctcatccaa 4440 tataggctca agcttgaggc cggagtcgat cgcaccttag ttggcgacgg aacctttgac 4500 ttaggaagca acgccttgat caaatacgac ccgacggatg aaaacattgg gcctctgacc 4560 gtccccacca acaaggctga cgccctagcg ctctgttttg gattacacga agatttaggc 4620 catagaaacg agaaggacgt catcagcgca gtgaagaaac ggtactggtt cccagaagtc 4680 acggcgaccg tcaaggaagc aatttccctc tgcagcgctt gtcaattcgc agcaaaaact 4740 acaagccagc actacttgcc aactcaatct atttctggag gtgagccctt cagacgttgg 4800 ggcatggatt tcgtaggacc attgcccaaa acagctaaca acaatcaata ccttatcacc 4860 gcaatagact acgggacagg atggtcttac gcacaaccac tcaaatccac ctccgccgag 4920 gccgttgtcg atatgacaaa gaacatcatc ctaaaccacg gtgtaccaaa tgaaattgtc 4980 accgataatg gatcggaatt cctgagcaat aaattcaaag acttcctcaa ggattacgac 5040 attaaacatg ttacaacaac accatatcac cctcaggcaa acgggctagt ggaacggttc 5100 cacggaacct tcatgaatgc gcttcgtaag ttctgcagcc catataatca actcctgtgg 5160 gacgaatacg taaacacgtg cctactagct taccgaactt cctactcgca ctcaatgaaa 5220 gcatccccat actatatggc ttacggcagc gaagcgagat tactctctga aaaagtaagt 5280 ataatgttcg atagttcttt cgaaaactta gacctaattc ataaacaacg aaacttgtca 5340 gtccacaaac ataggatcaa gagacaagac ctcatacaag aactcaacca aagggcaaaa 5400 gaaaaagata aagagagcga agaaggctac acggaacgca acctgcggcc tggagaccga 5460 gtactacgat catacgaagc aaggcccagt aagctccacc ctagatggga cggacctttc 5520 atcatccata atgcctaccc aaatggtacg ttcagtctaa tgacctcgaa tgttcacata 5580 ttgaacagca aaactaacgg gtgtcggcta aaaatattta aaggaacctc cgacgagttc 5640 tacttcgcat cacaaagact gcagcaacga gatacagctg caaggcagca acaccgtcag 5700 tcatcttaaa gaaacggctg ccgccttcga aaaaattaaa caacacgtgt ccgagcagta 5760 catctttgac tacatagaaa ttttagataa cctcaacgct tcaaccctta gagaactcat 5820 ccgacgagaa aaagggaaat accccgaggt aaactaggaa gtttacgtct taaggagggg 5880 atgg 5884 // ID Gypsy-10_MLP-LTR repbase; DNA; FNG; 180 BP. XX AC AECX01001617; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-10_MLP_; KW Gypsy-10_MLP-I; Gypsy-10_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-180 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001617; Positions 63840 63661. XX SQ Sequence 180 BP; 49 A; 54 C; 28 G; 49 T; 0 other; tgttatgatc cgggtttcac agagctgaca tatagtacat acatatcaca gacttgtagc 60 cagtatgttg tactcaccca cgtggtacat atgtcctttt ctctccatcc gacaatcaca 120 ataccagata gataggaccc acctcttccc ttgtcccaac ccctgaccgt gaccataaca 180 // ID LTRKT2 repbase; DNA; FNG; 417 BP. XX AC AJ439557; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 08-NOV-2002 (Rel. 7.1, Last updated, Version 1) XX DE Kluyveromyces thermotolerans retrotransposon LTRKT2, long DE terminal repeat. XX KW LTR Retrotransposon; Transposable Element; LTRKT2; KW Long terminal repeat; retrotransposon. XX OS Lachancea thermotolerans OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Lachancea. XX RN [1] RP 1-417 RA Neuveglise C., Feldmann H., Bon E., Gaillardin C. RA and Casaregola S.; RT "Genomic evolution of the long terminal repeat retrotransposons RT in hemiascomycetous yeasts."; RL Genome Res 12(6), 930-943 (2002). XX DR Genbank; AJ439557; Positions 1 417. XX SQ Sequence 417 BP; 107 A; 94 C; 87 G; 127 T; 2 other; tattgaggcg agtctgggtc attggtcgaa agggtcttaa attccttggc tagggtttcc 60 ataatgatct aggtaaacta cttcgcggat gatggtacta aatatctccc cgctatctcc 120 tttgttcaag aaatacgtcg tccataccaa agaaatgatc cggatttata caaaatttct 180 gggacgtggc tgagtcttaa tcataaggtg ataattgtac tggtggcacc actccttgcc 240 ttctccaagc cactgtggtc tgatctggac tatgaacggt caatgtgggc cttmtacgtg 300 acttagctgg gtatccgtcg tatttacgat atgtmcatgt gccatatcat ttcataccaa 360 tttgtcatta tgcaacttgg cactatcaca caccaacgaa ccgctcacaa tctgaca 417 // ID Merlin-2_Roryzae repbase; DNA; FNG; 4582 BP. XX AC . XX DT 17-MAY-2011 (Rel. 16.05, Created) DT 17-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Merlin; DNA transposon; Transposable Element; Merlin-2_Roryzae. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 4582 BP; 1633 A; 726 C; 761 G; 1462 T; 0 other; gggtgattgg cacttcaggc cacgatggat aaatcataag ttctacagct atagcggagc 60 tagaataacc atccaaaaat ggaaactaag gttataagac aaggataatc gaatggcgct 120 gtttaatttt tgaaattccg aatatatccc gatttagtca agaataatgg ggcatttctt 180 aagtgaagcc cctttattcc aagttggttt tcatatcatt tggtacatta gtaattttta 240 aaagtagtca acaccatata gtgcttttaa acttgcttaa cagccatcaa aatatttgta 300 acaaatgatc agatgtatct aatgcattat ttctcagaaa ttacaaaata gcttaattgg 360 caatgccctc ctttctaatc ttgatgtcgt gagttcgaaa catcctgtta ccattttata 420 cttctgtgat aacgaattac gttttccttt ttttttgttt cggagctcaa aggattttgt 480 aagccacaaa aaaagttttt ttgtggcttc atgttttttc ttttttcttt tgtttttctt 540 ttttttagtt tttaataaaa ggtaagttga agataatata agtaaccatt attcattaag 600 ttcttttttt ttttataaaa aactatacta ttgaaacatt caacttcaaa ctagatacag 660 tatacaagta tgtcagcaaa caataacaat ttaatttcca atggtccaag ctcaagaatt 720 gtacctaatg gtgaattgga cagagaagct attacaattt ctttcgattc ttgttggagg 780 caaatatctt tttttttaac aaaataagat tgttaatttt ttttctttag ttttttaaga 840 agcattggtg ctatttctac tcaaccagag gttagatgtc ctaaatgcag cgaaatcatg 900 cgtggtccaa catggagagc atctcgtaat cagtatcaat ttcgctgcta taatcgtgct 960 gcgcatgcaa cagaagtctc gcaatctgtg acaaaaaact caatgtttta ccgaaaaaga 1020 tgtgatattc gtataacatt gaaggttata tttcgcctta ctatgaacca agaaaggttt 1080 tcaaccatct atagagatct ttcaattcac tacaaggtga ttgcccagtt gtaccgcgac 1140 ttgattgttg tttttgaaag cgacttgtta agacataatc caattatgtt aggtaagtta 1200 tgagtttttc ttttatatct tatttacatg atgtaggtgg gtcaggtgta gaccatgtcc 1260 aaattgatga gtcgaaattt ggtaagagga aataccatag aggccatcga gtggaaggcg 1320 tatgggtttt gggcatggtc gaagctattg ctcttggtac aaatagggtt gtgacagtat 1380 cacaagaaaa cggggagact gaaacaagat tggtaccaca atttaaagca ggccgcagat 1440 ttttatgcac tgttccaaac cgaaacgcac atacccttat acccatcatt cgaaagtacg 1500 ttgctccagg tacaacaatc agaacagatg gctggcgtgc ctacagcggc ttgcacccta 1560 gggagcgata caatgcccgc actggagctc ttcaagttgc aatgatggac agtgagcagc 1620 atatctacag acatcaagta gttaatcatt cccttgggtt tgctacggaa gatcaagtac 1680 gccaaaacaa tactcaaggg atgatcaata caaacatcat agaaggattg tgggccgatg 1740 tcaaaaggga aatgcaagct agacatcgta caaaggttga atgtccatat cgcctactgg 1800 agtacttatg gcgttatgaa aatcgcaacc gaatttgggc cgctctagtg cgtggtttag 1860 gagaagtatc gtttcctaat atgccaaggc agagaaatag agacgaagaa gttgaaaatt 1920 atcaacttat aacagaagaa gaaggaagtg aagaggtgat agaaggggat ccaacagatg 1980 aaatgaacaa caatcatgaa attgacagta attatttata caaaaaaaaa tgatagcctt 2040 atatttgtct ggatcaatgt aatactaatt tctgcataat agtcgatatt ggtgatgaca 2100 cagatacgga tgatactgat gatgatccag attatcaaga agtacaagta caaccacctc 2160 cagcacgtcg ccgtaggatt aatagaagat ctgatgtcgc agtcgtgata gaagatgaaa 2220 gcaacgcttt acctagttct ggtaacagta gtgctctttc agtagaagaa gctacaagta 2280 gattaatgga tgttgtacgt aaccgagaat taaataactt tcgagaagca tcagaagcat 2340 tatacaatgc tatgatcgca gagcaaaata ctccaaatgc agtataataa acgtaacata 2400 tattaaaaaa tataggcatt tttttgttaa ggtatttgct acgtacttcg tcaatttatt 2460 agggtcattt tactgtatct tatttaatta aaccaatatg aagtggctat cctttattta 2520 atgtagtcaa tacgtaacta tatgctaact ttaatttgca aagcaaaagc gctattatca 2580 tgtgctcttg cttaattata attcttcttt gcctaaaaag tacttactgt tgtgtttttt 2640 cattacatct actatgaact atagacacac atctaaaaaa aaataaatag caaagaaata 2700 taccgtatcc agtaaaaata atttatattt tttttaattc aaattataat gtgaataaca 2760 tatttaataa gaacacttaa aatttaatta agaatagatt tgtaattaaa aaagcattat 2820 aaaaacggta tgtattaata tttttaaatg atattcaaga atgatatcac gttattatat 2880 aacagtatat agtttattac ttttgtaaaa cagtaactga ttgtagttga cattaataaa 2940 tacaagcatt aatacaagat cataattaaa ataaaacaaa atataagttt atttaaataa 3000 gatcataccg ctgtagaaca gtaactgact gtagttgaca ttaataaata taagtcttaa 3060 tacaagatca taattaaaat aaaataagat acaaatttat ttaaataaga tcatacaatc 3120 aattccaatt aaacatcttc ttttagaata tctattgtat tcatgaaagt tctttgataa 3180 gaagaagtat caactaaaat aaccaaatcc aatccaatgt caggaatatt gctaatcttt 3240 cacataattg aaataacgcg taaaataacg ttattcaaat cagtgaggaa ttcccagaat 3300 aaaagatcat ctatgacaga catacaaaag aacaaatgaa aagttttcat gatcataatt 3360 agagataaca cttgctaaag tacctgaata ttagaaaaac gatagcaata gaagatatca 3420 agccactcac tcaataagat ataaatacct gccttcaatt agaataattt ttactttaac 3480 ttaaactgaa ataaagatca gactcgaaac gttcacaaac tgtttcttta tttttgtttt 3540 tacctcttgt tttctatgat cttaacaatt acaattagtt tttttacaac taatgtttga 3600 accataacga agaaataacg atattttaag aatcaaggaa acaatgtgat tcttacataa 3660 taataatgat gttatcaagt taaaaaactt agtatctata ttaaatgcat caaaagatgt 3720 ttaggaataa gcttccattt ggtataaatg ctcggacagt ggatgatgcc tctttgcaag 3780 atatccttat tatagcatat acgctacgga ccctaattag gtcgcaaaaa ttttggcggc 3840 catatcgata tttcatgtaa ctgccataca tcaataatat tagagttcaa gaaacataca 3900 aaggagttgg cgattgctaa tccttcaaaa aagtaaatga gggcaagctg agaaaaccag 3960 atgcttacag tgccagatgt gccccaaaat tcaatttttc agattccatt caaactttta 4020 cccaatgtag catattattg ggtaagtcat ttcccaaatt ctcggagtgt gaaagcaaat 4080 attcgctccg caataataga aaacgtgcgg atatgctata aatttaagga acacataaaa 4140 ttctttctac tcgatagttt ttgataaaat tttgcatact atatgcacta acaaataaaa 4200 atgtttatgc aaaatctccg ccaaaatcgc cgacattttt atttttttgc tgctgttttt 4260 agcatgtttc tcttcatcaa aacacaatat gctaacaaat ttaaaagtta tatttttgga 4320 gtatacagtt acacaaatgt tcaaattttt attaaaatct gtaaagatgt cattgatata 4380 aagcttgcca aaaaaaattg tgtaccgcct attttcagac gcaaaaattt atagcaaaga 4440 gtcggaattc gaaaacgaaa gtggtaccaa tcgatgcgca atggccaaaa gtatatactt 4500 ccatatttgg aatgtagttc tagcgccgat tgaactgtag aacttatgat ttatccatcg 4560 tggcctgaag tgccaattgt cc 4582 // ID Copia-47_MLP-LTR repbase; DNA; FNG; 318 BP. XX AC AECX01001103; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-47_MLP_; KW Copia-47_MLP-I; Copia-47_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-318 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001103; Positions 338254 338571. XX SQ Sequence 318 BP; 90 A; 66 C; 44 G; 118 T; 0 other; tgttggaacc aagtttctgt tataagattt cattgtgttg cgtaggatga attacaatta 60 catattaaaa taaacttgta caatgtctga tactcaatac attttgtttc taaacctaac 120 cttgttttgt atcggcagat ctagtatatc aaacttgtct ctcgagaaaa cttcttactt 180 cttcttcatc aatcttagaa gttttctcac acaacaaaca tcatctccaa ataacctctc 240 cggaccatca ctgtccctcc tagtttatct gactgatatt tgttgttgtt tgatcttgat 300 cagggatctg agatccca 318 // ID Copia-1_ATe-I repbase; DNA; FNG; 5040 BP. XX AC AAJN01000194; XX DT 12-MAR-2011 (Rel. 16.03, Created) DT 12-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Aspergillus terreus genome: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_ATe_; KW Copia-1_ATe-LTR; Copia-1_ATe-I. XX OS Aspergillus terreus OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC mitosporic Trichocomaceae; Aspergillus. XX RN [1] RP 1-5040 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Aspergillus terreus genome."; RL Direct Submission to RU (12-MAR-2011). XX DR Genome; AAJN01000194; Positions 6838 1799. XX CC Positions [2145-2627] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 363..4934 FT /product="Copia-1_ATe-I_1p" FT /translation="MTDRISTILSTSSDWLLWFRSIKTLALNKGVWGYINP FT DQPEEPILPERPAYPFPSRIQGEATSILDLSPANRELYRDLLREYELQMRE FT FDHVNKGLTLIHMIIQNSISQQNKRAIINTNSAYQSLRILRDKLAPTNEAR FT KRDVIGRYHQLRAQGSSVRSPNLESYISQWENLYSEAQELDISEVNPPIRA FT LYDFLEAIHTVQPYFSTLWHDKLIEKSERNEPIPTFYEVTKRFRDHIKLSP FT KRNSHMANIANATFMGQSDNPNARPRIPRNRPRKCPCQSEPNDGFHDFSEC FT LYTNQSKRPIGWSPDPEAQKRFERANQNPGFKRAYENALQKFKTSQTTNLT FT VTMATVALATAQTTDSRSDWAFDTASNTHVCNDLSAFKDYTPRESKVLIGD FT THAEILGFGTVLMKPSNCYGKQTTFELKDCAYVPKFHINILSAMKAKKAGI FT FYNSRVPCLEGLDGSKVCQIYETGGISLVRWGEYSLSAHFTMGNPYKSESF FT GQNDLPDRFSTLELEDSRKQTNQTSQTYQIQRSDQPIISKGSIELWHQRLG FT HPGIKALKHLEQATSGAELTNQSEELGPCETCLVSKSKRQISRRPIGRGDQ FT PFETIHWDLIHMKRGMRNMNYISHIYDPMTHFHMAQTLQTKEQVSRSLNSM FT ILLVKNQYRVSPKRVHSDNDLNLSEFIDQSEGLIFNTSPPDQPEQNPFSER FT SGGLIIMKSRLLLVDSGIPMFLWPEVVSAATHILNRTPIRALDWKTPYEAL FT YTALPEKPGYMTSKPDISNLKRFGCRAYTRIINIPRLSKLQPRALIGYLVG FT YRASNIWRVWIPSRNKVISARDCVFDESRVYKDDPTGQLGELDPDLGLEVK FT ALTDSEINDLINSIEAESDESENAEMSLDLDDSRTLEDSADVLEPLENQEK FT EENEEYLEDSIGQDDIVLGDLPDSPPQSPVPPAPDVVIDNSQFKFDDYDDI FT MKPLGLKRQRSPSPDPEPSKIPRNYAILQAFSTDLGHKKLHKNDLPPPPAN FT WREAVNSPFRDDWIRAAEVEYQQLTNMDTWTPVSIDSIGPDQRALPLKWVF FT TYKFDQSGFLSKFKARICVRGDLQPISDLDTRAITLGSRTLRVVLALVAAF FT DLEAVQLDAVNAFLNSQLDETVYVSPPPGLGPKNRIFKLKKALYGLRRAPF FT LWQREIGETLKRLGLDPIPEDPCVYVNEFMILVIYVDDIIIIFRKENRERV FT EKVKQALCSTYQIKDLGEMTQFLNMKIHRKKSKRKLWISQKDYIEKIAHRF FT HLTNRKPPKTPLPAENLDISADETTLGPKEIKLYQQKVGSALYAAIISRPD FT IAYACQKLSEFLTKPTSRHMEAANHLISYLYGTRHWSIAYQGPIPPSQLGP FT FRVASDAAFADSAGRKSSEGYIFTLYSGPVDWSARKQKTITLSTTEAEFLA FT LSTASKQFLWWKRFFSAIGFNPSQKLAIECDNRQTVDLLIKQDSAYKSKLK FT HVDIHRHWLRQEVQNGSIQIQWVPTNQMIADGLTKRLTYQKHQEFMKHLNM FT EDISELSP" XX SQ Sequence 5040 BP; 1492 A; 1236 C; 1065 G; 1247 T; 0 other; tccgaggtgc ctgaggctcc tagagttcga tagttatgag cccggtccgc tacccctatc 60 cgacctgacc gaccttatca acctgttttg accattctga ttggtccgat ttattcgacc 120 tattcgacca atttgaccga tttgatctgt ttgacctgtt tgacctgttt gacttgaccg 180 attcgaccga tctgattgga ccaattcgac cgatctgacc agttgacccg attctaccga 240 cctctatcga tttctaccga ctaatctgac ctgattatcg attctatcga ttctgaccga 300 cgtcatcaac cgaattcgac cattctagaa ggctctgaca gctctagaac gttctatcga 360 aaatgaccga ccgaatctcg accattctat cgacctcatc tgattggctc ctctggtttc 420 gatcaatcaa aacccttgcc ctaaataaag gggtgtgggg ctacatcaat cctgaccaac 480 cagaagagcc tattctgccg gaacggcccg cctacccgtt tccatcaagg attcagggag 540 aagctacgtc aatcctagac ctttctccag ccaataggga gctctaccga gacctcctac 600 gcgagtatga acttcagatg cgcgaatttg accatgtgaa taagggtcta accctaattc 660 atatgattat tcagaattcg attagtcagc aaaacaagcg agcaatcatc aatacgaatt 720 ccgcctatca gagccttcga atcctccgag ataagctggc accgacaaat gaagcaagaa 780 aacgcgatgt gattggtcga taccaccagc ttagggctca gggctcctct gttcgaagcc 840 ctaatctaga gtcctatatt tctcagtggg aaaaccttta ttccgaagcc caagaactcg 900 acataagcga ggtcaatcca ccaatcagag ccctctatga cttcctagag gcgatccata 960 ctgttcagcc ctatttttca accctctggc acgacaaatt gatcgaaaaa tccgagcgga 1020 acgagccaat cccgacgttc tacgaggtga ctaagcggtt cagggaccat atcaaactaa 1080 gcccaaaaag gaatagtcac atggcaaaca ttgcaaacgc gacattcatg ggccaatcag 1140 acaacccgaa tgctaggcct agaatcccta gaaataggcc tagaaaatgt ccctgccaat 1200 cagagccaaa tgacggtttc catgacttca gcgaatgcct ctataccaac caatcaaaac 1260 gccctattgg atggtcacct gatcctgagg cacaaaaacg atttgaacgg gccaatcaga 1320 accctggatt caaaagggcc tatgaaaatg cgctgcagaa gttcaaaacc agtcaaacga 1380 ccaatttaac cgtcactatg gcaactgtcg ccctagcaac cgctcaaaca accgattcta 1440 gatctgattg ggccttcgac accgcttcaa acacccatgt ttgcaatgat ttgtcagcct 1500 tcaaggacta tactcctagg gaatcgaaag tcctaatagg cgacacacat gcagagatcc 1560 taggttttgg aacagtcctg atgaagccgt caaactgcta tggaaagcag actacctttg 1620 aactgaagga ctgtgcctat gtgcctaaat tccatataaa tattctctct gcaatgaagg 1680 ccaaaaaggc gggaatattc tataattcta gagttccttg cctagagggc ctagatgggt 1740 cgaaagtctg ccagatatat gaaacagggg gaatctccct agttagatgg ggcgaatact 1800 cactttccgc tcactttact atgggaaacc catataaaag tgagagtttt ggtcagaatg 1860 acctaccgga ccgattttcg actctagagc tggaagactc tagaaaacag accaatcaga 1920 ccagtcagac ctatcagatt cagcggtctg atcagccaat catctcaaag gggtcaatag 1980 aactatggca tcagcgatta gggcatccag gcattaaagc cctaaaacat cttgaacagg 2040 caactagtgg ggcagaattg accaaccaga gcgaggaact aggaccctgc gagacctgcc 2100 tagtttccaa gtcaaaacgc cagatttctc gacgtcctat tggtcgaggc gatcagccat 2160 tcgagacaat ccactgggac ctgattcata tgaagcgggg catgcgaaat atgaactata 2220 ttagtcatat ttatgatcca atgacgcatt tccacatggc ccagaccctg cagaccaaag 2280 aacaggtttc taggtccctg aattcaatga ttctattggt caagaaccag tatagagtct 2340 cgccaaagcg ggtccattca gacaatgacc tgaatttaag cgagttcatc gaccaatcag 2400 aaggcctgat tttcaatact tcaccgcctg accaaccaga acaaaaccca ttttctgaac 2460 gctctggagg actgattatc atgaaatctc gcctcctatt ggtcgactcc gggatcccca 2520 tgtttctatg gccagaagtg gttagcgcgg cgacccacat cctgaaccga acaccaatca 2580 gagctctaga ttggaagacc ccctatgaag ctctatatac tgccttaccc gagaaacccg 2640 gctatatgac gtcaaaaccc gatatctcaa acttgaaacg tttcggttgt agggcctaca 2700 cgcggattat caatattccg aggctgagta agcttcaacc gcgggccctg attggatatc 2760 tagtcgggta tagggcttca aatatctgga gggtctggat tccatctaga aataaggtca 2820 tttctgcaag agattgtgtg tttgatgaat ctagggtcta taaagacgat cctactggtc 2880 aattaggcga acttgaccct gacctagggc tagaagtcaa agccctaact gactcagaaa 2940 ttaatgactt aattaattca attgaagctg agtcagatga gtcagaaaat gcagaaatgt 3000 ctctagatct agatgattct agaaccctag aagactctgc agatgttctg gaacctctag 3060 aaaatcaaga aaaagaagaa aatgaggaat atctagagga ttctattggt caggacgaca 3120 tagttctagg tgatttacct gattcaccgc ctcaatcacc tgttccgcct gcccctgatg 3180 tcgtcatcga taatagtcag ttcaaattcg acgattatga cgacattatg aagccgctag 3240 gcctaaaaag gcaacgatcg ccatcaccgg accccgaacc atcgaaaatt ccgagaaatt 3300 atgcaattct tcaggcattt tcgactgatt taggtcataa aaaactgcat aaaaatgacc 3360 tacctcctcc acctgccaac tggcgagaag cggtcaatag tccatttcga gatgactgga 3420 ttcgagctgc agaggtagaa taccagcagc ttactaatat ggacacatgg actcctgttt 3480 cgatcgattc tattggtcca gaccaacgag ctctaccgct gaagtgggtc tttacctaca 3540 agttcgacca atcaggcttc ctatcgaaat tcaaagcaag gatttgcgtt agaggggacc 3600 ttcagccaat cagtgacctc gatactaggg ctataaccct aggttctaga accctgcggg 3660 ttgttcttgc cctagtagct gcttttgacc tagaagcggt ccaattagat gcggtgaacg 3720 ctttcctgaa tagtcagctt gatgaaacag tctatgtttc accgcctcca gggctaggcc 3780 ctaaaaatag gatttttaaa ttgaaaaagg ccctatatgg actgcgtagg gcgccgtttc 3840 tatggcaacg agaaataggc gaaaccctaa aacgactagg tctagaccct attccagagg 3900 atccctgtgt ttatgtcaat gaattcatga tcttagtcat ctatgtagat gacatcatca 3960 ttatttttag aaaagaaaac agggaaaggg tcgaaaaagt gaaacaagcc ctctgttcga 4020 cctatcagat caaggatcta ggcgaaatga cgcaatttct gaatatgaaa attcatcgaa 4080 aaaagtcgaa aaggaagctc tggatctcgc agaaggacta tatcgaaaag atcgctcacc 4140 gctttcacct gaccaatcgg aagcctccaa aaacaccgct cccggcagaa aatctagata 4200 tctcggctga tgaaacgacg ctaggcccaa aagagataaa gctataccaa caaaaagtag 4260 ggtcagccct atatgcagcg attatctcta ggcccgatat cgcctacgca tgtcagaaat 4320 tgtcagaatt cctgactaag ccgacttcta gacatatgga ggcagccaat catctgatct 4380 cttacctcta cggaacacgc cattggtcaa ttgcctacca aggacctata ccgccttccc 4440 aattaggtcc attcagggtt gccagcgacg ccgcctttgc tgattctgcc ggcagaaagt 4500 cttctgaagg gtatattttt accctatata gcggaccagt tgattggtca gctagaaaac 4560 agaagactat aacactttcc accactgaag ctgaattcct cgcgttatct accgctagta 4620 agcagttttt atggtggaaa cggtttttct cggctatagg gttcaaccct agtcagaaac 4680 ttgcaattga atgcgataat aggcagacag ttgacctgct tattaagcag gactctgctt 4740 ataaatcgaa actaaaacat gtggacatcc accgccattg gctgcgccag gaagttcaga 4800 atggatctat tcagattcaa tgggttccga ccaatcagat gattgctgac ggtctgacta 4860 agcgtttgac ctatcagaaa catcaggaat tcatgaaaca tctgaatatg gaagatattt 4920 ccgaattgtc accttgactt cacttttcta tgggattcct acctaaaagt gaggttttga 4980 tcaaactgtc tgaaaagtct gcaggttcta ctattttcaa tattagaacc tgggggggtg 5040 // ID Gypsy-22_LBS-LTR repbase; DNA; FNG; 523 BP. XX AC ABFE01001864; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-22_LBS_; KW Gypsy-22_LBS-I; Gypsy-22_LBS-LTR. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-523 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01001864; Positions 10775 10253. XX SQ Sequence 523 BP; 127 A; 144 C; 93 G; 159 T; 0 other; tgtaacgaag gattcaactc cttcatgttc attcctttca tctcttctat tacttccatt 60 tactttccac tgattaaacc acttcggaat taagtcacac tagatcacgt gcttctcgta 120 tttgtaacac gttgtttaca catgcaatgc atcgattctt tactctcacc accactgcgt 180 tctgcacact cgaagtggcg ttctatcgtg cttccaacct cagaaagtaa gtggtgtagt 240 tcaggtcctt cgctaacgct tctattatat agtaagcgtc agatcagtct caatctacgt 300 tctcacgtat agcgtccgat atccatcttt ccgtggattt cccggaaagt agaagaggtt 360 gtagccccgg atttccagag ccatcacaac ctcctctcgg atacgcccgc gaccgtcttc 420 gacaagaggt catcacacct ttgcagatct aactctcttg acgttcgtca ggactacgcg 480 ttgttggtaa gttactacga aaactcacag agcacctgct aca 523 // ID Copia-37_MLP-LTR repbase; DNA; FNG; 540 BP. XX AC AECX01001651; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-37_MLP_; KW Copia-37_MLP-I; Copia-37_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-540 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001651; Positions 97068 96529. XX SQ Sequence 540 BP; 133 A; 97 C; 120 G; 190 T; 0 other; tgttacgata ttatctggtt agctggtcaa agatcacaat cccttaaaag gaaagtgaaa 60 ggtgttggat ttagaagggt gtatgtggtg aaagtagact aatgcgctgt gggtaaacta 120 ggaagttgga caaagggata agcgatgtgg aaaatgtaga caggggtgtt gggaaaatcg 180 tttgggagtt ttctaatatc ttttgtgttt ctctttccct cttttctgat tcttttctta 240 tcgaaaagaa tcagtaagtt attttacttt tacatcatca tagttacttc tcgattgtgc 300 actactaact ttactcaaat acttagattt ttatttcaac tcttttgttg tctgtttaag 360 acctttccct tacgcgcgat ctcaattcat ttcgagctcg caggtcagtt attattgttc 420 tcgacgtgac tagcggatag ctgtagtctt acgccggaac ctgtgccctc aggtcagtta 480 ttattgttct cgacgtgact agcggatagc tgtagtctta cgccggaacc tgtgccctca 540 // ID TDH5_LTR repbase; DNA; FNG; 457 BP. XX AC AJ439552; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 08-NOV-2002 (Rel. 7.1, Last updated, Version 1) XX DE Debaryomyces hansenii retrotransposon TDH5_LTR, long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; KW Long terminal repeat; RNaseH; TDH5_LTR; gag; integrase; pol; KW protease; retrotransposon; reverse transcriptase. XX OS Debaryomyces hansenii OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Debaryomycetaceae; OC Debaryomyces. XX RN [1] RP 1-457 RA Neuveglise C., Feldmann H., Bon E., Gaillardin C. RA and Casaregola S.; RT "Genomic evolution of the long terminal repeat retrotransposons RT in hemiascomycetous yeasts."; RL Genome Res 12(6), 930-943 (2002). XX DR Genbank; AJ439552; Positions 1 457. XX SQ Sequence 457 BP; 145 A; 78 C; 78 G; 156 T; 0 other; tgttgaaata tattagtata ttttatgcga agtcagaata aattgagaaa tcatgtaaag 60 cggctgcaaa ttcttttata tctcaataga accagaatgc cgtttggctt tctattgtta 120 taccataaag attacgcccg ccttatgatt gttgaagcta tcattggata gcagtttttc 180 ttggaagaaa ggcactgata aaatgccttt cggtttccat atatctgcct tcccattcgc 240 tgtttacctg gaaacaacgg gaagcgtatt cctattcttg tgcaaatatt ggactatata 300 aagaggtctt tggacctcct tgaatacaga atatatatag agattaagta aattatatat 360 tataattgac actgactaac tgattaactg actgactgat tacctaacct gatactgacc 420 tgataatgta tgctactttc gagaatttac ttcaaca 457 // ID Gypsy-3_GDe-I repbase; DNA; FNG; 6238 BP. XX AC AEFC01000450; XX DT 26-MAR-2011 (Rel. 16.03, Created) DT 26-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Geomyces destructans genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_GDe_; KW Gypsy-3_GDe-LTR; Gypsy-3_GDe-I. XX OS Geomyces destructans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Leotiomycetes; Leotiomycetes incertae sedis; Myxotrichaceae; OC mitosporic Myxotrichaceae; Geomyces. XX RN [1] RP 1-6238 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Geomyces destructans genome."; RL Direct Submission to RU (12-MAR-2011). XX DR Genome; AEFC01000450; Positions 429 6666. XX CC Positions [3937-4272] - Integrase core CC 'AATTG' target site duplication CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 74..976 FT /product="Gypsy-3_GDe-I_1p" FT /translation="MPPRTRTQTADTEGSGSREAASLPPAESQQGIEDQLA FT TAQAIEEELLAKLQLKEAAERIERLRKRLEDGDVAEVLPPGLPVDDQSVAS FT SSSVSYGGLPKSIRPRQIAPYKGRTVREHSEFVTSCELSFRLEPSLNKSDE FT LRCDLAATWLEGEPRDAWLQHEKEPDFQKTWKGFKKFLLDLVEDPVNQGLS FT TVIKYEEARQRPGQSAQTFATYLETLEGELKAYSEEHRVQHFLAKLRPDLR FT LRVITYPEIPRTRSALIALASRFEQANLPKGQEVEVRRTTRKKRGGRILTR FT GVRAEKERE" FT CDS 1985..3856 FT /product="Gypsy-3_GDe-I_2p" FT /translation="MTVKNRHPIPLISEILDRLSGAAVFTKLDAKDAYQRM FT RIRKGDEWKTAFLTRYGLFEYLVMPFGLCNAPASFQAYINEALTGLVDVMC FT IVYLDDVLIYSKRKEEHPQHVRTVLERLRKFKIYCNLKKCKFSTKAKEVEF FT LGFIVSMKGVSMDLSRVETITDWPVPTTFRDLQVFLGFANFYQRFIADYSR FT IAQPLTELLKGSEKGKKVGPFLWNVEAAGAFQTLKDAFTSAPLLRHFDPAQ FT LIRLETDGSGMAIAGILMQPYTDIAGVTRWHPIAFYSRKLKDAEIRYDTHD FT IELLAIVAAFKHWRHYCQDSNFSVIVRTDHNNLIYFMKKVSINNRQAHWIE FT FLAGFDFQIEHRPGRTNPADGPSRRPDYVETQHQAFDGLLPTLQQKLARGY FT SEAVENSEGDASQKSTRIRAVFAKAGERTVTQDILHEVDKLKLHPHKGHAE FT DTQAFLPRCHVMREAATEAPYEEPSLDLRKLIVQVQSGDPFLKDRALRMTK FT RRPRMEGVEREWALHQESGVLSREGIIPFPRDPALIEELLRIHHDDPLAGH FT FGVAKTTELLTRKYKWPKMREDIKEYVKTCQICQKMVVKRHRPYGELQPLP FT QPNGPGEEISMDFITGLPPSLHSLTN" XX SQ Sequence 6238 BP; 1637 A; 1583 C; 1779 G; 1239 T; 0 other; ttctgaacgc tccaccattc ctacgattag gaagagtagc ccagtgaggg aagtcaaaag 60 agaagccgaa cccatgccgc cacgcacccg cacacagacc gcagacaccg aagggagtgg 120 cagtcgcgaa gcagcgtctc tgccgccagc tgaaagtcag caagggattg aggaccagct 180 agcaactgcc caggccatcg aggaagaact ccttgcgaag ctccagctca aggaggcggc 240 cgagcgaatt gagcggctaa ggaagcggtt ggaagatggg gacgttgcgg aagttctacc 300 gccaggcctc cccgtcgacg accaaagtgt tgcatcatcc agctccgtct cttacggggg 360 actgccgaag agcataagac cccggcagat cgcgccatac aagggaagaa cggtcagaga 420 gcactcggaa tttgtcacaa gctgcgagtt aagctttcgg ttagagccaa gtctgaataa 480 atcagacgag ctccgctgtg acttggcggc aacgtggctc gagggcgaac ctcgagatgc 540 atggcttcaa cacgaaaagg aacctgactt ccagaagact tggaaggggt tcaagaagtt 600 ccttcttgac ctcgttgagg atccagttaa tcagggcctc tcgacagtga tcaaatacga 660 ggaggcgcgc cagcgacctg gacagtctgc gcaaaccttt gcgacctacc tcgagacgtt 720 agagggggag ttgaaggcat acagtgaaga acatcgagtt cagcacttcc tcgcaaagct 780 ccggcccgat ctacgcttga gggtaatcac ttatcctgag attccgcgga cgcgaagcgc 840 ccttatagcg ctcgcctccc ggtttgaaca agccaaccta ccaaagggcc aggaagttga 900 agtcagaagg accacaagga agaaaagagg ggggagaatc ctcacaagag gggttcgagc 960 ggagaaagag cgggagtgaa gcgctcgcgt ctctcagatg aggaagagaa gcggcgaagg 1020 gacaacaacc tctgtttcca ttgtggaaag gagggacact gcactgctca gtgtttctta 1080 aggaacggcg agggtgcgca gcgcatccga gcagtacagc aggggagcca gctaaaaggc 1140 atcctgcaac caaaaaacta gggccctcta tccaaatccc gatggatgga gcggttaccc 1200 aggggaagaa cagcctgtcg attctcgtcg acgtacgagg agtggaagtg tggtgacccc 1260 ttcgtgcgct cgttgacagc ggcacaaccg ccaacctcat atcccactta gtggcaaaag 1320 agctcggggg agtgtgggaa ccaagctaca ccacagcaag gggtgtagga agccaggaaa 1380 tacctctctt tgggaagtcc cagctcaccc tcagcatcaa ggatgatggg ggcaagatgc 1440 aagaaagaga acgcctcttt gcgtctacgg ttatggagga atacgacctc atgttagggt 1500 atccatggct tctccgcgaa aacccagtga ttaattggac gcaggggacg tggagattcc 1560 ctaccgacaa catccaatat ctctcaaacg aggaggcgga agagctgcgg gggaaaggcg 1620 gaccagtgta tgtagcaatg ctacaccgaa tggacgaccc ccaggaaccc cgacctacac 1680 caacagttcc aagggaatac caatcttttg aggacgtctt tgacgatgaa ggggcagcgt 1740 cgctcccaga atactcaaag agggcagagc atgccataga tattgaggag gggaagcaac 1800 ccccgtgggg accaatctac agcctctcga gaaggagttg gcgacgctta ggggttacat 1860 tgaggattcg cttcagaggg gctggatccg tcactcgacg agcccagctg gagcacccat 1920 ccttttcgtc ccaaagccag atggctcgaa gcgcctatgt gtcgattatc gagggcttaa 1980 cgccatgaca gtgaagaatc gacacccaat ccctcttatt tcggaaatac tcgaccgcct 2040 aagcggggcc gcagtgttca cgaagcttga cgcaaaggat gcgtaccagc gtatgagaat 2100 ccgaaagggg gatgagtgga aaactgcctt cctaacaagg tatggcctgt tcgaatattt 2160 agtgatgccg tttgggctgt gcaatgcccc cgcatcattc caggcgtata tcaatgaagc 2220 gctcacaggc cttgtagacg tgatgtgcat tgtctacctt gacgacgtct tgatctattc 2280 gaagcgcaag gaagagcatc cgcaacacgt ccgcactgtc ctcgaacgcc tacggaagtt 2340 taagatctac tgcaatctaa agaagtgcaa attctcaaca aaagcaaaag aggttgagtt 2400 tctaggcttc attgtgtcta tgaagggggt ctcaatggac ctatcacggg tcgagacaat 2460 cacagactgg ccggtgccaa caaccttccg agacctacag gtcttcttag gctttgcgaa 2520 cttctaccag aggttcatcg cggactattc acgtattgcg caaccgctca cagaactcct 2580 taaggggagt gagaaaggga agaaggtagg cccgttcctc tggaacgttg aggcagcggg 2640 ggcttttcag acgttgaaag atgctttcac gtccgcccct ctcctccgcc acttcgaccc 2700 agctcagctg ataaggttgg agaccgacgg ctcaggaatg gcgattgcgg gaattcttat 2760 gcagccatat accgacatag cgggggtcac acggtggcac cccattgcct tctattcaag 2820 gaaattgaag gatgcagaga tcaggtatga cacccatgat attgagcttc tcgcgatcgt 2880 cgcggccttt aagcattggc gtcactactg tcaagatagc aatttttccg ttatagtacg 2940 gacggatcat aataatctga tttacttcat gaagaaagta tcaatcaata atcggcaggc 3000 ccactggatc gagttcctag caggtttcga ttttcagatt gaacataggc ctggccgcac 3060 caaccccgcg gacggtccga gtcgcagacc ggactacgtc gaaacgcaac atcaagcatt 3120 tgacgggttg ttacccacac ttcagcagaa gttggcccgt ggttatagtg aagctgtaga 3180 gaacagcgag ggcgatgcgt cgcaaaagag tactcgaatc cgcgccgtct tcgcaaaagc 3240 gggcgagcga acggtaaccc aagatatcct ccacgaagtg gacaagttaa agctgcaccc 3300 ccataagggg cacgcagagg atacgcaagc tttcctgcct cgctgtcacg tgatgcgaga 3360 ggcagctaca gaagcaccgt acgaggagcc atcccttgat cttcgcaagc tcatcgttca 3420 agtccaaagt ggtgacccat tcttgaagga tcgagcgttg cgaatgacaa agaggcgtcc 3480 acggatggaa ggcgtagaga gagagtgggc tcttcatcag gaaagtggag tcctttcgag 3540 ggaaggcatc atcccctttc ctcgagaccc agccctcatt gaagagctcc tccgaatcca 3600 tcatgatgac ccattagcag gtcactttgg tgtggcgaag accacagaac tacttactcg 3660 gaagtataag tggccaaaaa tgagggagga cataaaggaa tacgttaaga cctgtcaaat 3720 ctgccagaaa atggtagtga agcgtcaccg cccgtatggc gagttgcagc cgcttccaca 3780 accaaacggc ccaggagaag agatctccat ggactttata acaggcctcc caccaagcct 3840 tcactcattg acgaattaag gcgtacgatt ccatacttgt agttgtggat cgatacacaa 3900 agtacgcctt atatatcgca acaaataaga cggtgaatgc caataatttg gcaacgctta 3960 tcctccgcta cgtcatcaca gagttcggga ttccaaattg tctccgatag gggatctgta 4020 ttcacaagta gctactggtc ttgcctctgt tatttcttaa agattcgcca gcgtttgagc 4080 accgctttcc acccgcagac ggatggacaa actgagcgcc agaatcaaac aatcgaacac 4140 taccttcggt cgtattgcac aagctcccag gacgagtggg tagtactcct acccattgcc 4200 cagttctcat ataataactc cttccattca acaattcaaa caacaccttt ccgggcatta 4260 aaaggctttg atcctcctat gccagatgtg agtgtcgcgg acgcccctca aaagggggac 4320 gcactcgagc cgtttgaaag aatcagaagc tccaggagga aagggaagtg ctggcgcacc 4380 attggctccg agcgacgcag tcgcagacaa agcactataa tctcaaaagg aagccaaagg 4440 aatatagcat gggggaccaa gtcctccttt ccacaaagaa catcaagcta cggaggccaa 4500 acaggaagct tgcggagcag ttcgtaggac ccttcaagat cgtagagatc attgggaagc 4560 aggcctatag gcttgagttg ccttgggaca tggggatcca cccaaccttc cacgtgtcat 4620 tgctagagcc acaccacaag aggcagtaag ctcacgcaca ggtcacggat agtaaacaca 4680 agagtagaaa agccacatag ggcatcaatt cgctattcta gaaccggagc tcaactatct 4740 aatgtggcgc aagcgttatc aggcggcgac atcaccatct tcaacgtcgg agaagtcgtc 4800 gttggagatg tgggagcgag cgggggaggg ctcacggctg acaggccagc ttggcctcca 4860 gctctaggtc gttgtcatct ttgccaggga aagcctctga ctccgcctcg ctatcggagc 4920 tctcatactc ctcgcaggtg gcaacagcag gattgtgctg gtgctaagag aggttagttg 4980 gagaacagga agggaaacgg gagcgacgcg tacctttgcc tgatacatgt cggccttgaa 5040 gcgcgcgaca cgctgaagtg ccttgatgtt gtcggcaaca gcaaggaggg cgaggccgat 5100 gtcgatcttc tcgggcttac tgtccgcacc gaaggcgcgg cggcgacgaa ggagattctt 5160 gccacgctcg acgaactggg tgtacttggc ctggcgggcc ttgaggattt gggagcgggc 5220 cttgatgaca gggacaacca tctcatccac gttccaggaa gccatgaatt gcgcgtgcgc 5280 aaccatcatg caattgaatc tcttgatccc aaacttgggg atctaaaaca gattaggccg 5340 aggacaatca agcgatgcgg gggatgtgca gaggactcac gggcaagctg cgacagtgtt 5400 gatccgagca atactggcat cgggccatgc tgttcgcgcg ggtgcatttc agctccgtct 5460 tgaacttgaa gacgttcttg gagcagcgta gacacattgg gatgacgggc ttcttgcagg 5520 gggcatgcga ggggctgaac tccccaccaa caacggaagg gttcgcgggg cggttgaagc 5580 gctgtggcgt gcccatagcg gggaaggcaa tcaggcaaag agcagctcct gcttggcttg 5640 gggcgccatc ctcacttgcg acggtggccg agcgtgtctg gcgcggggtc gcctcaggcg 5700 cgggacgacg gggaacgagc tcatcattgt cctcgacaga tggaaggaca gctttaccct 5760 tctccttggc cttggcagtg ggcttgcggg ggcgagtctt cttgggggcg gtggtctttc 5820 cacctggctt ggaggggcca gcgggagtga gcccggctgg gtccttgttg tcacctttgc 5880 agggaggcat ggtggcggca gcgaagcaca agagagtgtg gagtgtgaag ctgtttgaat 5940 tagaaaaggt tcaagaagac tcaaggagtc cacgtaccaa aaacagagga gctgtgaggg 6000 tgcagcgttg tggtgtggaa gaggttgtgt tgtatcaaaa caaaaggaga agcgaagcca 6060 gaaagtggga aattcaccac agcaaaccaa aacaacaaca acaacgcgta ggcgggtggg 6120 ctccgcttac caagtgtgtc tcgctcaata agcggccacc accacacttt acaatacggg 6180 ctgagcaaat ctcaggattg ggggctcaaa ggggcttgag cctcaaaggg ggggatat 6238 // ID Copia-17_MLP-I repbase; DNA; FNG; 4530 BP. XX AC AECX01001143; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-17_MLP_; KW Copia-17_MLP-LTR; Copia-17_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4530 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001143; Positions 8212 12741. XX CC Positions [2251-2514] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS join(79..2247,2251..4401) FT /product="Copia-17_MLP-I_1p" FT /translation="MITRSSGSKKKPSDNNDDTQVPENKTNPDSDSDKDVD FT LLPSKGQEDLDSDIKLKDSSSDIKIEDLENSSVLVSSSKHSDLKTQISQVT FT VSDMSKDPPPHQVSDHYTSMDFFVKMPTLVNKAVATLDENGSNYLHWKNDL FT YVLIDFITKIPDYLDQDRKQTTGDDVITQMIRVSVCETLRLQVDPKESAFT FT VFNRLKSLFHFPTRSTHLSLWKEILQSKVDSPDAVAAHLSKMKAKVEELQR FT TGFTFTKDSFLSIVMQLGLSQQFANVNTILDSRLRTNPDTAISSRETEEAI FT RNETYRLADSAIDLLPNISALNINDRGHNYATRSRVQTHASQSSQRRYQTP FT TQSSGKTYPSKSNFRVGPNTPAHLKNCFDCGGEHWSKTPFCPKSPNYDPEL FT ARQELSQQSSSRGTFRANMADASHGYDASSPLPHSVYSSEASVTEDWNVGE FT AVLDSGATHHVTNEVGALLNYTHLRSPIPLHVATESEGNFLTGIGSLILKS FT DSGASIKIKDVYYCERAKGTLISMAALVEDGYVFSFSGYDLSLSFGSHTLT FT SKYKNRRWLVPLITNPPKPPACQTHVSSCRARVFTKEESNEALRWHKRFGH FT VGLRVIKRLLSKEMTTGLPEHLVNAPFQCPDCLISKSLRHQTMGPSGRERP FT QPLELIVSDIAGPFDGELFGFKYMITFRDLATTFTDIMILKSRDHAPSVFQ FT EFVLKMERQTSFKVKQLRTDGAGFNSATFLNWLKAHGIAKEKTLPYEHNQN FT GVAERSIRTIGDMGRTMLIASKLPKFFWCFAYLTAGFIHNRIPNVNTGDKT FT PFELLFGKKPHLEGLRTFGEEAFVHIPLERRGKLDPRAQKCIFIGYLPGSK FT GWKFFNFETKKVVESSMAVFSDDKEPVLEEIKEPEQSKGNLSHILNALQLG FT SFDAEEAVMQQDALVTSVENSLKNGDPTIPKTYKEAMSRSDADKWRAACEK FT EMDQMREMQVWEVCDLPKDRKVTDAKWVFDIKRDGTYKARYVARGFTQIEG FT VDYHETFAPTATFAALRIIVTVAAMFGWNTFGFDVTAAFLHSPLEEDVWVR FT QPAGFSYGPVSKVLKCLRALYGTKQGMRCWWKFIAKKLEKMGFRPSQFDAS FT FYILRRGSDTCLIWLHVDDGAVTGSSTRLLMEIEGLLKAEIKIKWETSLDS FT IVGVEINRPSPDRFEMSQPKLANRILKDNADLLSVLSAENPLSVSTRLETS FT VNVPPVDGGRYLSIIGSLSYLAVGTRPDLSYSVNFLARYGKTPQEAHWNAL FT KHLLRYLRKTASMGILINPRKTKYSAPMETFVDANWGGEFARSSYGHVTRL FT FGVPVAWVARRQACVATSTCHAEFMAIGAACRDSVWLHSLATDMLPDLPPP FT LLLCDNSSSVHVSKDNAANKRTRHAEREYYYINEQLHKKKVDIKWIPASQQ FT VADIFTKPLGPTKHKEAIKMLGMTGS" XX SQ Sequence 4530 BP; 1326 A; 965 C; 1019 G; 1220 T; 0 other; ggttttgagt cccagttgtg ataccagctg tagtttgttg tagatgaacg gttctgctta 60 aaattttttg aacgatcgat gataaccaga agttcaggtt ctaaaaagaa accctcagat 120 aacaacgacg acacccaagt gcccgagaac aaaaccaatc ccgattccga ctccgataag 180 gacgttgact tgctaccctc caaagggcaa gaagatttag attcagatat caagttaaaa 240 gattcaagtt cagatatcaa gatagaagat ttagaaaaca gctcagttct agtttcgagt 300 tcaaaacatt cagatttaaa aacccagatt tcacaagtta ctgtttcaga tatgtctaaa 360 gatccaccac ctcaccaagt ttccgatcat tacacctcga tggacttctt tgttaaaatg 420 cccactctag ttaacaaggc tgttgccacg ctagatgaga atggatcgaa ctatttacac 480 tggaaaaatg acttgtatgt tttaattgat tttataacca agattcccga ttacctagat 540 caagatagaa agcaaactac tggtgatgat gtgataactc agatgataag ggtctcagta 600 tgcgaaacac tcagattaca agtcgacccg aaagaatcag cttttaccgt tttcaatcgc 660 ctaaaatcat tatttcattt tcctactcgt tctactcatc tcagcttatg gaaagagata 720 cttcaatcta aagtcgattc accagatgcg gtggcagctc acctgtcgaa gatgaaagct 780 aaggttgaag aattacaacg caccggtttc acttttacta aggactcctt tcttagcatc 840 gttatgcagc tcggattatc tcaacaattt gccaacgtta acacgatctt agactccaga 900 ttacgcacaa accctgatac agcaatctca tctcgagaga ctgaagaagc aatccgtaac 960 gagacttatc gattggcgga ctcggccata gatctgttac cgaacatctc cgctttaaac 1020 atcaatgatc gaggacataa ttatgccaca agatctagag tacaaacgca tgcaagtcag 1080 tcatctcaaa gacgttacca aactccgact cagagttcag gcaaaacata cccttcaaag 1140 tcgaacttca gggttggacc gaacactcca gctcatctga agaactgttt cgattgcggt 1200 ggtgagcact ggtctaaaac accattctgc cccaagtcac caaactacga tcccgagctc 1260 gctcgacaag aactctctca acaatcgtca tcccgaggta ccttcagagc gaacatggct 1320 gatgcgtctc acggttacga tgcgtcatct ccactacctc actcggtcta cagtagcgag 1380 gcatctgtca ccgaagattg gaacgtggga gaggcggtac ttgactcagg agcgacgcac 1440 catgtcacaa atgaggtggg tgctttgctc aactacactc atcttcgatc cccaatccca 1500 ctgcatgttg caacagaatc tgaaggcaac tttctcacag gaatcgggtc gttaatcctc 1560 aaaagcgact cgggtgcatc tattaaaatc aaggatgtgt attactgtga aagagcgaaa 1620 ggaaccctta tttccatggc agctctggtg gaggatggat atgttttttc tttttccggc 1680 tatgatttat ctttgtcttt tggatctcat acattaacgt cgaaatacaa gaatcgccga 1740 tggctagttc ctctcatcac aaacccacca aaaccacctg cctgtcagac tcatgtttcg 1800 tcttgccgtg cccgagtgtt tacgaaagag gagagcaatg aagctttgag atggcataag 1860 cgcttcggac atgtaggtct tagagtgatc aaaaggctgc taagtaaaga aatgacgaca 1920 ggacttcccg agcatttagt gaacgcaccg tttcaatgcc ctgactgtct gattagcaag 1980 agcctccgac atcaaacgat gggtccgtcg ggaagagaac ggccccaacc gcttgagtta 2040 attgtttcgg acattgctgg tccttttgat ggagagttat ttggttttaa gtacatgatc 2100 acctttagag acctggctac aacttttacc gacattatga ttttgaaatc aagagatcac 2160 gcaccaagtg tttttcagga atttgttctc aagatggaac gtcagacatc atttaaggtc 2220 aagcaactac gaactgatgg cgccggttaa tttaactcag caacttttct taactggctc 2280 aaagctcatg gcatcgcaaa agaaaaaacc cttccgtatg aacacaatca aaacggtgtt 2340 gctgaacgct ccatcaggac gattggtgac atgggtcgaa ccatgctgat tgcgtcaaaa 2400 ctaccaaagt tcttttggtg tttcgcctac ctaactgctg gctttattca taaccgaatc 2460 ccgaatgtga acaccggcga taaaacacct ttcgaattac tttttgggaa gaaaccacat 2520 cttgaaggct tacgcacgtt tggagaagaa gcttttgttc acataccttt ggagaggcga 2580 ggtaaactgg atccacgagc tcaaaagtgt attttcattg gttacttacc gggcagtaaa 2640 gggtggaaat ttttcaactt tgaaactaag aaggttgttg aatcgtctat ggcggttttc 2700 tctgatgata aagagccggt acttgaggaa atcaaagaac cagaacagtc aaagggtaat 2760 ctcagtcaca tcctgaacgc tttacaactt ggaagtttcg atgccgagga agcggtcatg 2820 caacaagatg ccctggtaac ttctgtagaa aacagtctga agaacggtga tccaactatc 2880 ccaaaaactt acaaggaagc aatgagtcgt agtgatgcag ataagtggag agcggcctgt 2940 gagaaggaga tggaccagat gagagagatg caggtgtggg aagtgtgtga tttgcctaaa 3000 gatcgtaagg tgactgatgc gaagtgggtg tttgatatca agagagatgg aacttacaaa 3060 gccagatatg tggctcgggg gtttactcaa atcgaagggg tcgactacca cgagactttc 3120 gcaccgaccg ctacctttgc agctttgaga ataatcgtaa cggttgcagc aatgtttggc 3180 tggaacacct tcggttttga tgttaccgca gcattccttc atagccctct ggaagaagat 3240 gtttgggtaa gacaaccagc cggtttttct tacggaccgg tctcgaaagt actcaagtgt 3300 ctccgagctc tgtatggaac taagcaagga atgaggtgct ggtggaagtt tatcgcgaaa 3360 aagcttgaaa agatgggttt tcgacctagt caattcgatg ctagtttcta catcctaaga 3420 cgcggatccg atacttgcct catctggctc catgtggacg atggtgctgt aaccggtagt 3480 agcacacgat tgctgatgga gattgagggt ttactcaaag ctgaaatcaa gatcaagtgg 3540 gaaacaagct tagattcaat tgtgggtgtt gaaatcaatc gaccatcacc tgaccgcttt 3600 gaaatgtcac aacctaagct tgcaaaccgt atactcaaag acaatgctga tctactatct 3660 gtcctaagtg ctgaaaaccc cctgagtgtg tcaacccgat tagaaacctc tgtgaatgtt 3720 cctccagttg atggggggag atatctttcg ataatcggct cactaagcta tcttgcagta 3780 ggaactcgac cagacctgtc ttactcggtg aatttccttg cgcggtatgg caagacgcct 3840 caagaagctc actggaacgc gttaaaacat cttttacgat atttaaggaa gacagcaagc 3900 atgggtattt taatcaatcc tcgaaagacg aagtactcag caccgatgga gacttttgtg 3960 gatgctaact ggggcgggga atttgctaga tcatcttatg ggcatgtgac gagattgttt 4020 ggggtaccgg tagcgtgggt ggcgagaagg caggcgtgtg tagctacttc aacttgtcat 4080 gcggagttta tggcgatagg agcggcatgt agggattcag tttggttaca ctcattagca 4140 acggacatgt tacctgattt accaccacca ttacttttat gtgataattc atcgtcagta 4200 catgtttcaa aagataatgc ggctaacaag agaacaagac atgcggaaag ggagtactat 4260 tacataaatg aacagcttca caaaaagaaa gtcgacatta aatggatacc agcgagccag 4320 caggttgcag atatatttac taagccctta ggacctacaa aacataaaga agcaatcaag 4380 atgctaggca tgactgggag ctgaagtttt actttttttc atttcatttc atttattttt 4440 tcatttttat attttgagtt ttatttttgt gtgttttctt taggaggatt tttgtctagt 4500 cctggggggg agtgttgaag cctatatgtg 4530 // ID Copia-4_MVPL-LTR repbase; DNA; FNG; 337 BP. XX AC AEIJ01000810; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Microbotryum violaceum genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-4_MVPL_; KW Copia-4_MVPL-I; Copia-4_MVPL-LTR. XX OS Microbotryum violaceum OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Microbotryomycetes; Microbotryales; Microbotryaceae; OC Microbotryum. XX RN [1] RP 1-337 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Microbotryum violaceum genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AEIJ01000810; Positions 5630 5966. XX SQ Sequence 337 BP; 72 A; 84 C; 88 G; 93 T; 0 other; tgttgtggga aattagtaaa gggtcagcgg tccagagggg cttgtggcat tgggaagttg 60 gcgcgcgcat ggcgcggcgc cggatggagg cagatggcgg tgtggcgtat tgattggact 120 cagaggtggc ttgcacgcgg atactatcta aggctctggt ctcgttcact ctctccttct 180 tctttaacac acactcgata tctatctcgc ccatttgcct gctacaagtt gattgacctt 240 ggcgatactc tatctcacaa taggtcatct atcaacgccg tgtagctaca gcgaatcttc 300 attcattcat tggtactcca agtccaaatc cacacca 337 // ID TCN1-I repbase; DNA; FNG; 5165 BP. XX AC . XX DT 30-MAR-2005 (Rel. 10.03, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version 4) XX DE C. neoformans LTR retrotransposon - internal consensus. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; reverse transciptase; internal portion; KW TCN1-LTR; TCN1-I. XX NM TCN1-I. XX OS Cryptococcus neoformans OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Tremellomycetes; Tremellales; Tremellaceae; Filobasidiella; OC Filobasidiella/Cryptococcus neoformans species complex. XX RN [1] RP 1-5165 RA Goodwin T.J. and Poulter R.T.; RT "The diversity of retrotransposons in the yeast Cryptococcus RT neoformans."; RL Yeast 18(9), 865-880 (2001). XX RN [2] RP 1-5165 RA Loftus B.J., Fung E., Roncaglia P., Rowley D., Amedeo P., RA Bruno D., Vamathevan J., Miranda M., Anderson I.J. et al.; RT "The genome of the basidiomycetous yeast and human pathogen RT Cryptococcus neoformans."; RL Science 307(5713), 1321-1324 (2005). XX RN [3] RP 1-5165 RA Gentles A. and Jurka J.; RT "C. neoformans non-LTR retrotransposon TCN1."; RL Direct Submission to Repbase Update (15-MAR-2005). XX DR [3] (Consensus) XX CC 501 bp LTR deposited as TCN1-LTR. XX FH Key Location/Qualifiers FT CDS 164..1462 FT /product="TCN1-I_1p" FT /translation="MSGPSTRSGRAVEKVEKGKEVEHTLDDDHNPAGSFNT FT NPQSDPFTADNTSNDTQTQLEMMRNHIARLEQQKEELTHKLEESKVERQMF FT DNSEDNEGENEQELDEEDEEPRSSRDLSAQTPYPEVKREYSRQRTSVPSSR FT PQEPKVSQPEYYHGQYTKLSTFITQVTMVITLQPSRFPTETSKVLYAGSFL FT RDTPFLWFQPFVTIDPQPKFMLDFKKFCAELRKNFGDPDEEQTAERQLNTV FT RQQGSVSSYLATFMRYATLVQWNDEAKKACFYKGLKDDIKDELARLPKAKS FT FKNLQDMAIRIDSRRYERVLAKRDQQPKALFNATRSDYTRTSYNNNHPNNF FT RRFSAANSMPIRSTTANTTFNKEVTPAVNLRAAFVPSSARITRRGRLTPEE FT YQRRKDHNLCLYCADKNHQVAKCPVVPSQQSNTTLPSKN" FT CDS 1468..4947 FT /product="TCN1-I_2p" FT /translation="MLLSAKVEEGSREQERTIKPANTDSCEYLQTLEDNNK FT NNKNQLTIDFLFHNNVYQALIDSGASTNFIDKRFVQTFNLKTTKIEDSIPL FT YLFNAAGQRTIIEEEANILVNFQKPFGHTLLRLLITDIGSYPIVLGITWLQ FT EHNPSISWETLSIHPPVSQTTSANLAMVITNDKPPKENTDAEIVPKEYHQY FT LDVFDKKSADTLPEHRSFDHHIPLEEGKNPPFGPIYNLSETELEALREYLD FT ENLKKGFIRPSESPAGAPILFVKKKDGSLRMCVDYRGINKITIKNRYPLPL FT IAELLDRLKSAKVFTKIDLRGAYNLLRIKAGEEWKTAFRTRYGHFEYLVMP FT FGLTNAPASFQHLMNHNFRDLLDIFVIIYLDDILIYSPDLETHQSHVIQVL FT DRLRQTQLYAKASKCEFHQTSVEFLGFVVSDQGLSMDTKKVKSITEWPTPR FT NLRDTQSFLGFCNFYRRFIKNYSSIAKPLIDLTKKDLPFVWEEPQRTSFEA FT LKKSFTSVDLLRHYDPTKQLILETDASDYAIAGILSHEIDKKLEPVAFFSH FT KMLPAELNYPIHDKEMLAIVSAFKEWRHYFEGARETIRVYTDHRSLEYFMT FT TKQLNRRQARWSEFLADFDFNIIYRPGVQGTKPDALTRRHDYHPLEKGSSL FT TTAANPQNFQTLLRPGQYLGTATTGLDRLEISSPIKSLLKTGLETDESAKP FT FLDKANHPSEAHPYTRDDEGLLRYGESFYVPANNELRTLVTKECHDALTSG FT HPGRRKTIQLIRRHYWWPGLKGFVNHYIDSCDLCCRTKTRRHQPYGELKSL FT PIPPYPWSSVSMDLIEQLPPSHGYNTILVIVDRLTKMALFIPTTTSLNAEE FT LAQLYVTHVFSKHGIPTSIVSDRGSEFTSRFWRAFTQLLHIELELSTAFHP FT ETDGQTERVNQVLEQYLRLYTDYKQKEWAPLLPVAEFTYNNTPHSSTTMSP FT FFANKGYHPRASFTPDDNVPIFSPPARASITDLSKLHEHLKIEMSKAQESA FT ALQFDKHRAPLPEYTIGDKVWLSARNIKTKRPTKKLDHRYLGPYTIIARVS FT SHAYRLELPKSMRIHDVFHVQLLEKYIENEIPGRTQVAPSPIEVEGDLEYE FT VECILDHRFYRKRRQFLIKWLGYSAEHNSWEPETALENASEIVDQYKSTHR FT L" XX SQ Sequence 5165 BP; 1597 A; 1480 C; 938 G; 1150 T; 0 other; gtaaatattg atcttccaca cttgtggttg gttggtatag acttcaagta ccttttacct 60 tcgacacata cgtataccct tatacaactt catcaacaac ttcatcaaca acctcagcca 120 cctataacct cataccacca cttcgtatac catatcctat atcatgtcag gtccatccac 180 tcgtagtggt cgagctgtgg aaaaggtcga aaaaggcaag gaagtagaac ataccctcga 240 tgacgaccac aatcccgcgg gttcattcaa caccaatcct caatccgatc catttaccgc 300 cgacaacaca tcaaacgaca cccaaaccca actcgagatg atgcgcaacc atattgcaag 360 acttgagcaa cagaaggaag agttgacgca taagttggaa gagagcaaag tcgaacgaca 420 gatgtttgat aatagcgaag ataatgaggg cgaaaacgaa caagaattgg atgaagaaga 480 cgaagaacct agatcaagcc gcgacctgtc agcgcaaaca ccatacccag aagttaaaag 540 ggaatacagc cgacaacgca cctcagtacc ctccagtaga cctcaagaac cgaaggtatc 600 ccaacccgag tactaccatg ggcagtatac caagctctca acctttatca ctcaagtgac 660 aatggtgatt accctccaac cttctcgttt ccctaccgag acctccaaag tcctatacgc 720 cggatctttc ctccgagata ccccattctt atggttccaa cccttcgtaa ccatcgatcc 780 ccagcccaag tttatgctgg acttcaagaa attttgtgcc gaattaagga agaacttcgg 840 agatccagac gaagaacaga cagcagaacg acaactaaac actgttcgtc agcaaggttc 900 tgtatcctca tacctcgcaa cctttatgcg ttatgccacc ttggttcagt ggaacgacga 960 agcgaagaag gcttgcttct acaagggctt gaaggatgac atcaaagacg aacttgccag 1020 actacccaaa gccaagtcat tcaagaacct ccaagacatg gcaatccgca tcgacagccg 1080 tcgatatgaa cgggtattag caaagcgaga ccagcaacca aaggcgcttt tcaacgccac 1140 ccgaagcgac tacacccgca cctcttacaa taacaatcac cccaacaact tcagacgttt 1200 ctctgcggcg aatagcatgc cgataaggtc taccactgcc aacaccacct tcaataaaga 1260 ggtgacgccg gcagtcaacc tgagggctgc atttgtccca agttcagcta ggataaccag 1320 acgtggacgt ctgactccag aagaatatca gaggcgaaag gatcataacc tctgcctcta 1380 ttgcgccgac aaaaaccatc aagtcgccaa gtgcccagtg gtcccctcgc aacaatccaa 1440 tactactctc ccttcaaaaa actagatatg ctcttgtctg ccaaagtcga agaaggtagc 1500 cgggaacaag agcgtactat caaaccggcc aatacagact cctgcgaata tctccaaact 1560 ctcgaagata ataacaaaaa caacaaaaac caactcacaa tcgactttct ctttcacaac 1620 aatgtttatc aagctttaat cgattccggt gcctctacaa acttcatcga caaaagattc 1680 gtccagacct ttaacctcaa aaccacgaaa atagaagatt cgatcccatt atacctattc 1740 aacgctgcgg gtcagcgaac tataattgaa gaagaagcca acatcctggt caacttccag 1800 aaaccattcg gacacacctt actccgactc ctcataaccg acatcggctc ctaccccatc 1860 gtcttaggta taacctggtt acaagagcac aatccgtcca tcagctggga aacactttcc 1920 atacacccac ctgtatcaca gacgacgagt gccaacttag ccatggtcat caccaatgac 1980 aaacctccaa aagaaaacac cgatgccgaa atagtaccta aagaatacca tcaatatcta 2040 gatgtattcg acaagaaaag cgccgataca ctcccagaac ataggtcttt cgaccaccat 2100 atccctctcg aagaaggaaa gaacccacct tttggtccca tatacaatct ctccgaaaca 2160 gaacttgaag ctctccgcga ataccttgat gagaatctta agaaaggttt tatccgaccg 2220 tccgaatcac cagccggagc acccatactc tttgtcaaaa agaaagacgg atcgcttagg 2280 atgtgtgtcg attaccgggg aatcaacaag atcaccatca agaatcgcta tcctctacca 2340 ttgatcgccg agctcctaga tcgactcaaa tcagccaaag tattcaccaa gatcgacctg 2400 cgaggagcct acaatttact tcgcattaag gcaggcgaag aatggaaaac agctttccgt 2460 actcgctatg ggcatttcga atatttggta atgccgtttg gcctcaccaa tgcccctgca 2520 tccttccaac atctcatgaa ccacaatttc cgcgacttgc tagacatatt tgttatcatc 2580 tacctcgacg acatcctcat ctacagccca gacttggaga ctcaccagtc acacgtcata 2640 caagtcctag atcgcctccg ccaaacccaa ttatatgcca aagcttcaaa gtgcgagttc 2700 catcaaacct cagtagagtt cctaggtttc gttgtcagcg atcaaggtct atcaatggac 2760 accaagaaag taaagtctat cacggaatgg ccgacacctc gcaatctccg tgatacccaa 2820 tccttccttg ggttctgtaa cttctaccga aggttcatca agaactactc tagtatcgcc 2880 aaacctctta tcgacttgac aaagaaggac ttaccctttg tatgggaaga acctcaacga 2940 acatctttcg aagcactcaa aaagagtttc acctctgttg atctcctacg tcattacgat 3000 ccgaccaagc aactcatcct tgaaaccgac gcctccgact atgccatcgc aggtatctta 3060 tcacatgaaa tcgacaagaa actcgaacca gttgctttct tctctcacaa aatgttgcct 3120 gccgagttaa actatcctat tcacgacaaa gaaatgttag caattgtttc agcattcaaa 3180 gaatggcgac attacttcga aggtgctaga gaaaccattc gtgtctacac cgaccacaga 3240 agcctggagt actttatgac taccaagcaa ctcaatcgac gacaggcgcg atggtctgaa 3300 ttcctagccg actttgactt caatatcatc taccgaccag gcgtacaagg cacaaagcct 3360 gacgcactca cccgaagaca tgattatcat ccactcgaga aaggctccag ccttactact 3420 gctgccaatc ctcagaattt ccagactctc cttcgccctg gacagtactt gggtactgcc 3480 acaaccggac tcgatcggtt ggaaatatct tcgcccatca agtcattgtt gaaaaccggt 3540 ctagaaaccg atgaatcagc aaaaccattc ttggacaaag ccaaccatcc ctccgaagct 3600 cacccatata ctcgagacga tgaaggactc ctcagatatg gcgaatcatt ctatgtccca 3660 gccaataacg agctacgcac cctcgtcacg aaagaatgcc atgatgcact cactagtggg 3720 catcccggac gacgcaagac tatccaactc atccgacgcc attactggtg gccaggccta 3780 aaaggcttcg tcaatcacta cattgattcc tgcgatcttt gttgcagaac taagacaaga 3840 cgtcatcagc cctatggcga actcaagtct ctacccattc ccccatatcc ctggtcatct 3900 gtatcgatgg acctcattga acaacttccc ccatcacacg gctacaacac catccttgtg 3960 atcgtagacc gactcaccaa gatggctctc tttatcccca caacgactag cctcaacgcc 4020 gaggaactcg cccaattata tgtcacccac gtcttctcca agcatgggat tccgaccagt 4080 attgtatcag atcgtggatc tgaattcaca tcccgctttt ggcgagcatt cacacaactc 4140 ctacacatcg agttagaact cagtacagct tttcacccag aaacagatgg acaaaccgaa 4200 cgagtgaacc aggtcttaga acaatatctg cgcctttata ccgattataa gcaaaaggaa 4260 tgggcaccgc tactcccagt tgcggaattc acttacaaca atacgcccca ttcgtccact 4320 accatgtccc ccttctttgc caacaaaggg taccatccca gggcatcgtt tacccccgat 4380 gacaacgttc ctattttcag cccacctgcc agagcctcca tcaccgactt gagcaagctc 4440 cacgaacacc tcaagataga aatgtccaaa gcacaagaga gtgcagcact acagtttgat 4500 aagcaccgtg ccccacttcc cgaatatact atcggcgaca aagtctggct atctgcccgt 4560 aacatcaaaa cgaaacgacc caccaagaaa ttagatcacc gctatctcgg tccctacacc 4620 attatcgcgc gcgtttcttc ccacgcgtat cgccttgagt tgccgaaatc aatgcgtatc 4680 cacgacgtct tccacgtcca attgcttgag aaatatattg agaatgagat cccagggcga 4740 acacaagtcg caccatcacc tatcgaagtc gaaggtgacc tagaatacga agtcgagtgc 4800 atcctcgatc atcgatttta ccgaaaacgc cgccaattcc ttatcaagtg gctcggctac 4860 agtgccgaac acaacagttg ggaacccgaa accgctctag aaaatgcttc agagattgtt 4920 gatcagtata agtcaacaca ccgattatag gaaaccaagt tcatagtaac agttcagaga 4980 aattattcca aacattgcga aaacattttt agttcataga cttagttcat aaatttttca 5040 aagacaaatt cttcacaacc agttcataga aatttattcc atagctccag caaattttag 5100 ttacaagtac acaacagaac aagccgcaga cggcatgacg gagcgaccgt cttcgaaggg 5160 ggggg 5165 // ID Mariner-2_AF repbase; DNA; FNG; 1851 BP. XX AC . XX DT 28-FEB-2006 (Rel. 11.02, Created) DT 07-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE A family of Mariner DNA transposons - a consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW Mariner-2_AF. XX OS Aspergillus fumigatus OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Neosartorya. XX RN [1] RP 1-1851 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-1851 RA Kapitonov V.V. and Jurka J.; RT "Mariner-2_AF, a family of nonautonomous Mariner DNA transposons RT in the Aspergillus fumigatus genome."; RL Repbase Reports 6(2), 97-97 (2006). XX DR [2] (Consensus) XX CC It is a family of nonautonomous DNA transposon from the Mariner CC superfamily (Tc1 clade). The genome harbors 5 copies that are 97% CC identical to the consensus. The transposase encoding region is CC saturated by many stop codons. XX SQ Sequence 1851 BP; 605 A; 382 C; 368 G; 496 T; 0 other; acgtaatcca cggccgagcg gcgaatacga ctgaagtgcg aattttcccc gcaaccacca 60 aaatattcaa ctcgaccatt ttcaacgcgc cattttcttt acttttatat ataaatatgc 120 ctctatctaa ggaagcctgt atgcagatgg ctattactgc atggaaacag caaaaagtca 180 agttaaaact aaaggctgcc cagatattta gtgtccctga atcctccctt cgcaagcagc 240 tatctggagt caaaccacaa acagaaacac gtgcaaatag ccacaaatta acagctactg 300 aagaagagac tcttattaag caactattag atgcagataa gcaaggcttc ttaatttagc 360 cagaattcct gcgtggaata gcacagattt tgctctgtga atgtacacaa gatttaacag 420 cagtccttgg agtaaactga gcttattcct ttacaaaatg tcgccctgaa ctacgtacaa 480 gatataatta aaggattaca taccagaggg caaaacagga ggatcctaag gttatcagac 540 agtggtttga gactgtatgc aaagctattc aagaacacgg catacatgaa gatgatatct 600 ggaactttga cgaaactagc tttgcaatgg gactctgtat aatatctaag gttattactg 660 cagtagaata cagtgagaga cctcgtatag ttatccaagg gaaccgcgaa tgggttacaa 720 tcattaaata tatcagttct aaagggactt ctataccgcc agtggttatc ttaaagggaa 780 aagaacacca ggctgcttag tatcaagaac ctaagcttct tcaagactag ttaattataa 840 ctagtcagaa tggctggaca acagatgaga ttggcctcca ctggttaaaa gttgtgtttg 900 agccttattc taggcgattt ttaactggtg caaagcggtt gcttatcctt gatggccatt 960 ctagctattt aacaccagaa tttaatatat tctgcaagga gaatgcaatc atctgtctct 1020 gcatgccacc acatacttct catcttctcc aacccctaga tattggggtt ttcaggccgc 1080 ttaaacgctc atacagaaaa ctagtggaag gaatgatggt tgctggaaat aaccacatta 1140 ataaggaaga tttcctgcat ctatatccac cagcataaga taaggtgttt aactaagtaa 1200 atatatacaa tagctttaca ggagctggtc taaagccatt aaatgaagag caggtcctca 1260 gaaagatcac tttttaactt tatacaccaa caccactact tgctgaaggc tctatctctt 1320 ctgcctttca gactcctcag aatcctcgct agcttaatca caaggtccgc actatacaga 1380 gaagcatcta aaaacagaag ctatctagca gtctaatggc tcatattcag catctggaaa 1440 aggccgcaca gatagcaatg aatataaatc ttcttctcca agaagaaatc aaggtcctac 1500 gtgctgaaaa tgagcggaag gtaaagaaaa gggtaagaag gcgtgctata gtagggaatg 1560 atgtcctttt atctgtacaa gaaggctaaa attacgttca gcagcttgat acagaggtta 1620 atagccagat taatgagcct acacctatgt ctcgtcagcg cgctccgcca acctgtagtg 1680 gatgttggac tattagacat actagaagaa gttgccctaa taaatagcta tctattcaac 1740 tagtagatag tctaatcaat gacactggtg gttgaaaaac tttaatgata ggatgatttg 1800 cggcaaaaat tcgcacttcg gtcgtattcg ccgctcggcc gtggattacg t 1851 // ID Gypsy-22_MLP-I repbase; DNA; FNG; 5541 BP. XX AC AECX01000122; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-22_MLP_; KW Gypsy-22_MLP-LTR; Gypsy-22_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5541 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000122; Positions 190480 184940. XX CC Positions [4255-4734] - Integrase core CC 'AGCGG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 2407..3648 FT /product="Gypsy-22_MLP-I_1p" FT /translation="MMQFEDIFPADIPAISDEAEAKGEFVDGSFPEKLQHP FT SSQVRHKIVLTNPNAVINERQYPYPPKHHGAWQKLLDQHLDAGRLRLSTSQ FT YASPSLIIPKKDPSELPRWVCDYRTLNSMTVMDRSPLPKVDKLVRLVATGK FT VFSILDQTNAFFQTQMREADIPLTAVKTPWGLYEWCVMPMGLTNAPATHQA FT LLEEALGDLINTVCVVYLDDIVVFSSNVEEHKRHVQLVLERLRAANLYCSR FT KKTKLFRREVKFLGHVISEEGVRPDEEKVEQIKSWTSPKSSKVVRKFLGTV FT QWMKKFVSGLEKYVGKLTLLTSSKLDPKQFKWGEAEEEAFKNIKNIMTSLP FT CLKNIDYESLDPLWLFTDASGTGLGAALLQGREWRTAHPIAYESRQMLVAE FT RNYPVHEQELLAVIHALQKW" FT CDS 4156..5139 FT /product="Gypsy-22_MLP-I_2p" FT /translation="MSKDMEARLKSCEVCQKTKARTSLPNGRMKTPVMPSA FT PLSDVAVDFIGPLPKINSYDMILTCTCRLLGFTKLIPVNQTNTAKKTASRF FT FGAWMGTLGAPRSVIGDRDKAWASKFWQSLMSKLNVSFHRSPAYHPQADGR FT SERTTKTIGQVLRTFTDKRQTRWLDALPSVELAINSAVNVATGMSPFELIF FT GRPARLFPTTPGNDDSPPTLDAWLRQREGSWAQARDNLWSSRVIQALHHNK FT RHLDVLINTDDWVLLDSGDWRGRHSGGVETLKERYKGPYKVLDTFNEGQSL FT RIDLPPSDKRHNVFNVSKLKLFVEQEELLLGSMDQK" FT CDS join(161..1249,1253..2377) FT /product="Gypsy-22_MLP-I_3p" FT /translation="MTPPVLTRSDSTELTSRVDDPESFIQKPTTLPPHFQQ FT RSIEEAGRYTHNKVLSASVPPRPVPASAKPYTPAIQELPLRASSLLGSFPT FT SPLSILQHEEKRLHHDQDIRSTQHLPSSPLQRVETMTNKADGQPSNPSSTD FT ELLRAMLVAQTARIKAANEAAKRHARLEEERIAAAREAEERIAGLEERLLN FT LQLNAPSPNQPSTTNGGESRGLDLSKFKSSDGPQFKGPYRDTKAFLRWFTS FT LKIFFMVKKVDKDKDRIILTGSYLLETNLQAFWAHNIDNFKSLSWTAFKEN FT LFRAALPSDWNDKLREKILTLRMTINEDFQFFCTRSRSIQLLMNHDQIEID FT DFTLATHMLLGMLPELKSTLLDGPLKKEDFDFLRFEERCDLIYEDLIMKKI FT INRRPLTTTAGRQTSNQSATHQSTGRPTPSSDSTEERKQKFYWKINSYLDS FT LGLCHFCKKQCGSEYKQCTGTFSKEKVTFPPGYEAPPKPPNYTPPCALSAN FT AAGRPTHPPAGRPPRSQVAAVTFPEYDSNTIAAYEEVDRELCEIAESGEEA FT SGPTTEGYVDQPTSQPTVILELTCNGRTFRALADAGSETNLISEKMVDELN FT LKRRKLFKPTIVGLALDSNDQPTTLTEFTTTSLTHLLSDSTFDRVYMKIGN FT LNGSFDMILGAPFLCRHQLAVLCAKKAVVREESGVLLLDYRHEMNLKKKFE FT EECNEKMASIAENHESDKWRDWEKDLRQKNEPCKTLSL" XX SQ Sequence 5541 BP; 1532 A; 1347 C; 1270 G; 1392 T; 0 other; cttttttttt ctcaaaccaa cactggaaat cgaattaccc aacttcagac tacttcagaa 60 gaaccttgtt cttgaacgta ccccggattt gcaagaatcc aaccttattc gcattcaatc 120 ctaaccttgt gaattgcctt cattcgattc catatcattt atgaccccgc cggttttgac 180 ccgcagcgac tctactgaat tgacatctcg agtggacgat ccagaaagtt tcatccaaaa 240 gccaaccacc ttacctccgc actttcaaca acgctcaatc gaagaagcgg gaaggtacac 300 ccacaataag gttctgtcgg ccagtgtccc tcccagacca gttccggctt cagctaaacc 360 atacactcca gcaatacagg aactgcctct acgtgcttct tccttactgg ggtcttttcc 420 tacatctcct ctgtctatct tacaacatga agaaaaacgg ttacaccacg atcaagatat 480 cagatccaca caacatcttc cttcttcccc gcttcagaga gtcgaaacta tgactaataa 540 agcggatggg caaccttcca acccgtcatc gactgatgaa ctgttgcgtg ccatgctagt 600 ggcacaaaca gcgaggatca aagcagccaa cgaagctgcc aaacgccacg ctaggttgga 660 agaagagcgt atagcggctg cgcgagaggc agaagagcgg attgctgggc ttgaagagcg 720 tctattgaac ctccagttga acgcaccctc cccaaaccag ccttctacaa ccaatggagg 780 ggagtcccga ggacttgacc tttcaaaatt caagtcatca gatggcccac agttcaaggg 840 cccttatcgc gataccaagg cgtttttaag atggtttact tcgctgaaga tattcttcat 900 ggtcaaaaag gttgataagg acaaggatcg aatcatactc accggctcgt acctcctcga 960 gaccaacctc caagcctttt gggctcacaa tatcgataac tttaagtcgt tatcgtggac 1020 tgctttcaaa gaaaacctat ttcgggcagc tctgccatcc gattggaacg acaaactgcg 1080 agagaagatt ctcactttac ggatgactat caatgaagat ttccaattct tctgcacccg 1140 atcccgctcc atccaattgc tcatgaacca tgaccagatt gaaattgatg atttcacctt 1200 agcaacccac atgttgttag gcatgcttcc cgaactgaag tcaaccctat gattagacgg 1260 gccactgaag aaggaagatt ttgatttcct gcgatttgag gagcgttgcg atttgattta 1320 tgaggacctg atcatgaaga agataatcaa ccgccgaccg ctgacaacaa ctgctggtag 1380 acaaacgagt aaccagagcg ccactcacca atccactggt cgacccacac catcctctga 1440 ttcgactgaa gagcgcaaac agaagtttta ctggaaaatc aactcgtacc tggattcctt 1500 aggcttatgc cacttctgca aaaagcagtg tggtagtgaa tacaagcagt gcacgggaac 1560 tttctctaaa gagaaggtta ccttcccccc cggctatgaa gcaccaccaa aacctcctaa 1620 ctatacccct ccttgcgccc tatcagccaa cgcagcgggt cgtcctaccc acccaccagc 1680 gggccggcca ccgcgttcac aagtagcggc agtgaccttc ccagagtatg actcaaacac 1740 aattgcagca tatgaagagg ttgaccgaga gctttgcgaa atagctgaga gcggagagga 1800 agcatcgggg cccacaactg aagggtacgt ggaccaacct acttcacaac ccaccgtcat 1860 cctggagcta acgtgtaatg ggagaacttt ccgggctttg gcggatgcag ggtctgagac 1920 aaatctcatt tcagaaaaga tggttgatga attaaatcta aagcgacgca agctattcaa 1980 gcctacgatt gtgggtctag cactggactc aaatgatcaa ccaacgaccc ttacagagtt 2040 cacaacgaca tcattaacac atttattgtc tgactctact tttgaccgcg tgtacatgaa 2100 gattggtaat cttaacggtt catttgatat gattctgggt gctccattct tgtgtcgtca 2160 tcaattagct gttttgtgtg caaagaaagc tgtagtgcgt gaagagtcag gtgtcttact 2220 gttggactat cgtcatgaaa tgaatttgaa gaaaaagttt gaagaggagt gcaatgagaa 2280 gatggcgtca atagctgaga atcatgagag tgataagtgg agagattggg agaaagatct 2340 gagacagaag aatgagccat gtaaaacact gagcctgtga cgtggaaggt ctttgaagaa 2400 tcactgatga tgcagtttga ggacatcttc ccagcagata taccggctat ttcggatgaa 2460 gctgaggcga agggcgagtt tgtggatggt tcattccctg aaaaattaca acatccgtca 2520 tcacaagtcc gtcataagat agtactgaca aatcctaatg ctgttatcaa tgagcgtcaa 2580 tatccatatc caccgaaaca tcatggtgct tggcaaaagt tattggatca acatctagac 2640 gcgggacggc ttcgactatc aacgagtcaa tatgcgtcac caagcctcat catcccaaag 2700 aaggaccctt cggaacttcc aaggtgggta tgcgattacc gcaccttgaa cagtatgacc 2760 gtcatggacc gttcaccgct cccaaaggtg gacaagttgg tcagactggt tgccacggga 2820 aaagtttttt ccatcttgga ccaaactaat gcttttttcc agactcaaat gagagaggcg 2880 gatattccac tgactgcggt gaagacccct tggggacttt acgaatggtg cgtgatgccc 2940 atgggattga caaacgcccc agccactcac caagctctac tggaagaggc gcttggagat 3000 ctcatcaaca ccgtttgtgt ggtttatctc gacgatatcg tggtattttc ttccaacgtt 3060 gaagaacaca agcgtcatgt tcaactcgta ttggagcgac taagggctgc taacttgtac 3120 tgcagcagaa agaaaacgaa actttttcgc cgggaggtca agttccttgg acatgtcatc 3180 tcagaagaag gggtgaggcc agacgaggag aaagttgaac aaatcaagtc atggacgtcg 3240 ccgaagtcat caaaagtggt gagaaagttt ttggggacag tacaatggat gaagaagttt 3300 gtctcagggt tggagaagta tgtaggtaaa ttgactctgt tgacgagcag caaactagat 3360 ccaaaacaat ttaaatgggg agaagctgaa gaagaagcgt ttaagaacat caagaatatc 3420 atgacttcgt tgccttgtct caagaacatc gactatgaat cccttgaccc gctgtggttg 3480 ttcacagacg ctagtgggac gggattaggc gccgcgttgc ttcagggtag agagtggagg 3540 acggcacacc caattgccta cgagtctcgc caaatgttgg tggcagaacg aaactaccca 3600 gtacatgagc aggagctact agcggtaatt cacgcacttc aaaaatggtg aatgatcctg 3660 ttgggtatga aggtcaacgt gatgagtgac caccactcac ttacgtattt actgaaacaa 3720 tgctccctta gcaggcgaca agccaggtgg ttggaacatt tagctgattt taacttagat 3780 tttcaatacg tgaaaggtga ggaaaactcg gtcgccgatg ccttatccag gaaagacgta 3840 gatgatcttc aggtgggtgt tgaacccgta gcggcgttgg cactatcgga gacaacaata 3900 tcagatgaat ttcgacagtc aatcctcgag ggctatgaag ttgacaagtt ctgccaaact 3960 gttaaatcca gcacaccatt aagagaagct gactgctacg ttgatgacgg actagtattc 4020 atagacggtc gacttctgat ccctaacaca acaggcattc gattacgatt gattgatgag 4080 gctcattgct gagtcggaca cctgggttat ctcaagactt tggcgaccct tcgcaccgac 4140 ttcttctggc cacggatgtc taaggacatg gaagcacgac tgaaatcttg cgaggtctgc 4200 cagaaaacca aagctcgcac ctcccttccg aacggccgga tgaaaacgcc ggtcatgccg 4260 agtgcaccgt tgtcagatgt tgcggtcgat ttcattggac ctttgccaaa aatcaattca 4320 tacgatatga ttttgacgtg tacatgccgc cttttgggat tcaccaaact gattccggtg 4380 aaccaaacca atactgccaa aaagacggca tctagattct tcggagcctg gatgggcact 4440 ttgggagcgc ctagatcagt cattggtgat cgtgataaag cctgggcatc caagttctgg 4500 caatcgctca tgagtaagct gaatgtttcc tttcaccgct ctccggctta ccacccccaa 4560 gcagatgggc ggagcgagcg tacgaccaag accatcggac aggtcctgag gaccttcaca 4620 gacaaacgtc agacacggtg gttggatgcc ttaccttcag tcgaattggc gattaatagc 4680 gcggttaatg tcgccacggg aatgtcgcca tttgaactca tttttggacg accagcgcgt 4740 ctgttcccca caacaccagg aaacgatgat tccccaccca cactcgacgc gtggcttcga 4800 caacgcgaag gaagctgggc tcaagctcgg gacaacttgt ggtcgagccg tgttattcaa 4860 gcattacatc acaataaacg ccacctggat gtcctcatca ataccgacga ttgggtactc 4920 ttagactcag gtgattggcg aggcaggcac tcgggaggcg tggaaacgct caaggaacgg 4980 tacaaagggc catacaaagt cctcgacact ttcaatgagg ggcaaagctt acggattgac 5040 ttacctccga gtgacaagcg acacaacgtc tttaatgtgt caaaactgaa actgtttgtg 5100 gaacaagagg agttgctgtt aggttcaatg gaccaaaagt aagttctctc cttagtatgc 5160 accgccgggc tctactacga gttttagtca aagaaataca ttaccttggc cacactgtga 5220 gcgtcaaatt tggtccaatt ccctttgcag accttggaca cctcaacgtc aagcgacacc 5280 ggacgatgat ctcaccggct tacacttagt cttcaagcta ctctgttttc ttttggttat 5340 attgtgtttc atttcttttc tctgtttcct tttttttctt ttgattattt caggttcctt 5400 ttcttttcaa ctactcaata gttctttttt tcctttaaga ccttcttttt ctctgattct 5460 caattttcaa ttttttttag tttcctttaa ttactcctca ttccactccc ttgctcaagt 5520 ctcctttata gggcgggaga a 5541 // ID Copia-25_MLP-I repbase; DNA; FNG; 4598 BP. XX AC AECX01001060; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-25_MLP_; KW Copia-25_MLP-LTR; Copia-25_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4598 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001060; Positions 57427 62024. XX CC Positions [1865-2380] - Integrase core CC 'GATCA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(53..1552,1556..4564) FT /product="Copia-25_MLP-I_1p" FT /translation="MSDHGLVSTSAGNPPSSTHSQSEEEGNNTLLQITDSL FT LQSIRNPSNISPSSNLNNPPPSDSKMSAPLSQPVMQGTNKQLQAVYVNQAL FT SKICPEKPLSDENYFIWSSEVRNGLDSLYYIEYIETDDIKDEPGSIIHEVA FT RKFLTSWLLKQMDATNRSRFEPKITTYTTNGAKTESLPARLWKSVEDHHGS FT RTNETRLLLERSLNLIVQSPSTPLSKHLEFFQLAATKFKNAGGKITNDDLG FT QKLLISLNNAHFDDAKAIADQDITDYDKVVNALKKRMATVAMLRGNRPDRL FT IQHPPVQIMPEASAVSRNRFANKCTKKKCLGINHSPDLCFKKPGNEHMQRE FT WIAERIKLGQWKGEVPSNLPDASASTAVLDEPSIEDLEASFARMSPSASHV FT STVHLRDEVAKNAAENSIIALIDTAASHHMFNDRHSFSLYEDISNRHELLN FT LAGGTVTLPIVGKGSVSFKGPNGDIFKLHDCLHVPELKHSLICGTILIKED FT FITVEGDSFKISQAGKVAFVGHFMKNINLLQVKVSILTRSQPPQASFTDAI FT SLHRRLGHPNVRYMKMMLSEESVIGIGDLKSTVVPDSIDCNPCNLAKGHRI FT SHSYSRERSKSLLENIHIDLSGIIRTPALCGSIYFILFTDDYSSFRDVSGL FT SNKSSEAVFEKIRQYVALVERQCEAKVKNFTIDNGSEFINDLLVPWCEERG FT IYMRTTAAHTPEENGVAERGNRSVKEKSRALMIQANMPIRFWLYAVKASVY FT LLNRTVSSSLPKGKTPYQLKYGSKPDMSHVKTFGCLAYVLIRKALCTGGFG FT HVSRQAILLGHTNHNHNYEVFIIESNTIVLSHDVTFKEHIFPFQKLKPFNV FT SHLSVDDEDELLLTDPSTLAPDNHREQSDNELLPGGEGMHQETEPLVEQQI FT EGNTEDHLNNQLEDLVIDQDSGPRRSERLRQPVTRYTPSANFAFWNDSSSF FT TDEFDLFHYAFSVGPIVRLIHEPATYKLAMKAPDKAEWLTACKKEMSNMKR FT RGVWRLVPRPKDHPVVGSKWHLKIKMNTDNTINKRKARIVAKGYTQTYGVD FT YLDTFAPTGKPVSYRVIIVYATIFGMEVHSMDAVAAFLNGRLKELIYMEQP FT EGFEEGEDGEDLVCLLYQAIYGLKQSAREWNDEFKTKCRAVGFVQSVADEC FT VYIRVRGDNICIFYLHVDDMAITGNDIKAFKEEIKSLWEMEDLGLSTCVVG FT IQTVRKGPHHYAIGQEAYTLSILERFGMMDCKPALTPFPGGTKLRRSTPEE FT SAMFKRLNLPYRSGVGSIMYLAQCTRPDISYAVGCLSQHLENPNKHHWEAF FT QHVLRYLKGTASFCIHYKYDLTLPKELSSNNSWTLPTYFSDADWAGDKSTR FT RSTTGYVFMMAGGAISWRSRLQQTVAKSSTEAEFRAANEAGDEMIWLNVLL FT ESIGLKQQYPQFLNCDSMSAIDVSENVVLHGRTKAIEIHFYWLREKVKDGT FT FQLRHVESEDMLADVLTKPLHPGPFRSFRERIGLRLIE" XX SQ Sequence 4598 BP; 1429 A; 952 C; 983 G; 1234 T; 0 other; ttgtagcgag agatacatat ctcaaactca aagcgatcca tgaaatcatc tcatgagcga 60 tcacggacta gtatcaacat cagctggtaa cccaccatcg tcaactcact ctcaatccga 120 ggaagaaggc aacaatacat tactccagat aactgattct ctactccaat ctattagaaa 180 tccatccaac atttccccgt caagtaattt aaacaatcca cctccgtctg attcaaagat 240 gtctgctcca ttgagtcaac cagttatgca aggaacaaat aaacagcttc aagccgtata 300 cgtgaatcaa gctctaagta aaatttgtcc cgaaaaaccc ctatcggacg aaaattattt 360 catctggtca tccgaagtca gaaacggact ggactcattg tactacattg agtacataga 420 aactgatgat attaaagatg aacctggatc tataatccac gaagtcgcta ggaaatttct 480 tacatcctgg ttgcttaaac agatggatgc gacaaacaga tctagatttg aaccgaagat 540 aaccacctac acgactaacg gtgctaaaac tgaaagctta cctgctagac tttggaaatc 600 ggtcgaagac catcacggaa gccgtacgaa tgaaactcga ctactattgg aaagatcact 660 taacttaatt gtacaatctc catccacccc tttatctaaa catctagaat tttttcaatt 720 ggctgccaca aagtttaaga atgctggcgg gaagataacg aatgacgatt taggtcaaaa 780 gcttttgatt tccttgaata atgctcactt tgatgacgct aaggccatag ccgatcaaga 840 catcacggac tatgacaaag tggtgaatgc cttgaagaaa cgaatggcaa ctgttgcgat 900 gctgcggggt aataggccag atcgacttat acaacatcca cccgttcaga tcatgccgga 960 ggctagtgcg gtgtctagaa atcgttttgc aaacaaatgc actaagaaga aatgtttagg 1020 cattaatcac tcaccggatc tatgtttcaa aaagccaggt aatgaacata tgcaacgtga 1080 atggatagct gaaaggatca aactcggtca atggaaagga gaggttcctt ctaaccttcc 1140 cgatgcatct gcctcaaccg ctgttctgga cgaaccatcc attgaagatc ttgaagcttc 1200 gttcgctaga atgtcacctt cagccagcca tgtatcaacg gtgcatcttc gagacgaggt 1260 tgcgaaaaat gcggccgaga actcgattat tgctctgatc gacactgcag catctcatca 1320 catgtttaac gatagacact cattctcatt atacgaagat atatccaatc gtcacgaatt 1380 gctgaacttg gctggtggca cagtaacttt gccaattgta ggaaaaggat ctgtttcatt 1440 caaaggtcct aacggtgata tcttcaaatt gcatgactgt ttgcacgtgc cggaattaaa 1500 gcacagtttg atctgcggaa cgattttgat taaagaggac tttatcacgg tttgagaagg 1560 cgattctttc aagatctctc aagctgggaa agttgcattc gtcggacact tcatgaagaa 1620 catcaattta ttacaagtca aagttagcat attgactcga tctcaaccac ctcaagcctc 1680 tttcactgat gctatttcac tccaccgtcg tttaggccac cctaatgtta gatatatgaa 1740 aatgatgcta tcagaggaga gtgtgattgg tatcggtgac ttgaaatcaa cagttgttcc 1800 tgattctatt gactgcaatc catgtaacct tgcgaaaggt caccgaattt ctcattctta 1860 ctcacgagag agaagtaaat ctttacttga aaatattcat atcgatctga gcggtatcat 1920 caggacaccg gccttatgtg gaagtatata tttcatactt ttcacggatg attactcaag 1980 ttttagggat gtgtcaggcc tttcaaataa atcatctgag gctgtgtttg agaagatccg 2040 tcaatacgtg gcactagttg aacgccaatg tgaagcaaag gtcaagaatt tcaccatcga 2100 caatggtagt gaatttatca atgatttact ggtgccatgg tgcgaagaac gaggcatcta 2160 tatgcggacc actgcagctc atacccctga ggagaacggg gtagctgagc gtggtaatcg 2220 ttctgtgaag gagaaatcac gcgcattgat gattcaggct aatatgccca ttagattctg 2280 gctttatgca gtcaaggctt cagtctacct tctcaaccgt actgtctctt catccctacc 2340 aaaagggaaa accccttatc aattgaagta tgggtcaaaa ccggatatgt ctcatgttaa 2400 gacatttgga tgtttagctt atgttctcat acgcaaagcc ctttgtactg ggggttttgg 2460 tcacgtttca cgacaagcta tactccttgg ccatactaat cataatcata actatgaagt 2520 tttcatcata gaatcaaaca ccatcgttct ctctcatgat gttaccttca aagaacatat 2580 ttttcctttt cagaaactca agcctttcaa tgtcagccat ctatcagttg acgatgagga 2640 cgaactttta ttaacggatc cttccacttt ggctccagac aatcaccgcg agcaaagtga 2700 caacgaatta ctccctggtg gtgaaggaat gcaccaggaa acagaacctt tggtagaaca 2760 gcaaatcgaa ggcaatactg aagaccatct taataatcaa cttgaggacc ttgtaattga 2820 tcaggactca ggaccacgaa gatcggaacg tcttaggcaa cctgtgactc gatatactcc 2880 atcggcaaac tttgcatttt ggaacgactc atcctctttc accgatgaat tcgatctatt 2940 ccactatgct ttctcagttg gtcctattgt tcgcctaatt cacgaaccag caacatacaa 3000 gttagccatg aaggcacccg ataaagctga atggctgaca gcctgtaaga aagaaatgag 3060 caacatgaag agacggggtg tatggcgtct agttcctcga cctaaagatc acccagtcgt 3120 cggcagcaag tggcacctta aaattaagat gaacacagac aacaccatca acaagagaaa 3180 agctcgaata gtggctaaag gttacacaca aacttacggt gtagattatc tagacacatt 3240 tgctcccact ggcaaaccgg tatcatatcg cgtaatcatt gtttatgcaa cgatctttgg 3300 tatggaagta cactcaatgg acgcggtggc cgctttcttg aacggcaggc ttaaggagct 3360 gatttacatg gaacaaccag aaggattcga ggaaggtgaa gacggagaag atctagtatg 3420 tttattgtat caggctattt atggattgaa gcaatctgca agagagtgga atgacgagtt 3480 taagaccaag tgcagagcag tcggatttgt tcaatcggta gctgatgagt gtgtatacat 3540 tagagtaaga ggagataaca tctgcatttt ctacttacat gttgatgata tggcaattac 3600 cggtaatgac atcaaagcat ttaaagaaga gatcaaatca ttatgggaga tggaggattt 3660 aggcttatct acatgcgtag tgggaataca aactgttcga aaaggtcctc atcactatgc 3720 catcggtcag gaagcgtaca ctctatcaat tctggaacga tttggtatga tggactgtaa 3780 accggccttg acaccattcc ctggaggcac caaactacga cgatcaacgc ctgaagaatc 3840 agctatgttt aaaagattga acttgcctta tagaagcggc gttggtagca tcatgtatct 3900 agcgcaatgc acaagaccgg acatttcata cgcggtaggg tgtttatctc aacatctgga 3960 gaaccccaat aaacaccatt gggaggcatt tcaacacgta ctccgttatc ttaaaggcac 4020 agcttctttc tgtattcatt acaaatatga cttaacacta ccaaaagaat tatccagcaa 4080 caatagctgg acattaccaa catatttctc tgacgctgat tgggccgggg ataagagtac 4140 tagaagatcg accactggct atgtgtttat gatggcagga ggagcaataa gttggaggag 4200 tcggttacag cagacggtag caaaatcctc aactgaagcg gaatttagag cggcaaacga 4260 agcaggtgat gaaatgatat ggctaaatgt gttattagaa tctattggat taaaacaaca 4320 atatcctcaa tttttaaact gtgatagcat gagtgctata gatgtgtctg agaatgtggt 4380 gctacatgga aggacaaagg ctatagaaat tcatttttat tggttgaggg agaaggttaa 4440 ggatggtact tttcaattga gacatgtgga gtcggaggat atgctggcgg atgttttaac 4500 aaaaccatta catccaggtc catttagaag ctttagagaa aggataggtc ttaggttaat 4560 tgaatgatga tatttcaaag ctgtcgattg aggggggg 4598 // ID Gypsy-1_AM-I repbase; DNA; FNG; 5547 BP. XX AC ACDU01000137; XX DT 07-FEB-2011 (Rel. 16.02, Created) DT 07-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Allomyces macrogynus genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_AM_; KW Gypsy-1_AM-LTR; Gypsy-1_AM-I. XX OS Allomyces macrogynus OC Eukaryota; Fungi; Blastocladiomycota; Blastocladiomycetes; OC Blastocladiales; Blastocladiaceae; Allomyces. XX RN [1] RP 1-5547 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Allomyces macrogynus genome."; RL Direct Submission to RU (07-FEB-2011). XX DR Genome; ACDU01000137; Positions 20113 14567. XX CC Positions [4251-4733] - Integrase core CC 'ATCCA' target site duplication CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 1152..5447 FT /product="Gypsy-1_AM-I_2p" FT /translation="MQQLGLSFGGGIGADGFFQDQDKMQIDAMHGRHPRSG FT NGGGGGSNKQVCKCCSGTGNYTHGSNTSRDDRHCFKCGEVGHLRRNCPQKG FT TAKGDAKVAAVTHDQCKSGSACSTDLSQAANSVVVAPILAEEIEPEQAEKK FT ATVAVENEVQQDDPTADPETDKLLVGEEEKGQGSAEDVEVLAFGADQPYAY FT EAALAPGEGVRKGRTWSTKRKGERQRSKVQQAHDTIVMVDSGASHSVIRRT FT VVERVQARVEGDRWLNVSGFDGATQRVRREWAYLRITVAGVDTVVKCLVLD FT DVAWEVILGRPWLREHNPQLDWTMGALSLDGSTTCVQPVAGGPSHAKVDLI FT SAVDCAAAIRSDPSSVFGVLLSNVEVTNGTYTSDAIVQKLVEEYPTVFAKP FT TGMPPERDIDHRIELKEGARPVAQRAYQASPKELKELRMQLDELLRRGLIR FT PSKGSPFVSPVLFVPKKDGGVRVCVDYRALNKITVRDTYPLPRAETLFRML FT RGARYFSKLDLVSGYHQVRMHEDDVHKTAFVTRYGTYEWRVLSFGLCNAPS FT TFMRVMNQVFEPYLDQFVVVFLDDILVYSKTREDHERHVRQVLELLKKNEF FT HVKPSKCAFFQEKVEYLGHFVDQHGVHVDPAKVAAVRDWPVPKDQKDLHGF FT IGLANYYSRFVNGFAEIACPLTDLLADDVPYVWSAKTDTAFRALKDALSSA FT PVLVLPDEDGEWEITTDASGFAVGAILQQNGRPVEYFSRKLKPAERMYPVH FT DRELLAIVTSLGRWRSYVHGKPVRCNSDHKTLEYFATQRELNAQQVRWLAS FT LAKHDVEIKYQQGERMPHADALSRWPDLRPEGLLVADAPYMPLERTADGTL FT RVAAVPMYMDENGKFSVLRARHVRAKIGDAWIAAAQSAVEGEGATAAQHEI FT RPEVEPDLMRLVAAATAADRRTRAMLEGPPEGYEADSGILYKLVEVHGEVR FT RVPYIPKDNALRALWLTEAHDTPLAGHMGARRTLDLLRRWCYWRGQAVDVE FT YYVRTCVSCQANKPKVGKADGLLRPLTVPDRPWGRINLDLIGKLPDGEGGK FT DAIVTFVCALTKRLRVISATMTTDAEGLARLYVQHVLPHLGLPDCIVSDRD FT PRLASAFWEALWRRLGTRLAMSTAYHPQTDGQAEVANRTVQSMLRHFVNAQ FT RTNWPTYLPFVEFAYNNSRAEPTGKSPFELWGVVPRGPAEVMLRDNIPVPQ FT LERVAEQLATATRDARVSLDRSRRAMEVQANRHRKDIEYLVGDRVWLSTKN FT IAARTASGTELTSRKLLPKFIGPFNVEAVVGRGRACRLGLPARYKIHPTFH FT VDLLKPYHESDQERFPGRPGTENDDTDDEHGWAPAVADEPECVLREGTTGR FT RGDGETLLLIKWRQRSMAESTWERAADVCEDSDAVTKWRKYRAETSNVQVL FT AGVKVGKPSARVLRSLLQ" XX SQ Sequence 5547 BP; 1069 A; 1631 C; 1903 G; 944 T; 0 other; actggtagcg tcacaagttc gactcccttc aaggacgtac gggctggtca aggtcgggga 60 tggcgttcac acgccagcaa gttcccgcgg tctccctcaa gctgggtcct caagcttgag 120 cgtgggcgtt tggtggtttt tttttttttc cgaacgtgga cggtgttctc ggtctggtgg 180 cgctcgcggt tcttgttgcg ttgcgcctgg gtgtctgcag acgccggtct tgttccgact 240 ttttctgacc ttcctggcgg gagaaggggg gtgggcacgc ttgtccatcc gttttccgtt 300 tttggcgcgg tacctgtctc tcgtcacttt aacatggcga cagctgagga ccgtctcgcg 360 gccgtcgagg caatgatgcc ggcgattgcc cgcgacggga cgcaacttgt cgagcgcaca 420 acggccctcg agacgaacgt ggcggccgtc cagactggtc tggcgacgac caacgcaatg 480 ctcgagaaga tgatggggat gttacaggcg atgcaagctg ccacggctgg gcaggccacg 540 gtgatgccgg cggctgcaac tgcagcggca acagcatcac tgactgtgcc ggcgccgaaa 600 ccagtgcctg agacctcgca agcagcacca gtgttccagc gtgctgaaaa ggtcctggca 660 cccgagtttg acggcaagcg ctcggcgaca gaggttgcgg cctggtcggc tgatgctgaa 720 aagtggattc tcttcatgcg ggaccacaag cacagcgacc ccatgattgt gctgctgctc 780 aagcgtgcct tcacggccgc agcagagacg taccgtgtgc agcaggtcgt gggtgggttg 840 tggcccacca cgccggaggc gattgtgctc aacgtcaagg agcactttgc gccacgaggg 900 gctgacttca ccaaccgcaa gaagctgcgc gacctgcagt tccgggccag gtcgggcatg 960 acggcccacg tcattgagtt ccagggcctg gcacaggcgg tccactttgc atctgcgtcc 1020 gagctgctgt ttgacttcct caagtcggtc ccaaacagca tcgccatggc gctggtccag 1080 gtcgagggcc tgacgaagat ggccccggcc gactggccgc agtctacgaa aaggccattg 1140 aaatttggga aatgcagcag ttgggcctga gctttggcgg tggcatcggc gccgacgggt 1200 tcttccagga ccaggacaag atgcagattg acgccatgca tgggcgtcac ccacgcagtg 1260 gcaatggcgg tggtggtggc agcaacaagc aggtttgcaa gtgctgctcg ggcacgggca 1320 actacaccca tggcagcaac acgtcgcgcg acgaccgcca ttgcttcaag tgcggcgagg 1380 tggggcactt gcgccgcaac tgcccgcaga agggcacggc caagggagat gccaaggtgg 1440 ccgctgtcac gcacgaccaa tgtaaatctg gttctgcgtg ctcaactgat ctttcccagg 1500 cagccaactc ggtggtcgtt gctcccattc ttgctgagga aattgaacct gagcaagctg 1560 aaaagaaggc aacggttgca gtggaaaatg aagtccaaca agacgacccc actgccgacc 1620 ctgagacgga caaactgctg gtgggggaag aggaaaaggg gcaagggagc gccgaagacg 1680 tggaagtcct ggcgtttggc gctgaccaac catacgcgta cgaagcggct cttgcgcccg 1740 gggaaggggt gcgcaagggc cgcacctggt cgaccaaacg gaagggcgag cgccagcgca 1800 gcaaagtgca gcaggcgcac gacaccatcg tcatggtcga ctcaggcgcg tcacacagcg 1860 taatacgacg tactgtggtg gagcgcgtgc aagcgcgagt cgagggcgac aggtggctga 1920 acgtgtcggg gtttgatggg gccacgcagc gagtgcgccg cgagtgggca tacctgcgga 1980 tcacagtcgc aggcgtcgac acggtcgtga aatgcttggt gctcgacgat gtcgcttggg 2040 aggtgatctt gggtcgccca tggctgcgcg agcacaaccc acagctggac tggacgatgg 2100 gcgccttgtc gcttgacggc agcacgacct gcgtgcagcc ggtggcgggc gggccgtcgc 2160 acgccaaggt ggacttgatc tcggcggtgg actgcgccgc ggccatacgc agcgacccgt 2220 cgagcgtgtt cggggtcctg ctgagcaatg tcgaggtcac caacgggacc tacacgtcgg 2280 acgcgattgt gcagaagtta gtagaggagt atccaaccgt gttcgccaag ccgacgggca 2340 tgccgcccga gcgcgacatt gaccaccgca ttgagctcaa ggagggcgcg cggccagtcg 2400 cacagcgcgc ctaccaggcg tcgccgaaag agctcaagga actgcgcatg cagctggacg 2460 agctactgcg ccgcggcctg atccggccga gcaagggctc gccatttgtg tcgccggtgc 2520 tgttcgtgcc gaaaaaggac ggcggcgtcc gcgtgtgcgt cgattaccgc gcactcaaca 2580 agatcacggt gcgcgacacg tacccgttgc cgcgggccga gacgctgttt cgcatgttgc 2640 gcggtgcacg ctacttttcg aagctcgatt tggtcagtgg gtaccaccag gtgcgcatgc 2700 acgaggacga cgtgcacaag acggcgttcg tgacgcgcta cggcacgtat gagtggcgcg 2760 tgctcagctt tggcctgtgc aacgcgccgt cgacattcat gcgcgtcatg aaccaagtgt 2820 tcgagccgta cctggaccag ttcgtggtcg tgttcttgga cgacatcctc gtgtactcga 2880 agacgagaga ggaccacgag cgccacgtgc gccaggtgct cgagctgctg aaaaagaacg 2940 agttccacgt caaaccgagc aagtgtgcat ttttccagga gaaggtcgag tacctgggcc 3000 actttgtcga ccagcacggt gtccacgtcg acccggccaa ggtggccgcg gttcgcgact 3060 ggccagtgcc aaaggaccag aaggacctac atgggttcat tgggctcgcg aattactact 3120 cgcggtttgt gaacgggttc gccgagattg cgtgcccact gactgacttg ctcgcggatg 3180 acgtaccata cgtatggtcg gccaagaccg acacggcatt ccgtgcgctc aaggatgcgc 3240 tgtcctcggc gccggtgctg gtcttgccag acgaggacgg cgagtgggag atcacaaccg 3300 acgcgagcgg gtttgcggtc ggcgcgatcc tgcagcagaa cgggcggccg gtcgagtact 3360 ttagtcgcaa gctcaaacca gctgagcgca tgtacccagt gcatgaccgc gagctgctgg 3420 ccattgtgac ctcacttggc cgctggcggt cgtatgtaca tgggaaacca gtccggtgca 3480 actcggacca caagacgctc gagtattttg cgacgcagcg cgagctaaat gcgcagcagg 3540 tacgttggtt agcgtcgctg gccaaacacg acgtggaaat caagtaccag cagggcgagc 3600 gcatgccgca cgcggacgcg ctctcgcggt ggccggatct gcggccggag ggtttgctgg 3660 tcgcggatgc accttacatg ccactcgagc gcacggcaga cgggaccttg cgcgtcgcgg 3720 ccgtgcccat gtacatggac gagaacggaa aattttccgt gctgcgcgca cggcacgtgc 3780 gggcgaaaat cggggacgcg tggatcgcag cagcgcaaag cgcagtcgag ggcgaggggg 3840 caacggcagc gcagcacgag atccgcccag aagtggagcc cgacctcatg cggttagtcg 3900 cggcggccac ggccgcggat cggcgcacgc gcgcaatgct cgagggcccg ccggaagggt 3960 acgaggccga cagcgggatc ctgtacaagc tggtcgaggt ccatggagag gtacgccggg 4020 tgccctatat tccaaaggac aatgcgctgc gcgcgctttg gctcaccgag gcccacgaca 4080 cgccgctcgc cggacacatg ggcgcacggc gcacgctcga cctcctgcgc cggtggtgct 4140 actggcgcgg gcaggccgtc gacgtcgagt actacgtgcg gacgtgcgtg agctgccagg 4200 ccaacaaacc caaggtcggc aaggctgacg gcctgctacg gccactcacc gtacctgatc 4260 gaccatgggg ccgcatcaac ctcgacttga ttggcaagct gcccgacggg gagggcggga 4320 aggacgcgat cgtcacgttc gtctgcgcat tgacgaagag gctccgcgtt atttcggcga 4380 ccatgaccac ggacgcagag gggctcgcgc gactgtacgt acagcacgtc ttgccccatc 4440 tcgggctgcc cgactgcatc gtgagcgacc gcgatccgcg gttggcatcg gcattctggg 4500 aagcgttgtg gcggcggttg ggcacgcggt tagccatgtc tacggcgtac catccgcaga 4560 cggacgggca ggcggaggtg gcgaaccgca cggtgcagtc gatgctgcgg cactttgtca 4620 acgcgcagcg caccaactgg ccgacgtacc tgccgtttgt cgagttcgcg tacaacaact 4680 cgcgcgccga gcccacgggc aagtcgccgt tcgagttgtg gggcgtggtg ccacgcggac 4740 cagccgaggt catgctgcgc gacaacattc cggtgccgca gctcgagcgc gtcgcagaac 4800 aactagcaac ggcgacgcgg gacgctcgcg ttagtctgga tcgatcgcgg cgggccatgg 4860 aagtgcaggc caaccggcac aggaaggaca tcgagtatct tgtgggcgac cgcgtctggc 4920 tatcgaccaa gaacattgca gcgcgcacgg catcaggcac cgagctcacg tcgaggaagc 4980 tgctgcccaa gttcattggc ccgttcaatg tcgaggccgt ggtcggccgc ggtcgcgcgt 5040 gccggctcgg actgccggcg cggtacaaga tccacccgac tttccacgtc gatctgttga 5100 agccgtacca cgagtcggac caggagcggt tccctgggcg gccaggcacc gagaacgacg 5160 acacggacga cgagcacggc tgggcgccgg cagtcgcgga cgagcccgag tgcgtgctgc 5220 gcgagggcac gacgggccgg cgcggcgacg gcgagacgtt gttgctgatc aaatggcggc 5280 agcggtccat ggccgagtcg acatgggagc gcgccgccga cgtgtgtgag gactcggacg 5340 cggtgaccaa gtggcgcaag tatcgcgccg agacgtcgaa cgtgcaggtg ctggcgggcg 5400 tcaaggtcgg caagccgtcg gcgagggtgc tgaggtcgct gttgcagtag acgacgctga 5460 cgtgaaggat caagggcacg acggacacca agacgcgcgc gtcgctcaag tggatgcgcg 5520 ccgcgcctcg aagaggaggg agagagt 5547 // ID TCN760 repbase; DNA; FNG; 5176 BP. XX AC AF542532; AF542532.1; GI:24754014; VERSION; XX DT 08-JUL-2003 (Rel. 8.06, Created) DT 08-JUL-2003 (Rel. 8.06, Last updated, Version 1) XX DE Cryptococcus neoformans var. neoformans strain MMRL760 transposon DE Tcn760, complete sequence. XX KW Tcn760. XX OS Cryptococcus neoformans OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Tremellomycetes; Tremellales; Tremellaceae; Filobasidiella; OC Filobasidiella/Cryptococcus neoformans species complex. XX RN [1] RP 1-5176 RA Lengeler B.K., Fox S.D., Fraser A.J., Allen A., Forrester K., RA Dietrich S.F. and Heitman J.; RT "Mating-type locus of Cryptococcus neoformans: a step in the RT evolution of sex chromosomes."; RL Eukaryotic Cell 1(5), 704-718 (2002). XX RN [2] RP 1-5176 RA Lengeler B.K., Fox S.D., Fraser A.J., Allen A., Forrester K., RA Dietrich S.F. and Heitman J.; RT "Direct Submission to GenBank."; RL Direct Submission to Genbank (04-SEP-2002)Molecular Genetics and RL Microbiology, Duke University Medical Center, 289 CARL Building, RL Durham, NC 27710, USA. XX DR Genbank; AF542532; Positions 1 5176. XX SQ Sequence 5176 BP; 1399 A; 1142 C; 1313 G; 1322 T; 0 other; gcccttatcc gcatgctcgt ctatccaaaa tctaacatga tattttgggc ataatacgtc 60 acaggcagcc aagttatggc gattgctgtt tgttgtttgg ttcgaggggc cataaaactc 120 gggggcacgt caccgccagg gagaggattt ggaggacggt gacggttgga tgagtgtgac 180 tgaggaggta cagttgaaga aggcgggtct ccagggtgtt tagggacagg ggatggaaag 240 aggagtcgtc gggataggtg cgttttagcg gaataaaggc agttaaagca gtttatagtg 300 cagtagaagg tccaggaatc cgtggcatga gaagatacgt ataacagaag aagaaacgta 360 aaggaagttc tagaaacgga aggttattcc acgaacgacg acgaataggg aagctcacgg 420 tgcgattttt tttctaatcg atttttataa aaatcggcga tttagctgtg agtggcccag 480 gtgacgttaa taatttcaaa ccgatttctg gacctcggag ttaggtaaat ctccgtctac 540 aaagagatga ccagaggttc aaatctccag aaaaagcgag aagttccatg tcatgacatc 600 aggcaaaaat cgatagaaaa tcgatgtcaa aaggtcgatt tttgttaata atcctaatta 660 caaaaaaatc gattaaaaaa aaatcgacct tttgacatcg attttctatc gatttttgcc 720 tgatgtcatg acatggaact tcttgtgtct tgcatctttt tctggagatt tgaacctctc 780 cactacgtac gcctactatg aggtcatctc tttcggtctg tgctcatatg tttcaccatc 840 tatatatccc atttgcatgc atttatattc cttaccacaa cctctggaca gcaaagtgca 900 tcatgcaggg ctcgcaagac tcacaagact cccaaactgt agtaggcaag aaagacaatc 960 ccttcaagtg tcctacactc ccgtgagtag aatggtttga aaggcgcctc gaatcaatcg 1020 agatacatat ggaaactcac ggtgggcaat ttactaggtg ggatagggaa ttagagactc 1080 gaatgctcga gatcatcaaa gacgatgatg aaatgcgcaa agtcctgttc cccaggacag 1140 gagacaagac cggtgcgact agcaaagctg ccaagtgtga accgattggt cgtgcactgt 1200 ttgccggtaa agaaggattc gaatttctcg acgatgattg tcagatgact ccctctatac 1260 tcaagggcaa ggtgaaggac ctcagacaat gtgtttggct gaagctaaag aggtgagtgg 1320 tcaagctggt ttggattatg gaaccattat tttgatatcc catgttgacc gcttcgaatt 1380 ataaattggt tttgtaatta ggatgaagga cctttttcgg gccaatgaaa agaaaatggg 1440 tgcaactggg gctggtctct ctaacgaaca cgagatcaat cctgctaatg ataagttgct 1500 cggttcttgg cgtaagtttt tgtgcatact tcttcaagtc gcttgggaag agctaataca 1560 agactcggta gaggaggtaa agcagaaatg tccttggttc tacacgatga aggaccttct 1620 tccggagaaa gataaccctt tggaaggctt tcaagcagat gggaccacta ggccccgtac 1680 agatataatc aacgattgca agtgtactcc gttatttcgt ttatatgttt gggttctaat 1740 gtcgatatgt agctatacgt cgtcatcctc gcccagagcc tatcggccag gaggccagtg 1800 acactgtcaa tataggggat gaagccagcc ctgcatcgga tggcgacgat gaagacgctt 1860 ttctttcttt gccttcgacg aatccaggtg tcccagctag ccattctcct tccttgcgcg 1920 aactccttcc ttccgctaat ccccgagatc agtcacctga ctctgagact aggccttctc 1980 cagtccccgc tgctgcaact caccctcctt caacatccgt ggtacaaaag gcgaaggccg 2040 cagaccctag acggagtaat atccgtcaaa tggcttacga acatgtacat gacatgaaaa 2100 aaagtaagga gggtgagagg aaccaagtaa gctaccaatc atgtcttttt caatcaccat 2160 gctaattctc ttcaatgctt gctcaatcag aacgtaagta aagccacgta cgacaagctc 2220 agcaaatgtc gtccaacgct tgccagagcc atcgcacttg taagtgttgt gttttccttg 2280 aaacatatgt cgactgatac tttgtcttta gaatcgggag cagcgggaag agcacttccg 2340 gcaaacgttg gctgtacaga aggagatcgc cgaaagagat gaagaatacc gggagaggcg 2400 agaccaggcg gagcggaagc ggaagcggca aaaggaagaa gtgtacgaag aaatcctcaa 2460 gaggcggctt gagttggaag agaaaaggga ggctcgtctt caagagcagt gggagtctga 2520 gaagagaagg tcgggtttta acgcccgagg tccaggaagc gaatagctgg acctgttgta 2580 catatatctc actccttttt tccttccccg aatgaaccat atcgctgtgt tcggtcttca 2640 agtaacgttc atgtgtcatt aatattttcg cgagatctag cagtccactt gcttctttac 2700 gttccttccc cgatcctaac gtctcaagcg ctcctccaag agctcatacc tgcgaaacat 2760 ccctttgcac ctctcctccg ctcggccagc atcattagag atgtaagggc ggtacaattc 2820 cccctctgct gccagacctc cgtgcgactg gagccttgag accatctcct cgggcacgag 2880 ccttcttgct tcctcctccc tctcacttct gaggcgagac cttccttgcc tatcctcctc 2940 aagcatgcgc tggccctcta ggtggaatac atcctggtca aagtcccaag ttctgtcttg 3000 agactcgtcg tcatatgcga acatatgcaa gacaagggac gactggacgt aaagctgcag 3060 aagttccagc tggcgaaagg atcggatctg gtacctcaca cccttcgcaa actggaaacg 3120 gcctttgtaa aagccgatcg catgctcaga ggagatgcgg cacctcgaca agtaggtgtt 3180 gaacctgctc aagtcgctgt cacgagaggt ttcgctctca gagaaaggaa taatgatatg 3240 ccgacacaag ggaaagcctt ggtctgccca gcaccattcg ccaggtgaaa ggacttcatt 3300 gggttcgcgg gcaagtcgag aactctatgt attaaagggt caaggttaca agaaggataa 3360 aggccataac aataatcgcc aatgactggc tacttactgc aaaacaggag gcatcattcc 3420 tactgccgac gaaaccgcta ttgaaatcga caatcctcct gtccgccgtg ttaaagatct 3480 ggaaggtgtt atcgttagct atgcataata tatcggcatg ataaagactg gctaggaggt 3540 tgatggaata gttttttttc ctatcgaaga agctctcccc ttggatagtg ggacggcatg 3600 caaggggaat gagagtgcca tccaccacca gccacccatc cctccattcc ggcccaccct 3660 tttcttccat cttcctcttt gcatcttctc tctccgcaga tcctttagcg ggtaggccga 3720 tggcattcat taaggcgggt gtatccaaaa cagccttcag tacccgcgca gtgctgcgga 3780 caacgtggcc ctgggagaca ccaaaggttt tggagatctg cgcgagcgag gatgcgttgc 3840 ccgaatggcc cattcgatag aggaatagga tgagctgctg atttggggga gcctggggaa 3900 cgttgcttcc gctctggaag acaggatgac cttcgatatg actgagaatg cgggtgaatg 3960 tccagggata tacgcgaaca tcatcgcgga agatttccgg ccttgaatgg taccagtgat 4020 gaaggagtag atcgaagcgg tccgcctgct tcgtaatggg ctctttctct gccacaaacc 4080 tcactgacgt cagatgctcg agataagaat taacagaaga atacagtcca tctacgttct 4140 ctcccatatc ttcgtcgacc agatggataa aggacaagtc ggcgggtgaa ggcgaagagt 4200 taggattggg agagacggga agcttcgcca tcaatatgtc caagtcattt gtttcctcta 4260 cgatttcgga gagaggggaa gaggaaggca agggactgcc ggaggggagt cgaattgaac 4320 ggaggtgagc aatatcttcg gctgcatcga tgtctgcgct ttcttcttgt atgaggaaat 4380 cgattgcttc gatgcatagg agttctcgat tgctagtgcg aggcatggct gatggggata 4440 atgattgctt cgttatagta ttgtgttaga atcgatggtc cggaggatgt agtgcaattt 4500 agtgataaga tcggaagcag cggaaagggt tgcatatagc ttgtagacgg agatttacct 4560 aactccgagg tccagaaatc ggtttgaaat tattaacgtc acctgggtca ctcacagcta 4620 aatcgccgat ttttataaaa atcgattaga aaaaaaatcg caccgtgagc ttccctaatg 4680 aaaccggggc ggcggcgata tccgccatgc atatgaaaaa aaaaaatcca tgccttcagg 4740 cacggtcacg attttttttg gcgacgtttt tcatgtccat ggttcgatgc atgcacgtct 4800 gaaaagggta tagacatgta gagacgattt tcgtctacgt aacggtggcg gttgaatggg 4860 aggttgtggg ggtcacggat gagaacgaag gcgaaaagtg aggaaatttt caaaggtgac 4920 acggggaata ttaatatata aagataagtt agattctgtt tgtatatttt gtctccctga 4980 aaccgaggga ttgatctgct ggatggtaga tatgatgctt ttggaccact gtttctttcg 5040 ttcttatttg tctaaatgga ctactttata tttatacatg atgtcaatac catctaaaca 5100 agtcccatac gcttcgttac catctgtccg acgatatgct tcgacctagt cttgcgatat 5160 gatgcgaata ccagct 5176 // ID MOLLY_SN repbase; DNA; FNG; 1862 BP. XX AC AJ488502; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 03-JUL-2007 (Rel. 12.08, Last updated, Version 2) XX DE Stagonospora nodorum DNA transposon MOLLY_SN. XX KW Mariner/Tc1; DNA transposon; Transposable Element; transposase; KW target site duplication; ORFm2; MOLLY_SN. XX NM MOLLY_SN. XX OS Phaeosphaeria nodorum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Dothideomycetes; Pleosporomycetidae; Pleosporales; Pleosporineae; OC Phaeosphaeriaceae; Phaeosphaeria. XX RN [1] RP 1-1862 RA Rawson M.J.; RT "Transposable elements in the phytopathogenic fungus Stagonospora RT nodorum."; RL Thesis (2000) Department of Biosciences, University of RL Birmingham, Birmingham, United Kingdom. XX DR Genbank; AJ488502; Positions 3 1864. XX CC TA target site duplication. 64bp terminal inverted repeats. XX SQ Sequence 1862 BP; 478 A; 473 C; 506 G; 405 T; 0 other; acgtacctca cgggttggcc ggacacacgg tttggccgga cacttttgcc aagcccccac 60 caaattctac ctctcaacgt gatgcctcaa caacaacacc agatagaccc ttctagcgaa 120 cgtcatatac agactgccct tcaagctctt aaacaagacg cgacactgtc cttgcgacgc 180 gctgcagcta tctataacgt ctctcgagca acactaagcg atcgacgcgc tggacggcct 240 tcacaagcag attgctggcc taaaacaaag aatctaacta agactgagga ggacgtagtt 300 gttaagcata tacttgagct ggttacgcgt ggatttcctc ctaggctcgc agctgtggct 360 gatatggcta attccctgcg cgctgagcgc ggtctgggcc aagttggctc aaactggccc 420 agtacgttcg tcaaacgccg ccctgagctc caaacgaagt ttaatcgcaa atacgactac 480 aagagagccc tctgcgagga tcctgaggtt atacgagact ggttccggct tgtagagaac 540 atgaaggcga agcacggtat ccttgatggc gacatgtaca actttgacga gtctggcttt 600 atgatgggcc agatctcaac tggagcagtc gttacagctt cagagcgacg aggacggccg 660 aagacagtgc aacagggcaa tcgagagtgg acgacggtca tccagggcgt caacgcaaca 720 gggtgggcca ttccaccctt catcatcttc aagggccgcc accacctctc agcttggtat 780 aaggaggagg atctacctca taattgggtt attgcagtct ctaagaacgg ctggacaaca 840 aatgagctcg gtctgcagtg gttaaagcac tttgatgagc atacaaagag gagggttaca 900 ggcgcttatc ggctgcttat tatcgacggc catgagagcc acgactcgct tgaattccag 960 caatactgca aggataacaa gattatcact ctctgcatgc ctcctcactc gtcgcacctc 1020 ctgcagcctc ttgatgtggg ttgttttgcc tctttaaaga aggcgtacag acgccaagcc 1080 gaagagctca tgcgcaaccg gatcacgcac atcacgaaac ttgagttcct accgtgcttt 1140 aagcgcgcct ttgacgcagc aattactcct agtaatatcc aaggagggtt tcgaggcgct 1200 ggattggtcc catttgaccc agagcgggtc atattagccc ttgacgtccg catccgtacc 1260 ccaccgttgc ccaccgtcga agactgtccc tggcagtcgc aaactccaag taataccctt 1320 gaattaggat cgcaatcgac gcttgtaaag gcaaggattc agaggcatat agatagctca 1380 ccaacgtcta tggtggaggc ctttgagaag gtctcaaaag gggcagcgat tattgcgcac 1440 aagctagtgt tggcgcagaa ggagattgct gagcttcgag cagctaataa ggccgccacg 1500 cgacgtaaat cgcacaaaag aaagcgtgta caggaagaag ggaccttgac ggtcgaggac 1560 ggtcttcgac ggacgactct aaaggagttt ggtgcgcgta gtgatgggaa gaaggcgaag 1620 aagcaggttc gcgctggtgc aggcgagccc tcccaaaggc ggtgtggacg gtgcaatgag 1680 actgggcata atgcgcgtac gtgtaagaaa acagtagaag tagactctga atgatattgc 1740 atcttgcttt gtactataca gggccaaagt gggttgtttt gcgccagaat agggtggttt 1800 tggtgggggc ttggcaaaag tgtccggcca aaccgtgtgt ccggccaacc cgtgaggtac 1860 gt 1862 // ID Copia-1_PPM-I repbase; DNA; FNG; 4361 BP. XX AC ABWF01000054; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Postia placenta genome: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_PPM_; KW Copia-1_PPM-LTR; Copia-1_PPM-I. XX OS Postia placenta OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Polyporales; Postia. XX RN [1] RP 1-4361 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Postia placenta genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABWF01000054; Positions 14524 10164. XX CC Positions [1683-2183] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 189..3863 FT /product="Copia-1_PPM-I_1p" FT /translation="MSHHSSSPSFPKLNASNYKSWAGDMAAWLRSAGLWRL FT VSGAYLKPEAKDPKAPTEAEATLLEKWEVQEAKAAGWLYLMVEADQKVHFS FT NIDSDPVAMWAALKGVHQRPHAAGRWNAWEALLSIRKSDSESLQAVINRVE FT DAMQQCKNLRPDKYTLDDLDSELTVMAMILALPREDFALLRTQLLNGDLKK FT ATVIQAFVTEDAQRRHDSTIASQSALAVTSSSSGPCDFCGASGHSQPQCYS FT FQKAQKQAKERRATRGKGKAATAQEKPAEPAQESAANASALSSHSLEPSSP FT STPLQLDADFTWNADTGATSHMTPHRHWLRNYKPLRLPVKLADNKVVYSAG FT VGSVVFRPRVKGAFVQSVEFSRVLHVPDLRSNLLSCLHLARNKGFSITISS FT DAMAFYQSGTLRFVTPIDHNNAAYLDGEVEPLVEASARIAATLPLDLNLWH FT RRLAHNNYADVQKLVRDELVTGLTLSSKQQPDPVCEPCLAGKMRANPFPPS FT SHVSTMPLELIHSDVHGPLPVCTHSGYRYWITFIDDYSRFRVVILLKAKSE FT AFAAFKRFKAYAENQLGVKIKALQDDKGGEYMFSEFIAFTDSCGIVRRHTV FT RNRPQQNGVAERANGVMSDTATAMLAESSLPPSFWGEAVAAMVHVWNRLPT FT AVRPGTTPYELWHKGKPSVAHLRVWGCTAYVHIQKDQRKGLSPHMQKCVFI FT GYPDGYKGWKFYCPDTRKVVISERAEFDERFFPGLRRTSASLSTPIVDPPT FT SEPVPLFVPALDQGGDNDSVPAAIPQALIAPIPAAAPLPVAPVPVQAPAPP FT APIVPPAPQAAHQPPPEADDLNLPIAIRRGRRHAPVPAPAPKREDSPDPLD FT LFAEEADFVEVQFAGLAAGADPHNFREAMRSPDAKLWQTACNDEILSLIAN FT HTWDAVPLPAGKKAIGCGWVFKVKHNADGSIERYKARLVAKGYSQRPGFDY FT TEVFAPTSRPAALRLVLALTAIEDLELHSVDISSAFLQGDLDEEIYMQQPE FT GFAQGPPGTVLRLGRPLYGLKQAGRMWNKKLHQVLTSMGFKRLESDRSVYI FT YLRDEVRIIIPVHVDDLTLASTSRAAIDKTVEELSQHFKLRNLGPTTFLLG FT IEITRDRKKRTMFLSQRQYILDMLEQYGLTSCKAVLTPMEPGLRLSKQQAC FT QTDEDPSCIWPSPLALTSSMLSPALHASTPVLDLLTGPQSSTSFAICRAPR FT TTSSSTGLTTWVHPL" XX SQ Sequence 4361 BP; 875 A; 1389 C; 1106 G; 991 T; 0 other; actgctccct gcgtcgcctt ttgataggtt atgagccccg cgctctagca gtcaactaca 60 cacactacta gcgcgccaca gagtcaatct cttcgatttt cggaccactc cttgtcccta 120 cactctcaca tctttccctc ttgccacaat ctgcgctttc atcctctgcc actctgcttg 180 tgccacaaat gtcgcaccac tcgtcctctc cctccttccc caagctcaat gcctcgaact 240 acaagtcctg ggctggggac atggctgcct ggctgcgctc tgcaggcctg tggcgccttg 300 tctctggtgc ctacctcaag ccagaagcca aggaccccaa agcccccact gaagcggagg 360 ccacactact ggagaagtgg gaggtgcagg aggcaaaggc tgctgggtgg ttgtacttga 420 tggttgaagc ggaccaaaag gtgcattttt ccaacattga cagtgaccct gttgcaatgt 480 gggcggccct caagggtgtc caccagcgtc cccatgctgc tggacgttgg aatgcctggg 540 aggcgctgtt gtccatcagg aagagcgatt ctgagtcgct tcaagcggtt atcaaccgcg 600 ttgaagacgc catgcagcaa tgcaagaatt tgcgccctga caagtacacc ctggacgacc 660 ttgacagtga actcactgtc atggctatga tcctggccct ccctcgcgag gattttgccc 720 ttctgcgcac ccagctgctt aatggcgacc tcaagaaggc cacagtcatc caggcctttg 780 tcactgagga tgcccagcgc cgccatgact ccaccattgc ctcccagtct gcccttgcag 840 tcacctcgtc gtcctctggc ccctgcgact tttgtggcgc atctggccac tcccagccac 900 agtgctactc attccagaag gcccagaagc aagccaagga gcgtcgcgcc accaggggca 960 aaggcaaggc tgccactgcg caggagaagc ctgcagaacc tgcccaggaa tccgctgcaa 1020 atgcaagtgc cctctcctca cactcccttg agccctcttc gccctctaca ccgcttcagc 1080 ttgatgctga cttcacctgg aatgcagaca caggcgccac ttctcatatg acgcctcatc 1140 gccactggct gcgcaattac aagccactgc ggttgcctgt gaagttggct gacaacaagg 1200 ttgtctattc tgctggtgtt gggtcggtcg tcttcaggcc tagggtgaag ggagcatttg 1260 tgcagtctgt ggagttttca agggtgctgc atgtgcctga tctgcgaagc aatctcctgt 1320 cctgtcttca tcttgcgcgc aataaagggt tttcaatcac catttcgtcc gatgccatgg 1380 ccttctatca gtctggcaca ctgcgctttg tcacaccaat tgaccacaac aatgctgcct 1440 accttgatgg cgaagttgag cctcttgttg aggcctcagc acgcattgca gccaccctgc 1500 cactcgacct caacctctgg catcgtcgtc ttgctcacaa taactatgca gacgtccaga 1560 agcttgtcag agacgaactg gtcactggcc taactctcag ctcgaagcag cagcctgatc 1620 ctgtctgtga gccatgccta gcaggcaaaa tgcgagcaaa ccccttccct ccttcatccc 1680 atgtgtccac catgcctctg gagctcatcc acagcgacgt ccatggcccc ctgcctgtct 1740 gcactcactc tggctatcgc tactggatca ccttcatcga tgactactcc agattcagag 1800 ttgtcatcct gctcaaggcc aaaagtgaag cctttgctgc gttcaagcgc ttcaaggcct 1860 atgctgagaa ccagctgggg gtgaagatca aggcgctcca ggacgacaag ggaggggagt 1920 acatgtttag tgagttcatc gcattcacag actcctgtgg cattgtgcgt cgccacactg 1980 tgcgcaacag gccacagcag aatggggttg cagagcgcgc caatggtgtc atgtctgaca 2040 ctgccactgc catgcttgca gagtcctcct tgcctccgtc cttctggggc gaggctgtgg 2100 ctgccatggt ccatgtctgg aatcgtctgc ccaccgctgt gcgccctggc accacaccct 2160 atgagctctg gcacaagggc aagcccagtg ttgcgcatct gcgtgtgtgg ggctgcactg 2220 cttatgtgca catccagaag gaccagcgca agggtctgtc gcctcacatg cagaaatgcg 2280 tgttcattgg ctacccagac ggctacaagg gctggaagtt ctactgccct gacacaagga 2340 aggtggtgat ctctgaacgc gccgagtttg atgagcggtt cttccctggt ctgaggcgca 2400 cttctgcctc gctttccact cccattgtcg accctccgac ctctgagcct gtccctctct 2460 ttgtacctgc actggatcaa gggggggaca acgactctgt gcctgctgcc attcctcagg 2520 cccttattgc gccaattcct gcagcggcac ctcttccagt tgctccagta cctgtccaag 2580 ctcctgcacc tccagcaccc attgtgcctc ctgcgccaca agctgcgcac caaccccctc 2640 ctgaagcaga cgacctcaat ctgcccattg ccatccgcag gggaagacga catgcaccag 2700 tccctgctcc tgccccaaag cgcgaagact ctcctgatcc ccttgacctg ttcgctgagg 2760 aagcagattt cgtcgaagtg cagtttgctg ggcttgcagc tggtgctgac cctcacaatt 2820 ttcgcgaggc catgcgctcc cctgatgcca aactgtggca gacagcctgc aacgatgaga 2880 tactgtctct gattgcgaac cacacctggg atgcggtccc tctgccagcc ggaaagaagg 2940 ccattggatg cggctgggtg ttcaaggtga agcacaatgc agacggctcc attgagcggt 3000 acaaggctcg acttgtggcc aagggctact cacagcggcc tggctttgac tacaccgagg 3060 tttttgcgcc cacctctcgc cctgctgccc tgcgccttgt ccttgccctc accgccattg 3120 aggatctgga gctgcactca gtcgacatct cctccgcatt tttgcagggc gacttggatg 3180 aagagatcta tatgcagcaa cctgaaggct ttgcccaagg tccccctggc acagtcctgc 3240 gtctaggccg tcctctctat ggcctcaagc aggctggccg catgtggaac aagaagctgc 3300 accaggtgct cacctccatg ggcttcaagc ggctcgagtc tgaccgcagt gtgtacatct 3360 atctgcgaga tgaagtgcgc attatcatcc ctgtgcatgt ggatgacctc actctggcct 3420 ccacctcacg cgctgccatt gacaagacag tcgaggagct ctctcagcac ttcaagctgc 3480 gcaaccttgg tcccaccaca ttcctgctgg gcattgagat caccagggac cgcaaaaagc 3540 gcaccatgtt cctgtcgcag cggcagtaca tcctggacat gctggagcaa tatggcctca 3600 ccagctgcaa agcagtgttg accccaatgg agcctggcct tcgcctttcc aagcagcaag 3660 cctgtcagac cgatgaggac ccctcatgta tctggccatc accactcgcc ctgacatcca 3720 gtatgctgtc tcctgccttg cacgcttcaa ctcctgtcct ggacctgctc actgggccac 3780 agtcaagcac ctctttcgct atctgcaggg caccaaggac cacaagctcg tctacaggcc 3840 tgacgacctg ggtgcaccct ttgtgacctt cacagactct gctcatggtg actgtccgga 3900 ctctggacgc tccaccagtg gctacctggt caaggttggc tcaggagcga tcagctggtc 3960 cagcaaactc cagtccattg tcgcactttc gactacagag gctgagtatg ttgctgcagt 4020 ggatgctggc aaggagatac tctggatgcg caatctcatg gccgagtttg gctacacagt 4080 ggacgctccc tcgcccttgc aaatggacaa ccagtcgagc atcaatgtgt ccaagaatcc 4140 tgagcatcat gggcgcatga agcacctcga cctgcgcact ttctggctca gagacactgt 4200 ggaagctggg cgcatcaagc cctgctacat ccccactgca gacatgcttg cagactgtct 4260 caccaagcca ttgtctgccc ccaaggtcgc attctgtcgc actgggatgg gtattgaggt 4320 gtcctagtct ccctgtggga cttggcagat cagggggggt a 4361 // ID Gypsy-10_LBS-I repbase; DNA; FNG; 10823 BP. XX AC ABFE01000310; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-10_LBS_; KW Gypsy-10_LBS-LTR; Gypsy-10_LBS-I. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-10823 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01000310; Positions 3491 14313. XX CC Positions [4957-5418] - Reverse transcriptase CC Positions [6961-7440] - Integrase core CC 'CCTTG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 637..8079 FT /product="Gypsy-10_LBS-I_1p" FT /translation="MSSTIESASTQTFRPGADGRYEGSPMARLMGQYRSSS FT DDDISTCQRAANFSSKNYLLKNEIVEVRGVQRSFTLLPSLMRDVTQLVNEL FT QIYLERTSALIPERTSYFKVDPRDTFLSILRESSDASQIQAAWLGLANRLT FT SAQENLLKYEIQYRRPLSGENVAMPTSPISTDVGIYEAMDDLEDVDARMRY FT LYDNVPHLQGQVKNPRMLGDGTPWSEIFSFPSVQTNSPEKGSSKLAPIDES FT TIDERESRNDPLKSKGKRRITDEFTSPPASPRLLNVGYGTPFKSSSQFFTR FT PGGIPLPPAEIQAQQNILVGLGLPRTPAFESIPVVNETRQLPRANPQPQSR FT PSNPFEGRDLPPHMSQTRRDEADYVSTPADRQNNSSNAARNRYRLPPVPEE FT TRSNGGSSQPSENNYRGNNARGRSNHPGGDPGGDDDDDSEGDNNPHRGNGP FT SSRDPRSNGPSGNHPRGGGGGGDSGGGGGGNDPNGGNHQANNPNQPQGNIP FT YGNLVATIRNELKQDQLPVWDGNKDTAIEYFWKVQQLAALEGDIPVALGYW FT LWKSLKENSRIWMWFTTLPFAEQTKMRTHYLHYLKGIKDNYLGRTWQIGMN FT RKYENQSFRQEGYERESPPAFIVRRIMFTRMLVASDDGGPTEVYLVMQKAP FT ISWGPILNLETIRSTSLLYSRATDHELALVHAAKYESSNVVTADNLLYTLR FT KLGISTDRNRPMERSARLASSKDSSSEPGEDVIHEAFLGQLSREECTQEIT FT SDPEVLREAFQVLKKRQRPPPKGGYPYAKNDHVTTKMGRLPPSPCKVCGSD FT NHWDKECPDWSFYEAKQLKSAYRIETNEIEDLEEYYSSVYSILVTERMTLE FT NKQKNVSDFHEAVLQGEETSFSRERKSVESNRGRKQTVFMEEIEDEAWLEY FT KAKDKATTCLMHEVGHEEDEPLVKEAHSVHQTKQPRQKQSYESRPAEEKAS FT HDDIPIPSARPNEVKDPPGEPKDSPNATSSLPPPSKEKLFKIPKARSRPEG FT MSAIGVSVLSTRGFVGGLNNVETDLRLDSCADITLISHEFYEQLTSRPAIK FT QGMRMRLWQLTDKDSQLKGFVRIPIYMTTVEGEILETEAEAYVVPNMTVPI FT LLGEDYQQSYELSVTRNVENGTHISFGRHDHRIRATPVERTKDFGRLRQSA FT YMVGQYVRRQFHRRNKGKRHRRKVKFGIDEKTVRAAADYRLKPHESKPIQV FT EGQLGEDREWLVQKNLLANANDTFFAVPNVLISAAHPWVPIANPTDHPRFI FT RKGEIIGTLVDPATFFDTPKSREELKKFEDSAEIIRTVISLQMDQDSPASN FT PTEEGEEQEEYGPKTAAMPDPTIYSSSQLEELIDVGNLPDHLKEAAWEMLR FT KRIKAFGFDGRLGNLPAKVHIRTVDGQVPIAVPMYHASPQKREIIDEQLNT FT WFEQGVIEPSRSPWSAPVVIAYRNGKPRFFVDYRKLNAATIPDEFPIPRQS FT DILASLSGAQVLSSLDALSGFTQLELAEEDIEKTAFRTHRGLFQFKRLPFG FT LRNGPSIFQRVMQGILSPYLWIFCLVYIDDIVIYSKSYEEHLVHLDKVLEA FT IEKAGITLSPKKCHLFYGSILLLGHKVSRLGLSTHTEKVEAIMKLERPKKI FT SQLQAFLGMIVYFSAFIPYYASICAPLFQLLRKGCKWSWGAEQEHAFQSAK FT DALQSSPVLGHPMEGLPYRLYTDASDEALGCSLQQVQPIKVGDLRGTRTYD FT RLKKAYEGGLSPPQLVTALSNKCMDHEFSSEWAPVFDETIVRVERVVAYWS FT RTFKNAETRYSTTEREALAAKEGLVKFQPFIEGEDVLLITDHSALQWARTY FT ENANRRLAAWGAIFSAYAPKLEIIHRAGRVHSNVDPLSRLPRAPPSHTSPL FT ESTEPSIRAKESLDERQDVTSPAVKMAAFSFAAWSIEDCLEEPREVLINVR FT SRNKRFVETQEDHSTTNPIEETTAEDSGALDTLATTAEYWGAVNPPPTIHL FT SMSEDTKEQWRRAYKDDPMFKSIVKDDKFRYDKVQPGMRFLVDHDGMIFFS FT NENYQPRLCVPSTLRNFILREAHENPLESAHAGPERLWQSLSSRFYWKRMK FT VDIVRYCRSCDVCQKTKTPNFSRFGMLIPNPIPSRPYQSISMDFIVNLPWS FT EGFNAIYVVVDRLSKHASFIPTTTGLDSEGFASLFFKHIVCRFGLPESIVT FT DRDPRWTSDFWLSVAKALQTKMSLSSSHHPQHDGQTEVVNKLLTTMMRAFI FT AGKKDQWALWLHLLEFAYNSAIHSSIGTTPFHLLLGFHPRTPLDFLGTKRS FT DDIDSRALGPEAITFLENFAMHRESARSAIAKAQDQQVRSFNKGRKPIPVL FT KKGDKVLVNPHALEWIESKGEGKKLTQRWIGPFEIMQRINPNVYRLRMSNL FT YPGLPVFNYQHLRKYEPSPTEFGDRTSLPETRTKKPEQEEYEVEKIIAERR FT TKKGLEYLVRWEGYSPLYDTWEPKRALTNAPEVVAKWLKSHDVKTLL" FT CDS 8252..10822 FT /product="Gypsy-10_LBS-I_2p" FT /translation="MECLYTDGDGKVPISGRAEMVAGQWIALNASAGRIPS FT TTVFEEALTDRAFLPPDLAARPDAWTWTKRPTWYADRYHWDAWNVIDYFAE FT ENEAVPWYFGEGSTPWTGDDGKFRFDPILRAKAEDSLTKLWLCIESITSQP FT PFVSGTPHPTKFNYLLLSAAWDSAQGARSLAEDARGRVLEYLGFINWWSSS FT VSGWDDVLQRWMVDYIGGFKLRDLKKRGVFVDLPRRWHVLNIGHLLAENVP FT VYYFWQEDNDDYPCFARLSPTILQAYHDACDTLDRSEVSEGDMMGFQDDID FT AIKQYDEFFQLRRTPDHTSSPSFSDIPPNATVYICDFEGWSARLVIDDSLI FT EDYAGRYHFCIETDESDSYVTIWRWKPRRLDMGGSQRAGCWGPGSTTEACR FT GDREIRELFKAVHAPIGKARFDECGRITLVSRLGDVYEDNDISSVEADALD FT GTRLLPRPHWVPASHPPLLVPTPASPLNVASKWIQSMVAPSSHSGSRASSA FT ARLSHSSGERDLRTRSASPDRGRTASRDQLPEGRAAYIDDLRSLGSQYATP FT VSAWVSKKPLEWNVDYLDIRFLLVPDKKAQACLRYWAACSRDVSTMAEVLL FT KAITHGIPFSIGVKVEDFGRFKPEEVSDTDRVVGKPTCVMEPPFAYTAPGA FT LKAYYMSRVNDIIRRPHARILVGLGGPEAWLGRKWGGPELVAQFMDGPSPD FT VYLHRRGYIDSDDENPMFLYTDEMSPQEIDVIFGCIRSDIDRDRSLYPSRD FT ILDDGCFFWTGEWDDRMENMFADLTKEILQGSAKFRTPGMWNDYFRRLNRS FT HRGSKDRLNQLVPATLNKLYAKLLDGFSLDWHKRRLIDIELPEEYRPRRIG FT NRGA" XX SQ Sequence 10823 BP; 2998 A; 2718 C; 2626 G; 2481 T; 0 other; aaggtggaca ctgtgggaat caacgacagc ctggccggcg tgaacactac aacggaagag 60 gaactcagag cggaaataaa agagaaccga tcagaaggat ccagacgtag gaaagctatc 120 aggggtgctc ccccttcttc gctaagtcaa cccattcttc ctgatcgatc tcgatcttcc 180 tctccgtcca aatctgctag gactccctcc aacgtaccaa ctagaagttc gtcagcatca 240 agatctagaa tttcgcccgc tacaagctcc tcacgacctt tcttcaactc ggtattctcg 300 ttcgacgaat cccgagacta cgcatcttcg tccagaggaa aaccctcgtt aataggtcga 360 ggacgagcca cgagttatag acacggactt ctgcctagat catcacctac cactaccact 420 cctagttccc gaagctttca gcctcaactc ccctcgcagg aatctcaata tacttcgaat 480 tcctaccgag cctttgaatt caaaacaccg gctcctattt ctccaattac ggagatcagc 540 gcgcaaggag aaatttcctc ctttatttct caacagtccg accaacccct ttcctcgaac 600 tcggaatccc tcaccgctga ggacgatcca accaacatga gttctactat agagtctgct 660 tcaacccaaa cgttccgtcc gggagcggat ggaagatacg agggatctcc catggcgagg 720 ctcatgggtc aatatcggtc ctctagcgac gacgatattt ctacttgtca gcgagcggct 780 aacttctcct caaagaacta cctgctaaaa aacgaaatcg tggaagtgag gggcgtgcaa 840 aggagtttca ctttacttcc ctcactaatg agagatgtta cccagctagt aaacgagtta 900 cagatttact tagaaaggac ttcagccctc attcctgaac gtacctctta ttttaaagtg 960 gatcctaggg atactttctt atccatactt agggaatcat cggatgccag ccaaattcaa 1020 gccgcttggt tgggattagc gaaccgactg acttcggccc aagaaaacct tctaaagtac 1080 gagatacagt accgacgacc cttatctggg gaaaacgtcg ctatgcctac atctcctatt 1140 tctacggacg tgggcattta cgaagctatg gatgacttgg aggatgtaga tgctagaatg 1200 agatatcttt acgacaacgt accgcatctt caaggccagg taaaaaatcc ccgaatgtta 1260 ggagatggca ccccctggag cgaaatattt tccttcccga gcgtgcagac aaatagcccg 1320 gaaaaagggt catcaaaact agctccaatc gacgagagta caatcgacga acgggagtcg 1380 aggaatgacc ccctgaaatc caaaggaaag agacgaataa cggatgaatt tactagtcct 1440 cctgcttcgc cccgattact caacgtcgga tacggcactc ctttcaagtc gagttcacaa 1500 ttttttacga gaccaggagg gatccctttg cccccagcgg aaattcaagc gcagcagaac 1560 atcttagtgg gcttgggttt accgcgtact cctgctttcg agagcattcc tgtcgtcaac 1620 gaaactcgtc aacttccaag agctaacccc cagcctcaat ctcgaccttc taatcctttc 1680 gaaggtagag acttacctcc acatatgagt caaactcgtc gagatgaagc tgattatgtc 1740 tcaactcctg ccgatcgcca aaacaacagc tccaacgcag cacgtaatcg ataccgtctc 1800 ccaccggttc cggaagaaac gagatctaac ggaggatcct cgcaaccttc ggagaataac 1860 taccgaggaa acaacgctag agggcgtagc aatcatcccg gaggagatcc tggaggagac 1920 gacgacgacg atagtgaagg agacaataac ccgcacaggg gtaacggccc ttccagtaga 1980 gatcctcgca gcaacggtcc ttccgggaat catccccgcg gaggcggcgg aggaggcgac 2040 tctggaggag gaggaggtgg caatgatcct aatggaggga atcatcaagc gaataaccca 2100 aatcagcctc agggtaacat accttatgga aatttagtag ctaccattcg caatgaactc 2160 aaacaagacc aacttccagt ctgggacgga aataaagaca ctgctatcga gtatttctgg 2220 aaagtgcagc aattagccgc attagaaggc gacatacctg tcgctctcgg atattggttg 2280 tggaagagtt taaaagagaa ctcccgaatt tggatgtggt tcaccacttt gcccttcgct 2340 gaacagacaa aaatgagaac tcattaccta cattatttga agggaataaa agacaactac 2400 ctaggacgaa cttggcaaat aggtatgaac aggaagtacg aaaaccagtc cttcagacaa 2460 gagggatacg aaagggaatc tccccccgcc ttcatagttc gtcggataat gttcacgcgc 2520 atgctagttg cttcagatga tggggggcct acagaggtgt atttggttat gcagaaagct 2580 ccgatatcct ggggccctat cttaaatctc gaaacgattc gatcgacttc gttattatat 2640 tcaagagcga cagatcacga actcgcctta gttcacgcgg ctaagtatga atcgtcgaat 2700 gtcgtaaccg cagacaatct gttgtacact cttcggaaac tgggaatctc gaccgatcga 2760 aatcgtccta tggaaagatc ggccagatta gcttcgagca aagactcgag ttctgaacct 2820 ggagaagatg ttattcacga agcctttcta gggcagttga gtagagaaga atgtacgcag 2880 gagattacgt cggatccgga agtcttgagg gaagcctttc aggttttaaa gaagaggcaa 2940 cgaccacccc ccaaaggcgg ttacccatac gcgaaaaatg accacgtaac cactaaaatg 3000 ggccggttgc cgccttcacc ttgcaaggtc tgcggcagtg ataatcactg ggacaaagaa 3060 tgtccggatt ggtcattcta cgaagcgaag caattaaaga gtgcttaccg aattgaaact 3120 aatgaaatcg aagacttgga ggaatattac agtagcgttt actccatttt ggtgacggag 3180 cgaatgactc tcgagaacaa gcaaaagaac gtttcggatt ttcatgaggc agttctacaa 3240 ggagaagaga catctttctc aagagaacgt aagtccgttg aaagtaatcg agggaggaag 3300 caaaccgtct ttatggagga aatagaggat gaggcgtggc tggaatacaa agcgaaagac 3360 aaggcaacta cttgtctgat gcatgaagta ggacatgaag aagacgaacc gttagtgaag 3420 gaagcacact ccgttcacca gactaagcaa cctcgacaaa agcagtcgta cgaatctcgt 3480 cccgctgaag aaaaagccag tcacgatgat atccccatac cttcagccag accaaatgaa 3540 gtcaaagacc ctccaggaga acccaaggat tcccctaacg cgacctcctc gcttccccct 3600 ccgtcgaaag agaagttgtt caagatacct aaagctagat ctcgacctga agggatgtca 3660 gcgataggag tttcggtcct ttcgacgaga ggcttcgtgg gaggtctgaa taatgtcgaa 3720 acagacttac ggctggactc ttgtgcagat atcacgttga tctcacacga attctacgaa 3780 caattaacgt cgagacctgc aataaagcaa ggaatgagaa tgcgtttgtg gcaattgaca 3840 gacaaggatt cacaattgaa gggcttcgtg cgtatcccta tatatatgac cacggtggaa 3900 ggcgaaatct tagaaactga agcggaagcg tacgtagttc cgaacatgac agtaccaatc 3960 cttttgggcg aagactacca acaatcttac gaattaagcg ttactcgcaa cgtggagaac 4020 gggactcaca tttcctttgg tagacacgac catcgaattc gagctactcc cgtagaaaga 4080 acaaaggact ttggtcgtct tagacagagc gcctacatgg tcggacaata cgtgagacgt 4140 caatttcacc gaaggaacaa agggaagaga catcggcgca aagtgaagtt cggcattgac 4200 gagaagaccg tgagggctgc agcggactat cgattgaaac cccatgaaag caaacctatc 4260 caggtggaag gtcagttagg ggaggatcgc gaatggttgg tccaaaagaa cttacttgcc 4320 aacgcaaacg atacgttttt cgcagttcct aatgtcctca tctctgcagc gcacccttgg 4380 gtaccaatag cgaatccgac cgatcaccca cggttcatta ggaaagggga gattatcggc 4440 acgctagtcg acccagcgac tttcttcgac accccgaaat ctcgagaaga actgaagaaa 4500 tttgaagact ctgcagaaat catcagaaca gttatatctc tacaaatgga tcaagactcc 4560 cctgcttcaa atcctacgga agagggcgaa gaacaagagg aatatggacc taaaaccgct 4620 gcgatgcctg atcccaccat ctactcgtcg agtcaattag aagagttgat agacgtgggt 4680 aatcttcctg atcatttgaa ggaggccgca tgggagatgc tgagaaaacg cattaaagcg 4740 tttggcttcg atggccgatt gggaaatttg ccagcaaaag tccacatacg aacggtggat 4800 ggtcaagttc caattgcagt acccatgtat catgcctcgc cgcagaaaag ggaaataatc 4860 gacgaacagc tgaacacctg gtttgagcaa ggggtaatcg agccatctag aagtccctgg 4920 agtgcgcccg tagtcatcgc ttatcgtaat ggtaaaccca gattttttgt ggactatcgc 4980 aaattaaacg ctgccacgat acccgatgaa tttcccatcc cacgacaatc agatattcta 5040 gcatctttat caggggctca agttctatct tctctagacg ctctctcagg attcacgcaa 5100 ctggagcttg cagaagaaga tatcgagaag actgctttcc ggacacaccg tggactcttt 5160 cagttcaaac gtctaccttt cggattacgc aacgggcctt cgatctttca gagagtgatg 5220 caaggaatac tatctcctta tctctggata ttctgtctcg tgtacatcga cgacattgtt 5280 atctattcca aatcatacga ggaacacttg gtacatttag ataaagtttt ggaagccatt 5340 gaaaaggcgg gaataacact gtccccgaag aagtgtcacc ttttttatgg ctccattctt 5400 cttctcggtc ataaggtatc gcgattaggg ctctcaactc atacagagaa ggtggaagcc 5460 atcatgaaac tcgaacgacc taagaagatt tcgcaactac aagccttctt gggaatgata 5520 gtatacttct cggccttcat accctactat gcatccatat gtgctccctt gtttcagctt 5580 ctacggaaag gatgcaaatg gtcctggggg gcggaacagg aacatgcatt ccaatcagct 5640 aaagacgcct tacaatcaag tcctgtatta ggtcatccta tggaaggact cccttaccgg 5700 ctctatacag acgcttcgga cgaagcgtta ggatgctccc tgcagcaggt tcagcccata 5760 aaggtgggag acttgagagg aactcgcact tacgatcgtc tcaagaaagc atatgaaggg 5820 ggcctgtctc cgccgcaact agtaaccgct ctgagcaaca aatgcatgga tcatgagttc 5880 tccagtgagt gggctccagt tttcgacgaa accatagtcc gagtggaaag ggtggtagca 5940 tattggtcgc gaacgtttaa aaacgccgaa actcggtatt caacgaccga aagggaggct 6000 ctagcggcta aagagggatt agtaaaattc cagcccttca ttgagggcga agacgtcctc 6060 ctcattactg accattcggc gctacaatgg gcgcgcacct atgagaatgc caaccgacga 6120 ttagcggctt ggggggctat tttctcggca tacgcaccga aactagagat tattcatcgc 6180 gccggcaggg ttcactctaa tgtggatcct ctatccagat tgccgagagc accgcctagc 6240 catacttctc cgttggaatc tacggagccc tctattcgag ccaaagaatc actggacgaa 6300 aggcaagatg taacgagtcc agcggtaaag atggcggcgt tctcgtttgc agcctggtca 6360 atcgaggatt gcctagaaga accgagagaa gtcttgatca acgtgaggtc taggaataaa 6420 agatttgttg agactcagga ggaccattcc acgacaaatc ccatcgagga gactacagcc 6480 gaagattccg gggcactgga cacgctagca acaacggcag aatactgggg ggccgtcaat 6540 cctccaccta cgattcatct gtccatgagc gaggatacca aggagcaatg gagaagggct 6600 tacaaggatg accctatgtt caaatcaatt gtcaaggatg acaaattccg ctacgacaag 6660 gtacaacctg gcatgcgatt cttagttgat catgacggca tgatattttt cagtaacgag 6720 aactaccagc cacgattatg cgtaccctct accttgagga atttcattct tcgggaagct 6780 cacgaaaatc ccctggaatc ggcgcatgcg ggtcctgaac ggctatggca atcattgagc 6840 tcaagattct attggaagag aatgaaagtg gatatagtga gatactgccg atcatgcgac 6900 gtttgtcaga aaacgaagac acccaacttc tctaggtttg gaatgctcat acctaaccca 6960 atccctagtc ggccttacca gtcaatctca atggacttta tcgttaactt accctggtcc 7020 gaagggttca acgccatcta tgtggtagtc gatcgtttgt cgaaacacgc ctctttcata 7080 ccaacaacta caggcttgga ttcggaagga ttcgcgtcgt tatttttcaa gcatattgtt 7140 tgtcgattcg gccttccgga gagcatcgtt acggacagag atccgagatg gacttctgac 7200 ttctggttaa gcgtagcgaa agcccttcag acgaaaatga gtttatcatc ttctcatcac 7260 ccccaacacg acggtcagac tgaagtggtt aataagctac tgaccaccat gatgcgtgct 7320 ttcatagctg gaaaaaagga tcaatgggcg ttgtggctac atttattgga atttgcgtac 7380 aacagtgcca tccattcgtc cattgggact actccattcc acctccttct cggttttcac 7440 cctcgaactc ctttagactt tttagggact aagcgttcgg atgatatcga tagccgagca 7500 ttgggtccag aagcaataac ctttttggaa aacttcgcca tgcatagaga gagcgcgaga 7560 agcgcgatag cgaaagcgca ggaccagcaa gtcagatcat tcaacaaggg aaggaaacct 7620 atacccgtcc tgaagaaagg ggacaaagta ctagttaacc ctcacgccct agaatggatt 7680 gagtccaagg gagagggaaa gaagcttacc caacgatgga ttggaccttt cgagattatg 7740 cagcgtatca atcccaacgt ttaccgctta cggatgagca acttataccc aggtctgcca 7800 gtgttcaact atcagcactt acggaagtac gaaccttctc ctacggagtt cggagatagg 7860 acatctctgc cggaaactag aacaaaaaaa ccagaacaag aggagtatga ggtcgaaaag 7920 attatagcgg aaagacgcac caaaaagggt ctcgagtatc ttgtacgttg ggagggatac 7980 agtcccctct acgacacttg ggagcccaaa cgagccttga cgaatgcacc tgaagtggtc 8040 gcaaaatggc tgaaaagcca tgacgtcaag acactattgt aagaatagtt cggagttaac 8100 ttatctcgtc cgatcggact gatccgatgg acgactacca tatgccttcc tctttgtctt 8160 tttctccttc ttctttctct tctttctaca ctcaatctcg gacactctac cttcgacatc 8220 gactcagtca cacacccaga cgtcagccat catggaatgc ttatacaccg acggagacgg 8280 caaagttcca atttcgggtc gggcggagat ggtagcagga caatggattg ccttgaacgc 8340 ttcagcaggc agaattccct ccacaactgt attcgaggaa gctctaacag atcgcgcgtt 8400 ccttcccccc gatttagccg ctcgaccaga cgcttggact tggaccaaac gacctacgtg 8460 gtatgcagac aggtatcatt gggatgcatg gaacgtcatc gattatttcg cagaggaaaa 8520 cgaagccgta ccttggtatt tcggagaggg cagtacccca tggacgggcg acgacggtaa 8580 atttcgtttt gaccccatcc tacgtgctaa agcggaagat tcactgacga agttatggct 8640 ttgcatcgaa tctatcacat cgcagccgcc atttgtttcc ggcacgcctc acccaacaaa 8700 attcaattat ctactgctgt ccgcagcttg ggactcagct cagggggcta gatcgttagc 8760 tgaagatgcg cggggacgag ttttggaata cctgggtttt atcaactggt ggtcatcttc 8820 agtatccgga tgggacgacg tcctacaacg atggatggtc gattacattg gtggtttcaa 8880 actacgcgat ctcaagaaaa gaggagtctt cgtcgacctg ccgaggcgct ggcatgttct 8940 gaacataggg catctactcg cagagaacgt tcctgtctat tacttctggc aggaagataa 9000 cgacgactac ccctgcttcg cgcggctgtc tccaactata ctgcaagcct accacgacgc 9060 ctgcgacact ctggacaggt cggaggtctc ggagggtgac atgatgggct tccaagacga 9120 catcgatgcc atcaagcaat acgatgagtt ttttcaacta cgaagaacgc ccgatcacac 9180 ctcttcccct tctttctccg acatccctcc caacgcgaca gtatacattt gcgatttcga 9240 ggggtggtca gcgagacttg tgatcgacga ttccctgatc gaggattacg cggggaggta 9300 tcatttctgc atcgagacgg atgagtcaga cagctacgtc accatctggc gctggaaacc 9360 acggcggcta gatatgggag gaagtcagcg cgcaggctgc tggggcccag gatctactac 9420 agaggcatgt cggggagatc gagaaatccg cgagcttttc aaagcagtcc acgctcccat 9480 tggcaaagcg cgttttgacg agtgcggtag aatcacccta gtcagtcgcc tgggggatgt 9540 ctacgaagac aacgacataa gctccgtaga agcagacgcg cttgatggga ctcgtcttct 9600 ccctcgccca cattgggtcc cagcttccca cccaccgctc ttggtgccaa ctccggcttc 9660 cccacttaat gtcgcttcga aatggataca atccatggtc gccccttcta gtcattcagg 9720 ctccagggct tcttcagcag ccagactctc acatagttcc ggcgagcgag atcttcgtac 9780 acgttctgca tctcccgacc gcgggcggac tgcgtctagg gaccaattac ccgaaggacg 9840 agcagcctac atcgacgacc tgcgcagttt gggcagtcag tacgcaacgc cagtcagcgc 9900 gtgggttagc aagaaacccc tggagtggaa tgtcgactac ctcgacatca gattcctgct 9960 ggtacccgat aagaaagctc aggcttgtct ccgttactgg gcggcttgtt ctcgcgacgt 10020 ttccacaatg gcggaagtgc ttctcaaagc tatcactcac ggaattccct ttagcatcgg 10080 cgtcaaagtt gaagattttg gcaggttcaa acccgaggaa gtctccgaca cagaccgtgt 10140 agtagggaaa cccacttgcg tgatggaacc gccgtttgcc tacaccgccc cgggggctct 10200 gaaggcctac tacatgagcc gcgtcaacga tattatccga cggcctcacg cgaggatctt 10260 ggtgggacta ggtggcccag aagcttggct tggccgcaaa tggggcggcc cagagctagt 10320 tgctcaattt atggacggac catctccgga cgtctactta cacaggcgag gctatatcga 10380 ctcggacgac gaaaacccta tgtttcttta cactgatgaa atgagtcctc aggagattga 10440 tgtcatcttc ggctgcatcc gcagtgatat tgacagagac cgctccttat atccttcgcg 10500 ggacatcctg gatgacggtt gttttttctg gaccggagag tgggacgatc gcatggaaaa 10560 tatgttcgcc gaccttacaa aggaaatcct gcagggttcc gcaaagttca gaaccccagg 10620 aatgtggaat gattatttcc gtcgtttgaa tcgcagtcac cgaggttcga aggatcgcct 10680 caaccagcta gttccggcaa cgttgaacaa gttgtatgcc aaactgctgg atggtttttc 10740 gttggactgg cacaagcgac gcttgatcga catcgaattg ccagaggaat accgacctcg 10800 tcggatcgga aatagggggg ctc 10823 // ID Gypsy-7_RO-LTR repbase; DNA; FNG; 424 BP. XX AC AACW02000098; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-7_RO_; KW Gypsy-7_RO-I; Gypsy-7_RO-LTR. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-424 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000098; Positions 171317 171740. XX SQ Sequence 424 BP; 163 A; 70 C; 44 G; 147 T; 0 other; tgtcgtatcc caacatactt taaacatgat tcatattcaa tttataagaa atcagagtaa 60 tgaatgcttt acaaagattt attacagtct tttctctatt gttcggtcat aactaattcc 120 actcctttaa ctcaatgcaa gcttttaata taactcctga tgcaatgcaa cataaatgaa 180 agatcagatc tacttaagaa ataattataa aacagtccac ttgcataacc aggaacagat 240 cctcacatta aaacttattg acactagatc tacagttata attaattaaa gtatcttgaa 300 aaggatttaa ttaaacctcc atgaaaatag tataaatacc agacattata atattaaata 360 aagatataac ttttttcagt ctcgttgaat tttatagtct tttttattct tacgagataa 420 taca 424 // ID Gypsy-27_LBS-LTR repbase; DNA; FNG; 371 BP. XX AC ABFE01002852; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-27_LBS_; KW Gypsy-27_LBS-I; Gypsy-27_LBS-LTR. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-371 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01002852; Positions 81440 81810. XX SQ Sequence 371 BP; 91 A; 119 C; 71 G; 90 T; 0 other; tgtaagaacg tagaatacaa cctccatctt gcgtgacata gtagcaccct aattaccttt 60 gtttgtcaac tcgtatatag tacatcgcat acaaacaaat acaacttcac ttgaaaccac 120 tactaccttt acatgtggat cgggtcacac acgattagtt cggctaggcg acctatcgtc 180 tcccctccgg gataccagag ccccgtgctg ggtgactgca ctcctgtccc aggatcacag 240 ggaactccac actcgactcc gatatccgga gtcacctctc tctctctctc tcgagtacct 300 cgatggctca cccccgtggt cccgcaacct cggactcatc tagaagtcgt gcacggacaa 360 ccggtttcac a 371 // ID Gypsy-54_MLP-LTR repbase; DNA; FNG; 1630 BP. XX AC AECX01001611; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-54_MLP_; KW Gypsy-54_MLP-I; Gypsy-54_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-1630 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001611; Positions 30973 29344. XX SQ Sequence 1630 BP; 554 A; 206 C; 307 G; 563 T; 0 other; tgtggtgatc aatttatggg atcaattaaa aaattcacaa ttaccagttt accagtttat 60 tagcaggatt cagcaggaaa gaaaattcga aatggaggtt tttgatcaat tgaagctctt 120 gtaggcttat gtattggtca ttaggtcaac attacatcat tttttatttc acattatata 180 ttaagatcat tatagtttac ttacaaattt attacaaaaa gaaagaaaga aaaacataag 240 aaaaattcga aattataggt ataaaggttg gaaaatggtt tttaggttag taaaaataaa 300 gttatgggta ttatgaggtt agaatatgaa agaacaacat tatatgatac tatgacaatt 360 ttaatcttca aggttgataa gtagaaacag aagttattag ggaaaaggca ctttttcgaa 420 gttggtaaac aagaggtaca cgtgtagttt ggttgtgtaa tagtggtagt gatggaattt 480 aaggttgcca agaggatttt ccgggatttt taaggctata gatgatgatg tcatattaca 540 tgagcaagaa aaagatagta gtagctgtga aaagtgaaga aaatgggtga aattcgtgga 600 aaatggaggt tttcagctaa ttacatatgt caaatgacgt cagaataggt tggtagatgg 660 ctcattcaac gcagaagagt ggattacata atgtacaaac tttaaatcat tagtaatgga 720 ggtgtgtgta aaaagttata aggcaaataa agaattggtg tgaatctcga atttacagtc 780 acaagtcata tgtaacacta aaaaccttaa cttcacaact caatatctta ctcatcactc 840 atcagaagta ggtcatgtag gtgtcagaaa tgatcagtaa cacgtgggct acaagatgca 900 ctttagtaat caaaaattgg agcacatctg aaaaaattcg taatctcaaa aattggtttt 960 cagctagttt acaaggaaaa aatcacctgt attcatcata agaagtgctg tacatcatat 1020 tattgcgtca ttgatcactt ttagctgtga aaattacatc atttattagt aacaaaacca 1080 ttacattgcg tcatcatagg gtttttaggg aattttgggc ggatttgatt ggttttgagg 1140 ctgttgacca ttgtaaaagt tctagtacat tgttatatat actttgagag ggttttggga 1200 ctgttgttcc cctttttttc tttatcattt acgccattat tattttgttt atcatcataa 1260 acttatttta atagtacctg ttcacttaaa tacgcccttt attacagtta taaagagaaa 1320 tttactttag ttttatagtc ataaagactt attgggaata atatcaatca gatattgctt 1380 attttttttc agttatcaat tgggaataac actatagcag tgttgatagt aatacaaagt 1440 gtgcttttga agttaaaatt acgtgtgaag ttatatttct ctggtgcctt atcagtactc 1500 aatacaacag agctctttct gattgtgttg ttttagaaga gcagtcctta gacaacttct 1560 actagtcatt ttattagtag attatcctct tccagctctg tataaccctt gttgggttat 1620 aagttctaca 1630 // ID Copia-1_MLP-LTR repbase; DNA; FNG; 817 BP. XX AC AECX01002023; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_MLP_; KW Copia-1_MLP-I; Copia-1_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-817 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002023; Positions 11746 12562. XX SQ Sequence 817 BP; 240 A; 138 C; 131 G; 308 T; 0 other; tgattagatt caacggaatc tcatatctcg aaatttacta agtatggaac aagtgtcaac 60 tcatgatatg attgcagatg tattaactaa acctctgatc gaggttccag ctgggatggc 120 gtccaggtca ataatatctt cagatgcatc attatctgaa actgttgtgt aattgaaatc 180 atacccagtg agaggtcaat gatatattga agtttactac gtcaaatgat caattatgta 240 atgggtaatg gaatggaatt ggcaatggtt aaatgcgtca atatcttctc atacgttttg 300 acctaagctc ctactgatgt tttctatgat ttaacttctc tctctcttta tgatttatgt 360 tttctctctt tcttgatttg ttctcgtatc attttctctc atttcattta ttctcaattt 420 ctctatttta cccatcaaag atatctagga aaacaacaac tactagatca ccacgacaca 480 tagtctcgtt gatagtaact gactgcagga ggagtctgct acctatctat tgttttcagg 540 taattagatt ctttgatggt tgctaagaaa agaatataaa acatatgtat taatgtttct 600 tattgtagac atagtaggga tcatccaata ttcctagaag tcacttacac tcaccgagtt 660 cttactagtg ttgatcttat attaggtatt gatcattgtc tacatgtctc ttttaaagtc 720 tgttttattt cttttactaa cgtcttatat tagttgtttg aaatctatat atgtgattgt 780 tgctgcaata aagacataaa gatctcaaaa gctttca 817 // ID Gypsy-5_RO-I repbase; DNA; FNG; 4824 BP. XX AC AACW02000062; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_RO_; KW Gypsy-5_RO-LTR; Gypsy-5_RO-I. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-4824 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000062; Positions 100897 105720. XX CC Positions [2109-2651] - Reverse transcriptase CC Positions [3759-4256] - Integrase core CC 'CTAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 54..4718 FT /product="Gypsy-5_RO-I_1p" FT /translation="MSNGSLSQQQRIFEIEERVHELLNPELKEKFISYRQV FT TTFPEDQRKQLSRLQFTYIDREAEKLATEFTLNNSNQLSTSEIKFKLNKMD FT SNNVNMQDLAQLIATAVASAINNKPENNQNSVRIPIPSTYSGERSAAVINL FT WIQEVERYLSFYSVHPNRWIAYAVTLLRGRSQKWWNHITQKQEEPQTWEKF FT KHDLEYAFKPSYSEQAARDRLANIKQTSSITEYADAFQDILLDLPRVSDDE FT ALDRFVRGLKDKARIHVLTREPRSLEEAIRYSISYDSAQQAGIVLPQQVHE FT QFQNDPMDLSALMHQVNAIIRSNNNNNRYNQQDRRNTRFNKKNIICHWCSK FT PGHVVAECRTRLREVREFEQSKLKQSRFKGNQRSFGQNQVYHADLIEMDTS FT NNRTITKQTSDSSFFDSVNKDSLIDLSPYSFDFSNDRNFLMNATSRNSPLP FT TYEVFIQGQPYYALIDSGASANYIHPKALKNADAFRSINNQAVETANGEQT FT MITGEATCTMKIEGEGKNFIDTFKAFVFESKFDIILGNEWLKRIKPKPDWF FT ESAWSITLPDLSTVIMKPNDFKSKVEAKQENINTIVSAKQLNRLFKKNQIE FT ECYLIHASISESGVTYLNHINETDINWATEFSKEFPDVFKGHISGLPPVRD FT TQEIIVTKPDAVPVSRPPYKMSPLELTELRKQLDELLQKGLIEPCASEWSN FT PVLFVRKPNGDLRMCCDYRMLNKVTLKQKIQLPRIDECLERLHKAKYFTYL FT DLTNGFHQQRLSETDSIKTAISTRYGQFCWKVVPFGLSNSGPAFQKMMNSI FT LADYIDKFVMVYLDDILIFSTGDEEQHKKHVRLVLKKLDEAKLIINKKKCR FT FNRKELTFLGFNISAEGIKPAPEKVKAVFDWPTPTNVQQVRQFIGLAQHYR FT RFISGFAGIAAPLTNLTKGSGPKRRAITWTEDCQKSFDLIKRKLSSAPVLM FT TPDMTKPFRIECDASDFAIGAVLLQEEENGVWKPLAFESKKLSQAERNYPA FT QERELLSILHALRTWRCFIEGNEYQVFTDHLPLKYLRSQNKPTPRLVRWLS FT EIELYDPDILYKPGTENHVPDLLSRRDGVEAKPEKESMQPRYLYHISAAVK FT SLSILDQDPIQDWPLHYLSSPERWPNKIKNELDKLKHHFVIRDNQVYRKEK FT IPKNEDYVELKFIPFVRRADMVDNFHVGYGHSGQTNVYSLMKSRVWWPKMQ FT NDINAWISKCPQCQLAGPADKIKHHAPMQPLEVPPAFSRWHLDFIGELPST FT KNGNRWILMAVDYTTNWPIARALQNATGEEIVKFIYEEIVLRFGCPNEILT FT DRGANFMSKVVKQYISRIKTKHNLTSAFHPRSNGKCERLNQLFKKMLTKYV FT NGNVHSWDDYMDAALFACRIRKHATTGFSPFFLTYGVEPRIPGDHHRPFMN FT EFIEQDQEIMSQDVMVHLRKLREARYTAEDRLRKQAEIDKSRWDSLLKDKI FT QIFEIGDYVLLRHESKKGLEFNWMGPYKVIKRNLDFNTYQIQEINGKTYQS FT WVHTDRLHPVKYVGTSIDKSWYIPRMARIDTDAKLNNQH" XX SQ Sequence 4824 BP; 1736 A; 841 C; 896 G; 1351 T; 0 other; attggtagca tctgttgtag tcaaacaatt tacaacagat ataatataag tttatgtcaa 60 acggcagttt atcacaacag caacgaatat ttgaaataga agaaagagta cacgaattac 120 ttaatcctga acttaaagaa aaatttatat cctatcgtca agttacgaca tttccagaag 180 atcaaagaaa acaattaagt agattacaat tcacatatat agacagagaa gcagagaaat 240 tagcaactga atttacactc aacaatagta atcaattatc gacaagcgaa ataaaattca 300 aattaaacaa gatggattcc aacaacgtaa atatgcagga tctagcacag ttgatcgcta 360 ctgctgttgc atcagcaata aacaataaac cagaaaacaa tcagaatagt gtccgtattc 420 ccattccttc tacatacagt ggtgaaagaa gtgctgcagt aattaactta tggatacaag 480 aagtcgagcg ttatctcagt ttctacagtg tacatccaaa tcgttggatt gcatatgcag 540 taactctact tagaggaaga tcacaaaaat ggtggaatca cataactcag aaacaagaag 600 aaccacaaac atgggagaag tttaaacatg atttagagta cgccttcaaa ccttcatatt 660 ctgaacaagc tgcaagagat agattagcaa atatcaagca aacctcatcc ataacagaat 720 atgctgatgc atttcaggat atactattgg atttaccaag agtttctgat gatgaagcat 780 tagaccgatt tgttagagga ttaaaggata aagcaaggat acatgttttg acaagagaac 840 cccgatcact tgaagaagct atcaggtatt cgatatcata cgatagtgca caacaagctg 900 gaatagttct tcctcaacaa gtacacgaac aatttcaaaa tgatccaatg gatttatctg 960 ctttaatgca tcaagtaaat gcaataataa gatccaacaa taataacaat cgttataatc 1020 aacaagacag aagaaatact agattcaaca agaaaaacat catttgtcac tggtgtagca 1080 aacctggaca tgttgtagca gaatgtagaa caagattaag agaagtcaga gagtttgagc 1140 aaagtaaatt gaaacaaagt cgattcaaag gtaaccagag atcatttggt caaaatcaag 1200 tgtaccatgc ggacctgatt gaaatggata cgagtaataa tcgtactata accaaacaaa 1260 ctagtgattc gtcctttttt gatagtgtca ataaagattc tttaattgat ctttcccctt 1320 actcctttga cttttctaat gatcgaaact ttctgatgaa tgcaacttct cgtaactctc 1380 cactgcctac ttatgaagtt ttcattcaag gacaaccgta ctatgcatta atcgatagtg 1440 gtgccagtgc caattatatt catcctaagg ccctcaagaa tgcagatgcc tttagatcaa 1500 tcaacaatca agcggtagaa actgcaaatg gggaacaaac gatgataact ggtgaagcta 1560 catgtacaat gaaaatcgaa ggggagggta agaactttat tgacacattc aaggcgtttg 1620 tatttgaatc aaagtttgat atcatccttg gcaatgaatg gctaaaaaga ataaaaccta 1680 aaccagattg gtttgaaagt gcttggtcaa ttactttacc tgatctctca acggtgataa 1740 tgaaacccaa tgacttcaaa agtaaagttg aagcgaaaca agaaaatatc aatacgatcg 1800 tctcagccaa acaactaaat aggctattca agaaaaatca gatagaagag tgctacttga 1860 tacatgcttc aattagtgaa tcaggagtaa catatttgaa tcatataaat gaaactgaca 1920 taaattgggc tacagagttt agtaaggaat tccctgatgt atttaaaggt catatctctg 1980 gtctgccgcc agttagggac actcaagaaa ttatagtcac taaacctgat gctgtacctg 2040 tttcaagacc tccatacaaa atgtcgcctt tggaactaac tgaactacgc aaacaattgg 2100 atgaattact acaaaaagga ttaattgaac cttgtgcttc agaatggtca aatccagtac 2160 tattcgttag aaaaccaaat ggtgatttgc gtatgtgctg tgattacaga atgttgaata 2220 aagtgacgct caagcaaaag attcaattgc ctcgcataga tgaatgctta gaacgattac 2280 acaaggcaaa atatttcact tatttggact taacgaatgg ttttcatcag cagcgactat 2340 cggaaacgga ttcaattaag actgctatca gcacaagata tggtcaattc tgttggaaag 2400 tagttccttt tggtctatct aatagtggac ctgcttttca aaaaatgatg aattctattt 2460 tggctgatta tattgacaag tttgtgatgg tttacctgga tgatattctg atattttcga 2520 caggagatga agagcaacac aaaaagcatg ttagactagt attaaaaaaa cttgatgaag 2580 ccaaattgat tatcaataag aagaaatgcc gtttcaacag aaaagagcta acttttcttg 2640 gttttaatat tagcgctgaa ggtatcaagc ctgctccaga aaaggttaaa gctgtttttg 2700 attggcctac tcctactaat gtacagcaag tcagacaatt tattggtctt gcacagcatt 2760 atcgaagatt tatctctggt ttcgctggta ttgctgctcc tctaacaaat cttacaaagg 2820 gatctgggcc taaacgtcgt gctataactt ggaccgaaga ttgtcagaaa agctttgacc 2880 taattaaaag gaagctatcc agtgcgcctg tattgatgac gcctgacatg accaaacctt 2940 ttagaattga atgtgatgcc agtgactttg ctattggtgc tgtcttatta caagaagaag 3000 aaaacggtgt atggaaacct ctggcttttg aatccaagaa actatcccaa gctgaacgaa 3060 actatcctgc ccaagaaaga gaattgctca gcattttaca tgcattacga acatggagat 3120 gttttattga aggtaatgaa tatcaagttt tcacagatca tctaccttta aaatatttgc 3180 gatcgcaaaa taagcccaca ccaagattgg ttcgctggct cagtgaaatc gaactctatg 3240 atcctgatat attgtacaag ccgggtactg aaaatcatgt acctgattta ctgtcaagaa 3300 gagatggtgt agaagctaag ccggagaaag aatctatgca gcctcgatat ttatatcaca 3360 tttctgcagc tgtcaagtcc ttatctattc ttgatcaaga tcctatccaa gattggccat 3420 tacactatct gtcttctcca gaaagatggc ctaacaagat taaaaatgaa ctggacaaat 3480 tgaaacacca ttttgtaatc agggataatc aagtttaccg caaagaaaaa attccaaaga 3540 atgaagatta cgttgaactc aaatttattc cttttgtaag acgtgctgat atggtagata 3600 actttcatgt aggatatggc cattctggtc aaaccaatgt ctatagttta atgaaatcaa 3660 gagtctggtg gccaaagatg caaaatgata ttaacgcctg gatatctaaa tgccctcaat 3720 gtcaacttgc cggacctgct gataaaatta aacatcatgc tccaatgcaa ccattggaag 3780 tgcctcctgc tttttcaaga tggcatcttg atttcattgg cgaattaccg tcaacaaaaa 3840 atggtaacag atggattctt atggctgttg actatactac gaattggcct attgctcgag 3900 cactgcaaaa tgctactgga gaggaaatag tcaagtttat atatgaagaa atagttttgc 3960 gtttcggatg tcctaatgaa attctaacgg atcgaggggc aaacttcatg tctaaagttg 4020 tcaagcaata catttctaga atcaagacca aacataattt gaccagtgcc tttcatccca 4080 gatctaatgg caaatgtgaa agacttaacc aattgttcaa gaagatgctg acaaaatatg 4140 ttaatggtaa tgttcatagc tgggatgatt atatggatgc tgcacttttt gcttgtagga 4200 taagaaaaca tgcgactact ggattcagtc cgttcttttt aacttatgga gtggagcctc 4260 gaattcctgg tgatcatcat cgtcctttta tgaatgaatt tattgaacaa gatcaagaaa 4320 tcatgtcaca agatgttatg gtgcacttac gtaaattacg agaagcaaga tacacagctg 4380 aagacagatt aaggaaacaa gctgaaattg acaagtctcg ctgggattct ttgctcaagg 4440 acaagataca aatatttgaa attggtgatt acgtattgtt aaggcatgaa agcaagaaag 4500 gtctagaatt caattggatg ggtccatata aagtcatcaa acgcaaccta gactttaaca 4560 catatcaaat tcaagaaatt aatggaaaaa catatcaatc ttgggttcac acagatcgtc 4620 ttcatccagt caaatatgtt ggtacatcaa ttgacaagtc atggtatatt ccacgaatgg 4680 ctagaattga tacggatgcg aaattgaata atcaacacta aaaaaaaaaa aaaaaaaaaa 4740 aaaaaaaaaa aaaaaaaaaa aaaaaaaact ttcactttgg agtcttgttg ctaagggtct 4800 atcaacactt tgaaaagggg gtat 4824 // ID Gypsy-2_PCR-I repbase; DNA; FNG; 8796 BP. XX AC AADS01000313; XX DT 30-JAN-2011 (Rel. 16.02, Created) DT 30-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Phanerochaete chrysosporium genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_PCR_; KW Gypsy-2_PCR-LTR; Gypsy-2_PCR-I. XX OS Phanerochaete chrysosporium OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Corticiales; Corticiaceae; Phanerochaete. XX RN [1] RP 1-8796 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Phanerochaete chrysosporium RT genome."; RL Direct Submission to RU (30-JAN-2011). XX DR Genome; AADS01000313; Positions 25644 34439. XX CC Positions [5491-5997] - Integrase core CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 641..2875 FT /product="Gypsy-2_PCR-I_4p" FT /translation="MAQHGFQMPFKGERAAPEFDPREPRSLIRFFDDLEEC FT LRRANIVDDETKKKSALRYVSFREADAWQELDQYATGTYSAWKAEVLKLYP FT GADDSRKFTRSQLEKLVDDRRPLGFKTIGDWSDFYREFRTTAQWLIKQQKL FT PTHERDRILARVFEADFKARVDTRLLIKFPDVHPEDGYSMEQYNEAVNHLL FT YGTAATPSVPSAAIPSHVDSTVKTEEGLVDLLTRVFTRAQERSTSAAPSGS FT APRPSVQRSEASTGLDCHYCAQVGHCMSACPAAEEDICKGLIKRNAENRIV FT LPSGAFPPRSLEGRCMRDRILEWHRQNPNNVAVGSLSYSQPVATSQPTGSM FT LYSIVDSPSARQGSIPASAYTLSADDQERIAELEREIFAIRNGRPAPGGIR FT RSPRNHAGPIPTTRAEPATDERTSEPARVEQPRPAEPTPPPAAAQVPEHPF FT AQARDATYAPPAERNVGAPPPPPKKGDAKEPAYRTVAPIENPATADKVLSQ FT VLKSPTVQITPEDLLAISQPLRDRLRAQITPKRVATEKATEAATFLGSVDE FT GDGSPALPSVRELLRQESQQGLPRGVIRVPDPYEAYLRRLGPGETPRPLTV FT ARKSHALRAIHAVVANREEVECIIDGGCQVVSMSEAVCHALGLAYDPTIVL FT HMQSANGNLDHSLGLARDVPFRIGELTLYLQVHVIREAAYDILLGRPLDVL FT TESVIRNYRNEDQTLKMHCPNTGVIVTVPTFPRGKPRFRMPSQGF" FT CDS 5194..6579 FT /product="Gypsy-2_PCR-I_3p" FT /translation="MEPREADSFVRYALQFFVRDGKLWRRRLSGAHQLVVA FT PERRLRLLRDAHDGACHRGVFATTALLSERFWWPYLAFDVKWYVKTCHICQ FT TRKHQHHLIPPTVAIPASLFTRVYADTMHLPPSASRRFIVQARCSLTTFPE FT WVGLRRETGKLIGDWIYESLLSRWGILSEIVTDNGTPFIKAIEYLAKTRKI FT VHIRISGYNSRANDIVERAHYDVRQALYKAADGDPSRWYHAAHSVFWSERI FT TARKRLGCSPYFAVTGTHPVMPFDLAEATYLRPPPESVATTADLIAQRAIA FT MQKRSEDLARLRDRVYAARLRAARIFERDHAAVIRDYDFQRGDLVLIRNTR FT IEKSLNRKMRPRYFGPLIVVARNRGRAYILCELDGAVMDRPVVAFRVAPYF FT ARKSIPLPDGFEDITPERLRVMRDDTSLGDDDDGALTRSEPGEGSNDDEDA FT RSVSEASADDGEDLPED" FT CDS join(3055..3507,3511..4857) FT /product="Gypsy-2_PCR-I_1p" FT /translation="MLPLYDVLSELSDSALATLGDDPSSLRATFAAKKKYK FT PVHLKTRPVKTQLPSQYRIVRHPMPEALAELPKIDYANIPEFVPTGWYSAE FT RMRDTDTLHSGDFLLPEERKLLHHFMTLHQDAFAWDDSERECFKEEFFPPV FT EIPVVPHTPWVENIPIPPSIYDEVCEVIKKKIAAGVYEPSNATYRSKWFPV FT AKKFTKALRIVHSLEPLNAVTIQHSGVPPVPDYLTETFAGRACGGILDLYV FT GYDNRPLAETSRDLTTFQSPFGLLRLTTLPMGWTNSVPIFHDDVTFILQDE FT IPEYTIPYIDDVPIRGPATRYETPEGDYERIPENPGIRRFVREHFEVLNRV FT CQRMKYAGGTYSGKKSILIAAETVVLGYRCTYDGRLPEVDRVQVIHDWGPC FT KTLSELRAFLGTVGVLRIWIRNFAKRAHHLVKLTRKGATFEWGPEQQAAMD FT DLKEAACHCDALRPIDYKSPAAIVVGVDTSYLAVGYLLSQCDPEVPRVRYF FT ARFGSITLNDRESQFSQPKLEIYGVYRTLKELKPWLIGVRGWILEVDAQHI FT KGMLRNPDIAPSAAINRWIMYILNFHFKLVHVPGVKHGADGPSRRPAQPGD FT PE" XX SQ Sequence 8796 BP; 1783 A; 2986 C; 2455 G; 1572 T; 0 other; ttttggctgc cccagtgagg cctacgtcaa aattccgaag caaccgaagc ctcgaatcac 60 cacgaacagt agtcgcccga ttctcgagca aaaagacaac cccgacgaag tcctctacag 120 gaaacgcagg ctcagtacga taccagcgaa cgggccccga gcagcgtccg agccagtttc 180 ccgtctacaa cgaagagtat tccccggagc cttcgacacc ggatcacccg acactacctt 240 cgcaacacct tctacgagtc gcaccgtacc gtacccagcc gagcctcggc gacgcgctcc 300 cgctaagctc ggcagcaatc gtttcgaccc tagttgcctc gaagacaccc accccgacac 360 cgcgttttac gtccccgacg gtcccggaga cagtgacgcc gagtccgaga ccgagatcct 420 agcgcatctg cacgaacgcg aacccaccga gaacctcgga ggcggcgatc agcaaccgcc 480 agcgaaccca ccgaatccac cacaacacaa cgcgcctgcc gagccggcag ctcctgtcat 540 ccctcataac aaccacccac atctccccgc gaacccgccg aaccaacctc cgaaccctcc 600 caatccgcca ccgctacccc cagtacattt cggtccggcc atggcacaac acgggttcca 660 gatgcctttc aagggcgaac gagcggcgcc cgagttcgac ccgagagagc ctcggtctct 720 gattcggttc ttcgacgacc tcgaggaatg ccttcgccgc gccaacatcg tcgacgacga 780 aacaaagaag aagagcgctc tacgctatgt ttcgttccgc gaagccgacg cttggcaaga 840 gctcgaccag tatgcgaccg ggacctactc ggcgtggaaa gccgaggtcc tgaagctgta 900 tcccggggcc gacgattcgc gcaagttcac acgttcgcaa ctcgagaaac tcgtcgacga 960 ccgaaggccc ctcggcttca aaaccatcgg cgactggtca gatttctatc gcgagttccg 1020 caccaccgcg cagtggctga tcaagcagca gaagctcccg acgcacgaac gcgaccgcat 1080 tctcgcccga gtcttcgagg ccgacttcaa agcccgagtc gacacccgct tgttaatcaa 1140 attccccgac gtgcatcccg aggacggcta ctcgatggag cagtataacg aagcggtaaa 1200 ccatctcttg tacggtactg cggcaacccc ctcggtcccc tcagcagcaa tcccttcgca 1260 cgtcgactct acggtcaaga ccgaagaagg actcgtggac ctgctcaccc gagtcttcac 1320 gcgagcacaa gaacgaagta cctcggcggc gccttcgggc tcggcccctc ggccttcggt 1380 acagcgctcg gaagcgtcca cgggcctcga ctgccactac tgtgcgcagg tggggcattg 1440 catgagtgcc tgcccagcgg cagaagaaga catctgcaaa ggcctcatca aacgcaacgc 1500 cgagaaccgc atcgttctac cgagtggcgc attcccgccg cggtcgctgg aaggacggtg 1560 tatgcgcgac cgtatcctag agtggcaccg ccagaacccg aacaacgtcg cggtcggttc 1620 gctctcgtac tcgcagccgg tagccacgtc tcaacccacg ggatcgatgc tatactcgat 1680 cgtcgactcg cccagcgcgc ggcaagggtc tatccccgcc tcggcttata cgctctcggc 1740 cgacgaccag gagcgtatcg cggagttgga gcgcgagatc ttcgcgattc gcaacggccg 1800 acccgccccc ggaggtattc gccgtagccc gcgcaatcat gccggaccta taccgactac 1860 tcgcgccgaa cctgcgaccg acgaaagaac atccgagcca gctcgggtag agcagccccg 1920 acctgctgag ccgacaccac cgccagccgc ggcacaagtc cccgaacacc cgttcgcgca 1980 ggctcgggac gcgacctatg caccaccggc ggagcgaaac gttggcgcac cgccgccgcc 2040 tccgaagaag ggcgatgcga aagagcccgc gtaccgtacc gtcgccccga tcgagaatcc 2100 cgccactgcc gataaagtgc tctcgcaagt cctgaagtcg ccgaccgtac agatcacgcc 2160 ggaagacctg ctcgcaatct cgcagccgct acgcgaccgc ctccgtgcac agatcacccc 2220 gaagcgagtc gctaccgaga aagccaccga ggcagcgacg ttcctcggct cggtcgacga 2280 gggagatggg tcgcccgcat tgccctcggt gcgcgagcta cttcggcagg aatcgcagca 2340 aggcctaccg cgaggcgtca tccgagtccc cgacccgtac gaggcgtatc tgcggcgact 2400 tggaccaggc gaaactccga ggcccttaac ggtcgcccgc aaatcgcatg ccctgcgagc 2460 gatccacgcg gtcgtagcga accgagagga ggttgagtgt atcatcgatg gcggttgtca 2520 agtggtctca atgtcggagg ccgtctgtca cgcccttggc cttgcgtacg acccgacgat 2580 tgtcctacac atgcagagcg cgaacgggaa cctcgaccac tcgctcggtc tcgctcgcga 2640 cgtaccattc cgcatcggcg aactcaccct ttacctccaa gtccacgtta ttcgcgaagc 2700 ggcctacgac attctcctcg gtcggcccct cgacgtactc acggaatcgg tcatccgcaa 2760 ctatcgcaac gaagaccaga ccctcaagat gcactgcccg aacaccggtg tcatcgttac 2820 agtgccgacc ttcccgcgag gcaagccccg attccgaatg ccgagccagg gtttttagtg 2880 gccgaggatt cgagcgcacc tcgaggaggt cctaaaaatg ccggcggtac acaagacaaa 2940 ccagccgaag aagccgaacg cgaagaagaa cgagctgcca tcgcaagttt cctttctatc 3000 attaatacgc ctgtagccga agaccctgcc gtagaagtct ctcaatatct agctatgctt 3060 cccctttacg acgtgttgtc cgaactgtcc gactccgccc tcgcaactct cggcgacgat 3120 cctagctcgc tccgcgcaac cttcgctgcc aagaagaagt acaagccggt gcatcttaag 3180 acgcgtcccg tcaagacaca actaccctcg cagtatcgca tagtgcgaca cccgatgccc 3240 gaggccctcg cggagttgcc gaagatcgac tacgcgaata tccccgagtt cgtaccaacc 3300 ggatggtatt cggccgagcg catgcgcgat accgacacgt tgcacagtgg cgactttctc 3360 ctgcccgaag agcgcaagct cttgcaccac ttcatgaccc tccaccaaga cgcgttcgcc 3420 tgggacgatt ctgagcggga atgcttcaag gaggagttct ttccgccggt cgagattccg 3480 gtagtgccgc atacgccctg ggtggaatga aatataccga taccgccgag catctacgac 3540 gaagtctgcg aggtcatcaa gaagaagatc gcggctggcg tttacgagcc gtcgaacgct 3600 acgtaccgat cgaaatggtt cccagtcgcg aagaagttca cgaaggccct tcgcatcgtc 3660 cattcgctgg aacctctcaa cgcggtcacc atacagcact cgggcgtgcc ccctgtcccc 3720 gactacctga ccgagacctt cgccggaagg gcgtgcggcg gaatcctcga cctctacgtc 3780 ggctacgaca accgccctct cgccgagacg tcgcgagatc tcaccacgtt ccagtcgccc 3840 ttcggcctac ttcggctcac gacgctaccc atgggctgga cgaactcggt cccgatattc 3900 catgacgatg tcacgtttat cctacaagac gagattcccg aatataccat cccctatatc 3960 gacgacgtcc ctatccgagg tcctgcgacg cggtacgaga cacccgaagg cgactacgag 4020 cgcatacccg agaacccggg cattcgccgg ttcgtccgcg agcacttcga agtgctcaac 4080 cgcgtctgcc aacgcatgaa gtacgccggc ggcacgtact cgggcaagaa gtcgatctta 4140 atcgcggccg aaacggtggt actcggctat cgctgtactt acgacggaag gctacccgaa 4200 gtcgatcgcg tacaggtcat tcacgattgg ggtccctgca agacactctc cgagctaagg 4260 gcgttcctcg gcaccgtcgg cgtgcttcgc atctggattc gcaacttcgc gaagagagcg 4320 caccacctgg tgaagcttac tcggaaaggt gcgactttcg aatggggtcc cgaacaacaa 4380 gccgcgatgg acgacctgaa agaggcggcg tgtcactgtg acgcccttcg gcccatcgac 4440 tacaagtctc cggcagctat cgtagtcggc gtcgatactt cgtatctcgc agtcggctac 4500 ctattgtcgc agtgcgaccc cgaggtccca cgggttcggt acttcgcgcg attcgggtcc 4560 atcacgctga acgaccgtga gtcgcaattc tcgcagccga agctggagat ctacggtgtc 4620 tatcgcacgc tcaaggagct caagccttgg ctcattggcg tacgcggttg gatactagag 4680 gttgacgcac agcacatcaa gggcatgctg cgcaaccccg acatcgcacc ctcggcagca 4740 atcaaccggt ggatcatgta catcctgaac tttcacttca agctcgtaca cgttcccggc 4800 gtaaagcacg gcgccgacgg cccatcgcga cgacctgcgc aaccaggcga tccagagtga 4860 cccgaagatg acgacgacga tttcggcgac ccgaactttt cgttcattca ttttatcaac 4920 ccgattcgcc ctattcgcgc accgacaccg ataactcttc ataccgaact gtctctcgcc 4980 tcggtcttcg cgttggccga cgatggtacc gcgcccaagc acgaccccga gattcaagac 5040 gctaggcccg agctacctga agatgtcgca accgcgctgg atccacccga cctggtgtac 5100 gccgatattc ctcggtcccc cgccgctcgc gccgaggacg cgaagctcga gttggttcgc 5160 cagttcctcg agacgctggt gcggccagac gggatggagc ctcgcgaggc cgactcgttt 5220 gtgcggtacg cgctgcagtt cttcgtacgg gacggcaagc tgtggcgtcg ccgtctgtcg 5280 ggggcccatc aactggtcgt cgcccccgaa cgtcgacttc ggcttctgcg cgacgctcac 5340 gacggtgcct gccaccgagg agtcttcgcg acgaccgcgc tcttgtcgga gcgtttctgg 5400 tggccctacc tcgcgtttga cgtcaaatgg tacgtgaaga cgtgccatat ctgccagaca 5460 cggaagcacc agcaccacct gataccgccg acggtcgcga taccggcctc gctctttact 5520 cgggtctacg ccgacaccat gcaccttccg ccctcggctt cgcgtcgctt tatcgttcag 5580 gctcgctgct cgcttaccac gtttcccgag tgggttggtc ttcgccgcga aacagggaag 5640 ctcatcggcg actggattta cgagtccctt ctctcgcgct ggggtattct gtccgaaatc 5700 gtcaccgaca acggcacacc attcatcaag gctatcgaat acctcgcgaa gacccgcaag 5760 atcgttcaca ttcgcatcag cggatacaat tcgcgcgcga atgacatcgt tgagcgagcc 5820 cattacgacg ttcggcaagc actgtacaag gcagccgacg gtgacccctc gcgctggtat 5880 cacgccgcgc attcggtatt ttggtccgaa cgtatcacgg cgagaaaacg cctgggctgc 5940 tccccatatt tcgctgtcac cggtactcac ccggtcatgc cgttcgatct cgcggaggcg 6000 acataccttc gcccgccgcc cgagtctgtc gcgaccacgg ccgaccttat tgctcagcgg 6060 gccatcgcga tgcagaagcg aagcgaagac ctcgcgagat tgcgcgaccg tgtgtacgct 6120 gcccggcttc gtgcggcacg gattttcgaa cgcgaccacg ctgcggtcat tcgcgactat 6180 gactttcagc gcggcgacct cgtcctcatt cgcaacacgc gtatcgagaa gtccctcaac 6240 cgcaagatgc gccctcggta tttcggacca ctcatcgtcg tcgctcgcaa tcgcggcaga 6300 gcctacatcc tatgcgagct cgatggcgca gttatggacc gtcctgtggt ggccttccga 6360 gtcgcacctt acttcgctcg caagtcgatt ccactccccg acggcttcga ggatatcacg 6420 cccgaacggc tgcgcgtcat gcgagacgac acttcgctcg gcgacgacga cgacggtgca 6480 ctcactcgct cggaacctgg cgagggctct aacgacgacg aggatgctcg ctccgtctct 6540 gaggcttcgg ccgatgatgg agaggacctc cccgaagatt aattcacccc cccttcacgc 6600 gtcgcgtcgc atccgcgtcg ccctcgcgca cggcctatga tgacgtactg aggctatacc 6660 ccccttcttc cttttctcgt tcacttccca ttcacgtttc cctgcgccct ttcttcgctg 6720 gccccgaggg cgggcccttt ttttcggttg ggaggagaga gcaagccgac aaccaaacac 6780 tgcgcaaacc tcgactattt ataacaaaaa acaacatata gacatacaag tacaactaac 6840 gacgaccacg cgaacgcctc tcaccgagct cggcccttac ccttcgcact cgactcggca 6900 ggacctccag gaggcgtacg gcgccgctcg gcactatcgt cacgacgctc gcgaacgccc 6960 ggcaagtcgt caagaatggc ctggagccct gaacgaaggc tagtcagcag ctcgatacgc 7020 gcggataaca tcgcgatctc gctggccaac gcgtcggcct gggtgcgcac gatgtgttgc 7080 gcgacgtgct cggggggcgg gcgaagctgc agctcgcgca gcacagacga gtcgaccaac 7140 gccggcgaat gcccgaagta cccggacggc ggcggcatcg tgctggaaga cgcagccgac 7200 gaaggacccg gaaccgaggt cgcggcacga gctcgagccg aggccaccgg aacggtacga 7260 ggtcgctctg cgacacaagt caatacgaag cgcagcaccg agattcgacg acactcacta 7320 gtgttggcgg ccttgtcagg gctcaggcga ccctgtttct gcgaagtttg gctggacgcc 7380 actgaggcag cgtccgcgac gggctcggcc tggctttgcg acgcgcccga ctcggcaccg 7440 ctggcctccg gcgaaggtgg gaaggcgcgg ttgccgggac atttcacctt tcgctccaag 7500 caccgcgcgc accgacgttt cccggcctgg aaaacgcact cttcgtcgcg gctgatgcag 7560 aaccggcact ggcccacacg tcagctacct cgcgtccgaa aacgaggaac gggacgtact 7620 cggtcgaccc cgaagaattc ggcattttcg gtcgcgggca cggattgacc acggcgcgcc 7680 cgacgaggga ctggcttgcg acgaaccgaa cttagctccg aacgcgacct cgcgccgtcg 7740 ccaccctgcg gtgtcggctg gtcgggttcg ttctcggaca cctgcgaagt gggctctgtc 7800 gaaggcggca ccgccgactc gcttccctcc gcgtcgtctg cttggctaag ctcgtccacg 7860 tccatgcgag gagatcccgg cgcgttcgat gggttgggcg agggcgagga cgtccgcgag 7920 cgcatccgag cgtcagaatc gcgaagcgca ggcacgatgg tcccgctgcc cggggaggac 7980 gttggacccg cggcaacggt agcgggcgtg gcaggtagct gccgctcggt gtcttcgtcg 8040 ccgtacaaca cagcgagaag gcgatcctcg gggattggcc cgaggtctgc gagcctacgc 8100 gctgcggcct cgacggcgtc gctattggca tacccctccg cttcgaaccg acgacgtaac 8160 gtctcggcct cggccagcac cgcgcgagac gctaccgaac tcaggtccgg cgtaaggaac 8220 tcgcgcgcga ggtccgctgt gtcgctgctg tcggcgagct gatagagccg aagccccatc 8280 ttcaactgga agtcgtcgag ctcctctgcg agaccaggca cggccgcgac ggcggacggc 8340 tcttcggact aacatcgctc agtcaataca agtcggcaga cattcgcgct acgcaccgag 8400 acgctccggg cttgctccaa ccaggctcgg ttggccgcga cgttgtcaag gtcgattccc 8460 tcgagctgcg tgagcagccg gcggcgaagt cgagttcggg ccccgacgcg ctgctgcgct 8520 cggtcgctcc gggcgggcga aggctcagtc gtggggcggt gctgctggct catttcgagg 8580 gagatgcgtt gcaatagcgt acggttcccc ctcaccatcc tttaatccac gggacacgcg 8640 cactcggccc ctcttccacg tagcagatca ctcggcgatc cccgaagaag cgatgacggg 8700 ctacattctc gtacgcgggt ccgttcactc cattccccgc tcacgcactc cattcgctgg 8760 cccgaggatg ggccttcttt tcggttggga ggagaa 8796 // ID TCN2-I repbase; DNA; FNG; 5488 BP. XX AC . XX DT 30-MAR-2005 (Rel. 10.03, Created) DT 21-APR-2005 (Rel. 10.03, Last updated, Version 1) XX DE C. neoformans LTR retrotransposon - internal consensus. XX KW LTR Retrotransposon; Transposable Element; Interspersed repeat; KW integrase; reverse transcriptase; TCN2-I; internal portion. XX OS Cryptococcus neoformans OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Tremellomycetes; Tremellales; Tremellaceae; Filobasidiella; OC Filobasidiella/Cryptococcus neoformans species complex. XX RN [1] RP 1-5488 RA Goodwin T.J. and Poulter R.T.; RT "The diversity of retrotransposons in the yeast Cryptococcus RT neoformans."; RL Yeast 18(9), 865-880 (2001). XX RN [2] RP 1-5488 RA Loftus B.J., Fung E., Roncaglia P., Rowley D., Amedeo P., RA Bruno D., Vamathevan J., Miranda M., Anderson I.J. et al.; RT "The genome of the basidiomycetous yeast and human pathogen RT Cryptococcus neoformans."; RL Science 307(5713), 1321-1324 (2005). XX RN [3] RP 1-5488 RA Gentles A. and Jurka J.; RT "C. neoformans LTR retrotransposon TCN2."; RL Direct Submission to Repbase Update (15-MAR-2005). XX DR [3] (Consensus) XX CC 966 bp LTR deposited as TCN2-LTR. Probable 5 bp TSD. XX FH Key Location/Qualifiers FT CDS 64..5020 FT /product="ORF1p_TCN2" FT /translation="MAFDSQGQPIIAYDEVTSRPIIGHHPTGVPIFGTFTD FT DHLSSAGSGENLASPTTSNAVPSVPDLPDEKMVNYATPVHDTTGAIYRKTP FT GLTTAFSGPAVPEIDELKSIVAGLLRSTQALADQATTGHDRHVPPSTRGSR FT PDKISPPRLGTHSSADPFALQRHLLALETYFRDALLHWSPGPAEHAWKISV FT ANTSISQSGGRFTTWLELHGKHAETWDAWQVSLKSHILARGWESKLRHRFM FT QLRCLDTTPAAFDAFFHLVISYQGILHDFEDPLSDVDVNRRLLDGVDALVT FT HRTKAALRARGKTVLSATSADLXEIMMDVIDDAMLEARFLQVPTRPRHPPT FT VANVIAPAPAPPLIAKLAPAPSTTTKGHTAQQLDWLDPAKRLPLGDAGRSA FT RAYLQSINACFSCRVVGHHRLICPTRPPSTPPNASASVPVANLVSLADDDE FT SDHHGVFAVDPVTDTLVQDASSALAGSVPLIMVNCRFKADGDTVPALVDCG FT AGINVVDRAYAEQQGWQGRPIIPVGTKMADNRAGPVVDQEYVVDVIIGDTT FT YNATPFYAMALGPRYRLILGLQFCRQHRLFDGAERLNHLLNAGGSSYTSLV FT QLQLNSITPVESPTVSTERHSHSDAILREFADILPANISDVSHYPPICSST FT SQVRHRINILPDAMPVARAGFRVPLAWRDTLRQEIEKHRTAGRLRPSSSPW FT AAPAFLIKKENGKFRFLCDFRGLNSVTVKDRTPVPNIDDILQRAARGKVFA FT KLDLTDAFFQTLMHEPDIEKTAISTPWGLYEWVVMPQGACNSPATQQRRLN FT EALRNLISVCCEAYVDDIIIWGATDSDLAKNXRAVLTALRNSGFVCSPSKS FT KFFVDSVSFLGHVISPNHIGPDPKKVEALRAWPSPGCVKDLRSFLGLLQYL FT RKFIPHIATKTSVLTALLPPNKTAEKAYESRKRQLAKGLPAERLESLSWVW FT KWTTSAQXAFEALKEMVARITGLSPLSHEAILAGQTNLYLFTDASXTGLGA FT WLGTGXSPDNAQPIAYDSRSLTAAERNYPVHEKELCAIIHALKEWRPLLLG FT VPVHVMTDHATLKWFFQQPNLSERQKRWLLVLADYDLQISHIPGATNVIAD FT AFSRLRNSDAHVNALTMMVLSPNATFLDEVAXGYGQDPVMXIWREVDRCPP FT GVRTTEVNGARGVRTVLTXEDRLCIPEVLTXREQCLQECHDAMGHFGVEKT FT LELXRCKYFWDGMASDVKDFVSTCPACQTSKATTTXPPGLLHSLPVPPAKF FT SDIGIDFVGPLPQSHSFDYLIVITDRLTGWVALIPTVTTLTSSAFAQLYYD FT HWVSKYGVPQSIVSDRDKLFTAASWRRLNSLLGTKLKMSTAYHPQTDGISE FT RSNKTVXQILRTWTDDQGRNWAANLQRVAFAMNNTIRRSTHHTPAELVFGK FT RLSLTPPLLPSTSATDPSLAQPTASEWDLAAQRMALEEGIARDELLLAKHR FT QSVQANKHRRPDPVYXPGDKVYLNTAEFRHEYKTATNRSAKFMPRWEGPFT FT ILXAFPEQSLYELDVPVTSTQSTPRRHVSRLKPYRESEQYHQHAVPRLLDR FT PAVSSPRILQILEDRTLTPKGNHPKVYQVRARLAGEGPKARWLSRDTAQDY FT AGWQEAWEDFIGEDELHLDVLDVTTDVPQMFTQGX" XX SQ Sequence 5488 BP; 1185 A; 1878 C; 1205 G; 1184 T; 36 other; cttttttttc aaactcatta cccaaatcaa ttcacaccac actccaccac accaacatac 60 atcatggcgt tcgattctca gggccaaccc atcattgcgt acgacgaggt gacttcccgt 120 cccatcatcg ggcatcaccc cactggcgtt cccatcttcg gcacattcac ggacgaccac 180 ctatcgtccg cgggctccgg agaaaatttg gcttcgccca caacttcaaa cgcggtgcct 240 tcggtgcctg acctacccga cgaaaagatg gtaaactacg caacccctgt acacgatacc 300 acgggtgcaa tctaccgaaa gaccccgggc ctcaccactg ccttttccgg cccggccgtt 360 cctgagatcg acgagcttaa atctattgtt gccggactcc tncgttccac ccaagcgcta 420 gccgatcaag ccaccaccgg ccacgaccgt cacgtccccc cctccacacg cgggtcccgc 480 ccggacaaga tctccccgcc gcgacttggg actcactcct ccgccgaccc gttcgcnctc 540 cagcgacacc tcctcgcntt ggagacctat ttccgtgacg ccctcctaca ctggagcccc 600 ggcccggctg agcacgcgtg gaagatttct gtggcaaata cctcgatctc ccaatcgggc 660 ggccggttca ccacgtggct cgagctccat ggcaaacacg ccgaaacttg ggatgcatgg 720 caagtctccc ttaagtctca cattctcgcc cgcggctggg agtccaaact ccgccaccga 780 ttcatgcagc tccgctgctt ggacaccaca cccgccgcgt tcgatgcatt ctttcacctt 840 gttatcagtt accaaggcat tcttcatgac ttcgaggacc ccctcagcga cgtggacgtc 900 aaccgccgnc tattggacgg tgtcgatgcc ttggttactc accgcaccaa agctgctctc 960 cgcgcacgtg gcaaaacggt gctctccgca acgtcggccg acctatanga gatcatgatg 1020 gacgtcatcg atgacgccat gcttgaggcg cgtttccttc aggtgccgac acgcccccgc 1080 cacccaccaa ccgtcgccaa cgtcatcgct cccgcccccg cgcctccact cattgccaaa 1140 ctcgcccccg ccccctccac caccacgaag ggccacacng cacaacagct cgactggctc 1200 gaccccgcca aacggctccc cctcggtgac gccggccgat ctgcccgcgc ctatcttcaa 1260 agcatcaacg cgtgtttctc atgccgcgtt gtcggccatc accgccttat ctgcccaacc 1320 cgccccccct ccaccccgcc caacgcctcc gcgtccgtgc cggtcgctaa cctcgtctcc 1380 cttgccgacg acgacgagtc cgaccaccac ggcgttttcg ctgtcgaccc tgtcacagac 1440 actctagtac aggatgcctc gtctgcgctc gctgggtcag tacctctcat catggtcaat 1500 tgccgtttca aggctgacgg agacactgtc ccagcactcg ttgattgcgg cgctggcatc 1560 aacgtcgtcg accgggcgta cgcagagcaa cagggatggc aaggacggcc gattataccg 1620 gtggggacca aaatggcaga caatcgggcg ggtccagtcg tagaccagga gtatgtagtg 1680 gatgtaatca ttggtgacac tacctacaac gctaccccat tctacgccat ggcccttggt 1740 ccacgatacc gccttatcct cggattacag ttctgtcgtc aacaccgcct atttgatggg 1800 gcggagcgtt taaatcacct cctcaatgca ggggggtcat cctatacatc gcttgtgcaa 1860 ctacaactca actccatcac accagtcgaa tccccgaccg taagcactga acgccactcc 1920 cactccgacg ccatcctccg tgaatttgcc gacatccttc cagccaatat ctctgacgtn 1980 tcccactacc cgcccatttg ttcgtccacc tcccaagtcc gccaccgaat aaacattctt 2040 cctgatgcga tgcctgtcgc tcgagctgga tttcgagtac cgttagcgtg gcgcgacacc 2100 cttcgacaag aaatcgagaa acaccgtacc gcaggccgcc tccgtccatc cagttcccct 2160 tgggccgccc ctgctttcct cattaagaaa gaaaatggca aattccggtt cctctgcgat 2220 tttcgcggcc tcaacagtgt cacggttaaa gatcgcaccc cggttcccaa cattgacgac 2280 attctccaac gcgccgcccg tggcaaggtt ttcgccaaac tcgaccttac cgatgcattt 2340 tttcagacgc tcatgcacga gcccgatatc gagaaaacgg caatcagcac tccctggggt 2400 ttatacgaat gggttgtgat gccgcaaggc gcgtgcaact cgccggcaac acaacaacgc 2460 cgcctcaacg aggctttacg taacctcatc agcgtttgtt gtgaagctta tgtcgatgat 2520 atcatcattt ggggcgcgac cgactctgac ttagcgaaaa atatncgcgc ggttctcacg 2580 gctttacgta acagcgggtt tgtttgctcg cctagcaagt cgaaattttt cgtcgactca 2640 gtatccttcc tgggccacgt aatctccccc aatcacattg ggccagatcc gaagaaagtc 2700 gaagcactac gcgcatggcc atctcctggt tgtgtgaaag acctccgatc ttttcttggc 2760 cttctccagt atttacgcaa attcatccca cacatcgcca ccaagacgtc cgttctcacg 2820 gctcttctcc ctccgaacaa gacagcagag aaagcgtatg aatcccgtaa acgtcaactn 2880 gctaagggcc tcccagcnga gcgattagaa tcactgagtt gggtatggaa gtggacaacg 2940 tcggcgcaag angcgtttga ggcgctgaag gaaatggtgg cacgtatcac aggtctgtcc 3000 cccctttccc atgaagctat cctcgcaggt caaaccaatc tctacctttt caccgacgca 3060 agcaanaccg gcctcggcgc ctggttgggc acgggtntat cccccgacaa cgctcaacct 3120 atcgcctacg attcccgctc tctcaccgcc gccgaacgaa attatccggt acacgaaaaa 3180 gagttatgcg ccatcatcca cgccctcaaa gagtggcggc ctctacttct cggcgtcccg 3240 gtgcacgtca tgacggacca tgcgactctc aagtggttct ttcaacaacc aaatctgtcc 3300 gaacgtcaga agcggtggct actagtactc gccgattacg acctccagat ttcccatatt 3360 ccaggggcca ctaatgtcat cgccgacgct ttctcccggc tccgcaactc cgacgcccac 3420 gtcaacgccc tcaccatgat ggttctctca ccaaacgcaa ctttcctgga tgaagtngct 3480 gangggtatg ggcaggaccc ggtaatgagn atttggaggg aagtagaccg ctgccctccg 3540 ggtgtccgca ctaccgaagt caacggagca cggggggtca ggacggtgct gacatangag 3600 gaccggctct gcatccccga agtactnacc ttncgagaac agtgcctaca ggaatgccac 3660 gatgcgatgg gccatttcgg ggtggagaaa acacttgaac tantgcgttg taagtacttc 3720 tgggatggta tggctagtga cgtaaaggac tttgtcagca cttgcccagc ctgtcagaca 3780 tccaaagcta ccaccacnaa ncctcccgga ctactacact cattaccagt tcctcccgcc 3840 aaattctccg acataggcat agacttcgtg gggccactac cgcaatcaca cagcttcgac 3900 tatctcatcg tcattaccga tcgcctcacc ggctgggtcg ctctcatacc aacagtcacg 3960 acgctcacgt cctccgcttt cgctcaactc tactacgacc actgggtttc taaatatggg 4020 gtaccacaat cnatcgtctc agaccgcgat aagttattca ctgctgcgtc atggcgtcgg 4080 ttgaattcnc tcctgggcac taagctaaag atgtccacag cataccaccc ccagaccgat 4140 ggtatatcag aacgatcnaa caagacagtc atncagatcc tgcgnacctg gactgacgac 4200 caaggccgaa attgggcagc naacctacag cgggtcgcct tcgcaatgaa caacaccatc 4260 cgacgctcaa cccaccacac ccccgccgag ctcgttttcg ggaaacgcct gtcactcact 4320 ccgccgctnc tcccctcaac atcagctacg gacccgtccc tcgctcaacc tacggcttcc 4380 gaatgggatc tcgctgccca acgcatggcc ctcgaagagg gcatcgctcg tgacgaactg 4440 cttctcgcta agcatcggca aagtgttcaa gccaacaaac atcgtcggcc ggaccctgtn 4500 taccncccgg gagacaaagt ctacttgaac acagctgagt tccgtcacga atataagacn 4560 gccactaacc gttctgcgaa gttcatgccc cgctgggaag gccccttcac catcctcnag 4620 gcctttcccg agcaatcact ctatgaattg gatgtccccg tcacctcaac acagtcgacg 4680 cctcgccgcc atgtttcgcg cctcaagcca taccgggagt ccgaacagta tcaccagcac 4740 gcggttcctc gcctactcga ccgcccggct gtttcctcgc cacgcatcct ccaaatcctt 4800 gaagaccgca ccctcacccc caagggaaat catccaaagg tctatcaagt gcgcgcccgc 4860 ctcgccggtg aaggccccaa ggctcggtgg ctcagtcgcg atacngcgca ggactatgca 4920 ggttggcagg aggcttggga agactttatt ggcgaagatg agctacatct cgacgtactg 4980 gacgttacta ctgatgttcc tcagatgttc acccagggtt aacggtcccg acatggccac 5040 actccgtcgt gagctgtata caccctccct aacacctttt ttttcctcat ctacccctca 5100 gaccaggctg acaccttcgc acagatgcca aaccaccccc cccccccccc catcaagtcc 5160 ccagcccacc cctgtccagg ttttcaattc cgttcccccc ntttttttcc ttcgtgccgt 5220 gttttctcat tttttttctc gtccgtttat ttctccttca gtttcctccc ccccttcctc 5280 acgggcctga agacagacaa gaggttcccc caccccccat tttttttatc cccagacaaa 5340 gaatctgaaa agaagccaaa aggccaggga gttttttcgt ggtgggggtt tccctcagtc 5400 aacatgtttc atggattgtc tctcagcctg gcccctcatt gaagcccagc ttctacagtt 5460 ttctagctca tttctttctc cctcnttg 5488 // ID Gypsy-62_MLP-LTR repbase; DNA; FNG; 429 BP. XX AC AECX01001306; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-62_MLP_; KW Gypsy-62_MLP-I; Gypsy-62_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-429 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001306; Positions 36558 36130. XX SQ Sequence 429 BP; 115 A; 82 C; 66 G; 166 T; 0 other; tgtcatgatc ttgtagttgg ttcgaaaacc tgtatgacaa ttaccgtctt tcagtaacca 60 tttagcgttc ttttgagaaa caacttgtaa cattagtagt tgatgcatgg acatatggga 120 tattattact aatcaggaac aaggcactat tgttagacaa ctcgtctttg ttactatcta 180 tatatatctt gtatctctta caaacttgtt gtgtttcttt tctcatcaga ataattatat 240 ttcttacttc ccttttcatc gttcctatta aggaaaacta ttgccgtttg ctttatttgc 300 tttctccgat ttgagacgtt tgctttctga aaagactccg cattgagctt ttcgaaataa 360 accttattct ccgatttgag acgcagtagt aaagcttgtt gagttttacc tacaacccta 420 atcatcaca 429 // ID Gypsy-74_MLP-I repbase; DNA; FNG; 9437 BP. XX AC AECX01000970; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-74_MLP_; KW Gypsy-74_MLP-LTR; Gypsy-74_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-9437 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000970; Positions 48724 58160. XX CC Positions [3144-3647] - Reverse transcriptase CC Positions [4818-5297] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 513..2549 FT /product="Gypsy-74_MLP-I_1p" FT /translation="MEDHTGFIQIPAEDLCLIRAQLKQFGNELHQREQHHA FT REIADLRQQNQNVFQQQTNQINHLENLLNSRAQTPAPPKPDAFRAEPRKFT FT GYGDNAHLWFSELESNFTMHRYPESSWGEMLGLYLDEESNMFWHEVRKTQG FT GTITDYQDFKLHFLNHYNFDMMVEEILEKVKICYYKGDINDYILRFRRLMA FT YLPLDVVTFETRKFNFIDKLPAHFREKINNKGDIRTNKDMELIYSAAREAE FT RTVKINNRSFKSQDYHKSSNSHHKNFDKKTFNSHRSSFRSPNLFGKPHTVT FT NSSGPMDLDSVDMAKVECYSCHAMGHMADKCPKGHKRKPFNNSRNRPNNSD FT RRRPNLLLMEHLPFFLSDLLSLDSFVSPFQDHELQEETPFITPFSDADGKV FT YRASQESSESLDGPTEDGQIDLEEYKNHYLEALRDIEPETLLNLNKEREAV FT ARVIKDANDFCCCRSCKAYGNEDSPPTTSSGISYFDLTPQDFNTSKKRTHL FT DSPKEQEPGCDHDVFSIDMKTENLNPEKKTKLELPVIDGYRYTTIRTPSPS FT NGPDPVELAERELVNVETWSHILNTTYTSWADRDDFDSVCTPDEENSSTAA FT VESRGSIMSDAHSNLTDYKDPGPQAINLPSLSSSLHDLSVVDIESSSLSEH FT NGLTRPILDDNPHLASHEEDGKTSLPIYPV" FT CDS 2943..5489 FT /product="Gypsy-74_MLP-I_2p" FT /translation="MDKDNYEKLLEDLARKTAPACWKEEIGTVNGKYKHHI FT DTGDAKPIKTHGRPHTPPEHATIKQFVEDSIAQGIIEPSDSPWSSPLLLVK FT KPDGSNRVCVDYRALNKVTKKNAYPLPRIDDAYLFLSGSKVFSTIDLKSGF FT WQVPMEGADKEKTAFTCRLGHFQWRVMPFGLCNAPATFQEMMNSILKEVID FT KFALVYLDDVIIYSKSEEEHKNHVKTVFKILQKNGLVVSAKKCKFGKSSLL FT FLGHIVDGDGIRTNPEKIQKIVEWPSTTNISHVRGFLNLCTYYKRFILKFS FT SVASPLYKLTEGSPRPGTSIVWGDTEQLAFDKLKASLSQTVPLQHPEPFKP FT FVLDTEASGTNIGAILQQDTNVKVPEGVSFDYNLYQKNLRNSNLRPIAFES FT RKLSKTEQNYSAQERELLAIVHGLKHFRGFIEGSPVLIRTDHESLKYFKTQ FT KHINRRLARFVDEIEFFNTQIIYRPGNDQLAADSLSRKPDTEFDTDPPETA FT ASLFAIHDHEGQYRELLQNKLQLQSGIPPSKVGNGDFFLDGDRMVKIDLDH FT PHREQLTVPTSRPEAERICYRIHEDLGHCNIKDVTSAVKRRYWFPNVENTV FT KEAIDLCEACQVHAPISKKQGMPLQFLPRGKPFRKWGMDFVGPLPKTPNGN FT QYIITAIDYGTGWAYAKALVSTSSDAAVSLVKELTLNHGVPEEIITDNGSE FT FISHNFKTYLKKNNIKQKNTSPYHPQTNGLDERFHGTLTNALRKFCSPYNQ FT NLWDNYLNTCLFAYRASFSHSMKASPFYMAYGAEARLPSEAVSIKFDNSLE FT NLELILRQRNIATDKLNTTRQELIDDLNQRAKERSVTQEESYTERNF" XX SQ Sequence 9437 BP; 2936 A; 2240 C; 1833 G; 2428 T; 0 other; aatatatatt tgtgtgtttt atttattttt ctttctcaat ttctctaact ttaatatcat 60 tataaacctg atatcgaaaa gggctcgttc tctaaatccg tgatctaact ttgtgtttct 120 caatatccct ttcgacagac aaccgtaatc gaatacccta taaccagaag caacaaacct 180 ccccacaatc cggattaaaa ccgtcgtaag cttttgaaat ctcctagcta gtcattccaa 240 atcctctcga cacccttccc ctcaattagt tagattaatt aaagaatccc ttctcttata 300 cacacagaac aaatcaccaa caccttataa ccgtttatcg aaacactttc ctcgtgcttt 360 ttagccattt gatttttaaa tcactcatca gtatcttttt gaagaaccca cgtacgtaaa 420 ccagactctg atcaagtcta aatattccta agaaccatct cttaaatttc cctttgcttc 480 taccctatac acgaaccaat aacccaacag aaatggaaga tcataccgga ttcatacaaa 540 tcccggccga agacctttgt cttatccgcg cccaactcaa acagtttggc aacgaactcc 600 accaacgcga acaacaccat gctcgcgaga tagccgacct tcgccaacaa aaccaaaatg 660 tttttcagca acaaactaac cagatcaacc acctcgaaaa cctccttaac agtcgagcac 720 agactccagc tccccccaaa cccgatgcct ttcgggctga accaagaaag tttaccgggt 780 atggagacaa cgctcacctc tggttttcag aattggaatc caattttacg atgcatcgat 840 atcccgaatc ctcctggggc gaaatgctcg gattgtattt ggacgaagaa agtaatatgt 900 tctggcatga agttcgtaaa actcaaggag gaaccattac tgattatcaa gacttcaaat 960 tgcatttcct gaaccattat aacttcgaca tgatggtaga agaaatatta gaaaaggtca 1020 aaatatgtta ctataagggc gacatcaacg attatatctt aaggttcaga agattaatgg 1080 cgtaccttcc cctagatgtt gtcactttcg agactcgcaa attcaatttc atcgacaaac 1140 tcccagcaca ctttagagaa aaaatcaata acaaaggtga catcagaact aacaaagata 1200 tggaactcat ctattcagct gcgagagaag ccgagagaac cgtcaaaatc aacaaccgaa 1260 gttttaaatc acaagattat cacaaatcct ctaactctca tcacaagaac ttcgacaaga 1320 aaaccttcaa ctctcatcgc agtagcttcc gatctccaaa cctctttggc aaacctcaca 1380 cggtcactaa tagctccgga ccaatggatt tagattcagt agatatggca aaggttgagt 1440 gttatagctg tcacgccatg ggacatatgg cagacaaatg tcccaaagga cacaagagga 1500 aacccttcaa caactcaaga aatcgcccta ataactctga tcgccggaga cctaacttat 1560 tactcatgga acatcttcct ttctttctgt cagaccttct ttctcttgac tcgtttgttt 1620 ctccgtttca agatcacgaa ctacaggaag aaaccccttt tattacccct ttctctgacg 1680 cagatggcaa agtctacaga gcgtcgcaag aatcatctga aagcctagat ggaccaactg 1740 aggatggtca aatcgatctg gaagaataca aaaatcacta cctcgaagca ctccgagata 1800 tcgaacccga gaccctgctc aacttaaaca aagaaagaga ggcagtagct agagtaatca 1860 aagacgcaaa tgacttttgc tgttgtaggt cttgtaaagc ctatggaaac gaggatagtc 1920 cacccaccac ctcaagcggt ataagctatt ttgatctgac gcctcaagat ttcaacactt 1980 ccaaaaaaag gactcacctc gacagtccga aagaacagga accaggctgt gaccacgacg 2040 tctttagtat agatatgaag actgagaacc tcaatcccga gaagaaaacc aaactggaac 2100 ttcccgtgat tgacggatat cgatacacta caattcggac accttctcca tcgaatggcc 2160 cggatcccgt agaactagcc gaacgtgaac tagttaacgt tgagacgtgg agccacatcc 2220 tcaacaccac ttacaccagt tgggccgaca gggacgactt tgattccgtc tgcaccccag 2280 acgaagaaaa ctcaagtacc gctgccgtag agtctcgtgg tagcatcatg agtgacgctc 2340 actcaaatct caccgactac aaagacccgg gaccacaggc catcaatctc ccttcccttt 2400 catcctcgtt acatgattta tcagttgttg acatcgagtc aagttcctta tctgaacaca 2460 acggattaac tcgaccgatc ctcgatgaca atccacatct cgcctcccac gaggaggacg 2520 ggaagacctc cctaccaatc tatcctgttt gattccaaaa tattcaggtg gacgccatca 2580 tagatagcgg cgcggcggcc aactacatct caagggaaca agtcgacaag atgaaagaac 2640 tgttcccaac caaaatcgac attttgcctg tcaatcaaca aggagtgcgg ttagctaacg 2700 gcgcgaaaga atcgtgtaat caagtggcac tattcgaagc aaaaattcaa gacaaacaaa 2760 gcatggacat ccatgaattc aatctctgag cttttgtctt acccttgccc aatatctccc 2820 ttattcttgg actaccctgg ttacgcgaac agaaacccct cgtaaatttc accactggag 2880 tttacatcgt caaaaatgga ttcagagtac gcccaagaca gtatgagcct aagttgttca 2940 ctatggacaa agacaactat gaaaagttat tagaagattt ggcaagaaag accgcaccag 3000 cctgttggaa agaagaaata ggaactgtta atggtaagta taaacatcat attgacactg 3060 gcgacgctaa gcccattaag actcacggac gacctcatac gcctccagaa cacgctacca 3120 tcaaacagtt tgttgaagat agtattgctc aaggtataat tgaaccctct gactcaccat 3180 ggagttcgcc gttattactc gtcaagaaac cggatgggtc taatagggta tgtgttgact 3240 atcgtgctct gaacaaagtc acaaaaaaga acgcttatcc cttaccgcgc atcgatgatg 3300 catatttatt cttatccgga tcaaaggtct tctccacgat tgatctaaaa tcaggcttct 3360 ggcaggtacc gatggaaggt gcggacaaag aaaaaacagc attcacatgt aggttgggcc 3420 actttcagtg gcgtgttatg ccctttggtc tttgtaacgc tccggcaact tttcaggaga 3480 tgatgaactc catcttgaaa gaagtaattg acaagttcgc acttgtttat ttagacgatg 3540 tgataattta ttcaaagagc gaagaagaac acaagaacca cgtgaaaact gtcttcaaaa 3600 tcctacaaaa gaatggcttg gtcgtttcag ccaaaaaatg taaattcgga aaatcctctc 3660 tactattcct gggacacatc gtggacggag acggtattag aacgaacccg gaaaagattc 3720 aaaaaatcgt tgagtggccc agtaccacta atatttccca cgtacgcggg ttcttgaacc 3780 tttgcactta ctataagcgc ttcattttaa agttctcctc tgtagcatcc ccattataca 3840 aattgactga aggatcccca cgacctggga cgtcaattgt ctggggggat acagagcaac 3900 ttgctttcga caaactcaag gcctctttat ctcaaactgt ccctcttcag catccagaac 3960 ccttcaagcc gttcgtactt gacaccgaag cctccggtac gaacatagga gcgatccttc 4020 aacaagatac caacgtaaaa gtccctgaag gagtaagttt cgattataac ctatatcaaa 4080 aaaacctgag gaattcgaat ctgagaccaa tcgcatttga atcacggaaa ctttccaaaa 4140 cggaacagaa ttattctgcc caagaacgag aactgctcgc catcgtacac ggattgaaac 4200 acttcagagg attcatagaa ggttcgcctg ttctaattcg gaccgatcat gaatcgctca 4260 aatatttcaa aacacagaag catattaacc gacgtctcgc tcgatttgta gacgagatcg 4320 agttttttaa cactcagatc atctatcgtc caggaaatga tcaattagcc gccgactctc 4380 tctcaagaaa accggacacc gaatttgata cagatcctcc agaaaccgcc gcctcacttt 4440 ttgcaatcca cgaccacgaa ggacaatacc gtgagctact acaaaataaa ctacaattac 4500 aatctgggat acctccttca aaagtgggaa atggagactt tttcttggac ggtgaccgca 4560 tggtaaaaat agacctcgat caccctcacc gagaacaact cacagtccca actagccggc 4620 ctgaagccga aaggatttgt taccgcatcc atgaagacct gggacattgc aacattaaag 4680 atgtcacgag cgcagtgaaa cgacgatact ggtttcctaa cgtcgaaaac acagtcaagg 4740 aggctataga cttgtgtgaa gcctgtcagg ttcatgcacc aatttcaaag aagcaaggta 4800 tgccactaca atttctaccc cgtggtaaac cttttaggaa gtggggaatg gacttcgttg 4860 gtccactacc caagactcct aacgggaacc aatatatcat caccgccata gattatggta 4920 cagggtgggc gtatgctaaa gctcttgttt ccacatcctc agacgcagca gtctcactcg 4980 tcaaggaact cactttgaac cacggtgtgc ccgaagaaat cataaccgac aacgggtcag 5040 aatttatatc ccataatttc aaaacttacc tcaaaaagaa caatatcaag caaaagaata 5100 catcacccta tcaccctcag acaaacgggt tggacgaaag gttccatggt actttaacca 5160 atgctctccg taagttttgt agtccttaca atcaaaactt gtgggacaac tatctaaata 5220 cttgtttatt cgcatatcgc gcatcttttt cgcactcaat gaaagcgtca cccttttata 5280 tggcctacgg ggcagaagct aggttgcctt cagaagctgt aagcattaaa ttcgacaact 5340 cactagagaa cctagaacta atacttagac aacgtaatat agcaaccgat aaacttaaca 5400 caacgcgaca agagctcatt gatgacctca atcaaagggc caaagaaagg tcggtgacgc 5460 aagaggaatc atatacagaa cgaaactttt gacctggaga cagagtcctc cgccaatttg 5520 aaggccgacc atccaagctc catcccaagt gggatggacc attcattatc caagaggcat 5580 ctcccgatgg tacgttcaca ctaatgactt ccaacggaca tgtgcttaaa gctaaggtga 5640 acgggtgtcg tttgaaaaaa ttcaaaggat ctaccaatga attttacttc gcatctcgca 5700 gattacacga acgagataca gtggctagaa aacgaagtaa tgggacatct ggtactatgc 5760 cgtgaagcat acgagactgt caagaacaga acctcgagcg aattcgtata tgagttctta 5820 actaacatgt catccctaaa tgatagtggc atgaaagaac tagtgcgacg aaacaaaggt 5880 aaatacaaag atgtaaacta ggaagtttac ggtcttaagg aggggatggt gtgattccta 5940 aataggctat taccgttgtt gtatgcttac gcgctttctg cgcttttctt ttcccttaga 6000 gcacccacta tgggtgcggt tacgaggtga tttagcgtta ccttgacgct aaaattcaga 6060 caggtccact atggacctgt tttacgcttg cggtctgtac cgctgcgtta aatcttgcac 6120 cccgcgtgct cagatctacg caaccctacg acatgattta tcgtaggata atctgtcaac 6180 aactgtagtt ttcaaaaccc caaactacaa aacatcacaa atcattcctt caaacctcat 6240 cctcaaactt caacacctga acttcaaagt tcgaatttca aaaaccttcc tcatcacact 6300 tcaacacaaa ttaatatgtc ggttcctcca gccccagcca cttcaacagc cacttctaaa 6360 cgtggcgaca aatggactga acctgagctt gagcaacttg caatttcttg ggttgtgact 6420 agtacacgag cagaatactc aaacaatatg tcaagtgatg ctttttttga agatgtttca 6480 gtgcatttca attctcactc ggtaaccaca cgctcaggga agatgtgcaa gggccagtga 6540 gtatcatttc aatttagaaa tcaaaacatt caaacttctg acttcttcta ccttccaaat 6600 tcagatggcg tgttttgaat ccagcaaccc aaaagttctc gggtatatat gctcgtctgg 6660 agcacgatcc accatctgga catcaccgga tacctggatc gtcgatgcaa tgaagatctt 6720 tcaggctgag accaattccc cgttcaaaca tctcgctgct aggcagaagc ttcggtacga 6780 acctaaatgg gatactcgtc tgatcaatca acgcgcgcat ccccctccag caccccctgc 6840 agatcttctt ccaccttcag atgcgattgg agaagattca actcaatcca cttcagccaa 6900 caatcatgct cggccagatg atagtggcga gcgtccaatt ggtggaaagg ctgcaaagaa 6960 gcagcgtctt gaaagcaaaa aacaagctaa tgatgaatct gaaagaatcg aggccattaa 7020 atcatttaca gcggtggcaa gccgacgagc agaagctagc gaagaatcca atcgaattgc 7080 aaacaattta atgacaagtg aacttaaaag taaagatctc gagattgctt gaggaaaaga 7140 agatgaatgt cccgatgaga tttcaaaaag aatactacga gggctgaaag aggaagttgc 7200 tcggcattat caatagctta tacataacat ttcatcatct cgtattcatt taaaattgat 7260 acaaatctta ttcaccaaac acaagttgat actcgtaaag taggtacaga ggttattcat 7320 caaacaaaag aaacatctca attcatcaaa caaaagaaac atctcaatcc gaatatctca 7380 aaaatcaatc gatttccttg tattcattta aaatacatat gtacaaatct aatatgtaaa 7440 gccaaatcgt tatagtaaaa gttttaaatg tatagcagtt gtttgcaaat gttgacatga 7500 tagcaaatgt aaaattgatt tctaataatt ctatttcatt cacccgcttc cttcagccag 7560 agatccattg tgagatcgcg ctgtagctct tgatgagtgt tggaatttct cagtaggaga 7620 gtattttgaa gatacgccat aagagtattt tgtactgctg gatctgccag tttcggtacg 7680 atcctggttt gagaagtgta actctcgatt ataggatcga gtgatcctcg gtcttgaata 7740 atcatattgt gtagtactag acagcacatc atagccgagt gcatatctac taaataccac 7800 atacgacatg acattcgagt gattgcgaaa caaatctgaa tgacaccaaa agccctttca 7860 atatccttcc tagctccctc ttgagctgaa gcaaagagcc tggccgctgg agcttggcga 7920 tctcgtggtg ctttgacgat ggtcggccag tcgggataga ttccatcacc taaataatag 7980 gcctgcttat agagatgtcc attgacggtg aaatggactt catcttcatt tcctagatga 8040 ttcaagaaga tttgaatctc agttcatgat tttggaataa acatgaactg gatttgaaat 8100 aaacttacct ttgaggaagt cagtgaaaag aaacaaccgg tttaatacgt tgatatcgtt 8160 caacgaaccc ggcgttccga agaaagcatg ccagatcgtg agatcttggg atgcaactgc 8220 ttcgaggatg accgtagggg tgtgttcttt tccttcgaac tgcccagccc atgcaagtgg 8280 gcagttcttc caagcccagt gcatgccatc aaggctaccc ttacaaccag gaagcccacg 8340 gatctcgtat tctttcatga ttcgagtgag ctcctcggga tttggaggtc aaagataatc 8400 gggaccgtat atttcattta tgtgtttcat gaatagcttc agacagagta tagcagtagt 8460 tttgccaata ccgacgtagt cgtccgtcgc atcagctgga attcccaggg tgagctgttt 8520 caaagcacac acaaccttct gatataccgt gagtcccatc actccacatg catcaggttt 8580 ttgaagccaa aactgatgat gtttctgcaa atctttgatg attctcaagc agagctcttt 8640 gctcattcga aattgacgtt cgaattttga tttgtatctg gcattcggtg cgaagtagtc 8700 attgtaaagt tgctgaccgc gagcaacacg gtctctatct atacgtatac gagtatgagg 8760 aaggggtggc ggtggtggga gctcattgaa gaggcagtcg aagaccaact ggtcatagta 8820 ctgctgctca ttgtcgtcgt ttgaagactt caaaagccgt tcgatttgtt ttttcaagga 8880 gctcggagca ggcatttggt gatttttttt ggatgaaatc tatgaaagat aaatcatttc 8940 atttcaaaat tagaccactg tggatctggt cgatccacag tgatttatct ttggtaaagc 9000 agacgttaca tcatcccata gtggaattca tgcgtgtaac aatttagttg ttacacgcat 9060 catcaaattt ggcgttcagg aacgcaaagt ctacgataaa tcacgcccac catagtgggt 9120 gctcttagag cacccactat ggttgcgtta actcggtgat ttagcgttac cctaacgcta 9180 aaataccaac aggtccacta tggacctgtt ttagacctgc ggtccgtacg gctgcgtcaa 9240 gtcttgcact tcgtgtgttc agatttacgc aagcctacga cttgatttat cgtaggaact 9300 agggaaagca gacgataaat catcccatag tggatttcac gcggttatca actttggtaa 9360 aacacgcatg atcaaacttc gcgctcagga acgcaaactt tacgataaaa cactcccacc 9420 atagtgggtg ctcttag 9437 // ID copia-2-I_AF repbase; DNA; FNG; 3517 BP. XX AC . XX DT 28-FEB-2006 (Rel. 11.02, Created) DT 08-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE Internal portion of copia-2_AF LTR retrotransposon - a consensus DE sequence. XX KW Copia; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; COPIA superfamily; copia-2_AF; KW copia-2-LTR_AF; copia-2-I_AF. XX OS Aspergillus fumigatus OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Neosartorya. XX RN [1] RP 1-3517 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-3517 RA Kapitonov V.V. and Jurka J.; RT "copia-2_AF, a family of copia LTR retrotransposons in the RT Aspergillus fumigatus genome."; RL Repbase Reports 6(2), 53-53 (2006). XX DR [2] (Consensus) XX CC It is an internal portion of the Copia-2_AF LTR retrotransposon. CC Its ORF coding for a Copia-like polyprotein is corrupted by a few CC stop-codons. XX SQ Sequence 3517 BP; 1353 A; 553 C; 534 G; 1077 T; 0 other; ggttataagc ccggccttgc ttaaaggtaa tttattatac ctaggcctag cctaggcatc 60 tatactatta attatttatt aattaatatc tttccatacc tatatactag gcttatgcct 120 agccttgatt aattatcaat taatattctg agcctataga agttatattt aatatagcat 180 ctatagataa gattagccta ctaaagctac aaggctcaga taactacata acctagtcta 240 ttaggactaa ggctaccctt ataaagaaag atctatcctc tactatagag gaagatatct 300 ctagcaagaa aaatgataag gcactagtaa ttattaaact tctatataaa gatggacccc 360 ttcttcatat aaaggaaata gctagggcta aagaggcctg gataaggctt taagatcttt 420 ataatcctaa gggatttact actaaatatc ttatcttaaa ggatttcttt aataccacat 480 tggatgattt ctaatcaatg gaggaatacc ttaataaggt taaggcatta gtagatgatc 540 taaaaggaaa agatattatt ctgcctaacc aggttatcat agcctagatg ctctaggcaa 600 tgaatataaa gggtttatat ctaatataac acaggccctt aggaatgacc ctaaagccta 660 tactattaaa tccctatttt ctagccttgc agatgaggca caaggaaggg aaaatagtta 720 taataatagc cataagctac tatatactaa ggctaagggt aaatcctata aaagacaggc 780 cttaggcaga tactgctaac attgcaaact tactagtcat aatactagta actactattt 840 cttattccct aataaagcac ctaaagggtg gaaaaaggga aataaaaata ataagatgaa 900 ttaaaaggat ataaaggttt aaaagaacta aaagaattag actagactaa aggataagga 960 tgctctcaca gcagttctat ctagtctaga tcctaatgac ttatataatt cctcaaatgt 1020 aaatctttat ttaaatcctt aatcaaatac caattcaaat catgattcat ctaatagtca 1080 tgaatatgat tatttagatt aagatatgac taagagtcct gaaatatata cccttcaggc 1140 tacagaccct gatccattac taaatataaa tgatatccct gataaccagc tatttgatat 1200 agctggggat gaggtatatc tacctcttat aacccctaat actagggcag atactaatat 1260 taataatagt gaactagatc atctagttac taataatcac ttattacctt aagaaggatt 1320 aaaagtggat tttattatta atagtgctgc tactataaac accattcata gaatagacta 1380 tttctttaga tataaagaaa taaataaaat aatttcatgg ggcaaggcta aaagcctaat 1440 agctaaatat cagggtgata tccttataaa atacccttta gggtatataa atataataaa 1500 ggatgtttat tatatccctg aattagggat taatcttatt agtatggata gaatatctaa 1560 ggcattaggg ataactacta tctttactaa agataaagta tccctatata aaaataacca 1620 gtctattata gctggatata aaaataatag cctctatcat attctattta ctatccttta 1680 tcctaagaaa tatattaatg cctgcctaaa taacttagaa ctctctgaat ttcataaata 1740 gcatatgaga ttaggacata taaatgctat acccctagga atgctactta gcactatggg 1800 aatagctata tctagtcatg aactagaaga atataaaaga aataaatgtc ctacctgcat 1860 ccttgctaag gataaaaggt atattaataa agaatccttt aataaaaagg aatataatgt 1920 cctagaaagg atccatagtg atataggagg ccccttatca cccacctata ataattatag 1980 gtattatata accttcctag ataagaaatc taggtattta tggctattcc tactaaggca 2040 taaaaataag gctttttaga cctttttaaa ttatgctaaa aaggcagaga ataataataa 2100 taagaaaaga ataagagaat tcttttcaga taataggcat gaatatacta ataaatgctt 2160 tcaaaaggca ttgaataact atggtattat ttataatact acccctatct ataccaaaga 2220 gcctaatggc ctaatagaaa ggataaactc aactttactt agtaaagtta gatcccttct 2280 aataatggcg aatgctccta aatatctatg gggtgagaca ctactagcta gtgcctatct 2340 atataataga accccctata gtgccttagc attcaagaca ccttataagg tattttataa 2400 ggaaaagcca tatatctaga atattagatc atggggatct attacatact accgtagtaa 2460 tataaataag ctaagctatc ccctagaaag gagaaagcta ttattattag ctatagctaa 2520 tataatcact ataaactatg ggatctgaat aaaaataagg ctatatagtc tagggatgtt 2580 actatccttg aaaataagtt cttagaccct cagcctattc aaaaatcctt aatatctaag 2640 gatctaaatt aatatgatta tattaatata aaggctaata atagccttat aaaggagtct 2700 agctcagcta cccctaataa gaataataat tctattaggg agtctagaga agctagccct 2760 gattaaaata gactaataac taggtctaaa ataagggata tatcaactat acctaattat 2820 aggattgaaa ttcaaattcc ctagagaaat ataaatataa acttctctgt agtaggagaa 2880 gacagggccc tattcactat tagtaatacc atgctaagga attatataaa ttccttaaat 2940 aagaagggag aggtagcaga cttccttcta tctactacta ctataaataa gcctaatacc 3000 ttccctaagg caatgaatag ccctaattct aaggaatggc taagggcatg cctagaggaa 3060 gttaataagc tagaaaagta acatacctat gatatagttg atcttcctaa ggggaaaact 3120 gccctaaggg gtagatgggt ttttaaggaa aaacctataa ataatactac tactataact 3180 agtagctata taactaatag taaaaaaact attagatata aagctagatg ggtgatccag 3240 ggtttctatc aaaggctagg aattaatttc ctagaaacct ttagcactac ctacagaact 3300 gaaacctagc acctaatcct aataatagct attaataaag gatagcatat catgtaatat 3360 aatattaaaa atacttttat gcatgctaat attaatgctg atatctacac cagctaggaa 3420 aagaaactag tgttttttcc tttttataaa gaaaggtttt tgcactagtg aaacgaatat 3480 ttagtctaaa tattcctata tttagcttaa ggggggg 3517 // ID Gypsy-31_MLP-LTR repbase; DNA; FNG; 586 BP. XX AC AECX01000183; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-31_MLP_; KW Gypsy-31_MLP-I; Gypsy-31_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-586 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000183; Positions 9766 9181. XX SQ Sequence 586 BP; 167 A; 146 C; 86 G; 187 T; 0 other; tgtagcaggg ctacagtctc gtgtatgacc taataccgaa agggactgtc ccagcctgta 60 tacagtatct gtaaccagga taactgtatc caaaccaggc tagttcctta agccatcttg 120 taatattgcc attgccacgg aatggcaatc cttgtaatac ggcctaggca gtaaactagc 180 ttgtaaccga tatgtctatt gtataaataa gagggttttc ttcgaactgt gaagaaaacc 240 ccagaccatt attcattctt atttctttta tacaacttgt acacgttatt acagatccca 300 attaaaaccg ttacaattat cttcttataa actctttatc agctacatta agcaataaac 360 ctttattcga actaccattc ttcttttcgt ctctatatcc tccatttgcc aaacttaccg 420 tttcatcgca taaagcccgt ctcgactaaa tcctcttagc ctcgacgcat acttcccctt 480 gaaacctttg acaactttgg attgtaccat ttccctatct tgcctgaaac ctagattact 540 taggtctaac aacaacttac gtgggttata ggtcaacccc gttaca 586 // ID Copia-6_LBS-LTR repbase; DNA; FNG; 228 BP. XX AC ABFE01001950; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-6_LBS_; KW Copia-6_LBS-I; Copia-6_LBS-LTR. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-228 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01001950; Positions 31416 31643. XX SQ Sequence 228 BP; 44 A; 62 C; 43 G; 79 T; 0 other; tgacgaacgt caagctcttt ctgtacgttt cgttatctct cgtagcacgt gatccggtag 60 atgtaggaca attttcatct tcctggagtt cacctttact ctcgtgaact ccaggtgcgt 120 agtctacatt atatcttttt ctactactta caatatatac ctttactctc gtgaactcca 180 gtgcctgccc tcggccgttc cactcgcatt cggattgtgc gtgcgtca 228 // ID PYGGY_LTR repbase; DNA; FNG; 224 BP. XX AC AF533703; XX DT 03-JUN-2005 (Rel. 10.05, Created) DT 09-JUN-2005 (Rel. 10.05, Last updated, Version 1) XX DE P. graminea LTR retrotransposon PYGGY - LTR sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; gag; pol; PYGGY_LTR. XX OS Pyrenophora graminea OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Dothideomycetes; Pleosporomycetidae; Pleosporales; Pleosporineae; OC Pleosporaceae; Pyrenophora. XX RN [1] RP 1-224 RA Taylor E.J., Konstantinova P., Leigh F., Bates J.A. and Lee D.; RT "Gypsy-like retrotransposons in Pyrenophora: an abundant and RT informative class of molecular markers."; RL Genome 47(3), 519-525 (2004). XX RN [2] RP 1-224 RA Gentles A. and Jurka J.; RT "LTR portion of P. graminea LTR retrotransposon."; RL Direct Submission to Repbase Update (03-JUN-2005). XX DR Genbank; AF533703; Positions 1 224. XX SQ Sequence 224 BP; 56 A; 67 C; 54 G; 47 T; 0 other; tgtaatgggc tagagcccag taggtaaatc ctcgatggga tctcagtcac gtgcgtagac 60 gcacgctagg gccatacccc taactagcgg tcaaaggata acccgagctg cgcacagacg 120 cgcactctga gtgaccttcg agcgggatag cccatagctc tcttctttat cgaatacaac 180 tccagtatac ctgtgcaata gctccccgag cgtagcccct taca 224 // ID Gypsy-17_MLP-LTR repbase; DNA; FNG; 1034 BP. XX AC AECX01002074; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-17_MLP_; KW Gypsy-17_MLP-I; Gypsy-17_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-1034 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002074; Positions 37468 36435. XX SQ Sequence 1034 BP; 356 A; 237 C; 129 G; 312 T; 0 other; tgtcagctat gaggaactta gtcacactaa aattcccaaa taactaaacc acttaacctt 60 gacaataatt ataattacaa aaatctataa aaatatatat atcacatctc aacactcgag 120 tttcaagagt gaacaattac gtcacattac aactattcga aacactggcc aggatccaat 180 aataattgaa ccctgaccag gcgaactcaa ccccacaacc accagaccag ttaaaacact 240 acattaacta acctggaaga tcacaacgaa aacgatttca ggaaaaatca agaaccctta 300 attttaccaa accagcttag gcaaattaga agattattac gccacgttag aaacccagct 360 cctatataat ccatagattt ctcctaagaa agaagagacc acactaaaaa acataaccac 420 caattcctac ttgaattact caaagctctt atttgataaa cgaaactcgt tttactttta 480 acagaaacca agccgcatta aaacttaaat tttagacaat agcttttcta aagacttact 540 tcttcaaaaa ctagctctct gtctcacttt tatcgttcca tttcttttgt ttcttaaaag 600 aaaactccag ttacgtaaat cttcagaact gtatcttcat tcaatttttc ctgattttca 660 ccctatcgct tgaggacgaa ggatttaaat ctttacctat taggatctta tcagtaagac 720 ctagtggaag gcgtttagtt gatttagatc tctaaaatag ctaccttttt tggatcccaa 780 aactctacga ggagttgtcc aagtaagcga gtttacacct cacttaagat actttttcct 840 gattgattag aactactact aatcaacttc tgttcaattg aaacagaatt ccttacttgg 900 ctagaactta tagtagccca actgctacca caaagcccgt gcgtattcac cttatcatct 960 cagaatttgt ctttcgagtc acttttcatc ttgagcttat cccgttagtc tctacttgga 1020 atctcaagct gaca 1034 // ID LTR15_CN repbase; DNA; FNG; 515 BP. XX AC . XX DT 30-MAR-2005 (Rel. 10.03, Created) DT 30-MAR-2005 (Rel. 10.03, Last updated, Version 1) XX DE C. neoformans LTR - consensus. XX KW LTR Retrotransposon; Transposable Element; Interspersed repeat; KW LTR15_CN. XX OS Cryptococcus neoformans OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Tremellomycetes; Tremellales; Tremellaceae; Filobasidiella; OC Filobasidiella/Cryptococcus neoformans species complex. XX RN [1] RP 1-515 RA Goodwin T.J. and Poulter R.T.; RT "The diversity of retrotransposons in the yeast Cryptococcus RT neoformans."; RL Yeast 18(9), 865-880 (2001). XX RN [2] RP 1-515 RA Loftus B.J., Fung E., Roncaglia P., Rowley D., Amedeo P., RA Bruno D., Vamathevan J., Miranda M., Anderson I.J. et al.; RT "The genome of the basidiomycetous yeast and human pathogen RT Cryptococcus neoformans."; RL Science 307(5713), 1321-1324 (2005). XX RN [3] RP 1-515 RA Gentles A. and Jurka J.; RT "C. neoformans LTR sequence LTR15."; RL Direct Submission to Repbase Update (15-MAR-2005). XX DR [3] (Consensus) XX CC Average similarity to consensus 92%. XX SQ Sequence 515 BP; 163 A; 111 C; 121 G; 114 T; 6 other; tgtagtgcaa tgagccttcg gcctcataga ctaatgccac atcacagtcg gaagctaccg 60 ccggtgacag tgaggagaga tagagaagag agagagatgg nttgagatat atcggagggg 120 atcagggaga agnaggagag agaacnaagt agagcctgtg acagtaacta ccccttccat 180 gggtagctgc caaacagctt acagagattc naagtacata ggtacataac tgttagttac 240 gctcatggag cctatattgc tatattattt gatacacaca caacacacaa caactcgaca 300 tgagacccga gttcaattat accactacac aacctaacga cctatcgtca gtatcantac 360 taggtagatc gaccaccaac agccaaactt aggaaggctt tgtattccgt agaattnata 420 ccaaggagtc gatattctgt tatctgctgt gggtggcagt ggaacggaac caactggatc 480 gtttaaccaa taggagaccc gggtatttcc ctaca 515 // ID GYARLI1_LTR repbase; DNA; FNG; 276 BP. XX AC CR382131; XX DT 01-SEP-2005 (Rel. 10.08, Created) DT 01-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE GYARLI1: Gypsy-type element from Yarrowia lipolytica (LTR DE portion). XX KW Gypsy; LTR Retrotransposon; Transposable Element; GYARLI1_LTR. XX OS Yarrowia lipolytica OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Dipodascaceae; Yarrowia. XX RN [1] RA Kovalchuk A., Senam S., Mauersberger S. and Barth G.; RT "Tyl6, a novel Ty3/gypsy group retrotransposon from the genome of RT the dimorphic fungus Yarrowia lipolytica."; RL Direct Submission to EMBL/GenBank/DDBJ (25-JUN-2004). XX RN [2] RP 1-276 RA Jurka J.; RT "GYARLI1: Identification of LTRs."; RL Direct Submission to Repbase Update (01-SEP-2005). XX DR EMBL/GenBank/DDBJ; CR382131; Positions 1723614 1723889. XX CC LTRs are identical. Protein homology to Ty3/Gypsy was identified CC in ref. 1. XX SQ Sequence 276 BP; 80 A; 46 C; 80 G; 70 T; 0 other; tgtaatgatt cggagacact cataagatgt cagtgggagg cacgtgacat agagactagg 60 gttacgtgga aggcacgtgc ctgtgtgact gcgagtgccc aaataggaca tgcggggtac 120 tcgcctttgg aggtataaga aggattattg tatatgatta attgattagc ttattattga 180 agttcgacga gaccagtaga tccaggagcg cagtattgag actagcagcc ttggatagag 240 tattacgaca agagatagta cgcccgatct gttaca 276 // ID I-1_AF repbase; DNA; FNG; 5822 BP. XX AC . XX DT 02-MAR-2006 (Rel. 11.03, Created) DT 02-MAR-2006 (Rel. 11.03, Last updated, Version 1) XX DE A family of I non-LTR retrotransposons - a consensus sequence. XX KW I; Non-LTR Retrotransposon; Transposable Element; KW Interspersed repeat; I-1_AF. XX OS Aspergillus fumigatus OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Neosartorya. XX RN [1] RP 1-5822 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-5822 RA Kapitonov V.V. and Jurka J.; RT "I-1_AF, a family of I non-LTR retrotransposons in the RT Aspergillus fumigatus genome."; RL Repbase Reports 6(3), 124-124 (2006). XX DR [2] (Consensus) XX CC This is a non-LTR retrotransposon from the I clade. I-1_AF CC elements are flanked by ~14-bp target site duplications. Several CC copies are inserted randomly. Several copies are inserted CC precisely at the same target site in Afut2-LTR. I-1_AF encodes CC the 497-aa I-1_AF1p DNA/RNA-binding protein (pos. 171-1660) and CC 1273-aa I-1_AF2p polyprotein (pos. 1661-5479) composed of the CC endonuclease (aa pos. 12-244), reverse transcriptase (aa pos. CC 490-750), and RNAse H (aa pos. 971-1115). XX FH Key Location/Qualifiers FT CDS 171..1660 FT /product="I-1_AF1p" FT /translation="MARHNNQPGGPLQAALQETTTSATARALEGQKIFSPI FT AAFLDDHRRQNAGLTPRQLGALTALSNDLANIAQQHFNAYISGVPLTNAPL FT PLTPAPAPAHGPASFPPSPPPSRPPSGLAQSTYATVTRSPPARNNAATKKT FT YSNTKTDRLATKLPPPDNRLFVRLPENHLAKSMDAFAIHTSLRSHLGTNGK FT LLKGVQSIKTGLALLPVSTEALPALEAQKEAIAAFFKDCQIERSSRWISYR FT VTNVPRKVGRLTGSQYSMIPVNPEILSSEITEAIGLNPVSITETATSAANP FT NTISSSWFINFPEDSKANLPVRLSLFGMITNAQLLVRKTKIVQCNRCWKWH FT NARSCARQPRCRLCGSTQHTEEGHSNRCSTQPPHTCPPRCLHCHGPHPADY FT EQCLLRPSKAGTRCTKAQQAEIRKTSSLELAKARLERQCCMQVPEPSQDQI FT MAMNSRPSPFPFRTATPPPQEPLDSPPVTARAVRFVSPKPQNSFEVLMNEQ FT X" FT CDS 1661..5479 FT /product="I-1_AF2p" FT /translation="MKYQNHPRKESIAILQLNVGRIPEAHEIALSQAYSNH FT IDIILIQEPYIYRDLTRKITKRHPSYECFSPTDSWEISGIPRVLTYIRRKN FT GIRASQLRPDSISQETLSDLLLLQISSCSGQSALIFNIYNAPAGAKRAGEA FT ARTLTSLLESYFSLPTLLAGDFNLHHHRWQPSLQHAPTTFAESFTEWLDRL FT GLLLTSEIDIPTHDKGNVLDLAFVSSSPTLIGASTRVAHHLDATSDHRPLL FT TNLPWEQGPVETPQRLRFDTLDHTRFLTLLASNIANVRSSAENEDELDYLA FT NGITAAIHSSYTASANKSIPQGGGQRWWNTTCKEALQDYRAGLCTQKDFRR FT VTRRAREQYWRDKISAVTESKEVFDISRWHKSTGSYRSPPLKDPLRPDSPP FT AVALQEKRDIISRNLLQNTAEAGDIPLDTPAVPSSSLPFPDVTMAQVEKSV FT LQAGNTTPGADELPTGIIKVAWPLIKDRVLALYQGCLRTGYHPKCFRHAVL FT AIIQKPNKSNWSSPRSYRPIALLSVLGKGLERLIARNMAWIAIQYRVLASQ FT QFGALPLRSATDLTTSLLHDAEQALNQRLTASLLTFDVKGAFDGVLPGRLI FT HRLRSQGWPDNLARWVASFVTGRVVQIRIDGEIGPATEILCGLPQGSPVSP FT ILFMLYLAPLFWLGVPKSRFGYADDGAILAISPSIEANCQSLSNSLQEALD FT WAATEGITFAPDKYELIHFSRRKADQDPSHTPSIVAGPVTVSENTTRPYLR FT WLGVLFDKKLSFKYHVRETTSKALTVANALRGLGNTVRGIKPYLMRQAVIA FT CVLRKAYFGAETWWPGRSRPCPRAGSISNQVQGQLDKIAKVIQTGARAILP FT IYRTTPLPALYRESGLLPAEIELDHIATSAAIRVRRLDPYHPLRRRAAKIA FT QTGWATSRFARRVLALPESEQINPLQYPPWLPREARADAQLRIKAPNGLSK FT EQAATNFQDFYSSLPNTDMKVFTDGSKLPNGMAGAGFALYQTGRLCLQSSF FT SLGLNKELFDAEAEAALAGIKAAMQYHTARFATNLWVCLDNLEVATRLLSP FT FAGSSQEIFETVRTLASTWLERERFPYTDGGSVRICWVPGHAKVPGNEAAD FT QAAKEGAALEPPISQKHSFASLKRQTKAAMISGLQKHWQAVAPQPYQDLCI FT TSSPRCPDELRLPRHLLGRILAARTGHGDFADYHERFNHEDAHLHCRCGAR FT KSPVHFFFCRIVKRRLPQPLGPPSEMIPFLLGTAKGAAKLAAWLSQTRFFD FT DICPRRPLPEHA" XX SQ Sequence 5822 BP; 1506 A; 1820 C; 1288 G; 1208 T; 0 other; gagctacgcg acggacgcac agccttctca cccaggaagg ccatactcta ttgaattctc 60 agatcaggct ctcaatccag gcgccctgcc taggataagg gacagctccc cctccgctca 120 ctgcctgaca cccgcccagc aaccaccacg cgtctcgacg cgtcaataac atggcacgac 180 ataacaacca acctggtggc cctctacagg ctgcactaca ggaaacaacc acctcagcca 240 ctgctagagc tctcgaggga cagaaaatct ttagtcccat agcagccttc cttgatgatc 300 accgacgcca gaatgctggc ctcacgcccc gccagctagg agcgctcacg gcccttagca 360 atgacctggc taatatagcc caacaacact tcaatgccta catcagcggc gtgcccttga 420 ccaatgcccc cctccccctc acccctgccc ctgcccctgc ccatggccct gcttccttcc 480 ccccctcacc tcccccttct cgtcccccct ccggtcttgc ccagtccaca tacgcgactg 540 tcaccagaag tcccccagct aggaacaatg ctgccacaaa gaaaacatac agtaatacta 600 agactgatag gttagctaca aagctgccac ctcctgacaa ccgactcttt gtgcgccttc 660 cggagaacca tcttgccaag agcatggatg catttgctat acatactagc ctgcgatccc 720 atctgggcac taatggaaag cttctcaagg gcgtccaatc tatcaaaact ggactcgctc 780 ttctgccagt ctctactgaa gccctcccag ccctagaggc ccaaaaggaa gccatagccg 840 ccttctttaa agactgccag atcgaacgga gttcccgctg gatctcttat cgagtgacca 900 atgtacccag gaaggttggc cgacttactg gcagccaata ctccatgatc cccgttaacc 960 ctgaaatctt gtcttcagag atcactgaag ctataggcct taacccagtc tctattactg 1020 agactgctac aagtgccgct aaccccaaca ctatctcgtc cagctggttt atcaacttcc 1080 cggaggacag caaggccaac cttccagtac gactctcctt gtttggcatg atcaccaatg 1140 ctcagctgct agtaagaaag acaaagattg tacagtgcaa ccgctgctgg aagtggcata 1200 acgctagatc ctgcgctcgc cagccccggt gtcgactttg cggctctacc cagcatactg 1260 aggagggcca tagcaataga tgcagtaccc aacctcccca cacctgccct ccaaggtgcc 1320 tacactgcca tggcccccac ccggctgact atgagcaatg cctcctgcga cctagcaagg 1380 caggcactcg atgcacaaag gcccagcaag ccgagattcg caagactagc tccctagagc 1440 ttgccaaggc acgcctagag agacagtgct gcatgcaagt cccagagcct tcccaggacc 1500 aaatcatggc tatgaacagc cgcccttcac ccttcccctt ccgcacggct acccccccgc 1560 cacaggagcc tttagactct cccccggtca cagccagagc agtccgcttc gtatctccta 1620 agccacaaaa cagctttgaa gttctgatga acgaacaaat atgaagtacc aaaatcaccc 1680 aaggaaggag tctatcgcta tactccagct gaacgttggc cgtataccag aagcccatga 1740 aattgccctc tcccaggcat actccaacca catcgacatt atccttatac aggagcccta 1800 catttacaga gacctgactc gaaagatcac aaaaaggcac ccatcttacg agtgcttctc 1860 cccaactgat tcttgggaga taagtggcat ccctcgagtt cttacctaca ttcgtcggaa 1920 aaatggcatc cgagcctccc aactccgccc tgactcaata agccaagaga ccctctcaga 1980 ccttcttcta ctccagatct cctcgtgctc aggacaatct gccttgatat ttaatatata 2040 taatgctcct gctggcgcaa aacgggcggg tgaagcagcg agaactctta cttccttgct 2100 ggagtcctac ttctctctgc ctaccctgct tgctggtgac ttcaacctac accaccacag 2160 atggcagccc tcactccagc acgcacccac taccttcgca gaatcattca cagaatggct 2220 tgacaggctt ggactactgc ttacctctga gattgatatc cctacacatg acaaaggcaa 2280 cgtcctggac cttgcctttg tctctagctc cccaaccctc ataggagcca gtaccagggt 2340 tgcacatcat ctagacgcca cctcggatca tcgcccactt cttacgaacc tgccgtggga 2400 acagggacct gtggaaactc cgcaaagact aagatttgac acactagacc acacccgctt 2460 cctcacactt ctcgcctcta atatagccaa cgtcaggagc tcagcagaga atgaagatga 2520 actagactat cttgcaaatg gaattactgc agccatccac agctcataca cagcctctgc 2580 aaacaagtca ataccccaag ggggaggcca acgatggtgg aataccacgt gcaaggaggc 2640 attacaagac taccgggcag gactctgcac acagaaagac ttccggcggg taacgcggcg 2700 agcccgagaa cagtattgga gagacaagat cagcgcggta accgagagca aggaggtgtt 2760 tgatatatca agatggcata agtcaacagg ctcctaccgc agccccccac ttaaggaccc 2820 tttgagaccg gacagccccc cagcggtagc cttacaagag aagcgggata tcatatcccg 2880 gaaccttctt caaaataccg ctgaagcagg agacatcccg ctggatacgc cagcagtccc 2940 atctagctca ctgccattcc ctgatgtaac catggcccaa gtcgagaagt cagtcctaca 3000 agcagggaat acaacccctg gggctgacga gctacccact ggtataatca aggtggcatg 3060 gcccctgatc aaagacagag tccttgcact ctatcagggt tgccttcgaa ctggctatca 3120 tccaaagtgc tttcgacacg cagttctagc aataatccaa aaaccaaata agtctaactg 3180 gtctagccct aggtcatata ggcctattgc cctcctgtca gtactgggca aaggcctcga 3240 gagactaatt gcccggaaca tggcatggat tgctatccag tacagagtcc tggctagtca 3300 acagtttggg gccctgcctc tccgctcagc aactgatctg actacatcct tgctccatga 3360 tgcagaacag gcccttaacc aaaggctaac agcctcactc ctcacctttg atgtaaaggg 3420 agcctttgat ggagtccttc ctgggagact tatccatcga ctgcgctcac agggctggcc 3480 tgataaccta gctcgctggg tagcctcctt tgtcacaggg cgagtggtac aaatccgcat 3540 tgacggtgaa ataggaccag caacagaaat actctgcggc ctgccacaag gttcaccagt 3600 atcccctatt ctatttatgc tctatctagc ccccttattc tggcttggag ttccaaagtc 3660 caggtttgga tacgcggacg atggagctat cctggcaatc tcaccgtcta ttgaggcgaa 3720 ctgccaaagc ctatccaact ccctacaaga agcccttgac tgggccgcta cggaggggat 3780 caccttcgcg ccagataaat atgaactgat ccatttctca cgccgcaagg ctgaccagga 3840 tccaagccac acaccaagca tcgtagcagg cccggtcaca gtctccgaga atacaactcg 3900 cccttacctg cgttggctgg gggtcctctt tgacaagaag ctcagcttca agtaccatgt 3960 cagggagaca acctcaaagg ctctgacagt tgcaaatgcc ctccgtggcc tcgggaacac 4020 agtccgggga atcaagccct atctcatgcg acaggcggtg atagcctgtg tgctccggaa 4080 agcctacttt ggcgcagaaa cctggtggcc aggtcgctcg cgcccctgcc cccgtgcagg 4140 ctctatctca aaccaggtcc aggggcaact tgataagata gctaaggtta tacagacagg 4200 tgctagagca attctcccta tctaccgcac aactcccttg cctgctttat accgggaatc 4260 agggctccta ccagcagaaa ttgagcttga tcatatagct acatcggccg cgatccgtgt 4320 tcgtcgccta gacccttacc atcccctgcg caggagggca gcaaaaattg cacagactgg 4380 ctgggcaact agccgttttg cacgccgagt actagcccta ccagaatcag agcagataaa 4440 cccgctgcaa tatccccctt ggctcccaag ggaggcacgg gcggatgctc aactgcgaat 4500 aaaggcacct aacggactat ccaaggagca ggcagctacc aacttccaag acttttactc 4560 ctctctccca aacactgata tgaaagtctt tacagacggg tctaaattgc ctaacggaat 4620 ggcaggtgct ggctttgctc tataccagac aggaagacta tgtctccaat catcgttttc 4680 tctcgggcta aataaagagc tttttgatgc cgaagcagaa gcagctctcg ctggtataaa 4740 agcagccatg cagtatcata ctgcacgctt tgctactaac ctctgggtct gcctagataa 4800 cctggaagtg gccactcgcc tgctctcgcc ctttgcaggc tcatcccaag aaattttcga 4860 gaccgtccga accctggcct cgacctggct tgagcgggag aggttcccct acactgacgg 4920 gggctccgta cgtatctgct gggtccctgg gcatgcgaaa gtccctggga atgaagcagc 4980 tgatcaagcc gcgaaagagg gagctgcctt agaaccccct atctcccaaa aacattcttt 5040 tgcctcgctt aagcgtcaaa caaaggctgc catgatctct ggcctacaga agcactggca 5100 agccgtggcc ccgcagccct accaagatct ttgcatcacc tcctccccca ggtgccctga 5160 tgagctacgc ctcccgcgcc accttctcgg gcggatccta gcagctcgta cagggcatgg 5220 tgattttgca gactatcatg agcggttcaa ccatgaagac gcccatcttc attgccggtg 5280 cggagccagg aagtctccag tgcacttctt cttctgccga atagttaaga gacgcctacc 5340 ccaaccccta gggcccccct ccgagatgat tcccttcctt ctgggaactg caaaaggtgc 5400 agcaaagctg gctgcctggc tatcacagac tcggttcttt gatgatatct gtccaagacg 5460 gcccctgcct gagcacgcct gagacaatac aatccttctt taaaaaaaaa aaaaaaaaaa 5520 aaaaaaaaaa aaaaaaaaga ggagggggga aagcagagac gcggagaatt ggcggatcat 5580 agtgcggaga tggatccatg cagctggcat cgcaatttaa cctactatag cagaccttcc 5640 ccatgactga ggcagctatg cgaacgcgcc aaagtctctt gtttgtacat agaactagca 5700 atagttagag acacccaggg ctcaccccct ggcggccttt gccccgattg gggatagtgg 5760 aaaacgtgca agtcacgagc cacggccgta aaatacatac acacacacac acacacacac 5820 ac 5822 // ID hAT-1_Cglob repbase; DNA; FNG; 3031 BP. XX AC . XX DT 17-MAY-2011 (Rel. 16.05, Created) DT 17-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW hAT; DNA transposon; Transposable Element; hAT-1_Cglob. XX OS Chaetomium globosum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Sordariales; Chaetomiaceae; OC Chaetomium. XX RN [1] RP 1-3031 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 3031 BP; 747 A; 788 C; 812 G; 684 T; 0 other; cagcgctgct caaagatcag attgtagatc agatctatca atccagagca gtccagtcga 60 ttcagatcgg ccatttcttc agatcagtcg gatcagatga gcaccttaca aatctgataa 120 tctgattgtt atctatacca tacttttggc ccttggtcaa ggcaccgaat cccttcattt 180 tgagtgttaa ttatagagag gtggatgaat gtgtcgggta ttactttttc cgggccagaa 240 caagccctgt ctagcgctaa attgcggatt ccgcgcgtcg ctctgtccac tgtttgtacg 300 cgcgcgcgtg tgtgtgacga ccactgaggc tgatccgatt cgccagttaa agttaaccca 360 acctcgcaat gtcgactgct gcacagagca gctacgcagc ctctactgca gacaatgcca 420 acaacgttgc agcagtcttc ctcccccgcc ctcgatctcg actgcgcacc atcgcgaagc 480 agtacgacga agctgcgcag caaaccgtca cacataccga cgagccatac aagatcgttt 540 gggggagccg aactttcgtc attagcggtg ttctaaacaa gaagaagagc cgtggccgtc 600 gttcctggat tactcgtgag ggttggttcc taacggagct gacgacaaca ggaaaggtca 660 agcaggatgt ctggtgctgt cgacgctgcg atgcctacgg caagccccag tttttcagcg 720 ctcaatcaac ctcggcctcg cagggacacc ttctaaggct agcatagccc ttgccgttgt 780 tctaatcacg ttgctaacgg ttacaggtcc cacaacatcg ccgaggacag ccccagccgg 840 ccagacaacg agatgaccgt ccttgacatc cagcgagagg ctagcaaccg atcgacccct 900 tccagcattc cacaagcgaa gatcagtcgg gcccgggagc ttctagtagg ctatattgtc 960 gacggcgact tgccctttac tgctcttgag agcgtctaca tcaaggagct cttaaagcag 1020 ctagaccctg gattcgccca agagctccct cacagtagat cgacaattgg aagggatatc 1080 aaggaggttt ttgagtcgaa gatgatggcg gtcaaggctg acctgacgaa agccctcagt 1140 cagattcata ttagcttcga cctttggacc tctccaaaca acctagccat gatttcagtc 1200 tttgggcact tcatcaatca gaagaaaatg ccccagagca ggttgctagc cttccgtcgg 1260 cagctaggga agcattctgg gaattacatc gcgctcacga tccaggatat tctcaaggct 1320 tggggaattg ggaagcaggt gggtgtctgc gtcgccgaca atgcagggaa caacgacaca 1380 tgcctcaagg ccctatatcg atccctcgat cccaccataa ccgaccgaga tatcaaggcg 1440 cgtcgaatgc gatgcttcgg tcatgtcctc aatcttgttg ttcaggcttt cctttttggc 1500 caggatgcta cctgcttcga gagggatgct tatacccttt ccctttggag tgacgacgag 1560 gctgaactag cacattggag agcaaaggga cctgttggaa agcttcacaa tatcgtcaag 1620 ttcatcaggg cgtcgccgca acgatccgag gcattccgac agcatgctaa ggaagcccag 1680 acctctgacg actatctgtt gtcagaggag ccgacctggg accttggcct caagcaggac 1740 aacagcacgc gctggaactc aacttacctg atgatcgaga gggctgtcag gaagagggat 1800 gatatcaaca gctttatctt ggagcttgat cttgaatctg atggtgacaa gcggatccct 1860 gatgctgaca aactaaccac cgacgactgg aaggccctga tcgagatcaa gacgatctta 1920 gaaccactct acaaactgac gataaagacc cagggatggg ggcagtctgg aacaagtggt 1980 cgactttcag atgttctaat ggggatggaa tatgtattgg gacatcttga aggttacaag 2040 actctatacg acaaggattc cggtcttgaa gctgctcgag cagctgaagc ttcggcggat 2100 ctagcgcgta atcaaccaac cagtctatcg cagcttcgat cgacgcggcg gctgcgcttc 2160 aacgaagggg ccctgccttc tcatgcccgc gacgagtacg tcacgatgcc cgacagcgac 2220 ggtctgctta gactccaagc tagagagcgg gccagcattc gggccagcat caacaatgcc 2280 tggaagaaac ttgacgagta ctacacgaag cttgcggatt cccctttgtt cactgcagct 2340 atcattctga atccaaatct cggccttcgg tggctgcggc gtcggtggag ggatcccgaa 2400 caacatgagt ggctcgttgc tgcgaaagat ggtctcaagg aatactttga ccgctggtat 2460 tgcagctctg atgatcccca gcctcagggt cgatctgtct gtcgagattt aggccgggag 2520 gacgatgctt acgaggcctt cgtcaacagc ggagtcagtt ccgatgaaga tggtaacgag 2580 gacgagattg accggtacta caacatgaag gtcgggggca acgtcgatcc ggttgaatgg 2640 tggatcagta ataaggctca gttcccaaag ctgtctcaga tggcccttga cattctcgct 2700 attccagcca tggcagctga ctgcgagagg tccttcagca ttgctaggct tactctgagc 2760 tcccagaggc atgcaatgaa gtgggagact atcgagatgc ttcagatgct gaagaactgg 2820 cttaggaacg gcgatatcgt catcggagga gtcatgaagg gcagcaggcc ggattgggga 2880 tcgattaagg ggcagcaggg ttacgaatag ctgcatttgt atgtcagttc agtccagttc 2940 aatccagtcc agtccagtcc aagggtcgag ggaatcaatc cgatcagatc aggagggtct 3000 gtctgatctg actgatctga gagcagcgct g 3031 // ID Copia-22_MLP-I repbase; DNA; FNG; 4996 BP. XX AC AECX01000990; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-22_MLP_; KW Copia-22_MLP-LTR; Copia-22_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4996 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000990; Positions 2138 7133. XX CC Positions [2427-2924] - Integrase core CC 'CTTTC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 54..4838 FT /product="Copia-22_MLP-I_1p" FT /translation="MRTRRKDYGEAKPIAKRSYKKKEKVKDEVLLKSDIEL FT PTFIKKHPEVLENLKRHPGLRPITPSVYPPTSDSLYPPIPINRLHFIFRFS FT LLSLHSPPQPYFHNRRLNPLPTPLSQVPFFAPKGLLFEPTPRVTQYSTATR FT LEIRRIFQRSKTIKSQSTMSNAGPPPPAAPPGMRQPPPHQTHAASVEDYRS FT NKFQLSMPTYVKGSVDTLKVDGSNYAEWEFKIGRCVDRVTKTTDYLSKDGM FT KSSDPDGDAVVYSMIELTVPFEIQRRLSGSAKDAFRTIRALFFFPSRSGHM FT ATWKEVLEIKYDDVSDIGDYFRKIDEKISDLERSGFEWSKDAITGVLYQIG FT LGNAFTNVNNTLNARLRATPTNPITAREVEESIRAEKQIHNTDNITPAFSQ FT LDLNANAFNASPTMNRRGAGNTGSLYRAATPSRGRGVFVHSPQFQSRNLNS FT PLSRHLTSPPTRTYQLSPKFVPPAPGVVSKFHPSLYNQGICYVCHNTGHWS FT DTCPTKFPASPSQPSTSRSSTTASTSTPPRSFRLNLIDANGATFTVDATGV FT EEDEGPLPDGIWASEGTLTEMWDNGDEGVSDTGATHMVTGKLEYLTDVRML FT PRPIPVSVATKAPKAFVTHAGTMHIRSDDGKPIPIHNTFYSPLASSTLISI FT PQLVKSGGSWDAKDGKMKLKFPCGGVVTSDLNLNARRWTVPVMKASAFDCP FT KLSPPSRIVHPTPPSLSSLTTNATMIGAPPAVNEALKWHCRFGHTSMRNIQ FT RMIKHGVVSGLPEKIDKTPFTCLDCLKSKSLRSITLTSSGTRLQPLELIVS FT DVAGPYTPSANGARYMINFRDVATTYTESICVAKRDQVPQMFFEFVERMER FT QTGFKVKKLRSDGAGEYTSKAMNTWCAARGIINHQTNRYEHHQNGTAERSI FT RTISDMGRTMLHAANFGEDMWAYAHTAASHVHNRIPNAATGLKTPFELMFD FT RKPHLDYLRTFGEPAIVHIPCEIRRKLDVRGKKMLMVGYPKGKKGWTFWDP FT ETKVTTTVDSSLARFLSDGPPLPQPTPAPTSTETLSDVENSKGSLAYVLNA FT LRLGEFDGEHAVEEQDALALMVETQTSPYGIIPKSYTEAMRLPEAEKWKEA FT CLEELRRFTDMGVWEVSDPPEGQHLTDLKWVFTIKRDGRFKARLVAKGYSQ FT VEGVDFTETFAPTATFAALRVVLVIAAYYGLKVRGFDVIAAYLNSPMDHDV FT WVKSPPGFKLGRVMKLLKALYGTKQGARCWWKFMEEKLRALGFQPSQFDPS FT FYVLRRNGKMCLVWLHVDDGCVASDSDELLKEIEKELSSQIEIKWEKDLTH FT IVGVQIDRVATNHFILSQSDLANKIVRDNQDLLSSFSATTPVPPNLSLTTP FT TNELPLESNRYLSIVGAVSYLAVGTRPDLAFPVNFLARYSKSPQTEHWNAI FT KHLLRYVRDTANMGLDINPTKIKVRRAVETFVDANWGGEFARSTYGHVTRL FT YGVPIAWVSRRQGCVATSTCHAEFMAVGAACRDTIWLQSLIQDIIPRIGTP FT LILGDNKSSIHVSKDNAANKRTRHTEREFYYINEQLYKKKVDLEWIPGNEQ FT LADTFTKALGPLKFREARGLLGVCAKRK" XX SQ Sequence 4996 BP; 1329 A; 1192 C; 1169 G; 1306 T; 0 other; ggttatgagc ccgcactgcc actgcgtgag cgaattccga tagtcacagt ttcatgcgaa 60 caagaaggaa agattacggg gaagcgaaac caatagcaaa gcgatcatac aagaagaaag 120 agaaagttaa agacgaagta ttactgaaaa gcgatattga attaccgaca tttataaaga 180 aacatcctga agtattagaa aatctaaaac gtcatcccgg tttaagaccg attacccctt 240 cagtataccc cccaacttcc gattcattat atccccctat acctatcaac agattacatt 300 ttatttttcg cttctcactt ttatctcttc attcgccccc gcaaccttat ttccataatc 360 gccgacttaa ccctctgccc acacctctct cgcaagttcc tttcttcgct ccgaaaggtc 420 tactcttcga acctactcct cgagttacac aatattcgac tgctactcgt ttggagattc 480 gaaggatttt ccaacgcagc aagactatta aatcacaatc gacaatgtcg aatgccggac 540 cacctccccc tgctgctcca ccgggcatgc gtcaaccacc ccctcatcaa actcatgctg 600 cctctgtcga agactaccgt tccaacaaat tccaactgtc tatgcctact tacgttaaag 660 gctccgtcga tactcttaaa gtagacggtt caaattatgc cgaatgggag ttcaagattg 720 gtcgttgcgt ggatcgtgtg acaaaaacga ctgattatct ttcaaaagat gggatgaagt 780 cgtccgatcc cgatggtgat gctgttgttt actcgatgat cgaactcacg gtgcccttcg 840 aaattcaacg tcgcttgtct ggctcagcta aagatgcttt ccgtaccatc cgagctctgt 900 ttttctttcc cagtcgtagt ggtcatatgg ctacttggaa agaagtgttg gagattaaat 960 atgatgatgt atcggacatt ggtgactatt tccgcaagat cgacgagaag atatcggatc 1020 ttgaacgttc tggttttgaa tggtccaagg acgcaattac cggtgtacta tatcaaattg 1080 gactcggtaa tgctttcacc aacgtcaaca acaccttgaa cgctcgactc cgtgctactc 1140 caacgaaccc gatcactgct cgtgaggtgg aagaatcaat ccgtgccgag aaacaaattc 1200 acaataccga taacattact ccagccttca gtcaattaga tctcaacgcc aacgcattca 1260 acgcctcgcc taccatgaat agacgtggag ctggtaatac gggctcccta tatcgtgccg 1320 cgacaccgtc gagaggcaga ggtgtttttg tccactcacc tcaatttcaa tctcgcaatc 1380 ttaattctcc tctatctcgt catcttactt ctcctccaac tcgaacttat caactcagtc 1440 cgaaatttgt cccgccagcc cccggtgtcg tttccaaatt ccatccctca ctttacaatc 1500 agggtatatg ctacgtgtgc cacaatactg gacactggtc tgacacctgc cctaccaagt 1560 ttccggcgtc accctcacaa ccgtcaacaa gtcgatcatc tacgactgcc tcgacatcga 1620 caccgcctcg ttcgtttcgc ctcaatctga tcgatgcaaa cggcgcgacg ttcaccgtgg 1680 acgccacagg tgttgaggag gatgagggac cactacctga tgggatctgg gcgtcagagg 1740 gtaccctgac cgagatgtgg gacaatgggg atgaaggtgt gagtgacacc ggcgctaccc 1800 atatggtaac tggtaaactc gaatacctaa ctgatgtccg catgttacct agaccgatcc 1860 cggtatcggt ggcgacaaag gcacctaagg cgtttgttac ccacgcgggg acgatgcaca 1920 ttaggagtga tgatggcaag cctatcccga tccacaacac cttttattcg ccactggcca 1980 gcagcacact gatatctatt cctcaactag tgaaatcagg tggatcatgg gatgcgaagg 2040 atggcaagat gaaactcaag tttccgtgtg gtggtgttgt tacttctgat ctaaacttga 2100 atgctaggag gtggactgtt cctgtaatga aagcttcggc atttgattgt cccaagttat 2160 cgcctccgtc aaggatcgtg caccccaccc caccttccct atcctctctg acgaccaatg 2220 cgaccatgat tggtgctccg ccggctgtga acgaagccct gaaatggcat tgccgcttcg 2280 gtcacaccag tatgcgtaac atccaaagga tgattaagca tggtgtagtg tctggacttc 2340 ccgaaaagat cgacaagaca ccttttactt gcctggactg tctcaagagt aagagtttac 2400 gttcgataac tcttacgtcc tctggaacgc gtttacaacc actggaattg atcgttagtg 2460 atgttgccgg accttatacg ccttcagcaa acggtgcacg ctacatgatt aatttcaggg 2520 atgtagccac tacctacaca gagtccatat gtgtggctaa acgtgatcaa gttccccaga 2580 tgttcttcga atttgttgaa cgtatggaaa ggcaaaccgg cttcaaagtg aagaaactac 2640 gaagcgatgg agccggtgaa tatacctcca aggcgatgaa cacgtggtgt gcagcacgtg 2700 ggatcatcaa ccatcagaca aaccgttacg aacatcacca gaatggcaca gctgaaaggt 2760 ccatacgaac aatctcagat atgggccgca caatgcttca tgcagcgaac tttggggagg 2820 acatgtgggc gtatgcacac accgctgcgt cccatgtgca caaccgaatt cctaacgccg 2880 ccactgggct caagacccct tttgaattga tgttcgatcg gaaaccacac ctcgattact 2940 tacgtacatt tggtgaacct gcaattgtcc atatcccgtg tgagataaga cgcaaactag 3000 atgtacgggg taagaagatg cttatggttg gttatccgaa gggcaagaaa ggttggacct 3060 tctgggatcc tgaaaccaag gttacaacga cagttgattc ctctcttgct cggttcttat 3120 ctgatgggcc tccgctaccc caaccaacac cagcgccaac atcgacggag accctgtcgg 3180 acgtcgaaaa ctccaaagga agtttagctt atgttctaaa tgctttacgc cttggtgaat 3240 tcgatggcga acatgcagta gaggagcagg acgctctcgc tcttatggtc gaaactcaga 3300 catcgccgta tgggatcatt ccaaagtcct atacagaagc tatgcgttta cctgaagctg 3360 agaagtggaa ggaagcatgc ttggaagagt tacgacgctt tactgacatg ggcgtatggg 3420 aagtgagcga cccacctgaa ggtcaacatc ttactgatct gaaatgggtg tttaccatca 3480 aacgagacgg gcgcttcaaa gcacgactcg tggcaaaagg ttactcacaa gttgaagggg 3540 tagatttcac cgaaactttc gcaccgacgg cgacttttgc tgctcttcga gtggtattgg 3600 tcattgcagc ctattacggg ctgaaggtgc gtggttttga tgtgatagcg gcatatttga 3660 acagtccaat ggaccacgat gtttgggtga agtcaccgcc tggttttaaa cttggtcgtg 3720 tgatgaagct gttgaaagca ttgtatggta ctaagcaggg tgcgaggtgt tggtggaagt 3780 tcatggaaga aaagctacgt gctttaggtt ttcagccaag ccagtttgat ccaagcttct 3840 acgtgctgcg tcgcaatggc aagatgtgtt tggtttggtt gcatgtggat gatggatgtg 3900 ttgcgagtga ttcagatgag ttattgaagg agatagaaaa agaattgagt tcacagattg 3960 agatcaagtg ggagaaagac ctgactcaca ttgttggcgt gcaaattgac cgtgtggcta 4020 ctaatcactt catcctttca caatctgatt tggctaacaa gattgtaagg gataaccagg 4080 acctactctc gtcattctca gctacgaccc cggtgccacc caacctttca ctaacaacgc 4140 caacaaatga attaccgcta gaaagcaatc gctaccttag tatagtggga gcagttagct 4200 atttggcagt aggaacaaga cctgacttag catttccggt gaacttctta gctcgatact 4260 ccaaatcgcc tcagacagag cactggaatg cgatcaagca cctgttgcgc tacgttcgag 4320 acactgccaa catgggtctc gacatcaatc ctacgaaaat taaggttagg agagcggtgg 4380 aaacctttgt ggatgccaat tggggtggag agtttgcaag gtcgacgtat ggacacgtga 4440 cacgattata cggggtaccg atcgcgtggg tatctcgaag acaaggatgc gttgcaacat 4500 ccacgtgcca tgctgaattt atggcggttg gtgcagcatg tcgggacacg atttggttac 4560 aatctttaat tcaagatatc attccaagaa ttggaactcc tctgatccta ggtgacaaca 4620 aatcatcgat ccacgtatca aaagacaacg cagctaataa gcggacacga cacactgaac 4680 gagaattcta ttacatcaat gaacagctgt acaagaagaa ggtggatctt gaatggatac 4740 cgggaaatga acaactggct gatactttca caaaggcctt gggaccattg aaattcagag 4800 aggctagagg attgttgggt gtgtgtgcga agaggaagtg agagattgtt gttgtttttt 4860 tttattattt tattttcctt ttattttttt ctagttttat ttagcagtta ctgcttattt 4920 gtttgagatg ttgtgagcgt tgttgtttct attttttctt ttctagagat aggctttgtc 4980 ttgccgtggg gggggg 4996 // ID Gypsy-19_LBS-I repbase; DNA; FNG; 5374 BP. XX AC ABFE01001248; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-19_LBS_; KW Gypsy-19_LBS-LTR; Gypsy-19_LBS-I. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-5374 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01001248; Positions 8991 3618. XX CC Positions [4098-4592] - Integrase core CC 'AGGAG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(246..2342,2346..5228) FT /product="Gypsy-19_LBS-I_1p" FT /translation="MLELPRFTRTTALAQQLPLLNGLTSTTRSRNNRPLSL FT HEPSSPLTLPSPSPLLLLSSDSSDSEEDAYIHKKPAYIHKKHHHTMSTLAT FT VEPTSAKAPVLTEGDVSPAVMMDFENAALDFFVSKSIPAEKQVTMIIPGIK FT DLRIQDWIAAERTRLVTLEFPAFMSEMRANYLHQDWEDQIRNKILTSTLTS FT SKTSFWNWSQQLLKLNCLLRGTTSFFDDTALRNHLEAHLDDELRARLKNSE FT ARKEKALKPWINSVRIIDEARAVENKRARELIEETIDRQAKHQNTKTDVLR FT GSSRRANTSQSNSTSANSSSSSFVKLPPLTDDERALLNEHDGCTKCCRFYC FT DHHSQSCPDGFPSGKNYKTLTVAFALAAKKAKAVAKHTAKPVAATAASIEE FT VDSDDDLSATVAVLPNSPGNYASDSDEDWDVSHRDVSNTSIRSKHLIWNCQ FT INSLTNDFPVKTRALIDNGAHLVLIRPELVERLGLKQYKLNKPELVDVAFS FT NGKKKKTELYFYVKLALSSLDSAWTSHVVKALVTPGLCLPIILGLPWLERN FT FIVTDHAARTCIDKRNSYDLLNPPDIVPPPPPKPRLREQIKITKADKKLAL FT AELMLVCHDRLKNKKLQPEHVKDFDVAGAVRDRIDVLVAQEQLATRETALK FT TEYKEIFEPIPHADELPRDIVAEIQIKNAEKTIKSRSYPSPRKYKQAWQII FT QQHLDAGRIRPSSSPCASPAFIVPKANPNVLPHWVNDYWQLNENTVTDSHP FT LPRIDDILNDCAKGRIWGTIDMTNSFFQTRMHPDHVHLTAVNTPLGLYEWL FT VMPMGLKNAPAIHQRRVTAALRTLIGKICHIYLDDIVIWSNSLEEHERNVR FT TVLEALRAARLYVNPDKTHLFCTEIDFLGHHISVRGIEADNKKVEKVLNWP FT VPKSATEVRGFLGLVRYIAAFLPSLADHTGVLSELTTKDSKRNFPCWAPRH FT QKAFNAIKQIVTSRDCLTTIDFSKMPEYKIFVTTDASDKCSGAVLSFGPSW FT ENARPVAFDSMTFKNAELNYPVHEKELLAIIRALKKWRVDLLGSPFFIYTD FT HKTLENFNVQKDLSRRQVRWMEFMSQFDAKVIYIKGEDNTVADALSRLPSP FT QSLTDSENTARHPYNFCDDDEGEATIASVTFPRLCGPWESATCLASREPIL FT PAIGATLEITADKSFLDAVRSGYADDAWCKTLPAAAVSWPELVFRDGLWYV FT GERLIIPRSNNLRETLFMLAHDVIGHFGFYKTYGSLRNAYYWPNMRRDLEQ FT GYVPSCPDCQRNKSSTIKPYGPLHPLPIPDRQGDSVAIDFIGPLPEDNGKN FT SIITFTDRLGSDIQLVPSQTDITAEDLVYLFFDRWYCENGLPSEIISDRDK FT LFVSRFWKALHKLTGVKLKLSTAYHPESDGASERTNKTVNQALRYHVEHNQ FT IGWVRALPKIRFDMMNTVNKSTGFTPFQLRMGRSPRVIPPLVPAKSSATVT FT DIDAWHVIRKLETDVLEAQDNLLKAKLSQARQANKNRTLTFPFTVGSRVRL FT STLHRRKEYKVKGEKRVAKFMPRYDGPYTITDVDEEHSTVTLDLPNSTNIF FT PTFHTSQVIPYIESDTEQFPSRHFEEPAPIITEDGNEEQYIDRILDARRRG FT RGYQYLVRWRGFGKEHDEWLPGSELEDCEALDIWLASRNGSS" XX SQ Sequence 5374 BP; 1462 A; 1384 C; 1156 G; 1372 T; 0 other; cttttttttg aaatatcgcc accgttttag attctgacat gtatttcgcg catataggca 60 cgcccttttg gaattcccac taggcgtttg cttgacgcgt tatcgcatca tctctgtacg 120 taaactgtcg tgcttgtcca ttgtgacatg cttttaactg tctgattagt ggagggttac 180 tcacctgtac cggtataccg ccgtcatccg tgaacccctt ttgggacgcg ctcacgccaa 240 ccgtaatgct ggaactgcct cgcttcaccc gtaccaccgc acttgctcaa caactccctc 300 tccttaacgg tttgaccagc actacgcgtt ctagaaataa ccgtcctcta tctcttcacg 360 aaccgtcttc acctttaaca ctcccatcac catctccact gctcctactc tcatccgact 420 cttcagactc tgaagaagac gcatatatcc acaaaaaacc cgcatatatc cacaaaaaac 480 atcatcacac aatgtccaca cttgcaacag tagagcccac cagcgccaag gcacctgtct 540 tgactgaagg ggatgtttcc ccagctgtga tgatggattt tgaaaatgcg gcgctcgact 600 tcttcgtctc gaagtccatt cctgctgaga aacaggtcac aatgataatc ccgggtatca 660 aagacctcag gattcaggat tggattgccg ctgaacgcac gcgacttgtc acgctcgagt 720 tccctgcctt catgtctgaa atgcgggcta actacttgca tcaggattgg gaggatcaga 780 tcaggaataa gatccttacc tctactctca cgtcgtcaaa gacttccttc tggaattggt 840 cccaacagct cctcaagttg aactgcctcc ttcggggaac cacctcattc ttcgacgaca 900 ccgctctccg taatcacctt gaggcccacc tagacgacga gctacgtgcg cgtctcaaaa 960 acagtgaggc gcgcaaggag aaggcactca aaccgtggat caactctgtc cgcatcatcg 1020 acgaggctcg agcagtcgaa aacaaaagag ctcgtgaact cattgaagag actatcgacc 1080 gccaggcgaa acatcagaac acgaagactg atgtcttgcg tggttcgtcc cgtcgggcaa 1140 acacgtccca atcaaactcg acgtctgcca actcttccag ctcttcgttc gtcaagcttc 1200 cccctctcac cgacgatgaa cgagcactcc tgaatgaaca cgatggctgc accaaatgtt 1260 gtcgtttcta ctgtgatcat cattctcaat catgccctga tggtttcccc tctggcaaga 1320 actacaaaac acttaccgtc gcattcgcgc tagctgccaa gaaggctaag gcggtagcta 1380 agcataccgc caaacccgtt gccgccaccg ccgcctcgat cgaggaggtc gactctgacg 1440 atgacctgtc tgccacagtg gcagttctcc caaactcgcc tggcaattac gcgtcggact 1500 ctgatgagga ctgggatgtg tcgcaccgtg atgtgagtaa cacgtctatc cgaagcaaac 1560 atttaatctg gaattgtcaa atcaacagtc tgacgaatga ctttccagtg aaaacgcgcg 1620 ctctcattga caacggtgca cacctagtgc tcatccgccc tgaactcgtc gaacgcctgg 1680 gactgaagca gtataaattg aacaaaccgg aattagtcga cgtcgccttt agcaatggaa 1740 agaagaagaa aaccgaactg tatttttatg ttaaactcgc actctcatca ttggactctg 1800 catggacttc tcatgtcgta aaagctcttg ttacacctgg actctgtttg ccaattattc 1860 tgggcctccc ttggctcgaa cggaatttta ttgttacgga tcacgctgca cggacatgta 1920 tagataaaag gaattcatat gacttgttga acccccctga catcgtaccg ccgccgcctc 1980 ccaaacctcg tctgcgtgag caaataaaga taacaaaagc agataagaaa ttggcacttg 2040 cagaactgat gttagtatgc catgatcgat taaagaataa aaaactacaa ccagaacacg 2100 tcaaagactt tgacgtcgcc ggtgctgtcc gtgatcgtat cgatgttctt gttgcacagg 2160 aacaactggc aacacgtgaa actgctttga aaacagaata caaggagatt ttcgaaccta 2220 ttcctcacgc cgacgaatta cctcgagata ttgtcgctga aattcaaatc aagaatgctg 2280 aaaagactat taaatcacgc tcgtatccgt caccgcggaa atacaaacaa gcctggcaga 2340 tttaaattca acaacatttg gacgcaggcc gcattaggcc ttcttcttct ccgtgtgctt 2400 cgcccgcatt tattgtgccg aaggcaaacc ccaacgtatt gccgcattgg gtaaatgatt 2460 actggcagct taatgaaaac actgtcacag atagtcaccc cttaccccgc atcgatgata 2520 tcctaaatga ttgtgcaaaa ggacggatct ggggtacaat tgatatgaca aacagttttt 2580 tccagacaag gatgcatccg gatcacgtac atttaacagc ggtaaacact cccttgggcc 2640 tctacgaatg gctggtaatg ccaatgggtt tgaaaaatgc ccctgccatc caccaacgtc 2700 gagtgactgc ggcgctgagg acgctgattg ggaaaatttg tcatatatat ttggatgaca 2760 tcgtaatatg gtcaaactct ctagaagaac atgaacgtaa cgtccggact gtgctggaag 2820 cactgcgtgc agcacgcttg tatgtaaacc ctgataaaac acatttgttt tgcaccgaga 2880 ttgatttcct ggggcaccac ataagcgtac gcggtatcga agctgataac aaaaaagttg 2940 aaaaggtcct caactggcct gtaccaaaat cagctacaga agttagaggt tttctgggcc 3000 ttgtccgtta catcgctgca tttttgccgt cgcttgctga tcatactggc gtactctccg 3060 aactgacgac gaaagattcc aaacggaact ttccatgttg ggcgcctaga catcaaaaag 3120 ctttcaacgc aataaagcaa attgtaacga gccgggattg tctgacaaca attgactttt 3180 ccaaaatgcc agagtacaaa atatttgtaa caacagacgc aagtgacaaa tgttccggcg 3240 cagtgctatc cttcggaccg tcttgggaaa atgcgcggcc agtagcattt gactcaatga 3300 cattcaaaaa cgctgaactg aattatccgg tacacgaaaa ggagttgctt gcaataatac 3360 gtgcactcaa aaaatggcgc gttgacttgt tagggtcgcc tttcttcatc tacacagatc 3420 ataaaacatt ggaaaatttt aacgttcaaa aagacctgtc acgacgacaa gtgcgctgga 3480 tggaatttat gtcccagttc gacgctaagg tcatctacat caagggagag gacaatacag 3540 ttgcagatgc cctttcccgc ttaccttctc cacaatctct gactgattct gaaaatactg 3600 cgcggcatcc ctacaacttc tgcgacgatg atgaaggcga agcaacaatt gcaagtgtaa 3660 ccttcccgcg cttgtgtggt ccatgggaat ccgcgacttg cttggcgtcg cgtgaaccta 3720 tacttccggc aatcggcgcc accttggaga taaccgcaga taaatccttt ttagatgctg 3780 tcagatcggg ttatgcagat gacgcgtggt gtaaaacact acctgctgct gccgtcagtt 3840 ggcctgagtt ggtgttccgt gatggcttat ggtatgttgg agagagattg atcataccaa 3900 ggtcaaacaa cctccgtgaa accctgttca tgctcgcaca cgatgtgatt ggccattttg 3960 gattttacaa aacatacggt tctcttagga atgcgtacta ttggcctaac atgcgtcgtg 4020 atctcgagca aggctatgtg ccttcttgtc ccgattgcca gcgcaacaaa tcatcgacga 4080 ttaaacctta tggccctctt cacccattac caatccctga tcgacagggt gactctgtag 4140 cgattgactt tattggtccg cttccagaag acaacgggaa aaactctatt atcaccttca 4200 ctgatcgtct gggcagtgat atacagttag taccgtctca gaccgatatt actgctgaag 4260 atctggtgta tctatttttt gatcgttggt attgtgaaaa tggcttgccg tctgaaatca 4320 tttccgaccg agacaaactc ttcgtctcga gattttggaa agccttgcac aaattgacgg 4380 gcgtgaaact gaaactgtca actgcatatc acccggaatc agacggcgcc agtgaacgca 4440 cgaacaaaac tgtaaaccaa gcgctgaggt atcacgttga gcacaaccaa ataggttggg 4500 ttcgtgccct accaaaaatt cgttttgaca tgatgaacac tgtaaataaa tcaacaggct 4560 ttacgccttt ccaactgcgc atggggcgaa gccctcgcgt tataccaccc ctcgttccgg 4620 caaaatcttc tgccactgta acagacatag acgcttggca tgttatccgt aaacttgaaa 4680 cggatgtgct ggaagctcaa gacaacttgc ttaaggcaaa actgtcacaa gcacgtcagg 4740 cgaataaaaa tcgcacattg accttcccct ttaccgtcgg gtcgcgtgta cgtctgtcga 4800 cgttacacag acgaaaggaa tataaagtga aaggggaaaa acgcgtagca aaattcatgc 4860 cgcgttacga tggcccttac acaatcactg atgtcgacga ggaacactca acagttaccc 4920 ttgatcttcc taattcgacg aatattttcc ctacattcca cacatcacaa gtgattccgt 4980 acatcgaatc agatacagaa caattcccat cccgtcattt tgaagaaccg gctcctataa 5040 tcactgagga tgggaatgaa gaacaatata tcgacagaat attggatgca cgccgtcgcg 5100 gacgaggtta tcaataccta gtccgctggc gcggcttcgg taaagaacat gatgaatggc 5160 tacctggttc tgaactagaa gattgtgaag ctctcgacat atggctggcc tcgcggaatg 5220 gatcttcttg attttttaag tagatttctt tccaccttgc cagccggtag ctttttccca 5280 ttgggttttg atgcacccgg tgcttggatt tacttatatt attactaaca tttttctttt 5340 tttcctctct ttttcttttt aaagcagggg aggg 5374 // ID Gypsy-1_LWa-LTR repbase; DNA; FNG; 174 BP. XX AC AADM01000021; XX DT 12-MAR-2011 (Rel. 16.03, Created) DT 12-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Lachancea waltii genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_LWa_; KW Gypsy-1_LWa-I; Gypsy-1_LWa-LTR. XX OS Lachancea waltii OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Lachancea. XX RN [1] RP 1-174 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Lachancea waltii genome."; RL Direct Submission to RU (12-MAR-2011). XX DR Genome; AADM01000021; Positions 46008 45835. XX SQ Sequence 174 BP; 74 A; 28 C; 32 G; 40 T; 0 other; tgtcgtatga aaaacaagct atctgcttta ggaaggagaa ccaataccac aaagacggat 60 gtgacatata aatacggcta agctagatca gaaagtaact tacaagtagc tagagagaaa 120 ccaagtttaa tatatattgg tcaagtgatc aatacttaaa actacaatac gaca 174 // ID Gypsy-2_SPDB-I repbase; DNA; FNG; 5507 BP. XX AC ACOE01000170; XX DT 12-FEB-2011 (Rel. 16.02, Created) DT 12-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Spizellomyces punctatus genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_SPDB_; KW Gypsy-2_SPDB-LTR; Gypsy-2_SPDB-I. XX OS Spizellomyces punctatus OC Eukaryota; Fungi; Chytridiomycota; Chytridiomycetes; OC Spizellomycetales; Spizellomycetaceae; Spizellomyces. XX RN [1] RP 1-5507 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Spizellomyces punctatus genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; ACOE01000170; Positions 110416 115922. XX CC Positions [2846-3301] - Reverse transcriptase CC Positions [4382-4879] - Integrase core CC 'ACATT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 648..1556 FT /product="Gypsy-2_SPDB-I_2p" FT /translation="MASQEEILFNTIAELDIDQVRQELYQAKLQQLNSTTP FT APAQYQPRVDKSLVKVLQSKLKPYKGNRNVEEIRTYLLRLEEYFQAAADLS FT PEGQLLVATTYLELHAEVWWQSHVKNHPVGSPLRIQSWDQFKRALQENFLP FT ANATRAARDQLARLKQQGSVKEYLNQFNTLCLQVDDLSEAEKLDKFIRGLK FT PRVQEQLELNPTSTINLATAAATADRIDQIQFSYQRRSQATSYYSPKTENS FT GVVPMEIDSISTNSGKPLSKLTEAEKTELRKKGACFRCRKPGHIAPLCPLK FT TKQQGNGKQTQ" FT CDS 1511..5488 FT /product="Gypsy-2_SPDB-I_1p" FT /translation="MPLEDQAAGKWQADSVEDRGSASTGEDIKNHHPDTHH FT LVHSPVIQPLDLLDPPETTPETLTIHGLYLEPQRTPASQRTPETQRIPESQ FT RTPETQKIPGPLRTPESQRTPETQRTPEPQRTPETQRTPEPQRTLEPQKTQ FT EPQRTPEPQTTQATYQESKIHVVEVAPYSLSTPKGPRIPQSHLLTVQGRIQ FT GIPVTILLDNGANQLYVSQSFVKKAQLVTHSLPGAEISMADDRKHQEIGFV FT KGLQYSVGTYTEQSDFIVAPIKFDLILGIPWFAWRNPSINYNKNTVTFTTS FT QYPRVRWFVKQASSSNKSQKSNLITMVSAKMLARTLKKRTATAFAVYLKNP FT QDTSTPPDTMDSKVTKLLENYQDVFAEELPPGLPPERSVDHRIELSDPNQT FT PWRPTYHLSSLELEELRKQLSELLEAGHIRPSKSPFGAPILFVKKKEGSLR FT MCVDYRALNKITIKNRYPLPRIDELMDRLNGAQVFTKLDLKSGYYQIRVHP FT EDIPKTAFRTRYGHFEFLVMSFGMTNAPATFMTLMNDVLRPFLDIFVIVYL FT DDILIYSKTPEDHLDHLKQVLQALRQNKLYANPSKCQFFKQEVEFLGYVVS FT HEGVKLDPKKVQTVQEWPTPQSVTEVRQFMGLVNYFHSFLKDLAEQAAPLT FT DLLRNDKPWTWSEPQEKAFQQLKDMVTKAPVLRPFDPALPTTVFPDASGYA FT VGAWLGQDDGKGMRPVAYHSRKMAPAETRYDTREQELLAIHDALRVWRPYL FT QGTPFKVISDHQSLKWLQNQPFLTRRQARWVMNFQSFDFEIGYGPGRLHTV FT ADILSRRPDLAPRCIKCKALVTAELNAVTHTTSTLQERLKTSTTEDPELQE FT LIQALDDPRRPEDQQQRLKYLSHQDGLLYFQDRIFVPSDPDLRLSLLQETH FT DTQYAGHFGITKTHDALARHYYWPRMYKDIKKYVKSCDSCQRSKPSQQPPA FT GLLQPLPVPKTRWSQISIDLIPDLPMTSSGHNAIFVIVDRLSKRGHFIPTT FT TKASAKDLALLFLKHIFKHHGLPEVIVSDRDTKFTSSFWSNLMQLLGCNTL FT LATARHQQTNGQAERTIKTLKIYLRAFLEYNMKNWEDLLPFAEFAYNNSIS FT ETTGFTPFYLENGQHPRTPATPPGESNDPEATDFHQYIQQLLQVAKDNMIL FT AQERQVQQANKSLSKAPSYPLGSKVWLSREGITPEEEAQRPAQKLLHPWLG FT PYRIKAKDEFQNYTLELPSTMKIHPVFHVRLLKPYLDPTEDFPTRPVPPEP FT EPIIQQGEEFWEVEKILDHKIRRKKNLYLIKWKGYPSSQNTWEPEENLLPH FT GREILKEFLAQKGA" XX SQ Sequence 5507 BP; 1645 A; 1610 C; 1138 G; 1114 T; 0 other; tggtagcacc accttagaca acaaccccag actcttagac cttctgtgag agatcagaag 60 ctaagatccc cagaccccca gaccctgccc tagaccttct gagtaatatc agaagaccaa 120 cagaccccag accccagacc ctgccctagt ccttctgagt aatatcagaa gacccctgcc 180 tcttatccca ggcccaactt atgcccagaa ccccccctca aggactccta cagaccagcc 240 cacaccaccc cataccacaa acctaccccc agcacaacct ccagaactac caccagacac 300 accacttggt tagcatcaga agctctgata aaacagatag agcttttcta ttgtgggatt 360 gtgaagaagc tgtagtgtcc ccctgtatcc tccccaccag ttgtacttgc aagtaacatt 420 gcatagtttg ggcctgtgac ccaccctgtc ttgactgcaa gtaacattgc atctcttggg 480 cctgtgaccc accctatctt gactgcaagt aacattgcat ctcttgggcc tgtgacccac 540 cctatcttga ctgcaagtaa cattgcatct cttgggcctg tgacccaccc tattgttatt 600 gttacaagtt atattgtaac tacccaagac tccatcccat taccaccatg gccagccagg 660 aagagattct cttcaacacc attgctgagc ttgacataga ccaggtcagg caggagcttt 720 atcaagcaaa gctccagcag ctgaactcca ccacaccagc ccctgctcag taccaaccca 780 gagtagataa atccctggta aaagtcctcc aaagcaagtt gaagccctac aaaggcaaca 840 ggaatgtcga agagattagg acatacctcc tcagacttga agagtacttc caggcagctg 900 cagatctatc accagaaggc caactgttag tagccaccac ctacttggaa ctccatgctg 960 aagtctggtg gcaaagccat gtcaagaacc acccagtagg atcccctctc agaatccagt 1020 cttgggacca gttcaagaga gcccttcaag agaacttcct cccagcaaat gccaccagag 1080 cagcaagaga ccagttggcc aggctcaaac aacaaggctc agtaaaagag tacctaaacc 1140 agttcaatac tttatgcctt caggtggatg acttgtcaga agctgaaaag ttagacaagt 1200 ttatcagagg gcttaaaccc agggttcagg aacaactgga gctcaacccc accagtacca 1260 tcaacttagc cacagctgca gcaacagctg ataggattga ccagatccag ttcagctacc 1320 agagaagaag ccaggccacc agctactact cccccaagac tgagaactct ggagtagtcc 1380 ctatggagat agatagtata tccaccaact ctgggaagcc tctctccaaa ctcactgagg 1440 ctgagaagac tgagctcaga aagaagggag cctgtttcag gtgcagaaaa ccaggccata 1500 ttgcccccct atgccccttg aagaccaagc agcagggaaa tggcaagcag actcagtaga 1560 agatagaggt tctgcctcta ctggggaaga tatcaagaac caccaccctg acacccacca 1620 cctggtacac tctcctgtta ttcaaccctt ggacctattg gacccacctg aaaccacccc 1680 tgagacttta actatccatg gactttacct ggaaccccag agaaccccag catcccagag 1740 aaccccagaa acccagagaa tcccagaatc ccagagaacc ccagaaaccc agaaaatccc 1800 aggacccctg agaaccccag aatcccagag aaccccagaa acccagagaa ccccagaacc 1860 ccagagaacc ccagaaaccc agagaacccc agaaccccag agaaccctag aaccccagaa 1920 aacccaggaa ccccagagaa ccccagaacc ccagaccacc caggccacct accaagaaag 1980 caagatccat gtagtagagg tggctccata ctctttaagt acccccaaag ggccaaggat 2040 tccccagagt cacttgctga ctgtgcaagg tagaatccaa ggaatcccag taaccattct 2100 cctggataat ggggcaaacc aactctatgt gtcacagtcc tttgtcaaga aggcacaact 2160 ggtgactcat tctctccctg gagcagaaat ctctatggct gatgacagaa agcaccaaga 2220 aataggcttt gtcaaaggtc ttcagtactc agtaggaacc tacacagaac agtcagactt 2280 tatagtagcc ccaatcaagt ttgacttgat cttgggtata ccttggtttg cctggaggaa 2340 tcccagtatc aactacaaca agaacactgt gaccttcacc acttcccagt accccagagt 2400 cagatggttt gttaagcagg ccagctcttc caacaaatcc cagaagtcca atcttatcac 2460 catggtatca gcaaagatgt tggccagaac actgaagaag aggacagcaa cagcttttgc 2520 tgtgtatctc aagaaccctc aagataccag cacccctcca gacaccatgg attccaaagt 2580 caccaagctt cttgaaaact atcaagatgt ctttgctgaa gagcttcccc caggacttcc 2640 ccctgagaga tcagtagatc acagaattga actcagtgac cccaaccaga ccccctggag 2700 accaacctac cacctctctt ctttggagtt ggaagaactg aggaagcagc tttcagagct 2760 tctggaggca ggccacatca gaccatcgaa gagcccattt ggagcaccta tcctctttgt 2820 taagaagaaa gaaggttctt tgaggatgtg tgtggactac agggctctga ataaaatcac 2880 catcaagaac agatatcccc tccctaggat tgatgagctg atggataggc tgaatggagc 2940 ccaggtgttt accaagctag atctgaagtc aggctattac cagatcagag tccacccaga 3000 agacatcccc aagacagcct tcagaaccag atatggtcac tttgagttcc tggtcatgtc 3060 ctttggaatg accaatgccc cagccacttt catgactttg atgaatgatg tccttagacc 3120 cttcttggac atctttgtta ttgtctacct agatgacatc ctcatctact ccaagacccc 3180 agaagatcac ctggaccacc tcaagcaggt tctgcaagcc ctcaggcaga acaagctgta 3240 tgccaacccc agcaagtgcc agttcttcaa gcaagaagta gagtttcttg gctatgtagt 3300 ctcacatgaa ggagtcaagc ttgatcccaa gaaagtgcag actgttcaag aatggcccac 3360 ccctcagagt gtcactgaag tcagacagtt catgggcctg gtgaactact tccatagctt 3420 tctgaaggac ctggcagagc aagctgcacc attgacagac ctgttgagaa atgacaagcc 3480 ttggacctgg tctgaacctc aggagaaagc ttttcagcag ctgaaggata tggtcaccaa 3540 agcccctgta ctcagaccct ttgacccagc cttgcctacc actgtcttcc ctgatgcctc 3600 aggctatgca gttggtgcat ggcttggaca ggatgatggg aaaggcatga gaccagttgc 3660 ctaccactcc aggaagatgg ccccagcaga gaccagatat gacaccagag aacaagagct 3720 gttagccatc catgatgccc tcagagtttg gaggccctac ctgcaaggaa cccccttcaa 3780 agttatctca gaccaccagt ccctcaagtg gctacagaac cagcccttct tgaccagaag 3840 acaagccaga tgggtcatga atttccaatc ttttgacttt gagattggct atggcccagg 3900 aagactccac acagtggcag acatcctgtc tagaagacca gacctggccc caagatgcat 3960 caagtgcaag gccttggtga cagcagagtt gaatgctgtg acccacacca ccagcaccct 4020 tcaagagaga ttgaagacca gtaccacaga agacccagaa cttcaagagc tcatccaagc 4080 tttggatgac cccaggagac cagaagacca gcaacaaagg ctcaagtacc tcagccatca 4140 agatgggtta ctctatttcc aggacaggat ctttgttcct agtgacccag acctaaggtt 4200 gagcctactc caagaaaccc atgacactca gtatgctgga cactttggca tcaccaagac 4260 ccatgatgcc ctagcaaggc attactactg gccaaggatg tacaaggaca ttaagaagta 4320 tgtcaagtct tgtgattcct gccagagaag caaacctagt cagcaaccac cagcaggctt 4380 gctgcagccc ctcccagtac ccaagaccag atggagccag atctccattg acctgatccc 4440 tgacctaccc atgacttcct ctggtcacaa tgccatcttt gtcattgttg acagattgtc 4500 aaagagaggc cacttcatcc ctaccaccac caaagcctca gccaaggacc tggccctgct 4560 cttcttgaag cacatcttca agcaccatgg cctcccagaa gtgattgtat cagacagaga 4620 caccaagttc accagttcat tctggtccaa cttgatgcaa ctcttaggct gcaacaccct 4680 cctagccact gcaagacacc agcagaccaa tggacaggca gagagaacca tcaagacttt 4740 gaagatctac ctgagagctt tcctggagta caacatgaag aattgggaag atcttctccc 4800 ctttgctgag tttgcctaca acaacagcat ctcagaaacc acaggcttca ccccattcta 4860 cttggagaat ggccagcatc ccaggacccc agcaacacca ccaggagaaa gcaatgatcc 4920 tgaagcaaca gacttccacc agtatattca gcaactgctt caagtagcca aggataacat 4980 gatactggcc caagaaagac aagttcagca agccaacaag tctctcagca aggccccaag 5040 ctacccccta ggctccaaag tttggctgtc tagagaaggt atcaccccag aagaagaagc 5100 ccagaggcca gcccagaaac ttcttcaccc ctggcttgga ccttacagaa tcaaggccaa 5160 ggatgaattt cagaattaca ccctggagct ccccagcacc atgaagattc atccagtgtt 5220 ccatgtgaga ctcctcaagc cttacctgga ccccacagag gacttcccta ccagaccagt 5280 accccctgag cctgagccaa ttatccagca aggagaagag ttttgggaag ttgagaagat 5340 tctggaccac aagatcagaa ggaagaagaa tctctacctc atcaagtgga aaggatatcc 5400 ctcctctcag aacacctggg aaccagaaga aaacctccta ccccatggca gagaaatctt 5460 gaaggagttc ctagcccaga agggagctta atcttaaagt gggggag 5507 // ID Copia-1_CDC-LTR repbase; DNA; FNG; 435 BP. XX AC NC_012867; XX DT 06-FEB-2011 (Rel. 16.02, Created) DT 06-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Candida dubliniensis genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_CDC_; KW Copia-1_CDC-I; Copia-1_CDC-LTR. XX OS Candida dubliniensis OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; mitosporic Saccharomycetales; OC Candida. XX RN [1] RP 1-435 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Candida dubliniensis genome."; RL Direct Submission to RU (06-FEB-2011). XX DR Genome; NC_012867; Positions 1376815 1377249. XX SQ Sequence 435 BP; 142 A; 78 C; 83 G; 132 T; 0 other; tgttgtgata taagtaatgc aagcacaaag acgaatcccc acagctgaaa gttattacac 60 tgaggggtta aactgaatat aataaattaa tatgttattg tgattcacgt gtgaaataga 120 atatatagga agagacttaa ctaacgggtg tgttaatcct cggcacttgt ccacaagagc 180 ggggattaat cataatgtga agagtgtgtt agaaagagga agggaattat aagtactgca 240 gaattccctt tagactagcc actaactgtt gaagaactaa cgttctaaat atattcagta 300 tagggtatca ccttaatatt ctatactata cttttacaac aatctacctt taactctgtc 360 aagtttcact atgtctactc ctactcctac tactcctgct acggctgctt ccatgggtgg 420 tgatggccaa ttaca 435 // ID Copia-67_MLP-I repbase; DNA; FNG; 4577 BP. XX AC . XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-67_MLP_; KW Copia-67_MLP-LTR; Copia-67_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4577 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR [1] (Consensus) XX CC Positions [1871-2380] - Integrase core CC LTRs are 99% similar to each other. CC Inclues an insertion of a DNA3-1_MLP transposon (masked by x). XX FH Key Location/Qualifiers FT CDS 87..1776 FT /product="Copia-67_MLP-I_1p" FT /translation="MSLNPDKFFAQTTSPIPPSEHSDDAGDIISVNIQSSL FT KPPSQSTLSVPPNPIMSTSIPPPVEMSSQQQAMILNQSLDRSIKKLTDETY FT MLWASFIRSRLKSMFLADYLSSDSIKLENSQFDDEVSRACITNWMLNNMDD FT INRARFEPKITIYREDGPDTENLPAKLWKAVNKHHSGISEELRLLLERALN FT LIVQSSNTPLLTHIKNFQTTVTKYKTSGGTKSDEDLGRKLLISLNNSYFQD FT AKEIAIAGIKDYDLVVADAVAMLTKNPIAPRHTTVVNHTAEASFASNSYPR FT NKNPNRCTKKKCTGINHNPDMCFKKPGNKHLQREWIEQRVKLGQWNGDPPK FT DLSQSSSAAPIFKSEPTIEELENAFNSMNPSASHVSVQLLSTASNHNNSNT FT VEVLIDTAASHHMFKDKHLFINYVDLRNSNDSLSLAGGKSTLHIHGKGEVH FT FLGPDGHPFELKDCLHIPDLKRSLIGGTILLKQDFITKKDNNRFLILKDDK FT CAFTGTLNDNINLLRSSVRIIHVKSQPEANLISANADERILDLHRRLGHPN FT ARYMKTMVLHNSLKGLX" FT CDS 1955..4543 FT /product="Copia-67_MLP-I_2p" FT /translation="MLFTDDFSRYRHVVGLNAKSADVVFTKIKQYITLVER FT QCNAKLKMFTLDNGYEFINDLLVPYCKDIGIYLRTTATYTPEENGVAERSN FT RTITEPARAIMLEANLPTRFWIYAIKASVYLKNRTISSSLPIGKTPFELWN FT GRQPDISHVRPIGCLCYVLIRKAIRGGKFQPVSRQAILLGHTEHNLNYEVF FT ILESNTIVTSHDVLFRDRVFPLKKLKPFDISQLSFNRDENPQISDPSIDDP FT PPGDIDDDVLYPGGEGMHQDNELDIIEPILEEPPLDPAVQPTPQDIPRRSS FT RERFPVQRYQPSASFAYWDDEGAFIDLHDPFPCAFAVGSMVRLINEPHNFK FT SAMKTSDQEKWKSACDKEIQNMKDRQVWHLVPRPADRPVVGSRWHFKVKLH FT PDGTINKYKARFVAKGYTQTYGIDYTDTFAPTGKPSSFRVLVAFATYYGYE FT IHSMDAIAAFLNSKLKEKIYMEQPEGYEEGNEDEDLVCDVDQALYGLKQSA FT RDWNDDFKKRCIKAGFKQSEADECVYIRRRLQDVCLFYLHVDNLAITGNKI FT KEFKEEISTFWPMEDQGLSTCVVGIQLVRAGPNHYIIGQEAMTRSLLERFG FT MTDCKTASTPFPGGTKLTKSTDDEARSFSLLNLPYNSGVGSLMYLSQCTRP FT DITYAVGCLSQHLNKPSLRHWEAFKHVLRYLSGTINHCIHYKNTSSTTISS FT NNGYTLPEHFADADWAGDKSTRRSTTGYVFLLCGGAVTWRSRLQQTVAKSS FT TEAEYRAANEAGDEMIWLARFMKSVGLPQATPYLLNCDSLSAIDLSKDAVL FT HGRTKAIEIHFHWLREQVNAGIIKLTHCKTEDMIADILTKPLHPGPFNDFR FT ERIGLKRVDE" XX SQ Sequence 4577 BP; 1363 A; 1127 C; 890 G; 1109 T; 88 other; ttagccttta tatcccttac ccgtgattca agactcttag acattcatat ggtagcggga 60 gtattgtata cgatccaaac catcgaatga gtcttaatcc ggataaattc ttcgctcaaa 120 caaccagccc catcccacca tctgaacatt cagacgacgc cggagatatc atatctgtca 180 acatacaatc atcacttaaa cctccgtcac aaagtacact atctgttcct ccaaacccaa 240 tcatgtccac ttccatacct ccaccggtcg agatgagctc ccaacagcaa gctatgattt 300 tgaatcaatc actcgacaga tctatcaaga agctaacgga cgaaacctat atgctatggg 360 cttccttcat acgatccaga ttgaaatcaa tgttcttagc tgattactta agttcagatt 420 ctatcaaact tgagaatagt caatttgacg acgaagttag cagagcatgc atcactaact 480 ggatgctaaa caacatggac gatattaaca gagccagatt tgaaccaaaa atcacgatct 540 acagggaaga tggacctgac accgaaaatc tacctgccaa gctgtggaaa gctgttaata 600 aacaccactc aggaatatca gaggaacttc gactgctatt agaacgtgct ttaaatctca 660 ttgttcaatc ttcaaacact cctcttctaa ctcacatcaa aaactttcaa accaccgtga 720 cgaagtacaa gacctcagga ggaactaaga gtgacgaaga cttaggccga aaattgctca 780 tatctttaaa caacagctac tttcaagacg ccaaagaaat cgccatagcc ggaatcaaag 840 actatgattt agtagtagcg gatgcggtgg caatgctcac caagaaccct attgctccta 900 gacacaccac tgttgtcaac cacactgctg aggcaagttt cgcatctaac tcctaccctc 960 gaaacaaaaa tcccaacaga tgtacgaaga aaaaatgcac cggcatcaat cacaaccctg 1020 acatgtgctt taaaaaacct gggaacaaac atcttcagcg tgaatggatc gaacaacggg 1080 taaaactagg ccaatggaac ggagatccgc ctaaagactt gtctcagtca tcttcagcgg 1140 ctccgatctt caagagtgaa ccaaccatcg aggaacttga gaatgccttc aattccatga 1200 atccatccgc cagtcacgta tctgtccaat tgctgtcaac cgccagcaat cacaacaact 1260 ccaatacggt agaagtttta attgataccg cggcatcaca ccatatgttt aaggacaaac 1320 atctcttcat caactacgtc gacttgagaa acagtaacga ctcacttagt ttagcaggtg 1380 gaaagtcaac actccatatc catggaaagg gcgaagtcca tttcttgggg cctgatggtc 1440 atcctttcga attgaaagat tgtttacaca tcccggattt aaagagaagc ctcatcggtg 1500 gaacaattct acttaaacaa gattttatca ccaagaagga caacaatcgc ttcttaatac 1560 tcaaagacga caagtgcgct tttaccggga ctttaaacga taacatcaat ctacttcgat 1620 cctcggtgag aataattcac gtcaagtcac aacccgaagc aaaccttatc agtgcaaacg 1680 ctgatgaacg tatccttgat ttgcatcgcc gtctaggtca ccccaacgcc cgatatatga 1740 aaactatggt gttacacaac agtcttaagg gtttaxxxxx xxxxxxxxxx xxxxxxxxxx 1800 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 1860 xxxtcatgtc cgtagcttat ccctactaga gaacgttcat cttgatctga gcggcatcat 1920 acgtacaagc gctgtggacg gaagtctcta tttcatgcta ttcacggatg acttctcgcg 1980 atacagacat gtagtgggat tgaatgccaa atcggctgat gttgtgttta caaaaatcaa 2040 gcaatacatc actttggttg agcggcagtg caatgcaaaa ctcaaaatgt tcacccttga 2100 taacgggtat gaattcatca atgatctcct agttccgtat tgtaaggaca taggtatcta 2160 tctgaggaca acggccactt acacccctga agaaaatggt gtcgcagaac gttctaatcg 2220 gacaatcacc gaacctgctc gagcaatcat gctcgaggca aaccttccta ccaggttctg 2280 gatatacgcc atcaaggctt ctgtttacct aaagaaccgc actatctctt cttctctgcc 2340 tataggtaaa accccattcg agctatggaa tggtaggcaa ccggacatca gtcacgtcag 2400 acctattggt tgcctatgct atgtgcttat acgcaaggca atcagaggag gaaagttcca 2460 accagtctct cggcaggcca tactcttagg tcacacagaa cacaatctaa actacgaagt 2520 atttatactc gaatccaaca ctatcgtcac atctcatgac gtgctgttta gggacagagt 2580 atttcctcta aaaaagttga aaccttttga tatctcccaa ttatccttca atagggatga 2640 aaacccccaa atttcggatc cttccattga tgatcctcct cctggagaca tcgacgacga 2700 cgtcctatat cctggtggtg aaggaatgca ccaggacaat gagttagaca tcatcgaacc 2760 catcttagaa gaaccgccct tagacccagc agttcaaccc acccctcaag acatccctcg 2820 ccgctcctct cgagaacgct tccctgttca gcgatatcaa ccatctgcaa gcttcgcgta 2880 ctgggatgac gagggagctt tcatcgactt acacgatcct ttcccttgcg cctttgctgt 2940 tggctcaatg gtaaggttaa ttaatgaacc gcataatttc aaatccgcta tgaaaacatc 3000 agatcaagaa aagtggaagt ccgcctgcga caaagaaatc cagaacatga aagaccgcca 3060 agtgtggcat ctagtgcctc gccccgccga tcgccctgtt gtgggaagtc gttggcattt 3120 taaggtcaag cttcaccctg acggcactat caacaagtac aaagctcgtt tcgtagccaa 3180 aggttatact caaacgtatg gcattgatta caccgacact tttgcaccaa ccggaaaacc 3240 atcttccttc cgggtcctcg ttgctttcgc cacctactac ggctatgaaa tacattctat 3300 ggacgccatc gcggctttct taaacagtaa gttgaaagag aaaatttaca tggaacagcc 3360 ggaggggtat gaagaaggaa atgaagacga agacctcgta tgtgatgttg accaggcttt 3420 atatggctta aaacaatctg ccagagactg gaatgacgac tttaagaaaa gatgcatcaa 3480 agcaggtttc aagcagtctg aggcggatga atgcgtatat atcagaagac ggttacaaga 3540 tgtctgtctt ttctatcttc atgttgacaa tctagcaatc accggtaaca aaatcaagga 3600 attcaaagaa gaaatcagca ccttctggcc tatggaagat caaggactat caacatgtgt 3660 ggtgggaatt caactagtcc gcgctggccc taatcactat atcattggcc aggaggccat 3720 gacacgttca cttctcgaac gtttcgggat gacagactgc aagactgcat ctactccatt 3780 cccaggagga accaaactaa caaaatctac tgatgacgag gcccgatcct tcagcttact 3840 taacctccct tataacagtg gtgtgggaag ccttatgtac ttgtctcaat gcaccagacc 3900 agacattact tatgctgttg ggtgtctatc tcaacatctc aacaaaccat ccctacgtca 3960 ctgggaagct ttcaagcatg tcttacgcta tttgtcgggc acaatcaacc attgcatcca 4020 ttacaagaac acatcctcca ctactatctc tagcaataat ggttacactc taccagagca 4080 cttcgccgat gccgattggg caggcgacaa aagcacacga cgctctacta cgggatacgt 4140 tttcttactc tgtggtggcg ctgtaacctg gagaagtcga ttgcagcaaa ctgttgcaaa 4200 atcttcaacc gaggctgagt accgtgccgc taatgaagcg ggcgatgaaa tgatttggct 4260 cgctaggttc atgaaatcgg taggacttcc tcaagctacg ccatacctct taaactgtga 4320 cagccttagc gctatcgacc tgtccaaaga cgcggtgctt catggccgaa caaaagctat 4380 agaaatccat tttcactggt tacgagaaca agtcaacgcg ggcataatca agttaacaca 4440 ctgcaagacg gaagatatga tagccgacat cctaaccaag cccttgcacc ctggcccttt 4500 caacgacttt agggagcgta ttggcttaaa gcgtgtagat gaataggggt cttttaattg 4560 tgtcgattga agggggg 4577 // ID Gypsy-2_LBS-LTR repbase; DNA; FNG; 702 BP. XX AC ABFE01000017; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_LBS_; KW Gypsy-2_LBS-I; Gypsy-2_LBS-LTR. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-702 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01000017; Positions 1712 1011. XX SQ Sequence 702 BP; 207 A; 141 C; 121 G; 233 T; 0 other; tgttagggta ccatctcaat acttgtcctt ttattctcat tttggctcat tttggcacat 60 taaacctttt gtctgtttct gtattacctg actggtattt actagttgta ctcacgtcca 120 ctattacata agtagttgta ttctgtgggt attgtacttg atgtgtcgac aaacacgttc 180 aaacacaaac tcctttcatg tgttgtacac ttgtttggtt tgcaagtcgt tgttaggatc 240 acaagtcttc gttaggatcg gaagtctaac caaacatatc tgaacaagag aaatgtttgt 300 aaacaagaag atggaatatt gtagcaagga taagaacaat ctagatactt ctcaaatact 360 attgtacttg tatttaaata gagtgaaatg ctagaagtaa ggtatcttac ttttaacccc 420 aactttacca gttccttgaa ctaaggtgtt tcttgagtga attgataagg ctacacgcta 480 tcaattaact cggaccaatc gttaaggctt cgcgctaacg atcgcagtcc tcaagcaatg 540 caaacaaagt acggttaaag actctcgacg tttctaggaa acccccctta ctctttacaa 600 ctcaatcata tcatatctga ctctacgtac aattactcta gtcttgttta ctagtgtttg 660 tattcacagg aatacaccaa cgtagagcgt cgaactctaa ca 702 // ID Copia-2_VA-LTR repbase; DNA; FNG; 213 BP. XX AC ABPE01003718; XX DT 13-FEB-2011 (Rel. 16.02, Created) DT 13-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Verticillium albo-atrum genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_VA_; KW Copia-2_VA-I; Copia-2_VA-LTR. XX OS Verticillium albo-atrum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetes incertae sedis; Phyllachorales; OC mitosporic Phyllachorales; Verticillium. XX RN [1] RP 1-213 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Verticillium albo-atrum genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; ABPE01003718; Positions 14313 14101. XX SQ Sequence 213 BP; 44 A; 54 C; 43 G; 72 T; 0 other; tgttgccgat caggcaaggt tctgttctgt gacccaggag tactttcatc ccaaatgtgg 60 agtcctcatt tgatgcattt gttctgtcgt tatcgccccg gtaagactcc ttaccgatgc 120 tggatctgac caattacctt cgtttggact tctggaagtc tccttttgta ccttaacaac 180 aatcaatcca ctttgttcaa ttccgtggtg aca 213 // ID Copia-31_MLP-I repbase; DNA; FNG; 4221 BP. XX AC AECX01003016; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-31_MLP_; KW Copia-31_MLP-LTR; Copia-31_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4221 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01003016; Positions 5190 970. XX CC Positions [1497-2012] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 87..4220 FT /product="Copia-31_MLP-I_1p" FT /translation="MTDSSNSARVHFPKLAKTNYVEWAGDVVAHLMTCNLE FT EFINMDSPPVPPVKVDGSDQSIQLAVFATKQKKAAGILFGCIDQDNRICIV FT AKDAVSDPIKIWKILKEHFQSSSDENQARAYLKWTDIIFTDLETYITDNQH FT ALAGLLAVDGIQHIHQKFIGETIVSKLLSSMDITKTLLRKDRPLTSEKVIN FT YLEAQLVSIKDEELHTNTIALAVRQPRALQRTTTQRPLTFGRPSRPYCSNG FT RHNPEAKGHTSTQCFQVNPGLQPAQTSSTQPRASVTVASDAFFEAQAFVLT FT TTEHKGILLDSGCSHHMVPDKDMFENYQPVSSNVTLANGANIRIAGKGTIT FT CNLADSEVILQALHVPDLGCSLISMGSLLMQGYTIHNHVDDILMTKVGVGT FT FVGHVRNQVIELDITIGTSTPISNPRANISISDYDILHRQGGHPESPRLKM FT MYKVGAPDNWNCEICKLSKGHRLPYPGHFPSAKLPLDVIHSDLSGKISVPC FT VGGGLYYFKLTDACTNYKHVFIMKLKSDTFSCFTRYKTLVETLHNKKIVSL FT VNDQGGEYMAKDFQELLKKDGTTQHLSAPYTPQQNPVSERGNRTTTEKART FT LLRQSNLPYNFWAEAVTTAVFLENVTPAKITQNKSAYELWFGRPFDYSRLK FT PFGCRAYVLIPKQFRRKFDNTSSKGILLGYQVGMKNYRIRLDDGRIVYSHD FT VTFDCTNFPGIEGDTERDNSDLDTLFKDDYIIPNISIDSPPSTSTPANVQT FT LEDTPHDTPTPSQPSSPVILPETTPSEIEDEVAREEDDDSGDDNEVVDNLT FT NHLKPGWDWQLRAEAPQNISSNIDSSNILPSGSRRRAAAVSVKSTPTLNNP FT TPKTYKQALLSNDKTEWDQAITTELLNMTRRKVWSVVNLPKDRHAIGTTWV FT FKKKMGAEGELLKYKARLCALGFLQYFGVDYNETYAPTGRLATFRSVCTIA FT AEEDLDVIQMDAVAAFLNGSPKEVIYIRIPKGYNVDNATADTVLQLNQALY FT GLKQAPKVWYDTLKAFFATIGLFPSPTDPSLFISSDPTWRCLVHVHVDDMV FT IASNNVERFRSAITEKFQMDENTDFKYILGMKVVRDCQTKTITLSQAQYIQ FT DLLEDYGMEECKPVGSPMTPNMYLIPGTTSEREEFLNLGVSYRRAVGKLMY FT LNNATQPDLSFVVSQLSQHLNNPSIVHWIAFKRVLRFLKGTQSLSLVLGGS FT GLNLNGFADADFAACPVTRRSTGGYVTRLGNSTVNRNSKKQDSVATSTTEA FT EYRSAYEGGQDIVWVNELIAGMGIKQEKAPSFKLDNQGAIALSKNQKFQRR FT TKHVDVKYHWLREMALQSKLKINYIPTNDMIADVMTKALTPGKHGNFCTLL FT GLHDVTKMGGNQDQE" XX SQ Sequence 4221 BP; 1303 A; 1070 C; 845 G; 1003 T; 0 other; aaactaccta aagctctttg gttatgagcc cagcgttcca cgaagctctt aaaaattttt 60 tcgatcaaat ctaaaaattc tttcaaatga ccgactcatc aaactcagca cgagttcatt 120 tccctaaact tgcgaaaacg aactacgtcg aatgggccgg agatgtcgtc gcccacttga 180 tgacgtgtaa cttagaagag ttcatcaaca tggactcgcc tcccgttcct cccgtaaaag 240 tcgatgggtc tgatcaatct atccaacttg cagtcttcgc caccaaacaa aagaaggcag 300 caggtatctt gttcggatgc attgatcagg acaacagaat ttgtattgtc gccaaggatg 360 cagtctcgga tcccatcaag atttggaaaa ttttgaaaga acactttcag tcttcatctg 420 atgaaaatca agcccgcgca tatctcaagt ggactgatat catcttcact gacctcgaaa 480 cgtacatcac tgacaatcaa catgctttag ccggtctctt agctgttgac gggattcaac 540 acatccatca aaaattcatc ggagagacca tcgtcagcaa actcctgtcc tccatggaca 600 tcaccaagac cttattacga aaagatcgtc cactaacctc cgaaaaggtt atcaactatc 660 tcgaagctca gcttgtatct attaaagatg aagaacttca caccaacacc attgcactcg 720 cagttcgaca accacgagct ttgcagcgca caaccaccca acgtcctctg acgttcggtc 780 gaccatcacg gccttattgc tccaacgggc gccacaatcc agaagcgaaa ggtcacactt 840 ccactcaatg cttccaggta aaccccggtc ttcaaccagc tcagaccagc tcaactcagc 900 ccagagcatc cgttaccgtc gccagtgacg ctttcttcga ggcccaagcc ttcgttctaa 960 ccacgactga acacaaaggc atcctcctag atagtggatg ttctcatcac atggtacctg 1020 acaaggacat gttcgaaaat tatcaaccag tctcgtcaaa cgtcactctt gcgaacgggg 1080 caaacattcg aatcgcagga aaaggaacta tcacctgcaa ccttgccgac agcgaagtaa 1140 ttcttcaagc cctacacgta ccagacttag gatgctcatt gatcagcatg ggcagcttgt 1200 tgatgcaagg ctacaccatc cacaatcacg tcgacgacat tctgatgacc aaagtcggag 1260 ttggaacctt cgttggacat gttcgaaatc aggtcattga gctcgacatc accataggca 1320 cgtcaactcc tatttctaat cctcgagcca acatttcaat ctctgactat gatattctcc 1380 acagacaggg aggtcatccc gagtctccac gccttaaaat gatgtataaa gttggagctc 1440 ctgacaactg gaattgtgaa atttgcaagc tgtcaaaagg tcaccgtctt ccctaccccg 1500 gtcactttcc ttctgccaaa cttcctctag acgtcatcca cagtgacctc agtgggaaaa 1560 tttctgtccc ttgtgttggt ggtggattgt attactttaa acttacggat gcatgtacaa 1620 attataaaca tgtttttatt atgaaattaa aatccgatac tttctcctgt ttcactcgat 1680 acaaaacact cgttgagaca cttcacaata aaaagattgt aagtctggta aatgaccagg 1740 gcggagaata catggctaag gactttcaag aattactcaa gaaagacgga acaacacaac 1800 atctgagtgc tccgtacaca ccccaacaaa acccagtctc tgaaagaggc aatcgtacaa 1860 caactgagaa ggcacgcacc ctacttcgac aatctaactt gccctacaac ttctgggctg 1920 aggctgtgac aacagccgtg tttcttgaga atgtaactcc agccaagata actcagaaca 1980 agtcagcgta cgagttatgg tttggccgtc cctttgatta ctccagactg aaaccgtttg 2040 gttgtcgagc ttacgttctt attcccaagc aattccgacg gaagttcgac aacacgtcct 2100 caaaaggcat cctcttaggc tatcaggtgg gaatgaaaaa ctaccggatc agactggacg 2160 acggtcggat cgtctattct cacgacgtaa cgtttgattg caccaatttt ccaggaattg 2220 aaggtgatac tgaacgagac aactctgatc tcgacactct cttcaaagac gactacatca 2280 ttccaaacat ctcaatagac tcacctccct ccacatccac gcctgctaat gttcaaactc 2340 tggaagatac accccatgat actcctactc cgtcacaacc atcctcacca gtcatcttac 2400 cagaaacgac tccatcagaa atcgaagatg aagttgctcg agaagaagac gacgattcgg 2460 gcgatgacaa tgaagtagtc gataacctca cgaaccactt aaaaccaggt tgggactggc 2520 agttgagagc tgaggcacca cagaacattt cgagcaacat tgactcatca aatattcttc 2580 catcaggttc acgtcgaaga gccgcagcag tatcagtcaa atccactccg acacttaaca 2640 atccaacacc gaaaacctac aaacaagcat tactatcaaa cgacaaaacg gaatgggatc 2700 aagccatcac gactgaacta ctcaacatga ctagaagaaa agtttggagt gttgtgaatc 2760 tacccaagga ccgacatgcg ataggcacca cgtgggtatt caaaaagaag atgggtgctg 2820 aaggggagtt gctaaagtat aaggcaaggc tatgtgcgtt aggctttctt cagtactttg 2880 gtgtggacta caatgagacc tacgcaccaa ctggtcgcct agccacgttc agatctgtct 2940 gcacaattgc cgcggaagag gacttggacg tgattcaaat ggacgcagtc gcggccttct 3000 taaatggaag tccaaaggaa gtcatttaca tcagaattcc taagggttat aacgtggaca 3060 acgcgactgc cgacacagta ctacagctca atcaagcctt gtacggcctc aaacaagcac 3120 ctaaggtttg gtatgatacc ttaaaagcct tcttcgccac cataggtctc ttcccatcac 3180 caacagatcc tagccttttc atctcatctg atccaacatg gcgatgtctc gtacatgtac 3240 atgttgatga catggtcatt gcgtcaaaca acgtcgaacg tttccgatca gcaatcactg 3300 aaaagtttca aatggacgaa aacacggact tcaaatacat tctcggaatg aaggtcgtca 3360 gggattgcca aaccaaaact atcacacttt cacaagctca gtacattcag gatcttcttg 3420 aagactacgg catggaagaa tgcaaaccgg taggttcacc aatgacgcca aacatgtatc 3480 tcattccagg cacgacttca gaacgtgaag agtttctcaa tctaggtgtc agctatcgac 3540 gagcagttgg taaattgatg tatcttaaca atgctacaca acctgacctt tctttcgtcg 3600 tctcgcaact atcccaacat cttaacaacc catccattgt acactggata gctttcaaac 3660 gagtgcttcg cttcctcaag ggtactcaat cactcagctt agtcttagga ggatctggtc 3720 taaatctcaa cggtttcgca gatgctgact tcgctgcatg tccagtaaca agaagatcca 3780 cgggaggtta cgtgacacga ttaggcaaca gcacagtcaa tcggaattcg aaaaagcaag 3840 attcagtggc aacttcaacg acggaggcag agtacaggtc agcgtacgaa gggggacagg 3900 atatagtatg ggtaaatgaa cttatagcag gtatggggat caaacaagaa aaagcaccat 3960 ctttcaaact ggacaaccaa ggagcaattg cattatcgaa aaaccaaaaa tttcaaagaa 4020 gaacgaaaca tgtagacgtg aaatatcatt ggttaagaga aatggcatta cagagcaaac 4080 tgaagatcaa ttacattcca acaaacgata tgattgcaga tgtaatgacg aaagcactca 4140 caccgggcaa acatggtaat ttctgcacac tgttaggctt acacgatgtt acgaagatgg 4200 gggggaatca ggatcaggaa g 4221 // ID Gypsy-85_MLP-LTR repbase; DNA; FNG; 206 BP. XX AC AECX01002117; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-85_MLP_; KW Gypsy-85_MLP-I; Gypsy-85_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-206 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01002117; Positions 95858 96063. XX SQ Sequence 206 BP; 62 A; 56 C; 31 G; 57 T; 0 other; tgttatgaac tggttctatt tcacttgagt gacatattca cattacatgt cacaagctcg 60 agaaccatct tatagttgta gatacattgc ctgtacagat tttccctcat ccgacaatcc 120 acataggaag actggtaata gatacaaatc ctcaccagaa ctcccttgac cctcgtcgcc 180 gaaaaccccc aagtctgtcc ataaca 206 // ID Gypsy-57_MLP-I repbase; DNA; FNG; 7509 BP. XX AC AECX01001694; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-57_MLP_; KW Gypsy-57_MLP-LTR; Gypsy-57_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-7509 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001694; Positions 20942 13434. XX CC Positions [6359-6871] - Integrase core CC 'AAATA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 614..7486 FT /product="Gypsy-57_MLP-I_1p" FT /translation="MAPRKSSRVKDLEETVGRPNYSDSIRRSNSVGSQPRR FT PHGFRPPSRASTTGLASGAGQSSGTTDHEAARPQLPELAQRSDLEPQQHPQ FT YHDLSEIVEVNSRDLGDLDGGRVQESDEHAVGSIRGGAEVASENPQRLHRT FT GPACQTNTSPSGASILNGRTLENPNPTFEQRPVPETSSSRGRSPQHATNVV FT IQPSVLPSHERHQGLEEQGQESPKATSGEPQGTPTMSNTVLMTPSPFLSSH FT RIISEVVPPKLVVPVFTVPTRLNRRTNALEMRPKTMTEPKSVSLSENMREI FT EPFHAKTHAPAEAKKTEHVRKLEDTGIADAKTRASPFVLPGLLRNTSHDLS FT PISQDKNFSLSEKFKKLRLQTESLSAESIHSWKDHQEDLDTRFATMQKIME FT EKFNEGISKLNEKWDFCLNILNDEMSVNFEDIKETIIMFKCSCADVGAAWT FT KHQDEIERLLINQVVTNQQLIETHSHHLGELMCRLGEKFDEQAQMNLATNR FT NVSRIVSMLDKSTAKKDSEEQLPEPNGVFDKGPSLSNNPFAHELVTNHRRT FT MEGYDRDTTLPVQANNASPDADIVKQMRKDYLPAKDWPKFSGEGEYNHLEW FT IEWIDNNAEDSKMPDHLITCKLGIILTDSARAWYTNKRKAVGARSWPEWRE FT IIKEHFGTPLWKRKMSTAFDRDVFSWEHKDKPVAWLLLQRRRMEAAWPTLS FT VREQIDKILGLCDGDIEHAVQSRIREYSDFEAFVNIFEDVVTHTSIGRRFK FT KAENIKQVQLGNSKFRDYKNYSNKEYRNRDYRGGKDGKPENGEGKKPPFKT FT KDGKKFSFEKKINAVDAEPQISDEGELQGSRSQDSESTTEEEDVDFCVGNV FT DMFRIAEDLNISETESHTGNTLDVENTLTESAGITLQQLATNLRISENAIS FT TAPPIEIPRTILDVPETRTFLTVSVNGIRCKMLLDTGLRPSMASTRVLNVC FT WRTWAADTRSTAIENQDYDSEILGELALPIKIEHTWNPCLLMTKFVVTTDQ FT HIDYLVLGSRDLSEFGFTLHLGYNPRCYIGSLEREYELSPCYSSSEGTSEI FT ATITVEPDGNLDIAELQSQAAIPQEWDEASKQVSDARLMMSKPEKGKAHTL FT GDHCVTPALLENKSKVHVLLDSGAACSIVGKKYLSRILPGWEDRIMPPSRM FT TFRGCSTSLTPLGVIELPLTFPHRLGSIRIQPEFVVMDNATSNYLILGGEY FT LRLYGIDIVHSKEKYFTIGNENKKKKFLLNSLRENRIEAITQDTDKEFEKA FT YNLCNISPRLKASELNDLNKLVRSHKKAFAYGENPIGTIKGHEMTIKLTVD FT KPYPPVLKKQAYPASPRSRDEIDKHIDELLRLQIIRKVGADEEVDITTPVI FT IVWHNGKSRMCGDFRALNTFTVPDRYPMPRIHHCLTNLSEALYITTMDAMK FT SFHQNVVEKQSRKYLRIISHRGIHEFLKMPMGIKTAPSHFQRTMDTEFAAE FT LSEGWMIVYIDDIIVFSKTWEEHLVKLEWVFKRLIRMGLTISLNKTNFAFQ FT ELKALGHVVSGLWIAVDQNKVAAVLKLPIPQSVKEVQSFLGFANYYRSHLE FT GFAKVSGPLYKLLQKGVTFEMTEDRVASWKALKQKLTEAPFLLHADFKLPF FT KVYVDASFDGLGATLQQTQVINGKPVEGLISCISRKLKQSELNYGATQLEN FT LCLVWALEKFHYYLDGANFEVITDCTALKSLLNMKTPNRHMLRWQIAIQEY FT RSCMTITHREGRKHENADALSRMALNNDAQNPAWDPEDSTRDLHIMGISIC FT DLSDEFFDKVKEDYMLEPNTMKLIEILKQDHKNLPLSTTLDGKWKKSFDEG FT RFTLLSGVLYHREKHSCVMVITSTDMKTDILKICHDDIMAGHLCTDRTIDR FT VKQTAWWPEWRTLTEQYVASCDRCQKSNKATGKRLGLLQKIDEPTVPWEVI FT NMDFVSGLPPAGHDNVDCVLVVVDRFSKRCRFLPCHKDATAMDVAMLFWER FT IVCDSGLPRVIISDRDPKFTSDFWKGLFSLSGTKLAMSTAHHAQTDGLAER FT MIQSMQSLIRTYCGFGLEFKDPDGYTHDWKSLLPALEIAYNTSTHSTTGKS FT PFEIERGFNIRTPSKLLRPKDSTFHPTSVSFHHMLEKARQHATECIQASVE FT YNKSRWDKTHQEPSFQVGDLVLISTAHFTNLKGPRKLRDPFIGPYVIKALH FT GSNAVEVIPTGAIARKHPTFPVSLLKKYESSDKDKFPNREQPPEPELPVEE FT EPVGAIKSVIQEKLVRLNGRDTRLYLARFKGKHADEDKWLEAKDIPNSSAI FT LRKYRAHKRV" XX SQ Sequence 7509 BP; 2408 A; 1712 C; 1777 G; 1612 T; 0 other; attgggggcc tcatcgtgcg tagaaaccca agacacagaa agtttagtta ctactccctt 60 ttttaggttt tcaaaattcc gttacgaaac ttcaccaccg aagaaaggat agcggaactc 120 agagagatct ggaaccaaat cagagagcta cttaccgaac ttgaaaacac gctcgagtta 180 gttattttaa attttcgttc tttctttcaa caataaaaaa aaaaagtctg attaaaaact 240 accttttttt tttatcctaa aaaaaaaaaa cacatacttt ttttcgcttg ttgcctttcg 300 tttccttctc tcgccgttcc ttttgacgct gtgtactaag tgcacaagtc aacgcagcag 360 accccggtcc accaggaggg agaaataacc cccacctcat cgtcgacgag cgaccaaggc 420 aacgcagtcc agacccctac ctccacaccg agtactgggg agataccgat tactacagcc 480 acaggggtga ctagtaagta ctggttcagg accgttcttt tttttccgtt ttttcttcgt 540 caaaagatcg aagatagact ccttttagtt tgatactcta ccatttctaa tccaattacg 600 aaatcttctt accatggcac ccagaaagtc tagccgggtt aaagatcttg aagaaaccgt 660 cggacggcca aactacagtg actcaatccg tcgatccaac tccgttggca gccaaccaag 720 acgtcctcat ggattcagac cacccagcag agcatccacc accggacttg caagtggagc 780 aggacaaagc tcaggtacaa cagatcatga agctgcacga ccacaacttc cagaattggc 840 tcagcgctca gacctcgaac cccaacaaca cccgcagtat catgatctat ctgaaattgt 900 ggaagtcaac tcacgagacc ttggtgatct tgatggggga cgagttcaag aaagtgatga 960 acacgcagtg ggatccattc gtggaggagc agaagttgct agcgagaacc cacaacggct 1020 tcatcggacg ggaccagcat gtcaaacaaa cacctccccc tccggcgcaa gcatcctcaa 1080 cggccggacc ctcgagaacc caaacccaac gttcgaacaa cgccccgtac cagaaacctc 1140 ctcaagcaga ggaagaagcc cgcaacatgc aacaaatgtt gtcattcagc cgagcgtact 1200 accgagccat gaaaggcatc aagggctcga agaacaaggg caagaatcac ccaaagcaac 1260 atcaggagaa ccccaaggaa ccccaacaat gagtaataca gtgcttatga caccaagtcc 1320 ttttctttca tcgcaccgta taattagcga agttgtaccc ccaaaacttg ttgtccctgt 1380 atttaccgtt cctacacgcc tgaacagaag aacaaatgct ttagaaatga gacccaaaac 1440 gatgaccgag ccaaagagcg tgagccttag tgagaacatg agagaaattg agccttttca 1500 tgcgaaaacc cacgcacctg ccgaagccaa gaagacagag cacgtgagga aacttgagga 1560 tacagggata gcggacgcta aaacaagagc gtcccctttc gtgttaccag gccttctgag 1620 aaatacgtcg catgacctaa gtcccatttc acaagataaa aattttagcc ttagcgagaa 1680 atttaagaag ctaagactac aaacagaaag cttatcggca gagtccatcc actcatggaa 1740 agaccatcag gaagacctag atacaaggtt cgcaacaatg cagaaaatca tggaagaaaa 1800 gttcaacgaa ggtatatcca aattgaacga aaaatgggat ttttgtttga acatcctgaa 1860 cgacgagatg agtgtaaact ttgaggatat caaagaaacg ataatcatgt ttaaatgcag 1920 ctgtgcggat gtaggtgcgg catggaccaa acatcaagac gaaatcgaaa gattgctaat 1980 caaccaggtg gttacgaatc agcaactgat tgaaacgcat agccaccatt tgggggagct 2040 gatgtgtaga ttaggggaaa aattcgatga gcaagctcag atgaatctag ctaccaatag 2100 gaatgtgtcc aggatagtat cgatgctgga caaatcaacc gctaaaaaag acagcgaaga 2160 acagttaccg gaacctaacg gtgtgttcga taagggacca tcactaagta ataatccttt 2220 tgctcacgag ctagtaacaa accaccggcg cacaatggaa gggtatgaca gggacacaac 2280 actcccagtc caagcgaata acgcgtctcc ggatgcggat attgtgaagc aaatgaggaa 2340 ggattacctc ccggccaaag attggccaaa attttccgga gaaggagaat acaatcattt 2400 ggaatggatc gaatggatcg acaataacgc ggaagattca aaaatgccag atcaccttat 2460 cacgtgtaag ctaggtatca tcttgacaga ctcagcaaga gcttggtata ccaacaagcg 2520 gaaagcagta ggcgcacgta gttggcccga gtggcgcgag atcatcaaag agcattttgg 2580 gacaccccta tggaaaagga aaatgagtac agctttcgac cgagatgtgt tcagctggga 2640 acataaagac aagccggtag catggctcct actccagaga aggagaatgg aagcggcttg 2700 gcccacgctg agcgttagag aacaaattga taagatttta ggtctatgcg acggcgatat 2760 agaacacgca gttcagtcca ggatacgcga atattcggac ttcgaagcgt ttgtcaacat 2820 ctttgaagat gtggtaacgc acacctccat cggccgtcgc tttaagaaag cggaaaacat 2880 caagcaggtc cagcttggca actcgaaatt cagagactat aagaattata gtaacaaaga 2940 gtatagaaac agagactacc gcggaggtaa ggacgggaaa ccagagaatg gagaaggcaa 3000 aaagccaccc tttaaaacaa aagacgggaa gaagttctca ttcgagaaaa agatcaacgc 3060 agtggatgcg gaaccgcaga tcagcgatga aggtgagcta caaggaagta ggtcacaaga 3120 ctccgagagc acgacggaag aggaggacgt tgacttctgt gtagggaacg tagacatgtt 3180 caggatagcc gaagatctga atatctcaga aaccgaaagc catacaggaa acacgttgga 3240 tgtagaaaat acacttacag aaagcgcggg gataacccta cagcagctcg ctaccaactt 3300 acggatctcg gaaaacgcga tttcaactgc tcctcctata gaaataccca gaacgatact 3360 ggatgtgcca gaaacaagaa cctttttgac ggtctccgtg aacggtatcc gatgtaagat 3420 gttgttggac acaggtctga gaccttcgat ggcctctacc agagtgctga acgtatgttg 3480 gcgaacatgg gcggcggaca cccgttcaac cgcgattgag aatcaggatt acgattcgga 3540 aatactggga gaactagccc ttcctattaa aatcgaacac acttggaacc cttgtctgct 3600 tatgactaaa tttgtagtga ccacggatca acatatagat tatctggtac taggaagcag 3660 agatttgagc gaattcggat ttacgcttca tctgggatat aacccacgat gttacattgg 3720 atcgcttgag agagaatatg aattatcgcc gtgttactcc tcttcagaag ggacaagcga 3780 aatagccact ataacagtgg aacccgatgg gaacctagac atcgctgagc ttcagtcaca 3840 agcagcaata ccccaggagt gggatgaagc gtctaagcag gtttcagatg cgagactgat 3900 gatgtcgaag ccagagaagg gcaaagccca tacattaggg gaccactgtg taacaccagc 3960 attgctagaa aacaagtcga aggtacatgt cctgctcgat agcggcgccg cgtgttccat 4020 tgtaggtaaa aaatacttgt cgagaatcct tccgggatgg gaagatagga tcatgcctcc 4080 gagcagaatg accttcagag gatgcagtac aagccttacg cctctgggag tgatagagtt 4140 accactcact tttccacaca gattgggatc cattcggatt caacctgagt tcgtggtgat 4200 ggacaatgca acatctaact atctaatcct aggaggcgag tatttgcgat tgtacgggat 4260 agatattgtc catagcaagg agaaatactt tactataggc aatgaaaaca agaaaaagaa 4320 atttcttctg aactccctta gagagaatcg tatagaagcg attacgcagg acacggataa 4380 agaattcgag aaagcgtaca acctatgcaa tatatcccca agattgaaag cgtcagagct 4440 aaacgacctg aataaactag tgagatccca taaaaaagcg tttgcatacg gcgagaaccc 4500 aatagggaca atcaaaggtc atgagatgac aattaagctc acggtggata aaccataccc 4560 accagtgctg aaaaagcaag cttatcccgc gagtcccagg agcagggacg aaatcgataa 4620 acacattgac gaactactcc gcctgcagat tatacggaaa gtaggcgcgg atgaggaggt 4680 agatattaca acccccgtta tcattgtgtg gcataatgga aagtcacgca tgtgcggtga 4740 cttcagagcg ctgaacacct ttacggttcc agacaggtac ccgatgccta ggatccatca 4800 ctgcttgaca aaccttagcg aggcgctgta cattacgacg atggatgcga tgaaaagctt 4860 ccatcagaac gtggtggaga aacagagtag aaaatacctt aggattatct cgcatagggg 4920 aattcacgag ttcttgaaaa tgccaatggg aattaagaca gcgccatcac attttcaacg 4980 aactatggac acagaattcg cggccgagtt aagtgaagga tggatgatag tttacatcga 5040 cgatattata gttttttcca agacctggga agaacacttg gtcaaactag agtgggtctt 5100 taaacgacta atccggatgg gtttgacgat ctcgttgaat aagaccaact ttgccttcca 5160 agagctgaaa gcgttgggac atgtggtctc aggactttgg atcgcagtag atcagaataa 5220 agtcgcagca gtcctcaagc tgcccatacc acaatcggtg aaagaggtgc aatccttctt 5280 gggattcgcg aattactata ggtcgcactt ggaaggcttc gccaaagtaa gtggaccgtt 5340 gtacaaactc ttacaaaaag gtgttacttt tgagatgacc gaggatagag tagcttcatg 5400 gaaagcactg aaacaaaagc ttacagaagc acctttcttg ctccatgcag atttcaagct 5460 tccgtttaag gtctacgttg acgccagttt tgacgggctg ggagcaactc tacaacaaac 5520 ccaagtcatc aacggcaaac cagtagaggg actgattagt tgtatctcga ggaaattgaa 5580 acagtctgag ttgaactacg gagcgacaca acttgagaat ttatgcctag tctgggccct 5640 ggaaaaattc cattactacc ttgatggtgc aaatttcgaa gtaattacag attgtacggc 5700 cttgaaatcc ttgctaaata tgaaaacgcc taataggcac atgttgcgtt ggcaaatagc 5760 catccaagag tacaggtcgt gcatgaccat cacacaccga gaagggagga agcatgaaaa 5820 tgccgacgca cttagccgca tggcacttaa caacgacgcg caaaacccag catgggatcc 5880 tgaagacagt acgagagacc tccacattat ggggatcagc atttgcgatc tctcagacga 5940 attctttgat aaagtcaaag aagactacat gttggagcca aataccatga aacttatcga 6000 aattctgaag caagatcaca agaacttgcc cctgtcaaca acgttagacg gaaagtggaa 6060 aaaatccttt gatgaaggac gattcacctt actcagcggc gtactgtatc atcgggagaa 6120 acactcctgc gtgatggtga tcacatccac ggatatgaag acagacatat taaaaatttg 6180 ccacgacgat atcatggcag gacacctgtg taccgatagg accattgacc gggtcaagca 6240 aacggcctgg tggccggaat ggaggacgct gacggagcag tacgtggcat cttgcgacag 6300 atgccaaaaa tcgaataaag ctactgggaa aagactggga cttcttcaga aaatagacga 6360 acccacagtc ccctgggaag tcataaacat ggactttgta agcgggcttc ctccagcggg 6420 acacgacaat gtcgactgcg tgttagtcgt ggtggataga ttttcaaaga gatgcaggtt 6480 tttgccgtgc cataaagacg cgacagccat ggatgtagca atgctattct gggaacggat 6540 agtatgtgat tcgggtctcc ctagggtcat catcagtgac cgagacccta aattcacctc 6600 ggatttctgg aaaggtcttt ttagtctttc tgggacaaaa ttagcaatgt caaccgccca 6660 ccacgcgcaa acagacggac tcgcagagcg gatgatccaa agcatgcaat ccctgatccg 6720 aacgtattgc ggatttgggt tggaattcaa ggaccccgac ggctatacac atgactggaa 6780 gagtctctta ccagcactgg agatagcata taatactagc acgcacagca ccacagggaa 6840 atcgcccttc gagatcgagc gaggttttaa catccgaaca ccaagtaaac tactacgtcc 6900 aaaggactct accttccacc cgacatctgt tagctttcac cacatgcttg aaaaagcgcg 6960 acaacacgcg acagagtgca ttcaagcctc agtggagtac aacaaatcaa gatgggataa 7020 gacacaccaa gaaccaagct tccaagtagg agacttggta ctaatctcta cagcacactt 7080 cacgaacctc aaggggccaa ggaaactgag ggatccattt attggaccat atgtcataaa 7140 agcgctacac ggcagcaatg ctgtggaagt tattccgacc ggagcgattg cgcggaagca 7200 ccccactttc ccagtgtcgc tgttgaaaaa gtatgaatcg tctgacaaag ataagttccc 7260 taacagggaa caaccgccgg agccggaact acccgttgaa gaagagccgg tgggagctat 7320 taaatctgta atacaggaga aacttgtcag actgaacggc agagacactc gtttatacct 7380 agcaaggttc aaaggaaagc acgcggatga agacaaatgg ttagaagcaa aagacatacc 7440 taattcttca gccatactgc ggaagtatag agcgcataag agagtgtgac tctccttgtg 7500 gttggggag 7509 // ID Gypsy-43_MLP-LTR repbase; DNA; FNG; 159 BP. XX AC AECX01002299; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-43_MLP_; KW Gypsy-43_MLP-I; Gypsy-43_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-159 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002299; Positions 11248 11090. XX SQ Sequence 159 BP; 44 A; 40 C; 29 G; 46 T; 0 other; tgttatgatc caaacatgtc acgttatccg tgtagacatg tcacaggtcc agaggaacaa 60 ctcttgtccc ccatttgtac agtagagttt tcctcatacc gcaatctcag ttatacgtat 120 tggagaatct atcacgccgt gcatgagcca ttcataaca 159 // ID hAT-1_AN repbase; DNA; FNG; 4296 BP. XX AC . XX DT 03-JUL-2007 (Rel. 12.06, Created) DT 13-JUL-2007 (Rel. 12.06, Last updated, Version 1) XX DE DNA transposon, hAT family. XX KW hAT; DNA transposon; Transposable Element; hAT-1_AN. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-4296 RA Galagan J.E. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-4296 RA Clutterbuck A.J., Kapitonov V.V. and Jurka J.; RT "Transposable Elements and Repeat-Induced Point Mutation in RT Aspergillus nidulans, A.fumigatus and A.oryzae."; RL Chapter in "The Aspergilli: genomics, medical applications, RL biotechnology, and research methods". Edited by Goldman GH and RL Osmani SA. Publication expected 2007. XX RN [3] RP 1-4296 RA Clutterbuck A.J.; RT "hAT-1_AN."; RL Direct Submission to Repbase Update (03-JUL-2007). XX DR [3] (Consensus) XX CC 19-bp terminal inverted repeats, 8-bp TSDs. 2 full-length copies CC in genome; in both, ORF is interrupted by RIP, but fragments bear CC similarity to maize Activator (Ac) element. Also 22 fragments. CC Consensus sequence incorporates the less RIP-affected version at CC each disputed site. Sequence has little similarity to the CC non-autonomous A. nidulans elements, hAT-N1_AN or hAT-N2_AN. XX SQ Sequence 4296 BP; 1360 A; 877 C; 803 G; 1256 T; 0 other; taggctgcgg cataaccgca accgcggcta tgagatcgat aaccatcacc gcaattgcgg 60 ttatggtcat gagatttgat aaccatcacc atcaccgcaa attcgaggtt aaccgcggtt 120 tcaccgcggt taacctcact gtggccgcac tagcataact cttgcttaat tatcagtttt 180 gagcgccctg catggtagtt atgaagctta caaatactta ttaccgtcca ttttaaacac 240 gcgttcgagt ttttccaata cagacatctt atttgaagac aattgcactg taaatatagg 300 aaattagaat gaaatataaa aaggagtctc tattaaaata gatatatgct atctttgaag 360 tgtatttcct aaatataacc atgttcttaa aacacaggat attgaacaaa tcaaatatat 420 attttatata tatactttgc ctctattttc tgctggtata cttcaggtaa ggctatagcc 480 acttaaattg ttgattaagc taatataaac agtatagtac tatatagtag tattagagca 540 tctagaagag ctagtaattt taatagtgat ctatatattt attttgagta acatgcattc 600 aaaaatgctg agattcttaa agggtactcg agaagtctct gctactcggt ggtcaggagt 660 ggcagaacca aggctcattc caggcaccgt gcccaggatg aggctttggc tggccctatg 720 tgccagctta tcctaagtat agtacagtgc tcagggtaat caggtcaggc accataatag 780 caatcaagta aggtcctgct gcggctttca tgcggctata gtaagattaa ccgtggttaa 840 cctcgccatc cctgcagtta acctcgccat ccccgcagtt aacctcgcca tccccgcgct 900 ggcttactca tatggctaat cagcatctaa tatacctcct aagcatgtca aattcgtgtc 960 atctgctacc atgaaggcct gatcagagaa aatacttgta agtctatctc tctatagcct 1020 cccttggtct ttttgttacc cctgatagag atgcttttct actggcaaac tttataatat 1080 ctcaattata tacctctttc tattcagact ctctctttga tgctgagaat atactactag 1140 atagtcttcc agatatatca gagctcccaa gcagccagcc aagtctctat aaggcaaatt 1200 ctagcttcct tcctccacca cctttaccag cctgcgcccc aaaagatcta gtacacatca 1260 ggcctctact ccagcaagtg taggttttat ataacacata ccctgatatg gaggactcct 1320 ggaagcaatt tgttaagtag tggcttataa ctggatatag caagcaatgt gaagggcagt 1380 caaatataca ttgggatggt aagaaaaaag ctacagtctg gcaaggattt gagcaagttg 1440 tacatgagca aagcaggcaa ccaaaggtta tgtgtaagga gtataaagca atacttgcat 1500 atctagctgc aaagtatact ggaatatctt ctctccaata ttattttgct aaaggaggtt 1560 gtcgggtaca gaaggtagcc aagaagggag ttgatcagat attacaagaa atggtatatt 1620 tttaatagaa ggaaacagta tattattact aaccttcttt atagccctaa tggtccctga 1680 tatatttcaa taagcatctt cccaaccaaa agatacttaa tttgataaca actgcctgtc 1740 tcctattccg aattgttgaa tacccagcct tctatgacct cctccaaact gcccggcatg 1800 ctgaatcaca gcttaatatc ctgtctgcat gcagtattca acaccttctt gataataatg 1860 ttactgcaag ccagctaaat atgcttaata gacttccagc tgggagtagt ctattaattg 1920 tactagactg ctggatatcc ctcttctccc aagcatttat ggcagttact ggctacttct 1980 tagatcaaga ttggaattac tgtgaggtgc ttcttggatt tgaactacta gatagctcat 2040 attctggtac ttatctcagt aaaactgtga tcaaggtcct tcagcagcat aatatcatga 2100 agagagttct cttagttact actaacaata tattaaacaa caatacaatg gttgcaggta 2160 tccaggaagt tggctaatta cttgggcttg gtgaagatca gctcttctgt attccttgta 2220 taattcatat tatacaatta agtctcaggg aattgcttgg agagatgaaa gcaaacccgg 2280 tcaatgataa gattagatta atataggttg acatgccaca acaatcaaag cagtcaaaga 2340 atcaacaaca ccctgatatt ataaagactc taaagaaggt aagtaatact agctttcttc 2400 ccttttccta gcagaaaagt tataccttct gtttttaaac ttagttacta acagctatct 2460 attctcctta tagattcaag atctagctgt ctttattaat gccagtccgc agcgctggaa 2520 ggtatttctt gagctacaaa caacagagcc aagactggta ccaatccaag atatttatat 2580 ataatagaac tcaaccttcc taatgcttga tcaggctcaa aggattcagt cagatattga 2640 tcaatactgt gatacttatc accatactta atttaaactc aatccagagg agtggcgtca 2700 ggttgagtat cttttattac ttacaaagcc tttctttgac tttaccaacg tgctgtcaaa 2760 gataagagat gtgactatcc agcatatctt cagtatctat aataagctat tcaactatct 2820 tgatcaggct aagatgaggc ttaaatacaa agctgttccc tggaagaaga atatacttac 2880 agcaattcag gctgctaata caaagctccg gaagtattat actaaaacta ataatcagct 2940 atatagttca gtttatgcta ttgcaactat tctaacactg tcaaagaagc ttcggtactt 3000 tgataatgca gactagagag gccttgataa taataggagg ccagttaact tcataaagca 3060 ctatcaaaat atcctccaag caaggtttaa gctttattaa cagctgcatt caaaggaagc 3120 tgagcctatt aatatagaga ggatcttcta atcagcaggg gataagcttg aggaggtgta 3180 taattcacag actgctcttc aggctgaggt taatcagcca gataatgaaa ttgcccggta 3240 tcttgcaaag ggttagtatt atctaatact attctagact atatcatatc tttactaaca 3300 gttagatagg gcttactaag ggtaatcccc gcctcttctg gaaagagcat gaggaagaat 3360 atccagttct tgcttgcctt gcacaggata ttctttcagt accagcaagt agagctggta 3420 ttgaatgcct ctttaattgt gcttgtaaca tatgtcacta tcaccgcggg cagctaaaag 3480 agaatacaat ccaagacctg ataattcatc tcttttcaac caaatttgag cttaagcaca 3540 gtgagcttga tctaataaag gagcagctct ctattggtga ggctgcatta tctgatcagg 3600 cagctaaacc tgttcctata cttgctgagc ttgatccgat aagtaaagat gaggaggata 3660 aggaagaagg ggggaaagca gataatattg ctgattcaga ttcagatttt gattcagatg 3720 ctgtggcctc taagcaaata ttacctgcaa cccagcccac agtatgtatt aagcaggtac 3780 aggcccagaa taagcaggct cacagtcaga tcaaaactct gaatggtaca cttagtcaat 3840 tgcctcatgc atcactaggt gaggaagggc gaccagaaag gcatcggaag cagccaaagc 3900 taccagctgg gtttgagata gatagaacat aatattaata accaatatga tcagaaataa 3960 gaataagagt atgaaataat cagcataata gagtatgtac taaagcatca taagtaattg 4020 cttacttaaa gccctaaatt accaatctgc accacagtac atacatcctt gtatacaaat 4080 aattatacaa aaaacagcat aatctatgca ttctcataat cttggcaggt ttatattgtt 4140 tatactagtg aggttaaccg cagtgaccgc ggttataacc tagctataac cataacctca 4200 ccgcagagca aggttatcag aatctcatga ccatcacctc accgcgcggt tatggtgatg 4260 gttaaccgcg gctaaccgcg gttatgccgc agccta 4296 // ID Copia-67_MLP-LTR repbase; DNA; FNG; 843 BP. XX AC AECX01000620; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-67_MLP_; KW Copia-67_MLP-I; Copia-67_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-843 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000620; Positions 79260 80102. XX SQ Sequence 843 BP; 253 A; 164 C; 134 G; 292 T; 0 other; tgttgaaaac tacaatcgac gagaagagaa aacagtagtg taatcgaatc actacacaag 60 aatggaagcg acattccgca gaccagaaaa gacaaacagc aagcttcatt atcagacaag 120 ctacattatt agacaagata caagatcatc actaacattg ccccaaccac atttcatttc 180 aatttttcct ttcatatata ctattgtgtg tttcgagtac ttctcttttc cccatctttg 240 ggaagagaag tacgtacatt ttcatctttc tcactaaaaa tgaatgacaa gttcgattaa 300 catttgtaat tgatacgttt ctattcctac tctttatctt tcagaactcg tatctcttgc 360 gctagtccaa agatcagtta tcaagcttga gataacttta actaacaaaa tctaacttgt 420 tattctgaac ctgattatag gtacttatta ttatatctct atttctttct taactctatc 480 taactatttg ttcgatgtta taggtcagaa gacctcactt agatttcttg ttggttgtgt 540 agcgtgctta tcaaagccac tgtgcttttc attaggtgaa taaaatttat aatctttggg 600 aagagaaaac tcgtgtctct tgcgctagtc caaagatcag ttatcaagct tgagataact 660 ttaactaaca aaatctaact tgttattctg aacctgatta taggtcagaa gacctcactt 720 agatttcttg ttggttgtgt agcgtgctta tcaaagccac tgtgcttttc attaggtcag 780 aagacctcac ttagatttct tgtcggttgt gtagcgtgct tatcaaagcc actgtgcttt 840 tca 843 // ID PCretro3_I repbase; DNA; FNG; 4201 BP. XX AC DQ097840; XX DT 08-MAR-2006 (Rel. 11.02, Created) DT 08-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE Phanerochaete chrysosporium RP-78 Ty1/copia LTR retrotransposon DE (an internal portion). XX KW Copia; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; PCretro3_LTR; internal portion; PCretro3_I. XX OS Phanerochaete chrysosporium OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Corticiales; Corticiaceae; Phanerochaete. XX RN [1] RP 1-4201 RA Novikova O., Fursov M., Shutov O. and Blinov A.; RT "Divergent groups of LTR retrotransposons from Phanerochaete RT chrysosporium."; RL Direct subission to Genbank (2005). XX DR EMBL/GenBank/DDBJ; DQ097840; Positions 444 4644. XX FH Key Location/Qualifiers FT CDS 178..4191 FT /product="PCretro3_I_1p" FT /translation="MSDAPHTVRIEPLKGSENYFSWSIQEKDILTELDLLE FT YATGETTCPEGTAATAWKRKDRKAISAIRLRCSPAVAIHITSCETAKSAWD FT TLKNMYATQGTMAKVTLVRRLQKYEMSEGADMEDELRKLTEMRAEAAEAGG FT KITDEDFAFNILSALPPSWESFVAAQNNVEKSSDVIGRVLSENLRRQSKGD FT TTTALVARNGKPGKKPKKSKFLKGVFCHGCGKEGHLKNVCRSSNQGSGAGG FT NTSRAHLAESREASNDSYSFVATTLAYASTPGTECWLGDSASERHIVRDRA FT AFKNLTPTPGHTIVGVGSTAALGQGDVDVVFTTPKGKITATLRNCLWSPNL FT PHNLLALGRLTSAGMSFHGTGNLLHIKDRDRVIAVGHKMGQLYRMDMTSRS FT SSGHTPAPSTLAFAARNSARTWYEWHCALGHINATQLQEMYRKGLVEGMDV FT DTSSDPGFVCDACIQAKHSRAPFPEIASGTVDQVADLIYSDIWGPARVASL FT QGNVYAITFTDAKSRFVAVDFMKTRDAALDRFQKVEQLIERQLGRRVKVLH FT VDNAKEYTEGKFRAYAESRGIIIRTTAPYSPAQNGVAERLNRTLMERARAM FT LIARSLPKFLWQEAWAYACYLRNRTPTRALSGKTPYEALWGQKPDVRAARD FT FGTPCWVLVPENRRDKLAAKSERYIFTGISASSAGWRYYVPGLRQVLVSRD FT VIFERERESPSVPFTLEGESETAPEPSPAAGATPPPPTSQAPETSTSESVK FT ASPSTPNKPEDAAWAHFKSQLRNVTIRPTRQKTRIDYRALHETGTRIPKPP FT SEDDDSDPSPDQYHSAQAYFCYAAMDSDHPRSVDECKTRDDWPQWKAAMDA FT EMAQLEANGTWKKGELPPGRKAIGSKWVFAIKRHQDGSIDKYKARLVAQGF FT SQIAGQDYFDTFSPVVRQETFRVATALAATENLDSDALDIVGAYLHGPLEE FT EIYMRQAPGYDDGSGQVYVLIKALYGLKQAGRVWNHLLNHVLTSLMGWTRS FT EADPCLYFKHEGKLNMALVHVDDTALYGERSILDRFKADVAKHFAITTNGT FT LSSFVGLQVTRKNGAISILQTRYLETILERFGMQDCKPVSTPLDPGVKLEP FT TPEDQTPADVPYAAAIGSLMYAATGTRPDIAFAVQTLSQFTSRPSATHWTA FT VKHVFRYLKGTTDVGLTFARRADDDITGYSDADWAQAHDRRSVSGYAYLLA FT GGIVSWNSKKQPTVALSTMEAEYIALSHAAKEAVWLRRLLTELGFPPGAPT FT VLHTDNQAAISFAHDTQFHARSKHIDIRHHFIRERITDGDIKVIHCASADN FT IADMFTKALPRPKHRAALALANMATR" XX SQ Sequence 4201 BP; 899 A; 1518 C; 1125 G; 659 T; 0 other; ggttatgagc cccgcgctct aagtcgcccc aaacaccatc tcaccctttg ctacggtcaa 60 cgcggaccga atcttgctgc tgcagatcca tcgcggatca ctcctgtgac atcaccggac 120 gtcagatacg tcggacacac ctaggcgcca gatcgctgat ctcgccacgc gagcaagatg 180 tcggacgcac cgcacaccgt acgcatcgag cctctcaagg gctcagaaaa ctacttttct 240 tggtcgattc aggagaagga catcctcacc gagctcgacc tactcgagta cgccacgggg 300 gaaacaacgt gccccgaggg gaccgctgct accgcctgga agcggaaaga tcggaaggcg 360 atctccgcga tccgcctccg atgctctccc gccgtcgcca tacacatcac gtcttgcgag 420 accgcgaagt cagcgtggga cacgctgaaa aacatgtatg cgacccaggg caccatggct 480 aaggtcacgc tggtgaggag gctacagaag tacgagatga gtgagggggc ggacatggag 540 gacgagctga gaaagctcac cgagatgcgg gccgaggccg cagaagccgg cggcaagatc 600 acagacgaag acttcgcctt caacatcttg tccgcgcttc cccccagctg ggaaagcttc 660 gtcgccgctc agaacaacgt cgagaagtcg tccgacgtca tcggacgcgt cctgagcgaa 720 aaccttcgac ggcaatcgaa gggggacacg accaccgcgc tcgtcgcgcg caacggcaag 780 cccggcaaga agccgaagaa gtctaaattc ctgaaggggg tgttctgcca cggctgcggc 840 aaggagggac acctgaagaa cgtctgccgt tcctcgaacc agggatctgg cgcgggcggc 900 aacacgagcc gcgcccacct cgctgagtcg cgcgaagcct cgaacgactc gtactcattc 960 gtagcgacta ctctagccta cgcgagcacc ccaggcaccg agtgctggtt gggggacagc 1020 gcgtccgagc gccacatcgt gcgcgaccgc gccgctttca agaacctcac ccccactccc 1080 ggccacacca tcgtcggtgt cggcagcaca gccgcgctcg gacaaggcga cgtcgatgtc 1140 gtcttcacca cgccgaaggg gaaaatcacc gccaccctga ggaactgcct ctggtcgccg 1200 aacctcccac acaacctcct cgcgctaggc cggctcacct cggccggtat gtctttccac 1260 ggcaccggca acctccttca catcaaggac cgcgatcgcg ttatcgctgt cggtcacaag 1320 atgggccagc tctaccgcat ggacatgacg tcacgatcgt cctcgggaca tacaccggcg 1380 ccttccactc tcgcgttcgc ggcgcgcaac agcgcgcgca cctggtacga atggcactgc 1440 gcactcggac acatcaacgc gacgcaactc caggaaatgt accgcaaggg cctcgtcgaa 1500 ggcatggacg tcgacacgtc ctcggaccct ggattcgtct gcgacgcctg cattcaggcc 1560 aagcactcgc gcgctccgtt ccccgagatc gcatcaggca ccgtcgatca ggtcgccgat 1620 ctcatctact cggacatctg gggaccggca cgcgttgcgt cattgcaggg caatgtctac 1680 gcgatcacat tcaccgacgc gaagtcgcgt ttcgtcgcgg tcgatttcat gaagacgcgc 1740 gacgccgccc tcgaccgttt ccagaaggtc gaacaactca tcgagcggca gctaggccgc 1800 cgcgtcaagg tcctccacgt ggacaacgcg aaagaataca cggaggggaa attccgcgcc 1860 tacgccgaat cccgcggcat catcatccgc accacggcac cctattcccc cgctcagaac 1920 ggcgtcgcgg agcgtctcaa ccgcacgctg atggaacgcg cacgcgcgat gctgatcgcg 1980 cgctccctcc ccaagttcct gtggcaagaa gcgtgggcgt acgcttgcta ccttcggaac 2040 cgcaccccca cccgcgcgct ctcaggcaag accccctacg aggctctctg gggacagaaa 2100 ccggatgtgc gcgctgcgcg cgatttcggc acaccctgct gggtgctcgt ccccgagaac 2160 cggcgcgaca aactcgctgc gaagagcgag cgatacatct tcaccgggat cagcgcgtct 2220 agcgccggct ggcgctacta cgtcccaggc ctccggcaag tcctggtctc gcgcgacgtc 2280 atctttgagc gcgagcggga gtctccaagc gtccccttca cgcttgaggg ggagagtgag 2340 accgctcccg agccatcccc cgccgcaggc gcgacaccgc cgccgccgac atcgcaagcg 2400 ccggaaactt ccacctcgga atcggtcaag gcgtcaccct cgacgccaaa caagcctgaa 2460 gacgccgctt gggcacactt caagtcacaa ctcaggaacg tcacgatacg ccccacacgc 2520 cagaagaccc gtatcgacta ccgcgcccta cacgaaaccg gcacacgcat cccgaagcca 2580 ccttcggagg acgacgactc cgatccgtca ccggatcagt accatagcgc ccaggcgtac 2640 ttctgctacg ccgctatgga ctcggaccac cctcgctcgg tcgacgaatg caagacccgc 2700 gacgactggc cgcaatggaa ggccgccatg gacgccgaga tggctcagct cgaagccaac 2760 ggcacctgga agaaggggga gctgcccccg ggccgcaaag ccatcggcag caaatgggtc 2820 tttgccatca aacgccacca ggacggctcg atcgacaagt acaaggctcg cctcgtcgca 2880 caaggctttt ctcaaatcgc cggacaagac tatttcgaca ccttctcacc tgtcgtccgc 2940 caggaaactt tccgcgtcgc cacagctctc gccgctaccg aaaacctcga ctccgacgca 3000 ctcgacatcg ttggcgccta tctccacgga ccgctcgagg aagagatcta catgcgccag 3060 gcacccggct acgacgacgg atcaggacaa gtctacgtcc tcatcaaagc cctctacgga 3120 ctcaagcagg caggccgcgt ctggaaccac ctgctcaacc acgtcctcac aagcctcatg 3180 ggctggacgc gctccgaggc cgacccttgc ctgtacttca agcacgaggg gaaactcaat 3240 atggccctcg ttcacgtcga cgacaccgcg ctctacggcg agcgctccat cctcgatcga 3300 ttcaaggccg acgtcgcgaa acacttcgcg atcaccacga acggcaccct cagctctttc 3360 gtcgggctac aagtcacgcg caagaacggc gcgatttcta ttctacaaac acgctacctc 3420 gagacgatcc tcgaacgttt cggcatgcaa gactgcaagc ccgtctcgac gccgctggac 3480 cctggcgtga aactcgagcc gacgcccgaa gatcagacgc ccgccgatgt accttacgcc 3540 gccgccatcg gatccctcat gtacgctgct accggcacgc gtcccgacat cgccttcgcc 3600 gtccagaccc tctctcagtt cacatcgcgc ccgtccgcca cgcactggac cgccgtcaaa 3660 cacgtcttcc gctacctcaa gggcacgacc gacgtcgggc tcactttcgc aagacgcgcc 3720 gacgatgaca tcaccggcta ctcggacgcc gactgggctc aggcacacga ccgccgctca 3780 gtatccggct acgcctatct cctcgccggc ggcatcgtca gctggaactc caagaaacag 3840 ccgaccgtcg ccctctccac catggaggcc gagtacatcg ccctctccca cgccgccaag 3900 gaggccgtct ggctccgccg cctacttacc gaattaggct tcccgcccgg cgctcccaca 3960 gtccttcaca cggacaatca ggccgccatc agtttcgcgc acgacactca attccacgcg 4020 cgctcgaaac acatcgacat ccgccatcac ttcatccgcg agcggatcac cgacggcgac 4080 atcaaggtca tccattgcgc ctccgcagac aacatcgcgg acatgttcac caaggcgctc 4140 ccgcgaccca agcaccgcgc cgcgctcgcc ctcgccaaca tggccactcg ttgaggggga 4200 g 4201 // ID FOTYL repbase; DNA; FNG; 3001 BP. XX AC AJ745091; XX DT 27-SEP-2005 (Rel. 10.09, Created) DT 27-SEP-2005 (Rel. 10.09, Last updated, Version 1) XX DE Yarrowia lipolytica transposon Fotyl. XX KW Mariner/Tc1; DNA transposon; Transposable Element; FOTYL. XX OS Yarrowia lipolytica OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Dipodascaceae; Yarrowia. XX RN [1] RP 1-3001 RA Casaregola S.; RT "Fotyl, a DNA transposon of the dimorphic yeast Yarrowia RT lipolytica."; RL Unpublished (2004). XX DR EMBL/GenBank/DDBJ; AJ745091; Positions 1 3001. XX FH Key Location/Qualifiers FT CDS 402..2678 FT /product="FOTYL_1p" FT /translation="MLNNQERRKRKQYPPESLYKACMEVIRTNSTHKAAGK FT KWNVPDSTVSWRIRLLKQHGSLQPDWIEKMRISYKELALKMNWAHSNNKKL FT SEAHERGLLDHVKFQHSIARPVKLKDISKIANYLLKANGNTETVGKNWASN FT FVTRHNLLEVKRPVSLEQARIDGTTHDKLEEFITTITSQTLEEVLPENRWN FT MDETGFQQGEGRGGQVVGLRGEPAIQSVTGRSATTTVIETVSATGKSTRAL FT VIFKGKTVQGQWFPKGSRAEDYKDMVFEFSDSGFTNNNTAWKWLTQVFIPD FT TMPKDKYGNDAPHIRRILYMDGHHSHTQKRFLLKCIEHNITTVLLPPHTSH FT ITQPLDVGVFSSMKHWYKDLAGNEGKVGNLTDAGPHVFLTVINEVREKALT FT ESNIASGWRASGLWPLDPEKIRSNPRLIRENGNKEPLGAVRQDYTSQDTVR FT AIFPINTPTPAVPDMPLQPSLEELENSIHDDTHDETYKCTSESESDGSGSS FT DSSDSDVSASDIIVNDGAMDASDYEFLAAVNEELLERQNNANQVDADETLG FT SQQAFHNFSSSHWHHDDCIEKIFCKETYPPMENPPTGGNALLTWAHSTHPA FT MRDFLIHGLKPFVEMIDRLVGDLYHMQEDATMRQHEIDIVRSKTKRIRVNR FT SAPDQVISSQDVFAAIGAREDRGNARRERSARSTGEEPTQEEPTQEEPTQE FT DSTRQQSTRQQSTRQRSTRRAGPFSKVPPSLFADSPTSDTSSGPQDDNCPS FT SDSLMLSD" XX SQ Sequence 3001 BP; 853 A; 752 C; 735 G; 661 T; 0 other; taggcgtagc ccaagcactt gataatcctc ccaagcccca agtggcctta tttgtgcgtc 60 tctgatgcac aaaattagca acctccaaac tttatttgac tcagggtggc cgatttgatt 120 ggctgaattt acagtgactg tctagagtag gggttttgta gggggacgga gaacatgatt 180 ctttccgtct gttggatagc aacaatcacg tctattgtag tatagtaagt gctggggagg 240 tttagttcaa aagaggttgg ccccaaagat acatgcacat gacccataga ccaagacacg 300 tcaattgtga ctgtgccaac ccgtatctcc aatagctatg ttctgatagc tactgtagct 360 atcaactcac atccactcag atcaccttac ctgaaacagc gatgctcaac aatcaagaga 420 ggcggaagag gaagcagtac ccaccagaga gcctctacaa agcctgcatg gaggtgatca 480 ggaccaactc aactcacaaa gcagcaggca aaaaatggaa cgttcctgat tctaccgtct 540 cgtggaggat ccgtctactg aaacaacatg gttctctaca gcccgactgg atcgagaaga 600 tgagaatttc ctacaaggaa ttggcgctta aaatgaattg ggctcatagc aacaataaaa 660 aattgtcaga ggctcacgag cggggtctgt tggaccacgt taaattccaa cactccattg 720 ccagacccgt caagctaaag gacatctcaa agattgccaa ctacctgctc aaagcgaacg 780 gcaacactga aaccgtgggc aagaactggg cttccaactt cgttactcgt cataatcttc 840 tcgaagtcaa gcggcccgtg tctttggaac aagccagaat cgatggcacg acacacgata 900 aactcgaaga attcattaca actatcacct cccagactct ggaggaggtc cttccagaga 960 accgctggaa catggacgaa actgggttcc aacaaggaga aggaagaggc ggacaagtgg 1020 ttggactcag gggtgaacct gccattcaat ctgtcactgg cagaagcgca actacaacgg 1080 ttattgaaac agtctcggca actggaaaaa gcacgagggc cttggtgatc ttcaagggaa 1140 aaaccgtgca gggtcaatgg tttccgaagg gatctcgtgc ggaagattac aaggacatgg 1200 tgttcgaatt ctcagattct ggcttcacga acaacaacac tgcctggaaa tggttgacgc 1260 aggtttttat tcctgacacg atgccgaaag acaaatatgg gaacgacgcc cctcatattc 1320 gtcgaatact ctatatggat gggcatcaca gccacacgca gaagaggttt ctcctgaaat 1380 gcatcgagca caacatcact acggttctct taccacccca caccagccac attacccagc 1440 cgcttgatgt gggtgtgttc agctccatga aacattggta caaagacttg gctggcaatg 1500 agggaaaggt gggaaacctc actgacgcag ggccgcacgt attcttgact gtgatcaacg 1560 aggtgcgtga gaaggctctt accgagagca acatagcgtc tggatggaga gccagcggac 1620 tgtggccact cgaccccgaa aaaattcgct ccaatccgag gctgattcgt gagaatggaa 1680 acaaggagcc gctcggagct gtgaggcaag attacacctc tcaagacacg gttagagcaa 1740 tctttccgat caacacacca actcccgctg tccccgatat gccattgcag ccaagcctgg 1800 aagagcttga aaattccatt catgacgaca cccatgatga aacgtacaag tgtaccagcg 1860 agtctgaaag tgacggcagt ggctctagtg attctagtga ctctgacgtt agtgcctctg 1920 acattattgt caacgatggt gccatggacg ccagcgatta cgagttctta gcagctgtga 1980 acgaggaact attggaacgt caaaacaacg ccaaccaggt tgacgctgac gaaacactcg 2040 gctcacaaca ggctttccac aacttttcca gctcgcactg gcatcacgac gattgtattg 2100 agaagatttt ctgcaaagaa acgtacccgc ccatggagaa tccacctaca ggtggaaacg 2160 cccttcttac atgggctcat agcacgcatc ccgcgatgcg agatttcctc attcacggct 2220 taaagccatt tgttgagatg atcgatagac tggtgggtga cttgtatcac atgcaagagg 2280 atgccactat gcgacagcac gaaatcgaca tcgtcagaag caaaacgaag agaatcagag 2340 taaaccgaag cgcacctgat caagtaattt cgagccagga cgttttcgca gccattggcg 2400 ccagggaaga ccgcggcaac gctaggcgag agcgatcagc acggtctacg ggggaggagc 2460 ccacacagga ggagcccaca caggaggagc ccacacagga ggactctaca cgacagcagt 2520 ctacacgaca gcagtccaca cgacagcggt ccacacgacg tgccggtccc ttttcgaagg 2580 tcccaccctc gctatttgct gacagtccca cctctgacac cagctcgggg ccacaagacg 2640 ataactgtcc cagttctgac tctcttatgc tgtccgactg agtgcctcca ttgtagccta 2700 gcgcgctagg cgtgttgagt ttcggccaat gtggtttttt agtaatttga gtttttcttt 2760 gtcaactacc catgtacaga tgctgcaaag cctcaatact cacaaaaaac ggggaaaatg 2820 agtgatttat cgtcgttgta aaattgggcc acgtcatgaa gggggttttc gtacttaatt 2880 aattagctat gactatcgcg agatgttcaa tttcaccgaa ataagattag gctatacatg 2940 actctctgaa ataaggccac ttggggcttg ggaggattat caagtgcttg ggctacgcct 3000 a 3001 // ID Gypsy-9_LBS-I repbase; DNA; FNG; 6589 BP. XX AC ABFE01000442; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-9_LBS_; KW Gypsy-9_LBS-LTR; Gypsy-9_LBS-I. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-6589 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01000442; Positions 93929 100517. XX CC Positions [2853-3284] - Reverse transcriptase CC Positions [4510-4995] - Integrase core CC 'GTCCA' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1824..3659 FT /product="Gypsy-9_LBS-I_3p" FT /translation="MRVNTRIINGKKTVETKALIDSGAQGIFMDERFAKEH FT RLPLLRLDKEIQVSNVDESPNKNGPIRFYTRLPTKIDGKVNSTRFLISNLG FT KEDVILGLPWLDRINPEVNWTKKTIKIIPDRIKKPNIRQAVDREIQILKIE FT SERKRMKSKKAFADQLRKIPDKKKKSTISIEEVPDEDATPKFERLTDDEEA FT ILIAVNELPDQYENMPALVPNTEDDDEDETEGDLVTSYLQEETITVSEPKK FT EEPLNQPIQIDSDITIRAKTSISQSLAHKEETKEEKTFEELVPKEYHQFRS FT VFKKKASERFPESRQWDHRIDLKPDFVPKRSKLYPLGQKEEEEMNKFIDEN FT LKKGFIKPSNSPQASPFFFISKKDSKALRPCQDYRYLNDYTVKNAYPLPSI FT DDLLNKLQGATVFTKLDIQWGYNNVQIKQGDEWKGAFITKRGLFEPTVMFF FT GMTNSPATFQAMMNDYFADMIAQGWVLIYLDDILIFSKDPRHQHGRTEKVL FT KRLEEKDLFLKPVKCVFNAKEVEYLGFIIRPNEISMDPIKLAGIQDWIPPK FT NVKAVRSFLGFGNFYQRFIGNYAETAKPLNELTKKTKVFEWSQECQTAFDN FT LKKEVSRKTSASHP" FT CDS 4015..5568 FT /product="Gypsy-9_LBS-I_2p" FT /translation="MIQSDALSRRPDHVPDNDNDNENLVLLPDKIFINLIN FT VELAKMIESTTASDELVKNISNVLSTKGIPPIKSSLSDWKIEDGKLFYQNR FT CYIPDSNGIRKLIVQEFHDSPMTGHPGRDSTLEMVQRHYWWPRLRHFVYEY FT VAGCATCQQNKINTHPTQPPTQPIKSTATRPFQMISQDFISGLPKTKRGYN FT CIMVVVDHGLTKGVIFIPTNKELTALEAAELHFDHTFKRFGIPDDIISDWD FT PLFVSKTYRGLMKLCGIKQQISTAYHPETDRETERVNRELELYLRIFCKRI FT PEEWDKNLSIAEFSYNGRPHSVTKQTPFYLMFGCEPTGIPIAFSKMNVPAV FT ERRLSELLKERENARAAHELARQAQIKRSKKNSPPFSKGDLVWLDGRHLNK FT GHKFPKIDSLREGPFKIEEIMGPVTYKLRLPPQWKIHNVFHRKLLTPYRET FT DVHGKNFPEPPPDLVEGEEEYEVDSIRDHRKRGKGYHYLVEWKGYSDETWE FT PESNLKNASEILKSYKKRKNLQ" FT CDS 5643..6575 FT /product="Gypsy-9_LBS-I_4p" FT /translation="MYASLFNSESFLTATLFIECLVQKEEHVCTLIKTCHD FT DPTLSLNLHLILEAIATEKRLEGKLRYKDENVLSLHHMVTTAIAAAIRHGI FT LNTIRDCDKLPAVPDYIPSHFAGRGIIQYFEKFDAEIHAYGKKFPTSPHIT FT TYHPYQRTPTPRCSKCKRLGHIRKNCSDYQCRGCLWWGPGHTTPNCETKKK FT SDKAWEWEKNMKPAGYDTTTGTQMWKERIYNWTGIPILRNEDWNTPRKQPE FT FVDLTSPSPPSSPPFSPSMPPLAPPTPPSSPLYNPYSPTTHPSPLFDLEDL FT VSDFDSITSSSDNIGDIEV" XX SQ Sequence 6589 BP; 2334 A; 1638 C; 1239 G; 1378 T; 0 other; aaaggttcaa gcttatacca gaacagaagg cgcatcaccg ccctctaccc gaatcagaca 60 cactggacaa cgtaacaaga attatacgtc ttaccgaaga gtataacaac agaattgaaa 120 acgttataag tctcatcaga gagtataacg aaagaatcaa agacatcatc atattgacag 180 aaggcgtgtc caacccacct cctacccaaa gaaatcatgg ctgctcgaag agagtcagac 240 cgttggacac tacgagcacc agcagaaagc tcagactcag aacccgaacc aagaaccgac 300 accttcacat ccggtgaaga acagtacgaa agcgcagacg agggtagcag tccccctgca 360 tcacagacac cagtaacagg aacatcgaca cctgtatcga caacagaaag atatagacga 420 ataacaggac caggaccatc atcgttcccc tttagcagac gaaccacacc cagaggatca 480 ccagcgatca ccagaacacc cagcagaagc gtacagccgc tattaggacc aatcatcccg 540 acgtcgacat ttacaccgat agatttagcc agtttaggta ttacacctac cgacgaaaga 600 accgaagcag ttgaacagtt caacacattc gcagtaaacc cataccccga gtcaccagca 660 gtacctggaa ctcccagaca gacagacaca ccagtagaac ccaagaaccc aacagaacca 720 agacaaagaa tggccaatgc tgctaacgcc aaaggagact gatgccgcac acccaaagat 780 tttgatggaa acgaagacaa gtacaagact tggcttagaa cagtacagac ctacatggag 840 gtcaacgatc acttgttcag caccgacaag cgaaagatcc agttcgccct cagttttatg 900 atcacaggca gagctgctga ctgggcagaa cactttacag acacccactt gaataccgat 960 ggagtattca ccacagcatc aacgtggaaa gaatttgtcc aattgttaaa tgaaacattt 1020 gacatcagaa agatgaaaga taaagcgaga gtagacctct caatccttaa acataaactg 1080 ggacaactcg aacagtacat ccttgacttt actgcacttg ccaatcgggc aggatatgtt 1140 ttaacaggaa acacagagaa tccggtcctc tctcagcttt ttctcgaaca cttaaatccc 1200 ttgttacggg aaaagatcga gacacagaaa gaaccacccg agacactcaa agtcatcata 1260 gaagatgcta ggaaattcga caagtcatat tataagaacc aagcatggaa gacgaaatta 1320 atgggatggc aaccaactag gagcatacct cgtcaaactc ctcgaccatc ctatactccc 1380 agatctcgag atccagacgc tatggatatc gacagattaa ccatcgaaga acgaaacgaa 1440 tatatgaaga aagggttatg tttccgatgc ggaaaaacag gacacatgtc tcgcgaccat 1500 gcgaccaatc ccgcccttaa taccggaaaa ggaaacacgt ccaaacccgc cactccatat 1560 aaggctatca tgccaccacc aaacaagaac gccgcacaga aagtccgtgc tatcatggca 1620 ggactcgatg acgaagaatt agaagaagca aaagttgctt ttatagaatc tttggataaa 1680 aatccaaacg aaccagccga agaatcagac gatgaagaca agggttttca gtaagaaggc 1740 tagcaacgac gtctagacct tctaaacaat tttccattaa agctcttaaa attcgaagtg 1800 tacctaatga cgactcaaaa tcaatgcgcg ttaacacaag aattattaat ggaaagaaaa 1860 ccgtcgaaac gaaagcgctt atcgatagtg gcgctcaagg aatctttatg gacgaacgat 1920 tcgcgaaaga acacagactc ccgcttttga gattggataa ggaaatccaa gtctcaaacg 1980 tagatgaaag tccgaacaag aatggaccta ttaggttcta cacccgactc ccaacaaaaa 2040 tcgatggaaa agtaaattcc acccgatttt taatttccaa cctaggaaaa gaagatgtaa 2100 tccttggatt accatggtta gacagaatta accccgaagt aaactggacg aagaagacca 2160 tcaaaattat cccagacagg ataaagaaac cgaacatacg acaagccgta gaccgagaaa 2220 ttcagattct aaagatagaa tccgaacgta aaaggatgaa gtcaaagaag gctttcgcag 2280 atcagctgcg aaaaatccct gacaaaaaaa agaaatcaac gatctctatt gaagaagtac 2340 ccgatgaaga cgcaacccca aaattcgaaa gattaacaga tgacgaagaa gctattctca 2400 tagccgtgaa tgaacttccc gatcagtacg aaaatatgcc ggccctagta cccaacacag 2460 aagatgatga tgaagacgaa acagaaggag atcttgtcac ctcatatctc caagaagaaa 2520 ccataacggt atccgaacca aagaaagaag aacctctaaa tcaacccatt cagattgatt 2580 cagatattac gatcagagct aagacatcta tatcgcagag tcttgcacac aaagaagaaa 2640 ctaaagaaga gaagactttc gaagagctag tcccaaaaga ataccaccag ttcagatcag 2700 tattcaaaaa gaaagcatca gaacgctttc ccgaatcaag acaatgggat cataggatcg 2760 atctcaaacc ggatttcgta ccgaaaagat cgaaactcta tccactagga cagaaagaag 2820 aagaagaaat gaacaagttc attgatgaaa acctaaagaa aggtttcatc aaaccatcaa 2880 attcacctca ggcttcacct ttctttttca tatcaaagaa agacagtaaa gcgctgagac 2940 cctgtcaaga ttatcgctat ctgaacgatt acacagtcaa gaacgcatat cccctcccat 3000 ctatcgacga tctcctaaac aaacttcaag gagctacagt cttcacgaag ttagatatcc 3060 aatggggata taacaacgtc caaattaaac agggtgatga atggaaaggt gccttcatca 3120 ctaaaagagg actattcgaa ccgacagtaa tgttctttgg aatgaccaat tcacccgcaa 3180 cattccaagc catgatgaat gactatttcg ccgatatgat cgcccaagga tgggttttga 3240 tctatctcga cgacattctc atattctcca aagaccccag acatcaacat ggacgaaccg 3300 agaaagttct aaaaagacta gaagaaaagg atttattcct taaaccagta aagtgcgttt 3360 tcaacgcaaa agaagtcgaa tacctgggtt tcatcataag gccgaacgaa atctccatgg 3420 accccataaa gctagctgga attcaagatt ggattccacc gaagaatgta aaagccgtca 3480 gatcattttt aggatttggc aacttctacc aaagatttat tggcaactat gcagaaactg 3540 ccaaacccct aaatgaactc actaaaaaga caaaagtttt cgaatggtca caagaatgtc 3600 aaacggcctt cgataactta aaaaaagaag tttctagaaa aaccagtgct agtcatccct 3660 gatccaacaa aaccgtttta tgttgaatca gatgcatcca aatgggctac aggagccgtt 3720 ctccgacaaa gagatacgaa cagagatttg aaaccatgcg gttacatttc tcacagtttt 3780 acccagacag aaagaaacta cgacatatac gacagagaac ttctaggcat catacgagcc 3840 ctagaaacgt ggagacactt cttagaagaa agtccacacc ccgtaacagt attctcagac 3900 cacaagaatt tgacatactt ccgaaagacg caaaaactga accgccgaca agctagatgg 3960 agtttatacc tatcaaggtt taatcttcaa ctcattcacg ttcccggatc aagaatgatc 4020 caatccgacg ctctatccag acgaccagat cacgtccctg ataacgacaa cgataatgaa 4080 aaccttgtct tacttccaga caagatattc attaatctta ttaatgttga actcgcaaag 4140 atgatcgagt caacaacagc ttcagacgaa ttagtaaaga acatatcgaa tgttctctct 4200 acgaaaggaa ttccgccgat taagtcaagc ctctccgatt ggaaaataga agatggaaaa 4260 cttttctatc aaaaccgatg ctacattccc gacagtaacg gaatacgaaa actcattgtg 4320 caagagtttc acgattcacc aatgaccgga caccccggaa gagacagcac attggaaatg 4380 gtgcaaagac attactggtg gccaagatta cgccacttcg tatatgaata tgttgcagga 4440 tgcgccacgt gtcaacagaa caagattaac acacatccca cgcaaccacc gacacaacca 4500 attaaatcca cagcaacacg accctttcag atgatatcgc aagattttat atcaggatta 4560 ccaaagacga aacgaggcta taactgcata atggtcgtgg tagaccatgg ccttacaaag 4620 ggggtaattt tcatcccaac caacaaagaa ttgaccgccc tagaagccgc tgaacttcat 4680 tttgatcata ccttcaaacg atttggaata ccagacgaca tcatatccga ctgggatccc 4740 ctttttgttt caaaaaccta tcgaggttta atgaaacttt gtggaatcaa acaacaaatc 4800 agtacagcat atcaccccga gacagacaga gaaacagaac gagtcaatcg agaactcgaa 4860 ctgtacctca gaatattctg caagagaatt ccagaagaat gggacaagaa tctctctatc 4920 gctgaatttt catacaatgg acgaccccat tctgttacaa aacaaacccc attctacctc 4980 atgtttggat gtgaacctac cggaatccca atagcattct caaagatgaa tgtccctgca 5040 gtcgaaagaa gattgtcaga actcctaaag gaaagagaga acgctcgagc cgcacatgaa 5100 ctggcacgac aagctcaaat taaacgatcc aaaaagaatt ctcctccatt cagtaaagga 5160 gacttagtat ggttagatgg gcgacacctt aacaaaggac acaagttccc aaagatagat 5220 tcattacgcg aaggaccctt caagattgaa gaaatcatgg ggccagtaac gtacaaactt 5280 cgacttccac cccagtggaa gatacacaac gttttccaca gaaaacttct taccccatac 5340 cgcgaaaccg acgttcacgg aaagaacttc cctgaacccc ctcctgacct tgtcgaagga 5400 gaggaagaat acgaagtaga ttcgatcaga gatcacagaa aacgaggcaa aggataccac 5460 tatctagtgg aatggaaagg atattccgat gagacgtggg aaccagaaag caatttgaag 5520 aatgcttccg agattctcaa aagctataag aagaggaaga atctccagta gaaccctcag 5580 acaatcgtct gaactcgcaa ttaagccaac caactttctc cgcaaccaac cctcctctaa 5640 acatgtacgc ctccctcttc aactctgaat ctttcctcac cgccaccctc tttattgaat 5700 gtcttgttca aaaagaagaa cacgtctgta cccttatcaa gacctgtcac gacgacccca 5760 ctctctccct caatctccac cttattttag aagctattgc caccgagaaa cgtctcgaag 5820 gcaaactcag atacaaggac gaaaacgtcc ttagtcttca tcatatggta acaactgcca 5880 tcgctgccgc catccgacat ggcatcttga acactattag agactgtgat aaacttcccg 5940 ccgtccccga ctacatcccg agccatttcg caggacgagg gatcatacag tactttgaga 6000 agttcgacgc agagatccac gcctacggga agaaattccc aacatcccct cacattacta 6060 cttaccatcc ttaccaacgc actcctactc ctcgctgctc aaagtgcaag agacttggac 6120 acattcgtaa gaattgtagc gactaccaat gtagaggatg cctctggtgg ggaccaggac 6180 ataccacccc caattgtgaa acaaagaaga aatccgacaa ggcatgggaa tgggagaaga 6240 atatgaaacc cgcaggatac gatacgacga caggaacgca gatgtggaag gaaagaatct 6300 ataattggac cggaataccg attctcagaa acgaagattg gaacaccccc aggaagcaac 6360 ccgagttcgt cgacttaact tccccttcac caccttcttc gccacctttt tcaccctcta 6420 tgccacctct cgcaccaccg accccaccat catctcctct atacaatccc tactctccta 6480 cgactcaccc gtctcctctc tttgaccttg aagacttggt cagtgacttc gactctatca 6540 cttcttcatc tgacaatatc ggtgatattg aagtttaaac ggggggtaa 6589 // ID Copia-3_PPM-I repbase; DNA; FNG; 4598 BP. XX AC ABWF01002028; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Postia placenta genome: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-3_PPM_; KW Copia-3_PPM-LTR; Copia-3_PPM-I. XX OS Postia placenta OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Polyporales; Postia. XX RN [1] RP 1-4598 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Postia placenta genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABWF01002028; Positions 72398 76995. XX CC Positions [1979-2494] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(80..3043,3047..4588) FT /product="Copia-3_PPM-I_1p" FT /translation="MPHFLRYRQTVTLARCAYSFASRTLRLRQGYEDSDLT FT DLPPYRSNSPLAHLARIDLDVLTFTSLVPSTSVPSAPVMSTTVAKEFSEVP FT KLSSDGSNYRIWLGRVERAAGACEAEELLVRAADASSAAELKLNKQMLNAI FT TGKFPDSLFKKYISVTEVHSVMSGLKMEFGMSTAASEAWTEAKLFSLRCTD FT ERKVRQHLDQLSELKDKLSEMNVKIEDRTYINAITTSIPRSFTPVVTAITT FT ATDIYNSTLGAGATRRIVTSAEIVKALRAEADSRAVLKVTDKNVTANAAFT FT NGRGGTQGRGRGRGGNRGRGANSSKSSTPRSEDDLTCFKCGGKGHRSPDCP FT SKKQFYQKAKKSEAASTSAGTSDNKTKTDAKPKEGATKGSSASAVVTAVES FT SAHFEEAWSACAAIPYEEAIVIDCSDLAAQEEVHIPDAYHAFAGIIESGHR FT VDIFDSGASRHMTPHLDRLSNFRTTAPHQIRAANSEVFYSHGVGDMLLHLP FT AENGGNHIRLKGVLHAPKMHATLISLGVIEAAGFAWIGYDGHLNICNKHND FT IVITVPREDHLYRIFYESTHASLATSAVPLSLFEIHKRLSHVNYGYLKAML FT RQNRLTGITLDPSHGEEVECKTCLVAKARRAPILPVRQSPLADKFGEHLHM FT DIWGPASVLTIDRCRYALTIVDDATRWLEMPLMRVKSEAFGKYVALEARLL FT TQYGVRVKTVQSDRGGEFLSTDMDAHLERAGTMRKLTVHDTPEHNGVAERT FT HLTTFNGVRAALAGSGLPKWLWGEALAYVVHVYNRTARRALQGRSPFEVRY FT GHAPDVSNLREWGCHVVVSVPTDSKLDARGREARWLGLDRTSNGHRIYWPK FT ERKISVERNVVFSSESPRVEGEHDFDIGPTSHVSDTASPPLKRLRHVPEPA FT RSPSLEPGEIREPVHPDIVTGKRKRTPSAKLRAIASGRAEGALTEITIDES FT EESEVLAYSVGAAASDSLGFDPRSVAEAKRRPDPKWQEAMRDEIKRLESRK FT AWVYVDPPPRQAGHNVVGSKWVFRMKHDARGEVTGYRTRLVAQGFTQVEGV FT DYFADDTYAAVCKLASIRVILSVAARNGWFTHQVDVKSAYLYGKLREDEKI FT YMRPPPDIELDGLTAGQVLLLLVALYGLHKSGRRWYVRLRKILEGFKLVRL FT EHDHAVFYRRHPDGEISIIFLHVDDMTLVCSLLANLLHLKEMIKAELEITD FT NGELHWLLGIEVKRNLEKHTIALSQRAYIDSIVARYGFSDAKPLAQPMDPH FT VRLSIDQCPVSTAEYAAMRDKPYLEALGALQYVSVATRPDITFAVGQLAQF FT GRNPGLAHWNALKRVYQYLKGTADVWLVLGGSEDNEIVGYSDADGMSTEGR FT RAISGYVFMLNGGAVSWSSKRQDLVTLSTTEAEYVALTHASKEAIWLRSFI FT KELFGDPEQPLPLRSDNQSAIALAKDDRFHARTKHIDIRFHFIRYAIAEGK FT ISLSYCPTEDMTADILTKALPSLKCKHFASSMGLSKV" XX SQ Sequence 4598 BP; 1028 A; 1292 C; 1189 G; 1089 T; 0 other; taacttacgt tgttaggtta tgagccccgc ctttggcggc tggggtgtct attagtactc 60 gcgtacatgc tggtattgta tgccgcattt tctacgctat cgacagaccg ttacactcgc 120 gcgctgcgca tattccttcg cgtcacgtac gctaaggctt cgacaaggtt acgaagactc 180 ggaccttaca gatctacctc cctatcgctc aaactcacca ctcgctcatc tcgctcgaat 240 cgacctcgac gttctcacat tcacttccct ggtcccctcg actagcgttc cgagcgcgcc 300 tgtcatgtcc acaactgttg ccaaagaatt ttccgaagtc ccgaagctgt cgtccgatgg 360 ctcgaactat cgcatatggc ttggccgcgt cgaacgcgct gccggtgcct gtgaagcaga 420 ggagttgctt gttcgcgcgg cggatgcttc ttcggctgca gaactaaagc tgaataagca 480 aatgctcaat gcgatcacag ggaagtttcc tgactcacta ttcaagaagt atatcagtgt 540 cacggaagtt cactccgtga tgtctggctt gaagatggag ttcgggatgt ccactgctgc 600 ttccgaagca tggacagagg cgaagctttt ctcgctacgc tgcactgatg aacggaaggt 660 tcgtcagcac ttggaccagc tgtccgagct caaggacaag ctctccgaga tgaatgtgaa 720 gatcgaggac agaacctaca ttaatgccat caccacatcg attccccgct ccttcactcc 780 tgtcgtcact gccatcacaa ctgctactga catttataac tcgactctgg gagctggcgc 840 cacgcgccgc attgtaacct ccgcagagat tgtcaaagcg ctccgcgctg aagcagattc 900 acgcgcggtc ctcaaggtca ctgacaagaa cgttacagct aatgctgcat tcaccaatgg 960 acgtggcggg acacaaggac gcggacgcgg acgcggcgga aatcgcggac gtggtgcgaa 1020 ctcgagtaag agcagcactc cccgttccga ggacgatttg acgtgcttca agtgtggtgg 1080 aaaggggcat cgttctccgg actgtccgtc gaagaaacag ttctatcaga aggcgaagaa 1140 gtcagaagct gcatctacca gtgcaggtac gagcgacaac aagacaaaga ctgatgcaaa 1200 gccgaaggag ggcgcaacaa aaggctcgtc tgcatcagct gttgtcacag ccgtcgaatc 1260 atcggcacac ttcgaagagg cctggagtgc ctgtgcagcg attccttatg aggaggcgat 1320 tgtcattgac tgctctgacc tagccgcaca ggaggaggtc catatcccgg acgcatacca 1380 tgcttttgct ggtattattg aatccggaca tcgtgtcgat attttcgact ctggcgcatc 1440 acggcacatg acaccgcacc ttgatcgtct ctcgaatttc cgcaccaccg ctccgcatca 1500 gattcgcgct gcaaattctg aagtatttta ctcacatggt gtcggcgaca tgctgctgca 1560 cctacctgcg gagaatggcg gcaatcacat ccgccttaag ggtgttcttc atgcacccaa 1620 gatgcacgca acccttatct cgttgggtgt tatcgaagca gctggctttg cgtggattgg 1680 ctatgatggc cacctgaata tttgcaataa gcataacgac attgtcataa cagttccgcg 1740 ggaggatcac ctgtatcgga tcttctacga gtcgactcat gcttcacttg ctacttcggc 1800 agtaccactc tcactctttg aaatccacaa aaggctcagt cacgtcaact atggctacct 1860 taaagctatg ctacgtcaga atcggcttac cggtatcaca ctcgatccat cccatggcga 1920 agaggtcgaa tgcaagactt gccttgttgc gaaggcccgt cgcgcaccca tattgcctgt 1980 gcgccagtct ccgcttgccg ataaatttgg cgaacatctt cacatggata tctgggggcc 2040 cgcatctgta ttgaccatcg accgctgcag gtatgccttg acgattgtcg atgacgcgac 2100 ccgatggcta gagatgccgc ttatgcgtgt caagtccgag gcgtttggga agtacgtggc 2160 tcttgaggcg cggctgctga cgcagtatgg tgtccgcgtg aagactgtcc aatccgatcg 2220 aggcggcgag tttctttcta ctgatatgga cgcccacctc gaacgtgctg gtaccatgcg 2280 taagctgacg gttcatgata cacctgagca caatggtgtt gctgaacgta cgcacctcac 2340 gacctttaac ggtgtgcgcg ctgcacttgc tggctccggt cttcccaaat ggctgtgggg 2400 cgaagctctc gcctacgtag tccatgtata caaccggaca gctcgccgcg cactgcaggg 2460 tcgttcgcca ttcgaagtgc gctacggtca tgccccagat gtgtccaatt tgcgggaatg 2520 ggggtgccat gttgttgtct ctgttccgac agactcgaag ttggatgctc gtggacgtga 2580 ggctcgctgg cttggcctgg atcggacgtc aaatgggcac cgtatctatt ggcctaagga 2640 gcggaaaatt tctgtcgagc ggaatgtcgt tttcagctcg gagtctccgc gcgtcgaggg 2700 ggagcacgat tttgacattg gacctacctc acatgtttcc gacactgctt cacctcctct 2760 caagcgtttg cgccatgttc ctgaacctgc gcgctcgcca tcacttgagc ctggcgagat 2820 ccgtgaacct gtgcaccccg acatcgtcac aggcaagcgc aagcgcacac cgagtgctaa 2880 gcttcgtgct attgcgtctg gacgagctga gggtgcactc actgagatta ccattgatga 2940 gagcgaggaa tccgaggtcc tcgcatactc tgtgggtgct gctgcatctg actcccttgg 3000 ctttgatcct cgctctgttg cggaagcaaa gcgtcgtccc gactgaccca agtggcagga 3060 ggcaatgaga gacgagatca agaggttgga gtcaaggaaa gcgtgggtat acgtggaccc 3120 gccgccgcgt caagccggac acaacgtggt tggctcgaaa tgggtttttc gaatgaaaca 3180 tgacgcccgt ggggaagtca ctggctaccg cacacgctta gtcgctcaag gcttcacgca 3240 agtcgaaggt gtcgattatt ttgctgatga cacctatgct gcagtctgca aactggcatc 3300 aattcgagtc attctctctg ttgcagctcg caatggttgg ttcacgcacc aagtggatgt 3360 caagagtgca tacctttatg gtaagctgcg cgaagacgag aagatatata tgcgccctcc 3420 tcccgacatc gagctcgacg gcctgacagc tggtcaagtc ttactgcttc tggtcgcgct 3480 ctatggactc cataaatccg gacgacgctg gtacgttcgt ctgcgcaaga ttcttgaagg 3540 gttcaaactc gtacggcttg aacatgatca cgccgttttc tatcggcgtc atcccgacgg 3600 tgaaatatcc atcatattcc tccacgttga cgacatgaca cttgtttgct cattgctcgc 3660 taatctactc catctcaagg agatgatcaa ggccgaactc gaaattacgg acaacggcga 3720 gctccattgg ctcctcggca ttgaagtcaa acgcaacctc gagaaacaca caatcgcact 3780 ctcacagcgt gcgtacattg actctatcgt tgcacgctac ggtttctcgg atgcaaaacc 3840 cctcgcacag cccatggatc cgcatgtccg gctctccatc gatcaatgcc ccgtctccac 3900 agctgaatac gctgcaatgc gagacaagcc atatctcgaa gctttggggg cactacagta 3960 tgtttctgta gccactcgtc ccgatatcac atttgcagtt gggcaactcg ctcaattcgg 4020 tcgtaatccc ggactcgcgc attggaatgc gctgaagcgt gtttatcaat acctcaaggg 4080 cactgccgat gtctggcttg tgctaggcgg ctcggaggac aacgaaatcg ttggctactc 4140 tgatgccgac ggcatgagca cagaagggcg tcgcgcaatc tcgggctacg tcttcatgct 4200 caacggaggc gctgtctctt ggtcatcaaa gcgacaggat ctcgtcacgc tgtcgaccac 4260 tgaggccgaa tatgttgcgc tgactcacgc tagcaaagaa gccatctggc ttcgctcttt 4320 catcaaagaa ctgttcggcg accctgaaca accccttcct cttcgctccg acaatcaatc 4380 tgctatcgca ctcgccaagg acgaccgctt tcatgcacgc accaagcaca tcgacatacg 4440 atttcatttt atccgctatg ccatcgctga aggcaagatc tcactgtcct actgcccgac 4500 agaagacatg actgccgata tcttgaccaa agcgctgccg tcgctgaaat gcaaacactt 4560 tgcttcgtcc atgggactct cgaaggtttg agggggag 4598 // ID Gypsy-41_MLP-I repbase; DNA; FNG; 5886 BP. XX AC AECX01002345; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-41_MLP_; KW Gypsy-41_MLP-LTR; Gypsy-41_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5886 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002345; Positions 10091 15976. XX CC Positions [4559-5038] - Integrase core CC 'CAGCT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 140..5473 FT /product="Gypsy-41_MLP-I_1p" FT /translation="MSDRRNLPANYSDNPEALLCRPRITENLISPLYSPPA FT PKPPRYNWYPEVPETPASEYRELTASQAIDSDQSRDTHPKPTTPVFPESAP FT PVDHSTSSDREIVQFFFGRLLDSKTTPRKSVTDAESTPRPPGAYFDTPHVP FT VSRMMAETSNMAMKGKTPLTKISEEDQIEYFKQQLKEASEKNEVLERRCER FT IDALESLVHELAQDREDRKAPVGDNSLQSSRSNNPFAQYMENGPVSPSPIV FT VHQVQPPREQGPPPINFSSRPTARTMPAPPAYDSESGQSLPLPPTAIPWTY FT PWVQSTPAVRWVAPEVASGVLPPASPRSPGLPPPPSAPIIQSQEEGNDSSD FT AHSDDSSSTVTASKIRLAEVPKYKGPYGRPAELFHWQWLTEQYFAVKDLNN FT DKERLKLLGSLLIDQQTSAWYQSARSELEGKTWSEVMDLIALGTLPEDWIA FT DIEQQLRKLQIKSGESFDTYIERAQALYRTISRFSQITPKELAQYICWGLP FT SIFDSSIRKDCLLMEDNFNMIIFISRAKATWRVLMDSGLVKEGRPRTSTTN FT PFVNHQPLGTAPSATGTGFRSTPRNPEERAENAWRYRQWMKHLGLCMSCKK FT KCNNPSCTETSSIFISVPPPDQFNAGPRPSRIDNRPPPGAPAQKPAGRPAM FT NNTATTRVAAVSELPDLSTAELERYAEADRVLVACLEEEKQERCVSSGNRE FT PSIIVDLVCNGVLIRDLFDSACETNLMDERLLKKARIPRRPLVKPVTVGLA FT LESEGAPKQLTHFAFANLDSKDPAIQFGATFFKVASLNGGYDIICGTPFLK FT LHKIDISIHRRCITHTKSGLVIYDRDESRMKRMKELRCNELNKIPNTELFA FT CVMENVRKVESARKLSEKEEMMFEEFKDLFPAELPAVAEGEDDFFPPTEQH FT ESSTVCHKIVLTDPNAVINGKQYGYPLKHREAWRRLIDQHLKAGRIRRSSS FT QYGSPSLIIPKKDPAADPRWVCDYRELNKVTVKDRSPLPNPDEAVRLVATG FT KVFSVVDQINSFFQTRMRKEDIPLTAVKTPWGMFEWVVMPMGLCNGPATHQ FT ARCEEALGELIGRICVVYIDDIVIFSDSVEEHEAHLREVLTRLKRANLYCS FT SKKSKLFRRKISFLGHEISEEGIRADPDQVNKVTSWKSPKSAKGLNSSLGT FT VQWLKKFVDGLGQYVATLAPLTSSKRKPSDFCWGAAEESAFENVKRLITTL FT PVLKKIDYDSDEAVWLFSDASGHGLGAALFQGKDWETSSPIAYESRTMSPA FT ERNYPVHEQELLAVINALQKWKLLLLGLKINVMSDHHSLVHLLKQRNLSRR FT QARWLETLADFDLDFKYLKGEHNSVADALSRKEAIAAVEIKARLDDDTRTS FT ILLGYKTDPFCIKLASALPLREDSVRVDDLMYLDGRLVIPSFRRLQQDLIS FT RAHESLGHLGSAKTLEQLRKEFFWVGMAKDVATFISACDSCQRNKAQTTLL FT SGRLQATDVPHAPLEDISLDFVGPFPKVNGYDMILSCTCRLTGFVRLVPVS FT QKDTAERTARRVFASWLSIFGAPSSMIGDRDKIWISRFWQHLNARLNVKVK FT LSTSYHPQTDGRSERSNKTLGQILRHLTASRHSKWLESLPTVEFAINSATN FT SATGVSPFEFVFGRQPRLFPTNLASQIRESDAHRWLEKRRADWAVWRDKLW FT ASQVDQALQYNKRRKQGVTLEEGDLVLVDSQDRQQVVGGKGRMVSKLRARF FT DGPYKIKTVINEGRNYELELPEGDKTHPVFHISKLKPYRVEEDIESSTGLA FT SKSKFPS" XX SQ Sequence 5886 BP; 1588 A; 1308 C; 1463 G; 1527 T; 0 other; ctttttttaa ctatatcttt tcaactttaa accttattag tcttggaatc cctgctgtat 60 aatcactccg actgagtcat tgaactctta ttgctgctgc cgctgttcgc tttattctgt 120 tttgctgttc tgatctcgaa tgtccgatcg aagaaacctg ccggctaatt actctgataa 180 tcccgaagca cttctttgcc gacctcgaat tactgaaaac ttgatctcac ctctgtatag 240 tccccccgct ccaaaacccc cgcgttataa ttggtacccg gaggttccgg aaactcctgc 300 ttctgaatat cgagaattga cagcaagtca agctattgat tctgatcaat ctagagatac 360 ccatccaaag ccgacaactc ctgtttttcc cgaatcagct cctcctgtcg atcattcaac 420 gtcgagtgat cgagaaatcg ttcagttttt ttttgggagg ttactcgaca gcaaaacaac 480 acctcggaaa tctgtaactg acgctgagtc tacaccacga cctccgggag catacttcga 540 cacacctcac gtaccagtca gccgtatgat ggcggaaacg tcaaatatgg ctatgaaggg 600 caagactccg ttaacgaaaa tatcggaaga agatcaaatc gagtacttta aacaacaact 660 gaaggaggca agtgagaaaa acgaggtttt ggaacgacgg tgtgagagaa ttgatgcgtt 720 ggaatcactc gttcatgagt tggctcaaga ccgagaagat aggaaggccc cagttggaga 780 caatagctta cagtcttcgc gttccaacaa cccgttcgct caatacatgg aaaacggtcc 840 ggtgtctcca tccccgatag tcgttcatca agtacaacca cctagggaac aaggaccccc 900 tccgatcaac ttctcttcac gacctactgc cagaaccatg ccagcaccac cggcgtatga 960 ttcagaatcg ggccaatctc tgccgttgcc gccaacggcc ataccgtgga cgtatccttg 1020 ggtacaatcg acgccagctg tccgttgggt cgctccggaa gttgcatctg gagtgcttcc 1080 tcctgcatca ccgagatcgc cgggactgcc accacctccc tcagctccca ttattcagag 1140 tcaagaggaa ggaaatgact catcagacgc tcacagtgac gactctagtt caacggtgac 1200 ggcaagcaaa attcgattag ctgaggttcc taaatacaaa ggaccgtatg gcagaccagc 1260 cgagttgttt cattggcagt ggcttacgga acaatacttt gcagttaaag acctgaacaa 1320 tgacaaggaa cggctgaaat tattaggttc actattaatt gatcagcaaa cttcggcctg 1380 gtatcagtca gctagatcag agttagaagg aaaaacgtgg tcagaagtga tggacttgat 1440 tgcattgggt accttaccag aagactggat tgcggatatc gaacagcaac tacgaaagct 1500 tcaaatcaaa tcaggcgaat cttttgatac ctacatcgaa cgggctcaag ctctgtatcg 1560 aacaatctca cgtttttccc aaattactcc gaaggaattg gcccaataca tctgctgggg 1620 attaccgtca atctttgata gttctataag aaaagattgc ttgctgatgg aggacaactt 1680 caatatgatc atcttcatct cacgagccaa ggcgacctgg agggtcttga tggatagtgg 1740 acttgttaag gaaggtcgtc ctcgtacctc gactaccaac ccttttgtca accatcaacc 1800 tttgggaacg gcgccgagcg ctacaggaac ggggttcaga tcaacaccgc ggaaccctga 1860 ggaacgtgcg gagaacgcct ggaggtatag acagtggatg aagcacttag gtttatgcat 1920 gagctgtaaa aagaagtgta acaacccatc gtgtactgaa acttcgagca ttttcattag 1980 cgtccctcca cctgatcaat tcaacgccgg accacgaccc tcacgtattg ataatcgacc 2040 accaccagga gcgcctgcac aaaaaccagc gggtcgtcca gctatgaaca acactgcgac 2100 gactcgagtg gcagctgtct cagaactgcc tgacttatct acggcagaac tcgaacgata 2160 tgcggaggcc gacagagtgc tagtagcatg tttggaagaa gagaaacaag agaggtgcgt 2220 gtcttctggc aacagagaac catctattat tgtggatctc gtgtgtaatg gggtactcat 2280 tcgggacttg tttgattcag cctgcgaaac caatttaatg gacgaacgac tattgaagaa 2340 agcacgaata ccgcggcgtc ctctggtgaa acccgtaact gtgggtttag cacttgaatc 2400 tgaaggtgca cctaagcagc ttacccactt tgcttttgcc aatttggact caaaggatcc 2460 ggccattcaa tttggagcaa ctttcttcaa ggtggcgagt ttgaatggag gttatgatat 2520 catatgtggt actcccttct tgaaattaca caagattgat atttctattc atcgccgttg 2580 tattacacat accaagagcg ggctggttat ttatgatcga gatgagtcaa gaatgaaaag 2640 aatgaaagag ttgagatgca atgaattgaa caaaatacca aatacagaac tgttcgcgtg 2700 tgtgatggag aatgtgagga aagtagagag tgcgaggaag ctgtcggaga aggaggagat 2760 gatgtttgaa gaatttaagg acttgttccc tgctgaattg cccgcggtag cggaagggga 2820 ggatgatttt ttccctccca ctgagcaaca cgaatcatca acggtatgcc ataagattgt 2880 gctaacggat cctaatgcgg ttatcaacgg gaaacagtat ggctatccgt tgaagcatcg 2940 tgaagcatgg cgtaggctga tagaccagca tttgaaagca ggccgtatac gtcgatccag 3000 cagtcaatac gggtctcctt ctctcatcat tccgaagaaa gacccggcgg ctgacccacg 3060 atgggtgtgt gattaccggg aattgaacaa agtgactgtc aaagaccggt caccgttacc 3120 caaccctgat gaagcggtac gtttggttgc gacgggaaag gtgttttcgg tagttgacca 3180 gatcaacagt tttttccaaa cacgtatgcg gaaggaggat atcccattga cggcggttaa 3240 aacgccttgg gggatgtttg agtgggtagt gatgccgatg ggactgtgca acgggccggc 3300 gactcatcaa gcacgctgcg aggaagcctt gggagaactg attgggagga tctgtgttgt 3360 gtacatcgac gatatagtta ttttttctga ttccgtagaa gagcatgagg cgcatctaag 3420 ggaggtcttg acaagattga aacgcgctaa tctttactgc tcttcgaaga aaagtaagct 3480 gtttcgtcga aagatttcct ttttgggtca tgagatcagt gaagagggaa tccgcgcaga 3540 cccggatcag gtcaacaagg tcacttcctg gaaaagtccc aaatccgcga agggactcaa 3600 ttcatctcta ggaacggtgc aatggctcaa aaagtttgtc gatgggctag gtcaatatgt 3660 agccaccctc gcgccgctga cgagttcaaa acggaagcct tcagattttt gctggggagc 3720 agcagaggag tcggcgttcg agaatgttaa acgtttgatc accaccttac cagttctcaa 3780 aaagatagat tatgattcag atgaagcagt gtggctcttt tcggatgcga gtggtcatgg 3840 attgggtgct gcgctttttc agggtaagga ttgggaaact tcctctccga tagcttatga 3900 gagtcgaaca atgtctccgg cggagcgtaa ctatccagta catgagcaag agctacttgc 3960 tgtgataaac gcgttgcaaa agtggaaatt attgctgttg gggttaaaaa ttaatgtcat 4020 gtctgaccac catagcttag tacacctatt gaaacagcgg aatctcagtc ggcggcaggc 4080 taggtggtta gaaactcttg ctgattttga tttggacttt aaatatttaa agggagaaca 4140 caattcagta gcagatgcgt tgtcacgaaa agaagcaatt gcggcggtgg aaatcaaagc 4200 caggctggat gacgacactc gtacatcaat ccttctgggt tacaagacag accccttttg 4260 cataaagttg gcatcagcac tgccgcttcg tgaagacagt gtgcgagtag acgacctcat 4320 gtacctcgac ggccgcctag tcattccatc attccggagg cttcaacaag acctaatatc 4380 aagggctcac gaatctctgg gacatctagg gagcgcaaag actttagagc aattacgtaa 4440 ggagtttttc tgggtgggaa tggctaagga cgtggcaaca ttcatctcgg cttgtgacag 4500 ctgtcagcga aacaaagcgc aaacgactct gctgtctgga cgtcttcaag caacggatgt 4560 accccatgca ccattagaag acatcagcct ggactttgtg gggccattcc ccaaagtgaa 4620 cggttacgac atgatattat cttgtacgtg tcggttgaca ggtttcgtga ggcttgttcc 4680 ggtatctcaa aaagacacgg cagagagaac ggcacgacgg gtctttgctt cgtggctgtc 4740 aatttttgga gccccttcct caatgattgg ggatcgcgac aagatatgga tctcgcgatt 4800 ctggcaacat ctcaatgctc ggctcaacgt gaaggttaaa ctatccactt cataccatcc 4860 gcaaacggac ggccgaagtg aaaggtcgaa caagactttg ggtcaaatcc ttcggcacct 4920 gacagcgtct cgacatagca agtggctgga gtcgctgcct acagtagaat tcgcaatcaa 4980 ttcagcaaca aactcggcga caggtgttag tccttttgaa ttcgtctttg gcaggcagcc 5040 tcgtctgttt cctactaact tggcatccca aatacgagag agtgatgctc atcgttggct 5100 ggaaaagaga agagctgatt gggcggtttg gagggacaaa ctgtgggcaa gtcaagtaga 5160 tcaagcatta caatataaca agcggaggaa acaaggagtt acattagagg aaggtgatct 5220 ggttttggtg gatagtcaag atagacaaca ggtggttggg ggtaaaggtc gtatggtgtc 5280 gaagttgagg gcgaggtttg acgggcccta caagatcaag acggtcatca atgagggtcg 5340 aaactacgaa ctcgaactgc cagaaggcga caaaacccac ccggtcttcc atatttcaaa 5400 gctcaagccg tacagagtcg aggaggatat tgagtctagt accggactag cgtccaaaag 5460 taagtttccc tcctagtatg tcaccgccac taccttctgc accttttgta aaagcaattc 5520 cacggccact ctgtgagcac atattcggac actttcttta caattagggc tcgggacgag 5580 ggaagacttt tcgataacga caataacgat ggctctgttt cacgtgttgc ttgattcttt 5640 tttttctctg gttgtgatgg tttttgatga gggcaatgaa tggtttttct ttctacttct 5700 ttttttttct tctttctgtt tctgaattac ttatctttta ggggcgagtt atgccagttt 5760 ttttcttctt ctttcttttt tttgatttct tttctagttg atggggtgga atactttttc 5820 tttttatttc ttgatttaga ggtggatttt tttttcggat ggtcaatttt ttttagaggg 5880 ggaggg 5886 // ID hATw-1_CCo repbase; DNA; FNG; 3620 BP. XX AC NW_001884902.1; XX DT 12-JAN-2009 (Rel. 14.02, Created) DT 12-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE hATw-type DNA transposon family from Coprinopsis cinerea. XX KW hAT; DNA transposon; Transposable Element; hATw; 7-bp TSD; KW hATw-1_CCo. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-3620 RA Bao W. and Jurka J.; RT "hATw-type DNA transposons from Coprinopsis cinerea."; RL Repbase Reports 9(2), 649-649 (2009). XX DR [1] (Consensus) XX CC TSD is 7-bp long. XX FH Key Location/Qualifiers FT CDS 508..3441 FT /product="hATw-1_CCo_1p" FT /translation="MPKDRKKPRAGRSKKQKQQLLQAQAARRLSHQKNDAR FT TKRAAGNVLRAKKKELEKELEATRRESERWKQRYWNERRRTIRMGKAHQQL FT KATHKVVMAELRRMRGVITTMEKEFMEARKEADEEMGALKQRIRALERSRK FT ELVHGQTVVRKRMARLAAAKRNYQRKLTELVKSKPSVFNMVKRGVYTKQAR FT SLARYLASTGTAEAKVGKTIKRIAEAMGLQVDRVMSKHTVQRSILEAGVAA FT DIQMAFEMAKACSECISFFVSKVLATYQVSELCYSSDSTSHKHIEYESRFI FT ALQVVDYERPDQEPQWVLRSLGIGTSVDHRSETQVQGLKERLADLADIFNR FT SPLAKREGIRFSADNFAYKLIGTSGDHANDQKKSHKVLHEWRMDVLCYRMG FT KEALMKMDPVLSAAILLPLKWRQLQTIGQSVWEAMTEEEKGEKDDEVIREF FT GKAIFENLPTNDQAKLSRFIRTGCCMHKDLNCFKWGDKSMQEMWKAKGLVP FT PCLLANKENVSILEGCVGEEPVTADQKRAMDASKRGGSKTVELGGMICKNK FT DSKKGQQDTYRFWMVEHLGFELPYPDVSNTRYGSHGAAAATVLTQLPAFRQ FT FINHVHDTKDRPGYTNIEQNFARALDDIPTITEIAVLALYHVAVSQPFMRY FT VRQEKNLLKLEPFFNRMDTFLASVATSPTVWSTPGHPYGSVFLDSNTPDTF FT SAEVLNVVHQLAPTLPHLDDSISAFTTGAREAFVERFSDEFREGNGIDSLT FT EAERDDLFFASTNDANEGALGSWRLAQRRRVSETLHKFNASFTAELNGTEF FT FIEHMLTEEADEIYLRKEARLRDEDGLQRQLKEAQVEADRKKAEENRLKVA FT RREEKEKEKANAILATGQNLILDDSAIEKLTNDGLNLQLDYHRDLEKSITS FT IPDTDRIPLKSHMKLKSDRIRNLKLAVSRYRSRQTISQPPVEPPTIPSASE FT TGPVTSEHDSSYEYDYYDDFYT*" XX SQ Sequence 3620 BP; 1047 A; 828 C; 956 G; 789 T; 0 other; tggggagggg cagcccaggc gaaatctgat gatgtttgca cctgtttgtc gctctctata 60 cggccgggat ttgggagaaa atggacattt tgggttgtcc gaggcctggg gccttctaaa 120 ttgagtaccg actggatcag aagcactgcc tctcggagcc cttcaaagcc aaattctgag 180 aacgccgatg gtccacgaga tttaccttca atccagtact taaaaagtga aatacaaaaa 240 aataaattga agagctcctt tgaaatgtag aatctcgtct acagacaata tctaagatga 300 gtagacctac aggttgagga gaaatacggg tgctcaggtg cgaccgtaaa cacgctcatt 360 tgattaccta atgaacgcga caatcaagat ttacgaccgc gcccgatggc tcgtaaaagc 420 aactcgggcc tcgaaacaag gtataactcg ttcattctgt aacagaatct taaaggacta 480 actaattaac atgtatagat cttatttatg ccaaaggatc gaaaaaaacc gcgggcgggg 540 cggtccaaaa agcaaaaaca acaattgctt caagcccaag ctgctcgcag gttgtcacac 600 caaaagaatg acgcgagaac gaaaagggcc gctggaaacg tgttgagagc gaagaagaaa 660 gagctagaga aagagttgga ggcaaccagg agggaatcag agcgatggaa gcagagatat 720 tggaatgaga ggaggaggac aatccggatg ggaaaggctc accagcaact gaaggcaact 780 cacaaagtgg tcatggcaga gttaaggcga atgcgaggag tgataacaac gatggagaag 840 gagtttatgg aggcaaggaa agaggcggat gaagagatgg gtgcattgaa acagaggatc 900 cgggccttgg agagaagcag gaaggaactg gtacatgggc agactgtggt gcggaagagg 960 atggcgagac tggcagcagc aaagcgaaat tatcaaagga aactcactga acttgtcaag 1020 tcgaagcctt ctgtgttcaa catggtgaag aggggtgttt atacgaagca ggcccggtcg 1080 cttgctcgct atcttgcatc gactggaaca gccgaagcga aggttgggaa gacgataaag 1140 aggattgcag aggcgatggg cttgcaggtg gatcgtgtta tgagcaaaca cacggtccag 1200 agaagcattc ttgaagctgg agtcgctgca gacatccaaa tggcattcga aatggcaaag 1260 gcttgcagtg agtgcatttc tttctttgtt tcgaaggtat tggcgactta ccaggtgtca 1320 gaactatgct atagctcaga ttcaacctct cacaagcata ttgagtatga atcgcgtttt 1380 atagcgctcc aagtcgtcga ttacgaacgc ccagaccagg aaccgcaatg ggtgctgcgc 1440 agcctcggaa ttggcacatc cgtggaccac aggagcgaaa cacaagtcca agggcttaag 1500 gaacggcttg ccgaccttgc agacatcttc aaccggagcc cgctcgctaa acgcgaagga 1560 attcgattca gcgccgacaa ttttgcatac aagttgattg gtacgagtgg ggaccatgcg 1620 aacgaccaga agaagagcca caaagtcctc catgagtgga ggatggacgt gctgtgttac 1680 cggatgggta aggaggcgtt gatgaagatg gacccagtac tttcagctgc aattctgctc 1740 cctctcaaat ggcgtcagct tcagacgatt gggcaatcgg tttgggaggc aatgacagag 1800 gaggagaagg gagagaaaga cgacgaagtg attagggagt ttgggaaggc cattttcgag 1860 aatctaccga ccaacgacca agccaagctc tcgcgcttta tccgaactgg ctgctgcatg 1920 cacaaggacc ttaattgctt caaatggggc gataaatcca tgcaggagat gtggaaagcg 1980 aagggactcg tacctccatg cctacttgcc aacaaagaga atgtgtcgat ccttgaaggt 2040 tgcgtgggcg aagagccagt cacagcagac caaaagcggg ccatggatgc gtcgaaacga 2100 gggggaagca agacagtgga gctgggagga atgatttgca aaaataaaga ctcgaagaaa 2160 ggtcagcaag atacgtaccg gttttggatg gttgaacacc tcggcttcga acttccctac 2220 cccgatgtca gcaatacacg atacggaagc catggtgctg ctgcggcgac tgttttaacc 2280 caactgccag ccttccgaca gttcatcaat cacgtccatg ataccaaaga ccgccccggt 2340 tacaccaata tcgagcagaa ttttgctcgt gctctagacg acattccgac gatcacagag 2400 attgccgtcc tcgcactata ccatgtggct gtttcccagc cattcatgcg ttatgttcga 2460 caagagaaga accttctcaa actcgagcca ttcttcaacc ggatggacac gtttctggca 2520 tcggttgcga cttcccccac agtttggtct actccgggtc acccttacgg ctctgttttc 2580 cttgacagca acacccctga caccttctcc gctgaggttc tcaacgtcgt gcatcaatta 2640 gcgcccactt taccgcattt ggacgactca atctccgcat ttacgacagg tgccagggag 2700 gcatttgtcg agcggttttc tgatgagttc cgggagggaa atggaattga cagccttacc 2760 gaagctgaac gtgacgacct attctttgct tcaacaaacg atgccaacga gggagcccta 2820 ggttcgtgga gattagcaca gaggcgccgg gtgtctgaaa ctctgcacaa gttcaatgct 2880 tcctttactg cagagttgaa cggcacagag ttttttatcg aacacatgct cacagaagaa 2940 gcagacgaga tttatctcag gaaggaggcg aggctacgag atgaggatgg attgcaacgg 3000 cagttgaagg aggcacaggt ggaggcagac cgaaagaaag cagaggaaaa tcgattgaag 3060 gttgcgagaa gggaagagaa ggaaaaagaa aaagctaatg ctattctcgc tacaggacag 3120 aaccttattc ttgatgacag cgccatcgag aagctcacaa acgacggtct taatctacaa 3180 ttagattatc atcgtgatct ggagaagagc atcacatcta ttccagatac cgaccgaatt 3240 cccctcaaat cccatatgaa gctcaagtcc gaccgaattc gcaatctcaa actcgctgtt 3300 tctcgttaca ggtctcgcca aactatctct cagcctccag tcgagcctcc aactattccc 3360 agtgcatccg aaactggacc agtcacctca gaacatgatt cctcctatga atatgattac 3420 tacgacgact tttatacata gaatatgtta gcagctattt gtactctatc taaccttaat 3480 tagaatactt ctctacgcct ttataacacg attcggtctg gtttcgcaga ggagagctgg 3540 acaaagttac gatttggcac ttttacggtg gctgtgcact tattaagcag ccgttttgtc 3600 tcgcgcgctg cccctcccca 3620 // ID Gypsy-122_MLP-I repbase; DNA; FNG; 5800 BP. XX AC AECX01000903; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-122_MLP_; KW Gypsy-122_MLP-LTR; Gypsy-122_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5800 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000903; Positions 4373 10172. XX CC Positions [4579-5058] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 292..2376 FT /product="Gypsy-122_MLP-I_1p" FT /translation="MSGEHPVNSRSTRRNPGPALPSVDNPERIIFPTRGHH FT RSHSNPATNFTPPRPVSAPPVPSFVEPNSLLSRQWDLFVNPPNPHGHPVIT FT ANWPPGPPSISIDQVSEDSQELTRTILPRPLARRPRCSIPGAYDPSASFIS FT AGDITARMDPQPSTSMAEGQPSTLAPGYDVSEELARVKLANEVIRDDNEGL FT RNEMREIRLLLGELLRRERPVVPEAVEEVISPEATASTQQPRMAPVARSVN FT SQHTPQLFTSTPAAGNRVHLFNVPNSLPSVEDTTSAPAQPVVDLSKFRVSD FT WPQYKGKFGEVAAFRTWQYQMEITFRVKSINRPEDRFRILPLVLSTDPAAS FT WCRRSERNFVGRSWDEVMTEMQSVVLPVGWEDVAKEKLRELTMKQNESSTA FT YCSRARLIQEEIGVEECSDETLAHAVVGGTIGTFKAWVKMERIVKNSIDPV FT TKRFSFPVFEERMGAIWVLAQEIDNRNIPKARAATAQTASGSTASVSAPSN FT RTPYDRTSAVSTSARPALSSEETLARNVRFGAYMRSIGMCPRCKTPCDKWM FT GGCEAKPNSAFFSVPMEFPRAPPYPPPKAQTGPPRSSAPVTRPPPRRVDVA FT SVEEAQPRVDIAAIGEFPDLGRADLAAYEQMITQLNNSGSKEELVESAEYD FT KSNSFEVGKTLIMTWFVNGVPLRILIDTGAGRSLISSQSVDRLKLV" FT CDS 3043..5451 FT /product="Gypsy-122_MLP-I_2p" FT /translation="MIIPKKDPAALPRLICDYRVLNKHTIKDRSPLPNPDE FT AVRLVGSGKIYSSLDQINAFFQTRMEEKDIPLTAIKTPWGLYEWVVMPMGL FT TNAPATQQRRCEEALGDLVNKVCVVYIDDIVVFSQTVEEHEEHLRQVLQRL FT RAANLYCGIKKTQLFRRQIKFLGHVISEDGISADEEKVEKVANWRSPKNPK FT QLKEFLGTVQWMKKFIDGLSRYASQLTPLTSSKRKPEDFNWTDQEEAAFSN FT IKRMITTLPVLRNLDYDSTDPVWLFTDASGQGLGAALFQGKDWETAWPIAY FT DGRTMTAAERNYPVHEQELLAVIHALNRWRLLLLGLKVNVMTDHHSLTYLL FT SQRNLSRRQARWLEVLSQFDLNFKYLQGADNSVADAISRIDTAAVTESRPI FT LSDITVREIAAGYALDPFCSKLAKVLPLRDNCAWKGELMIMDDRIVVPANS FT DLRNNLFLSAHQAVGHLGSLKTYQRLRREFFWPGMAGDVDKWVGLCDDCQR FT HKARTTLLPGRAQTTNLPQRPMSSIALDFVGPFPKVAGYDMLLSCTCRLTG FT FVRLIPSNQKDTAERTARRLYSAWLSVFGAPDEMVGDRDKLWVSRFWQELH FT RLLGVTVKLTTAYHPQGDGRAERTNKTLGQLLRFSTQGRQGKWLESLPTVE FT YAINSAVNAATGVSPMKFVLGREPRLFPIVSSVEGVSQDVERWVSERQGDW FT LKWRDKLWVSRVEQAVQYNKRRGGDLDLKVGELVLVDSANRSAQVGGKVAK FT LRARFDGPYKVLQILNEGRDFKLQLPRGDTTHDVFHVSKLKKYRTDEPVEA FT C" XX SQ Sequence 5800 BP; 1466 A; 1261 C; 1479 G; 1594 T; 0 other; gtctggctac ggccttagag atctgagaac cccagtgaag gtcgagatat caacgacctt 60 acactttttt gaacaatccc agtgattcga atccaaacgg cgaagatcat tggctcgaat 120 ctattcaatc ggacgtggct caatatcaat acaattcaaa aacttaaaac ttcaattcat 180 tcaaattttt ttttcacaat tcaattattc ctgtttgagt ttgctttagt gctacttccg 240 gtttcctgtc acacctgtca cctatacccc gcgttacctg tttgctctcc catgtctggg 300 gagcatcctg tcaattcccg atcaactcgt cgcaatccag gaccagcctt gccatctgtc 360 gacaacccgg agaggattat ctttcctact cgaggtcatc accggtcgca ttcaaaccct 420 gctactaatt tcacaccacc cagacctgta tccgctcctc ctgtaccaag ttttgtcgaa 480 cctaactcat tgctttcgcg acagtgggat ttattcgtta acccaccaaa tccccacggt 540 catccagtca taacggccaa ttggccacca ggtcctccat ctatctcgat tgatcaggtt 600 tcagaagatt ctcaggaatt gactcgtacg attttgccgc gaccgctcgc tcgtcgacct 660 aggtgcagta taccaggtgc ttacgaccct agtgcatctt tcattagcgc tggggatatc 720 actgcacgta tggacccgca accctcgact tccatggcgg agggtcaacc ttcgacactg 780 gcgcctggtt acgatgtcag cgaagaattg gcgcgtgtca agttggccaa tgaagtgatt 840 cgtgacgata atgagggtct gcgtaacgag atgcgggaga ttcgacttct gcttggggag 900 ttgcttcgac gtgaacgacc agttgtaccg gaggctgtgg aagaggtcat atcgcctgag 960 gccacagcat ctactcaaca accaagaatg gcaccggttg ctaggagtgt taattctcaa 1020 catacacccc agttgttcac ctcgacaccg gctgcgggga atcgagttca cctttttaac 1080 gtcccgaact ccctgccctc ggttgaagac accacttctg ctccagctca gccagttgtt 1140 gacttatcca agtttcgtgt gtcggactgg ccgcaatata aaggaaagtt cggtgaggtc 1200 gctgcttttc gaacttggca ataccagatg gaaattacgt ttcgagtcaa atcaatcaat 1260 cgacccgaag atcgttttcg cattcttccg cttgtgcttt caacggatcc agcggcttcc 1320 tggtgtcgac gttcagagag gaatttcgtg gggcgatctt gggatgaggt gatgactgag 1380 atgcaaagtg tagtgttgcc tgttggttgg gaagatgttg ctaaagagaa acttcgtgaa 1440 ctcaccatga aacagaacga gtccagtact gcttactgta gccgagcgag attgatccaa 1500 gaagagattg gtgtggagga gtgctctgat gaaactttag cccacgccgt tgtgggaggc 1560 acgattggta cattcaaggc ttgggttaag atggagcgca ttgtaaagaa cagtattgat 1620 ccggtcacca aacgcttctc tttcccggtt ttcgaggaac gcatgggcgc tatctgggtt 1680 ctggcgcaag agatcgataa ccgtaatatc ccgaaagctc gagctgcgac ggcacagact 1740 gcgagtggat cgactgcttc tgtatcagcc ccgtcaaacc gtacgcctta tgatcgaact 1800 tcagctgttt cgacatcagc ccgtccggca ctttcatcgg aagagacact tgctcgtaac 1860 gttcgttttg gggcgtacat gaggtccatt gggatgtgcc ctcgctgcaa gactccctgt 1920 gataaatgga tgggaggttg cgaagccaaa cccaactcgg ccttcttttc agttcctatg 1980 gaattcccac gggcaccgcc atatccaccg ccgaaagccc aaactggtcc ccctcgatca 2040 tcagcacctg ttactagacc acctcctcga cgagttgacg tggcttctgt tgaggaagct 2100 caacctcgag tggacattgc agctataggg gagtttccgg atttgggtcg tgctgatttg 2160 gcggcgtacg agcaaatgat tacccaatta aataactcgg gttccaagga ggaattggtc 2220 gaatctgctg agtacgacaa atccaattct tttgaagtcg gaaaaacact tatcatgact 2280 tggtttgtga atggtgttcc attacgcatc ttgattgata ctggagcagg aaggagccta 2340 atttccagcc aatcggttga tcgccttaaa cttgtttgac gaccccttcc ggtacctata 2400 caggtgcgtc cagcaatcca gtctgaaccc gataactttg tcctcaagga atttaccttt 2460 gctaatgtca aatctcccga acctgctttc acctttggcg cgacagcttt caagatcgca 2520 ccgctggggg gaaattatga cgtgattctc ggggcacctt tcctttctaa acatcattta 2580 gatgtgtcat tgtcacaacg tctactcaga agtgcaaaga atgcttatga atttcgggaa 2640 cagagtttga ttgatgagac gagagagttg aacgagctga ggagtaagcg tgaagcgctt 2700 gttaagactg ttatggagaa tttgaacaag gtgcaagaag tgaatgaatt ctcgcttcgt 2760 gaggtggcga tgttaaaaga atttgaagat ttatttcccg aggatttacc ggatgtgagc 2820 gatgagagtg ttgatgagga ggatgagaaa ttccctgaga agttgcagga tgtgtcgtgg 2880 aagactcgac accgcatacg tctgacggac cctgatgtac aaatcaatga aaaacaatat 2940 ggttagccta ggaaacattt ggacgcgtgg agcaagctga taactcaaca tgtcaaagct 3000 gggagattga gaaagtcgtg tagtccatac gcttcacctt ccatgataat tccaaagaag 3060 gatccggctg ctttaccacg tttgatatgt gattatagag tcttgaacaa acacacaatc 3120 aaggatagga gtccactgcc taatcctgat gaagctgtcc ggttggttgg ttcaggaaaa 3180 atatattcgt ccttagatca aatcaatgcg ttctttcaaa cccggatgga agagaaagac 3240 ataccgttaa cagcaattaa gactccttgg ggtttgtacg aatgggttgt tatgccgatg 3300 ggcttgacaa atgccccggc gactcagcag cgacggtgtg aggaggcatt aggtgatctt 3360 gtgaataaag tgtgtgttgt gtatattgat gacattgtgg ttttctcaca aactgtggag 3420 gagcatgaag agcatttacg tcaagtttta caaagactgc gagcagctaa cttgtattgt 3480 ggtattaaaa agactcaatt atttagaaga caaatcaagt tccttggaca tgtgattagt 3540 gaagatggca taagtgcgga tgaggaaaaa gttgagaagg tggctaactg gcgcagtcct 3600 aagaatccca agcagttgaa agagttcctt ggcacggtcc aatggatgaa gaagtttatt 3660 gatggactgt cacggtatgc aagtcaatta actcctttaa ccagttctaa acgaaagcca 3720 gaggatttta attggaccga ccaagaagaa gctgcattta gtaatatcaa gcgcatgatt 3780 acaaccttac cagtcttacg taatctggat tatgattcaa ctgacccggt ctggctattt 3840 actgacgcaa gtggtcaagg tcttggagca gccttgtttc aaggaaaaga ctgggagact 3900 gcctggccaa tagcttacga tggacgcacc atgacggcag cggaacggaa ttatccggtt 3960 catgaacaag aactgcttgc tgtaattcac gcactgaaca ggtggaggtt gttgctttta 4020 ggtctcaaag tgaatgtcat gaccgatcac cattctttga cttatttact atcacaacgt 4080 aatttgagcc gaagacaggc aaggtggttg gaagtcttgt ctcaatttga tttgaatttc 4140 aaatatttac agggggctga caattcagtg gctgatgcta tatctcgtat tgatacggct 4200 gcggtgacag aatctcgacc gatattgtct gacataacag ttcgtgagat tgcagcaggg 4260 tacgcattgg acccattctg cagcaaattg gctaaagttc tgcccttacg agataattgt 4320 gcctggaagg gagaactcat gattatggat gatcggattg ttgtaccggc taattcagac 4380 ttacgcaaca acttatttct atctgctcat caagctgtgg gtcacttggg gagtttgaag 4440 acgtatcaac gattgagacg ggagtttttt tggccgggta tggcgggaga cgtagataag 4500 tgggtagggt tgtgtgatga ttgtcagcgg cataaggctc gtacaacact cttaccggga 4560 cgggctcaaa ctacaaatct tccccaacgg ccaatgagta gcattgcatt ggacttcgtt 4620 ggcccgtttc caaaggtcgc tgggtacgac atgcttttgt cttgtacgtg tcgacttacg 4680 ggttttgttc gattgatccc ttctaatcag aaggatactg ctgaaagaac ggctcgccgg 4740 ttatattccg cttggttgtc cgtctttggg gcaccggacg agatggttgg ggaccgcgac 4800 aaactttggg tctcacgatt ctggcaggaa cttcaccgat tgctgggtgt tactgtgaaa 4860 ctcacaacag cttatcaccc acaaggcgac ggacgggcag agcgtaccaa taaaaccttg 4920 ggtcaacttt tacgattctc tactcaaggt cgacagggta aatggttaga atccttacca 4980 actgtcgagt acgcaattaa ctcggcagtc aacgctgcca caggggtctc gcctatgaag 5040 ttcgtgttgg gacgggagcc acggttgttc ccgatagtgt catcggttga aggagtgagt 5100 caagatgtag agcgttgggt gtcagagcgt cagggtgatt ggttgaagtg gagagacaaa 5160 ttgtgggtat cacgggtgga gcaggcagta cagtataaca agcgaagggg tggtgatctg 5220 gatttgaaag taggtgaatt agttttggtt gacagtgcca atcggtctgc gcaagtcggg 5280 ggcaaggtag caaaattacg ggctagattt gatgggccat acaaggtttt gcagatactt 5340 aatgagggac gggacttcaa acttcaacta ccgagaggcg acacaacgca cgatgttttc 5400 catgtctcga aactgaagaa gtataggacg gatgagccag tggaggcttg ttaaggggtg 5460 agggactacc ttgtgcaagt aagttccctc ctttgtatgc accgccgcgg ggtttctacc 5520 tctccttttt tcgcaacaac atcttggcca cgcctgtgag cacgaaatct tctgtttcgc 5580 tctttctcag ggtaccgcgg cgcaatatca ctggacgaag ctgttggtgg gatgaggatt 5640 gttttgtttt gtttcttttt tctattttct cttttacttt tttcttttag tttctcaaat 5700 aaatcaattt tttggttata tttccttttt tgtttttgta ttttggtttt ctgggtgtta 5760 gggggaagtt atgagatatt ttatttttag ttgggggggg 5800 // ID TSE5_I repbase; DNA; FNG; 4546 BP. XX AC AJ439554; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 08-NOV-2002 (Rel. 7.1, Last updated, Version 1) XX DE Saccharomyces exiguus retrotransposon TSE5_I, internal region. XX KW LTR Retrotransposon; Transposable Element; RNaseH; TSE5_I; gag; KW integrase; internal region; pol; protease; reverse transcriptase; KW internal portion. XX OS Kazachstania exigua OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Kazachstania. XX RN [1] RP 1-4546 RA Neuveglise C., Feldmann H., Bon E., Gaillardin C. RA and Casaregola S.; RT "Genomic evolution of the long terminal repeat retrotransposons RT in hemiascomycetous yeasts."; RL Genome Res 12(6), 930-943 (2002). XX DR Genbank; AJ439554; Positions 371 4916. XX SQ Sequence 4546 BP; 1269 A; 1018 C; 823 G; 1436 T; 0 other; ggttatgagc cctgtagccc aaagggaaaa tcctttgtcc attaaactgg agagctccaa 60 agactttaag aaatggttaa atcgtttaag aaatcatatg tttcatgtat ccaatgattt 120 ggctgcttat gcagaaactg gtaacttgtc tcaagcatct aatgaatata atttgtctga 180 acaagtcaga agtgaagcta ttaacatttt aagctttgca ttaattgaat tattgaaaat 240 atcaactcaa ggtcgtcctc atgacgctat tttcaaggaa acggatgaaa atccatacgt 300 tactggtaaa gaaattttag acatgattat taagaagtat gatatttcta attttcgtca 360 agatgtccaa atttggttac caatccattc tggtatggtt gctgagcatg tcgttcaaat 420 ggataattct tccgatcgga acctaacctt ttgtgctact gacacatcca cttctggtat 480 ttccaaagac gaatggattc tagacagtgg ttgtactgtt catgtttctc acaacaggga 540 attgtttact tcatttaaaa gttctaccaa ttccacactt agaggtgttg gtggtgagac 600 ccatatttat ggctatggtg atattaaaat tggcaacgtt acattaaaag acgtagccta 660 cgctccagac ttaccgttca atttattgtc tgtccaaaag gcaattacca cctcagatta 720 tgcgttatta ttcgatgttg atcaccatgt gaagattctt gatcaaaagg gttacatcag 780 agcattagca tctctgaaaa ataatttgta tgtcgtccac tttggtactg atactaagca 840 tatcgctatg ggtgcttatg atgttgaaat tacatcatct ttgactggtt ctcatgatcc 900 tttgttagct tctgaaaaac cagttactcc agaaactagc attttacatg ctcgactagg 960 ccatccaggt attaatagtt ataatcagtt agcatccatt gttcatgctc caaagattaa 1020 gccggcatca gttactgtct gtccaacttg ctcattatca aagggtacaa ttaataaagg 1080 tgtaatctca ccagctgagt acactagccc tttgcaacta cttcaagttg atttatgtgg 1140 tcctttcagg tataaaaact acaaatctga aaaatacttc atgacaatta gagatgcata 1200 ctccagatat tactcggtag ttcacctagt taaaaaatcc gatgctgccc aagaactgat 1260 tgattggatt caagaggttg agaattattt tttctcacgt ggtggttata aggtcggtgc 1320 tatccgtaca gataatggtg gtgaatttat gggtcaaagt cttcatgaat tttttaagaa 1380 gaagggtatt caacatcagt tgactgttcc tcactcaagt ttccaaaacg gagcagtcga 1440 acgtgctcat cgatcaattg aagagaagac tcgttgttta ctggtcggtg gtcgtgttcc 1500 tccatcattg tggacagaag ctgtcaatac tgctgtatat ttattgaata gattacctat 1560 taccaataaa aaaggctcta ttccgttttg cttatggaca ggttctgaac cgtctgctct 1620 aaagcttgat aaccttcgtg tctttggctg tgctgcgtat gctaccttag actcttccct 1680 tcgtgatggt aaatttgcac caacttctat ttcaggtgta tttgtcgggt atgattccaa 1740 ccgtaaggca tatcgtatct ttcacccagc gtctaagaaa atctttgcaa gttgtcaagt 1800 caagtttgat gaatttgttt ttccacttga acataccaaa gatacagtta gtacccactc 1860 ttttgctacc tcaactattg gcggtgcgcc aacatatcct tcatccaact ccggctatcg 1920 ttacattgaa tctgaaaacg aagtttcaga ttctagtcat ggcgatcaat catcatctgt 1980 taattcttcg ttctccgaat ccaataatat tgctcctgat gatgctgact atgaagagga 2040 acctgattta tccatgcgtt catcatctca ttcttctgat gccgctacct cgaaccttca 2100 agttgttcct attaccgctg caccaccgtc atctcctgac tctactgaac tagccaaaac 2160 ctccaacgca gctgtgcgtt cctccgactc ctattctagc tttccatcct ctgagcctta 2220 ttattctgct ccaccttcta atcaatcaac tgcctccatt tccgaatctc acctagtcag 2280 ccattccgat attccttctt gtgaatcgga gaatcatgat atggtgcttg ccgaccccct 2340 cgtggattct gtaacacaac aactccaaca cagtactaat caacttaaat cgactcacga 2400 tgatcttatg agtcttcaga aaatgaattc tgaactaact gcacaaaatt caagattacg 2460 taaccatgcc caccatctgg ctcagttagt tcctagtcct accgctcgta ataaacgtac 2520 tcatgctagt accgatttgt taccaccaat tgtcccaaat gaaattcttg tcgattcccg 2580 tccgaccaaa cagagcagta cccatgttct cggtactgat aaaattgcac gtcttccaac 2640 tgcaatggtt actccagatg accacaccta tcagacaatt cctattcaat ctactgatgt 2700 tgctcctatt aatcatgttc ctgttaaatc ggcatgggca gaccgcatcc gtcattatga 2760 atccaaacaa aactggtctc cgaccggtca ttctttgaca aaccttccaa ttgcagccgt 2820 cgatcaacta tatagcacag accgggctat tgttgttcgt aatgatcaaa accaactcac 2880 acagcatgtt ctttctccgg ataatgcttc tttaccctct tcagacattg atatcgaaga 2940 agtcgaaact cacgttggta tgactgctgt tattagtggc cactctacta ttactccatc 3000 actgatcgca ccaatgacta tgcaacaagc tctacgaggt aaaaatgcca aagcttggcg 3060 cagtgccgct gaacatgaac tggcagcctt taaaaaccat catacatacg aactcattcc 3120 acctcctgat gatgttcgtg ttcttggatc tcgttgggtt ctgaccgtaa aaggtacaca 3180 cacggctaag gctcgtcttg ttgctcaggg tcatagacaa attcagggtt tggattacac 3240 tgagacattt gctcccgttg ttcgttatga ctctgttcgt atcttcttag ccttagctgc 3300 ctgccatcgt cttcaggtcc atcaaatgga tgttgatact gcctttttaa attctccaat 3360 ggacgaacct gtatacgttc gtcaaccacc agcttttctt gatgctcaac accccgattg 3420 ggtgtggaaa ctttctggtg ctatgtatgg tttgaagcaa gctcctatgt tatggaataa 3480 acacatgaac ggcactttgt gtaaaaaggg ttttgaacaa catgctggcg aacacggctt 3540 gtatttcaaa aatactgctt ctggtattgt cattgttgct ctttacgttg atgatctact 3600 tattgcgggt cccaatgatg ctactatttc ttctgttaaa catgacttat ccactgtgta 3660 ctccatgaaa gatcttggtc cagtaagcaa gtttttgggt ctgaatgttc atcaaacccc 3720 cgctcatatc tcattaagtt tggaggatta cattgtgaaa gctgccaatg ctagttctat 3780 ctgtctcagc aaaccgagat atattcccat ctcccctacc acagatttgt ttgacactaa 3840 ctcttcactc ttgtcaaata ttaccccata tcaaagtatt gttggtcaac tactatttgc 3900 tgccaatact ggccgtcctg atattgcaca tgccgtctct ttgctttctc gttttcttaa 3960 ggctcccact gaacttcatc ttgagactgc tcaacgtgtt cttcaatatc tttatactac 4020 ccgtcatgct tctttgatgt atcaaatggg tgctcctgtt caaatgaata tttattctga 4080 tgcatcttat ggtgcaaagg aggatatccc atatgctaca cgggggtata tcacacaact 4140 agccgggggt actattacat ggtgctccaa gaaaattaag tccactatca cgttgtcctc 4200 aactgaagct gagtatatcg cagcaagtga ggctgttgcc gaaatgcaat ggttaattaa 4260 catgatgaat catatgggta ttaaagtgaa gacgccaaac ttatgggtcg ataacatccc 4320 tgcaattcat attgctgaaa acccggttca ccacaataga atgaaacatg tcgctatcaa 4380 ggttcaccat gtgagaaaag ctattgccga tggtcaaata aatattggcc atattaatac 4440 caaagagcaa ttagctgata ttaccacaaa aacattatcc aaagcgctat tcattcatct 4500 acgtgataag attatcttta aagcaagtat ttgacttacg ggggta 4546 // ID TFO1_FO repbase; DNA; FNG; 2763 BP. XX AC AB008746; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 08-NOV-2002 (Rel. 7.1, Last updated, Version 1) XX DE Fusarium oxysporum Ac-type DNA transposon TFO1_FO. XX KW hAT; DNA transposon; Transposable Element; TFO1_FO; hAT family; KW target site duplications. XX OS Fusarium oxysporum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Hypocreomycetidae; Hypocreales; OC mitosporic Hypocreales; Fusarium; OC Fusarium oxysporum species complex. XX RN [1] RP 1-2763 RA Okuda M., Ikeda K., Namiki F., Nishi K. and Tsuge T.; RT "Tfo1: an Ac-like transposon from the plant pathogenic fungus RT Fusarium oxysporum."; RL Mol. Gen. Genet 258(6), 599-607 (1998). XX DR Genbank; AB008746; Positions 3549 6311. XX CC 15bp terminal inverted repeats. 8bp target site duplications. XX SQ Sequence 2763 BP; 742 A; 709 C; 672 G; 640 T; 0 other; cagtgtgtcc atcaatcaag ttacttggct tggttggttg tgattggttg attacgtaac 60 aaccaaccaa agtaagggct cctgttcaac caagcaaagc aacctttttc ctttactcgg 120 ctctacttga tttactcgac agaattgccg aatgaggcat agcgcgatga ctttgacata 180 tctacgcgct ctgattggct aggtattccg catcccacct gcgaaaattc tcgccgagtt 240 tcttaagaga aggtacctga cgattgcgcc aagctcgaag ccaaatatgt ctgcacattc 300 attctttcgg ccgcatagaa ggatcaggaa gtccactcca cctaggactc cctctgagcc 360 tctcggttct accctgccca gtacgcctac ctccgtgtca actccaaccg atatactggt 420 acttgatgac ggcgcatcta caccaaccat tgccgacgaa agacagattc caacgagctc 480 tcaatgtgtt ttccctatta actgggacaa gattcgctat gacgggaagc ctgtcccatc 540 aattcgatac cgtcagcctc ataagcgcac cctcaactcc aagctccaac cgtcagcaat 600 ctacaaacat ggtgctcagc tcacaaccga tggggataac aagtactggc tctgcaagta 660 ctgccatata cggggacacc accatactgc actattttcc agtgagagca ctaccagcgt 720 aatataccac ctaaagcatc aacataaact ggaagacttc ggataccagg cagcactgtc 780 aaaccccttc agcatggcga aggggacaac caaccccaca tcatatctcg gaggccggcg 840 gttagtgttt aacgacttgc aattcaagga cgactttatc gactgggtta tcgacctcga 900 tctcaccttc cgtcaagtaa cccatcaacg cactcatgag atctttacca accatctgga 960 agacattggc aagattctgc caaagagccc atccgccctc agtaactgga tcaaggagaa 1020 atgggtcggt gatgctggaa ggcgtgtttg gctgatggaa aagctacacg ttgctacaag 1080 caagatccat atctcggtgg atgcctggac atctgaagaa ggaacgaact atctagcagt 1140 tgttgctcac ttcttggacg aaagccataa gctgcaaaca gctcttctgg acctcccacc 1200 tctgaaagga ccccactccg gtgagaatct tgcgaaggcg ttatccaaag tcattgactt 1260 ttacgatatc tccaccgtca ttggcttctt catgatggac aacgctggca ataatgacac 1320 atgcattcag gagctggcaa agcaataccc ggcgatcaaa ccgcagagtc gccttcggtg 1380 tgttggtcat atgttgaatc tcatcgtcaa agctctgctc tttgggcagg gtgttagcaa 1440 gatggaacaa cagctgcgtg gcgcatccga cgaggagcga tttgagatat ggaggaaaca 1500 gagctttatc ggaaagctcc acaacttctg tgtgtgggtc aacaggagtg accagcgacg 1560 ggagctgctg aaacagtacg tgttgacggc gtacgacgaa ggcagcatcg agtgcctcta 1620 cactagagtc ctggttgatg gcggcatccg ctggaactct gcatatgcga tgattgaaag 1680 agctcttaaa ctccgccacg ccattgatct cttcttcctc aactacaacc atatcggcaa 1740 ggaatatgat atctcacagg atatgcttac tccgcaggat tgggttgatc tggaacactt 1800 cctcggtatc ttaaagccgt tcaaagatct aacaaaacgc atggaaggcc gggcaaataa 1860 agcgggcctt gaaggatccc acggctccct ttacgaaacg atcgagtcgc tagacgttct 1920 cttcaaggag cttcaagagg ctggaaagtt tgcagataat caccccgagg tggtgtccac 1980 atactattcc tatgctatcg atgctgctcg cgtcaagctt gaagagtact ttggccttac 2040 agatgccacg cctgcatacc gctgtgcggt tgccctgcac cctgcgaaca aatttacata 2100 cttcgagctg gaatggagtc acaacaagca gtggatcagt gaggcgaaga gagtggtacg 2160 ggaggtgtac gcacagtacg aggaagcagc tgccaaaacg ggcacgatag gagcacaacc 2220 acaggaggag gtaattgatg acaatgacgt ggcgctggat cctctccagc aggcccgtaa 2280 gcgtcgtcag cgacttgctg ccactgcggc gtcaggctca cgaggaaata agcgcatcaa 2340 actcacctct gagctcgatg agtttatggc cagggctaat agggctgatg tggaagttga 2400 ggatcctctg gagtggtggg tttgccatgc gtctgactac cctatcctct ctaagatggc 2460 ctttgacttg ttcagttgcc cagcaatgag cgctgagtgt gagcgagtat tctcacagac 2520 aaagaaggtc attaccgatg aacggaatcg cctcaagtca gataccgtag cggcacttga 2580 gtgccaaaag cacctgcttc ggaccggcat gttaccatag aaattgcccg attgggaata 2640 gtagtcaagt aatatcaacc aagtaaggac ttcttgtaag caaccaagta aagtaaaggg 2700 ctctagtcaa ccaagccgag taagggggtc aaattttggt ttacttggtt gatggacaca 2760 ctg 2763 // ID Gypsy-33_MLP-I repbase; DNA; FNG; 7828 BP. XX AC AECX01000195; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-33_MLP_; KW Gypsy-33_MLP-LTR; Gypsy-33_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-7828 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000195; Positions 112409 120236. XX CC Positions [6628-7107] - Integrase core CC 'ATACT' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 4024..7731 FT /product="Gypsy-33_MLP-I_1p" FT /translation="MIDWKTKTIHIDLITASTEAESSDLTPPSTYPELEPL FT RDAKNCDEGMCITHNALTSPQSESVIDLFTHSPKEKTGKLYSLLNLQPNTD FT ATTTNQTQDNTHKEPHQDHEIAAAISASSSPQQNPESPKMEPTGHARNCDE FT GASIVSNTFKPPQCEFDLPRLFNPREAAGKPIFPLNSSIPLDIAAAKTSWS FT TSARLAADEKKKTPNQPVEELVPSCYHRHLNMFKKSQAQHLPPRRKYDFKV FT ELIEGAQPQASRIIPLSPAESDVLKEMINTGLANGTIRRTTSPWAAPVLFT FT GKKDGNLRPCFDYRKLNALTVKNKYPLPLTMDLVDSLLDADEYTKLDLRNA FT YGNLRVYEGDEEKLAFICKEGQFAPLTMPFGPTGVPGYFQYFMQDILLGHI FT GKDTVVFLDDIMIYTKKGTTHEGVVNKILDVLTKHQLWLKPEKCEFSKKEV FT EYLGLLISKNKIRMDPTKVKAVSDWPAPRNVSELQRFIGFANFYRRFIDQF FT SRTTRPLHNLTRDKTPFIWDTTCELAFNKLKTVFTSAPILKIADPYKPFVL FT ECDCSDFALGAVLSQKCEVDGEIHPVAYLSRSLAQAERNYEIFNKELLAIV FT ASFKEWRHYLEGNPNRLEVIVYTDHRNLETFMTTKALTRRQARWAETLGCF FT DFQIKFRPGRQATKPDALSRRPDLAPTGKEKLTFGQMIRPENIGPETFTAD FT LACFESFFDNEDIELDNADHWFEVDILGVEDTEPQDPILNDTEIIDLIKKA FT NQKSPRLQELMEAVQNPLSTKLKQAVKKYEVKDGLLYNQGRIEVPEDEDIK FT RHILKSRHDSLLAGHPGRAKTLGLIRRSFFWPSMKAYVNRYVDGCDSCLRT FT KTSTKKPLGTLEPLPIPAGPWTDISYDLITSLPLSNNNDSILTVVDRLTKM FT SHFIPCRETMSAGELADLMIKFVWKLHGTPKTITSDRGSIFISQITQELDK FT RLGIRLHPSTAYHPRTDGQTEIVNKVIEQYLRHFICYRQDDWDSLLPIAEF FT SYNNRDHTSTGVSPFKANYGFEPNFSGIPSSEQCLPAVEERLKLLKEVQEE FT LTECLEVAQENMRTQFDKDVRPTPDWNIGDQVWLNSKNISTTRPSPKLDHK FT WLGPFNIIKKISRSAYELTLPLSMKGVHPVFHVSLLRKHSQDTINGRHQAE FT PQPIKINDEEEWEVYNILDCRKRYNKREYLVSWKGFSAENNSWEPEVNLKN FT SKELLDQFNTKFPNAANRHKRTQRKK" XX SQ Sequence 7828 BP; 2516 A; 1871 C; 1490 G; 1951 T; 0 other; tattgtcgga tctttcatcg acgagcgtta cagaattcac aaacggacag aaccagaata 60 ccagaatcga attttagaac taaactagaa ctaagacgat ctaactccac gaataccagc 120 aaatcaaccc cgcataagat tgataagatt agaacttgat atagatacac tcagaacctc 180 acagaacctc gactcggata aacttgaaac cttaaaccta gaatttccag attgatcatc 240 ccagaatctc ttatttagaa ctcactccac cgaaacacct tataccttat agaacaccac 300 aaccacgtct ccctcataca gaatccccac agaagacgac gtcgaatccg aattgaacga 360 ctcgtacatc agcgtgaacc aatcttcctc gaactctgtt gatcctacct ctaccccaca 420 tcctcctgca ttagaatcag tagccatgga agccattcaa cgccagctca atgaacttca 480 gaactccctc gctgaagagt gtcaactgcg agaacaagca gaagcgagag gtagacaagc 540 ggaggaacgt ctagaagcaa tgatagcggg ccgtaacacc aaccctgatg tgaccaatcc 600 cgcgacgtct agcccatctg gagctcatcc cgaaattaca aacatcctga agggacccaa 660 agtttcagtg cctgataagt ttaacggtgc tagaggcgga ccagcagagg tgttcgccag 720 tcaggtacag ctgtacatga tggctcaccc ttatctattt caaaacaccc gctgtagaac 780 tcctacacag tgtagcgtag gctgagccgg agagggtaag aatcaccact cgtgagagcg 840 gatgaaacta cgggtgcggt gaagctgccc tcaacctttt gttgcagttc gaaaagaacc 900 agagcaatct aacaattttc tatcttcacg aatttttccc gtcaaacttt gtttatcccg 960 tagatggatg cgcaaggtct ggcggagtct ggcgatgtac agtacacgcc tacacaacgt 1020 ctatagccgg agttatgcaa atttcgattt ttcactactt tcggattttg agtccgtagt 1080 gtacttttac aattggagag gggggagatg gttttgtcgg cggcggggat tttccagaaa 1140 tttggtactc ggaccccaaa tttaacaatt tataattgtt tcgaataatt gacaaagtcg 1200 aatttccaga aagggatttt tcaaaattgg cgaatttgtt gaaaggatgg gattcgaacc 1260 ggtgctcagg gttggccccg agttgtttga gctttcacaa acagtgacac agataacaca 1320 aaataaaaaa aataataata aaaataaaga ccaacgtctt cacaccagct cagcaaccga 1380 tcaagacttc aatatcaata acactaacgc tctggactca gcaagtttca agaacacaat 1440 agatcactct caagactctc aatagcctta tttaaactca ataaaactct tcaagacact 1500 ttatctgatc ttaataacac acacaagaca ctcgagattg aaaaaagaaa ctcttcaaga 1560 ccttttaaat tgatcactgg aactcgatca caaaacagaa cacacacacg aacacgatga 1620 tcaaatcaaa acatatctga taggatgata ggatccttat tccattcgct agaaaccccc 1680 ctccttatat actcacaacc ttcgcttcaa ccgttgcttt cacatcattt gccccaacca 1740 atctatttaa gatttccccc gatcgcaccc ggcatcacat catcaactca ttcccctcct 1800 ttctctaatc acatcacagt aagtttcctt taatttttct ccactttcct cttctctttc 1860 attcaatact aatatctgct tcatcttccc cacttattca atcaatgcca acctctaatc 1920 tccttctcca cctcgatctc atcgcatcat cctcatggtc atcttggctt tgcttgtttg 1980 aattttgatt ctggactgtg ttttaaatca atcaatcaat ttcgattttg gatcctttga 2040 gtgaattcaa ttttgatttt cttcttagct tccttctttt aataactttc ctgatactga 2100 catcattact ttacttcttg ttgttacttt tagttttgaa tttcgatcta gagcttcaat 2160 ttcaactacc acccccaatc ttaactcaac actctaacct tcaagtaagt cttttatcac 2220 atcttctcca ccttctcatc ttcagaatca caccctagat ctattgaact gattcaattt 2280 cgatctcaac tccttttcaa ctacaacctt catcaatcaa cctcttcaat ttcgatctca 2340 gccttctatt ttggtatgtc attcctgttt ctttctaatc ttaaatcttg actttggacc 2400 ctaatcgtga tcatgaataa tactgacacc ttcattttaa ttggactgta ggtattcttt 2460 actttattga atattaaagg actattttcg agttatgtct tcttttgtgt attttgtttt 2520 gtttgcttat atgtagactt tctgttatga agaaatgagt aataaaataa taaaagtacc 2580 agatttgaat ctaatagacc ataatcaaaa taatgtaagc ctgatcagtc tgagatgagt 2640 aatgaaactt gagccttaat tttgaataat tctgtagctc atgattgtga acttgatcta 2700 atgaattgat atcaaaaaca agataatata aatgagtgag cacttgaaga caaaacccaa 2760 aatataagat tgatatagtt ttttcttttt ttttctgtta aataacttta aatccttgat 2820 caaatgaata tcaacaagta atgttaaaaa aaatgatcat tgattttgat ttccacaccg 2880 ctcgaaagtc gtattcgccc tctcttacct taccgggact gcgagtgcgt gggcccaacc 2940 gcttaccgct gagctcttcg acgccgaaac cgctcatact gtcacctacg agcggtttgt 3000 acaaaacttc aaagcaatgt tcttcgacac tgagaagaaa tccaaagccg aaaaggcaat 3060 cagatccctc actcagaagt ccacagttgc tgcatacact cacgagttta atctgtacgc 3120 tacaagcact ggctgggaaa cttcaaccct gatcagccag tacaaacaag gattgaagcg 3180 tgatatccga gtagccatgg tccttgtcca agaagagttc aaatccatca aacaaatctc 3240 gaatctcgct atcaaactag acaacaagat tcatggatcc gctgatacat ctgttgcacc 3300 accaaaccct gtccgcgacc ctaacgctat ggacatctcg gcgtcatcca cgagacttac 3360 tgacgacgaa aaagccagac gattacgtac tggaacctgc tttaggtgta atgtacaggg 3420 acacatctcc actttttgcc ctaatcgaaa aaatgacaga tctggaaaag gaaagggagg 3480 atacaaagca agagttgccg aattagaagt gaaattagct gagcttagca gtagaagtga 3540 agagaggagt gaaagtggag aaggaagcag tagagcagac atgtcaaaaa atggaggcgc 3600 tcaagcctga aggaagtgcc tagcttgagc aaaggggtta tcatgaatac aatagatttg 3660 ggagctagta cagttgtaaa atgcaataca aatgatccac acttatttca ctgtgttcca 3720 ctatctttgt cccatgaccc caaagccact ccatttaaga acccccgtgc ttgactcttg 3780 attgactcgg gtgccactca caacgtgttg ggagaagcat ttgccttcca agcaaactta 3840 ctaccatacg tgacgccaac ttcacgagcc gtcactggat tcaacggatc agcaactagc 3900 tcaacccacg aaatcaatct gtctatcaac aatgacaagc acgacaccca attcatcatc 3960 attgacctca agaacaccta cgatggtatc ctaggcattc cctggataca agataattac 4020 catatgattg actggaagac aaagacaata cacattgatt taatcactgc ctccaccgaa 4080 gcggagtcgt ccgatctgac accaccctcg acgtatcccg agttggagcc tctgagggac 4140 gctaagaatt gtgacgaggg gatgtgcatt acacataatg cgttaacatc cccgcagagt 4200 gagtctgtta tcgatctttt cacccatagc cctaaagaaa agactggcaa gctttattcc 4260 ctcctgaatt tacagcccaa taccgacgct accacaacga accagacaca agacaacaca 4320 cacaaagaac cgcaccaaga ccacgagatt gcggctgcta tatcagcctc gtcgagtccg 4380 caacagaacc ccgaaagccc aaaaatggag cccacagggc acgctaggaa ctgtgacgag 4440 ggggcgagta ttgtttctaa tacgtttaag cccccgcaat gtgagttcga cttgcctaga 4500 ctctttaacc cacgtgaagc agctggcaag cccatttttc ccttgaattc tagcataccc 4560 cttgacattg cagccgccaa gacatcctgg tcgacttcgg cgcgattggc agctgacgaa 4620 aagaagaaaa ctccaaacca accggtcgag gagttagtac ctagctgtta ccatcgacat 4680 ctcaatatgt tcaagaagtc gcaagcacaa cacctcccgc caagacgaaa gtatgatttc 4740 aaagttgaat tgattgaagg cgcacagcca caagctagcc gcataatacc cttatcaccg 4800 gcagaaagcg acgtcctcaa ggaaatgatc aacaccggat tagctaatgg cacaatacgc 4860 cgtacaacat ctccttgggc cgcccctgtt ctattcaccg gcaagaaaga cgggaacttg 4920 agaccgtgct ttgactaccg gaagctcaac gcattgactg tgaaaaataa gtatccgctg 4980 cctctcacca tggatctggt ggacagcctc ctcgatgccg acgaatatac caaactcgat 5040 ctacgcaacg catatggaaa cttacgcgtt tacgaaggcg atgaggaaaa gctggctttc 5100 atctgtaagg aaggtcaatt cgcacccctg acaatgccat ttggaccaac aggagtgcct 5160 ggatatttcc agtacttcat gcaggatatc ttattgggtc atattggaaa ggatactgtc 5220 gtatttttag atgacataat gatttacacc aagaaaggaa caacacatga aggagttgtc 5280 aataagattc ttgacgtttt aactaagcat caattatggc ttaaaccgga aaaatgcgaa 5340 ttctcaaaga aagaagtaga atacctcggg cttcttatat caaagaataa aatcagaatg 5400 gatcccacta aagtcaaagc cgtatctgac tggccagcac cccgcaacgt atctgaacta 5460 caaagattca ttggattcgc taacttctat agaagattta ttgaccaatt ctcacgcacg 5520 acacgacccc ttcacaatct taccagagac aaaactcctt tcatatggga taccacctgc 5580 gaacttgcgt tcaataaact gaagaccgtg ttcacgtcag ccccgatttt gaaaattgct 5640 gatccataca agccatttgt attggaatgt gactgctcag atttcgcatt gggagcggtg 5700 ttatcacaga agtgcgaagt ggatggcgaa atacacccag tagcctactt atcacgatct 5760 ttagcacaag cagaacgcaa ctatgaaatc ttcaataagg aattattagc gattgtcgcg 5820 tccttcaaag agtggagaca ctaccttgaa ggtaatccaa accggttgga ggttattgtt 5880 tacactgatc atagaaatct tgaaacattt atgaccacga aagctttgac tcgacgacag 5940 gccagatggg cagagacact tgggtgtttc gactttcaaa taaaattcag acccggtcgc 6000 caagccacca aaccagacgc cctttccaga cgccccgacc tagccccaac tggcaaagag 6060 aagctgacct ttggacaaat gatcagacct gaaaatatag gaccagaaac tttcaccgca 6120 gatctcgcgt gtttcgaatc attctttgat aatgaagaca ttgaacttga taatgctgat 6180 cactggtttg aggttgatat tttaggagtt gaggacaccg agccacaaga tccaatactc 6240 aacgacaccg aaatcattga ccttatcaag aaagcaaacc agaaaagccc aagattacaa 6300 gaacttatgg aggcagtaca gaaccctcta tcgacaaaat tgaaacaagc ggtaaagaaa 6360 tatgaagtga aggatggact attgtacaat caaggacgaa tcgaagtacc tgaagacgaa 6420 gacatcaaga gacatatact gaaaagcaga cacgacagcc ttctagcggg acacccaggt 6480 agagctaaaa ccttaggcct aatccgtagg agtttctttt ggccatccat gaaagcttac 6540 gttaacagat acgtggatgg ttgtgattca tgcttacgaa ctaagacaag cacaaagaaa 6600 ccgctaggaa ctctagaacc cctgccaata cctgcagggc cgtggacaga cataagttat 6660 gatttaatca caagtctacc cctatcaaat aacaacgaca gcatattgac agtggttgac 6720 agactaacaa aaatgagtca ctttataccc tgcagagaga caatgtcagc aggtgaactc 6780 gccgatttga tgatcaagtt cgtatggaag ctccacggaa cgcccaagac catcacatca 6840 gaccggggaa gcatattcat ttcgcaaata actcaagaac tcgacaaacg actgggtata 6900 cgacttcacc cttcaacagc ataccaccct cgtacggacg gacaaactga aattgtgaac 6960 aaggtcatag agcagtatct tcggcacttc atctgctaca gacaggacga ctgggacagc 7020 ttacttccaa tagccgaatt ctcttataac aatagagatc acacgtcgac aggagtatca 7080 ccgttcaaag ctaattacgg gttcgaacca aacttcagtg gtataccctc cagcgaacaa 7140 tgcctacctg cggtagagga aagattgaaa ttactgaaag aagtccaaga agagctgact 7200 gaatgcttag aggttgcaca ggaaaatatg agaactcaat ttgacaaaga tgtaagacca 7260 acacctgact ggaacatagg cgatcaagtg tggctcaaca gcaagaacat ttcaacgacg 7320 aggccaagcc caaaactgga ccacaaatgg ctgggccctt tcaatatcat taagaaaata 7380 tcacgatcag cttatgaact gacattacct ctctccatga aaggagtgca tccagttttc 7440 cacgtgtcat tattacggaa acattctcaa gataccatca acggaagaca tcaagctgaa 7500 cctcaaccaa tcaaaattaa cgacgaagaa gaatgggaag tctacaacat acttgattgc 7560 cgcaaaagat acaacaaacg agaatactta gtcagttgga aaggatttag cgcggaaaac 7620 aattcatggg agccagaagt caatttaaag aacagcaagg aattattaga ccaatttaac 7680 acaaaatttc ccaacgcagc aaacagacac aaaaggacac agagaaaaaa gtgagagggt 7740 aagctttttc cctacggggt tttttaatgc tacccgggga ggtacgcaga gctttcaaga 7800 gggagcttgg gcgtaaaagg ggggatac 7828 // ID Copia-1_CCO-I repbase; DNA; FNG; 4488 BP. XX AC AACS02000001; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the mushroom Coprinopsis cinerea genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_CCO_; KW Copia-1_CCO-LTR; Copia-1_CCO-I. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-4488 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the mushroom Coprinopsis cinerea RT genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AACS02000001; Positions 4069514 4074001. XX CC Positions [1588-2124] - Integrase core CC 'GCAGT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 124..4470 FT /product="Copia-1_CCO-I_1p" FT /translation="MSDQHSGTATYRLEPLNATNWMPWKRRMLAVLREHDL FT DKYVTGSAKTKPTLTPEGVSPAVTAADIEKWTTGEAKARTRLELAIGDSEM FT VHILGAETAGDIWDRLSQIKETKGRLGVLATRRALYRAQAVDGFDMVEHIS FT KLRGYQSELALMENIVSDEDFAMIIITSLPESWDGFTTSYLGSQSNAATLS FT AASIIPLLIDEDRRRRTRDGGSSNALQARFGGKESGGSGGRGGGSKATKEC FT FNCKKKGHFKKDCWAKGGGMEGKGPQGRGRNRAHQATDSNVDINQAVSVAY FT MARLSDAPAPSKLVWYLDSGTTSHICTQRDAFTHYTPLHNSTIQGIGPTAA FT IVAGMGTVRLKFVLDGKTSIHEIRDVLHVPDAPNCLLSLSRFDDAGGSVNF FT ANGMCNLLDKDGRLVGKGRKTDRLYQLDARAELKQNRTFAATRRARTWDYW FT HRAFGHVGMSTLEKMHSSGAVTGFEVDESSVPSPTCVPCVEAKFSHLPHPH FT KASGRSDTAGEGYHCDVWGPIKTESIWHNKYYISFTDDATRYVDVKFMKSK FT DAAFDLIAKHVASVETRFGKPPKWFRFDNGKELVNERTKELCALKGIEIHT FT TAPYSPAQNGVAERLNRTLLELVRANLIAKSVPAFLWDEAVSHAVYIRNRV FT LVSALPDKTPYEAYHSKKPDVRHLREFGSDVWVLDESPGRSKLDPKANRMV FT FVGFIDGSRSIRYYDAARRSVKVSRNFSFNDNLDDAWMESLGSGLEEETLI FT DTPVTPVSKSVEVDTRSGNSETTDQEAILRYTPTPAPQSPRSTSNSPGKKS FT VPPVVDDNITPVPTPGYEGRTLRQRNTDIDYKLANNPDARKPADRNWKQRT FT KALKTARSTKDTNSSLEASDSDEYAFLARRVTIEEVEDVDAPRSSIPIPFT FT SLPPTPQSPPGLKSSPSPLARNIPIPSNVHEALSGAYAKEWEEAIKEELSS FT LQKMGTWDLVDLPAGRNAIGNKWVFALKEDDHGRVVRWKARLVAQGFSQKP FT GIDFNVHETFAGVMRLDTFRTNTAMAAVNGWRMFQLDVKSAYLNGRLDEEL FT YMAQPPGFEDGSGRVCRLNRALYGLKQAGNVWNREFTSTMSELGFTQLRSD FT SCCFIRSSNGATTILLVWVDDIIGYTDSDEEEERLVKELSAKFEITHVGRP FT QILLGMKIAQDDTSGAITLSQSHYIDSMLARFGLTNANTVSTPLDPNINLD FT YDQGPPPAADRRGSSLYATAIGSLLYAAMGTRPDITYAVHRLAQFTSHPEP FT KHWTAVKRVFRYLKGTKDFALTYGGLDFDLTPDIDIFCDADWASNADRKSI FT SGYVCNIAGGAVAWSSKKQTTVALSTAEAEYVAATHAARQVLWFRSLFSEL FT GFPLPSTSTIFTDNQAAIAIAHHPEFHARTKHIDISLQFLRDHVKTGTLDT FT VYINTHDNLADIFTKALPRPAHEEFVYRIGLLPLG" XX SQ Sequence 4488 BP; 1052 A; 1343 C; 1119 G; 974 T; 0 other; ggttatgagc cccgcatagc tctgacctac ggaacgatta ctgtcaacaa aactctgctt 60 tactcttaac tctcgaactt ttctctcttt tcttttcgtg atatcgatat caacgccgcc 120 accatgtctg atcaacattc tggcactgct acctatcgtc tcgagcctct gaacgcgacg 180 aactggatgc cttggaagcg ccgcatgcta gctgtgcttc gggagcacga cttggacaag 240 tatgtcaccg gctccgccaa aaccaagcct accctcactc cggagggtgt ctcacctgct 300 gttaccgccg ccgacatcga gaaatggacg actggcgaag caaaagcacg aactcgtctc 360 gagctcgcta ttggcgatag cgagatggtc cacatcctcg gagctgagac tgctggggac 420 atctgggacc gtttgtcaca gatcaaggag acgaaaggcc gactcggagt tctagcgact 480 cggcgagcac tgtatcgagc ccaagctgtc gatggcttcg acatggtcga gcacatttcg 540 aagcttcgag gctaccaatc ggaactggcc ctcatggaga acatcgtctc ggatgaagac 600 ttcgccatga tcatcatcac ctcactgccc gaaagctggg acggatttac gacctcatac 660 cttggatccc agagcaatgc agcgactctg agcgctgcca gcattatccc actcctcata 720 gacgaagatc gacgacgtcg cactcgtgat ggcggatcct cgaacgcatt gcaggctaga 780 tttggtggga aggagtcggg aggatcagga gggaggggtg gtggatcaaa ggcgacgaag 840 gagtgtttta actgcaagaa gaaggggcat tttaagaagg actgctgggc gaagggtggt 900 ggaatggagg ggaagggacc acagggtaga gggaggaata gggcgcacca ggcaacggat 960 tccaacgtcg acatcaacca ggcggtatct gtagcctaca tggctcgtct cagcgatgca 1020 ccagcgccct ccaagctcgt ttggtacctc gactctggaa caacctccca catctgcaca 1080 caacgcgatg ctttcaccca ctacacacct ctccacaact ctaccatcca gggcattggc 1140 ccaactgctg ctatcgtcgc tggcatgggt actgtccgtc tcaagttcgt tctcgatgga 1200 aagacctcta tccacgagat tcgagacgtt ctgcatgtcc cagatgcacc caactgcctc 1260 ctatcactca gccgcttcga cgacgcagga ggatcggtca actttgcgaa tggcatgtgc 1320 aacttgttgg acaaggatgg gagattggtt gggaagggaa ggaagacaga tcggttgtac 1380 cagttggatg ctagagccga gttgaagcag aaccgtacct ttgcggcaac tcgtcgggct 1440 cgcacctggg actactggca ccgcgctttt gggcatgtgg gcatgtctac cctcgagaag 1500 atgcactctt cgggcgctgt aaccggtttc gaagttgacg aatcgtctgt tccttccccc 1560 acttgcgtac cctgcgttga ggccaagttc tcgcacctcc cacaccctca caaggcttct 1620 ggacgatccg acacggctgg cgagggttat cactgcgacg tctggggtcc aatcaaaacc 1680 gaatcgattt ggcacaacaa gtactacatt tccttcacag acgacgccac ccgctacgtc 1740 gacgtcaagt tcatgaagtc caaggatgca gccttcgatt taatcgccaa acacgtcgcc 1800 agcgtcgaaa ctcggtttgg gaagcctccc aagtggttcc gcttcgataa tgggaaggaa 1860 ctcgtcaatg aacgcaccaa ggagctctgt gcattgaagg gaatcgagat ccacaccacg 1920 gccccctact cccctgccca aaacggtgtt gctgaacggc tcaaccgcac tttgttggag 1980 ttggtgcggg caaacctgat tgccaagtca gtccctgcct tcctttggga cgaggctgtc 2040 tcgcatgctg tctacatccg aaaccgtgtc ctggtcagcg cactccccga taaaacccct 2100 tacgaggcat accacagcaa gaaacccgat gttcgccacc ttcgggagtt tgggagtgac 2160 gtttgggtct tagacgagtc acctgggcgt tcgaagcttg atcccaaggc caaccgcatg 2220 gtctttgtcg gatttattga cggatctcgg tcaatccgtt attatgacgc ggccaggagg 2280 agtgtgaagg tctcaaggaa cttctccttc aacgacaacc tcgatgacgc ctggatggag 2340 tctctcggat ctggtctcga ggaggagaca cttatcgaca cccccgttac ccccgtatcc 2400 aaatctgtcg aagtcgatac cagatccggt aactccgaaa ccacggatca ggaggcaatt 2460 ctccggtaca cacccacccc agctccccaa tctccgcgtt ctacttcgaa ttctccggga 2520 aagaagtccg ttccacctgt tgtcgacgat aatatcactc cagtacccac acctggctac 2580 gaagggcgaa cactgcgtca acgtaacacg gatatcgact acaagcttgc caacaaccct 2640 gatgctcgaa agccggccga taggaattgg aaacaacgca ccaaggccct gaagaccgct 2700 cgcagtacga aggacacgaa ctcttcactc gaggcatccg attcggacga gtatgccttc 2760 ctagcacgac gagtgaccat tgaagaggtt gaagacgtcg atgcacctcg aagctctatc 2820 cctattccct tcacttcctt accacccact ccccagtccc ctcctggtct taaatcgtcg 2880 ccttcacccc ttgcgcgcaa catacctatt ccttcgaacg ttcacgaagc tctcagtggc 2940 gcatatgcga aggagtggga ggaggcgatc aaggaggagt tgtcttcatt gcagaagatg 3000 ggtacatggg atctggtcga cttacctgct gggaggaacg ctattggcaa caaatgggtg 3060 tttgccttga aggaggacga ccatggccgt gtagttcggt ggaaggctcg actagtcgcc 3120 cagggcttct cccagaagcc tggaatcgat ttcaacgtcc atgaaacctt cgcaggcgta 3180 atgcgactcg acaccttccg aaccaacact gctatggctg ccgttaacgg atggaggatg 3240 ttccaactcg atgtcaaaag cgcctacctc aacggacgac tcgacgagga gctttacatg 3300 gcgcagcctc ctgggtttga agacggttca ggccgagtct gtcgcctcaa tcgagcgctt 3360 tacgggctga aacaagccgg caacgtctgg aatcgcgaat tcacctccac catgtccgaa 3420 ctcggtttca cccagctcag gagcgactcc tgctgcttca tacggtcttc taacggcgct 3480 acaacgattc tattggtctg ggtcgacgac atcatcggtt acacggactc tgacgaagag 3540 gaggagcgcc tggtgaagga actctctgca aaattcgaga ttacgcacgt cggacgacca 3600 cagatccttc ttgggatgaa gatcgcccaa gacgacacct ctggcgccat aaccctctcc 3660 caatcgcact acatcgactc catgctcgct cgtttcggtc ttaccaacgc caacaccgtc 3720 tccaccccac tcgaccccaa tatcaacctc gactacgacc aggggccgcc tcctgctgct 3780 gataggcgtg gtagcagcct ctatgcgacc gccatcggtt ccctgctgta tgcagcgatg 3840 ggaacacgtc cagacatcac ctatgccgtc catcgactcg cccagttcac cagtcacccc 3900 gaaccgaagc attggacggc cgtcaagcga gtttttcgtt accttaaggg taccaaagac 3960 tttgcactca cctacggcgg actcgacttc gatctaaccc ccgacatcga catcttctgt 4020 gatgcagact gggcctctaa tgctgaccgc aagtctatca gcggctacgt ctgcaacatt 4080 gcaggaggag ccgtcgcttg gagctcgaag aagcaaacca ccgttgctct ttcgactgcc 4140 gaagccgagt acgtcgcagc tactcatgct gcgaggcagg tcctctggtt ccgatctctc 4200 ttttcagagc tcggatttcc ccttccgtcg acttctacga tcttcaccga caaccaagcc 4260 gccatcgcga tcgctcacca tcccgaattc catgcacgca ccaaacatat cgacatctcc 4320 ctccaattcc ttcgcgatca cgtcaaaacc ggcaccctcg acactgtcta tatcaatact 4380 cacgacaatc tcgctgatat cttcaccaaa gccttacctc gccctgctca tgaagaattc 4440 gtctatcgta ttggtctctt accccttgga tgaacgaaca aggaggag 4488 // ID Copia-55_MLP-LTR repbase; DNA; FNG; 489 BP. XX AC AECX01000203; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-55_MLP_; KW Copia-55_MLP-I; Copia-55_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-489 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000203; Positions 36737 37225. XX SQ Sequence 489 BP; 116 A; 81 C; 102 G; 190 T; 0 other; tgttgtgtca agaccttcaa gcgttaagta aatagatcaa tgttgtcaca aaacgttcaa 60 cggtcgtatg tacaagtaga acattgtcga ctcgatcagt ttagagctaa tgatacatga 120 aaggggtaaa tgtgtgtttg gttgtcagtt gattggaatg taagcttagc gaagtagtgc 180 atgaatgcag gggtggtcac gcgtctttct tttctctttt tgtttgtcta atttttgtct 240 tatatgtgtg ttcgctcgtg ttgtcccctt tagtttttct ttttcctcac cgtctgggaa 300 aagaaattct aaagttttat ttctttgtca aaccttgtca ggtttgctat cttcttaaaa 360 tctcttcaag tctagtcagc tcgctgtcga gagatttata ttgtttttct ttgttaaagc 420 ctttaacaca gatatcatca acgtcgagtt gtatctccgt tgtattcttt tatggtagcg 480 agagtatca 489 // ID Gypsy-1_MLP-I repbase; DNA; FNG; 5635 BP. XX AC AECX01000978; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_MLP_; KW Gypsy-1_MLP-LTR; Gypsy-1_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5635 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000978; Positions 5929 295. XX CC Positions [4437-4916] - Integrase core CC 'CAGGG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 292..1449 FT /product="Gypsy-1_MLP-I_1p" FT /translation="MDPNSITTSLASILDQLTKLSTRLDEESEQRRIAQEQ FT LEKESVLRQETQNQLNNLVQQINAGNQSQAQQQTPPAPAAQQTAQSPTQPP FT PAEPITAADVKQLLGPSRGPKVGTPDKFDGSKGDKAEVFVNQVGLYLLANA FT NLFPDDKAKIIFTLSYLTGEANQWAAPIFKQLLHPNDEEELTFKAFGAAFE FT ATFFDSDRQNRAQRDLRSLHQTGSVADYTSKFMSLAARTGWGEVEHISHYR FT IGLKQEIRVNMILKSFESVSAITAYALAVDNELHPGLQRQSTSRAVAPTVA FT RDPDAMDLSSAQVEYSRSGVSREEMLRRTEKGLCFKCGKGRHRAADCGRKD FT EKGNWVRSGGGGGHRIAELEAVIAELKNGKGRADESKNGEARD" FT CDS 1452..5537 FT /product="Gypsy-1_MLP-I_2p" FT /translation="MDVPLPGVQDDVVGIGAVKCMKRNANDPRFFLSLPLS FT HPHLCATSRPIALCLLDCGATHEAISKKFIHNHNLPTIPLTNPCSVSAFDG FT QVRTLHEEAHLNIDSDITPTTFIVTQLKDSYDTLLGMPWFKKHGHQIDWQH FT GVFHSTDGPINIAAADAVSSRPTTPLEEPTRDARQCDEGVCVYDTLTPPRC FT ESGIPTLQNPPETAGKLYCPIEQVTNSILLQEQDTTTTQDLIATEETVSFN FT PKTPVETQDEATRDARKSEEGVRVCDTLVPPQCEFATSITNVILETAGKSV FT PFLEFSDPDLCAAKTSWSTAAKIAVKEASNQPKKTVEELVPTRYHKYINMF FT RKSTSQTLPPRRKYDFRVDLVPGATPQAGRIIPLSPAENEALDTMINEGLA FT NGTIRRTTSPWAAPVLFTGKKDGNLRPCFDYRRLNALTVKNRYPLPLTMDL FT IDSLLDAERYTKLDLRNTYGNIRVAEGDEDKLAFICKNGQFALLTMPFGPT FT GAPGCFQYFIQDILLGRIGKDAAAFLDDIMVYSKSDTNHEDAVEEILQILD FT GQSLWLKPEKCEFSKSEVSYLGLVISKNRIQMDPAKVQAVTDWPAPKTTSD FT VLRFIGFSNFYRRFIDQFSKKARPLHDLTRKDTPFTWTPERQAAFDELKTA FT FTTAPVLKIADPYQPFILECDCSDFALGAVLSQRSTKDGEIHPVAYLSRSL FT IPDERNYKIFDKELLALVASFKEWRQYLEGNPHRLDVTVYTDHKNLENFMT FT TKQLTRRQARWAETLACFGFVIKFRPGRHSTQPDALSRRPDLAPSSEEKLT FT FGQLLRPSNITDDTFAELDSFEAQFQEDIEVEHSDATKWFELDVLGVEAND FT NSDNTTEVCNLQWSDTHIVDKIRELTPQDHELTELISAVTSPISLKLKGAV FT GEYEVVDGVLYHRGKAVVPSNEELRSGILKTYHDSKLAGHPGRAKTLSSVR FT RNYVWTGQKAYVNRYVDGCTSCQRIKPSLMKPFGALEPLPIPAGPWTDISY FT DLITGLPLSNGKDSILTVIDRLTKMAHFLPCKETDGAEALADLMLKEVWKL FT HGTPKTIISDRGSIFVSKITSQLNKRLGISLHPSTAWHPRTDGQSEIANKV FT VEQYLRHFVTYHQDDWEALLPPAEFAYNNSTHTSTGVSPFKANYGYDLSLG FT AIPSEDQCIPAVEERIKRLTEVQMELHECLLGAQEAMKNQFDKGIRDTPLW FT NVGQRVWLSSKNILTTRPSPKLDHRWLGPFRIVKRVSHSAYKLELPLTMRG FT VHPVFHVSLLRKFEPDTIKTRTTTQPEPVVVEGETEWEVNEILDCRRRYNK FT LQYLVNWKEFGKEEDSWEPASNMTHAQELIKAFHKRFPDAASKHKRHRRK" XX SQ Sequence 5635 BP; 1705 A; 1376 C; 1295 G; 1259 T; 0 other; tattgcagta tctatacctc aacgcagtcg acacgagttc gcagaacgga ttacgtattg 60 aatcatcaga tcggaactat aaagttagaa tttagaatca aagagaacaa gcagaagaag 120 atcatcagat taaaacttta ttttcattga ttagaaactt agaaagaagg tcgacaccgt 180 tgcattcgaa caacggcaaa gtttacaacc ccagacgaac aaggttcgga acactcctca 240 tcctcatacg ctacagtacc gaaccaagac gatccaaacc ccacaagcac tatggatcct 300 aattcaatca caactagcct ggcttcgatc ttagaccaac tcactaaatt aagcacaagg 360 ttggatgaag aatcagaaca acgcagaatc gcacaagaac aactagaaaa agaatcagtg 420 ttacgtcagg aaactcaaaa tcagcttaat aatctcgtac aacagatcaa tgcgggtaat 480 caatcccaag ctcaacaaca gacgccacca gctcctgcag cgcaacaaac cgctcaaagc 540 cccacgcaac cacctcccgc tgaacctatc acagcagcgg acgtcaagca actcttggga 600 ccgtcccgag gcccaaaggt gggtacgccc gacaagttcg acggctccaa aggcgacaag 660 gccgaagtgt tcgtcaacca agtcggattg tacctcttag ccaacgccaa cctcttccca 720 gacgataagg cgaaaatcat ctttaccctg tcttacttaa ccggcgaggc gaaccaatgg 780 gcagcaccta tctttaagca acttcttcac ccaaacgacg aggaagaatt gacgttcaaa 840 gcctttggtg ccgccttcga ggctaccttc ttcgactccg atcgccagaa cagagcccaa 900 cgagatcttc gaagcttaca tcaaacgggg tcagtagcgg attatacatc taagttcatg 960 tctttagcgg ctcgcactgg atggggtgaa gtagaacaca tcagtcatta tcggattgga 1020 cttaaacaag aaattagagt caacatgata ttaaaatctt tcgaatcagt atctgctatc 1080 actgcgtatg ctctagccgt ggacaacgaa ctacaccctg gactacaacg ccaatcaact 1140 tctcgtgctg tcgcccctac cgttgcaaga gacccagacg ccatggacct ttcaagtgca 1200 caagtcgaat actctagatc aggtgtttca agagaggaga tgctgaggag aacggagaag 1260 ggtttgtgtt tcaaatgcgg gaaagggaga catcgtgctg ctgactgcgg gagaaaagat 1320 gaaaagggga attgggttag aagtggagga ggaggaggtc acagaattgc tgaattagag 1380 gctgttatag ctgagctgaa gaatggaaaa ggaagagcgg atgaatcaaa aaatggcgag 1440 gctcgggatt gatggatgtg cctctcccgg gcgttcaaga tgatgtagtg ggtatagggg 1500 ctgttaaatg tatgaaacgc aatgcaaacg acccaagatt ttttctttca ttacccttgt 1560 cccaccctca tctttgtgcc acatctagac ctattgctct atgtcttctc gactgtggag 1620 ctactcatga agcgatcagc aagaagttca ttcacaatca taaccttccc acgatccctt 1680 tgaccaaccc gtgttccgtt agcgcgtttg atggacaagt aaggaccctt cacgaagaag 1740 ctcacttaaa catcgactcg gacatcacac ccacgacatt cattgtcacc caactgaaag 1800 actcttacga cacacttctt ggtatgcctt ggttcaagaa acacggccac caaattgatt 1860 ggcaacacgg tgtatttcac tcgacagatg gaccaatcaa catcgcggct gctgatgcag 1920 tttcgtcacg accgacaaca cccttggagg agcccacaag ggacgctagg caatgtgacg 1980 agggggtatg tgtctacgac acgctaacgc ccccgcgatg tgagtccggt attcctactc 2040 tacaaaatcc tcctgagaca gctggcaagc tttattgtcc catagaacag gttacaaact 2100 cgatactact gcaagaacaa gacacaacca cgactcagga cctcattgcg actgaagaaa 2160 cagtctcttt caatccgaaa acacccgtgg aaacccaaga tgaggccaca agggacgcta 2220 ggaaaagtga agagggggtg cgtgtctgtg acacactagt acccccgcaa tgtgagtttg 2280 ctacttctat caccaatgtc atccttgaaa cagctggcaa gagtgtgccc ttcctagaat 2340 tctctgatcc agatctatgc gcggcaaaaa catcttggtc gacggcagcc aaaatagcgg 2400 tgaaagaggc aagcaaccaa cctaagaaga cggtcgagga actggttcct acgaggtatc 2460 acaaatacat caacatgttc cggaaatcta catctcaaac tctccctccg agacggaagt 2520 acgacttcag agtggatcta gtaccaggcg cgacacctca agctggaaga atcatacccc 2580 tatccccagc agagaatgaa gcactggata caatgatcaa tgaaggattg gccaacggca 2640 caatacgacg tacgacatcc ccatgggcgg ccccagttct gtttaccggg aaaaaagacg 2700 gcaatttacg gccttgcttt gactaccgac gcctgaatgc acttaccgtc aagaacaggt 2760 acccgttacc gttgacaatg gatctcattg acagcctctt agacgcagaa aggtacacca 2820 agttagactt acgaaacacg tacggtaata tccgggtggc ggaaggcgac gaagacaaat 2880 tggccttcat atgcaagaat ggtcagttcg cgctgttaac aatgcccttt gggccgactg 2940 gtgcaccggg atgcttccag tatttcatac aggacatctt actgggacgt attgggaaag 3000 acgccgcagc atttttagac gacatcatgg tctattccaa gagtgatact aatcatgagg 3060 acgcggtaga agaaattcta caaatccttg atggtcaatc cctgtggctt aaacccgaaa 3120 agtgcgaatt ttccaagtcg gaagtgagct acttaggact cgtaatttca aagaacagaa 3180 ttcagatgga ccccgctaag gttcaagcgg taacggactg gccagcacca aagacgacct 3240 cagatgtcct tcgatttatt gggttctcga atttctatcg acgattcatt gatcaatttt 3300 ccaagaaagc cagacccctc catgatctca cacgaaagga tactcccttt acatggactc 3360 cggaaagaca ggctgccttt gacgagctca agacggcgtt cacaaccgcg ccagttctga 3420 agatagccga cccctatcag ccgtttatac tggagtgcga ctgctcagat ttcgcactgg 3480 gtgcagttct atcgcaacga agtacaaagg atggcgaaat ccaccctgtc gcatatttat 3540 cacggtctct aatacctgat gagaggaatt acaaaatctt cgataaggag ttacttgcct 3600 tagtcgcatc tttcaaggaa tggaggcagt atttggaggg caaccctcac cgtcttgatg 3660 tcaccgtgta caccgaccac aagaacctag aaaacttcat gaccacaaag cagcttactc 3720 gaagacaagc cagatgggcc gaaactttgg catgttttgg ctttgtcatt aaatttcgac 3780 ccggccgaca ttcaactcag cctgacgccc tatcacgacg accagacttg gccccatcat 3840 cagaggagaa gctaaccttt gggcaactgt tgcgaccatc aaacatcact gacgacacgt 3900 ttgctgaact cgactcattt gaagcgcagt ttcaggagga catagaggtc gagcactcag 3960 acgcaactaa gtggtttgaa ttagatgtct taggtgtaga ggctaacgat aattctgaca 4020 ataccacaga ggtatgcaat ttacaatggt cggatacaca catagtcgat aaaattaggg 4080 aactcacgcc tcaagatcat gaattgactg aactgatatc cgccgttacc agccctatat 4140 ccttgaaact caaaggagca gtgggtgaat acgaagtggt cgacggcgta ctgtatcaca 4200 gaggtaaagc agtagtcccc tccaatgagg aactgcgttc gggcatcctc aagacgtacc 4260 atgacagtaa gctagcggga catccaggca gagccaagac cctcagttcg gtgagacgta 4320 actacgtttg gacgggacag aaggcttatg ttaaccgcta cgtcgatggt tgcacatctt 4380 gtcaacgtat caagccttca ctgatgaaac cctttggagc tttagaacca ttacctatcc 4440 cagccggacc ctggaccgac atcagttatg atttgatcac agggttgcct ctatcaaacg 4500 gcaaggatag tatcctcact gttattgacc gtctgacaaa aatggctcac tttttaccat 4560 gcaaagagac agacggcgcg gaggctttag ctgatttaat gcttaaggaa gtttggaaac 4620 ttcacggcac ccccaaaacg atcatttcgg acaggggtag tatatttgta tcaaaaatca 4680 cctcacagtt gaataagaga ctaggaatat ctctgcaccc ttcaacagcc tggcaccctc 4740 ggacggatgg ccaatcagaa attgctaaca aagttgtaga gcaatactta cgccactttg 4800 ttacttatca tcaggacgac tgggaggcct tactaccacc ggccgagttc gcttacaaca 4860 atagcaccca cacatctact ggtgtatcac ccttcaaggc aaattacgga tacgacttaa 4920 gtttaggtgc aattccgtcg gaagatcagt gcataccggc ggtagaagaa cggatcaaaa 4980 gacttacaga agtacagatg gaattacatg agtgtttgtt aggtgctcaa gaagcaatga 5040 agaatcaatt cgataaagga atcagggaca caccattgtg gaacgttgga cagcgtgtct 5100 ggctgagcag caagaatatc ttgacgacaa gacccagccc caaattggac catcgttggc 5160 tggggccctt tcgtatcgtt aaaagagttt ctcattctgc ctataaatta gagctaccac 5220 tgactatgag gggagtacac cccgtctttc acgtctctct gttgaggaaa ttcgaaccag 5280 ataccatcaa gactcgaaca accactcaac ctgaaccagt agtggtggaa ggggaaactg 5340 aatgggaggt aaatgaaata ttagactgtc ggcgaagata caataaactt caatatttgg 5400 ttaattggaa agagtttggc aaagaagaag actcttggga accggcttca aatatgactc 5460 atgcacaaga attgattaag gcttttcaca agaggtttcc cgacgcagca agtaaacaca 5520 aaaggcatag gagaaagtga gagcgggtca agctttttcc cacgtggttt ttaatgctga 5580 cccagggaca gagcgcagag cagcaagagg gacttgggcg ttaaaggggg gataa 5635 // ID Copia-10_MLP-LTR repbase; DNA; FNG; 268 BP. XX AC AECX01000965; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-10_MLP_; KW Copia-10_MLP-I; Copia-10_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-268 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000965; Positions 281333 281066. XX SQ Sequence 268 BP; 76 A; 53 C; 44 G; 95 T; 0 other; tgaacgtcac gtagtgacgt catgctgagg agcaggacgc gcatactgtc atatgcatta 60 tagaagagtt tctcttttct aacatcttat gtaatcattg agttcgagtc agtaatatat 120 tagaacaagt gttgtacaca tctagttgtt cttttcctta tcgaactcat tatcagaaac 180 ctttctcaat ctcttttctt agattgtggt ttctttcgat caagcaccag tcaaacataa 240 gactgatact agatccttaa tctattca 268 // ID Mariner-7_AN repbase; DNA; FNG; 1891 BP. XX AC . XX DT 09-JAN-2004 (Rel. 9, Created) DT 09-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE DNA transposon. Mariner superfamily. Pogo clade - a consensus DE sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; KW mariner superfamily; Mariner-7_AN; Pogo subclade; transposase. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-1891 RA Kapitonov V.V. and Jurka J.; RT "Mariner-7_AN, a family of DNA transposons in the Aspergillus RT nidulans genome."; RL Repbase Reports 3(12), 215-215 (2003). XX DR [1] (Consensus) XX CC DNA transposon. Mariner superfamily. Pogo subclade. CC TA target site duplications; 60-bp TIRs. XX SQ Sequence 1891 BP; 566 A; 433 C; 426 G; 466 T; 0 other; acgtaaatcc cgggcggtag gtgcgtcccg ggcggaaggt agttttctcg tccaccccaa 60 cgcgtttatc aacctcaact ttcaacaacc atcatgccac caaaagcgcg taaaacaaag 120 cgagatttga ttgagcaaga gggcaggatc caatgcgcga ttcaagacat taaaaataga 180 aaatttcaaa aaattgcgcc cgcagcgcgt gcatacaaaa ttcatcccaa tacacttcga 240 gggagacttc atggccgcca atctcaagca gaactccgca accaccagca taggctatcc 300 ctacatcaag aagaggtctt gataggatgg atagtattac ttgacattca tggagcagct 360 cccaggccct cgcgcgtacg tgagatggca caacttatcc tggatgaatc ctcaacctta 420 tctcgaccga tcagaaagaa ctgggtaaca gagtttacaa aaaggcgccc tgaaatcaaa 480 accaggtttg cttggaagat caatcatcag agagcacttt gtaaagatcc caagataatt 540 cgcccatttt tcaatgagat acagaggatt aaagttgagt atgggatatc agatgatgat 600 atctacaact ttgatgaaac tggctttgct atgggcctaa ttgcaacaac aaaagtggta 660 tctcgagcag aaatgccagg caaaccatgg cttatacagc cgggggatcg caagtgggtt 720 accaccatta aatgcatcaa ttcaactgga tggtcagttc tatcaaccat tatctttaag 780 ggaaagcgct atagagaggg atggtttgag gaactctcta ttccacatgc ctggaggatt 840 gaggttagta ataatagatg gactacagat ataattgggc tttgctggct tcaaaaatgc 900 tttattccag ctatacagag gtggcaaagg ggggagtata tactccttat tctggacagc 960 catagaagcc acttgacccc ggcctttgac actatataca aggataataa cattatccct 1020 gtctgcatgc ctccttattc atctcacctc ctgcaacccc tggatgtggg ctgttttggc 1080 cccttgaaga gggtatacag atccctgatt aagcagaagg cacgcctagg atacaaccat 1140 gttgacaagc ttgatttctt aaaggcttat tcagaagcct ataagaaagt ctttataata 1200 gagaacattc aaagcagatt cagggcaact gggttacatc ctttctcacc tgctgcagta 1260 ctggataagc tgcagttaag accattgact cctacacccc cccccaagca gaggtactgc 1320 ttcaatcccc tcctctcaac tctgtacgcc ttatacagtc cgtcaggtgt atcgaaaagc 1380 ttcatcagtc aaaaagcttc taaaagaggg ctctaggagt ccttcaagcc cctcaaaaca 1440 ggcgctggat gaatttgtaa agggctgtga ggtggctatt tacaatgctg ggttgctggc 1500 acaggaaaac aaggatctct gcttatttgt agcagataac atggcaaaaa agagttgttc 1560 taggcgtcta ataactccta cagatggact ctcatttaaa gaagccaggg accttatttc 1620 gtcgagaaat aatgaattac aagctggtgg ggggggttca agctccagta cccttccaac 1680 ttcggagaga cttaggcgcg cccccccaag gtgtacaaat tgcggagtac aaggccataa 1740 aagaacaagc tgtcaggttc ccaatcatcc ttagtttatt tagtttggat aagaattgat 1800 cgagttattg aaatcgaaag tttgtatagc agtggggtgg atgagaaaac taccttccgc 1860 ccgggacgca cctaccgccc gggatttacg t 1891 // ID Gypsy-48_MLP-LTR repbase; DNA; FNG; 186 BP. XX AC AECX01001259; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-48_MLP_; KW Gypsy-48_MLP-I; Gypsy-48_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-186 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001259; Positions 39767 39952. XX SQ Sequence 186 BP; 47 A; 56 C; 31 G; 52 T; 0 other; tgttgtgatc ttatatcacg gattactgag gtgtcacagc atgtgacaga tgagactctc 60 aagcacactc tagatcagag cttttactag attgatccct tcctcatgct acaataatca 120 tatcacctta ttggcaccct ctccctctgt tcctcccgtc cacgacccca gaagccggtc 180 acaaca 186 // ID Gypsy-4_PPM-I repbase; DNA; FNG; 6894 BP. XX AC ABWF01004646; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Postia placenta genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_PPM_; KW Gypsy-4_PPM-LTR; Gypsy-4_PPM-I. XX OS Postia placenta OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Polyporales; Postia. XX RN [1] RP 1-6894 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Postia placenta genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABWF01004646; Positions 18703 11810. XX CC Positions [5137-5631] - Integrase core CC 'CAAGT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 863..2107 FT /product="Gypsy-4_PPM-I_2p" FT /translation="MSNPWEDPNVPRQSPSSPDELMRTPNRSPQRPQSRRS FT PPRLAQPQPSYPQVPVNPRPAPPVPPPNALAIALSQIATLLQNQQQGGGRK FT PVVNKPKDFDGNKDEYEKWKMEMRLFLADHQINDDNRRTNIIISYIRGPKV FT DAFIRILYNTNCPGGYWQISSAELWGILDDHYVDASLREKAQQKIEYVRQG FT NRSADDYIVEFEDLASQAGYNLGDEHVVRLLKRGTSQNVIDQVYMSGNIPT FT TYVEWCTRIRGIDRNWRIRQEEKALWSRYHGNSNASGSKPATQPARQAAPV FT APQRTPGVIPVAGVSYQPRRDGTGTLYQGGGQPMQIDRACIKCGAKKTEKG FT TCGDMWHLPNRAAGPQPQRVRELAVETGVEAGTENPTTVNLDNEEQGLNYL FT RSLYASRPELFTKAGFRFGNN" FT CDS 2134..6621 FT /product="Gypsy-4_PPM-I_1p" FT /translation="MNRFSPLSDSVDLIEHNDVSPTDLHTRDTLSINTYKA FT EASHNERGSAITVLNSSTKNTKTSRKSSSASKPKTTLKIYLDDPRAIAPKR FT QTEGSVGYDLYAAEDLFIAAHSRTLVDTGIRMEMPKGVYGRIAPRSGLAVK FT GLDVGAGVIDPDYTGRLKVLLINNSDKEFHVEYSMRIAQLILECAEVPEID FT IAQGTIQDTTRGTGGFGSTGEKLRGNVVVKGVGRGRQMELPIAVVIPETEQ FT VIDTRALLDSGSTGSCIDRNFVVRHGIEVHKFENPIKVYNADGSANSGGRI FT TDYVELVVGTGKHKEKRQFLVTELQSAEVFLGFDWIQYHNPTIDWQRKLLK FT FDRCPESCSVFSENLVEHEDRIFMMDSNAWLRTVHERKEYLRANATDIAIA FT EGKYKAKKMEDVVPEHYRDYADVFSEDTFNALPDRKPWDHAIELLPGAKPY FT CGKVYPMTLNEQKALDEFLEENLRTGRIKPSKSPWGAPFFFVKKKDGKLRP FT VQDYRKLNEMTKKNKYPLPLISELLNKLKNCRYFSKLDIRWGYNNVRMKKG FT DEEKAAFITNRGLYEPLVMFFGLTNSPATFQMMMNDLFYEMIMEGKVIVYL FT DDILIFSDDLQTHREDVRRVLDILRLNKLTCKPEKCEFEKSEVEYLGHLIS FT YGTIRMDPGKVAAVAEWPEPTCKKDLQSFLGFANFYRRFVKDFARIATPLN FT RLTGNVEWEFGAGCKEAFGRLKDTLTNAPVLGIPTDNDPFRVEVDASKYAV FT GAVLSQKQNDVWHPICFLSHSLSPAERNYEIYDRELLAIIVALEEWRHFLL FT SAAHTVEIWTDHKNLEYFRKPQRINPRQARWVSTLQEYDFILTHKAGKTNI FT VADPLSRRPDLDIGDGHLEDVTVLKAEYFNAITVPSADDIIKRAKLSIPDY FT PESVVRLLDKDPHVTTEDGLIYRRGRLVIPTNRSLIGDVIAAHHDAPSAGH FT PGIEKTMELVDRSYWWPTRKKDIHDYVIGCEACQKAKAHRQARAAPLNPNP FT VPENNWEFISVDLITHLPEVHGHNSILNFVDLKSKDYISVATSDTLSSEGF FT ANLYLKHVFSKHGLSRRIFSDRGPQFVSNFIRDVYKKLGITGNPSTAYHPQ FT TDGQTERVNQELETYLRIFVSYRQDDWPDWLPMAEFAHRNRKHSSTGFSPF FT YLTHGHDAFTGVETRKEMKNESAAQWAEKMAQLQKEASQALELSKESMKKY FT YDRKKKDAREYKEGDLIWVEGTNIRTEKPNKKMDDKRYGPFRIEKKIGASA FT FKLKLPMSWKGIHPVFNESLLYPYVEPTFPNQDSSHTIPPLVDQPDAVDEI FT LDSRERKNGLQYLVHWKGKPRSENTWEPRSGLLRTSKDLLDRFHKDHPNAP FT KPPSIRIPTRVRFSETVEVAEGPDDQEWEKWGNKYQRWQERKKLHSLLLHA FT NLEPDAEHVILPTRLNIGHVWFCDSQHRVRYIAVVNSPSPQGNNFAYPIQS FT ILKIRENIVCTHPEARLYEHSNNLPLNVALW" XX SQ Sequence 6894 BP; 1973 A; 1741 C; 1713 G; 1467 T; 0 other; ttaactgaag actactctac tccgaactcg ctcgaccggt attaccgcgt ctttcaaacc 60 cttgtattcg agagtgacca ccctccggta ctaaacccgg gccacatagg gcaaagaact 120 ctctaaatac aagtctccac atccgatcct atccgagcca ccccaccgct cactctctac 180 atccgcacat caaacgacca agacaacgca gtcttgcaga gccggacact aggcttaagg 240 tgtagccatt actggcactt ttctttcgaa cgcccaccgc aggctcttcg acaagggtag 300 atagctacgt acgcgcccgt cacttcgaga cgaaccttcg ggagatttac cttcagtggt 360 agtctccaag gggccgaacc ccctccgtga ttaggcaacc cactgcataa tagtcaaaag 420 cagcacgggg ctcacggcga tcgcgaatcg aagtgaatac cgcggagcct agataagtaa 480 cagtaggcta gcgtagcaat actttgtccc tgcgagttac ccctattggc taagaggaag 540 acttagcggt aggaacaggg aatctttttc accatccgga gataacggct gtgcgggctt 600 ggatagcctt tttgggttga tcctgtttgg tgacagggga ccaggcagcg cgtggatttg 660 atagacccag aaagcggtct ggttttggaa atttgggttc cggggtcagg aaatccccga 720 ataggatcgc gaggtcggaa attcctcggc agaatcaggg agcagagcaa ttgaggcggc 780 acgctccacc gcagcctctg tataggcaat tgggctccac ggtcggaaaa tccgtgaata 840 gaaccacgtc tttcatcaca tcatgtctaa cccatgggag gatccaaatg tgcctcgtca 900 aagcccatca tcccctgacg agctcatgcg gactccaaac cgctcaccgc aacggccgca 960 gtcacggcgg tcaccaccga ggttggcaca gcctcaacca tcctacccgc aggtcccggt 1020 aaacccgagg ccagcaccac ccgtaccccc accgaatgca ctcgccatcg cccttagcca 1080 aatcgcaacc ttgttgcaga atcaacaaca agggggaggt cgaaaacccg tggtaaacaa 1140 accaaaggat ttcgacggca ataaggatga gtacgagaag tggaagatgg agatgcggct 1200 gtttttggcc gaccaccaga tcaacgacga taacaggaga accaacatta tcatcagcta 1260 tatccgcggg ccgaaggtag acgccttcat aagaattcta tacaatacga attgccccgg 1320 cggatattgg cagatctcca gcgcggagct ttgggggatc ctggacgacc actacgtcga 1380 tgctagccta agagaaaagg ctcagcagaa aatcgagtac gtacgtcagg gaaatcggtc 1440 ggccgacgac tacatcgtcg aatttgagga tctggcaagt caggccggat acaacttggg 1500 tgacgaacac gtggttaggt tgctaaaacg aggcaccagc cagaacgtga tcgatcaagt 1560 ctacatgagt gggaacattc ccaccactta tgtagaatgg tgcacccgta tccgcggcat 1620 tgaccgaaac tggcgaattc gccaggaaga aaaagctttg tggagccgct accacggcaa 1680 ctcaaacgct tctggaagca agccggcaac acaaccggct cgtcaagctg ctccagtagc 1740 accacaaaga actcctgggg tcataccagt ggccggggtc tcataccagc cgagaaggga 1800 tggaacgggt actttgtacc aaggaggtgg acagccgatg cagatcgatc gggcctgtat 1860 caaatgcgga gcgaagaaaa ctgaaaaggg gacatgcgga gacatgtggc atctgccaaa 1920 ccgggcggca gggcctcagc cacaacgagt acgtgagctg gctgtcgaaa cgggggtaga 1980 ggcggggact gaaaacccca caacagtcaa cctggacaat gaggaacagg gactaaatta 2040 tttgcgctcc ctctatgcat cgaggcccga gctcttcacg aaggcgggtt ttcggttcgg 2100 caacaactga atgcgccagt tgcactgcca attatgaaca gattttctcc tttatcggac 2160 agcgtggatc tcattgagca caatgatgta tcccctacgg acttacatac acgtgatacg 2220 ctatctatta atacatacaa agcggaggcg tctcacaatg agcgcggcag tgctataact 2280 gtgcttaatt cctccaccaa aaatacaaaa acatcgagaa aatcatcgag cgcatcgaaa 2340 ccaaaaacca cgttaaaaat ctatcttgac gatccaagag ccatagctcc gaaacgacaa 2400 acggagggct ccgtcggata tgatctttac gcggcagaag atttatttat agctgcacat 2460 tctcgcacgc tagtagacac gggaatccga atggagatgc ctaaaggcgt atacggtcga 2520 atcgccccac gatctggact cgcagtcaag ggtctagacg taggcgctgg tgtaattgat 2580 cctgattaca ctggacgact taaggttctc ctcatcaaca attcggacaa ggaattccat 2640 gtcgaataca gcatgagaat tgcgcagttg attctggagt gtgctgaggt tccggaaata 2700 gacattgccc aagggacgat tcaggacact acacgaggta ccggtggctt tggaagcact 2760 ggtgagaagt tgcgaggaaa tgtagttgtg aagggagtag gtcgtggtcg acagatggag 2820 ctgcctatcg cggtcgtcat tccggaaacg gaacaggtga tcgacacgcg agcactatta 2880 gactcaggaa gcacgggatc gtgcattgat aggaattttg tggtacgaca tggtattgaa 2940 gtacacaaat tcgaaaaccc tatcaaagtc tataatgcag atgggtcggc gaactctggg 3000 ggaaggatca cggactatgt cgaactcgta gtaggcactg ggaaacacaa ggaaaagcgt 3060 caattccttg tgactgaatt acagagtgca gaagtattct tgggttttga ttggattcag 3120 taccacaatc caaccatcga ttggcagcgg aagctgttaa agttcgatcg atgcccagaa 3180 tcatgctctg ttttcagcga aaacctggtc gaacacgagg atcgcatatt tatgatggat 3240 agcaacgcat ggcttagaac cgtccatgaa cgaaaagagt accttcgcgc caacgctaca 3300 gacatcgcga tagcggaggg aaaatacaaa gctaaaaaga tggaggatgt tgttccggaa 3360 cattaccgag attacgccga cgtgttctcc gaagacacgt tcaatgctct tcctgatcgg 3420 aagccatggg accacgcgat agaactcctg ccgggagcta aaccttattg cggaaaagtc 3480 tatcccatga ctctgaacga gcagaaggct ctagatgaat ttctagaaga aaatctccga 3540 accggacgca ttaaaccctc gaagtcacca tggggggctc cattcttctt tgtaaagaaa 3600 aaggacggaa agttgcggcc tgtacaggac tatcgtaaat taaacgagat gacaaagaaa 3660 aacaaatacc ctttgccact catctcagaa ctattgaata agttaaaaaa ctgtcgttat 3720 ttctcgaaac tggacatcag atggggatac aacaacgttc gtatgaaaaa gggtgacgaa 3780 gaaaaagccg cattcatcac gaatcgcggg ctttacgaac ccttggtcat gttctttggc 3840 ctcacgaact cccctgccac cttccaaatg atgatgaacg acttattcta tgagatgata 3900 atggaaggga aggtcatcgt ttatttagat gatatcttga tattctctga cgatttgcaa 3960 acacatcgtg aggacgttcg gcgtgtatta gacatactgc gcctaaacaa actcacgtgt 4020 aagccggaaa aatgcgaatt tgaaaagtca gaagtcgaat atctcggaca tcttatttca 4080 tatggtacga tacggatgga ccctgggaaa gtggcagcgg tcgcggaatg gcccgagccc 4140 acgtgcaaga aggacttgca gagtttcctg ggatttgcga acttttaccg tagatttgtg 4200 aaggacttcg cccgaattgc gacgcctttg aaccgcttga ctggcaatgt tgagtgggag 4260 tttggagccg gatgcaagga agcatttggt aggctcaaag acactcttac gaacgctcca 4320 gttctcggta ttcccacaga caatgatccg tttagagtag aagtcgacgc gtcgaaatac 4380 gctgtcggcg ctgtactttc acagaaacag aacgatgttt ggcatcctat ctgtttctta 4440 tctcatagtc tatcaccggc tgaacggaac tatgaaattt acgatagaga actcttagct 4500 atcatcgtag ctttagaaga gtggcgacat ttcttactta gtgctgcgca tacggtagaa 4560 atctggacgg accacaaaaa cctagaatac ttcaggaagc cgcagcggat caacccgcgc 4620 caggcgcgct gggtctcgac gctccaggag tacgacttta ttttaaccca caaggcggga 4680 aaaacaaaca tcgtggcaga tccattgtct cgcaggccag acctcgacat tggtgacggt 4740 cacttagagg atgtcactgt tttaaaagcc gaatatttta acgcgatcac ggtgccgagc 4800 gccgatgata taattaaacg cgctaaatta tcaatacccg actaccccga atcagtagtt 4860 cggctattgg acaaggaccc tcacgtgacg accgaggacg gtctaatcta tagaaggggg 4920 agacttgtta tacccacgaa ccgatctcta atcggtgacg tcattgctgc gcatcatgac 4980 gcgcccagtg cgggtcatcc cggcattgaa aaaaccatgg aattagtaga tcgctcgtac 5040 tggtggccca ccaggaaaaa ggacattcac gattacgtta ttggatgcga agcgtgccag 5100 aaggcaaagg cccatagaca agcacgtgcc gctccactaa accctaatcc tgtgccggaa 5160 aacaactggg agttcatatc agttgatctg atcactcacc ttccggaagt ccacggacac 5220 aactctattc ttaatttcgt cgatttgaaa tcgaaggact atatttcggt agccacttcc 5280 gacacgttgt cttcagaagg cttcgcaaac ctctatctga agcacgtttt ctcgaagcac 5340 ggactatccc gccggatctt ttccgatcgc ggaccacaat tcgtttctaa ttttatacgg 5400 gacgtttaca agaagctcgg gatcacggga aatcctagta ccgcgtatca cccacagacg 5460 gacgggcaga cggaacgtgt caatcaggaa ctcgagacgt atcttcggat cttcgtctcg 5520 taccgacaag atgattggcc ggactggctc ccaatggcgg aattcgctca tcgaaaccgc 5580 aaacattcgt ctaccggctt ctcgcccttc tatttaacac acggacatga tgccttcaca 5640 ggcgtagaga cgaggaagga gatgaagaac gagtcggcag cacagtgggc tgagaaaatg 5700 gctcaattgc aaaaagaagc tagccaggcg ctagaactca gcaaggagtc aatgaaaaaa 5760 tactacgacc gcaagaagaa ggatgcgcgg gagtataaag aaggagatct gatatgggtc 5820 gaagggacca atatcagaac agaaaagcca aataaaaaga tggacgataa gcgatacggt 5880 cctttccgta tcgagaaaaa gataggcgca agcgccttta aattaaagtt accaatgtct 5940 tggaagggga ttcaccccgt gttcaacgag tcactccttt acccttacgt tgaacccaca 6000 ttccctaatc aggactcttc tcatacgata ccgccgcttg tagaccaacc ggacgctgta 6060 gatgagattc tggactctcg agaacgtaaa aatggactac agtacctagt acattggaag 6120 ggcaagcccc gttccgaaaa cacctgggaa ccacgtagcg gactattacg tacatcgaag 6180 gaccttttag atagattcca caaggaccat ccgaacgcgc cgaaaccccc gagtattcgt 6240 atacccacgc gagtacgctt ctcggaaacg gtggaagtag cggagggacc ggatgatcaa 6300 gaatgggaaa aatggggtaa caaataccaa agatggcaag agcgcaagaa attgcactct 6360 ttactattac acgcaaatct tgaaccagac gccgagcacg tgatcctacc aacgcgctta 6420 aatatcggcc acgtctggtt ctgcgactcc caacaccgag ttcgctatat cgcggtcgta 6480 aacagtccat ctccccaagg caataacttc gcctatccca tccaatccat cctcaaaatc 6540 cgcgaaaaca tcgtatgtac acatccggaa gcgcgcctat acgaacattc gaataactta 6600 cctctcaacg tcgcactttg gtagatgtcc agcatcaccc tatgggtttt cttcgggatc 6660 gtcgccttct gcggcacggc cggagtaatt atagtactgg ctaggtggta tcgaccacaa 6720 cgcgggacgg agcatatcga actccaaccc caccccgaac gcggctatac gtcgctcgtc 6780 atcaatcggg acgtcgaacc ccaactaccg ccgcctgtct atatccaaaa cccgagcgag 6840 tggactctcc aaaacccaca acgcgaggac gcgtcgtcct aaagaggggg gtaa 6894 // ID CACTA-1_PB repbase; DNA; FNG; 4894 BP. XX AC . XX DT 16-MAY-2011 (Rel. 16.05, Created) DT 16-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW EnSpm; DNA transposon; Transposable Element; CACTA-1_PB. XX OS Phycomyces blakesleeanus OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Phycomyces. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 4894 BP; 1436 A; 939 C; 970 G; 1549 T; 0 other; cactacggga aaaaaaagtg gggttttatg gctgctttat ggcggtttga caaggcggct 60 ttgtggcggt tttttgccaa atatccactt tgcccccctg gagtttaaat acatattaca 120 gatctccgct ttctgattct cctttgttgg ctcttctatt ggacagcgtt tcattcttta 180 ttgtataaag tatattatat aatcgattac ttttattggg tactctctta aatgtcattt 240 attactttaa tattttagct tagaaaggaa ttctattgga cactagtaat ttttttttaa 300 tgttactttt atgaaaacag ttttttattg gataatatac atgtaaatgt ttgatcattt 360 taataaatac aaatttttat tggttattac aatagaagat catttttatt aatcattttt 420 attggttgtt atgacagaaa aaaaatcata ttgttcattt tactggttat tatgacagaa 480 aaataaataa tattgtttat tttattggct gttaaagcat aaaaaaatct atagatacat 540 tttattggct atttactatt taatattcta ggttttcata gtaaaatatc ctattcaatt 600 ccactacacc ggattactta cgagactgta atcttttatg gcggtccctc cagagatgtg 660 caactcttgc agttgatgcc cagtttccga gacatctagt gaagtctata ccagaatcct 720 tcattatatt tgcgtgcctg gcacacattt ctttcttaag tgctactggt atagaccccc 780 agctaggatt aggaccaaga gcgatcactg aaggatgggc agccatatca tcacatataa 840 gctttgtaca catattgagt gtggcctttt cgttacttgt tacttcaatg ccaaggtctt 900 cacttatcat cttgaagata tgctttaaag ttatttcacg gaatatgctc accggtttgc 960 ggatggtatt agcactagac tattaaatga catattaata tatattcaac tcgttttatt 1020 ttttgtctta caatggcaga gacagaaggc gaagtgtgtt ggttggaaat gtttctattc 1080 tccagttggc tctggagagc agtaatctcc aaattcaaat cttcgatgct cgtcttcatt 1140 tcgagcttca actcttcgat ctgtgccttc atttcaagtt tggccttttc ttggcctgcc 1200 ttcaaagata gtatctcatt gtacatggcg tgtagcaaag agatggaatt gttatctgtg 1260 ttattcatgt tgagagtttg gtttacctga gaaaaaggat tgttgtgaaa gaaagataaa 1320 ggccgaaaaa attttctatt gtctattatt agtttcatat aaatgagttt aacctattgg 1380 tccagccaat attattattg ggctactgta gttctaataa cagctattac gcaatgtttt 1440 agcttctgtt cctgagttta tatttgtatc atctatgtca ttgcctactt gtagtcatta 1500 agtctataca aatctcaacg ctattaagta taaaggcata ttatttaaaa ggaattatca 1560 atattcatac ctttattctt aagtgtttat ttccaaccta aaaaatgtct tcaaattcaa 1620 tccttgatag ttaccaatgc aatcaatgca aagagcgcca caccaattta aaaaaagcca 1680 agagttgtag agctcaatgc tttaagaacc gtcatagaag acataatgat atccagactt 1740 cacagaccac gcctgttcct ggacaagtca gtgttgtttt gaacactgtc tcgaacgaca 1800 ctatcgagta agactacaaa tcacatacat actattagaa agtgtaatta attatattgt 1860 taccttcata aacaagcaga gaacgtgctg atgcaatcga agatcagatc atggatacac 1920 tcaacagcga agataacgat gatccaatta tgaatatctt cagtaatgat gataatgatg 1980 agtctatggt aagatatata tatatataga tatacatgtc ttattatata atgctaatta 2040 ttatttacct tgttaaagta tgatgcggaa cttggcaacg acatggatat cattgaaaac 2100 gaaacatctc ctttggtttt tgacttcagt cagcctgcac ctacccctga caaagacgat 2160 gcaaagaacc ttgagtttct caagatcatt aaggattttg gtatctcccg caatgcccat 2220 gaaatgatag tcaagcattt caacagcatt ttggaaacgt cgacttgtat tacatacaga 2280 gcatgtactc cccatcttgg caaaaagctt ttgaagcgtt tctcgggtgt tgaagagaca 2340 gtccatgata tctgtcagag gggatgtatg ctctttacca gtccatccca gactgagtgc 2400 tccaactgtg gacaaagccg gtacaaaaca agacgtggag aaaccgaggg tggtgatctg 2460 gttgctgctg ccacgatgat acagcttcca ttggcgagac aactggctct tgcgttagcc 2520 aacgagaata caagagcaga tatgcattac cgccataatc atgagccaag ctcggatgga 2580 agtaaaaccg atgtgtttga tggccaggtt taccagcaag caaaacatct gttttccgga 2640 aaagatgaca ttgcaatttc gttgtctgtt gatggattca cgccacacaa tgttcctggt 2700 tctgtaacga tcctccatgc caccattctt aacctgaatc ccatggtccg gtatgagaga 2760 agcagaatgc tccagatcgc aatgatccca ggtccaggtg cacctgccaa tttctggtcg 2820 ttcatggaac caacgatgaa agagctcctg gtgttggaga gcgaagggat ggtcgtcaag 2880 acaccaaacg agaccattcg tgccaaggta catgtcttaa tggtcactgg tgacattccc 2940 gccttagcaa agttggcatg tcattctgga catatgagca aagacggttg ccgtatctgt 3000 catgttgtcg gccaatgtcc caagcacgga caatacttca gaactttgcc cagcaccaat 3060 atccgtacgc tggaaagttt ccagaatttt tcccaggcca gtgcatccag ccgcaaaggg 3120 ttaaacggac aatcccccct ggcaacattg aaggtattct ctggaccatt gttttttgcc 3180 ctcgatgaga tgcatggact ttgccatgga ataagcaagc aggtctgggg tttggtttct 3240 ggtacctacg gaacagacca ttgttttgct ctttcttctg gtgttcggaa ggagatcggt 3300 acagcaatgt acaaaacaag aaataccatc cctacatcct tccatggcga ttggagggat 3360 gtatacaaga accccgggtc gttcaaggcc gtcgattggg cagactttct tctgttcgtc 3420 gtccctacgc tggtggcaga gcgtattgga gatgcaactg cccggaacgc gttacttggc 3480 ttggttcaag catgtaacct actcatgagc tgggagttat cagcagagga acaaacctct 3540 atcaaaaggt atattttagt aaaagctatg tgttattcta ttatttattc taactatttt 3600 tcttcagtaa acttgaaata tggaacatgt acctggaatc gctgcttaca agtggaaaaa 3660 tcaaaatcaa tattttcact ataaaccagc atcttcttca gcactatcct ctcatgattg 3720 acgcatacgg cccacctcgt gcatatagcg ccagatccgt ggaacgggca atcggtgaat 3780 actcaagggc tatcaagagt aattctgcta taaatgttaa cgctggcaat atcatgcttg 3840 gcttagcaca aatacgacaa gcagaggccg gagccactgt catgattaca gaagcaagaa 3900 cagcacgaca cttacaatat gaagattcca ctgctggttg gccgttgaca gatgagggtg 3960 agcgtgttgg tgctgggtct gatattgagt tctgggggcc tttgaggaat agaacgatcc 4020 gagatagttt tgaaggcatt tcttgtcttt cgaaacttct tgaagacttc tacgaatcaa 4080 agggggaaga gtgtagtatg attgaagcag ctatacaaac tagccgcaag gcatttgtca 4140 atggttgtgt gattgactct gcactcgacc aaaattgtgt aagggaggca cataacatca 4200 gattacagat tcaggttgat gaaaaccgca acataaattc tgcatactcc ccggtttaca 4260 aggatttctt cggaaaagtt gttgtcttct ttgagcacaa gctcaacaac aagagatggc 4320 cgcttgctct tgtagagatt gcggcagtcc gtttggtaaa tggtatacca gttgttaaca 4380 atgggcaaat gaagccaaag gtagttcacc tggcagatgt caaagaattg gttggtttgg 4440 tgaagtcgga cgcgactata aatacaacaa caacaacaac aacaacaaca acaacataca 4500 tagtgtggcc agagcttaac cgcggcccaa aattgtcact tggttctctt gcagacctat 4560 aatttgtttt ttttttgtaa tacatttttt ttttcaaatt gtcttaacat attttatttt 4620 ctcagcagaa agtttagata ttttcttctc tttttcttct gacttttaat agtcatcaaa 4680 tgcaatctac atttactaaa aatgtaaagt ctgtagttaa agcttacttt tgtctatcag 4740 gagatttaat tttaggtggt caatttgctt tttgtcaacc ttttcaaaaa acagagattt 4800 tctagtttca aattggattt ttagtgattt tttaattatt ataggccgaa aacatagtca 4860 caaaagcaaa aaaccgccat tttttcccgt agtg 4894 // ID copia-1-LTR_AN repbase; DNA; FNG; 279 BP. XX AC . XX DT 09-DEC-2003 (Rel. 8.11, Created) DT 09-DEC-2003 (Rel. 8.11, Last updated, Version 1) XX DE Long terminal repeat of copia-1_AN LTR retrotransposon - a DE consensus sequence. XX KW Copia; LTR Retrotransposon; Transposable Element; KW COPIA superfamily; copia-1-I_AN; copia-1-LTR_AN. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-279 RA Kapitonov V.V. and Jurka J.; RT "copia-1_AN, a family of copia LTR retrotransposons in the RT Aspergillus nidulans genome."; RL Repbase Reports 3(11), 198-198 (2003). XX DR [1] (Consensus) XX CC It is a long terminal repeat of Copia-1_AN LTR retrotransposon. CC It is characterized by 5-bp TSDs. XX SQ Sequence 279 BP; 75 A; 60 C; 54 G; 90 T; 0 other; tgtcaaccgg aggaggttcc tccttactaa tcaatgttca gcacctcgtt cgtttgcctg 60 tctgtttgat gctttacacc gtttgtatgg cagggatatc tgtttaatgc cttatactgt 120 ttgtatggca aggatatctc ctgcataagc gagaaccttc ttttttgaag ccttggaaaa 180 ggtataaatt gcctatacag atcgtagtta ggagatcaag accttcaatg aaatcgttca 240 ctacggatat tgattacata accaccccaa taatccaca 279 // ID Gypsy-11_CCO-LTR repbase; DNA; FNG; 142 BP. XX AC AACS02000004; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the mushroom Coprinopsis cinerea genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-11_CCO_; KW Gypsy-11_CCO-I; Gypsy-11_CCO-LTR. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-142 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the mushroom Coprinopsis cinerea RT genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AACS02000004; Positions 759109 759250. XX SQ Sequence 142 BP; 47 A; 37 C; 31 G; 27 T; 0 other; tgggcactac tccaactcac ttggactaga tcacacttag acgaacgaga tacacggaca 60 gacgattaga ttagatcgga ttacctaatc cctagactac agcgagcacg tcgacaccta 120 gagtaggggg aacataccat ca 142 // ID FOLYT1 repbase; DNA; FNG; 2615 BP. XX AC AF057141; XX DT 28-APR-2000 (Rel. 5.03, Created) DT 28-APR-2000 (Rel. 5.03, Last updated, Version 1) XX DE FOLYT1 is a DNA transposon. XX KW DNA transposon; Transposable Element; FOLYT1; TIR; transposase. XX OS Fusarium oxysporum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Hypocreomycetidae; Hypocreales; OC mitosporic Hypocreales; Fusarium; OC Fusarium oxysporum species complex. XX RN [1] RP 1-2615 RA Hera C.; RT "FOLYT1."; RL Direct Submission to Genbank (03-APR-1998)Genetica, Universidad RL de Cordoba, Avda San Alberto Magno, Cordoba 14071, Spain. XX DR GenBank; AF057141; Positions 1 2615. XX CC FOLYT1 has 9-bp terminal inverted repeats; regions 421-1347 and CC 1503-2456 encode a DNA transposase. XX SQ Sequence 2615 BP; 698 A; 634 C; 587 G; 696 T; 0 other; tagagatgga tttaatgctg catcattaca taaaatgcgc tttttgcatt aaatatttaa 60 agcaagacgc gctttgtatg cattatttcc tgtcgcactg agcgtgctta aagggttaga 120 gtgaatatga tgagcagcag caaatgtgtt gattgatgtc ttaagttttc tcttgtgcgg 180 ggtattccat cccgcacagt tacttaaagc atatagctag attactaatg cctaccctgc 240 gacatttcct cttctatctt aggcttagcg ccttctataa taggtttagt gcataggaac 300 aatataagtc cttttccgac ctccttttct ccacagcctt caaccgtctg tcgatgtatc 360 cccctgaaac ttctcaattc gactccccac taccttcaga ccctccgtgt gtcttctaca 420 atgcccagag cccctggaag gccagccgat caccaggtcc accgagaatt tatcacgctt 480 gcctccgacg ctagttcggg tttcaagaac tctagagtca agtgcaagca ctgcggccac 540 gagataacca agggcactac tcgtcagaag aagcacttac ttcgccattg tcccaattac 600 aaaaggcagc aacagtctca gcagcctcaa cttactcatc actttcctgt agtggacaag 660 acctttaagc agatgcttga tgagcttgct gctatagcaa tttttgctga tggccgtcca 720 tttaacctct ttgagtccaa acgaatgcgg attctgttga gcaagctaaa cactgcctgg 780 caaccaccgt ctcgtcgccg agtacagcgg ctgctagccc cgacttattc tcagtaccac 840 aatcaagtcc aggctatcct cgaccaagca gaacatatca acgtaatctt tgacgcctca 900 gacaacatca caagtcatcg aattatcaat atctcgatac aggtggcaaa tggtacggcg 960 ttctactgga agacgtttga tacaggacag attcaacaca cggctgagca ctacatcgat 1020 cttttgtatc ctgagctgga gataatttgc aagggcaact tcttgcgaat taactcgttt 1080 tgcacagata cggatagcgt gatgaggaaa gctcatgtgc atttggcagc aagaaaggaa 1140 ttccaacatt gcttcttttc cctctgcgat tcacatgggt tacagctact catcaaggac 1200 atcctagagc tgccgttctt tgaggaggca ttcaaaaacg ctacgttaat tgtcaccttc 1260 ttcaagaagt cgaaactgca gttggctcga ttgagagaag cacaaaaggc tgcttggggc 1320 caccataagg cgtttttatc tgcgtgggtc tttcagtttt tctgactcct agctatagct 1380 gactgctcgc tttagtgtaa tcacacgctg gggcagtcag ttcaatgctt tacagtcagt 1440 tttacgctgt aaagagcctc tccaagcata cgctcgccgt cctgacgtaa gggcagagct 1500 agcctccggc tctctcgagc ttccttccaa aggtgctgga gtctgtcaat aaccctcact 1560 tctggatacg cctagagact gttttggcca taattaagcc tgtcaacagt cgtcagcgtg 1620 cctcagaagc cgatcgggct cacatcggcc atgtgatccc tcgttggctg gagattaaag 1680 cagaatggaa agcacttgac gagtctcagc aacatcaaga cgtgaatttc agcgagctgt 1740 attccgtatg gttaaaccgt atggataaac agacatatga tattcattat gcgggatttg 1800 cattacggcc tgacacagtc gggactaagc ttgaagaaca gctgatgatg aaagtgctgc 1860 agttctttaa gtcagcagtc aatcctgctg actacatcaa tattgttcga gaatttaacc 1920 acttccgagc acaatcgggc ggccggtttg ctgcaggagg cttagtctat tcaaaagaat 1980 ggactccatt agatgcttgg atgcttcttg ataaccaagg cagcaagctg gctgcacttg 2040 cagtccggat ctttggaacc atagccaatt cagttccctc agagagatca ttctcggcag 2100 ttaacttcct ccacagcaag gcacgcaaca gactcacacc agccaacgct gacaagttgg 2160 cgttcatcta catgaacgaa cgggtgctag agaggataac gcagtctcag aatcaacctc 2220 ttgatcatcg caatgatcat tctacggtgg tcagctggga agatttggca gaagacggtt 2280 ggcttacact cgaagatacg tatatggaaa ttcattgcgc atcaaatctg gaagttgacg 2340 ctgttgtcgg cgagttcact caccaacccg cctccgacgg agaagagaca gtggtggaca 2400 cacttgaggt cgaagatggg tcagagagcg aaggagcggg ggaaaaattg acctaatttt 2460 gcttggttgt ttgtatgata tggcccatgg gcagcttcat aaatgggtcg taatattgca 2520 ataaaaagcg cgctttttat taggaataaa atgctgcttc gttataataa aaagcagctt 2580 aatacacatg tgtattatgc acgtgcccat cccta 2615 // ID Copia-4_CCO-I repbase; DNA; FNG; 5889 BP. XX AC AACS02000009; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the mushroom Coprinopsis cinerea genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-4_CCO_; KW Copia-4_CCO-LTR; Copia-4_CCO-I. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-5889 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the mushroom Coprinopsis cinerea RT genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AACS02000009; Positions 806226 812114. XX CC Positions [3074-3604] - Integrase core CC 'CTTTG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 252..1763 FT /product="Copia-4_CCO-I_2p" FT /translation="MSQHHSPPHSPVQNPGENFGSDGENQHQAIFSGYDGT FT ISWVLPAPVFPAPMPMPEQPVAMPPPMGFRPSTAPPSMVPSSFNHVLPPVQ FT GHVPNSYTPSFGPTLPPSYNFGGSRRTFGAPGAFQPHLSNFGQELYSMNQS FT FGQALSQNGLPSSSAENSYRTPLLSPLHVDRELPYHNASPYTFRQNAIRGN FT PSYCNLGAPPFLPAASGMIGQTNAVNPAILDPTFMANGILPRHSTPTRDPH FT LQYTHQRHSPSTPTEQMRPPHPPNQPMFPNHYQFPLPPQGYYQHPPFVYDM FT NTITGLSKGLPSTESIPKLTGRENFRAWFEALESAALSANVYPHIASEPSE FT GAPYDPISIPTYPPSVAPDSPPHVLAFYQQWWQKDNVASHLVTAKLSDTIR FT LQIPTKSLNRRMPAREILREVQKIYGVVDYTSALVLVSEITNLTCNGEKKL FT SVDSYISTWRSKFNTLIQSSYNPDGRVLAQNFLLGFPSDDIDWKIWAARHM FT HFLETDSP" FT CDS 2027..4468 FT /product="Copia-4_CCO-I_1p" FT /translation="MIKRSLLGWAIITPPDPPPPLPSSSSTSGAKAAPSSN FT TSIASVAFVPPTVETSMSSSTENMTIVDWYDLDVVSLVDRFDDDDVSYDPS FT SGLSASTATHDPSLALVAPSQDLRGSVFPQEYNCLLDSGCSSHIFNDRTLF FT WTLDSLFRREVGTANMGSFTIEGRGDVAIQIYHSGKPIVLRFPDCYYYPNT FT PIHLLSVGAFNEKGLDCLFRADQSSTITFPRDHQSLPGLSVSISHWNRLYL FT LYFNFLLPTHTISSESLVALSPPPLYSHLCLVSIPFEDAYAHEAAFRATLT FT AQRPVPLTVHLWHRRCAHVSLEAIDMMVKGTYVEGMVLDERKGDPPTHPCP FT TCVVAKAKQQPYPHNRHRASAVLELLHLDVVGPFSPSRKKKLYFFTILDDY FT SNYGTTYLVSTRPEAIKHFKDYMSKWERQFDRKVLSVRFDGAGEFIHTLRD FT FCTSIGIRSQPTAAYAHPQNGKIERYNQTNEDGMNALLADSGLPPFFWGDA FT VLAHQYVRNRVPTSTLPPNITPFERFFGHKPDISNLRVFGCLCFPYVYKEK FT RRKGDLPRYEAIFVGYEEDRVGWLVVTPQGKENFVRDCVFMENVRGRLHSS FT KSLVPLSPNSITSSDSSMLPPMSVSTTEVVPSIRAKDVIDQREERLKTLRP FT RLPNRAALVTDDAIHSLMALSFLDSSFCEWSFSSLENFHIEQCSLLASSID FT RARYSRPKTYTRPPKNLAEALRREDWDAWFAAMQREVHSLRVDHKAVTAVP FT LPPGEKAIPVRWVYDYKYNPDGSIIRGKEKARLVAQGFRQGPGEYACVVRP FT KAIPYLLIRRLW" XX SQ Sequence 5889 BP; 1285 A; 1754 C; 1245 G; 1605 T; 0 other; ggttatgagc cccggcgact aactataaca gtttgccacc gagccagatc gctagttcta 60 gcggaaccga gctgtttcta gaacgatttt ttcggagacg ttggagaata gggtcgcggg 120 agaagatttt tttgacggca gctatacagg tgcgcctatc atctcgggtg cagcagtttt 180 tgtccgttat ttctttctct ttttctgttt cttacgcgta aaccttgtct attttttctt 240 ttatctctac tatgagtcaa caccattccc ctcctcattc cccagtgcaa aaccctggag 300 agaattttgg tagcgatggt gaaaaccaac atcaggccat tttttcgggc tatgatggca 360 ccatttcatg ggttcttccg gccccggtgt ttccagcacc tatgccgatg cccgagcaac 420 cagtggctat gccacctcct atgggatttc gcccttctac ggccccaccg tccatggttc 480 cttcatcttt caatcatgta ttacccccag ttcaaggcca cgtgccgaat tcgtatactc 540 cttcttttgg tccaaccctt ccaccgtcat acaatttcgg tggtagtaga cgcacatttg 600 gtgctccagg cgcttttcag ccacatttat ctaattttgg acaagaactt tattcgatga 660 atcaatcttt tggtcaggct ctctcacaga atggtttacc ttcaagttct gccgagaatt 720 cgtatagaac tcctcttctt tcaccattac atgtagatag agaactacct tatcataacg 780 cttccccata cacctttcgg cagaatgcca ttcggggaaa tccctcctat tgtaatctag 840 gtgctccacc ttttttacct gctgcatctg ggatgattgg tcaaactaac gctgttaatc 900 ctgctattct cgatcccacg tttatggcca acggaattct cccgagacac tctactccca 960 ctcgagatcc acatctacag tacacacacc aacgacattc accgtctacg cctaccgaac 1020 agatgcgacc acctcaccca ccgaaccaac ccatgttccc aaatcactat cagtttccgt 1080 taccgcctca gggctactat caacacccac cgttcgtata tgatatgaac accatcacgg 1140 gtttgagcaa gggcttaccg agtacggaat ctatacccaa gcttacgggt cgggaaaact 1200 tccgtgcctg gttcgaggcc ttggaatcgg ctgcgttatc ggcgaatgtc tatccccata 1260 tcgcttctga accttctgaa ggagcaccat atgaccctat atccatcccc acctaccccc 1320 catcggtcgc gccggactca cctcctcatg ttttggcttt ctaccaacag tggtggcaaa 1380 aagacaatgt cgcctcccat ctcgttacgg caaagctctc ggatacaatt cgcttacaga 1440 ttccgacgaa gagtttgaac cgtcgaatgc ccgcgcggga gattcttcgg gaggtccaga 1500 agatctacgg tgtggtggac tacacgtcag cacttgttct tgtctccgag atcaccaacc 1560 ttacctgcaa cggggagaag aaactgtcgg tcgattcgta tatttcgacc tggcgttcga 1620 aattcaacac actcattcag tccagctaca accctgacgg ccgtgtcctc gcccagaatt 1680 tcctgcttgg tttcccctcc gacgatatcg actggaagat atgggcagca cgtcacatgc 1740 acttcctcga gacggactcc ccctgagggc atccttgccc gtattccctt cctcctccaa 1800 tcagcgacgg ataatcttct cgcaatcgac aacacgacta agcttcagaa gccctcttca 1860 agctcaactt cgcgtcgtca acaaaagtct accgtatcgt cggcgaatcc gaaggcgaag 1920 gttatttgct ctcactgcaa gcgttcgggt catgaggtgt ctacctgttg ggctgaggga 1980 ggacccatgt ttgggaaacg agaagaggtg ttgaagagca agcgagatga taaaaaggag 2040 tctgctagga tgggcaatta tcacaccccc ggatcctcca ccccctcttc catcctcctc 2100 ttcgacatct ggagctaaag ctgccccttc ctctaacact tctatcgcct ctgtcgcgtt 2160 tgtaccaccc actgtagaaa cctcgatgtc atcatctacc gagaatatga ctattgttga 2220 ctggtacgac ctggatgtgg tgtccttagt ggatcgtttt gacgatgacg atgtctctta 2280 tgatcctagc tctggattat ctgcctctac ggctacacac gacccctccc tcgcactcgt 2340 tgccccgagt caggatctac gtggatcagt ttttccccag gagtacaact gtcttctcga 2400 ctccggttgt tcatcacata tattcaacga ccgcactctt ttctggacac tcgactcctt 2460 gttccgccgg gaggttggta cggccaacat gggctcattc acaattgagg gccgaggcga 2520 tgttgcgata cagatctacc attccggtaa acctatcgta cttcgattcc cggattgtta 2580 ctactatccc aacaccccta tccaccttct ctctgttggc gctttcaacg agaaaggctt 2640 ggactgcctc ttccgagctg accaaagctc gacgataact ttccctcgtg accaccaatc 2700 ccttccaggt ttatctgttt ccatctcaca ttggaacaga ctgtacctac tatacttcaa 2760 cttcctgctt cccacccata cgatcagcag cgaatcactc gtcgctctgt ctccacctcc 2820 cctctactcc catctctgtc tcgtctctat cccgttcgag gatgcttatg cgcacgaggc 2880 cgcgtttcgc gccactctca cggcgcagcg ccccgtccct ctaacggttc atctttggca 2940 ccgccgctgt gcgcatgtca gtctagaggc gatagatatg atggttaaag ggacgtacgt 3000 tgagggtatg gtcctggacg agaggaaggg agatccaccc acccatccat gcccgacttg 3060 tgttgtggcg aaggctaagc aacaacctta cccccataat cgacatcgtg cctcagccgt 3120 actcgaactc ctccatctcg acgtcgtggg tccattctct ccatcacgga aaaagaagtt 3180 gtacttcttt actatcctcg acgactactc caactacgga accacttacc tcgtctcgac 3240 tcgtccggaa gcaatcaaac acttcaagga ctacatgtcg aagtgggaac gtcaatttga 3300 caggaaggtt ctttctgttc gatttgatgg tgctggggaa tttattcata cgcttcggga 3360 tttctgcacg tcgattggta ttcgctccca acccacagct gcttacgcgc accctcagaa 3420 cggaaagata gaacggtata accaaaccaa cgaagacggt atgaacgcgc ttctagcaga 3480 ctccggtctt cctcccttct tctggggcga cgccgttctc gcacaccaat atgtccgcaa 3540 cagggtacca acctcgactc ttccccccaa cataacgcca tttgaacgat tcttcggcca 3600 taaacccgac atttcgaacc tgagggtctt tgggtgtctc tgcttccctt atgtgtacaa 3660 ggagaagcga aggaaaggag atcttccccg ttatgaggcg atcttcgttg ggtatgaaga 3720 agaccgtgtc ggatggttgg ttgttacccc ccagggcaag gagaacttcg tccgagactg 3780 tgtctttatg gagaacgttc gcggacgcct tcattcgtcg aagagtctag ttcccctctc 3840 tcccaactcc atcacctcta gtgactcgtc tatgcttccc cctatgtccg tctccaccac 3900 ggaggttgta ccttcgatca gggctaagga cgttatcgac caacgtgagg agaggctcaa 3960 gactcttcga cctcgactac ccaaccgcgc agccctggtg accgacgacg ctattcacag 4020 cttgatggcc ttatcgtttc ttgactcttc cttttgcgaa tggtcattct cctcactcga 4080 aaactttcac atcgaacaat gctcccttct tgcgtcctct atcgaccgtg cacggtactc 4140 tcgtccgaag acatacactc gtccccccaa aaatcttgcc gaggcgctgc ggagggaaga 4200 ctgggatgcc tggttcgccg cgatgcaacg agaagtccac agtctccgtg tcgatcacaa 4260 ggctgttacc gcggtccctc ttccgcctgg tgagaaggct attcctgtcc gatgggttta 4320 cgactacaag tacaaccccg acggctccat catacgaggg aaggagaagg cgaggttggt 4380 tgcccaaggt ttccgtcagg gaccaggtga atatgcttgt gtcgttcgac caaaagctat 4440 tccttacctc cttatcaggc gattatggtg acacttatgc accagttgcc cgcctctcga 4500 gtattcgctc gatcttggcc tgggcctgct tcaaggattt ggaagtgaaa tctttcgata 4560 tcaagaccgc ctttttgcac gctaggcttt cgaaacccat atacatcaaa cagatcccac 4620 attttcccga atctgattct tcccatgttt cttcgtcttc atgtcgctct ttatggcctc 4680 cgccaatccg cctatgagtg gtaccattta ctcaactcgg ttatcacctc tattggcctt 4740 gttcgtctcg atatggacca cgccgtctgg gtcggccgct ggtcttcccc accccctgac 4800 tccaatctca ccatgccctc tgacggctca ccccttacaa ttattgtccc tgttcatgtc 4860 gacgacggtc tcgccgcaac aaactccact cccctctacc actggtttat cgacaaactc 4920 cgggagcgta tcaatgttgt tgatcttggc gatgtttctc tctacttggg catccgcatt 4980 atgcgtgatc gtaccaagcg tcgaatgtgg ttgtcccagg agtcttatat cgaggaggta 5040 ctcgatgact tcgggttgaa gacctgtaat tccccccctt gtccctatca gccgcccctt 5100 gaacgaagta aaggacgatc ccacctcaac gaccaatctt aaccccgatg aatgccgaac 5160 tatgtatcaa cgcatagtcg gtatcattac ctaccttgct gtctgtactc gtccagacct 5220 gtcctacgcg gctatggcac tcggacagta caacgcttcg ccaaccccaa ctcttgttgt 5280 catcgctcgt ggtgttcttc gctatctacg caagacatcc tcactcacac tcgcgtattc 5340 cccattctcc gatccgaaaa ccgacaaacc atatcctcta ggttccccat cgactgcggg 5400 gtattcggat gcggactggg ctaccgattc atcggacaga aagagcgtct ctgggtattg 5460 ttattttctc aaccactgcc ttatctcgtg gtcggcttcg aagcaaaaat gtgtctctct 5520 ttcttctaca gaaagtgagt actacgcact agctcatgcc atgcgtgagg gcttgtggct 5580 tcggcccctg ctctccgcta tcggtattga atatcccatc cattttccac ttattcggcg 5640 acaaccaatc aactatcacg atgtcccaat ctctctgtgt caacacccga tccaaacata 5700 tcgatgttcg ttaccatttt gttcgtgata atattcgttc cggtcttttc gatttgaatt 5760 gggttcctac ttcggctatg atttctgata ttttcacgaa accgctttct tctactcttt 5820 ttgaaactca tcgtgatgct cttcatcttg taccttgctc tatctctgcg taggcgtctt 5880 gagggggtg 5889 // ID Gypsy-1_CDC-LTR repbase; DNA; FNG; 516 BP. XX AC NC_012866; XX DT 06-FEB-2011 (Rel. 16.02, Created) DT 06-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Candida dubliniensis genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_CDC_; KW Gypsy-1_CDC-I; Gypsy-1_CDC-LTR. XX OS Candida dubliniensis OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; mitosporic Saccharomycetales; OC Candida. XX RN [1] RP 1-516 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Candida dubliniensis genome."; RL Direct Submission to RU (06-FEB-2011). XX DR Genome; NC_012866; Positions 428433 428948. XX SQ Sequence 516 BP; 188 A; 79 C; 93 G; 156 T; 0 other; tgtcgtgtag tggatagtaa tcttttaaat atgctatatt tggtactatc acacgacaga 60 gaatcatttg aaagaataat tacgtttgtt ggtaaatcgc acataaagaa gagaattgtt 120 attagagaga aagtgttgaa ttagttaaaa ggatttaagt gtgagtgtgt gagtgaatag 180 aataattggt acacacacgc acgaaaatgt cgaattctat ttaagtaaag aaaatctaga 240 aaattaatta tttagttaga atggacctca tgcaagctta ggctgagatc caggaatata 300 caaatacata caatccacat atattacact cgtacggaat agaggtatcc cgaagactta 360 cacttagtca ctgtgtccat tattacacat acatccaata gcattgattt actactttta 420 gccaaccacc agaaactgcg tgagtattca cattgcacgt gaatcagtac cgtaaatatc 480 attgtcattt aaattaaata gcgctacatt gtgaca 516 // ID I-3_AO repbase; DNA; FNG; 4137 BP. XX AC . XX DT 24-JAN-2006 (Rel. 11.01, Created) DT 02-FEB-2006 (Rel. 11.01, Last updated, Version 1) XX DE A family of I non-LTR retrotransposons. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-3_AO. XX OS Aspergillus oryzae OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC mitosporic Trichocomaceae; Aspergillus. XX RN [1] RP 1-4137 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-4137 RA Kapitonov V.V. and Jurka J.; RT "I-3_AO, a family of I non-LTR retrotransposons Aspergillus RT oryzae genome."; RL Repbase Reports 6(1), 11-11 (2006). XX DR [2] (Consensus) XX CC It is a 5' truncated copy of a non-LTR retrotransposon that CC belongs to the I clade. Only one copy is present in the genome. CC ORF2 is corrupted by many mutations. XX SQ Sequence 4137 BP; 1130 A; 1255 C; 1071 G; 681 T; 0 other; ttattctacc tgcggagtca aaccctacgg accgagttgt tcgatcccag tggacgtctg 60 acgcagtgct tgcaatgcca acgttatgga cacgtccagc gcggctgcac cttctccatt 120 cgacgcctgt actgcggtga acaacaccgg aaaggtgact gtccccaaac caaggtaacc 180 cccgatcgtt ttacatgcgc cacctgcaaa ggccctaatc ttgcctacga aaagatgtgc 240 ccgacacgac tccaagcggc tgccgaagcc gactatacca gaaagcacaa gccgaaatac 300 tttcccgaac cagcgcggcc ggtgacgtcc ctgcctattg gacctccgcc gctgacatcc 360 aatcccagcg gggactctga gcagccggag gaacccgaac ccgaacccgc cgaatccagc 420 gcggagacgc ggaagaaagc caagaaggtg gtggagacca gtctcacaac cacaccaacg 480 ccactgccgg aaggcgcaca gcgaaatctg cgtcgacgag cgatcaccta cgcccacgag 540 gtctgcctgg tcaactcggc cacctcggac cactcagaaa gcaccgagcc ggaatcagac 600 cagagagaac cgtcagggac gatctcccaa gccgacagct ccgagcagtc cgaagaatcc 660 aatccgtccg ggcgatcgga cacagatgcg ccgtcagccc cgaagaactc ggagttagac 720 agagatgcca acttagagat cccagaccga gaggaggtcc agcaatgctc ggccttggac 780 acggacacac gacccgaaca gcaaacgccc atttccccgt ttagcagccc gaagaaacga 840 cgatcaacgc gcctccagga ggaaccaatc aaacgccgaa aaaccacagg cccgagagaa 900 ataccccagc caacgcctga acaatggcga aatccgagca ggctggaccc ccaggcggaa 960 agccgaacat gcaggtcgta tgtcggcatt gggactgacc ggcaattgaa gaccactgac 1020 cagcgtaaaa cacaccagac gaaatgtcca caaggggcaa ggggagggga cggggccgac 1080 cgacgcgtcc ccagctccgc ccccgccccc gcccgactcc ggtccaccag gaacgctacg 1140 cgtcttaccg gggcgtacgc accacctctg cagggaccga gcgggctcga gtttacggca 1200 caggcaaaag cgaaactgct gtcccagatg ttcttcccgc caccaccacc cgtaaacctc 1260 gacgatatca ggaactacaa atacccggaa ccgctgcaga atcccggcat tacccagagc 1320 gaaatgctac agtcgatcaa cagaccacgg gaggacaaag acccttgccc agacgatatc 1380 accgaccggt tcctccaagt cacggcggac ctggtcgcac tgattctgac agaaatctac 1440 aacgaatcgc tgcggcaagt ccattgcccg acacacttcc ggaaagcacg taccgttgcc 1500 ctgcgcaaac cgggcagaga cggctactcg aaaccgaaat cataccggcc tatcgcgctc 1560 ctaaacacaa tcgctaaggt gctcgaaggg attatggccc tacggctatc gtacttcgca 1620 gaagaatata aacttctccc agacaatcac tttcgcggac ggaagggaca gggcaccgag 1680 acagcacttc acagtgatgg aaatcatcag cagagcatgg aaacgaggcc taaccgcgtc 1740 agcactgctc ctagacaatt taggagcctt cgacaacgtc tcacaccaac agctcctgca 1800 caacctccgc aaacgacgaa tcccatccgt aatcgttaat tggatggcct catttctcag 1860 gagcgctaca cgacactaga acttccagag gtctcctgtc cagaatcgat gattgacacc 1920 ggaatccccc gaagatcgcc ggtgtccccg attctctacc tgttctacca cgcggatatg 1980 attaacgcag atgaagaaac gaaaacatcg gatacattga cgacgctaca atggtcgcag 2040 taggcccggc accggcggaa aactgctgaa ctctacggaa ggcctttctc aatacctgcg 2100 aaccgtggtc acggacacac gccttggagt agtgctagac agacagttcg actttcagac 2160 acatctacag cagatcgata ccaaatgcag tcaggggcta cgggcaatca cattgttggg 2220 acgttcgaaa tggggtccgt ccctggagga caagcggcag gtatacaacg cgtgcgtaac 2280 acccgtggca ctctacgggg cttccgtttg gcagcaccca ggggccccca agaagaaagg 2340 gatgcacgac aggcagctgc gatcactggt cgctattcaa cgacgggccg gacacagcat 2400 ctcaggcggc ttcaaggtcg tcagcaggga cgcattcgac gcagagctcc acctactacc 2460 cataacccaa cggctccgaa gaaaccgcct gaccgcgctg accagaattg ccgcaacccc 2520 cgcataccaa cgaatcctta ggcagcgcgc tttcagccag aaccaagctc tacattccgc 2580 gctggaggtg gccaaagaga gctccaccta cgggctgaca tcgacgcctt gcggatggag 2640 atcacaaccg catttatcgt gccccgtggt ggaaactacc ccatgtggag atagcaaaat 2700 ggaaaacaga agcgctaaca cgccacggca tccactgtat cgcctatcct gtggtgccaa 2760 aagtctacac tgacggctca ggacaccacg gcgagaccgg ggccgcgcat tttgtctgga 2820 cccaagaacc tactgacagc cgtacttagg gccaggccga cacgccacgg ttccggtcgc 2880 agaactggca gggcttgtcc tggcacttcg aatggccaca gaagacccat aatctcaggg 2940 tccggtggaa gtatacatag ataatgaaag ttccctgaat atgctctgaa accctaaata 3000 ggggtctggt caacatctaa tcaaacaaat cgcatgcctt cttggggaaa ggcgccaaca 3060 agggtggcca acgaccttct agtggattcc ggctcacgta ggggttccgg gaaacgaact 3120 cgcagatgaa atggctgaat gggcaacggg gtggcgccct gcactcggca caagtgacgg 3180 tccgcgcgcg aaggggatcc cagggctgga cctggcaccc caacaatcta cggcagaccg 3240 gtggattcga accaaattta gacgaatggg aagataactg gcaaggcagg ggcccaaggc 3300 tcccggcaag gagccacacg acctaaaacc gacgatccga agcaaggacc ggataatcta 3360 cagcggtatg acaccgagtg agcgcagcgt gttatttcag acgagaagca acaaaatagg 3420 ccccaacggg tagctagcaa gggccgcaag gatcaaacgc gcagatactg atcgataccc 3480 gcggtgccgc agtgcacccg aaacccggca ccatctacta atcacatgcc cagcgtacag 3540 gaagctgatg atcaagaaat ggtttgactc ccaatcccgt ccacagaatc tcaaagaagc 3600 tctcaacaac ctgaagtaca cgctgcgtac agctcggttt ctgctacaaa cagcactact 3660 acggctatac ggagaatacg acgacgacga cgacaatgac agtggggatg atgacgttgt 3720 ggacagcagt gattcggaca ccagtggcag cgggttaacc tgatctgcgg tagggtgagg 3780 ccacggaata ggggaagcag ccggctataa tcaccgcttg ctcaaatccc tttggtatcc 3840 gccatcgatc tgtgtatgac aatatctaca tataagatct acgcccaatt acagacacaa 3900 ctgtcgatgc gagctacttg gttaagctgg ggatggaaca aggaaacggg tgcaggggaa 3960 gggaactaaa atattttgaa accgcacgaa ggccgcgagt cttccatcat ctagctagaa 4020 taaccctgta catacctaag ttagtcctcc ggctggccaa cttgccgtcg acgacccgag 4080 aagagcccta aaaggggctg cattcgggta taatgcattt ctccctctct ctctgtc 4137 // ID Gypsy-104_MLP-LTR repbase; DNA; FNG; 601 BP. XX AC AECX01000554; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-104_MLP_; KW Gypsy-104_MLP-I; Gypsy-104_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-601 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000554; Positions 41847 41247. XX SQ Sequence 601 BP; 177 A; 144 C; 104 G; 176 T; 0 other; tggctaacaa gcaaagggga agagacggca cctgatttta caaaagccaa tttcctgtta 60 gattttctaa acaagagaaa gactcttttt caattccaaa gctcctaaga aattcccgac 120 gaatagtctt aatagcttta aaactttgag aaaaccattt aaatcagcgt caaacctatt 180 tactacattt ccaagaaatt gttattcata gagaactcac tcctagttaa atcttgttgg 240 tcagttgttt ctcttcttct ccactggtca aggacccagg ttctgtacat tcgtgtcgac 300 agaatccctg cagatttata tctcaggctg ctcttaagct tatagcttag agaataacta 360 aggctcaggc tcgacctgcc accacctgta aaccacagtg ggcctatagt ttaccaagtc 420 tccccttaag gaacgaatag cactctcagt tacctatata actgtctttc ttgaatattc 480 caataggata atttgcttat tccccgtagc cagcggaata gaccgtcagg atctcagcac 540 gagttgactt actagacctt ctcgtacgta ctagttatag ccgccctgac aagtggtccc 600 a 601 // ID Gypsy-10_RO-I repbase; DNA; FNG; 5559 BP. XX AC AACW02000074; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-10_RO_; KW Gypsy-10_RO-LTR; Gypsy-10_RO-I. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-5559 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000074; Positions 351281 356839. XX CC Positions [4567-5052] - Integrase core CC 'CTTTTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1341..2948 FT /product="Gypsy-10_RO-I_4p" FT /translation="MQQRQQQPNQQYMQQLRQNASIPAPQQLHNSMAEPIV FT VDPPTPKNIKAKKKRTPARRLEVDVPSVDIWEILKGKNADVSCAQLLAMNK FT VIAKDLVDGIRGMHGRKANVRQVNMARPAPDVLTLDSYDYSDDEGEDDMFD FT EDGDDDDAQASYCSFGDEASSCIANTGSSVEESEFDGDSGDDADTEFEYSY FT DYREMSKSEPLTVKVIIHDKELSAIVDTGAAISVMSEALVKKLDLKTNDDT FT VSIQLLDGTNSKPGGVVPNVPVRIGGKLRTEHFAIQKGRKDELLILGMTWL FT KNYGIIPDPESGTVTVPYGRRVDYKGAVVREAGQVVLTTQRETGDAVEGQR FT DRWVSRPVYTLSLAEGVHSAPEPMAVSNCYDGESSLSGSDDEGEKWEICDM FT TDTPKEVEQIVMRNKSCFVEVSGLGRVVGVEHQIRLKTDEPIRCKPYRLTW FT EEEKVLKEELEKLLDQGLIEKSDGLYASPILFVQKKDGSKRLCIDYRKVNA FT ITVKDAYPLPFIDELLDAVGGAKVFSTLDAASGYLASSDG" FT CDS 2824..3900 FT /product="Gypsy-10_RO-I_2p" FT /translation="MLSLSRMLIHCLLLMNCSMRWVVPKYSVRWMLLVGIW FT QVAMAKDSVDKTGFVTKFGTFKWKVLPFGLTTAPSTYQRMMVNILGELIGD FT CVYVFIDDIIIFSETVEQHVHDLQRVFDKCEAAGLKLKGAKCRFGVSGVEY FT LGHQITKDGLLPTEKNVKKIMDMPTPTTSDEVRSFLGLVGYYRRFIVEFAD FT TAHPLTSLIKKGAVFDWNQECEAAFDSLKNSLVTPPLLDYPDRDQVQILTT FT DASSKGLGAILSQSPDGTGENEKVIGYASRTVRGPEVRYPPTHLEALGVIW FT AVQHFRHYLAGRRFILYTDHSALQFIFNNPKPAPKLARWAAAMMEYDFDTR FT YRRGEENPADALSRLV" FT CDS 4297..5508 FT /product="Gypsy-10_RO-I_3p" FT /translation="MSRSMVYYTRNKLMGNPGVEILNTGNIFEVVKQLHNE FT GHFGVLNTWRRLRIQYDAPGLYDFVKEFVAGCEACQKRAKRQRIKTVQASP FT IPTPSAPFFMVGCDAIGPVVESKKGNKNLLVAVDYLTRWPVVAAVPDITAE FT TVSWFLFHCVVKDFGVPSYILTDRGSNFLSDHVAFFLKRMGCRHLTTTAYR FT PQTNGLCERMNQTVVQALSKIIATSEDNRDWDEYLDETILAVRTMPNETTK FT FTPSMLLFGYEMRTPSNWPAPRQDYVQGEVVLEVESRIRVITGLMNEFRAE FT AKERSGLMQQRRKKKYDEKVFFRRYELGDKVLYRDKMKDNKFSYCWIGPFM FT VVRVNVHGTYWLDGAGGKKVMGAVNGDELKPWKDHVRLKPDVAVSGAYEQY FT RRFYEERRTI" XX SQ Sequence 5559 BP; 1824 A; 896 C; 1406 G; 1433 T; 0 other; gtttggtgga gaagctgggc gtgcacataa acaaatcctc caattcgtac tataaaaaaa 60 aaaatttttt tacaaggaaa aaactttttc aaggaccgat aaggtttgac agcgtcaagg 120 agaaggaaca aaataaaaaa aaaaaaaaag agaaaacagg agagattaac acaagaagat 180 ccatccatca aaaaaaattt tatggttaaa gaaatcgcaa gcgaaaacgg tgatatgttc 240 aataacttca ctacaatcaa gtcttttatg gctagacctg agccatttta tggtgtgaaa 300 ggcgaaaacc ctggttcttg gctgagatcc attgatagaa taaggaaagg aacaaaggcc 360 aatgatatgg acattttgtt aattgtggga acattattga aaggtaatgc aggaatctgg 420 tgggattcca tcgaggacta tgtaaccacg tggggtgaat ttaaaaccaa gttcttggga 480 aattacataa acgaagaaac aaaaatgcag tggaggaagg agctaaagaa caaaaaacaa 540 tacgagaatg agtccattga cgaactggtg acttaccaac tagacatgtt tcaacggctt 600 ggttgggacg atgaaagtga caaaatcgaa gtgttcatct ctgctttaag acgagaatac 660 gcttacgaag tggaaagagc tcgtcctttg acttgggaag ctgcagtaaa ggacgcaaaa 720 caattagaat ccttgagttt gaaatacaat ggtataaaaa ataaaaaaga aaacattgca 780 cccactaagg actttgggag tgaatcaatg agcagcattc ctgttaatca aagcagcaat 840 gctagtcagg atgttgtctc ttcgctgtca tctgaagtga aaagtctaac ccaacaattg 900 aatgcattga aaatctatgc tactccaaat agaaatatga atgggcaagc aagtttcact 960 tgttatggtt gtggccaagt tggtcataaa tcgttttact gtcccaatcg tatagaaaat 1020 aataataatg tatcgagttc gggaaaagac tcgagacagc agtactagag gagtctgctg 1080 tcgaagagaa tcatcaaagc tcctctacag gtaaaaattt taaaaaagaa aatgtaaact 1140 atgggaagaa acaagaaagc catggaagaa aaataaatgt tgtgcaaggc gtgcaagtat 1200 ggaacaatag aggcgcggta agaactcgtt ctgagttgtc ctctgataag ggtaagcaga 1260 cagctaccca aaagcgacaa aggaaagaac agcaacctgt accaatacaa cagccagtac 1320 aggactacca acaaatatta atgcaacaaa ggcaacaaca accaaaccaa caatatatgc 1380 agcaactacg acaaaatgcg tcaattcctg caccgcagca actgcataac tcaatggctg 1440 aaccaatagt tgttgatccg cctactccaa aaaatataaa agctaaaaag aaaagaacac 1500 ccgctagaag attagaagtg gatgtaccca gcgttgacat ttgggaaatt ttaaaaggca 1560 agaatgctga tgtcagttgt gcccaattgc tagcgatgaa caaggttata gctaaagatc 1620 ttgttgatgg gataagggga atgcatggtc ggaaggcgaa cgtgcgacag gtcaacatgg 1680 caaggcctgc cccagatgta ctaacgctag actcctatga ttatagtgac gatgaaggtg 1740 aagatgacat gtttgatgaa gatggtgatg acgatgatgc tcaagcatca tattgttctt 1800 ttggtgatga agcctcctct tgtattgcca ataccggttc tagtgtggaa gaaagcgaat 1860 tcgatggtga tagtggtgac gatgctgaca ccgaatttga atattcctat gattacagag 1920 agatgagtaa aagcgaaccg ctcactgtta aggtgatcat acatgacaag gaactttcgg 1980 caatagtcga cactggtgca gcaatctctg tgatgagtga agccttggta aagaagttgg 2040 atttgaagac caatgatgat acggtgtcca tacaattgtt agatggtact aacagtaagc 2100 ctggtggggt agtgcccaat gttccggttc gtataggtgg taagttgcgg acagaacatt 2160 ttgccataca aaaaggaaga aaggatgagc tgcttatcct gggtatgaca tggttgaaaa 2220 attatggtat catacctgat cctgaaagtg gtacggtgac ggtgccctat ggtaggaggg 2280 tagactacaa aggtgcagtg gtgagggagg caggacaggt ggtcttgact acccaacgtg 2340 agactggtga tgcggtggag ggtcaacgag acaggtgggt ctctagacct gtgtacactt 2400 tgagtctggc agaaggggta cacagtgcac cagaacctat ggctgtcagc aactgctacg 2460 atggtgaaag ttcactctct gggtctgatg atgagggtga gaaatgggaa atatgtgaca 2520 tgactgatac accgaaggaa gtagaacaga tagtcatgag gaacaaaagc tgcttcgttg 2580 aggtgtctgg gttgggtcgt gttgttggtg tagaacacca aatccgcttg aagactgatg 2640 aaccaatccg ttgcaagcca tatcgactga catgggaaga ggaaaaggtt ttgaaagaag 2700 agttggaaaa gctactggat caaggattaa tcgagaaatc cgatggtctg tatgcttcac 2760 ctattctgtt tgtacaaaag aaggatggga gtaaaaggtt gtgtatagac tatcgaaagg 2820 ttaatgctat cactgtcaag gatgcttatc cattgccttt tattgatgaa ttgctcgatg 2880 cggtgggtgg tgccaaagta ttcagtacgt tggatgctgc tagtgggtat ttggcaagta 2940 gcgatggcta aagactctgt ggacaagact ggatttgtca ccaaatttgg tactttcaag 3000 tggaaggtct taccttttgg tctcacaaca gctcctagta cgtatcaacg tatgatggtg 3060 aacattttgg gtgagttgat cggtgattgt gtgtatgtct tcattgatga tattatcatt 3120 ttttctgaga cggtggaaca acatgtgcat gatctgcaaa gagtatttga caagtgtgaa 3180 gctgctggcc tgaagttgaa gggtgctaaa tgtcgttttg gtgtgtcagg tgtggagtac 3240 ttgggtcatc aaattactaa agatggattg ttaccaacgg agaagaacgt caaaaaaatt 3300 atggatatgc ctacccctac tacaagtgat gaagtccgtt cgtttttggg tctggttggg 3360 tactatagaa ggttcatagt cgaatttgct gacacggctc atccgttgac atcgttgatc 3420 aagaaaggtg ctgtatttga ttggaaccag gaatgtgaag ctgcattcga ttcgctaaag 3480 aatagtttag taacaccgcc tttattggat taccccgatc gtgatcaagt acaaattttg 3540 actacagatg ctagtagtaa aggtttgggc gctattttat cgcaatcgcc tgatggcact 3600 ggtgaaaatg aaaaagtaat cggttatgcg tcaagaactg tacgaggtcc agaggtgaga 3660 tacccaccga cccatttaga agcactaggt gtcatatggg cagtacaaca tttccgacat 3720 tatttggctg ggagacgttt catattgtat acggaccatt ctgctctgca atttatattc 3780 aataatccga aacctgctcc aaaactagca agatgggctg ctgccatgat ggaatatgat 3840 tttgatacca ggtatagaag aggtgaagag aatcctgcgg acgctttatc gcgtttagta 3900 tgattttaat caaaaaaaaa aaaaaagatc acaggaggtt gggtgggtag tgacaaaaaa 3960 gaataaaaaa aaaaaaaaaa aagtggtaaa agaataaaac aaaaaaacaa aaaaaaagag 4020 agtatgacga tagtgagaca aaaaaaaata tattagaatt taattatcag agggtgggag 4080 gtcggcatgt aacaacaaca aaaaaaaaaa atttattatt attattattt gtattattat 4140 cgcgtgtatc tacaatatca agaagaatta ctactatgga actatcgtta tatattgcaa 4200 tcaacaatta ccttgtctca cgtaagtacc ctgaaggtga tggtcatgac ctaaacccga 4260 aggcgaaacg gcgtataaga gaccaatcaa gcaaatatgt cgcgatcgat ggtatattat 4320 acaagaaaca aactgatggg aaatcctgga gtggagattt tgaacactgg taatatattc 4380 gaagtggtga aacaattgca caacgaagga cattttggtg tgctgaatac ctggagaaga 4440 ttaagaatcc aatacgatgc tcctggattg tatgactttg tgaaagaatt tgttgctgga 4500 tgtgaagcat gccagaaaag agctaaacgt caacgtatca agactgttca agccagccca 4560 atccctaccc caagtgctcc attttttatg gtcggttgtg atgctatagg acctgtggtg 4620 gaatcgaaga agggtaacaa gaacctcttg gtggcggtgg actatttgac tagatggcct 4680 gtggtagctg ctgtgccaga tataactgct gaaacggtgt cttggttttt gttccattgt 4740 gtggtgaaag attttggtgt acctagttac attttgacag atagaggttc gaatttttta 4800 tctgatcatg tagcgttttt cttgaagcgt atgggctgta gacatttgac aacaacggcg 4860 tatcgaccgc agacgaatgg tttgtgtgag aggatgaatc aaacggtggt gcaagcattg 4920 tcaaagatta tagctactag tgaagataat cgggattggg atgagtacct agacgaaacg 4980 atactcgcag tgaggacaat gcccaatgag acgaccaagt ttactccttc gatgctcctt 5040 tttggatatg agatgaggac tccaagtaat tggccggctc caaggcagga ttatgtgcaa 5100 ggtgaagtgg tgctagaggt agaaagtcgc atccgtgtga ttactgggtt gatgaatgaa 5160 ttccgtgccg aggctaaaga gagaagtggt ttgatgcaac aaaggaggaa aaagaaatac 5220 gatgagaagg tattctttcg aagatacgag cttggggata aagtgttgta tagagacaag 5280 atgaaagaca acaagttttc atattgttgg attggcccgt ttatggtggt gcgtgtcaat 5340 gttcatggaa cttactggtt ggatggtgct ggagggaaga aagtgatggg tgcagtcaat 5400 ggtgatgaat tgaaaccctg gaaggatcat gtaagattga aacctgatgt ggcggttagt 5460 ggtgcgtatg aacagtaccg aaggttttat gaagaacgaa gaacgatcta atgatggtat 5520 gtcgtcttca accctactca aggctggaag gggggcatg 5559 // ID Copia-42_MLP-I repbase; DNA; FNG; 4987 BP. XX AC AECX01001249; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-42_MLP_; KW Copia-42_MLP-LTR; Copia-42_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4987 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001249; Positions 65138 70124. XX CC Positions [2284-2808] - Integrase core CC 'CAGCT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1306..4968 FT /product="Copia-42_MLP-I_1p" FT /translation="MNKQDEKKQYKRRNYQHQANHTEALSSIRGVKIVRPP FT IASAHAMTFLIDFQPDDSTMTQAINVPDVIPVVDISSAFIQVPVPHDLVYP FT VQPLVYPVQHLSVLDLHSERSAQGSEFVDFDASASTAKTDDGIWALNDTGA FT SHHMFNDLNFFEASTIKPLQDSNKRLKLAGGDASLAVHSTGSVKLKAGDGT FT IFTLADCLYVPDLSRNLIAGGALLKKGVVTIINSSDPECFSLVMGQCALFN FT GAFSGNLMLVSLEPVSSSVCHQQPVSQSHSCNLQHQRLGHIDHRYLDEMVK FT HGSLDGLCTGSNSIPSSTSSCPVCIKSKSKKLPFSGSRPRCSSFLQNVHVD FT LSGINRVTGLHNESYYILFTDDFSSFRHIFPLSNKSKESVFNIFLKYIALA FT ERQTGRRVIQFTMDQGSEFVNAIMEEYCENTGIVFHFTASYTPEQNGVSER FT SNQTVTTRARSFMIQSGVPQSFWYEACATAVFIMNRTVTSALPTGKTPFEY FT WYFRKPSVGHLRVFGCQAHVLIRKGVREGKFTARTTEGVLIGFQDDNFNYR FT IFDLETRKIVLSHNVTFSESTFPYLSTSTTPSSTSSPLCFDPDVAPPLNVE FT SRNRFQVLTNLNEDSEEDDAFPLPQSTQTMSDPKEGLPSTVPLITPTPAVP FT SRRSDRTKAPVKYTGMGGHAYLENCTFVSAFSIFSEVFSASTIDVPVPKSY FT KFAVNGPEAESWKMACQKEFDSLVNKEVWELVEPPTDKNILRGFWLFRKKF FT HVNGTVSKYKARFVVMGNMQQEGIDFHKTFSPTGKPSSLRLIIAFAALQGW FT EIHQMDAVTAFLNGNLEEEVYMYQPEGFVIGSKRKVCRLQKSLYGLRQSPK FT VWYDDVVDFLISIGFEQCPLDPCIYTRATSNTFTAVYVHVDDMAITGNDIS FT TFKSQISKKWEMEDLGVATTVVGIQIIRQAPHCYTITQSALSRAVLERFNM FT LDLKTASTPFPAGLKLYRASEDEASAFKLEGLNYRSAVGSLMYLAQCTRPD FT LAYAVGVLSQHLDSPARKHWDAVIHVFRYLLGTINLGISYDGEQSSSTAIS FT GLKSFNFPLSHCDADWAGDKESRRSTTGYIFQLAGGPISWKSRLQPTVALS FT STEVEYRAITEAGQELVWLRGMLARFGFEDPNPTVLRSDNMGAIHLTTKSI FT FHARTKHIEIQYHWIREVVKQGALIVKHCPTNDMIADLLTKPLGRQQFYYL FT RQKLGLKHISP" XX SQ Sequence 4987 BP; 1368 A; 1089 C; 1034 G; 1496 T; 0 other; tggtagcggg agtctcgata ttaatcgact ctatacacga tccgactatc tttgtcatta 60 acacttcatg caatcggaag acaaatcaat aaatccaacg gacactgaat ctgtttcatc 120 tgatacttcc tctgacgcct ctgttaaatc caataagact atcaaacata cttctggttt 180 aaaacccctc gttctaccct ctaatctacc gttatctcca tctaaatctt ataaatcatc 240 ttcggcaatc atgtctttag acaactccac caatgtcgaa tcctgaaccc tcctacaaat 300 gtttattcaa cgaacaacaa atctacttag caagttcaat atcaagaatg atttaaacga 360 tgaaaattac aattattggc atcttgtcat tcatgaatca attgaatctt taggttatga 420 atcatactta gacattaagg accacgtaga tacatccctt tcggaggaga aacataagaa 480 aattaggttt ctacttacga cttggatact taataaatgt gatggtgtca acggtgaacg 540 agcacgtgat agtttaactg tcagagatcc tgttactaag gatatgtcta tcacttacga 600 tccttatatg ttatggaaat ttcttaaaga gtatcattcc agtatctcgg aggccaagtt 660 gaagaatgta gaatcagctt tattaaacat gaaacaactc agatcagata acatgaaggt 720 tcacattgat aagttctcag ctttacttcg tgattattgg aagttcaaag gagatatgtc 780 cgatggacaa gtcgctggta ccttaattaa gagtctgaaa cccggatatg agatcactgt 840 gaatatcatt tatcgagtta tccagcctct tacttttgaa aaggtgaaga atgaattgtt 900 aatggctgag gaggaacagg aatatgtaac tccttcttta actcagtcca gtcatcttgt 960 atccaactat cacgccaatc ccagtcaact agttaagtgt acagaggata gatgcaccgc 1020 caacatagat gcctcggatt gaaaaatccg aggcatcatc agatggaaac cctcggacag 1080 gtcttgtccg aggatttgat tcgaggtatt ttttgacgca aaataatacc tcggatcaaa 1140 acctcggaaa gattttgtcc gaggatttga gtccgatgat aactcggatt tttcaatccg 1200 aggcatcttt gttggctgtg atgtgttggt aatacttatt ctaatcccca caagcccgaa 1260 caatgtttca agaagccagc taattttgct aaaagagatg aatggatgaa caaacaagac 1320 gagaagaaac aatacaaacg aagaaactat cagcaccaag ctaaccacac tgaagctttg 1380 agttctattc gaggagtgaa gattgttaga cctcctattg catctgctca tgcaatgact 1440 tttctcattg atttccaacc cgatgattcc actatgactc aagctattaa tgttccagat 1500 gttatcccgg tggtagacat atcttcagcc tttatacaag ttcctgttcc acatgattta 1560 gtctatcccg ttcaaccttt ggtctatcct gttcaacatc tttcggtcct cgatcttcat 1620 tcagaacgat ctgctcaagg atccgaattt gtggacttcg atgcctccgc ttctacagct 1680 aaaaccgatg atggtatctg ggccttgaat gacacaggag cctcgcatca catgttcaat 1740 gatttaaatt tctttgaagc ttcgactatt aaacctctac aagattctaa caaacgactc 1800 aaattggcgg gtggtgatgc ttcactggct gttcactcga ctgggtccgt aaaactcaaa 1860 gcgggtgatg gaactatatt caccttagca gattgcttat atgttcctga tttaagtcgt 1920 aatctcattg ctggaggtgc cttgttgaaa aaaggagtag tcactatcat taattcttct 1980 gatcccgaat gcttcagtct agtgatgggt caatgtgcgc tttttaatgg cgcgttttct 2040 ggaaatctca tgctggtctc acttgaacct gtgagttctt ctgtctgtca tcaacaaccg 2100 gtttctcaat ctcactcgtg taatttgcaa catcaacgac taggtcacat agatcacagg 2160 tacctggatg aaatggtgaa acacgggagt ctggatgggt tgtgtactgg atctaattct 2220 attccaagtt ccacttcatc ttgtcctgtt tgtatcaaat ctaaatccaa gaagcttccg 2280 ttttctggtt cacgaccccg ctgttcttca tttcttcaga atgttcacgt agatttaagc 2340 ggtataaatc gtgttactgg actacacaac gagtcttatt atatcttgtt taccgacgat 2400 ttttcttcct tccgtcacat ttttccattg agcaataaat caaaagagtc tgtgtttaat 2460 atttttctca aatacattgc gttggctgaa agacaaaccg gtcgtcgtgt cattcaattc 2520 actatggacc aaggcagtga gttcgtcaat gctataatgg aagagtactg tgagaatacg 2580 gggattgtat tccatttcac tgctagctat acgccggaac agaacggagt ttcggaaaga 2640 agcaatcaaa ctgtgacgac tagagctagg tctttcatga tacaatcagg tgttcctcaa 2700 agtttttggt acgaggcatg cgctacggca gtttttatta tgaatcgtac agtcacttca 2760 gctttaccaa ctggtaaaac gccttttgaa tactggtact ttcgaaaacc tagtgtcggt 2820 catctgaggg tatttggatg tcaggctcat gtactgatca ggaagggggt cagggaagga 2880 aagttcactg cgcgaactac agaaggtgtt ttgatcggat ttcaagatga caattttaac 2940 tatcggatct ttgatttgga aactcgtaag attgttctta gtcacaacgt aacgttctca 3000 gagtcgactt ttccttatct ttcaacgtcg actactccga gttctacttc ttcaccgctg 3060 tgtttcgacc cagatgtagc accaccatta aatgttgaat cacggaatcg gtttcaagtt 3120 ttaacaaatc tgaacgagga cagtgaggag gatgatgcat tccctttacc tcagtcaaca 3180 caaactatgt cagatcccaa ggagggtctg ccttctactg tacctctcat aactcctact 3240 cctgcggttc catctcgccg ttcggatcga acaaaagcac ctgtaaaata cactggcatg 3300 ggtggtcatg catatcttga aaactgcact ttcgtatcgg ctttttcaat tttttcagaa 3360 gtcttctcgg catccaccat tgatgttccg gttcctaagt cgtacaaatt cgctgtcaac 3420 ggtcctgaag cggaatcttg gaagatggca tgtcagaaag aattcgattc cttagttaat 3480 aaggaagtat gggaacttgt tgagccccct actgacaaga atatcttgcg tggcttctgg 3540 ttattccgaa agaagtttca cgtcaatggt acggtgtcca agtacaaagc tcgttttgtt 3600 gtcatgggaa atatgcaaca ggaaggtatt gactttcaca agacattctc acctaccggg 3660 aagccttcat cactacgact tatcattgct ttcgctgcat tacaaggctg ggagatccac 3720 cagatggatg ctgttaccgc attcctcaat ggaaatctgg aagaagaagt ttatatgtac 3780 cagccagagg gctttgtcat tggttcaaaa cgtaaagttt gtcgcttgca aaaatcttta 3840 tacggcttac gtcaatctcc taaggtttgg tatgatgatg tcgtcgactt cttgatcagt 3900 attggttttg agcagtgtcc acttgatcca tgcatctata cacgagctac gtccaacacc 3960 ttcacggctg tctatgttca cgtcgatgac atggctatta ccggaaatga catctccaca 4020 ttcaagtctc aaattagtaa gaaatgggag atggaggatt tgggcgttgc tacaaccgtc 4080 gttggtattc agattatacg acaggcccct cattgctaca cgatcactca gtctgcgctt 4140 tctcgagctg ttcttgagcg attcaatatg ctggatttga aaacagcgtc cactcctttt 4200 cctgctggac ttaagctcta tcgtgctagt gaggacgaag cttccgcttt caaacttgaa 4260 gggctcaact accgaagcgc tgtgggttcc ttgatgtacc tggcccagtg tacacggccg 4320 gatttggctt acgctgtagg cgttctttct caacacctgg attcacctgc acggaaacac 4380 tgggatgctg tgattcacgt gtttcgttac ctccttggca caatcaactt ggggatttcc 4440 tatgatggtg aacaatcttc ttcgacagcg atcagtggtc tcaagagttt caattttcct 4500 ctttctcact gtgacgctga ttgggctggt gacaaggagt ctcgcaggtc tactacgggc 4560 tacatttttc agctggccgg agggcctatc tcttggaagt cgaggcttca gcctacggtg 4620 gccttatcat ccactgaagt ggagtatcga gctatcaccg aggcaggaca agagctggta 4680 tggttacgag gaatgcttgc tcgctttggc tttgaagacc ctaatcctac ggtattgaga 4740 agtgacaaca tgggcgcaat tcatttaaca accaaatcaa tctttcacgc gcgcacgaaa 4800 catattgaga ttcagtatca ttggatcagg gaagtggtta aacagggcgc tcttattgtc 4860 aaacattgtc ccactaatga catgatagca gatctcctca cgaaaccatt gggacgccaa 4920 caattctatt atttgcgtca aaagctgggg ctcaagcaca tttctccctg aagatcttga 4980 gggggcg 4987 // ID Gypsy-6_LBS-I repbase; DNA; FNG; 7202 BP. XX AC ABFE01000274; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-6_LBS_; KW Gypsy-6_LBS-LTR; Gypsy-6_LBS-I. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-7202 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01000274; Positions 10200 2999. XX CC Positions [5803-6294] - Integrase core CC 'CAGC' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 2826..3854 FT /product="Gypsy-6_LBS-I_4p" FT /translation="MRTDAYPKPGIQVDQDMSMPTLGRALHSLMGTYSFEP FT SSFISELEPPETYDFILQKIAHCSPRTSRPLLTPWTKRALVYNSNPLQMRT FT KYKTVDKKVRPVPSYMPDPAGQTFLPVLIPPLPILLLNPPLLTQFCPSKRL FT TEDRLQKILMAVPKDFLQPREIDLFVYVLQKRELALAFEDSERGTFSDKYF FT PDYEIPVIEHIPWVQAPIRVPKAIEETVRQMLLEQKAAGKYEYSTASYCSR FT IFAVGKPKGGIRLVTDVQELNKVTVHDAGLPPRTKFCQTRHIWPGRFVFRL FT RWPKVRCRLQTSHNFQFSHWSSPFLCAPSRGNEFTARVPALYYAHFTRRDP FT " FT CDS 3625..7116 FT /product="Gypsy-6_LBS-I_3p" FT /translation="MMQAYLRGLSFVRHVIYGLADLFSGYDGRKLGVASRP FT LTTFNSLIGPHRSCVLPQGATNSLPEFQRCTTHTLQEEIPKYGGVFVDDVG FT LKGPTSNYDNEEVAPGIRCFVFEYATILDRFLARFIEAGITASSKKLVLAT FT PRLHIVGTIVSKDGWHLEHRLVTKILNWGPLTSVTDVRSFLGTAGVGRKWI FT KGFSLIAKPLMQLTRIAVQREFHFSSEAESAQKELKRLISTAPVLVKLDYD FT AVKLMTRLDPLRRTSEHGLVIVGIDSCQNGTGWILFQMVEKEKHPVIFGSC FT TFNDTESRYSQAKLELYGVFRAIKDLRHRVWGIHFRIEIDAKFLIEMVKQP FT DLPNAPMTRWISYIVLFDYVMNHVPVTAHAGVDGLSRRRHTPEDTDDEDAE FT EYLDKFLGSSSYHVNSVSSLTNFLSTGSLNAYRSTRLDNNFFKDLLLSMQR FT TPQTPFASFRTTSIAEDLSFPQVVDPTPTLAAELQHLKKSNFNHGKKDLSN FT ASLIKRSLLTITDNFSYTGREFEHRRVSVSEVVECELAGEVFTLEVQKYPR FT AFMSSLKKGASQPTMTDQSALPGIPDLSLRTDNRFNYEDVDPLENVTCATH FT SYGAKDQDSPEMWMEIIAYLKTDTLPERCEDLKLRKSFICRSKGFFLHDED FT RLWKIDPQGKPPRLVVIDVERRSSLISEAHNNVGHRGRDATYKTLSERYFW FT PNMYDEIAYFIRSCNVCQLRSKARPKVAFSPTWNSGILRRFDLDTIHMPDG FT FGGMKYLLQATDPAMSWVEARTARKPNSETWAKFLYKEVYSRFGCVLLCLV FT DGGSEFKGAVDILFKQYGIVVIVSSPYHPEGNGHSERSHQTLVNSILCACG FT KDTHRWPLYVHARLWAMRCSTSRVTGYPPYFLLYGQRPFFAFDIADRTWDT FT LDWHTVASTEDLLAMRMQQILRWDKKLVLAMEQQKRVRQRAVDDFNSKHEH FT HLSSGNFLLGTWVLLHETWLDTQMGNKGALRWTGPYIIHRKVQDTTYQLRE FT LDGTVMRGSVAANRLKVFYYREEHQTVRTVEPAEYALHVAANTSSSSHASI FT VIGTLNQSLLTTPAFPVSIKVGIAVLPENRSLSCLPAFAPSAFTSQHLHHG FT YYPNIAELDPMDNNPVQYVRYTASSSVFQGHIHETLLEDSNIRDLESLALD FT ALPLR" FT CDS join(691..1806,1810..2826) FT /product="Gypsy-6_LBS-I_1p" FT /translation="MDKYYRDFTALSSDLTPSKMLENDVNLCFYRGIPTSL FT RTRIKKRIPAANLKTSSPPTTEFLLRLLRAEFDEEDLDAKTAYVGLSLDSD FT SDSSSSDSDEDIDKVLITKKKKKPSKKVAFEKTVPAVPIVEPVGFSPVDRL FT TKQMEDLRLAHAEFLRSINITPNPNLTNQQILREARCFFCDKTTHRLGLKI FT CPEVEACIKEGLVAYTPLGRLARSDRSELPRVFGSDGGVAKVLREQHTMSS FT NLKGKAREASRDLPPHMANYAGLLFDGEDVLSSEVFNASPSSAVPEWRAPL FT SSTLAVTCSQKDKETHFDPIKRPEKRQTKAKSFPKSKESQNKTPVPTLENL FT RLANVPVASTPQAFNLRPPKVNTKDAFKNRATPSKTKDVEMKDGETKAKST FT PAYHFTSDIQEMYDLDKIVHEKVNKTIVQLELGELLAISAFMQKSISNMTK FT TRREYKSKPVVVNVVEVLEEAEWDEENAMLTELAGGYDSDDEDFYNTLPTA FT ESFSNAGFIESRVGLEFDESRESKENILIRYASAVKIHLSPQPLFAMVTGR FT FKGKFAGIDAVFMIDTGSELNLIAQEFYDRTNLAIDLDGTRWSLKGINRRP FT VPLGGCVRDTEIKISGRRFDHHIFVSREGTGKQEIILGQPWLQWYSASIQY FT TRQGAMNMRIWQDGDGDKTDGSPQGPSILIPLCTPGTPRNAAMLNLDHRTR FT IEEVDDVDAGK" XX SQ Sequence 7202 BP; 1930 A; 1685 C; 1603 G; 1984 T; 0 other; gtggtgactg agacaggggt tctggttttc cgttctgcac ctttggctat agtcgagaaa 60 atattggtac atcctcataa tttgaaaagc gtgctgatta ctaggttttc aggcgcatat 120 atcctctttt ccttgccaca acacgaccaa aacccctcca ctttcgaaac tttgaaatct 180 ctcgaagtag aagaaccata catttggctg catatacgct aagcgttcct ctttgatctt 240 ttcttttgct tcgattttct ttctttttct cttattttga tgagtcctgt ggacaaaata 300 acgtctatgg attctatggc tttgtccgtg tccgaaatga gtcttaacgc tactctaggc 360 gttcctatgc ctatgcctgg aacgccaggc gctcccaaat tcaaagggaa aaatgtgttg 420 gattttttgg attctctcga gcaacacgct gatagtgcta gagtttcgca tttgcttctt 480 cccggatatg ttttaagata ttgtcatatg aaagttcgga tggtcattgg ggcgtctaaa 540 cttttggctg gagatgactg ggttgctacg agagtttatt tgacggatct ttatggatct 600 aacgactcca ttcctcccaa ttctcctgat cacttgtgac actggtgcgc tagtcatgga 660 gagtcaggca ttatcgcgtc ccgcaaagac atggataaat attatcggga tttcaccgcg 720 ttatcttcag atttgacacc cagtaaaatg ctagaaaacg acgttaacct ttgtttttac 780 agagggattc cgacgtctct acgtactcga atcaaaaagc gcataccggc tgcaaacctc 840 aagacttctt cgcctcccac taccgaattt cttcttcgtc ttttgcgagc cgagtttgat 900 gaggaagatc tggacgcaaa aaccgcttac gttggcctta gtctagattc agattctgac 960 tcgagctcga gtgattcaga tgaagatatc gacaaggtgc ttatcacgaa gaaaaagaag 1020 aaaccctcga aaaaggtagc tttcgaaaaa acggtcccag ctgtgccgat agtcgaacca 1080 gttggattca gccctgtaga tcgacttacc aagcaaatgg aggatttgag gttggcgcac 1140 gctgagtttc tacgttccat taatattact ccgaacccaa atctaaccaa tcaacaaatc 1200 ctgagagaag ctaggtgttt cttttgcgac aagacaacac atcgtctcgg tttgaaaatt 1260 tgtcccgaag tcgaggcttg tattaaagaa ggattggtgg cttatactcc cctcggtaga 1320 cttgcgcggt ccgacagatc tgagcttcct cgagtttttg gaagtgacgg cggagttgcg 1380 aaagttttgc gggaacaaca cactatgtcc agtaatttaa aaggcaaggc cagggaggcc 1440 tctagagacc tgcctcccca tatggccaac tatgccggtc tattatttga tggtgaagac 1500 gtgctttctt cggaagtatt taacgcttcg ccttcttctg cggttcctga atggcgagca 1560 cctctttcat caacactggc tgttacttgt tctcaaaagg acaaagaaac tcattttgac 1620 cctatcaagc gtcctgaaaa acgtcaaacc aaagctaagt cttttccaaa gtctaaggag 1680 agtcaaaata aaactcctgt acctacccta gaaaacctta ggctggcgaa cgttcccgtg 1740 gcttcaacgc ctcaagcttt taatttacga ccccccaaag tcaataccaa ggacgctttc 1800 aagaattgaa gagcgacgcc atcaaaaacc aaagatgtgg agatgaagga tggggaaact 1860 aaagctaagt ctactccagc ttatcatttc acgtctgaca ttcaagagat gtacgatctg 1920 gataaaatag tgcatgaaaa ggtcaacaag acgatcgtgc aactcgaact aggggaactt 1980 ctcgcgattt cagcgttcat gcagaagtca atcagtaata tgacgaagac gcgaagagaa 2040 tacaagtcca aaccagtggt agtgaatgtt gtagaggttc tagaagaggc ggaatgggat 2100 gaagaaaatg cgatgctgac ggagctcgct ggtggatatg actcggacga tgaggacttt 2160 tacaataccc ttcctacggc tgaatctttt tcgaatgcag gttttatcga atcaagagta 2220 ggtcttgagt tcgacgagtc cagggaatcg aaggaaaaca tcctgatccg atatgcgtcc 2280 gctgtcaaga ttcatttgtc gcctcagccg ctctttgcga tggtaacagg acgtttcaag 2340 ggcaaattcg ccggaataga tgcggttttt atgattgaca caggatccga actaaatttg 2400 atagcacagg aattctatga caggacaaat ttagctatcg acctcgacgg aacacgatgg 2460 tccctcaaag ggataaacag acgaccagtg cctttaggcg gatgtgtacg cgacacggaa 2520 atcaagattt caggacgacg tttcgaccat cacatcttcg tcagtaggga aggcacaggc 2580 aaacaagaaa taatactagg acagccttgg ctccagtggt actctgcgtc aattcaatac 2640 actcgtcaag gcgcaatgaa catgcgcatt tggcaagacg gcgatggtga caaaactgat 2700 ggttcaccgc agggtccttc cattctaatc ccactttgta cccctggcac acctcgtaat 2760 gcagccatgc tcaacctcga ccatcgcaca agaatagaag aagtagacga cgtcgacgcg 2820 ggaaaatgag gacggatgcg tatcccaagc ctggaattca agttgaccag gacatgtcca 2880 tgccgactct tgggcgggcg ctgcattctt taatggggac ttatagtttt gaaccttctt 2940 cttttatttc tgagttagaa cccccagaga cttatgattt cattttgcag aaaattgcgc 3000 attgctctcc tcgaacttct cgccctttat tgaccccttg gaccaagcgc gctctcgttt 3060 acaattcaaa tcctcttcag atgaggacaa agtataagac agttgacaag aaagtccgac 3120 ctgtccctag ttacatgcca gatccagcag gtcaaacatt tcttcctgta ttaattcctc 3180 cgctccctat tcttctgtta aatcctccgc ttctcacaca attttgtcct tcaaagcgtc 3240 tcacagaaga ccgtcttcag aagatcctga tggcagttcc caaggacttc ttgcaaccta 3300 gagaaatcga cctttttgta tacgtactgc agaagcggga acttgcctta gcttttgaag 3360 actctgaacg cggtactttt tctgacaaat attttccaga ttatgaaatt cctgttatcg 3420 aacacatccc ttgggtgcaa gcacctattc gcgtgccgaa agctatagaa gaaacagttc 3480 gacagatgtt gctagagcaa aaagcagccg ggaaatatga gtattctaca gcatcttatt 3540 gctcacgcat tttcgctgta ggaaagccaa aaggggggat tcgcctcgtc acagacgtac 3600 aagaacttaa caaagtaacg gtgcatgatg caggcttacc tccgaggact aagttttgtc 3660 agacacgtca tatatggcct ggcagatttg ttttcaggtt acgatggccg aaagttaggt 3720 gtcgcctcca gacctctcac aactttcaat tctctcattg gtcctcaccg ttcttgtgtg 3780 ctccctcaag gggcaacgaa ttcactgccc gagttccagc gttgtactac gcacacttta 3840 caagaagaga tccctaaata cggcggagtt ttcgttgacg atgtcggatt aaagggaccg 3900 acttcaaatt acgacaatga agaagtcgcg ccaggtatca gatgtttcgt ttttgaatac 3960 gcaactattt tagatcgctt cctggcacgc tttattgaag cagggataac tgcctccagc 4020 aagaagctcg ttcttgccac gcctcgtctg catatcgtcg ggactatcgt ttcaaaggat 4080 ggatggcact tagaacacag attggtaact aaaattttaa actggggacc tctaacgagt 4140 gttacagatg tcagatcgtt tcttgggact gcaggggtag gccgaaaatg gataaaggga 4200 ttctctctta tcgcaaaacc tctgatgcag ctgacacgta tagctgtcca acgagaattc 4260 catttttcct cggaggctga gtccgcgcaa aaagaactca aacgtcttat ctccacggct 4320 ccagttctgg taaaacttga ctatgatgcc gtgaagctga tgacgcgttt agatcctttg 4380 cgtcgtactt cagaacatgg actggtcatt gtgggcattg attcgtgtca aaacggcact 4440 gggtggattt tatttcaaat ggttgagaaa gaaaagcatc cagtgatttt tggatcttgc 4500 acgttcaatg atactgaatc aaggtactct caagcaaagc tagaactcta cggagtgttc 4560 cgagcaatta aagaccttcg acatcgtgta tgggggattc attttaggat cgaaatcgac 4620 gcaaaattcc tgatcgaaat ggtgaagcaa cccgatctcc ctaacgcgcc aatgacgaga 4680 tggatatctt atattgtgtt attcgactac gtaatgaatc atgtacctgt gacggcacat 4740 gcgggtgtag atgggctatc tcggagaaga catactcccg aggatacaga tgacgaagac 4800 gcagaagagt atttggataa atttttgggc tcttcctcgt atcacgttaa ttctgtctcc 4860 tcattgacga actttttgtc gacaggatct ttgaacgcct accgctctac gcgcttggac 4920 aataactttt tcaaggatct tttattatcg atgcaacgca caccacaaac tccttttgcc 4980 tcttttcgaa ccacttcaat tgctgaagac ctttctttcc cacaggtggt agacccaacg 5040 ccgactttag cagcagagct acaacacttg aaaaagtcga acttcaatca tggaaagaaa 5100 gacttgagca acgcttcgct cataaagcgt tctcttctca cgatcactga taatttttct 5160 tatacaggaa gagaattcga gcacagaaga gtgtccgtgt cagaggtagt ggagtgtgag 5220 ctcgcaggcg aagtttttac tttggaggtg cagaaatacc ctcgagcttt tatgtcgtct 5280 ttaaagaaag gtgcgtctca acctacgatg actgatcaat ctgctcttcc aggaatcccg 5340 gacctgtctc tacgcactga caacaggttc aattacgaag acgtcgaccc tctagaaaat 5400 gtcacatgcg ccactcactc ctatggggcg aaggatcaag attctccgga aatgtggatg 5460 gagatcatag catatctgaa aaccgacacc ttacctgaac gttgtgaaga cttgaagcta 5520 cgaaaatcgt tcatttgtcg gtccaaggga ttctttttgc acgacgaaga tcgactttgg 5580 aaaatcgacc ctcagggaaa gccacctcgt cttgttgtga tagacgtcga acgtcgttcc 5640 tcacttatat cagaagccca taacaatgta ggtcatcgag gacgtgacgc tacttacaaa 5700 actctttcag aacggtattt ttggccgaac atgtacgatg aaatagctta cttcatcagg 5760 tcatgcaatg tctgccaact tcgctctaag gctcgtccaa aagtagcatt tagtcctact 5820 tggaactcag ggatcttgcg acgcttcgac ctagacacca ttcacatgcc agatggattt 5880 ggtgggatga aatatctgct tcaagcgacc gatccggcga tgtcatgggt agaagcgcgc 5940 accgcacgaa aaccaaactc ggaaacttgg gcgaagtttc tctacaagga agtatactca 6000 cgtttcggtt gtgttctcct ttgtctagtt gatggaggat cagagtttaa aggcgcagtt 6060 gatattcttt tcaaacagta tggaatcgtt gttatagttt cgtcgcccta ccatcctgaa 6120 ggaaacggac actcggaacg ctcgcaccaa accctagtca actcaatcct ttgtgcttgt 6180 ggaaaggaca cacatcgttg gccgctttac gtgcatgcca gactctgggc gatgcgatgc 6240 tctacttcta gagtgactgg gtatcctcct tatttcctac tttatggtca acgtcctttc 6300 tttgcatttg atattgcaga tagaacttgg gatacgctcg actggcatac cgtggcatct 6360 actgaagacc ttcttgcaat gcgtatgcag caaattcttc gttgggacaa gaagctcgtg 6420 ctagctatgg aacaacagaa gcgagtgcgt caacgggctg ttgatgactt taacagtaag 6480 cacgaacatc atctgtcatc tgggaatttt cttcttggga cttgggtatt attgcatgag 6540 acgtggttgg acactcagat gggaaacaag ggagcactta gatggacggg accctatatc 6600 attcacagga aagttcaaga cacgacttat cagttacgag agcttgacgg aactgttatg 6660 cgtgggtcgg ttgcggctaa tcgtttaaag gttttctatt accgagaaga acatcagacg 6720 gtcagaacag tagaacccgc ggaatatgct cttcacgtcg ccgcaaatac atcttcgtct 6780 tcacacgctt ccatcgtcat tgggactctc aaccagtcgt tgctcaccac tcctgctttc 6840 ccagtttcta tcaaagtcgg gatcgccgtt cttcctgaaa atcgttctct ttcgtgcctt 6900 ccagcttttg caccgtcggc tttcacgtct cagcaccttc accatggcta ttatcccaat 6960 atcgcagagc tcgatcccat ggacaacaac cccgttcaat acgttcgcta caccgcttca 7020 tcaagtgttt ttcaaggtca tattcatgaa actctccttg aagattcaaa cattcgtgat 7080 ctcgagtctt tggctctcga cgctctccct cttcgctaag tcatttgttt gcctttttat 7140 ttcccctttt tttgtatgca caacgaaacg atgggggcat cgtttttaaa acttttcttt 7200 at 7202 // ID Gypsy-102_MLP-LTR repbase; DNA; FNG; 170 BP. XX AC AECX01000530; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-102_MLP_; KW Gypsy-102_MLP-I; Gypsy-102_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-170 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000530; Positions 187577 187746. XX SQ Sequence 170 BP; 44 A; 49 C; 29 G; 48 T; 0 other; tgttatgagc catagcctta tacttagaca atatgcttgt accagaacca ccgtatactt 60 gggatcctgg tggacagact tctgtctttc ctcacgttgc aatccatatc tactaggaag 120 gggaccttac tacactcttg catctcacca actcacgccc agtcctaaca 170 // ID TSU4-LTR_SB repbase; DNA; FNG; 321 BP. XX AC AJ439550; XX DT 16-MAY-2005 (Rel. 10.05, Created) DT 03-JUN-2005 (Rel. 10.05, Last updated, Version 1) XX DE S. bayanus LTR retrotransposon, LTR sequence. XX KW Copia; LTR Retrotransposon; Transposable Element; copia-type; KW TSU4-LTR_SB. XX OS Saccharomyces bayanus OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Saccharomyces. XX RN [1] RA Neuveglise C.; RT "Genomic evolution of LTR-retrotransposons in hemiascomycetous RT yeasts."; RL Unpublished (2002). XX RN [2] RP 1-321 RA Gentles A. and Jurka J.; RT "Yeast LTR retrotransposon."; RL Direct Submission to Repbase Update (16-MAY-2005). XX DR GenBank; AJ439550; Positions 1 321. XX CC Internal sequence deposited as TSU4-I_SB. XX SQ Sequence 321 BP; 118 A; 50 C; 41 G; 112 T; 0 other; tgttggaata aagtgatctt ggacataaca ttcttatgtc aaaacaggcg atctcaacat 60 tcctaatatg atattaatta tgttgttctg aaacaaacat acttaatgaa gacgagttta 120 aagacctatt aattgatcaa gaattagtat ataagaagat gatttatacc tgagaaccta 180 ttatcaatgt tctgaattat acttgaacca ttgggtttca tataaccaat cagcgtgcgt 240 tttatatacc tctcttatat aataagaaag aactgcttat tcttaattat tactacctac 300 taaacttact aattatcaac a 321 // ID MAGGY_LTR repbase; DNA; FNG; 253 BP. XX AC L35053; XX DT 30-MAY-2000 (Rel. 5.04, Created) DT 19-SEP-2005 (Rel. 10.1, Last updated, Version 2) XX DE MAGGY_LTR is a long terminal repeat from MAGGY, a gypsy-like LTR DE retrotransposon. XX KW Gypsy; LTR Retrotransposon; Transposable Element; MAGGY_I; KW MAGGY_LTR; endonuclease; gag gene; homologue; pol gene; protease; KW reverse transcriptase; FOSBURY. XX NM MAGGY_LTR. XX OS Magnaporthe grisea OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Magnaporthales; OC Magnaporthaceae; Magnaporthe. XX RN [1] RP 1-253 RA Farman L.M., Tosa Y., Nitta N. and Leong A.S.; RT "MAGGY, a retrotransposon in the genome of the rice blast fungus, RT Magnaporthe grisea."; RL Unpublished (1994). XX DR GenBank; L35053; Positions 1 253. XX SQ Sequence 253 BP; 51 A; 87 C; 59 G; 56 T; 0 other; tgtcacagac ctgaaggaca gccactgggc tgtcgcttag tcatgacccc tgtcacgtga 60 aggcgcgagg gccgagcgcc tcgcgcgcgt atatctacgc gaaatctctg cagtctccga 120 tatgtagctc ctcgagctac gtcttcgtca acaaccacct gctaccctgt acctggaaga 180 ctagatagcc aatatactct gtcctgttac ctgatcgccc gcacctacct gtccgccctg 240 cctgcccgtg aca 253 // ID Mariner-4B_AF repbase; DNA; FNG; 2019 BP. XX AC . XX DT 28-FEB-2006 (Rel. 11.02, Created) DT 08-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE A subfamily of nonautonomous Mariner DNA transposons - a DE consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW mariner; Interspersed repeat; Mariner-4_AF; Mariner-4B_AF. XX OS Aspergillus fumigatus OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Neosartorya. XX RN [1] RP 1-2019 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-2019 RA Kapitonov V.V. and Jurka J.; RT "Mariner-4_AF, a family of Mariner DNA transposons in the RT Aspergillus fumigatus genome."; RL Repbase Reports 6(2), 100-100 (2006). XX DR [2] (Consensus) XX CC It is a nonautonomous subfamily of Mariner-4_AF from the Mariner CC superfamily (Tc1 clade). The Mariner-4B_AF and Mariner-4_AF CC consensus sequences are 84% identical to each other. The 16% CC divergence is due to 327 mismatches (all are transitions) induced CC by RIP. The transposase coding region of Mariner-4_AF was CC completely destroyed by RIP in Mariner-4B_AF. XX SQ Sequence 2019 BP; 749 A; 307 C; 307 G; 656 T; 0 other; tctgtgacta aggctccccc tcactgctct gcattgaggg tgagccacag tgaaactact 60 aaccacacta aatccactat tttttgatat tttgactatt taaagctatt ctaactacct 120 aatatgccta aatcttctaa aattaataaa tcttacctcc ttaaggcctg caaggctgct 180 caggcctaaa aaaagctaaa tatctctaag attatatata aatatggtat tccttataca 240 accctatata attatattaa gaagtatata catcctcaac tagctaataa actagttaat 300 aaagccccta agggatacta ggaggaggcc ttaatctagt agatagtcta tatatataat 360 tagaatatac tagtaacgcc taagctacta gaagagtatg taaattaggc actttaatat 420 acaggggaat ctaggtaggt tagtaagata taagcatatt actttaagaa ataacttcta 480 gaatacctta atctaggcct agtgaagtaa aagataaaga aatctaagta tatccaggct 540 aaggatgcag gtttactaat atattagtat aattagcttg caggggtggt taaaaaggat 600 atactagtat agttagtata taactttaat aaatatagct ttcagcctgg cgaaggcaaa 660 tctaggaagg taattagttt aaaaggttta aaagtaccta atcttactaa atctaaaaga 720 ggagagaata ttatagctat taaatatata gctgtagata gatagcagat agatctctag 780 tttatcttta aaggtaagtt cctaataatt cttccctttc tttaaatact aatctatact 840 tacttcctaa aggtaacaga atctttatgg aatcttagtt taataagagt aaggccttat 900 tactagatat tataatagca atatctccta atagctagat atcagatgaa ctagctattt 960 aatagcttta aagctttatt aatataacaa acaagtgtat aaagaagggg gagaaatgga 1020 tacttatatt taatagctat agctcttatc ttactattaa attcttataa ctttataaag 1080 ataatagtat tattcccttt agattccttc cttatataat atatctttgc tagcctttag 1140 atagtaagct attcttaagc tataagtaat acttctatta tataaataat aagctatctt 1200 actagactag taagcctata gggaagttag aattcttata tataattaga ccagtacaag 1260 agaaggcttt taactaaagg attatctata aggcctttaa agattgtaga atctagctag 1320 ttaatagtaa aatagctaat aatcttacta tcttactata ggaaggaatt ctagatatct 1380 atacacctaa tcttaataag ataacttcct ctatactacc ctctcagctg ctatcttagc 1440 cactatcttt atctagtatt aatatcttac ctctgtggat aatttaggcc cttaagaaga 1500 actagacaaa gctatctaag tatatagatc tacttatacc aaagctgcag caaaatctta 1560 aatagatatt taaatataac taaatcacta ctaagcacct agctatagta aataaaacta 1620 ttagttaaat aagggccgcg caggcctctc tatagtatta atatactaag cagtaggtta 1680 agctacttag ctagtctagt atattaacat tatataatac aaattaatta attactttaa 1740 ggaaggctaa agatactgct atgtaagaga gatatttata aatataatag gagaaagtac 1800 atagtaagcc cccaccacta gcatctatat aagagaatat agtattaaat agattagcaa 1860 aggcagtaga tgagaatagt aatttttttt atatagataa cccctagtat ggcatcaaaa 1920 tcatgatttc tatcgaaaaa tggtggattc agtgtggttg gtggtttcac tatggctcac 1980 cctcaacgca gagcggtgag ggggagcctt agtcacaga 2019 // ID Gypsy-1_CBW-I repbase; DNA; FNG; 4782 BP. XX AC CP000289; XX DT 15-JAN-2011 (Rel. 16.02, Created) DT 15-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Cryptococcus bacillisporus genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_CBW_; KW Gypsy-1_CBW-LTR; Gypsy-1_CBW-I. XX OS Cryptococcus gattii OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Tremellomycetes; Tremellales; Tremellaceae; Filobasidiella; OC Filobasidiella/Cryptococcus neoformans species complex. XX RN [1] RP 1-4782 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Cryptococcus bacillisporus RT genome."; RL Direct Submission to RU (15-JAN-2011). XX DR Genome; CP000289; Positions 710799 715580. XX CC Positions [3580-4068] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 858..2426 FT /product="Gypsy-1_CBW-I_2p" FT /translation="MGANTLLNKLLRHTEHVEWQLTAFSGILPTRSSQTRP FT AVAAVTNAPSTKIDFSKWYDPAICLPKGLLGRQVRQHLTEKGKCWLCRKEG FT HKSPECPKRNDPAAVNIVKTYDGDQQAYETEEAAGLAEVLALNITESVSPL FT LIKCRFHVDGPSFLTLFDTGASVSLIDPSLVDMNQLRVMTAPRMRQVALAG FT GIVGPQLRQMVEEDIWIGDAKYMCAAFVMPLGKQYQAILGLNFILTHGFLT FT GATFLEQVVPHTTHPFIVSSLATDPHQEELRKALLSKYSAIFPDDIGDVAN FT YPPICSSDSKVRHHINLTPGAVPFRSAGYRSPHMWRRQLAEEIQKHRAAGR FT LRPSSSPWAAPAFLIKKENGKFRFICDYRGLNKVTQKDATPVPNVDDILHR FT AARGSIFAKIDLSDAFFQTLMHEPDIEKTAITTEFGLFEWVVMPQGACNSP FT ATQQRRLNEALRGLIGEVCEAYVDDIIVWAKDAEELNTRLEAVLERLQKAG FT LVCSPTKSEFFCDEVLRSRDFCKPHLS" FT CDS 3265..4782 FT /product="Gypsy-1_CBW-I_1p" FT /translation="MGRWLKEDDLVPGVQKEELTVGNQVETILRWEGRLCV FT PNTTKLREAFIRQCHDPVGHFGVEKTLEMTRRNYFWPGMKDDVNEFVKSCP FT ECQISKSLTVKPAGRLHSLPVPQTKFLDIGIDFIGPLPVSRGYDQLIVITD FT RLTGYVVLIPTNTTMNSSELARLLYDHWLSKFGCPRSIVSDRGSVFQSTLW FT KHLMRHIGAKSQLSTAYHPQTDGISERSNKSIIESLRTITDIRGRMWADNI FT QRIAFALNNHVRASTNRAPAELVFGRRLTHVPPLVAETPDKIAEALKFLVP FT TEEEWQAAARRMELEEGEARDNLLMAKHRQAVQANRHRRADPQFKVGDKVL FT LDTRHIRQEYKATHTRGNSTKFIPRYDGPYTIVQAWPEQSLYELDLPPRTN FT DTARRHVSLLKPYVTSDRFHSRTPFISPQLQSPPRPTVLQVLDVKKRSKPD FT EEVKILIAGKPGAGQWVKRSEASDWEGFADAWETFDGPDELLLDAVEMTEL FT PPRLSSRGGR" XX SQ Sequence 4782 BP; 1253 A; 1330 C; 1119 G; 1080 T; 0 other; ctttttttca gacgtctctt cctggcatgg aacaagacct atccgacttc atcgtcaaac 60 ttaataccag tgaaaaccaa acccgaattc gacaaggaac tcccatgacc tcctcaggag 120 aagaaaacat cgcggcggag ctcgcaagag ccctacaaga gaacgaagca ttgaaagcag 180 aaatgattag gaaggacaag tgggcggaag atctagctcg acaattggag attctgggag 240 gaaacttaga gggggacaaa gcaaaggttg aaaactcaag atccatggga aaaggtacgg 300 aagctaatct cgaagctgaa gttacctcga caagtagtga aaccctatcc cgctacgaag 360 ccctcccgac gtacaaactt aatctcccca ccctcccgcc cacaaaagat cctctaatca 420 tccaaaacca tctcgacaag ctggctgtcc aattcaaagg cctggccaat ggccggaagt 480 atgacccgat cctcctggaa aggcacaaaa ttcacctcgc agcccaaact ctctcggctc 540 cagaacatat cgcttatgcc aaaaccgtga aaaatcaact taccttccag gagtgggcag 600 ccgggttcaa agaagcagtc ctcccatatg gttggattac aacggccgag cgaaacatgg 660 ccgccctcgc cccactcgca cagaacctcg ccaccatccc ccgctttgtc gacaaagtcc 720 gaagttatgt cgcccttctg gaggatgctg attccgcctt gcctgatacc tacgtcgctg 780 ccttcgtccg acagaatatg caccccgctg ttcgtgctga catggagaga gatcacactg 840 agaaagagct ttgatcgatg ggcgccaaca ccctcctcaa caagttgtta cgacataccg 900 agcatgtgga gtggcagctt accgctttct ctggcatcct ccccacccga tctagtcaga 960 ctcgtcctgc tgtcgccgcc gtaaccaacg ccccgtccac caaaatcgac ttttccaagt 1020 ggtacgaccc cgcaatctgt ctcccgaaag gtctcctagg tcgccaagtt cgtcagcatc 1080 tcacggaaaa aggaaagtgc tggctctgca ggaaagaagg gcacaagtca cctgagtgtc 1140 ccaagcgaaa tgaccctgcc gccgtaaata tcgtcaagac ctacgatggc gatcagcaag 1200 cttatgagac agaagaagca gcaggacttg ccgaagtact agcgctcaac ataaccgagt 1260 cagtctctcc tcttctcatc aaatgtcggt tccatgttga cggtccgtct ttcctcaccc 1320 tattcgacac cggcgcatcc gtctctctca tagatccctc tcttgttgac atgaatcagt 1380 tgcgggtgat gactgctccg agaatgcggc aagtagcatt agcaggcggt attgtggggc 1440 cacagcttcg tcagatggtt gaagaagata tatggatagg tgatgctaaa tatatgtgtg 1500 ccgcctttgt tatgccgttg ggtaaacaat atcaagcaat ccttggttta aatttcattc 1560 tcacccacgg gtttctcacc ggcgcaacgt ttctcgaaca agttgtgccg cataccactc 1620 acccattcat cgtatcttct ttggcaaccg acccgcatca ggaagaactg cgtaaagcac 1680 tccttagcaa atacagcgca atctttcctg atgacattgg ggatgtagca aattacccac 1740 ccatttgcag ctcggattcg aaagtacgac accacataaa tctcacgccg ggcgctgttc 1800 ccttccgttc agctggttat cggtctccgc acatgtggcg ccgtcaactt gccgaagaga 1860 tccaaaagca ccgagcagcg ggtcgattgc gtccctcaag ttcgccgtgg gcagctccgg 1920 cgttcttgat aaaaaaggaa aatgggaaat tccgttttat ttgcgattat cgcgggctga 1980 ataaggtgac tcaaaaggat gccacgccgg taccgaatgt cgacgacatc cttcatcgag 2040 cagctcgtgg tagcatattc gccaagatcg acttatccga cgccttcttc caaactctca 2100 tgcatgagcc ggatatcgag aaaactgcaa tcaccaccga gttcggcctg tttgagtggg 2160 tggtgatgcc gcaaggcgcc tgtaattcgc ccgccactca acaacgtcgc ttaaatgaag 2220 ccttgcgagg tttaattgga gaggtttgtg aggcgtatgt ggatgacatc attgtatggg 2280 cgaaagatgc ggaggagtta aacacccgac tagaagcagt gctagaacga ttacagaaag 2340 cagggctagt ctgctcccca acgaagtcag agttcttctg cgatgaagtt cttaggtcac 2400 gtgatttctg caaaccgcat ctgtcctgat ccagtcaagg tccgcaccat acgagaatgg 2460 cctaagccac agtcccctcg ggagctccga tccttcttgg ggctgttaca gtaccttcgg 2520 aaatttattc caggcattgc ccaacatacc cgtcccctca cagctcttct tcctcctaat 2580 gtagcagcag aaaaagcatg gattgctcat cagcgtgtct tgtctcaagg tcagcatcct 2640 aaaaccgttc ttccgtgggt ttggagatgg acgacagaag cctctgcagc ctttcaaata 2700 ctcaaacaga aggttgcgga cattagtgga ctttgaccat tggattatgc aaccgcactt 2760 tctggtaaag ctccaattta tctctttacc gatgccagca aatacggtac cggcgcttgg 2820 ctcggacaag gaccaactcc agaagatgca tatcccgtgg catacgattc tcgtgggcta 2880 tccccagccg aacagaacta ccctactcac gaaaaagagc tgctcgccgt tgtccgcgcc 2940 ctcaagctct ggcgaccctt acttctcgat gtgcccgtcc atatccaaac cgaccatttc 3000 acgcttaaat ggttcctgca gcagcgcgac ctttccgaac gacaaaaaca ttggctaggg 3060 atcctctctc gctttgacct gcggatcgac catatcgctg gtgtaaacaa cttcatagct 3120 gatgccctgt caagactcgg cggtgcggat cccgaagatg atgggataga gatacaggag 3180 gttagcgtgg cggttctcgg attgcttgga caggatgttg ggctgttgaa gcgagtggcg 3240 caaggataca gcacggatga agtgatggga cgttggttaa aggaagatga tctagttccg 3300 ggcgttcaga aggaagaact tacggtagga aatcaggtgg aaaccatatt acgttgggaa 3360 ggtcgtcttt gtgtcccaaa tacaaccaag ctccgcgaag ctttcattcg tcagtgccat 3420 gatccagtgg ggcatttcgg ggtggagaaa acattggaaa tgaccaggcg aaactatttc 3480 tggccaggga tgaaggacga cgtcaacgaa tttgtcaaat cttgtccaga atgccaaatt 3540 tccaaaagtc ttacagtcaa accagcaggt cgtctccact ctctcccagt ccctcaaaca 3600 aaattcctcg acatcggcat tgattttatc ggtcctctac cagtctcgcg tggctatgat 3660 cagcttatcg tcattacaga tcggctcact ggctatgttg tcctcattcc caccaacacg 3720 acaatgaact cctccgaact tgcccgcctc ctttacgatc attggctctc aaaattcggt 3780 tgccccaggt caatcgtctc agaccgaggt agtgtcttcc aatcgacact ttggaaacat 3840 ctcatgcgcc atataggtgc caaatcccaa ctatccactg cctaccatcc tcaaaccgac 3900 ggaatctctg aacgttccaa taagtcaata attgaatccc ttcgtaccat cacagacatc 3960 cgcggccgca tgtgggccga taacatccaa cgcattgcct ttgctctcaa caatcatgtt 4020 cgagcttcca caaatcgcgc accagctgaa ctcgtgtttg ggcgtcgtct tacccatgta 4080 cccccccttg tggctgaaac ccccgacaaa atcgctgaag ccttgaagtt cttggtccca 4140 acagaagaag aatggcaagc ggctgcccgg cgcatggagt tggaagaagg agaagctaga 4200 gataatttgt taatggcgaa acatcgacaa gcggtacaag caaatagaca tcggcgggca 4260 gatcctcaat tcaaagttgg tgataaggtc ctcctcgaca cacgacacat ccgacaggaa 4320 tataaagcca ctcacactcg cggtaactcg accaaattca tcccccgata tgacggcccc 4380 tatactatcg ttcaggcttg gccagaacaa tccctgtatg agctcgacct acctccacgc 4440 accaatgaca ctgcccgccg ccatgtctcc ttactcaagc catatgtcac ctccgaccgg 4500 ttccattctc gtaccccctt catctccccc cagctccaat caccaccccg ccccacagtt 4560 ctacaggtgt tggatgtgaa aaaacgaagc aaaccggatg aagaggtcaa gatattgata 4620 gcggggaagc caggtgccgg ccaatgggtg aaaagaagtg aagccagcga ttgggaagga 4680 tttgcggacg cgtgggaaac atttgatgga ccggacgaat tgttattgga tgcggtggaa 4740 atgacggagt tacccccacg gctttcctca agggggggaa gg 4782 // ID Gypsy-25_MLP-I repbase; DNA; FNG; 5361 BP. XX AC AECX01000928; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-25_MLP_; KW Gypsy-25_MLP-LTR; Gypsy-25_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5361 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000928; Positions 66703 72063. XX CC Positions [4238-4717] - Integrase core CC 'AGAAG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(128..3817,3821..5116) FT /product="Gypsy-25_MLP-I_1p" FT /translation="MSPPPDSRLTRSSQLPLTDRVDDPKTFLRQPPTFHQR FT STEDASAYSSDKVLFATSSRFTPATAKPYHPPSTPAQPVPGDFPQTPALLH FT RRLPQLLKEEGRLRIEHRATALPFHTSPLKSTETMTYAAGDTSNSKMTKSS FT SQGDDFPPDSSTNDLLRLILMAQNAGIQAANESARRIARLEEERIAAARES FT AEKIAALEERLLTFQLNSASSSQSPPAPEPADRGIDLTRFKSSDGPTFKGP FT YRDTEAFLKWFTALKIFFLVKKIDKDADRIILTGSYLLETNLQAFWSHNVN FT TFKEGLWKDFKDRLFRSALPSDWSDKLREKILLLRMSTSKDFKAYSTRARS FT TQSLLNHDELVLQDYKLAEHLILGMLPELKSTIRIHVPIKEEDFDYNHFEE FT RCDQFYDDLVVKKVIQKRHITSTSARPSTFNNTRPPPEKSTNEDFYWKISA FT YLDSAGLCHYCKKQCGSEYRQCTGTLSRERIVFPPEFKAPPKPANYVPPRA FT HLASTSNAAGRPVNPPAGRPVKNTVAAATFPEHNTTTIAAYEEIDRELREI FT ANGNDELIVPDETGYVNQPVSQPVVILQLNCNGAPIRALEDAGSETNLISE FT DFIDRLKLKRRKLVKPTIMGLALEAGGQPPMFTEFTTATLKHPSSGFCFDR FT TYLKIGKLGSSYDIILGAPFLARHQLSVSCSQKAVISEITDARLLDYRHEE FT EMKEEFKVSIREKIALTEERVESEKWNSWEKSLQEKNNPLLTRESESWKSM FT EESFLIKFQDLFPVDIPAISDEAEAKGEFKDGSFPESLQDPSSKVRHKIIL FT TNENAVINERQYPYPAKHMAAWHKLVDQHLAAGRLRRSFSQYALPSLIIPK FT KDPLELPRWVCDYRTLNSMTVRDRSPLPNVDELVRLVAKGKVFSILDQTNA FT FFQTWMREADIPLTAVKTPWGLYEWCVMPMGLTNAPATHQARLKEALGELI FT NTVCVVYLDDIVIFSDSVEEHKKHVELVLKRLRVANLYCSSKKTKLFRQEV FT KFLGHVISADGMRADEEKVEQIRQWESPKSGKGVRKFLGTVQWMKKFISGL FT EKYVGKLTPLTSSKLKKRDFRWGEAEERAFQNIKRIMTTLPCLKTVDYDST FT DPLWLFSDASGTGLGAALFQGKEWKTAHPIAYESRQMSAAERNYPVHEQEL FT LAVIHALQKWRMLLLGMKVNVMSDHHSLTYLLKQRSLSRRQARWLEQLADF FT DLQFQYVKGEDNSVADALSRDVDDLQTGVETIAALALSQTRISEEFHRKVT FT AGYESDKFCLAVRDGTPLREDCYIDNGLIFIEDRLLIPDSENLRIQLIDEA FT HRRVGHLGLLKTVATLRSDFFWPKMSKDVEARLKSCETCQKTKSRTTLPHG FT KMKTPHMPTEPLSDIAIDFIGPLPKINTYDMLLTCTCCLTGFTRLIPTNQS FT DTAEKTASRFFTAWMGSLGAPKSIISDRDEAWSSKFWRLLVERLNISFHRS FT LAYHPQADGRSERTNKTVGQILRTFTEKRQTRWLESLPAVEFAINTAINVA FT TGHSPFEVVFGRQARLFPSTMTTNESPPSLDVWLQQREGTWAQVRDNLWSS FT RVLQALQHNKRHQDLTLNTGDWALLDSGDWRGQHSGGVDKLKERYKGPYKI FT LETFNTGQSVRLDLPTGDKRHPVFNISKLKAFVEDSQLRSSDQK" XX SQ Sequence 5361 BP; 1496 A; 1413 C; 1195 G; 1257 T; 0 other; cttttttttt ctaataccaa tcccgagtaa acaaattgaa atcgtatgaa cagactttta 60 cgtcttatct ccgcacccgg attcacctga atcaaacctt atcgttgttc aaaccccaca 120 ttcatacatg agccctcccc cggattcacg tttaacccgt agttcacagc taccactcac 180 agatcgagtg gacgatccca aaacttttct tcgccaacct ccaacttttc accaacgttc 240 taccgaggac gctagtgcct attcttcgga caaagttctc ttcgctactt catctcggtt 300 cacccccgcc accgctaaac cgtatcatcc tccatcgaca ccggcacaac cagtccccgg 360 tgattttccc cagaccccag ctctacttca tcgtcgccta cctcaacttc taaaggaaga 420 agggagactc cgaattgagc atcgcgctac ggccctaccg tttcacacct cacccttgaa 480 gtctaccgag acgatgactt acgcagctgg agataccagc aactcaaaga tgaccaaatc 540 atcatcacaa ggcgacgact tccctcctga ctcatccacc aacgatcttc tacgtttgat 600 cctgatggca caaaacgccg ggattcaagc agccaatgaa tccgctcgga ggattgctag 660 acttgaggag gagcgaattg ctgcggctcg cgaatcggcc gagaagatcg ctgcattaga 720 agaacgcctc ctaacgttcc agctgaactc agcatcctct tctcaatcac ctcctgctcc 780 tgaaccagcg gaccgaggta ttgatctcac caggttcaag tcctcggatg gacctacttt 840 caaaggacct taccgtgaca cggaagcctt tctcaaatgg ttcacggcac tgaagatctt 900 cttccttgta aagaagatcg acaaagacgc ggataggatc atcctcacgg gatcctacct 960 gcttgagacc aaccttcaag ctttttggtc acacaacgtc aacactttca aggagggatt 1020 gtggaaagat ttcaaagatc gattgtttcg atcagcgtta ccatcagact ggagtgacaa 1080 gcttcgagag aagattctcc ttcttcgcat gtccactagc aaagacttca aggcctatag 1140 caccagagct cgatcaaccc aatcactcct gaaccacgac gaattggtgc tccaagacta 1200 taaacttgct gaacatctca tcctcggcat gttaccagag ttgaaatcca ctattcgcat 1260 ccacgtgcca atcaaagaag aagactttga ttacaatcac tttgaagagc ggtgtgatca 1320 attttatgac gatctggttg tcaagaaagt gatccaaaaa cgccacatta cctccacttc 1380 tgcccgtcct tcgacgttca acaacactcg cccccctccc gagaaatcaa ctaatgagga 1440 cttctactgg aagatcagcg catatctcga ctcggcaggt ctctgtcact actgcaagaa 1500 gcagtgcggc agtgaatacc gacaatgcac tgggacgctt tccagggaac gaatcgtatt 1560 cccacctgaa ttcaaagccc cgccaaaacc tgccaactat gttccaccaa gagctcattt 1620 agctagcacg tcaaacgccg ctggcaggcc tgtcaaccca ccagctggac gcccggtcaa 1680 gaacacagtt gcggcagcca cctttccaga acacaatacc accactatcg cggcttatga 1740 agagattgac cgagagctgc gtgaaatcgc caacggaaat gacgaactca tcgtgcctga 1800 tgaaacaggg tacgtcaacc aacctgtctc tcagcccgtg gtgatcttgc agctgaactg 1860 taatggggca cctattcggg cactggagga cgcaggttct gagacaaacc ttatctcgga 1920 ggacttcatt gatcgattga aattgaaacg tcggaagcta gtcaaaccaa caatcatggg 1980 actagcactc gaagctggag gacaaccccc aatgttcacc gaattcacga cagccacatt 2040 gaaacacccc tcctccggtt tttgctttga tcgaacttat ctcaagattg gaaaattggg 2100 atcatcttat gacatcattc ttggagcccc ctttcttgct cgtcaccagt tatcagtatc 2160 ttgttcacaa aaagctgtaa ttagtgaaat cacagatgcc agacttttag attatcgtca 2220 tgaagaggaa atgaaagagg aattcaaagt ctctattcgt gagaagattg cgttgactga 2280 agagagagtt gagagtgaga aatggaactc gtgggagaag tccctacaag agaagaacaa 2340 tcccttgttg accagagaaa gtgagtcgtg gaagagcatg gaagagtctt tccttatcaa 2400 atttcaagac ctcttccccg tggacatccc agctatatca gatgaagcgg aagcaaaagg 2460 agaattcaaa gacggctcat tccctgaaag cttacaggac ccatcatcaa aagtcaggca 2520 taagattatc ctcaccaacg agaatgcagt gattaatgag cgccagtacc catacccagc 2580 caaacacatg gcagcgtggc ataaacttgt ggatcaacat cttgcagcag gccgcctgcg 2640 acgctcattc agccaatacg ccttgcccag cctcattatt ccaaagaagg accctttgga 2700 acttccaagg tgggtatgtg attaccgcac cttgaatagt atgaccgtca gggaccgttc 2760 acctcttcct aatgtggatg agttggtcag actagtggct aagggcaagg ttttttcgat 2820 tcttgatcaa acaaacgcct tcttccagac ctggatgcgg gaggctgaca ttccattgac 2880 agctgtgaag acaccatggg gcctttacga atggtgcgta atgcccatgg gactcacaaa 2940 cgctccggct actcaccaag cgagactcaa agaagcattg ggagaattga tcaatacggt 3000 gtgtgtggtc tatttagacg acattgtcat attttcagac tcagttgaag aacacaagaa 3060 gcatgtggaa ttggtgctga aaagattacg tgtggccaat ctttattgta gtagtaagaa 3120 aacaaagttg ttcaggcaag aagtcaagtt cctaggtcat gtaatctcag ctgacggaat 3180 gcgagcagat gaagagaaag tggaacaaat tcgtcaatgg gagtcaccaa agtccgggaa 3240 aggagtccga aagtttcttg gcaccgtgca atggatgaag aaattcatct caggcctcga 3300 aaaatatgtc ggcaagctca ctcctctaac aagcagcaaa cttaagaaga gggattttag 3360 gtggggcgaa gctgaagaac gagcatttca aaacattaaa cgaatcatga ccaccctgcc 3420 ctgcctgaaa accgttgact acgactccac cgaccccctt tggttatttt ccgatgctag 3480 cggcacaggt ttgggcgcag cgctttttca gggtaaggag tggaaaacgg cacatcccat 3540 tgcatacgaa tcccgtcaaa tgtccgcagc agaacgtaac tacccggtgc atgaacagga 3600 gcttctcgca gttattcatg cacttcagaa gtggcgaatg ctcttattag gcatgaaggt 3660 aaatgtcatg agcgaccatc attcgctgac ttaccttttg aaacaacgtt cactcagtcg 3720 acgacaggcc cgatggctcg aacaacttgc ggactttgac ttacaattcc agtatgtcaa 3780 gggcgaggat aattcggtgg ccgacgcgct gtccaggtaa gatgtcgacg atctacagac 3840 aggagttgag accatcgcag cgttagcctt atctcaaacc agaatctctg aagaattcca 3900 caggaaagtg accgctggtt acgagtccga caagttctgc ttggcggtca gagacggcac 3960 accactccga gaggattgct atatcgacaa tggattgatt tttatcgaag acagactgct 4020 gattccggat tctgaaaact tgagaattca acttatcgac gaagcacata gaagagtggg 4080 ccacctgggt ttactgaaga ctgtggctac cctacgatcg gatttcttct ggcccaagat 4140 gagcaaggac gttgaggcgc gactgaagtc gtgcgagacg tgccagaaaa cgaaatcccg 4200 cacaactctt cctcacggaa aaatgaagac tccccacatg ccgacggaac ccctttctga 4260 catcgcaatc gactttatcg gccccctccc aaagattaac acttacgata tgctactcac 4320 ctgtacttgt tgcctcactg gtttcactcg ccttatcccg acgaaccaaa gtgacacggc 4380 cgagaagacg gcatctcgtt tcttcaccgc ttggatgggc tccttaggag cacccaagtc 4440 catcatcagc gaccgggatg aagcgtggtc ttctaaattc tggagattgt tagtcgaacg 4500 acttaacatc tctttccatc ggtcattggc ataccatcca caggcggatg ggagaagtga 4560 aaggactaat aaaacagtcg gtcagatact ccggaccttc acagaaaaac gtcaaactcg 4620 ttggctggaa tcactgccgg ctgtcgaatt tgcaatcaac accgcgatca acgtcgccac 4680 aggtcattcc ccattcgagg tagtttttgg acgccaggca agactctttc cctccaccat 4740 gacaactaat gaatcaccac cctcactgga cgtgtggctt caacaacgag aagggacgtg 4800 ggcacaggtt cgggacaact tgtggtcaag ccgggtactt caggcactcc aacacaataa 4860 aagacatcag gacttgacgt tgaatacggg agactgggcc ttactggact cgggagattg 4920 gcgtggccaa cattcaggtg gcgtcgataa gctcaaagaa cggtacaaag gcccgtacaa 4980 gattttggag acattcaaca ctggtcaaag cgtccggttg gacctcccca ccggcgacaa 5040 gcgacatcct gttttcaaca tctcaaagct taaggcattc gtggaggact ctcagctgag 5100 gtcaagtgac caaaagtaag tcccatcctt tgtatgcacc gccgggcacc tctacgccgt 5160 tgtttcgaaa actttacctt ggccacactg tgagcgtccc ttgggttcca aaaaaaaaaa 5220 ttcctttctt ctcttaaaga aaaattggtg acccctgcgt gggtatagtg gcggttcttc 5280 cttttttttc tttcttttct tgactcaatc accttgcgca ttctcgcacc ttggcattaa 5340 gccttttata ggaggggaga a 5361 // ID Gypsy-11_MLP-I repbase; DNA; FNG; 6196 BP. XX AC AECX01001578; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-11_MLP_; KW Gypsy-11_MLP-LTR; Gypsy-11_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-6196 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001578; Positions 47186 40991. XX CC Positions [3192-3617] - Reverse transcriptase CC Positions [4821-5300] - Integrase core CC 'CTTG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 367..1998 FT /product="Gypsy-11_MLP-I_1p" FT /translation="MKSLLSRRNLQNQIPWTKPTESFKFTQAQYDALSPEQ FT ADYFKTLRRFKLGKLFKIPAFAQTLLGRNRDGFFKGEQPDSPTNYKSSLNS FT ESLPLTPIGSKAPLMPGAPSKSNLVTPASVNEDDHSKGKGPMEVTTNQPNT FT RVYPLTGYQGRIREPIDSSSSGAENEDVHMHDVKPQVKITLTDPNEIKNFR FT QWQASRDADRVRLTQRVESVTLSEDTITAKPTAPVVSSNLRPLRLVSVRDS FT VEPGVCSENRAANHTTSRYSQGPEDGIPRDKDYHKICKTLTVRKFEKFDGT FT SSFDAHLWFANLSRALMLLNVDESVWHLVAFQFLEGAALTELNQVIGSALQ FT PRSFEDLENFLIDSFPSSLSLITMQDRFDNFKFRPNEKVSDAVARFRILQN FT NAAHLGFHYQESTIFLSKLPPALRRYCQSEIERDARAGKPMNFTQLVLCAT FT DRDNSFRSEKNSVNTVSTPPAASSNNNGRRNNKRKSSGAVDQTNTICYNCN FT KKGHRFGTLDNPICTVKPSPRTINYFKRIKGNPPSSNAVASGSGESKA" FT CDS 2340..6053 FT /product="Gypsy-11_MLP-I_2p" FT /translation="MAGMIQDRLLDRLATIESDSGLPRVDFVLTLNNVSIR FT ILLDTGAKQSYISQVYVDERKIVPQRNPSQPVVYGVWGKPYKCNHLADLEI FT DFGIVSINHELQVAPLASYDVILGMDWILHYAKSTDWVTGTWVLCDSKGNE FT QSFRPAAISSPLKSLHLIEGEGHLEIDDAPCTRSQFRRFCRNKNVESWIAC FT PKETILQIGESDGPREEIPSTLPKISTTEPKLVQCATGIVAKFKQLFDDIT FT EAPKAEQVVQHLIDTGDSKPIAQAVCRMLPLLLSELQKKLESLEKNGFIQP FT STSPWSSPVLFAKNASGKLRFCVDYRAINAVTKRDRHPLPLIQDCFDQLHG FT AVWFSKFDLQQGFHQMKISPDHVPKTAFSTKYGHYEWLVMPFGLVNAPSTF FT QRMMSNVLRQFLDKFVQVYMDDILIYSKSDEEHEEHVQLVWEALAQQDLKV FT SGAKSELFADEIQFVGHMVSSSGIRPMQDKLDAIQAWPRPSNVTEVRSFLG FT LTGYYRRFVKNFSKIASPLHGLTEGNVTKKSKVLWTAVHEAACELLKKALL FT EPPILISPDPNKPYIIETDASDFTVGAVLLQVGKDGREHPVAYDSHKLGQA FT QRNYPAQERELFAIIRSWRKWRNYVEGAVQDTIVRTDHVTLTYLHKQALPT FT RRLLHWIEEFGEMTIKVEYKKGSTNIVPDALSRQSDHELLVIHDCKSGLRD FT PTDWPLLIPYIRTLRELPKWVTPGVMETAIRSSHEFEFDPKEETLIWIGNQ FT QERSPFIPFYQRAELMDVLHRRYGHRGRDGTLSLLRDRGWWPGRYKDVETY FT CKFCPECQIYDAPDKSQETAKQMPLPPVDPFERWAVDFISLPESVDGYKWI FT LTMIDHGTGWPLAIPMKTATSANVAEALVRDLIQVYGVPSEILSDRGKNFL FT SKEMTAFYEGFHVHKLNTSGYHPRSNGKCERFNGLLEKALFKANTSKDPAR FT WPEYLAEALFAVWVNKSTVTAWSPFELLYGVSPRLIGDPAKLRPRELDSDP FT SCQEVRLEKLREARKQASEALVQRAATNKARFDGQFEPDEATGKARSKVVA FT YNIGDKVKLCNEAHTKGEPHWFGPFEIFDSLGLNVYTLSDHRHSLFPHPIG FT GNRLKPALVREKDLGKAWALPPRLIQEITKEDLQVSKILKMKAKRLAKAQE FT VAVPTRIRLVGRFAPGAGNVTAETPVKHPLSPPTPGQLADLPPVLSEPKAS FT QVEISSVKTPDLSKPALRRSTRIRQGPR" XX SQ Sequence 6196 BP; 1704 A; 1452 C; 1338 G; 1702 T; 0 other; ctggtagcga gagcttttct ttttcttctt gaaattgata atctgacgtt ctcttctgtt 60 ttataaactt tttgataatt tttcccatcg gaaaaatctc tatagttttc gttctttgaa 120 acctatcttt ccaactaaca actgttctac tctggaagtt attgaaagaa aaagttaaat 180 tttacgaaat ccttttataa aacaagtgag tccatcttct tagctttaga aaatatatta 240 gtacgaagaa aagagaaata gtttacttat tacagttgat ctgtggttag actatataag 300 aaatgtccaa cccatcttca gttcaaggtc aaaaccctgc cgaagtccag atcccactta 360 ggcaacatga aatcgctgct ctcaaggcga aacttacaaa accagatacc ttggacgaaa 420 ccgacggaat cttttaaatt tactcaagct cagtatgacg ccctcagccc agagcaagcg 480 gactacttta aaacgctcag aagattcaag ttgggaaagt tgttcaagat tcccgctttc 540 gctcaaactc ttttgggtcg caaccgggac ggattcttca aaggagagca acctgattca 600 cccaccaact acaagagtag tttaaattcc gaatctctcc ctttgactcc tatcgggtcc 660 aaagctccgt taatgcctgg agctccctca aaatcgaatc tggttactcc ggcctcggta 720 aacgaagacg atcattccaa aggaaaagga cctatggaag taactaccaa tcaacctaac 780 actcgagttt atccactcac tggttatcag ggtcgcatcc gcgagcctat cgactcttcc 840 tcctccggcg ccgagaatga agatgttcac atgcatgacg tcaaaccgca agtaaaaatc 900 actctcaccg accctaacga aatcaaaaac tttcgccaat ggcaggcatc tagggatgct 960 gatagagttc gtttgactca gcgcgttgaa agtgtcaccc tctcggagga tactatcacc 1020 gccaaaccca ccgctcccgt cgtctcttcc aacttacggc cccttcgatt agtctctgtc 1080 agagattccg tcgaaccggg tgtttgtagt gaaaatcgag ctgcgaacca taccacctcc 1140 aggtattctc aaggaccaga agacggaatt ccgcgggata aggactacca caagatctgc 1200 aagaccttaa ctgtccgaaa atttgagaaa ttcgacggaa ccagttcgtt cgacgctcat 1260 ctttggtttg ctaacttatc gcgagccctt atgctcctca atgtcgatga atcagtttgg 1320 catttagttg ccttccaatt ccttgaagga gccgctttga ccgaattaaa tcaagtaatt 1380 ggctcggctc ttcaacctcg aagttttgaa gatttggaga attttcttat tgattccttc 1440 cctagctccc tgtccctcat taccatgcaa gaccggttcg ataacttcaa gttccgtcct 1500 aatgagaagg tttcggatgc tgttgctcgt ttcaggattc ttcaaaacaa tgcagctcat 1560 ttaggtttcc actaccaaga aagtactatc ttcctcagca agttaccccc tgcccttcga 1620 cgttattgcc aaagcgaaat tgagagagat gctagggctg gaaagcctat gaatttcact 1680 caattagttt tgtgcgccac cgaccgcgac aactctttcc gatccgaaaa gaactcggtc 1740 aataccgtgt cgactccacc agctgcttct tcgaacaaca atggtcgaag aaacaacaaa 1800 cgcaagtcct ccggtgcagt cgatcaaacc aacaccatct gttacaattg taacaagaaa 1860 ggccatcgat ttgggacctt ggataatccc atttgcaccg tcaaaccgtc tccccgcact 1920 ataaattatt ttaaaagaat taaaggtaat cctccttctt ctaatgctgt tgccagtggc 1980 tccggagagt caaaagctta ggcgatacat ctttagctcc tcctgtattg cctgatctgt 2040 gtgtgatctc aacctctact gtttctactg ttgtcagcaa tgaatctgtt aatttggttg 2100 attttattat tgagcccaag gtccctgcga ccttatctga ggagaaaccc tccaatcagc 2160 ttgtgttaca agctaattct tcagaagccc cttgtgtcct ctcaaaagtc gagaaattgc 2220 ctctgattaa agctcctagg ttgtacaact acctaccaag cgctccctgt aaggagcatt 2280 gttgccaaac agttgccgaa tcatctgcat tctcacaggc cagtgagccc agtggtaaga 2340 tggctggaat gatccaggat cgcctcctgg atcgtttagc aacaattgaa agcgactctg 2400 gtctgcccag agttgatttt gttctgacac tgaacaatgt ctccatcaga atattattgg 2460 atactggtgc gaaacaatcc tacatttcac aggtttatgt agatgaaagg aaaattgttc 2520 ctcaacggaa tccttcgcaa cccgttgttt atggggtttg gggaaaaccg tacaaatgca 2580 atcatttggc tgatctagag attgactttg gtattgtttc gatcaaccat gagctacaag 2640 tagctccgct cgcgtcatac gacgtgatat tagggatgga ttggattctt cattatgcaa 2700 agtccaccga ttgggtcact ggtacttggg tattatgtga ttcgaaagga aatgagcaat 2760 cttttcgtcc tgctgcaatc tcaagtcctt tgaagtctct tcaccttatt gaaggagaag 2820 gacatcttga gattgatgac gctccctgca ctaggtcaca atttagacgg ttctgcagga 2880 acaagaatgt cgaatcatgg atagcttgcc ctaaggagac tatcctacag attggcgagt 2940 cagatggacc tcgagaagag atccctagta ctctcccaaa gatctctacc accgaaccca 3000 agttagtcca atgcgcgacg ggtattgttg ctaaatttaa gcaactattt gatgatatta 3060 ctgaggcacc aaaagcggag caggtagtac aacatttaat agatacaggt gacagtaagc 3120 caatagccca ggcagtttgt agaatgttgc ctctgttatt atcagaactt caaaagaagt 3180 tagaatcctt ggaaaagaac ggatttatac aaccttccac ctccccttgg tcctcacccg 3240 tcctttttgc taagaacgca tctgggaaac taagattttg cgtggactac cgggctataa 3300 atgctgttac caagagggat agacaccctc ttcctcttat ccaagattgt tttgaccaac 3360 tacatggtgc agtttggttc tctaaatttg atcttcaaca gggatttcat caaatgaaga 3420 tatcccctga tcacgttcct aagacggctt ttagcaccaa atacggtcat tacgaatggt 3480 tggtaatgcc gtttggtcta gtaaacgccc ctagcacgtt ccaacgtatg atgagcaatg 3540 tgttacgaca gttcttagat aaatttgtac aagtatatat ggatgacata cttatctact 3600 ccaaatccga cgaagagcat gaggaacatg ttcagttggt gtgggaagct ctagctcagc 3660 aagacttaaa agtcagcggt gctaaatctg agttattcgc ggatgaaatt cagtttgtag 3720 gccatatggt gtcgtcgtct ggtattcgtc ctatgcaaga caagcttgat gctatccaag 3780 catggcctcg gccctctaat gtcactgagg ttcgatcctt cttgggcctc acgggatatt 3840 atagacgctt cgttaaaaat ttttcgaaga tagcgtcacc gttgcacggc cttacagaag 3900 gaaatgtgac aaagaaatcc aaagtcctgt ggactgcggt acatgaggca gcgtgcgaac 3960 ttctcaagaa ggctctgtta gaacctccca tcttgataag tccagatcca aataaacctt 4020 atatcattga aaccgacgcg agcgatttca cagtaggtgc cgtcttactt caagttggca 4080 aagacgggcg agaacatccc gtcgcttatg actcacataa gttgggtcaa gctcaacgaa 4140 attaccctgc ccaagaaagg gaattgttcg caattatccg ttcttggcgc aagtggagga 4200 attacgttga gggtgcggta caagatacta ttgtacgaac cgaccatgtc actttgacat 4260 atttgcataa acaggccctc cccactagac gactacttca ttggatcgaa gagtttgggg 4320 agatgacaat caaagttgaa tataaaaaag gatcaactaa cattgtccct gatgctctta 4380 gccgccaaag cgatcatgaa ctcttggtaa ttcatgattg caaatcggga cttcgtgacc 4440 ctacggattg gccgctactc attccttata tcaggacact acgcgagtta cccaaatggg 4500 ttactccggg agtaatggag acagctattc gcagttctca cgaatttgag ttcgatccta 4560 aagaggagac tcttatttgg attggtaatc aacaggaaag aagccctttt attccgtttt 4620 accaaagggc cgaacttatg gacgttcttc atcgtaggta tggccatcga ggaagagatg 4680 ggactctctc tctccttcga gatcgaggat ggtggcctgg acgttacaaa gatgttgaaa 4740 cttattgcaa attttgtccg gaatgtcaaa tatacgacgc tcccgataaa agccaagaaa 4800 cggctaagca aatgcctcta ccacctgttg acccgtttga aaggtgggcg gtagatttca 4860 tatctctccc agagagtgtg gatggctaca aatggatcct taccatgatt gatcatggta 4920 caggatggcc gttggcgatt cctatgaaaa cagccacttc ggctaatgtg gcggaagctt 4980 tggttagaga tttgatacaa gtttatgggg tgccttcaga aattctgtcg gatcgcggaa 5040 agaactttct ttccaaagaa atgacggcct tctacgaagg atttcatgtc cacaagttaa 5100 acacttcggg atatcaccct cgttctaacg ggaaatgcga gcgtttcaat ggattgttag 5160 aaaaagcctt gtttaaagcc aatacttcca aagacccggc tagatggccg gagtatctcg 5220 ccgaagcctt gtttgccgtt tgggtgaaca agagtaccgt cactgcttgg tctccttttg 5280 agcttctcta cggagtcagc cctcgactta ttggcgatcc agctaagttg aggcctcggg 5340 aattggacag tgatccctcc tgtcaggagg ttagactgga gaaacttcgt gaggcacgga 5400 agcaggccag tgaggccttg gttcaacgtg ccgcgacgaa caaagctagg ttcgatggcc 5460 aattcgaacc agatgaggcg acgggtaaag ctcgttcaaa agttgtggcc tataacattg 5520 gcgataaagt aaagctttgt aacgaagccc acactaaggg agaacctcat tggttcgggc 5580 cgttcgagat ctttgattct cttggtctta atgtctacac ccttagcgat catcgccatt 5640 cgttatttcc tcatcccatt gggggaaatc ggttaaaacc tgccctggtt cgcgaaaagg 5700 acttgggcaa agcgtgggct cttcctccaa ggttgataca agaaatcacc aaagaggatc 5760 tacaagtttc gaagattctt aagatgaagg cgaaacggtt agcaaaagct caagaggttg 5820 ccgtccctac ccgcatccgt ttggtgggga gattcgcacc tggtgctggc aatgtcacag 5880 ctgaaacacc tgtcaaacac cctctgtcac cacccactcc cggccaactt gcagatttac 5940 ctccagtttt gagtgaaccc aaagcttctc aagtcgagat ttcgtccgtc aaaacacccg 6000 acttatcaaa accagcactg cgccgttcaa caagaattcg tcaaggacct cgttaggagt 6060 ctgttttcag aaaatttttt ctctcgtttt cattgcactg ttacggttat caattttcct 6120 ttccttctct gtaggtttac ttctgttaca tagactgtgc gattcaggag tattgcttgt 6180 ataagagggg gatagt 6196 // ID Gypsy-99_MLP-LTR repbase; DNA; FNG; 138 BP. XX AC AECX01000442; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-99_MLP_; KW Gypsy-99_MLP-I; Gypsy-99_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-138 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000442; Positions 22121 21984. XX SQ Sequence 138 BP; 41 A; 38 C; 18 G; 41 T; 0 other; tgtacccttt ctcttcatgg tacaattcaa tcatacatcg ttataaggta taagcagcac 60 aagacccaat ctcctatagt ccctcatcct aagatccaat agactagtct ctgtacgtaa 120 ccgagtctgc tcatatca 138 // ID Gypsy-4_LENY-I repbase; DNA; FNG; 4763 BP. XX AC AAPO01000014; XX DT 12-FEB-2011 (Rel. 16.02, Created) DT 12-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Lodderomyces elongisporus genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_LENY_; KW Gypsy-4_LENY-LTR; Gypsy-4_LENY-I. XX OS Lodderomyces elongisporus OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Debaryomycetaceae; OC Lodderomyces. XX RN [1] RP 1-4763 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Lodderomyces elongisporus RT genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; AAPO01000014; Positions 545392 540630. XX CC Positions [3608-4099] - Integrase core CC 'TAATT' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1292..4744 FT /product="Gypsy-4_LENY-I_1p" FT /translation="MTLPSVKGRECMALIDSGSGADLVLAEFVGGKIDSLP FT QDTCLCDVVVAFGASRQVTKRVLLEFSINGIHFSRWFMVVPGFSKDMVLGL FT GFVGEHENLISFKKRSFAGVSKENPLGLVDSEEFVEAVENAAEVGVFTLKT FT EEVGEELPRDPNGLFPDLIKDYKDIFLQELKETPVSRGKWDHRINVIPYVT FT APSGKQYPLGKPEFAELKSQVKKMLDAGLIRQLGVGESDFNSPVLFVRKKD FT GSYRMCVDYRLLNLTVVQKQFQMPVVEQLIKEVSGYRYYSTLDMTQSFHQI FT RLDDETSHVTAFTAFGKKFAYNVLPFGFTNAPAILQETVSQLIQEIPGCVN FT YIDDIIVYSNSIEDHRKSLQMLFEAFRRNKFFFKGSKCELGVSRVTFLGHE FT VGMKGTRIPEAQLKAIAALKTPDNPKDMRSALGFFNYFRHYIENFSQVAAP FT LYEFATKRKVQFTKVHEEAFEKLRRKLLKSELLIKVNYDLKPVSFQLEVDA FT SHHAIGAVLTQVHNQEVVGVIEMVSRSLTVAERNYPIRQKELLLIVFAVKK FT FRHYILGYETVVYTDHQSLETLFASNTRPESERIIRWLESLQECNLKVRYR FT SGEENVLADVLSRLVDAGKVEVDDLDATVNEVIESGETVTQVLRQRNVVDQ FT IKASYDTDAYLSQILKLLQDTDRARNPIPPELKSAIKKYSLRDGVLYYNTQ FT GGSKPVVGKSVAMLVLDKVHSFGHFGITKCYFAIQPYLFVPKLLEVVTDYV FT NSCDHCQRAKAIHGSVGGLMLPSDVPKDVFSTIHLDFVTGIPTTPEGYDCI FT LVVVCALTKYCVCIPTRKKSKVIQTAKLLIDEVFSVFGVPDVIKSDKDIQF FT MNSIMKYVFEYYQIDFRTTSTNNPSTNGQVEALNKVVIQTIKSFCHREHAL FT WSHYLKVVQFAINTNYAPAIRMTPYQAVFGRLPRDRTGILDINMQHSSMSA FT ELLVRRAEAVHAQIKDTMSLSQDVMARKANRGKNPVRFEVGDMILLHREAY FT WRPGKYRKLTDVYHGPFRLVKKINDNAFEVDLPSMNKKDRVINVKYFRKYI FT EQRGVFKEPPVNLVEAEARVKELSAILGYDAENFEFDVTWVGCRPGHGSTV FT PIAWFYQKVPKGLRDTLIANAIDFLGDRFKDADADVEDPEEEV" XX SQ Sequence 4763 BP; 1299 A; 787 C; 1284 G; 1393 T; 0 other; tctggtagcg tcacttctat aatattctta atatcaccat gaattcagac cggatggata 60 tcgaacctcg tcatttcagc gttcctatgg aagccatgga ggcgttgatt cgtgaccagg 120 ctaccatgtt tgccaagatg attgagacaa tgaatgcaga taggtttaag gctggtacag 180 agtcgaaccc cgcggccttg ttcaatccgc gtaatcccgc gaagttgaag aagtttattt 240 ccgaggtcga gacgaaggct caatttcacg acaaatatcg ggataataag gtggcttatg 300 ctcaaatatt catggtgggt agtatagttg atgattgggt gaagaagaat ccaggtgttt 360 ggcagcttag ctggaatgag ttccggacca agctttatac ctcattctta gatccatctt 420 acgtcgataa actcgaagtt aagttggagg ctttgaagca gaagggtacg gtggaaaagt 480 acgtcgaaga gtttacggca cttaaatcac aattgcctga aggtcttagg tctgaaagat 540 cctataagcg tgtttttgtt atgggattga agagtgctat tagatctccg ttgatgggta 600 agttgccaaa tgacgatgta tctcttaacg agattatgaa tgatgccctt ttgcaggatg 660 ctggtgtgca agttggctgg aaggatggtt ctaaagagaa cgctgataga gccaacactg 720 ggagagtata tgaagaccca gatgcgatgg aattggatgc tgtttctgtt aggaaactca 780 gaaagctgga aaagcagttg ttaaaggcga ataatggctg ttttaattgt cgtaagactg 840 ggcatattgc gaagttctgc ccattgcgtg gaaagacctc gggtaaatct tcaggagact 900 caaaaaacta gttaagtctc tagagctggg ggacgttaaa ccaaacagct tggcactgga 960 gtccggtaga cgtaccgtga catttgatca agtttcggtt cggcgtccgg aggaggtgcg 1020 cgcggagttg ggaagtgtgg taacgcgtga cgatgctaag aggaatgaac tcgtacaaga 1080 ggtaccgagt ttgagggctg attgtaactt ggatgagtta gatgaagctg cacggctgac 1140 ggacgtcagt ggatcccaag gtcgggatgg tgactcgcaa gagttgaaag agcttgagct 1200 ctttgaggtt tcagtgagga aggggtttga tccggagaag gtcattccgc cacctcccat 1260 tgaaaatagt aagttttatg ttccggtggt aatgacactg ccttcggtta aaggcaggga 1320 atgcatggca ttgatagatt ccggttcggg cgcagacttg gtgctggctg agtttgtggg 1380 agggaagatc gatagtctcc ctcaggatac ttgcttatgc gacgtggtag ttgcattcgg 1440 tgcaagtagg caagtgacca agagagtgtt gcttgagttt agtattaacg gtatacattt 1500 cagccgttgg tttatggttg tcccagggtt ttccaaagat atggtattgg gactaggatt 1560 cgtcggtgaa cacgagaatt tgatctcatt caaaaagaga tcctttgccg gagttagcaa 1620 agagaaccct cttgggttgg ttgacagcga ggaattcgtt gaggcagttg aaaatgctgc 1680 ggaagttggt gtgttcacat taaagacgga ggaagttggt gaggagttgc cgagggatcc 1740 aaatggtttg tttcccgacc tcatcaagga ttataaggac atatttcttc aagagttgaa 1800 ggaaacgccg gtgtctaggg ggaaatggga tcatcggatt aatgtgattc catacgtaac 1860 ggccccgagt ggaaagcagt atccactagg taaaccagag tttgcggagc ttaagtctca 1920 ggtcaagaag atgcttgatg ctggtttgat tagacagttg ggcgtgggag aatctgactt 1980 caattcccct gtgttgtttg tccgtaagaa agatggttct taccggatgt gtgtggatta 2040 tcggttattg aatttaacgg tagttcagaa acaatttcag atgccagtgg ttgaacagtt 2100 gattaaggag gtgtcaggtt atcggtatta ttccactttg gatatgacac agagtttcca 2160 tcagatccgt ttggacgatg agactagtca tgttaccgct tttacagcat tcggcaagaa 2220 gtttgcttat aatgtattgc catttgggtt taccaacgct ccagctattt tgcaggaaac 2280 ggtttctcag ttaattcagg agatacctgg ttgtgtaaac tatattgatg acattattgt 2340 ctactctaat tctatagagg accacagaaa gtcattgcag atgttgtttg aagcgttcag 2400 aagaaacaaa ttcttcttta aaggttccaa gtgcgaattg ggcgtatcga gggtgacctt 2460 tttaggtcat gaagtgggta tgaagggaac acgaattcct gaagctcagt tgaaagctat 2520 tgctgctctt aagacaccag acaatccaaa ggacatgaga agtgcgttgg gtttctttaa 2580 ttatttccgc cactacattg aaaatttttc acaagtcgct gctccgttat atgagttcgc 2640 taccaaacgt aaagttcagt ttactaaggt gcacgaggaa gcgtttgaaa aattgagaag 2700 gaaattgttg aaatctgagt tgttaatcaa ggttaattat gacctaaaac cagtgagttt 2760 ccagttggaa gttgacgctt cacatcatgc gattggtgca gttcttacac aggttcataa 2820 tcaagaggta gtcggggtca ttgagatggt ttcccgttcc ctaacggttg ctgaacggaa 2880 ttatcctatt aggcagaagg agttgttact gattgtgttt gctgttaaga agttcaggca 2940 ttatatttta ggctacgaaa cggtggtgta tacagatcac caaagcttag aaacgctatt 3000 tgcttcgaat actcgtccag agtcggaacg aattattcgt tggctagaat cgttgcaaga 3060 gtgtaatctc aaggttcggt atcgaagtgg agaagaaaac gttcttgctg atgtgttatc 3120 gcgtttggta gatgctggta aggttgaggt tgacgactta gatgctacag tcaacgaggt 3180 aatagaatcg ggagaaactg ttacccaggt attacgacaa cgaaatgttg tggatcagat 3240 aaaggcttcg tacgacacgg atgcttattt atcacagatt ttgaagttac ttcaagatac 3300 ggatcgcgca aggaatccaa ttcccccgga attaaagagt gcgatcaaga aatattcttt 3360 gagagatggt gtgttatatt acaatacgca gggtggatca aagccagttg tgggtaagtc 3420 ggtagcaatg ttggtattgg ataaggtgca ctcgtttggg cattttggaa taaccaaatg 3480 ttattttgca atacagccat atttatttgt tcctaagttg cttgaggttg tgacggatta 3540 tgtgaactca tgtgatcatt gtcagcgagc taaggcgatt cacggttcag ttggcggttt 3600 aatgttgcct agtgatgtac ctaaggatgt gttcagtact atccacttag actttgtgac 3660 aggtatccct acaacacctg aaggatatga ttgcattttg gtggttgtgt gcgcgttgac 3720 gaagtattgt gtgtgcatcc cgacgaggaa aaagtcaaag gttattcaaa ctgccaagtt 3780 attgattgat gaggtatttt cagtgttcgg tgttccagat gtgatcaagt cagataagga 3840 tattcagttt atgaatagta tcatgaaata tgtgttcgag tattatcaga ttgatttcag 3900 gactacaagt actaacaacc catctacgaa tggtcaggtt gaggctttaa ataaggtggt 3960 tattcagaca attaagagtt tttgtcatcg ggaacatgcc ttgtggtctc attacttgaa 4020 ggttgtgcag tttgctatta acacgaatta tgctcctgct atacgcatga caccttatca 4080 ggctgtgttt ggacgtcttc caagagatcg tacgggtata ttggatatca atatgcagca 4140 cagtagtatg tcggctgaac tgttagttcg gcgtgctgag gcggttcacg ctcagattaa 4200 agatactatg tctttgagtc aggacgtaat ggcccgcaag gctaacaggg ggaagaatcc 4260 tgttcgcttt gaagttggtg atatgatttt gcttcataga gaagcgtatt ggcgaccagg 4320 aaaatatagg aagttgacgg atgtttatca tggcccgttt aggttggtta agaagatcaa 4380 tgacaatgct tttgaagttg acctacctag catgaacaag aaggatcggg taattaatgt 4440 caaatacttc cgtaaataca ttgaacagcg tggtgtgttc aaggagccgc cagttaattt 4500 ggttgaagct gaggccagag taaaggaatt gagtgcaata cttggttacg atgcagagaa 4560 ttttgaattt gatgttacat gggttggttg caggcctgga catggttcta cagttcctat 4620 tgcttggttt tatcagaaag tgccaaaagg gttaagagac acgttgattg ctaatgccat 4680 agacttcctt ggagatcgct ttaaggatgc tgatgctgac gttgaggatc cggaagagga 4740 ggtttgattt aacaaggggg tga 4763 // ID Gypsy-79_MLP-LTR repbase; DNA; FNG; 151 BP. XX AC AECX01001086; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-79_MLP_; KW Gypsy-79_MLP-I; Gypsy-79_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-151 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001086; Positions 13486 13336. XX SQ Sequence 151 BP; 40 A; 39 C; 25 G; 47 T; 0 other; tgtaaagagc ctaatagtga accaatatgc ttatacttgt attagacctt tagagtactg 60 ggagaagccc atcacacgct tctccctcac gttgcaatct catcatagtg cttccagttg 120 cttgtgttct ctctctctca agaccataac a 151 // ID Copia-34_MLP-LTR repbase; DNA; FNG; 813 BP. XX AC AECX01001687; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-34_MLP_; KW Copia-34_MLP-I; Copia-34_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-813 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001687; Positions 55889 55077. XX SQ Sequence 813 BP; 213 A; 143 C; 154 G; 303 T; 0 other; tgttggagta aattgtggat tggtcaaaac caatgaaggt cagggagtag tattcaggtt 60 aacttaagct agtgtagtga ggatggtaag aaatgcactg tgaacaaatt agaatcaaac 120 aagtcaattt agaatacgtg gtcaagaggt tgagttgttg tgtagaagga ttgaccgcat 180 tgggattggt gaactaatca cgttatggaa acattagaag aaattaccgg attgggactg 240 gtagtttatt tagaatgtgg acgtagtgtt tctgattctc aaaattgcca tttgttgttt 300 ttacttataa ccaagcctct cctcacattt agatcttttt ccctcattaa ggaaaaagat 360 tctaaacgtg agttattgcc aatttctttt tctcttcttt actatcttac taaaaactca 420 cgtttctctt ctacctcata catcttcact ctactagttc ttatctataa tcctccttgt 480 ctttcaggtc tgtccgagtt tagaactcgt gtcttttcat tgcctcagtg tgtcttgtca 540 acacgaactg ataagttctt gttaaacctt ttaaaaggtg tgttgtatgg ggcttatttt 600 ctttttctct tttgctgatt aaggaaaaag attctaaact tcttatctat aatcctcctt 660 gtctttcagg tctgtccgag tttagaactc gtgtcttttc attgcctcat gagtgtgtct 720 tgtcaacacg aactgataag ttcttgttaa accttttaaa aggttcctgt cgtagatcta 780 cagcgttgag cttgtgatcc tcattttcta aca 813 // ID Gypsy-14_CCO-I repbase; DNA; FNG; 6536 BP. XX AC AACS02000009; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the mushroom Coprinopsis cinerea genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-14_CCO_; KW Gypsy-14_CCO-LTR; Gypsy-14_CCO-I. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-6536 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the mushroom Coprinopsis cinerea RT genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AACS02000009; Positions 2251920 2258455. XX CC 'TCACC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1812..3458 FT /product="Gypsy-14_CCO-I_4p" FT /translation="MHLEGETESGKIIVDTEGMIDSGAGGKFIDQNFARNK FT RLPLQLLPTPLKVYNVDGTLNKQGTIRYSTTMNVTIHGHTQPITFYMTGLG FT KQKVIMGFPWLEEANPIIDWRKGTLEWRQSPPSTPPITPTKPYTQIFKTTV FT EEIPDTEPQNLDPLPDWYPTTSPMIPKDHFDTFETYVNELMEGNFHWDNDE FT GSLLTDDLVISFINGELSDETEDIWINTTLTHSQAFELKYNQREEIKDIRQ FT HVPPEYHEYLSVFDEKAASRFPESRPWDHRIDLKPGFEPRSSKVYPLTPEE FT EKLVKEFIDDNLAKGYIRPSDSPQASGFFFVPKKDGKKRPCQDYRYLNEWT FT VKNAYPLPLVTDLMDKLKGAKYFTKLDVLKGYNNIRIRDGDQWKAAFKTKF FT GLHEPTVMFFGMCNSPATFQAMMDHIFSLQIGGNFVIIYMDDILIFSKTKE FT EHERHTKEVLRILRDNDLYVKPEKCEFTKEKIAYLGFIIERDRISMDPAKV FT KGLQDWPAPQTIKQVRSFLGFGNFYRKFIRKYSDIARPLNELPQEEPSIQV FT DR" FT CDS 3151..4284 FT /product="Gypsy-14_CCO-I_2p" FT /translation="MNDTPRKSFGSSAIMISTSNPRNVNSPRKRLLTSDSS FT SNETESPWTQQRLKDSKTGLHHKPSSKSDPSSASETSTGSSYGNTPTSLDR FT STNSLKKNQAFKWTDECQEAFDTLKKRFCEEPVLMMPDMSRPFQIESDASK FT YASGAVLTQMDSNGNRHPVAFLSKSFSPAERNYEIYDRELLGIIRALDEWR FT HYIQGSPHETLVYSDHLNLTYFRSPQKLKPRQARWQLKLSESAIKLSHLPG FT HKMILSDALSRRPDLCPEEDHDNENITLLPEELFVNLADLKTLTVGLVDIA FT LQEEIAKCQNFDTEATDALKIILSDGPKAMKDDLQDWTVEERDGFRLLFYK FT GRNYIPQDLDLRRKILHRYHDSPNGRTPRRTRNLQ" XX SQ Sequence 6536 BP; 1973 A; 2021 C; 1317 G; 1225 T; 0 other; gaagaactag ctcctacaca tcgtcactcg tacttacggg tgacgacccc atcgaccaca 60 cagaattccc acatactcgg catctgatcc cccggaacga ctccctacac caccgccatt 120 tgtcgaccca gtagccaccc taattgcctc aacacccgga tcccagtcaa ccacacccag 180 aggacgtctc cgagggctta ccccaagctc aaccagacca tcaacaccat ctttactagc 240 ggttaaccaa tcgctccctg aagaagaaac cgaagaagaa caagaagact cggaagacaa 300 gacagacaac accgactgca gcaacttcgt acaccagttt aactgtgaac actacgacac 360 caacgaagaa gactggttta actgcgccca acaaccttgc cccaactacc cggaagaccc 420 aagacatcga ccaagcaaac aagaaccctc agaagtttct tttgaagaag agccttctgc 480 agaattccaa gacgcccaag aagaacccag cactgagcca cagcatcaag aggaaggacc 540 cttgattcca gaacgtgaag aggaagaacc ctttacgcca atcacccttg aacaatcagt 600 caccatggtc agcccagcca acccagacaa cggaacccga gacgtcgtaa tgagcgacga 660 aaccaaccag aagatccccg aaaggcgaat caatccgcca tcccctttca ccggcaaaaa 720 ggaagaccgt caacaccttc ctcaaagacg tgaaacttta cttgaagggt caacaagcac 780 atctacaccg atgacgacag caaaatcacc tatctccttt cattcctcaa aggagacgaa 840 gcctccgctt ggaaagaagc atggctacgt tccaaggaaa ccgattccga tatcgaattg 900 ggcacctgga aggactttgt cgccatcttc aaacaggcgt tcaccccgat caacgaacaa 960 gaagacgcct tacagcgctt gaagactatg acccaaggga agaagacggc tgaagaacat 1020 gtagccgact tcaagattgt cctatcaaaa accaaccttc tctccaaaga ccacgccaac 1080 aaaaccgaag atcaaagagt tgcagacgct ggagacactg tagcccgaga ctacttccaa 1140 gccagtctga acgatccgct tcgaaagaag ctcttcgacc ttgagaaacc ccctcgaacc 1200 ttcaaggagt ggacagagaa ggccattgaa aaggacaaca actaccgtcg ctactttgcc 1260 ctcaccaaca aggcctcgac caaaacttcg acctcaacca ggcagtggaa cttcagtcgc 1320 cccgctcctg ccaaggaccc tatggccatg gacatcgaca ccatcatctc caaagtcaag 1380 ggcttagacc agtctgcagg gaccgcttta gagcaacaca tcagggcaat cggccttgga 1440 aagctctctc cccaagagag agccgacctg atcaagagag gaggatgctt ccgttgccga 1500 aaggacggac atcgctcctg ggaatgcccc aataacacca ccaacaatcg ctctgcaggg 1560 accagcaacc agaacaccag tagccgtcaa aactggtctt tccgcccaaa gaatggcaag 1620 gatctctaca ccaacatccg ggctgcaatc caagaagcag gagaggaagt cgttgaaaga 1680 attttacaag gaactcgaaa ccaaccccag ggtttttaga atggagacct gcatcgacgt 1740 cagtgtctcc ctatttatcg attcattctg ccttagtagc ccgaaagaac aataaatgct 1800 tatttattcc aatgcatctt gagggcgaaa cggagagtgg aaagataatc gtcgacaccg 1860 aaggaatgat agattctgga gctggaggga aattcatcga ccaaaatttc gcccgaaaca 1920 agagactccc gctccaactc cttccaaccc ctctgaaagt ctacaacgta gacggaaccc 1980 tgaataaaca aggaaccatc cgatactcga ccaccatgaa cgtcactatc cacggacaca 2040 cacaacccat taccttctac atgaccggcc ttggaaaaca gaaggtcatc atgggttttc 2100 cctggcttga ggaagccaac cccatcatcg actggaggaa gggtaccctc gagtggcgac 2160 agagccctcc gtcaacaccg ccgatcactc ctaccaagcc atacacccag atcttcaaga 2220 ccactgtcga agaaatcccg gacaccgaac ctcagaactt agacccacta cctgattggt 2280 acccaaccac ctctcctatg atacccaagg atcactttga caccttcgaa acctacgtca 2340 atgagctgat ggaagggaac ttccattggg ataatgacga gggatctctc ttgaccgacg 2400 acctagtgat ctcattcatc aatggagaac tttctgacga aacggaagac atctggatca 2460 acaccaccct cacccactcg caggccttcg agctcaagta caaccagcga gaagaaatca 2520 aggacatcag gcaacatgta cctccggaat atcacgaata cctctctgtc ttcgacgaga 2580 aagcggcctc ccgattcccc gaatctcgtc cctgggacca caggatagat ctgaaaccag 2640 gattcgaacc aaggtcatcc aaggtctacc cgctcacacc ggaggaagag aaacttgtca 2700 aggaattcat cgatgacaac ctggccaaag gatacattcg accgtcggac tctccacaag 2760 catctggatt cttcttcgtc cccaagaaag acggaaagaa gcgcccatgt caagactatc 2820 gctacctgaa tgaatggacc gtcaagaacg cctatcccct tcccctggtt acagacctta 2880 tggacaaact gaaaggagcc aaatacttca ccaagctcga tgtcctgaaa ggttacaaca 2940 atatccgtat acgcgatgga gatcagtgga aagcagcatt caaaacgaaa tttggcttac 3000 acgaacctac ggtcatgttc ttcggcatgt gcaactcccc agcaaccttt caagcaatga 3060 tggaccacat attctctttg cagattggag ggaatttcgt catcatctac atggacgaca 3120 ttcttatctt ctccaagacc aaggaggaac atgaacgaca caccaaggaa gtccttcgga 3180 tcctccgcga taatgatctc tacgtcaaac ccgagaaatg tgaattcacc aaggaaaaga 3240 ttgcttacct cggattcatc atcgaacgag accgaatctc catggaccca gcaaaggtta 3300 aaggactcca agactggcct gcaccacaaa ccatcaagca agtcagatcc ttcctcggct 3360 tcggaaactt ctaccggaag ttcatacgga aatactccga catcgctcga ccgctcaacg 3420 aactccctca agaagaacca agcattcaag tggaccgatg aatgccaaga ggcctttgac 3480 accctgaaga aacgcttctg tgaagaacct gttctcatga tgcccgacat gtccagacca 3540 ttccagatcg aatctgacgc ctccaaatac gcctccggag cagtcttgac acagatggac 3600 tccaatggta accgccaccc agtcgccttc ttgtccaagt ccttctcccc tgccgaacgg 3660 aattacgaga tctacgacag agaactcctt ggaatcattc gagcacttga cgagtggcgc 3720 cactacatcc aaggaagccc ccatgaaact ctggtctatt ccgatcacct caacctcaca 3780 tatttccgat ctccgcagaa gctcaaacct cgacaagcac gttggcaatt gaaactgtca 3840 gagagcgcca tcaagttgtc acatctaccc ggacacaaga tgatcctatc ggatgccctg 3900 tcccgacgcc ccgacctgtg ccctgaagaa gaccacgata acgaaaacat caccctgcta 3960 ccggaagaac tattcgtcaa cctcgccgac ttgaaaaccc tcaccgtagg actggttgac 4020 attgctctac aagaagagat cgccaaatgc cagaatttcg acacggaagc caccgacgcc 4080 ctgaagatca tcttatccga tggaccaaaa gccatgaagg acgatctaca agattggaca 4140 gtagaagaac gtgatggatt ccgactccta ttctacaagg ggagaaacta tatccctcag 4200 gacctagacc ttcgacgcaa gatcctacat cgataccacg attcgcctaa cggcaggaca 4260 cccaggagaa ctagaaacct tcaataacgt cagccaacac tattggtggc caggaatcag 4320 aacatttgtt aagaactacg tcaaaggatg tggtgtatgt cagcaattca agatcaaccg 4380 taacccatca aaacccgctc tcatgcccat ccccaccggc caaatcactc cgaccattct 4440 cccagatctc agccgacttc attaccgacc tccctgaatc taacggatac aactctatct 4500 tgtccgtggt ggaccacggg tttacaaagg gggtaatttt gatcccatgc accaaagaga 4560 tcacggcaga gcagaccgca caactccttg tcaacaacct cttcaaacgc ctttgtctcc 4620 cagagaagat gatctccgac cgcggccccc agtttgctgc tcaggttttc cgacacttcc 4680 tccaacttct cgagatggaa tccgcactgt ccaccgctca ccatccccaa actgatggca 4740 ccacggaacg attcaatcaa gaaatcgaga cctggttgtc aatctattgt cttgcctttc 4800 cacaagactg gtccgaacac cttggcatcc tagaatttac ccacaacaat cgaagacact 4860 ccgatcgacc atatacccca ttcgaactca tgatgggaat agcacccaaa gcgatcccaa 4920 cctccttcga atacactcaa ttcccatctg cagaacaacg catcaaagct ctcgatcgaa 4980 taagacatga agcccttgcc gcacacacac tggcccagca aaggatggct caaaggatca 5040 agtccacctt caaacccttc aagaaaggcg aaaaggtgtg gttagaatcc aaaaacctca 5100 acctgggata ccacaagaag atcaaagcca aacgagaagg acccttcgag atcttagatg 5160 tgttaggacc gaccacctat cggctcaata tccccgatca ctggaaacga gtccgagttc 5220 atgacatgtt ccacgccacc ctcctcacgc catacgtcga gaatgaagtc cacggcccca 5280 acttccccag accacctccg gaagtggacg acatttacga aattgagcgc attctgaaac 5340 accgcaaaac ccgaagcgga ctggagtacc ttgtgctttg gaaaggttac tccgaagaag 5400 aagcaacgtg ggaagatgaa gacaacctca tgaccggagc aagcaagatc ctgacacaat 5460 acaagacgac tcacctgaaa gaaaaaccaa agaaccgcaa acaacgtcgg aaataaaagg 5520 tccaaccagt cagactcact cccccagcaa accacccatc tgctacttca cctcacaaat 5580 ccacttgccc ctcttcacta ctctcacacc tacttccctt tcgaaatgac caacatgatc 5640 atcgactccg gtgccgcctt caccttcgaa ttccccgtcc ccctccaccg aatccgtcac 5700 tgaaaccacc gccgtcgaaa catcaaccac tgaaaacccc cctgccaacg aacccactga 5760 cccctacaaa gggggtaact ccactttcat gacctgcgct gccgaactca tgccacctgc 5820 tcgccgaccc ccgcgattgg actaaccctc aagagatgtg caggatatgc gcatcacatc 5880 gtcctagacc gcctccgaga tctcgaccgc cacatggtcg agattcacag agccattctt 5940 cgcattcccc ctcaagaccg gcgcgtaggc tcccaaaccc gagtcactct catcaacctc 6000 cgccttttcc tttgggaaga gttccaagag aaatgcgaag gattcgtcta caaagccctt 6060 cacaaagcca tcgtccccgc cctcatcgac cttcgtgatc tttcggcacg caatgactgg 6120 acaggcagag aagaatacgc ctggatcacc tacactcccc cttacgagaa ttccccctac 6180 cactcgcctg ccagccgtct caagaccctg ttacaactgc aagcgtgacg ggcacgtcat 6240 ccgcgagtgc gggtttgaaa aggaacgacg aagggcccat catcgcaacc gctaccaacg 6300 atcgaaagcc gcccacgatt atgcaaggac gactcaagaa gccatctcta ggaggtccgc 6360 cgggatcaag attcccctca agttccgaca agccgatcaa ggaaagaagg tccgctttga 6420 cgaaaccccc gcaggaaaca acagttggga cgacagtctg atccccgact ggtcgacctc 6480 tggccccgac gctagcagct gggattgaag ggacttcaat catacaaagg gggtaa 6536 // ID Copia-57_MLP-LTR repbase; DNA; FNG; 536 BP. XX AC AECX01000342; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-57_MLP_; KW Copia-57_MLP-I; Copia-57_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-536 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000342; Positions 121830 121295. XX SQ Sequence 536 BP; 126 A; 120 C; 77 G; 213 T; 0 other; tgaagtataa gacttacata gttatttggt tatctaatct aagattcttt cctttctgat 60 tcttttctca tatcatatca atgtcatcat ttcttactaa acctatgata tcattgctgt 120 gcgtatgatg taatgctgtg ttatgctgtg acctttttgt tggtataaat tggtgttgac 180 ttcgtctact tgttttcctt ttcttcaccg atttctacct gtacttcgag caggtaacta 240 ctgctctcat cggtcataga gactatcttc tctaactcta gctaacttgg cattctatca 300 ggtcattgtc tttttatcgc tgttatgttg tttatttgct ttaatacaaa tcctaaccaa 360 tttctacctt tacttcgagc aggtaactac tgctctcatc ggtcatagag actatcttct 420 ctaaccctag ctgacttagc attctatcag gtaactactg ctctcatcgg tcatagagac 480 tatcttctct gactctagct aacttagcat tctatcagct tcataaccaa ctcgca 536 // ID Gypsy-1_PPM-I repbase; DNA; FNG; 6816 BP. XX AC ABWF01007905; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Postia placenta genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_PPM_; KW Gypsy-1_PPM-LTR; Gypsy-1_PPM-I. XX OS Postia placenta OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Polyporales; Postia. XX RN [1] RP 1-6816 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Postia placenta genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABWF01007905; Positions 18338 25153. XX CC Positions [2906-3466] - Reverse transcriptase CC Positions [4565-5047] - Integrase core CC 'GCTAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 471..1484 FT /product="Gypsy-1_PPM-I_3p" FT /translation="MATFTQEDIDQHIAVTLAAYQSQQSTANRPLRLDIPA FT PEPFSGKAEDLRRFIQCVLSYFVATNNTRLSNEAKIAFTVALMRKDLGKTW FT ADVYYEKSAEGVQVYSTWANFVATLEEVFPEHGTRIKAHQILMKLPERQRD FT RKTALSLGNYVTRFEQLASKAQLKDAEVNGINRVENDYHTLHANFIKGLPK FT ELYFALTTRVARDRPNTMKAWYNEVRNADAAKQGALVVTDTRDYGEPMDID FT AAAVASTFASTSGGRKWELGAVLNEADRKLHRDGNLCFYCHIKGHSAKDCR FT KKAATRQGGGRPNQGGSGKDDFRARIKTLSADEKRELYEELTMEDF" FT CDS join(1949..2872,2876..5635) FT /product="Gypsy-1_PPM-I_1p" FT /translation="MDLYQRRVAWLELKKAVTTPLPEPTEDEFIVISTSEH FT ATLPERKTSSAAGYDISAAHQLIIPARRRGLVGTDLIMAIPEGHYRRITPR FT SGLALKEIDVTAGVIDSDYRGKVSVLLVNNSDLPFTVKAGDRVAQLILERI FT STPSVTLTDHTDKTEHGEDGFGSTGIASIKEQAEDIDDDEFIISYLSGTLM FT ITKYEKQESLAVRDVPLKERLPKLSSGYKRLHNGMIWKVHHDIRRVTTSTE FT LVASNAQGKTEKTLPQMIPSYLHNLLPIFDKGTAARLPDHTEYDHEINLKE FT GFTDMKQQVYPLNPQQKELDKFLEENLSKGYIRPSKSPLASPFFFVGKKDG FT SLRPCQDYRLLNENTVKDRYPLPLISDLLDKLKHAKVFTKMDLHAGYNNVQ FT IKKGDEWKAAFIIPGTDGKPPQLYEPTVMFFGLCNSPATFQRMMNTIFADM FT LQEGWLCIYMDDILIFSQDKAEHEERTRHVIERLRQHDLYLKPEKCAFDVT FT EVEFLGMIIRPGEIAMDPVKLAGIADWPVPSNLRQVRAFVGFGNFYHRFIK FT GFSDKAQPLVALTRKDVAWTWGSDQQKAFETLKKAFLEEPVLRMPDPDRPF FT TIETDASKFATGAVLRQRNDNGDWHPCGYLSQSFNAAERNYDIYDRELYAV FT IRALDEWKHYLISSPYPVTILSDHKNLTYWKQAKDMNRRQARWASKLTEFD FT FHLVHVPGTQMVQSDALSRRPDHDDGSNDNTDVVMLPPEVWVAAIDIDTRD FT TILSFQDDPIMKEGLTAARTGGTPHTGTDWSIDDDLLLYQDRVYVPPDESL FT RKRLVSHYHDLPSAGHPGVQKTLLGLRRDYWWPSMAQFVAHYVNGCGICQQ FT MKVNTHPARPPITPIRPRANAKPFSTFSCDYITDLPPSNGFDSILVVVDHD FT LMKGVIFIPCNKTIDALGTADAMYREVYKWFGLPDKIISDRGPQFASKVFQ FT ELCKMTGIQLAMSTAFHPQTDGETERVNQELEIYLCCFTANDPNKWADHLH FT TAQFAHNSREHSTTKMSPFKLMMGYEPRGIPTTHPGSNTPQAAYRLKDLQN FT IREEAHAAMEMAQNILAKRVKRKGGSFRKGQLVWLDSRNLKLPYPSRKLAP FT RREGPFKVLDKIRTVAYKLDLPSTWQVHPVFHKWLLSPYSETEIHGPNYTR FT PPPDLIKGEDNYEVKAIIGHRENKRMRRREYRVRWKGYSTADDSWEPEANL FT TGTADDVLTEYKKRKNLS" XX SQ Sequence 6816 BP; 1892 A; 2087 C; 1604 G; 1233 T; 0 other; aaaggtcgaa ctttctaagc tacgttccta gttttaccta gtcgaacatc tcatccacca 60 ttacatcgaa caacgcagtc aatccggttc ctttgggact accagccctc gcaggatcac 120 tgctcctcca gtacgaccga gcagatcgct gctttttcac ctccacaaaa agggctccca 180 tctacccttg aggcggcacc tggtgtcgtg caaccggtcc aaacccacag gtcgtcgccc 240 atcaagaact tgttaactac taccagagac acccacccgc tcacccagaa gatgtattca 300 ccatcctacg aatcgacgtc gaacccatac aaacagcaga aagcgcacag tcacccacca 360 gcgaacaacc actcgaactc cctgaagttc agtacatccc catcgagatc ccagacaccg 420 aactaccacc agctccaccc gcaccaacca acgcaccggt tgaagtccct atggcaacat 480 tcactcaaga agacatcgat cagcacatcg ctgtcaccct cgcagcatac caatctcagc 540 agtcaacggc caaccgaccc cttcgcctcg acatccccgc tcccgaaccc ttcagcggaa 600 aggcagaaga cctcagacgc ttcatccaat gcgtcctctc ctacttcgtc gccaccaaca 660 acacccgact tagcaatgaa gcaaagatcg ccttcactgt agcgctgatg aggaaggatc 720 tggggaagac atgggcggac gtgtactacg agaagtcggc agagggagtc caggtctatt 780 ccacatgggc aaatttcgtt gccaccctag aagaggtgtt ccctgagcac ggaacgcgaa 840 ttaaggcaca tcaaatcctg atgaaactgc ctgaacgaca gagggatagg aagacggcac 900 tctccctcgg taactacgtc acccgcttcg agcagctggc atcgaaggct cagctcaagg 960 acgccgaggt caatggcatc aaccgcgttg agaacgacta ccacactctc cacgccaact 1020 tcatcaaagg actcccaaag gagctgtact ttgctctcac aacaagagtc gctagggatc 1080 gacccaacac catgaaggcc tggtacaatg aagttcgaaa cgccgacgca gccaagcagg 1140 gggctctcgt cgtcacagac accagggact atggcgaacc aatggatatc gacgccgctg 1200 ccgtcgcatc aaccttcgcc tccacatcgg gaggaaggaa atgggaacta ggagccgttc 1260 tcaatgaggc tgatcggaag ctccacaggg acgggaacct gtgcttctac tgccacatca 1320 agggtcacag tgccaaggac tgccgcaaga aagcagccac acgacaaggg ggtgggaggc 1380 cgaaccaggg aggatctggg aaggacgact tccgcgccag aatcaagacg ctctcagctg 1440 acgagaagcg ggagctgtat gaagaactta cgatggagga tttttgaagg ggcagggtga 1500 gccgacgcaa ccactgccca tatccgtttc actgtcccgt gtagttctag tagtccgaaa 1560 tgaacacgct ctgcacattc cgattactgt gggacaaaaa gacgtcgagg tgcgcgcttt 1620 catcgatagc ggcgcaacag gaatcttcat cagcccgaga ttcgtccaag agcatcgact 1680 tcagacagaa cctcttcccc gacccatccc catcttcaac gtggatggaa ctctgaacaa 1740 agaagcactc atcacgcgaa gagctctctt atcttacaag ataagcggag aaccccgatt 1800 tgtcaaagcc tacgtcgcca gcatcggcaa ggaagacatc atcttcggcc acacttggct 1860 caagctcgaa aaccccaaga tagactggaa aaccggacga gttgagctaa accaccggtg 1920 gacgaaagac gagaccaggg cctggaagat ggacctgtac caaaggcgag tggcttggct 1980 agaactgaag aaggccgtaa ctaccccact cccagaacca acggaggacg aattcatcgt 2040 catctccacc tcggagcacg caacccttcc cgaacgaaag actagcagcg cagcaggata 2100 tgacatctct gctgcgcatc aactcatcat ccctgcccga cgacgaggcc tggtcggcac 2160 cgatcttatc atggctatcc ccgaaggaca ctacagaaga atcacacctc gatctggttt 2220 agcactaaag gaaatcgatg tcactgcagg agtaatcgac tccgattacc gaggcaaggt 2280 ctcagtcctg ctggtcaaca actctgacct cccattcacc gtcaaagccg gcgaccgagt 2340 tgctcagctc atccttgaaa ggataagcac cccatcagta acactaaccg accacactga 2400 caaaacggaa cacggggagg atgggtttgg gtcaacggga attgcgtcca tcaaagaaca 2460 agcggaggac atcgacgacg atgagttcat catttcctac ttatcgggaa cactgatgat 2520 caccaagtac gagaaacagg aatccctagc agtaagggat gttccgctca aggaacgact 2580 ccccaaactc tcatccggct acaaacgact ccacaacggc atgatctgga aggtccacca 2640 cgacattagg cgagtcacaa catccaccga acttgtggcc tcgaacgccc aaggcaagac 2700 agaaaaaacg ctaccacaga tgataccgtc ctacctacac aatctcctcc ccatctttga 2760 caaaggcact gctgcgcgac ttccagacca caccgaatat gaccacgaaa tcaacctcaa 2820 ggagggcttc acggacatga agcaacaggt ctatccgctc aacccgcagc aatgaaagga 2880 actcgacaag ttcctcgagg agaacctgag taaagggtac attcgccctt cgaaatcccc 2940 attggcgagt ccattcttct ttgttggtaa aaaggacggt tcactacgtc cctgtcaaga 3000 ctaccggctg ctcaatgaga acacggtcaa ggatcgatac ccgctcccac ttatatcgga 3060 cctcctcgac aagttgaagc acgccaaggt gttcacgaag atggacctcc atgccggata 3120 caacaacgtc caaatcaaga aaggggacga atggaaggca gcgttcatca tacctggaac 3180 tgatggcaaa ccgccgcagc tgtacgaacc aacggtgatg ttctttgggc tgtgtaattc 3240 ccccgctacg ttccagcgga tgatgaacac tatattcgca gacatgcttc aagaaggctg 3300 gctttgcatc tacatggacg acatcctgat cttttcccag gacaaggcag aacatgagga 3360 acggacacga cacgttatcg aacgacttcg acaacacgac ctctatctca aaccggagaa 3420 atgcgccttt gacgtcactg aagtcgaatt cctcggcatg atcatccgtc caggagaaat 3480 tgcaatggac cctgtcaaac tagctggcat tgcagattgg ccggtgccgt ccaacctcag 3540 gcaagtacgc gccttcgttg gatttggaaa cttttaccac cggttcatca aaggcttctc 3600 ggacaaagcg caaccgctgg ttgctctcac ccgaaaggat gtcgcctgga cctggggatc 3660 tgaccaacag aaagccttcg agaccctcaa gaaggcattc ctagaagaac cagtactacg 3720 catgccggat cccgatcgac ccttcactat cgaaactgac gcctccaagt ttgcaaccgg 3780 agcagtacta cgccagcgca atgacaacgg cgactggcat ccttgcggat atctctctca 3840 aagcttcaat gcagcagaga gaaactacga catctacgac agggagttat atgcggtcat 3900 ccgcgcacta gatgagtgga agcactacct cattagcagc ccatacccag taacgattct 3960 cagcgaccac aagaacctca catattggaa acaagccaaa gacatgaacc gacgacaagc 4020 acgctgggct agtaaactga ccgagtttga cttccacctg gtacacgttc ctggtaccca 4080 aatggtgcag agcgacgctc tctcccgccg acccgaccat gacgatggct ctaatgacaa 4140 caccgacgtc gttatgctcc caccggaagt atgggtcgct gccatcgaca tcgacacccg 4200 cgacaccatc ctttcttttc aggacgaccc aataatgaag gaaggcctca ctgcggcgcg 4260 cactggagga acaccacaca ctggaacaga ctggtccatc gatgatgacc tcctcttata 4320 ccaggatcgg gtctatgtac ccccagatga gtctcttcgc aaacgtctcg tctcccacta 4380 tcacgacctc ccctccgcag gacatcccgg tgttcagaaa accctcctcg gcttgcgccg 4440 tgactactgg tggccgagca tggcccagtt tgttgctcac tatgttaatg gctgtggcat 4500 ctgccagcaa atgaaggtta acacgcatcc agcccgaccg cccatcaccc ccattcggcc 4560 ccgtgcaaat gccaagccct tcagcacatt ctcgtgcgat tacatcaccg acttgccacc 4620 gtccaatggc tttgattcca ttttggtcgt agtggaccac gaccttatga agggggtaat 4680 tttcatccca tgcaacaaaa caattgatgc tcttggaacg gcagacgcaa tgtaccgaga 4740 ggtctacaag tggtttggac tcccagacaa gatcatcagc gaccgaggac cgcaatttgc 4800 atccaaagtc ttccaggaac tttgcaaaat gactgggatt caattggcca tgagtacagc 4860 attccaccca cagaccgacg gcgagacaga acgagtgaac caagagctgg agatctacct 4920 ctgctgcttc actgccaatg atccaaataa gtgggctgac catcttcaca ccgcgcaatt 4980 tgctcacaac tcccgcgagc attccacgac aaagatgtcc ccattcaaac tcatgatggg 5040 atacgaacct cgtggaatac ccaccacaca ccccggatca aacacgccgc aagcagctta 5100 tcgcctcaag gacttgcaaa acatccgtga ggaagcacat gcggcaatgg agatggccca 5160 gaacatcctg gccaagagag taaaaaggaa ggggggttcc ttccgtaaag gtcagctggt 5220 atggctagac tcccggaact tgaagctgcc gtacccgtcc agaaaactag ccccgaggcg 5280 agaagggccg ttcaaagtcc tagacaagat cagaacggtc gcatacaagc tagacctccc 5340 tagcacctgg caagtccacc ccgttttcca caaatggctt ctatctccct actcagagac 5400 ggagatccat ggtccaaact acacgaggcc accccctgac cttatcaaag gcgaagataa 5460 ctacgaagtc aaagccatca tcggacaccg agagaacaag aggatgcggc gccgcgaata 5520 tcgcgtcagg tggaagggtt actcaaccgc ggatgactca tgggaacccg aagcaaacct 5580 gacagggact gcggacgacg tgctaacaga gtataagaaa aggaagaact tgagctgagt 5640 tgacagcagc tcaacccagt cgctacaccc ccactcgtac caacatgtcc tccaccctcc 5700 cgttcctcga ccagttcaac gccccctcaa ccgagggcgg aaagaggatt ttgatctaca 5760 cccctaagca cacccatgtc ggcaacagca ctttgctgac actacttctc agtaacccca 5820 ccgacgtctt caataaactc aaagcccaca accccgaggc aaccaatgcc accgaccgcg 5880 cagcactaga agcgtacctc tccgcccacc atgaatacga caaggctgtc aaggcagccg 5940 acaaggccat cgaccaccac aagcgactcc tacgtcaaca ggatgaccgt gtcctcaccg 6000 agctcattca actcgacaac ctgaaagttg cccatcgctt tcaaccgctc ctaccgcgca 6060 gcatccgggc acgacacaac aaattcatcc cacgcaccat ccccaatgcg tacctacccc 6120 tacccgcacc cctaccgacg tctgccttca ggcggccccc gatcccatcc cctttcctcc 6180 aagcaacgcc gcggagcact accatcccgg ctgactggca acccaaccct ggttggaccc 6240 caaagggaag ctgtagacga tgcggatcgt ctcgacactg ggtacgggac tgtccggaca 6300 cacgatgcgc aagatgcgga aaagaagccc caggacacct ggagcgagag tgcagaacca 6360 ggccgatgaa gcgacatgta tccgcaccac ctgaagagcc tgcatgacgc gtgggggtgg 6420 tcgtcgacaa cgtgttcgtt gaaggaatca tcaatgaggc aaaggagagg aaggaaagag 6480 agaggcagac gaaagcggta cccatccctc caccgcgcag tgccaaccct gaaccccagg 6540 acagtcccat cgcgggatcc tcacgccccc gccccgatac acccgtcgtc ttccgcaaag 6600 tcaaccccga ttggacaccc gatactactc aatggacgtg ggacagctcc tggccgcgcc 6660 aaaagcacct ctctggcgag gaatggaaga atgtggggag gaacgcacgc aatgagtggt 6720 tcgatgaaca ggaggatgac ggcgtagact gggagttgta tggagatggt gagcagtaag 6780 tagttgaatg ggacttcgac tctggcaggg gggtaa 6816 // ID Copia-39_MLP-I repbase; DNA; FNG; 5264 BP. XX AC AECX01001583; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-39_MLP_; KW Copia-39_MLP-LTR; Copia-39_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5264 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001583; Positions 6848 1585. XX CC Positions [1775-2275] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 8..2542 FT /product="Copia-39_MLP-I_1p" FT /translation="MSFRVPDDSDISFESAQSEPTISNNQSSSSDNQLIIA FT NTYATTIEPLEPETMNPEMETQVAGNQQQAASGVNNSQNLQQILNQQSITN FT APQTLHQTITNVFTITGSENTDSWIVRTKNCQVKLASLAALTADGSNFDVW FT DADIQEYLETMTGCVDYLHPAAIPSVDGWREDIARGINGLFHWTVDKHIAM FT RLKDHSPVPSARYAWLKSQFSGQTFAGRMSLLNEFTSIKYDPSSTTLDAFV FT NSASTLRRKLERTGMSIPEDVFAGILMNSVPKDFPNVSNTFEALIINDPRA FT TVNSSAMVRAIGAADVAHRHRNISVADGLSASIAKSPKARVLRCYYCKKTG FT HTKPDCPALKTKDKKRVSPIASTSDARTVEPQFADIDFSGQVDIRHIMVSQ FT LDTDITSSTIIFNTGATHHVFNDRSFFISIQNITPIPVRMANNNSSSFITG FT VGRVRICNVTNSDEFLEVNNVYFCENLRHGLLSGIQLEEDGITFKNSSKGI FT ELWKDNSRVLTAIKSGRKWVLKALSHPVETAAVIGDFMLWHRRLGHVNDRT FT LLKLIREKSCVGLPDKLTKTVACEDCAISKSTKTSTIGPSLISYDGPLQLV FT VADLCGPFPVKTISGSEYSLEIRDIYSTYMRTYLLKSKSDAANQIKIFIAE FT AERRTGKKVIHWRTDGGGEFVNNSLKSYFAEKGIKIQTSLPYFHEQNGSIE FT RANRTVQGGMRVLLRDSGLPRKFWGFAIITATDLHNRTPNSNTGAKTPIEL FT MFGFAPQVDHLRIFGSWAFVHVPVEKRRKLDDRAVKCRFVGYLEASKGWRF FT WNPVSEEFIDSAHARWLDEKTTDKIIPEPPVSHHDESSLNHIL" FT CDS 3734..5230 FT /product="Copia-39_MLP-I_2p" FT /translation="MHSEYATEWKEACAKELEMLIRLKVWEEVPLPPGQKA FT VSSKWVFAEKTDSHGNITKRKSRFVVRGFTQREGLDFSETFAPTAKFTSLM FT IIFSITVFHGWLIRGFDVVAAYPHSPIDETIYVKPAEGVPTSSPNLYLLLR FT KALYGTKQAARCWWKHFSSILAGMGCKYCINDQSLYVLHYKNNIALLWIHV FT DDGVICGSSIDIINFIHESLLKTFQITWNDNLEQIVGIKIDYKPDGIFLSQ FT PVLTASVLKSTGFTTLKTSTPMVANLQLDTLDKDATGINASSFLSILGSLS FT YLAIGTRPDISFSVNYLARFSSKPGDKHWLALKHLLRYISGTKHDGIYFRK FT RNEDMDLVTYCDANWGGEFSRSTHGFAVFLFGNIISWASRRQSCVATSTCH FT AEYMALGVASRESIWIQNLLEDIFKKTFITTLKCDNTAAIKVAQDLHLTKR FT SHHVAREFHYVNEQIFDGNLKITWIDGPNQKADILTKSLANVIFHIHKHNI FT CMS" XX SQ Sequence 5264 BP; 1515 A; 1038 C; 1090 G; 1621 T; 0 other; ataggttatg agctttaggg ttccagacga ctcagatatc tcctttgaat cagcgcaatc 60 cgaacctacc atctcaaaca atcaatcttc ttcttctgac aatcaactaa tcatcgcaaa 120 tacttacgcc actaccatcg aaccacttga acctgaaaca atgaatcccg agatggaaac 180 tcaagttgct ggaaatcaac aacaggctgc ttctggagtc aacaattccc agaatctgca 240 gcagatttta aatcaacaat cgattaccaa tgcaccacaa actctccatc aaacgattac 300 gaatgttttt actattaccg gtagtgaaaa caccgattca tggatcgtca ggactaaaaa 360 ctgccaagtc aagctcgcta gtcttgctgc attgactgca gatggttcga actttgacgt 420 atgggatgcc gacatccagg aatacctcga aactatgacc ggatgtgtcg attatcttca 480 tccggcagct attccttccg tcgatggttg gcgcgaagac atcgctaggg gaatcaacgg 540 actttttcac tggactgttg acaagcacat cgccatgcgt ctaaaggacc attcacctgt 600 gccttctgct cgttacgctt ggctcaaaag tcaattctcc ggccaaacct ttgccggaag 660 aatgtccttg ttaaatgaat ttacatccat caagtacgat ccatcttcta ctaccttgga 720 tgcgttcgtt aactcagcct ctactcttcg tcgcaaactc gaaagaaccg gcatgtctat 780 ccctgaagac gttttcgctg gtattctgat gaatagcgtt cctaaggatt ttcctaacgt 840 ttcaaacact ttcgaagctt taatcatcaa cgatccgcgt gctactgtca actcttcagc 900 tatggtcaga gccattggag ctgctgatgt ggctcatcgc catagaaata tctcggttgc 960 ggacggactc agcgcttcga ttgcgaaatc tcctaaggct agggtgttgc gttgttacta 1020 ctgtaagaag accggccata cgaaaccgga ttgtcctgcg ctcaagacta aggacaagaa 1080 gcgtgtttcc cctatcgctt ctacttcgga tgctcggact gttgaacctc agttcgcgga 1140 catcgatttc tcagggcaag tggatatccg tcacatcatg gtttcgcaac tggacactga 1200 tatcacttcc tctaccatca ttttcaacac cggcgcgact catcacgtgt ttaatgaccg 1260 ctcttttttc atttctattc aaaacattac ccctatacca gttagaatgg ccaataacaa 1320 ttcttcttca tttattactg gtgttggtcg tgttagaata tgcaatgtaa caaactctga 1380 tgagttttta gaagtcaaca acgtctattt ttgtgagaat ttgcgtcatg gtctactctc 1440 tggtattcaa ctggaggagg atggtattac ttttaaaaat tcttcaaaag gtattgaact 1500 ctggaaggat aattcaagag ttttaactgc tatcaaatca ggaaggaaat gggttttgaa 1560 ggctttatcc catcctgttg agacagctgc tgttatcggt gattttatgc tctggcatcg 1620 ccgtctgggt catgtaaatg acagaacctt attgaagttg attcgtgaaa aatcttgtgt 1680 tggattgcct gataagctga ctaaaactgt agcgtgtgaa gactgtgcaa tatccaaatc 1740 cacgaagact tcaacaatcg gtccctcttt aatttcttat gacggacctc ttcaacttgt 1800 agtagctgat ttgtgtggac cttttcccgt caaaactata tccggatctg agtattctct 1860 tgagatcagg gatatctact ccacgtatat gcgaacttat ctattgaaat cgaaaagtga 1920 tgcagctaat cagattaaaa tctttatcgc agaagctgaa cgacgtaccg gaaagaaagt 1980 cattcactgg agaacagacg gtggaggtga atttgtcaat aactctttga aatcgtattt 2040 tgctgagaag ggcatcaaga ttcaaacttc tcttccttac ttccatgaac agaacggctc 2100 gatagaacgt gcaaatcgaa cggttcaagg aggtatgcgt gtgttgcttc gagattcagg 2160 acttcctagg aaattctggg gttttgcgat tatcactgca actgatcttc acaatcggac 2220 accgaattca aacactggag caaaaactcc catcgagctc atgtttggtt ttgcaccaca 2280 ggttgatcat ctacgaattt tcggtagttg ggcttttgtc catgttccgg tcgaaaaacg 2340 tcgtaaactt gatgatcgtg cagttaagtg tcggtttgta ggatatctgg aggcaagcaa 2400 ggggtggaga ttttggaatc cggtcagtga agagtttatc gactcagccc acgcacgatg 2460 gcttgatgag aagacaaccg ataaaatcat ccctgaacct cccgtctctc atcatgatga 2520 atcttctttg aatcatatat tgtgagggtt gggtactcta cagaggactc aaggctcact 2580 aagtgggtac tggatagtag caaatgtttc aatagtctat ttttatattt caattatatg 2640 gtaattacaa cagaaatata agaaagatag agaaagaggg aaaaaggggg ggaactggtc 2700 tgtttggata aacccagaca gactacagtt aaatgactcc tgaaaaggag gtttttaatt 2760 ataataaata aatgttataa ttaatgaagg atggatccca aatgtccttc ctttaatctt 2820 ctaattaaca gattaaagag tgaaataaag gtcactacac tggttgttct taaagtcacc 2880 aacttatggt tgttcttaag gtcaccagcc tgaaagtaat tatatgtagg tcactagcct 2940 tgggttgttg gaaggtcacc aacctaaaag agttcctact tgggggaagt agaggatcca 3000 gaaagggtga aggtattact tacttttaat caaggaactg attaaagtaa tgaagttatc 3060 actagtctta tatttataag actcttgatg ttacttcagt agaactttaa ttgttcttaa 3120 aagtagtgtg taataattgc ttggggtttt ctgaaggctg agagaggagg ggtttatata 3180 ctctcctctc taaacctaac tgtagtgata ttttgttatt atcactacat tggttcaggg 3240 ttagtcatta acatatagta atgattgtta ctagtgatgc ttagttacat atacatgtta 3300 tgtgtgatat gtaatataag catttaactc ttgacactga gttatgaggt gtaaatggtg 3360 atttattacg tattacatgt tgttttacag gttcatgtat atgtaatgaa ggttttggta 3420 cattgtatta catgttacat acagtttata ttcagtatgt agcatgtaat tgatttatac 3480 atagggtttt gattgttaca ggttagtttt gggtacatgt atatgtactc agggctgaca 3540 tatattaact tcaaacaaca ctatccaaac tcatgaagaa cttcgtactc tatttgaatc 3600 tctttcaatt acttattcac ttgaagatcg cctttttacg gacactgtta ggcaacagga 3660 tttgaacatt gatttattgc atgctaaggc cgctggattt gctcaacgat tacccagaag 3720 ctatcaagaa gccatgcata gcgaatatgc aactgaatgg aaggaagcgt gtgctaaaga 3780 gcttgagatg ttgataagat tgaaggtatg ggaagaagtt cctctaccac cgggacagaa 3840 ggctgtatcg agcaaatggg tgtttgctga gaaaacggac tcacatggta acatcacaaa 3900 acgcaaatct cgattcgttg tcaggggttt cactcagcgt gaaggactgg atttctcaga 3960 aacattcgca ccaacggcta aatttacatc cctcatgatc atcttctcca tcacggtttt 4020 tcatggatgg ctcattcggg gttttgacgt tgtggcagct tatccacata gtccgattga 4080 cgaaactatc tacgtcaaac cagctgaagg tgtaccgaca tcatctccca atctttatct 4140 cttacttcga aaagctttat atggtactaa gcaagcagct cgctgctggt ggaaacattt 4200 ctcttcaatc ttagcaggta tgggatgcaa atactgtatc aatgatcaga gtttatacgt 4260 tctacattac aaaaacaaca ttgcactatt atggattcat gttgatgatg gggttatttg 4320 cggctcttcg attgatatca tcaattttat tcacgaatct ttattgaaaa ctttccaaat 4380 cacttggaac gacaacttgg aacagatagt tgggatcaaa atcgactaca aacccgacgg 4440 tatcttcttg tcccaaccag tcttaaccgc cagtgttctc aaatcgactg gtttcaccac 4500 tttgaaaaca tctactccta tggttgcaaa tctccaattg gacactctgg ataaagatgc 4560 tacaggtatt aacgcatcat cgtttctatc aattttaggc tctttgtctt acctggctat 4620 aggtactcgt ccggacattt cattttctgt caattattta gctcgatttt cttcaaaacc 4680 gggggacaag cattggcttg ctttgaaaca tttgcttagg tatattagtg gtacaaaaca 4740 tgatggaatt tattttagaa aaaggaatga agatatggat ttggtcactt attgtgatgc 4800 gaattggggg ggtgagtttt ctaggtcaac tcatggtttc gctgtttttc tttttggtaa 4860 tattatttca tgggcctcga ggaggcagag ttgtgtagca acttctacct gtcatgcaga 4920 gtatatggct ttgggggtgg cttcaagaga atcaatatgg attcaaaact tattagaaga 4980 cattttcaag aaaactttca ttacgacttt gaaatgcgat aatacggccg ctatcaaggt 5040 tgctcaagat ttacatttga ccaaacgatc tcatcacgtt gcgcgagaat tccattacgt 5100 caacgaacaa atatttgatg ggaaccttaa gatcacctgg attgatggac ccaatcaaaa 5160 ggccgatatt cttacaaagt cacttgctaa tgtaattttt catattcata aacacaacat 5220 ctgtatgtca tagtagtgta aacctctaaa ggttgagggg gggg 5264 // ID Mariner4_AO repbase; DNA; FNG; 1892 BP. XX AC . XX DT 24-JAN-2006 (Rel. 11.01, Created) DT 02-FEB-2006 (Rel. 11.01, Last updated, Version 1) XX DE It is a family of nonautonomous Mariner DNA transposons- a DE consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW Mariner4_AO. XX OS Aspergillus oryzae OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC mitosporic Trichocomaceae; Aspergillus. XX RN [1] RP 1-1892 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-1892 RA Kapitonov V.V. and Jurka J.; RT "Mariner4_AO, a family of Mariner DNA transposons in the RT Aspergillus oryzae genome."; RL Repbase Reports 6(1), 33-33 (2006). XX DR [2] (Consensus) XX CC This nonautonomous Mariner DNA transposon is 66% identical to CC Mariner3_AO. The TPAse-encoding CDS was completely destroyed by CC RIP-induced mutations. Given that 110-bp TIRs are well preserved, CC this family did multiply after massive changes by RIP. XX SQ Sequence 1892 BP; 738 A; 159 C; 180 G; 716 T; 99 other; gccgtctctt aagtcagcag acccacccgc ccttcggcgt gttggtctgc ctcatctcaa 60 cgcaaaatcc tatatcaacg haaaatttaa agttatttag agattttatc tactactata 120 atacctgatt tctattctaa tattgagaaa taaatcttag atatctgtat agctatttaa 180 agctaaaaaa aaccaaatat ttttgcgtta gcgtattaat ttaatatttt ttattattaa 240 ctctaagtat atatttaaga ccgagcgact taaagtatat attctatctt aactaaaact 300 tttaataata tataaaaaaa ggcattaatt tattagattt attagtttaa taatctatat 360 tatttttcta tagctagaat aattaagtag agtactaact agatattata ataaaatata 420 attaacggac tattttatat agttaataaa aattagatat attattttat taaacatctt 480 ttaagcgagt tttagcttat ttaataaaaa ctaaaagata agaattactt taatataaag 540 aaaataaata ttctttaata ttagtataat tatctagagt ctattattaa atagatttct 600 tttaaaaata tatataattt taataaaatc ggtttctaac taaaatagag aaaattttaa 660 aaaattataa ctataatatc tatacgagtt gtaaaagaaa atctatttaa aaaaataaaa 720 aaattaatct ttataataga gtatattata gtaaatagtt ttattttttt tctatatttt 780 atatttaaag gtgtatatta tttagagaga tagtataata tagatatttt ttataaatat 840 taaatagctt tatcttctaa gaactatatt ttaaataaga ttagtcttna ctagatttaa 900 tanttntatt attatataaa gtrcygtatc tytaaaaata agatttaatt attacttttt 960 aataghtata agtcttatht tatctataaa ttthtttagt tttatagatt atatcttant 1020 atttttcttt ttatataata tatyttrtat artctyttaa taaataatct ttctagrtct 1080 ataaatactt ctattataaa yrraataata agcttatttt aararacgct ragatarata 1140 ataaragtaa tttyttaaaa tagatttatt ctatttrtat agagactttt aaataatata 1200 taatctagta tatytttaaa aaataaaata tatatttttt aaatttarag ytarttttta 1260 aatctttaaa taagrcttta gaattagyty taaagcttta aataattata atacyattay 1320 taytattttt aaryttatyr ctattatcta taatataaga tytttgttra agtattarta 1380 aagyctarag ctttattaat artartctag aryttaatta aagctttata caacgtctaa 1440 accatatatt ctaragyttr ctagaaayta ytaaattagy ygcttarctt aaraataatc 1500 tataayaaya yctyyratat taaaagtctt aaaatagatr gaaatyayag agaagggtct 1560 rgtataatag actaytaayt atatataata taaaacgcta tattaytaat tatatagara 1620 tagaaaract atagratctt aratagatta gaaaaryggr aactttagaa tataataaat 1680 yactataata raaagatata grgratctgy ctagtatara gactagttaa atagataara 1740 aaggttctag attacyyttt taratarata ctcaaggcga tattrtatag raatctgaat 1800 aactttaaat tttgygttga catgggattt tgygttragg traggcarac caacacgccg 1860 aaggcgggtr ggkctgctga mtaagagacg gc 1892 // ID TY1B repbase; DNA; FNG; 50 BP. XX AC M24990; XX DT 11-NOV-1996 (Rel. 1.1, Created) DT 11-NOV-1996 (Rel. 1.1, Last updated, Version 1) XX DE S.cerevisiae Ty1 transposable element D15 DNA, segment 2. XX KW Copia; LTR Retrotransposon; Transposable Element; TY1B; KW Ty1 transposon; mobile element. XX OS Saccharomyces cerevisiae OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Saccharomyces. XX RN [1] RP 1-50 RA Eibel H., Gafner J., Stotz A. and Philippsen P.; RT "Characterization of the yeast mobile element Ty1."; RL Cold Spring Harb. Symp. Quant. Biol 45, 609-617 (1981). XX DR GenBank; M24990; Positions 1 50. XX SQ Sequence 50 BP; 9 A; 5 C; 9 G; 27 T; 0 other; ttttccatag tatggaggct tttttttttc tgtgaatgta cgtatatttt 50 // ID Gypsy-15_MLP-I repbase; DNA; FNG; 6320 BP. XX AC AECX01001312; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-15_MLP_; KW Gypsy-15_MLP-LTR; Gypsy-15_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-6320 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001312; Positions 14012 7693. XX CC Positions [4574-5053] - Integrase core CC 'CTGAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 668..5782 FT /product="Gypsy-15_MLP-I_1p" FT /translation="MSREEDSETIRALRSQVENLELRTAELDESRRETAEL FT QKLVRDLLAERSQARSPGIAAVTSEVDRSEADPPSGNNTFSRYVHGTPTSP FT SPLPRNQSGNQPAKVQQPPAETNRQGRTTTSYVPLQRVSWAEEIPEPQPTT FT RSQPTQSTPANFSSRPTYPSPLTTVYSPAIPPQPARPDRFEDTGQINNPTI FT VVPNTTTADRVKLSDLPKFSGKFGSPADLFHWRSLIEETFEIKNLVDDREK FT LKLLGSLLNNEEMSAWYQSNKGELKQQTWDGAMQMMAMGTLPYAWLTDTEN FT ALRRLEIKPGETFDLYVSRAQALHRLIKPFGQITDHHLAQHITWGAPQLVK FT DMIEREGHLHVIPFSFPKFKDTADGIRRFLINSRLLSDPSSKTKSSHATAN FT TNSTAPNPARNPPPARQRSEDERADNTWRYHEYLRRNGICSTCKEKCNNQN FT CTKKNMRFLSVPANFNPGPRPRRPGLPNNGPTPPGAPTQRPAGRPPVAIPT FT ARVAAIENFPDILSEDIAAYEEADRFQSTQIEEHADGQEGCVPSKSVAPII FT LELTCNGKPVRALIDSGAGTNLLSDCMASELRVPRRPLVSPVEVRLAIPTE FT GSPLVLREFAMANLKCENPSLRFGAVFFKLSPLSPSYDMILGSPFLSKFQL FT DVSLHRRCVLHTSSGKILYEKDMQSEMNRVLCAIQNLDKICKIQDMSEREL FT KIFEDFKDLFPAELPPVEEGEAPDETFPDGLPDVNRQVRHKIILTDPNVVI FT NERQYGYPRRHLDSWNKLIQQHLAAGRIRRSQSQYGSPSMIIPKKDPAELP FT RWVCDYRTLNKYTVKDRAPLPNVDEAVRLVATGKVWSVVDQTNSFFQDRMR FT EEDIPLTAVKTPWGLYEWVVMPMGLTNGPATHQARNEEILGELVGEICVVY FT IDDIVIFFQSVEEHEDHLRQVLERLRKAKLYCSIKKSKLFRREIKFLGHQI FT SEAGICPDDEKVEKISKWTSPTNSKQLLKFLGTVQWMKKFINGLSHYVGTL FT SPLTSTSLKGKPFQWGKAEESAFNNIKRLITTLPVLKNLDYDSNEPLWLFS FT DASGNGLGAALFQGLQWETASPIAYESRTMSPAEKNYPVHEQELLAVINAL FT HKWRLLLLGMRVNVMSDHHSLTTLLTQRNLSRRQARWLETLSEFDLDFKYL FT RGQDNSVADALSRREDVSVCEVRAGLREEELRQIQEGYARDPFCIKLRKVL FT PLREDCILKDGLMYLDGRLVIPSHGSLRKDFINQAHQALGHLGTLKTLTRL FT RPTFFWPGMANEVERQLKSCDSCQRNKARTTATAGRLQTSAVPSYPMQSIG FT IDFIGPFPKVQSFDMILSCTCRLTGFVRLIPASQRDTAERSASRLFAAWSS FT IFGLPESIIGDRDKAWTSRFWQRLHELLGVRIQLSTSYHPQADGRSERSNK FT TIGQILRHLAADKHGKWYQSLPSAEFAINSAVNVSTGVSPFHFVYGRIPRL FT FPIQSPSVDDNEEVTHWIERRDAEWATWRDRLWHNRVEQALHYNRRRREGL FT PFSVGDWVMVDSADRQQVVGGKGRPTGKLRARYDGPYEVQEVLNGGRDFKL FT DLPGNDSTHPIFHISKLKIYHSKSEPHDSIAAVSTVSLRRKTTPGRPMEEP FT LGQATKRNEGVCVKPDTLTPPQCECDTSLSFYNCDAVGKLCPLLNNSAHSH FT NDITIGAVARVTKDCSLINDFVTAENSLKEPGIGSL" XX SQ Sequence 6320 BP; 1778 A; 1573 C; 1457 G; 1512 T; 0 other; ctttttttaa cttgatcact tcgaaagtta aagttacacc accaatattt ttttctcact 60 atctcatttc atcgtatctg ctctgctttt cggaattgtc ggtcaatctg ctcagtcacc 120 ccacctgcca ccgtatctct cacggttttc tgctctgtca ccttctgcta cacctactct 180 gacttcgacg ccggaacaat cacccgaact cacgatctta ctagcgtatc ggatgtctgg 240 accgtcaacg ctcaaacctg acaactattt agacgatccc gagaccttac tcagacaagc 300 aagaaaagga aagaaagtgg actctagtct agaagagact tagagtccac ttgccggacg 360 acctccaaat ttttcctgga gacccaaagc tgttgcgtca ggatcaccgg ctacactgac 420 ggatcgtgtc gtttacgctt cgtcgtatta tcaaaaaagc ccagatcagc ctcaaacacg 480 tgaattctca ccaatcgaca tctcaccatc cgcacccgag aagacggtaa ttgacctgca 540 cttcaaaaaa ttatttagtc agtctcacac accgcatccg cctggatacc tcccgcaaga 600 cacaccactg caaaaaccac ctgctacaac cgcaacagga acccttacat cggatcccac 660 caacactatg tcgagggagg aagattctga aactatacgg gctctacgat ctcaagtcga 720 gaacttagaa ctgcgcacgg ccgaactgga cgaatcacga cgtgagacgg cagagctaca 780 aaagctggta cgagatctcc tagcggaacg aagtcaagct cgatcacccg gaattgctgc 840 agtgacgtct gaggtcgatc gctcagaagc agacccaccc agcggtaaca acactttcag 900 tcgttatgta catggtaccc caacgtctcc gtctccactg ccacgaaacc aatcaggaaa 960 tcagccggca aaggtacagc agccaccagc cgaaaccaac cgacaggggc gtaccaccac 1020 gagctacgta cctctgcaac gagtcagctg ggctgaggag atcccagagc cccagccgac 1080 tactcgctct cagcctaccc agtcaacacc tgcgaatttc agttcgagac cgacctaccc 1140 atcgccactg accactgtgt actcaccggc tataccacct cagcctgcga ggccagatcg 1200 cttcgaagac accggccaaa tcaacaatcc aacaatcgtc gtacccaaca ccaccacggc 1260 cgatcgtgtc aagctgagtg acttaccaaa gttcagcggg aagttcggta gccccgcgga 1320 cctgtttcac tggcgtagcc ttatcgaaga gacgttcgag atcaaaaacc tcgtggatga 1380 tcgggagaag ctgaagttgt tgggttcctt actgaacaac gaggaaatgt cagcctggta 1440 tcaatcgaac aagggagaat tgaagcagca aacctgggat ggagccatgc aaatgatggc 1500 catgggtacc ttaccttacg catggctgac ggataccgag aacgccttac gaagactcga 1560 gattaaaccg ggagaaacat tcgatcttta tgtatctcgt gctcaagctt tacaccgact 1620 gatcaaaccg ttcggtcaga tcacagacca ccatctcgct caacacatta cctggggtgc 1680 accccagtta gtcaaagaca tgatcgagag agaggggcac ttacatgtga tacctttctc 1740 tttcccgaaa ttcaaagata cggcggacgg catccgtcgc tttctgatca acagtcgatt 1800 actatcggat ccgtcgtcga aaactaagtc aagtcacgct acggccaata ctaactctac 1860 agcaccaaat cccgcccgta atcctcctcc agctagacaa cgatcagagg atgaacgggc 1920 agacaacacc tggcgatacc acgagtatct tcgacgcaac ggaatttgct cgacctgcaa 1980 agagaagtgc aataaccaga attgtacgaa gaagaacatg cgttttcttt cagtaccagc 2040 caatttcaac cctggcccta gacctcgtcg tccaggccta cccaataacg gacccacacc 2100 tccaggagca cctactcaac gccccgcagg aaggccaccg gtagcaatac ctacagcacg 2160 agtagcagca attgaaaatt tccctgacat cttatctgaa gacatcgctg catatgaaga 2220 agcggatcga ttccaatcaa ctcaaatcga agaacacgca gatggacagg aagggtgcgt 2280 accgtcgaaa tctgtcgctc caatcatctt agagctgaca tgtaatggga aacccgtgcg 2340 agctctcata gattcgggag cagggacgaa tctactgtcc gattgcatgg cgagtgaatt 2400 acgagtacct cgacggccat tggtctcacc tgtagaggta cgcctggcaa tccctacaga 2460 agggagccct cttgtcttgc gagagtttgc gatggctaat ctaaaatgcg aaaatcccag 2520 cttgcgtttt ggagcggttt tcttcaagtt atcaccgtta agcccatcgt acgacatgat 2580 tttaggttca ccattcttat caaagtttca attagatgta tccctccatc ggcgttgcgt 2640 gttacacaca tcgagtggca agattttgta tgagaaagac atgcaatcag aaatgaatag 2700 agttttatgt gctatccaaa atcttgacaa gatttgcaag atacaagata tgagtgagcg 2760 tgaattgaag atctttgaag attttaagga ccttttcccg gcagagcttc ctccagtgga 2820 ggagggtgaa gcaccggatg aaacattccc ggatggctta ccggatgtga acagacaagt 2880 tcgtcataaa atcatactta cggatccaaa tgttgtgatc aacgagaggc agtatggcta 2940 tcctcgaaga catttggatt cctggaataa attgattcag caacacttag ctgcaggccg 3000 cattcgaaga tctcagagcc agtacggctc accgtcgatg ataatcccca agaaggaccc 3060 agcagaatta cctcgctggg tctgtgatta tcgtactctg aataaataca cagtcaagga 3120 tcgggctcct ctacctaacg tggacgaagc agtccgctta gtggcaacag gaaaagtatg 3180 gtctgtggtc gatcaaacca attccttctt tcaggaccgg atgcgcgagg aagacatccc 3240 gctgacggcg gtgaaaacac cttggggtct gtacgaatgg gtcgtgatgc ccatggggtt 3300 gacaaatggg ccggctacgc accaagctcg aaatgaagaa atcctaggcg aattggttgg 3360 ggaaatctgt gtggtgtata ttgacgatat agtcatattt tttcagagcg ttgaagaaca 3420 cgaagatcac ttaagacaag ttttggaacg gctaagaaaa gcaaaactat actgctcaat 3480 caaaaagagt aaactgttca gacgagaaat caagttcctg ggacatcaaa tcagcgaggc 3540 aggtatatgc ccagatgatg agaaggtgga gaagatttca aaatggacat cgccaaccaa 3600 ctcaaaacaa ttgcttaagt ttttaggtac agtgcagtgg atgaagaaat tcatcaatgg 3660 attgtcacat tatgtgggaa ctttgtcacc attgacaagc acatctctga agggcaagcc 3720 ttttcaatgg ggtaaagctg aagaatctgc ttttaacaac atcaagagac tgatcacgac 3780 tttacctgta ctaaagaatc tcgactatga ctcaaatgag ccgctttggc tattttctga 3840 cgctagcggt aacggactcg gtgctgctct tttccaaggc ctgcaatggg agacagcttc 3900 acccattgcg tatgaaagcc ggaccatgag ccctgcggag aaaaactacc cagttcacga 3960 acaggaactc ttagcagtta tcaacgcact ccataaatgg agattactgc ttctgggaat 4020 gagggtgaac gtaatgtcgg accaccattc cctgacgact cttctgacgc agcggaacct 4080 cagccgacga caagcgcgat ggctggaaac cttatcagaa tttgacctgg acttcaagta 4140 tctacgaggt caagataatt ctgtggcgga tgcgctatcg aggcgtgaag atgtaagtgt 4200 ttgtgaagta cgagcaggat tgagggagga agagctgcga cagattcaag agggttacgc 4260 acgagatcct ttttgcatca agctccgtaa agtactgccc ttacgagaag actgcatcct 4320 caaagacggt ctcatgtacc ttgacggtcg tttggtaatc ccctctcacg gaagcttgcg 4380 caaggatttc atcaatcagg ctcatcaggc attaggacat ctgggaacgt taaagaccct 4440 cacacgactc agacccacct ttttctggcc tggtatggct aatgaagttg aaagacaact 4500 caagtcttgc gattcttgtc agcgcaataa ggctcgaacc acagctactg caggacgact 4560 gcaaacttca gcggtaccta gttatccaat gcaatcaatc ggcatcgatt tcattggacc 4620 tttcccaaag gtccaaagtt ttgacatgat tttatcatgc acatgtaggc tgacaggttt 4680 tgtgcgtctc atcccagcgt cgcaacgaga cacggcagaa cgctcagcaa gccggttgtt 4740 tgcggcatgg tcctcaatct ttgggttacc tgaatcgatc attggagacc gcgacaaagc 4800 gtggacctcc cgattctggc aacgcttaca tgaactgtta ggtgttcgga ttcaactgtc 4860 aacgtcctac cacccgcagg ctgatgggcg cagcgagcgc tcgaacaaga ccataggtca 4920 gatcctacgg catctcgcag ctgacaagca tgggaaatgg tatcagtcgt tgccttcggc 4980 cgaattcgcc attaactccg ccgtgaacgt gtcaactggt gtctcgccgt ttcacttcgt 5040 gtacggaaga ataccacgat tattcccaat tcaaagccca tcggtagacg acaatgaaga 5100 agtgacgcac tggattgaac gtcgtgatgc ggaatgggcg acatggcgag atagactttg 5160 gcacaaccga gtagaacaag cgttacacta taatcgaaga aggcgtgaag ggttgccatt 5220 tagtgtaggt gactgggtaa tggtggatag cgctgatcga caacaggtgg taggaggtaa 5280 gggacgtcca acgggcaaat tgcgagctcg ttacgacgga ccttacgagg ttcaagaggt 5340 gttgaatggg gggcgtgatt tcaaattaga ccttcctgga aatgactcga cacacccgat 5400 ttttcacatc tcaaagctta agatctacca cagcaaatct gaaccacatg actccattgc 5460 agccgtgagc acggtttcgt tgcgtcggaa aacaacccca ggacgtccga tggaggagcc 5520 tttggggcaa gctacgaaac gtaacgaggg ggtgtgtgtc aagcctgata cgctaacacc 5580 cccgcaatgt gagtgcgata cttcattatc tttctacaat tgtgacgcag ttggcaagct 5640 ttgtcctctc ctaaataaca gtgcacatag ccataacgac ataaccatcg gagcggtagc 5700 acgagtaacc aaggattgca gtcttataaa tgactttgtc acggccgaaa acagcctcaa 5760 ggaaccaggg attggaagcc tatgaggcaa gctaggacgt ttgacaaggg ggtgtgcact 5820 agattagtgc gttaacaccc ccgcaatgtg agtttgccat tggttcctat ttcatttcta 5880 tgtcacgaag aggctggcaa gcaaaatcat ctcctgcatt acagatttta cgaatacacg 5940 ataacaagtc agaggcttta caaggcctga tacacaagtt tttcaagatg accttttcct 6000 caaaaatttc atataattac ttcaagactc aagattcaca tcaagactca gaagaaaaaa 6060 aaaattatca ttattcacca aatctctcaa tatattcatc agtttctttt ttccttttct 6120 cttttttttt cttactttga tatcttttct gtttttgatt ttacttttgc ttctaattat 6180 caaaacaatt ttgctttcac tgggacgagt tatggtcaat tttttggggc gtgtcatgcc 6240 attttttttc tctctttctg ttttttttag ttgggctgtg tggttgccta gaaggaggct 6300 tgattttaag gaggggaggg 6320 // ID Copia-2_CDC-I repbase; DNA; FNG; 4623 BP. XX AC NC_012866; XX DT 06-FEB-2011 (Rel. 16.02, Created) DT 06-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Candida dubliniensis genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_CDC_; KW Copia-2_CDC-LTR; Copia-2_CDC-I. XX OS Candida dubliniensis OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; mitosporic Saccharomycetales; OC Candida. XX RN [1] RP 1-4623 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Candida dubliniensis genome."; RL Direct Submission to RU (06-FEB-2011). XX DR Genome; NC_012866; Positions 644417 639795. XX CC 'CAGC' target site duplication CC LTRs are 94% similar to each other. XX FH Key Location/Qualifiers FT CDS 2140..4590 FT /product="Copia-2_CDC-I_1p" FT /translation="MNSHIVVENESSIPDDDEYQTAIENNYITDTDTESES FT EHEENLPPDEFNTNGPSQIAVESVESNKTFENYHNRSESIEPKINKELSPD FT FNQTLEDINESMNHINEENLDEFDKSIMDDQNLDDGIESLHKELVISNNKI FT DYVEDLVNKRKRDNHLLPANKYIKNNDKTPGIIVSQNKSQAQILDELKPPK FT DKYSNAPFKSKRSKKGFKPPYVTRSGRKVQAPARYLNAIINKIDYNDPNWR FT RSMDEEIEKFKLKNVFEVVKIPPGVKPITTRWVHTYKEKDLKNKSFKSRCV FT VHGQKMVEDTHYNPYKISSPVVDLVSIRLLTIIAVEKGLSMHLLDISSAYL FT NADLEDNREIFIYPPKCYEVGKNHCWKLNKSLYGMRQSGYNWYNHFRRILI FT NNGMTQTLHNDGIFAKQFENGDVLYLTIYVDDVFIIGNTAKVIEDFVTMLE FT NHFELTYFGETTEYLGINFNRNGDTFTLDQKPFLEKLVDNFNILDSYGKNI FT PVIPNDINVVRKLRKSDEINDFIKIERPHHLINAITKTTYIEPDDDEKRIP FT DFESSPEAELLDAKGIKLYQSAVGLLLWATMNTRPDLSFTANVLGSKCSNP FT DTNDWKKLIYCLRYVKNHLDFKLEYKKGRLLKLENDFIIEVFSDASFAPEL FT DRRSITGMAIYVNGNLVNWATKRQKIITHSSAACEMLALNYSVLKAFDLRN FT TIRDIGLKVKNIHIHEDNKAVITILQNDNFHPHRPIDICYKFLRQKLNDGY FT FSISYVESKDNLADSFTKPLGMRKLNEHTRRIIKREDYDGETTLIVDVRKI FT KEIKQNNETNHHMEF" XX SQ Sequence 4623 BP; 1596 A; 741 C; 839 G; 1447 T; 0 other; acgccgacta tagaatcagt aatcgcgttg gttgtaacca tatcacgttg gtcgtcagct 60 acacctaaaa tcgttgtttg tggtacggaa tcatagtcat cgctaagtgt ttttccacga 120 acatcattgc ttaataggga ctctggagga ttaccctctc cagaaattaa aggttggtgt 180 agggttctgg tagtgctttt taccacttca ctactaggtg gctgatcaat tttattaatc 240 cctaaatttg aatgtcctag atccgtgttg tcaggtacga ttaattctac tggtcgacta 300 tcgtgaggag gttcatcaag aaaagcgtca ttaatgttat caaatgatgt tgttgatgtc 360 atatcattaa tttcatctaa atctgtattt tcttggcggg gctgatcaag tggaaagttg 420 atcaatggac taaggtaggt gtcggtaaat ttaattttgg aattttgttt catgtattgt 480 atataatcat tgataacact catggaggcg agtattttaa tattgcctgt aatgattact 540 ggataaccct tggttgacac taaaactttg aaagtggttg aatcttgagt ataaccaata 600 aatgcaccat atactaccat gggaaaagat ttttcttttg caaacggcaa tttcaatgta 660 gctgcctctt ggtatgaggt tagcttgata gcaacatcga caccaaattg cacataagta 720 atcttataac tagataaatt gtaataacgt tcaaatggag tttgagtctg attcttagca 780 accgtgtgat ttctaatgtg aaccagatat tcagctatat aatcaaataa aaacaatata 840 ttagcatcaa aattcaatac cgttttataa atgtttgaaa tgattggcct atttgtactt 900 tcggcaacac cattgagggc tggggtccca ttgggaataa catgatgttc aatacccatt 960 ggttcaagat catctttatt tggtagctca gctgcattgt cacttctaac actagttatg 1020 cggtgattaa atctgttgtt ccaaattttc aatttctcca taattcttgt tttgattcct 1080 ttgacatctg agacgattat ttcagtatag ctactgaact gatctataac agtacttata 1140 tattttttta gcacagtcgt acaaaacggt cccactgtgt cgatatgtag cctttggaga 1200 ctaggcacga cactggtttg tgggtgtttg ttacgactgg ccaattttcc gtgggcagca 1260 gcacataatg gacaatttaa aatggcagat tcagctgtct ttgaaccaac tgggttaatt 1320 tttcccgttt tctttaataa ttccaatttt tctaatgaaa ggtgattaat aagaaggtgg 1380 taataatata aatcttgaac catgtatttg tcagcaactt cctggggaat attgtggata 1440 ccaatataat ttaatttcaa tttactttct ctccccttca atgaggtaac taagtcttct 1500 ccgactagtt ttttctccat tgatggttag gctggtgttg tatttgatat atcaagtttt 1560 gccattgctt ctgtcaaaat cttgctttgc tttgaatctt tattcatcaa gtattttccg 1620 gaaactggtc cagagtataa gtcatctttt ttgcagtact ttgcaacaac tattggtgca 1680 tgggcattgt gtaatacgaa catgtgataa tcggaaatca atattgagaa acccaaatcc 1740 tcgaactgtt taatggatat caaatttaat ttgatttcgg ggatatataa cacttccttt 1800 aactttatat ctaagccatt aactgtaatc aataaagttc cttcagcttg tgcatatgaa 1860 gtttcacctt ctgcggttgt cacttcaata tttgagtcct taacttccga taatagtgtg 1920 atatcgttaa caactgtcaa taaatttgta aacaatttcc tcgaaatcat caggcttact 1980 tgaaatcatg ttcaaatatt caaaaccatc acctttgtag gtagcattct tgacctccat 2040 acaaatcttt caaacccacc acaaccacaa atcacttctt caggggagaa agaaataaat 2100 gaagaagaag taaatgatgc tttggtagaa gcacaacaaa tgaattcaca tatagttgtt 2160 gaaaatgaat catctatacc agacgatgac gaatatcaaa cagcaattga aaataactat 2220 ataactgata ctgatactga atctgaatct gaacatgaag aaaatctacc accagatgaa 2280 tttaacacaa atggtccatc acaaattgct gttgaatcag tggaatcaaa taaaactttt 2340 gagaactatc acaatagaag tgagagcatt gaacccaaaa ttaataaaga attgtcaccc 2400 gattttaatc aaacattaga agacatcaat gaatcaatga atcatattaa tgaagaaaat 2460 cttgatgagt ttgacaaatc aataatggat gaccagaatt tggatgatgg catagaatct 2520 ttgcataaag aattagtcat aagtaataat aagattgatt atgttgaaga tttggtaaac 2580 aaaagaaaac gggataacca tctgcttcca gcaaataaat atattaagaa taatgataaa 2640 accccaggaa taatagtgag tcagaataaa tctcaagccc agattttaga tgaattaaaa 2700 ccacccaaag ataaatacag taatgctcca tttaagagta aacgtagtaa aaaaggtttc 2760 aaaccacctt atgtgacaag gtccggtaga aaagttcaag caccagctag atatttaaat 2820 gcaatcatta ataaaattga ttataatgat ccaaattgga gacgtagtat ggatgaagaa 2880 attgaaaaat tcaagctgaa aaatgtattt gaggtagtca agattccacc aggagtaaaa 2940 cccatcacta cccggtgggt ccatacgtat aaagaaaaag atttaaaaaa taaatcattc 3000 aagtcgcgtt gcgttgtgca cggacaaaaa atggttgagg atactcatta taatccatat 3060 aaaattagta gtccagtggt tgatttggtg tctataaggt tgctaactat aattgctgtt 3120 gaaaagggtt taagtatgca cctgttggat ataagttctg cttatttgaa cgcggactta 3180 gaagacaaca gggaaatatt tatataccca ccaaaatgct atgaagttgg taaaaaccat 3240 tgttggaagc ttaacaaatc tctttatggg atgcgtcaaa gcggatacaa ttggtacaac 3300 cacttccgca gaattttaat aaacaatgga atgacccaaa cattacataa tgatggtatt 3360 tttgctaaac aattcgaaaa tggtgatgta ttatatttaa caatctatgt cgatgatgta 3420 tttattattg gtaacactgc taaggtaatt gaagattttg ttacaatgtt ggaaaaccat 3480 tttgaactca cctattttgg agagacaact gaatatctag gaattaactt taaccgaaat 3540 ggagatacat tcacattgga tcaaaaacct ttcttggaga aactagttga caatttcaat 3600 atattagaca gttatggaaa gaatattcct gtcattccga atgatattaa cgtggttaga 3660 aaattaagga aatctgacga aattaatgat tttataaaaa ttgaaagacc acaccactta 3720 attaatgcca tcacaaaaac aacatacatt gaacctgatg atgatgaaaa aaggattccg 3780 gattttgaaa gttcaccaga agcagaatta ttggacgcca aagggataaa attatatcaa 3840 tctgcagtcg gtctgttgct ttgggcaacc atgaacacca gacctgactt gtcgttcact 3900 gctaatgtat tgggatcgaa atgttcaaat ccagatacga atgattggaa aaaattaata 3960 tattgtttga gatatgtaaa gaatcatcta gatttcaaat tggaatacaa aaagggcaga 4020 ttactgaaac tggaaaatga ttttatcatt gaggtatttt cagatgcttc gtttgcacca 4080 gagttagata gacggtctat cacaggtatg gctatctatg ttaatggaaa tttggtaaac 4140 tgggcaacga aacgtcaaaa aattataaca catagttcag cagcatgtga aatgcttgca 4200 ttaaattatt cagtattaaa ggcatttgat ttaagaaaca ctattcggga tatcggattg 4260 aaagtgaaga acatccatat ccatgaagat aataaagctg ttatcacaat cttgcagaat 4320 gataatttcc acccccaccg accaatcgat atttgttata aatttcttag gcaaaagtta 4380 aacgatggtt acttctcaat ttcatatgtg gaatctaaag acaatttggc agattcattc 4440 acaaaaccat tagggatgag gaaactaaat gaacacacga gaaggatcat taaaagggaa 4500 gattatgatg gtgagacaac gctgattgtc gatgtcagga agataaaaga gattaagcaa 4560 aataatgaaa ctaaccacca catggaattt taattgtttg ctaaatcagg ggagtgttcg 4620 ctg 4623 // ID Copia-3_MLP-I repbase; DNA; FNG; 4997 BP. XX AC AECX01001156; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-3_MLP_; KW Copia-3_MLP-LTR; Copia-3_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4997 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001156; Positions 139185 144181. XX CC Positions [1801-2049] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 226..2049 FT /product="Copia-3_MLP-I_1p" FT /translation="MNPPDATDVKPKSQSSKSLPSNFKNTIPFPTKTIPTM FT TSDLKEFSTSSYSSVVKLSTGTFNDWKLKLTTLLSGQRLSKYILKNIPVPT FT DETELEDHETNSARALAAIHATIDSENFQVIRTCTNPKDAFDKLCEYHDDA FT GGLSTAHLFSDLMALRMQSDEDLNEHVNKFRKIYNDLLSNLASIPDFKISE FT PFITIILIKSLPSEFTPLVQSLLSNFKELTLARLFSLLKIEATRSASANKS FT DSALAARRNNESKPRSYRRKDLSNQPSNGLRCSLGHAGHTDQNCRTRRFRA FT FLEHEKKVAASASPSGNSSNVSQVSQHQTEPVEDVSYWDSAFSASTSFSTP FT IICDTGATSHMFSDISLISDLRPCRPVRISVASKDGVIWARSKGNVRFGSL FT TLRNVLYSPKLTGNLVSVGSLCDNGYRASFDKSIGTIVDSHGNEILQMRRN FT PSTNRLWTPIIPNDSISALFSYTDPEKLAMTWHRRLGHLHPDGVIHFLRRQ FT KLLPISKNNFVSCDACSMGKLTQSPATSPFHRSPKRLNLVHSDLLGPITPV FT SKSGFKYVLSFIDDHSRYASVYLLKSKDQTFSAFKQYKALMEKKCGEDPQT FT QVRSRRRVYL" FT CDS 2117..4972 FT /product="Copia-3_MLP-I_2p" FT /translation="MANSVSERYFRTLIGKTRAQLLESGLPLSLWGELVKY FT CCIQINCSPSAALQNQSPIEVLESLLPGHVHPFDVDRLKPFGCLCFAVDRH FT RKSKVAPVGKRFIFVGLEDGARAARLWDKEAGRIFITGDVLYREDVFPALH FT PTFSPDVAKELIIPSLTDQLVRTKGQDSTVVPDETMTEVIESTGSGPAKDL FT GWRSPIVIEEESDRSVSPPQFCPKVLIPLSESIHAPRVESNQPRVSTSPNP FT SPKSLSPIRSLPNQSEVQRLPSELGLQQYRSFSPPSSHSSATPIESPVSSA FT KSPEMHRSNSSSDLSSISNDVIEVVQSSPRGAESVRSSPKAGPSTLPKSPI FT SSPSPDRCIVPSVSQPPAVSSPPKTPPVQTTPSAISPPVEKQTEQIVQRPA FT TPPAPRRSLRTQKEPERYGFTKKTAKVARVIPNSFGLSATTGSDPDSPTFK FT QAMSGPDRNAWRQAMQQEFDSLTQHGVGKLVDRPPEANVLGGMWVFNRKRD FT EHNRVVKYKARWVVLGNHQIKGLDYNDTYASVGKLDSLRILLALSVKRRWR FT KRQFDVVTAFLNGNMKDLVFACQVRGFEHPTQPNRVWQLIKSLYGTKQAAR FT RWQQHFGATTSEFHLNPTDSDTAVYVLKCALGILILHLHVDDSMVFYDNDE FT LFFKFQAFIDSKYKLKWTEKPTLYLGIKLEIADDQSSIKISQSHYIESVLE FT RFSMVNCNSAKTPLPQRTTLLPGTNDEIEEAKDIPYQEMVGCLQWISAVTR FT PDISYAVSQLSKFNSAWTVSHWTAAKHLLRYLKGTQTLGITYSGGISEPLA FT YSDSDFSQCPTTRKSVTGYIVTVAGGAVSWKSQRQSIVALSTSEAEYVAAT FT ECSKHMAWVRAFYFDIIQQLEHPTIFHIDNTSAIFTASGEGLKSRSKHIDR FT RHHYIRQMIQSNELQVKYIPSDQMLADFLTKPLGPSAITHALKLNNLY" XX SQ Sequence 4997 BP; 1367 A; 1264 C; 984 G; 1382 T; 0 other; ggtgagtaac tcgtggttgt gggattcaga ttagattgaa tttattaagt ttgaatcata 60 ctgatccctg ctctgaaagg tttcacgaag attgtaatag gttatgagcc cagcgtaatt 120 cgtctgcggc aacctatcga tccatcacaa tctttcaaac ttaataatta tctaactttg 180 ttattccccg atcatgatgt tttactctgc ataatctcag actgaatgaa tccaccggac 240 gccacagacg tcaaacccaa atcccaatct tcgaaatctc ttccttccaa ttttaaaaac 300 accattccct tccccaccaa aacaattcca acgatgacgt ctgatctaaa agagttctca 360 acttcctctt actcatcagt ggtgaaactg tccaccggaa cgttcaacga ttggaaactc 420 aaactcacta ctctgttgag tggacaacgt ctgtccaaat acatactcaa gaacattcct 480 gtaccaactg atgaaactga actcgaagac catgagacca attcagctcg tgcgttggct 540 gctatacatg caacgatcga cagtgagaac ttccaggtca ttcgaacttg cactaatccg 600 aaggatgctt ttgacaaact ctgtgaatac cacgacgatg ctggaggact atccactgct 660 catctcttct ccgacttgat ggctctccga atgcaatccg acgaagatct taatgaacac 720 gtcaacaaat tcagaaagat ttataatgat ttactcagca atctagcctc catccctgat 780 ttcaagattt ctgaaccctt catcactata attctaatca agtctcttcc ttctgaattc 840 actccgcttg ttcaaagtct tctttcaaac ttcaaagaat tgacactagc aagactcttc 900 tcactactca agatcgaagc taccaggagt gcatcagcca acaaatcaga ttctgctctg 960 gcagcccgac gtaacaacga gtccaaaccg agatcataca ggaggaagga tctgtcaaat 1020 caaccatcca acggtcttcg atgttctcta ggacatgcag gacacactga ccagaactgt 1080 aggactagaa ggttccgagc tttcctggaa catgagaaga aagtcgcagc ctctgcctct 1140 ccatcaggta attcgtccaa cgtctctcaa gtctctcaac atcaaactga gccggtcgag 1200 gatgtgtctt actgggactc tgctttctct gcgtcgacct ccttcagtac gccaattatc 1260 tgtgatactg gagcgacaag tcatatgttt tcggatatct ctctaatttc tgatttacgt 1320 ccttgtcgtc ctgttaggat aagtgttgca tcgaaagacg gggttatctg ggcgaggagt 1380 aaggggaatg taagatttgg ctctctaact cttcgaaacg ttctttactc gccaaaattg 1440 actggaaacc ttgtgtcggt tggttcactt tgtgataatg gctaccgtgc ttcgttcgac 1500 aaatctattg gtaccatcgt agactcacac ggtaacgaga tccttcagat gcgacgcaac 1560 ccctcaacaa atcgtctatg gactccaatc atacccaatg actcaatctc cgctcttttc 1620 tcgtataccg atccagaaaa acttgcaatg acatggcatc gacggttggg tcatctacat 1680 ccggatgggg tcattcattt tctccgtcgt caaaaactac ttcccatatc taaaaacaat 1740 tttgtatctt gtgatgcctg ttccatggga aagctcactc aatctcctgc cacttctcct 1800 tttcatcgat cgccaaagcg tctcaatctt gtacatagcg atctcctggg tcctataact 1860 cctgtatcga aatctggttt caaatatgtc ctcagtttta tcgatgatca ctctcgatac 1920 gcgtctgtgt accttctgaa atctaaagac caaacttttt ctgctttcaa acaatacaaa 1980 gccctgatgg aaaagaaatg cggagaagat cctcaaactc aagtccgatc gaggaggaga 2040 gtatacctct aatgaattta tttcttttct caaagaagaa ggtattgagg aggagaaagg 2100 tccagctcat agacctatgg cgaactctgt gtcagaacga tatttccgta cactgatagg 2160 aaagactcga gctcagctgc ttgagagtgg tcttccactg tctctctggg gggagttggt 2220 caagtactgt tgtatacaga tcaactgttc tccctcggcg gcacttcaga atcaatcacc 2280 tattgaagtg ctagaatctc ttttacctgg tcacgttcac cctttcgacg ttgatcgctt 2340 aaagcccttt ggatgtcttt gctttgctgt ggatcgtcat cgcaagtcca aagtcgcccc 2400 ggttggaaaa cggttcattt tcgttggtct cgaggacggc gctcgtgcag cacgcctttg 2460 ggacaaagag gctggccgaa ttttcattac cggagatgtc ctatatcgcg aggacgtctt 2520 tcctgcctta catccaactt tttccccgga tgttgctaaa gaactcatta tccctagcct 2580 cactgatcaa ttagtgagga ccaaaggtca agactcaaca gttgttccag atgaaactat 2640 gactgaggtc atcgaatcta ctggctctgg tcctgcgaag gaccttggtt ggcggtctcc 2700 catcgttatc gaggaggaat cagaccgatc cgtttcacca ccacaatttt gtcctaaggt 2760 actcattcct ctatctgagt caatacatgc gccacgtgtc gaatccaatc aaccacgggt 2820 gtccacatct ccgaatccat cacccaaatc actttcaccg attcgttctc tgccaaatca 2880 gtcggaagtt cagagacttc cgtctgagtt gggtcttcaa caataccggt cattttcacc 2940 cccgtcatca cactcttcgg ctacaccaat tgaatcaccg gtttcgtcag ccaagagtcc 3000 ggaaatgcac aggtcaaact catcttcaga tctttcgtca atctcgaatg atgtcataga 3060 ggttgtccaa tcctctccaa ggggtgctga gtctgttcga tcatctccga aggccggacc 3120 cagtactttg cccaagtcgc cgatttcaag cccgtctcct gatcgatgca ttgtccccag 3180 tgtatctcaa ccgccggcag tttcatctcc gcccaagaca ccccctgtac aaacaacgcc 3240 atcagccata tctccacctg tcgagaaaca aaccgagcaa attgtacaac gacctgctac 3300 cccgcctgct ccgagaagat ctttacgaac tcaaaaagag cctgagcggt atggcttcac 3360 caagaaaaca gccaaggttg cacgagtcat accaaacagc tttggtctgt ctgccacgac 3420 tggcagtgat ccggatagcc cgacttttaa acaagccatg tctggtcctg acagaaatgc 3480 ttggcgacag gcgatgcagc aagaattcga ttctcttact caacatggag tcggtaaact 3540 tgtggatcga ccgccagagg cgaatgtact cggggggatg tgggtattca accgtaaacg 3600 agatgaacac aatcgagtcg tcaaatacaa ggctaggtgg gtagtcctcg gcaatcatca 3660 gataaaaggt ttagattaca acgacacata cgcgtccgtt ggcaagctcg attccttacg 3720 tatcctacta gccttatccg tgaagagacg ttggcgaaaa cgtcaatttg atgttgttac 3780 tgctttcctg aatggcaaca tgaaagattt agtatttgcg tgtcaagttc gtggtttcga 3840 acatccaact cagcctaatc gcgtttggca actcatcaaa tcgctctatg gaaccaagca 3900 agctgctcgt cggtggcaac agcatttcgg ggcaacaacg tcggagtttc atcttaaccc 3960 aactgattcc gacaccgctg tatacgtact caagtgtgct ttgggtatcc tcattctaca 4020 ccttcacgtt gatgattcaa tggttttcta tgacaacgat gaactatttt tcaaattcca 4080 agccttcatt gactcgaaat acaagttgaa atggaccgag aaacccacgt tatatctcgg 4140 tatcaagttg gagatcgctg acgatcaatc ctccatcaag atttctcaat cacattacat 4200 tgaatcagtc cttgaacgct tttccatggt aaactgcaat tcagccaaaa ctccgttacc 4260 tcaacgaaca actctgcttc caggtactaa cgacgaaatc gaggaagcaa aagacatacc 4320 ttatcaagaa atggttgggt gtcttcagtg gatttcggcg gtaactagac ccgatatttc 4380 ttatgctgtt tctcaattat ccaagttcaa ctcggcatgg actgtatcac attggaccgc 4440 agcaaaacac ctactccgat atctaaaagg aactcaaact ttaggcatta cgtactctgg 4500 ggggatttct gaacctttgg cttattctga ttctgatttt tcacagtgtc caactactcg 4560 aaagtccgta accggttaca ttgttacagt agcgggtggt gccgttagtt ggaagtctca 4620 acgacagtct attgttgctt tatccacttc agaagccgaa tacgtggctg ctactgaatg 4680 ttccaaacac atggcttggg tgcgagcatt ttatttcgat atcatacaac aactagaaca 4740 tcctacaatc tttcatatcg ataacacttc agctatattc acagcttcag gtgaaggatt 4800 aaaatcaaga tcaaaacaca ttgaccgacg acaccattat attcgtcaaa tgattcaatc 4860 aaatgaactt caagttaaat acattccttc agatcaaatg ttagctgact tcctcaccaa 4920 gcctctggga ccttcggcta taactcatgc tttaaaattg aacaacttat attgaaattt 4980 gtttggaaat gggggga 4997 // ID Copia-1_LENY-LTR repbase; DNA; FNG; 479 BP. XX AC AAPO01000012; XX DT 12-FEB-2011 (Rel. 16.02, Created) DT 12-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Lodderomyces elongisporus genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_LENY_; KW Copia-1_LENY-I; Copia-1_LENY-LTR. XX OS Lodderomyces elongisporus OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Debaryomycetaceae; OC Lodderomyces. XX RN [1] RP 1-479 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Lodderomyces elongisporus RT genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; AAPO01000012; Positions 67362 66884. XX SQ Sequence 479 BP; 193 A; 68 C; 77 G; 141 T; 0 other; tgttgaagtt aattactgaa acaaaataat caaagtgacg agcatatcat tactaagtaa 60 ggatgtgtga gtgccaagac tattttagtt acccaacaat taattgtggg gtaactataa 120 taataataag ttgtgaccca acaaatacat cataaggtaa cttgatgttg taaccccaca 180 agttgagttg tggggtaaga gtaataatat ttaatgtgtt gaattgagga atcattcaac 240 tatataagga gatagaaata tcttcttatg attgcagttt acagtttaca gtttaaagaa 300 gtttaacaac caagaattta cagacaaaac aattaacaat taacaattaa caacaactat 360 tatatttaga taaaacagac tttaataata caaacaatga gtagtgaaat ggaacgtgag 420 gctcctgaaa ctactgtcac caatgaatta acacatgctc gtaatattac ctctcaaca 479 // ID Gypsy-6_LENY-LTR repbase; DNA; FNG; 430 BP. XX AC AAPO01000005; XX DT 12-FEB-2011 (Rel. 16.02, Created) DT 12-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Lodderomyces elongisporus genome: DE long terminal repeat. XX KW LTR Retrotransposon; Transposable Element; Gypsy-6_LENY_; KW Gypsy-6_LENY-I; Gypsy-6_LENY-LTR. XX OS Lodderomyces elongisporus OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Debaryomycetaceae; OC Lodderomyces. XX RN [1] RP 1-430 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Lodderomyces elongisporus RT genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; AAPO01000005; Positions 10197 10626. XX SQ Sequence 430 BP; 130 A; 104 C; 75 G; 121 T; 0 other; tgtcacatac agggcatcta cgaggtgatt aagctcgcta ccaacccata ggtagctcct 60 ttactaacct cttgtagaac acctgaagga ccgtaccatc ttgtcatccc gagtttatcc 120 gcctctgggc atcagactag cgatctgcat atacgctagc caattagaga gccggtactt 180 tcccgagata gacattgtgc tacgtcttct gaggttgttg agaagagctt aaatagccac 240 caagatccca gttagataga cctcctcgat atatacttct agaggacaac tttagagtag 300 atagctttag atcaactaac tctcagtata ataacgttct aatgttatat ctttagttca 360 cacacaaaac taagtaatat cactattact gttcagacat aagcataccg aacactactt 420 atctctgaca 430 // ID Copia-54_MLP-I repbase; DNA; FNG; 4521 BP. XX AC AECX01000274; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-54_MLP_; KW Copia-54_MLP-LTR; Copia-54_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4521 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000274; Positions 56044 51524. XX CC Positions [1729-2229] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS join(661..3432,3436..4497) FT /product="Copia-54_MLP-I_1p" FT /translation="MTTVKKALNDLSATGVKTDEEMISYFILHLLPDDLYN FT LKTMITHGAELMDQKLTVDSVLNNIQQYVNNKKVNINSFQSPSSAYTAQRN FT SSASSRQPSYPICSNGKHNPDTAHSADRCHQRFPRLKNKNKRSPANGAVNT FT AAVMSTFSAQEDFALPPVPPAITSFLLAAVQKKWDKVLNLLDSGASVTMFR FT DQSAFKTYSECSEDVYLADGNSIQAVGKGTVIVKGIDNVELSFTNVLHIPS FT LSSNLISLSQLFSKGFELMNSGRDAFEIKKAKQSVLSGVISNGIFVLNLSH FT GKSLALSSTKSILTDEATLLHRRLGHLNHRYMKMLFNSSVSSLSPCSTCTL FT SKHHCLPFSGKLPRPTAPLEVIHSDLSGRISPESIGGGRYYFKLTDGFSKY FT KHVYILKHKSETFSIFVQFKKLVEKQLGVPIKRLVNDNGGEYISEEFLTYL FT KNEGITSDFSAPYTPQQNPISERGNRTTTERARCMLIDTNLPKKFWGEAVS FT TAVYLENQCPEASIGYKTPHELWTGKIPSVDHLHIFGCAAYPLIPKQFRSS FT KFSPVSKKCILLGYQERLHNYCLWDYGSRKIIYSHDVIFNESDFTHSSLSS FT INTDYDVLTEDIPSLESSNSDPVMSDQEPPVPIMNNLSSPVQVSSSLDSIP FT SHGFTPAGSPPIEYTHIEDGDPITTSSSSDSTNVSSESENVDVVRNDSSIL FT IPPIPESSRWSYQPISQSAPRDISSSINPNNILSTRRRQAHLTRANPSTTP FT KTYKQAMRGDLANEWVEAMKVELDNMERRGVWSVCELPEGCTAVGTVWVYK FT TKTDVDGAFVKFKARLCAQGFSQVANVDYHETFAPTGCKTAFRVLLAVAGV FT NNFDVHGMDAIAAFLNGIPGETIYLRIPEGLHIDGATDKTVLKLNRALYGL FT KQSPRCWYKQLTDFFISIGFSSKADSCLFISSDDSEPCYVAVHVDDMLITG FT SKKGIASFKSAISKKFEMEDLGEATSILGMKISRNRHLKTISLSQEHYVET FT LLDYYGMSQCSPVSTPMIPNTCLSPASAESIAAFKASGLNYCQSVGSVNYL FT AQCCRPDLAFTCSQLAQHLENPGNEHWAAFHRLLRYLRGTSKYNLTYGSSS FT PAEGSYAHAKTENFADADWAGCINTRRSTTGYLFLIFGGPISWKSKKQPVV FT SLSTTEAEYKAVVEAGQELMWLQTLFTDLKVVTDPMIPLFNDNQGCISLAS FT NPIFRARTKHIEIQYHWIREKIEDKTFLLSYKPTAEMTADIFTKSLPRVLH FT NRCCVSMGLTSACLQIS" XX SQ Sequence 4521 BP; 1245 A; 1070 C; 842 G; 1364 T; 0 other; gcgcctatgt gaaatccgaa atccattaat cttcattctt taccattctc atgagtaacg 60 ataacttcga agatcttccc aacactccag catcacttcc agactttgag attcaactag 120 aatctgaatc aaacaactca gaatcacttc attcgtccag atcctctacg cccactggac 180 tcaattcaaa ctctacgttc aatcctagaa cccttattcc tttaaacaat ctagtaacac 240 ctactaccaa tagcaacatg tccaacctat tcaacgataa ctccaactcc aactccgagt 300 ctatttcgag tgttcctatt ttaactgaag acaacttctc tacctggcat cttaaagtcg 360 aatctttcat cattatgaaa aatcttgatg gaattattgg aggtagagaa gctgttccat 420 ctgatgatgc tcttagagct acttacgacc ttcgttgcaa gtatttagtg ggatttcttc 480 gtttaaaaac tcagtgatag gattgctgaa ttattaatca acgactccaa cagaagaaat 540 cctattgaat tatggaaaac cataataggt cacttcgcct ctactaaagc tcgaaatcgt 600 ggaagagtat tttctcaact ctttacgtta tcttgtgatg gatccgatct tcaaggcttt 660 atgaccaccg tcaagaaagc tctcaacgat ctgtcagcca ctggtgtcaa gaccgatgaa 720 gaaatgattt cttacttcat tttacatcta ctccctgacg atttgtacaa cttaaagacc 780 atgattactc atggagctga actaatggat caaaaactca ctgttgactc tgtgttgaac 840 aacattcaac aatacgtgaa caacaagaag gttaacatca attcgtttca atctccgtca 900 tctgcatata ctgctcagcg aaattcgtcc gcttcttctc gacaaccgtc ttatcctatt 960 tgttcgaatg gcaagcataa tcccgatacg gctcattctg ctgatagatg tcaccaacgt 1020 tttcctcgtt tgaagaataa gaacaaacga tctcctgcga acggtgctgt caacaccgct 1080 gcggttatgt ctactttctc tgctcaagaa gacttcgctt tgcctcctgt acctcccgcc 1140 atcacttcct ttctattggc tgctgtccaa aagaaatggg ataaggtcct gaacttatta 1200 gatagtggtg cctcagtcac catgtttaga gatcaatctg ctttcaagac ctattctgaa 1260 tgctcggaag atgtttatct agccgatggt aattctattc aagctgtagg aaaagggact 1320 gttattgtaa aaggaattga caatgtcgaa ctctcgttta caaatgtttt acacattcct 1380 tccttatcca gcaatctgat tagtctttct caactatttt ctaaaggatt cgaactcatg 1440 aattctggaa gagatgcttt tgaaatcaag aaggccaaac aatcagttct tagcggcgtc 1500 atctctaatg gcatttttgt cttgaatctc tctcatggca agtctctcgc tctatctagt 1560 acgaagtcca tcttgactga cgaagcaaca cttttgcaca gacgcctagg tcatctaaat 1620 catagataca tgaaaatgtt gtttaattcc tctgtatcct ctctttctcc atgctccacg 1680 tgtactctta gcaagcatca ttgtctgcct ttttcgggta aacttcctcg cccgactgct 1740 cctctagaag ttatccatag tgacttgagc ggtcgaatct ctcctgaatc cattggaggt 1800 ggacgttatt actttaaact tacagatgga ttttctaaat acaaacatgt atatattctt 1860 aaacataaat cagaaacttt ttcaatcttt gtgcaattta agaagcttgt cgaaaaacag 1920 ctaggagtac ctatcaagag attagtgaat gataacggtg gggaatacat ttcagaggaa 1980 ttcctaactt acttgaaaaa tgagggtatc acttcggact tctccgctcc atacaccccc 2040 caacaaaacc ccatctctga acgaggcaat cgcactacca ccgaaagggc tcggtgcatg 2100 ctcatcgaca cgaatctacc caagaaattt tggggtgaag ccgtatctac cgcggtttac 2160 cttgaaaatc aatgtcctga agctagcatt ggatataaga ccccacatga gctttggact 2220 ggaaagattc catctgttga tcacctacat atctttggct gtgcagcgta tcctttgatc 2280 cccaagcagt ttcgttcttc taagttcagc cctgtttcca agaagtgtat ccttttggga 2340 tatcaagaac gattacacaa ctattgtctc tgggattacg gttccagaaa aataatttat 2400 tcccatgatg tcatcttcaa cgaatcggac tttactcatt catcactctc cagtataaat 2460 accgactacg atgtcctaac tgaggacatt cctagtctcg agtcatctaa ttctgatcca 2520 gtcatgtctg atcaagaacc tcctgttccg atcatgaata atctatcatc gcctgtacaa 2580 gtttcttcct cgcttgattc cataccatca catggtttta ctcctgccgg cagtcctccg 2640 attgaataca cccatatcga ggacggtgat ccaataacta ctagttctag tagtgattct 2700 acgaatgtgt cttcagagtc tgaaaacgtt gacgttgtaa gaaatgactc ctctatttta 2760 atcccaccaa tccctgaaag ctctcgttgg tcctatcagc caatctcgca gtctgcccct 2820 cgtgacatct cctcttctat aaatcccaac aatatcctca gcactcgtcg tcgacaagct 2880 catttaacca gagctaaccc atcaaccact cctaaaactt acaaacaggc tatgcgtggt 2940 gatcttgcga acgaatgggt cgaagccatg aaagtcgagc ttgacaacat ggagcgacgt 3000 ggtgtttgga gcgtgtgcga gctgcctgaa ggttgcactg ctgtcggaac cgtttgggtt 3060 tacaaaacga agacagacgt agatggagcc tttgttaaat tcaaagcgcg actttgtgca 3120 caaggattct ctcaagttgc aaatgtcgac tatcacgaaa catttgctcc caccggttgt 3180 aagactgctt ttcgtgtatt acttgctgta gctggggtta acaactttga tgttcatggc 3240 atggacgcca tcgcagcctt tctcaacggc attcctggtg aaactattta cttacgaatt 3300 cctgaaggtc tacatattga tggagcgaca gacaaaacgg ttcttaagct gaatcgggca 3360 ctctacggtc ttaagcaatc gccaaggtgc tggtataaac aattgacaga ttttttcatc 3420 tccattggtt tttgatcatc taaagccgac tcctgtcttt ttatcagctc cgatgattca 3480 gaaccttgtt acgttgcggt acatgtcgac gatatgttaa tcaccggtag caagaaaggt 3540 atcgcctcct tcaaatcggc tatctctaag aagtttgaaa tggaggatct tggtgaggcc 3600 acttctatct tagggatgaa aatctcgagg aatcgacacc taaaaaccat ttctctttct 3660 caagaacatt acgttgagac tcttctggac tactacggaa tgtctcaatg ctcccctgtt 3720 tctactccta tgattccgaa cacttgtctt tctccggcaa gcgcggaatc cattgctgcc 3780 tttaaggcat ccggcctcaa ttattgtcag tctgttggtt cggtgaacta ccttgctcaa 3840 tgctgcagac ccgatctagc gtttacttgt tctcaactag ctcaacacct ggagaatcct 3900 ggtaatgaac attgggctgc ttttcatcga cttctccgtt atcttagagg tacctcgaag 3960 tataacttga cgtacggctc tagttctcca gcagaaggtt cctatgctca tgccaagact 4020 gaaaactttg ctgacgctga ttgggccggc tgcatcaaca cacgtcgatc tactaccggg 4080 tacttatttc taatctttgg aggtccaatt agctggaaat ctaaaaaaca accggttgtt 4140 tctctttcta ctacagaagc ggaatacaag gcagtcgtgg aggcgggtca ggagctcatg 4200 tggttgcaga ctctgttcac ggatctgaaa gttgtcaccg atccaatgat tcctttattc 4260 aatgacaatc aaggttgcat ttcattagct tctaatccga tctttcgtgc tcgaacgaaa 4320 cacatcgaaa ttcagtatca ttggattcga gagaagatag aagataaaac gtttttactt 4380 tcatataaac ctacagcaga aatgactgct gatattttca ctaagtcttt acccagagtt 4440 ttgcacaata ggtgctgtgt tagtatgggt ttgacttctg catgtttgca aatctcatag 4500 atatcgaagt agggggggta t 4521 // ID FOXY repbase; DNA; FNG; 485 BP. XX AC . XX DT 03-JUN-2005 (Rel. 10.05, Created) DT 03-JUN-2005 (Rel. 10.05, Last updated, Version 1) XX DE F. oxysporum putative SINE element - a consensus. XX KW short interspersed element; FOXY. XX OS Fusarium oxysporum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Hypocreomycetidae; Hypocreales; OC mitosporic Hypocreales; Fusarium; OC Fusarium oxysporum species complex. XX RN [1] RA Mes J.J., Haring M.A. and Cornelissen B.J.; RT "Foxy: an active family of short interspersed nuclear elements RT from Fusarium oxysporum."; RL Mol Gen Genet 263(2), 271-280 (2000). XX RN [2] RP 1-485 RA Gentles A. and Jurka J.; RT "F. oxysporum putative SINE."; RL Direct Submission to Repbase Update (01-JUN-2005). XX DR [2] (Consensus) XX SQ Sequence 485 BP; 117 A; 133 C; 125 G; 102 T; 8 other; ctgggtggct gtaacgaagc caaatctcct actggagcaa aagacgtgct gagctagcta 60 ggtacaggat accccgcttc cttcaccatc cagcatacca atggcacaaa aggygctgta 120 tttctcaagc ccgtcacccg tcagcgggtt cgggggcaat ctrtygtcga actttttcca 180 ccgaccagaa ttctctcacg catcgctaca tctgctgacg cagtccacaa tctccagtaa 240 ccaagttgta gctcgtagtc gactgtcgyg gcggcaggga aaaacctcgg gaaggaatca 300 gaggccgacg gtgcggcttg ccttcaaagg agaaaaaccc atcgcaatcg gcgtcgttga 360 tcggraggat gtcgctacsr atgccmgggg catcgtcctc cgatatctaa ggcgttctgg 420 ttctgaggtg ccgccattag atccgacaaa ggtgttgtca gagaaggcga actctccaaa 480 attcc 485 // ID Gypsy-1_LENY-I repbase; DNA; FNG; 7340 BP. XX AC AAPO01000065; XX DT 12-FEB-2011 (Rel. 16.02, Created) DT 12-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Lodderomyces elongisporus genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_LENY_; KW Gypsy-1_LENY-LTR; Gypsy-1_LENY-I. XX OS Lodderomyces elongisporus OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Debaryomycetaceae; OC Lodderomyces. XX RN [1] RP 1-7340 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Lodderomyces elongisporus RT genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; AAPO01000065; Positions 80046 72707. XX CC Positions [6295-6795] - Integrase core CC 'CTTCC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1352..2725 FT /product="Gypsy-1_LENY-I_3p" FT /translation="MYAKKDNQAKNTTGPTVPSLQSIETTSTSAASTATTD FT TTKSTKPDGKDGKDGKEGEGGKEDQVSSLERRVSELTKLVGELSISMKTPT FT QTSQSPTQVSTTDEPGTQERDRSIDVTLLLSGISPKANAENHVMLGSPATY FT KGRKGSTENSNDGIEINGQEEACKLINEIRKEIVDNNNVENLSYIQNPSEK FT IERAFIAVHSTLATKVNKTPNDEMVLNRYDILKFPGPKSKDMEGFFHWFKR FT LIWYKHVNCIPDYLLRDDIISAAVLIHNQGLENVVRAACKKPDEIEYGPIK FT YQQIFLMWQRKNTQDASVDLVGEVNRLMKLGDDPMSTMITILIEIENYLSY FT NKLGDLPIITWVRLFKTLYDNQGHIMKQIFNQMDSRVKLRKLNDEGKLEDS FT YLNGELIRQCLLTIEASEYQELLGKKLRASYDDIWQATNATFNQKISMISF FT AKPNSKVRKRHKK" FT CDS join(2977..3804,3808..7155) FT /product="Gypsy-1_LENY-I_1p" FT /translation="MEANAVMTRSATKGGKQIATTRSGAYDPLKKTHDALK FT NSKLSKDLQKQAKVEKKNNHLFNKDENPQEKEDVDPNQSIRFIDEDLPMDE FT GGDDPDYVYEEPTKADLPFNPTTTSMEKELDKLAQEDKPDPEEEMKNFLES FT EPTIKDADRTLVDNQTKEEVSKKPENLDDEVEDDGIYDPIIGKFNATEHQP FT LDQDDEMEDLEEDLSKQNIGLLRRKRTSKYDITDKQSYYNQLIRKNITSSI FT PLGDILGICPDFRKFLHDSTKGTLVAREGPVKSSQLGSEAFEINCSNGKVS FT EELIYIPAQIQHRDILLKYDTALQVSLINSKTLEGLKVRWSTLLPPIKVIG FT VNRDEATLTKGVTIEILIHFVPLMARLLVHDATPVGQILLGLPFQTDHKLS FT VGYNDEDKRELRFKSAGIQYKFPVIHTENNNYQVGQYPIVHTHEVTVFGNL FT QTKLRSAFKDSIANKDDIEYFIQLCETVTDVFYEKGGDPGRLKPEVHPPVQ FT IKLRDKESFWRCKSIPLGIKRPYAVEILTDMLKNGQLEYSNAAYRNPWFLI FT PKKDGRYRMLIDLRELNKHVELEGGHPQSTDELTSELSGRLYNTLIDVQNA FT YFQVPLDPLTNDVTSFNSPLGLLKYAVLPQGYINLVSEFSSILQKILAPVA FT KDVLCFIDDIAICGPKVQDHNDQLMRTHLDKVYKVFELLSKAGLKINPEKL FT RVAVLDCNFLGYRITPAGKTIITSLVDALLNYPLPTTQKKMESFLGLVNYY FT RQLIVGYAELTTPLYNLVQEAKNHPKHQLVWDEKSKKHYQQIIRVLTSSPV FT LQPLDLHQIVTIHTDASLESWGGVLQNTDENGVTRLVLCYSGKFHASEKNY FT TIFEKELFSIYHTLYAIHPLLVGYTGVIYIYCDNKALVHVLDKPLENSHFV FT NRAYKWLNLIRSFNYHIIHIDGTCNVIADALSRCELESFQAENFAVKEAFR FT NFRESLDPTITLEANAVSRILNPNNSYKGIDLTAIRDYLSTSVIPNIYNNA FT QHHKKFVNRALEFYISNGVLFKRGKYGIISRQVVTTSDELHKIFISAHDKR FT GHLGIESCFNLLNCYLFVPNLYRRLKKYITSCVQCQKYGPVTRQRDPLYLN FT IPTGLFHTIVCDCVKIHGSVIVVARDEFLGWAEACILPELSGDAVADFIYT FT SFITRIGTFHQFKSDNGTEFVNQVLKRLLQVHNIKAIYSIPYHPQGNGMIE FT ASHKGIIRFMRLLPPNANLQHALDTALWVDRTTVRRRTGFTPQYLVYGFEG FT HSPLTALLRYTPKSTDYTETELFHFRFRQLYYKNQLIDSALDTQRLDRERQ FT KTNFDARYDTTVTLHAGDLVLVTDGDSKLPGKLDQRWAGPYKIRKILSRTY FT YLKSLSGIHILRKYTREMLKPFTKRDSTI" XX SQ Sequence 7340 BP; 2656 A; 1353 C; 1395 G; 1936 T; 0 other; atttggtggt ccctagcagg aaggttttta ccttccgaac aattctatac attatgccta 60 agccaacacc aactgaaagg atatccagca atgattagtg tcatagtgcg aaacagacaa 120 actgaggagt aacattgtcg cgaaaatgga atcgactcag tacaatatag acagagtacc 180 ctatctgaag gaacaagggg gatgtgttaa aactacacga atgtggagga ggactcccat 240 tgggtggaag tcccataaaa ttcaacgtgc actaaattcc atggaagagt ttaacttccg 300 ctaggtgaga gtcctaggaa taaaaattaa tttactaaat tccatggaag agtttaactt 360 ccgctaggtg agagtcctag gaataaaaat taatttacta aagtccatgg aagagtttaa 420 cttccgctag gtgagagtcc taggaataaa aattaattta ctaaagtcca tggaagagtt 480 taacttccgc taggtgagag tcctaggaat aaaaattaat ttactaaatt ccatggaaga 540 gtttaacttc cgctaggtga gagtcctagg aataaaatca tacgactaca aaattccatg 600 gaagagttaa acttccgcta ggtgagagtc ctaggaataa aatcatacca ctacaaaatt 660 ccatggaaga gttaacttcc gctaggtgag agtcctagaa tcgaaatcag acaatcaaaa 720 ttaaagagaa cagcaactac tactatgttg taatggtgta tcatacctaa atgtcggtga 780 aggtcggaat aagaaatttc aactaatcaa tcaccaagaa atgaacaata ggctaccgaa 840 aattaattaa ataaatttac cccacactaa tgtttcaaac actaatgaag gaagtcacgc 900 acagaacaag ttgtcaggga gacgatagga tgaacttgaa actcgtctaa ccttccaaga 960 tgacggtaaa ataaatactg aaattcacct atatgcaagg gggaacgcac atgacgtgat 1020 cgtctaacaa cctgaaactg acgagaatgt gtttacaagt gggtaaacgg aaacaataaa 1080 gtggagaata atatagtaga agagacctaa ctacgaaagg gtgaaaagaa agctatatgt 1140 tttccgatcc acagatatca agaatatagg aaaaccttct gaaaagaaaa tattcctttt 1200 tgaaaaatta taaagtcata acaatttctt tcttcaaaca cgtttcaaga ttagaagaaa 1260 ctacagaaaa acctgaacaa aaactaaaac tcacaaaagc aataccaaat aaaataaaca 1320 aaaaaaaaaa aaatttaatt aaaatctcaa catgtacgca aagaaagaca accaagctaa 1380 aaacactact ggcccaacag tcccatcact tcaaagtatc gaaacaacct ctacctctgc 1440 tgctagcacc gctaccactg atactaccaa gtcaactaaa ccagatggta aagatggtaa 1500 agatggtaaa gaaggtgaag gaggtaagga agatcaagtg agcagcttag aaaggagagt 1560 ttcagagtta acaaagttgg ttggtgagct aagcattagt atgaaaacac ctacccagac 1620 ctctcaaagt ccaactcaag ttagtacaac tgacgagcct ggaactcaag agagggacag 1680 atcaatcgac gtaacgctgt tactatctgg tatatctccc aaagcaaatg ctgaaaatca 1740 tgtcatgtta ggttctcctg ctacctataa gggtaggaaa ggcagtacag aaaattcgaa 1800 tgatggtatt gaaatcaatg gacaagaaga agcttgtaag cttatcaatg aaattagaaa 1860 agaaattgtt gataataaca atgttgagaa cttatcttac attcaaaacc cgagtgaaaa 1920 gattgagaga gcttttatcg cggtccatag cacattagca acaaaggtta ataaaacacc 1980 taatgatgag atggtattaa acagatatga tattctgaag ttccctggac ccaaatcaaa 2040 agacatggag ggtttctttc actggttcaa aagactcata tggtataagc atgtgaattg 2100 tattcctgac tatttattga gggatgatat aatctcagca gctgttctga ttcataatca 2160 agggttggaa aacgttgtac gtgctgcgtg taagaaacca gatgaaatcg aatatggacc 2220 tatcaagtat caacagattt tcctgatgtg gcaaagaaaa aacacccaag atgcttctgt 2280 tgatttagtc ggtgaagtaa ataggttgat gaaactggga gatgatccta tgagtaccat 2340 gatcactatt ctgatcgaaa ttgaaaacta cttatcctac aataaattag gtgatttgcc 2400 tattattact tgggtccgat tattcaaaac cctttatgac aaccaaggac atattatgaa 2460 gcaaatattc aatcagatgg atagcagagt caaattgaga aaattaaatg acgaaggaaa 2520 attagaggat tcatacctca atggagaact tatccgccaa tgcctcttaa cgattgaagc 2580 aagtgagtat caggaattac taggaaaaaa gttacgcgca tcgtatgatg acatatggca 2640 agctaccaac gctacattca accaaaagat atcaatgata agttttgcta aacctaactc 2700 caaagtaaga aaaagacaca aaaaataaat gcgatttttg tggaaaacag ggtcacatca 2760 tgaccatttg ttacaaagct caaaatgcag tgaagaacaa ggagataatc aagaaggatg 2820 gaaaactctg catgatggat ggtaaagagt tgcaaatagc tgctggtgaa actattatgc 2880 tgaagtatcc tgagttgttc aagaactttg cgaggcaacg cacgcaaacc acaaacaaaa 2940 cctaaagaag ctaatgtgtt aaacagtgaa tcatcgatgg aagcgaatgc tgtaatgacc 3000 agaagcgcca ctaaaggtgg aaaacagata gctacaacac ggtctggtgc ttatgatcct 3060 ttgaagaaaa cacacgatgc tttaaaaaac tcaaaactta gtaaagattt acaaaaacaa 3120 gcaaaagttg aaaaaaagaa taatcatcta ttcaacaaag acgaaaatcc acaagaaaaa 3180 gaagatgtgg atccaaacca atctataaga tttattgatg aggatctccc aatggacgaa 3240 ggtggggatg atccggatta tgtttacgaa gaaccaacta aggcagatct accatttaac 3300 ccaactacca cttccatgga gaaagaattg gataagttgg cacaagaaga caaaccagat 3360 ccagaggaag agatgaagaa tttccttgaa tcagaaccta ccatcaaaga tgctgacaga 3420 acattggttg acaaccaaac aaaagaagaa gtatcgaaga aaccagaaaa tttggatgat 3480 gaagtcgaag atgatggtat ttacgatcca ataattggga aatttaatgc tactgaacat 3540 caacccttgg atcaggatga tgaaatggaa gatttggaag aggatctgtc aaaacagaat 3600 attggacttt tacgccgtaa gagaacaagt aaatatgaca ttactgacaa acaatcatac 3660 tacaaccaat taatcagaaa gaacattacg agtagtatcc cattaggtga tatcttggga 3720 atctgtccag attttaggaa gttccttcat gattccacga aaggtacatt ggttgctaga 3780 gaaggtcctg tgaagtcatc acaatgacta ggatctgagg cttttgaaat aaactgctca 3840 aacggaaagg tctcagaaga acttatatac ataccagcac aaattcaaca tcgtgatatc 3900 ttattaaagt acgatacagc tctgcaagtc agtttaatca attcaaagac acttgaaggt 3960 ttaaaagtta gatggtctac actactgcca cccattaaag ttataggagt aaatcgagat 4020 gaagctacct taacaaaagg agtaacaatt gaaattttaa tccactttgt cccattgatg 4080 gcacgattgt tagtgcatga tgctactcct gtcgggcaga tattattagg tctaccattt 4140 caaactgatc ataaactaag tgttggttat aatgatgaag ataaaagaga attacggttc 4200 aaatcagctg gtatacaata caaatttcct gttattcaca cagaaaacaa caactatcag 4260 gtagggcagt atccgatagt acacactcat gaagtcacag tatttggtaa tctacaaaca 4320 aagttgagat ctgcatttaa agactcaatc gccaataaag acgacataga atactttatc 4380 caactatgtg agactgtcac tgacgtattt tatgagaaag gtggcgaccc tggtcggctc 4440 aagcctgagg tccacccacc agtgcaaatt aaacttagag ataaggaatc cttttggaga 4500 tgcaagtcaa tcccattagg aataaaaaga ccttatgcag tcgaaatctt aactgatatg 4560 ttgaagaatg gtcaattaga atatagtaat gctgcttacc gcaatccatg gttcctaatc 4620 cccaaaaagg atggacggta taggatgctc atagatttaa gagaactcaa caaacatgtt 4680 gaattagaag gcggacatcc acaatctact gacgaattga cttctgagtt aagcggacgg 4740 ctgtataaca cacttataga cgttcaaaac gcgtactttc aagtacccct tgatcctttg 4800 acaaatgatg taaccagttt taatagtcct ttaggcttat tgaaatatgc agtactacca 4860 caaggttata tcaatctggt gagcgaattc agttccatac tacagaaaat actcgcacct 4920 gttgcaaaag acgttttatg tttcattgat gacattgcaa tatgcggacc aaaggtccaa 4980 gaccacaatg atcaactgat gcgaactcac ttagacaaag tttacaaggt atttgaactc 5040 ctatcaaagg ctggtttgaa gatcaacccg gaaaagttga gagtggcagt actggattgt 5100 aactttttag ggtatcgtat aacacctgct ggtaagacta ttattactag tctggttgat 5160 gctttattaa attatccatt acccactact caaaagaaaa tggagagctt tctcggcttg 5220 gtaaattact acagacaact tatagtagga tatgctgagt taactactcc actatataat 5280 cttgtacaag aagccaagaa tcacccaaaa caccaacttg tatgggatga gaagtcgaaa 5340 aaacattacc aacaaatcat tagagttttg actagcagtc cagttctcca acccctagat 5400 ttacatcaga ttgttacaat ccatacggat gcttccttag aatcctgggg tggagtttta 5460 caaaacactg atgaaaatgg tgtcacccga ttggtactgt gttattctgg aaagtttcat 5520 gcctctgaga aaaactatac aatctttgaa aaagaacttt tcagtatcta tcatacattg 5580 tacgcaatac accctttgct agttggttac actggagtca tatatatcta ttgtgataat 5640 aaggcgttag tacatgtact tgacaaacca ctcgaaaatt ctcactttgt gaatcgtgcc 5700 tacaaatggt taaatctaat acgttcattc aactaccaca tcatacacat cgatggtacc 5760 tgtaacgtta ttgcagatgc acttagtaga tgtgaactgg aatcatttca agctgaaaat 5820 tttgctgtta aagaagcctt ccgaaatttc cgagaaagcc tagatcctac tataacactc 5880 gaagctaacg cggtatctag gatattaaat cccaacaata gttataaagg tattgacctt 5940 actgctatcc gtgattattt aagtacaagt gtcattccaa atatttacaa caacgcacag 6000 caccacaaga aattcgtgaa tcgagcattg gaattctaca tatcgaacgg ggtcttattc 6060 aaacgtggta aatatggtat catttcacga caagtcgtta ctacatctga cgaattacac 6120 aaaatattca tttctgcaca tgacaaacga ggacaccttg gtatagaatc atgtttcaac 6180 ctcttaaatt gctacttatt tgtaccaaac ttatatcgca gattgaaaaa atatattacc 6240 tcatgtgtac aatgtcaaaa atatggacct gttactcgtc aacgtgatcc cctatatctc 6300 aacattccaa ctggcctctt tcacactata gtgtgtgatt gtgtcaagat tcatggttca 6360 gtgatagtgg ttgcccgtga tgaattcctg ggatgggcag aagcttgtat tttacctgaa 6420 ctctccggag atgcggtagc ggatttcatt tatacatcat ttattactcg gattggcaca 6480 ttccatcagt tcaagtcaga taacggtaca gaatttgtga atcaagtcct taaacgtcta 6540 cttcaagtcc ataatattaa agcgatatat tcaataccct accatccaca aggaaatggc 6600 atgatcgaag catcacataa aggaataatt agatttatgc gactgctccc tcccaatgcg 6660 aaccttcaac atgccttgga cacagctttg tgggtagatc ggacaactgt acgtagacgg 6720 actgggttta cacctcaata cttagtgtac gggttcgaag ggcatagtcc ccttactgca 6780 ttacttcgct atacaccaaa gagcaccgat tatacagaaa ctgaattgtt tcattttcgt 6840 ttccgccaac tatactacaa gaaccaactg atcgattctg cattagatac acaacgtctt 6900 gacagagaac gacaaaagac aaatttcgat gcacgttatg atacaaccgt aacgttgcat 6960 gcaggtgatt tagtgttagt gacagatggc gattctaaac tccctggtaa gcttgatcaa 7020 cgatgggctg gtccttacaa aataaggaag attctgagtc gtacatatta cttaaaaagt 7080 cttagtggaa tccacattct gcggaagtat actcgagaaa tgttgaaacc atttaccaag 7140 agagactcga ctatttgatg ctatggtctc ttaatcacaa aagagaatat ataaattttt 7200 aacttccact tccaaggggg ggatggccca tatacatcaa aatcaagaat tacaaatttt 7260 ttgaaaataa caagtcatta taacttttaa cacaatattc gaatgaattt cttcttcatc 7320 tagtgttaaa ggggggaaga 7340 // ID Gypsy-115_MLP-LTR repbase; DNA; FNG; 115 BP. XX AC AECX01000719; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-115_MLP_; KW Gypsy-115_MLP-I; Gypsy-115_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-115 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000719; Positions 115238 115124. XX SQ Sequence 115 BP; 30 A; 26 C; 22 G; 37 T; 0 other; tggttttcct attgatttga aataataaaa gtcacttggg ttctagaaac cgtcccgctg 60 tccttgacag cttcccgttg caattcagtg aaggtttgag atcccaaacc ttaca 115 // ID Gypsy-7_MLP-I repbase; DNA; FNG; 5959 BP. XX AC AECX01001703; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-7_MLP_; KW Gypsy-7_MLP-LTR; Gypsy-7_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5959 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001703; Positions 69707 75665. XX CC Positions [3215-3670] - Reverse transcriptase CC Positions [4766-5245] - Integrase core CC 'AAAGA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 362..5497 FT /product="Gypsy-7_MLP-I_1p" FT /translation="MLYVFIFGRVCGGSSRFFGYTQKLSTLGYFRDVIRGL FT SSSIRSLISWSLFQWILKGFFVQLILRHFGLQLICLTVMASRRSTVSGDGP FT VALSQDEQSFADRYVVPTLCALQTTLEERLDRSSLDTITLLRGEIAALQDE FT ISQLSTSFSSLSGVFQDCVRISDFDHRVGHRISDALSLMRAEAGATTATLR FT QESKAAHDDVVSQLNDQLDASRASLQISRVEPPPGTLSFDGEARHVDWFIT FT SIRDSVISNDSCFVDDSRRIRWVARHFKPNTSPQEWWISLLRENSREHKVP FT FNHNSSNLPFIIPNLLSLDAFFNHMADHFGDKYSSMTILKELQSLKQGRLS FT LVHFNAKFDSLAYQLTLDNIILRDYYQKALDPDILRRSIQHPDWVNVVTLK FT DLQSLAVLASKQVTDLGMVPSFSNQSNSQSRPASRPPPPPPRPAVIVPFDP FT TAMQVDVNAYQANPPSNPRPSTHETPTRCGIGFYRRLCQRNTRCYRCLKAF FT DGSHRGPDGRTFSCTNPGASSAEMDAFVLECQNAAPRSSQPISAVSSQAPT FT APAPNRVQRSTMNNPRLGVPRSTSQLSFHPSASHQPLPRTNIGFDQAPPQL FT RGMLSAFPPSADVASQGHVASSSGQPAHHVHDVSAMFHKFHNLNIPDSDDA FT SEFLSSQPLFGYVDNEVHLTEHVSAIDFKGEEGVDSRLILSVLLSCGTKFY FT PARALIDSGASRSFLDRGFVLRHHLKTKPRPIPISCVTFDGASSKDGLVTC FT EWDGHLFLQGLHEEHAFKSNVLLNVTTLGGYDVILGLDWLRIHDGWVGGTI FT PSLRLDKPIDVMGKEVLGKCPPLPFSPIFSSTSPVKNRFQNSSATLSSTPS FT TLSPPPFVIPPQFEKFANVFKKQAASLLPPHREGFDMAINLKPGAVPTFGG FT SYLLGSQEQVELKTYIDEQLAQGNIRPSSSSAASPIFFVKVPGKKNRPCVD FT YWSLNLITVRDSFPIPLMGDLLTRITGCRYFTKLDLKSAFNFIRIKEGDEW FT KTAFRTPWGLYECMVMPFGLANGPACFQRFIQSVLSEFLGVSCFVYIDDIL FT IFSKTYEDHTLHVSQVLQCLQDSNLVVSPDKCLFYATEVTFLGFVISVEGL FT KMDPKKLSTITDWPYPTNSKQLYCFLGFTNFYRRFIYHFSSISAPLTVLTR FT KGVNVVDGLTSSECISAFHHLLKAFTTAPLLSHFDFDRPRALQVDCSGFAM FT AAILSQPDSEGNLLPVSFFSRKLTPSERNWQVHDQELGVIVESFGEWRAWL FT TGTEQPVTVFSDHANLRYFMTSQKISPRQARWASYLTQFNFQILHTPGKIN FT PADPLSRRPDYEDFSSESSRMTLLKPYMIKEGVTVCSIDTTFSIPSNETRE FT LLTKSYDGVSEFVKPPPLFSFRGGLWWFRDRLYVPKPLRLQILSSFHDDPS FT MGHPGISHMLSMITRTFGWKSIREDVINFVKSCESCQRVKRSTQAQQGTLI FT PLAIPDWPWSMIGMDFIVKLPISAGFDSILVITDHFSKGAHFIPCKESMSA FT PQLASLFISQFFRYHGYPDKIVSDRGATFVSTFWKAVQMQLRIHPAPSTAF FT HPQTNGQTERTNQALESYLRHFITYRQEDWCEWLPMAEFCFNNTPSSSTKL FT SPFFSWQGFHPRANSFTELLKVPHADQFVKLLEATQLNLLVSLKHAKEQQA FT KYYNSHKRLGQEYSKGDLVWLSRQNITSTRPSRKLDYRRIGPF" XX SQ Sequence 5959 BP; 1388 A; 1292 C; 1261 G; 2018 T; 0 other; gtttgattca atcattatca attttttcgc aagtctcatt tttattgact ttcttttgcc 60 tagtctttgt tatttccttt ggggtttttt cgtttatttt ccatgtgctc tgtgaatcat 120 cgttacattt gtttatattg catattattt ttttcatctg tatatgttat attatctctg 180 tagaattttt ttcgcatgtt ggtttattgg tctatcattt ttcttctttt aagcttttac 240 gttttgacat agatacactt ttttccgttt ttggttaggc agtcgtgttg tgttttatta 300 tttctatgtt ttatatggtg tagtttcatg tttgttatgt tttctaggtt cctgtttttt 360 gatgttatat gttttcatat ttggtcgcgt atgtggtgga tcatctcggt tttttggtta 420 tactcaaaag ctttccacgt tggggtattt tcgtgacgtc attcgtggtt tatcgtcttc 480 aatccgttct cttatctctt ggtcgttatt tcaatggatt cttaaaggct tcttcgttca 540 attgatactt cgacattttg gtcttcaatt gatttgtctc acggtcatgg catctcgacg 600 ttccactgtt tcaggtgatg ggcctgttgc gctctcgcag gatgaacaat cttttgcgga 660 tcgttatgta gtacctactc tttgtgctct tcaaacaact ctggaagaac gtcttgaccg 720 ttcctcattg gatactatca ctctgttaag aggagaaatc gctgctcttc aagacgagat 780 atctcagctt tctacttctt tctcgtcgtt atcaggggtg tttcaagatt gtgtgcggat 840 cagtgatttt gatcaccgtg tgggtcatcg tatctcggat gcgctatctt tgatgcgtgc 900 ggaggctggt gccactacag ccacgcttcg tcaagaatcg aaggcagctc atgatgatgt 960 agtttctcag ctgaacgacc agttggatgc ttctcgtgct tctcttcaaa tatctcgtgt 1020 tgaaccacct cctggtaccc tttcttttga tggtgaagcg cgtcatgtcg attggtttat 1080 cacttctatt agagattccg ttatttccaa tgactcttgc tttgtggatg attctcgacg 1140 catcagatgg gtagctaggc attttaagcc gaacacctca cctcaagagt ggtggatcag 1200 tcttcttcgc gaaaactcta gagaacataa ggtccctttt aatcataact cgtctaatct 1260 tccttttatc attccgaatc ttttgtcatt ggatgctttt ttcaaccaca tggcggatca 1320 ctttggtgac aaatactctt ctatgacgat ccttaaggaa cttcagtctt tgaagcaagg 1380 aaggctttca ctggtacact ttaacgccaa gttcgattca ttagcttatc aactcactct 1440 tgataatatc attttaaggg attactatca aaaagcgtta gatcctgata ttttaagacg 1500 ttcgattcaa catccggact gggttaatgt ggtcactctt aaggatcttc aatcattggc 1560 tgtgctggct tcaaaacagg tgacggattt ggggatggtt ccttcttttt ccaatcagtc 1620 caactctcag tctaggccag ccagtcgtcc tcctccacct ccgccgcgac ctgcagtcat 1680 tgttcccttt gaccccactg ctatgcaagt agatgtcaac gcatatcaag ccaatccacc 1740 ttctaatcct cgaccttcga ctcatgagac acccacacga tgtggcattg gattctatcg 1800 acgactgtgc caacgtaaca ctcggtgtta taggtgtttg aaggcattcg acggatcaca 1860 tcgcggaccg gatggcagaa ccttttcttg tactaatcca ggcgcatcgt cggcagagat 1920 ggacgctttt gttttagaat gccagaatgc ggctcccagg tcttcccaac cgatctcggc 1980 ggtatcatct caagcaccga cagctcctgc tccgaatagg gttcaacgtt cgactatgaa 2040 caacccgcga ttaggggttc ctcgttcaac ttctcaactt tcatttcatc cttcagcgtc 2100 acatcagcct cttcctcgaa cgaacatcgg ttttgaccag gccccgcctc agttgcgcgg 2160 tatgctgtcg gctttccctc cttctgcaga cgttgcttca caagggcacg tcgcctcttc 2220 aagtggtcag cctgcacatc atgttcatga tgtatccgct atgtttcaca aattccacaa 2280 tctcaatatt cctgattctg atgatgcctc agaatttttg tcttctcaac ctttgtttgg 2340 ttatgtggac aacgaagttc atttgacaga gcatgtttcg gcgattgatt ttaaaggcga 2400 ggaaggagtt gattcacgtc ttattttatc agtgttgtta agctgtggca ctaagttcta 2460 tccagctaga gctttgattg actcaggcgc ttcccgtagt ttcttggacc gaggatttgt 2520 tctccgacat catctgaaga cgaagccccg gccaattcct atctcctgtg tgacttttga 2580 tggtgcaagc agtaaagatg gattggttac ttgtgaatgg gatggtcatc tttttttgca 2640 aggtttacat gaagaacatg cgttcaagtc caatgttttg ctcaacgtaa cgacgctagg 2700 cggttatgac gtcattttag gcttggattg gttgcgaatc catgatggat gggttggtgg 2760 tacgattcct tcgctacgtc tggataaacc tatcgatgtg atgggaaaag aagtgcttgg 2820 taaatgtcca ccactccctt tttcacctat tttttcctcg acttctcctg tgaaaaatcg 2880 atttcagaac tcttcggcga ctttatcttc cactccttct actctttcac ctccaccttt 2940 tgtcattccg cctcagtttg agaaattcgc taatgttttc aaaaaacagg cggcatctct 3000 gttaccaccc catcgagaag ggtttgacat ggccattaat ctgaagccgg gggcggtgcc 3060 gacatttggt gggtcttatc tgttgggaag ccaagaacag gttgaactga agacttacat 3120 cgatgagcaa ttggcacagg gcaatatccg accttcctct tcttcggcgg cgtcacctat 3180 tttttttgtc aaggttcctg gaaagaagaa tcgaccttgt gtcgattatt ggtctttaaa 3240 tttgatcact gttcgagata gttttcccat tccgcttatg ggggatctat taacaaggat 3300 caccggttgt agatatttca ctaaactcga ccttaaatca gctttcaatt ttatacgtat 3360 caaagaaggg gatgaatgga aaacggcttt taggacgccg tgggggttgt atgagtgtat 3420 ggttatgcct tttggtcttg caaatggccc agcttgtttt caacggttta ttcagtctgt 3480 tttatctgag ttcttgggtg tatcatgctt tgtttacatc gatgacatcc taattttttc 3540 caagacttat gaagatcata cactccatgt ttctcaagtt ttgcaatgtc ttcaagattc 3600 caatttggtt gtttctcctg ataagtgtct cttctatgca acagaagtaa cctttttagg 3660 ttttgttatt tcagttgaag gattaaaaat ggaccctaag aaactgagca caataactga 3720 ctggccttat cccactaatt ccaagcagct ctattgtttt ttaggcttta ccaattttta 3780 taggcgtttc atttatcatt tctcttctat atctgctccg ttgacagtgt tgacacggaa 3840 aggagttaat gtcgtggatg gcttaacatc gtctgagtgt atttctgcgt ttcatcattt 3900 attgaaagct tttactactg ctccccttct ctctcatttt gattttgata gacctagggc 3960 attacaggtg gactgctcgg gctttgctat ggctgctatt ctttcccaac cagattctga 4020 aggcaatttg cttccggtat ccttcttttc tcgaaaactg actccttctg agcgaaactg 4080 gcaagtgcat gaccaggagc ttggagtgat tgttgagtct tttggggagt ggcgcgcgtg 4140 gttgactggg actgaacaac cggttactgt cttttcggat cacgcaaacc ttcgatattt 4200 catgacttca cagaagattt cacctcgaca agcaagatgg gcatcttatt tgactcagtt 4260 taattttcag attcttcata caccgggtaa aattaatccc gcggatcctc tctctcggcg 4320 accggactat gaagattttt cttcggagtc ttccaggatg accctcctca aaccatatat 4380 gattaaagag ggggtcactg tgtgttctat agacaccacg ttttctattc cgtccaatga 4440 gacgagggaa cttctcacca aatcttatga cggtgtttcg gagtttgtta agccacctcc 4500 tttattttct tttcgtggtg ggctgtggtg gtttagagac aggttatatg tacccaaacc 4560 tctgagacta caaattcttt catcttttca tgatgatcct tcaatgggac atcctggcat 4620 atcacatatg cttagtatga tcactcgaac atttggatgg aagtctattc gggaagacgt 4680 catcaatttt gtgaaaagct gcgagtcgtg tcaacgggtg aagagatcga cccaggctca 4740 gcagggaacc ttgattcctt tggctatacc agattggcca tggagcatga ttggtatgga 4800 cttcatcgtc aagttgccga tttcagcagg ttttgactct atattagtca ttactgacca 4860 cttttcaaaa ggggcccact ttataccatg taaggagtcg atgagcgctc cccagcttgc 4920 ttcccttttt atatctcagt tttttagata ccatggctac cctgacaaaa tagtctcgga 4980 taggggtgcg acgttcgtct cgaccttttg gaaggcggtt caaatgcaat tgcggattca 5040 tcctgcacct tcaacagcgt ttcatccgca aaccaatggt cagacggaaa gaaccaatca 5100 agctttggag tcttatctac gtcactttat tacttatcga caggaggatt ggtgtgaatg 5160 gttgccaatg gcagagttct gctttaacaa cactccttct tcgtcgacta agctttctcc 5220 atttttttct tggcaaggtt ttcacccgag agctaacagt tttacggaac ttttgaaagt 5280 acctcatgcg gaccaattcg tcaaactctt ggaggcaact caattaaatc tgttggtttc 5340 tctgaaacac gcgaaagagc aacaagctaa gtactataat tctcataaaa gattgggtca 5400 ggagtactct aaaggtgact tggtttggtt gtctagacag aacatcacaa gtacgaggcc 5460 ttctagaaaa ttggattata gacgaattgg gccgttttga gtggaagaaa tggttgggaa 5520 gaatgcggtg aaattaaatt tggggcgttc atactctcgt cttcaccctg tttttaacat 5580 ttctttgatc tcaccttatt ataatccatc tatgggagga cggcctgcag ctactcacga 5640 acttgtgtct tctccgcagg taacaccaat tagggattgg cgtcaggtct cggcaattgt 5700 ggattataga aagaaaggga atgcggcacc ggaatatctg attagatggg cagggagacc 5760 gataacagat gatacttgga ttcaactacc tgatatttca tcagatttag atctgtttat 5820 tctcactttt catcatcgat atccccaatt tccagctccg aaaaatatct cttgggagcg 5880 ttatgaatca ggttatcaag ctgtgttacc attttaaaga aaaataagaa aaattttgac 5940 cctggacgga aattattat 5959 // ID FTF1_FO repbase; DNA; FNG; 1251 BP. XX AC AF395082; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 02-JUL-2010 (Rel. 15.08, Last updated, Version 2) XX DE Fusarium oxysporum fused Fot1:Tfo1 transposable element FTF1_FO. XX KW Mariner/Tc1; DNA transposon; Transposable Element; KW target site duplications; hAT family; FTF1_FO. XX NM FTF1_FO. XX OS Fusarium oxysporum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Hypocreomycetidae; Hypocreales; OC mitosporic Hypocreales; Fusarium; OC Fusarium oxysporum species complex. XX RN [1] RP 1-1251 RA Horman R.S., Garcia-Pedrajas M. and Bainbridge W.B.; RT "Direct submission."; RL Direct Submission to Genbank (25-JUN-2001). XX RN [2] RP 1-1251 RA Jimenez-Diaz M.R.; RT "Direct submission."; RL Direct Submission to Genbank (24-JAN-2002). XX RN [3] RP 1-1251 RA Jimenez-Gasco M.M.; RT "Direct submission."; RL Direct Submission to Genbank (24-JAN-2002). XX DR Genbank; AF395082; Positions 1 1251. XX CC Contains Fot1-like and Tfo1-like transposons separated by CC an 8 bp target site duplication in the choline kinase gene. XX SQ Sequence 1251 BP; 325 A; 313 C; 288 G; 325 T; 0 other; tcactacgac gcagggtgac attatggaag gggcggttgt taatggcaac gtacatcgtg 60 atcgatacag gggagcccta acgcagtacg agtcaaaccc cgaccgtaca tatacgtgga 120 ctcccattat ttattgtaat ctcggcgctc ggggtcgctt tgaaccccct gatcatattc 180 aaggccaaaa gtatccaaga acaatggttc aagaaggatt tcctagctaa gcatcctggc 240 tggcatgttc accttttcgg aaaacgggtg gacaagcaat gatattgctg tcgagtggct 300 agggaaggtg tttttacccc aaacgcaacc tgaggatcca gccatggtcg cctacttatc 360 gtcgacggtc acggcagtca tacctcggac gaatttatga ctatgtgtta cctaaacaac 420 gttcacttac tctttttccc tgctcatact tcccatgtcc tccagccctt ggatctcggc 480 tgcttttcca gtttaaaaac gcctatcgta ggttaattgg ggagcatacg gcttttgaca 540 gataccaaca aaggttggaa aggcaaactt tcctcgaatt ttatgcgaaa gctcgagaaa 600 ttggtcttcg aaaggaaaac ttcaatctgg gtggaaagcg agaggcaaca gaaggagagc 660 ctgaatatgt tagtgagccc cattgaagtg atcaggcgtt tgaatcttac catagaatcg 720 agaagagacc tgcttagcat tcacatccag aatgaagtat cctctgtagt aaccctcttg 780 ccactgaagc tcagggttct tctggactct ggcacgagca gccttgccgg ctgcaggctc 840 gatagggcta ccaacaccat tagagctgac agctgttcag tgtgtccatc aatcaagtta 900 cttggcttgg ttggttgtga ttggttgatt acgtaacaac caaccaaagt aagggctcct 960 gttcaaccaa gtaagcaacc tttttccatt actcggctct acttgattta ctcgacagaa 1020 ttgccgaatg aggcatagcg cgatgacttt gacatatcta cgcgctctga ttggctaggt 1080 attcctcatc ccacctgcga aaattctcgc cgagtttctt aagagaaggt acctgacgat 1140 tgcgccaagc tcgaagccaa atatgtctgc acattcattc tttcggccgc atagaaggac 1200 caggaagtcc actccaccta ggactccctc tgagcctctc gctttccacc c 1251 // ID Gypsy-12_RO-I repbase; DNA; FNG; 6618 BP. XX AC AACW02000311; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-12_RO_; KW Gypsy-12_RO-LTR; Gypsy-12_RO-I. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-6618 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000311; Positions 21248 14631. XX CC 'CTAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 133..1221 FT /product="Gypsy-12_RO-I_1p" FT /translation="MASLQSGQTVLATVNNQSAIQGFRPRLIEFYGYEGED FT FRHFQETLDSYLAITNTHSDARKLIVLKSQLRRAAKVYFERTILKEQPDIG FT YEQAIEKLKKHYITPELIQTYELEFNEMFQGEQEHPQIFLARLREAADLAD FT IDSEEVIESRFRAGLLREIKQFCIQCSSRTFKDWTVHAEGWWNANRPRKIA FT IVDNPFLPRNVNNALIYNDDNTYYNHASIYKHNIDLIDTDERMLQYIPVGN FT ARMNERMDKSNVAHNIITGPNQLTTMDVVNDINHSPSYKHQTNRHKESTSV FT HKVDQQDIVTLIQETIRNELNKQQQSYYPQSNRNNQRNYRNNYYNRREEPS FT NYRRYPPRDDTNNQAQNSKN" FT CDS 1224..4277 FT /product="Gypsy-12_RO-I_2p" FT /translation="MGSIAFDNQKNGQSNTHQYTKPINKPTSQHHLNALIT FT EGEYDPYNHELCAAVRPDRPPEVISSRVAPYSKGKASSKRKEPEKPTVTKR FT VTTRRHMEEINQPPIQKITTDQNMDMDTELPIEIKDNQPTKKKIIKRRKPE FT ILYDIAADVLDKPANITVRDLITTVPSLRRQLTTVCRPNQQKRTISPDKTT FT IAVIEDNEDYSTTAVYSKVSIGSKRIRALVDCGAAKTCMSKALADALNLEI FT DASSESVFTLGNGTKQPALGIIYDVPIQVKENMIIPCTVEVLPSCPSHFII FT GNNWLNRSKARIDFNTSSLKVTYKNKNAELEISFLRKNEKLPQVSSYKQTY FT KNPVSTTNSHLVKQVHFEDETIENSSEEGSENETEEESSSLNESEDEENDE FT KSLLVLENDGQEEMVINHLKDICYITAGRNGLYISANSSKKLVIDKAKTQN FT KNIKYIFEITNAKLKYVYGCFDSSSNLIMNRKTIEIHLYNRTREHILLCPF FT EEIGILEEIDLQEETNVKAYDVLSNPELFVMDLEESPSINKKEKEPLESEL FT YTKLEVGNLDKAVEKKLRILLKKYYSIFDWNNDTIGNTSIITHKIIIEQDT FT QPISHRPYRLSPIEAKYLEQEIDKYCKLGVISPSNSPWAAPVILVKKKNGE FT YRMVIDYRKLNAVTKKDSYPLPRIDDLLDTLGKASIFSALDMRAGFHQVPL FT EEESKELTAFTTKYGVYHYNTLPMGLVNSPATFQRLIDLCFRPLINKCLVA FT YIDDLNIYSRNNDDHLQHLKQVFNCIKIANLKLNPEKCFFFKNHLKFLGYI FT ITKEGIQTDPSKIQKIIDYPIPQTVTQIRGFLGLASYYRRFIKNFAAIARP FT LHDQTKTLKKIPWTDKTTQSFLTLKGLLTTAPVLSRPDFSKPFILVTDASK FT LGLGCILTQLDADGKEHPVIYASRSLKSSEVNYAATKLECLAVVWAVKMFR FT PYLLGEKFTIMTDHSALTGLLKAKNPTGILARWIAILSEYEYEIKYRPGRV FT NESADFLSRLGY" FT CDS 4446..5786 FT /product="Gypsy-12_RO-I_3p" FT /translation="MNSDKIKYLKIYLQEQNLPSGIETKMQKYIKQNAKRY FT TIYNGILYRYNTENGIIRKVLSKQEAEEMLYAYHQHPLGGHLAYNNTLHKI FT ASRYYWENMAQDILNYVQKCHRCQVYGKKRLNEELYPVPVSPKPFDRIAID FT VKHVQASRAGHRYIVAAIDYLTKYVEAKPLRLQTASEIAIFLYEEVISRHG FT CPTLIVTDNGKPFVSGLVRAVCHNFSIIHKTTTPYNPQSNGLIERFNRTLG FT QILQKRSTEEKEDWHLYLPAALFAYRSIKQATTKQSPFFLMYGYEPKTPFD FT NNHRIIGFKTPSFDATLMHRTIHQIRNLNLVREQASQSIKQTQVAQKKAIE FT NKILEEKKELKPPFRLGDIVLLYKDYMATSWSGKLQDKWEGPFIIHSILGK FT GTYHIKSFKSDDNRIRRVHGNRLKIYAIPKVQWSTDNARRVPNPIELDQDE FT CF" XX SQ Sequence 6618 BP; 2614 A; 1199 C; 1120 G; 1685 T; 0 other; tttggtggtc actacgaggg gaaaaatcaa gcaaatttat taatcaaacg tttattatca 60 accaacactt actacacaaa caaaaagaag aaacatatat atcaacaata cttacaacat 120 caagactata caatggcctc attacaatca ggacaaaccg tgttggctac tgtcaataat 180 caaagcgcca ttcaaggctt cagaccccgt cttattgaat tctacggata tgaaggcgaa 240 gattttagac attttcaaga aacccttgat tcttacttgg caattacaaa tacccatagc 300 gatgcccgga aattgatcgt tctcaaatcc cagttgcgtc gtgctgccaa ggtttatttc 360 gaaagaacaa ttcttaaaga acagcctgat atcggatacg agcaagcaat agaaaaatta 420 aaaaaacatt atattactcc tgagctcata caaacttatg aactggagtt caatgaaatg 480 ttccaaggtg agcaggaaca tccgcaaatc ttccttgcac gtcttaggga agcagcagat 540 ctagcagaca tagatagcga agaagttatt gaaagtagat ttcgtgcggg gctcctcaga 600 gaaataaaac aattttgtat tcaatgcagt tctcgcacat tcaaagactg gacagttcat 660 gcagaaggtt ggtggaatgc caacagacca agaaaaattg ccatagtgga caaccctttc 720 cttcccagaa atgttaataa tgcgttaata tacaatgacg acaatacata ctataatcat 780 gcttcgatct ataaacataa tatagatctt attgataccg atgaacgcat gcttcaatat 840 attcctgttg gtaatgcgcg tatgaacgaa aggatggata aatcaaacgt tgctcataac 900 attatcactg gtcctaacca gttgacaact atggatgtcg tcaatgatat caatcactct 960 ccttcataca agcatcaaac caatagacat aaagaaagta ctagcgttca taaggttgat 1020 caacaagaca ttgtaactct cattcaagaa acaattagaa atgaactcaa caaacaacaa 1080 caatcatatt atccacagtc aaataggaat aaccaacgca actaccgtaa caattattat 1140 aatcgtaggg aagagccttc aaattacaga cgttatcctc ctagagatga cacaaataat 1200 caagcccaaa actcaaaaaa ctaatggggt cgattgcttt tgacaatcaa aagaatggtc 1260 aatcgaacac ccatcaatat actaaaccca ttaacaagcc aacttcacaa caccatctta 1320 atgccttaat cactgaagga gaatatgatc catataacca tgaattgtgc gctgctgtta 1380 gacctgacag acctccagaa gtgatttcat caagggtagc accttatagt aaagggaaag 1440 cctcatcaaa aagaaaagaa cctgaaaagc ccacagttac gaaaagagta accactcgaa 1500 gacatatgga agaaataaat caaccaccta tacaaaaaat caccacagat caaaacatgg 1560 acatggatac tgaattaccg atagaaatta aagacaatca acctacaaag aaaaaaataa 1620 ttaaacgaag aaaacctgaa atattgtatg atatagctgc cgacgtgttg gataaaccag 1680 caaatatcac cgtccgagat ctaataacaa ccgtgccgtc attaagaaga caacttacca 1740 cggtatgcag acccaatcaa caaaaaagaa caataagtcc cgataaaact acaattgcag 1800 taattgaaga taatgaagac tacagtacaa ccgctgtata ttctaaagtg agcataggat 1860 cgaaaagaat tagggcgtta gtagactgtg gtgcagcaaa gacctgtatg tcaaaagctc 1920 tcgcagatgc tttaaattta gaaatagatg cttcatcgga aagtgtgttc acgttaggaa 1980 atgggactaa acagccagca ttgggaataa tctatgatgt accaattcaa gtaaaggaaa 2040 atatgatcat tccatgtaca gtagaagtac ttcctagctg tcctagtcat tttatcattg 2100 gaaataattg gcttaatcgc tctaaagcaa gaattgattt taatacttca tcattgaaag 2160 taacatataa gaacaagaat gcagaattgg aaatatcatt tttacgtaaa aatgagaaac 2220 taccacaagt ctctagttac aaacaaactt acaagaatcc agtcagcact acaaactcac 2280 atcttgtaaa acaagtgcat tttgaagatg aaactataga aaattcttct gaagaaggtt 2340 ctgaaaatga aacagaagaa gaaagttcaa gtttaaatga gtcagaagat gaagaaaatg 2400 atgaaaaatc gttactagta ttagagaatg atggccaaga ggaaatggtt attaaccatt 2460 taaaagatat ctgttatatt acagcaggaa gaaatggact gtatatttca gcaaattcat 2520 caaaaaagct ggtaatagat aaagcaaaaa cccaaaacaa aaatatcaaa tatatctttg 2580 aaatcacaaa tgcaaaattg aaatatgtat atggatgttt tgattcatct tcaaatttaa 2640 taatgaacag aaagacaata gaaatacatc tttataatcg tacacgagaa catatactcc 2700 tttgcccctt tgaagaaatt ggaatattgg aggaaataga tttacaagaa gaaacaaacg 2760 ttaaagctta tgatgtctta tcaaatccag aattattcgt gatggattta gaagaatccc 2820 ccagtataaa taaaaaagaa aaggagcctt tggaatcaga attatacacc aaattagaag 2880 ttggaaacct ggataaagct gttgagaaga aattaagaat attactgaaa aagtattata 2940 gcatatttga ctggaataat gatacgatcg gaaacacatc aataattaca cataaaatca 3000 ttattgaaca agacactcaa cctattagcc atcgaccata tcgattgagc ccaattgaag 3060 ctaaatactt agaacaagaa attgacaaat attgtaaact cggtgtgata tcaccttcaa 3120 atagtccttg ggcggccccg gttatcttag taaagaaaaa gaatggagaa taccggatgg 3180 ttatcgatta tagaaagctt aatgctgtta ccaaaaaaga ctcgtatcct ttgcctagaa 3240 tagacgatct cttagataca ttgggaaaag cgtcaatatt ttcagcctta gacatgcgag 3300 caggttttca ccaggtgcct ttggaagaag aaagcaaaga acttactgct tttacgacga 3360 aatatggtgt ataccattac aatactttac caatgggact ggtcaattca ccagccacgt 3420 tccaaagact tattgactta tgttttcggc cattaattaa caaatgtttg gtcgcatata 3480 tagatgacct taacatatat tcaagaaata acgatgacca tttacaacat ctgaaacaag 3540 tttttaattg catcaagata gcaaatctga agttaaatcc tgaaaagtgt ttctttttta 3600 agaatcatct caaattcctt ggatacatca ttacaaaaga aggaatccaa accgatccca 3660 gcaaaataca gaaaataata gattatccta ttcctcaaac tgtgactcaa atacgaggat 3720 ttcttgggtt agcctcttat tacagaagat ttataaagaa ttttgcggca atagcacgac 3780 cgctacatga ccaaaccaag acattgaaga aaataccatg gactgataag acaacacaat 3840 cctttttaac actcaaagga ttacttacaa cagcaccagt attatctagg ccagatttca 3900 gtaagccatt tatcttggta acggatgcat cgaaattagg cttaggttgt attctcacac 3960 agttggatgc agatggaaaa gaacatccag ttatctatgc cagtcgaagc ctgaaatcca 4020 gcgaagtaaa ttatgctgca accaaattag aatgtttggc tgtagtatgg gcagtaaaaa 4080 tgtttagacc atatttgctt ggagagaagt ttaccattat gaccgatcac tctgcattaa 4140 ctggacttct aaaagccaaa aatcccactg gaatattagc acgttggata gccatcttat 4200 cagaatatga atacgagata aagtataggc ctggtcgcgt caatgaaagt gctgatttct 4260 tatcacgtct tggatattaa acatatctat aacaatatta ctcattaaat tttaactgaa 4320 ctacatggag gaagggaggg ggtagttgac caataataat aaataaaaaa aaaaaaaaaa 4380 aaacaaacaa acaaacaaat atactataaa tatacaaatt tatacaatcc aacttgaaaa 4440 caacaatgaa ttccgataaa atcaaatatt tgaagatata cctgcaagaa caaaatctgc 4500 ccagtggcat agaaacaaag atgcaaaaat acattaaaca aaacgcaaaa agatatacca 4560 tttataatgg tatattatat agatataata cggaaaatgg tataatacga aaagttttat 4620 caaaacaaga agctgaagag atgttatatg cataccatca acatcctttg ggaggacact 4680 tggcatataa taacacatta cacaagattg catccaggta ttactgggaa aacatggctc 4740 aagacatttt aaattatgta cagaaatgtc atcgatgtca agtctatgga aaaaagcgat 4800 tgaatgaaga gttatatcca gttccagtat cacccaaacc ctttgatcga atagcaatag 4860 acgtaaagca tgtgcaagct tctcgagcag gacacagata tatcgttgca gccatagatt 4920 acctgaccaa gtatgtagag gcaaaaccac ttagattaca aactgcctca gagatagcca 4980 tatttctata tgaagaagtt attagtagac atggctgtcc tactcttata gtcacagata 5040 atggaaaacc ttttgtaagc ggattagtgc gtgctgtttg tcataacttt tccattatac 5100 ataagacgac tactccttac aatccacaaa gtaatgggct aatagaaaga tttaacagaa 5160 cacttggtca aatacttcaa aagagatcca cagaagaaaa agaagactgg catctgtatt 5220 tacctgctgc attatttgca tatagatcta tcaaacaagc tacaacaaaa caaagcccat 5280 tcttcttaat gtatgggtat gaaccgaaaa caccatttga taacaatcat agaattatcg 5340 gctttaaaac accaagcttt gatgccactt tgatgcatag aacaattcac caaataagga 5400 acctgaattt ggtgcgcgaa caagcatcac aatctatcaa acagacacaa gtggcccaaa 5460 agaaagctat cgaaaataag attcttgaag aaaagaagga acttaaaccc ccattcaggt 5520 taggtgacat cgtattatta tacaaggatt atatggctac gtcgtggtca ggaaagcttc 5580 aagacaaatg ggaaggaccc tttatcatac atagtattct aggaaagggg acctatcata 5640 ttaaaagctt caaatcagat gacaatagaa tcagaagagt tcatggaaat aggcttaaaa 5700 tatatgcaat cccaaaagtg cagtggtcta cagataatgc aagacgcgtt cctaatccca 5760 tagaattaga tcaagacgaa tgtttctaat aaataacaac aaaaaaaaaa aaaactatat 5820 aatataccaa gaaaaatcaa aaaaataata atttaacaat aatacaaacc atcgaaatac 5880 ctcaaaaaca aagttgtggc aatgaattca gaaaacaaaa ccatcttcgt cgatttaact 5940 gatgaaaaag aagatgtctt gcctgtacaa atggacataa ccattcaaat gcacttccgg 6000 gatatgtatc aatcctttgg aaaggagctt acaagggagg ctcttaatac atacttctct 6060 actcaagaaa aagaacaagg caatccctat aagatacggt atgccactga agagagcgat 6120 actgatcgcc aaggatatat tgattggata atgcaaagat ttcaaaaaat ggacgaagaa 6180 atagatctca aatcttcttc tcccatgtct gtaaaaactc aaaacatggc tgacttctgg 6240 gatgacttat tagaacaatg ccgtaaccaa tacgaattca atcaagaatt tgcaaaccat 6300 tatcaggatc agttcatgaa aaacttcaaa aggtatgaag agctattgcg agaggaaatc 6360 gagataaaca agcaaaagag aaccataatc agaatatttg ctcataatgt tgccgaatgg 6420 gcacatctca agtttaccag accaacaaat gaaaagaaaa gtcaacatat gtatgttact 6480 acgaaaaaca atcaactcag atcaattcta ccaaaccata ttataagccg ctcaatgatg 6540 agaaaactca gggaacaaga ctcttggtgt aagaatgaaa aaatgttcga ggacgaacat 6600 catatcggtg ggggcttc 6618 // ID Gypsy-3_TMe-LTR repbase; DNA; FNG; 1272 BP. XX AC CABJ01003414; XX DT 13-FEB-2011 (Rel. 16.02, Created) DT 13-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Perigord black truffle genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_TMe_; KW Gypsy-3_TMe-I; Gypsy-3_TMe-LTR. XX OS Tuber melanosporum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Pezizomycetes; Pezizales; Tuberaceae; Tuber. XX RN [1] RP 1-1272 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Perigord black truffle genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; CABJ01003414; Positions 137400 136129. XX SQ Sequence 1272 BP; 392 A; 293 C; 277 G; 310 T; 0 other; tgttaggaat accgagatcc gatacctagg aactacccta ggtagggtaa cctctcccga 60 aatcgaaatc tcaggtctcc aacacccccc tccggccccc ttacggggcc cgcacgtcat 120 catacatatc gatctaagtt gcgtcctgtg agtattatac cacagaatcg aaggtgttaa 180 aataacaccg tatacgtcac cgccgtccgt ccgaccggat cctgttcgca ggatatataa 240 gaggggacct tcggctaggt ctgggaagta taacggagtg aatccgtcaa ctacccagaa 300 ggttactagg cttcaagaac gagatatctt tggcataggc cgtaagtaac gggacgtttt 360 cgccaaaaag atgatgatga atagtaaagg atcttaatga aagaccaata cattcacatc 420 taccctacga taaccagaag tgtctggaac tccggagaat actaccatcg aacccaatga 480 agttcgagga gtgttacaac taggttctgg tacatactgc agccgaaggg gtgactccca 540 aaccggtaac agtaaccaga atcctattac ccacggcaac caagggttac caaccgaggg 600 attatctacc aatcctatgg aaaggcttcc tgcaggcaag taggaagtcc tccggaaaag 660 acctattggt tgggtatgga gttatgaaga gttgaagtta gatgacctag aaagttggta 720 ataacaacta tcttcgcaga tgcaacccgc agacgaaact gatcatttcg tcgacaaggg 780 agacgattgg agccaacaca gttcgacttg agaattgggc catttgtcta aaaagactct 840 caccatcgga taattactat tggaaacagc atgacttcca aatagtcata ttataattat 900 ccaagagggt gtgacgattt tcagggtctc caggtgttca caccttggaa gaaaacccga 960 cgttgcgagt tatattggaa aagggtacat ggttgtatgt tagtcagatg tagtctgcga 1020 acactccgac gcggaatata aatatcgaag tgcctcccaa gctttagata tcatcaagca 1080 atggagaaga gaccttaaca atataatatt gttcttacga ttttacttta ctttacccta 1140 ctttacttca ctcttagtag tcataaacga aacgagtgaa gtccgagcct tcctagacaa 1200 tctccgtcta taaaaacgaa tctcgatcag cctagacaac tctgaagtct atcaggttgt 1260 ctccgcctaa ca 1272 // ID Gypsy-1_CCO-LTR repbase; DNA; FNG; 195 BP. XX AC AACS02000001; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the mushroom Coprinopsis cinerea genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_CCO_; KW Gypsy-1_CCO-I; Gypsy-1_CCO-LTR. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-195 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the mushroom Coprinopsis cinerea RT genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AACS02000001; Positions 96252 96058. XX SQ Sequence 195 BP; 49 A; 54 C; 38 G; 54 T; 0 other; tgtaaggacg cgccttacac gatcacgtgt ggctctctat ctcagcacac agaagaccga 60 cacatatctc caacttacta tcatctcgct gtacgcgtac tctacggaca agctctatag 120 actcgattca tacgcacgtt gaacgaactg tgagtgtttt ctgttctcct gtttccgtag 180 gagaacagat ttcca 195 // ID Gypsy-110_MLP-I repbase; DNA; FNG; 5748 BP. XX AC AECX01000612; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-110_MLP_; KW Gypsy-110_MLP-LTR; Gypsy-110_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5748 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000612; Positions 16948 11201. XX CC Positions [4548-5027] - Integrase core CC 'CATGA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1644..2756 FT /product="Gypsy-110_MLP-I_1p" FT /translation="MPWLRENGHRINWKDGSITPKDSPTNAVMATAAVTSP FT IPKTTPDDVAYQEGQARNLGEGALDDDHIRKIQPPQCEHDTVVTILPVTDD FT KLDCPLNYSEQPEMTTTDVYLPNNDQMPKPEIATAAVPALPNPKTTLDDPV FT SVGGQARKFGEGALNDDHIDEMKPPQCEHDTVPSIVPETDDKIGCLLNYFD FT QTATTTFNTTNDIAVAAITASPDPETALDESASVEGQARKFGEGALDDDHI FT KKIQPPQCEHETVPTISPVTDDKLFCPLNSSAKICSSTTSWNVSARLAAEA FT SKDKKERTAEELVPTRYHRYINMFRKTRAMTLPPHRRYNFCVDLIPGATPQ FT AGKIIPLSPAEEVALNAMIDEGLEKGTI" FT CDS 3297..5648 FT /product="Gypsy-110_MLP-I_2p" FT /translation="MDPLKVSAVKDWPAPKNVTQVQRFLGFANFYRRFISN FT FSKITRPLHELTQDDVRFEWTSKRDQAFETLKEAFTTAPVLKIADPYKAFV FT LECDCSDYALGAVLSQEDEDGILHPVAFLSRSLVQSERNYEIFDKELLAVV FT ASFKEWQHYLEGNPNRLEVVVYTDHKNLETFMSNKQLTRRQARWAELLGCF FT DFHIRFRPGKQSTKPDALSRRPDLEPSSEDKWLFGSMLKPNNLSEASFLAE FT LDCIEAWFGDESIAHDEVEDWFEEDIKQEYPPELIEVDAIDRTPGANGPMW FT TDDVILRQVREASKQDPRILELIKQVEKGGSQVPAGLTVTDQVLYRNGIIE FT IPNDQRVKLEILQSRHDSLLAGHPGRAKTLSLVQRQYRWPSMKAYVNQYVD FT GCDSCIRVKSTTSSPFGSLEPLPIPAGPWTDISYDLIPDLPTSNGKNCILT FT VVDRLTKMGHFIPCTTEMNSEELATLMLANVWKLHGAPKTIVSDQGSVFIS FT KITESLNKQIGIELHPSTAFHPRSDGQTEIVNKAIEQYLRHFVSYRQDDWE FT GLLPLAEFAYNNSTHTSTGVSPFRANTGYDLNLGRIHSNEQCVPVVERRLK FT MIEDVQEELKENLKRAQVAMKHQFDQGVRPTPEWEVGNEVWLSSRHISTTR FT PSAKLDHRWLGPFRIVKKVSRSAYQLALPATMGRVHPVFHVSVLRKHTPDS FT IEGRVQDPPDPIEIEGEEEWEVEDVLDKQKRRGKDEYLISWKGFGRNEDSW FT EPSTNLTNAAELIDQFNLRFPRASEDHRRTRRV" XX SQ Sequence 5748 BP; 1656 A; 1391 C; 1459 G; 1242 T; 0 other; ctttgtagca tctttatcaa ggcaaaccaa gaatcaggag atcttaagaa gaagaagata 60 aattaaaaat ttcagaagaa agcataagaa gaaatttttt tttttaagaa aaaattgttt 120 actcaaggac atcaagttaa ttgaagaaag tactataaag aaaattactc gtagcaccct 180 ctcacacccc agaccaatcc catcacgcca acctttaaca ttcccaacga caactctgtc 240 accccatcca aacattcctc taaccaattg tcctacgcct ctatggacga cacaaccgaa 300 gccgctggtg ttgacctcca cgccatttct tgacagctgg ccgagttaga agcgaagttg 360 aagaccgaga ccgaacgccg agttcaagcc gaagccgcgc aagctcaaag atcggaaccc 420 ccagccaaaa cacctaagat ggcgaccccc aataaatttg acggaactcg agggtcgaaa 480 gccgaggcgt tcgctagtca ggtcggactc tacatcatca ccaacgcagc ccttttccca 540 aatgacgtgt cgaaggtcac cttcgccctt tcttatctta cgggcgaggc catcaagtgg 600 gctcagccat tcttacaccg tgtgctgaac cctactgaag gtacagaagt tacctacaat 660 ggctttgtcc gggctttcga attggtctac ttcgactctg accgccagaa acgggctgag 720 gcggcgctcc gggtgctgaa acaaaccaag acggctgcgg attacacagt cagcttcaat 780 caactagccc cggcaaccca atgggaactg cctaccttga ttagccacta ttgacaaggc 840 ctgaaacgtg aggtgcggtt agcaatgatt cgggagaagt tcgaggatct ggagagtatc 900 atggcactcg cctgcgcaat cgataacgac atccgcggca agtatgcacc ccctgctagc 960 atcagtcgcc cagtcgatcc cgatgccatg gacatctcca gcactcggtt tgacatttcg 1020 tccaccgagt ataatcgacg taatgaagag aggttgtgct atagttgtgg tgaatcaggt 1080 catttggcta ggtggtgtgc aagtagaaag gggaggaaga aagggaaggg gaaggagaca 1140 aggaaggtag ctgagttaga ggccaaaatc gctgcattgg aaggtaaatt gggggagggt 1200 agtagtagat cagatatgtc aaaaaatgga gacgcttaga ggtgacggac gtgccacccc 1260 taagccagag cgggagggaa tttttagaag tagctgagaa tgatagctgt gcaaagaaaa 1320 gcttacatga tcctcgtatt ttcacaacta tttccctttc agagtccccc cgcgccacgt 1380 cccaacatga tcccgatccc aaaccaagag tgagagctcg tgcccttgtc gactgtggtt 1440 cgacgcacaa agtgcttgga actaccttcg cggacaaggc tggaatcccc gtaacggaac 1500 ttgccacggc aggtgacgtc tatggatttg atggccagcc tcgtagcgtc gcacacgacg 1560 cggaactgtt cgtcgacaat gcggacaacc catcccggtt ccttgtcacc aagattaagg 1620 attcctacga tgtcatccta ggaatgccat ggctgcgtga gaatgggcac cgaatcaatt 1680 ggaaagacgg aagcatcaca ccgaaggact caccgaccaa tgccgtgatg gctaccgccg 1740 ccgtcacgtc gcctatcccg aaaacaaccc ctgacgatgt cgcgtaccag gaggggcaag 1800 ctaggaactt aggcgagggg gctcttgatg atgaccatat cagaaagata cagcccccgc 1860 aatgtgagca tgataccgtt gttaccattc ttcccgtcac agatgacaag ctggattgcc 1920 ccctgaatta ttctgaacag cctgaaatga caacaaccga cgtataccta ccgaacaacg 1980 accaaatgcc taaacccgag atagccactg ccgctgtacc agcgttgcct aatccgaaaa 2040 caaccctcga cgatcctgtg tcagtaggag ggcaagctag gaaatttggc gagggggctc 2100 tcaacgatga ccatatcgac gagatgaagc ccccgcaatg tgagcatgat actgtacctt 2160 caattgtccc cgaaacagat gacaagatag gttgcctcct gaattacttt gaccagactg 2220 ctacaacgac cttcaacacg acgaatgaca tcgccgttgc cgctataact gcatcgcctg 2280 atccggaaac agcccttgac gagtctgcgt ctgtggaagg gcaagctagg aaatttggcg 2340 agggggctct tgatgatgac catatcaaaa agatacagcc cccgcaatgt gagcatgaaa 2400 ccgtacctac tatctcccct gtcacagatg acaagctgtt ttgccctctg aatagctccg 2460 caaaaatctg ttcgagcacg acatcatgga atgtatcagc acgactcgca gcggaagcat 2520 ccaaagataa gaaagagcgt acggctgaag aattggtccc cacccggtac catcggtata 2580 tcaatatgtt tcgaaaaact agagcaatga ccctaccacc ccacagacgg tacaactttt 2640 gtgtagactt gattccaggg gctacacctc aggctggtaa aatcatcccg ctgtcccctg 2700 cggaggaggt tgcgttaaat gccatgattg acgagggtct ggagaaagga acgatctgac 2760 gtacacgttc accttgggca gccccagtcc tcttcactgg gaagaaggat ggagctctcc 2820 aaccgtgctt cgactatcgt aaactgaacg ctgtgacggt gaaaaattgt taccccctac 2880 cgctcacgat ggagttagtc gatagtttga ggaacgcgga gaggtataca tcccttgaca 2940 tgcggaacgg gtataataac ctgcgtgtga gggagggaga cgaatcaaag ttagcatttg 3000 tgtgtaaaag aggacagttt gaacccctgg tgatgccgtt tggaccgaca ggggctcctg 3060 gctatttcca atttttcatc tctgatattt tccgcgacaa gatcggaaag gacctggcag 3120 cttacctgga tgacttgttg atctataccc ctgagggtgt tgatcacgaa ctggtagtgg 3180 aggaagtcct ttgagtattg gaatcacatt caatctggct taaaccggag aaatgcaagt 3240 tttcccgtaa ggagatcgac tatctagggc tgttaatatc aaagaaccgc gtacgcatgg 3300 atccactgaa agtgtcggcg gtgaaagact ggccagcacc gaagaatgtg acccaagttc 3360 aacgtttcct gggttttgca aacttctacc gacgattcat tagtaatttc tctaagatta 3420 cacgcccact ccacgaactg actcaggatg atgtacgttt tgaatggacg tcaaagagag 3480 accaggcgtt cgaaacactg aaagaagcct tcaccacggc accggtgttg aagattgccg 3540 acccatacaa agcctttgtc cttgagtgtg actgctctga ctatgcactg ggggcagtct 3600 tgtcacagga agacgaagac gggatcctcc atccagtggc tttcctatcg cggtcgttag 3660 tccagtcgga aaggaactac gaaattttcg acaaagaact gttagcggta gtggcgtcat 3720 tcaaagaatg gcaacattac ttggagggta acccgaatcg attagaagtt gtcgtctata 3780 ctgaccataa gaacctcgag accttcatga gtaacaagca actcacgagg cgtcaggcga 3840 ggtgggcaga gctgttgggt tgttttgact tccatatccg gttccgacct gggaagcagt 3900 ccacaaaacc ggatgccttg tcacgccggc cggacctgga accttcttca gaagacaaat 3960 ggttatttgg gtcaatgctt aaacctaata atctatcaga agcttcattc cttgcagaac 4020 ttgattgtat cgaggcgtgg tttggcgacg agagtattgc acatgacgaa gtggaagatt 4080 ggtttgaaga agacattaaa caagaatacc cacctgaact gattgaagtc gatgcaattg 4140 acagaacccc aggagccaac ggacctatgt ggacagacga tgtcattcta cggcaggtac 4200 gagaagcatc caagcaagac cctcgcatac tggaattgat caaacaagta gagaaagggg 4260 ggagtcaggt acccgcggga ctcaccgtaa cggatcaagt gttgtaccgc aatggaataa 4320 tcgaaatccc caacgatcaa cgggtgaagc tcgaaatact ccaaagccga catgatagtc 4380 tgttagctgg acatccaggc agggcaaaaa cgcttagcct cgtccaacgt caatacagat 4440 ggccttcgat gaaagcttac gtgaaccagt atgttgatgg atgtgattct tgcatccgtg 4500 taaaatcgac gacctctagc ccgtttggat cactcgaacc acttccaatc ccggcgggac 4560 cctggacgga tatcagctat gatctcatcc ctgacctacc gacctcaaat ggaaagaatt 4620 gtatacttac cgttgtcgac cgattgacta aaatgggtca tttcattccc tgtacgactg 4680 aaatgaattc agaagagttg gcaaccctca tgttagcgaa cgtttggaag ctacacggag 4740 ccccaaagac gattgtgtca gatcaaggta gtgtatttat atcaaaaatc actgaatcgc 4800 tgaataaaca gattgggatc gaacttcacc cctcgacagc tttccaccca cggtcagatg 4860 gtcagacgga gatcgtgaat aaagctatag agcaatattt aagacacttc gttagctacc 4920 gacaagatga ttgggaaggc ctgttaccac tggcggaatt cgcgtataat aacagtactc 4980 acacatcgac gggggtgtcg ccattcaggg ccaacacggg atatgacctg aacctaggac 5040 gaattcacag caacgagcaa tgcgttccag tagtcgaaag gcgactgaag atgatagaag 5100 acgtccagga ggaactgaaa gaaaatctga agagggcgca agttgcgatg aagcaccaat 5160 tcgaccaagg agtacgaccg acgccagagt gggaggttgg aaatgaggtg tggctaagca 5220 gtagacacat ttcaaccaca cgacctagcg ccaagcttga ccaccgctgg ctaggaccgt 5280 ttaggatagt caagaaagta tcaaggtcag cttaccaatt agcattaccg gctacaatgg 5340 gacgtgtaca cccggtattt cacgtgtcgg tattaagaaa gcacacacca gattccattg 5400 aaggtcgagt acaggatccg ccagacccaa tcgagattga aggagaagag gagtgggagg 5460 tagaggacgt cctagacaaa cagaagagaa gaggaaaaga tgagtacctt atcagttgga 5520 aggggtttgg gagaaacgag gattcttggg agccaagtac taacctgacg aatgcagcag 5580 aattgatcga ccagttcaat ttaaggtttc cgagggcgtc agaagaccac cgaaggacaa 5640 gacgggtgta atgaggggta aggttttttc cctacggggt tttttaatac cgccccgggg 5700 aggaggcagg gccgcgaaca gggagcccgg gcctaaaagg ggggatgg 5748 // ID Copia-28_MLP-LTR repbase; DNA; FNG; 229 BP. XX AC AECX01003121; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-28_MLP_; KW Copia-28_MLP-I; Copia-28_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-229 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01003121; Positions 603 831. XX SQ Sequence 229 BP; 59 A; 49 C; 37 G; 84 T; 0 other; tgtggtcaca tgatcacaca actgcggcac agccatctct catgctggag ccaacaaaac 60 tctttcaact tgtatccgaa gttccacatc acttgtggaa ctcagattgt tatttctcat 120 acttatgcgt tatcgaatga tttgttattc actttaggta tttacatcac ttgtggaact 180 cagattgtta tttctcatac ttatgcgtta tcgaatgatt tgttattca 229 // ID TKL1_LTR repbase; DNA; FNG; 393 BP. XX AC AJ439548; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 08-NOV-2002 (Rel. 7.1, Last updated, Version 1) XX DE Kluyveromyces lactis retrotransposon TKL1_LTR, long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; KW Long terminal repeat; RNaseH; TKL1_LTR; gag; integrase; pol; KW protease; retrotransposon; reverse transcriptase. XX OS Kluyveromyces lactis OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Kluyveromyces. XX RN [1] RP 1-393 RA Neuveglise C., Feldmann H., Bon E., Gaillardin C. RA and Casaregola S.; RT "Genomic evolution of the long terminal repeat retrotransposons RT in hemiascomycetous yeasts."; RL Genome Res 12(6), 930-943 (2002). XX DR Genbank; AJ439548; Positions 1 393. XX SQ Sequence 393 BP; 131 A; 82 C; 67 G; 113 T; 0 other; tgttgttcct ggcgttgtag gagtacttct aaggcgccag ttattgaacc aatgctaaat 60 aagatgctga agctgagacg aactaggatc aaggctgcag cgtttggtga ttggaagccc 120 aatctgaaaa gtatataagt agatggtttc ttcatccaat ttcccatctg gatttgacga 180 catattaatt actgttctac attaacttat atgtcagatg taagaaacca aggagaacac 240 cgtaagtaac ttacttaact agtcgatcac caatttaatt atccatttat ctcattaaca 300 actacaagtc aaaatggcat ccaattacgt taatactact aaccctctag gccaagcccc 360 tactaccaaa catgatgaat atccagtcaa aca 393 // ID Gypsy-29_MLP-I repbase; DNA; FNG; 5894 BP. XX AC AECX01001225; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-29_MLP_; KW Gypsy-29_MLP-LTR; Gypsy-29_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5894 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001225; Positions 204325 198432. XX CC 'CTTGT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 362..1429 FT /product="Gypsy-29_MLP-I_2p" FT /translation="MNPEASSDAMAEIQRQLTELQTSLAEERLLRHQAEAR FT IRQAEERINAMQSGNTANTNATSATTPAQAEPQAVPKGPKVATPDKFAGTR FT GGPAEVFASQIQLYMMAHPYLFPDDRSKVVFSLSYLMGTASAWAQPLTAEL FT FNHETAHTVTFERFVCNFKAMFFDTEKKSKAEKALRSLTQKSTVAAYTHEF FT NLHASNTGWETPTLISQYEQGLKRNIRVAMVLVQEEFKSIEQISNLAIKLD FT NKIHGAADTSTTMTTPARDPNAMDISSNYTRLSADERARRLRTGNCFRCNA FT HGHVSIDCPNRQPNRNSKGKGSYKARLAELEVKLAAMGGKDESGGDQSEVP FT SRAELSKNGGAQA" FT CDS 2534..5794 FT /product="Gypsy-29_MLP-I_3p" FT /translation="MPPQCEFKSLNPETVIEATGKQEHPLNYSTDAYIDAA FT KTSWSTSAKLAADEKKKVPTRPVKELVPSRYHRHIHMFMKSKAQHLPPRRK FT YDFKVDLVDGAQPQASRIIPLSPAENEALEEMINTGLANGTIRRTTSPWAA FT PVLFTGKKDGNLRPCFDYRKLNALTVKNKYPLPLTMDLVDSLLDADKYTKL FT DLRNAYGNLRVAEGYEDILAFICKAGQFAPLTMPFGPTGAPGYFQYFMQDI FT LLGHIGKDTAAFLDDIMLYTKKGTDHEAVVDGILKILDKHQLWLKPEKCKF FT SKSEVEYLGLIISKNKIKMDPTKVKAVSEWPAPRNVTELQRFIGFSNFYQR FT FIDNFSKTTRPLHNLTRNKTPYIWDNECNKAFEGLKNAFTSAPILKIADPY FT QPFVLECDCSDYALGAVLSQRCEDVGEIHPVAYLSRSLVQAERNYEIFDKE FT LLAIVAAFKEWRHYLEGNPNRLEVIVYTYHCNLETFMTTKQLTRRQARWAE FT TLGCFDFQIRFRPGRQATKPDALSRRPDLAPKEGEKLTFGQLLRPENITPD FT TFKVELATVESFFENEDIELEDAEHWFKVDVLGTADPPSADEILNDKDIIE FT LIREATRKNDKLNELIEAVKNPISARVRQATSKYTFQDGLLYNQGRIEVPD FT DDHIKFQILRSRHDSLVAGHPGRAKTLGLVRRCFTWPSLKAYVNKYVDSCD FT SCLRSKATTLKPFGALEPLPIPAGPWTDISYDLITGLPKSNEQDSILTVVD FT RLTKMSHFLPCRETMSANELANLMIRSVWKLHGTPKTITSDRGSIFISQIT FT QELDKRLGIRLHPSTAYHPRTDGQTEIVNKAIKQYLRHFVCYKQDNWDELL FT PIAEFSYNNRDHASTGVSPFKANYGFEPSFGGIPSSNQCLPVVEDRLKVLH FT EVQQELTEFLNVAQEEMKYQFDKGVRPTPDWNVGDQAWLNNKNISTTRPCP FT KLDHRWLGPFNIIKEISRSAYKLTLPLSMKGVHPVFHVSLLQKHTQDNIKE FT RKQTEPSPITINDEEEWEVSEVLDCRKRYNKKEYLISWKGFSADHNSWEPE FT INLTNSQDLLDQFNRKFPDAIERHKRTRRK" XX SQ Sequence 5894 BP; 1848 A; 1549 C; 1269 G; 1228 T; 0 other; tattgtcgta tctcatccaa caacgggcgt tgaagaacta gactcaagat tgaaacttcg 60 atcatcatcg aagaaccttt aaaattagat tgatacagac cagaacctta tttgaattaa 120 cccagatcta ccaacaaacc ggatacactg actcacaatc aaagattgaa ccccgaacga 180 accttattga agaattagaa gaaaaccgaa ccgtattata atctaagttt tagctacaac 240 gtctccttca taccgaatcc ccgccgacga cgttggatca gacagtgatt caacacatac 300 tgccctcgac tctacaccac tcatctcaga ccttcccact ctactcctgc tgactaccgg 360 catgaatccc gaagccagct cagacgcaat ggcagagatc caacgccaac tcaccgagct 420 ccaaacttcg ttggcggaag aacgattact acgccatcaa gctgaagctc gaatccgcca 480 agcggaagag cgtataaatg caatgcagag tggcaatact gccaacacta atgccacctc 540 ggctaccacc cctgcccagg ctgaaccaca agctgtccca aagggaccta aggttgcaac 600 gccggataaa ttcgccggca ctcgtggagg tccagcagag gtgttcgcca gtcagatcca 660 actgtacatg atggcacacc cgtacctctt tccggatgac cgtagcaagg tcgtgttctc 720 actatcttac ctgatgggta ctgcaagcgc ttgggcccag ccactcacgg cagaactctt 780 taaccacgag accgcccaca ccgttacttt cgaacgattt gtgtgtaact ttaaggctat 840 gtttttcgac actgagaaga agtcaaaggc cgaaaaggca cttcgatcac tgactcaaaa 900 gtccaccgtt gccgcgtaca cgcacgagtt taacttgcat gcctctaata ccggatggga 960 gaccccaacg ttgatcagtc aatacgaaca aggactcaaa cgcaacatca gagtagccat 1020 ggtactggta caagaagaat tcaaatccat tgagcaaatt tcaaatctgg ccatcaagct 1080 tgacaacaaa attcacggag ctgctgacac ctcaacgacc atgacgaccc cagcacgcga 1140 cccgaatgct atggatatat cctccaacta caccagacta tccgccgacg aacgtgcgcg 1200 acgattacgc actggaaatt gtttcagatg taatgcgcat ggccatgtat ccatagactg 1260 cccgaaccgc caacccaacc gtaattcaaa agggaaggga agctataagg caagattggc 1320 cgaattagag gtgaaattag cagctatggg tgggaaagac gaatcaggag gagatcagag 1380 tgaagtacct agtagagctg aattgtcgaa aaatggaggc gctcaagcct gaaggtcgtg 1440 cctagcttga gcaacggagg tgttgtgaat tcagtagact taggggcaaa taacattgta 1500 acgtgcaatc taaatgatcc gcgcatcttt ttactcgcct cactatccct gtcccacaca 1560 ccccgagcca caccatctga gaaccccccc acttgtttct taattgattc gggcgccaca 1620 cataacgtgt tgagtgaaga attcgccttg aagagaggat tattacctta cgcaacaggc 1680 aactcaagag ctattactgg attcgatgga tcaaagacca gttcatccca cgacatcgat 1740 ctcacactcg accaagacat ctatttcaca cattttatca ttgtccacct caagaacacc 1800 tatgatggaa tcctaggaat cccatggata cgagccaacc accacaagat tgattgggca 1860 cgcagtaccg tgcacaacac cgaactcatg tctgcctctg ccgtcgcaga gtcgtctaag 1920 ccgtcaacac cctcaataat cccggacgtg gactggttgg gggacgctag gaacaacgac 1980 gaggggatgt gcataacgtc cccgcagggt gagtccggta ttcctttaga ttccccaaag 2040 tctgtagaag agatcagcaa gcagttttct cctatactac accagaccag aaccacgaga 2100 cccaacgacc ccagcacaac acacgaatcc cagaacatca ccaccgcggc tgttacctca 2160 gccttgccaa gcccgcacca cacccttgga agcctgaatg agcccacagg gcacgctagg 2220 agctttggcg agggggcttg caattctacg aatgcataca tgcccccgca atgtgagttc 2280 gacacagccc tatctgatcc ttgtagcgaa acagctggca agcgtagttg ttccctgaat 2340 tatagatcag agaccacgca cgacaacacc ggacaatcac aattacaacc accttcacca 2400 gatcatccaa gcattgcgac tgatactcca gtctcgtcaa atccgcaaca cacccctgaa 2460 agcccaaatt aaggagccca cagggcacgc taggaaattt gacgaggggg cgctatgttc 2520 tttgaataca gcaatgcccc cgcaatgtga gttcaaatcc cttaatcctg aaaccgtaat 2580 tgaagcaact ggcaagcaag aacatccctt gaattacagc acagacgcct acatcgacgc 2640 agccaaaaca tcttggtcaa catcggcaaa gctagcggct gacgagaaga agaaagtacc 2700 tacaagaccg gtcaaggaac tggttccatc ccgctatcac aggcatattc acatgtttat 2760 gaagtccaag gcacaacacc tacccccgag acgcaaatac gactttaaag tagaccttgt 2820 agatggagcc caaccccaag ccagccgtat catcccacta tcacctgccg agaacgaagc 2880 ccttgaagag atgatcaaca ctggcctagc aaacgggacg atacgccgaa ctacttcacc 2940 atgggcagcg cctgttttat tcaccggcaa gaaagatggc aacctccgcc cgtgctttga 3000 ctaccgaaag ctcaacgcgc ttaccgtcaa gaacaaatac cccctccctc taaccatgga 3060 tctcgtggat agcctcttag acgcagacaa gtacaccaaa ctagacctcc gaaacgctta 3120 cggaaacttg agagtagcag aaggatatga agacatccta gccttcatat gcaaagctgg 3180 tcagttcgca ccactaacca tgccatttgg gccaacaggg gcgccggggt actttcagta 3240 ctttatgcaa gacatcctgc tgggccacat aggaaaggac acggctgctt ttttggacga 3300 catcatgcta tatacaaaaa agggaactga ccacgaggcc gtagtcgatg gcattctcaa 3360 gatcctggac aaacaccaat tatggctcaa accagaaaag tgcaaatttt ccaaatctga 3420 ggtcgagtac ttaggcttaa tcatatcaaa gaacaaaatc aagatggacc ccactaaggt 3480 caaggcagtt tcagaatggc ctgcgccaag aaacgtgacg gaattacaaa gattcattgg 3540 cttctcgaat ttttaccaaa gattcatcga caatttctcc aagacaacac gcccattaca 3600 caacctcaca cgtaataaga caccgtacat ctgggacaac gagtgtaaca aagctttcga 3660 aggactgaag aacgcattca catcggcccc catcctgaag atagcagatc cataccaacc 3720 ctttgtactc gaatgcgact gctctgacta cgcattagga gcagtgctgt cacagagatg 3780 tgaggatgtc ggtgaaatac acccagtagc ttacctttcg cgatctctag tgcaggcaga 3840 gcgcaactac gagatattcg acaaagagct cctagcaatt gtagcggcat tcaaggaatg 3900 gcgccactac ctagaaggga acccgaatcg cctggaagtc atagtctata cgtaccactg 3960 caacttagag acttttatga ccaccaagca actcacacga cgccaagctc ggtgggcaga 4020 aacactggga tgttttgact tccagatcag gttccgccca ggtcgacaag ctacaaaacc 4080 agacgcactg tctcgacgac cggatctagc gcctaaagaa ggagagaaac tgacttttgg 4140 tcagttgcta cgtccagaaa acataacacc ggacacgttt aaagtcgaac ttgccaccgt 4200 agaatctttt ttcgagaatg aggatattga acttgaggat gcagaacact ggttcaaagt 4260 cgacgtgcta ggcaccgcag acccaccatc agcagacgag atcttgaacg acaaagacat 4320 tattgaactt atcagagaag caaccaggaa gaacgataaa ctgaatgaat tgattgaagc 4380 cgtcaagaat cctatctcag cacgagtacg acaagcgaca tcaaagtata catttcaaga 4440 tgggttattg tacaaccaag gcaggatcga agtgccggac gacgaccaca tcaaatttca 4500 aatactacga agccgtcatg acagccttgt tgccggacac ccaggtagag ctaagacgct 4560 gggattagta cgtaggtgtt tcacatggcc atctctgaag gcatatgtca acaagtacgt 4620 ggatagctgt gactcgtgtc tcagatccaa agcaacgact ttgaagccat ttggtgcatt 4680 agaaccttta cctatcccgg caggaccctg gaccgacatt agttatgacc tgatcacggg 4740 attaccaaaa tcaaacgagc aagatagcat cttaactgta gtcgacagac taacaaaaat 4800 gagccatttc ctcccatgcc gcgaaaccat gtcagcaaac gaattagcca atctgatgat 4860 ccgatcagtg tggaaactac acggtacgcc taaaactatc acatctgacc gtggaagcat 4920 tttcatttca cagatcacac aagaactcga caagagacta gggataaggc tacatccatc 4980 aacagcttac cacccaagga cggacgggca gaccgaaatc gtgaataagg ccatcaagca 5040 gtacctcaga cactttgtct gttataaaca ggacaactgg gacgaattat tacccattgc 5100 tgaattctca tacaacaacc gagatcacgc gtcgactggc gtctcacctt tcaaggctaa 5160 ttacggtttt gaacccagct tcggaggaat accttccagt aaccaatgct taccagttgt 5220 agaagacagg ttgaaagtat tacacgaagt gcagcaagaa ctgaccgaat ttttgaacgt 5280 ggcacaggaa gagatgaagt atcaattcga taaaggagtg cgaccaaccc cagactggaa 5340 cgttggcgac caagcctggc tcaacaataa aaatatctcg accacacgac catgccccaa 5400 actggaccac agatggctag gcccttttaa tatcattaaa gaaatatccc gatccgctta 5460 taaactgacc ttacctttgt ctatgaaagg agtacatcca gtgttccacg tttcactttt 5520 acaaaaacac actcaggaca acatcaagga aaggaagcaa acggaaccat cacccatcac 5580 aattaatgat gaagaagaat gggaggtgtc agaggtatta gactgccgga aaagatacaa 5640 caagaaagaa tatttgatat catggaaggg atttagtgca gatcataact cttgggaacc 5700 cgaaatcaat ttaacgaata gtcaagattt attagatcaa tttaacagaa aatttccgga 5760 tgcaattgaa agacacaaga ggacacggag gaagtgagag aggacaagct ttttccccac 5820 ggggtttttt aacgctgtcc gtggagagtg cgcagaactt gcaagaggga gtttgggcgt 5880 aaaaaggggg atac 5894 // ID TY1A repbase; DNA; FNG; 163 BP. XX AC M24989; XX DT 11-NOV-1996 (Rel. 1.1, Created) DT 11-NOV-1996 (Rel. 1.1, Last updated, Version 1) XX DE S.cerevisiae Ty1 transposable element B10 DNA, segment 3. XX KW Copia; LTR Retrotransposon; Transposable Element; TY1A; KW Ty1 transposon; mobile element. XX OS Saccharomyces cerevisiae OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Saccharomyces. XX RN [1] RP 1-163 RA Eibel H., Gafner J., Stotz A. and Philippsen P.; RT "Characterization of the yeast mobile element Ty1."; RL Cold Spring Harb. Symp. Quant. Biol 45, 609-617 (1981). XX DR GenBank; M24989; Positions 1 163. XX SQ Sequence 163 BP; 46 A; 26 C; 26 G; 65 T; 0 other; cgtcgcggag atttttttgc tactgttgcg gttacatata acttactttt tgcatatatc 60 gtaagttatc agacaccatc tggctcactg aaagtttatt tctatgtgct ttctgaaacg 120 tatgtatata tgagattcaa atacaatatt taaatatttt cga 163 // ID I-1_AN repbase; DNA; FNG; 5425 BP. XX AC . XX DT 09-JAN-2004 (Rel. 9, Created) DT 09-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE Non-LTR retrotransposon from the I clade. XX KW Non-LTR Retrotransposon; Transposable Element; I clade; I-1_AN; KW RNaseH; endonuclease; reverse transcriptase; zinc knuckle. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-5425 RA Kapitonov V.V. and Jurka J.; RT "I-1_AN, a non-LTR retrotransposon from Aspergillus nidulans."; RL Repbase Reports 3(12), 207-207 (2003). XX DR [1] (Consensus) XX CC Non-LTR retrotransposon. I superfamily. CC I-1_AN is a non-LTR retrotransposon from the I clade. CC This family was active during the last one million of years, or CC so. CC Some copies are less than 1% divergent from the consensus CC sequence. CC I-1_AN encodes two proteins: I-1_AN-1p (pos. 103-1583) CC and I-1_AN-2p (pos. 1581-5141). I-1_AN-1p is a putative CC DNA/RNA-binding CC protein similar to the ORF1 proteins encoded by non-LTR CC retrotransposons CC from the I superfamily. The protein includes the zinc CC knuckle-like CC motif (pos. 360-425). CC I-1_AN-2p includes endonuclease, reverse transcriptase and RNaseH CC domains. XX FH Key Location/Qualifiers FT CDS 103..1581 FT /product="I-1_AN-1p" FT /translation="MEVDISPPGGTRPATPLLGENSDPPSGPTTPTPLPRN FT SLKRRALFSPQKTPTAAPVPVSHTPQAPSICEQVGMVADDQLALLHDWKLA FT MTSLAKALDLTVSSLQGRPRDLARELAARFVTLAKQDSPQQISQMPVVAPP FT QPPRQMEQPNHPPTPEASKSPLNRQTSQPTTWASLTAPRTGQGNWQTIAPE FT HHMQAKQTAQRRLKQSNKTDHRIFLRLPASSSLRAIGPHGIRVTLAGKVPD FT GITQVQVISTGYAITTTEQGKAFLLSEKAASLAGDGYFEIPTEYHQVIVSR FT IPKQLWSLDGWIDTTIADISMEAERITGIKPLMAKLSKHPVERDSITAVIA FT FPKKLQHPLQLFGLSGLSRPTRPKQRPLQCTRCHRFYDTRACRSSERCISC FT GSSKQEHNCRVQCINCCGPHAADFQKYPARPHIQRNTITRLSKDALAAICK FT AGRLAFQQEQKKAEESSKQQTDNTHTTNQPTRQLTQELLNQTLTSPEL" FT CDS 1581..5138 FT /product="I-1_AN-2p" FT /translation="MKILQANIGRGGAVYDLLLSFEADIILVQEPWTNTAK FT HLTKTHPQYQLFSPPTRWTARPRTLTYVQRDLPAHSLPEPISPDITTIYTA FT GLTIINVYRPPNNPVAPAGAGSTPSTLSTLLGYAPPENTILAGDFNTRHPF FT WQPDTESHAVTPGATGLLDWLDAHELELRLEPGTPTRGPNTLDLVFSNLPL FT RALVEDHLKTPSDHATIGIILEQEEPPPIYKLGSTNWEKARALASPPDPTL FT PIDLLAKQLVQISQLAIQGASRYNTRRLPRTPWWTPELTDILHQTRQQQNP FT DYKQLRKAIVQAKAEYWKQRIEQATAPIDAFKLAKWIQYPDQLAAPPLNIQ FT GAQVTTPQGKADAFLNHLLEKGALLPNQTEEGPPNKPLGLLHLPTKEHCWA FT ALCAPPPSAPGEDGLATTAWRELWPVLGDTITQLYYRCMEEGCFPLSLKSA FT KVIMLPKPGKRGYTQLNAWRPISLLSTLGKGLERLLAQQIAVRAIQADVLA FT PCHFRALPGRSAIDLVQVLVHRVEEAFQQGKDASLLLLDVKGAFDAVIHQQ FT LLSHLRLQGWHKGLLQLLKDWLTGRSVSVHIKEGTATAPIKGRLPQGSPLS FT PILFLLYAARIVSTLEGSFCYADDMGILLTGNTLEESSQQLVEAYKQITAL FT GTETGLPFSIEKTEIQYFSRKQQQHLPTVTLPGIGEITPSLYTQWLGVLLD FT TKLTFKAHINLVFSRGKRLAQHLKRLSNTQHSCPVASMQAAVIQYILPTAL FT YRAEVFYTGKRQKGVVNSLLSLFCTAALAIIPAYKTTPTAALLREADLPDP FT EALLNSILQRAAVRYMSLDTKHPIAQIAAETTAGRPKTRLKRILQLLLSPL FT PERAIIELPLPPLCMLPTDNKGYSPAPLQISVYSDGSRTSQGAGYGYAIYF FT GPILVTKGHGPAGPRTEVYDAEIMGAVEGLRAALGQPCVGYSTQLVILLDN FT LAAASLLASYRPTPHRHGLSETFSQLAAQWMESPSILTMQRKPLQVRWIPG FT HSGIAGNELADKLAKLGSSIYSPDIPPSPAYLQREAKQWLRTETYTAYANK FT APETYKALNIRPHTKESRSREHKLPWWVLGRLVAARTGHGDFTAYHQRFNH FT SDYLESCSCGRTKTPVHFFFCPYTRKRWKDRWRCIRDGPSKTIDWLLSTAA FT GAEEFSRIVQESSFFKDICPNWARRSA" XX SQ Sequence 5425 BP; 1503 A; 1711 C; 1158 G; 1053 T; 0 other; gaatggcagg catcacagcg acacgacgag gggaaaacag cacccctact gctctaataa 60 ctgatctctt acacttggct cattgctcat cccctccacc ccatggaggt ggatatctcc 120 cccccaggcg gaacccgtcc ggcgactccg ctcctgggtg aaaactctga ccccccctca 180 ggacctacca ccccgacccc cctaccccgg aactccctga agagaagggc cttattctcc 240 ccgcagaaga ctcccactgc agctccagtc cctgtatccc atacgccgca agccccgtcg 300 atctgcgaac aggtcggcat ggtagcagac gaccagctag cccttcttca tgattggaaa 360 ctagccatga cctcccttgc caaagctcta gatctaactg tctcctccct acagggccgc 420 ccaagagacc tggcccggga gcttgcagcc agattcgtca cccttgcaaa gcaggactcc 480 cctcagcaga tttctcagat gcctgtggtt gcacccccac agccacccag acagatggaa 540 cagccaaacc atcctcccac tcctgaagct tccaaaagcc ccctgaacag gcaaacctcg 600 cagcctacaa cctgggcatc cctaacagct ccaagaactg gccaggggaa ctggcaaact 660 attgcccctg aacaccatat gcaagccaag caaacagcac aacgaaggct gaagcagtca 720 aacaagactg accaccgcat cttcctccgc ctcccggcct cctccagcct ccgagctatt 780 ggaccacatg gcatccgggt cacccttgct gggaaggttc cggacgggat cacacaggta 840 caagtaatat caaccgggta tgcaatcact acaacagaac agggcaaggc tttcctacta 900 tcagagaagg ctgcaagcct agctggggat gggtactttg aaattccaac agagtaccac 960 caggttattg tctcccggat cccaaaacaa ctctggtccc tagatggatg gatagataca 1020 acaattgcag acattagcat ggaagcagag cgcattactg gcattaagcc tctcatggcc 1080 aagctctcaa aacacccagt agagagggac tctatcacag cagtcatagc ctttccaaaa 1140 aagctacaac accccttgca actctttggc ctgtccggcc tatcaaggcc cacccgcccc 1200 aagcaaaggc ctttgcaatg cacccgatgc caccgcttct atgatacacg agcctgccgc 1260 tccagcgaac gctgtatctc ctgcggatcc tcaaaacagg aacataactg ccgtgtgcag 1320 tgtatcaact gctgcggccc gcatgcagca gacttccaaa aatacccagc cagaccccat 1380 atccagagga acactattac ccgcctctca aaagatgctc tagctgctat ctgcaaggca 1440 ggccggcttg ccttccaaca ggagcagaag aaagcagaag aaagctctaa acaacaaaca 1500 gataataccc atactacaaa ccagcctaca agacagctca cccaggagct cttaaaccaa 1560 accctgacct cccctgaact atgaaaatac tacaagctaa tataggaagg gggggcgctg 1620 tatatgacct gctactctcc tttgaagcag atattattct tgtccaagaa ccttggacaa 1680 atacagcaaa gcacctaacc aagacccacc cacaatatca gctgttcagt cccccgaccc 1740 gatggactgc cagacccagg actctaacat atgtacaaag ggatctccca gcccattccc 1800 tcccggaacc aatctcacca gacatcacca caatctacac ggcaggcctt actattatca 1860 atgtctaccg cccccctaat aacccagttg cccctgctgg tgctggctca acaccctcta 1920 cactttccac actcctagga tatgcacccc cagagaacac catcctagca ggagacttca 1980 atacccggca cccattctgg cagccagata ctgagtctca tgctgtcaca cctggcgcaa 2040 caggattatt agactggctt gatgcccatg agctggaact tcgcctcgag ccaggcaccc 2100 ccacccgtgg accaaacacc ctagaccttg tcttctctaa cctaccacta agggccctag 2160 tagaagacca tctaaagact ccaagtgacc atgcaacaat tggaataata ctggaacaag 2220 aagagccccc gcctatatac aagcttggat ctaccaactg ggagaaagcc agagccctgg 2280 caagcccgcc tgacccaacc ctaccaattg acctactagc caaacaactg gtccagatat 2340 cccagcttgc aatacaaggc gcatcaagat acaatactcg cagactcccc aggaccccat 2400 ggtggactcc agaactaaca gacatactac accaaacaag acagcaacaa aaccccgact 2460 ataaacagct ccggaaggcc attgtacagg caaaggctga atactggaag cagcgaattg 2520 aacaagccac agcacctata gatgcattca aacttgctaa atggatacaa tatccagacc 2580 agcttgctgc tcctcccctg aatatacaag gggcacaggt tactacccca cagggcaagg 2640 cagacgcctt ccttaatcac ctcttagaga agggggccct gcttccaaat cagacagaag 2700 agggaccccc aaacaagccc ctgggcttac tacacctgcc aacaaaagag cactgctggg 2760 ctgctctctg tgccccaccc ccgtctgccc ccggggagga cggacttgcc actactgctt 2820 ggagggagct ctggcctgta ctaggggata caatcacaca actatactac aggtgtatgg 2880 aggaaggctg ctttccactg agcctgaagt cagcaaaggt aataatgtta ccaaaaccag 2940 gaaagagggg ctatacccaa ctcaatgcct ggcggccaat tagcctcctc tctaccctag 3000 gtaaaggcct agagcgcctc ctagcacagc agatagctgt aagagcaatt caggcagatg 3060 tgctagcccc ctgccacttc agggccctgc caggacgctc tgccattgac ctggtccagg 3120 ttcttgttca cagggtagag gaggcctttc aacagggaaa agatgcttca ctactcctac 3180 tagatgtaaa aggggcattt gacgctgtaa tacaccaaca gctcctttct cacttacgcc 3240 tgcaaggatg gcataaaggc ttactccagc tacttaagga ctggcttact ggccgctctg 3300 tatctgttca tatcaaagaa ggcactgcca cagcaccaat taaaggcaga ctcccccagg 3360 gatcccccct atccccaata ctcttcctgc tatatgcagc aagaatagtc tctaccttag 3420 agggctcctt ctgctatgca gatgatatgg gcatattatt aactgggaat accctggaag 3480 agagctcaca acaactggta gaggcctaca agcaaattac tgctctaggg acagagacag 3540 gcctcccttt ctcaatagag aaaacagaga tacaatactt ctctagaaag cagcagcagc 3600 atctccccac agttactcta cctggtatag gggagattac accatcccta tatacacagt 3660 ggttaggagt tcttctggat acaaagctta cttttaaagc ccatattaat ttggtcttta 3720 gccgcgggaa acgactcgcc cagcacctaa agagacttag caatacccag cacagctgcc 3780 cagtggcctc catgcaggca gcagttatac agtatattct tccaacagct ctgtacaggg 3840 cagaagtctt ctatacaggc aaacgacaaa aaggggtagt taactccctg ctttctctct 3900 tctgcacagc agccctggct attatcccag cctacaagac cacccctact gcagcactcc 3960 tccgcgaagc agacctacca gacccagaag ctctactcaa cagcatcctc cagagggcag 4020 cagtgagata catgagcctt gatactaaac acccaattgc ccaaatagcc gcagagacta 4080 ccgcgggcag gcccaaaacc aggcttaaaa ggatcctaca gctcctcctc agccccctgc 4140 cagagcgcgc tataatagag ctgcctctcc ctccattatg catgctccca acagacaaca 4200 aaggctacag ccctgcccct ttacagattt cagtgtactc agacggctca cggaccagcc 4260 agggggcagg gtatggctat gcaatctact ttggccctat cctggtaacc aagggacatg 4320 gccccgcggg ccccaggaca gaagtctatg atgcagaaat catgggtgct gtggaaggcc 4380 tacgcgcagc cctgggacaa ccatgcgttg gctactccac ccagctagtt atcctcctag 4440 ataacctagc tgcagcctcc ctgctagcaa gctataggcc aacccctcac agacatggtc 4500 tgtcagagac ctttagccaa ctagccgccc agtggatgga aagcccttca atcctaacca 4560 tgcaacggaa gccccttcag gtccgctgga ttccaggcca ctctggaatt gctgggaatg 4620 agctggcaga caagctcgct aagctagggt cttctatata cagccccgac atccccccct 4680 ccccagcata cctacaacgg gaggcaaaac agtggctccg tacagagaca tatacagcat 4740 atgctaataa ggcgcctgaa acctacaaag ccctgaatat cagaccccat acaaaagaaa 4800 gccgctcccg cgagcacaag ctgccctggt gggtacttgg ccgactcgtc gccgcccgta 4860 caggccacgg agactttacg gcataccacc agcgcttcaa ccactcagac tacctggaga 4920 gctgctcttg tggcaggacc aagaccccag tgcacttctt cttctgccca tataccagaa 4980 agcgctggaa agatagatgg agatgtataa gggacggccc gtcaaaaaca atagactggc 5040 tcttaagtac agctgccggg gctgaagaat tcagccgcat cgtgcaagaa tcatccttct 5100 tcaaggatat atgcccgaac tgggcccgcc ggagcgcttg atagtgcgac agtccacaca 5160 tctacctgga taaagggtac ggcccctccc cccaatctat aggtagtcaa aacgggcatc 5220 tgccctcgaa gacctggcca gggcagcgcc gggtgcttct tccgctcatt tccaacatat 5280 attgtccata gttgctgctt caaacctgta tctagctagt tcctaggcag ttctgtttag 5340 gtagcacgtc cagatgcccc ctgggaggcc gcagatcacg tgggccccgt gatccgccga 5400 gtgacgttaa ataataaaac caaac 5425 // ID Gypsy-15_LBS-LTR repbase; DNA; FNG; 656 BP. XX AC ABFE01001144; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-15_LBS_; KW Gypsy-15_LBS-I; Gypsy-15_LBS-LTR. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-656 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01001144; Positions 7854 8509. XX SQ Sequence 656 BP; 184 A; 126 C; 141 G; 205 T; 0 other; tgtttaaagg ttcagtccat aggaccacaa aaaaccggtg gaccgaactg aactggacca 60 tggtccagtc tttttttcag ttgcagttgc cccaatttgg ggttggtccg gttgccggtt 120 gcctcatttc gaaaatcttt taaaaccgtt caaagaccgg ttttaatcgg ttgcaaccgg 180 ttaaacggtc acgtgcgtta catagccttg taaccacatt tattgccttt tattgggatt 240 ttgtttcatc aaaaacggtt aagaattgag aaagatataa ccaaatccat attttaccag 300 agtctctagg agatgtaatt cagtttcagt catagctatg tcaaatgttc attaaatctc 360 ttgaaaattg atagaatgac gggtatttat atgctttact attttatata acccgtttat 420 ttttcagagt aataatggtt gagatacaaa tggtttatat tggtataata aattatacgc 480 aaaaaaaccg gtctggatcg gtctgaaccg gtatcatgga ccgcaaaaga ccggtcctag 540 gtggttcggt tcggttccct caatatctgg gtcggtcctg gaccggttgc ggtccacggt 600 tgcgcgtttt cggggcaaaa aaccggactg aactgaacct ttaaacacta agtaca 656 // ID Copia-39_MLP-LTR repbase; DNA; FNG; 383 BP. XX AC AECX01001583; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-39_MLP_; KW Copia-39_MLP-I; Copia-39_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-383 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001583; Positions 1584 1202. XX SQ Sequence 383 BP; 82 A; 81 C; 74 G; 146 T; 0 other; tgttacggct gaggtgacca ggcaccgtag gtgacaaggc aatcagccta gtctcacccg 60 ttttcggttt agcccaggtg gaagatccag gaatgttgga tcgggctggt ttcatctttt 120 tgtgtttcgc gtcattgttt gttccgccta gctgaaactt gtttcatata aatagttctt 180 ctttcttcct aaacttgtgt tggtctctct attgtcacaa acttctttgt ttcggtagaa 240 tcatcactaa cactagttga actaggtaga ttcttatttc ttctattttt gtcttttatg 300 aaattgtcac aaacttcttt gtttcggtag aatcatcact aacactagtt gaactagttt 360 acgtttcatc tgctctcgtc cca 383 // ID Gypsy-1-LTR_AF repbase; DNA; FNG; 671 BP. XX AC . XX DT 28-FEB-2006 (Rel. 11.02, Created) DT 07-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE Long terminal repeat of the Gypsy-1_AF LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; Gypsy-1_AF; Gypsy-1-I_AF; KW Gypsy-1-LTR_AF. XX OS Aspergillus fumigatus OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Neosartorya. XX RN [1] RP 1-671 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-671 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-1_AF, a family of gypsy LTR retrotransposons in the RT Aspergillus fumigatus genome."; RL Repbase Reports 6(2), 61-61 (2006). XX DR [2] (Consensus) XX CC This is a long terminal repeat of the Gypsy-1_AF LTR CC retrotransposon. Gypsy superfamily. It is characterized by 5-bp CC target-site duplications. XX SQ Sequence 671 BP; 214 A; 139 C; 125 G; 193 T; 0 other; tgttacgaca ccgacacagc cgcgctagcg caaggcggat gtgacaggca gaaacccaaa 60 gatcaatgat caccagtcaa tcccggagtg atcagggcac gatccggaga aatgcttaat 120 ctagataggt tcccgcagaa atacagttca gtagttgctt actgaatatt aatgcttaca 180 taataaagca cactattaag gagcacctca cgtgataaat ggtatatgga tggtatgcta 240 agacaacatt atgaaggtat tacaataata taccaccacc atacccagga tataccaccg 300 aatatactac ctggaggaac tatgggatat aaggacactc attcacgctg tatctagaac 360 tcttagttta gttccttata tttaatatta attatttact taagtacaag atcactagtc 420 tgattcgtca gctactctat actattaagt atatatactt aataggcctt gcttctatct 480 ctactattat tattcttatg ttacctttag ttatctttaa cccttagtta ggtcattata 540 tcactgtgtt atcaggctat ctagttaata ggcctcaggc aagtattaga tatagtccag 600 ggactatatc gccccgttcc caaagtagta gactagcaga gaacagcaga gacggatccc 660 agggtgtgac a 671 // ID hATw-1_LB repbase; DNA; FNG; 4918 BP. XX AC . XX DT 12-JAN-2009 (Rel. 14.02, Created) DT 12-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE hATw-type DNA transposon family from Laccaria bicolor: consensus. XX KW hAT; DNA transposon; Transposable Element; hATw; 7-bp TSD; KW hATw-1_LB. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-4918 RA Bao W. and Jurka J.; RT "hATw-type DNA transposons from Laccaria bicolor."; RL Repbase Reports 9(2), 650-650 (2009). XX DR [1] (Consensus) XX CC TSD is 7-bp long. XX FH Key Location/Qualifiers FT CDS 1172..4216 FT /product="hATw-1_LB_1p" FT /translation="MAPLASRKRQAAVANLKAQEVLNEKRAVLASETAKDD FT LWISLQAAKSRIEELENLLADRDTECCRLESELDKANQKLQMHKDNSVLWQ FT EKHERTYHELRMQRQTSKRGQQKLTQLQNQVEILKTAEKETSKQLLRGSRE FT SHKAIALLQKQNNTLHNELSMSMAKWTSQLEKSHAKLAKSTSDLTTLRNKA FT SKLRKAVKCGKKQKEQSIASVKKKILDQRSVHYLMQKGVFTEETRNVVRLL FT VKAGCSRNLVGEVILAVLKSAGITGVGSISRTSVSRFLHEGYFAAQIQLGY FT EMKKAESMTFSADGTSHRSINYNSRHVHLIAEDYTSPEGSSKQQVTRTFGI FT QSSKDGSSEEAIADWENTLKKIIDLYNNSPLGKRSGGLLKFIELLIKLAGM FT STDHCAKEKKDARLLETLKAWAVDQHLGEEKMLEMTLEEVRDYFKKAEEEM FT IRKAGGVNKWNKLSDIKKAERKAKMIEEAVAELGKEEFDNLSDEEKHIFRL FT FIWAGCGCHKDLNTIRGGYLAVAAWWIENGLEEERPVLLANRDNDPVVQER FT ATALEKGDTLTPAQERAFHKSTRGAIKTAEIAGAIFNNKDDKRGHHDIFRY FT WWWEHVGVPFTFPDTSNNRFQSYCNAAAALILYGDEFKDFLESLRINKQNP FT TLNHMELNLWKALHCTSTVTELAVLAIYAEAVSYPYMKAIRTSDAQKKNML FT DLGPFHSRVYDHMQKIIANPDILIGKDLDISESYKTATLDGEKWQNPAVVK FT KIFDLIPTLPHFHNLLIAFFKGAAQTWERFTSEFAPGGLIDEATAEERELA FT HMPATNDENEGLLGSFRRLMHYQPQLTLLSCNALIMFFRNNTQAFMETKFT FT EEDYQYIHKLGREANGEEKKRRNELTDFRDQRQAKKTARKEVREQNAKATA FT ERIAQLELVFDKEKVPGLKGTSLKDQLRLFKSAGAPNLKQGPMPTKVADIR FT KALVDAIDLHTNGAWKLIQDEESEGENINLSEEEEDEEDEEDEEDEEEGWE FT DIEGYESE*" XX SQ Sequence 4918 BP; 1493 A; 998 C; 1076 G; 1351 T; 0 other; tggtgttgca caatcctcgg taagtgtgat gaccagtgac atgccgtgac atggtgtcac 60 aggcccaata tcttgagaca ttgagtttgt actttccaca gcacctattc aatatatttt 120 ttgcaatggg tttgattgaa ttaagtgaat gaatccactg ggagagtttt ccctggagaa 180 attgcattta ttgctccaaa aatgcgaaat cagtctgttt tccctgccaa atgaggcagt 240 ctgaagagat tggactataa cttttcaaca acagcacgta tgaacatgat ttgcagacgg 300 attttactta gaattcaaat tccaatcgat ctacggatcc agaattttta tttgtggcac 360 ctgtgccgag atacccttga aaatcagcac ttttacatgg ttaaaggcta gtaattaaaa 420 atatggaccc gtaggattgt agccacatag taatcgatga tttcgtctct gcaaacctaa 480 gcccaaatcc tatctgttgt gaagctaata gctcgaggag gcaaccttgg gccaataggt 540 gctgcacagc cactgatcca attccagaat gggtttgcac aacatgtggc atctgcatgg 600 caccctcatg catgcaggaa tgttgctgga tttcttggag acagcttctt tacacttgac 660 caccacctca acagcctgta gacccaaaac acccccattt ttttgtgctt ttgcctgctt 720 ttttccatca aaaagcatcc aagtccatac tacaagtgag ggtttgtgct tgagcttcag 780 aaatgcatat acatcttatt attgttttgc atctaggatc tattcagtaa ttgaacatct 840 gctttaggta tagatatagg cagatcattc aattcaagca tataactata tatttttgca 900 ggtattgtta gaacttagta ccacccttgt cacctctctc ttcccctggg tttcattttt 960 tgcttgtctc tctttctctg tggttgttct ctcagttatt ctcatctcta gtactactct 1020 tgtcaggaga ttctttgcct ttctctctct gggcttcatt cttgtttgtc tctcttgctc 1080 tgtgcttgtt ctctgagtct cggtaattct catctcaggt gatttctgca tcttggccac 1140 tttgtagtga caacgtctct ggcttcacat tatggcacca ttagcatcac gaaaacggca 1200 ggcagcagta gctaacttga aagcacagga ggttctcaat gaaaagagag ctgtattggc 1260 ttcagagact gcaaaagatg atctctggat tagcttacag gctgcaaagt cacgaattga 1320 ggagctggag aatctgcttg ctgacagaga cacagaatgc tgcagacttg agtcagagct 1380 tgataaggct aatcagaaac tgcaaatgca taaggacaac tctgtgcttt ggcaggaaaa 1440 gcatgagaga acatatcatg agcttcgtat gcagcgccaa acctcaaaaa gagggcaaca 1500 aaaattgacc caattacaga atcaggtaga gatcctgaag actgcagaga aagagacttc 1560 taagcagctt ttgagaggct ctcgtgagtc acataaggct attgcattac tacagaagca 1620 gaataacact ctccataatg agctctctat gtctatggct aagtggacct cacagcttga 1680 gaaatcccat gccaagcttg ctaaatcaac ctctgatctg acaacattac gaaacaaagc 1740 ttccaagttg cgcaaggcag tcaaatgtgg taaaaaacaa aaggagcaat caatagcctc 1800 agtcaagaag aagatcttag accaacgatc agttcactat ttaatgcaaa agggtgtctt 1860 cactgaggaa acacgcaatg ttgtccgttt acttgtcaag gctggctgct cacgaaacct 1920 tgttggtgag gttattttgg ctgtcctcaa atctgcagga ataacaggtg ttggcagtat 1980 cagccgcact tctgtttctc gatttcttca tgagggatat tttgctgcac aaattcagct 2040 tggatatgag atgaaaaaag cagaaagtat gactttcagt gcagatggta caagccatcg 2100 tagcatcaac tacaattccc gccatgttca tcttattgct gaggattaca cttcaccaga 2160 gggcagttca aagcaacaag taacccgaac ttttggaatt caatcatcaa aagatggatc 2220 tagtgaagaa gctattgcag actgggagaa tactctcaag aaaattattg acctctacaa 2280 taacagtcct cttggaaagc gctcaggtgg acttctcaaa tttattgaac ttttgatcaa 2340 acttgcagga atgagcactg atcactgtgc aaaggaaaaa aaagatgccc gattgcttga 2400 aactctgaaa gcttgggctg ttgatcaaca tcttggagaa gaaaaaatgt tagaaatgac 2460 attagaggag gttcgtgact atttcaagaa agcagaagag gaaatgatta gaaaagctgg 2520 tggggtaaat aagtggaaca aactgtctga catcaagaag gctgagagga aggccaaaat 2580 gattgaagag gcagttgctg agttgggaaa agaagagttt gataatctct ctgatgagga 2640 aaaacatatt ttccggctct ttatctgggc tggctgtggt tgccacaagg atctgaacac 2700 tattcgggga gggtatcttg cagtggcagc ttggtggatt gagaatgggc ttgaggaaga 2760 acgccctgtc cttcttgcca atcgtgataa tgatcctgtg gttcaagagc gagccactgc 2820 tcttgagaaa ggtgacaccc taacaccagc tcaagaaaga gcttttcaca aatcaacacg 2880 tggtgcaatt aaaactgcag aaattgcagg tgcaatcttc aacaataaag acgataaaag 2940 gggccaccat gatatctttc gttattggtg gtgggaacat gtgggagttc catttacatt 3000 tcccgataca tccaacaata gattccagtc atattgtaat gctgctgcag cccttattct 3060 ctatggagat gaattcaagg attttcttga gagtctacgc atcaacaagc aaaatccaac 3120 actcaatcac atggaattaa atctctggaa ggctctccac tgcacttcaa cagtgactga 3180 gcttgctgtt cttgcaattt atgctgaggc tgtttcatac ccatatatga aagcaatccg 3240 tacttctgat gcccaaaaaa agaacatgct tgaccttggt ccttttcatt ctcgtgtcta 3300 tgatcatatg cagaaaatca ttgcaaatcc tgacattctc attggaaaag acctagacat 3360 ttctgaatca tacaaaacag ctactctaga tggtgaaaag tggcagaatc ctgcagttgt 3420 aaagaagata tttgacctca tccctacact ccctcacttc cacaatcttt tgattgcatt 3480 tttcaaagga gcagcacaaa catgggaacg tttcacatca gaatttgcac ctggtggcct 3540 aattgatgaa gcaactgcag aggaaaggga gcttgcacat atgcctgcaa caaatgatga 3600 aaatgaaggt ctgctagggt ctttccgacg cctcatgcac tatcagcctc agcttacact 3660 tctcagttgc aatgccttga taatgttttt ccgcaataac acacaagcat ttatggaaac 3720 aaagtttact gaggaagact atcagtatat acataaactt ggacgagagg caaatgggga 3780 ggagaagaaa aggcgcaatg aacttactga tttccgtgac caacgacaag caaagaaaac 3840 tgcacgcaag gaggttcggg agcaaaatgc aaaggcaact gcagaacgga ttgcacagtt 3900 ggaacttgtt tttgacaagg aaaaggtccc aggactcaag ggcacgtccc tcaaggacca 3960 attgagactc ttcaagagtg caggtgctcc aaatctcaag caagggccaa tgccaaccaa 4020 agttgctgat atccgcaaag cacttgtaga tgctattgat ctgcatacaa atggtgcttg 4080 gaagctcatc caagatgagg agagtgaggg tgagaatatt aatctgtcag aggaagagga 4140 ggatgaggag gatgaggagg atgaggagga tgaggaggag ggttgggagg atatagaggg 4200 ctatgagagt gaatgagatg ttgccagaaa actgcatttt ctttggatat ttttcatgtt 4260 tagcattatt acaacacttt gaatgtataa agtaggctat tatacaacta ttctacaaaa 4320 gaaaaactaa gggaagtgat cacaactcat tctggaattg gatcagtggc tgtgcagcac 4380 ctattggccc aaggttgcct cctcgagcta tgagcttcac aacagatagg atttgggctt 4440 aggtttgcag agacgaaatc atcgattact atgtggctac aatcctacgg gtccatattt 4500 ttaattacta gcctttaacc atgtaaaagt gctgattttc aagggtatct cggcacaggt 4560 gccacaaata aaaattctgg atccgtagat cgattggaat ttgaattcta agtaaaatcc 4620 gtctgcaaat catgttcata cgtgctgttg ttgaaaagtt atagtccaat ctcttcagac 4680 tgcctcattt ggcagggaaa acagactgat ttcgcatttt tggagcaata aatgcaattt 4740 ctccagggaa aactctccca gtggattcat tcacttaatt caatcaaacc cattgcaaaa 4800 aatatattga ataggtgctg tggaaagtac aaactcaatg tctcaagata ttgggcctgt 4860 gacaccatgt cacggcatgt cactggtcat cacacttacc gaggattgtg caacacca 4918 // ID MAGGY_I repbase; DNA; FNG; 5132 BP. XX AC L35053; XX DT 30-MAY-2000 (Rel. 5.04, Created) DT 30-MAY-2000 (Rel. 5.04, Last updated, Version 1) XX DE MAGGY_I is an internal portion of MAGGY, a gypsy-like LTR DE retrotransposon. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; MAGGY_I; MAGGY_LTR; gag; pol; protease; KW reverse transcriptase; internal portion. XX OS Magnaporthe grisea OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Magnaporthales; OC Magnaporthaceae; Magnaporthe. XX RN [1] RP 1-5132 RA Farman L.M., Tosa Y., Nitta N. and Leong A.S.; RT "MAGGY, a retrotransposon in the genome of the rice blast fungus, RT Magnaporthe grisea."; RL Unpublished (1994). XX DR GenBank; L35053; Positions 254 5385. XX SQ Sequence 5132 BP; 1348 A; 1635 C; 1218 G; 931 T; 0 other; ttacgtagct ccttcattag gtgcccgcga tgcctgcgtc actgcaaccc ccgatcttaa 60 ccgattcgac cgaggaactg atcgaacacc tacaaagaca ccctgaacaa tgggtgtctt 120 acatctccga ttcctaccaa aagctgcaac tattgcacga tagcaacgtc cgcctgaact 180 cctgcctaga aatgcaggaa gccgagaccc aacgcgcccg cgccgaaacg gaccgtgtcc 240 gcgacaaggc gcaggccgac attctggcca tggccatgga aaaggcctcc gccataacct 300 cccgggatgc cgctttcgcc gaattagaga aaacacggtc cgaactgaag gaagttcgga 360 ccgtaacgct acccacggta cacataacca cccccgcccc aactaccaac acgcctgttg 420 aaccccttat ggttactcct atgggaacga ctccgccgcc tgcttccgaa catcccgcct 480 ccgcccgcct ttctgaacga cttccagacc ccgataaatt tacgggcgcc cgctccgacc 540 tccgccgctt cgccacccaa atccggggga agatgacctc aaacaaagac cgcttcccca 600 accccgaatc acgcctgatt tatatagccg gtcgactgtc cggtaaagcg tacaacctga 660 tcctgcccaa aatggtcgga ggcacacccc aattcggaga ttatacggat ctcctccaat 720 acctagagga ggcattcggg gaccccgacc acgtccaaaa cgcacaaaac aaactatatg 780 ccctgaagca gcggaacgta gatttcgccg agtatctgtc ggagttccaa cgcctgtctt 840 tggaaggaga aatgccggag gatgccctcc cccctctttt attccaggga atatcggaag 900 agctccagga catgctcctc cacaaccccg ccccatctcg ccaataccac gaattcaccc 960 gacacctgca aagcttggat aaccgctacc gccagcacca gcagtataaa aatagacaga 1020 cacgtacccc tcgggccgca gcgccccccg cgcgcgcggc tccccgaacc caacccgaca 1080 taccgcgggc cgcccccaag ccaaacgcgg agctcccgct taacgaccct atggacctga 1140 gcagccaacg ccgccacaac cgcaaggaaa acatgctatg ttaccgttgt ggatcccaag 1200 aacatttcgt tgccaaatgc ccggaaccgg atacccgccg cacgcggctg caccaggccg 1260 gaatcgaacg atccaggtcc aggtccaggt ccccgcgcca aatacaaacc aaactcccta 1320 aaaacccccg ccgcggctcg gacgccagcc gctccagcag ccgaagttcg aaaaacggag 1380 cccgcctggg ataagtcgcc cccaggccgc caaggccgcg catagaccgc cggcgatccg 1440 catctctgcc gccgccgttc acgggttccc cgttgaagag gaatataacc aacgatccga 1500 cctgatgatc ctgcctgccg aactgacagt ggggggccaa aacctcccaa cctatgccct 1560 caccgattgc ggtgcggaag gaaaatgctt cctcgaccaa ggatgggccg aagaacgcca 1620 actacaaatg tatccgctgc gaaacccatt cgacatcgaa gtattcgacg gaaggaccgc 1680 cgaaagtggg aagtgcaccc attatgtccg aggacaattg agaatcaagg accacatcca 1740 gaaaaacgcg ctgttcttcg tcacacagct agcccactac cccattgtgc taggaatgcc 1800 ttggctaaag cagcacgatc caacgattgg attcgcgtca cacgttatta cgtttgacag 1860 cgactattgc cgccgacact gcaatatgcc cgacaaaccg gagaaggtta aagctttaca 1920 cgccgtacca aaaaaaagcc gcccccagta ccaggctgac cgaccgccgt ccctccgcga 1980 aatggatatc gcccctatct ccctgcaagc cgcgagcatg tacgcccgcc gccgatcctg 2040 ccgactgtac gccgtaacct tgaaacagat tgatgaggtc ctagccgccg acccccaaaa 2100 taggaacggg ccgaccctgc ccgaaacaat ccgggaattc gcggacgtat tttccccgca 2160 agaagccgag aagctgcccc cacaccgacc gtccgaccac catatcccgc ttatagaagg 2220 aaaaacgccg ccgttcggcc ccctgtacgc catgtcccgc gaagagctga tagcccttaa 2280 ggaatggctg accgcagagt tgaaaaaagg gtttatcaga cccagttcgt catccgtcgc 2340 ctcgcccgtt ttgttcgtga agaagcaggg aggagggttg aggttttgcg tggattaccg 2400 cgcgctgaat aacattaccg taaaagaccg ttacccgctg ccgctagtcc gagagaccct 2460 gaacaacctg gccggcatga aattcttctc gaaaatcgat atcgtttccg cttttaacaa 2520 tattcggatt aaaaaggggg aagaatacct gacggcattc cgcacgagat tcggcttata 2580 cgagagccta gtcatgcctt tcgggctaac gggagccccc gcgacgttcc agagatatat 2640 aaacgactcc ctgcgcgaat acttagacgt attttgtaca gcctacctgg acgacatttt 2700 gatttatagc cgcacccgaa cagaacacga agaacatttg aaactcgtac tggaagccct 2760 gaggaaagcc gggctatacg ccaacgccgc gaagtgcgaa ttcttcgtga cggaaaccaa 2820 gttcctgggc ctgttggtcg gcgtcgaagg cgtaaaaatg gaccctgaaa aaatcaccgc 2880 tgtcctggac tggcaaacac ccaaaaagct tactgacgta caagcatttt tggggtttgg 2940 caacttttac cggcgcttta tccgagattt ctcaaagatt gtagcccctt tgacgagatt 3000 gaccaagaaa gacgtcgcat tcgaatggaa tagcgcctgc gaaaccgcct tccgactgct 3060 gaagagaaag tttaccgaag cccccgtact ggcgcacttc gattgggaaa aggatgtgat 3120 cctggagacc gacgcttccg attatgtttc tgcgggaata ttgtcacaat acggcgacga 3180 cggaatactc cgccccgtcg cgttcttttc gaaaaaacac acagctaccg aatgcaatta 3240 cgagatttac gacaaagaat tgctagccat tatccgctgt tttgaagaat ggcgcccgga 3300 actcgaaggg acgtcgtccc cggtccaaat aattacggac caccgcaacc tggaatattt 3360 cacgacgacg aaaatgctca accgccgaca agcccgatgg gccgaattcc tgtcaagatt 3420 taatttccgt atcacctacc gacccgggaa acaaggagcg aagcccgacg cactaaccag 3480 gaggtcagag gatatgcctg aggaggggga tgaacgactt aagcaccaaa cacaagtcgt 3540 actgaaagaa cacaatttac ctgcccgccc cacccggtta caacccataa tccgccaaaa 3600 caaaccccta ttgccaagac acctgataga gctactagac gccggatacg aatccgaccc 3660 gataacgcaa tctgccctag aagcgttgag gacaggcgcc gaccgccacc ccaagctgca 3720 attggccgaa tgcgaagaac gatccggcta tttgtattac cgcaatagac tgtacgtacc 3780 cgattcgaat aatctgaaag ccgagatcct gcgccgctgc cacgactccc ccgtcgccgg 3840 tcaccccggc aaagcaaaaa cctacgacct gctgtcccga gaatattact ggcccggaat 3900 gctacattac gtatcattat gggtaaagaa atgccagacc tgccgccgaa tcaacccgtc 3960 ccgcgaaggc caccagggcc tcctgcgacc cctgcccact cctgaacgct catggcaaca 4020 cctgtcgatg gattttatca cacatttgcc gcaaagcaac ggccacgacg ccatcctagt 4080 ggtcgtagac cgcctgacta agatgagaca cttcgtccct tgtaaaggga cctgcaatgc 4140 cgaggacacc gccaacctgt acctacacca cgtatggaaa ctacacgggt tgcccctgac 4200 gatagtttcg gatagaggca cccaattcgt gtcgaagttt tggaaacacc tgacaacccg 4260 tttgaaaatc gacagcctgc tatccactgc tcaccacccc gagactgacg gacagaccga 4320 gcgttttaac gcgtccctgg aacaatattt gagagcttat gtggcctacc tgcaagatga 4380 ttgggaaagt tggcttcccc tcgccgaatt cacagccaac tcccacaagt ccgagaccac 4440 cggaacctcc ccgttttacg cgacttacgg attccatccc cgcatgggat tcgagccggt 4500 acccctgaac cagcctttgc ccgcccagcg ggatgccgaa aagctagccg cccgaatgga 4560 agccattctg gaacaagccc gcgccgaaat gaccgccgcc caagcccgtt acgaagaaca 4620 ggcgaaccga caccgtaccc cggcccgccg gcttaccgtc ggccaatatg tgtggttaga 4680 cgcacggaac atccagaccg cccgcccgca gaagaaactg gattggaaga acttgggacc 4740 cttccgaatt tctgaagtga ttagcccgta cgcctaccgt ttggacctgc cgtcgtctat 4800 gagagtacac cctgttttca atataaaccg actaaaacct gctgacgttg agcccctgcc 4860 gggccaacaa cctgcaccgc caccccattt ggaagtggaa ggtgagagag aatacgaagt 4920 cgaagagatc ctcgactcct tttgggaaac ccgcggccga ggaggccgcc ggctgaaata 4980 tatcgtccgt tgggctggat atagcgaacc cacgactgaa cctgccgatt acctggaaaa 5040 cgctgctcaa ttggtaaaga atttccaccg tcgatacccc cataagcccg ggccccggcc 5100 gtgacggagc tcggcctgga aggggggaat ac 5132 // ID Gypsy-6_CCO-I repbase; DNA; FNG; 11284 BP. XX AC AACS02000011; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the mushroom Coprinopsis cinerea genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-6_CCO_; KW Gypsy-6_CCO-LTR; Gypsy-6_CCO-I. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-11284 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the mushroom Coprinopsis cinerea RT genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AACS02000011; Positions 45095 56378. XX CC Positions [7141-7602] - Reverse transcriptase CC Positions [9070-9552] - Integrase core CC 'CAACG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 3051..5003 FT /product="Gypsy-6_CCO-I_2p" FT /translation="MSEPTTGKRSSKSGSKKSSGTKGDSDSQAPPGSPAKT FT EPKSTPASLSGSNQDDEFDLDLKPMTDLKTNEPVSNPKPDSQRNEPSGDLP FT IIPLGLVGQPLQAPKNWADHLVPILSPFIGSYSRNAPQIKDVISSTEYASR FT IEDGLTLGEEYYRGSLLRVSHSRHPTRSEIRTSFREEYDMLNDFLTNVFQC FT PTESGNGYCIDRSKLVRLNTRWLQLRNLFTDELLMTGLNCSAPRWGPYGSA FT TDIWSANDFEVLAICYRHEVESFLSRILDARKEHETIVSNAGDLTRLGIKQ FT ELTGEMGEGRKEGASLKSPFTFKPSRELTSTPREEGATRNVALETPKSMRQ FT PPSTTPKTDDPFGKKSRITYDTDTAEGRTIPPNFRTRMMSEDPFFRGAPAQ FT TSSSRKFQELFGSPTDDRLSNTGSKNPFRRNAYRNHSEDEDHRNEPEHEGE FT GDNPPPSINRFNGRSGGGQPGDSSDSSSNDERRPNRDERRDPFRRARREGR FT EFHRRSDVEDRFVQSTRSVPEPQFDTKLKVDIIPSWDGSPETLVRWITKVN FT RLSAKSKTVHQQLGNLVPGRFTGSAETWYFSLPLRTREGYEQNWTTLRNGI FT ANYYMNRSFIEKQKRRADRAHYRQAATPGRRPVSTSSGRSSYWSSLTTIRI FT AR" FT CDS 5575..10296 FT /product="Gypsy-6_CCO-I_1p" FT /translation="MSVITNPFFKPDNPYHLQALNRKARAAINRKIKRKLF FT REVSLSSFLASGKVDDKERLIELLPVSPKPPGCAFLGSKATEVKTRLGSLD FT ESEVSTTIDTGSDITLISSETILRMKSRPKIRTGQRIKLVQVTGNAVITGY FT VTLKVYFETEEGPVVITVEAYVVRGMSVPFLIGNDFSEQYSLSVLRSNGAT FT TLILGDSGRKISVTDNPVFALKDCQGQTCKVTVDPQVSQNNSKARAAKRRR FT VRLKAAAAEVHTRVTVVIPAGTTKRVPIVIHKQFTLQSVEPFLERGAVYRD FT SGKEDLLFGWTDTLINKDSPSVLLSNFSSEPIVVEEATYLGKLEDATLCLE FT RECDLSEKEKKQRICHSRLISSLVNQELEAQASSLLNQTIWKDPTHNEDVL FT TDETLSRPVEGGPKTAEVPPLDVPSREFLEVVDLAPELSAQERQRLIEVLW FT RNKDAFALDGKLGNYEEEVTVPMKEGSKPVSLPPFPMSPANRAVIDEQMDK FT WMKLDVIEPSKSPWAAPVFIVYRNAKPRMVIDLRKLNESVVPDEHPIPRQE FT EILQSLQGCKYLSSLDALAGFTQLSIHEDDREKLAFRTHRGLFQFKRMPFG FT YRNGPAVFQRVMQNVLSPYLWLFTLVYIDDIVIYSKTFDEHLAHVDLVLKA FT IMESKLTLSPDKCHFGYGSILLLGQKVSRLGLSTHKEKVQSILDLETPKNV FT KTLQTFLGMMVYFSSYIPFYSWIAHPLFQLLKKGTKWEWKAEHQNAFELCK FT EVLTEAPVRAHAMPGRPYRVYSDACDFGLAAILQQVQPVEIRDLRGTRTYD FT RLRKAFDKGEKVPVLAQTISKDVKDVPEDVWASDFESTTVHIERVIAYWSR FT VLQSAERNYSPTEREALALKEGLVKFQVYLEGEKVLAITDHAALQWSRTFQ FT NVNRRLLSWGLVFSAFPNMTIVHRPGRVHSNVDPISRLRRRIPDQLSPNVT FT ENKGLTLNESDGPMKDMYEALCPKLEEKVLSLMTQCLETEELEDVSQSSEP FT IRLTVSTSKGEEINSRLERAARNHSLLISVDPTELTRWKEAYSTDSHFTKI FT LNHLQENDPVTNQRYAQSDEGLLYFNDNLGNNRLCVPSSLVNEIMSEIHDR FT KTEGAHGGYFKTYNRVGSTYYWPGMSRTIKRWVETCDVCQKTKPRRHGPVG FT HLQSIPIPTRPFETLTMDFIPDLPTTPNGFNNVFVVVDKLTKYGFFIPTTT FT TLDDEDSARLLTNEVLLKYGLPRQIISDRDPRWTSGFWKEICRLLGVKRAL FT TTAYHPQSDGQTEIMNQYLEIAIRAYITPNRTNWDELLKHLAFSYNTSVHT FT ATGFSPFYLLHGFHPATESSLMHAKEGIERPGESLSATDERAQGFAEAFAA FT ERHRAQEALALAQLSQQRDYNKGRLAKEFEVGDLVVINRDSLELGRTVKGR FT GKKLDPKYEGPFEILEKISAVAYRLRMPVSYGIHPVLNIAHLEKYEKSPTE FT LGPRTKLRKLRADFDQLEEVEVERIVDERTRKVGSRQVKEYRVRYAGFPPD FT HDEWMRERGLKNAPQVLRDWEEERRRMKNKIADQTKNEKRLKGRRGGGKRA FT SVQPSST" XX SQ Sequence 11284 BP; 3150 A; 3012 C; 2682 G; 2440 T; 0 other; ttggtggaca aatccggaac cttacgtcag tttacggaga tacgtcccaa cgcgtgttca 60 tcgcgcccat cacactcgac ccaccacgat ttcgcgagtg accgttcgat cagtattatt 120 cgatctgcac cgccacgact gggcatggga acagcggcgg ccacaagccc cgcacatcgt 180 cctccgggct tgccctcacc gctcctcccc gtcgcttacg gtgggactag agatggacga 240 tgagcggggg ggaagacccc accccaaccg cggatcaaag gggtatgtat ggcggtgttt 300 gtctgggcca tcaggcacca ctgtagccac tctcctccta agtaaggtct tcgacgttcg 360 tcgaacaagg tacgcatagt ccgtgatgaa gattgaacta cagtagatat gtattttgaa 420 tatagcaata ttatccaacc tcatagaact cccgaattca cgcagtgcag gtcttggttg 480 taagtgcctg ccctaaagtg ccgcagttaa gcgcccattg cgcatgaaaa tcactcatgc 540 tgttccatgg ctctcgtgct tatgaggcat gctacgcctc aataccaaga aaactgacat 600 tatcatgggt ggtgtgccaa cttgaagtgg cacttaccag agtggctcag gccaattgaa 660 cagaattccg agtggccaat ggtggctaat taaccatttc ttcctcgagt gcgagcactg 720 aggatgtccg ctcctttaga acctcgaaga gatgacgagg tagatattca gtcagagata 780 tatcagctag ctcagatgct ttctgcacgt cccaactcat ccgaaccgat cttggcacca 840 gtccaaatcc gccactcgtt aatttctgtc ctcgaatttc tccagcaatc actacctcag 900 tctactttac acccagccat atcaggcact attgcagagc cagactctcg tatatcaact 960 acagtccctc cctcaatctc tactgtcaat caacaatcaa ccagctcgat acttgaatcc 1020 actaatcacc aaccttcact cactccggag ccctcggaca ttctgaacaa cgtcaaaatt 1080 aatcgtaaga cgaccatcgg ggttgtctac aagtttgacc agccgggagt cttgattgaa 1140 tacccaaaga catctcctga aagggtcggg tacttactgc gaatgaatcc cctcaagtgg 1200 gacaatccga cccataattt tgcgtactct ctcggatacc caatgtcgtg gagatcgacg 1260 aagtcgattc aggacaacaa ccccttgctc accgatggga atgggaatca tgtccgcttc 1320 tatgtctcgc atacaacctg taagtcatag tatatcgcat ttccattgat tcttgaacct 1380 ccatctcaca tgttattgtt caactttttt cctaggtcaa ggaatcaaga tatgtcctta 1440 ttccgacaga gaaaagctct cacagccgca cagagaggct tctcgcgagg taatttctta 1500 aattatcccg ggtggacaaa tcaataacca gaccttatcg ttaagctttt gaaggctcgg 1560 ttggaagaaa ctcgctcatt tttgggtggc accagcaccg tagacaccaa aggtcttcgt 1620 tcagagctca tacaccgaac acttgcctac tgccatacac tccgaacaca tggatgcttt 1680 atgcccagtg ccacccagtc tgatatcatg aacatagaat ctgcttcgat cacaaccgat 1740 gatttccgct cccaagttga aaaggggagg cgtggtcacc tgcctcgtcc gacctgtggg 1800 ggagacttgg tgtttgatca ggactcaaaa gggaacttct atgtacggta tgtacccaaa 1860 taatacttga aatcgcatta actctaataa attttaatac agatgccagt tctatgcccc 1920 tgatcaagac cgtgatcatt tcgtcgatca cagtccttcg aacggactgt atgatctcgg 1980 ttatcttcga gccctctttg acaatgatac agaggctatt gaagagatgg aagacgactt 2040 gtttatatct gacatgtcgt tcgcaggatt tgatcactgt aacttcattc agaactgctc 2100 aagcgttaga gttaacaaag attgaataga aacacctaca gggagtaccc agataatcca 2160 gagccatccc agtggactat ggcttcgctg ttcagactat ggatagtgcg ggtgagggga 2220 cggtaaacag gaacattctt gagttcgtca tcgggcttgt gcgtgggggc aaaattcaag 2280 cgagaaaaga cagtaggcat tgttgtgaga acgagagact gaggtgagaa taggtggacg 2340 tcgaaggtga agtggaacga gagacgagag tgggcggaaa tggaaacgag tggctatggt 2400 gaggaaagga ttctgaagac gacgtgtcca ctcgagacca tttaaatcca ctggaccttg 2460 agtgccactc cgatactttt ttccctttcc tttgttctgc ttgcacaatc ggactcggac 2520 cttcagctca cgaggcccat gtctctaagc aatggttgtt tttgaaaagg ttctatgtga 2580 cctttgaaca accgttaagt tcaaacctca gagcctagcc ccggccccca gggtgtcttc 2640 ccctcatggg cgcttatttg gaacccattg atgcgatatg atataccgac gtgctggttg 2700 tattcctaca ctgataattc tagaccgaga gcgaagcgac ggtgatttgc atcagtatag 2760 tcggtcgctg ctctggtcct ctggatcaaa gagcgggcta ggcgtccttg tcctctcccg 2820 cgcccagccg tgcctgaatc gaataatact gatcgtagta tgactcacga tttcgccaat 2880 caaccctttc accacctcgc ctatcacctc tttcatcctc tgaaagagaa cccaccatca 2940 ccaactatca aatcctgttc ccttgttccc ggtaatctag cgagcatcca ccctttcctc 3000 tcttcggcaa cttcctggca aacatctcct acactcagta cttcagcttc atgtccgaac 3060 cgaccacggg taaacggtcc tcgaagtccg gttccaagaa gagctcggga accaagggtg 3120 attccgactc ccaagctcct cctggtagcc ccgctaagac ggaaccaaag tctaccccgg 3180 catcactgtc tggttcgaat caggacgacg aattcgatct tgaccttaag cctatgacgg 3240 atcttaagac aaacgagcca gtctccaacc ccaaacccga ctcgcagcga aatgaaccga 3300 gcggtgattt gccaatcatc ccactcggat tggtaggtca accgctacag gcgcctaaaa 3360 actgggctga tcatttggtt ccaatcttgt ctcccttcat cggaagctac agtcgtaacg 3420 cacctcaaat caaagatgtc atcagttcta ccgagtatgc ctcgcgaatc gaggacggct 3480 taactttagg agaagagtac taccggggaa gtctactccg ggtaagtcat tcgcgacatc 3540 ctacacgatc ggaaatacgt acgagtttca gagaggagta cgacatgctc aacgactttc 3600 tgaccaacgt tttccaatgc cctacggaat cagggaatgg atattgtatc gatcggagta 3660 agttggtccg tttgaacacc cgttggcttc aactgagaaa tctcttcacc gatgaactgc 3720 tgatgacagg tttgaactgt tcagcacctc gttggggacc ttatgggtct gcgacggaca 3780 tctggagtgc gaacgacttc gaagtgctag cgatctgcta tcgacacgaa gtcgagtcat 3840 tcctttctcg gatcctggat gcccgaaagg aacatgagac catcgtatca aacgctggtg 3900 acctcactcg actcgggatt aagcaggagc ttacgggcga aatgggcgaa ggaaggaaag 3960 agggtgcgtc tctgaagtct ccctttacct tcaaaccatc gcgagagctc actagcactc 4020 ccagagagga aggagctacc aggaacgtcg ctctcgaaac accaaaatcg atgcgacaac 4080 cgccttcgac gactccgaaa accgacgacc cgtttggtaa gaagtcacgt atcacttacg 4140 acacagacac tgccgaagga agaaccattc ctcctaactt caggaccaga atgatgagcg 4200 aagacccctt cttcagaggg gcaccagctc agacttccag ttcgcgtaag ttccaggagc 4260 tctttggttc acctactgac gatcgacttt cgaataccgg gagtaagaac ccgtttaggc 4320 gaaacgctta tcgaaaccat tccgaggacg aagaccatcg aaacgaacct gaacacgaag 4380 gagaaggcga caatccgcct ccgtcgatca atcgtttcaa tggaaggagt ggcggagggc 4440 aaccaggaga tagcagcgac agtagcagca acgacgaaag acgcccgaac cgagacgaac 4500 gaagagaccc gttcaggaga gcccgaagag aagggcgtga attccacaga cgatcggatg 4560 tcgaagacag gttcgtacaa tcaacgcgat cggttccaga gccgcagttt gacaccaaac 4620 tcaaggtaga cattatccca tcatgggatg ggtctccaga gactctggtg cgctggatca 4680 ccaaggtaaa tcgtctttcc gcgaaatcga agacggttca tcagcagcta ggcaatttag 4740 tcccagggcg ctttacaggg agcgccgaaa cgtggtactt cagtcttccg cttcgaaccc 4800 gcgaagggta cgagcagaac tggacgactc tgcgaaatgg gattgctaac tactacatga 4860 acaggtcttt catcgaaaag cagaaacgac gagctgatcg agctcattat cgacaagcgg 4920 ctacaccagg gagacgccca gtgagtactt catcaggaag gtcgagttat tggagttctc 4980 ttacaactat tcggatagcg agatgatcac tgagatcatg aacggagcgc cgaccatctg 5040 gacaagcgtt ctcacccctc acctattcaa taagttggtt gaattccaac acactcttcg 5100 attccacgaa gaccacctac tggctctgaa cgcacgagta acgaggtacg acgaacccct 5160 accttggaaa cgagaccacg gctcacgaaa tcctttcgaa tctttcaaga gcgttcgaac 5220 gaaccttatc ggagcgagca acaagctgcc agcaccacca ttccctaagg acgacaacaa 5280 cgtctcaaag aggaagaccc cggaaagttt aggcgtgcgg ccttgcaggc actgcggaag 5340 cgcaaagcac tgggacaacg agtgtaagta cgcacgacga gcccagaaga tcgcacgaac 5400 taacttagct atggcagaag aagaggacga actcgctaac caagaatacg acgacgcgta 5460 ttttggattg gatagcgaag atgaagggga tcaaacgggt ttttaaggga ccctccggtt 5520 gacgagctag aaccgggggg tggatctctt caagctcttg tagcgacgaa gtccatgtcc 5580 gtaatcacga acccattttt taaacccgac aatccttatc accttcaagc gcttaaccga 5640 aaggctcgcg cagctattaa ccgaaagatc aagcgtaagc tcttcaggga agtctcgtta 5700 tcgtccttcc tagctagtgg gaaggtagac gataaggaac gattaatcga actcttacca 5760 gtttctccta agcctccagg gtgtgcattt ctagggtcga aggctaccga agtcaaaaca 5820 cgcttgggct cattggacga atcggaagta tcgactacta ttgacacagg atccgatatt 5880 accctcatat cgagtgaaac catcctcaga atgaagagca gacccaaaat acgtacagga 5940 cagcgaatca aacttgtcca agtgactggt aacgccgtca taaccgggta cgtaactctg 6000 aaggtttact tcgaaaccga ggaaggacca gtggtcataa cagtggaagc gtatgtcgtt 6060 cgaggaatgt cagtcccttt cctcataggg aacgacttca gcgagcaata ctcgctttcg 6120 gtacttcgat cgaacggagc gacgactctc atcctaggcg attcgggaag gaagatctcg 6180 gttaccgata accccgtgtt cgctctcaag gactgtcagg gacaaacctg caaggtaacc 6240 gtcgatcctc aagtctcgca gaacaattcc aaggcaagag cagcaaagcg acgaagggtt 6300 cgattgaaag cagcagcagc agaagtccat actcgcgtaa cagtggtcat ccctgcagga 6360 accaccaagc gagtccctat cgtaatccat aagcagttca ctctgcaaag cgtcgaaccg 6420 ttcctggagc ggggagccgt ctatcgtgat tcagggaagg aagatcttct ctttggatgg 6480 acggacacct tgataaacaa ggatagtccc agtgttctgc tgtcgaactt ctcatccgag 6540 ccaatcgtcg ttgaagaagc aacatacctt gggaaattgg aagacgctac tctctgcctg 6600 gaaagggaat gtgatttatc cgaaaaggaa aagaagcaac gaatttgtca ttcgcgtctc 6660 atttccagcc tggtgaatca ggaactggag gctcaagctt caagcctgct taaccaaact 6720 atctggaaag atcccactca caacgaggat gtccttacgg atgaaacact atcgcgacca 6780 gtcgaaggag gaccgaagac agcagaagtt cctccgctag acgttccatc aagggaattc 6840 ttggaagtcg tggacctagc gcccgagcta agcgctcaag agcgccaacg tctgatcgaa 6900 gtattgtgga gaaacaagga cgcgtttgct ctggacggta aactgggaaa ttacgaggaa 6960 gaagtgacag ttcctatgaa ggaaggatca aaaccggtgt ccctacctcc tttccccatg 7020 tctccagcta accgcgcagt catcgacgaa caaatggata aatggatgaa acttgacgtc 7080 atcgaaccat caaagagtcc ttgggcagcg ccggtattca tcgtttatcg gaacgctaaa 7140 ccgcgcatgg ttatcgacct aagaaagctc aacgagagcg tagtccccga cgagcatcct 7200 atcccgaggc aggaggaaat tcttcagtcc ttgcaaggct gtaagtactt gtcttcgctc 7260 gacgccttgg caggatttac tcaactcagc atccatgaag acgacaggga gaaactcgcg 7320 tttcgaacgc atcgaggact gttccagttc aaacggatgc cctttgggta tcggaacggg 7380 cctgccgtat tccaaagagt gatgcagaat gtgctatccc cttatctttg gctgttcacc 7440 ctggtctata ttgacgatat cgttatctac tcaaagacgt tcgacgagca cttagcgcat 7500 gttgatctgg ttttgaaggc aatcatggag tcgaaactca cgctgtcacc ggataaatgc 7560 cacttcgggt atggatctat cctactactg ggtcagaaag tctctagact cggactgtcg 7620 acgcataagg agaaggttca gagcatcttg gacctggaaa caccaaagaa tgtgaaaacc 7680 ttgcaaacct tcttgggcat gatggtttac ttctcctctt acatcccgtt ttactcgtgg 7740 atcgcacacc ctttatttca gctgttgaaa aaaggaacaa agtgggaatg gaaggccgag 7800 catcaaaacg cgttcgagct ctgcaaggaa gtactcaccg aagcacccgt gcgcgctcac 7860 gccatgcctg gaagacccta ccgagtctac tcagatgctt gtgacttcgg attggcggcc 7920 atccttcaac aggtacagcc ggtcgagata agagatttac gagggacaag gacgtacgac 7980 cgattgagga aagcgttcga caaaggagaa aaggttccag ttttggcgca aaccatatcc 8040 aaagacgtca aggatgtgcc cgaagatgtc tgggccagcg acttcgagag caccaccgtc 8100 catatcgaac gagtaatcgc ctattggtca agggttctac aatcggcaga aaggaactac 8160 tctccgacgg agagagaagc tctagcactg aaagaagggc tggtgaagtt ccaagtatat 8220 ctggagggag aaaaggtact cgccatcaca gatcacgccg ccctccagtg gagtcgaacc 8280 ttccaaaacg taaatcgtcg gctcttgagt tggggattgg tgttttcggc cttcccaaac 8340 atgacgatcg tccatcgacc aggaagggtg cactcgaacg ttgatccaat atcccgactt 8400 cgcaggcgca tacccgatca attgagcccg aacgttacag agaacaaagg actcacgttg 8460 aacgagtcgg atgggccaat gaaagacatg tatgaagcgt tatgtccgaa actcgaggag 8520 aaagtcctct ctctgatgac ccaatgctta gaaacagaag agctcgagga cgtatcccag 8580 agctccgaac ccattcgact cactgttagc acttcaaaag gggaagaaat caactccaga 8640 ctcgaacggg cagctcggaa tcactcactt ctcatttccg tggaccctac agagcttaca 8700 cgatggaagg aggcgtactc aacggattct catttcacca aaatcctgaa tcatcttcaa 8760 gagaacgacc cagtcaccaa ccagaggtat gctcaaagtg atgaggggct actctacttc 8820 aacgataatc tcggaaacaa tcgactgtgc gtcccaagct cgcttgtcaa cgaaatcatg 8880 agcgaaattc acgatagaaa gaccgaaggg gctcacggag gatacttcaa gacctacaac 8940 cgtgttggaa gcacttacta ctggcccgga atgtcacgaa caatcaaacg atgggtggag 9000 acgtgcgatg tttgtcagaa gacgaagcct agacgacacg gacctgtcgg gcacctccaa 9060 tccatcccta tccctactcg gcccttcgaa actcttacta tggatttcat tccagatcta 9120 ccaacaacac ccaacggctt caacaacgta tttgtagtcg tcgataaact caccaaatat 9180 ggtttcttca tcccaaccac aacaactctc gacgacgaag atagcgctcg actcctaacc 9240 aacgaagtcc tcctcaaata cggtctacca aggcaaatca tttccgacag agatccccga 9300 tggactagcg gattttggaa agaaatatgc cgattactgg gcgttaaacg ggcgcttact 9360 actgcctacc atcctcaaag cgacggacaa accgagatca tgaaccaata tctcgagatc 9420 gcgatccgag cttacatcac gccaaatcga accaactggg atgaactact caaacaccta 9480 gccttctcat acaacacctc cgttcacaca gctacagggt tctccccttt ctacttactc 9540 catggattcc atcctgcaac cgaaagctcg cttatgcacg ccaaggaagg aattgaaaga 9600 ccaggggaat cactcagcgc caccgacgaa cgagctcagg gattcgcgga agcttttgca 9660 gcagaaaggc atcgcgctca ggaggcctta gcactcgctc aactgtctca acaacgggat 9720 tataacaagg gaagacttgc taaggaattc gaagtgggag atctggtagt gatcaaccgc 9780 gactcgctcg agctcggaag aacagtaaag ggcagaggga aaaagttgga ccctaaatac 9840 gaaggaccct ttgaaatcct ggaaaagata agtgccgtcg cttaccgatt acggatgccg 9900 gtgtcctatg gaatccatcc cgtacttaac atcgcccacc tggagaagta cgaaaagtcg 9960 ccgacggaac tagggccaag aacgaaattg cgtaaactcc gagcagactt tgaccaactg 10020 gaagaagtcg aagtggaaag aatcgtggac gaacgaacac gcaaagtagg aagccgacaa 10080 gtcaaagagt atagagtccg gtacgcaggg ttcccgccgg atcacgatga gtggatgaga 10140 gaacgtggtc tgaaaaacgc gccccaagtc ctgagagatt gggaagaaga gcgacgaaga 10200 atgaagaaca agatagccga ccagacaaag aacgagaaac gtttaaaagg aagacgagga 10260 ggaggaaaga gagcatcagt ccagccgtct tcgacctgat ctagctgcct tatcaacgct 10320 caaagccctc tcgctataac cctatctccc tcacccttcc ttattcgcat tccccaactc 10380 gaatcatcat gcccgccatc actcaatacg accccgctct tccccttttc caaccaatcg 10440 gcagaaccaa caacaacatt accattaacc ctattgcatg ccctgtctac cgcaaactgt 10500 cacacgatga tttccttctc gccgtcgacg ccaacgggac agaaactgca tgtattcgac 10560 aggagaccgc tgtctcttta tggacgactc tcaaggacat cgggcttgca atcaacaagg 10620 ccggaaaccc tggacgaaac gaagcggttc accttctcaa tcagtactac gaaatcccgc 10680 actacaaatt cgaccgactc ggattacagc gcccgctcat tctcgatctc gtccgaccca 10740 accttcataa catcttcaac ccgctttacg cttggggacc caaggagtac ctgaccaaga 10800 cctaccaggc tctgcagatc gtcgggaggg tagcccttcg ttggatcgac gacgccaaga 10860 aggtctgcca agaactcgga cccgattgcg gatgggactg ggaggaggga aacgagatcg 10920 aggaggaagc gccacccatg cccattcaag cccaccccct taatccctac aacgacctca 10980 ccactcggga cccacgggta cgaccctacc cagcgcgcca agaaatccga aactcgacca 11040 accgcgatct gattcttcac ccaaccgcct cctcagctgc ccgtgctaac atgcagatcg 11100 tcgtgcatcc caaccgatcc aacgccgccc gcgccaataa cagtgtcacc atttatcgac 11160 gcggcaacag aatggcaaga gccagcgctt actcgaccga cggatcagag tacgtcattc 11220 gcaacgggaa acagaccaac accaacttct gaggtgaatc aatgctcgga aagtgggggg 11280 ggta 11284 // ID Gypsy-11_LBS-I repbase; DNA; FNG; 10695 BP. XX AC ABFE01000651; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-11_LBS_; KW Gypsy-11_LBS-LTR; Gypsy-11_LBS-I. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-10695 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01000651; Positions 139382 128688. XX CC Positions [4667-5227] - Reverse transcriptase CC Positions [6629-7108] - Integrase core CC 'CACTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 410..7729 FT /product="Gypsy-11_LBS-I_1p" FT /translation="MSSKSSSPATARPDDLPQLGANSSPTDHAVPSEDAPT FT LRRGLERGYALSPMYSLILEYNREQGANFSTAAEVSWILQQPSLKPSDIVF FT KKDHYDSHSIVSVGKPRLQRILRAVTELQTFLERMANLIEERDNIFRIDPE FT DTMTTALQGCESRSQLEVAYAILHKRLLMAQQTVSKYEAQYQNTEIPLSPV FT STLPELYEDFNALDDVDSRMRFMLQRIPHHQHHLTSSAQAAVYQGHSWDVI FT HPTPPLPSDSQDEPRASSSSAPFEADTPLRSDAEGKKKVEWSEVSPWREGS FT SSMELGRDKEEGLEPSFGFQTPFKAGTKFFDASGGSSASAYFSTPVAGPMT FT DITVGLATPSLTQFADNVRGKVSNVNLARLASSKDTTPPNPIHSSNPMPFQ FT RAPPPDDDPSGGGGGGGGGGGGGGNGFPIPSHSNASYPGGQPFPGGGGSPG FT GGGGGGGGPPFPGNQTGPPAPYGNIPASIKTELKVEQLPEWDGNHWTAIDY FT FWEVQQLAHLGGWIPEALGYWLWFRLKDKSPVKSWFITLPIAYQTYMRSHY FT LKFLKGIKDGYLGHRWQLRMNNYYNSQLFRERGHERESPSEFIVRRIVYTR FT MLLSISGGPLEVFYIMRKAPISWGPILLISSIKDSSELYSRVTEHEEALLE FT AYRVSRGGQAPSIDNIVSQLRQMGLISEKDNKTDNKTFPSHQSYQRRANLV FT EYATTKQNDPPDQTTLTTEEAIIPPVLDDLSYLTSNQHILREAYQVLKQRQ FT RPPPAGGYPFSKNDHVTTKMGRLPPSPCKCCGSANHWDKECPDWNTYLERA FT KRSANLVEIWPEDESEKPYAAAYAVLLNERLAGDVVNQPLLADSLTQQGFK FT AASSLPQVSEEETSKTRVGDSERTPRTTMEEIEDEDWLAHVAKPKSSDFLM FT EEVSPTNEAFVSQSKKEKEGVEPNFRSPPTQDETETSSKASDVPGPPIPDI FT RIKLKKRRFAPAGASAVGVSVVAVQGWVGSTRNKPIDIRMDSCADVTLISQ FT EYLESLQDRPPCQNGLKMNLWQLTDKDASLQGYVRIPIFLQSAEGVLLETE FT AEAYVVPDMTVPILLGEDYHLNYELIVAHKVDFRSVINFAGVPYSVPARGV FT SRTRDFERMRQSACAAASFIKAKFHKRNKAKKARRSKSFGIEKRTVRAAED FT YRLRPDECRRIRVDGHFEEDKTWLVEKNLLAAADDSTFAVPNVLISASDPW FT VPISNPSPHPKTIRKGDIVGYLTDPQEYFDTPQTPEDLEKLTKMTEALAAI FT ISISANDPPQDPSTQEKPEKQSESRQEQDDNKDDEQEAYGPKTAELPDPME FT YPSARMREFLDVGSLPEHLQEKAWEMLERRKKAFGFDGRLGHHPSKVHIRT FT VDGQVPIAVPMYGASPAKRAVMDEQLNKWYEQDVIEPSKSPWSAPVVIAYR FT NGKPRFCVDYRKLNAATIPDEFPIPRQSEILSSLSGAQVLSSLDALSGFTQ FT LEMDEEDVEKTAFRTHRGLFQFKRMPFGLRNGPSIFQRVMQGILAPYLWIF FT CLVYIDDIVIYSKTYEDHISHLDQVLEAIEKAGITLSPVKCHLFYSSILLL FT GHKVSRLGLSTHTEKVRAILELNRPAKLSQLQTFLGMAVYFSAFIPYYADR FT CYPLFQLLRKGAKWNWTAECENAFNSIKSALQESPVLGHPTEGLPYRLYTD FT ASDEALGCALQQVQPIKIKDLEGTRLYDQLRKAHAAGKKPPKLVVQLPSAF FT DDNIVTEDWSESFEDTVVYIERVIAYWSRTFKSAETRYSTTEREALGAKEG FT LVKFQPFIEGEKITLITDHAALQWAKTYENSNRRLAAWGTVFSAYAPHLSI FT VHRPGRKHSNVDPLSRLYRAPPPQDSPVKDDAISLEMNPVHIDFSANPSMG FT KAVFMAFSIADCLEEDKEAQLNTRSSKRKKEVLPLKPTVTTTPKTDTTEEQ FT SSEYWKATNPPPNLLVHLEKGMLQAWVESYLKDPHLAKIWNDPKTMVDQWT FT PGHRFFRNEEGLMFFRDADYQPRLCVPLLQRRLILEEAHEQAFEGAHQGPE FT KLWQKLSEKFYWKRMKADLLKFVQSCDVCQKIKAPNFNKYGYLIPNPIPSR FT PYQSIAMDFIVNLPWSDGFNAIHVTVDRLTKHGTFTPTTTGLNAEDFGALF FT VRKIVCRFGLPESIICDRDPRWTSDFWKGVAKFLRTKMSLSSSHHPQHDGQ FT TEIVNRFLEVMLRAFVSGNKEAWALWLPLLEWAYNASIHSSTGSTPNFLMF FT GFEPRTPMDFLQPKDTEKNVTRRSDSEEWLAKLQMIRDSARQAIAHAQHHQ FT ARSHNKGRRALELLVGDKVLVNPHSLEWIESKGEGAKLTPRWIGPFEVTQK FT INPNVYRLRMGDNYPGSPVINIQHLKKYVEDETYGDRTTLPESFVRRPESE FT EFEVEKIVGHRRIGKKAALKYLIRWAHYGPQFDTWGTAADLKNSPILLKEY FT RAKNNL" FT CDS 8066..10666 FT /product="Gypsy-11_LBS-I_2p" FT /translation="MDLTIHHLNPIASTTIRVDRDAIWVGLNSDTALIPDA FT AIIFRNLPDPGWSVDDFSDSGQVLPIDTVRSPHWYRDDEQWAAWTPTSFLL FT SERPWYDQLETAVPVEERLDGWSMAEEQRQICSGDLIRTQACVRSIVEFDQ FT RFPPHAKVPPQYPTERLAKVYATKKLVQINAAKAKCSVLQALAFMAWWTTI FT MPNWEVELNDTAAETISRLLATTKGKRGIICDLERDWSVINIPLYIQHDIP FT VFYLWDFDVRADPRFSRLNPALNLTYWAVRQGTTLTLHPDIAEDDLNRVAR FT DAVKLDHYFQEVFTYQSAADPPILSSYTPFVIDFVGWKRRPINRREETTES FT LAKFYYYDVFDDNEDYDHKVIVFWRWRKREPRDDYLRRQYKTSLPGEEYAG FT LIRELYKSSYAPKPGVMYDEDTGLRVTKARSLNTSPSLLERMGGSLVKGRL FT SLQDRLSDDASQTDMSTTQSSLSDEDFTAVRITEVPDTLYHPRAINSPAAW FT IRHNEALLENARRTTTARRTAQGIEASPYRRSQSPTQISEPIHSSHERPEV FT MFRRLLKDESAKITYTSSTWFAPHFAWNPEYLEVAYLFIPDVESEARLRYW FT ANCWDTVGTIRRLLTIAIEHGIRFYLALPPDSVRRFRPIIIDSLDRSSASF FT IYNVGFQEPPLSPADNAATFCATYLARMNDLLRRPHARAFIAEGGQLSWIA FT RRWTGLRLVEEFMSGPSIQTTVHSRGFYDSASEDASYLAHDIVSEQEKDLL FT LGYCPGTNGCLGRWLFPPADIFNGGFELWTGEWNAALDHIYRRLADDIARG FT KAKLRTREKWKCWIRNNDRGQRRPAYLPSTADFQNVMEGISIAGLKPTWHK FT EPLDDITLPERRLD" XX SQ Sequence 10695 BP; 2920 A; 2887 C; 2549 G; 2339 T; 0 other; aaggtggaca ctgtgggaac ggttgattcc tctagccgac gatccaacct cattgattca 60 tctattcatt tcaataccac tatcgcttta aaacgctgac accctgtcga cctatcgtcc 120 tacgaaacca actccccctg tggttgctag ggggggtaaa cccgcttttt cttcaacgtt 180 cggagcttct agaaattcta gcaaagccga tccgacgtcg actactactg aggatacaac 240 ccctgtcccc ggctcatcca attcaccgtg gaggaggacc gcttcttctc aggctccatc 300 ttcctcgtcg acaaaaaaca aaacaactcc tctgacacga gagctacccc ctcaccaatc 360 atctctgatt ccgaggccga aactaacgtc tcactcccct acgccaagaa tgtcttcgaa 420 gagttcttct ccggccacgg ctaggcctga cgatctgccg caattgggtg ccaactcttc 480 gccgaccgac cacgctgtac cttcagaaga cgcgccaacc ttacgtcgag gattggagag 540 agggtacgcc ctttctccca tgtacagcct tattctcgag tacaatcgag agcaaggggc 600 aaacttttct acggccgcgg aggtctcctg gatccttcaa caaccttctc tgaaaccgag 660 cgatatcgtc ttcaagaaag accactacga ttcccactct atcgtttcgg ttgggaaacc 720 tcgacttcaa cgtatactta gggcggttac cgagctgcaa acgttcctcg agcggatggc 780 taatctcatc gaagaacggg acaacatatt ccggatcgac ccagaggata cgatgacgac 840 ggctctacaa ggctgcgaga gtcgatcgca actagaagtg gcctacgcca tccttcacaa 900 acgtctcttg atggcgcaac agacggtcag caagtatgag gctcaatacc agaatactga 960 gatcccgcta tcaccggtct cgactttacc tgaattatac gaggacttca acgcactgga 1020 cgacgtggac agccgtatga ggttcatgct tcaaaggata cctcaccatc aacaccatct 1080 tacttcctca gcgcaagcag ccgtctatca gggtcattcg tgggatgtaa tacaccctac 1140 tccaccgttg ccatcggatt cgcaagacga gcctcgagca tcatcatcct ccgcgccttt 1200 cgaggccgac acaccattac gatccgatgc agaaggcaaa aagaaggtgg aatggagcga 1260 ggtctcccct tggcgcgagg gatcatcaag catggaacta ggaagagaca aggaagaagg 1320 gctagagcct tcctttggct ttcaaacccc cttcaaggcg ggcaccaagt tttttgacgc 1380 ctcaggtggc tccagcgctt cagcctactt ttcgacgcca gtagcagggc caatgacaga 1440 catcactgtg gggctcgcga caccctctct aacgcaattc gctgacaacg taagggggaa 1500 agtctcgaac gtcaacctag ctcgtctagc ttcttcaaaa gacaccactc cgcccaaccc 1560 tattcactca agtaacccca tgccctttca aagagcgccc ccgccggatg acgatccttc 1620 cggcggagga ggtggtggag gaggtggtgg tggaggcgga ggcaacggct tccctatacc 1680 gtctcacagc aacgcttcct accccggagg gcaacccttc ccaggaggag gcggctctcc 1740 tggaggaggc ggaggaggcg ggggaggacc accattcccc gggaaccaaa caggaccgcc 1800 cgctccctac ggaaatatac ctgcgtccat caagacggaa ttgaaagtcg agcaattacc 1860 agaatgggat gggaatcact ggacagccat agattacttc tgggaagttc agcaactagc 1920 tcatttagga ggctggatcc ccgaagcatt gggttactgg ctgtggttcc gtttgaagga 1980 taaatcaccc gtcaagtcat ggtttattac tttaccaata gcttatcaaa cctatatgcg 2040 ttcgcattat ctcaaatttc tcaagggaat taaggacgga tacctgggac atcgttggca 2100 actaagaatg aacaactact acaactccca acttttccgt gaaagaggcc atgaaaggga 2160 gagcccttca gaattcattg tcagacgtat agtctatacg cgaatgctcc tatccataag 2220 cggagggcca cttgaagtct tttatatcat gaggaaggct cccataagct ggggacctat 2280 cctgctcata agcagcataa aagactcaag cgagctctac tcgcgggtca cggagcacga 2340 agaagcgctt ctagaagcgt atcgcgtatc gagaggaggg caagctccct cgattgacaa 2400 tatagtctcc caactgaggc aaatgggtct gatttctgaa aaggataaca aaaccgataa 2460 caaaaccttc ccttctcatc agtcctatca acgacgcgcc aatctagtag agtacgcgac 2520 tacaaagcaa aatgaccccc ctgaccagac taccctcaca acggaggaag ccataatccc 2580 ccctgtcctt gacgatctct cctacctcac ctcaaaccaa cacatccttc gtgaggcgta 2640 ccaggtctta aagcaacgcc agagaccgcc accagcagga ggatacccat tctccaagaa 2700 cgaccacgtt acaacgaaga tgggcagatt gccgccgtca ccgtgcaaat gctgtggtag 2760 tgcgaatcat tgggataagg aatgtcccga ttggaacact tacttggaaa gagccaaacg 2820 atccgctaac ttggtggaga tttggcccga ggacgagtct gagaaaccct acgccgccgc 2880 ctacgccgtc ttgctcaatg agaggttagc gggagatgta gtcaatcagc ccttgttagc 2940 agactctcta acacagcagg gttttaaggc ggcatcgtct cttcctcagg tatctgagga 3000 ggaaacgagt aagaccaggg taggggactc cgaaaggact ccacgcacca caatggagga 3060 gatcgaagac gaagattggc tagcacatgt agccaaaccg aaatcttcgg acttcttgat 3120 ggaggaagtg tccccgacga atgaagcttt cgtttctcag tcaaagaaag aaaaagaagg 3180 agtagaacct aacttcagat cccctcccac tcaagatgag actgagacgt cttcaaaagc 3240 ctcggacgtt ccggggcccc ccattcccga tatcagaatc aaattaaaaa agagacgttt 3300 cgctccagcg ggagcgtcag cggtaggagt gtcggtagta gcagtccaag gatgggttgg 3360 ttctactaga aacaagccta tcgatatcag aatggattca tgcgcggacg tcaccttaat 3420 ctctcaagaa tacctagaaa gcctacaaga ccggcctccc tgtcagaacg gactgaagat 3480 gaatttatgg caacttacag ataaggacgc gtcccttcaa ggctatgtcc ggatcccgat 3540 atttctgcaa tctgcggagg gagttcttct cgaaacggaa gctgaagcct atgtagttcc 3600 ggacatgacg gttcctattc tgttaggaga agactaccat ttgaattacg aactcatcgt 3660 ggcccacaaa gtagatttcc gctcggtcat taacttcgca ggagttccct attcggtccc 3720 agcccgagga gtcagcagaa ccagagactt tgagagaatg cgacagagcg catgcgcagc 3780 ggctagcttc atcaaagcaa agtttcacaa gcggaataag gccaaaaaag cgaggaggag 3840 taaaagtttc ggcattgaga aaagaaccgt tcgagccgca gaggactacc gcctccgtcc 3900 agacgaatgc cgtcggatca gggtagatgg tcactttgaa gaagacaaga cttggctggt 3960 tgagaagaac cttctagctg ccgcggacga ttcgacattt gccgtgccta acgtactcat 4020 ttctgcatca gatccctggg tcccaatttc gaatccgtca cctcatccta agacgatcag 4080 aaaaggcgac atagtagggt acttaactga cccacaggag tatttcgaca ctccacagac 4140 gcccgaagat ctggagaagc taacaaaaat gacggaagcc ttagcagcca tcatatcgat 4200 ttccgcaaac gatccccctc aggatccttc gacacaagag aaacctgaga agcaatcaga 4260 gtctcgtcaa gaacaggatg ataataagga tgacgaacag gaagcctatg gtccaaaaac 4320 tgccgagctc ccagatccga tggagtaccc ctccgcgaga atgcgagagt tcctagacgt 4380 tgggtctctt cccgagcatt tacaagaaaa agcttgggag atgttagaaa gacgcaagaa 4440 agcgtttggt ttcgacggcc gcttaggtca tcatccgagc aaagtccata ttaggaccgt 4500 ggatggacaa gtccccatag ccgttcctat gtacggggca tcgccggcaa agagagcggt 4560 catggacgag cagttgaata aatggtatga acaagacgtg atagaaccat ctaagagtcc 4620 atggagcgcc cccgtcgtaa ttgcttaccg taacggcaaa ccacgattct gcgtggatta 4680 cagaaaatta aacgctgcta ccattccaga cgagttccct attccccgtc agtccgaaat 4740 cttatcttcc ctctctggag cacaggtcct atcttcatta gatgcgctat ccggcttcac 4800 gcaattagaa atggatgaag aagatgtgga aaaaacggct tttaggactc atcgcgggct 4860 cttccaattc aaacgaatgc ctttcggttt gagaaacggg ccctctatct tccaaagggt 4920 catgcagggc attctagccc cgtacttatg gattttttgc ttagtctaca tcgacgatat 4980 agtaatttac tccaagacat atgaagatca tatcagtcat ctcgatcaag ttcttgaagc 5040 tatagaaaag gcgggtatca ctctatcccc cgtcaaatgt catctctttt attcttctat 5100 ccttctactg gggcataaag tgtcccgact ggggttatcc acgcacaccg agaaggttcg 5160 cgccatcctg gagctaaacc gacccgcaaa gctatcacaa cttcaaacct ttttagggat 5220 ggcagtctat ttctcggctt ttatcccgta ctacgcagat cggtgttatc cgttgttcca 5280 gctgctgagg aagggcgcca agtggaattg gacggccgag tgcgagaacg cttttaattc 5340 gattaaaagc gctctccaag aatccccagt cttgggacat cctacggagg gtttgcctta 5400 tcgtttatat acagacgcgt cggatgaagc actaggatgt gcattacagc aagtccaacc 5460 tatcaaaatc aaggacttag aaggaacaag actttacgat cagctcagga aggctcacgc 5520 agcagggaag aaaccgccga aacttgtcgt gcaactacct tctgcttttg acgataacat 5580 agtgacagaa gactggtcgg aatcctttga agatacagtc gtttatatcg aaagggtgat 5640 cgcatattgg tctcgtacct ttaagtcggc cgaaacgcga tactcaacaa cagaacgcga 5700 agctctgggg gctaaagagg ggctagtcaa attccagccc ttcatcgaag gcgagaaaat 5760 cacgttaata actgaccatg cagcacttca atgggcgaaa acttacgaaa actcgaatcg 5820 gagattggcc gcctggggaa cagtattctc cgcctacgcg ccgcatctat ccatcgtcca 5880 tcgtccgggt agaaagcatt cgaatgtcga cccgctgtcc cgtttgtacc gcgcccctcc 5940 acctcaagat tccccagtca aggacgacgc aatttcattg gaaatgaacc cagtacacat 6000 tgatttcagc gcgaatccgt ccatgggaaa agccgtgttc atggctttca gtattgctga 6060 ctgtttggag gaggataaag aggcgcaact caatacacgc agttcgaagc gtaaaaaaga 6120 ggtcttacct ctaaaaccaa cagttacaac aactcccaag acggacacga cggaagagca 6180 atcgagcgag tattggaaag cgacaaatcc acctcccaat ttgctagtac acctagagaa 6240 aggaatgtta caagcctggg tagaaagtta tttgaaggat ccacacctcg cgaagatctg 6300 gaatgatccc aagacgatgg tcgatcaatg gactccgggt catcgtttct tcagaaacga 6360 ggaaggactg atgttcttta gagacgccga ctaccagccg aggctatgcg tacctctact 6420 tcagaggagg ttgattctcg aggaagctca cgaacaagca ttcgagggcg cccaccaggg 6480 gccggagaaa ctgtggcaga aactaagcga gaaattctac tggaaaagaa tgaaagccga 6540 cttactcaaa ttcgttcaga gttgtgacgt atgccagaag atcaaagctc ctaatttcaa 6600 caagtatgga tatctcatcc ccaatcctat tccaagcagg ccgtatcaat cgatagcgat 6660 ggacttcata gtaaacctcc cctggtcaga cggcttcaat gcgattcacg tcacggtgga 6720 tcgactaacc aagcatggta ccttcacgcc aactactact ggtttgaacg ccgaagattt 6780 cggtgcactg ttcgttagaa aaatcgtctg tcgctttggc ctcccggaaa gcattatatg 6840 tgacagagac cctaggtgga cttcagactt ctggaaagga gtagcgaagt ttttacgaac 6900 caagatgtcc ctttcctctt ctcaccatcc acaacatgac gggcagaccg agatcgtaaa 6960 tcgcttcttg gaagtgatgc tgagggcgtt cgtctctggc aacaaggaag cctgggcctt 7020 atggctgccg ctcttggaat gggcatataa tgccagtatc catagctcaa cgggatctac 7080 ccccaacttc ttgatgttcg gtttcgaacc acggacgcca atggacttcc tacagccgaa 7140 ggatacggaa aagaatgtaa cgaggcggtc ggactcagaa gagtggctgg caaagctgca 7200 aatgataaga gacagcgcta gacaagctat agcgcacgca caacaccatc aagctcgtag 7260 ccacaataaa gggcgaagag cgttggaatt actggtagga gacaaagtcc tggtgaaccc 7320 gcattcctta gaatggatag aatcgaaggg tgaaggtgcc aaactcaccc cacgctggat 7380 tggtcccttt gaagtgaccc agaagatcaa tcctaacgtc taccggctcc ggatgggaga 7440 caactacccg ggctcacctg tgatcaacat ccaacacctc aagaaatacg tagaggacga 7500 aacttacggc gatcggacga ctttacccga gtctttcgtg cgacgaccag aatcagagga 7560 gttcgaagtc gaaaaaatag tcggtcaccg aaggatcggg aaaaaggccg ctctgaagta 7620 tttgatacga tgggcgcact acggcccgca attcgacact tggggcacag ccgcagatct 7680 caagaattct ccgattctgt tgaaggaata ccgtgccaag aacaaccttt gaaaactagt 7740 tctactgcga attatgcaat ccccctttca cattttcact ctttcgttct gattgtcagg 7800 tacttctcgt atattttttt tttctttctt tatatttctt ttcttttttt tcaattggag 7860 cagttagaga cgcatggcag gcacactgac tatccatctc atcaatcaga tacacgtgtc 7920 gttacaaaac tcgacgcctc ccctccttcc ctagatttcc cctcatcttt cctgcccctc 7980 accttcattc gttgggactg tccacgagtg aaagttagga gaacaactaa tctcgcagag 8040 gactgtcttg cttgatttcg ttgccatgga tctcacaatc catcatctca atccgatcgc 8100 ttccacgacc atcagagtcg accgagacgc tatctgggta ggtctcaact cagacacagc 8160 cctgatccct gatgcggcaa taatctttag gaatttacct gatccagggt ggagcgtcga 8220 cgatttttcc gattcaggcc aagttttgcc tatcgacaca gtcagaagcc cacactggta 8280 cagggacgac gagcaatggg cggcgtggac gccaacctca ttccttctta gtgaacgtcc 8340 ttggtacgac cagctcgaaa cggctgtccc agtcgaagaa cgactcgatg gatggtccat 8400 ggcagaggag caacgccaga tctgcagcgg ggacctcatc cgtacccaag cctgcgtacg 8460 cagcatagtg gaattcgatc aacgattccc cccccacgcg aaagtcccgc ctcagtaccc 8520 aaccgagcga ttagccaaag tctacgcaac caagaaattg gttcaaatca acgcggcgaa 8580 ggcgaaatgc tccgttttac aggctctcgc ctttatggca tggtggacaa caattatgcc 8640 gaactgggag gttgagctca atgacacagc ggcggagaca attagtaggc tcttagcgac 8700 aacgaagggc aagcgcggaa ttatctgcga tctcgagagg gactggtccg tcatcaacat 8760 ccccctgtac attcaacacg acataccagt cttctacctt tgggatttcg acgttagagc 8820 ggacccacgt ttcagcagac tcaacccagc cctgaaccta acttactggg ccgtcagaca 8880 agggaccacg ttaacccttc acccggacat tgcagaagac gatctcaaca gagttgctcg 8940 tgacgcagta aagttggacc attatttcca agaagtcttc acgtatcagt cagctgccga 9000 cccacccatc ctttcatcct acacaccctt cgtcatcgac ttcgtaggat ggaaacgcag 9060 accgatcaac cgtcgggaag agacaaccga atctttagcg aaattttact attacgatgt 9120 cttcgacgat aacgaggact acgaccacaa agtgattgtt ttttggagat ggaggaagcg 9180 agaaccgagg gacgattatc ttcgacgcca atacaagacg agcttaccag gggaagaata 9240 tgcagggttg atcagggagt tatacaagtc ttcctacgcg ccaaaacccg gagtaatgta 9300 tgacgaagat acgggcttgc gcgttaccaa ggcccgatcg ctcaatacct caccatctct 9360 tctggaacga atggggggct ctctcgtcaa gggcaggtta tcccttcaag accgtttatc 9420 cgacgacgcg tcgcagactg acatgtccac cacacagagt tctctatccg acgaagactt 9480 cacggcagtt cgcatcaccg aagtccctga caccctatac caccctcgag caattaacag 9540 tccagcagca tggatccgtc ataacgaggc cctgctcgag aacgctcgcc gaaccaccac 9600 agcacggcgg acagcccaag gtatcgaagc atcaccttac cgtcgctccc agtcgcccac 9660 acaaatatct gaaccgattc actcatctca tgagcgccca gaagtcatgt tccgaaggtt 9720 actgaaagac gaatcagcga agataaccta caccagcagc acatggttcg cccctcactt 9780 cgcgtggaac cctgaatacc tggaggtggc atatctcttc atcccggacg tagagagcga 9840 ggctcgactt cgctactggg ctaactgctg ggatacggtc ggcacaattc gcagactcct 9900 caccatagcc atcgagcatg gtattcgctt ctacctagct ctaccgccag actctgtcag 9960 acgattccgc cccatcataa tcgacagttt ggaccgaagc tcggcatcgt tcatttacaa 10020 cgtaggcttc caggagcctc cactctcccc agccgacaac gcagcgacat tctgcgcgac 10080 gtatctagct cggatgaacg accttctacg ccgcccccat gcgcgcgctt tcatcgcgga 10140 aggagggcag ctgagctgga tagcaagaag atggacaggc ttgcgacttg tcgaggaatt 10200 tatgtctgga ccctccatcc aaaccaccgt tcatagcaga ggattttacg actcagcgtc 10260 cgaagacgct tcctaccttg cacacgatat cgtttcagag caggaaaagg atttattgtt 10320 aggctactgt ccaggcacga acggctgcct aggccgatgg ctcttccccc cagctgatat 10380 cttcaatggc ggtttcgaac tgtggacagg ggaatggaac gccgcgctcg atcacatcta 10440 ccgaagactc gctgacgaca tcgcgagagg caaggccaaa cttcgcacac gagaaaaatg 10500 gaaatgttgg atcaggaaca acgaccgagg ccaacgccga cctgcttacc taccatccac 10560 tgcagatttt cagaatgtaa tggaaggcat ctcgatcgcg ggattgaagc ccacttggca 10620 caaagaaccg ttggacgaca tcaccctccc cgagaggcga ttggattaat cgggaactgg 10680 aaagtggggg ggctc 10695 // ID Copia-52_MLP-LTR repbase; DNA; FNG; 751 BP. XX AC AECX01000262; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-52_MLP_; KW Copia-52_MLP-I; Copia-52_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-751 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000262; Positions 16498 17248. XX SQ Sequence 751 BP; 175 A; 143 C; 123 G; 310 T; 0 other; tgtcgaaata caatcaagac acttaccctg accgttgtat gacggacgta tggtagcggt 60 ttttggagtt ggatttagtt ttctttcgtt tttgttatca gagatgtttt agtgtttgtg 120 atctttgtag acgagtcaag aaaagtgtct tgtacatctt tcccaataac gttgttgttc 180 atctttccca atattttatt ttgatctttt tcacttctat gtatagcaag gctcggttct 240 caatctccct ctctttcctc atcttttgga aagagagtga gtaccttttt gcctttttca 300 ctatacacgt tatcttacta atatatctat atgtgttaga aatccttaag atcatcatct 360 tcgagctttt gagttcttta taaaagtttt ctcatcactt atattttcct cctgctatct 420 aggtacattt tatttctctc atacgctctt ttattatgtc ctgtcttctc atgtctctta 480 tttcttttct acttgtcttt aggttagttc cttagaggtc tgcatacctc gagctggaga 540 agagtgctta cctttagcac tggtttatta ggttagttaa caagtaataa ataatatctt 600 ttggaaagag aaaatcctta agatcatcat cttcgagctt tcgagttctt tataaaagtt 660 ttctcatcac ttatattttc ctcctgctat ctaggttagt tccttagagg tctgcatacc 720 tcgagctgga gaagagtgct tacctttagc a 751 // ID MGR583 repbase; DNA; FNG; 5977 BP. XX AC AF018033; XX DT 11-AUG-1999 (Rel. 4.07, Created) DT 11-AUG-1999 (Rel. 4.07, Last updated, Version 1) XX DE MGR583 is a non-LTR retrotransposon MGL. XX KW Non-LTR Retrotransposon; Transposable Element; AF018033; LINE; KW MGR583; ORF1; ORF2; reverse transcriptase. XX OS Magnaporthe grisea OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Magnaporthales; OC Magnaporthaceae; Magnaporthe. XX RN [1] RP 1-5977 RA Meyn III A.M., Farrall L., Valent B., Chumley G.F. RA and Orbach J.M.; RT "Magnaporthe grisea repeated DNA element MGR583 is a member of RT the LINE-1 class of polyA retrotransposons."; RL Unpublished. XX RN [2] RP 1-5977 RA Meyn A.M., Farrall L., Valent B., Chumley G.F. and Orbach J.M.; RT "MGR583."; RL Direct Submission to Genbank (08-AUG-1997)Plant Pathology, RL University of Arizona, P.O. Box 210036, Tucson, AZ 85721-0036, RL USA. XX DR GenBank; AF018033; Positions 1 5977. XX SQ Sequence 5977 BP; 1389 A; 1801 C; 1743 G; 1044 T; 0 other; caagcatcct ccaccattca tgggtcagac ccacctttgt tagcgattca cacccaaaca 60 caccaataag cttaacagct gaagaatttt cgccctccgc gcggtcgtca tggcctcacc 120 agaggtctca aaccgccgcg cgcgcccgca gagcacgccg cacggtttgc cgcagggcca 180 ggccgagtcg ggacacgtat ctctcgacac gcgcaccgct aacgcgcctc agccggctgg 240 ccgcggaagg tactccaccg tcccggccct ggccatcaaa acactgcagc aggagttgga 300 aaacgtggaa acagtcggca gctggctcga agaggtcttt gcagccaagc tcgaaagttt 360 acaggccccg gaggacgccg cgcttaggag aattgttctt gaacttgacg acgtgatcag 420 cggctggttc ctcagtcgca ccctccctga aaaagagcgt tcgcgctccc cgtccggttc 480 gccgaccagt tccagcgaaa ggcgcagcat tttaattacg aggacgggca cggcgtcgtc 540 ggcctctaaa ggcagaggca cgcccactaa caaatcggtc acatgggccg agcgagccgc 600 cacgccacca gcggcggccc caacgcgaat tttaaccccg gccacgcgct cgtacccgat 660 acgtacgacg cccggcaatg cacctgacag ctccgcaact ccctcccaat cgggccggga 720 ccgctcccaa gccccgaaac gtttgacaga acagccgtcc aaccgcatcc tgatccgcgt 780 aaatgacaaa ttgcggatgg tgacgaggga gccatacgcg gtaacgacca aattggccga 840 attggtctcg gtgccacaca gtgaaatccc aaacgccaag cgtaccccca cgggctgggg 900 tgttacggtg gcctctgagg aaacacggca aaaactcctc aaccccagcg cggtcaaggc 960 cattatggac gcattggacg cgcgggaagt cactgtcccg acaacctggc acacatacgc 1020 cgtctctgga gtgccccgaa cgctcaattg cctggacgga agaaaggacg tacatgaggt 1080 aatccaggag gagattaccg tggcagcagg ccagcagcca gtcaactggc atatttccaa 1140 acatggttac gacccacata cggctgaggg cacgtggata gtgtccttcc ttcaacccgt 1200 aaagcccttt cagctttttg gagtgtcgaa acgtgcgcgg aaggttgaaa aatcacgcaa 1260 aattacacac caccacgacg gttgccaggg atattgcgaa atacgtcgct gcgtacggca 1320 agcgcgttgc ggcatatgtg gtgaagcaaa acatgcgaca gaggaagagc cgtgcaaggc 1380 agcccccaaa tgcgttaatt gccacggcca tttcccatcc ggacatgaga attgccctgc 1440 ccgacctttg gtggtaaatg ggaagcttga acgaccttca cagcgtaaac ttaaggggat 1500 tcgcagaatc ggacgtgcca accgtgccgc gattatcaac gaccgggccg aaacggcccg 1560 ccgagagcca gcacctgcaa tcccggtcct aagcacctca tcgcagcatt cgcaaacctc 1620 gagctcccga tcgagcattt caaacaaaag agcgcggcca tgcgaggcgg caggggcgga 1680 catccgtaac ttcctgcagg ttgaacagga acgcgtgcac gacgaaatcg tttgggcagc 1740 cgaaatggag gaggacgcag cggctgccga accttgcccc gctgatggga tagacttgga 1800 agcaacccgt cgattaatat gacgaaatac tcgctcggca atatgcagcg ggaatgcctc 1860 gacgtagttt gggccaacgt aggtaaaagg atgggtgtgc atctgagcct attggagttg 1920 tgccaccaga gaaaggtgga cctggttaac gttcaagagc catggtgcgg gctaaatacg 1980 actacacaaa cgcacccggg ctatgatgtt tttgcgccag tggacgaatg gcacgccaac 2040 acttacgaag ccatgactgg gctccggccc cgggtgctca cctacgttaa gaagggggca 2100 gccctaaggg ccgcacaacg acggctaccg cagggccaaa atacgcggga catactctgg 2160 ctcgaaatta acggaatatt gtttgttaac gtatataggg ctccgggcac tgaagccgcg 2220 ctggaaatgg tatgcaatac cgtgccaaat ggacccacgg tgctaggcgg cgattttaac 2280 gtagcggctg ctgcatacca gccgggccgc gcaaacgcac gcggagggga ccaactgacg 2340 gcatgggcgc aagcccaggg gatgagtttt acaggaaata tcggcgtgcc cacccaccgc 2400 gacgggggtt tactggatat ggtcttttcc aacctcccca gcacaataac tgtggttgac 2460 agcagtcttt acacaggctc cgaccacgaa tccctttata ccacgctgcg gacgcgcggc 2520 gccccccgcc cggaggcgat caacgtttcg gttagggacg accggttacc aaagttcgcg 2580 gagctacttg cttttggcat gcaagacatg ccggacccgg gatccgcggc cgatgcgtac 2640 gcattggacg cgtgggtagc tgaatttaca actttatggg agcaagtgac gtgggtcgtg 2700 ggtacgccgg cgggaaaaag ggacagctcg gctccctggt ggaccgaaaa ctgccagcgc 2760 gtctggtccg aattccagcg ggttaaacgg agcgctgtaa atagggcaga cgcctctgcc 2820 gaggaaaaag cctatacgaa aggggtgcgg gccgcgaagc gggagtactg gaggcaccgg 2880 atcgaccaac tgcgcgacga taaggatttg tggaacatgg tgggctggct gggagccgga 2940 ccacgcctcc ggtccccgcc gctggttatt aacggcgagc aagtgagcga gcctttagca 3000 aaagcggaag cactgcaacg ggaggtgctt ggccgatttt cggcggagga tgacctgcag 3060 ggagacccct tggaggcctg gtcgccggac gaggctaaaa tcccttggga cacccaggtt 3120 agtgcggaag aggcggagcg ctcctgcatt ggggttacga gcacgtcccc gggcatcgac 3180 ggaataaccg tccggctact aaaagccgga tgggcttcgt tggccgagcc ggtgcgcagg 3240 ctctaccaac gttgcctgga actcggacat ttcccagcgc cttggaagaa ggccgaagtc 3300 gcgatgctac cgaaaacggg gaagaaagac cggagcagcg ttcgatcctg gcggcctata 3360 gccctcctct cctgcatcgg aaaaggcctc gagaggctcg ttgcacgccg gatagcttgg 3420 gccatacacg acaacgggct ccttagctgc acccacggcg gtgccctacc caaacgatcg 3480 gcgacggatc tggtttgtgc cctcgcccac gatatcgagc aggctttagc tcgcggggag 3540 gaagttaccc ttgtcgcatg cgacgtccaa ggcgccttcg acgccctcct ccatgggcga 3600 ttaatacgga aaatgcggag cctcgggttt agtaaaatgc tcctcaggtt cgtgattaac 3660 tttttaagcg gccgccaggc ccgagtccgg ctggagggta cgaccacggg ctttaggcgg 3720 cttgggtgcg gcaccccgca agggtctccc ctttccccga tcctatatat gctatatttg 3780 gcgtatctcg tcaaaaacgg tacgaagtgg cggttggcat acgcagacga cgtgcttaca 3840 tggaaatcgt caccctcgtt ggaggaaaac gtacgttggc tggaagataa actccgggat 3900 atgcacgaaa ttgcggcgga agagaagatc cattttgcag cggaaaagac agaggtgatc 3960 catatcacta agaaaaggca cggtcgcaac ccggaaatcc ggattaatgg tagaacggtt 4020 accccggtcc aactaccggg cggtcgacgc ggacaaagcg cctccggggc cgagcgctac 4080 ccgggcatgc gttggcttgg tttttggttt agccgacggc tagacgggcg ccgccacgtg 4140 gccgaaaggg ctgccaaagc aatggcggtc gctgcccacc tcaagagctt tggggcagta 4200 agatacggac cgcctgcggc tgcacttcgc aaggcagcgg tggcatgcgt gggttcctcg 4260 gccacctacg cagccgaggc atggtacaac ccagcgcaca agcaaagagg actcctcaaa 4320 gcattgaaca aaccgctggt tttagcggca cgggcaatcc tcccggcgta taaaaccacc 4380 ccctcgtcca ctgttctcag ggacgcagga ctaccctcgg cccgcgtcgc gctggcctac 4440 acccgcctga aatacggcgc ccgactaagg ctcgcggaca aggggcaccc ccttgtcagc 4500 cgcctgcgtg aaacacctcg tgcccgcaac tcgggccatt cggccacgac cctgcaaacg 4560 gctgcacaac ttctcccgcg gatccggaga ctagagttgc gcgcacctcg caatgccccg 4620 gattctcgca ctgaccccac cggaggagtc ccgaaagagg aagcggcccg ccgctttatc 4680 gaatggctgg acatggtatc acctgacgat atcgtggtat atacggatgg ttcggaaaaa 4740 cacgaaaaca actgcgtcca aatagggtac ggatgggccg cttttagggc gggcctggaa 4800 tttgccgcac gtgccgcatc tattacgccg gaaagccacg ttttcgacgc cgaggcgatt 4860 ggcgccctaa aagggctaca ggcggcagcc aaggcccagc caggcgcccg gatctggatt 4920 tgtgtggaca gcacctcggt tatttggggt cttagaggcg acgcgccgcg ttcgtcccaa 4980 tgggcctttc tggagttcca taaccttgtc gatctactcc gaaaacaaag taccgaggtt 5040 cgggtccgtt ggtgccctgg gcaccaggga atcccgggaa acgaccgggc tgacgagctg 5100 gccaaggccg gctccgccgg accgccggac ccagacccga gggctcagca aaccacgtat 5160 agcggtgccg gcacggtcct cagagccatt ctttcgaata tagagaagga ctggtggcgt 5220 aaagaactct gtgaacggtc ccccgcatat agggaatgga aattccaata cacaccgaga 5280 aaggagcccg aggaactgcg tttgccaaga cccctactgg gccattattt ggccatgagg 5340 accggccacg gcgatttcaa agcctaccat gaccgtttca accaccagga tgcaaacacc 5400 tcgtgtgcct ggtgctggaa gcggacctcc cctgaacacc cggtgcactg ccgcttttcg 5460 cgggcggtgt ggagaaactg gccgtggcct gacaacgacc ggccggtcgg gccgccagac 5520 cgcgcccaac gccgcaaatt cttccagaca agcctcgggc aaccgaaaag ctttgaggcg 5580 ttttcgatag ccaccaacta cttcagcgcc cgccccagag cggcccgcca gcgccccgcg 5640 cgccacgaac gcactttacg cctagggaca ccgatcgtaa acgactcgaa ctctgacgag 5700 gaatagactt actgcctttt cacccacact tcacggcaca agccgctaga cgaaccctcc 5760 gtgcacctta ggcacggggt cgcgtcagtg aactaacccc ctaagcagga ccgggcccga 5820 acccggtcag gcacgatccg cctctgccct ccttgttttc cccctgtgta aataaagaag 5880 atagaacgcg cgccgagata cccctcggga ggttgctaac ggccggctaa caagccgggc 5940 cgagcccggc gttaactaat actactacta ctactac 5977 // ID Gypsy-20_MLP-LTR repbase; DNA; FNG; 338 BP. XX AC AECX01000932; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-20_MLP_; KW Gypsy-20_MLP-I; Gypsy-20_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-338 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000932; Positions 6222 6559. XX SQ Sequence 338 BP; 73 A; 73 C; 78 G; 114 T; 0 other; tgtaaggagt atacacgaca agtagtcatg ggtttgggat catgggatta gatagacgcc 60 tccttacctt tctcttccag cgcccggctt cggaaagtcc tcactcctca gagagcttat 120 tgggtgactt tttggagcta ggtgagatta tcatttctct ttatcatttc acttatttct 180 tatgtttaca ttgttgctga ctcctcagag agcttattgg gtgacttttt ggagctagat 240 cgcaatttat accgaagttc caagtccgtt tgctctctcg agaagtcgtg ccggacgtgt 300 ttccggacgc tcagtgaagg ttctttaaga accttaca 338 // ID Copia-18_MLP-LTR repbase; DNA; FNG; 910 BP. XX AC AECX01001037; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-18_MLP_; KW Copia-18_MLP-I; Copia-18_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-910 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001037; Positions 70117 71026. XX SQ Sequence 910 BP; 196 A; 199 C; 182 G; 333 T; 0 other; tgttattgaa tgatttcgat ctccacttca agtccttcac gtgatcagtg tagcagttgt 60 agaagtagat ggagctagga ttaagtgtgt gctgcgtgta aagatggtag ggaattgtgt 120 caagggggta gttaggaagt gtgagaagtg catccaaatc gggaaggtat catgggtgtg 180 tgtgttggaa agtgatgaat gaggttaaca ttacttccct tttgagagta gcctgttcct 240 ttttagtttc tatcttttat gaatgtgttt gaaccccgtt gtcagaattt ttcacttgtg 300 ttcgatttgt cttatattcc cgctgcctct cgagctctct cttccttcct taccgttagg 360 aattgaaagc tcaagagtaa gtcgcggtgt tcctctataa tcactctatt agctgttttc 420 ctggatagta ctcatcaaat ctctctttct cgatatctcg aatagttccg tttttctttt 480 ctcctgcatc actgcaggta aggcgttgat tcatccgcca ctgtcccttt tcactattcg 540 agttatcaca tagagctcat accttacacc tgtctgtcca ggtaattagt acttcatatt 600 gttttcagta gtttttctat ttttgctttg cctgaccgtt aggaattgaa agctcaagat 660 tccgtttttc ttttctcctg catcactgca ggtaaggcgt tgattcatcc gccactgtcc 720 cttttcacta ttcgagttat cacatagagc tcatacctta cacctgtctg tccaggtaag 780 gcgttgattc atccgccact gtcccttttc actattcgag ttatcacata gagctcatac 840 cttacacctg tctgtccagg tttcatcata gctctattgc gttgattaga tcccgcattg 900 aattctttca 910 // ID Gypsy-89_MLP-LTR repbase; DNA; FNG; 153 BP. XX AC AECX01000270; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-89_MLP_; KW Gypsy-89_MLP-I; Gypsy-89_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-153 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000270; Positions 50859 51011. XX SQ Sequence 153 BP; 42 A; 40 C; 25 G; 46 T; 0 other; tgttataacc ttatactggg ttatgcttgt attgagtaca gtgtgaccac gccatctcag 60 cgcacgccca aacctgtgtt gtacttcatg ctacaatcta agttataata tcagaactca 120 tctggcctct gatacccatc caagctcata aca 153 // ID Gypsy-3_LENY-I repbase; DNA; FNG; 5029 BP. XX AC AAPO01000087; XX DT 12-FEB-2011 (Rel. 16.02, Created) DT 12-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Lodderomyces elongisporus genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_LENY_; KW Gypsy-3_LENY-LTR; Gypsy-3_LENY-I. XX OS Lodderomyces elongisporus OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Debaryomycetaceae; OC Lodderomyces. XX RN [1] RP 1-5029 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Lodderomyces elongisporus RT genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; AAPO01000087; Positions 13542 8514. XX CC Positions [3824-4330] - Integrase core CC 'CTTAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 17..5020 FT /product="Gypsy-3_LENY-I_1p" FT /translation="MVKTRATQVAIRFQRNVAFTAFIATLSGLVGSLLYQH FT DPEHFAVVRQWSLFIFTTLQIWKQQLIVPDWWYNSTYTTQAVVTSLRLPWY FT RTHYTTITYQYTTRSFQEFHLDDLLGTELDTFIKSVKLDPAAPHWLITAFN FT IIKITITYCSLLFHILSGWALPIITFILLTTSITFFRMVWSILTWPIRASQ FT PKRVTNHNTTPTANTPVTSIRTEITHGSNSPTRIVIETLPELVAQIHGRAQ FT IPAGRQPWHARLITFCWMLIRWIFLFPFQLTRWLRESRARALTRDYRDYQL FT VDLDFISREALANSLFPQIRLVRFDPKRLYGAIIRASENRNWNDRNTFTSR FT RSKFLRMTTLWQFQTEFIKMGLSNDVYNTLSEQARQTALEFKTPVLDYQET FT LDLLRELDQIPYLHKHFNSVMPALLRTAFSKPAIHNVILTLYDTTQSYWTL FT MSLLDSNWYDLAVNVPERPSTRPYNKHHLRNGRPRLVTPNIVSTTTETTTP FT SSIIPKMHSTNYSQTGYVAAALANQAKHSFFVPILLQSADVPETMGYLDQG FT STFSFISPALVSSLGLVTSPRAQRINLAAFGQGARTVTAPTVDVDFNIKDH FT DTPFSFTFVVHETRIAPILLGADWWEKYELLCVVGKRLEINGQTIPTIQGE FT EHQLIFSTDDLDPPPDPTPPLFAHLIQQYKSKAPTLTLPPARSTDYVINIR FT KDAIIKPNRPHPKYAVNAEQAIINNTRELLQQGLIIPSKARHYSIPSAVPK FT KNGTYRIVYDFRAVNALTIPINSTWNNFRELLPGITGATCFSTIDLSSAYH FT QLRIAPLTQPWTAFWTPIGVFEFLVLPFGLTDAPEVFGRYMKQLFMDMPFV FT RTYLDDILVFSENEKVHYSHLKKIFERLVQHQHTINWEKSVFAQPEIQWVG FT HLLTKDGVLPTDTTKGQIQLIAIPRDKDEVRQFLGHVGYIQDHIANYSHRA FT APLTDLLTKHTQFNWSQKCQSAFNDLKAAVLETTSLQFFDPKQPIAVFTDA FT SHIAMAAILMQPDQAGGGNWRPIEYRSRKFTGGEVNWDIFLKEFAAVKMAF FT EKFRLYLEGTHFSLFVDQLPIKQVFEAFGRGKIHGDARIQRWMSKLLCYNF FT DVFHINGNENFIADHMSRNPAHFQSSTAISIIGAVNVLLVPDEWFHQFPPA FT YADDPFFANVYDLLQQGQTNNEATQNYSLDTTTTLLYYHDKLCIPHALLQQ FT FLMTQHDSLEGGHCGITNFFNKVDPYYYFPKMRLVIRQYVDLCLQCQRHKP FT PNRAPNGLLQSFPDPKGRWTDINVDIISGYPEVEFNGRLVNAILTIVCRFS FT RRVHFYPISTKFSTLDFIHVFQHLYMPLHGIPTIITTDRGTQFTSELFQNY FT MSSLKITSNIAVTSHQQSNGLAETKNKQIERYLRLYIDQNPEWPQLLLIGE FT YVMNSTPKSALDHKSPFEMDLSYIPRSPATILFPTHAGAQDNAAIDITEKL FT ERLTTAARHAHKATFQRLKDNYDRHHRDIEFNVGDAVLVRTPLLKSHPDAA FT NHGQRRKFLSEFTGPFTIAAKSDNGVNYTLEMPQSYKGHSDFHVSQLRLLK FT SLPPDAITAPQPEVGFKVYKDGSQLVEIDAIVDHKPRKTGFLLLCRHTPTS FT KHPDGECVYYSASSLRQTARDLLLAYACEKSLPALVAPKDRHLVTPPQTT" XX SQ Sequence 5029 BP; 1352 A; 1488 C; 880 G; 1309 T; 0 other; tggaggtcct aaccagatgg ttaaaacccg agctactcaa gttgcgatac gttttcaacg 60 taacgtagct ttcacagcct tcattgccac tctctccggt ctcgttggat cccttcttta 120 ccaacacgac ccagaacact ttgcagtcgt tcgccaatgg tcactattca ttttcacgac 180 tttacaaatc tggaaacaac aactcattgt tccagactgg tggtacaact ccacctacac 240 cacacaagct gtggttactt ctcttaggct tccttggtac aggactcact acaccaccat 300 tacgtaccaa tacaccactc gttcctttca ggaattccac cttgacgatt tgttaggcac 360 tgaacttgac accttcatca aaagtgtcaa acttgaccct gctgctcctc actggctcat 420 caccgccttc aatattatca agattacgat tacctactgt tccctgttgt tccacatact 480 ttccggatgg gcgttaccta tcattacgtt catcttactt accacttcca ttaccttttt 540 cagaatggtt tggtcaatcc ttacttggcc aattagagca tcccaaccca aacgggtcac 600 taatcacaat acgaccccaa cagccaatac acctgtcacg tcaatacgta ctgaaatcac 660 ccacgggtcc aactcaccga ctcgcatagt aatcgaaact ctccccgagt tagtcgcgca 720 aatacacgga agagctcaaa tcccagcagg cagacaaccc tggcatgcac gcttgatcac 780 cttctgttgg atgctcattc gttggatctt cctattcccg tttcagctaa ctcggtggct 840 tcgtgagtct cgtgcacgag cactcactag agactacaga gattatcaac ttgttgacct 900 tgacttcatt tcccgtgaag ctcttgctaa ttcacttttt ccacaaattc ggttggttcg 960 cttcgaccct aagcggttat acggtgccat catcagagca agcgagaacc ggaactggaa 1020 cgaccgtaac acattcacat ctcgcagaag taaattcctt aggatgacta ccttatggca 1080 attccaaact gagttcataa aaatgggcct ctctaatgat gtctacaaca ccctcagtga 1140 acaagctaga caaacagctc ttgagttcaa aactccagtt ctggattatc aggaaacctt 1200 agaccttctc cgcgagcttg atcaaatccc ctaccttcat aaacatttca attcagttat 1260 gccagcctta cttagaacag ctttctccaa gccagcaata cacaatgtta tactgacact 1320 ctacgacacc acacaatcct actggacgtt aatgagttta cttgattcaa attggtatga 1380 ccttgctgtt aatgttcctg agagaccttc cactaggccg tacaacaaac accacttacg 1440 aaatggtcgc cctcgactcg tcacacccaa cattgtgtca actacgacag agacaaccac 1500 tcctagcagt attattccaa agatgcacag tacaaattac tcacagacgg gttatgttgc 1560 tgctgcctta gctaaccaag ccaaacactc cttctttgtg cccatccttt tgcaatccgc 1620 tgacgttcca gagacaatgg gctaccttga ccaaggatca acattctcct tcatcagccc 1680 agcacttgtg tcttcccttg gacttgtcac ctcaccacga gcacaacgta ttaatctcgc 1740 agctttcggt caaggcgccc gcacagtcac tgctcctacc gtcgacgtcg acttcaacat 1800 caaagaccac gatacaccat ttagcttcac cttcgttgtt cacgaaacca ggatcgctcc 1860 tattcttctt ggagcggatt ggtgggaaaa gtacgaactt ctttgtgttg ttggtaagag 1920 attagagata aatggtcaaa ccatccccac cattcaaggg gaggaacacc aactcatctt 1980 cagtaccgac gatttagacc caccacctga tccgacacca ccactttttg cgcatttgat 2040 tcaacagtat aagagtaaag cgcccacgct cacactccct ccggcgcgca gtactgatta 2100 cgtcattaac attcgaaaag atgcgattat taagcccaac agaccacacc cgaagtacgc 2160 agttaatgct gaacaagcca tcatcaacaa cactcgcgag cttctccaac aagggttaat 2220 tatcccatcg aaagcacgtc actactccat cccttcagcg gtccccaaaa agaatggtac 2280 ataccgtatc gtatatgatt ttcgagcagt caatgcattg acgatcccaa taaattccac 2340 ttggaacaat tttcgcgaat tattgcctgg catcaccggc gccacctgct tcagcaccat 2400 tgacttgtct agtgcctatc accaattgcg cattgcgcca ctgactcaac cttggactgc 2460 cttttggacc ccaattggtg tctttgaatt cttggttctt ccctttggac tcaccgacgc 2520 acctgaggtt tttggccgct acatgaagca actttttatg gacatgcctt ttgtccgaac 2580 ctaccttgat gacattcttg ttttctccga aaacgagaaa gtccactata gtcatctaaa 2640 gaagatcttt gaacgacttg ttcaacatca acacaccatc aattgggaaa aatctgtttt 2700 tgcgcaacct gagattcaat gggtcggtca ccttcttacc aaagacgggg tcctgcctac 2760 tgacaccacc aaaggccaaa ttcagctgat tgcaattccc agagacaaag atgaagtccg 2820 tcaattcctc ggacacgtag gttacattca agatcacatt gccaattata gccatcgcgc 2880 tgccccgctc acggatctct taaccaaaca cacccaattc aattggtcac agaaatgcca 2940 atctgccttt aacgacctca aagctgctgt tttagaaacc accagtcttc aattttttga 3000 ccccaaacaa cccattgcgg tattcaccga cgcttcccac atagctatgg ctgccattct 3060 catgcaacct gaccaagctg gtggcggaaa ttggcgcccc atcgaatacc ggtcccgcaa 3120 attcactggg ggggaagtta actgggacat tttcttgaaa gaatttgctg ctgtcaaaat 3180 ggcctttgaa aagttccgcc tttacctcga aggcacccac ttctcacttt ttgtggatca 3240 gcttcccatc aaacaagttt tcgaagcctt tggtcgtggt aaaatacatg gggatgctcg 3300 tatccaacgc tggatgtcca agctcctttg ttacaacttt gacgttttcc acatcaacgg 3360 caacgagaac tttattgcgg atcacatgag ccgcaaccca gcacactttc agtcatctac 3420 cgccatttcc atcattggcg ccgtcaacgt acttctggtt ccagatgaat ggttccacca 3480 attccctccg gcctacgcgg acgatccatt ttttgccaac gtgtacgacc ttctccaaca 3540 aggacaaacc aataatgagg ctactcaaaa ttactctctt gacacaacaa ccacgttgct 3600 ttactaccat gacaaactct gcatcccaca tgctttgctt caacagtttc tcatgaccca 3660 acacgactcc cttgagggtg gtcactgcgg catcactaac ttcttcaata aggtagaccc 3720 ttactactat tttcccaaga tgcgtctcgt aatccgacaa tacgtcgacc tgtgtctaca 3780 atgccaacgc cacaaacctc ccaaccgggc acccaatggt ttgctccagt ccttccctga 3840 tccaaaggga agatggaccg acatcaacgt tgacatcatt agtggttacc ccgaagttga 3900 attcaacgga cgtttggtaa atgcgatttt aacaatcgta tgtagattct cccgcagagt 3960 ccatttttac cccatatcca cgaagttctc aactctggac ttcatccatg tttttcaaca 4020 cctttacatg cctcttcatg gcatacccac catcatcacc accgatcgag gtactcaatt 4080 tactagcgaa cttttccaaa actacatgag ttccctcaaa atcacttcca atatcgcagt 4140 aaccagccac caacaaagca atggtctcgc agagaccaag aacaaacaaa tcgagcgcta 4200 cttacgcctc tatattgacc agaaccccga atggccacaa cttttactga ttggagaata 4260 tgttatgaac tccaccccta agtccgccct tgaccacaag tccccatttg agatggacct 4320 ttcctacatc cctcgctccc cggccactat tctttttcct acccacgcgg gtgcacaaga 4380 caatgctgct attgacatca ctgagaagtt ggaacgtctc accacagctg ctcgacacgc 4440 tcacaaagca acgttccagc gcctaaagga caactacgac agacaccacc gtgacatcga 4500 gttcaacgtt ggtgacgctg ttctagttcg tacccctttg cttaagtctc accctgatgc 4560 tgccaaccat ggccaacgcc gaaagttttt gtctgaattc actggcccat tcactatagc 4620 tgcaaaatct gacaatggtg tcaactacac tttggaaatg cctcaatcct acaaaggtca 4680 ttccgatttc cacgtctccc agcttagact cctcaagtcc ctcccacctg atgctatcac 4740 cgcccctcaa cccgaagtgg gcttcaaagt ctacaaggac ggatctcaac ttgtcgaaat 4800 cgacgctatc gtggaccaca aaccccggaa aacaggtttc ttactccttt gtcgtcacac 4860 acctacctcc aaacacccag atggcgaatg cgtctactac tccgcctcat cacttagaca 4920 aaccgctcgc gatcttttat tagcttatgc gtgcgaaaag tccttgcctg cccttgtggc 4980 acccaaggac cgccatttag taactcctcc tcaaacaacc tagggggga 5029 // ID Gypsy-95_MLP-I repbase; DNA; FNG; 6379 BP. XX AC AECX01000488; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-95_MLP_; KW Gypsy-95_MLP-LTR; Gypsy-95_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-6379 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000488; Positions 134891 141269. XX CC Positions [4780-5268] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 281..1951 FT /product="Gypsy-95_MLP-I_2p" FT /translation="MSGKSVPQSSGRRAGTHISLSDDDVRRIILELPDVVS FT TTWDPNYGVWDKPTVENFESWDEVYFQTLKEHKEKRLLTIPPCVERLLGIN FT SDGERPPDVESPTTYKNPRLPTDGSIGQTPAALPANSLPPAPSKKSSSLID FT KKQEAPVDKGKMRSNPTAGPSYPLTGYVPCDQQSASASSVEPMDDILNDQD FT YQRQESKPLITDPIEDPNEVAAFRQWKAAAESERNKLSRRIESVTLSDSTI FT LPNPAPTPAQPPIRQIAVRDSVGPTVRPGTQPGFTTSTRYSQPPEAGISRD FT KDYHKICKTLTIRKFEKFDGTSSYDAPLWFANLAKDLMLLNVDEAVWHYIG FT FQFLDGAALTELRQVIGSPNCPSTYSELETFVINVFPSSLSLINNFKFRPN FT EKVCDAVARFRVLQNDAIHLGFEYQESTVFLSKLPPYLRRYVQSEIERDSR FT AGRPMTFAQVTLCAIDRDNQLRSEKDHVNTVSTPEGNSSSNNRRNNKRKNP FT ESGKTDVVCYNCGKKGHCFGTLVNQLCPEKASQRTLNYFKKVKSNGSSANA FT VASGSGESKA" FT CDS 2314..6039 FT /product="Gypsy-95_MLP-I_1p" FT /translation="MITDSSRDHLAAIETDSGMPRVDFVLTLDKVSTRILI FT DTGAKQSYISQNYVDNRKLITRRNPLRPVVYGVWGKSQKCNHLAELDINFG FT IVSISHELRVAPLASFDVILGMDWIGTYAVSTDWVSGTWVLRDYTGTEQSF FT RPAAISSPVETLHLIEGEGHLESCDEAPSTRSQMRRFMRKKNVESWLVYGS FT DFIHSVDEVEGPREEPPTVLPKISTDDPALRKVASEIVEKYRDLFDEISSA FT SKKERTVQHLIDTGESKPVAQAVRRMSPLLLTELQRKLEVLEKNGFIQPST FT SPWSSPVLFAKNAAGKLRFCVDYRAVNAITKRNRHPLPLIQDCFDQLHGST FT RYSKFDLQQGFHQMRISPDHVPKTAFSTRYGHYKWLVMPFGLVNAPSTFQR FT MMSDILRPYLDKFVQVYMDDILVYSKNDQDHAEHGQLVLEALALHELKVSG FT SKSELFADEIQFVGHMVSAAGVRPMKDKLEAIQAWPRPATVTDVRSFLGLT FT GYYRRFVKGFSKIASPLHELTGGNVTKKSKVVWAPKHEASFVTLKKALMEP FT PILINPDPNKAYIIETDSSDFAVGAVLLQVGSDGKEHPIAFDSTKLNAAQR FT NYPAQERELLAIIHAWRKWRNYVEGAVCDTIVRTDHVTLTYLSKQALPTRR FT LLRWIEEFGEMTIKVEYKSGPTNVVPDALSRRVDHEILLIQDCQDGLRDAS FT DWPLLIPYILSSKELPSWVTAAVMEMAIRGAREFEYNAEEETLLWIGNSSE FT RSPFIPFYQRAELVDLLHRKYGHRGRDGTLSLLKDRGWWPGRYKDIESYCK FT SCPECQIYEAPDKNQETACQIPLAPVDPFERWAVDFISLPESKDGYKWILT FT MIDHGTGWPLAIPMKTATSANVAEALLRDLIQVYGVPSEILSDRGKNFLSK FT EMTLFLKSCHTRKINTSGYHPRTNGKCERFNGILEKALFKSNSSKDPKRWP FT EYLAEALFAVRVNKSTVTGWLPFKLLYGVSPRFLGDPATLRAPVVDNDPSL FT QETRLEKLWLSRNKASELSVERAAVNKARFDKQFKADDVTGKVASRLVTYA FT IGDKVKLRNEAHTKGEPHWYGPFEILNSLGKNVYLLMDHRQSLFPHPIGGN FT RLKPVKIREKYLSVAWALPPRLLQDIAKEDLKVSVELKKKAKRLAKTQEKA FT GPTRIKLVGRFAPENASDQGLTASPAQTQPISPSDPVLPPLSVPPVELPEE FT TLTSPLVIDPPLAPQLPNPVQPRRSNRIRDKK" XX SQ Sequence 6379 BP; 1735 A; 1507 C; 1452 G; 1685 T; 0 other; ctggtagcga gagcttctct tttggataaa attgatctga caacttcttg aaatttttat 60 tttttgacat attcaaggca gtttttttaa caggtttaaa aaggattatc atctttattt 120 tttccgtcga aaactccctg tttagaagat ctggtcataa gaataaattt ttttacgagt 180 gagtctatat tcgagctaag taggaaagag ctgagaagct aagttattga caatttaaca 240 attgattctt cgtgtttagt aaaaacatta gattaaagtc atgtccggaa aatcagttcc 300 tcaatcctcc ggacgccgcg cgggtactca tatatccctt agcgatgacg atgttcgacg 360 aattatcctc gaactacccg atgtcgtatc taccacgtgg gacccaaatt atggtgtttg 420 ggacaagcct acggtggaga acttcgaaag ctgggatgag gtgtacttcc aaactctgaa 480 ggaacataag gagaaacgac tcctcactat tcctccctgc gttgagcggc ttctcggcat 540 taactctgac ggggaacgcc ccccagatgt cgaatctccg accacatata aaaatccacg 600 gctacctacc gatggatcga ttggtcagac cccggctgcc ctgccagcca actctcttcc 660 tccggcccct tcgaagaaga gctcctcttt gattgacaag aagcaagaag ctcctgtcga 720 caaaggcaag atgcgttcta atcctacggc cggaccatct tacccgctta ccggttacgt 780 cccttgtgat cagcaatcag cttcggcatc ttcggtcgaa ccaatggacg acatcttgaa 840 cgatcaagat taccaacgtc aggaatcaaa gccattgatt actgacccta tcgaagaccc 900 gaacgaggtc gctgccttcc gacaatggaa agctgcggct gaatccgaaa gaaataaact 960 ctctcgtcgg atagagagcg ttactctttc ggactcgact atcctcccaa accctgctcc 1020 gacgccggct caaccgccca ttcgtcagat tgcggtaagg gactcggtgg gtcctacggt 1080 gcgacctggg acccaacctg gtttcaccac ttcgacccga tactcgcaac cacctgaagc 1140 tgggatttct cgtgataagg actatcacaa aatctgcaag actctcacca tccgaaaatt 1200 tgagaaattt gatggcacca gctcctacga tgctcctctt tggttcgcaa atttggccaa 1260 agacctcatg ctcctcaacg tcgacgaggc agtatggcat tacatcggat tccagttttt 1320 ggatggtgcc gctctgaccg aactacgaca agtcattgga tcccctaact gtcccagtac 1380 ttattcagaa ttagaaacct ttgtcataaa cgtgtttcct agttcattgt cccttatcaa 1440 caacttcaaa ttcagaccaa atgaaaaagt ctgtgatgct gtagcgcgct tccgagtcct 1500 tcaaaacgac gcgatacact taggttttga atatcaagaa tcgacggtct ttctcagcaa 1560 gctgcctcca tatttaagaa gatacgtcca aagcgaaatt gaacgtgact caagagccgg 1620 aagacctatg accttcgctc aagtcacact ttgcgctatt gacagggata atcaactacg 1680 atccgaaaag gatcatgtca atactgtgtc tactcctgaa ggaaattcat ccagcaacaa 1740 tcggaggaac aacaagagga agaatccaga aagtggaaag accgacgtag tttgttataa 1800 ttgtgggaag aaaggccatt gttttggaac gttggttaat caattgtgtc ctgagaaagc 1860 atcccaacgc actttgaatt acttcaagaa ggtcaagagt aatggttcat ctgctaatgc 1920 tgttgctagt ggctctgggg agtcaaaagc ataggcgata caactctagc tccttttgta 1980 ttgcctgatc tgtgtgtaac atcatctatt gctagtaatc atactgtcat taccgctgct 2040 agtgatcaat cttcggatta tcttgatgtt tattgtgagc ccgaagttag aaaaacttca 2100 agtgagaata ttctttcatc taagccacct aaagaggcta tccattcgaa ggctaagagt 2160 gtttcatctc aaggcgagaa acagctttcg aagatcgctc ctagtcttta tgactactta 2220 ccgagcgctc cttgtaagga gaggtgctgc ccaacagtag ttaattctac cgcattctca 2280 ccggccagtg agcccggtgt atcaactcct ggaatgatta cggatagctc ccgtgaccat 2340 ttggcagcca ttgaaactga ctctggcatg cctagagtcg attttgtcct gacactggac 2400 aaagtatcaa ccagaatatt aatcgatact ggtgccaagc aatcgtatat ttcacagaac 2460 tatgtggata atagaaagct tattacacga agaaatcctc ttcgccctgt agtttatggg 2520 gtatggggta aatctcagaa gtgtaaccac ctcgctgaat tagacattaa cttcggtata 2580 gtgtcaatca gtcatgagct tcgtgtagct ccgttggcgt cctttgacgt aatcctgggt 2640 atggattgga tcggcaccta tgccgtctca acggattggg tctctgggac ttgggtcctg 2700 cgtgactata caggtactga gcagtcattt cgaccagcgg ccatctctag cccggtcgag 2760 accttgcatc ttatagaagg tgaagggcat cttgaaagtt gtgacgaggc tcctagtact 2820 agaagccaaa tgcggcgatt catgcgtaag aaaaacgtag aatcgtggtt ggtttatggt 2880 tccgatttca tccacagtgt agatgaagtc gagggaccta gggaagagcc tcctacggtc 2940 cttcccaaaa tttctaccga tgatccggca ctcagaaaag ttgcttctga gattgtggag 3000 aaatatcggg atctgtttga cgaaatttcg tccgcctcca aaaaggagag aactgttcaa 3060 catcttattg atactgggga gtcaaaacca gtcgcacaag ctgttagaag aatgtccccg 3120 ttacttctta cggaacttca gagaaaactg gaggttctcg agaagaacgg gttcatacaa 3180 ccctctacct caccttggtc ctcaccggtg ctatttgcta aaaatgccgc cggtaagctg 3240 aggttttgtg tagattatcg tgctgtcaac gccataacca aacgcaatag acatccttta 3300 cctttaatcc aagattgttt tgatcagctt catggctcta cacgctactc caagttcgat 3360 ctccaacaag ggtttcacca gatgagaatc tcacctgatc atgtacctaa aactgccttt 3420 agtactaggt acggccacta caaatggctg gtaatgcctt tcgggttagt taacgcgcca 3480 agtacattcc aaagaatgat gagcgatatc ttacgtccgt acttggataa gtttgtccaa 3540 gtatacatgg atgatatcct agtatattct aagaacgatc aagaccatgc tgaacatggg 3600 cagctggttt tggaagcgtt ggcacttcat gaactgaaag ttagtggttc taagtccgag 3660 ctttttgccg acgagattca gttcgtcggc cacatggtgt ctgcagcggg tgtccgtccc 3720 atgaaggata aactcgaagc catacaggct tggcctcgcc ctgctaccgt gaccgatgtt 3780 agatcctttc tgggcctaac tggttactat agacgttttg tgaaagggtt ctccaagata 3840 gcgtctcctc ttcatgagtt gacaggagga aacgttacta agaagtccaa agttgtatgg 3900 gctccgaagc acgaagcgtc ctttgttact ttgaagaagg ctcttatgga gccccctatc 3960 ttgatcaatc ctgatcctaa taaggcctat attattgaga cggactctag cgactttgca 4020 gtcggagccg ttctcctcca ggttggttcc gatgggaagg aacatcctat cgcattcgac 4080 tctacaaagt tgaatgctgc ccaacgtaat tatccagctc aggagaggga gctactagcc 4140 atcatccacg catggaggaa atggcggaat tacgttgaag gcgcagtttg tgacactatt 4200 gtaaggacag accacgtaac ccttacgtac ctatccaaac aagcgctgcc aactcgacgt 4260 cttcttcgat ggatcgaaga attcggagaa atgacaatca aagtggaata caaatcagga 4320 cccacgaatg tcgtacctga cgctttgagt cgtcgagttg atcatgaaat tctgctcata 4380 caggattgcc aagacggttt gagggatgcc tcagactggc cattactgat cccatacatt 4440 ttgagctcta aagaattgcc atcttgggtc actgcggcag ttatggaaat ggctatccga 4500 ggcgctcgag aattcgagta taatgctgag gaagaaactc tgttatggat cggaaacagc 4560 tccgagagga gcccgttcat tcctttctac cagcgagcag agctggttga cctcctgcac 4620 cgaaagtacg gtcacagggg gagagatggg accttatcct tgttaaaaga caggggatgg 4680 tggcctggcc gttataaaga catagaatcc tattgtaaat cttgcccgga gtgccagatt 4740 tatgaggcac ctgataagaa tcaggaaact gcttgtcaaa taccacttgc tcccgttgac 4800 ccttttgaac ggtgggcagt agatttcatt tcactgcctg agagcaagga tggatacaag 4860 tggatcctta ccatgattga tcacggtaca ggttggccac ttgcgatccc gatgaaaaca 4920 gctacttctg ctaatgtggc agaagctctg ttgagagatc taatacaggt ctatggtgtt 4980 ccctccgaga tactttcgga tcgagggaag aatttccttt caaaggagat gactttgttc 5040 ttgaaaagtt gtcacacgag aaagataaat acttcagggt accaccctcg cacaaatggg 5100 aagtgcgagc ggtttaacgg tatcctggaa aaagcgttat tcaaatccaa ctcttctaaa 5160 gatccgaagc gttggccgga atatttggcg gaagctttat ttgccgttag ggtaaataag 5220 agtaccgtca ctggttggtt gcccttcaag ctgctctatg gagttagtcc tcgttttctg 5280 ggcgacccag ccacgcttag ggctcctgtg gtggacaacg atccaagtct gcaggagact 5340 cgtttggaaa aactttggct ttccaggaat aaagcttctg agctatctgt cgaacgagcg 5400 gcagtaaata aagcacgttt cgacaaacaa ttcaaagctg atgacgtgac ggggaaggtt 5460 gcatcccgtc ttgtcactta tgcaatagga gacaaagtaa agctccgtaa tgaggcccat 5520 acgaagggtg aacctcattg gtacggaccg tttgagatcc tcaattctct cggtaagaac 5580 gtatatttat taatggacca tagacagtcc ctttttccgc atcccatcgg gggtaacagg 5640 ttgaaacctg ttaagattcg agaaaaatat ctcagcgttg cctgggccct tcctcctcgg 5700 cttcttcaag acatcgccaa agaagatttg aaagtttcgg tggaattgaa aaagaaagcg 5760 aagagactgg ctaaaactca agagaaagca ggccctactc gtatcaagtt ggtcggacga 5820 ttcgctccgg aaaacgcgtc tgatcagggc cttaccgcgt ctcctgctca aactcagcct 5880 atctcgcctt cggaccctgt cctccctcct ctttcggttc cccccgtcga attaccagag 5940 gaaactctta cttcaccact ggttattgat ccacctcttg cacctcaatt accaaatccc 6000 gtccaacctc gaaggtccaa cagaattcgt gacaagaaat aaattaccgg gctcctacat 6060 ctcggattac cctctagtcg tttgtctggt catcatacct ggtcgggtat ccgcccctgc 6120 gattcaggag tatcgcttgt ttgggaaggg ggtggttgta gcagggccga aggccctgta 6180 ctatctacat taacataaat ggctctaacc taactaccgt aggaatcagc ccttataaaa 6240 gggctagtct agtccacaaa catcgtatta tcaactagac cctataaagg aacattccat 6300 ttataactag gcacaagcct tcctcaattg tacctagttg aaagaaaagc gccataagcc 6360 ttttttgaag atgtaagac 6379 // ID Copia-49_MLP-LTR repbase; DNA; FNG; 552 BP. XX AC AECX01002337; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-49_MLP_; KW Copia-49_MLP-I; Copia-49_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-552 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01002337; Positions 26581 26030. XX SQ Sequence 552 BP; 175 A; 95 C; 79 G; 203 T; 0 other; tgttgaagta tccaatcgac aaagtgaaga agactacaac ttactggatt atacataagc 60 aatccggaag aaaatgaaat atgaatatca tttacaaatg agaaggatac atcatgatac 120 gtcatcatta atttccccca aagtgttaag accaattact acatatagct gttatctctg 180 aaatcgaatt acttctcttt tccccatctt tgggaaagag aagtaagtaa tttcctatta 240 accttagtcc tttctatttc tctctcttga tgttatttgt tattgacaaa gataatgtat 300 gttgtttcta tcaaaaatta gtaaagcgct cttatagact caaagatcag ttatcttgca 360 ttcaataact tatactaatc agatcatcaa tactctgtac caacttatag gtactaattg 420 ttatttcttt ctattcttca taagtcttta tcttcattta tctgattagc tattattgtt 480 ataggtcaga ggtctttcta tcagattgtt gtttcgaggt tgtgcttatc agagcagtag 540 tgcatactac ca 552 // ID Gypsy-1_AM-LTR repbase; DNA; FNG; 164 BP. XX AC ACDU01000137; XX DT 07-FEB-2011 (Rel. 16.02, Created) DT 07-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Allomyces macrogynus genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_AM_; KW Gypsy-1_AM-I; Gypsy-1_AM-LTR. XX OS Allomyces macrogynus OC Eukaryota; Fungi; Blastocladiomycota; Blastocladiomycetes; OC Blastocladiales; Blastocladiaceae; Allomyces. XX RN [1] RP 1-164 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Allomyces macrogynus genome."; RL Direct Submission to RU (07-FEB-2011). XX DR Genome; ACDU01000137; Positions 14566 14403. XX SQ Sequence 164 BP; 28 A; 57 C; 41 G; 38 T; 0 other; tgttggcccg aggccaagcg tcccccgcct actcagccac accgggatgg tcgtggcagg 60 tgggcaggga tcgccttccc tatgagtagc agtccgtcag ctttaatata ttccttacgc 120 attcctcctt ggattctacg acccccccca gtccggtcgc atca 164 // ID CACTA-2_Mlaricis repbase; DNA; FNG; 8208 BP. XX AC . XX DT 16-MAY-2011 (Rel. 16.05, Created) DT 16-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW EnSpm; DNA transposon; Transposable Element; CACTA-2_Mlaricis. XX OS Melampsora laricis OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX CC incomplete_on_the_left. XX SQ Sequence 8208 BP; 2335 A; 1854 C; 1570 G; 2449 T; 0 other; aaaaaactga tatgaagtac tctcaattgt ttgtcaagaa ttggggatgc aatctgaagt 60 tcatgcaatt caaaaaaatc tcaagtataa caagatggct ttagagaaaa gatataactt 120 tggagatacc tttagagaat ataagaaaca gattgtaata gcacaaccat acaataatac 180 actattaaat ttgtaaatct gagatcgaag tgtcattggt aggtgaagca tcatcttgct 240 ccgccaaatc agattcagaa tgaacatcat aataatgttc aagggacttt ccaggtccta 300 ataactcact cagcgctcgc gcttgtcccc ttattaaggt gtcagcgatg tccaactcgt 360 gctgtaggtc aattgcatct acttgaagat catcaacttt tgtttcagga atcacaatct 420 tctgcattgc ttcatctagc ttggaagaat tttgagcagc ggcgtcggct agaataaaca 480 ggagtcagct tgaagagttg ggatccgcta cagaagtgaa aaacacatac cattcgcttg 540 ttgttgttgg agatcttgga aaagattcat actggccttt ttctccatct cttcaagtga 600 actttcaaga tgggctaatt gttgttcaaa atttcgcact ggatggatga tttgatgaga 660 aatacagttg catagtattg aacaacgtga ttgtgagctt acctttctct gctcgttgtt 720 gctcctcttg gaagatatca actccgcttt tcttgttcaa gggagaaaga ttttggttca 780 tgttgatagc tggaatatcg cggccgatca ctggacatgt tttgtgagat ttcagctcag 840 caccttgcaa caaaggtaga aaaacgagag aaggagacaa ttacctagat tccccacttc 900 aaaaacttca gtatcagaaa agttaataag atccatatcg tctgagacat gagctgggtt 960 tgatgccttg acgtcggtga cttgttcttg taaattcaat gtattccagc cttggtgtcc 1020 ttgtgagtga ttcggatcaa tgttgaaaaa tatgttaacc gggctgtttt ctcttttgat 1080 aggagaaact ttcgccacgc tcctatttgt tgaagtcgga atacgtgaag cgccactttg 1140 atcatttata tgaggcaaaa agagggattg tttgagcccc tcaagttcct ttttttcatc 1200 ctcattgctg agacttaaga ttgatgaaat tcgaggtgac gaaatggacg cgccctccat 1260 tcgattggaa acaacctttt ttgaccgttg acgaacggta ctagtgctgt ttgccgcctt 1320 ggtgcggctt gagggctgct acagaatatc agaatcaatg agagttggaa ggtaatttga 1380 tcaaaggttt gatgctcact ttcttctttt tcgaaatctt ggatgtggat gtgccacaaa 1440 cctcaggctc cggctttgct tccattccta attcaatcaa caagtgtaat cagaatgtat 1500 atacactatt ttgacgtaaa agagggaagt gacccaccaa ttcctatgcg cttatgtgga 1560 taagaaaact ccaaatcttg agatgcagag cgcttgatca ctaacggctc ttttttctcc 1620 aaaggccttg aggttatagg agaagtaact cgaagatctt caagagtagg atagatgttg 1680 atcattccgg aagcaaccgc tggggcatct gaggtcggat cggggaaaac tgtcaagaac 1740 ccgcatgagc gtcctcagta tgtataaatc ataaaatcac aaccatatga ggagatgaaa 1800 aaggagataa agatttgaac tgacattcag gttgtttctc tcgaacaatg actgctacct 1860 cgcctgttct gatatttgat cttatgggag tcattggatg atgaatgcgc aaccagtcaa 1920 gcatagtttg tttaccaaca gatgggttac tggggtcgag aaaagtttct aaagtacctc 1980 tagcactagg ttcggacata tttatgagaa agttgattta agattcaaag aaaagagatt 2040 tcctgttgga aaataaagta tttgaggaga agagatgttt cgaaaataaa tgttgaggat 2100 gtactctaaa gtaagatgga gaaagaacaa gcctgctaga gagctaactt gatgtgacgg 2160 gatgagtcca tcaagatacg ttgatactta ttttctttca aaggagaact tcttcgtgta 2220 ttcggagtta agtcgtgacc ccaatactac agttgcacca tgagttgttt gcggcatcct 2280 tcatgactca gaagccacag acaaagaacc tttcatgttt ttaatttatg atcgacttag 2340 aaatttccat gaggtagtcc cgcatttctc ctttcggcaa ttacctgagg tacatgacaa 2400 ccaaacaatg gttcagggtc acagatcagt atctagatgt tgccgttcaa tgtgttacct 2460 ggacccaatt ttcgatggtt caaacaatgg caaacaattg atggtaatct cttacacacg 2520 tatatatgct tgccatccat ccatgtataa ccacagaagg gggagacatg atttatagaa 2580 aggcctacgt tcttcctcaa cactgatttt cccttttcca tcgacttgtt ctttggttgt 2640 tggcttcttt aactgtcctt tacaattgtt cgtatcagta gctcatatcg cctgtatcat 2700 caaattcatt gttgattgat gtttgtttgc ttgtttcttt cagaatcatt tgtatcagta 2760 gctcacatcg cctttaccat caaagtcgtt agtcattatt atctcctttc accttgtctg 2820 gtgattgata caattgaaag tgtgttatgc cgtcttcctt tttctactta catacatgct 2880 gatcagttgt ttggaaatct tcatgatctt cgcttttatg gatagaactt cagactcgcc 2940 gtcatgccac caagaacacg taacccctct ttggcaaatg gatcacctct cagtccgcct 3000 ggagaaggtc caaagcctgg tcaaacatct cctctcgcat cggcccaacg tttccaacgt 3060 ccacctccag aagacccagt ggctgccaag gttcaatctc tttcggcaga ggcgcggatc 3120 ttccttacct atgactttga atctgttagc cgcaagcctt tgaaattaca ctactataac 3180 atcatatacc ttttcaatcc tgctaccaaa agctgacctg gtgacaagaa agatgtactc 3240 aagaaggcct ttctcgacga agtgcgcccc ctcctcaaac catatctctt acctcctcct 3300 tctatgccta tggagacgga ttcaacagac ctcgattttg acccactgag acgcaaaact 3360 actcgcccga tgctcgtgaa agcaattgca agcaagtcgc ccaaaaccga tgtatcgccc 3420 ggtgccacta tcgatgatgt tctaattcta tacaagcgtt acgtcgaccc tcaacttaaa 3480 cttcctgcaa atgattgctt cacaaaaaga cctcgcactg tccctatcaa cagactcaag 3540 agcgaaagta tggaggacct tcttcttgct ttgcgatacc ttgcgccgcg agtgtttgta 3600 cgttccttgg ccatgaacaa gagctgcctt atggacctct atatcaagtt tgtccacaac 3660 aagactcccg atagtcccct tatccttggg taccattaca cccttttgga tttatcacca 3720 tcggatcttg taggagagtc tatggaagaa gactctgtcg gagaagcaaa gtgtaagatt 3780 cagatttctt catacgtctt ctggtaaatt tgcatttagt attctacaga agaactgatg 3840 ggattcatta tgaacttctc aattattcac cgaatcattt gtatttgtag aaaacaaaga 3900 atgtcaccgt agaaaacgaa gtgaagaatg tcatatgatt tacagtcatc aaacaagcac 3960 cgatttcatc agaagttctt tagttttcat acactcatga cgatcccaga attaaagatg 4020 tcttgtattt ttaatcaacc tttttttgtg ctcactcaca ctgctcttcc acccttccca 4080 aaatcggact agacccttca cttcaacgta ctcctattta agatattcac ccatcatatc 4140 tatttttgaa tgtctgatga ttgattatat acattaaaac cccccaaaaa aaatttcttt 4200 gatttctttc tttataaagg tctcatatac gttaaaaccc ccaaaaaatc gtgtctccgt 4260 tgtatcatca tccgtgaaat cgatcttttc ttgattgctg ttcaccaatg cctccgcgac 4320 aacatcaaac caaacaagcc cattgagttg tgtgtacatg tgttgcttac gaatgttctc 4380 agcaagaata tcttgatgcc aacggaacct gccatcctgg ggttgaggtt ttgcctgaga 4440 cacgtgctgc tcaccaacgc gccgattttc gaaacagatt accaaaatcc ccgcgtacac 4500 caggtcactc aaacgaatca ttgatcaatt tggctctgga caacctgctt tcacccttga 4560 gacaattgcg tatcacctca tcaccttcta ctccgcgcaa tgatcaacat ctagacgatg 4620 aacttactca cggaagatca tcaatacatc attcaagtcg cccagatgac gaaatttcaa 4680 atcaagtggc cgagatagag tcaccaaacc tccctgcaac cgattctctt cccaaatcac 4740 agtccaagtc caccaaaaaa tgctctgagg cagccttggc aaaatcttca ggtcaatcgg 4800 tgtttgattg tggtcagtat cttttcagta tatcctgtta gtcgactttc tcatgattga 4860 ctgtactgat ttgttacatg ttcgctgttt actctttcta gatcgttttc atggatccga 4920 cttgaaatca gcaaatccac tcattcttca tacggctctc actgcttcca ttcttgacat 4980 atttggacaa tcctcaaatt caatcaccaa atggatcctt gatatccaga ccatcacgat 5040 caagctgtct acaacacatg gaatcttacc tacaaagtca ccgatccgtc agattcttcc 5100 tgaagaagct gccactctta agaaaattcc tagctcagct acaacaacct tccgatggtt 5160 aaaacttgat ccagtcctaa aatatctgaa ctgctgctcc tcgtgctttg cgatgtatcc 5220 cgagaacagc gcaccatctc gatgtcatca tcgtattgca aatatcccgg gtggaccttc 5280 tgattccgtc gatccaaaca aaaatacctg ttctcctcca tcagaagaca atgagccaga 5340 attctcagat tcaatatgtg gagagcctct attcaaatat gtacgtggtg tacagaagcc 5400 tgctcgccgt tatgcatttc agagcctctc ggattggatt tcacgtctcc tctctcgacc 5460 tgaaattgaa aaagcgctag aggtttcagc atcagaatcc tcaaaaccat ttgattcaac 5520 caccaatatt cacgatatct atcagtctcg actatggaag gagttttgtg gtccagatgg 5580 gagccagttt accagtaata gcagtaactt aagctttgct ctttttgtgg atgcaattaa 5640 cccctttggt aataaacaaa gtgggcatca cacttccatc acatttgtaa ttcttgtttg 5700 tctaagtctc cctcccaatc ttcgacatca acccgagaac gtctttgtag tggggatcgc 5760 tcccggacct cgcgaaccct ccctcgagca aatgaattgg atcttacgac cgcttgtcac 5820 cgaacttcag gtcttatggt caacaggatt actcatctct caaactcacg agtaccagga 5880 tggccgtctt atcagagcag cattgttggt ttttgttgca gatattccat ctcttcgtcg 5940 ctgccttgga ttccccagcg ctactgcgac ttttttttgt tctttctgtc tcctcaagaa 6000 aagcgaaatc aacaacttcg atcaagattt gtgggaacct cgtacgtggt gtcaacatca 6060 caaatgggca tgtgaggccc gtgatgctaa gacagttaag gagcgcaaga agatattcaa 6120 aacacacgga gtgcgctact cagtgcttat tgaattagat tattggaaca tcatcgatta 6180 tcacgttgta gattcaatgc acaacttact tctaggtcta tcttcttggc atgttcgacg 6240 tttttgggca atgaaggatc ttcaaaatga cgaagagaag ttaccaccaa ttagtactgt 6300 cgagcttctg aagctcgttg cggagcattc agaaccttta cccgattggc ccaatccacc 6360 cgcactcgat tctgaaagag accccaaagg actggatcaa aatcttaacg atatcgagtt 6420 ttcaaatgat acctcatcga gcaatcagga cttcaatccc tttgatgatg ctggatggaa 6480 gggagaatgg aatcctccac cccttgaaga gatcatcttt gatgctgaag tccttcgtca 6540 catcaactca atacttccaa agattcatac gccaacttgg atcaaacgtc ccattcctgt 6600 tttaggcaaa gcatcatttg ggaagctgaa agcggacaaa tggagaagct tgatcacctt 6660 gcagcttcca ttggttttaa tcccgatgtg gtcaggcaaa gatcatatca aaacctcact 6720 cctcaagaat ttcatccact tagtctcagt tgtgaactta ggactcaaac gtgtaatcaa 6780 ctctactcac atcgaacgtt accgttatca cattcgaaaa tatctggaag gttctgtttt 6840 actttttcag cactgcaagc ttgctcccaa ccatcacatt tctgtccacc ttgccgattg 6900 tcttgaaaaa ttcggaccag tacgtgcgtg gtggtcattt ccctatgagc ggttaatggg 6960 aaagatcctc aaggcagaac acaacaatca tataagtaag caaggcttga aatcgttgtt 7020 cttttttcat aattgaactt cactgaccac ttggtctcta catatttaac agctgagctt 7080 gaaataacgt ttctgaataa attttgccgt gcagcaaacc ttttggcttt gatcaaagat 7140 gataaactgc ctgagacact aaaaccatat acttcacgac ttcaagccct atataaccca 7200 cccttgcgaa ggccacgtcg accatccaac tcgaagcttt cccctctttc aaatgatgtc 7260 ctcaacctac tcgtcgatta tctgaacacc acaaagaaag aatcatgcgt atggcgtcgc 7320 ccagatgatt gggccttact gagcaagagt gactcaatag gttactctcc tgtagccgct 7380 cgtgctcaat tctataaaca ggtcgaacac accgacggat tgatttctac tttcactcaa 7440 aatccagata attgctgtgt ctatttcaag gactcaaaag gttctgattg ctttgctcga 7500 gtatattcaa tatttttgca tagtaggact tctctacaat ctggtgtttc tactgatata 7560 tggctccatg tacaatgctt tccccattta ccaaccacac atcataatcc ttttgactta 7620 actgaccagt cagaagtaca atctgcattg agattatgga gtcctaccga gcagaaatta 7680 atcaagctaa atgaggtgat tgcacagtgt gtatggatca tgttcaaacc gggggagatt 7740 aatcagaatg tgaatgtatc tacgataggc ttgattatct taaaacactg attgtatata 7800 ctcttctgaa ttcaagctat ctcatttttt ctgtttatga acttgatcaa cactttggaa 7860 tggatttggc ttgacccaat gtaagatgaa ccattcatca agatctcagc atctctagaa 7920 atgagaagta catgcatata aaggaaagtg taagagtaaa aaaatgtggt tgcaccacct 7980 gctagtgaga gccactatat caatttgtca aggtaactta tgtaagcctg ttacatccgg 8040 taccagttgc ccagcatcta tgtcctattg actgcatggc ttgtcattga acataccata 8100 gcacgctgat gtttgttttc tgatcacaag agttcagaaa atgatttgaa tttttcaaag 8160 ccaaagactt ttgtaaactc catggtatat tagtcatatg gcgtagtg 8208 // ID Mariner2N1_AO repbase; DNA; FNG; 1883 BP. XX AC . XX DT 24-JAN-2006 (Rel. 11.01, Created) DT 02-FEB-2006 (Rel. 11.01, Last updated, Version 1) XX DE It is a family of nonautonomous Mariner DNA transposons- a DE consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW Mariner2N1_AO. XX OS Aspergillus oryzae OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC mitosporic Trichocomaceae; Aspergillus. XX RN [1] RP 1-1883 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-1883 RA Kapitonov V.V. and Jurka J.; RT "Mariner2N1_AO, a family of nonautonomous Mariner DNA transposons RT in the Aspergillus oryzae genome."; RL Repbase Reports 6(1), 30-30 (2006). XX DR [2] (Consensus) XX CC Mariner2N1_AO copies are 93% identical to the consensus, which is CC 77% identical to the Mariner2_AO. The TPase-encoding CDS is CC destroyed by RIP (30 stop codons). XX SQ Sequence 1883 BP; 699 A; 311 C; 250 G; 621 T; 2 other; acgtagccgg taagcggtcc ggtcatgtaa gcactccggt caccccacgc gtttttccgc 60 gccttctaga attgaagcta ttcaactacc agctattcaa tcaaaaatac ctaaaagctc 120 taaagatcga gtggagcagg aagacagaat hctattagct atttcagctt taaataaaca 180 aaaaatagct agtattcgaa aagctgcatc tatttttaat ataccttatt ctacttttca 240 agaataatta aatagtcata cttttcaagc tgaactacgc gtcaaaagct ataaattaat 300 ttaaaataag gaggattcct tagtataata gattctattg ctagatcaat atagagcagc 360 tcctcggtat acacacatat gagaaatagc taatattcta cttactaagc gtggtaattc 420 tatctctact actatagatt aaaaatagat atataatcta gttcagcgta gagataagct 480 aaaatctcgc ttctcttatc gctataacta tcagcgcgct aaatacaagg atcctaagct 540 tctatataaa tagtttaagc gtgtatagat tactataata tagtatagda ttcaaccaaa 600 caatatttat aactttaata aaattagctt tacaatagat ttaatatcta ctactaaagt 660 agttatttaa gctaaattaa ctggtagacc ttttcttcta taactagaaa attaggaata 720 gattacttct attaagtata tcggctctag aggatctctt ctaccttatt ttatttttaa 780 aggtaaagtc tatattaagg gctagtataa gatggatcta ctactagact agcatataga 840 aataagtata aatagctaga ctatcaataa aatagatctt cgttaactac agcagctatt 900 tatatctttt actactagcc atatagttag tcaatatcat cttttaatac tcaatagcca 960 cgatagttat ttaatacctt agttcaataa tatatatagt taaaataata ttatacctct 1020 ctatatatct atacacttat tttatctact ttaaccgctt aatatagact actttagccc 1080 tctaaaacgc gcgtatagat aactaattaa aaataaaata agattagact ttaactatat 1140 taataaactt aattttttta aagccttttc ttaggcaaga gcttagatat atactactag 1200 taatatctac agcggtttct tagctactaa tcttatttct tttaattcta agcgcgtact 1260 atcttagctg aatatttagc taaaagtaat atctctagat agtagaccta gtagtagatc 1320 tactaattta gtatctaaaa tatcttataa tctaaagtag ctataaaaat agaaaactat 1380 atttaaaaag ctacttagag cttatataaa gagccctaac tcgcctacta agatagtaat 1440 aaagcagctt tttaaagact ataaataagc tttaaataaa gctactatta taaagtagaa 1500 agctaagaaa ctacgcgccg tatataaaag aatatttaaa aaaaaagcgc tctactagac 1560 agctttctat aaaatcagat actttagttt aaaaagctta aaagcttata caaaacagga 1620 attctactaa tacagctata tctactaaga tagtagatat agactctata gtagaaaacc 1680 agcgcgtacg cgccccacta aagtattcta attatagtat tctagatcat aaaattactt 1740 attatcctaa tcgttagact atttaaattt tctatagaaa tacagatttt ttggtgtttt 1800 gaatagcttc aattctagaa ggcgcggaaa aacgcgtggg gtgaccggag tgcttacata 1860 accggaccgc ttaccggcta cgt 1883 // ID TF2_I repbase; DNA; FNG; 4220 BP. XX AC L10324; XX DT 07-FEB-1997 (Rel. 2.01, Created) DT 30-AUG-2005 (Rel. 10.09, Last updated, Version 2) XX DE Internal part of TF2 retrotransposon, protease, reverse DE transcriptase, RNAse H, integrase gene, complete cds. XX KW Gypsy; LTR Retrotransposon; Transposable Element; LTR; RNase H; KW TF2; TF2 retrotransposon; integrase; protease; KW reverse transcriptase; internal portion; TF2_I. XX NM TF2. XX OS Schizosaccharomyces pombe OC Eukaryota; Fungi; Dikarya; Ascomycota; Taphrinomycotina; OC Schizosaccharomycetes; Schizosaccharomycetales; OC Schizosaccharomycetaceae; Schizosaccharomyces. XX RN [1] RP 1-4220 RA Weaver C.D., Shpakovski V.G., Caputo E., Levin L.H. RA and Boeke D.J.; RT "Sequence analysis of closely related retrotransposon families RT from fission yeast."; RL Gene 131(1), 135-139 (1993). XX RN [2] RP 1-4220 RA Boeke D.J.; RT "TF2."; RL Direct Submission to Genbank (04-FEB-1993)Jef D. Boeke, Mol. RL Biol. Genetics, Johns Hopkins University, Baltimore, MD 21205. XX DR GenBank; L10324; Positions 349 4568. XX CC LTRs of TF2 are named as LTRTF2. CC TF2 contains a single ORF encoding a protein with regions CC similar to protease, reverse transcriptase, RNase H (RH) and CC integrase from other retrotransposons and retroviruses. CC CDS 167..4168. XX SQ Sequence 4220 BP; 1629 A; 766 C; 667 G; 1158 T; 0 other; aataactgaa ctcttgtgat ctacaattaa ctccttttag gaaaaaggaa ttattgaata 60 aatatatatc ataaatatat atatttctct ccttctgggt tcaaaggaga aggaactttg 120 gaaggactgt tatccctttt gaaatctccc aaagggaaac atatacaatg tcctacgcaa 180 attatcgtta tatgaaagca agagcaaaac gatggagacc agagaatttg gatggaattc 240 aaacatcaga cgaacattta ataaaccttt ttgcaaaaat attatcgaag catgtaccag 300 agatagggaa attcgatcca aataaggatg ttgaaagtta catttcaaaa cttgatcaac 360 actttactga atacccttca ttattcccaa atgagcatac taaaagacag tatacattga 420 atcacctaga agaattagag caacaattcg ctgaacgcat gttttctgag aatggaagtc 480 ttacatggca agaattactc agacaaacag ggaaagtaca aggatccaac aaaggtgatc 540 gtttaactaa aacatttgaa ggttttagaa atcaattgga caaagttcaa tttataagga 600 aactcatgtc aaaagcaaat gttgatgatt tccatactcg cttgtttata ttatggatgc 660 tgccatattc cttaaggaaa ttaaaggaaa gaaattactg gaaatcagaa atcagtgaaa 720 tttatgactt tttagaggac aaaagaacag cctcgtatgg taaaactcac aagcgttttc 780 aaccgcaaaa taaaaatcta ggaaaagagt ccctttcaaa gaaaaataac accactaata 840 gcagaaacct gaggaagaca aatgtttcga gaatagaata ctcatctaac aaattcctaa 900 atcatactag gaaacgttac gaaatggtat tacaagctga acttccagac ttcaagtgct 960 caataccctg tctaatcgat acgggcgctc aagcaaatat tataacagaa gaaactgttc 1020 gagcacataa actgcctacc agaccctggt caaaaagtgt gatatatggt ggagtttatc 1080 caaataagat taatcgcaaa acaataaaac ttaacataag tctaaatgga atatcaatca 1140 aaacagaatt cttggttgta aagaaatttt cgcatccagc tgctatctcc ttcacaacat 1200 tatatgacaa taacattgaa atatctagca gtaaacacac gctctctcaa atgaacaaag 1260 tttcaaatat tgtcaaggaa cctgagttac cagatatcta taaagaattc aaagacatta 1320 ctgcagaaac caatacggaa aagctaccaa agccaataaa agggttagaa tttgaagttg 1380 aactaactca agaaaactac agattaccta tcagaaatta cccgctacca ccgggaaaaa 1440 tgcaagctat gaatgatgaa attaatcaag gattaaaaag tggaattata cgagaatcta 1500 aagccattaa cgcctgtcca gtaatgttcg ttccgaaaaa ggaaggcacc ttgagaatgg 1560 tggttgacta caaaccttta aataagtatg tcaaacccaa tatatatccg ttaccactta 1620 ttgaacaatt acttgctaaa atacaaggtt ctacaatttt tactaaactt gacctcaaaa 1680 gtgcctatca cttgatacga gtaagaaaag gagatgaaca taaacttgct tttcgctgtc 1740 ctcgtggagt ttttgaatat ctagtaatgc cttatggcat atctacagct ccagcacatt 1800 ttcaatactt tatcaataca atacttggtg aagccaaaga atcacatgta gtatgttata 1860 tggatgatat tttaattcat tcaaaatcgg aatctgaaca tgtaaaacat gttaaagacg 1920 ttctacagaa attgaaaaat gcgaacttaa ttatcaatca agcaaaatgt gaatttcacc 1980 aatcacaagt aaaatttata gggtatcaca tttcggaaaa aggatttacg ccttgtcaag 2040 aaaatataga caaagtctta caatggaagc aacctaagaa tcgtaaagaa ttacgacaat 2100 ttctaggttc tgtcaattat cttaggaaat tcattccaaa gacatcacaa ttaacacatc 2160 cactcaataa tcttttgaaa aaggatgtac gctggaaatg gacaccaaca caaacccaag 2220 cgatagaaaa cattaaacaa tgtttagttt ctcctccggt gctacgacac tttgatttca 2280 gtaaaaagat tctactggaa actgatgctt cagatgtcgc tgtaggagcc gtattgtctc 2340 aaaaacatga tgatgataaa tactatcctg ttggatacta ttcagcaaag atgtctaaag 2400 cacaattaaa ttatagcgta tcggacaaag aaatgcttgc aatcattaag tctctcaaac 2460 attggagaca ctatttagaa tccactatcg aacctttcaa aattttaaca gaccatcgaa 2520 acttaattgg tcgcattact aacgaatccg agcctgaaaa caaacgttta gctcgttggc 2580 aattattttt acaagacttc aactttgaaa ttaactacag acctggatca gcaaatcaca 2640 tagctgatgc cttatccaga attgttgacg aaacagaacc aattccaaaa gattcagaag 2700 acaatagtat caactttgtt aatcaaatct cgataaccga tgattttaaa aaccaagtgg 2760 ttacagaata tacgaatgat acaaaattgt tgaatttact aaacaatgaa gacaaacgag 2820 tggaagagaa tatccaactc aaagatggct tactaattaa cagtaaagac caaatcttat 2880 tacctaatga tactcagctg actaggacaa ttattaaaaa gtatcatgaa gaaggtaaat 2940 tgattcatcc aggcattgaa cttcttacaa acattatatt acgtagattt acgtggaaag 3000 gaataagaaa acaaatacaa gaatatgtac agaactgcca tacatgtcaa ataaacaaat 3060 ctaggaatca taaaccttat ggacctttac aaccaattcc cccatcagaa agaccttggg 3120 aatctttatc aatggatttt attacagctt taccagaatc atctggttat aatgcacttt 3180 tcgtggtagt tgaccgattt tcaaaaatgg caatcttagt accttgtacg aaatccatta 3240 cagcagagca aacagctcga atgtttgatc aacgagttat tgcttatttc ggcaatccaa 3300 aagaaatcat tgcagataat gatcatattt ttacttccca aacgtggaaa gatttcgcac 3360 ataaatataa tttcgttatg aaattttcgt taccatacag accacaaact gatggacaaa 3420 ctgagcgtac aaaccaaact gtggagaaat tactaagatg tgtatgtagc acacatccaa 3480 atacatgggt agatcatata tccctagtgc aacaatctta caacaatgcg atacattcag 3540 caactcaaat gacacctttt gagatagtac atcgctattc accagcttta tcacctttag 3600 agttacctag ctttagtgac aaaactgacg aaaactctca ggaaacgatc caagtatttc 3660 aaacagttaa agaacacttg aatacaaaca acataaagat gaaaaagtat ttcgatatga 3720 aaatacaaga aattgaagaa tttcaacctg gagacctagt tatggtcaaa agaacgaaaa 3780 caggttttct tcataaatcc aataaattag cacctagttt tgcaggaccg ttctatgtgt 3840 tacagaagtc gggtccaaac aactatgaat tggatcttcc agattcaatc aagcacatgt 3900 tttcatctac ttttcatgtt tctcacctag aaaagtatcg acataattca gaactcaatt 3960 acgctaccat tgatgagtct gatattggaa caattcttca tatcctagaa cataaaaaca 4020 gagaacaagt actctactta aatgtcaagt acatttcgaa tctaaatccg agtactatta 4080 tgtcaggatg gactacatta gctacagcgc tacaagcgga caaagcaatt gtcaatgatt 4140 atattaaaaa caataatcta aatatctgag aacatatgac ttatcctcag atttacatag 4200 aaaatcttgg ggagggcaat 4220 // ID Gypsy-8_CCO-I repbase; DNA; FNG; 5876 BP. XX AC AACS02000003; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the mushroom Coprinopsis cinerea genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-8_CCO_; KW Gypsy-8_CCO-LTR; Gypsy-8_CCO-I. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-5876 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the mushroom Coprinopsis cinerea RT genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AACS02000003; Positions 846578 840703. XX CC Positions [3010-3516] - Reverse transcriptase CC Positions [4675-5169] - Integrase core CC 'GCCGC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 127..1389 FT /product="Gypsy-8_CCO-I_1p" FT /translation="MSDQANTQQAAPDPHATLDWLVGQMEQSNARAEQLQQ FT NYNHLTAQLATITALLQNQAVPPATPPNEPPIPLPVPPAPVATATSSAPPK FT VSPPADFDGDRTKGRSFIAQCNIYLSVCGSQFRDDQARILWALSFMKGGRA FT TRFATRVMKLIGLGRTTTDHLGFDLNDWKSFVALLVKDFCTRFDTETARVK FT LENAAHYHQGKRSVEDYLDTFKDLVVDAGYTEGAVIVMKFRAGLDPAIESQ FT VANLARELRPKDDSPEDWYDRAVEAEQSRLATKVFRSSTVTVASRPSPAPP FT TTAPGCSFFPTPFSRAPTALPAAPPAPPRLPPPSIPSPGNPVPMDIDATRR FT TRPLPAGHCYRCGQPGHVKAQCPRAYDVRYMTTEEIDEAMQQRALAQDVSE FT SQARQEAVQAAVVEGSIEEEDFVSGDE" FT CDS 3046..5862 FT /product="Gypsy-8_CCO-I_2p" FT /translation="MASPVFFIKKKDGSLRLVQDYRVLNSMTIKNRYPLPL FT ISELVNRLRGARWFTKLDVRWGYNNVRIKEGDEWKAAFRTNRGLYEPLVMF FT FGLTNSPATFQTMMNDIFHDLIMEGVVCIYLDDILIFTKTLDEHRRITRLV FT LDRLRRHKLFLKPEKCEFEKQKIEYLGLIISEDHVEMDPVKIAGVAQWPVP FT KSRKEVQSFLGFVNFYRRFIANFSHHARPLFDLTKKDCAWSWGPAEQGAFE FT KLKAAVTSAPILVSPQDDLPFRVEADSSDFATGAVLSQQSKEDGKWHPVAF FT YSKSLSEVERNYNVHDKEMLAIMRALEEWRHFLEGARHPFEIWTDHKNLQY FT FTTAKKLNRRQARWSLSLARFDFRLHHRPGQSMGKPDALSRRADHGDGSGD FT NEGITLLKPEFFAIRALEGVQLVGEERDILRAIRRFNAQGLHEESVAKAAR FT ELQSSSAKSVRSSEWSVQDHLLCYCGKIYVPNDPELRRRIVAQHHDTKVAG FT HAGRWKTLELVSRNYWWPQMSRYIGQYCRTCDMCMRTKIQRRRPVGELHPL FT PIPESRWDTVSVDFIVELPDSHGHDAIMNVVDSVSKRAHFLPTNTTVTALG FT AARLYLQHVWKHHGLPRQVVSDRGTQFVAEFTKELYRLLGIKLASSTAYHP FT QTDGQTERVNMELEQYLRVFVNERQDDWDELLPLAEFQYNNHVHSSTQQTP FT FMLDTGRHPRMGFEPHQEPSRLETVNEFKDRMEKALEEAKAALAKAKDDMA FT RYYNRRRDPTPVFKPGDKVYLDASDITTTRPSKKLAHRQLGPFKVEAAVGS FT HAYRLKLPPSMQRLHPVFPVVKLTPAVPDPIPGRRARPPPPPVIVDDQEEY FT EVEEILDSRIHRNKLQFLVKWKGYGYEENSWEPEENVNSPVLVARFYRQNP FT GAPRRIRTLLFEGMRWRRRTDELWRPVPVHRGAAP" XX SQ Sequence 5876 BP; 1217 A; 1885 C; 1540 G; 1234 T; 0 other; gtacaaaagc ctctacaacc aagagaccgg agcagcactg tctgtatcgc tgacgacctt 60 caccactcat tccacgattc accttcgctg ctcccctctc tccctcttgc gactctctca 120 ccaatcatga gcgaccaagc caacacccag caagcagcac ccgaccccca tgcaacgctg 180 gactggctgg tgggccagat ggagcagagc aatgctcgag ctgaacaact tcagcagaac 240 tacaaccacc tcactgctca gcttgctact atcacagcct tgctgcagaa ccaggcggta 300 ccccccgcca cacctccaaa cgaaccccct attccccttc cggttcctcc cgctcccgtc 360 gctacggcaa cctcatccgc tccccccaag gtctcgcctc ctgccgactt cgacggtgac 420 cgcaccaaag gacgcagctt cattgcccag tgcaacatct acctctccgt ctgtgggtcg 480 cagttccgag acgaccaggc tcggatcctg tgggcattgt ccttcatgaa gggtggccgg 540 gccacccgct tcgccactcg tgtcatgaag ctcattggtc tgggccgcac caccactgac 600 cacctcgggt tcgatctgaa cgactggaag agttttgtcg ccttgctcgt caaggacttc 660 tgcacccgct tcgacaccga gaccgccagg gtcaagctgg agaacgccgc ccattatcac 720 cagggcaagc gttctgtcga ggactacctg gacaccttca aggaccttgt cgttgatgct 780 ggctacaccg agggtgccgt cattgtcatg aagtttcgtg ccggcctaga ccctgccatc 840 gagtcccagg tcgccaacct cgctcgcgag cttcgcccca aggacgacag ccccgaagac 900 tggtatgacc gtgccgttga agcagaacag agccgccttg cgaccaaggt cttccgttcc 960 agcacggtca cagttgcctc cagacctagc ccagccccac ctaccactgc tcctggctgt 1020 tccttcttcc ccactccttt ctctcgtgcg ccgacagctc tccctgccgc tcctcctgcc 1080 ccgcctcgtc tccctcctcc atccatcccc tctcctggca atccagtccc aatggacatt 1140 gatgccaccc gtcgcacccg ccctctgcct gctggccact gttaccgttg tggccagcca 1200 ggtcacgtca aggcgcaatg cccccgagcc tacgacgtgc ggtacatgac aaccgaggag 1260 attgacgagg ccatgcagca gcgcgccctg gctcaggacg tttccgagtc acaggcccgc 1320 caagaggctg tacaggctgc cgtcgttgaa gggtccattg aagaggagga ttttgtgtcg 1380 ggcgacgagt gatcagtgtg ccctcgtcgc tcgtaggtag tcgttacgcc tcgttgcctg 1440 tagaatcgtg ttccgatgaa tccacttccg tttctactcg caacccgcta ccagtcacac 1500 agcctgccct tcagacccct gcccctcgtc ccatgcgcat ccgtcttgct cgttgggagc 1560 gtcgtcttcc ccgtcgctac atcattgctg ctactccaag caccaactcc cttcgcctcc 1620 ccatctctat cgagacgctt gacacccatg agcgtcgctt gcttaaggcc ctgcttgact 1680 gtggtgctac tggcctcttc atcgaccggg actttgttcg tgccaatcgt ttgactgaga 1740 ggcacctaca agtcccgatt ccggtcttca acgtcgatgg cacccctaac gaggctggta 1800 gtatcacgtc cgtggtggag ctgatcctac gcttcaagga tcatgctgaa cgggcctact 1860 ttgctgtcac cgggttgggc aatcagcaga tcatcctcgg gtactcctgg cttagggagc 1920 acaacccgga ggttgattgg cagaccggtg aggtgaagat gagcaggtgc ccagccaagt 1980 gtgctacgtg tcgcgacgag ctcaaggccg aaaagcgcga agcccgtgcg gcagtacgag 2040 ccctggaagc atgctcccaa ggccccttcc cctcagtcac catggaggag gtggacccgg 2100 atgacgagga gcctcccgag ctgtcggagt tgtgccacat cgccctcgag gatgacgacg 2160 atgaccctct gatggaagaa ggtgatcggc tcctcatgac cacccttccc ccagagtccg 2220 agttcattgc tgccaccacg accacgtctc agcgccttgc ggaagcccac gctgccaact 2280 ccgaagccca gtcattccgc gatgccgtcc ccaaccacct ccacgacttc gaggatgtct 2340 tctcgaagta aggctggacg cgggtgcacc tgtacgggta cccgcaccct tgacacaggt 2400 acccatgccc ccaaaatcca tacctgcacc ctcacaggta tttccgaaca ggtacctgtt 2460 acctgcgggt cctacccatg ggtacctgtt gggcacaggt cctacaggta cccaaaggta 2520 aacctgtagg tacctgctaa aaatggatca gttagaagtg aattccctcc attatccatg 2580 gccatggacc ttaattaata agtgaattca ttgtctaata tgatatacag agattaggtt 2640 catttctgag cggttttgaa gcattattac agtggacatg gtcaccaaca ggctggcagg 2700 tacccaccca caggtacctg tgaggctaaa agctatacct gcacccttgc gggtattttg 2760 gccacgggtt acctggaccc acgggtcaca tacccatggg gtttcttggc gggtcgggtc 2820 acgggtacct gtgtacgacc tgcacccgcg tccagcctta tctcgaaggc ctcctttgat 2880 gtccttccgg agcgcaagcc ttgggaccat gctatcgagc ttgaaccagg ctccaagccc 2940 tcgagctgca aggtttaccc actggcgcta gacgagcaga aggagctgga tgctttccta 3000 caggagaact tggctactgg tcgcattcgc ccctccaagt cgcccatggc ttcgccggtc 3060 ttcttcatca agaagaagga tggttccctc cgactggtcc aggactaccg agtgctgaat 3120 agcatgacca tcaagaaccg ctaccccctc cccctcatct ctgaactcgt taatcgcctc 3180 cgtggtgccc gctggttcac caagctggat gttcgttggg gctacaacaa tgtccgcatc 3240 aaggagggtg atgagtggaa ggcggcgttc cgtaccaatc gcggtctcta tgaaccactg 3300 gtcatgttct ttgggctcac aaatagtccc gccacattcc agacgatgat gaacgacatc 3360 ttccatgacc ttattatgga aggtgttgtc tgcatctacc tagacgacat tctcatcttc 3420 accaagacac tcgatgaaca ccgacggatc actcgcctgg tcttggaccg gctgcgtcgg 3480 cacaagctct tcctcaagcc tgagaagtgc gagttcgaga agcagaagat tgagtaccta 3540 ggtctcatca tctcggagga ccatgtggag atggacccgg tcaagattgc aggtgtcgcc 3600 cagtggcctg ttcccaagtc tcgcaaggaa gtccagagct tccttgggtt tgtcaacttc 3660 taccgccgct tcatcgccaa cttctctcac cacgctcgcc ccttgttcga cctcaccaag 3720 aaggactgtg catggtcctg gggtcccgca gaacagggtg cctttgagaa gttgaaggcg 3780 gcggtcacct ccgcgcccat cctggtgtcc ccccaggacg acctcccctt ccgggtcgag 3840 gccgactcct cggactttgc cacgggagct gttctctccc aacagtccaa ggaggatggg 3900 aagtggcatc cggttgcgtt ctactccaag agcctcagcg aggtagagcg caactataat 3960 gtccatgaca aagagatgtt ggcaatcatg cgggcgctcg aggagtggcg acacttcctt 4020 gagggagcac gacacccctt cgagatctgg accgaccata agaacctgca gtacttcacc 4080 accgccaaga agctcaaccg ccggcaggcc cgctggtctc tctcccttgc ccgttttgac 4140 ttccgcttgc atcatcgccc aggtcagtcg atgggcaagc ccgatgcctt gtcacgacga 4200 gcagaccatg gtgatggctc tggggacaac gaggggatta ctctcctgaa accagagttc 4260 tttgccatcc gggccttgga aggagtccag ttggtgggtg aagagcgcga catcctgcgt 4320 gccatccgtc ggttcaacgc tcaaggtctg cacgaagagt cagtggctaa ggctgctcgc 4380 gagcttcaga gttcgtctgc caagtcggtc cggtcttcgg aatggtctgt ccaggaccac 4440 ctcctgtgtt actgtggcaa gatctacgtc cccaacgacc ctgagctccg aagacgtatt 4500 gtggcgcagc atcatgacac caaggtggca ggccacgctg gtcgttggaa gacgttggaa 4560 cttgtgtctc ggaactactg gtggccccag atgtcccgct acattggtca gtactgccgg 4620 acgtgcgaca tgtgcatgcg aaccaagatc cagagacgcc gcccagtagg cgaactccac 4680 cctctcccca tcccggagtc tcgttgggac accgtcagtg tggactttat cgtcgagctt 4740 cccgactccc atggtcatga cgccatcatg aatgtggtgg actcggtcag caagcgtgcc 4800 cacttcctcc ccaccaacac cacggtcacc gcgcttggtg ctgctcggct ctaccttcag 4860 catgtctgga agcaccatgg gctgccacgt caggtggtat ctgatcgggg aacacagttc 4920 gtggcggagt tcaccaagga gctgtaccgg ttactaggga tcaagcttgc ctcctcaacc 4980 gcttaccacc cccagaccga tggacagacg gagcgggtga acatggagtt ggagcagtat 5040 ctccgggtgt ttgtgaacga acgacaggac gattgggacg aactcctccc cttggcggag 5100 ttccagtaca acaaccatgt ccactcatcc acccagcaga caccattcat gctggatact 5160 ggtcgtcacc cgaggatggg cttcgaaccg caccaggaac cctcacgact ggagacggtt 5220 aacgagttca aggaccggat ggagaaggca ctggaggagg ccaaggcagc cctcgccaag 5280 gcaaaggacg acatggctcg gtactacaac cgccgaaggg accccactcc ggtcttcaag 5340 cctggggaca aggtgtactt ggatgcgagt gatattacca ccacacgccc ctcaaagaaa 5400 ctggcgcacc gccaacttgg gcctttcaag gtggaggccg cagttggatc acacgcctat 5460 cgcctgaagc ttcccccctc gatgcaacgt ctccaccctg ttttccctgt cgtcaaactc 5520 actcctgcag tcccagaccc catccctggt cgaagagctc gcccaccccc gcctccagtc 5580 atcgtggatg accaggagga gtacgaggtg gaagagatct tggacagccg tattcaccgg 5640 aacaagctcc agttcctggt gaagtggaaa gggtatggct acgaggagaa cagctgggaa 5700 ccagaggaga acgtcaactc ccccgttctg gtagctcggt tctaccgcca gaatccgggt 5760 gcccctcggc gtatccgtac ccttctgttc gagggtatgc gttggagaag gaggacagac 5820 gagttgtggc ggccagtccc ggtgcatcgg ggcgctgcac cttgaagggg gggtaa 5876 // ID Gypsy-3_PCR-LTR repbase; DNA; FNG; 1005 BP. XX AC AADS01000631; XX DT 30-JAN-2011 (Rel. 16.02, Created) DT 30-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Phanerochaete chrysosporium genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_PCR_; KW Gypsy-3_PCR-I; Gypsy-3_PCR-LTR. XX OS Phanerochaete chrysosporium OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Corticiales; Corticiaceae; Phanerochaete. XX RN [1] RP 1-1005 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Phanerochaete chrysosporium RT genome."; RL Direct Submission to RU (30-JAN-2011). XX DR Genome; AADS01000631; Positions 3858 2854. XX SQ Sequence 1005 BP; 328 A; 265 C; 204 G; 208 T; 0 other; tgtcctgcgt caaacttgtt tgacacaaga acataaggaa aatgctgaaa gggtacagcc 60 tacccgcaaa tgcctactta gtatattcca gccaagacag aaagggaaga ctttctatgg 120 gattatacag gcaccaaggc actggaatca ctgcacaaac accccccctg atcaccccaa 180 atgatcgagc acgatcaact gcactagtct cgatcacaag gccaccgatc acgatcaagc 240 gatcgaatac tcaagcctca atcgcacagc cccctgaaac gcgatcagag aatcaagggt 300 gcacctcaga agctgaaaag cgactcatcg ccaagcagta tgagtcagaa aagaaggaaa 360 ttagggtatg ggatgactaa aggctcaatc agcacgacat atcattagct gattgattat 420 ggaactactt ttctctgcga caaattggaa gagcatggga gctcagccaa tcaggttgca 480 tgatcatgta ccccataagg gaatttcaaa tatacctaca gcagtatata tacagcagta 540 ggtagcccac ttaataccaa tccaaatttc cccaaagcag agctctcaaa gtgtcagagc 600 tgggattggc ttcctgagca aacccgttgc tcagtattca gaagtacgta tcagaggcgc 660 agaaagccta ctgacaagag ttatttgagt acccagaagg accatacctg gagaagagtt 720 caaggacttc aactggaagt caaagccata ataactgcag agcaataagc atcgaggcta 780 ttacttgagt tatttccccc catcccaaca tcctgcatat tctcaagact ctgaggataa 840 gccttgtcac tcccacacac taccttacaa cccccaacga atgactggag accctctgat 900 ccaagtcgcg atccgagaat ccaggcggtt tagcaagaac gttgaccatg gccttaccac 960 ccaacctcaa cacttcctaa tcaccgcaaa agaagcgtcg ttaca 1005 // ID Gypsy-50_MLP-LTR repbase; DNA; FNG; 190 BP. XX AC AECX01001249; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-50_MLP_; KW Gypsy-50_MLP-I; Gypsy-50_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-190 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001249; Positions 71571 71760. XX SQ Sequence 190 BP; 47 A; 63 C; 28 G; 52 T; 0 other; tgttatgacc ctatcacggg tcactgaggt gtcacaactc gtgacatatg ccaccctcaa 60 tctcttacta gagcagcgct ccacatgtat atgcttttct ctcacgctac aataatcata 120 catcgtagtc atcacctttc cttctctctg tccccatcca cgtcaaggcc ccctgaaaca 180 ggccataaca 190 // ID Gypsy1-I_AO repbase; DNA; FNG; 6115 BP. XX AC . XX DT 24-JAN-2006 (Rel. 11.01, Created) DT 24-JAN-2006 (Rel. 11.01, Last updated, Version 1) XX DE An internal portion of the Gypsy1_AO LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Nonautonomous; KW Gypsy1_AO; Gypsy1-LTR_AO; Gypsy1-I_AO. XX OS Aspergillus oryzae OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC mitosporic Trichocomaceae; Aspergillus. XX RN [1] RP 1-6115 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-6115 RA Kapitonov V.V. and Jurka J.; RT "Gypsy1_AO, a family of Gypsy LTR retrotransposons in the RT Aspergillus oryzae genome."; RL Repbase Reports 6(1), 1-1 (2006). XX DR [2] (Consensus) XX CC This is an internal portion of the Gypsy1_AO LTR retrotransposon. CC Its long terminal repeat is Gypsy1-LTR_AO. ORF coding for the CC Gypsy polyprotein is severely corrupted by many stop codons, CC which are likely have been introduced by RIP. XX SQ Sequence 6115 BP; 2279 A; 947 C; 940 G; 1949 T; 0 other; attctaattc ttcttttcaa ctacctaagg atatcgttgg tattattatc ccgatcatct 60 aacggagata tttctttatt tttaagaatc cctaacgaaa acatgccttc tcttacaaaa 120 atagttccag aaactattcc cgaattaaat acccaagaaa gcgaaaccct ccaaagcctg 180 agagatacta ttcaacgact ggagaatcga ctataaagta aagaattttt ataaaatcgc 240 cgacgatcca gttccgcttt gtcaatggat ggctgtactg ccaaactctt taagaatatt 300 cccgagttca ccctcgactt tagactgcaa caaagacaga aatggattct taatttagaa 360 tactccttca aggaagcgaa aagacgccta aaaagagata atcaggaaat cattgcagca 420 ctttcttata tatctccaat ttgccgacag caatggtacc gacacctcgt agaaaagaaa 480 agagattaac aaagaaatgc cgaggaatcg tggcaatact ttatagaata gaccctctca 540 ttgatcagga attcggctat cttatagtcc gatattataa gccagcttca aagagctcgt 600 caataaaaag accaagatcc gagagagttt tatatctacc tcgactcctt aaagcaatat 660 ttcccaaggc aattagaaaa agaaagagcg cttactttct ttataaaact cctcccaaaa 720 ttattaaaat atatacaaga acactatata aagctcctag atcaacgaaa caaaatggtg 780 acccttacta cacatcatta gaatcttcta tatagagaat aaaaacggaa acgtgacgct 840 tcctcaaagg aaagcaaaaa taaagacgat aaattaaata aagataagga actacaaaac 900 acaaagaaaa attcgaaaga aaactctaag aaatctaaca aaaaccctac cgataataaa 960 gagaatcttt tataatatta tatatataat agtgattctt attttactaa ttattgtcta 1020 aaaaagaaaa aagctatagt ctaatcggct aaagtagaag aaatggattt agagttgaaa 1080 aataatttag agtcggaata gtctctacta aactagtttc taatctctat atagaaactg 1140 ttacacgttt ctatatttct atttaatata aaaataatat taaaatgttc gctaatctag 1200 atagctatat taaaattaat attattagct attaatttat taaaaatttt tattttaaaa 1260 agatttaatt tatagttcta ataattaaaa tagttaacta tattactatc tcgacttatg 1320 gtatatagaa gatatctctt attattatta atttttaaaa gactatctag agctttacta 1380 gactatatat agtaattgat agagactctc gccttaagaa aagtctaatc tttttattaa 1440 taataactat aagtaattta agaatctact tattactata gaattataaa tagtgattta 1500 aatttttatt aattaaatta atattagttt ataaatttgc gaaattatgt tgaaatcttg 1560 tctatatctt tgccgtggta aaactattta aagaagtttg gttactagat aacttaagaa 1620 gttctgtttt ataatatatt agtaaaatcc tactagaatt aaaagaatat aaaaatattt 1680 tcttaaagta aaaatctaaa attatgttgt tccggaaaat aatagattat ataattaaat 1740 ttattaatga taaaatatcg ctgtataatt ctatttattc tcttttctag cgagagcttt 1800 aagtattaag ataatatatt aacgaaaatc tagagtatag ttggattaga ctcttaaaga 1860 gttctgtaga agcccctatt ttatttatct ttaaaaagaa tagtggtcta aaactatata 1920 tcaattatcg gggtcttaat aaaataacga taaagaatcg ctatttatta tttttaatat 1980 tagaaatttt agatcgtctg gcaggcgtaa aattctttat taaaattaat attcaagacg 2040 tatattatcg aatccatcta caagaagaag atgaatagaa gatagcgttt tatacaagat 2100 ataaccactt taagtattta atagtatttt ttggtctcac gaatatacta gctacttttt 2160 aaagctatct atatatagta ttatacaata ttctaaatat ctgttatata gtatatctag 2220 ataatatctt aatcttttta ctagatcgag agagctatat aaggtatatt aagtagatat 2280 tagatcgact gtaaaaagct gatttatata ggaaattgtc gaaatgtact ttctaccaaa 2340 aataggttga atttctagga tatattatat cgcaagatgg tatctctatg gattcttaac 2400 aggtgaatga cattatgttg tgggaagaat ctaaaagcta ctacaacgta taaagttttc 2460 tgggatttta taacttttat cgacgattta tctaaaatta ctctcgaatt accttgcccc 2520 ttattttctt aataaaagga tcgaaaaacg gataaaaatc tggactagtt aaatttactc 2580 ttacggaaaa attagtattc taatatttta ttactgcctt ttagtcagcg tctcttttat 2640 actatttcaa cccttaataa tctatctaaa tcgaaatcga tactttaaat aaagggataa 2700 taggaattat gtcccagcta gataaaaata gagtctatta tcttatagcg atctagttat 2760 aaaaatttag tggagtagaa ttaaattata gcacccccga tcaagaatta tacgctatcg 2820 tctatttgtt taagtattgg cgataatatc ttaaaggttt ggtatatact attaaagttc 2880 tcactaacta tttaaattta tagatcttta taaagtaaac taaactaaac gggcgttaag 2940 cgcgatgact tatatttcta attcttttta attttataat caaacattgg actgggaaat 3000 caaacccggc tgatgggctc tcttggaaat tagagaaaat cttaaaacgt actctagata 3060 tagagttaat gttaccgttt atatagcgat ttgctagtgt agaattctta taagttaagg 3120 atttgttgta aaagaaaacc tagtttaagc ctggcgagac gtttttatag aagagttcta 3180 agtttaatac ccgcatagca ataagcttta agaattataa gattaatata gaaatagctg 3240 actgggctat ctagaaagag ttaaattagg atcgattcgt ttctaggtct taagtacaat 3300 gtacttatat ctctaagaaa gtgtataccg ctgaggtcta agaagatcta agagatctta 3360 taaagcggat acagtctaaa aacctagaga tccaatagta aaaggccgct gtagagtaaa 3420 ggcttataaa aaataagggt tggagcgttg tctttgatag actagttaga tttaaggatt 3480 gactatatat ctttttgagg gagaatttat ggtagatttt aattaaccta tatcataata 3540 atcctcttgc aggctatttt aagaaaaatt atacagaaat tctgcttaaa cgcaagtttt 3600 attggacaaa tcttcaacga gatattgcag actacgtggc gggatgccca gtatattaag 3660 aagtagtagc tctaaagcat cgcttctata gaattttaga attgctatca attctatcta 3720 gatcttttgt agaactatca atagacttta taatagagtt actagaaacg atatttaaaa 3780 ataaaatcgt taatttaata taaatagtag ttaattaatt ttctaaatag tcactattct 3840 tccctgtttt aataacaatc aatgcggtaa aacttgttaa attattttat aattatatag 3900 aattacaatt tagactatca aatagtattg tatctaatag agaactaatc tttactagta 3960 aattctagtt aaatctttac tattttagct atataaaact taggttatca acagcgttct 4020 atctacaaat tgatagttaa acagaatata taaactaaat attagaatat tattttaaat 4080 attttaccaa taaagagtaa ataaattggc taaatctact cttaacagcg gaattcgtat 4140 gcaataatac ttaaaatata actactgggc tatcgctatt tcaagtccta ctcggctatt 4200 cgccaaattt ccaattacgt accgaggacg gtactttttc ggaggagatc ccggcagtac 4260 aacaccgcat tgaaaaatta acgaaaattc gtgaaaattt agcggaacac tggcaaaacg 4320 cgatagaatc ttagaagaaa cattttgata aacattatca agcaaaatta ttcaagcgag 4380 gtgatcttgt acggttatta atccggaata ttaaattaaa agtcccggct cggaaactag 4440 caccgaaata tattagacta ttttaggtac tagacgctat aggaaagcaa gcatatcggc 4500 ttagcttatt taaaaaatat aattaaattt ataatgtatt ctatatctcg ttactggaac 4560 tatggacgct acgttatcaa aacaatgccg atccattgcc aatacctaac cttgataatg 4620 ataataaata gaagatagaa gaagttaaag aaaagtaaat cttcaaggaa gaaacccaat 4680 atttagtaaa atggatagga tagccgttag aatataatta atagatttct aaagaagata 4740 tagcgaacgt aaaaaaaaaa tctataattt taaataaaaa agttaataat atacttaaag 4800 gttaacgcga cgttgacgtg gctatagtaa atacttaata agtaaataat aattatagaa 4860 agacgcgtcg attctacacg cgctagaaaa ctttattatt aattaaaata taaaaagtcc 4920 accgatctat tctagaatta ctatcgaagt agaaagataa gggcgccgaa atattaatta 4980 attattatat taaatcgctg taaaaataat agggaaaata ttgaatatag cgtcctaata 5040 ttcccgctaa tccccctcta catccttcta attaaaatat tcacgaacta ctcgctttac 5100 atgacgggta aagttctaaa tagctatcta ccaagatata taaccagcgt cgaattcttt 5160 gagccggagt tgacaatctt ccatgtactg tctacgagcc ctcacatcat ctcgaggagg 5220 cagtggaccc aactgtcgtt tgagggcatc ggtacgacgg cttatactct cacgatagtt 5280 ggcaataatt cgctatagaa ttgttaaaga agaagtcctg aattggctcg ttaatacaca 5340 ccccaagctt ttcagccttt ttagtagtta aacgaaattc cttttaatga gccaactcca 5400 ccccatcgaa agcgtggtat aacgcgatac aagcttgagc gaccaattcc cgtatctcta 5460 agccaaacat aacttgttct ttattcttat tccagaaaat ggtctccgcc catttcagta 5520 tacgatgcaa gtcatgtgca tcgcctagta tatcttaaga agtcttcacg agttagtaac 5580 gagtagtacg ccaggcaaag aagtatatct agttacaatt agtatttcgg ccactgtatt 5640 gtctgcacaa tactgacgct gcactattga atttacaact aattgtaaac aattcgtctt 5700 ctccgcgctc cctgtccttg acagcagttc ttaaacaatg taggcactac aaaatagagg 5760 gaacgactgg ggcaccgacg accctctcac ggcgaggtat aatgatatca ataaaaaaac 5820 gtaaagaagt aactgggttc gaaatacagg aaaagaaaag acttacgaca cgcagaagag 5880 atgttgtaga ataaaaaaag ataagatgca gaaatagtgg agagaagctc atctaatatc 5940 tcgtcaatat taacatcgtt gtcgtagaca actattattt actcaacaac gattcacgtg 6000 gggtcgatgt tcctatacag aacacccaac gcgctcctaa ggcgtcgacc acttaacgcg 6060 aacaacgcgt cgatcgctgt ctggggatag tcagcctagg aaaagggagc gtaat 6115 // ID Copia-8_MLP-LTR repbase; DNA; FNG; 210 BP. XX AC AECX01000970; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-8_MLP_; KW Copia-8_MLP-I; Copia-8_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-210 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000970; Positions 124390 124181. XX SQ Sequence 210 BP; 58 A; 32 C; 27 G; 93 T; 0 other; tgttggagtg tgtgtgtttc aaaagtcttg tttacataaa tgtattatgt ataaatgtat 60 ttctcttcat cattaaatta catcatactt ctcaatgtgg aatgttacgt cacttaatca 120 atgatgtcat ttaattaaga aattcttctt gttttcatat ataacctgag tctcaaatca 180 cagtctgatg tttctctttt tcttttctca 210 // ID Gypsy-19_RO-LTR repbase; DNA; FNG; 348 BP. XX AC AACW02000006; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-19_RO_; KW Gypsy-19_RO-I; Gypsy-19_RO-LTR. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-348 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000006; Positions 241704 241357. XX SQ Sequence 348 BP; 121 A; 60 C; 33 G; 134 T; 0 other; tgtcgtattt accccattaa atacttatta ttcgtatgag attactcaaa tcttatataa 60 aaaataacaa gtactgaata cataatatta tatattacat attctctaag actcatttac 120 aatattaacc atatacaata tttatattat tcaagttcaa tactaaaatt gaccttgagc 180 acacttcggt ccctgaataa ttttatttcc tatataaagg cgtagtttga ttaaatgggt 240 tctctctctc gtcttgaaat ctaataaaga ggttacatag agactcattc actagttttc 300 aagtactatc tccttttaat tctcaaattt ttatctgcaa atatctca 348 // ID Copia-1_GDe-LTR repbase; DNA; FNG; 354 BP. XX AC AEFC01001314; XX DT 12-MAR-2011 (Rel. 16.03, Created) DT 12-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Geomyces destructans genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_GDe_; KW Copia-1_GDe-I; Copia-1_GDe-LTR. XX OS Geomyces destructans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Leotiomycetes; Leotiomycetes incertae sedis; Myxotrichaceae; OC mitosporic Myxotrichaceae; Geomyces. XX RN [1] RP 1-354 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Geomyces destructans genome."; RL Direct Submission to RU (12-MAR-2011). XX DR Genome; AEFC01001314; Positions 4962 5315. XX SQ Sequence 354 BP; 80 A; 82 C; 83 G; 109 T; 0 other; tgttggggga aacggacaat cccatgggaa tacggtagtc ttttgcgccc aaaatggcac 60 ttttctccat gggactgtct agttccccag tcgttcagag tactgagctg aggtttcagc 120 tctgtgtatc actttggttt gggtgtagga acccaaaccc tcagggtttt gggggtttcc 180 cccaggccta caaatagcgt acgtagaccc gtagtttagt gagtaattca attgattata 240 ttattctatt gttgaaacct ctgttctgct gcccgtaaga gattcattac agtgaccgga 300 gattaccaca cgattctgtg tgtgtttctc cgtaccccgc aattacttcc aaca 354 // ID Copia-2_TMe-LTR repbase; DNA; FNG; 424 BP. XX AC CABJ01001613; XX DT 13-FEB-2011 (Rel. 16.02, Created) DT 13-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Perigord black truffle genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_TMe_; KW Copia-2_TMe-I; Copia-2_TMe-LTR. XX OS Tuber melanosporum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Pezizomycetes; Pezizales; Tuberaceae; Tuber. XX RN [1] RP 1-424 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Perigord black truffle genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; CABJ01001613; Positions 445265 445688. XX SQ Sequence 424 BP; 95 A; 100 C; 62 G; 167 T; 0 other; tgttggaata ttgtccctga gtttcgattt catacctact attggtttcc tatatgatct 60 tcacgaactt tctgttatac cttacttgat gtctgaaact gttatcaccc tgttgtctcc 120 gtactgtttc ctctctatct ccttgcagtg aaaatttctc tcttgtttct tttctgtttc 180 cttgttgttt caaagggaca agactttagt ccccttaccg gagatgaaaa ttttcatcca 240 cacaggcacc ggacaagatg tagatcatct tgggttaaaa gggccttgtt atctacttag 300 aaatgatatg cgaattctga cctttcaatt gtgcacctaa gaaactctcc tgctggttct 360 tatcctttat tcctccgtct atacttttct ctcctcacat tccaaacaat tttattttac 420 ttca 424 // ID I-1_AO repbase; DNA; FNG; 4325 BP. XX AC . XX DT 24-JAN-2006 (Rel. 11.01, Created) DT 02-FEB-2006 (Rel. 11.01, Last updated, Version 1) XX DE A family of I non-LTR retrotransposons - a consensus sequence. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-1_AO. XX OS Aspergillus oryzae OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC mitosporic Trichocomaceae; Aspergillus. XX RN [1] RP 1-4325 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-4325 RA Kapitonov V.V. and Jurka J.; RT "I-1_AO, a family of I non-LTR retrotransposons in the RT Aspergillus oryzae genome."; RL Repbase Reports 6(1), 9-9 (2006). XX DR [2] (Consensus) XX CC This is a family of non-LTR retrotransposons that belong to the I CC clade. I-1_AO is 5' truncated, ORF2 that is composed of the CC endonuclease, reverse transcriptase and RNase H domains is CC relatively well preserved (just a few stop codons). The 3' CC terminus is composed of the TTGA microsatellite. Usually, I-1_AO CC elements are severely truncated at their 5' ends, and they are CC flanked by 10-15-bp target site duplications. XX SQ Sequence 4325 BP; 1059 A; 1300 C; 1225 G; 741 T; 0 other; tgcccatcag cccccggagg agcagaagtt tggcttcaag gagtttactc cgatcggggc 60 gaacgagcgc ccggtcacag cgacaccacc ggcgcaccgg cggcacacat gcgaccatcc 120 aattgccccc actggaaccc ctcaggatcc ggtccgcacg caaggttgtt cccgtcgtga 180 ccaccccaaa gggcggagac aggcctgcgg catacggccc tcgggtgggg ggaccagacc 240 ctgcggacac gccgacgaac ggcggctacc gcagggcggc tcccccctgg aatctgcttt 300 tcgatgatgg ccagcggcca acgatccaca gaccccaacg gtgccggcga ccaaacggcc 360 accgggcttc ggtcacccca gaccgcacca ccgagacgtt tcacggcctc ggcgggtata 420 agcgcggctg gggtgggcgg gcgtccccgt ttcctcacgc gaggcccgac gagcgggtac 480 ctttggcgac gaggcacagc gagacgacca cggccggtga ggccacgacg acaaacgacg 540 acaccgacga cactagccag gaccgacaga tccggacaga ccaaaaggcc ccccagctgc 600 tcgaattcgc ggactccagg ctcctggacc tgtggctcga gctggggaca ataacccaag 660 accggaacaa ccaccggtcc acgattgacc ttgtcttcgg ggctcagagc cttgcagatc 720 aacacatcgc ctgcgaggtg gcccccaagg tccacgcgga ctctaaccac ctactaatcc 780 gtatgattct ttacctcgcg cctcacgcat accaaccacc gaaacgacgc cagtagaaga 840 ccatagaggc agccaagctg cgtaaattta tggccagcaa cctcaatata tattctcact 900 gggacgtgct aaaaatgaat ctctcggcgg cctcgattga cgcggcggtc gacttcctta 960 tggaggtcgt gcagcgcgcg atccagcacg cggtcccgtg ggcccgtccc agcgagtggg 1020 ccaagccaga cttcacaccg gaatgcaaac gtgcggtcaa gataactaga aaactccgca 1080 ggatctatat gcgtcaccgt ctaccctcag actggaccgc ctacgtcaag gcccgcaacc 1140 gaaagggccg gatcatcaac cggtccctac ggaggggctt ccgccgctgg gtctcggagg 1200 cgatcgatca aggcacgcac gggatctggc gcgtggctaa gtgggcccgt aaccgtgggg 1260 gccgggccgc caatatgatc ccaaccttaa atggacctca cgggccggcc gacaccaccg 1320 aggccaaggc cgaggtcctg cgagaatcgt tctttcccga gccaccgccg gccgacctat 1380 ccgatatcgc gagacgcacc cagccgccgc agatcgagtt cccagaggtc acgaaagagg 1440 aggtcgccaa ggccatccgg cgggcaccgc cagacaaggc ccctgggccg gacgcggtcc 1500 cgaacaagat ctggcacgag ctgtgcaagg tcccggtctt tccagagcgg gctacggctc 1560 tgttcaacgc gagtaacaaa acagggcaca atccgagaca cttccagacc tctaccacgg 1620 tggccctacg caagggcgga ccgcgtgact accgcaagcc aaaatcatac cgaccagttg 1680 ccctgctcaa tacttttggt aagattctcg agtcgatcat agcaacacgc attgcctggg 1740 ccttggagga gcacaaatta ctcccacaga cccatctcgg aggcagaaag gggatctcga 1800 cggaccacgt catccagctc atcctcgata atatatatcg tgcatgggga cagggtaaaa 1860 aggtaagtat gatcctgctt gacgtctctg gtgcctttga caacgtgtcc cacgcccggc 1920 tgctcttcaa cctccgccag ctgaagctgg gccacttcgc ggactggctg cagtcctttc 1980 taaccggcag aacgacccgg atctcgctcg caggggagct cagcgcggag ttcccgaccc 2040 cgacgggcat cccgcagggc tcgccgctgt ccccgattct atacctgatc tacaacaccc 2100 cgctgatcca ggatctccat gtccggcggc cccagggcgg ctcaaccacg gccttcggct 2160 ggatcgacga cgcgtgtacc ctggccgtgt ccgacacctt cgcagaaaac gtcgaaacgc 2220 tgaatgcggc tctgtcccgg gccggctgat gggctagtcg acatgcatct aaatttgcac 2280 cggacaagtt cgagctcatc cattttacta acccacggga gacggaaacg ccgccccagt 2340 ccccgggcct cccaccggac catccggacc agatatggga ggtgccactg ccaccagcgg 2400 gacacgacca gatggagatc atctttacgg acacgataat caagccaaca gaaacggcca 2460 agtacttggg ggtatggttg gacaagaccc tgtcgttctc catccatcga accaaggccc 2520 tggccaaggc gcacgggacg ctggcggccc tcaaggggat cgcaggatcc acgtgggggg 2580 cccctctgcg tgccatgcgc cggatctatc aggcggtgat cgttccccag ctattctacg 2640 cggccgcggc ctggtacagt cctaagggtg gccagatcgt ggcctcgatc aaccagaaga 2700 tgctggcgga gttcactcag attcagaagc aggcagcgct gctgatcagc ggggccttta 2760 gaggcacctc cgcagcggct cttaacgtag aattatatat attaccggta cacctccagc 2820 tgcagcagat cattgaagaa acggcggtca gaatccggac gggaccagag ctggcctgcc 2880 cagagtcggt ccttagaccg cgcacggtac aagaacgccg gcgcagcggc tggacaccga 2940 tggaggcact gagtcggaaa gggggccccc tatggcccct gggcaagaag gagtgggaaa 3000 cccgcaagcc atatattctg gcaccatggg aaccgccggt cacaaccgtg attgacagtc 3060 acgaagccgc actaatctac cacagacatt actgcgccag gcgagaaggg atcgcagtgt 3120 acacagacgg gagcggtcta aacggccggg ttggagctag tacagtctgc ctctcgcagg 3180 gctggaagag gaactgtacg ctgggaacag aagaggagtc aacggtctac gcaggggagc 3240 tgaccgggat ccggatggcc ctgcacaggc tacggaggga aaccagaccg gccacggtct 3300 ttgtggacag ccaggccgcg atccaggcaa ttcagaaccc gcggagaccg tcgggccaat 3360 atatattaga ccagatctac tatattatac ggagatataa catgcagaac cgggtccaga 3420 tccactggat ccctgcacac atcggagtgc cagggaacga ggcagccgat gaagccgcac 3480 gagaaggagc tacacgagaa ggcacacagc aaactggcga agctatttgc ctagctgcag 3540 cggccaaacg acagatacgc cgctctatta aagatagatg gatacgggag tggaagaccg 3600 agaaaacagg ccccacaaca taccgattgg tggaggttcc aaacaagaag atactggatc 3660 tctataagaa tctatcaaaa tcgtacgcat cgattattat tcagatgcgt acccagagaa 3720 acggactacg gcactttcta cataaaatta aagccgtgga ctcagaccaa tacctctatg 3780 cgttgggatc ccagacagcg cggcatattt tactacaatg tccattatac gccgaactca 3840 ggggtagaat gattggcaag ctggacccgg gggtccagaa aagattagat tacaacggga 3900 tcatgtccca tccgcaggcg atccgctacg tcgccgaatt catgcaccaa acagaattac 3960 tcagtcaatt tagagacgtg gagcaaactg gtcactacta aagacggaca gatatagaag 4020 acgatgagta catagtcaca ccaggagggg gccggtcagc cccttctata cacatatcag 4080 tcttcacata ccaggagcac tagaaggaac taggaacgat ggatggagga ggtagaaaag 4140 gcagctgcca ggggagtggc aggcagtttt agttgtgttt aaaagctttt tgattggact 4200 atcttccgtg ggatatacgg cgtttacggt tagattctag gctctcaagc acctgcatag 4260 ctagacgttc agtagttagg ctgcatcagg ttaatagaat cttgattgat tgattgattg 4320 attga 4325 // ID Gypsy-108_MLP-I repbase; DNA; FNG; 5175 BP. XX AC AECX01000606; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-108_MLP_; KW Gypsy-108_MLP-LTR; Gypsy-108_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5175 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000606; Positions 94883 100057. XX CC 'CTTGT' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 2308..4347 FT /product="Gypsy-108_MLP-I_1p" FT /translation="MERVASPVQALRGSQSSWDRSPRQEIDLLQLGLQEQE FT GMEYTDQHVSLPQSLDRSRDRQVTSSGYPKLASTLAQPKQLSRVHSTREGS FT SSKSIQQAHSIASLPTQRASSISHIGLGRGSASSNRYGEVLSKERRTKQPQ FT SSSLNSNEARLVSLFTKVLEDDLVGPWTDVLSKHLDQLSGETQKDAEVIRY FT YQDCLATINKNCKELYKRVCPEIKDLRKIMIDLLEHLEMQQSSSVQENESR FT HSELLDLLRLQNVSLDNQSIRELGEVLSNEIERKIDMRFGQIFATKKGIKI FT PDNNNIDNIIERKLEEKLDSQFKKFFDQISEKMESGISNTDLVEQNIQSQK FT ILHSRFHILEQYLKEMNEDGKIQREKILTEVLIIKNSVKSISKGERLIVNS FT PVNMDSGRTSLPKRDIEGNSEQAGTAQEYNVQQMLSGIKEQIHALGQSNHK FT QTLPATQDSESAEYQANWKKDLRKEYPSPKDWPTFDGEGEYNHQEFIDWVD FT RMSSKLRMPDELIMAKLGIVFQGVARQWFIDFAKEDDLQTWAEWKVAIQER FT FGNQAWKNRMQKMYLSDRFVLRTNIDCVKWASKQKQRVEAFQPNITQEELV FT EKIIWQLPGTIRIWVSSKMREPARWTDFLLVFEDICRNMLDSRNNQQNNTG FT RMFAPRQDEPTAASGSKDRPVTNNLEQRPEQ" XX SQ Sequence 5175 BP; 1681 A; 957 C; 1052 G; 1485 T; 0 other; acttgggggc ctcatcagtg ttttgttcct gaatcatact aggagtaaga cccttttcat 60 ctataacact tttattttat ttgcatttgt tgaaagacaa tcaagactga tgtcaggagt 120 gctgtatcaa gtgtgaaagg tggaataggt gtcatggttc cttctcttat atttatcttg 180 tctaagagtg atcatacgta taatgtataa tgctttgcta tttgccgttg tatttcccaa 240 ccaacattag cgctgatctt tcccaacgct ttgtgtagaa gtgaccgaga tatatatgtt 300 gttttgtctg cttgttctaa tctctctttc ctcatatatt ggaaagagag taagtttttc 360 tactaccttt tctcttcctc cttgtattag cactaatgtt actacatcaa agattcctat 420 agatcactcg agccgtatct tataagctca tcacctttat tctcatctta tatttcctcc 480 gtaaaactta ggtcagttta cgaggtctgc atacctgttg ttggagaaga gtgctgaacg 540 tcaagcacta tcttattagg tcagtttacc aggtctgcat acctgttgtt ggagaagagt 600 gctgaacgtc gaggactatc ttattaggta cccgtgtctc tagccattta cagcatttgt 660 ttttgtttct gcgtataaga gttaaataaa aaatctacat ctatttcatc ctctttcaca 720 cattagtagt attgcaattt actactcaaa agccattttt actttttgtt catttaagat 780 cacacttcca cctgcctagt caactgattg agatcacatt tctttttcag tacaattcac 840 atttttgttc taataaaaat agtacaattc aaaagtacaa ttcaaatttc cctcaatttc 900 aaattcaatt tagaacagtc acctcattgt cttattttat aatttgaaat aattacacag 960 cacattttta ttcataccag aattgaattt tggaatttca aattgtattg aactacattg 1020 aaatttctaa ttgtattgaa ttacattgaa atttgaattt gtattgaact acattgaaat 1080 ttctaattgt attgaactag attgaaattt gaatttgtat tgaactacat tgaaatttcc 1140 atttgtattt aactacattg aaatttcaaa ttgtattgaa ttactagcac ttacacagtg 1200 agaacagaac atcaccagct tgtgatattc aattttaatg agttgagtga tgcttgaagt 1260 caaaattaca tagatatata tacaattaca tataaacgtc aacatactac aatagtaata 1320 gtatagaccc ttgaattttg ttgtgtagag ttactgtgtg gagaatagca agcgtgagcg 1380 cacgtgatac atacttgtaa caattggtgt gtgaccgcac acgttattgt caagtagtac 1440 agacatcaac tccgagtacc gttatagcac gtgattacac gtgatattag catacacaag 1500 tgttacacag ctgacagggt gaactgtcaa gtctgtatca accaccccat caaattcacc 1560 aaagtaatac tctttgagtt tatccgactc gataccaagc acaaacttcc caacttaccc 1620 ttatgtgaca ttcttgtcaa tatttgtcat caagaaacca acaacatcca tctgcttgaa 1680 ccatcaaaga agaagttaat aacagcaaca cagaatactc gtagtaagtt aaggaatcaa 1740 cttactattt attctattgt tctagaagtt ttgaatctca ttattggaca ctgatttatc 1800 tgtgttactt tatttcaaag tgacggttac gtggatttta gggacttttt tgaaaggacc 1860 ctatcagatt gatcaaagat ataaaaacca actgtttttg tccgttaaac ttttcctcca 1920 cctcaaagcg ccatctacgc ctactcaaac ttctcaagta agtcatctta accactttct 1980 tggtattcaa gtcgttctaa gtattttatt ttattttatt ttatatttta ttgtcatgtc 2040 ttcacgacct tcaacgtgta gtgaatcacg ccaaagagca gcaacagcca ccccgcctcc 2100 gatggtgatc gagcacgacc aaccaattgg tccagcgaac ccattggaac caggggttgg 2160 ggcagagcac cagccagagg tagggcttat tgccctttaa cctccactcc tcttcgctca 2220 cttgctgata caaccattgt gggtcctata ccagcacctg cacgtgctag tgcaacctca 2280 agcggtccac ttggaaatag ttttccaatg gaacgcgttg caagcccagt acaagcgctg 2340 agagggagcc agagctcttg ggatcgatca cccagacagg agatcgatct gctccagctc 2400 ggtcttcaag agcaggaagg tatggaatat actgaccaac acgtttcatt accacaaagc 2460 cttgatcgtt caagagatag gcaagtgact tcaagcgggt acccaaagtt ggcgtccaca 2520 ctggctcaac ccaaacaact ttcgcgagtt cactcaacaa gagaaggaag ctcaagcaag 2580 tcaattcaac aagcgcactc aattgcatca cttccaacgc aacgtgcatc gtcaatctca 2640 catattggac tcggacgcgg tagtgcaagc tcaaatcgct atggggaagt tcttagcaaa 2700 gagagacgga ccaagcaacc acaatcctca tcattgaact ccaatgaagc taggttagta 2760 agtttattta ctaaagttct tgaggacgat ttagtgggcc catggactga tgtattgagt 2820 aagcatctag atcaactatc cggtgagaca caaaaggatg ctgaggtaat aaggtactat 2880 caggattgtc tagctactat caataagaat tgtaaagagt tgtataagcg tgtatgccct 2940 gagataaaag atctcaggaa aatcatgatt gatcttttag aacacttaga aatgcaacaa 3000 agttcttctg tccaagaaaa tgagagtaga cacagtgagc ttttagattt gttgagactt 3060 cagaatgtga gcctagacaa ccagtctata agagaacttg gtgaagtatt gagtaatgaa 3120 attgaaagaa aaattgatat gagatttggt cagatttttg cgacaaagaa aggaatcaag 3180 attcctgata ataataacat tgataatatc atagaaagaa aacttgaaga gaagctggat 3240 tctcagttta agaagttttt tgaccagatt agtgagaaga tggaatcggg catcagtaac 3300 actgatttgg ttgagcaaaa tatccaatca cagaagattt tacactcaag atttcacatt 3360 ctggaacaat atctgaagga aatgaatgaa gacgggaaga ttcagcggga aaaaattctc 3420 acagaagtgt tgatcattaa aaacagtgtc aagtcaattt caaaaggaga gagattgatt 3480 gtcaacagcc cggtaaacat ggactcaggc aggacaagcc tcccgaaaag agacatagaa 3540 ggcaatagtg aacaagcagg gactgctcaa gaatataatg tgcaacaaat gcttagtggt 3600 atcaaagaac agattcacgc gcttgggcag agtaatcata aacaaacttt gccggctaca 3660 caagacagtg aatcagcaga gtatcaagcc aactggaaga aagacctgcg gaaagaatac 3720 ccgtcaccta aagattggcc aacatttgat ggtgaagggg aatacaatca ccaagagttt 3780 attgactggg tagatagaat gtccagtaag ctgcgcatgc ctgatgagct catcatggct 3840 aaattaggga ttgtctttca aggcgtggcc aggcaatggt tcatagactt tgcaaaggaa 3900 gatgatttgc agacatgggc agaatggaag gtagccattc aagaaaggtt tgggaaccaa 3960 gcttggaaga accgtatgca gaaaatgtat ttgtctgata ggtttgtact taggacaaat 4020 attgattgtg tcaaatgggc aagtaagcag aaacaaagag ttgaagcttt tcaacctaat 4080 attacgcaag aagaacttgt tgagaaaatc atatggcaac taccaggcac catacgcata 4140 tgggtcagta gtaaaatgcg ggaaccagca agatggacag atttcttact tgtatttgaa 4200 gacatatgta gaaacatgtt ggatagtagg aataatcaac aaaacaacac agggcggatg 4260 tttgcgccaa gacaagatga gccaactgct gcttcaggta gtaaggacag acctgtaaca 4320 aacaatttag aacaacggcc ggaacaataa agtgtgtcat ggttgtggaa gtacagatcc 4380 aacacatgta tggaaaacct gtaaaggtaa aagtcaggct gtgaaccagg ttgagatgga 4440 agaaacaatt gaccaagaca tagagccagt cagtatggaa ttcattgaat atgcagaatc 4500 caaagagggg gatatatttg gtgcaaaaga agaagtgctg gttgacgtca ttgagagcaa 4560 attttgcaaa gatgtggaca tagagcaact ccaggcatat gctgatgcac aaaatctagg 4620 gcatactgaa gatgctgtac agcatattac caacgcaaaa ctgctgaaat caaggccatc 4680 tacaggaaaa gcacatacac taggatttca tagtcttacc catgtaattg ttgaaggaac 4740 tgatgcagag atgctacttg acagtggagc atcttgctca gtggtgggga gttcatattt 4800 gacaggtatt gtccccgact ggagacttca attgatgcct tgtgatgata atatgaagtt 4860 ttcaggatgt ggaggaagtt tgttcccgtt gggtgttatc aacttgaaaa gtgtatttcc 4920 ccataaacaa ggaggtgtga ggatacacgt ggaatttgtg gtaatggata attctcatac 4980 aaagtacttt atattagggg ataattacct aggtgcatat ggtatagaca tttttcacag 5040 ccaagaaaaa tactttacaa taggtaatga tgtgaagaaa aagaaattta cattaccact 5100 tcagagacca atactatcta gccttagaga gagaatctac agcagtagaa gggaaccttc 5160 tctcgtggtg gggag 5175 // ID Gypsy-90_MLP-I repbase; DNA; FNG; 16542 BP. XX AC AECX01000233; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-90_MLP_; KW Gypsy-90_MLP-LTR; Gypsy-90_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-16542 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000233; Positions 24836 8295. XX CC Positions [12172-12675] - Integrase core CC 'CCGT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 6701..9613 FT /product="Gypsy-90_MLP-I_3p" FT /translation="MKASRDEKVRRRELRSEGVYHSDNSEDDGDAQQVQDD FT LLGKGVGNELKSDEEDQIDEEFKSRKMKEEIIENEIKKKLKSKKTRRVVYE FT EISSEDEDQKKPVKIQPVDKSLSFKSGGDMERFLRDFEDAALIDNASDRDK FT CLQVKFFIKDDEMKTIIEAMAGYRTQRWRLLKNEMNELWGAGTQPLYTDED FT LYKLCDDLAKAGGISTNDAYVKFENKFKTMLIYLKTSGQIDSSDSRSMARY FT YFKAFSSHIQHQIKILMDSEGQLKKFGHHYRMVPLSMLENYVSRVMRMHMA FT FGDLTKNEEERSSFKKENVTSVMESFRAQRQASLGISGGSGGTSNLLDQLK FT KKLEMVEKENENLKKGGATGGRTYGSSGNGNSTRGNYGGNRENNYSRNNYG FT NRNENDEFKRNEGGNRSVYMCWYCDREGHSIHDCRYVKQDVDDNLVKFDGK FT SFFLPNGDRIPWDQKPYRKLVIENSTKVTENKVEVEKDEDKGKEEVKSSCG FT TLSQWNVPSISSQRRVIFESDAARRKVDPLIKKRTSPRFANLDQDNQDQIN FT QSTSRNDEGRKEEIIITDEQAEEFRRENSERFSTPGWDGVVENSRDVPASS FT EPQITPKKILKRGTRIEDLSTDSDLEILPGSYPNTPDQRHKVLRDISIEDG FT RTSRKPTYISPLKNVKKEEPRSKSPGIVSSKKNGTLEGLIDKCKEMKAPQL FT TMEELINLVPDLVDGLKVKGGNVGNGFSKKVNLSSSGSIRRSNEIIKEEDI FT GNMDQMDLENLDFGKLKSIWGRQKDWLYSCPLGFLNVELGDHEITTRCLID FT SGSQINVMNVDFAYSMGLESHVKIKMSLRGIVNNEAELVGIVENVLLLLGN FT HVQGKVHFFLTTGDAPVILGRPFLVDFEANCQFSEKYGERISMVDDRGMVA FT RFSTCDGKASEFQRSIPGMDWPPQVYEHEDARGNQSSRRHRYANGGKKVWK FT LVREDADEGQRHLKDL" FT CDS join(9979..11604,11608..13203) FT /product="Gypsy-90_MLP-I_1p" FT /translation="MKEQELVCKDFKEGGFIVGATKYKPVAKKIRPVNEPM FT PQYLNPPLQRPSLSRDPYETPVLKTPPEFIETEKVTEERLKMVNFGPPGWL FT SSEEMKLILHVIVLREKSIAFNESERGVLRHEYGLPYIIPVVDHEPWQKKV FT IPIPKAKRHEYIELVRQRLRTGLYEQSTSSYSSPVFCVIKHDGKLRVVHDL FT QELNKVTIKDAGVPPAPEEFVEAFAGRSCYGLGDIMGGYDERELAVESRPL FT TTFETPLGRFQLTRLPQGATNSVAVYQAQMMWILQDEIPEHAGVFIDDGGI FT MGPRDDYDNEVLFENEGIRRFIYEYAVTLERILFRIEEAGLTVSGKKFAAC FT VPELEIVGHVVGFHGRTISVKKRNKIQTWPVPKDPGQIRGFLGVCVYVRMF FT IEGFSELSSPLRRLTRKGVDWDWDSLCQDVFEQLKEIVGREITLKSITYGE FT GAGKLKLAVDSSYIAAGAVLTQEDKIKKDRPVLYESVTFTEVESRYSQPKL FT ELCGVAKILKKLQVHLWGQHFELQVDAKALIQMINTPDLPNAPMTRVFFIQ FT LFSFDLVHKAGKTFTMPDGLSRRPQDSDLDSDAEEFNEEKKLIDVANNDYD FT FGIYCGDLIINEDLIDGDVRWEQVGFWKHLVNYLENLIRPEDISDEEFKSI FT KRKSDKFYLDNGRLMRRQNPMAQVVVTNVEGQDWILEQLHEGLGQRGVEET FT YRRIVLRFWWPSLKKSVREWVQSCEACQKRSSLTPKELGHATGEATLFGRI FT SMDAVHIKAGQHKYLVVARDDLSGWVEAVPLVNLKADKVAEFLEKEWIFRY FT GAIKMVTVDGGGEFKDELVKAVESCGAKLRIVTPYYPQAAGMVERGHKPIK FT DTLVKMCSTNPSAWRKTLPMVLFADRILTKRTTGISPYEMVFGQRAVLPVD FT VEAGTFLGVNWEEVHTRAELLEARTEQLLRREEMMDNAYSKMMRVREESVR FT YWDKKNAHKLRKSPLEVGDMVLVYNASLESQWGKLFENRWNGPYKVKEQLH FT MGSFVLEELDGTELRRRYAASHVKKFYARGTNELEEGTDEEDLNEQQSGED FT FVDEDSRWEESDEEFDV" XX SQ Sequence 16542 BP; 5449 A; 2256 C; 3454 G; 5383 T; 0 other; tctggtccct caactcttga tcttacctct tccctttttg gtgtttgctc ttccaactga 60 gtcatctagg ggttcttttg gttatttcaa gcttatctgt aggatattaa tttcgtggta 120 gtacttggat attggtctac ttcaaggttt tattatttag aagtactagg atattggtct 180 tctcagtttt tgatttattg gtcttcaaga ttcaagattt caaattcaag ttattaatta 240 ttattcttag ttgattgttg ttcactttat catcaagaat ttcgtggtta ttgattcatt 300 cattcattaa cttcatttat aggtttcatt tgatagaact acttggatat aagtattcta 360 gaattgattt gggtaatcaa attcaattct tcacatcaag tttgatttat ttgatttcaa 420 gtttaattat tcacttcaag attaattaat tttgacttca agagttactt ttcattcaga 480 gtcttattca tttcaagatt cagtttgaca tcaagaatta tttaaatcaa gtttcaattg 540 attacttcaa ggttcatcat tgaattcaac atttatttat ttttgatttg aagatttatt 600 attcaagatc aagaatttta tttacttcaa gtggtttatt ttacttcaag aacaagaatc 660 ttattcattt caagattcat ttgacttcaa gttttgttta ttcacatcaa aattattttt 720 agtttaagtt gtatttgatt cacttcaaga cttattcact tcaagactta ttcacttcaa 780 gacttattca cttcaagact tattcacttc aagatctgaa tttatttcaa gatttatttt 840 atttcaagat ttattcagag tctgattcat tggaagaatt ttttacttca agttcaagat 900 ttattttagt ttttatttac atcaagtttc aattatttac atcaagaaat taattgactt 960 caagttttaa ttaattttta tttcaagttt caaattattc aattcaagtt tttaattcac 1020 ttcaagaatt attatcaaat tcaagttgtt aatttattca agttatcatc aagaattttt 1080 tgtgaatcaa tcattcatta actgttaatt cattgattca tttcatttta ttcaaatggc 1140 taatttattt gataatgaat ggaaggagga agacatcaaa gctagctcaa gtggttttga 1200 tgctttgaat aaaggaaaag gaaaagaatg atcagaaaat tatcaagttg cagaagacaa 1260 cattgaaagt ttgatggatg gtttaactga gaaagaagtc attcaagttt tacagaaact 1320 agcgaattgg cggcgggcac cgcggctaga tgattgagct aatcagtgat tatcactttc 1380 ccggtgcttg ttagtttgct ttcagctggg gctttcagtt tacgtctcag tgtccatgaa 1440 catatgttga ccctgtgata tattgttttg tgtagcacgc tctcagagag agtacacata 1500 tcttaagatc tgttgtgcaa ttcttgccac cttgcatatg tatgataagt ttgcctgcaa 1560 atctcatgga gttttgtacg gtatcttcta aaataccaaa tgatgtcact ttactatcct 1620 ttcttttgat tgcccaaggg ttgcttgggt cagtcgtcat ctccacccca cagtcttcaa 1680 caatgttttc ttccaattca acaactgtat tacatacaat ccaaaaaaac tgcattacaa 1740 atgaatctta tttctattcc ttccattttt ggttgtcagg tgttcttatg tctttgcagt 1800 ctctaacctt ttgatcacta tttttgttgt gaatccactg ctctttggtc ccaatcatag 1860 aaccagcaca ggccatggtg ttccatctat tgcaattagt tattcttcac cagccttttc 1920 attctgattg agttatttga tgtgtcttat tacttcagct acttttccag gcctcagttc 1980 caaccatatt ttcttgatgc atccacatct cactaatcac ggaaatactt tgccttcact 2040 acattcaaca ttatagtgaa cattatgtac ttttttattc ccatctacct attcaatgta 2100 aaagaattca acggagaaaa aactcgatgt tgacttgaac actgtatcaa aggctttaac 2160 aagataaaaa gatattagat ctcttgacag aaagttgaat aagctcgaag agatcaatga 2220 aataaagcaa acctgacaag gtttgataaa aagtaaaaag aaaactttag aattcttttc 2280 cccaaacggt gaggtataaa ttaatatgaa aaacatcaca gatactaaaa cacaccttta 2340 acaagataaa aagatattag atctctcgac agcgagttga ctaagcttga agagatcaat 2400 gaaacaaagc aaacctgaca aggtttgata aaaagtaaaa acaaaactta aacagtagaa 2460 gaaatgtgta aagtagtgta gatatgaata aggctgagat cagtatctgt agattgagta 2520 aaaagagaga aagaataaaa acagttacct ttagaattct tttccccaaa cggtgaggaa 2580 gaagaaaact aaagggaaca agacagagcg cacatataag agaaaaagaa atgagtaaca 2640 accagacgcg acttcaaata caacacccct gcattaaacc accttgctta cgcactacac 2700 aaacagtcac ccctttgtcc aacaatcatt ctagcaataa taaccagtaa gcttagttct 2760 tgatgttaac acgattgatg ttaagctaat ttgatgttaa cacaaccatt gataacgttt 2820 gaaggtcttg acaaacattt attccaatac aatgtctctc actatttcag tcaatattct 2880 tggcttccat tccagcagga tctttgactt aagattcata aaatagagca ctttttgcgc 2940 attcgattgc tcaagtattt gagcttggac ttattatgat ttcagactta ttgagatcca 3000 cctgatatca acacacatat tggaaaaggt cctgtgtagg taaatcatca gtttatatgt 3060 ttctttccta tcatctagag ttgtaattgg gagcctgttt gatcaaaggt ctttgactat 3120 gctttccacc atgtaaatat ccttgtagca cgtcgacgtg cacacacaaa atcctttcca 3180 tagaccctgt aatgacccat caaagaggtg ccacaacaag cgcatattcc ccttatcacc 3240 tgggaggttg aaagtcagtt gacccctgct gacaacaaag gattctgcaa atcctttgat 3300 caaatcacaa agcacaaagg gtgctcaaca ataatctcag tctaaatccg accaaaagac 3360 tttgagattt gtttcaaaac tctctgtgtg tgcacgttga cgtgctacac aaatatttac 3420 atggtattgg ttgtcaagga ggattgtgat ttgtcaaatg tgaactgatg cggtgttcag 3480 agccattgca agttccatct ggattacaga tattctgctt gatagatacc ataggttctt 3540 gctgttccta ggtgttcagc caaagattta ttagtattct ccctccatac agccgctgct 3600 gttgtagaaa tcctacactg tgtggcatag gctgagccgg agaggggatg agattccacc 3660 aatgaagtac agatgaaact tgagtgtggt gaagctgccc tcatcctatg aaattgaagt 3720 ttgaagaaaa aaacatagca actctaacta cttttgatgt tcaaaaatat aacttgtgac 3780 aatattctat atctatcatt atctgtctgg aattcattgg tgagttactc aaatctgttg 3840 ttgtgaatga cttgtgaaaa ttggcctact tatacttaca ataaaaaagt gtaggtactg 3900 gtaacttgat ttgattggtg tcctttgttg acaactgttg cagaaggtgg aagtttggtt 3960 ccagcgtaaa actgtgtgat tgtttctttt gctcctaagt gctaggtgag taatctcaaa 4020 cctttctgat ctgtgaattt gcaataattg agtatgttgc tttatcaaag agacactggt 4080 atgtattggt ctaaatactg attgcacttt caattgtttc cttaatccca tcacaaaaat 4140 tgaactatat gtcaactcac caaatcaggt aagaaatgaa gtgcacaaac ttctgctgag 4200 tggaatttta cctcccactg ccacataaga acacatagtg atcttcaaca atggctcttg 4260 aatacctgag tgcatcacag tgtggaaact aattcctatg atcatgtctt aatcttgtat 4320 catattcttt tatatcacac tgtagatcaa ctcacaacag aaaaagaaaa gatacataaa 4380 ttgtaccctt caattcttat caaggcatgt ggactcaaga aaaagactca aaatcagcat 4440 ctcaatctgg attatctata tgattttttg catgtcattc tcaatgaaca tcaatcttac 4500 attcaaacag tatagtatat acatacatag aagttaaaat aaacatgaca catacaagaa 4560 agataaataa gtaaaatgca cacatacctt tgagaaatgg gggtcaaaat gaaaaagtca 4620 acattgatgt gatgcaaata caaacataat ttaaagatta aaaaccctca cataatgaat 4680 taagcaagac caaacaatca atctacaagg aaacaagccc caatattgaa atcttcaagg 4740 tatcagtatg gcaaacaaat gtgatgtgaa ggaaagaaat ggaagttctt acatattgaa 4800 tggatgaact atgatattaa aacatagctc aaaatctcat tctagaataa aaaataagta 4860 aagtcaggtg gagagtgggg aaagaaagga aagagttagt atgattcaga agaaggaatc 4920 aaataagaca taaattaaat cacattacat accttcatca taggccaaga tcaaaataat 4980 aaacctcata aagcgtttga ttttgggaat ctaatattca aacatgattg agagatgaag 5040 gtggaaatcg aaattgaatt ggatgtttat gaagattgta gtcgaaagag gctgagatcg 5100 aaatgattaa ttaaatatga agtttgattt tgaggttcat agctaaaaga aagaaagaga 5160 gttgggtcag tagaattaat tacttacttg aaaagaacaa gatggagggt caagatcgaa 5220 attgaatcca tcaatctatt catcagatac caccagaatc gaaatttaaa catcaaaagc 5280 tatgatggga aagaatagaa gagaaaagaa caaataaact tacctatgta agtgattggg 5340 ttccaaaatc gaaaatgaaa tcagtgatct attcatcggt cattcaaaat caaattttga 5400 accaacaagg ccaagatgaa gttgaagaag tggggaaaag aagagctatc agtatgtata 5460 actaccaaac agaaaagtac ggagagtgga gaagaaagga ttagaaaatc tacgttgatg 5520 tgatttgaaa gagagtccga ttggagatga aaatgaaatt tgggaaggaa gggattgtga 5580 atattccatc atgaccaatc tttcttcgtt cgtctgtttg gctgtttgtc tgtctcgtta 5640 gttgtgtttg tgtcaaagtg tatcatatct ttttctttat ttggagatcc atcgatggga 5700 tcaccgtagt ttttcttata ttttttcttt tgattgtttt ttggagaaga aagtcctaca 5760 attgttaaat caaatgaaat agttcttttt agttagattt ctttggtaac agagacaagg 5820 agaataaaga ctgtaccttt gaatttattt cattgttctt tagtgtgttc ttgtcgtggt 5880 atcttgtgtg gaatattgtg attgtgtttt ttgttgctga gcctcgtcgt aaatgtttga 5940 ctctgtatac ggtccaggac gattttcatc tcggtctcta cttccatcac gtgtgttagt 6000 tgtgtctacc cacgccgctt gcttttccat tgaaacgcca agcaccccaa acaaaggcta 6060 gtgtttgtgt acaaaacact tacggtgatg tcggtgatga gtaagattca taccaaccat 6120 accaaccggg tggcgccaat cacccagttt tcagtgttga aggatgatgg tgcccaatca 6180 gtattggttt gttttcgggc atactcctgg caatcagctt ctaatatttt cgctttccaa 6240 gaatgaagtc gccatgtgtg gtaatggagg atttgtgtag attgagtcaa tgggaaatca 6300 gcggaaatat tgtcaaatcc ggtaacggtt gtgaaaagac acacatatga aaaatgactg 6360 gtaatggagg tgaaacaggg acagatgacc gtcaatggag atttcaaggt ttgggttgta 6420 tggctctcat ggtcattgat gtattgtttg ttgtctcgtg ccgtcaagga ctgttcaaac 6480 gagggttgca attgtaagga tgaggtgggc ctcaaaagag gggttgatga gaaagggtgt 6540 gagtgatgga agtgcgcaaa tgccgaattc tgtaggaaag gggaatgaga ctgaggcggt 6600 gggatcttag gtatgtttac cgttaggcct tagggggttt aatccagttt cgccgagaaa 6660 catggttgag cctttagttt tatgattaaa gcaagacaag atgaaggctt caagagatga 6720 gaaagttcgt agaagagaat taagaagtga aggtgtttat cacagtgaca attcagaaga 6780 tgatggtgat gctcagcaag ttcaagatga tttattgggt aaaggtgttg gaaatgaatt 6840 gaaaagtgat gaagaagatc aaattgatga agaattcaaa tcaagaaaaa tgaaggaaga 6900 aattattgaa aatgaaatta agaagaagtt gaaatcaaag aagacaagaa gagttgttta 6960 tgaagaaatc agtagtgaag atgaagatca gaagaaacca gtcaagattc aaccagttga 7020 taaaagctta tcatttaaat caggtggtga tatggaaaga tttttaagag attttgaaga 7080 tgcagctttg attgacaatg caagtgatag agacaagtgt cttcaagtca agttttttat 7140 taaagatgat gaaatgaaga ctattattga agctatggca ggttacagga ctcaaagatg 7200 gaggttattg aagaatgaaa tgaatgaatt atggggtgcg ggtactcaac ctttgtatac 7260 ggatgaggac ttgtataaat tgtgtgatga tttggctaaa gctggaggta tttcaacaaa 7320 tgatgcttac gtcaagtttg aaaataaatt taagaccatg ttaatttatt tgaagacttc 7380 aggtcagatt gattctagtg attcaagatc aatggcaaga tattacttca aggctttttc 7440 atctcatatt caacatcaaa tcaagatctt aatggatagt gaaggtcaat tgaagaagtt 7500 tggtcatcat tatagaatgg ttcctttaag tatgttggaa aattatgtta gtagagtaat 7560 gcggatgcat atggcttttg gagatttaac aaagaatgaa gaagaaagat cttcatttaa 7620 gaaggagaat gttacttcag tgatggaaag ttttagagct caaagacaag cttcattggg 7680 tatttcaggt ggttcaggcg gtacaagtaa cttattggat caattgaaga agaaattaga 7740 aatggttgag aaagaaaatg aaaatttgaa gaaaggtggt gctacaggtg gaagaactta 7800 tggtagcagc ggtaatggta attctactcg tgggaattat ggtggtaata gagaaaataa 7860 ttactcaaga aacaattatg gaaacaggaa tgaaaatgat gaatttaaaa gaaatgaagg 7920 tggtaatagg tccgtttata tgtgttggta ttgtgataga gaaggtcatt caattcatga 7980 ttgcagatat gtaaagcagg atgtggatga taatttggtg aagtttgatg gtaaatcttt 8040 ctttcttccg aatggagaca gaattccttg ggatcagaag ccttatagaa aattagtaat 8100 tgaaaattca acaaaggtta ctgaaaacaa agtcgaagtt gagaaggatg aagataaagg 8160 aaaggaagag gttaaatcaa gctgtggaac tttgagtcaa tggaacgttc cttcaatttc 8220 aagtcaaaga agagttattt ttgaatcaga tgcggcaagg aggaaagtgg atcctttgat 8280 caagaagagg acttcaccac gttttgctaa tttggatcaa gataatcaag atcaaattaa 8340 tcaatcaact tcaagaaatg atgaaggaag aaaggaggaa attattatta ctgatgaaca 8400 agctgaggaa tttagacgtg agaattcaga aaggttttca acacctggtt gggatggagt 8460 tgtggaaaat tcaagagatg tacctgcttc aagtgaacct caaattactc ccaagaaaat 8520 tttgaagaga ggtactagaa ttgaagattt aagtactgat tcagacttgg aaattttacc 8580 tggaagttat ccaaatactc cagatcaaag acataaggtt ttgagggaca tttcaattga 8640 agacggaaga acttcaagaa aacctactta tatttcacct ttaaagaatg tcaagaaaga 8700 agaaccaaga tcaaaatcac ctggtattgt ttcaagtaag aagaatggca ctttggaagg 8760 tttaattgat aagtgtaaag aaatgaaggc gcctcaactt acaatggagg aattaattaa 8820 tttagtacct gatttagttg atggtttgaa agttaaaggt ggtaatgttg gaaatggttt 8880 cagtaagaag gttaatttaa gttcaagtgg tagtattagg cgtagtaatg aaattattaa 8940 ggaggaagac attggtaata tggatcagat ggatttggaa aacttggatt ttggtaaatt 9000 aaaatcaatt tggggtagac agaaagattg gttgtactca tgtcctttgg gttttctcaa 9060 tgttgaactt ggagatcatg aaattactac aagatgtctt attgattcag gttctcaaat 9120 taatgtcatg aatgtagatt ttgcgtatag tatgggtttg gaatctcatg ttaaaatcaa 9180 aatgtcatta cgtggtattg ttaacaatga agctgaatta gttggtattg ttgaaaatgt 9240 tctgctttta cttggtaatc atgttcaagg aaaagttcac ttctttttaa ctaccggtga 9300 tgcaccagtt attttaggac gtccattctt agttgatttt gaagcaaatt gtcagttttc 9360 agagaagtat ggtgaaagaa tttcaatggt agatgataga ggaatggttg caaggttttc 9420 aacttgtgat ggtaaagcta gtgaatttca aaggagtatt ccgggaatgg actggcctcc 9480 tcaagtttat gaacatgaag acgcaagagg aaatcaatct tcaagaagac atagatacgc 9540 aaatggtgga aagaaagttt ggaaattagt tagagaagat gctgatgaag gtcaaaggca 9600 tttaaaagat ttgtaggtga atctttacat ggggaaattg tcaaggtgca gcaaaacacc 9660 cctggttcat ctaattattt gcatcacaat aattttcatt gtacttttaa ttatttttct 9720 gacttatctt atattcaaaa tcaaaacttt tcttttgact cttattcaat ttcatacttt 9780 tctcaattta attttatttc aaatgatgat gagtttaata ttttttatgg tggtaaatgt 9840 ttgaatagcg gtaaatgtaa agcaagaaga aggggttcaa gacgtggaaa gagaagatta 9900 ttacggttca agattgaaga tggtttattt gaagaagaag aattgatttc attatcttat 9960 ttggcaagtg gattgggtat gaaggaacaa gaattggtat gtaaggactt taaagaaggt 10020 ggatttattg ttggagcaac taagtacaaa ccagtggcta agaagatcag gccagtgaat 10080 gaaccaatgc ctcaatatct caatcctcca cttcaaagac catctctgtc gagggatcct 10140 tatgaaactc ctgttctcaa gactcctcct gaatttattg aaactgaaaa agttactgaa 10200 gaaagactta agatggttaa ttttggtccg cctggttggt taagtagtga agaaatgaaa 10260 ttaattttac atgttattgt cttaagagaa aaatcaattg catttaatga aagtgaaaga 10320 ggtgttttaa gacatgaata tggattacct tacattattc ctgtggttga tcatgagcct 10380 tggcagaaga aagttattcc aattcctaaa gctaaaagac atgaatatat tgaattagtc 10440 agacaaagat tgagaactgg tttatacgag caaagtactt caagttattc aagcccagta 10500 ttttgtgtaa ttaagcatga tggaaaactt agagtagttc atgatttaca agagttaaat 10560 aaagttacaa ttaaggatgc aggtgtacca ccggctccag aagaatttgt tgaagctttt 10620 gcaggaaggt catgttatgg acttggtgat attatgggtg gttatgatga gagggaattg 10680 gcggttgagt caagaccttt gactactttt gagactcctt taggtagatt tcaattgact 10740 agacttcctc aaggtgcaac aaattcagtt gcagtttatc aagctcaaat gatgtggatt 10800 ttacaagatg agattcctga acatgcgggt gtatttattg atgatggtgg aattatgggt 10860 ccaagagatg attatgacaa tgaagtttta tttgaaaatg aaggaatcag aagatttatt 10920 tatgagtatg cggttacatt ggaaagaatt ttattcagaa ttgaggaagc tggacttacg 10980 gtttcgggaa agaagtttgc tgcatgtgta cctgaattag aaattgtagg acatgtagtt 11040 ggttttcatg ggcgcactat ttcagttaag aagaggaaca aaattcaaac ttggccagta 11100 ccaaaagatc caggacaaat tagaggcttt ttaggtgtat gtgtgtatgt aagaatgttt 11160 attgaaggtt tctctgagtt atcttctcct ttaagaagat taacaaggaa gggagttgat 11220 tgggattggg atagtttatg tcaagatgta tttgaacaat tgaaggaaat tgtaggtaga 11280 gaaattactt tgaagagtat tacttatggt gaaggtgcgg gtaaattaaa attagcagtt 11340 gattcaagtt atattgcggc tggtgcggta ttaactcaag aagataaaat taagaaagat 11400 agaccagtgt tgtatgaatc tgttacattt actgaagttg aatcaaggta ctcacaacca 11460 aaattagaat tatgtggagt tgcaaagatt ttgaagaaat tacaagttca tttatggggt 11520 caacattttg agttacaagt agatgctaaa gctttgattc aaatgattaa tactcctgat 11580 ttaccaaatg ctcctatgac aagatgagtt ttctttattc aattattttc ttttgacttg 11640 gttcataaag caggtaagac ttttactatg ccagatggtt tatcacgcag acctcaagat 11700 agtgatttag atagtgatgc tgaggaattt aatgaagaaa agaaacttat tgatgttgca 11760 aacaatgatt atgattttgg aatttattgt ggagatttaa ttatcaatga agatttaatt 11820 gatggtgacg taagatggga acaagttgga ttctggaaac acttagtcaa ttatttagaa 11880 aatttaatca gaccagaaga tattagtgat gaagaattta aatcaattaa gaggaagagt 11940 gataagtttt atttggataa tggaaggtta atgcggcggc aaaatcctat ggctcaggtt 12000 gtagttacaa atgttgaagg tcaagattgg attttagaac aattacatga aggtttaggt 12060 cagagaggag ttgaagaaac ttacagaaga atagtactta ggttttggtg gccaagtttg 12120 aagaagtctg tgcgggaatg ggttcaaagt tgtgaagctt gtcaaaagcg gagttcttta 12180 acacctaaag aattgggtca tgctacaggt gaagctactt tatttggtag aattagtatg 12240 gatgctgtac atattaaagc tggacaacat aaatacttag tggttgctag agatgattta 12300 tcaggatggg ttgaagcggt acctttggta aatttaaagg cggataaggt tgctgaattt 12360 ttagaaaaag aatggatatt taggtatggt gcaatcaaaa tggttacagt tgatggaggt 12420 ggtgaattta aagatgaatt agtcaaggcg gttgaaagtt gtggagcaaa acttagaatt 12480 gttacacctt attatccaca ggcagctgga atggttgaga gaggtcacaa gccaatcaag 12540 gataccttag tcaaaatgtg tagtacaaat ccaagtgctt ggcggaagac tttaccaatg 12600 gttttatttg cagatagaat tttaacaaaa agaactacag gaatttcacc ttatgagatg 12660 gtttttggtc aaagagcggt cttacctgtt gatgttgagg ctggaacttt cttaggagtt 12720 aattgggaag aagtacatac aagagctgaa ttattagaag caagaactga acaattatta 12780 agaagagaag aaatgatgga taatgcttac agtaaaatga tgagagttag agaagaaagt 12840 gttaggtatt gggataagaa aaatgcgcac aaattaagga aaagtccttt ggaagttggt 12900 gatatggttt tagtttataa tgcgtcttta gaaagtcaat ggggtaaatt atttgaaaac 12960 agatggaatg gtccttataa agtcaaagaa caattacata tggggtcttt cgttttggaa 13020 gaattagatg gtactgaatt acggcgaagg tatgcggctt ctcatgtcaa gaaattttat 13080 gcaagaggaa caaatgaatt agaagagggt acagatgaag aagatttaaa tgaacaacaa 13140 agtggtgaag attttgtgga tgaagattca agatgggaag aaagtgatga ggagtttgat 13200 gtttaaatta aaattacaaa agtaaataaa aatataatat agattgcggt ttctgttgga 13260 cttatttttt agatttttta tggttatgtt tcttttgagt tatgtttctt tatatttttc 13320 ttttcctttt tatgttatgg tttctttatt gaatcacggc agtgggttac ttcagaggcg 13380 gatttcttta ttttaaaatt cagttgcggg ttaattttcg attttcttag attatggtta 13440 tgtttctttc atttcatttt atttttctta ttcattcaat caattattca atcaagaggg 13500 tattaaaaaa atcatattag tgaagaggga acaaaaccca aagtagtggc gggttcttca 13560 cttatcaatt tattttatca gttttttttt atagtcttaa ttttttcctt ttggttattt 13620 gggttttttt tataaattta gttggcaggg tttccaattt tttatttttt attcttacaa 13680 aggttcttca aaatttacgg gtgctaaaat tacttcaaca aaattattga agttaaaatt 13740 attttaacaa aagtcataca agttaaatgg gtcatacaaa atttacaata caaaaattta 13800 ttagcggcag gttcaaatta atcatattca gcgtgaaggt tttatcaaat caatggttat 13860 gtttcatttt attatttcat tttttatttc acttacttta attatcattc aaggcggtca 13920 tgggttcatt atttaatcaa catcatagag gcgggtttca attaattctc aaaacttcaa 13980 gttacggtta tatttcattc atatagagtc tttggttatg tttctttggt tatgttccaa 14040 ttttttctta aattacactt ttttttattt tgggcgcgga gtttacaatc aaaattaaac 14100 atcaaaatta taaatgaatc aattttaggg gtatcaaatc aatggtgaag agggaacaac 14160 aaaacatagc tgtggcgggt tcttcacctc tttcttattt tcttattttc attttctcgt 14220 gtttttcatt ttttttcttt cattttattt tggggcacaa ctactttaca agttaatcat 14280 tcacatcagg gcgcaataca tcacataaat tacattcaac ggcagcgggt acaattattt 14340 acatgtcagg cagaattatt catacaaggt tacgagtttc aagttcaaaa cttattggtt 14400 atgtttcttt tatcttattt ttcttatgtt tcatttatgg tcttgggaaa ggtcctcact 14460 gaaagatgtg agaggcgtgg agggaagttt ggaagatgtg gaaaaaggac ccttactcat 14520 attttacgta aggggtaaaa ttttataatc aatttataat ttcaagtata ataagttgta 14580 gttacaggta caacaaacaa taggttttaa ttatattatt tataagaaac cttttaactc 14640 ttacagattt ttaatcttac atattcaaat ttacactttt agaataggag ctttttttta 14700 ggcatactta atcataattt atcataatta ttctcaattc tcatctggct caactcatca 14760 aattctcaca atacatcaca tcaatttatt agaggaggtg taggcaaaaa cccattggtt 14820 ttgggggagt ggtcaagttc aaatgagaac attcaacata cttgatcttg tatttcaaga 14880 gtcaatcaga taacaaaagt aaaacagaac agatatttat taaaggtcaa ctgtattgaa 14940 attcagaaac aaagataggt ctaggtctaa ataatcttga gttgtttgat aagagctcta 15000 ggttgaacta gaaaatagaa aagataaaag ataaaagttt atcttgaaga gtgggttgat 15060 ggttagactc gatctttccc ttttgaggaa gatccagtgg aggaaccttc agaatcgttc 15120 ctacaggtaa aatatacaca aataaatcga ttgtcagtac gaatacataa aataaattca 15180 attcaactag aattattaga agcaaaaaga agaaaaataa aagtaaatgt aatttacatt 15240 tctaatgcag gaacgatgct tggagaactt cgtttttcca accctttttg acctatgaag 15300 tggagtttcc tgagagtctt agcgcggtca tcgatttctt cctcattaac ggcaacatga 15360 cctttaaagt gtttgaagac ctcggaatac atggaagaaa tgcaagattc gatttcgtcg 15420 gaagggtaca caggttcttt gttgagaatt accttaggac caagcaaatt tttgtatcca 15480 tcaactgaca aaggagtgct aaagataaat ttgggaccaa cttcaatcgg aggtggtaaa 15540 actggaggtt ggtgcgattt gtcgttagta cgagctgcag gtgtattaga attggcaggg 15600 ggtgaaatat ctaaaatggg gtcattggcg tagtgttttg atcggttatg attgattttg 15660 agcttagaag ggtctggaag cgagctgttg agtcggttac gtttagggct aggagaattg 15720 gctactggcg gagattgaaa tggacctagg ttgactagtt gaggagtctg agcccgaact 15780 ggagctgggg tgacagaagg tacgatgaca actgggacag tggtgatagt gtgaggtgat 15840 cgagcatgag ggggagtcat taagttgtcg tccttggaat cattgtcacg atgtgttttc 15900 ttgggagtag aggggatgcg cttgaaatta ggagaagagg ctatgggatc aggagggcga 15960 tcgaacgttt tgttccgttt cttgttggat cgtctagcag ccatttgctt aagctcggct 16020 tcgcgttgtt tgcgtggtgg gatgggaact ccgtccatat ccattatagc taatatcaaa 16080 taatgaatga gttgaaaaga atcagtacaa gtgaacaatt tatgagttag agctttgagg 16140 gttgggagag attaatagaa agatatggaa gagggagatg agaagtgaag tgggaaactt 16200 gcagttttca ataatgtagt aatgagggta ttctctaatt cgttcggtat cgttggagga 16260 gaacgtcgct tgaaaggctt caagtctttt aaatgaatcg gtgtccatcg agtagttgtc 16320 ccatgcgaat agatgtaacg aagtcgttgg gaaaactgag ttcataagcc cgttatcgta 16380 catcgtagac tttagaataa gactgttaat caaatgtctt gaaggagttt ttgttttgga 16440 tatcttgaaa gtgctactta ctggatttgg tcttgacata taggttaagt ttggtttggc 16500 aggatcgaag cgatggggac gttgctgggt tttgggggag gc 16542 // ID GYPSY1_MG repbase; DNA; FNG; 5233 BP. XX AC M77661; XX DT 26-APR-2005 (Rel. 10.04, Created) DT 26-APR-2005 (Rel. 10.04, Last updated, Version 1) XX DE M. grisea gypsy-like LTR retrotransposon (Grasshopper), DE incomplete sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; RNase H; KW reverse transcriptase; integrase; GYPSY1_MG. XX OS Magnaporthe grisea OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Magnaporthales; OC Magnaporthaceae; Magnaporthe. XX RN [1] RP 1-5233 RA Dobinson K.F., Harris R.E. and Hamer J.E.; RT "Grasshopper, a long terminal repeat (LTR) retroelement in the RT phytopathogenic fungus Magnaporthe grisea."; RL Mol Plant Microbe Interact 6(1), 114-126. XX DR Genbank; M77661; Positions 1 5233. XX CC 5' LTR from positions 1-198. 3' LTR not present. XX SQ Sequence 5233 BP; 1203 A; 1481 C; 1580 G; 969 T; 0 other; tgttacggat tcgtgctgat tgcacgtatc cccgtatata gatagcagcg gggaccacgt 60 gataagttgc attgtatata agggaggaag ggtcgccaag accttttccg cacccctttc 120 ttctcctttt ccttagcgat aataataact cctttcgggt acccaaccgt tgtatgatcg 180 ttggcctacc ctataacatt tattatcgct cattattctt ctttttcctt ttccacggat 240 aatgagttcg ccacccaggg aggaagccgc cgcccaggtg ccgggcgccg cgcttcagga 300 attatggcgg acaatcgcgg atttacaagg gagagtacaa gctttgcaaa caggggcccc 360 gacggtacca gctatcgcgg aggcattaca agccacggca ctgccgaagc gcaaaccact 420 tcgagacccg ccactgtacg acggcgtccc ggcctcgttc acagcctggc gatgcgctat 480 ggaatataaa ctccgccgcg acgcggattt tataggcgac caccgcgacc agtacgaata 540 cctgtgggca gggctcgaga cgtccgtcca gaaggtggtc cgctcttact acgaggtagg 600 cggcagagac ggcgcgtacc gctacacaga tttcctggac tatttggaac gcacttacga 660 cgatccccac aaacgagcgc aagccctggc cgaactcgag acgttgaaga tgaagccggg 720 ccaatcgttc gcccaattta ttgcgatttt cgagagaacc ctggctacgg ccggaggatt 780 agcctgggcg gacgaggtcc gtaccaactt cctgcgtttc cgcgtgtccc ccaggattag 840 ggaggcgtgc gtcggacgag gcatgggaga cggcacctat ttaggagcgg tggccattta 900 ccgccaggtc gcccaggacc ttgaggcgat cgaattggac agacgtttcg gtcctcaccg 960 cgcgggcgcc gccacggccc ccaggccgcc gaaggacgaa gatacaccta tgacgggcgt 1020 ggccgcaatg ggttccaggc ccaatggggg ggcgagggga cgccgtagac ccggacaaac 1080 ccagccttcg gacaccaaca gaagggacac gcgtccacgc gcccaatggg tccccagcga 1140 cgaatatcag cggagacgcg aaacaggggc gtgcctgcgc tgcggtaatt ccggccatca 1200 agtggcggat tgcacctacg cagcggcgct gcgcccttcc acggtggtgg ccgcgactac 1260 gacggagacc cctggagagg gaaacgagta gccctcgaca cgagactcag tcgagggcgg 1320 aggcgcaagg cgcaaccgaa gcaagggcag acagggggcc aggaagcgag catggtgaaa 1380 tcggtctaat ccgacgttct atggacgggg cgcctttcct tatccctgcc ttgttagata 1440 attcacagtt tgttaatgca caagtggatt cgggttgcga gtgttacgcg gcaatgagcg 1500 acaaatgcgc gactcggtta aggatcgaga ggataccttt gccccaagcg cgccacgtgg 1560 gtacggcggt agggcgcgcc cagccgatga ttcgagagtt ggccaagtgc gaaatggatg 1620 ttgacggatg ggttacacca atgctttttt atatcgtacc ggggttggcc cgggacgtaa 1680 tattaggatt gccctggatg acccaccgac ggatatcact cgacgcggcg cggaaaaagt 1740 tagtcgtggg cgcggcaagg ggcatgctgg tagacgagtc gtccacacgc cctgtttcca 1800 ccaaacccac ggtggtcatt ggcaacgttt ttctggccgc gtgccggcga gcgaagaggc 1860 acgacggcga acagatggaa tttgcgtcga catccctccg ggaaatgacc agtattttgc 1920 aattaacggc cacatacgag caggcgttac cacaagcgac gcttcccccc gagctggcga 1980 aattcgcgga tctgttcgac aagacgaaag cgtcagggtt gccgccgcac cggggccacc 2040 ttgaccacca cattcgccta caaaaggacg aatccgggaa gaccccggcc ctcccatggg 2100 gccgccttta ccatatgcca cgagaacaat tgttggagtt acgacgtcag atagtggata 2160 tgatggacaa gggctggatt agggccagct cctcgtcggc ggccgccccg gtcctgatgg 2220 tgcgcaaagc gtcggggggt tggaggttgt gcgtggacta tagggccctg aatagcatta 2280 ctatgcagga ccgttacccc ctgccgttaa ttaaagaaac aatacgttct ttaacggggg 2340 cgcggtggtt taccaaagtg gacgtgcgcg cggcattcca caaattacga atagcggagg 2400 gcgatgaaca ccttacagcg ttcagaactc gcttcggatt attcgaatgg ctggtgtgcc 2460 cgttcggact cgccggggcg cccgcgacct ttcaacgtta cgtcaacggc gtgttaggcg 2520 atacgttggg ggactatgca tcagcctatt tggacgacat cctcatttat tcttcaggct 2580 ccaaatccga ccattggtcg aaagtgaccc gggtgctgga caagttggcg gcagcagggc 2640 ttaatttgga cctcgataaa agcgcgttcg cggtgaaaga ggtcaagtac ctagggttta 2700 ttgtcaaggc gggggaagga gtccaggccg accccgagaa gataaaggca attcgagatt 2760 gggaggcccc gacgagactc cggggacttc gcgggttttt gggtttcgca aatttctacc 2820 gcgattttat agacggttac tcaacattaa cggccccgct gttagcgctt acgaaaaagg 2880 ggaccccatt tcgatggacg gaagagctcg aaggggcgtt cgaagcgttg aaacacgcgt 2940 tcctccaagc accgatactc gcccagtggg acgacgcgaa ggatacgagg atggaaactg 3000 actgttccgg agcggcgtta gggggctgcc tttcgcaaaa ggggacggat gggctatggc 3060 gcccggtcgc gttccattcg gccaaattga ccgacgctca gagaaattat accatccatg 3120 acaaagagct gctcgcggtt atagcatgcc tgaaagcatg ggacgcggaa ctccgcagcg 3180 tccgtcggcc ctttttaata ttaacggacc acaaagcgct ggagtatttt tccaaaccca 3240 gagaagtctc ggagagacaa atgcgttggg cagagacact ttcgaaattc aattacaatt 3300 tacgcttccg cccgggccgg ctagcggggg tccccgacgc gttatctagg agagaacagg 3360 acgaatgcac cacaccacga ttaacgaccg tgttacgccc ggccagacca cgaaccaacc 3420 tagcgccggc gaatactata cccactgcaa ccgccgcggc tcccccgccg ggatcgcagg 3480 ttttcgcaca ggcacatttg gcccggcttt gggacgaagc attgtcgaaa gacgacctgt 3540 accaaatacg tttggacgca gtgcagggag acgaacggcg cttcccgcca gaggccgaga 3600 cgaaggcgca ggtggcagat tgcgcggtga accggctcgg ggcgcttcaa tacagggggc 3660 ggttgtggtt accgaactgg gaaccattaa ccacggcggt gttgcaacgg acccacgaat 3720 cgccgatggt gggccattcc ggcagggacg gtacgttcgc tatactcgcc agggactacc 3780 attgggacgg tatggcggag cacgtgaggc gtttcgtacg caactgtgat atatgtcgac 3840 ggacgaagcc ctcgcgccgg gcacgccagg gcctcctcca accactaccc ataccggaca 3900 gattttggaa acaaatttcg atcgacttta tgaccgacct acccgggaac ggggaggtga 3960 cccctcggta tttaatggtg ataacggaca gactttccaa atacgtacag ctagaggcga 4020 tgcactcgat gaaagcggag gattgcgcgg cacggttcct gtcctcctgg tggcgattcc 4080 ggggtttccc cagtcagatt atttccgata gaggctccga ttgggtgggg ggtttctgga 4140 cagagctgtg caggcaaacg ggggtggaac agctactttc aacctcttac cacccagaaa 4200 ccgacggggg cacggagcgg gctaaccaag aagtccagca ataccttagg gcttatatag 4260 ctttcgacca aggggattgg cccgaccact tgggggcagc acagctggcc ctcaacaacc 4320 ggaattcttc agtcacggga acaagcccga acaaactatt gcttggattc gatatcgagg 4380 cggtaccgaa tgccgcccct ccctccaaag caccggcctc gagccccaaa gccagggcca 4440 ctcggttttt ggaacactta cgagagggct ccgaattggc ccaggccgcc attgcgtaca 4500 accaacaacg ccaagaggcc ggggcgaacg agtcacgacg cccggctgaa cgattccggg 4560 tcggggacga agtgttcctc aacctacgca atatacgtac gaaccgcccg tgcaggaagc 4620 tcgattatat ttacgggaaa taccgagtcg tggccgtgcc gaccccactt acggtcacgt 4680 tggacgtccc gcgaggaatc catcccacct tccacgtaga attggtggaa cgagccgcca 4740 gcgacccatt accgtctcaa attcgaaccg actcccggcc cgaccccgaa cttcaaccga 4800 cagaggaaac cggcgaaccc gaggaagtct gggccgtgga ggcgatcctg gccgcgaaga 4860 accgcagggg gaggggaggc ggccgtcagg tattggtcaa atggcagggg tacgataacc 4920 ccacgtggga gccgctggag ctaatgacgg acacccgggc cctggacgag ttcgaggccc 4980 gctggggagg cgtccacacc aacgacgggc cgacctccag acgtcgccga ccgaccacga 5040 ccgccaccca gagattcccc acaccctcgg gcccggaagg ggaggagccc accgccgcgg 5100 gccagcgacg acgcagcgcg agattgttac gcgtttcaca catgtgcacc tcggaaggag 5160 gagggatggg caagacgctt gcgcgtaacc ggacgttgca agggcgttaa aagccccggg 5220 gccagttgaa ttc 5233 // ID LTR13_CN repbase; DNA; FNG; 383 BP. XX AC . XX DT 30-MAR-2005 (Rel. 10.03, Created) DT 14-APR-2005 (Rel. 10.03, Last updated, Version 1) XX DE C. neoformans LTR - consensus. XX KW LTR Retrotransposon; Transposable Element; Interspersed repeat; KW LTR13_CN. XX OS Cryptococcus neoformans OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Tremellomycetes; Tremellales; Tremellaceae; Filobasidiella; OC Filobasidiella/Cryptococcus neoformans species complex. XX RN [1] RP 1-383 RA Goodwin T.J. and Poulter R.T.; RT "The diversity of retrotransposons in the yeast Cryptococcus RT neoformans."; RL Yeast 18(9), 865-880 (2001). XX RN [2] RP 1-383 RA Loftus B.J., Fung E., Roncaglia P., Rowley D., Amedeo P., RA Bruno D., Vamathevan J., Miranda M., Anderson I.J. et al.; RT "The genome of the basidiomycetous yeast and human pathogen RT Cryptococcus neoformans."; RL Science 307(5713), 1321-1324 (2005). XX RN [3] RP 1-383 RA Gentles A. and Jurka J.; RT "C. neoformans LTR sequence LTR13."; RL Direct Submission to Repbase Update (15-MAR-2005). XX DR [3] (Consensus) XX CC Average similarity to consensus is 88%. XX SQ Sequence 383 BP; 134 A; 65 C; 120 G; 63 T; 1 other; tgtaaggacg aggaaataga gaaggaagga agagaacaat aagcagggga agtagtttcg 60 gaggrccgag gagagagaag aagaagagag gaagagaagg aggaccgtgc gaggaggcgt 120 tagggaagag agaggtccga gcggaggata cagggaaagt gtgagaaaga tgagtcagct 180 cgtagagctg agaaagatat aaaaggcgta gcctttagag tagataaaat gcatagccca 240 agcagcttag tcttctcgaa ctctccagtc taccgaacca cttcagagct tctagtacaa 300 ttgaatacaa acgaccgagg acaacgaaac tattgtagct taaggtgctt tccggcctct 360 acgaggacag tggcaccgct aca 383 // ID Gypsy-1-LTR_ACa repbase; DNA; FNG; 554 BP. XX AC . XX DT 26-FEB-2009 (Rel. 14.02, Created) DT 26-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE An LTR portion of the Gypsy LTR retrotransposon from Ajellomyces DE capsulatus - consensus. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW reverse transcriptase; Gypsy superfamily; Gypsy-1-LTR_ACa. XX OS Ajellomyces capsulatus OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Onygenales; Ajellomycetaceae; OC Ajellomyces. XX RN [1] RP 1-554 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Ajellomyces capsulatus."; RL Repbase Reports 9(2), 359-359 (2009). XX DR [1] (Consensus) XX SQ Sequence 554 BP; 127 A; 153 C; 123 G; 149 T; 2 other; tgtcacaccc ttggttgtgc caaggccccg tgacggctcc acacggccca tcgyttcccc 60 tcagataatc ccatctatcc catracgtca cgcaggtgtc cgaaccggtt tccaaccacc 120 ctatgacatc attcggatgc aatagaagag gatccgattg atgagatatg agaatcctgt 180 ttctatacgg ggtctggatt ccttctggca gatcaatgca gcgccactgc ggagttgctt 240 cggctaccct cgcttataga gtgaggggag gatggctgcc cgttgtctat atagaatagc 300 tagatgctgg ctctctattc caggagccag cgattcgtct tttatctttt gacaataaga 360 taccagagtt accactgtcc tcatacaata ttcatagtgt gccgagtacc tctggtagtt 420 cagtatttgt atagtattct acgagttcgc ggttcaagcg agataatcaa gcccacccat 480 aaggcttccc attatatacc atcctcctct gtgccgctgc cgagtagcaa tcgcctcggc 540 gccgtcacat taca 554 // ID YETI-LTR_PA repbase; DNA; FNG; 354 BP. XX AC AJ272171; XX DT 16-MAY-2005 (Rel. 10.05, Created) DT 03-JUN-2005 (Rel. 10.05, Last updated, Version 1) XX DE P. anserina gypsy-like LTR retrotransposon, LTR sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW gypsy-like LTR retrotransposon; LTR; YETI-LTR_PA. XX OS Podospora anserina OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Sordariales; OC Lasiosphaeriaceae; Podospora. XX RN [1] RA Hamann A., Feller F. and Osiewacz H.D.; RT "Yeti--a degenerate gypsy-like LTR retrotransposon in the RT filamentous ascomycete Podospora anserina."; RL Curr. Genet 38(3), 132-140 (2000). XX RN [2] RP 1-354 RA Gentles A. and Jurka J.; RT "P. anserina LTR."; RL Direct Submission to Repbase Update (16-MAY-2005). XX DR GenBank; AJ272171; Positions 41 394. XX SQ Sequence 354 BP; 101 A; 91 C; 87 G; 75 T; 0 other; tgtcacacgg caaaagccac ccacctgtaa ccacaaccaa caacactgtc gataccaccg 60 caaggagggg taactgcaag ttgtgagcct ctcgttcaaa gtgaggcaga caatcagagg 120 catggcggtt atcaaaggaa gcccactggc ccagtaggat attgaaggac ttggcgatgg 180 cctgaggcca acaaaggcac gggccacgcg agggagggta aaatacgcgt ttggaaacac 240 caagactgta tctcttcttc ttcttttcgg tatcacgctc ttcgaggcca ttgaagatta 300 ttagcctgaa ttgaaccatg agttccaagc cctgatatta cagcccttgt taca 354 // ID Gypsy-117_MLP-LTR repbase; DNA; FNG; 351 BP. XX AC AECX01000853; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-117_MLP_; KW Gypsy-117_MLP-I; Gypsy-117_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-351 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000853; Positions 254937 254587. XX SQ Sequence 351 BP; 81 A; 76 C; 79 G; 115 T; 0 other; tgtaagggtt acacttagac ataggcggta tggttaagag aagcatgggt ttatatgggt 60 ttcttatata tcttgttgtc agcgcccggc ttcaaaggat cctcactcct cagagagcta 120 accttgggat tctttggagc taggtgagtt ttcagttatt ttcatatctc tctggttcct 180 tttcaattac ttgtactatc ctgactcctc agagagctaa ccttgggatt ctttggagct 240 agttcacaat ataaccgaag ttctaagtta attgttccca ccgtgctctc tcgagaagtc 300 gcgccggact aagtttccgg acttcccagt gaaggtctga gagaccttac a 351 // ID Gypsy-83_MLP-LTR repbase; DNA; FNG; 179 BP. XX AC AECX01001005; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-83_MLP_; KW Gypsy-83_MLP-I; Gypsy-83_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-179 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001005; Positions 31209 31387. XX SQ Sequence 179 BP; 54 A; 49 C; 26 G; 50 T; 0 other; tgttactagt ccgtctgtga aaggtctaac acatacatat cacagattga acactatatt 60 gtatacacac cactagctag atgttgttcc ttttccttca tccgacaatc tcgtaatagg 120 actagacagt atcaggattc ctcactccag tcccaaagac ccaaccctga gccttaaca 179 // ID Gypsy-98_MLP-I repbase; DNA; FNG; 5704 BP. XX AC AECX01000493; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-98_MLP_; KW Gypsy-98_MLP-LTR; Gypsy-98_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5704 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000493; Positions 37955 32252. XX CC Positions [4504-4983] - Integrase core CC 'TGATC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(289..4035,4039..5673) FT /product="Gypsy-98_MLP-I_1p" FT /translation="MSINLSNDESMNETPWSDSVESMNEDTSATPRQTPRG FT SPSPQIPPHQMRQRRRSFSTETSASGVSTTSTASEIQSLTEGLRSLQETVA FT NSYSQMSVITDRLDRLSMNSPMYAQFSQPQPRFEHNQPRFGNDTTQPPRTR FT MRQNYQARAATQSYANSRPSSYFSAQAAPSAQAAPVPPAANDSRRQATSPA FT PPLNHPGAPNPARPAAQAPEINLQPAEAQQRRATPAASNRGVGGAPPPNPP FT PRDLFSDERDESEATVVYSDPAPIRGLYFPGEPTELRDFLMEIREAMRSVD FT YRFLNDVRGEQRRINWVAQRFKTRDARGVTTSSTALAWFRGLLATNAHSQG FT IWSEYADLKAFDYLIGELSSLEAFYGAMIDEFRDVNSVRNANEALQNLKQR FT NQPLPDFNSAFRITAANTSLSIDSQMELYRTNLNPAISSIAVFIPGWVACT FT TLDEKMRIGSTAAAMANECSQIAGHPFNTRNMHYRAPPPQAAVRQAPVVCV FT PVPIDPNSMQIDAVGTKEYSSDEKRVWSAVKKICWGQFWCFNCSGPFSTSH FT RTSRDCPHPEASYEERLKFIKQHRPSTDVAPVQVYQPIAAASSSRVIDAPL FT GRNWADAMSQEANDALQEMTDEWIDHELELQGIDMALNAVRIAPIPSFSSR FT FCVPLNLVLGSSSVTVRALIDTGAMDSFVHKRVVRSHELFTNSLPTPLTCS FT GFDGTPGGDLVTHSWLGIGHLTNGLEVSDSLVLDLKVNNIGCYDVILGLPW FT LDKNRAVIHCSSAGRGVEIGALTLMCDNEDIDDSPERNEVRQFVPKCFLDF FT EDVFLPQVTSLPPHRKHDVVINLKEGCEPPTSRAYDLSAADEAELKAWVDD FT QLKKGFIRLSSSPASAPAFLAKSAGRKNRPCIDYRGLNKVTVRDSYPIPLV FT KSLLSRVRGCKRFAKIDLKAAFNLLRIAKGHKWKTAFRAPGGLYESLVLPF FT GLANGPAVFQRFIQYVLHEYLDVFCFVYLDDILIFSKDDKEHEDHIRKILL FT KLRENNLQASPTKCEFFKEEVVFLGFVVLTTGLKMDPAKLATIKEWLYPSN FT LAELRRFLGFTNFYRRFIARFSAEVARLTDLTKNGVNVEQGLSQPDTRSCF FT NSLIAAFTSAPFLKHFDFDQRRVLQVDASAYAFSAILSQPGEGGKLVPVCY FT YSKKLTPAEALWQTHDQELGAIVAAFKEWRSWLMGLNEQIVVFSDHANLRY FT FMEGRSLTPRQSRWAAFLSAFNFVILHTPGRLNPADPASRPDFTTVTAPEA FT LVLFKSVSGVPIDAIHLSSDGVGYDVSFAEPTCEALSEIKSSYQSPNFPTV FT PKDEGLVFIDNAWWFKGRIFVPSHLREGILKLYHSKPLSGHWGVAKTLDLL FT SRTFGWRNMRLDVLSFTRKCHSCQKVNRDLRPRQGEMIALPIPDRPWSTIG FT VDFIVKLPKSHDFDSIMVVVDHLSKFTHFIPAKETWSAVELADAFVENVFR FT LHGLPDKIVSDRGAVFVSAFWTAVSKRLQISPSPSTAFHPQTDGQVERLNA FT VLEDYLRHFVEENQSDWKSWLALAEFANNNSISSSTGFSPFFANYGYHPRF FT NSVTHASSVPKANDFVGHMQRIQSQLQVTLAEAKERQARFYNKGKRITVCY FT KPGDLVWLSRKFIKTRRPSQKLDYRRIGPFAVVRMVGKNAVQLSLPREYAR FT LHPVFNVALVMPVVGITDDVLDFSCLPEVPINRGIGFESDEARNVAQWLSI FT SYVLGHRKVEGVHQYLLRSEESGLDDSWVPLHQVSRGLDVFIQAYHELNTE FT EEKPHWEHFEDTSRPDLGYVAGA" XX SQ Sequence 5704 BP; 1364 A; 1279 C; 1374 G; 1687 T; 0 other; ttttgtttcg atcttactat cactcttaca tcagttcacc tgatttcaat aaaacaagcg 60 tcatatcaga tcatctgata tcaactcaat aaaaacttcc tcatcgtgct ggtagcacaa 120 taatttcgaa ctaagtagtt caaaatttaa aagactcgga tgaattgaaa aattcaaaac 180 aagtcgcgtt caacatattt ctcttttgat aaaatattcc gaaatttttt gcattttacc 240 tcttcttttc tttttctccc taactcttca agtcgagttc tttccagaat gtccataaat 300 ttatccaacg acgaatccat gaatgagacc ccttggtcgg acagtgtgga atcaatgaat 360 gaagacacct ctgcgactcc gcgtcaaact ccacgcggct ctccttcacc tcagatccct 420 cctcatcaga tgcgtcagcg tcgtcgttcg ttttcgactg aaactagtgc ttcaggggtg 480 tcaactacca gcactgcttc tgaaattcaa agccttacgg aaggtcttcg ttcattacag 540 gaaactgttg ccaattctta ctcgcaaatg tccgttatta ctgaccgtct agatcgtctt 600 tcaatgaact ctccaatgta cgcacagttt agccagcctc agcccagatt cgagcataac 660 cagccgcgtt ttggaaacga tactactcag ccccctagaa ctcgcatgcg gcaaaattat 720 caagctcgtg ccgcgacgca aagttatgct aactctcgcc catcttctta tttttcggct 780 caagcggccc cttcagctca agccgcgcca gttcctcctg cagctaacga cagtcgcaga 840 caagcgactt cgcctgctcc tcctcttaat cacccgggtg ctccgaatcc agctagacca 900 gcagctcaag ccccggaaat taaccttcag cctgcggagg ctcaacagcg tcgtgctact 960 ccggcagctt ctaatcgagg cgttggggga gctccccctc ctaacccacc tcctcgtgat 1020 ctgttcagcg acgaacggga cgagtccgaa gctacggtgg tttattctga cccagctcct 1080 attcgtggtc tttattttcc aggtgaacca acggaacttc gcgacttcct gatggaaatc 1140 cgggaagcta tgaggtcggt cgattatagg ttcttgaacg acgttagagg tgaacaacgt 1200 cgtatcaatt gggtggcaca gcgcttcaaa actcgtgatg ctcgcggtgt cacaacttct 1260 tcaactgctt tggcctggtt tcgagggttg ctggcgacta acgctcactc gcagggaatt 1320 tggagcgaat atgcggattt gaaggctttt gattatctta tcggtgaact ttctagtttg 1380 gaagcttttt atggtgctat gattgatgag ttccgggatg tcaactcggt tcgaaatgct 1440 aacgaagctt tgcagaacct caagcagcgc aaccagccgt tgccagactt caattctgcg 1500 tttcgaatca ctgcagcgaa tactagcctc agcattgact ctcagatgga gttgtatcga 1560 actaacctta atccagctat tagcagtatt gcggttttta tcccaggatg ggttgcctgt 1620 acgactcttg atgagaagat gcgcataggt tctacggcgg ctgctatggc aaacgaatgc 1680 tctcaaatag caggccatcc gttcaatacg agaaacatgc actatcgtgc tcctccccct 1740 caagctgctg tgcgacaagc tcctgttgtt tgtgttcccg ttcctattga tccaaattct 1800 atgcaaattg atgcggtggg taccaaggag tactcctcgg atgagaagag agtgtggtct 1860 gccgtaaaga agatatgttg gggacaattc tggtgtttca attgttcagg tcctttttcg 1920 acttcccatc gtactagtcg cgactgtcct catccagagg cttcttatga agaacgtctc 1980 aagtttatta agcagcatcg cccctcaacc gatgtggcac cagttcaagt ttaccaacct 2040 attgcggctg cttcgtcatc tcgtgtgatc gatgctcctt tgggcagaaa ttgggctgac 2100 gcgatgagtc aagaggcaaa tgacgcatta caggagatga cggatgagtg gattgatcat 2160 gaattggagt tacaaggcat agacatggca ttgaacgcgg tacgtattgc gccgattcct 2220 tctttttctt ctcgcttttg tgttcctttg aatctggttt tgggatcctc gagtgttacg 2280 gtccgtgctt tgatagacac tggtgccatg gactcttttg tccataagcg tgttgttcgc 2340 agtcatgaat tgttcactaa ttccttacct acacctttga catgttcggg ttttgatggt 2400 acaccaggag gcgacttagt cactcattcg tggttaggga ttggacattt gacgaatgga 2460 ttggaagttt ctgatagctt ggttttagat ttgaaagtga acaatattgg ttgttacgac 2520 gtgattttag gtcttccatg gcttgataag aaccgtgctg ttattcactg tagctccgcc 2580 ggaagaggtg ttgagattgg tgctttaact cttatgtgtg ataatgaaga tatagatgat 2640 tctcctgaac gaaatgaagt tcgccaattt gttccgaaat gctttcttga ttttgaggat 2700 gtttttctgc cgcaagtcac ttccttacct ccgcatcgta agcatgacgt cgttataaat 2760 ttgaaggaag gatgtgaacc gccaaccagc cgcgcttatg acctgtccgc tgcggatgag 2820 gctgagctta aagcttgggt cgacgatcag ttgaagaagg gttttatccg cttatcttct 2880 tcccctgctt ccgctccggc ttttctcgcg aaaagtgctg gacgcaagaa tcgtccttgt 2940 attgactatc gaggtcttaa taaagtcaca gtgagggaca gttatcccat tcctctcgtt 3000 aaatcactct tgagccgagt tcggggatgc aagaggtttg ctaagatcga tctgaaggca 3060 gcctttaact tgcttcgaat agctaaaggt cacaaatgga agactgcttt tcgagcgccg 3120 ggtgggttat atgaatccct agtgttgcca tttggtcttg cgaatggacc ggcagttttt 3180 cagcgcttca ttcaatatgt gctgcatgaa taccttgacg ttttttgttt tgtttacttg 3240 gacgacattt taatcttttc aaaagacgac aaggagcacg aagatcatat tcgcaagatc 3300 ctccttaagc tgcgcgaaaa taacctgcaa gcttccccca ctaagtgtga gttcttcaaa 3360 gaggaggttg tattcttggg ttttgtagtt ttgactacag gtcttaagat ggatccagcg 3420 aaattggcaa ccatcaagga atggctgtat ccttcaaacc tggcggaatt acgaaggttt 3480 ttgggtttta ctaatttcta tcgacgcttc attgcgcgtt tttctgctga agtggcgcgt 3540 ttgactgacc tgactaagaa tggggttaat gtcgagcaag ggctgagtca gccggacact 3600 cgttcttgtt tcaactcttt gatcgcagcc ttcacttctg ctccgttctt gaagcatttc 3660 gattttgatc aacggcgagt attgcaggtg gatgcaagcg catatgcttt ctcagctatc 3720 ctttctcagc ctggcgaagg tggcaagtta gttccggttt gctactattc aaagaaattg 3780 acaccggcgg aggctctttg gcaaacacac gaccaggaat taggtgccat cgtggcggct 3840 ttcaaggaat ggaggtcgtg gcttatgggt ttgaatgaac agatcgttgt attttctgac 3900 catgctaacc ttaggtactt tatggaaggg cgttctttga ccccacgcca atcgcgttgg 3960 gcagcctttt tatctgcgtt caacttcgtc attctgcaca cgcctggtag actgaaccca 4020 gctgatccag cttcatgaag gccggatttc accactgtaa ccgctccgga ggcattggtt 4080 ttgtttaaat cagtctcagg ggtacctatc gacgcgattc acttatcatc ggacggcgtc 4140 ggttacgatg tatcttttgc ggaaccaacc tgcgaggctt tgtcggaaat caagtcaagc 4200 tatcagtccc ccaattttcc aactgtgcct aaggacgaag gtttggtctt catcgacaat 4260 gcatggtggt ttaagggtag gattttcgta ccttcccatt tgagagaagg gattttaaag 4320 ctctaccatt ctaagccttt gtcagggcat tggggtgtgg ctaaaacttt ggatttatta 4380 agcaggactt ttggttggag gaatatgagg ttggacgttt tatcgtttac taggaagtgt 4440 cacagctgtc agaaggtgaa cagagatttg cgcccacgtc agggtgaaat gattgctcta 4500 ccaataccgg atcggccttg gtctacgatc ggtgtggact tcattgtgaa attacctaag 4560 tctcatgatt tcgattccat tatggttgtg gtcgaccatc tgtccaagtt cacgcatttc 4620 attcctgcaa aggagacttg gtcagccgtg gagctcgctg atgcttttgt cgagaacgtt 4680 tttcggctgc acgggctgcc agataaaata gtttcggaca gaggagccgt gtttgtcagt 4740 gccttctgga cagcggtaag taaacgtcta cagatttcgc cctccccctc tacggcgttc 4800 catcctcaga cggatgggca ggtagagcga ctgaatgcag tgcttgagga ttacttgagg 4860 cattttgtcg aagaaaacca atcggattgg aagagctggc ttgcattagc ggagtttgct 4920 aataataact caatatcatc ttcgacaggt ttttctccat tttttgcaaa ttatggttat 4980 catcctcgat tcaattctgt tactcatgcg tccagtgttc ccaaagcaaa tgactttgtg 5040 ggtcatatgc aacgcattca atctcaattg caggttactt tggcggaggc aaaggagagg 5100 caagcgcggt tttacaacaa aggtaagcga attactgtct gctacaaacc aggtgattta 5160 gtatggttgt cacgcaagtt catcaagaca aggaggcctt ctcagaaact ggattacaga 5220 cggatcggtc catttgctgt tgtacggatg gtgggcaaga atgctgtaca gctcagtttg 5280 cctcgcgaat atgcgcgatt gcatcccgtt ttcaacgtcg cactggtcat gccagtggtg 5340 ggaataactg atgatgtttt agactttagt tgtttgcctg aagtgcctat taacaggggt 5400 ataggttttg aatccgatga agcaaggaac gtggcacaat ggctgtcgat ttcttacgta 5460 ttaggtcata ggaaagtgga aggagttcac cagtacttgt tgagatcaga agagagtgga 5520 ttagatgata gctgggttcc tttgcatcaa gtatcaaggg gtttggacgt gttcattcag 5580 gcgtaccatg agttaaatac ggaggaggag aagccgcatt gggagcattt cgaggacact 5640 tcaagaccag atttaggata tgttgcgggt gcttaatgga taggcgtata tggaggatta 5700 cttt 5704 // ID Copia-3_TMe-I repbase; DNA; FNG; 4843 BP. XX AC CABJ01002243; XX DT 13-FEB-2011 (Rel. 16.02, Created) DT 13-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Perigord black truffle genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-3_TMe_; KW Copia-3_TMe-LTR; Copia-3_TMe-I. XX OS Tuber melanosporum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Pezizomycetes; Pezizales; Tuberaceae; Tuber. XX RN [1] RP 1-4843 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Perigord black truffle genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; CABJ01002243; Positions 20595 15753. XX CC Positions [1787-2296] - Integrase core CC 'TCTTG' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1538..2545 FT /product="Copia-3_TMe-I_1p" FT /translation="MMPINDSCKREDVTNTHIEIKPPQQDLQLWHEQLGHM FT NMSDLRRLKSLSTGLTIKDHGSPLAVCPACLEGKQQRSFNRKNKSAHVSEK FT LGLIHSDSCGPFPVPSMASARYFILYVDDCTRMVWCYFLKQKSALEVLEMF FT KNFKVLIKKHSGKSILRFRCDNGRGEYDNNYFQEYLRSEGITYEPSAPYTQ FT HQNGVSECMIHIIMERTRTILLKSKLDVNFWAEAANTSVYLHNHCPTKALD FT GQTPYEAWNGTKPQLQHLRRFGCDAYVHVPAERRKKFDPKSRLCIHLGYIH FT NTTKLWRVWYSRTRQVIHVAMLSLMKQALVDMFNQRPAAHSVLC" FT CDS join(2570..3529,3533..4666) FT /product="Copia-3_TMe-I_2p" FT /translation="MIPPTHLACYRKVYQTIHSSISDREILLRLTIPLVKN FT FMEVIEERIGDADAPTPVINADCFPIPTSESTARSISINGMAPEEVEPMIS FT HNMAPDGHQHLALRKSSRARKPSFWLRDSITFAARASVYEEPQTYQEALEQ FT QSYRHWENVIREEFASHAENCTWELAELPLGKHDISCKWVFKLKTNTDRSI FT QYKARLVIHGFEQVPSIDFQETFAPVAKFVTIRVLLALATQYDWEIEQMDM FT KTAFLHPALKEEVFMTIPEGYLDYSDMLTLVGEYPVLRLRKALYGLKQAPR FT AWYDNINHFISSIGLLQSSEDHSLYFSDIIVILYVDDLSLFAKVMQVIEMM FT KDKLSTTYHMTDLSPITQFLGLQVTRNRPLKIINLHQFSYTQSIPKRFQMS FT DCKGISTPMEPNFNLPSCLDDSDIHNQSSYQSKIGGIMYAMLGYRPELAYT FT ISSLSKHNARPTSLHHNALQCVFRYLQKSKAIGIRYQGTENNVESFPKLIS FT HTDSDWAGDKDDRKSTGGYVIMLCQGAISWKTRKQDVVAMSSTEAEYIALT FT EAAKEVVWLRRLLIELESRHIAYVIPNITAEHFHGLHKQWESLDTAGKTPE FT QFPRNSWISTISQTIYADNQGAMKLADNPQLHSRTKHIDIRYHFIRTLLER FT DEVTISYIPTAEMTADMLTEALTREKHERHTKSMGMIDLADEIEGKEFL" XX SQ Sequence 4843 BP; 1439 A; 1083 C; 983 G; 1338 T; 0 other; ggttatgagc ccggattaac actttccaag ctgatatcct catataccaa aagaccctga 60 atatcctttt tttttatttc ttattctctt tatactatct atctcatatc ctgctctaaa 120 aattattgtc gaatccggat atcccaaaag cgcaatgcca atcgacacat ccaacaacga 180 gtatacggag aagagatttc gatgccctct cttgactgaa aagaactttc ccaccaggga 240 acgtagtgtt aagatgcgat tgatcgcaga aaattgctgg gagatagtta tcggggaaga 300 gaagatccca gatccccctg tacttgctga tggatcctct agagcaatgg aatcagcaca 360 tgctacagca ttaaaagagt ttaaatcaca attgggtgac tttactcgac gttccggcaa 420 agccgcttcc ataattaact ctaccctttc cccgagtatt gagttctatg tcaaagacac 480 tatcaatcca aaggagatgt gggagattct tagaaataag cttactttag tagataactg 540 gagtctccag tgtaccctaa agcgagactt ctataagctc agctatgatg ggaaggaatc 600 catcaccaca tatatcaatg gcctccgtat cttccaacaa caacttcagg gaaccaacaa 660 tgagatttca aatgatgaac tcgttaatcg gattataacc tctcttccag ccagttggga 720 acagcgtatt attacccttg atgattgacg agatttgacc cttgacgacc ttgaacgaac 780 tctccatagc cattaagcca aaattgccaa tactaccacg caggctacca aggctttcgt 840 agtaacaaaa ggattccagc gtggtcgtgg acgaagccat ggaggacgag taagaagcaa 900 tcgaatgaac aaccgcatta gaccagatcg gaatcctacc acatactggt actgcctaaa 960 ggtcggacat tctcaaaacg attgttttat caaaaagaaa gcggatgagg ccagaagaga 1020 ccgaacgagg agaccactac caaagcgatt ggatttctgt ggagacagtg ctgatgctgc 1080 ctccgccaat gctaccacac atgccctaat aatgaagcga atatcagtaa aacacttctc 1140 tgagacctgg ttaatcgatt cgggtactac cgatcatatg tgtcctgatc gcatggattt 1200 ttcctctctg tgacgattga atacaccgat ccgtatagtt cttggtgatg ataccttgtt 1260 gcatgcctat ggggctggtt ctatctatct aagtccccaa atccttctta ccaatgtttt 1320 atatgttcct gatttggaaa ttaaattact ttctgttagt gcgatgactc gatcaaaatg 1380 ttaggtaatc tttgacgagt ctggttgtca tatcctgaaa gatggttcca agattctctc 1440 agcctccgaa tgcggtaacc tttttaaggt taatctacca gaagtctact atagtttagc 1500 aaccgtatac gaaggctcca atcagaacag gagtaccatg atgccaataa atgatagctg 1560 taaaagagaa gacgtcacca acacccacat tgaaattaag cctccccaac aggacctaca 1620 actttggcat gaacaattag gtcacatgaa tatgtccgat ttacgacgac tcaaatccct 1680 atcgactggg ttgacaatta aggatcatgg aagtccattg gcagtttgcc ccgcttgcct 1740 ggaaggaaaa caacagcgct cctttaaccg caagaacaaa tctgcacatg taagcgagaa 1800 gctaggactc attcactctg actcttgtgg ccctttccca gtaccatcga tggccagcgc 1860 acgatacttc attctttatg ttgacgattg tactcgcatg gtatggtgtt acttcttgaa 1920 acagaagtca gcattggagg tattggagat gttcaagaac ttcaaggttc ttattaaaaa 1980 gcattctgga aagtctatcc ttcgtttccg ctgtgacaat ggtagaggtg aatacgataa 2040 caattacttt caggagtatc tacgttctga aggaattacc tacgaacctt ctgcacctta 2100 cacacagcac caaaatggag tgagtgagtg tatgattcat attataatgg aacgaactcg 2160 gactatcctt ctcaaatcca aacttgatgt taacttttgg gctgaggctg ctaacacctc 2220 agtatacctc cacaaccatt gtccaactaa agcactagat gggcaaacac catatgaagc 2280 atggaacggt accaagccac aacttcaaca tctgagacga tttggatgtg atgcctatgt 2340 ccacgttccg gcagagcgac gaaagaagtt tgacccgaag tcacgcctgt gtattcatct 2400 aggatatata cacaacacta caaagctttg gcgagtctgg tatagccgaa cacgccaagt 2460 cattcatgtc gcgatgttgt ctttgatgaa gcaagcttta gtggacatgt tcaaccaaag 2520 gccagcagcc cactcagtac tctgttgatt gatggagtaa tagacttaga tgatacctcc 2580 tacacatctg gcttgttacc gcaaagtata ccagacgata cacagttcaa tctcagatcg 2640 cgaaatcctc ctgaggttga cgatacctct agtgaaaaat tttatggagg ttatagagga 2700 gaggatcggt gatgccgatg cgcctactcc tgtgattaat gccgattgct tcccgatacc 2760 caccagtgag agtacggcaa ggtccatttc cataaatggt atggcaccag aggaagtgga 2820 gcccatgatt tctcacaaca tggctcctga tggccatcaa catctagctt taagaaagtc 2880 ctcacgagct aggaagcctt ctttttggtt acgggatagt atcacgtttg ctgcgcgggc 2940 aagtgtttac gaggaacccc aaacatatca agaagcattg gaacagcaat cctatcgcca 3000 ttgggagaat gtgattaggg aagaatttgc atctcatgct gagaattgca catgggagct 3060 tgctgagcta cctttaggga agcatgatat cagttgtaag tgggttttta aactgaaaac 3120 aaatacagat cgctctatac aatacaaggc gcgccttgtt atccatggat ttgaacaagt 3180 tcccagtatc gactttcagg aaacttttgc tcctgttgca aagttcgtta ccattcgtgt 3240 actgttagca ctggctactc agtacgattg ggagattgag caaatggaca tgaaaactgc 3300 ctttctacat cccgcactaa aagaggaagt attcatgact attcctgaag ggtaccttga 3360 ctactcggat atgctgacgt tagtgggaga atacccagta ctgcgtctcc gaaaagcact 3420 ctatggctta aaacaagctc cgcgggcatg gtatgataat atcaaccact ttatctctag 3480 tataggctta ctgcagtcaa gtgaagacca tagcctttat ttctctgatt aaattattgt 3540 cattctctat gttgatgatc tgtcgctctt tgctaaagtc atgcaagtta tagaaatgat 3600 gaaagataaa ctttctacaa cttatcatat gacagattta agtcctatta cacagttttt 3660 aggcttgcag gttactcgaa atcgtcctct caagattatt aaccttcatc aattctccta 3720 tacacagtca atccccaaac gctttcagat gtcagactgt aaaggaatat cgactccaat 3780 ggaaccgaac ttcaatcttc cctcctgcct tgacgattcc gatatacata accagtctag 3840 ctatcagtca aagattggtg gtatcatgta tgctatgctt ggatatcgac cagaattggc 3900 atatacgata tcctccctca gcaaacataa tgcacgacca acctctctac accacaatgc 3960 acttcaatgt gtatttcgat atttacaaaa gtccaaggca attggtatcc ggtatcaagg 4020 aacggaaaac aatgttgaat cctttccaaa attgatctcc catacagact cggactgggc 4080 cggcgacaag gatgatcgca agtctactgg tggctatgtc attatgcttt gtcaaggcgc 4140 aatatcttgg aagactcgta aacaagatgt ggtggcaatg tctagtacag aggcagagta 4200 catagcacta actgaggctg caaaagaagt ggtatggtta cgccgcctac taattgaact 4260 agaatctcga cacattgcct atgtcatacc gaatattaca gcagagcatt tccatggtct 4320 tcataagcag tgggagtctc ttgacactgc cggaaaaact cctgaacaat ttcctcgcaa 4380 ttcctggatt tctacgatat cacaaacaat ctacgctgat aaccaaggag cgatgaaact 4440 cgctgacaat ccgcaattgc attctcgcac caagcatata gatatccgat accactttat 4500 tcgaaccttg cttgagcgag atgaggtcac tatttcttat ataccaacgg cagaaatgac 4560 cgccgatatg cttaccgagg cacttacgag agagaaacat gaaaggcata cgaagagcat 4620 gggtatgata gatcttgcag acgagataga aggaaaggaa tttctttagc atgtataccc 4680 aatactcttt ttccctgata cgatatttta tttttcctat ccaggttttt ctatcgatca 4740 ggtttttaat gagattgtgg tcttgattct tgtctttatt tcattctttt attcttatac 4800 cgtttttttt ttaaaaaaca tctctactac ggggaagtgg gag 4843 // ID TY2_LTR repbase; DNA; FNG; 332 BP. XX AC U20162; XX DT 22-AUG-2005 (Rel. 10.08, Created) DT 22-AUG-2005 (Rel. 10.08, Last updated, Version 1) XX DE TY2 LTR-retrotransposon from yeast (LTR). XX KW LTR Retrotransposon; Transposable Element; TY2_LTR. XX OS Saccharomyces cerevisiae OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Saccharomyces. XX RN [1] RP 1-332 RA Dujon B., Albermann K., Aldea M., Alexandraki D., Ansorge W., RA Arino J., Benes V., Bohn C., Bolotin-Fukuhara M. et al.; RT "The nucleotide sequence of Saccharomyces cerevisiae chromosome RT XV."; RL Nature 387(6632 Suppl), 98-102 (1997). XX DR Genbank; U20162; Positions 4392 4723. XX SQ Sequence 332 BP; 128 A; 48 C; 48 G; 108 T; 0 other; tgttggaata aaaatcaact atcatctact aactagtatt tacgttacta gtatattatc 60 atatacggtg ttagaagatg acgcaaatga tgagaaatag tcatctaaat tagtggaagc 120 tgaaacgcaa ggattgataa tgtaatagga tcaatgaata ttaacatata aaatgatgat 180 aataatattt atagaattgt gtagaattgc agattccctt ttatggattc ctaaatcctc 240 gaggagaact tctagtatat ctacatacct aatattattg ccttattaaa aatggaatcc 300 caacaattac atcaaaatcc acattctctt ca 332 // ID Copia-27_MLP-LTR repbase; DNA; FNG; 743 BP. XX AC AECX01002580; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-27_MLP_; KW Copia-27_MLP-I; Copia-27_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-743 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002580; Positions 6508 5766. XX SQ Sequence 743 BP; 169 A; 130 C; 119 G; 325 T; 0 other; tgttggattt aattttcttc aagtgataag tgtacggtca agaacttcaa gttactgtgt 60 gaaagtcagt atacaaatta gttgttcaat tatcacatat caaatttagt attacaacgt 120 gtgtgtcttt attttatcga ggtcgaacaa gatagttaaa gatcatagta gtgtacattc 180 tttgttcaat tgttctgttt tcgtgctctg ttgtttctcg gaagctgatt agattagttc 240 tcattgttca attaactaca ccttgatttt ctcgttgcac cttgtttttc ttcttataac 300 cttgtccctc tttcctttag aatctttttc cttatcgaag aaattgattc taaaggtgag 360 gttttctttt cttttctttt tcctatcaat acactgatta aaattctcta tcatacaaat 420 attgttcttt ctcttttgat tctacatgtt agtctttatt atcttcttcg ttcaggtctg 480 ttgagtcttt ccagttgtcc atcttagtct gtcagccgtg tctcgtcaac acgtagctga 540 agtatttctg ttacctttta aaaggtttgt tattgtgttt gtgattgttt catcttatcg 600 aagaaattga ttctaaagtc tttattatct tctttgttca ggtctgttga gtctttccag 660 tcgtccatct tagtctgtca gccgtgtctc gtcaacacgt agctgaagta tttctgttac 720 cttttaaaag gtatttgtct aca 743 // ID PIF_Harbinger-1_AllMac repbase; DNA; FNG; 3715 BP. XX AC . XX DT 17-MAY-2011 (Rel. 16.05, Created) DT 17-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Harbinger; DNA transposon; Transposable Element; KW PIF_Harbinger-1_AllMac. XX OS Allomyces macrogynus OC Eukaryota; Fungi; Blastocladiomycota; Blastocladiomycetes; OC Blastocladiales; Blastocladiaceae; Allomyces. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 3715 BP; 793 A; 1054 C; 1028 G; 840 T; 0 other; agaccacttt cgtttacccc acctgggcct gttattgtat ccacgtgatc cgttgtcacc 60 atctgtccac agcacaagcc gcatcgcgtc catcccatct ccctcgaaac actctccaat 120 caccagaatg ccgagtcggt gctttgttga tggctgccgc cgcactgtcg gcaaggaatc 180 tggcgtcagc cgctttgaag tggaccccaa tcacgagcgt atcctgacct accccacagc 240 tatgagcact acattcgtga ccccctgact ggtttgcacg tgtgtcaagc atgcacgact 300 gctgccgttg cggaacctgc gtcgcgtgat cggccatatg gccgcgcggt tgtgtctgtg 360 acgcgcttgc aatccatgag cccattgcca tcacaacctg cgcctgcaca aaatgtccga 420 attcttcgaa tccccatgtg ccgatttgta ttcagcgact aactccgacc gtttagctgc 480 ccacgccgtc cacgacacgg gcagcgcgtg tgagacgtga accagttgca gcacctgcat 540 caccattacc agctgcatgt tgcagccatg ggcgtggcgg cctgtccaat gtcattacac 600 agcaaaactc agcacttttt gccgaggtgg gccagtgcaa gaaagtgatt gatgcccaag 660 ctgctgagat tttggcaatg tgcacgacaa ttgaccgctg tgagctcgag atttgcagtc 720 tcaaacaggt catcgagatt cagaagcagc agcaactcgg tgccctgtca tgggagcatg 780 tcaaggtcga tgatatggcc gtccagctct acacggcgtt cctgtgtgcc acggtctttg 840 attgcttttt tgaggccctt gggcctgcta ccttgtacct caacctctgc ctgccatgcc 900 agaaacgtgc aagtgcgtct gccacacctg gtgcaggggc tcaacccaca tcgacgaatc 960 tggagcctga cgcaatgtgc gcactgacca acttggaaac tggtgtgttg cacacactga 1020 ccaacttgga acctggcgtg atgcgcgcac cgcacggatc cggtacacaa gaggatgtag 1080 atgatgtgtc aacggacgat gacgccatga tggaccacaa ggatgggctt gtgctcgaca 1140 acaaggatga agagagtgat gaggaccatg gtgagggtgg ggcggctggt agtgtgcttg 1200 acggcgatga aactgaagtc gaatcaattg atggtgctga aatcgaggat gagcaggttg 1260 gcagcaatca acaggtgaga cccaccccgg ccccttgctg ctgttgcggc tgccgccaaa 1320 aactcaagcc caaggaccag atgctgctca ccttgatgcg gctcaagctt ggtctcgagc 1380 tcagggacct cacattccgc tttggcatct ctcctgcgac cgtgtcgcat gtcttcagga 1440 cttgggtcaa gttcttggcc aagcagctga gcgtgattga tcggtggctg tccaggtccg 1500 cgattgaagg cgcaatgcca gccggattcc gcgagcttta cccgtcgacc cgcatcatca 1560 ttgatgtgac caagctcaag atccagaagc tgttgagcct caccctgcaa gtgaccacct 1620 acccgtgcta caagtcatca aacacggcca agttcttgat tgccatcgtg ccaaatggca 1680 tgattaccta tgtctcgaaa gggtacccag gttgcatttc agactaccag atcatggccc 1740 tttgcaaaga cctggtggag aagcttgaca gtggcgacat ggtcatggct gactgtggct 1800 tcaaggtcaa ggagatactc cacctggtcg gcacgcagct gaacgtgccg ccaggcacca 1860 agaaggatgc gcagatgtcg tgtgtgcagc tcaccaagac tagggcagtc gcatcgcttt 1920 gcattcatgt agaacgtgca attggttgtg tccgtgagtt tggtattttg caacgcgttt 1980 tcctgctcac tttggtccca cacatctccg atatcttcaa tgtttgctgc cttctcacga 2040 atttccgaac tgcccccatc actgacacgt gaatataatt ggatcaccac ccatggttcg 2100 tcagaaatac aacccacccg ttgtcaccca caatctcaat cggcgtcacc aaagcaattg 2160 atctacgaca ggaaatacgg cccggccgtc gtcttgcagc ccttgggatc cagcatctcg 2220 atcaactgga gcaccaatgc gattgcctca cgcgatggcg cctgctgaag acaggggggc 2280 cgcgggtccc agtcaggatc aatacgcacc catccgcgcc gctggcgccc atcgtgctgc 2340 ttgccaagcc tctggaacgg catctcgtca atcggcttgg cgaggtcctt ggccgaatat 2400 ggcgtgcacg tggacggacg accccaggcc tgagcgattg atgttgagca aaaaacttcc 2460 tcccccatgt cctcatacgc cttacgcgcc tgcaccaacc accacgcaag gccgcatgcg 2520 tgggagcaag tgcccatgat gccagcttgg caggtgcagt aaccgatcag gatctggatc 2580 gggttcacag tgaacttcaa ataggccaca tacgagtccg acagcatcga cgcgcccacg 2640 ttggccttga tcaggaactg cccatcctca ctgagcactg tcatgtggac agtcccttcc 2700 caaatgcggc catcgactga gttggcatac ccgcgcttga atgcgcgtga cccgctgcgt 2760 gccatgttgt tggagatggt cagcgattca gaggacgcgt ggttgaggag gtgctcaatg 2820 atgtcattgc ctgacatgcc ctcaagcagc aacacaatgg cgagcaaatc agcgccctgc 2880 ccaacgtcct tgtctgtgtt cttgcactat tttgcagccg tcagtccagc cgtcagtacg 2940 cgcatttttg tgcaacaact gtcagcatca gtacacacca gttcatagaa cgcgcggatc 3000 tcgggcggtc ccttcttcac agctttgggc accttaacaa cgccatcagc ctcagtctct 3060 tcatccgacc cggaatctga gtcggccgat tccactgcat tggcgtgcgg gtagttcaag 3120 tgcgcccata tgcgattgac aaggtcggcc ttcttgctgg acgtgggcca ctttgtcgcg 3180 cgcaggtacg ctttgaggtc gttgacttga tacttcttcg cgaatgcctc cttgtccggc 3240 gtgctgtggt ccaggacgac cggccgggga tctttttgac gcgtgctgcg cgcctggtct 3300 ggcacgtcga catctatgtg cgtgttctct tggtttgtgt tggggtccat gctgggtgtt 3360 catggagggc caatgatggt cgtcttgcag tccgagtcgc gagcctgact gtggggggct 3420 cctggctgtt tcataatgaa gctgttgctg tggatcaagt acagaggcat gtttattggt 3480 gacaacctga ccccgccaca gctgtggtgc atggaggggg cgcagatctt gaggcataac 3540 gtgccaactg tcgcgaaatg tgattggcgg gggcactgac cagtggaagc tggatgggga 3600 ccacacgatg cgagctgagc gggacgtgcg agacgtggtg ttgaagaggg aggaggcagg 3660 caacggacca cgtggataca ataacagatc caggtggggt aaacgaaagt ggtct 3715 // ID copia-2-I_AN repbase; DNA; FNG; 4788 BP. XX AC . XX DT 09-DEC-2003 (Rel. 8.11, Created) DT 19-MAY-2005 (Rel. 10.06, Last updated, Version 2) XX DE Internal portion of copia-2_AN LTR retrotransposon - a consensus DE sequence. XX KW Copia; LTR Retrotransposon; Transposable Element; KW COPIA superfamily; copia-2-I_AN; copia-2-LTR_AN; KW internal portion. XX NM copia-2-I_AN. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-4788 RA Kapitonov V.V. and Jurka J.; RT "copia-2_AN, a family of copia LTR retrotransposons in the RT Aspergillus nidulans genome."; RL Repbase Reports 3(11), 199-199 (2003). XX DR [1] (Consensus) XX CC The 1412-aa Copia2_AN-Ip polyprotein is encoded by ORF1 CC (pos. 187-4422). XX FH Key Location/Qualifiers FT CDS 187..4455 FT /product="copia2_AN-Ip" FT /translation="MPPKASRGTKSNDVPFQMTTRHRTYEPETPVSADNQI FT SNDSTDEDQPDEPTTMEDMRAGLLRQLRDELREELREEMAQQLRSEIRHEI FT RAEQQHQSPTQQLHQEINQNASYGRNTDIQQRDLIYCEKQDIRKLAKSKFG FT TQASILLNGRSNYTAWRDSMLMDTYMIEAKDILDETKPPDGSNEIDIARWE FT TKNEILHTRILQSTARHVRQTISWKGSTLASELWARITSTYGLSMAEERLM FT TVKALLDINPQGNYPAMVRDFQRIAAKIKEMKLSLDDVIHDIFICSLGQWQ FT QNFVRTKLDEFYSCGRGPIKNLDIDTFADQLVARSSSSNNKYIPQEPQEFK FT LEPRYRIILTEPKDSSRTKRDGQDQKPARTKTLCQACGKGYHKPDDCWTLH FT PEKAPKRHGNQASGTSNDNKNQQPTGQELVLRNHPQANSIAIVPAKEPEND FT EEPQWLLDTAAAFHISNKYHVFINLRGHKAYINDAGGRTHQIIGIGTALVY FT GVEIPDVRYAPTTTADLLSFSQLDDQDFDVSTQGTVNKKHFYITSPTGASL FT DAFKEQNTCLYQIKPVAYAIQPILHAKDTQKETNNIIPTATMEEWHQHLSH FT VHLQAILKMAQQKIIKIKGPKTLAFCDICRQAKERRKSTKEPASRATKILV FT RIYIDIAGGGAILDCKDKQAPPGIRNIRYFLLITDDATQYQWVYTLRTRDE FT AIPTFQGWLEHIKNQGYSPPAFVRSDREFLTEHVKKLCQTYGLIWEPTAAD FT SPWQDGVSERGIQTVLQYTRAMLYDSGLPRWLWPQALQTAVYHMNRLPTRV FT PLYNDRRPMAPTSDPEIQPCAHFTPYSAWTNGDTDIKHLVKFGSPAWMHLH FT GASKYAGKPTSKIDPKAKKVHVVGYQGRHIYVIWDPEINQLRDTSDISIKE FT EFNPPQSKPYEAAKAPEAAKALETVKPGETAEAAEPKTNEDLLIQDISDIT FT SREELYRPAKGFAIQKSVESPLPEPKSYNEAIKGPESAQWHAAMQEEINTL FT KQKQCWDLIRKSDMPAGARAIPGRWVYKKKLNPDNSIRYKAQWVIRGNLLD FT KSEFEGATYAPVVDPITSRILFGVSAQKGWHIIQADAVLAFLNAKLKGQPI FT YMHQPLGFAEGEPGTLVCLLRQSLYGLTPSARLWYDDLRAYLESIGFKVSP FT HDPGLFVHVTEKLYITTHVDDFMIVGEKAQNAVQALESLKSRFEIKEAPEF FT KRYLGMNIKTTPTGIHLSQEDQIDDIINSFRLHNAHPTKSPLDPGTVIDDA FT PDPKINIKEYQHGTGSLQYLATKTRPDISRAACFLAEFNTAPTAKCWAALI FT HIIKYLKGTRSLGIRYQHDPAANIKPPKAFSDSDWGGPHTKARRSVGRYVF FT KLAGGPIAWQSKRQTCVATSSNEAEYIAASEASREAYWIYSPQTSKDRQKF FT IKIQRLAKL" XX SQ Sequence 4788 BP; 1531 A; 1205 C; 993 G; 1059 T; 0 other; ggtcatgagc cctctacgct gaaggaatac gatttgtttg ctagatatag atacgcagaa 60 tcgattcttg acgctctgag caacttgcta tacaaaattt tagacacccc gcacctgtac 120 gagggggggt agcaacagtt ttggggatta ccccagaatc aggatcataa ggcacctttc 180 gctaggatgc cgccaaaagc ctcacgaggc accaagtcaa acgacgtacc gtttcaaatg 240 acaacaaggc atcgtacgta cgagccggaa acaccagtat ctgctgacaa ccagatatcg 300 aacgattcga cggacgaaga ccaaccggac gaaccaacca caatggaaga tatgcgagca 360 ggattactcc gtcaactacg agatgaacta cgagaagaac tacgtgagga gatggctcag 420 caattgcgca gcgagatcag gcatgagata cgcgcagagc aacaacatca gtcccctacg 480 cagcaactac accaggaaat caatcagaat gcctcctatg gtcgaaatac agacattcaa 540 caacgagacc ttatatattg cgaaaaacaa gacattcgga aattggccaa atcaaaattt 600 ggtacccaag catcaatact gcttaatggt cgatcaaact atacagcatg gcgcgattct 660 atgcttatgg atacctatat gattgaagca aaggacatcc tcgatgaaac caagccaccc 720 gatggcagca atgaaatcga catcgcccgc tgggaaacga agaatgaaat tttgcataca 780 aggattctcc agtcaacggc aaggcacgta cgacaaacaa tcagttggaa aggctctaca 840 cttgcatccg agctatgggc tagaataaca tcaacatatg gcctatcaat ggccgaggag 900 cgccttatga ctgtcaaagc cctgcttgat atcaacccac aaggcaatta cccagcgatg 960 gtacgggatt ttcaaagaat agctgcaaag attaaagaaa tgaaactatc cctggatgat 1020 gttatccatg acatttttat ctgttctcta ggccaatggc agcagaactt cgtacgtaca 1080 aaactagacg agttctattc ctgcggccga ggaccaatca aaaacctaga tattgacacc 1140 tttgcggatc aattggttgc tcgatcatca tcttccaaca acaaatatat cccccaggaa 1200 ccacaagaat tcaaactcga gcccagatat cggataatcc ttacagaacc aaaggactct 1260 tcccggacga agcgcgacgg tcaagaccaa aagccggcac gtacgaaaac cctctgtcaa 1320 gcttgtggca aaggatatca taagcccgat gattgttgga cattgcatcc cgaaaaggcg 1380 cctaaacgcc atggaaacca agcatctgga acctccaatg ataataagaa ccaacaacca 1440 acaggacagg agcttgttct acggaaccat ccgcaagcca actcgattgc aatcgtacca 1500 gccaaagagc cggaaaatga tgaggagccc caatggctct tggatactgc tgcagccttc 1560 catatatcca ataaatatca tgttttcatc aatctccgag gccacaaagc atatataaat 1620 gacgccggtg gtcgtacgca tcagattatt ggaatcggaa ccgcattagt ttatggggta 1680 gagattccag acgtccggta tgcgccaaca acaacagcag acctactgtc attcagccaa 1740 ttggatgacc aggattttga tgtatccacg caaggcactg tcaacaagaa gcacttctat 1800 attacatcac ccacaggagc ttctcttgat gccttcaaag agcaaaatac atgcctatat 1860 caaatcaaac cagttgcata cgcaatacag ccaatcctac acgcaaagga cacccaaaaa 1920 gaaacaaata acataatacc tacagcaact atggaggaat ggcaccagca cctatcccat 1980 gtccatttac aagccatatt aaagatggca caacagaaaa tcatcaaaat caaagggcca 2040 aaaaccttgg ctttctgcga catctgtcga caggctaagg agaggagaaa gagcaccaag 2100 gagccagcct cacgcgccac aaagatcctg gtgcgaatct atattgatat tgcaggaggg 2160 ggagcaatat tggactgcaa ggataagcaa gcccctcccg gcatcagaaa tattcgatat 2220 ttcttgttga ttactgatga tgcaacccaa tatcaatggg tttataccct tcgaaccaga 2280 gatgaggcta ttcccacctt ccagggatgg cttgagcata tcaaaaacca aggatacagc 2340 ccaccagctt tcgtacgaag tgatcgcgaa tttctaaccg aacacgtcaa gaagctctgc 2400 caaacctatg gcctaatttg ggagccaacc gctgcagact ccccatggca agatggcgtc 2460 agtgagcgcg gaatacagac ggttttacaa tatacaaggg caatgttata tgactccgga 2520 ttaccacgat ggctatggcc acaggctcta caaacagctg tctatcacat gaaccggtta 2580 cctacaagag ttcccttgta caatgatcga cggcctatgg caccaaccag cgatccggaa 2640 atccagccat gtgcccattt tacgccctat tccgcctgga ccaatggtga caccgatatc 2700 aagcatcttg tcaaatttgg gtcacctgct tggatgcacc tacatggagc ttcaaaatat 2760 gctggcaagc caaccagtaa gattgaccca aaggccaaga aggtccatgt tgttgggtat 2820 caaggccgcc atatctatgt tatttgggac cctgaaatca accaactacg cgataccagt 2880 gatatctcca tcaaggagga gtttaacccg ccacagagca agccgtacga agctgccaaa 2940 gctcccgaag ctgctaaagc tctcgaaact gtcaaacctg gcgaaactgc cgaagctgca 3000 gaaccaaaaa ccaatgaaga ccttcttata caggatatat cagatattac ctccagagaa 3060 gagctatacc gacctgccaa aggatttgct atccaaaaat ctgtcgaatc acctctacct 3120 gagcccaaat cgtacaatga ggctatcaaa ggccctgaat cagctcaatg gcatgcagca 3180 atgcaagaag agattaatac cttgaaacag aaacaatgtt gggatttgat ccgaaaatct 3240 gacatgcctg ctggcgcacg cgccatacca ggaagatggg tttataaaaa gaagcttaat 3300 cctgacaact ccatccgtta caaagcacaa tgggtaatca gggggaatct tcttgataaa 3360 tcagaatttg aaggagcaac gtacgcccca gttgttgatc ctattacttc acgaatcttg 3420 tttggtgtca gcgcccaaaa aggctggcat attattcagg cagatgcagt tctagcattt 3480 ctaaatgcaa agttaaaagg ccaaccgatt tatatgcatc aaccacttgg attcgctgaa 3540 ggggagccag ggaccctagt ttgcctgctc cgccaatcac tatatggcct aaccccgtcc 3600 gcgcgcctat ggtacgatga tctacgagca tatcttgaat ctataggatt taaggtttcc 3660 ccgcatgacc caggcctatt cgtacatgtt acagaaaagc tctatatcac cacccatgtt 3720 gatgacttca tgattgttgg tgaaaaggcc caaaacgccg tacaagcgct cgaaagcttg 3780 aaatcccgat ttgaaattaa agaagcgccg gaattcaagc gatatctagg catgaatatc 3840 aaaacaacgc ctacaggcat ccacctgtca caggaggatc aaattgatga cattatcaac 3900 tcttttaggc ttcataatgc ccatcctacc aaatcacccc ttgatcctgg aacagttatc 3960 gatgatgctc cagatccaaa aatcaatatc aaagaatacc agcacggtac tggcagcttg 4020 cagtatctag ctacaaaaac taggccagat atcagccgag ccgcctgctt tcttgctgaa 4080 tttaatacag cacctacagc caaatgctgg gcagctctta tacatattat caaataccta 4140 aaaggcaccc gcagtctagg aatcagatat caacatgatc ctgcagccaa tatcaagccc 4200 cctaaagcct ttagtgattc tgattggggt ggacctcata caaaagcacg tcggtcagtt 4260 ggcagatatg ttttcaaact tgccggagga ccaattgctt ggcaatcaaa gcgccaaacc 4320 tgcgtagcaa ccagctccaa tgaagctgaa tatattgctg catccgaagc ctcgcgcgaa 4380 gcctattgga tatactctcc tcaaacttca aaggacagac aaaagttcat caaaatccag 4440 agactcgcga aattatgaag gatcttcgat tatttgatga ccagcatgca cctggtatcc 4500 tattatatat ggacaacaaa ggagctattg atcttacaat gtccaacata caaaccaaaa 4560 gatcaaagca tattgacatc cgctaccatt acacccgtga tatggtcgac caaggcatca 4620 tccatatcaa gcagatccct actgccgaaa tggttgcaga tggctgtacg aagcctctgg 4680 gatctgaagc tcactcccat ttcattcgtt tattaggtct ccacaacgat gattgatatt 4740 ttcatgtaat taggcccgct ggtgatgaca atctcgctcg aggggggg 4788 // ID Copia-5_TMe-LTR repbase; DNA; FNG; 545 BP. XX AC CABJ01001229; XX DT 13-FEB-2011 (Rel. 16.02, Created) DT 13-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Perigord black truffle genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-5_TMe_; KW Copia-5_TMe-I; Copia-5_TMe-LTR. XX OS Tuber melanosporum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Pezizomycetes; Pezizales; Tuberaceae; Tuber. XX RN [1] RP 1-545 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Perigord black truffle genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; CABJ01001229; Positions 189262 188718. XX SQ Sequence 545 BP; 173 A; 84 C; 157 G; 131 T; 0 other; tgttagaata ttcgcgaagg ttccttgtcg ttaccacgac cccaaccacg gatcgcctac 60 gacaatggat gttttgctaa gagaagttat gtttgggaaa gtgttacgga agatgatagt 120 tgaagcaatg attgtttgta ctagacgcta gttgcagcgg agatgtttgc aggtttgaga 180 aatcacacta gggggtgttc aatgtttgga actaagaggt ggcacaagga gaatgtcaca 240 ctacaacgat tgggaaacaa acgatggttc agctcaggtt aaggattgca ggcgaatgaa 300 tgagaagcgt gtaagaggat gagacgtgcg aaacggattg agggatgata tgtcttgatg 360 gatgttcggg agtgtggtgt ggaaggaaaa agagcatgag aaaaagggaa gaaaaagagt 420 ctcacgaaga ggaggaagac tttgttccat agttagaaat cgtgtatata tagtacgctg 480 ctggcagcac agaaaagaat aaggaccgca agacattctc tcctctaccc atcatagcct 540 attca 545 // ID Gypsy-5_LENY-LTR repbase; DNA; FNG; 1758 BP. XX AC AAPO01000113; XX DT 12-FEB-2011 (Rel. 16.02, Created) DT 12-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Lodderomyces elongisporus genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_LENY_; KW Gypsy-5_LENY-I; Gypsy-5_LENY-LTR. XX OS Lodderomyces elongisporus OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Debaryomycetaceae; OC Lodderomyces. XX RN [1] RP 1-1758 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Lodderomyces elongisporus RT genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; AAPO01000113; Positions 40371 42128. XX SQ Sequence 1758 BP; 591 A; 325 C; 348 G; 494 T; 0 other; tgtcacagtg cgacacacat tggacgctct gtgccctcgt ttccgctaag tgaatatttg 60 gagaatattg cacgacatag gaaacttgaa acttcaaaca agatgagttt atataaatag 120 gaacgagaag tggtaaaatt aacttatcac tattatatag aaatatcggt aaaatataga 180 ttttatacgg tcaactacat tgaacacaaa tcactaggtg aacgtgcaga gtgtatactc 240 tcacgattct ccttactagt gactttgtat gagacaattt actctcttaa cttatgagca 300 ttgagacctt tataggtgcg atcattgaca attacactta gttggaatat agcctatcaa 360 gtcacgaact caaaccgctg gtatcggtga gctaccagct caaaccgatg ctcagcagtc 420 gtacacttgt tacttgtgac agtgctatgc gacagaatgg cgaccagagt gaggacctca 480 aatccttgca aatcatgttg tttgaatgat ttgatatggt tacctaatat ttcaagtaat 540 tgatacattg gttgaggcta ctgttaagca gtattgtcga ccaggatatt gaatgatcat 600 gacatttcaa taccttggca tttcaacatt tggtcaatat catttatgag aaggacctgt 660 ggaagagtaa atctcctatc cacgaattaa tagcttcact attgtgaact acagtttaaa 720 ggaggagcta ctcctcattg gagaactatg cgagtttagt ccaatggaac ataatgcatc 780 gtagtcatcg ggacgtatcc gagactacac cagaaaatat caaatagcgt atactgtcaa 840 cactttatga tggaatattc acccatcagt tgacaaaggg cgaaggttgt agacgttaga 900 cagagtgcaa ctatagaccg aatagagtcg tgtctgctga cttggctata gtttgaaggc 960 tgtacgtgtg atagacatgg cgttgaaacg agtaccgaag attgcacggg tataggtggc 1020 taccctcaat tggtgcaaag atcgaggaaa gtatttttag gataatgacc acatccctaa 1080 tttgggttta tatcattttt ttaaaaaaga cttattacat tctagaaggt aaataagcac 1140 cgttaaagca gcgtttaaga aatcatcatt actgtctacg gactcatctg agtatgatgg 1200 tagcagacta aacaaactac acgggttatc ggtgaggtag aattacacca gataaaaacg 1260 gcggagatgg taactaccga gggaaagact aaattgcatc tcgcaagaaa gactatatag 1320 catcttgaga gaaagacaga attgcatccc gaaagaagca accaattatt attatgaaac 1380 tacagtgttt caataggtcg ttcaatcatc actttattag agatactgac ttagcgcctg 1440 ctatatagga ggagtcaaat ccgaggcact catttattgt cgacaaaatt tttaagatga 1500 ggccaggata cacgactatg cgctggtgaa aaattttcac cctactcaac aaaaattgtt 1560 gtccacgacc tcttactcgc gtgacgagaa ctataagtat tcccgagatt gattgaacga 1620 aacccttgat aatttggaac aaagggttta tctaccataa aaacatttca ccaaaaagtt 1680 taacaccaag tatatcacat attacattca agcaaaatat tattccaaaa acttgaaaaa 1740 cttctactac aaatgtca 1758 // ID Gypsy-1_MVPL-LTR repbase; DNA; FNG; 234 BP. XX AC AEIJ01000645; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Microbotryum violaceum genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_MVPL_; KW Gypsy-1_MVPL-I; Gypsy-1_MVPL-LTR. XX OS Microbotryum violaceum OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Microbotryomycetes; Microbotryales; Microbotryaceae; OC Microbotryum. XX RN [1] RP 1-234 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Microbotryum violaceum genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AEIJ01000645; Positions 9557 9790. XX SQ Sequence 234 BP; 53 A; 66 C; 59 G; 56 T; 0 other; tgtaaggcgc gcgccttact acgggccaca gggattagaa atagagatcg cgcgcaccta 60 gcttcttcag ttccctcccc attgggctga caccagaagt taggtgagta ctggtgttcc 120 atatctcagc actctcgtac atcgggctga caccagaagc tagttgcaat ccatataact 180 tgcacctgtc agctaggtcg ccagtgaggc tcgagagcga tctcgtgcct ttca 234 // ID Copia-59_MLP-LTR repbase; DNA; FNG; 775 BP. XX AC AECX01000446; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-59_MLP_; KW Copia-59_MLP-I; Copia-59_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-775 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000446; Positions 40052 40826. XX SQ Sequence 775 BP; 190 A; 131 C; 143 G; 311 T; 0 other; tgtgataact taccatcttg attctatgtg ctctttcggg tagattggtc aaggccttct 60 ggtgttactt agacgactta attgatgaga agaatgttac gaaaagaaga tatgaaattt 120 cgaattgagt attaaatttg tgaagagagt tacgaatgag gaaaagaatt gttatgagag 180 actaggtgtg ctgaatgtgg ttaaatgatg tgtgcattgg gaataagtag agtgtgactt 240 gagatttctt ctttatttga ttatttgtta attgttgcgc ttttccgtgt ttctacgtat 300 aaatagagat gttttacttc actcagattc ttttcctcac tgaaaagaat ctgtaagtgt 360 tttctactat acaaatccta tcaacacaac tctaacattt ctaacatact ttatacgtct 420 ttagatttta attttactta taaacttctc ttcgcgtcac tttcacgtat cctgttgcct 480 gcgcacaggt ttgttatttg atttcatttc attcttgctc ttttatcttt ttcttaacta 540 actgttgtgt cgcgattagg tctgtctttt agcttctcaa agctcttttg tgtggcttgt 600 tgccagtact taccttgacc tgattacccc aggtatgatt agtattcaat gactgaaaag 660 aatctatttt aattttactt ataaacttct ctttgcgtca ctttcacgta tcctgttgcc 720 tgcgcacagg tctgtctttt agcttctcga agctgttttg tgtggcttgt tgcca 775 // ID Tad1-1_EP repbase; DNA; FNG; 5856 BP. XX AC CACN01001643; XX DT 19-APR-2011 (Rel. 16.04, Created) DT 19-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE Tad1 Non-LTR retrotransposon. XX KW Tad1; Non-LTR Retrotransposon; Transposable Element; Tad1-1_EP. XX OS Erysiphe pisi OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Leotiomycetes; Erysiphales; Erysiphaceae; Erysiphe. XX RN [1] RP 1-5856 RA Jurka J.; RT "Non-LTR retrotransposons from barley the Erysiphe pisi genome."; RL Direct Submission to Repbase Update (22-MAR-2011). XX DR EMBL/GenBank/DDBJ; CACN01001643; Positions 5856 1. XX FH Key Location/Qualifiers FT CDS 176..1366 FT /product="Tad1-1_EP_1p" FT /translation="MAQVAASKVLNASKPEEWTNVSRKQYRNHGAHKSFPS FT PIKSVEEPKRRIIFTRQKGISSNPATVCQDILHAINMKLLGMKAPSHLRLT FT KLRYNERGNLTGLTSSQTTAEAMVTLFREELTQTALRFDPHIQDVAANQQW FT ISLKAHGVELGRYYPAGGLNQIKEEVAAGPSALELPFTPRWVSSPERLTEM FT ARNGVKRHSTIRFTVRTLAEADRVMKQGLHFGGRFHKVERFIPIGPDTICS FT TCCHWGHTTYGCPTPDKVRCAICAEAHLTEHHKCPISMCKTGTGKFCSKHG FT TYKCANCGGSHTARSPSCPDQRQAIAIARSGREEWREREKEHDIRRSKLEK FT EEDNISEYSENSEMDIEEVTQGQTQKDLETERNVDPFESSQDDQEQTSSLN FT SCD" FT CDS 1360..5427 FT /product="Tad1-1_EP_2p" FT /translation="MRLIQLNCQHNYAVCQATFQVGVEIEADFICLQEPYV FT GIRGMSHPAYDFVMGNAGETRQQRVAFGIKKDIRANVIVETRSDLVDHPYI FT QIIDIWELDLCGNKSRKTRIINVYDNWVGDGYQWKGDKDEKRRAIEDISWD FT KVIEKRTLLVGDFNAHSPYWNSLCRRRIRADQLESIIDRHKLLVNNEMTTP FT TRPKQTSGCSIIDLTMSTPDIGYLPAWTIDPEYATPSDHELITFDIENLNN FT HAKQTQTYSEVTGWALKEITTEQEKEAKREWGIRTENRSIVDDNSSITDLD FT AEAQWITDTLTKIFDQHFKQLRVCARSKRWWSNDISESRSEFKAARRRFQR FT GQTSLEEYKKLRNAYYRKIRKARDKCFEDWIQGGEEIYEPLPADSSPESSQ FT TFRKRREEENERCWTALRYTKEWTTQLTPSLKSENGQVASTIEEKEKMILE FT TLFPCPPDDYQEVLIDSAGNSHLGVTMEVVHRAILDQSVRKAPGPDRLNFR FT AIRLLWDWDSERITALIRQCIRQGHHPHVWRIAKGILLRKPNKADYSQVKS FT YRIISLLNCLGKVAEKVAADIIATWCEKRDVLHQGQMGCRKQRSCIDAVAR FT VVAGVEEAWNRGNIAALLLMDVKGAFDHVSCNSLLRRMHKMGADGQMIRWV FT ESFLTDRRMQFVIDGKCRKEVQIRTGVPQGSPVSPILFTIYLCGVFDSVEQ FT GVDGCTATSFADDCGFVVEAATVPDLIGKIQLAGEKASDWGSENFLQFDQS FT KTEAVAFTRRRKGKTELFNATIKVKNHSFKFNKEATRWLGIWLDTTLSFQA FT HKNVYLQKARKAEGRLRSITFSKGLAPGLVRKIQIAVVQSVALYGAELWWR FT DQKTWEQEHQKLINRQARAITGVFKSTPIGITVKEAGLRPAISLLNNRQRR FT YTQRLLGLPISNDTRKILPETLRDGDAHAQPGEQDATGWDWLSDCKAKQLS FT HRLANSLVKGTDFDSTFGIECTEIINECQFPGKISVLARSEEALKMANEHQ FT DSLGELSFWTDGSKLENQRVGSGVAWQSENGKWNTRKIYLGTNKEVFDAEL FT YGVDQALEIAQKAGRPLARTSSQSTIQSLSKLQTVFIWLDSQAAISRIRHV FT EPSPGQWLVRRIHQRVRELREQGIGVQINWVPGHSGVEGNERADIAAKQAA FT LRGRQCQERFASLSHITRLVTERKWKECQIWFQLKHRSRSQAVKDTYNMQI FT GKRGINKVASHSWKILAARYFQLKSGHAWTGSFLGRIKNRETNKCTECSEA FT PPQTVRHLMLDCRRWRRERDEMWKQIEISGSQIRPRRTKIKTLFGDEQATA FT AILQFLKNTTVGKRNMNSTAEEWRETLGIEDLDADEELGESENE" XX SQ Sequence 5856 BP; 2015 A; 1129 C; 1398 G; 1314 T; 0 other; cttgagacag aatggcaatt ccacacagga tcctaaaaaa aatccaccag ttacatcgca 60 gcaaacaaaa gaagcggaaa ttggttatga cctaccaacg cgcccaccag cgtctagaac 120 tgtaaaggat gtccctaaat tgacgcagcc tgcggtgccg aataaaattc ccacgatggc 180 tcaggtcgca gcatcaaagg tactaaatgc atcgaagcct gaagaatgga cgaatgtctc 240 ccggaaacaa tataggaatc atggcgcaca taaatcgttt ccgagcccaa tcaagagtgt 300 agaagagcca aaacgacgga ttatatttac cagacaaaaa ggcatctcta gcaatcctgc 360 aactgtatgc caagacatct tacatgcaat caacatgaaa ttattaggaa tgaaagcacc 420 ttcgcacctt cgtctcacaa aactaaggta taatgaaagg ggaaatctaa cgggactcac 480 atccagccaa acaacggctg aagccatggt aacactcttt agagaagagc taacacaaac 540 agcactgagg tttgacccac acatccagga cgtggcagct aatcaacaat ggattagtct 600 gaaggctcat ggagtagagc taggacgcta ttatccagct ggaggactaa atcaaattaa 660 agaagaagtg gctgctggtc catcagcact cgagctgcct tttacaccaa ggtgggttag 720 ttcacctgaa cgattgactg aaatggctag aaatggtgtg aaaagacact ctactattag 780 attcacagtc cgaacattag ctgaggcgga tcgagttatg aaacaaggat tgcactttgg 840 aggacgtttt cacaaagtag aaagatttat acctattgga ccagatacta tatgctccac 900 atgctgtcat tggggacaca ctacttatgg ctgccccacg ccagataaag ttcgctgcgc 960 aatttgtgcg gaggcgcact tgacagagca ccataagtgc ccaatatcaa tgtgtaagac 1020 aggaactggt aaattttgct caaaacacgg cacatacaaa tgcgccaact gtggagggag 1080 tcacacggca cgctctccta gttgtccaga tcaaagacaa gcaattgcta tagcgcgttc 1140 gggtagggaa gaatggagag agagagaaaa ggaacatgac atacgaagat ctaaattaga 1200 aaaggaagaa gataacatca gtgaatatag tgagaacagt gagatggata ttgaagaagt 1260 aactcaggga caaacacaga aagatttgga aactgaaaga aatgttgatc cattcgaatc 1320 aagtcaagat gatcaggaac aaactagcag cctcaactca tgcgattgat acaactaaat 1380 tgtcaacaca actacgccgt gtgtcaagcc actttccaag tgggagtcga aatagaagcc 1440 gattttatat gtttacaaga accatacgta ggaataagag gtatgtctca cccagcctat 1500 gatttcgtca tgggtaatgc tggcgagaca cgacaacaga gagtcgcttt tggcattaaa 1560 aaagacataa gagcaaatgt gatagtggag acacgatcag acctagttga ccacccctac 1620 atccagataa tagatatatg ggaattggac ttgtgtggca ataaaagtag gaagacaagg 1680 attataaacg tatatgataa ctgggttgga gatggatatc aatggaaagg agacaaggat 1740 gagaagagac gagcaattga ggacatttct tgggacaagg taattgaaaa acgcactcta 1800 ctggtaggcg acttcaacgc acatagtcca tattggaact cactatgccg aagacgaatc 1860 cgggcagacc agttagaaag tataattgac cggcataaac tgctagttaa caatgaaatg 1920 actacaccca ctagaccaaa gcagacatcg ggatgctcta tcatcgatct aacaatgtca 1980 acgcccgata taggatattt accagcgtgg acaattgatc cggaatacgc caccccctcc 2040 gaccacgaac tgataacttt tgacatagaa aatctgaata atcacgcaaa gcaaactcaa 2100 acgtattcag aagtcactgg atgggcatta aaagaaatca cgactgaaca agaaaaagaa 2160 gctaagagag agtggggtat tagaacagaa aataggtcta tagtagacga taacagcagt 2220 attacagatc tggatgcgga ggcccagtgg ataacagata cactcacaaa gatattcgat 2280 caacatttca agcaactaag agtatgtgcc agatcaaaaa gatggtggtc caatgatatc 2340 tccgaatcta gatctgagtt caaggcagct agacgtagat ttcagcgcgg acagacttct 2400 ctagaagaat acaaaaaatt aaggaacgca tattatcgaa aaatcagaaa agcccgagac 2460 aagtgctttg aggattggat tcaaggtgga gaagaaatat atgaacccct tccggctgat 2520 tcaagtccag agtcttcgca aaccttccga aagaggcgag aagaggagaa tgagagatgt 2580 tggacggccc tacggtacac aaaagagtgg accacacaac ttactccttc tttaaagagt 2640 gaaaatggcc aagtagcatc cacaattgaa gagaaagaaa aaatgatact tgaaactcta 2700 tttccttgtc cgcctgacga ttatcaggaa gtactaattg acagtgcggg aaattcacat 2760 ttgggtgtca caatggaggt agtacataga gcgatactgg accaatcagt acggaaagca 2820 ccagggccag atagacttaa cttcagagct atacgcttat tatgggactg ggactcagaa 2880 cgaatcactg cattaataag acagtgcata cggcagggac atcatccgca cgtttggcga 2940 atagccaaag gaattcttct tcgaaaacca aacaaagcag attattcgca agtcaagtca 3000 tatagaatca ttagcttatt aaattgtcta ggaaaagtag ccgagaaagt ggcagcggat 3060 attatcgcaa cctggtgtga gaaaagagac gttcttcacc agggacaaat gggttgtcgg 3120 aaacagcgaa gctgcatcga tgctgttgcc agagttgttg caggagtgga agaagcctgg 3180 aacagaggaa atatagcagc tctattactc atggatgtca agggggcatt cgatcatgta 3240 agctgcaata gcttactacg acgaatgcac aagatgggtg ctgatggcca aatgataaga 3300 tgggtagagt cgttcttaac agatcgaaga atgcaatttg tgattgacgg taaatgcagg 3360 aaggaggttc agataagaac aggagttcca caaggatcac ctgtctcacc aatactattc 3420 actatatatc tgtgcggcgt gttcgatagt gttgaacagg gagtagatgg atgtacagca 3480 acttctttcg ccgacgattg tggctttgtt gtggaggcag caacagtgcc agatctaatt 3540 ggtaaaattc agttagcggg ggaaaaggcc agtgattggg gctccgaaaa tttcttacag 3600 tttgaccaaa gtaagacaga ggcggtggca ttcactagaa gacgcaaggg gaaaaccgaa 3660 ctctttaacg ccaccatcaa agtcaagaat cactccttta aattcaataa ggaagctacg 3720 cgatggctag gaatatggtt ggacacgacg ctaagctttc aggcccacaa gaatgtatat 3780 ctccaaaaag cgcgtaaagc agaaggtcga ttgcgatcta ttacatttag taagggacta 3840 gctccaggcc tggttcgaaa gattcaaata gcggtagtac aatctgtagc gttatatgga 3900 gccgaattat ggtggagaga ccaaaagaca tgggagcaag agcaccaaaa actcataaat 3960 agacaagcac gagctataac gggggtattt aaatcaactc caataggcat aacagttaag 4020 gaagcaggac tcagaccagc tatctcacta cttaataata gacaaaggag atatacgcag 4080 cgtttattag gattacctat aagtaacgat actcgaaaaa ttctcccaga aacgctacga 4140 gatggtgacg ctcatgctca accgggtgaa caagatgcaa ctggatggga ctggctctct 4200 gactgcaaag caaagcaatt aagccatcga ctagctaact cactagtgaa gggcacagat 4260 tttgatagca catttgggat agaatgtaca gagataatca atgaatgtca atttcctgga 4320 aagatttcgg ttctagcaag aagtgaagaa gctttgaaaa tggcgaacga acaccaagat 4380 agtttaggag aactatcttt ctggacagat ggatcaaaat tagaaaatca gcgcgttgga 4440 tcaggggttg cctggcaatc agaaaacgga aagtggaata cacgcaagat ttacttaggg 4500 acaaataaag aggtatttga tgcagagttg tacggagtag accaggcttt agaaattgcc 4560 caaaaagctg gaagaccatt agcgagaaca tcatcccaat cgacgattca aagtctcagc 4620 aagctccaga ctgtatttat ttggttggat tcacaagctg ctatatcacg aatacggcat 4680 gtagaaccaa gccctggcca atggctggta cgcagaattc atcaaagggt tagagaatta 4740 agagagcaag gaattggggt ccagattaat tgggttcctg gccactcagg agtagaaggt 4800 aatgaacgag cggatatagc ggctaagcaa gcagcgcttc gtggtagaca atgtcaggaa 4860 cgcttcgcat cattatcaca tataaccaga ctagtcacag agagaaaatg gaaagaatgc 4920 cagatatggt ttcagctgaa acatagaagt cggagtcaag ccgtcaagga cacatataat 4980 atgcaaattg gcaagcgtgg tataaacaaa gtggcttccc actcctggaa aatattagcc 5040 gctcgatact ttcagctaaa gtctggacac gcgtggacag gatccttttt gggacgaatc 5100 aagaataggg aaaccaataa atgtactgaa tgctccgagg caccacccca aacagtccgc 5160 cacttgatgt tggattgtcg gagatggcgc cgcgaaagag atgagatgtg gaagcagatt 5220 gaaataagcg gaagccaaat ccgccctagg agaactaaaa taaagactct cttcggagac 5280 gaacaggcca cagcagcaat cctacaattc ctgaaaaata cgacagttgg gaaaaggaat 5340 atgaattcaa cagcggaaga gtggcgggaa accctaggca tagaagatct cgacgccgat 5400 gaggagctag gagaaagtga gaatgagtga gtagaggtaa gctcagcggt cccgaaaaga 5460 gtatatttcc tttttgtttt ttttctcctc ctgtatgcga tgactttcct tttatctgta 5520 tatgataccg tgtagtacta gtttcttttc cttttttttt tgttttttac attcatattt 5580 gagccatgct agttagaagt aatgaaaaat tgaggccgga agtgatgagg gtatggtgct 5640 aggcaatagc cttcaatact gaaacccaaa tgcccctctc atttgtggaa atgatgggcg 5700 aggaatataa ggatgtgtta ggcacagaat agacttagtt atagaaatcg taaattgggc 5760 ggtaattgta gagtttatta taagggagga atctgtagat tagtgtggca tatccattaa 5820 ggattagaac ccaaattgat tacaaaaaaa aaaaaa 5856 // ID GYMAG1_I repbase; DNA; FNG; 5681 BP. XX AC AACU01000830; XX DT 01-SEP-2005 (Rel. 10.09, Created) DT 01-SEP-2005 (Rel. 10.09, Last updated, Version 1) XX DE GYMAG1: Gypsy-type LTR retroelement from Magnaporthe grisea DE (internal portion). XX KW Gypsy; LTR Retrotransposon; Transposable Element; GYMAG1_LTR; KW internal portion; GYMAG1_I. XX OS Magnaporthe oryzae OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Magnaporthales; OC Magnaporthaceae; Magnaporthe. XX RN [1] RA Dean R.A., Talbot N.J., Ebbole D.J., Farman M.L., Mitchell T.K., RA Orbach M.J., Thon M., Kulkarni R., Xu J.R. et al.; RT "The genome sequence of the rice blast fungus Magnaporthe RT grisea."; RL Nature 434(7036), 980-986 (2005). XX RN [2] RP 1-5681 RA Jurka J.; RT "GYMAG1: Gypsy-type LTR retrotransposon from the rice blast RT fungus Magnaporthe grisea."; RL Repbase Reports 5(9), 242-242 (2005). XX DR EMBL/GenBank/DDBJ; AACU01000830; Positions 10814 16494. XX CC LTRs are identical and there are two distinct ORFs. CC This appears to be a recent insertion. XX FH Key Location/Qualifiers FT CDS 133..1680 FT /product="GYMAG1_I_1p" FT /translation="MSPRKNYGPPIRQSQRVTNANSAKPDNLASGTATPNT FT EGSDQNGEITVAPVSPEPVEPAPTSANTEPEEPVEPQYDTEQDQPVDSIEV FT DGSYEYESVSEHPGEARSPPHQPSAKTSDGRNEIPKFPSRAPSPAAQMRSE FT MANESENTTGQATITQSQLQDLLATIAQLKDQLATQNNSTSNGARGVTPFN FT SAFGGTDFKPVGFAAHDAFRPYGNGQISVNPGYDRKARKTGVDPGSFEGKD FT DTFDTWIIQVADMMEEDESTYKTERSRMAALFSLTKGPPNDLLRSRYASKE FT NPFSGVAEMVATLAAVYHDDNQGTKARRELAEMMFDPADKTMDIPCFLLAK FT LTLLADRANILKSERKTLLYEHIPAKLNTQLFDDAKNPDISYESFVNKVAN FT AALANQRAYEENLKRKQAKKGRQSPEPRRRRSRDYEHRREVKTATPTPVKE FT VVTLSDKERSALLDAGLCFLCKKAGHQSRQCPDRKIIANMLQALDHVDSYS FT QPDNSTRNSSSTSSSDSENC" FT CDS 1713..5627 FT /product="GYMAG1_I_2p" FT /translation="MPQIAKIDRSPKLNTFLAPIQLQDKGRGLNAQALCDT FT GADVYLSIRPSLARKVAQRLELPIKKLPKPLDFGDFNGKVTARATEYLRLT FT LQIDGRQFPRQKFILLETAHDVFIGQEWWTKHRVGLFPATRSFVWPNDLPA FT MAQYSPAIVVQRSNPPLDLEAQADAVRRDRKMEQEDRRIQVRKILRSPKRQ FT HENQAQIAEINAFPTIPKTVSNKGLPASICALLNDPRQDRWKRSPLPTEPI FT PLVPVNDTPTTSLNAVSALQWNKTHDGRPIPFPADEDPEHIAQVRAKLPKA FT LAHLEGFFSKKASTILPPSRPGFDVVLELEKPLEGRPARYSTPFAMMELEK FT ETIDELLRIDFIERTMEETAASTLFVPKPQSKEQRFCVDYRWVNKFIKGRQ FT VLAPDVAGTLSKCGKARRMTKIDIIRAFNRLLMDPKSRYLTAFKTRQGTFR FT WKVLPFGLKVGPAWFQAFINAQLNELLDAFASAYADDVLIYTEDKSEQVHF FT EQTEEVIYRLHKAGLQGDIKKSSFGVFEIEYLGLLLEIGKGIRIDPKKVEA FT ITSWQWDDVTSVSAVRSFLGLCNFVRTFCHHASEQAEPLTRLLKKGVPFER FT GPEQKSAFEALKQLVITAPVMSFFKPGMPVRMDTDASGRATAGVVWQQQDD FT GSWKPIGYSSKTMSPAEQNYPIQDQELLAVINTLKDFEPALLGTKFCVFTD FT HQALIYWSTKKLLSARQIRWADYLANFDITFKYRPGKDNVAADALSRKTID FT NPTVKARAVLDRTIALIPPEKIDSRPGEPLPTISNLEPSAPQGADLVALIL FT EENLKQNLGHHDNLLTVPETTQDGKIFLRTALIREAHEPKIFGHGGQNKTL FT SRLKKDYWWPDRNRDVKRYIKNCRECQRNKVRHDKTPGLLHPLEIPRRCWQ FT HVMVDGKDMPKDKEGYNYVWVFICRLTKLLATLPGKKTDTAETLAMRYYQT FT VYRWKGVPDLWLSDNAGPFISEFLNTLNELTGTKHKHGSARHPQSQGSVEI FT TNAELDQKMRFYVDKYQTNWRRHIPAFDFGHNSSVHASIGMAPTEAETGAR FT PRDPLSLPLPEEDLETGQQQKALEMVRQARQAQELARNNGLGAQIEQQRQA FT NKKRRPVDFTVGDAVYVSKKGFSTEAPTTKLDSQNAGPWTILEEKGHSFIL FT DTPAWYKGSKLFHASRLRKAATDPLPQQYQKPEPPVEINGEPEWEVEQVLA FT SRLFGRKKTLQYQVSWVGLDPDETWYEARDLKNSPVLLDTFHREYPDAAGP FT PVNLQQWIRSAAEDVFAEDGPEDNVAEHDAKKTRERRKAPRRHT" XX SQ Sequence 5681 BP; 1584 A; 1589 C; 1377 G; 1131 T; 0 other; taattctaaa ctcgctataa tttgctggtg aagcaattaa aatatatcgt tcaatccctt 60 tattcaatcg ttcaccacgt cggctgtgca cgacggtaaa ctaggccaca gaagacacca 120 ttcgcgaaat taatgtcgcc gcgtaaaaat tacggcccac cgattcgcca atcccaacgc 180 gtgacaaacg caaactcagc aaagccagat aatttggctt caggcactgc tactcccaac 240 accgaaggtt cggatcaaaa cggagagatt acagttgccc cagtgagccc agagccagta 300 gagcccgctc ctaccagcgc caacacagag cctgaagagc ccgttgagcc ccaatacgac 360 acagagcaag atcagccagt cgacagcatc gaagttgacg gatcgtacga gtacgagtca 420 gtttctgagc acccaggaga ggcgcgaagc cctccccacc aaccttctgc caagacgtca 480 gacggcagga acgaaattcc gaaatttcca agcagagcgc cctcgcctgc cgcccaaatg 540 agaagcgaaa tggcaaacga aagcgaaaac acaaccggcc aggccaccat cactcaatcg 600 caattgcaag acctcttggc tactattgcc caactgaagg accagttggc cacccaaaac 660 aactccacca gtaatggagc ccgtggggtt acacccttta acagcgcctt cggtgggacc 720 gattttaaac ctgttggatt tgccgcccat gatgccttta ggccctacgg aaatggccaa 780 attagcgtta accccgggta tgatagaaaa gcccgcaaga ccggcgttga ccccggatct 840 ttcgagggta aggacgacac cttcgacacg tggatcatcc aagtcgccga tatgatggaa 900 gaagatgaaa gcacttacaa gaccgaacgt agccgtatgg cagccctttt ttcgttaaca 960 aaaggaccgc ctaacgacct attgcgtagc cgttacgcct cgaaagaaaa cccttttagc 1020 ggtgttgccg agatggtcgc aactttagct gcggtgtacc acgacgataa ccaaggcact 1080 aaagcacgcc gagaactggc tgaaatgatg tttgacccag cagacaaaac tatggacata 1140 ccatgttttc tgttagcaaa gttaacactg ttggcggacc gcgccaacat tctcaaatca 1200 gagcgtaaaa ccctactata cgaacatatt ccggcaaaat tgaacaccca gcttttcgac 1260 gacgctaaaa atccggatat atcgtacgag tctttcgtga acaaagtcgc caacgctgca 1320 ctggctaatc agcgcgccta cgaggaaaac cttaaacgga aacaggctaa aaaggggcgc 1380 caatcgcccg agcctcgtcg tcgccgttcc cgcgattacg aacacaggag agaggttaaa 1440 acagcgactc ccacacccgt taaggaagta gtcactcttt ccgacaagga aaggagcgct 1500 cttttggacg caggtctttg ttttctatgt aagaaggccg gccaccaatc ccgtcaatgc 1560 cccgatcgca aaattatcgc caatatgtta caggctctgg accatgtgga ttcgtattcg 1620 cagcctgaca attcgactcg caactcgtcg tcgacttcct cgtctgattc ggaaaactgc 1680 taaggctgcc ggaagtctcc tcggcagtct gcatgcctca gatagctaaa atcgatagat 1740 cacccaaatt aaacaccttc cttgctccca ttcaactgca agacaaaggt cgtggtctca 1800 acgcccaagc tctctgtgac accggagcag atgtgtactt atccattcga ccgagcctgg 1860 cgagaaaggt cgcccaacgt ttagagctac ctatcaaaaa gctccccaaa cctttggact 1920 ttggagattt taacgggaag gttaccgcaa gagccaccga ataccttcgg ctcactttgc 1980 aaatcgacgg acgacaattc ccacggcaga aattcatcct ccttgaaacg gcccacgatg 2040 tgtttatcgg acaagaatgg tggacgaaac atcgagtagg cttattccca gctacccgca 2100 gcttcgtttg gcccaacgac ctgcccgcca tggcccaata ctcacccgca atcgttgtac 2160 aacgttcgaa tccgcctctg gaccttgagg cccaagctga cgcagtccgc cgggaccgca 2220 agatggaaca agaagaccgt cgcatccagg tccgcaagat acttcgaagc cctaaacgcc 2280 aacacgaaaa ccaggctcag atcgcagaaa ttaacgcctt cccgactatc ccaaagaccg 2340 tttccaacaa aggcctgcca gctagtattt gtgccctcct taacgaccca cgtcaagatc 2400 gatggaagcg atccccgtta cccacggagc ctataccgct ggtaccagta aacgatacgc 2460 ctacgaccag cctcaatgcg gtatcagccc tgcagtggaa caagactcac gatggacgtc 2520 ccattccatt ccctgcagac gaagacccgg agcacatcgc gcaagtacga gccaagttac 2580 ccaaagcact tgcccacctt gagggtttct tttccaaaaa ggcgtcgacg attctcccgc 2640 ctagccgacc cggcttcgac gtggttttgg aactcgagaa acctttggag gggaggccag 2700 cccgctattc gacccctttc gcgatgatgg agttggaaaa ggaaaccatc gacgaacttc 2760 tgaggatcga tttcattgaa cggacaatgg aggagaccgc agcttcgact ctgttcgttc 2820 ctaaaccgca atcgaaggag caacgttttt gtgtggatta ccgatgggta aacaaattca 2880 tcaaaggacg acaagtactt gcccccgacg tggccgggac gttgagcaag tgtggaaaag 2940 cccggaggat gaccaagatc gacattattc gagcttttaa cagattgcta atggatccga 3000 agtcacggta tttaacggcg ttcaaaacgc gacaaggcac atttcggtgg aaggtcctac 3060 cctttggact gaaggtcgga cccgcctggt tccaagcttt tatcaacgct cagcttaatg 3120 aactgctgga cgctttcgcg agcgcctacg cagacgacgt gttgatttac accgaggata 3180 aatcggagca agtccatttt gagcaaaccg aggaggttat ttacaggctg cacaaagccg 3240 gactccaagg cgatatcaaa aagtcaagtt tcggcgtttt cgaaatcgag tacctgggtt 3300 tacttctaga gatcggtaaa ggtatccgca tcgaccccaa aaaggtcgaa gccatcacca 3360 gctggcaatg ggacgacgtc acctccgttt cggcagtacg atccttccta ggcttatgca 3420 atttcgttcg gacgttctgc catcacgcca gcgaacaagc agaacctctg acccgattat 3480 tgaaaaaggg agtcccgttc gaaagaggcc ccgagcagaa atctgccttt gaagcgctga 3540 aacagctggt gatcaccgcc cccgtgatga gtttcttcaa acccggtatg cctgtcagga 3600 tggacaccga cgccagtggc agggccaccg ccggtgtcgt gtggcagcaa caggacgatg 3660 gcagctggaa gccaatcggc tattcgtcca aaaccatgtc ccctgctgaa caaaactatc 3720 cgattcagga ccaggagcta ttagctgtga ttaatacgct aaaggacttc gaaccagccc 3780 tgttgggtac caagttctgt gttttcaccg atcaccaggc cctaatttat tggtcgacca 3840 agaaactttt gtccgctcgc cagatacgat gggctgacta cctggccaac ttcgacatca 3900 cctttaaata ccgtcccggc aaggataatg tggcagccga tgccctttcg cgcaaaacta 3960 tcgacaaccc tacggttaag gcccgagctg tgttagaccg caccattgca ctaatccctc 4020 cagaaaagat cgactcccga cccggcgaac ctcttccaac cattagcaac ttggaaccct 4080 ccgcaccgca gggagctgac ctggtggccc ttatcctgga agaaaacctg aaacagaacc 4140 taggtcacca cgacaatttg ttaacggtac ccgagactac ccaggatggc aaaatcttct 4200 taagaacggc tctcatccgc gaagctcatg agcccaagat cttcggccat ggaggccaga 4260 ataaaaccct cagccgcctg aaaaaggact attggtggcc cgaccgaaat cgagacgtca 4320 aacgctacat taagaattgt cgcgaatgtc aacgcaacaa ggtacgacat gataagacgc 4380 ctgggctgtt gcacccactg gagatcccta gacggtgttg gcagcacgta atggtagacg 4440 gcaaagatat gcccaaggat aaagaaggct ataattacgt ctgggtcttc atctgccgat 4500 tgaccaaact tttggccacc ctaccgggaa aaaagacgga caccgcggag accttggcca 4560 tgcggtatta tcagaccgtg taccgttgga aaggcgtccc cgatttatgg ctttccgata 4620 acgccggacc gtttatatcc gagttcttaa atactctaaa cgagttgaca ggaacaaagc 4680 ataaacatgg aagcgcccgc caccctcaaa gtcagggcag cgtcgaaatt accaacgctg 4740 aattggatca gaaaatgcgc ttttatgtag acaaatacca aacgaattgg cgacggcata 4800 tccccgcttt cgacttcggc cacaactcgt ccgtccacgc ctctattggc atggcaccga 4860 ccgaagcaga aaccggagct cgccctagag acccgctctc tttaccttta cccgaagaag 4920 acctggagac cggtcagcaa caaaaggcgc tggaaatggt ccgccaagcc cggcaagctc 4980 aggaactggc ccgcaacaac ggactcggag cgcaaataga acaacagagg caggctaata 5040 agaagcggcg accggtcgat tttacggtgg gagatgcggt ttacgttagc aaaaagggtt 5100 tttccaccga agctcctacg accaagttgg actcgcaaaa tgcaggacca tggacgattc 5160 tcgaggaaaa gggccacagc ttcattctcg acacccctgc ctggtataaa ggtagtaagc 5220 tgttccacgc cagccgactg cgcaaggcag caacggatcc attacctcag cagtatcaaa 5280 agccggaacc accggtcgaa attaacggag aaccggagtg ggaggtggag caagtgttgg 5340 cctcacgtct attcggacga aagaagacgt tgcaatatca ggtatcgtgg gtcggactcg 5400 accctgatga gacatggtat gaggcccgcg acttgaaaaa ttcaccagta ctgctggata 5460 cctttcacag ggagtacccg gacgcagcag gacctccggt aaatttgcaa cagtggatca 5520 ggagcgcagc agaggatgtt tttgcggagg acggacccga ggacaatgtt gccgagcacg 5580 atgcgaaaaa gacacgagag cggaggaagg ccccgaggag acacacgtga caaactcaag 5640 actctgaccg cctacgtcga tcaggcctta gaggggggta a 5681 // ID Gypsy-16_RO-I repbase; DNA; FNG; 4829 BP. XX AC AACW02000194; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-16_RO_; KW Gypsy-16_RO-LTR; Gypsy-16_RO-I. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-4829 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000194; Positions 321832 326660. XX CC 'TGATC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(46..1374,1378..4806) FT /product="Gypsy-16_RO-I_1p" FT /translation="MSENTLGEIADGTSNPTDSPSTVNPSVQLGTMASKYA FT HSDITPMEEDTPTPVTPKPEDVALKSIGMIKDMKKQALVLFAAYMNSNQSS FT PDSPETLQALSLYREFEQKIVAAQEAHKSMISLFDDKQDSNGSEILGSVRL FT VVPNDLPVLQLRGEALWKKKSDPFDSAYDFCNTFETVLHAHGLSLNSNWER FT LLPLCMNPEQVSWSREALMNKRYTWKEVRPIVLDHFDTPYRKFLLMVEVGS FT MRQAPYESNREYSNRFQKMRREAGMEDGTQLAVTYFASLKPSVKTVAQVAI FT SSHLGAKLPSSINQIIDLVLASGEDSAFSIKNPHKRNRGSEEELVSRVSFI FT KTNKVSPGTSKVANGFKSNNISTNKGITKPKPCIYCGKEWSKGHRCEEFKE FT AKKSSSNAKANEGFSLSRVNRMAIRSNNDEKSNDDEDVTNGHLNRMALDKL FT KNKDITVTRDFKLTKDNSISFPILVNNIKAMTILDTGANFSSINKKFCLEH FT NFPIVPSKEKDVIKLADSKSTITRIGTIELNIRCNEKNYTHNFEVMNLTND FT HDMSIGTDFMTKLGIGLTGLPYKWDTDEVDNIQAAQSKYSDNSELLEEVDK FT EMSELEDCPAGTPEQFQSAINDIKPLIQLNQGIPRGSFCTIPESVVSIETP FT ENVTSYRSPYPIALVYHQVVQDQINEWLENGVIKRAPANTEWNSALTVVKK FT TNAKGEITGYRVCHDPRHINALLSSIDRMPLPKISELFEELKGASVYSTLD FT LKSAFNSLRLREDHAHKLAFSWNNVQYVPIGTPFGLKHVSSVMQRTMSIAL FT ENMPFARCFIDDIVVASTSIEEHKLHLKAVINKLTKVNLKLNPDKCKFFQN FT KINLLGFRISPKGVSLDVRKVANVQDFPVPKTGKDIMRYCGLINYFRPLIP FT KASTLLAPLDALRNEKSLEKIWNSKHQTCFDNLKKVLLEGTILAYPDLNRP FT FCVATDASNVGVGVVLYQVIDGKMKYISMMAKSLSKSERNYSATKRELLAV FT VYALKKFHKYLWGNHFTLYTDHKALTYLHTQKIANVMLINWLDTLLKYDFN FT VVHLPGIDNILPDTLSRLYEIENPVNELGGDNARLLNRTAIKLPSINDGEY FT ITPADSEERKELLLKEHLKGHFGSDAIYHALKRKGIYWNNLKEEAVELVKS FT CVPCQQFNIIKKGYNPLRPITATLPGDSWGIDLAGPMKTSLNGNNYLLIMV FT DIATRYCILKPIPDKQSMTIVKTLIDVFSNYGFPRVLQSDNGGEFVNELMQ FT LLAENAGYDHRLISSYHPRANGVSERWVQSAVKAIKKQVEGAKADWDLYVP FT STQLYLNSKYNERTKTPPFTLMFGRNPNDFEDFSQEKNEATKEELNTELLK FT NIKKMTEIVFPAIYERTKIITNKQKEKFDASHKLVSIPENSYVMVTVNYKT FT NKLDAPYEGPYKVKRITQGGSYVLEDEKGDLLPKNYPPSALKMISQDEVIS FT SNKFYQVEAILAHKKAKGKYIYKCRWKGYDESEDTWEPASHFADLKFITEY FT WQRIGIVPEDIKTKYKILDKKEKSNNTNKESNPQSSKRKISIDLKETSYSS FT TTIDEKNKSQGRRKRSKRY" XX SQ Sequence 4829 BP; 1698 A; 847 C; 866 G; 1418 T; 0 other; ttttactttt tgaattttgc tctttatttg aaatttaaaa tacgtatgtc tgaaaacact 60 ctcggtgaaa ttgctgatgg tactagcaat cctactgatt ctccatctac tgtgaatcct 120 tctgtccaac tgggtactat ggccagtaaa tatgctcact ctgatattac tcctatggaa 180 gaagacactc ctactcctgt aacgcctaaa cctgaagatg tagctcttaa aagcattggt 240 atgatcaagg atatgaaaaa acaagctttg gttctgtttg ctgcgtacat gaattcaaac 300 caatctagtc ctgatagtcc agaaacattg caagctttgt cgttgtatag agaatttgaa 360 cagaagatag tggctgcaca agaagcccat aaatcaatga tctctctttt tgatgataag 420 caagactcaa atggctctga aattcttggt tcggtgcgtt tagtagtccc aaatgattta 480 cctgtgttac aattacgtgg tgaggcactg tggaagaaaa agtcagaccc ctttgattct 540 gcttatgact tctgcaatac gtttgaaacg gtactacatg ctcatggttt gtcacttaat 600 tcaaattggg aacgcttatt acctttatgc atgaacccag aacaagtctc ttggtcccgt 660 gaagccctaa tgaacaagcg ttacacctgg aaggaagtcc gtcctattgt tcttgaccac 720 tttgatactc cgtatcgaaa gtttttgcta atggttgaag ttggctctat gcgtcaagct 780 ccttatgaaa gtaataggga atattcgaat cggtttcaaa agatgagaag agaagcgggt 840 atggaagatg gaacccaact agctgtaaca tactttgcat cacttaaacc ttctgtcaaa 900 accgtggctc aagtggctat atcatcccat cttggtgcta aactccctag ctcaattaat 960 cagataattg acttggtcct tgcttcgggt gaagattctg ctttttcgat caaaaatcca 1020 cacaaacgaa atagaggatc tgaagaagaa ctagtctctc gtgtgtcttt cattaagact 1080 aataaggttt cccctggaac tagtaaagtc gctaatgggt tcaagtcaaa taatataagc 1140 actaataaag gcatcactaa gcctaaaccg tgtatctact gtggtaaaga atggagtaaa 1200 ggtcatcgtt gtgaagagtt caaagaagca aagaagtcct caagtaatgc caaggcaaat 1260 gaaggatttt ctctctcccg tgtcaatcgt atggcaattc gctccaataa tgatgaaaaa 1320 tcaaatgatg atgaagatgt aactaacggt catttaaacc gtatggctct tgattgaaaa 1380 ttaaagaata aagacataac tgttactcgt gattttaaat tgaccaaaga taattctata 1440 agctttccta tccttgtaaa taatataaaa gcaatgacta tcttagacac aggtgcaaat 1500 ttttcgtcaa ttaataaaaa gttttgtcta gaacataatt ttcctattgt tccatctaag 1560 gaaaaagatg tcataaagtt ggctgattct aaatcgacaa ttacacgaat tggtacaatt 1620 gaattaaaca tcagatgcaa tgaaaaaaat tatactcata acttcgaagt aatgaattta 1680 acaaatgatc atgatatgag tataggtaca gatttcatga ctaaacttgg tattggtctt 1740 actggtttac catataaatg ggatactgat gaagtagaca atattcaagc tgcacaatct 1800 aaatatagtg acaatagtga gctcttagaa gaagtagata aggaaatgtc agaacttgaa 1860 gactgccctg caggaactcc tgaacaattt caatctgcta taaatgatat taaaccattg 1920 attcaattaa atcaaggcat acctagaggt tctttttgta caatcccaga atctgttgtc 1980 tcaattgaaa ctccagaaaa cgtaacatct tatcgaagtc cttacccgat cgctttagtg 2040 tatcatcaag tagtacagga tcagattaat gaatggctgg aaaatggtgt cataaaacgt 2100 gctcctgcta acactgaatg gaattcggct ctaactgtag taaagaagac aaatgccaag 2160 ggagaaataa ctggttaccg tgtatgccat gaccctagac atattaatgc tcttttatct 2220 tctattgacc gaatgccttt accaaaaata tctgaactct ttgaagaact aaaaggagct 2280 tcagtatatt caactctgga tcttaaatcc gcatttaact ctttaaggct gcgagaggat 2340 catgctcata aattagcttt ctcatggaat aatgtacaat acgtgccaat tggtactcca 2400 tttggtttga aacacgtatc tagtgtgatg caaagaacaa tgagtatagc acttgagaat 2460 atgccttttg ctagatgctt tatcgatgat atagtagtcg catctacttc aattgaagaa 2520 cataaactac atttaaaagc agtaataaat aagctgacca aagtaaatct caaattaaat 2580 cctgataaat gtaaattctt tcaaaataag attaatttac ttggttttag aatatcgcct 2640 aaaggtgtat ctcttgacgt tcgtaaagtt gctaatgtac aagatttccc tgtacctaaa 2700 acaggaaagg atatcatgag atattgtggt ctcataaatt acttccgacc tcttatccct 2760 aaagcgtcca cactcttggc tccgctagat gcattgagga atgaaaaatc acttgaaaaa 2820 atatggaaca gtaaacatca aacatgcttt gataatctta aaaaggtact tttggaagga 2880 accattttag cctatcccga tttaaatcga cctttctgtg ttgccactga tgcttctaac 2940 gttggtgtgg gtgtcgttct atatcaagtg attgatggaa aaatgaaata tatttccatg 3000 atggcgaaaa gtctctctaa aagtgaacgc aactattcag ctactaagcg tgaattattg 3060 gcagtagtct atgcattaaa aaagttccat aaatatctct ggggtaatca cttcactcta 3120 tatacggatc ataaagcatt aacctatctt cacacgcaaa aaatagcaaa cgtaatgctg 3180 ataaattggc tggatacttt attaaagtat gattttaacg tcgttcatct gccaggaata 3240 gataacatct tacctgatac actttctcgt ctctatgaaa ttgagaatcc tgtcaatgaa 3300 ctgggagggg ataatgctcg tctgcttaac agaacagcta taaaattacc tagtattaat 3360 gacggtgaat acataacgcc agcggattca gaagaacgta aagaacttct tcttaaagag 3420 catctaaaag gacattttgg ttctgatgca atctatcatg ctttaaaacg taaaggtatc 3480 tactggaata atctaaaaga agaagcagta gaattagtaa aaagttgtgt accctgtcaa 3540 caatttaaca taatcaaaaa aggttataac cctttaagac ctataacagc aacattacct 3600 ggtgatagtt ggggtatcga cttagctggt cccatgaaaa cttccttgaa tggcaacaat 3660 tacttactga ttatggttga cattgctact cgttactgca tattgaaacc tatccctgat 3720 aaacagtcta tgacgattgt aaagacatta atcgatgtat tctctaatta tggttttcct 3780 cgtgtactcc aatctgataa tggtggtgaa tttgtaaatg agcttatgca attacttgct 3840 gaaaatgctg gttatgatca tagacttata tctagttacc atcccagagc taatggtgta 3900 agtgaacgtt gggtgcaaag tgctgtaaaa gccataaaga aacaagtcga aggtgctaaa 3960 gcagactggg atttatatgt tccttctaca caactctacc taaatagtaa atataatgaa 4020 cgtacaaaga cgccaccctt cactttaatg ttcggtcgta atccaaatga ctttgaagat 4080 ttcagtcagg aaaaaaatga agcaactaag gaagaattaa atacagaact tttaaaaaat 4140 ataaaaaaga tgacagaaat tgtatttcct gctatatacg aaagaacaaa aataataaca 4200 aacaaacaaa aggaaaaatt tgatgcgtca cataaacttg tatcaatccc cgaaaatagt 4260 tatgtaatgg taacagtaaa ttataaaaca aataagcttg atgctccata tgagggacct 4320 tataaagtta aaaggatcac ccaaggtgga tcatatgtgc ttgaagatga aaaaggagat 4380 ttattaccta aaaactaccc accatccgca ctcaagatga tatcacagga tgaagtaata 4440 tcctcaaata aattctatca agttgaagcc atcttagcac ataaaaaagc taaaggaaaa 4500 tatatttata agtgtagatg gaaaggttat gatgagagtg aagatacctg ggaacctgca 4560 tctcactttg cagaccttaa attcataact gaatattggc aaagaattgg tattgtacct 4620 gaagacatta aaacaaaata taaaattctt gataaaaaag aaaaatcaaa caatacaaat 4680 aaggaatcta atcctcagtc aagtaaacgt aaaatatcaa tagacttaaa agaaaccagc 4740 tactcttcca caacaattga tgagaagaat aagagccaag gtagacgtaa aagatctaaa 4800 cggtattaat catacctggg aggggatta 4829 // ID Gypsy-24_MLP-LTR repbase; DNA; FNG; 319 BP. XX AC AECX01000965; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-24_MLP_; KW Gypsy-24_MLP-I; Gypsy-24_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-319 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000965; Positions 210357 210675. XX SQ Sequence 319 BP; 73 A; 77 C; 65 G; 104 T; 0 other; tgtaagggtt acatacaggt acaagccatt atggctgtag acaggttaga ttgtatagtc 60 atcacgcacc tagctccaca aagatccttt tcttcaccga atacttcggt tggagctagg 120 tgagcacctc atttcactat ttgtatttca gttctccttt cccatctctc atcttatctt 180 aggagctagg agctgattca ccaaatactt cggttggagc tagtcttgtt ttggaataga 240 ttgaagggtt ttagattacc cttggagcct tgctcccgtc ccaccttgcc cagtgaagac 300 ttcccttcgg agtcttaca 319 // ID Gypsy-9_RO-LTR repbase; DNA; FNG; 254 BP. XX AC AACW02000090; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-9_RO_; KW Gypsy-9_RO-I; Gypsy-9_RO-LTR. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-254 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000090; Positions 149889 150142. XX SQ Sequence 254 BP; 94 A; 37 C; 37 G; 86 T; 0 other; tgtatgtata tctaccaaaa tagactggtg aatgaggagc aagtgcttat attaaattat 60 ataaagatct gcataataga atataacatt tagccatctg cataaggctg tgggcatttc 120 gccatccgca tagtacatat attaatatct acactgtgaa aagtatataa actctggaca 180 tatattatta aataaaatca tatataattt tgggtcctat tgaaatctcc attcttattt 240 ttgtgaacaa aaca 254 // ID Gypsy-2_MLP-LTR repbase; DNA; FNG; 1325 BP. XX AC AECX01001632; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_MLP_; KW Gypsy-2_MLP-I; Gypsy-2_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-1325 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001632; Positions 134205 135529. XX SQ Sequence 1325 BP; 507 A; 255 C; 224 G; 339 T; 0 other; tgtcagctct cgagctctta cacataagaa ccagaagagc tacaataatt ataaaaattc 60 atatatcatt taacatttag attacactta taatatatac aacttataga cctagtttcg 120 aaggtcaaga aacttctaag tattgaaaac accaaagagc taataaacaa tacaaaataa 180 aatatacaaa tcacaaacat acccgttttg aagaaacaaa caaaaacatc ttatagactt 240 agtatcaaaa gtcatgaata tacaaaaacc aaagtaagag ttagaacact cacaggagag 300 catatacaaa aaccaaagta agagttagaa cactcacagg agagcatatg aaattaatgt 360 atagctagat aagagtaccc aaagaggaac actaggccgg accagaactt acattcaaag 420 ataagccaaa gaagaatggt accagaaagt ccgtgattag tgaataagga atgaggagac 480 caaagactag aactcctgaa cagacgagta aaatcaacaa tattaggtaa aatattggta 540 gaatgccaag aaacaaacgc tgagaactat ttgatcaata taaggaaaat acgtaaatac 600 atacgtagta ttcctcgact tagtaagacg gcggtgacga gatgctgagc tagcttggaa 660 ataagcagac caaagctaga gtcagaattc tgagaagagg atgactgaga aaagaaggta 720 taaatactgg acgactaatc aaaaataaga agggactcac ctgattttac aaacccaaaa 780 ttctatctag attacaccat tatctaatta gaatactctt cttttttcga tcactagcta 840 ataagaaatt cccaaagaat agtcacatta gttccctttt gaaatcagtt atagaatcag 900 agtcatacct tgctattcta tatagttata acaaaaccac cgagttgtcc tattcgttaa 960 acataattaa cggttagtct ttctcttcta ctcttctggt caaggaccca ggtctcgcaa 1020 ctatattgtt ggcgggatcc ctgtagattc aaagtctcag gctgctctta actctcgagt 1080 tagagataac tacggctcag gcgtgacctg ccaacttctg ttaccttaca gtgggcctgt 1140 agtacaagac ttacttgtta agtataattt tggaatagca attactaatc ataagctttc 1200 tatgtattcc ctcctagaaa gataactcta catattcccc gtacccagcg gaaaaacggt 1260 caggatcttc tcacgacact gttgactaga ccaattcgtg agtcataaca gtagccaccc 1320 tgaca 1325 // ID I-5_AO repbase; DNA; FNG; 5486 BP. XX AC . XX DT 24-JAN-2006 (Rel. 11.01, Created) DT 02-FEB-2006 (Rel. 11.01, Last updated, Version 1) XX DE A family of I non-LTR retrotransposons - a consensus sequence. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-5_AO. XX OS Aspergillus oryzae OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC mitosporic Trichocomaceae; Aspergillus. XX RN [1] RP 1-5486 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-5486 RA Kapitonov V.V. and Jurka J.; RT "I-5_AO, a family of I non-LTR retrotransposons in the RT Aspergillus oryzae genome."; RL Repbase Reports 6(1), 13-13 (2006). XX DR [2] (Consensus) XX CC It is a family of I non-LTR retrotransposons. It contains well CC preserved ORF1 and ORF2 coding for the gag/RING finger-like and CC endonuclease/RT/RNase H proteins. XX FH Key Location/Qualifiers FT CDS 81..1409 FT /product="I-5_AO_1p" FT /translation="MSGRAETRTPDPPDRDEQENSPPRLVRPKRTTRPPAH FT YAQEQEIETEQRNTRSQRKKKNQGKPVAQDKAATSDDSSTEREDSDTSKLV FT KEIVKLRREIRRRDELYKEELQRVKEEFGAALTEFRHELLANRPPTPQAHP FT ESCAQSGHEEILREIQSLRVAVNPSGSPSYADVARTPPTSQPSNIRTLSSW FT NTTPTTFTDTLYCTIDTSKMADTESERPSAGPIRTAVETEIRTMENYTNWR FT CRAVTVDPKNTNRIRIACRDEAEHQLVKKVAETKVGAGARVLRDELYPIKV FT DSVNRTAVLDENGDIRVGAAAAFGEENETTVAKIAWLSRKENAKAYGSMVV FT YLTKGSDARRLLADGFFHAGGESGVTSTFEHRPRPIQCYNCQEIGHKAFQC FT KSTQKCARCAAEGHHHSCCNQSVPKCIPCGGPHESYSKNCRKLYPSHHE" FT CDS 1405..5169 FT /product="I-5_AO_2p" FT /translation="MNKTLRVIQLNVRKQGAVHESLMNDEETQNTVALAIQ FT EPQARRIQGRLLTTPMGHHKWTKMVPSTWREGRWAVRSMLWINKEVEAEQV FT PIESPDLTAAVIRLPERLIFMASVYVEGGNASALDDACNHLLDAITKVRRD FT TGVVVEILIMGDFNRHDQLWGGDDVSLGRQGEADPIIDLMNECALSSLLRR FT GTKTWHGGGHSGDCESTIDLVLASENLADSVIKCAILGTEHGSDHCAIETV FT FDAPWSLPKHQGRLLLKNAPWKEINTRIANTLAATPSEGTVQQKTDRLMSA FT VSEAVHALTPKSKPSSHAKRWWTADLTQLRQIHTYWRNHARSERRAGRKVP FT YLETMAQGAAKQYHDAIRQQKKKHWNQFLADNDNIWKAERYLKSGEDAAFG FT KIPQLLRADGTTTTDHKEQAEELLAKFFPPLPDNIDDEGTRPQRAPVEMPA FT ITMEEIERQLMAAKSWKAPGEDGMPAIVWKMTWPTVKYRVLDLFQASLEGG FT TLPRQWRHAKIIPLKKPNKENYTIAKSWRPISLLATLGKVLESVVAERISH FT AVETHGLLPTSHFGARKQRSAEQALVLLQEQIYAAWRGRRVLSLISFDVKG FT AYNGVCKERLLQRMKARGIPEDLLRWVEAFCSERTATIQINGQLSEVHSLP FT QAGLPQGSPLSPILFLFFNADLVQRQIDSQGGAIAFVDDFTAWVTGPTAQS FT NREGIEGIIKEALHWERRSGATFEAEKTAIIHFTPKTSKLDREPFTIKGQA FT VEPKDHVKILGVLMDTSLKYKEHIARAASKGLEAVMELRRLRGLSPSTARQ FT LFTSTVTPVVDYASNVWMHAFKNKATGPINRVQRVGAQAIVGTFLTVATSV FT AEAEAHIATAQHRFWRRAVKMWTDLHTLPDTNPLRRNTARIKKFRRFHRSP FT LYQVADALKNIEMETLETINPFTLAPWEARMQTDGEAMPDPQAIPGGSIQI FT AISSSARNGFVGFGVAIEKQPPQYRKLKLKTFSVTLGARSEQNPFSAELAA FT IAHTLNRLVGLKGFRFRLLTSNKATALTIQNPRQQSGQEFVCQMYKLINRL FT RRKGNHIKILWVPASEDNKLLGLAKEQARAATHEDAIPQAQVSRMKSTTLN FT LARSQAATTKALPEDVGRHIKRVDAALPGKHTRQLYDGLSWKEATVLAQLR FT TGMARLNGYLYRINVAQTDQCACGQARETVEHFLFRCRKWTTQRIALLQCT FT RTHRGNLSLCLGGKSPSNDQQWVPNLEAVRASIRFAMTTGRLDAV" XX SQ Sequence 5486 BP; 1568 A; 1440 C; 1495 G; 983 T; 0 other; tagtgtagta gatattggta ccagttacag aactgtacta ctactggttt ggaacagaaa 60 acaacttgga ggaggaaaaa atgtcgggac gggccgagac gcgaacccct gacccgcccg 120 accgcgacga acaggaaaac tcaccaccgc gattggtgag gccgaaacgc acgacaagac 180 caccggccca ctatgctcaa gagcaagaga tcgagactga gcagagaaat acgcgctccc 240 agcggaagaa aaagaaccag ggtaagccag tcgcccaaga taaagcggcg acttcggacg 300 actcctcgac agaaagagag gactcagaca cctccaagct tgtgaaagag attgtcaaac 360 tcagaagaga aattagacga cgagatgaat tatacaagga ggaactccag agagtcaaag 420 aagaatttgg cgctgccctc acagagtttc gacatgaatt actggcgaat cgacccccga 480 caccgcaagc ccaccccgag tcatgcgctc agagcggcca cgaggagatc cttcgcgaaa 540 tccaatcctt acgcgttgca gtcaatcctt cgggatcccc gtcctatgca gatgttgccc 600 gtactccccc caccagccaa ccgagcaata tacggactct ctcatcgtgg aacacgacac 660 cgactacctt caccgacacg ctatattgca cgatagacac ctcgaagatg gcagatactg 720 aaagtgagag accatcagca ggtccaatta gaacggcagt tgagaccgag atccggacaa 780 tggaaaacta cacgaactgg cggtgccgcg ctgtcacggt ggacccaaaa aacaccaacc 840 ggatcagaat tgcttgccga gatgaggctg aacaccagct ggttaagaag gtggcggaaa 900 ccaaggtcgg tgcgggagcc cgagtgctcc gtgatgaact ttacccgatc aaagtcgaca 960 gcgtcaacag aacggcagtg cttgatgaaa atggtgatat ccgagtagga gctgcggcag 1020 cctttggtga ggagaacgaa accaccgtcg ccaagattgc atggctgagc agaaaggaga 1080 atgcgaaggc ctatgggtca atggtcgttt acctgaccaa aggtagtgat gcacggaggc 1140 tcctggccga cgggttcttt cacgctgggg gagaatccgg cgtgaccagc acctttgaac 1200 accgaccacg accgatacaa tgctacaact gtcaagagat tggacataag gcattccaat 1260 gtaaaagtac tcaaaagtgt gcgagatgtg ctgcggaggg tcaccaccac agttgctgca 1320 accaatcggt tccaaagtgc attccatgcg gaggccctca cgaatcgtat agcaagaact 1380 gtcggaagct ctatccatca catcatgaat aagactcttc gagtcattca actgaacgtg 1440 agaaaacagg gtgcagtgca cgagagctta atgaatgatg aagagactca gaacacggtg 1500 gcgttagcga tccaggaacc ccaagcgcgg aggattcaag gccggctctt gaccaccccg 1560 atgggacacc ataaatggac gaagatggtc ccctctacct ggagggaagg cagatgggca 1620 gtacgaagca tgctctggat taacaaagaa gtcgaggcgg aacaagtacc gatagagtcc 1680 ccggacctca cggcagcggt tatcaggctt cccgagcgac tgatattcat ggcatcagtt 1740 tacgttgaag ggggcaatgc ctcagctttg gatgacgcat gtaaccatct actcgacgcg 1800 atcacgaaag ttcggcggga tacgggtgtg gtggtggaga ttttgattat gggagacttc 1860 aaccggcatg accaactctg gggaggagac gacgtgtctt tgggaagaca aggtgaagcg 1920 gatccaatca tcgacctcat gaatgaatgt gctctcagca gtctcctcag acgaggcaca 1980 aagacatggc atggcggagg acacagcggg gactgcgagt cgactatcga tctggtcctg 2040 gcttcggaaa acctggcaga ctccgtaatt aagtgtgcta tattagggac ggagcacggc 2100 tcggaccact gtgctattga gaccgtattc gatgccccct ggtcgctccc aaagcatcag 2160 ggacgacttc tgctgaaaaa cgctccatgg aaagagatca ataccaggat agcaaacact 2220 cttgccgcca ctccgtcgga gggtacggta cagcagaaaa ccgaccgact catgtcggcg 2280 gtctcggaag cggtgcatgc gctaacacca aagtcaaaac catcatcaca cgcgaagcgg 2340 tggtggactg ctgatttgac gcagctccgt caaatacaca catactggag aaatcacgcc 2400 cggtccgagc gacgagcggg gcgaaaagta ccctacctag aaacaatggc ccaaggcgcg 2460 gcgaagcaat accacgatgc catccggcaa caaaagaaaa agcactggaa tcaatttctt 2520 gccgacaatg ataacatatg gaaagccgag agatacctga agtcgggaga agacgcagcg 2580 ttcgggaaga tcccgcagct cctcagagcg gatgggacca ccaccaccga tcacaaggag 2640 caggcagaag aattgctggc caaattcttc ccgcctctgc cggacaacat tgacgatgaa 2700 gggactcgac cacaaagagc gccagtcgag atgcctgcca ttacgatgga ggagatcgaa 2760 cgacagctga tggcggcaaa gtcttggaag gcaccgggcg aagacggtat gccagcaata 2820 gtctggaaga tgacttggcc cacggtcaag tacagagtcc tggatctctt ccaggcgtcg 2880 ctggaaggag gcacgttgcc aagacaatgg agacatgcga agatcatacc actcaaaaag 2940 ccgaacaaag agaactacac cattgccaaa tcatggagac cgatttcgct gctcgcgacg 3000 ctgggcaagg tactggaatc cgtggtggcg gaaaggatct cacacgcggt agagactcac 3060 ggtttgctcc cgaccagcca cttcggcgcc cgaaagcagc ggtccgcgga gcaagcactc 3120 gtactcctgc aggaacaaat ctacgcggcg tggcgcggcc gacgggtctt gagtttgatc 3180 agcttcgatg tcaagggagc ctacaacgga gtgtgcaaag aacggttgct tcaaaggatg 3240 aaagcgcgag gcataccgga ggatctgctc cggtgggtgg aggcgttctg ctcggaacga 3300 acggcgacta tccaaatcaa tgggcaattg tctgaggtcc acagcctgcc acaggctgga 3360 ctgccgcagg gctcgccgct atccccaatc ctgttcctct tcttcaacgc agacctcgta 3420 cagcgacaaa tcgacagcca gggaggtgcg attgctttcg tggatgattt cactgcatgg 3480 gtgacggggc caactgcaca gagcaaccgg gaaggtattg agggaatcat taaagaagcg 3540 ctccactggg aaagacgaag tggggcaact tttgaagcag agaaaacagc catcatacac 3600 ttcaccccca agacttccaa attggaccga gaacccttca ccatcaaggg acaggctgtt 3660 gaacccaaag accatgtcaa gattctgggc gtcctgatgg acacaagtct caaatataag 3720 gaacacatcg caagggcagc gtctaagggc cttgaggcag taatggagct tcgacgactg 3780 agaggtctct ccccatcaac agcgcggcag ctattcacat cgacagtgac ccctgtagtg 3840 gactacgcct ctaatgtctg gatgcatgca ttcaaaaata aggccacggg accgatcaac 3900 cgagtccaaa gggtaggagc gcaagcgatt gtcggaacgt ttctcaccgt tgcaactagt 3960 gtagcggagg cggaggctca cattgccacg gcgcaacacc gtttttggag acgggccgtg 4020 aaaatgtgga cggacctcca cacgctccca gataccaatc ctctccgcag aaacacagct 4080 aggatcaaga aattcagaag atttcaccgc tcgccgctgt accaagtagc ggacgcgctg 4140 aaaaacatcg agatggaaac actggagact attaacccgt ttacattagc accatgggag 4200 gcacgtatgc agaccgatgg cgaagccatg ccggacccac aggctatacc gggtggatca 4260 atacagatcg cgatcagcag ctcggcacga aacgggtttg taggatttgg ggtggcgatc 4320 gagaaacagc cacctcaata tcgaaaattg aaactgaaga ccttctccgt aacattgggc 4380 gcaagatcag agcagaatcc gttctccgca gagctagcgg ctatagcaca tacgttgaac 4440 aggctggtgg gactgaaagg cttcaggttc aggttgctca caagcaacaa agcgacggcc 4500 ctcacgatac agaaccctcg acaacagtca ggccaggagt tcgtctgcca gatgtacaag 4560 ctcataaaca gactgcggag aaaaggaaac catatcaaga ttctttgggt ccctgccagc 4620 gaagacaaca aactgctggg cctggctaaa gagcaggcca gggccgcgac ccacgaggac 4680 gctatcccac aggcacaggt ctccagaatg aaatcaacaa cgttgaatct cgcacgatcc 4740 caagcagcca ctacgaaagc cttgccagag gacgtaggaa gacacatcaa acgagtagat 4800 gcggcactcc caggaaaaca cacccgacag ctgtatgatg ggctatcctg gaaagaagcg 4860 accgtgctgg ctcaactccg gaccggcatg gcaagattaa atgggtatct ctaccggatc 4920 aatgtcgcgc agacagacca gtgtgcatgc ggacaagcaa gagaaaccgt ggagcatttc 4980 ctcttccgat gtcggaagtg gaccacgcaa cggatcgcac tgctacaatg tactcgtact 5040 caccggggca atctctccct ctgcctgggc gggaaatcgc cttccaatga tcagcaatgg 5100 gtgccaaatt tggaggcagt acgagcctca atacgatttg ccatgaccac gggccgactc 5160 gacgccgtct aaccgcgtcg accaagacaa tgacactctc ccctaactaa cctacaccat 5220 ccaccttgtg ccacagatgg tagacgcgct tcaactgccg aggataggag gttgaacact 5280 tgaagaagcg ccttcaagcc agtctatcaa caaaattact aacccacagg gattcctacg 5340 atattgacag ctgagaggac atatactgaa aggatctatg tatgggcttc aaggagatac 5400 ggtagatgta cattagaacc tgtgttgata gatcagtgat agttgccctt tcggctgagc 5460 cgccgggcgt taatgaattg tgtgtg 5486 // ID copia-2-LTR_AF repbase; DNA; FNG; 407 BP. XX AC . XX DT 28-FEB-2006 (Rel. 11.02, Created) DT 07-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE Long terminal repeat of copia-2_AF LTR retrotransposon - a DE consensus sequence. XX KW Copia; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; COPIA superfamily; copia-2_AF; copia-2-I_AF; KW copia-2-LTR_AF. XX OS Aspergillus fumigatus OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Neosartorya. XX RN [1] RP 1-407 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-407 RA Kapitonov V.V. and Jurka J.; RT "copia-2_AF, a family of copia LTR retrotransposons in the RT Aspergillus fumigatus genome."; RL Repbase Reports 6(2), 54-54 (2006). XX DR [2] (Consensus) XX CC It is a long terminal repeat of the Copia-2_AF LTR CC retrotransposon. It is characterized by 5-bp TSDs. XX SQ Sequence 407 BP; 150 A; 51 C; 81 G; 124 T; 1 other; tgtgaaagaa aataaatagt atatactaaa tagtatatga atgacaaagg aaatttcacg 60 aatatagagg ggaatagatg gcttgcctag gtataaagga agaataggaa gaataatagt 120 aggagaaaaa tctaaggata tcttagatta atgtcttaat tccttaatta attattttct 180 agctataggg tgatttyaag ggtgtattta tagatgattc cttctagcat aggccattcg 240 aatattaaat gatctcatag ccgggtgatc tcattaatta gatcaccatt tgctaatggt 300 tgaaatcgat gatgattgat gatgatctat aaccataagg ttagaaccat agcggatcaa 360 cccatgggta tgatcaacga tcaactatag catggttttc accaaca 407 // ID TCN3-LTR repbase; DNA; FNG; 502 BP. XX AC . XX DT 30-MAR-2005 (Rel. 10.03, Created) DT 30-MAR-2005 (Rel. 10.03, Last updated, Version 1) XX DE C. neoformans LTR retrotransposon - LTR consensus. XX KW LTR Retrotransposon; Transposable Element; Interspersed repeat; KW TCN3-LTR. XX OS Cryptococcus neoformans OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Tremellomycetes; Tremellales; Tremellaceae; Filobasidiella; OC Filobasidiella/Cryptococcus neoformans species complex. XX RN [1] RP 1-502 RA Goodwin T.J. and Poulter R.T.; RT "The diversity of retrotransposons in the yeast Cryptococcus RT neoformans."; RL Yeast 18(9), 865-880 (2001). XX RN [2] RP 1-502 RA Loftus B.J., Fung E., Roncaglia P., Rowley D., Amedeo P., RA Bruno D., Vamathevan J., Miranda M., Anderson I.J. et al.; RT "The genome of the basidiomycetous yeast and human pathogen RT Cryptococcus neoformans."; RL Science 307(5713), 1321-1324 (2005). XX RN [3] RP 1-502 RA Gentles A. and Jurka J.; RT "C. neoformans LTR retrotransposon TCN3."; RL Direct Submission to Repbase Update (15-MAR-2005). XX DR [3] (Consensus) XX SQ Sequence 502 BP; 123 A; 144 C; 94 G; 141 T; 0 other; tgtaagtgca gagcccttac caccactcgt tctcaccata tggtaggatt tattcctatg 60 cattggtcga tgttggtaag tgagatttga tgccgaaacg cgtctttcgg acttcgccga 120 aacgcggcat cgagctctgt cccgccttta tcctatgcat gatattttat tctcatgcat 180 aggaccggca attgctgccg agacgcggcg gcggacttcg ccggatctct gccccttttt 240 tcgatggact tctgtttgtt tctgcataca tccatacata tatatatata tagatctcat 300 acatagttat agttttctct ttctctaagc gaaactcatg catatcaacc attgctagtt 360 tcgcatccag tagaccacta taactataaa caaacaccga acgcttcctc gaacgcatcc 420 gaacgccctc tagacctata ccacgaacgc ccatccttct cgaacgccaa ccgaacgcag 480 tgaggcaccc ccagtgccta ca 502 // ID Copia-1-I_CCi repbase; DNA; FNG; 5065 BP. XX AC . XX DT 21-JAN-2010 (Rel. 15.04, Created) DT 21-JAN-2010 (Rel. 15.04, Last updated, Version 1) XX DE Internal portion of a Copia-1_CCi LTR retrotransposon - DE consensus. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_CCi; KW Copia-1-LTR_CCi; Copia-1-I_CCi. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-5065 RA Kapitonov V.V. and Jurka J.; RT "Families of self-primed copia LTR retrotransposons from diatom RT and fungus."; RL Repbase Reports 10(4), 550-550 (2010). XX DR [1] (Consensus) XX CC This family is characterized by 5-bp TSDs and a short palindrome CC at the 5' terminus of the internal portion that serves likely as CC the RT primer. This family belongs to an ancient group of CC self-primed copia LTR retrotransposons. CC This sequence was derived from sequence data produced by the CC Broad Institute http://www.broadinstitute.org/ in collaboration CC with the Coprinus research community. XX FH Key Location/Qualifiers FT CDS 300..5033 FT /product="Copia-1-I_CCi_1p" FT /note="Polyprotein copmosed of the integerase and FT RT domains." FT /translation="MSSTTVTTDNPLSATTISQWKRVCINKALEFAAYDIL FT TGEETAPTGSGQMKARQHYQERRSKMAGYLRSTLDHAQTETLLGDVDILDV FT PTIWTTLLTAYEPKDSGSRTSAIQELMTLRKQEDETYADYGARCVAAGQQL FT INRLPSGPNYVKETTASGSFFTSQATGATPVTAETVVTGSTFDKGYSAVDL FT VNDFAAAMMTVGLIAEEDKVLRHTLAHLKSTTASDILEHLRRADTLDKNTA FT LAEASAASALAASKKAAKKPKAKFNCSVHGPNKSHNDKDCRAQQQKAQIAE FT EDDETPPAIVKAQMAQVAHLASPPRIRSLNTNADESWNTDSGATSHMTPHR FT RWIRNMVPCRIPVELANKHVVWAVGKGSVVFSPVVDGKPAESVIFQNVLYV FT PALQNNLFSILTVVTKSRMRVVIEGNSLEFSRDNKVLFTASIKGTIGTLNG FT TTLAYNEQAYIARVDKHTLHCRLGHIGKGRLDTLINKELATGILVKPNTEV FT HNVCEHCQDGKQHRDPFPTHSNNRSPELLGRIHSDLHGPLTRTTAGHKYWI FT TFIDDYSRYKRVYLLKSKDEAFEKFKIFVAEVERKHGRKVKELRDDKGGEY FT IGKEFNAWCEKLGITRQHTVKGTPQQNGVSERFNRTAAEGVVAMLCQANLP FT SSFWSLALLYYVDILNMTPSASVSNTTSYEVWEGRKPDLSMIRTFGCRAYV FT NIQRKDRRNLDSHTTRCIFVGFEPGYKGWKCFDPQSKKFIISRDIVFDETL FT FPGLSTKSTTVTVKPVGIRDIWPYDDDDEWDTAPHGPGAPPDPPAPPAGFR FT STGDAPSTTTHESDSDSDDDDTPPYTPKPRVKKEHSGTDTPQVKEELVTPS FT TPTAKPPVARHAPYHREEEASPTPQGRSQPPVPPSSPEPLVENPAPLPAPN FT FGQGRATKPKPPPQPSQNPRPIRSTARVTDYYTLAGHSRRGTAPAPRRTAE FT ENGHPPNQDQGGDQVVDQGGAREDNLPPITADDINEEDEPDFHSAFKATPV FT LDSILHVYGMSGEYLTMAEACEMALEMVLDKVKEKALGAATRPEDSPRNWK FT EAMARPDRDKWISAARAEIDALIANGTWELVQLPQGRKAIGSRWVFLIKRK FT SDGTIDRYKARLVAKGFAQAPGIDYDQVFAPTSRLATIRAILAQAAMNGEF FT IESIDISNAYLNGEIEDEYEVYMVQPEGFEVPNQNGGRWVCRLKKGLYGLK FT QSGRLWYEKLAEELERLGFKQLKSDPSVYIWELDGIRVTLPVFVDDITIIS FT KDQKKIQWVKDALAKVFKLKDLGPTSYLLGIKVDYDREKRTLQLSQKQYIQ FT DMLVRFRLDAANTVKTPMNPSVRLSKDQCPKTEEEREEMRNIPYMNAVGAL FT MYLAIGTRPDIAFAVAKLAQFNSNPGMTHWTAVKHVFRYLKGTMDLKLTYR FT MDNNPQLASELFHTFSDADYAGCLDTRRSTSGFVIKMGTGAISWSAKKQAT FT VANSSTEAEYVSASSAGREIIWLRNLLREIGFSLDKPSPMMVDNQSAIKVL FT RNPEHHSKMKHIDIKMHWIRQEIKRREIEIHFCPTKDMVADILTKPLHRED FT VERHRIALGLV" XX SQ Sequence 5065 BP; 1437 A; 1478 C; 1164 G; 986 T; 0 other; ggttatgagc cccgggcata actgatttat cagtattact gctcgcgccg cctgacaact 60 cacgacgatc tgttcgatca cctcgcctac ggctccccct ttgctctctc atcgtccaag 120 tgtgccactt gaccttgaga ccaaccctgc ctcgtgtctt cccctttcct gtgcctacaa 180 ccagtcttgg gagctcccag gactcgttaa gcccacttgc cacctcgtgc ctcgtcgttg 240 ctgacttggg agctcccaag tcaaccaatt gacgcatacc cttacctcca cactccgtta 300 tgtcgtccac caccgtcacc accgacaacc cactgtcggc cacgaccatt tcacaatgga 360 agcgggtctg catcaacaaa gctctcgagt tcgccgccta tgatatcctc acgggcgaag 420 aaacggcgcc aactggcagc gggcagatga aggctcgcca gcactatcag gagcgtcgtt 480 ccaagatggc cggttacctc cgctccacac tcgaccatgc gcagacggag accctccttg 540 gtgatgtcga catcctagac gtaccaacaa tttggaccac tctccttact gcatacgagc 600 ccaaggactc tggatcaagg acttccgcga tccaagaact catgacactc cgaaagcagg 660 aagacgagac gtacgcggat tatggtgcac gatgcgtcgc tgctggacaa cagctcatca 720 atcgcctccc ttctggtccc aattacgtca aggaaaccac agcatccggc tccttcttca 780 cgtcacaagc gaccggcgca acaccagtga cagcggagac tgtggtcacc ggttccacct 840 tcgacaaggg atacagtgct gttgacctcg tcaacgactt tgctgccgct atgatgaccg 900 ttggccttat agctgaagag gacaaggttc ttcgtcacac cctcgcccac ctcaagagca 960 ctacggccag cgacattctt gagcaccttc gccgagcaga cactctcgac aagaacactg 1020 cccttgcaga agcttcagct gcatctgccc tcgcggcctc taaaaaggcg gccaagaaac 1080 cgaaggccaa gttcaactgc tcggtccatg gccccaacaa gtcccataac gacaaggact 1140 gcagggctca gcagcagaag gctcaaatcg ctgaagagga tgacgaaaca cctcccgcta 1200 tcgtcaaggc ccagatggcc caagtcgcac acctcgcaag tccccctcgt atccgctcgc 1260 ttaacaccaa tgctgacgaa tcttggaaca cagactcagg agctacctct catatgacac 1320 cacaccgcag gtggatccga aatatggtcc catgcagaat tcccgtcgag ctcgccaaca 1380 aacacgtagt gtgggcagtc gggaagggaa gtgttgtatt ctccccggtt gtagatggca 1440 aacccgccga gtctgttata ttccagaatg tcctttatgt accagcattg caaaataacc 1500 tcttttctat cctcaccgtt gttacaaaga gtcgaatgcg agtggttatt gagggaaaca 1560 gtctcgagtt ttcaagagac aacaaggtct tattcacagc gagcatcaag ggcactattg 1620 gcacattaaa tggcacaact ctggcatata acgagcaggc atacatcgca cgagtcgata 1680 agcacaccct gcactgtcga cttggacaca taggcaaagg gagacttgac actctcatca 1740 acaaggagct ggctacaggc attctcgtca aaccaaacac cgaagtccat aacgtgtgcg 1800 agcactgtca ggacgggaaa caacatcggg atcccttccc aactcactcc aataaccgtt 1860 cccctgaact actcggacgc atccacagtg acctccatgg ccctctcacc cgcacaactg 1920 ccggacacaa gtactggatt acattcattg atgactactc acgttataaa cgagtgtacc 1980 tcttgaagag caaagacgaa gccttcgaga agttcaagat atttgtggcc gaagttgaga 2040 ggaaacacgg acgaaaggtc aaggaactcc gagacgacaa agggggtgaa tacattggaa 2100 aggaattcaa tgcgtggtgc gaaaaacttg gaatcactcg ccaacacacc gtaaaaggca 2160 caccacaaca gaacggtgtc agtgaacgat tcaaccgaac cgcagctgaa ggagtggtag 2220 ccatgctttg ccaggcaaat ctcccatcat cattctggtc actcgccctt ctctactacg 2280 tcgacatcct caatatgact ccatcggcat cggtctccaa cacgacttcc tacgaagtat 2340 gggaaggtcg gaaaccggac ttgtccatga tcagaacatt cggatgtaga gcatatgtca 2400 acatccagcg aaaggataga cggaacctcg actcccatac gacccgttgc atttttgtcg 2460 gttttgaacc aggatacaaa ggatggaaat gctttgaccc acaatcaaag aaattcatca 2520 tctctcgaga tatcgtcttc gatgagaccc tttttcctgg cctatccaca aagtcaacca 2580 cggttacggt caaaccagtc ggcatccgag acatctggcc atatgatgat gacgatgaat 2640 gggatacagc accgcatggt ccgggagcac cacctgatcc tccagctccc ccagcaggct 2700 ttaggagtac gggggacgct ccgagcacaa ccacgcacga gtcagactct gattccgatg 2760 atgacgacac tcctccatac acgccaaaac ctcgggtcaa gaaagaacac tctggaacag 2820 acactccaca agtgaaagag gaattagtga ccccttctac cccaacggca aaacctccag 2880 tcgcacgcca cgcaccttat catcgagaag aggaagcctc acccacacca caaggccgat 2940 cgcaaccacc tgtcccacca tcgtcaccag aaccactggt agagaatccc gctccactcc 3000 ctgcaccgaa ctttggacag ggaagagcaa ccaagccaaa accacctcca cagccatccc 3060 aaaaccctcg tccgatacgt tccacggcaa gagtcacgga ttactacacc ttagctggac 3120 actcacgcag gggcactgcg cctgccccaa gacgaacagc agaggaaaat ggtcatccac 3180 cgaaccagga ccaagggggt gatcaggtag tagatcaagg gggtgcaaga gaggataatc 3240 taccacctat cacagccgac gacattaatg aggaggacga acccgacttc cactcagctt 3300 tcaaagcaac accagtcctc gactccatcc tgcacgtcta cgggatgagt ggagagtacc 3360 tcacgatggc tgaagcatgt gaaatggcac tcgaaatggt tctcgacaag gtgaaggaaa 3420 aagcactcgg agcggccaca cgaccagagg actcgccacg caactggaag gaagcaatgg 3480 cccgaccaga cagagacaag tggatcagtg ctgctcgagc ggaaattgat gccctcattg 3540 ccaacggcac ctgggaactc gtccaactcc cacaaggacg caaagcaatt ggatcacgct 3600 gggtcttcct gatcaagcga aagtctgatg gcactattga ccggtacaaa gcacgcctcg 3660 tcgctaaagg atttgcccaa gcaccaggaa tcgactatga ccaagtcttc gctccaacct 3720 cccgccttgc caccatcaga gcaatcctcg cacaagctgc catgaacggt gaattcatcg 3780 aatccatcga tatctccaat gcatacctca atggagaaat tgaagatgaa tacgaagtat 3840 acatggtcca gccagaggga ttcgaagtcc ccaaccaaaa tgggggacga tgggtttgcc 3900 gcctcaagaa gggactgtat gggctgaagc agagtggtcg actctggtac gagaaactgg 3960 cagaggaact cgaacgactc ggattcaaac aactgaagtc cgatcccagt gtctacattt 4020 gggaattaga cggcatccga gtcaccctgc ccgtcttcgt cgacgacatc acaatcatct 4080 cgaaagacca gaagaagata caatgggtca aggacgcctt ggcaaaggtg ttcaaactga 4140 aagaccttgg accaacatcc taccttctgg gaattaaggt tgattacgat cgagagaaac 4200 ggacactgca actatcacag aagcagtaca tccaggacat gctcgtccgt ttccgcctcg 4260 atgcggctaa cacggtcaaa acgccgatga acccaagcgt ccgcctaagc aaagatcagt 4320 gtcccaagac agaagaggaa cgagaagaga tgcggaacat tccttacatg aacgctgtgg 4380 gagcactcat gtatctcgca atcggcactc gacctgacat tgctttcgct gtggcaaagc 4440 tagcacaatt caattcaaac cccggaatga cacattggac agccgtcaaa cacgtcttcc 4500 gctacctcaa gggcaccatg gatctcaaac tcacatatcg catggacaac aacccccaac 4560 tcgcctctga actcttccac acattcagtg atgccgatta tgcaggatgc ctcgacacga 4620 gacgatcaac aagtggattc gtcatcaaga tgggaacagg agccatcagc tggtctgcca 4680 agaagcaagc aactgtggcc aactcttcaa ctgaagcgga atatgtttct gcatcatcag 4740 cgggacgaga aattatatgg ctacggaact tgctgaggga aattgggttc tcactcgata 4800 aaccttcacc gatgatggta gacaaccagt cagcaattaa agtccttcga aacccagagc 4860 atcacagcaa aatgaagcac atcgacatca agatgcactg gatcagacaa gagatcaagc 4920 gcagggagat cgagatccat ttctgcccaa cgaaagacat ggtcgccgac attctaacaa 4980 aaccactcca tcgagaagat gttgaacgtc atcgaatcgc actgggactt gtgtaattat 5040 ctagtgacat tacattgagg gggtg 5065 // ID Gypsy-76_MLP-LTR repbase; DNA; FNG; 185 BP. XX AC AECX01001117; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-76_MLP_; KW Gypsy-76_MLP-I; Gypsy-76_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-185 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001117; Positions 65045 64861. XX SQ Sequence 185 BP; 47 A; 54 C; 31 G; 53 T; 0 other; tgttacgatc ctacatgtag cactcgtgac agatacatga catgtcactg agagtatcta 60 ttcactcgtt gtattacgta gttgctttcc ttctcatccg acaatcgata tacaaacctg 120 attgcttggt cctgaactcc ttcattccca cgacaagccc aaacccccct gagctgggcc 180 taaca 185 // ID Gypsy-67_MLP-I repbase; DNA; FNG; 6272 BP. XX AC AECX01001283; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-67_MLP_; KW Gypsy-67_MLP-LTR; Gypsy-67_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-6272 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001283; Positions 29212 22941. XX CC Positions [3415-3870] - Reverse transcriptase CC Positions [4993-5472] - Integrase core CC 'TCTAG' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS join(970..2052,2056..6168) FT /product="Gypsy-67_MLP-I_1p" FT /translation="MPNYTYPGPVWPGGPPSDLPPFAPPRYLVLHVPQFAV FT PPPHFVPHPNPQFAQNQQHAQNASAPPHVVPQFNPYPQAPPFVAPQQTTPQ FT FQPPPPQFQFPQNPDFAAPPQMQPKFGHYHQQPQPAQQQRSTPTARPVSRA FT PTVELVTDARAPKICDNLCYYGDAKGLQQFLVEIHDELDQIVWKDDKAKIN FT WIARHFNSINSLNLSSTQIWFMGLLQKNAFRQGFLNPYGNLKALPYDLPEL FT LNLDNFLDELIYKFGDKHADKTCREELEACKQGKMSIIDYNSKFEMLSPHV FT KKTPEDKILLYVEGLHPSIQMEAARVAGWVNETDLSRKQAMAVEAADILDL FT RSKVAQSHPHLKVPGEVYHPNPHPLTRPAHHQVQHQASSNRNVNGPVPMDI FT DVNEVSLQRDGSNPFPAIRQICNTERLCYDCLKVYDDEHKRLRGLQGRRSC FT PNPPARMEDKLKLLRSSVNPAPEQPTQISAIDLDDAEYAAYTALPTATIDE FT TSRFVESFWENLSTPSYPSVAPHSYQPTVQEVRVDAIRVQADLDNPRRFLI FT PLRIVSDNLPISIMALVDTGASDSFIDLGFADLHGLNLNKKFVPQRVSGFD FT GASSTSVTKEWCGVMDVVDVDGVNSKFKAKLGVTKLGNGHDVILGLPWMME FT NEVTLLMSKKGRWLEIGGSVVSACLVEDEVVDVSLVSCEPKPSIISFSSPL FT DTTAIIPDPISSNSFLPPPDKSLFSNLPKSCEKYLHVFSPQESVLPPHRLF FT DIAIDLKPGCEPPFGGLYNLAPNEQIELKTYLNDQLSKGFIRPSKSPAAAP FT IFFVKVPGKKNIPCVDYCGLNKITKRDSYPIPVMSWLLNQLRGCKHFAKID FT LKAAFNLLRVAEGDECKTAFRTPWGLFEYTVMPFGLANAPAVFQRFIQWVL FT REYLDVFCFVYLDDILIFSKNAEDHAQHIEKVLSKLSEHKLTASPEKCQFF FT ATEVVFLGFVISTRGISMDPSKLETIADWPFPQDLSDLQCFLGFSNFYRRF FT IANFSGIVGPLTALTAKTADATLGLRTDKARRAFDQLRRLFSAAPFLLHFD FT FDLPHVVQVDASGYAYSGILSQKSDQGDLRPVAYFSKKLTEAERRWQIHDQ FT ELGAIVACFHEWRAWLMGTDKPVAVLSDHANLRYFMTSQALTPRQARWATF FT LGEFNFEILHTPGKSNPADPASRRSDFVCGKQDSSKMVLLGLREIKEAGIS FT AVHIRNPGSFDVSAYMPVSDTLRDRIVNAYHSDDLIAGIHPAFLYFLDGLW FT WWRDRVYVPLSLREEVLKQIHESSTGGHWGSMKTLDLLTRSFGWPNLRKDV FT LLYLEKCTSCQQFKVDRRPPQGQLIPLPIPDRPWATIGVDFIVKLPLLDGF FT DSVMVVVDHFSKAAHFIPAKESWTADNLAAAFVAQVFRFHGLPDTIVADRG FT TTLVSGTWKSVLRLLRVSPAPSTAFHPQTDGQVERLNALLEDYLQHFVAEG FT QDDWSGWLPLAEFSYNNSASSSTNMSPFFTLQGYHPCFNSLTGSSGRPNAD FT KFVGHIQGIQQQLVDNLTRAKEDQARYYNKDRRVEKSYEKGDLVWLSRKHL FT KTKRPNNKLDVRRLGPFRVMRMIGRNAAELDLPPELGRLHPVFNVSLLMPF FT VGKTQVFSSAERTRWGSLLIWEELHIKTVLDYRLNSLGIHEYLLRFNDASA FT LEDQWLPLSQLSPTIDASLERFHCLNTTVGPGPSQYVWIARAKRRVDDEER FT EKSLALPDGFLL" XX SQ Sequence 6272 BP; 1629 A; 1383 C; 1362 G; 1898 T; 0 other; tttttattcc gatctttttt tactgagcgt tatttaaacg ttatcgataa atttttttta 60 aaaatttttc ttttgaattg atcacttaga tcctaatatt ttgttctcgt tttttctctt 120 ttttgactct tacttttcgc cagatcactt agatcatttt ttgtttagaa aactttaata 180 attatttttc gtttaaattc tcattgcttt ccttttttta gtttatgttt agcgtttttt 240 cacatctttc attttctcac gccttcgttt acctatatat acatcctgct cactattcga 300 cggaagattt attacattct ttcatcgatt tacctttttt tcagcctcca attgaagatc 360 cgatccagtt actggcaagt tttcgtcagc ttcctttctt cgagaaccca gacgaaaatc 420 cattttcact tccaacagca aattcaccga taacctctca agctctcttc caacagtcat 480 tggcctatta tttggattta gaggtttaca aagaactgta tctggaccgc cgtctttatt 540 actcagatcc cattatcacg cctccgacct ctctggattc agactcaatc gattcggact 600 cattatcaga tctagaatca ttccctcaag tatgaatcct tggagagaag gacctgaaag 660 cccaacggtt tcttcagtct ttggaagaga catgcctcct catcaaggaa ctagcagtga 720 gctctcggat cctcatgcta tcagaattga ttccccccaa attatcgcaa cacccgtttt 780 ccacattctc aatctcctgt ctctcgacta gctcaaggtt tgagtaattt ggatttcgct 840 actaatcgaa ggaatttgaa tcaaccagga cagcaggtac ctcgcggccc ccatccagca 900 gcccaaaatg agccaggttg gcgtttatta gcgcaaagat atcaacgaca gtatggttat 960 ggagcacaaa tgccaaacta cacctaccct ggtccagtct ggcccggtgg accaccatct 1020 gatcttccac cttttgctcc accacgatat ttggtcctgc acgtccctca atttgcagtg 1080 ccaccccctc atttcgtacc acatccaaac cctcaattcg ctcagaatca gcagcacgcc 1140 caaaatgcgt cggcacctcc gcatgtggtt cctcagttta atccgtatcc acaagcaccc 1200 ccttttgtag cccctcaaca aaccacccct cagtttcaac ctcctcctcc acaatttcaa 1260 tttcctcaga acccagattt tgctgctccg cctcagatgc aacccaaatt cggtcattat 1320 catcaacaac cacaaccagc tcaacaacag cgttcaactc ctacagcaag gcctgtttcc 1380 agagctccta ctgtagaatt ggtgacggat gccagggcgc ccaagatttg tgataatctt 1440 tgctattatg gagacgctaa aggtttgcag cagtttttgg tggaaatcca tgatgaattg 1500 gatcagattg tgtggaagga cgacaaagcc aagatcaatt ggattgctcg acattttaac 1560 tcaataaatt cattgaattt gtcaagcacg caaatttggt ttatgggttt attacagaag 1620 aatgcgtttc gtcaaggttt cttaaatccg tatggtaatc tcaaagcctt accttatgat 1680 ctaccagaat tattgaatct tgacaatttc ttggatgaac tgatttataa gtttggtgat 1740 aagcatgctg acaagacttg tcgtgaagag ttagaagcct gcaaacaggg taaaatgtcg 1800 atcatcgact ataattcaaa attcgaaatg ttgtcacctc atgtcaagaa aactccggaa 1860 gacaagattt tattatatgt tgaaggcctt cacccgagta ttcagatgga agcagcaaga 1920 gtggcgggtt gggttaatga aacagactta tcacgaaaac aagctatggc ggtagaagcg 1980 gctgatatct tagatctacg ttccaaagta gctcagagcc atccgcattt aaaagtccca 2040 ggtgaagttt attgacatcc taatcctcat cccttaacaa gaccggctca tcatcaagtt 2100 caacaccaag cttcgtcgaa tcgcaacgtt aatggtccgg ttcccatgga catcgatgtt 2160 aacgaagtat ctcttcaacg ggatggctct aacccttttc cagctattcg tcaaatctgc 2220 aacactgaga ggttatgtta tgattgtttg aaggtttacg acgatgaaca caagaggctc 2280 agaggtttgc aagggagaag atcttgtcct aatccacctg ctcgcatgga agataagttg 2340 aagcttttac ggtcatctgt caatccagcc cccgaacaac cgactcaaat ttcagctatt 2400 gatttagacg atgcagaata tgccgcttat accgcccttc caactgcgac aatcgacgaa 2460 acttctcgat ttgtcgaatc attctgggag aacctatcaa ctccaagtta tccctcagtt 2520 gctcctcatt cgtaccagcc cacagtacaa gaggttcgag tagatgcgat tcgagttcag 2580 gcggatttgg ataacccacg tcggtttcta attccattac gtattgtctc tgataatctt 2640 cccatttcaa ttatggcact cgtggatact ggtgctagtg acagttttat agatctgggt 2700 ttcgctgatt tacatggttt gaatcttaat aaaaaatttg ttccacagag agtttcgggt 2760 tttgatggtg cttcaagtac gagtgtgaca aaggagtggt gtggggtgat ggatgtagta 2820 gatgtggatg gggtcaactc aaaattcaag gctaaattag gtgtgacaaa gttagggaat 2880 ggtcacgacg taattttagg tttaccttgg atgatggaga atgaggttac tttgttgatg 2940 agcaagaagg gtagatggct ggagattgga gggagtgttg tgtcggcatg tttggtggaa 3000 gatgaagtgg tcgatgtatc tttagtctct tgtgagccta agccttcaat catttccttt 3060 tcttcccctt tagataccac agcaatcatc ccggatccaa tttcttcaaa ttcttttctt 3120 cctcccccgg acaagtccct tttttctaat cttccaaaga gttgcgaaaa gtacttacat 3180 gttttctctc cgcaggaatc tgtattacct ccccatagat tgtttgatat tgcaattgat 3240 cttaaaccgg gttgtgaacc tccatttgga gggttgtata accttgcgcc caatgaacag 3300 atcgagctga agacctatct taatgatcag ttaagtaaag gcttcattcg gccttctaag 3360 tcccccgctg ctgccccaat tttctttgtg aaagtcccag gtaagaaaaa tataccttgt 3420 gtcgattatt gcggactgaa caaaataacc aagcgtgaca gctatcctat cccagtcatg 3480 tcttggttgt tgaatcaatt aagaggttgc aaacattttg cgaaaattga tttgaaagcc 3540 gcattcaatt tacttcgagt agcagaaggt gacgagtgta agactgcatt tagaactcct 3600 tggggtttgt ttgagtatac agtgatgcct tttggacttg ctaatgctcc tgcagttttc 3660 cagcgattca ttcagtgggt acttagggag tatttggacg ttttttgctt tgtttatctg 3720 gatgacatac tgatcttttc gaagaatgca gaagatcatg cgcaacatat cgaaaaggtc 3780 ttgtcgaaat tatcggaaca taaattaaca gcttcaccgg aaaaatgtca attttttgcc 3840 accgaggttg tgtttttagg ttttgtaatt tcaacaaggg ggatcagtat ggacccgtca 3900 aaacttgaga ccatagctga ttggcctttc cctcaggatt tgtcagattt gcagtgcttt 3960 ttaggttttt ctaactttta tcggcgtttc attgcaaatt tctccggtat agtaggacca 4020 cttactgcac ttacggcgaa aacggctgat gcgacgttag gactaagaac cgacaaagct 4080 agaagggcat ttgaccagct acgtcgtttg ttttctgcag ccccatttct tctccacttt 4140 gactttgatt tgccacacgt agtgcaggta gacgcttcgg gatatgccta ttctgggata 4200 ttgtcgcaga agtcggatca aggggatttg cggcctgtcg cgtatttttc gaagaagttg 4260 acggaggcgg aacgtcgttg gcagatccat gatcaggaac taggagcaat agtagcctgt 4320 tttcatgagt ggcgagcttg gctaatggga actgacaaac ctgtagctgt tctgtcagat 4380 catgctaact tacgttactt catgacttcg caggcgctga ctccccgaca ggcacgttgg 4440 gcgacattct taggcgaatt taacttcgag attctgcata cgcctggaaa gtcgaatcca 4500 gcggatccag cttcacgcag atctgatttt gtttgtggga agcaagattc ttcgaaaatg 4560 gttttgttag ggctacggga gatcaaggaa gcgggtatca gtgcagtcca cataagaaat 4620 ccggggagtt ttgatgtttc cgcttatatg cctgtctcag atacactacg agatagaatt 4680 gtgaatgctt accattcgga cgatttgatt gcaggaattc atccagcctt cctatatttt 4740 ttggatggtt tgtggtggtg gcgagaccgg gtttacgtcc cactttcttt gcgggaggag 4800 gtcttaaaac aaattcacga gtcctctacg ggtggccatt gggggagtat gaaaacattg 4860 gacttgctga cgcgctcttt cggttggccg aatttgcgga aagatgtgct cctttatctc 4920 gaaaagtgta ccagctgtca gcaattcaaa gtggatcgtc gccccccaca aggacaactt 4980 attccgctgc cgatacctga ccgaccgtgg gcaacaattg gtgtcgactt tattgtcaaa 5040 ctgccacttt tggatggttt tgactctgtc atggtggtgg tcgaccactt ttccaaagct 5100 gctcatttta ttccagcaaa agagagctgg acggctgata atttggctgc tgcctttgta 5160 gcacaggttt ttcgatttca cgggttacct gataccatcg ttgcagatcg aggcaccaca 5220 ctagtctctg ggacttggaa gagtgtcttg cgacttctcc gagtatcgcc ggcaccgtcg 5280 acggcttttc atccccaaac ggacggacag gtggagcggt tgaatgcctt gttggaagac 5340 tatctacaac atttcgtggc tgaaggtcag gacgattggt ctggttggct accactagca 5400 gaattttcat ataataactc agcttctagt tcaacaaata tgtccccatt cttcacactc 5460 caaggatatc acccttgttt taattctttg accggctctt caggacgacc taatgctgac 5520 aaatttgtgg gtcatataca gggaatccaa caacaactcg tagataatct cactcgtgca 5580 aaagaagatc aagcacgtta ttataacaaa gatcgacggg tggagaagtc ctatgagaaa 5640 ggggatttgg tgtggctctc aaggaaacat ctcaaaacga aacgacctaa taacaagctt 5700 gatgtgcgaa gattgggtcc atttcgagta atgcgaatga tagggaggaa tgcagctgag 5760 ttagatttac ctccagaatt gggacgcctt cacccggtat ttaatgtttc ccttttaatg 5820 ccttttgtag ggaagacaca ggttttctca tcggcggaga ggacccgttg gggatcactt 5880 ctgatttggg aagaattaca catcaagaca gttttggatt accggttgaa ctcgttagga 5940 attcatgaat atcttttgcg gttcaatgat gcttcggcgt tggaggatca gtggctacct 6000 ttgtcacaat tgtcaccgac aattgacgca tctctggaac gatttcattg ccttaacact 6060 acagtaggcc caggacctag tcaatatgtg tggattgcaa gagctaagcg ccgtgtcgat 6120 gatgaagaac gtgaaaagtc attggctttg ccggatggat ttttgttgtg atttgccaac 6180 ccctgtgtgc ctaaacaagc cactcaagac ttgggtatgt gaaggcggtc gcagaaattt 6240 ttgccatcaa tggtataagg gaggtcataa tt 6272 // ID Copia-22_MLP-LTR repbase; DNA; FNG; 292 BP. XX AC AECX01000990; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-22_MLP_; KW Copia-22_MLP-I; Copia-22_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-292 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000990; Positions 1846 2137. XX SQ Sequence 292 BP; 68 A; 73 C; 77 G; 74 T; 0 other; tgttatggcc ggaggccata ggcggttagg ttagtgaacg gggccacgtc tacgacaaag 60 cccggggctt aggtttgggg atcttataca attagttgta ccactcctcg atatcccggc 120 tcttcaccca ttggagccgg aacatcgagc gaggaacaga gtgctcactt gagactctgt 180 cactacaggt tagcctattc ttctctctat gttgtagtgt ttgcaatcca acccattgga 240 gccggaacat cgagcgagga acagagtgct cacttgagac tctgtcacta ca 292 // ID Gypsy-7_RO-I repbase; DNA; FNG; 5149 BP. XX AC AACW02000098; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-7_RO_; KW Gypsy-7_RO-LTR; Gypsy-7_RO-I. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-5149 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000098; Positions 171741 176889. XX CC Positions [3723-4199] - Integrase core CC 'CAAAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(48..1553,1557..4823) FT /product="Gypsy-7_RO-I_1p" FT /translation="MAAQNNLPMEINTINNAQDVSPMVPADIRSSSSQEAT FT EPVLPYQNDADMMETADDSLLGIGNTDSLTILRQQLQEVSQRFARGVSNHL FT PMDQLESLRNEASHIANCIKYLLEVQTYCDPPKIETTQQSINQSSNRFDHF FT IPNDLPTWQWVGNVWRTDVEVHDSVEDLLDTFALIVSSNGLAIDQHWMRLV FT PIKMNRDQRSWFNEVLKGRSLNWDEARSIIINTYATQDVAQTLNSIDQLLT FT IKMQQNESIEAFTDRFQRIRRAAKWQDDIRTAALFKRALPLALYKEVSRSL FT LNLPLNQQDSVCKVSAKARTVISSNICNDEAMAAGVTKHNITTQSPANPIS FT FVSGADKSMHNPKNASGPSKQGQPQFKPKSKCLVHGNGNHTTEQCKVLKRI FT ATANTATTATTATNVTPQASNTKKNCYKCSANVPWSPDHAATCSKDKVKRF FT NGPTKAFRSARFSGSHNRSKHLPAAVNLNDQEHSHGSMDIDLSSLPTNYDC FT KEKTSKNEVKTNHSLYVPITIENIDTFALVDSGASFSSIDIAFANKNNINI FT NNNVSGSIIFATNDNKSKRFGTTSKALSLIYGDNDNTLIHTSHIFEVLPLS FT FDTDAVIGLDLMPKLNILITNLAVKHPNKKPAAKEEIDDTPEPNNAPYGTT FT DQQINFMAAVKPDLDQNAQIPKNSFCTVPESIIHLDTEKNKTAYRSPYRIP FT LKLMPVMRECVDTWLKDEVIEYAPPSSEWNSPLTLAPKKDLHGNLTGYRPC FT LDPRLLNSILISNDKQPIPKIDEIFDQLQGSTIFTTLDLRQAFHRFQIYKP FT DRVKTTFTFQGQQYMFKGCPFGLKHIASRYQRVINSILADVPYALAFVDDI FT IIFSKSYEEHITHVQNVIKKLTNVNLILNVDKCHFAQTSVYLLGFCVDAKG FT SQLDPRKVTNALSWEKPSTGKDIQRFLGLVNYFRKYLPNLSEVTAPLDKLR FT FEGKLTSKIWDQEQDDAFTKIKQLLASAPVLYHPDLNEPFYVATDASNYSI FT GAVLYQIIDKQIRHLGFMARALSTSEKNYSTTKRELLAIVFALKKFHPFLW FT GNPFTLYTDHKALTYIHTQPVANSMMIQWLDTILDYDFKIIHRPGIQNVLP FT DMLSRLFESERTLVGDKNQHKTWITPHIANSKNTNMTTRMMMSDDLVTPAP FT EERQELLLKTHLEGHRGSQAMVTTLHSEGIHWTNLKQDALDCINSCPDCQK FT FNVAKHGYHPLSSIYADAPWDHICIDTAGPMTTSIQGNNYILVVVDVFTRF FT CVLKAMPDKSSHTIALALRNILSLFGRPKIIQSDNGTEYVNDIIRKYTEIS FT GIDHRLITAYHPRSNGIAERWVGKTKNIIYKRLQGKNDDWDLYLDSTQEAL FT NNTPTALHDTRPFSLMFARRPNQFKDYSDVSIPTNYNETRKQFNGHITKFN FT NTVLPAIREKIKTSQTAAENKFNKSHRIIKEIPSGSQVMIKNINRTAKTDP FT LYIGNYTIVRKNQGGSYILVDGTGALLPRNIPPSHIKVISEEQSEVYDVEA FT VIDHKGTPGNWLYLVRWKGYDAKDDTWEPEKHFHDNRPILKYWNRRNGQHN FT VEPNSSEKRPRKTHDNSSRKRARN" XX SQ Sequence 5149 BP; 1730 A; 1137 C; 846 G; 1436 T; 0 other; tttttttcaa cgaattctaa attttattgc tctatcaatt ttctaaaatg gccgctcaaa 60 ataacttacc tatggaaatc aacaccatca acaacgctca agatgttagc cctatggttc 120 ctgctgatat ccgttcttca tcaagtcaag aagcaactga accggtttta ccttaccaaa 180 acgatgctga catgatggag acagctgatg atagtctcct gggtattggt aacactgatt 240 ccttgactat acttcgccag caacttcaag aagtatcaca aaggtttgct agaggcgtat 300 caaaccattt acctatggat caactggaaa gccttcgcaa tgaagcatca cacattgcaa 360 actgtatcaa gtaccttttg gaagttcaaa cctattgcga ccctcccaag attgaaacaa 420 ctcaacaatc cattaaccag tcgtcaaacc gctttgatca tttcattccc aatgatcttc 480 ctacatggca atgggttggt aatgtgtgga gaacggacgt agaggttcat gattcagtag 540 aagatctcct ggacacattt gcattaattg taagttccaa cggacttgca atcgatcaac 600 actggatgcg cttggttcca atcaaaatga atagagacca aagatcctgg tttaatgagg 660 tgctaaaagg tcgaagcttg aactgggatg aggctaggtc tatcatcatc aatacgtacg 720 ccacccaaga tgtcgctcag acgctaaaca gtattgatca actcttgacc atcaaaatgc 780 agcaaaacga gtcgatcgag gcttttactg accgattcca aagaatcaga cgcgcagcaa 840 aatggcaaga tgatattaga actgctgcct tgttcaaacg tgctttgcct ctcgccttat 900 ataaggaagt gtcccgctcc ttgctaaact tacctttgaa tcagcaagac tctgtttgta 960 aggtgtctgc caaggctcgt actgtgatct cctcaaatat ttgcaatgat gaagccatgg 1020 ctgctggtgt tacaaaacat aacatcacaa ctcaatcgcc tgcgaaccca atttcgtttg 1080 tgagtggtgc tgacaagtcg atgcacaatc ccaaaaatgc tagtggtcca agcaagcaag 1140 gccaacctca attcaagccc aaatcaaaat gtcttgttca tggtaatggt aatcatacca 1200 ctgaacagtg taaagtcctc aaaaggattg ccactgccaa taccgccact actgccacta 1260 ctgccaccaa cgtcactcca caagcatcaa atacaaagaa aaattgttac aaatgttcgg 1320 ccaacgtacc atggagtcct gaccatgcag ctacatgcag caaagataaa gtcaagaggt 1380 tcaatggacc aactaaagct ttccgttctg ctcgtttctc tggttctcac aatcgaagta 1440 aacatttacc tgccgctgtc aacttgaatg atcaagagca cagccacggc agtatggaca 1500 tagatcttag tagcttgcct acaaactatg actgtaagga aaaaactagc aaatagaatg 1560 aagtgaaaac taaccattct ttgtatgtac ctatcactat agaaaacatc gacacctttg 1620 cccttgtgga ctctggtgct tcttttagct cgatagatat tgcattcgct aataaaaata 1680 acattaatat aaataataac gtttctggtt ctattatttt tgctacaaac gataataaga 1740 gtaaacgttt tggtacaact tctaaagctt taagcttaat ttatggtgat aacgataata 1800 ctctcattca cacatcacat attttcgaag ttcttccact ttcctttgat acagatgctg 1860 tcattggttt agatttgatg cctaaattaa atattttaat aacaaattta gctgtaaaac 1920 atccaaacaa aaaacctgct gcaaaagaag aaatcgatga tactcctgaa ccaaacaatg 1980 ccccttatgg taccactgat caacaaatta attttatggc tgcagtcaaa cctgatcttg 2040 atcaaaatgc acagatacca aagaatagtt tttgcacagt acctgaatca attattcatc 2100 ttgatacaga aaagaacaaa acagcctatc gatcacctta ccgtattcca ctcaaactga 2160 tgccagtaat gcgagaatgt gtcgatacat ggttaaaaga tgaagtaatt gaatatgctc 2220 caccaagttc agaatggaat tctcctctca ctttggcacc caaaaaggat ttacatggaa 2280 atctcacagg ttatagacca tgtctagatc ctcgtctgct gaattctatt ttgatatcta 2340 atgataaaca acctatacct aaaattgatg aaatttttga tcaacttcaa ggttcaacta 2400 ttttcactac tttagactta cgacaagctt ttcatcgttt ccaaatatac aaacctgatc 2460 gtgtcaaaac aaccttcacc tttcaaggtc aacaatacat gttcaaaggt tgcccatttg 2520 gtctcaagca tattgcttca cgttatcaaa gagttatcaa ttctatccta gctgatgttc 2580 cttatgctct tgcatttgtg gatgatatta tcattttttc taaatcttat gaagaacata 2640 ttactcatgt acaaaatgtc atcaagaagc tcaccaatgt taacttaatc ctgaatgttg 2700 acaaatgtca ctttgctcaa acatctgttt acctacttgg tttttgtgtt gatgccaaag 2760 gatctcaact cgatccacgc aaggttacaa acgcactttc ttgggaaaag ccatccacag 2820 ggaaagacat tcaacgcttc ttagggttgg taaactactt taggaaatac ttgcccaatc 2880 tatccgaagt tacagctcca ttggataaat tgcgttttga aggcaagctc accagcaaaa 2940 tttgggatca agaacaagac gacgcattca caaaaattaa acagctactt gcttctgctc 3000 cagtcttata tcatcctgat ctcaatgagc cattctatgt tgcaacagat gccagtaatt 3060 attccattgg tgctgttctc taccagatca ttgataagca aatccgtcat cttggtttca 3120 tggctcgtgc tttgtctaca tctgaaaaaa attactcaac tactaaacga gaactacttg 3180 ccattgtttt tgctctcaag aagtttcatc catttttatg gggtaatcct ttcactcttt 3240 acacggatca caaagccctt acttacatcc acactcaacc tgtcgccaat tctatgatga 3300 tacaatggtt agacacaata ttggattatg acttcaaaat catacacaga cctggcattc 3360 aaaacgtact accagatatg ttatcgcgtc ttttcgaatc cgaaagaacg ctggtagggg 3420 ataaaaacca acataaaacc tggatcacac ctcacatagc taattccaaa aacacaaata 3480 tgactactcg catgatgatg tctgatgatc ttgtaacccc agcaccagaa gaacggcaag 3540 aattattact caaaacccat cttgaaggac acagaggctc acaagcaatg gttaccactc 3600 tacacagtga gggaatccat tggacaaact taaagcaaga tgcattagat tgtatcaaca 3660 gctgccctga ttgtcaaaag tttaatgttg ccaaacatgg atatcatcca ttatcgtcaa 3720 tctatgctga tgcaccctgg gatcatatct gtatcgatac tgccggacct atgactacat 3780 caattcaagg aaacaattac attctagttg tagtagacgt attcacaaga ttttgcgtac 3840 tcaaagcaat gcctgataaa tcttcacata ctatcgcact cgcattaaga aacatactat 3900 cacttttcgg tcgtccaaag attatccaaa gcgacaatgg aactgaatat gtcaatgata 3960 tcatccgtaa atacactgaa atttctggta tcgatcatcg tcttataaca gcatatcacc 4020 ctcgaagcaa tggtattgca gaacgttggg ttggaaagac aaagaatata atatataaaa 4080 gacttcaagg caagaacgat gactgggact tataccttga cagtacccaa gaagctctaa 4140 ataacacgcc aactgcctta cacgacacaa gaccattttc actgatgttt gctagacgtc 4200 caaatcaatt caaggactat agtgacgtgt caattcctac aaactataat gaaacaagaa 4260 aacagtttaa tggacacatt acgaaattta ataacactgt tttaccagct attcgcgaga 4320 aaatcaaaac ttctcaaact gctgccgaaa acaagtttaa caaatctcat cgaattatca 4380 aggaaatacc atctggtagt caagtcatga tcaagaatat aaaccgaact gcaaaaacag 4440 atccactcta cattggaaat tatacaattg tcagaaagaa ccaaggagga agctatatcc 4500 tagttgatgg tactggcgca cttttacctc gaaatattcc accttcgcat atcaaagtaa 4560 tatcagaaga acaatctgag gtatatgatg tcgaagccgt aattgaccac aaaggcactc 4620 ctggtaactg gttatacttg gtacgttgga aaggctatga tgcgaaagat gatacttggg 4680 aacctgaaaa acactttcac gacaacagac cgatactaaa atactggaat cgtcgaaatg 4740 gacaacataa tgttgaacct aattcaagtg aaaaaagacc aagaaaaact cacgacaatt 4800 caagccgtaa aagagcaaga aattaaccgt gactgcaatt catcttatta cgtattatct 4860 caacgcctta cgctaccttt caataattca aaaacttcca ataatccaat ctacaatgtc 4920 aatttacatt caatccacag ttcattcaca agtagaatcg tcaacataaa aaacctttca 4980 catgttgcta cctatacatt cttataacat gcacttacga attacattat tgccataaca 5040 tattgctttg ctcaatatta caaaacagag aacttcactt acactcgtct tcacaattga 5100 acactcatga atctaaatcc agaagtagtt atgacctgga agggggcaa 5149 // ID Gypsy-5_LBS-LTR repbase; DNA; FNG; 165 BP. XX AC ABFE01000277; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_LBS_; KW Gypsy-5_LBS-I; Gypsy-5_LBS-LTR. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-165 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01000277; Positions 127002 126838. XX SQ Sequence 165 BP; 38 A; 48 C; 20 G; 59 T; 0 other; tgtaaggaca cctatttgct catatttaca cacattgaca cacgcatcca gctataagta 60 gatccattct agctttcgtc ttcttccttc ctttttaccc tcactttacc tcttatacac 120 gatttggaac tgttcactcc tcgtgtgatc cagtcccgta ttaca 165 // ID Gypsy-5_CCO-I repbase; DNA; FNG; 8959 BP. XX AC AACS02000002; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the mushroom Coprinopsis cinerea genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_CCO_; KW Gypsy-5_CCO-LTR; Gypsy-5_CCO-I. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-8959 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the mushroom Coprinopsis cinerea RT genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AACS02000002; Positions 184205 193163. XX CC Positions [4216-4677] - Reverse transcriptase CC Positions [6223-6714] - Integrase core CC 'GGATG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 472..8919 FT /product="Gypsy-5_CCO-I_1p" FT /translation="MRAPSLRARATPRISQKPRFEREALGPHSPLSKSVQI FT EIAQSGLSRAQRDQIEKRRERVNNLGARSRDPTPGPSKDKGKGVDVRNWGD FT VVRTTTDERELDPEVQKALLDQAIGNVLEAERVAGIVHPYETEDDIEQATN FT PDTKGTNSKDKGKKDISESKKRKNKERREKSREKSRARKKKAATPLSTSMG FT DLIDKVTKNVDRGSDDDGSNDSDNDSDDNLNPAGKIDPKSTLGATFKKLDK FT LEKEESKKTKKKSANKGKKKGKKSKKDKRKRGRSRTRKNGSRNNDGDDEPS FT DSSDSSDSSSEESEGNCRSSDDGSGSSSSSDDTSGDSDSDDGRRRRKARNV FT KSVTPPDYKGQQNVPHFQRYMTLATNRVEDLGRRLPSNRRVGYVAQYLKGR FT AWDWYHREIAYREERMTLSEFFRRLYNWVFPATWKQEQRDRLDAIRQGGQT FT TREFYGRFRELCDTVGLTHKGEMVQRFFRALRPEFREAVFTRRLDPDTISM FT RKLIKECEYVELARSVRQYHTHQNGQDGGKRRRDADGNRRDRGERRDKRQR FT QDRSRDRREDRKSKKTNATPGFNSNQTKFKRDNERREKRKVLTEEKKAKLR FT KENRCFHCEETGHMASRCPKRNAVASSSRGPPGISNSNIEIELDMAHEDDG FT TTSGSDGDDEVSNLSLHLNSAALTDNLPPSSTTSLTTIEDRRSERTDLVKE FT ADELVSVLECLDDDELIDLPWKTRVPGWEDEHAERQALIPMGDPLLRKAEV FT LLKAGVPYPGDPSSWETYDGGRFYVYRTNKGKSYAIMDQLRDCEDTIEWDL FT LRDPEFNLANWYSNRLNGTSLDEFEAPRQALGDVVGQYLVDVLERNGPYPF FT DHVWDPDYQPVPRGERRFHFEFEDGIWDVEDGGYWIFDKELGFKTFIPLRD FT TRSPSNRIVRQYTSRLYDAQNRFFDFFSINPLEENILALLDRTPDEEEAEA FT IDRLREAEEEVLWNHRIRLDYSDGYEHVIELFGKQVEAGTYPAIQRNSSRR FT KELGRSVPKPVVVQALVNGKPVRALIDTGSLGDFMSTTLAEQLKVKRRELE FT CPVNVQLAVSGSRTKVNYDTLCHFKYQEIDEERRFDIINLSNYDLILGTPF FT LFQHQVLIGFNPFRVVVGTNESLPMEGESVRKLAVSHLSPELTDIERIRAE FT LLEYARPICKSAAETPLPPLRKINHRIPLIDSNKSYPRRRSRCPEALKAQW FT DAKRVAYIKTGRWVPTTSYNTVPMLFITKPGKNGDPPRLRTVVDLRERNAN FT TKKMNCPLPDMDGILRRISRAKYRTIIDGQDAYEQIRIEVSDVPNSAMMTP FT EGAVLSQVMQQGDCNAVSTFQTIMTDLFSEHIGKFVEVYLDDIIIFSDTLE FT DHVRHFKTVIDLLRKESFYLSERKINILPEEMKVLGRVVDRDGIRMDPDKV FT DALAKWKTPTNRELLRGFLGAAGYLADDIDRVRIPMGVLTTLTSDNVPFRW FT THTHQRAFEEIKDLACRFKDHHRVPLNYSPDAPSINMVTDACITGVAGVIS FT QGDDWKDAKVAAFYSAKLNPAQTNYPVHELELLAGVETMLRNRDLLQGVHF FT KWYTDHKGLIYLLKQEKLSRRQVRWMEKISDFDFEVIYVPGTENILSDALS FT RIYSNDAPGTVRSPSEYVQFDDSADDSLFFGGIAPKSTPAPVYVGKEALSV FT VGPGAKRRASPRKKPDPPLPADTGRPETTKEWAARMAATRNFVLRGPRVRR FT EGGMSEQNENEPPKSTPAKEKLVIRIPARRKSAEAEAKSQSRQEPAVPTDQ FT AVPNVGENVLLRAPEANPQGIDIPRAIKGKYVHCPFFKILIENPKHYKNFE FT VEQDLIYMRLEDGGRVLCIPSGAFYAGRSVREIVISEAHGLLAHLGPKRTL FT EYLKQHVWWKEMVRDVTLFCESCPTCKRSKPSNHKPYGLLNPLQIPGAPWE FT SIGIDFVGPLPVSKDRDAEYDAITVIIDRFSGMVHLVPSRQDYKAKEVAEL FT IFSEVYKHHGLPRSIVSDRDKWFTSVFWEHLHKLTGVKLKMSSAYHPQTDG FT ATERANRTITQLIRQCIGPKQRDWVARLPGIEFAINLARSETTGYSPFFLN FT NGRLPRTFIWDWASKEEFPGVRTFAHKLKIAVMGAHDAIIGARVKQTRDAN FT RKRIVSPFKANDLVYLSTENITFPKGLTRKFLPKFIGPYKIVEDFGNNSYR FT IELPARMKQRGVSDVFHASKLRIHVPNDDRLFPGRVDNQIWEYDDEEFETE FT WAVSRITNHAGAKSSAKFEILWESGDKTWLPYSRVADLVALQDYLDALGVE FT KISDLPEKAVPARMDEQIELGYLAGMELELAPSYPPVSSPPCSTSVSPVSM FT TSRPVNRPVTLFPGISRDLDAASRFTSAPFRNRYFTHVAYDTHTREVVTRV FT PLGTERRFAVDHFISCVHHAIYLASITTPDPSVPAPAGYEEVARSFNDSTY FT CRAKFPVVESGAPFVHPSCRPIKEFDFDFSLVRLNPLGVHGSALSMLGLVN FT NNQLDIARIRTMYERNDRMLDFALEALTTGRSPFSNSRNAFHNEAFQNRRF FT QNRNRARNSRPYDDGPSNRGGRGRPIFRGGARGGPSLGRNGTFRHVWRRRE FT QLDSASSSDYTLPAYRTDESSAYTSPRSANAYSNLPVVPPAVRSTTTSDPR FT VTENRRLLSELPTGGSPPSSDGSSADPSTASDPDQAEDPNQGVEERQQSPS FT WDDDTPMDDDDVSVIVASEDSINYEPASPNAGSGTVTPPWPAGTAPTDFAA FT SPPGSPMAEDTDLDAEGEPDDSATEDAGLAARQAREREMLDKFHQKLVDHV FT NKDLVITPETTDAVDTPTDTASTST" XX SQ Sequence 8959 BP; 2354 A; 2460 C; 2314 G; 1831 T; 0 other; tatttttgag tgaacaagct ctcttctcta ctctctactc tctctctctc ttctcgacgc 60 cgactcgaat cctggaagca acggtctgat caaccgacgc cacccgactc gaagacgcaa 120 taaacaacca ccaaaaaccg gtcctcgaca cccacaagat gtctcggacc accaccacca 180 ccaccaacac cacctaccgt gctcgacttc gctcgggtac ggtgctcggg gctactggcc 240 cagtcccgag gtcgctgcca ggtacgtacg gcgacaaccc tgttgaagac tcaccttttg 300 ctagcgtggc ttcgggggtc ccaaccgcga gcggtggcgg gacccagtcg ttggaaaatt 360 tgcctgaggg tgtttccaac gctgcctcca gcggccgagc cgctgctgga gaggccgaaa 420 ccgcgaacaa cgcgggtgta gttagtgtag ataccggcgg caatacgaac gatgcgggcg 480 ccgagtctga gggctcgagc gactccgcgg atatcccaga aaccccggtt cgaaagggag 540 gctctaggcc cccactcacc tctgagcaaa tccgtgcaaa ttgagatcgc ccaatccgga 600 ttaagtcgag ctcagcgtga ccagatcgag aagcggcgcg agcgtgtgaa caatttaggc 660 gcccgctctc gtgaccccac ccccggcccc tctaaggaca agggtaaagg cgtagacgta 720 cgcaattggg gtgacgtcgt cagaactacg actgacgaac gtgaactgga tcctgaagtc 780 cagaaagcct tattggacca agctattggc aatgttttag aggctgaaag ggtggctgga 840 atagttcatc catatgaaac tgaggatgat attgaacagg ctactaaccc tgatacaaag 900 ggaactaact ctaaggataa aggtaagaaa gatatttcgg agtccaagaa gagaaagaac 960 aaagaacgtc gtgaaaaaag tcgtgagaag agccgcgccc gaaagaagaa ggcggccacc 1020 cctctctcca ctagcatggg agatctaatc gacaaagtta ccaaaaacgt cgacagaggc 1080 tccgatgacg acggctcgaa tgattctgat aacgactccg acgataatct taaccctgcg 1140 ggtaaaatag accccaagag cactttgggt gctaccttca agaagctaga taaacttgag 1200 aaggaagaat ctaaaaagac gaagaagaag tcagccaata agggaaagaa aaagggcaag 1260 aagagtaaga aggataaaag gaaacgaggt agaagtcgaa ctcgtaagaa tggatctcgc 1320 aataacgatg gggacgacga accttcagac tcgtccgaca gctcggatag ttccagcgaa 1380 gagtccgagg ggaactgcag gagttcggac gacggttccg gctctagctc ttcgagcgac 1440 gacacttcgg gcgacagcga tagcgatgac ggacgtcgtc gacgaaaggc taggaacgtc 1500 aagtcagtaa ctccccctga ttacaaggga cagcaaaacg taccccactt tcaacgctac 1560 atgacgttgg ccactaaccg agtcgaagac ttgggccgac gtcttccttc caatcgccga 1620 gttggatacg tggcccaata cctcaagggg cgcgcttggg actggtacca ccgcgagatc 1680 gcttaccgtg aggagcgtat gaccttgtct gaatttttcc gaaggctcta caactgggtc 1740 ttcccggcta cctggaaaca ggaacagcgt gaccgtttgg acgcgatcag gcaaggtggc 1800 caaaccactc gtgagtttta tggacgattc agggagttgt gcgatacggt tggacttacc 1860 cacaagggtg agatggtcca gcgcttcttc cgcgccctta gacccgaatt ccgcgaagcg 1920 gtgttcacgc gccgcctcga tcccgacacc atctcgatgc gcaagctgat taaggaatgt 1980 gagtatgtgg agctcgcgcg ttcagtcagg caataccaca ctcaccaaaa tggccaagat 2040 gggggaaagc gccgacggga tgccgatggg aaccgtcgag acaggggcga gcgccgagat 2100 aagcgccagc gccaagaccg atctcgcgat cgtcgcgaag accgcaagtc aaagaagaca 2160 aacgcgactc ctggttttaa ttcaaaccag actaagttca aacgcgataa cgaacgcaga 2220 gagaaacgca aggtgcttac cgaagaaaag aaggccaaat tgaggaaaga aaatcggtgc 2280 ttccactgtg aggagaccgg tcatatggcc agtcgttgcc ccaagcgcaa cgcggtagcc 2340 tcgtcttcac gtggaccacc tggaatttca aattccaata tcgagatcga attggatatg 2400 gcgcacgaag acgacggcac tacctcgggc agtgatggtg acgacgaggt gagtaaccta 2460 tctcttcatc tgaattcagc agcacttacc gataaccttc cccccagctc gactacctct 2520 ctgacgacga ttgaagatcg tcgctcggaa cgtacggatc tggtgaagga ggcggatgaa 2580 ttagttagcg tgctcgaatg cttagatgac gatgaattga ttgacttacc ttggaagacc 2640 cgcgtgcctg gttgggaaga tgagcacgcc gagcgtcaag cactgatccc aatgggggac 2700 ccccttctga gaaaggccga ggtactcctt aaggcaggag tcccctatcc tggggaccct 2760 tccagttggg aaacctacga cggagggcga ttctatgtgt accgcacgaa caagggtaag 2820 tcctatgcaa ttatggatca actgagggac tgtgaggata caatcgagtg ggacttactg 2880 cgagatcccg aattcaactt agcgaattgg tattcaaata ggctgaatgg tacgagcctg 2940 gatgaattcg aagcgcccag acaagcgctt ggggacgtgg taggtcagta tctggtcgat 3000 gtacttgaga gaaatggacc ctacccgttt gaccatgtct gggaccccga ttaccaaccg 3060 gtcccgaggg gcgaacgtcg tttccatttc gagttcgagg atggaatctg ggacgtcgag 3120 gacggcggtt attggatctt tgataaagaa ttaggcttca agaccttcat accactgagg 3180 gatacacgct caccgtcaaa tcgaatcgtt aggcagtaca ccagccgact atacgatgcg 3240 cagaaccgtt ttttcgattt cttttccatc aacccattgg aggagaatat actcgccttg 3300 ctggatcgaa ctcctgatga ggaggaagca gaggctatcg atagattgag ggaagcggag 3360 gaggaggtcc tttggaacca tcgaattcgt ttggactact cagacgggta cgaacacgtc 3420 attgaactgt ttggtaaaca ggtggaggct ggtacttacc ctgccattca aaggaactcc 3480 tctcgccgca aggaactcgg acggagtgtg ccaaaaccag tagtagtaca ggcgctcgtt 3540 aacggcaaac ccgtccgagc gctcatcgac accggttccc tcggtgactt tatgtcgaca 3600 acactcgccg agcagcttaa agttaagcgc cgggagctag aatgccctgt aaacgttcaa 3660 ttagcagtct caggctcaag aactaaggtg aactacgata ccttgtgtca tttcaagtac 3720 caggagatcg acgaggagag gcgattcgac ataatcaacc tctccaacta tgatctcata 3780 ctgggcactc ccttcctgtt ccagcaccaa gtcctgatcg gcttcaaccc cttcagagtc 3840 gtcgtaggga cgaacgaatc gctacctatg gagggcgagt cggtcagaaa acttgccgtg 3900 agtcacctta gcccagaact caccgatatc gagaggattc gagcagaatt gctcgagtac 3960 gcaaggccga tctgcaaaag cgcggccgaa acgcctctac cacctctgag gaagatcaac 4020 catcgtatcc cactcatcga ttctaataaa tcttaccctc gtcggcgttc gcgctgccca 4080 gaagctctga aggcgcagtg ggacgcaaaa cgcgtcgcgt acatcaagac tgggcgctgg 4140 gtaccaacga ccagttacaa cacggtacct atgcttttca ttaccaagcc gggtaagaac 4200 ggtgacccac ctagactccg aacggttgta gatcttcgtg agcgcaacgc aaatacgaag 4260 aagatgaact gccctcttcc ggacatggac ggaatccttc gccgaatatc aagggcaaaa 4320 tataggacca ttatcgacgg tcaagacgcg tatgaacaga ttagaatcga ggtaagtgac 4380 gtaccgaatt cagcgatgat gacgcccgaa ggcgctgtgc taagccaagt aatgcagcaa 4440 ggagactgca atgcagtctc caccttccaa acgatcatga cggacctgtt ttcggaacac 4500 attggaaagt tcgtcgaagt gtacctcgac gatataataa tcttctcaga tacattagaa 4560 gatcatgtcc gacactttaa aacggtaatc gacttacttc ggaaagaatc gttctacctg 4620 agtgaacgaa agatcaacat cttgccagag gaaatgaagg tcctcgggcg agtcgtcgac 4680 cgtgatggaa ttcgcatgga ccccgacaag gtcgatgcgc ttgcgaagtg gaaaacgccc 4740 actaaccgcg agttgttgcg cgggtttcta ggcgctgcag ggtacctggc ggacgacatc 4800 gatcgcgtta gaatccctat gggtgtgcta acgactctaa ccagtgataa cgtaccattt 4860 cgatggactc atacgcacca gcgcgcgttc gaggaaatta aagatctggc ctgccgtttt 4920 aaggatcatc atcgagtgcc tttgaattat agtcccgacg caccttcaat taatatggtg 4980 actgatgctt gcatcacagg tgtagccggc gtcatcagtc aaggtgatga ctggaaggac 5040 gccaaggtgg cagcctttta ctctgcgaaa ttaaaccccg cgcaaacaaa ctaccctgtt 5100 catgagttgg aactattagc gggtgtggaa acgatgttac gaaatcgtga cttacttcaa 5160 ggtgtccact tcaagtggta cacggatcat aagggcttaa tctatttgct gaaacaggag 5220 aagctgtcca gaagacaagt caggtggatg gaaaagatat ccgactttga ctttgaagtg 5280 atttacgtac caggtaccga aaatatccta tcggatgcgc tgtcgcgcat ctattctaac 5340 gacgcgcctg gaactgttcg ctcaccgtcg gaatacgtac aattcgacga ctcggctgac 5400 gactcactct tctttggtgg aatcgcgcca aagtctacac ctgcccctgt ctacgttgga 5460 aaggaagcgt tgtcagttgt cggaccgggc gccaagcgga gggcgagccc gagaaagaaa 5520 cctgacccac ctttacctgc tgacaccggt cgaccagaaa cgaccaagga gtgggcggcc 5580 agaatggcag ctacgaggaa cttcgtactc agaggcccca gagtacgaag ggagggcggg 5640 atgtcagagc aaaatgaaaa cgaaccccct aagtcaacgc cagccaagga aaagcttgtc 5700 attagaattc ccgcgcgaag gaaaagtgcc gaagcggagg caaaaagtca atcacgccaa 5760 gaaccggcgg tacctaccga tcaagcagtc ccgaacgttg gtgagaatgt tctactgaga 5820 gccccagagg caaaccctca agggatcgac ataccgcggg caatcaaagg aaagtatgtg 5880 cactgtccat tcttcaagat acttattgaa aacccaaaac actacaagaa ctttgaagtg 5940 gaacaagact taatatacat gaggcttgag gatggagggc gggtgctatg catcccctcc 6000 ggtgcctttt acgccggtag aagcgtacga gaaatagtaa tctcagaagc ccatgggcta 6060 ttagcccacc tcgggcccaa gcgaacgctt gagtatctca aacaacatgt ctggtggaag 6120 gaaatggtgc gcgatgtaac tctattctgc gaatcgtgcc ctacatgcaa aagaagcaag 6180 cctagcaacc ataagcctta cggcctcctg aatcctttgc aaataccagg cgcaccatgg 6240 gaatccatcg ggattgactt tgtcggacct ctcccggtct cgaaggaccg agatgctgaa 6300 tacgacgcga ttactgtgat tattgacagg ttctcgggca tggtacatct ggttcccagt 6360 aggcaagatt acaaggccaa agaagtagct gaactgatat tctcggaagt ctataaacac 6420 cacgggttac ctagatcaat agtcagtgat cgtgacaagt ggtttacatc tgtcttttgg 6480 gaacacctac acaaactaac cggggtcaag ttgaaaatgt ccagcgctta ccacccacaa 6540 acggacggcg ccaccgaacg cgcaaatcgg accataaccc agttgatcag gcagtgtatt 6600 ggcccgaaac aacgggactg ggtagcgaga ttacctggca tcgaattcgc gattaatctc 6660 gcccgatcgg aaaccacggg gtactcacct ttcttcttga ataacggccg gctacccaga 6720 acgttcatct gggattgggc gagtaaggaa gagttccccg gagtgagaac attcgcgcat 6780 aagcttaaaa tcgccgttat gggcgcgcac gatgctatca ttggtgcgcg agttaaacag 6840 acccgcgatg ccaacagaaa gcgaattgtc agtccattca aagccaacga cttagtttat 6900 ctgtcaacag agaacataac attcccgaag ggacttaccc gaaaattcct acccaagttc 6960 atcggaccgt acaagatcgt cgaagatttc ggcaacaact cctaccgaat agagttacct 7020 gctagaatga agcagcgcgg agtgtcagat gtattccacg cgtcaaaact aaggatacac 7080 gtaccgaacg acgaccgcct atttcccggg cgggtggata accagatttg ggagtacgac 7140 gacgaggaat tcgaaacaga gtgggccgta agccgcatca caaatcacgc gggcgctaag 7200 tccagcgcca agttcgagat cctgtgggag tcaggcgaca aaacctggct cccttacagt 7260 agggttgctg acttggtcgc actgcaggac tacctggacg cgctgggcgt cgaaaaaatt 7320 tcggacttac ctgaaaaagc tgtacctgcg cgcatggacg aacagattga gctggggtat 7380 ttagctggta tggaattgga actcgcacct tcctacccac ctgtgtcgag ccccccctgc 7440 tccacttccg tttcccctgt cagcatgact tcccgtcctg tcaaccgtcc tgtcaccctg 7500 ttccctggca tatcccgcga tctcgacgcc gcctcgagat tcaccagtgc ccccttccgt 7560 aaccgatact tcactcacgt cgcttacgac acccacactc gagaagtggt cacccgagtt 7620 cccctgggca ccgaacgcag gtttgccgtc gaccatttta tctcctgcgt ccaccatgcg 7680 atctacctcg catctatcac cactcccgac ccctcggttc cagcacccgc cggctacgag 7740 gaggtagcgc gatcatttaa cgactccacc tattgtcgag ccaagttccc ggttgtcgaa 7800 tctggtgcgc ctttcgtcca tccctcctgc cgacccatca aagagttcga cttcgacttc 7860 tccttggttc gactcaaccc cctcggcgtt cacggaagcg cccttagcat gttggggtta 7920 gttaataaca atcagctcga cattgcgcgc atccgaacta tgtacgagcg caacgatcga 7980 atgctcgact tcgctttgga ggcacttacc acaggccgtt cccctttttc gaactcacgt 8040 aacgccttcc ataacgaggc cttccaaaac cgacgtttcc aaaaccgtaa ccgcgcccgc 8100 aatagccgcc cttacgatga cggccccagt aaccgtggtg gccgtggccg acccatcttc 8160 cgtggcggtg cccgaggtgg tccttccctg ggaaggaacg gcaccttcag gcacgtctgg 8220 cgtcgaaggg agcagctcga ctcggcctcc tcgagcgact atacgcttcc cgcctatcgg 8280 accgatgagt ccagcgccta cacttcccct cgatcagcca acgcgtacag caacttaccc 8340 gtggtccccc cggcggttcg ctccaccact accagtgacc ctcgcgtcac cgagaaccga 8400 cgactcctct cggagttacc cactggaggc tctcccccca gctccgacgg ctcgtccgcc 8460 gacccttcga ccgcctctga ccccgatcaa gcggaggacc cgaaccaagg cgttgaagag 8520 cgacaacagt ccccttcgtg ggatgacgac acacccatgg atgacgatga cgtctccgtt 8580 atcgtcgcgt ccgaagactc catcaactac gagccagcca gtcccaacgc cggctctgga 8640 accgtcactc ccccttggcc cgccggtacc gcgcccaccg acttcgccgc ctcccctcct 8700 ggttccccta tggctgaaga taccgatttg gacgccgaag gagaacccga cgactcggct 8760 accgaagacg ccggtctcgc ggctcgccag gctcgcgaac gcgaaatgct cgataagttc 8820 caccagaagt tagttgatca tgttaacaaa gatttggtta ttaccccaga gactactgat 8880 gctgtagaca ctcctaccga taccgcttcg actagtactt aacttttcca gagtctcttc 8940 gatactctgg gagggcgta 8959 // ID Gypsy-32_MLP-LTR repbase; DNA; FNG; 178 BP. XX AC AECX01000965; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-32_MLP_; KW Gypsy-32_MLP-I; Gypsy-32_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-178 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000965; Positions 154042 154219. XX SQ Sequence 178 BP; 56 A; 40 C; 31 G; 51 T; 0 other; tgttacaatc caaatgtaac ggacttgtga aagagatcac acatgtcaca agtgcttcta 60 gttataatcc ttgtattacg taacttgtat gaatcctcat ccgacaatct cgttaatcaa 120 taggaaagga ttcaagagtt ccttcatctc ctgagccccg tgacagcagg tcataaca 178 // ID Copia-5_LBS-LTR repbase; DNA; FNG; 305 BP. XX AC ABFE01001868; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-5_LBS_; KW Copia-5_LBS-I; Copia-5_LBS-LTR. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-305 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01001868; Positions 17425 17121. XX SQ Sequence 305 BP; 84 A; 71 C; 55 G; 95 T; 0 other; tgttggcata aaaccacgtg acctcatacg tcacaattat gacatgtgct gtgtgtatta 60 cgcagcactt acacatcata tgctgtacat tgttctacag cattccatat gattctatgt 120 caaatatgca tacatagtag cacggaccat ctctgtagag tatataagga cgtctactat 180 gtcctaatac attaagtctt ttacttctag ctagtcttgc ttagcttcgc ctctacaata 240 acacccaagg taagggctat ggtagtgtta gctacctctg cgctgcgcat cctagatgcg 300 ataca 305 // ID Copia-33_MLP-LTR repbase; DNA; FNG; 831 BP. XX AC AECX01001335; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-33_MLP_; KW Copia-33_MLP-I; Copia-33_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-831 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001335; Positions 41591 42421. XX SQ Sequence 831 BP; 203 A; 168 C; 139 G; 321 T; 0 other; tgttaaaata caatcaagac taacgctaat tatttagttg agcgtgtagt gtggtgtcag 60 gaagtgtaac aggttatcaa ttctcgtctc ttattagctt ttgtcattgt gtgatatgtt 120 ttcctatatc tccccaacca acgtttgcgc acatcaatcc caatgttatc ttgaagttgt 180 tttcttatat atatcttttc tcgttttcgt cttactctct tccccatctg ttggaaagag 240 agtaagtacc tcgttacctt ttctctcgcg atcgtagtat actaactact caatactgta 300 gattccagta gatcagcttt acatcgaagc ttttgccttt tgcttatctt tcattctcat 360 ctttactttt cctccgaaaa ctttaggtac ttatacaaac agttatcaca tcatctattt 420 tctctctgtt tacgtttttc tactttttgt tgtttatgtt tttaggttag ttttataccc 480 aagtctgcac acttgaggtt gtggagaagg tgcttacgat tagcacatta tcatcaggtt 540 agctagaaat ataatcatat ctgttggaaa gagaattcca gtagatcagc tttacatcga 600 agcttttgcc ttttgcttat ctttcattct catctttact tttcctccga aaactttagg 660 ttagttttat acccaagtct gcacacttga ggttgtggag aaggtgctta cgattagcac 720 attatcatca ggttagtttt atacccaagt ctgcacactt gaggttgtgg agaaggtgct 780 tacgattagc acattatcat caggtctaca cggtgtcctc agccatttac a 831 // ID Gypsy-63_MLP-LTR repbase; DNA; FNG; 159 BP. XX AC AECX01001331; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-63_MLP_; KW Gypsy-63_MLP-I; Gypsy-63_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-159 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001331; Positions 16900 16742. XX SQ Sequence 159 BP; 39 A; 46 C; 26 G; 48 T; 0 other; tgtaacaatc cttataagta gtcaccatgc ttgtactgta actcccacgc tcccgtggtc 60 atatggtgct cgagagtcca tacaggtctc tctcttcatg ttgcaatctc attatcaagt 120 agtcatcacc ttgcccagtt cctcagcacg atcataaca 159 // ID Gypsy-49_MLP-LTR repbase; DNA; FNG; 650 BP. XX AC AECX01002280; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-49_MLP_; KW Gypsy-49_MLP-I; Gypsy-49_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-650 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002280; Positions 81436 82085. XX SQ Sequence 650 BP; 248 A; 155 C; 114 G; 133 T; 0 other; tgtcagcctc caagctcagg ctagctgaag ccaaagagac tacaacttga ataaaaacta 60 ataaatagat aaaattaaaa tatatatatt tctcctttca tcatactaca tcgtgtgtct 120 agtttcaaag acaacaaata cttgaaaaca aaactatgag aacctaagtc aaaggtaaca 180 acaaaaatcc tcacaagaca cctcgtgtct agttccaaga ggaacaatcc tgagaaacca 240 agattaacac tcctagaggt aaaactaaaa ggaacccact aagaagatac catacgaggc 300 cagcttgagg atcagaacac agtaacgcaa cccgaaggac aaaacgaaac gaatttagcg 360 tttcaggctt gagctcaacc acgagttgac caagcaagga tttactgagt tcccaatctg 420 gccaaaccac gggttagcca gaagagtctc ccctttggga actatcaata acaacgactg 480 agaaccacaa gtcagccagc cacctggaaa aggcatataa agagagatgg tctcctgagt 540 tcgaaaggag gacactcatc tacgacgacc agaagcaact ataagctttt ttctcgcagt 600 ccagccatac ttcctaaaaa aggtatatct ccactcacta taaaagccca 650 // ID copia-3-I_AN repbase; DNA; FNG; 5309 BP. XX AC . XX DT 09-DEC-2003 (Rel. 8.11, Created) DT 09-DEC-2003 (Rel. 8.11, Last updated, Version 1) XX DE Internal portion of copia-3_AN LTR retrotransposon- a consensus DE sequence. XX KW Copia; LTR Retrotransposon; Transposable Element; KW COPIA superfamily; copia-3-I_AN; copia-3-LTR_AN; KW internal portion. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-5309 RA Kapitonov V.V. and Jurka J.; RT "copia-3_AN, a family of copia LTR retrotransposons in the RT Aspergillus nidulans genome."; RL Repbase Reports 3(11), 201-201 (2003). XX DR [1] (Consensus) XX CC LTR retrotransposon; Copia superfamily. Internal portion. CC LTRs is deposited in Repbase as Copia-3-LTR_AN. CC It is a relatively old LTR retrotransposon, its internal portion CC encodes CC remnants of a Copia-like polyprotein (numerous stop-codons). CC The polyprotein is most similar to polyproteins from some Copia CC elements in the Arabidopsis thaliana genome. XX SQ Sequence 5309 BP; 1937 A; 916 C; 1082 G; 1374 T; 0 other; ttggttatga gcctgggctg ctatgtctag cctaaccagc aggcttagta ccttggggga 60 ttatataaag cctaattact ttgcccaatc cagcaggaga ttgaaactgt acgcgatgtg 120 ttgatccttg atcttgtgaa ggtggccagg cctgtacaac ctgcctatct ctccctgatt 180 gagctattaa accttgaacc tagaactgaa cacgacagat gctaccagct cgaagacaag 240 aacaagaccc atagactggg agatatatat taagctgaga gagtattaaa gacctagcag 300 gaccaagtta gaacataccc agggcatttg atgaactggt agagacaatt aatcagacca 360 agacagtgcc aggggcgagt gagacagttg agagaactga gccggacgag agaatctgaa 420 gtactagggg cattgagata gatcctagca agcagctgca agcagaattg gcagctgagg 480 accctgatta gcagctacaa ggagaacaag tatctgggat actaggtata gaagaattga 540 tatccaggat gtcaggagaa tttgagattc ctgcctacca ggcagtgaca ctacagaaga 600 ttattatatt taatattaaa aagctagact agatgaatat aagtagttgg aaagcccaat 660 ataagatctt cctggagata caaggttgct ggagtgtggt agaatatata tataagtggc 720 atggaaatgt gacaagggtc agaaagctgt tagaggacct aggatagaga gtattagatg 780 taacagctaa gttatatatc ctccagaata taaaggtaga agataaggct tccatacaga 840 gtttaaaaac gtccagagat atgtgggcct tcttaataga gaaatataag caaagaactc 900 aggttaatat taccaatgca attcaaaagg taacatacta gcagatagat ctaaagatga 960 gccttgagga ggcaatgcaa cagctggatc aatattatgc agagctagag gatattagca 1020 ataggaagat aaagtttgat aatataatta ttcttatctt cttcctggat agattgctat 1080 taggatatga ttctataaag tttttacttc tggcataaga gaatcttacc tgcaggatag 1140 tcctatcaca gctctaatag caagaaagca taataattac tgctaaagag aataagacta 1200 tctaagaatc tataagctga gcaaagcaga tgctatgttt taattatagc aagcaaggct 1260 actttgcaag agattgtaca gctctaaaga agaaccaaga gcagagcagt tcccaggagg 1320 accctcaggg aaggagtcac cagaagggcc acagcaaggc cagaggatat agaaatagca 1380 gagataagag ccaaagagga cataccagct gccagaaagg aaaagcctgt acagtaaata 1440 ataaaataga tatagacata gatagtgatt caagtactag cagcaaaggt tctaccagac 1500 atagcatata tcaagtatat aaataaacag cctactgagc agctgagaat acctactaag 1560 ttgaattaag agagcatatc agtaattaat ttaaatactt gcttgaaaag aataagaaag 1620 catacagatt aaaggggagc aataatccta ttattaatag tagagcaata agtacctgta 1680 gcagcaagat taatttattt aagtctctag atcagagata tagaggcagc ttgggaatag 1740 ctagtaaatc tattaaaatt actggtagag gaacaataag gatcctatta agcagtagaa 1800 aagtggctag gatctggaat gtattgtaca tgcccaggat gatatagact ctcctattaa 1860 cctaattatt gcaagataaa ggtatctgga ataagcatgt aaagaagaag tactaattct 1920 tcaagaggag aggaaagatc ctagcaagag gatataatat tggctgtaca agttacctag 1980 gataggtaaa aagccagaat gcattggcta aagggtttgg gaagcctaaa aataataaat 2040 atacaagact tataaagcag attaattagg agttgcttta ttaatatcta ggtcatccag 2100 gaaaggccag atttagatag ataataaaga aattaggact cttacctagt aaagaggtaa 2160 ttaagcagct aaagacctgt aagacatgta tacaggtaaa gagtatcaag aaatagaatc 2220 atgccaaagt gccaagggca tcaagaccct taaagagagt ttatatggat ttttagggtc 2280 tatacagcaa agcaaaaact ctagagagat actatctctc cctgacagac aactgtacga 2340 gattttcatg gatatatctg acaaaggatc aagaggctgc cacagtgaaa gccactctag 2400 agcaatggct ggctctagcc gagcgcgaga aaggtgtcaa gttgcttatt atccagactg 2460 ataatgcaag agaattcaag gctctagagc cataggcctt gaagaaaggc atccagatca 2520 agtttactga gcctgataca cctcagcaga acagtatggc agaaaggctg aattaatatc 2580 tcttagagat gaccagggca atccttatta atacaaatat tccaaagaag tactggctat 2640 acacaatcag aatagccaat tatctctaaa atcaagtagt cagggtgcaa ggtactaaga 2700 aaaccccttt tgaaatatag ataggacatc ctcctgatat atcaaagttc caaattcctt 2760 tctcaagagt ctggttttat aagaagacaa atgacaagct ggagccaaga gctattaaag 2820 gtatatttat aggatataag tcaagccaga atcattatat aatcatggcc aagcaggatt 2880 ataagatcta ttaagttata aatcctatat tcctggaaaa caagcaaggc ttcattagca 2940 aagaaccagg agtttgagat cttggggaag aacctctatt ttaaaggata tttagagttc 3000 ctgaagtaag cttaggaact aggggaggta ttacagaggc tctgggagcc agtaatataa 3060 gcaataaagg tagcagtata gatactgcaa gccctgaagg tgctgggggc accagaggtg 3120 ctggtaatat taagaataat attagtgtta agaataaagt tgaagctgaa gacagcaata 3180 taactagcca aagtggtcag aattcaggat tgaccaatca gaggctagaa gtagctatcc 3240 caacatatag gacaccaagc ttggacaaga gccaggaaga gcctataccc aagcctttga 3300 aaacaacttc ataactgtca ttatctccat tgtcaaaccc tatcctgaca gaaccctcta 3360 agatatgtag accaagccaa aattaaaagc caatacaggc ggcaattaag tccaagcaga 3420 cagaggctat atataggcag aagccatgag cctagagata cagagaagag agggaagtac 3480 taaaagatcc ttctctgcgc ctagctgttg aacagcagta aattaataaa gtgactaatc 3540 tagctatagc cctggagctt catcttgctg attataatac tttcaaagcc aaggttaata 3600 agatctatgg gcagatccct attccaaaga cctaccagga ggcaataaat gaccctatat 3660 acaaagccaa gtagaaggaa gcaatcaagc ttaagctgaa taacctgatc caattcagca 3720 cctagagata tattagaaga cctaaagatt aactagtagt atcaataaaa taggttttta 3780 atattaaata tagagctaat agctgagttg accagtttaa ggcaaggcta gttgccagag 3840 gcttttccta atacaaagga ttagacttta aagatatatt tgctctagtt atctagctgg 3900 agagcctcag gatcctattt accttagtaa cagtccatgg cctcaaagct tactttctta 3960 atactataaa tacctatgtc agatcaaaac ttgacaagca gatctttata gagatcccag 4020 agggagttaa tcctaaatcc tataatctaa ataatatata taagatccta caatccttgt 4080 acagacttta ataatctata tacctttgga accagaaggt taagaagttt gttatattaa 4140 ttaggtttca acaaagcact gcagaccctg gagtttttat taataactaa ggagttatta 4200 tagctcttta tatggataat atcttgattt ttggcaaaga taataaggat attaagtcta 4260 ccaaagatca gctaaagagc ttctatccta taaaggacct agggctggca caaaaggtat 4320 tagggatcta gattatatag ataaagaact ttatctgact ggaccaggag ctttatacct 4380 aatctatcct agaagagttt agaataacta aatcaatatc tcaagacaca ctgcttgatc 4440 ctagtaccaa tctggataat taattatcta ggaagctacc ttgagacctg tataataagt 4500 ttaggaagat tattaaacag cttacctacc tagctggcag aactaggcca gatatctagt 4560 tctctataaa ctgactaagc taatatctta cagatccctg agaggtccat cttagagctt 4620 caagatatct cctttactat atcaaaggca ctgttatata tagaataacc tacagtgcaa 4680 aggggagtac agataccaag accctgatag gatattcaga tttattatat aggaattcca 4740 caaagcagag atcgaccagt atatatatct ttatgctagc taatagacca gttagttggt 4800 atagccagaa gcagcctatt actgctatgt caataactaa agcagaatat attatagctg 4860 cagaggcagc aaagcaggct atctagatca gatactttct agcagctata tcaaagcatc 4920 ctaagcagcc aacccaactg ggaattaaca atcaaggtac tttgatgcta tcatccaacc 4980 cagttaatca tctacacagc aagcatatct gcatacaata tcatgccatc taggacttca 5040 tcgagtatgg agatatcaag ccaatctata tccctacctc agaaatggtg gcagatggtc 5100 taacaaaggc aataaaggct gataacttta agaaagcact tcaattgctg cagctgaagt 5160 caaaataagg ttctataata taatatcaag atcctcagat taataagata aaggcattca 5220 acaccccttt attgtttctg ttttatttcc ttttttagaa gcttgtatag cttcattcta 5280 tttatttcag ggtattgaat gaaggggag 5309 // ID Gypsy-14_LBS-LTR repbase; DNA; FNG; 479 BP. XX AC ABFE01000655; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-14_LBS_; KW Gypsy-14_LBS-I; Gypsy-14_LBS-LTR. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-479 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01000655; Positions 72564 72086. XX SQ Sequence 479 BP; 115 A; 146 C; 95 G; 123 T; 0 other; tgtagcatgc tagtctcgtg agtactcact acccatactc tatatccttt ccgctcccct 60 acgtcgcgcc actaccagtc tgttaccata ctttacttca tttgatcacg ttacgcttac 120 gtcattacgt tgtattttac acctcatgta gtacatgagg cgaataatct ctttcaccat 180 actatcgctc acgaagctcg actcgaagcc ctactaacgg ctcctactcg aactctaggc 240 gatacgtaag agaaggagtc aacgagacct actcgatcta agcattctcc aacgggacgg 300 tccgtggaga tcggagcacg ggtactgtga actgtcacgt tccacggctc ctagcttgag 360 atcgattaga tcgaacgacc tactccgagg gtcaatacac ctctgtccgc gttgtccgct 420 agacactcca acagccgtcc cactgcccta gtgacaccag cgtatagcag tgttccaca 479 // ID Mariner-4_AN repbase; DNA; FNG; 1850 BP. XX AC . XX DT 09-JAN-2004 (Rel. 9, Created) DT 19-JUL-2005 (Rel. 10.08, Last updated, Version 2) XX DE DNA transposon. Mariner superfamily. Tc1 clade. XX KW Mariner/Tc1; DNA transposon; Transposable Element; KW mariner superfamily; Mariner-4_AN; Tc1 clade; transposase. XX NM Mariner-4_AN. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-1850 RA Kapitonov V.V. and Jurka J.; RT "Mariner-4_AN, a family of nonautonomous DNA transposons in the RT Aspergillus nidulans genome."; RL Repbase Reports 3(12), 212-212 (2003). XX DR [1] (Consensus) XX CC DNA transposon. Mariner superfamily. Tc1 clade. CC The consensus sequence was built based on multiple alignment of CC several copies 99% identical the consensus. CC Mariner-4_AN elements are characterized by TA target site CC duplications and 36-bp TIRs. CC This family is closely related to Mariner-4_AN. The consensus CC sequences are 68% identical to each other. XX FH Key Location/Qualifiers FT CDS join(277..1149,1205..1678) FT /product="Mariner-4_ANp" FT /note="transposase" FT /translation="MPAPHPNELRVQVLSYWALGIQPPDIAKMLQINVRTI FT RDMIQKGQDRGYNPAQCMRVKLEYVEDGKRSGRPKEISEATDMAVLASVKQ FT DRNGREKSSEILAFEAGISHSSVLQILHKHGFTIVKPSWKPGLTEAAKATR FT LRFCLDHQHWTLEDWKAVIFTDETSVILGHRRGSVRVWRTARDVHDPTCIR FT RRWKGSSDFLVWGCFTYDKKGPLHIFEPETAYQHKKSEVEIAALNAELEPI FT LREEWEIETRLKRLHLRGVPGRVPTWRFTEKTGKLVRKSKGGVDWWRYQQE FT ILLPHLIPFAKACKIERPDTKVLEDGAPAHKHHAQRRIYSIHEIEKIFDWP FT GNSPDLNAIEPCWMWMKKRTTSRGAPRDKKTGKTAWIKAWNELPQEKIQGW FT IERLIRHIQEVIQHDGGNEYKEGRTDHDARSWKGRRIKGQLSARQDLSPDP FT WEDL" XX SQ Sequence 1850 BP; 496 A; 414 C; 446 G; 494 T; 0 other; cggaggtgtg caccaaaagt catcgccgcg ctagtaattt atgatgccaa atcctgtaat 60 tccattacca gtcgcaagct gataggttga tgatatatcg atcttctaag caaacatcaa 120 cataacctct tctcattttt gaagctatcc agatcgcaaa ttgacagatc aaatcagttt 180 ttatggtgtt ttgaagcgct ttttctgcta tttttgcctc cctgcctcca attctgcctc 240 cgaactgtac acgccttcct gcctgcgatc cttacaatgc cagcgcctca tccaaacgag 300 cttcgagttc aagtcctctc ttactgggcc ttagggattc agccacccga tatagccaag 360 atgcttcaga tcaacgtccg tacaatacgg gatatgatcc agaagggcca agatcgtggc 420 tacaatcctg ctcagtgcat gagggttaag cttgaatatg tggaagatgg caagcgctct 480 ggccgtccga aggagatttc tgaagctaca gatatggcag ttcttgcatc tgtcaagcag 540 gataggaatg gacgtgagaa atcttctgaa atccttgcct ttgaagcagg tatatctcat 600 tcttctgttt tacaaatcct ccacaagcat ggctttacaa ttgttaaacc ttcttggaaa 660 cctggtctaa ctgaagctgc aaaggctact cgtcttaggt tctgcttgga tcaccaacac 720 tggactctgg aggactggaa ggctgttata tttaccgatg agacttctgt tatccttggc 780 caccgtcgag gctccgtacg agtttggagg actgctagag atgttcatga tccaacatgt 840 atccgaaggc gctggaaggg atcatctgac ttcttggttt ggggatgctt tacatatgat 900 aaaaagggac ctctgcatat ctttgaacca gagactgctt accaacacaa gaagtcagag 960 gtagagatag cagctctaaa tgcagagctg gagcctattc tcagggagga atgggagata 1020 gagacgaggc taaagcgtct gcatcttcgt ggggtccctg ggcgtgttcc tacatggaga 1080 tttactgaga aaactgggaa gcttgtacga aagagcaaag gaggagttga ctggtggaga 1140 taccagcagg tattctctaa tagctttatt ctcctatatt gggctaacca gcttttctca 1200 ataggaaatc cttcttccac atcttattcc ttttgccaag gcctgcaaaa tcgaacgtcc 1260 agatacaaag gtattagagg atggggcccc tgctcataaa caccatgctc agcgccgcat 1320 ttacagcatc catgaaattg agaagatctt tgactggcct ggcaattcgc ctgatcttaa 1380 tgcaattgag ccatgctgga tgtggatgaa gaagcgtaca acttcacgtg gagcacctag 1440 ggacaaaaag actggaaaga cagcttggat aaaggcatgg aatgagctcc ctcaggagaa 1500 gatacagggc tggattgaga ggcttataag gcatatccag gaggttatac aacatgatgg 1560 gggtaatgag tacaaggagg gccgtacaga tcatgatgct cgaagctgga agggcaggcg 1620 gatcaaaggt cagctttctg cgcgtcaaga tctatctcca gatccttggg aggacctctg 1680 aatagcttca tatctgctag ctttgacatt ttattgtatt ctccttctct aatatcacca 1740 cagctgtatg ccgaacattg gttgctatgg tattttctta caagaagccg gtacagtggc 1800 agtactagcg tcaatactag cgcggcgaag acttttggtg cacacctccg 1850 // ID Gypsy-5_CCO-LTR repbase; DNA; FNG; 269 BP. XX AC AACS02000002; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the mushroom Coprinopsis cinerea genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_CCO_; KW Gypsy-5_CCO-I; Gypsy-5_CCO-LTR. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-269 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the mushroom Coprinopsis cinerea RT genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AACS02000002; Positions 183936 184204. XX SQ Sequence 269 BP; 56 A; 77 C; 43 G; 93 T; 0 other; tgtagggatt tgtttccctt ggcttcctaa ttcgtcttat cacttactga tctcgtttgt 60 tatcttatct atgtcactcc ttccccattc ctcattcccc tatgattccc cgtatgttcc 120 cagttagctt atgtaactta ctacctaggt atatatagag tagttctcat ctgtacgtag 180 gtagttctcc ttccaacacg aagcttacac cactcaagcc cagtgtccac tgtgagaatt 240 ccgtgtgccg agcccaacga ctcgtcaca 269 // ID SKIPPY_LTR repbase; DNA; FNG; 429 BP. XX AC L34658; XX DT 11-NOV-1996 (Rel. 1.1, Created) DT 08-AUG-2007 (Rel. 12.07, Last updated, Version 2) XX DE Fusarium oxysporum retrotransposon Skippy; gag polyprotein, pol DE polyprotein; LTR. XX KW LTR Retrotransposon; Transposable Element; LTR; KW reverse transcriptase; gag; integrase; retrotransposon; RNaseH; KW pol gene; SKIPPY; SKIPPY_LTR; SKIPPY_I. XX NM SKIPPY_LTR. XX OS Fusarium oxysporum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Hypocreomycetidae; Hypocreales; OC mitosporic Hypocreales; Fusarium; OC Fusarium oxysporum species complex. XX RN [1] RP 1-429 RA Anaya N. and Roncero I.M.; RT "Skippy, a retrotransposon from the fungal plant pathogen RT Fusarium oxysporum."; RL Mol. Gen. Genet 249(6), 637-647 (1995). XX DR GenBank; L34658; Positions 191 619. XX CC Target-site duplications of 5 bp were found. XX SQ Sequence 429 BP; 130 A; 128 C; 72 G; 99 T; 0 other; tgttacgacc ctagcggtta catcacgtgt acctcattat ttttctccaa tcacaggaac 60 ggacccacca cctcacattg gaccgacctt ccaatacttt cggtccaaag ataagataag 120 gaccgacgac tctcgttcaa cggtcctaca gaccgatgca taaacaatat cggtcctgag 180 gggaccgacg ccagcacact atcggtcctc acgaagacct ctagtatata tacactcctt 240 tagcagtaga tagaacttag ctcaccagct atcaataaaa cttctttaca ttgatcaagc 300 ctaacacgca ttgaatctac cattaagaga cgtgacactt cgacaagctc agcatcacgc 360 cctacgccat ctgccaacaa taaacctagc acggcttagt ttatccacag ttgggaacca 420 accgtcaca 429 // ID Gypsy-12_LBS-I repbase; DNA; FNG; 3868 BP. XX AC ABFE01000693; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-12_LBS_; KW Gypsy-12_LBS-LTR; Gypsy-12_LBS-I. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-3868 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01000693; Positions 21199 17332. XX CC Positions [2741-3001] - Integrase core CC 'TCAGC' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS join(1838..2737,2741..3853) FT /product="Gypsy-12_LBS-I_1p" FT /translation="MCAWLMGTAQPISVNCDHKNLEYFMTSRTLNRCQACW FT AMFLSDFDFTLSWASGSQNPADGPSRRPDFMPKKGDSVSLQQNQAILSSYH FT TQHVIPSEIPPSPILIAATSSLTIDNSELLDRFRLAYQEDTTWRESLLHGD FT TSFTTTNDMVFHNGRVYVPPPLRLDIMHQHHDAVLGGHPGHTITLNKVLRS FT YSWPGLYTFVCRYVAACDTCNCTKIPRHKPYGLLKPLDIPSRPWKSISMDF FT IIKLPISHGYDSILVVCDRLTRATHFIPCNETMTASKLAWLFIDRIFRYHG FT LPDSIISDGSLFVSNFWKRLTTHLSIDLRHSTAYHPRTDGLTEHTNQTLET FT YICAYCSYQQDDWVDYLPLSEFVFNSAENTSTKQTPFFANIGFHPTFTPQL FT ADISTVPAADELAQHLDRIHSELKAELELAQERQRKLFNKKVLPSPVYQPD FT QLVWLLRRNVCTTRPSLKLDHRRLGPFPIIHAIGNDTYLLKLASYLSCLHP FT VFHTSLLEPYSDPSEFHTHTEPEPFQLAENPIETISDISDVLDCRKTGHRY FT DYLVCWKDLSMDDNSWVPLQELPTSSNELLECFHRRNPRAPRPHSLILNQI FT ALSPSDFDFDIPPDSDSVLSPISIPNSSAHKRPKSPPLTRSNLRSIYTPPS FT STTLSTGRESCPHPRYSTNVSS" XX SQ Sequence 3868 BP; 846 A; 1379 C; 600 G; 1043 T; 0 other; gtttgattag accaacaccg attgttctcg tgtttgctcc ctgctcgtca tcctcttcct 60 ttttatcctc actgccaacc atggacgaca tccagcagat tccacctacg ccctctacct 120 ctacttcttc atccggcaac aacaacatca tacccatgga gattgattcc atctgccggg 180 gtcccttatc cgctgaagaa aaagctcgcc gcaagaaaga aggtccttgc ttgtactgta 240 gtcaggttaa gcacttcact gctgactgct ctaacaacaa gaaagcttcc cctttgggaa 300 aagcttgaat ggtagctcct tgaacaaggt ccttgggcta ccgaaatctg atgttaggac 360 ccgtccgccc actacagtac tgcccaccca catccgtcac tccaacggca agtggtatcc 420 ctgctccgat ctccccattc cctgctacat ccttgcctcc tcccctgact cttctgatct 480 ccattgcctt cgtttcccaa accgcaaacc tatccacacc actgctctca tcgattccag 540 tgcctcacaa ttatgtattt ctgaatgttt cacctcacgt cactcttttc ctcgaagcgc 600 taaggaaact cccattccca tcctcgccgt cgacaaccgc ccaatcgcct ctggcctcgt 660 aacccacgat atcattgcta acatcgacat caaagaacac tctgaaacct tcctcctcgc 720 cgtcgtctca gtccacttcc ccatcatcct aggccttgat tggcttcaca ttcataaccc 780 cgcaattgac tggactgacc cctgcatatc actttcttgt tgtaatctga accctatacg 840 gccagtgcgt attcggctca aaggcttcgg tcttgttgga gccacccccc ttgaagcccc 900 tcgcgcttat tcagtttcgt ccgtaggact aggcctcagt cttcgtcctt gccttgcctt 960 gccaagttta tcccctcaac gtccagtacg catctggccc aaaggcttcg gtcttgttgg 1020 tgccaccccc cttgtagccc ctcgggctta ttctgtttca tccgttggac taggcctcgg 1080 tctacgtcct tgccttgcct tgcctttgcc ctctgcagca gccaccccac cagcttctcc 1140 tcctctttca aaatcctcat tcctcagttt cttgtccaag tccagcaacg gctctggtcg 1200 tgaccccctc gccaatcctt tgacatcacc ccctccgaaa atatccacct gttcaccagc 1260 acatttcact aaatatgcca aaaaccagca tgtcagaata cacaacttcc accctgtagg 1320 ctctcccatt tacattgcag caacctcttc ctcccttgcc gatgatatct ccaaaccccc 1380 tcctcttcct gacattcctg gcttgccgga caagtacaag gcctggtcca acaccgtctt 1440 ttcccccact cctcgttccg cacacgatat ccaagtattc cttgggtttt gcaacttcta 1500 ttgccgcttt atcaaccact atgcttccgt cgcgatccct gtcaactgcc tcacacacaa 1560 aaacaccttc atttggtctt ctgacgccaa ctctgctttc gaaaagctca aaaacacctt 1620 tctctcttac cccatccttc gtcactatga tccctccaaa cctgccaccc tctctatcga 1680 tgcctctgac ttcgccctct ctggtattct ccaacaacct gacccctccg gctctcttcg 1740 tccagtcgcc tatttctcac gcaaattttc ccctgcagag atcaactacg atatccacaa 1800 caaggaattg ctcgtgatca tcgagtcttt ctgtgacatg tgtgcctggt taatgggtac 1860 agcacaacca atctctgtta attgcgacca caagaacctt gaatatttta tgacttctcg 1920 taccctcaac cgctgtcaag cctgttgggc catgttcctt tcagattttg acttcacgct 1980 ctcctgggct tctggctccc aaaaccctgc tgatggcccc tcccgtcgtc ctgacttcat 2040 gcctaaaaag ggagatagcg tctctttaca acaaaatcaa gccattcttt cgtcatacca 2100 cactcagcac gttatccctt ccgaaatccc tccttccccc atcctcattg ccgccacctc 2160 atcactcacc attgacaatt ccgagcttct cgaccgtttt cggctcgctt atcaagaaga 2220 caccacatgg cgcgaatccc tcctccatgg agacaccagc ttcaccacca ccaatgacat 2280 ggttttccat aacggccgcg tttacgtccc tccccctctt cgcctcgaca tcatgcacca 2340 acaccacgac gcagtccttg gaggccatcc aggacatacc atcactctca acaaggtcct 2400 tcgcagttac tcatggcctg gcctctacac ctttgtctgc cgctacgtcg cagcttgtga 2460 tacttgcaac tgcacgaaga tccctcgtca caagccctat ggactcctca agcccctcga 2520 catccccagc cgtccttgga aatcaatttc catggacttc atcattaaac tccccatctc 2580 ccatggatac gattccatcc tcgtagtttg cgaccgcctc acacgtgcca ctcacttcat 2640 cccttgcaat gaaaccatga ctgcctccaa actcgcctgg cttttcatcg atcgaatctt 2700 ccgctatcac ggtttacctg actccatcat ctcagattga ggttccctct tcgtctccaa 2760 cttctggaag cgtctgacca cccacctttc cattgacctt cgacactcca ctgcttacca 2820 ccctcgcacc gacggactca cggaacacac caaccaaaca ttagaaactt acatctgtgc 2880 ctattgttct tatcaacagg atgactgggt cgactatctg ccgctttccg agttcgtttt 2940 caatagtgcc gagaacacct caaccaaaca aacccccttc tttgccaaca ttggcttcca 3000 cccaacgttc acgcctcaac tcgccgatat ctccactgtc cccgccgctg acgaactcgc 3060 ccaacacctc gaccgcatcc actcagaact aaaagcagaa cttgagcttg ctcaggaacg 3120 ccaacgcaag ttattcaaca agaaagtcct cccttcccct gtctaccaac cagatcaact 3180 ggtttggcta ctccgtcgca atgtctgcac cactcgaccc tcactcaagc ttgaccatcg 3240 ccgccttgga cccttcccga tcattcatgc tatcggcaac gacacctatc tcctcaaact 3300 cgcctcttat ctctcctgtc ttcaccctgt atttcacacc tcactcctcg aaccatattc 3360 cgacccatct gagtttcaca ctcacacaga acctgagcct ttccaactcg ccgaaaaccc 3420 catagaaaca atctctgata tttccgatgt actcgactgc cgcaagactg gtcatcgtta 3480 tgactacctg gtctgctgga aggatctttc catggacgac aactcctggg ttcctcttca 3540 ggaacttccc acatcatcaa atgaactcct cgagtgcttc catcgtcgca accctcgtgc 3600 tcctcgccct cactctctca ttctcaatca aattgccctc tctccctccg atttcgactt 3660 tgatattcct cccgactctg attccgttct gtcccccata tccattccca attcctctgc 3720 tcacaaacgt ccaaagtcgc ctcctttaac tcgttccaac ttacgttcta tttacactcc 3780 tccttcttct acaacattgt ctactggtcg cgaatcctgt cctcaccctc gctactccac 3840 caatgtctcc tcgtaaaaag ggagataa 3868 // ID Gypsy-1_TMe-I repbase; DNA; FNG; 6163 BP. XX AC CABJ01002980; XX DT 13-FEB-2011 (Rel. 16.02, Created) DT 13-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Perigord black truffle genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_TMe_; KW Gypsy-1_TMe-LTR; Gypsy-1_TMe-I. XX OS Tuber melanosporum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Pezizomycetes; Pezizales; Tuberaceae; Tuber. XX RN [1] RP 1-6163 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Perigord black truffle genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; CABJ01002980; Positions 22275 16113. XX CC Positions [4317-4796] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 1806..3230 FT /product="Gypsy-1_TMe-I_1p" FT /translation="MRQDKRVRIQRLDQEAKVPTKGSKGAAGHDLYDNEGI FT KIPANRQATIAMGIAIGLPEGTYGRIAPRSGLAVKHQLMTMAGVIDVNYTG FT EIKVVLANLAQEDYQVQKGDRIAQLIIEKINKEDLHEVDQLEETIRGNKGF FT GSTDQNQLDTTKRIEIKEITAQAFGRYYQRGDTTGILKWTYKDEKVVLAKI FT NISTELAIQDKKYQSKKRWKELVPKEYHQFTNLFQEEEATELPRRRPGIDH FT EIQLDKDEKAGKTKEIPYKKLYPLGEAELEELRKFIEQNLKRGWIRDSFVS FT GGSPILFVKKKDGKMRLCVDYRALNSVTKKDRYPLPLIGEALDRLRSAKYF FT TKLDIKDKYHNIRIKAGDQWKTTFASRYGMFEYLVMPFGLTNAPASFQRWI FT NKVLNKYLDICCIVYLDDVLIYSENIQQYQKDISNIMAAIQGSGMKIKPTK FT CEFHKEETEYLGFIINRSGVKVDPVKTAAI" FT CDS 3564..5264 FT /product="Gypsy-1_TMe-I_2p" FT /translation="MTPAECNYDVANKELLPIVQALKEWRRYVSNNDHPVR FT VLTDHKNLVPFMTTKELNNRQIRWMEHLSKYNFKIEYRPGKESGKPDALTR FT IASDKPQTKEDERIQQKTRILLPKTYFEAMEVIQFTSKEERGIENANKKDK FT TIQAIREVLEEGKKEMKGIALGLCQWKDGNMWYDGKQWIPEKEEIRTDIIR FT QHHDTRTAGHGGTAKTTEFITRSFYWPGLREDIKRYVKNCDICQRTKTAKH FT VPYGLLKSNEVPEQPWEIITMDFITDLPTLENCDSILVVVDRLTKMSHFIP FT IRKDINTRDFIQTFFREVFRLHGLPKNIITDRGTLFTSGLWKKVIGEMGIQ FT NRMSTAFHPQTDGQTEWVNAILEQYLRAYVNYQQDDWYKLLSMAEFVYNNG FT KQETIGTTPFFANYGQHPQSQTTYHTINDVIPDKTDMGKLHETLQYEMSQA FT QQRQKEQYDKHRRPDPNIRPGDMVWLTPRNIHTTRPSKKLDYKKIGPFRIL FT EKVGSQAYKLDLPSSMRIHNTFQISLLELHNDNKFPSQTLIPPPPIEPDGE FT PEYELDEIVDSRFFHGHLQY" XX SQ Sequence 6163 BP; 2259 A; 1380 C; 1431 G; 1093 T; 0 other; ttacaaagcg tctcctagta cgacgataaa cgatacaaga cggcatcctt caacccagac 60 agggatggag atattgacat cgacgatcaa aatcttaatc taggaggatt cagcgacgcc 120 ctagatcacg tctcagcaga accctcgaga gcagcaagcc cagtaacccc agagaaagga 180 tcaccagtcg atagctctcc tgctggaatg ccacgacccg agagaatgtc gggatccgca 240 gtacctgggc caccggcccc agccgctccg gcaccaggaa ccagcatact tggaatgact 300 gaaaatcagt ttcagggatt tcttgcagcc acgcgaaata cagggcaagg aaatgtccaa 360 caaccaaaag caaaggagct tgacacatat gaaggagccg aagataagct acgaatattc 420 ttagcgaaat gcgagggata ttttgcagtt aagggataca gagatcacca tgatgaaatt 480 aagattccat atgtggaatc tctactcaga ggactcgcgg gacaatagat catgcccttc 540 caagatggac ggaaacagag ggaatgggtc acgtacgaag agttcaagga gacactcttt 600 gaacaatttg gagaccccaa cgccaaagaa acagcaagag gaaagctaga aaaactccag 660 caaggaaaac aaatgttctc agcatactgg aacgaatgtc gactgttgga atacgaggcc 720 gagtttgatg atgcgactct ctacatcatc ctcctgagaa attgcagtga gagactccga 780 gaggcaaggg gaatctctga cctgcgctat acgacagccg aacgatttgc aagatgggca 840 atctacaaag aaagcaaatt aaacatagtt cggagcaact tgaaaaatac tatagcacat 900 agagtggttc aatcatccag gaatgtggac gggaccttca aaccgcaagc acctcgagga 960 gaaagtcgag cattaccacc aggcgaaccg atggaattag acaagtcaag agcagggcct 1020 gctctgaatc tgtctaagga ataatacact agaaggatga atgggaaact atgcctaagg 1080 tgtggaaaag catggcatcg gatacgagaa tgcagaagcc gactggacaa cggataccag 1140 ggaaacagag ctaatggaaa taatcgtttc caaaatcagt aaagaagcaa catccgcgaa 1200 atgatagcag aggattcacc ggcggaaaac gacgagggtc cccagtaaac agtggtctgg 1260 gagacccaag atgtggacct ggaaaaccca tcgaattagc gaactactac acggcactac 1320 acacgcaaag aggagaagaa gatggagatc atgtgatgat caaaacaaag atatacaaca 1380 gagaaagcga agaaataaca atatatgcaa tgcgcgactc gggagccact gaagacttca 1440 tcgacaaaga aatatgtgac aaatacggcc tccacaccaa acaagcgaag aaagcaagac 1500 aggtctactt agcggatgga aaaccaagcg ctatgggtcc tataacccat atggcgacag 1560 tacctatgta cattggcagc tatagggaaa cggcaacctt tcaagtggca gatctgaaga 1620 accatgagat agtcctaggg atgccatggt tacggaagca taatccgacg atcgattgga 1680 acgagaataa gatgaccttt acaagcgaac tatgtgcgac acaatgtctc gacagttcac 1740 ctgtggtcta ctcaataccg atgtgagaag cagaggaaga aactttgcat gtaaagtttg 1800 ccgagatgcg gcaagacaag agggtccgga tacaaagact ggaccaagag gcaaaagtgc 1860 ctaccaaggg ttccaaaggg gcagcaggac acgacctata tgacaacgaa ggaatcaaga 1920 taccggccaa caggcaggca acgatagcga tgggaatagc cattggactc ccagaaggca 1980 cctatggacg aatagcaccc cgcagtggtc tagcagtcaa acaccaactc atgaccatgg 2040 caggggtaat cgacgtcaac tatactggtg aaatcaaagt ggtcctggcc aacctagctc 2100 aagaggacta ccaagtccag aaaggcgatc gaatcgcaca gttgataatc gaaaaaatca 2160 acaaagagga tctgcatgaa gtagatcaac tggaagaaac cataagagga aacaagggat 2220 ttggaagtac ggaccaaaac cagctggata caacaaaaag aattgagatc aaagagatca 2280 cagcacaagc ctttggacga tattatcaaa gaggagatac aacaggaatt ctcaaatgga 2340 catacaaaga cgaaaaagtg gttctagcaa aaattaacat tagcacagag ctagcaatcc 2400 aggacaagaa ataccaaagc aaaaaacgat ggaaagaatt ggttccaaaa gaatatcacc 2460 aattcacaaa tctattccaa gaagaggaag ctacggagct accacgtcga agaccaggaa 2520 tcgaccacga aattcagctg gacaaagacg aaaaagcggg aaagacaaaa gagattccct 2580 ataagaagtt atacccccta ggggaagcag agctcgaaga acttaggaag ttcatcgagc 2640 aaaacctaaa gagaggatgg atccgagact ctttcgtgtc aggaggatcc ccaatactat 2700 tcgtcaaaaa gaaagacgga aaaatgagac tatgcgtgga ctacagagcc ctcaattcgg 2760 ttaccaagaa ggaccgatat ccactaccct tgattgggga agcactggac agactacgat 2820 cagcaaaata cttcaccaaa ctcgacatca aagacaaata ccacaatata cgcatcaagg 2880 caggagacca atggaagact acattcgcct caagatacgg aatgttcgaa tacttggtta 2940 tgccctttgg gctaaccaac gcaccggctt cattccagag atggatcaac aaggtactta 3000 ataagtacct cgacatatgt tgcatcgtat acttggacga tgtactcata tactcggaaa 3060 acatacaaca ataccaaaaa gatattagca atatcatggc ggcaatacaa ggatcaggaa 3120 tgaagatcaa accaacaaaa tgcgaattcc ataaggaaga aacagaatac cttggattta 3180 ttatcaatcg aagtggggtc aaggtcgacc ctgttaaaac agcagccata tgagattggg 3240 ccaaaccaaa caacaaaaca gatgtccaat gctttatggg attctgcaac ttttatcgaa 3300 ggttcataga ggggtttagc agacttgcaa aaccactcta taacctaacg aaaaaagaca 3360 agaaatggga atggggagaa acggaacaaa atgcatatga cgcaatacga acgcatctaa 3420 catcagcacc catattggtc catttcaacc caaaccgcag gacaatgatt gaaatggacg 3480 cctgaaaata tgtatgctca ggcatactat cacaacaatg cgacgatgga aaatggaggc 3540 cagtgtccta tcgatccaag acaatgaccc ccgcagagtg caactacgat gttgcaaaca 3600 aagaactatt accaatagtt caagcattaa aagaatggag aagatacgtt tcgaacaacg 3660 accatccagt ccgagttcta accgaccaca agaacctggt gccgttcatg actacgaaag 3720 aactgaacaa ccgccagata agatggatgg aacacctcag caaatacaac ttcaaaatcg 3780 aataccgacc aggcaaagaa agtggaaagc cagatgctct caccaggatt gcaagcgaca 3840 aaccacaaac gaaagaagac gaaaggatcc aacaaaaaac acggatccta ttaccaaaaa 3900 cctactttga ggcaatggag gttatccaat tcacatcaaa ggaagaaaga ggaattgaga 3960 acgcaaataa gaaagacaaa acaatccagg caataagaga agtcttagaa gaaggaaaaa 4020 aggagatgaa aggaattgca ctgggattat gccaatggaa ggacggaaat atgtggtacg 4080 atggaaaaca atggatccca gaaaaagagg agataaggac agacatcata cgacaacacc 4140 acgatacccg aacggcagga cacggaggaa cggctaaaac aacagaattc atcacgaggt 4200 cattctattg gccaggactg cgcgaagata tcaaacgata tgtcaaaaac tgtgacatat 4260 gccagaggac aaaaacggct aaacacgtac cttatggatt actaaaatca aacgaagtcc 4320 ccgaacaacc atgggaaata attactatgg atttcatcac ggacctacca acattggaaa 4380 attgcgatag catcttggtc gtggtggatc gcctgaccaa aatgagtcat ttcattccga 4440 ttagaaaaga catcaatacg agggatttta tacaaacatt cttccgagag gtctttcgac 4500 ttcatggcct cccgaagaat atcattaccg atcgaggaac actattcaca tcaggactat 4560 ggaagaaagt aattggagaa atggggatac aaaacagaat gagcacagcg ttccacccac 4620 aaacagatgg acaaacggaa tgggtaaatg cgatacttga acaatattta agggcctatg 4680 tcaactatca gcaagatgat tggtacaagc tgttatctat ggccgaattc gtatacaaca 4740 atggaaagca agaaacaatt ggcacgacac ccttcttcgc aaattacgga caacatccac 4800 aaagtcagac cacatatcat acgattaatg atgtaatacc ggataaaacc gatatgggga 4860 aactacacga aacattacaa tacgaaatga gtcaagcaca acaacgacag aaggaacaat 4920 acgataagca ccgaagacca gatccgaaca tacgaccagg agatatggta tggctaacac 4980 cacgcaatat tcatacaacg cgaccatcaa agaaattgga ctacaagaaa ataggaccat 5040 tccggatact ggaaaaagtg ggcagccaag cttataaact cgaccttccc tcctcaatgc 5100 gaatacacaa tacattccaa atctcactcc tcgaactaca caacgacaac aagtttcctt 5160 cacaaacact tattcctccc cctccgatcg aacctgatgg agaaccagaa tacgagctcg 5220 acgaaatagt cgattcccgc ttcttccacg gtcacctcca atactgagcc aagtggacag 5280 gatattctcc tgaatacgat aaaacttggt acttcgcaca caactttgaa aacgcacaag 5340 atgccatctc aaccttccac cagcgatacc caggcaagcc cggcccaggt gacgatcatg 5400 ttcaaccaag acaaaggaaa gggaaagact cctacgccat tgcagcaaca tgagggctgg 5460 ggcgatagcc cccgccaaac cacacccact agtcccgaat actcattcat ggggccaacc 5520 aatgaagaac tcaaggaaag ggaaagacaa gcacagcata acgccatgtt atggacggta 5580 tgctacgatg acggttgcca aacacacttg ggcgataatg aaggcttggg atggttcccg 5640 agggaatgtc gaaagcgtga cccatgccca gaccacgtcc tgaagcattg gcaacaatgc 5700 ttcagaaatg ggtgcaggat gcacagggca caaaaaatat ctgcgggatt ctacccgcag 5760 aaggatggaa gccaaaaaga gttggaagga gaagatgcca gaaaggcatg ggccacgagg 5820 acgcggtcca aggagggagg ggcccaaaat tcctcggcag acacggaacc ggatgtcagg 5880 agcctacaca ggcagatcta gggactccga gaacaactgg gcccagccga ggccacaatt 5940 tcggcaaacg acaaaacgat cgcggagctt agacagctga acgacaatca gcggcaaacg 6000 atcgagatgg tgaatctgac actggaatcg gtaagggtca gatggaggat ggcagacgaa 6060 aagttccaca agctgagaag gaccatgcgc ttggcaggac gagaactatt gaaacaggga 6120 gaaaaagcct gatgggaatc aggcttagga ggaggggagc taa 6163 // ID Gypsy-14_RO-LTR repbase; DNA; FNG; 832 BP. XX AC AACW02000082; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-14_RO_; KW Gypsy-14_RO-I; Gypsy-14_RO-LTR. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-832 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000082; Positions 32404 33235. XX SQ Sequence 832 BP; 258 A; 170 C; 128 G; 276 T; 0 other; tgtcatgttt tgacatgtca ataagataac agtcttatca ttctaagcta aatctttaca 60 aatgagtttg aataagaata gatcctgata attaccaact gccctgaaca gccaagtttc 120 actttcatct caacagttga ttatttacat cagtttataa aatttttctt atacatgtca 180 aatcatatcg gtcgtaacta tctcttgtca tttaatctca ggtcttgctc aaactaatat 240 aaatagcctt ggaatctttc tttaataaaa gtaattcgac tttagaacta ttgttcggtc 300 tcgttaattc ttctttttca tattttaaat acttaaaaac acttcatttt ttttaacgaa 360 ttctgaactt ttgtttaaaa ttacaaaata tggctgacaa cgatcaatcc actcaagttt 420 cctcccttaa taagaaaggc aaccttaatc aacaagaaga agagacatcc agggtcagtg 480 aagacataga aatggctatc gttgaagaag ctgcttctac cacttcaaat ggttttgaat 540 taaaagaaga tccatctatc gatgctgtta tgggtctgag ggttacattg acagacctta 600 gacaacagat cgctcgtgct gtggtaactg gagctcctca agaagagcta accaagctcc 660 aagaacaggc agttaatata aagaactgca tccagttctt agatgatgct caagctttct 720 gtattactcc tcctacaccc gtaggtaatg ctgttttcag tcctggcttc tcaaatactg 780 cttttaacac tcggtcggct catattatcc cacctgatct tcctgtctgg ca 832 // ID Copia-3_CCO-LTR repbase; DNA; FNG; 393 BP. XX AC AACS02000011; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the mushroom Coprinopsis cinerea genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-3_CCO_; KW Copia-3_CCO-I; Copia-3_CCO-LTR. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-393 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the mushroom Coprinopsis cinerea RT genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AACS02000011; Positions 1323379 1322987. XX SQ Sequence 393 BP; 106 A; 93 C; 70 G; 124 T; 0 other; tgttgaactt atcttatctt atctgaagat tatatcgtag attagattag attagtgaga 60 cttggtccag aacactctag aacattctag tgtagaacat tctggaacgt cacaggagta 120 ggtgctaata cttgcaccat attgcgtcat tatcgctcat tcttactcat gatgactcat 180 tcgcttactc acccgtcaca ccttccatat ttatgtagta tacgtcacat ctgtcacgct 240 tcctgtaaac tataaatacc ggccctccct gggcctggat agatgataga gttcgtcaca 300 aactcttgta gtagccacta gttagcctca cacaactctc tacctggtaa gtcttagagt 360 agaccacctt agctagatag tgtttgctcg aca 393 // ID Gypsy-3_MLP-LTR repbase; DNA; FNG; 273 BP. XX AC AECX01001703; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_MLP_; KW Gypsy-3_MLP-I; Gypsy-3_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-273 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001703; Positions 143434 143162. XX SQ Sequence 273 BP; 57 A; 60 C; 50 G; 106 T; 0 other; tgtaaggtgt tacacactta ctacattaca tatcacacgt gtgacagggg tatacattga 60 ctcagtacat tagaactcta gctgttatca aattcccttt tcctcatcta ttcttttcgt 120 tgtttgtcgt tagtggtctt tcattgtacc aggttgttct tgatctgatg ttttcatgag 180 ctaggcctag ctttccgcaa tctagattgg ttatagatac tcttggctct cgccatcgtg 240 cctttccagt gttgttccta ttaggaactt cca 273 // ID LTRYL1 repbase; DNA; FNG; 273 BP. XX AC AJ439559; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 08-NOV-2002 (Rel. 7.1, Last updated, Version 1) XX DE Yarrowia lipolytica Ty3/gypsy-type retrotransposon LTRYL1, long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; LTRYL1; KW Long terminal repeat; Ty3/GYPSY superfamily; retrotransposon. XX OS Yarrowia lipolytica OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Dipodascaceae; Yarrowia. XX RN [1] RP 1-273 RA Neuveglise C., Feldmann H., Bon E., Gaillardin C. RA and Casaregola S.; RT "Genomic evolution of the long terminal repeat retrotransposons RT in hemiascomycetous yeasts."; RL Genome Res 12(6), 930-943 (2002). XX DR Genbank; AJ439559; Positions 1 273. XX SQ Sequence 273 BP; 83 A; 49 C; 60 G; 81 T; 0 other; tgttgaaggt aatacccggt ggggtaatac ctggctaatt cctcacataa tgatgagatg 60 tcaccacata gtaataagag gaatcagctg ggtcgtagaa gagatgtcca tcctgtagat 120 agttttatcc actgatctag aatctctttt ctggtgaatg agcaagcaca caccagaagg 180 gtgtgtgcta ctatcggaac gtcgtgtccc tagaagtggg ttgtaagttg ttaattaact 240 atatcacatc aataatactt ataatcttca ata 273 // ID PCretro5_LTR repbase; DNA; FNG; 419 BP. XX AC DQ097839; XX DT 08-MAR-2006 (Rel. 11.02, Created) DT 08-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE Phanerochaete chrysosporium RP-78 Ty1/copia LTR retrotransposon DE (LTR portion). XX KW Copia; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; PCretro5_I; PCretro5_LTR. XX OS Phanerochaete chrysosporium OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Corticiales; Corticiaceae; Phanerochaete. XX RN [1] RP 1-419 RA Novikova O., Fursov M., Shutov O. and Blinov A.; RT "Divergent groups of LTR retrotransposons from Phanerochaete RT chrysosporium."; RL Direct subission to Genbank (2005). XX DR EMBL/GenBank/DDBJ; DQ097839; Positions 43 461. XX SQ Sequence 419 BP; 91 A; 128 C; 95 G; 105 T; 0 other; tgtcgaagac tcgcgccgcg ctttgcgcaa gcgcccaccg ccgccacttg cgtggcgcgg 60 aagcaaaact acctcggccg tcaagccaca acgtcggact ttccgtagat agtatccagt 120 agtattagct aatcttttcc gtagtataaa aagccccccc tctcaatttt cgcccgctca 180 gagacgtctt cacccttctc tgagcgtatg tactgtttcc cccgtagata ctaggctcgt 240 actgaccatt atattagtct agactgagta tctacggtta taggtacata atagagacgt 300 cttcaccctt ctctgagctc tagactgagt atctacggtt ataggcctcg ctgcgcacgc 360 tgaccggagt cagcggcccc ctgtctgcgc agaacccgtc cagagaccgg ttctgatag 419 // ID SINE3-1_AO repbase; DNA; FNG; 206 BP. XX AC . XX DT 24-JAN-2006 (Rel. 11.01, Created) DT 02-FEB-2006 (Rel. 11.01, Last updated, Version 1) XX DE SINE3 nonautonomous non-LTR retrotransposon - a consensus DE sequence. XX KW SINE3/5S; SINE; Non-LTR Retrotransposon; Transposable Element; KW Nonautonomous; 5S rRNA; SINE3-1_AO. XX OS Aspergillus oryzae OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC mitosporic Trichocomaceae; Aspergillus. XX RN [1] RP 1-206 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-206 RA Kapitonov V.V. and Jurka J.; RT "SINE3-1_AO, a family of 5S rRNA-derived nonautonomous non-LTR RT retrotransposons in the Aspergillus oryzae genome."; RL Repbase Reports 6(1), 45-45 (2006). XX DR [2] (Consensus) XX CC SINE3-1_AO is a family of SINE3-like retrotransposons in the A. CC oryzae genome. The 63-bp 5' termini was derived from 5S rRNA CC (pos. 3-65, 98% identity to the A. oryzae 5S rRNA) and includes CC the polIII internal promoter. There are several families of CC SINE3-1_AO like elements in the A. nidulans, A. fumigatus, and A. CC oryzae genomes. XX SQ Sequence 206 BP; 50 A; 37 C; 69 G; 45 T; 5 other; agtacatacg accatagggt gtggagaaca gggcttcccg tccgctcagc cgtacttaag 60 ccaactatta aacgcggcca attatcaatg tgcccttagc tctnaaagag cgrtggcttc 120 grtnttaggg tgagtgcggg tgatttgggt gagaabtctg gagaccgggg tgagagaggt 180 gagaactggg tgagagtggg tgagaa 206 // ID LTR2_AO repbase; DNA; FNG; 549 BP. XX AC . XX DT 24-JAN-2006 (Rel. 11.01, Created) DT 02-FEB-2006 (Rel. 11.01, Last updated, Version 1) XX DE A solo LTR of an unknown LTR retrotransposon - a consensus DE sequence. XX KW LTR Retrotransposon; Transposable Element; Nonautonomous; KW LTR2_AO. XX OS Aspergillus oryzae OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC mitosporic Trichocomaceae; Aspergillus. XX RN [1] RP 1-549 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-549 RA Kapitonov V.V. and Jurka J.; RT "LTR2_AO, a family of LTR retrotransposons in the Aspergillus RT oryzae genome."; RL Repbase Reports 6(1), 23-23 (2006). XX DR [2] (Consensus) XX CC It is a solo long terminal repeat of an unknown LTR CC retrotransposon. It is characterized by 5-bp TSDs. XX SQ Sequence 549 BP; 187 A; 79 C; 118 G; 164 T; 1 other; tgtcagctgc cctaatccta aagcggaata gtcttagggc tagctaggat gaatagaaaa 60 gttgaataga atgaatgaat tgaatgacta aatcacataa gaaagctagg gttgactgag 120 tagtaaccct aggtcaggca ggcaggtagg gcttaggcaa tctaagccct agattagata 180 ataaaggcag gcaactaggg ctaaggcagt caaagcccta ggtcagrtat gaatagaaga 240 gatcaaaaga atgaatcaat atctggcagt cagatagttg gatgatgatc tgcttgaaaa 300 aattgattca atttagcagg ggattgagtg gctttttata ttgaattggg ctcgcttaga 360 cttactaaat taggaattcg cctaatattt cctaatttag tactcgccta atatttccta 420 attaggatgt aattgcttaa tattgtctag ttaggatttt ttcgtctaat ttctcctaga 480 taggtaatat tctagaaaga ttctagaata ttcaagaaaa tttcctaaat aggaaatata 540 tctctgaca 549 // ID Copia-63_MLP-I repbase; DNA; FNG; 4567 BP. XX AC AECX01000530; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-63_MLP_; KW Copia-63_MLP-LTR; Copia-63_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4567 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000530; Positions 200942 196376. XX CC Positions [1693-2202] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS join(1618..2823,2827..4089) FT /product="Copia-63_MLP-I_1p" FT /translation="MNEIDVQKCPVCVKSKMACHSFKSRAEYRADSVGELI FT HSDVCSFKVPSREGFKYFVTFINDFSKFTIIYPLKLKSNTSSCFKVFCSKF FT ELRISTTIKKLRTDNGGEYMSYEFKNYLSESGIEHFPGPPHSPQLNGVSER FT TNRTICNHIRCSVTSSGLSKSFWVDALRYLAHSLNSIPCYTPLGFFSPSSL FT ISVSPIDPTRCHPFGCEVYYKVPEANRKKLDPKFIRSIFLFYLPDGNSYVV FT WDTTNSMPVKSCDVIFNVNVFPSLEKLSPKPPSSHDAHIPWPSQSVPRPSA FT RYRRLSVSIHNPARRPLSPSAFKDLPSMPQIPSPASLPSLSDHCPSPPQTP FT PPLPPRPTPSVSVIPCRRPSTPPTRKPSKSKIPRTQLSNQPARVPSADLPT FT SPDQPPVPRSGRNRAAPDRLGNFVKSACPAGEDVSDTPKTWKQLLRSPNKS FT KWLKAADDEYLSLVGMSTWKLVPRPDKRKIIQSKWVFKVKRRVDKTILKLK FT ARLVAMGYSQVEGVDYNEVFAPTTRLETLRLVLSMIASRKWSGRQVNIKTA FT FLNGHLDEPVYMTQPPGYEDPNFPDWVCEVTHSIYGLKQSPRQWNLKLHKV FT LILLGLTQSRHDPTLYFELVDGKVKGLVTVHVDDLSVVGPDSFVNDFIAKL FT SSHFEIGSNEELHHFLSLNITRDLSKNLCFINQAHYIKELHEKFLPTSHIK FT VTTPTDSSFKDLLPKQPNDQSSPGHYQSLIGALMWVAQCTRPDISFAVSRL FT SQFLRDPLEAHWFAALRVLNYLITTSHLSLTLGGEASISGYSDSDWAEDRH FT DRRSTTGYTYQWGSGPILWRS" XX SQ Sequence 4567 BP; 1275 A; 1214 C; 811 G; 1267 T; 0 other; agaggttatg agcccatcag ctaccactta aatctgtctc gctcaataca aatctcaaac 60 tttaatcatc ccattcaatc cattctctct atcttctcca ctcttcttcg tttcggaaag 120 gttcgaagtt gcatgaagat aatttctcat ccaatggcgt catcagctac agatagccag 180 aaggaaaact gtcttaccgg tatcacttct ctccgccctc caagtgagga ctcaaactac 240 catgactggg agttcaaagt cgaactcgcc ctggatgcag tcaaactggc ttacgtcctc 300 aaaccaatcg ctgtcaaaga cagattgaac aattggaaag aagacaacac caaagcctgc 360 accctcatct cccgttccat cgaagacgga aatcttaagt atataaaacc tcatcgttgc 420 aacgccgccg gcatgtgggc tgccctcaga gtagctcatg aagactctgc tacaggaggt 480 cgtatgcatc tactccataa actcattacg actcgaatgg aagtcaccga tatagatgca 540 cacattgact cccttcacaa gatatacaaa cgactcgacc acctcatcac acctgacagt 600 ccttgcactg tagatgatat ctatactacc tccatcctaa ccgcccttcc ccttgattgg 660 ctacctgtca taactcctct catgcaacga gaggctgtga attccgccag ggttatcaga 720 gccctcaagt aagaggcaac tcagcaaaag acttcttctg atctcaacag tccctcagat 780 cttgtcgctg cacgagctta tgctaacaac cggttcacta ccaacaacaa tcgatccggt 840 caaggtcctc gccgaaatac gaaattttgt actcactgtg aacataccaa ccacgatgtg 900 gagaactgct ggatttgtca aggcacatct cgaggctcat cgtcatctaa cagaggagga 960 agacactctt tgagcagtca tccaaatcgt caaaacgcaa aggccggcaa aacttctgtc 1020 ttgactcttg aattcagcaa agatgaagat gattcgaatg acaacacaac tgaagtgact 1080 caagccaaat cagcaaaagt tatcacagcc aaccatgtct ctacaatgga ttggaagatc 1140 gacttggggt gttctctcac tatgacgccc aacaagtcag ctttgacaca cttggaagca 1200 agtagaaaaa ctatccaact tgccgacgcc ttgtccatca gctcaactca ctcaggaaga 1260 attcatctac ctatgacttc caacttacat catcaccgtt ctttacttgt ccccggtctg 1320 caagaaccct tattatctgt ctccgctctc tgtgatgatg gttttaaagt tgtcttcgac 1380 aacaccaagt gctcttttta ttgatcgaac aatcaggacc tgtcgaaccc cgttggggta 1440 ggatattgac gtggaaactt gtactatctt ccatcaaagg tagattcatc tcactccacc 1500 tcgacctcaa caactaaaac taatcaaact cttcccaact ggcacaacaa tctaggccat 1560 ataggtctga agcccctcaa atcatttctc aagaagcatg aaattgctcc aactgttatg 1620 aatgaaattg atgtacagaa atgtcctgtc tgtgtaaaat ccaagatggc ttgtcattct 1680 ttcaagtcca gagcagagta tcgtgcggat tcagttggag aattgatcca ctctgatgtg 1740 tgcagtttca aagtgccttc tcgagaaggt ttcaaatatt ttgtaacctt tatcaatgat 1800 ttttctaaat ttacaatcat ctatcctttg aaattgaaaa gcaatacttc ttcttgtttt 1860 aaagtgtttt gttccaagtt tgaacttcgg atctcaacta ccattaagaa actccggaca 1920 gacaatggag gagaatatat gtcatatgaa ttcaagaatt acttatcaga atctggaata 1980 gaacatttcc caggtccgcc tcactcccct cagctgaatg gtgtctcaga aagaacaaac 2040 cgcaccatct gtaatcatat caggtgttct gtaacaagtt caggactatc caagtctttt 2100 tgggttgatg cactaagata cttagcgcac tctctcaact ccatcccgtg ctatactccc 2160 ttgggattct tctctccatc atctctaatc tcagtttctc ctattgatcc tacaagatgc 2220 catcctttcg gttgtgaagt gtactataag gtgccagaag caaatcgtaa gaagttggat 2280 cccaaattta ttaggtccat ctttcttttc tatctcccag atggaaacag ctacgtggtg 2340 tgggatacaa ccaactcaat gcctgttaaa tcttgtgatg ttatcttcaa tgttaatgtt 2400 tttccatctc tagagaaatt atccccaaaa cctccgtctt ctcatgatgc tcacatacca 2460 tggccttcac aaagcgtacc tcgtccgtca gctaggtacc gtcgtctctc tgtttctata 2520 cataatcccg cgagacgccc tctttctccc tccgctttca aagatcttcc atctatgcct 2580 caaattccct ctccagcttc tttaccaagt ttgtcagacc attgtccatc tcccccccaa 2640 actcctcccc ctctaccccc tcgtccaact ccgtctgtat cggtcattcc ctgcagacgt 2700 ccctcaactc ctccaactcg caaaccgtca aaatcaaaaa taccgcgaac tcaactgtca 2760 aatcaacctg caagagtacc atcagcagac ctaccgacct caccagatca acctcccgta 2820 ccttgacgtt ctggaagaaa tagagctgct ccagatcgtt tgggtaattt tgtcaaatcg 2880 gcctgtccag caggagaaga tgtctcagat actcctaaaa cttggaaaca actactccgc 2940 tcaccaaata aatcaaaatg gttgaaggct gctgatgacg aatatttgtc tctagtaggt 3000 atgtcaacct ggaaactcgt gcctcgaccg gataaaagaa aaatcataca atcaaagtgg 3060 gtcttcaagg tgaaaagaag agtagataaa actattctca aactcaaagc tcgcttagtg 3120 gccatgggct actctcaagt tgaaggtgtc gattacaatg aagttttcgc cccaaccact 3180 cgactcgaaa cactgcgtct agttctatca atgatagctt ccaggaaatg gtcaggaagg 3240 caagtcaaca tcaagaccgc ttttctcaac ggccatttgg atgagcctgt gtacatgact 3300 caacctcccg gttatgaaga ccccaacttt ccagattggg tatgcgaggt gacacattca 3360 atctatggct tgaaacaatc acccaggcag tggaacctca aactccacaa agttctcatc 3420 ctactcggtc ttactcaatc acgccatgat cccactctat actttgaact cgttgacggc 3480 aaagttaaag gtctagtgac ggttcatgtg gacgacctat cagtggttgg accggatagt 3540 tttgtcaacg actttatcgc caaactctca tctcactttg aaatcggttc gaatgaagaa 3600 cttcatcatt tcttatccct aaacatcact cgtgatctct ccaagaatct atgtttcatc 3660 aatcaagcgc attacatcaa agaactacat gaaaaattcc taccaacttc gcatatcaaa 3720 gtcacaactc caactgattc ctcgttcaag gatcttcttc ccaagcaacc gaatgatcaa 3780 tcatctccag gccactacca aagcctaatc ggtgctttga tgtgggtcgc tcaatgcacg 3840 agacccgata tctcatttgc tgtcagcaga ttatctcagt tcctgagaga tccgttggag 3900 gctcattggt ttgctgctct tcgtgttctc aactacctca tcaccacatc tcatctctct 3960 ctcaccttag gaggagaagc atcaatttcc ggttactctg actcagactg ggcagaggat 4020 cgccacgacc gacgatccac aacgggctac acctatcaat ggggaagtgg gccaatcttg 4080 tggagatctt gaaagcaagc tactgtatct ctatccagca ccaaggtgga atacaaagcg 4140 ctatctgact catgtcgtga aggcttatgg ctcggtaacc tactatccaa actcggtgtt 4200 cgccctccag ctgcaatccc acttcatgtc gataatgaag gtgcggaagc actcgccaag 4260 aaccctcagc atcactcacg cactaaacac atacacactt gctatcactt tgtgcgtgag 4320 tgcgtatcca ataaatcaat ctcagttctc cacgtggcgt ccaaggatat gttagcagat 4380 atgctaacta agccccttgg ctgtgtctta cttcaatctc atcgtgaaag atttggcatc 4440 atctagactt tctatttcaa tttttctatt ctttcttatt ttttcttatt ctcaaacttt 4500 gttgtctctc ttaatttttt tttcttatct gtttcttcta gtgcttcatg ttgtgcgcaa 4560 ggggggg 4567 // ID Gypsy-45_MLP-I repbase; DNA; FNG; 6362 BP. XX AC AECX01001170; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-45_MLP_; KW Gypsy-45_MLP-LTR; Gypsy-45_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-6362 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001170; Positions 140943 134582. XX CC Positions [5163-5621] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 372..1469 FT /product="Gypsy-45_MLP-I_2p" FT /translation="MEPQTDPTLTEVLRQLNHLSSQFAEVKASLADETQKR FT QEAEIRLQQYETNNMQASPTPAQPNPASVQPVYVNQTVQAQRQPKMSTPDK FT FDGSKGSKAKVFMNQIGLYMQLNDHLFANDQAKVAFALSYMSGKASIWGQS FT LTDQLLDPELSESVSWSKFVVSFKATFFDSERVSRAEKEIRELKQKGSVAD FT YWIRFSELSLVIKWPQSILMSHFEQGLKREIAVYMIKDEFNDAEEMARHAI FT KLDNKLNRRAHETTYSSTASVPNPTSAIDPDAMDCSAYKLNISNEEYKRRG FT STRACYKCGKTGHYIADCFVGKRGRMWNSSFKNDSSESKLKARIAELEGQL FT GKSSSVVESGSKAESSKNGDARE" FT CDS 1538..4441 FT /product="Gypsy-45_MLP-I_1p" FT /translation="MKDTRIIAKLSLHDPQTATTKQVRALVDSGATHEAVS FT RSLVREFNLFTTPLQESRSVVGFSGVESRVNDVGEYYINYGKHLTTFIVTE FT LHDKYDIILGMPWIKKNHQIINWEDSKLEDITHAIATVETVSSVPKKSSNV FT YPMEPERQARNCNEGVELTNSLTPPQCEFARSPSPPLPEAAGKLSPLLEFI FT DHKAPTNPGQIPKDVDHIAASSVPKTTSMDQGIPKGYARKLEEGVEFLSSS FT MPPQCEFDNSLSSCIKEATGKRSSLLNLKSQSLGLNVRRSLPTPKRLYAKE FT FDVIKAPQLSIAAAKTSWNVSAKLAAENAKSLPEKTAKELVPECYHEYLGM FT FEKSNTNVLPPHRQYDFRVDLLPNVVPQAGRVIPLSPKEDEVLHEMLNKGL FT ANGTICRTTSPWAAPVLFTGKKDGNLRPCFDYRRLNALTVKNKYPLPLTME FT LIDSLLDADQFTSLDMRNGYNNLRVREGDEAKLAFICKAGQFEPLTMPFGP FT TGAPGFFQFFIQDILKAHIGRDVAAYQDDILIYTKPGEDPEAKVKEILNIL FT KEQNVWLKPEKCKFSKKEVSYLGLIISKNQIKMDATKVEAVKNWPVPRNVN FT EVQIFLGFANFYRRFISHFSKIAKPLHELSQKEIKFEWTEPRQEAFDKLKI FT AFTSAPVLKIANPYRPFILECDCSDFALGAVLSQVSEDNEELHPVAYLSRS FT LIQAERNYEIFDKELLAVVSAFKEWRQYLEGNPNRLNVIVYTDHKNLQSLM FT TTKELTRRQARWAEVLGSFDFKIRFRPGKQSTKPDALSRRPDLAPNKDEKL FT NFGQMLKPHNLPTDAFIDEFDALESWFIQENETAITHTGNELIEIMNLEEN FT EEVESSVWSDTEIIEEVKRVSGEDPRVNEIIQLYKDMPNSKHLNEYSMTNG FT LLYFRERIVVPNNTNVKFQILKSRHDSLIVGHPGRSRTLALVKRTFQGTRH FT LVKASKRAKSFWF" XX SQ Sequence 6362 BP; 2171 A; 1303 C; 1407 G; 1481 T; 0 other; aggtcctaat atattgcaac gtctcattta ggatcaactg tctacaacaa gtttaagtaa 60 ccaagaaaga agaaaaagaa taaatcgaaa agcaaagaag aagaagattt aaaagttaat 120 taaagttcaa gatctgtact gaagtcaaag tttaaaatag tttatctccg ctctataaga 180 aagcctgaat ctcaaggata atcacattac gtcaagtagt ttagaaacac ccgcaagtaa 240 agttacaatc attccaccac tgaactaaag tcacaacgcc aaatttcatc gtacctagtt 300 cttcgacgtc tgagtcgagc aaccattcaa cgaaaatcag cgaggtcttc cacgaactcg 360 aagagaacgt gatggaacca caaacagacc ccactttaac tgaagtcttg cgacaattga 420 atcatttgtc aagtcaattt gctgaagtca aggctagcct cgccgacgaa acacagaaac 480 gtcaagaagc tgaaattcgc ctgcagcagt atgaaactaa taacatgcaa gcctcaccga 540 ctcccgctca accgaacccc gcctccgtgc aaccagtgta tgtgaatcag acggtgcaag 600 ctcaacgcca accgaaaatg tcaacccctg acaagttcga tggaagcaaa ggatcgaaag 660 ccaaagtctt catgaaccag attgggctgt atatgcaact gaatgatcat ctgttcgcca 720 acgatcaggc caaagttgca ttcgcacttt cgtacatgtc aggaaaagct agcatctggg 780 gtcaaagtct aacggaccaa cttctagacc cagagctatc ggaatcggtg tcatggagta 840 aatttgttgt gtcgtttaaa gctacgtttt ttgattcgga aagagtgtca agagcggaaa 900 aggagatacg agagttgaaa cagaaaggat ctgttgcgga ttactggatc cgattctcag 960 aactgtcgct tgtaatcaaa tggcctcaaa gtatattaat gtcacatttc gagcaaggtt 1020 tgaaacgaga gatcgccgtc tacatgatca aagatgaatt taatgatgct gaagagatgg 1080 cgagacacgc aatcaagtta gataacaagt taaaccgacg agctcatgaa acaacatatt 1140 catcgactgc atcagtacca aacccgactt ctgcgatcga cccagacgca atggattgtt 1200 ctgcctacaa attgaacata tccaatgaag agtacaaacg taggggaagt acgagagctt 1260 gctacaagtg tggaaagaca ggtcattaca tagctgattg ttttgttggt aaacgtggga 1320 ggatgtggaa ctcaagtttt aaaaatgatt caagtgaaag caaactaaaa gcgaggatag 1380 ctgagctcga aggacaatta gggaagagta gtagtgttgt agagagtggt agcaaggctg 1440 aatcttcaaa aaatggcgac gctcgggagt gattgttgtg cctcccccga gcaaagaaca 1500 agtgggttta gaattaggag acattagtaa ccttgaaatg aaagatacca gaatcattgc 1560 taaattgtcc cttcacgacc cacaaactgc cacaaccaaa caagtgcgag ccctagttga 1620 tagtggtgcc actcacgaag cggtgagtag aagccttgta agagaattta accttttcac 1680 gaccccactg caagaatcaa gaagtgtagt gggatttagc ggtgtagaat caagagtaaa 1740 tgatgtagga gagtactaca ttaattacgg aaagcacctg acaacattca tagtcaccga 1800 attacacgac aagtacgata taatacttgg tatgccgtgg atcaagaaaa atcatcaaat 1860 catcaactgg gaagacagca aattagaaga catcactcac gctattgcaa ctgttgaaac 1920 agtgtcgtca gttccgaaaa aatcctcgaa tgtctaccct atggagcctg agaggcaagc 1980 taggaactgt aacgaggggg tggagttaac aaactcatta acacccccgc aatgtgagtt 2040 tgctagatcc ccaagtcctc ctttacctga agcagctggc aagctttctc cccttctaga 2100 attcatagat cacaaggcac caacgaaccc aggccaaatc ccaaaagacg ttgatcacat 2160 tgcagcatct tcagttccaa aaacaacctc catggaccaa gggataccta aggggtacgc 2220 taggaaactt gaagaggggg tagagttttt aagctcatca atgcccccgc aatgtgagtt 2280 tgataattcc ctttcatcat gtattaaaga agcaactggc aagcgctcct ctcttctgaa 2340 tttaaagtca cagtctcttg gattaaatgt acgaagatca ctcccgactc ctaaacggtt 2400 atacgcgaag gaattcgacg taatcaaggc tcctcaactg agtattgcag ctgcgaaaac 2460 atcgtggaat gtctctgcta aactagcagc tgagaacgcg aagagcttac cggagaagac 2520 ggcaaaggag ctagtaccgg aatgctacca cgagtacttg ggaatgtttg aaaagagcaa 2580 cacaaatgtc ttaccaccac accgtcaata tgattttaga gttgatctcc ttccaaacgt 2640 ggttcctcag gctggaagag ttatacctct ctcaccgaag gaagatgaag tacttcatga 2700 gatgttaaac aaaggactgg cgaatgggac aatctgcaga accacatcac cttgggcggc 2760 cccggtatta ttcacaggaa agaaagatgg caacttgagg ccctgttttg attaccgacg 2820 attgaatgct ttaacagtca aaaataagta cccgttgcca cttaccatgg agctgattga 2880 cagtttatta gatgcagatc agttcaccag tttagacatg aggaatggat ataacaactt 2940 acgggtgcga gaaggcgacg aggctaagct ggcctttatc tgcaaagctg gacaatttga 3000 acctttaaca atgccatttg gtccaactgg cgccccggga tttttccagt tcttcattca 3060 ggatatcctc aaagctcata taggcaggga tgtggcggcg tatcaagatg atatcttgat 3120 ttacacgaaa ccgggagagg atcctgaggc gaaagtcaag gaaattttaa atatcttaaa 3180 ggagcaaaat gtatggctta aacctgaaaa atgcaagttt tccaaaaaag aagtctctta 3240 cctcggttta atcatatcaa agaatcaaat taagatggat gccaccaagg ttgaagcagt 3300 aaagaactgg ccggtaccgc ggaatgtcaa tgaagtacaa atctttctcg gttttgcaaa 3360 cttctatcgc aggtttatat cacatttctc taaaatagca aaaccattac atgaactctc 3420 acaaaaagaa atcaaatttg aatggactga acctagacag gaagcttttg acaaactcaa 3480 aattgcattt acctctgctc cggtactgaa aatagccaac ccgtatcgac cattcatact 3540 ggaatgcgac tgctctgact ttgcgctagg agcagtcctc tcgcaggtgt cagaagacaa 3600 cgaagagctt catccagtgg cgtatttatc gagatctctc attcaggcag aacgaaatta 3660 tgagattttt gacaaagagc tattagcggt cgtaagtgcc ttcaaggagt ggcgacagta 3720 tttagaagga aatcctaacc ggttgaacgt gatcgtgtac acggaccaca agaatttaca 3780 atccctgatg accacgaaag agttaacacg acgccaggca agatgggcag aagtacttgg 3840 aagcttcgac ttcaaaatta ggttccgccc aggaaagcaa tctacaaagc cggacgcatt 3900 gtcacgacga cccgacttag cgccaaacaa agacgagaaa ctgaatttcg ggcaaatgct 3960 gaagccacac aacttgccaa ctgacgcatt cattgatgag tttgacgcac tcgagtcgtg 4020 gttcatacaa gagaatgaaa ctgctatcac tcacacggga aatgagctaa ttgaaatcat 4080 gaatttagag gagaatgagg aagttgagtc atcagtatgg agcgacaccg aaattataga 4140 agaagtaaaa agagtatcag gagaagaccc aagagtgaat gaaatcatac aactctacaa 4200 agacatgccc aactcgaaac acctgaatga gtattcgatg acaaatggcc tactctactt 4260 tcgtgaaaga atcgtggtac ctaacaatac taacgtaaaa tttcaaattc tgaaatcaag 4320 acacgatagc ctgattgtag gtcatccagg aagatcccga actctagccc tggtcaaacg 4380 gaccttccaa ggaactcgac acctggtaaa agccagtaaa cgggctaagt ccttctggtt 4440 ctgaacacca aggacaacaa tactacacac ctggtaaaag ccagtaaacg ggctaagtcc 4500 ttctggttct gaacaccaag gacaacaata ctacacacct attaaaggtg aaagggacaa 4560 gattcaatac gcctaatttt cctcagtgga ctgatattat cattaatcat aacttaaaca 4620 acagacctgg taaaagccag taaacgggct aagtccttct ggttctgaac accaaggaca 4680 acaatactac acacctatta aaggtgaaag ggacaagatt caatctatac aaaaggaggt 4740 aatagaaata agagcaattg atgaataagg ttgatatgat tgagttgaat ataatgttag 4800 agaaaagaac tcacacgcct aattttcctc agtggactga tgaggaaaaa gaggcgcgcg 4860 cggagtggcg atcagtatga actaagttga gtgatacaat gatgatacag ctaaatttgt 4920 aattaggttt gagtttttgt attggtggat ttgtataggt gatcatattg tatcggtgac 4980 tgtatggttt tagtgaggta tgtgtttgtc gtgattgaaa taaatgatat atcttaacaa 5040 aacggacctt ccactggccg tcaatgaagg catatgtcaa taagtacgtt gatggatgtc 5100 aatctttcca acgagtgaaa tcaagaacta ccaaaccgtt aggaaccttg caaccgctgc 5160 caatccctcg aggaccgtgg acagatattt gttatgattt aataacggac ctgccagaat 5220 cagagggaag tgactccatc ttaactgtag tagataggct gacaaaaatg gcacatttca 5280 tcagttgcaa gaaatctatg acgtcggacg agttggcaac attaatgatc cggaacgtgt 5340 ggaaactcca tggcactccc cagactgtca cgtcagatag gggaagtgtg ttcatttcaa 5400 caatcacgaa acagatcaat aagaaactag gaattaagac acaagcatca accgcgtatc 5460 accctcagac tgacggacaa tctgaaataa cgaacaaagc tgtggagcaa tacattcgac 5520 attttgtgaa ttatatgcag gacaactggg tggcgttgct tccaacagcc gaattttctt 5580 ataataacaa ctttcatgtc tctataggaa tgtcaccgtt ttgagctaac tacggatttg 5640 atgcaaactt cgcaggtaca gcgtcagttg agcagtgctt acccactgtg gaagaaaggt 5700 tcgagcaaat caaagaagtc caggaggagt tgaaatgcgc aatggaggaa gcacaagaaa 5760 caatgaaact tcagttcgac aaaaaggtga attcaacacc gaagtggaaa gtaaacgaaa 5820 aagtatggct gaacagtaag cacatatcaa caacacgccc aacagccaaa ttttctcacc 5880 gatggatagg acctttccct atagaaagcc aagtgtctac taacgcctat agattgaaac 5940 tgcctgagtc aatgaagaag attcatccag tgtttcatgt ttgcctattg aggaaagcca 6000 acaaaagtca aattgaaaat caactacaat caccacaacc accggtaata attcaaaatg 6060 aggaagaata cgaagtgcaa gaagtgttag acaaaaggag aagagggagg aaagtggaat 6120 atttaattaa ttggaaaggt tacacaccgg aacatgatac atgggaaccc aaatcaggac 6180 tgaagaatgc gcaagaactg attgaggaat ttaacagcaa atacccatca agagaattga 6240 aacacaagag gacacggaga gtaaagtgag gacgaggctt tttccctaag gggtttttta 6300 atgccagtcc agggaagatg tcagacccag caagaggggg ttgggacata aagggggagt 6360 gg 6362 // ID Copia-2_MLP-LTR repbase; DNA; FNG; 311 BP. XX AC AECX01002036; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_MLP_; KW Copia-2_MLP-I; Copia-2_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-311 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002036; Positions 7716 7406. XX SQ Sequence 311 BP; 70 A; 59 C; 47 G; 135 T; 0 other; tgttagttgt tgactattat caaatagttc tggatatgtt ttgatgtttt acgtgtgtat 60 gatcatcctg cgtgatgaat cctttctcta tgcttatgtc atcctgcgtg atgaatttga 120 tatgtttctc tttatttgat aaatctccaa ctgtgtgcgc ataactctct catctatctg 180 ttttcaaatt gtcttttcct ataaagccta ctatgagttg caccaagctt tgatcttcat 240 ctcattcaaa gcttttctct tttattcaca aactaactta ttgtcgtgtt tgtgaatctg 300 tgcattactc a 311 // ID Coprina_Pc2 repbase; DNA; FNG; 4143 BP. XX AC . XX DT 29-APR-2007 (Rel. 12.04, Created) DT 18-MAY-2007 (Rel. 12.04, Last updated, Version 1) XX DE Coprina_Pc2 is a Penelope-like retroelement. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Nonautonomous; Penelope-like elements; reverse transcriptase; KW Coprina_Pc2. XX OS Phanerochaete chrysosporium OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Corticiales; Corticiaceae; Phanerochaete. XX RN [1] RP 1-4143 RA Arkhipova I.R.; RT "Distribution and phylogeny of Penelope-like elements in RT eukaryotes."; RL Syst. Biol 55(6), 875-885 (2006). XX RN [2] RP 1-4143 RA Gladyshev E.A. and Arkhipova I.R.; RT "Telomere-associated endonuclease-deficient Penelope-like RT retroelements in diverse eukaryotes."; RL Proc Natl Acad Sci U S A 104(22), 9352-9357 (2007)in press. XX DR [2] (Consensus) XX CC Coprina_Pc2 is a Penelope-like retroelement from the white rot CC fungus, Phanerochaete chrysosporium. Its single ORF contains CC homology to reverse transcriptases. No associated endonuclease CC has been found. Most copies are associated with telomeres and CC are 5' truncated by addition of reverse-complement P. CC chrysosporium telomeric repeats, (TAAACCC)n. XX FH Key Location/Qualifiers FT CDS 27..3662 FT /product="Coprina_Pc2_1p" FT /note="reverse transcriptase." FT /translation="MSSRATSLPPAFAGLGLGESRRGPMASLDAVLSRAGS FT PVPEGRAAALHTSGADSLADARGSSALYRVLENCTSLNEILAVFPPAYRDC FT VRGTLSDLNRWARVYAANWLAQQRAARHAAAGTFPSALNSIKVPTFQLSQE FT WSKEDLARGLNTSAAQAVQKAKQACFDAYKFALEQEEHWLHEKLDHTRRVR FT EMVAAINVAWENHVKPFALIPKDDAYLDPDSMLVEDGSHPDTRDKWIAPAS FT AKGEHNRAIRLCQLAFGRVVQIRLAEVREEERKRLAKQEVKAAADVALGAP FT VNQQTVQRLVAAQVKAALKARGSSSGSRPRGVPGGKPSIKGKAKTAPAPPR FT KSKSLFSAGVRSPLTFSKRRNIRGSGQGTTSLRLGRKDLQPFQFQAPTPTS FT GSRRPATPGQKQRQGAVERVLRDPRWRYEHPSSYPDELLTLPLRTQIQILH FT TRVPLELVEAARFRAGVHVQRGVEVPWEIACTLSSGWKYLLPVTWNFSGVF FT QGWEQFEQSVRWQAHFATNPPPVRFYDHDYDTGRRSERAPPPAPIHIEAGL FT AAGRAFLNRLSGEAEQVARRKRREELANLVPVKSALRYLKEHSLVLLGTDK FT NLGVAVCQAKWYKEQCQRYIDSDKAIIEIDRETAEELVRRLSRKIGDVIAS FT PKFTDSERRYYAQLFDWLGSKRPEAHGNRVYIPKFHGIPKIHKDPWKIRPI FT VPGFNVVAAPAAKFISKMLKKYIAAAPYVLTSSKTMAQELLQLPKFGTEPY FT WLLTGDIVAFYPNVPLVDASVVCQQYVFNDPDVALPLRVLFGELIGLVNWG FT MVYEFDGRYYEQVDGLVMGLACSPDIANLYAAHHESPLVPQVEGLVYYRRY FT IDDAFAIVAAPSKEEALRRMRIVQFGKCTIEWECSGTSTAFLDMHVMIDKF FT SRKVEFQPFRKSMNHHERLPWASAHPFDVKKGTFVSEMSRLAALSSKVSYY FT QDALAGLKSLYLARGYPLAVVSSWLRSNGTRMWLNRYTSEAREARGRTAVW FT VVKTSFNPALEALRVNELWETVKEQWRNVPPGTRTAFDISKTDFLDRRMLI FT SRKRTTNFGDLIRRVCREVLQHDEMTDEHDVVDPELVDQLKRLARTARGVM FT KRISVMAVYPLPSASVSLRGYKPLLDSAHGLSGGTLLTDFASRHRHGYLLG FT ELTDTLHAGLLVLAFVFTFISVIRRFGSLGREPPDFRLGIMCVRSVVLQGT FT P*" XX SQ Sequence 4143 BP; 897 A; 1243 C; 1139 G; 863 T; 1 other; taaaccctaa caaaggagta agcaccatgt cctctcgcgc cacttcgctg ccccccgcct 60 tcgcaggact cggcctcggc gaatctcgtc gtgggcccat ggcgtctctc gacgccgttc 120 tctcacgagc tggcagcccc gtccccgaag gacgtgcggc agccctccac acatcgggcg 180 cggattccct cgccgatgcg cgcggttcat ccgcgctgta tcgagtgctc gaaaactgca 240 cgagtctcaa cgagattcta gcagttttcc cgccagcgta cagggactgc gtccggggga 300 ccctcagcga cctcaatcgc tgggcgcgtg tttacgccgc caactggctg gcgcaacaac 360 gtgcggcgcg gcatgccgcc gccggcacgt tcccctcggc ccttaactcg attaaggtgc 420 ccaccttcca gctctctcag gagtggtcga aggaagacct cgcccggggg ctcaatacct 480 ccgccgcgca ggccgtccaa aaggccaaac aggcctgctt cgacgcgtac aaattcgcgt 540 tggagcagga ggaacactgg ctccacgaga agctggacca cacccgtcgg gtgcgcgaga 600 tggtcgcggc catcaacgtc gcgtgggaaa accacgtgaa gccgtttgcc ttgataccca 660 aggacgacgc gtacttggat cccgactcca tgctcgtcga agacggctcg caccccgaca 720 cgcgcgacaa atggattgcc cctgcgtcag cgaaggggga acacaaccgc gcaattcgcc 780 tctgccaact cgccttcggg cgggttgtcc aaatccgcct ggcagaggtt cgcgaggaag 840 aacggaagcg gctcgcgaag caggaagtaa aagccgcggc tgacgttgcc ttgggcgcac 900 ccgttaacca gcaaacggtg cagcgcctcg tcgccgctca agtcaaggca gcgctgaaag 960 cgcgcgggtc gagctccggc tcacgtcctc gaggcgtacc cggaggcaaa ccgtccatca 1020 agggcaaggc caagaccgcc cccgccccgc cgcgcaaaag taagtccctg ttcagcgcag 1080 gtgtccggtc accactcacg ttttccaaac gtagaaacat ccgggggtca gggcaaggga 1140 ccacgtccct tcggctgggt cgaaaagacc tccagccctt ccaattccaa gcgccgacgc 1200 caacgtcggg aagccgccgc ccagcgacgc cagggcaaaa gcaaaggcaa ggcgcagtag 1260 agcgcgtcct tcgggaccct cggtggcggt acgaacaccc atcatcgtac ccggacgaat 1320 tgctgacgtt acctctccga acacagatcc agattctgca cacgcgtgtg ccgcttgagc 1380 tcgttgaggc cgcgagattc cgcgctgggg tgcacgttca gagaggggtg gaagtgccgt 1440 gggaaattgc atgtacacta tcgtcaggct ggaaatattt attacctgta acttggaatt 1500 tttctggagt gttccaaggg tgggagcagt tcgaacaatc tgtgcgttgg caggcgcact 1560 ttgctacgaa tcccccaccg gtacggttct acgaccacga ctatgacaca gggcgacgtt 1620 cggagcgtgc tccaccgccc gcacccattc atattgaggc aggcttagct gcaggacggg 1680 cctttttaaa ccgcttgagc ggtgaggccg aacaagtagc acgccgtaaa cgtcgcgagg 1740 aactagccaa cctcgtgccg gtgaagtctg cactccgtta cctgaaagag cattccctcg 1800 tgctcctcgg aacggacaag aatctcgggg ttgctgtctg tcaggctaag tggtacaaag 1860 aacaatgcca gcggtatatc gactcggaca aggctatcat tgagatagac cgcgaaaccg 1920 ccgaagaact tgtgcgtcga ctctctcgga agatcgggga cgtaattgca tcaccgaaat 1980 tcacggactc ggagagacgg tactatgcac agctgttcga ttggctcggc tctaaacgac 2040 ccgaggcgca cggcaatagg gtatacatcc cgaaattcca tggtattcct aaaattcata 2100 aggatccgtg gaaaatacga cccatcgtgc ccgggtttaa tgtagtcgct gcccctgcag 2160 caaagtttat tagcaaaatg cttaagaagt acattgcggc cgccccatac gtcttaacct 2220 cctcgaagac gatggcgcaa gaacttcttc aactgccaaa gttcgggact gagccctact 2280 ggttgttaac tggtgatatt gtggcctttt accccaacgt cccgctcgtc gatgcgagcg 2340 tcgtttgtca acaatacgtt ttcaacgacc cggacgttgc cttgccattg cgggttctct 2400 tcggggaact tatcggcttg gtgaactggg gaatggttta cgaatttgac ggacgctact 2460 atgagcaggt agacggcttg gtaatgggtt tggcgtgtag ccctgatatt gccaacctat 2520 acgctgccca ccacgagtcc ccactcgtgc ctcaagttga gggtctagtg tactatcgac 2580 ggtacataga tgatgcgttc gccatagtcg ctgcgccttc aaaggaggag gctttgagac 2640 gcatgcgcat cgtgcaattt ggcaaatgca cgatagagtg ggagtgttcc ggracgtcga 2700 ccgcgttcct cgacatgcat gtcatgatcg ataagtttag caggaaggtc gagtttcagc 2760 ccttccgtaa gtctatgaac caccacgaac gccttccttg ggcatccgca caccccttcg 2820 acgtcaagaa gggcactttc gtaagcgaga tgtcgaggct agccgccctt tcgtccaagg 2880 tgtcctatta ccaggatgcg ctcgcaggcc ttaaatccct gtacctggcg cgcggctacc 2940 ctttggctgt tgtttcgagt tggcttcgct ccaacggcac ccgcatgtgg cttaatcgct 3000 acacgtctga agcgcgagag gcgcgcggga gaacggcggt gtgggtggtg aagacctcgt 3060 tcaatccagc cctggaggcg ttgagagtca acgagttgtg ggaaacggtt aaggaacagt 3120 ggcgcaacgt acctccgggt acgcgcactg ctttcgacat ttcgaagacg gacttcctcg 3180 accgtcgcat gctcatttca cggaaacgga ctaccaactt tggtgatctg atccgccgag 3240 tgtgccgcga agtactgcaa cacgacgaga tgactgatga gcacgatgtc gtcgaccctg 3300 agctggtgga ccagctgaag cgacttgcga gaactgctcg aggagttatg aaacggattt 3360 cggtgatggc cgtgtaccct ctcccgtctg catctgtttc tttacgcggg tataaaccgc 3420 tcctggactc ggcgcacggg ctgtccgggg ggactcttct tacggatttt gctagccggc 3480 atcggcatgg ataccttctc ggtgagttaa cggatacttt gcatgcggga cttttggttc 3540 tggctttcgt gttcaccttt atttcggtca tccgtcggtt tggctctctg ggccgtgaac 3600 caccggattt tcgacttggg ataatgtgtg tcaggagtgt tgtccttcaa gggaccccct 3660 aacccgaaag cctaaacccc aagccctaaa ccccaaacct aaaccctaac cctaaaccct 3720 aaccctaagt ccagggcggg taggttgggc gctaagtttg gacgcgctaa gtatcctaag 3780 cgcccgagcc acgtgccggc gctcagtttc agccacgtgg tggtcacgtg atcacggcgc 3840 ctatcactcg gccgatcgga tccttcggga ccgaacccgg accttgtgac caggcttcga 3900 tcatgatcat caaactctcc ctcgcaactg tacagctgta cagtagctga tagagtttga 3960 gccctaggcg tatataacta gggactctga gtgctgatga ggcatcgccg aaaggttcag 4020 cgcggccccc caaaggcccc accttcactt tcgtcccttg cctccctcca cctctttcac 4080 ccttactaaa ccctaaaccc taaaccccta aaccctaaac cctaaaccct aaaccctaaa 4140 ccc 4143 // ID Copia-12_MLP-LTR repbase; DNA; FNG; 247 BP. XX AC AECX01002368; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-12_MLP_; KW Copia-12_MLP-I; Copia-12_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-247 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002368; Positions 13318 13072. XX SQ Sequence 247 BP; 57 A; 51 C; 44 G; 95 T; 0 other; tgttagacat taacttacgt gctagctact gtactagtgt tatgtgctgg atgcctatct 60 aaaccttgtg aaagccatgt gacattcctg agtatatcta gtaaagcttg tagtggtgcg 120 gttctattct atttcctcac ctttcctatt tgcttgtact gatcctttaa ctcaggttag 180 ttatcaaagt aatgaattac ctttcctatt tgcttgtact gatcctttga ctcagcactg 240 tgcatca 247 // ID hAT-N1_AN repbase; DNA; FNG; 426 BP. XX AC . XX DT 09-JAN-2004 (Rel. 9, Created) DT 09-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE Nonautonomous DNA transposon. Putative classification: hAT DE superfamily. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; hAT superfamily; hAT-N1_AN. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-426 RA Kapitonov V.V. and Jurka J.; RT "hAT-N1_AN, a family of nonautonomous DNA transposons in the RT Aspergillus nidulans genome."; RL Repbase Reports 3(12), 218-218 (2003). XX DR [1] (Consensus) XX CC Nonautonomous DNA transposon. Putative classification: hAT CC superfamily. CC 15-bp TIRs, 8-bp TSDs. Despite their young age, many elements are CC not CC flanked by the TSDs. XX SQ Sequence 426 BP; 127 A; 89 C; 94 G; 116 T; 0 other; tagacttgtt aaaccacggg ttggggcggg ttttcaggcc tagctgatcc gcccacgcgg 60 gttttggggt gggttacctt cacagtaaac cgcccatggg tttagcaaat aattctaacc 120 caacctaaat aacccaaaat aacccagtta tgcatatcat tactctaata agcagtgatc 180 tacatagtta ataaaatact gtatttaaat actgtattat aaactatcta agtaagaaaa 240 tataatctaa atacagtaat atacctattc agatatcttg gcaacccagc gggttgctcc 300 gccgggcttt ggggcagcca aaaatatcca aaacccaatg gataattaga aggtctaacc 360 caacccattt cttggcgggt cggggcgggt tggggcgggt ttcgtgggtt gggtttaaca 420 agtcta 426 // ID Copia-4_MVPL-I repbase; DNA; FNG; 4164 BP. XX AC AEIJ01000810; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Microbotryum violaceum genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-4_MVPL_; KW Copia-4_MVPL-LTR; Copia-4_MVPL-I. XX OS Microbotryum violaceum OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Microbotryomycetes; Microbotryales; Microbotryaceae; OC Microbotryum. XX RN [1] RP 1-4164 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Microbotryum violaceum genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AEIJ01000810; Positions 5967 10130. XX CC Positions [1545-1880] - Integrase core CC 'AGAAG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 5..955 FT /product="Copia-4_MVPL-I_1p" FT /translation="MSSTTTTNVSNEGDVKITKLGKDNYELWSIRVDAFLE FT GRGYSSVLQHGADRPSSMDVQAWKTLHGIDPSLEEGKVPTLITKWDRSARA FT IIISCLDDSNLKMVKSKSLSAKNIWEKLKQHHEGNADPFKIRTLLYEISNA FT LYDEDTNLGEFLQGITDKVDQLEDLGQKFKDEAIVAFMLQALPPSYEMLKQ FT AIKLSDKCSVDYAVNRLTDEYSERKLTGKYNKLLKNAGALAVREGTQVKRD FT PNCICRGCKGTGHYQIDPECPKYDPNAKRGRNKNHSNKKKKQHKHSKGSAD FT SEESASFALVFKSRGEPFASHQRRE" FT CDS join(1257..1880,1884..3095) FT /product="Copia-4_MVPL-I_2p" FT /translation="MTIKIKGLTVAKTLNGTGYTLDFTRIVEDQAHVATRA FT TVGAPLMEWHRRYGHIAVSSLKEIVKSGAVDGLVLTDEKVHDCEPCISSKC FT RVSKFSESVTHVTDVLQRVFMDIGLVAEDQVDFQGRKAYLAIVDQYSTAKW FT TFPLKTKSAEEVTRVWIAFRQGVEKMTGRKIKRVRTDNGNEFTNKLIGTDS FT QSQGILHETSAPYTPQQNTVEPFNGSLMAIVRAVLAASKLSWKYWSYAMEY FT ATFVANRVLHSKLDGKTAYEVFYGKKPKVSHFRPFGSTVFAQVPKSKRSKL FT EPTSVRGTFIGYDNEYNCRVLFDSDSEHKVVITRDIAVLNLQPEEVRAPEV FT TTLLEGPDEDVGPVLEGNDAGDELQDEVAVEQIAPARPNRPRWEYRDPAVR FT GRNPGRFEEIDAGTEIHQRTRGQRCIAEQQGLFVTDIPELDPEPPIMSGYV FT MISTDRPAVPSSYKEAMNNVEADKWKEAIQDELNAMDRHQVLADSDLPHGA FT RALGSKWVFARKENAQGEVIRYKARLVAQGFAQRSGIDYDETSAPVARSTT FT ILFLIAIAASQGLCLEQFDYDSAFLNGTMTEMVYMKYPKGWDRPQLGQVLR FT LVKSMYGTKQAPRE" XX SQ Sequence 4164 BP; 1045 A; 1095 C; 1117 G; 907 T; 0 other; ggttatgagt tcgacgacta ccaccaatgt atcgaatgaa ggggacgtca agatcaccaa 60 gctcggtaaa gataactacg agctgtggtc gatccgagtc gacgcgttcc tcgaaggtcg 120 tggctacagc agtgtacttc aacacggagc cgaccgacca tcgagtatgg atgtgcaagc 180 atggaagact ctccatggca tcgacccaag tctagaggag ggcaaggtcc cgactctaat 240 caccaagtgg gatcgtagtg cgcgcgcgat catcatctcg tgcctcgacg attcgaatct 300 caagatggtc aagagcaagt cgctctcagc caaaaacatc tgggagaagt tgaaacaaca 360 ccacgaaggg aacgccgatc cgttcaagat tcgcacgcta ctctacgaga tttcgaatgc 420 gttgtacgat gaggacacca acctcggcga gtttctccag ggcatcacgg acaaggtcga 480 tcaactcgaa gatctcggcc aaaagttcaa ggatgaagcc atcgtcgcgt ttatgcttca 540 agctctacct ccctcatacg agatgctcaa gcaagctata aagctcagcg ataagtgctc 600 ggttgactac gccgtcaatc gacttaccga cgagtacagt gaacgcaagc tcactggcaa 660 gtacaacaaa ttactcaaga acgcgggagc gttagcggtt cgcgaaggca cgcaagtcaa 720 gcgtgatccg aactgcattt gtcgtggttg caagggaacc ggtcattacc aaatcgatcc 780 ggaatgtccc aagtacgatc cgaacgccaa acgcggtcgt aacaagaacc acagcaacaa 840 gaagaagaag caacataagc attccaaggg atctgcggat tcggaggagt cggcttcttt 900 tgcgttggtg ttcaagtctc gaggagagcc attcgcttca catcaacgtc gcgaataggg 960 tgtcacagaa cgaccagagt gagatctgga tcctggacac cggtgcgagt cagcactacg 1020 tcggcaacgc aagtctgctt accgaccgtc aacatggaag tctgaacgtt caaaccgcaa 1080 gcggtcaggt agtacattgc gacacgtatg gtacggtacg gttcaagttg agaagcgggg 1140 cctcgctctc actttcgaac gtctaccatc tcccgggcgc tccggtgaac ctcgtcacga 1200 ctcgtgctct ataccgatca gaagctagtt gcggatggga agatggcagg gatgtcatga 1260 cgatcaagat caaagggctt accgtagcaa agacgctgaa cggtaccggt tacaccctcg 1320 actttacgcg cattgtcgag gaccaagctc atgttgcaac tcgtgcaaca gtgggagcgc 1380 ctttgatgga atggcatcgt cgctacggcc acatcgccgt gagctcgctc aaggagatcg 1440 tcaagtcagg cgctgtagat ggactagttc tcaccgacga gaaggtccac gactgcgagc 1500 cctgcatcag ttccaagtgt cgcgtatcga agttctccga gtcggttact catgtcaccg 1560 acgtccttca acgcgtattc atggacatcg gactcgtcgc agaagatcaa gtcgattttc 1620 aaggtcgaaa ggcgtacctg gctattgtag accagtactc gactgcgaag tggacatttc 1680 cgttgaagac caagtcggcc gaagaagtca ctcgcgtatg gatcgcattt cgacaaggtg 1740 tcgagaagat gactggtcgc aaaatcaagc gcgttcgcac ggacaatggc aacgagttca 1800 ccaacaagct catcggcaca gatagccaaa gtcaaggaat tctacatgag acctcagcac 1860 cgtacacccc acaacagaat tgaacggtcg aacccttcaa tggctcgttg atggcgatcg 1920 ttcgtgcggt ccttgcagcc tccaagctgt cctggaagta ctggtcttat gcgatggagt 1980 acgccacgtt cgtcgcgaat cgggtcctac actccaagtt ggatggcaag acggcttacg 2040 aagttttcta cggaaagaag cccaaggtct cgcactttcg tccgtttgga tcgaccgtct 2100 tcgcgcaagt ccccaagtcg aaacgctcta agctcgagcc gacttcagtc aggggaacgt 2160 tcatcgggta tgacaacgag tacaactgtc gcgtgctttt cgactccgat tcggaacaca 2220 aggtggtcat cacacgcgac attgcggtct tgaacctaca accagaggag gtccgtgcac 2280 cggaggtcac gactttgctc gaggggccag atgaagatgt gggtcccgtc ttggaaggca 2340 acgacgccgg agacgaactt caggatgagg ttgctgtaga gcagatagcc cctgctcgtc 2400 cgaaccgtcc tcgttgggaa tacagagatc ccgcggttag gggtcgaaac ccaggcaggt 2460 ttgaggagat cgacgcaggc accgagattc atcagcgtac tcgtggccaa agatgcatcg 2520 cggaacaaca aggtctcttt gtcacagata tacccgaact tgatcccgaa ccacccatca 2580 tgtctggcta tgtgatgatc agcacggatc gaccggccgt acccagcagc tacaaagaag 2640 cgatgaataa tgtcgaagcg gacaagtgga aggaagctat ccaggacgaa ctcaatgcga 2700 tggatcgtca tcaagtcctt gcagactcgg atcttcccca cggtgctcgc gctctcgggt 2760 cgaagtgggt ttttgctcgt aaggagaacg ctcaaggaga agtcattcgt tacaaggctc 2820 gattggtcgc gcaaggtttt gcacagcgat ctggcattga ctacgatgag acttccgctc 2880 ctgttgctcg ctctacgacg attctctttc tcatcgccat cgctgcatcc caaggactgt 2940 gcctcgaaca attcgattac gactcagcgt ttctaaacgg aacgatgacg gagatggtct 3000 acatgaagta tcccaagggt tgggatcgtc ctcagcttgg ccaagtcctt cgtcttgtca 3060 aatcaatgta tggaacgaag caagctcctc gagaatgaaa ctcggcagtc aacaaactca 3120 tggtcgcttg tggttacttg caatctgacg ccgattcatg tctttacgtc aagcgtgtcg 3180 aggagacgtt catctacatt actctttacg tcgatgatgg actcgctgcc tcgaacgatc 3240 agacgtttct gaattcggag atctaagctt tcaacaaggt gtaccaactc aagcgacttg 3300 gtcctgtgaa ggtgtttctc ggtctcgaat tcctgcgctc gtccaagttc atccagtcca 3360 agtacatcag gagccttgtc gctacgtatg gcggagatca cggatcgaag cacccagcga 3420 aggtcccgat gaagcccagg ttgaacatgg aacactcggc ggagctgttc gacgacattg 3480 cactctacca gtcagcagtg ggagctttgc agtatgctgc gcatcgtgct cgtcccgaca 3540 tcgtcacttc ggttcgtgca gcagcttcca aggtttcagc tcctacccag gccgactgga 3600 tcgcggtgaa gaggatcatc cgctacttgc aaggtaccat tgactgggga ctcaagtaca 3660 acctcgaggg ttcgactgtg ttcaaactct actcagatgc ctcgtgggga gacgacatgt 3720 tgactggcaa gtcgatagga gcgtttgtct cgatcatggc aggcgcagca atttcttggc 3780 agagtaagca gcaatcgatg gttgcgactt cgactactga ggccgagatc ttggctgctt 3840 ccgcgacagc aaaggaggcc atgtggcttc gacgcctggc tgcggatctc aagattcaac 3900 agcctggatc tacacttctc tgggaagaca atcaggcggt gatcgcgatc gcacagaacc 3960 cggctcatca tggtaggacg aagcactata gtgtgcatca cttctacatt cgcgagcgag 4020 ttacgactgg cgacatcgaa tcaagtattg caagacagga gccatgactg cggatcttct 4080 caccaagccg ctcgctcgca acttgttcga acttcatcgt gatggattgg ggatggtatc 4140 ccttggagcc ttgaccagtg ggag 4164 // ID MuDR-1_FO repbase; DNA; FNG; 2928 BP. XX AC . XX DT 02-JUL-2010 (Rel. 15.11, Created) DT 02-JUL-2010 (Rel. 15.11, Last updated, Version 3) XX DE Mutator-like transposon, partial consensus. XX KW MuDR; DNA transposon; Transposable Element; MuDR-1_FO. XX OS Fusarium oxysporum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Hypocreomycetidae; Hypocreales; OC mitosporic Hypocreales; Fusarium; OC Fusarium oxysporum species complex. XX RN [1] RP 1-2928 RA Jurka J.; RT "DNA transposons from Fusarium oxysporum."; RL Repbase Reports 10(11), 1846-1846 (2010). XX DR [1] (Consensus) XX CC Copies ~100% identical. XX FH Key Location/Qualifiers FT CDS 164..2707 FT /product="MuDR-1_FO_1p" FT /translation="MNPVTATRQFSDDCLPPEREYGSREALHAAINAWAAP FT RGYAFVTGKSKKTESGRRIVFFSCDRGGAPPKASGVRQRSTTTRRTGCQFS FT VLAKEALDKTTWRLTHRPGSEFAHHNHEPSTSMSAHPVHRQLSNADRSTIN FT NLANAGVAPKEIRSYLRQNTESHATQQDIYNCIAQGKRDQKKGQSTIQALA FT NELEAEGFWSRIRFDEDGRVTAVLFAHPESLTYLKSYPDILILDCTYKTNK FT YRMPLLDIVGVDACQRSFCIAFAFLSGEEEKDYIWALDRLRSMYEACSARL FT PSVILTDRCLACMNAVSHCFPAAVSLLCLWHANKAVLRYCQPSFMRHNDAA FT QKPQGHQEWKDFYGRWHELVASANEETFEDRLQQLKERYASAHAREVAYII FT ETWLDLYKTKLVKAWVDRYLHFENVVTSRGEGIHQLIKVYLDTSQLDLFEA FT WRAIKLAILNQVAELRANQAKQQIRTPIELSGSLYSIIRGWVSHEALRKVE FT AQRKRLQQDRLPACTGVFSATLGLPCAHTIEPLLQQAQPLQLHHFHTHWHL FT QREGNPQLLIEPRRQFDQVPATSTLPKTSTQREPCAFEIVEQASQTRANPK FT CSRCHTQGHRMSSKACPLRYAHLVSPSASTAPPNTLPPSTTPSSAQQPVPD FT PSLCSILPQQQVAPPAALCDQEMVPSPPDIIALPSSSADVPPPPDMPPSVP FT ASGPDIRYDDPRAIYDRYIAARMAWYNKLPRGSLKTNQEYRKAMGLPQRYD FT KTSYSWCLDYKQMGKRCTSTIPGREWTKEEMMAYLDWTRVEDERVERQVAQ FT EMGDNPLANRRTGMKEIWKRVEQDMIDQQALYSNDKLAEDCIIVTA" XX SQ Sequence 2928 BP; 787 A; 783 C; 725 G; 633 T; 0 other; caataggtgc gcacctggat gcgcacgtaa gtgtcacgtg ttgcgcaccc ggtgcgtctg 60 taagggagtt atgtaagccc aacacccttg tagctacccc tttcccgttg ataaagagcc 120 tcacttcgcc gcgcaacgat ctactcaaat catctctcct agaatgaatc cagttacagc 180 tactcgtcaa ttttctgatg actgcctgcc tcctgaacgc gagtatggct ctcgggaagc 240 cctacacgcc gcgatcaatg catgggcagc tcccaggggc tatgcgtttg tcactgggaa 300 gtcgaagaaa acagagagcg gcagacgaat tgttttcttc agctgtgacc gtggaggagc 360 acctccaaag gcctcgggcg tacggcaacg gtcaactaca acacggcgta cagggtgtca 420 attctcagtc cttgcgaagg aggccttgga taagacaacg tggcgcctta cacaccgtcc 480 tggcagcgag tttgcccatc acaaccacga gccaagcaca agcatgtctg cacatccagt 540 ccatcgtcaa ctatccaatg cagataggtc aactattaac aaccttgcaa acgctggtgt 600 agcaccgaaa gagatcaggt cctacctacg tcagaacaca gagtctcatg ccacccagca 660 agacatctac aactgcattg cacaaggcaa acgagaccag aagaagggcc agagcacaat 720 ccaagctctt gctaatgagc ttgaggctga gggtttctgg agtcgaatac gcttcgacga 780 ggatggtcgg gttacagctg tgttgtttgc ccacccagag tcgctaacat accttaagtc 840 atacccggat atacttatat tggactgcac atataagaca aacaagtata ggatgcctct 900 tctcgatatc gtcggtgttg atgcctgtca acgatcattc tgcatcgcct tcgccttcct 960 cagcggcgag gaggagaagg actacatctg ggcgttagat cggctacgtt caatgtacga 1020 agcctgtagc gcaaggctgc catctgtgat ccttacagac cgctgtctgg cctgcatgaa 1080 tgcggtatct cattgtttcc cggctgcagt atcgcttcta tgcctttggc atgccaacaa 1140 ggcagtcctg cgttactgcc agccaagctt tatgcgtcat aacgatgcgg ctcaaaagcc 1200 tcaaggccac caagagtgga aggacttcta cggaagatgg catgagctcg tggcatcggc 1260 aaatgaggag acatttgaag acaggcttca gcagctcaaa gagcgctatg cttcagctca 1320 cgcccgggag gtcgcctaca tcatcgaaac atggcttgac ctctacaaaa caaagctcgt 1380 gaaagcttgg gttgatcggt atcttcactt tgagaatgtg gttacatctc gaggcgaggg 1440 tattcatcag ctcatcaagg tgtaccttga tacttcccag ctagatctct ttgaggcctg 1500 gagggctatc aagctggcga tacttaacca agtagctgag ctccgagcaa accaggcgaa 1560 gcagcaaatc cgaacaccca tagaactctc agggagtcta tacagcatca tacgaggctg 1620 ggtatctcac gaggcattgc ggaaggttga ggcgcagaga aagcggctcc aacaggacag 1680 actccctgcc tgcacaggag tcttctcagc aacgcttggg ctcccttgcg ctcacacaat 1740 tgagcctctt ctacaacaag cccagccgct tcaactacac cacttccata cgcattggca 1800 tcttcaacga gaaggaaacc ctcaattgct catagagccc cgccgtcagt tcgatcaggt 1860 gccagctacg tcaacattgc cgaagactag cactcagcgc gagccatgtg catttgagat 1920 cgtcgaacaa gcatcacaga cgagagccaa ccccaaatgt tcaagatgtc atacacaggg 1980 ccacaggatg agctcaaagg catgcccatt gcgatatgcg catcttgtat ctccctcagc 2040 ttctacagcg cccccaaata cgctgccgcc atcgactaca ccaagttcgg ctcaacagcc 2100 agttcccgat ccatccttat gctcaatttt gccacagcag caggtagcgc cgccagcggc 2160 actatgtgat caagagatgg tgccgtcccc gccagatata atagcactgc catcctcatc 2220 agcggatgta cctccaccac cagatatgcc gccatcggta ccagcgtctg ggccagacat 2280 acgctatgat gaccctcgtg ctatatatga tcgatatatt gcagcacgga tggcatggta 2340 taacaagctg cctcgaggta gcttaaagac taaccaggaa tatcgaaagg caatgggcct 2400 ccctcaacgc tacgataaga caagttacag ctggtgtttg gattataagc agatgggtaa 2460 gcgatgcaca tcaacgatac ctggacggga gtggactaag gaggagatga tggcgtactt 2520 ggattggacc agggtagaag atgaacgtgt ggagaggcaa gtcgcccagg agatgggcga 2580 caaccctctg gcgaacaggc gaacaggcat gaaggagatt tggaagaggg ttgagcagga 2640 catgatcgac caacaagctc tgtactcgaa cgataaactg gctgaagact gtatcatagt 2700 tactgcttag attattatta acgcctacat tacttatatc ccaattatgg caccgtgaga 2760 aaaaatctgt atagcagagc agttcgtcat ctagagatta gaagcaatac aataaaatac 2820 aataggctgg ctggcataca agggtgttgg gcttacataa ctcccttaca gacgcaccgg 2880 gtgcgcaaca cgtgacactt acgtgcgcat ccaggtgcgc acctattg 2928 // ID Gypsy-121_MLP-LTR repbase; DNA; FNG; 165 BP. XX AC AECX01000855; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-121_MLP_; KW Gypsy-121_MLP-I; Gypsy-121_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-165 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000855; Positions 3136 3300. XX SQ Sequence 165 BP; 34 A; 51 C; 27 G; 53 T; 0 other; tgtaataagc ctaatagata gcttatgctt atactgtgta gcagcccttc catactacta 60 gctgcttgac cacgttgtac tgctgctctc tcacgttgca atccagttat cagcctatag 120 tcatctttgc ctcctcgtcc tcatctcgcc gcctcggtcc taaca 165 // ID Gypsy-13_MLP-I repbase; DNA; FNG; 5656 BP. XX AC AECX01002076; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-13_MLP_; KW Gypsy-13_MLP-LTR; Gypsy-13_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5656 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002076; Positions 9560 3905. XX CC Positions [4474-4953] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 344..1453 FT /product="Gypsy-13_MLP-I_2p" FT /translation="MEPQADPTMIEVLRQLNQLSAQFNKVKASLAVETQKR FT LEAEQRLNQFETSQTTNMSTKGVPGNTNPAPTPVQPVYVNQTVQSQRQPKM FT STPDKFDGSKGAKAEVFMNQLGLYMQLNNHLFANDQAKVAFALSYTSGKAS FT IWGQSLTDQLLDPTLSESVTWNKFIDSFKSTFFDSERVSKAEKEMRMLKQK FT GTVSDYWIKFSELSIVIKWPQDILMSQFEQGLKREISVYMIKEEFVTAEEM FT AKYAIKLDNKINRHSENSHTLAASTSTPTPSVDPDAMDCSAYKLNISNEEY FT KRRGTAKACYKCGKTGHFIADCFVGRRGKTWNSNYRGNSVESKLKARIAEL FT ESQLGDGSSVVESVSKSDLSKNGDARE" FT CDS 1948..5583 FT /product="Gypsy-13_MLP-I_1p" FT /translation="MVSKEPERHARNRDEGVELEISSSTPPQCEYDLIQSP FT TVSEAGHKLYSPVELSDTPNSTQQTSTRPNPNIAVSSSPATPPIGPLSKME FT EARNFEEGVESNKIDSETPPQCEFATVLKPVIESTVCNQKCLLNNRHVPGL FT TTRTHGLWNTTRKLRTPMPTKSYASIAAAKTSWNFSARLAAEKVKDQPEKT FT AAELVPSVYHEFLEMFEKSKSNVLPPHRPYDFRVDLVPGATPQAGRIIPLS FT PKENDALNEMIEKGLSNGTIRRTTSPWAAPVLFTGKKDGNLRPCFDYRRLN FT VVTIKNKYPLPLTMELIDRLLDADQFTSLDMRNGYNNLRVREGDEAKLAFL FT CKAGQFEPLTMPFGPTGAPGFFQYFIHDILKAHIGKDVAAYQDDILIYTKP FT GEDHESKVKEILKILQDQNVWLKPEKCKFSQKEVSYLGLIISCNQIRMDES FT KVKAVKEWPAPQNLSEVQMFLGFANFYRRFISHFSEIAKPLHELSQKSIPF FT NWTDAQREAFERLKEAFTSAPVLKIVDPYRAFIIECDCSDFALGAVLSQVS FT KDDEELHPVAFLSRSLLQAEQNYEIFDKELLAVVTAFKEWRQYLEGNPHQL FT SVIVYTDHKNLQSLMTTKELTRRQARWVEVLGSFDFEIRFRPGRQSTKPDA FT LSRRPDLAPSKEEKLTFGRLLKPENLPVDAFIDELDSPEAWIEDEEVEEFT FT STEIELMAGEDEELIWGDMEILNEIRRYSKTDQRISEIGRMLSEMPKSKHL FT MEYSMIDGLLYKNERVEVPDVASLKAHILRSRHDSKIAGHPGRSRTLALVK FT RSYCWPSMKAYINKYVDGCESCQRVKARTSKPFGSLQPLPIPYGPWTDICY FT DLITDLPESNGHDSILTVVDRLTKMAHFIACQKSMTSDELATLMIQNVWRI FT HGTPKTITSDRGSIFISRITKEMNKRLGITTQASTAYHPQTDGQSEITNKA FT VEQYLRHFTSYMQNDWNSLLPMAEFSYNNNSHVSIGMSPFKANYGYDVAFT FT GLPTEDQHLPIVEDCIKQIQEVQQELRGAMEEAQHEMKIQFNKKVLKSPDW FT EEGEQVWLNSKHISTTRPSAKFGHRWLGPFKIEKKITSNAYKLTLPLSMKT FT VHPVFHVSLLRRYQKSKIPDQRNEEPPPVMIQDKEEFEVAEILDKRRRGNK FT IEYLVSWKGYSTEHDSWEPKEGVQNANELVKEFESKYGERMKTNNRKRRMV FT RG" XX SQ Sequence 5656 BP; 1901 A; 1183 C; 1244 G; 1328 T; 0 other; acgtcgatat caaggtcaac agtctacgtc aaaattcaag tatcaagaaa taagaagaaa 60 gttaattaga aagttaagaa gaaaaaagtt taagataatt agatttctag atctgtatcg 120 acatcaaagt ttaaagtagt ttaattccgc aaatccaagc agacagaaag attcaaggtt 180 atacagatta cgtcaagtag ttctatacca gcaagtaaag tcagttcacc accgaagcat 240 attcaacacg ccaagtttca acgtaccaag ttcttcgacc tccgagtcga gtagcaattc 300 aaataatccc tgcgaggtct tccacaaact cgaagagaac gtgatggaac cacaagctga 360 ccccaccatg atcgaagtct taagacaact gaaccagctg tcggcccagt ttaacaaagt 420 taaagctagc ctcgctgtag aaactcaaaa acgcttagaa gccgagcaac gtttgaatca 480 gttcgaaacg tctcagacta ctaatatgtc taccaaaggt gttccaggaa acaccaaccc 540 tgctcccacg cctgttcaac ctgtctacgt aaatcaaacg gttcaatctc agcgacaacc 600 taaaatgtct acccccgaca agtttgatgg tagcaaagga gcaaaggctg aagtgttcat 660 gaatcaatta ggattgtaca tgcagcttaa caaccatctg tttgcgaatg atcaagccaa 720 agtggctttc gctttatcgt acacttctgg taaagccagc atatggggtc aaagcctgac 780 cgatcaactc ctagacccga ctctgtctga atctgtaacc tggaataagt tcatcgactc 840 atttaaatca accttcttcg attccgagcg agtttccaaa gctgagaaag agatgagaat 900 gttgaagcaa aaaggcaccg tatctgacta ttggattaag ttttcagagc tctctatagt 960 aatcaagtgg cctcaagaca tcctcatgtc ccaattcgag cagggtctga aacgagaaat 1020 atctgtgtat atgattaaag aagagtttgt gactgctgaa gagatggcaa agtatgctat 1080 caaacttgac aacaagataa atcgacacag tgaaaatagt cacacgctag cagcctctac 1140 ctctactccg acaccgtcag tagaccccga tgcgatggat tgctcggcct acaagttgaa 1200 tatctcaaat gaagagtaca agcgtagagg aaccgctaag gcatgttata aatgtggaaa 1260 gacaggtcac ttcattgcag attgttttgt tggtagacgt ggtaaaacgt ggaattcaaa 1320 ttataggggt aattcagttg aaagtaaatt gaaggctagg atagcggagc ttgaaagtca 1380 gttaggagat ggtagtagtg ttgtagaaag tgtcagcaag tcagatttgt caaaaaatgg 1440 cgatgctcgg gagtgaaagt tttgcctccc ccgagcaatc aatgtatgga aaaattgggt 1500 gatatcagta gcttagaaat gaaagatacc cgaattattg atttcattca cctaacagac 1560 ccatcccatg ccacaacaat agttgcccgt gccctcattg atagcggagc aacacatgaa 1620 gcaataagct gtgctttcgt gaacaaacac tcattgacaa cctccccgct cgaagagcca 1680 agaagagtaa ccggcttcag cgggcatacc tctcaaatca gtgaagtagg agacttcatc 1740 atcaacgaag acgattcagc aacaacattc atagtcactg aattgcgcga caaatacgac 1800 ttaatcttgg gaatgccatg gattaagaag aaccatgaac atattgattg gaagagaagt 1860 tgtttaaagc gccagagtgg cagcattgcg gtcattgatt cagttccgtc tggattagtg 1920 atttagtctg gtccgcaaac agcctcgatg gtatcaaagg agcctgaaag gcacgctagg 1980 aaccgtgacg agggggtgga gcttgaaatt agctcatcta cacccccgca atgtgagtac 2040 gatttgattc agagtccaac agtcagtgaa gcaggtcaca agctttattc ccctgtagaa 2100 ttatctgaca caccgaattc aactcaacaa acgagcacaa ggcctaatcc taacattgca 2160 gtatcttcaa gtccggcaac acctccgatt gggcccttga gcaaaatgga ggaagctagg 2220 aactttgaag agggggtgga gtcgaataaa atagactcag aaacaccccc gcaatgtgag 2280 tttgctactg tcttaaaacc cgttattgaa agtacagtgt gcaatcaaaa atgccttcta 2340 aataacagac acgtaccagg attaacgaca cgaactcatg gattatggaa tacgactcga 2400 aaattaagaa caccgatgcc tacgaaatct tatgctagta ttgcagcagc taagacatcg 2460 tggaacttct cagcaagact agcagctgag aaagtgaagg atcaacctga gaagaccgcg 2520 gccgaattag tccccagtgt gtatcacgag ttcctggaga tgttcgagaa gagcaagagt 2580 aatgtattac ctccgcatcg tccttatgac tttcgggtag acctagtacc aggtgcaacc 2640 cctcaagcag gaagaattat ccctctatca ccaaaagaaa acgacgctct gaacgaaatg 2700 attgaaaaag gactgtcaaa tggaacaata cgtcgaacca cctcaccatg ggcagcccca 2760 gtccttttca ctggcaaaaa ggatgggaac ttacgcccgt gcttcgacta ccgtcgactg 2820 aatgtagtaa caataaagaa caagtatcct ctcccattga cgatggaact aatagacagg 2880 cttctagatg cggatcaatt taccagtctt gatatgcgca acggctacaa caacttgcgc 2940 gtaagggaag gagatgaggc caagctagca ttcttgtgca aagcaggcca attcgaaccg 3000 ctcacaatgc cttttggacc aactggagca ccaggcttct ttcaatactt tattcatgac 3060 atactcaagg ctcacatagg aaaagatgta gctgcttacc aagatgatat actgatttat 3120 acaaaaccag gtgaagacca cgaaagtaaa gtgaaagaaa tcctcaaaat tctgcaagat 3180 caaaacgtat ggctcaagcc tgaaaagtgc aagttctctc aaaaagaggt atcttactta 3240 ggattgataa tatcatgtaa tcaaatacgt atggatgagt caaaggtcaa agccgttaaa 3300 gagtggccag ctccacaaaa cctatctgaa gtgcaaatgt tcttgggatt cgctaatttt 3360 tacagaagat ttatatccca tttctctgaa atagcaaagc cgctgcacga attgtctcaa 3420 aaatcaatcc ctttcaattg gacagacgct cagcgggaag cattcgagcg ccttaaagaa 3480 gctttcacat ctgctccggt gttgaagatc gtggatccgt acagggcttt cattattgaa 3540 tgcgactgtt ctgactttgc acttggtgca gttttatcac aggtgtcaaa agacgacgaa 3600 gaacttcatc cagtggcctt tttatcacga tccttacttc aagcagaaca aaactatgaa 3660 atttttgaca aggaattgtt ggcagtagtg acggctttca aagaatggcg ccaatatttg 3720 gaaggaaacc cgcatcagct gagtgttatt gtgtacactg accacaagaa cctccaatca 3780 ctcatgacca caaaagagct taccaggaga caggcaaggt gggttgaggt gttaggaagc 3840 ttcgattttg agattcgatt ccgtccggga agacaatcaa ccaagcccga cgcactctcg 3900 cgaagaccag acctagcgcc atcaaaagaa gaaaaactga cctttggacg actgctgaaa 3960 cctgaaaacc taccagttga tgcctttatc gacgagttag actcaccgga ggcatggatc 4020 gaggatgaag aagtggagga attcacaagt actgaaattg aattaatggc tggagaagat 4080 gaagagttaa tatggggtga tatggaaatc ttaaacgaaa tcagaagata ctcaaaaaca 4140 gatcagagaa tatcagaaat tggacgaatg ttaagtgaaa tgccaaaatc gaaacattta 4200 atggagtact cgatgattga tggattgctt tacaaaaatg aaagagtaga agtacccgat 4260 gtcgcaagcc tcaaagctca catcttacga tcaaggcacg acagcaaaat tgccggtcat 4320 ccaggaagat caagaacgct agccttagtc aagagatcct attgctggcc gtcgatgaaa 4380 gcctatatca acaaatatgt agatggctgc gaatcgtgtc aaagggttaa agcaaggact 4440 tctaaacctt ttgggagtct acaaccgctt ccaattcctt acgggccatg gactgacata 4500 tgttatgatc ttattactga ccttccagag tcaaatggac atgatagcat tctgactgta 4560 gtggatagat taacgaagat ggctcatttt atagcctgcc agaaatcgat gacgtcagat 4620 gaattagcaa cattgatgat tcaaaatgtt tggcgaattc atgggactcc aaaaaccatc 4680 acgtcggatc gtggtagcat cttcatttca agaatcacga aggaaatgaa taagagacta 4740 ggcattacca cccaggcgtc gacggcttac cacccgcaga ccgatgggca atctgagata 4800 acaaacaagg ccgtggagca gtatctgcga cacttcacat cgtacatgca gaacgactgg 4860 aattccctac tcccaatggc cgagttttct tataacaaca actctcatgt atcaattgga 4920 atgtcgccgt tcaaggctaa ctatggctat gatgttgcgt ttacaggatt gccgacagaa 4980 gatcaacacc taccaattgt tgaagattgt ataaagcaaa tacaagaagt acaacaagaa 5040 ctaagaggag caatggaaga agcacaacac gaaatgaaaa ttcaattcaa caaaaaggtt 5100 ctgaagtctc ctgattggga agaaggtgaa caagtctggt taaacagcaa gcatatatca 5160 acgaccagac catcagctaa gtttggccac agatggctag gccctttcaa aattgagaag 5220 aaaatcacta gtaatgctta caaactaact ttgccacttt cgatgaagac agttcacccg 5280 gtatttcacg tcagtttgct tagacgctat cagaaaagca aaatcccaga tcagcgaaat 5340 gaagaaccac cgccagtaat gattcaggat aaagaagagt ttgaagtagc ggaaatttta 5400 gacaaaagaa gaagaggaaa caaaattgaa taccttgtaa gttggaaggg gtatagcact 5460 gaacacgact catgggagcc taaagaagga gttcaaaatg caaatgaatt agtgaaggaa 5520 tttgaaagta aatatggaga aaggatgaaa actaacaaca ggaaaaggag aatggtgaga 5580 gggtgacgct ttttccccac aggggttttt tgatgcaaac ccgtggaaag atgtttcact 5640 cagcaagagg gagtgg 5656 // ID Gypsy-27_MLP-LTR repbase; DNA; FNG; 441 BP. XX AC AECX01000158; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-27_MLP_; KW Gypsy-27_MLP-I; Gypsy-27_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-441 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000158; Positions 22182 21742. XX SQ Sequence 441 BP; 117 A; 116 C; 54 G; 154 T; 0 other; tgtagcaggg ctacttattg ttagtctgta ttagaatatc ttgttacacg tttcatatac 60 atatgtctgt attaccactt gtactcgcca cataagccta ctcttaaatg gctcatacaa 120 ccttattgta ataccggcta tagcataacc ggccttctgc ctttatcgcc atctattgta 180 tttctcttat aaggaacact tcgagtcctt ctcttgtatt ccttagtcat ctagttatct 240 cctttctttc atcctcttct ggatagcaat acatcttaat atcaatcctt ctcgataaac 300 cccttataat ccttataagt cccttaatcg acccaattca ttaagatagc caagtcattc 360 acttaaccca taccaggttt cagacttcca ttccccatcc ggaatatcat aacgtaagcc 420 cctaataagg tgcccgttac a 441 // ID Gypsy-2_LBS-I repbase; DNA; FNG; 8611 BP. XX AC ABFE01000017; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_LBS_; KW Gypsy-2_LBS-LTR; Gypsy-2_LBS-I. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-8611 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01000017; Positions 10323 1713. XX CC Positions [4339-4800] - Reverse transcriptase CC Positions [6268-6747] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 356..1972 FT /product="Gypsy-2_LBS-I_2p" FT /translation="MSSSNEGTKKSTRRTNNSQSAGMTSGVSNTLGFAPPI FT LQEASETNTFPDSVSDNARRIPEDVQGTPSNRQVENTFVDPAEQGEEDTRS FT DSSLSLPDFGELTELGSPYRFNELPVDRELFFERTIEFPSCKPFSNYGDSE FT KGLKKSEKVFSTGGRYFQALKLGEASLQRPIASSGGKTMSRNERKKFFSEE FT LGILDELLTNLYCFKDGNRAGYAIQKIILVSLNQRLRACRERAEEDLIMSG FT NGILSIPRWGLHGKADEFWNTNDFEILGACYRREVEDFLTYLVDHHDFPSA FT KKETASKSRTMSVTLPAESSGESSPTNLATPVITVPIFAPGEADSISRFQR FT PSTVSYTYQTPGNLNSSVFGRPTQNTSSKVFQEIFSADNSKKGQIGRVVTN FT SPPPVTSTEQSHRNPTGGAPGPGDSDGDDSSDERGNKPPRNPKVPRIPRKN FT PFDNTSGGNAASKFPAEPQFDNKLKGDAIPTWDGNPDSLKRWFLKLNSLSK FT RSGLVFEQLGTLVPTQLTGSAETWYYSQSADTRTRLETDWGTL" FT CDS 3019..6858 FT /product="Gypsy-2_LBS-I_1p" FT /translation="MIIDSGSDITLISQKTLDQMLKAPKIRMGQQIKLIQV FT TGKAIITGYVILNLIFKTPEGPVQLNIEAYVIKGMSAEFILGNDFADQYSI FT SVICNEGETTLQFGRSGRSIRVHNSLSTPFVDEDGHAFKVRVRSDITNRVL FT KSKAHRKRQVQRKQMKRRQAEQEVRSRYQVSIPPESSKLIQVCANLDHPTS FT TLLVERSFASNGNPEEVYGSTDTLIDREDSRIFVSNFSRTPITIPAGQVLG FT IGRNPASWLDRKGQFSESQKQDAEKRAHLIQTLLQNQSIPPKSHTQSEHTT FT LTGKCQLQEILDPSRLRYSGEDPLSEPPVEGGPKTAEVPGEVISESTLLKE FT LDISASLSQEQAQEIRRILIKHKEVFGLDGRLGSYAEEVKIPLVPNTKPIS FT VPPFHASPASREVIDKQMDSWLSLGVIEPSRSPWGAPVFIAYRNSKPRMVI FT DLRRLNERVIPDEFPLLRQEEILQSLEGSQYLSTLDALAGFTQLSIAPDDR FT EKLAFRCHRGLFQFRRMPFGYRNGPAVFQRVMQKILAPFLWIFALVYIDDI FT VIFSKSFEEHCQHVGTVLAAIESAEITLSPSKCHFGYQSIMLLGQKVSRLG FT LSTHKEKVDAILQLDNPKNVHDLQVFLGMMVYFSLYVPFYAWIAHPLFQLL FT KRENKWTWGKDEQAAYKLCKQVLTEAPVRAHAMLGIPYRIYSDACDFALAA FT ILQQVQPIKIRDLKGTKTYERLEKAYKKGESIPNLITHLDKEGSDVPSNGE FT WDVDFDNTTVHIEHVIAYWSCVLQAAERNYSPTEREALALKEGLIKFQPYL FT EGEKILAITDHTALTWSKTFQNVNRGLLTWGLVFSAFPNMKIVHRAGRVHS FT NVDLISRLRRRQPIQEGPTNMDSAFLTLKPTEDPLKNMFEELGPQFEEKLL FT KVASNFMMTELEVEKELTSIPVTLRVGDTEEIEMSQLTTRNYSILVGMNQD FT EISKWKRAYDKDPHFNSVLKAMREDKDDSIPFPQYHYSDNGLLYFEDSLGN FT TRMCVPKELRNKVMSENHDIISESAQSGYYKTYNRISGTYYWPRMSREIKS FT FVNTCDVCQKTKPRRHAPLGLLQPIPIPSQPFEVVMMDFIPELPTSNGCDN FT ILVIVDKLTKYAIFIPTTTKISEIETARMFFKHVIAKFGIPRQVISDHDTR FT WRGDFWKEVCRLMGMKRSLTTSYHPQVDGQTEVMNQGLEISIRTYIGPERD FT DWSDLLDVLALSYNTSPHTATGFSPAYLLRGYHPITGSTLLMSPRLIRRDD FT LQEIYIEHEALDEKALHMT" XX SQ Sequence 8611 BP; 2586 A; 1999 C; 1934 G; 2092 T; 0 other; ttggtggaca cattgggaac ttttcttagc aaacacgaga cctcttagcc cgtaggacgt 60 tccaattgag tcatcagtta tcggtcacct ctgttcccaa accgtcggaa cacgttcggt 120 tatcggctac gtttacctcc gaaccgcagg aacacattcc ccgattgcaa cgcccaccaa 180 ccaagtcaat ctgcacaaat cggtatcagc aaaacacgtt ttcaatctca agagatcctg 240 acgagacatg ttttcgatcg tcacgcccac agacctcgtc aatcattctc agtaaccatc 300 acatttaccc ttttccatcc gtcttatcca tcgaatttca aatcaagtct tcattatgtc 360 tagtagcaac gaaggcacga agaagtctac tcgcagaaca aacaactcgc aatcagctgg 420 tatgacaagt ggggtaagta acacattggg ttttgcgcca ccaattcttc aagaagcctc 480 agagacaaac actttcccag attcggtaag tgacaatgcg aggcgaatac cagaagacgt 540 tcaaggcaca ccaagcaacc gccaagtcga gaacaccttc gtagaccctg ccgaacaagg 600 agaagaggac acaaggtcag actccagtct gagtctacca gacttcggag agctgacgga 660 gttaggttct ccctacagat tcaacgagtt gccggtcgat cgggagttgt tctttgaacg 720 cacaatcgag tttcctagtt gtaagccgtt ttctaattac ggagattcgg agaaaggact 780 aaagaagtct gagaaagtat tctccaccgg aggacgttat tttcaagcgc taaaactggg 840 agaagccagt ctacaacgtc ccatagccag ttccggtggg aagactatgt ctagaaacga 900 acgaaagaag tttttctcag aggaactggg tattctcgac gaacttttaa ctaatttgta 960 ttgttttaag gatggaaatc gcgccggtta tgctattcag aagattatac tggttagtct 1020 aaatcaacga ctacgggcct gcagagaacg cgcagaggaa gatttaatta tgtcgggaaa 1080 cggaattctt agtattccgc gatggggatt gcatggtaag gctgatgaat tctggaacac 1140 taacgatttt gaaattttag gagcttgtta tcgacgtgag gttgaggatt ttctgacata 1200 ccttgtcgat catcacgact ttccatccgc gaagaaagaa actgcaagta aatcgaggac 1260 tatgtccgtt actctacctg ctgaatcgtc tggtgaatcg tcacccacta atctggcaac 1320 tccagtaatt acagtgccca tcttcgctcc aggcgaagca gatagcattt cgagattcca 1380 gcgtccatcg acggtgagtt atacttacca aacgcctgga aatcttaact cctcagtatt 1440 tggacgaccc actcaaaata cttcgtcgaa agtgtttcaa gagatattca gcgctgataa 1500 ttcgaagaaa ggtcagatag ggcgagtagt gactaatagt cctccccccg taacctccac 1560 cgaacagtct cacagaaacc ctacaggagg agctccaggt ccaggagatt ctgatggaga 1620 cgacagcagc gatgaaagag gaaataagcc tcctaggaat cccaaggtcc ctcgtattcc 1680 gcgcaaaaat ccgttcgaca acacgtcagg agggaacgct gcaagtaagt ttccagcaga 1740 accgcaattt gacaataaac tgaaggggga cgctatacca acttgggacg ggaatcccga 1800 ttcgctgaag cgatggttcc taaaactgaa cagtttgtcg aaaaggtcag gattagtttt 1860 cgaacagttg ggaacattag taccaactca acttacagga agtgccgaaa cgtggtatta 1920 tagtcagagt gctgatacta gaacacggtt agagactgat tggggaactt tatgagcggc 1980 gatcggtgag tactatatga atcgtacatt cttagacaag cagaaggcac aagctaataa 2040 agcatcatat cgcgacgcag gaaatccgaa ggagttgcct agtgagtatg ttattcacaa 2100 actagaactt cttcaatttg tctataacta tactgacaat gaattgatta atgagattat 2160 ggaaggcgcg ccttcctact ggacaccaat tgtaacggca cacttatatc aaacactgga 2220 ggagttccag ttggccgtca agttccatga agattccctt atgagagccg gtaacgatgt 2280 tctcaacact accaggaatg aattttactc cagagacgca cccccgaaaa gcccattcaa 2340 tccatttcgc aaccaacggg cgaacgttaa tctagtcggg tggactcaag cagtgtctaa 2400 gcctcagttc cctaaggacg actcgaatat ttcaccgcga gggacacctg aagagaaggg 2460 agcacgaccg tgtcgtcact gcggtagcag caaacactgg gatcgcgatt gcaagtacgc 2520 acgaaagggt gaaagatccg cacgagtaaa taaggtgacg ttccagcaag atgaaagaga 2580 agctcaagat aagtatgatg acttatatta tgaaacgttt agtgatgacg agagcgttga 2640 agaaagttca gatcggtcgg attttcacaa agcctctcag tgaaagactc gcattcgctg 2700 aggggggagt ttcatagcaa aacctaccac attaaaacta cttgggagga taatctaaag 2760 aatccctatt ctacagctcc tcgtttgtcg tcgagttatg ttgtacgaag aaaaccacca 2820 gaaattaaca ggaagactaa acggaagtta gcacacgaaa tcagtacagc gtcgttcgct 2880 acacacgcgc aggataatag tggtgagctt atagaactcc gaaaacactt gtcacgacca 2940 ttagggtgtt cgtttctagg ggctaaagct acggaagcct gcgtgtcaat agatgatgtg 3000 aacgccaaac ctgtttccat gattattgac tccggatcgg atatcacgtt gatatcacag 3060 aaaaccctcg atcagatgtt gaaggcaccg aagatccgta tgggccaaca gatcaaattg 3120 atccaagtaa ccggtaaggc tatcatcaca ggatacgtca ttctcaatct aatcttcaaa 3180 acgcctgaag ggccggttca actaaacatt gaagcatacg tcatcaaagg aatgtctgca 3240 gaattcattt taggtaatga ttttgctgac cagtactcga tatcagtaat atgcaacgaa 3300 ggagaaacta cactacagtt tggaaggtca ggaagatcga tcagagttca taattctctg 3360 agcacgccct tcgtcgatga ggacggacac gcttttaagg taagagtccg ttcagatatc 3420 acaaatcgag tactaaagtc caaggctcat cggaaaagac aagttcagag gaagcaaatg 3480 aaacgccgac aagccgagca agaagttcgt tcaagatatc aggttagcat accgcccgag 3540 tcatccaagt tgatacaagt atgtgctaat cttgaccatc ctacatctac cttgttggta 3600 gaaagatctt ttgcgtcaaa cggaaatcca gaggaagtat atggcagtac ggatacgctt 3660 atagaccgcg aagacagccg aatattcgta tcaaatttct caaggacccc gattacaatt 3720 ccggcaggac aggttctagg gataggaagg aatcctgcat cgtggctgga cagaaagggt 3780 cagttttctg aatcgcagaa gcaggacgcc gaaaagcgag ctcacttaat tcagacttta 3840 cttcagaacc aatccattcc gccgaagtca catacacaat ccgaacacac gacactcacc 3900 ggtaaatgtc aattacaaga aattctagac ccttcaaggt tacgatactc tggagaggat 3960 cctctatcag agccaccagt agagggcgga ccaaagacag cagaggtacc aggggaagtt 4020 atttctgaaa gcacgttatt gaaggagttg gatatttcgg ctagtctctc tcaggaacag 4080 gctcaggaaa ttcgtcgtat tttgattaaa cataaggaag tttttggact agacggacgt 4140 ttgggtagtt acgcagaaga agtaaaaatt ccgctagttc caaataccaa gcccatatcc 4200 gtcccgcctt ttcacgcatc tccggcaagt cgagaagtta ttgataagca gatggactcc 4260 tggttgagtt taggagttat tgaaccatct cgaagcccat ggggcgcacc tgtcttcatc 4320 gcgtatcgga acagtaaacc gcgaatggtc atcgatctca gacggttaaa tgaaagggtt 4380 ataccggatg aattccctct tctgagacag gaagagattt tacaatcctt ggaaggaagt 4440 caatacttat cgactctcga cgctctcgca ggattcaccc aacttagcat cgctccggac 4500 gaccgcgaaa aactggcgtt ccgttgtcac aggggactgt ttcagtttag aagaatgccc 4560 tttggctata ggaatggtcc ggccgtcttt cagcgagtga tgcagaaaat tcttgctccg 4620 tttctgtgga ttttcgcgct agtttatatt gacgatatcg tgattttttc caaatctttc 4680 gaagaacatt gtcagcacgt agggacagta ctggcagcaa tcgaatccgc tgaaatcacg 4740 ctttctcctt ctaagtgtca tttcggctat cagtcaatta tgctattagg gcagaaagtt 4800 tctcggttag ggttatccac acataaggaa aaagtggatg caatcctaca acttgacaac 4860 cctaaaaacg tacatgatct acaggtattc ttgggtatga tggtctactt ctcattgtac 4920 gtcccgttct acgcatggat agcgcacccc ctatttcaac ttttgaaaag ggaaaataag 4980 tggacatggg gaaaagacga acaggcagct tacaagttgt gtaaacaggt ccttacagaa 5040 gccccagtaa gggctcacgc aatgctggga attccgtacc gaatctattc agacgcttgc 5100 gattttgcgt tagccgcaat cctacaacaa gtccaaccaa taaagataag ggatttgaaa 5160 ggaactaaaa cttacgagcg attggaaaag gcctacaaga aaggcgaatc gataccaaac 5220 ctgattactc acttagataa ggaaggatca gacgtccctt cgaacggaga atgggatgta 5280 gattttgata acacaactgt ccacattgaa cacgtaatcg cgtattggtc gtgtgtgttg 5340 caagcggccg aaagaaacta ttctcctact gagagagaag cattagcatt gaaggaagga 5400 ttaataaaat ttcaaccata tcttgaagga gaaaaaatcc tcgccatcac tgatcacacc 5460 gctttaacgt ggagtaagac atttcagaac gtcaataggg ggttacttac gtggggttta 5520 gtattctcag ccttcccaaa tatgaaaatt gtacaccgag ctggtcgagt acactcaaac 5580 gtcgatctga tttctcggct gaggagacga caaccaattc aggaagggcc tacaaatatg 5640 gactcagcat tcctaaccct gaagcctaca gaagacccac tgaaaaacat gtttgaggag 5700 ttaggtccac aatttgagga aaaattgttg aaagtagcgt cgaatttcat gatgacagaa 5760 ttagaagttg agaaggaatt aacgtccata cctgttacgt taagagtggg agatacagaa 5820 gaaatcgaga tgtcccaact aaccacacgg aattattcaa tcttggtagg aatgaatcag 5880 gacgaaatat ctaaatggaa aagggcatac gacaaagacc ctcattttaa ctcagtcttg 5940 aaggctatga gagaggacaa agatgacagc ataccttttc cgcaatatca ttactccgat 6000 aatggactcc tatattttga ggacagtctg ggcaatacga ggatgtgtgt accgaaggag 6060 ctaaggaaca aggtcatgtc cgagaaccat gacattattt cggagtccgc tcagagcgga 6120 tactataaga catacaacag gataagcggc acatattatt ggcctaggat gtctagagaa 6180 attaaaagtt ttgtgaatac gtgcgatgtc tgtcagaaga caaagccgag aagacacgcg 6240 cctctgggat tactccaacc tatacccatt ccatctcagc cgtttgaagt ggtgatgatg 6300 gattttatcc ctgaacttcc gacatcaaac ggatgcgata acatactagt tattgtcgat 6360 aaactcacca aatatgcaat cttcatccct acaacaacaa aaatcagcga aatcgaaaca 6420 gctagaatgt tcttcaaaca tgttatagct aaatttggaa tacctcggca ggttatttca 6480 gatcacgaca cacgttggcg cggagatttt tggaaagagg tatgtcgatt aatggggatg 6540 aagcgatccc tgacgacatc gtatcacccg caagtggatg ggcagactga agtgatgaat 6600 cagggtcttg agatctcgat ccgcacctac atcggccctg agcgggatga ttggagcgat 6660 cttttggacg tcctagcgtt atcttacaat acttcacctc acaccgctac aggcttcagc 6720 ccagcttatc tcttacgagg atatcatccc ataactggat cgacactact catgagccca 6780 cgactgataa ggagggacga cttacaagaa atttacatcg aacatgaggc gcttgatgag 6840 aaggcgttac acatgaccta agcatttgag gctgagcaca ggaaggcaca agatgcctta 6900 cttctgggtc aggtattcca aagaaaggct tataacaagg atcgactgac ctgggaattt 6960 caggaaggtg acaaagtcgt caccaatcga aaacatctag gtcttcttag gaatgaaaaa 7020 ggatgagggg acaagcttct caccaaatac gaagggccat tcgaaatcat ccagaagata 7080 agttcagtct catatcgtct acgcatgcct gcgtctttcg gtatgcaccc agtacttaac 7140 attgaacatt tagaaagata ccacgactcc cctaaggaat ttggtgagag acccaaaata 7200 aagatgaatc gaatggactt cgaagaatta cctgaatatc aggtggaccg catcgtggcc 7260 gaatcatggc ataaaggcag gaatggcagg cgtataccca tttatcgggt acactataca 7320 gggtatggtc ctgaagccga cacttgggag ccacgacaga atttgaaaaa cgccccagtc 7380 gtgttgcagg aatggattaa caacaaggcg tcccgatcta ggaaatctaa acaactaaaa 7440 tgaatgagaa aaaatctatt taagccgtgc gtctatcact tttctctatt gcaatatttc 7500 attccctctc tccctcaaag acattgtttc ttctcttaaa aacacctcaa ctcttctttc 7560 tacgatgcag accattgaaa cctacactcc gtctctcccc aaccttcacc tcgttccttt 7620 tcaatcagcg aacaccgccg tcaacaacgt ccatcctctc aactaccgtc tcacagccga 7680 cgacttcatc acccttacct tgcccagcgg acaactcttg tacgcgcttg atccggctac 7740 tggcacctcc ctgtggtggg cgttacgacg tgtcgcacga gttataacac gcgcagccac 7800 gcactccatt ccctttgatc catctttacc cattgcgcaa cgtgcataca tcccccagtt 7860 tcaatttgac cgcatcgcga tgcagtgacc cgccatcctg tcaaccttga accctgactg 7920 catgaaccct ctttacgggt atacgtcaca ggaaaatatc caggcgtggt ttacggattt 7980 ccaaagcatg gggcggcagg cagtacagtg gatccgcacc gcacgccttc agtcggcctg 8040 cttagggtta cactcaggct ggaactggga ggtggcgtcc caggacgcct tgagcttaga 8100 ttatcctaac cgatacctcc tcaaggggcc cgagggtaca caaccaactt tcagaattat 8160 ttttctctca ttcattgaaa caataggtca tattgatcct gtttcatacg ctctcccttc 8220 cgcctacagt ccatctgtcg ccgattggtc aagcgacacc tccagcatca gctccaattc 8280 tgacgaagac gtcaacatgg aggatgccga gcttggtaag agccaaagat ttcaattagg 8340 atttacattg agatacttgt caggtaatag gtctctcgtc ctgcacccac tgctctcagc 8400 atatgccaca cgataccggg attacgacgt gtttagcaag gtacctttga ttgacaatgg 8460 gtacatcgat tctcgcaagg ccggcgatct cactccctat gtttctgcat tatcccccat 8520 cgaatctacc acgttccgaa ttctttcact tccaatttac gatgattacg acaactaggg 8580 cgtattcgaa cttcaaggac aggggggggt a 8611 // ID Gypsy-4_AM-I repbase; DNA; FNG; 5006 BP. XX AC ACDU01002060; XX DT 07-FEB-2011 (Rel. 16.02, Created) DT 07-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Allomyces macrogynus genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_AM_; KW Gypsy-4_AM-LTR; Gypsy-4_AM-I. XX OS Allomyces macrogynus OC Eukaryota; Fungi; Blastocladiomycota; Blastocladiomycetes; OC Blastocladiales; Blastocladiaceae; Allomyces. XX RN [1] RP 1-5006 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Allomyces macrogynus genome."; RL Direct Submission to RU (07-FEB-2011). XX DR Genome; ACDU01002060; Positions 1831 6836. XX CC Positions [2228-2767] - Reverse transcriptase CC Positions [3881-4381] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 896..3568 FT /product="Gypsy-4_AM-I_1p" FT /translation="MSDAEYFRLLRGRVNSQAREELARALGQVQKAGDKRE FT HLTRQEIVQALLEAHSIGHTTELDDPMDIDLAALKPQLTYRKNVGNQRTRF FT GSPHQQSDRTPSSNFDRNNIVDSYRCLYHMTTSHSNLQCLLQGSNPRCGRY FT DSCPADLLTRIKSRGKKVEVATAGIEDGVSKVVDNVLMQIETDDSECSDDA FT TPKRISLGSARTRSVFLHTAFLGQQQAQLHAKTVCGAGANFLNSSILKRTG FT ESKTVDVQVDCILADGQTSALTKGVWLNVQVGKFISCKFFYLLNGADFQAI FT LGMTWLANMNPDIDWTTGVMKNRLGDEYSRPIPGTTPGCNISIAAVSMNGF FT RRAMRNADGPVFAIELRAATSSTDFAPDINHLSPSDRTIIEPILHKYKDVL FT NGLPKGFQPPDRGPLNFQIQLAPNFEYPRPRIPHLSPLDHEALKKEIASLL FT QRGAIKPSDAPFASPILFVKKKNSDLCMCVDFRTINAATIVDLYLIPHSDE FT LRSRVAKAKIFTVIDLASGYHQLCIHPDSQVWTTFTCPLGNFAFTVMPFRL FT AGAGSAFSHMMRSVLGHLGDEFIAFYFDDVVIFSNTMDKYANHLDAVFATL FT HQQRLYATPKKASYANDHVHFLGHVIRHGEIRPDLKKIKSIMDWPLLKTRN FT DVQQFRGLAQYLSAHVPHDSSIVAPLSDLLTNGTPRDAFEAKPAVAAFAEI FT KRLIADSVALTLIDPGLLFEIYCDASSFGLGFALVQNGKVLSFNSHKLKPT FT ERNYPAHDFELLAVCNSLKEYGYLVEGRKFTIFADNQALSHFLNKKHDLTQ FT AQHAALDLIQSFMPFTIKHIAGAKNSIADALSCRPDYVADPANPPEPLTLP FT LKRVKLSEIITAHVSADEVLDWIRAAYDQDEWCQAILM" XX SQ Sequence 5006 BP; 1152 A; 1512 C; 1166 G; 1176 T; 0 other; ttttggtagc gtctgtccgt gacgactcct cgcgcacggc cgctcagtcc gcacctcaaa 60 gccttcacgg ctccatcacc ttcactacac cctcgcgctt tcctctgcat ccacgacgaa 120 tctctcgcct tcacagcaca ttctctctcg atactcgatt cgcacacgta ccaacacaac 180 aatgccgccc aaggcttctt cttcttccca acaatttctc gttctcgtgc cacgttacgc 240 aggcacgact gtctctggat acaacgcgtt catcgacgaa cgctctggcg caactgccac 300 cacctcaatt tcgtcagtcc aactcgatgc caccaacgat acccatgtcg catttgacaa 360 taatgctgtc aaggaaaaga gctattggca agtctcgcat gatggaacgc tcaccaagcc 420 gcacgaatcc attgcaaatg tggcagcgca gtctatctgg atgagcatgg gtgccactgc 480 cactcatgtt gctgctgaag ccgctgcagc tgctgtcgcc agtcaagatc acccaatctc 540 ggtccaacag gccatggaca agccgaagtt tgatggctct cgtgatattg gtgctctgga 600 cacattccag gtcaagcttg aactctacct cgacggttgt ggctggaccg aacaagacaa 660 gatctgattt gttattggcc aactttttgg cgaggcgatt gcgttttggc ggcacacggc 720 gcaacctatt ccgataacgc actggactga ggtcatcaac acgctccgca cttactttgt 780 gcgcttgacg attggcaccg actcgtacaa ggcgctcgaa cggttgtgac agggcaatac 840 gcccccattc gaattcttga aacaatttga cgagcatgcc agccgtgtca gcaacatgtc 900 cgacgccgag tacttccgcc tattgcgtgg acgcgttaac tctcaagccc gcgaagaact 960 ggcccgcgct ctcggccagg tccagaaggc gggcgacaag cgtgaacact tgacgcgcca 1020 ggagatcgtc caggcattgc tcgaagctca ctcgattggc cacaccacgg agctggacga 1080 ccccatggac attgacctgg cggctctgaa gccgcagctg acataccgca agaacgttgg 1140 caaccaacgt actcgattcg gttcaccgca ccagcagtct gaccgcacac caagcagcaa 1200 ctttgaccgc aacaacattg tcgactcgta cagatgcctg taccatatga ccacgtcgca 1260 cagcaacttg cagtgtctct tgcaaggcag taaccctcgc tgtggtcgct atgacagttg 1320 cccggcggac ctactcaccc gcatcaagtc gcgtgggaag aaggtcgagg tcgcaactgc 1380 tggcattgaa gatggcgtgt ccaaggttgt tgacaatgtt ctgatgcaga tcgaaactga 1440 cgattctgaa tgctctgatg acgcaacacc aaagcgcatt tcacttggct cagcacgaac 1500 gcgctcggta tttttgcaca cggcctttct tgggcaacag caagctcaac ttcacgccaa 1560 gactgtttgt ggtgctggtg ccaactttct caactcgtcg atcctgaaac gcactggcga 1620 atccaagact gttgacgttc aagtcgactg cattcttgcg gatggccaga ccagtgctct 1680 gaccaagggc gtttggctca atgtgcaagt tggaaagttc atctcgtgca aattctttta 1740 cctgctcaat ggcgctgatt tccaggccat cctgggcatg acttggcttg ccaacatgaa 1800 cccggacatt gattggacaa ctggcgtcat gaaaaatcga ctcggcgacg aatactctcg 1860 tccgatacct ggcacaacgc cagggtgcaa catcagcatt gctgctgtct ccatgaacgg 1920 tttccgccgt gctatgcgca acgctgatgg cccagtcttt gcgatcgagc ttcgagctgc 1980 cacctcaagc accgactttg cacctgacat caaccatctc tcaccatccg accgcacaat 2040 catcgaaccg atcctgcaca agtacaagga tgtcttgaac ggtttgccca aaggcttcca 2100 gccgcctgat cgtggcccgc tcaatttcca gatccaactt gcgccgaact ttgaataccc 2160 acggcctcgc atccctcatc tatcgccatt ggatcacgaa gcactgaaga aggagattgc 2220 atcgctcttg cagcgtgggg ccatcaaacc aagcgacgcg ccattcgctt caccaattct 2280 gtttgtcaag aagaagaaca gcgacttgtg catgtgcgtc gattttcgga ccatcaatgc 2340 agccacaatt gtcgatctgt acttgatccc tcattccgac gagttgcgct ctcgcgttgc 2400 aaaagcgaaa attttcactg tgattgacct tgcttccggc taccaccaac tgtgcattca 2460 cccagactcg caagtctgga ccacgttcac ctgcccgctt ggcaactttg cgttcactgt 2520 catgccattc aggctcgcag gcgctggatc cgctttttcg catatgatgc gctccgttct 2580 tggccacttg ggcgacgagt tcattgcctt ctactttgac gacgttgtga tcttcagcaa 2640 cacgatggac aagtacgcaa accacttgga tgctgtcttt gctactttgc accaacaacg 2700 actctacgcg acacccaaaa aggcttccta cgccaacgac catgtgcatt tcctgggcca 2760 cgtcattcgc catggcgaaa tccgcccgga cctgaagaaa atcaagtcga ttatggattg 2820 gccgctgctc aagactcgca acgatgtcca acaatttcgc gggttggccc agtacctctc 2880 tgcgcacgtt ccccacgact cgagcatcgt cgcgccactc tctgacttgc tgaccaacgg 2940 aacgccaagg gatgccttcg aggctaagcc agcagtcgct gcatttgctg aaatcaagag 3000 gctgattgct gactcggtgg cactgacgct gattgaccca ggcctgctgt tcgaaattta 3060 ttgcgatgca agctcgtttg gtcttggctt tgctttggtg cagaatggca aggtcttgtc 3120 gttcaattca cacaaactaa agccgactga gcgcaattat cccgctcatg actttgagct 3180 tttggctgtt tgcaattcgc tcaaagagta tgggtacctg gtcgagggtc gtaaattcac 3240 gatctttgct gacaaccaag cactgtcgca tttcctcaac aagaaacacg atctgactca 3300 agctcaacat gctgctctgg atctgatcca gtcgttcatg ccatttacga tcaagcacat 3360 tgctggggcc aagaactcaa tcgctgatgc actctcctgc cgacctgact atgtggcgga 3420 ccctgcgaat cccccagagc cactgacctt gcccctcaag cgtgtcaaac tatcggaaat 3480 cattactgct catgtttcgg ctgatgaggt cttggattgg attcgtgcag cttacgacca 3540 agacgaatgg tgccaggcca tcttgatgta gcttgcgcaa ccccttggtc cccacacgtc 3600 gcatacgcac cacaacttta ttcacctcga aggccttctg tacctcaaga atcattttgg 3660 ccagcttcga ctcgtggtcc cttacgctga cgacttgcac gcgcaacttc tcgccgaaca 3720 tcatcacaat actggcgttg cacatcatgg tattgcgcgg acgtactatc tcgctgtgca 3780 aggactctac tggcctggcc tctaccacaa cgtcgagttg ctagtcaaga agtgcacgac 3840 ctgtgccaag ttttgaaccc ataaccagcc gccccaacaa gtacagccgc ttcccattcc 3900 aactgggctg ttctcggaaa tttcaatgga cttcatcaag gggctcctga tgttgcgcac 3960 aggcaacaac caggtcctca ccattgttga ccgtttcacc aagtacgctg tcttcgtgcc 4020 atgccgcgac acaatcacgg ctgaggatgc cacggaaatt ctgttcaagg agtgggtcaa 4080 gtacttttcg cttcccacgt ccattatctc cgattgcaac gttctcttcc gtgcccaggt 4140 cttcatgtac ttgtggaaac gtcttggcac gaagctcaag ccgacaacag cttatcgccc 4200 acagagcaat ggccaggctg aagtgaccaa ccgcaagttc tctcgcatct tgaaaacagt 4260 gctgcacggt cttgacggca tgcggtggga ggacatggtt ccgattgtgc agttgacgta 4320 caactctgcc tgccatcacg ctctcagcat gtcgccactc caagcagctg ttggcacact 4380 acctcgactt ccaccaacca cgtttatggc cacttgtctc gatccactga tgttcagcga 4440 tctggctcca tgacttctcc gtattcatga tgccatcttg gactcgcagc agcgcatgct 4500 tcaacgcaat cctcgaaacc aagcagctgt ttggcaacca agaatcggcg atttcgtcat 4560 gctcaatgca cgcaacctga acgccaatac tgctggactt ccagtcacag ccaagctccg 4620 gccatcgtat attggaccgt tcaaggtcat tgcacaacgc tcgcctgtga gcttccagct 4680 cgaactaccg gatcgtatgg ttgccaaccg tgttcacgat gtcttccacg ccgaccactt 4740 aaaacaggca cctttgggtg gtgcgccagc acaaggccat gatcacactg atgaaagcga 4800 tacctctgac tctgccgaca aacctgatga tggcactggc gagtatgaca agattgcgct 4860 gccacctgat catgtcactg ctgctcctga acctgcttcc ctgaacgaga accgcaagtt 4920 gcgccggttg cttggacaag cacccgaaat ccaaaaacgt ttcaatccgg aaccgcgtcg 4980 acatcgctag ctggaggggg gagaaa 5006 // ID Gypsy-1_LWa-I repbase; DNA; FNG; 3195 BP. XX AC AADM01000021; XX DT 12-MAR-2011 (Rel. 16.03, Created) DT 12-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Lachancea waltii genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_LWa_; KW Gypsy-1_LWa-LTR; Gypsy-1_LWa-I. XX OS Lachancea waltii OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Lachancea. XX RN [1] RP 1-3195 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Lachancea waltii genome."; RL Direct Submission to RU (12-MAR-2011). XX DR Genome; AADM01000021; Positions 49203 46009. XX CC Positions [2075-2572] - Integrase core CC 'AATCT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 497..3193 FT /product="Gypsy-1_LWa-I_1p" FT /translation="MCVDYRKLNKVTVKDPFPLPRIEVLLTKVGEAQWFST FT LDLHSGYHQIPLRTTDRHKTAFVTPSGKYEYRVMPFGLVNAPSTFARYMAD FT LFRALPYVCVYLDDILIFSDSQNQHFEHLDEVLGRLKKENLIAKLKKCHFL FT QKRVEFLGYVIGHNEIRPIQEKCQAIASFPRPRTKKDAQRFLGVINYYRKF FT IPQCSQWSRPLIEFISDKQKWGTDQTKAFIKLKSCLMSKPLLRPIKQDAQY FT RLTTDASKDGLGAVLEEVSGKRMIGVVGYFSKSLQGSQKNYPAGELELLAI FT VEALRHFKYMLHGKHFVLRTDHISLLTMRGQKEPHRRCARWMDELSEFEFT FT MEYLPGEANNVANAISRSSPRLDMEGSEQKGLGVQRAAAVEVAQCDPGQWL FT ASWQADALGAAALLYLGIIQDDRVSSQERDAFEKYKKKFKLSEQFRKAYSY FT HDDRLWYKERVVVPKDKREDICNIYHDHLLFGAHFGCEVTFNKIAEKYYWP FT SLWRATKDYVASCMQCQIMKSHQPQRQGLRKPSEPPEGRWTEIAIDFLTGL FT PTTKKGYDMIMVVIDKFTKRSHFVACEKKVTGEMAMIDLLFRYVFAYHGFP FT KTITSDRDGRFVSVAYQELANKLGIKLKMSSSNHPQTNGQAERCIQVLNQL FT LRIYTKEYRQEWDIMLPQVEFAYNSTFNIAIGMAPFEADIGFCPNEPTIKP FT NSIMNPQAMDHHEFTTRLQALKNRVQEKLIDNGVSMEISKNQNRRMLKLNV FT GEYAFLHRDAYFKGGRYIKIRPIYMGSFQVVKKINDNAYELDLPSMKKNHR FT VINVQWLKRYIPRRETYSRTPPNTQKEKQERAGQITEIVGYDETQQIYYCK FT MAEVDPMITVEYNIQEFRAIPDHLRSELLTTYQETMAAQGSLEERE" XX SQ Sequence 3195 BP; 1172 A; 577 C; 692 G; 754 T; 0 other; attggtagcg ccgctgtttg acgacttcta gagtaatgag ttcagagaga gaatcaaaaa 60 aaggagaacc gacaccacat ttcgatggaa ccaatgacga tgatcagttg aataggttct 120 taaagaaatt aaaggtatgg ttccatttac atggtacaaa agagaaggat aaagaaagag 180 aactcgcact aaagttaacg aacatgaaaa gaactcattc aaaaaattac cacaatggtt 240 acagacaaaa tacaaaggaa cagtctgcaa tgaacttaaa ggaaaaaagg aacacaaaag 300 cagcatagaa cacacaattg atgtcttacc agaagcgatt ttacctagac aacagcctta 360 taggttgacc ccgaaacaag aacaaatcgc acagagcttg gtagaagaat tgttacacaa 420 aggatttgct tcaccttcta agtcgccatg cagctcacct atagtactag tcaagaagaa 480 ggatggttct tataggatgt gtgtggatta cagaaaactg aataaggtaa cagttaaaga 540 tccttttcct ttaccacgta ttgaggttct gttaactaaa gttggcgaag ctcaatggtt 600 ctcaacttta gacttgcaca gtgggtatca tcaaatacca ctaagaacta ctgacagaca 660 caagacagcc tttgtgacac cttcaggaaa atacgagtac agagtaatgc cgtttggatt 720 agtaaacgct cccagtacgt tcgccagata tatggcagac ttgtttcgag ccctaccata 780 tgtctgcgtg tacctggatg acatattaat cttctcggac tcccaaaatc aacactttga 840 acatctggat gaagtcttgg gtaggttaaa gaaggagaat ctaatagcta aactcaaaaa 900 atgccatttt ttacaaaaac gagttgagtt cctaggatat gtaatcggac acaacgagat 960 aagaccaatc caagagaaat gtcaagctat tgcatctttt ccacgaccaa gaactaagaa 1020 agatgctcaa cgatttttag gagttataaa ttactatcga aagtttattc cacaatgttc 1080 acagtggtca aggccactaa tagaatttat tagtgacaaa cagaaatggg gtacggatca 1140 aacaaaagct ttcataaaac tgaagagttg tttaatgagt aaaccattac tgaggccgat 1200 taagcaagat gcacaatatc gactcaccac cgatgcctca aaagacggcc taggagctgt 1260 tttagaggaa gtatcgggta aaagaatgat tggagtagta ggatattttt cgaagtcact 1320 gcagggctct cagaaaaatt atccggccgg agagctagaa ctacttgcaa ttgttgaagc 1380 gttaagacac ttcaagtata tgctccatgg gaagcatttt gtactgagga cggatcacat 1440 aagtctcttg accatgagag gtcagaaaga accacatcgc aggtgtgcaa ggtggatgga 1500 tgagctgagt gaattcgagt tcacaatgga atacttaccg ggcgaagcga ataatgttgc 1560 gaatgctata tcgaggtcta gcccaagact agatatggaa ggaagcgaac agaaaggact 1620 gggagtccag agagctgccg cggttgaagt tgctcagtgt gatccggggc aatggttagc 1680 gagttggcaa gcggacgcgc ttggcgcagc tgcgctattg tacttaggca taatccaaga 1740 tgatagagta tcatcccagg agagagatgc attcgaaaag tacaagaaaa aattcaagct 1800 ttcagaacag ttcaggaaag catatagcta tcatgacgat aggctatggt acaaggaaag 1860 agtagttgta ccgaaagata agagagaaga tatatgcaac atatatcatg accacctctt 1920 atttggagca cattttggat gtgaagtcac atttaacaag atagcagaaa agtattattg 1980 gcctagcttg tggagggcaa ctaaagatta cgtggcaagc tgcatgcaat gtcaaatcat 2040 gaaaagccat caaccacaaa ggcaaggtct gagaaaacca tcagagcctc cggaaggcag 2100 atggacagag atagctatcg actttttaac aggtttacca acaactaaaa aaggatatga 2160 catgataatg gtggtaatag ataagttcac aaaacgttct cattttgttg cttgtgaaaa 2220 gaaagttact ggagaaatgg ctatgataga tttgttattc agatacgtgt ttgcatatca 2280 cggattccca aaaacaatta caagtgacag ggatggccgg ttcgtaagtg ttgcatacca 2340 ggaactagca aataaattgg gaataaagtt gaaaatgtca tctagtaatc acccacaaac 2400 caatgggcag gcagaaagat gtatccaagt tttaaaccaa ttattacgaa tctatacaaa 2460 agaataccga caggaatggg atattatgtt accccaagtc gaatttgctt acaacagtac 2520 cttcaacatc gcaattggaa tggcaccatt cgaagcagat attggatttt gcccaaacga 2580 acccacaata aaacctaata gcattatgaa tccacaagca atggaccatc atgaatttac 2640 aacgaggtta caagctctaa aaaacagagt acaagaaaag ttaatcgaca acggcgtatc 2700 catggaaata tctaagaacc aaaacaggag aatgcttaaa ctcaacgtag gagaatatgc 2760 tttcttacac agagacgcgt atttcaaagg aggccgttat attaaaatca gaccaatata 2820 catgggatcc tttcaagttg taaaaaagat aaatgataat gcgtatgaac ttgatctacc 2880 ttcgatgaaa aagaaccata gagtaataaa tgtacaatgg ttgaagaggt atataccacg 2940 aagagaaaca tattctcgaa caccaccaaa cacacagaag gaaaaacaag agagggcagg 3000 tcaaataact gaaattgtcg gttacgacga aacacagcaa atatactatt gcaaaatggc 3060 agaagtagat cctatgatta cagtggaata taatatccag gaatttaggg caatacctga 3120 ccacttaaga tcagaattgc taacaacata tcaagaaacc atggcggctc aaggctcgct 3180 agaagagagg gaaga 3195 // ID Copia-1_LBS-I repbase; DNA; FNG; 3009 BP. XX AC ABFE01000677; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_LBS_; KW Copia-1_LBS-LTR; Copia-1_LBS-I. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-3009 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01000677; Positions 85583 88591. XX CC Positions [79-609] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 16..2751 FT /product="Copia-1_LBS-I_1p" FT /translation="MRIDLSQTPPKCDSCIRGKQGRTPVPKMRQGERLNHR FT LGIIYVDLTGPEAVKSASGNLYVMNIVDDNSSHPWTFCLKLKSDVLPTLQT FT WARRAEAESSEKIGIIRIDGGELDSDAMALWCDANGYTLQTTAPYTSAHNG FT CAECMHLMIMNRMRAMRASTPQVPPNRWDEFAMTAGYLLARTPTRMLGKTP FT YEVWHGRKPDLSHLREIRSCAFALILKNNPKIYERSFECILVGYSPNSKAY FT QLYHHTTHRLFESFHVKFIEQKDDIPRPLYPGHVIDLPSTDPPGNPSDASQ FT TTPIVSSSPDVPASSSVSHSPSLSSSSPSISVSSSPKHTIISDEEELINDA FT QGQVWTVPGNDDEVPVPVHDDPTNVGPIGDIGNVPRRSARTLAPMAKAAEI FT LGIKHLPHVAQAITESHEAGRRLKEQRAQAKFERRQLVLDQRASLTNIGPP FT STSNPTIPPAVPTDAVPDDSLPQVPPLDPESDFVAFCEAYATELASPLINP FT RNPDKPTFRQAMKSPDADKWTFGIQDELKSLKDMGVYVLVPCSDVPSGHKI FT LHGKWVLNLKQDKVGAPVRHKARYVVLGYEQIFGQDYADTTSPTARMESVR FT LLLNIVAAKDWDLQQIDVKTAFLYGLLPPDKAQYLEQPEGFAEPGKEDWVW FT CLQSGLYGMKQSGRIWNKTMHKAMLGWGFKRLHADPCVYYRVSSIGTVLST FT VHVDDFLITSSTPEASQAFKEELKSLWTISDLGEASFCVGIAISHNRIDQT FT ISISQTALIDRIIQQFGQTDADPISTPMDPSVAKSLTRPSPSDPPLSATDS FT YDLGRIPYRSLVGSLMYLAVGTRPDISFAVARLCQFLDCYRRAHWNAALHV FT VQYLKGTWLLASTLSGDPDLELVGFSDSSYADCPDVNDIFMKALPRPDFMR FT LRPFLGLQ" XX SQ Sequence 3009 BP; 684 A; 830 C; 620 G; 875 T; 0 other; taggatattc taggtatgcg catagatctc tctcaaacac ctcccaaatg tgacagttgc 60 atacgcggca aacagggtcg aaccccagta ccgaaaatgc gccagggaga gagattgaat 120 cacaggttgg ggatcattta tgtggatttg acaggacctg aggcagtgaa gtctgctagc 180 ggaaacctat acgttatgaa catcgtcgac gacaactcta gtcacccatg gacattctgt 240 cttaaattaa aatctgacgt gttacctact ctccaaactt gggctcgtcg agctgaagct 300 gagagcagtg agaaaatcgg tatcattcgt attgatggtg gtgaactaga ctccgacgcg 360 atggcacttt ggtgtgatgc taatggttat accctacaga ccactgcacc ttacacatct 420 gcccataatg gatgtgccga atgcatgcac ctcatgataa tgaacaggat gcgtgctatg 480 cgcgcttcaa caccacaagt ccctcctaat cgttgggacg aatttgccat gactgcaggt 540 tatttattgg cccgtacgcc tacccggatg cttgggaaaa caccctatga agtatggcat 600 ggcaggaaac cagacctatc acacctccgt gaaatcagat cttgcgcatt tgcccttatc 660 ctcaaaaaca acccaaagat ctacgagcgt tcctttgaat gcattcttgt tggttactcc 720 cccaattcca aagcatatca actttaccat cacacgacac accgactttt tgaatcgttc 780 catgttaaat tcattgaaca gaaggacgat atccctcgcc ctctttatcc tggtcatgtc 840 attgatcttc catctacaga tccccctggt aacccttctg atgcatcaca aactacccca 900 atagtttcct cttctccaga tgtgcccgct tcttcctctg tatctcattc tccctctctt 960 tcttcttctt ctccttctat ttctgtttct tcttcgccca agcacaccat tatttcggat 1020 gaggaggagc tcattaacga tgctcagggc caagtctgga ctgtacctgg caatgatgat 1080 gaggtaccag tccctgtaca tgatgaccct actaatgttg gtcccatagg tgacattggc 1140 aatgtaccac gccgatctgc ccggacgctg gctcctatgg ctaaggcagc tgaaattcta 1200 ggcatcaagc acttacccca tgttgcgcaa gccattacag aatctcatga agctggtcgc 1260 cgcctcaagg aacaacgtgc acaggctaag ttcgaacggc gtcaactggt cctggatcaa 1320 cgagcttcat tgactaacat tggccctcct tccacctcta atcctaccat tccccctgct 1380 gttcctactg atgctgtccc tgatgactca ctaccgcagg tacctcctct ggacccagaa 1440 tcggatttcg tggctttctg tgaagcttac gcgaccgaat tggcatcccc cttgatcaac 1500 cctcgcaatc ctgacaagcc taccttccgt caggcaatga aatccccaga tgctgacaaa 1560 tggacctttg gtattcaaga tgaactgaag agtctcaaag atatgggtgt ctatgtactt 1620 gttccttgtt ctgacgttcc ctcaggtcat aagatcctac atggcaaatg ggttctcaac 1680 ctcaagcaag ataaagttgg cgcccctgta cggcacaagg cgcgatatgt tgttttaggc 1740 tatgagcaga tttttggtca agattatgcc gatacaacct ctcctacagc ccgcatggaa 1800 tctgttcgcc tccttctcaa cattgtggcc gccaaggact gggatcttca gcaaatcgac 1860 gtcaagaccg ctttccttta tggtctatta cctcctgaca aagcgcaata cttggaacaa 1920 cctgaaggct ttgccgaacc tggcaaggaa gactgggtat ggtgcctaca gagtggactc 1980 tacggcatga agcagagtgg ccgtatctgg aacaaaacaa tgcataaagc tatgttaggt 2040 tggggcttta agcgactaca tgctgatccc tgcgtctatt atagagtctc ttccattgga 2100 acggtccttt ccactgtcca tgttgacgac ttccttatta cctccagtac acctgaagca 2160 tcccaagctt tcaaggagga gctcaagtct ctctggacaa tttcggatct tggtgaagcg 2220 tccttttgtg ttgggattgc catctctcac aatcgcatcg accagaccat ttctatttct 2280 caaaccgctt taattgaccg catcatccaa cagtttggcc agactgatgc ggacccaatt 2340 tctacaccta tggatccttc cgttgcgaag agccttactc ggccctctcc ttcagatcct 2400 ccgctgtcgg caactgactc atatgatcta ggccgtattc cttatcgctc cctcgttggc 2460 tctctcatgt atttggccgt tggcacccgc cctgacattt cttttgctgt cgcccgcctt 2520 tgccaatttc ttgattgcta ccgccgtgca cattggaatg ctgccctaca tgttgtccag 2580 taccttaagg gtacttggct tcttgcttcg acactcagtg gtgaccccga tttggagctc 2640 gttggctttt ctgattcttc ttatgctgat tgccctgatg tcaacgacat ctttatgaag 2700 gccttacctc gtcccgattt tatgcgcctc cgtcccttcc taggcctcca gtgatgcttg 2760 tgcgaggagg agaattttct ggttggcgtc tcgatttcaa tctccttcat ttaggtagat 2820 ttcatttcga ccatgccagc cggtagcttt ttcccactgg gttttgatgc acccggtgct 2880 tggattctac ccatacattt attctatcca tcatcctatc ctatcctcca ttcatctctc 2940 gttttttttt ctttgctttc ttttttttct tcttttcttt gtcctcccga tctacgctta 3000 gaggaggag 3009 // ID Gypsy-3-LTR_AF repbase; DNA; FNG; 266 BP. XX AC . XX DT 28-FEB-2006 (Rel. 11.02, Created) DT 07-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE Long terminal repeat of the Gypsy-3_AF LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; Gypsy-3_AF; Gypsy-3-I_AF; KW Gypsy-3-LTR_AF. XX OS Aspergillus fumigatus OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Neosartorya. XX RN [1] RP 1-266 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-266 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-3_AF, a family of gypsy LTR retrotransposons in the RT Aspergillus fumigatus genome."; RL Repbase Reports 6(2), 65-65 (2006). XX DR [2] (Consensus) XX CC This is a long terminal repeat of the Gypsy-3_AF LTR CC retrotransposon. Gypsy superfamily. It is characterized by 5-bp CC target-site duplications. XX SQ Sequence 266 BP; 53 A; 70 C; 48 G; 95 T; 0 other; tgtgacagcc ctctcagagg ttgctcacat tctgccgtac cagactgttg accgccatct 60 tgaagtgatt ccatattgag agattcagtt ggggattttc cgtttgcttt gtacaaccaa 120 cagcaccaag atgtctcacg ctcaagctgt acatagattc cctctctttc cttccctctt 180 tctgtttctt cttcttcttc tttctcaagc ttagtattcg catccaatgc taccgtatgt 240 ttcttacgaa tgagctgtgt gtgata 266 // ID ABR1 repbase; DNA; FNG; 314 BP. XX AC AJ238113; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 08-NOV-2002 (Rel. 7.1, Last updated, Version 1) XX DE Agaricus bisporus hAT-type non-autonomous class II DNA transposon DE ABR1. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; ABR1; KW class II; hAT-type. XX OS Agaricus bisporus OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Agaricaceae; OC Agaricus. XX RN [1] RP 1-314 RA Sonnenberg S.A., Baars J.J., Mikosch S.T., Schaap J.P. RA and Van Griensven J.L.; RT "Abr1, a transposon-like element in the genome of the cultivated RT mushroom Agaricus bisporus (Lange) Imbach."; RL Appl. Environ. Microbiol 65(8), 3347-3353 (1999). XX DR Genbank; AJ238113; Positions 1 314. XX SQ Sequence 314 BP; 87 A; 74 C; 67 G; 86 T; 0 other; ctcagtaaaa agcgattttc aggcattttc gggtttctca tgccctgaac ataggctgaa 60 aattgtcctc tttctagaac aatgcgagaa catgatcagc caacgttcag gcaagttgct 120 ttaaatgtgg aacatttgtc gggccaagct caggttgtgt tcaggcaacg ttcaaggtgt 180 aagaacctca actctcaccg accttcaggg tgccaatcga ccctaaacca gccctaaatc 240 agcctgcatt gtcccgaatt acgttgaatt gacctggatt agcctgaaag atcctgagaa 300 tattttttac tgag 314 // ID GYARLI1_I repbase; DNA; FNG; 4556 BP. XX AC CR382131; CAG34127; XX DT 01-SEP-2005 (Rel. 10.08, Created) DT 01-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE GYARLI1: Gypsy-type element from Yarrowia lipolytica (internal DE portion). XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; GYARLI1_LTR; GYARLI1_I; internal portion. XX OS Yarrowia lipolytica OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Dipodascaceae; Yarrowia. XX RN [1] RP 1-4556 RA Kovalchuk A., Senam S., Mauersberger S. and Barth G.; RT "Tyl6, a novel Ty3/gypsy group retrotransposon from the genome of RT the dimorphic fungus Yarrowia lipolytica."; RL Direct Submission to EMBL/GenBank/DDBJ (25-JUN-2004). XX RN [2] RP 1-4556 RA Jurka J.; RT "GYARLI1: Identification of LTRs."; RL Direct Submission to Repbase Update (01-SEP-2005). XX DR EMBL/GenBank/DDBJ; CR382131; Positions 1723890 1728445. XX CC LTRs are identical. Protein homology to Ty3/Gypsy was identified CC in CC ref. 1. XX FH Key Location/Qualifiers FT CDS 81..836 FT /product="GYARLI1_I_1p" FT /translation="MTGLAEEWWFDSGRAALGADARDWKKFTQALESRFTP FT STYAMNMEEDWSNLQCGPNESSYSYETRFRAARHHLSREAAPEEAWLRLTS FT GMKDMWRREIMKQGPILQNNVEETLRHMHIVAGAEYFPGAHAQTPAAAPVA FT TPTSTSQYGPGPMEVDNAVLLKAMMESQEASQKRMMEQQTHMVELMANAIG FT RANNNGSQNGRNNRGGRGNYYNQRTSRECYTCGKIGHLARDCRHRERVQEL FT IQQDQGNVRQQ" FT CDS 839..4543 FT /product="GYARLI1_I_2p" FT /translation="MTETITAEPLEDLESVSVPPAEYDIINKSNFVEDTTG FT EPEDAGVEVHLEEEQTEEPMQLEEEQVGEQVEEPLIAARVEVVDGKRRRAL FT IKGDRPPISIPYCYNGTTVQALLDCGASSCFISRNLAEKLGLKMTPCKPRQ FT VQSVHSMETTNYTVEVPVELGKWGCDVFAYVLPQMVGQELLLGMPFFEEYH FT EAVDFKARTFTPDGYEVPAWPANESTTWDKHGHIKSCSLEKATQLAEHHGA FT QLFLYMVREKPEGEEHEPDVDTREVLEEYADVIVDKMPMELPPKRSVEHTI FT DSDETARPPARASYRLTRFEWAEVDKQVNDLLERGIIRPSKSPYSAPLVIV FT KKKGGELRICTDYRALNELTTKDRFPLPRIDDILDCLDGADTFSKFDLLSG FT YWQVLVKESDVHKTAFSTRSGHYEYLVMPFGLCNAPATFQRLMNDALRPFL FT NKTVCVYLDDIIVFSRNREDHKRHVREVLDALRAQKFYAKKSKCELFRKKM FT GFLGHVVSAAGVEPDPEKVKVVEEWVPPNTPKGLLSFLGLTGYYRRFIEDY FT AKIAAPLTDAATLSPTDFKWTEACQVAFEQMKAKLVSNEVMIIPTMEDTFK FT VSTDACDIAMGGVLQQWSPKDQEFRPVAYESTKFKKHEMNYPTREKEFYAI FT IHALRKWRHYLLGRPFLIETDHQSLSYFTSQTHPPSGRLSRWLDFLAEYDF FT EIKYVPGKDNDAADGLSRMLAQTAMVFEPDDSLLDIIKQGYESDEYFKDVF FT KVLATEPVVIPKEMHNHARHFRYDKNTGLLYFASVYKGEGERLCVPRGKAR FT KMLMKEAHDAPLAGHYGYFKSYERLARAYYWPRMIDHMRNHTRSCLICQTT FT KARRAPPQGLLKQLPVPTGNWQEITMDFIGGIPTTHRGHNNIWVTVDRMSK FT MVHLIPCKTSTDGEELADMYIDRIVRYHGVPRSIVSDRDKLFTAKLWQTLQ FT TRLGTELKFSTVNHPQTDGQSERVNTELIRQMKQHFVTDKNWDLWLPVIEF FT AMNSAKHSSTGYAPFEAVYGYIPDGPTYASTRELTKVHHQMDAWMDKLRAI FT SNSMHDRLIEHQRVQENRVNQHRVPVAFQINDQVLVHRKAFFDKAKYAKMY FT DVYFGPFPIEKKIDTNVYKVQLPYDSTRHKNINVQHLKKFIPRPEYDINPP FT STEYSQECSLHQITSLVGIDDDRYFVTWEDCDPSIASSISKEMFHRIPKDK FT RDSLLDQWNQFIKTPAESEDYVDIS" XX SQ Sequence 4556 BP; 1237 A; 1171 C; 1251 G; 897 T; 0 other; cgtggtagcg atgcatcctc aacaaactag aacacttcga agttcccgaa cagtttcgag 60 tcggtgtggc cgtcgtggcc atgaccggcc ttgcagagga atggtggttc gacagtggta 120 gggccgctct gggcgcagac gccagagatt ggaagaaatt cacccaggcc ctcgagagcc 180 gattcactcc ctcgacttac gccatgaaca tggaggagga ctggagcaat ctgcaatgtg 240 ggcccaacga gtcatcctat tcctatgaga cacgcttccg agcggcacga caccacctgt 300 ccagagaggc cgccccagag gaggcctggt tacggctgac cagtggtatg aaggacatgt 360 ggcgacgtga gatcatgaag cagggcccca tactgcagaa caacgtcgag gagaccctgc 420 gtcatatgca cattgtggca ggcgcagaat acttcccagg agctcacgcc cagacacccg 480 cggccgcacc tgtggccaca cctacatcca ccagccagta cggtcctgga ccgatggagg 540 tggacaacgc ggtgctgctc aaggctatga tggaatcgca ggaggccagc cagaagcgaa 600 tgatggaaca gcagacccat atggttgaac tgatggcaaa cgccattgga cgagctaaca 660 acaacggctc ccaaaacggc cgcaacaacc gtggtggccg tggtaactac tacaaccagc 720 gaacctctcg tgagtgctac acctgcggca agattggcca cctggctcgt gactgccgac 780 atcgtgaacg tgttcaggaa ctgattcaac aggaccaggg aaatgtgcga cagcagtaat 840 gaccgagacc attactgctg agccactcga agatctcgaa tcggtgagtg tgccacctgc 900 ggagtatgat ataatcaata aatccaactt tgtagaggac actaccggag aaccggagga 960 tgcaggggta gaggtgcatc tggaggaaga gcaaacggag gaaccaatgc agctggagga 1020 agaacaagtg ggggaacaag tggaggaacc actgattgct gcacgagttg aggtggtaga 1080 tggtaagcgg cgacgagcct tgatcaaggg cgaccgacca ccgatctcga tcccgtactg 1140 ctacaacggt accacagtgc aggcgttgtt ggactgtggc gcgagctcgt gcttcattag 1200 caggaacctg gctgagaaat taggcctgaa aatgacgccc tgcaaaccac gacaagtcca 1260 gtccgtgcat agcatggaga caaccaacta tactgtggaa gtaccggttg agctgggtaa 1320 gtggggttgc gacgtgtttg cgtacgtctt accccagatg gtcggacaag agctactatt 1380 aggcatgccc ttcttcgaag agtatcatga agccgtggat ttcaaagctc gaacgttcac 1440 acccgacggg tacgaggtac cggcatggcc agctaacgag tcaactacgt gggacaaaca 1500 cggccacatc aagtcgtgct cactagagaa agctacgcag ctggccgaac accacggtgc 1560 gcagctgttc ctctacatgg tacgggagaa gccagagggg gaggagcatg agccggatgt 1620 cgatacccga gaggtgctgg aggagtatgc agatgtgatt gttgacaaaa tgccaatgga 1680 gctacctccg aaacggagcg tagaacatac catcgactcc gatgagacgg cacgacctcc 1740 agccagggca tcgtatcgac ttacccgatt cgaatgggct gaggtcgaca agcaggtaaa 1800 cgacttgttg gagcgaggaa tcatccgacc atcgaagtca ccttacagcg caccgttagt 1860 gattgtaaaa aagaaaggag gtgaacttcg tatctgcaca gactatcgcg ccctcaacga 1920 gcttaccacc aaggatcgat ttccgctgcc ccgtatcgat gacattctgg attgccttga 1980 cggcgccgac accttctcca agttcgatct tttgtcgggc tactggcagg tgttggtaaa 2040 agagtctgat gtacacaaga cggccttctc gactcgatcg ggacattatg agtacctggt 2100 aatgccgttc ggtctatgca acgctcccgc caccttccag agattgatga atgacgccct 2160 acgaccattc cttaacaaga cagtctgtgt gtatcttgac gacatcatcg tgttcagccg 2220 aaaccgagag gaccacaaac gacacgtgag ggaagtcttg gacgccctga gagcacagaa 2280 gttctatgcc aagaagtcga aatgtgagtt attcaggaag aagatgggat tcctgggcca 2340 cgtggtgtct gcggcaggag tggagccaga ccctgagaaa gtgaaggttg tggaagagtg 2400 ggtacctccc aacacgccga aaggcctctt gagctttcta ggactgactg ggtattatcg 2460 acggttcatc gaagactacg ctaaaatcgc cgccccactc acagacgctg ccacattgtc 2520 gccaactgac tttaaatgga ctgaggcatg ccaggtggcg tttgaacaga tgaaggcgaa 2580 gctagtatcg aacgaggtca tgatcatacc tacaatggag gacacgttca aggtgagtac 2640 ggacgcgtgc gatattgcga tgggcggagt actacagcag tggagtccca aggaccaaga 2700 gttccgacct gtggcatatg agtccacgaa gttcaagaaa cacgagatga actacccgac 2760 gcgtgagaag gaattctatg ctatcatcca tgccctacgg aagtggcgac actacctact 2820 tggccgaccg ttcttgattg agacagatca tcagtctctg agttacttta cgtcccagac 2880 acaccctccc agtggacgac tgagtcgatg gctcgacttc ctagcggaat acgactttga 2940 gatcaagtac gtgccgggca aagacaacga cgcagccgac gggttgtctc gtatgttggc 3000 acagacagcg atggtgttcg agccggacga ctcgttgcta gacatcatca agcagggcta 3060 tgagtcggac gagtacttta aggacgtctt caaggtactg gccacggaac cggtggtcat 3120 ccccaaagag atgcacaacc acgcccgcca tttccgatac gacaagaaca caggactgct 3180 gtatttcgcc tccgtttaca agggggaggg agagcgactc tgcgtgccaa gaggtaaggc 3240 cagaaaaatg ctgatgaagg aggctcatga cgcacccctt gctggtcact acggatactt 3300 caaaagctat gaaaggctgg cgagggctta ctactggcca cgaatgatag accacatgcg 3360 caaccacacc agatcctgtc tgatctgcca gaccacgaag gccagacgag caccacccca 3420 gggcttgctg aagcagctgc cggtaccgac gggaaattgg caggagatca caatggactt 3480 cattggaggc atcccaacga cccatcgagg ccacaacaat atctgggtca cggttgatag 3540 gatgtcaaag atggtacatc tgattccttg caagacgagc acggatggag aagagttggc 3600 ggacatgtac attgaccgca ttgttcgata tcacggagta cctcgttcga tcgtgtcgga 3660 cagggacaag ctattcacag caaagctatg gcagacgtta cagacacgtc taggcacaga 3720 gttgaagttt tccacagtca atcaccccca gactgatgga cagtcagaac gagtgaacac 3780 cgagctgatt cgacagatga agcaacactt cgtcacggac aaaaactggg acctctggtt 3840 acctgtcatc gagtttgcga tgaattctgc aaagcattcg tccaccggat acgcaccctt 3900 cgaggcggtg tatggataca tcccagacgg accaacgtat gcctcgactc gagagctcac 3960 caaggtacac catcaaatgg acgcttggat ggacaaacta cgggctattt ccaattcgat 4020 gcacgaccga ttaatcgagc accaacgggt ccaggagaac agggtgaatc agcatcgagt 4080 gccagtcgcg tttcagatta acgaccaagt cttggtacat cggaaggcgt tctttgacaa 4140 ggccaagtac gcaaagatgt atgacgtcta ctttggacca tttcccatcg agaagaagat 4200 tgacaccaat gtctacaagg tgcagctacc atacgactcg actcgacaca agaatatcaa 4260 tgtacaacac ttgaagaagt tcatacctcg acctgagtat gacatcaacc cacccagtac 4320 ggagtactct caagaatgca gcctgcatca gatcaccagc ctggtgggta ttgatgacga 4380 ccgttacttc gtaacttggg aagattgtga cccctctatt gcctcttcaa tttcgaaaga 4440 gatgttccac cgcatcccta aagacaaacg ggattcacta ttggatcaat ggaaccagtt 4500 cattaagacc cctgcggaaa gcgaggacta tgtggacatc tcctaggggg ggagag 4556 // ID TCN1-LTR repbase; DNA; FNG; 528 BP. XX AC . XX DT 30-MAR-2005 (Rel. 10.03, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version 3) XX DE C. neoformans LTR retrotransposon - LTR consensus. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; TCN1-LTR. XX NM TCN1-LTR. XX OS Cryptococcus neoformans OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Tremellomycetes; Tremellales; Tremellaceae; Filobasidiella; OC Filobasidiella/Cryptococcus neoformans species complex. XX RN [1] RP 1-528 RA Goodwin T.J. and Poulter R.T.; RT "The diversity of retrotransposons in the yeast Cryptococcus RT neoformans."; RL Yeast 18(9), 865-880 (2001). XX RN [2] RP 1-528 RA Gentles A. and Jurka J.; RT "C. neoformans non-LTR retrotransposon TCN1."; RL Direct Submission to Repbase Update (15-MAR-2005). XX DR [2] (Consensus) XX CC Internal segment is TCN1-I. XX SQ Sequence 528 BP; 155 A; 132 C; 109 G; 132 T; 0 other; tgactgtcac gagcctgcac aaagaagtgt cacgagcctg cacaaagaag aagagataag 60 atatcatagg agggagttac gtaataggaa ggcaagatcc gattcagata agaaggtcca 120 gctggacctc cagttggacc tccgagaaaa gaactggacc tccagctgga cctccagctg 180 gacctcaaag gatataaata gatttctgta gaagtatata aaggggagct ttgtcctcga 240 tcttgatctt cttcccctcg aagatcaata tttcataagt tactttgcga aaggtcctcg 300 agctcccagt gctcactctc gtcttacctt gcaccacctt cgacataaac ttactccact 360 agtatataag cttgcctgat cttcccaagc gaatatacct ccgccctata ccatacgcca 420 acagacaagt gtcaagaggt atatcataca tatatctcat tgcttagtta aactacggct 480 ttctatctcg tctccaaggc cggaccttgt tggggacgag tcgtaaca 528 // ID Copia-35_MLP-I repbase; DNA; FNG; 4256 BP. XX AC AECX01001650; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-35_MLP_; KW Copia-35_MLP-LTR; Copia-35_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4256 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001650; Positions 85755 81500. XX CC Positions [1516-2031] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 82..4230 FT /product="Copia-35_MLP-I_1p" FT /translation="MNHDQPTPSSPTSTSISTSISPYISPEMSNHNSATKA FT KDISPLKHDWRRWSPLMLAHFMEWDLDAIVDGTEEEPAETASEAIKINYIK FT RRKKAAGFISRKLSPENRALVINAANIKDPKAIWDALVKLYASTKARNRAR FT ILRKFLNLKCTDNSLESFLTEYRRITHEMTEVSFKIDDDILAHMLLFKLSP FT RYHSTRDLLAHTAETADSLLTLDQVFEHLQQLVLDYQPGSAPPPAALIAEQ FT RAVSYERCLNGTHNPLTSHTEAKCFQLHPELKPNRNNQSSNRGAATATISG FT TVLTTYVFNATLTGKPVLDSGASQSMFSLRSSFASYRPFHATINVANGQQI FT NAVGIGTVTGSHCGKPVSISDCLHVPELQTNLVSMVALVRKGCTFNFSQDS FT SFNVLVDSNVVLTGDTKTGVMEINLDLGKSHQLSLTASTKVNPAILHRRLG FT HPGRIPFEKAFPGIHYPDSCEPCILSKMHRLPFQSHLPNACDRLSVIHSDL FT SGLISPTSLGGGKYYFKITDQKSNFKFVYILQSKCQTFSQFVQFKSLVENQ FT TNLKIKTMVNDNGGKYTSKQFSEFLKHEGIQMNFTAPFTPQQNPIAESGNR FT TTTERARALLKQSNLPLVFWAEAVSTTVYLENLTPIVRDNYVTPYENWFGK FT KPTYSHLRVFGCLCYVHIGKERRSSKFSDVAKRGVFLGYQGTMHNYRIYLL FT DDHRVIYSHDVIFNKEVFPFSDPTVLASFSDKFGSHDNVVLEELNSEDNFE FT IPLAQSPIHPLASDEDLSVSTSIPISSSSPQRDSPDPPTPTPTPPPTPTRE FT SSEPILPDELPNTRQDTQPSRQEESSTNRPSYSMVPATVPAPKAITSSIDP FT ANILTSRRRANLANHLTSEPHSYRDAMKRPDSAEWKQAIEKELNALTEMGV FT FTEVELPAGAHALGTTWAFRKKTDQNNVIIKHKARLCAQGFSQIPGLDYNE FT TYAPTGRAASMRLALSICGIDDLEVRLMDAVGAFLNGIPEEVLYIKIPQGY FT TPKLTGKNIVLLLNRSLYGLKQSPRCWYNMVKSFFLSIKFSPSKSDPCLFI FT SDDPDWRCFVHIHVDDMLVMGKNTERFSQLIQTRFKMEDLGECSFYLGMRL FT ERNCEARTITLTQDKYILGMLEEYGMNDCHSVTTPMIPGTYLLPASDEEHS FT AFLATGLNYNRAVGLLNYLVLCTCPDLAFTAGQLAQHLKKPGQEHWNAFKR FT VLRYLQGTYQDGLVLGGGSVELKVYADSDYAGCPATRRSTSGYISKLGNGC FT VSWRSRKQPSVSTSSTQAEYRAAYEAAQETVFLRRILGDLGYLQTGGTTFL FT CDNQSSLALQKNPLFKDRSKHFAVHLHWIRKQVEAGIITPTYIPTKEMLAD FT ICTKSLPRPQHEYLKNLIKA" XX SQ Sequence 4256 BP; 1145 A; 1079 C; 800 G; 1232 T; 0 other; gatcacctct gaattgccta cattattagg ttatgagccc agccgactca tcttaaatca 60 gatttttagc gctaccactt tatgaaccac gatcagccta cgccatcgtc acccacatcc 120 acatctatca gcacctctat ttcaccttac atttctcctg aaatgtcgaa tcacaattcc 180 gctaccaaag ctaaagacat ttcgccttta aaacacgact ggcggcgatg gtccccactc 240 atgctcgccc acttcatgga atgggacctc gatgccatcg tcgacggtac cgaagaagaa 300 cctgccgaga ccgccagtga agcgatcaag atcaattaca tcaaacgtcg taagaaagct 360 gctggtttta tctcgcgaaa actaagtcca gagaatcgag ctcttgttat caacgccgct 420 aacatcaaag atcccaaagc aatctgggat gcgttggtta agttgtatgc ttccaccaaa 480 gcgcgcaatc gtgctcgaat tcttagaaag tttctcaatc tcaaatgtac cgacaactct 540 ttagaatcct ttctcactga atatcgtcgt atcactcatg agatgactga agtctcgttc 600 aaaatcgacg acgacatctt agcacacatg cttcttttta aactttctcc tcgttatcat 660 tccacgcgtg atctacttgc tcataccgcc gagaccgccg actctcttct aactcttgat 720 caagtctttg aacaccttca acaattagtg ttagactatc aacctggttc agccccacct 780 cctgctgcgt tgatcgctga acaacgtgcc gttagctatg agaggtgttt gaatggtact 840 cacaatccgc ttacttcaca cactgaagcc aaatgttttc aacttcatcc cgagcttaaa 900 cccaatcgga acaatcaatc ttccaatcgc ggtgcggcca ctgccaccat cagtggtacg 960 gttctcacta cctatgtttt taatgctact ctcaccggta aacccgtcct tgactctggt 1020 gcctcacaat caatgttcag cctacgatct agctttgcta gttatcgacc tttccacgcg 1080 acaatcaatg ttgccaacgg acagcaaatc aatgccgtcg gaattggaac cgttaccggc 1140 tctcactgcg gcaaacccgt atctatttca gactgtcttc acgtcccaga acttcaaact 1200 aatctggtca gcatggttgc gcttgtgcga aaaggttgca cattcaattt ttctcaagat 1260 tctagtttca acgtccttgt tgactctaac gtggttctta ctggggacac caaaactgga 1320 gttatggaga ttaatttaga tttaggcaag tcacatcaat tatcactcac cgcatctaca 1380 aaagttaatc cagctattct ccacaggcgt ctaggacatc ctggtcgaat tccgtttgaa 1440 aaagcttttc ctggaattca ttatcctgac tcttgtgaac cctgtatctt gtcaaaaatg 1500 caccgtcttc ctttccaaag tcatcttccc aatgcctgtg atcgtctttc tgtaatacat 1560 agcgatctga gtggtttaat atctcctact tctctcggcg gtggcaaata ctatttcaaa 1620 attaccgacc aaaagtccaa ttttaaattt gtttatatcc ttcaatcaaa atgtcaaact 1680 ttctctcaat ttgttcaatt caaatccctt gtggaaaatc aaaccaatct caaaatcaaa 1740 accatggtta atgacaacgg aggcaagtat acttctaagc agttttctga gtttctcaaa 1800 catgaaggta tccagatgaa ttttactgct cccttcactc ctcaacaaaa tccaattgct 1860 gagagcggta atcggacaac tactgaaaga gctagggcgc tactcaagca gtccaactta 1920 cctttagtct tttgggccga agcagtgtct actacagtct atctcgaaaa tctcacgcca 1980 attgttagag ataactatgt cacaccttat gaaaactggt ttggtaagaa acccacttac 2040 tctcaccttc gtgtttttgg ctgtctttgc tatgtacaca tcggcaaaga acgacgttct 2100 agcaaatttt cagacgtcgc taaacgtggt gtttttcttg gctatcaagg caccatgcat 2160 aactatcgta tctatcttct tgatgatcat cgtgtgattt acagtcatga tgtcattttc 2220 aacaaagagg tattcccatt ctctgatccc accgttcttg cttctttttc tgataaattt 2280 ggaagtcatg acaatgtggt cttggaagaa cttaactcag aagacaactt tgaaattccg 2340 cttgctcagt cgccaattca tcctcttgct tctgatgagg acttatccgt ttcaacttcc 2400 attcctattt catcttcatc tccccaacgc gattctcctg atccacctac acccacacca 2460 actcctccac ctaccccgac tagggagtcg tcagaaccta tcttgcctga tgaacttccg 2520 aacacccgtc aagacactca gccatccagg caggaagaat cttcgaccaa tcgaccttcc 2580 tactccatgg ttccagctac tgttccagct cccaaagcaa tcactagttc aattgatcca 2640 gccaacatac tcacctcaag acgaagagcc aatttagcta atcatctcac ctctgaacct 2700 cactcttatc gtgatgctat gaaacgtcct gattcagctg agtggaaaca agctatcgag 2760 aaagaactaa atgctctgac tgagatgggt gttttcaccg aggtggaact accagctggt 2820 gcccacgctt taggaacgac ctgggctttt cgtaagaaaa ctgatcaaaa caacgtcatc 2880 attaagcaca aagcccgact ctgcgcgcaa ggattctctc agattcccgg tctcgattat 2940 aacgagacat atgcgccgac cggtagagct gcaagtatga ggttagcatt gagtatctgt 3000 ggtatcgatg atcttgaagt gcgtcttatg gacgccgtcg gtgcttttct caacggtata 3060 ccggaagaag ttctttacat caaaatccca cagggctata ctcctaagct cactggtaag 3120 aacattgtgc tattacttaa tcgctctctt tacggtctca aacagtcgcc tcgctgctgg 3180 tacaacatgg tcaaatcgtt ctttttatca atcaaattct caccgtccaa gtccgacccc 3240 tgtttgttta tctcagatga tcctgactgg cgctgcttcg tgcacattca cgtcgacgac 3300 atgttggtca tgggtaagaa cactgaacga ttttctcagc tcattcagac tcgattcaag 3360 atggaagatt taggagaatg ttcattttat ttaggcatgc gtctagaaag gaactgtgag 3420 gctcgtacta tcactctcac tcaagataag tacattctcg ggatgcttga agagtatgga 3480 atgaacgatt gtcactcagt taccactcca atgattcctg gtacatacct cttgcctgca 3540 tctgatgaag aacactccgc atttctcgct actggtctta actacaatcg agcagttggt 3600 ctcttgaact acttagtctt gtgcacttgt cctgacttag ctttcaccgc tggtcaactt 3660 gcgcagcatc tcaagaaacc cggccaggag cactggaatg ctttcaagcg tgttcttcgg 3720 tatcttcaag gaacgtatca agatggttta gtattaggag gtggatcggt ggaattgaag 3780 gtttatgctg actcggatta tgctggctgt cctgctacta ggcggtctac ttcagggtat 3840 atatctaagt tgggaaatgg ttgtgtaagt tggagatcca ggaagcagcc atcagtatcg 3900 acgtcatcca ctcaagcaga atatcgagcc gcatatgaag ctgcacaaga aactgttttc 3960 ttacgaagga tcttaggtga cttagggtat ttacaaactg gtggaaccac tttcttgtgc 4020 gataatcaga gctcattggc gctgcagaaa aatcctttat ttaaagatcg gtcaaaacat 4080 tttgctgttc atcttcattg gattcgcaaa caggttgaag ctggcattat tactcctact 4140 tacattccga ctaaggaaat gctagcagac atttgcacca aatctcttcc taggcctcaa 4200 catgaatatc tcaaaaatct gattaaagct taggatgttt tacgattgag ggcggg 4256 // ID Copia-8_LBS-I repbase; DNA; FNG; 4443 BP. XX AC ABFE01002315; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-8_LBS_; KW Copia-8_LBS-LTR; Copia-8_LBS-I. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-4443 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01002315; Positions 19813 15371. XX CC Positions [1759-2298] - Integrase core CC 'AACGA' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS join(88..2418,2422..3501) FT /product="Copia-8_LBS-I_1p" FT /translation="MSGSHNISIPFSLPEDQRCRGYENFASWKTLMIAHGK FT PRGYLKYWENKIVVPQEILDAENTSTDPKTSPSGSTTDKPPGPTPAHSITP FT SELEYELCESASMSSILINLVDIAGSGVDSNGKLHEAWTLLAKQYGGASDR FT ARNMQERALANCKFEEGMKVAGENGHIEKMRALRKAANDASAEITNIRFIT FT KLLDSFPKSWDPIISNLYDKEDLSEVIMKLTSHGERLANRASETTKTTNNT FT FDSVKALEATVHALQAEVKTLRTRPGGSSNPNKSHLVCTNPSCGKTGHLIP FT DCFQMGGGKQGQYPPWWKGKRTISPIANLASSSSTIDGTINTGGHFALSAT FT FNDDVAKLLEENESVLQKVALAASQVLPSISSCSFADSGYTTHFFKSRDVF FT SMYKPLERMAGQSSKEGASFSILGGGDIEIKVVFNNVEHTLTFRDALHAPD FT ITANLLSISKMDIAGWHAVFGDQRVRFYKGKSEIFDGILKNGLYLVNGSFS FT LAIPTALTAQSLRSPTDLATWHRRFAHFGVTRVKQALKLVDGLEVTSEDAI FT GQCEDCILGNQKRRPYDDDVQIKNRILRLTNIDIWGPSRVKSHGGSLYGMK FT FHDSGSAHRKTFFLANRQDETTLTALKTYRSESEKVTEKLMVFIRVDNAPE FT FKGKLWAEYFRETGLIMLPTPPYSSSSNGTAERSIGVTTGAVRIMLLDAGL FT SAKWWAEAWAFADYVENRLPSSRHPGVIPEEGWTGKRQDVGHIRVWGCIAY FT VHIPFEKGGSKLAARGQKGRLIGIEGGVYRILIPETGQIIRSRNVKFEEGL FT GHRTLTTEREYFHQDNGDTDYDFLVEPIVTEKPITVPAVPGNPIIPDTLNH FT PQTRPRIVYPPASRQSARIAAKNNPPPAETDDDASIPILEDPDEDDDSTHT FT ALNAEFPPEPQNRFVPVTFDDAFNLSRRHLWFPAMEREIAQWDERGVVTAV FT PHPPGVKTIKGKWVYDLKVDGLGSLIRRRARGVVKGFSQRLGENYYESFAA FT VARYESVRMLFALIASLKLHFWLIDFVGAYLNSKPQGTNYLEIPQGFENHY FT KIPNIDTVLIMNFTIYGTMDGANNWFRLLNKTFTELGHRQSRADPCIRIQH FT TTEGYTISSTYTDDVSAGSSSVDAMDRAK" XX SQ Sequence 4443 BP; 1234 A; 1170 C; 1004 G; 1035 T; 0 other; ggttatgggc cccgcacgct cctaaattat cagaagatta tcggttacac acatatcccc 60 tcgactcaat aaaaaaggct tattaccatg tctggaagtc acaatatttc gattcctttc 120 tccttgcccg aggatcagcg ctgccgtgga tatgaaaact tcgcgagctg gaaaacactt 180 atgatcgctc atgggaaacc ccgaggatac ctaaaatact gggagaacaa aatcgtcgtc 240 ccccaggaga tactggacgc agaaaatacg tctactgatc caaagacatc accctccggc 300 tctacaaccg ataaaccacc aggtcctact ccagcccact ccataactcc atcagaactc 360 gaatacgaac tctgcgagag tgcctcgatg tcgtcgatct tgatcaatct ggtagatatt 420 gctggatccg gcgttgactc aaacgggaag ttgcatgagg cctggaccct actagccaag 480 cagtatggtg gcgcaagcga tagggctagg aatatgcagg agagagcgtt agcaaactgc 540 aagtttgaag agggaatgaa agttgcagga gagaacggac atatcgagaa aatgcgcgca 600 ctacgcaagg ccgctaatga cgccagtgct gaaatcacca acatacgctt catcacgaaa 660 ctcctcgact ccttccccaa gtcttgggac cctatcatat cgaatcttta cgataaagag 720 gatttgagtg aagtcatcat gaaactcacc tcacacgggg aacgtctcgc caatcgcgca 780 tccgaaacca ccaaaacaac taacaacacg ttcgattccg tcaaagccct ggaagcaact 840 gttcatgcac tccaggctga ggtgaaaacc ctccgaacga ggcccggagg atcatcgaac 900 ccaaataagt ctcacctcgt atgcaccaac ccttcatgtg gaaagactgg acacttgatc 960 cccgactgtt tccaaatggg aggcgggaag caaggacaat accccccatg gtggaaaggg 1020 aagcgaacca tttctcctat cgcaaactta gcatcctcgt cctcgacaat tgatggtaca 1080 attaatactg gtggacactt cgccttatcg gctacgttca atgacgatgt cgctaaactc 1140 ctggaggaaa atgaatcagt ccttcagaag gtcgccttag ccgctagcca agtcttacct 1200 tccatttcct catgttcatt tgcggattca gggtatacta cacacttctt caaaagtcga 1260 gacgtattct caatgtataa acctttggag aggatggccg gacagtcctc taaggaaggc 1320 gcgagtttct cgatactggg gggaggcgac attgagatca aggtggtgtt caataatgtg 1380 gaacatacct taacctttcg cgatgctctt catgcacctg acatcaccgc caatctcttg 1440 tcaattagca aaatggacat tgctgggtgg catgctgtgt tcggagacca gagagttcgt 1500 ttctacaaag ggaaatcaga gatctttgat ggaatcctga agaatggact gtaccttgtc 1560 aatggatcct tctcactggc tataccgact gccctcacag cccaatccct tcggagtcct 1620 actgacctcg ctacctggca tcgccgattc gctcatttcg gagtcactcg tgtgaaacaa 1680 gcattaaagc ttgttgacgg cctcgaagtt acttctgagg acgctattgg acaatgtgaa 1740 gactgtatct tggggaatca gaaacgtcga ccatatgacg acgatgtcca aatcaaaaat 1800 aggatcttaa gactcaccaa catcgacatc tgggggccat ctcgtgtcaa atctcatggt 1860 ggatcgttat acggaatgaa attccatgat agtggatccg ctcaccgtaa aacattcttc 1920 ctcgccaacc gccaagacga gacaacctta actgccttga aaacctatcg aagcgaaagt 1980 gaaaaggtca ctgaaaagct aatggttttc atcagagtcg ataacgcacc ggaattcaaa 2040 ggcaagctat gggccgaata ctttcgtgag actggactca tcatgctacc aacccccccc 2100 tattcctcct cctcgaatgg tactgcggaa cgctctattg gagtgacgac tggcgccgta 2160 cgaatcatgc tcttggacgc tggattgtcg gcaaaatggt gggccgaggc atgggcattc 2220 gcggattatg tggagaatag gttgccgtcc agtcgacacc caggtgtgat ccctgaagag 2280 ggatggacag gaaagagaca agatgtggga cacataaggg tctggggttg tatcgcatat 2340 gtgcacatcc cctttgagaa gggcggaagc aaattagccg cacgggggca aaagggcaga 2400 ttgatcggaa ttgagggctg aggcgtttac aggatcttga tcccggagac cggccaaatc 2460 atccgttccc gaaacgtcaa atttgaggaa ggtcttggac accgtaccct cacgactgag 2520 cgggagtact tccaccagga caacggcgat actgactacg acttccttgt agaacccata 2580 gtcactgaaa aacctatcac agtccccgct gttcctggca acccaatcat tccggatact 2640 ctcaaccatc ctcaaacacg tcccagaatt gtctatcccc ctgcatcacg tcaatcagca 2700 agaattgctg cgaaaaataa ccctccacca gctgaaactg acgatgacgc ttcaattcca 2760 attttggaag atcctgatga agatgacgac tccacacaca ccgcgctaaa tgccgaattc 2820 cccccggaac cacaaaatcg attcgtcccg gtgacatttg acgacgcgtt caatttgtcc 2880 cgacgtcatc tatggttccc agccatggag agagaaattg ctcaatggga tgagcgtggc 2940 gtcgtaacgg cagttcctca tccccctggt gtaaaaacca tcaaaggaaa gtgggtctat 3000 gacttgaagg tcgatggcct agggagcctc attagacgtc gtgcccgtgg ggtagtaaaa 3060 ggtttttctc agaggcttgg ggagaattac tatgagtcct ttgccgcagt ggccaggtac 3120 gaatcagtcc gaatgttgtt tgcgctcatt gcgagtttga agctccattt ttggcttatc 3180 gacttcgttg gggcgtacct taactccaaa ccccagggaa ccaattacct cgaaattccc 3240 cagggcttcg aaaatcatta caaaatcccc aatatcgaca ctgttcttat catgaacttt 3300 actatatacg gcaccatgga cggcgccaac aactggtttc gtctactcaa caaaacattc 3360 accgaactcg gtcaccgcca gtctcgagca gacccttgca tacgcattca acatactact 3420 gaagggtata ccatctcgtc gacttacacc gatgacgtat ctgcaggatc ttcttcagtt 3480 gacgcaatgg atcgagcaaa atgagaactc gcggcaaaat tcgaaatcac cgatttaggc 3540 actcctaaca aagtacttgg catgtcgatc gtaaaccatg cctctggcga catatctatc 3600 catcaaaaac cgcttatcac gaaggcttta gttacattcg gaatggagaa cgctaatcca 3660 aagtataccc ctctccctcc gggcgttacc ctcgtcgaat ctcagcctat gcccatccct 3720 cctcacgact ccgaattcat gcgcgataag gattatagaa gcgcattagg gatgctaaac 3780 cacgttgcga atggtacaca gcccgacatc gcattctctg tcatggtact gatgcgctac 3840 gcatctgatc ctcgccctat gcattggaag cttgttcaac acctcctcgc ttacctgaaa 3900 acaaccagca acctcgttat cacataccga aaggacgggg ttatcaaacc cttcggctat 3960 tccgactctt cctatgcgga tgaatccgac agccgaaaat catctgctgg gttcgttttt 4020 atatcctctg gaggcccagt tagctggaaa gcaaaaactc aaaggtgtgt cagcacctcg 4080 actggagaag ctgaatatgt gggcattttc gaggctggga aacaagcaaa atggataagt 4140 gcttggttct tcgaagttga ccaattcttc gatctcccca tcaccgtcta ctgcgataac 4200 gacgccgccg tcgctctcac caaaaacttt ggaggtcata ccaaaataaa gcacgtcgat 4260 gtcaagaccc actggatccg tgaggccgtc aaccttgaag acatcatcgt tgtcccgatt 4320 gatacaactc aaaacgtcgc tgatatcttt acaaaatcac tttctcgacc aacgctggaa 4380 tacctcatca aacttatggg aatggaatac atcgatattt aggagctacg gtttcagggg 4440 gag 4443 // ID Gypsy-107_MLP-LTR repbase; DNA; FNG; 405 BP. XX AC AECX01000643; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-107_MLP_; KW Gypsy-107_MLP-I; Gypsy-107_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-405 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000643; Positions 953 1357. XX SQ Sequence 405 BP; 95 A; 96 C; 55 G; 159 T; 0 other; tgtaataact catgtcactc attattacag aagttcatta ctacttaaca tacttagact 60 taagacctaa atcttctata tctcttgttc ctagacgcca cctcaacctg actccttaag 120 gatctttgac tgatcacttg tcattgtctc tttggagtca ggtacgtctt tatttgtatt 180 tttcttttaa tctatctttt cactactcaa ggatctttga ctgatcactt gtcattgtct 240 ctttggagtc agttttcgta tttataaata taatacttcg gttctagaaa actcttccac 300 cactttgtgg tgttgattgc cctcgcgcct tagaagttat acaataactt ctccttcctt 360 ccgccttgcc cttttccttc agtgaggaca ttcgaagtcc ttaca 405 // ID Cnl1 repbase; DNA; FNG; 3376 BP. XX AC . XX DT 04-JUN-2009 (Rel. 14.06, Created) DT 04-JUN-2009 (Rel. 14.06, Last updated, Version 1) XX DE C. neoformans non-LTR retrotransposon - consensus. XX KW CRE; Non-LTR Retrotransposon; Transposable Element; KW non-LTR retrotransposons; Cnl1. XX OS Cryptococcus neoformans OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Tremellomycetes; Tremellales; Tremellaceae; Filobasidiella; OC Filobasidiella/Cryptococcus neoformans species complex. XX RN [1] RP 1-3376 RA Goodwin T.J. and Poulter R.T.; RT "The diversity of retrotransposons in the yeast Cryptococcus RT neoformans."; RL Yeast 18(9), 865-880 (2001). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 70..3246 FT /product="Cnl1_1p" FT /note="contains the reverse transcriptase and FT restriction enzyme-like endonuclease domains." FT /translation="MSLQRAKNARGDPGRCNLCSADYRDLKDHLNKQHSTH FT FFVPSDLRGSSLVACPRCGTPCSAGTGLSRHQSRYCGLTAPRIRRNRVGNS FT TNTSRCPPSNTAASPIVPSPSPERPSPPQPAEVVASLEPLSEAEEVLEVAQ FT VDAETVDTLEGTRRAPESVPRSAEEGSTRVRELNMTAPEEEHRGEEESSHT FT NPTAPAGLENAVSSTLGPSPGTLPSLLPSQECANERFLYLAHLPVRSKPLP FT NNLVTDFMDAAERCALAYIAQPSDSTLLAFLALPKVGLTQALAPEQPLRPS FT TFLKQFPHIPWPEQPPARRPPSNIRPDTTKQVIKLVENGRLGAAERVLEED FT ASVAELDQGVIDQLITKHPKGPSCPFGNAVGPTPGKAPDIDTIQKALDSFK FT PDTAPGVSGWSVPLLKTAAKREPVKQFLQLLCAAIANNTAPGRSMLRTSRL FT IPLKKDDGSIRPIAVGELIYRLCAKALIISHFQPDFLLPFQLGVKSIGGVE FT PIVRLTERVLEGSAGAEFSFLASLDASNAFNRVDRAEMAAAVKTHAPTLWR FT TCKWAYGDSSDLVCGDKILQSSQGVRQGDPFGPLFFSITLRPTLNALSQSL FT GPSTQALAYLDDIYLFSNDSQVLSKTTQFLADKQHIIKLNEKKCKLISFDE FT IRQEGFKMLGTMVGGKEKRAEFLEGRIRKEMAKVGKLKDLPHQHALLLLRF FT CIQQNLRHLQRSLRSDDLVDLWERLDTMLWEEVKRMRMRQREDTAEEEALG FT RSLTKLPARLGGLGLLSFKDVAPLAYRSAAEASDTLLDNLGLLSSPEEPPT FT PIPQRTRCAELWESQQEAILHNLGDTERKRLTENASRLGRSWLSVIPYLQP FT LRLSNVEIASGLHDRTLVGSSIPVCRFCGSDSPLGHDELCRARNPWTQRRH FT NAINRVIYQHLKQIQGATVEIEPHTLSGQRRNDLRVRGSSALAFTDYDLKV FT YSLGDRDARSTVTPCAPNGKLADFCLDRCVNWLDKVGQVVSKNAPKVTGGV FT FKPIILSTGGLMSRSTADEWKDWRDAMPVGGFEKMEKRIGVELVKARARTL FT VL" XX SQ Sequence 3376 BP; 777 A; 1037 C; 846 G; 716 T; 0 other; ccctcttaat accccataac acataacaac cccctaatca acgttctctg caccttaaac 60 accaccaaca tgtccctgca gagggccaaa aacgcccgtg gagatcctgg tcggtgcaac 120 ctatgctctg ccgactatag ggacctcaaa gatcatctca ataaacaaca ttccacccat 180 ttcttcgtcc cctccgacct ccgtggctct tccctagtcg cttgccctcg ctgcggcacc 240 ccctgctcag ctggcactgg tttatctcgt caccagagcc ggtattgcgg tctcaccgct 300 cctcgaatcc gccgaaatcg cgtgggaaac tcaacaaaca catctcgctg ccctccctcc 360 aatactgcag cttcacccat cgttccttcg ccttccccag aacgcccaag cccccctcag 420 cctgctgaag ttgttgccag tctcgaacca ttgtctgaag ccgaggaggt gctggaggtc 480 gcccaggttg atgccgagac tgttgacacg ctggaaggga cccggagagc tccggaatcc 540 gttccgagat ctgccgagga aggtagcacg cgagttaggg agctaaacat gacagcgccg 600 gaggaggagc atcgtgggga ggaggagagt agtcatacca acccaactgc cccagcaggg 660 ctcgagaacg cggtgagctc aacgctgggg ccttcccctg ggacgttgcc ttccttactt 720 ccgtcccaag agtgtgctaa cgaaagattc ctgtaccttg cgcacctgcc tgttcggagc 780 aagcctctgc ccaacaacct agttaccgac ttcatggacg ccgctgagcg ttgtgctctt 840 gcctacattg cacaaccctc ggactctaca ctgctggcat ttctcgccct tccaaaggtc 900 ggcctcaccc aggcgctcgc tccagaacag cccctcaggc cgtcaacctt ccttaagcag 960 ttcccgcata tcccctggcc agaacagcca cccgctcgtc gtcctcccag caatattcgt 1020 ccagacacca ccaaacaagt catcaaactc gttgagaatg ggcgcctagg tgcggcagag 1080 agggtgttgg aggaggatgc ttcagtagcc gaactcgatc aaggggtcat cgaccagctc 1140 atcaccaagc accccaaagg gccgtcttgt ccattcggca atgcagtggg tccaactcct 1200 ggtaaagctc ccgacatcga caccatccaa aaggccctcg actccttcaa gcccgacaca 1260 gcacccggcg ttagtggctg gtcagtccct ctcttgaaga cggctgccaa gagggagccg 1320 gtcaagcagt ttctccaact cctctgcgcc gccatcgcca acaacaccgc ccctggtcgc 1380 tctatgctcc gcacttctcg tctcatcccc ttgaagaagg acgatggctc tatccgacct 1440 atcgctgttg gtgaacttat ctatcggctg tgtgcgaaag ctctcatcat ctcgcatttc 1500 caacccgact tcctcctccc gttccagctc ggggtcaagt caatcggtgg tgtagagccg 1560 atcgtgaggc tgacagagag agtcttggag ggttctgccg gcgctgagtt ctccttttta 1620 gcctcgctcg atgcttctaa cgctttcaac cgtgtagata gggccgagat ggcagcagcg 1680 gtcaagaccc atgcgccgac gctttggagg acatgcaaat gggcctatgg cgactcgtcc 1740 gaccttgtgt gtggtgacaa aatccttcaa tcctctcaag gtgttcgaca gggtgacccc 1800 tttggccctc tcttcttctc gatcaccctc cgaccaacct tgaatgccct cagtcaatcg 1860 ctaggtccgt ctacgcaagc actcgcttac ctcgatgaca tctacctctt ctcaaacgac 1920 tcgcaagtcc tcagcaaaac tacccaattc ctcgccgaca agcagcacat catcaagctc 1980 aatgaaaaga aatgcaagtt aatcagcttc gatgagatca ggcaggaggg cttcaagatg 2040 ctagggacga tggtaggagg taaggagaag cgagcggagt ttctggaagg caggattcgg 2100 aaggaaatgg caaaggtggg caagctcaag gatcttccac atcaacacgc gctccttcta 2160 ttacgtttct gcattcagca aaatctacga cacctgcaga gaagtctgcg ctcggacgac 2220 cttgtagacc tatgggagag gctggacacg atgctatggg aggaggtgaa aaggatgagg 2280 atgaggcagc gggaggatac ggcggaagag gaggctctag ggagatcgtt gacgaagcta 2340 ccagcgcgac tgggcggact aggtctactt tccttcaaag atgtagcccc ccttgcttac 2400 cgctcggcag ccgaggcctc cgacactctc ctcgataacc taggtctcct ttcttcgcca 2460 gaggaacctc caactccgat cccccaacga actcgatgcg cagaactctg ggaatcgcaa 2520 caggaagcca tcctacataa cctcggcgac actgaacgca agcgactcac cgagaatgcc 2580 tccagactcg gccgaagttg gttatcagtt atcccttacc ttcaacccct gcgcctttcc 2640 aatgtcgaga ttgcctctgg tctccatgac cgcaccctgg tcggctcctc gatccctgtc 2700 tgtcgcttct gtgggtcgga ctcacctttg ggtcacgacg agctttgccg cgcccgcaac 2760 ccctggaccc agcgccggca caatgccatc aaccgcgtca tttatcaaca cctcaaacaa 2820 attcaaggtg ccacggttga gattgagccc cacacgctgt cgggacaaag gagaaacgac 2880 cttcgggtca gaggttccag cgctctggcc ttcactgact acgacctgaa ggtttactcc 2940 ctcggggacc gagacgcgag aagcaccgtc acaccctgcg cccccaacgg caagctggcc 3000 gacttctgct tggaccggtg cgtgaactgg ctcgacaagg tgggtcaggt cgtctctaag 3060 aacgctccga aggtcactgg tggggtcttt aaaccaatca tcctttccac tggtggcttg 3120 atgagcagga gcacagcaga cgaatggaag gactggaggg acgcgatgcc ggtggggggg 3180 ttcgagaaaa tggagaaacg gattggtgtc gagttagtaa aggcaagggc gaggacgctg 3240 gtcttatgag gaagaggagg ttggattatt ttttcttttc tttaataagt tgtttattta 3300 agtagtttct ttcattcggg caacccacac gacaacccaa taaattaaac aacgaaaaat 3360 gcaacctcta taaccc 3376 // ID LTR-1_AN repbase; DNA; FNG; 300 BP. XX AC . XX DT 09-DEC-2003 (Rel. 8.11, Created) DT 09-DEC-2003 (Rel. 8.11, Last updated, Version 1) XX DE Long terminal repeat of a LTR retrotransposon - a consensus DE sequence. XX KW LTR Retrotransposon; Transposable Element; LTR-1_AN; solo LTR. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-300 RA Kapitonov V.V. and Jurka J.; RT "LTR-1_AN, a family of solo long terminal repeats in the RT Aspergillus nidulans genome."; RL Repbase Reports 3(11), 192-192 (2003). XX DR [1] (Consensus) XX CC LTR retrotransposon. Solo LTR. CC 40 copies less than 1% divergent from the consensus; 5-bp TSDs. XX SQ Sequence 300 BP; 77 A; 65 C; 75 G; 83 T; 0 other; tgtcacaggc tatggcctgg atcttggttg tcggccatgc cctcaacctg gttctaaata 60 aggtttctgc aacatcagtg tacagcttcg gaaattgcgg cctcgaagct taggaaagga 120 gatccgtcct cataactttg gaaaagggat ccgtcggcat acaggtccgg gaagtcagaa 180 aggttgataa agggaggagg aagatatctg cgcttctatc ttttgtttct ttctctaagc 240 ttgtgatact cgtttataca ggacagccag ttgaaaataa tactgcctac gcccgttaca 300 // ID I-2_AO repbase; DNA; FNG; 2678 BP. XX AC . XX DT 24-JAN-2006 (Rel. 11.01, Created) DT 02-FEB-2006 (Rel. 11.01, Last updated, Version 1) XX DE A family of I non-LTR retrotransposons. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-2_AO. XX OS Aspergillus oryzae OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC mitosporic Trichocomaceae; Aspergillus. XX RN [1] RP 1-2678 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-2678 RA Kapitonov V.V. and Jurka J.; RT "I-2_AO, a family of I non-LTR retrotransposons in the RT Aspergillus oryzae genome."; RL Repbase Reports 6(1), 10-10 (2006). XX DR [2] (Consensus) XX CC This is a family of non-LTR retrotransposons that belong to the I CC clade. I-2_AO is 5' truncated, ORF2 is highly damaged by many CC mutations. I-2_AO is 64% identical to I-1_AO. The I-2_AO family CC is represented just by one copy. Most likely, I-2_AO was exposed CC to RIP-like heavy mutations (while I-1_AO is GC-rich, 58%, the CC I-2_AO GC content is only 32%). XX SQ Sequence 2678 BP; 1061 A; 472 C; 377 G; 768 T; 0 other; gtgacaacca cttatctaaa ttataccagc cagttattct acttaatact cttaataaaa 60 tcctagaatc aaccatagct acgcaaatta tatagatttt aaagaaatat aaattacttc 120 taaaaactta tctaaaaaag caaaaaagta tctctattaa ctatactatc taacttatac 180 tcgattatat atattaagta tagagataaa atagaaaaat aaatatagtt ctactagata 240 tatccagagc cttcaacaat atatcccaca ctcaactatt atttaatctc cgccagctaa 300 aactaggcca ctttacggac tggctacaat tctttctaac taattactca acctaaatct 360 ctctagcaga gaaacttagt ttaaagttct caacttcaac agatatcctg taaggcttat 420 tcttattacc aattctctat ctaatctaca atacttctct aatccaagat ctctatataa 480 aacaattaca ggggggctcg accgagatat atagatagat taataatatc tacgttctag 540 cctcatccaa atcatatata gaaaatatta aaatactaaa aacagccctg gtctaagtag 600 accaatagac tactcaacac gcttctaagt ttatactaga aaaatttaag cttatttatt 660 ttattaattc tagaagatta aaagcgaaag caataccgcc tctatctcca gatctatctc 720 cagacgaccc cgacagtatt taaaaaatac tacacccgcc aaccggccat aactagatag 780 atattcccta cgcagacaca actattaaac taatagaaat agctaaatat ctagaaatct 840 agcttaacaa gatcttattc tttactatac actatatcaa agcactagct aaagcacatg 900 gcacgctagc ggccctttaa gagatcgcta gatttatata aaatatcccc ctgcgtacta 960 tacgccggat ctatcaagta gtagtaattt ctcagctatt ctacagagcc atagtctagt 1020 ttaatccacg cagcggccag gtagtagctt taataaacca gaaaatactc acagaattta 1080 tataaatcta gaaacaggcc gcattactga tcagtaatat atttaaagac actataacta 1140 tagctttaaa tattaaacta tatatattat cggtatatct ctagctatag tagatcatcg 1200 agaaaacagt agttagaatc tggaccagac cggaactagc ctgttctaaa tcaatattaa 1260 aattatatac agcgcggaaa agacgctgca gagaataaat attaatagaa atacttaatt 1320 agaaagagag cccgctgtag tctctcgaag agaagaaata gaaaacccgc cagctatata 1380 ttatagctcc gtgggaatcc tctttaatag taataattaa tagccacgag gccgtatttt 1440 aattttataa aaaatattgc gttaggcgat agagaatcgc agtatatata gacaggagcg 1500 gtctaaacgg ctagattaaa gtaagtatag tctatctctt gtagagctgg aagtgaaact 1560 gtacactaga tatagaaaag aagtcaacag tctatataaa gaagctaatt agaatctaga 1620 tagctctata cagagtatag aaagaaacta gattagctat aatctttata aataattaaa 1680 ccgcgattca ggcgatctac aatccctaaa aattcttaaa gcaatatata ttaggcgaga 1740 tctactatat tatacaaaga tataatatat agagccaggt ctagatctac tagatccctg 1800 tatatatcgg cgtgccagag aacaagacaa tcaataaaac cacacgcgag aatacctaga 1860 aaatagaaga ggctatttat cttaccacga tagctaaaca gtagatacac taccgtatta 1920 aaaataaata gactagagaa tagaaaatag agaaaatagg ctatataata cacaaactag 1980 tcaagatcct aaataaaaga atactaaata tttataaagg cctatctaag ccgtacgtat 2040 cggttattat ttagatatag atatagagaa acggtttaaa atattttctt tttaaaatca 2100 agatatctaa ctcgaactaa tactactata gacagggatc ctagacatca cagtatattt 2160 tattttaata tccacttttt actaacccta gaaaggctat actagataag ctagatctta 2220 ggatttaaag aaagatagac tataatagga ttatatccca cctatagata atatactaca 2280 tcgctaaatt tatatattaa ataaaattac ttaattaatt taaaaacgta gagtaaacta 2340 gtcactataa aaaaaagata aatataaaag acaatacgta aacaactata ctaaaaagaa 2400 ctagattaaa cccttctagt acacgcacca acacctacct actaaaagat attaaaagaa 2460 gctagagaat ataggatagt agagactgag aaagatagat cagggagact gtcaggtagt 2520 agataatttt agttgtgata aaagctttcg attacttcct tatgggatac ggcattttat 2580 gattagattc taggctccca agcacctgca tagctagacg ttcaatagtt aggctcgaac 2640 aggttaatag aatcttgatt gattgattga ttgattga 2678 // ID MULE-2_Cglob repbase; DNA; FNG; 2499 BP. XX AC . XX DT 17-MAY-2011 (Rel. 16.05, Created) DT 17-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW MuDR; DNA transposon; Transposable Element; MULE-2_Cglob. XX OS Chaetomium globosum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Sordariales; Chaetomiaceae; OC Chaetomium. XX RN [1] RP 1-2499 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 2499 BP; 565 A; 751 C; 679 G; 504 T; 0 other; ggggtccatt aaagagcccg caaattttat tagagggacc gctgcaagtg ggtctaagca 60 ccgaccacag tatcgatttc cgactgcatc cacgcattcg gtaccatgtc aaatcgtacc 120 ttttttcacg gcaaatcgag ctttttatct ttcaatcaac aacagcaacg gccggctgcc 180 tgcaccacca ccgccatgga tttctcggcc tcctacggct ctctcaagga ggcagaagcc 240 ggcctgaagg cgcaggccca cgccttgggc tttgatctag ccgtcaaaga gcgattccct 300 cgcggcgctg cggccgagga ggtgacccgg gtcaactacc ggtgcgccaa aggccggtcg 360 cacgcaccaa agaacgacga agttattcac accaccaaga gacgcaaaac gagctcccag 420 atgacggctt gcggttacaa gatcaacctc aagcgtgtgc ccggggccgg ctagaagctg 480 gatccagtgc gcacgcgtaa cggcaccgat cttgagcaca accacgacct cctcgatcca 540 tcgaccttct cctccttcag gcagcgtagc cttgcctcct acaaggccaa gatagttaca 600 gactggcgag caggtacgcg gccatcgcag atcatggcca atttgcgcga ggccggagac 660 gccgtggagt tcagctacca ggacctcgcc aacctgttgc atgcatgccg ccgggaagag 720 ctcaatggtc gtacgccgat ccagtggctc tacgaggtaa gcatcatgcg gctatccgat 780 ttcatggttt tcacggctaa ttaaaagggt atccaatagc aactcgatga cgagaccgag 840 tacttctacc gagatctacg cgacgagagt ggccgagtgc agtgcctatt tattgcgcct 900 agaagtgcgg tccctctctt ccgtaccgcc cccgacgtta tcgtcgccga ctgtacctac 960 aagactaacc gctttggcct ccctctcctt aacttttgtg gtatccaggc cttacgcaag 1020 tcgttctcga tcgcggctgt attcatcaat gcagagaagg aggaacagta tacgtgggcg 1080 ttgcaggcgc tgcgagagtt cttgaccgaa gaggacctcc ctctcccaaa gctgatcgtc 1140 accgaccgag agttggcctt aatcaacgcg cttaaacgcc atgaagcctt cacgttggtc 1200 cctcgactcc tctgcaggtg gcacgttaat atgaacgtgc tagctaaagg taagcggttc 1260 ttcccaccag ctacccgcct tcccgacggc tcgatcaagc ggaacgagag gtttacggca 1320 taccttaaag attggaacgc catacttacg tcggataccg aggagctctt cgagtcacgt 1380 atacagagct tcaaaagtgg gagataccct ataggagcgg tcaattacgc cgtgaaaacg 1440 tggctcgacc catacaaaga gctcctcgtc gatgcgtggg tcaacaaaat attgcacttc 1500 ggcaaccgca ctacctcgat cgttgagtca ctgcacgcgg gcatgaagag gtttatcagc 1560 agcgccggcg gcgatctagc cacagtcttc cggaagctga aggcatattg gcgcaatcag 1620 gcagcggata ttgccctcgc acgcaaccag gcgatgaaca aggtgccgtt cgggctcagc 1680 gacctactct acggcgacgt caaaagtgca gtagtgcctc atgccctacg cgcatgcgaa 1740 aaagaggtcg ccgctatcga gaagcagcct cgtgcaggcc gttgggatct aggaccgccg 1800 gagccttgca cttgcagtat cacgacctct cacggcctgc cctgccgcca cgcccttttc 1860 ctctgcctcc gaaactccga gccgctcagc atcgcccaat ttgatccgta ttggcgctgg 1920 gaccgcaccg cgataccttc actctcggca tcaccttcac gcgcccagag accgcttgat 1980 cccgctgtcg tgcgaggtaa aggtcgcccg cgcgggttgg tggccacaga caagtccacc 2040 aagcgactgc ctagcgccca cgagatcacg gaggccgagg agcgacgaga ggcactgccg 2100 ccgccttcca cggcaccagc gcgcctgaac aagccggctg taccggcgga tgatccctac 2160 gaggccggca ccgcgatgcc aaggcggtcc gggcagtgga tggctcggct cgagtctgtt 2220 gatgatcagg aggccgagtt tgatctgatt cctactaatt tcgagggtaa attcgatgaa 2280 aaggtggccg aggccgaggc gacggctcgg caggctacac aggatgccca ggaatcgacg 2340 gcgttctgat tatcacgaga tttggccgtt ggtgttgttt aacatagcgt cgagatcgga 2400 cgtccatttg gtagctgtac ttaggaaagt cgaacgagca ctgtggaccc acttgcagcg 2460 gtccctctaa taaaatttgc gggctcttta atggacccc 2499 // ID Gypsy-9_RO-I repbase; DNA; FNG; 5251 BP. XX AC AACW02000090; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-9_RO_; KW Gypsy-9_RO-LTR; Gypsy-9_RO-I. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-5251 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000090; Positions 150143 155393. XX CC Positions [3747-4223] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 39..4925 FT /product="Gypsy-9_RO-I_1p" FT /translation="MSNNSAANLVSHLPIQNSNGTALEDVTMNYDGDSQSS FT ASPVLDDGQSSLLPDSNASLSSSAHHMKGIMSVLVEEIKSLTVELVTNPSN FT KSAQELDNIRHLLTQKARDLTALKSSHALIVSMIEDPVVSRTNPTTADQQR FT FFVPPNLPVFQWPGSTFDSNMPVLSDVKACLRKFENILKCHGLSLDQHWFR FT LILPCLSNDQQIWLEEIVGASRTPFNWAKIKEVFIGHYGSTMADEKTVCTS FT ELLGIYMYPKESIEAYIDRFNSLARRSGIIDKLVLTNKFVAGLPKELNQVV FT NVAICGASPEKKMCLNTIAAISRDLYNKLFRSSASSLSFGHSRRDSAAPAK FT ENLGVQGRTNMKKKYCNFHKTHGNHSTEECKGAKKAAAIKTARAKNQCFKC FT GVSPWSKAHVCKTGSAVGERAMRAMSVVDDASCKMDNLRLEENSRASGADL FT AGTSGTPSATTMAFATDSVPSSSSLGPMPFAKDQMEIDSEIDRLSKECKYN FT PQNFKSVPSNCIIVPITIEGVRTRAFVDSGSTFSCISPSFASSLGLKVSPA FT SGSITLGSKDSFANRIGTINKVNIVYNNIITSHNFEIFEFNSITPICIGFD FT LMPKLKIYMSGLASDWDSNNHPVIPDPIDPECIKPNQSPVGTDYERKIFFD FT TIQLNLDANAKIPLTSSCEIPEAVVNLDTVPGQKAYRAPYPIANAHIPTVQ FT KQVDDWLRDDVITIAPPNTEFNSPLLVVPKKGPNGEYGEEIRPCLDIRKLN FT SILTNIDRFPLPLISELHQKMGNAKFFTTLDLKQAFHRFPIRKEDQPKTSF FT TFNGIQYMFKKAPFGLAPVSALVQRTLTNLFADLPYVTIFIDDLTVFTDKD FT TNHHAECLNEVIKRLNKANLILNTKKSHYLQSCVTILGFTISSTGLSLDPA FT KVSNIHNWPIPRTGRDIQSFLGYANYFRASIPNFSKLTAPLDKLRTMSSIE FT KIWNDTHLKAFKRIQTALSNAPVLSSPNLNYPFYVGTDASDFSIGGLLYQI FT IDNTIYYISFVSRSLSKSERGYSTTKRELLAIVYCFKKFYKWLYGSHFFLY FT TDHQSLTYLHSQVNPNKMMLNWYESIFEMDFTVTHCKGIDNVICDALSRLF FT IDHLDNDLEGGKSANIIKSKKNNKKGKNKKISHLSKTDVSAEVAQKELLLR FT KIDVADFLTPPPEERHEILLKIHLNGHFGYKSIVDEVHSRQLHWNNLKDDA FT LDIVKNCPKCRLFNIGKHGYHPPRSILPDAPGDHFAVDLGSFNVTSAQGNN FT FFLVLVDLFSRFTVLRPLRDKSAITVAKELVDIFCLIGFPKIIQSDNGKEF FT IAEVIEQMIKYSGIEHRLSNPYNPLGNSVNERYVGLAKQIIVKRLDGVKNE FT WDLYLPSVQYAMNCKFARLHYSRPFCVMFNRQPNELIDYSNIQPILHNQTI FT DPQVLEDKLKDVMDIIIPGLREKISETQRNDNTKFMRKNRIIHDQYPLKSE FT VMIKNVNRTDKTSERYEGPYTIHGYTKNGSYILMDKTGALLPRNIPTSHIV FT LISSDNVEPDPNDKQWELQAIVDHRQGKHGYEYRVRWKGYPPSEDTWEPRD FT SFQSNVPILEYRARREADGNPIISDEATTNSKRKAHGNDEQNHSPKRRKNT FT SRRSKTRKSRN" XX SQ Sequence 5251 BP; 1572 A; 1014 C; 969 G; 1696 T; 0 other; ttttttttca atacacttaa actttgattt tatttaacat gtccaacaac tctgctgcta 60 acttggtttc tcacttacct attcaaaact ccaacggtac tgctctcgaa gatgttacca 120 tgaactacga tggtgattcc cagtcttctg cttctcctgt tctagatgat ggtcagtctt 180 ccttgttgcc ggatagtaac gctagcttgt ctagctcagc tcaccacatg aaaggcatca 240 tgtctgttct tgttgaggaa attaaatcct taacggtgga gctagttacc aatcctagca 300 acaagtctgc tcaggaactt gacaacatta ggcatctgct aacccagaag gcaagagatc 360 taactgcctt gaagagtagt catgctctga ttgtctcaat gatcgaagat cctgtggtat 420 ctcgtacaaa ccctaccact gctgatcaac aaaggttttt tgttccacct aaccttcctg 480 tattccaatg gcctggttcg acttttgatt ctaatatgcc agtactatct gatgtcaaag 540 cttgtctgag aaagtttgaa aacattctca aatgtcatgg tctcagcttg gatcaacatt 600 ggttccgctt gattttgcct tgtttatcga acgatcaaca aatatggttg gaagaaattg 660 tcggtgcttc tcgtactcct ttcaactggg caaagatcaa ggaagtcttc attggtcact 720 acggaagtac tatggctgat gaaaagactg tttgcacttc tgaacttttg ggtatttaca 780 tgtatcctaa agaatccatc gaagcctata ttgataggtt caactctctt gctcgccgtt 840 cgggcatcat tgataagttg gtgttgacta acaaatttgt ggctggtctt cccaaagaac 900 ttaatcaggt ggtgaacgtg gctatttgtg gtgcatcacc agaaaagaaa atgtgtttaa 960 acacaatcgc cgccatatca agagatttat ataacaaatt gttccgctcg tctgcttctt 1020 ctttgtcttt tggacattcg cgtcgtgatt ctgctgctcc tgcaaaggaa aaccttggtg 1080 ttcagggcag aaccaacatg aagaagaagt actgtaactt ccacaaaacc catggtaatc 1140 attctacaga agagtgcaaa ggtgcaaaaa aggccgccgc cataaaaact gctcgagcca 1200 aaaaccagtg ctttaaatgt ggtgtatctc cttggtctaa agcacacgtg tgcaaaactg 1260 gttctgctgt aggtgaaagg gccatgcggg ctatgtctgt ggttgatgat gcgtcctgca 1320 agatggacaa tctcaggctg gaagagaatt ctcgtgcttc tggtgctgat ttggctggta 1380 cctctggtac tccttctgcc actactatgg catttgctac tgatagtgtt ccttcttcgt 1440 cgtccttggg tcctatgccc tttgctaagg accagatgga aattgatagc gaaatcgatc 1500 gcctatcaaa ggaatgtaag tacaacccac aaaatttcaa aagtgtacct tctaattgta 1560 ttattgttcc aataactata gagggcgtta gaactcgtgc tttcgtggac tctgggtcta 1620 ctttttcttg tatctcacct tctttcgctt cttctcttgg tttgaaagtt tctcctgctt 1680 ctggttctat caccttggga agtaaagatt cttttgcaaa tcgtattggc actattaata 1740 aagttaatat tgtttacaat aatatcatta cgtctcataa ttttgaaatt tttgaattta 1800 attctatcac tcctatttgt atcggttttg atttaatgcc taaattaaaa atttatatgt 1860 caggacttgc gtctgattgg gattctaata atcatccggt tatacctgat cctattgacc 1920 ctgaatgcat taagccaaat caatctccgg ttggcactga ttacgaacga aaaatattct 1980 ttgatacgat tcagttgaat ctagatgcta atgctaagat tcccttgaca tctagttgtg 2040 aaatccctga agccgttgta aatttggata ctgtgcctgg tcaaaaagca tacagagcac 2100 catatcccat agccaacgca catattccaa ctgttcaaaa gcaagttgat gattggttac 2160 gtgacgatgt tattacaatt gcacctccta acactgaatt caattctccc cttcttgttg 2220 tccctaagaa aggtcctaac ggtgaatatg gtgaagagat tcggccatgc cttgacatta 2280 gaaagctgaa ttctattctc acaaatattg atcgttttcc tctaccattg atttctgaac 2340 tacatcagaa aatgggaaat gccaaatttt ttactacttt ggatttgaaa caagcctttc 2400 atagatttcc tatccgtaaa gaagatcaac caaagacttc gtttacattt aatggtatac 2460 aatatatgtt caagaaagcg ccttttggtc ttgctccagt tagtgcttta gttcaacgta 2520 ctttgactaa tctttttgct gatctgccat atgttacgat atttattgat gatttaactg 2580 tctttactga taaagataca aaccatcatg cagaatgtct caatgaggtt ataaaacgtt 2640 tgaataaggc aaatctgatt ttgaacacta agaaatcaca ttatttacag agctgcgtta 2700 cgattcttgg ttttaccatc agttccactg gccttagttt agatcctgcc aaagtttcta 2760 atattcataa ttggcctatt ccacgtactg gtcgtgatat ccaatctttt cttgggtatg 2820 ccaattactt ccgtgcaagt atccctaatt tctccaaatt gactgctcca ttagacaaac 2880 ttcgtactat gagttcaata gagaaaattt ggaatgatac tcatcttaaa gcattcaaaa 2940 gaatccaaac cgctttgtcg aatgcacctg tgttaagttc gccaaactta aactatccgt 3000 tttatgttgg cactgatgca agtgattttt ctatcggtgg attgctttat caaataatcg 3060 ataacaccat ttattatatt agctttgttt ctcgatcatt gtctaaatcg gaacgaggtt 3120 actccactac caaacgtgaa ctactggcaa ttgtatattg tttcaaaaag ttttacaagt 3180 ggttgtatgg ctctcatttc ttcctttata ctgatcatca gagtttaaca tatttacatt 3240 ctcaagtgaa tcccaacaaa atgatgttaa actggtatga atctattttt gaaatggatt 3300 ttaccgtcac tcattgtaaa ggtatagaca acgttatttg tgacgcttta agcagattgt 3360 ttattgacca tttagacaac gatctggagg gaggtaaatc tgccaacata ataaaatcta 3420 agaaaaataa taagaaagga aagaacaaga aaatttccca cctttctaag acggacgtaa 3480 gtgctgaggt cgcacaaaaa gaattactac ttcgtaagat cgatgtcgct gatttcctta 3540 ctccaccacc tgaagaacgt catgagatat tacttaaaat acatctcaat ggacatttcg 3600 gctataaaag tattgtagac gaggttcata gtaggcaatt acattggaat aatctcaagg 3660 atgatgcctt agatattgtt aaaaattgtc ctaaatgccg attgtttaat attggtaaac 3720 atggatatca tccacctcga agcatacttc ctgatgctcc tggagatcac tttgcggttg 3780 atctcggatc attcaatgtc acctctgctc aaggcaataa tttcttcctt gtactggtag 3840 atcttttttc aaggtttaca gtattaagac ctttgcgtga taaaagtgcg attacagttg 3900 ccaaagaact tgtagatatt ttttgtctaa tcggctttcc aaaaatcata caaagtgata 3960 atggtaaaga attcattgct gaagtcatag aacaaatgat taaatattcg ggtattgaac 4020 acagattatc caatccctac aatccacttg gtaatagtgt gaatgaaaga tacgtaggtc 4080 ttgcaaaaca aattattgta aagcgattgg atggtgttaa gaacgaatgg gacctttact 4140 tgcctagtgt acagtatgct atgaattgca aatttgcaag attgcattat tcgcgtccat 4200 tctgtgtcat gtttaataga cagcccaatg aactgattga ttatagtaat attcaaccta 4260 tattacataa ccaaaccatt gacccgcagg ttcttgaaga caaactgaaa gatgttatgg 4320 atataattat tcctggtctt cgtgaaaaga ttagtgaaac acaaaggaat gacaatacca 4380 aattcatgag aaagaatcgt atcatacatg atcaatatcc tcttaagagc gaagtaatga 4440 ttaaaaacgt caatcgtaca gataaaacga gtgaacggta tgaaggtccg tatactattc 4500 atggctatac caagaatggt agttacatac tcatggataa aactggtgct ctcttaccga 4560 gaaatattcc tacttcacat atagtactca tatcatcaga taatgttgaa cctgatccca 4620 atgataagca atgggaacta caagcaattg tggaccatag acagggtaag catggctatg 4680 agtatagagt caggtggaag ggttatcctc ctagtgaaga tacttgggaa cctcgtgaca 4740 gtttccaaag taatgttcct atcctagaat atcgtgcacg tcgtgaggcc gatggaaacc 4800 ctataattag cgatgaggct actacaaata gtaagcgaaa agcacatggt aatgatgaac 4860 aaaatcattc acctaaaaga aggaaaaaca cttcacgtcg atcaaagact agaaagtctc 4920 gcaattgatt tcttttgtcc cgtatatttt tacctcaaag ggttttttag ggttttcttc 4980 gaaatcaatt acgcgattgg acaactataa gctttttcat tttttgctta tacaatagac 5040 atttttttat ttttttttca agttattgtt cacaggataa tcacatctga ttccatatat 5100 tgctataaca ttatcttcgc acgcactctt tttcgcatgc tttgcaagct ggaggaggcg 5160 catgttgtgc tccaaagata atcattgtat agcaatcaag cattattaac gattgatagt 5220 gtgagtatat ggaaatgtag ttttattaat a 5251 // ID Gypsy-68_MLP-LTR repbase; DNA; FNG; 188 BP. XX AC AECX01001167; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-68_MLP_; KW Gypsy-68_MLP-I; Gypsy-68_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-188 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001167; Positions 15138 14951. XX SQ Sequence 188 BP; 63 A; 51 C; 30 G; 44 T; 0 other; tgttataact attacacacg tgaaaggcca gataaatgtc acagatagaa gacttacgtt 60 tgtacaaacc cccatgttgt atctctctct tttccttatg caacaatcat catagatcag 120 cgtgaagata gcagaccaag accaccctcc actcccaaca gacaccaccc tgagagaggg 180 tcataaca 188 // ID Gypsy-93_MLP-LTR repbase; DNA; FNG; 171 BP. XX AC AECX01000333; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-93_MLP_; KW Gypsy-93_MLP-I; Gypsy-93_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-171 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000333; Positions 98010 97840. XX SQ Sequence 171 BP; 42 A; 47 C; 27 G; 55 T; 0 other; tgttacgacc catgtagcgc tacgagtact tagacactta tgtacttagc agttcttatt 60 atacttggct cagcgcacag atagtgcttc ctcacttagc aatctaatca tcagtaccag 120 agtaccttct gagccttctc ttgcatcctt cagtcctctc aagtcatatc a 171 // ID Gypsy-17_MLP-I repbase; DNA; FNG; 7437 BP. XX AC AECX01002074; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-17_MLP_; KW Gypsy-17_MLP-LTR; Gypsy-17_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-7437 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002074; Positions 44905 37469. XX CC Positions [6275-6787] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 947..7405 FT /product="Gypsy-17_MLP-I_1p" FT /translation="MPTTRSNASRSSSVAGSEHSVGVTTRLRSSHISAEPP FT INIDPSPGTNSRSSSYLAAARSELRSVGESPRSKRRESIQTHDHHGQRPAS FT QIDQTDGLPTSARSHERMESSRMGLDHIRKEDSGNEGQPSQQQAQTVLKTF FT RECSRSGSSGSIRSSEELHESLERMILPLDRELLAPFHSNLSDSISPLVQS FT LFKLEKEIKTIKNCVRTSRATNERIENIDGMLGGLSENINSQIVSLNKVSD FT DIEKAIKITDSAAVQINSWLETMRLKMNSKYEQTEDLICKGHIAFHEANQK FT IRSTMISDMHGLFENQMTDIKSLWATHKQQPISIENETTPISITNEFMQLQ FT NLVIKHREDQLEEQGLLRSQNEQIIGLLADLVLVKDKASPMLPLAETVETK FT NLRDLPPHMNKAEESSKAGEPIASSTPHNAEYTAEPSPSARDIAPLAFKNE FT SIEDIIQQKSEYTNNAASDELILYKALKNEVPASRNWPKFSGEGEYNHVDF FT ISWIDRVKEDMHVPDALITSRVSQAMTGLAQLWFLEKIKEGKLSWEEWKEA FT INAKFGTRKWRRDMQSAFDKDHFSTKNQEPVKWLVVQRKRLEAAQPGIDLE FT DTIDKILEQCPGDLDHAVRSRMLTLSDFVEFTTTFKEVVNRTTIGKNTSQR FT IARNLDWKTQATSGTTTPDPKATTKPTTVRIPGKCDTCGSTEPKHDYRSCR FT RKGKAINMVEQEDITPIDSDHEDVTFYSDAESLSDNDEIVTMIQDGHETLD FT IACIDEIQIQAPKFLRRKTCDTRLGCPWLTIICNTWEFNILVDTGASSSMI FT TPCILNRIWPSWKFDMKPTPPKRYWTPTGKLEAIGEVRLPIQFVHPTASCV FT MMADFVIMNNTTTPQIILGSDMMQLYGMKVSYSDTIEITFEDSEAIFQPKN FT ESEIQGKKFPGEHILAIESNATELDISEVQAETELPQEFSKNLKNHIGDAR FT QLKTRPSAGKAHMIGQHSITTVLINNQEVSLLLDSGASCSIVGARYLSTII FT GDWKNRMLPKSSLTFKGCGEALTPLGVIEFPIIFPHTEGSIRIRAEFVIIE FT NATPTYFILGDDFLSLYGIDIMHTREKYFTIGNDNKKKKFALQSTKRILAV FT QTPHPKDTTTNNILRRRIVSGDNLTNDQKEQIHSIVDKYHTTFGLGEKKIG FT TIRKYEVEISLTVEKPYPPILRKAAYPASPRNRVELETHIDELVKLGILRK FT VQQNEDVEITTPCVVAWHGGKSRVCGDFRALNAYTVPDRYPMPRIEQVITN FT LGKSVYITTMDLMKGFHQNVVSKDSRKFLRIILHLGIFEYQRMPFGIKNAP FT AFFQKMMDTEFYEELRRGNVIVYIDDIIIFHESWEEHVKTLTSILTKLENL FT GATISLDKCKFGFHEVKALGHIVSGLTLAIDQNKVAAVLHRQLPKTVKDVQ FT SFLGFSSYYRMYLENYAKVTFPLYYLLKKGVDFSITDERAKAWNNVKNLLT FT SAPCLLQPDFGKPFILYVDASFSGLGAALHQKQVVKEKLVNGPVCFISRQL FT KESEKRYGAPQLECLALVWALNKLHYYLDGIYFEVITDCQAIKSLMNTKTP FT TRHMLRWQLAIQEYKSYMTITHRPGILHSNADALSRMAMPNDENNPAWEAE FT DTDRDVLIMGISLCELSDEFFENVENSYLKNPNTSTLLKIFSTEIKDVAME FT STLETQWKEPFQEGKFSLISGILYHREKHTNVLVICDRADIEKILAVCHDD FT FMSGHLSEDRVVQRVSSTAWWPDWKNEVHDYVESCERCQKANRQTGKRFGL FT LQRIEEPKFPWEVINMDFVTGLPPAGNDNVNSVLVTVDRFSRRTRFLPCHK FT EIDAMGTALLFWQTIITDCGLPRIIISDRDPKFTSEFWKGLTKLMGTTLAM FT STSYHPQTDGLAEHMIQSLEDMLRRYCAFGLTFKDRDGYTHDWKTLLPALE FT LAYNTSIHSTTNKTPFELERGYNPRTPKDMVKDRDINIHPTALSFHDMLSK FT ARTYAKECIDEAVKYNKNRWDKTHKEPEFQIGDKVLISTINFNNIQGPRKF FT KDSFVGPFTVVKLHGPNAVEVMLTGDFARKHPTFPVSLLKHFKETDRAKFP FT NRKSPDEIIPFEEDTPKTVHKVLEHKRIKLSGKDVRLYLVRYKGRGADSDE FT WLSEDKIANSQQVLRKYRAEKKNSQ" XX SQ Sequence 7437 BP; 2656 A; 1603 C; 1501 G; 1677 T; 0 other; attgggggcc tcctctcact ttcagatttt ttttttttac gaaaaaacat ctttgatccc 60 gaaccttttt ttactttatc atcaaatcta caataagcca aaaaacccat aaaaaacatt 120 tttttttact aaagatatct ttctatctca tctaaaagct tcgtcaggaa tcatttattc 180 ttcaaatttt aaccagttat ctttagactt gttcggagaa ctcccttatt tccaatttga 240 aaacaagaat ccgtacaccg acaaaagcta caatcagtgc tttggaatac acgcttgaaa 300 tcccgacaac gcctggttca gattttcaaa tctcctcgac gattcatcca gtgaactttt 360 tctaagcgac ctaatcttaa cagacatcta agaaagaaga cagttttttt tttttttttt 420 taacctacgg actagcagta agtattacaa atcctgtcct atagcatacc tctctagatc 480 aactcataac cagtctactg taactgactc ataaaactat cttagtgtcg aacgtacaaa 540 gtttaccagc gtaccgtagt accttcagcg gaagagaaga tagccttact ccccaaagcc 600 aaatcgaagg cgaaaagagt ttagaatcca ccagccctct caccaacgaa agctttgaag 660 aagatatagg aacgaccaaa gaccagcaag gacagactac tccaaagcag acttcagaca 720 cacaagccac tgccatcatt attgacagtc cagttaacaa aactcgcaac cgacctccta 780 acattgatct cagtgttagc ccagcggcac cagctacgat tagcctattt gagagattca 840 cgtcaatacc tggttactcc agcccgagaa atagaatacc ccgttagtaa attttctttt 900 tgagaagtac tgcatccaga ccgtataaac ttacaagagt gaattcatgc ctaccacaag 960 aagcaacgcc tcgcggtctt ccagcgtcgc aggatccgaa cactctgtcg gcgttacgac 1020 tagactccgc tccagccata tcagtgcaga acctccaatc aacatcgacc catctcccgg 1080 aaccaacagt cgaagttcga gctatcttgc agcagcacga agcgaattac ggtctgtggg 1140 tgaaagcccg agaagcaaaa gacgcgaaag cattcaaaca cacgaccatc atggacaaag 1200 acctgcatcg caaattgatc aaacagatgg gctaccaacc tctgctagat cacacgaaag 1260 gatggaatcc agtagaatgg gattggacca cattcgaaaa gaagatagtg gaaacgaggg 1320 acagccgtcg caacaacaag cgcaaaccgt cctcaagacc ttcagggagt gtagccgtag 1380 cggatcttca ggaagtatta ggtctagtga agagctacac gagtcgttag aacgcatgat 1440 actcccctta gatagggaac tgctggcccc atttcactca aatttgtctg attcaatttc 1500 tcctcttgta caaagtttgt ttaaattaga aaaagaaata aaaaccataa aaaattgtgt 1560 ccgtacatcc agagccacaa atgagagaat tgagaacatc gatgggatgt tgggaggact 1620 ctccgagaat attaattctc agatcgtgtc actaaacaaa gtaagtgacg atattgagaa 1680 agctatcaag ataacagaca gtgcagcggt ccagataaat tcctggttag aaaccatgag 1740 attgaaaatg aacagtaagt atgaacagac ggaagacctg atatgcaaag gacatattgc 1800 gttccatgaa gcaaatcaaa aaattcgttc aaccatgata tcagatatgc atggactttt 1860 cgagaaccag atgacagata taaaaagtct gtgggcaaca cacaaacaac aaccaatcag 1920 tattgagaac gaaaccaccc caatctccat aacaaacgaa ttcatgcagc tacagaactt 1980 agtaattaaa cacagagaag accagctaga agagcaaggg ttactacgat ctcaaaatga 2040 gcagatcata ggacttttag cggatctagt tctcgtgaaa gataaggcat cccctatgtt 2100 acctttagca gaaacagtgg agactaagaa cctcagagac ttaccgcccc acatgaacaa 2160 agcagaagaa tcatcgaaag caggagaacc aattgcttca tcaaccccac acaatgctga 2220 gtacaccgca gaaccttcac cttcagctag agatatagct ccgttagcat ttaaaaacga 2280 gtctatagaa gacattatcc aacagaagtc agaatacact aacaacgcag catcagatga 2340 gcttatactc tataaagcac ttaagaacga agtaccagct agcaggaact ggccaaagtt 2400 tagtggcgaa ggtgagtaca accacgttga ttttatcagt tggatagaca gagtgaaaga 2460 agatatgcac gtacctgacg cattgatcac atcgagagtc agccaagcaa tgacagggct 2520 agcacagttg tggtttctcg agaaaataaa agaaggcaaa ttgtcttggg aagaatggaa 2580 ggaagcaata aacgcgaaat ttggcactag gaagtggaga agagacatgc aatcagcttt 2640 cgacaaggac catttttcta caaaaaacca agaaccagtg aaatggttag ttgttcaaag 2700 gaagaggttg gaagcagcac aaccggggat tgatcttgag gataccattg ataagattct 2760 ggaacaatgc ccaggcgatc tagaccacgc agtcaggagt agaatgttga ctctatcaga 2820 ctttgtcgaa tttacaacca cgttcaaaga agtggttaac agaacgacaa taggtaaaaa 2880 cactagtcag cgtattgcca ggaacttgga ctggaagact caggcaacaa gcggcactac 2940 gacacctgac ccaaaagcga caaccaaacc aacaacggtc cgtataccag ggaagtgcga 3000 cacatgcggt agcacagaac ccaaacatga ctataggtcg tgcagaagga aaggcaaagc 3060 cattaacatg gttgaacaag aggatattac gcccatagat tcagatcacg aagatgtaac 3120 gttctatagc gatgcagaga gcctttcaga caacgacgaa atagtgacaa tgatacaaga 3180 tggacacgaa acactagaca tagcttgtat agacgagata cagattcaag cacccaagtt 3240 cctacgcaga aagacttgtg acacaagact cgggtgtcct tggttaacaa ttatctgtaa 3300 tacttgggaa ttcaacatcc tcgtagatac aggagcgtca agctcaatga taacaccatg 3360 tatcctaaac aggatatggc catcatggaa atttgatatg aaacccacac cgcccaaacg 3420 atactggaca ccaacaggaa aattggaagc aataggggag gtgagactac ccatacaatt 3480 tgtccaccca acagcgtcat gtgtgatgat ggcagacttt gtgattatga acaacacaac 3540 aaccccacaa attatcctag ggtctgacat gatgcaatta tatggaatga aagtttctta 3600 ttcagacaca atcgaaatta cctttgaaga ctctgaagca atttttcaac cgaaaaatga 3660 atctgagatc cagggtaaga agttcccagg ggagcacata ctagcaatag agtctaatgc 3720 tacagaatta gatatatcag aggttcaagc agaaacggaa ttgccacaag aattttcaaa 3780 aaatttgaaa aatcacattg gagacgcaag acaactgaag acaagacctt cggctggtaa 3840 agcgcacatg ataggtcaac atagcattac gaccgtgcta ataaacaatc aagaggtaag 3900 cctgctacta gatagtggcg cgtcgtgttc gatagtagga gcacggtacc tatctacaat 3960 aataggagac tggaagaaca gaatgttacc aaaaagcagt ctgaccttca aaggatgtgg 4020 agaagcgcta actccgctag gagtcataga atttccaatc atattcccgc atacagaggg 4080 atccatacgt attcgcgcgg aattcgtaat aatagagaac gcgaccccta catattttat 4140 actaggcgac gattttctat ccctatatgg tatagacatt atgcataccc gcgaaaagta 4200 ctttactata ggtaatgaca acaaaaagaa gaagttcgca ctacagagca caaagcgaat 4260 tctggcagtc caaactccgc atccaaaaga caccacaaca aataacatat taagaagaag 4320 gattgtttca ggcgacaacc tgacgaatga ccagaaagaa cagatacatt caatagtgga 4380 taaataccac actacttttg gtctgggcga aaagaaaata ggtactatac gaaaatatga 4440 ggtagaaata tctctgacag tagagaaacc atatcccccg atcttaagaa aagccgcata 4500 tccagctagc ccaagaaacc gggtggaatt agaaactcat atagacgaat tggttaaatt 4560 aggcatcctc aggaaagtcc agcagaacga ggacgtcgag attaccacac catgcgtggt 4620 ggcatggcac ggaggtaagt ctagagtatg tggagatttc agggcgctga atgcctacac 4680 agtacctgac agatatccca tgccacggat tgaacaagta attacaaatt taggaaaatc 4740 agtctatatt acaacaatgg acttgatgaa agggttccac caaaatgtag tatcaaaaga 4800 cagcagaaaa ttcctgagaa tcatcttaca tttaggaata ttcgagtatc agagaatgcc 4860 ttttggcatc aaaaatgccc cagcattttt tcaaaagatg atggacactg aattctacga 4920 agaactacgc agaggaaatg tcatcgtata catagatgat attattatat ttcacgagtc 4980 atgggaagaa cacgtcaaaa cactaacatc aatactaaca aaattagaaa acttaggagc 5040 cacgatctca ctagacaaat gcaaatttgg atttcacgaa gttaaggcac taggacatat 5100 agtgtcgggt cttactctag caatagatca gaataaggtc gcagcagtac tccacaggca 5160 gttacccaaa acagttaaag acgtacaaag tttcttaggg ttttccagct actacagaat 5220 gtacctagaa aattatgcca aagtcacgtt cccgctttac tatttactta agaaaggagt 5280 agacttctcg attacagatg agagagctaa agcatggaat aatgtgaaaa acttgctcac 5340 atcagcgcca tgcttattgc aaccagattt tggtaagcca tttatcctat acgtcgacgc 5400 aagcttttca ggacttggag cggccctaca ccaaaaacaa gtagtcaaag agaaacttgt 5460 taacggaccg gtatgtttca tttctcgaca acttaaagag agtgagaaaa gatacggagc 5520 accccaacta gaatgcctag ctctagtatg ggctctgaat aaacttcact attacctgga 5580 cgggatttac ttcgaagtca ttacagactg ccaagcaatc aaatcgctaa tgaatacaaa 5640 aactccgacc agacacatgc tacgatggca attagcaatt caagagtata agtcgtacat 5700 gacaattaca catcgaccag gtatactaca cagtaacgct gacgcattaa gcagaatggc 5760 aatgcccaac gacgagaaca acccagcttg ggaagccgaa gacacagaca gagacgtact 5820 gatcatgggt atcagtctat gcgaactatc cgacgaattc ttcgaaaacg tagagaatag 5880 ttatttgaaa aacccaaaca cttctacact tttaaaaata ttcagcacag aaataaaaga 5940 cgtagccatg gaaagtactt tagaaacgca atggaaagaa ccattccagg aaggaaagtt 6000 ctccctaatc tcaggaatac tctatcatag agaaaaacac acgaacgtgc tagtcatatg 6060 cgacagagca gatatcgaga aaattctggc agtttgtcac gacgatttca tgtctggaca 6120 tctgagcgaa gacagagtag ttcagagagt aagctccaca gcttggtggc cggactggaa 6180 aaacgaagta cacgattacg tagagtcatg tgaaagatgt caaaaggcta acagacagac 6240 ggggaagcga tttgggctac ttcagagaat agaagaacca aaattcccgt gggaggtaat 6300 caacatggac tttgtcacag gactacctcc cgcaggcaat gataatgtaa atagtgtact 6360 agtaacagta gacagatttt cgcgtcgaac acggttttta ccatgtcaca aggaaatcga 6420 cgctatggga acagcgttat tattctggca aaccatcatt accgactgtg ggcttcccag 6480 aatcattata agtgacaggg acccaaaatt cacctctgaa ttctggaaag gactcacaaa 6540 gttgatggga acgacgctag caatgtccac ctcataccac cctcagactg atggtttagc 6600 ggagcatatg atacaaagcc tcgaagacat gctcagacga tactgcgctt ttggattgac 6660 atttaaagat agagatggct acactcacga ctggaaaaca ttgttaccag cgttagaatt 6720 agcatacaat actagtatcc atagcactac aaataaaacg ccgtttgaat tagaaagagg 6780 ctacaacccc cggacaccaa aagatatggt caaagacagg gatattaata ttcatccgac 6840 agcattaagc ttccatgaca tgctctcaaa agctagaaca tacgcaaaag aatgtataga 6900 cgaagcagtc aagtacaaca agaatcgttg ggacaaaaca cacaaagaac cagagtttca 6960 aataggagac aaagtactaa tatcgactat caactttaat aacatacaag ggccaaggaa 7020 atttaaagac tcctttgtag gtccttttac agtagtaaaa ttacacggcc caaacgcagt 7080 tgaagtaatg ctaaccggag actttgccag gaaacaccct acattcccag tgtccctttt 7140 gaaacacttc aaagaaacag atcgggcgaa atttcctaat agaaagtcac cggacgaaat 7200 cattccattc gaagaagaca cacctaaaac cgtccacaaa gttttagaac ataaaagaat 7260 taaactctcc ggcaaagacg tgagattata tctagtaaga tataaaggta gaggcgctga 7320 tagcgatgaa tggctatctg aagataaaat agcgaactcg cagcaagtac tgagaaagta 7380 tagagcagag aagaaaaact ctcaatgaat ttttcttctc tcccccaccc cggagag 7437 // ID Copia-1_BDJ-I repbase; DNA; FNG; 2614 BP. XX AC AATT01000134; XX DT 07-FEB-2011 (Rel. 16.02, Created) DT 07-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Batrachochytrium dendrobatidis DE genome: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_BDJ_; KW Copia-1_BDJ-LTR; Copia-1_BDJ-I. XX OS Batrachochytrium dendrobatidis OC Eukaryota; Fungi; Chytridiomycota; Chytridiomycetes; Chytridiales; OC Chytridiales incertae sedis; Batrachochytrium. XX RN [1] RP 1-2614 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Batrachochytrium dendrobatidis RT genome."; RL Direct Submission to RU (07-FEB-2011). XX DR Genome; AATT01000134; Positions 108484 105871. XX CC 'AACAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 5..2614 FT /product="Copia-1_BDJ-I_1p" FT /translation="MSPAGSHAVKITDGLYIPKLQNAEVALHSSFGQRKPP FT RAYYNNQSHFKGTDNKGNYSNDSNQDVHYNNQPHLRGFTNGDHSNDSNQNV FT QNSNNSSYWPNGKKKILCRNCGKLGSHLAKDCRARSSNNSQSNSNYASTSC FT ANLNANNDFSFHNTTTSPKTDYCSISRNSVNEIILDSGATTHMVCNKNWLS FT NTVPLQNHSVKGSMKTATNAECKGSLQLKVWNGKASRVITINNVLYVPGFE FT VNLISPGKLAVEFGCWTKLNKQGAVLYTKDNVEVLRAHKVGTLDIIKCEIL FT IPNKQIHASVRKSSLSGIKKPKANIKVVAESTQFISNEAINASSPRIQELS FT QPDSEVEDTPLKSKTQGPYYYYKRQPDFDYMNSSLVSANISNSTFPSNWKQ FT AMTAPDANLWKIAADKETESMAPLFTHSYNLGGGITPSVPIVIDNVVSPET FT PEIPYRASTLTGGESKSGRRQVPIVVQTARNTARMTVADRGTNIKTSHGRI FT TQEGLLGRAISKNKAIKTSLNKSSLAGYRKPNRKSRSKSQGSRKEVQERPV FT DVMNRDRATMAAPIKSTVTVEYPGAISETADASEPSIQAAVEQRMVENLPT FT NTSQEEHYLKNVLKRFYMLDCKPVATPIESGTNLTASLPTEPVTDAPYREA FT VGALMYAMVATRPDLGAAIGQVSRFMHHPNDTHWTAVKRILRYVKYSLNYS FT LTLGGDNTTLVGYCDADWAGDVDSRKSTSGYTCFLGNSCISWRSTKQTSVA FT ISTMEAEYAAAVTATQELLFLRNMLNELGFTQERATILYSDSQSAIANTEN FT QAPNHATTKHMDVKLKFLRDQVSQKNILVMYIRTQDQIADIMTKGLPRIAF FT AQFSDLLGLRAVQGEEG" XX SQ Sequence 2614 BP; 878 A; 606 C; 529 G; 601 T; 0 other; ggttatgagc ccagccggaa gccacgcagt aaagatcaca gacggtctat acataccaaa 60 actacaaaac gccgaagtcg cgttacacag ctcattcgga caaagaaagc caccaagagc 120 atactacaat aatcagtcgc atttcaaggg cactgataac aaaggaaatt actcgaacga 180 ttctaatcag gatgttcact acaataatca gccacatctc aggggcttta ccaacggaga 240 tcactcgaac gattctaatc agaatgttca aaactccaat aattcatctt actggccaaa 300 cggcaagaag aagatattgt gtcgcaattg tggaaaactt ggatctcatt tagcaaaaga 360 ctgtcgagct cggtcctcaa ataacagtca atcaaacagt aactacgcat ctacgtcatg 420 cgctaattta aacgctaata atgatttctc gtttcacaat accacaactt caccaaaaac 480 tgactactgc agtatcagca ggaacagtgt caacgagatc atccttgact cgggtgctac 540 cactcatatg gtatgcaaca agaactggct gtctaataca gtaccgttgc aaaatcacag 600 tgtcaaaggt tcgatgaaaa cggctactaa tgccgaatgc aaagggtcgc ttcaactcaa 660 agtctggaat ggaaaagcca gtcgcgttat tactatcaat aacgtgctat atgttcctgg 720 attcgaggtc aatctaatct ctccaggaaa actagctgtt gagttcggat gctggactaa 780 gctcaataaa caaggagctg tcctctatac caaggacaat gtcgaggtct tacgtgcaca 840 caaagtcggc acactagaca ttatcaaatg cgaaattctc attccgaaca aacaaattca 900 cgcctccgta cgtaaatcaa gcctaagcgg tatcaagaaa ccaaaggcaa atataaaagt 960 cgttgcagag tctacgcaat ttatttctaa tgaagcaatc aacgcttcaa gtccccggat 1020 tcaggaacta tctcaacccg attccgaagt ggaagacaca ccacttaaaa gtaaaactca 1080 gggaccatat tattattaca aaaggcaacc agactttgat tacatgaact caagtctcgt 1140 ttctgccaat atatctaact caacatttcc atcgaattgg aaacaagcca tgactgcccc 1200 ggatgcaaat ctctggaaga tcgcagcaga taaagaaaca gaatcaatgg caccattatt 1260 tactcatagc tacaatctag gcggtggtat tacaccgtct gtacctattg tgatagataa 1320 tgtggtaagt ccagaaacac cagaaattcc atatcgcgcg agtacgctaa ctggagggga 1380 atcaaaatca ggtcggcgac aagtaccaat agtagtacag actgcaagaa atacagcaag 1440 aatgactgta gcagatcgcg ggaccaatat taaaacaagc catggacgca taactcagga 1500 ggggttatta gggcgtgcca tcagcaaaaa taaggctata aagacatcgc taaataagtc 1560 gagtctggcc ggttatagga aaccgaaccg taaatcgaga tcgaagtctc aaggttcgag 1620 gaaagaagtt caggaacgtc cagttgacgt tatgaacagg gacagagcca cgatggctgc 1680 tcccataaaa tcgacggtca cagttgaata tccaggagcg atatcagaga ctgctgacgc 1740 gtctgagccg agcattcaag ctgcggtgga acaaagaatg gttgagaatt taccaaccaa 1800 tacaagtcaa gaagaacact atctgaaaaa tgttctcaaa agattttaca tgttggactg 1860 caaaccagtt gcaacaccaa tagaatctgg caccaatttg actgcatctc tcccaacgga 1920 accagttacc gacgctcctt atcgcgaagc agttggagct ttgatgtacg caatggttgc 1980 tactcgacca gatctcggcg ctgccatcgg tcaagtttcc agatttatgc accatccaaa 2040 cgatacacat tggactgctg ttaaacgtat actaagatac gtaaaatact cacttaatta 2100 ttctcttacg ctaggagggg ataacaccac tctggtagga tactgtgacg cagactgggc 2160 aggagatgta gactctcgga aaagcacctc cggatacact tgctttctag gcaatagctg 2220 tatatcatgg agatctacta aacagacttc tgtagctatc tcaactatgg aagctgaata 2280 tgcagccgct gtcactgcca ctcaagaatt attatttcta cgcaacatgc ttaatgaact 2340 cggattcact caagaaagag ctacgatact ctactccgat tcacaatcag caattgccaa 2400 cacggaaaat caagcaccca atcatgctac aactaagcat atggacgtca agcttaaatt 2460 cctacgcgac caagtctcgc aaaagaatat actagtcatg tatattagaa ctcaagatca 2520 gatcgcggac atcatgacaa aaggtcttcc aaggatcgcc tttgcccaat tctcagatct 2580 acttggactg agagccgttc aaggcgagga gggg 2614 // ID Copia-1_PPM-LTR repbase; DNA; FNG; 344 BP. XX AC ABWF01000054; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Postia placenta genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_PPM_; KW Copia-1_PPM-I; Copia-1_PPM-LTR. XX OS Postia placenta OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Polyporales; Postia. XX RN [1] RP 1-344 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Postia placenta genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABWF01000054; Positions 10163 9820. XX SQ Sequence 344 BP; 60 A; 103 C; 80 G; 101 T; 0 other; tgtcacaatg tgctctgcca cagtcccccg gtcggcctag gacgatcccc cggtggcggg 60 agccacaaca ccatgtgacc acatggttgt atcaattgcg gagcaatcgg accggagcag 120 agctccggcc acacttcttt tgcttccatt gttcctgtgg ccaagtggcc cttgttctcc 180 ttttggactc tattgttcat tcttctattg ttctgccgtg ggaatgcttt cccctgctca 240 tgtatataaa gggctgctgt agcgacctgt aatatcactt ctgtgaccca cttcgacttc 300 agccccactt gtggcattca gtccttcagt gctagtcctg tgca 344 // ID Gypsy-42_MLP-I repbase; DNA; FNG; 6026 BP. XX AC AECX01001151; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-42_MLP_; KW Gypsy-42_MLP-LTR; Gypsy-42_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-6026 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001151; Positions 129311 123286. XX CC Positions [5101-5304] - Integrase core CC 'CAAAC' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 452..1510 FT /product="Gypsy-42_MLP-I_3p" FT /translation="MEEIQRQILELQNSLANERSLREQAEANHRRAEERLI FT AIESQQARQSSTANAQSATSPGHTPAATSVPAPYSNLAPVPKGPKVAVPDK FT FNGIQGGPAEVFASQLQLYMMAHHYLFPDDRSKVVFALSYLTGPASAWAQP FT LTMELLNPETEHLVTFERFVQNFKAMYFDTEKKAKAEKALRSLSQKTTVAA FT YTHEFNIHSHNTGWEVPTLISQYEQGLKQDIRVAMVLVQDDFASIEQISNL FT TIKLDNKLHGVADTSTSFNAPARDPNAMDVSASFTRLSDKEKAKRLRTGSC FT FRCNKQGHRANECPNGRGDRGRKRDNYGAKIAELEMKLAALGDRGEGSSRA FT DDSKNGGAQA" FT CDS 1900..2865 FT /product="Gypsy-42_MLP-I_1p" FT /translation="MPWIRDNHHRIDWNSGTISDSSPISASALTESSYPTT FT PLPNPVVEVVREARRSDEGMCVSNNALPSPQSKSYHLYTYNPEEETCKHLH FT SLQNLPQNHVNTDNDNCTNTLGCVATVPTVSSNPHHTPEDPDAELTGQARD FT FDEGARVRNDTFTPPQCEFASSTIPKHLETACKLSFPRNSQSHEPTDVTNA FT ELTDILAYKTDKNNLNSPTPHESIATVLTVSSSPHHTPKDPDATLLGKTRD FT FDKGAHVRDDTFTPLQCEFALDPITKFVEAAGKPLCPLKFDSQISVDAART FT SWSTSARLAADKKKLIPKKTIEELVPVAYH" FT CDS 3589..5097 FT /product="Gypsy-42_MLP-I_2p" FT /translation="MDPTKVAAVSDWPAPKTVTELQRFIGFANFYRRFIDH FT FSKTTRPLHDLTRDKTPCVWDHKCDEAFNKLKTAFTSAPVLKIADPYKPFV FT LECDCSDFALGAVLSQMCEDDGQLHPVAYLSRSFVQAERNYEIFDKELLAI FT VASFKEWRHYLEGNPNRLDVIVYTDHRNLESFMTTKQLTRRQARWAEILGC FT FDFQIRFRPGKQAARPDALSRRPDLAPPNEEKLTFGRLLRPENITSETFTA FT NIDSVESFFQDEEITLENPEHWFEIDVLGVSEAKTETEDRGPNDTKIIDLI FT RQSTSNCSRLQEIINIIDNPISTKIKKSTSHYRITDGVLYNNGRIEVPDDN FT KIKHEILKSRHDSLLAGHPGRAKTLALVRRCFNWPSQKAYVNRYVDACDSC FT LRTKSSTKRPFGTLEPLPIPAGPWTDVSYDLITSLPKSKGHDSILTVVDRL FT TKMAHFIPCNETMKANELADLMIQNVWKLHGTPKTITSDRGSVFISQITTE FT LDKRLGI" XX SQ Sequence 6026 BP; 1892 A; 1475 C; 1289 G; 1370 T; 0 other; tattgtcaga tcatcaacag acaagcctcg aggcatcaag cacagattag actagtatcg 60 aatttcagaa cgcagattta gagatagacc atcagaaacc ccaccaatca ccagcgattt 120 agatttaaaa cttaccgaag catcggattt agaattgaga acacattaga aaccttgata 180 tcagattcaa acttaatcga agactcctgc aagttaaact tcaagaatag atcagccgat 240 atattagagt ttagaataga tcagaatctt caaagttaga ttaaaactac cactttgaaa 300 accctacaag acatcaatca cgtctccttc gtacagagcc ccacaagacg acgacgatct 360 cgacagtgac tccgagccaa ccttcgtcga cgttcctacc gaacaatcca cagaactgca 420 ggtcgctcat ccccgcattc ctgattcaac tatggaagag attcaacggc agatcctcga 480 actccagaac tcattggcga atgaaaggtc cctccgtgaa caagcagagg ctaatcatcg 540 ccgggccgaa gaacgcttaa tcgccattga gtcgcaacaa gctagacaga gttctacggc 600 taacgcgcaa tctgcgacga gccccggcca cactcctgct gctacaagcg ttccggctcc 660 ttattctaac ctagcaccag tcccaaaggg acctaaggta gcggtacctg acaaattcaa 720 tggaattcaa ggaggacccg ccgaagtttt tgcgagccag ttacaactct atatgatggc 780 gcatcattac ttattcccag atgatcgaag taaagtagtg tttgcgttgt cttacctcac 840 gggacccgca agcgcttggg cacaacccct cacgatggag ctactaaatc cagagacgga 900 acacctcgtc acctttgagc gctttgttca gaacttcaaa gccatgtatt tcgatacaga 960 gaagaaagcc aaggccgaaa aggcgctcag atctctatct cagaaaacta ccgtggcggc 1020 ttacacccat gagtttaata tacactctca caacaccggc tgggaagttc ccacgttaat 1080 aagccagtat gaacaaggct tgaaacaaga catcagggtt gccatggtct tagttcaaga 1140 cgacttcgct tcaatcgagc aaatatcaaa cctcaccatt aaactcgaca ataaattaca 1200 tggagttgcg gatacatcta catctttcaa tgctcctgca cgcgacccta acgccatgga 1260 cgtttcggcc tctttcaccc gactatccga caaagagaaa gccaaacgct tgcgtacggg 1320 ttcctgtttc cgttgtaata aacaaggtca tcgagccaac gagtgtccta atggaagagg 1380 tgatcgtggg aggaagagag acaattatgg tgctaagatt gctgagttag aaatgaagtt 1440 ggctgcatta ggagatagag gagagggatc tagtcgagcg gatgattcaa aaaatggagg 1500 cgctcaagcc tgaaggttgt gcctagcttg agcaaagggg gatcagtaga aactttagaa 1560 ttgggagcaa gtgctgttgt aatttgcaat aatgacgatc cacgtctgtt tttgccagtt 1620 tctctttcat tgtcccaagt cccctgagcc acaccatata gaatacctcc agctcgcctt 1680 ttgattgact ccggcgcaac tcataatgtg ctgggagaag catttgccaa tgcagctggt 1740 ctgcttcgtt acgcaattaa cacctcaaga gacatctcag gttttaacgg atcccaatca 1800 acctcgtcat acgaaatcga cctcaacatc gaccacgact ctcaagaatc aagatttatt 1860 atcacaccca tcaagaatac ttatgacgga atcctcggca tgccatggat ccgagacaac 1920 caccaccgga ttgactggaa cagcggtacc atcagcgatt ccagcccgat ttctgcctct 1980 gctttgacag agtcgtctta cccgacaaca cccttaccaa atcctgtcgt ggaagtcgtg 2040 agggaagcta ggcgtagtga cgaggggatg tgcgttagta ataatgcgtt accatccccg 2100 cagagtaagt cctatcactt gtacacctac aatcccgaag aagagacttg caagcatcta 2160 cattccctac agaacctacc acagaaccac gtcaacacgg acaacgacaa ttgcacgaat 2220 accctaggct gcgttgcgac tgtcccgacg gtctcgtcaa acccgcacca caccccagag 2280 gatcctgacg cggaacttac tgggcaagct agggactttg acgagggggc gcgtgtcaga 2340 aatgatacat tcacgccccc gcaatgtgag tttgcttcga gtactatccc caaacatctc 2400 gaaacagctt gcaagctttc ttttcccagg aattcccagt cacacgagcc caccgacgtt 2460 accaacgccg aattgaccga tatccttgca tacaagacgg acaagaacaa cttgaactca 2520 cctacaccac atgaaagcat tgcgactgtt ctcacagtct cgtcgagccc gcaccacacc 2580 cctaaggatc ctgacgcgac acttttgggg aaaactaggg actttgacaa gggggcgcat 2640 gtcagagatg atacattcac gcccctgcaa tgtgagtttg ctctagaccc aatcaccaaa 2700 tttgttgaag cagctggcaa gcccctatgc cccttgaaat ttgattccca gattagtgtg 2760 gacgcagcac ggacgtcttg gtcaacatct gcccgattag ccgccgacaa gaagaaacta 2820 atacccaaaa agacaattga agaactggtt ccagtggctt accattgata tcttcatatg 2880 ttctccaagg caaaggctca aggactacca ccccgacgac agtacaattt caaagtagag 2940 ctcattaaag gcgctcaacc tcaagccagc cgtataatcc cattatcacc tgctgaaaac 3000 aacgcgctgg aagaaatggt gaagacagga ctggctaaca gaactattag gcgtacaact 3060 tcaccttggg ctgcgccggt gctgttcacc gggaaaaaag acggtaactt gaggccttgt 3120 ttcgactacc gcaaacttaa cgcggttacg gtcaagaaca agtaccccct accgctaacc 3180 atggatttgg tcgatagctt actggatgct gaagagttta caaaactaga catgcggaat 3240 gcatacggca atcttcgagt tgctgaaggc aacgaagaca agctggcatt tatatgcaaa 3300 ttgggacaat tcgctcctct gacaatgccc tttggcccaa ccggcgcccc agggtatttc 3360 cagtatttta ttcaagatat actgatgacc catattggga aggacgtagc tgcctttttg 3420 gatgacacca tgatctacac gaaaaaaggc gctcaccacg tgtccgttgt taatgaagtt 3480 ctggaaatat tcagcaaaca ccatttatgg ctaaagccag agaaatgtga attctcaaag 3540 acagaagtgg aatacttggg ccttctaatc tcaaagaaca aaatttaaat ggacccaacc 3600 aaggttgcag cagtatcaga ttggccagcc cccaaaaccg tgacagaact ccagcgattt 3660 ataggttttg caaactttta taggcgattt attgatcact tttctaaaac cactcgacct 3720 ttgcacgacc ttacaagaga caaaacacca tgtgtatggg atcataagtg cgacgaagct 3780 ttcaacaaac tcaaaaccgc cttcacgtcg gcaccagtac taaagattgc cgacccgtat 3840 aagcctttcg tattagagtg cgattgttca gacttcgcat tgggcgcggt gctatcacaa 3900 atgtgtgaag atgatggaca actgcacccg gtggcgtatc tatcacgctc attcgtccaa 3960 gcagaacgga attacgaaat ttttgacaaa gaattattgg caatagttgc atccttcaag 4020 gagtggagac actacctaga agggaacccc aaccgcttag atgtcatcgt ttacacagac 4080 caccggaact tggaatcttt catgaccacc aaacagctca cacgacgcca agctagatgg 4140 gccgaaatct tgggttgttt cgacttccag atacgatttc gacctgggaa acaagccgct 4200 agacctgacg cattgtcaag aagacccgac ctagcacccc ctaacgaaga aaagcttaca 4260 tttggacggc tcttacgacc agagaacatc acatctgaaa cattcacggc taacattgat 4320 agtgttgaat cattctttca agacgaagag attacgctcg aaaaccctga gcattggttc 4380 gagatcgacg tccttggagt atcagaagct aagaccgaga ctgaagacag agggccgaac 4440 gacaccaaga tcatagattt aattaggcag agtacctcaa attgctcacg actacaagaa 4500 attatcaaca ttatcgacaa cccgatatcc acaaagatca agaagtcaac atctcattat 4560 agaataacgg acggtgtgct gtacaacaac ggtaggattg aggtaccaga tgacaacaag 4620 atcaaacacg aaattctcaa gagcagacac gatagcttac tagctggcca ccccggcaga 4680 gctaaaaccc tcgcacttgt caggagatgc ttcaactggc cttcacaaaa agcgtatgtc 4740 aataggtatg ttgacgcgtg tgattcctgc ttacgtacca aatcaagcac aaagaggcca 4800 tttgggacat tagagccgct tcccataccc gcaggtccgt ggacagatgt cagctacgat 4860 ttgatcacca gtttaccgaa atcaaaaggc cacgacagta ttttaacagt ggttgatcga 4920 ttaacaaaga tggctcattt catcccttgt aatgagacaa tgaaggcaaa cgaactcgct 4980 gatttaatga tccaaaatgt atggaaactg catgggacac caaaaaccat tacatcggac 5040 agaggtagcg tgtttatatc tcaaatcacc accgaattag acaaaagact tggtatttga 5100 ttacacccgt caaccgctta ccacccaagg actgatggac aatcagaaat tgttaataag 5160 gtcattgaac agtatttgcg acactttgta ggctaccgac aagacgattg ggccgactta 5220 cttcctatag gcgaattttc ttataacaac aaagaccacg catcgacggg agtttcgcct 5280 tttaaagcaa actacggatt cgaaccgact tttggtggag tcccttccag tgcacaatgc 5340 gtacccatag tggaggaaag aatcaataca ttgaaagaag tacaagctga attaactgaa 5400 tgtttagaaa ttgcacagga tattatgaag aaacaattcg acaagaccgt tagacccacg 5460 ccagaatgga agattggaga ccaagtctgg ttagatagca agaacatttc cacaacacga 5520 cccagcccta agctagatca taagtggcta ggccctttca atattgtaaa gaaaatatca 5580 cgttctgctt atacactaac gttgcatcta tccatgaaag gcgtccatcc agtatttcac 5640 gtgtcactac ttaagaaaca cacacccgac agcatcaaag aaagaaggca acaagaacca 5700 agtccaataa taattgaaga taatgaagaa tatgaagtaa atgaaatatt agattgtaga 5760 agaaaattta acaaattaga gtatttagtt agttggaaag gattcagcgc agaagaaaac 5820 tcgtgggaac cagtactcaa tttgaagaac agtatgaaat tagtcaatga gtttaatgaa 5880 aagtttccgg acgcatcttc aagacacaag aggacacgga gaaaaaggtg agagggcaag 5940 ctttttcccc acagggtttt ttaatgctgc ccagggaagg aacgcagagc ttacaagaga 6000 gagcttgggc gtaaaagggg gtatag 6026 // ID Harbinger2-2_TMe repbase; DNA; FNG; 3156 BP. XX AC . XX DT 13-AUG-2010 (Rel. 15.09, Created) DT 13-AUG-2010 (Rel. 15.09, Last updated, Version 1) XX DE A family of autonomous Harbinger transposons - a consensus. XX KW Harbinger; DNA transposon; Transposable Element; KW Interspersed repeat; Harbinger2; Harbinger2-2_TMe. XX OS Tuber melanosporum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Pezizomycetes; Pezizales; Tuberaceae; Tuber. XX RN [1] RP 1-3156 RA Kapitonov V.V. and Jurka J.; RT "Harbinger2, a novel clade of Harbinger transposons in protozoan, RT fungi, choanoflagellate, and metazoans."; RL Repbase Reports 10(9), 1224-1224 (2010). XX DR [1] (Consensus) XX CC Harbinger2-2_TMe belongs to a novel clade, Harbinger2, of CC Harbinger DNA transposons. This clade includes transposons CC present in protozoan (brown alga), fungi, choanoflagellate, and CC metazoans. CC Harbinger2-2_TMe is a consensus sequence of a family of CC autonomous Harbinger transposons that were active in the Tuber CC melanosporum genome recently. The consensus was derived from 5 CC copies ~96% identical to it. The genome contains only four CC full-size copies of Harbinger2-2_TMe; ther are flanked by the TNA CC target site duplications. This transposon codes for the 387-aa CC TPase and 382-aa unclassified protein. XX FH Key Location/Qualifiers FT CDS 243..1403 FT /product="Harbinger2-2_TMe_1p" FT /note="Harbinger TPase." FT /translation="MNLNIPQVTTILLAAVAADLNTRKILNSIILHQRRKS FT KTVQVLLSGPTRVLKNYSQKRKIPYQQFDWSIEDQTDDWCIEFLRFNKAQI FT IELANLLRIPDTFRYRYRANPVTALAVTLYRLSYPSRLKQSISIFGHERGW FT LSTIFNNVCSHLYQYFAGKLHWDSEFLSPQWLEKYCTAIANKGEPTGLIWG FT FIDGTQRPICRPESETADQALFYSGYKKQHTMQFQAITTPDGLIASLSGPW FT EGRLGDWAIWVNSGIGKVLSQYGRNVFGEQLYVYGDAAYSVETGVIGAFSA FT PAGSVLSQEKAVFNAYMAKQRMSVEWGFGKVLQYFTFLGFKYGLKVGLSPI FT ATWYFTGVLLTNCHTCYYGSNTADTFGCSPPSIKEYLCTKLSKE" FT CDS 2565..1420 FT /product="Harbinger2-2_TMe_2p" FT /translation="MKVSEQVPQFSESEFVPATTYVLPPSSIHPTFPPILS FT THSGFSMPFTPSTISITSATPTSSTLSTTSTFPILSQHFMQEPLIQGTTQS FT LPGTNKPRTILKDDDSQARLVQLCCNHFHLYIKGKGKFYDFMRVKYRELYG FT IDVNVKGFIQRREAMRRSQLRDERGKSGVARADTDYNQALDKWIELYDDHE FT RLIKEQKAEKGSKVEAEKREAIQTREDMIYGFAKKWKQRSDELEGLVGLED FT HSRGTATGEDTDPASGKYKNLSINYCYCYKTTNLILIDSDITIIKPFTSKR FT QITSQILSSMKEGDQETLRSLRDMQAEQMDRWEHIIGNVVGVSSESEWSNE FT SSQPIKRLEERIQRIEEQSGEVKEKLDQVLSLMASKFSS" XX SQ Sequence 3156 BP; 906 A; 657 C; 643 G; 950 T; 0 other; agggaggttc tgcaccaagg tgcgcaccac gtttccacag caatggggtg gcaaaagtgg 60 ctggtgatgg ttttttttat aatgcacact aatttttgca ccatgtacgt gcagaggtga 120 tgcagatgtg tttacatagt aaacagaggt tttcttagtc aggatacggt aataaaggca 180 tagcttatac aacaacaggt aagtgtgtag agacaaacat cacccatgat aaatatccta 240 cgatgaactt gaacatccca caagtcacaa ctatcctcct tgctgctgtg gcagcagatc 300 ttaatacaag aaaaatctta aattcaatca tcctccacca aagacgaaag tctaaaactg 360 ttcaagtctt actttccggt cctacaagag tccttaaaaa ctattcacaa aaaagaaaaa 420 taccatatca gcagtttgat tggagtattg aagaccaaac tgatgattgg tgtattgagt 480 ttttaaggtt caataaagct caaattatag agcttgcaaa cctgttacga ataccagata 540 ccttccgtta ccgctatcgt gccaaccccg ttactgccct cgccgttact ctctacagat 600 tgtcttatcc gagtcgcctg aagcaaagca ttagcatttt tggtcacgag cgcgggtggc 660 tttccaccat atttaacaat gtatgttctc acctatacca atattttgcc ggaaagcttc 720 actgggattc agagttccta tcgcctcaat ggttagaaaa atattgcact gcaattgcaa 780 ataaagggga accaactggc ctgatttggg ggtttattga tggaacccaa cggccaattt 840 gccgacctga atcagaaacc gctgatcaag cactatttta cagtggctac aaaaaacaac 900 atacgatgca gtttcaggca attaccactc cagatgggtt aattgcaagc ctatctgggc 960 catgggaagg acggttaggt gattgggcaa tatgggttaa ttcaggaatt ggaaaagtac 1020 tgagtcagta tggaaggaat gtatttgggg aacagcttta tgtttatggt gatgctgcat 1080 actcagttga aactggggtc attggtgcat tttcagcacc agctggcagt gttttaagtc 1140 aggaaaaagc agtttttaat gcctatatgg caaagcaaag aatgtctgtt gagtggggct 1200 tcggaaaggt tttacaatat tttacatttt tgggttttaa gtatggttta aaagtaggac 1260 tttcaccaat tgcaacatgg tatttcacag gagttttgtt gacaaattgt catacatgtt 1320 attatggttc aaataccgca gatacttttg gctgttctcc tcctagtatt aaagagtatc 1380 tttgtactaa attatcaaaa gaataagtat tatattctaa ctactaaact ttgatgccat 1440 taaagaaaga acctgatcaa gtttttcctt aacttctcct gactgctcct caatccgttg 1500 aattctctct tccagtcttt ttattggctg tgacgactca tttgaccatt ccgattctga 1560 acttactcca actacattcc cgattatatg ctcccagcgg tccatctgct ctgcttgcat 1620 atcccttagt gaacgtaaag tttcttggtc accttccttc atcgagctga gtatctgact 1680 tgttatttgt cgcttcgagg taaagggttt tattatagta atatcactat ctattaatat 1740 taaattagtg gttttataac aataacagta gtttatggaa agatttttat acttaccgga 1800 ggcagggtca gtatcctctc cggtagcggt ccccctactg tgatcctcta atcctaccag 1860 cccctccaac tcatcagatc tttgcttcca ctttttggca aagccgtata tcatatcttc 1920 tcttgtctga atcgcttctc ttttttcagc ttctactttt gatccttttt ctgccttttg 1980 ctccttgatc aaccgttcat ggtcatcata aagctcaatc cacttgtcca atgcttgatt 2040 atagtcggta tcagcacgtg caacaccact tttccccctc tcatcccgaa gttgagatcg 2100 tctcattgcc tcccttcttt gaataaagcc tttgacatta acatcaatgc catacaactc 2160 tctatatttg actctcataa agtcataaaa ctttcccttg ccctttatat acagatgaaa 2220 atggttacaa caaagttgga ccaagcgggc ttgagagtca tcatccttca gaatagtacg 2280 aggtttattg gtacctggta gtgattgggt agttccttgg attaaaggtt cttgcataaa 2340 gtgttgggat agaatgggga acgtggatgt tgtagaaagt gtggaggatg tcggagttgc 2400 agaggtgatg gaaattgtgg aaggcgtgaa gggcatggaa aatccagaat gggtggagag 2460 tatggggggg aacgtaggat gaatggacga tggagggagt acgtaggtag ttgctggaac 2520 aaactcagac tcggaaaatt gcggtacttg ttctgagact ttcatagttt cctagtctca 2580 taatttactg ttagtaattg ttactctcag taaatgtatt ataagagaca agttcctaat 2640 acaatacatt tttccgagaa ctttgctcaa tctcgtccaa tttcctacca gcatgggact 2700 cattcatctt gctcatcata gtaccgccag cttaagagaa tgctctataa ctccttaagg 2760 tcagtaaaac ttttaactca attcctaacc agtggaagga ttcaactata cccagccgga 2820 tccgagtcgc gacacaaatc ctcctagcct ccaggtagga actctctgaa atccagcgtg 2880 cgtacaagct gacagaaaca ccaaggtctc aaaaaaaaat aagaatagga tagacaacca 2940 aaatgagttt taagttgata gtaacgttga tgttagaata agggtgggtt gagcgggtga 3000 gcaggagaca catgataaac aaactgaagt cacatgtttt gtttcgtcat tctcacgatt 3060 gaactgcgca acaagcagaa cccccctaga attgatgcac catttccgca catgcaccac 3120 cctgttgtcg tgcgcacccc aatgcagaac ctccct 3156 // ID Gypsy-81_MLP-I repbase; DNA; FNG; 5727 BP. XX AC AECX01001084; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-81_MLP_; KW Gypsy-81_MLP-LTR; Gypsy-81_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5727 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001084; Positions 48185 53911. XX CC Positions [3882-4307] - Reverse transcriptase CC Positions [4875-5090] - Integrase core CC 'CAGG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1263..5504 FT /product="Gypsy-81_MLP-I_1p" FT /translation="MEENLGFVSIPAEDLRLIRAQLEQVNIDTQTNAQNHA FT REIAKLRNQHHLIYQQQSNQIHQLEANQHQHTNRASDPNKPDPFRAEPRKF FT GGPGDNANLWISELESNFKMHKYPESAWGEMLGSYLDEETHMFWHGIRTKQ FT GGTITDYQDFKTQFLEHYNFDLMVEENLEKLKVCYFKQNDLHDYILRFRKI FT MAHMPDDEMNFKQRKFFFLDKLPEFYRDKINNREELRKKQDMELVYAAARE FT AERSAKINHRGYKNWDTKSPSTSKNHEKKPQFNPSHRHNHHYRPSTSFGKP FT HAVTNGSGPMDLDLADLANVECYSCHGKGHRANACPRNKNKSSNSHKQRPN FT FSGDRRKPNLLLINHDPTLPIQSTSNSLDSFIPPFQDFELQEQAEFTTPFF FT DSEGRMLGASRESSESLDGPTEDGRESLADYQKPYLEKLENLDSKALLEMQ FT EAREAVAKLMEDEVKSGFRCCKSCKAYGNEDSPPTVSKGIDYFSLDGLRID FT QTDLKRSLSTTEENESEILPAHDPEGKNDLYYSNLEDLINKDTSMKRFKLD FT CSLIDGYQYTRAPFNSPVAEIDRLDLAEKELECVEGWNHILSSTYTSQAEY FT DDFDSVCTPDEEDSSVKAVEHPSDIEFEITSDMTDYEDLKARAVDNPPEPL FT DLSTVELAAQQGPGKTSLPIYPCQWRDMPLDAIMDTGAAGNYISLEKVKTM FT LKLNPRKIKIDKVSTQGVRLANGAQEKCSETATFQAEIYHKDTSETFNFTI FT SAYILPLPNISLILGLPWHREHKPAIDYDTGTYTVKKHSGTFDIQPKDCEP FT KLFTIDNNNFGQGLEREAKVIAPDCFSDELIIKDRKHKHYIDTGDSKPIKT FT HGRPHTPAEHEIINQFIKEGLEQGIIERTDSPWSSPLLLVKKQDGSTRVCV FT DYRALNKVTRKNAYPLPRIDDAYQFLSGSNVFSTIDLKSGFWQIPMAPEDK FT EKTGFTCRQGHFQWKVMPFGLCNAPATFQEMMNGVLKEVIDKFVLVYLDDI FT IVYSKNKEEHIEHIKTVFEILQREGLVVSVRKCQWGKPSLLFLGHIVDGNG FT IRTNPDKIAKIVEWPIPSNISQVRGFLNLCTYYKRFILKFSTIASPLYKLT FT EGSPKPGTAIKWGKEENSSFEKLKEALSKTVPLQHPTPFHPFVLDTDASGT FT NIGAVLQQDSSFEIPKVGNFDYNNYQKQLKNNNLRPIAFESRKLSKIKHVR FT TSPYHPQANGLIERFHGTLMNSVRKCCSPYNQDRWDDYLNSCLFAYRASYS FT HSMKASPYFMAYGEEARLQSEAVSRTFDSSLENLELIHRQRNITVHKLRNK FT REDLIKLLNDRAGERYAQTEESYTERNLRPGDRVLRCFEGRPSKLHPKWDG FT PFIIRDAFPNGTFELMTSNGHVLQAKTNGSRLKKFKGTTDDFYFASQRLHD FT " XX SQ Sequence 5727 BP; 1863 A; 1341 C; 1010 G; 1513 T; 0 other; aatatatatt tgtgtgttct tatttctttc ctctctaact ttaaatcatc accatacctg 60 atatcgaaga aagcttggtt attaatctgt gttctgtttc tctccaactc tgtgttcctt 120 tcgtactccg ttcacagata tcctcaccgg aaaccttacc cagaagcaac aaaccctcac 180 acccagatta acaaccccgt cgtaagcctt agaattccct agctagttat tccaatcctt 240 tccgacactt tcttttgatt agttagaact cattaaagaa atcttcccta tcccttacac 300 acagaaaacc taccattaac actacgccaa attgaaaatt taccgctgag tttagaaaat 360 tgagaagaca tagaaaatta tggaaaatct gggaggggtc agaaaaatat gaaaaaatat 420 tggttgggcc ctttggtgca ttccaatcat ctggcatatt ttttcatatt tttacaacca 480 caatgcaaat aatgtcagaa acccaatttt ggtccaaaac tggcaggtag agagacagtg 540 accttgcaac accaacaaaa tcatcaattt ttctccaaca catgtaatgt cttgacattc 600 tgaattccac attctggatc tacagtttgt gtgcacaaaa aagctgtgtg tctccaataa 660 tgggtgagat atatcaagag acgtacacag cagaaaccat atataagcat agaagtcata 720 actacaatgg cttcagcacc ttaataaacc ttgtagagac atcatgcggc ttccataaat 780 tgcaagtgtt cagatgattg aataaaaata ctcatatttt tgtgcttgga ggtttttgcc 840 caaatattgc tagttttgga ccaaatttga gtttctgaca taatatcccc tgtgagcatt 900 ttggtttggg gagacttcaa aaatcatggc aaaaatatgc aaaaatatgc aaaatgaggg 960 gacagcaccc atgaattgaa tcattatttt ttcaaatttt tctgacctcc cctgtattgg 1020 tgatattttt ctacagtttc aattttctga actcagcggt aaattagtga tctggcgtag 1080 tgtaacaccg ttaacccaaa aaagcttttt accaaccttt atatttcgat ttttgaatca 1140 ttgaacattc tttttgaaga attaacgtac gtactcctta gactctgatc aagtcttatt 1200 taaagaatac ttttaataac caaccccttt ccgattcttt acgaacagaa tctcattcaa 1260 tcatggaaga aaatctagga ttcgtctcaa tcccagctga agacctacgc cttatccgag 1320 ctcagcttga gcaagtcaac atcgatactc aaaccaacgc tcaaaaccat gctcgcgaga 1380 ttgccaaact tcgcaatcaa caccacctaa tttaccaaca acagtctaat cagatccacc 1440 aattagaagc caaccagcat caacacacca atcgtgctag tgacccgaat aaacctgatc 1500 cctttcgagc cgaacccaga aagtttggcg gtcctggcga taacgcgaat ttatggatct 1560 cggaattgga gtccaatttt aagatgcata agtacccaga gtccgcctgg ggagaaatgt 1620 tgggttctta tttagatgaa gaaactcaca tgttctggca tggcatccga actaagcaag 1680 gtggaaccat taccgattac caagacttca aaactcaatt cctcgaacac tacaacttcg 1740 acctcatggt tgaagaaaat ttagagaaac tgaaagtttg ttacttcaag caaaatgatc 1800 tccatgatta catcttaagg ttccgcaaga tcatggctca catgccagat gacgaaatga 1860 acttcaaaca acgcaagttc tttttcctcg acaagctccc cgaattctac cgagacaaaa 1920 tcaacaacag agaagaatta cggaagaagc aagatatgga acttgtctat gccgctgcaa 1980 gagaagcaga gcgaagcgcg aaaatcaatc atcgcggcta caagaactgg gacacgaagt 2040 ctccctcgac ctcaaaaaat catgagaaga aaccccagtt caacccttcc catcggcaca 2100 accaccacta ccgtccttca actagcttcg gtaaacctca cgccgttaca aacggctctg 2160 gcccgatgga tctagacttg gccgatctcg ccaatgtcga gtgttactcc tgtcacggta 2220 aaggacatcg cgctaacgcg tgtccccgca acaaaaacaa gtcgtctaac tctcataaac 2280 aacgacccaa cttctcagga gatagacgca agccaaactt attattgatc aatcatgatc 2340 ctactctccc tattcaatca acttctaatt ctcttgattc attcattcca ccctttcaag 2400 attttgaatt acaagaacag gctgaattca ccactccttt ctttgattct gaaggacgaa 2460 tgctcggcgc ttccagagaa tcttcagaga gtcttgacgg acccactgaa gacggtcgcg 2520 aatccttagc cgactaccag aagccctact tggagaaact cgaaaactta gactccaagg 2580 ccctgttgga aatgcaagaa gctcgtgaag cagtcgccaa gttaatggag gatgaagtca 2640 agagtggatt tcggtgctgt aagtcctgca aagcctatgg aaatgaagat agccctccta 2700 ccgtatctaa aggcatcgat tatttctctt tagatggact tcgcatcgat caaaccgatc 2760 tcaaaagatc tctttcaact actgaagaaa acgagtctga gatcttaccc gctcacgatc 2820 cagaaggaaa aaacgatctc tattattcca acctcgaaga cctcatcaac aaggacacta 2880 gtatgaagag gttcaaattg gattgttccc tcattgacgg atatcaatac actagagccc 2940 cctttaattc tcccgtcgct gaaattgatc gcctagattt agctgaaaaa gagttggagt 3000 gtgtagaagg atggaaccat atcctcagtt ccacctacac tagccaagcc gaatacgacg 3060 acttcgactc agtttgcaca cccgacgagg aagattcaag tgtcaaagcc gtcgagcacc 3120 cttccgacat cgaatttgaa atcacctctg atatgactga ttatgaagat ctcaaggctc 3180 gcgctgttga caatcctccg gaaccacttg acttaagtac agtagaactc gctgcccagc 3240 aaggcccagg taaaacttcc cttccaattt acccttgcca atggcgtgac atgcccttgg 3300 acgcaattat ggatactgga gcagctggaa attatatttc cttggagaaa gtcaagacta 3360 tgttgaaatt aaacccgaga aaaattaaga ttgataaagt ttctactcaa ggtgttcgac 3420 tagcaaacgg agctcaagaa aaatgttcag aaactgccac ctttcaagca gaaatttatc 3480 ataaggatac ttcagaaact tttaatttta cgatctcagc ttatattctt cctttaccca 3540 atatctcgct gatcttaggt ctcccttggc accgagaaca caagccagcc attgattacg 3600 acactggcac ttacacagtc aagaaacact caggtacctt cgacattcaa cctaaagact 3660 gtgaaccaaa gctattcact attgacaaca acaactttgg tcaaggttta gaacgagaag 3720 caaaagtgat tgctcccgac tgtttttcag atgaattaat tatcaaagat cgtaagcata 3780 aacattacat tgatactgga gatagtaaac ctattaaaac tcacggtcgc cctcatactc 3840 cagcagaaca tgaaattatc aatcaattca ttaaagaagg attagaacaa ggcatcattg 3900 aacgcacaga ctcaccctgg agttcaccct tattgctcgt aaaaaaacaa gatggatcaa 3960 caagagtctg tgttgactat cgagcactca acaaggtaac aagaaagaat gcctacccac 4020 tccctcgtat cgacgacgct tatcaattct tatctggttc taacgtcttc tcaactattg 4080 atttgaaatc cggattttgg cagattccca tggccccgga agacaaggaa aaaacaggtt 4140 ttacctgtcg acaaggacac tttcaatgga aagtaatgcc ttttggatta tgtaatgctc 4200 ccgctacttt ccaagaaatg atgaacggtg ttctaaaaga agttattgac aaatttgtgc 4260 tggtttattt agatgacatc attgtttact caaaaaataa agaagaacat atcgagcata 4320 tcaaaactgt ctttgaaatt ctccaaagag aaggattagt tgtgtccgtg aggaagtgtc 4380 aatggggaaa accttcctta ctattccttg gacacattgt tgatggtaat ggcatcagaa 4440 ctaatcctga caaaatcgct aaaattgttg agtggccgat tccctctaat atctctcaag 4500 tgcgaggatt tcttaatcta tgtacttact ataaacgctt cattctcaag ttttcaacca 4560 tcgcctctcc tttgtacaaa ctcaccgaag gatcacccaa accaggaacg gcaattaagt 4620 gggggaagga agagaactca tcattcgaaa aacttaaaga agcgttatca aaaacagttc 4680 cattacaaca cccaaccccg tttcacccct tcgtactaga cactgacgca tctggaacta 4740 atatcggcgc agtgctacaa caagactcaa gttttgaaat tcctaaggta ggtaacttcg 4800 actataataa ttatcaaaaa caacttaaaa acaataatct tcgtcctatt gcctttgaat 4860 cacgcaagtt atcgaaaatc aaacatgtgc gaacatcacc ctatcaccca caagctaatg 4920 gcttgattga aagattccac ggtaccctca tgaacagcgt ccgtaagtgt tgtagtccct 4980 ataaccaaga tcgatgggat gactacttaa actcttgtct cttcgcttat cgcgcttctt 5040 attctcactc aatgaaagcc tccccttact tcatggctta tggagaagaa gcccgcctac 5100 aatcagaagc tgtaagtcgc acatttgata gttctttaga gaacctcgaa ctgatccaca 5160 gacaacgaaa catcacagtt cacaaactca gaaataaacg agaagacctc attaagcttc 5220 tcaatgatag agctggagaa cgatacgctc aaacagaaga atcttacaca gaacgtaatc 5280 tcagaccagg tgatcgagtc ctccgatgtt ttgaaggaag accctccaaa ctccatccca 5340 aatgggatgg accatttatt atccgagatg cattcccaaa tggtacgttt gaattaatga 5400 cctcaaacgg ccatgtccta caggctaaaa ctaatggttc ccgactaaaa aaattcaaag 5460 gcacaacgga tgacttctat ttcgcatctc aacgattaca cgattgagac cagcgagcta 5520 gaagacgatc ttgttaaata catcgaccta tccagcagcc tgttcaacat catcaacgaa 5580 gatgtcgcac cagaaatcct tgataaattg tttgccaaac tccaatcatt aaaatatctt 5640 attttgaaag aaaaaggcag acgataccaa ctcaaaggaa aggcaaaatg gtaaactagg 5700 aagtttacgg tcttacagag gggatga 5727 // ID Gypsy-58_MLP-LTR repbase; DNA; FNG; 274 BP. XX AC AECX01001648; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-58_MLP_; KW Gypsy-58_MLP-I; Gypsy-58_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-274 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001648; Positions 42931 43204. XX SQ Sequence 274 BP; 69 A; 66 C; 36 G; 103 T; 0 other; tgtaagacct acggtcatta cacatcaccc aatacaaata cttaatactt aacacttaag 60 cttaagcctt gttttcattt acttataaag ttgttcacgc gtccctctga tttctggttc 120 cttagagata tcgggactaa tctcttgccc tttatcattc ggaaccaggt aggacttctt 180 tctctatatt tctctacatt tctcttttat catttagtat agccttcctt agagatatcg 240 ggactaatct cttgcccttt atcattcgga acca 274 // ID Gypsy-6_CCO-LTR repbase; DNA; FNG; 624 BP. XX AC AACS02000011; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the mushroom Coprinopsis cinerea genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-6_CCO_; KW Gypsy-6_CCO-I; Gypsy-6_CCO-LTR. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-624 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the mushroom Coprinopsis cinerea RT genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AACS02000011; Positions 44471 45094. XX SQ Sequence 624 BP; 166 A; 153 C; 100 G; 205 T; 0 other; tgttgtagta tgtatcaatg aagtaacaac ttaccttcgt gactcttact ttcctcccat 60 cttatctaca ttcctgtttc ttacccttat aaacaaaccc atatctcagt gacgacttca 120 cgactttgta catacgcttg actttatcac aatgtttgtt actatttatc ctagaaggcc 180 tcgccttcat cttgactgtg tttgttctta tttaacttac acggtttcac cttcgacacc 240 ttgactttcc cagcgccctc tggaagcgct cggaaagtat gagaacatcg tagtgagttc 300 tgtatcactc taggtagaaa ccataacaat ataatagatc gcccttcgac taaaggacga 360 cacctttttc taacttcaaa cttgacttgc ccttgagtaa gaataccgta cggcttcgcg 420 ctacggattt gagtttgaga atagaaatcg attacctcag catttacttc gcttcgtgaa 480 actacgctca gtgagactct tgattacaac tacgactatt cattattgta tcacaactgt 540 aagctggtaa ggttctattg aattaagttg agttgtctta ctgtcttcac cccgaagaca 600 ccaacgtttc gacacctctc aaca 624 // ID Copia-1_TMe-I repbase; DNA; FNG; 4903 BP. XX AC CABJ01002173; XX DT 13-FEB-2011 (Rel. 16.02, Created) DT 13-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Perigord black truffle genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_TMe_; KW Copia-1_TMe-LTR; Copia-1_TMe-I. XX OS Tuber melanosporum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Pezizomycetes; Pezizales; Tuberaceae; Tuber. XX RN [1] RP 1-4903 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Perigord black truffle genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; CABJ01002173; Positions 67556 62654. XX CC Positions [1831-2343] - Integrase core CC 'ATTCT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 106..4683 FT /product="Copia-1_TMe-I_1p" FT /translation="MSLSKFSAAKDDSVYRCKYGEATRLSQLTYPQWSQDL FT QYLLQGAGALEITLGHEVAPPANQHARLLDFNRRESLAVTFIYNSCGPEAK FT AFLRRIPRSPAAMWTALATEFDTAASRAGREGLIREFNRIKCSDYSSVSAY FT ITALMDFKDVLALTDQAITDATFISRLTSSLPDPYDTTIQLLHQQANLSVM FT DYITSIRQYEATLRIKAPEASTSNVAATSGSALYSRVSSTHGGNSGFRRSG FT HGLRSRGRSRGRSGYRNSWSRSNGSVGGHADDRSSSVTCWHCGRHGHTRRD FT CNSRQRGRQAMDAHQKSKRHDSADKNSGSSQASAMPATADPVLVSVQALLT FT DVVSLSSNELTVSDTRQAEISNWLIDSGATHHLCRNRDALFSYRAFKPNAT FT IPIHMGNNSSINAIGVGTVRILFKNNQSHEVLALDVQALYAPNLRSSLLSV FT GQLSAHRRITFTEDTCMISRLDGGSQIPLGQLRNLVWELAGPGRAVVTGAI FT KSPALFLLSRQAHALLANATNISTATDVSIPITLWHQRLAHLNFRAVSSIT FT GLPIGDIPPCHTCIEAKHQRTFVRLPVARTARPFELIHSDLCGPIGTPSWS FT GSRYLILYIDDYTRWAYGYFLRLKESIEITRIFQEFQARVETAFPNWPISR FT FRCDNGKGEYDNSLFRGILRVGGILFEPAPPYCQHKNGVAERMIRTIIGKG FT RAILLDSNLPDAMWAEAVETALYLHSRSPSTSLGGRSPYEMLHGIKPDLGH FT LRRFGCSASRLIPKEQRNGKFGPRSRECIMIGYVHDTAKIWRLWDPEFRAI FT VRSSDVVFDESRTPGDVKHGPGRDVLKEFVPLVNGVEDDDEIGSVAAVDAK FT AEELKLGAAKTPMTRGEGSDVPVNPVPPPNKILVPETGENGDRRVVRKESS FT RAAMIVEGEKKVRVEHDNLQIGQTTLDLESSSSGPTLPVPKRIKVGPTVQE FT PEKAGPMSLGPERLLKRAQESNQMLRRSLRRRQPVQAIAAEAKNVSELRPG FT DDPESYADAVSHTAWRSAMKQEFNSLIENDTWEVVSHDDIREAKIIGCKWV FT FRVKHNSDRSIRYKARLVIKGYEQTEFGETYAPVARLTTFRILIALAAQLG FT WKIHQMDVITAFLNPVVNDLVFMALPEGIEWLHPGASPGGVCRLKKALYGL FT KEAPRLWFSDINRFLQSQGFIPSSGDANLYISSSHKVILLLYVDDILLASS FT NPSAIATVKQLLVSRYKMTDLGLSRQFLGIDIEQLPEKIRIGQRHFVESVL FT RRFGMSDCNGIWTPLEVRPSLEFKPLVPEDQQLYQSLVGSIMYLMLGTRPD FT LAFTISVLSKFSASAGSEHLAQAKRVLRYLKQTRDIKLCYRRSETPLDRSL FT KGFSDSDWAGDLGDRKSTGGFVFLLSECVVAWKAKKQTIVALSTTEAEYIA FT ASEAGREAVWLRRLIGDLMDTISPTDSESFSSPTIIYTDSTGGLAQISNVR FT HHERTKHIDIKYHYVRDIVEQGTVLFRHISTHDMSADILTKPLARDLHWRH FT MERIGMETVV" XX SQ Sequence 4903 BP; 1154 A; 1297 C; 1195 G; 1257 T; 0 other; ggttatgagc ctattgtagc cttccatctg caatttattc ctcgcgcttt atattaaatc 60 ttctatctat ctgtctacct atctttctat acttcaacct tctcgatgtc tctttcgaag 120 ttctcggcag ccaaagacga ttctgtttac cgttgtaagt acggtgaagc taccaggttg 180 agtcagttaa cctacccgca gtggtcccag gaccttcagt accttctgca gggggccggg 240 gcccttgaaa tcacccttgg tcacgaagtt gccccgccag ccaaccagca tgctcgcctc 300 ctcgatttca atcgccgtga aagccttgca gtcactttca tttataactc gtgcggtcct 360 gaggcgaaag cgttccttcg gcgtattcct cgctcgccag cggcgatgtg gacggcgctt 420 gctactgagt tcgatactgc cgcctctcgt gcggggagag agggtcttat ccgtgagttc 480 aatcgtatta agtgcagtga ctactcatct gttagtgcct atatcaccgc cttgatggat 540 ttcaaggatg ttcttgcctt gaccgatcag gccattaccg acgctacgtt catctctcgt 600 ttgacctcct ctttgccgga tccttatgac accacgattc agctattgca ccagcaggca 660 aatctttcgg tcatggacta tattacctct atccggcagt atgaggccac cctccggatc 720 aaggccccag aggcctctac ctcaaacgtt gctgctactt ctggttctgc tctgtactcg 780 agagtcagtt cgacccacgg cggtaattcc gggttccgta ggagtggtca cggcctccgc 840 agtcgtggtc gcagtcgtgg tcgcagtggc tatcgtaact cctggtccag atccaatggc 900 tccgttggcg gtcatgcgga tgatcgttcc tcgtccgtta cctgctggca ttgcggtcgc 960 catgggcata cgcgacgtga ctgtaattct cgtcagaggg ggagacaggc aatggatgca 1020 catcagaaga gtaaaagaca tgacagtgct gacaagaatt ccgggagctc acaagcttca 1080 gccatgccag ctactgctga cccagtcctc gtctctgttc aagcccttct gaccgatgta 1140 gtttctcttt catcgaacga gttaacggta tcagataccc ggcaagccga aatctcgaac 1200 tggttgattg attcaggtgc aacccaccac ctttgtcgga atcgcgatgc attgttttca 1260 taccgagcct ttaagcccaa cgctactatt cccatccata tgggcaacaa ctcgtcgatc 1320 aatgccatag gcgtaggcac tgtacgaatc ctcttcaaga acaatcaaag ccatgaagta 1380 ctggccctag acgttcaggc tctctatgcc cccaatcttc ggtcatctct cctgtcagtt 1440 ggccagctgt ctgcccaccg tcgtattacc ttcaccgaag atacctgcat gatatcaaga 1500 cttgacggag gttcccaaat acctctcggt cagcttagaa atttggtctg ggaactcgcg 1560 ggtcctggaa gagcggtcgt caccggtgcc atcaaatccc cggcgctctt ccttctatct 1620 agacaggctc atgctctcct tgcgaatgcc accaacatct ccactgccac tgatgtgagt 1680 atcccaatta ccctgtggca tcaacgactg gcccatctca attttcgagc tgtttcctcc 1740 atcaccggtc tccccatcgg cgatattcca ccctgccata catgcatcga agcaaagcat 1800 cagcgcacct tcgtccgact tcctgtcgcc agaactgcaa gaccctttga gctcattcac 1860 tccgatctct gtggcccgat tgggaccccg tcctggtccg gatcccgtta tctcatcctc 1920 tacatcgatg attatacgcg ctgggcatat ggctattttc tgcggttgaa ggaatccatt 1980 gagattacga gaatctttca agaatttcag gcgcgggtcg aaacagcttt tccgaactgg 2040 ccgatttcgc gattcagatg cgataacggc aagggggaat atgacaactc cctcttccgt 2100 ggtatcctgc gtgttggtgg aatccttttc gaacctgcac cgccttactg tcagcacaag 2160 aacggcgtag cggaaagaat gattcgaaca ataattggga aggggagagc aattctcctg 2220 gactcgaact tacctgatgc aatgtgggcc gaagcagttg agacggcact ctatcttcac 2280 agccgttctc catcgacttc acttgggggt cggtcaccgt acgaaatgct gcacggcatc 2340 aaaccagacc tcggccacct tcgtcgcttc ggttgctcag ctagtagact tattccgaag 2400 gagcaacgga atggtaaatt cggtccacgt tctcgcgaat gcatcatgat cggttatgtg 2460 catgataccg caaagatttg gcggctatgg gaccccgaat tccgtgcaat tgttcgttca 2520 tccgatgttg tctttgatga atctcgtacc cctggtgatg tgaagcacgg tccaggtcgt 2580 gacgtgctca aagaatttgt tcctcttgtg aatggggtag aggatgatga tgagattgga 2640 agcgtggctg ccgttgacgc taaggcagaa gaacttaagc tgggagctgc caagaccccg 2700 atgaccaggg gtgagggcag tgacgtacct gtcaacccgg tcccaccacc aaataaaatt 2760 ctcgtccctg agactgggga gaatggggat cggagggtcg ttagaaaaga aagcagccga 2820 gctgctatga ttgtcgaggg ggagaagaaa gtcagagtgg agcacgacaa tctccaaatc 2880 ggccaaacaa cactcgatct cgagagctcc tctagtggcc cgacgttacc ggtgccgaaa 2940 agaattaaag ttggtccgac ggtacaggaa ccagagaaag ctggcccaat gtcactggga 3000 cctgaaaggc ttctgaagcg cgcgcaagaa tcaaatcaaa tgcttcgtcg atctcttcga 3060 cgccgtcagc ctgtccaagc aattgctgct gaagccaaga atgttagcga actccgtccg 3120 ggtgacgacc cggagagtta cgcagatgct gtctcccata cagcgtggcg gagtgccatg 3180 aaacaagagt tcaactctct tattgaaaat gacacctggg aggttgtcag tcacgatgat 3240 attcgtgagg caaagatcat tggatgcaaa tgggtgttca gagttaagca taactctgac 3300 cgttctatcc gctataaagc ccgtctcgtt attaaaggat atgagcaaac ggaatttggt 3360 gagacttacg ctccagttgc ccggctgaca acattcagaa tcctaatcgc cctcgctgcc 3420 caactcggct ggaaaatcca tcaaatggat gtcatcaccg cctttctgaa tcccgtagtc 3480 aacgatctgg tctttatggc gttgcccgaa ggcatagagt ggttgcaccc cggggcatct 3540 cctggcggag tgtgccggtt aaagaaggcc ctatatgggc ttaaagaagc tccccgcctt 3600 tggtttagcg atatcaatcg cttccttcag tcgcaaggtt ttatcccatc ctcaggtgat 3660 gctaatctct acatctcttc ctcgcacaaa gtcatactgc tgctatacgt agacgatatc 3720 cttttagcgt caagtaaccc ttcggccatt gcaacggtga agcaacttct tgtaagccga 3780 tacaagatga cagatcttgg attgtcacgc caattcctag gcatcgatat cgaacagctc 3840 ccggaaaaga ttcgtatcgg tcaacgacac ttcgtagaat cagtcctacg acggttcggc 3900 atgagcgatt gtaacggcat ttggacgccg ctcgaagttc gaccttctct ggaattcaaa 3960 ccacttgtgc ctgaggatca gcaactctat caatccttgg ttgggagtat tatgtaccta 4020 atgttaggaa cacgacctga tctagcgttc acgatatcgg tcctctcaaa attctccgct 4080 tccgccgggt cggaacacct tgcgcaggcc aagcgtgttc taagatattt gaagcagaca 4140 agagatataa agctctgcta taggagaagc gagaccccgc ttgacaggtc tctcaaggga 4200 ttctcggact cggattgggc aggggattta ggtgatcgga agtcgacagg tggatttgtt 4260 tttctactct cagagtgtgt cgtagcctgg aaggctaaga agcaaacaat tgtcgccctt 4320 tcaactaccg aggcagagta catagctgcg tcagaagctg gaagagaagc tgtttggctt 4380 cgtcgtctaa tcggagatct catggataca atctccccca ccgactcaga atcgttctcc 4440 tctccgacta tcatctacac cgacagcact ggcggacttg cgcaaatctc gaacgtgcgg 4500 catcacgagc gaacgaaaca catagacata aaataccact acgtccgcga tatcgtcgag 4560 cagggaacag tcttgtttcg ccacatctca acacacgata tgtccgcgga catacttacg 4620 aagcctctcg caagagacct ccactggcgt catatggaaa ggatcgggat ggagactgta 4680 gtctaggaag tacctttttg gggttagata tcaggcctgt ttgtttgaaa tgtttttcct 4740 tttgttatta actgttctaa ggaagagctg agagtatacc tttccccctt gagacactta 4800 tggatagggg gttttaccgg tgctcgatga tgttcaaaga ctttccttta ttgtttattt 4860 ttatttctgt tttccttctc acggtatggc agaagagggg gag 4903 // ID REALAA_I repbase; DNA; FNG; 5609 BP. XX AC AB025309; XX DT 03-JUN-2005 (Rel. 10.05, Created) DT 09-JUN-2005 (Rel. 10.05, Last updated, Version 1) XX DE Alternaria alternata LTR-retrotransposon REAL, internal sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; gag; pol; LTR-retrotransposon; REALAA_I; KW internal portion. XX OS Alternaria alternata OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Dothideomycetes; Pleosporomycetidae; Pleosporales; Pleosporineae; OC Pleosporaceae; mitosporic Pleosporaceae; Alternaria; OC Alternaria alternata group. XX RN [1] RP 1-5609 RA Kaneko I., Tanaka A. and Tsuge T.; RT "REAL, an LTR retrotransposon from the plant pathogenic fungus RT Alternaria alternata."; RL Mol Gen Genet 263(4), 625-634 (2000). XX RN [2] RP 1-5609 RA Gentles A. and Jurka J.; RT "A. alternata LTR retrotransposon REAL, internal portion."; RL Direct Submission to Repbase Update (03-JUN-2005). XX DR Genbank; AB025309; Positions 219 5827. XX FH Key Location/Qualifiers FT CDS 1870..5596 FT /product="REAL_AA_1p" FT /translation="MDRKTHSEWSQFQEKHMRTPPITVDLIINNVANAQAL FT VDSGCLCYSLVNKKFAYRHRLERFQIPVRIIEGVNGKLSEINEVARFSFKL FT HGHEETAYAYVMDLSFGEDIYLGRGWMNHNNVSVAPAKKSIFIHSTGVRVR FT STEGISKQKIRQVNAAAFSALIHRHRRAPNSVQIFAASIADIDKALQPKKR FT VDVRALLPEQYQEYYGLFDPKAAEKLPPHRGPGVDHSIELELKDSQPPWGP FT LYSMSRGELLVLRKELTSLLEKGFIRVSSSPAAAPVLFAKKPGGGLRLCID FT YRALNAITKKDRYPLPLIRETLNNLSKAKWFTKLDVIAAFHKIRVAEGDEW FT KTAFRTRFGLFEWLVTPFGMANSPSTFQRYINWTLREFLDDFCSAYLDDVL FT IYTDGSLKQHQEHVRKVLRKLQDAGLQVDIKKCEFEVKSTKYLGFIIKAGK FT GISMDPAKVAAIREWEAPQTVKGVRSFLGFANFYRKFIKNFSQLAAPLTRL FT TGDVSFRWGPEEQSAFDKLKEIFVSEPTLASFDPERETVLETDSSGFAVGG FT VLSQYGDDGVLRPCAYFSRKNNAHECNYEIHDKELLAVVRCLEEWDSELRS FT VERFKVITDHKNLEYFMKPRMLNERQIRWSLLLGRYNMELLYRPGKQNVRA FT DALSRREQDLPVGADDERLQKRFVQILKPTNSYCEELEEEDLVGAILVMAT FT RMTTVPPTKQGSVQPSTEETTAEQNELEKLWQEAVSRDRVYQGAKKTIKAQ FT ERRFPLSLGLKCSTEDCSVEGNLLLYRGRKWVPDSEKLRTQIISGAHDSLA FT TGHPGREVTYKILARDYFWPGMTQTIRRYVRNCSTCGRSKSWREGKQGLLK FT PLPIPAQIWKEISMDFVEGLPESEGMTNLMVITDRLSKGTIFVPLPNIKTD FT TVVQKFIERVVAYHWLPDAITSDRGRQFVSVLWTKLCELLKINRRLSTAYH FT PQTDGATERMNSVWETYIRSFTNWAQNDWALLCPMAQIAINGRTATSTSMS FT PFFLQHGYEVNPLQIEPEVGSRASNQTEGLSDVQKAQVIASKLQQAIELAQ FT ASMAESQQEQERQANKTRREAQNFRVGDKVWLKLDQQYSTGRSSKKLDWKN FT AKYTVIRVVDSHSVELDTPPGPHPVFHVDRLKLASTDPLPSQDQDDSQPQP FT LQVDGEDEWEIEDIVAEEVRRRGRGRKLYYEVKWKGFHQTTLEPAELLKDA FT EAVDRWEAFTEAERDGEGRLPEGFRRGDVVSPX" XX SQ Sequence 5609 BP; 1607 A; 1393 C; 1547 G; 1062 T; 0 other; ttttgtattc gatcccacta ggcagtaaca ctgtctgctg aaggctcttc agcacgtacc 60 ttagttaagg cgcgtaagtg gtgttgttta ggcacgtaag gcccgcacaa caacttaaca 120 agcgatcgca agaagcgcgt taattaagca actagccttg tagaaggcag tcaaaggaaa 180 agccctacta gaatcctaca ggtggattgt aggtaccgtt actagcgtcc ctctgagcag 240 gaatcgcagc aacggtgaga agtcctatcg cgtcttggaa ggccgagccg taggcacagt 300 cgccgcgtga gaaactatct gaaggctatt cggcctcgac gacaggtagc tataagcggg 360 cactgcgacg agaagtcctg ttgattctcc accgaggagg tgcaggccgt gcgactagtc 420 ttcaatcgga agacaaatac cacgagcgtg acggtgagac gagagttacc tagggacgca 480 aaggatagca gccgattgct gctcagggaa gctcacctga cgagcatacg aacaagccct 540 atcagtcctc accgacgagg agtaggccac agtccgggac tgtgcgagac gctcaaggag 600 cgaacagaag ccctgtacga tggcagacaa cgttccgaac ttgaacgtag aagagctggc 660 ggcttacgcc accactaaga gccaggcagg agtctcagac gaagagatcc tgcgcgaaat 720 cacacagatg ctgagagagg acggatggaa ggaggcagcg attggaccaa caatcgaagc 780 ggttgttaaa cgatcatcag tcgtgggagg gaaaggctcg ccaagacgag gcgactcacc 840 aaaggctaat caggacgcca tggcccagat gatgcaacag atgatgcgac tacttagccc 900 gatggctagt cgtctagaag cgctcgagcg gagccaagct gaaggacgct cagaatcgtc 960 cttaggagcc aacaccaccc cgccactatc gacaccagtc gccgagccag ctgtccgcaa 1020 gaacaagttc cctgacccag agcgcttcga cgggacaaga ggtaactacc caggctggaa 1080 gtttgagtgt gaagggaagc tagagtacga ttgcgctatg ttcccaacag aggacgcaag 1140 agtcagatac gtgcttagcc gaacaaagga caaagcaaac caagtccttc tcccttgggt 1200 actcgagcat aaggaacgaa cagtcgtcgg cctgtgggag catatggata cccacttcca 1260 ggacgtacac cagcagcagc gtgcgctaaa taagcttcgg cacctaaagc aaggacggag 1320 accaatacgc gattacgttt cagaatttaa ccagcttcgc gtagaatctc gtcaacagtt 1380 tagtccagta gtagctcggg agatgttcag cgagggcctt agagaagagc tccagaagct 1440 gatgctccac actccaaaga acggaagcct gaaggagtac atggacaagg caatcgaact 1500 atctgacgac ctgtaccgga tccagctaca tgggcggaac caaaggagca cggctcatga 1560 cggcgcgcag aaccacccga gagccgttca gcgagaagct agcccagagg ctatggattg 1620 ggaaccttca aaagttagcc aggcacgaga atctagagtt aaaacgaagc gtgctccgct 1680 cacgtgctac agctgcggca agccgggaca catcgctagg gactgccaga gcaccacccg 1740 ggtcaggaga gcgaaggccg tgcctaacaa gaacagacga gagcttgaga aggagcttga 1800 ggacctgtca agcagcgact cggaaaaaga ggagctctga gccagagtct cggcttagag 1860 cctctgaaga tggatcggaa aacacatagc gaatggagcc agtttcaaga aaagcatatg 1920 cgcaccccac caattactgt tgacctaatt ataaacaacg tcgcgaacgc tcaggcgctc 1980 gtcgacagcg gttgcctatg ctactcactg gttaataaga aattcgccta ccgccaccgt 2040 ctagagcgct ttcagatccc tgttcgaatc attgagggtg taaatgggaa gctgtcagaa 2100 attaacgaag tcgctcgatt ctcgtttaaa ctgcacgggc acgaggagac tgcctatgcg 2160 tacgtgatgg atctatcctt cggagaggat atttatctag gaagaggctg gatgaaccac 2220 aataacgtgt cggttgcacc agcaaagaag agtattttta tccactcgac gggcgttcga 2280 gtgagatcaa cagaagggat ctctaaacaa aagatccgac aagtaaacgc agccgccttt 2340 tcagcgctta ttcaccgcca tagaagagcc ccgaacagcg ttcagatctt tgccgcatca 2400 atcgcggata ttgataaagc cttacagcct aagaaacggg tagacgtgag agcactgctc 2460 cctgagcagt atcaagagta ttacggtctc ttcgacccta aggcggcaga gaaactccca 2520 ccacaccgag gacctggtgt tgaccacagt atcgaattgg aactgaagga cagtcagcca 2580 ccctggggac cactgtacag catgtcccgt ggcgagctgc tagttctgcg aaaggaactt 2640 acgtctctgc tcgagaaggg atttatacgc gtaagcagct cacctgcagc agcaccagtg 2700 ctatttgcta agaagcctgg gggagggctc cgattatgta ttgactaccg cgcgctgaac 2760 gcgattacta agaaggaccg gtaccctctc ccattgatcc gagagacgct caacaacctg 2820 agcaaggcta agtggtttac taagctagat gtgatcgcag ccttccataa gattcgtgtc 2880 gctgaaggcg acgaatggaa aacagctttc cgaacacggt tcggactgtt cgaatggcta 2940 gtcaccccct ttggtatggc caactctcca agtaccttcc agcgatatat caactggacc 3000 ttgcgggagt tcttagacga cttctgttcg gcgtacttag acgacgtgct aatctacact 3060 gatgggagcc taaaacagca ccaagagcac gtccgaaaag tgctcaggaa gctacaagac 3120 gcagggttac aggtcgatat caagaagtgc gagtttgaag ttaaatcgac caaatactta 3180 gggtttatta ttaaagcagg caagggaatt agcatggatc cagctaaagt cgccgcgatt 3240 cgcgaatggg aggctcccca gacagttaag ggcgtaagat cgttcttagg gtttgcaaac 3300 ttctaccgga aatttattaa gaacttctcg cagctagccg cgccattaac aaggctcacg 3360 ggcgatgtgt cgttccgctg gggtcctgag gagcaatcag cattcgacaa actaaaggaa 3420 atctttgtgt ctgaaccgac gctagcgtca ttcgatccag aacgcgaaac agtgcttgaa 3480 accgactctt ctggatttgc ggttggcgga gtactgtcac agtacggcga cgatggggta 3540 ctacggccgt gcgcgtactt ctcacggaag aataacgccc acgagtgcaa ctacgagatc 3600 cacgacaagg agctactagc tgttgtacgt tgtctggaag aatgggactc tgagctacgg 3660 tcggtagaga ggttcaaggt gataacagat cacaagaacc ttgagtactt tatgaagcca 3720 agaatgctga acgagcgtca gataagatgg tcgctgctcc tgggtcgtta caacatggag 3780 ctgttgtatc ggccaggtaa gcaaaacgtg agagcagacg cgctttcacg acgagaacaa 3840 gacttgcccg taggggctga cgacgaacga ttacagaaac gtttcgtcca aatacttaag 3900 cccacgaact cctactgcga ggagttagag gaagaagact tagtgggggc tatcttggtc 3960 atggctacga gaatgaccac tgttccccct actaagcaag gatcggtaca gccgtctact 4020 gaagaaacga cggcagaaca gaacgagttg gagaagcttt ggcaagaagc agtgtcgaga 4080 gacagagtct accagggcgc gaagaagacg attaaggcac aagagaggcg attcccactt 4140 agcctaggac ttaagtgctc aaccgaagac tgttcagtcg agggcaacct actactctat 4200 cgaggacgta agtgggtgcc cgacagcgaa aaactacgga ctcagatcat tagtggcgcg 4260 cacgactcct tggccaccgg ccacccagga cgcgaggtaa cgtacaagat actcgcgaga 4320 gactacttct ggcccggtat gacgcagacg atccgaagat acgtcaggaa ctgcagcacc 4380 tgcgggagga gcaaatcgtg gagagaaggc aagcaaggcc ttttgaagcc ccttccgata 4440 ccagctcaga tctggaaaga gatctcgatg gacttcgtgg aaggcctacc agagagcgaa 4500 gggatgacga acctgatggt gatcacggat cgactaagca agggcaccat ttttgtgccg 4560 ctccctaaca tcaagacaga cacagtagtc cagaagttta ttgaacgagt ggttgcctac 4620 cactggctgc ccgacgcgat cacgtcagat cgcggccgcc aatttgttag tgtcttgtgg 4680 acgaagctct gcgagcttct gaagattaat cggagattat caacagcata tcatccgcag 4740 actgatggcg caacggaaag aatgaacagc gtatgggaaa cctacattcg atccttcacg 4800 aactgggctc agaacgactg ggcgttgcta tgcccaatgg ctcaaatcgc tatcaatggc 4860 cgtaccgcaa cgtcaacaag catgtcacca ttctttctgc aacatggata cgaggtaaac 4920 cccctacaga ttgagcctga ggtcggttcg agggcgtcaa atcaaaccga agggctctca 4980 gacgtgcaaa aggcacaagt aatcgcgtcg aaactccaac aagcaataga gctagcgcag 5040 gcaagcatgg ccgagtctca gcaagaacaa gaaagacagg cgaacaagac tcggagagaa 5100 gcacagaact tccgggtagg ggacaaagta tggctaaagc tcgaccagca gtatagtaca 5160 ggtcgaagct ctaagaagct tgactggaaa aatgccaagt acactgtaat ccgagtagtt 5220 gatagccact cagtcgagct agacaccccg cctggcccgc acccagtgtt ccacgtagat 5280 cgactgaagc tcgccagcac cgacccacta ccaagtcaag accaggatga ctcgcaacca 5340 cagccactgc aggtggatgg agaagacgaa tgggagatag aagacattgt agcagaagag 5400 gtacgccgcc gcgggagagg gcgtaagcta tactacgagg tgaagtggaa agggttccac 5460 caaacaacgc ttgaaccagc ggagctactc aaggatgctg aggcagtaga tcgttgggaa 5520 gcatttacag aggccgaacg agacggcgag ggtcgcctcc cagaggggtt tcggagaggt 5580 gatgtagtat caccataagg aggggggta 5609 // ID Copia-4_TMe-LTR repbase; DNA; FNG; 217 BP. XX AC CABJ01002876; XX DT 13-FEB-2011 (Rel. 16.02, Created) DT 13-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Perigord black truffle genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-4_TMe_; KW Copia-4_TMe-I; Copia-4_TMe-LTR. XX OS Tuber melanosporum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Pezizomycetes; Pezizales; Tuberaceae; Tuber. XX RN [1] RP 1-217 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Perigord black truffle genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; CABJ01002876; Positions 8207 7991. XX SQ Sequence 217 BP; 70 A; 47 C; 50 G; 50 T; 0 other; tgatggtata ataatcccag tagtagtatg ttgcactgga gaaacatatt agggcacgca 60 tggaggtata gggcatagca gggatttagg gctcacatgt acgagcagag ttttcactat 120 aaagtgaacc agtagctagc aacctctata gaaatgatga accagaacat accttcctag 180 gaccacatct gcgcacgatc atagccgacc tccatca 217 // ID Mariner-4_AF repbase; DNA; FNG; 2019 BP. XX AC . XX DT 28-FEB-2006 (Rel. 11.02, Created) DT 07-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE A family of Mariner DNA transposons - a consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-4_AF. XX OS Aspergillus fumigatus OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Neosartorya. XX RN [1] RP 1-2019 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-2019 RA Kapitonov V.V. and Jurka J.; RT "Mariner-4_AF, a family of Mariner DNA transposons in the RT Aspergillus fumigatus genome."; RL Repbase Reports 6(2), 101-101 (2006). XX DR [2] (Consensus) XX CC It is a family of DNA transposons from the Mariner superfamily CC (Tc1 clade). The genome harbors 15 copies that are 99.8% CC identical to the consensus. It encodes a 493-aa Mariner-3_AFp CC transposase (five exons at pos. 124-793, 853-1143, 1255-1326, CC 1366-1803, 1897-1907). XX FH Key Location/Qualifiers FT CDS join(124..793,853..1143,1255..1326,1897..1904) FT /product="Mariner-4_AFp" FT /translation="MPKSSKINESYLLEACEAAQAQKKPNISKIAREYGVP FT YATLRDRVKKHVHPRLANKPVNRALKGYQEEALIQWIVCMRDRNMPVTPKL FT LEEYANQALRRAGESRQVSKMWAYRFEKRLPEHLNLGPAKQKIKESKRIQA FT EDAGLLTHWYNQLAGVVKKDTPARLVYNFDECGFQPGEGKSRKVISSKGSK FT VPDLAESERGENITAIECVAADGWQMDPWFIFKGNGIFMESWFNESEALPP FT DTTIATSPNGWISDELAVQWLQSFINATNERTKKGEKRILIFDGHGSHLTV FT EFLQLCEDNGVIPFGFLPHTTHLCQPLDVREKAFNQRIIREAFKDRGIWPV FT NNNP" XX SQ Sequence 2019 BP; 586 A; 467 C; 470 G; 496 T; 0 other; tctgtgacta aggctccccc tcaccgctcc gcgttgaggg tgagccacag cgaaaccacc 60 aaccacaccg aatccaccat ttttcgatat tttgactatt tgaagctatt ccaaccaccc 120 aacatgccta aatcttctaa aattaatgaa tcttacctcc tcgaggcctg cgaggccgct 180 caggcccaaa aaaagccaaa tatctccaag attgcgcgtg aatatggtgt tccttatgcg 240 accctacgtg atcgcgtcaa gaagcatgtg catcctcgac tagccaacaa accagttaat 300 agagccctta agggatacca ggaggaggcc ttaatccagt ggatagtctg tatgcgtgat 360 cggaatatgc cagtgacgcc taagctacta gaagagtatg caaatcaggc acttcgacgc 420 gcaggggaat ctaggcaggt tagcaagatg tgggcatatc gctttgagaa acgacttcca 480 gaacacctta atctaggccc agcgaagcaa aagataaagg aatccaagcg tatccaggct 540 gaggatgcag gtttactgac acattggtat aatcagcttg caggggtggt taaaaaggat 600 acaccagcgc ggttggtata caactttgat gaatgcggct tccagcctgg cgaaggcaaa 660 tctaggaagg taatcagttc aaaaggttca aaagtgcctg atcttgccga atccgaaaga 720 ggagagaata ttacagctat tgaatgtgta gctgcagatg gatggcagat ggatccctgg 780 tttatcttta aaggtaagtt cctaatgatt cttcccttcc tttaaatact aatctatgct 840 cgcttcctaa aggcaacgga atcttcatgg aatcttggtt caatgagagc gaggccttac 900 caccagatac tacgatagcg acgtctccta atggctggat atcagatgaa ctagccgttc 960 aatggcttca aagctttatc aatgcaacaa acgagcgtac aaagaagggg gagaaacgga 1020 tacttatatt tgatggccat ggctctcatc tcactgtcga attcttgcaa ctttgcgaag 1080 ataatggcgt tattcccttt ggattccttc ctcatacgac acatctttgc cagcctttgg 1140 atggtaagcc attcttaagc tataagcaac actttcgtcg tatgaataat gagctatctt 1200 actgggctgg tgagcctgta gggaagtcag aattcttaca catgattgga ccagtacgag 1260 agaaggcttt caaccaaagg atcatccgtg aggccttcaa agatcgtgga atctggccag 1320 tcaatagtaa gatagctgac gatcttgcta tcttactatg ggaaggaatt ccggatatct 1380 acgcgcctga tcttgataag atgactccct ctacgccacc ctctcagccg ccatctcggc 1440 cgccatcttc atctagtatt gatatctcac ctccgcggac gattcaggcc cttaagaaga 1500 accaggcaaa gctatctaag catgcagatc tgcttacacc aaagctgcag cgaaatcttg 1560 aacggatatt tgaacataac cgaatcgccg ctgagcacct ggctatagca aatgaaacta 1620 ttggtcgaat aagggccgcg caggcccccc tacggcgtca atatactaag cggcaggtta 1680 agccgcttag ccagtctggt atattgacat tacgtgatgc aaatcgatca attgcttcaa 1740 ggaaggccaa agatgctgcc gcgcaagaga gacgtttaca aacgcaatgg gagaaagtgc 1800 atggtaagcc cccaccacta gcatctacat aagagaatat ggtatcaaat ggattagcaa 1860 aggcagtaga tgagaatagt aatttttttt atatagataa cccctagtat ggcatcaaaa 1920 tcatgatttc tatcgaaaaa tggtggattc ggtgtggttg gtggtttcgc tgtggctcac 1980 cctcaacgcg gagcggtgag ggggagcctt agtcacaga 2019 // ID Gypsy-115_MLP-I repbase; DNA; FNG; 5579 BP. XX AC AECX01000719; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-115_MLP_; KW Gypsy-115_MLP-LTR; Gypsy-115_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5579 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000719; Positions 120817 115239. XX CC Positions [4124-4603] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 192..2795 FT /product="Gypsy-115_MLP-I_2p" FT /translation="MNTRQSRSQTGQTGPLPDPESIISAARKASRIAKQDI FT QTTPDPDNTTVPFVTDPPPTQPPDLPITPSRPRTDTLAPFKTPLPYLQTFR FT RKEPPLEMSNPGSSTPDDKSMPDLVRLVLESQLASNKRMERLEDVVLKLAE FT NQTSTPLDAATLVSRDEGSIDLTKFRTSDGPVFKGPYHDVEAFLTWFSSLK FT TFYCTKGVVMYEDRIILIGSYLAEPNAHAFYEGGLKKFIKGSWHAFIKILF FT AVCLPNDWMDKLYEKAQHLQMSAYEDFKTYSTRACTIQNLINFDDVKLDDY FT QLAKFVKFGMLEELKAATKLWGYTKADKEFTYHLFESQVETLYKSLTASHI FT ISKKSRTVAQPSHFASTRAITSNAPVRNSAPRLSNEEFAWKIDSYRDLLGI FT CHFCKGYCGSEYRACKGPYVKQRVNFPPGYVAPPKPANYVPPRARSSPPAT FT TQPAGRATQPPAGRSSTSYAKAAAAEEFPELDQASVAALQALDKELEQEEG FT EGCVTDSKTPRVILEFMCNGKRMRALADPGAEINFLTDHAARTLKLNQRQL FT VRPTQLGLAVSTDSPPPLLTHFTIANLVDETSGRKFDRTYFKIGEVGGEFD FT MILGTPFFKIFQFSVSVNQRAIICERSGMRIQDFRVLNQMREREQCENLMR FT VSEPKKVTREKWQDEVSKVEEVSPKLKASQLFMMERGEKLEKEMIEEFSDL FT FPVDIPAVSDEAEEKGLFIDGSFPEKLQKESSKIRHKIVLKDPDAVINEKQ FT YRYPLKHLKVWQCLTDQHLAAGRIRCSTSQYASPSLIIPKKDPNKLPRWVC FT DYRILNSLTVRDRASLPNVDKLVRLVASGKFFSIIDLTNAFFQTRMREADI FT PLMAVHLPVVCTNGA" FT CDS 2798..5020 FT /product="Gypsy-115_MLP-I_1p" FT /translation="MPMGLTNAPSTHQGQLEEALGDLLNTICVVYLDDIVV FT FSNSEEEHIKNTHQVLQKLREANLYCSRKKTKLFRSEIKFLGHWISAEGIR FT VDDEKVSQVLNWKTPKSARGIKKFLGTVQWMKKFIWGLQNYVNKLTPLTSS FT KLDPSRFRWGKGEDEAFNNIKKLMTSLPHLKNIDFDSEDPLWVFTDASGSG FT LGAALFQGKEWKLASPIAYESRQMTPAERNYPVHEQELLEVIHALNKWQML FT LLGMKVNVMSDHHSLTYLMRQKTLSRRQARWIKFLADYDVEFKYIKGEENT FT VADALLRKEIEEDEASPEGIDCIATLIEAGPRLSPSVKTRIQAGYADNKFY FT GAVTSVLPLREECIISDNLLFIDGRLYIPGGGGLRLELIEEAHTRLGHLGY FT LKTVTDLRRDFFWPKMAKQIENFVQSCDVCQRIKTPTSAPAGRMLTPTIPT FT TPLDCLAMDFVGPLPKVNNYDMLLTITCQLSGFTRLIPCGQTDTAEKTASR FT LFSGWVGTFGAPVSIISDRDKTWTSNFWKALMKKLGPSFHMSTAFHPQADG FT RSERTNRTVGQILRTFATKRQGKWLEALPAVEYAINGAVNISTGKSPFDLV FT LGRPQRKLTTHASSDGDPLAMTKWLSMREDSWADTKDALWKSQIQQALQHN FT KRVKQTEPLTTDSWVLLNSADWRGKHTGGVHKLKEKFEGPYRVLKSFNHGQ FT DVELDLPEGDKRHPVFHISKVKPYVEREEEEGAPLGTGLQK" XX SQ Sequence 5579 BP; 1637 A; 1310 C; 1277 G; 1355 T; 0 other; gttttttttt ctcaaacgcg aaacatccaa aacgccgacg acgatcactg gcgcagctgt 60 atgacactac cccgtacacc tcgcgttcaa ttttagtcaa tccaaatcaa ctcaactaca 120 ccttcttggt tctccaccaa ttggagaacc cccggcctac ggtcgtttga attttcttcc 180 tttttctttg catgaatact agacaatcaa gaagccagac aggtcaaaca ggacctttgc 240 cagaccctga atcgatcatc agtgcagctc gaaaagccag tcgtattgca aagcaagaca 300 tccagacaac gccagaccca gacaatacaa cagtgccatt cgtaacagac cctccgccaa 360 ctcaaccgcc agacctcccg ataacaccat ctagacctcg aactgatact cttgcgccgt 420 tcaaaacacc actgccgtac ttacaaactt ttagacggaa agaaccgcct ctcgagatga 480 gtaatccggg ttcttcgaca ccagacgaca aatctatgcc ggatctggtc cgactggtcc 540 tagaatctca actagcgagc aacaagcgta tggagaggtt agaagacgtt gtcttgaagc 600 tagccgaaaa tcagactagt accccgttgg acgcagcgac cttagtgtca agggatgaag 660 gcagcatcga tctcaccaaa ttcagaacat ccgacggtcc agtttttaaa ggaccttatc 720 atgatgttga ggcgttcctc acctggttct cctccctcaa gacattctat tgtacgaagg 780 gagtggtcat gtatgaagac cgaatcatct taatcggtag ttatctggcc gagccaaacg 840 cccatgcctt ttacgaaggt ggtttgaaaa agtttatcaa aggatcttgg catgcgttta 900 tcaagatact cttcgcagtg tgcttaccca acgactggat ggacaagctg tatgaaaaag 960 cacaacatct acaaatgtca gcctatgaag acttcaaaac ctatagcacg agggcttgca 1020 ctattcaaaa cttaatcaat tttgacgacg ttaagctcga tgattatcag ttggccaagt 1080 ttgtcaaatt tggcatgtta gaagaactca aggcagcaac aaaactttgg gggtacacta 1140 aagccgacaa agagttcact taccacctat ttgaaagcca agtcgaaacc ttgtacaaat 1200 cgctcacggc gtcgcatatc atcagtaaga agtcccgcac cgtggcacaa ccaagccatt 1260 ttgcgtcaac tcgagcaatc acgtccaatg ccccagttag gaactcagct ccgagactaa 1320 gcaacgaaga gtttgcgtgg aagatcgatt cctatcgcga tcttttaggt atttgtcact 1380 tttgtaaggg atactgtggg agcgagtatc gggcctgcaa aggaccgtat gtcaagcaaa 1440 gagtcaactt cccgccagga tatgtagcac cacctaagcc agcgaattac gtacctccaa 1500 gagcgcgatc ttcaccgcca gcaacaactc aaccggcagg acgagctacc cagcctccag 1560 caggccgatc atctacatct tacgccaagg cagcagcggc tgaagagttc cccgaactag 1620 atcaagcttc cgttgccgcc ttacaagcct tggacaaaga gttggagcag gaggaaggag 1680 aagggtgcgt caccgattcc aagactcctc gtgtcattct agaattcatg tgtaatggca 1740 aacgaatgcg tgctctggct gacccggggg ctgaaatcaa cttcttaacc gaccatgcag 1800 cccgtacgct taagctcaat caacgccaac tggtccggcc aacacaactg ggcctggcag 1860 tctcgacaga ctctccacca ccactattga cacacttcac cattgcgaac cttgtcgatg 1920 aaacgtcagg gaggaaattc gaccgaactt acttcaagat aggggaggtg ggaggagaat 1980 tcgacatgat tctgggcact ccgtttttta aaatctttca attttctgtg tctgtcaatc 2040 aaagagcgat tatatgtgaa cggagtggaa tgagaataca agatttcaga gttttaaacc 2100 aaatgagaga gagagaacag tgtgagaact tgatgagagt tagtgagccg aagaaagtaa 2160 cacgagagaa gtggcaagat gaagtgagta aagtagaaga ggtgtcacca aaattgaagg 2220 ccagtcagtt gttcatgatg gagagaggtg aaaaattaga gaaggagatg attgaggagt 2280 tcagtgacct ttttcctgtg gacataccag cagtatcgga tgaagcggaa gagaaaggct 2340 tattcattga tggctcattc ccggagaagt tgcagaagga gagctcaaaa attaggcata 2400 agatagtgct gaaagacccg gatgcagtta tcaacgaaaa acaatatcga tatcccttga 2460 agcatttgaa ggtgtggcaa tgtcttacag atcagcattt agcagccgga cgaatcagat 2520 gctctaccag tcaatatgca tcaccgagtc tgattatacc aaagaaggac cctaacaaac 2580 ttccaagatg ggtatgcgac taccgcatct tgaacagtct gactgtgagg gaccgtgctt 2640 cgttgcctaa cgtggacaag ttggtcagac tggtagccag tgggaagttt ttttcaatca 2700 tcgaccttac gaatgctttc ttccaaacaa ggatgagaga agcagatatc ccacttatgg 2760 cggtccacct cccagtggtt tgtacgaatg gtgcgtgatg cctatgggct taacaaatgc 2820 acccagcaca caccaaggac aactcgaaga agcactagga gatttgctaa acacaatttg 2880 tgtcgtttac ttggacgata ttgttgtgtt ttctaactcc gaagaagaac atatcaagaa 2940 cacacatcaa gtacttcaga aattaagaga agcaaactta tactgcagcc gaaagaagac 3000 gaagctattc aggagtgaaa tcaagttctt aggtcattgg atatcagctg aaggaatcag 3060 agtagatgat gagaaagttt cccaagttct caattggaag acgccaaagt cagcaagagg 3120 catcaagaaa tttctcggga cagttcagtg gatgaagaag tttatttggg gacttcagaa 3180 ttacgttaac aagcttacac cattgacaag cagtaaactg gaccccagca gatttcgctg 3240 ggggaaagga gaagacgagg catttaacaa catcaagaaa ctcatgacgt ctctacctca 3300 cctgaagaac atcgactttg actctgaaga ccccctgtgg gtattcacgg acgcgagtgg 3360 ttcaggatta ggtgctgctt tatttcaagg gaaggagtgg aaactagctt caccaattgc 3420 ttacgagtca cgtcaaatga cacctgccga acgcaactac cctgtgcacg agcaggagct 3480 cttagaagtc atccatgcgc tgaataagtg gcaaatgctc ctattaggca tgaaagtcaa 3540 tgttatgagt gatcaccact ctttgacgta tctcatgcga caaaagactc tgagtagacg 3600 tcaggcgagg tggatcaaat ttttagcgga ttatgatgtg gagttcaagt atattaaggg 3660 agaggagaac acggtagctg atgcgttatt gcggaaggag attgaggaag atgaagcatc 3720 cccggaaggt attgattgca tcgcgactct aattgaagca ggtccgaggt tatcaccgag 3780 tgtgaagaca cggattcaag cagggtacgc ggacaacaag ttctatggag cggtgacctc 3840 agtactacct ttgagagaag aatgtattat atcagacaat ctgctgttca tcgatggacg 3900 actttacatt ccaggaggag gaggactgcg actggaactc attgaagaag ctcatacaag 3960 gctaggacac ttaggatacc tcaaaacagt caccgattta cggcgtgatt tcttttggcc 4020 aaagatggcg aagcagattg agaattttgt acaatcttgt gacgtctgtc aacgaatcaa 4080 gacacctacc tcggcgccgg cgggtcgaat gttgacacca accataccta ccactcctct 4140 cgactgttta gcgatggact ttgtaggtcc tttacccaaa gtcaacaatt acgacatgct 4200 cctcaccatc acctgtcaac tgtccggatt cacgagactt atcccttgtg gacagacaga 4260 cacggcggag aaaacagcat cccgactctt ctcgggttgg gtaggtacat ttggtgcgcc 4320 ggtctcaatc ataagtgacc gagacaaaac ttggacctca aatttctgga aggcccttat 4380 gaagaagctg ggccccagtt ttcacatgtc tacagcattc catcctcaag ctgatggccg 4440 gagtgaacga acgaaccgta ccgttggtca aatcctacgc accttcgcga caaagcgtca 4500 aggcaagtgg cttgaggctc tcccggcagt cgaatacgcc atcaacggag ccgtcaacat 4560 ctccacgggg aaatctccgt ttgacctggt acttggtagg ccccaaagga agctgacaac 4620 acacgcatca tctgatgggg accctctggc gatgactaaa tggctcagca tgagagaaga 4680 ctcgtgggcc gacacgaagg acgccttgtg gaaaagccaa attcaacaag ctctgcaaca 4740 taacaagcga gttaaacaaa ctgaaccgct aaccactgac tcgtgggtgc ttctaaattc 4800 ggcggactgg cgaggcaaac atacaggagg agttcataag ctgaaagaaa agtttgaagg 4860 accatatcgc gtgctcaaga gttttaatca tggccaagat gttgaactcg atctcccaga 4920 aggcgacaaa cgacatcccg tttttcacat ctcaaaagtg aagccttacg tcgagcgaga 4980 agaggaggag ggtgcaccgt tagggacagg attacaaaag taagtcaccg ccgggtctct 5040 actaccccaa aagttgtcaa aacataccca ggccactatg tgagcacaca ttcttcgact 5100 gtctcccttt agatctcata cacagacaac agttgcacac tctcagggta cctggacaac 5160 aaatctcgtg gttttctttt ttttcttttc tgtctttctt ttctattttt tttctttctt 5220 gctttctttt ttcaattttg cttctttctt tcaatttttt acttcttgtg gtacctcaag 5280 gccggttgaa gcttcctttt ataggagggg agactgtaag acctatggtc attacacata 5340 cgtacaaata ctaaagtact taataattaa tacttaagtc ttagagactc ctttatatat 5400 ctttgttcaa ttccacgcta tggttctgta gtgatcaggg actgacttcc caccccatat 5460 cctacggaac caggtaagcg tatatcttat ttctttcttt cttatgattc ttgcgtttat 5520 atttgtagag tgatcaggga ctgacttccc accccatatc ctacggaacc agattctat 5579 // ID Gypsy-3_RO-LTR repbase; DNA; FNG; 463 BP. XX AC AACW02000296; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_RO_; KW Gypsy-3_RO-I; Gypsy-3_RO-LTR. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-463 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000296; Positions 57165 57627. XX SQ Sequence 463 BP; 140 A; 72 C; 92 G; 159 T; 0 other; tgatatgtgg ctaaatccgt ctcttcgtcc tgtatttaat cacgtgtttt gcggctcgga 60 ttgttgttgt gcggaccggg taatattttc ttattgtttg gtgcttttgt ggctgtattg 120 ggaaatagtg tcaagttaca cagatcgccc ctttagctca gacgcgaaag atattagagt 180 tcccgtaaga caagttagat atgaagattc tatggtgtct aaatatatac aagatcgaga 240 actcaattgt tctagaagaa caagaagtaa cggaatgtca aaaagagttt tattataaat 300 agtgttgatg tgataataat aaacgagata cagtcttctt cccttttttt ttatatactt 360 actttacgct tttaaaagtc ttactgccta aagaggttcc catactatca attgaaagtc 420 gtattattca aggtaaacat agttggggaa aacacacata tca 463 // ID Gypsy-10_CCO-LTR repbase; DNA; FNG; 458 BP. XX AC AACS02000004; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the mushroom Coprinopsis cinerea genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-10_CCO_; KW Gypsy-10_CCO-I; Gypsy-10_CCO-LTR. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-458 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the mushroom Coprinopsis cinerea RT genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AACS02000004; Positions 1645489 1645032. XX SQ Sequence 458 BP; 91 A; 153 C; 65 G; 149 T; 0 other; tgttacgaac tcccctattt cttcctttgt ttccctctac gcactctacc tttccgaacc 60 ttccggactt ccgttctctc cgtagttccg ttaccttcgc ttcttcgatt ctcccgcatc 120 tccgaacctt tcgtacttcc gtttctctcg tattccgtac tcatcgatac tccttacttg 180 tcgctttcac accttctttc gctctcttcc aaacctcttg atagctcgcc aactctacta 240 taaatactgt ctctctgtag tagatattcc tcagcctgtt atcgagaatc tttcctctag 300 aacttctatc gctctcagct tgatcctctc tgcctgcgat agcgacaggc cacttgaact 360 cgactccaga accagaactt cgatccagaa ctctagaact ccagaatccc agaacgactg 420 ttggcaacga taagtcctcg ttgcccgaag tcgtaaca 458 // ID Gypsy-19_RO-I repbase; DNA; FNG; 5590 BP. XX AC AACW02000006; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-19_RO_; KW Gypsy-19_RO-LTR; Gypsy-19_RO-I. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-5590 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000006; Positions 247294 241705. XX CC Positions [4135-4614] - Integrase core CC 'CCTGC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 10..5484 FT /product="Gypsy-19_RO-I_1p" FT /translation="MNSSINQSPSKNVEKDSADEVAKSLDRMDLETLESRA FT SNSSIHPRDDPSIETDVEMDEATNSSPAVPSSDPPNYLERLMVQKTMFFDL FT IEKLNNNKASFILENKLTLLEENEKQLSYAVSALNDVNALIASQFELRNHE FT LKGSGSKNSSPKDRLIPSREIPRFNVNPTASALYQLTLQEGDKNVNPSNEP FT SLDMFIRDFQRKFLDYDVSIEDHWLHYLEVSFEKSDSDTDHDWFERFIKRR FT CMDKKTALNWDQAKEVLKQRFDLASQTTPEMWFKNLINFRQERCETLTQAM FT DRYRLFSLGAKVNMHDNTFLIGHFVSRLHTVKFQEMVQATITRNLPSLVSA FT SSNTNGESVYSTRNLPIPLPKEWNVLESILIKEMANLESALLNILKDKKKE FT ENTKKLGEHDDNANTKKRKMVHQSEHINQKVSKSSVEFQQEIADLKKQGVC FT TFCKTAKYSPGHYSSCSVRLEYVKRKHGKQIVSNSSKGNDKTVSNLELSNS FT ITSISQTNKSPKNRVTDLSLSPSNNESTSQSYELNESSDDEFEFLYKHYDR FT NIICSNKNINANFDEEYNKNVFAIRNNKVGEDDSLFESDNVAYSPVTPITL FT NNVNTYGIIDTGARVSVMNKKFAIDNNIKFHSVPGNLVLANGDQIPRMRSV FT DYVNVEYDNIDYIIKHKFDIIDDHASSYNCKILIGTDLLPKLSIHLMNVAV FT KHKSTEKELDDSILDKAYKPNISRAGTEKEQKVFDMAIKPYIEANKKLNKN FT SLCNIKEAVLHLPTPKDYVANIKQYPIAYTLQPKVMDIINSWLDEGIIIPA FT TPSGWNLPMTVTHKKNLDGTKTDKIRLVLDPRMLNKVLPVDNHQLPLINDI FT FNSMSGAVIFSTLDLKSAFNQFPVNKDDQIKTTFTAPNNLQYMYRGAPFGI FT STISQLFSRIMLTLFKDLPYVKCFVDDICIFSSSIHAHFLHVKKVLQILTD FT ANLKINFEKTYLAKSAVYLLGYSISASGKQIDARKLTNIFDWPRPTTQKQV FT QSFLGFVNYFRQHTPNAALLMAPLDALRCHDEKVKGPFEWTKEHQMHFDSI FT KHILSSELMLSHPDLSKPFCIATDSSDYSTGCCLYQEFEVTKPNGEISKIK FT RYIGFMSRSLSRSEKRYSVTMRELLGVVYALTQFHKFIWGTRFTLYTDHKA FT LCYIHSQKNANSMLIKWLDVILDYNFSVIHVPGLENVLPDKLSRLYPPKDT FT FEHESDIKKGKRLSSYRQNHAISETINTKKKHRAYSKYNAAINIISVIQET FT TTSDAQKDIIQVKENTDIRDIDYNINDQYLFYVQSAQPLYSDYLTPPLAEH FT QQLLIDAHNKIGHYGAEQMVKRLHNDGIHWPNLIGDCIKFIKQCKDCMKHN FT IEKRGFHPLRSIYSYYPGDHYAIDLGGPMHTTSVYNNNYFMVIIDVCTRFC FT ILRALPDKRSDTILRSLIDVFSTMGFPTKLQSDNGTEFKNSLSKDLADAMG FT YDHRFITPLHPSANGISERTVQSVKKLLAKATHGVGNDWDLYLPSIQLAMN FT NRISKRLNSTPFSLMFARKMNEPYGFRSDKDKLKEVKDKPPMSHEELMKRI FT DYMTDIVFPAIVNKTKAQIELEQAKFNDSHRLVDYAPGSHVMVRIPNKSGQ FT LAPAYEGPYTVVRKNKGNAYILRDETGVLMPRAYTSVELKLISNEEVIELD FT DEGNEIINFEIEAVINHRGPPKNREYLVRWKNYSSEWDEWLSADKFNDPNT FT LRNYWKNLGKKYIPPKNAPITNSPSSSELLKSTPSGTISTAMKHFSSDDEN FT NSNLVDKKSTIKSNVGSSARPVKRTRRNKPPTNNACDTPSTATRSSKRLKS FT LHK" XX SQ Sequence 5590 BP; 1929 A; 1026 C; 898 G; 1737 T; 0 other; tttttttgaa tgaattcttc tattaatcaa tccccttcta aaaacgttga aaaggattca 60 gctgacgaag tagctaagtc cttggataga atggatcttg aaactctcga aagtagagca 120 tcaaactcat ccatccatcc tcgtgatgat ccatccatcg aaactgatgt tgagatggat 180 gaagctacta actcttcccc tgctgttcct tcttcagacc ctcctaatta tttggagagg 240 ttgatggttc aaaaaaccat gttctttgat ttaattgaaa aattaaacaa taataaagcg 300 agttttattt tagaaaataa gctaacttta cttgaagaaa acgagaagca attgagttat 360 gctgtatcag ctttaaatga tgttaacgct cttattgctt cccagtttga attacgtaat 420 catgagttga agggcagtgg atctaaaaat tcatcaccca aggatagact aatcccaagc 480 cgtgaaatcc caaggttcaa tgtcaaccct actgctagtg ctctgtatca gctcactctt 540 caggaaggtg ataaaaacgt caatccaagt aatgaaccat ctttggatat gtttattaga 600 gatttccaaa gaaagttctt ggattatgat gtatctatag aggatcattg gttgcactac 660 ctggaagttt cgtttgaaaa gtctgacagt gataccgacc atgattggtt cgaacgattc 720 atcaaaagac gttgtatgga caagaaaacc gctttgaatt gggatcaggc aaaagaagtc 780 ttgaaacaac gcttcgatct tgcttcacaa actacaccag aaatgtggtt taaaaacctt 840 atcaatttca ggcaagaacg ttgtgaaact ttgacccaag caatggatcg atatcgtctt 900 ttctctcttg gtgcaaaggt caatatgcat gataacactt tcttgattgg acatttcgtt 960 tcgaggcttc acactgtcaa gtttcaagaa atggtccaag ctactattac aagaaatctt 1020 ccctctcttg tctctgctag ctccaatacc aatggagaga gtgtttattc aactcgcaac 1080 ttgcctattc ctctgcctaa ggaatggaat gttcttgagt caattctcat taaagaaatg 1140 gctaacttgg aatctgcttt gttgaacatc cttaaggata agaaaaagga agaaaatacc 1200 aaaaaacttg gtgaacatga tgataacgcc aatactaaaa aaagaaaaat ggtacatcaa 1260 agtgaacata taaaccaaaa agtttccaaa tcgtctgttg aatttcagca agaaattgca 1320 gatttaaaga agcaaggtgt atgcacattt tgtaagactg ccaagtactc tcctggtcat 1380 tactcatctt gttctgtaag acttgaatat gtcaaaagaa aacatggcaa acaaattgta 1440 agtaattctt ctaaaggaaa tgacaaaact gttagcaatt tagagttatc caactcgatc 1500 actagcatta gtcaaacaaa taaatctcct aaaaacagag ttactgattt atctttatct 1560 cctagtaata atgaatctac ctcgcaatct tatgaattaa atgaatcatc cgatgacgaa 1620 tttgaatttt tatataaaca ttatgacaga aatataatct gttcaaataa aaatataaat 1680 gctaattttg atgaagagta taacaaaaat gtctttgcca ttcgtaataa taaagttgga 1740 gaagatgata gtctattcga atcggataat gtggcttatt cgcctgttac tccaatcacg 1800 cttaataatg tgaacactta tggtatcata gatactggtg cacgtgtttc tgtcatgaat 1860 aaaaaattcg caatagacaa taatattaaa tttcattcag ttcctggtaa cctagtttta 1920 gcaaatggcg accaaattcc cagaatgaga tctgttgact atgtgaatgt cgaatacgac 1980 aacattgact acattattaa acataagttt gatattattg acgatcatgc ttcctcttac 2040 aactgtaaaa ttcttattgg aacagatttg cttcctaaac tttccattca tttaatgaat 2100 gtagcagtca aacataaaag tactgaaaaa gaattagacg actctattct agataaagct 2160 tataaaccaa atatttcacg agccggtaca gaaaaagaac aaaaagtttt tgatatggct 2220 attaaaccat acatcgaagc aaacaaaaaa ttaaataaga attctttatg taatattaaa 2280 gaggctgttt tacatctccc aacgccaaaa gattatgttg ccaatataaa acaatatcca 2340 atagcatata ctttacagcc taaagtaatg gatataataa atagttggtt agatgaaggt 2400 attatcattc cagcaactcc atctggatgg aatctgccta tgacagtaac tcataaaaaa 2460 aatctagacg gcactaagac tgataaaatt cgattagtgt tggatcctag aatgttaaat 2520 aaggtcttac cagttgataa tcatcaactt ccattaatta acgatatatt caattctatg 2580 tctggtgcag tcatttttag cacgttagat ctaaaatctg catttaatca gtttcctgta 2640 aataaggatg atcaaattaa aactactttt actgcgccca ataatttaca atatatgtat 2700 agaggtgctc cttttggaat atctacaata agtcagctat tctctcgtat aatgttaaca 2760 ttatttaaag atcttcctta tgtaaaatgt tttgtggatg atatctgtat attcagttca 2820 tctatacatg cacattttct tcatgttaaa aaagtgctgc aaatattaac tgatgctaat 2880 ttaaaaatta attttgaaaa gacctatttg gcaaaatcag ccgtttattt attgggatat 2940 tctatttctg cctcaggaaa gcagattgat gcccgaaaat taacaaatat attcgattgg 3000 cctcgtccta ctactcaaaa gcaagtgcaa agttttcttg gatttgtcaa ctattttaga 3060 caacatactc ctaacgctgc acttctcatg gctcccttag atgctttgcg ttgtcatgat 3120 gaaaaagtta aaggtccttt cgaatggaca aaagagcatc aaatgcactt tgatagtatc 3180 aagcacattt tgtcttcaga attgatgtta tcccatcctg atctgtcaaa accattttgt 3240 attgctactg attcatcaga ttacagtact ggttgttgct tatatcaaga gttcgaagtg 3300 actaaaccta atggcgaaat atctaaaatc aaaagatata ttggtttcat gtcccgttcc 3360 ctgtcaagaa gcgaaaaacg ttatagtgtt acaatgcgtg aattgcttgg tgtagtttat 3420 gccttaaccc aatttcataa attcatatgg ggcacacgat ttactttata cactgaccat 3480 aaagctttat gctatattca ttcccaaaag aatgccaata gtatgcttat taaatggctt 3540 gacgtcattc ttgactataa ctttagtgtc attcatgttc ccggcttgga aaatgtctta 3600 cctgacaaac tatccagatt atatccgcca aaggatacgt tcgagcatga aagtgatatt 3660 aaaaaaggca aaagattatc ctcatataga cagaatcatg ctatttctga gacaattaat 3720 acaaagaaaa aacatcgagc ttattctaaa tacaatgctg ctataaacat tatttcagta 3780 attcaagaaa ctacaacttc tgatgcacaa aaggacatca tacaagttaa agaaaatact 3840 gatataagag atattgatta taacattaat gatcaatact tgttttatgt acaatctgca 3900 cagccattat attcggatta tcttacacct cctttggctg aacaccaaca gttacttata 3960 gatgcccata ataaaattgg tcactatggt gctgaacaaa tggtcaaacg tctccataat 4020 gacggtatcc attggcctaa cttgataggt gattgcatta aattcattaa gcaatgcaaa 4080 gattgtatga agcataatat agaaaaaaga ggctttcatc ctctacgtag catttatagc 4140 tactatcctg gtgatcatta tgctattgat cttggtggtc ctatgcatac tacctccgtt 4200 tacaataata attattttat ggttattatc gatgtttgca cacgcttttg tatacttcgt 4260 gctctacctg ataaaagatc agatacaatt ttacgttctc ttattgacgt gttctcaact 4320 atgggttttc ctacaaaact ccaaagcgat aatggaacag agtttaagaa ctcgctctcc 4380 aaagatttag ctgatgctat gggctatgat catcgattca tcactcctct acacccatct 4440 gctaatggaa ttagtgaacg tactgtacaa tcagtcaaaa aattactggc taaagccaca 4500 catggtgttg gcaatgattg ggatttatat cttccatcta ttcaacttgc tatgaataat 4560 cgtatatcca aacggttaaa ttcaacacct ttctctctta tgttcgcacg taaaatgaac 4620 gaaccttatg gttttcgatc tgacaaagat aagctcaaag aagttaaaga taaaccacca 4680 atgtctcatg aagagctcat gaaacgcatt gattatatga ctgatattgt attcccagct 4740 atcgttaata aaaccaaagc acaaatcgaa cttgaacaag caaagttcaa tgattctcat 4800 aggcttgttg attatgcacc tggctctcat gtcatggttc gtattccaaa taaatctggt 4860 caactcgctc ctgcttacga aggaccatat actgtcgtac gtaaaaataa aggaaatgcc 4920 tacatcttac gtgacgaaac tggtgtcctt atgcctcgtg cctacacttc cgttgagctg 4980 aaattaatct ctaatgaaga agttatcgaa ctggatgatg aaggtaatga aatcataaat 5040 ttcgaaatcg aagcagttat aaaccatcgt ggccctccaa aaaaccgtga gtaccttgta 5100 cgttggaaaa actatagcag tgagtgggat gaatggctat ctgcagataa gtttaatgac 5160 cctaacacac tacgcaatta ttggaaaaat cttggtaaaa aatatattcc tccaaaaaat 5220 gctccaatta ccaattctcc atcttcatct gaactactaa aatccactcc atctggtact 5280 ataagtactg ctatgaaaca tttctcttct gatgatgaaa ataattctaa tttggttgat 5340 aaaaaatcca caattaaatc taatgttggc tcttcagcac ggcctgttaa aagaactcgt 5400 cgtaataaac ctccaactaa caacgcttgt gatactcctt ccacagccac tcgttccagt 5460 aaacgactta aatctcttca taagtaaatt acttttcatt tacaaaaaaa aaaaaaaaaa 5520 aaaaaaaaaa aaaaaaacca aaatagtcac gagattttca ttaaggtaat ctcatactgg 5580 tcgggggcta 5590 // ID Copia-1_AB-LTR repbase; DNA; FNG; 269 BP. XX AC GU129696; XX DT 02-DEC-2009 (Rel. 14.11, Created) DT 02-DEC-2009 (Rel. 14.11, Last updated, Version 1) XX DE LTR Copia retrotransposon - long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; KW LTR Rtrotransposon; Tab1_Sc2; Full Length; Copia-1_AB; KW Copia-1_AB-I; Copia-1_AB-LTR. XX OS Agaricus bisporus OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Agaricaceae; OC Agaricus. XX RN [1] RP 1-269 RA Sonnenberg A.S.M.; RT "Annotation Repetative elements in Agaricus bisporus. Copia RT group."; RL Direct Submission to Repbase Update (04-NOV-2009). XX DR EMBL/GenBank/DDBJ; GU129696; Positions 8 276. XX CC Full length copy of LTR copia transposable element. CC One ORF. Conserved region of Gag (Zn2HC), protease, integrase, RT CC and RNHase identified, as well as putative (-)PBS and (+)PPT CC primer binding sites. CC Target side duplication of 5 bp. XX SQ Sequence 269 BP; 72 A; 51 C; 56 G; 90 T; 0 other; tattggatgg gcaaactcgt gttgagtata ggagtggcag gaagtatgag aggtatgacg 60 aaaatttcct tttatattcg ttccacttgt agtatacaaa cgaaacgcaa aacacatgta 120 cacctcagtt ctcgtccccg agactgaggt aggcccgata tttcttctcc gtagttttat 180 ctaattgtac ttgtattagt gttatattct ttcaggtata gtacgaatac acagttctcg 240 tccccgagac tgagtgttat attctttca 269 // ID Gypsy-2_PPM-I repbase; DNA; FNG; 8609 BP. XX AC ABWF01003175; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Postia placenta genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_PPM_; KW Gypsy-2_PPM-LTR; Gypsy-2_PPM-I. XX OS Postia placenta OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Polyporales; Postia. XX RN [1] RP 1-8609 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Postia placenta genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABWF01003175; Positions 14648 6040. XX CC Positions [5508-6017] - Integrase core CC 'GCGG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1090..2898 FT /product="Gypsy-2_PPM-I_1p" FT /translation="MARDPTASQAPRSLRRRPITMAEIHAAATFILHGTSS FT TPTTAANQTIASTSNTSTTVPPGMIKTEDISMIIESLSRTITTLIQPTTHA FT THNHAPAPRQQAAVHVHENTGAEQTCHYCGNRGCRVGTCEFAEIDIRDGKC FT KRNTEGKIVLPNGSFCPRTIPGLTIRDRIYEWHRRNPAVPAAPTMLFEIDD FT RSTVQTFTLNTSGRIEALERELLQLRKRREVFDGVEILQRKKPTTTAIPRS FT AEASGSGTSKGVAAPPSTSTSTAPPPTIPAAAPASSSSPPTQSTSQPIATS FT APPAPPVHPFANARDATYAPPNVRNFATPPKPSNDKGKEPAYKTIVPVIQP FT KLAEEIFQRSMKSQFVTLTPEELLSIAPDVRTKYRDAVTPKRVSTEPVASA FT HIVEIGADEVMTVNQLSCSGATLEPGATIVPDPYETYLKHIPHGEHPAEFT FT VARDSNAIRSIIALIDNKEQIECIVDPGSQIVAMSEEVCLGLNLLFDPTIQ FT LNMQSANGEVDRSLGLIRNVPFRIGEIVLYLQAHVIRNAVYDILLGRPFDV FT LTQSVVKNFADENQTITILCPNTGETVTIPTYARGRSRSRQHSLGCRKSHF FT LASRN" FT CDS join(3507..4484,4488..6515) FT /product="Gypsy-2_PPM-I_4p" FT /translation="MCLHNEAFAWTDDERGCFKPEYFPPVDFPVVPHTPWV FT QKNIPIPPGIYNEVCAVIKRKIAAGVYEPSNSSYRSRWFCVVKKDGKSLRL FT IHSLEPLNAVTIQHSGVPPTPKYLAEQFGGRPCGGMLDLYVGYDERLIAKS FT SRDLTTFQTPYGAMRLVTLPMGWTNSVPIFHDDVTYILQPEIPHVTVPYVD FT DVPVKGPESDYDDERLPSNRGIRRFVWEHFENMNRVVTRMRYAGGTFSGVK FT SVLIAREIMVVGHRCTPQGRLPDDSRVAAIRNWGPCATLSDIRAFLGTIGV FT VRIFIRNFAHRADALVRLTRKDVEFEFGLEQITAEDLKDALLSSPALRAIN FT YESPSPVILAVDTSFIAVGYHLCQCDEANMRVRYYNRFGSITLNDRERRFS FT QPKLEIYGLFRALRSLRLYVIGVRNLIVEVDARYIKGMLKNPDIAPSASIN FT RWIVSILTFHFTLVHIPGTMHGPDGLSRRTAQPGDVIEGEVDEEFDDWIDQ FT MHSFVHQIQPLPPVPSLAIFTNDRAEDGSDFNMEEDSYELVPRSEQAQLAD FT AKLDDVLEFHRTLAKPDNISESVYEGFIKYCMQFFLDNDTLWRKDSHGAHK FT LVVKPGDRLRILRECHDRTAHRGIYATRAFVNERFWWPFHYADIAWFVRSC FT HICQSQQVRQILIPPTVAVPAPIFAKIHIDTMHMPASGGYKYLVQGRCSLT FT TYPEFRMLRKETAKALADWIFEDILCRWGALSEIVTDNGTAFIKAADYLSK FT KYHINHIRISGYNSRANGLVERPHFSTRQALFKTADGDEKKWSQAAYFVFW FT SERITTRRRMGCSPYFAVTGTPPLIPLDIVEATYLQPPLTSILSTTDLIAR FT RAIALQKRVEQVTELHSKVYEARRTAAIRFEKEHEHSIQDFDFKRGALVLI FT RNTKIEKSLNRKMRPRYLGPLIVVSRNKGGAYIVCELDGTVLDRPIAAFRV FT IPYFARKSIPIPEDLEDVSTERLRELELSNSLGDDDDEVEEAEERDE" XX SQ Sequence 8609 BP; 1951 A; 2771 C; 2092 G; 1794 T; 1 other; tttctggagc ccaccgcgag gggatctcag gcaagaattc gccgctttcg ggtttccgaa 60 aacatctcca gtactcaccc gctcgcaagc gcgtgaggct gctagtcgca gcgccgccga 120 aaacctcgac agctcctctc gaacgcactc gactccctct ccaacaattc ccggtaactt 180 cgaccgcgac gaagaagacg agatagacca agaactccag gacgacttcg acgaagaacc 240 gatcccttcg accgccgaag aacgcacttt gtcccccgag cttctaggcc tcactacttc 300 cgactacgcc acttcgactc ctgatctttt cgaccaatcc ggttcttcgc ccgaacccga 360 ggatcccatc ccatctactt cgaatctcgt acttccgact ccgtcttctt ttcgcgccca 420 cgctcagccg cccatcgcat cctcttcgcg actttcagtc atacctacat ccgacctggc 480 acctccgcca ccattggcac cttcgaacgc agcttcgaac tccaaccccg catcgcccgc 540 acctacgaat ccttcgacta ctaccgcatc ctcttcgagc ccagctccta ctaacactac 600 gaacatgagt cagaacacga acacaccgtt gatgcccccg tgcggtcatt cgacggcacc 660 gagcttcgac ccatcggaag tccgttcgct tcggcgttac ttccaggacc tcgaggcgtt 720 gttcacccag tgtcagatta ctgatgacac ggccaagaaa cagtgggccg ttcggtaccc 780 ttcgatcgac gtcgccgact tatgggaaac catcgagtcc ttcatcgacg taaccaagag 840 ctacaacgac tggaaggccg acgtacgagc actctatcct ggtgccgacg atacaagaaa 900 gtggtcgctg gcagacatgg atcagctcat cggagaacgc gctcgtatcg ggattcacaa 960 tgcggcagat ctgggctgct attaccgtga cttcatggcc attacgaaac atctcattgc 1020 acagaaccga ctctccccta tcgaacaaag tcgcgcattt cttcgaggat tccagccagc 1080 gttgctcaca tggctcgaga cccgactgca tctcaagcac cccgatcact acgccgacga 1140 cccatcacca tggcagaaat tcatgctgcg gctacgttca ttttacacgg tacgtcgagc 1200 acacctacga cagcggcgaa ccaaactatc gcttcgacat cgaacacttc gacaacggtg 1260 ccccctggga tgatcaaaac cgaagacatc tcgatgatca tcgaaagcct gtcgaggacg 1320 atcacaacac tcattcagcc gacgacgcac gctacacaca accatgcccc cgcaccgaga 1380 caacaggctg ctgtccacgt ccacgagaac accggagctg aacagacgtg ccactactgc 1440 ggcaatcgcg gctgcagggt aggcacctgt gagtttgcgg agatcgacat tcgggacggc 1500 aagtgcaaac ggaacaccga aggcaagatc gttcttccaa acggaagttt ctgcccccgc 1560 actatccctg gtctcacaat acgagatcga atctacgagt ggcatagaag aaatcccgca 1620 gtaccagctg ctccgacaat gctcttcgag atcgacgatc gctcgactgt gcaaacgttc 1680 acgctcaaca ccagtggcag gatcgaggcg ctcgaacgag aactccttca gcttcggaag 1740 cgaagggagg tcttcgacgg cgttgagatt ctacagcgga agaagcctac gacgacagcc 1800 atcccgagga gcgcggaagc ctctggatct ggtacatcga aaggagtggc ggcacccccg 1860 agcacttcga caagcacggc cccgcctccg acgataccag cagcagcacc cgcgtcgtct 1920 tcttccccgc ctacgcaatc cacgtctcaa cctattgcta catccgctcc cccagcacca 1980 ccagtacacc cctttgcaaa cgctcgcgat gcaacctacg ccccgccgaa cgttcggaac 2040 ttcgcaactc cgccgaagcc ctcgaacgac aagggcaagg aaccagcgta caagaccatc 2100 gtcccggtta tccagccgaa gctcgccgaa gaaatcttcc agcgttcgat gaagtcgcag 2160 tttgtcacgc tgaccccaga agagttgctg tcgatcgcgc ccgacgttcg aaccaagtac 2220 cgcgacgccg tcacccctaa gcgagtctcg acggaacccg tcgcgtcggc ccacatcgta 2280 gaaatcggcg ctgacgaggt catgaccgtc aaccagctct cgtgttcggg tgcaacgttg 2340 gaacccggtg ctacaatcgt ccccgacccc tatgaaacat atctgaagca catccctcac 2400 ggcgagcacc ccgcagaatt caccgtcgcc cgcgattcga acgcgattcg ttcgatcatc 2460 gcgttgatcg acaacaaaga gcagatcgaa tgcatcgtcg acccaggttc gcagatcgtc 2520 gccatgtcgg aagaggtctg tctgggcctc aacctcctct tcgatcctac tatccagctg 2580 aacatgcaat cggcgaacgg cgaggtggac cgatcgctgg gactcattcg aaatgttccc 2640 ttccgcatcg gtgagattgt attgtatctc caggctcacg tcattcgaaa cgcagtgtac 2700 gacatcctcc tcggccgacc cttcgacgtg ctcacgcaga gcgtcgtcaa gaatttcgcc 2760 gacgagaacc agaccattac gatcctctgc ccgaacaccg gcgaaaccgt cacgattccg 2820 acgtacgcgc gaggaagatc gcgaagccgg cagcactctc tcgggtgtcg caaatcgcat 2880 tttctagctt cgaggaattg attccccatg atcaaggaga agccgcgcta gtcatcgact 2940 acgatggggt aaaaccaagt atttccttcg tctctcctat tgattcctcg actccgaaca 3000 ctgtttcgtc actgtatctc tccgcatgca cctctgtttc atcatatttg cagtctcttt 3060 acgcttcgga caatacagct tctaatgccg ctccgtcgaa cgcttcgacg tgggcttcga 3120 ctcttctatt ttcagctccg ttcgcttcga ctccgccagc tcccgttccg gcgaagtcca 3180 tctcacagtt ttccaactcg attctcccct ttccttcgac gacatcgact tctacccctt 3240 caacgcacaa gatactcgcc gcgacgaaga aaaagtacaa gcctgtcgct ctcaaaaccc 3300 gacccgtcct tggtgccgtc cccaaacaat ttcgaatact tcgagacatc aagggcgacc 3360 ctctatccat catgcctaag ctttcgaccg accccccacc gttcaaacct accggtcggt 3420 acaccctcga acgctttgaa acgaccgaga agcttcacgg cggcgacttc ctacttcccg 3480 acgaacgttg agtcctccac cacttcatgt gcctgcacaa cgaagccttc gcatggaccg 3540 acgacgagcg cggatgcttc aagcccgagt acttcccccc cgtcgatttc cccgttgttc 3600 ctcacacgcc ctgggtgcag aagaatatcc ccattccgcc cgggatctac aacgaagtct 3660 gcgcggtgat caagcgaaag atcgccgccg gagtctacga accatcgaac tcgtcgtacc 3720 gatcgcggtg gttctgtgtc gtcaagaagg acggtaaatc tcttcgactc atccattccc 3780 tcgagcccct caatgcagtt acgatccaac attccggtgt accgccgacg cccaaatacc 3840 tcgcagaaca attcggagga agaccatgcg gcggcatgct cgacctctac gtcggctacg 3900 acgaacgact catcgccaaa tcttcgagag atctaacgac gttccagaca ccctacggag 3960 ccatgcgcct agtaacactc ccgatgggtt ggacgaactc ggtcccgatc ttccacgacg 4020 acgtcaccta catcctccag cccgagatcc cgcacgttac ggtgccctac gtcgacgatg 4080 ttcccgtcaa aggtcccgaa tcagactacg acgacgaacg gctcccttcg aaccgcggca 4140 ttcgacgctt cgtatgggag catttcgaga acatgaaccg cgtagttact cgaatgcgct 4200 acgccggagg taccttttct ggagtcaaga gcgtcctcat cgctcgagaa atcatggtgg 4260 tgggacatcg ctgcactcct caaggacgcc tccccgacga ctctcgcgtc gctgcgatac 4320 gcaactgggg tccatgtgct actctatccg acattcgagc gttcctcgga accatcggag 4380 tcgttcgtat cttcatccgt aatttcgctc accgcgccga cgcgctcgta cgcctcacga 4440 ggaaggacgt cgagttcgag ttcggcctgg agcaaatcac ggcgtaggaa gacctcaagg 4500 atgcgctact ctcatccccg gcactccgag ccatcaatta tgagtccccc tctcccgtca 4560 ttctcgcagt cgacacgtcg ttcatcgccg tcggatatca cctctgccag tgcgacgaag 4620 ccaacatgcg agtccgttac tacaacaggt tcggctcgat cacgctcaat gatcgcgaac 4680 ggaggttttc acagccgaag ctcgagatct acggcctatt ccgcgcatta cgatcccttc 4740 gactctacgt gatcggcgtt cggaacctca tcgtcgaagt cgatgctcga tacatcaagg 4800 gtatgctgaa gaatccggac atagccccca gcgcgagtat caaccggtgg atcgtctcga 4860 ttctcacctt ccatttcacg ctagtccaca tccccggtac aatgcacggc cccgacggtc 4920 tctctcgacg caccgctcaa cccggtgatg tcatcgaagg cgaggtcgac gaagagttcg 4980 acgattggat cgaccagatg cactctttcg ttcaccagat tcagcccctc ccgcctgtcc 5040 cttcacttgc catcttcacc aatgacagag ccgaggacgg ctcagatttc aatatggagg 5100 aggatagcta cgagctagtg cctcgatcgg agcaagcaca actcgccgac gcgaagctcg 5160 acgacgtgct cgaatttcat cgaacgttag caaagcccga caacatctcc gaaagcgttt 5220 acgaaggatt catcaagtac tgcatgcaat tcttcctcga caacgacacc ctctggcgca 5280 aggacagtca cggagcacac aagctagtcg tcaagcccgg cgatcgcctt cgaatacttc 5340 gagaatgcca cgaccgcacc gcgcatcgag gaatctacgc aacccgagcc ttcgtcaatg 5400 agcgattctg gtggccgttc cactacgccg acattgcttg gtttgttcga tcgtgccata 5460 tctgccagtc ccaacaagtg cgtcaaatcc tcattcctcc gaccgtcgca gtaccagccc 5520 ccatcttcgc gaaaatccac atcgacacaa tgcacatgcc ggcctcgggc ggatacaagt 5580 acttggtcca aggtagatgc tcgctcacga cttacccgga gtttcgaatg cttcgaaaag 5640 agacagcgaa ggccctcgcc gactggatct tcgaagacat cctctgtcga tggggcgcgc 5700 tcagcgagat tgtcactgac aacggtacgg cgttcatcaa ggcggccgat tacctttcga 5760 agaaatacca catcaatcat attcgaatca gtgggtacaa ttcgagggcg aacggtctcg 5820 tcgaacgtcc gcacttcagc acgcgacaag ccttgtttaa gacagccgac ggagatgaga 5880 agaaatggtc ccaagctgcg tattttgtat tttggtccga gcgcatcacg actcgacgac 5940 gcatgggctg ctcaccctat ttcgctgtta cgggcacccc tccgctcatt cctctcgaca 6000 tcgtcgaagc tacgtatctg caaccaccac tgacttcgat tctttcgacg actgatctca 6060 tcgcccgtag agccattgcg ctccagaaac gcgttgagca agtcaccgaa ctccactcga 6120 aggtctatga agcacgtcga actgcagcaa ttcgcttcga gaaagagcac gagcactcaa 6180 tccaggattt cgacttcaag cgcggagccc tagtgctcat acgcaacacg aagatcgaga 6240 agtccctcaa ccgcaagatg cgcccgcgat atctaggccc cctcatcgtt gtctcgcgca 6300 acaagggcgg cgcgtacatc gtgtgcgaac tcgacggtac agtcctcgat cgacccatcg 6360 cagcattccg cgtcattcca tacttcgcgc gcaaatccat tccaatcccc gaagatctcg 6420 aagatgtctc gaccgagcgt cttcgagaac tcgaactatc gaattctcta ggtgacgacg 6480 atgatgaagt cgaagaagcc gaagaacgcg acgaatagag gttcttccct tttgttactt 6540 gtacaacatg atacgtactt tcttctctcc attcctgctc tcacgttaca cgatatgctt 6600 gattttctcc tatgttctag acctgccctt gtacgagtat tttcgattca actcgatagt 6660 cgcgacatcg ttcgacgttc tacgaacgct tcgattcgac cgttgcgcgg aggctcacct 6720 tagacgaaga cctcctaggc tgtgatgaca aactgagcct atattttcat taagtccatt 6780 ttccctcgct acgaacattt ctatttcacc ttctatgttc ttcgatcaca ttacgacttg 6840 ctttgaacac acttattttc atttcctttt ttagttcccg ctaggtttcg tctaggccaa 6900 aggccacatt ttccattcga ccatctacgc gatattctac aacacgagta caactacaat 6960 acaaaacgac ttgcgcctag tcatcgtcga cagccccgcg cacgaacccg attcctccct 7020 ttgcaagcgc ggccttctcc atcttttgga ggtcgaggtc catgtcatgg aggagcgctc 7080 gacggcgagc gatgaaggcg cgctcctcgc ggatgacgcg aatgatcccc caagattcgt 7140 cgtcgaggtc gagcgcatcg aggtcgtcct gttcaaggcc aaggcggccg gccccatcga 7200 gaggcagtcg tagcaggggg cgcggaggct cgtcgactcg acgcttcttc gatccactcg 7260 agtccgcggc cgaaggcaca ggagaggtcg gccgaactac taagacgtta gcaaaggtta 7320 ggacacccag agggaacaac ttactgcgct ttttcgtggt tgatcccccg ccgcgcgttt 7380 tctttgtctt gccaagcaca gacaccttgt cgaagtagca gccatgctgg aactgcgcgc 7440 acctctcgca ggtcgtggcg ttctcgaccc acacgcactc gacaggggcc gagcggtagc 7500 cggcgcaaca atcacactat cgaggagtta gcatcaaaca gcttcgacga aggntcgagc 7560 ttacattctg cccggcctga cgcttcgcaa gcgcaccggc tatcgacctc cgtcatcgcg 7620 aacggacgtt cgagtgtccg aggagtcttc tacgaaagag agattagcga tcgaattacg 7680 acagaaaaga aaacttacga ccgaagggtc atccgacaac tcggtgacct cgtcgtcgac 7740 ccttgcacgc cctttcccct tgtcgagacc ggcaggcgcg gagtcagcga ggagtccctc 7800 ggcggccgtg atcctcacca actcctcgtc ggcggcctct tgtgcgcgac actcgtcttc 7860 ggcgacctgc ctgagacggt cctcttcatc ctttcgacaa cgttcgtctt cgaggcgacg 7920 gtcctcggcg gccttcgcac gcttcgcagc ctcttcgacg tgtcgtgctt cctcctcggc 7980 gaggcgcttc tgctcttcga cgcgtcgttc gacctcggcg tcgatggcgg tcgcgagctc 8040 gggccagtca ccgcgtgccc actcgaccca cgactccgac gcccgctcgg cgagctgttt 8100 gacggcggcg gtgacgcgca gctccgtctc ctcgtcgcgg atggcgccca gggccttgtg 8160 ataggcgagc tgttcgaaaa caaatggtta gttgggttat gcagattacg ggcgacttac 8220 gacgagcggc gacaaatctt gcgcccactc ttcgaccaca tcgagcgtgg cgtcggcggt 8280 cggcagagcc tcgaggacca cgagcagagc ggagaggcgg cggttgacga gcgaaggcgt 8340 cgatgcaggg gtggcggagc gagaggacat cgtggaagca gagggtgtgc gaggaaggac 8400 ggcgaggcag cgacgaagga acgatgggct ggggaaaatg aggctaagtc taccctgctt 8460 ataacccctt ttcgaccagc gtttagcgcc ggccacgtgg cgttcgaaac cgacacgggt 8520 tgtcatgccg cttcgatccg ttcacgacca tcgcgcttct cgcatcacgt ggaggaccga 8580 gggcggtcca aatttcaata aggaggaga 8609 // ID TSE1_LTR repbase; DNA; FNG; 424 BP. XX AC AJ439547; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 08-NOV-2002 (Rel. 7.1, Last updated, Version 1) XX DE Saccharomyces exiguus retrotransposon TSE1_LTR, long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; KW Long terminal repeat; RNaseH; TSE1_LTR; gag; integrase; pol; KW protease; retrotransposon; reverse transcriptase. XX OS Kazachstania exigua OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Kazachstania. XX RN [1] RP 1-424 RA Neuveglise C., Feldmann H., Bon E., Gaillardin C. RA and Casaregola S.; RT "Genomic evolution of the long terminal repeat retrotransposons RT in hemiascomycetous yeasts."; RL Genome Res 12(6), 930-943 (2002). XX DR Genbank; AJ439547; Positions 1 424. XX SQ Sequence 424 BP; 178 A; 51 C; 54 G; 141 T; 0 other; tgacggaaat tatgtagtaa ggttacatct aaaatgttta acatggtctc aaaggattag 60 actagtcatt tgtgatgagt cattatttga tgagtcactt tgtgatgtgc cattatttga 120 tgcgtcataa ttagatgaaa aattattaag aaataaaaat tgtcactcaa taaattaatt 180 gatggaaaat aaaaatcaaa acatatataa acacgataaa atatctggac tgatagtttg 240 gaattattct actcgttata tattatagat attatataaa aaagaaaaca cgacaaatta 300 tagtaataaa taataattta ctttaaatta acaaaaatac cttcagtact aatacaaact 360 cactttatat aatatctgaa atgaataacg atcaagttaa taccaactat tctgcttcag 420 ctca 424 // ID Gypsy-49_MLP-I repbase; DNA; FNG; 9847 BP. XX AC . XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-49_MLP_; KW Gypsy-49_MLP-LTR; Gypsy-49_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-9847 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR [1] (Consensus) XX CC Positions [8669-9181] - Integrase core CC LTRs are 96% similar to each other. CC Includes an insertion of DNA3-2_MLP (masked by x). XX FH Key Location/Qualifiers FT CDS 4309..6345 FT /product="Gypsy-49_MLP-I_3p" FT /translation="MTSANKAEAARGIRGRITLPIKLEHVTKPVFLTSEFV FT IVNDNERKYPSLGCRDLNEFKFMIYLGTERFCEVRTTYRTFKFPFVNKRTD FT ESEMLPQGQEISTITMEPNKEMDIAELQSKASVPQIWEENKDTSNQVSDAR FT LMMSQPEKRRAHTIGQHCITSALIKNKDKIDILLDSGAACSVVGTKYLDRC FT YPDWRDEMLPPSNMTFRGCSSSLKAVGVIELPIVFPHRLGSIRIRLEFVIM FT DNATSNYFILGGEYLRLYRIDIVHSKEKYFTIGNENKKKKFLLGPSHIQNV FT ENITQTEDKSFEKAFGECNISPRLKSSEAHSLKAVISKYRKAFAFGDTPIG FT TAIGHQLKIKLTIEKPYPPILKKQAYPASPRSRDEIEKHIDELTRYSILCK FT VGADEEVDVTTPVIIAWHNGKSRMCGDFRALNTFTTPDRYPMPRITHCLTN FT LGAALYITTMDMMKGFHQNEVEGFSRKFLRIITHKGIHKYLKMPFGIKGAP FT AHFQRMMDSVFAEELSEGWMIIYIDDIIVFSKTWEEHLQRLTTVFEKIIKL FT GMTISLQKSNFAFQELKALGHVVWGLWIAIDQNKVAAVLQKPIPQSVKEVQ FT SFLGFANYYRSHLEGFAKVSGPLYKLLQKGVTFEMTKERVDSWNNLKKKLT FT EAPFLIHPALQTLFGFPKWAGKNDWEYDLS" FT CDS 8030..9799 FT /product="Gypsy-49_MLP-I_4p" FT /translation="MKTPNRHMLRWQIAIQEYRSCMTITHREGRKHENADA FT LSRMALANNSDNPAWDPEDSNRDLPIMGISISDMSKEFFEQIKEDCLSEPN FT TVKLLELLKQDCKALNLSSTLEGLWKKSFDEGRFTLLSGILYHREKHACVM FT VITTAEMKTSILKICHDDPMSGHLSLDRTLDQVKQTAWWPQWRTTTEDYTS FT TCDRCQKANKATGKQFGLMQKIEEPKIPWDTINMDFVSGLPPGGHDNVDCV FT LVVVDRFSKRCRFLLCHKDATAMDIALLFWERIITDSGLPRVIISDRDPKF FT TSEFWRSLHKLCGTSLAMSTAYHPQTDGLAERAIGNLSELIRRYCAYGLEF FT KDKDGYTHDWKSLLPALEIAFNSSVHSTTGKTPFEVERGYNVRTPRTLIQR FT GDSTFHPTAVSFNNMLSKARKHAMECIRAAAEYNKQRWDKSHREPEFQVGD FT LVLILTINFTNLKGPRKMIDAFVGPYVVQALHGKNAVEVILTGSIARKHPV FT FPVSLLKKYKKSDAAEFPNRQIPPEEDLPVHDEPIGAIKKIINEKMVRIQG FT KETRLYLARIKGGNADEDKWLEVKDIPNSTALLRKFRAEKRSL" FT CDS join(1081..2070,2074..3840) FT /product="Gypsy-49_MLP-I_1p" FT /translation="MPPRKSDRVRAIKEKLGQPSYSETSRRCSSVGPTSAR FT PNARRSSIRSATTDHNVGGRETPNVPTREKLHQLEKSSRNEPEQYPGNHDL FT SQLVEEQSRQLDDAPGRGGSPEEVPGMESVLRRTQVQSENQPWLHRQESEP FT STSIAARAGTSAVQLVSRTNPQEPSKSSERALSETSSRRGGIGQHAVDAVL FT QPSLLPGDQRDQQIPEEQRKVSLSYNPKPTGTVQPVEKTFISTPSPFLFSQ FT RVAYENNNDVFCSAVMMPTPVSKTSVLVQKVAHPTTSPLVQNDVTPIPFRQ FT VVSPKVHALKTISTPMTENDKIERALENKNLESISQENETKRIETSVPMPE FT NSKIQETSEHLSTKKSVNSLPPITNSKVEGPTKRQDSYSVGRTVSLVEKEE FT LRHKKIKDDLPEPDNSVMKTKSIEEILRETENRISPFLLAGPVNTMPQKMS FT PLSPDKEFSLNNKFSALKIHSETTSPKSHISVPDKEFRIHETLKTVQSHIE FT TKLEETIKTLNSKWDNSVELLNEEMNMNFEEIRHSILHTNNNIDISAIAIV FT LDDHQTNMESNMQLLLHSHKEHSEVYLKHLGELMCVLGEKLDSAVQRTENI FT HRNVNRLISLTERTHNQNHSGTYSHRDHMPIANEPEILSNNPFAHEFPAMQ FT QPMFRGVDINAPTRTAMDNPKLTVDDRRSNESVGQALVKQLQKDYPPVRDW FT PKFSGEDEYNHLEWIEWIDNTQQDTGMPDSSITCKLAIIMTHSARAWYTSK FT RTTEGPSSWARWKELIKDTYGTPVWKRKMSTAFDRDTFRAEHREKPLAWLL FT LQRRRMVAAWPFLTTSEQIDKILSLCHGDIEHAVQSIIKDHSNYEMFMAIF FT EEVVTHTSIGKHKTSYPKRFTNEAIATSEYKQKDDRKEQSSRDYRDRTPGT FT NYKPRDKPFFKKDSD" XX SQ Sequence 9847 BP; 3094 A; 2107 C; 1948 G; 2075 T; 623 other; gttactacaa acgcaaagta caaagagtca cttctgatct tacctagact gtaagtaaga 60 gaccaggttc actacctatt cctttttccc tcttttcctt tgactgtccc tgtttcaaga 120 cagtaaaact ccatcttggt caagtgccat tgttcttaca acgtaaagtt gataagatcc 180 ctgaggcttc aaaacccagg ctactggttg attccatctt cgagaaaata tccccggctc 240 agaacctgac cagttccgtc tgttaaccaa cagtgggccg tggaaccttc tgtcacgttt 300 atatttacgt ataccgtggt tagtaccctt cgaaacaagt tactgatata aaggaaattt 360 ctacctttat aatagatact attcatccta cgtatccaat aggattcttt gtcaggttcc 420 tcagttaaat attgatattt aactactaga gtaagtgcca ccctgacaat tgggggcctc 480 attgagtgtt agtagaccca agaactaaca aaataaaaag aaaacgttta gttttcaaag 540 tagatatttt cacatttcct tcttcttaga aatcataaat cgttttcaaa atttcccaat 600 caaaggaact gctcaccttg aagcacgatc tcctagatct ttccttgcaa atcggaaaac 660 tgatagaaaa agtcatacaa cttgagattg ctgaaagaaa acataagtaa gcttacatcc 720 acccatccca aagagagtgt agtaattgac aaatacgcgc catccttgaa gtgaacgcac 780 cagagaaggt cattcaacct ccgaaggcgc agaagaacca gcccccgaac caagcagtag 840 gaaaggttcg agagcaaatc ctaacaatct tgccgaagat ccagctaaaa acgtcgtcta 900 ccgtgccgtt acctaatcag tctatacttc atacaaccac cacctcgatt accgagacaa 960 cgactactgg gtcggagaag attggtacca cggatcaccc tggtacatta cctttcattc 1020 ctttagtttc cttgttcttt atttctcttt ccgagaagat aaattgacaa gcttaacgga 1080 atgcctccta gaaaatctga tcgagtccga gctatcaaag agaaactcgg acaaccctca 1140 tactcagaaa cctccaggag atgtagctca gtcggtccca cctcagcacg acctaatgca 1200 agaagatcaa gcattagatc agctaccacc gaccataacg tcggaggaag agagactcct 1260 aatgtcccta cacgagaaaa acttcatcaa ctggagaaaa gctcaagaaa cgaacccgaa 1320 caatacccgg gcaatcacga tctatctcag cttgtggaag aacagtcacg acagcttgac 1380 gatgctcctg gacgaggagg aagtcctgaa gaggtccctg ggatggaatc cgtactccga 1440 agaactcaag ttcaaagcga gaaccaacca tggcttcata ggcaagaatc cgagccctca 1500 accagcatcg cagcacgtgc cggaaccagc gcagtccaac tcgtcagcag gaccaatccg 1560 caagaaccat caaaaagctc agaaagagcc ctatcagaaa ccagctcacg tagaggagga 1620 atcggccaac atgcagtcga tgctgtcctt cagccgagcc tactaccggg cgatcaaagg 1680 gatcaacaaa tcccagaaga gcaaaggaaa gtctccctta gttacaaccc aaaaccaacc 1740 gggaccgtcc aaccagtaga aaaaaccttc atctctactc ccagtccgtt tctcttttcc 1800 caacgcgtgg cttatgaaaa taataatgat gttttttgtt ctgctgtcat gatgcctacg 1860 cctgtgagca aaacgagtgt cctagttcaa aaagttgccc accccactac ctcaccttta 1920 gtacaaaatg atgttactcc cattcctttt agacaagttg tgtctcctaa agttcatgcc 1980 ctcaagacaa tttctactcc tatgacagag aatgacaaaa ttgagagagc gctcgaaaac 2040 aaaaacttgg agagtatctc tcaagaaaat tgagagacta aaaggattga aacctctgta 2100 cccatgcctg agaatagtaa gatccaggag acctcagaac atctaagcac taagaaaagt 2160 gttaactccc tccctcctat cacaaacagt aaggttgaag gacctaccaa aagacaagac 2220 tcttacagtg tcggaagaac cgtttcgtta gtcgagaaag aggagttaag acataaaaaa 2280 ataaaagacg atcttcctga gcctgataac tcagtaatga agacaaagag cattgaagaa 2340 atactccgtg aaacggaaaa caggatttca ccattccttt tagccggacc agtcaatact 2400 atgccacaaa aaatgagtcc tctgtccccg gacaaagagt ttagtcttaa caataagttc 2460 tcagcattaa aaatacacag tgagacaact tccccaaagt cacatatatc cgttccagac 2520 aaggaattca ggatacacga aacgttgaaa accgtgcaat ctcatataga gactaagctg 2580 gaagaaacca taaagacact gaattcgaag tgggacaata gcgtagaact cctaaatgag 2640 gagatgaata tgaactttga agagattagg cacagtatcc tccacacaaa caataatata 2700 gacatcagtg caatcgcgat agtactagat gaccaccaga cgaacatgga aagcaatatg 2760 caacttctac tacactcgca caaggaacac agcgaagttt acctcaaaca cctgggcgaa 2820 ttgatgtgcg tgttaggaga gaagctagac tcagcagttc agagaacaga aaacatacac 2880 cgtaatgtaa accgtctgat atcactgaca gagcgtacac acaaccaaaa ccatagcggt 2940 acctactcac atagagacca tatgccaata gcaaacgagc cggaaatact gagcaataat 3000 ccatttgctc atgaattccc agcgatgcaa cagccgatgt ttaggggagt agatattaac 3060 gcaccaacgc gcacagcaat ggacaaccca aagttgaccg tagacgatag aagatccaac 3120 gagtcggtgg gacaagcctt ggttaaacaa ttacaaaagg attatccgcc tgtgagggac 3180 tggcctaaat ttagtgggga agacgaatat aaccatttag aatggatcga atggatagac 3240 aacacccagc aagatactgg catgccagat tcgtcgatca cctgcaagtt agccatcatc 3300 atgactcact cggcgagagc atggtacacg agtaaacgaa cgactgaagg accaagcagc 3360 tgggcgaggt ggaaagaatt gataaaggac acatacggta ccccagtatg gaagaggaaa 3420 atgtccacag ccttcgacag ggatactttc agagcagaac accgagagaa acctctcgcg 3480 tggctcctac tgcagaggag gcgaatggtc gcagcatggc cgtttctgac gacctcagag 3540 cagattgata agattctgag tctatgccac ggcgatatag agcatgcggt tcaatcgata 3600 atcaaagacc attcaaacta cgagatgttt atggccattt tcgaggaggt agttacccac 3660 acatccatcg gtaagcataa aacaagttat ccgaagcgtt tcactaatga ggcgatagca 3720 acctctgagt acaagcagaa ggatgatcgc aaagaacaga gctctagaga ctatagagac 3780 aggactcccg gcacgaacta taagcctagg gacaaaccct tctttaagaa ggatagtgat 3840 tgaagaccac gattggagaa gaaagcggtt aacactgtcg aactagacga tagagagact 3900 gatgttgaaa ccagaatcga ggacgaagac cagtcagaag ctgggtcgga aacgacagac 3960 gatgaacagg acatatgtat cggtaacatc gagatgacca accaagtgca agggaacggt 4020 caggcagaca aagactacaa tacagattct acaggaagtg taatagaagg tgataaccta 4080 tcgctggagg aactcgcaca caatctacat atagcagaat tcgcacacgt agagagcaac 4140 tatccccctt ttatcgaaag acttgaaatg ggaacgatag acccaagttt aacactctcc 4200 atatcgatta acggcatcct ggactcactg gtcctagata cgaacaccat agaatcaatc 4260 atgccactat cacacttgca aatctactgc ccttgatggg aaacggacat gacatcagcc 4320 aacaaagcag aagcggctag aggcattaga ggtagaatta ccctacctat taaattagaa 4380 cacgtcacaa agccagtctt tttaacgtca gaattcgtca tcgtgaatga caacgaaagg 4440 aaatacccca gtctgggatg ccgagacttg aacgaattta agttcatgat ttacctcgga 4500 acagagaggt tctgcgaagt aagaacgaca tacagaacct tcaaatttcc atttgtgaat 4560 aaaaggacgg acgagtcaga gatgcttcca caggggcaag aaatatctac gataaccatg 4620 gaacccaaca aagaaatgga tatcgcagag ctacagtcaa aagcgtcagt accccagata 4680 tgggaagaga acaaggacac atcaaaccaa gtatcggatg caaggctcat gatgagtcaa 4740 ccagagaaaa gaagagctca cacgatagga cagcactgta ttacatcagc gcttatcaaa 4800 aacaaagata aaatagatat cctgctagac agcggagcgg catgctcagt tgtgggaacc 4860 aaatatcttg acagatgcta cccagattgg agagacgaga tgctacctcc cagcaacatg 4920 acctttcgag gatgtagcag ctctttgaaa gcagtaggcg tcattgaatt gcctatagtc 4980 tttccacata ggcttggttc gataagaatc agactagaat tcgtcatcat ggataacgca 5040 acctcaaatt attttatatt aggaggagag tatctgcgct tgtacagaat agacatagta 5100 catagcaagg agaagtactt taccataggg aatgagaaca agaaaaagaa gtttctgcta 5160 ggtcccagtc acatccaaaa cgtggagaac atcacgcaga ccgaagacaa atccttcgag 5220 aaggcatttg gggaatgtaa catttcaccc aggctcaaat cttccgaagc acattcgcta 5280 aaagcggtga tatctaaata ccgcaaagct tttgcctttg gagacacacc aataggcaca 5340 gccataggtc atcagctgaa aatcaaattg actatagaaa aaccgtaccc accaatcctc 5400 aaaaagcaag cgtacccggc gagcccgcgt agcagggatg aaatagaaaa gcacattgat 5460 gagctaacta gatacagtat cctatgcaaa gtgggagcgg atgaggaagt cgatgtcaca 5520 acaccggtca tcattgcgtg gcataacggc aaatctcgca tgtgcggaga ttttagggca 5580 ctaaatacgt tcaccactcc ggacaggtac cccatgccga gaattaccca ttgcctaaca 5640 aatctaggtg cggcactata catcaccaca atggatatga tgaagggatt ccaccaaaat 5700 gaagtagaag ggtttagcag aaaattcctg cgcattatca cacacaaagg gatacacaag 5760 tacctgaaaa tgccgtttgg aatcaaaggc gcgcccgcac atttccagcg gatgatggac 5820 tccgtgtttg ctgaggaact tagcgaagga tggatgatta tatacattga cgacattatt 5880 gtattctcga aaacatggga ggagcatctt caaagactta ctactgtttt cgagaaaatc 5940 atcaaactag gtatgactat ttcactacaa aagtcgaact ttgcctttca ggaactgaaa 6000 gcgctgggac atgtcgtttg gggcctatgg atagctatag atcaaaataa agtagcagcg 6060 gtgctacaaa aacctatccc acaatcagtg aaagaggtgc aatcgttcct tggattcgct 6120 aactactata gatcccactt agaaggattt gcaaaagtaa gtggcccatt gtacaaatta 6180 ctgcaaaaag gagtcacctt cgaaatgaca aaagaaaggg tagattcttg gaacaacctt 6240 aaaaagaaat tgacggaggc tcctttctta atacacccag xxxxxxxxxx xxxxxxxxxx 6300 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 6360 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 6420 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 6480 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 6540 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 6600 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 6660 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 6720 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 6780 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 6840 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 6900 xxxcagattt caaactaccc ttcaaagtgt acgtggacgc gagttttgat ggcctaggag 6960 ctaccttaca acaagtacaa atagtggaag acaaaccaac agaaggactg atcagctgca 7020 tttcacgaca actgaaacaa tcagaactaa actacggtgc tatgcaactg gaaaacctat 7080 gcctagtatg ggctttagaa aaaggagcgc actatttggc acccgctttc gaaaatcccc 7140 gacgcccgct tttatcggta ctttcccgcg cgttcaagcc aaatttggcc gcagccgcag 7200 caattatgag aaaccgggca ttagcctgaa accggtttgc cacatctcac ctctctcact 7260 cagcattcct cctaacttga agacgacgta ccaatatcaa atcggaccac ggtttgtcag 7320 actaatgaat cttttaagaa agtcactcct acagatcttg aacaaccggt gatgcaacct 7380 tctcatctcc aattctgagg atgaaaatct attgaaccaa acatttatct ttttcgaact 7440 cctattttga ctgattacag ttgaagttgt atcttatgac tgtcttatca ttatcatatt 7500 caacctcatc atatttcaat cgatcccgag tcaaatgatt gttgtgaatg tctcttattt 7560 tttattctgg ttgatcttga ttagaatgat tgtttgaact tgtatctttt tggccttggt 7620 tgtttctgag tacaccagtc attcatttta tttgaacttt tatgactgtt aaccacctca 7680 cgattacatt gcaatcgggg tctttgaaaa agcaaattaa gaaattggat ttcaaaaact 7740 atcgacaatg agctagaact atgggcaagt atgcttgtca taaaccatgg tctaactcgg 7800 catgtgtata cgattaaaaa ccgggttgaa accgggctac ggaaagatgt aaaatttgaa 7860 aaccgggcat tgtgttttca aaaccgggcg tcgtgtttcc aaaaccgggc gtcgtgtttt 7920 caaaaccggg cgccaaatag tgcgatcctt agaaaaattc cattattacc tggacggatc 7980 ctttttcgaa gtaataacag actgcacagc actcaaatca ctccttaaca tgaaaacacc 8040 aaataggcac atgctaagat ggcaaatagc catccaggag tatcgatcct gcatgacaat 8100 cacgcatcgc gagggtagaa aacacgaaaa cgcagacgcg ctgagtcgta tggcattagc 8160 taataatagc gataacccag catgggatcc agaagatagt aaccgggacc ttccaatcat 8220 gggcattagc atatcggata tgtccaagga attttttgag caaataaagg aggattgctt 8280 atcagaacct aacacagtta aattactaga gttactcaaa caagactgta aagcgctgaa 8340 cctctcatcc accttagagg gactgtggaa aaaatccttt gacgaaggaa gattcacgct 8400 cctcagtggg atcctctacc atcgagaaaa acacgcatgt gtgatggtga tcacaacagc 8460 cgaaatgaaa acgagtattc ttaaaatatg tcacgacgac ccaatgtcag gtcatctatc 8520 attggacagg acactggacc aagttaagca gacggcgtgg tggccacaat ggcgcacaac 8580 gacagaagat tacaccagta catgtgacag gtgccaaaag gcgaataaag caacaggaaa 8640 acaatttggt ttgatgcaaa aaattgaaga acccaaaatc ccctgggata caataaacat 8700 ggactttgtg tccgggttac ctccgggagg ccacgataat gtagattgcg tgttagtagt 8760 cgtcgacagg ttctcaaaga gatgtagatt cctgctgtgc cacaaggacg caacagctat 8820 ggacatagca ttactctttt gggaaaggat cataaccgac tcaggacttc caagggtaat 8880 catcagcgac agagacccca agtttacgtc agaattctgg cgaagcctgc ataaactgtg 8940 cggaacctcc ctggccatgt ctaccgcata ccacccacaa acagatgggc tagcggaacg 9000 agcaattggt aatctatctg aactcatcag aagatattgc gcgtatggcc tggaattcaa 9060 agacaaggat gggtataccc acgattggaa aagcttgtta cccgctttag agattgcttt 9120 taattcaagc gttcatagca ccacaggtaa aacacctttc gaagtggaaa ggggctataa 9180 cgtcagaacg ccgagaactt tgatccagag gggagattca accttccacc cgacggcggt 9240 aagctttaac aatatgctta gcaaggcaag aaaacacgca atggaatgca taagagccgc 9300 cgcagaatac aacaaacaac ggtgggacaa atcacacaga gagccagaat ttcaagtagg 9360 agacctagtt ctcatattga caatcaattt caccaacttg aaaggcccca ggaaaatgat 9420 cgatgcattt gtaggtccat acgtcgttca agctctccac ggcaagaatg cggtggaagt 9480 tatactaaca ggaagcattg ccaggaaaca cccggtcttc ccagtctccc tgctcaaaaa 9540 atacaagaaa tcagacgcag cagaattccc taataggcag atcccgccag aagaagacct 9600 accagtgcac gacgagccaa tcggtgctat taaaaagatt atcaatgaaa aaatggtacg 9660 catccaggga aaagagacac gtctctacct cgcaagaatc aagggcggta acgcagacga 9720 agataaatgg ttagaagtta aagatatccc aaattcgaca gcattgttaa gaaaattcag 9780 agcggagaag agatccctct gaaccattga gtcacggttc agggtacgtc tccttgtggt 9840 tggggaa 9847 // ID PCretro3_LTR repbase; DNA; FNG; 404 BP. XX AC DQ097840; XX DT 08-MAR-2006 (Rel. 11.02, Created) DT 08-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE Phanerochaete chrysosporium RP-78 Ty1/copia LTR retrotransposon DE (LTR portion). XX KW Copia; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; PCretro3_LTR. XX OS Phanerochaete chrysosporium OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Corticiales; Corticiaceae; Phanerochaete. XX RN [1] RP 1-404 RA Novikova O., Fursov M., Shutov O. and Blinov A.; RT "Divergent groups of LTR retrotransposons from Phanerochaete RT chrysosporium."; RL Direct subission to Genbank (2005). XX DR EMBL/GenBank/DDBJ; DQ097840; Positions 4645 5048. XX SQ Sequence 404 BP; 101 A; 119 C; 82 G; 102 T; 0 other; tgttgaaagt cgccccgcca gggcgcactc aaaaccgtcg tgcagacgga agtcaaaagc 60 tccggcgcgc ccggagggct gaagcgccga gcgcccatca aaagatgcgc cgtcggcgta 120 tccttacaag gtctcgcggc gggacaagtc tcgccgcctt ccccattcct cttttccgct 180 ctgagtaatc gttagattag attccgtagt tatcaatccc cagcgctccc ttatcaatcc 240 tctagattag attccgcctt tttcagagcg agcatttttc atttgattat tacctacttt 300 agatacacgg aagtaaatct aagtagacta taaaactcgt agaataatcc ccaaaaatat 360 tatctcgctg ttctcctcca tcccagagtc tgcgcatcac aaca 404 // ID Gypsy-35_MLP-I repbase; DNA; FNG; 5809 BP. XX AC AECX01000146; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-35_MLP_; KW Gypsy-35_MLP-LTR; Gypsy-35_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5809 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000146; Positions 12635 6827. XX CC Positions [4699-5178] - Integrase core CC 'CCTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 2635..5637 FT /product="Gypsy-35_MLP-I_1p" FT /translation="MKIRAYCLSLPHIDLILGLPWFQEHKPVPNFSNGTYS FT ISTQPSNLDIRPSKKSKDSDVNTIEKISPQEEVRRLAEQTAPECFAEDITV FT GANTTCTHTIDTADARPVKTHGRPHSPPEHAVINQFVEDGLKQGIIEKSCS FT PWSFPIVLVKKADGSTRVCIDYRKLNNVTIKNAYPLPRIDEVYQFLSKANW FT FTTMDLKSGFWQVLMDPQDKIKTAFTCRAGHFQWKVMPFGLCNAPATFQSM FT MNDILAPIIDRYAMVYLDDVIIYSKTPEDHKRHIKEVLTLLKDHKLTLSPK FT KCNWAQNKLLFLGHIVDGDGIRTNPDKISKILEWPVPTTVTHVQGFLNLCT FT YYKRFIKDFAKIASPIYKLTEGSLKPGTTITWGDEQHKAFTALKTVLTGTV FT TLPHPIPFYPFVLDTDASKTCIGSVFQQDTEDKFDKENFTLEQYSKEVKNR FT NLRPVAFESKKLSKTEQRYSTQERELLAIVHSLKHFRGLIAGSPILVRTDH FT ESLKHFQTQQYTNPRLARFLDDIESYNVHIVYRPGKHQLAADAMSRKPDAP FT SDVEPPEIAEPLYVIHNDNSPELDFERLREYYDLLDHGKNPSDVGSGNFNI FT IRRQMVRYNPDEPNTTPKAVLTTKEEAELVCIAVHINLGHRNYKDVIISMK FT ENYWFPNMNKFIEETIKDCRECQIHAAHSDKENLPIQVIERGKPFRKWGMD FT FVGPLPKTAHGNEYILTAVDYGTGWAYAVPLKKTSAEAVVDLVKQLCLAHG FT HPDEITTDNGSEFHSHVFQVYLRESKIKHLHTTPYHPQGNGLVERFHGTLI FT GAIKKFCAPYNQGKWDQYVNKALFAYRAAHSNSLKASPFEMTYGIEARFPP FT IFSLGDNPILVNKKDIKLTQELRRIDLERFSKQRQERIDQLNERARQRLQD FT SEENFVERHIKPGDLVWRKFEGRASKLHPRWDGPFIVRDSDPNGTYQLMTS FT NGHILQLRVNGSRLKPYMGKNIDEFFFASQQLHDRDAAAKHGQTSN" FT CDS join(1044..1328,1332..2597) FT /product="Gypsy-35_MLP-I_2p" FT /translation="MEVLYQAARDCERNNKLTRSGLNSTHHSNHHKSPNHH FT RNNFKNHNNYGKPHATTATTSMSAPMDLDNIDISKVECFKCHKFGHRAREC FT RSGNKTNRPNLTLLDLTSNSESQDNPEENNGPLDLAVVDVPPEDHPVSSVE FT SPYNILGKLEPDPRKELNEYAKTYIRDEIQKKSWDESVTELRKWHKDYKQT FT LVDTTELGDKRYRIAKEAEWTFMRLANRDHYNFADTAKSMEHETNWNLTQK FT KKMYNLNEYRERILKALDKMQSSPEYPIDVQSEKEVFEEKYQGWKKHLLEK FT YADKPRCCESCRLFGCEDSPEPETNNIERTNPAPEVPAEVLILPSHDPNGK FT NALFFTDLENIINEDQDVIMDNTLIDGFSFIKTPFNSPLIGAQTIISPDKE FT LESVEAWNKIYSLCKKRSNEHLDEKVEDEEPVYLDLNLVDDLNHVTGSSNL FT PLYTFFTSSNSPLRTILDTGAAANYISKDVALEILKQDHKAKVIDEPKRQV FT RLANGSTEDCGRKLEFFILWS" XX SQ Sequence 5809 BP; 1968 A; 1278 C; 1104 G; 1459 T; 0 other; aataattatt tgtgtgtttt ttttaaatcg ttacaagtct ttataaagtc taacaaatct 60 tttcctatct cttatttttc gaaagaccga ttacaagttt ccaatagatc ccacagaaac 120 ccttccttcg aaaaataacc ccagagatta gattccttag tcttatcgtt tcaaaacaaa 180 gaaaacactt tttatcaaac gtgaataaga ttcctatttt ctttctcacg tatcgtatta 240 gtctttttct ttggacctca gtcctcttta attcctgatc agtcttaaga acgactcacg 300 gttgttaacc cttcttgtaa cagaaacacc aagaacccaa aatgtcatca ccaatgatga 360 acagtacgga cggttgaatt cccgcccaac aagaagatag ggatcgagtg gacctctcgt 420 tccaacaaca agctcaaaga ctcgcagaat tagagaatgc ggtcggacaa caatcgcatc 480 aacatgagcg aattaccaac gaactggtaa gtgcactaaa cactataaga gaattacaaa 540 ctgaaatata tagcttaaaa aatgtacgta caaaccgaca ggagcctaga attgaggcct 600 taaaagtgga accacgcaag ttcacaggat atggcgataa tcccgacctt tggattaccg 660 agattgagac caatttcgcg tctcaaaact acccagtcac ccgctggacc gaactcatca 720 tcaacttctt agatgaagac gcccgatact tttggcatga actcgttaag aagagtgatg 780 gacaaatccc atcatggatt gtctttaaaa ccaagttctt tgaaaagtat aattactccc 840 tcgtattata tgaagttcga caacaactga aagcccttta ttacaaatct gatataaatg 900 attatatcct taggtttcgg aagttggccg tcaagatccc cgacgagaaa ctacctttct 960 ttgaaagacg tttcttattt caagataaat taccggctag ttatcaacag gacctaaata 1020 aagttgattt aactaacgag gatatggagg tactttacca agctgctcga gattgtgaaa 1080 gaaataacaa actgactcga agtggtttaa attctactca tcactcaaac catcacaaat 1140 cacccaacca ccatcgaaac aacttcaaga accataataa ttatggtaaa ccccatgcga 1200 ccaccgccac tacatccatg agcgctccta tggatctgga taacatagac atatctaagg 1260 tagagtgttt taaatgtcat aaatttggtc atcgcgctcg agaatgccga tcaggaaaca 1320 aaactaattg aagacccaat ctaaccttgt tggatctcac ctccaattcc gaatcacagg 1380 acaacccaga agagaacaac ggacccttag acctcgcggt tgtagatgtt ccacctgaag 1440 accatcctgt atcttcagta gaatctcctt acaatattct gggaaaactc gaacctgatc 1500 ctcggaaaga actgaacgaa tacgccaaaa cgtatatccg tgatgagatc cagaagaaaa 1560 gttgggacga atcagtgact gagttaagga aatggcataa ggattataaa caaactttag 1620 tcgacaccac agagctaggt gataaaagat atcgtattgc aaaagaagcc gaatggacct 1680 tcatgaggct tgcgaatcgt gatcattaca actttgctga tactgccaaa tccatggaac 1740 atgaaacaaa ttggaatcta actcagaaga aaaaaatgta taacttaaat gaatatcgag 1800 agaggatcct aaaggccctg gataaaatgc agtcctctcc cgaatacccg atagatgttc 1860 aaagtgaaaa agaggttttt gaagagaaat accaaggttg gaagaaacac ctcctagaaa 1920 aatacgccga caaaccaaga tgttgtgagt cttgtcgcct atttgggtgc gaagatagcc 1980 cggaacccga aactaataac attgaacgca ctaatccagc tccagaggta ccagctgagg 2040 tcctcattct tccttcacat gatcctaacg gaaagaatgc tctgttcttt acggacctag 2100 aaaacatcat aaatgaagac caagatgtta taatggacaa taccttaatt gatggatttt 2160 ctttcattaa aactcctttt aactctcctc ttataggcgc acaaactatt atctctcccg 2220 acaaagaatt ggaaagtgta gaggcttgga ataagatata ttctttgtgt aagaagagat 2280 caaatgaaca cttggacgag aaggttgaag atgaagaacc agtatattta gatctaaatc 2340 tggtagatga cttaaaccat gtaactggtt cgtcaaattt accattatat acttttttta 2400 ccagctctaa ctcgccacta cgtaccatat tagatactgg ggcggccgca aattatatct 2460 caaaagatgt cgctttagaa atattgaaac aggaccacaa ggccaaagtc atcgacgaac 2520 caaagagaca agttcgatta gccaacgggt caacggagga ctgtggtcgt aagttagaat 2580 tcttcatact gtggtcgtaa gttagaattc ttcatatcca tcgcagggtt tcaaatgaaa 2640 atcagagcat attgtctgag tctcccgcat attgacttaa tacttggatt accatggttt 2700 caagaacata aaccggttcc taatttctca aacgggacgt attcaatttc aacacaacct 2760 tccaatctgg acatccgacc tagtaaaaag tcaaaagact cagatgtcaa caccattgaa 2820 aaaatatctc ctcaggaaga agttcgaaga ctggccgaac aaacggcacc tgaatgtttt 2880 gctgaggaca tcacggttgg cgccaatacc acctgcacac atacaattga cacggccgac 2940 gcgcgacctg tcaaaactca cggaaggcca cactcaccgc ctgaacacgc agtgattaat 3000 caatttgtag aggacggatt gaagcaaggt attattgaga aatcatgttc gccttggtca 3060 ttccccatag tgcttgtaaa gaaagctgat gggtccaccc gtgtctgcat cgattatcga 3120 aaacttaaca atgttacaat caaaaacgcg tacccacttc cacgaataga tgaagtatac 3180 caattcctta gtaaagcgaa ctggtttaca actatggact tgaagagcgg tttttggcaa 3240 gtactgatgg atccccagga caaaattaaa acagcattta cttgccgagc cggccatttc 3300 caatggaagg taatgccttt tggtctctgt aatgctcccg caaccttcca gtccatgatg 3360 aatgacattc ttgctcccat catagacagg tacgctatgg tttacttaga tgatgtcatc 3420 atatattcca aaactccaga agatcataaa agacatatta aggaagtctt gactttattg 3480 aaagaccata aactgactct ttctccaaaa aaatgcaatt gggctcaaaa taaactctta 3540 tttttaggac acatagtaga tggtgacgga atcagaacaa atccggacaa gatatcgaaa 3600 atcttagagt ggccagtacc aacaaccgtc acacacgtac aaggtttttt gaacctatgc 3660 acctattaca agaggtttat aaaagacttt gcaaaaatcg catcgccaat ctacaaactg 3720 acagaaggtt ccttgaaacc tgggacgacc atcacctggg gagacgagca acataaagca 3780 tttacagctt tgaaaacagt tctaaccggc acagtcactt taccacaccc gattcctttt 3840 tatccgtttg tcttagacac agacgcatcc aagacttgca tcgggtcagt atttcaacaa 3900 gatacagaag acaagtttga taaagaaaat tttaccctag aacaatacag caaagaggta 3960 aaaaatcgaa atctcagacc agtagcattc gaatcgaaaa agctatcaaa aactgaacaa 4020 cgttactcga ctcaggaacg agaacttcta gctatcgtcc attctttgaa gcatttccga 4080 ggtttgatcg caggttcacc aatcttggtt cgaacagatc acgaaagcct caaacatttt 4140 cagacacaac aatacacaaa ccccagattg gcacgattcc tggacgacat tgaaagctac 4200 aatgtccaca ttgtatacag accaggtaag catcaacttg ctgccgatgc tatgtctagg 4260 aaacctgacg caccctcaga cgttgaacct cctgaaatag cagaacctct gtatgttatt 4320 cacaacgata acagccccga attagacttc gaaagattgc gagagtacta cgacttgttg 4380 gaccatggaa agaatccttc cgatgtaggc tccggcaact tcaacataat taggcgtcaa 4440 atggtacgtt acaatcccga cgaaccaaat acaacaccaa aagcagtcct cacgacaaag 4500 gaggaagccg agttagtatg tatagctgta catatcaatt tgggacatag aaactacaaa 4560 gacgttataa tcagtatgaa agaaaattat tggtttccaa atatgaataa attcattgaa 4620 gaaacaataa aagattgtag agagtgtcaa attcacgcag ctcactcgga taaagaaaat 4680 ttaccgatcc aggttatcga aaggggaaaa ccattccgca agtggggaat ggacttcgta 4740 ggccctttac caaaaacggc ccatggaaat gaatacatct taacggccgt agactacggc 4800 acgggctggg cttacgccgt tcctctcaag aaaacgtcag cagaagcggt ggtcgacctc 4860 gtgaaacaac tctgtcttgc ccacggacac ccagatgaga ttacaaccga taatggtagt 4920 gagtttcatt cccacgtttt tcaagtctat ttgcgagaga gtaaaattaa acacctacat 4980 actactccct accatccaca aggcaacggg ctggtagaac ggttccacgg aacattaatt 5040 ggtgctatca agaaattttg tgcaccttat aaccaaggaa aatgggacca atacgtgaat 5100 aaagctcttt ttgcttatag ggcagcccat tccaactcac tgaaagcgtc gcccttcgag 5160 atgacatacg gcattgaggc acgttttcca ccaatctttt ctttgggaga taaccctatt 5220 ctcgtcaaca aaaaggacat taaattaacc caagaattgc gacgtataga tctcgaaaga 5280 ttctcaaaac aaagacaaga gcgtatcgat caactcaatg aaagagctag acagcgacta 5340 caagactcag aagagaactt tgtcgaacgg catattaaac ccggagacct ggtatggaga 5400 aagtttgaag gaagagcttc aaagctacat ccgcggtggg acgggccgtt catcgttcga 5460 gactctgatc ctaacggtac gtatcaattg atgacatcca atggtcatat tttacaacta 5520 cgcgttaatg gatcaagact gaagccttac atgggaaaga atattgatga atttttcttt 5580 gcgtctcaac aactccatga ccgtgatgca gcagccaaac acggtcaaac atctaattag 5640 ccaattacgg ttcttatcaa aactgactat tggaaatgaa gatctagata tactacattc 5700 cctcaaacat gaactagaat tcaccttgtc cgaagtacaa caggaaatag acgtggtgga 5760 ggataccttt ggctcaaacg aggatgtttg aatcttaagt gggggatgg 5809 // ID Gypsy-7_LBS-I repbase; DNA; FNG; 8339 BP. XX AC ABFE01000288; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-7_LBS_; KW Gypsy-7_LBS-LTR; Gypsy-7_LBS-I. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-8339 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01000288; Positions 43340 35002. XX CC Positions [4198-4659] - Reverse transcriptase CC Positions [6127-6324] - Integrase core CC 'TCCAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 2947..6324 FT /product="Gypsy-7_LBS-I_1p" FT /translation="MAKPPKVRTGQRIRLVQVTGNAIITGYVTLDVYFHTK FT DGPVLIKVEAYVVKGMSAPLILGNDFADQYSISLLREEGESTLIFGKSGRS FT TKVHNSIMTSLLDEEGHAFKVLVRPDITSRALKGKLHRRSQKIKRRINQRT FT KDNYVRIASSVQIAPESTKLVRVQGNFCKSVDHLLVEKKLVTTGGPESIYG FT CANTLINRKSSFVYISNFSKKQVNIPIGQILSQGHDPSTWLDKEEQFTKKE FT INAISGHANLLRTVINSEGTSIDKNPFVKMTRSEVDALQDSSRHDYSSDDV FT LAEPPLEGGPKTAEIPGDTTSSAELLKEIDISPDLTKEQADDLKEILRRHE FT GAFGLEGKLGHYETEVDIPLLPNTKPISIPPYQALPANREVIDKQMDKWME FT LGVIKPSRSPWGAPVFIAYRNSKPRMVIDLRRLNEKVVADEFPLPRQDEIL FT QSLEGSQYLTTLDALAGFTQLSIRKEDREKLAFRSHRGLFQFKRMPFGYRN FT GPAVFQRVMQGILAPFLWIFALVYIDDIVIFSKSFEDHLVHIELVLKAIEE FT AKITLSPGKCHFGYQSLMLLGQKVSRLGLSTHKEKVDAILQLENPKNVHDL FT QMFLGMMVYFSSYIPFYAWIIHPLFQLLKRGSKWKWDEDEQNAYDVCKQVL FT TQAPVRAHPMPGLPYRIYSDACDFALAAILQQVQPIKVKDLRGTKTYEVLE FT RAFKAGEPIPDLATHLVKEDSDVMPQGAWDVEFENTTVYVERVITYWSRVL FT QSAERNYSPTEREALALKEGLIKFQPYLEGEKILAITDHAALTWSKTFQNV FT NRRLLTWGLVYSAFPNTKIVHRAGRVHSNVDPISRLRRRVPPQSSPLDVQF FT DPLKLKPAEDPLRSMFDELGPKFEEKMLTVATHFAESELQLESSNKRLDVT FT IALEGDKEIVVPYETSQSYSTTVQIGEQEIKRWKDAYARDSHFNLVNRSNE FT ANEGVNITYPQYHFSREGLVYFLDSTGNSRLCVPKDLRAEIMQEAHNTITE FT AAHGGYFKTYNRISATYYWPRMSREIKIFVNTCDVCQKIKPRRHAPVGLLQ FT SIPVPSQPFEVISMDFIPELLTSNGFDNILVIVDKLTKYAIIVPTTTKVTE FT VETARLFFKHVISRFGIP" FT CDS join(194..1159,1163..2536) FT /product="Gypsy-7_LBS-I_2p" FT /translation="MDTRKSGSSGRKTNNSQSTGMSKSSTPIPPSGTTNTS FT RGKETALDTSTQNVFTPGTPDIPEDLDMTEVPFFNTVQGPDQETPIPSMVE FT PEHPDIEDLTELGTPFAFEDLPDRGSYREIFIEFPGCMPFSSFGETVTDTG FT FDRSEKIFALAGRHRQALQLGEAALRRPIYNSYNKVVTRKGRKDLFKEEMD FT TLKELLSRLYYFNDESNVAGFALQRIILVNLNHQLKARRNEAEEDFIISGE FT GVPTLPRWGLNSKADEFWSANDFEILGACFRREVENFLAYLAEHHDFSKAK FT KGNKHEQRTTIITKPSHKKVLISTPTITTVDYQPTFVNYGADSISAHLSRN FT KGHANRFAANRNTSVFGHPAQNSSSHTFKELFGVRQGEDAIESQNGDDDSV FT HSGSHKSGTGIYNQQGQRRSGGDPGDPDGSDSDDGGSNGPRRGPKRTAVPD FT KSRRNPFETTPEGGNSTTVKAPPEPQFDTKLKMDTIPTWDGNPENLRRWLL FT KINSLAKRSTIVFKQLGTLVPTRLTGSAEIWYYSQSVDTRDRIEQDWSTLR FT AAIGEYYMNRAFLDRQKARANRASYRDMGNGRETPSEYVIRKLELLQFVYN FT YTERELINEIMEGAPSFWATVVTPHLLLDLEQFQLSVKFHEDSLLRLGGGE FT NSLNRQVNSNYSKDNSQRNPYNPFRNARVNLVGWTKAASNPQFPKDDANVS FT PRGTPEEKGARPCRHCGSGKHWDKDCRHARKGEKRARVNAVTTTAEEDRAE FT EEYDNIYYERFSDEEDIEDNPDFPKPSQP" XX SQ Sequence 8339 BP; 2539 A; 2001 C; 1933 G; 1866 T; 0 other; ttggtggaca cattgggaaa tccttgccta cgattcgtgt gaacatctgc acgatcggat 60 ctcaagatct taactaatac aagtacttcc acaaccggcg gaacacgttg cacccacaca 120 acgcatccta aacgaaaccc tcacgcgaga tcaatccgta caagatttgc ttctaaggta 180 ccgacatagc cccatggata caaggaaatc aggtagctcc ggtaggaaaa ctaacaactc 240 gcagtctaca ggaatgtcga agagttcgac ccccattccc ccttctggaa cgacgaatac 300 gtcccgcgga aaggagaccg cgttagacac cagcacgcaa aacgtattca caccgggaac 360 gccagacatc ccagaagact tggatatgac cgaagtaccg ttcttcaaca cagtacaagg 420 gccagatcaa gaaacgccta ttccaagtat ggtcgaacct gagcaccctg atatcgagga 480 tctcaccgaa ctagggacac catttgcatt cgaggactta cctgatcggg gttcatacag 540 ggagatattc atcgagtttc ctggttgtat gcctttctcc tcgtttggtg aaaccgttac 600 tgacacggga tttgacagat ctgagaagat cttcgcctta gcaggtcgac atcgtcaggc 660 cctgcaatta ggggaagctg cactcagacg acccatctac aattcctata ataaggtcgt 720 cacgcgaaaa gggcggaagg atttgttcaa ggaagaaatg gacaccttga aggagttatt 780 aagtcgtttg tattacttca atgacgaatc taacgtggcc ggtttcgcgt tacaaaggat 840 catcctagta aatttgaatc atcagctaaa ggcccgacgg aatgaagccg aggaagactt 900 cataattagc ggagaaggcg ttcctacatt acctagatgg ggtctaaaca gtaaagccga 960 cgaattttgg tcagcaaacg attttgaaat tttaggagct tgcttccgtc gagaggtcga 1020 aaacttttta gcttacctag cagaacatca cgacttttcc aaggcaaaga agggtaacaa 1080 acacgaacag cgaacgacca tcatcacgaa accttctcat aaaaaggtgt taatcagtac 1140 tccaactatc acgaccgtat gagactatca acctacattc gtgaattatg gcgctgatag 1200 catttccgct cacctcagcc ggaacaaggg tcacgcaaat cgtttcgcgg caaatagaaa 1260 cacttcagta ttcggccatc ccgctcaaaa tagttcctca cacacgttca aggaattatt 1320 tggcgtcagg caaggtgagg acgcgatcga atcccagaat ggtgatgacg attccgttca 1380 ttcaggatct cacaaatctg gaaccggaat ttacaatcag caaggacagc gtcgaagtgg 1440 aggggatcca ggagaccctg acggcagcga cagcgacgac ggaggaagca atggaccgcg 1500 ccgaggacca aaaagaacgg cggtaccaga caagtcacga aggaatccat ttgaaactac 1560 gccggaagga gggaattcca ctaccgtcaa ggcccctcca gaaccacagt tcgacaccaa 1620 gctcaagatg gacaccattc cgacgtggga tggtaacccg gagaacttaa gacgatggct 1680 cttgaagatc aatagccttg ctaaacgatc taccatcgtc ttcaaacaac taggtacatt 1740 agtacccacg cggcttacag ggtcagcaga gatttggtac tacagtcaaa gcgtcgacac 1800 gcgcgaccga atcgagcaag attggagcac cttacgcgca gcaatcgggg agtattacat 1860 gaaccgagca ttcctcgaca ggcagaaagc gcgcgccaac cgcgcttcct atcgcgatat 1920 gggtaatgga agggagaccc cgagtgagta tgttattcgt aaactggaac tcttacagtt 1980 tgtgtataat tacacagaga gagaactaat caacgaaatc atggagggag caccctcatt 2040 ctgggcgaca gtagtcaccc ctcacctact attggacctc gagcaattcc agttatccgt 2100 caaatttcac gaggactctc tcttacgact agggggaggg gaaaattctt tgaaccggca 2160 agttaattct aactattcta aggataattc ccaacgaaat ccttacaatc cgtttaggaa 2220 cgcgcgcgtc aatctggtgg gatggaccaa agcggcgtcg aacccgcaat ttcctaaaga 2280 tgacgccaat gtttcaccac gcggaacacc cgaagaaaaa ggcgctaggc cttgccgcca 2340 ttgtggtagt gggaaacact gggacaagga ttgcaggcac gctagaaagg gtgagaaacg 2400 cgcaagggtt aacgcggtca ccacaactgc ggaggaagat cgagcggagg aagaatacga 2460 caacatctat tacgaacgat tcagtgacga ggaagacata gaagataacc cggattttcc 2520 gaagccctct cagccataag ggaagcatcg agactgaggg gggtaagtgt tgttaagtct 2580 tctcgagttc tttcagaatc cgaatcctgg cagaagaact tgctaaatcc gtattctagt 2640 tctccgtctt tgaactcctt atttgttaaa aaaaattcgc caccttccat caatcgtaaa 2700 actaggcgga agctctccaa ggaaatcaac agtatttcat tccatacgcg aacgcaaaat 2760 agtgagatag aaagcaccac tatcgaactt aagaaacatt tggctagacc acccggatgt 2820 tcctttttag gagcacgagc tacagaaacc cctgtcagcc ttagcgaggc aggaaatcaa 2880 ccgataccaa tcatcgtgga ttcaggatca gatatcactc tgatctccca gaagacactc 2940 gacgagatgg caaaaccacc taaggtgagg actggtcaac gcattagact ggtgcaagta 3000 accgggaacg caattatcac cggatacgtt acattagacg tctacttcca cactaaggat 3060 ggacctgtcc ttataaaagt agaagcctat gtcgtaaagg gtatgtcagc tccgcttatt 3120 ctcggaaacg actttgcaga ccagtattca atctcgcttc tcagggagga aggagaaagc 3180 actttgatat tcggaaaatc agggcgatct acgaaagtgc ataattctat catgacaagt 3240 ctactcgacg aagaaggaca tgcattcaag gtacttgtcc gacctgatat cacgtccaga 3300 gcactcaaag ggaaacttca tagaagatcc cagaaaatca agcgaaggat aaaccagaga 3360 acaaaggata actacgtgcg aatcgcgtcc tccgtgcaaa tagcaccgga atctacgaag 3420 ttagtaaggg tacagggcaa cttttgtaaa agcgtcgatc acttacttgt agaaaagaaa 3480 ttagtgacaa cgggtggacc agagagcatt tacggatgcg caaacacgct gataaacagg 3540 aagtcgtcct tcgtgtacat ctcgaacttc tcgaagaaac aggttaatat tcctatagga 3600 caaatattga gtcagggaca cgacccttcg acatggctcg ataaggaaga acaatttacc 3660 aaaaaggaaa ttaacgctat cagcggtcac gccaaccttt taaggacagt tatcaactcg 3720 gaaggaacct ccatcgacaa gaaccctttc gtaaaaatga ccaggagcga agttgatgcc 3780 ttacaggact cttcgcgtca cgattatagc tcggatgacg ttctagcgga accacctttg 3840 gaaggagggc ctaaaacggc cgaaatacca ggggatacga catcgtctgc cgaattactc 3900 aaggagatcg acatttcgcc tgatctgacc aaggaacaag cagacgatct aaaagaaatt 3960 ctcagaagac atgaaggagc attcggattg gaaggaaaat taggacatta cgaaacagaa 4020 gtcgatatcc ctcttctccc taatactaaa ccaatatcta ttccaccgta tcaagccttg 4080 ccagccaacc gggaggtaat cgacaagcaa atggacaaat ggatggagct aggagtgatc 4140 aagccctcca ggagtccgtg gggagccccc gtattcatcg cgtaccgaaa cagtaaacct 4200 agaatggtca tcgacttaag aagattaaat gaaaaggtag tggctgacga atttccactt 4260 cctcggcagg acgaaattct tcaatcactt gaaggaagcc aataccttac tacgcttgac 4320 gcgctcgctg gatttacaca actgagcatc agaaaggagg atcgagaaaa gctggccttt 4380 cgaagtcaca ggggattatt ccagtttaaa aggatgccgt ttgggtatcg aaacgggcct 4440 gccgtattcc aaagggtaat gcaggggatt ttggcgccgt tcctctggat tttcgcactt 4500 gtgtacatcg acgatatagt aatattctcc aagtcgtttg aggatcatct agttcatata 4560 gaattagtat tgaaggctat tgaggaggcg aagataacct tgtcacctgg aaaatgtcat 4620 ttcgggtatc aatccttaat gttacttgga cagaaagtct cccgactcgg gttatctacg 4680 cataaggaaa aagtcgacgc tattcttcag ttagaaaatc cgaaaaacgt acatgactta 4740 caaatgttcc tggggatgat ggtctacttt tcctcatata taccgttcta cgcctggatc 4800 attcacccac tgtttcagct gctgaagcgc ggtagtaagt ggaaatggga tgaggatgaa 4860 caaaacgcgt atgatgtatg caaacaggta ttaactcaag cgccggtgcg ggcacatcca 4920 atgccaggac taccgtatcg aatctactca gacgcatgcg acttcgctct tgcagcaatt 4980 ctgcagcaag ttcaacctat taaggttaaa gatctgcgtg gcacgaaaac ttatgaagta 5040 ttggagcgcg ccttcaaagc cggagaaccg atacccgatc ttgcaactca tctagtcaag 5100 gaggattctg atgtgatgcc acaaggcgcg tgggacgtag agtttgaaaa tacgacggta 5160 tatgttgaga gagtgatcac atattggtcc agggttttac aatccgcaga aagaaactac 5220 tcccctaccg aaagggaagc attagcactg aaggaaggac taattaagtt ccaaccatac 5280 ctcgagggag aaaagattct tgcgataacc gatcatgcgg cgttaacgtg gagcaagacc 5340 tttcaaaacg ttaatcggcg actactaacc tggggcctag tctactcagc gtttcctaac 5400 acgaagatcg tccatcgagc cggaagggtt cattccaacg tcgatccaat ctcccgacta 5460 cgacgccgcg taccacctca gagtagtcct ttggatgttc aattcgatcc gttgaagcta 5520 aaaccggcag aagacccgtt gcgaagcatg tttgacgagc ttggccccaa gttcgaagaa 5580 aaaatgttga cagttgcgac ccacttcgcc gaaagcgaac ttcagcttga aagtagtaac 5640 aaacgtcttg acgtaacgat agccctggaa ggtgataagg aaatcgtagt tccgtatgag 5700 acgtctcagt cgtattctac cactgttcaa ataggtgaac aggaaattaa aaggtggaaa 5760 gacgcatacg cacgggactc gcattttaat ctagtaaacc gaagtaacga ggcaaatgaa 5820 ggagtaaaca tcacataccc tcaatatcac ttctccagag aaggacttgt ttacttcctg 5880 gattccaccg ggaactcaag actttgcgta ccaaaggacc ttcgcgccga aattatgcag 5940 gaagctcata atactattac cgaagcggcg cacggaggtt attttaaaac ctacaacagg 6000 attagcgcta cttactattg gcccagaatg tcgagagaaa ttaagatttt cgtcaacacc 6060 tgtgacgtct gtcagaaaat aaagcccaga cggcatgctc cagttggctt acttcagtcc 6120 attccagtcc catcgcaacc tttcgaagtc atcagcatgg attttatccc tgaattgctg 6180 acttcaaacg gatttgacaa tatactagtg atcgttgaca aactcaccaa atacgccatt 6240 atagtcccca ctacaacaaa ggtaacagaa gtagaaacag ctagactgtt cttcaagcac 6300 gtcatttcta gattcggaat tccatgacaa atcatatcgg atcgagatac cagatggcgc 6360 ggagatttct ggaaggaaat ttgccgattg atgggcatga agcggtccct gacgacatca 6420 tatcatccac agtccgacgg acaaacggaa atcatgaatc agggtttgga aatctcgatc 6480 cgcgcatata ttggccctga ccgagatgac tggagtgaaa tgctcgacgc tttgttgctc 6540 tcttacaatt cgtcaataca tactgctact ggattcagtc cagcttacct cttatgaggg 6600 ttccaactga ttaccagtgg ttgcatcgta agtcagtctc caagcgtcga ccgaacggga 6660 atttcaaatt caggaagcga tgattgagag acgtcacaca acaaagcgct tgatctagtc 6720 gaaggctttg tcgcagaaag gtcgagagct agggacgctc tcctcctggg gcaagtattt 6780 caaaagaaat cctacaacaa aggaagatta aatttggaat tcaacgaagg agacaaggta 6840 gtgataaatc gacaaaacct tggcttgtta aaggaagaga aaggaagagg aaataaactc 6900 ttagccaggt acgaaggacc ctttgaaatt atgaagaaga tcagtgctgt agcatattgt 6960 ctccgcatgc cagcatctta tgggatgcac ccagtgttaa acatcgctca cctggaaagg 7020 tatcaagaat cacctgacga gtttggggat cgtccacagc tcaagacgaa taggtcagat 7080 ttcgacgccc taccagaata tgaaggggaa aggatagtag ctgaacgaac acgaaaaggg 7140 aagaatggaa gaaaaatccc gatctaccgg ttaagatata caaattacgg acctgagggt 7200 gatacatggg agactagaca aaatctcaag aacgcacctg atgtactact cgaatgggaa 7260 aaaatcaagg cgctccaaaa gaagaactcc cggacaacga gttaggcgta tgcggatgcc 7320 tacttaagga ggacgtgttt tttcggtaaa gactcaacat tcagccaatt ctccctcgat 7380 aaatcctttc taccaaattc cctcagcgtt caattcagtt atcatgcaat cccttgagct 7440 ctataacccc aactccgcaa acttgcatct cacttcgtcc gcagccaccc actccttctc 7500 cgaaaacctt aatccactca actaccagct tacgtccgac gacttcatta cgctcgtcca 7560 gccggacggt caacttttgt atatgatcga tccgtcaact gcgacctcat tgttctgggc 7620 gctacaccgt ttagcagcat acatcgtccg ggcagcatca cacgaccttt acagtgaccc 7680 taccattccg ttcgcctcca gaggctatat tccacaaatc tatttcgata gtatcaacat 7740 gcaacgtcca gacattctta ccatacaaga acatcattcc attaaccctc tgtacggcta 7800 cactactcgc gaaaacctta cggcttggta tcgcaacttc cagatgatgg gtcggctcgc 7860 cgtccaatgg atccaggtgg cacgccgaca aacacttcaa ctaggagaaa caggatggaa 7920 ctgggacgta cggcgcagtg ctcgcgtcga gtacccggat cgccgtctca ttacgggaca 7980 cattacggag aacgacagcg ataactcctt tagtgacacc acctctgagg agttggacga 8040 tggtttaggt ggatggaaag atggggatac cttcgcagga ggattagcag gaggagacac 8100 tcgcctagta ttgcacccaa ccgagtccgc ctatgctaaa aggtacccaa acttcgaccc 8160 ctttgcagga cgacccctca tcgacaatgg gtacgttgac aacttcaaca attgctgctg 8220 cgcgatgacc gatcacccca ccgcaagttg tgctcgcttc gaccatatga acaccctttt 8280 tgcggcctct agcgagaaca tggaggagtg ataggatctc gaggtcaggg gggggggta 8339 // ID TKL1_I repbase; DNA; FNG; 4634 BP. XX AC AJ439548; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 08-NOV-2002 (Rel. 7.1, Last updated, Version 1) XX DE Kluyveromyces lactis retrotransposon TKL1_I, internal region. XX KW LTR Retrotransposon; Transposable Element; RNaseH; TKL1_I; gag; KW integrase; internal region; pol; protease; reverse transcriptase; KW internal portion. XX OS Kluyveromyces lactis OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Kluyveromyces. XX RN [1] RP 1-4634 RA Neuveglise C., Feldmann H., Bon E., Gaillardin C. RA and Casaregola S.; RT "Genomic evolution of the long terminal repeat retrotransposons RT in hemiascomycetous yeasts."; RL Genome Res 12(6), 930-943 (2002). XX DR Genbank; AJ439548; Positions 394 5027. XX SQ Sequence 4634 BP; 1653 A; 1035 C; 913 G; 1033 T; 0 other; agaaaatgtt taaaagatga agaagaggaa tacattgcca acatccacat gacatgtgtt 60 cccaacacca gctatcccga atggttcaaa aagtggctgg agaataacat ggagttcata 120 gattgtttct tagaagctgt ggccatcata tcagaagaaa atgataaacc aaccttactg 180 cgcgaaattt caacattatc tctacgcgac agagaatcga tcgagaaatt tgcacagaga 240 gcaaccagac cctatcaaag agcagagaga gcaaaaatcc ccgagttaga tgagttatta 300 atacaacaag tcctgcaggc actccctagt gaatgtagtg cgacggaatt gatttttagc 360 acaaaagaag acaaaagctt cgattccctt atgaagcttc tattatctaa acaatggggc 420 aagaaagacg ccaagagacc agaacaaaat aacaacaaac accaagggaa acaatggaat 480 aacaattcta atgaaagaaa gcaatacaac aaccgttata acaacaataa ccaatatgct 540 tcaaataacg aaaatagcaa agaaacggta caacacgttt catcaaagca agcttctgag 600 gaccttaggc ttgacaccac ggactcagaa tactaaaatc accaggaacc ccgccaggga 660 cctgatgatc gactccggag caaccatctc tgtagtccat gataaaagtt tacttcacaa 720 cttcaatcca tcaactgatc aacaactttt tgatactcaa gaaaatcaaa tcagagtaga 780 aggtgagggt aacctaattc tgaaattcaa gaaagacaag gtaaaggtaa gagccatata 840 tacgacagat atgagcatga atgtcatcag tgacgcacac ttgaaaaatg cgggtatatt 900 cagagacaat cgtcaaccgt ttttgatctc taggacagga aagcgcattg ctaaattata 960 tgaactagga agtctctgct ggattccata ctgccacatc tcaaaaccac atgaacaaac 1020 catttcggcg atatcgatca gagatgcacc aaacagattc tctctggcaa acgtccatcg 1080 ctggttcggc catataaatg tcaaatatat tagagaatct attagaaaag gccatatcca 1140 aggtttgaag gaagacgacg tagattggac cggttatagc tctttccaat gccaacaatg 1200 cttggaaagt aaagccaaga gaaataatca ctacgtcaac tccaggatgg actacactaa 1260 agagtactat ccattcgagt atctccacac ggatttgttt gggccaataa gatgcagatc 1320 cagctatccg cctcagtatt tcattgcttt cactgatgag atcacaaggt tcagatggac 1380 atatccactt tactcaaaaa cagctgaaga agtcgtggac aagttcaaag agattgtcat 1440 gcaattaaag tcgcaatttg gcacaagagt cagaactatc cagatggaca gaggtagtga 1500 atttaccaac aatatgacaa gagcatattc aaagaaagag gaatcttgat cagatacacc 1560 accactgctg actctaaagc tcacggtcta gctgaaagac aacattacac tatgttgaac 1620 gactgcagaa ctcatctgca acacgctaat ttaccaccga agttgtggta ccatgcaata 1680 gtgttctcaa acactgtgcg caataccttc gtcaatagac atactggaac ttctccaaga 1740 aataaagcca gtatggcagg cctctcgttc aaagacgtgc ttcctttcgg acaaccagta 1800 attgctcata tacacgatcc acagtcgaaa ttagattctc gtggtattct aggctatgca 1860 ttgcatccat ctaccgaatc gtacgggtat atcatctatg ttccagaaga aaacaagatc 1920 attgacacga gaaactatgt gatactcaag cacccaccag gagaaattgt ttcccaggac 1980 gaaatagagc agatgatcga gcgtatcgaa aacgaagacg cggctaatat ggaaaatgtg 2040 gaatttaatg aagaaccaaa ttatgcagga tccgcacagc ctatgcctta caatacagac 2100 tatattgcag ataatatcaa cgaaaacttt gagaccatca ctcaagaaat ggcagatttt 2160 ggtctcgaat acgggggtga tacattatca ccagttaccc caagtgacgc tggagaccca 2220 ttcactccaa gtgacgctgg agactcattc actccaagtg acgctggaga ccaattcacc 2280 cctgctccca cagacaataa cagaagagag acaacttctt ctccagaacc gattggagaa 2340 aatagagacc caatctctcc acctgactca aaccaggacg gtatgaatga caatataaat 2400 gagtcaccag agccttacag agctaattca accgcaagct ctgctcaacc tgaagaggaa 2460 caccatgctt ccccaaccaa cctagacttt tcgccaataa tcgacggtct agacacgaat 2520 acaaacttac aggaagaacc acagcttccg ccccaacaac ccagcacgga gcaagaacag 2580 ttgccaaccg ctgcagacag tgtccctgac acaactaatc ccaaccccag tgaagatgac 2640 gatactataa acgataagct gaaatccatt cctatcttta atggaaacaa gtacaaaggc 2700 ccgagaacta ccacaaacag tttcgagcct tcgtacgggg gtggaaccta cactgaaaac 2760 tacaatcatc ccacactgga agaagtattt gagtcaattg aacgagatcc cttcaatttg 2820 acatcacaaa aacgaccacg gtcttcaaac acctaccgcg aatcggatca agacagctac 2880 gatagcgggg gtgactttga tgaagaacaa gaacaagagt gctcttcgga caactcacca 2940 gctcgtccac caaacaagag gatccgacgt gtgaactacg ttcatgcaat tgaacacaca 3000 aaagggatca tgcaaatgaa tacttctatc aactactccg aagcaatatc tcgaaacaga 3060 aatgagaacg agaaattagc attccagaat gcatacaaga aggagatagc tcagctaacc 3120 aagatgcaca cctggagcga ggagttgata gatgctacct tgattcccaa acaaagaatt 3180 ataaactcaa tgttcatttt caccactaaa agagacaatt ccaagaaatg cagactagtt 3240 gcacgaggag atcaacagtc attggaaacc tacgatcctg accaaaaggc tactacggta 3300 caccatctcg ctctaatgac cgttctcgcc atggcattag accataatct gaccgccttc 3360 caactcgaca tttcatcggc atatctctat gcagatctca aggaagaact atatatcaga 3420 gcaccacccc acatgaatgc aagaaacaag gtcctgaaac taaataagtc cttgtacggg 3480 ttgaaacaaa gtggagctaa ttggtgcgct ctgatcaaac agtttctcgt tgaggaatgc 3540 ggactgatcg aagacagatt ctggaagtgc gtgttcacag ccaaagagcc attgaaactc 3600 gtcgtgtgtc tatttgtaga cgacatgttg gtagtcggca gaaacgatga acacatcatt 3660 gaattcatct ccaaactgtc caaaaggttt gacaccaaag tcgtaaacga cggtaatcat 3720 agggaagaag atggtgttaa tgaatatgac attctcggag ttgaattaga atacaagaaa 3780 ggagaatata tgaaattcgg aatgcagaaa tcactggaag ataaactccc acatttagga 3840 gtaccgctat actcccatcc tagacagaga aaggttccag gatgtcctgg agactatatt 3900 tcttcaggag aggacctcac tctcgatgat caggattaca aggataaagt gaagcatttg 3960 cagagactag taggacttgc ttcctatgta ggacacaaat tcagatttga aatcctatac 4020 tatgtgaaca tcttggcaca acaccaatta tatcccagtg ccaaggtttt agatcgtgct 4080 gcacaattgt gccagtactt gtgggataca agagacaaaa agttgatatg gaagtattcc 4140 ggaccaaaaa ataacgaaat tactgccatc tcagatgcag ccttcgccgg aaacaatgat 4200 ttcaagtcac aatctggagc gctatatctc tggaatggca agacaattgc tgcaaaatca 4260 tccaagatta aactaacatg tgtctcatcc accgaggcag aaatttactc aataagcaac 4320 tgcgtaacgt acttaagagg aatagaaata ctggtggata agttgctaaa cacaaaatcc 4380 ataataaaac taaagactga cagtcaacca gcaatggcaa taataaagag gaaagaagat 4440 gcagagtttc tcaagaaaca cactggtagt agggcaatga ggataagaga cgagtgcaac 4500 gaacttggac tagtcctaga atatatccca accaaagaga ataccgctga tatcttaacc 4560 aaacccttga gtatgaaact attcaaactt ttaacagagg actggataca atagctttct 4620 cctagtcggg ggtg 4634 // ID Copia-1_UM-LTR repbase; DNA; FNG; 412 BP. XX AC AACP01000055; XX DT 28-JUN-2010 (Rel. 15.06, Created) DT 28-JUN-2010 (Rel. 15.06, Last updated, Version 1) XX DE LTR retrotransposon from the Ustilago maydis genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_UM_; KW Copia-1_UM-LTR; Copia-1_UM-I. XX OS Ustilago maydis OC Eukaryota; Fungi; Dikarya; Basidiomycota; Ustilaginomycotina; OC Ustilaginomycetes; Ustilaginales; Ustilaginaceae; Ustilago. XX RN [1] RP 1-412 RA Kamper J., Kahmann R., Bolker M., Ma L.J., Brefort T., RA Saville B.J., Banuett F., Kronstad J.W., Gold S.E. et al.; RT "Insights from the genome of the biotrophic fungal plant pathogen RT Ustilago maydis."; RL Nature 444(7115), 97-101 (2006). XX RN [2] RP 1-412 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Ustilago maydis genome."; RL Repbase Reports 10(6), 841-841 (2010). XX DR Genome; AACP01000055; Positions 3608 4019. XX SQ Sequence 412 BP; 125 A; 107 C; 90 G; 90 T; 0 other; tgaagataca atggtcgacc aagaccattt cggcatagtc cctgcgcgat caaacggaaa 60 ttgcgtgcga ttacccaggg ttactcaacc gaaattaacc ggggttactc aaccgaaatt 120 aaccggggtt actcaactaa atgtgagcag agatgagata gccctcgcct actctgcaag 180 agcctaccac cgaaaggcac acaagccacg tcccgcacag agcgaagacg ggaaggaaaa 240 gaggagatgg atgtcccgag tcgcaggacg agcagtttca taaagacata aatatgctca 300 ctttgtatgt agtctcgagt ccagaaaatc taagcgatct tttcaccatt ttcttccctt 360 catcaaaact gccgagaacc atcacgggtt tttcccccct aattagttct ca 412 // ID Gypsy-101_MLP-LTR repbase; DNA; FNG; 1407 BP. XX AC AECX01000545; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-101_MLP_; KW Gypsy-101_MLP-I; Gypsy-101_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-1407 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000545; Positions 39991 38585. XX SQ Sequence 1407 BP; 508 A; 274 C; 215 G; 410 T; 0 other; tgtcagtcct cgagtgtctt gtaagataaa gaacaccaga agaacttcac aagataacat 60 aagtataatc actaaactct cagtctatat taaaatgata aaacaataaa aatatatata 120 cataatttaa aaagtagatt tatctacaac cacaccttag gaatctagtc tcaaagattc 180 aacgaaaaca acttatagga cctagtttca aaggtctagt atgaagcaga agtaatataa 240 aacaaataac cagcataatt tgaaatcact catctacacc ttcactatgc taagtaagaa 300 tattgacccg gaaggagaag aagtatgaag ttatgacaat aactaaaagc aacttaattg 360 aaatacgtag gaaccactgg tcaacgctaa caaagaaaga gtagccaaga gaaccggccg 420 gaaaacaaat gacaagttct ataagaagtg atcacaacaa agactagcca agtacgtaaa 480 taaactagca aagagaaaat ccgtcattac gtatataaaa acttacctaa ggagtagatg 540 tttactgctc aaggaatata tcatcgatag aagaagatgt caaaggtacg aagtactata 600 agaagtattg aagtagtacg catacaaatt tacataatga atgtaacctg aaagctataa 660 gaagccctct tcttagctga ctagaagaaa gggcatcgaa ttttacacat ccaatttctt 720 aagaacgtta gtattccttt tctcgaagtc cagcgtaata aaccatttta aaatacctta 780 ttgcttagaa atgatatttt aaaatccttt atccgttcca cttcttaaaa taaccatact 840 agctatccta acaagttgac ttcgtaaaat aaacgtatct accttcaaat ctgttttctc 900 tcgaaatcgc cagtatactg atcctagatt attcttgaat cattattacc tctgtcttaa 960 atacttagtc acagtttcca gttacctatt tgatcacata ctttccaacg gttaatcatc 1020 agttctccca agactactta ttgcttacta gatctaaagt aagttcctcc tatctggtca 1080 atatcctcag ttctcataac gtgaagttat acgagatccc tgtctattta gttagcaggc 1140 aactctaagc ttattgcttc agagtaaaaa ctcgggtcta ggattgacca tcctctactg 1200 tctctctcag taggcccgtg gttttagtcg tgaagagctc ttattatttg atatcttcag 1260 aagtattaac gtacaactta ttgtttgtaa tatatagttc acattagcag tacctattcc 1320 acgtacccag gggaatagag ccttagtcag gaattaacta cgtaagttct agaccttact 1380 agtgcccttt aggtagccac cctgaca 1407 // ID Mariner5_AO repbase; DNA; FNG; 1270 BP. XX AC . XX DT 24-JAN-2006 (Rel. 11.01, Created) DT 02-FEB-2006 (Rel. 11.01, Last updated, Version 1) XX DE A family of Mariner/Tc1 DNA transposons- a consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner5_AO. XX OS Aspergillus oryzae OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC mitosporic Trichocomaceae; Aspergillus. XX RN [1] RP 1-1270 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-1270 RA Kapitonov V.V. and Jurka J.; RT "Mariner5_AO, a family of Mariner DNA transposons in the RT Aspergillus oryzae genome."; RL Repbase Reports 6(1), 34-34 (2006). XX DR [2] (Consensus) XX CC It is a family of Mariner DNA transposons (Tc1 clade). CC Mariner5_AO elements are characterized by TA target site CC duplications and 26-bp TIRs. They encode a 349-aa Mariner5_AOp CC transposase. XX FH Key Location/Qualifiers FT CDS 117..1163 FT /product="Mariner5_AOp" FT /translation="MAPRKELDPQIRLRICELHSIGWGPTKIYRQHPEIPL FT STIKTTIRRESIRVNNTTRARSGRPRKLTEQQRDHIYDLVQSEPHIRHIDL FT LTEVDHVVKRRSIQYLLSEMGCRKWKQLNRPEIKPIHAAKRLAWARRYEHY FT TAEDWARVKWSDEYMVERGIGVRPTWTFLRPRDQLKNHDIHAKPCGKGIKQ FT MFWAAFGEDIRTGLVPLDSDPESAQGGVTAAIILALYRAFLPDLLQEGDIF FT MHDGASIHRTYIVREGLEEMGIEVMEWPPYSPDLNPIENLWALMKEEIYKL FT YPELLTAPNTASIRELLTKAAQEAWHSIENRVLVRLSTTMPHRVQAVIEAD FT GWYTKY" XX SQ Sequence 1270 BP; 359 A; 281 C; 287 G; 343 T; 0 other; cagtgggatg caaaaagttt gaaaccagcc gcttttaccg ccctacatac cgccttaggt 60 caacccaacc gaattcaggc gcctgaacct atacacgtcg atatacgcgt tgaattatgg 120 ctcctagaaa ggaactagat cctcagattc gcttacgtat atgtgagctg catagtattg 180 gatggggtcc gactaaaatc tatcggcagc accctgaaat acctctttct actattaaaa 240 ctactatacg tcgagagtca attcgggtta ataatactac gcgcgcgcgt tctggccgcc 300 ctcgaaagct tacagaacaa caacgtgatc atatatatga tttagttcaa tcagaacctc 360 atatacgaca tattgattta cttacagagg tagatcatgt ggtcaaacgc cgctctattc 420 agtatttact gagcgaaatg ggttgccgca aatggaagca gttaaatcgg cctgaaatta 480 agcctatcca cgcagccaaa cgccttgcct gggctaggcg ctacgaacac tatactgcag 540 aggattgggc gcgcgttaaa tggagcgatg agtacatggt tgaacgaggt attggggtac 600 gtccaacctg gacctttcta cgtccccgag atcagcttaa aaatcacgat atacatgcta 660 agccctgtgg aaaaggtata aagcagatgt tctgggctgc ctttggagaa gatatacgca 720 caggccttgt cccactagat agtgaccctg aatcggctca gggaggtgtt actgcagcta 780 ttattcttgc tttatatcgt gcctttctac cagatctatt acaagaaggt gatattttta 840 tgcatgatgg agctagtata caccgtacgt atatcgtacg tgaaggcctt gaggaaatgg 900 gtatagaggt gatggaatgg ccaccttatt cgcctgatct gaatcctatt gaaaatctat 960 gggcgttgat gaaagaagaa atctataagc tctatccaga gcttctaacc gctcccaaca 1020 ctgcttctat acgtgagctt ttaactaagg cagctcagga agcctggcat tctatagaaa 1080 atcgtgtttt agtacgcctt tctactacta tgcctcaccg tgttcaggca gttattgagg 1140 cagatggctg gtatactaaa tactagtaat ggttaaattt gcgaggtttc tgcctggcag 1200 aatgacctaa ttaggaaggc ggctagatta ggcaataaaa ccacggtttc aaactttttg 1260 catcccactg 1270 // ID Gypsy-3_CCO-LTR repbase; DNA; FNG; 495 BP. XX AC AACS02000001; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the mushroom Coprinopsis cinerea genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_CCO_; KW Gypsy-3_CCO-I; Gypsy-3_CCO-LTR. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-495 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the mushroom Coprinopsis cinerea RT genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AACS02000001; Positions 81538 82032. XX SQ Sequence 495 BP; 73 A; 152 C; 77 G; 193 T; 0 other; tgtcgtgaac agcgggtacg agcccctatt tttccattat cttatcttat cactttcatc 60 tcccttatcg ttccctcctt tattctcatt tctaccttta ttctcgatac ggactctttc 120 tctcatggaa cactgcctcg atgtaccctc tgccctattt cttattgttc ttccttctcg 180 cactgcctcg tgactccctt ctattgtctt ctccttgttc ttcgcgcagc cccgcgcagc 240 cttgtgaccc tctttattgt tcttctgtgt ttcaccaccc tttcgtcatt ttgactcatc 300 gcggctatgt agactgtaga ataggtatat aagctaggta gattcctctg tatttcctca 360 gcttgaccct cttcgtcaat acatcgcttc agccttcatc tgcatttgtc gctcatcgct 420 cttatcttac tcgccgctct tatctgctcc gcatcactgg tctttggttt ggtttagtgt 480 tcgctcttcg cgaca 495 // ID Gypsy-109_MLP-I repbase; DNA; FNG; 5650 BP. XX AC AECX01000596; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-109_MLP_; KW Gypsy-109_MLP-LTR; Gypsy-109_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5650 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000596; Positions 44170 49819. XX CC Positions [4449-4928] - Integrase core CC 'GGGGGG' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 409..1386 FT /product="Gypsy-109_MLP-I_1p" FT /translation="MDARLLEETRRREEAEQGRAEAEQRLAAMQQATSTAQ FT PQSRVEPAVINNFGPVKTPKVATPDKFDGTRGSKAEVFVNQVGLYILMNPS FT QFPDDKTKIGWTLSYMTGKGGEWAKPITQKLLDNSEGAISTWEAFLKSFQS FT TFYDSQRVVKAEKAIRDLKQTGTVLAYSLKFKEFAFIVDWPEHVLISQFEQ FT GLKSEIRVQMVRDVFVTMDDIIELSIKIDNVLHKRGDDGLGEVKVAPVVDP FT DAMDCSAFRFRISEDEYKKRRENNLCFKCGRSGHIARVCGTGNLGRNWKGK FT WRGNEKVNTSEVKLEEGSKETGRADEAKNGVAQE" FT CDS 1437..5552 FT /product="Gypsy-109_MLP-I_2p" FT /translation="MGAIELEIDLIEMKDNRLFATVPIYDPNLETTYFARA FT LFDTGATHDVLNQAFVLKNGLTTTKLPQPKPVTGFNGSRSSITHIGEYVLD FT IDGNENLTPFLISKLKDSVDCIIGINWISKNYEKLDWKNRTLKTNLTSIAA FT TEVVSSRPKTILEKTRKEPLGQARIRDEGVCISNDTLAPPQCECAITLPTQ FT LCEAAGKLDHPLISRHPTNTTKSTTGTKKERLLTVAAAEPASSKPKTIPDV FT TREETLGQARNFDEGVCISNDTLAPPQHECDSPLIPLRQEAASKRFHPLQL FT GHSRSRVSLTRNNTRSYLPTRSILKTPQLDAARASWNVSAKLAAERTESKP FT SLSAAELVPEVYHEYLPMFEKSCSKVLPPRRPYDFCVDLVPGATPQASMVI FT PLSPAENEVLKEMIEEGLTAGTIRRTTSPWAAPVLFTGKKDGRLRPCFDYR FT KLNALTVKNKYPLPLTMELVDSLLNADRYTSLDMRNGYNNLRVREGDKAKL FT AFICKEGQFEPLTMPFGPTGAPGYFQYFVQDIFRNRIGRDMAVFLDDILIY FT TKPGEDHEKAVKDVLDTLREQNIWLKPEKCKSSQKELTYLGLKLSHNKIAM FT DEDKVKAIAEWPTPKNVSEVQQFVGFANFYRRFIEDFSRIARPLHNLTQKK FT VAFEWTADQQKAFDSLKQSFIKSPVLKIADPYKQFVLECDCSDFALGAVLS FT QVSKDNGELHPVAFLSRSLIQAERNYEIFDKELLAVVASFKEWRHYLEGNP FT HRLNVIVYTDHKNLESLMTTKELTRRQARWAETLACFDFEIRFRPGKRSTK FT PDALSRRPDHKPEDGCKLTVGQLLKPANLPADAFIHEVEMVGKDEDNIAYF FT IDSEEEEVYFMDNGLIEDDDDDERVWDDSVILNEVRNKLYGDEKLKKLIAA FT CKAQEDGSWEDYVFTGGLLYFKGMVEVPSDKTLRRRIVESRHDSILAGHPG FT RMKTLSLVKRNYHWPSMKAFINAYVDGCHSCQRVKSRSSKPFGALQPLPVP FT AGPWTDVCYDLITDLPESEGKNCILTVIDRLTKMCHFVACNTTMSSEELAK FT LMVKNVWKYHGTPKTITSDRGNIFISELTKELNQQLGIRTQSSTAYHPQTD FT GQSEIANKAVEQYLRHFVGYKQDNWCSLLDLAEFAYNNSPHTSTGISPFKA FT NYGYDLTYSRIPSSEQCIPAVEEMPYQLKEIQDELKVSLRLAQQTMKEQYD FT KHKGESPDWAVGSRVWLDARHISTTRPSAKFLHRWLGPFVISARVSTNAYR FT LILPESLSRVHPVFSVGLLQPYNESSIKGQQQQPPMPITVDSQIEYEVEEI FT LDKRRKGKQIEYLVSWKGYGPEDDTWEPSNGLKNAKELVDQFNKKYPNAEK FT GYKRTRQVK" XX SQ Sequence 5650 BP; 1823 A; 1158 C; 1326 G; 1343 T; 0 other; tattgtagca tctacaacct tcagacgtca gagcaagtga agtaaaagaa gaagaagaaa 60 gatcaaaaga ataattatcc aggaagtttt ataattaaaa gttaaactca gaagaagaag 120 taaataaaag tttaaatcaa ttcaaagcaa ctcaccgatc taaatccaca cattcccaca 180 cctttaatca aaccttatct cgatcgcctc aattgtaact ccgtaacctc tcgaaaccgt 240 cacgacgccg aactttacaa ccccggaaag cccttctagc tccgcatctt ctagtcaacc 300 ccacacgtac aattcagctg tatccgaaga attagagttg gacataatgg aagactcaac 360 tacggctcca ccacccgatg cactagccca aatacttaat cgcttagcat ggatgcccga 420 ctattggagg agactaggag aagggaggag gccgaacaag gtagagcgga ggctgaacaa 480 cgcctagcgg caatgcagca agctacgagt acggcacaac cacagagtcg cgtagaacct 540 gcggtcatca acaattttgg cccagtgaaa actcctaaag tagcaacccc cgacaagttt 600 gatggcactc gaggtagcaa ggcggaggtt tttgtaaacc aagttggtct ctacatactc 660 atgaatccct cccaattccc cgatgataaa acaaagattg ggtggactct gtcgtatatg 720 acaggcaaag gtggagaatg ggctaagcca attactcaaa aattgctcga caactcagag 780 ggtgcgattt cgacgtggga ggcgttctta aaatctttcc aatctacttt ctacgattcg 840 caacgagtag tgaaggctga gaaggcgatt cgagatctca agcaaactgg aacggtgctc 900 gcttactcac ttaaatttaa agaattcgct ttcattgttg attggccaga acatgtattg 960 attagtcagt ttgaacaggg tttgaaatct gaaattcgag ttcaaatggt acgcgatgtt 1020 tttgtgacaa tggacgacat tatagaacta tccatcaaga ttgataacgt gcttcataaa 1080 cgtggtgatg acggattagg agaagtgaaa gtagcacccg tagtggatcc tgatgctatg 1140 gactgttcgg cttttcgttt tagaatttca gaggatgaat acaagaagag aagagaaaat 1200 aatctttgtt ttaaatgtgg aagaagcggt catatagcaa gagtgtgtgg aaccggtaat 1260 ttaggaagga attggaaggg gaagtggaga ggaaatgaaa aggttaacac atcagaagtg 1320 aaattagagg aaggtagcaa ggagactggt agagcagatg aggcaaaaaa tggcgtagct 1380 caagagtgat agttgtcccc tcttcgagct ccttaggtga tgagtcatta atagacatgg 1440 gtgcaataga attagaaatt gatttgattg aaatgaaaga taatagatta tttgctactg 1500 tgcccattta tgacccaaac ctagagacaa cttatttcgc ccgtgctttg ttcgacacag 1560 gagccacaca cgatgttttg aatcaagctt ttgtgttgaa gaacggcctt accacaacca 1620 aactccctca acctaaaccc gtgactggtt tcaatgggtc acggtcgtca attacgcata 1680 ttggtgaata tgtgctggat attgacggaa acgaaaacct tacaccattc ctaatctcaa 1740 aactcaagga ttcagtcgac tgtattattg gtataaactg gatcagcaag aactatgaaa 1800 aattagattg gaagaaccga accttgaaga caaacctgac ttccattgcg gctacagagg 1860 tagtctcgtc tcgaccaaaa acaatcctgg agaagactcg aaaggaacct ttgggacaag 1920 ctaggattcg tgacgagggg gtgtgtatct caaatgatac gctagcaccc ccgcaatgtg 1980 agtgtgctat cacccttcct acccaattgt gtgaagcagc tggcaagctg gatcatcctc 2040 tgattagcag gcaccccacg aatacgacga aaagcacaac tggtacaaag aaagagagac 2100 tactcactgt tgcggctgct gagccagcgt cgtcaaaacc aaaaacaatc ccggatgtga 2160 ctcgagagga gaccctggga caagctagga attttgacga gggggtgtgt atctcaaatg 2220 atacattagc acccccgcaa catgagtgtg atagcccttt gattccatta cgtcaagaag 2280 cagctagcaa gcgttttcat cccctacagt taggtcacag ccgtagccga gtcagcctaa 2340 cgaggaacaa caccaggtct tacttaccaa cacgatcaat attgaagact cctcagctgg 2400 acgcagctcg cgcgtcgtgg aatgtatcag caaaactggc agcagaacgc acggaatcaa 2460 aaccatcttt atcagcagca gaactggtac cagaagttta tcacgaatat ttacctatgt 2520 ttgaaaaatc ctgttcaaaa gtcctaccac cacgaagacc ctacgatttt tgtgtggacc 2580 tggtaccggg cgctacacct caggctagta tggttattcc attatcaccg gcggagaacg 2640 aggttctgaa ggagatgatc gaagagggac tgactgcagg tactatacgg cggacgacct 2700 caccttgggc agccccagta ctattcacgg gaaaaaaaga cgggagatta cgaccctgct 2760 tcgactatag aaaactgaac gcacttacgg taaaaaataa atatccatta cctttaacaa 2820 tggaacttgt ggatagttta ctgaatgccg atcggtatac atcactggac atgaggaatg 2880 gctacaacaa tttgcgcgta cgggaagggg acaaagctaa gttagcgttt atctgtaaag 2940 aaggacaatt tgaacccctc actatgcctt tcggacctac gggggcccct ggctacttcc 3000 agtattttgt acaagacatc ttcaggaatc ggattgggcg ggatatggca gtgttcctgg 3060 atgatattct gatttatact aagcccggag aagatcatga gaaggcagtg aaagacgtac 3120 tagatacact acgtgaacag aatatttggc taaagccaga gaagtgtaaa tcctctcaaa 3180 aggaacttac atacctaggc ttaaaattat cacacaacaa gattgccatg gacgaagaca 3240 aagtgaaggc aatagctgaa tggccaacac caaaaaatgt cagcgaagtt caacagttcg 3300 taggattcgc aaacttttac cgaaggttta ttgaagactt ttcaaggata gcacgaccct 3360 tacacaacct aactcaaaag aaagtagcat tcgaatggac ggctgaccaa cagaaggctt 3420 ttgattcact gaaacaatct tttatcaaat caccggtatt gaaaatcgcc gacccgtaca 3480 aacaattcgt cttagaatgt gattgctctg acttcgcgtt aggagcggtt ctgtcgcaag 3540 tttccaaaga caatggcgaa ttacacccgg tagctttctt gtcaaggtca ctgattcagg 3600 ctgagcgaaa ttacgagatc tttgataaag agctgctggc ggtagttgcc tccttcaagg 3660 aatggcggca ttacctagag ggaaatccac atagattaaa tgttatcgtg tatactgatc 3720 acaaaaattt ggaatctttg atgacaacga aagaacttac acgtcgtcaa gcaaggtggg 3780 ccgagacgtt agcttgtttc gacttcgaaa tcaggtttcg accggggaaa agatcaacaa 3840 aacctgatgc tttgtcacga cgacctgacc acaaacctga ggatggatgc aaattgacgg 3900 ttggtcagtt attgaaacca gcaaaccttc cagcggatgc ttttatccac gaagtcgaga 3960 tggtaggaaa ggacgaggac aatatagcgt atttcattga ttcagaagag gaagaagtgt 4020 actttatgga taatggactg attgaagacg atgatgatga cgagagagtg tgggatgata 4080 gtgtaatttt gaatgaagta agaaataaat tatacggtga tgaaaaattg aagaagttga 4140 tagctgcttg taaagcacaa gaagatggat catgggagga ctatgtgttt acgggtggat 4200 tattgtactt caagggaatg gtcgaggttc caagtgacaa gactctgcga cgacgtattg 4260 tcgaatcacg gcatgacagt atcctagcag gccacccggg tcgaatgaag actttaagtt 4320 tagtcaaaag gaactaccac tggccatcta tgaaagcctt tatcaacgcc tacgtcgacg 4380 gatgccattc atgccaaagg gtcaaatcaa gatcgtcaaa gccctttgga gcattacaac 4440 cattgccagt acctgcaggg ccatggaccg acgtttgcta tgacctcatt acggacttgc 4500 cagaatctga aggaaagaac tgtatactga cagtaatcga tagattgacc aaaatgtgtc 4560 attttgttgc ttgtaatacg actatgtctt cggaggaact agctaaattg atggtgaaga 4620 atgtgtggaa atatcacggg actcccaaga caattacgtc agatcgggga aacattttta 4680 tatctgagct gacaaaggaa ctaaatcaac aattgggtat tcgaactcaa tcatcaacag 4740 cttatcatcc gcaaacagac gggcaatcag agatcgcgaa taaagctgtg gaacagtacc 4800 tacgtcattt tgtaggctac aagcaagaca actggtgcag tttattggat ttggccgaat 4860 tcgcatataa caatagtccc catacctcaa caggcatctc accgttcaag gcaaattatg 4920 gctatgactt gacgtattca agaatacctt caagtgagca gtgcataccg gcagtggagg 4980 aaatgccata tcaattgaaa gaaattcaag acgaattaaa agtatcatta cgactcgccc 5040 agcaaacaat gaaggagcaa tatgataagc ataaaggcga atctccggac tgggctgtag 5100 gttcaagagt ttggcttgac gcaaggcata tatcaaccac tagaccaagt gcaaagtttt 5160 tgcacagatg gttgggaccc tttgtcatct ctgctagagt atcaacaaac gcttatagac 5220 tgatactgcc tgagtctttg agccgagttc atccagtttt ttccgtaggt ttacttcaac 5280 cgtacaacga aagctccatc aaagggcagc aacaacaacc accgatgccc attacggtcg 5340 acagtcaaat tgagtacgaa gtagaagaga ttctcgacaa gaggagaaag gggaagcaaa 5400 ttgaatactt agtaagttgg aagggatatg gcccagaaga tgatacatgg gaaccctcaa 5460 atggtttgaa gaacgcgaaa gaattggtgg atcaatttaa taagaagtac cctaatgcag 5520 agaaaggata taaaaggaca cggcaagtaa agtgagaagg cgatgctttt tccccacagg 5580 ggttttttaa tgctagcctg gggaaggacg tcagctcaac aagagggagt gggacgtaga 5640 ggggaagtaa 5650 // ID Gypsy-18_LBS-LTR repbase; DNA; FNG; 382 BP. XX AC ABFE01001149; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-18_LBS_; KW Gypsy-18_LBS-I; Gypsy-18_LBS-LTR. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-382 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01001149; Positions 724 343. XX SQ Sequence 382 BP; 99 A; 110 C; 79 G; 94 T; 0 other; tgtgatacgg aataggagaa tacgcgcaga ctcgcttaca taatcttgac ttcttgacga 60 ttattgtacg tagcaatctc tagacattaa accaccatca tcttgctacc aagctcgcgt 120 gatgagtcta aggtgtccac tgctggtaag accaagggtg agccccctta caatgctccc 180 tcctccacag gatacgagag cccttcctct gggttaacga gggtctgcct aggacttcta 240 ggcaacccta ggagacgagc gcgacctcaa tagtcgcgtt ctctccactt acttctattc 300 atcgatacta tactacaaca ttcaactacg gtactacgag acttcacgcg cacgaagtcc 360 ccgcacgcac tgaggtttca ca 382 // ID Copia-40_MLP-LTR repbase; DNA; FNG; 276 BP. XX AC AECX01001591; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-40_MLP_; KW Copia-40_MLP-I; Copia-40_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-276 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001591; Positions 48003 48278. XX SQ Sequence 276 BP; 63 A; 64 C; 46 G; 103 T; 0 other; tgttgggata gtcaagtgtc atacgaagtc ctgtctaaga ttacagtcct gcgtgacttt 60 tccaatgtcc tgtcaagtcc tgtgtgacta tctcctatgt tcttatttca tttcatactt 120 acataatatt tcatatttca tttcataact gtctgcttag caacttatgc gtcatgtgac 180 atctagcagt gtaatactta ctatataagc cagctggaac agcgcaggaa aggtgctctc 240 tcttcacctt tccttgcttc ttctcttgct gttcca 276 // ID TCA2_I repbase; DNA; FNG; 6134 BP. XX AC AACQ01000023; XX DT 04-AUG-2005 (Rel. 10.08, Created) DT 30-AUG-2005 (Rel. 10.08, Last updated, Version 1) XX DE Copia-like LTR retroelement from Candida albicans (internal DE portion). XX KW Copia; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; TCA_LTR; internal portion; TCA2_I. XX OS Candida albicans OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; mitosporic Saccharomycetales; OC Candida. XX RN [1] RP 1-6134 RA Jurka J.; RT "TCA2: A recently inserted Copia-like element from Candida."; RL Repbase Reports 5(8), 226-226 (2005). XX DR Genbank; AACQ01000023; Positions 13185 19318. XX CC LTRs are 100% identical. This appears to be a very recent CC insertion. XX FH Key Location/Qualifiers FT CDS 190..1104 FT /product="TCA2_I_1p" FT /translation="MAEFSDAELRKMMGTLSLLVQDSRREINHLHDKLENN FT SDSKYQSLETYINSKYADTIKSFEKLKYLDIDNSELVNTWIMCFNQVKRFH FT PQVFDAFMEAENEDEIGIEKIQYTPYTGKHLNDMIRIFYMKISELIERKVS FT PNVSREMNDGQPQFVPNLFKKVYEMIISKPDVSAAERIGKALFKLQSKLRE FT LERESAFLLCQHLMTNDHQHDDIILKFLVSGVSPWYLHLQIYMLSYKLGFS FT NLFLEIYAQHYELYKADPIYKLPDSMTLLNEIRSNRDYPKVVNAAKNTVQV FT NNVSSKNNKKKDE" FT CDS 1760..6100 FT /product="TCA2_I_2p" FT /translation="MTNKVERVTYVSIRNIKQEVADKYMIKDLYYYHLLIN FT HLSHEKLQLLVKRGVIKPVKSTSAESAILNCQICVAAHAKLASHNHTQQRE FT LERPLQRLHLDTAGPFTSNKTKSYLTTVIDQFSRYTEVIVSDTKAVKQSIL FT HRLRVWNNRFQFKIAEIRYDNALEYPSAEELEELGIYKHLLPNYSPMLNGT FT AEATNRPIVQGIYKVVLNFSCQVLILFPFIVEYAVHIRNHTPIKEFDGATP FT YERYYGLSKYVIPFFQFGTDVLIKCASVQEAISLKLPSSRDKAFPTVMFGA FT FLGYGSDSFTFRVLVSTKGYPVITTSNIRPIATMQVLNDYLAYISENSSIS FT YDDTFLSPLNHPMIRTNQHDRRGDNINVEYENRPNVPFEYHAEPPRTNSST FT GIIDRPDIRPRADPTWQRMPDANIHQETTTVQTPDHGELDTMINNEHQLPR FT SGEGNYPGQQVRTDIIGQFRDRGPTTLNTPIDLGVPDETDDISMTSENPID FT SPNSEMIISPSLPTNELEHQIDISSGEMSLLQTNMEADNELKTNEMVLYKS FT KNDGIIIQQQQFTENLSDENEEDSSTDEETLEDKKQQRLEYNISPNDEWIN FT NDVQNEDDTQVPHVKEPINYETQSRNETNMPRIEMGIIENLSDDGKNTPRE FT LRMVTYDNNKKIQKYQNSNIEILEPRNENKNHTFIESNLELLDNQEMFQED FT PQVEDIRLTTPKKDKSLSPDFNQTHNEIQLFMADINEDMLEEYDENINMNE FT VLADSTETLDKELDLDEESGRIEYIADRVRKKNRGTDGAPHGKYLKKNDKD FT FGSIKSQKKSDAQMDDEVGIAISKIRNFPFRLKDGRASFFPPYKTKFGRSV FT HPPKRYLNAIVKKIDYNQKEWRQSMEEEIEKFKANQVYTVEKTPKNVVPLK FT TMWVHTYKTNDLKNHNYKSRCVVMGNYMVENRDFDPHAISSPVVDLTSIRL FT LSAIAVENNLVMHQLDIASAYLNASLEDGRVIFVRPPRGFEVKPGYSWRLH FT KSVYGLRQSAHNWYSHFKNVLEANGLKQTLHNDGIFWKNYENGDVLYVSVY FT VDDVFMIATNEKIIKEFVAMLETYFQLQYFGEATEYLGIQFRKTPDGYTLD FT QIPFLEKLVATFNIQDSYGKDIPIIPKDINVVKQLRKSSQINDFVKLEKPQ FT KLINSVSKTKYIDDYEENEEHDFKESPQADPLSAKGIKLYQSAVGLLLWAS FT MNTRPDLAFSVNQLGAKCAHPDVDDWKRLMYCLRYIKKDMDFKLEYKRGRL FT NNKSKDFIIECFSDASFAPDLDRKSITGTSIFVNGNLVAWATKKQKIITHS FT SAACEMLALNYTILKAFDLRNTIEDLELKIRNLHVHEDNQAVITILKNDNF FT HPHRPIDICYKFLRQKLKDGFFSISYVESGDNLADSFTKALGRNKLIEHTK FT RIRERKDYDNNATLIVDVRTLEEIKINKKLVHH" XX SQ Sequence 6134 BP; 2275 A; 1030 C; 1127 G; 1701 T; 1 other; gattagaagc ttggtaaatc tttggttatt catcacgtct tgagaataat acaaagttta 60 atatagtatt ttcaaatttt ggaatacaaa agttgctaat tggtaaataa gttattgatt 120 tatttcataa atcttttttg gtatcatatt tcaaagagtt gcaattgaaa gctaaagaca 180 tccttataaa tggctgaatt tagcgatgct gagctcagaa agatgatggg tacactttca 240 ctcttggtac aagattccag gagagaaatt aaccacttgc atgataagtt ggagaacaat 300 agtgactcaa aatatcaatc tttagaaacg tacatcaact caaagtatgc agatactata 360 aaatcatttg aaaaattaaa atatttggac attgataatt cagagttggt taatacctgg 420 atcatgtgtt ttaatcaggt taaaaggttt caccctcagg tttttgatgc tttcatggag 480 gcagagaacg aggacgaaat tggaatcgaa aagatccaat atacgccata cacaggtaaa 540 cacttgaatg atatgatcag aatcttctac atgaagatat ccgaattaat agaaagaaaa 600 gttagtccaa atgtttctag agagatgaat gatggacagc cacaatttgt tccgaatttg 660 tttaaaaaag tttacgagat gattatttca aaaccagatg tttctgctgc tgaaagaatt 720 ggaaaagctc ttttcaagtt acaatctaaa ctgagagaac ttgaaagaga atcagcattt 780 ttgttatgtc aacatttaat gaccaatgac caccagcacg atgatattat tcttaaattt 840 ctcgttagcg gtgtctcacc atggtactta catctgcaaa tttacatgct gtcatataaa 900 cttggattct caaatttgtt tttagagatt tatgctcaac attatgaatt gtataaagca 960 gatcccattt acaaattgcc agatagtatg acattgttga atgaaataag atcaaataga 1020 gattatccta aagtggtaaa tgctgcaaaa aatacagtac aagtcaataa tgtttcatcc 1080 aagaacaata aaaagaagga tgaatgacaa caattagcca ataaaattga ggaagtagga 1140 cgttatagcg aaataaacgc aacatctaca tatcatgaaa ttggcgatac caacaaaaac 1200 aaagaacaat taatattgaa tttgaaaaat catacaaaat taagtgaaca aaagaagaaa 1260 acaaacctat tggtatatga tctgggagcc acagtatccg tggtgaatga taagacttta 1320 cttaacgaca ttaaagaatc aaatatcgaa attgcaactg ctgaagggga gacatctacg 1380 gcttatgctt taggtactct aaccatatct gtgaatggat tgaatgcgaa attagatggt 1440 gttctatact tgccatctat tcaattaaac ttaatatcta taaaacaatt tgaagattta 1500 tgctacgcaa ttttgatttc cgaaaattta atgtttctag ttcacagtga ccacgaacct 1560 acggtcattg cgaaatattc acctaaagat gacttatact caggcccaag atcgggaacc 1620 tttttttaaa agaattcata atgaccaaac ccattttttg cttgccnctg ctaaaaaact 1680 tttagaatca gagaccatat ttctggagaa tccctgaaaa atccaatgga ttgatcaaga 1740 aaaattagat ccgttgaaaa tgaccaataa agtagaaaga gttacctatg tcagcatacg 1800 caacatcaaa caagaagtgg cagacaaata tatgataaaa gatctttact actatcattt 1860 attaattaat cacctttcac atgaaaaact acaattatta gtaaaaaggg gagtgattaa 1920 accagtcaaa tctacttcgg ctgagtcggc cattttaaat tgtcagatat gtgttgcagc 1980 ccatgcaaaa ttagctagcc ataatcacac tcaacaacgg gaattggagc gaccattaca 2040 acgcctccat ttggataccg ccggaccatt tacctcaaat aaaactaaga gctatcttac 2100 aaccgtgatt gatcaatttt ccagatatac tgaagttatt gtatctgaca ccaaagcagt 2160 caaacaaagc atattgcata gacttagggt ctggaacaat agatttcagt ttaagatcgc 2220 ggagataaga tatgataatg cattggagta tccatcggct gaggagttag aggagttagg 2280 aatttataaa caccttctcc caaactactc tcctatgctt aacggtacag ctgaagcaac 2340 caaccgcccc attgtccaag gtatttataa ggtagtgtta aattttagtt gtcaagtatt 2400 aatacttttc ccatttatag tggagtatgc ggttcatatc cggaatcata cacctataaa 2460 agaatttgat ggtgctactc cttatgaacg ttactatggt ttatctaaat acgtcatacc 2520 attttttcag tttggaaccg acgttttgat aaaatgtgct agtgtacaag aagctatttc 2580 attaaaacta ccatcttcaa gagataaagc ttttcctaca gtgatgtttg gtgcttttct 2640 cggttacggc tcagattcct ttaccttcag agttttagtt tccacgaaag gatatccagt 2700 tattacaaca tcaaacatcc gtccaatagc gacgatgcaa gtactcaatg actatttggc 2760 atacatatcg gagaatagct caataagcta tgacgataca ttcttatcac ctttgaatca 2820 cccaatgatt cgcacaaacc aacatgatag acgtggagac aatataaatg tcgaatatga 2880 aaaccgtcca aatgtaccat ttgaatatca tgctgaacct cctcgtacaa attcatcgac 2940 gggaattatc gatcgaccag atattagacc tagagctgat cccacctggc aacgtatgcc 3000 tgatgccaac atacatcagg aaacaacaac tgtacagact cctgatcatg gggagttaga 3060 taccatgatc aacaacgaac accaactacc acgatctggg gagggtaatt accccgggca 3120 acaggtgcgc accgatatta ttgggcaatt tcgagatcgc gggcctacca ctctaaacac 3180 tccgatcgat ctaggtgtac ccgatgaaac agacgatatt agtatgacat cagagaatcc 3240 aattgattcc ccaaattccg agatgatcat atccccatct ttacccacaa atgaattgga 3300 acatcaaatc gatatcagtt caggggagat gtcgttattg caaacgaata tggaagcaga 3360 taacgaattg aaaacaaatg aaatggtatt atacaaatca aaaaatgatg gtattatcat 3420 tcaacaacaa caattcactg aaaatttgtc agatgaaaat gaagaagatt catcaacaga 3480 tgaggaaaca ttggaagaca aaaaacaaca gcgattggaa tataatattt caccaaacga 3540 tgagtggata aataatgacg ttcagaacga agatgacaca caagtgccac atgttaagga 3600 accaatcaat tatgaaactc aaagtagaaa tgaaacaaac atgccacgaa ttgaaatggg 3660 cataatagaa aacttaagtg atgatggaaa gaatacacca cgtgaattac gtatggtcac 3720 ctacgataat aataaaaaaa ttcaaaagta ccaaaacagt aatatcgaga tcctggaacc 3780 cagaaacgaa aataaaaacc acacattcat tgaaagcaac ttagaattac ttgacaatca 3840 agaaatgttt caagaagatc ctcaagttga agatattcga ttgacaactc caaaaaagga 3900 caaatcgtta tcacctgatt tcaatcaaac ccataatgaa atacaactat tcatggcaga 3960 tatcaatgaa gatatgctag aagaatatga tgaaaatata aatatgaatg aagtgttagc 4020 tgactccacg gagacgttgg acaaagaatt agatttagat gaagaaagtg gaaggatcga 4080 atatattgct gatagagtta gaaaaaagaa cagaggtact gatggtgcgc cacacgggaa 4140 atatttaaag aaaaatgata aagattttgg ttcaataaaa agtcagaaaa aatctgacgc 4200 acaaatggat gatgaagttg gaattgctat ttcgaagatc agaaactttc catttagatt 4260 gaaggatgga cgagcaagtt tcttccctcc atataaaaca aaatttggaa gatcagtgca 4320 tccacctaaa agatatttaa atgccattgt taagaaaata gattacaatc aaaaagaatg 4380 gcgtcaaagt atggaagaag aaatcgaaaa atttaaggct aaccaagttt acaccgttga 4440 aaaaacacca aagaacgttg tcccattgaa aaccatgtgg gtacatactt acaaaaccaa 4500 tgacctcaaa aatcataatt acaaaagccg ttgcgtggta atgggaaact atatggtcga 4560 aaatcgtgat tttgatcccc atgccatctc ctccccggta gtagatctca caagtatacg 4620 actattatct gccatagctg ttgaaaataa cttggttatg caccaattgg acatcgcctc 4680 agcttatttg aacgccagtt tggaggatgg aagagtaatc tttgtgagac caccgcgtgg 4740 ttttgaggtt aaacctggct atagttggcg tttacacaag tctgtgtacg gtcttaggca 4800 gagtgcccat aattggtact cacattttaa gaatgtgttg gaggcaaatg gtttaaaaca 4860 aacactacac aatgatggca ttttttggaa aaattatgaa aatggagatg tattatatgt 4920 gagtgtatat gtggatgatg tttttatgat agctactaat gaaaagatta ttaaagagtt 4980 tgttgctatg ctcgaaacat attttcagtt acaatatttt ggcgaggcta cggagtatct 5040 aggaatacag tttagaaaga cacctgatgg atacacgtta gaccaaatac cattcttgga 5100 aaaattggtc gctacattca atattcaaga tagttatgga aaagacattc ctattatccc 5160 aaaagatatt aatgtggtca agcaattgag aaagtcaagt cagataaatg atttcgttaa 5220 gttggagaaa cctcaaaagt taatcaattc ggtgtcaaaa accaagtaca ttgatgatta 5280 cgaagaaaat gaagaacatg attttaaaga atcaccacaa gctgatccct taagcgcaaa 5340 aggaataaaa ttataccaat ccgcagtggg actgttgtta tgggcttcga tgaacactag 5400 gccagactta gcttttagtg tgaatcagct cggagcaaag tgtgctcacc cagatgtcga 5460 cgactggaag agattgatgt attgcttacg gtatattaag aaagatatgg acttcaaact 5520 tgaatacaag agaggaagac tcaacaacaa gtcaaaagat tttattattg aatgtttctc 5580 agatgcctca tttgccccag atttggacag aaagtcaatc accgggacaa gcatttttgt 5640 aaatgggaat ctagttgcat gggcaacaaa gaaacaaaaa ataatcacac atagttcagc 5700 tgcttgcgaa atgcttgcat taaattatac catattgaag gcatttgatt tgagaaatac 5760 cattgaagat ctagagttaa aaataaggaa tttgcatgta catgaggata atcaagcggt 5820 cattacaatc ttaaagaatg ataatttcca cccacataga ccgattgata tatgttacaa 5880 atttctcaga caaaaattga aagatggatt tttttcaata tcatatgttg aatctggaga 5940 taatttagct gactcattca cgaaagcttt aggaagaaat aaattgattg aacataccaa 6000 aaggattaga gaaagaaagg attatgataa taatgctaca ctgatagtgg acgttaggac 6060 gctcgaagag attaagataa acaagaaatt ggtacatcat taattaattt agctgtttac 6120 ctgaatcagg ggag 6134 // ID Gypsy-112_MLP-I repbase; DNA; FNG; 5722 BP. XX AC AECX01000667; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-112_MLP_; KW Gypsy-112_MLP-LTR; Gypsy-112_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5722 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000667; Positions 6135 414. XX CC Positions [2802-3221] - Reverse transcriptase CC Positions [4506-4892] - Integrase core CC 'TTGTG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 424..1485 FT /product="Gypsy-112_MLP-I_1p" FT /translation="MNALEDIQRQLAEITASLAEERILRGQAEARFQQSEA FT CLAAIESQRQSTTSSTPAPTAPAPQAEAPVPKGPKVAAPDRFSGVRGEPAE FT IFASQVQLYMLAHPYLFPNDKTKVVFALSYLTGAASSWAQPLTKELFDEST FT SHLVTFKRFVTNFKAMYFDTEKKAKSERALQSLSQKTSVAAYTHEFKIHAT FT STGWEMPTLISQYEQGLKKEIRVAMVMAQDTFTSIEQIANFAIKIDSKLHG FT AAHTSFHTPSATTNVHDPNAMDLSAAYVRLSEDERAKRMRAGLCFRCNGHG FT HISSDCPDRKNQRRGKGKGGYHSKIAELEVKIAALSNREENQSSSLENSSR FT AETSKNGGAQE" FT CDS 1875..4892 FT /product="Gypsy-112_MLP-I_2p" FT /translation="MPWIKTHSDLIDWQHGTIKCANTHIATVSTVLYCPTK FT PSKGHEMDPMGDARSSYEGMCVSSDTLASPQCEFDTISHKPCSETDNNPIF FT PASVPENPNKDDINRKPLTNGLCTDMTDTDIATVPTVSSIPTQNPNDPKME FT PEGHTRNSDEGANILLEVLMPPQCESAIALSPIHSRTAGKPLHHLNSSYTH FT LDAAKTSWSTSARLAADEKKSAPVKTLEEMIPSYYHQHLHMFKKSNAQRLP FT PRRRYDFQVDLIHGAQPQASRIIPLSPAENAALEEMITTGLENGTIRRTTS FT PWAAPVLFTGKKDGNLCPCFDYRKLNALTTKNKYPLPLTMDLVDSLLDAED FT FTKLDLRNAYGNLRVAEGDEDKLAFICKQGQFAPLTMPFGPTGAPGYFQYF FT IQDILLGRIGKDVAAFLDDIMVYTKSGVNHEEAVDSVLAILAKHNLWLKPE FT KCEFSRKEVKYLGLIISKNKISMDPTKVKAVKDWPAPQNVSELQQFIGFAN FT FYRRFINQFSKTTRPLHNLTKLNTHYLWDEKCEEAFESLKNSFTSAPVLKI FT ADPYKAFTLECDCSDFALGAVLSQRCDEDGELHPVAYLSRSLVQAERNYKI FT FDKELLAIIASFKEWRHYLEGNPNRLDVIVYTDHRNLESFMTTKQLTRRQA FT RWAETLGCFDFKIKFRPGRHAAKPDALSRRPDLAPDDEDKLSFGQLLRQEN FT ITPDTFPVELAAVDGFFEDETIHLENADHWFEIDVLGVLEETPEDKTVDQP FT ISTDNEIINQIRVANKKCKRIQELMNIALNPISTKTKIATKNYTVKDGVLY FT NQGKIEVPDNTDIKYMIFKSRHDTLLAGHPGRAKTLSLVRRSFTWPSQKAY FT VNRYVDGCDSCLRVKSSTRKPFGTLEPLPIPAGPWTDISYNLITGLPLSNG FT YDSILTVVDRLTKMSHFLPCRESMTAEDLADLMTKNVWKLHGTPKTIISDR FT GSIFVSQITKELDKRLGIRLHPSTAFHPRTDGQSEIVNKAIEQYLRHFVQY FT " XX SQ Sequence 5722 BP; 1760 A; 1413 C; 1256 G; 1293 T; 0 other; tattgtcgga tctatagacg ggattcgagg actagacttt aattgattta gaacgaacaa 60 gactgatcga aaagaaccga actcgtagat ctagaattca ggaagattag aaaccttata 120 ttgaacttta tcattacccc ggattagatc atattgatca agacccgcag atacagcaag 180 aatcagaaca ccttcgaata tcataccaga taaccccgct caaactacga ttccccgatt 240 cccacgatta cccatcagct aaaaccttaa aagacctcac ccgtaccaaa ccttatcgcc 300 acaacgtctc ctacgtacag aaccccactt gacgacaccg aatctgagaa cgatacagcg 360 attcctttct tcgacgccga ttctgcatct tccagcgccg aatttaattg taccgagcct 420 gagatgaatg cccttgaaga catccaacgc cagctggcgg agatcactgc ttctttagca 480 gaagaacgca tcttacgcgg acaggctgaa gccagattcc agcagtccga ggcttgtctg 540 gctgcgatcg aatcccagag acaatctact acgagtagta ctcctgcacc tacagcccca 600 gctccacaag ctgaagcgcc tgtccctaag ggtcctaaag tggcggcgcc tgacagattc 660 agcggagttc gaggcgaacc ggccgagatc tttgcaagcc aagtccaact ctacatgttg 720 gcgcatcctt atctgttccc taacgataaa accaaggtgg tcttcgctct gtcgtacctc 780 actggcgctg ctagcagctg ggcccaaccc cttaccaaag aactgttcga cgaatccacg 840 tctcacctag ttaccttcaa gcgtttcgtc acgaacttta aggccatgta tttcgatacg 900 gagaaaaagg caaagtcgga acgtgcttta caaagcctgt ctcagaagac cagtgtagcc 960 gcgtacaccc acgagttcaa aatacatgcg acctctacgg gttgggaaat gcctaccttg 1020 ataagccagt acgagcaagg attaaagaaa gaaatccgag tggcaatggt gatggctcaa 1080 gacaccttca cgtctattga gcaaattgca aactttgcca tcaagatcga cagcaaactt 1140 cacggtgctg ctcacacgtc ttttcatacg ccaagcgcca ccactaacgt acacgacccg 1200 aacgctatgg acttatcagc tgcttatgta cgcctgtctg aggatgaacg cgcaaagcgc 1260 atgagagctg gattgtgctt taggtgtaat ggtcatggtc acatatcatc tgattgtcct 1320 gatagaaaaa atcaaagaag agggaagggg aaaggtggat accattctaa aatagctgaa 1380 ttagaggtga agatagcggc attgagcaat agagaggaga atcagagtag tagtttggag 1440 aatagtagta gagctgagac gtcaaaaaat ggaggagctc aggaatgaag gttgtgccaa 1500 tcctgagcca actgggggat tcaaggaatg ttagtgtggg ttctagtaga ttgtcaacat 1560 gcaatttaaa tgatccgcgt atctttttac acactttctt gtccacgtcc caaaaccccc 1620 gagccacaac aaacttctcc cacccagcta cgttcctcat cgactctgga gccacacatg 1680 atgtgttgag cgagtcattt gccactaact acggcctcct tgaaggagca gaccccaacg 1740 accaaatgat cacgggattt gacggctctg agagcagatc caccttcaag acacaccttt 1800 tcattgcatc cgacatttaa cccacccctt ttgtcattac acgccttaag gactcttacg 1860 acggcatcct ggggatgcct tggatcaaga ctcattccga cctgatcgac tggcagcacg 1920 gcaccatcaa atgcgccaac acccacattg caaccgtgtc gacggttttg tattgcccga 1980 ccaaaccctc aaaaggccat gaaatggacc ccatggggga cgctaggagc agttacgagg 2040 ggatgtgtgt cagttctgac acattagcat ccccgcaatg tgagttcgac accatttccc 2100 ataaaccttg tagtgaaaca gataacaacc cgattttccc cgctagtgtt ccagaaaacc 2160 ccaacaagga cgacatcaat cgaaaaccgc tcacaaatgg attatgtact gacatgactg 2220 acactgatat tgcgaccgta cccacggttt cgtccattcc aacccagaac cccaacgatc 2280 ccaagatgga gcctgagggg cacactagga acagtgacga gggggctaac atcttattag 2340 aggttttgat gcccccgcaa tgtgagtctg ctattgccct ttcacccatt cattctcgaa 2400 cagctggcaa gcctcttcat catctgaata gttcttacac acacctcgac gccgcgaaga 2460 catcatggtc cacgtccgct cgactagctg ctgacgagaa gaaatccgct ccagtgaaga 2520 ctcttgaaga aatgatcccg tcatattacc accaacactt gcacatgttc aagaaatcta 2580 acgcacaacg gttaccacca cgaagaagat acgattttca agtggacttg atacatgggg 2640 cgcagcctca ggcaagtcgc ataattcctc tatctccagc agaaaatgct gcgctagaag 2700 agatgatcac gacaggactt gagaacggga ctatccgtcg caccacgtct ccatgggctg 2760 cccctgtgct tttcacaggg aagaaagacg gcaatctgtg tccttgcttt gattatagga 2820 aactcaatgc cttaaccaca aaaaacaagt accccctacc cttaactatg gatctggtag 2880 acagcctcct agatgctgaa gacttcacaa aactcgacct gcggaatgcg tacggcaatc 2940 tacgagtggc ggaaggggac gaggacaaac ttgcattcat atgcaagcag ggccaattcg 3000 caccgttgac aatgcctttt ggcccaactg gagcgcccgg gtattttcaa tacttcatcc 3060 aagatatatt attgggacgt attggtaaag acgtggccgc gtttctagat gacatcatgg 3120 tctacacaaa atccggagtg aaccacgaag aggcagtaga tagtgtacta gccatcctcg 3180 ccaaacacaa cctatggctc aaaccagaaa aatgtgagtt ctcgaggaaa gaagtcaaat 3240 acttaggact catcatctcg aagaataaga tcagcatgga cccaacaaaa gtcaaagccg 3300 tcaaagactg gccagcccct caaaatgtgt cagaactaca acaattcata ggatttgcga 3360 acttctatcg acgttttata aatcaatttt ctaagactac acgaccattg cacaatctga 3420 ccaaacttaa tactcattac ttatgggatg aaaagtgcga agaggcattc gagagcctca 3480 agaattcttt cacatcagcc ccggttttga agatagctga tccgtataaa gccttcactc 3540 ttgagtgcga ctgttctgac ttcgcactag gtgctgtcct atctcaacgt tgcgacgagg 3600 acggtgagct acacccggtt gcgtacctat cacgatccct tgtccaggct gagcgtaact 3660 acaagatctt tgacaaagag ctactggcaa tcattgcatc tttcaaagaa tggcgtcact 3720 atctagaagg taaccctaat agattggatg tcattgtcta caccgaccat aggaacttgg 3780 aatcttttat gactacgaaa caactcacac gacgtcaagc tcggtgggct gagactttgg 3840 ggtgctttga cttcaagatt aaattccgac caggtcgcca tgcagcaaag ccagacgcac 3900 tatcacgtag acctgattta gcaccggacg acgaagataa actatcattt ggacaactct 3960 tgcggcagga gaatatcaca cctgacacct tccctgtcga gttagccgca gtcgacggat 4020 tctttgaaga tgagacaatt cacctcgaga acgcggatca ttggtttgaa attgatgtgc 4080 taggagttct agaagaaaca ccagaagaca agacagttga tcagcctata agcacagaca 4140 atgaaatcat aaaccagata agagtagcca acaagaagtg caagcgtatt caggaactga 4200 tgaatattgc attgaacccc atctcgacta aaacaaagat agcgacgaag aactacacgg 4260 taaaggatgg agtgctatac aaccaaggaa agatagaagt acccgataac actgacatta 4320 agtacatgat tttcaagagc aggcacgaca cacttttggc aggacacccg ggaagggcca 4380 aaacactgag ccttgtaaga agaagtttca cctggccatc gcagaaagcc tacgtcaaca 4440 gatacgtaga cggttgtgac tcttgtctgc gagtcaaatc aagcacgagg aaaccctttg 4500 gcacattgga acctctccct ataccggctg gcccttggac agacattagc tacaatctta 4560 ttactggtct acccttgtca aacggctatg acagtatcct cactgtagta gataggctaa 4620 ctaaaatgag ccatttcctt ccgtgtagag aatccatgac ggcagaagac cttgcggacc 4680 tcatgactaa gaacgtctgg aaactacacg ggacacccaa aacaatcatt tcggacagag 4740 gcagcatctt tgtgtcacag atcacgaagg agctggacaa gcgactggga atcaggttac 4800 acccatccac ggcgtttcat ccaagaacag acggtcaaag tgagattgtc aataaagcta 4860 ttgaacagta cttacgacat tttgtccaat actgacagga cgattgggag tcattactgc 4920 ctacagctga attcgcctat aacaacaggg atcacgagtc gacgggaata tcaccattta 4980 tggctaatta tgggtataac ccaatgttta ataaaatccc gtcagctgac caatgtgtac 5040 cgttggtaga aagaagactt caagcgctag acgatactca aagggaatta agccagtacc 5100 ttgagattgc tcaagaaaag atgaaggtcc aattcgacaa aagcgtcaga gacactccag 5160 attggcgcat aggagatcag gtgtggttga acaggaagaa catctcaact acacggccga 5220 gccctaagtt ggagtacaga tggctaggtc ccttctttat taatgaaaaa gtctcaaatt 5280 ctacttatag attggatctc cccttgtcga tgaaaggcat acatccagtg tttcacgtct 5340 cagtactacg aaaacacagc ccagactcaa taccacaaag acaacaacaa ccaatcaaac 5400 cgattatcat caacaatcaa gaagaatggg aggttaatga aatattagat tgcagaagaa 5460 aattcaataa attggaatat ctcgtcagtt ggaagaactt cagcacacaa gataactcat 5520 gggagccaga aagcaacctc aagaattgtg tggaattagt taaatcattc aattcaaaat 5580 ttccggaagc ggctaacagg tacaagagga ggaaacggag aaagtgagag ggctatgctt 5640 tttcccaccg ggttttttaa tgcagcccgg ggaaagaatg cagagtttgc aagaggaaac 5700 ttgggcatta aagggggaat ag 5722 // ID Gypsy-1_SPDB-I repbase; DNA; FNG; 5068 BP. XX AC ACOE01000078; XX DT 12-FEB-2011 (Rel. 16.02, Created) DT 12-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Spizellomyces punctatus genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_SPDB_; KW Gypsy-1_SPDB-LTR; Gypsy-1_SPDB-I. XX OS Spizellomyces punctatus OC Eukaryota; Fungi; Chytridiomycota; Chytridiomycetes; OC Spizellomycetales; Spizellomycetaceae; Spizellomyces. XX RN [1] RP 1-5068 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Spizellomyces punctatus genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; ACOE01000078; Positions 77700 82767. XX CC Positions [2762-3277] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 143..5008 FT /product="Gypsy-1_SPDB-I_1p" FT /translation="MKPRMMPVKRSPTRGHSPVIQTRSQGPAEDVEVPPAT FT RPRRSGTTSVEPEAPNLATMASEVAALKEDIARSTAAMQLATQNLTEQLHR FT LSECQTAADERLARALEALAHRPLPEQPLIARATRERSATPWETSDTPRER FT TEPSFSGPVPPIQVPAAPWASNTKVDSLKAKDVPKFTGKRNQDIEEWLNKV FT QLLQNISATSDARMVQLLPAMIAEKTPADAWFYTLEAQETCSWTWKRWKEE FT LLREFRGPLTESKCKTEYVHCHANPKEFRSFLAFIDHKTHLRRLAYGDRAN FT IECPTDQHFDLIVEQLPSDIQIMVKSCNITNLTKLKDFLRLYFGSDDTRKR FT DPCYSGAAEAQKKSQKTGTDKGPKQMAQHKTGNPKKSRDGPQKMVPDPNTE FT PPAPCKFCQKKHWNVFCWQNPNRKGFYAQFISRYAELVRPLQDAIQEGVRE FT YQDLKKGRADPKKGAEKLNSPSMLDMAVAPTITATGARFEVSQHPKSSSSD FT LPAVNLRQSSLADLPIDPFCVGRLARKRPDQGLRRFIAKREVSWTDSRNSA FT FAAAKQALTTAPVLAAPRKGFPYTLHSDASYTAFNVTLEQLQPDASKCPLC FT YHSRGTTTSEKKAPPVELEAGCLIWGLGKLRPYLEGLPFTVITDHRALLWL FT AKYKGENRKLLRWALDLSYWSEWMTIVHKPGRFHIVADTLSRLLGAETLLC FT STATQIWVSKCGRRRLRTTPAALAPSSTLADAVFRLALQPALIAAIRAGLT FT GQLHPIAQQACMLLWETSPDTYTTADARFLQDWQRADNLLWHRFSVSKDLW FT HLYVPPACTPKVRKLFHEGGGHFGFERTYARAAAQVWWPKMAQDFRKHCAG FT CETCQRSKRVPQLIPGLLKPIPPAPGRWTVITTDFVTKLPLTSRGYDAFWL FT IACKCSKRIHVWPLKSTATATQVVELFIERYVPLHGIPETIISDRDPKFTS FT RFWQEVCKNLKINQELTTAYHPQGDGQSERDNATVLQVLTTMVNTRLDDWD FT TFLLVAEFIINDTVHEATKMMRFKMDLGYEPHNPLADIVERLSETRNADVE FT ELCKHLRDLNILIRDLMIEAQDSQKRYYDLSRRHLEFRRGDKVLLSTKNLA FT GYESKIAPKWLGPVTVTDCNPLHDNYELKLPDKMRRLHPWFHISKLRPYQK FT PVDAETEERIVEAQMEDSDASKGEFEVEAIVDHRTRRGKHQFRIRWAGYPP FT HEDTWEPEEALTEASEALEEYWRSRAPTSALVEDSLVVSFAGDTLAYASDH FT QCFTIYPDNSSRDNHTLSPERKPVLRGLPHQSRTPHQASRSCLCASELSKS FT AGDDLKRTPLHTPPLFSSSTTYDLSPLQTYSALQTLSMDTETKAFVSNTMT FT TDIRGQQGVSILSTLEEGYPWSEVSDAFAAENSIERLGAAMATWVEVWIRE FT LFEDPNKPLEHYRGEDWVVDGASKPYATSQPLTMHIAEPYSRASDGSQAAV FT GIIADTKKIGKRLRDVPCTAWLDAGVPQHAAGNPKTARFTLWPDREAHDFA FT DIAIEAARAVLLKLANAAETTEEQLRPKHREHAYYHFRAVNLKRAAETFTA FT LYVTRANGSRTPGTPAPEATDQIWFCANVMGRATIIWTINEDSLEADDDYP FT VASW" XX SQ Sequence 5068 BP; 1365 A; 1514 C; 1196 G; 993 T; 0 other; tggtggagaa gcgcgacaaa ctgccttccc ctctcaaatt cagctgaaaa ccttattcgg 60 tcataccgaa actcttgcgc aaactgagat ctgctcaggt agctactcgg aaaccttttc 120 tcttgtttcg gaaacagact tcatgaaacc gagaatgatg ccggtcaaac gctctccaac 180 gcgaggccac tcacctgtga tccagactcg tagtcagggt cctgcggagg acgtcgaagt 240 ccctcccgca acacgcccaa ggcgttcagg gaccacttcc gtcgaaccgg aagcccctaa 300 cttggccacg atggcctctg aagtggcagc gttaaaagaa gacattgccc gctcgactgc 360 tgcaatgcag ttggcaactc aaaatcttac ggaacagctt cataggcttt ccgaatgtca 420 gacagcggcg gatgagcgcc tcgcacgagc cttagaagcc ctagcacatc gcccactgcc 480 tgaacaacct ctcatagcac gagctacaag agaacggtcc gcaaccccgt gggaaacgtc 540 ggacacacca agggagcgta ccgaaccaag tttttcaggt ccggtgcctc caatccaagt 600 gccagcggcc ccgtgggcca gcaacaccaa ggtcgattcc ttgaaagcaa aggacgtacc 660 caagtttacg ggcaaaagaa accaagacat tgaagaatgg ctcaacaaag tccaacttct 720 tcaaaacatt agcgcaacat cggatgccag gatggtacag ctcctaccag ccatgatagc 780 cgagaaaacc cctgccgacg cttggtttta cactctggaa gcccaggaaa cttgctcgtg 840 gacctggaaa aggtggaaag aagaactgct tcgagagttc cgaggtcccc taaccgagtc 900 caagtgcaaa accgaatatg tgcactgcca cgcgaacccc aaagaatttc gcagcttcct 960 cgcattcatt gaccacaaaa ctcatctccg gcgcctcgca tacggcgacc gggcaaacat 1020 agaatgtccc actgaccagc atttcgacct aatagtcgaa caattgccca gcgacatcca 1080 aatcatggtc aaatcctgta acatcacgaa cctgaccaaa ttaaaagact tcctgcgcct 1140 ttatttcggc tctgatgaca ccagaaaacg agacccttgc tacagcggtg ctgctgaagc 1200 tcagaagaag tcacaaaaga ccggaaccga caaaggccct aagcagatgg ctcaacataa 1260 aaccggaaac cccaaaaaga gtcgcgacgg ccctcagaag atggttccgg accccaacac 1320 ggaaccaccg gcaccctgca aattctgtca gaaaaagcat tggaatgttt tctgctggca 1380 gaacccgaac cgaaaagggt tctacgccca attcatttct cgctacgcgg aactcgtccg 1440 acccctgcaa gatgctatcc aagagggcgt tcgcgaatac caagacctta aaaagggtcg 1500 cgctgacccg aagaaaggcg ctgaaaagct caattcacca tccatgcttg acatggcggt 1560 ggcccctaca ataacagcca ccggagcccg atttgaggtt tcgcaacatc cgaaatctag 1620 ctcctcagac ctaccggcgg tcaatctacg ccaatcatcg ctggcggact tacccattga 1680 ccccttttgt gtcggacgcc ttgctcgcaa gcgccccgac caaggtctgc gcagatttat 1740 cgcgaagcgt gaagtttcat ggaccgattc ccgaaactct gcctttgcag ccgccaagca 1800 ggctctgaca acagctccag tgctcgcggc acccagaaaa gggttcccct acacccttca 1860 cagtgatgct tcctacacag cgtttaatgt cacgcttgaa cagttacagc cggatgcttc 1920 taaatgcccg ctgtgctacc attcccgagg caccaccacc tcagaaaaga aggccccccc 1980 cgtcgaactc gaggccggat gcctcatttg gggcctcgga aaactccgcc catacctcga 2040 aggcctgcct ttcacggtca ttactgatca ccgagctctt ttgtggctcg ctaaatacaa 2100 aggcgagaac cgtaagcttc tccgctgggc cctcgatctt agttactggt ccgaatggat 2160 gacaatcgtt cacaaaccag gacgttttca cattgtcgct gatacacttt cacggctact 2220 aggagcggaa accctactct gctccaccgc gacccaaata tgggtgtcga aatgcggaag 2280 acgacggctg cgcaccacac cagctgctct cgcaccttcc tcgacactgg cagatgccgt 2340 ctttcgtctg gccttacagc ctgcactcat tgccgccatc cgcgctggcc tcaccggcca 2400 gctccacccc atagcccaac aggcctgcat gctcctatgg gagaccagcc ccgacaccta 2460 cactacagcc gacgcgcgct ttcttcagga ctggcaacgt gcagataacc tgctgtggca 2520 ccgtttttca gtctccaaag acctttggca cctctacgtg ccccctgctt gcacccccaa 2580 ggtgcgcaag ctttttcacg aggggggagg acactttgga ttcgaaagaa cctatgctag 2640 agctgcagca caagtctggt ggccgaaaat ggcacaagat ttcagaaaac attgcgcagg 2700 atgtgaaacc tgccaacgct caaaaagggt accgcaactg attccggggt tgttgaaacc 2760 cataccacca gctccaggac gctggacagt catcacgact gacttcgtaa ccaagctccc 2820 acttacttcc agagggtacg atgccttttg gctaatagct tgtaaatgct caaagaggat 2880 ccacgtatgg cccctcaaga gcactgcgac agctactcag gtcgtggaac tatttataga 2940 acgctacgtc ccattgcacg gcatcccgga aactatcatc tcagaccgcg accccaaatt 3000 tacgagtcga ttttggcagg aggtctgcaa aaacctaaag attaatcaag agctcaccac 3060 agcctaccac cctcaaggtg atggacaatc agagagagac aacgcaacgg tcttgcaggt 3120 ccttaccaca atggtcaaca cccgattaga cgactgggac acctttcttc tggttgcaga 3180 attcattatc aatgatacgg tacacgaggc gacaaaaatg atgcggttca agatggacct 3240 aggctatgag ccccacaatc cactggcgga catcgttgaa aggctgtcag aaaccagaaa 3300 tgcagatgtg gaagagctct gtaaacatct cagagaccta aacatcctta tccgggatct 3360 gatgatcgag gcccaggact cgcagaaaag atattacgat ttatcgcgac gacatctgga 3420 gttccggaga ggagataagg tgcttctgag cacaaaaaac ctagcagggt atgagtccaa 3480 gatcgctccg aaatggctgg gaccagtcac ggtgaccgac tgcaaccctc tccacgacaa 3540 ctatgagcta aaactgccag acaagatgag acgacttcac ccatggtttc acatatcaaa 3600 attacgtccg taccagaagc ctgtcgacgc cgaaactgag gaaaggatcg tggaagcaca 3660 aatggaggat tccgacgctt caaaaggcga gttcgaagtg gaagcgatag tagaccatcg 3720 aaccaggcgc ggaaagcatc aatttcgaat acgttgggcc ggatacccac cccatgagga 3780 tacctgggaa ccagaggaag ccctaacaga ggcttccgaa gccttagaag agtactggag 3840 atccagagct cccactagcg cactagtaga ggactccttg gtagtcagct ttgcaggcga 3900 caccctggct tacgcttctg accaccaatg cttcacgatt tatccggaca atagctcacg 3960 tgataatcac accctctcac cagaaagaaa accagttttg cgcggcttac ctcaccagtc 4020 gagaacacca caccaagcct cccgatcatg cttatgtgca tctgagctgt caaaatcagc 4080 cggcgatgac cttaaaagga cgcccctgca cactccccct cttttttctt cttcaactac 4140 ctatgaccta tctcccttac agacctactc tgctttgcaa accttgtcta tggacaccga 4200 gacaaaagct ttcgtttcca acaccatgac caccgatatc cgaggacaac aaggagtcag 4260 catcctctct acccttgagg aaggctatcc ctggtcagaa gtaagcgacg ccttcgctgc 4320 cgagaactct atcgagcggc tcggagcagc tatggccacc tgggtcgaag tttggattcg 4380 tgagctgttc gaagacccaa acaaacccct cgaacactat cgtggtgaag attgggtggt 4440 tgacggggct tcgaagccgt acgccacgag ccagcccctc acgatgcata ttgcggagcc 4500 ctacagtagg gcttccgacg gcagtcaagc agctgtcggt atcatcgccg acacaaaaaa 4560 gattgggaag agattacggg acgtaccctg caccgcctgg cttgacgctg gggtccctca 4620 acatgcggcc ggaaacccca aaacggcccg cttcaccctc tggcccgatc gcgaggccca 4680 cgatttcgcg gacatagcca tcgaagccgc gagagcagtg ttgttaaagc tagccaacgc 4740 agctgaaacc acagaagagc agctccgtcc caaacatcgg gagcacgcct attaccactt 4800 tcgagcggtt aacttgaaaa gggccgcaga gacgttcaca gccctctacg taacccgagc 4860 caacggttcc aggacacctg gaaccccggc tccggaagcc actgaccaaa tttggttctg 4920 cgcgaacgtc atggggcgcg ctaccataat ctggacgatc aacgaagaca gccttgaagc 4980 tgacgacgac tatccagttg cgtcatggta gaagcgaggt tcgggcactc acagacgagt 5040 cgccgaacct tctttttcga agggggga 5068 // ID Gypsy-1_PCR-LTR repbase; DNA; FNG; 636 BP. XX AC AADS01000368; XX DT 30-JAN-2011 (Rel. 16.02, Created) DT 30-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Phanerochaete chrysosporium genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_PCR_; KW Gypsy-1_PCR-I; Gypsy-1_PCR-LTR. XX OS Phanerochaete chrysosporium OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Corticiales; Corticiaceae; Phanerochaete. XX RN [1] RP 1-636 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Phanerochaete chrysosporium RT genome."; RL Direct Submission to RU (30-JAN-2011). XX DR Genome; AADS01000368; Positions 667 32. XX SQ Sequence 636 BP; 164 A; 118 C; 159 G; 195 T; 0 other; tgggataggc cagaggccca ggctttagac gacaaatacc atgacaaatg ctcaccaagc 60 attgtacttc aggcatgccc tggaaggtaa aagaagagag tagaatgcca tggaaagttc 120 tggaaaagct tggaaagcga taccctgctg gaagccaggg agatggaaag caccagaata 180 tgatggaatt tgagagaata gcgtagaata ctctagaata gcttagtcat acactaattt 240 cacatctcag gaaaggtata taaacccccc tgaatgtgcc aggaaataca aggcccacgt 300 tactagctct ttctttagta ctgtgatcac cttcacgtgt tttgtgttta cgttcaacgt 360 tgtttacggt gtcagcactc taagcttggt taaatcccaa gtagagtgtg gtgtctggta 420 ttctagtctg gctacacgtt gcccagtgtc gaatacctgt gctgttgtct tacgttgttc 480 ggtgttgtgt tctaagcttg gttaaatccc aagtagagca ctgttgttgt tgtttggtat 540 tacgctctaa gcttggttaa atcccaggta gagtgtagtt gttgttgttg ttgttgttgt 600 tgtttggtgt tatgctctag gcttggttaa atccca 636 // ID Gypsy-100_MLP-I repbase; DNA; FNG; 6449 BP. XX AC AECX01000471; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-100_MLP_; KW Gypsy-100_MLP-LTR; Gypsy-100_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-6449 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000471; Positions 7601 1153. XX CC Positions [4550-5029] - Integrase core CC 'CTGTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(338..4453,4457..5614) FT /product="Gypsy-100_MLP-I_1p" FT /translation="MDPWNPRTGISDISSLFDMESVRPDPPRPESRSEVQR FT LQDQVHQLQIQLSNTQIQSLMSQAENLNQSHPQVSIDDLTHGVNGLNIQRD FT LPPHQQIPLDQIPLNHPNHPPPPQFFNPSFPLHNSPFNNVPPPPLFTQSFQ FT MPHVNVQVPQEHSRFAPPPIHIHQPQVFQPQPQYEPRQRTPMQHHAQYAQG FT PPMPPRPPIRTPLPPSAWSNSSQAIPTPRFRNIRFSGTASHLKAFFQDFYQ FT EMREHEVFFASKDDTYKINWILTLFEPRSTASNWFNSLLEQNAAESGVFDG FT FNDLKGLGYRLAPLSSFSSFLQELRIIFSDKNEDRTNRKNLDNCRQGSHTI FT IDYNSRFRSLVIHVSISPEDAILRYVKGLDPDIYAEAVRLQGWIGSKVLQE FT KMSLAVQAAEIVQELSELPPTHSSYIKKTHKPDSQNIHHFRPTQVQVNRQD FT QAVPMDVDAVNLGSTKKKPSIFFQIMDACRERKLCAKCLKNYDLQGSHLKG FT VCPNQNKTLQEKLNFLNIKPISQVQVEQPQEDSNEVDDFNINSIAFVNSSQ FT EQTLNVNAFIAEYYEEMNQVMYPEILQDHFDISSIRINTSQTANGKFIVPF FT VLSNHGAPIAVTALIDTGAMADLISQDLVDKHQIKTMARAPMRCFSFDGSA FT GASGVIDQEWCGTLLATSNSSSPVSLSAKLGVTSIHGQDIILGLPWMKSVQ FT ASAICHEKGVYLAIGGVVVAAMTDSSVSLVSCESSEVASVSVPSSFPTATV FT QDSSHSTSSQIYSKHQTDSPISLPSSIPSDKSLPLNLPLSCHKYSHVFFPQ FT DFVLPPHRSFDCAIDLKPDCEPPFGGLYNLAQSEQIELREYLNEMLEKGFI FT RPSSSSAAAPIFFVKVPGKKNRPCVDYSGLNKITKRDSYPIPVMSWLLNQL FT KGCTVYSKIDLKSAFNLIRIKKGDEWKTAFRTPWGLFEFTVMPFGLANAPA FT TFQRFIQWVLREFLDISCFVFIDDILIFSKSEEEHTRHIEQILSKLSEHQL FT TASAAKCEFFVKEVVFLGFVISTDGLRMDPKKLNTLADWPFPSDLADLRRF FT LGFSNFYRQFIPAFSQIVAPLTDLTTVNSDVKKGLTKAGARDAFNLLRRLF FT AESPFLLHFDFSRKRVIQVDSSGYAFSGILSQYSDQGELRPVAYCSKKLTP FT SERNWQVHDQELGAIIACFQEWRAWLLGSEEPVAVLSDHANLQYFMNSQEL FT TPRQARWASFLGQFHFTILHTPGKINPADPASRRSDLTDGKVSTRRVVLLG FT FRDLPGVSIEAVTLRKDAVMGQSNPSNFMPASSETQSWLQTLYSTDALIQG FT RRPAQLSYQHNLWWWRDRIYVPLEARVRVMREMHEHPSAGHWGVMKTLDLL FT SRTFSWPNVEELLIFLKSCISCQRVKVDRRPPQGELVPLPIPERPWSTIGV FT DFIVKLPLSNGFDSIMVVVDHMGKGAHFIPAKESWNAEELAVTFVANVFRY FT HGLPDVVVSDRGTTFVSKFWTAVLRLLNVVPAPSTAFHPQTDGQVERVNAL FT VEDYLRHFVANDQSDWSAWLPMAEFAYNNSPSASTGFSPFFVYTGSHPRFN FT SLVTTSGVPRADDFFQHMQSIHHSLKVNLEKSKASQACFYNGSRRISATYA FT AGDLVWLSRRNLKTLRPSNKLDVRRIGPFPVVRMVGKNAAELSLPPALKRL FT HPVFNVSLLSPFFGESFAPPPDLEEFSQEEHDLGTVAFIINYRMTDQGIHE FT YLLCGGDTSGLDDVWTPLTTISSALDPWLRRFHVLCQP" XX SQ Sequence 6449 BP; 1702 A; 1264 C; 1324 G; 2159 T; 0 other; ttttaatttc gatcctcaaa ttcacaagtc ataagatatt ttttgttaat ttttccttta 60 taattttatg tttgtacttg atttttctca aggttatcat tatcaaaatt taaataattc 120 atttatttca agtgaagaaa atttagatta ttttaaagag ttaccgtatt tcgattgttc 180 tgaagaatat tttattggtc agaatttcaa ttcccatctt atagggataa atttgatttg 240 tatagaattt attgaagctt ttcatttctc acgtcatcgt ttagaagaat tctcgcctga 300 tacatcagtt gattcggatt cagatattga agattgaatg gacccttgga accccagaac 360 tggaatttcc gatatttctt ctttattcga catggaaagt gttcgtccag acccaccgag 420 acctgaatca agatcagaag ttcaaaggtt gcaagatcaa gttcatcaac ttcaaattca 480 gttatcaaat actcaaattc aatcgttaat gagtcaagct gaaaatctta atcaaagtca 540 tcctcaagtt tcaattgatg atttaactca cggtgttaat ggtttgaata ttcaaaggga 600 tttacctcct catcaacaaa ttcctttgga tcaaattccc ttgaatcatc ctaatcatcc 660 tccgcctcct caatttttta atccttcttt tcctcttcat aattcacctt ttaataatgt 720 tccgcctcct cctttgttca ctcaatcgtt tcaaatgcca catgtcaatg ttcaagttcc 780 tcaagaacat tctagatttg ctccgcctcc tattcatatt caccagcctc aagtttttca 840 acctcagcca cagtatgaac caagacaacg aacgccaatg caacatcatg ctcagtatgc 900 gcaaggtcct ccaatgccac cgcgtccacc gattcgtact cctttacctc cgagtgcttg 960 gtcaaattcg tcacaagcta ttccgactcc tagattcagg aatatacgtt ttagtggaac 1020 ggcgtctcat ttgaaggctt tctttcaaga tttttatcaa gaaatgcgtg aacatgaagt 1080 tttttttgca agtaaggatg atacttacaa aattaattgg attttgactt tgtttgaacc 1140 gagatcaaca gcgagtaatt ggttcaattc attattggaa cagaatgcag cagaatctgg 1200 tgtttttgat ggttttaacg atttgaaagg attaggatat agattggctc ccttgagtag 1260 tttttcttcg tttttgcaag aattacgtat catcttttcc gacaaaaatg aagacagaac 1320 gaatagaaag aatttagata attgtagaca aggttctcat actataattg attataattc 1380 tagattcaga tcgttggtta ttcatgttag tatttcgcca gaggatgcaa ttcttagata 1440 tgttaaaggt ttggatcccg acatttatgc tgaagctgtt cgtttacaag gatggattgg 1500 tagtaaggtt ttacaagaga agatgagttt agctgtacaa gcggcagaga tagttcaaga 1560 gttatcagaa ttacctccaa ctcatagttc gtacatcaag aaaactcaca aaccggattc 1620 tcaaaatatt catcatttta gaccaactca agttcaagta aatcgtcagg atcaagcagt 1680 tcccatggat gttgatgcag taaatttagg atcaactaag aagaaaccgt caattttttt 1740 tcaaattatg gatgcttgta gagaaaggaa gctttgtgcg aaatgtttga agaattatga 1800 tttgcaaggt tcgcatttga aaggagtttg tccgaatcaa aacaagactt tgcaagaaaa 1860 gcttaatttt ttgaacatta aaccgatttc tcaagttcaa gtcgagcagc cacaagaaga 1920 ttcaaatgaa gttgatgatt tcaacataaa ttcaatcgct ttcgttaatt catcgcaaga 1980 acaaaccttg aatgtcaatg ctttcatcgc tgaatattat gaagaaatga atcaagtgat 2040 gtatccggaa attttgcaag atcattttga catcagctct attagaatca ataccagtca 2100 aacagcaaat ggaaagttca ttgttccttt tgttttgtct aatcatggag ctcctattgc 2160 ggtcacagct ttgatagata cgggtgctat ggctgatctc attagtcagg atctggttga 2220 caagcatcaa attaaaacta tggcacgtgc tcctatgcgt tgctttagtt ttgatggatc 2280 ggcaggtgca agtggcgtaa ttgatcaaga atggtgcggt actttgttag cgacttcaaa 2340 ttcttcttca cctgtttcgt tatcggccaa gttgggtgtt acctcgattc atggtcaaga 2400 catcatctta ggtctccctt ggatgaagag tgtccaggct tcggctatct gtcatgagaa 2460 gggtgtttat ttggctattg gtggtgtcgt tgtggcagcc atgactgatt cgtctgtttc 2520 tttagtttct tgtgagtcaa gtgaagttgc atctgtttct gttccttctt cttttcccac 2580 agccacagta caagattctt cccattcaac ttcttctcaa atttattcga aacatcaaac 2640 ggactctcct atttctttgc catcttccat tccttcggac aagtctcttc ccttgaatct 2700 tccgttaagt tgccacaagt actcacatgt tttctttccg caggattttg tacttcctcc 2760 acatcgctcg tttgattgtg caattgattt aaaacccgat tgtgaacctc cttttggtgg 2820 actttataat cttgcccagt ctgaacaaat tgagttacgc gagtatctta atgagatgtt 2880 agagaaaggt ttcataagac cttcttcttc ttcagctgca gcccctattt tttttgttaa 2940 ggttcctggt aaaaagaata gaccctgtgt tgattatagc ggtttgaaca aaataactaa 3000 gagagatagt tatccaattc cggtgatgtc gtggttgttg aaccagctga aaggttgtac 3060 tgtttattcc aagattgatt taaagtctgc ttttaatttg attagaataa agaagggcga 3120 cgaatggaaa acggcttttc gtactccttg gggattgttc gagttcaccg taatgccttt 3180 tggattagcg aatgcaccag ctacgtttca gcgtttcatt caatgggtct tacgtgaatt 3240 tttagatatc tcttgttttg tttttataga tgatattcta attttttcga aatcggaaga 3300 agagcatact cgtcacattg aacaaatttt gtcaaaatta tcggaacatc aattgacggc 3360 ttctgcagcg aaatgcgaat tttttgtgaa agaagtggtg tttttaggct ttgtcatttc 3420 aactgatggc ttgaggatgg atcctaagaa attgaacact ttggctgatt ggccatttcc 3480 ttctgactta gctgatttga gaagatttct aggcttttcg aatttttatc ggcaattcat 3540 accagctttt tctcaaatag ttgctccttt aaccgatttg acgactgtta attccgacgt 3600 gaaaaagggt ttgactaagg ctggagcaag agatgccttt aatctgttga gacgtctatt 3660 tgctgagtcg cctttccttc ttcattttga tttttcaagg aagcgtgtca ttcaagttga 3720 ttcttctggt tatgcctttt cgggtatttt gtctcaatac tctgatcaag gagaattgcg 3780 acctgttgct tattgctcga agaagctcac tccatcagaa aggaattggc aggtacatga 3840 ccaggagctt ggagcaatca ttgcgtgttt tcaagagtgg agagcttggc ttttgggctc 3900 tgaggaacca gtcgcggttt tgtctgatca tgcgaatttg caatacttta tgaattcgca 3960 agaattgact cctcgacagg cgcgttgggc ttctttcctg ggacagttcc attttactat 4020 tcttcataca cctgggaaga ttaatcctgc tgatcctgct tcgagacgtt cggatttgac 4080 tgacggcaag gtgtctactc gacgagttgt tcttttgggt tttcgtgatt taccaggagt 4140 ttcaatcgag gcagttacct tgcgcaagga tgcggtaatg ggtcaatcca atccgtcgaa 4200 tttcatgccg gcttcttctg agactcagtc ttggttacag actttgtatt cgactgatgc 4260 tttgatccag ggccgacgtc ctgcgcagtt atcttaccag cacaacttat ggtggtggag 4320 agatcgaatc tatgtgcctc tggaagcacg agttcgggtc atgagggaga tgcatgaaca 4380 tccttcggct ggtcattggg gagtaatgaa aactttggat ttgctttctc gtacctttag 4440 ttggcctaat gtctgagaag aattattgat ttttttgaaa tcctgtatta gttgccaacg 4500 tgttaaggtg gatcgtcgac cgccgcaggg tgaattggtt cctttaccta ttcctgagcg 4560 accatggtcg actattggtg tagattttat tgtgaagctt cctttgtcca acggtttcga 4620 ttctatcatg gtagtcgtgg atcatatggg gaagggagcg cattttattc cggcaaagga 4680 gtcgtggaat gcggaggaat tggcagttac atttgttgct aatgtgtttc gctatcatgg 4740 ccttccggat gttgtagtat cagacagagg aacgactttt gtttcaaaat tttggacggc 4800 agttcttcgt ttactgaatg tggttcctgc tccttctacg gcttttcacc cacagactga 4860 cgggcaagtt gaacgggtta acgcactggt tgaagactat ttgcgccatt ttgtagctaa 4920 cgaccagtct gattggtcgg cttggttgcc aatggcggaa tttgcctata ataattcgcc 4980 gtctgcgtca acgggattct ctccattttt cgtttatacg ggcagtcatc ctcgctttaa 5040 ttctcttgtt acgacttctg gggtaccgcg cgctgacgat ttttttcaac acatgcagtc 5100 catccatcac tcgttgaaag ttaatttgga aaagtcaaaa gcttctcagg catgttttta 5160 caatggttcg cgtcgtatct cggcgactta tgcagcaggg gatttggttt ggctctcaag 5220 acgtaattta aagactttac ggccttcaaa caagttggat gtgcgacgga ttggcccatt 5280 tcctgtggtc aggatggttg gcaaaaacgc tgctgagttg tctttgcctc ctgctttgaa 5340 gcgtttgcac cctgtcttta acgtgtcttt gttgtctcct ttctttggtg aatcttttgc 5400 gcctccacca gatttggagg agttttcaca ggaggaacat gatttgggaa cggtggcttt 5460 tatcatcaat tatcgcatga cggaccaagg cattcacgaa tatctgcttt gtggcgggga 5520 tacttctggt ttagatgatg tttggactcc cttgacaact atttcttctg cgttggatcc 5580 gtggttgcgt cggtttcatg ttttgtgtca gccctgagcc ctgtgtacag gaccacggac 5640 taacaaacac acatgtaaac acatgtaata catatgtaaa taatatgtaa atatatgaaa 5700 cttgtttaaa accataagaa ttacaactca tggctcagtt tcaagagtca cttaagtaac 5760 aagactttta acctcatggt aaaacctaac ccaaacctaa ctcaaaagaa aagcaaacct 5820 aagtgccaac cacaagttga cgcctaggtc tggaatatat aaacccctct tcttcccact 5880 ctggaagaag aagaacttca tatcattaca caactcatca tcacaactta cttgattctt 5940 caccaagaat attgagtaca gtgatattgt tgtactttga ttgtcatagt tcttttctgt 6000 ctttgttcag gtcattgtaa aacctcaagt agtgttctga gcttgtggga ccaatagcta 6060 acagaacaag cttggaattt acaaagggca cctcattaga gagtctaaag tactcagcgg 6120 cacctctgat tcttcagagt ctaaagtact gcaggcttgg tagttgtaag atcaacccaa 6180 gctcagagta ggatccttag aaggactact tagccatccc actggccaca aatcctctca 6240 ttagaggtca actaactgta gtcttcactt gggtgtaaca agtgaaggct tcatcgttga 6300 ctactcttta cttttcttag tgaagagtta gcccaacagt tgctgactcc tcacattttg 6360 tctcccggct tgggttcagg tcctcctcaa gaggtgtgga aagctcgttc agaaagtttt 6420 tagtttggcg tacatggtag atcatattt 6449 // ID Gypsy-2_SPDB-LTR repbase; DNA; FNG; 554 BP. XX AC ACOE01000170; XX DT 12-FEB-2011 (Rel. 16.02, Created) DT 12-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Spizellomyces punctatus genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_SPDB_; KW Gypsy-2_SPDB-I; Gypsy-2_SPDB-LTR. XX OS Spizellomyces punctatus OC Eukaryota; Fungi; Chytridiomycota; Chytridiomycetes; OC Spizellomycetales; Spizellomycetaceae; Spizellomyces. XX RN [1] RP 1-554 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Spizellomyces punctatus genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; ACOE01000170; Positions 109862 110415. XX SQ Sequence 554 BP; 156 A; 95 C; 125 G; 178 T; 0 other; tgtgacagaa ttcttctgtc agtgctatgg ttatactggt tagatgtggt tccaaactag 60 tggtcacatg actttgctga tggtgtcagc attcatttga ccagaattag ataaataagc 120 agatttagcc atttctggct aaattagatg agtatcctga cacactagca tctgcccctt 180 ttaatcaagt ggcacagtgt atttaggatg acttctctca cttgtgattg gccataaatg 240 gtgcaatagg ctgacaaata gtttaggatg acatgtacaa gatctgattg gttgcaaatg 300 gtgcaatagg ctgacaaata gtttaggatg acatgtatga gttctgattg gttgcaaatg 360 gtgcaatagg ctgacaaata gtttaggatg acatgtatga gttctgattg ggtagttagg 420 atatagcatt tgtcagcccc ttttgtctat aaaaagagga cctctctgag gcccctcaaa 480 ggaaaaaaat aacaagtagt aaccaagtct tccctagttt tctgttgtct tcttgtccct 540 gctagcctgt taca 554 // ID NU_CA_LTR repbase; DNA; FNG; 277 BP. XX AC AF119344; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 08-NOV-2002 (Rel. 7.1, Last updated, Version 1) XX DE Candida albicans retrotransposon NU_CA_LTR, long terminal repeat. XX KW LTR Retrotransposon; Transposable Element; Long terminal repeat; KW NU_CA_LTR; retrotransposon. XX OS Candida albicans OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; mitosporic Saccharomycetales; OC Candida. XX RN [1] RP 1-277 RA Goodwin J.T. and Poulter T.R.; RT "Multiple LTR-retrotransposon families in the asexual yeast RT Candida albicans."; RL Genome Res 10(2), 174-191 (2000). XX DR Genbank; AF119344; Positions 1 277. XX SQ Sequence 277 BP; 109 A; 31 C; 51 G; 86 T; 0 other; tgaggttcct tcttatatcc tttttggcaa gtaaatgtgt cgtgctttga tatattagaa 60 agacaatcca ttaatagatg aaatatatat tgatgatgaa aaaagtattg gttgttcaaa 120 atgaaagatc aatataaaaa ttcggagaga aacgtgatgt ttatagagta aaaaattgag 180 ctgataactt cgcaaccaat tctgaacaag catagtttgc aaatatgaat acatcctaga 240 aaaagtgtaa tctatgagga aatatgcagg atattca 277 // ID Gypsy-40_MLP-LTR repbase; DNA; FNG; 169 BP. XX AC AECX01001084; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-40_MLP_; KW Gypsy-40_MLP-I; Gypsy-40_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-169 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001084; Positions 35937 36105. XX SQ Sequence 169 BP; 40 A; 50 C; 28 G; 51 T; 0 other; tgttatgagc catatacaag gctatctaca ttatgcttgt actgagactc gtgctctctc 60 atctgctggc tcgaggatgg cctgtgcatc ctcacgctac aatctcaata ctagtactga 120 agtaccttta gcatcttcac attcccttcc accaccacga gtccttaca 169 // ID Gypsy-23_LBS-I repbase; DNA; FNG; 5346 BP. XX AC ABFE01002024; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-23_LBS_; KW Gypsy-23_LBS-LTR; Gypsy-23_LBS-I. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-5346 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01002024; Positions 1166 6511. XX CC Positions [4070-4564] - Integrase core CC 'AGTAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 485..5200 FT /product="Gypsy-23_LBS-I_1p" FT /translation="MSTLASVEPTSGKAPVLTAGDLTPAVAMDFENAAQDF FT FVTKSVPLDKQVALILPGIKDIRIRDWITADRARITTLSFVEFIKELRANY FT LQSDWEDQIRNQILTSTLTSSHKSFWNWSQHLLSLNCLLRNTPSALDDIAL FT HNHLEAHLDDELKEKVKHSEARNDKVFKTWVAAVRVLDEARTTENKRQIDL FT IEGALQRQAKRQATDANALRGPSRRNNSAASTSTSTTSNRLAPLTEGERTL FT LNEHDGCTKCRRFYVGHRSHNCDLGFPTAKGYKTLTVADALTAKKAKATSK FT STTKAVSATISTVDSSDDEVTAAAAVLPNSPKVYASDSEEDADVSRCNVSA FT PLRVKHLFWNCQIHGLIDDFPVKTRALIDNGAHLVLIRPELAAELGLKKYR FT LREPEIVDVALKNSESNHRCELSKYVKLSFTSLDARWTSRKVKAIIAPGLC FT APVILGLPFLQHNSIVVDHADRSCIDKKSGYDLLNPPPCLPPPPPKPRLRE FT QIKNTKADKKLVLAELMLMCHDRQKSGKGIPEEVAPFNVAGAIRERVETLA FT AEEALQKREKGIKSEFKEIFEPIPHIDELPTGIVAEIHLKNAEKTIKTRTY FT PSPRKYKEAWQILIQQHLDAGRIRPSSSPFASPAFIVPKANPNVLPRWVND FT YRQLNENTVTDSHPLPRIDDILNDCAKGKIWATIDMTNSFFQTRMHPDHIP FT LTAVTTPLGLYEWLVMPMGLKNAPAIHQRRVTLALRQYIGKICHIYLDDIV FT IWSNTIEEHVSNVQTILQALHDARLYVNPDKTHLFCREIDFLGHHISARGI FT EADSKKADRILAWPQPKSVTDVRAFLGLVRYLAAFLPALAEHTGILTELTI FT KECEKAFPTWTDRYQTAFDSIKAIVTSRECLTTIDLSKLPEYKIFVTTDAS FT DKRSGAILSFGTTWENARPVAFDSMTFKGAELNYPVHEKELLAIIRALKRW FT RVDLLGSPFFIYTDHKTLENFVTQRDLSRRQARWMEFMSQFDAKIIYIKGE FT DNTVADALSRLPYSTSSQEAETSAQHPYNFCPDDESENMIASIFHCTTQGP FT RDAAKSLAHASDDMSSVNATLKISSDETFLQDIKAGYAEDSWCKTLPSAAL FT SLPTLQLRDDLWYIGNRLIIPRTGSLRETLFMLAHDTLGHFGFHKTYGSLR FT DAYYWPNMRRDLEEGYIKSCPECQRNKSSTTKPLGPLHPLPIPDQRGDSVA FT IDFIGPLPEDEGKNCIITFTDRLGSDIRIIATRTDITAEDLATLFFDEWYC FT ENGLPADIVSDRDKLFVSRFWKALHRLTGVKLKMSTAYHPETDGASERTNK FT TVNQALRFHVERNQLGWARALPRIRFDMMNTVNKSTGFSPFQLRMGRSPRI FT IPPLVPAKSNATVTDIDTWHVIRKLETDVLEAQDNLLKAKISQSTQSNKHR FT TLKFPFEIGSRVRLSTLHRRNNYKAKGEKRVAKFMPRYDGPYTIIDVDEDH FT STVTLDLPNSPNIFPVFHTSEILPYIESDTSLFPSRHLEEPRPIITEDGQE FT EYSIDKILDARRRGRGYQYLVRWSGYGAEHDKWLPGSELQDCEALDRWLES FT RVGSP" XX SQ Sequence 5346 BP; 1452 A; 1513 C; 1106 G; 1275 T; 0 other; ctttttttga agttacaccg cggtcgtatt attcacaata acgactgaac gcttttctcg 60 aaggaacccc tcgagatcgc tcttaccggc atcaacgcgc gcataatcct tgctgtatcc 120 ggcgcatata cgcgcatata tcatctccta ctgtacgtga cacgccgaac atacttgttc 180 ttgacgcgtc gtcctcacag tttacctgat agtggagggc aacccgcctt gaccgcaaac 240 cgaatcaagt acccgcgtgt cgatcctgga aaaccctcgc ttcacacacg catctgccct 300 tacgcaacag ctaccgatcc ttgaaggact gacgcccact ccgcatctcc gtaacaaccg 360 tcaaccacct caacacaacc gtcatcactc gtcttcaccg ttatccagtc ctccctcgtc 420 ctttgttcta tcaacagact catcagacga cgaaatccac gcatataccc aaccaaaact 480 gaaaatgtcg acccttgcgt cagttgaacc tactagcgga aaggctcctg tgctcactgc 540 gggagacctc acgccagcgg ttgcaatgga tttcgaaaat gcagcccagg acttttttgt 600 tacgaaatca gttcccctcg acaaacaagt cgcgctgatc cttccgggga ttaaggacat 660 acgtatccgc gactggatca cagcggaccg cgcacgtatc accactctct cttttgtcga 720 gttcatcaag gaactccgtg caaattacct ccagagtgac tgggaagatc agattcgcaa 780 ccaaatcctt acttccaccc tcacttcgtc ccacaaatca ttttggaatt ggtctcagca 840 ccttctctcg ctcaactgtc ttctccgtaa caccccatcc gcgctcgacg acatcgcttt 900 acacaatcat ctggaggctc acttagatga cgagttgaaa gagaaagtca agcatagcga 960 ggctcggaat gataaagtct ttaagacttg ggttgccgct gtccgcgttc tcgacgaagc 1020 acggactacc gagaacaaga gacaaatcga tctcatcgaa ggcgcccttc agcgtcaagc 1080 gaaacgccag gcaaccgatg caaatgcgct tcgtggcccg tcccgtcgca ataactctgc 1140 agcatctact tccacctcta ccacctcgaa tcgcttagct ccgcttaccg agggcgaaag 1200 gacccttctc aatgaacatg acggttgcac aaaatgccgt cgattctacg tgggacatcg 1260 ttcacataac tgcgatttgg gtttcccaac tgcgaaaggg tacaagaccc tcactgtagc 1320 ggacgcgctc accgctaaga aggcaaaagc tacgtcgaag tcaaccacaa aagcagtttc 1380 cgcgaccatc tccacggtcg actccagtga cgacgaagtt acagcagcgg ctgccgtcct 1440 tcctaattct ccgaaggttt atgcttcgga ttcggaggaa gatgctgacg tttctcgctg 1500 taatgtgagt gcgcccctcc gtgttaaaca tctattctgg aactgtcaga ttcatggtct 1560 gatcgatgac tttccagtga aaacgagggc actcatcgac aacggcgcac acctcgtcct 1620 catccgtcca gaactcgctg ctgaactagg cctaaaaaaa tatcgtctcc gagaacctga 1680 aatagtcgac gttgccttga aaaactctga atcgaaccat agatgtgaac tctccaaata 1740 cgttaaactt tcgtttactt ccctagacgc cagatggacc tcacgaaaag ttaaagccat 1800 catcgccccg ggcctctgtg cccctgtaat acttggtctc cctttcctgc agcataattc 1860 aattgtggta gatcacgcag accgttcgtg tatagataaa aaatcaggat acgatctttt 1920 gaacccccct ccttgtcttc ctcctccacc gcccaaacct cgtttgcgag aacaaataaa 1980 aaatacaaag gcggataaga aattggtact ggcagaacta atgttgatgt gccatgatcg 2040 ccaaaaatct ggtaaaggaa ttcctgaaga agtagcgcct tttaatgttg caggcgcgat 2100 ccgcgaacgc gtcgaaacac tcgctgctga agaggccctt caaaagcgag aaaaaggaat 2160 taaatcagaa ttcaaagaaa tttttgaacc catcccacac atagatgaac tacctacagg 2220 aatcgtcgcc gaaatacatt tgaaaaatgc agaaaaaaca ataaaaacga gaacataccc 2280 ttcacctcgc aaatacaaag aggcatggca aattctcatt caacaacacc ttgatgccgg 2340 tcgtatacgc ccgtcatcat ctccattcgc ttcaccagca ttcattgtac ctaaagcgaa 2400 cccgaatgta ttgccgcggt gggtcaacga ctaccgtcaa ctaaatgaaa acacggtgac 2460 cgacagccac ccacttccgc gcatcgatga tatattaaac gattgcgcga aaggcaaaat 2520 ttgggcaact attgacatga cgaatagttt ttttcaaacc cgaatgcatc ctgatcacat 2580 tcctctaacc gccgtcacca ctccattagg tctatatgag tggctagtaa tgcccatggg 2640 gttgaaaaat gcaccagcga tacaccagcg acgtgtcacc ttggccctac gacaatacat 2700 tgggaaaatc tgccatatat accttgacga tatcgtaatc tggtctaaca ctattgaaga 2760 acatgtatct aacgtgcaga caatactcca agctctccat gatgcacgtc tgtacgttaa 2820 tcctgacaaa acacatttat tttgtcggga aatcgacttt ctcggtcatc atattagtgc 2880 acgcggaatc gaggcagact caaaaaaggc cgaccgtatt ctggcatggc cacaacctaa 2940 atcagtcaca gacgtccgcg ctttcctggg tcttgtacgt tacttggcgg ctttccttcc 3000 tgcactggct gaacataccg gcattctaac ggagctcaca attaaagaat gcgagaaagc 3060 cttcccaact tggacggatc gctatcaaac ggcttttgac agcatcaaag caattgtaac 3120 cagtcgggaa tgcctgacaa caattgatct gtccaagtta ccagaataca agattttcgt 3180 caccaccgat gctagtgaca aacgttcggg ggccattcta tcctttggca ccacttggga 3240 aaatgcgcgc cctgtcgctt tcgattctat gacattcaaa ggggccgaac tgaactaccc 3300 tgtccacgaa aaggagctac ttgctatcat tcgtgccttg aagaggtggc gagtggactt 3360 acttgggtcc ccctttttca tctatactga tcataaaacc ttggaaaact tcgtgacaca 3420 aagggacctc tcacgccgcc aggcacggtg gatggaattt atgtcccaat ttgacgccaa 3480 aatcatctac atcaagggcg aggacaacac agtggctgac gcgttgtcgc gcttacccta 3540 ctcgacttcc tcacaagaag cagaaacctc tgcccaacat ccttacaact tctgtccgga 3600 tgatgaatcc gaaaacatga ttgcaagcat atttcattgc actacccagg gccctcgcga 3660 tgcagctaaa tcactcgcac atgcgtcaga cgatatgtct tccgtcaacg cgacccttaa 3720 aatttcgtct gatgaaacct tcttgcagga catcaaagca gggtacgcgg aagattcatg 3780 gtgtaaaacg ttaccctctg cagccctaag cctaccgaca ctccaacttc gcgacgacct 3840 ctggtatatt ggcaaccggc tgatcattcc acgcaccggg agcttgcgcg aaaccctatt 3900 catgctcgca cacgacactc tgggccactt cggttttcac aaaacctacg gatccctgcg 3960 cgacgcttat tattggccaa atatgcgacg tgatcttgaa gagggttaca tcaaatcctg 4020 tccggaatgt caacgcaaca aaagcagtac gacaaaaccc cttggtcctt tacatcccct 4080 tcccattcca gaccaacgcg gcgactctgt cgctattgac ttcatcggac cgttaccgga 4140 agatgaagga aaaaactgca taatcacatt cactgatcgt ttgggaagtg acatcaggat 4200 catcgcgaca cgcaccgata tcacagccga agatttagcc actcttttct ttgatgagtg 4260 gtattgcgag aacgggttac cagctgacat cgtctccgac agggataaac tcttcgtgtc 4320 gcggttctgg aaagcgttac accgtctcac aggtgttaaa ctgaagatgt caactgcgta 4380 ccacccggaa accgacggcg ccagtgaacg taccaataaa acagtaaatc aagctctccg 4440 ttttcacgtc gaacggaacc aactgggttg ggcacgcgcg ttgcctcgaa ttcgatttga 4500 catgatgaac accgtcaaca aatcgactgg cttttcgcct ttccaacttc gcatgggccg 4560 aagcccccga atcatccctc ctcttgtacc cgccaaatcg aatgctaccg tcacggacat 4620 tgacacatgg catgtaatac ggaaactgga gacggatgtt ctcgaagcgc aggacaactt 4680 gttgaaggct aagatttcac aatctactca atcgaataaa caccgcacac tgaagttccc 4740 gtttgaaata ggttctcgtg tgcgattatc aacattgcat cgacggaaca attacaaggc 4800 gaagggcgag aagcgcgtcg caaaatttat gcctcgctat gacggccctt acaccattat 4860 cgacgtcgac gaggaccact caacggtaac actcgacctc ccgaattctc ccaacatctt 4920 ccctgttttc cacacatccg aaatccttcc ctacattgaa tccgacacat ctctatttcc 4980 ttcccgccat cttgaagaac cccgccctat catcactgaa gacggacaag aagaatactc 5040 gatcgataaa atattggacg cgcgccgacg aggtcgtggc taccaatatc tcgttcgttg 5100 gtccggctac ggcgcagaac acgacaaatg gctcccaggc tctgaacttc aagattgcga 5160 ggctctcgac cgttggttag aatcgcgagt tggttcacct taattttagg tagctttctt 5220 tcgaccttgc cagcggtagc tttttcccac tgggttttga cgcacccgcg ctcggattgc 5280 acttacatct tcaattacta acatcttttt tttccgttcc tttttctttt ttaaagcggg 5340 ggaggg 5346 // ID CALTR1 repbase; DNA; FNG; 508 BP. XX AC AF069450; XX DT 30-MAY-2000 (Rel. 5.04, Created) DT 30-MAY-2000 (Rel. 5.04, Last updated, Version 1) XX DE Candida albicans retrotransposon long terminal repeat zeta, DE complete sequence. XX KW LTR Retrotransposon; Transposable Element; CALTR1; ZETA. XX OS Candida albicans OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; mitosporic Saccharomycetales; OC Candida. XX RN [1] RP 1-508 RA Goodwin J.T. and Poulter T.R.; RT "Multiple LTR-retrotransposon families in the asexual yeast RT Candida albicans."; RL Genome Res 10(2), 174-191 (2000). XX RN [2] RP 1-508 RA Goodwin J.T.; RT "CALTR1."; RL Direct Submission to Genbank (31-MAY-1998)Department of RL Biochemistry, University of Otago, Cumberland Street, Dunedin, RL New Zealand. XX DR GenBank; AF069450; Positions 1 508. XX SQ Sequence 508 BP; 185 A; 63 C; 89 G; 171 T; 0 other; tgttgtcata tagctaatgc taattcttga ttagtgtgga aagcctaata aggttatatt 60 gtgcacaggt taactacctt aatatagtta ttgttaatac agttattgct gttgactact 120 attgttattg ttaaattaaa gtgttaggtt gagttaattg attagtgaaa accaactaac 180 taccgtatta aattagtgta ttaagattga ttcctattaa ggataaaaca gagagtgtgt 240 tagaaagaga aagggtggat tataaatatg tgtaaaatcc cctttagaga ctaatcacta 300 gaaatctatt gatggtttca tatatagaga ttaacgatta tatttataat ataagttggt 360 agttgctagt atatttgaaa gcactacagt atagtatgtc agaatcagat catttaaact 420 ctactaataa tacaggaaac actttcatta gtctagatca agccagtaca ataatggcag 480 atcaaactca aggagctaac ccacaaca 508 // ID Copia-50_MLP-LTR repbase; DNA; FNG; 112 BP. XX AC AECX01002655; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-50_MLP_; KW Copia-50_MLP-I; Copia-50_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-112 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01002655; Positions 416 527. XX SQ Sequence 112 BP; 26 A; 32 C; 14 G; 40 T; 0 other; tgtgtctcta cgttgctacc ttccgacagg tgcatacctt tcatttccat cactcttagt 60 gatatctagc catagcgctt tccctaaatc actaatacat gattccttat ca 112 // ID CALTR2 repbase; DNA; FNG; 531 BP. XX AC AF192278; XX DT 30-MAY-2000 (Rel. 5.04, Created) DT 30-MAY-2000 (Rel. 5.04, Last updated, Version 1) XX DE Candida albicans retrotransposon LTR kahu, complete sequence. XX KW LTR Retrotransposon; Transposable Element; CALTR2; KAHU. XX OS Candida albicans OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; mitosporic Saccharomycetales; OC Candida. XX RN [1] RP 1-531 RA Goodwin J.T. and Poulter T.R.; RT "Multiple LTR-retrotransposon families in the asexual yeast RT Candida albicans."; RL Genome Res 10(2), 174-191 (2000). XX RN [2] RP 1-531 RA Goodwin J.T.; RT "CALTR2."; RL Direct Submission to Genbank (04-OCT-1999)Department of RL Biochemistry, University of Otago, Cumberland Street, Dunedin, RL New Zealand. XX DR GenBank; AF192278; Positions 1 531. XX SQ Sequence 531 BP; 178 A; 83 C; 101 G; 169 T; 0 other; tgttgatatg gtaatgttca gttcgtaaga acacgacaaa tggaagttgc cagaaagatc 60 tagatgaata aagtggtagt gcagaaatgc acaaaatatg aattactgtt aacaaatgac 120 tgcgttaagc tactgtaaag tcattttact gtgaaagtcc aaagtgtcca tatgaaagtc 180 acatatgtgg tagaatatgc tagaagaatc atatactggg gaatgatcaa gagtggctgt 240 gtttcacggt tatactcgag tatatttgcc accgatgact ttatatggtg taaatctcac 300 tgactataaa cgtacaacct agttgtaaat gtactgtatt aaagtttact aaccgactat 360 atatagaggg cctacttcct tgaagttggc attagtttag gatttagata actaagttta 420 atatactgat taactgttct atataagatt catcatgtct actcaagatt ctactgcaaa 480 ttccacttct ggattatctc aaactgctga tcagaatgcg tcatttcaac a 531 // ID Gypsy-24_LBS-LTR repbase; DNA; FNG; 144 BP. XX AC ABFE01002141; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-24_LBS_; KW Gypsy-24_LBS-I; Gypsy-24_LBS-LTR. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-144 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01002141; Positions 23733 23590. XX SQ Sequence 144 BP; 46 A; 34 C; 36 G; 28 T; 0 other; tggaataggg aaaggagcat tcccacagga gagcacatga ccattagatt agtcatcagg 60 tagtgtacac gagcatgtag agagggcaat ccatcatcat tccaccaggc atctgcatcg 120 taggaagcac tagagcccat tcca 144 // ID TY repbase; DNA; FNG; 6536 BP. XX AC X02417; XX DT 11-NOV-1996 (Rel. 1.1, Created) DT 22-AUG-2005 (Rel. 10.09, Last updated, Version 2) XX DE Yeast Ty transposable element Ty-pY109 near tRNA-Lys1 gene. XX KW LTR Retrotransposon; Transposable Element; TY; direct repeat; KW enhancer; Inverted repeat; reverse transcriptase; KW unidentified reading frame. XX NM TY. XX OS Saccharomyces cerevisiae OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Saccharomyces. XX RN [1] RP 1-6536 RA Hauber J., Nelboeck-Hochstetter P. and Feldmann H.; RT "Nucleotide sequence and characteristics of a Ty element from RT yeast."; RL Nucleic Acids Res 13(8), 2745-2758 (1985). XX DR GenBank; X02417; Positions 1 6536. XX SQ Sequence 6536 BP; 2331 A; 1398 C; 1046 G; 1759 T; 2 other; agatctctta ttctatataa gagaagtata gaatacagcc ttattagtaa tattaaacat 60 tgctcatatg attatcatga aagcatatta tgatttcaat gtcatgactc acatccgtat 120 tgttggaata aaaatccact atcgtctatc aactaatagt tatattatca atatattatc 180 atatacggtg ttaagatgat gacataagtt atgagaagct gtcatcgagg ttagaggaag 240 ctgaagtgca aggattgata atgtaatagg ataatgaaac atataaaacg gaatgaggaa 300 taatcgtaat attagtatgt agaaatatag attccatttg aggattccta tatcctcgag 360 gagaacttct agtatattct gtatacctaa tattatagcc tttatcaaca atggaatccc 420 aacaattatc taattaccca catatatctc atggtagcgc ctgtgcttcg gttacttcta 480 aggaagtcca cacaaatcaa gatccgttag acgtttcagc ttccaaaatt caagaatatg 540 ataaggcttc cactaaggct aactctcaac agacaacaac acctgcttca tcagctgttc 600 cagagaaccc ccatcatgcc tctcctcaac ctgcttcagt accacctcca cagaatgggc 660 cgtacccaca gcagtgcatg atgacccaaa accaagccaa tccatctggt tggtcatttt 720 acggacaccc atctatgatt ccgtatacac cttatcaaat gtcgcctatg tactttccac 780 ctgggccaca atcacagttt ccgcagtatc catcatcagt tggaacgcct ctgagcactc 840 catcacctga gtcaggtaat acatttactg attcatcctc agcggactct gatatgacat 900 ccactaaaaa atatgtcaga ccaccaccaa tgttaacctc acctaatgac tttccaaatt 960 gggttaaaac atacatcaaa tttttacaaa actcgaatct cggtggtatt attccgacag 1020 taaacggaaa acccgtacgt cagatcactg atgatgaact caccttcttg tataacactt 1080 ttcaaatatt tgctccctct caattcctac ctacctgggt caaagacatc ctatccgttg 1140 attatacgga tatcatgaaa attctttcca aaagtattga aaaaatgcaa tctgataccc 1200 aagaggcaaa cgacattgtg accctggcaa atttgcaata taatggcagt acacctgcag 1260 atgcatttga aacaaaagtc acaaacatta tcgacagact gaacaataat ggcattcata 1320 tcaataacaa ggtcgcatgc caattaatta tgagaggtct atctggcgaa tataaatttt 1380 tacgctacac acgtcatcga catctaaata tgacagtcgc tgaactgttc ttagatatcc 1440 atgctattta tgaagaacaa cagggatcga gaaacagcaa acctaattac aggagaaatc 1500 cgagtgatga gaagaatgat tctcgcagct atacgaatac aaccaaaccc aaagttatag 1560 ctcggaatcc tcaaaaaaca aataattcga aatcgaaaac agccagggct cacaatgtat 1620 ccacatctaa taactctccc agcacggaca acgattccat cagtaaatca actactgaac 1680 cgattcaatt gaacaataag cacgaccttc atcttaggcc agaaacttac tgaatctaca 1740 gtaaatcata ctaatcattc tgatgatgaa ctccctggac acctccttct cgattcagga 1800 gcatcacgaa cccttataag atctgctcat cacatacact cagcatcatc taatcctgac 1860 ataaacgtag ttgatgctca aaaaagaaat ataccaatta acgctattgg tgacctacaa 1920 tttcacttcc aggacaacac caaaacatca ataaaggtat tgcacactcc taacatagcc 1980 tatgacttac tcagtttgaa tgaattggct gcagtagata tcacagcatg ctttaccaaa 2040 aacgtcttag aacggtctga cggcactgta cttgcaccta tcgtaaaata tggagacttt 2100 tactgggtat ctaaaaagta cttgcttcca tcaaatatct ccgtacccac catcaataat 2160 gtccatacaa gtgaaagtac acgcaaatat ccttatcctt tcattcatcg aatgcttgca 2220 catgccaatg cacagacaat tcgatactca cttaaaaata acaccatcac gtattttaac 2280 gaatcagatg tcgactggtc tagtgctatt gactatcaat ggcctgattg tttaatcggc 2340 aaaagcacca aacacagaca tatcaaaggt tcacgactaa aataccaaaa ttcatacgaa 2400 ccctttcaat acctacatac tgacatattt ggtccagttc acaacctacc aaaaagtgca 2460 ccatcctatt tcatctcatt tactgatgag acaacaaaat tccgttgggt ttatccatta 2520 cacgaccgtc gcgaggactc tatcctcgat gtttttacta cgatactagc ttttattaaa 2580 aaccagtttc aggccagtgt cttggttata caatggaccg tggttctgag tatactaagc 2640 agaactctcc ataaattcct tgaaaaaaat ggtataactc catgctatac aaccacagcg 2700 gattcccgag cacatggagt cgctgaacgg ctaaaccgta ccttattaga tgaactgccg 2760 tactcaaact gcaaatgtag tggtttaccg aaccatttat ggttctctgc aatcgaattt 2820 tctactattg tgagaaattc actagcttca cctaaaagca aaaaatctgc aagacaacat 2880 gctggcttgg caggacttga tatcagtact ttgttacctt tcggtcaacc tgttatcgtc 2940 aatgatcaca accctaactc caaaatacat cctcgtggca tcccaggcta cgctctacat 3000 ccgtctcgaa actcttatgg atatatcatc tatcttccat ccttaaagaa gacagtagat 3060 acaactaact atgttattct tcagggcaag gaatccagat tagatcaatt caattacgac 3120 gcactcactt tcgatgaaga cttaaaccgt ttaactgctt catatcaatc gttcattgcg 3180 tcaaatgaga tccaacaatc cgatgatctt aacatagaat ctgaccatga cttccaatct 3240 gacatcgaac tacatcctga gcaaccgaga aatgtccttt caaaagctgt gagtccaacc 3300 gattccacac ctccgtcaac tcatactgaa gattcgaaac gtgtttctaa aaccaatatt 3360 cgcgcaccca gagaagttga ccccaacata tctgaatcta atattcttcc atcaaagaag 3420 agatctagcg gcccccaaat ttccaatatc gagagtaccg gttcgggtgg tatgcataaa 3480 ttaaatgttc ctttacttcg tcccatgtcc caatctaaca cacatgagtc gtcgcacgcc 3540 agtaaatcta aagatttcag acactcagac tcgtacagtg aaaatgagac taatcataca 3600 aagctaccaa tatccagtac gggtggtacc aacaacaaaa ctgttccgca gataagtgac 3660 caagagactg agaaaaggat tatacaccgt tcaccttcaa tcgatgcttc tccaccggaa 3720 aataattcat cgcacaatat tgttcctatc aaaacgccaa ctactgtttc tgaacagaat 3780 accgaggaat ctatcatcgc tgatctccca ctccctgatc tacctccaga atctcctacc 3840 gaattccctg acccatttaa agaactccca ccgataaatt ctcatcaaac taattccagt 3900 ttgggtggta ttggtgactc taatgcctat actactatca acagtaagaa aagatcatta 3960 gaagataatg aaactgaaat taaggtatca cgagacacat ggaatactaa gaatatgcgt 4020 agtttagaac ctccgagatc gaagaaacga attcacctga ttgcagctgt aaaagcagta 4080 aaatcaatca aaccaatacg gacaacctta agatacgatg aggcaatcac ctataataaa 4140 gatattaaag aaaaggaaaa atatatcgaa gcataccaca aagaagtcaa ccaactattg 4200 aaaatgaata cttgggacac tgacaaatat tatgacagaa aagaaataga ccctaaaaga 4260 gtaataaatt caatgtttat cttcaacagg aaacgtgacg gtactcataa agctagattt 4320 gttgcaagag gtgatattca gcatcctgac acttacgact caggcatgca atccaatacc 4380 gtacatcact atgcattaat gacatccctg tcacttgcat tagacaataa ctactatatt 4440 acacaattag acatatcttc ggcatatttg tatgcagaca tcaaagaaga attatacata 4500 agacctccac cacatttagg aatgaatgat aagttgatac gtttgaagaa atcactttat 4560 ggattgaaac aaagtgagcg aactgtacga aactatcaaa tcatacctga taaaacagtg 4620 tggtatggaa gaagtcgtgg atggtcatgc gtatttaaga atagtcaagt aacaatttgc 4680 ttattcgttg atgatatgat attgttcagc aaagacttaa atgcaaataa gaaaatcata 4740 acaacactca agaaacaata cgatacaaag ataataaatc tgggtgaaag tgataagcaa 4800 attcagtacg acatacttgg cttagaaatc aaatatcaaa gaggtaaata catgaaatta 4860 ggtatggaaa actcattaac tgagaaaata cccaaattaa acgtaccttt gaatccaaaa 4920 ggaagaaaac ttagcgctcc aggtcaacca ggtctttata tagaccagga tgaactagaa 4980 atagatgaag atgaatacaa agagaaggta catgaaatgc aaaagttgat tggtctagct 5040 tcatatgttg gatataaatt tagatttgac ttactatact acatcaacac acttgctcaa 5100 catatactat tcccctctag gcaagtttta gacatgacat atgagttaat acaattcatg 5160 tgggacacta gagataaaca actgatatgg cacaaaaaca aacctaccga gccagataat 5220 aaactagtcg caataagtga tgcttcatat ggtaaccaac catattacag tcacaaattg 5280 gcaacatata tttacttaat gggaaaggta attggaggaa agtccaccaa ggcttcatta 5340 acatgtactt caactacgga agcagaaata cacgcgataa gtgaatctgt cccattatta 5400 aataacctca gtcaccttgt gcaagaactt aacaagaaac caattactaa aggattacta 5460 accgacagta aatctacaat cagtataatt atatccaata atgaagagaa atttagaaac 5520 agattttttg gtactaaagc aatgagacta agagatgaag tatcaggaaa tcatctgcac 5580 gtatgctata tcgaaaccaa aaagaatatt gcagacgtaa tgaccaaacc tcttccgata 5640 aaaacattca aactattaac aaacaaatgg attcattaga tctattacat tatgggtggt 5700 atgttggaat aaaaatccac tatcgtctat caactaatag ttatattatc aatatattat 5760 catatacggt gttaagatga tgacataagt tatgagaagc tgtcatcgag gttagaggaa 5820 gctgaagtgc aaggattgat aatgtaatag gataatgaaa catataaaac ggaatgagga 5880 ataatcgtaa tattagtatg tagaaatata gattccattt gaggattcct atatcctcga 5940 ggagaacttc tagtatattc tgtataccta atattatagc ctttatcaac aatggaatcc 6000 caacaattat ctaattaccc acatatatct cacgtatatg tgcaaacacc aagttgatct 6060 atttttactt tcctggaaaa cgccatcgaa atagtcgcca aacaaatcga tgggctggga 6120 aaacgcgtca tcaattcaac cgaataagga aaactaagcc acttcacgcg gttccggata 6180 tttgtccagc ctctttttcc gaaaaaaaaa aaattaaata ataaaatgaa acggacagga 6240 attgaacctg caacccttcg attgcatctt attccgtgga tttccaagat ttaattggag 6300 tcgaagtcta ccattgagcc acgttcatct tgaannctgc cgaaattctg tgttgctata 6360 atggttgaat tagaatctct taaaatagct actcatactt cttcataact aatccattag 6420 tgaccatatg aagcaatcgg acgccacaca tcattgatgt ttcacgatgg agaatgataa 6480 cccactaagt ggcgattgtg ggcaaagtaa gttaaacacc tattgctcat atgatc 6536 // ID LTR-3a_AN repbase; DNA; FNG; 360 BP. XX AC . XX DT 09-JAN-2004 (Rel. 9, Created) DT 09-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE Subfamily of LTR-3_AN long terminal repeats - a consensus DE sequence. XX KW LTR Retrotransposon; Transposable Element; LTR-3_AN; LTR-3a_AN; KW solo LTR. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-360 RA Kapitonov V.V. and Jurka J.; RT "LTR-3_AN, a family of solo long terminal repeats in the RT Aspergillus nidulans genome."; RL Repbase Reports 3(12), 209-209 (2003). XX DR [1] (Consensus) XX CC LTR retrotransposon. Solo LTR. CC The consensus sequence is 75% identical to the LTR-3_AN CC consensus. CC LTR-3a_AN elements are ~76% identical to the LTR-3a_AN consensus. CC It is possible that LTR3a_AN is a consensus sequence of ~20 CC subfamilies CC represented by a single copies only. XX SQ Sequence 360 BP; 101 A; 90 C; 84 G; 82 T; 3 other; tgttaaggat ctacttcgtc aggatagatc agcctgttaa ccaggtaagg ataccaacgc 60 catgtaaagt taacgatagg agaacgaata gtcaatgtaa tgcaagacta aaccttgggg 120 atcccccaag accgggggat ctcccatggt cctaaatata tctatgggga tatacgcatg 180 taacctggca gcaacatcat gacgaagttc tatccgaaaa cggcggctra tcccgacggt 240 tccgccarcg tagcatctcc cagagaagga tccctaggtc tcgatcgtac ccgaggttcy 300 catggttgtt tgcctgaggc tcctgaccct tgatatcgaa tcctagcaga atccgtaaca 360 // ID CACTA-1_Roryzae repbase; DNA; FNG; 7739 BP. XX AC . XX DT 17-MAY-2011 (Rel. 16.05, Created) DT 17-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW EnSpm; DNA transposon; Transposable Element; Nonautonomous; KW CACTA-1_Roryzae. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 7739 BP; 2495 A; 1352 C; 1266 G; 2626 T; 0 other; cactgcaccc aaaccctata ttctaaatta ttatacatcg ggtattttaa cagtgaaaca 60 tcgtgtaaaa taatacgaaa ttgaattttc agaatagaga tgtaattttt gtaccaaaaa 120 aaagctagaa aaaagtatat actatatctg taaagatata tatatgttta cactctgcat 180 actgaataaa atgaaattgt atttcttgta aaattttttt tgtctctttt gtactacatg 240 aatccatggc aacttttaaa ttgggaaaat gaaaaatttt caaaaccagt gtatgatttt 300 ttaatttcaa caaaggccaa taaagtatat ctattggatt gtaagctcgg acatataact 360 atttgagttg cgacgttctc ttatcaagca tactgaattt gctatactgc aaaaaaagaa 420 aaaagaaaag tatgcttagt ttatgtaccc aaaaaaaaag ataaaaaaaa ttgtttgttt 480 acactgaaaa gataaagaac aacatttctt aagttgcgaa agggaacata ggctcaatga 540 ttttatccaa cgaagagtaa aattaattga ttacaattgg aacaacagaa tatatcatag 600 gaagtaataa atgcaaatgt taaataagag ataaaataaa aatgaacaag gaaaaaagaa 660 aaaaaaaaga aatacaattt tccttttttt gaccacaagg aaaataactt ctaccacaga 720 tgataaggca atgtttaaac gactaaagct aagcagaaaa taaggaggaa tgggtttggt 780 agtcattttt atttttaact gcaaaaagaa acaataatta ttaacatcaa acgaatttac 840 atgttttgag attccttata gcagagacta taaaaaagag ctattgttaa agaaataaaa 900 tagtgcatct taagagattt agaaaaaaaa aaaaaagtaa tgttgaaatt agagaattgc 960 aatgtcggaa ttacttgcat gtgaaaaaaa gcaatgttgg aattacttgc atgttaaaaa 1020 aaaagaatgt taaaaatgac atttcaaacg aatacctact tttacaaatt gccttaagtt 1080 gaacaaagcc tacatacagg atccttgttt tattcataga gtatacaaag atttactttg 1140 ggttttagaa agcggtttta gaaattgcac tgaaattgga aatgtttttt ttcaaccaag 1200 aaaaaaaaat ctaactgaga tgtcaattag aaatgtgtta attcatttac ctttgtttgc 1260 taatttcaat ttcttattca atgtatctat gctaatttat gtgctttttg tcccaacttt 1320 aagaatatca tcactttgtc ttttttcttt ttctttttct ttttcttttt cttcttcttt 1380 tttttttttt ttttttgtaa agtagtgcag ttaagtaagc caatcaatat agtactgaat 1440 atgacgggaa aagattcaaa tcttttaaca gaaaagttga attgatttta ctcgatgaaa 1500 aatgtattat tatttagtcg caattgtatg atgtcgtaat atagcaacat caattgaaat 1560 ttgttaaaga ttcttattct ttgattttgg agttgtatat aaggagttca taaagcgaca 1620 tatcaatgca taaaaactgt tattgtattc atacaaaaaa ataagtactc atgaataaat 1680 ttaatatcat tgatttacag taaaaccata tctctataaa taaatatgag acttctgcaa 1740 taattacctt tgatctgttg ttcatatcgg tattttaaga atcacatcgg gacctgtttt 1800 taagaacggt acatcaatgt aatcctattt gataaatatt gttaaatcaa taaacaactc 1860 acatcatcat tgattgattc gtcaagaaga caacatgtca tatttatcca ttatctttcc 1920 tgaatcttcc cattctcttt gtatcattat tttaaccgat gaatagaaat agcggtaaaa 1980 atttacatat tcaaaaagta taaataacat caattgttgt tgtaaatcaa acagtatata 2040 attagtaacg caagttcttt attgtattta taatgtattc ttcaagccca tacaaattcc 2100 attgttactg cgcccaatgc tctggaggag aagcaggcaa gtaccaattt gttacgcaat 2160 ggactcttgc tcgccataga aagaaagaag ctgttaaagg tttgttactt acattaagat 2220 agtttttttg ctaacgcatc atataaacag gtgaatcaga aataattcca gaagttaaca 2280 ccaataatga agtgattcca tcaagtgaaa atgattctga gaataatggt gatgaagact 2340 tttctaattg gcaagtcatt gaaaatgatg aagatgatgc ctctagttgt atggatgttg 2400 tcgaattcaa tgtgcccaaa ggtaaaaacc ttctagaatt gttttaataa aatataaaat 2460 acttactttg tcaaacgttt agttaaactt gaaggattcc aaagtgatgt tgttgctttc 2520 atagcaattt tcttgacgta tttccagctg tttttcattt ctgaatatgc tgcggaaata 2580 gttatgaaat ttgctaatga tcttgttgtc ctcttacatt atgaaccaat cttccctacc 2640 aagttggaca ctcttcgaaa tgctgttgga attgactata cctacaaagg tatgtttcgc 2700 cttgttgtct gtcctgaatg ccataccctg taccagcctg aagaagtaca ctgtgattcg 2760 aagtgcacct tttctgaatt tcgcatcaca tgtaatgccc cactcttcaa gcctgctacg 2820 attggtgcaa gcaaaatgta cgccaacaag gtctctgcat tcaattccat caagtatgcc 2880 ttgactgtga tgttctctcg accaggcttt gaatcagcaa tagaagcatg gcgctatcgt 2940 actcgtcata acaacaccat gtatgatatc tatgatggtc aactttggaa tgaattcaaa 3000 gatagagatg gtaatttatt taccagtcaa gctagatctc ttctttttac gttgaatgtc 3060 gattggtttc agtcatccaa aagaaacgtg tactctgttg gtgctatcta cttgacaata 3120 aataaccttc caagatccat cagatacaaa aaagaaaata tcatcttggt ttgcgttatt 3180 cctggtccca aagaacccaa ggacacacaa atcaacaact atctacaagt tttggtaaat 3240 gatctgaagg aactttacta cgatgacttc tttacctgta cggctgaatc tcctaacgtg 3300 cctgtcccag ttcgtgctgc tctgttcttg attgcttgtg atattcctgc tagccgaaag 3360 gttagtgggt ttacttcatt caatgccact gtaccttgta acaagtgctc aacagcattt 3420 cctgctcgtt ataatactca tcttcaacgt gatttttctt ttggtcttga agatattgat 3480 gcttggtcac ttcgtaccaa tattgaaaac cgttatcatg caaaaaattg gaaagaagca 3540 accaccaaga gaaaaagagc cgacttggaa cagaagtttg gtactcggtt ctctgcctta 3600 catgatttgg aatatcttga tcttgttcgt tgtactgcca ttgatcctat gcatggtctt 3660 ttccttggaa cagcaaagaa aatggttaaa atatggcgta caactatttg tgaccttacc 3720 catgaattat acctgactga tgctgacttg aaagaaatgc aaaaagaagc aaatgatatt 3780 atattgccag cacaatacac gccaatcaat cagaaaattg ccagcgattt ctctgactta 3840 aaggctgatg aatggagaac ctggtgtctt gctctttctc ctcttttgct caagtctcgt 3900 cttccagtcc gtcataaaga aaactgggca aagtttgtac aggcctgtca tatcgtatgt 3960 cgtccatgca ttacccagaa tgaagccatg atagcccaca agctattcca tgagtttgtc 4020 cttggtgttg ctgaattgta tggtccagaa atggttactc caaatatgca tcttcatcta 4080 catttgaagg actccattca agactttggt ccaatctatg cattttggtt gtatggcttt 4140 gaaagactta atggtgacat taagaagatg acggtaaact acaaaacagc attcgaagtt 4200 acatacatga agaagttcct ttctgttgta cactatggtg actacatccg taacttacca 4260 caaggaatta aacagaatcc agtgatgatg tcttcctttg tctatttgtc accaacacct 4320 tctgattccc ttgttgcttc atctgtttca accagctttg atttcagctt atttgtcagt 4380 gctcctctac gtttgaatga taccttcact ggtgctgaaa ttctacctcc tgatacacaa 4440 gcttctgcta gacacaaagt tagtgttaat agattgactc aagcccatta tggttacctt 4500 cttgcctttt acagacttgt atatactgat cgaacattcg caagtgcgct tattccagaa 4560 gaccacgatg acttgtctac aatggtactt cccgatattt ctgttttcgc tgaaatcgaa 4620 attcttgggc aaacgtatcg atcaaaagca tcgcgtacta ctcgtggttg ctatattgaa 4680 gtcgcttgta atcctactat tcctggaaaa gaggctgaga tgcgaattgg tgaagttcag 4740 tattatttta gccatcagct tcaaatgaag aaaactataa ttccaaatgg tcgggtcttt 4800 gccccaaatg cttttgatga acacctcttt gcttttgtac gttggtataa cgctcctctt 4860 catccttttc gaggatttga atgtcttggt gccgcgtact accacaactc ttttcgacca 4920 gctggttcag actgtattct tccagtatcg cgtatcttta cttgtgttgc tatgaagcaa 4980 ggttatcctg ataatcatgt tgtcttttta cctctcccaa gaaaaaccat tggtttgtaa 5040 aaagtaaaaa aaataaaaaa ataaagtaaa ataaaaaaag taaaagtaaa aaaaaaattt 5100 aaaatttaaa aaatccaaac accgcatttc tttttatttt ttttttgttc attttataat 5160 tttttttaaa actggaagtc gtcttcatcc aactgcattt cagaatcaaa agcccaagcg 5220 atactctctt cagttatatt ttcttttctg acgaagtcag gaacaggcag cgaacattct 5280 cttcgaatgc gaggaagaga ctgaatgttg ctggttccag tcctttgtct cttggccctg 5340 tcacttgcta tctgatcatc caacagcttg aagaattcag tgcactaaat attggataat 5400 gagctgattg cgcttaaaaa gatattaatt aaacttactg ctggtgatct gtactttggt 5460 acacgaactt caaaataatc aatggttttt gtacacaaga cagacatgcg ttccttttcg 5520 gtaaaaggag ttgttctatc tggattttgc tccagaagaa cttcctctac cttatccatt 5580 gtatcgctat tatacatgtc ttcttcttca gacatgtact gctgttgaag caagtactcg 5640 cagttcggat agttgttgta aacgccttga tccttgtgct gctcatacat tgctttccgt 5700 gttttgattt tctaaaaaaa aaatcaatta taaaatagca atagccttta ttgtccttac 5760 gttgtttttt cgagtcgttt tagcagcctt tcttttcttt acgaactcgt cgacaacacc 5820 aacctcctct ccagccatac cagctttctt ggtataatac tgcctttcaa ggtatttccg 5880 caaatcgcgg ccattggtat gcttgtacgc ataatggcta ttgacatact cttgcaacag 5940 cttaagtact ttttgattct caacagactt tagaggcttg ctgagatcgt acttagatga 6000 cggaggtaaa tcattgtaat gattcctcac agcatcctaa aattaaagaa aaaaaaagta 6060 attattacac ttgtttaaat ttaaattatt aataacctta ccttaatcgc tgttgcaatt 6120 ggttttgcac ttttgctgga ttcttgttca agctcgctgt aggcttcttt gaccttggca 6180 aatgccactt gcaaggcacg cttttcttcc tgctcttttc ttaacatctc cttcaaagtg 6240 tcaatttcct ttctcatatc cagataaatg gagaaggaaa gagaagcttc atcacgtcca 6300 acatcttcag cttcacgcat tcgcttctgg acattggtca taggggggta accaggtcta 6360 ggataggcaa agctatcaag aataccatca cttgacacaa actcaggaac aacagcacgg 6420 gaaggaagag caattggctg cccagcagag ttagagccac gagcattgac aggagcaggg 6480 ttgaattcga taatacggtc agacataatg ttataaggta ctttgaagaa aaagtaaaga 6540 aacttaacgt tgaagttgaa cacaccttaa atactattgc gttcaattgt aaactcagtt 6600 ctatacaaat caaaaaatct ataaaaggat tgatgtctca gatcaatctt ttcttttctt 6660 tctttctttt ttttgatttg ttatcccagg taagttgttc ttttttttaa aaattttttt 6720 tttctaatgt actttattaa aaattcatag atatacttat ggctattata agttcttgag 6780 tcctttctgt atctgtttat gtggtacgta agtttctaaa ttttcattgt ctttattaac 6840 attcaaattt agttttctaa gcttgattct gaagctgttt gaactccttg atatcgtctt 6900 ctagttttta aagaggttta aggtaagaaa gtcgggtaca tcatgtttca ctgtaattaa 6960 tcttttattt tattcagttt caattctact ttcctactgc ctagtctact gccttcatct 7020 ttcttttgct tctgatctct ttgactctgc ttcattgcat ttgtttgttt tgcctgtttt 7080 gtctgctata ttgtattgtt tgctgcttct gaggtctagg taagtgcaat tttgaaaaaa 7140 aaaatttttg gtatctaccg tttaaaggtg ttgtttcatt attcccatcg atcccttgtt 7200 ccttttcccg cctttttgta tcatttttaa taaacaaatg aaccaataat atataatcaa 7260 tatgacgata aagccccaaa ataaacgaat aaaactaaat ttttaaccaa taataccgta 7320 ttttaagcct attttatttc ctaatataaa ccatccagag ttatatataa tcataataag 7380 ccaataagaa agtattgcga aaaggcaata agccagtaaa gaagtaatat tcaagagaaa 7440 tagacgtata aggaagtatt actaaaggca ataaaccaat aaaggagtag tatcaaagaa 7500 aaacgcaagc atgtaaggtc caatagaaag gtgttatgtt tgtgtatcat cgtagaaaga 7560 acaatgaagc ggcgcgtcac caaaagtaac acggtattcc acatacatat cttaacattt 7620 agagcaaaat acacgaagta tatattttat gaaatttgta aatatattga cgtaaactcg 7680 aaatatctga ttatattctg tcaaatatat tgttctatat atagcaatta tgtgcagtg 7739 // ID Gypsy-93_MLP-I repbase; DNA; FNG; 5716 BP. XX AC AECX01000333; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-93_MLP_; KW Gypsy-93_MLP-LTR; Gypsy-93_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5716 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000333; Positions 103726 98011. XX CC Positions [4520-4999] - Integrase core CC 'AAGAG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 384..1514 FT /product="Gypsy-93_MLP-I_3p" FT /translation="MSNFDMDAVMNRMDEMAASLEEEKRLRMEAENRWAEL FT KLSVERNLNNNPSSIDIDKPVLPDTGNTTPHPAVIPTPILVQQAPAQHVKP FT PKIATPDKFEGTKGQKAEVFMNQISLYMQMNHAAFINEQAQVAFALSYLTG FT KASVWAQSLIDRLLDSTQMGQVTWSKFVDSFKAKFFDSERVAKAEREFCAL FT KQTKSVADYWIKFSELSLVIKWPENILVSHFEQGLKDEVALFMVKEEFTDV FT EEMSKFAIKLGNKLHKRTPDLLRYPAASTSNSPTATSVDPDAMDCSACRLN FT ITNDKYRRRGAVGACYRCGKTDHYIGDCTERNEKSRRGGRGNWRGRGYGGY FT GRFKHRVAELDGVKDEEKVEGRSEGSKNGDAREC" FT CDS join(1877..3544,3548..5620) FT /product="Gypsy-93_MLP-I_1p" FT /translation="MPWIKNNHHIIDWSNGKLLNESTFIAVAESTLLRPQQ FT AQMNRETETNGQARILNKGVELQCSLTPPQCEYNSLKTSDSKEKMSKQERL FT LENFIDSDRNETTDTPTATVEETVLPEPKNTTLDHKDMEPMRQARQIGKGV FT EILHGSSIKPPQSTCTSSFSKTHELAGKRFPFRLQSATRRTRPMSTMATKT FT MQDLRRSLQPTSAMIDAAKTSWNLSARIAADQTKNAPMKSAAELVPECYHE FT YLHMFEKSNSNILPPHRPYDFRVDPGAVPQAGKIIPLSPKETEVLNEMLEK FT GLTNGTIRRTTSPWAAPVLFTGKKDGNLRPCFDYRKLNAVTIKNKYPLPLT FT MELIDSLLNADEFTSLDMRNGYNNLQVREGDEAKLAFICKSGQFEPLVMPF FT GPTGAPGFFQYFIQDILKNHIGKDVAAYQDDILIYTGPGVDHKAVVKEVLD FT ILKKQNVWLKPEKCKFSQREISYLGLIISRNQIRMDESKVKAVKDWPTPKS FT VSETQTFLGFANFYRRFIDQFSKMARPLHELSQKGVEFHWNDERNRAFESL FT KSAFTTAPVRIADPYKPFILECDCSDYALGAVLSQVSDDDHELHPVAYLSR FT SLIQAERNYEIFDKELLAVVASFKEWRQYLEGNPNRLNVIVYTDHKNLQSL FT MTTKELTRRQARWAEILGSFDFEIRFRPGRQSTKPDALSRRPDLMPAEGTK FT LTFSQLLKPENLPDDAFVDKLELADMWFENEETEESHELMQDETEEQTTES FT RIIKDSELLKLIRSKSGEDAKVKELMNLCEDMPNSKLLKGYTLIDNVLYFK FT NKVVVPNDNSLKLLILRSRHDSKLAGHPGRMRTLALVKRAYHWPSMKAFVN FT NYVNGCGSCQRVKSRTEKPFGSLQPFPIPQGPWLDICYDLITDLPTSNGFD FT SILTVVDRLTKMAHFIACTKNMKSDELARLMVKEVWRLHGTPRTITSDRGN FT IFISRITRDFNRRLGITTQASMAFHPQTDGQSEITNKAVELFIRHFTSYKQ FT DNWYELLPFAEFSYNNNQHSAIGVSPFKANYGFDVNFTDVPASEQCLPLVE FT QRIEQIKDVQREIKDAMALTQEMMKSKYDEKIRPTPIWKKGKKVWLNGKHI FT STTRPTAKFAHRWLGPFSILSCVSKNAYKISLPKSMSKIHPVFHVNLLRRF FT EKSKIQGQDKIQPPPIILNDDEEYEVEEVLDKRKKGNNIEYLISWKGYGEE FT HDSWEPEEGVKNAQEMVINFNRKYPKAEDRYKRTRRK" XX SQ Sequence 5716 BP; 1982 A; 1131 C; 1257 G; 1346 T; 0 other; tattgctaag tctatacatc ttaggatcca agggaagaac tcaagtacag aagatcatca 60 aggaaattaa aaagaaagaa gaaaagaaga aagaagaaaa aattaaaatt aaaagaagat 120 tgaaagaaaa gaagaaattg aaaagagtat aaatcaaaga aagatagcga agagtaataa 180 gaatctagaa tcaaaagcaa ttcagtataa tttattgaat tcactctatc tccgacgtat 240 cgacgatctt gcgatacccc gcacgaaact ctaaacctta aaaccttatt ttaaacgcca 300 tcacgcctaa cttcacatta cccaacagtc cttcaagttc aggacgaaac gattctagta 360 aagaggaatc attccacgat acaatgtcaa attttgatat ggatgctgta atgaatagaa 420 tggatgaaat ggctgcgagt ttagaggagg agaaaagatt aagaatggag gctgaaaata 480 gatgggcaga acttaaattg agtgtcgaac ggaatctgaa caacaatccc agtagtattg 540 acatcgacaa accagttctg cctgatactg gcaatacaac acctcacccg gctgtaatcc 600 cgactccaat tctagtacaa caagccccgg ctcaacacgt gaaacctcca aagatcgcaa 660 ctccggataa attcgaagga acgaaaggcc agaaggcaga agtgttcatg aatcaaatca 720 gcctttacat gcagatgaat cacgctgcgt tcattaatga acaagcccag gtggctttcg 780 cattgtcgta tttaacgggg aaagctagtg tatgggctca atctctcatt gatcgtctgt 840 tagattcaac tcagatgggc caagttacat ggagtaaatt tgttgattcg tttaaagcta 900 aattcttcga ctcggagaga gttgctaaag ctgaaagaga gttctgtgca ttgaagcaaa 960 ctaagtcagt agcggattac tggatcaaat tttcggaatt atcgttagtt atcaaatggc 1020 ctgaaaacat attagtatct catttcgaac aaggcttaaa agatgaagtt gcattattca 1080 tggtaaagga agaattcacc gatgtagaag aaatgtccaa atttgcaatt aaactgggca 1140 ataaacttca caagcgtacc ccggacctac ttcgctatcc tgctgcttca acttccaatt 1200 ccccgactgc gacatctgtt gatcccgatg cgatggactg ttcagcctgc cgactgaaca 1260 taacaaatga caagtatagg cgtagaggag cagtgggagc gtgctacaga tgtggtaaaa 1320 cggatcacta tattggagat tgtactgaac gaaatgaaaa aagtagaaga ggaggaaggg 1380 ggaattggag aggaagaggt tatggagggt atggaagatt caaacataga gttgcagaat 1440 tagatggagt taaagatgaa gagaaagtag aaggaaggtc tgaggggtca aaaaatggcg 1500 atgctcggga gtgctagttg tgcctccccc gagcaaatcc aatttaattg tagactttag 1560 taacatcaat tcacttgaaa ttaaagacac gagaattata gaccttgtta ctattgttga 1620 tattaaaaat gccacaacga cccctgcaca agcactgatt gacagtggtg ctacacatga 1680 agccatcagt gaaagctttg tcaccaaaca ccaccttcac attgagcttt aatcacaagt 1740 tcgaaaagtg acaagtttca gtggtcatga atctgcgata actcacaccg gagattttca 1800 cgtcaactcc accaaagctc ctcccacgac gttcattgtt acccaactaa gagacaagta 1860 tgatttaatc ttaggaatgc cctggatcaa gaacaaccat cacatcattg attggagtaa 1920 tggcaaattg ctgaacgaaa gcaccttcat tgcagtcgcc gaatcgactt tgttaagacc 1980 gcaacaagcc caaatgaacc gagagacgga aacaaatggg caagctagga ttcttaacaa 2040 gggggtggag ctacaatgct cattaacacc cccgcaatgt gagtacaatt cgcttaagac 2100 ttcagattcc aaagaaaaaa tgagcaagca ggaacgcctt ctagaaaact ttatagacag 2160 cgataggaat gaaacaaccg atacaccaac tgccactgtt gaggaaacag tgttgcctga 2220 gccgaaaaac accacgttgg accataagga catggagccg atgaggcaag cgaggcaaat 2280 tggcaagggg gtagagatcc ttcatggaag ctcaattaaa cccccgcaga gtacgtgtac 2340 aagctccttt tcaaagactc atgaattagc tggcaagcgt tttccctttc gattacagtc 2400 agctaccagg agaactcgcc cgatgtcaac aatggcaact aagaccatgc aagatttacg 2460 ccgatctctc cagcccacaa gtgccatgat tgacgcagca aagacctcgt ggaacctatc 2520 agccagaatt gcagcagatc agacaaagaa tgcacctatg aagtcagcgg cggaactagt 2580 gccggaatgc tatcatgaat acctacacat gtttgaaaag agcaattcaa acattctccc 2640 gcctcaccgt ccttacgatt tcagagtaga tccaggagca gtacctcaag cgggaaagat 2700 cattccatta tcacctaaag aaactgaagt attaaatgag atgttagaga aaggactaac 2760 aaatggaacc attcgtagga caacatctcc gtgggcggcg ccggtactct tcacagggaa 2820 gaaagatggc aacctgaggc catgtttcga ctaccggaag ctgaatgcag tcaccatcaa 2880 gaataagtat ccattaccac taacaatgga acttattgat agtttactca atgccgacga 2940 gttcaccagt ctcgacatgc ggaacggtta caacaactta caagttagag agggtgatga 3000 agcaaaactt gccttcatat gcaagtcagg tcaatttgaa ccgctggtaa tgccttttgg 3060 accaacaggt gccccaggat tttttcagta ctttattcaa gatatattga aaaatcatat 3120 aggcaaagat gtagcagcat accaagatga tattctgatt tatactggac ctggagtaga 3180 tcataaggcg gtagtaaagg aggttttaga tatcctgaag aagcaaaatg tatggctgaa 3240 acctgaaaaa tgcaaatttt cacaaagaga aatatcgtat ttaggattaa ttatatcaag 3300 aaatcaaatt aggatggacg aaagcaaagt gaaagcagtt aaagattggc ccaccccaaa 3360 aagcgtatct gaaactcaaa ccttcctagg atttgctaac ttctatcgtc gatttataga 3420 ccaattttcg aaaatggcac gaccactgca tgaattatct cagaaaggag ttgagttcca 3480 ctggaatgac gaacggaata gagccttcga atctctgaag agtgcgttca caacggcacc 3540 agtgtgacgt atagctgatc catataaacc cttcatcttg gaatgtgatt gctcggacta 3600 tgcgctagga gcagtccttt cacaagtttc cgacgacgat catgaacttc acccagtagc 3660 atatctctct cgctcattga ttcaagcaga aagaaattac gagatatttg ataaggaact 3720 gctagctgtg gtagcctcgt ttaaagagtg gcgtcagtac ctcgaaggta acccaaatag 3780 gctcaacgta atagtttata ctgatcataa gaacctgcaa tccttgatga cgacaaaaga 3840 attgacgaga cgtcaggcca gatgggccga aatccttgga agcttcgatt tcgaaattcg 3900 atttcgacct ggaaggcaat caacgaagcc ggatgcactg tcgcggaggc cggatttgat 3960 gcctgctgaa ggaactaaac tcacattcag tcagctactc aaaccagaaa acctacccga 4020 tgacgccttt gtagataaac tggaattggc agacatgtgg tttgagaacg aggaaactga 4080 agaatcacat gaattgatgc aagacgaaac tgaagaacaa acgacagaat caagaattat 4140 caaagattca gaattactca aattaatacg aagcaaatcg ggagaggatg cgaaagttaa 4200 ggaacttatg aatctttgtg aagacatgcc gaattccaag ctcttgaaag gctatacgtt 4260 aatcgacaac gtcttgtatt tcaaaaataa ggttgtagta ccaaacgaca actcgctgaa 4320 attactgatc ttacgttcaa ggcatgatag taaacttgca gggcatccgg gccgaatgag 4380 gacactggcg ttggtcaaga gagcatacca ctggccatcg atgaaagcgt ttgtaaacaa 4440 ctatgtcaat ggatgtggct cgtgtcagag ggtaaagtcc cgaactgaga aaccgtttgg 4500 cagtttacaa cctttcccaa tccctcaagg accatggctt gacatatgtt acgatttgat 4560 aactgattta cccacctcaa atggctttga cagcatcctg acggtcgtag accgcctaac 4620 gaagatggcg cattttattg catgcaccaa gaatatgaaa tcagatgaac ttgcaagact 4680 gatggtcaaa gaagtatggc gcctacacgg aacaccaagg accatcacct ctgatagagg 4740 aaatatattc atttcacgca ttacgagaga tttcaacaga agacttggta tcacaactca 4800 ggcatcaatg gcgttccatc ctcaaactga tggacagtca gagatcacaa ataaagctgt 4860 cgaattgttt attaggcatt tcacgtcgta caaacaggac aattggtatg aactgctacc 4920 ctttgctgaa ttctcgtata acaacaatca acattcggca ataggagtat ctccattcaa 4980 agcaaactat ggctttgatg taaatttcac tgacgttcct gcgagtgagc agtgcttacc 5040 tttagtagaa caacgaatag aacaaatcaa ggatgttcaa agagaaatca aggatgcaat 5100 ggctttgacg caagaaatga tgaaatcaaa atacgacgaa aagatacgtc caactccgat 5160 atggaagaaa ggcaaaaagg tatggctcaa tggtaaacat atatcaacta ccagaccgac 5220 ggcaaaattt gcacataggt ggcttggacc cttttcaata ttgtcatgtg tgtcaaaaaa 5280 tgcttacaaa atcagcttac cgaaatcaat gagcaagata catccagttt ttcacgttaa 5340 tttactacga agatttgaga aaagcaagat tcaaggtcaa gataaaatac aaccaccacc 5400 tattatattg aatgatgatg aagaatatga agtagaagaa gtattagaca aaagaaagaa 5460 ggggaataat attgagtatt taataagttg gaagggatat ggggaagaac atgactcatg 5520 ggaaccggag gaaggagtga agaatgctca agaaatggta atcaatttta acaggaaata 5580 tcctaaggct gaagataggt ataaaaggac acggaggaag tagagggtga ggctttttcc 5640 caacgggttt tttaatgcca acccgtggat agatgctaac ctgcaagagg gggttgagac 5700 atggaagggg gagtgg 5716 // ID Gypsy-96_MLP-I repbase; DNA; FNG; 6154 BP. XX AC AECX01000490; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-96_MLP_; KW Gypsy-96_MLP-LTR; Gypsy-96_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-6154 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000490; Positions 129723 135876. XX CC Positions [4736-5215] - Integrase core CC 'GAAAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 2273..5638 FT /product="Gypsy-96_MLP-I_1p" FT /translation="MCIVDDTLTSPQGESATFIVPSFCESASKQVSIPKSQ FT IKQQDDNVTTETHSKCVQPTGLSYAAVVTASSIPNRSLVEPRVEPQGHIRK FT CDEGAAIILDTHQPPQCEFERVPSSISLEAAGQFLRPQNRHLDISATKASW FT STSARIAADEKSKVPVKTVEQMVPSCYHRHLHLFQKSRAQCLPPRRKYDFR FT VDLIPGAQPQAGRIIPLSPAEEAVLDEMVNTGLANGTIRRTTSPWAAPVLF FT TGKKDGNLRPCFDYRRLNALTVKNRYPLPLTMDLVDSLLDAEEFTKLDMRN FT AYGNLRVDEESEDILAFICKQGQFAPLTMPFGPTGAPGYFQFFIQDIMVGR FT IGKDTAAYLDDTMIYTKKGVHHEKAVDGILEIFDKHQLWLKPEKCEFSRSE FT VEYLGLIISKNKVRMDPAKVKAVRDWPAPKNTNELQRFIGFANFYRRFINQ FT FLKTTRPLHDLTKLNTHYEWNENCQRSFESLKTAFTSAPVLKIADPYRAFI FT LECDCSDFALGAILSQRSDDDGEIHPVAYLSRSLIQAKRNYEIFDKELLAI FT VASFKEWRHYLEGNPNRLEVIVYTDHRNLETFMSTKQLTRRQARWAETLGC FT FDFIIKFRPGRKSSKPDALSRRPDLKPPEDERLTFGQLIKPENIGPDTFPT FT ELASIDAFFMDESIDLEDAEHWFEVDVLGVSDADIDAIDDDILSDTTIIQR FT IREANAQDERITNLINSVTNPISSDMKKLSRSYKVKDGILYKRGKIEVPND FT DDIKFHIVKSRHDTLLAGHAGRNKTLSLTKQCFSWPSQKAYVNRYVDGCDS FT CLRTKSTTQKPFGTLQPLPVPAGPWTDISYDLITKLPVSNGYDSILTVVDR FT LTKMSHFIPCKETMKAEDLADLMIRNVWKLHGTPKSIVSDRGSIFISKITK FT ELDKRLGINLHPSTAFHPRTDGQSEIVNKAIETYLRHFVDYRQDNWESLLP FT TAEFAYNNRDHDSIGVSPFMANYGFNPIFNQVPSPEQCIPLVEERLKLLKE FT VQKELTVCLQLLQDTMKHQFDKHVRKNPRWNVGDEVWLDSKNITTTRPSPK FT LGHRWLGPFNITKAISDSTYALNLPISMKGIHNVFHVSLLRKHNPDTIHQR FT ARKESPAIEIEGEI" XX SQ Sequence 6154 BP; 1886 A; 1438 C; 1315 G; 1515 T; 0 other; tattgtcgca tcttctcaaa aacgtgaact gaggaaagat agaccaatcc gaaagaatca 60 aaattagaat cgattgactt tagaagattt agattagact tcaactacaa gattagaatt 120 tgacagaact tttaagaact tattagaatt gaactcacac ttgatcaaca acagaacatc 180 agatttaacg tttcagatcg actattcaga gtttagccac cacatcgcct tcatacagaa 240 cccccgcaga cggcgacgac gattcagacc ctgaaacatt cacaaacttc gtcgacgccg 300 ataccgctct cactgataca aatcttgcac tagcatctga atctactgcg atggaagaca 360 tacagcgtca attgaatgag ctccacacct cattgcaaga agaacgtcaa ctccgattat 420 aggctgaagc taggtctcat aacgccgaag ctcgcttatc cgctattgag tctagtcgtg 480 cgactcaaca aacctctcag actgtacctc cagtaacccc acaggctcca catgtatcat 540 cggccgctaa ccctaaggga ccgaaggtct ccgtccctga taaattcaat ggggtgaggg 600 gcgctccagc cgaggttttt gcaagtcaaa ttcagctata catgttagcg catccctacc 660 tatttcagga cgatcgtagc aaagtggtct tcttgctttc ttatctgacc ggagccgcga 720 gtagctgggc tcagccattg actctcgagt tgtttgacaa tgctaccgct cacaacgtga 780 cttttgatcg tttcattact aattttcggg caatgtattt tgacactgaa aaaaagtcta 840 aggctgaaag ggcccttcga actctcaccc agaagtcgtc tgtggccgcc tacacccatg 900 aattcaatat tcatgcatca gcaacaggat gggaagttcc tacactcatt agtcactacg 960 aacagggtct gaaaaaagaa attagagtcg ctatggtgat ggtacaagat gagttcacta 1020 gcatagagca aattgcaaat ttagcgatca agatagacag caagatacat ggagtctctg 1080 atccatctct cgtcacattt catacaccgg accccaacgc tatggacatc tcctccgggt 1140 ttgttcgact gtctgatgaa gaaaagaacc gacatatgag agctggatta tgttttagat 1200 gcaatgaacg tgggcatagg gcaaatgaat gtcctggtag aaagtctgat agaggaagag 1260 ttggtggagg ctataaggct agagtcgcgg aattagagtt gaaggttgca gcgtttggaa 1320 acaaggagga tagcaagaat gatggggtta actcttctag tcgagcagaa tcgtcgaaaa 1380 atggcggtgc tcaggcctga gcgatgtgcc taacctgagc catgatgggg taatagctga 1440 aattgaactg ggcgctagta gaatcatcac atgcaatgca aatgatccac gtgtttttct 1500 caagtgttca ttatcaacgt cccaaaaacc ccgcgccaca tcaaccacat ctttttctac 1560 actttttctt gtcgactctg gtgccacgca cgatgtgcta agtgagacct ttgcgcatcg 1620 aacgggcctt attgaccgag cagtgcgcgc aacacggatt gtcaccggtt tcgacggctc 1680 acgtagccat gcatcttatg agacagattt atttattcac cacgactcct cccctaccca 1740 cttcattatt actcgtatca aggattcgta tgacggaata cttggtatcc cttggataag 1800 aagaaaccac caccttatcg actggataag tggcaaagtt cacttagatc acaccaccat 1860 tgcaactgca aatgcagttt cgtcaaatcc gcaaccaccc tcaccagccc aaggattgga 1920 gcccgcgagg gatgttagga acactgacga gggggccgct atcgtaattg atacgcatca 1980 gcccccgcaa tgtgagtacg atacgcctac gtctcaaatt tcttttgaaa cagttggcaa 2040 gctttgtctt tccccaaaat tacagactcc accgactcaa tcactcgaag taccgacacc 2100 tacaatcgac tgctccaaga acgttgatct tcatccaaat ctagaagttt ccaaggttga 2160 catatccctt gtaacacctg cggctgaaat tacagcctcg tctgatccgt caccaccctc 2220 gccaggtcat gaaattgagc ccgtgaggga agctaggcaa aatgacgagg ggatgtgtat 2280 tgttgatgat actttaacat ccccgcaggg tgagtccgcc acgtttattg tcccgtcctt 2340 ttgtgagagc gctagcaagc aagtatccat tccaaaatca cagattaaac aacaagacga 2400 taatgttacc accgagaccc actcaaagtg tgttcagcca accggattat cctacgcggc 2460 tgtagtgaca gcttcgtcca ttccaaacag atcccttgtc gagccaagag tggagccaca 2520 ggggcacatt aggaaatgtg acgagggggc cgctatcata cttgatacgc atcagccccc 2580 gcaatgtgag ttcgagagag tcccatcttc catttcacta gaagcagctg gccagttttt 2640 acgtccccaa aacagacatc tcgatatctc tgcgacgaaa gcttcctggt cgacttcagc 2700 acgtattgct gcggacgaaa agtcaaaagt acccgtcaag acggttgaac agatggtacc 2760 ctcttgctat caccgacatc ttcacttatt ccagaaatct agagcccaat gtcttccgcc 2820 ccgacgaaag tacgactttc gtgtggacct catacctggg gcgcaaccac aagccggtag 2880 aatcatacct ctatcacctg cagaagaagc ggttctagac gagatggtga acacaggact 2940 ggctaatgga actatcaggc gaacaacgtc cccgtgggcc gcaccggtgc tctttaccgg 3000 caagaaggac ggcaacttac gaccgtgttt tgactaccgg agattgaatg ctttgactgt 3060 aaaaaatagg taccctcttc ctctaaccat ggatttagtg gacagcttac ttgacgccga 3120 agaattcacc aaacttgaca tgcgaaatgc ttacggcaat ctgcgagtgg atgaagaatc 3180 tgaagacatc ctagcgttta tttgcaagca gggccagttc gcgcccttga ctatgccatt 3240 tggtcctact ggtgctcctg gatactttca attctttata caggacatca tggtaggacg 3300 aatcggcaaa gacacagctg cctacttaga cgacactatg atttatacta aaaaaggagt 3360 acaccacgaa aaagcagtag acggaatcct tgagattttc gacaaacacc agctttggct 3420 taaaccagaa aaatgcgaat tctctaggtc tgaggttgag tatcttggac ttattatatc 3480 taaaaataaa gtccgtatgg atccggctaa agtcaaggca gtaagagatt ggcctgctcc 3540 gaagaacacg aatgaacttc aacgatttat tgggttcgca aatttctacc gccgctttat 3600 aaaccaattc ttgaaaacga caagacctct tcatgatcta actaagctca acacacatta 3660 cgagtggaat gagaattgcc agagatcttt cgagagtttg aaaacggcat ttacatcggc 3720 acctgtactg aaaatagcag atccttaccg cgctttcatt ttggagtgtg actgctccga 3780 cttcgctcta ggagctatcc tttcacaaag gtcagatgat gacggtgaaa tccatccggt 3840 agcttacttg tcaaggtctt tgatccaggc caagagaaat tatgaaatct ttgacaagga 3900 actcctagca atcgttgcat ctttcaaaga atggcggcat tacctcgaag gtaatcccaa 3960 ccggcttgaa gtaattgtgt acactgacca ccgtaactta gagacgttta tgagcactaa 4020 acaactcaca cgccgacaag cacggtgggc ggaaacacta ggttgtttcg atttcatcat 4080 caaattccgg ccagggcgta aatccagcaa accagacgcg ttgtctcgac gccctgatct 4140 caagccacca gaggatgaac gactgacatt cggacaactg atcaaaccgg aaaacattgg 4200 ccccgatact ttcccaactg aattagctag cattgatgca ttctttatgg acgaatcaat 4260 tgacttggaa gacgccgaac attggttcga agtggatgtc ctaggtgtct cagatgctga 4320 tatcgacgct attgatgatg acattcttag cgacactact atcatacaac gtatacgaga 4380 agctaacgct caagatgaaa gaatcactaa tctgataaac tcagttacaa accccatctc 4440 atcggatatg aagaaactct ctagatctta caaagttaag gatggcattc tgtacaaacg 4500 tggaaaaatt gaggtgccaa atgacgatga tatcaaattt catattgtta aatctagaca 4560 tgacacttta ttagcaggtc atgcaggacg taataaaact ctgagtctta cgaaacaatg 4620 cttttcatgg ccctcgcaaa aagcctatgt taaccgctat gtggatggct gcgactcttg 4680 tcttcgcact aaatcaacca cacagaaacc ttttggaacc cttcaaccgc tacctgtacc 4740 ggctggacct tggacggaca tttcttatga cttaattacg aagctaccag tgtcaaatgg 4800 gtatgacagt atacttacgg tggtggatag actcaccaaa atgtcacatt ttattccatg 4860 caaggaaacg atgaaggcgg aggacttagc tgatttaatg attcgcaacg tgtggaagtt 4920 acacggcaca cctaaaagta ttgtgtcaga ccgcgggagt atattcatct ctaaaatcac 4980 taaggaactt gacaaacgac tgggcattaa cttacatcct tcaacagcgt ttcatccaag 5040 aacggatggc cagtctgaaa ttgtgaataa agcaattgaa acttacttac gacactttgt 5100 tgactacaga caggataact gggagtcttt attaccgaca gcagagttcg cttacaacaa 5160 tcgagatcac gactccatcg gggtatctcc gtttatggca aactacggtt ttaatcccat 5220 tttcaatcaa gtcccatcac cagagcagtg tatcccatta gttgaagaaa gattgaaatt 5280 acttaaagag gtacaaaaag agctaactgt ttgtctacaa ttattgcagg acaccatgaa 5340 acatcaattc gacaaacacg tacgaaaaaa tccacggtgg aacgtaggag atgaagtttg 5400 gctggactca aagaatataa cgacaacgag gccaagtccc aaattgggac acagatggct 5460 aggtcccttt aacataacta aagccatttc tgactctact tacgccctga atttacctat 5520 ttccatgaaa ggaatacaca atgtatttca cgtgtcttta ctaagaaaac acaatcctga 5580 cacaattcat caaagagcgc gtaaagaaag cccagcaatc gaaatcgaag gtgaaatata 5640 acgacaacga ggccaagtcc caaattggga cacagatggc taggtccctt taacataact 5700 aaagccattt ctgactctac ttacgccctg aatttaccta tttccatgaa aggaatacac 5760 aatgtatttc acgtgtcttt actaagaaaa cacaatcctg acacaattca tcaaagagcg 5820 cgtaaagaaa gcccggcaat cgaaatcgaa ggtgaagaag aatgggaggt gtcagcaata 5880 ttagactgta gaatgagaag aaataaaaaa gaatacttag taaattggac ggggtttaat 5940 tcaagtcacg actcatggga accggaagca aacctcaaga attgtaaaga tttattgaaa 6000 gaattcaaaa taagatttcc aaatgcagaa aggataaaga aggcacggag aagacggtga 6060 gagggctaag ctttttccca cgtggttttt taatgctgcc cggggaagca tgcagagctt 6120 gcaaaaggga gcttgggcat taaaaggggg ataa 6154 // ID Gypsy-1_BFB-LTR repbase; DNA; FNG; 270 BP. XX AC AAID01000422; XX DT 25-FEB-2011 (Rel. 16.02, Created) DT 25-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Botryotinia fuckeliana genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_BFB_; KW Gypsy-1_BFB-I; Gypsy-1_BFB-LTR. XX OS Botryotinia fuckeliana OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Leotiomycetes; Helotiales; Sclerotiniaceae; Botryotinia. XX RN [1] RP 1-270 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Botryotinia fuckeliana genome."; RL Direct Submission to RU (25-FEB-2011). XX DR Genome; AAID01000422; Positions 670 939. XX SQ Sequence 270 BP; 62 A; 71 C; 47 G; 90 T; 0 other; tgagagtgaa aacacgtgac ttcagggtca cgtgctacca acgtttagta tttatagctt 60 ctcttctgta gacctaaaaa tatcatcgaa attttctacc ttctaaagtt gtctcctgaa 120 gaattgagtg cccagctcct accgtctccc tcacctcgct gtcgcgatgt agcgattctc 180 cccattttac tacgtgtagc cacttgctct atgcttggta tatgcgaagg cttttcttgt 240 gtgaccatta ccaccttcta cacttttaca 270 // ID Gypsy-19_MLP-LTR repbase; DNA; FNG; 392 BP. XX AC AECX01000924; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-19_MLP_; KW Gypsy-19_MLP-I; Gypsy-19_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-392 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000924; Positions 227971 228362. XX SQ Sequence 392 BP; 79 A; 113 C; 57 G; 143 T; 0 other; tgattccctg attcgaccgt tccctttgtt ctctttgtac gcgctttcgc gcttttctct 60 tttctctatt ttgctttcaa tgcttttcaa agtctctatt tttttattca tttcctttat 120 cgagacggac cttaccagat ccgtcccgat cacttacatc ttgtagacct ccactaattg 180 taaccggatc ctgcagtacc ggatccggcc ttttgcgtcc tccacctaac ttgtatcaca 240 gcgatcctgc tgttcttatt gtaccacaat atttccttat ctgtacttcg cccaaagatg 300 ctcgtctata taaagacctc gcatctccct agtaatgtaa acgctaagtt acttcccacc 360 tattttcgcc cccttgaaag tgagtggaat ca 392 // ID Gypsy-65_MLP-I repbase; DNA; FNG; 5477 BP. XX AC AECX01002583; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-65_MLP_; KW Gypsy-65_MLP-LTR; Gypsy-65_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5477 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002583; Positions 21388 15912. XX CC Positions [2782-3285] - Reverse transcriptase CC Positions [4645-4878] - Integrase core CC 'TAAA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 283..1386 FT /product="Gypsy-65_MLP-I_1p" FT /translation="MPALTKSQRAALEAKVSKAKASDGFYTEEELDSMMQE FT QRDIIIKLSNASSVSSPLPASGNEDQKMEDGSVNPKPTGGSVTLSGEEYQS FT WVTHQAREEVKKSIVERKLLIGTCSDERLDEVTNSLEYLSDKMATVSLADD FT GNRLESSLAKAVKLMDNAGITPFDGSSKMDANIWLQRFKKSMMMRWIHPAI FT FHRLAMQYVSGIALTKVEALLGTDLCPANYSAFCDFLLREFPPTVTKSTIK FT EKLNDFKQLPGESASEYYDRLLVLAEEAKEVGYAIEMREEFVSGLNPALKS FT FVEAQLITAAKPNKVTFDTVALWSISKDNNYRRDKAREKVTQVAESSASGS FT GRNKKRKANAPVRVAMADKSCYNCG" FT CDS 1810..4641 FT /product="Gypsy-65_MLP-I_2p" FT /translation="MFVNSVPTGEVVANASEVPCEGQGELTVGSMKDAPPA FT SGPGLKLEGELNACNVEDLLALENNSAGHRVDFVLEAEGKAARFLLDSGGM FT RKYISLSFVKKWGLKTIPAQGSNFVSGAFGKAVECNLLCLVRFRLNSFDYE FT VPCRVVPLTQYDVVLGLEWINEFVVRTDWDKSLWVLKNGLGRTCDFYPDRV FT CAPRQREHLLQVAESSPEEATRSGFRRFCRQKDIEIVLCHPMEYVEAIKEV FT EGPREEVAAEMPSIDPKEEKLRDRIKALVDRFKDQFSEVAAVPQVERVINH FT LIDTEGKGPVSQPVRRMSPLLLDELKVKLKELQDKGMIRPSTSGWSSPVLF FT ARNASGKLRFCVDYRAVNELTKRDRHPLPLIQDCFDQLGGSMVYTKFDLQQ FT GFHQMKIAEGDIEKTAFGTRYGHYEWLVMPFGLVNAPSTFQRMMTDILREY FT IDDFVQVYLDDILIYSKNFEDHLIHVEKVLEALKTAQLKISGKKSVLFAEE FT MQFVGHVISKEGVRVMPEKVEAIRTWPRPNNVYDVRAFLGLAGYYRRFIKG FT FAKLAAPLHGLTAGAVKKKQPVEWLPFHEAAFESLKAALQCAPVLGTPDVS FT KPYIMETDSSDYATGAVLLQVGDDGLEHPVAFESSKLNSAQQNYPAQEKEL FT LGIMNAWRKWRVYLEGAVADTIVCTDHASLVYLKSQVLPSRRLHCWIDEFA FT EMTIRVEYKKGSTNIVPDALSRRSDLMVVQELGHALRDPSDWPLLIPYILG FT QRVLPDWVDATSKALALSSIDKFEYSKGEDELLYVHNERSPFVPFDHRGDV FT LDSVHEVSGHRGRDGTLALLKGRGWWPKRYADVEEYCKFCPQCQIFDNPKK FT GLENGLQHPLPSADPFERWGADFIQLPESKEGFKWVLSIIDHCTGWPIAVP FT LKEATSENIVDVCINEVFHQFGVPSEILTDRGLNFLSKDI" XX SQ Sequence 5477 BP; 1415 A; 1062 C; 1537 G; 1463 T; 0 other; actggtagcg agagttttta gaatccctct ttttgttctg ttctgtgtgg ataagatcat 60 taaaatcctt ttcttcgatt tcttctctat aatctttcgt ttgatttttt ttagaaactt 120 tcaaaacttt ccttagaacc tttcgtcgat aaatttttta atcttcctgc aaacgtcgga 180 aaaggagtga gttttggatc tgcaaacctc agtggttgaa ggagagactt actgactagt 240 cgtgttttgg ttgtgaaaca ttgtgtgtat aacgcagtta tcatgccagc attaaccaaa 300 tctcagcgtg ccgctttaga agccaaggtg tcgaaggcaa aggccagtga tggtttttac 360 accgaagaag agcttgactc gatgatgcaa gaacaacgcg acatcattat caagctttcc 420 aacgcctctt ctgtcagttc cccgttgccg gcctcgggta atgaagatca aaagatggag 480 gatggatctg tcaaccccaa gccgacaggt ggttcggtca ctctctccgg tgaggagtac 540 caatcttggg taactcatca agcccgggaa gaggtgaaga agtccatagt agaacgtaag 600 ttactaattg gcacatgttc tgatgagagg ttggatgagg ttactaactc actggaatac 660 ctttcagata agatggccac cgtctctttg gcggatgatg gcaatcgttt agagtcgagc 720 cttgccaagg ctgtgaaact tatggataac gcgggtatca cgccatttga tggttcgtcc 780 aagatggatg cgaacatctg gctgcagcgt tttaagaagt ctatgatgat gcgctggatt 840 cacccggcaa tctttcaccg acttgcgatg cagtatgtgt cggggattgc tttgactaag 900 gttgaggctt tgttggggac cgatctttgt ccagccaact attcagcttt ttgtgacttc 960 ttacttcgcg agttcccccc gaccgttacg aagtccacga ttaaggagaa gctgaatgat 1020 ttcaagcaac ttcctggcga gtccgcctcg gagtattatg atcgactttt ggtgttggca 1080 gaagaagcta aggaggtcgg ttatgctatt gagatgcggg aggagtttgt ttctggattg 1140 aacccggctc ttaaatcttt cgttgaagct caattgatta ctgctgcgaa gccgaacaaa 1200 gtcacgttcg acactgtcgc cttgtggtcc atttccaagg acaacaatta tcgacgggac 1260 aaagctaggg agaaggttac tcaagttgct gagtcgtctg cttcaggctc tggtcgcaac 1320 aagaagagga aggctaacgc tccggtgcgg gtggcaatgg cggataaaag ttgttacaat 1380 tgtggctaaa ctggtcatat ctttggtaat atggctaacc ccaagtgtcc tgaacctgct 1440 actgagaaga cttaagttgt tctttgagaa gaagaaggcg gaaaaagcct aggcgatttt 1500 gatgtgactc tgtcggtttc gcctgaactt ggtgctatta gtaatgaaaa tctgatttct 1560 gctagtgaca cttttgagtc tgctcctttg gagcaccccg tgccttgtga gttgcctgag 1620 gaatgtgagt tgcctttgcc ttgtgagtca tgtgagttat gtgagtcttt gttgccatgt 1680 gagtgttcgg tacggaggct ttcgaaagtt tcgtcaccgg tttgtgacga aaagaaggaa 1740 gaatcaaata agcctgtttt ggttccgaag gaagatgtta agggactgag tttggttcca 1800 acacaaaaaa tgtttgtgaa ctcagtcccg actggcgagg ttgttgctaa cgcgtcagag 1860 gtcccctgtg aggggcaagg tgagctaact gtgggttcca tgaaggatgc tccaccggcc 1920 agtgggcctg gattgaagct tgagggagaa ctcaacgcat gcaacgttga ggacttgctt 1980 gccttggaaa acaactcggc tggacaccga gtagattttg tcttggaggc tgaaggaaaa 2040 gctgcgagat tcttgctgga ttcgggaggg atgagaaagt atatatcctt gtcgtttgtc 2100 aagaaatggg gattgaagac aatccctgcc caaggctcta acttcgtgag tggagccttt 2160 ggaaaagcag tagagtgtaa tcttctctgt ctagtaaggt ttagactgaa ctcgtttgat 2220 tacgaagtgc cctgtcgagt ggtgccattg actcagtatg acgtggtact tggtcttgaa 2280 tggattaatg agtttgtcgt taggaccgac tgggataaga gtttatgggt gttgaagaac 2340 gggttaggta gaacctgtga cttttaccct gacagagtct gtgctccccg ccagagggag 2400 catttactac aggtggcaga aagctccccg gaagaggcca cacggtcagg attcaggagg 2460 ttctgtaggc agaaggacat cgagattgtc ctatgtcatc cgatggagta tgtcgaagcc 2520 atcaaggagg tcgagggacc gagggaagag gtggcggcag aaatgccaag tattgatccc 2580 aaagaggaaa agctccggga tcggataaaa gctttggttg atcgtttcaa agatcaattt 2640 agtgaggttg ccgcggttcc ccaggtagaa agggtgatta atcacctgat tgacacggag 2700 ggaaaagggc cagtctccca gccggtcaga cgaatgtcgc cgttgctatt agatgagttg 2760 aaagtgaaac tcaaagagtt gcaagacaaa ggtatgatac gaccatccac ttctggatgg 2820 tcgtctccgg tgctctttgc aaggaatgcc agcggaaaac ttcgtttctg cgtggactat 2880 cgcgctgtca atgagttgac taagcgggac cggcatcccc tcccgttgat tcaagactgt 2940 ttcgatcaac tgggaggatc aatggtttat actaagttcg accttcagca aggttttcat 3000 cagatgaaaa ttgctgaagg agatatagaa aagactgcat ttgggacgag gtatggccac 3060 tatgagtggc tggtgatgcc gtttgggtta gtgaacgcac ccagtacgtt tcaaagaatg 3120 atgacagata ttctgcgtga atatatagat gattttgtac aagtgtattt agatgatatt 3180 ttaatatact ccaagaattt tgaggatcat ttgatccatg tcgaaaaagt actggaggct 3240 ttgaaaacgg cccagctgaa aattagtgga aagaagtcag tacttttcgc tgaggaaatg 3300 cagtttgtgg ggcatgtgat ctcaaaagaa ggcgtacgtg taatgcctga aaaagtggag 3360 gccatcagga cttggcctcg tccgaacaac gtgtatgatg tccgggcttt tcttgggcta 3420 gcggggtatt accgacggtt catcaagggg tttgctaagt tggcagcccc gttgcacggc 3480 ctgacggctg gggcagtgaa gaagaagcag cctgtggagt ggctgccgtt tcatgaagct 3540 gctttcgaga gtttgaaagc agcgctgcag tgtgctcctg tgctgggtac tccagatgtg 3600 agtaaaccgt atatcatgga aacagattcc agtgattatg ctacaggagc agttttgttg 3660 caagttggag atgatgggct ggagcatcca gtagcgtttg agtcttcaaa gctgaattcg 3720 gctcaacaaa actacccagc gcaagaaaag gaattgctgg ggatcatgaa tgcatggaga 3780 aaatggagag tatacttgga aggcgcggtg gcggatacga tcgtttgtac tgatcatgcc 3840 tctttagtat acttgaagag ccaggtgctg ccatctcggc ggttacattg ctggatagat 3900 gagtttgctg agatgacaat cagagtcgag tataagaaag gtagtacaaa cattgtacct 3960 gatgcgttga gtcgccgtag cgatttgatg gtggtacagg agctgggtca cgctttgagg 4020 gacccaagtg attggccgtt gctgatccct tatattttgg ggcaacgagt actacctgac 4080 tgggtagatg ccacaagcaa ggcgcttgct ctgtcatcta ttgataaatt tgagtattca 4140 aaaggggaag acgagttgct gtatgtgcat aatgaacgtt ccccgtttgt tcctttcgat 4200 catcgggggg atgttttgga tagcgtacat gaagtgtctg gtcatagggg acgggacggt 4260 acgttggctt tgcttaaggg ccgaggttgg tggccgaagc ggtatgcgga tgttgaagag 4320 tattgcaaat tttgtccaca atgtcagatc tttgacaatc caaagaaagg gttggaaaac 4380 ggtctgcaac accctcttcc aagtgctgac ccttttgaac ggtggggcgc ggatttcatc 4440 cagctaccgg aatcaaagga aggatttaag tgggtactat cgattattga ccactgtaca 4500 gggtggccaa tagcagtacc acttaaggaa gctacgtcgg agaacatcgt cgacgtatgc 4560 atcaatgagg tgtttcacca atttggtgtc ccctctgaaa tcctgacgga cagaggtctg 4620 aattttttgt caaaagatat ttgacgtttt tatcagagga cacatatccg gaagttgaat 4680 acttctgggt atcatctgcg gaccaatgga aaaactgagc gtttcaatgg tttgcttgag 4740 aaagcgttgt tcaagctaaa tggcaccggt gatgaaacac ggtggccgga gttcttgaga 4800 caggcagtgt tctctgttag gataaatcct aacacggtga ccggttggtc gccgtttgaa 4860 cttttgtatg gtgtgaagcc ccgtctcata ggggatcatg cgaagctacg accaccagta 4920 ctggcggatg aaggtgtaag tccggatgct gctacagcag ctaggcgcaa gcgtctttcg 4980 gaacttcaac tgaccagggc agtggctcgc gagaagcagc aagccagagc aatcgaaaat 5040 aagaaaaaat tcgatgctgg tgtaattcac agtaacagcc tgcgatctta ttccgtagga 5100 gaatgtgtta agttacggaa tgaaaccgca acgaaagggc agcccaaatg gcatgggcca 5160 ttcgaaattt ttgacagctt aggtcaaaat gtgtatcggt tggtggatcc aatgggttcg 5220 ttgtttcctc atttggtgaa tggtaatcgg ctgcagccgg caaaggtgcg ggatgcatcc 5280 ttgatctcgc cgtgggcttt gccgaaacaa ttgcaggaaa acaagaaaaa agtacaagag 5340 aaagttccaa tggaactagc tacagaggcg gtaaaaatga cgcgtgccca gaagaaggcg 5400 gcgcggtcca gaatccgaat cattggtagg tttggaaatt cgccgagtgt gacggctcaa 5460 ccctaggggg ggatggt 5477 // ID Mariner-6_AN repbase; DNA; FNG; 1863 BP. XX AC . XX DT 09-JAN-2004 (Rel. 9, Created) DT 19-MAY-2005 (Rel. 10.06, Last updated, Version 2) XX DE DNA transposon. Mariner superfamily. Pogo clade - a consensus DE sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; KW mariner superfamily; Mariner-6_AN; Pogo clade; transposase. XX NM Mariner-6_AN. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-1863 RA Kapitonov V.V. and Jurka J.; RT "Mariner-6_AN, a family of DNA transposons in the Aspergillus RT nidulans genome."; RL Repbase Reports 3(12), 214-214 (2003). XX DR [1] (Consensus) XX CC DNA transposon. Mariner superfamily. Pogo clade. CC TA target site duplications. 44-bp TIRs. CC The 456-aa transposase is encoded by 3 exons (pos. 95-920, CC 1080-1550, 1637-1710). XX FH Key Location/Qualifiers FT CDS join(95..920,1080..1550,1637..1710) FT /product="Mariner-6_ANp" FT /translation="MPRVRVSSSQNCHEKEGRLLLAVQAIKKKEITSIREA FT ARRFNVPESTLRTRLRGTTNRAESRANGHKLTEIEEEVLKQWILSLDLRGA FT APTKAHVREMANILLAKRGSTPIQTVGQKWVYNYTQRHPELESRLSRQYDC FT QRAKQENPKVIQAWFNTVRATIEQYGILPDDIYNFDETGFAMGLCAHQKVI FT TKSESCGRRPVLQPGNREWVTAIESISASGWALPPTLIFKGKQYNQAWFTG FT LPPDWRFEISTNGWTTNEISLRWLQKQFIPSTEHLLKRSYASLVDQKMRLG FT ISHIDKLDFLAAYPQARISTFKLDTIRNSFRAAGLVPLNPEPVLSKLSIQA FT RTPTPPGSRGSQASTFCPHTPANVDELLKQASLLRDFLKQRSKSPPSPSHN FT ALNQLIKGCQIAMQKGILLEQENRALRAENAIQRRKRARNTWSMRATSRRC FT TNTKGTGITYM*" XX SQ Sequence 1863 BP; 553 A; 454 C; 415 G; 441 T; 0 other; acgtggttgg taagcgagct gcgcatgtaa gcgagctgcg cacccaacca cttttttaca 60 tgctgacgcg gtcatctatc ttccaacagc caccatgccc cgagttcgcg ttagttcaag 120 ccaaaattgc catgagaagg aaggtcggct cctactggct gtacaggcta ttaaaaaaaa 180 ggagattaca tcaatacgcg aggcagcacg tcgcttcaat gtgcctgaat ctacactacg 240 tacgcgacta cgcgggacta caaatcgcgc cgaatctcgc gcaaatggcc ataaattgac 300 tgagattgaa gaggaagtgc ttaagcagtg gattctctct ttagatctac gcggagcagc 360 tcctacaaaa gctcatgtac gagaaatggc taatattctg cttgcaaagc gtggttccac 420 cccaatccag actgtcggcc agaaatgggt atataattat actcaacgcc acccggagct 480 tgagtctcgc ttgtcaaggc aatacgactg ccagcgagca aagcaagaga acccaaaggt 540 tattcaagca tggtttaaca ccgtacgagc cacaatcgaa caatacggga tcctaccgga 600 cgatatctac aactttgatg agactggctt tgcaatgggc ctttgtgcac atcagaaagt 660 gattaccaag tcagaatcat gtggccgaag accagttcta cagccaggaa accgtgaatg 720 ggttactgca attgagtcaa tcagtgcttc tggatgggca cttccaccaa cacttatctt 780 taagggcaag cagtataacc aagcatggtt tacaggcctt ccgcccgact ggcgatttga 840 aattagtaca aatggatgga caactaatga aattagcctt cgctggcttc agaagcaatt 900 tatcccgtca acagagcatc gtacgcgcgg aagatatcaa cttctagttc ttgatggcca 960 tggaagccat cttacaccag agtttgatca aatctgtaca gatcataata ttataccact 1020 ctgcatgccg gcacattcct cccatcttct acaaccactt gatattggat gttttgcagt 1080 tttgaagcgc tcgtacgcca gcttggttga tcagaaaatg cggcttggca tcagccatat 1140 tgacaaactt gatttccttg cagcctatcc acaagctcga atcagcacat ttaagctgga 1200 tacaatcaga aacagttttc gagcagcagg actagtgcca ttgaatcctg aaccagtgct 1260 ttcaaagctt agtattcagg ctcgtacgcc tacaccccct ggaagccgtg gcagccaggc 1320 aagcactttt tgcccacata caccagcaaa tgttgatgag cttctaaagc aagcttcttt 1380 actcagagat tttctcaaac agcgctcaaa aagtccacca tcaccgtccc ataatgccct 1440 aaaccagcta attaaaggct gtcaaattgc aatgcaaaag ggcatactat tggagcaaga 1500 gaatagggcg ctacgtgctg aaaatgctat acaaaggcga aagcgagctc gtacgcatag 1560 atggatagct catgataatg gtctgtctgt acaagaggct acagagctcg aggaagctca 1620 taatgcgtct tttcaggcaa tacctggtcc atgcgggcca ccagcagaag gtgcacaaac 1680 accaaaggca cgggcattac ctacatgtag tacctgccat agaattgggc atagaagaaa 1740 tgcttgtcca aataaataat aattaatata aaggcgttgt ggttgattaa aaggtcaaaa 1800 tatgggaaat ctgtatgcag gtgcgcagct cgcttacatg cgcagctcgc ttaccaacca 1860 cgt 1863 // ID Copia-16_MLP-LTR repbase; DNA; FNG; 145 BP. XX AC AECX01001151; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-16_MLP_; KW Copia-16_MLP-I; Copia-16_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-145 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001151; Positions 77207 77063. XX SQ Sequence 145 BP; 40 A; 9 C; 42 G; 54 T; 0 other; tgttacgata ttatcttgtg aagtgttcaa agataattaa tccatcttgt ggtttgtgta 60 gaggtgtagg ttgaggttgt gttagcagga atagtggttg cttaattagg aaggagatgt 120 ggttagtaga ttgagaattt aatca 145 // ID Gypsy-3_GDe-LTR repbase; DNA; FNG; 215 BP. XX AC AEFC01000450; XX DT 26-MAR-2011 (Rel. 16.03, Created) DT 26-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Geomyces destructans genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_GDe_; KW Gypsy-3_GDe-I; Gypsy-3_GDe-LTR. XX OS Geomyces destructans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Leotiomycetes; Leotiomycetes incertae sedis; Myxotrichaceae; OC mitosporic Myxotrichaceae; Geomyces. XX RN [1] RP 1-215 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Geomyces destructans genome."; RL Direct Submission to RU (12-MAR-2011). XX DR Genome; AEFC01000450; Positions 214 428. XX SQ Sequence 215 BP; 54 A; 62 C; 51 G; 48 T; 0 other; tgtcacgaac cgaatcacgt gcgggcaata ctcacgtgcc cttcagtgcc acgacggagg 60 catgtcggtc cagacaggac acgtagaact atatagtagg aacgcggacc ttcggaggaa 120 ggtcagcgtt cagtcttcgt atcaatagac caggattcta cgtatattcc ttcccgtgac 180 ttcccactgc cctactcgcc aagaagtcct ttaca 215 // ID AFLAV_I repbase; DNA; FNG; 6839 BP. XX AC AY485786; XX DT 04-SEP-2005 (Rel. 10.08, Created) DT 04-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Aspergillus flavus LTR-retrotransposon AFLAV (internal portion). XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; AFLAV_LTR; internal portion; AFLAV_I. XX OS Aspergillus flavus OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC mitosporic Trichocomaceae; Aspergillus. XX RN [1] RP 1-6839 RA Okubara P.A., Tibbot B.K., Tarun A.S., McAlpin C.E. and Hua S.S.; RT "Partial retrotransposon-like DNA sequence in the genomic clone RT of Aspergillus flavus, pAF28."; RL Mycol Res 107(Pt 7), 841-846 (2003). XX RN [2] RP 1-6839 RA Hua S.-S.T., Tarun A.S., Pandey S.N. and Chang P.-K.; RT "AFLAV, a new Tf1/Sushi retrotransposon from Aspergillus RT flavus."; RL Direct Submission to EMBL/GenBank/DDBJ (21-NOV-2003)Direct RL Submission to EMBL/GenBank/DDBJ (21-NOV-2003). XX DR EMBL/GenBank/DDBJ; AY485786; Positions 472 7310. XX CC LTRs differ by 1bp substitution. XX FH Key Location/Qualifiers FT CDS 250..969 FT /product="AFLAV_I_1p" FT /translation="MSSQSSSSKKTPVKSTPPAETDSESETTVKEQLKQMK FT SMITQLVNNAKEKNQEIENLKVQLGEAERIRNEQQDHIAQLDAQVGASAPK FT DAIGKVKLPKAEPFDGTRSKLQAFLTQMNMHIHANRKNLIDEADKVIFIST FT HLRGAAWNWFEPYIREYYEVVPDNWSNTTRELFTDSGNLRKHLERTFGDVD FT AEAVAERKLKHLYQRGSASTYAAEFQQIISRMDWNEKVYVSTFISGLKDM" FT CDS 1065..6242 FT /product="AFLAV_I_2p" FT /translation="MEKRDNEAWRKGSHRPKGQYKSNDQRERTGVKHNDPY FT GPKPMELDATEGQGQSKGISQKERERRRREKLCYNCGRAGHMSKDCRQKRN FT SQPANRKPQQMNATEDEAEPPKKVRFAQLNATAGNSEPHNKARGASVPPGS FT IVRLWMRSAGNRSHTAQCQHDLSAEEIERHRENEPNDDDWITLDWLTIHTN FT QTIWPRINQDWSNLQDATEQWYGQLNEHEIDELADNANNETNRLGRVNSES FT QYEAIMEPIRERVRYALTHRVDGPDNTDQYNEPVSSIDQSNRALVEIDPLN FT EEYGTQWIGHDPNMEGLLEVPETPENPQDEDENAHRRVMALIDTLQEVVSP FT RRRTPVSRPYQMQQVEVEPPRRQETLTNWTNNVIDEIVRNPRRFSRPLRML FT LEQCPHWNHECWDSNIENWDEHCQQCDKHPIVCEICGADRFEYYGELELIN FT PDRAKRGHEGTHHWLRECECCHYATEPMHNRYPWVVCFDDSCTHHRIWKQI FT ARFWPQNDANRRTLAATRQGRHITTIIVVNGKPARAMIDSGATNNFMSPRY FT RENMKIEGRQKENAEPLLGLDGQKLGTGQVSVETVPVTMAVGQHVESIAFD FT ITPLGNKYDVVLGISWLEDHNPTIDWKQRTLHLNNCHCPKGPCMGYGTRTL FT TSKCTGSGRIERRDQDTAKGNSAKNMIMAATRYSEKEWLAELMGWAPANEQ FT ERLEVMTLGSESEEEWHSSPETNQTSPKSNSDSWTLLDSQELAANSAEQPS FT LPKEYQGFRELFEQPRTNKLPEHGPHDHTIPIQEGKEVTCKRIYPMSEKES FT QALKEYIKDRLERKQFDHRKSPAGHGVLFVPKKGGELRLCIDYRPLNDITV FT KDRHPLPLITEIQDKIRGAKWFTKLDITDAYHRRRIAEGEEWKTAFRTKYG FT HYEYLVMPFGLTNAPASFQRFINEALGEILDVFVIAYLDDILIFSHNLEEH FT VQHVQTVLEKLRKAEVRLKLKKCEFHVQETEFLGHWISTEGIQAEEGKVKA FT IREWPEPTNLKELQQFTGLLNYYRKFIDRYAHKLAPLFDLLKKSKQWEWTN FT EHQSAFDKAKEAITTAPILAQHDPAKQTIIETDASDYAIGARMVQAEPDGK FT PRPIAFESRKLVQAELNYDIHDKELLAIVSAFKKWRVYLEGAQHQIIVKSD FT HKNLTYFTTTKELTRRQQDGLKRYHSMTLGSNTGKGSENGQADALSRRPDH FT EIKGKTIETAILKQHEDGSIGYNKQTLAAVTVEIKDPTRHLIAKANKRDEA FT LTQKLEASDDLFTKDEDGIVYYRNLIWVPQKLRNMIIQEHHDNPTRGHFGV FT EKTSEQIARNYYFPKYGQAVRKYIDKCETCIRDKPARHKPYGLMQSPDAPS FT KPWEWITIDFVGPLPESEGWDMITVITDRLTKYIHLVPSKSTLDAVHLAHL FT LVNHVFVHHGMPEKITSDRDKLFTSKFWQSLTDLMGIDQKLTTAYHPQGNG FT QTERTNQTIEQYLRHYVNYQQDDWANLLPTAQFAYNNAEHSTIGTTPFYAN FT HGYHAKVAGEPRNKQPVAEEAIETVEGLKSLHNQLSLDIKFFNHRAAMYYN FT RHHEKGPTFKKGEKVFLLRRNIKTKRPSSKLDHQKIGPFKIEEKIGNVNYR FT LKLPDSMKRIHPVFHISLLEPAPENAKIAENIELDKEGTEYEVEKILKDKR FT VNGKPHYLVKWKGYSTSENSWEPIENLRGCHQLVRQYHQQKGQNSPKRRGR FT SSSESD" XX SQ Sequence 6839 BP; 2329 A; 1649 C; 1594 G; 1267 T; 0 other; actgcatccc tgtgacgacg gcaatctagt atcgacggca gactagtgtc gacggaagtc 60 tcgtaacgac tcccgtcttc cctagaccgc aaccaacaac gacggaggtc aaagaacgac 120 ttcaggttag taacccttga aacgtggtcc gctaactggt ctcagacttg acgacgaagc 180 cccgggcgta acacattgat caccagtcta tacgagttga accacgtttg gacttaaaac 240 atcgacaaca tgtcttctca gtcatccagc agcaagaaga ctccggttaa gagcactcca 300 ccggcagaaa ctgattctga aagcgaaaca accgtgaaag aacagctgaa gcagatgaag 360 agcatgataa ctcagcttgt caacaacgcc aaagaaaaga atcaggaaat cgaaaatctc 420 aaagtacagc tcggagaagc tgaacggatc cgcaacgagc agcaggatca cattgctcaa 480 cttgatgctc aggttggggc atcagcacct aaggacgcaa tcggaaaagt gaaactgcca 540 aaagccgagc cttttgacgg aactcgctca aaattgcaag catttctgac gcagatgaat 600 atgcacattc acgcgaatag gaaaaacctc atcgacgaag ctgacaaggt catcttcata 660 tccactcact tacgtggagc agcatggaac tggttcgaac cgtatatccg agaatattac 720 gaagtcgtgc cggacaattg gtcgaacacc acgcgagaat tgttcaccga ctcaggaaat 780 ctgcgcaaac atctcgaacg gactttcgga gatgtcgatg ccgaagcggt agcagaacgc 840 aagctaaagc acctatatca acgaggtagt gcatcaacct atgcagcaga atttcagcag 900 atcatatcca gaatggactg gaacgaaaag gtctatgtgt caaccttcat cagtggtctc 960 aaggacatgt gaaggatgag tttgcacgaa ttgataggcc agcaacactt aacgaggcaa 1020 tcgactttgc cgttaaggtg gataatcgct accacgaacg actcatggaa aaacgggaca 1080 acgaagcctg gagaaagggc agtcaccggc cgaagggaca gtacaaatcg aacgatcagc 1140 gagaacgcac aggtgtcaag cacaacgacc cttacggacc gaaacccatg gaattggacg 1200 ccactgaggg acaaggccag tcgaaaggca tctcccaaaa ggaacgagag cgtcggagac 1260 gcgaaaaact ttgttataac tgcggtaggg caggacacat gtcgaaggac tgtcgacaga 1320 aaagaaatag tcaaccagca aatcggaagc ctcagcagat gaatgctaca gaggacgaag 1380 cggaaccgcc aaagaaggta aggttcgcgc agctcaacgc cactgcggga aacagtgagc 1440 cgcacaacaa ggccagagga gcatctgtac caccgggatc aatagtccga ctatggatga 1500 gatcagcggg aaatcgaagc catacagctc aatgccaaca tgatttgtcg gcagaggaaa 1560 tagagcgaca cagagaaaac gagccaaatg acgacgactg gatcacacta gattggctta 1620 ctattcacac caatcagacc atatggccac gcatcaatca ggattggagt aacctacaag 1680 atgccacgga gcaatggtac ggtcaactta acgaacacga aattgacgaa ttagccgaca 1740 atgcaaacaa tgaaaccaac agactcggac gagtcaacag cgaatcacag tatgaggcca 1800 tcatggagcc aatcagagaa cgcgtacggt acgcgctaac ccatagggtg gacggccctg 1860 ataataccga tcaatacaac gagcctgttt caagcatcga ccaatcaaat cgagctttgg 1920 ttgaaatcga tcctttgaac gaagagtacg gtactcaatg gattggccat gatcccaaca 1980 tggagggact ccttgaggta cccgagacgc cagaaaatcc ccaagatgaa gacgaaaatg 2040 cacaccgacg agttatggca ctcatcgaca cactgcagga agtggtgtca ccaaggcgaa 2100 ggacgccagt atcgaggcca tatcaaatgc aacaggtcga ggttgagcca ccgagaaggc 2160 aagagaccct taccaattgg accaacaacg taatcgacga aatcgtacgg aatcctagaa 2220 gattctcacg accgcttagg atgttattag aacaatgccc gcattggaac cacgaatgct 2280 gggattcgaa catcgaaaac tgggatgagc attgtcagca gtgtgacaaa cacccaattg 2340 tatgcgaaat atgtggcgca gacagattcg agtattacgg cgaactcgag cttatcaacc 2400 cagacagagc gaaaagggga cacgaaggaa cgcatcattg gttaagagaa tgcgaatgtt 2460 gccactacgc aacggaacca atgcacaatc gttacccttg ggtagtgtgt tttgacgaca 2520 gctgcacgca tcaccgtatc tggaaacaga tagcacgttt ttggccacag aacgatgcaa 2580 accgaagaac gcttgctgcg accaggcaag ggagacacat caccacgatt atcgttgtca 2640 acggaaaacc agcgcgagca atgatagact caggcgcgac gaacaacttc atgtcaccaa 2700 gatatcgaga aaacatgaaa atcgaaggac ggcaaaagga aaacgccgaa cccttactcg 2760 gactagacgg ccagaaattg ggaactggtc aagtctcggt cgaaacagta cctgttacta 2820 tggctgtagg gcaacacgtc gaaagtatag cctttgacat cacgccttta ggaaacaagt 2880 acgatgtggt gttagggatc tcatggctcg aagatcataa cccaacgata gattggaagc 2940 aacggacgct tcatctgaac aattgccatt gcccaaaggg accatgtatg gggtatggta 3000 cccgcaccct caccagcaag tgcactggta gtggaaggat cgaacggcgc gatcaggata 3060 ccgcgaaggg aaattccgcg aagaacatga ttatggcagc aacccgatat tcagaaaagg 3120 aatggttagc tgaactgatg ggatgggcac ccgctaacga acaggaacga ctagaagtca 3180 tgacgctagg aagcgaatcg gaagaggaat ggcactcttc accagaaacg aatcagacat 3240 cgccaaagtc aaacagcgac tcttggacac ttttggactc tcaggaatta gcggctaact 3300 ccgcggaaca gccaagcctg cccaaagagt atcagggatt ccgagaacta ttcgaacagc 3360 cacgaacgaa caagttacca gagcacggac cgcatgatca cactattcct attcaggaag 3420 gaaaggaagt aacatgcaaa cggatttacc caatgtcaga aaaagaatca caagctctga 3480 aggagtacat caaagacaga ctcgaaagga aacaattcga ccatcgaaaa agtccagcag 3540 gacatggtgt attattcgta cctaagaagg gaggagaatt acgactatgc atcgattatc 3600 gaccattgaa cgacattact gtcaaggaca gacacccact accgctcatt acagaaatac 3660 aagataagat aagaggagca aaatggttta cgaaactcga tattacagac gcataccacc 3720 gccgcagaat cgcggaaggc gaagaatgga aaactgcatt tcgaacaaaa tatggacatt 3780 acgaatactt ggttatgcct tttgggctca ccaacgcacc agcatcgttt cagagattca 3840 tcaatgaagc actaggagaa atcctcgatg tattcgtcat tgcataccta gacgacatcc 3900 taatcttctc gcacaacctt gaagaacacg ttcaacacgt ccagacagtt ttggaaaaac 3960 tacggaaagc ggaagtacga ttgaaattga aaaaatgcga attccatgtc caagaaaccg 4020 agtttttagg acactggata tccactgaag ggatacaagc agaggaagga aaggtgaaag 4080 ctatccgaga atggccagaa ccaaccaacc tcaaggaact gcaacagttc acgggattgc 4140 tgaactatta tcggaagttc attgaccgat acgcgcacaa gttagcacca ctctttgact 4200 tactcaaaaa atcgaagcaa tgggagtgga caaatgaaca ccaaagcgca ttcgacaaag 4260 cgaaagaagc aatcaccact gcaccaatct tggcacaaca cgatccagct aagcaaacca 4320 tcattgaaac cgacgcatct gactatgcga ttggcgcacg aatggtacaa gcggaaccag 4380 acggaaagcc acgaccaata gcattcgaat ctaggaaact agttcaagcg gaactgaact 4440 acgacatcca cgataaagaa ttgttggcta tagtgtcagc gttcaagaaa tggagagtct 4500 acctggaagg agcacaacac cagatcatcg tgaaatcaga tcataaaaat ctgacgtact 4560 tcacaaccac gaaagagctc acacgaagac agcaagatgg gctgaaacgc tatcacagta 4620 tgactttagg atcgaacact ggcaagggat cagaaaacgg tcaagccgac gccttgagcc 4680 gaagacctga tcacgagatc aaaggaaaaa caatcgagac tgcgatattg aaacagcatg 4740 aggacggatc gatcggatac aataagcaga cactcgcagc ggtaactgtg gaaatcaagg 4800 accctactcg acacctcatc gcaaaagcaa acaaaaggga cgaagcactc acacagaaac 4860 tcgaagcaag cgacgacctg ttcaccaaag acgaagatgg aatcgtgtac tatcgaaacc 4920 tcatttgggt tcctcagaaa ctacgaaaca tgattatcca ggaacaccat gacaacccga 4980 cacgaggaca ctttggagta gaaaaaacat cggaacagat cgcaaggaac tactatttcc 5040 cgaaatatgg ccaagcagtc aggaaataca tcgacaaatg cgaaacatgc atcagagaca 5100 agccagcaag acacaaacca tatggactta tgcaatcacc agacgcacct tcaaaacctt 5160 gggaatggat cacaatcgac tttgtgggac cactacccga atcggaagga tgggacatga 5220 taacggtgat aacggaccga ctcaccaagt acattcatct ggtaccaagc aaatcaacac 5280 tcgatgcagt gcacctcgca catctactcg tcaaccacgt ctttgttcat catggaatgc 5340 cagaaaagat cacatcggat cgagacaagt tgttcacatc aaagttttgg caatcactca 5400 cagacctaat ggggatagat cagaaattaa ctacggcata tcacccacaa ggaaatggtc 5460 agacggaaag aacaaaccag acgattgaac aatatttgcg acactacgtc aattatcaac 5520 aggatgattg ggcaaacctg ttaccaacag cacagttcgc atacaacaac gcagaacact 5580 ccacaatagg aacaacaccc ttttacgcaa atcacggata ccacgccaaa gtggcaggag 5640 aaccaagaaa taaacaacct gtcgccgaag aagcgatcga aacagtcgaa ggattgaaaa 5700 gcttgcacaa tcaattgtcc ttggacatca aattctttaa ccatcgcgcg gcaatgtact 5760 acaaccgaca ccatgaaaag ggacctacct ttaaaaaggg ggagaaagta ttcctgctcc 5820 gcagaaatat caagacaaaa cgaccaagtt caaaactcga ccatcagaaa attggaccat 5880 tcaaaatcga agaaaaaatt ggaaacgtca attatcgact aaagctaccg gactcgatga 5940 aaaggataca ccctgtattt catatctcat tattggaacc tgcaccagag aatgccaaaa 6000 tcgctgaaaa cattgaactc gataaagaag gaacggaata cgaagtcgaa aaaatactaa 6060 aagacaaacg agtcaatgga aaacctcact atctggtgaa atggaaaggt tacagtacct 6120 cagaaaactc atgggagcct atcgagaatt tgcgcggctg ccaccagctg gttcgacagt 6180 accaccagca gaagggtcaa aattcaccca aaaggagggg tcgctcatca tccgagtcag 6240 actaggatcg gacgacacgg cgtcaagaaa agcagcgtct gactcatcgg caaggcgctc 6300 cagctcacga agatcctcag cactgggagg atcctcctca tcaagttgct ccataagcaa 6360 ggagtcatgc tccaacatac gaccaccacg ttctttcaag aaacgctgtt gtttctgaag 6420 gcgaagaatt ttcgacatcg cctcttgcat cttttgggaa tgctccagcc agtttcgctg 6480 ggcctcttca agttcggaag caatcttctc ctggtcacga tgaagattct cccactcacg 6540 ttcactatga aactgtttta accatttccg accacgacga gtgcactccg cacaattctt 6600 gtttttggaa tccatcacac aggatttgtt actttgggca caatgatcgc aaggattatc 6660 gacaatggag ccgttggcca gaatcgcacg atactgaaga agtctgtttc gcttgcgcac 6720 ctcgttagac acgggcatga tgggaaagca tggcagaaag ttgagggagg gacctctcac 6780 ctgggcaagc ccgcaacttt atgctttcga ggacgaaagc ctgaaaagag gggggatga 6839 // ID Gypsy-69_MLP-LTR repbase; DNA; FNG; 346 BP. XX AC AECX01001171; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-69_MLP_; KW Gypsy-69_MLP-I; Gypsy-69_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-346 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001171; Positions 47444 47789. XX SQ Sequence 346 BP; 99 A; 67 C; 53 G; 127 T; 0 other; tgtcatgaac ctgtcacatc tcaggaactt taaattatct caaacaggaa cgttaagaat 60 acttcataat agttacttag ataatatgac gtatgtagga ttcataatca ggaacaaggt 120 tcttttgtct aacagttgtt ttgcggattc aaaattgtct tctctcatat atatactttg 180 tattttcaag aactgttgtg atccttttct tatcggaata attattatct ctctctctct 240 tttacgacgc tgcctagaag atgctcttct tctttgctgt gtgccaagat ctccgaaatt 300 tgagatcagc tatcccataa agaatcaccg ttaaaagctt ttgcca 346 // ID Gypsy-1_TMe-LTR repbase; DNA; FNG; 783 BP. XX AC CABJ01002980; XX DT 13-FEB-2011 (Rel. 16.02, Created) DT 13-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Perigord black truffle genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_TMe_; KW Gypsy-1_TMe-I; Gypsy-1_TMe-LTR. XX OS Tuber melanosporum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Pezizomycetes; Pezizales; Tuberaceae; Tuber. XX RN [1] RP 1-783 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Perigord black truffle genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; CABJ01002980; Positions 16112 15330. XX SQ Sequence 783 BP; 214 A; 137 C; 193 G; 239 T; 0 other; tgtcacgaac gtggcttagc gagaaaaggg ttttcacgtg acacatggtt atgagttaca 60 ggtatgttat tctttcacgc atatgaccta ttcgggttat atgattagtt atgtgtggga 120 tttgtgatgc tttcgcacat ggctcaagta tgagtcatgg gacatgcgcc tgacgagtat 180 gccttacgag tgtgccttac gggtaagcct tgcgcatatg atttatgttt cgcacatgac 240 ctcattatga tgtcatgggt ttcgtgcata tgactcagtc gagtcacatg acttgattat 300 tgcgttatga ttggtttatg agtttcgcac atgacctcgc tatgaggtca tgggttttga 360 gcatgtgatc acttgggtca tatgactagt ttgttatgat ttagtttcgc acatgtgacc 420 actttaggtc acatgactaa aaggcgggaa gaacactata taaggaggcc gatgcggccg 480 aggaagatag gttcttcacg ctttgttcta agtacttttc ctatttatta ctttaacgaa 540 tataagatac ctttacgatc tttatcttta ttgcaacatt atcgttacgt taaggaaggc 600 gctttacgaa gaagagttag agaaagctta cgataggcgg tctcgataga agacaagggt 660 cttgtactat cggtaacgta ggaagaggcc tagagctatc ataagataaa ggaagaaaaa 720 gacaggaaga gaagtatcgt tccccgtcgc caagcactaa ctatccaaag agggatcgtt 780 aca 783 // ID Gypsy-4_RO-LTR repbase; DNA; FNG; 392 BP. XX AC AACW02000293; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_RO_; KW Gypsy-4_RO-I; Gypsy-4_RO-LTR. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-392 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000293; Positions 214550 214159. XX SQ Sequence 392 BP; 145 A; 61 C; 47 G; 139 T; 0 other; tgtggtattt acacatgtaa taaagtaata atgtaagtta ctacgaacaa tggtcctcca 60 aattaggaat atcggacatt gtattacgcc acttcaaata aaatcccgat ccactccaga 120 atattctaat aagttcctgg atcaggatta ataattactt cttgataaga agatgttaaa 180 ctatgccaat catataaata actcattgtc taatatcgat taataatcaa tagtcatata 240 ctgtaaatat taatatttag ccgtgattag aaaagtataa ataccctcat atttttaact 300 tgaataaagg acgatctact ttgagtctca cttattttta tatttgttga ttgacttagt 360 actttttata ttataaaacc tcaaatatca ca 392 // ID TCN10-LTR repbase; DNA; FNG; 528 BP. XX AC . XX DT 30-MAR-2005 (Rel. 10.03, Created) DT 30-MAR-2005 (Rel. 10.03, Last updated, Version 1) XX DE C. neoformans LTR - consensus. XX KW LTR Retrotransposon; Transposable Element; Interspersed repeat; KW TCN10-LTR. XX OS Cryptococcus neoformans OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Tremellomycetes; Tremellales; Tremellaceae; Filobasidiella; OC Filobasidiella/Cryptococcus neoformans species complex. XX RN [1] RP 1-528 RA Goodwin T.J. and Poulter R.T.; RT "The diversity of retrotransposons in the yeast Cryptococcus RT neoformans."; RL Yeast 18(9), 865-880 (2001). XX RN [2] RP 1-528 RA Loftus B.J., Fung E., Roncaglia P., Rowley D., Amedeo P., RA Bruno D., Vamathevan J., Miranda M., Anderson I.J. et al.; RT "The genome of the basidiomycetous yeast and human pathogen RT Cryptococcus neoformans."; RL Science 307(5713), 1321-1324 (2005). XX RN [3] RP 1-528 RA Gentles A. and Jurka J.; RT "C. neoformans LTR sequence TCN10-LTR."; RL Direct Submission to Repbase Update (15-MAR-2005). XX DR [3] (Consensus) XX SQ Sequence 528 BP; 140 A; 132 C; 112 G; 144 T; 0 other; tgtagtgcaa tgggcctacg gccatgtcac atcacttcga gcacccactt atcaacatga 60 cggaggttac cgcagggatc aagagcggag tagagccgtt gctgaagtgg gataccttat 120 tacttttact ttcttttact tagttcactc tgggtactta gtccattcta ggctagctat 180 ctatgatgtt ctccttagat ataatctatc ttcgtatatg cagccttata ggcagacgct 240 ttatagcgca agtacagcag gactgtatca gcaacaccag ccagctcgac atgagacccg 300 agcttaacta atcgttacca ctataccact ataccaatct atcaccagat acatagggta 360 cttccatcgt atgtactatg accaccacag ccaaaccttt gggaggctat attccctagc 420 agcctttaca gtcgtacagt ccgtcatcag ccttggatat catgggtgga ggccgacgga 480 acctaatgga agcatatccg taggagactt taggggtttt acgctaca 528 // ID Gypsy-4_LBS-LTR repbase; DNA; FNG; 144 BP. XX AC ABFE01000666; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_LBS_; KW Gypsy-4_LBS-I; Gypsy-4_LBS-LTR. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-144 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01000666; Positions 265556 265699. XX SQ Sequence 144 BP; 39 A; 34 C; 45 G; 26 T; 0 other; tggaataggg aatggggcat tccccccgcg gaggcacgtg agcgttagat tagtcatgca 60 gtagtgtaga gagggaccag aagggcgaat agagcatcat tccaccaggc atctgcatcg 120 tcggaagcac tagagcccat tcca 144 // ID Gypsy-78_MLP-LTR repbase; DNA; FNG; 270 BP. XX AC AECX01001152; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-78_MLP_; KW Gypsy-78_MLP-I; Gypsy-78_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-270 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001152; Positions 19481 19750. XX SQ Sequence 270 BP; 58 A; 56 C; 55 G; 101 T; 0 other; tgtaagggag gagttagatg tcacaagatt acaagtgtat agtttcacac gtgtgttagt 60 tagagtcttt atagtcagtt gtagattctc ctccctcttg tttcttttcc cttgaggttc 120 cctttcctcc ttcttgattc tgtagtcttt tatattgatt atctggtatt taggccttat 180 tcaggaatac aaacgtggtg atagttctgt ctcagcttcg gccccttgag aaacagtccc 240 tagtgaaggt gcattctcac gcaccttaca 270 // ID Gypsy-27_LBS-I repbase; DNA; FNG; 8695 BP. XX AC ABFE01002852; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-27_LBS_; KW Gypsy-27_LBS-LTR; Gypsy-27_LBS-I. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-8695 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01002852; Positions 81811 90505. XX CC Positions [2134-2712] - Reverse transcriptase CC Positions [4267-4752] - Integrase core CC 'GCATG' target site duplication CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 4168..5394 FT /product="Gypsy-27_LBS-I_3p" FT /translation="MSKDAETYTNSCDVCQKIKSDHRAKMGVLRLAHIPLR FT PFATVSLDMITGLPPSGEQGYTAILVIVDKLTKFAIIIPTHTTLSQEGFAK FT LFVERVVNVYGLPEVIISDRDRRWATTFWKSVVANYGSVMALSSAHHPQTD FT GQTEILNATIEQMLRAYVSSDKESWSSWLSVLAYSYNSSVHSSTRYSPNFL FT LMGYNPRTSTSAIIPEVDPALRPFLPSQSAEDFVEAIEIHRNMAKDAIALA FT QDRQAKAYDKKRRPVEELKVGDYALVNPHSLELVDVAGTGKKLVQRMIGLF FT EVVEKINPMVYRLRLPNTYSMHPVFNLDHLKKYVPSPDSFGECTELPSTRE FT LRASEEYEVEAILGHRLVGKKKANRRMFLVRWKDYGPADDSWVSEYDLRNS FT AQLKRDYLDSMKLLF" FT CDS join(1042..2034,2038..4038) FT /product="Gypsy-27_LBS-I_1p" FT /translation="MTDSPRLREGMRMKLYHLTGGAKVLGYIKTELYATAQ FT DGSIISFELEAYVVRNMNVPLLLGEDFQTTYELSVTRHASGQCEVLVGRSG FT HVISAASALNVDLGFEIRVAHTAKSFIRRKAVARAKNKGSAKKQAEVLSYE FT DVLIQPYSVRNVAVSAAFEGREDWIVEKVIIGTDNQNIMAAPTTWISAKCP FT YIPIANTSPHPRYIRTGEVVGYLMDPEATLDKPKDEDCYNKMVASAEVFKK FT TIVGTLKAQDLATTKNPAADTPGHDDCLDDNSSWGPKTTAVPKDPLVGDVD FT KLVNLGPDIPLEYQGRLTKVLRRNAAAFGVNGHLGRIEAVGIPLLPDTQPI FT SEPMYGASPAKREVIDKQMKTWFEAEVIEPLVSPWGFPVVISYRNGKPRLV FT IDYRKLNAKTIPDEFPIPRQSEIIQALSGAQVLSSFDALAGFTQLEMADDA FT KEKTAFRCHLGLWQFKRMPFGLRNGPSIFQRIMQGVLAPFLWLFALVHIDD FT IVVYSKSWEDHLEHLDRVLGAIAASRITLSPAKCFIGYSSILLLGQKVSRL FT GLSTHKEKVQAIMDLARPTSVSDLQKFLGMVVYFSTYIPFYSMIAAPLFDL FT LKKGVKWQWRAEQETAYEQAKEALVNAPVLGHPVANQAYRLYTDASDLALG FT ASLQQVQTIQVKDLKGTPCYARLESAWEAGKEVPQLIVKLHKEKHEVKQQD FT EWGATLDETTVHVERVIAYWSRTFKSAERNYSATEREALAAKEALVKFQPF FT IEGKEITLVTDHAALVWARVYENANRRLAAWGAVFAAYPGLTIVHRAGRIH FT SNVDPLSRLIQIPPHDSPLSDDVVPIEQDNRKHNIAQKAEDRIFRAAAPKA FT AFSAFWWEDAIDKHVSPVRTRRQLAAESEHESAETTTENTRNDTSDTSDEQ FT GEVLPFLNSNHWTYPAGIKPSDTTPDEEWSTRTQLLISVDSVIHKEFSKGY FT EEDKFFAPRYIKVQPNEKTIISASHFQRGQDDLLYFIDAGWRT" FT CDS join(5546..6727,6731..8629) FT /product="Gypsy-27_LBS-I_4p" FT /translation="MGPFDPTRNPQLSFPGKEWRAFITVSAAGQPRESPEH FT EPITNCWRSDNAPAFDTGSVHSNYIDKLIRVNQEVEKRMEALYKVYSDRFS FT MEQTHRALWSPAIRPVAPSSEYLDTLWHIKRFSSAVDWITDAQRGIKDKRA FT FVDYVSRLQASPFWMPDPNAIVGMADDRYLGVWLNGTDERLARWYLKEGVP FT CFVVREISHPECVQLTALETMIDFAAGTYASAIHWSINEYDSRALINGDVL FT LSDMTSFHYPGWMWPDHQDKIRSVSAKGSSEMQVTNYEPPPLDIILIARDR FT VSWVRPPPVKKAEQSRPGASSLERKKWIKFVEKHDPKGTFQEVGSKHVIDH FT HVHSRYDREKHWHIHFLRPLKAPEGCISEIEVFGQPCPAGIYLNFGGKRMQ FT PHWLYHSLEPKATDVGRKAPNPKPEDLPFLDRSKRPPPPSDDDNDSDGDNY FT PEYKGYVLPVQDPEKTLTEIRESATVNPAAQTLVSEPAALGANTRMEPYRT FT ETTTLPPTLPPSVSNHARSDDEVSLGDVPEPMGPQISAPVTDTDIGMSDPA FT TEDPLEFASSYLMLYGTPDSESFSTVRTLVTDVASRLNFTVRRLFRVNTER FT GHSFWFEMASVEQARRMRAYMHHRRESTFELLVAYANYDDYVKALMRNTHQ FT WPETTESNRSQVVVTPSPALPVAGTSTRRLNRRDHPPSRETRRRSPSRDRY FT RSSRRSPSISRRRSPARPTYPRRRHQLSPLPRSRYCSPDRRSRSPAFHKRN FT SSQEPATRPSVPRERSPPRPAASVHPEPLSVPPLPVFPGMPASLGISPNTL FT LPFGANIAFMWSPSGNTLSPVLLQGNTTVIPFPLPSATPTPSALLPWPVAA FT ALPTPMIPSTTRTSNSSSSSLALRIAAERETRPVTPPPAASSPTLMSRMTD FT HLSARLSDPTTPLNLSDRLSDPTRLTLADRLNVDDYMDVDRTTGSFPADFH FT AYRDQTTRDTTTSSQPSVTGEPVPAQGDSDSEVDEEGELGYKKTKRGRRSG FT HKIQGYQRRDEERKHRRQGRR" XX SQ Sequence 8695 BP; 2468 A; 2480 C; 2014 G; 1733 T; 0 other; atggtggggg caccgcggac ttcaccaacg tgaagtccct gagtgcctaa ctacggagga 60 aacgcaacag gtgagtacca tcaatgcaat ctgtcgacta acgacaatct ccagacttcc 120 cgctccctca aggcctcgaa tggacggaac gtcaaccgac aacccaacaa gaccctcgtt 180 agcacgcacc cccgctgctt ccaagggtaa caacatcaag gacgacctct ctgaaccttc 240 gttagcgcgc gcatccgctg ctttcgaagg tagcgacacg gatgacgatt cctcatcatc 300 cataacctta gtatcgtata cacgatcaac gagtagttca gtcaaccttt ccagaccaaa 360 cgatgcacca accgctcccg ttagaccatc tctcccaccc catacgagct atgcccaaca 420 acctcagcta tgcctgaact actcagtcaa agaagagccc cctccagaaa atggaatgtc 480 ctcaggaacg ctcatcaaat tcacgtagac tgggatccta gtgaagagga ggaagcggat 540 agactgtacc tagttatgct ggctgaaact aagacctcta tttccgctta cgagagtgag 600 aacctactta agaccttcgc ccgtgaaata cacagctcaa aaggcaacgt aagaggagta 660 ctcgccgcta caaagcgcga aactcccatt acagagactt acaaccatgc ccctcaatac 720 gaagcactcc cagaagccag cttacataag gtatacccca tgcatcgcaa caagagaaag 780 aataaaggga aatcaaggaa tccggacacg atcactgaaa gaaccaaagg aatccagaat 840 agaccgatca agcatatcta tgagtcgggg aaggtatttc ccgctattaa aggacgatca 900 ctaccagaag gaatgggctc attaggaact aaggccctac atatgcgggc taagatccac 960 gagctagact ctgacgacgt caaggcatga ctagactcgg gggctgacat caccctcatt 1020 tctgaagaat tctggaaatc catgacggac tcccctcgac tgagggaagg aatgaggatg 1080 aagctttacc acctgaccgg aggtgcaaag gtcctaggct acattaagac cgaactctac 1140 gctacagcgc aagatggttc gatcataagc tttgaactcg aagcgtatgt cgtaaggaac 1200 atgaacgtgc ccctcctgct cggagaagac ttccaaacca cctacgaatt gagcgtgacc 1260 agacacgcct cgggacagtg cgaagtactg gtaggaagat cagggcacgt catatccgca 1320 gcctctgccc ttaacgtgga tttaggcttc gaaatacgcg tagcccatac tgccaaatct 1380 ttcattagga gaaaagccgt agcacgtgct aagaacaaag gaagcgcgaa gaagcaagca 1440 gaagtcctgt cctatgaaga tgtcctaata cagccataca gcgtccgtaa cgttgccgtc 1500 agcgcagcct ttgaagggcg cgaagactgg atagtggaga aagtcatcat aggaacggac 1560 aatcagaata taatggccgc ccctactacg tggatatccg ccaagtgccc ctacatacca 1620 atagctaaca ccagccctca tcctcgatac attaggacag gagaagtagt aggatacttg 1680 atggaccccg aagctaccct agacaagcct aaggacgagg attgctataa caaaatggtg 1740 gcatcagcag aagtctttaa gaaaaccata gtaggaaccc tgaaagccca agatctggcc 1800 accacgaaga atccagctgc ggatacccct ggccacgacg actgtctcga tgacaactcc 1860 tcctggggcc ctaaaactac cgctgtaccc aaagaccctc tagtaggaga tgtagataaa 1920 ctagtcaatc taggacccga catcccgtta gaatatcaag gacgtctaac caaagtattg 1980 cgcagaaatg ctgcagcctt tggagtaaac ggccatctag gacgtatcga agcgtgagta 2040 ggaatcccgc tgcttccaga tactcagcct atatcggaac caatgtacgg cgcttcacca 2100 gcgaaaagag aagtcatcga caagcaaatg aaaacgtggt tcgaagccga agtcatcgaa 2160 ccgttggtta gcccctgggg cttcccagtc gtgatctctt atcgtaatgg caaaccgcgc 2220 cttgtcatag actatcgaaa gctgaacgct aaaaccatac cggacgaatt ccccattccg 2280 cgacaatcag agattataca ggccctctct ggagcccaag tactgtcctc tttcgacgcc 2340 ctagcaggat tcacacagct cgaaatggcg gacgacgcga aggaaaaaac ggctttcaga 2400 tgccatctgg ggctgtggca gttcaaaaga atgccattcg gtttaaggaa tggaccctct 2460 atatttcaga gaataatgca gggagtacta gctccgttcc tgtggctatt tgctctggtc 2520 catatcgatg atatagtagt ctattccaag agctgggagg atcatctaga acacctggac 2580 agagtacttg gagccatagc cgcctccagg ataactctat caccagccaa gtgcttcata 2640 gggtactcct ccatactact gctgggacag aaggtctcgc gcctgggatt gtccacccac 2700 aaggagaaag ttcaagctat aatggatctc gctcgtccta cgtcagtatc tgacttacag 2760 aaatttctag ggatggtcgt gtacttttcg acgtacatac cattctactc catgatcgca 2820 gcaccactgt tcgacttact taaaaagggc gtcaaatggc agtggcgcgc ggaacaggag 2880 accgcttacg aacaggccaa ggaagcccta gtgaacgctc cagtgctggg tcaccccgtg 2940 gctaaccaag cgtacagact atatacggac gcctcggacc tagcactagg tgcgagctta 3000 cagcaggtcc aaacgatcca agtaaaggac ctcaaaggca caccttgtta cgcgagacta 3060 gaatctgctt gggaagcagg taaagaagta ccgcagctca tagtaaaact gcataaagag 3120 aagcacgaag tcaagcaaca ggacgaatgg ggggctacgc tagacgaaac cacagttcac 3180 gtggaaagag tgatagccta ctggagccgc accttcaagt cagctgaaag gaactacagc 3240 gccaccgaac gagaagctct agctgctaag gaagctctcg tcaaattcca acctttcatt 3300 gaaggcaaag agatcacttt ggttacggat cacgcagccc tggtatgggc gcgagtttat 3360 gaaaacgcta accgacgact agcagcctgg ggggcagtat ttgctgccta tccgggactc 3420 acgatagtac atagagctgg tcgcatacac tccaacgtcg accctctctc tcgtctaata 3480 caaatacccc cacatgactc cccactcagc gacgacgtcg tacccatcga acaggacaac 3540 agaaaacata atatagctca gaaagctgag gatcgtatat tcagagctgc tgctccaaaa 3600 gccgctttca gtgcattctg gtgggaagac gccatagaca aacatgtctc ccccgtccgt 3660 acccgtcgac aactcgctgc cgaatctgag cacgaatccg cggaaacaac tacagagaat 3720 actagaaacg acacgtcgga cacttcagac gagcaagggg aagtactccc cttcctaaac 3780 tccaaccact ggacgtaccc cgctggaatt aaaccctctg acaccactcc agacgaagaa 3840 tggtcaactc ggacgcaact acttatatct gttgactccg tcattcataa ggaattctct 3900 aaaggttatg aagaagacaa gttcttcgct ccccgctaca tcaaagtcca acctaacgag 3960 aagaccatta tctcagcgag ccatttccaa agaggtcaag acgatttact ctacttcatc 4020 gacgcgggtt ggagaacatg actttgtgta cccaaaacta aagttaacta tgtgctccgc 4080 tggatacacg agtcccctta cgagagtgct cacgccggac cacactgctt catagcgcga 4140 ctccaagagc tattcttctg gccttccatg agcaaagatg ccgaaacgta cactaactcc 4200 tgtgacgtat gccagaagat caagtcagat catcgggcca aaatgggcgt gttaagacta 4260 gcacacatac ccctacgtcc attcgccaca gtatcactgg acatgatcac tggattgcct 4320 ccctccggag aacagggcta cacggctata ctagtgatag tcgataaact caccaagttc 4380 gccatcataa tacccacgca tactacccta tcgcaagaag ggttcgccaa gctattcgtg 4440 gagagggtag ttaacgtata tggcctccca gaagttataa tctcggacag agatagacgc 4500 tgggctacca ccttctggaa atccgtcgta gccaactacg ggagcgtaat ggctctttcc 4560 tcagctcatc atccgcagac tgacgggcag acggagattt taaatgctac catcgagcaa 4620 atgctacgcg cttacgtctc ctcagataag gagagctggt cgagttggct cagcgttcta 4680 gcctactcgt ataatagcag cgtacattcc tcaacaagat actctccaaa cttcctactt 4740 atgggataca accctcgaac ctccactagc gcgataatac cagaagtaga tcccgcgctg 4800 cgacccttcc tgcccagcca atcggcagaa gactttgtag aggccataga aatacatcgt 4860 aacatggcca aggacgccat agctctagcc caagatcgac aagcgaaggc ctacgataag 4920 aagagacgac cggtagaaga actaaaagta ggggactacg ccttagtgaa cccccactct 4980 ctcgagctcg tagatgtcgc aggaactggc aaaaagctcg ttcaaagaat gataggtctg 5040 ttcgaagtcg ttgaaaaaat taatcctatg gtgtacagac tacgactacc caacacctac 5100 tccatgcatc cggtcttcaa cctggaccat ctcaagaaat acgtcccatc ccccgatagc 5160 tttggagaat gtacggaact accttcaacc cgagaactac gggcctcaga agaatacgaa 5220 gtggaagcaa tcctaggtca ccgtctagtg ggaaagaaaa aagccaatcg acgaatgttt 5280 ctagttagat ggaaggacta cggacccgcg gacgattcat gggtctctga gtatgatcta 5340 cgtaactcgg ctcaactaaa gcgggactac ctcgactcaa tgaaattgct gttctagaag 5400 cccgataagt catgacataa accctaacct tcgattaacg tcccccaatt cgttacaaaa 5460 cacgttttgc ccagggaaac gaccttcaaa cctaaacctc gcagttagct ccttccgaga 5520 gttcgaacga cgctatgacg gccacatggg accattcgac cccactagaa atccgcaact 5580 cagcttccca ggcaaggagt ggagagcctt cataactgta tccgctgcgg gtcaaccacg 5640 cgaaagcccc gaacacgagc ccatcacgaa ctgttggaga agcgacaatg ctcccgcctt 5700 tgacaccggc agcgttcata gcaactacat cgacaaatta attcgcgtga atcaggaagt 5760 cgagaagaga atggaagccc tatataaagt gtactctgat cgcttctcca tggagcagac 5820 acatagagct ctgtggagtc cagcaatacg gccagtggcc ccttcctcgg aatacctgga 5880 cactctttgg catatcaagc gattctcgtc tgctgtagac tggatcacgg acgcacaaag 5940 agggatcaaa gacaaacgtg cattcgtgga ctatgtctcg cgcctacagg cctcgccttt 6000 ctggatgccc gatcctaacg ccatagtagg tatggcagac gaccgttacc ttggcgtatg 6060 gttgaatggg acagatgaac gactagcccg ctggtacctg aaggagggtg tcccctgctt 6120 cgtcgtaaga gaaatctcgc atcctgagtg cgttcaactc acagctttgg aaacgatgat 6180 agacttcgcg gcgggcacct acgcctccgc catacactgg agtatcaacg agtacgactc 6240 gcgagcactg ataaatggag atgtactcct gtcagacatg acttccttcc actaccctgg 6300 ctggatgtgg ccggaccatc aggacaaaat tagaagcgtg tcagcgaaag gatcgagcga 6360 gatgcaagta accaactacg agcctccccc gctggacatc atcctcatcg ctcgggacag 6420 agtgtcgtgg gtcagacccc ctccagtgaa gaaggccgaa cagagtcgac caggagcctc 6480 ttccctagaa cgcaagaaat ggatcaagtt cgtagaaaaa catgacccca aaggaacgtt 6540 tcaggaagtg ggttcaaagc acgtcattga tcatcacgtt cactctaggt acgataggga 6600 aaaacattgg catatacact ttctgaggcc gctgaaggca cctgaaggtt gcatatcaga 6660 gattgaggtc tttggacaac catgccccgc cggaatctac ctaaacttcg gaggaaagcg 6720 aatgcaatga ccccattggc tataccattc cctggaaccc aaggctacag atgttggtag 6780 aaaagctccc aatccaaaac cagaagacct ccccttcctg gaccggagta aacgtcctcc 6840 ccctccttca gacgacgaca acgacagcga tggcgacaat tatcccgagt acaagggcta 6900 tgtcctacct gtgcaagatc cagaaaaaac cctcactgaa atcagggaga gcgcgacagt 6960 caaccccgct gctcaaaccc tagtatcaga acccgctgcc ttaggcgcga acacgaggat 7020 ggaaccctac aggacagaga caactacctt accccctact cttccccctt ctgtatccaa 7080 ccacgcgcga tccgatgatg aagtctctct aggcgacgtc cccgaaccca tgggccctca 7140 aatctctgcc ccagttacgg acacagacat cgggatgagc gatccagcca cagaagaccc 7200 cctagagttc gccagctcat acttgatgct ttacggaact cctgattcag agagcttctc 7260 aacagtacgg accttagtaa cggatgttgc gagcaggctt aattttacag tgaggcgcct 7320 gttcagggta aatacagaga ggggccacag tttctggttt gaaatggctt cggtggaaca 7380 agcacgacga atgagggcct acatgcatca ccggcgagaa agtactttcg agctactagt 7440 cgcctacgcc aactacgatg actacgtcaa agcccttatg cggaacactc accaatggcc 7500 ggaaacaaca gaaagcaaca ggagtcaggt cgtcgtaact ccttcaccgg cattacccgt 7560 cgcagggaca tccacgcgca ggctgaaccg aagagatcac cccccttcgc gcgagacacg 7620 tcgaagatcc ccctccaggg atcgctatcg ttcaagtaga cgatccccct ccatttcacg 7680 ccgtagatct ccagcacgac ccacgtaccc caggagacgg catcagctct cgcccttgcc 7740 tcgctcgcgc tattgttccc ccgacagaag atccaggtca cccgcatttc ataaacgtaa 7800 ctcatctcaa gagcctgcga ctcgaccatc agtcccaaga gaaaggagcc cccctcgccc 7860 tgcggcatca gtgcaccccg agcctttgtc agttccccct ctgcccgtgt tcccaggcat 7920 gccagccagc ctagggatat cgcctaatac cctgctgccc tttggtgcga atatagcgtt 7980 catgtggtcc ccctcgggca acaccttgag cccagtcctc ctgcaaggca acacgacggt 8040 gattcccttc cctcttccat ctgcaacgcc cacccctagc gccctgctgc catggcccgt 8100 ggctgcagcc ctgcccaccc ctatgatccc aagcacaacg agaacctcga acagctcatc 8160 ctcgtcactg gccttgagaa tagccgccga aagggaaaca cgacccgtga cccctccccc 8220 tgccgcctca tctccaactc tcatgtcacg tatgaccgat cacctgtcag caaggttaag 8280 cgatccgacc acgcccctaa atctatccga ccgcctgtca gaccccacta gactgacatt 8340 ggcggatcgt ttgaacgtgg atgactacat ggatgttgac cgcactacag gatcattccc 8400 agcggacttt catgcttaca gggatcaaac gacgagggac actacgacgt caagccaacc 8460 ctcagtcacg ggagaacctg tacctgcaca gggggactcg gacagcgagg tggacgagga 8520 aggcgagcta ggctacaaga agactaagag aggtcgccgc agcggacaca agatacaagg 8580 ttaccaaaga cgcgacgagg aacggaaaca ccgaagacag ggccgacgct gacagcatct 8640 tgcacacctc cttctttctt tttcgtcggt gctccaatct cacggggggg ctttg 8695 // ID Mariner-3_AF repbase; DNA; FNG; 1852 BP. XX AC . XX DT 28-FEB-2006 (Rel. 11.02, Created) DT 07-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE A family of Mariner DNA transposons - a consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-3_AF. XX OS Aspergillus fumigatus OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Neosartorya. XX RN [1] RP 1-1852 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-1852 RA Kapitonov V.V. and Jurka J.; RT "Mariner-3_AF, a family of Mariner DNA transposons in the RT Aspergillus fumigatus genome."; RL Repbase Reports 6(2), 98-98 (2006). XX DR [2] (Consensus) XX CC It is a family of DNA transposons from the Mariner superfamily CC (Tc1 clade). The genome harbors 3 copies that are 99.6% identical CC to the consensus. It encodes a 351-aa Mariner-3_AFp transposase CC (two exons at pos. 625-1533 and 1582-1728). XX FH Key Location/Qualifiers FT CDS join(625..1533,1582..1725) FT /product="Mariner-3_AFp" FT /translation="MGLCTASKVITAVDRSERPRTVIQGNREWVTIIECVS FT SKGISIPPVVILKGKEHQAPWYQESNLPPDWRLTNSTNGWTTDEIGLKWLK FT EVFNPFSSLHSTGAKRLLILDGHSSHQTAEFDDFCKENAIICLCMPPHTSH FT LLQPLDVGVFGPLKRSYGKLVEGMMVAGNNHIDKEDFLYLYPPAREAVSNQ FT RNICNGFKGSGLRPLNKDQVLEKITFQLRTPTPPPLIEGSISSAFQTPQNP FT RQLDHKVRSLQRSLQKRKLSSSPVSHIQHLEKAAQMAMSMNLLLQQEIKAL FT RAENEWKLKKKEGRDRIQQLNEQVDEQVDKPIPEPRQRAPPRCSGCWTIGH FT TIRNCPSK" XX SQ Sequence 1852 BP; 541 A; 413 C; 423 G; 475 T; 0 other; acgtaatcaa ccaccgaacc gccgatacca ccgaaccgcc gcttcgccgc gaatatcatc 60 gaattcttgg agccatctaa tcaacacgtt atttctttac cactattatt aaaaatgcca 120 ctatccaaag agaatcgaat gcagatggcc atatcagcat ataaaaaggg gcaattcaaa 180 tcaaaagcag ccgctgctaa ggtctttggg gtgtctagag agacccttcg tgatcggctt 240 cgcggaatca aaccacgcgc agagacacgc gctaatagcc ataagttaac agctcttgaa 300 gaggaggccc ttgctaagcg tctattagat gctgataggc gtggcttttt aattcgaccg 360 cagttcctgc gtggaatggc acatattcta ctatgtgcac ggacaaatga tccaacttca 420 gtcattggag tcaactgggc atataagttt attaaacgcc atccagcact gcgtacaagg 480 tataatcgga ggatctcata ccagcgggca aagcaggaag atccaaagat tataaaacag 540 tagtttgagc ttgtgcatgc cactattcaa gagtatggta tccatgaaaa tgatatctgg 600 aactttgatg aaactggctt cgcaatggga ctctgtacag cctcaaaggt tattactgca 660 gtagatcgca gtgaaagacc tcgtacagtt atccagggaa accgcgaatg ggttacaatc 720 attgaatgcg tgagctcgaa gggaatttct ataccaccag tggttatctt gaagggaaaa 780 gaacaccagg ctccttggta tcaagaatca aatcttcctc cagattggag gcttaccaat 840 agcaccaatg gctggacgac agatgagata ggccttaagt ggttaaagga ggtctttaat 900 cctttctcta gcctacactc aactggtgca aagcgattac tcattcttga tggccattca 960 agccatcaga ccgctgagtt tgatgatttt tgcaaagaga atgcaattat ctgtctatgt 1020 atgcctccac atacatccca tcttcttcaa cccctagatg ttggggtttt tgggccgctt 1080 aaacgctcat atggaaaact agtggagggg atgatggtgg ctgggaataa ccacattgat 1140 aaggaggatt ttctctacct ctaccctcct gctcgtgaag cagtatcaaa ccagaggaat 1200 atatgcaatg gctttaaagg atctggcctt agacctttga ataaagatca ggtccttgag 1260 aagatcacct ttcaactccg tacgccaaca ccaccacctc ttatagaagg atcaatctca 1320 tctgcttttc aaacacctca aaaccctcgc cagcttgatc acaaagttcg cagcttacag 1380 agaagcctgc agaagaggaa gctttctagc agtccagtct ctcatattca acatcttgaa 1440 aaggctgcgc agatggcaat gagtatgaat cttcttcttc aacaggaaat aaaggctcta 1500 cgggctgaga atgagtggaa actgaagaag aaggtaagga agcatgcttc tctagggaat 1560 gacctctttt tatctatcca ggaaggtcgt gaccgcattc agcagcttaa tgagcaggtt 1620 gatgagcagg ttgataagcc tataccagag cctcgtcagc gtgctccacc acgctgcagt 1680 ggctgctgga ctattggaca tacaatacgg aattgcccta gcaaatagct atatattcat 1740 atattagata attgatttag tagcgttggt gattagatgg ctccaagaat tcgatgatat 1800 tcgcggcgaa gcggcggttc ggtggtatcg gcggttcggt ggttgattac gt 1852 // ID Copia-20_MLP-I repbase; DNA; FNG; 4613 BP. XX AC AECX01002710; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-20_MLP_; KW Copia-20_MLP-LTR; Copia-20_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4613 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002710; Positions 16514 11902. XX CC Positions [1889-2401] - Integrase core CC 'AGACA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(83..3502,3506..4603) FT /product="Copia-20_MLP-I_1p" FT /translation="MPSGSPTHESSPTGTHRDLLPPASPADSNTSTTSSST FT AIPPLDDTVSDDFLLLQLLPSADMSTPAGTNTEDRIKIPELTDGNFPHWNK FT RLHFALQTRGLLKFLTEDSPPTAADGLALYRRQQGRVMELIVNSLNAANDS FT LIQITDTPKIAYEKLSKAHGSSGGVLAAAAICEIATAQLESGQSLSDFIIK FT IRTLHNQLSQYANEDKEIALSSKLLAIFLCNGLGKDFEYITAPFFADLSKL FT TVQSVMDRLVLETAKQSSSGQNTSAVSAFKSSSSPAVNNSAPNVNNSGSAS FT NPNSNYRRIGSGPNDLCHLPNHRGLSHTNRQCHGQKGKNKTGGHGSIPATN FT SLTEAEMVKRYLAIEAAHQAKKSSQAQTPQAPAPTYAVLTSNQFSALDLAE FT DFPDEGFAAQAYVATNKSIDHDYILADTAATRNICFNRSMFVQLKPVSPVK FT ITGISGDPQFAHHIGSIVIPGFSEEDHSTIDILIPDVLFVPTMTVNLLSLS FT QLCANGALFSGSGTSITVVGMTGRSYFICEKTSDERLWQCKVRLSNSVSAF FT SASAETWHYCLGHLNYDAIRRLASSGSIKISPRVSSSSINSCMTCQKSKIF FT CLPFKSHFPASSNVLNRIHSDVIGPFPVSLGGARYIVSFIDDCSRYASIFP FT IKLKSDVFECFVNFKNQVEVLFDKKIKFLHTDRGGEYQSANFLSFLTTHGI FT TLEQGPAETPEHNSVSERYNRSLIERVRCNLHHTSLPVKMWAEIALATAFT FT LNHSPHSYLNHESPLSIWNSFIPDTGLHGPDPSFLRTLGCSAIYLAPRISG FT KLGLKGREGVLLGYEKNAKAYRVWDLEHKKVIITQSVIFNETLFPFSTSTL FT TSDSPKFVILQDDGYDIPDVTAPNLSSLSPINSTNTSVPHIHSSATTQGLK FT PLLLSSISPRTSPSPITRRHHSITDVEPSSSTVFSNIRGSPSRTASTSTKD FT NVGRPVRTTSKPQHLGNFIGHAGTDLNDTPTYNQAMNAPDADEWKKAMKIE FT FDSLVQHNVGKLVPKPLDARVIGGMWVLKKKRDENGVVLKYKARWVCFGNR FT QIEGIDFNDTYSAVGKTDTFQLLVAIAAYLKCSIIQFDIITAFLHGIIKER FT VYIQQVKGFVEPGSEGMVWELGKSLYGTRQGADFSDHLCGVLLAFGFTTSK FT SDDCLFIFQRKSNFLYLHMHVDDGFLISNSDSLINDFKSHMLKSYELKWKI FT KPTLHLGMHLSYHNDGSIFINQTHYLQDILDRFGMDDLNPVKLPFPVGLRL FT QNGSPEDVQAAAYLPYQSLIGSLNWAAISTRPDIAYAVSQLSRFNSCYTFA FT HWNAAKHLIRYIKGSISQGILFKGSMPAELKGFGDADYANDPLDRRSVTGY FT LFTYGGSIISWRSRRQKSTALSTTEAEYMAILDCARHALWFKSLFNDLKLP FT VSSVSLSSAGQAIQLFNNNRGTVLLSKEPVINDRSRHIDVRYHFIRDHVRL FT RNITTAHVPTTSMPADFLTKPLSIEAFQRCCDQISVLECSS" XX SQ Sequence 4613 BP; 1285 A; 1118 C; 858 G; 1352 T; 0 other; ggttatgagc ctttaaaaac tcaagtaatc acgcggttta tacattacgt catcagctta 60 acagatcaat catcttgact cgatgccttc cggttctcct actcatgaat cgtcgccaac 120 tggcactcat cgagacctcc ttccgcctgc atctccggct gattccaaca cctcaactac 180 ctcgtcctcc actgccatac cgcctttgga cgacactgta tcagacgatt ttctcttgtt 240 gcaactttta ccttctgccg acatgtctac tcctgccggc acgaatactg aggatcgaat 300 caagatccct gaacttactg atggcaactt cccgcactgg aacaaaagac ttcactttgc 360 tttgcaaact cgcggccttc ttaaatttct tacggaagac tcacctccaa ccgctgctga 420 tggactggct ctctatcgcc gccaacaagg acgggtaatg gaactaatcg tcaattccct 480 caacgctgcg aatgattctt taattcaaat cactgacacg cctaaaatcg catatgagaa 540 actttcgaaa gcccatggta gcagcggtgg tgtgctcgct gcagcagcta tctgtgaaat 600 tgcgaccgct caactagagt cgggccaatc attatctgat tttatcatca agattcgaac 660 tcttcataac caactctctc aatacgcaaa cgaggacaag gaaatagctc tctcttccaa 720 attacttgct attttcttat gcaacggttt aggaaaggat ttcgagtata ttactgcacc 780 tttcttcgca gacctatcca aactcacggt acaatcagtg atggatcgac ttgttctgga 840 aacggccaag caatcttcct ctggtcaaaa cacttctgca gtttcggctt tcaaatcttc 900 ttcctctcct gctgttaaca actctgctcc taatgtcaac aattctggtt ccgcctccaa 960 tcccaactcc aactatcgtc gtattggttc tggccctaat gacttatgtc atcttcccaa 1020 tcatcgtggt ctatctcata caaaccgaca atgtcacggt cagaagggga agaacaagac 1080 aggtggacat ggaagtatcc cagcgactaa ctcactgacg gaggctgaga tggtaaagcg 1140 ttacttggca attgaggctg ctcatcaagc caaaaagtca tctcaagctc aaacccctca 1200 agctcctgct cctacatatg cggtattgac ctcgaatcaa ttcagtgctc tcgacttagc 1260 tgaagatttc ccagacgaag gatttgctgc tcaagcctac gttgcgacca acaaatcaat 1320 tgatcacgat tacatcctgg ccgacactgc tgctactcgg aacatatgct ttaataggtc 1380 aatgtttgtt caactcaaac cagtatcacc agtcaagatc acaggaatct ctggagaccc 1440 tcaattcgcc catcacatcg gatcaattgt cattcctgga ttctctgaag aagatcactc 1500 caccatcgac atcttaatac ctgatgttct atttgttcca accatgacgg ttaatttgct 1560 ctctttaagt caactatgcg ccaatggagc attgttttca ggatcaggaa ccagcattac 1620 tgttgttgga atgactggtc gtagctattt tatatgtgaa aagacgagtg atgaaagact 1680 ttggcaatgc aaggttcgcc tctcaaactc agtttctgct ttttctgctt cggctgaaac 1740 atggcattat tgcttagggc atttgaacta cgatgccata aggcgtcttg catcatcagg 1800 ctctatcaag atttctcccc gggtttcatc ttcttctata aattcttgta tgacatgtca 1860 gaaatcaaaa atcttttgtc ttcctttcaa gtctcatttc ccagcctcat caaatgtctt 1920 aaatagaatt cactccgacg tcattggtcc ttttcccgtt tctttgggtg gtgctcgtta 1980 tattgtatct ttcattgatg actgctcaag atatgcctct attttcccta tcaaactcaa 2040 gtcagatgtt ttcgaatgtt ttgtaaactt caagaatcaa gtagaagtgt tgttcgacaa 2100 gaaaatcaaa tttctacata cagatagggg aggtgaatat cagagcgcaa attttttatc 2160 ttttttaact acgcatggca ttactttgga gcaaggaccg gctgagaccc cagaacataa 2220 ttctgtctcg gagaggtata acagaagctt aatcgaaaga gtaagatgta atttacatca 2280 tacgtcactc ccggtaaaga tgtgggccga gatagcctta gccacagcct ttactctcaa 2340 ccactcacct cactcgtacc tcaaccacga atctccgctc tcaatttgga actccttcat 2400 tccagatacg ggtttacacg gtccagaccc ttctttctta cggacgctag gatgctcagc 2460 tatttattta gccccgagga tctcaggtaa attaggcttg aaagggagag aaggggtgct 2520 attgggttat gagaaaaacg ccaaagcgta tcgagtctgg gatttagaac ataaaaaggt 2580 catcattaca caatcagtca tctttaacga gactcttttt cctttttcta catctactct 2640 tacttctgat tcaccgaaat ttgttattct acaagacgat ggatatgata ttccagatgt 2700 tactgctcca aatctttctt ctttatcgcc gatcaattct acaaatacat ctgttcctca 2760 cattcatagt tctgctacca ctcaaggtct taaaccttta ttattatctt ctatatctcc 2820 tcgtacatcc ccttcaccaa tcactcgaag acatcattca atcaccgatg tcgaaccatc 2880 atcttccaca gtcttttcca acatcagagg ttctccaagc cgaaccgcat caacatcgac 2940 taaagacaat gtgggaagac ctgtcagaac aacaagtaag ccccaacacc ttggaaattt 3000 cataggacat gcaggtacag atctaaatga tacacctacc tataaccagg cgatgaatgc 3060 tcctgatgcc gacgaatgga aaaaggccat gaaaatcgaa ttcgactcac ttgttcagca 3120 caatgtcggt aaacttgttc ctaaaccatt agatgcgcgt gtgataggtg gaatgtgggt 3180 actgaagaag aagcgtgatg aaaacggggt ggtgttaaag tacaaagcaa gatgggtctg 3240 ttttggtaac agacaaatag aaggcattga tttcaatgat acgtactctg cggtaggcaa 3300 aacagacaca ttccaattac ttgtagccat agcagcttac ctcaaatgtt caatcattca 3360 atttgacatc attactgcct ttctgcatgg cataatcaag gaacgagtct acatacaaca 3420 agtgaaaggt ttcgttgaac ccggttcaga agggatggtg tgggagttag gtaaatctct 3480 atatggcact cgacaaggag catgagattt tagcgatcat ctttgtggtg ttttactggc 3540 tttcggattt actacatcaa aatcagatga ctgtcttttt atctttcaac gcaaatccaa 3600 tttcctctat cttcacatgc acgtagacga cggcttcctt atttcaaact ctgattccct 3660 tatcaacgat tttaaatctc acatgcttaa atcctatgaa ctgaagtgga aaataaagcc 3720 gactcttcac ctcggaatgc atttatcata tcataacgat ggaagcatct ttattaatca 3780 gactcattat cttcaagaca tccttgaccg attcggaatg gacgacctga atccagttaa 3840 actacctttt ccggttggtc tacgattgca aaatggatca ccagaagatg ttcaagctgc 3900 cgcttactta ccttatcaat ccttgattgg ctcattgaac tgggctgcaa ttagcactcg 3960 gcctgatatc gcttatgcgg ttagccaact ttctcgcttc aactcttgtt atacttttgc 4020 acactggaat gcggcgaaac acttaattcg ttacattaaa ggatctattt ctcaaggaat 4080 cttatttaaa ggaagtatgc cggcggagct taaaggattt ggtgatgctg attacgcgaa 4140 tgacccatta gacagaagat cggtaactgg ttatttattt acttatggag gatctatcat 4200 ctcatggaga agcaggcgac aaaagtctac agccctatct actaccgagg ctgagtacat 4260 ggccattttg gactgcgcca ggcacgcatt atggtttaaa tccttgttca acgatctcaa 4320 attacctgtt tcatctgttt ctctatcctc ggccggacaa gcgattcaac tctttaacaa 4380 caatcgaggg acagtcttac tgtcaaaaga accagttata aatgatcgct caaggcatat 4440 agatgttagg taccacttta ttcgagatca cgttcgactc cgaaacatca ccactgctca 4500 cgttcctaca acttccatgc ctgcggactt tctcaccaaa cccctatcaa ttgaagcttt 4560 tcaacgttgt tgtgaccaga tttctgtttt agaatgttcg agttaggggg gaa 4613 // ID Copia-42_MLP-LTR repbase; DNA; FNG; 977 BP. XX AC AECX01001249; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-42_MLP_; KW Copia-42_MLP-I; Copia-42_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-977 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001249; Positions 64161 65137. XX SQ Sequence 977 BP; 218 A; 210 C; 180 G; 369 T; 0 other; tgtcgtggta ttagtatacc acatttgcat ccttgttgtc tccattcctg agagaagaaa 60 tatgtgacaa ctttctattt acgttcaaga tgagtcaaga ccttcaggtc gatgagttag 120 gagttattat gttatgacaa ttggaaatga agagtgagaa ggatatggat tgggattaat 180 tgtacaattt ctttttcttt ttgtgactgt ttgggaatag gtcgcttgtg acctgagatt 240 tgagtagtat ggcccgagcg ttcactttgt tcaagttacg agttaagtta gatatataat 300 gaccacagtt gtaccaccca gtttctttcc tcatcggaaa gaaactgtaa gtggtctttc 360 tctttaaact attgcaactt gttttaccac atacttacat tttatcgtcg accttcagat 420 taatttttac ttacattctc cctatcaaat ctcacttgat cacctttttc ctgttgctaa 480 acgcacaggt ttgtctgttt tctcttttcc tctactattt catttttatt tctcgtacta 540 atgtgtgtgt gaattcaggt gtgttcttag ttctcagaag ctcgttgaag ctctgttttt 600 ggcttgcgcc gttgtactca cctacacctg atttccccgg gttagtattc atgctttaca 660 atgatcggaa agaaactatt aatttttact tacattctcc ctatcaaatc tcacttgatc 720 acctttttcc tgttgctaaa cgcacaggtg tgttcttagt tctcagaagc tcgttgaagc 780 tctgtttttg gcttgcgccg ttgtactcac ctacacctga tttccccagg tgtgttctta 840 gttctcagaa gctcgttgaa gctctgtttt tggcttgcgc cgttgtactc acctacacct 900 gatttcccca ggtggtcttt ttctgcacgg cttagccaac tgttagcttc aacgctctca 960 aacgtcactt cttcaca 977 // ID Gypsy-4_CCO-I repbase; DNA; FNG; 5825 BP. XX AC AACS02000012; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the mushroom Coprinopsis cinerea genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_CCO_; KW Gypsy-4_CCO-LTR; Gypsy-4_CCO-I. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-5825 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the mushroom Coprinopsis cinerea RT genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AACS02000012; Positions 1928631 1934455. XX CC Positions [2932-3477] - Reverse transcriptase CC Positions [4594-5073] - Integrase core CC 'CCCTC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 737..1924 FT /product="Gypsy-4_CCO-I_2p" FT /translation="MTNAQELQDLRNQQTAQAQVLEQLLKQLTEKAGASRS FT PIKEPEPFKGQMNDARRFLRFFTNWASNQRQPLQDTTGKRNDRAWISSALS FT FMQGDAANWASRFLQQIVDHDAAEEKDKASKPWPFNGSWPEFTTQFNARFQ FT PADDKEAAQQELDNLAQGRSSVAQFAAKFQEVFARTGLSEEDGMSKFKRKL FT NRDDKLWLAMASMVKKPTTLQELVNTCISNEFVMRNAGVDSPRSSGPSTSS FT TPARDPYAMDIDATRVGPNGRSRDDYVAMMRGRCFGCGSRDHIKRDGDHTA FT LRCTHCQRMGHTAGVCQDRFMGFPTGRGLRSQPHRRVRATTMPFALEPDND FT FDVCSPRSSAPTSSIAASSSSPTPTPSIPPDVQRFAETMTKLTEGMAKVNQ FT DF" FT CDS 2104..5811 FT /product="Gypsy-4_CCO-I_1p" FT /translation="MIDCGATGLFLDLPFVKRHRITQHPLRHPIRLLNIDG FT TPNQAGSLTHFARLELTVDGVPGWYDFLITDLGGEDVILGLPWLRQLNPAI FT DWQKGTLQVPRRRSSVLVEELLDDDAVPPSRGAPANGAILEEIYDTDSIDS FT EPEPPSDASDSEPPDDGPPPLCKIRANRELRRLWVRSGLLEDQGDELWCAA FT GFTYSQQIAEKTQETRPEKSFEEMVPPQYRRHASVFSETESHRLPDHKPWD FT HAIELIPGAPATMRTKVYPMSQNEQEELNRFLDENLKKGYIRPSKSPLSSP FT VFFVKKKDGKLRFVQDYRRLNEITVKNRYPLPLVSDIINRLRGAKYFTKFD FT VRWGYNNIRIKEGDEWKAAFATNQGLFEPLVMFFGLTNSPATFQALMNAIF FT SDLIAAGKVAVYLDDILIFTVTLEEHRRVVHEVLDRLKRHDLYLRPEKCEF FT ERQEVEYLGLVIQEGEVRMDPAKVEAVREWPVPTNLRAVRGFLGFANFYRR FT FIKNFATIARPLNDLTKKDTPWSWGTPQQDAFDTLRRAFTSAPILTLWDPT FT LPTRIEVDASGFATGGALLQKHADGLWHPVAFRSASMQPAERNYEIYDREM FT LAIIEALKDWRHFLEGLPNPFEIVTDHANLAYWRSAQDLSRRQARWALYLS FT RFEFSLSHRPGKANTQADPLSRLESHQVHDGDDNQQQVVLKPEWFARLAMA FT RLLVNPLEDRIRRASSKESEVLEALERLKRTGPRRLVNGLAEWEEEGGLVY FT HRGKVYVPPDNDLRRDVVRQCHDDPTSGHPGVHGTLERLDRQYWWPTMRAF FT VKKYVEGCDVCARKKRTQHPQAMTTPLPVPGAPWETVGVDLITQLPEAHGY FT DAVIVFTDHYTKMIHALPCTSDVSTEGVADIFYREVFRLHGLPLQFVSDRG FT PQFASKVMRTLLRRLGITSSLTTAYHPQANGQTERANQEVEKYLRLYVSRR FT QDDWDQHLPMAEFVINSRVHSALDRSPFEVTYGYNPHFNIPVGKRSDLRPV FT DERLDRLIEARRDVEAALRLEKQQQKESYESGKRAAHSFKEGDFVWLSAKD FT IQLKVPTRKLGDLQLGPYKVIERLGDLDYRLELPPSLSRLHPVFHVDKLSP FT WKGNDVNGILPPPPEPVELEGELEYEVQEIVDSRWTRRGRGRRLEYLVNWK FT GYGSKDDTWEPEENLEHAPEKVKEFHSRHPSAPRRIAATLFASLPWKPLEN FT FTDAPPTDLEWELGRRLPMFIEDDEC" XX SQ Sequence 5825 BP; 1298 A; 1988 C; 1441 G; 1098 T; 0 other; ttaggtcaag ctacccctct gcgcaccgac aacgacgtac tgttgtccct gcttcgccgt 60 gttcaggttc gcggtctgat tcgccgatcg aatccaccgc cggaagaaag tcaagattcc 120 gtcgcccagc agtcgactcc tctgccggga tttctcgcat ttcgtctgga aggatccgta 180 ctcgcctacc cgcgacgcgc attacgcacg aagccctacc gctctagtca ccccctgtga 240 agttcagccg tgcccccacc gtctggaacg acccgtatca cacaccgttt tattccgact 300 tcgaaccgcc ggttacgtcg tcgagctctg tggtgaggaa aaccactcac tcgtcgctaa 360 cgagaatttt cgaccgtcgt tcaacgacac ccaacgccaa ctagccgcca cctcgcccct 420 tccccgcgtc tctcgcacct cctcactccg ctcgactccc gccggttcgc gcacagcgtc 480 acccgcgcgt acagtcgttc ccctcccctc cacccttccc ctcgacgccg tcgcctcctc 540 tagtgtccag aacacagagt cgagggcgcc gtcgaacacc tcttccccac ctacaaccga 600 caacgaacac gacgcctcgt ttgtcaccac cgaaggcgaa ccgctcaacg acaaccacag 660 cccaattaac gccacccctg cttccccgct tcaccccgtc tccatcccac ccgctggtac 720 cccctctgac gtcgaaatga ccaacgcaca ggaattacag gacctcagga accagcagac 780 cgcccaggcc caggtccttg agcaactact caaacaactc actgagaagg ccggcgcctc 840 tcgcagcccc atcaaggaac cggaaccgtt caaaggacag atgaacgatg cccgtcggtt 900 ccttagattc ttcaccaact gggcctctaa ccagaggcaa cctctccagg acaccaccgg 960 caagcgtaac gaccgggcct ggatttcttc cgccctctcg tttatgcagg gcgacgccgc 1020 caactgggcg tctcgcttcc tccaacagat tgtcgaccac gacgccgccg aagagaagga 1080 caaagcctct aagccttggc ccttcaatgg ttcttggcct gagttcacca cccagttcaa 1140 cgctcgcttc caaccggccg acgacaagga agcggcccaa caggaactcg acaacctcgc 1200 tcaaggtcgc tcgtctgtcg cccagtttgc cgccaagttc caagaagtct tcgcccgtac 1260 tggtctatcc gaagaggacg gcatgtccaa gttcaagcgc aagcttaacc gtgacgacaa 1320 actctggctc gccatggcct cgatggtcaa gaaacccacc accctccaag aactcgtcaa 1380 cacctgtatc tctaacgagt tcgtcatgcg caacgccggc gttgactctc cccgttcctc 1440 tggcccgtcg acctcgtcta cccccgcccg tgatccctat gccatggata tcgacgccac 1500 ccgagtcgga cctaacggac gctcgcgcga cgattacgtg gcaatgatgc gcggacgatg 1560 ctttggctgt ggatcgcgag atcacatcaa gcgggatggc gatcatacag ccctgaggtg 1620 tacccactgt caacgcatgg ggcataccgc cggtgtctgc caagatcgtt tcatgggttt 1680 ccccactggt aggggacttc gctctcagcc ccaccgccgg gttcgcgcca ctactatgcc 1740 cttcgcactt gaaccagaca acgacttcga cgtttgctcc cctcgctcgt ccgcccccac 1800 ctcgtccatc gccgcctcgt cttcgtcccc tacccctacc ccctcaattc cgccggatgt 1860 ccagcgcttc gcggaaacca tgaccaaact caccgagggg atggccaagg tcaaccagga 1920 tttttaggcc gggtgctcgc tcacgccgcg tcgcgtgcac cctttactcc gtactccatc 1980 tctgcttacg attcgatttg ttcctgtacg ataagtttga ctcgcgaaat ctcacctgat 2040 tcatcccatt ttcgtgtgaa cgtcaaactt cgaggcagga accgcagtac ggaagtagcg 2100 gccatgatag actgtggggc cactggcctg ttcttggacc tcccgttcgt gaaaaggcac 2160 cgtatcaccc aacaccccct acgccatccc atccgtttgc tcaacatcga cgggaccccc 2220 aaccaggcgg gaagtctcac ccacttcgct cgcctagagc ttaccgtcga cggcgtaccc 2280 gggtggtacg acttcctcat caccgatctg ggaggagagg acgtcatttt gggactgccg 2340 tggctacgac agcttaaccc cgcaatcgat tggcagaaag gaaccctcca agtaccgcgt 2400 cgcagatcca gcgtcctcgt cgaggagctt ctcgacgacg atgcagttcc cccctctcga 2460 ggggctccag cgaatggagc cattctcgaa gagatctacg acacggattc gatagatagc 2520 gagccagaac cgccgagtga tgccagcgac tctgagcccc cagacgacgg acctcccccc 2580 ctctgtaaaa tccgagcaaa ccgagagtta cgtcgactct gggtccgttc tggtctactg 2640 gaggaccaag gtgacgaact ttggtgcgcc gccgggttca cctactctca gcaaatcgcg 2700 gagaagaccc aggagacccg acccgagaaa tcattcgaag aaatggttcc cccacaatac 2760 cgtcgccacg cctccgtctt ctctgaaacg gagtcccaca gattgcccga ccacaagccg 2820 tgggaccacg ccatcgagtt aatccctggg gcccctgcca caatgcgcac caaggtctac 2880 ccgatgtccc agaacgaaca ggaggagcta aaccggttcc tcgacgagaa cctgaagaag 2940 ggatacatac ggccgtccaa atcccccttg tcttccccgg tcttcttcgt caaaaagaag 3000 gacgggaaac ttcggttcgt ccaagactac cgccggttga acgagatcac ggtcaagaat 3060 cgctaccccc tccccctcgt ctccgacatc atcaatcgac tgcgtggagc caagtacttc 3120 accaagttcg acgtacggtg gggctacaac aacatcagga tcaaggaagg agacgagtgg 3180 aaggcggcct tcgccaccaa ccagggcctc ttcgaacccc tggtcatgtt tttcggcctg 3240 acgaactccc ctgccacgtt ccaagcgctc atgaacgcca tcttctcgga tctcattgcc 3300 gctggtaagg tcgccgtcta cttggacgac atcctcatct tcactgtcac tctcgaagaa 3360 caccgccggg tagtccacga ggtcctagat cgactcaaga gacacgactt gtacctacga 3420 cctgagaagt gcgaattcga gcgtcaagag gtcgagtatc ttgggctggt tattcaagag 3480 ggagaggtgc gaatggatcc cgcgaaggtc gaagccgttc gcgagtggcc cgtccctacc 3540 aacctacgcg ctgtccgagg gttcctaggc tttgccaact tctaccgccg gtttatcaag 3600 aattttgcca ccatcgcccg tccacttaac gacctcacaa agaaggacac cccttggagt 3660 tgggggaccc cccaacaaga cgccttcgac accctccgcc gcgccttcac ctctgccccg 3720 atccttacgc tctgggaccc aaccctaccg acgcgaatcg aggtcgacgc atccggtttc 3780 gccaccggcg gtgccttgtt acagaagcac gccgacggac tctggcaccc cgtcgcgttc 3840 cggtccgcct ctatgcagcc cgcagagcgc aattacgaaa tctacgaccg cgagatgttg 3900 gccatcatcg aagccctcaa ggattggcgc cacttcctgg aaggattgcc taatccgttc 3960 gagatcgtaa cggatcacgc caacctcgcc tactggcgat ccgcgcagga ccttagccga 4020 cggcaagcgc gttgggcgct atacctgtcc cggttcgaat tctccctctc ccaccgtccc 4080 ggcaaagcca acactcaggc tgacccgctc tctcgactag agtcccacca ggtacacgat 4140 ggagacgaca accagcaaca ggtggtcctc aaacctgagt ggtttgcgag actcgcgatg 4200 gcgcgattat tggtcaaccc gctggaagat cgcatccgcc gggctagctc gaaggaatca 4260 gaagtgctcg aagctttaga gcgcctgaag agaactgggc cccgtcggtt ggtcaacggt 4320 ttagcagagt gggaagagga aggaggactc gtctatcacc gagggaaggt gtacgtcccg 4380 ccagacaacg accttcgacg cgacgtggtg agacaatgcc acgacgaccc gacttctgga 4440 caccccggcg tgcacggcac cctcgaacga ctggaccgac agtattggtg gcccacgatg 4500 cgcgcctttg tcaagaaata tgtcgaaggt tgcgatgtct gcgcccggaa gaaacgcacc 4560 cagcaccctc aggccatgac gaccccactc ccagtcccag gagccccctg ggagaccgtc 4620 ggcgtcgacc tcatcacaca gctgcccgag gcccacggct acgacgccgt cattgtgttc 4680 actgatcact acacgaagat gattcacgcc ctcccctgta cgtcggatgt ctcgacagaa 4740 ggtgtcgccg acatcttcta cagagaggtc ttccgtctcc acggcctccc cctccagttc 4800 gtcagcgaca gaggaccaca gttcgcctcg aaggtaatgc gcaccttact acgacgccta 4860 ggtatcacct ccagtctcac cacggcatac cacccccaag ccaacggcca aaccgaacgg 4920 gctaaccagg aagtggagaa gtacttacga ctctacgtaa gccgccggca ggacgactgg 4980 gaccaacatc tccccatggc tgagtttgtc atcaactccc gagttcattc tgcactcgat 5040 cgctccccgt ttgaagtcac ctacggctac aacccgcact tcaacatccc ggtcgggaaa 5100 cgctctgacc ttcgacccgt cgacgaacgc ctcgatcgcc ttatcgaggc tcgtcgggat 5160 gtcgaagccg cactgcgcct ggagaaacaa caacagaaag agtcttacga atctgggaaa 5220 cgtgcggcac actcgttcaa ggaaggcgat tttgtctggc tcagcgcaaa ggacatccaa 5280 ctcaaggtac cgactcgtaa acttggggac ctacaactgg gcccctacaa ggtgatcgaa 5340 cgacttggtg atttggacta ccgactcgaa ctgcccccat ctctatcccg actccacccc 5400 gttttccacg tcgacaagct ctccccatgg aagggcaacg acgtgaacgg gatcctccca 5460 ccaccgccgg aacccgtcga actggaagga gaattggagt acgaggtaca agagatagtc 5520 gacagtcgct ggactcgcag ggggcgaggg agacgactag aatacctggt caactggaag 5580 ggctacggct cgaaggacga cacctgggag cccgaggaga acctggagca cgcgcccgaa 5640 aaggtgaagg aatttcattc gcgacacccc tcagcacccc gccggatcgc ggcaacgttg 5700 ttcgccagcc taccctggaa acccctcgaa aatttcactg acgcccctcc cactgacctc 5760 gagtgggaac tcggacgacg cctaccgatg ttcatcgagg acgatgaatg ttagaagggg 5820 ggtaa 5825 // ID Gypsy-2_AM-I repbase; DNA; FNG; 7034 BP. XX AC ACDU01005031; XX DT 07-FEB-2011 (Rel. 16.02, Created) DT 07-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Allomyces macrogynus genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_AM_; KW Gypsy-2_AM-LTR; Gypsy-2_AM-I. XX OS Allomyces macrogynus OC Eukaryota; Fungi; Blastocladiomycota; Blastocladiomycetes; OC Blastocladiales; Blastocladiaceae; Allomyces. XX RN [1] RP 1-7034 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Allomyces macrogynus genome."; RL Direct Submission to RU (07-FEB-2011). XX DR Genome; ACDU01005031; Positions 6097 13130. XX CC Positions [4293-4844] - Reverse transcriptase CC 'ACGT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 213..4922 FT /product="Gypsy-2_AM-I_2p" FT /translation="MTPLNGDDLARIAAAFTANAYTTIIPDPNCNAVQAQT FT HDLDTTQTQYFDPSMQHPNYQTQDPIQFHDHRILVATAGSSANASGIPPLS FT TATTNTNANTVMTEGMTPIGATTTGLVPPVPTTTAANPAGHPKTSAPTMPL FT FNSKPDGPKSGAFAKTVGSHQHLTAEFYRPPMNVDDHLQAVSLLMEIAAAS FT DTTVTDQSKKAALVYSVEKALWPLLVSIPEWKVGSYTDFAARVRRELHLAG FT GTITTVDDLMGLGMSQPNGCRSARTLALWFDSYARVIDRLSDEYKASLLYS FT LADVRTQLELDQGLRVLGRPATYCDVHTTMLALDGTSSMRERAAMLNVNLG FT RMQAHARASDAPTATTVPPGVTRTVPPATMATPNARAVQPATQASIDELTA FT RLNELMIAVARQGGTRNVLAAPVGMSTQAPVTAPPANMPTYSPALPSVGIY FT TQGIDQGRAGPQVHSSPMPPYHGLPTRPFGYDYEGEPARSHEYNVPPRREY FT GYDGPRRDYGHDGPRRDYGYDGPRHANSYDGPRNGYSGGYSGAEPLCYYCS FT VRGHTMLTCQYVHEDLRHGLIMRNPYTRRLELPNGEQLRPSPGGFRMGVPR FT GHGPDYGHGEQGPGAHSPTGHGGAGIGPAVRQEERSRGHDPGPGAGAGERP FT DVRVAGVFVQRDFEDGMSRVADDRHASDDESAVVVAAVKRGQESAASGNER FT PRKVGRLVADKVVIEQPSWLTKAKANCKKQPPDPPKPPPGFTTGIAVPQPF FT PPMNPPVKPMLPVAAPALRPNNTPAPRARPDPMIIDLDEVLDDQPRWLPTA FT NKPAAATKTTTGPKAVKDANAATKPAPTARGIKGISAKTVISSKMVGNKTT FT PAYTYEYQLSLTEQDLVKEVAKLLSVATSPEKLLAMLTIQAGLRARFGSRR FT HVLKAINGVQVANIEIDDNDMDSSNIADDLVQIYTAADLDLLNSLRSELEE FT GKYANEHGQGYEACEADDNEEPFTEPAAMVAAASVGLPVGTLTTTSPAKWI FT RVRIGDLDTMALVDGGAEANIVDVTIARAAGMVVQSTTQRMVNADSTTRPF FT VGVSAGVFPIHIGPVKIGLHFHVQRDFDIGVLLGRPFLDAHACQEQNDGAG FT GCDYRVNFRNRKAITVPAVIKSGRGIFTSGQYIVDSSYLLAVESVSREAQA FT PALAPALLLPEPHHAKEPNADQVRAHDFEDHAAMEWLMAQPNVDEIVAVVF FT APQQVPSPDPEGSRVLVATMRKRVTNKVRPANVALPGGEAPQQGARRARAA FT VTDRWTTENIARIKVGDGLTETEVAALHGVLEKNQGALAFDESHLRMLEPH FT IELPVVIRTVPHAPWAQKPMKFSQQEWAAVTNIVRDRLHRGILEYSNGPYA FT NRYFLVRKKAGSWRFIQDVRQLNGIAVKDANMPPACDDLTEWLAGYPLLTL FT LDAMSGYDQVPLAMESRDLTAMWMPLGLVCMMRLPQGYTNSVAIFERAMSH FT VLVDVKGRFAENMVDDITIHAALRDRDESVGEDGTRKFVRLHLDALDRVLM FT ALDDAGLTVSVEKALFCAERAELLGTVVHRFGRELTPARVDKIVRWARPAT FT SAS" XX SQ Sequence 7034 BP; 1509 A; 2182 C; 2131 G; 1212 T; 0 other; ttggtggaga atccgggtag gcggtttggc aaaatacgtg gttggtgttg acgcctgtct 60 caaaattgtg attgcaggga aggagtagag tgcactgcgt gaccgaacca cgcgcgtgag 120 cccaacacgc aaatcgctcc attgatctcc tgcactcttt ctcgacaatt ctcgcacgcg 180 cgacctccaa ccgcagtggc aacaccatca ccatgacgcc cctcaacggc gacgacttgg 240 cacgcattgc cgcggcgttc acagccaacg catacaccac gatcattcca gaccccaact 300 gcaacgccgt ccaagcacag acccatgacc tcgacaccac ccaaacgcag tacttcgacc 360 cctcgatgca gcaccccaac taccagacac aagacccgat ccagtttcac gaccaccgca 420 ttctcgttgc aacggccggt tcatcggcca acgcatcggg catcccgcct ctcagcacgg 480 ccacaacaaa caccaatgcg aacacggtca tgaccgaggg catgaccccc attggggcga 540 cgacgaccgg cctggtcccg cccgtgccca caacaacagc ggccaatcca gcaggccacc 600 caaagactag cgcaccaacg atgccattgt tcaacagcaa gccggacggg cccaagtcgg 660 gcgcgttcgc caagacggtt ggatcgcacc agcacttgac cgctgagttc tacaggccgc 720 cgatgaacgt cgacgaccac ctgcaggccg tctcgctgct catggaaatc gcggcggcaa 780 gtgacacgac ggtcacggac cagtcgaaaa aggctgcatt ggtctactcg gtagagaaag 840 cattgtggcc actgctcgtc agcattccgg aatggaaagt cggctcgtac acggatttcg 900 ctgcccgggt gcgtcgcgaa ttgcacctcg ctggcggcac gatcacgacc gtcgacgacc 960 tgatgggtct ggggatgtcg cagccgaatg gatgccggtc agcacgtacc ttggcactct 1020 ggtttgactc gtatgctcgc gtgatcgaca ggttgtctga cgagtacaaa gcgtcgctgc 1080 tgtattcgct cgccgacgtg cgcactcagc tggaactcga tcagggcctg cgcgtgctcg 1140 ggagaccggc cacgtattgt gatgtgcata cgacgatgtt ggcactcgat gggacctcgt 1200 cgatgcgcga acgggccgcc atgttgaacg tgaacctcgg tcgaatgcag gcacatgctc 1260 gggcgagcga tgcacccacc gccaccaccg tgccgcccgg ggtcacacgg acggtaccgc 1320 ctgccaccat ggctacaccg aacgcgcgtg ccgtgcagcc ggccacacag gcatcaattg 1380 acgagttgac cgcacgcttg aacgagctca tgatcgcggt cgcacgccag ggagggacgc 1440 gcaatgtgct cgctgcgcct gtcggcatgt ccacacaggc accagtcacg gcgccaccag 1500 cgaacatgcc gacgtactcg ccagccctgc cgtcggttgg tatctacacg caagggatag 1560 atcagggacg tgcggggccc caagtgcact cgtcacccat gccgccgtac cacggactgc 1620 cgacgcgccc cttcggctac gactatgagg gagaaccggc tcgcagccac gagtacaacg 1680 tgccgccacg ccgtgagtat ggatacgatg gaccgcgccg cgactacggt catgacgggc 1740 cacgccgcga ctacggctac gatgggccgc gccacgctaa cagctacgac gggccacgca 1800 atggatacag cggggggtac tcgggcgctg agcccctgtg ctactattgc agcgtccgcg 1860 ggcacacgat gctaacgtgt cagtacgtcc acgaggactt gcgccacggc ttgatcatgc 1920 gcaacccgta cacacgtcgt ctcgagctac caaacggtga acagctgcgg ccgtcaccag 1980 gcgggttccg catgggcgtg ccacgcgggc acgggccgga ctacggccac ggcgagcagg 2040 gccccggcgc gcacagtccc accggccatg gcggagctgg cattgggccg gcggtcaggc 2100 aggaagaacg cagtcgcggg cacgatccgg gccctggtgc tggagctggc gagcggccgg 2160 acgttcgtgt tgctggcgtg ttcgtacagc gggattttga agacggcatg tcgcgtgttg 2220 ctgacgacag acatgcctca gacgacgagt cggcggtggt ggtcgcagcg gtgaagcgcg 2280 gccaagagag tgcagcgagt gggaatgagc ggccacgcaa agtcggtcgt ttggtcgcgg 2340 acaaggtcgt gatcgagcag ccgtcatggc tcaccaaagc caaggccaac tgcaagaagc 2400 agccgcccga cccgcccaag ccgccgcctg gcttcactac gggcattgcc gtacctcaac 2460 cgttcccgcc gatgaatccg cctgtcaagc ccatgctgcc agtcgctgca cccgctctgc 2520 gcccgaacaa cacgcccgca ccacgtgcgc gccccgaccc catgatcatc gacctcgacg 2580 aggtgctcga cgatcagccg cgctggctgc caacggccaa taagcccgct gccgccacca 2640 agacgaccac tggcccaaag gcggtcaagg acgccaacgc cgcgaccaag ccggcgccga 2700 ctgcccgtgg cattaagggc ataagcgcga agacagtcat ctcgtccaag atggtgggca 2760 acaagacgac gcctgcctac acctacgagt accaactgtc actgactgag caggacctcg 2820 tcaaggaggt cgctaagctg ctgtcagtgg ccacgtcgcc cgaaaagctg ctcgctatgc 2880 tgaccatcca agccggcctg cgcgcgcggt tcgggtcgcg ccgccacgta ctcaaggcta 2940 tcaacggcgt gcaggtcgcc aacattgaga ttgacgacaa cgacatggac agcagcaata 3000 tcgctgacga cttggtgcag atctacacgg ccgccgacct tgacctgctc aacagcctgc 3060 gctcggaact cgaagaaggc aagtacgcca acgaacacgg ccagggatac gaggcctgcg 3120 aggccgatga caacgaggag ccattcactg aacctgcggc catggtcgcg gctgcgtcag 3180 ttggcctgcc cgtcggaacg cttacgacca caagccccgc caaatggatc cgtgtgcgca 3240 tcggcgactt ggatacgatg gcgctggttg acggtggtgc cgaggccaac attgtcgacg 3300 tgaccattgc gcgtgccgcc ggcatggtcg tgcaatcaac gacgcagcgc atggtgaatg 3360 cggattcgac gacgcggccc ttcgtcggcg tgtcggctgg cgtgttcccc atccacatcg 3420 ggcccgtcaa gattgggttg cacttccacg tccaacgcga cttcgatatt ggggttctcc 3480 ttggtcggcc attcctcgat gcacacgcgt gccaggagca gaacgacggc gcgggcggct 3540 gcgactaccg cgtcaatttc cggaaccgca aggccatcac agtgccagcg gtcatcaaga 3600 gtggtcgtgg gatttttacg agtggccaat acattgtcga ttcctcgtac cttctcgcag 3660 tcgaatcagt cagtcgtgaa gcgcaagcgc cggccttggc tcctgctctt ttgctgcctg 3720 aacctcatca cgccaaggaa ccaaacgcgg accaggtcag ggcacacgac tttgaggacc 3780 acgcggccat ggaatggctc atggcccaac ccaacgtcga cgagatcgtc gcagtcgtgt 3840 tcgcgccaca gcaggtccca tcgcccgacc cagaaggaag cagggtgctg gttgcaacca 3900 tgcgcaagcg ggtcacgaac aaagtccggc cagcaaacgt cgcattgccg ggtggagaag 3960 ctccgcaaca aggtgcccgt cgtgcgcgcg cggctgtgac cgaccggtgg accactgaaa 4020 acattgcgcg catcaaagtc ggtgacggcc tgactgagac cgaggttgcg gcactgcacg 4080 gcgtgctgga aaagaaccag ggcgctctgg ccttcgatga gtcgcacctc aggatgctgg 4140 aaccacacat tgagttgccg gtcgtgatcc ggaccgtgcc acacgcgcca tgggcgcaaa 4200 agccgatgaa gttctcacaa caggagtggg ccgcagtcac caacattgta cgcgatcgct 4260 tgcaccgcgg catccttgag tactcgaatg gcccctatgc caaccggtac ttcctggttc 4320 ggaaaaaagc cggcagttgg cgcttcatcc aggatgtgcg ccagctgaat ggcatcgcgg 4380 tgaaggacgc gaacatgcca ccggcctgcg acgacctgac ggaatggctc gcgggctatc 4440 cgctcctcac cctcctcgat gctatgtccg ggtacgatca ggtaccgttg gcgatggaga 4500 gccgtgatct gactgcgatg tggatgccgt tggggctggt ctgcatgatg cgcttaccgc 4560 aaggttacac gaactcggtc gcgatttttg agcgcgctat gtcgcatgtc ctggtggacg 4620 tcaaaggccg gttcgcggag aacatggtcg acgacatcac gatccatgca gcattgcggg 4680 accgggacga gagcgtcggc gaggacggga cgcgaaagtt tgtgcgactt cacctggacg 4740 cactcgaccg ggtgcttatg gcactcgacg acgccggcct gaccgtgtca gtcgagaagg 4800 ccctgttctg cgccgaacgc gccgagttgc tgggcacagt cgtgcatcgt tttggtcgtg 4860 agctgacgcc tgcacgtgtt gacaagatcg tgcgctgggc acgcccagcg acatccgcga 4920 gctaaagggg gtcctcgccc tgagccagat ttgcagcgca ttcgtgcgca cctcgggaac 4980 gcgatcgagc cactgacgcc gctgacgcgc gggccaaaaa agaagcccga tgatcgtgtg 5040 gcatggaccg atgcggccga gcgatcattc cgactcgtca aggtgttatt ccaggacgca 5100 tgtgtgctag cgccgcccaa ctttgccgac gacgcgccac cattcattgt ctacacagac 5160 gcgggcgacc gcgcgctcgg cggcgtgctc atgcaggaac aagccgatgg tgagctgcga 5220 ccgatccggt tcatcagcaa aatcctcgac cgagcccagg tcaagtattc cacgccaaaa 5280 cgagagctcc ttgggatctt gtgcacggtc cagaaattcc ggccatacct ccatgggcga 5340 cgcgccgttt tgcgcaatga ttcccgcaca gtcattggga tgatcaatcg accaaccacg 5400 ctgccagacg cgaccacaac gcggtggatc gcgtatgtgg ccatgtacga cctggaaata 5460 gagcatgtcg gccgcagcaa aaatatcctg gctgacgcgc tgacgcagat tgaagcctat 5520 gagccccaat ttgattcgga cctgttcgac caggagcagg cggaccaaga ggtggagaag 5580 gcccttgatg acggcgcacc acgggtggcg cgagttgcgg ctgctcgggt tgtggccaag 5640 ggctcgacga cgacccaggt ggagagcatt ggtaacgacg gggactttga cctggccatg 5700 gaacgcaaga ccgctgagga tgcaattcag cagggccgat acgacgacga ccatgttgag 5760 gtcatacgca acctgctgtg gccgggcttc acaaagggag ccgtgcgcaa gacgtggctg 5820 caaaagaagc taggccgtta cttttggcaa catgggcacc tctggcgacg ccgacatctc 5880 aaagagccgg tcatcgtcgt tgacgataag gaacagcaac tacgggtctt gtacgaggtt 5940 catgacgagg gcgggcacaa aggccgtgac tcgacgttcg caaaagtcgc tgcgtgagca 6000 ttctggactg ggatgcatgg aatggtgatt gactacgtca agacgtgcga ggtgtgccaa 6060 gtgcatgatg caacccagga gcagcatgcg tatgggcgat tgccgattca cgggccaatg 6120 tcgtccatct cgattgatca catggccatg cctcttgcgg gcggcaagcg gcacattctc 6180 gacgcctgct gcacattcac aggattcctc gaagcggtcg cggtgccaga caccaaagcc 6240 caccgtgtag tcaagttttt gaacgacgtg ttctgtcgcc atggacctgc cgcagtagtg 6300 attgctgatc gcggcacggt gagtgcgcaa gacgtacagg attgcgtcaa acgatgggga 6360 ggcaagctca tcctgactgt tgcatacaac ccatgcggga atgcagtggt tgagcgagga 6420 caccacgctt ttgcttgatc actggccaag atttgtgaac agaacaagaa gaagtggacg 6480 gaggtgctgc cactcgcagt aatggccgac cgcacaacag tccggtgcag cacagggcac 6540 acaccgttct acctcgtgca tggccgcgag tgtacactcc ctgttgacct ccaactggct 6600 gagtgggccg ggctcgcaaa cgcctacccg gtggagaagg aagacctgtt gctcgtgcga 6660 ctgcaggcgc ttctggcatt cccaggtcag atcaagaaag cgctcgccaa ggtcgacaat 6720 caacgcactc aggatcgtga acggcacgct gacgcaatca acagccgacc atttgagtcc 6780 ggcgacctgg tccttgtctg gaacaatgag tatgacaaga cattcacggc tgaacggaaa 6840 acggcacgca agtggctcgg gccattcatc gttctcaaga atcatggaag caaggttaca 6900 gttgcggaac tcgacggcgc caagctacag ctacccgtct cgatccaccg cgtgaaactc 6960 ttccacccgc gcgaatgggt cggccctgtg ttggctgagc gctcaggtcc ccgatcgctc 7020 atcgcggggg gtga 7034 // ID piggyBac-1_Mcir repbase; DNA; FNG; 2026 BP. XX AC . XX DT 12-MAY-2011 (Rel. 16.05, Created) DT 12-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW piggyBac; DNA transposon; Transposable Element; piggyBac-1_Mcir. XX OS Mucor circinelloides OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Mucor. XX RN [1] RP 1-2026 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Direct Submission to Repbase Update (12-MAY-2011)Proc Natl Acad RL Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 486..1838 FT /product="piggyBac-1_Mcir_1p" FT /translation="MLRAIVDNTNKYHDNSKAVGKNSRHWRPIDINELKVW FT IAIVIYQGLHKESCIEQYWNKSINAPVHNIKNEMPLYRFQQIKRFLHISND FT FEHAPSTKTNYFDKVEPLMSHIKQTSMKLYKPKSNVSIDEMMVQFSGRSAH FT TFKMKCKPIPEGFKIISLCDDGYTYTFLPTSRIVKSNVERIVGINYTGCQV FT LHLVKQLPFRNSAYNIYMDNYFTFSIGLFKHLLDLGIGACGTVRKSVMKSL FT LKLPDNPRLDWDTRSGAVKDGVLAFFWQDNGPVTMLSTIHEVTGANCDVIR FT PRRKPRETSTNGEKLKKVFGSSFVKDLPIPKIIDDYNHNMNGVDIADQLRS FT YYSTHQTSRRTWFPLFFWLLDTAIINSYIIFKSKGFQISQKDFRHNLLWEL FT IRSAKSNSQLRSCKEAGTIARNAIRTNANTTLSDERFSNKMHLPIHSKTRL FT VCKLCA" XX SQ Sequence 2026 BP; 678 A; 422 C; 369 G; 557 T; 0 other; cccttaaact gtaaagagga cagatctgtc ctcttagcag attttcagac tgtgtcttta 60 aaattttttc gtcaaaaagc cgtcaaatta aatcaaaata aagagaaaag ctcaaaacga 120 tcgaatatgt tattcttttc tttcaaatcc ttctcaacct cgagtaattg tctgataaag 180 atcacaaaat agaccaatct gactttcaat tcgaatatcc tgccctagaa agcaatcaac 240 atgaagagct ttcatcaatc attacttcac gtaagcagct tcaaaacgtt gaagaagaaa 300 tagaggaaac agcactacct agaaagaaaa ctcgctcaga tgccaaatct ttgccttcaa 360 ctccagtcta caatgctttc aaacataaaa agaaactgca taaagccagt gtcacgctcc 420 ctccaatctt tgacagaaat aacattactc ctgccgctgt atttggtcat tttttcactg 480 atagcatgct cagagccatt gtagataaca caaacaagta tcatgacaac agcaaagcag 540 tgggcaaaaa tagtcgacat tggaggccaa tcgacataaa cgaacttaaa gtctggatag 600 caatcgtgat ctaccaagga ctccacaaag aatcatgcat tgagcagtac tggaacaaaa 660 gcatcaacgc accagttcac aatatcaaaa atgaaatgcc actgtataga ttccagcaga 720 taaagcgctt tttacatatc agtaatgatt ttgaacatgc accaagcacc aaaacgaatt 780 acttcgacaa agtagagccc ttaatgtctc acatcaagca aacgtccatg aaactttaca 840 aaccgaagtc gaacgtatcc attgatgaga tgatggtaca gttttctggt agaagtgccc 900 acacattcaa gatgaaatgc aagccaatcc ctgaaggatt taaaatcatc tcactttgcg 960 acgatggata cacttacacg ttccttccta catctcgaat tgtcaagagt aatgttgaac 1020 gaatcgtcgg cataaactac actggttgtc aagtactcca tctagtaaag caattaccgt 1080 tccgaaattc tgcttacaac atttacatgg acaactactt tactttttct atcggcctgt 1140 tcaagcattt gttggatcta gggatcggcg catgtggtac tgtccgaaag tcagtgatga 1200 aaagtctgtt gaagcttcca gacaatccaa gacttgactg ggatacaaga tctggtgctg 1260 tgaaggatgg tgttctcgct ttcttctggc aagataatgg tcctgtcact atgctctcaa 1320 caatccacga ggttactggt gccaattgtg atgtgattag acctcgtcgt aagccaagag 1380 aaacaagcac aaatggcgag aagttgaaga aagtgtttgg tagtagcttt gtcaaggacc 1440 taccaattcc aaaaatcatt gatgattaca atcataatat gaacggtgtt gatatcgctg 1500 atcagcttcg ttcatactac agtacccatc agacgtctag aagaacttgg tttcctctct 1560 ttttttggtt gttggatacc gcaataatca actcgtacat cattttcaag tcgaaagggt 1620 ttcagatatc ccaaaaggac tttcgacata atcttctctg ggagctgata cgttcagcaa 1680 agtccaacag tcagttgaga agttgtaagg aagccggcac aattgctaga aacgccataa 1740 gaacgaatgc gaacacaact cttagtgatg agcgattctc aaacaagatg catcttccaa 1800 ttcatagtaa aactcgtttg gtgtgcaagt tatgtgctta gaagtctcaa caaaagccag 1860 ataatattaa gtgcagccct cccaaaagct acattttctg taaatgttgt gatgttgctt 1920 tatgtctcaa cagcacccgt aattacttcg tagaatacca tgaaaagaag aagtaataaa 1980 tatttttgag aataaatttt ttaaaatttg atttacagtt taaggg 2026 // ID Copia-1_VA-LTR repbase; DNA; FNG; 226 BP. XX AC ABPE01003719; XX DT 13-FEB-2011 (Rel. 16.02, Created) DT 13-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Verticillium albo-atrum genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_VA_; KW Copia-1_VA-I; Copia-1_VA-LTR. XX OS Verticillium albo-atrum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetes incertae sedis; Phyllachorales; OC mitosporic Phyllachorales; Verticillium. XX RN [1] RP 1-226 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Verticillium albo-atrum genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; ABPE01003719; Positions 25935 25710. XX SQ Sequence 226 BP; 63 A; 45 C; 60 G; 58 T; 0 other; tgtcggattg atgcttggat ccgcttttaa gcgtgtctga aatcgggttt attgaatatt 60 tctccattct gccgcagggc ggtgaaggag tggggttaag tttgtcactc gacagaacca 120 gtacaaaaca aggatcaacc aagttacaac caccaagagg tccttttaat gtgaggactc 180 ggccagaagt gacgaatacg atacgtgagg catcgagtat ctgaca 226 // ID TSU4-I_SB repbase; DNA; FNG; 5452 BP. XX AC AJ439550; XX DT 16-MAY-2005 (Rel. 10.05, Created) DT 03-JUN-2005 (Rel. 10.05, Last updated, Version 1) XX DE S. bayanus LTR retrotransposon, internal sequence. XX KW Copia; LTR Retrotransposon; Transposable Element; copia-type; KW integrase; protease; pol; TSU4-I_SB. XX OS Saccharomyces bayanus OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Saccharomyces. XX RN [1] RA Neuveglise C.; RT "Genomic evolution of LTR-retrotransposons in hemiascomycetous RT yeasts."; RL Unpublished (2002). XX RN [2] RP 1-5452 RA Gentles A. and Jurka J.; RT "Yeast LTR retrotransposon."; RL Direct Submission to Repbase Update (16-MAY-2005). XX DR GenBank; AJ439550; Positions 322 5773. XX CC LTR deposited as TSU4-LTR_SB. XX SQ Sequence 5452 BP; 2269 A; 863 C; 898 G; 1422 T; 0 other; tggcgacccc agtgagggat gagacaagat atgttattga cgacaacata tctacgcagg 60 ttcaatcaaa agtcaaattt caaaacttaa gttttgatac ttccgttaaa aacaaacaaa 120 ggattcaaac tcaaattgat ttaaaaatgt atgaaagatt cttaaacaaa ttaaccacta 180 aaaagaataa aattgcaaat gaggctgaag aggatgatga tatttcaaat ctcgaccagt 240 taattgaagt cgatgaaaaa attgaaaaaa ctaataatat aattcttagg ttacaagaaa 300 agttagaatt gattgaattc aataaagatg ttaagaagtt acataaaaga aatgattcca 360 ctggaacata ttacttattc gacacaatta cctcaaaaaa taataaatat tatcctaagg 420 attggatatt caagtataag atgaataaaa ttggagatat tcctgttttt ttgaacaact 480 ttcaccaatt cattgaaaaa tatgaattcg ataatgtttt tgatcaacaa atacaaaaca 540 tcgatccccg tgaaaacgaa atcttatgta agattatcaa agaaggtttt gatgaaagtc 600 ctgatataat gaacattaat acagttgaca tctttagaat cattagtgaa ttaaagaaaa 660 aatacactcg tttttttggc agagatagaa ggttaaaggc ttgggaaaaa gtgttggtgg 720 atacaacttg taagaatact gaattattga taaatgaact tcaaaagtta atattaatgg 780 aaaaatggat tctttcaaag tgttgtcaag attgtccaaa tcttacacac attttagaag 840 aggctattct tggaacctta catgaatctg tgagaaatcc ggttaaacaa cgtttacaca 900 tgtactcaat tagtgaaaat gaaaaaactg aagaaattct gattaacatc gtaattgaaa 960 cagtaatgga cttgagtcca cctgactctc attatacaga aagaaattgt aagtactgta 1020 aatcggaatt acacagctct gtgaactgta gaaagaaagt aaacagagaa cttaggccta 1080 ctacatccag ctactcaaaa ggtaattatt cacaaggctc aaaccaaaaa gaatacacaa 1140 agactggcac aaaaccattc agaacttttg aaaagacaaa agaagaagaa caaaaagggt 1200 tcaaacaaaa ctctagtaat tattgatacc ggttcaggcg tcaatattac gaataataag 1260 aatttattac acgagtacga ggacaacaaa gaaaaagtaa aattcttcgg tattgggaaa 1320 gataactcag ttcctgttaa aggatcagga tacattaaaa taaaaagtaa cacaaatgat 1380 gattacttat taactcatta tgttcctgaa gaaaaaacta ccattatcag tggatatgat 1440 ttagccaaag aaaccgatct cgtattaaac caaaactact ccaccttgga aaacaaggac 1500 atgaacatta aaactcatgt aaaagatgga attattcacg taagaatgga tgacttaata 1560 gatcatcctg catatgatta taaaatcaat gcgatacaac ctacttcttc taaaaaaatt 1620 agactgaagc ccaaaattat aagcttaaaa gatgctcata aacgaatggg acatacagga 1680 gttcaacaaa ttgaaaactc tattaaacat agtcattacg aagaaagtat tgatttaatt 1740 aaagaaccaa atgaattttg gtgtgaaact tgtaaagttt caaaagccac gagaaggaac 1800 cattatgctg gatccatgaa tgaacacagt atcgatcatg aacctggttc atcgtggtgc 1860 atggatatct ttggtccagt atcaaattcg aacttggata caaaaagata catgcttatt 1920 atggttgata acaatacaag atattgtatc acctccacac attttaataa aaatgctgaa 1980 actatcttgg cacagatcaa gaagaatatt cagtatgtgg aaactcaatt cgacagaaaa 2040 gtcagagaaa tcaattcaga tcaaggaact gaatttacaa atgatcaaat cgcaaaatat 2100 tttgtttcaa aaggaattca tcatatattt tccgctacac aagatcatgc tgccaacgga 2160 agagcagaaa ggtacatcag aaccatcgtt actgatgcaa caactttgtt aaaacatagt 2220 aacttacgta ttaaattttg ggaatacgca gtaatatctg ctaccaatgt acgcaattgt 2280 ttagaaaaca aaaccacagg tcaactaccg ctaaaggcga tatctagtca acctgttaaa 2340 gtgagattca tgtccttctt accatttgga gaacaaggaa taatttggaa tcataatcac 2400 aataaactaa aaccatcagg acttagtgct ataatattat gcaaagatcc taatagtcac 2460 ggatacaaat tttttgtacc atctattaaa aaaattgtca cttccgataa ttacacaatt 2520 ccagactatg ctgtggatcc aatattaagg aacacacaga acatatacct agatgatcaa 2580 agcagatcag atactttcaa tgaagcagaa aacatagatg ctgtttcaag gttgtatgat 2640 tcactggaag attacgaaga tgatcataaa caagttacac tacttacaga cttgttcacc 2700 acagaagaac tagcccaaat cgaagctaat tctaaatatc catctcctag tgataatcta 2760 gaaggtaatt tagactacgt tttctctaac atagaagaat ctgacgaaga tgaatacgat 2820 catgtaacaa acatggatgt agattcagaa cttcaatcga aagaaaatat cactactgaa 2880 agtgaaacaa acgaaataaa taaaccaagt aatactgatg aggatgttta cgaagaaaat 2940 gtttatagaa ttcctacggc aatacaagaa aaccttgttg gaagccagaa aactataaac 3000 atcaataatg aggataatat tgctagcaga atgcaaaaga atatcagtgg aaatgaaata 3060 aactacaaag aattatcaga tgacgacagt gattgtagcc ttcatgattc tacaaatgac 3120 tcagttacga ttacaagcaa aaaggataat ttaacagatg ataaagattt acaatcacag 3180 caagaattat tcgaaaaagt tagtgatcca gaagtattac ctgaacacat gaaaattgaa 3240 aaagatgtgg agtctcaaaa ttcagataat gaaacctcac aaggcgtaca gtttcaacct 3300 gaatcaattg tcacatcatc aagtgacaat gatactcaaa atgatgacta ttcaactgac 3360 aaagaaagtc atcacctgcc cctggttgta aacgttatgg acaatactga ccaaacatat 3420 gataaaccaa acaaggaaaa aagtaaaaac aactctgata taagtatctc accgaaaggt 3480 aataatgaag agttagtaca attggtagat agtaacaaag ccgaaaaaca ggatgctaca 3540 ctagaatcat cagcgataac agatgaacca attgaaatag aaaacccagc agcaaacaaa 3600 gctggattct taaataaagc gttcaattct cttaataaga aaaggaagag acctattgaa 3660 aacaagactt cctttaatga cacagctaaa agagacaata aacgtcaaag aaaaaacata 3720 atcaaactac ttccggataa cacagaaaca agttctgcac cgagaataaa aaccatctac 3780 tacaatgaag ctatttcaag aaatgctgat ctcaaagaaa aacatgctta taaagaggca 3840 tacaggaaag aattacaaaa tctcaaggat atgaatgtat tcgacgtaga tgtcaaatac 3900 aacagatctg atgttcctag taatttaata attccgacaa atacaatatt ctcaaagaag 3960 agaaatggta tttataaagc aagaattgtt tgtagaggtg atatccagac accagacacg 4020 tataatgtta ttggaacaga atcactgaac cataatcaca ttaaaatatt tctgatgatt 4080 gcaaacaata gaaacatgtt catgaaaaca ttagatatta atcatgcgtt cctatatgcg 4140 aaattagaag aagacatata cattccacac ccacatgaca ggagatgtgt tgttaaacta 4200 aataaagcac tatatgggct taagcagagt cctaaagaat ggaacgacca tcttagacga 4260 tatttaaaca gtattggatt aaaagataac acttatactc caggactata ccaatccaag 4320 gataaaaaac tcatgatcgc agtttatgtt gatgactgtg tgattgcagc aagtgatgaa 4380 caaagattag atgatttcat aaacaattta gaaaatacct ttgagctaaa aatcaccggt 4440 actctaatag atgatatatt ggacacagac atcttaggga tggacttgat ctataacaaa 4500 aaacttggta cggtggattt aacgttaaaa tcattcatag aaaaaatggg tgagaagtat 4560 aaaaaggaat tggataaggt tagaaaaagt tcaattccac atacatcagt atacaagatt 4620 gatcccaaaa agggagaatt aaagatgacc gaaaaagaat ataaaaacgg tgtgttgaaa 4680 ttacaacaac tactaggaga acttaactat gttaggtaca agtgtagata cgatgttgaa 4740 tttgcggtca agaaagtagc tagattagta aactttcctc atgaacaggt attttacatg 4800 atttacaaaa tcatacaata tttagtacaa catacgaata tcggaataca ctatgataga 4860 gacctcaaca aagataagaa aatcactact atcactgatg catcagttgg aacagaatac 4920 gatgcgcaat caagaattgg agttataatt tggtatggaa agaacatttt caatgtatac 4980 tcaaataaaa gcaccaacaa gtgcgtatcc tcaaccgaag cggaacttca tgctatctat 5040 gaaggttatg cagactcaga aaccctgaaa gcaaccttaa cagaactcgg agaaggtaaa 5100 gataaagaaa ttacaatgat tacggattca aaaccagcta ttgaaggctt aaatcgtagt 5160 tacctacaac cgaaacagaa gttcacatgg ataaaaaccg aaataattaa agagaagcta 5220 aaagagaaga ttataaaact aataaaaatc aatggtaaag acaatatagc agatttactc 5280 acaaaaccag tatcaatttc tgactttgat agatttatta aagtattgaa taatcagata 5340 acaccacagg atattttggc ctcaacggac tattgataat tacattactt aattaagaac 5400 caaacgcaca acagatatct gttggagtac aataatatat ctttaagggg gc 5452 // ID Gypsy-9_MLP-I repbase; DNA; FNG; 5855 BP. XX AC AECX01002130; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-9_MLP_; KW Gypsy-9_MLP-LTR; Gypsy-9_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5855 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002130; Positions 20545 26399. XX CC Positions [4656-5135] - Integrase core CC 'GAGTT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 2361..5759 FT /product="Gypsy-9_MLP-I_2p" FT /translation="MGNARICDEGVCIPSNTLTPPQRECASVLIPKVKETA FT GNQTSFLKQVLSKNWKKSDSLWQPRHRSFLPNPMILQTPRLDAARASWNVS FT AKLAAEKTSESTKLSAADLVPKEYHDYLPMFEKSKSRVLPPHRPYDFCVDL FT VPDATPQANRVIPLSPAENEVLNKMIEEGLASGTIQRTTSPWAAPVLFTGK FT KDGKLRPCFDYRKLNALTVKNKYLLPLTMELIDSLLDADRYTALEMRNGYN FT NLRVREGDEAKLAFICKAGQFEPRTMSFGPTGAPGYFQYFVQDIFRDRIGR FT DMAAFLNDILIYTKPGEDHEKAVKAALDTLREQNVWLKPEKCKFAQEEITY FT LGLKLSPNKISMDSDKVKAVTDWPTPKNVSEVQQFVGFANFYQRFINNFSR FT IARPLHELTQHKVPFEWTDERDLAFKSLKEAFTTAPVLKIADPYKAFVLEC FT DCSDFALGAVLSQVSEDDGELHPVAFLSRSLIQAERNYEIFDKELLAVVSS FT FKEWRHYLEGNPHRLNVIVYTDHKNLESLMTTKELTRRQAQWAETLACFDF FT EIRFRPGRQSTKPDALSRRPDHKPEDGQKLTYGQILKPSNLPADAFIHELE FT VIDNWFEEEECDISYFLEDEEGVEEVKDENDEPIDDEDETVWDDSQLLNEI FT RKTLHKDSRLSDIISTCESHKEGRWDEYVYTGGLLYFKGIVEVPHDTELKR FT KIVKSRHDSPLAGHPGRMKTLNLVKRNYRWPSMKAFINAYVDGCHSCQRVK FT SRSTKPFGSLQPLPIPSGPWTDVCYDLITDLPVSNGMDCILTVIDRLTKMC FT HFIACKTTMSSEELAKAMIKYVWKYHGTPQSVTSDRGNVFISKLMKELNHQ FT LGIRTQSSTAYHPQTDGQSEIANKAVEQYLRHFVSYKQDNWTDLLDLAEFA FT YNNSPHTSTGISPFKANYGYNITYSRIPSSEQCIPAVEEMLAQLKEVQDEL FT RESLVLAQESMKAHYDKSKRDSPDWQVGSKVWLDARHISTTRPSAKFAHKW FT LGPFSILARVSKNAYKLNLPASMSRVHPVFSVGLLRPYEESTIAGQHQEPP FT PPIIVDEEEEFEVADILNKRKRGSKIEYLVSWKGYGPEDDTWEPEGSLGNA FT KELVDKFNSRYPEAEKNYKRTRRVK" XX SQ Sequence 5855 BP; 1907 A; 1217 C; 1353 G; 1378 T; 0 other; tattgaagca tctcttaaaa ttagaagtca gagatagaag gacgcaagaa gtcaagaagc 60 tgatcgaaga agttatcggt caaaagtcaa aagaagaaat taaattaagt ttaaaagata 120 attaaaaaaa aagtttaact ccgcaagaga agaagatttt ccatctcatt cacatctacc 180 ccacaccctg tcaaacctta tctgctacat tccaccacgc gaaacaattc aaaacgccta 240 acttcacgat atcccagttc tcggaaagcc ctacgtctag ccgatcatcc gctgaattcg 300 catcaattac aaatccggga tctgaaccgg gtatggccga tacaaccatg ccgacatctg 360 ggaatacaga ccctctagca cagatcatgg cgcggttgaa tttaatggac gctaggttat 420 tggaagagac acgtcggcga gaagaagccg aactcggccg acaacaagca gagcaacggt 480 tagacactac cagcataaag taattttccc aagtaaaaaa tgaatacatt tagagcaaaa 540 aatgattgaa aatgcagttt atcagcatta aatcggcatt aaatcagcat tattcggcat 600 taaatgagca tttttgatta aaaatgcatt aaatgtaatc attttttcct tgggaaaaat 660 accttttgct gggagtgaga agaaattcaa cgagccaacg ctaacgcgca agcatcagcg 720 ctgactatcc aacctgcacc agtacctagt actatcaatg tgaaagctcc gaaagtggca 780 actccggata aatttgatgg tactcgaggc agtaaagctg aaatttttgc aaatcaagtc 840 ggcctataca ttgttatgaa ccccttacag ttccctgacg atagaccaaa aatcggatgg 900 gctttatctt acatgaacgg caaaggtggc gagtgggcta aaccatggac tcagaaactg 960 cttaacggta aaacggacga ggtactcacg tgggatggat tctctgcggc attcgaagcc 1020 acattctttg attcagagcg tgtagcgaaa gctgaaacgg cgatacgagc tctacggcaa 1080 actaattcgg ttctcgcgta ttctctcaaa ttcaacgacc ttgctattgt tgtgaaatgg 1140 ccggattcaa ttcttatcac tcaattcaaa caagggctga agccggagat ccaagttcaa 1200 atagtacgcg acgtgttcac ttcattagat caaataacag agctggctat aaaaattgat 1260 aacatattac ataagcgtgg tgatgatctc aagttggagg tgaaggaaac ggtggtagac 1320 cccgatgcca tggactgttc agcatttaga ttcaatatat caaatgaaga atatcaacgt 1380 agatgggata aagagttatg tttcaaatgt gggaaaagtg gtcatagagc aagagaatgt 1440 ggtaggggaa atttaaatag gaagtggaga gggaagtgga aagatgggaa ggatgttaaa 1500 gttagttcaa ctgaagcaaa agaagaagaa ggtgtaaacg aattgagtaa agctgatgag 1560 tcaaaaaatg gcgttgctcg aggatgaagg ttgttccttc ctcgagctct ttaggggatg 1620 aagtagagat agacatgggt gcaattaagt ttaacattga tgtacttgaa atgaaagaca 1680 atcgcatctt tgcaaccgtg tctattcatg acccaacccg agagacaacc cactttgccc 1740 gagccatgtt tgactcaggc gcgactcatc atgttttgaa tgaagccttt gtgaaacgca 1800 acgacctgac cacgaaagaa cttccaaatg ccaaaccagt aaccggattc aacggagcac 1860 agtcctcaat cacgcacgta ggcaactttt gcattggaca ggaaggtaga agattcgaac 1920 cagcaacatt cctaatttca ccgctgaaag attcaattga ttgtattatt ggtattgatt 1980 ggatatgcaa gaatcatcaa ttgattgatt ggaaacgacg caccttacag cagacgacga 2040 ccagcattgc gactaccgaa gtagtctcgc caccaccgaa aacaatccca ggagaatctt 2100 gagaggagac tttgggacaa gctaggacta gtggcgaggg ggtgcgcatc ttaaacgata 2160 cactaacacc cccgcaatgt gagtgtgatg cagttttatt atcccgtact gaagaaacag 2220 ctggcaatca ggatcctctc caaattaaca ggtctctaga aaaagaacgt accgaggcga 2280 acgtcaagca ggatgttgca gctgccgtgc cagtttcgtc caatccgaaa accaccccac 2340 ctgatctagg agaagaccac atggggaacg ctaggatctg tgacgagggg gtgtgtattc 2400 caagtaatac gctaacaccc ccgcaacgtg agtgtgctag tgttttaatt cccaaagtga 2460 aagaaacagc tggcaatcag acatctttcc taaagcaggt tttgtcaaag aactggaaga 2520 aatccgactc actgtggcaa ccaagacacc ggtcattcct gcctaatcct atgatcctac 2580 aaacaccgag gttggacgcg gcacgagcct catggaatgt ttctgcaaag ctagctgcgg 2640 agaagacatc tgagagtacc aagctgtcgg cagctgacct ggtccccaag gaataccacg 2700 attatctgcc tatgttcgag aagtctaaat caagagttct acctccgcac cggccgtacg 2760 atttctgtgt ggatctagta cctgatgcca cacctcaagc taaccgagtt atacctttat 2820 cacctgctga aaacgaagtc ctcaacaaaa tgatagagga aggactagct tcaggaacta 2880 tacaacgtac aacatcgccc tgggctgccc ccgtattgtt tactggaaag aaagacggaa 2940 aattacgtcc ctgctttgat tatagaaaat taaacgcact cacggtcaag aataaatacc 3000 tgcttccgtt gacaatggaa ttgattgata gcttactaga cgccgataga tatactgcac 3060 tcgaaatgcg aaatgggtat aataatctac gagtgaggga aggtgacgaa gcaaaattag 3120 cgtttatctg caaggccgga caatttgaac cacgtacaat gtcgtttggt ccaactggag 3180 ccccaggtta ctttcaatac tttgttcaag atatattccg agatcgaata ggtcgagaca 3240 tggctgcgtt cttaaacgac attctaattt ataccaaacc gggtgaagat catgagaaag 3300 cggtcaaagc tgcgttagat acattacgcg aacaaaacgt ctggctgaaa ccggagaagt 3360 gcaaattcgc tcaagaagaa attacatacc ttgggttgaa attgtcccct aataagatat 3420 ccatggacag cgacaaagtc aaggcggtaa ctgattggcc tacaccaaag aacgttagcg 3480 aggtacagca atttgtgggg tttgctaact tctaccaacg gtttatcaac aacttttcaa 3540 gaatagcccg gcctctacac gaactgactc aacataaagt accctttgaa tggaccgatg 3600 aacgagactt agcttttaaa tccctgaagg aagcctttac aactgcacca gttttgaaga 3660 tagccgatcc ttataaagct tttgtgttag aatgtgactg cagtgatttt gcgctgggag 3720 cggtgctatc tcaggtgtcg gaggacgacg gtgaactaca tccagtagcc ttcctatccc 3780 gttcattgat tcaggcggaa cgaaactacg aaatttttga taaagaatta ttagcggtcg 3840 tgtcatcatt taaggaatgg cgtcattact tagaaggaaa ccctcaccgc cttaatgtta 3900 tagtctatac ggaccataag aatttggagt cgttgatgac aacaaaggag ctcacacgtc 3960 gacaggcgca atgggccgaa acccttgctt gttttgattt cgagattcgc tttcgtccgg 4020 gaagacagtc gacaaaacca gacgctttgt ctagacgacc ggatcacaag ccagaagatg 4080 gacaaaagct gacgtacgga caaattttaa agccctcaaa cttacctgca gatgcattta 4140 ttcatgaact ggaagtgata gacaattggt ttgaagagga ggagtgtgac atttcgtatt 4200 tccttgagga tgaggaggga gtagaggaag tcaaggatga gaacgacgaa ccaatagatg 4260 atgaggacga aacggtatgg gatgatagcc aattattaaa tgaaatccgg aagacgttac 4320 ataaggacag cagattgtca gatatcattt caacatgtga gagtcataag gaaggaagat 4380 gggatgaata tgtttatacc ggtggattac tttatttcaa agggatcgta gaagtacctc 4440 acgacaccga gctgaagagg aagattgtca aatcgcgcca tgacagtcca ctagcaggac 4500 atccaggccg gatgaaaacc ttgaaccttg taaagcgtaa ctaccggtgg ccatcaatga 4560 aggcgtttat caatgcgtat gtggatggat gccactcatg tcaaagagtg aaatcaagat 4620 caacaaaacc atttggatca ctgcaaccgc taccgattcc atcaggacca tggacggatg 4680 tttgttatga tctaataaca gacttaccag tgtcaaatgg aatggactgc atactgaccg 4740 tgattgaccg actaaccaag atgtgccatt ttattgcttg caaaacaact atgtcatcag 4800 aagaattagc aaaagcaatg ataaaatatg tgtggaaata tcacggcact ccgcaatcag 4860 tcacatcgga tagaggaaac gtattcattt caaaattaat gaaggaattg aatcaccaac 4920 taggcattcg gacgcaatcc tcgacggcgt atcaccccca gacggatggc caatccgaaa 4980 tagccaacaa ggcagtggaa cagtatttac gccattttgt tagctataaa caagataact 5040 ggactgactt attagacctt gcagagtttg cttataataa tagtccccat acgtcgaccg 5100 gaatctcacc gtttaaggcc aactatggat acaacatcac ttattcacga attccatcga 5160 gtgaacagtg cataccagcg gtagaggaga tgttagctca attgaaggag gtccaggacg 5220 aattaagaga atcacttgta ttagcgcaag aatcgatgaa agctcattat gacaaaagta 5280 aacgtgactc accggactgg caagtagggt caaaggtatg gctcgacgcg cggcatatct 5340 caacaacaag accaagcgcc aaatttgcgc ataaatggct aggtcccttt tccattttag 5400 caagggtctc aaagaatgct tataaactga acctacctgc atctatgagc cgcgtccatc 5460 cagtcttctc tgtaggactc ctgcgacctt atgaagaaag cacaatagcg ggacaacatc 5520 aagaaccacc accaccaatt attgtggacg aggaagagga atttgaagta gcggatatac 5580 tcaataagag aaagagaggc tcaaaaatcg aataccttgt tagttggaaa gggtatggac 5640 cagaggatga tacttgggaa ccagagggta gcttaggaaa cgcaaaagaa ttagtggata 5700 agtttaatag cagataccca gaggcagaaa agaattataa aaggacacgg agggtaaagt 5760 gagggcgatg ctttttccct actggggttt tttaatgcta gcccggggga agacgtcggc 5820 ccaacaaagg agggcgcgac gtaaaggggg agtag 5855 // ID Gypsy-119_MLP-LTR repbase; DNA; FNG; 150 BP. XX AC AECX01000800; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-119_MLP_; KW Gypsy-119_MLP-I; Gypsy-119_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-150 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000800; Positions 120480 120331. XX SQ Sequence 150 BP; 40 A; 36 C; 22 G; 52 T; 0 other; tgttatgatc ccatataaac ataatgcttg tatcaagtag agctctctgg cattagactc 60 acgccgctct gctcttgttg tatcctcacg gttcaatctt agttataata catcaccatc 120 tctgatcttg tattactgaa gctcataaca 150 // ID Copia-46_MLP-I repbase; DNA; FNG; 5309 BP. XX AC AECX01001112; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-46_MLP_; KW Copia-46_MLP-LTR; Copia-46_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5309 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001112; Positions 462668 457360. XX CC Positions [2543-3085] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1151..5275 FT /product="Copia-46_MLP-I_1p" FT /translation="MVTLNRVVNAPIVDALNRARFEPKITIYTDNGPETQD FT LPAKLWKTVTNYHSSRSEELRLLYERALNLITQAQNVPLLTHIQNFQNAVT FT KYKTAGGQMSDEDLGRKLLISLNNNHFQDAKEIAISGVKDFDLVVSELKKR FT LDAVSMLTRGSTRNHTTVHHAAEASAISHSNSRHKSSLKCTKKKCVGVNHT FT PDQCFKKPGNKHLQREWIEQRVRLGQWNGEPPKDLGNSSASAITFNEPTME FT QLENAFNSLNASASHVSVDSLSKLSLRNHPSDIEVLIDTAASHHMFKERNV FT FMNYVDMEEDNDFLNMAGGDATLKIHGRGDVKFIGPDGHNFDLHNCLYIPN FT LKRSLIGGTILLKNDFNFVAKKADGRFEITKDNNRAFEGVLNDDVKLLRSY FT VRPFSPDSNPKANLIMSPTDSNVLNLHRRLGHPNIRYLKSMVSQGSVKGLN FT LNVSAIPNNLPCDSCDLSKSHRIPHNKIHVRSSNLLENIHLDLSGIIRTSA FT VCGSVYFMLFTDDYSRFRHVVGLTTKSADAVFVKIKQYISLVKRQCNSKIK FT SITLDNGYEFINDTMVPYCKEVGIYLRTTATYTPEENGVAERSNRTITEPA FT RAMMLEANLPIRFWLYAIKAAVYLKNRTISSSLPDGITPFQLWYGRQPDVT FT HVKPLGCLCYVLIRKAIRHGKFNQVSRQAILLGHTEHNLNYEVFIFDTNSI FT VISHDVVFRENVFPFKKLKSYDISHLSFNEEENLLLSDPSVLEPQNEVQGD FT DVPIPGGEGMHQDEIEPIVLPLTDQEPEPILQTEDQPQPISTPPPRRSERE FT RRPVDKYNPSASYAYWDDEGAFTDLHDAFPCAFAVGSIVRLVQEPHNFKSA FT MNGPDKDQWKAACGKEMKNMEDRKVWRLVPRPSDQAVIGSKWHFKVKLNPD FT GSINKYKARIVAKGCTQTHGVDYDETFAPTGKPASFRTVVAFATYHGLPIH FT SMDAIAAFLNSGLKHRIYMEQPEGYEIVISGQDLVCELLQALYGLKQSARE FT WNDDFRTKCLKAGFVQSPADECVYIRRRSHDVCLFYLHVDDLAITGNNIDA FT FKKEIGGFWPMEDMGISTCVVGIQIARAGPNHYILGQEAMAKSLLERFGMM FT DTKPASTPFPGGTKLTKSTPDEARSFSLLNLPYRSGVGSLMYLSQCTRPDI FT TYAVGCLSQHLEKPLLRHWEAFKHILRYLKGTLNYCIHYQRNLPPPALPTI FT SSNNGFSLPEHFADSDWAGDKSTRRSTTGYIFMLCDGAISWRSRLQQTVAK FT SSTEAEYRAANEAGDEMIWLSRLLASIDLPQQTPYLLNSDSLSTIDISENA FT VMHGRTKAIEIHHHWLREKVKEGVIKLVYCASEDMLADILTKPLHPGPFND FT FRRRIGVKEIDG" XX SQ Sequence 5309 BP; 1591 A; 1064 C; 1156 G; 1498 T; 0 other; tatggtagcg agagatttat cttcaatcca ataatcaaat gagtcaagac caacaagaac 60 cggatccatt cttcgcacaa tcaacaagct tagcaccttc atctgagaca ttggaaaatt 120 ccgaagacga agcagaacag acaaccagac ttatatcaca actattaaca tcaccaacat 180 ccctcaaccc aaattcatca tcagatccaa ttttagatcc tagtccatca cttgatccta 240 aacccaatcc tattactact atgtcagcac caatgattcc tagcgaacca agtcctcaac 300 atcaggcaat ggttttgggt caatctttag gaagatcatc aaaaatattg actgacaaaa 360 actatacact atggtcatcc ttcattcgcg gtggtttaaa atctgtgttc ttatccgaat 420 acttaatctc agatgaaatc aaattagaaa acagccaatt caccaatgaa gttagtcgca 480 catgcatcac aaactggatg ctcaacaaca tggatgatgt tagagcatcc actatgttga 540 gggtgtttta tcgtaaattt tgcggtcctg attgcaaaat ttaatcatgc gtgttgtagc 600 taagttgtta accgcatgaa atccactatg ggatgatgta acgtaggatt taactacgat 660 aatctgttct tagaacacta tacacgctat ggcaaagtcg agactcgaat taaagtatct 720 taccacacct cacccagcct atgctgatga ctaccttggt gaccagaaag aaccaaattg 780 atgccttcag gttttgacaa tcagcaaaga gctttctcac ctcttgtttg atcttggtga 840 catcgatcgc gcgagatgac ttcttttgaa caaagatgct ttttttagca aatccggagt 900 aaacagctac tatcaacaag ctgagggttc gagggaagag ggaaggtttt gagtgaaggt 960 ggtgagtgca gagctggatg ggagatagat cattggatgg ctgttgaccg tttggatgcg 1020 ccaaaagttt acgataaatc aagtcgtagg ctttcgtaga tttgaacaag cggactgcaa 1080 gacttgacgc ggtcgtacgg accgcatgtc taaaacaggt ccatagtgga cctgtctgga 1140 ttttagcgtt atggtaacgc taaatcgcgt cgttaacgca cccatagtgg atgctcttaa 1200 tcgggctaga tttgagccta agataaccat ctatactgac aacggacctg aaactcaaga 1260 cttaccggcg aaactatgga aaactgtgac taattaccat tctagtagat ctgaagagct 1320 tcgattgtta tatgaaagag cgcttaacct tattactcaa gctcagaatg ttccactttt 1380 gactcatatt caaaattttc aaaatgctgt taccaagtac aagacggcag gtggtcaaat 1440 gagtgatgaa gatttgggtc gaaagctatt aatatctctc aacaataatc attttcaaga 1500 tgccaaagaa atagcaatct ccggagtcaa agattttgat ctagtcgtat ctgaattaaa 1560 gaagaggttg gacgctgtat ctatgcttac tagaggttct actagaaatc ataccacggt 1620 gcatcatgct gcggaagcta gcgcaatttc acattcaaat tctcgtcata aatctagctt 1680 gaaatgtaca aagaagaaat gtgtcggcgt caatcatact ccagatcaat gttttaagaa 1740 gccgggaaac aaacacttac agcgtgaatg gattgaacaa cgcgtcaggc ttggtcaatg 1800 gaacggcgag cctcccaagg accttggtaa ttcttcggct tcggctatca ctttcaacga 1860 gccaactatg gaacaactcg agaatgcttt taattcacta aatgcgtcgg ctagtcatgt 1920 ctcagtcgat tcattatcta agttgtcgtt gcgaaatcat ccaagcgata ttgaagtttt 1980 gatcgatact gccgcgtctc atcatatgtt caaagaacgt aatgtcttta tgaactacgt 2040 tgatatggaa gaagataacg attttcttaa tatggccgga ggtgatgcta ctttaaaaat 2100 tcatggccga ggagacgtga agtttatagg tccagacggt cataactttg atttacataa 2160 ctgcttatac attcccaatc tcaaacgaag tttgattgga ggtaccatat tacttaagaa 2220 cgacttcaac tttgttgcta agaaggccga tggtcgtttt gaaattacta aggacaacaa 2280 tcgagctttc gaaggtgtat taaacgatga tgtcaagctg ttgaggtcat atgtgagacc 2340 gttttcgcct gattcaaatc ccaaggcaaa ccttatcatg tcaccaactg attcaaacgt 2400 cctgaactta caccggcgtt taggtcatcc caacatacga tacttgaaat cgatggtgtc 2460 gcaaggaagt gtgaaagggt tgaatttgaa tgtctcggct attcccaata atcttccatg 2520 tgattcctgt gatctgtcaa aatctcatcg tattcctcac aacaaaatcc atgtgcgcag 2580 ctccaatcta ttagaaaata ttcaccttga tcttagcgga attatacgaa caagtgctgt 2640 gtgtggcagt gtatatttca tgttgttcac ggacgactac tcaagatttc gtcatgttgt 2700 tggtttgact acaaaatcag ctgatgcagt gtttgtcaag attaagcaat atatctctct 2760 tgtcaaaaga cagtgtaatt ctaagattaa atccatcact cttgataatg gttatgagtt 2820 catcaatgat accatggtgc cgtactgtaa ggaggtgggc atatacctac gaacaactgc 2880 aacttatacg ccggaagaga atggggtggc tgaaaggtca aatcgtacta ttacggagcc 2940 tgctagagca atgatgcttg aggcgaatct tccaatccga ttttggctat atgccattaa 3000 ggctgcggta tacctgaaga acagaacaat atcttcatct ctgccagacg ggatcactcc 3060 tttccaatta tggtatggtc gtcagcccga tgtgactcat gttaaaccat tggggtgttt 3120 gtgctatgtt ttaatcagaa aagcaattcg acatggaaag ttcaatcaag tttctcgaca 3180 agctatatta cttggtcata ctgaacataa cctaaattat gaagtattta tttttgatac 3240 aaactcaata gttatttctc atgatgtcgt gtttcgggag aatgtctttc ccttcaagaa 3300 attaaagtca tatgatatat ctcacttatc atttaatgaa gaagaaaact tgttattatc 3360 ggatccttcc gttcttgaac ctcaaaatga ggtgcaaggc gacgacgtcc ctatacctgg 3420 tggtgaagga atgcaccagg atgaaattga acctattgtc ctacctctaa ccgaccaaga 3480 acccgagcct atattgcaaa ccgaagatca accacaaccc atatcaacac ctccccctcg 3540 acgttcagaa agagaaagga ggcctgttga taaatataat ccatccgcta gctatgcata 3600 ttgggatgat gagggcgctt tcactgactt gcatgatgcg tttccgtgtg cgttcgcggt 3660 tggatcgatt gttcgcttag ttcaggaacc ccacaatttt aaatcagcta tgaatgggcc 3720 ggacaaagat caatggaagg ccgcttgcgg caaagaaatg aagaatatgg aggacaggaa 3780 ggtatggcgt ctagtgccga gaccgtctga ccaagcggtg atagggagta aatggcattt 3840 caaagttaaa ctcaatcctg atggatcgat caacaaatac aaggctcgaa ttgtagcaaa 3900 agggtgcact caaactcatg gtgttgatta tgatgaaacg ttcgcaccta cggggaagcc 3960 ggcttctttt cgcactgtcg tcgcttttgc aacttatcac ggcttgccta tacactcgat 4020 ggatgctata gccgctttct tgaatagtgg actgaaacac aggatttata tggagcaacc 4080 tgaaggttat gagattgtga tcagcggaca agatttagta tgcgaattgc ttcaagctct 4140 ttacggttta aagcagtcag ccagagaatg gaatgacgat ttcaggacta aatgtttgaa 4200 ggctggtttc gtacaatcgc ctgctgatga atgtgtgtac atcaggagga gatcccatga 4260 tgtctgcttg ttttatttac acgttgatga cttggcaatt actggaaaca acattgatgc 4320 ttttaagaag gaaatcggcg gtttttggcc gatggaggac atggggattt ctacatgtgt 4380 tgtaggcatt caaattgctc gagctggtcc gaatcattat atcttgggtc aagaggctat 4440 ggcaaagtct ttgttggaac gctttggaat gatggacacc aagccagctt caaccccatt 4500 tccgggcggc actaaactta caaaatcaac acctgatgaa gccaggtcgt ttagcttatt 4560 gaatctccct tacagaagcg gtgttggtag tttgatgtat ctgtctcaat gcaccaggcc 4620 tgatatcaca tatgccgtgg gctgtctatc gcaacatctc gagaaaccat tgctgcgaca 4680 ttgggaagct tttaaacaca ttcttaggta tttaaaaggg actctaaatt actgcataca 4740 ctatcaaaga aatcttccgc cgcctgcttt accaacaatt tcaagcaaca acggtttttc 4800 cttgccggag cattttgctg actcggactg ggcaggtgat aaaagtacca gacggtcgac 4860 cactggctac atctttatgc tatgtgacgg cgctattagc tggcgaagca gattgcagca 4920 gacagttgcg aaatcctcaa cggaggctga atacagagcg gcaaatgagg ctggggatga 4980 aatgatatgg ttatctagat tactcgcatc aattgatctg cctcaacaaa ccccttattt 5040 attaaattcg gatagtttaa gtacaattga tatatcagaa aatgctgtca tgcatgggag 5100 aacaaaggct attgagatac atcatcattg gttgagagag aaagtgaagg aaggtgtgat 5160 taaactggtg tattgtgctt cggaggacat gttggcagat attttgacta agccacttca 5220 cccggggcct ttcaatgatt ttaggagacg tataggtgtt aaagaaatag atggatgagg 5280 atattttaat tgtgtcgatt gaggggggg 5309 // ID CcNgaro3 repbase; DNA; FNG; 6131 BP. XX AC BK001748; XX DT 14-DEC-2005 (Rel. 10.12, Created) DT 23-DEC-2005 (Rel. 10.12, Last updated, Version 1) XX DE Coprinopsis cinerea retrotransposon CcNgaro3, complete sequence. XX KW DIRS; LTR Retrotransposon; Transposable Element; CcNgaro3. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-6131 RA Goodwin T.J.D. and Poulter R.T.M.; RT "A new group of tyrosine recombinase-encoding retrotransposons."; RL Unpublished (2004). XX DR EMBL/GenBank/DDBJ; BK001748; Positions 1 6131. XX FH Key Location/Qualifiers FT CDS 109..1791 FT /product="CcNgaro3_1p" FT /translation="MRDGISHLLAVLERSPSSLLVPLSFSRLLWSLGSCIS FT TLNLHLSLSPNFRAPHSFSLVEKFSFDMADGGAEGSGGGGEGTGGAGGVAA FT AQDIRTRKTHADDRVSRLLSSMALHLKLQMNKRIEEENDQRTLKNLAAAQG FT NGSTSTLASEVTIPDDDIDDLRDMTAWEAVKGFCSRGFTITMSAIHRRDRE FT EEEERRRGMTLGEQDRGRSKKKRRYEEDSSDSDGDGMPQAKKALLYLDPAD FT EQNFAPGGIQAKSVPFVLLNTEIVEPVPLRWLTESNWRWISEHSSKLPTTT FT AKVFNKIGATVSLTVIDVQKVLTQHGMSSSQKTELAHLAEESPRNYGEFVA FT TKANMLLLQSKRQQPGEDHQVRWYEQHYDNYLNRPEISEEGLFQRWIAQER FT RDRLERRSYPTPYDKRVADLRWEVMLSSYRAEQDNRSFRNPTPPKGTTGPP FT SGSSGGRRASEFIARRKERNQAQQESSDCVVCGVTGHTAKRHDFARAPRFR FT DGREFYVKKVGGSLVQTNAPYKEVCITWNAVGPDARCSHNLDERLHVCSFC FT GDGQHHALSRTCRN" FT CDS 3843..5261 FT /product="CcNgaro3_3p" FT /note="putative tyrosine recombinase." FT /translation="MPVTHHNTRSPKKNHPRFREIIQKHRDRTLNAIRTSS FT KSPALSPNPPTDTKANPPPLPPTRPAARAPKSGSEILKSEYRPHVVAEDRI FT YAWDTPYARCHRSKLGLPPGLISSSFVTLMGALAPNTRSTYAAGILRFTQF FT CDKYGVSEEERMPASYALLTGFMSEWSGKKGDGTIRGWLAGVRAWHQFHHA FT PWFGDDSWVQLARSCVAKEGQAFRKPPRSPVSMEHLIALRRTLRLSDPFHA FT AVWAAATAAFFGCRRLAEIVVAVKSKFSPKYNVCRSCEPRFRTHRDGSSAV FT AFDIPWTKTTKERGALVVLTARTDKWARELGICPVKAMKNHLAVNHGLPLS FT ASLFAYRSGNSWKHLTRAQFLEHVMGIWRSAGLGHVSGHSFRIGGAVELLL FT AGVPPEIVASTGGWTSLAFLLYWRRVEEVLPMSTTRAYHESQLKNSLAAIF FT EKFRRDNNLPEYLSTTAQSLRDTDHASD" FT CDS join(2042..2404,2578..3822) FT /product="CcNgaro3_2p" FT /note="Reverse transcriptase." FT /translation="MPRLNSTVIIPNHPSCMLYKQAVDDYLLGERNAGRMS FT GPYTRERVEEILGGPFFASPLIVSVQTQAPGTPDKLRICRHLSKGTKVDSS FT VNSYIEKESFPTRFDTALRVADIVSTLSISDLVFRHRDDHTESDLPFQISA FT APPGTQACTLDIEKFHRTIPVVPPHKCWLVVQGDPGEFWIEHNVPFGCASA FT SSNSGMVANAGVDIIQASGAGPTMKYEDDLKNLRVPVAEGRIEDSGYTYDV FT PSGRVADILTYLGFPINREKGDGVYRPVVEFIGFLWDIPRKVVSLPERKRS FT KFLRRVRDFLDAFDGKRCSRRDVERIHGSMCHVSFVHIDGRSRLPSLSNFA FT ATFDGRDPKTAHYPPTSVVSDLKWWAGCLEMAPRERSIRNRGPPMDHRIFV FT DASTSWGIGIVIGERWAALRLREDWKVKGRDICWLETVAVEILVYLLDSLG FT YRDQHILIHSDNKGTVGSITKGRSRNYHINHSVRRLYDLVLAVGLTPTLEY FT IESEKNPADPLSRGLPGPPGKKLVTDIRLPPDIDKALYFL" XX SQ Sequence 6131 BP; 1349 A; 1867 C; 1598 G; 1317 T; 0 other; tgggtggatg tagtgcaagc acaacatccg tggacaagat gaggtcgagt gatccggagg 60 cggcgcacct gtttggtcag accaaacagc ctcgtcttcc tcgcgccgat gagagatggt 120 atctctcatc ttctcgccgt tctcgagagg tccccatcct ctctactcgt tcctctctcg 180 ttttcgcgtc tgctctggtc actaggctcc tgtattagca cccttaatct acacctgtct 240 ctttcaccca attttcgcgc cccacactcc ttttcactcg ttgaaaaatt ttccttcgat 300 atggcagacg gaggagctga aggtagtggc ggaggcggtg aaggcactgg cggtgctggc 360 ggcgtcgccg ccgcccagga tatccgcacg cgcaagactc acgccgatga cagagtgtcc 420 aggttgctat cttcgatggc gctacatctc aagctccaga tgaacaagcg gatcgaagag 480 gagaacgacc aacgtaccct caagaacctg gcggctgcgc agggtaatgg ctcgacatca 540 accctagcct ctgaagtcac cattccggac gacgacatcg acgacctcag agacatgacg 600 gcatgggagg cggtgaaggg cttctgctcg agaggcttca ccatcacgat gtcggccatt 660 cacaggaggg acagagagga agaagaggag aggcgtaggg ggatgactct tggggaacag 720 gatcggggaa ggagtaagaa gaagaggaga tacgaggagg actcatcgga ctccgacggc 780 gatggcatgc cccaggcaaa gaaggccttg ctctaccttg acccggccga cgagcagaac 840 tttgcgccag gcggcatcca ggcgaagagc gttccgttcg tcctcctcaa cactgagatc 900 gtcgaacccg tccctttgcg ctggctcacg gaatccaact ggaggtggat ctccgaacat 960 tccagcaagc tgcctaccac cacagccaag gtctttaaca agatcggtgc cactgtgtcc 1020 ttgacggtca tagacgtgca gaaggtgctg acgcaacacg gcatgtcatc atcccagaag 1080 accgaacttg cgcacctcgc cgaagagagc cctcgcaatt acggagaatt tgtggcaacg 1140 aaagcaaata tgctacttct ccaatccaag cgtcagcaac ccggcgaaga ccatcaggtc 1200 aggtggtacg agcagcacta cgacaactac ctcaaccggc cggagataag cgaggaaggt 1260 cttttccagc gttggattgc gcaggaacgc cgcgaccgcc tggagaggcg ctcgtaccca 1320 acgccatacg acaagagggt agcggatctg aggtgggagg tcatgctctc aagctacagg 1380 gcggagcagg acaaccggtc ctttcgtaac cccactcctc cgaagggcac caccggcccc 1440 cccagtggtt cttctggcgg cagaagggcg tccgagttca tcgcgaggag gaaggaacgc 1500 aaccaggcgc agcaggagtc ctccgactgc gtcgtatgtg gcgtcacagg gcacaccgcg 1560 aagcgccacg acttcgccag ggcgcctcgg tttcgggacg gaagagagtt ctacgtcaag 1620 aaggtcggag gaagcctcgt ccaaaccaac gccccctaca aggaggtgtg catcacctgg 1680 aacgcagtcg gcccagacgc ccgatgtagc cacaacctcg acgagaggtt gcacgtctgc 1740 agcttttgcg gggacggcca acaccacgcc ctctcccgca cctgccgaaa ctagcgcgga 1800 cgctacgttc gcgcaatttg agaggctggc ctctccccaa aagctttgtt attcagattt 1860 ctccaacacc atcgtccctc gatattgcag atcaaaccaa ccagaagaca tagagatttt 1920 caaccgtatt tgcacgcctt atgacgccaa tgctttcgac gaattgctca acaagtttga 1980 actcaccaac gactatccta atctcgttta taatctcaga accggcttcc ccattggcaa 2040 catgcctcga ttgaacagca ctgtcattat tcctaaccac ccctcctgca tgctatacaa 2100 gcaggccgtc gacgattacc ttcttggaga aaggaacgcg gggaggatgt cgggaccata 2160 cacgagggaa cgagtggagg agatcttagg cgggcctttc ttcgcatcgc cgctgatcgt 2220 ctccgttcag acccaagctc ccggcactcc agacaagctc aggatttgtc gtcacctgtc 2280 gaaggggact aaagttgact cttccgtcaa ctcgtacatc gagaaggaat cgtttccaac 2340 tcgatttgat acggccttac gggtcgccga tatagtgagt actctctcca tatcggatct 2400 cgtatagcca gggctggtgc cgcgggcacc agacgtctca ccggtacgcc ggtggggaga 2460 ccactcgcgg ggtggtctcg cctggcactc agatagaaag tgcctccacc tggcgcggcc 2520 aggatggaac gtctgggtgc accgcgggtg acaccctgca ctatcgagat ccgataattt 2580 cgacatcgag acgatcacac agaatctgac ttgccgtttc agatatctgc cgctcctcca 2640 gggacgcagg cctgtactct cgacatcgag aaatttcacc gcaccattcc cgttgttccc 2700 ccccacaagt gttggctcgt ggtccagggt gacccaggcg agttttggat agagcacaat 2760 gtaccgttcg ggtgtgccag tgccagttcc aactccggaa tggtcgccaa tgccggtgtc 2820 gacattatac aggcttccgg tgcgggcccc accatgaagt acgaggacga cctcaagaac 2880 ctgcgggtcc cagtcgccga gggacgcatc gaggactccg gttacaccta cgacgtccct 2940 tccggccgtg tggctgatat cctcacctac ctcgggttcc ccatcaacag ggagaaggga 3000 gatggtgtct acaggcccgt cgtggagttc atcggctttt tgtgggatat tccccggaag 3060 gtggtatcac ttcccgagcg caagaggtcg aagtttctca ggagagtccg ggacttcctg 3120 gacgccttcg atgggaagag atgttccaga cgcgacgtcg agcgtatcca tggctcaatg 3180 tgtcatgttt ccttcgtcca tatcgacggc cgctctcgtc ttccatctct ctccaacttt 3240 gctgccacct tcgacggccg cgaccccaaa actgctcact acccgcctac ctcggtggtt 3300 tcggacttga agtggtgggc aggctgcctg gagatggctc ctcgcgagcg ctcaatccgg 3360 aatcgcggac cccccatgga ccatcgcatt tttgttgatg cctctacatc atggggcatc 3420 ggtatcgtta tcggcgaacg ttgggcggcg ctacgtcttc gcgaggattg gaaggtgaag 3480 ggaagggata tctgctggct ggagacagtg gcagtcgaga ttctcgtata tctcctagac 3540 tcactgggct atcgggacca gcacatactt attcactcgg acaacaaggg caccgttggc 3600 tccattacca agggtcgtag caggaactac catatcaacc actcagtgcg acgcctctat 3660 gaccttgttc ttgctgtggg actcacccca acccttgagt acatcgagtc cgagaaaaac 3720 cccgctgacc cgttgtcacg aggactgccg ggaccgcctg ggaaaaagtt ggtcaccgat 3780 atccgcctcc cgcccgacat cgacaaggct ctttatttcc tctgaccacc aacatacctc 3840 ccatgcctgt tacccaccat aacacgcgct ctcccaaaaa gaaccaccca cgttttcgag 3900 aaattattca gaagcatcgt gaccgcacgt tgaacgctat ccgcacctct tcaaagtcgc 3960 ccgcccttag ccccaacccc ccaactgata caaaagcaaa cccaccacct ctacccccaa 4020 ctcgccctgc agccagggct cctaaaagtg gaagcgagat cctcaaaagc gagtaccgcc 4080 cgcacgtagt cgccgaagat agaatttacg catgggacac cccatacgcc cgctgtcacc 4140 gtagcaaact cggcctcccg cccggactta tttcatcatc ctttgtcaca ctcatgggag 4200 ccctcgctcc caataccaga tccacctacg ccgctgggat cctacgcttc acccaattct 4260 gcgacaagta tggcgtgtca gaggaggaga ggatgcccgc atcgtacgca ttgctcacgg 4320 gatttatgag cgaatggtca gggaagaagg gagacggcac catcagggga tggctcgctg 4380 gcgtgcgcgc ctggcaccag ttccatcatg ccccttggtt tggagacgac tcgtgggtgc 4440 aactagctcg ctcgtgcgta gccaaggagg gtcaggcttt caggaagcct ccgagatctc 4500 ccgtgtctat ggagcatctc attgctcttc gtcgcacttt acgtttgtca gatcctttcc 4560 acgctgcggt gtgggccgct gccactgctg ccttctttgg ctgccgccga ctcgctgaaa 4620 tcgtcgtggc cgtgaagtcc aagttctccc ccaagtacaa tgtctgtcgc tcatgtgaac 4680 cccgttttcg cacacaccga gatgggtcct cagccgtcgc cttcgacatc ccatggacta 4740 agaccacgaa ggaacgagga gcattggtcg tcctcactgc ccggacggac aagtgggcac 4800 gggagctcgg catctgccca gtcaaggcca tgaaaaatca ccttgcagtc aaccacggcc 4860 tcccattatc ggcatccctc ttcgcctaca ggtccggaaa ttcttggaaa catctcacaa 4920 gagctcagtt cctcgagcac gtcatgggga tctggcggag cgccggcctg ggacatgtat 4980 caggccacag ctttcgcatt gggggcgctg tcgagctgct cctcgccggc gttcctcccg 5040 agatcgtcgc ttccacaggc ggttggacct ccctggcctt cctgctctac tggagaaggg 5100 ttgaggaggt acttcctatg agtacgacta gagcatacca tgaatctcag ctcaagaact 5160 cgctcgccgc catctttgag aagttccgaa gggacaacaa cctgccggaa tacctctcca 5220 cgactgccca gtcattgcga gataccgatc atgcatcgga ttaacttatg accttcgcta 5280 atttcttggg acacataaca gcactgaaca atcacctttc atcataactt gcatcgctcg 5340 tcgctttcgt ttcgtcaccg tcaacatcat cgcattattc gcattattaa ctagactgta 5400 attttacctg catttcatta tcactgcctc gagatacaga ctttaattcg ttgcgtagcc 5460 aagccattgc atagccaagt atataatgtc accgtgtatt ctacactagt tatccagttc 5520 atcgacctcg tctctggttg ggtatgacca ccagacctcg cacccttgcc tggctgtcag 5580 ttgtgagact cagtctctga ctctctagtt ggagcttggc catccagcca cggttgctcc 5640 ggctcggtgg tcatctgggc gttccggcca gcaagctggc ccaggcaaca ccaaggaaac 5700 ctgattaggt ttacttggct tcacgcttcg tcacggctct cgagagaacg agcctatctt 5760 gttgggtgga tgtagtgcaa gcacaacatc cgtggacaag atgaggtcga gtgatccgga 5820 ggcggcgcac ctgtttggtc agaccaaaca gcctcgtctt cctcgcgccg atgagagatg 5880 gtatctctca tcttctcgcc gttctcgaga ggtccccatc ctctctactc gttcctctct 5940 cgttttcgcg tcccaccgct cccgcccacc accagcgcct acgcttggcc atccagccac 6000 ggttgctccg gctcggtggt catctgggcg ttccggccag caagctggcc caggcaacac 6060 caaggaaacc tgattaggtt tacttggctt cacgcttcgt cacggctctc gagagaacga 6120 gcctatcttg t 6131 // ID Copia-61_MLP-I repbase; DNA; FNG; 6029 BP. XX AC AECX01000544; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-61_MLP_; KW Copia-61_MLP-LTR; Copia-61_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-6029 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000544; Positions 125370 119342. XX CC Positions [3365-3880] - Integrase core CC LTRs are 90% similar to each other. XX FH Key Location/Qualifiers FT CDS join(2879..4333,4337..6028) FT /product="Copia-61_MLP-I_1p" FT /translation="MAPDRDLFDNYQPVSSNVTLANGTNIRIAGKGTIMCN FT SANDEVVLNALHVPDLGCCLISMGALIMDGYRIENVVDNILMVKHGVGGFV FT GHVRNGVIELDITLGSSSLTDPPRVNISISDYDTLHRQGGHPESPRLKMMY FT NVSAPEAWHCETCKLSKGHCLPYSGCFPSSNLPLDVIHSDLSSKISVPSVG FT GGLYYFKLTDACTNYKHVFIMKSKSDTMSCFLQYKSLVETFHNKKIVSLVN FT DQGGGYRSKDFRTLLEKNGTTSYLSAPYTPQQNPVSERGNRTTTEKARTLL FT RQSNLPYNFWAEAVMTSVFLENITPTRKTNNKSAYEMWFGRRFDYTRLKPF FT GCRAYVLIPKQFRRKFDNTSIKGIFLGYQVGMKNYRVRLEDGRIVYSHDVT FT FDCNNFPGIEGETQSSETKISALFSEEDYVIPAIVPETPTQIATNSPISPT FT TPVDVTSRPTSPEVEINDPFAQDDDDESSDGDNIVEQLTDHKPGWDWELRA FT EAPKDISSTIDTSNILPSGSCRRAAAISSKSTPTPLNPTTKSYKQALMSND FT NSEWENAITNELLNMTRRKVWDVVTLPKGRKAIGTTWVFKKKMGAEGELLK FT YKARLCALGFLQYFGVDYNETYAPTGRLVTFRSLCTIAAEENLDVIQMDAV FT AAFLNGKPEETIYINIPKGYNVNNATAETVLQLNQALYGLKQAPKVWYDTL FT KAFLATIGLFPSEIDASLFISSDPSWRCLVHFHVDDMVIALNDVLRFKNAI FT TKQFQMDENTDFKYILGMKVVRDRKARTITLSQSQYIQDLLEDYGMEDSKP FT VGSPMQPNTYLVPGTLAEQEEFLKLGISYRQAVGTLMYLNNATRPDLAFVV FT SQLSQHLNRPSIHHWIAFKRVLRFLKGTQALSLVLGGLMIIINAFADADFA FT TCPITRRSTGGYVTRLGNSTVNWNSKKQDTVATSTTEAEYRSAYEGGQDVV FT WVKNLLKGMEIRQDEAPIFKLDNQGAIALSKNEKFKRRTKHIDVKYHWLRE FT LSTKKPIKIEYIPTNDMIADVMTKSLTPSKHGNFCRLLGLHNV" XX SQ Sequence 6029 BP; 1971 A; 1407 C; 1069 G; 1582 T; 0 other; accaaagtct tacttcataa gactgagttt gatcttaatc cttatcaggt gttagttact 60 atttagttat attgtgttct catatgttaa tcaaactcaa gttttctttc tgcgtcacaa 120 cttccttttt cagaaagttg ttttctttcg atcaaccaaa gtcttacttc ataagactga 180 gtttgatctt aatccttatc aggtaaagtt gttttctttc gatcaaccaa agtcttactt 240 cataagactg agtttgatct taatccttat cagggaaaga aataacttcg attacctcag 300 ttatttggtt atgagcccag cgctccaata gtcgcaaatc ttccaaaaat tttttttttt 360 gaatccttgc gaaatgaccg actcttcgaa ctcatctcgc attcatttcc caaaactcgc 420 caaaaccaat tatgtcaagt gggcagcaga tgtcaccacc catttgatga catgcaatct 480 ggaagagttc attaacattg accctccacc agttcctcct gtcaaaactc tcgatacaga 540 tgcatctgtc caaattgccg tattcttgac caagaaaaag aaggccgctg tagaactctt 600 tcagtacaca gctgaaagca gagccaagaa ggggatgaat gaagacacaa acaacaacac 660 acacaatcaa aagaagaata agtcttgaga tggtgaattg aaagacttca aacaaagaaa 720 gaattgccct tcttaaacag ggactttgat tactgatgaa ttaaatttaa gaccttcaag 780 tctttggatt aataaaaacc aggctaaaag aggacaataa cttcaaaaat agcacaatct 840 aggcgcactc aggcaccttc caaatgagca aatgagctta caactcaaca agaaattaat 900 taaaagatga ataagggtct tattattata aaattccaac tctcattagt tgtttacctg 960 aatgataata ataaaagata aaagcttatt ctaaaaatct agtaattatt tttgctaact 1020 aaaaggggaa aaaaaggtgg gaaaaagaca cttcactata tcaaaattga aacacacaca 1080 aaaaagaaga taagacaaaa acccccctca aactcgaaat tcaccttcat acagcttttt 1140 cacagcaaca ctctcaatga aaccccaaaa taaagtggat agacagattc agcgcacaca 1200 cattgacaat ctacaaactt tcaagctcac aagcagcact ataacttagc tgtttaccac 1260 aaaaactgaa aaatccgcca attcaccctt gaaacctgaa aaaagataag aaatagggac 1320 tgtctcaaaa atctgaatag acagaaagaa aggtctgctt ccgctctaca actttgctca 1380 gccaagctaa actctaactc tattggttgt acccttaaaa tgaaattcga ttttatgtga 1440 aaatactcga aattgaattt acccagttta ccacctgata ttaccactga tgatgatgta 1500 tatggtttat aattgatttt tcatgttcct atgcaaaaat ataaccctcc ccaacttatt 1560 tctaggctga taagatgctg tggtacccta aaatcaaagt tatatgattg aaaatagcac 1620 tttttatgat tttgaaagtc atttttgagt ttcaaaacca cccttttcat atgtaatgtt 1680 taatttacca atgctgttgt ttgataatcc catgaatcca tctttcaaag tgatccctgc 1740 ttgctttttt agtttaccac cccagtttac cactacccca gtttaccacc atctgtctga 1800 gttgttggat tttcgaaatt cagccttatt taaacccata cctcaaaacc atgattcctc 1860 tacacaacac gttaaaccca tacatatctt taaacataac tattaaaacc cataattttc 1920 gaaattccaa cttcagacct gtatactcta cataacccat aaaccaccca actacacata 1980 agttctaaac ccataatttc gacttttggc ttcattttac actacactcc ctacacataa 2040 ccattacact caaaattcaa catgtaacat ataaaaataa aatgaaatat gataaaataa 2100 aatgataaaa taataaaata aaaaaaataa aactaaaaat gaattgtgta tttgtctctt 2160 gaatactttt gattaccaca ccgccggtat cctatacggc tgcatcgacc aagacaatcg 2220 tatgcgagtt gtggccaagg aagccgtatc tgatccaatc aagatatgga tgttattgaa 2280 agaacacttt caatcgtcgt ctgacgagaa ccaagctcga gcttacttaa aatggactga 2340 cattgtcttt acggatctcg aaacttacat caccgataat caacatgcaa tagcaggact 2400 tcttgcggtg gacggtctga aacacattca cgacaagttc attggcgaga ccatagtcag 2460 taatctcccg tcctccacgg acatcacgaa gaccctccta cgcaaggatc gaccgttgac 2520 tgccgataaa gtcatcacgt accttgaatc ccaactcatc accatgaggt acaaagaact 2580 tcacgagagc accgttgctc tagcagtcag accaccacga tcactccaac gatcagcaac 2640 tcaacgtccc ttgacgtttg gacgaccacc acgcccattt tgttccaacg gctgacataa 2700 cccagaagct aaaggtcata ccttacaaca gtgcttccaa gttaacccca acctacgtcc 2760 tacacaaccc aaaactgcgg ttgtcaccac cgacaactca ttctttgaat ctcaagcatt 2820 tgtcgtgacc tcatccgatc acgaatccat cctattggat agcggctgtt cacatcacat 2880 ggctccggat cgcgatctgt tcgacaacta ccagccagtt tcttctaacg taactctggc 2940 taatggcacc aacatccgca ttgctggcaa gggcaccata atgtgtaact ccgccaatga 3000 tgaagttgtt ctcaacgcac ttcatgtccc cgacttggga tgctgtctaa taagcatggg 3060 cgctctcatt atggatggct atcgcattga aaacgttgtc gacaacattc tgatggtcaa 3120 acatggcgtc ggaggttttg tgggtcacgt tcgcaacggt gtaatcgagc tcgacattac 3180 tctaggctcg tcatccctca ctgatcctcc acgtgtcaac atcagcattt ctgactatga 3240 tacacttcac agacagggag gtcatcccga atcccctcgc cttaagatga tgtacaatgt 3300 ctccgcacct gaagcctggc actgtgaaac ctgtaagctc tcaaagggcc attgtcttcc 3360 gtattccggc tgttttcctt cttcgaatct cccattagat gtcattcata gtgatctgag 3420 cagtaaaata tctgtacctt ctgttggagg aggcctttat tattttaagc tcacggacgc 3480 atgtacaaat tataaacacg tttttatcat gaagtctaag tccgacacaa tgtcttgttt 3540 cctgcaatac aaatcactag ttgaaacctt tcacaataag aaaattgtga gtttggtcaa 3600 cgaccaagga ggcgggtata ggtcaaaaga ttttcgaaca cttcttgaga aaaacggcac 3660 gacatcctac ctgagcgcac cctacacacc gcaacagaac ccagtctcgg aacgagggaa 3720 cagaaccaca acggagaaag caagaacttt actccgtcaa tcgaatctgc catacaactt 3780 ctgggcagag gcggttatga cttcagtctt cttagaaaac atcacgccaa ccaggaagac 3840 aaacaacaaa tcagcttacg agatgtggtt tggtcgaagg tttgattaca caagactgaa 3900 accttttggc tgccgtgcct atgttttgat tcccaaacag ttccgacgta aattcgacaa 3960 cacatcaatc aaaggcatct tcttaggata ccaagtcggt atgaagaatt atcgagtgag 4020 actcgaggac ggtcgcatcg tttactcaca tgatgtgaca ttcgactgca acaattttcc 4080 tgggatagaa ggtgagactc aatccagtga gaccaaaatc agtgctcttt tcagcgaaga 4140 ggactacgtg ataccagcca tcgttcctga aacacccact cagatagcca ccaactctcc 4200 catttcacct acaacaccag tagacgtcac atcacgtccc acctcaccag aagtcgaaat 4260 caatgatcct tttgcacaag acgacgacga cgaatcaagt gatggagaca acatcgtcga 4320 gcaactcaca gactgacaca aaccaggttg ggattgggaa ctacgagcag aagctcccaa 4380 ggacatatcc agcacgatcg acacatcaaa catcctacct tcagggtcat gtcgacgggc 4440 agccgcaatt tcatcaaaat ccacaccaac tcctctcaat cctacaacta agtcttacaa 4500 acaggcgctg atgtcaaatg ataattctga atgggaaaac gcaatcacca atgaattact 4560 caacatgacg agacgaaaag tgtgggatgt ggtcacccta cctaaaggac gaaaagctat 4620 tggcacgacc tgggtcttca agaagaagat gggtgctgaa ggggagttac ttaagtacaa 4680 ggccaggtta tgtgctctgg ggtttctcca atactttggt gttgactata atgaaaccta 4740 tgcgccaaca gggcgtttag taacttttcg atcactgtgt accatcgcag ctgaagaaaa 4800 tctggacgtc attcaaatgg acgcagtcgc agcgttctta aacggcaaac cagaagaaac 4860 aatttacatc aacattccaa aaggctacaa cgtcaataac gccacagctg aaacggtact 4920 tcaactcaat caagctctgt atggactgaa gcaagctcca aaggtctggt acgacacctt 4980 gaaagcgttc ctcgctacca tcggtctttt tccctccgaa atcgacgcaa gtctctttat 5040 ttcatctgat ccctcatggc gctgcttagt tcactttcat gtcgacgaca tggtgattgc 5100 tttgaacgac gtacttcgct tcaagaatgc gatcaccaaa caatttcaaa tggatgaaaa 5160 cacagatttc aagtatatcc taggaatgaa agttgttaga gatcgtaaag caagaacaat 5220 cactttatct caaagtcaat acatacaaga cctactagaa gattatggaa tggaagactc 5280 aaaacctgta ggatctccga tgcaaccaaa cacctacctg gtaccgggca cgctcgcgga 5340 acaagaagaa tttctcaaat taggaataag ttatcgacaa gctgttggaa cactcatgta 5400 cttaaacaac gccactcgac ccgacttagc ttttgtcgtg tctcaactat cacaacactt 5460 gaacagaccg tctatacatc actggatagc cttcaaaagg gtcctacgtt ttctcaaggg 5520 tacacaagcg ctaagtctgg ttcttggagg attaatgatc ataatcaacg cttttgcaga 5580 tgctgacttt gctacttgtc cgattacgag gagatctaca ggtggatatg tcacgagatt 5640 gggtaacagt acggtgaatt ggaattctaa aaagcaggat acagtagcca cttcaacgac 5700 tgaggcggaa tatcggtctg cgtatgaagg aggtcaggat gttgtttggg taaagaattt 5760 attgaaagga atggaaataa ggcaagatga agcaccaatt tttaaactgg acaatcaggg 5820 agcgatagcg ttatccaaaa atgaaaaatt taagagaaga actaagcaca tagacgtgaa 5880 ataccattgg ttacgggaac tgtcaactaa gaaaccaatc aagattgaat acatcccaac 5940 caatgacatg attgctgatg taatgaccaa gtcactaaca ccatcaaaac acggaaactt 6000 ctgtagacta ttaggcttac acaatgttt 6029 // ID Gypsy-74_MLP-LTR repbase; DNA; FNG; 314 BP. XX AC AECX01000970; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-74_MLP_; KW Gypsy-74_MLP-I; Gypsy-74_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-314 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000970; Positions 48410 48723. XX SQ Sequence 314 BP; 69 A; 83 C; 46 G; 116 T; 0 other; tgcttttcct cccttcctat ttttgttata ctcttttcct attttacttg tagaccagat 60 cttgctagat ccgtcccgac catatactgt atttccttgt ggaccgaatc ctacagtacc 120 agattcggtc atctctcatc ttgtatcact acgatcttcg atcgtctctt gttgtaccgc 180 cattccattg ttttccttat tcgtttctcg cagaaggatg tttgcctata taacctgctc 240 acatcctcat agcaatggag aggacaagtt acttcctact tactttcgta cccataaaag 300 tgagtagaat caca 314 // ID Copia-64_MLP-LTR repbase; DNA; FNG; 210 BP. XX AC AECX01000588; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-64_MLP_; KW Copia-64_MLP-I; Copia-64_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-210 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000588; Positions 44806 45015. XX SQ Sequence 210 BP; 67 A; 45 C; 15 G; 83 T; 0 other; tgtcacgtaa cagttcattt aattacaagt taacaatcta atgaccatta catatgttta 60 agatatttac tcaaaccttt tgtaccaaac tataacctaa ccatgacttg tatcacattt 120 tctatttcat accttgttgt tcgaaaactt tcttcttctt tcctcatcag tttaaaaagt 180 tttcttctca caacttctta tcataaaaca 210 // ID Gypsy-5_MLP-I repbase; DNA; FNG; 5657 BP. XX AC AECX01002110; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_MLP_; KW Gypsy-5_MLP-LTR; Gypsy-5_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5657 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002110; Positions 3491 9147. XX CC Positions [4460-4939] - Integrase core CC 'GTTAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(194..2869,2873..5329) FT /product="Gypsy-5_MLP-I_1p" FT /translation="MSNNPRHTRNNPVSELVSVPNPEGILRLTRSNPTLRS FT TEVHPPVSVANVTATSLFHREINSFLNPPDRSGIGVLYPSPSVWSPIASQL FT VASNTPSLVDSVELDQLSSLASPTLPQLRLPHPPRRSIPGAFDPCQSFAMT FT DVPAGNRSIENEGDHPVQPTGQPGVTGTEVTQEQVSIAEVQAKMRAEMNSL FT RALIQQMMSAQISTPAPATQPKLQEQSGITDLQATPVPHALGTRHVSSSAD FT PVFNQQVYSSTPLHQNPFHRPLVPPASQTPSVDQVPVSAIDPLRFRITDFP FT EYKGKYGDVAAYRIWRHKVERLFLVKGLVLDSDKFSVLPLLLSTNPAASWC FT RRSGDFTGHTWLSAMKEMESVILPADWLDKVKQQIRELAMRPNEHISAFCA FT RARVLQEAAGLEECSEEALAWAIVGGSTSLFRSIQQRDQIIKSSINPVSQK FT FSFAAFEQKACLAWDFALELEPRIQSRIPRSSNNLSATPTVLTSQNQRGSA FT PFRAALSPEEQAARNARFMAYMRSIGLCPRCKTSCLKWLGGCTADTNSAYY FT PVPPDFARSPPYPPPKSSNTVSKVAQNAAPRPAARRVDVAAVEETPSVDLA FT AVDSFPDLGREDELAYNQLIERLQAEPEDDLVDAAECVPSDSVSSCASELQ FT THRVSPLILEITVNGVKMRALADTGAGTNLMSEKVATKLKVVKRLLPRPVT FT VRPAIVSEPVPFTLKEFAFANLTCDRPTFTFGVTPFKLAPLGGSYDIILGT FT PFLSKHRLDVSVSRGLLRSSKNGYEFKAVVEKSEEEELKELHKRREVLLAT FT VFDNLAKVDKARDFSICEMKILREFQDLFPEDLPDVESVDDDAEFFPEKNQ FT DESSLTRHCIELTNPDVVINERSYGYPRKHLDVWAKLLNHLKAGRIRKSKS FT QYASPSMIIPKKDPKALPRLICDYRKLNKYTVKDRGPLPIVDECVRIVAQG FT RIYSLLDQINAFFQTLMREKDIPLTAIKTPWGLYEWVVMPMGLTNAPATQQ FT RRCEEALGDLVNRICVVYIDDIVVFSQTVEEHEEHLREVLRRLRAANLYCG FT LKKTQLFRRQIKFLGHEISEDGIRPDEDKVIKVANWKRPSTPKHVKEFLGT FT VQWLKKFVEGLQRYTGQLTPLTSNKKLNSFEWTEKEDAAFENIKRMITTLP FT VLRNIDYDSEEPIWLFTDASGHGLGAALFQGENWETANPIAYDERQMTAAE FT RNYPVHEQELLAVISALNKWKLLLMGLKVHVMTDHHSLTHLLTQRNLSRRQ FT ARWLETLSQFDLNFQYLQGADNSVADALSRVESAALTISTAGLDDSFVDQV FT KSGYEMDPFCVKLSKVLPLRNNCVLKDGLMYIDNRLVIPSVGTLCKDLVTS FT AHVAVGHLGAVKTAAILRQEFYWLGLYDDVEKHVSCCDGCQRFKARTTKII FT GQLQSTDLPRRPFSSIASDFVGPFPKVSSYDMVLLCTCRLTGFVQLIPVNQ FT RDTAEKTAQRLFQSWLSIFGAPDEMVGDRDKTWQSRFWRELHQMMGIEVKL FT TTAYHPQADGRSERTNKTFGQILRFSTQEKQGKWVEALPLAEYAINSAVNS FT ATGVSPMRFVLGIQPRLFPIPHVDAQVSEDVEVWLKTRESEWASWRDRLWA FT SRVDQAVQYNKRRGGELGAKVGDMVLVDSANRSQVVGGRVAKLRACYDGPY FT RVLQVLNEGRDFKLQLPEGDNTHDVFHGSKLKIYHTAEVEGT" XX SQ Sequence 5657 BP; 1467 A; 1143 C; 1451 G; 1596 T; 0 other; ctttttttat cccccttgtt cgacgttcaa cgaatccgat tgttgggatt caatacatcc 60 ctcgagttca ccagtcactc gtctcggtct cgcaccctaa tcggtcgcca acgttttaag 120 ttacagattc aattgtcccc actcagtggc gttgtcgcct ttattttgtt tcaaccagtt 180 taaagtttgt ttgatgtcaa acaaccctcg tcatacacgt aataatcctg tttcagagtt 240 agtatcggtt cctaatccgg aaggcattct gcgactgact cgttcgaatc ctaccttgag 300 atccaccgag gtacacccac cagtttcagt ggcaaacgtt acggccacct cgttattcca 360 tcgtgaaatc aattcctttc ttaatccgcc tgatcgttcg gggattggag tcctatatcc 420 ctctccatct gtatggtcac caatagccag tcaactagtg gcttcgaaca caccatctct 480 cgtcgattct gtcgaactcg atcaattgtc tagtttagcg tctccaacat tacctcaact 540 gcggttacct catccccctc ggcgtagtat tcctggagcg tttgatccgt gtcagtcgtt 600 tgctatgact gacgtaccgg ccggcaacag gtccatcgaa aatgaaggtg accatcctgt 660 acaaccgacc ggtcagcctg gagtgacggg cacagaggtt actcaagagc aagtctcgat 720 tgctgaggtg caggccaaaa tgcgagctga gatgaacagt ctccgagctc ttattcaaca 780 aatgatgtca gctcaaattt ctacaccggc tccggcgact caaccaaagt tacaggagca 840 aagtggtatt acggacttgc aagctactcc tgttccgcat gctcttggta cccggcatgt 900 atcctcatct gcagatcctg tgttcaatca acaagtttac tcttcgacac ctttgcacca 960 aaacccgttt catcgaccac tagtaccccc ggcaagtcaa acaccgtcag tggatcaagt 1020 accggttagc gctattgatc cgcttcgttt tcgtatcacc gactttccag aatataaagg 1080 aaagtatggc gatgtagcag cctatcgtat atggcgtcat aaggtggaac ggcttttttt 1140 agtcaaaggc ttggtgctcg attccgataa attcagcgtg ttgccgttat tgttgagtac 1200 caacccagcg gcgtcttggt gtcgcagatc gggtgatttt actggtcata cttggttatc 1260 cgctatgaaa gaaatggaat ctgtcatttt gccagcggac tggttggata aggtcaaaca 1320 gcaaatccgc gaattggcga tgcgaccaaa tgagcacata agtgcgtttt gtgcaagggc 1380 tagggtgtta caggaggcgg caggtttaga agagtgttct gaggaggctt tggcatgggc 1440 tatagttgga ggttctactt cgttattcag gtctatacaa cagcgtgatc aaataatcaa 1500 atcgagtatc aacccggtct ctcagaagtt ttcctttgca gctttcgaac agaaggcatg 1560 cttggcgtgg gactttgcgt tagaacttga accacgtatt caaagtcgta tcccacggtc 1620 ttctaacaac ctttcggcta ctccaacagt tttgacgtcc caaaatcaaa gaggatcagc 1680 accctttcga gcagctctat ctcctgagga gcaagccgcg cgtaatgcta ggtttatggc 1740 gtacatgaga tcgattggtc tatgtccacg gtgcaagacg tcatgtttga aatggttagg 1800 aggttgtacg gcggacacga actcggctta ttacccggtt ccaccagatt tcgcccgttc 1860 ccccccttat ccaccaccca agtcttccaa tacggtatct aaagtggctc agaatgcggc 1920 tcctagacct gcagcgcgac gggtagatgt agcggccgtg gaggaaacgc ctagtgtgga 1980 cttagcggca gtcgatagtt ttccagatct gggaagggag gacgagttag cctacaatca 2040 actcattgaa agattacagg cggagcctga ggatgacttg gtcgacgcag ccgagtgcgt 2100 accttctgat tctgtatctt cctgtgcgag tgaactccaa actcatcgag tgtctcctct 2160 gatccttgaa attacggtca acggtgtcaa aatgcgggca ttggcggata ctggtgcagg 2220 tacgaacttg atgtcggaga aggtagccac taaactgaag gtggtgaagc gactgttacc 2280 tcgaccagta actgtgcgtc cagcaatcgt ttccgagcct gtacccttca ctttgaagga 2340 gttcgctttt gctaatttga catgtgatcg ccccactttt acttttggtg ttactccttt 2400 caaattagct ccgcttggag gatcatatga tattatttta ggtactccat tcttatcaaa 2460 acatcggttg gatgtgtcag tcagtagagg tttgttgagg agttcaaaga acggttatga 2520 gtttaaggca gtagtggaaa agagtgaaga ggaggagttg aaggaattac ataaacgaag 2580 agaggtgttg ctggcaactg tttttgataa tttagctaaa gttgacaaag cccgtgattt 2640 ttctatttgt gaaatgaaga tattaagaga attccaggat ttgtttcctg aggacttacc 2700 agatgtggaa agtgtggatg atgatgcaga atttttccct gagaagaatc aggatgagtc 2760 gtctttgaca cgacattgta ttgagttgac gaacccggat gttgtcatta acgaacgaag 2820 ttatggttac ccgcgtaaac acttggatgt gtgggcaaaa ttattgaatt aacacttgaa 2880 ggcggggagg attaggaaat caaagagcca atatgcctca ccctccatga tcataccaaa 2940 aaaggaccct aaggcacttc ctcgtcttat atgtgactat cggaaattga ataaatatac 3000 tgttaaagat aggggtcctc tacctatagt ggatgaatgt gttcgtattg tagctcaggg 3060 aaggatctat tcattgttag accaaatcaa tgctttcttt cagacgctta tgcgtgagaa 3120 agatatacct ttaacggcta tcaaaacccc gtggggatta tatgaatggg ttgtcatgcc 3180 tatgggtttg acaaatgccc ctgctactca acaaagacgt tgcgaggaag cgttaggaga 3240 cttggtgaat cgtatttgtg tcgtgtatat tgatgacata gtagtctttt cgcagacagt 3300 tgaagaacat gaagagcatt taagggaggt gttgagaagg ttaagagcag caaatttgta 3360 ttgcggattg aagaaaaccc agcttttcag gagacagata aaatttctgg gtcatgagat 3420 tagtgaggat ggtattagac cggatgagga taaggttata aaggtggcaa attggaagag 3480 accctcaacc cctaaacatg tgaaggaatt tttaggaact gtgcagtggt taaagaagtt 3540 tgtggagggt ttgcagcggt atacaggcca actgacaccg ttaactagta ataagaagct 3600 taattcattt gaatggactg agaaagagga tgcagcattt gagaatatta aaagaatgat 3660 aactacgtta ccggtgcttc gaaatattga ttatgactcg gaggaaccaa tttggttgtt 3720 tacagatgca agtggacatg gtttaggtgc ggcgttgttt caaggggaga attgggagac 3780 tgccaatcca atagcttatg acgagagaca gatgacagca gcggagcgta attatccggt 3840 tcatgaacag gagctgttgg ctgttatcag tgctttgaac aaatggaagc ttctgctgat 3900 gggattgaag gtacatgtca tgaccgatca ccattctctg acccatttat taactcaacg 3960 gaacctcagt cgacgacagg ctcgttggtt ggagacactg tcccaattcg atctaaactt 4020 tcagtacctg caaggtgccg ataactcggt ggcggacgcg ttgtctagag tggaatcagc 4080 ggctttaacc atatcaactg caggccttga tgattctttt gtagatcagg tgaagagtgg 4140 ttatgaaatg gatccgtttt gtgtcaaact ttcgaaagtt ctgccactaa gaaacaattg 4200 tgttctgaag gatggattga tgtacatcga taaccgtctg gtcattcctt ctgtaggaac 4260 tctttgtaag gatttggtca cttcggcgca tgtagcggtg ggacacttgg gagcggttaa 4320 gacagcggct atattgaggc aggagttcta ctggttgggg ttgtatgatg atgtggagaa 4380 gcatgtgagt tgttgtgatg gatgtcaaag attcaaagct aggacgacca agatcattgg 4440 gcaattgcaa agtacagact tacccaggag gcctttcagt agtatcgcat cagactttgt 4500 aggaccgttc cccaaggtgt caagttacga catggtgtta ttgtgtactt gccgtttgac 4560 tggttttgtt cagctcatac cggtaaatca acgggacacc gctgaaaaaa ctgctcaacg 4620 attatttcag tcatggttgt caattttcgg cgcacctgat gaaatggtag gagataggga 4680 caaaacgtgg caatctaggt tctggagaga attgcatcag atgatgggta ttgaggtaaa 4740 attaactacg gcttaccatc ctcaagcaga tggacgatct gagcgaacaa acaaaacctt 4800 tggtcaaatt ctgcgctttt caacacagga gaagcaaggc aagtgggtgg aagctttacc 4860 tctggcggag tatgccatca actcggcagt caacagcgca actggtgtat ctccaatgag 4920 atttgtactt ggtatacaac ctagactgtt ccccatccca catgttgatg ctcaggtgag 4980 cgaggatgtt gaagtgtggt tgaagaccag agagagcgag tgggcgtcgt ggagagatag 5040 attgtgggca tcaagggtag atcaggctgt tcaatacaat aagcgtcgag gaggagagtt 5100 gggtgcgaag gttggtgata tggtattggt tgacagtgca aacaggtctc aggtggtagg 5160 tggaagagtc gctaaattac gggcatgtta cgatggaccg tacagggtgc ttcaagtatt 5220 gaatgagggt cgtgacttca aacttcagct gccagagggc gataatacgc acgatgtgtt 5280 tcatgggtca aagttgaaga tttatcacac ggcggaagtg gaagggacgt gattttggcg 5340 aggagctacc ttgtgcaagt aagtctcctc cttatcgtat gcaccgaccg tggggttcta 5400 cctcctttgt ttcaatcaaa aacatacctt ggccacgcct gtgagcacct ccattttggg 5460 tctctcttct gcaggacaag gacggttgtt gatattttcg gatggatacc tggttcagtt 5520 ggtggaatgg tagtggtttt atgttttcct tttttttctt ttcttttatt tttattttct 5580 tttggttttt gatggttgat tttaggaggg ataagaatat tttgcggagg ggtttgttgg 5640 ttttttggtg gggaggg 5657 // ID LTRTF2 repbase; DNA; FNG; 349 BP. XX AC L10324; XX DT 07-FEB-1997 (Rel. 2.01, Created) DT 07-FEB-1997 (Rel. 2.01, Last updated, Version 1) XX DE TF2 retrotransposon, LTR. XX KW Gypsy; LTR Retrotransposon; Transposable Element; LTRTF2; KW Long terminal repeat; retrotransposon. XX OS Schizosaccharomyces pombe OC Eukaryota; Fungi; Dikarya; Ascomycota; Taphrinomycotina; OC Schizosaccharomycetes; Schizosaccharomycetales; OC Schizosaccharomycetaceae; Schizosaccharomyces. XX RN [1] RP 1-349 RA Weaver C.D., Shpakovski V.G., Caputo E., Levin L.H. RA and Boeke D.J.; RT "Sequence analysis of closely related retrotransposon families RT from fission yeast."; RL Gene 131(1), 135-139 (1993). XX RN [2] RP 1-349 RA Boeke D.J.; RT "LTRTF2."; RL Direct Submission to Genbank (04-FEB-1993)Jef D. Boeke, Mol. RL Biol. Genetics, Johns Hopkins University, Baltimore, MD 21205. XX DR GenBank; L10324; Positions 1 349. XX CC LTR of TF2 retrotransposon. XX SQ Sequence 349 BP; 121 A; 68 C; 48 G; 112 T; 0 other; tgtcagcaat actacactac gctataatac actacgttga gtatcactat atgtcacatg 60 ttctaattat atatcgtacc atgtatgata cgatatggag attgatctta atgataatct 120 attaagatca atattatctg aatactataa atagagctac tgctgaacct cgttcctcag 180 ttcagttatg agctatatta gtgataggta acattataac ccagttaata caatacctat 240 actcagttgc tacttataca acctgtgtat cgtaatataa tagatcacaa ggaaaactca 300 ccgcagttct acgtatcctt aaatcagata ccaaactgcg tagcttaca 349 // ID Zorro repbase; DNA; FNG; 5706 BP. XX AC AF254443; XX DT 28-JUN-2005 (Rel. 10.06, Created) DT 03-JUN-2009 (Rel. 14.07, Last updated, Version 2) XX DE Candida albicans non-LTR retrotransposon Zorro 3. XX KW L1; Non-LTR Retrotransposon; Transposable Element; Zorro3; Zorro; KW L1-1_CA. XX NM L1-1_CA. XX OS Candida albicans OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; mitosporic Saccharomycetales; OC Candida. XX RN [1] RP 1-5706 RA Goodwin T.J., Ormandy J.E. and Poulter R.T.; RT "L1-like non-LTR retrotransposons in the yeast Candida RT albicans."; RL Curr Genet 39(2), 83-91 (2001). XX DR Genbank; AF254443; Positions 1 5706. XX FH Key Location/Qualifiers FT CDS 3..1829 FT /product="Zorro_1p" FT /note="unknown." FT /translation="IMTTNGTNGFDFADKGDHPPRQSQENPPGNFQELPLD FT GGQIRKSSSNDGVSNNVEFNTNTNAHESWADYPESDQMDTDSSEYYTEDEN FT EEGKIPKGPYHQFQNANKFTYFTPETATTLVAPNNNTQNTTTEQDNMSKTY FT ASVAATFKTNLAKSQDKQEELRMSNFKVPQKAFDDLKRHNDAVFPEEVMTS FT DLSTIINKYGHEKVFDTIENTLITTAQNLEGYLSEYPENVMRKAFKYFTLK FT LSIEDKQTRDVCEEILGIREELMYIQRILDFNIMDKVMCFMPFTFEQMEGK FT LSSEKAKVRTIVENLELDFDGTVEHIKIDSAKRNRQLVAFKVPIHCRSLAE FT RYFRQIFLHKEDVIVQQYAPIIEKVQLGRKGKTEENNHQLLTEYVCYFIVD FT LFSGQVPKRRIKIDNTKNKPSSYEQCTSRVLSNCKYCEYCRSMTHTKYHCP FT LIKPCTNCGVKGHKTTSCKKALSTNTLVLKRPETKPNFPILPPKVVNKTTS FT DDSQRKGSIAKTTSHTKGDNTEKMVTPSAQNENTTLQESIPVTITGHREAP FT TTPPVADTTAKKTMTTGVLEMDTSKHTISSNDEASPRKKLNVKEKPSNPEK FT DPGLNLPPNLQ" FT CDS 2007..5453 FT /product="Zorro_2p" FT /note="endonuclease and reverse transcriptase FT domains." FT /translation="MKRNESYINSLTIGSKNIGSHQSTDFKKLLDIFLKLI FT GEHLMIDIWFIQEIRFVSQEQFNYINKILKQHNAQLRMHHFEDLTGFLIHS FT PHAKLKFKIRDNDNHHTTHFEGRISILDITLITNEDITLINNYLHSGNMDA FT QMTLLKAFIKYIANLKKHTNHNIIYGGDYNHIMLLDDVQLPLDQTRYIISK FT KELEIIQLMSNFYKKWKLQDAFQIRNNLQPTNFHSNKSVKKRLDRIYIDSR FT IRRKLRNCRILEEFKQISTHKIIAMSFQIQKEPILKVGNPRYLIPQWMSQD FT ENIIKDLNNNEQSSLTPFSNWNGIINRIKEKVIFYEKYQRYIRAYIPHAGN FT FPDEKMLKFRFRPSFNFSIITEMQTESGNTVQDTEIMINLATKFYQDLFLV FT EDRHLESFTFVEQFDKKIDDTDKVLLEKAFIEENVYDHLLMINKKTAVGTD FT GISYQNLIELWPSLGESLIRAGNNILKYGTLPQQMSEVIITLIPKKLKTPI FT IENFRPISVISCAVRLLSSVIEKQLNPVLAKVIEKTQTGFLKERSISNSIY FT LLDMVLTRYQTSKTADAESAGFINLDFRKAFDSVHHDFILKVLQQVGFGPK FT ATNFLMAITAKQKAKVSINNIEGPCFPLKRGVRQGNPISPLIFILILETFL FT ARLSKEIEGIGVVNEVSLVAYTAYADDVIIFFKNKNDQERIQQLLEDFGRE FT SGLYLNNNKTEVCYFNDIPEISFLPYVSKKLQLEKLTYLGVPMKKADEEFD FT PWTLFVLNLNAQIRMTPILDLPYQLIMKLMNIFIFSKLYYRDLHSPILTTA FT VSSIITTVQQRLPLYFKLQRLQTPNHLGGFGLMNPNHQVKGRRGKQIYLLY FT TQEDDLIIKFMRTKIQDILDNIAKDYYTMPTEPDKLIVYPWYLFGMGVSCH FT LSLQFRYLKERVYENLTKLEISWFEAWFQLVHYTGPPVETPIIIMLIEDYA FT NLIIQPQTESIKLMSPNFEEQSLSEQTFHHTSRKLCPQAPIIPEGWGKHFD FT TIKSLTISDWTEFWKNMNKVQKQSVGSLQDYHLFILGYYSHYPFYPNYKKL FT EIHFCQLCNTGTDSIVHHIFECGETMELWLRHFNSQRTPQFIIGNKHLHKR FT DLYALNEYIKEVVQKVKRRRGSPILNQGERENVDAGTNVLV" XX SQ Sequence 5706 BP; 2130 A; 1002 C; 961 G; 1613 T; 0 other; ttattatgac tacgaacggt actaatggtt ttgattttgc tgataaaggt gatcatccgc 60 ctcggcagtc tcaagagaat ccccctggga attttcaaga gctgccgtta gatggtggtc 120 aaattcggaa atcatcgtcg aacgatggtg tttccaacaa cgtcgaattc aatacaaata 180 caaatgcgca cgagtcgtgg gctgactacc ctgaatcaga ccagatggat actgattcat 240 ctgaatacta tacagaagat gaaaacgaag aaggaaagat ccccaaaggt ccataccatc 300 aatttcaaaa tgctaataaa tttacttact ttacaccaga aactgcaaca acattagttg 360 ctcctaacaa caatacacaa aacactacaa ctgaacaaga caatatgtcc aaaacttatg 420 cctctgtggc agctacattc aaaacaaact tggccaagag ccaagataaa caagaggaac 480 ttagaatgtc caattttaag gtccctcaaa aagcctttga cgacttaaaa agacacaatg 540 atgcggtatt tcctgaagag gtcatgacat ctgatctctc aaccattatt aacaaatatg 600 gacatgaaaa ggttttcgac accattgaga acacgctaat tactactgcg caaaacttag 660 aaggatattt gagtgaatat cctgagaatg ttatgagaaa ggcattcaaa tatttcacat 720 tgaaactctc tattgaggat aaacaaacaa gagacgtctg tgaagaaata ttagggataa 780 gagaggaatt gatgtacatc caacgaattt tagattttaa tataatggac aaagtaatgt 840 gcttcatgcc atttacgttt gaacaaatgg aaggtaaact ttcatcagaa aaggcgaaag 900 taagaaccat agtggaaaat ctcgaattgg atttcgatgg tacagtggaa catatcaaaa 960 ttgatagtgc caaacgaaac agacaattgg tggcctttaa agtgccaata cactgtcgta 1020 gtttagcaga aaggtatttc cgacaaatat ttttacataa agaagatgtt atagtacaac 1080 aatatgcgcc tattattgag aaggttcagc ttgggagaaa gggcaaaact gaagaaaata 1140 accatcaact tttaacagaa tatgtttgtt atttcatagt tgatctcttt tcgggacaag 1200 ttccaaaaag aagaatcaaa atcgataata caaagaataa gccctcctca tatgaacaat 1260 gtacatcaag agttttatct aattgcaaat actgtgaata ctgcagatct atgacccata 1320 ctaaatatca ttgtcctttg ataaaaccat gcaccaactg tggtgtaaaa ggacataaga 1380 caacgagctg taaaaaagct ctgtctacca acactctggt tctaaaaaga ccagaaacca 1440 aaccaaactt cccaatttta ccacctaagg tggttaacaa aactacgagt gatgattctc 1500 aaagaaaggg aagtatagca aaaactactt ctcatacaaa aggagataat acagaaaaga 1560 tggttacacc atctgctcaa aatgaaaata ctaccttaca ggaatcaata cctgtgacaa 1620 ttacgggtca cagggaagct ccaacaacac ctcctgtggc tgatacaaca gccaaaaaaa 1680 caatgactac tggcgtcctg gaaatggata ccagtaaaca cacaatatcc tcaaacgacg 1740 aagcgtcacc gaggaaaaaa ttaaatgtga aggaaaagcc cagcaatcct gagaaggatc 1800 cagggctaaa cctgccaccg aacctccagt aggatgagaa ctcctacatg tttgcttttc 1860 cttgttaaag attgattttc ccgttattta taatactaaa acaaaaactg ataaaaacac 1920 tagaaaaaaa aaaaaaaaaa aaaagagaaa gacaacaaat tcatattcat acatacatac 1980 ttacacccat tatttaatta ctgtgaatga agagaaatga atcctatatt aacagtttaa 2040 caataggctc caaaaacatt ggcagccatc aatccactga tttcaagaaa ttacttgata 2100 tattccttaa actaataggt gaacacctga tgattgacat ttggtttatc caagaaatca 2160 gatttgtatc tcaggaacaa tttaattata tcaacaaaat attaaaacaa cacaatgcac 2220 agcttagaat gcatcatttt gaagatctaa cgggttttct tattcattcg ccacatgcta 2280 aactaaaatt taaaataaga gacaatgaca atcatcacac aacacatttt gaaggtcgta 2340 ttagcatctt ggatattacg ttaattacaa atgaagacat aactttgatt aataactatt 2400 tacatagtgg aaatatggat gcacagatga ctctactaaa agcattcatt aaatatattg 2460 caaacttgaa aaaacacacc aaccataata taatttatgg tggtgattat aatcacatta 2520 tgctgctaga tgatgtacaa ctaccactgg accaaactcg ttatattatc tccaaaaaag 2580 agctggaaat tattcagcta atgtcaaatt tttataaaaa atggaaactt caagatgctt 2640 tccagatacg caacaacctt caacccacaa attttcattc taacaaatct gtaaagaaaa 2700 ggttagatag aatatatatt gactccagga ttagaagaaa acttcgaaac tgccgaattc 2760 tagaagagtt taagcaaatc tcgactcaca aaattattgc gatgtccttc caaattcaaa 2820 aagaacctat cctaaaggtt ggtaacccac gttacctgat acctcaatgg atgagtcaag 2880 atgaaaatat aattaaagat ctaaataaca atgaacaaag tagtctcact cctttttcta 2940 attggaatgg aattattaac cggataaaag aaaaggtgat attttatgaa aaataccaac 3000 gatatatcag agcatacatt ccacatgccg gaaattttcc tgatgaaaag atgcttaaat 3060 tcagatttag gccctcgttt aacttttcca ttattacaga aatgcagact gaatctggta 3120 atacagttca agacacagaa ataatgataa atttagcgac aaaattctac caagatttgt 3180 tcttggtaga agataggcat ttggaatcat ttacctttgt ggaacagttt gacaaaaaga 3240 tagacgatac tgataaagtt ttgttggaaa aagctttcat tgaagaaaat gtatatgacc 3300 atttactaat gatcaataag aaaactgcag tgggaactga tggcatttct taccaaaatc 3360 tcattgaatt atggcccagt ttaggtgaaa gcctcataag agcaggaaat aatattttga 3420 aatatggaac tttgccacaa caaatgagtg aagtgattat tacattaata cccaagaaac 3480 tgaagacacc tataattgaa aactttcgac ctatcagtgt tatcagctgt gccgtacgat 3540 tgttatcatc agtaatagaa aaacaactaa atccagtctt ggcaaaggtt attgagaaaa 3600 ctcaaacagg cttccttaaa gaaagatcca ttagtaatag catatacctc ctcgatatgg 3660 ttcttacaag atatcagact tctaaaactg cagatgcaga atcagcaggg tttattaatc 3720 ttgacttcag aaaagctttt gattcagtac atcatgattt tatattgaaa gttttacaac 3780 aagtaggatt cgggcctaag gcgacgaatt tcttgatggc aataactgca aagcaaaaag 3840 caaaagtcag tattaataat attgaaggcc catgcttccc acttaaacgt ggggttagac 3900 aaggcaatcc tatttcacct ttaatcttta ttttaatctt agagaccttt ctcgctagat 3960 tgagtaagga aattgaggga ataggagtgg tgaatgaagt ttcgctggtt gcatacactg 4020 catatgcaga tgatgtgatc atctttttca agaacaagaa tgatcaagaa agaattcaac 4080 aacttttaga agactttgga agggaatcag gcctttatct taacaataat aaaacagagg 4140 tttgttattt caatgacatt ccggagattt cctttttacc atatgtttca aagaagctac 4200 aattagaaaa attaacatat ctaggagtac cgatgaaaaa agcagatgaa gagttcgatc 4260 cttggacact gtttgttcta aacctcaacg ctcaaattcg aatgacaccg atactggatc 4320 ttccatatca gttgattatg aaattgatga atatctttat attttctaaa ttatactata 4380 gagatttaca ttcaccgatt ctgacgacag cagtgtcgag tattataaca acagttcaac 4440 aaagacttcc attatacttc aagcttcaga gattgcaaac ccctaaccat ttgggtggtt 4500 tcggtttgat gaaccccaat catcaagtaa aaggtcgaag aggtaaacag atatatcttt 4560 tatacacaca agaagatgac ttgataatta aatttatgag aactaaaatt caagatattc 4620 ttgataatat tgcaaaggat tattatacaa tgccaacgga accagataaa ttgatagttt 4680 atccatggta tctttttggg atgggtgtat cctgtcatct gtctctacaa tttagatact 4740 tgaaagaacg agtatatgag aatcttacaa agctggaaat ctcgtggttt gaagcttggt 4800 ttcaattggt ccactacaca ggccctcctg tggaaactcc aattataatc atgctgatag 4860 aagactatgc aaatttaatt atacaacctc aaacggaaag cattaaatta atgtctccga 4920 attttgaaga acaatcttta tctgagcaaa cttttcatca cacttcaagg aaattatgtc 4980 ctcaagctcc gattattcca gaaggttggg gcaaacattt tgatacaatt aaaagtctaa 5040 ccatttcaga ttggactgaa ttttggaaga atatgaacaa agtccaaaaa cagagcgttg 5100 gatctttaca agattatcat ctttttatcc ttggttatta ttcacattat cctttttatc 5160 ctaattataa aaaactggag atacactttt gccaattgtg taatacgggt actgattcga 5220 tagtacatca tatttttgaa tgtggtgaga ctatggaatt atggctgaga cattttaata 5280 gtcagagaac tccacaattt attattggaa acaaacatct acataagaga gatttatatg 5340 ccttaaacga gtacatcaag gaagtggttc aaaaggtgaa acgacgaaga ggttcaccaa 5400 ttttgaatca gggagaaagg gaaaatgtgg acgctggaac aaatgtactc gtttagacat 5460 aacaacaaca ctgcttaatt ttataggaag attgcttata caatgcctcc aagcgttgtc 5520 aataataaac cacacaccac atatcataca cgatggtttt taagatattc tcactgagta 5580 tttctttcca tgaaaatggc ctcaaaaggt tttccatctt gaacttatta aaataaatga 5640 ttgtaacccc ctcgtatgtt tatagttata tacctgtata taaggactaa atatatgttg 5700 agaaag 5706 // ID Gypsy-85_MLP-I repbase; DNA; FNG; 5716 BP. XX AC AECX01002117; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-85_MLP_; KW Gypsy-85_MLP-LTR; Gypsy-85_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5716 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01002117; Positions 96064 101779. XX CC Positions [4500-4994] - Integrase core CC 'TCTCC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 412..1491 FT /product="Gypsy-85_MLP-I_1p" FT /translation="MSDTNMTSLEDIQRKLAELSTSLAEERLMRGQAEARY FT QQSEARLAALESRGPSAPAAATQPPTAPQPPPAEPVLKGPKVSTPDKFNGA FT RGEPAEIFASQVQLYMLAHPYLFPDDRSKVVFSLSYLTGAASSWAQPLTKE FT LFDPATAHLVTFERFVTNFKAMYFDTEKKSKAERALRNLSQKTTVAAYTHE FT FNIHATSTGWETPTLISQYEQGLKKEIRVAMVMTQEDFTSIEQITNLAIKI FT DSKLHGATNPTFHTPSTSAGISDPNAMDLSAAYVRLSEEERARRMRAGLCF FT RCNGQGHISSACPDRRASRKGKGRGGYQSKIAGLEIKVAELSNQRVNHTSS FT LEGVGRAETSKNGEAQD" FT CDS 1557..5618 FT /product="Gypsy-85_MLP-I_2p" FT /translation="MNLCNTNDPRIFLHTTLSISQIPRATPSQTFPATFLI FT DSGATHDVLSESFAYHAGLIAGADRHDQLITGFDGSESRSSFKTHLFIKSD FT PLPTPFIITRLKDSYDGILGMPWIKKNFHLIDWSKSMVHPAETHVATTSLV FT SYWPKTPLSGHSLDPMRDARHNYEGMCVNSDTLASPQCEFESPVNPNHLET FT DDNPLSPDSSPLITITPTTNNENHEHITTQDPDTSIAAVVNTASSVPPQNP FT DDPKEEPQGHARNSDEGANVFLDISTPPQCEFITDSNPCSTRTAGKPVSPL FT NNTSIQVTAAKTSWSTSARLAADKKKLEPVKALEEIVPSRYHRHLHMFRKS FT NAQRLPPRRRYDFRVELIPGAQPQASRIIPLSPAEDAALEELVNSGLANGT FT IRRTTSPWAAPVLFTGKKDGNLRPCFDYRKLNTLTVKNKYPLPLTMDLVDS FT LLDAEDFTKLDMRNAYGNLRVAEGDEDKLAFICKQGQFAPLTMPFGPTGAP FT GYFQYFIQDILLGRIGKDTAAFLDDIMIYTKPKVDHEGAVDGVLDVLSKHT FT LWLKPEKCEFSKKEIEYLGLIISKNKIRMDPTKVKAVKEWPAPQSVSELQR FT FIGFANFYRRFIDQFSRTTRPLHNLTKLNAHYVWDEKCEKAFESLKTSFTT FT APVLKIADPYKPFTLECDCSDFALGAVLSQRSEDDGELHPVAYLSRSLVQA FT ERNYEIFDKELLAIVASFKEWRHYLEGNPNRLDVIVYTDHRNLESFMTTKQ FT LTRRQARWAETLGCFDFVIKFRPGSKATKPDALSRRPDLAPTNEEKLTFGQ FT LLRPENIGPDTFPVELASVEGFFEDEVVALDNSEHWFEVDVLGIAETDTKA FT PDIPTDDQIIQRIRDASKECDRIGELMNATLNPISAKTKAALKHYKVQDGI FT LYNQGRIEVPDVDDIKYSILRSRHDSLLAGHPGRAKTLGLVRRSFIWPSQK FT AYVNRYVDGCDSCLRVKSTTQKPFGTLEPLPIPAGLWTDISYDLITGLPAS FT NGYDSILTVVDRLTKMSHFIACTESMSAEDLADLMMKNVWKLHGTPKTIIS FT DRGSIFISQITEEIDKRLGIRLHPSTAFHPRTDGQSEIVNKVIEQYLRHFV FT KYRQDDWESLLPTAEFSYNNKDHESTGISPFKANYGFNPSFNKVPSAEQCV FT PAVEKRMQTLAEVQSELNQCLELTQEKMKEQFDKRVQQTPRWEVGDQVWLN FT GRNISTTRPSPKLENRWLGPFFIIQKISNSTYKLNLPLSMKSVHPVFHVSV FT LRKHNPDTIHQRKQPPVEPIIIQDEEEWEVNEILDCRRRYNKKEYFVSWKG FT FGKENNSWEPEINLKNSQDLVNEFNLKYPDATNKYKRRRRKK" XX SQ Sequence 5716 BP; 1746 A; 1441 C; 1255 G; 1274 T; 0 other; tattgtcgga tctatagacg agactcaagg actatagata gatcaccaag aagaaacaag 60 tagaattgac ttaaatcgaa agaaaaccga acctcaagat tcagaatact agcgaatctc 120 gacctcatta gaaaacctta tatagatctc agaaattcaa agttgaaaag acccaagaaa 180 ctattcgaaa tcgaaaacct tattaactaa tctgcataca ctgaaaccga atccaccaga 240 accttatctt acttactaaa aactctgtcg atcccgactg tgatccctgg actagcaaac 300 gcaaccacgt ctccatctta ccgtgctcct ttagaggagg aggactctga tagtgaaact 360 ccgatccctt ttctcgacgc cgattctagc cttaccggta gcgagatcga tatgtctgac 420 accaacatga cctctctcga agacatacaa cgcaagcttg cggagctttc cacctcattg 480 gctgaagaac gcctgatgcg aggacaagct gaagccagat accagcagtc cgaagctcgt 540 ttagcggcgt tggaatcccg aggcccatct gcacctgctg cagccactca accacccact 600 gcgccacagc caccaccagc tgaaccggtc ctgaagggcc ccaaagtctc gacgcccgat 660 aaattcaacg gcgctcgagg cgaacccgca gagatctttg cgagtcaagt ccagctctat 720 atgctggcac acccctacct cttccccgac gaccgaagta aggtggtttt ctcgctgtct 780 tacctcacgg gagcagccag cagttgggcg caacccttga caaaggagtt attcgacccc 840 gctaccgccc acctcgtaac ctttgaacgc ttcgttacaa attttaaagc tatgtatttc 900 gatactgaga agaaatcgaa ggcggaacgc gcgctgcgaa acttatcgca gaagaccacc 960 gtggccgcgt atacccatga atttaacatt cacgcaactt ctactggctg ggaaactccg 1020 acgctcatta gtcaatacga gcaaggtttg aaaaaggaga ttagagtggc aatggttatg 1080 acgcaagagg atttcacgtc aattgaacaa atcactaatt tagctatcaa gatagacagt 1140 aaattacacg gcgcgaccaa cccaaccttc catacgccaa gcacgagtgc aggaatctca 1200 gaccccaatg ccatggactt atcagccgcc tacgttagat tgtccgaaga ggaacgagca 1260 cgacgtatgc gagcgggttt atgcttccgc tgcaatggcc agggtcacat ctccagcgca 1320 tgtcccgatc gtagagctag taggaaaggg aagggtcgag gtggttatca atcaaaaata 1380 gctggattgg aaattaaagt cgcagagttg agtaatcaga gggtgaatca taccagtagt 1440 ttagaaggag taggacgagc tgaaacttca aaaaatggag aagctcagga ttgaaggttg 1500 tgcccatcct gagccacctg gagaatttgt catcaattaa tgtgggatct agtagaatga 1560 acttgtgcaa tacaaacgac ccgcgcattt ttttacacac cacattatcc atatcccaaa 1620 ttccccgagc cacaccgtcc caaacatttc ccgctacgtt cctcattgat tcaggtgcta 1680 cgcacgatgt gctcagtgaa tcatttgcct accacgctgg cttgatagcc ggtgctgatc 1740 gacatgacca attgatcact ggattcgacg gctctgagag ccgatcttcc ttcaagaccc 1800 acctttttat caaatccgac ccattaccga ccccattcat cattacaaga ctcaaagact 1860 cgtacgacgg catacttggc atgccatgga tcaagaaaaa ctttcacctg attgactgga 1920 gtaaaagtat ggtacaccca gccgagactc acgttgcaac cacttcattg gtttcgtatt 1980 ggccgaaaac acccttatct ggccactcgt tggaccccat gagggacgct aggcacaatt 2040 acgaggggat gtgtgtcaat tctgacacgt tagcatcccc gcaatgtgag ttcgagagtc 2100 cagttaatcc taatcacctt gaaacagatg acaaccctct atctcccgat agttcccccc 2160 ttatcacgat tacacccacc acgaacaacg aaaaccacga acacatcacg acacaagacc 2220 cggatacgag cattgcggcc gtagtgaata cggcctcgtc cgtcccgcca cagaaccccg 2280 acgatcccaa ggaggagcct caggggcacg caaggaacag tgacgagggg gctaatgtct 2340 ttttagatat atcaacgccc ccgcaatgtg agtttattac tgattccaac ccatgttcaa 2400 cccgaacagc tggcaagcct gtttctccct tgaataacac atcaatccag gtcacagcag 2460 ccaagacgtc ctggtcaacc tcagcacgac tcgccgccga caaaaagaaa ctcgaacccg 2520 tcaaagctct agaggaaatt gtcccgtcta ggtatcatag gcacttacac atgtttagga 2580 agtccaatgc ccaacgccta cccccgagac gccgctacga ctttcgtgtc gaactgatac 2640 caggagctca accgcaggct agcaggataa ttccattatc accagccgag gacgctgctt 2700 tggaggaatt ggtgaattca ggcttagcca acgggactat ccgtcgaacc acttcgccat 2760 gggctgcccc tgtgttattc acagggaaga aggacggcaa tctgcgccct tgttttgatt 2820 accgtaagct caacacccta accgtcaaga acaaataccc actaccgttg actatggact 2880 tagtcgacag tttgctggat gccgaagact ttaccaagct tgatatgcgc aatgcgtacg 2940 gtaatcttcg agtagcagag ggtgatgagg ataaactggc ctttatatgc aaacaaggcc 3000 agtttgcacc tctgactatg ccttttgggc caacaggcgc ccctggttat ttccaatact 3060 ttattcaaga tatcttattg ggacgtatag gtaaagacac cgcggccttc ttggatgaca 3120 ttatgattta tacaaagcct aaagtagatc acgaaggcgc tgttgatggc gtacttgacg 3180 tcctatccaa gcatacactc tggctgaaac ccgaaaaatg cgagttctcg aagaaggaaa 3240 ttgaatacct tggccttatt atctctaaga ataaaatcag aatggacccg acgaaagtga 3300 aagccgtcaa agaatggcca gcaccccaat cagtatctga actccaacgc ttcattggat 3360 tcgccaattt ttaccgacgt ttcatcgacc aattttcaag aaccacaaga ccacttcata 3420 atctgaccaa gctcaacgcg cactatgtct gggatgaaaa atgcgaaaaa gcctttgaga 3480 gcctgaagac ctccttcacc accgcacccg tactgaagat agccgatcca tacaagccgt 3540 ttactctgga atgtgactgc tccgattttg cattaggagc agtgctatca caacgcagcg 3600 aggacgatgg agaactccac ccagtagctt acctgtccag atcactagtc caagctgaaa 3660 ggaactacga gatctttgat aaggaactac tggcgattgt cgcgtctttc aaagagtggc 3720 gccactacct ggagggaaac cccaaccgcc tagatgtgat agtctacacg gaccatcgta 3780 acctggagtc tttcatgaca accaagcagc ttacccgcag acaggctcga tgggcggaga 3840 cactgggctg tttcgatttc gtgattaaat tccgacccgg cagtaaagct accaaaccag 3900 atgccttatc tagaagaccg gacctggcgc ctactaacga ggaaaaattg acctttggcc 3960 agttgttaag acccgaaaac atcggcccag acacgttccc ggtcgagctt gctagcgtag 4020 aggggttctt cgaagatgaa gtggttgcac tcgacaattc tgagcactgg tttgaggtgg 4080 atgtattggg catcgctgaa actgacacta aggcaccgga cattcctacg gatgatcaaa 4140 ttatacaacg aatcagagac gcaagcaaag agtgtgacag gataggggaa ctgatgaacg 4200 caaccttgaa cccgatatca gccaagacga aagcagcttt gaagcattac aaagtacaag 4260 atggtatcct ctacaaccaa ggtagaatag aggtacctga cgtcgacgac atcaaatact 4320 caatactaag aagcagacac gactcattgc ttgcgggcca cccaggtcga gcaaagactc 4380 tcggattggt caggagaagc ttcatctggc catctcagaa agcgtacgtc aacaggtacg 4440 tggacggttg tgactcttgt ctaagggtta aatcaaccac acagaaaccc tttgggactc 4500 tggagccatt accgataccg gctggcctgt ggacggatat ctcatatgac cttatcaccg 4560 gcctccctgc atcaaacggc tacgacagca tattaaccgt ggtggataga ttaacgaaaa 4620 tgagtcattt cattgcttgt actgaatcaa tgtcggctga agacctagca gatctcatga 4680 tgaagaacgt atggaagctt catggaactc ccaagacgat aatctcagat aggggtagta 4740 ttttcatttc acagatcacg gaagaaattg acaaacgcct aggtatacgc cttcacccgt 4800 caaccgcttt ccacccacgt acagacggtc aaagcgagat cgtcaataaa gtaatcgaac 4860 aatatttacg acatttcgtt aaatatcgac aagacgattg ggaaagtcta ctgcccacag 4920 ctgagttctc gtataacaac aaagaccatg aatccacggg catttcaccg tttaaagcta 4980 actatggatt taacccatca ttcaacaaag taccatcagc tgaacagtgt gtgccagcag 5040 tagagaagag gatgcaaaca ctcgccgaag tgcaatctga attaaatcaa tgtttagaat 5100 taactcaaga gaagatgaaa gaacaatttg acaaaagagt acaacaaaca ccaagatggg 5160 aagtaggaga ccaagtctgg ttgaacggac gaaacatttc gacaactagg ccaagcccaa 5220 aattagaaaa tagatggcta ggtcctttct tcattatcca aaagatatca aattctactt 5280 ataaattgaa tctccctctt tccatgaaaa gcgtccatcc tgtatttcat gtctctgtac 5340 tcaggaaaca taacccggat acaatacatc aacgaaaaca acccccagtc gaaccaatca 5400 taatacagga tgaagaagaa tgggaagtca acgagatact agattgtaga agaaggtaca 5460 acaagaagga atactttgtg agttggaaag gatttggcaa ggaaaacaat tcatgggaac 5520 ccgaaatcaa cctcaagaat agtcaagatt tagttaatga gtttaattta aaatatcccg 5580 acgcaacaaa taagtacaaa aggagaaggc ggaagaagtg agagggctat gctttttccc 5640 actgggtttt ttaacgcagc ccgtggaaag aatgcagagc ttgcaagagg aagcttgggc 5700 attaaagggg ggataa 5716 // ID Gypsy-38_MLP-LTR repbase; DNA; FNG; 189 BP. XX AC AECX01001140; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-38_MLP_; KW Gypsy-38_MLP-I; Gypsy-38_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-189 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001140; Positions 240522 240710. XX SQ Sequence 189 BP; 53 A; 47 C; 33 G; 56 T; 0 other; tgttatgatc ctataacata tgtaacaacg agtgaaagac gggttagaca tgtcacggat 60 agagattgag tccctagttg tacacattgc tattgtacaa gtttctcttc tcttcatccg 120 acaaactact ttatagactt gtcaccagat ccttgaccct cgtcccagac cccagtactg 180 gccttaaca 189 // ID Copia-25_MLP-LTR repbase; DNA; FNG; 779 BP. XX AC AECX01001060; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-25_MLP_; KW Copia-25_MLP-I; Copia-25_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-779 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001060; Positions 56648 57426. XX SQ Sequence 779 BP; 237 A; 150 C; 116 G; 276 T; 0 other; tgttggtgtt caatcgactc aggatgagaa gatgaagaat agaagcaagg agttacaatt 60 tcctaattac atgttacgaa agctatgacg aaatgaaaac acttactatg acgtcaatgt 120 atctgaaagc ctttataaga tagttacttt tcatttgtaa cgttttctct ttcctcatct 180 ttggaaagag aaacgtacgt atcccatctt agaacctttt catattatgt atccaactaa 240 ctgttataac attgtagcta tctttcactt agatcactga gagctcacca taacatcgtg 300 tgctctactt tatcaaatct atctacatta taggtattta cttttgttat caataataga 360 agagtcgaga cttatctttc atatttatat ttgtatcttt ctcttatttg ttgtaggtca 420 gagttctttc ctataacagg agaacttcta gtgatcctta tctaacttat agcgcatatc 480 aggtaattaa taatctttgg aaagagaaac ctatctttca cttagatcac tgagagctca 540 ccataacatc gtgtgctcta ctttatcaaa tctatctaca ttataggtca gagttctttc 600 ctataacagg agaacttcta gtgatcctta tctaacttat agcgcatatc aggtcagagt 660 tctttcctat aacaggagaa cttctagtga tccttatcta acttatagcg catatcagtt 720 cctctttgtc acaggaacat ttatcacaga ccgtattcca agtcacgtag acattttca 779 // ID MarinerN-2_AO repbase; DNA; FNG; 484 BP. XX AC . XX DT 24-JAN-2006 (Rel. 11.01, Created) DT 02-FEB-2006 (Rel. 11.01, Last updated, Version 1) XX DE It is a family of nonautonomous Mariner DNA transposons- a DE consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW MarinerN-2_AO. XX OS Aspergillus oryzae OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC mitosporic Trichocomaceae; Aspergillus. XX RN [1] RP 1-484 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-484 RA Kapitonov V.V. and Jurka J.; RT "MarinerN-2_AO, a family of nonautonomous Mariner DNA transposons RT in the Aspergillus oryzae genome."; RL Repbase Reports 6(1), 41-41 (2006). XX DR [2] (Consensus) XX CC This nonautonomous family of Mariner DNA transposons is CC characterized by TA target-site duplications and 23-bp TIRs. XX SQ Sequence 484 BP; 179 A; 95 C; 55 G; 153 T; 2 other; cagtaaaacc tctatataag caactccgat ataggcaata tacccgatat aagcaatcta 60 ccaggctgtc tcctatcgtt tcccatacaa aattgcagaa aacctcgata cagggaactt 120 tcagtaatct gctcagtctc ccatacaaaa ttccttataa cgaggtatag tgggtctaac 180 taaccaatta gaaaatttac taaaataata taaagtcacg tgtaaatttc tcgggatact 240 aatttagtct accactacta cctaacaata acaaaaatat atataattac aaaaattctt 300 ttactctacc ttactctcag tctttatact aaagaaatct actaagaata ctatataata 360 stacattatt atactatatc tatattttag aaggacttat atagtgattt taatataagc 420 aatacctcga tataagcaat ttaggcagtc agctcaawcc cttgcttata tcgaggtttt 480 actg 484 // ID Gypsy-12_RO-LTR repbase; DNA; FNG; 479 BP. XX AC AACW02000311; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-12_RO_; KW Gypsy-12_RO-I; Gypsy-12_RO-LTR. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-479 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000311; Positions 14630 14152. XX SQ Sequence 479 BP; 193 A; 84 C; 45 G; 157 T; 0 other; tgtaaggatg cagattataa attttgaccc gaaggtacat aaaaattaat gcaaggcaca 60 aaaattaaaa ggtgcattaa aatcaaggca tgctttaaat tttaaagcat gttataaaat 120 tcataaaaac tataaaaaca attaattcac ttacttttga caaacttact tctgacatac 180 tttacttttg attattttta aataaatatc attcaaagca aatctctttt attactatta 240 ctattttact aaacatcaac aacaagtcga gtcccttgaa gccaaatatc ttcattgtat 300 cactgttaaa gctcatatca agctactatt acctttatat caagactgat ctactttaaa 360 acaagatctg gtggtcactc ctatcaagaa aatactcaaa tataacttta cttcatattc 420 aaaactactt taaacccttt ataaatacac aaggcctaaa tagaaaacaa gctattaca 479 // ID TSE3_LTR repbase; DNA; FNG; 947 BP. XX AC AJ439555; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 08-NOV-2002 (Rel. 7.1, Last updated, Version 1) XX DE Saccharomyces exiguus retrotransposon TSE3_LTR, long terminal DE repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Long terminal repeat; RNaseH; TSE3_LTR; gag; integrase; pol; KW protease; retrotransposon; reverse transcriptase. XX OS Kazachstania exigua OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Kazachstania. XX RN [1] RP 1-947 RA Neuveglise C., Feldmann H., Bon E., Gaillardin C. RA and Casaregola S.; RT "Genomic evolution of the long terminal repeat retrotransposons RT in hemiascomycetous yeasts."; RL Genome Res 12(6), 930-943 (2002). XX DR Genbank; AJ439555; Positions 1 947. XX SQ Sequence 947 BP; 347 A; 206 C; 156 G; 238 T; 0 other; tgtaaccaag atggcctttt ctggtaacca aaaaatgacc ttttggtgac acaaaacctc 60 catatcacaa ttactcaatt aaatggcatc tctatgagag ctatgacatg aataagagtc 120 actagcctca aatggcatag tgtgacactt tattccatca ctttaaaccg tctcctcagc 180 aattaggaac gtcaggagca tatcacccag aaacagccgt agaacagtaa atcgacctgc 240 ttgatgggac atgtgacatt ccaccgtgca ctttaggaaa tcacataggt aactgcctgg 300 gatataaata ttaaatccga acacccgaac caatcagacg aagcacaacc acttcggtga 360 caagggacga tgagttgcac aacctgcaca agctcactta gctaacaggg caacgggagt 420 ttgtccacac tggacagtaa tgacaaagaa gaagacattc ctcgttcgaa cgtattctaa 480 tgaatacaca tcgaaggatt ggagaacatt ctgcattcag aaattctcta cattttctat 540 ataagaataa tacaaacttt aagtttgttt atttcctgtt atctttcaga agaaagaacg 600 aactaaggtt aaggaagaga actaagactt attaagaaga tcaatcgata atcaacgtta 660 acttaatcta ttaagcataa cgaagaatta caaaaaagca caaaaggatt aataactctg 720 tttaactatt actagctatc taagacctct acgcaagtta gaatcgaaga acctagaatt 780 aaaatagcat aagcaagacg tacactctct tcttaccctc cttacttgga acaagaaata 840 ttcaacctta ctagttcgtt cagaaagaca tctcaagaac gactcagttt gaataaccta 900 cggacgcatt ccaaccctaa tcacaacaaa acacctgagt cgttaca 947 // ID Gypsy-8_LBS-LTR repbase; DNA; FNG; 695 BP. XX AC ABFE01000288; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-8_LBS_; KW Gypsy-8_LBS-I; Gypsy-8_LBS-LTR. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-695 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01000288; Positions 192782 193476. XX SQ Sequence 695 BP; 211 A; 147 C; 116 G; 221 T; 0 other; tgttatagta cctctatttg tcccctatcg ttccattact ctaaggttca ctcagatcat 60 ttacattgta ttttctattt gtgattctca aatagcgcgt aggacatcta gttgtagtgc 120 gttaatatac ctcgaatcca gttcttgaat gtgttagcga cgtgtttgca gtataggcga 180 ttgacaaata gaacctaaca aacaaacatg tccaagcgcg tcgaacagag acgaacaagg 240 acggaacatt cttaaacatt cttttaagtt cttctaggtt ctggaactaa gtactttagg 300 tatataacat gtcagtaaag ctacagaaag gatactcact ttttacccca catttgtctg 360 aacttaagat cttgaataaa agacttagtt agctgtttta agtacttgat aaagcatttg 420 agctatcaca tacttgatcc gaactgttaa cgtctgagga ctaacatttt agatttacag 480 cgttctttgt caacaattca cgacttgttt cacgacgtta ctcgttaccg aatcactcat 540 caatcacgct aagaactcac gtgatctagg gaatacttat cacacttgac tcttcaacgc 600 gtcgactcag ctatacgtac gttttgattc aactactaca actgacattt gtattcatag 660 gaatacacca acgtacgagt cagcgaactc taaca 695 // ID Gypsy-62_MLP-I repbase; DNA; FNG; 6584 BP. XX AC AECX01001306; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-62_MLP_; KW Gypsy-62_MLP-LTR; Gypsy-62_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-6584 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001306; Positions 43142 36559. XX CC Positions [4801-5151] - Integrase core CC 'GATTA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 656..3829 FT /product="Gypsy-62_MLP-I_2p" FT /translation="MNPWREGRDSPAISSIFGRDLPPHQVPGSELSDPHAI FT LLDSPRPHLINQFNRLPHSQSPVSRLAQGLNNLDFRTNSISFAHQPGHQNA FT QIPQQHFTARMTGAAPAFHPQVFQQPLSFQPNHNHQQYPNQQSVPPIHNQF FT VPNHFVPPPQAQYVHPQQQFQTGPIPPTHAYQVPLQQPIPQTTNQTAPLQR FT PPSRMPNSYITDNRAPKICDNLRFFGDNKGLKQFLVEIHDELDQITLHDDK FT MKINWIARHFTSPTSTPSSMQIWFMGLLERNAFQQGVQSLYGNLKSLDYVL FT PELASLAIFLDELIYKFGDKNSDKTAREELDACKQGKLSIIDYNAKFEQLS FT LHVKKSEEDKILQYVEGLHPSIQLEATRIAGWVRETDLLCKQAMALEAADI FT LYLRSKVAHNHPHLRTPGESYRHPQHHQNPINQQTNGPVPMDINVNAVSLE FT RSGSNPFPVIRRICNEKNLCYDCLKPYDDEHKCLRSLKGKQSCPNPYVCVK FT EKLKVLCTSMKLPSDGRHTSPTQISAMQLEELEYAALPTEVIEETGRLVEA FT YWESLGSPAYPLPSSSQSQPMSQDVQVDAIRIQSDVNRLRRFLIPLSIISQ FT NLPVAIMALVDTGASDSFIDAEFADLHHLNLNKKFIPQNVLGFDGGPAKRV FT TKEWCGLLSMNDVEGKETRLKAKLGVTKLGGGNDVILGLPWMEENGATLWM FT NKNRRWLGIGEKVLSAVVVEEELVDLAVVYCEQDSLPISFSSSKAPTASTF FT SQETPSITLNPTLAKFFPLNLPKSCTRYLHVFSPQVSVLPPHRSFDIAIEL FT KPGCKPPFGGLYNLSPDEQSELKTYLNDQLSKGFVRPSKSPAAAPIFFVKV FT PGKKNRPCVDYRGLNKITKRDSYLIPVMSWLLNQLRGCKFFAKIDLKVAFN FT LLRVAKGDKWKTAFRTPWGLFEYTVMPFGLANAPAVFQRFIQWVLREYLDV FT FCFVYLDDILIFSKNADEHELHIEQVLAKLSEHKLTASPEKCQFFAKEVIF FT LGFVISTEGISMDPAKLKTIADWPFPQELTDLQRFLGFSNFYKQTFPESWV FT P" FT CDS 4153..5151 FT /product="Gypsy-62_MLP-I_1p" FT /translation="MGTKVPVTVLSDHANLRYFMTSQMLTPRQARWASFLG FT EFNFEILHTPGKSNPTDPALRRSDFVCGKQDSARVILLGLREVKDIGVNAI FT YISHPGHFNISSYMPVADELMVTIKHSYHADDLIAGIHPSFLHFLDGLWWW FT RDRLYVPLALRISIIKQIHKSTIGGHWGTLKTLGLLTRSFGWPNARRDVLD FT FIKGCSSCQQVKVDHRAPQGQLVPLPIPDRPWSTIGVDFIVKLPISDTFDS FT VMVVVDHFSKVAHFVPAKESWSAEELAKAFVAQVFRFHGLPDAIVSDRGTT FT LVSSFWTSVLKLLQISPAPSTAFHPQTDGQVERVRASPPRT" XX SQ Sequence 6584 BP; 1812 A; 1448 C; 1394 G; 1930 T; 0 other; tttttattct gatcttcatt ctgatcggat aatttttttt ggaaagaagt tttttatgtc 60 attggaaatc atcgaccttt ctcaggctta catattcacc aacggaccgg atctcaattt 120 cgctgctgat actgtaaaat tctcgagcga agatttatta tcgtcattgg aagaattacc 180 ttattttaat tgtccattgg aatccccatt ttttaatatc tcatcgaccg aattactcga 240 taatttcaaa cttctaccct acttttatta tcctcatcac aacccctttt caaatcaact 300 catccctcac cctctcaccg ttgacgcgtt gaatcattat ctatctcttg aaagactaga 360 atacgagaga ctttacttcg aacgactcaa ccgttatcga gaagttaatc gaactcctgg 420 aacttcattg gattcattac cagtattcgc gccgatccca acaccacttt cgcccactct 480 ttccccaaga aatagtgttt caccatctca atctctggta gatcgatttc aagaattagc 540 agtcacagaa tcaactccct tcagaacttt ccttaggaat cttttagaaa gaggccttct 600 tatcgattca agctattttt tggagttttt cggtttcacg gattaattga taaagatgaa 660 tccttggcgc gaaggtagag acagtcctgc tatttcttcc atttttggta gagatctccc 720 accgcatcaa gtgccaggca gtgagctatc ggacccccac gctattttac ttgattcccc 780 tcgtcctcat ctcatcaatc agttcaatcg tcttccccat tctcagtccc ctgtttctcg 840 actagctcaa ggattgaaca acttggattt caggactaac agtatatctt ttgcacatca 900 accaggtcat cagaatgctc agatcccgca acagcatttt accgcacgaa tgacaggtgc 960 agccccagcc tttcacccac aagtgttcca acaaccactg tctttccaac cgaatcacaa 1020 tcaccagcaa taccctaatc aacaatctgt tcccccaatt cacaatcaat tcgtccccaa 1080 tcattttgtc ccaccgcctc aggctcagta tgttcaccct cagcaacaat ttcaaaccgg 1140 ccccatacca cctactcatg cttatcaagt tcccttgcaa caaccgatcc ctcaaacaac 1200 caatcagacc gctcccttgc aacgaccacc ttcaaggatg cccaattcgt acatcacaga 1260 caacagagcg ccaaagatct gtgataactt gaggtttttt ggagataata aaggattgaa 1320 gcagttcttg gtggaaattc atgacgagct agatcaaatc actctacatg atgataaaat 1380 gaagataaat tggattgccc gtcacttcac ttcgcctact tcaactcctt caagtatgca 1440 aatttggttt atgggattac tcgaacgtaa tgcatttcaa caaggtgtac aaagtttgta 1500 tggtaatttg aaaagcttag attacgtact tccggaactg gcatcattgg caattttttt 1560 ggatgaattg atctacaaat tcggtgacaa gaactcggac aagacggcta gagaagaatt 1620 ggacgcttgt aaacaaggaa agctttcgat catcgactac aatgcaaagt ttgagcaatt 1680 gagtttgcat gtgaagaaat cggaagaaga caagatcctt cagtatgtag aaggattgca 1740 tcccagtatt caattggaag ctacgaggat tgcaggttgg gtacgggaaa ctgatttact 1800 ttgcaagcag gcaatggctc tcgaagcggc ggacatactg tacttacgat cgaaagtcgc 1860 tcacaatcat cctcatctgc gaacacctgg agaaagttat cgccatcctc aacaccatca 1920 gaaccctatt aatcaacaga ctaatggacc tgttccgatg gacatcaatg tgaatgcggt 1980 atccttggag agaagtggtt ccaatccgtt cccggtgatt cgacgtatct gcaatgagaa 2040 gaacttatgt tacgattgtt tgaagcccta tgatgatgaa cacaaatgtt tacgttcttt 2100 gaaaggtaaa cagtcgtgtc ctaatcctta tgtgtgtgtc aaagagaaat tgaaggtgct 2160 ctgtacttca atgaaattac catcagatgg acgacatacc tctccgactc aaatctcagc 2220 aatgcaatta gaagaattgg aatatgcagc tttaccgaca gaggtcattg aagaaactgg 2280 acgactagtt gaagcttatt gggaaagttt aggatcacca gcttaccctc ttccttcaag 2340 ctctcaaagt caaccaatga gtcaagatgt ccaagtagat gcgatccgca ttcaatccga 2400 tgtgaacaga ctgcgtcggt tcttaattcc tctcagtatc atttctcaaa atcttcccgt 2460 agcaatcatg gcacttgttg acactggagc aagtgacagt tttattgatg cggaatttgc 2520 tgacttacat catttgaatc tgaacaagaa gttcattcca cagaacgttt tgggttttga 2580 tggaggtccg gcgaagagag tgacgaaaga gtggtgtggt ttgctgtcta tgaatgacgt 2640 tgaaggcaaa gaaacaaggt tgaaggcgaa attaggagtg actaagctag gtggaggaaa 2700 tgacgttatt ttaggactac cttggatgga agagaatggt gcgacgctgt ggatgaataa 2760 gaacagaagg tggctaggta ttggagagaa agtgttgtcc gcagttgtag ttgaagaaga 2820 gttggtagat ttagcagtag tttattgtga gcaagattct ttacccattt ctttctcctc 2880 ttccaaagct ccgacagcat cgactttttc ccaagaaacc ccttcaatca ccctgaatcc 2940 cactctagcc aagttttttc ccttaaatct tccaaaaagt tgcactcgat acttacatgt 3000 tttttcccct caggtatctg tattaccccc ccacagatcg tttgatatcg caattgaatt 3060 aaaacccggt tgcaaacctc cctttggggg attatataat ctgtctcctg atgagcaaag 3120 tgagctcaaa acctacctta atgatcagtt aagtaaaggc tttgttcgtc cttcgaaatc 3180 tcccgctgcc gcacccatat tttttgtgaa agttcccggt aagaaaaaca gaccctgcgt 3240 tgattaccgt ggtttaaaca aaatcacaaa gcgagatagt tacctaatcc cggtgatgtc 3300 ctggttgcta aatcagttga gaggctgtaa gttttttgcc aagatcgatc tgaaggtggc 3360 tttcaaccta ttacgtgtag caaaaggtga caaatggaag acggctttcc gaactccttg 3420 gggtctgttc gaatataccg tgatgccatt tggcttggct aatgcacctg cagtcttcca 3480 acgtttcata cagtgggtac ttcgagagta tctggacgtt ttctgttttg tttatctgga 3540 tgacattctt attttttcta aaaatgctga cgaacatgaa ttacatattg agcaggtatt 3600 agcaaagttg tcggaacaca aactaactgc ttctcctgag aagtgccaat tttttgcaaa 3660 ggaagtgata tttttaggat ttgtgatctc gaccgaaggc ataagcatgg atccagcaaa 3720 attgaaaact atagcggact ggccatttcc acaggagctc actgatctac aacgtttttt 3780 aggattctcc aatttttata agcaaacttt tccagagtcg tgggtccctt gacaagtctg 3840 actgcaaaga cggcggatgc aaggaaaggt ttactgctga aatcatcgag agattcattt 3900 gacgagctgc gtaaaatctt ttcctcggcg ccttttctgt tacactttga tttcgatttg 3960 cctcgtgtgt tacaggtcga tgcctccggc tatgcgtatt cgggaatctt gtctcagaaa 4020 tcgaaaacgg gagaactcag accagtggcg tatttctcga agaaactaac ggaagcagaa 4080 cgtagatggc agatccatga ccaagaactg ggagccattg tagcttgttt tcatgattgg 4140 cgcgcttggc tcatgggaac aaaggtacca gtcacggttt tgtcagatca tgctaattta 4200 cgttatttca tgacctctca aatgctgaca cctagacaag caagatgggc atcattttta 4260 ggagaattta atttcgagat tttgcatact cctggaaagt cgaatcccac tgatccagca 4320 ttgagacgct cggactttgt ttgtggaaaa caggattctg ctagggttat tctgctggga 4380 ctgcgagaag tcaaggatat aggcgtaaat gcaatctaca tcagccatcc gggacatttc 4440 aatatttcct cttatatgcc tgtcgcagat gaactgatgg ttaccatcaa acattcttat 4500 cacgcagatg acctgattgc agggattcat ccttcatttt tgcactttct ggacggatta 4560 tggtggtgga gagatcggtt atatgttcca ttggctttga ggatatcaat cattaagcag 4620 attcacaaat caacgatcgg cggtcactgg ggtaccttga agactttggg tttactgaca 4680 cgttcttttg gctggccaaa tgctcgaaga gacgttctgg atttcatcaa aggctgtagc 4740 agctgtcaac aggttaaagt tgatcatcgg gcacctcaag gacagcttgt tcctttacct 4800 attcctgaca gaccatggtc aactatcggt gttgatttta ttgtgaaatt gcccatttcc 4860 gacacttttg attcagtcat ggtcgtggtg gatcattttt ctaaagtggc gcattttgtg 4920 ccggctaagg agagctggtc tgctgaagaa ctggcaaaag cgtttgttgc acaggtgttt 4980 aggttccacg gtttgcctga tgccatagtc tccgatcgag gaactacttt ggtatcttcg 5040 ttctggacga gcgttttgaa attactacag atctctccgg cgccgtcgac ggcctttcat 5100 ccacagactg atggtcaggt tgagcgggtt agagcatctc caccgcggac ctgatctgta 5160 aaccgaatac agagccaaaa aaagcttgct ggaaaaactt tgacagctgt attttatatc 5220 cagcttcgtt ccatccaacg cgcaaatcgg tataaatttt tgagtcgttt tgagctgcct 5280 aagttgccaa accctcaccc aatcgagcca ccaaatagcc atcaattggt tggacaacaa 5340 gaaaatagga cgcatgtggt gactttatca ccacatgtgc caaatgtacg tccaatactg 5400 ggacaataca gtttgctgca aatattcggt tggtacttca tatttcgagt actgaccgtg 5460 tgaaacaggg tttgctggat attgaggctc acacggatgg tgagctgtta cgtttccaga 5520 ccgcgctgga tggtcgattt gtgacctaga cctgcaaacg taaaatacac cttacgtgta 5580 tgtctgaaac gttacgtttg caggtctgcg ttggagatgc tcttaatgcc ttattgaagg 5640 attatctacg tcactttgta ttgaacaatc aagacaactg ggctttgtta ttaccactag 5700 ctgagttctc ctataataat tcagtttcca gttcgacaaa attgtcaccc ttctttgctt 5760 tgcaaggtta tcatcctagg ttcaattctt tgacaggttt gtcaggacgc ccaaaagcgg 5820 acggttttgt ggaacacatt caaagagtac aggagacttt ggcggagaat ctgactcaag 5880 caaaggaatc tcaagcttgt ttctataaca aagacagacg gattgaagtt gcttacaacg 5940 tgggagactt agtttggttg tcacgacgtt tcttgaagac gaaacgacag aacagcaaat 6000 tggacttcag acgtttggga ccctttccaa tagtacgaat ggtgggacgc aacacagctg 6060 agttggattt accctcttct ttacgacaat tacacccggt gtttaatgta tccttattga 6120 tgccatttgt aggacagtca aaggagccgt caatggctga gaagacatgc tgggataatt 6180 tattgattcg cgaagaacag cagatcaaga cgattttgga ttatcgtcga cattcgtcag 6240 gcatgcatga atacctcata cgaatgaatg acgcctctcc tttggacgac cagtggttac 6300 ctctgtctca attacctttc actttggaca actatctcga gcgctttcat aggatgtcac 6360 cctcactggg ccctggaccg gatgttggag tttggatagc gcgttcacgt caacgtgttg 6420 atcctgacta tctggataat acaagtaatc tacagtaagg catatttttt tcgtttgaag 6480 acgtgccaac ccctgtgtgc ctaaacaagc cactcaagac gtgggtatgt gaaggctgtc 6540 ttggaaaatg tttttttatg tctggtagat ggaaggtcat aatt 6584 // ID Copia-3_MVPL-LTR repbase; DNA; FNG; 164 BP. XX AC AEIJ01000198; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Microbotryum violaceum genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-3_MVPL_; KW Copia-3_MVPL-I; Copia-3_MVPL-LTR. XX OS Microbotryum violaceum OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Microbotryomycetes; Microbotryales; Microbotryaceae; OC Microbotryum. XX RN [1] RP 1-164 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Microbotryum violaceum genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AEIJ01000198; Positions 1698 1535. XX SQ Sequence 164 BP; 30 A; 50 C; 28 G; 56 T; 0 other; tgttgaagta aattactctt gccttaaatc cgcactacgc gcgtctccac tttatattag 60 tcgcgcgcgc aatttgcgtc tttgtctcct cttcttcact tgattacgaa ctcattggct 120 cgtccctgcc gccaactcgt tcagcacatc gctgttcagt ttca 164 // ID Gypsy-17_LBS-LTR repbase; DNA; FNG; 595 BP. XX AC ABFE01001078; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-17_LBS_; KW Gypsy-17_LBS-I; Gypsy-17_LBS-LTR. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-595 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01001078; Positions 1317 723. XX SQ Sequence 595 BP; 137 A; 159 C; 112 G; 187 T; 0 other; tgtgaggggt cgatactaag gcattcccct ctctcgttat atttcctatt caacgatgtt 60 tgtatccttc tctagcgcaa ttcctcttct acaatcttca gagtcacgca atcacgctta 120 gtcatgttta gtcacgttta gtataactac cttgtaaaca gaaaagaata ttacttcttc 180 ttacaccata ttcttttcct atcttgttgc acgagcatcg tactcacgac tcgcctcaag 240 cctgttgaag attcgcgttc gaagagatca aggtcagttc ctatagaact aactttagga 300 gctcgtgtct cctgtgaaag ctaacgaagc tttcttgtat tttcccattt ctccgaaggg 360 acttgttcac tcccttctgc ggctttccga gaaggttgag acatccgcgc tcgtggttat 420 ccagggcata agcggactct cgcgtgatca ggcttcttct ttctctctag attcagacct 480 gcatttaggg tcatcacacc attactcaac ccttacgttc gacgttcgcc agacttactc 540 gccgttggta agaatcccac tcagcgacta aatcccctta gaacagggtt gcgca 595 // ID Copia-31_MLP-LTR repbase; DNA; FNG; 502 BP. XX AC AECX01003016; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-31_MLP_; KW Copia-31_MLP-I; Copia-31_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-502 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01003016; Positions 969 468. XX SQ Sequence 502 BP; 114 A; 124 C; 77 G; 187 T; 0 other; tgatcactta ggcgcattac gcgtatagga cgcgacatgt tcagatgcgc atttagatat 60 gtttctcttc tcactacgtg acaccttatt gagtccgtta ggttcatcta tatagaacaa 120 ctctcttact tcatgttgtt ctcttctctc tatcgaactc aaagctttca ttgtgttaca 180 atctccttta cagattgtcc tttctttcga tcaaccagtg tcttccctca gtaagaccct 240 atctgatctt aatccgcttc aggtgttagt tattgtttta gttgtttgtg ttctcctttg 300 attaatcgaa ctcaaagctt tcattgtgtt acaatctcct ttacagattg tcctttcttt 360 cgatcaacca gtgtcttccc tcagtaagac cctatctgat cttaatccgc ttcaggtatt 420 gtcctttctt tcgatcaacc agtgtcttcc ctcagtaaga ccctatctga tcttaatccg 480 cttcagggaa agaaaacaga ca 502 // ID Gypsy-1_GDe-I repbase; DNA; FNG; 7365 BP. XX AC AEFC01000931; XX DT 12-MAR-2011 (Rel. 16.03, Created) DT 12-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Geomyces destructans genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_GDe_; KW Gypsy-1_GDe-LTR; Gypsy-1_GDe-I. XX OS Geomyces destructans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Leotiomycetes; Leotiomycetes incertae sedis; Myxotrichaceae; OC mitosporic Myxotrichaceae; Geomyces. XX RN [1] RP 1-7365 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Geomyces destructans genome."; RL Direct Submission to RU (12-MAR-2011). XX DR Genome; AEFC01000931; Positions 4050 11414. XX CC Positions [4034-4525] - Integrase core CC 'CTATT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 47..1261 FT /product="Gypsy-1_GDe-I_1p" FT /translation="MATPLLCDGRTIRRGHDRSCSQNQDATQTLRGDDEPG FT PHSGSSQLEDEGHAQRQLSASEDLHGDRDGSPDGSELSTLTRVPSDWGDRV FT DEARREADHLECEVRAARARKRAAVESRIAELEVQEQRKHALQRELRELRK FT EWTPTRDRKRRASNGHDGYDQGRASLEIGSVAGESAASHPRGPHRAPKFRD FT LATFQGKSLREAQVFAAGADRRFRIDAGSQYPMDQNKIDYCVLAFGPTPAA FT KWERHERREGLGNTTWVEFKEWIMDSIIDPSNRTFDAITSYNEAKQGETQS FT AEDFAAHLDTLELELRIEDEELRKNTLYAKLREEVRREILHRDDIPRTQQG FT MLALATRIENTHCLTIGGLTGGDPTPRRGWETARTLGRGPCDHRGLRGLRN FT GALRALGEALL" FT CDS 5709..6647 FT /product="Gypsy-1_GDe-I_4p" FT /translation="MVTGAARSSTQQGAARTTAQQTVTGLPPLLPLPLPPT FT RSESSHATLQSVSNQRSRRSRMGGGRGKTYLHLAEDHQRLLRVAEEGGAHL FT RRRRRHRDSHANLRREGGSSSHDAGAEANADATHGSAEVRLQRQQVPDGGG FT DGRRDLPKRALANAPQKKDPKGKPKQKTKTRTGPETHRDALVLKMSAARAL FT RRGGGEDTRELWRLRQILRAAQAAHTKQRAVGVPALVRGRVAVRGRGRSPT FT GHGALLRGLGALPLAGGAARGNIFLLGGLLDRRRRGNDGRDRSGRRSGGGG FT GSYGGGRTRDGRETARFCQTC" FT CDS join(2486..3742,3746..5209) FT /product="Gypsy-1_GDe-I_2p" FT /translation="MPFGLSNAPATFQAYINQALIGLVDVTCVVYLDDILI FT FSENPEHHEGAVQEVLNRLRTHRLYANLKKCVFTMDSVEFLGFMVGPQGIT FT MDPARVATVNEWPPPRSVREVQVFLGFTNFYRRFIEGYSRVAGPITNLLKT FT TGVEKGRKFHMTLDAMTAFDVLKAWFTTAPMLRHYEPELPTQVETDASGYA FT ASGILSQLFGAGPEAKWHPVAFFSRKFSPVELRYDTHDKELLAIVLAMDHW FT GQYLRGVTRSVRVRTDHNNLRYFMTKRSLNGRQARWAESLARYDFYIEYRP FT GKANPADGPSRRPDYQPLDDPTLDQHLPGLHFRLCSGPETAADSPPPLIAV FT GRQRGTNQLSAKLAVSLEQPSLTAEAAPLRRQDTNAIVAGERPYSPSGATV FT REALLELQSRDDFVTQRAAAPHSMWPREAALWTTGEDGLLRRKGAIYVPDS FT PPMRAELLATHHDDPHAGHFGFARTQELLQRKYYWVGMRRDVKSHLRTCDV FT CQRTKTKRHAPYGELAPFQPPTTPWQEITMDFIVGLPPSRFRGRVYDSILV FT VVDRLTKMARYIPVNATIDAPELAEVFANTIFKDYGTPTGITSDWGPQFTS FT AFWGSFMFYLQIRRRLSTAFRPQTNGQTERQNQMLEQYLRCYCDYSQGNWA FT SKLALAEFSYNNSVHESTGHTPFYLLYGYHPAIEVTAAEPADGAGMAASRV FT EELQREREELEATLRRATEAHKKHYDRKHTPMRFRVGDQVMLAAKNIRQLR FT PNRKLSDKYLGPFRVTEAIGSHGQSYRLTLPKGYRIHDVFHVSLLEPYHAR FT TGTIPPEATQTQDQDEWEVQSIQAHKETPKGRRYLVRWKGYSPAEDTWEPP FT DNLANAPDVLQAYHEAAPQPIRTRGRRKRQRQTTDPQNPNTRDGGGEETHA FT RDP" XX SQ Sequence 7365 BP; 1733 A; 2303 C; 2315 G; 1014 T; 0 other; ttctaatctc tcttaataac gggaggcagc ccgctacgac ccggttatgg caacgcccct 60 actttgcgat gggcgcacca ttcgaagggg ccacgaccgt tcgtgctccc aaaaccagga 120 cgctacgcaa accctccgag gtgacgacga accggggccg cactcagggt cgagccagtt 180 agaagacgag gggcacgccc aacgccaact tagcgccagt gaggacctcc atggcgaccg 240 ggatggaagc cctgacggat cggaactctc caccctaact agggtcccgt ccgactgggg 300 tgaccgggtg gacgaggccc gccgcgaggc cgaccatcta gaatgcgagg tgcgtgccgc 360 ccgggcccgg aagcgagccg ctgtggaatc gcggatcgca gaactggagg tccaggagca 420 acgaaagcac gcactccaac gggagctgcg cgaactacgg aaggagtgga cgccgacgcg 480 ggaccggaag cggcgcgcaa gcaacggcca cgacggctac gaccaaggga gagcttccct 540 cgagataggg agcgttgctg gcgagtcggc ggcgagccac ccccgaggcc cccacagagc 600 gccgaagttc cgcgacctag caaccttcca ggggaagagc ctgcgggagg cgcaggtctt 660 cgccgctggg gcagaccgcc ggttccgtat cgatgcgggg tcacagtacc ccatggacca 720 gaacaagatc gactactgcg tcctcgcctt cggaccgacg ccggcagcaa agtgggagcg 780 ccacgagcgc cgcgaggggc tagggaacac cacctgggtg gaattcaagg agtggataat 840 ggattccatc atcgacccct cgaaccgcac gttcgacgcc ataacgtcgt acaatgaggc 900 gaagcaaggc gagacccaat cggcggaaga tttcgcggcc cacctcgaca ctctcgagct 960 cgagctccgg atcgaagacg aggagctccg gaaaaacacc ctctatgcaa aactgcggga 1020 ggaggtgcgg cgcgagatcc tgcaccgcga cgacataccc cgtacgcagc aaggcatgct 1080 agctctagcg acccgcatag agaacacgca ctgcctcacg ataggaggcc taaccggagg 1140 cgatccgaca ccccggcgag ggtgggagac cgcgaggaca ttagggcggg gcccctgcga 1200 ccatcgcggc ctacggggac tgaggaacgg ggccctgagg gccctaggag aggccctgct 1260 atgagcccaa atcgcacccc agcgggcgcg aagggagatc gccgcggcaa cgggtgccac 1320 ggctgcggct cggtggagca ccggctggcc cattgccctg aggctatatg ctacaattgc 1380 agcaagaagg ggcacatttc cccggattgc ccgagcctga caggaaaagg cgagtcccgg 1440 cagtgatatc cactgcccta aacccccggg accaaggcgg ggtgccacca acgcggagac 1500 ggctagtgat cgcggtccaa gtcactgccg cggatgggtc cacccgaacg gagtgagccc 1560 tgatcgacag cggggcagag gagaactgcg tccgccagtc tctagtcgta gagtgcggat 1620 gggagtctag caaaggggcc gacacggggt tgtcgaccct tgaggggagg gaggtatgga 1680 cctatggggt ccaccaccta cccatcggcg ccacggatgg ggcggggagg tcgttaacga 1740 ccggccaccg gttcgtagcc tgcgccttcg acgggttaga tgtgaacctc atcctagggt 1800 acccttggat agctgacgtt gaccctacaa tcagttacaa ggacggggcg tgggagttcc 1860 cgacacggca gggggccgtc cgggaggtag gacctgaaca gttctacgag gatgtccagg 1920 aggcgggggg agcctacgga gtcctcaccc aatgggcagt aagggggcag cgtatcggtg 1980 tcgtcaccgc cgatacggcc ctggcggtcc tccctaacca ataccaggac taggctgacg 2040 tcttcgacgc agcgaaggct ggggtattgc ccgagcacca cccaatggag cataaaatcg 2100 aggtggaagg gggcaaggaa cccccctgga gcccggtata cccccttggc gaaccggagc 2160 tcgaggcgct acgagagtat ctcgattcgg gcctcaagaa ggggtggatc cgaaggtcca 2220 ttagccagcg ggggccccaa tcctgtttgt cccaaagaaa gacgggagcc tacgactctg 2280 tgtggattat agggggctga acgcagttac ggttcggaat cggaccccgc taccgctcat 2340 cagcgagacc ctcgaccggc tacgccgatc aaaggtcttc accaaactag acctcaagga 2400 cgcacatcac cgaatacgta ttcgcggggg ggacgagtgg aagacggcgt ttcggacccg 2460 gtacggccac ttcgaatacc tcgtcatgcc gtttggacta tcgaatgcac cagcgacctt 2520 ccaggcgtac ataaaccagg ccctcatagg cctagtagac gtgacctgcg tcgtctacct 2580 agacgacatc ctgatcttct ccgaaaaccc agagcaccat gagggggcgg tacaggaggt 2640 cctgaaccgg ctacggaccc accggctata tgccaacctc aagaagtgtg tgttcaccat 2700 ggacagcgtg gaattcctcg gcttcatggt agggccccag ggcatcacca tggaccccgc 2760 ccgggtcgcc accgttaacg agtggccgcc gccgaggtcg gttagggagg tccaggtgtt 2820 cctagggttc accaacttct accgccgatt catcgagggc tactctaggg tagctggccc 2880 aatcactaac ctcctaaaga ctactggggt cgaaaagggg cgcaagttcc acatgacgct 2940 agacgccatg acagcgttcg acgtgcttaa ggcctggttc accaccgccc cgatgctgcg 3000 acactacgag cccgaactgc cgacgcaggt agaaacggac gcctcggggt acgccgcatc 3060 gggtatcctg tcgcagctat tcggggccgg accggaggcg aagtggcacc cagtcgcttt 3120 cttctccagg aagttctccc ctgtggagct caggtacgac actcacgaca aggaactcct 3180 ggccatcgtc ctggctatgg accactgggg ccagtacctc cggggcgtca cgaggtcggt 3240 acgggtacgg actgaccaca acaacctccg atacttcatg accaagagga gcctaaacgg 3300 acggcaagca cgctgggcgg agtcgctcgc ccgctacgat ttctacatcg aataccgccc 3360 cggcaaggcc aaccctgcgg acggaccgtc gcggagaccg gactaccagc ccctcgatga 3420 ccctaccctc gaccaacact tacctggcct acacttccgt ctatgtagcg gccccgagac 3480 ggctgcggac tcgccgccac ctctaatcgc ggtaggccgc caacgaggaa cgaaccaact 3540 aagcgcgaag ctagcagtca gcttggaaca acctagtctg accgcggagg cggcgccgtt 3600 acggcgccag gacacgaacg ccatcgtcgc gggagaaaga ccctactcgc cctcaggggc 3660 gacggtaaga gaggctctcc tagaactgca gtcccgggat gacttcgtca cccaacgggc 3720 ggcagccccg cactcgatgt ggtgaccccg agaggccgcc ttgtggacga caggggagga 3780 cggcttactc cgaaggaagg gcgccatata cgtccccgat tcgccaccga tgcgggcgga 3840 actactagcc acgcaccacg acgaccctca cgctggccac ttcggtttcg cccgcaccca 3900 ggagctgcta caacggaaat actactgggt ggggatgagg agggacgtga agagtcacct 3960 ccggacgtgc gacgtctgcc aacgcacgaa gactaaaagg cacgcgccat acggcgagct 4020 ggcgcctttc caacccccga ccaccccctg gcaggagatc acgatggact ttatagtggg 4080 gttaccgccg agccggttcc gaggccgagt ctatgactca atcctcgtcg tcgtggatcg 4140 cctcacgaag atggcacgtt atatacccgt caacgctacc attgacgccc cagaacttgc 4200 ggaagtcttc gcaaacacta tcttcaagga ctacgggacc ccgacgggaa tcacctccga 4260 ctggggcccc cagttcacga gtgcgttttg gggaagcttt atgttctacc tccaaatacg 4320 gaggcggctg agcaccgctt tccggccaca aaccaacggc cagacggagc gacagaacca 4380 gatgctagag cagtaccttc gctgctactg cgattactca cagggtaact gggctagcaa 4440 actagcgctc gccgagttct cctacaacaa ctcggtgcac gagtcaacgg gacacacccc 4500 gttctaccta ctatacggct accaccccgc catcgaggtc accgcggcag agccggccga 4560 cggtgccggc atggcggcgt cccgggtgga agaactccaa cgggaacgag aggaactaga 4620 ggccaccctt cgtcgagcca cggaggccca caaaaagcac tacgaccgga aacacacgcc 4680 gatgcggttc cgggtcggtg accaggtgat gctcgcagcg aaaaacatcc gccaactccg 4740 ccctaatcgg aagctctccg acaagtatct aggcccgttt agggtcacag aagccatcgg 4800 atcgcacggg cagtcgtatc gcctcacgct cccaaagggg taccgaatcc acgacgtgtt 4860 ccacgtgtcc ctgctagaac cgtaccatgc ccgaactggc accataccgc ctgaggcaac 4920 tcaaacccag gaccaagatg aatgggaagt acagtccatt caggcacaca aggagacccc 4980 taaggggcgc cgatacctcg tccgatggaa ggggtattcc cccgcggagg acacgtggga 5040 accccccgac aacctagcta acgcgccaga cgtcctacaa gcgtaccacg aagccgcccc 5100 gcaaccgatc cggacaaggg gccggcgcaa acggcaacgt cagaccaccg acccccagaa 5160 cccaaacaca cgggacgggg gcggggaaga gacgcacgcc agagacccct agaaccccaa 5220 cgcacggagg gcggccgggg ggtggcagcg accctcgcga ctcaaggggg ggaaccgcca 5280 ggacgcacca aaccaccccg agtaacggga ggacggggcc gcccggccaa agggcagcga 5340 ccaaaccccc ccgagttatg ggaaggggag ggcgacgaga ccacctggtt cgaatcggct 5400 acgtaccggt accagtccga ccccggccaa cggggggagg aacacgacgc gaccaaagtc 5460 accaaccacc agcagccaga acgaggcgca ggcacaggca caggagcgca acgggacgaa 5520 gacagatcac gggggcaaga acactcgtat tataagcccg aaggccaaac taaactacgc 5580 cgaaggcggc agagacaaca aacacggggg cccgtaggca agccacgcac aaaacacccc 5640 aggacaccca caacgcccta gaccacgcca ccgcccagaa aacaaacgcc ctaggagcag 5700 gcgcagcgat ggtgacaggg gcagcccgat cgtccaccca gcaaggggca gcacgaacga 5760 ccgcccagca gaccgtcacc ggcctacccc cgcttcttcc ccttcccctt cctcccaccc 5820 gcagcgagag cagccatgct acgcttcaaa gtgttagcaa ccagaggagc cgaaggtcaa 5880 ggatgggagg aggaaggggg aaaacgtacc tgcatctggc ggaggaccac cagcgtctcc 5940 tccgagttgc ggaggagggc ggcgctcatc tccgccgcag aaggcgccac agagacagcc 6000 acgcgaacct ccgcagggag ggtggcagca gcagccacga tgcgggcgcg gaggccaacg 6060 cggacgccac ccacggcagc gccgaagtcc gactgcagcg ccagcaggtc ccggacggag 6120 gcggcgacgg caggagggat ctgcccaaac gagcgttagc taacgctccg cagaaaaagg 6180 acccaaaagg gaaaccaaaa cagaagacca agacaagaac cggaccggag actcaccggg 6240 acgcacttgt ccttaagatg agtgcagcgc gtgcacttcg tcgaggaggc ggggaagaca 6300 cacgagaact ctggcgactt cgccaaatac ttcgcgcagc gcaggcagca cacacgaagc 6360 agcgcgcggt gggggtcccc gccctcgtcc gaggacgagt cgccgtcaga ggacgaggac 6420 gaagtcccac tggccacgga gccctcctca ggggactcgg cgcgcttcct cttgcgggag 6480 gagcggcgcg gggcaacatc ttcctcctcg gaggtctcct cgaccggaga cgtcggggaa 6540 acgatgggcg agacaggagc ggcaggcgta gtggcggagg aggcgggagt tacggcgggg 6600 gtcggaccag ggacggacgg gaaaccgcca gattctgcca aacatgttag ccacggccca 6660 agaacccatc acctaccaca aagaagccta ggaaacacga cgagaaacca aaagacggaa 6720 ggaaaggact caccagagcg cacgggggcg cagaaggaga aacggggggc cgggacaggg 6780 cggggaggcg acccgaacgc cacccagcca gatggagctc cagaacgtta ggaaacgccc 6840 aaacgaggaa acgaaaaagg ggcactcact catggtggcc gtgtggccgg cgacggaacg 6900 gttctcttgc gccttaaggg cgccgacggt ttcagtgctc tcttgagagc cggacatggt 6960 ggtggtggcg ggggcagcgg tggagttaga tgaattggaa ggttgaggga ggagggggag 7020 gaaggtgttt tgaggccgct tactcctttg tgtggagggt aggtcacgtg cgcggggaca 7080 acacgcgcct accgcgcgcc ttggaacaaa ggtaggtagg ccggttctgg cctagtgcca 7140 tagcgtcaaa ggcgcgccca attccggcct taaagcaggc cttggaataa aggaaacgga 7200 ggaacggagt agctaggtca cgtgacgcta agttcggctt cacgtctatt aacgagcctc 7260 ccccgaaccg acggcaggcc gctaatgacc aacgctgttc caccaagggc tgccactgaa 7320 gcaatggcat gtccgtaggg ccccggactc tagagggggg agtgt 7365 // ID Copia-3_CCO-I repbase; DNA; FNG; 5347 BP. XX AC AACS02000011; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the mushroom Coprinopsis cinerea genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-3_CCO_; KW Copia-3_CCO-LTR; Copia-3_CCO-I. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-5347 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the mushroom Coprinopsis cinerea RT genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AACS02000011; Positions 1328726 1323380. XX CC Positions [2443-2940] - Integrase core CC 'TCGGT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 520..2316 FT /product="Copia-3_CCO-I_1p" FT /translation="MTPAIRLTIQSNQLARTIWTKLEESYGTPGASVVFAD FT FRKAIAFKLSENENPVPQMDDLRSVFDRLHANGVRMPDFFQAMILINAIPQ FT KWDSIAVLLMQNKKVKELKLADVREAIMAEYERRTQGSVHANKMSAIKRKK FT GNPNFKQQKQQQSSSNSPAQGQQGQPSNDKKKSKRGKSSGKKKQHHHHHHT FT HMVDADSDSDSDDSSQIASAVVLGSHIISAHHHTPTPTTTSVASFTSSGIK FT VNKVPVNLPAQVYTPATAEKTNDPRTHPVYPDLAQAQEICDRLHVPRTAKN FT LRPLMDFPVKKSLEDRLESPPPAKRARSESPLFTDSDDDDAVSLGMDTDDE FT PDFKADTEIEDTAMELDTGHIDEQRCVSSRRLNEQLADLTQRSILIDAAVL FT RDDLYMDLCIALNYLARQYMSISSFFASCSECHSPFHGYDVWMMDSGCSMH FT VTSDETDFASYTPLRNSPEIRTADKKATLRWAGVGSVFIKHMVKDSHGKYQ FT EVRTRLYPVYHVPNLHGRLLSMGEFLQQRGHYVHGDNKSIGIYKPDQSQAI FT VCRPAIPGHTIFWVKTRITSRKESSGKTALDGLFRRLPNHASTLWAPLKGS FT A" XX SQ Sequence 5347 BP; 1357 A; 1431 C; 1239 G; 1320 T; 0 other; ggttatgagc cccgttaggc tttagctttg cgcttgtttg gctgcattgc taaatctggt 60 ttgcgctagt ctctggctgc atccagaatc tagtttgcgc ttttttggct gcatctagat 120 gctctaaggc ttactacttg tggtcgtcat gatctgcgct atttttggct gcgatcaggg 180 gactaaggag cgctattttt ggctgcacac ttgtccttct attctctcac gagaacgcca 240 tgctgatgat cggtctcttt gacagatctc aacaaccatg tcctccgccc tcactcaact 300 tgttccgatt cttgatggca ccaactatcg agaatggaag ccgctcgcca tcgcttatct 360 ccaatttgct ggcgtgtggg atgttgttga tgggaccaat ccggagcctt ctgatcttcc 420 ccctgatgcc aaggaggaag acaaagaagc tcgccggaag gagtcaaagg gcgtggctcg 480 agaagaatat gactgctaaa ggtgcccctc accttacgca tgacacctgc tattcgcttg 540 accatccagt caaaccaatt ggctcgaacc atttggacga agttggagga aagttacggt 600 actccagggg catccgtggt ctttgccgat ttccggaagg ctatcgcttt caagctttcg 660 gaaaatgaaa atccggtacc acaaatggat gatctccgat cagtctttga tcgacttcac 720 gccaatggtg tcagaatgcc tgacttcttc caagccatga tcctgatcaa cgcaattccc 780 cagaaatggg atagcattgc ggttctgttg atgcagaaca aaaaggtcaa ggaactcaaa 840 ttggctgatg tgcgtgaagc catcatggcg gagtatgaac gtcgaaccca gggttccgtt 900 cacgccaaca agatgtcagc gatcaagcgg aagaagggta accccaactt caagcagcag 960 aagcagcagc agtcgtcttc taactcgcct gcccaaggac agcaaggaca gccttctaac 1020 gacaagaaga agtccaagcg tgggaagtcg tccggaaaga agaagcaaca ccatcaccat 1080 catcacactc acatggttga tgctgattct gattctgatt ccgatgactc ctcccaaatt 1140 gcttccgctg tcgtccttgg ctcccacatc atctcggctc accatcacac tcctactccc 1200 actactacct ccgtcgcctc gtttacctcg tccggcatca aagttaataa agtccccgtc 1260 aatcttccgg ctcaagttta cacccccgct accgccgaga agaccaacga tccccggact 1320 catccagttt atccggactt ggcacaggct caggagatct gcgatcggct tcacgttcct 1380 cgcactgcca aaaatctccg acccctcatg gatttccccg ttaagaagtc actcgaagat 1440 cgtctcgaat cacctcctcc tgccaaacgt gctcgttccg agtctcctct ctttacagac 1500 tccgacgatg acgatgcagt ctcacttggc atggataccg atgacgaacc cgacttcaag 1560 gcggacacgg agattgagga cactgccatg gaactcgaca ccggacacat cgacgaacaa 1620 aggtgtgtgt catcccgtcg gctgaacgaa caacttgctg atctcacaca gcgctctatt 1680 ctaatcgatg ctgctgtatt acgtgatgat ctgtatatgg acttatgtat tgctttgaat 1740 tacctcgctc ggcaatatat gtctatttcc tcattcttcg cgagttgttc tgaatgccac 1800 tctccattcc atgggtatga cgtctggatg atggactccg gttgttctat gcacgtcact 1860 agtgatgaga ccgatttcgc atcctatact ccattaagaa attcgccaga aattcgcaca 1920 gctgataaaa aggctactct tcgttgggct ggagttggat ccgtcttcat caaacacatg 1980 gtgaaagatt ctcatgggaa atatcaagaa gtcaggactc gtctatatcc tgtctatcat 2040 gtccccaacc tacatggacg actactctct atgggagagt ttcttcaaca acgtggacac 2100 tacgttcacg gtgataacaa atcgatcggc atctataagc ccgatcaatc tcaagctatc 2160 gtttgccgtc cggctatacc cggtcacacc attttctggg ttaagactcg gatcacttcc 2220 cgtaaagagt ctagcggcaa aacagctctc gatggtttat tccgtcgact accaaaccat 2280 gcatcgacgc tttgggcacc cctcaaagga agtgcttgag aaagcgaagg gacataccaa 2340 gggtttccca cagaatgtct ctgtacccga aaagatgtac tccctgcaag ggatgtcatg 2400 agggaaagat gccatcaaaa tcgttcccca ccatcaactt ctcgtgcgaa gaagcctttt 2460 gagaagattc actctgatct taagagcttc ccggtagaat cctatcaccg gtacaagtac 2520 ttcatcgctt tcttcgatga ctatacctcc cacggttgga tcgtccttct ccgaaagaag 2580 gatgatgcca tcaaggcctt acgcgacttc gttgctatgg tcaagaccca atttaacgcc 2640 tccatcaaag agtggatgtc ggatggaggg ggagaattta agtctgaaga attcgataac 2700 gcactgaagg aacttggcat caaaattcta cagagcgtcc cgcgtcagcc ccaacaaaac 2760 ggcagggctg agcgttttat caggacaatt atggataagg cccaggcact tcgctttgac 2820 gcctgtcttc ctgagtcctg gtgggaattc tcggtgtcac atgccgtgca tctctacaat 2880 cgcacgcctg tcagacgtct taagtggaaa actccgtatg agctatttac acaagaaggt 2940 ccctgacatc tcgcacttac gtgtctttgg ttgtggcgct tacgtctacc ttcccgaaga 3000 agtccgaaag aacaaactat cacccaaatc tgaacttatg gtgttcttag gatatcctga 3060 gggaatgaag ggttatctct tcatgcgctt acacaacaac agtctcttcc gaggcgctac 3120 ggctgtcttt gatgagactt acttcccgaa gtgtcctggt gcacaacctc aacgaggtgg 3180 cgttcgtgcc ggcgacgaac cgcctccacc atcggatcgc ggagatgaca acatttctcc 3240 tgaaggaggt gacgatgatg actttgtcca tcgtcccaac tcccaaaatg atacccttcc 3300 ccaaagggat caagggggag atcgtgacga cgatcaacct caagatccgc cgccacctgc 3360 cctgggagat caaccagagg aaccaccggc tccggaacca caaccggacc aacctcgaag 3420 atctggccgt gagcggagaa ccgttactcg tccaggctcc atctatgggg ataaaacccc 3480 ttcacaaatt gatgctcaat cgcagcgtga ttggagaaag actgttggag ataccgagcg 3540 aaggcctgcc ggttacccac aagtctagaa atgcaagatg aggttcccca gaccaaatcc 3600 tgctgataat ccaaatcgcg atagtggcac tgctgatgag cgtggagacc tagagaaacg 3660 attggcccga ttggtcaggg aagggggagt agagtttatc acatacttgc tttctaaggc 3720 agtttcgccc gataaacccc ttcagaagtc gccacgtgaa tacacattcc gtgacatcca 3780 aaaacttccg cctaaacagc gtgagaaatg gttagctgct tgcaagctgg agattgaggc 3840 actccagaag cgcggcgtct atatcctcgt cactctgcct gctggtcgta aagcgatcaa 3900 atgccgatgg gtgttcgacg tcaaaccaga cggtcgactt agagccagac ttgtcgccaa 3960 gggtttctcc caagttgaag gtgttgactt caacgaaatc ttctctcccg tggttcgtta 4020 tgaatcagtt cgcacgatgc ttgccgtcgc tgcactcgaa ggctggtatg tcaccgccgt 4080 cgatgtcaag aatgcctttc tctatggcaa acttgaagag gagatctata tggaacaacc 4140 cgaacggctt caaggtacct ggtcaggagc ataaagtgtt ccggctgctc cgagctatct 4200 acggtctcaa acaagcttcc aactcctggt acaaggaact tgtagagtct gccaaacttc 4260 ttggctttag acgactttcg accgataccg gaatctttat tttccaggag aaagatggca 4320 acttcgtcat catgatcgca tacgtcgatg acattctgtt catgggaccc tccaagcgac 4380 ttgtcaatct aaagaaagac gagttcaagc aaaaatggga atgccgtgat cttggcgaac 4440 caaccgaatt cctccgaatg cggatcaaac gtcaaggaaa aaccatacgc cttgatcaga 4500 aagactacct caaaacggtc ttagaacgct tcggaatgat gggactgccg agtggctcga 4560 actcccatgg ttgagggata caaacctctt ccgaatactg gtcccgtgga tcaccaactt 4620 cgacaacgtt tccagtctgt gattggatcg ctgctatacc ttatgttagg cactcgtccc 4680 gacattgcgt atgctgtcac caagctatcg cagtttgcag caaatccgtc caaagagcac 4740 ctcgacaagg cgctctacat ctgccgatac ctccgaggga ctatggacta cgcactggtc 4800 ttagatggtt cttcctcaca aggactcatc gcctactcgg attcggactt tgccgctgat 4860 cccatcaaac gcaggtccgt gactggattc atcgtcaagt tggctaatgc cgttatctgt 4920 tggacttccc atgcccagaa gaccatcgct acttcgtcta cagaagccga atatatggcg 4980 ctttctgatt gtagccgtca ggtcatgtgg ctaaagcata tgttcaacga attaggcatg 5040 cccatcgaca aagtgccgat ctgcacggat aacaatggag ctatcttcat tggctcaaac 5100 cctatccagg aacgtcgaat caaacacatt gatgttcgct atcactacat ccgcgaacgt 5160 gttgaagatg gagacgtcga gattcttcgc gtcgacacta atgacaatcc tgctgacatg 5220 tttaccaaac ctctcggtca tgtcaagttt gagcatttcc gtgaacaact cggacttgag 5280 tttggaaggg cttaagttct tgagtacgct cagcgcgcta ttcatagctg cacttagcga 5340 ggggggg 5347 // ID Copia-11_MLP-I repbase; DNA; FNG; 4796 BP. XX AC AECX01001249; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-11_MLP_; KW Copia-11_MLP-LTR; Copia-11_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4796 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001249; Positions 54427 59222. XX CC 'CTTTT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(306..1547,1551..4670) FT /product="Copia-11_MLP-I_1p" FT /translation="MASLGSLDDRSNKLSGITALQPPGENSNYLDWEFALE FT LYFVDAKLTYVLKETDVKARPSTWEDDNARVCTLISRAVEDVNYQYLKPHQ FT NNAAAMWKALRLAHEDSTSGGRMIVLNSLVTSKMGEMDLDTHLQSMNRLFE FT KLASLITEDQPLTADDIYTTSLINSLPDDWAHVVTPLMQQKTTDSVSVIRA FT LKNKQNRRNASGHVSVSKAFKSHSSSTPSYKTASLDQCQTNRKLKNAKFCS FT FCKRTNHDVNHCWVLDKILEERKSMTTPSGSKVSSHSRPQPQSSKAGKTSV FT VSLGFSESESENESPAQAKFALVDVARISRLKEWNIDSGCSSTMTPNSSSL FT SGLTPATRSIHLADNSVIYATHSGTAQLGISGSIGHKSLFVPDLQEPLLSV FT SSLCDDGFSVVFTKDGCRIHSTDNLSDSLPVGNGVRRGNLYYLPEKVDTSP FT SISCFSSKADISLFDWHCRLGHIGLCPLKAVLKTHDVKPSILNEIEVQQCD FT ICVKSKMSRMSFHSRSPYRSIHPGDLIHSDVASFETPSREGFRYFVTFIDD FT YSKYTVIYPLKLKSEVFMCFKNFCAVFCNLTSFNVKSLRTDNGGEYLGKEM FT QAYIIDKGITHNPGPPHSPELNGVSERSNRTIGNHLRCSLVTSGMPKTFWA FT DALRYLTHTLNSIPCYTPAGFHSPNSILSLPDPSISTLFPFGCQAYYKVPE FT ANRKKLDQKAQAAVLLSYLSDGNGYRLWDLRHRKVVKSRDVLFCHDIFPYS FT STISCSPPSIEADIPWPSDYVSSPSPSLTEGQRRALRERRLSNSIHAPRCR FT PIRTPAPTPVVTPAVSPISSPKAITISSDSPGSSPRINSPSSSSEDAPQER FT MPLPAEPSIPTVPKIIPETPSSQAEPPTLSPSPLPMEESSSQPSVPDNSTK FT PVKASIKKSVKQPGPPTRRSGRSTKPPDRFGNWAGLVTDETASIDTPRTWK FT QLLCSPNKSKWLKAADSEFSTLIGMGTWKLVPRPAKRKIIKSKWVFKVKKN FT IDGSLDKLKARLVAMGYSQVHGIDYSEVFSPTTRMETLQLVFSLLACKKLV FT GRQIDIKSAFLNSLLTESIYMTQPQGYEDPEHPDWVCELQRSLYGLKQSPR FT LWNAELHRILITLNLTQSKYDPTLYFRLVDGKLVGALTAHVDDLAIVGEDS FT FVQEIINAISKKFEILSNKELTHFLSLDISRDVPARKVFISQSHYIRELMN FT RFLPSDSISVKTPTDSAFKDLTPRAVGEEPSPGHYLSLVGALLWVAQCTRP FT DVSFAVNRLSQYLRDPSASHWAAGLRVLRYLSWTVDLRLCLGGSATFSGYS FT DSDWAEDRHDRRSTTGYTFRFGHGPISWKSRKQTTEAEYKAMSDSCREAMW FT FKHLRFELNIRTPSAIPLHVDNEGAEALAKNPQHHSRTKHIHARFHFVREC FT VKNEDVTILHVSSKDMLADMLTKPLPRVLLERHRAMFGLV" XX SQ Sequence 4796 BP; 1232 A; 1240 C; 925 G; 1399 T; 0 other; ggttatgaga cccatctcta ccaactgcta gatctcgatt caagtttttt ttaaaatctt 60 taatctgtaa attgcgccac actataaggg tccctgggta cacagtacac cagtgcattt 120 ttttagctct gtttctcttc tccacaacct aaatcatctt ctctcgaata ctccaataca 180 aatctcttac acaccacaac ctcctcatat ccacgctgga ttttcctatt ttatacacaa 240 ccacacgtta tccaccgtct tcattcgatc tctatccctc tttatctcgt tatactactt 300 caatcatggc ctcgttaggc tctttggatg atcgttctaa caaactatcc ggtatcactg 360 ctctgcaacc acctggcgag aactcgaatt atctcgactg ggagttcgct cttgagctct 420 actttgtgga tgctaagttg acttatgttc tcaaagagac ggatgtgaaa gctaggccgt 480 cgacatggga ggatgacaat gctcgtgtct gtaccctcat cagtcgcgcg gtagaagatg 540 tgaattatca atatcttaag ccgcatcaaa acaacgccgc cgccatgtgg aaggccttgc 600 gccttgcaca tgaagactca acctcgggtg gtagaatgat tgtactcaac tctctcgtca 660 cctccaaaat gggagaaatg gacttagata cacatctgca atcgatgaat cggttgtttg 720 aaaaactcgc atctcttatc acggaagatc agcctcttac tgccgatgat atttacacga 780 cctctcttat caactcgttg cccgatgatt gggcgcacgt ggtaactccg ctaatgcagc 840 agaagactac cgattccgtt tccgtcattc gtgcactcaa aaacaaacaa aatcgacgaa 900 atgcatcagg acacgtatct gtttcaaagg ctttcaaatc tcactcatcc tcgactccat 960 cctacaaaac cgcatcatta gatcagtgtc agaccaaccg caaacttaag aacgctaaat 1020 tctgctcctt ttgtaaacga acgaatcatg atgttaatca ctgctgggtg ttggacaaga 1080 ttctggagga gcgtaaatct atgacgactc cgtcaggctc taaagtctct tctcactctc 1140 gacctcagcc tcaatcgtca aaagccggta aaacctctgt tgtctcgcta ggttttagtg 1200 aatctgaatc agaaaacgaa agccctgcac aagctaagtt tgcattggtt gatgttgcga 1260 ggatatcccg tctcaaagag tggaatatcg attctgggtg ttcatcaacc atgacaccaa 1320 actcttcatc tctctcggga cttactcctg ctacgaggtc tatccacttg gcagacaact 1380 cggtaattta tgctacccac tctggcacgg ctcaactcgg catatcaggt tcaatcggtc 1440 acaaatcact ctttgtcccc gatctgcaag aaccattact ttctgtttcg tctctctgcg 1500 atgatggttt ttcggtggta tttaccaaag acggatgccg cattcattga tcgaccgata 1560 atctctcgga ctccttacca gtaggaaacg gtgttagaag aggaaaccta tactatcttc 1620 ctgaaaaagt tgacacctct ccctctatct catgcttctc aagtaaagct gacattagct 1680 tatttgattg gcactgtcga ctaggccata taggtctttg tccattgaaa gccgtcctga 1740 agactcatga tgtaaaaccg tcaattttga atgaaatcga ggttcaacag tgcgatattt 1800 gcgttaaaag taaaatgtcg cgaatgagct ttcactctag atctccttat cgctcaattc 1860 atcccgggga tctaattcac tctgatgtag ctagttttga aacaccatct cgtgaggggt 1920 tcaggtactt tgttactttt atcgatgact attccaagta cactgtcatc tatcctctca 1980 aactcaagtc ggaagtcttt atgtgtttca agaatttctg tgctgttttc tgtaatctca 2040 cttcattcaa tgttaagtcc ttacgtaccg ataacggtgg cgagtatctt ggtaaagaaa 2100 tgcaagccta catcatcgac aaaggcatta cacacaatcc tggccctcca cattctccgg 2160 aactcaacgg tgtctctgag cgttcaaaca gaactatagg gaatcatctg cgatgttcac 2220 ttgttacgtc aggaatgcca aagacttttt gggctgatgc tcttcgttac ctcacacaca 2280 ctcttaactc cataccgtgt tatacgccgg ctggattcca ctctccaaat tcaatcctat 2340 ctctacctga cccatctatc tccaccctgt ttccttttgg gtgtcaagct tattacaaag 2400 ttcctgaagc aaacaggaag aaactagacc aaaaagctca agctgctgta ctattgtcat 2460 atctgtcaga tgggaacggg tacagattgt gggatctcag acatcgaaaa gttgtcaaat 2520 cgagagatgt cctgttttgt catgatatct ttccttactc ctcaactatt tcttgctctc 2580 ccccgtccat tgaagctgat attccctggc cgagtgatta cgtgtcttct ccctctccat 2640 ctctaacgga aggacagcgt cgcgcgcttc gcgaaagacg tctttccaac tctatacacg 2700 ctcctcgttg cagacctatt cgaacacctg ctcctacgcc tgttgtaaca cctgctgttt 2760 caccaatatc atctcctaaa gcaataacta tctcgtcaga ttcacctggt agctcacccc 2820 ggataaacag tccatcatcc tcctcagaag atgcaccaca agaacgcatg ccacttcctg 2880 ctgagccctc aataccaact gtgcccaaaa taattccgga aacaccctcg tctcaagcag 2940 aacctcctac tttgtcgccc tcgcctcttc caatggaaga atcttccagt cagccatctg 3000 tcccagacaa ctcaaccaag cctgttaagg catctatcaa gaagtcagtc aaacaacctg 3060 gtcctcctac tcgtcgttct ggtcgttcaa cgaaacctcc cgatcgattt ggcaactggg 3120 caggtttagt aacagatgaa actgcatcta ttgatactcc tcgaacgtgg aaacagttac 3180 tttgctctcc caacaagtcg aagtggttaa aagctgctga cagtgaattc tccactctca 3240 tcggaatggg tacgtggaaa cttgtaccac gtcccgccaa gcggaaaatc attaagtcta 3300 agtgggtgtt taaagtcaag aaaaacattg atggatctct tgacaagctc aaagctcgat 3360 tagtcgctat gggctactct caggttcacg gaatagatta ctctgaagtg ttctccccga 3420 ctactcggat ggagactctt caactcgttt tctctcttct cgcatgcaag aagttggtag 3480 gcaggcagat tgatataaag tcggcatttc tcaattctct tctcacagaa tcaatataca 3540 tgacccaacc gcaggggtat gaagatcctg agcatcctga ttgggtttgt gaactccaac 3600 gctctctgta cggcctgaaa cagtcaccta ggctgtggaa tgcggaactg cacaggattc 3660 tcatcactct caatctcacg caatccaaat acgatcctac tctctatttt cgtttggttg 3720 atggcaagct tgtgggagct cttactgctc atgtcgatga tctggcgatt gtcggtgaag 3780 actcatttgt gcaagaaatc atcaatgcca tctccaagaa gtttgaaata ttgtcaaaca 3840 aagaactaac tcatttcctc tctctcgata tcagtcgtga tgttcctgca cgaaaggttt 3900 ttatctctca gtcacactac atacgagaac tcatgaacag attcttaccg tctgactcta 3960 tctcggtcaa aacacctact gattccgctt tcaaagatct cactcctcga gctgtagggg 4020 aagaaccctc acctggtcac tatttgagtc tggttggtgc tctactttgg gttgcacaat 4080 gcaccagacc agatgtctca ttcgccgtca accgcttatc tcaatatctc cgtgacccat 4140 cagcctctca ttgggctgca ggtctgcgtg ttcttcgata cctctcgtgg acggtcgatt 4200 tacgactctg tctgggtgga tctgctactt tctcaggcta ctcggattcg gattgggctg 4260 aggatcgaca cgatcgtcgg tctactaccg gttacacctt tcgttttggg catggtccaa 4320 tctcgtggaa atcacggaaa cagaccaccg aagccgagta caaagcaatg tctgactcgt 4380 gtagggaagc tatgtggttt aaacatcttc ggtttgaact aaacataaga accccatcag 4440 caatacctct tcatgttgac aacgaaggcg cggaggctct tgcgaaaaat cctcagcacc 4500 actcgaggac taaacatatt cacgctcgat tccactttgt ccgagagtgt gtaaagaatg 4560 aggatgtgac tatattacat gtatcctcta aggacatgtt ggcagacatg cttaccaagc 4620 ctctaccacg tgtcttgctc gagcgtcaca gagccatgtt tggtctggtc taattttgtc 4680 tctttctttt tccttttttt tttttgtttt tttttctttt ctttaaactc tctttctctt 4740 tctttatgct atcttatgat atttgtttga tgaagctagc tggacgcaag gggggg 4796 // ID GORPI repbase; DNA; FNG; 2406 BP. XX AC AJ864642; XX DT 02-SEP-2005 (Rel. 10.08, Created) DT 04-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Partial putative gypsy-like retrotransposon (partial sequence). XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW internal portion; GORPI. XX OS Orpinomyces sp. OUS1 OC Eukaryota; Fungi; Neocallimastigomycota; Neocallimastigomycetes; OC Neocallimastigales; Neocallimastigaceae; Orpinomyces. XX RN [1] RP 1-2406 RA Nicholson M.J., Theodorou M.K. and Brookman J.L.; RT "Molecular analysis of the anaerobic rumen fungus Orpinomyces - RT insights into an AT-rich genome."; RL Microbiology 151(Pt 1), 121-133 (2005). XX DR EMBL/GenBank/DDBJ; AJ864642; Positions 1 2406. XX FH Key Location/Qualifiers FT CDS 1..2403 FT /product="GORPI_I_1p" FT /translation="EFCIGNLQLNGINGILGRDWLTKHNPYINYQVNKIFF FT IGRYCGSHCPSARKNKFFNQKPEVTASMINPENSNDETNYDTIIPESISED FT ELFEEDICAAMLPGSTINEEQLNKEDIINKYYYDLKIVFEKKNAEKLPPHR FT EYDISIDLIPGGQLYFGPIYSLTTTELKTLKEYIKENLEKKFIRKSKSPAG FT APVLFVKKHDGSLRLCVDYRRLNAITIRNSYPIPRINDLIESFKDSKIFTR FT LDLRSAYNLVRVKAGHEYLTAFRTPIGHYEYLVMPFGLRNAPSVFQRFIQD FT VLNDVIGSYVQVYLDDIIIYSKTISEHVKHVRFVLKLLIDNGLYAKLEKCD FT FHVAETTFLGFTVSINGLTMDQNKVKSVVEWPTPKNLKELQSFLGLCNFYR FT KFIKNFAKIMEPLRALLKKENNFNWNSEAEDAFKKLKASFTTGEVLIFPDP FT EKEFVVETDASDFAVGCVLSQIGSADNWLQPAAVTLRDREDQAGKAMTQSY FT DEELLAIITRFTMYGGIHLEGAKYPVQVITDHKNLLYFKKPQHLNLRLIRW FT SLFLSKFDFRIYYRPGAKGGKPDALSRRPDYKTELPPNINSVIKTDSFCCA FT INDNIQSLIEAQNADKYCKEVRSKISNKSNEIKSSLFSLVDGVLHFQKRII FT VPSSLKARILKTFHDAPTSGHQGVDRTYEKLKRYYWWPNMKKDIYNYVLSC FT DTCCRSKMRRHKPYGKIQPLPIPTKPWEIIGVDYIVYLPTSQGCTCIMVVS FT DHLTKMIHLVPCSDVPTADLTAKLLLYNVFRYHGFPKIIVSDHGSQFSSE" XX SQ Sequence 2406 BP; 814 A; 452 C; 412 G; 728 T; 0 other; gaattctgta ttggaaacct tcaattaaat ggtatcaacg gaatcttagg aagagattgg 60 ttgacaaagc ataatccata tatcaattac caagtaaata aaatattctt tattggaagg 120 tactgtggat ctcactgtcc atccgcaaga aagaacaaat ttttcaatca aaaaccagaa 180 gtaacagcat ccatgattaa tccagaaaac tcaaatgacg aaaccaatta tgatacaata 240 attcctgaat ctatctctga agatgaactc tttgaggaag acatttgtgc agcaatgtta 300 cctggatcaa ctatcaatga agaacaactt aacaaagaag atattataaa caagtattat 360 tatgatctaa aaatcgtatt cgaaaagaaa aatgcagaga aacttccacc tcaccgagaa 420 tatgatatct ccattgatct cattcctgga ggtcaattat actttggacc aatatactct 480 ttaactacta ctgaattaaa aaccttgaaa gagtatatta aagaaaattt agaaaagaaa 540 ttcatccgca aatctaaatc accagcgggt gcacctgttt tattcgtaaa aaaacatgat 600 ggttctttaa gactctgtgt ggattatcgt cgccttaatg ctattacaat ccgtaatagc 660 tatccaattc caagaataaa cgatttaata gaatcgttta aagactcgaa aattttcact 720 aggttggatt tgcgttcagc ttataacctt gtaagagtta aagctggaca tgaatactta 780 acagctttca ggactccaat tggtcattac gaatatcttg taatgccttt tggattaaga 840 aatgcccctt cggtatttca aagatttata caagatgtac taaatgatgt tataggttct 900 tatgttcaag tataccttga tgacatcatt atctattcca aaactatttc agagcatgta 960 aaacatgttc gctttgtttt aaaattactt attgataatg gcttatatgc aaaattagaa 1020 aagtgcgact ttcatgttgc agaaactact tttctaggct tcaccgtatc tattaatggt 1080 cttacaatgg accaaaacaa ggtcaagtcg gtcgtggaat ggccaactcc taaaaattta 1140 aaggaacttc aaagtttcct tgggctttgt aatttttatc gcaagttcat taagaacttt 1200 gctaaaataa tggaacctct tcgcgctctc cttaaaaagg aaaataattt taactggaac 1260 tcggaagctg aagatgcctt caaaaaacta aaagcatcat ttacaactgg tgaggtactc 1320 atctttccag atcctgagaa agaatttgtt gtggagaccg atgccagcga ctttgctgtt 1380 ggctgtgtac tctcacaaat tggtagtgcg gataattggt tacagccagc ggccgttaca 1440 ttacgagatc gcgaagatca agccggaaag gcaatgacgc agtcatacga tgaagaactt 1500 cttgccataa ttacccgctt tacgatgtac ggaggcatac atctagaagg tgccaaatat 1560 ccagtccaag taataaccga tcataagaat ttattatatt tcaagaagcc ccaacactta 1620 aatctacgac taatccgttg gagcttattc ctttccaaat ttgattttcg gatatattat 1680 aggccaggag caaaaggtgg taaacctgat gcattatcca gaaggcctga ttataaaact 1740 gaacttccac ctaatattaa ttcagttatt aaaactgatt ctttttgttg tgctataaat 1800 gataatattc aaagtcttat tgaagcacaa aatgcagata aatactgcaa ggaagttcga 1860 tccaaaatca gtaataagtc gaacgaaatt aaaagttcac tttttagcct cgtcgatggc 1920 gttttacatt ttcaaaaacg catcatagta ccatcctctc ttaaggcaag aattttaaaa 1980 acatttcatg acgcaccaac tagtgggcac cagggggttg ataggaccta tgaaaaatta 2040 aaaaggtact actggtggcc taatatgaaa aaagatattt ataattacgt tttatcgtgc 2100 gatacctgtt gtcgaagcaa aatgcgcagg cataaaccct acggtaaaat acagccgctt 2160 cccattccta caaaaccctg ggagattata ggggtcgact atattgtcta cctgccaact 2220 tcgcaaggtt gcacttgtat aatggttgtg tccgatcatt taactaaaat gatccatctt 2280 gtgccttgtt ccgatgtccc aactgccgat cttaccgcta agctcctttt atataatgtt 2340 tttcgatacc atggttttcc aaaaatcata gtatctgatc atggttctca gttctcctcc 2400 gaattc 2406 // ID Copia-1_AB-I repbase; DNA; FNG; 4276 BP. XX AC GU129696; XX DT 04-NOV-2009 (Rel. 14.11, Created) DT 04-NOV-2009 (Rel. 14.11, Last updated, Version 1) XX DE LTR Copia retrotransposon - internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; KW LTR Rtrotransposon; Tab1_Sc2; Full Length; Copia-1_AB; KW Copia-1_AB-I. XX OS Agaricus bisporus OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Agaricaceae; OC Agaricus. XX RN [1] RP 1-4276 RA Sonnenberg A.S.M.; RT "Annotation Repetative elements in Agaricus bisporus. Copia RT group."; RL Direct Submission to Repbase Update (04-NOV-2009). XX DR EMBL/GenBank/DDBJ; GU129696; Positions 277 4552. XX CC Full length copy of LTR copia transposable element. CC One ORF. Conserved region of Gag (Zn2HC), protease, integrase, RT CC and RNHase identified, as well as putative (-)PBS and (+)PPT CC primer binding sites. CC Target side duplication of 5 bp. XX FH Key Location/Qualifiers FT CDS 67..4266 FT /product="Copia-1_AB-I_1p" FT /translation="MSSTAVMNDVLPPTVPALDVSGTNWAIFELRFRMVIQ FT GKGLWGHFDGSTPRPTPPRAITPASHPSTPVLATSPSTAAAAVQVPIPSTP FT TPTVAEIDAWDRNENIALSLLAQRIPDSTLVVVSAQTTVKLMWDKIVRDYT FT YKSAFSQANLRQDFMSSCCPSGGDVRLFLNELRAKKAELLAIGVHISDDEY FT RSAIIQSLPRWLSTYASNQLSAARLHTSLHNTIDPDMLIVMICDEWDRTRR FT FAKKGQKSEGNDALAVEEEGKGKKKGKGKGKAKDGERKKGPCWFCQGEHLK FT KDCAEWKKKQEEGSSKKDQSKSSANVAEEDDDESFAVDIDKGGKVSETIAR FT VEVFDSGSSRHISPYRDMFTSLQMIQPCALRTANQQCLNAIGKGEIKLDLP FT NGDSRSQLHLKEALYAPEAGYTLISIGRLDTDGFSTTFRDNKCIIRDSNGA FT RVAEIPRNEKGLYKLVKSSCDEVNVAVETLTVDALHRRLGHISSVAARKLV FT TSGLVSGLKLSGDESNSITCDSCSYAKATRLPIAKVCEGERALKVGEEVHT FT DVWGPSRVATKKGRRYYVTFTDDYSRWTHIEFLSNKSDVFEAYKQFEAWCE FT TQFNSRIKVLHSDRGGEYTSEEFQKYLKSRGTQTKLTVHDTPQHNGVAERR FT NRTIVERVRALLHASCLPKSLWAEAAAHIVWLMNRTSTKAVQGMTPFEALY FT GRKPRLGNVQEWGDEVWVHQAGGDKLGARAKKGKWLGYDTESNGSRILFPD FT TGTIKIERNFRFIKDQTNLQLEGEYIPTPEVPASSTPAISSPELHESTTTP FT VSPSVGSTPTQRESSPAPIQQPDSPDQVPVVRRSQRTRQPSQKAREILEGK FT GITVVEELDWEEVHVLVTEMEIMEALEPRTWKEATQRTDWPLWKKAMEEEL FT ATLQAAGTWELVDCPLGINIVGSKWVFKAKKDAAGNIVRYKARLVAQGYSQ FT IPGVDYFDTFAPVARLSSIRTVLAIATARNLEIHQIDVKGAYLNGILNDDE FT TVYMRQPPGFHDTTHPRYVCHLKKTLYGLKQSGRRWYQRLCEILIDNLGYS FT RCDVDHGVFFRVIQDDLIIILVHVDDCTLVATKLELIRELKERMNEFVEVT FT DLGEIHWLLGIEIRRNREEGKLYMSQRSYIDSCLRRYGFEDAKPVSIPMDP FT SIHLSTNQSPNSTTEIARMARIPYQEAVGSLMYAAIATRPDIAFAIQVLSK FT FSKNPGEKHWEAVKRVFRYLKGTRELWLTFGGQDDTLKGFADADGNMAEDR FT HATSGFAFIINGGAVSWSAKRQEIVTLSTTESEYVAATHAAKETLWLRSLI FT SQVFNITLPTTRLFSDNQSAIALTKDHQFHSRTKHIDIRYHFIRWIVEEGK FT IRLVYCPTEDMVADTLTKALPSPKIKHFACELGLTTV" XX SQ Sequence 4276 BP; 1226 A; 939 C; 1020 G; 1091 T; 0 other; ggttatgggc cccgccccat tgtgagacat catttataaa agtgctgcaa gcgaaagaac 60 tacacaatgt caagtacagc agttatgaac gacgtattgc cacccaccgt gcctgcattg 120 gatgtttccg gaacgaattg ggcaatattt gaattgcgtt tccggatggt tattcaaggg 180 aagggcttgt ggggtcattt cgacggttca actcctcgtc ctacaccacc gcgagctatc 240 actccggcgt cgcacccatc tactccagtc ctagctactt ctccttcaac cgcagctgcg 300 gctgtacaag ttcctattcc ttcgactcct acaccaacag tggccgaaat tgatgcctgg 360 gatcgtaacg aaaatattgc gctatcgctt ctagctcagc gaattcctga ctccacactt 420 gtagtggttt ctgcccaaac tactgtcaaa ctgatgtggg acaaaatagt tcgagactat 480 acgtacaaaa gtgctttctc tcaagcaaac cttcgccaag acttcatgtc ttcatgttgc 540 ccaagcggag gtgacgtacg cttgttcttg aatgagttac gagctaaaaa ggcggagcta 600 ctcgctattg gtgtccacat tagtgatgat gagtatcgaa gtgcaataat tcaatcccta 660 cctcgttggt tgtcgacata tgcttcaaat caactctcag cagctcgtct ccatacttct 720 cttcacaaca ctattgatcc tgatatgctc atcgttatga tttgcgacga atgggaccgt 780 acccgacgtt tcgccaagaa aggtcaaaaa tctgaaggga atgatgcatt agccgttgaa 840 gaggaaggaa aaggaaagaa gaaagggaaa gggaaaggga aagctaaaga tggcgagagg 900 aagaaaggac catgttggtt ttgccaaggc gagcatttga aaaaggattg tgcggaatgg 960 aagaagaaac aagaggaagg gtctagtaag aaagaccaat ctaaatcaag tgcaaatgta 1020 gctgaagaag atgatgacga gtcttttgca gtagacatag ataaaggggg aaaggtttcc 1080 gagaccatag cccgagtcga agttttcgat tccggatcct ctcgacacat ttctccttat 1140 cgagatatgt ttacttcgct tcaaatgatc caaccctgtg cgttgcgaac tgcaaatcag 1200 caatgcctaa acgcaatcgg gaaaggagaa atcaaactag acttgcccaa tggtgattca 1260 cgatcccaac tacatcttaa agaagcactg tatgcacctg aagctggtta tactctcatt 1320 tctattggtc gtctggacac cgacggtttt tccacgacgt tcagggataa taaatgcata 1380 attcgagact caaatggtgc tcgtgtcgcc gaaattccta gaaatgagaa ggggctgtat 1440 aaattggtca aatccagctg cgatgaagta aacgtagcag ttgaaaccct taccgttgat 1500 gcactacatc gtcgtcttgg tcatatatca tctgtggccg ctcgaaaact tgtaacgagt 1560 ggactagtat ctggactaaa gttgagcgga gatgaatcta actcaattac ctgtgattca 1620 tgttcatatg ccaaagctac acgcttgccg atagcgaaag tttgtgaagg tgaaagagcg 1680 ttgaaagttg gcgaggaagt tcatacggat gtatggggac catccagagt tgcgacaaag 1740 aaaggacgac gttactacgt tacgttcacg gacgactact cgaggtggac tcacattgag 1800 ttcttatcca acaaatcgga tgtttttgaa gcatacaagc agtttgaagc ctggtgcgag 1860 actcagttca attctcgtat caaagttctc cattcagatc gaggagggga atatacgtct 1920 gaagaatttc agaagtatct caaatcgcga ggaactcaaa cgaaacttac agttcatgat 1980 acgcctcaac ataacggagt agcagaacgt cgtaatcgaa ccatcgttga acgtgttcga 2040 gcattgttac atgctagttg tcttccaaag agcctatggg ctgaagctgc cgcccacata 2100 gtatggttga tgaatagaac cagtactaaa gctgttcaag gtatgacacc tttcgaagct 2160 ttgtatggcc gtaaacctcg actcggaaac gttcaggagt ggggtgatga agtttgggtt 2220 caccaagctg gaggggacaa gttgggagca cgagcaaaaa aggggaaatg gttgggatat 2280 gataccgaga gtaacggctc acgaattttg ttccctgata ctgggacaat caaaattgag 2340 cgtaactttc gctttatcaa agatcagact aacctacagc tcgaggggga gtatatcccg 2400 actcctgaag tacccgcgtc gtccacaccg gctatttcat caccagaatt gcatgaatct 2460 accaccactc cagtttcacc aagcgttggt agtaccccca ctcaacgcga gtcgtcaccg 2520 gcaccaattc aacaacctga ttcaccggat caagtaccag ttgttagacg gtcacaaaga 2580 acacggcagc cgagccagaa agctcgcgaa attcttgaag gaaaaggcat caccgttgta 2640 gaagaattgg actgggagga agttcatgta ttagtaacag agatggaaat aatggaggct 2700 ttagaacctc gtacctggaa agaggccaca cagaggactg actggccgtt gtggaagaaa 2760 gcgatggagg aagaattagc cacgttgcag gctgctggca cctgggaact agttgattgt 2820 ccgcttggca tcaacatcgt gggatcaaag tgggtcttca aagcgaagaa ggatgcggca 2880 ggtaacattg tccgctataa agctcgttta gtggcccaag gttattcaca gatcccagga 2940 gtcgactatt tcgacacttt tgcgccagta gcgcgacttt cgtcaatccg aaccgtactc 3000 gctattgcta ctgcccggaa tcttgaaatc catcaaattg atgtcaaagg ggcgtatctt 3060 aatggtatat taaatgatga cgaaactgtt tacatgcgtc aacctcccgg tttccacgac 3120 accactcatc cccgttatgt ctgtcatctc aaaaagacac tttatggtct taaacagtcc 3180 ggaagacgtt ggtatcaacg cctttgtgaa attcttatcg acaatctagg ctattctcgc 3240 tgtgatgtcg atcacggtgt tttcttccgt gtcattcaag atgatttgat cataatcctt 3300 gttcatgtgg acgactgtac tctcgtggca acaaaactcg agcttattag agagttgaaa 3360 gagaggatga atgagtttgt agaggtgact gatctcggag agattcactg gcttctaggg 3420 attgaaattc gacgaaatcg agaggaaggt aaattgtata tgtctcaaag gtcgtatatt 3480 gattcgtgtc ttcggcgata cggtttcgag gatgctaaac cagtcagtat ccctatggat 3540 ccatccattc atctttctac caaccaatca ccgaattcta ctacggaaat tgctcgcatg 3600 gcacgtattc cctaccaaga ggctgtaggt tcgcttatgt atgcagcaat tgccactcga 3660 cctgatattg ccttcgcaat acaagtcctt tcgaaattct cgaagaatcc tggagagaag 3720 cattgggagg ctgtaaagag agttttccgg tatctaaaag gtacgagaga gttgtggttg 3780 acttttggag gtcaagacga tactctaaag ggttttgctg atgcagatgg gaatatggca 3840 gaggataggc atgcgacttc gggtttcgcc tttatcataa acggaggtgc tgtttcatgg 3900 agtgcgaaac gacaagaaat cgtcaccttg tcgactactg aatccgaata cgtggccgct 3960 acacatgccg ccaaagaaac tctatggctc cgttctctca tttcccaagt attcaatatc 4020 actctcccca ctactcgtct attctcggac aatcaatcgg ctatcgcgct gacaaaagat 4080 catcaatttc atagtcgaac aaaacatatt gacatacgtt atcatttcat acgatggata 4140 gtcgaagaag ggaaaatccg gcttgtctat tgtccaacag aggatatggt tgcagacaca 4200 ttgacgaaag ctctaccatc tccaaagatt aagcatttcg catgtgagct agggctcacc 4260 acggtttgag ggggag 4276 // ID Copia-43_MLP-I repbase; DNA; FNG; 6262 BP. XX AC AECX01001209; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-43_MLP_; KW Copia-43_MLP-LTR; Copia-43_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-6262 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001209; Positions 11783 5522. XX CC Positions [1899-2411] - Integrase core CC LTRs are 93% similar to each other. XX FH Key Location/Qualifiers FT CDS join(48..1514,1518..4652) FT /product="Copia-43_MLP-I_1p" FT /translation="MSSDQDKFFAQTTESIPGSDTSSSSREGDETVRNITS FT SLKPSTSANPHPPGPYMSKSVQNPKTVEPASSISTMSLPSNQISQRQQAIN FT LSQALNKLSRKILTEESFPVWLSYIRGELNSLFLADYLKSDELKTEDAQLS FT DEVCRACITSWMLGTMDDVNRSRFEPKITNYTDDGPETQNLPAKLWSAVNS FT HHLSRSEELCLLLEQSLNLIVQSANLPLLTHIENFQNAVTKYKTAGGKMSD FT EDLGRKLLMSLNNNYFQDARDLAIKGVSEYEKVVTELKKRLDAVAMITRAP FT QVSRQPTLVHQSAKASAVSNSSYRNKFQNKCTKRQCTGTNHTPDQCFKKPG FT NEHLQREWIEQRVKMGQWSGEVPKDLNVTSASATVFHEEPSIQQLEEAFNG FT LNPTASHVSIKSLSSNQSGTPGITRVLIDTAASHHMFKEKHVFANYQDVLN FT KGDSLYMAGGKETLRIHGRGDVHFVGPDGHLFKLQNCLHIPDLKSLIGGTI FT LLKNDFITVKEGIQFKISKDGNCAFEGHLHDQVHLLELTVRHLSRDEINPK FT ANHISSISEADITRLHQRLGHPNIQYLRTMVRNESVQGMNISLSHIPDVFH FT CNSCDLAKGHKIPHNKIHVRSQHLLDNVHLDLSGIIRTNSVCGSNYFMLFT FT DDYSRYRHVVGLSTKSANVVFNKIKQYISLVERQCDRKLKMITLDNGYEFI FT NDTMVPYCKEAGIYLRTTATYTPEENGVAERSNRTITEPARAMMIQANLPV FT RFWLYSIKAAVYLKNRTISSSIPAGTTPFELWNGRKPDVGHVRPIGCLCYV FT LIRKPLRNGKFDQVSRQAVLLGHTDHNLNYEVFIFDSNTIVISHNVVFRDN FT NFPFKKLKSFDVSHLSFDDDAPLLLSDPSAVEPLQGEQGNDAEEPGGEGMH FT QAYDNINTNLLVDSQHIPDTDDQVIAPEDDQIHQPIESEPVVEDEPEDDVP FT RRSGRARVPTDFYRPSASFAYWEDDGAFMNFHEYGYYAFAVGPVVRLINEP FT HNFKSAMNGPDVEKWKEACEKELENTKEKKVWRLVPRPANHPVVGSKWHFK FT VKLNPDGSINQHKARIVAKGYTQTYGLDYNQTFAPTGKPASFRAVVAFATY FT HGFEIHSMDAVAALLNSRLKEKIYMEQPEGYEMKLPDIDDDDLVCELLQAL FT YGLKQSAREWNDDFKEKCIKAGFKQSEADECVYIRCGGNDIIVFYLHVDDL FT AITGNAIKAFEEEMSGFWKMEDLGISTCVMGIQIVQASKHHYIIGQEAMSW FT SLLERFGMMDCKPASTPFPGGLKLTKATDDEAKAFALLNFPYNSGVGSLMY FT LSQCTRPDIAYAVGCLSQHLQRPALRHWEAFKHVLRYLQGTLSYCIHYNNE FT NLSTDVISNNAYSLPEHFADADWAGDKSTRRSTTGYVFMLCGGVISWRSRL FT QQTVAKLLTEAEYRAANEAGDEMIWLALFLKSIGFHQSVPYILNCDSLSAM FT DLADNAVLHGRTKAIEIQMHWKRDKVKDGTLKLVHCNSEDMIADLLMKPLH FT PGKFNHFRRLIGMRSVDE" XX SQ Sequence 6262 BP; 1964 A; 1170 C; 1314 G; 1814 T; 0 other; ttcttatggt agcgagagtc tcatctaaat tgatcgaatc aaactgaatg agttcggatc 60 aagataagtt cttcgcacaa accaccgaat caatcccagg cagtgataca tcttcatcct 120 caagagaagg agacgaaaca gttagaaata tcacatcttc tctcaaacca tctacatccg 180 caaatcctca tccacctgga ccatatatgt ccaaatcagt ccaaaatcct aagacggtag 240 aacccgcttc cagtatttca accatgtcat tacctagcaa tcaaataagt cagcgacaac 300 aagctatcaa cttaagccaa gctctgaaca aattatcaag aaagatctta acggaagaaa 360 gtttccccgt atggttgtca tacataagag gcgaattgaa ttcactgttc ctggctgatt 420 acttaaagtc tgatgaattg aaaactgaag acgctcagct aagtgacgag gtatgcaggg 480 cttgtattac ctcttggatg ttgggaacta tggatgacgt gaatagatct cgattcgagc 540 caaaaattac gaattacaca gacgatggcc cggaaactca gaacctacca gccaaattat 600 ggagtgcggt taattcacac catttgagcc gatcagagga gctttgttta cttctcgaac 660 aatctttgaa cttaatagtc caatcggcaa acttaccgtt attaactcac attgaaaatt 720 ttcaaaatgc ggttactaaa tataaaacag ctggagggaa gatgtctgat gaggatctcg 780 gcagaaagct tcttatgtct ttgaataaca attactttca ggatgcaaga gaccttgcga 840 tcaaaggtgt tagtgaatat gaaaaggtgg taactgaatt aaagaagaga ttagacgctg 900 tggctatgat cactagagcc cctcaagtat ctcgtcaacc taccttggtc catcaatctg 960 ccaaagctag tgcggtgtcg aattcatctt atcgtaacaa atttcaaaac aaatgcacca 1020 agagacaatg caccggtacg aatcataccc ctgatcaatg tttcaaaaaa cctggtaatg 1080 aacacttgca acgcgagtgg attgagcaac gggttaagat gggtcaatgg tctggagaag 1140 tacctaagga tttgaatgtc acatctgcgt cagctactgt attccacgaa gaaccttcca 1200 tccagcaact tgaagaagct ttcaacggtt tgaaccctac ggccagccat gtctcaatca 1260 aatctttgtc tagtaatcag tctggtactc ctggtataac tcgagtttta atcgacacgg 1320 cagcatctca ccatatgttt aaggagaagc atgtctttgc taactatcaa gatgtgttaa 1380 ataaaggaga ttcattatat atggcaggag gcaaagaaac cctcagaatt catggccggg 1440 gtgacgttca ttttgttggt cctgatggtc acttatttaa gttacaaaat tgtctacaca 1500 tacctgattt aaaatgaagc cttattggag gcactatcct gctcaagaac gactttatta 1560 ctgtcaagga aggcatccaa ttcaaaatca gcaaagatgg aaactgtgca ttcgaaggcc 1620 acttacacga tcaagtccac ttgctagagt tgacggtgag acacctctcc cgagatgaaa 1680 tcaatcccaa agcaaatcac atatcttcaa tctcagaagc tgatattact cgtctgcatc 1740 aacgtctagg tcatcccaac atccaatacc tgaggaccat ggtcaggaat gaaagtgtgc 1800 aagggatgaa tataagtctt tctcatatcc cagacgtatt tcattgtaat tcatgtgatt 1860 tggcaaaagg acataaaatc cctcataata aaattcatgt tagaagtcaa catttattgg 1920 acaatgtcca tttagacttg agcggtatca ttcgtacaaa ttctgtatgt gggagtaatt 1980 atttcatgct gttcacagat gattattcaa ggtacagaca tgttgttggt ctatcaacaa 2040 aatccgccaa tgtagtgttc aataaaatca aacagtatat ctcgttagtt gagagacaat 2100 gcgacagaaa attgaaaatg attaccctgg ataacggcta tgaatttatc aatgatacca 2160 tggttccgta ttgtaaggaa gcaggtatct acttacgaac taccgcaact tatactcctg 2220 aggagaacgg agtggcagag aggtccaata ggacaatcac agaacctgct agagcaatga 2280 tgattcaggc caacctccct gttcgtttct ggttgtattc aatcaaggca gctgtctatc 2340 tcaagaacag gaccatatca tcgtcaatac cagccggaac gactccgttt gaactatgga 2400 acggtagaaa gccagatgtc ggccatgtga gacctatagg atgtctgtgc tacgttctta 2460 ttcgtaaacc tctcagaaat ggaaagtttg atcaagtttc gcgacaagca gtattgttag 2520 gtcatacaga tcataatttg aattatgaag tattcatttt tgattccaat actattgtta 2580 tatcacataa cgtggtgttt cgagacaata actttccgtt caagaaactc aaatcttttg 2640 atgtatcaca tctatccttt gatgatgatg ctccgttatt attatcggat ccttccgcag 2700 ttgaacctct acaaggggaa cagggcaacg acgctgaaga acctggtggt gaaggaatgc 2760 accaggccta tgacaatatc aacaccaatt tacttgtcga ttcacaacat atacctgata 2820 cagatgatca agtgattgca cctgaagacg accaaatcca tcaacctatt gaatctgagc 2880 cggttgtcga agacgagcct gaagatgatg tacctagaag atcaggacga gcaagggtgc 2940 ctacagactt ttatcggcct tcggcaagtt ttgcatactg ggaagacgat ggtgcattta 3000 tgaattttca tgagtatggt tactatgctt tcgcggttgg tccagttgtt aggcttatta 3060 atgaacccca taattttaag tcggctatga acggtccgga tgttgaaaaa tggaaggagg 3120 cctgtgagaa ggagttagaa aacacgaagg agaagaaagt gtggcgtttg gttcctaggc 3180 cagcaaatca tcctgtggtc ggtagtaagt ggcactttaa agtaaagctg aatccagatg 3240 ggtcgatcaa ccaacataag gcaagaattg ttgcgaaggg gtacactcaa acctacggat 3300 tggattataa tcagaccttt gcaccaactg ggaagcctgc ttcttttcga gctgtggttg 3360 cattcgcgac ttatcatggg tttgagatac attcaatgga cgccgtagcg gctttgttaa 3420 atagtagatt gaaagagaag atatacatgg agcagccaga agggtacgag atgaaattac 3480 ctgatataga tgatgatgat ctagtgtgtg aattattaca ggcattatac gggttgaaac 3540 aatcagcgag agaatggaat gacgatttta aagagaagtg tatcaaggct ggattcaagc 3600 aatcagaagc ggatgaatgt gtttacatca gatgtggagg aaatgatatt attgtgttct 3660 atttacatgt agatgatcta gctattactg ggaatgcaat caaggcattt gaagaagaaa 3720 tgagtggttt ttggaagatg gaggatttgg gtatctctac ttgtgtaatg ggcattcaaa 3780 tagttcaagc aagtaaacat cattatatca ttggacagga agctatgtct tggtcgttac 3840 tggagcgttt tggtatgatg gattgtaaac cagcatcaac cccatttccg ggaggattga 3900 aactcacaaa agcaacggac gatgaagcta aagcgtttgc attactcaat tttccctaca 3960 atagcggggt gggtagtctc atgtatttgt cccagtgtac ccgaccggat atcgcgtatg 4020 cagtaggatg cctatctcaa catcttcaac gaccagctct gcgtcattgg gaagctttta 4080 aacatgtgct tcgatatctg caagggactt tatcctactg tatccattac aacaatgaaa 4140 atttatcaac tgacgtcatt agtaataacg cttactcatt accagaacac tttgcggatg 4200 cggactgggc tggagacaag agtactagga ggtcaacaac ggggtacgtt ttcatgttat 4260 gtggtggagt cataagttgg cgtagtaggc tgcaacagac ggtagcaaaa ttgttgactg 4320 aagcagagta tagagcggca aatgaagcgg gagatgaaat gatctggttg gctctattct 4380 taaaatctat aggttttcat caatcagtgc catacatatt aaattgtgat agtttgagcg 4440 ctatggattt agctgataac gctgtattgc atggaaggac gaaagctata gagatacaga 4500 tgcattggaa acgagataaa gtcaaggatg gaacattgaa attggtgcat tgtaattcag 4560 aagacatgat tgcagattta ctgatgaagc cattacaccc gggtaaattc aatcatttta 4620 gaaggttaat aggtatgaga tcagtggatg aataggtgtt tcaattatgt cgattgaggg 4680 ggtgtgttga agtgtgcaat cgacgaatga tgagatgacg tgttaaagaa aggaggatta 4740 attcacataa accgttgagg aaaatgtttg ctaaaattca ccccaatcga attcaggata 4800 atccatatta tacttataac aatccatgaa tccaaattta cttctctttc ctcatctttg 4860 gaaagagaag tgagtacatt tcattatttg atttcttatt tcttacttca tttaaactat 4920 caagattaat gattggttgt tgttactgtt tgttctaatt aagatctttt actcttgagc 4980 ccaaagatca gttatcgaat catagataac ttatactaat acattatctg taccaatcat 5040 ataggtacaa atctcatagt cttatagttt ctctatgttc atgtttctaa taaatttgtg 5100 tgtgttgttt tagggtacgt ttggtagcga ttattttcgg cgattatttt gaataatagc 5160 gattattttg caaaataatt gctgttaaaa aaagacgttt gggaccactt acggtaccca 5220 taaataaact ccttccccgt cccaaaagga gtttaaaacg ggactttgga tcaacgccaa 5280 aagggagttt atccgtgcaa taaactccct tttgtacttc acaaaatttc atctcgaaag 5340 acttcaaccc tcctcaatcg atccaatatg tattcacttc ataagatgct tgttctttca 5400 ctcaatcaac atcttcaaca ctacaccatc atactcaaga ctcgcaagcc aatccttgct 5460 tctactcctt gctttctgtg taaaaagttg tcttttgtga ttgtacaaat gaaatcttga 5520 aaaccaattt tgattgaagg aacgagttat cgattgttca atttccagtc aattttgcaa 5580 tttcaatcaa tcggtcaatt ttaaatcaat ttttcaaaac caatcaatca ttcaacttca 5640 ttctatcaat ctcaatcaat tgttgaactt caaaaaatct caaaacaaat cactcttgaa 5700 agatgagata aatagggagt ggtggagagt gttgaatttt gtgggacatg tttctgttat 5760 cgcaagcgtg tttgggcaaa agattgttgg atgggaacaa aggttgtatc atatgatcat 5820 tacttgtatc ggacatcgcc agaggatatc catatgggtg gatgcgaagt ggtagaatgt 5880 gaagtgagcc gagctgagct gagaaacctt caaagagagg tgactacata tagatacgtc 5940 tggttgggat tttcaaccaa acgtgatttc ctgcaatgta ccctaaacgg agtttatttg 6000 atgaaaaatc aatttgaaca ataaactcat ttgccccggg gcaatgtacc tggggtcaaa 6060 cggagttttc aattcgtaat cgcaaccaaa cgtaccctta ggtcagagct gtctttgtag 6120 attttgtgct gggtattggt tctcatcgtg ctattgtgca cactcttagg tgaatagaat 6180 tatctttgga aagagagatc ttttactctt gagcccaaag atcagttatc gaatcataga 6240 taacttatac taatacatta tc 6262 // ID Gypsy-10_CCO-I repbase; DNA; FNG; 10727 BP. XX AC AACS02000004; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the mushroom Coprinopsis cinerea genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-10_CCO_; KW Gypsy-10_CCO-LTR; Gypsy-10_CCO-I. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-10727 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the mushroom Coprinopsis cinerea RT genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AACS02000004; Positions 1656216 1645490. XX CC Positions [2332-2847] - Reverse transcriptase CC Positions [4018-4500] - Integrase core CC 'CACGT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 521..1492 FT /product="Gypsy-10_CCO-I_3p" FT /translation="MAGNNNSNTTKNDSKSDNGKVIPKRFNGNRYHYQAFR FT DAVDLFFITDDKHDTDQKKIAFVLALLDEGEARIWRTNYLRQCRKNGKLDL FT GNFDELLKKLDDTFKQLEEEDEALFALNNMKQRPNERAEQTITRFREQASL FT AGLDLTKNDRIAIDYLKDVLDPGLVDKVSLDVREPDTFEEWVKLAIKYDRV FT YRRNKLLKSLGKRGNTPNFRKTLSAFARSTRNNSERDPDAMDIDAISTDER FT TRMMKTGSCFYCREQGHIAKECPKKRKDRNSSTSNDTKKKTPREAAKYIRR FT LLAQYSPEEEAEILEAAGDTLDEDEDDDQEDF" FT CDS join(1579..2994,2998..5076) FT /product="Gypsy-10_CCO-I_1p" FT /translation="MRAPITLIGEKTGKTVDTKVLIDSAAGGTFISTRFAK FT RRNIPLRPLPRPLKVNNVDGTPNKEGAITHFVDSLARLHGQTFRIRWYATE FT IAHQDLILGLPWLRRANPIIDWKKGTLDWQDKIDPDKIENHLPQTNFSTPN FT TTTVAVMCLLEDETDEEGETPVVWINAKTTTAQLLAAQEQEKKEKKTIDDI FT LPEPYKKYRHLFEDRPETALPPHRKWDHKIDLKPGFVPKVFKTYQLTPAED FT QELQKFLEENLRLGRIRPSQSPMGSPLFFVAKKNGKLRPCQDYRYLNDWTI FT PNAYPLPRTDELMDRLQGSRYFTKLDIKWGYHNIRIREGDEWKAAFKTNRG FT LYEPTVMLFGKRNSPATFQNMMNDILDEDDGNPKPLKTEGYMDDLMPHGRT FT REECRENTLRTLAKLDKYDLPLNLEKCIFEATEVEYLGLIVRHNELSMDPT FT KVEGLANWPTPKNVKQVRQFLGFGNFYRFIRDYSKIAKPLNDLTRKNNKFE FT WTSATEEAFQELKKRFTSYPVLRMPDFTKPFQVEADASKYASGAVLTQMDD FT DGIRHPVCFMSKTFNDAERNYTIFDRELLAIIRALTEWRHYLQGSPFPLTI FT FSDHKNLIYWQDPRKITPRQARWRVFLEEFSPYEIRNTPGKQMIQSDNLSR FT RPDLCQEDEEPELVTMLPKEVFVNLINLIDLDIQERILSSDQNDDSANQAL FT QTLLEEAPSDLRQDLSDWTLETKDGRRMLFYQGKAYVPNDSDLRREIVRKH FT HDSPTIGHPGELGTFNAVKEHYWWPGMRRFIKSYVEGCTECQQYKINRHPT FT KPTLEPIAAPTSSRPFSQISMDFITDLPPSKGYDSILVVVDHGLTKGVVLE FT PCHKTITAEETAQLFLNRVFSRFGLPDKFISDRGPQFAARVFRDLCKLLKI FT DSALSTTFHPQTDGGTERVNQEIEAYLSIYCTMNPETWADHLPILEFTHNS FT RPHADRALSPFELIMGIKPQALPEAFDRTDYPSNEERIKSLDKSRTEALAA FT HELARSRMIERSRRQWKPFRKGQKVWLEAKNLKLPYQSKKIAPKRLGPFEI FT TDEVGSRAYRLALPTQWRIHNVFHASLLSPFKETDEHGTAFSEPPPDIIND FT EEEWEVEAIIAHRRRGRGYQYLTHFKGYPSSEDCWLPERNFENAQEILNAY FT KQRHDL" XX SQ Sequence 10727 BP; 3025 A; 3090 C; 2446 G; 2166 T; 0 other; gaataacagg ctcctccatc gccgccttcc tcgaaagcgc ggccccctct cttcaaccac 60 ggattcagac gacgatctcc gaactccttc gacccctccc tacggctgct acaaccagcc 120 acacacattc tactccgaca agaagttcga cgtccaccta aacccagtcg gcaagtggat 180 tcactgcacc cgacaaccct gcccacatcg tctatccaaa caacccgaag agttccccag 240 ttggggagtc ccctgtgaca ccgtggaaca aactgaattc gatcccgaca tcgtagagga 300 atccctccag aaagccctcg aatctcgaac cgaacgaagg aacaaacaga aagaagaaag 360 aagacttcaa aagaagcatc agaaagctac gaagtcacca aggtcgaggg gccctactga 420 aatcgaacga accctatcgg accaactgaa gaatctgaac atcgaccaac cgactccatc 480 gaagaaccaa gaagaccctg aaaccgactc taataccgac atggccggaa acaataacag 540 caacaccacc aagaacgact cgaagagcga caatggcaaa gtcatcccca agaggttcaa 600 cggaaaccgc taccactacc aggcattccg cgatgccgtc gatctcttct tcatcactga 660 cgacaaacac gacaccgacc agaagaagat cgccttcgtc cttgccctac tcgacgaagg 720 agaagctcgc atttggcgaa ccaactacct cagacaatgc cgcaagaacg gaaaacttga 780 cctcggaaat ttcgacgaac tactcaagaa actcgatgat actttcaaac aacttgaaga 840 agaagatgaa gccctctttg ctcttaacaa catgaagcaa cgacccaatg aacgcgctga 900 acagacgatc actcgattca gagaacaagc ctccctcgct ggtctcgatc tcaccaagaa 960 cgatcgcatc gctatcgact acctcaaaga cgtcctggac cctggtctcg tcgacaaagt 1020 ctccctcgac gttagagaac ccgatacatt cgaagaatgg gttaaactcg ccatcaaata 1080 tgaccgggtc tatcgacgca acaaacttct caagtcgctt ggcaaacgag gaaatacacc 1140 caacttccgc aagaccctca gcgccttcgc cagatccacc agaaacaact ccgaacgcga 1200 ccccgatgcc atggacattg atgccatttc caccgatgaa cgaactcgaa tgatgaagac 1260 cggcagctgt ttctactgcc gtgaacaagg ccacatcgcc aaggaatgcc ccaagaaacg 1320 caaagatcga aactcatcca cctcgaacga caccaagaag aagacccccc gagaagccgc 1380 caaatacatc cgaagactcc tcgctcaata ctccccagaa gaagaagccg aaatcctcga 1440 agccgctgga gacaccctcg atgaagacga agacgacgac caggaggatt tttgaatcgg 1500 ggagcggttc agtcaacgta cgcctcccct cagcttgata tctactctgt actagtcacc 1560 gagatcaatg accacactat gcgcgcccct attactctca ttggtgaaaa gacaggaaaa 1620 accgttgaca ctaaagttct tattgacagc gcagcaggag gaacctttat tagcacaaga 1680 ttcgcgaaac gacgaaatat tcccctccgt ccacttcctc gaccactcaa agtcaacaac 1740 gttgacggga ctccaaacaa ggaaggagca atcactcatt ttgtcgactc attggctcga 1800 ttacacggtc aaacattccg gattcgatgg tatgcaacag agatcgccca ccaagatcta 1860 atcctaggcc tcccatggct ccgtcgtgca aaccccatca tcgactggaa gaagggaaca 1920 ctcgactggc aagacaagat tgaccccgat aaaatcgaaa atcaccttcc tcaaacaaac 1980 ttttccaccc caaacacaac cactgtcgcc gtaatgtgcc tcctcgaaga tgaaaccgac 2040 gaagaaggcg aaactcctgt tgtttggatc aacgctaaaa ccactactgc tcaactcctg 2100 gccgcacaag aacaagaaaa gaaagaaaag aagactatcg acgacatcct tccagaacca 2160 tacaagaagt atcgacacct attcgaagat cgcccagaaa cagcccttcc tcctcacaga 2220 aaatgggatc acaagatcga cctcaaaccc ggatttgttc ccaaagtatt caagacttat 2280 cagctcaccc cagccgaaga ccaagaacta cagaaattcc tcgaagaaaa cctccgactc 2340 ggacggatcc gcccatcaca atctccgatg ggatcacctc ttttcttcgt tgccaagaag 2400 aacggaaaac tacgcccctg ccaagactac cgctacctca acgactggac catccctaac 2460 gcctatccac tacctcgaac tgatgaactc atggatcgac tccaaggatc ccgatacttt 2520 accaaactgg acatcaaatg ggggtatcat aacatccgga tccgagaagg agacgagtgg 2580 aaagccgcct tcaagacaaa tcgaggactc tacgaaccga ctgtcatgct tttcggaaaa 2640 cgcaactcac cagcaacctt ccagaacatg atgaacgata tcctcgacga agacgacgga 2700 aaccccaaac ctctcaagac agaaggatat atggacgacc tcatgcccca cggaagaacc 2760 agagaagaat gccgagaaaa cacattaaga accctcgcca aactcgacaa atacgacctc 2820 cccctcaacc tcgaaaaatg catatttgaa gccaccgaag tcgaatacct gggtctgatt 2880 gtccgacata acgaactcag tatggatccc acgaaggtgg aaggcctagc caactggcca 2940 acacctaaaa acgtcaaaca agtccgacaa ttccttggtt tcgggaactt ctattgacga 3000 ttcatccgag actactccaa gatcgctaaa ccactcaacg accttacacg aaagaacaac 3060 aaattcgaat ggacaagtgc taccgaagaa gccttccagg aactcaagaa gcgattcaca 3120 tcttaccccg ttctccgcat gcccgacttt accaaaccct tccaagttga agccgacgcc 3180 tcgaaatacg cttccggtgc tgtcctcaca caaatggatg acgacggaat ccgacacccg 3240 gtctgtttca tgtccaagac cttcaacgac gccgaacgga actacaccat cttcgatcgc 3300 gaattactgg ccatcattcg agcccttacc gagtggagac actacctcca agggtctccc 3360 ttcccattaa ccatcttctc cgatcacaaa aaccttatct attggcaaga cccgcggaag 3420 attacccccc gacaagctcg ctggcgcgta ttcctcgaag aattctcccc ttacgaaata 3480 cggaacaccc ctggaaaaca aatgatccaa tctgataacc tctctcgacg acccgacctc 3540 tgccaagaag atgaagaacc cgaactcgtc accatgttac ctaaagaagt cttcgtcaat 3600 ctcatcaacc tcatcgacct cgacatccag gaaagaatcc tatcctccga tcaaaacgac 3660 gacagtgcaa atcaagcttt acaaaccctc ctcgaagaag ccccctcgga tctccgacaa 3720 gatctctccg attggacttt ggagacgaaa gatggccgac gaatgctctt ctatcaaggg 3780 aaagcttacg ttcccaacga ctccgacctt cgacgagaaa tcgtcagaaa acaccatgat 3840 tcccccacga ttggacaccc tggtgaactc ggcactttca acgcagtcaa agaacactac 3900 tggtggcccg gaatgcgacg attcatcaaa tcttatgtcg aaggatgcac cgaatgccaa 3960 cagtacaaga tcaatcgtca tcctactaaa cccaccttgg aacctatcgc cgccccaact 4020 tcttcccgac cattttccca aatatcgatg gactttatca ccgatctccc accatcaaaa 4080 ggatatgact ccatcctggt cgtggtggac cacggcctta cgaagggggt agtccttgaa 4140 ccttgccata agactattac cgccgaagaa actgcccaac tcttccttaa tcgagtcttt 4200 tcccgattcg gattaccaga caaattcatc tccgatcgcg gaccgcaatt cgcagctcga 4260 gtcttccgcg acctttgcaa gcttctcaag atcgactcag ctctctccac cacctttcac 4320 ccgcaaactg acgggggaac cgaacgagtc aaccaagaaa tcgaagcata cctatccatt 4380 tactgcacca tgaaccccga aacttgggcc gatcacctcc ccatcctcga attcacccac 4440 aattctcgac cccacgctga ccgagccttg tcaccattcg aacttatcat gggaatcaaa 4500 ccacaagctc ttcccgaagc gttcgataga acagactatc cttcgaacga agagcgcatt 4560 aaatccctcg acaagtctcg aacagaagcc cttgccgccc atgaactcgc ccggtctcgt 4620 atgattgaac gatcccgacg acaatggaaa ccgttcagaa agggacaaaa ggtctggctc 4680 gaagcgaaaa acctcaaact cccttaccaa tcaaagaaga tcgctcccaa acgactgggc 4740 ccgttcgaaa tcaccgatga agtcggctcc cgagcgtatc gccttgccct accaacccaa 4800 tggcgcatcc acaacgtttt ccatgccagt cttctctcac ccttcaaaga aaccgacgaa 4860 cacggaactg ccttctccga acccccaccg gacattatca atgacgaaga agaatgggaa 4920 gtagaagcga tcatcgcgca tcgccgacga ggacggggtt accaatatct cacccacttc 4980 aaaggctacc catcaagcga agattgctgg ttaccagagc gaaacttcga aaacgcacag 5040 gaaatactca acgcctacaa acaacgacac gacctataaa acctttaccc agactcccca 5100 tccgacagca tgtacaacca caccaactac tacaaacgct tccaagtcta catcggccga 5160 ccagaaactg tccaagaagc ttccgaccgc tacgataaag aactcaaaca ctcccgcgaa 5220 cactattact tctgtcgcgc gtccttctgc atctaccact ttcgctatta atcccctctt 5280 catcccttcc atgacggaac gtccctctaa caattcgctc ctccctccat ctatccgcaa 5340 ataccttgaa gaatcagaag aacaaatcca agccatcaag accgtcaacg atatcaagga 5400 agtcgtcggt gccgacgccg ccctcctcct tgttggagaa cgtacccctg ccatctgtcg 5460 cctatacgcc acccttgtcc aggaacacga acgcatcatc aaagagaagt ttcaccttat 5520 cggacattac caagccctcg acgcatcagt caccaaattc ttcgaagcca cccagaaggt 5580 cctagacgga ctcaacgctc aagaccagat cctcgaacaa tctcgccaac gggttgaagc 5640 agagctcctt cgtgacctcg acgaactcaa gaatgccatt ggacaagccc aactcatttc 5700 tgctttcgat cgtcgacgag tccatctcgc ttcccccgta ccttcttcct ctggcaatca 5760 cggatcagtt tccaccatgg ctcccggact aaggctgtac acgggtgcac ccgcacgggt 5820 acccggagcc gccaagcagg tacccgcagc cccaaaagcg acacctacac ccttgaaggt 5880 aaagcttcgc ggatcacctg gacccacagg tcaggtaccc tcgggtacgg tgggtacctt 5940 cctgtcacag gtatacaagg gtacctccag gtatacccgg aggtacctgc aggcatagat 6000 caacaatgac caacgaaaat taatttcaca accaaagaaa tacagattta tggtatagac 6060 acagtacaaa tcaatcaacc agctcgatga cagtagaagt gtcacctgca tccatgcgaa 6120 ccttcttccg ctggggacga cggcggacct tctccttaaa gatctcgagg acagtgtcaa 6180 gagggaccaa cgactgtgag gcccaagacc ccaatacagt agcagcacgg gtagagctgt 6240 ctaaaagaga atggcggagc ttcgacactg taagcccacc tcgggagaac gcacgctcga 6300 tatcagtaga cgcagctgga atcaaggtca atacttattg accacagatg ggagagacaa 6360 catacccggt atagtgagaa agtccaaagc catacgtgca agtggatgac cagccgcctg 6420 catctgagac cagtacttga gaggatcaat gaccccagga atcggagggg aagccatcca 6480 ttcctcgaga gggtcaagag atgcagaggt caccgaacgc gagtggacca aagtgaaatg 6540 cttgcgcgtt accgcaaagc gctacaacaa gattgtcatt agtgatactc cactaacgac 6600 aatgatcgca aacttacacc atccttatca ttgcccgcag tagccgacga cgatgacacg 6660 ggtgtggagg tcgaagatgc gggggtgata gcagacttgt agtatgttac ccactcatct 6720 cgtgcaatgg ccttggccgc cacaatccac tccggttccc attttgctgt catgaaatag 6780 ttcgttttgt aacccggatg gagaactgtg aagatgtcag ccaagaggta actacagact 6840 gagcaaacac ttactcatag caatacgata cacgatgctc ttgtcagtct tcgaataata 6900 cttgttcagc attttgagac cacgtaatgc ggcatggcgg acagcgggat ggcgctcagt 6960 gttatcgatg aattcgtcaa acattgaggt gagcgagtcg ataaagggga tgacttcgtg 7020 gatcagcggg atgctcgact tggagacctc ctttgtcgcg tagagaatgc cctacagtca 7080 agaattaatt cgcatcagtt gaggatggga atcaaaacaa atacatacat cgagaagggg 7140 agcaagctgc tccaagactt tccactcatc gggagacagt cggtactgct tgaggcgagc 7200 ccctcgagaa cggttgtgct ctggcatgag acagaggcgg tccagagcag cacggagctc 7260 caaggcacgg tcgagtaact ccgcagtact gttccatcgg gttggaacat cacggaccat 7320 cttctttgca aagacaccga cgcgctcaca gcaggccttg aggtcttcag agatagtggg 7380 gctatgaaca atcttgatgg cgagcttccg gagctgatga cttgtgtcag tacactgttc 7440 cggccatgca tcaacaagag catacctttg tgatactact ctgcgcaacc tttgcgtcgg 7500 cagcagagag aggatcaagg cgcgagacaa aatctgcctc aatctccctc acaaggtccg 7560 aaacctccct ctggtcactc tcaatgaggc cctcgggtac ttcgaactcg ccgtcatcgt 7620 cgtcatcatc gtcactgaca tcgtcgtcgt catcaccatc ttggtcctca cttgggggta 7680 cctcgacatc agggtgtgcg gtggactgac tgtcttcttc agctgggttg tcaacctcag 7740 gatcaacatc gtcgttgggg ccacgcttcg ccttcgcaaa gggtgacatg acagactgag 7800 ttgacaaagt tagcatagcg taaaagtgaa gagttctaac agaccttgac aaccaggtta 7860 aggatgtggg caaagcaccg cacctggacc tcaggaccac ggaagctcgg catgagctcc 7920 ttcagttctt caagcattgt atcgttgttc gaagcgttgt caccggtgaa cgcgaggatc 7980 tgcggacgtg agttgtggtg gatcatatca atgagatatg acagtacata ccttatcggc 8040 aatcccaaac tcatgcagac attccatcaa cttggacgca agatattgtc cagtgtgagc 8100 cttcttcagc ttgatgaagt cgagaagggt cgcgttaatc gaggcatcct tcaagcgatg 8160 gaccgtgaca ccaaggaacg agatgacatt cggactggtc catccatcag cggcaagatt 8220 aagcctgccc gggtaatcct acggtgagtg tcagcattca tagatggggg agaatcgttg 8280 ccagcgcacc tgaagactct tcttgagtgc ctcgcgtgta atggtaaaga tctcccggac 8340 gtctcgggat accatgttac gtgaaggcgt ttcgacgagg gtgttgagct ccttgaagat 8400 ctgcaaaagt tcaggatctt caacaatgga gaacggccga cgttgctttg caacccaaat 8460 ggcgaggagg acccgatgcc ggagcttgtt gaacttgtgg cccgctgtga aggtggccat 8520 tgcctgcacc tcagaggagt ctgctgggac acatttgtcg acgtggcgct tgaggttacc 8580 cgtgctctcg tcgtggcgag cacgagtgat ggagacactg ggatgactga taggtagcca 8640 tgagttagtc aatgacaggt gctcagtgtc aaaaaacaga ctcacgcctt gcaggtaaaa 8700 atgtaccaga ccttgtcacc ctcacgtttg atggtcggcg agtcgaaatg gtcgtagcat 8760 cttgaagtcc aagcgtttcg ctgcgccgct aaaatcaatg gtcaacacgc ggtgccccct 8820 cagacaatgg tgagaaacat accaaggacc tcttcgtcag tgcgagtctc agtcttgtac 8880 ctggctgcga accgagacaa gcgtgccgca tccttcgaag ctgtggaagg gtttaccggc 8940 gcggtcgtgg gtgttgatga tgcagatgcc atatcctggg tgggcggggg ttcagagtca 9000 gagccagact ggctggcatc ggagacagac tcgtcgctgt caccaacagc gccaaggtca 9060 ggcataatgc cagcatccac agcgtctcgc gctcgcttgg taagccgtcg tcgtcgtttg 9120 gggcgagtgt tggtcatggg agagaggagg gcgaatagtt gggcgctagg cgtcgggctt 9180 tggagccgtg aatggtgggt taaagaatct ccagaatttt cgagtcctgc tgtcgtcgaa 9240 gacacctctg agcaacgtga taaaggaaca aaataggagg cactcaccct cagccaatcg 9300 tcgacgaatt cgacgataaa gagatagccg tagccgacca aaagacggaa accgtcacgc 9360 agttgaaccg gtcgcagcat tggtggtggt ctcgactctc atcgatcgat gacaggaggc 9420 gcaatgcaaa gagcagggag ctggggagag gggaactgtg gaatcgttga gcaagcatga 9480 cttgttgata aagggctgcc acgtcgccag aatgaaggag actcgttgtg ggcgctgttt 9540 tgtgtaaacg acgcccatag acagtgccca cagcggcatt ctgcgagcgg aatgacgtct 9600 gacgtgaatg cttggcatct aattcacgtc accaacgggt atcgtcggcc actgactttt 9660 gccggcacct ggatgttcaa tgagccgggt atggctgcaa ggcgcgataa gggggtttgg 9720 gatgcgcgtt tggggggaat gataagagcc agcacccagt gaggcagtcc gtcaatgtgc 9780 acaagcgccc aaatgcaaga caacctgcca ttgacggcac catccgattc gggcaatccc 9840 acggctgcca accggcctaa ttcggattta tcgacgagaa ttccttcggt tgtcatcgac 9900 aatgctgccg agcctgtgta agtcgaccat cctttctgtt ttccagatgt gtgcttagta 9960 cactcatgtc attccgaaat gtagtgagcg catgttctca aacagtgtcc ccgccacgcg 10020 ctcgcgatgt caaagccacc aagagctttt gcaattagtg agtgtctaaa tgccatcaca 10080 gcctgagcat tgacttatct ccgaatgtag aaagcgttca ttttgtcgtt ccatgatgat 10140 gtttcggagc ccgagaataa atcttagtag ttagagtgta tgcacggtta aaattaacat 10200 tttttctgcg cgggtacctg caggtatgac ctgcatacat gaaagccaca cctgcacctg 10260 cgcgggtttt ttggtgagga tcacccgcac ctgcgggtcc gtttacctgc aggtttcttg 10320 gcggctccag gtccaggtac ccgggtgcaa cccgcacccg cgtacagcct tatcccggac 10380 cggtcgccag aagcaacgtt ccttctgagc ccgacggctt ggttcgacct ttgtcgaaga 10440 aagaactacg caagagggat ggtcataggt gcaacaaatg tggcgaaacc ggacactggc 10500 aatgcgacca ttactcctac aaatgcaccc tatgcggcaa gaacgctccc ggacacaact 10560 ctggttccag gtggtgtccc gattaccgac caacgccccg caacacttca cccgatcagt 10620 atgacgcact cttggacgac ggagactatg acgactactt atggggagac gaaggagaac 10680 acaatctgaa cacctgatgc ggagcatcag gcatacgaag ggggtac 10727 // ID LTR-5_AN repbase; DNA; FNG; 282 BP. XX AC . XX DT 03-JUL-2007 (Rel. 12.06, Created) DT 13-JUL-2007 (Rel. 12.06, Last updated, Version 1) XX DE Solo long terminal repeat of LTR-retrotransposon. XX KW LTR Retrotransposon; Transposable Element; Nonautonomous; LTR; KW LTR-retrotransposon; LTR-5_AN. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-282 RA Galagan J.E. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-282 RA Clutterbuck A.J., Kapitonov V.V. and Jurka J.; RT "Transposable Elements and Repeat-Induced Point Mutation in RT Aspergillus nidulans, A. fumigatus and A. oryzae."; RL Chapter in "The Aspergilli: genomics, medical applications, RL biotechnology, and research methods" Edited by GH Goldman and SA RL Osmani. Publication expected 2007.. XX RN [3] RP 1-282 RA Clutterbuck A.J.; RT "LTR-5_AN."; RL Direct Submission to Repbase Update (03-JUL-2007). XX DR [3] (Consensus) XX CC 5-bp TSD, 14 full length copies in the Broad genome sequence, CC with 82-98% identity to the consensus, plus 4 fragments. XX SQ Sequence 282 BP; 76 A; 67 C; 72 G; 67 T; 0 other; tgtcacgagc tagagtcgtt gatactaaca ggtagtatta gactagaaga attcgctaga 60 gaattaaaac acagtgaccg tcgagcttcc tatatgtgtc acggaccgac gttggtgcca 120 aacttgattc gtcattccgg aagtcagcaa caatcatacg agaactggcc tgtacagggc 180 ttatcccgga ccagagattc aatcggcagt tggttcctag tggaccggct cgtcgcaatt 240 cgggcaatgc tcgaatcaag acttcgggcg tagctcgtca ca 282 // ID Copia-40_MLP-I repbase; DNA; FNG; 4751 BP. XX AC . XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-40_MLP_; KW Copia-40_MLP-LTR; Copia-40_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4751 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR [1] (Consensus) XX CC Positions [1848-2345] - Integrase core CC LTRs are 94% similar to each other. CC The original sequence (Accession No AECX01001591; positions CC 48279 54167), included an EnSpm-like sequence near the 3' end CC (deleted). Due to the reconstruction it is listed here as CC "consensus.". XX FH Key Location/Qualifiers FT CDS join(1302..1844,1848..3845) FT /product="Copia-40_MLP-I_1p" FT /translation="MTPHKDDLFSFEKTNKSVQLADSSSIPATHSGLTVLS FT LNDGNHQHQALLVPDLQEPLLSVSSLCDDGFVVIFKKESCKFYHLDQVNFT FT EAPIGNGYRKGNLYYLPSKVASSNSNSTSLKETNLSLLDWHNKLGHIGLKP FT LKTLLKAHNVAPRVMNQIDVQQCRVCVESKMARRSFKSRAGYASTVGELIH FT SDVCSFEVPSREGFKYFVTFMDDYSKFTIIYPMKSKRDTFSCFKSFRFKFE FT LRLLSPIKKLRSDNGGEYVSKEFASFLSKAGIEHNPGPPHSPELNGVSERM FT NRTICNHIRCSLSSSGLPKSFWVDALRYLAHSLNSIPCYTPACFLSPCSLV FT SIPHVSLSKSHPFGCEVYYKIPEANRRKLDPKGSRATFLSYLSDGNGYNVW FT DATKSKLVKTCDVIFNDTIFPSLSKSSTPSPQSAPAEIPWPVCEKPLPSLK FT HRRLSVSIHNPARRPITPSMFDNAPSTPVSLFPVTKPLPFLSPLSDLSTSE FT FPPLPSNPSTSPPAVTPPPSRPSPTVSVVPSRRQSTQPRRNPPKTKVPCVV FT NSPVSDTNLQPIIPEASLSPPAQPPAPQSSPVLPPRRSARNRTQPDRLGNL FT VKSASSSDQQVDDTPKTWKQLLRSPNKDQWLKAADDEFSSLIGMGTWKLVP FT RPQKQKIIRLKWVFKIKRRVENTVLKLKARLVAMGYSQIEGIDYDEVFAPT FT TRLETLRLVLSLMASKKWKGRQVDIKTAFLNGHLDEPVYMSQPPGYEDPLH FT PDWVCEVTQSIYGLKQSPRQWNRELHGVLIALGLTQSKHDPTLYFRLEKGK FT LAGLITVHVDDLSVIGPDSFVSSFISNLSSHFQISSNTELHHFLS" XX SQ Sequence 4751 BP; 1261 A; 1250 C; 834 G; 1406 T; 0 other; ggtaggcaat agtttcatct ttcactcgtt atcttcatta tttgcaataa caaagatcta 60 aacctttcct tgcttcttct cttgctgttc cagcttgctt caaacctaag aggttatgag 120 cccgtcgtta cctcctgcag tacagatctg aaactctgcg aatacattct aaatctataa 180 tcaactacat agtaacttta atcttcagtc atactcctct tctctctatt attctctatt 240 cttcgatttg gaaaggtccg aagaattttt tttatatcat gtctgtctcc acggatgact 300 caaaagaaat ccgagtcacc ggtataacct tcctcaagcc accaggcgac aaatccaact 360 acctcgactg ggaattcaag gttgaactgg cccttgaggc agttaaactc tcctatgttt 420 tatcaccgat tgctgtcaaa gatcactcca acaactggag tgatgacaat acgaaagcct 480 gtgctctcat ttctagagct gtcgaggatg gaaatctgaa attcatcaaa cctcatcgac 540 gtgacactgc cggtatgtgg tccgccctgc gactggctca cgaagattca accttgggag 600 gtcaaatgca tctcttgcat agactaatca ccacttgtat ggaaggtgat gatgtggaaa 660 ctcacatcaa tgctctccac cgcacttttg agaaacttga ttctctcatt accgatgctg 720 ctccccttac cgtcgatgaa atcttctcga cttctatcat gacctctctt ccatccgact 780 ggcttcctgt tatcacacct ttaatgcaac aaactacagt ggttaattcg gccactgtta 840 ttcgcgctat ttgaaatgga gcgacacgaa gaaagacttc gtccaatctc ctgtcacctt 900 ctgatgttgt cgctgatcga gcaaatgcga ctccacgttc atcttctaca gctcctccat 960 cttcatcgaa gcctcgtaac accaaatttt gtacccactg tgaaagatca aatcatgagg 1020 tacaaacctg ctggatccgt caagcggaac gagaaaatcg atcctcttct tcctcacgag 1080 gtggatccag ctcaaaccga ggaggacgcc actcttcatc gaatcgatca tctcgtcctc 1140 ctaccaaagc cggaagaacc tctgtagtaa ctctagactt cagtgacgat gagtcttatg 1200 tagtcgatga tgagatcatt cactctcatt ctgctaaagt catcaatgtg aactcggcat 1260 ctccgactga ctgcaacatc aactcgggtt gttctcttac gatgactcca cacaaagacg 1320 acttgttcag ttttgaaaag accaacaaat ctgttcaact tgcggattca tcatcaatcc 1380 cagccactca ctcgggcttg actgttttat ctctcaatga tggaaatcat caacatcaag 1440 ctctattagt gcctgatctt caagaacctt tattatcagt ttcatcttta tgtgacgatg 1500 gatttgttgt catcttcaag aaagaatcat gcaagttcta tcacttggat caagtcaatt 1560 tcactgaagc tcctatcgga aatggctaca gaaaaggaaa cctttactac cttccttcca 1620 aggtagcttc ttctaactca aactcgactt cactcaaaga aactaattta tctcttctcg 1680 attggcacaa caaacttggt catataggtc tgaagccctt aaagacactc ctcaaagctc 1740 acaacgtcgc tccacgagtt atgaatcaaa ttgatgtcca gcaatgtaga gtatgtgttg 1800 agtcaaagat ggctaggcgt tcttttaagt caagagctgg ttattgagcg tctacagttg 1860 gggaactgat tcactcagac gtttgtagtt ttgaagttcc ttctcgagaa gggtttaaat 1920 attttgtaac ctttatggat gattactcta aattcactat tatctatcct atgaaatcaa 1980 agagagatac attttcttgt tttaaatctt ttcgtttcaa gtttgaactt cgcctgttgt 2040 ctccaatcaa gaaactcaga tcagataacg gaggagagta tgtctcaaaa gagtttgcgt 2100 ctttcctatc caaagctggt atagaacaca atccaggacc tccacactcc cctgaactca 2160 acggtgtatc tgaacgtatg aacagaacca tctgcaatca cataaggtgc tcattatcca 2220 gctcaggact tcctaaatct ttctgggtag atgcccttag atacctcgct cattctctca 2280 attcaatccc ttgttatacc cctgcgtgtt ttctctcgcc atgctcttta gtatctattc 2340 cccatgtcag cttatccaag tctcatccct ttggctgcga ggtctactac aaaatccctg 2400 aggctaaccg taggaagtta gatcctaaag gaagtagagc cacgtttctt tcctacttat 2460 cagacggaaa tggatataat gtgtgggatg ctactaaatc aaagctagtt aaaacatgtg 2520 atgtcatttt taatgacaca atttttcctt ctctctctaa atcgtcaact ccctctcctc 2580 agtctgctcc ggcagagata ccatggcctg tttgtgaaaa accgctcccc tctcttaaac 2640 accgacgtct atctgtttct attcacaacc ctgccaggag acctatcacc ccctctatgt 2700 ttgataatgc accctctacc ccagtttctc tttttcctgt taccaaacca ttgccttttc 2760 tctcacctct ttctgattta tctacgtctg aatttccacc gctaccatca aacccatcta 2820 cttctccacc ggcagtcact cctccacctt ctcgcccgtc acctactgtc agcgttgtac 2880 caagcagaag acaatcaact caacctcgaa gaaatcctcc gaaaacgaaa gtaccttgtg 2940 ttgtcaactc tccagtgtct gacacaaact tgcagccgat cattcctgaa gcttctctat 3000 cgcctcctgc tcagccacct gcacctcaat cttctccagt cctgcctcct cgacgctctg 3060 cgaggaaccg aactcaacca gatcgacttg ggaacttggt taaatcggca tcatcttccg 3120 accaacaggt ggatgacacg ccgaaaacgt ggaaacaact cctacgctca cctaataagg 3180 atcaatggct aaaagctgcc gacgatgagt tctcttcgct catagggatg ggtacctgga 3240 agttagttcc cagaccacaa aaacaaaaaa tcatccgttt gaaatgggtg tttaagatca 3300 aaagacgtgt tgaaaatact gtgctgaagc tcaaggcccg tctcgtcgcg atgggctatt 3360 ctcaaatcga gggtatcgac tatgatgaag tctttgcacc aaccaccaga ctcgaaactc 3420 ttcgtttggt tctctcgctg atggcttcca agaaatggaa aggccgtcag gtggatatca 3480 agacagcatt tctcaacggc cacctggacg aacctgtcta catgagccag cctccaggtt 3540 atgaagaccc tctccaccct gactgggtgt gtgaagtaac acaatccatc tacggtctca 3600 aacaatcacc gcgtcaatgg aaccgagaac ttcatggagt cctcattgct cttggtctca 3660 ctcaatctaa acatgacccg actctatatt ttcgacttga gaaaggcaaa cttgctggtc 3720 tcataacagt tcatgtagat gatctctctg taattggtcc tgactctttt gtttcttctt 3780 ttatttctaa tctctcttct catttccaaa tttcatccaa tactgaactt catcattttc 3840 tctcttaata tttgtcgtga tcttactgaa aatctagtgt acatcaatca agctcattat 3900 atcaaagagc ttggtgataa atttctttct tcttcttcta ttcattcctc tgttgtcact 3960 cctaccgatg cgttcttcaa agatttcctc cctcgtcaaa cagatgagca accttcctct 4020 ggacactacc aaagtctcat cggcgctctg ttatgggttg cacaatgcac tcgtgctgac 4080 atatcatttg cagttagccg cttgtcacaa tttcttcgag acccctcaca agcacactgg 4140 ctcgctggtt tacgtgtcct caaatacctc gtatcaactt ctcatctttt tctaacactc 4200 ggaggagacc cgttgatctc aggatattcc gactcagact gggctgaaga ttgtcatgac 4260 caaaaatcaa ccactggata ttcctaccga tggggaagcg gtcctatctc gtggagatct 4320 tgaaagcaag caacagtctc gctgtctagt acagaggcag agtacaaagc tctctccaac 4380 tcgtgccgtg aagaactttg gcttcgtaac ctcctttccg aacttcatct ccgaccacct 4440 aacacaatac cactgcaagt caacaatgaa ggagccgagg cactagccaa gaatccccaa 4500 catcactctt gcacaaagca cattcacaca cgttatcact ttgtgagaga atgtgtggcc 4560 aacaactcaa tctctatttg acatgtttcc accaaagaca tgctagcgga tatgctcacc 4620 aaaccactag gccgtgttct tctacaatct catagagagc gttttggcat agtgtaattt 4680 ttttttattt ttcctttttt cttctacaat tttttttctt ctctctgaga tgtgttgcac 4740 gcaagggggg g 4751 // ID Gypsy-13_LBS-LTR repbase; DNA; FNG; 540 BP. XX AC ABFE01000616; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-13_LBS_; KW Gypsy-13_LBS-I; Gypsy-13_LBS-LTR. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-540 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01000616; Positions 198156 198695. XX SQ Sequence 540 BP; 134 A; 146 C; 101 G; 159 T; 0 other; tgtgttattc cccattcaac ggcgtcatat tcttcataca tcacttccag agagtcacct 60 tcagtcacgg ttagtcacat agtcacactc ggtatagtca ctccataagc acatgtaaaa 120 acttacttct tactactatc aactcttttg ccaacttgac gtttgagcat cgtactcacg 180 actcatctca agtcctgtta gaagaaacat cttaggaaga caaaggtcag ttctattcag 240 aataacttgg gagctcgtgt ctcccgttag cgttaacaac tctcttcgtt ttccccaacg 300 atagagattt ggttatactt cctacggctt tccgaggaag cagaactgtc tatgctcgtg 360 gctatccagg gcatagtaga cgttcgcgta accaggtctt ctttccgttc gttagatatt 420 agaccgtatt ctagggtctt cacacctaac cgctctacgc gagtcattat gctcgtcagg 480 actcgattgt cgctgtctca accccacaac tcggcttaac ccctcgaaca gggtagcgca 540 // ID Gypsy-104_MLP-I repbase; DNA; FNG; 8539 BP. XX AC AECX01000554; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-104_MLP_; KW Gypsy-104_MLP-LTR; Gypsy-104_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-8539 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000554; Positions 50386 41848. XX CC Positions [6476-6988] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 2063..3658 FT /product="Gypsy-104_MLP-I_1p" FT /translation="MIEIHSTHLSEMMCRLGEKLDQQAHQIALLHMNVTRT FT LSKMNEKADREDSMQQETLGNNPFADYGSVRNNIQHEDDNVMSQPRTNKPT FT AEERETEVREERSATLEAEERKALQKGQLQAKEWPKFSGEGEYNHLEWLEW FT IDNTQEDTCLPDALIICKLGVVMTHSAQSWYTVKRRNCKNKSWQEWKKMIM FT DHFGTPTWKRKMAMAFDRDTFKWENRDKPTAWLLLQRRRMDAAWPFLTTRE FT QIDKILGLCNGDIEHAVQSRIRDHSDFESFMIIFEEVITDTSIGKNLLRSK FT EYRNNHFSSKEKTLSSREYKTTEHPKPQESGKYRDYKQNRPMGSGPAKVSF FT KDDNRINRFEKKSINAVENEEDDGQSGEERTERSEEQEEESESSDDDDLGL FT CIGNIDMTNTHQSYEDLQPVLITTEEEAPVQAPTLSITELAENLRVAEQTT FT NHISPPPFIERLIFNNSESRTFIAISMSGVVSNMLLNTGIEPSVASTKVLK FT KYWPTWQADTKTMDENNTYANDSEVLGEIGRNISTD" FT CDS 3693..5615 FT /product="Gypsy-104_MLP-I_4p" FT /translation="MVKFIVLINDDLDHLILGSRDWREFSFTLYLGGRTQC FT KIRTEDGTFRFPVIPQKIWDEPRRRRAQSHIATITVESEEGVDIAELQSQA FT SIPQQWEDNAKRSLQVSDARLLMSKPEKGRAHTIGHHCITSALLSGGQKVE FT VLLDSGAACSIVGTDYLNRFIKNWEDLMLPPSNMTFKGCSNRLRALRVIEL FT PMIFPHHLGPVRIQPEFVIMDNATSDYFILGGEFLRLYGFDIVHSKEKYFT FT IGNENKKKKFLLGSVREQRIATISAKVDEVFEKAFDECNISPRLKSTELHS FT LKAVARKYKQAFAYGENPIGTIKGHKLKLRLNIDKPYPPILKKQAYPASPR FT SREEIEKHIGDLVKHGILRKVGADEEVEVTTPVIIACHNGKSRMCGDSRAL FT NTFTVPDRYPMPRISHCLTNLGKALYITTMDVMKGFHQNEVDDLAKYLLRI FT ISHMGIHEYQKMPFGIKGAPAHFQRMMDTEFARELSEGWLIIYIDDIIVFS FT KTWEEHLTKLEQVFNKLVKMGMTISLQKTNFAFEELKALGHVVSGLWIAVD FT QNKVAAVLKKPIPQSVKEVQQFLGFANYYRSHLEGFAKASGPLYKLLQKGV FT TFEMTADRVASWNILKKKLTEAPFLLHADYCLLKYMWMRASTV" FT CDS 5519..7618 FT /product="Gypsy-104_MLP-I_2p" FT /translation="MEYLEEETNRSSLFAPRRLLPFKVYVDASFDGLGATL FT QQIQIVDGKPFEGLISCISQKLKQSELNYGATQLENLCLVWALEKFHYYLD FT GAFFEVITDCTALKSLLNMKTPNRHMLRWQIAIQEYRSCMTITHREGRKHE FT NADALSRMALENDIENPAWDPDDISRDLPIMGITITDLSQEFYDKVMNDYN FT EEPNTVKLLTIMNQGCKALELSSSLDKEWKKSFDEGRFTLLSGILYHRERH FT SCAMVITSQRLKTDILKVCHDDVMSGHLSCERTMDRIKQIAWWPGWRTLTE FT EYVRSCDRCQKANRATGKRLGLLQHIEEPKIPWEVINMDFVTGLPPAGHDN FT VDCVLVVVDRFSKRCRFLACHKSATAMDIAMLFWERIVSDSGLPRVIISDR FT DPKFTSEFWKGLFSLAGTSLAMSTAYHPQTDGLAERMIESLSDLIRRYCAF FT GLEFKDNNGYTHDWKSLLPALEIAYNTSTHSTTGKSPFEVERGFNIRTPSK FT MIKPKDSTFHPTAMDFHNMLEKARKHAGDCILAAKEYNKQRWDKTHKEPTF FT KVGDLVLISTVNFTNLQGPRKMKDSFIGPYVIKALHGTNTVEVILTGKVAR FT KHPTFPISLLKKYEKSDEEKFPNRTEIEQRDPPMEEEPIGAIKKIISDKYV FT KINGRSTRLYLARFKGKDADKDKWLEAQDIPDSDRLLRKFRVERRENNTQA FT " XX SQ Sequence 8539 BP; 3006 A; 1721 C; 1846 G; 1966 T; 0 other; tcgagcgtta ataaccagag actaacaaaa gcaaaaacgt gtttttagtt tgaaatttcc 60 tttttctttt actttcttag tcagaaaaac aaataaaaac ctacaaaaaa tataccaaat 120 tttcttcttt tctttttttt tactcatccg gaaaatttct ttccagttga tagatcagtg 180 tttcttggat ctatttagag aactacctta cttttctttc gttcacgaaa accctttcaa 240 cgacgagatc tataatccct gctttggctt tcacgttcta agtcctagat tgttgtacat 300 ccttcaaaat ttcggaagca acgaatccag caatagagga gcttctagca atcaataaaa 360 ccttaggaaa tcttatctac acattcgtgc gaattaagat actcaaagaa ccaccaccag 420 caaatctata cagagacctg tggagactag gtaaaaagct aaaaatttta gctataaaac 480 tagacgccgg agagttagta gaacttatat tttttctctc atcctagaca tctggaaaga 540 agtttagcgt accaataaca ccatccttga tcacccagtg accaagataa cccagcccaa 600 tccggttcca acgaagggac atcaggcagt agacaaaatc ccatcagcct cgccagtagt 660 ccagatcccg agcaaccagc cagcccagag tcgtacgacg ggtacgagta agctagcttt 720 catttttttt tcttttgtaa attcgaaaga aaaacaataa gctgagatca atactactta 780 tagcccactt gacgacgaat acctcgtcta taggtcaaac tggtaaataa cacaccttat 840 atttcctgtg tttgttttcc ttttcactaa cattgacaat gccacctaga aaatccgatc 900 gagtcagaac aatcgagagt cagtccggca gacccaacta tagtgatacc acccgaagag 960 caagctcagt cagctcagtc ccagggcgac gcaatgcagg aggacctacc ggcagagcag 1020 ttaccaccga cggtccccca agagcaacag cgactactga tctccctcca cgagcacaat 1080 ttcaagaatt ggaaatcagc ccaagactcc aaccccaaca atacccgtca gatcgagatc 1140 tttctcggat tgtggaagat gtcgcacgac gccctgacga cgatcctagg gaaggaggaa 1200 gtccagaaag ggagcttagg ctgggatccg tatgtcaaag agaagaagat gttagcgaaa 1260 acgagcaacg ggtttatagg actgaaccag aaccctcacg gatcaaccaa ccccaacaga 1320 ccttatcaat ccccagatca gatcgagact caagagccgg ctcaatcgac ttcatcagca 1380 ggcccaatga ggaacgacaa ccacagacag accaaggaac cctacaagag accaccgcaa 1440 gcggaggaag acgcccagaa catgcactcc atgctagcat tcagcaaggc gtactatcga 1500 gcggtgaaag ggatcaacaa gtccaaaggg aaaggacatc agaagaaccc gaaagaccaa 1560 tcttctcaca agtgaaaagt atttttacgt ctaccccaag tccgtttcta tcctctaaac 1620 gtatcttatg tgatcagaga ccagttatag atcatcatgt aactagtatt cctatgagac 1680 aagtagaaga tataagagtt gttcaaccaa ctacgggttg taaccacata ggttaccaga 1740 ctgaaaacta tgattgtaat aatattgaac cgtttgagag ttcctcattg accagagagc 1800 aaagtgagcc agtgaaaaat acactgaaga atgagcctac acctgagatg atgagcgaac 1860 gttgggcaat tgacaataat aaagtaacaa gtaatactga tattcgaaaa tatcagctcc 1920 ataagacaac tcataaatca gagacaaatg cagtaaagca aaacgaacac gagaagattt 1980 attgcgaaaa attccagcag gaaaatgcgg agaatgcgat tgtgagagaa tcatcgaggc 2040 gcaaggacga gcatgcaaag agatgataga aattcatagt acacatctta gcgaaatgat 2100 gtgcaggttg ggtgaaaagt tggaccaaca agcacaccaa atagccttgc tccatatgaa 2160 tgtgacgcga acactatcaa agatgaacga aaaagccgat agagaagata gtatgcaaca 2220 agagacgtta gggaataacc ctttcgcgga ttacggatcg gttcgaaata atatacaaca 2280 cgaagatgat aacgtcatga gccaaccaag aacaaataaa ccaacggctg aggaacgaga 2340 gacagaagtt agagaagagc gatctgccac tctagaagcc gaagaaagaa aagcactgca 2400 gaaaggacaa cttcaagcca aagagtggcc gaaatttagc ggtgaaggag aatataacca 2460 cctggaatgg ttagagtgga tagataacac acaggaagat acgtgccttc cggatgcgtt 2520 gatcatctgt aagttaggag tcgtaatgac gcattcagca caaagttggt atacagtgaa 2580 gagaagaaac tgcaagaaca aatcttggca ggaatggaag aagatgatca tggatcactt 2640 cggtacgccc acttggaaac ggaaaatggc tatggcattc gatcgagaca ccttcaagtg 2700 ggaaaataga gacaaaccga ctgcatggct ccttctacaa cgtcgaagga tggacgccgc 2760 ttggccgttt ctgacaacca gagaacagat tgacaaaatt ctaggattgt gtaacggcga 2820 tatagaacat gctgttcaat cgcgaattag agatcactca gacttcgaaa gttttatgat 2880 tatctttgaa gaagtaataa cggatacttc gatagggaag aacctcctaa ggtcgaaaga 2940 gtatcgtaat aatcactttt ctagcaagga aaagacacta agctctagag agtataagac 3000 aaccgaacat ccgaaaccac aagagtccgg gaaatataga gattataaac agaatagacc 3060 tatgggaagc ggcccagcga aagtatcgtt taaagacgac aatcgaatca atagattcga 3120 gaaaaaatcg attaatgcgg tagaaaacga agaggatgac ggtcagtctg gcgaggaacg 3180 gaccgagaga tctgaagaac aagaagaaga atccgaatct tcagatgatg atgaccttgg 3240 actatgtata gggaacattg atatgacgaa tacgcatcaa agctacgaag acttacaacc 3300 agtattgatc accacagagg aggaagcacc agtacaggct ccaaccttat cgataaccga 3360 gctagctgaa aacttgagag tagctgagca gacgacaaat catattagtc caccaccatt 3420 tatagaacgt ttaattttca acaattcaga aagcagaact tttatcgcaa ttagcatgtc 3480 gggcgttgtt agtaatatgc tcttgaatac aggaattgag ccatcagtag cgtcgacaaa 3540 agtactaaaa aagtactggc caacgtggca agcggataca aaaacgatgg acgagaataa 3600 cacttacgcc aacgattcag aagtattagg agaaataggg agaaatatct ctaccgatta 3660 aattggaaca cacgatcaaa ccttgcttcc tgatggttaa atttatcgtg ctgatcaacg 3720 atgatttaga tcatctgata ctaggcagta gagactggcg agaattcagc ttcaccctat 3780 acctaggtgg cagaacgcag tgcaagataa ggacagaaga tggaactttt agatttccag 3840 tgatacccca aaagatatgg gacgaaccac gaagaagacg agctcagagt catattgcga 3900 cgatcactgt ggaatcggaa gaaggtgtgg atattgcaga attacaatct caagcatcaa 3960 taccccaaca atgggaggat aatgcgaagc gttctttgca agtatcggac gcaagactac 4020 taatgtctaa accagagaag ggcagagcgc atacgatagg gcaccactgc atcacttcag 4080 ctttgctaag tggaggtcag aaagtggaag tgctgctcga cagtggagca gcgtgctcga 4140 tagtcggaac agattatctg aacagattca tcaagaactg ggaagatctg atgctgccac 4200 caagtaatat gacttttaaa ggatgtagca acagactacg agcactgcga gttatagaac 4260 ttcctatgat atttcctcac catctgggac cagtaagaat acaaccggaa tttgtaataa 4320 tggacaatgc tacatcagat tattttatac taggaggaga atttcttcgt ttgtacggat 4380 ttgacatagt tcatagtaaa gagaaatatt tcactatagg aaacgaaaat aaaaagaaaa 4440 aattccttct aggttcagta agagaacaga gaatagcaac aatatccgcg aaggtagatg 4500 aagtctttga gaaagccttc gatgaatgta acatttcccc tagactgaaa tctacagaac 4560 tgcattcact gaaagcagta gcgagaaaat ataaacaagc ctttgcgtat ggcgaaaacc 4620 ctatagggac aattaaaggt cacaaactta agttgagact gaatattgat aaaccatacc 4680 ctccgatatt gaaaaagcag gcgtacccag ctagcccacg tagccgcgaa gagattgaga 4740 aacacatagg cgatctagtt aagcacggca tcctacgaaa agttggagca gacgaagaag 4800 tagaagtaac aactccagtc atcatagctt gtcataacgg gaaatcccgg atgtgcggag 4860 attctcgcgc acttaacacc tttacagtgc ctgacaggta cccgatgcca aggatcagtc 4920 attgcctgac gaaccttggt aaagccttat acattacgac tatggacgtg atgaaaggat 4980 tccatcagaa cgaagttgat gatctggcca aatatttgct aagaatcata tcgcacatgg 5040 ggatccatga atatcaaaaa atgccatttg gtattaaggg cgccccagcg cactttcaac 5100 gtatgatgga tacggaattt gcgagggaac taagtgaggg atggcttatc atctacatcg 5160 atgatataat cgtgttctca aaaacttggg aagaacattt gacaaaattg gaacaagttt 5220 tcaataaact tgtaaagatg ggtatgacaa tatccttaca aaaaactaac ttcgccttcg 5280 aagaactgaa agcgctagga cacgtagttt caggattatg gatagctgta gatcagaaca 5340 aagtagcggc ggtgctcaag aagccaattc cacagtcagt caaagaagtt cagcaatttc 5400 taggctttgc gaattattac aggtcccact tggaaggatt tgccaaggcc agcgggccac 5460 tatacaaact actccagaag ggagtaacat ttgaaatgac agctgataga gtggcgtcat 5520 ggaatatctt gaagaagaaa ctaacagaag ctcccttttt gctccacgca gattactgcc 5580 ttttaaagta tatgtggatg cgagcttcga cggtctagga gccacattgc aacagatcca 5640 gattgtggac gggaaaccct ttgaagggct aattagttgt atctcacaaa aattgaaaca 5700 atctgaattg aattatggag ccactcagct cgagaatcta tgtctggtat gggctctgga 5760 gaaatttcat tattatctcg acggagcatt cttcgaggta ataacagatt gtacagcact 5820 gaagtctctt ctgaatatga agacgcctaa ccggcatatg ctgcgctggc aaatagccat 5880 acaagagtat cgctcatgta tgacaataac ccatagagag ggacgtaaac atgaaaatgc 5940 ggatgccttg agtagaatgg cgttagagaa tgacattgag aacccagcat gggacccaga 6000 tgatattagc agagacctgc cgattatggg catcactata acggacttat cgcaagaatt 6060 ttatgataaa gtcatgaacg actataatga agaaccaaat acagtaaaat tgttgactat 6120 catgaatcag ggttgcaaag cattagaact ttcatcctct ttggataaag aatggaagaa 6180 gtcatttgac gaaggtcgat tcacattact cagtgggatc ctgtaccaca gagaaagaca 6240 ctcatgcgca atggtcataa cttcgcaaag gctgaagacg gatatactta aagtatgtca 6300 tgatgacgtc atgtcaggtc acttatcttg tgaaaggact atggatagaa tcaaacaaat 6360 agcctggtgg ccaggttgga gaacactaac agaggaatac gtgagatcct gtgatagatg 6420 tcagaaagcc aatcgagcta cagggaaaag gctaggcctt cttcaacata ttgaagaacc 6480 caaaatacca tgggaagtca tcaacatgga tttcgtgaca ggacttccgc cagccggaca 6540 tgacaacgtg gactgcgtat tagtcgtagt tgacaggttt tcgaaacgtt gcagatttct 6600 ggcatgccac aaatcagcaa ccgccatgga catagcgatg ctgttctggg aacgaatagt 6660 ctcagattcg ggattaccaa gggtgatcat tagtgataga gaccctaagt ttacttccga 6720 attctggaag ggtttattca gccttgcagg aacatcgtta gcaatgtcga ctgcgtatca 6780 cccgcagaca gatggactag cggaacgcat gatcgaaagc ctcagtgacc ttatcaggag 6840 gtattgtgcg tttggacttg agttcaagga caataacggc tatacgcacg actggaaaag 6900 cttactgcca gccttagaga tagcatacaa taccagtact catagcacca cagggaaatc 6960 accatttgag gtagaaaggg gatttaatat taggactccg tcaaagatga tcaaaccgaa 7020 ggattcaacc ttccacccga cagcaatgga ttttcacaat atgctggaaa aagcgaggaa 7080 acatgcaggt gactgtattc tggctgcgaa agaatataat aagcaacgat gggacaagac 7140 ccacaaagaa ccgactttca aagtaggcga cttagtacta atatccacag tcaattttac 7200 caacttacaa ggaccacgaa aaatgaagga ctcattcata ggtccctacg tgataaaagc 7260 actacatggg accaacacgg tagaagttat cctgacaggc aaagtggcaa gaaagcatcc 7320 gacctttcca atttcattac tgaagaagta cgaaaagtca gacgaggaga aattccctaa 7380 tagaacagaa atagaacaaa gagatccacc tatggaagaa gaacccatag gggctataaa 7440 aaagattatc agtgataaat acgtgaaaat caacgggagg agcactagac tttatctagc 7500 acgctttaaa ggaaaagacg cagacaaaga taagtggcta gaagcccagg atataccaga 7560 ctcagataga ctccttagga aattcagagt cgagagacgc gagaataata ctcaggcctg 7620 aggttggggc gtcattgcat tccacccctg aggttgggga atgtcagctc tcgagctctt 7680 acacataaga gccaaaagac ctacaagaaa caaaatatta attaatcatt ttttttgtaa 7740 cataattata caacttattg acctagtttc aaaggtcctc tcagaaagat acgccaggaa 7800 aacaactaaa agaaatataa acataaccgt ctacaaagac acagacacta ccagcataaa 7860 gtattttccc caggaaaaaa tgaatacatt aaatgcatta ttcaatcaaa aatgctcagt 7920 taatgctgaa taatgctgat ttaatgctga tttaatgctg atctaatgca ttttccatca 7980 tttcttgctc taaatgtatt aaatctttcc gtggggaaat tactttatgc tggcagtgag 8040 acagcttatt gacttagtat cgaaagtcta aaaaaattat gtaatataga aatgtaaagc 8100 acacaattat caaaagaaca atgaaaccag tctagtctct ggaaaagaag actaggccgg 8160 agaaatcaag aaaaggaagg aactgccaag gaaagattcc ttgaagcagc atataaaata 8220 acaaatagaa taatagaaaa gaagtatctc tcaagctagt tagaggttaa atagagatca 8280 cgaaagatac taatctatct ggagaactga tcatgggaga tataagatta ctgaaatacg 8340 atgaccaagc tgccctttac gtatcattcc acttcgtaat tgatattacg taaggtcctt 8400 tacgtatcct cccactttgt aattgatatt acgaaaggag ctaacggatc agctcggagt 8460 taattgaacc taagccaaga gaacacgtat tctattatca atgataagat agacaaggag 8520 tgctaaaagg tataagtac 8539 // ID TCN5-LTR repbase; DNA; FNG; 591 BP. XX AC . XX DT 30-MAR-2005 (Rel. 10.03, Created) DT 30-MAR-2005 (Rel. 10.03, Last updated, Version 1) XX DE C. neoformans LTR retrotransposon - LTR consensus. XX KW LTR Retrotransposon; Transposable Element; Interspersed repeat; KW TCN5-LTR. XX OS Cryptococcus neoformans OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Tremellomycetes; Tremellales; Tremellaceae; Filobasidiella; OC Filobasidiella/Cryptococcus neoformans species complex. XX RN [1] RP 1-591 RA Goodwin T.J. and Poulter R.T.; RT "The diversity of retrotransposons in the yeast Cryptococcus RT neoformans."; RL Yeast 18(9), 865-880 (2001). XX RN [2] RP 1-591 RA Loftus B.J., Fung E., Roncaglia P., Rowley D., Amedeo P., RA Bruno D., Vamathevan J., Miranda M., Anderson I.J. et al.; RT "The genome of the basidiomycetous yeast and human pathogen RT Cryptococcus neoformans."; RL Science 307(5713), 1321-1324 (2005). XX RN [3] RP 1-591 RA Gentles A. and Jurka J.; RT "C. neoformans LTR retrotransposon TCN5."; RL Direct Submission to Repbase Update (15-MAR-2005). XX DR [3] (Consensus) XX SQ Sequence 591 BP; 127 A; 153 C; 111 G; 200 T; 0 other; tgtaaggcat gcatgccaaa cacggtcatt atgatcccaa ggtctgtttc acttcataca 60 tagatttgct gccgcttttc tcttattttg attccgttta ttttgcgcgc tcctcacagg 120 cctcaaagga atttgcagca aggaaaagcc tctcttcttt atctttagaa agagagccct 180 tttcctttgg ttgtccgcga cctgctgtag ttcttttgag tgcttgtgag tcccacttcc 240 tacttggaat tgtggtttta tgctaactgg gcggacttta gtaattatag cttcgttgac 300 actctgccca ggtatgcttg taacttcttc tttatcttta gaaagagagc ccttttcctt 360 tggttgtccg cgacctgctg tagttctttt gagtgcttta attatagctt cgttgacact 420 ctgcccagtg atcaacaaac atatgcaaac tcatagaccg cttctgcgcc cggacaatcc 480 aactacctct ttccttgcca ccgactagac aacagtcgtg ttccagccgt ctctctttca 540 aatcaaatca agccgaactc tttcagtgaa catcccccct gggttgttac a 591 // ID Copia-7_MLP-I repbase; DNA; FNG; 4530 BP. XX AC AECX01000123; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-7_MLP_; KW Copia-7_MLP-LTR; Copia-7_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4530 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000123; Positions 27105 22576. XX CC Positions [1867-2391] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 214..4509 FT /product="Copia-7_MLP-I_1p" FT /translation="MPQESSDSRTFGEPIQKFSQIMTNALSKYKVSDDLTD FT DNYIEWSQSLMEVFRSLELHYYVKVKDHVNPNLTAEEVEKTRFNITTYILG FT RLDSSNKLRVRNKLTDPLDPTELIYDPYECWTYLKDFHHRISEDKLETITR FT SLYSCQISRSDTLTGFVDKFENLIRDFYRLKGELSDAQSARMLLGAIPSLT FT VETKEYIHNTVVPLTRDGVGRYLRKFEEHHGWTTSAVREVHSVTVKSSKKE FT CTPDECFGPHMAKNCWSKPENADKRSAFLAKIRGKSSTGSSNSTSQPQATS FT TIRGVKNVYDSNVNSASASMAFLSLNVEFVEIETSNKSTPEEIYSAPSEFE FT EDPDVSASVSAVLSTSSDWALHDTGATHHMFKDVKYFDQSSLIKIEDASKR FT LKLAGGDISLAVHSRGIAKLLAGDDTVFELKNSLYVPELSRNLVAGGLLKK FT QGVREIFHDSKNFSLVINDLAVFNGFISDNNLMFILLQPVSLGASSSVSAS FT ITEISSLLQHRRLGHVSSKYLKLMAKLDSVEEFSYSDETLNCDTCSLSKNT FT KLSFNKSRPRARNFLENVHVDLSGIIRTTGLNNENYFVLFCDDYSLYQHIY FT PLNGKTKEEVYDVFMAYLAVAERQTGCSLKQFTLDRGSEFLNNLLGQKLKE FT LGITLHLTSGHTPEENGVSERGMRIVNTRARSMMLGSSLPIRFWYYACSTA FT VFLTNRCVTASLESGKTPFEMWYFRKPSVQHLRVFGCQAFGLIRKELRDSK FT FSPVSSEGVLVGFHQDNFNYQIFDLSTKKVYTTHHATFNEDIFPFSEVPAI FT SSNLNSTLDRRTVKVQFFDEDSESEDIPEDANPVEESLSRLDDQRTNLDVS FT SSVPPAPRRSTRTSKKQHNSLKGMCSIAVFECDFVTGSFFAAFPPEANLGD FT LHPIPDPKSYKRAMESSESDLWRAACDKEFKSLKDKDVWVLVDRPSDKNVI FT RGMWIFRKKQLVSGGVKYKARFVAMGNTQIPGQDYGETFAPTGKPVSLRIL FT IAFAAIMGWEVHQMDAVTAFLNGDLDEELYVEVPEGYRTKSSAGKVWRMKK FT ALYCFKQSPKIWQDDVEEFLIEIGFVQCEIDHCIYIRAKNDLFTAVYVHVD FT DLAITGNDISTFKIEISAKWEMEDLGIAHTVVGIQIDRIDKFTYSMSQQQY FT ALTVLSRFGMVDAKPAITPLAPNVKILKSTDEEAHEFSLTKLPYRSVVGSL FT MYLAQCTRPDLAHSVGVLSQHLERPGKLHWEAAMHVLRYLSHTVNIGIVFS FT GISTNHVEGQRSFECPLSHCDADWAGDANTRRSTTGYVFVLAGGPISWRSR FT LQPTVALSSTEAEYCAITEAGQELLWLRNMMARFGFTDPNPTVLQSDNLGA FT IHLTSKSIFHARTKHIEIHYHWIREVVKKGDMVIKHCPTHLMVADLLTKQL FT PKEQFSTLRKSLGLRFLA" XX SQ Sequence 4530 BP; 1266 A; 991 C; 954 G; 1319 T; 0 other; ttttatggta gcaagagtaa aagatcttat cgaccaattt tcgcttgtat gagttctctt 60 gaaaacgtcg acaacactac cgacgagacc ctcgataact ctaccgtaac cgaaggctct 120 agctcttcaa attctaccaa ttccttttga acggttcatc aatctaattc tcttcgtgat 180 cctaattcaa cttcgcatca ccaatcttca gccatgcctc aagaatctag tgactccaga 240 acttttggtg aacctatcca aaagttctca cagatcatga cgaacgcttt gagcaagtac 300 aaagtgtctg atgatctcac cgatgacaac tacatcgagt ggagtcagtc tttgatggag 360 gtcttccgct ctctggaact ccattactat gtcaaagtga aggatcacgt aaatcccaac 420 ttgaccgctg aagaggttga aaagacacgt ttcaacataa ctacctacat acttggtcga 480 ctcgattcat ctaataaact cagagttcga aacaagctga ctgatccgtt ggatcctacc 540 gagttaattt atgatcctta tgagtgctgg acttacttaa aagattttca tcatcgaatt 600 tcagaagata aacttgaaac aattactcga tctttgtact cctgtcaaat ttccaggtct 660 gataccctta ccggcttcgt tgacaaattt gagaatctca ttcgagattt ctatcgccta 720 aagggcgaat tgtcagacgc tcagtccgct agaatgcttc taggggctat tccctcactc 780 actgttgaaa ccaaggagta cattcacaat acggtcgtac ccttaactcg agatggagtt 840 ggaagatacc tcaggaaatt tgaagaacac cacggttgga caacctcggc tgtccgtgaa 900 gttcactctg tcaccgttaa atcatcaaag aaagaatgca ctcctgatga gtgttttggt 960 cctcatatgg ccaaaaattg ttggtcgaaa cctgaaaacg ccgacaaacg ctcagccttt 1020 ctggcgaaaa tccgaggaaa atcgtctact ggtagttcaa attcaacctc ccaacctcaa 1080 gctacttcta ccattagagg agtgaagaat gtatatgact caaatgttaa ttcagcctca 1140 gccagtatgg cctttctttc attaaacgtt gaattcgtgg agattgaaac atcgaacaaa 1200 tcaactcctg aagaaatcta ttccgcaccc tccgagtttg aggaagatcc cgacgtcagc 1260 gcctctgtat ctgcagtgtt gtctacttcc tcagactggg ctctccatga tactggagca 1320 acccatcaca tgttcaagga tgtgaaatac ttcgaccaat cctctctgat taagattgag 1380 gatgcaagta agcgtttgaa attggctggt ggtgatatct cgttagctgt tcatagtcga 1440 ggtattgcta agcttctggc tggggatgat acggttttcg aactcaagaa ttcactttat 1500 gtaccggaac tttcaagaaa tttagtggct ggaggactgc tcaagaaaca aggagttagg 1560 gaaatctttc acgacagcaa gaatttttct ctggttatca atgatctggc agtcttcaac 1620 ggtttcatct ccgacaataa cctaatgttc atcctactcc aacctgtgag tctaggtgct 1680 tcatcatcag tttcagcttc gattactgaa atatcttcat tgcttcaaca ccgacgatta 1740 gggcatgtaa gctctaaata cctcaaactg atggcaaagt tggatagtgt ggaggaattc 1800 tcttactcag atgaaacttt aaactgtgat acttgttctc tttccaaaaa taccaaatta 1860 tccttcaaca aaagtagacc ccgtgctcgt aatttcctag aaaacgtcca tgtagatctt 1920 agcggaatta ttagaaccac tggtctgaac aatgaaaatt attttgtttt attctgtgat 1980 gattactctt tgtatcaaca catttacccg ttaaatggta aaacgaagga agaagtttac 2040 gacgttttta tggcttatct tgccgttgcc gagaggcaga ctggctgctc actgaagcag 2100 ttcacactgg atcgtggtag tgaattctta aacaatctac tcggtcagaa acttaaagaa 2160 ctagggatca ctttgcacct tacttctggt catacccctg aggaaaatgg cgtctctgaa 2220 cgtggtatgc gcatagtcaa taccagggcc cgttcaatga tgcttggctc atctttacct 2280 atccgttttt ggtattatgc atgcagtact gctgtgtttt tgacaaatcg ttgtgtgact 2340 gcttctctcg agagtggtaa aacccctttc gaaatgtggt attttcgtaa accctcagtt 2400 caacatcttc gtgtgtttgg gtgtcaagcc tttggtctta tcagaaaaga attgcgggac 2460 tcaaagttct ccccggttag ctcagaaggg gtgttagttg gcttccatca agacaatttt 2520 aactatcaaa tctttgatct gtcgactaag aaagtttaca caactcatca tgctacattc 2580 aacgaagaca ttttcccgtt cagtgaagtt cctgcgattt cttctaactt aaattcaact 2640 ctggatcgac gtacagtgaa ggttcagttc tttgatgaag acagcgaatc tgaggatatc 2700 cctgaggatg ctaatccagt cgaggagtca ttaagtagac tagatgatca aaggacaaat 2760 cttgatgtct catcgtctgt tccaccagca cctcggcgtt caactagaac ctccaagaaa 2820 caacataact cactcaaagg aatgtgctct atagcggttt ttgagtgcga ttttgtgacc 2880 ggttccttct tcgctgcttt tcctccggaa gcaaatcttg gagacctcca tcccattcct 2940 gacccaaagt cttataagcg tgcgatggaa tcctccgaaa gtgatctctg gagagcggca 3000 tgtgataaag agttcaaatc tctaaaagat aaggatgtct gggttttagt ggatcgtcct 3060 tctgacaaga atgtaattcg gggaatgtgg atttttagga agaaacagtt agtttcaggg 3120 ggtgtgaagt ataaggcacg ctttgtagct atggggaaca cacagattcc tggtcaagac 3180 tacggcgaga cctttgctcc caccgggaaa ccagtctcac ttcgtatctt gatagctttc 3240 gccgctatca tgggctggga agtgcatcaa atggatgccg tcactgcgtt cttaaatggc 3300 gatcttgatg aagaacttta tgtcgaagta cctgaaggct atcgtaccaa atcatccgct 3360 ggaaaggtct ggagaatgaa gaaagctctt tattgtttca agcagtcccc caagatctgg 3420 caggatgatg ttgaagagtt tctcatagaa attggctttg tacaatgtga aattgatcat 3480 tgtatctaca tacgtgcaaa gaacgactta ttcaccgccg tatacgtaca tgtcgacgac 3540 ttagccatta cgggaaatga catcagcact ttcaaaattg aaatctcggc aaaatgggag 3600 atggaggatt tgggaattgc tcacactgta gtgggtattc agattgatag aatcgacaaa 3660 ttcacttact ccatgtctca acaacagtat gccttgactg ttctcagcag atttggaatg 3720 gttgatgcca agcctgccat tacacctttg gcaccgaacg tgaagatcct gaaatcaact 3780 gacgaagaag ctcatgagtt ttccttgact aagttacctt acaggagtgt ggtaggatca 3840 ctgatgtact tagctcagtg tacgcgtcct gacttagctc actccgtcgg ggttctctct 3900 caacatcttg aacgccctgg aaaacttcac tgggaagcgg cgatgcatgt tttacgttat 3960 ctcagtcaca ctgtcaacat tggtatcgtc ttttcaggca tttcgacaaa tcatgtggaa 4020 ggacaaagga gttttgaatg ccctctctct cactgtgacg cggactgggc cggtgatgct 4080 aatactcgtc gctctacgac gggttatgtc tttgttctcg ccggtggacc aatctcctgg 4140 agaagcagac ttcaaccaac agttgctttg tcgtcaactg aggctgagta ttgtgcaatt 4200 actgaagccg gtcaagaatt actctggtta cgaaatatga tggctaggtt tggattcact 4260 gatccaaatc ccacagttct ccagagtgat aacttaggtg ctattcattt gacttcaaaa 4320 tcaatattcc acgcaagaac aaaacatatt gaaattcatt atcactggat acgtgaggtg 4380 gttaaaaagg gtgacatggt tatcaaacat tgcccgaccc atcttatggt tgctgatctc 4440 ttaacaaaac aattaccaaa ggagcagttt tcaacactca ggaagagtct aggtttaagg 4500 tttttagcgt aaatgctctt tgagggggtg 4530 // ID Gypsy-32_MLP-I repbase; DNA; FNG; 5928 BP. XX AC AECX01000965; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-32_MLP_; KW Gypsy-32_MLP-LTR; Gypsy-32_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5928 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000965; Positions 154220 160147. XX CC 'CTATA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 377..1450 FT /product="Gypsy-32_MLP-I_1p" FT /translation="MMEDIQRQLNELSDSLTNERTLRERAEARSIAAEARL FT AAIESSRSATHNAPPPNPTVTMQSPAEIVPTPKGPKVSTPDKFSGARGAPA FT EIFATQVQLYMLAQHHLFPDDRSKVVFTLSYLTGAASSWAQPMTLELLDDN FT TSHLVTFERFITNFKAMYFNTEKKSKAERALRALTQKGSVAAYAHEFNLYA FT TATGWEVPTLISHFEQGLKKEIRVAMVMVQEDFTSIEQIANLAIKLDSKIH FT GATSTSATFHVPTDPNAMDISSNFVRLSEEERAKRLSTGSCFHCAGQGHRV FT SECPVKRNGGSNMKRRGGLTGQGGFRHKIAELEAKIAAMGGESSGGEDRQG FT EGSRADKSKNGAAQA" FT CDS join(2045..3784,3788..5056) FT /product="Gypsy-32_MLP-I_2p" FT /translation="MLFLIPLMLLNVLALHHKLQIISPAILQTQTHEDEYH FT IAAVTKASSIPKHNPKDPMEMPQGQARIRDEGMSIIDDTRTSPQCEFDLIH FT HQIPIEIAGKLYSNPKSQTTPHQPIPNDVIQTSSITEQDTIIATDLPVLSV FT PHHNPKDPVEMPKGQARKCDEGMCIFNDTLASPRCESIISSDFTACDTAGQ FT HDSSLNTTVIDVDASKTSWSTSARLAADLKLQEPVKTVEELVPSYYHRHLR FT MFRKSDSQCLPPRRKYDFKVDLIPGAQPQSSRIIPLSPAEDAALDKMIREG FT LANGTIRRTTSPWAAPVLFTGKKDGNLRPCFDYRKLNSLTVKNKYPLPLTM FT DLVDSLLDAEKFTKLDLRNAYGNLRVAEGYEDILAFICKQGQFAPLTMPFG FT PTGAVGHFQFFIQDILIGRIGNDTAAYLDDVMIYTKAKVHHEEAVDGILDV FT LSNQNLWLKPEKCEFSKDEIEYLGLIISKNKVKMDPTKVKAVREWPAPRNV FT NELQRFIGFSNFYRRFIDHFSHTTRPLHNLTKLKTPYVWDTECEKAFETLK FT TAFTSAPVLKITDPYKAFILECDCSDFAIGAVLQVCDEDNFLHPVAYLSRS FT LALAERNYEIFDKELLAIVASFKEWRHYLEGNPNRLEVIVYTNHQNLETFM FT TTKQLTRRQARWAETLGCFDFIIKFRPGRNAAKPDALSRRPDLAPKEEDKL FT TFGQLIKPENLVEDSFLAEVDAFDCFFNDETVELDNAKHWFEVDVLGITDP FT ITEITEEDQITTDDEIINLVRQANKQDERINELMNAKMNPISSKIKMAVND FT YQIQDGVLYNKGRIEIPNDDHIKYLIVRSRHDSLLAGHPGRAKTLGLVRQS FT FIWPSLKAYVNRYVDGCDSCLRVKSTTQKPFGTLEPLPIPAGPWTDITYDL FT ITKLPMSNGYDSILTVVDRLTKMSHFIPCKESMTANELADVMIRNVWKLHG FT TPKTIVSDRGTIFVSQITRELDKRLGIQLHTPVDRFSSTNRWSKRDRQ" XX SQ Sequence 5928 BP; 1789 A; 1341 C; 1305 G; 1493 T; 0 other; tattgttgga tctcgtatca ggaatcaagg aacaagaaat tgatatatta gaattagatt 60 gaaactaaat cacttagaac caaaccgaat ccttgggata ctcatcgaaa ccttatcgaa 120 ttgaattgaa actatagttg aacctactgc atcactagat tagataatca gaattcttca 180 cttgcccaca gaccaccaca cttagaacta gaccaccaca cttagaacta gaactcaact 240 gaaatacgcc accacgtcac cctcgtacag tctccccgca atcggcgaag acgagtctga 300 ctctgaaact ttcctcgata ctgaaactca tctagcaatt gaagtaccta ctcgtctagc 360 aatcgaagga ccgagcatga tggaggacat tcagagacaa ctgaatgaac tttcggattc 420 tctcacgaac gaacgcacgt tacgagaaag agctgaagcc aggtctattg cggctgaagc 480 tcgactcgcc gcgattgaat catctcgcag tgcaactcac aatgctcctc caccaaaccc 540 tactgtcaca atgcagtcgc ctgctgagat cgtgcctact cctaaagggc ctaaagtatc 600 gactccagat aagtttagtg gtgccagagg tgctccagcg gagattttcg ctactcaggt 660 gcaattgtat atgctcgcac aacatcacct atttcctgat gatcgaagta aggtcgtatt 720 caccttatcg taccttaccg gagctgctag cagctgggct caacctatga ccctcgaact 780 ccttgacgat aacacgtctc acttggttac cttcgagcgt tttatcacca attttaaggc 840 aatgtacttc aacaccgaga agaagtctaa agcggagcgc gcgttaaggg ccttaactca 900 gaaaggctct gtagctgcat atgcacatga attcaatctg tacgcgacag ctactggatg 960 ggaagtcccc actctcatca gccatttcga acaaggtctg aagaaagaaa tccgtgtggc 1020 aatggtcatg gtccaggagg atttcaccag catcgagcaa attgctaatc tggcgatcaa 1080 gcttgacagc aagattcacg gcgctacaag cacatccgcc accttccatg tacctacaga 1140 cccaaatgct atggatattt catctaactt cgtacgctta tcagaggaag aacgtgctaa 1200 acgtttaagt accgggtctt gttttcattg tgcaggccaa ggccaccgtg tgtctgagtg 1260 tccggtgaag cgaaatggtg gtagtaatat gaagagaaga ggtggtttaa caggtcaagg 1320 tggttttaga cacaagattg cagagctgga ggcgaagata gctgcgatgg gaggtgaaag 1380 ttcagggggt gaggatagac agggagaagg aagtagggct gataaatcga aaaatggcgc 1440 tgctcaggct tgaagggtgt gcctatcctg agctcaagag ggggattaga tgtcatagaa 1500 cttggtgcta gtgaaattca aacctgcaat tcaaatgatc caaggctttt tttacgtgca 1560 actctatcat tgtcccaagt ccctcgcgcc acaccacatt ttccttttca agctctattc 1620 ctaattgact caggtgctac gcataacgtc ctaagtgagt ctattgccgc taaggcacaa 1680 ctacttcagt atgcctaccg atccaccaga gtagtgactg gctttgatgg atccgtgagc 1740 caatcctcct acgaaatcga cttacacatt caaaatgaac ctacccacac ccgatttatc 1800 atcacaaaaa tcaagaattc ctacaacggt attttaggaa tcccttggat taaagaacat 1860 agccacctcg tcgattggaa agaaggcgtg attattgatt cacacgttgc ggctgttact 1920 acagcgttgt ctcgtccgcc accaccctta gaggacccga cgttggagcc cataagggac 1980 gctaggattg ttgacaaggg gatgtgtatc agtgatgata cattagcatc cccgcaacgt 2040 gcgtatgctc tttctgatac ctctcatgct gttgaatgtg cttgctttgc atcataaatt 2100 acagatcatc tcaccggcca tcttacagac tcaaacacat gaagacgaat atcacattgc 2160 agccgtaacg aaggcttcgt ccattccgaa acacaacccc aaggatccaa tggagatgcc 2220 acaggggcaa gctaggatac gtgacgaggg gatgagtatc attgatgata ctagaacatc 2280 cccgcaatgt gagttcgact tgattcatca ccaaattccc attgaaatag ctggcaagct 2340 ttattccaat cctaaatcac agacaacacc acatcaaccg attccgaacg acgtcattca 2400 gacttcatcg atcactgaac aggacactat catcgcgact gatttaccag tcttgtcagt 2460 tccgcaccat aaccctaagg atcctgtaga gatgccgaaa gggcaagcta ggaaatgtga 2520 cgaggggatg tgtatcttta atgatacgtt agcatccccg cgatgtgagt caattatttc 2580 ttcagatttt actgcatgtg acacagctgg ccagcatgat tcttccctga atactaccgt 2640 tatagatgtc gacgcttcta agacatcgtg gtccacatca gctagactgg cggcggattt 2700 gaaacttcag gagccggtga agactgtcga ggaactcgtg ccatcctatt accatcgcca 2760 tcttcgaatg ttccggaaat ctgactctca gtgcctacct cctaggcgta aatacgattt 2820 taaggtagac ctcatccctg gagcgcagcc tcaatcaagc cgcatcatac ctttatcacc 2880 agcagaagat gctgcattag ataagatgat tcgagaggga cttgcgaatg gcaccatccg 2940 tcgaaccacg tcgccatggg ctgcccccgt gttattcacc gggaagaagg atggcaacct 3000 ccgcccatgt tttgattata gaaaattgaa ttcactcacg gttaagaata aatacccact 3060 tcctttaact atggatttgg tagatagctt gttggacgct gaaaagttca ctaaactcga 3120 ccttcgtaac gcctacggta atcttcgtgt agccgaggga tatgaggaca tattggcttt 3180 catctgtaag caaggacagt ttgctccact tacaatgcca ttcggaccta ctggcgcagt 3240 cggtcatttt cagttcttta ttcaagatat cttaattggc aggattggga acgacacagc 3300 ggcttacctt gatgacgtga tgatttatac aaaggcaaaa gtccatcacg aagaagctgt 3360 agatgggata ctggatgtct taagtaatca aaatttatgg ctcaaaccgg agaagtgtga 3420 gttttctaag gacgaaattg aatacttagg acttatcatt tcgaagaaca aggttaagat 3480 ggaccccacc aaggtgaagg cggttaggga atggccagca cctcgtaacg taaatgaatt 3540 acaacggttc ataggattct cgaactttta tcgaaggttt attgatcact tttcccatac 3600 gacccgtcca ctacacaatc tcaccaaatt gaagacaccg tatgtctggg acaccgagtg 3660 cgaaaaagcc ttcgaaacct tgaaaaccgc ttttacgtca gctccagttc ttaaaatcac 3720 ggacccgtat aaggcattca tccttgaatg tgattgttcg gactttgcga ttggtgcggt 3780 gctctagcaa gtgtgtgatg aagataactt tctacatcca gtagcgtatt tgtcaaggtc 3840 cctggcgctg gcggaacgaa actacgagat ctttgacaag gagctactgg ctatagtagc 3900 ttccttcaaa gagtggcgtc attatctcga gggtaatcct aacagattag aagttattgt 3960 ctacaccaac caccaaaacc tggagacgtt tatgacaact aaacagctta ctagacgtca 4020 ggctaggtgg gcggagactt taggctgttt tgacttcatt atcaagttca gacctggtcg 4080 aaatgcggca aaacccgacg ctctatcccg aagaccagat cttgctccta aagaagaaga 4140 caagttgaca ttcggccaat taattaaacc cgaaaatttg gtggaagact catttttggc 4200 tgaagtagat gcttttgatt gttttttcaa cgatgagacc gtagagcttg ataacgccaa 4260 acattggttt gaggttgatg tgttaggaat tacagacccc atcactgaaa ttacagaaga 4320 ggatcaaatt acgacggacg atgaaatcat taatttagta cgacaagcaa acaaacaaga 4380 tgaacgtatc aatgaattaa tgaacgcaaa aatgaacccg atatcttcaa agatcaagat 4440 ggctgttaat gactatcaaa ttcaagatgg tgtattatac aacaaaggac gcattgaaat 4500 tccaaacgat gatcacatca agtacctcat tgtacgtagc agacatgatt cattactggc 4560 tggtcaccct ggaagagcaa agactctcgg attagtccgg caaagtttca tttggccttc 4620 attgaaggca tacgtcaata gatatgttga cggttgcgac tcctgtctta gagtcaaatc 4680 aaccacgcag aaaccatttg gcacccttga accgctccca attccagctg gtccatggac 4740 cgacatcact tatgatctca ttacgaaact gccaatgtcg aacggatatg acagtatttt 4800 aactgtagtg gaccgcctga cgaaaatgtc tcattttatt ccatgtaaag agtctatgac 4860 cgctaacgaa ttggccgatg ttatgatcag gaacgtctgg aagcttcacg gcacgccgaa 4920 aacaatagtt tccgacagag gcacgatttt tgtgtcacaa atcacacgcg agttggataa 4980 acgtcttggt atacaactac atacacccgt cgaccgcttt tcatccacga accgatggtc 5040 aaagcgagat cgtcaataag tgcattgaac aataccttag acactttgtt caataccgtc 5100 aggacgactg ggaggcgttg ttgccgacag ccgaattcgc gtataacaac agagatcacg 5160 aatcaacggg catgtcgcct tttatggcaa attacggtta caatcccgtg ttcaataaag 5220 ttccatcctc tgaacagtgt atacctgtgg tggagactag gctaaagatg attgaagatg 5280 tgcaaaaaga gctgactagt tgtttggaat ccgcgcagga atcaatgaaa acccaatttg 5340 accgacatgt tgggaagaca cctgactggc aaatagggga ccaggtgtgg ctcaacggga 5400 ggaatatttc tacgacgaga cctagtccaa aactcgaaca tagatggcta ggtccatttc 5460 ctataattga gaaagtatca aattcagctt acaaattgat tcttcctgat tcaatgaaag 5520 ggatacatcc ggtattgcat gtgtcattat tacgcaaaca ccaagtagat tcaattaaag 5580 aaagaagacc aacatcaccg ccaccaatta tcattaatgg agaaaatgaa tgggaggtag 5640 aagaaatact cgacagcagg acacggcgta agaagaaaga atatttaatt aaatggaaag 5700 gttatcagtc aaatcacaat tcttgggaac cggaaggcaa tttgattaac agtaagcaac 5760 ttttaaatga ttttattgca agatttccaa atacgtcaac aacaagaatc aagaggaaaa 5820 ggagaaagtg agagagggca tagctttttc ccacagggtt ttttaatgct gcccagggaa 5880 agaacacagg acttgcaaga gggagtttgt gcgttaaatg ggggataa 5928 // ID Gypsy4-I_AO repbase; DNA; FNG; 6649 BP. XX AC . XX DT 25-JAN-2006 (Rel. 11.01, Created) DT 02-FEB-2006 (Rel. 11.01, Last updated, Version 1) XX DE An internal portion of the Gypsy4_AO LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy4_AO; KW Gypsy4-LTR_AO; Gypsy4-I_AO. XX OS Aspergillus oryzae OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC mitosporic Trichocomaceae; Aspergillus. XX RN [1] RP 1-6649 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-6649 RA Kapitonov V.V. and Jurka J.; RT "Gypsy4_AO, a family of Gypsy LTR retrotransposons in the RT Aspergillus oryzae genome."; RL Repbase Reports 6(1), 5-5 (2006). XX DR [2] (Consensus) XX CC This is an internal portion of the Gypsy4_AO LTR retrotransposon. CC Its long terminal repeat is Gypsy4-LTR_AO. The only ORF encodes a CC 1999-aa Gypsy4_AOp polyprotein composed of the gag, zinc knuckle, CC protease, RT, integrase and chromo domains. XX FH Key Location/Qualifiers FT CDS 58..6054 FT /product="Gypsy4-I_AOp" FT /translation="MSSQSSSSKKTPVKNTPPAETDSESETTVKEQLKQMK FT NMITQLVNNAKEKNQEIENLKVQLGEAERIRSEQQDHIAQLDAQVGASAPK FT DAIGKVKLPKAEPFDGTRSKLQAFLTQMNMHIHANRKNLIDEADKVIFIST FT HLRGAAWNWFEPYIREYYEVVPDNWSNTTRELFTDSGNLRKHLERTFGDVD FT AEAVAERKLKQLYQRGSASTYAAEFQQIISRMDWNEKVYVSTFISGLKGHV FT KDEFARIDRPATLNEAIDFAVKVDNRYHERLMEKRDNEAWRKGSHRPKGQY FT KSNDQRERTGAKHNDPYGLKPMELDATEGQSQSRGISQKERERRKREKLCY FT NCGKAGHMSKDCRQKRNSHQSNRKPQQMNATEDEAEPPRKARFAQLNANAE FT ADKPHDQARGAYCTTGIKSPIMDEIGEEIEAIQLNANMDLSARETERYREN FT EPNDDDWITLDWLTTHTNQTIWPRIDQDWSNLQEATEQWYGQLNEHEIDEL FT ADHANDETDRLGQVNSESQYEAIMEPIRRRVRYALTHRVDGPDNTDQYNEP FT VSSIDQSNRVLVEIDPLNEEYGTQWIGHDPNMEGLLEVPETPENPQDEDEN FT AHRRVMALIDTLQEVVSPRRRAPASRPYQTQQIEVEPPRRQETLTNWTNNV FT IDEIIRNPRRFSRPLRMLLEQCPHWNHECWDSNIENWDEHCQQCNKHPIVC FT EICGADRFEYYGELELINPDRAKRGHEGTHHWLRECECCHYATEPMHNRYP FT WVVCFDDSCTHHRIWKQIARFWPQNDANRGTLAATRQGRHITTTIAINGKP FT ARAMIDSGATNNFMSPRYRENMKIEGRQKENVEPLLGLDGQKLGTGQVSVE FT TVPVTMAVGQHVESIAFDVTPLGNKYDVVLGISWLEDHNPTIDWKQRTLHL FT NNCHCPKGPCMGYGTRTLTSKCTGSGRIERRDQNTAKGNSAKNMIMAATRY FT SEKEWLAELMGWAPTNEQERLEVMTLESGSEEEWHSSPETNQTSSESDSDS FT WTLLDSQELAANSAEQPSLPKEYQGFRELFEQPRTNKLPEHGPHDHTIPIQ FT EGKEVTCKRIYPMSERESQALKEYIKDRLEKKQIRPSKSPAGHGVLFVPKK FT GGELRLCIDYRPLNDITVKDRHPLPLITEIQDKIRGAKWFTKLDITDAYHR FT LRIAEGEEWKTAFRTKYGHYEYLVMPFGLTNAPASFQRFINEALGEILDVF FT VIAYLDDILIFSHSLEEHVQHVQTVLEKLQRAEVRLKLKKCEFHVQETEFL FT GHWISTEGIQAEEGKVKAIREWPEPTNLKELQQFIGLLNYYRKFIDRYAHR FT LAPLFDLLKKSKQWEWTNEHQSAFDKAKEAITTAPILAQHDPAKQTIIETD FT ASDYAIGARMVQAGPDGKLRPIAFESRKLVQAELNYDIHDKELLAIVSAFK FT KWRVYLEGAQHQIIVKSDHKNLTYFTTTKELTRRQARWAETLSQYDFRIEH FT CKGSENGQADALSRRPDHEIKGKTIETAILKQHEDGSIGYNKQTLAAVTVE FT IKDPTRHLIAKANKKDEALTQKLEASDDLFTKDEDGIVYYRNLIWVPQKLR FT NMIIQEHHDNPTRGHFGVEKTSEQIARNYYFPNMAKQVRKYIDKCETCIRD FT KPARHKPYGLMQSPDAPSRPWEWITIDFVGPLPESEGWDMITVITDRLTKY FT IHLVPSKSTLDAVHLAHLLVNHVFVHHGMPKKITSDRDKLFTSKFWQSLTD FT LMGIDQKLTTAYHPQGNGQTERTNQTIEQYLRHYVNYQQDDWANLLPTAQF FT AYNNAEHSTIGTTPFYANHGYHAKVAGEPRNKQPVAEEAIETVEGLKSLHN FT QLSLDIKFFNHRAAMYYNRHHEKGPTFKKGEKVFLLRRNIKTKRPSSKLDH FT QKIGPFRVEEQIGNVNYRLKLPDSMKKIHPVFHISLLEPAPENAKIAENIE FT LDEEGTEYEVEKILKHKRVNGKPHYLVKWKGYSTSENSWEPIENLTGCHQL FT VRQYHQRKDQNSPRRKGHPSAESS" XX SQ Sequence 6649 BP; 2281 A; 1597 C; 1544 G; 1227 T; 0 other; cattgatcac cagtctatac gagttgaacc acgtttggac ttaaaacatc gacaaacatg 60 tcttctcagt catccagcag caagaagact ccggtcaaga acactccacc ggcagaaact 120 gattccgaga gcgaaacgac cgtgaaagaa cagctgaagc aaatgaagaa catgattact 180 cagcttgtca acaacgccaa ggagaagaat caggaaatcg aaaatctcaa agtacagctc 240 ggagaagctg aacgaatccg aagtgagcaa caggatcaca ttgctcaact tgatgctcag 300 gttggggcat cagcacctaa ggacgcaatc ggaaaagtga aactgccaaa agccgagcct 360 tttgacggaa ctcgctcaaa attgcaagca tttctgacgc agatgaatat gcacattcac 420 gcgaatagga aaaacctcat cgacgaagct gacaaggtca tcttcatatc cactcactta 480 cgtggagcag catggaactg gttcgaaccg tatatccgag aatattacga agtcgtgcca 540 gacaattggt caaacaccac aagagagttg ttcaccgact cgggaaatct gcgcaaacat 600 cttgaacgaa ctttcgggga tgtcgatgcc gaagcggtag cagaacgcaa actaaagcag 660 ctatatcagc gaggcagtgc atcaacctat gcggcagaat ttcagcagat catatccagg 720 atggactgga acgaaaaggt ctatgtgtca accttcatca gcggtctcaa aggacacgta 780 aaggatgagt tcgcacggat cgatagacca gcaacactta acgaggcaat cgactttgcc 840 gttaaggtgg ataatcgcta ccacgaacga ctcatggaaa aacgggataa cgaagcctgg 900 agaaaaggca gtcaccgacc gaagggacag tacaaatcga acgatcagcg agaacgcaca 960 ggcgccaagc acaacgaccc ttacggattg aaacccatgg agttagacgc cactgaggga 1020 cagagccaat cgagaggcat ctcccaaaag gaacgagagc gtcgaaaacg cgaaaaactt 1080 tgttacaact gcggaaaagc aggacacatg tcgaaggact gtcgacagaa aagaaatagt 1140 catcagtcaa atcgaaaacc tcagcaaatg aatgctacgg aagacgaagc agaaccacca 1200 aggaaggcaa ggttcgcgca actcaacgcc aatgcggaag ccgataagcc acacgaccag 1260 gctaggggag cttactgcac caccggaatc aagagtccaa tcatggatga gatcggtgaa 1320 gaaatcgaag ccatacagct caatgccaac atggatttgt cggcaagaga gacagaacga 1380 tatcgagaaa acgaaccaaa tgacgacgac tggatcacac tggattggct taccactcat 1440 accaatcaga ccatatggcc acgcatcgac caggattgga gtaacctaca ggaggccaca 1500 gaacaatggt acggtcaact taacgaacac gaaattgacg aattagccga ccatgcaaac 1560 gatgaaaccg acagacttgg acaagtcaac agcgaatcac agtatgaggc tatcatggaa 1620 ccaatcagaa gacgcgtaag gtacgcgctg actcataggg tggacggccc tgataatacc 1680 gatcaataca acgagcctgt ttcaagcatc gaccaatcaa atcgagtttt ggttgaaatc 1740 gatcccttga acgaagagta cggtactcaa tggattggcc atgatcccaa catggaggga 1800 ctccttgagg tacccgagac gccagaaaat ccccaagatg aagacgagaa tgcacaccga 1860 cgagttatgg cactcatcga cacactgcag gaagtggtgt caccaaggcg aagggcgcca 1920 gcatcgaggc catatcaaac gcagcaaatc gaggttgagc caccgcgaag gcaagagacc 1980 cttaccaatt ggaccaacaa cgtaatcgac gaaattatac ggaatcctag gagattctca 2040 cggccactta ggatgttatt ggagcaatgc ccacattgga atcacgaatg ctgggactcg 2100 aacatcgaaa attgggacga gcattgtcag cagtgtaaca agcatccaat cgtatgcgaa 2160 atatgtggcg cagacagatt cgagtattac ggggaactcg aactcatcaa cccagataga 2220 gcaaaaaggg gacacgaagg aacgcatcat tggttgaggg aatgcgaatg ttgccattac 2280 gcgacagaac caatgcataa tcgttaccct tgggtagtgt gctttgatga cagctgcaca 2340 caccaccgta tctggaaaca gatagcacgt ttttggccac agaacgatgc aaatcgagga 2400 acacttgccg cgaccaggca aggaagacac atcaccacga ccatcgctat caacggaaaa 2460 ccagcgcgag cgatgataga ctcaggcgca acgaacaact tcatgtcacc aagatatcga 2520 gaaaacatga agatcgaagg acgacaaaag gagaacgtcg aacctttact cggactagac 2580 ggccagaaac tgggaactgg tcaagtctcg gtcgaaacag tacctgttac tatggctgta 2640 gggcaacacg tcgaaagtat agcctttgac gtcacgcctt tagggaacaa gtacgatgtg 2700 gtgttaggga tctcatggct cgaagatcat aacccaacga tagattggaa gcaacggacg 2760 ctccatctga acaattgcca ttgcccaaag ggaccatgta tggggtatgg aacccgcacc 2820 ctcaccagca agtgcactgg tagtggaagg atcgaacggc gcgatcagaa taccgcgaag 2880 ggaaattccg cgaagaacat gattatggca gcgacccgat attcagaaaa ggaatggtta 2940 gctgaactga tgggatgggc acccactaac gagcaggaac gactagaagt catgacgcta 3000 gaaagcggat cggaagagga atggcactct tcaccagaaa caaatcagac atcgtcagag 3060 tcagacagcg actcttggac acttttggac tctcaggaat tagcggctaa ctccgcggag 3120 cagccaagcc tgcccaaaga gtatcaggga ttccgagaac tattcgaaca gccacgaacg 3180 aacaagttac cagagcacgg accgcacgat cacactatcc ctattcagga agggaaggag 3240 gtaacatgca aacgaattta cccaatgtcg gaaagagaat cacaagctct gaaggaatac 3300 atcaaagaca gactcgaaaa gaaacaaatt cgaccatcga aaagtccagc aggacacggt 3360 gtattattcg tacccaagaa gggaggggaa ttacgactat gcattgacta ccgaccactg 3420 aatgacatta cggtcaagga cagacaccca ttaccgctca ttacagaaat acaagataag 3480 ataagaggag caaaatggtt tacgaaactc gatattacag atgcatacca ccgcctcaga 3540 atcgcggaag gcgaagaatg gaaaactgca tttcgaacaa aatatggaca ttacgaatac 3600 ttggttatgc cttttgggct caccaacgca ccagcatcgt ttcagaggtt catcaatgaa 3660 gcactaggag aaatcctcga tgtattcgtc atcgcatatc tagacgacat cctaatcttc 3720 tcgcacagcc tcgaagaaca cgttcaacac gtccaaacag ttttggagaa gctacagaga 3780 gcagaagtac gactcaaatt gaaaaaatgt gaattccacg tccaagaaac cgagttttta 3840 ggacactgga tatccaccga gggaatacaa gcggaggaag gaaaggtgaa agctatccga 3900 gaatggccag aaccaaccaa cctcaaggaa ctgcaacagt tcatcggatt gctgaactat 3960 tatcggaagt tcatcgatcg atacgcgcat aggctagcac cactctttga cttactcaag 4020 aaatcaaagc aatgggaatg gacaaacgag caccaaagcg cattcgacaa agcgaaagaa 4080 gcaatcacca ctgcaccaat cttggcacag catgatccag ctaagcagac catcattgaa 4140 accgacgcat ctgactatgc gattggcgca cgaatggtac aagcgggacc agacggaaag 4200 ctacgaccaa tagcattcga atctaggaaa ctagttcaag cggaactgaa ctacgacatc 4260 cacgacaagg aattgttggc tatagtgtca gcgttcaaaa aatggagggt ttacctagaa 4320 ggagcacaac accagatcat tgtgaaatcg gatcataaaa acctgacgta cttcacaacc 4380 acgaaggagc tcacacgaag acaagcgaga tgggctgaaa cactatcaca atatgacttt 4440 aggattgaac actgcaaggg atcagaaaac ggtcaagccg atgccttgag ccgaaggcct 4500 gatcatgaga tcaaaggaaa aacaatcgag actgcaatat tgaaacagca tgaggacgga 4560 tcgatcggat ataataagca gacactcgca gcagtaactg tggaaatcaa ggaccctact 4620 cgacacctca tcgcgaaagc aaacaaaaaa gacgaagcac tcacacagaa actcgaagca 4680 agcgacgacc tgttcaccaa agacgaagat ggaatcgtat actaccgaaa cctcatttgg 4740 gttcctcaga aactacggaa catgattatt caggaacacc atgacaaccc gacacgagga 4800 cactttggag tcgagaaaac atcggaacag atcgcaagga actactattt cccaaacatg 4860 gcaaaacaag tcaggaaata tatcgacaaa tgcgaaacat gcatcagaga caagccagca 4920 agacacaaac catatggact tatgcaatca ccagacgcac cttcgagacc ctgggaatgg 4980 atcacaatcg actttgtggg accactacct gaatcggaag gatgggacat gataacggtg 5040 ataacagacc gactcaccaa atacattcat ttggtaccaa gcaaatcaac actcgatgca 5100 gtgcacctcg cacacttact cgtcaatcac gtctttgttc atcatggaat gccaaaaaag 5160 atcacatcgg atcgagacaa gttattcaca tcgaagtttt ggcaatcact cacagaccta 5220 atggggatag atcagaagtt aaccacggca tatcacccac aaggaaatgg tcagacggaa 5280 agaacaaacc agacgatcga acaatatttg cgacactacg tcaattatca acaggatgat 5340 tgggcaaacc tgttaccaac agcacagttc gcatacaata acgcggaaca ttcgacaata 5400 ggaacaacac ccttttacgc aaatcacggg taccacgcta aagtggcagg tgagccaaga 5460 aataaacaac ctgtcgcaga agaagcgatc gaaacagtcg aaggattgaa aagcttgcac 5520 aatcaattat ccttggacat caagttcttt aaccatcgcg cggcaatgta ctacaaccga 5580 caccacgaaa agggacctac ctttaagaag ggggagaaag tattcctact ccgcagaaat 5640 atcaaaacga agagaccaag ttcgaaactc gaccatcaga aaatcggacc attcagagtc 5700 gaagaacaaa ttggcaacgt caactatcga ttaaaactac cagactcgat gaaaaagata 5760 caccccgtgt ttcatatctc gttattggaa cctgcaccag aaaatgccaa aatcgctgaa 5820 aacatcgaac tcgacgaaga aggaacggaa tacgaagtcg aaaaaatact aaaacataag 5880 cgagtcaatg ggaaaccgca ctacctggtg aagtggaagg gttacagtac ctcagaaaac 5940 tcatgggagc ctatcgagaa tttgacgggc tgccaccagc tggttcgaca gtaccaccag 6000 cggaaggatc aaaattcacc cagaaggaag ggtcatccat cagctgagtc aagctaggat 6060 ctaacccgga aaccgcggct agttgaacat caccacttga cgcactagcc ccggaacagg 6120 aaaccgcagt ttcctgatgg tgcaacttct cctgcgcctc acgctctttt tgctcttgac 6180 gctcaagttc ctcgagttcc tcaatttctt taatatcacg cgcaatgaaa tcacctgcac 6240 gtttttgcaa taaccgtttt tgttttcgca aacgaagaag cttcgccaaa ataacctcct 6300 cctcctcctc aatagtattc tgggctttca cgagccgttt ccactcagca tcagaaaagg 6360 aaggttcgga cattttgcaa gaggcaccgt tcgctttcac acactcgcta caacgaccag 6420 aattttcgga tttaatacaa gaacgatgaa ggcggataca acgttcgcaa gggaataccg 6480 cctgaaaccc ctcgtcttcg attcgctggt aaagcttgga tttacggaca gaactggaag 6540 gcatggtcag gacatgggta aaagaaaaag ttgaggatgg gcatctctca cctgggcaag 6600 accgcaactt tatgctttcg aggacgaaag cctgaaagag gggggatga 6649 // ID Gypsy-7_MLP-LTR repbase; DNA; FNG; 255 BP. XX AC AECX01001703; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-7_MLP_; KW Gypsy-7_MLP-I; Gypsy-7_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-255 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001703; Positions 69452 69706. XX SQ Sequence 255 BP; 84 A; 32 C; 40 G; 99 T; 0 other; tgtaatggat tgaaatactt ttatcaaata ttgttttaga ttctgttgtt atggaagaaa 60 ggataacaaa atacaaattt gaatagttac gaataatgtt gttagatgtt acatatggtt 120 gttttgtata aatatgtttg tgtatactta gttatgtttc cttttcctta tcaaatcaat 180 aaagtagttt tcctcactca taaaacataa gtagacaaga catggtcttg tctactgtac 240 acctgagccc ttaca 255 // ID Gypsy-29_MLP-LTR repbase; DNA; FNG; 198 BP. XX AC AECX01001225; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-29_MLP_; KW Gypsy-29_MLP-I; Gypsy-29_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-198 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001225; Positions 198431 198234. XX SQ Sequence 198 BP; 59 A; 56 C; 32 G; 51 T; 0 other; tgttatgatc caattgacat gtaactggag tcgtgaaagg tctcacaata tgtcacagat 60 tagagaacaa ccactttgta catccacgtt gtatgactgc tctctttttc ctcatacgac 120 aatctacata ccagaccaga ctattggcag aacagaacct caacccccgt cccaatcccc 180 ccagtgacgg tcttaaca 198 // ID Gypsy-11_CCO-I repbase; DNA; FNG; 5984 BP. XX AC AACS02000004; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the mushroom Coprinopsis cinerea genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-11_CCO_; KW Gypsy-11_CCO-LTR; Gypsy-11_CCO-I. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-5984 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the mushroom Coprinopsis cinerea RT genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AACS02000004; Positions 759251 765234. XX CC Positions [4761-5279] - Integrase core CC 'CCTT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 532..1806 FT /product="Gypsy-11_CCO-I_1p" FT /translation="MQGPGGISMFHGVETKDDANSKDFLKKLRNFFRTSGI FT ATDTDKIECFADHLATNSPAERWFDDPKTGKDTWADVQSGFSARFPVPRVI FT ERTNEDIERELLGKRLKPGELGKRWERRGGEVWSHEAFADELLALAKEARI FT EETSTYIWKVREEFPASLRERVGTGYPTWVVFCAALKAVDGRVLAEVMERE FT REREEKIAAMERAQQAAAAETSRQIAALTAQLNALTTSSTGNIHRRPASGP FT QTTTRAPVALGSQNPSAGPTVSNDKWERLPEVVTEEMRRRTRELLELYPQQ FT PNTPEGIREWKNQAVRWVTANPGERRSTGLTGFPLSPKARPVCSGECFKCG FT QGGDDDPRGAHLGTACPRPRTEWIPRAEGSWRAWCARVLGRGGAQRAVAVQ FT MVEVMEDGDDDDLAFLGPDLAAHVKEAQGKA" FT CDS 1902..5984 FT /product="Gypsy-11_CCO-I_2p" FT /translation="MFQMNTIVDVYTVNSQVREAPFQHPVNLRKGRCNIVV FT TAMGTADGGALIGAMCSRRYRELPMELRRATKSGRWMRMADGGLVPSEGVW FT SGWLEVEGIAISVRLEIFDSGGCWDVLLGKPILRQLRAVHDYAKDIVAVKA FT REGDSPTLLESKASTKASATVGDDDAPVETSPHIPPRRVDNVIEEATDNHR FT QHLPDSHTNGTPVHTVLDEGTNIFTRHTEPFKPERVKRVVEEVRLGKDLTE FT EQRAAVRELIAEFADCFALSLREVNAVPGAVHKLNVPEGASFSTKVGARKR FT TPAQRQFMDKKIDEMLEAGVIRPIHPSEVRCAAQTVLAPKERDDGLPLAEL FT QHMVNDQCVSHGMEPIEDLPPRPEPRTPSGEGKPASSWRICHNFREINKVT FT EIAPMPQGEIRMKQQNLSGHRYIHVFDFAAGFYAVTVHEDSQPFITFYVEG FT RGYFAYVRMPFGVTGGPSSFAHLTASKFHDLVAEGTCELFVDDGGAASDTF FT EEGMTKLRRILERVRREKLSLSAGKMRVFMTEAVFAGAVVGPGGVTPDVTK FT LTAMVNWPTPKDAAHLEGFLGLTGYFRDLIPGYALTEKPLRDLLRRVGVPK FT GTRKAQYQRLMKGYKLEEEWREEHTKAFLTLKARLVSKPVLCAPRFDGTPF FT IMTTDGCKDAFAGVLTQRVKVTLPGGKSKTKLNPIGFVSKRTSPAEENYKP FT FLLEFAALKFSLDKFSDIIWGYPIEIETDCQALRDVLTNDKLNATHARWRD FT GVMAYDIREVRHVPGAVNIADGLSRQYEGVPRGDDEDSAWSVDPDCQRAAG FT VEDAVWGVEEVEDDQLNQLRTRFKEETYFSDILDALYDIKPSRKMRLRDVQ FT RARHRAKNYMVDEGKLWFVGGGTKIRAKARRECVTREEAVALAREEHERGG FT HWHRDGIKIALLDRIHRPGLDELIIQGIRTCPRCKNFGGSQLNALLQPITR FT RHPFELLVGDYLKMPVGKGGYHTVGLYLDTFSQHVWGYMFKTHGSAKTTVK FT SLKDIFQTFAPTEMFMTDGGSHFDNEEVRAVCSEWGAKTDVVAAYSPWVNG FT LVEGTNKLLLYVLARLCAPEVGEDGWQEMTWDDLPRTWTDHFEHAIRILNW FT RILPALKFSPKELLLGLVVNTVATPLEASASILPPTDVDTHMAYAAQQRLD FT GYSETVHHAIRRKAAFDRKVERSKDGPITFEEGQLVQVHRSDLFKTLAAER FT KLKPMWSAPKRVTKRLTNSYELEELDGTPIPGRFHARRLRLFEPREGTELW FT KEQQERSRAARGAVGGGAGETGEANGEPRVGEERDGEDGDRDGEEEEESLE FT ESRSGDGEETECGRESEGEDEGLDSVAGRLVARRRGVIHSSGRQAQSSQ" XX SQ Sequence 5984 BP; 1522 A; 1429 C; 2096 G; 937 T; 0 other; ccttccattc gacgacctat ccaccctctc catagtggtg acgagagcgg gatcgaccga 60 cgccgaacac cgacagcagc aagcaacaac ggagaagcag ccacggagcc ctgagcagag 120 ccagaacgcc gcagccacgt acccacctcg tgcgagaccc tggcacccca ttggaagtta 180 caaacgcgag tgaaaggcaa gtttggcccc aaagcccaac cagaaccagc gcacgcaccc 240 gaggaatcag tgccagcgaa ccccaacaca cagcccgaag tgtcatcgtc accggcagga 300 tatagcctag cttcttcact ccggtcttcc ccaatcagca gcaacttcag cctctcagga 360 gccgcgacac cagagagctt cgacctcgta gcactgtccg agacgacgtc gagccttcgg 420 ttcccgatat taccaacagt gcaggtgaag accgagggcg gaggagtaga ggggagtact 480 ggaaacagga cggaagaagc ggaacgagga aggatacagc agcccaggag gatgcaggga 540 ccgggtggga ttagcatgtt ccacggagtc gagacgaagg acgacgccaa cagcaaggac 600 ttcctcaaga aactaaggaa tttctttcga acgtcgggga tcgccacgga tacagacaaa 660 atcgaatgct tcgcggacca cctggccacg aattcaccag cggagcgatg gttcgacgac 720 ccaaagaccg ggaaggacac gtgggcggac gttcagagcg ggttttcggc gagattcccg 780 gtgccgaggg tgatcgagag gaccaacgag gacatcgaga gggagctgct ggggaagagg 840 ttgaagccgg gcgaactggg gaagaggtgg gagaggcgag gcggggaggt gtggtctcac 900 gaggccttcg ctgacgagtt gctggcgttg gcgaaggagg cgaggatcga ggagacaagc 960 acctacatat ggaaggtgcg ggaggagttc ccagcgagtc tgagggagag ggtgggcaca 1020 gggtacccga catgggtagt attctgcgca gcgttgaagg cggtggatgg gagggtgctc 1080 gcggaggtga tggagaggga gagggagagg gaggagaaga tcgcagcgat ggagagggca 1140 cagcaggcag cagcagcgga gacgagccgc cagatcgcag cgctgacggc tcaactcaac 1200 gcgttgacga catcgagcac cggcaacatc catcgacgac cagcgagcgg cccacagacg 1260 acgacacgcg caccggtggc gctggggagt cagaacccga gcgccggacc gactgtcagc 1320 aacgacaagt gggaaaggct gccagaggtg gtgactgagg agatgcgccg cagaacgcga 1380 gagctcctcg agctctaccc gcaacagccg aacacaccag agggcattcg agagtggaag 1440 aatcaggcgg ttcgttgggt cacggcgaac ccaggggaga ggaggtcaac gggactcacg 1500 gggttcccgt tgagcccgaa ggcgaggcca gtgtgctcgg gcgaatgctt caagtgtgga 1560 cagggaggcg acgacgaccc gaggggtgcg catctgggca cggcatgccc gagaccgagg 1620 acggagtgga taccgagagc agaggggtca tggagagcgt ggtgcgcgag ggtgttgggt 1680 aggggtggag ctcagagggc ggtggcggtt caaatggtgg aggtgatgga ggatggagac 1740 gacgacgatc tcgccttctt aggcccggac ctggcagcac acgtgaagga ggcacaggga 1800 aaagcgtgag ggccgtccgc aacacgaacg acgaacggac ggcccgaggc actcgagtcg 1860 acgaggcctt gggaagcccc cctattccac gacatgtatc tatgttccaa atgaatacaa 1920 ttgtcgatgt gtatacggta aattcccaag tgagagaggc accattccaa cacccagtca 1980 acctacggaa gggcaggtgt aacatagtcg tgacggcgat gggaacagcg gacggcggcg 2040 cattgattgg agcaatgtgc tcgaggaggt atcgagagtt gccgatggag ctgaggaggg 2100 ccacgaagtc ggggaggtgg atgaggatgg ccgatggcgg gttggtgcca tccgaaggag 2160 tttggagtgg gtggctggag gtagagggga ttgcgatctc agtaaggctg gagattttcg 2220 acagtggagg atgttgggat gtgttgctgg gaaagcccat actgcgacag ctgagggcag 2280 tgcacgacta cgccaaggac atagtcgcgg tgaaagcgag ggaaggagat tcaccgacac 2340 tattggaatc caaggcgtca accaaagcta gtgcaactgt tggggatgat gatgcgccgg 2400 tagagaccag cccacacatc cccccaaggc gagtggacaa tgttatcgag gaggctacag 2460 ataaccacag acaacactta ccagactctc atacaaatgg aacaccagtg cacacagtcc 2520 tggatgaagg caccaacatc tttacgagac acacagagcc gttcaagccg gagagggtga 2580 agagagtcgt ggaagaggtt aggttgggga aggacttgac ggaggagcag agggcggcgg 2640 tgagggagct gatagcggag tttgcagact gcttcgccct gtcgctaagg gaagtcaatg 2700 ccgtgccagg ggcggtccac aagctcaacg taccggaggg tgcgtcgttc tcgacgaagg 2760 tgggagcacg gaaaagaacg ccagcacaac gccagttcat ggacaagaag atcgacgaga 2820 tgttggaggc gggggtcatc agaccaatac acccaagcga agtgcgctgc gcggcccaga 2880 cggtcctagc accgaaggag agggacgacg gactaccgct cgcggagctg caacacatgg 2940 tcaacgacca atgtgtttca catgggatgg agccaataga ggacctaccg ccgagaccag 3000 aaccgaggac accgagcggg gaagggaagc cagcgtcgtc atggaggata tgccacaatt 3060 ttcgggagat aaacaaggtg acggagatag cgcccatgcc ccaaggggaa atccggatga 3120 aacaacagaa cctctcaggg catcgataca tacacgtctt cgactttgcg gcagggttct 3180 acgccgtcac cgttcacgaa gattcacagc ctttcatcac gttctacgtg gaaggcaggg 3240 ggtactttgc atatgtgcgg atgccattcg gggtaacagg aggaccatcc tcgttcgcgc 3300 acctgacagc gtccaagttc catgacctag tggcagaggg tacatgcgag ttgttcgtag 3360 acgatggagg tgcagcatcc gataccttcg aggaggggat gaccaagctg aggaggattt 3420 tggagagagt gaggagggag aagctgtcac tatcggcggg gaagatgcgg gtgttcatga 3480 cggaggcagt gtttgcggga gcagttgtgg gacctggggg agtcacaccg gacgtcacca 3540 agcttacagc aatggtgaac tggccaacgc cgaaggacgc agcccatctg gaggggttcc 3600 tgggtctcac gggttacttc agggacctga taccggggta cgcgctcacc gagaaaccat 3660 tgcgggacct gttaaggagg gtaggagtgc caaaaggaac aaggaaggcc cagtaccagc 3720 gactcatgaa gggctacaaa ctggaggagg agtggcggga ggagcatacg aaggcgttcc 3780 tgacgctcaa agcaaggctg gtatccaaac cggtcttgtg cgcaccaaga tttgatggaa 3840 caccattcat catgacgacg gacgggtgta aggatgcgtt tgcgggggta ctgacacaga 3900 gagtcaaggt gacactgccg ggagggaaat cgaagacgaa gctgaacccc atcgggttcg 3960 tgtcgaagcg tacatccccg gcagaggaga actacaaacc tttcctgctc gagttcgcag 4020 cgctcaagtt ttccctggac aagttctcag acattatatg gggctacccc atcgaaatcg 4080 aaacagattg tcaggcgcta cgggacgtac taacaaacga caaactaaac gctacacatg 4140 cgcgatggag ggatggcgtg atggcgtacg atatcagaga ggtgcgacat gtgccggggg 4200 cagtgaacat tgcggacggg ctaagcaggc agtacgaagg ggtgcccagg ggggacgatg 4260 aggatagcgc gtggtcagtt gacccggatt gccagagggc ggcgggagtg gaggatgcag 4320 tgtggggggt agaggaagtg gaggatgatc agctaaacca gctccggacg aggttcaagg 4380 aggagaccta cttctcggac atcctggacg ctctatacga catcaaaccc tcgagaaaga 4440 tgcggttgag ggacgtgcag agggcgagac acagagcgaa gaactacatg gtggatgaag 4500 ggaagttgtg gtttgtgggc ggaggaacaa agattcgggc gaaggcgagg cgggagtgcg 4560 tcacgaggga ggaggcggtg gcgctggcga gggaggagca tgagagaggg ggacactggc 4620 atagggacgg catcaagatc gcactactgg accgcatcca ccgaccgggc ctcgacgagt 4680 tgatcattca aggaatacgc acctgcccac gatgcaaaaa ctttggagga tcacaactca 4740 acgccctcct acaaccgata acgaggcggc acccgttcga gctcctagtg ggcgactatt 4800 tgaaaatgcc cgttgggaaa ggcgggtacc acacggtagg actctacctc gacacctttt 4860 cccagcatgt gtgggggtac atgttcaaaa cccacggcag cgcgaagacg acggtcaagt 4920 cgctgaagga catcttccag accttcgcac cgacggagat gttcatgacg gacgggggca 4980 gccacttcga caacgaggag gtgagggcag tgtgcagcga gtggggagcg aagaccgatg 5040 tggtggcagc gtactcgccg tgggtgaacg ggctagtgga ggggactaac aagttgctgc 5100 tatacgtgtt agcgaggttg tgtgcaccgg aggttgggga agatgggtgg caggaaatga 5160 cgtgggacga cttacctcga acctggacag atcacttcga gcacgccata cgtatactca 5220 attggcgaat actcccggcg ctgaagttct caccgaagga gttgttgttg gggttggttg 5280 tcaacaccgt ggcaacccca ctcgaagcca gtgcctcgat tttgccacct acggacgtcg 5340 atacccacat ggcgtatgcc gcccaacagc gtctcgacgg gtattcggaa acggtccacc 5400 acgccatcag aaggaaggcc gccttcgatc gcaaggtcga gcggtcgaaa gacggaccaa 5460 tcacgttcga agagggccag ctggtgcaag ttcaccgctc ggatctgttc aagacgttgg 5520 ccgcggagcg gaagctgaag cccatgtggt cagcgccgaa acgagtgacc aaacggctta 5580 ccaactcata cgagctcgag gagctcgatg gcacgccgat tccggggagg ttccatgcca 5640 ggaggctcag gctcttcgaa ccgcgagagg gaacggagtt gtggaaggag cagcaggaga 5700 ggagtagagc cgcgagggga gccgtgggag ggggtgcggg tgagacagga gaggccaacg 5760 gggagcctag ggtaggggaa gagagagacg gagaggacgg agaccgagat ggggaggagg 5820 aggaggagag cttggaggag agcaggagtg gagacggaga ggaaacggag tgtgggcgag 5880 agagtgaggg ggaggacgag ggtttggact cggtggcagg tcgtttggtt gcaaggaggc 5940 gtggggttat ccattcttct gggcggcagg cgcagtcgtc tcag 5984 // ID Gypsy-11_LBS-LTR repbase; DNA; FNG; 496 BP. XX AC ABFE01000651; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-11_LBS_; KW Gypsy-11_LBS-I; Gypsy-11_LBS-LTR. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-496 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01000651; Positions 128687 128192. XX SQ Sequence 496 BP; 114 A; 142 C; 93 G; 147 T; 0 other; tgtagtatgc tagtattgtg agtaacccct agcttctact ctctatcctt tccactcccc 60 tacatgcaac tatcattcgt tattatatgt ttacttcact tgtgttacga tgtttacttc 120 acttgtgtta cgcttacgtc atctttgtac atatacctca tgtagcccat gaggcaatcg 180 atatctttca ccatactatt gcccacacag ctcgactcga aagccctact aacggcttca 240 actcggactc taggcggtac gtacgaaaga gacgttccat ccactataat cgattagagc 300 cttctccaac gggactgtcc gtggagatcg gagcacgggt actgtgaact gtcacgttcc 360 acggctccca gcgtctaatc gatcacattg cacgacctac ttcgagggtc ttcacacctt 420 tggttgcgtt gtccgttaga cgttccaaca gccgtcccac taccttagcg gcactgacga 480 tcaatagtgt tccaca 496 // ID TCN6-I repbase; DNA; FNG; 4354 BP. XX AC . XX DT 30-MAR-2005 (Rel. 10.03, Created) DT 30-MAR-2005 (Rel. 10.03, Last updated, Version 1) XX DE C. neoformans LTR retrotransposon - internal consensus. XX KW LTR Retrotransposon; Transposable Element; Interspersed repeat; KW reverse transcriptase; TCN6-I; internal portion. XX OS Cryptococcus neoformans OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Tremellomycetes; Tremellales; Tremellaceae; Filobasidiella; OC Filobasidiella/Cryptococcus neoformans species complex. XX RN [1] RP 1-4354 RA Goodwin T.J. and Poulter R.T.; RT "The diversity of retrotransposons in the yeast Cryptococcus RT neoformans."; RL Yeast 18(9), 865-880 (2001). XX RN [2] RP 1-4354 RA Loftus B.J., Fung E., Roncaglia P., Rowley D., Amedeo P., RA Bruno D., Vamathevan J., Miranda M., Anderson I.J. et al.; RT "The genome of the basidiomycetous yeast and human pathogen RT Cryptococcus neoformans."; RL Science 307(5713), 1321-1324 (2005). XX RN [3] RP 1-4354 RA Gentles A. and Jurka J.; RT "C. neoformans LTR retrotransposon TCN6."; RL Direct Submission to Repbase Update (15-MAR-2005). XX DR [3] (Consensus) XX CC 275 bp LTR deposited as TCN6-LTR. 99% average similarity to CC consensus. XX FH Key Location/Qualifiers FT CDS 256..4354 FT /product="ORF1p_TCN6" FT /translation="SSESYTHIMSNNTDNNTSYQGVILTGPQNYAEWELSI FT KTSLILKDLEIGTPVSLDSMSSKEDKEKYKRSRQAFALLIKSLSPEVQASL FT PANIRSVETADSSALWEELKSQYSAAVGARQAQLLQQMWISPVIEGEDPNK FT RMAEIRSAHAQINSSGENLSDRMLAYAMTLALPESFTTVKQTLWLREPLTS FT SAVQAAVQAEWTRRSSEEVAMANRVQESHTQGNKSNRRAKPREWTEKWCSI FT HKVPTHNTRDCSLRKFSNNHSQSGQARVGTGESAKAEELASTFHAVAATTT FT CFHDNSFIVDSGASHHMVSNKGLLHNYGPATIVRSVKIGNGTILPVAGQGT FT MTIGATTLQQVLHVPQLQCNLLAVNKVPSGFHWAFSSSKGELFDQSNQVCL FT SAPFKDGAYSLQVDPSGHAYPVQLADTLSEWHHKLGHLGIEKVVRLAKEGR FT LGQNNQLKNANAGDTKDFYCEACIKGKTGRLPSPPEPNIRASRPLELLHID FT IWGPAPVASKGGMRYFLTVYDDYSHRISLTLMRMKSETLQSFKNFVNHAET FT QTGHKVQSIRSDRGGEFTSAAFQRYIREKGIEHVMVPPDAHAQNGRVERAH FT LTILNGVRTLLVETGLPASFWGEAAKYIAFSRNCSFDSNNGIPFERWYGKK FT LLFTQLHAFGEKIFFRDYTNTNKLKPRYRQGRFMGYASDSSTSSYRILDDE FT DKKIKVSRDTILPKKLDIPHSKMPARGSILRLEEWDDNKNRDTSTHAQDPL FT AEIPDRIIPAPMEVNPQDEVRPRAVPAEHQSPMELLRRLARYDTGIQQQPE FT ASPSSNASEETNSDVSSIDPLALTDNPETWGTIATALNAMAINTPNTYREA FT IQSGEGEQWHQAMMEELEKMEKYNVWKVVDRAPGQRVLKARWVYTRKIDGT FT TGKPAAYKARWVAKGFSQKAGIDYNEVFSAVAHKDSIRVFLSLVNHLDMEC FT DQVDIKAAFLNGDLEETIYLEAPEGSDIPANKILLLNKSLYGLRQSPRCFN FT KALDQWLKSQGLKPTRADPCFYVRRCEGELLMLSVHVDDQLIACNSRKTLD FT EFKQALNSKFECSDSGPAGYFLGINIYRDRPQKKLYLSQEHYMESLLDRFD FT MSNCNPAKTPLPSGFKPIPATDQEHELAQHRPYPKLVGSILYAATVTRPDL FT AHAASVLSRFISKWNESHWLAAKHCLRYIRGTSDLSLTFDAHSSKRVALGY FT VDADWGGDLDTRRSTTGYVFKIYGGVVGWKSKRQPTVALSTTEAEYMASAD FT AAKQAIWLRLLLDDIGLGLGDQPLQLLNDNAGAIALSKNPVNHEKSKHIDM FT RHHFIREKVEDKAISLAHVPSVENIADLLTKSLPAEAFVKLCHLLGMQRLD FT QGGX" XX SQ Sequence 4354 BP; 1254 A; 1055 C; 1036 G; 1009 T; 0 other; ggttatgagc cttgctgctt aacagcacta gactctagca aacacatcag tcaaccaaca 60 caaatcctgc aggtcactat tgcattagta ccacaggtac tgcttgatac caacaagtgg 120 tgagaaaatc tggtgccccc cagtggcaag tcactgcgcc tagtgcacaa gcaaacttgc 180 cttgcgttag ctaagtacaa cctggacgtc tctttatctc tattacaaag aacacgactc 240 tcacttctcc tatagagctc tgaaagctac acacacatca tgtccaacaa tacagacaac 300 aacacatcct accagggggt catcttaact ggcccccaga actatgctga atgggagcta 360 tctatcaaaa catctctcat cctcaaggac ctggagattg gtactccagt cagccttgac 420 agcatgtcct ccaaagagga caaggagaag tacaagaggt ccaggcaagc ttttgccctt 480 cttatcaaat cactctcacc agaagtccag gcatccctcc ctgccaacat acgatctgta 540 gagacagcag acagctcagc actctgggag gagctcaaat cccagtactc agcagcagtg 600 ggcgcaagac aggctcaact tttgcaacaa atgtggatat cccctgtcat tgaaggagaa 660 gacccaaaca agcgcatggc agaaatcaga tcagcccatg ctcagatcaa cagcagtggt 720 gagaacctct cggaccgaat gcttgcctat gccatgaccc tagctcttcc agaatcattc 780 accactgtca agcaaactct ctggctcagg gaaccactta cctcttcggc tgtgcaggca 840 gcagtccaag ctgagtggac aaggaggtcc agtgaggagg tagcaatggc taacagggtg 900 caggagagtc acacccaagg caacaaaagc aatagaaggg ccaagccaag ggaatggact 960 gagaaatggt gcagtatcca taaagtgcct actcacaata ccagagactg ctctctcagg 1020 aagttcagca ataaccacag tcagtcaggt caggctaggg tgggtaccgg tgagtctgct 1080 aaggcagagg agctggcatc taccttccat gctgtagcag caactacaac ctgtttccat 1140 gacaactcct tcatagtgga ctcaggggct tcacaccaca tggttagcaa caaaggctta 1200 cttcacaact atgggccagc aacaattgtc agaagtgtaa agattggtaa tggcactatc 1260 cttccagtcg ctggccaggg aactatgacc attggagcta ccactcttca gcaagtcctc 1320 catgttcccc aactccagtg caaccttctc gctgttaaca aggtcccatc tggtttccac 1380 tgggcattca gcagcagtaa aggtgaactt tttgatcaat ctaaccaggt ctgcttgtct 1440 gccccattca aagatggtgc ctactccctg caggtagatc catctggcca tgcctatcct 1500 gttcaactgg ctgatactct ctcagaatgg catcataagc tagggcattt ggggattgag 1560 aaagtggtta ggcttgcaaa agaagggaga cttggtcaga ataatcaatt gaagaatgca 1620 aatgctggtg ataccaaaga tttctattgt gaagcgtgta ttaaggggaa aactgggcga 1680 cttccttcac cacctgaacc caacatcaga gcttccaggc cacttgaact attgcatatc 1740 gacatctggg gaccagcacc agtcgcttcc aagggaggta tgcgatactt cctgactgtc 1800 tatgatgatt acagccaccg gatttccctc accttgatga gaatgaagtc tgagacactg 1860 cagagtttca aaaactttgt caatcatgct gaaactcaga ctggtcataa ggttcaatct 1920 attcggtctg ataggggcgg agaatttaca tcagcagcat tccagaggta tatcagggag 1980 aaagggattg agcatgtcat ggtgccacct gatgctcatg cacagaatgg gagagtggag 2040 agggcgcatc ttactatcct caatggtgtg agaactttgc ttgtggagac tggactgcca 2100 gctagcttct ggggtgaggc tgctaaatac attgccttct ctcgcaactg ctcctttgac 2160 tccaacaatg gaattccctt tgaacgctgg tatggcaaga aactgttgtt cactcaactt 2220 cacgcatttg gagaaaagat tttctttagg gattatacca acaccaacaa attgaagcct 2280 cgctatcgac aaggaaggtt tatggggtat gccagtgata gcagcacttc cagttatcga 2340 attctggatg atgaagacaa gaagatcaag gtgtcaaggg ataccatttt gccaaagaag 2400 ctggacattc ctcattccaa gatgccagca agggggagta ttcttaggct ggaggagtgg 2460 gatgacaaca agaacagaga cacatcaaca catgctcaag atccattggc agaaatacct 2520 gacagaatca tccctgcccc catggaagtg aatcctcagg atgaagtaag acctagagca 2580 gttccagctg agcatcaatc acccatggag cttcttcgta ggctggcaag gtatgacact 2640 ggaatacagc aacaacctga ggcatcacca tcatccaatg catcagaaga gacaaacagt 2700 gatgtttcat ccattgatcc tcttgccctg actgacaatc cagagacctg gggtactatc 2760 gcaacagctc tgaatgctat ggcaatcaac acacccaata cctacagaga agctattcag 2820 tcaggcgagg gggagcagtg gcatcaggct atgatggaag aattggaaaa gatggagaag 2880 tataatgtgt ggaaggtggt ggatagggca ccaggacaga gggtcctgaa ggcaagatgg 2940 gtttatacca ggaagattga tgggacaaca gggaaaccag cagcatacaa ggcaagatgg 3000 gttgccaaag gattttctca aaaggctgga attgattaca atgaggtctt ctctgctgtt 3060 gcccacaaag actcgattag ggtattcctc tctctggtca accatcttga catggaatgt 3120 gaccaggtgg atatcaaggc agcctttctc aatggcgatc tcgaagaaac catctatctt 3180 gaagcaccag aaggaagcga catcccagca aacaagatcc tactcttgaa caagtcactc 3240 tatgggttgc gacagtcacc aagatgtttc aacaaagcac ttgatcagtg gttaaagtct 3300 caagggctga agccaaccag agctgacccc tgcttctatg ttcgacgttg tgagggggaa 3360 ttgctcatgc tctctgtcca tgtggatgat cagctcattg cctgcaattc aagaaagaca 3420 ttggacgaat tcaagcaggc acttaacagc aaatttgagt gctctgactc tgggcctgct 3480 ggctacttcc tgggcatcaa tatataccgg gacaggccac agaagaaact ctatctctcc 3540 caggagcatt atatggagtc cctgcttgac cgctttgaca tgtccaattg caatccagca 3600 aagacgcctc ttccttcagg cttcaaaccg attccagcaa cagatcaaga gcatgagcta 3660 gcacaacatc gaccatatcc aaagctagtg ggctctattc tctatgcagc aacagtgaca 3720 cggccagatc tagcccatgc agcaagtgtt ctgtcaagat tcatcagcaa atggaatgag 3780 tcccattggc ttgcagcgaa acattgtctg aggtatatca gaggcacttc agatctttca 3840 ttgacctttg atgcccactc aagcaagcga gttgcactgg gatacgttga tgcagattgg 3900 ggaggagacc ttgacacaag gaggtccacc actggttatg ttttcaagat atatggaggt 3960 gtagttggct ggaagtcaaa gaggcaacct actgtggctc tctcaaccac agaggccgaa 4020 tacatggcct ccgctgatgc agcaaagcag gctatttggc taaggctgct cttggatgat 4080 ataggacttg gcctgggtga tcagccactc cagctgctca atgacaatgc tggggccatt 4140 gccttgtcaa agaatccagt gaaccatgaa aagtctaaac acattgatat gaggcatcat 4200 ttcattcgag agaaggttga ggataaggct atctcacttg cccatgtccc atcagtggaa 4260 aacattgctg atcttctgac aaagagcctt ccagctgaag cttttgtcaa gttgtgccat 4320 ctccttggaa tgcagaggtt ggatcaaggg ggag 4354 // ID Copia-12_MLP-I repbase; DNA; FNG; 4384 BP. XX AC AECX01002368; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-12_MLP_; KW Copia-12_MLP-LTR; Copia-12_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4384 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002368; Positions 17702 13319. XX CC Positions [1813-2313] - Integrase core CC 'AAATC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 430..4254 FT /product="Copia-12_MLP-I_1p" FT /translation="MSGTTPSSPPLTYESDKSKICATLARYVMPANLVILR FT KHRGDPRGMWKALKEAHEGNTAGSCMYWLEKLVTFKMETNNVNTELDRILS FT ISERLNSIVTTQRPLTVDEILSMVVCLMLPASFKPTVTPLLQRDEITSTQL FT ITAIREEVTITSISQSTVDSSISIESVNKASESKPETTKSKTCNYCRERGH FT LISECPKKLTRSQKYNPALDEMKQEIEKLKAQLEKSNKAEKASVANSIPPR FT GDSDENVPSVVDDDSDFSYAIQNISEKVPSSGTIVVDSGASSHMLPSSDFV FT YESKSNHTTVQLADGSSITSTSKGCFNPGFSDGSLHQSLIIPKLKEPLMSV FT SKLSDAGIVSVFNNKECVFYQSPTITGNIVGRAPRRGGLYYKSYVRTSNNS FT LEQCKNAKTDLLYWHRRFNHPNIQTLKRKLKLAGIEVPNSSIKSCSECRIC FT IQGKMRRRNFHSCSEYKSKKPFSVIHSDVSEYSTVGRDGCKFFVFFLDDYS FT KFARSFPIRHKSDTFNCFKKFINEVESIPGVKVHELRSDNGGEYTSNKFAS FT FCSDRGIQQTMGPAHCPQLNGVSERWNRTIKEKIRCSLIESNFPVTFWPYA FT LSYCTESYNHLPTCTNEKFTSPVTLSGFPERKPEDFHSFGCEVWYHVPDTK FT SKLESRARQGVFVAYLKNNRGYYIFDVSNQNLIKSTSAKFFESNFPGISIL FT SSSPPLNSQPQTLPWPELESERPHPVSLPKIQPNISNRLPIRQLPTRTRRP FT PDRYGEYGNVACEPDPKSYKQAKKSLNWEAWRKAAVSEFESLVGKETWRLV FT PRPACRRIIRCKWVFKTKRNVDKSVDKLKARLVALGFSQIKGIDFHEVFSP FT TSRQESLRVLITIMAQKGWKARSVDIKTAFLNGDLDEHIYMEQPEGFIDPD FT RPEWVCQILRSLYGLKQSPRAWNKTLHEFLISLGLVQSSHDPSLYLRKNND FT QLSGLIVVHVDDLSITGTDEFISSTCKSLHGRFEISKDEPLSHFLSLQINR FT ESQTTASISQSHYIEDLAEKFLPSDFKPCKTPTTDAFKLLVPSTEDEEFQQ FT PYSSLIGGLLWVAQCSRPDISFPVNRLSQFLQKPNHSHWESALRILAYLNS FT TKDKKLLLGGKTLDAVAYSDADWAEDRHKRKSTTGYVFKLGIGAISWRSRK FT QKTTFLSSTEAEYMAMSDLCREARWLFYMLGELGVIKDKAITVCCDNEGAE FT ALAKNPSHHSRTKHIHTRYHFVRGCVKDGVVNLCHVLSADMCADMFTKALP FT KVLLEKHRNALNII" XX SQ Sequence 4384 BP; 1298 A; 997 C; 801 G; 1288 T; 0 other; ggttatgagc cccagtcgct ctccgatcta acaccaatca tcaagttttt ttttcttcga 60 atcacttctg tcttggaaag gtccagaagc tcgaactcta ttgattcacc gctaaaacaa 120 tacaatcgat tgatttaaaa ctttaaactt tatctcatct acaatcttca acataaaagt 180 taactttatt taatctgcat tcaacaaatg gatccatctt cttctcctgc tgctggtaac 240 tcagcctcag ccggaactgg tgaaaaacaa aaagatttct cccctccaac tcaactgact 300 caataccgcc gggatgatct tcacaccgaa ggtattgaaa agttaactgc tccgggagtt 360 gggtcaaact ggttagactg gtctttcatg atggatacat gtattagtgc ataggtgtat 420 gcttatgtaa tgagtggtac aactccctct tctccacctt tgacgtatga atccgacaag 480 tctaagatat gtgcgactct tgcacgttat gtaatgcctg caaatcttgt gattcttcgc 540 aaacatcgtg gagatcctcg aggaatgtgg aaggcgttga aggaagcaca tgaaggtaac 600 acagctggtt cttgtatgta ctggttggaa aaactcgtta ccttcaagat ggaaaccaac 660 aatgtcaaca ctgaacttga cagaatcctc tcaatctccg aacgactgaa ttcaattgta 720 accactcaac gcccacttac cgtagatgag atcttgtcta tggttgtatg tctaatgctt 780 cccgcttctt tcaaacctac tgtcacaccc cttcttcaac gcgacgaaat cacatcgact 840 caattgatca ctgcaatccg agaagaagtg acaataacct caatctcaca gtctacagtt 900 gattcgtcaa tatctattga atcagtaaac aaagcctcag aatcgaagcc cgagactact 960 aagtcgaaga cgtgcaacta ttgtcgcgaa cgaggacacc ttatctctga atgtcctaag 1020 aaacttacac ggtctcagaa gtataatcca gctctagatg aaatgaaaca agaaattgag 1080 aaattaaaag ctcaactaga aaaatcgaac aaagctgaaa aagcttctgt cgctaattct 1140 atccctcctc gtggtgactc tgatgaaaat gttccatctg tagtagatga tgattccgat 1200 ttctcatatg ccattcaaaa catttctgag aaagttccgt cttcgggcac tatcgtagtt 1260 gattcaggag cctcctccca catgttacca tcctctgact tcgtgtatga atctaaatcc 1320 aaccatacaa ctgtccaatt agccgacggt tcatctatca cttcgacatc aaaaggttgt 1380 ttcaaccctg gtttttctga tggtagtctt catcagtctc ttatcatccc taaactcaaa 1440 gaaccactca tgtcagtttc aaagctatcc gatgctggaa ttgtatctgt gttcaacaac 1500 aaagaatgtg ttttctacca gtcgccaact atcaccggca atattgtggg tcgtgctcct 1560 cgacgcggtg gtttgtatta caaatcatat gtgcgtacct ccaacaactc attggaacaa 1620 tgcaaaaatg ctaaaaccga cctgctttac tggcacaggc ggttcaatca tccgaacatc 1680 caaactctga aacgaaaact caaattggct ggtattgaag tacctaattc atcaattaaa 1740 tcctgcagtg aatgtcgtat atgtatacaa ggaaaaatgc gcagaaggaa ctttcattct 1800 tgttcagaat acaaatcaaa gaagccattt tctgtcattc attcagatgt ttcagaatat 1860 tctactgtgg gacgtgatgg atgtaaattt tttgtgttct tccttgatga ttattccaag 1920 tttgctcgta gttttccaat ccgtcataag tctgatacct tcaactgctt taagaaattc 1980 ataaatgaag ttgagtccat tccgggtgtt aaagtgcatg agctacgatc ggataatgga 2040 ggagagtata catcaaacaa attcgcatct ttttgctctg atcgtggaat tcaacagacc 2100 atgggtcctg ctcactgccc tcaactgaac ggagtctctg aacgttggaa tcgcaccatc 2160 aaagaaaaga tcagatgctc actcattgaa tctaatttcc ctgtcacctt ctggccttat 2220 gcactgtcgt attgcaccga gtcatacaat cacctaccta cgtgcacaaa tgagaaattt 2280 acgtctcctg tgacactgag tgggttccca gaacgcaaac cggaggactt ccactcgttt 2340 ggctgtgaag tttggtacca tgttcctgat acgaagtcaa aactagaatc ccgagcccgt 2400 caaggagtgt ttgttgcata tctgaaaaac aatcgaggat attatatatt tgatgtatca 2460 aatcaaaact tgatcaaatc tacatctgca aaattctttg aatcaaattt tcctggtatc 2520 tcaatattat cttcctcccc tccactaaat agtcaacccc aaactcttcc ctggccagaa 2580 ctggagtctg agcgacctca tcctgtttcc ctacccaaaa tccaacccaa tatctccaat 2640 cggcttccta tacgtcaact tccaactcgt actcgtcgac ctcctgatcg gtatggtgaa 2700 tatgggaatg ttgcttgtga acccgaccct aagtcctaca agcaagcaaa gaaatcactc 2760 aattgggagg catggagaaa agctgcggta tcagagtttg agtcacttgt tggaaaggag 2820 acatggcgtc tggttcctcg tcctgcttgt cgtcgcatta tacgatgcaa atgggtgttc 2880 aagactaaac gaaatgttga caagtcagtt gacaaactaa aagctcgtct tgttgctcta 2940 gggttctccc aaattaaggg aatagatttt catgaagttt tttcacctac aagccgtcaa 3000 gagtctctcc gtgtcctaat aaccatcatg gcacaaaaag gttggaaggc caggagtgtg 3060 gacatcaaga ctgctttcct gaacggtgat cttgatgaac atatctatat ggagcaacct 3120 gagggattta ttgaccctga tcgaccggaa tgggtttgtc aaatcttaag atctctttat 3180 ggtcttaaac aatctccaag agcgtggaac aagacacttc atgaattcct aatctcattg 3240 ggtttagttc aatcatctca tgacccatcc ctatatctac gaaaaaacaa tgatcaactt 3300 tcaggcctaa ttgtagtaca tgttgatgat ctatcaatca ccggaaccga tgaatttatc 3360 tcctctacat gcaaatctct tcatggacga tttgaaatct ccaaggatga gcctctttct 3420 catttcctat cacttcaaat taatcgtgaa tctcaaacaa ctgcatctat ctctcaatca 3480 cactacattg aagacctggc tgaaaaattc cttccatctg atttcaaacc gtgcaaaact 3540 cccaccaccg atgccttcaa actacttgtt ccatctactg aggatgaaga gtttcaacaa 3600 ccgtactcca gccttattgg aggtcttctg tgggttgctc aatgttccag accagacatc 3660 tcatttccag taaaccgtct ctcccaattt ctacaaaagc caaaccattc tcactgggaa 3720 tccgcacttc gaattctggc ttacttaaac agcacaaaag ataagaaact tcttcttggt 3780 ggaaaaactt tagatgctgt tgcttactca gatgcggact gggcagaaga tcgccacaaa 3840 cgcaaatcaa caacaggtta cgtgttcaag ttaggtattg gagctatatc atggagatct 3900 cgcaagcaga aaacaacgtt tctatccagt accgaagccg aatatatggc catgtcagac 3960 ttgtgtcgtg aagctcgttg gcttttttat atgttgggtg aattgggtgt gattaaggat 4020 aaagcaatta ctgtatgctg tgataacgaa ggagcagagg cgctagcaaa aaacccctct 4080 catcattcaa gaacaaaaca catacataca agatatcatt ttgtaagagg atgtgttaaa 4140 gacggagtag taaatttatg ccatgtcttg tctgctgata tgtgtgcaga tatgttcact 4200 aaagcactac caaaggtact attagaaaag catcgaaatg ctctaaatat aatttgatca 4260 tagttcattt ctttctttct ttctttaaaa tatatttcat tatattattc aattattttt 4320 gcttctcatg attctttttt tttcttttct tttcggtcta atctagtcat cagcaagggg 4380 gggg 4384 // ID Gypsy-5_MLP-LTR repbase; DNA; FNG; 372 BP. XX AC AECX01002110; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_MLP_; KW Gypsy-5_MLP-I; Gypsy-5_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-372 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002110; Positions 3119 3490. XX SQ Sequence 372 BP; 90 A; 68 C; 72 G; 142 T; 0 other; tgtaaggggt tggaagtatg acatacagta catgggttta cattatgggt tataataggt 60 agatagatta tttagagtct tccgccccct tacaagttgt tttcttttct cctttgtgga 120 tcttacttat cttactcttt tatcatctat tacgttatac tggttttctg atcactgaag 180 tgctggacta ggtatggtgg atcttactta tcttactctt ttatcatcta ttacgttata 240 ctggttttct gatcactgaa gtgctggact agataattga atcaagggga ttagataaat 300 ccctcgctct ttctgagctt gtcccattac tcgaaactct tagtgaaggt ggagagtaca 360 ctccacctta ca 372 // ID Harbinger2-1_TSt repbase; DNA; FNG; 2567 BP. XX AC . XX DT 13-AUG-2010 (Rel. 15.09, Created) DT 13-AUG-2010 (Rel. 15.09, Last updated, Version 1) XX DE A family of autonomous Harbinger transposons - a consensus. XX KW Harbinger; DNA transposon; Transposable Element; KW Interspersed repeat; Harbinger2; Harbinger2-1_TSt. XX OS Talaromyces stipitatus OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Talaromyces. XX RN [1] RP 1-2567 RA Kapitonov V.V. and Jurka J.; RT "Harbinger2, a novel clade of Harbinger transposons in protozoan, RT fungi, choanoflagellate, and metazoans."; RL Repbase Reports 10(9), 1220-1220 (2010). XX DR [1] (Consensus) XX CC Harbinger2-1_TSt belongs to a novel clade, Harbinger2, of CC Harbinger DNA transposons. This clade includes transposons CC present in protozoan (brown alga), fungi, choanoflagellate, and CC metazoans. CC Harbinger2-1_TSt is a consensus sequence of a family of CC autonomous Harbinger transposons that were active in the CC Talaromyces stipitatus genome very recently. The consensus was CC derived from 3 copies ~99.9% identical to it. They are flanked by CC the TWA target site duplications. This transposon codes for the CC 268-aa TPase and 317-aa unclassified protein. XX FH Key Location/Qualifiers FT CDS 529..1332 FT /product="Harbinger2-1_TSt_1p" FT /note="Harbinger TPase." FT /translation="MALAILLARLSYPRRVRDLEIFFGRSFGYISTIFNDV FT LQHLYRRYKRLLEWHPLLTQERCKAYAQILEAQGAIPRGWGCIDGTFRATC FT RPSKNQRIAYSGYKKRHGFKYQGIITPDGMVLSLIGPFEGKIADSNIFQIS FT RTDERLEAIAGNEKQLLLYGDIGYQGCPFVLTPFPKIGELSARQIEINDRI FT SAHRISVEQIFGKVQNQWHGNALYTNMQPIKQAVAAYYAVSILLTNIASCL FT RENLVSKRYGIETPSLEEYLGVLDNEP" FT CDS 2285..1335 FT /product="Harbinger2-2_TSt_1p" FT /translation="MDSQISTSNESQLAANIAAAALQATEYIIITSSPPPP FT VRSQPTTDTQGTTDSQATSYNSTDDSDSNEASISQNTASQVKNNKRGRSLK FT DEELSLLFKCALDLKLDYKPKRKYWEAVEDRFVRLIGHSYSWKSCKSQIER FT LSKKRRLYLAQYVTGREAEATSELDELIDQWNDFIDGYEKDEAEKLAEKNK FT YKENSQLVLAYRDQLVSTGLQKSKKQAPEELKPSEDDTGSDNRKVPEYRKT FT ATASRPSKKRSIQEAIFALVDVLEEDRQASKKPESVAKKSELERLSGDVKE FT LQNQYKNLDEKLDKLISMIGEKRTG" XX SQ Sequence 2567 BP; 698 A; 554 C; 519 G; 795 T; 1 other; agtggtgttc cgcagttttg tcgaaccgaa ctcgattctg ccggacaaaa ccgattacgg 60 ttagcgtatc ccaaaataat ctagcgctgt actatgccta aaataatatt aagtaaacct 120 ccggaagctc aaatccatat caatttacat ggcgtatata aagattatag attagaagtt 180 tatattttga caccmtcacc gtttagctat ggcgctcgtg ttatcaagaa gacagaaaat 240 acttattatt ttgattatac tgctaagttt actggctgag gacgaacatg ctggtagatt 300 accccttcga gaatcctatt atacaccatt catattcgat attgatatgc ttagcgatac 360 gtgcctacga gagaatatac ggtttggata tgcctaaatc taaaaatatt acagtagcta 420 actccgcgtt ctattattag atttgaccgg caagaattct gtcaaattct tccttatttt 480 gagcttcata caatttcata tactggacga agaacgccgt cgccctctat ggctcttgca 540 atattgcttg cacgcctgtc ttaccctcga cgagtacgag atcttgagat tttttttggg 600 cgctcatttg gctatataag taccattttt aatgacgtct tacaacacct ttatcgtcgg 660 tacaaaaggc tattagagtg gcatccttta cttactcaag aacgctgtaa agcgtacgcc 720 caaatccttg aagctcaggg cgctattcct aggggctggg gttgtattga tgggactttt 780 agggcgacat gtcggccatc taaaaatcaa cgtattgcct attcaggata taagaaacgc 840 catggcttca aatatcaagg aatcatcaca ccagatggca tggttctctc actaattggg 900 ccctttgagg gtaagattgc agattcgaat atattccaaa tatcaagaac agacgaacga 960 cttgaggcta tagctggcaa tgaaaaacag ctacttttat atggagatat tggatatcag 1020 ggatgtcctt ttgtcctaac accatttccg aagattgggg aattatctgc acgtcaaatc 1080 gagataaacg ataggataag cgctcaccga atatctgtag aacagatatt tgggaaagtt 1140 caaaaccagt ggcatggaaa tgcgctttat accaatatgc agccaataaa acaagccgtt 1200 gctgcttatt atgctgtatc tattttactc actaatatag cctcatgttt acgtgagaat 1260 cttgtgagta aaagatatgg aattgagaca ccttcgttag aggaatatct tggtgttctg 1320 gataatgagc cctaacccgt acgcttttcc cctatcattg agattaattt atcgagcttc 1380 tcatcaagat ttttatactg attttgaagc tctttgacat ccccagaaag ccgttccaat 1440 tcgctttttt ttgcaaccga ttcaggcttt tttgacgctt gtctatcctc ttctaatacg 1500 tcaactaagg cgaatatcgc ttcctggata gaccgctttt ttgacggccg cgaggctgta 1560 gcagtcttac gatattctgg tactttccta ttatcagacc cagtatcatc ctcagacggc 1620 ttaagctcct ctggcgcttg ctttttggat ttttgcaagc ctgttgagac cagttggtct 1680 cgatatgcaa gcaccaactg cgaattttcc ttatatttgt ttttttctgc caacttctca 1740 gcctcatctt tctcatatcc gtcaataaag tcgttccatt gatcgataag ctcgtccaac 1800 tcagatgtag cctccgcctc tctaccagta acgtattgag ccaggtataa gcgccgtttc 1860 ttagagagcc tctcgatctg agatttgcac gatttccacg aataagaatg tccaataagg 1920 cgcacaaatc tatcctcaac agcttcccaa tattttctct ttggtttata atcaagtttg 1980 aggtccaacg cgcatttaaa cagcagagaa agctcttcat ctttgagcga tctccctcgc 2040 ttgttattct tcacttgaga tgccgtattt tgactaatgg aagcctcatt cgaatccgag 2100 tcgtcagtac tgttatatga agtggcttgc gaatcggtgg taccctgggt atcggttgta 2160 ggctgcgaac ggactggcgg cggtggtgag gaggttatta taatatattc ggtggcctgc 2220 agcgctgcag ctgctatatt agcagccaat tggctctcat tcgaggttga gatttgcgaa 2280 tccatacata tactttttga aattgcttta tggatattgt gctagtaaag atatgtatgt 2340 aactaacatt taaattaatt ccaagcttac gtaccccaac aggccagagg tttagtcaca 2400 tgacttagcg tcttggcgaa agttcgacaa aatgtcgaac agttcgacat tttgtcgaac 2460 ttttgccaag aaaattatgt aatcagtgtt caccgcttgc tcctttgaga gggctgggcg 2520 ccaagccgag ttcgacaaaa gttcgacacc cagtgcggaa caccact 2567 // ID LTR-2_AN repbase; DNA; FNG; 179 BP. XX AC . XX DT 09-DEC-2003 (Rel. 8.11, Created) DT 09-DEC-2003 (Rel. 8.11, Last updated, Version 1) XX DE Long terminal repeat of a LTR retrotransposon - a consensus DE sequence. XX KW LTR Retrotransposon; Transposable Element; LTR-2_AN; solo LTR. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-179 RA Kapitonov V.V. and Jurka J.; RT "LTR-2_AN, a family of solo long terminal repeats in the RT Aspergillus nidulans genome."; RL Repbase Reports 3(11), 193-193 (2003). XX DR [1] (Consensus) XX CC LTR retrotransposon. Solo LTR. CC 5-bp TSD, several subfamilies, ~100 copies. XX SQ Sequence 179 BP; 54 A; 49 C; 39 G; 37 T; 0 other; tgttgccata tcccaaagcc tggctcgcag gatgatggcc tgtggcccgg acgatagccc 60 gtgacccagg cggtcggccg tcgggcagaa taataaatga tatcactagc gatagcttca 120 gaacaacaca acaatcaatc atacttcata catcaatcga actttgcaca attgcaaca 179 // ID Gypsy-2-I_AF repbase; DNA; FNG; 7487 BP. XX AC . XX DT 28-FEB-2006 (Rel. 11.02, Created) DT 07-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE Internal portion of the Gypsy-2_AF LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; Gypsy-2_AF; KW Gypsy-2-LTR_AF; Gypsy-2-I_AF. XX OS Aspergillus fumigatus OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Neosartorya. XX RN [1] RP 1-7487 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-7487 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-2_AF, a family of gypsy LTR retrotransposons in the RT Aspergillus fumigatus genome."; RL Repbase Reports 6(2), 62-62 (2006). XX DR [2] (Consensus) XX CC This is a an internal portion of the Gypsy-2_AF LTR CC retrotransposon. Gypsy superfamily. ORFs coding for a Gypsy-like CC polyprotein are corrupted by numerous stop-codons. XX SQ Sequence 7487 BP; 2857 A; 1065 C; 1180 G; 2385 T; 0 other; tttagtgacc agtgccgggc ctaaataaat cctatatttc ctttattatt atttctatat 60 tagtcctgtg agtgtgagta ttaagtagct acaaatcctt agagtaacta ggggctagat 120 tcacagagag ttaacttagt agatttcctc tttaaactta ttctatttca gattttcttt 180 cctttaatta gaattattag attcatctcc tatattcttt ataagaaagg ctaggaagca 240 taaagcacct tagtagttac taagatagac ctatttttat ctatctactt agggtagcgg 300 cctgtgtagt tagctcctag gagtaatatt agtaagcctt aggcagtatt atattaccta 360 ccttctattt taggattagc cctagagtgt acctattcct tctactagta actagaacag 420 attagggaag gtatacctaa taatactcta cttaattagc tataagaggt agttagagta 480 ttctaaacct tattacatac ctctatatac agtaaattag gtaatctatt attatctata 540 gttaaggaga aacctttaca gctaactata aattatttct atattataat attaaactaa 600 cctaccttct taggtaaacc tggggaagat atagatcttt atattaacta gtattagttt 660 atataggcta gtataagctt agaccctaag gaaaagaagc aggcaatagc tactatactt 720 tttataggcc taagagaagc tactcttaga tttagctata ccttacctaa gacagagagg 780 aaggactagg agaaattagc ctaatatctc taagcaaggt tcctagaata agagctaagt 840 aattagatat aggatattat tataaggctt tcagagttat agtagggaat aaaaagtcta 900 agggggtata ttaataaagt ctaagagctc acaatgtcta ttctagataa ttagggttaa 960 tctatcagtt taatattcgc ctagggacta ttagatccta taatataaaa agttctaaat 1020 acttatatat aagctcagaa agtaatatct aagatagtta aaattaaaga actaatttat 1080 atagcaagag tatataaaga caaggtaact attattaagg aggtgactat atctagatcc 1140 taagataaga tatttacaga cacccttaat aattaaagct aactaattag gactcttact 1200 aaatagcttt taaggattaa tctatcctag ggttagtcct agtagttatc tataagttag 1260 taacagcagt agtggcagaa tataggacag agagtatact ttaagtatag gtaaataggc 1320 tatattatac taatctatat aaatcctcta ttacctacag aggaataata atagattaga 1380 taagaaacta taggttaagg ctagggttaa ctctagttat agggataata tagatagcct 1440 tagataaatt acagtacttt agagataaat agactaattt agccttaagg aggagtattt 1500 agactagagt ctaggtatta agtagaccta actactagta taacttcagc agctattata 1560 ggagtagaat taaataatat tatatatatc tagtttatag atagcgaaag taaagaagta 1620 gctaaataga atactattaa tagagttaat atagtgcttc tcagctccct atccctaata 1680 gatatctaca tcctataatt aagtaatgtg ctagtggaac tatagtaagt ctacgctact 1740 agagataaag atataggtag taagtaggcc agggtagata ctataagtaa tagttaactt 1800 aagatactaa atatagggaa gactaagaac ccaaaggaaa agagacacct aaagggatta 1860 gatgggaatc tacttaattt aaagaatata ataaatagta taagagtaga actaagtctt 1920 atagaactcc ttaatatggc actagcagcc tgtatagagt ttataaaact tataaggcta 1980 aatcctacta ataagataaa aaaggctcta aagagagtaa gaatttatta tcttaagcag 2040 ctagactagc ggaataaagt atataataat tatttctaac tataggatcc tagtttaaca 2100 gaactcttct acactactac agatgtataa gctataacta ctaataataa gttataggct 2160 gtgatgtgta aggcagtgct agtagatagg ggattagaga gtaatttagt tactagatat 2220 attattagtg ccttaaaagt aagaattatt ctagtagata tttattatta ggtagtaaca 2280 ggctataact tctagttaag tactatagtc tagttatagt taactattaa aggggttacc 2340 taaattatta cagtaatagt tatagacagg gatctaggat actctatcct actaggttac 2400 tactggatat tatctattag attattagga aattataaat atagaatata tactatcaaa 2460 gggttagata gaataaagga agttataagg acttaatagg ctactgttat agaagagata 2520 aaaagagcta aggtctagag ttaagtaatt agataagtta aaattaaaac ctacaagcct 2580 attattcctc tatgtacttt aaattattat ccttaggcct tagactaaga taaattaagt 2640 actaatatag aaggcaagta taagctattc taagttatct aggaagtaat agaagaggac 2700 tgctataagg ctaactcctt agactataat tagggaaact agtagtagca cctatattac 2760 cctctacaga ggctacaggt tattaataga tctagtaaag agggcactat aacaggagca 2820 cctgttatta ataagggaag gaatagtata tattaaacct actccctaaa aacacagaaa 2880 cagaaataat atataaaaat aatatataga atactatcta agtattagat actacagcac 2940 taatacctac ctatttatat cctctaccta actaaatatc cctcttaatg actacagacc 3000 ttagagatat taaagataag atagagaata taggatacct agggttccta ataactttaa 3060 gacctcagga gtaccagtgg gctataataa tagattataa ctaggcataa gcttagataa 3120 aggaaaatac ctaattatta ggactactag ttactaagaa ggagtagcta gtagtagcag 3180 atctacttta tacttagaag gatcttttta ttaaaagggt tactaacata ctagctataa 3240 gacttattaa atattaaatc cctacctatc tataggctac cctacaagca gcaaaactcc 3300 tactctatat aaaagaggaa aaaagttagt aagtagagaa cctactaaag atagttaata 3360 taggcattat tattaagtat atatctctat agagtgcacg tactaagttc ctaaggaaga 3420 ccttaggaaa actatagata gtatataatt ttatactaat aaatacagct actataaaga 3480 taaattatct actatagaga atagagccag ttattataaa tctcttaaaa aagtaataga 3540 aggtattctt taaagtagat atagctaata gatattaggc agtactgctg gctattaagt 3600 atttatttaa aatagggttt aacttaatcc taggctaatt ctattattta tatataggac 3660 aggggcttac aggagctcta gagacttact ctaaactaaa gaatctagta ataggagtaa 3720 tacctaaact attatctaaa gctacactta cattacttct aggagtgggt tttaaatact 3780 ttatagataa taacgctgct ataactaaga atataaataa tatagtttcc ttcctacact 3840 agtattactt cccctgacta gcttaggtag gattaaccct taatcctact aagagtatct 3900 tctttactaa taaaattaaa attcttagtt actaatatac ctataataga cttcagctgt 3960 ctataataaa acttaaagca ctctaggtat agctagaact aataaataag gaagaattaa 4020 taaggtttat ttatctgctg ctattcttaa aggtatatat cctagagagg gtagattata 4080 tagctatcct taagaaggct cttaaatata tagggaaggg caagactaag taacttaagt 4140 tatttaaata gggtaaggaa caacagaaag tgttctagat attaaagaga tatcttctag 4200 aggtaaagct ctctagagaa gatcctaagt tataatatca ccttagcact aatacttcta 4260 ataggggcct aggtggagta ctcttctaga taacagaata tctagttaga actaaattat 4320 tataaaaaac ttatctatat aaagtcctag taatatatct gtcctttata ctttcagacc 4380 ctaaaaccta ttatactata atagaaaagg aagtactagc agtccttagg ggtttaaagg 4440 aaataaggta gcttatatta ggatcaccct atctagttat tatctatact aattatactg 4500 tagttaagtc tattatagag gagtatttag aagtaataga ttaattagct tagtagtatt 4560 actaactgca ggaataccaa gttaactata tttatatact aggaaagctt taagtagtgg 4620 ctaatagact atcctagatt ccttactaga aattaactac cctaggaact aaagaggaca 4680 gcttcctatt acttttattt ataaatatag aagattatta actacagcct agtataactc 4740 ctaaggaaac ctataaagaa ttatacctgc tatacctata ggatacctag tataagcaga 4800 tagttaaaga actcctaaaa ggaaggatta acaagtaatt tagcctgatt aatattaagg 4860 gagaatatct cttagtatat tataaagcta ataggaaata gttataatat atactataat 4920 cttagctcta aggagtctta tatctactct ataatataca tagatatttt atagcaagga 4980 tttccctagg gtaagtaatt agttaatact actggctatg ctgttactag accttagtag 5040 agtactacag gagttatcta taatattaag tagttagtct taaacctcta aagggggacc 5100 ctagggcagt aatctcccta gagctgttat aattatttac tatagactat attagactaa 5160 ttatactaat atcctagatg ggagctaggt acattttagt aggagtagat tactttttaa 5220 aatatatatt tatacacact ataactacag ccactatata agtattagtc tatttcctta 5280 aatataatat aggaaaatat tttagattac tataatacct ctacatagat aacagaaggc 5340 actttactag agaaggattt actacctact taagaaataa tagggttaag tatataacca 5400 cccctatcac agccctatag tctattagtt ttatagaata gatagtctat ataattatag 5460 gctaattaag gataattagc tccctggatt ccttgtaact cctagattag gatagtaatt 5520 tactattagt aattaaggca attaatatat actatattaa ggaccatgga ttttccctag 5580 ctaagattct atttagattc tctctaaggc atacaggcgc actaaggata gaggacctta 5640 ttaacctaga aggaatagaa gttagggtag agatagattt atagttagta attaaaagca 5700 aggcagcacg tgtataggaa gcctaagatt taataactaa gttaagggta atacttctac 5760 tactaacatt gacacgcctt actaaaggag atctagttct attataggat actatactta 5820 agaaagataa gggcaggaag ttagatctat actagagagg gcccttccta ttaaagaagc 5880 ttatatatta taagtactta gtagttctag aagatcttat tacaaagctc taagtaggat 5940 gctattatat taactatctt aaatacttta ttaccaggtc tagtaatact ataaaggaag 6000 taaaagagct acagtaataa ctcctagata gataataaga agctaaatat ttaataaagg 6060 aagctagggg gcaacagaat ctactagaag ccatacctat taaactagaa gagaccttaa 6120 tatagaaggg tgggattatc aacctatagg caggttaata ccctatttaa ccctagccta 6180 gtatttacta gtaaatagaa atcttctact attatttata gatttctaag attatcaccc 6240 cctaacttac agtgcaccta tagcttaatt actcttacct tacctctaaa cttctatact 6300 attaagattt atcttaaagt attatatatt aagcattatt aacctactct atttattcct 6360 tatttcctag taattagtta ttattttatt attaacacag ggataaatgg ctactttaaa 6420 gggaaaggaa aagttagagg tggccctaga atatagtata gaggtgactt tagatagaac 6480 tatatttaca gaactcctac agacccctaa gccctttagt atagatattt aagaggatat 6540 agaataagag attaaaagat tagaatccct tacagcggaa tattataaat taaacttaaa 6600 taaaagattc taggtttagt ttagacagct tattacttag tataattagc tctacattcc 6660 ccctttagct acagagatag atatttaatc taataaccta ggaatacttc tctctagcac 6720 cacaggctaa gaatctagta ttaatcttat ctatactaag gtaactatta aagcagaaga 6780 aagacagcaa agtctaataa agataacaga gcctatagat aataaactag acctaaagat 6840 cctttctaat aataatattc ttctgcttac ctaacaacta ctaaaggtaa gaaaatatta 6900 cttatactac tagaactaga atagaaagat atacaagcct tactttacta ttatctttaa 6960 acacactgta aaaggggaac ttatcttact agaaaagtta ttagcatcta atattaagaa 7020 ctagagggtt aggctatagc aatttaatta ccctaaattc tataatgcct aaacctaact 7080 acccctaacg ccttctaatt agctagagta tactaaatta gaggtgcttt atctctaaga 7140 gataatatat aatagcaact ccttatacag agaagaggta ataaaggtta agcagaaata 7200 taaacaggct atataggcaa aagttaagga agttattata ttaaagcaat agattaggga 7260 acttaaggta aagttagagg tattatagca gataatcact aggcaagagc atattattaa 7320 tactatgaca caggttactg ccaccaccac ccctatccct aaccttagag acactacact 7380 attctacata cctatattaa agggatctaa ttaatcacag acgtctaatt taattaaata 7440 aattttctta tctaataata gacttaaaaa ttagagatag gggggca 7487 // ID copia-5-LTR_AN repbase; DNA; FNG; 227 BP. XX AC . XX DT 09-JAN-2004 (Rel. 9, Created) DT 09-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE LTR retrotransposon. copia superfamily. XX KW Copia; LTR Retrotransposon; Transposable Element; KW COPIA superfamily; copia-5-I_AN; copia-5-LTR_AN. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-227 RA Kapitonov V.V. and Jurka J.; RT "copia-5_AN, a family of copia LTR retrotransposons in the RT Aspergillus nidulans genome."; RL Repbase Reports 3(12), 217-217 (2003). XX DR [1] (Consensus) XX CC LTR retrotransposon. Copia superfamily. Solo LTR. CC 5-bp TSDs, 99% divergence from the consensus sequence. CC Its 3'-half is 85% identical to the 3'-half of Copia3_AN-LTR. XX SQ Sequence 227 BP; 54 A; 56 C; 48 G; 69 T; 0 other; tgatacgaat atatcatgtt gagcctcgtg tcacgtgcca cgtgatctgt atggcatgat 60 ctctggcccc tggcctgatc cctcgaggag ttgtacattg aagaagtgac aactatcgtc 120 atcaagatag agaatcaatc gtcatatggc atcctatcct tagtaggcta cgttcgtgta 180 gttgatgcag cttccttcta ccctttcccc tttgagcatt cataaca 227 // ID Copia-48_MLP-I repbase; DNA; FNG; 4670 BP. XX AC AECX01001003; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-48_MLP_; KW Copia-48_MLP-LTR; Copia-48_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4670 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001003; Positions 5003 334. XX CC Positions [1825-2325] - Integrase core CC 'AAAAGA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(1312..3201,3205..4668) FT /product="Copia-48_MLP-I_1p" FT /translation="MFKSRDAFKTYNSCNVDISIGEEGRIIKAVGKGTVTI FT KSKYGQYNINDAFHVPSLPHNLFSLTPFWRKGVQLVYCNSNMFVLKKNDTI FT ILDGNIINKLLILNIEIIKSNQFNNALHTSAIWHKRLGHPCEENLKRAAEC FT TEGMNSKITPLPFCEPCALGKSTRNRFPGTIPKPVSPLDVIHSDLSGKITP FT SSVGGANYYVKFIDGFSSYWWIYLLNNKNDFLHVFKQFKCLVENQFGRKIK FT ILVSDGGGEYINHEMDNLCKSSGIIHQVTAPHTPQQNPISERGNRTTVEKA FT RTMIAQSSLPTSFWGEAVITATYLENRTPSSSANFRTPFELIYKKIPNVNH FT IKIFGCAAYKHIPPANRHGKFNNRVEKCILLGFTEGTKNYRLIRISDRKLI FT SSHDVIFNEEEFPSLGNRANQVNEIIMNEESEIIDDVPDEPVENHQDDPEP FT DQPNDDVHIQADEPAEENQINPQVKDEPVVEDENNPINRENPFRIGIHNPA FT MADAIREEQNERENPQPNVCRSNRLRGIPAPNVSVMIAQASALIASIGNEI FT ESSMPYTPLDNMVDPASYSKAKLASDWLKWQSAVHDEFNSLIQNKVFRPCK FT SVPINRKLIGTRLILNKKLDGRYKARCVAQGFQKEGIDYNETFSPTGRIAT FT LRALGAITALDDLEFRRADFVTAFLNSSLEKGEEVYIKPPEGFIEYIMSLP FT HTSVVYKLFKILIDDPNACLEVIQSLYGLKQSARNWYTMVSDWFVAHDFHI FT SSADPCLFIHIDKGHRRSYVYVWVDDMAIAGKEVDWILDELKKDFKIKDLG FT NVDHLLGMKVTRNRAERQIMFSQEHYVDALLQQHGMEDCKPIGTPMQSNEM FT PSTATDEDREAFITSGYDYRAAIRSLNYLSQCTRPDMAHVVSLLSQFLEKP FT GMTHWQCFKRTLRYLRGSKHLSLVYGKLISHTDMSFLQDSSKAPSPSTPTG FT FSDSNWAGCPLTRRSTSGYVFLLNGAALTWRSKKQPTVALSSTEAEYRGFL FT DAGQEGVWIRRLMSDLGFENLCSTTLYGDNQGSIALAKNPVFHARTKHVEI FT HFHWIREKVKDQTFNIVYCPTEHMIADIMTKSLDRSKFVQFRTSLGLLPPS FT KIPVMVARGG" XX SQ Sequence 4670 BP; 1469 A; 982 C; 903 G; 1316 T; 0 other; tctttttcat tctcaccttt tattaactat tgttgttctc aggttcgagt gctttccatt 60 acactctcac cagtattcct gaacgtgctc tgatctgagc ttcctataaa aggtgtgcat 120 tagtaatata actatctgaa ctttttactc tatcatccaa gtccatttcc tcacgatcat 180 tatccttgac cgttcgaata ggttcgagtg ctttccatta cactctcacc agtattcctg 240 aacgtgctct gatctgagct tcctataaaa ggtttgagtg ctttccatta cactctcacc 300 agtattcctg aacgtgctct gatctgagct tcctataaaa gtttgacaca ttatcagtac 360 caggatacca taaagctaat cgatatgtcg ctcgatttca ataccggaaa cacctcatct 420 tcatctcgta tgcttctaac cacaaagaat tatctcacgt gggttacgat gatcgagtgc 480 aagcttgatc gtttaggact gcttgaaata gcactcggga cggaaccgat cactagaact 540 ccaggagtcg aacctaccga cttgatgaag gagtcgcacc gtctcaagag caaaagagcc 600 tacaacgtca ttgttgatta cctaaacgaa gacaacctag ccattgtcag tagcaacgag 660 gaagtcggaa aagctagaaa tgcgttcatg ttatggactc ttctccgcga gaaatacatc 720 ggtgaaaaca tgcttaatcg tacaaccctt ttcactaaat tcgccaacat cgtattcaca 780 gacgttgaga cctttgttaa agagattcga tcgtgtgcta atgagttgag gagatccggt 840 ttcaagatgg gtgaagagat gaaggtcatc atggccctat caaaacttcc gtcagagtat 900 gaatcctttg tcagagttat gactcatggt tttccggatc agaatcttga tttcatgtta 960 caccgactgg aacaagatca actccaaagt gcgactagcg gagtcgaggc aaattgtgct 1020 tcagcgaatc acaccaaggc taattcatct cgtagcgaca ttacttgtac tcactgcaag 1080 aagtctggcc acgaagaaac ggggtgttgg aagaaacaca aacatctagc tcctaaccga 1140 aaccgttttt gatctcggaa acctgccgat gaacaagagg tccctaccgt tgcaagctat 1200 gcaggcgtgg acgattctat ccctgggtca gtcccacttc ctcgaacctc ggtcttcgtc 1260 tgttcaacta gtcctgaatc aaatcttctc gattcaggtg cttctgatcc gatgttcaaa 1320 tctagagatg ctttcaaaac atacaactct tgcaatgttg acatcagtat aggagaagaa 1380 ggtcgaatca tcaaagctgt tggaaaaggc actgttacca ttaaatctaa atatggtcaa 1440 tacaatatca atgacgcctt ccatgttcct tctttacctc acaacctctt ctcacttact 1500 cctttttgga ggaagggtgt ccaacttgtc tattgtaata gcaatatgtt cgtgctaaaa 1560 aagaatgata ctattatact tgatggaaat atcattaata agcttttaat cttgaacatt 1620 gaaattatca aatcaaatca atttaacaat gcacttcaca catcagccat ctggcataaa 1680 cgattaggac atccctgtga ggaaaatctc aagagagccg ctgagtgtac agaaggaatg 1740 aattcaaaga ttactcctct tcctttctgt gaaccatgtg cgttaggaaa atctaccaga 1800 aatagattcc ctggaacaat tccaaaacca gtctcacctc tcgatgttat tcactctgat 1860 ctaagtggaa agattacacc atcatctgta ggaggagcaa attattatgt taaattcatt 1920 gatggatttt cttcttattg gtggatctac cttcttaata acaaaaacga ttttcttcat 1980 gttttcaagc aattcaagtg tctagttgaa aatcaatttg gtagaaagat aaagatctta 2040 gtcagtgatg gtggaggaga atacattaat catgaaatgg ataatttatg taaaagctca 2100 ggcatcattc atcaagttac tgcaccacat acccctcagc aaaacccaat ttctgaaaga 2160 ggtaatcgta ctacagtaga gaaagcaaga actatgatag ctcaatcatc acttccaacc 2220 tcattctggg gagaagcagt cataactgct acatatcttg agaacagaac tccatcgtca 2280 tccgctaatt tcagaacacc gtttgaactc atctacaaga agattccaaa tgttaatcac 2340 atcaagatct ttggatgtgc agcttacaaa cacataccgc cagcaaaccg tcatgggaaa 2400 ttcaacaata gagttgagaa gtgtattttg ctgggtttta ctgaaggaac taaaaattat 2460 cgattaatcc gaatttctga tagaaagctt atttcatctc atgacgtcat ctttaatgaa 2520 gaagagtttc cctcacttgg aaaccgagct aatcaagtaa atgaaattat catgaatgaa 2580 gagtcggaaa taattgacga tgtaccagat gaacctgttg aaaaccatca agatgaccct 2640 gaaccagatc aaccaaatga tgatgttcat atccaagcag atgaacctgc tgaagaaaat 2700 caaatcaatc ctcaagtcaa agatgaaccg gttgttgagg atgaaaataa cccaatcaat 2760 cgagaaaatc ctttccgaat cggtattcac aatcccgcta tggctgacgc aatccgtgaa 2820 gaacaaaatg aaagagaaaa ccctcaacca aacgtctgta gatcaaatcg tctgagaggt 2880 attcctgcgc caaatgtatc ggtcatgata gcacaagcct cagccctaat agcgtcaatc 2940 ggaaatgaaa ttgaatcatc aatgccatat actcctctag ataacatggt tgacccagca 3000 agctattcaa aagcgaaact agcatctgat tggttaaaat ggcaatcggc ggtacatgat 3060 gaatttaatt ctttaattca gaataaagta ttccgtccat gcaaatcagt tccaattaat 3120 cgaaaactta ttggtactcg tcttatttta aacaagaaat tagacgggag atacaaagca 3180 agatgtgttg ctcaaggttt ttgacaaaag gaaggaatag actacaatga aactttttca 3240 ccgactggac ggatagctac gctacgagct ttgggggcaa ttaccgcact tgatgaccta 3300 gaattcagac gagccgattt cgttactgct ttcctcaatt catcgcttga gaagggagaa 3360 gaagtgtaca tcaaaccacc tgaaggattc attgaatata tcatgtcatt accacacaca 3420 tcagtagtat acaagctttt caagatccta attgacgacc cgaacgcttg cctagaagtc 3480 atccaatctc tctatggcct gaagcaatca gctaggaact ggtatactat ggtctcagat 3540 tggtttgtag cacatgattt tcatattagt tcagcagacc cttgtctttt catacatatc 3600 gataaaggcc ataggagatc ttatgtatat gtttgggtag atgatatggc gattgccgga 3660 aaagaggtgg attggatttt ggatgagttg aaaaaggatt ttaagatcaa agatttgggt 3720 aatgttgatc atctattagg gatgaaagta actaggaata gagcggaaag gcaaattatg 3780 ttttcacaag agcactatgt cgatgccctt ctacaacagc acggaatgga agattgcaaa 3840 cccatcggaa cgcctatgca gtctaacgaa atgccgagta cagcaaccga tgaagaccgt 3900 gaagcattta ttacaagcgg atatgattat agagcagcca tcagatcact taattattta 3960 tcccagtgta ctagacctga tatggctcat gttgtaagtc tattatccca atttcttgag 4020 aaacctggaa tgacgcactg gcaatgcttc aaacgaaccc tacgatatct acgcggatca 4080 aaacatctat cactagtcta tggtaaatta atctctcata ccgacatgtc ttttctacaa 4140 gattcatcta aagcaccatc cccaagcact ccgactggtt tctctgattc aaattgggca 4200 ggatgccctc taactagacg atcaacatcc gggtatgtct tcttgttaaa tggtgctgcg 4260 ttaacttgga ggagcaagaa acaaccaaca gttgcgttat cgtcaaccga ggctgagtac 4320 cgtggtttct tagatgctgg tcaagaagga gtttggatcc gtcgacttat gagtgattta 4380 ggttttgaaa acttgtgttc aacaactctt tatggagata accaaggatc aattgcatta 4440 gctaagaatc ctgtcttcca cgctcgcacg aagcatgtag aaattcattt tcattggatc 4500 cgggagaaag tcaaggatca aacatttaat atagtatatt gtcctactga acatatgatc 4560 gctgatatta tgacaaaatc acttgacaga tcaaagtttg ttcaatttcg tactagtctt 4620 ggtctactgc caccgtcaaa gatcccggta atggtggcga gggggggttt 4670 // ID PCAL_LTR repbase; DNA; FNG; 280 BP. XX AC AF007776; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 08-NOV-2002 (Rel. 7.1, Last updated, Version 1) XX DE Candida albicans Ty1/copia-type retrotransposon PCAL_LTR, long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; KW Long terminal repeat; PCAL_LTR; Ty1/COPIA superfamily; KW retrotransposon. XX OS Candida albicans OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; mitosporic Saccharomycetales; OC Candida. XX RN [1] RP 1-280 RA Matthews D.G., Goodwin J.T., Butler I.M., Berryman A.T. RA and Poulter T.R.; RT "pCal, a highly unusual Ty1/copia retrotransposon from the RT pathogenic yeast Candida albicans."; RL J. Bacteriol 179(22), 7118-7128 (1997). XX DR Genbank; AF007776; Positions 1 280. XX SQ Sequence 280 BP; 102 A; 42 C; 50 G; 86 T; 0 other; tgttggtttg tgcactattt tgtgtcagaa actgatcaat gaaaatgatg gttattatga 60 gaatggaaaa tttttccatc acacatcagg tgatgacaga actaaactat attgtgtagt 120 ataaataagg gtatgaaata ccaacatccc agaatatcaa cgagatagaa gggaggagtt 180 tcaatatata tcttgtgaat aataacttcg ttctaattca ctatacacaa ctagacgtgt 240 acacgctcaa tctcaggtaa agaaagttta tattccatca 280 // ID Copia-4_CCO-LTR repbase; DNA; FNG; 300 BP. XX AC AACS02000009; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the mushroom Coprinopsis cinerea genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-4_CCO_; KW Copia-4_CCO-I; Copia-4_CCO-LTR. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-300 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the mushroom Coprinopsis cinerea RT genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AACS02000009; Positions 805926 806225. XX SQ Sequence 300 BP; 55 A; 88 C; 42 G; 115 T; 0 other; tgtcgaccag atccctagcc tattacgtct atgtacctta tctcattgta cactccgcct 60 acttctttct ctcacgggct catacgatac aaaatctttt tcgccttctt tttcttctct 120 tttcctcagt ctctgttcat ccagattctg aggtatgttt tttcctgtat tttgttcgtg 180 tctacagcta cttatctgta cctatagcac tcgtcttgtc cactaggtac agctcccaat 240 tctagtctct gttcatccag attctgagca ctcgtcttgt ccactagtac ctagccgaca 300 // ID Gypsy-4_LBS-I repbase; DNA; FNG; 6432 BP. XX AC ABFE01000666; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_LBS_; KW Gypsy-4_LBS-LTR; Gypsy-4_LBS-I. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-6432 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01000666; Positions 265700 272131. XX CC Positions [5214-5732] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 505..1743 FT /product="Gypsy-4_LBS-I_1p" FT /translation="MSAPVTLRSFSGETDDDVQPGEFLKTFRRFTTSACIT FT DDDLIITSFGDHLKYGSPADDWFAELTSTKQTWKEVETAFLQRFPPVEKAK FT RTETELERELCELRLKTEDLGKKEKYAGEEVYTHVIFAEKALSLAKQAKIS FT TGSNSIWKVRDELPDIIRQKVKETYPTWTEFCAAIKRVEMAHIRDGVKKYQ FT KEKEEKEKTEAAIASLQRLHLQGQRRLPAAPTSSVSNVSQTQQSSTLGGQR FT NNTPQSNSTNTNAYMGPSSGQGNLPRPAPSPITEADRSALIRSLALYPMQP FT NTEAGIAAWHAQLREWKAKNGETAEVTVNTGFPLQPGGEAPGSGECYRCGR FT GGHRRIDMRCTTDQINNKERTFRTICGRILRAAPTSRVNFVSDADDEFSWL FT NHQAFRPTPSPGNGEGPPA" FT CDS 1833..6431 FT /product="Gypsy-4_LBS-I_2p" FT /translation="MRHNSSLIDLYTVGDGDKTWDTLRQGEMPFVHWLLVH FT GPQGEKVRVKAVFDGGAMVGAMCTSFFKKVQHRLLGQTTPSNRRLRVANGT FT IVPSHAVWTGILELGGVRAKAEFEIFDSGGGWEFLFGKPLLRCFKVLHDFD FT ADTVTIRAAHGPVVLCNSVSQCAPKTPLGVSLTLSVEQRENSVGGSSGVKP FT PPRQVLHHEVLDVEVQNDESDCILGGTNDTSTVTGGYIVNEDESIVQEEEE FT WLKEAEECERESIPQESHEGVQATDQKRDEQGSERCTDQGGGNIPPSREVQ FT YQSAASAGASETDDFLVAVPNSAMKASESAMPAENAPICHIKPLQVEREDL FT SGGSDKPPSRGVPTFPVDQHKTTPADTPCLVLPVANASDVQPEDTIFTRQT FT EPFLQARVEKILELVQIGDDIAADQREEVKSLIAEFADCFALSLSEVNLIP FT GAVHKLDIPENTSFRTKIPQCSFNPDQRAFMEAKVDEMLKGGIIRPIHPRE FT VKCVAPSVLAQKAHENTGLSSDELKHKVNDECVKHGLPTAFDLPPRPPPAE FT NGPTTTSPKKWRLCQDFGEINKVTPVAPVPQGDIRAKQLRLSGHRYVHIFD FT FAAGFYGIAVHPDSQPYITFYLEGRGHFAYERMPFGVTGGPSEFGYVVGDR FT MHDLIADGTCENFVDNGGSAADSFEEGITKLRRILERVRRERLSLSPAKFQ FT VFKTEAVFAGARVGPGGVSPDSAKLTAVVNWKIPEDASHLEGFLGLTAYFR FT DLVKGYAALEKPLRDLLCAVDIPNGTKKSAYQRIMKAHKLQPHWTAEHTAT FT FVNLKARLVSEPVLTAPQFDGTHFILTTDACKDAFAGVLSQKIKTTLPGGK FT DVTRLHPIGFASKRTSPSEEKYKPFLLEFAALKYSFDKFSDIVYGYPVEVE FT TDCQALRDILLSDKLSATHARWRDGVLAHNIVDVRHIPGKINIADGVSRQY FT EGMDKVPGDGSEWTVTPDWEEMTGLVHDLYHIADLPDLTTLKERFKNEPLY FT LDVIDAIVGLSSEGTTIRERKRAQHRKTQYMLEDGKLWFIGGGSGARARAR FT RECISKVEAVEQAKREHEQGGHWHRDSVKLALLDRYHSPKLDESIIKGIMD FT CARCKNFGSTHLHSLLQPITRRHPFELLVGDYLSLPVGKGGYHTAGIYLDT FT CSQHVWGYKFKTHGTATTTNRSLDDIFHNFAPPETFMADGGKHFKNKEVAE FT NCERWGTKLHTVAAYSPWVNGLVEGTNKLLLYVLARLCAPEVGEDGWQAIT FT WDKLPASWPDHFDKAIRILNWRILPALKFSPKEILLGLVVNTSKTPFEVSC FT SFLTPSDVDSHMTYAAQQRLDGYAEAVHHAVQRKTAFDRRLKASKTGVIEF FT EKGQLVQVYDNKLASTLRTERKLAPMWSSPHRVAERLLNSYKLETLDGTPL FT DGLYHARRLRSFVPREGTLLAAEQKELEEALASEASNVATPRVEAQVETDQ FT EETEESETEDSESQDVEGESSEGDREEIGAGFFYDEDEEVTQEEDEETGIG FT ARVAARRRGRLHNGGGQ" XX SQ Sequence 6432 BP; 1757 A; 1695 C; 1724 G; 1256 T; 0 other; cctggtgatg agtgcgtgat tcgtactttt acacgcgcat atctctcgtc ttaaaacgtc 60 cgtcctcata tcaccgtctt aaatccaccg tcccacgtca tacgtcacac cgtcaacgcc 120 accacgtcca acgtccagcg tctaacggct ttccatctcc gaaatcaact cggccgcaaa 180 cgtgtttcca acgtttcacc aagctcatcg tgtaccccgc acccataaag caagctggca 240 cccaacaata cctggataac tcagccaaga gtcaacggca gattcgtggc tagagacaga 300 gccgctacca ttacaccctc gtctaataca acctcgggta catccacgcc atctactgaa 360 agttcgctca cgctttcgga catactagag cctgcctctc cgttcgagga tctctttgga 420 tcaacgtcaa gtcctcaact aggagcccca caacaaaccg cacagctcag ccccgttaaa 480 ccaccgcagg accttccacc cattatgtct gccccagtca ccctacgcag cttcagcggc 540 gaaacagacg acgatgtcca gccaggcgag tttctgaaga ccttccgccg cttcacaacg 600 tccgcctgca tcacggacga cgacctcatc atcacatcct tcggagacca cctcaagtac 660 gggtcccctg cagacgactg gttcgcagag ctaaccagca cgaagcagac gtggaaagag 720 gtagaaacag cgtttttgca acgctttccg cccgtcgaga aggctaaaag gacggaaacc 780 gagttagaga gggagttgtg tgaattaagg ttgaagacag aggatttggg gaagaaggag 840 aaatacgcag gcgaagaagt ctacacccac gtcatattcg cagaaaaggc tctcagcctt 900 gcaaaacaag ccaagatcag cacggggtcg aattcgatct ggaaggtcag agacgagcta 960 cccgacatta tacgccagaa ggtcaaggaa acgtacccaa cctggaccga attttgtgca 1020 gcgatcaaaa gggtagaaat ggctcatata cgagatgggg tgaagaagta ccagaaggag 1080 aaggaggaga aggagaaaac ggaggcggct atagcaagcc tacaacgctt acacctgcag 1140 ggacaacgtc gcctcccagc cgcacccaca tcctcagtat caaacgtcag tcaaacccag 1200 cagtcatcaa ccttgggagg tcaacgtaac aacacgccac aatccaattc aaccaacacg 1260 aatgcataca tgggcccgtc aagtggccaa ggcaacctgc cccgcccagc tccctcgcca 1320 atcactgaag ctgatcgaag cgcgttaatc cgcagcttgg ccctctaccc catgcaaccc 1380 aacaccgaag caggcatcgc agcgtggcac gcccaactca gggaatggaa ggcgaagaat 1440 ggcgaaactg ctgaggttac agtaaacacg gggtttccac tacaaccagg gggtgaggcc 1500 cctggttcag gggagtgcta cagatgcgga aggggaggac atcgccgcat cgacatgcgc 1560 tgcacgactg accaaatcaa caacaaggag cgcacgttcc gcacgatatg tggccgcatc 1620 cttcgagctg cgcctacctc tcgagtcaac ttcgtcagtg atgcagacga tgaattcagc 1680 tggctcaacc atcaagcctt cagacccacg ccgagcccgg gaaatggaga agggccgcct 1740 gcataggagt tgagtgggcg gccccgactt tagtagggcc tacagaggtc gaccgcccca 1800 gcagtcagaa cagtctcaat gatgtatcta ccatgcgcca taattcatca ctcattgatt 1860 tgtacactgt tggggacgga gataagacat gggatacatt aagacaagga gaaatgccct 1920 ttgtgcattg gctgctcgtg catggaccac aaggcgaaaa agttagagtc aaggcggtgt 1980 ttgatggagg agcgatggtg ggcgctatgt gtacatcgtt cttcaagaag gtccagcaca 2040 gacttctagg ccaaaccaca ccatccaaca gacgtttgcg tgtggcgaat ggcaccatcg 2100 taccctcaca cgcagtatgg actgggatcc tggaattggg aggagttcgg gcgaaggcag 2160 agtttgaaat cttcgatagt ggaggcggct gggaattctt attcgggaaa ccgctactac 2220 gttgcttcaa ggtgttgcat gactttgatg cggacacggt caccattcgc gctgctcatg 2280 gaccagtcgt attatgcaac agcgtcagcc aatgtgcccc gaagactcct ttaggcgtta 2340 gcctcacact cagtgtggaa cagcgggaaa actcagtagg gggctcttca ggcgtgaaac 2400 cccctccgag gcaagttttg catcacgagg tcttagacgt tgaagttcag aatgacgagt 2460 cggattgcat tttaggtggt actaacgaca cgtcaactgt aacaggtggg tacatcgtaa 2520 acgaagatga aagtatagtg caggaagaag aagagtggtt gaaggaggct gaggaatgcg 2580 agagagagag tataccacag gagagccatg aaggagtaca agccacggac cagaagcggg 2640 atgagcaggg atcagagcgt tgtacagacc aggggggagg taacataccc ccctcgaggg 2700 aagtacaata tcaaagtgct gcttcagcag gggcgagcga aactgacgat ttccttgttg 2760 cagtccctaa cagcgccatg aaagcctcag aaagtgccat gccagccgag aatgccccta 2820 tatgccacat aaaaccgtta caggtggagc gagaagatct tagtgggggg agtgacaaac 2880 ccccctcgag gggagtacct accttcccag ttgatcaaca taagaccacc ccagctgaca 2940 cgccatgcct tgtgttacca gttgccaatg catcagatgt ccagccagaa gacacgatct 3000 tcacacgcca gactgagccc ttcctacaag cccgtgtaga gaagatattg gagcttgtgc 3060 agataggtga cgacatcgca gcagaccaac gcgaggaagt caagtcactc atcgctgaat 3120 tcgctgactg cttcgcctta tcactcagcg aggtcaatct cattccaggc gcagtgcaca 3180 agctcgacat acccgagaac acatcattcc gcacaaagat tccacaatgc tcgttcaacc 3240 cagaccaacg ggcctttatg gaggcaaagg ttgacgaaat gctgaagggt ggtataatac 3300 gtcctataca cccaagggag gtcaaatgcg tcgcgccctc ggtgttggcc cagaaggcgc 3360 atgagaacac aggcctgtcc tcagacgaac tcaagcacaa ggttaacgat gaatgcgtga 3420 agcacggtct gccaactgcg ttcgacctac cacctcgccc acctcctgct gaaaatggac 3480 ccacaaccac gtctccaaag aaatggcgcc tgtgtcagga cttcggtgaa atcaacaaag 3540 ttacgcccgt ggcaccagtt ccccaaggag acatacgtgc taagcagcta cggttatcgg 3600 gccacagata tgtccacatt ttcgactttg cagcgggttt ctatgggatt gcggtccacc 3660 cagactccca gccatatatc acgttctatc tagaagggcg tggacacttt gcttatgaac 3720 gcatgccatt cggcgtcact ggaggcccct cagagtttgg atatgtagta ggcgaccgaa 3780 tgcacgacct catagcagat ggaacttgcg agaacttcgt tgacaacggt ggatcggcag 3840 cggattcctt cgaggagggt ataacgaaac tgcggcgaat attggaacgt gtgcgtaggg 3900 aacgcttgtc tctctccccg gccaagttcc aggtgttcaa gacagaggct gtctttgctg 3960 gagcgcgtgt tggaccggga ggcgtcagcc cagactcagc aaaactcaca gctgtggtaa 4020 actggaagat accggaagac gcttcgcatt tagaaggatt cttgggtctt acagcgtatt 4080 tcagagacct ggtaaaaggc tatgcggccc ttgagaaacc tctccgggat ctactatgtg 4140 cggtggatat tccgaatggc accaagaagt cagcgtatca gcgaataatg aaggcccaca 4200 agttgcagcc acactggaca gcagagcata cagcaacctt tgtgaaccta aaggcacgac 4260 tggtctcaga accggtcctg acggcccctc aatttgatgg cacacatttc attctcacca 4320 ctgatgcatg caaagacgca ttcgccggag ttttgtcaca gaaaatcaaa acaacccttc 4380 caggaggcaa ggatgtaact cgtctacacc caatcggctt cgcatccaag cgaacgtccc 4440 cgtccgagga gaaatataaa cctttccttc tcgagttcgc cgcgctcaag tattcattcg 4500 acaagttttc agacatcgtg tatggatacc ccgtggaggt ggagaccgat tgtcaggccc 4560 tgcgcgacat cctacttagt gataagctga gcgcgaccca tgcacgatgg agggatggag 4620 tactggcaca caacattgtg gatgttcggc atattcctgg gaagatcaat atcgcagatg 4680 gcgtcagtag gcagtacgaa ggcatggaca aggttccggg cgatggcagt gaatggacag 4740 tcacaccgga ctgggaggag atgactggcc ttgttcacga cctttaccac atagccgacc 4800 ttcctgacct aacgactttg aaagaaaggt tcaagaacga acctctttac ctcgatgtca 4860 ttgacgccat cgtgggcctt tcatctgaag gcacgacgat acgggagcga aaacgtgcac 4920 agcataggaa gacccaatac atgcttgagg acggtaaact ctggttcatt ggaggcggta 4980 gtggtgcaag ggcacgagcc aggagggaat gcatttcgaa ggtggaggcg gttgagcagg 5040 caaaacgaga gcatgaacaa ggcgggcact ggcacaggga ctcagtgaaa ctggcactgc 5100 tggaccgata ccacagccca aagctggatg agtcaatcat caaaggcatc atggactgcg 5160 cacgttgcaa gaatttcgga agtacacacc tacattccct tcttcagccg ataaccagac 5220 gtcatccgtt tgagctgctg gtgggggact acctctcatt gccggtagga aaaggcggtt 5280 accacactgc tggaatttac ctagatacgt gctcgcaaca tgtttggggt tataagttca 5340 aaacccacgg tactgcaaca acgaccaacc gatctcttga cgacatattc cacaacttcg 5400 cacctccgga aacgtttatg gcagatggag gcaagcactt caagaacaag gaagtggctg 5460 agaactgtga gcgctggggg acaaagctgc acacagttgc agcgtattca ccttgggtca 5520 atggacttgt tgagggtact aacaagcttc tactctacgt cttggctagg ttatgcgctc 5580 ccgaagtagg tgaagatggc tggcaagcaa tcacctggga caagctgcca gcatcatggc 5640 cggaccactt cgacaaagcg atacgcatac tcaactggcg gatactccct gctctcaaat 5700 ttagcccgaa ggaaatactg ttgggcttgg tagtgaacac gtcaaagaca cccttcgagg 5760 ttagctgctc cttcttaacc ccatctgatg ttgattcaca catgacctac gcggcgcaac 5820 aacgtcttga tgggtacgcg gaggccgttc accacgcagt gcaaaggaaa actgcgttcg 5880 accgcaggct caaggcttca aagacgggag tcatcgaatt cgagaaaggt cagctagtac 5940 aggtatacga caacaaacta gcatccaccc tcaggaccga acgcaaacta gcgcccatgt 6000 ggtcttcccc acaccgagtt gctgagcgcc tcctaaactc gtacaagctg gaaactttgg 6060 acggcacccc tcttgacggc ctgtaccacg caagacgcct gcgaagtttc gtaccaagag 6120 agggcacatt gttagcggca gagcagaagg agctagagga agcactggcc tcagaggcat 6180 cgaacgttgc aaccccaaga gtggaggcgc aggtggagac agatcaggag gaaacggaag 6240 agtctgagac ggaggattcg gagtcgcaag acgtggaggg ggagagtagt gaaggtgatc 6300 gggaggagat tggagccggg tttttctatg acgaagatga ggaggtgaca caggaggagg 6360 atgaagagac aggcataggc gctagagtgg cggcgaggag acgaggacgt ctccataatg 6420 gaggggggca ga 6432 // ID TDH5_I repbase; DNA; FNG; 4435 BP. XX AC AJ439552; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 08-NOV-2002 (Rel. 7.1, Last updated, Version 1) XX DE Debaryomyces hansenii retrotransposon TDH5_I, internal region. XX KW LTR Retrotransposon; Transposable Element; RNaseH; TDH5_I; gag; KW integrase; internal region; pol; protease; reverse transcriptase; KW internal portion. XX OS Debaryomyces hansenii OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Debaryomycetaceae; OC Debaryomyces. XX RN [1] RP 1-4435 RA Neuveglise C., Feldmann H., Bon E., Gaillardin C. RA and Casaregola S.; RT "Genomic evolution of the long terminal repeat retrotransposons RT in hemiascomycetous yeasts."; RL Genome Res 12(6), 930-943 (2002). XX DR Genbank; AJ439552; Positions 458 4892. XX SQ Sequence 4435 BP; 1186 A; 947 C; 987 G; 1315 T; 0 other; ggttatgagc cactttttgg actttgatgt tacggagcag gataagctcc gtggtattga 60 taactacaaa ctttgggctg ctatggtaaa gtccagactg ctggaccttg actgccactt 120 agatctgtat ctaagtaatg tggatattcc tggtgttagt gagtttactg acactgaacg 180 gaatgtgatt agacttaagt ttgatctgat ccttgacaag cttttgaaaa agtgcattac 240 gtctgctgtt tatgcgacgt actgtactaa acgtggttta cgtggactta atttatggca 300 tgctctattc cgcgactttg gaactatttc tgttaagcag catttagagc ttcattccac 360 gttaacccgt cagctttttg attctactgt atcgcttgct gctaagattc aacttcttga 420 agatgctgac ctggaccttc ttcctttgga tgaaggccag cggattctct tctttcacag 480 ttgtgttaat aacagcactg ttaagaacgt tattattcgt cttgacgaat acgttccctc 540 ccgtcttaat tggcttgcat tgaaagacgg gatctctgag attcttcacg agacaagttc 600 ccccgtgtcc gatgagtcta gtatggctct tgtggtcctg cagcgaaacc ctcgaagaat 660 tatgatcaag tctaagttga cttgtttcaa atgtggtggc attggccaca aactgaatgt 720 ctgtccctct cgaagtgatg acgagtattg gaatctgcaa ccaccgaagc cgacgaataa 780 gcccaattat tttgtgacat atactgagga gaaacccagc gtctctccca ccaatgctgt 840 atgtgggctt agttacttgg tccccaaacc tagtcagcac actgctgcct atgctgacac 900 cggtcaccag acgtcttctg cctcggtatc tcatggttgt gcagctgcta ctgatagcca 960 tagtgatgtc ttccttcttg atactggtgc gtctgttcat attacgcatc agaaggacct 1020 acttcatgat tttaatcctg ataactctgg taccattgct ggacttgacc ctgctaccac 1080 gttcaagatc cttggtactg gaacattgaa atttacgtta cctgatggtt ctatactccc 1140 cgttgaggat gtgcagtatg ttccttcctg cggtcgtaat ttaatctcgg ttagccgtgc 1200 atctaaagga ggttcaacat tccaccttct acccgatggt atcgttgacc ttcgtttagg 1260 ccttcaggtt gctgctctta cgaaggatga tctttatcaa ttttcactcc catgcttgcc 1320 tgctcataat gtatcagctg gcactgcgtt tactgctaca atggatgccc attctcgtct 1380 tggtcatcca agtcgtaagg ttgcgcaacg cattggcacc ttatgtcctt cgtatgctgc 1440 tgcgttgaag tcggaatcga aggattccct ttgtgagtcg tgtattcgtg cgaaggctac 1500 gcacgctctc cctaaaacgt cgaccacttc cggtcgcact gtgaagactc cgttggagct 1560 tgtccactct gatgtctgtg ggccgttttc ccagccgagt ctaacccaag acttatatta 1620 tgttgtcttt gttgatgatt acactcacta tatggcagtg tatcctgtta aacaaaaatc 1680 tgatgtatat gagtgtgcgc gctcatattt tcttcagtct gaacgattct ttcataatcg 1740 tggtggctac aagccggtta cttttcgcac cgataatggt ggtgagtata tgtcgtccca 1800 attgcagcaa tttcttaaag ctcaagggat cactcaccaa accactgttg cgtacaatag 1860 tcatcagaat ggcgtcagtg aacgagccat tcgtacgatc aatgaaaaat gtcgtgccat 1920 gatgtttcat gcatccacgc ccctgtgttt ttgggccgaa gctgtagcgt gtgctactta 1980 tcttcttaat agattaccgt cgactgctat tgataagcaa tatccctatc aacggtggta 2040 taagagtcat gcccagtggg accatttgcg cccatttgga tgcatggcgt atgctcttat 2100 tccacaacaa ttgagatcgt cgaagttgtc tcctcggtct attcgaggtg ttatgcttgg 2160 ttatgcccaa acccaacatg catatcgtat tttcgatctc gactctggca aagttgctgt 2220 tagtaataat gtgaagtttg acgaatttgt gtttccgttt caaactatga ctaacattcc 2280 ccgggatatt gcctctatag gctcgctgtc gtcgtcgctg tcttcgtttt catccattcc 2340 cgggattcga gctactgctg agctccccgg tactcttatg tcgccacctg tgctggatgt 2400 cgacatgtct gatattgaat ctgatttttc gtcccatgat tctatggtcc ctcaggatct 2460 tgcgtctcct gtcgatgggc catctatgga tatcgtgggc ccaccttctc gtggttcgct 2520 gtcgtcctct cctggtatta ccaccactcc tagagtggtt gaaccgttgt cccctctcgt 2580 aacgcaagag gtgtctcgtc ctcgtatcga gtacatttct gactcagaag cttcggttcc 2640 aggggaagac tacgctactt ctttctatga acgtcatcac gataactttg ataagtctga 2700 tgatgaggat tatgttgatg aacctgtgag gagaagaaga gctattacta tgcctgatag 2760 actgcgtaga agatcagaag tatcgtcgga tggagaaact gaaccttcta agaaactgaa 2820 atatgaaaca ctgcatactg aatcagactc ctcgcttgtt gtcagaggat ctggttattt 2880 catgtcacac gcctatgttg ttagtaccaa gaaagatggt gtaccagtaa catacaaaca 2940 ggcaatgctg agtgacgaag ctgaaaaatg gaaaatcgcc atggatagtg agatgagtgc 3000 tcactactca aataatacat gggatttagt actgttgcct aaggatagaa aagcaatagg 3060 caacagatgg gtatttacca aaaaagatga tggtagatac aaagctagac tagtagcaca 3120 gggcttcagt caggttccag gggaagacta cttagatacc ttcagtcctg tgatcagata 3180 tgaatctgtg aagctacttc tagcattcag tgctgttgat aatagagttg tccatcagat 3240 ggatgtagat accgcatttt tgaacggtac tgttgaggaa acactctata tgaaacagcc 3300 tgttggattt attaatgaga atgaaccaga gaaagtctgt aagttgaaca aatcgttgta 3360 tggattgaaa caagcaccga tatgttggaa tactacaatt tcagattttc ttgctgaaca 3420 taggttcaac cgaattgaca ctgagttagg catatacgtt cgagggaaca taattattgg 3480 cctctatgtt gatgatatcc ttatatcagg aaaggatatg aaggaaattg aagatgtaaa 3540 gaaaatgctc gcactgaagt ttaaaatgaa agacttaggg gttgcgaaga agtttctagg 3600 aattaatatt gcacaggata ccaatggtat caagatatgt ctctatgact acattggaaa 3660 ggtcttgcag gactttgaaa tgactgatgc aaacacggtg acaactccga cattggccgg 3720 agaggactta cacaaggaca acacagaaga atgtgatgcc accaggtacc gctctctagt 3780 aggaaagcta ttatttgcat ctactactgt tcgaacagat attgcgtacg ctgttggaat 3840 acttagtaga cacttagcca aaccctctga gatgcatatg aaatgtgcca aacatgtcct 3900 cagatatctt aaaggaaccc aagatatcgg ccaccattat actggagaca gctcattgga 3960 tatatattgt gatagtgact gggctagtga caagagcgac aggaaatcaa ttacaggata 4020 cattgttaga tatggaggag ctcctatctc gtggaaaagc aagaaacaga ctacagtggc 4080 aatgtcgacc actgaagctg aatatctcgc attgggtgag gctaccaaag aagcgctatg 4140 gatcataatg ttgtttgacg agatgcgtgt gccattacaa ttacccatct cgatccacga 4200 agataataac tcatgtatac tccttgcaga gcaccctgta tttcacctgc gcactaagca 4260 tattgacata cgccatcatt tcataagaga gcatataata aagaaacaaa ttaaactttg 4320 tcagattagc acccatacac aaattgcgga catgcttaca aaaggattaa ataagattaa 4380 gttccaggac ttgagaagtc ttgctggaat gacaagaatt aggattaagg ggaag 4435 // ID Gypsy-20_MLP-I repbase; DNA; FNG; 6141 BP. XX AC AECX01000932; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-20_MLP_; KW Gypsy-20_MLP-LTR; Gypsy-20_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-6141 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000932; Positions 6560 12700. XX CC Positions [4500-4979] - Integrase core CC 'TCATT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(654..4307,4311..5765) FT /product="Gypsy-20_MLP-I_1p" FT /translation="MSHEQDTIDSLRQQVESLKLQNESLAQARQETLTLQN FT LVRDLLAKKNTVASAAQAAGANPNTLGSDSATQPHPYIQYLNQSPVSPTPL FT PRDSNRNATVTGLAAQAVTPTAGPSPGKTYQAEATREHAPAEAPQVPTTPI FT SPRDRATLPIPYAQTPYLHYYPQQLPETPAAFVSVPMTADRVKLSDLPKFT FT GKFGRPADLFHWQSLVEETFDIKNISDDRERMKLLGSLLNNEEMSAWYQSN FT RDTIRQQSWKEAMNMMAMGTLPHAWLADTEESLRRLKMQTSESFDAYVVRA FT QALHRLVKRFGHVTDRNLAEYITWGTPRLFKDMITREKHLTAEPFSFPLFK FT DAADGIWCFLTNARLLPETHGKVKQPTLTGSSNSPASIPPRPANQNRVRSD FT DEQADNAWRYHEYLRQSGICAVCKEKCNNTNCTRRSSKFLSVPSTFNPGPR FT PTRSATASTPAPAPGTPTQRPAGRPPAASTSSRVAAVDTFPAMLAEDIAAY FT EEADQFKEGQVEETVTEDERCVPSTASTPIILELTCNGKTVRALIDSGAGT FT NLLSENMARNLQVCQRPLVAPVEVRLAIPTEGTPLVLREFAIANLKSEHPS FT LRFGAVFFKLAPLGPNYDMILGSPFLSKFKLDISLHRRAVLHTPSGKILYE FT KEMQSELRRVLCALENLGKINKAQDLCEREISVLRDYEDIFPAELPPVEQG FT DEPDETFPDGLPDPARQVRHKIILTNPDVVINEKQYGYPRKYLDSWNKLLN FT QHVAAGRLRRSHSQYGSPSMIIPKKDPNQLPRWVCDYRTLNKYTVKDRSPL FT PNVDEAVRLVATGKVWSVVDQTNSFFQDRMREEDIPLTAVKTPWGLYEWTV FT MPMGLTNGPATHQARNEEILDELVGKICVVYIDDIVIFSQTVEEHEVHLHQ FT VLDRLRAAKLYCSVKKSKLFRQNIKFLGHEISEAGVAPDDEKIEKISKWVS FT PSNSKQLLKFLGTVQWMKKFIDGLSHYVGTLTPLTSTSLKGKPFQWGEAEE FT RAFNNIKRLITTLPVLRNLDYDSNEPLWLFTDASGSGLGAALFQGADWDTA FT SPIAYESRTMNSAEKNYPVHEQELLAVINALNNWRLLLLGMRVNVMSDHHS FT LTTLLTQRNLSRRQARWLETLSQFDLDFKYIKGQDNSVADALSRRDDVALC FT EVRGKLADTELQAIQAGYNNDSFCVKLRKVLPLQEDCVLQDDLMYLDGRLV FT VPQYGDLCDFISQAHNALGHLGTLKTLARLRTTFFWPGMAKEVENQLRTCD FT SCQRNKARTTSTTGKLQSSEVPISPMQSIAIDFVGPFPIVSGYDMLLSCTC FT RLTGFVRLIPASQRDTAERSATRLYTAWSSIFGLPESIIGDRYKAWISRFW FT QRLHALLGVKINLSTSYHPQADGRSERSNKTIGQILRHFAAEKHGKWLQAL FT PAAEFAINSAVNVSTGVSPFQFVYGRTPRLFPIKNPAEEEDNGVKHWIERR FT QAEWATWRDNLWSSRVNQAFQYNRRRREGEPFVTGDWVMVDSSNRQQIVSG FT KFRPTSKLRARYDGPYEVLEVLNEGRNYRLQLSPNNLTYPIFHISKLKLYH FT AATEDREEADIATMSLVSSQPEPTPEGSREEPLGQARNCDEGVCIQVDALT FT PPRCESFILLRPLLKEIVGKQSFLQNNSDEDSVELTKQGTLVKESLVAATE FT LVLSIPKTTPTGPGMEPLGQARLIDKGVCFTMRR" XX SQ Sequence 6141 BP; 1775 A; 1449 C; 1383 G; 1534 T; 0 other; ctttttttga cttgctcact tcaaaaataa attaaatatc aacaataaaa aaactattta 60 caattttttt cctctatttt tttgcaaact ttctgtcact ctgtcttatc tgtcgcctgt 120 caatcagttc atcagtctcg gatttccacc tgtcaaacaa actctgtcaa ttcagccaaa 180 ccttaccttc ggttataacc agacaatcac acacgagtca aatgagcggg aaaccggata 240 actttcttga caatcctgac agtttacttt gagaggctcg aaaagctaaa caaaaacaaa 300 aagaatcggc tgagtcacca ccggcgctgc ctccaacaaa cttttcttgg agacctgaaa 360 cattcgtatc ggggtctcca gctccgactg accgcatttt atacgcttca agccatcgac 420 ctagtcctgc agcggtctcc gcagttcact tcacaccccc agttatatca ccatcagcct 480 cggaacgcac tgtcattgac cttcactata agaggttgtt cgaacaagga cgacttgatc 540 agacacccca tccgccaggg cattttccaa tcaacacacc agcaggacaa cagtcggaag 600 caccggacgt cacgttgaca ggaccaacgt tgacgtgtga ctcttcaaga atcatgtctc 660 acgagcagga caccattgat tcgttacgtc agcaggtgga aagcttaaaa cttcaaaatg 720 aatcgttagc tcaagctcgt caagagactt tgacgttaca aaacttagtt cgagatcttc 780 tagccaaaaa gaacacagtg gccagtgctg ctcaagccgc aggtgcgaac cccaacacct 840 tgggtagcga ttcagcaacg cagccacacc cttacatcca atacctcaac cagagcccgg 900 tcagccctac accgttacca cgagactcaa atcgaaatgc gacggtgacg ggacttgcag 960 cacaagcagt gacgccgaca gctggtccat cgcctggcaa aacttatcaa gccgaagcaa 1020 cgcgcgaaca cgcaccagct gaggcacctc aagtgccaac gacgcctatc tcaccacggg 1080 atagggccac gttaccaatt ccctacgctc aaactcctta cctgcactat tatccgcaac 1140 agcttcctga aacgccagca gcctttgttt cggtaccaat gacggcagat cgagtgaagt 1200 tgagtgattt accgaagttc accggtaaat tcggtcgacc ggccgatctc tttcactggc 1260 agagtctcgt agaagaaaca tttgacatca agaacatctc ggatgatcgg gaacgtatga 1320 aactactcgg atcccttctc aacaacgagg aaatgtcagc ctggtaccaa tccaaccgag 1380 atactatacg tcagcaatcg tggaaggagg ctatgaatat gatggctatg ggaactctcc 1440 ctcacgcgtg gttagcagac acggaagagt ccttgcgacg actcaagatg caaacctctg 1500 agtcttttga cgcttacgtt gttcgggcac aggcattaca tcgtttggtg aaacgatttg 1560 gccatgtaac ggatcggaat ctagccgagt atatcacatg gggcactcct aggttattca 1620 aggacatgat aacacgcgag aaacacctca cggccgaacc gttctccttt ccgcttttca 1680 aagacgctgc tgatggaatc tggtgctttc tcacaaatgc cagactctta cctgagacgc 1740 atggaaaggt taagcaacct actctaacgg ggagctcaaa ctcacccgct tcaataccac 1800 ctcgacctgc aaatcaaaat cgagtgcgat cagacgacga gcaagcagat aacgcctggc 1860 gttatcacga gtatctcaga caatcgggga tctgtgctgt atgtaaagaa aagtgtaaca 1920 acacaaattg cactcgaagg agctcaaagt ttctctcagt cccatctacc ttcaatccgg 1980 gacccaggcc tactcgatca gctactgcta gcaccccagc tccggcgcct ggaactccta 2040 ctcaaagacc ggctgggcga cctccggctg cttcgacctc gtctcgtgtt gccgcagttg 2100 atacctttcc agcgatgttg gcggaggata tcgcggctta cgaggaagca gatcaattca 2160 aggagggaca ggtggaggag acggttaccg aagacgaaag gtgcgtacca tcaacagcat 2220 ctacccctat catcctggag ctgacctgta acggaaaaac ggtacgagcg ctcatagatt 2280 cgggagctgg aaccaaccta ttgtccgaga acatggcgcg taatcttcaa gtgtgtcaac 2340 gaccgttggt agcacccgtt gaggtacgct tagcaatccc aacggaagga acccctctag 2400 tgttgcgtga attcgcaata gctaacttga agtctgaaca tcccagtctt cgatttggag 2460 cggttttttt taagctggca ccgttaggcc cgaattacga catgatactg gggtctcctt 2520 ttctctcgaa attcaaattg gatatttctt tacatcgtcg tgctgtctta catactccca 2580 gtgggaaaat cttgtatgaa aaagaaatgc aatctgagtt aagacgagtt ttgtgtgcac 2640 tcgagaacct tggaaagatc aacaaggcgc aagatttgtg tgaacgggaa atatccgtac 2700 tgcgagatta tgaagacatt ttcccggctg aactaccacc tgtcgaacag ggtgatgagc 2760 cggatgaaac ttttcctgat ggtttaccag acccagcaag acaagtcaga cacaagatta 2820 ttctgacaaa tccagatgtt gtcataaatg aaaaacaata tggatatcca cgcaagtatc 2880 tcgactcgtg gaataaacta ctgaatcagc atgtagcagc aggaaggctt agaagatccc 2940 acagtcaata tgggtctcct tccatgatca ttccaaaaaa ggacccaaat caattaccca 3000 ggtgggtttg tgactaccgt actttgaaca agtacacggt gaaggatcgt tctcctcttc 3060 caaatgtaga tgaagcagta agattagtgg caacaggaaa ggtatggtcg gtggtagacc 3120 aaacaaattc attttttcaa gacagaatgc gagaggagga tattccgctg acggcagtga 3180 agactccttg gggattgtat gaatggaccg tgatgcctat ggggttaaca aatggcccgg 3240 caacacatca agcgcggaat gaagaaatct tagacgaatt agtggggaag atttgtgtag 3300 tttatattga tgatattgtt atcttctcgc aaactgttga agaacacgaa gttcatctac 3360 atcaagtgct tgataggttg cgtgcagcaa aactctactg ttcagttaag aaaagtaaac 3420 tttttcgtca aaatatcaaa ttcctgggac atgagattag tgaggctggt gttgccccag 3480 acgatgagaa gatagaaaaa atctcgaaat gggtctctcc ttcaaattca aaacagttac 3540 tgaaattcct tggaactgtg caatggatga agaagttcat tgacggactt tctcactacg 3600 taggcacctt gacacctcta actagcacgt ctttgaaagg taaacctttc caatggggtg 3660 aagcagaaga acgtgcgttc aacaacatca agcggttaat caccactctt ccggtactac 3720 gaaatttaga ctatgattcg aatgaaccac tttggctatt tactgacgcc agtggcagcg 3780 gtctgggagc agccctgttt caaggagcag attgggacac cgcgtcgcca atagcatatg 3840 agagccggac catgaactca gcagaaaaga actaccctgt tcacgaacaa gagcttttgg 3900 cagttatcaa cgcgctgaat aactggaggt tacttctgtt gggtatgagg gtgaatgtta 3960 tgtccgatca ccactcactc acaactctct tgacacaacg caacctcagc cggcggcagg 4020 caagatggct agaaactcta tctcagtttg atctcgactt caaatacata aaaggtcaag 4080 ataactcagt tgcagatgcg ctttcccgac gtgatgatgt tgcgttatgt gaagtgcgtg 4140 ggaagctggc cgataccgaa cttcaagcaa tccaggctgg ttacaacaat gacagttttt 4200 gtgtcaaact tcgcaaagta ctgcctttgc aagaagattg tgtcttgcaa gatgatttga 4260 tgtacctaga tggacgctta gtagtacctc aatatggtga tctttgctga gacttcatct 4320 cacaagcaca taatgcgtta ggacacctag gtactttaaa gactttagca cgcctgagaa 4380 ctacattctt ttggccagga atggcgaaag aagtcgaaaa tcaactaaga acatgcgatt 4440 cgtgtcagcg taataaggcg cggacaacgt caacaaccgg caaattgcag tcttcagaag 4500 ttcctatcag cccaatgcaa tcaattgcga ttgactttgt tggcccattc cccatagtat 4560 ccgggtatga tatgctcttg tcttgtactt gtagattaac aggatttgtc cgtttgatcc 4620 cagcctcgca gcgtgataca gccgaacgct cagcgactcg tttatacaca gcttggtcat 4680 ccatttttgg cctaccagaa tctatcattg gggatcgtta caaagcctgg atatctcggt 4740 tttggcagcg gttgcatgca ttactaggtg ttaaaatcaa tttatcaaca tcttatcacc 4800 ctcaggcaga cggaagaagc gagagatcga ataaaacaat tggtcagatt cttcggcact 4860 ttgccgctga aaagcacgga aagtggcttc aagctttacc agcagctgag ttcgctataa 4920 actccgctgt taatgtctca acgggagtgt ctccttttca attcgtgtac ggccgaacac 4980 ctcgactttt cccaataaaa aatccagcag aggaagaaga caatggtgta aaacattgga 5040 tagaaaggag gcaagcggaa tgggcgacat ggagggacaa tttatggtca agccgggtta 5100 atcaagcctt tcaatataac cgtcgacgaa gggaaggaga accttttgtg actggtgatt 5160 gggtgatggt tgacagcagc aataggcagc agatcgtaag tggaaaattt agacctacca 5220 gcaaacttag agcacgttat gatgggccat atgaggtctt ggaagtgctg aacgagggtc 5280 gcaattaccg acttcaacta tcaccaaaca atctcacata ccccatcttt cacatctcaa 5340 agctgaagct ttatcatgcc gcgacggaag atagagaaga agcggacatc gcgaccatgt 5400 ctttggtctc gtcgcaaccg gaaccaaccc cagaaggatc tcgagaggag cccctggggc 5460 aagctaggaa ttgtgacgag ggggtgtgca ttcaagttga tgcgctaaca cccccgcgat 5520 gtgagtcttt tattcttctc aggccccttc ttaaagaaat tgttggcaag cagtcttttc 5580 tccaaaataa cagtgacgag gatagtgtcg aattgacaaa acaaggtaca ttagtcaagg 5640 aatcactggt tgcagccacc gagttggttt tgtcaattcc gaaaacaacc ccgaccggtc 5700 ctggaatgga gccattgggg caagctaggt taattgacaa gggggtgtgc ttcactatga 5760 ggcgctagca cccccgcaat gtgagtttgc taccgtttcc tttcaatcaa ttttgcaaaa 5820 cagctggcaa gcaatattac ctcctataat acagatatca cgaagcgaat gctattacga 5880 cgcaagtgga attcatggat tcaacttcaa gacttcaaga ctagtttaaa ttactgcaag 5940 atctcaagat aatgatgaat taatacaaat gaattaagac ttagatggat atttttgcac 6000 agtttttctt acttctttct tttctttttt tctgctttct ttgggacgag ttatggtcaa 6060 aaaattttgg ggcgagttat gccaattttt ttttctcttt attaaaaaca gaagctacaa 6120 ttttttttta gaaagggggg g 6141 // ID Gypsy-4_PPM-LTR repbase; DNA; FNG; 260 BP. XX AC ABWF01004646; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Postia placenta genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_PPM_; KW Gypsy-4_PPM-I; Gypsy-4_PPM-LTR. XX OS Postia placenta OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Polyporales; Postia. XX RN [1] RP 1-260 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Postia placenta genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABWF01004646; Positions 11809 11550. XX SQ Sequence 260 BP; 60 A; 60 C; 45 G; 95 T; 0 other; tgttacggtt ctttcatttc ggacgcatta tttcggacca cggaacctta tctctattta 60 gttaccatat atgtcttttg ttttctcatg actatctcgg accttaccta tgtcacagat 120 atgctcgtcc ctcttcacat atgttctctc atgtaatatg tagttactca tgcggtttgt 180 acaatatata agcaggctga agctgagggc aatcctcagt cttcagttgg cactttcaac 240 gctaaaatct agtcgtgtca 260 // ID SKIPPY_I repbase; DNA; FNG; 6989 BP. XX AC L34658; XX DT 11-NOV-1996 (Rel. 1.1, Created) DT 08-AUG-2007 (Rel. 12.07, Last updated, Version 2) XX DE Fusarium oxysporum retrotransposon Skippy; gag polyprotein, pol DE polyprotein; LTR. XX KW LTR Retrotransposon; Transposable Element; LTR; KW reverse transcriptase; gag; integrase; retrotransposon; RNaseH; KW pol gene; SKIPPY; SKIPPY_LTR; SKIPPY_I; internal portion. XX NM SKIPPY. XX OS Fusarium oxysporum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Hypocreomycetidae; Hypocreales; OC mitosporic Hypocreales; Fusarium; OC Fusarium oxysporum species complex. XX RN [1] RP 1-6989 RA Anaya N. and Roncero I.M.; RT "Skippy, a retrotransposon from the fungal plant pathogen RT Fusarium oxysporum."; RL Mol. Gen. Genet 249(6), 637-647 (1995). XX DR GenBank; L34658; Positions 620 7608. XX CC Target-site duplications of 5 bp were found. CC The second ORF, 3888 bp in length, has homology to the protease, CC reverse transcriptase. RNase H and integrase domains of CC retroelement CC pol genes in that order. CC Sequence comparisons and the order of the predicted proteins from CC skippy CC indicate that the element is closely related to the Gypsy family CC of CC LTR-retrotransposons. XX FH Key Location/Qualifiers FT CDS 55..2613 FT /product="SKIPPY_I_1p" FT /note="gag." FT /translation="MSANVNPVQQQNAAANLQGHQPAAPANPAPVNRPPTD FT QNMDDADDSSQSSDDSEVERLREQLGNVTNEMNEMRQMLEEFTALQHQQNQ FT SNNNTQQEMYNLASAANNGRDPGEVLKPSPPEYFDGTPSKLPTFLTQSRAF FT ITYYPNQFRNDSAKVMYMAGRLIKTAAQWFQPIMNDYMTNPPYKLQPRTAL FT LFGENGRHEMEEALKMAFGTIDEKGQAERKIKTLKQTGSASTLGVEFLQLA FT SKLPWDQDVLMSFFFDALKEQVQQELWEKDRPRTLVEYINMAVKIDDRQFA FT WRTRNSRGNKGRQDNKPRYHANQGRTRQTDTSYGTEAGPMAIGMTKRDKSK FT VTCYNCGKKGHYERECKNPVKTNQKYRPVPEGKKINMVKKNEEPQMAIKTI FT NMTRKDGYDMTQAKYINNFDPTLDVHDPFLSKEETLEKYYSKLEPEQRPVL FT GHTAPTTEVLYSRSKEEKREKRNNREQKKRQQAKKDKTIELEWVPVPEHNQ FT QPTKGKSIFMVRKGKEIAPKTDNEPSTKEVNQEIDNIPVKVTSRRPNRHTN FT DPNYLESCRVQRESDARKLRNKNEKTIWNTPIVPQYDEFNQRFWTNKHQKY FT TELARERAITEQDDDIYIRAYTKEMLEDPTTPEERMQAEKDLRRYPNHPNH FT KQISWVSCTRHYCSRHQQDKIANDCFPVQVPEHQEQKPYLIIDTVGYRVTK FT KYRGSQVAKLEAHTETRERALAHIQNSRHVSQWRQRVQQEVEIAEDETLDQ FT ELSQINQDITRWEGELSQDEQDILRQLDKQQLQERLIQTEEEKGRPLTEGE FT KADIILDHEVGKMEAAYQQKLATQNECPDDINCTKSECSMDHPWGKGTRHL FT " FT CDS 2454..6338 FT /product="SKIPPY_I_2p" FT /note="partial pol, protease, reverse FT transcriptase,RNaseH, integrase." FT /translation="RRKSRHHPRPRGRKDGSCLSTKISHPERMSRRHQLHQ FT ERMQHGSPMGKRHAPSVTERQIGEKSEAKLPKKALRMATKGRQRTCLELKV FT RIKGKWLSALVDSGADMNFISPTTVNELRLPWKDKNDPYTVHDGQGETYLY FT ENGNITREIDHLKVFVNGKNQGIDFDIIPVWRYDLVLGYPWLLRYNPQFNW FT RTGQVDCEDHPSDDESDSGYDTRSRTSTEESSEDRSGKVPPIPKGTRHKYH FT KGKVKCIRRTIASLKGQFKQLDQDIKQMKQIAAKESDERLKNIPPEYRIYE FT KLFQEELDTKLPQHTDYDIEIVLKDGKNPKFFPIYNLSQDELGTLREWIND FT MIRKGYIRPSKSSAGFPVMFVPKPNSNKLRLVVDYRQLNEITEKDRTSLPL FT ITELKDRLFGKKWFTALDLKSAYNLIRIKEADEWKTAFRTKYGLFEYLVMP FT FGLTNAPAVFQRMITNVLREYLDIFVVCYLDDILIFSDTEEEHTEHVHKVL FT KALQDANMLVEPTKSHFHQSQVTYLGHEISHNEIRMDRRKIAAVAEWKVPT FT SVKETQSFLGFANYYRRFIKDFSKTAIPLTEITKKDKQFQWNDKAQEAFEK FT LKSAITSEPVLVMFDPDRQVELETDASDFALGGQIGQRDDNGVLHPIAFYS FT HKMHGAELNYPIYDKEFLAIVNCFKEFRHYLRGSKHPVKVFTDHKNIAYFA FT TTQELNRRQLRYAEYLCEFDFTIAHCKGTDNGRADAISRRPDFDTGTVKTK FT EQLLETNSKGEYQFTQPVKTIALTRKGITEEEKQRYIGFKKYISMQQLKEH FT HRDMHEKPEEASASHLSWYFGSGRDERIRQIMDKCPTCKQKPQGWRYPIQS FT LTDQDEQERVEQEFIYEIHAHPLHGHQGVTKTMKRLQELGYRHFKKGQVEK FT VIKQCDLCAKTKAQRHKPYGQLQPLPVAQRPWDSITMDFITKLPLSEEPST FT GIFYDSIMVIVDRLTKFSYYLPYREATDAEELSYVFYRHIVSIHGLPTEIL FT SDRGPTFAATFWQSLMARLGLNHRLTTAFRPQVDGQTERMNQVLEQYLRCY FT INYEQNDWVEKLPIAQLAYNTAYNESTKLTPAYANFGFTPNAYHNARPEKS FT INPAAIIKSEDMQDLHEYLKTELEFVRKRMKNYYDPKRLKGPTFSEGDMVY FT LATKNIKTDRPSHKLDYKFIGPYKVLQKISENNYKLDLPPKVRLHPIFHVS FT LLESAADTIQVKTGNEPREISGPEVYEAEAIRDTRKINGQREYLIKWKNYP FT ENENTWEPPKHLVNAQRLLKDFHQRARKKERRPK" XX SQ Sequence 6989 BP; 2546 A; 1801 C; 1395 G; 1247 T; 0 other; cgttgatagc tcttgttcag accgtaccgc gtaacgctga gcaatacaaa cgcaatgtct 60 gcgaacgtca accccgtcca acagcagaac gcagctgcca accttcaggg tcaccagcct 120 gcagcacctg ctaaccctgc accggtcaat cgcccgccaa ccgatcagaa catggacgac 180 gcagacgact cttcccagag ttccgacgac tcggaggtcg aacgacttcg cgagcagctc 240 gggaacgtca ccaatgagat gaacgaaatg cgacagatgc tggaagaatt caccgctctg 300 cagcaccagc agaatcagag caacaacaac acccagcagg agatgtacaa cctggcctcg 360 gcagccaaca atgggagaga cccaggagaa gtcctcaagc ctagcccgcc tgagtacttc 420 gacggaaccc cgagcaagct acccacgttc cttactcaaa gccgagcctt catcacgtac 480 tatccgaatc agtttcgcaa cgattccgcc aaagtaatgt acatggcagg aagactcatc 540 aaaactgcag cccaatggtt tcaacccatc atgaatgact acatgacaaa cccaccttac 600 aaactacaac cacgaaccgc tttactcttc ggagaaaacg gacgtcacga gatggaagaa 660 gcactcaaaa tggcgttcgg aaccattgac gaaaaaggcc aagcagaaag gaaaatcaaa 720 acacttaagc aaaccggatc agcatctacc ctaggagttg agtttttgca actcgcaagc 780 aagctaccct gggaccaaga cgtattgatg tcatttttct tcgacgcact caaagaacaa 840 gtccaacaag aattatggga gaaagatcga cccaggacat tggtcgaata catcaatatg 900 gcagtcaaaa tcgatgaccg acaattcgca tggagaactc gtaactcacg aggaaacaag 960 ggacgacaag acaataaacc tagataccat gccaaccaag ggagaactcg acaaactgac 1020 acgtcctacg gcaccgaagc aggaccaatg gctataggaa tgactaaacg agacaagtcc 1080 aaagtcactt gctacaactg tggcaagaaa ggacactacg aacgagaatg taagaacccc 1140 gtcaaaacta accaaaaata cagacctgtt ccagaaggta agaaaatcaa catggtcaag 1200 aaaaacgaag aacctcagat ggcaatcaaa accatcaaca tgactcgaaa agacggatac 1260 gatatgacac aagcaaagta catcaacaac ttcgatccaa ccctcgatgt acacgacccg 1320 ttccttagca aagaagaaac cctcgaaaaa tactattcca aactggaacc agaacaacga 1380 cctgttctag gacatacggc cccaaccaca gaagttctct attcacgaag caaggaagaa 1440 aaaagagaga aaagaaacaa ccgcgaacag aagaaacggc aacaggccaa aaaagacaaa 1500 acaattgagt tagaatgggt accagtaccc gagcacaacc aacaacccac caaaggaaaa 1560 tccattttta tggtcagaaa gggaaaggaa attgccccta aaacggacaa tgaacctagt 1620 accaaggagg tcaaccagga aatcgacaac attcctgtca aagtcacctc acgacgacca 1680 aaccgacata ccaatgatcc caattacttg gaatcctgca gagttcaaag ggagtccgac 1740 gccaggaaac tcaggaataa gaacgagaaa acaatctgga acacccctat cgtaccacaa 1800 tacgatgaat tcaaccaacg tttctggacc aacaaacacc agaaatatac ggaactcgcc 1860 agggaacgag ccatcacgga acaagatgac gacatctaca tccgagcgta cacaaaagaa 1920 atgttggaag accctacaac tccagaagaa cgtatgcaag cggaaaagga cctaaggagg 1980 taccctaatc acccgaacca caagcagata tcatgggtat catgcacacg acactactgc 2040 tcgagacacc agcaagataa gatagcaaac gattgcttcc cagtacaagt tccggaacat 2100 caagaacaga aaccctacct gataatcgac acagttgggt accgagttac aaagaaatac 2160 aggggatctc aagtagcaaa actagaagct cacacagaaa cacgagaacg agctctagca 2220 catattcaaa attctagaca cgtctctcag tggagacaac gggtacagca ggaagtcgag 2280 atcgccgaag acgaaacact cgaccaagaa ctcagtcaaa tcaatcagga catcacccga 2340 tgggaaggag aactatccca agatgaacaa gatatccttc gacagctaga caagcaacag 2400 ttgcaagaac gattgataca aactgaagaa gaaaaaggaa gacccctcac tgaaggagaa 2460 aaagcagaca tcatcctaga ccacgaggta ggaaagatgg aagctgctta tcaacaaaaa 2520 ttagccaccc agaacgaatg tcccgacgac atcaactgca ccaagagcga atgcagcatg 2580 gatcacccat ggggaaaagg cacgcgccat ctgtaacgga gcgacagata ggcgagaaat 2640 cagaggctaa actacctaag aaagctctga gaatggcaac gaaaggacgc caacgaacct 2700 gcctagaact aaaagtcaga atcaagggca agtggctaag tgcacttgta gatagcggag 2760 ccgacatgaa cttcatttcc cctacaacgg ttaatgaact aaggctacca tggaaggaca 2820 agaacgaccc atatacagta catgacggac aaggagaaac ctatctttac gaaaacggaa 2880 atatcaccag agagattgat cacctcaagg tattcgttaa cggaaaaaac caaggtatcg 2940 acttcgacat catcccagta tggagatacg atctagtact cggataccca tggctattac 3000 gctacaaccc acaattcaac tggagaactg gccaggtgga ctgcgaagac cacccttcag 3060 atgatgaaag cgattcaggc tacgacacga gatctcgaac ttcgactgaa gagagcagcg 3120 aagacaggag cggcaaagta ccgcccattc caaaaggaac acgacacaag taccacaaag 3180 gcaaggttaa atgcatcaga cgaacgatag catctctcaa aggacagttc aaacaactgg 3240 accaagacat caagcaaatg aaacagattg ccgcgaagga atctgacgaa cgactcaaga 3300 acataccacc ggagtatcgt atctatgaga aattgtttca agaggaacta gacacaaagt 3360 taccccaaca cacggactac gatattgaaa tcgtactaaa ggatggaaag aacccgaaat 3420 tcttccccat ctataatcta tctcaagatg agctaggaac gctcagagaa tggataaacg 3480 acatgatacg gaaaggatac attagaccat caaaatcctc agcaggattc ccggtcatgt 3540 tcgtaccaaa acccaattcg aacaaactac gacttgtagt ggattaccga caacttaacg 3600 aaatcacgga aaaggacaga acatcgttac cactcatcac cgaactcaag gaccgtctat 3660 tcggcaaaaa atggtttacc gccctcgatc tcaaatctgc ttacaatctc attcgaatca 3720 aagaagccga cgaatggaaa acagcctttc gaacgaaata cggactattc gaatacttgg 3780 ttatgccttt tggattaacc aacgcacccg cagtattcca acgcatgatc acgaacgtac 3840 ttcgagaata ccttgacatc tttgtagtct gctacctaga cgatatcctc atcttctctg 3900 acacagaaga agagcacaca gaacacgtcc acaaggtttt gaaagcactg caagacgcca 3960 acatgctcgt cgaacccacc aaaagtcact ttcatcagtc tcaagtaacg tacttaggac 4020 atgaaatctc gcacaacgaa atcaggatgg acagacgaaa gatagcagca gtcgctgaat 4080 ggaaagtccc aacatcggta aaagaaaccc aatccttctt aggattcgcc aactactacc 4140 gacgattcat caaagacttc agtaaaaccg ctatcccact cacggaaatc accaaaaagg 4200 ataaacaatt ccaatggaac gacaaggcac aagaagcttt cgagaagctt aaatcagcca 4260 tcacaagcga acccgtcctt gttatgtttg accccgaccg gcaagttgaa ctcgagaccg 4320 atgcttccga cttcgccttg ggaggacaaa taggacaacg agacgacaac ggtgtactac 4380 accctattgc tttctactct cataagatgc acggagcaga actcaactac cccatctatg 4440 acaaagaatt cctagctatc gtcaactgct ttaaggaatt caggcactac ctcagaggaa 4500 gcaaacaccc agtgaaagtg ttcaccgatc acaaaaacat agcttacttt gcaacgacgc 4560 aagagctaaa ccgacgacaa ttacgctacg ccgaatacct atgtgaattc gacttcacca 4620 tagcgcactg taaaggaaca gacaatggac gcgcagacgc cattagtcga cgaccagact 4680 ttgacactgg aactgtcaag accaaggaac aactcttgga aaccaactct aagggagaat 4740 atcagttcac acaacctgtc aaaaccatcg ccttaacccg caaaggaata acagaagaag 4800 agaaacaacg ctacatcgga ttcaagaagt acatctcgat gcaacaacta aaagaacacc 4860 atcgagacat gcacgagaaa cccgaagaag ccagcgcaag ccatctttct tggtatttcg 4920 gatcaggaag agatgaacga atcagacaaa tcatggacaa atgcccaacc tgcaaacaaa 4980 aaccacaggg atggagatac ccgatacaat ccctcacaga ccaagatgaa caagaaagag 5040 tggaacaaga atttatctac gaaatccacg cacatccatt acatggacac caaggagtca 5100 cgaaaacaat gaaacgactt caagaactag gctaccgcca tttcaaaaag ggacaagttg 5160 aaaaagtcat aaaacaatgc gatttgtgtg caaaaacaaa ggcgcaaaga cacaaaccct 5220 atggacaact acagccctta ccagtagcac aacgaccttg ggactcgatc acaatggact 5280 tcataaccaa actacctttg tcagaagaac cctcaaccgg aatcttctat gacagtatca 5340 tggttattgt tgaccgactc acaaagttct cttactacct accctacaga gaagcaacag 5400 acgcagaaga actttcttat gttttttaca gacacattgt tagcatacac gggttaccta 5460 cggaaatcct ttcggataga ggacctacgt ttgcagcaac attttggcaa tcacttatgg 5520 cacgacttgg actgaaccac cgactcacga cagcgtttcg accgcaagtc gatggacaaa 5580 ccgaacgaat gaaccaagta cttgaacagt acttacggtg ttacatcaac tacgaacaga 5640 acgattgggt tgagaaacta ccaatagccc aactagccta caacaccgcc tacaacgaaa 5700 gtacgaagct tacaccagcc tacgcaaatt tcggtttcac accaaacgca taccacaatg 5760 cacgaccaga gaagtcaatc aatccagccg ccattatcaa aagcgaagac atgcaggatc 5820 tacatgaata tctgaagaca gaactcgaat ttgttagaaa acgcatgaaa aattactatg 5880 acccaaaacg gttgaaggga ccaacctttt cggagggaga catggtctac ctagccacaa 5940 aaaacatcaa aacagaccga ccaagtcaca aattggacta caaattcatc ggaccttaca 6000 aggtactaca aaaaatctca gaaaacaact acaaactgga cctaccaccg aaagttaggc 6060 tccatccgat ctttcacgtt tcattactcg aatcagcagc ggacacaata caagtcaaaa 6120 caggcaacga accacgagaa atctcaggac cagaagtcta tgaagcagag gctatcaggg 6180 acacccgcaa aataaacggt caaagagaat acttgataaa atggaaaaac tacccagaaa 6240 atgaaaatac atgggaacca ccaaaacacc tcgttaatgc ccagcgactt ctgaaggatt 6300 ttcatcaacg ggctcgaaaa aaggaacgtc gcccaaaata gaattccaat cgatcatgtc 6360 cacggcaccc agggactgag cattagaaat caccacggac tcgtcctgga gctcaccaga 6420 ctcctcctgg gattgcatcc cacgagcaaa caactcatca cccttcttct tcaacaaacg 6480 cttctgactg cgaatacgag ccaatttcgc catagcagct tcaagagcct tttcagcttg 6540 ctcttcctcc ttctccaact taacgcgagc tttcatgttc ttttccacta ttcgaagaaa 6600 agaacgtcag aacaacatca gaaacagaaa acaaaaaaaa aaaaaaacgg aacatacagt 6660 aggaagcaac acccgaacca tcgcacacca ccttggcctc agtacaattc tggcaccgat 6720 gagattgatc gcctttgacc tgacaagaaa ggcccttgcg gaaacaacgg gaacacggca 6780 tgatctcgtc accaaacctg acaatgtact cagaaagttt gacggcctct aacgttttcg 6840 gatgaggttt gcttttggaa acacgacctg acattttgtc agcattgaaa ataaaaaaac 6900 aagactaggt ttgactcacg accaacggat ggaagaaacg acggttatgc aggtcttgag 6960 gacaagaccc aacggagggg agagatcgt 6989 // ID PIF_Harbinger-2_PleOst repbase; DNA; FNG; 2831 BP. XX AC . XX DT 17-MAY-2011 (Rel. 16.05, Created) DT 17-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Harbinger; DNA transposon; Transposable Element; KW PIF_Harbinger-2_PleOst. XX OS Pleurotus ostreatus OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Pleurotaceae; OC Pleurotus. XX RN [1] RP 1-2831 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 2831 BP; 634 A; 702 C; 732 G; 763 T; 0 other; agagagtttg agaaagccac tcaaggcggc ttttgccgcg tgagcccgcc accagcacag 60 ccaccctccg acctcgcctc cgtgccattc atcgacatcc cttcgttact gttaagcact 120 attctctccg tttatcacac attcgacgct ttactgcact cgccatatgg ctaccacttt 180 cttggccagc cggattgcct tcaaccgtcg cttgctggag gttttagcgg aagcggatga 240 tttggtagag gagcagactg cgggacggat gcggccagga gggcgggtga gattggagag 300 tcttggggat gatgtttggt tgcaacaatt ccggtagact cgttcccagt ccctgtgttg 360 tagtttgaga attcagtggt atcaaatagc ttcacccgta gcgaaatcaa tcagcttgta 420 atgcacctag acctaccgga atctgtccga tgtccagata gtggagctgc ggaggaccga 480 gttactgctc tttgcatgct tctacgacgt ctcgcgtatc ccatccgtct cgttgacgtt 540 gaactcatct ttggatggga gcggtctcgg ttctcgagga tcacacgaat tacggcactc 600 attctatggc atcgatggaa gcatcttttg cgtttcgatc ccgtgcgatt atcccctgaa 660 aaactcgaga cccttgctgc atttgtggca ctcaagggtg caccgctgga cgttgtcgtt 720 gcatttcttg atggaaccct tcgcaagaat gctcgccctg tccggaacca gcgaatggtg 780 ttcaacggat ggaaacggat tcattgcttg aagtatcatc ttctcctttc gcctgatggc 840 atcgccatcc atgtttatgg cccagttgaa ggacgaaggc atgatgaaac tgtctacaag 900 gagagtggcc tagcgaacct tttgcgtacc cattttcata cacgagatgg gcgacttctc 960 ttcatttatg gggacagtgg gtatagctgc aaggggcagg ttctctcacc ttacaaaggg 1020 tcagttatta cagatgagca gcgcatgtgg aacaatgcta tgagcaaggt ccgtgaaccg 1080 gtagagtgga tatttgggga ggtggtgaag cagtttgcgt tcctcgactt cagccacaac 1140 ttcaagctcc ttctgcaacc atgtgggctc tattatctca tcgctatcct cttctgcaac 1200 gcacacacca ttctacaccg accacaaacc cctcaatact tccattgtcc gccacctact 1260 ttgacagaat acttccacgg cgagcctgtt gatgacgaag aactggatgc ctggtgtatg 1320 tcaacaccat ggggggagtg tgaggtacca gagggtgtta ctggtgactc agaggatccc 1380 agtcaatcag cgcagcagga gatggagtga atatatacat atcatacaat aagaacagaa 1440 taatatactg taattattta tcctttgata cattgacggc caacgtgagt atagcagttg 1500 tcaatgcatt gagctggtct ttctgctgac gctcacgctc catctcttct gcttgtcgct 1560 caatatcatg tttttgttgg gcttccatca atgacgagat gccatgcaag atgccgatcg 1620 tctctttatg caggttagtg agctcttcgt gacgaagctg gtccaaggta cggagttcct 1680 gcagagattt ctggtcagac gatatctgct ccaatatcag tgcggcggca tggctgtggc 1740 gacgtttccg cggagggaat tcaaattcaa agttctcctt ctctgccgta gagaaggaga 1800 gcggacggcg tctatgagta tcgtcagcaa aacgaatcag ctatacaaca ggcgagctta 1860 cttaagtact cgttgagctt gtttttcgcg cacacttgct cccggaagag ccgatacgtc 1920 caccagagac tcgcgatgaa caagcccctt catagatgct tcgcgtagct cggaggatgc 1980 ttgagactca acagacctct tcttttttat ctgtgcagat tgttcatctt tgatactgct 2040 gtgtacatcg tacagctgag caaggccggt cagcagctat agagttttgt tagaaaattc 2100 aatccgccta aacggaggta tccctcactt ccacatgcgc gtcaacttcc tcgttcgtgc 2160 cggtagactg taacgatcga gtctcatcgc gctggagcaa caatcagtat gtgtatctga 2220 gttcacaaac ctgatcatac cttatgggat tgaattagct tctcaaaccg ctttcggcag 2280 gcatacgcag ttctgtcgat gacagactga ggcccctcaa gcgcggtgtg ctgccgaaga 2340 gtctcggcca gcgtttccca tgccttggtg accattttcg tgttgggttc atcgaatggg 2400 cgcacgttgt ccacttcttt gatgaggcgc cggtcttgcc agggaagcca ccgcggcgac 2460 cgtgcatttt tggactgtct ttggctagtg gtatcttcag gtgtttggct actatcgtca 2520 ggcagacttc cttgctcgca atcccatgca ggggagggct ggtgttgtac tgggtcagca 2580 actgatgtcg aggaatcaat tgggatgtca gatggtttta ttggcagttt gatccggatt 2640 ttcagaggtg aggagtcttg atgaatggtc ttttgtcgtt tagccctgac catggagaga 2700 aatacgcaag taaacgctca attatcgtga gttacgacgc gccgcgctca aagccgctgg 2760 gtggctatct cacgcggaaa aagccgccgg tggctgacta agcagcctct ggtggctttc 2820 gcaaactctc t 2831 // ID Copia-3_MVPL-I repbase; DNA; FNG; 4138 BP. XX AC AEIJ01000198; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Microbotryum violaceum genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-3_MVPL_; KW Copia-3_MVPL-LTR; Copia-3_MVPL-I. XX OS Microbotryum violaceum OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Microbotryomycetes; Microbotryales; Microbotryaceae; OC Microbotryum. XX RN [1] RP 1-4138 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Microbotryum violaceum genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AEIJ01000198; Positions 5836 1699. XX CC Positions [1526-2026] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 50..2983 FT /product="Copia-3_MVPL-I_1p" FT /translation="MSTPLMVTGGTTPIVLVKPRDVDPSCHLRNAGGWSQW FT IDDVRGALGGDYWNCTLATPVQSPNTRTDLDHANFQAVALLMRSVSPDIRK FT HIKDGYDHTPAEVLQMIKKAVGTGTVADILIRLQALLDVGPPPTGMDAFAK FT YQATTIELFNTLSGHELTFEQIVSFRLVHFGKDLYREYYTQTLRQGLDRLP FT SPTEVLQSLHTQADQTPAPKPVAMVASPIATADKSTAPPFSPCHHCGGPHW FT NHKCPDNKAGSKGKKTKAPSAPSASLVTAPDISTLVDGPVLGYLTATMSTL FT ARPSIILESGATHHIVNNCACFTSFRKCKPAPLESITGEVVNSIEGVGSAI FT VRSFDGSIGQLNGALFVPTAPVNLFSASRADIGGNDIRLTRAKATVSIADT FT VCHTGELKNGLYHMQASLVDVAALVHQAGAFPRALMAVPMSVLHGRFAHLH FT PRALRQLISTEAVSGVSMEVDPNEDLKCDTCQAGKITHHPFPAVASNCSAQ FT PLEQVHMDLLAFDGAVSLGGARYALVIVNDHSQYLWAIPMSHKSDTFAAFK FT SWLAKVERSTSRKVLAVRSDNGGEFMLNKFSRFLEEQGITRQLSIPDTPQQ FT NGVAKRANRSITEGVRSMLHQLGLPHALWAKALATYIYVKNRSPHSANSGT FT TPHTHWYSSKPSAGHLRVFGCRAWKAATTEPRSKLNPRGIPLVFVGYDLES FT KGYRLLDPNTRQVFKSRSVTFFEDNFPARATGIRAPPLPAAGDDNSGVVII FT PPEHVDPPAALRFDAPGPAWQPPAGPRVCNPPAQYGALASLRSTPSAFAFS FT LGNEVVALVANLAEAGINVSKADHPLSEPEDPFTFPTSDPSTWKEAMRHPH FT AEGWKAGAIEEFRLMKDDFKVFSVVDLASVPRAATILPSRHVFRTKRDKAG FT KMVSLKSRIIARGCAQQAGDFDETFAPTAKFTSICVLVAHAASRGHHIIQA FT DVDKAYLHGVLEEEIYMRVPTGI" XX SQ Sequence 4138 BP; 872 A; 1286 C; 1065 G; 915 T; 0 other; ataggttacg agcccaaccg agcggccagc caattcgagt catcaagaca tgagcactcc 60 tctcatggtc acagggggaa caacccctat cgtcctcgtc aaaccacgcg atgttgaccc 120 atcatgccat ctgcgaaacg ctgggggctg gtcgcaatgg atcgacgacg tgcgaggcgc 180 ccttggcggc gactattgga actgcaccct cgcgacgccc gtccaatcac ccaacacgcg 240 cacggatctc gatcacgcca atttccaagc ggtcgcgctg ctgatgcgtt ctgtctcacc 300 ggacatccga aaacacataa aggatggtta cgaccacact cctgcggaag tcttacagat 360 gatcaagaag gcggtcggta ccggcacggt ggctgacatt ctcatccgtc tccaagcgtt 420 gctcgacgtg ggaccccctc ccaccggcat ggacgccttc gcaaagtacc aagcgacgac 480 cattgagcta tttaacacgc tctcgggaca cgagcttact ttcgagcaga ttgtgtcgtt 540 ccgccttgtc cacttcggca aggatctgta ccgtgaatac tacacgcaaa cacttcggca 600 aggattggat aggctaccga gcccgacgga ggtcctccag tcgctccaca cgcaggcgga 660 tcagacgcca gctcccaaac ccgttgccat ggtggcaagc ccgattgcga cagcggacaa 720 gtctaccgcg cctcctttct ccccgtgcca ccactgcgga ggtccgcact ggaatcacaa 780 gtgcccagac aacaaggctg ggtccaaggg gaagaagacg aaggctcctt cagcgccttc 840 cgcctcgctc gtcactgcgc ctgatatctc gacgctggtt gacggtcctg tgcttggcta 900 cctcacagct acgatgtcca ccttggccag gccctccatc atcctggaaa gcggggcaac 960 gcaccacatt gtcaacaatt gcgcatgttt cacgtccttc cgcaagtgca agccagctcc 1020 gctcgaaagc atcactggtg aagttgtcaa ctccattgaa ggcgtcggct ccgcaattgt 1080 tcgttccttc gacggcagca ttggccaact caacggcgcc ttgttcgtgc ccaccgctcc 1140 ggtcaacctg ttctccgcca gtcgtgccga catcgggggt aacgacatcc gtctgactcg 1200 ggccaaggca acggtctcca ttgctgacac ggtgtgtcac actggcgagc tcaagaacgg 1260 actttaccac atgcaggcct cgctcgtcga cgtcgctgct cttgttcatc aagcgggtgc 1320 gttcccgcgt gcgctcatgg ccgttcccat gtcggtcttg cacggacgct tcgctcacct 1380 ccacccacgc gccctccgac agttgatttc gactgaagcc gtgtcgggag tcagcatgga 1440 ggtagatcca aacgaggatt tgaaatgtga cacttgccaa gctggcaaaa ttacgcatca 1500 cccgttccct gctgtcgcat cgaattgctc tgctcagccg ctcgaacaag tccacatgga 1560 cctccttgcg tttgacggtg ccgtgtcatt agggggtgcg agatacgctc ttgtcatcgt 1620 gaacgaccat tctcagtatt tgtgggcaat tccgatgtca cacaagtccg acacctttgc 1680 ggccttcaaa tcctggctag cgaaggtgga gcgctccacg tcacgcaaag tccttgctgt 1740 ccgctcggat aatgggggcg aattcatgtt gaacaagttc tccagatttc tggaggagca 1800 aggcattact cgtcagctct ctatccctga cacgcctcaa cagaatggag tcgccaagcg 1860 tgcaaatcgt tcgatcaccg agggtgtccg ctcaatgctc caccaattgg gcctacctca 1920 cgctttgtgg gccaaggcat tggcaacgta catctacgtc aaaaatcgct cgccgcactc 1980 tgcaaactcg ggcaccacgc ctcacacaca ttggtacagt tccaaacctt cagctggcca 2040 tctacgcgtg tttggctgtc gagcatggaa ggccgctacc accgagccgc gcagcaagct 2100 caatccgcga ggcatcccac ttgtctttgt tggctatgac ctcgagtcca aggggtacag 2160 actgctggac cccaacacaa gacaggtctt caagtcccgc agtgtcacat tctttgagga 2220 caacttccct gctcgagcaa cgggtatccg agcgcccccg cttccggccg caggggacga 2280 caacagcggg gtcgtcatta ttccccccga gcacgttgac ccaccggctg ccttgcgttt 2340 cgacgcccct ggcccggcct ggcagccacc cgcaggccca cgtgtgtgca atccgccggc 2400 tcagtatgga gcccttgcat ccttgcgctc gacgccttct gcctttgcat tctcactcgg 2460 caacgaggtt gttgcgctgg ttgccaatct tgcggaggcc ggcatcaatg tctcaaaggc 2520 ggaccatcct ctctccgagc ccgaagatcc tttcacgttt cccaccagcg atcccagtac 2580 ttggaaggaa gccatgcgcc accctcatgc ggaaggttgg aaagctggtg ccattgagga 2640 gttccgcttg atgaaggatg acttcaaggt gttcagcgtg gttgaccttg cctctgttcc 2700 cagggcagcc actatcttgc cgagtcgcca cgtgtttcgg accaaacgcg ataaggcggg 2760 caaaatggtg tcgttaaaga gccgcatcat agctcgggga tgcgcacagc aggcaggtga 2820 ttttgacgag acgtttgcac caacggcaaa gttcacctcc atttgcgttc tcgttgcgca 2880 cgcagcctcc aggggccacc acatcataca agccgacgtg gacaaggcct acctccacgg 2940 cgtcctggag gaagagatct acatgcgcgt ccccactggc atctaaggct atgacggcaa 3000 gtgcctccgt cttcaccgtt ccatctacgg gctcaagcaa gccggtcgag tctggaacaa 3060 taaaatcaac accactctag cgaatcttgg ctaccaccgt cttgcctgcg atgaatgcat 3120 ttataggcgc gccaacgttt tgggtgatca ctacattgcg ctctacgtag acgacctcct 3180 cttctttggt cccgaccttg gggagattga ttgagtcctg gatcaactca acacccttta 3240 tggcgtaaaa tgtctcggcc cggctaactg gattctaggg gttcaggttg tacggcatga 3300 tgatggcggg atcaccctcc tgcagcgcca ataccttgtt gacgtgttgg cttgtttcgg 3360 catgtccagt tgcaatccct gcaaatcgcc aatggaggcc aacctccagt tgttgcccaa 3420 gatcgacccc gatgaagccg acaactcgac atatcgttct atgatcggct ccctcatgta 3480 cgccgttgtt gccacccaac ccaatcttgc acacgctgtg gggtatctct cctggtttgt 3540 tggaaaagcg ggggcagcgc atctcgaggc cgtcaaacac gtcctttggt acatcaaagg 3600 ctcgctcgat ctgggcatcc attacatggc taacaacaaa cccttgctgg gatacgaggg 3660 gtattctgac tcggactggg gatccgacgt gaacacgttg aggtcaacaa tgggctactt 3720 gttcaaactg gctggtggca caatattgtg gtcatcccgt ctccaatccc gagtggtgtg 3780 ttcatctacg gaagccaagt atctgggtct ttcgcacgct gccaaggagg cagtcttcct 3840 ttgctcgctc ctcaccaagc tcggccttga cacatcgttg ccccttcgtc tgcttggaga 3900 caatcaaggc gctattgcgc tcacccagaa cccggtcttc catgcttgca ccaaacacct 3960 tcgcatgttg gagcactttg ttcgggagca cgttcgaaat ggagagatct tggtcaccta 4020 catccctacc catgacatgg tggccaatat cttcaccaag gcgctgcctc aagcaatttt 4080 ccagcgtcat tgcgcgctat tggccttcgg caaatttcag gccaagagca aggggggg 4138 // ID Copia-1_MPA-I repbase; DNA; FNG; 6501 BP. XX AC ADBL01000478; XX DT 26-MAR-2011 (Rel. 16.03, Created) DT 26-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Magnaporthe poae genome: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_MPA_; KW Copia-1_MPA-LTR; Copia-1_MPA-I. XX OS Magnaporthe poae OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Magnaporthales; OC Magnaporthaceae; Magnaporthe. XX RN [1] RP 1-6501 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Magnaporthe poae genome."; RL Direct Submission to RU (26-MAR-2011). XX DR Genome; ADBL01000478; Positions 2112 8612. XX CC Positions [3776-4093] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1035..3848 FT /product="Copia-1_MPA-I_2p" FT /translation="MGRKKRKASKSNRSYFDMAEGFSMAMDENLADQVQRQ FT GTEISNLTNQVDLLQQQNQKILALLQALQPKEENPPGGTLDRGKGKAVEDV FT KLESSAVPAGGPATGGPAPGGSPAGGPNNAEGDSGPPKEIWAFTNPAIKEM FT SKSVYSFPEAHKLRGAQNYQEWRHALVIQFTAVGLADFLINPMLANSVSAA FT DQATILLLLKNSCSAETIHTITWETSPVNALKALQDSYSLTPEIQRDALYR FT EFHALDFSKYKGSISEFNSQFTSLLIRLRNCSVEISAIDIKNQYLKALEGS FT FPQWAERLRSTIRSAIALGQSTEGINLQYLMADLLAETNNPISTAAKQAAS FT HRAQRQNKKPKDSAEGAKEKNKAGKKGKNSDPKDKAAGQKGEPSEDKPKEK FT DFKGKKSKGKKEKDGNESPDNKKGGKSTAFVAVNYDLITGSTTESATEQIS FT GTNYAKDSGIVDLDALDLSNSSESDSESGSDSDANTVDITHCKCCDHAKSN FT TRDSKYKGMSPVLYDTGSTDHIFNSMESFTAFSKNCNVVIRTGGGLVYPLG FT VGTVKLPVRCSEKPGDFEEITLKEALYIPSFDVNIISGLRHYRAGGALYNQ FT KLFNAERHCWGILNVKDHGFFLQCKGYAYPTVRHRQNRYCSFYGLAQRITI FT DLKNTPPADREKYVRFDTDSEENTDSSDEQPLKEPPKGKAILNRKKKAVIN FT SDMPKLSAESPEDSRVVEPCELGRPLGKSPEKPIPEDITLPKRGTSHARDP FT LPESPPVGKVQAAPFINITPKSLQKSADGWVSAEGDVIKGCDEETALKALL FT WHRRLGHIGLTLLKKTAKITDGLPNFDKIQGLQCVICAKTTAVRRTGKGPL FT PSPGNVLDSIEGDTVVLSPTPHNKKPVILLLVDRKSRFRWVFQLPNKQGPT FT VMAAIKSFFRALKATYGRTLLVFSSTGVKKSIRR" FT CDS 3947..5788 FT /product="Copia-1_MPA-I_1p" FT /translation="MDRLRATINAAGLPHYLWCYVLSAVVELVNATAVTNR FT ETTPYQEFYDEVEPGPRHRPDLGHYRVIGTHAEALIPLEHRAKSKKLATRT FT EPVRLLAALSKSTVLVYAPARRAIYKTSTIKILEGVPVKNLVPEGDMILEG FT VSIENPPEGSASSAPNNAPRAPDSDSESDMEPDFQRYDSPAVPDPDYRLPE FT PVIPTVRPPEKPIVMAPPTPKIQEIVDEIPVLQDLLGGKTPNPDAMEVDEL FT VQYICYKATQTIKRKKPTRDGEPNSYKEALKSPYKKEWLTALGLEFEQLLR FT AGTFNFLPQSALPKGRKLITCRPVFKLKKDNQNRPVKYKARLVAKGFLQIE FT GLDYFETFASTSIPPTWRILLAFAAAKNWEIEQIDFIGAFLNSDLDVDIYL FT SIPEGFSEWLATATTDIKQLANNLGFKPSEKQVILLKKALYGLKQGPREWQ FT NKLISLLKEEGYNQLISDPAVFFNAKIKHFIITFVDDCLIIGPDMRYISAL FT KRSLHKVYALENRGPAAYFLGVQIIRDRPKRQLWIHQSQYIDEMLKTFGLD FT ASRSISIPLQPGVLKDTKGQPVTSAETKLLQRIIGTVMYLMLLTRPDISFA FT VSMDFTPFNKGHQYAY" XX SQ Sequence 6501 BP; 1926 A; 1425 C; 1511 G; 1639 T; 0 other; ggttatgagc ccggggagat cctcggattg acgtgctgta gccagtgcaa ttggcgctgt 60 ggcttacagc ggttgcagag gttaaccaaa ccctagcgca tttgctatag ggatcttgga 120 aggcctgcac tagtgcgaag tgaaccctag gggttgtttg tgacctgctg attgcaaagg 180 acggattgat tgagacgcga cgtgggaata gcttcagcga tcgtgcagcg cttggaccaa 240 ctgggccacg tgcgcacgcc ccacaacgcg actcagcagc ccccccctgt aactgcagcg 300 ctacgagtaa tcctttaggg ggagaagtgc actagtactg caaccaaata gtggctggtt 360 aggtgcttgc acctgctagt aggcgctgag gagcgctaat acctttggct ggatctttta 420 gcctgggtgc gtttgcatcc aacaagtcta gctgcccctg ggtggcctaa aaataggctt 480 gtgctatagc gctgtcgagc gctatacaag tacgaaaagt agcgctgtcg agcgctatat 540 aaatacgaaa cgtaccatat ttgtgggggt aacgttgcgt agactggtat gcagccttgt 600 aggagataca cctcggtcaa gtcagtttat aacctaccct atatgtactg ccctcacggc 660 atattcacta gagcggttgg ggccaaggcc taccaactta accggtgttt gcagtgcttt 720 agcacctaaa agaacgctga atagctgcat tttagggtct tgttaaagta cgtggtattg 780 gaaatccctt tgttatttgt aacgttaggg tcaactagta atagttgtta aggcattgca 840 atcttttacg agtcggtgga ttttcccagg cgaaaaagat ttataagcct tataaaagcc 900 ctaaagatta gcaaccttta ggccagctgt aaaaaattca gctgttaggc ggttcatcct 960 gaaaattggc acctttcagg gcaattaacc aaggttaatt gcaataagtc ccagggggtg 1020 gcagccctgg gcgaatggga cggaaaaagc gtaaagcttc taaatcaaat agaagctatt 1080 tcgatatggc agaggggttc agtatggcaa tggacgaaaa tctcgctgat caggtgcaaa 1140 gacagggtac tgaaattagt aaccttacta atcaagtaga ccttcttcag cagcaaaacc 1200 aaaagatttt agcgctttta caggccttac aacctaaaga ggaaaaccct cctggaggga 1260 ctcttgatag gggcaagggt aaggctgtag aagatgttaa gcttgaaagt agtgctgtac 1320 ccgctggagg acctgccaca ggaggtcctg cacctggggg atcacccgct ggggggccta 1380 ataatgctga aggggattca ggacccccta aagaaatttg ggcctttaca aacccggcga 1440 tcaaagagat gtccaaatcc gtatattctt ttccagaagc ccataaactg cgaggggccc 1500 aaaattatca agaatggagg catgcgttag tgattcagtt cactgccgta ggcctggcag 1560 atttcctaat aaaccctatg ctagcaaata gcgtatctgc agcggatcag gccactatac 1620 tgttgctact aaagaacagt tgtagtgcag agactattca cactattacg tgggagacgt 1680 cgcctgtaaa tgcccttaaa gccttacagg atagctattc gttaacgcct gaaatccaac 1740 gagatgcact atatcgggaa ttccacgcgc tggattttag caaatataag gggtcaatta 1800 gtgaattcaa ctcccagttt acctcccttt tgataaggct ccgtaattgc agtgttgaaa 1860 ttagtgctat tgatattaaa aatcaatacc ttaaagccct ggaagggtcc ttcccccaat 1920 gggctgaaag gcttcggagt actatccgta gtgctattgc cttagggcaa tctaccgaag 1980 gcattaattt acagtacttg atggctgatt tgcttgccga aacaaacaac cctatatcta 2040 ctgcagccaa acaggccgct agccataggg cccaaaggca aaataaaaag ccaaaagata 2100 gtgctgaggg cgctaaagaa aagaataaag ctggtaaaaa gggtaaaaac agtgacccta 2160 aggataaggc cgcaggccaa aaaggagagc cttccgaaga taagcctaaa gaaaaagact 2220 ttaagggcaa aaaatctaag ggtaaaaagg aaaaagacgg taacgaatcc cctgacaaca 2280 aaaaaggcgg caaaagcaca gcgtttgttg cggtaaatta cgacctaatt acaggtagta 2340 ctaccgaatc tgccacagag caaatttcag gtactaatta cgccaaagat agcggaatag 2400 ttgatttgga cgctttggat ctatcaaatt cgtccgaatc ggattcagag tcaggtagcg 2460 actccgatgc aaatacagtt gatataacgc actgtaaatg ctgtgatcac gctaaaagta 2520 ataccaggga tagtaaatat aagggcatgt cccctgtatt gtacgatacc ggtagtaccg 2580 atcacatctt taattccatg gaatctttta cagcgttttc aaaaaattgt aacgttgtta 2640 tccgtaccgg tggaggcctt gtataccccc taggcgtcgg tactgttaag ttacctgtcc 2700 gatgctctga aaaacccggc gatttcgagg aaataacgct aaaagaggcg ttatatattc 2760 ctagctttga tgttaatatc attagcgggt tacgccatta cagggctgga ggtgccctgt 2820 ataatcaaaa actctttaac gctgaacgtc attgttgggg catattgaac gtaaaagatc 2880 acggtttttt cctccaatgc aaaggctacg cctaccctac tgtccgccat cgccaaaata 2940 ggtactgtag tttttacggc ttagcacagc gtataactat agacctaaaa aatactcctc 3000 cagccgatag ggagaagtac gttcgttttg acactgattc ggaagaaaat acggacagtt 3060 cagacgaaca accccttaag gaacccccta aagggaaggc tattttaaac cgtaagaaaa 3120 aagcggttat aaatagcgat atgcctaagc ttagtgcaga aagccctgaa gactcccgtg 3180 tagttgagcc ctgtgagctc ggcaggccac taggcaagtc ccctgaaaag cctattccag 3240 aggacattac tttaccaaaa agggggactt cccacgcaag agatccgttg ccagagtctc 3300 ccccagtagg taaggtacag gcggcccctt ttattaatat tacccctaaa agcctccaaa 3360 aatcagcaga tggctgggta agcgctgaag gtgatgtaat aaaagggtgt gacgaggaaa 3420 cggccctaaa agcgttgctt tggcaccgtc gattaggaca tataggccta acgctgctaa 3480 aaaagaccgc taaaatcacg gacggtctcc ctaatttcga caagatacag gggttgcaat 3540 gcgttatttg tgccaaaact acagcggtgc gcagaaccgg aaaaggcccg ttaccctccc 3600 cgggtaacgt tctggattca atagaaggtg atacagttgt actatcccct accccgcaca 3660 ataaaaaacc cgtaattcta ctgcttgtgg atcgcaaatc acgattccgg tgggtttttc 3720 aactaccaaa taagcagggc cctacagtaa tggccgcaat aaagtccttt tttagggctt 3780 tgaaggctac ctacgggcgt accctactcg ttttttcttc gacgggggta aagaagtcaa 3840 tacggcgctg aaggaatggc tagaacgcaa aggcacagcg tttagcacgt cgaatcccta 3900 taaccacggc caaaatggcc tatccgaacg ctctattagg gtagttatgg acaggcttag 3960 ggctacaatt aatgcggcag gcctaccgca ctatttgtgg tgctatgtcc ttagcgctgt 4020 ggtcgaatta gtaaacgcta ctgcagttac taatagggaa acgaccccct atcaagagtt 4080 ctacgacgaa gtagagccag gccctcgtca caggcccgat ttgggccatt atagggttat 4140 tggtacccat gcagaggctt tgatacccct ggagcacagg gctaaatcaa agaaattagc 4200 tacaagaaca gagcctgtta ggctcctcgc agccctgagc aaatccactg ttcttgtata 4260 tgcacctgcc cgaagggcta tatataaaac ctctactatt aagatattag agggggtgcc 4320 tgttaaaaat cttgttccag agggggatat gattttagag ggggtatcta tcgagaaccc 4380 ccctgaagga agcgcttcta gcgcccctaa taatgctcct agggcccctg atagcgattc 4440 cgaaagcgat atggaacctg attttcaacg ctatgattct cctgcggtgc cagaccctga 4500 ttataggctc cccgagcctg taatcccgac tgttaggcct ccagaaaagc ctatagttat 4560 ggcgcctccg acgcctaaaa tacaggaaat agtggatgaa atacccgtct tacaggacct 4620 tttgggaggc aaaacgccta acccagatgc aatggaagtc gacgagctag tgcagtatat 4680 atgctataaa gcaacgcaaa ctatcaaaag aaaaaagcct acaagagacg gtgaaccaaa 4740 ctcttataaa gaggccctaa aaagccccta taaaaaagag tggctaaccg cattagggct 4800 tgaattcgag caacttttac gcgctggaac ctttaatttt ctaccgcaat ctgcactacc 4860 aaaagggcgt aagcttatta cctgtcgtcc ggtatttaag cttaaaaagg ataaccaaaa 4920 tcgccctgta aaatacaaag cgagactggt agcaaaaggt tttttgcaaa tagagggcct 4980 ggattatttt gaaacctttg cttctactag tataccgcct acctggcgta tattattagc 5040 gtttgctgcc gccaaaaatt gggaaataga gcaaatcgat ttcattgggg catttcttaa 5100 tagtgaccta gacgtagata tttaccttag tattccggaa gggttttcag aatggttagc 5160 tactgcaacc actgatataa agcaattagc caacaatcta ggctttaaac cttccgagaa 5220 acaggtaata ctattgaaaa aggcgctata tggtttgaag cagggacctc gtgaatggca 5280 gaataagcta atatccctac taaaagagga aggatataat cagctaattt ctgatccagc 5340 ggtctttttc aacgctaaaa taaagcattt tattattacc tttgtggacg attgtttaat 5400 aattgggcct gatatgcgtt atattagtgc cctaaaaagg tcgttacaca aggtatatgc 5460 gttagaaaat cgtgggcctg cagcctattt tttaggcgtt caaatcataa gggataggcc 5520 taaaaggcag ctttggatcc accaaagtca atatatcgac gaaatgctaa aaactttcgg 5580 gttggatgct tctcgatcta ttagtatccc tttacaaccg ggcgtgctaa aagatactaa 5640 aggacagcct gttacgtccg ctgaaacaaa attgttacag aggataatag gtaccgttat 5700 gtacttgatg ctgttaaccc gtccggatat cagtttcgct gtttcaatgg atttcacgcc 5760 atttaacaaa ggccaccaat acgcatatta atgccgctaa aaacctatta aggtacttta 5820 gcttgcacgc gtaatttagc tattcggtac agctataacg ctaaaaatac ggggccttac 5880 gaacccgttg caagtactag gttttagtga cagcgatttt gcgggtgata tggactcttc 5940 caaatcaacc tatgggtatt tatttaccct attagggggt cctatctgtt ggaagtgcaa 6000 aagggctagt actatagccc ttagtacggt cgaggcagaa actgatgcct taacggaaac 6060 cattagggag gcaaaatggt taaggtacct ttttatagag ctacaaaccc ctataaaagg 6120 tcctatagct gtttttggtg ataaccaagg ttctatttcc aatgcccata accccaacct 6180 acattcgcga actaagcata cgttgctaaa attccatttt gttagggaag aggtaggggc 6240 tggaactatt acaatagaat acttggacac taaaagcatg cccgcggacg gcctaacaaa 6300 gccgctaact cctccaaagc ataaagcctt tttggggctt atggggctcc ggcctaaagc 6360 ctagtttttg cttattgctt aacggttagc gcttataata ctggttaacg tattataaac 6420 cccctatcag cactgttatt ttcttttccc gactagatag cttttcctat ttttttgtat 6480 aaaatactca aagagagggg g 6501 // ID Copia-2_CDC-LTR repbase; DNA; FNG; 170 BP. XX AC NC_012866; XX DT 06-FEB-2011 (Rel. 16.02, Created) DT 06-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Candida dubliniensis genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_CDC_; KW Copia-2_CDC-I; Copia-2_CDC-LTR. XX OS Candida dubliniensis OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; mitosporic Saccharomycetales; OC Candida. XX RN [1] RP 1-170 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Candida dubliniensis genome."; RL Direct Submission to RU (06-FEB-2011). XX DR Genome; NC_012866; Positions 639794 639625. XX SQ Sequence 170 BP; 55 A; 25 C; 33 G; 57 T; 0 other; tggagtaaaa tagtgagcat tttacacgac aattctagca gaaatcttct gacgttgtca 60 acgacgagca gtgaaaaatt ttcaccagaa aacttttctt tgatactttt tctttttaga 120 aattggaggt ttaagaggtc agtttggaac attctacgtt aaagattgca 170 // ID Gypsy-1_CDC-I repbase; DNA; FNG; 5365 BP. XX AC NC_012866; XX DT 06-FEB-2011 (Rel. 16.02, Created) DT 06-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Candida dubliniensis genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_CDC_; KW Gypsy-1_CDC-LTR; Gypsy-1_CDC-I. XX OS Candida dubliniensis OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; mitosporic Saccharomycetales; OC Candida. XX RN [1] RP 1-5365 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Candida dubliniensis genome."; RL Direct Submission to RU (06-FEB-2011). XX DR Genome; NC_012866; Positions 428949 434313. XX CC 'TTATT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1209..5258 FT /product="Gypsy-1_CDC-I_1p" FT /translation="MLQLLQQMERMNERVGLIVDDLEAVKLKQEKLDKTAK FT TQMMNDAIGQSTEAEGINTFQDAAEMPVMNLNPELSTSREMLGLDHTVDIT FT ANTYQTEIQKWKSYELNMDDILECPLIKHPNKNAEMAALKYLNTIRFKENK FT SNYEKEVLAYNRSEKLPKYDSINSFGDLAIYLNLVLDLKVKYIIPDYNLRT FT HIKAACNKVDDEDFKKMVNYLIKPTKQASSPIRYNTILRSIQMKLPRTDKS FT DLVSTIKRTIESMEDATIILETITVQLDEYNEKEDLRDISVGDWIRIWKSI FT AYAMPDDYKEVYMNMVDIRLANALRKIGNRNNTITDLEDHTQYNYNELWKG FT FKNTLYELAPLTSFQAIAKKKNDISQNQRMVFKSEGRLHNFKMFNDEVVIQ FT DNEYEIKETIRDLLKEPLSQSMLTDKQKDYFIEKLQDIKDVFQTPTGEPGR FT LKHFPPVRIHANDHEPWRLKTIPLGEKKSIAVDILEKMIAEDQLEFAPTSA FT YRNPWFLLRKPSGGFRFLIDLRVLNSFVELEAGHPKDVREIIRNISGKKYL FT TTLDISNAYFQLELDERDRDITAFVTPIGVLRFKVMPQGFKNSVSYFTNVL FT TKILRSVSKFTESFLDDIAILGPDASQDSDDDTDMIRHLDDVVATLLLLHE FT HGLKINAAKLQIAMKEADFLGYRVNEQGATLLRKRVSAFEEFPIPNTLTKL FT ERFLGMTNYYRHLIPAYSEIASPLHKLVTATRKEQKKSLTLNEKELKHFEY FT LKKCLVLEPVVTSLNKEDEVMLFTDASSLSWAGVLESKNTDGNVVVVDCVS FT GSFNTTQGNYTIYEKELAAICFSLEKLEMHLLNYDKVIKIYCDNKAVVTLL FT NGSFTNGHLMNRVAKWLLFLRNYNIEIAHIDGKSNIVADCLSRIEDPTSTP FT ILPLNLTEDMDYLKSKMEIKTNFAQITDDDPKYGRFSLQGIQNYLTTVQIP FT QIYNTNNKIRKGFLSKAQEFYLDDGKLYKRGAKGHFARQVIMDKSELERIL FT RMTHEERGHMKLQNLFNYLNLLFFIPNLYKILQDYIASCHTCQTFDGHSIG FT RDPLYINLPGGLFDKIVCDSVFIDDCWLVIARDEFSNWAEATVMPTLDGSK FT VADFIYKDIICRFGQFRILKSDNGSEWKNQFMKRLLDHYNIQLGFLIQHHP FT QGNGLVERNHVGLVNFLKKLPKHLNWQDYVDAALRVDRNTIKSTTGMSPHY FT LVYGYCGHSDLSLLYANPPENKNYTKEDLFKFRFKQLQYREVQYNSAYKTT FT EIQRLRTKAYFDAKFEIDTPVEIGDMVLIWDRPHKNPMGAKMKKMQPKWSG FT PFKVKSKGDRIYRLEDLDGTELSRPFAREMMKLYIKRK" XX SQ Sequence 5365 BP; 1882 A; 937 C; 1089 G; 1457 T; 0 other; ggatatatac attttggtgg tccctagcag gacaaacaga tcttagaagc accatgccta 60 aactggaaat taattcgata tttggtcaaa aaaaaatgta atagggaact tcacctcgag 120 gaatgagatt gattactgat tgtgaaatgg ttagctagct attcaagcat ccgtaggcag 180 tcggatgacg ccttcgaaaa caaaggagat aagttcaaag gactctgggt tgacgttggt 240 ggtgatgaga cagtactggc caaacactgt tttcgatagt tatactacac cacccgggta 300 tgggtaaatg tctacagaat aattgagaat ggagcctgag cagagttgcg ccctagacat 360 ccgagatcct cacgtattaa taggtaattg agactgacgc gtaacgtgga aaaggactat 420 cccaatatat taataaatcg gacctgagac tctgcgacca tgctaataga cagtactggc 480 gggacgagga cgtacttcat aataaatcgg acctaagact ctgcgaccat gctaatagac 540 agtactggcg ggacgaggac gtacttcata ataaatcggg cctaagactc tgcgaccatg 600 ctaatagaca gtactggcgg gacgaggacg tactacaaat aaatcgggcc taagactctg 660 cgaccatgct aatagacagt actggcggga cgaggacgta cttcataaga aatcggacct 720 aagatgccgc gatctgatta catacaatag atccggaaga ggacgtacac aaaaaggtgt 780 aattgaaaaa attttcattc acatcataca aagagatgag caataaaaag gaattagaat 840 ggttctaatt ggtcaacaag ctgaaacgac ctgtataaag aagaactagt actgaattaa 900 tggaatttga tgaaagttct tccaagtttg aaaaattgcg aaacgcgaga aaaataaaat 960 aaatggaagc tctcatctaa ggataaggaa gaactcagaa gaagaaacgt acttaccaaa 1020 acaagaaaaa aggagaacag aaatcactat gtctgctaac aaaactaacg aaagatcaag 1080 gccatctgga ttatcagcca aatatccggc acctgacttt aatcaaaaaa gagagagttg 1140 ctaaggatgc tgcagatgaa gaggttgagg ttgatgctaa ggttcaagtt gcgaaaaagg 1200 atgacatcat gttacaactt ttacaacaga tggaacgcat gaacgaacgt gttggtttga 1260 ttgtcgatga cttagaggca gtcaagttga aacaagagaa attggacaaa acggctaaga 1320 ctcagatgat gaatgacgct attggacaaa gtactgaagc tgaaggtatt aatacctttc 1380 aggatgctgc tgaaatgcct gtgatgaact tgaacccgga attgtccact tctagagaaa 1440 tgttgggatt ggaccacacg gtcgatatca ctgcaaacac ctaccaaacg gaaatccaga 1500 agtggaaatc atacgaactg aacatggatg acatattgga gtgcccactc atcaagcacc 1560 ctaacaaaaa tgcggaaatg gccgccttaa aatacctcaa tactattcga tttaaggaga 1620 ataagtctaa ctatgagaag gaagttcttg cctacaatag aagcgagaaa ttacccaagt 1680 acgatagtat taactccttc ggcgacttgg ccatttattt aaatttggtt cttgacttga 1740 aagttaagta tatcattcca gattataatt tacgtaccca cattaaggct gcgtgtaata 1800 aggtagatga tgaggatttc aagaagatgg ttaactattt gatcaaacct acgaaacagg 1860 cctcgagtcc aattagatac aataccatct taagaagtat ccaaatgaaa cttcctcgta 1920 ctgataagag tgacttagtc agtacaatta aacgtactat tgaatccatg gaagatgcta 1980 ctattatttt agaaactatc acagttcaac ttgatgaata taatgaaaaa gaggatctga 2040 gagatatctc cgtgggagac tggattcgca tttggaaaag cattgcttat gcaatgcctg 2100 atgactacaa ggaagtttat atgaatatgg tagacatacg cctggcaaac gcattaagaa 2160 aaattggtaa tcgtaataac actattacgg acttagaaga tcatactcaa tataattaca 2220 atgagttgtg gaaggggttc aaaaatactc tttacgaatt agccccactc acgagttttc 2280 aagctatcgc taaaaagaag aatgatatct cccaaaacca aagaatggta ttcaaaagtg 2340 aaggtagact tcataatttc aagatgttta atgacgaagt agtgattcag gacaatgaat 2400 atgagataaa agaaactata agagacttat taaaagaacc gttatctcaa tcgatgctca 2460 ctgataagca gaaagattat tttattgaaa aattgcaaga cattaaagat gtattccaaa 2520 cgcctacagg tgaaccagga agattgaagc atttcccacc ggtccgtata catgctaatg 2580 atcatgaacc gtggagacta aaaactattc cccttgggga gaagaagtca attgctgtgg 2640 atatcttaga gaagatgata gcagaagatc aacttgaatt tgctccaact tcagcatata 2700 gaaacccatg gtttttatta cgtaaaccat ctggtggttt cagattctta atagacctac 2760 gggtcctcaa tagtttcgtt gaattagagg caggtcatcc taaagatgtc agagaaatta 2820 ttcggaatat tagtggcaaa aaatatttaa ctacacttga tatttcgaat gcttattttc 2880 aattggaact tgacgaacgt gatcgtgaca tcaccgcgtt tgtaacacca atcggagtgc 2940 taagatttaa ggtgatgcca caaggtttta agaatagtgt tagttatttt actaatgtgc 3000 ttactaaaat tttaagaagt gtctcaaaat tcactgaatc attcttagat gatattgcaa 3060 tattaggtcc tgatgcttct caagattcag atgacgatac agatatgata cgacatttag 3120 atgatgttgt tgccactttg ctgcttctac atgaacatgg attaaaaatt aatgctgcaa 3180 aacttcaaat agctatgaaa gaagctgatt tcttaggcta tcgggtaaac gagcagggag 3240 ctacattatt acgaaagaga gtttctgctt ttgaagaatt ccctattcct aatacattaa 3300 caaaattgga gagatttttg ggaatgacaa attactatcg ccacttgatc ccagcttaca 3360 gcgagattgc atctccgttg cataaattgg tcacggcgac aagaaaggaa caaaagaaat 3420 ctcttacatt aaatgagaaa gaactcaaac attttgaata tttaaaaaag tgcttggtac 3480 tggaacctgt cgtgacatcc ttgaataaag aagatgaggt tatgctcttc acagatgcca 3540 gctccttgag ttgggctggc gtcctcgaat ctaaaaacac tgatggaaat gtggttgtgg 3600 ttgattgtgt ttcgggatcc tttaatacta cccaagggaa ttatactata tatgaaaagg 3660 aacttgctgc tatatgcttt tcgttagaaa aacttgaaat gcatctatta aattatgaca 3720 aagtcataaa aatctactgc gataacaaag cggtagtaac cctcttgaat ggtagtttca 3780 ctaacggtca cctgatgaat agagtggcca aatggctact gtttttacga aactataaca 3840 ttgaaattgc ccatatcgat ggtaaatcca atatagttgc agattgtcta tcgagaatcg 3900 aagaccctac atctacaccc atactacctt taaatttaac agaagatatg gactatttga 3960 agtccaaaat ggaaattaag acgaattttg cacaaattac tgatgacgat cccaagtatg 4020 gacgcttcag tctacaggga atacaaaatt atttaactac agttcaaatt ccccaaattt 4080 acaacactaa caacaaaata cggaaaggat tcctttctaa agcacaagaa ttttacctcg 4140 acgatggaaa attgtacaaa cgtggagcta aaggacattt tgcccgccaa gttatcatgg 4200 acaagtccga attagagcgt attttacgca tgacacatga agaaaggggt catatgaaac 4260 tccagaactt attcaactac ttgaatttat tattctttat tccgaattta tacaaaatct 4320 tacaagatta cattgctagt tgccatacat gtcagacatt tgatggacat agtataggcc 4380 gagatccgtt atatatcaac cttcccggag gtctttttga taaaattgtt tgcgattctg 4440 tttttattga cgattgttgg ttagttatcg cccgggacga gttttcaaac tgggcggagg 4500 caaccgttat gcctactttg gatggctcca aagttgctga ttttatttac aaggacataa 4560 tatgccgttt tggacaattc agaatcctaa aatcagataa cggttctgaa tggaaaaatc 4620 aattcatgaa gagattgtta gaccactata atattcaact gggattcctg atacaacacc 4680 acccacaagg taatggtcta gtggaaagaa atcatgtcgg attggtaaac tttttaaaga 4740 aattaccaaa gcatttgaat tggcaagact acgttgatgc tgctttgaga gttgatcgga 4800 atacaatcaa atcgaccaca ggtatgagcc cgcactatct agtgtacggt tactgtggtc 4860 attccgacct ctctttgtta tatgctaatc ctccagagaa taaaaattac actaaggaag 4920 acctattcaa gttcaggttt aaacaactcc aatatcgtga agtccaatat aattctgctt 4980 ataaaacaac ggaaatccaa cgattgagaa ctaaagcata tttcgatgcc aaatttgaaa 5040 ttgatacacc tgttgaaata ggtgatatgg ttctcatctg ggaccgacca cataaaaatc 5100 caatgggtgc taagatgaag aaaatgcaac ccaaatggtc cggacctttc aaggtgaagt 5160 caaaaggtga tcgaatatac agacttgaag atttagatgg aacagaatta tcgagaccat 5220 ttgcaagaga gatgatgaag ttgtatatca aaaggaagta attattcgag ttgatgatga 5280 ttctccaata gatgttaatt tgattaatga tggtaattta aaatgaaaat ccaaggcttt 5340 acatttcaaa aatcaggcgg gaacc 5365 // ID TSK1_I repbase; DNA; FNG; 5250 BP. XX AC AF492702; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 08-NOV-2002 (Rel. 7.1, Last updated, Version 1) XX DE Saccharomyces kluyveri retrotransposon TSK1_I, internal region. XX KW LTR Retrotransposon; Transposable Element; RNaseH; TSK1_I; gag; KW integrase; internal region; pol; protease; reverse transcriptase; KW internal portion. XX OS Lachancea kluyveri OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Lachancea. XX RN [1] RP 1-5250 RA Neuveglise C., Feldmann H., Bon E., Gaillardin C. RA and Casaregola S.; RT "Genomic evolution of the long terminal repeat retrotransposons RT in hemiascomycetous yeasts."; RL Genome Res 12(6), 930-943 (2002). XX DR Genbank; AF492702; Positions 323 5572. XX SQ Sequence 5250 BP; 1854 A; 1210 C; 891 G; 1295 T; 0 other; tggtagcgcc tgtgcttcgg ttacttctaa agaagtccac acgaatcaag atccattaga 60 cgtctcagct tccaaattcg acgaatatga gaacaattcc accaaggcta gttctcaacc 120 tgaagaaaca cctgtgtcat cagctgttcc cgagaacgct catcatgcct ctcctcacac 180 tgctcaagca ccattaccgc agaatgggcc atactcacag cagtgcatga tgaccccaaa 240 ccaagccaat ccatctggct ggtcagtata cggacaccca tatatgatgc cgtacacacc 300 ttatcaaatg tcgcccatgt actatccacc tgggtcacaa caacagtatc cacagtatac 360 atcaggtgtt ggcacgccat tgagcactcc atcacctgag tccagcaata cacctaccgg 420 accaccatca gcaaaatcta atatgacacc cgttaacaaa tctgtcagac caccaccgtt 480 tttaacctca tcgagtgaat tcctaatttg ggttagaaat tacatcaagt ttttacaaaa 540 ttccaatctt ggtgatatta ttccgaaaac caacggaaaa gctacgcgtc agatgacata 600 tgacgagcac actttcttgt acaacacttt tcaaacgttt gctccatctc aattcctacc 660 tacttgggta aaagacatct tatctgttga ttacacggat ataatgaaga ttctcactaa 720 aagcatggaa aaaatgcaat ccgacaatca agaggtaaac gactatatta cgctcgctaa 780 cctgcaatac gatggtagta tccctgcaga tatgtttgaa acacaagtaa tcaatactat 840 tgacagactt cgcgacagcg gtcttcatat caacgacaag ctagcgtgca aattaattag 900 gagaggcctc tctggtgaat atagattctt acgatacgca cgtcatcgcc atctaaacat 960 gacagttact gaactattca tggacattca tgccatatat gaagaacaac aggaaacgag 1020 atacaataag cctacatact cgaagaatcc tagtgataag aggaatactt ctcgagcctt 1080 tacaaataca aataaagcca aaactgtaac tcgaaaccct caaaaaacaa gtaactcaaa 1140 atcaagaaca gtcaagacta ataacgtgtc tacatcccac aactcttttg acgaaaataa 1200 tgattcgatc aacgaatcac ctgatcaaac aatatacttg aacaatcagt acgaccttca 1260 tcttaggcca gaaacatact gagtccaaag taaaccacac ggatcattca aatgatgaac 1320 ttcctggaca ccttctcatt gattcaggtg catcacaaac ccttatcaga tccgctcatc 1380 acatacactc agcgtcacct aatattgaca taagcgcaat tgatgctcaa aaaaaggaaa 1440 taccgatcaa cgccattggt aatctccaat tccaattcga cgacaatact aagacatcaa 1500 taacagtatt acacactccg aacatagccc atgatctgct cagtttgagt gagctagcta 1560 aacaaaatat cacagcctgt tttaccaaaa ataccttaga gcgatctgac ggcaccgtac 1620 ttgcccatat cgttcgacat ggtgactttt actggctatc caaaaaatac ttgattccct 1680 cgaacctatc tgtaccaaca gtcaataata tcggtactga taaaggctca cacaagtatc 1740 catatccttt gattcatcga atgcttggac acgccaatgc tcaaaccatt cggaattctc 1800 tgaaaaataa ctcgatcacg tatctaaagg aatcagatgt cgactggtct aaagcgacta 1860 cttaccaatg cccggactgt ttaatcggca aaagcaccaa acacagacat gtcaagggat 1920 cacgagtaaa ataccaaaac tcgtatgaac ttttccaata cctacatacc gacatatttg 1980 gacctgttca caacctgcca aaaagtgcac catcctactt cataacattc accgatgaga 2040 aaaccagatt tcaatgggtt tatccattac acgatcgtcg tgaacaatcc attctcgagg 2100 tttttaccac aatactagcc ttcattgaaa gacaattcaa ggccagtgta ttggttatcc 2160 aaatggaccg tggatctgaa tacaccaaca aaactctcca caaattcctt cttaaaaagg 2220 gtatcactgc atgctataca accacagcag actctcgagc tcatggtgta gctgaacgat 2280 tgaatcgtac tttactagac gattgtcgca cacagcttca atgcagtggg ttaccaaacc 2340 atttatggtt ttccgcagtt gaattttcaa ccatagtcag gaactcttta gtttcaccca 2400 aaaacgagaa atctgcacga caacacgcgg gtttagcagg actcgacatc agcaccctac 2460 taccgtttgg tcaacctgtg gtagtcaata atcacaatcc tgattcgaaa atacatcccc 2520 gtggcatccc tggctacgct ctgcatccat cccggaactc ttatggatat attatctatc 2580 ttccatcttt aaagaagaca gtggacacta ccaattacgt tattctgcaa gacaagcaat 2640 ccagattaga ccaattcaat tacgacgcac tcacctttga tgaagacata agtcgactaa 2700 ctgcttcata caaatcgttt atcgattcaa atgaaataga acagtcatgc gaacttcaca 2760 tggattctga tcataatttt caatctgaga tagaccatgg cactgaaccc cggagagatg 2820 acagtcccac agacctgatt gcaactggtt cgctatcaac ttctactcaa ccaacgataa 2880 tggaacccgt acctactact aatgttcgtg cacccaaaga agttgacccg aatatatccg 2940 aatccaatat ccttccgtca aagaagaggt ataacgtgcc caatattcct gatccagaca 3000 gtacctgttc gggtggtacg aatagaccag atgtatctat tgttatccct gcgcacgact 3060 ctgtcacaca acagtcgaat aaatacaata attacgataa agtgagtgat ccaggtccct 3120 acagcaatac tgataccggt ttcacgaacg ttacaacacc ctgtttgggt ggtaccaaca 3180 acaagaatat tccagatctc aacgatcaga ctatcgagaa aagatcgata agctgcccaa 3240 ttcctctaga tgattcttca cagggaagtg agataccaaa ctataacact cctattgata 3300 caccagacac ctactctgaa catgacgtca aggagtatat taccgaagat cttccgcttc 3360 ccgaattacc tccagaatct cctacctgtt tccccgatgc atttaaagaa attccaccga 3420 tcaactctcg tcaaactaat tccagtttgg gtggtatggg taaacctaat atctatgcta 3480 ctatcgatag taagaaaaga tcattagaag ataatgaaac tgaaattaag gtatcacgag 3540 acacatggaa tacaaagagc atgcgtagtc tagaacctcc aagatcaaag aaacgtatac 3600 acctgatcgc agctgtaaaa gcagcaaaat cgatcaaacc gatacgaaca accttgagat 3660 atgatgaggc aatcacatat aataagaata tcgaggaaaa ggaaaaatac attcaagcat 3720 accataaaga agtaaatcag ttactaaaga tgaatacatg ggacacagac agatactatg 3780 acaaaaatga aatagacacc aaaaaggtga taaactcaat gttcatcttt aacaagaaac 3840 gcgatggaac acataaagct agatttgttg caagaggaga cattcaacat cctgatacgt 3900 acgactcaga aatgcaatct aatactgtac atcattatgc attgatgaca tccctgtcac 3960 ttgcattaga caatgaccac tacgtcacac aactagacat atcttcggcg tacttatacg 4020 cagatatcaa agaagaatta tacataagac ctccgccgca tttgggtatg aacaacaagt 4080 taatacggtt gagaaaatcg ctttacggct tgaaacaaag tggtgcaaac tggtatgaaa 4140 ctatcaagtc atatctggta gacaaatgtg gtatggaaga agtacgtgga tggtcatgcg 4200 tgtttaaaaa cagtcaagta acaatctgtt tattcgttga tgacatgata ctgttcagca 4260 aagacttgaa ctcaaacaaa aggatcatag aaaaactaaa gagacaatat gacactaaga 4320 tcataaacct aggtgatagt gatgatgaaa tccaatatga catcctggga ctagaaatca 4380 aataccagag aggcaagtac atgaaactag gtatggaaaa ttcattaact gagaaattac 4440 caaacctaaa cgtacctttg aacccgaaag gtaggaaact tgccgctcca gggcaacccg 4500 gtctttacat agaccaagaa gagttagaac tggaagaaga tgattacaaa atgaaggtgc 4560 acgaaatgca aaagctaatt ggcctggcat cgtacgttgg atacaagttc agattcgacc 4620 tactgtacta catcaacact cttgcacaac atatattatt tccgtccaaa caagttctag 4680 acatgacata tgaattaata caattcatat gggataccag agatcgacaa ctaatatggt 4740 acaaaaacag atcttcgaaa ccagaaaaca aattagttgt aataagcgat gcttcgtacg 4800 gaaatcaacc gtattataaa tcccagattg gtaacatatt cctactcaat ggaaaagtca 4860 ttggaggaaa atcaacaaaa gcctcactaa cgtgtacttc aaccacagaa gcagaaatac 4920 acgcagtcag tgaagctgtt cctttactaa acaatctaag ttatctcgta caggaactag 4980 ataagaaacc aatcatcaaa tgtttactta ctgacagcag atctacaatc agtatagtta 5040 catccaaaaa tgaagagaaa tttaggaaca gattcttcgg cactaaagca atgagactca 5100 gagatgaagt atcggacaat aatctgtact tatgctacat tgagaccaag aagaacattg 5160 ctgatatatt gacaaaacct cttccgataa aaacgtttaa gttgttaaca aacaaatgga 5220 ttcattagat ctattacatt atgggtggta 5250 // ID Gypsy-2_CCO-LTR repbase; DNA; FNG; 333 BP. XX AC AACS02000013; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the mushroom Coprinopsis cinerea genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_CCO_; KW Gypsy-2_CCO-I; Gypsy-2_CCO-LTR. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-333 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the mushroom Coprinopsis cinerea RT genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AACS02000013; Positions 130773 130441. XX SQ Sequence 333 BP; 92 A; 84 C; 64 G; 93 T; 0 other; tgtcatgaat tagacctttc cttcctttcc cgccgagact tgtagctagt aacataggat 60 gtcatgtgac tcagacacgt gatgagtcat gagaccatac cccgccagct gaacaaaaca 120 ctgcgcgcgc acagaccatg gaccaaacaa gtacatacaa ctcaagtagt atatatagag 180 tagacttgta agcgtagtta cctcagtctt tagctggtct attcctagct acaacactct 240 agtcttcact gaatcctatc gctttttcgc caagtctttt ctagtgcagc tattgattgc 300 ttgaacgacc gatcaagtag ttagtcgctg aca 333 // ID Copia-2_PPM-LTR repbase; DNA; FNG; 369 BP. XX AC ABWF01005750; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Postia placenta genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_PPM_; KW Copia-2_PPM-I; Copia-2_PPM-LTR. XX OS Postia placenta OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Polyporales; Postia. XX RN [1] RP 1-369 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Postia placenta genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABWF01005750; Positions 42376 42008. XX SQ Sequence 369 BP; 83 A; 91 C; 83 G; 112 T; 0 other; tgacgttcaa gctctgccgc aggcggatag aaaggccttc cgcacagacg ctcgttcgta 60 cgagatgtct ggcggtgatg cgcctgtggc gcagtccgga cattgtttgg acgttgtttg 120 gctcactata cagttcctta ttttcgtacc gtagaactct tgctctcgcg cgtaggctga 180 cttcagttca tcccttatgg cctactgaag tcagtgagta cctttagaca tatcaacagt 240 atcaatatac taatgtatcg tagatatact atttgtacag gtgcatacct atctgagttc 300 gtcccttacg gcctactgaa atatactatt tgtacagtac agcgctctcg tcccacgtta 360 ggtttaaca 369 // ID Gypsy-4_MLP-I repbase; DNA; FNG; 5584 BP. XX AC AECX01002004; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_MLP_; KW Gypsy-4_MLP-LTR; Gypsy-4_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5584 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002004; Positions 54903 60486. XX CC Positions [4388-4867] - Integrase core CC 'CATCG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 238..1392 FT /product="Gypsy-4_MLP-I_1p" FT /translation="MDALNANIASILSQLTNLNARLDEETARHRETQKKLD FT EESAHRLSTQEQLDTLINQLNNSSSSNTTPAAQHTTNPALNPAISQPTLPV FT TSETQPPLTASDVKILMKSNRSPKVGTPDKFDGSKGDAAETFVNQVGLYLL FT ANEESFPNDKTKIVFALSYLTGEANQWAAPYFKRLLHPNDEDELTFQAFAK FT AFEATYFDSDRQNRAQRDLRALQQNSTVADYTTRFMALAARTGWGETEHIS FT HYKIGLKQEIRVNMILKTFTSLTDITAFAIAINNELHPGITRLTPRNLPQI FT TTTTKDPDAMDLSTTKVWVSKEEMSRRAEKRLCYKCGKGRHRAADCGRKDE FT RGNWVRSGGNNFKVAELEAKITELESKGKSRAEESKNGDAWD" FT CDS 2450..5488 FT /product="Gypsy-4_MLP-I_2p" FT /translation="MFSKTKAQTLPPRRKYDFQVKLIPGATPQAGKIIPLS FT PAENEVLEKMVNEGLANGTIRRTTSPWAAPVLFTGKKDGNLRPCFDYRRLN FT ALTVKNRYPLPLTMELVDSLRGASKYTKLDIRNAYGNLRVKEGDEDVLAFI FT CKAGQFAPLTMPFGPTGAPGCFQYFIQDILLGHIGKDTAAFIDDIMIYTKD FT DVNHEDTVEDILKILSKYSLWLKPEKCEFSRSEVEYLGLIISKNQIRMDPA FT KVKAVADWPAPKNTSEVLRFLGFANFYRRFIEQFSRTARPLHDLTCRETPF FT SWTKERQESFDKLKTSFTTAPVLRIADPYQPFILECDCSDLALGAVLSQKA FT EDGEVHPVAYLSRSLIPAERNYEIFDKELLALVASFKEWRQYLEGNPHRLD FT VIVYTDHKNLESFMTTKQLTRRQARWAETLGCFDFVIKFRPGRSATQPDAL FT SRRPDLAPSSEEKLTVGQLLRPSNITEDTFAELDAFDAQFANEEEVEHVKA FT NEWFQCDIAGTKTDPENNLLPSDTELVQKIRELNVKDEKIQEIMNALINPI FT SSKIKEALNVYSVADGILYQDGRVVVPENDKIRAEILKTYHDSKLAGHPGR FT AKTLSLVKRNYTWTGQKAFVNRYVEGCISCQRVKPMLMKPFGSLEPLPIPA FT GPWTDISYDLITGLPTSNGKNSILTVIDRLTKMAHFLPCNDTDGAEQLADL FT MMKEVWKIHGTPKTIVSDRGLIFISKITSQLNERLGIKLQPSTAFHPRSDG FT QSEIANKVVEQYLRHFVSYHQDDWEPLLAPAEFAYNNNTHTSTGVSPFKAN FT YGYDLTLGPIPSEDQCVPAVEERLQRLAEVQLELQSCIEGAQEAMKHQFNK FT GIRDTPAWKVGEKVWLHSRNISTTRPSPKLDHRWLGPFSIVKRVSPSTYQL FT KLPLTMKGVHPVFHVSVLRKMEPDTIKSRQIDELPPIEIKGEEEWEVEAIL FT DSRKRYKTLEYLVSWKGYGKEHDLWEPYSNLTNAKEMVRDFDKRFPMAATK FT YKRSRRK" XX SQ Sequence 5584 BP; 1876 A; 1272 C; 1251 G; 1185 T; 0 other; tattgcagtg tctcaccaat cagcggtgga caagaatcga aactcgaaag aagaataaag 60 aaaggaatac gaatcaatta aattagaaac tcggaattag aaaggatcta ccaaaagttt 120 attttgacga agatcggaac aacggcgaac tttaccgctc ccaccgaaga acaaggttca 180 gaagaatcaa cagaagctgt atcggcgcaa gaatcaatct tcgacgaact cgacgagatg 240 gatgcattaa atgctaatat agctagtata ctatctcaac ttacaaatct gaacgcaaga 300 ttagatgagg agacagcccg tcacagggag actcaaaaga agttagatga agaaagtgcc 360 catagattat cgacgcaaga acaattagac accctaatca atcagcttaa taatagcagc 420 agttcaaaca ctaccccggc agctcagcac accaccaacc ccgcattaaa ccccgcaatt 480 tcccagccaa cgttacccgt aactagtgaa acccaaccgc ccctcacagc atccgatgtg 540 aagatactta tgaaatccaa ccgtagccca aaagtcggaa caccggataa gttcgatggc 600 tcaaaaggcg atgcagcgga aacttttgtg aatcaagtag gactgtacct tctagctaac 660 gaggaatcgt tcccgaatga caagacgaag atcgtattcg cactgtctta cctgacgggg 720 gaagcaaacc aatgggcggc accgtacttc aaacgcctac tacaccctaa cgatgaagac 780 gaacttacgt ttcaggcctt tgcaaaagct tttgaagcca cctacttcga ttccgaccgt 840 cagaaccgcg cgcaacgaga tctacgagca ctgcaacaga attcaacggt tgctgattat 900 accacgagat ttatggcact agcggctaga acgggctggg gtgaaacaga acatatcagt 960 cactacaaga tcgggttgaa acaagaaatc agagtaaaca tgattctcaa gacattcaca 1020 tcactgacag acatcactgc ttttgctatc gccatcaaca acgaacttca ccccggcatt 1080 acacgtttga caccccgtaa tctacctcaa atcacgacta ctaccaagga ccctgacgcg 1140 atggacctat caaccacgaa ggtatgggta tctaaggagg agatgtctag gagagctgag 1200 aagaggttat gttacaagtg tggcaaggga aggcatagag cagcagactg tggtagaaag 1260 gatgaacgtg gaaactgggt gagaagtggg ggtaacaact tcaaggtagc tgaattagaa 1320 gctaaaatca ctgaattaga aagtaaaggg aagagtagag cggaggaatc aaaaaatggc 1380 gatgcttggg actgaaggat gtgcctatcc cgagctctga ggaggaggtt attggtattg 1440 gagcagttac ttatttgaaa agaaatgcaa gcgatcctcg ctttttttta tcattaccat 1500 tgtcccatac cagcccctct ctctgtgcca catcaaaacc atttgcctta tgtttactgg 1560 actgtggagc tacgcacgaa gcagtcagcg acaggtttgt cagaaaattc aatctcaaga 1620 cctcaaagct caatgtacca caaacggtaa gcgcctttga cggcaagtcc aagcaattga 1680 ctgaagaagc tcatctattc atcgaccaag acacaaaccc cacgcacttt atcgttactc 1740 aattgaagga caactacgac gcactattag gaatgccgtg gtttcgaaag catggtcaca 1800 agattgattg gtcaaaaggc acgtttgaca attcacctat ggaattcatt gcaaccgtcg 1860 aagcggtttc gtcacaaccg gaaaacccct tggagcccgc aagggaagct aggtctgttg 1920 acgagggggt atgtactgaa gctgaagtca gcaacagtac actaataccc ccgcaatgtg 1980 agtcatcttt ttcattacac aaagatcagt ttagaacggc tagcaacgct tcatctctct 2040 tagaacagta ttcccaaaac cgactagcac caacaacaat taaggatgac gaaccacgaa 2100 atgcagctgt cccgtcagct tcttccattc cgaaaacatc ctccgaaccc aaggaggcca 2160 agaggaaagc aaggaatagt gaagaggggg tatgtactga agctgaagtc agcaacagta 2220 cactacacta atacccccgc aatgtgagtc tgctactgga atttcatccc tagtgtctga 2280 aacagctcgc aagcgtgatt ctctccaaat tagtagtcaa ctaccagcca tatcagcagc 2340 gaaagcctct tggtcaacag cagcaagaat tgcggtcgaa gagaagaaga aaacagtcca 2400 agaaccagtc gagaaactag tcccaacaca ataccacaga tacctaggaa tgttcagcaa 2460 gaccaaggct cagacacttc caccgagacg caagtatgat tttcaggtaa aattaatccc 2520 aggagctaca ccgcaagcag gcaaaatcat cccgctttcc ccagccgaaa atgaagttct 2580 ggagaaaatg gtgaacgaag gcttagcaaa cggcacaatt agacggacga catcaccgtg 2640 ggcggctcca gtactattca cgggtaaaaa agatggcaac ctcaggcctt gttttgacta 2700 ccgtcgactg aacgctctga cagtaaagaa caggtaccca ctaccattaa ccatggagtt 2760 agtggatagt ctgagaggag catcaaagta cacaaagctg gacatcagaa acgcgtatgg 2820 aaatctgagg gtaaaagagg gtgatgaaga cgtgttggcg ttcatatgca aagcaggtca 2880 attcgcccct cttaccatgc cattcggacc tacaggcgcc ccaggttgtt ttcaatactt 2940 tatccaggac attctcttag gacatatagg aaaagacacc gccgcattca tcgacgacat 3000 tatgatctat acaaaagacg atgtgaatca cgaagacact gtagaagaca tcttgaaaat 3060 attaagtaaa tattctttat ggcttaaacc ggaaaagtgt gaattctctc gttcagaagt 3120 cgaatacctg ggtctcatca tctcaaagaa tcaaatacga atggacccag caaaggtcaa 3180 agcagtcgca gactggcctg ctccaaagaa tacgtccgag gtattgcggt tcttaggatt 3240 cgccaacttt tatcggaggt tcatcgagca attctcaaga acagcaagac cattgcacga 3300 cctcacgtgt agagaaacac ctttttcatg gacaaaggag agacaggagt catttgacaa 3360 actaaaaact tcattcacta ccgcaccagt ccttcggata gctgacccct accaaccttt 3420 catcctagag tgcgattgct cagatttggc cctaggagca gtactgtcgc agaaggctga 3480 agacggcgaa gttcatccgg tagcatacct atcacgatca ctcataccag cagagcgcaa 3540 ttacgaaatc tttgataagg agctattagc cttggtagca tcattcaaag agtggcgcca 3600 atatttggaa gggaaccccc atagacttga tgtaatagta tatactgatc ataaaaacct 3660 ggaaagtttc atgacgacca agcaactgac acgtagacaa gctaggtggg cagaaacact 3720 tgggtgtttt gactttgtaa tcaaattcag gccgggaagg agcgcaactc aaccggacgc 3780 actgtctcga agaccggact tggcgccatc atcagaagaa aaactgactg tcggacaact 3840 cttacgacct agcaacatca cagaggatac ctttgcagaa ttagacgcgt tcgacgccca 3900 atttgcaaac gaggaggaag tggaacacgt gaaagctaat gaatggttcc aatgtgatat 3960 tgcaggcacc aaaactgatc cagagaacaa ccttttacca tcggacacgg aactagtaca 4020 aaagataaga gagcttaatg tcaaggacga aaaaattcaa gaaatcatga atgcattaat 4080 aaacccgata tcatccaaaa tcaaggaggc tctgaacgtg tactcagtgg cagacggtat 4140 cctttatcag gacggtaggg tggtggttcc ggaaaacgac aaaatcagag cagaaatact 4200 gaaaacttat cacgacagta aactggcagg ccacccagga agagctaaaa cccttagcct 4260 cgtcaaacgc aattacacct ggacaggaca aaaagcattc gtaaacagat atgtggaggg 4320 gtgtatttca tgtcaaagag tgaaaccaat gctaatgaaa cctttcggat cgctagagcc 4380 attaccaatc cctgcaggac cctggacaga cattagttac gatctcatca cgggattacc 4440 aacgtcgaac gggaagaata gtatcttaac agtcatcgac cgattaacca aaatggcgca 4500 tttcctacca tgtaacgaca ctgatggagc ggaacagcta gcggatctaa tgatgaaaga 4560 agtatggaag atacacggaa cgccaaagac catagtatcg gatagaggtt tgatattcat 4620 ctctaagatc acatctcaat taaacgaaag acttggcatc aaattacaac cgtcaacagc 4680 gtttcacccc cgatcagatg gtcaatcaga aatcgccaat aaggtagtgg aacaatatct 4740 gcggcacttt gtatcatatc atcaagatga ttgggaacca ttattggccc cggcggagtt 4800 tgcgtataac aacaacactc acacatcaac aggagtatcc ccatttaaag ctaactacgg 4860 gtacgattta actctaggac caatcccatc cgaggaccag tgcgtgccgg cagttgagga 4920 acgattacaa cgcttagcag aggtacaact tgaactacaa tcatgtatag aaggcgctca 4980 agaagcaatg aaacatcaat tcaataaggg aatacgagat acaccggcgt ggaaagtagg 5040 agagaaagtt tggctccaca gcagaaacat atcgacgaca aggccaagcc ccaagttgga 5100 ccataggtgg ctaggaccct ttagtatagt caaaagagtt tccccttcta cgtaccaatt 5160 aaagttacca ttgacaatga aaggagttca ccccgtgttt cacgtttcag tattacgaaa 5220 aatggaaccc gatacaatca agtcaagaca aatagacgag ttgccaccaa ttgaaattaa 5280 aggagaagaa gagtgggagg tcgaagcaat actggacagc agaaaaagat acaagacact 5340 ggagtacctt gtaagctgga aaggatacgg taaagaacat gacttgtggg aaccatattc 5400 aaatctaacc aacgcaaaag aaatggtaag agattttgat aagagatttc caatggcggc 5460 tacaaaatac aaaaggtcaa ggagaaagtg agcgggtcaa gctttttccc actgggtttt 5520 ttaatgctga cccagggatg tatgcagggc tgcaagagga gtctgggcat aaagaggggg 5580 atag 5584 // ID Gypsy-72_MLP-I repbase; DNA; FNG; 5650 BP. XX AC AECX01001226; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-72_MLP_; KW Gypsy-72_MLP-LTR; Gypsy-72_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5650 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001226; Positions 63413 57764. XX CC Positions [4452-4931] - Integrase core CC 'CAAGA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(1806..4304,4308..5552) FT /product="Gypsy-72_MLP-I_1p" FT /translation="MPWIKKNYQLIDWKNSKFKTEELTIAAANAVSFKPKQ FT ASRSPEELPKRHARNFNEGVESTCSFTPPQCEFDLTLSSKIEEAAGKPAHL FT LENLPEIESIRKEEPAAVITTVSSKPTTTSKTQPGVEPERDTRQSDEGVKS FT SNGFLKPPQSTCTPLISNRRKEQVNKHSHSILQMNFKKLRGSSPRSNIILD FT QIRRSQQPSPTMIHALKTLWNLSAKIAADKAKETPPKSAKELVPECYHEYL FT HMFEKANSDVLPPHRPYDFRVDLVPGATPQAGRVIPLSPKESEVLNEMITK FT GLANGTIRRTTSPWAAPVLFTEKKDGNLRPCFDYRKLNAVTVNNKYPLPLT FT MELVDSLLNADKFTSLDMRNGYNNLRVREGDEAKLAFICKAGQFEPLTMPF FT GPTGAPGFFQFFIQDILKNHIGRNVAAYQDVILIYTGPGEDHKTVVKEVLN FT ILKAQNVWLKPEKCKFSKKEISYLGLIISKNQIKMDASKVKAVKDWPVPKN FT LSDTQTFLGFANFYRRFINQFSKVARPLNELSKKDVEFTWNEERNQAFEAL FT KTAFTTAPVLKIADPYKPFVLECDCSDYALGAVLSQVSNDDGELHPIAYLS FT RSLIQAERNYEIFDKELLAVVASFKEWRQYLEGNPHRLKVIVYTDHKNLQS FT LMTTKELTRRQARWAETLGTFDFEISFQPGKQSTKPDALLRRPDFAPAEGT FT KLTFGQLLKPENLPADAFLNELEIAEMWFENDDLFEESEEDRLNEEEQMEP FT NRQILRDSDLIELIKEKAGGNEHIKELIRLCEEMPNSKHLQGYQVENGILY FT FKNKIVVPCDTDLKVQILRSRHDSKLAGHPGMRTLALIKRAYHWPSMKAFV FT NKYVDGCQSCQRVKARTEKPFGSLQPLPIPEGPWLDICYDLITDLPESGGY FT DSILTVVDRLTKMAHFIACKKSMTSKELADLMIREVWRLHGTPRTITSDRG FT NIFISRITKDFHKQLGIKTQSSTAYHPQTDGQSEITNKAVELFIQHFTSYK FT QDDWFDLLPFAEFSYNNNEHVAIGISPFKANYGFDVNFTDVPASTQCVPLV FT ENRLEQIKMVQRELKDTMNLTQEEMKKQHDRKVNNTPSWEKGDKVWLNSKH FT MSMSRPTVKFSHRWMGPFVISARVSTNAYRLELPKFMSKIHPVFHVNLLRK FT YDPSSIKGQEATPAPAVIINENEEFEVNEVLDKRKKGKNVEYLINWKGYGE FT EHDSWEPSSNLQNAQELVNEFNQKYPQAEVRYRRSRRK" XX SQ Sequence 5650 BP; 1968 A; 1119 C; 1236 G; 1327 T; 0 other; tattgctaag tcttagaaac actcaggatc caagagaaga aagaaaacga agaaagaaga 60 aatttaaagt tgcaaagttt aaagacaaga agaagaaaaa aaaaagaaag aaacacgaag 120 aataagaaga tttaaagctt aaaagaaaga aaaagaaggt ttcaaaaaaa gaaaacagta 180 gatcaaagtc aattaaagta gtgacaagtt cagattcgag atctccccgc atcatacagc 240 aagcctgcat caaactttga attgatacac aaaccttatt ccctcgaatc ccctcgatta 300 accacccaac acgccggaat tcggtttacc cgttagtcct agtagttcag gacctagtga 360 aaggtttgaa gacgtagaag atactataat ggaggttaat gtagctgaat tgatgaaaag 420 aatggacgag atgaatgtga aattagaaga agaaagcagg ttaagacaag aagcagaaag 480 aagatggttt gagttgaaag aaagtgttga aaagaataat tccaatgcac cattacaacc 540 tacaaccgct ccaactccaa acccgaatca actacaatcc caagtcaaac ctcccaaaat 600 cgcgacaccc aacaagtttg atggagcaaa aggccaaaag gccgaagtat ttgtgaacca 660 gataagctta tatatgcaga tgaatgcctc aagtttcata aatgagcaag ctcaagttgc 720 ttttgcatta tcttacatgg atgggaaggc tagtctatgg ggtcaatctt taactgatca 780 acttctggac tcggaaagaa tgcgactggt aacttggaag aagttcatag aatccttcaa 840 agccacattc tttgatacag aaagaatcac caaggccgag agagagatac gagcgttacg 900 tcaaactcga tcggttaccg attactggat aaagttctcc gagttgtctc taattgttaa 960 atggcccgat actgtactca tctctcaatt caagcagggg ttaaaaggag aaattagggt 1020 tcatatggtt agagatgttt ttgaggaagt agaggagatg gctaaattag caatcaagat 1080 cgataatgaa gttaacgaac gtaactcaga attgactcac cagacttgaa cctctgcatc 1140 aattcctata acatcatcga cttctcttga ccccgatgcg atggactgct cagcttatag 1200 gttgaacatc acgagtgagg agtataagca aagaggtgca attggagctt gttatcattg 1260 tggtaaagtt gatcattaca ttggagattg tcctgataaa agaagacgaa gtaatagggg 1320 aggttggcga ggaagaggta gaggaggatt tagatcaaag tttgctgagg ttgatagtgt 1380 aaaggatgag ttgaagagtg aaggaaggtc tgaagaatca aaaaatggag atgctcggga 1440 gtgaaggttg tgcctccccc gagcggtaat gatttatcat cagaaaatag tataattaag 1500 cacttgaaat aaaagataca agaattattg aaagcattac cattttcaat gttaaaagtg 1560 ccacaaccat aaacgcgaga gccctgatca acagcggagc tactcatgaa gctatcagta 1620 agaccttcgt agtaaaaaat caacttcaaa ccaagccttt aacccaagtc agaagtgtta 1680 cgggcttcag tggacatgaa tccaaaatca cgcacactgg tgactttcat gtgaactcct 1740 gtgatacacc accaactacc tttattgtaa ccgacctaag agacaagtat gacattattt 1800 taggcatgcc atggatcaag aagaactatc aattgattga ttggaagaac agcaaattca 1860 agactgaaga acttaccatt gcggctgcca atgcagtgtc gttcaaaccg aaacaagcct 1920 cgaggagtcc tgaggaactg ccaaagaggc acgctaggaa tttcaacgag ggggtggagt 1980 ctacgtgctc attcacaccc ccgcaatgtg agttcgattt gactttatca tccaagattg 2040 aagaagcagc tggcaagcca gcacaccttc tagaaaattt accagaaatc gagtcgataa 2100 ggaaagaaga acctgcagct gtgataacca cagtttcgtc caagccaaca accacctcga 2160 agacccaacc tggagtggag cctgagaggg atactaggca aagtgacgag ggggttaagt 2220 cgagcaacgg ctttctaaaa cccccgcaga gtacgtgtac acctttaatt tccaacagac 2280 gtaaagaaca agtgaacaag cattctcact ccatattaca gatgaacttc aagaaactgc 2340 gcgggagctc accacgatca aacatcattc tggatcaaat tcgacgatcc cagcaaccaa 2400 gcccaacaat gatacatgca ctcaagactt tgtggaatct ctcagcaaag atcgctgcgg 2460 ataaagctaa ggagactccg ccgaaatcag caaaggaact tgtgcctgaa tgctatcatg 2520 agtatttaca tatgttcgag aaggcaaatt cagatgtgtt acctcctcac aggccatacg 2580 acttccgggt agaccttgtt ccaggagcaa ctcctcaggc cggacgagta attcccctat 2640 cgccaaaaga aagtgaagtc ttaaatgaaa tgataactaa gggtttggca aatggcacta 2700 taagacgcac tacatctcct tgggcggctc cagttctatt caccgaaaaa aaggacggca 2760 atctaagacc ttgctttgat tatagaaagc tgaatgctgt aactgttaat aacaaatacc 2820 ctcttccctt aacaatggaa ctagttgata gcttactcaa tgctgacaaa ttcaccagct 2880 tggacatgcg gaacggatat aacaatcttc gggtacgaga aggagacgaa gcaaaattag 2940 cattcatctg taaggctgga caatttgaac cacttacaat gccttttggt ccaacaggtg 3000 ctccgggctt ctttcaattt tttatacaag atatactgaa gaatcacata ggacgcaatg 3060 ttgcggcata tcaggatgtt atactcattt atactggccc tggagaagat cataagacag 3120 tggtcaaaga agtattgaac atcctgaagg ctcaaaatgt ttggcttaaa cccgaaaagt 3180 gcaagttctc aaagaaagaa atctcttact taggtttaat catttcaaaa aatcagatta 3240 aaatggatgc aagcaaggtg aaagcggtca aggattggcc agttccaaag aacctatccg 3300 acactcagac ctttttagga ttcgccaatt tttacagaag atttataaat caattttcca 3360 aagttgcacg accgttgaat gaattgtcca agaaagatgt tgaatttaca tggaatgagg 3420 aacgaaatca agcgtttgaa gcactcaaga cggccttcac cacggcaccg gtactcaaaa 3480 tagcagaccc atacaaacca tttgtattgg aatgcgactg ctcggattac gcattaggag 3540 ctgttctctc acaagtatct aatgacgatg gcgaacttca ccccattgct tacttatcac 3600 gatccttgat tcaagcagag cgcaactatg agatcttcga taaagaactc ttggcggtag 3660 tggcgtcctt caaggaatgg cgtcagtacc ttgaaggtaa cccacaccgg ctcaaggtta 3720 tagtgtacac tgatcacaaa aacttgcaat ccctcatgac aaccaaggaa cttacaagac 3780 gacaggcgcg ctgggccgaa actctgggca cgttcgactt tgagataagt tttcaacccg 3840 gtaaacagtc aactaagcca gatgcattgt tgcgcaggcc agactttgca cctgctgaag 3900 gaaccaaact gacatttgga cagttattaa aacccgaaaa tttaccggct gacgcattct 3960 tgaatgagct agaaattgct gaaatgtggt ttgaaaatga tgatctattt gaagagagtg 4020 aagaagatag actaaacgaa gaagagcaaa tggaaccaaa ccgtcaaata ttacgagatt 4080 cagatttaat cgaactaatc aaagagaaag caggaggcaa tgaacacatc aaggaattga 4140 tcaggctgtg cgaagaaatg ccaaactcga aacatctaca aggataccaa gtggagaatg 4200 gaattttata tttcaaaaac aaaatcgttg taccatgtga tactgatcta aaggttcaaa 4260 tcttgcgttc tagacatgac agcaagctgg ctgggcaccc aggatgaatg agaacgctag 4320 ctctgattaa acgagcctat cactggccgt cgatgaaggc ctttgtcaac aaatatgtgg 4380 acggttgtca atcatgccaa agagtgaagg ctcgaaccga gaaaccgttt ggatcattgc 4440 aaccattgcc tattcctgaa ggaccgtggc tcgatatctg ttatgacttg attacggact 4500 tgccggaatc aggagggtat gatagtattc tcaccgtggt cgaccgacta acaaaaatgg 4560 ctcatttcat agcctgcaag aaaagcatga cctcaaaaga actagctgac ttgatgattc 4620 gagaagtttg gagactacat ggcaccccaa gaacgataac gtcggatcga ggaaacattt 4680 tcatttccag aataacaaaa gacttccata aacaattggg gatcaaaacg cagtcttcaa 4740 ccgcttatca tccgcaaact gacgggcagt cggaaatcac aaacaaggcg gtggaactgt 4800 ttattcaaca cttcacctct tataagcaag atgactggtt tgacctgcta ccatttgcag 4860 agttttcata taacaataat gaacatgttg caataggaat atcaccattc aaggctaact 4920 atgggtttga tgtgaatttc acagatgttc ctgcgagcac gcaatgtgta ccactagtag 4980 aaaaccgatt ggagcagatc aagatggtgc aaagagaatt aaaggatacg atgaacttaa 5040 cgcaagaaga gatgaagaaa caacatgacc gaaaagtaaa caacacaccg tcgtgggaga 5100 agggagacaa agtttggctg aatagcaaac acatgtcaat gtcaagaccc actgttaagt 5160 tttcacatcg ttggatggga ccatttgtaa tttccgcacg agtttcaact aacgcttata 5220 gacttgaact acccaaattc atgagcaaaa tacatcctgt ctttcatgtt aacttattac 5280 ggaagtatga tccaagctca attaaaggac aagaagcaac cccggcacct gcagtaataa 5340 tcaatgaaaa tgaagaattc gaagttaatg aagtattgga taagagaaag aaaggcaaga 5400 atgtagaata tttaattaac tggaaagggt acggagaaga acacgattcg tgggaaccgt 5460 caagcaacct tcaaaatgct caagaactag tgaatgaatt taatcaaaaa tatcctcaag 5520 ctgaagtaag atatagaagg tcaaggagaa agtagagagg gtgaggcttt ttcccaatgg 5580 gttttttaat gccaacccgt ggaaagatgc tggcctgcaa gaggaggtcg agacattaaa 5640 gggggagtgg 5650 // ID Copia-57_MLP-I repbase; DNA; FNG; 4270 BP. XX AC AECX01000342; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-57_MLP_; KW Copia-57_MLP-LTR; Copia-57_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4270 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000342; Positions 126100 121831. XX CC Positions [1528-2028] - Integrase core CC LTRs are 92% similar to each other. XX FH Key Location/Qualifiers FT CDS join(79..2163,2167..3150) FT /product="Copia-57_MLP-I_1p" FT /translation="MGHHDSGTKHSTIEQLNDDNFSQWRVNMYGWMLEHSL FT GIFVSTPHPTIPNQRNRAFGVIIQRLDQASNARFLNEENTLNPNLLWDAIL FT EHYQSTDAASQSRVFTTFLRISFSTLSQFIKDIRQGMKALVDSDVTNVIGD FT VLLAEIIVHKLPESLSTFKELQYSKRPLTTSAVLVALDNHHADDRENNQPD FT VAAAMSSRRSNAVNRNVASSNRNNNNNRRPICSNGTHNPLTKHHQDNCWTL FT FPDKRPAHLRQQASQANLPVVNSTRYVDDNVFSVQNASAAYVHATTSAYRS FT RALAVNTVPNNFAKLLDSGCSDHMTSNIEDFSTYVPKWSTVSLADGSIIQI FT IGEGLVHASSNGSITSFHAYHVPKVNGTLLSLGKLMLEGCSLHSKEDSFVV FT KKDDSPILYGSICDGVLELDLNLGKLALPTIIASSARLLTSYDTLHRLAGH FT TNPDRLRILFHCSPPPEWTCEACILSKGHRLPYKMSLPVSTTPLQFVHGDL FT SGKIAVTSLGGASYYFKLTDACTSYKYIYPLKLKSETFARFLEFKSEVETF FT HERKIISLVNDRGGEYMSRDFLKCLKENGTTMHLSASYTPQQNSVAERGNQ FT TTSEKARALLKQADLPASYWAEAVTTAVFLENVTPMEKHGNKTPYKLWHQR FT PFDYSCLHPFGCRCYVLIPKHLRDGKFGDTTAKGILLGFQIGMHNFVQREN FT GSIIYSHDVTFNDDVFPGAGKSHDSSFVINDEEESFQIDTHSNSDLDQTPP FT VSPHVPTTPFNSPELNPPDDSRALICCQRTPKEAPIAGYNGDGDLVMWHPL FT PSPPLPSVPTKHSWDWQLASVPAIQDISSSINESNILPEGSKRHRIQAARK FT IQRHRVMTARRILRARSARLANIPFPTSLKEALLREDASDWLNAIQAEMKG FT MEDLKVWDIVNLPEGDHAIGTTWVFEKKFGAEGKLIKYKARLCAQGFSQRP FT EDFGDTFAPTGRLLSLRALMDIAAVQDLDVHVMDVKLAFLNGVPKETIYLR FT IPKGTNSQELLRSQSSN" XX SQ Sequence 4270 BP; 1223 A; 883 C; 803 G; 1361 T; 0 other; attggttatg agcccagtgc ttacagattt ttaaactcaa ccttttaaat caacatttaa 60 atcgaccgat acgttgtcat gggtcatcat gacagtggaa ctaaacattc caccatcgag 120 cagttaaacg atgataattt ttcgcaatgg cgtgttaata tgtatggctg gatgttggag 180 cattctcttg gaatttttgt ttccactcct catccaacta ttcccaatca acgtaatcgt 240 gcttttgggg tcattattca acgtctagat caagctagta atgctaggtt tttaaatgaa 300 gagaacactt tgaatcctaa tcttttatgg gatgctattc ttgagcatta tcaatcaact 360 gatgcggcta gtcaaagtcg tgtttttaca acttttcttc gaatctcttt ctctacttta 420 tctcaattca taaaagatat aagacaaggg atgaaagctc ttgttgattc tgatgttacg 480 aatgtcattg gtgatgtgct attagctgaa atcatagttc ataaacttcc tgaatcttta 540 tctacgttca aagaacttca atactctaaa agacctttaa ccacttctgc tgtccttgtt 600 gctcttgata atcaccacgc cgacgatcgc gaaaacaatc aacctgatgt tgctgctgct 660 atgtcttctc gtcgttcaaa tgctgttaat cgtaatgttg cttcctctaa tcgcaataac 720 aataacaaca gacgacctat ttgttcaaac ggtactcaca atcctcttac caaacatcat 780 caagacaatt gttggactct ttttcccgac aaacgaccag cgcatcttcg tcaacaagct 840 tctcaagcaa accttcctgt cgttaattct actcgatatg ttgatgacaa cgttttctcc 900 gttcaaaacg cttctgccgc ttacgttcat gctactactt cagcttatcg ttcacgtgct 960 ttagctgtca ataccgttcc gaataacttc gctaaacttc ttgacagcgg ctgttccgac 1020 cacatgactt caaacattga agatttctct acttatgttc ctaaatggtc aactgtttct 1080 ctcgcagatg gaagtatcat tcaaattatt ggtgaaggac ttgttcatgc atcaagcaat 1140 ggttcaatca cttcttttca tgcttatcac gttcctaaag tcaatggtac cttacttagt 1200 ctcggtaaat taatgcttga aggttgctcc ttacatagca aagaggactc ttttgttgtt 1260 aaaaaagacg attctcccat tttatatgga tctatttgtg acggtgtact cgagctcgat 1320 ctcaatctcg gtaagcttgc tttaccaacc atcatagcct catctgctcg attgctgaca 1380 tcttacgaca cgcttcatag acttgccggt catacaaacc ctgatcgtct cagaatcttg 1440 ttccattgtt ctccccctcc tgaatggacc tgtgaagctt gtatcttgtc aaaaggccat 1500 cgattgccct ataaaatgtc tcttcctgtt tcaactactc ctcttcagtt tgtacacggc 1560 gaccttagtg gcaaaatcgc cgtaacttct ctaggtggtg cctcttacta ttttaaactt 1620 accgatgcat gtacttctta caaatatatt tatcccctca aattgaaatc tgaaactttc 1680 gcacgttttc ttgaatttaa aagtgaagtt gaaacgtttc atgagagaaa aattatttca 1740 ttagtaaatg atagaggagg cgagtacatg tcacgtgatt ttcttaagtg tcttaaggag 1800 aatgggacga ccatgcatct ttccgcttct tatactccgc aacaaaactc cgttgcagag 1860 cgaggcaatc aaactacgtc tgaaaaggcc agagctcttt taaaacaagc cgatcttcca 1920 gcttcatatt gggctgaagc tgttacaact gcagttttcc tagagaatgt cactccaatg 1980 gagaaacatg gcaacaaaac tccatacaaa ttatggcatc aaaggccttt tgattattct 2040 tgtttacatc cttttggttg tcgatgttac gttttgatac ccaaacatct acgtgatgga 2100 aaattcggag atacaactgc gaaaggtatc ctcttaggtt ttcaaatcgg catgcataac 2160 ttttgagtac aacgggaaaa tggctctata atttatagtc atgatgtgac ctttaatgac 2220 gatgtttttc ctggtgctgg taaatctcat gattcttctt ttgtaatcaa tgatgaagaa 2280 gaatcttttc aaatcgatac tcattcaaat tcagatcttg atcaaactcc tcctgtttca 2340 cctcatgttc cgactactcc ctttaattct cctgaattaa atcctcctga tgatagtcga 2400 gcactcattt gttgtcaaag aactccaaag gaggcgccca tcgcaggtta caatggcgat 2460 ggtgatctgg ttatgtggca tcctctacca tcaccaccgc ttccctcggt tcctaccaag 2520 cattcctggg attggcaact tgcatccgtt cctgctattc aagacatttc cagcagcatt 2580 aacgaatcca acattcttcc cgaaggttca aaacggcacc gtattcaagc agctcgaaaa 2640 attcaacgac atcgagtcat gactgcacgt cgaattctgc gtgcgcgttc cgctcgcttg 2700 gctaatatcc cttttccaac gagtttaaaa gaggcattat tgcgtgagga tgcatcagat 2760 tggttgaatg caattcaagc tgaaatgaaa ggtatggaag accttaaagt gtgggatatt 2820 gtcaacttac ctgaaggtga tcatgctatt ggtacaacct gggtgtttga aaagaaattt 2880 ggagcagaag gcaaattaat taaatataaa gcgcgtctat gtgctcaagg gttctcacaa 2940 agaccagagg acttcggtga tacttttgca ccaaccggtc gacttctctc tttacgtgca 3000 cttatggaca ttgcagctgt tcaagatctc gacgttcatg tcatggacgt gaaactggct 3060 ttcctcaatg gtgttccaaa agaaacaatt tacttacgta ttcctaaagg taccaattcc 3120 caggaacttc tgagaagtca gtcctccaat taaataaatc aatttatggt ttaaaacaat 3180 cgccttgttg ttggcatgat gttattaaag cattcttcat tagcatcaat cttaaacaga 3240 ctccttctga tccttgtgtg tttgtgtcca atgacccaaa ttggattgtc tatgttcact 3300 tacatgttga tgatatgacg atagctagca acaatgtaca atgttttaaa gatttaatta 3360 atgagaaatt tagcatggaa gatttaggtg aagctaaatt gatacttggt atgaaagtta 3420 caagagatcg caaagccaag actatatctt tatctcaacc tcaatatatt ggcaacttac 3480 ttgaagaata tgatatgact gaagctaatg ccgttggtag tccaatgctt gataatactc 3540 accttgtacc cggtactgaa ggaagtcgca aagaatttat tgaatcaggt gaagattatc 3600 gacatgctgt gggaatgctc atgtacttat cccttgcaac tcgtcctgat ttggcttttg 3660 tagtttcaca attatctcag cattttgaaa ggccggatat ggttcattgg atggaattta 3720 agaggatctt acgttatcag aagggtactc aacattttgg tatagtttta ggtggtgata 3780 atttaaattt acaaacgtgg agtgattctg attttgctgg ttgtccttat acaagacgct 3840 caactactgg tatgattacg caagttggat ctgggtgtgt taattggtga gctaagaaac 3900 aagaaggggt agcaggttcc tcaactgaag cagagtatcg gtcggtgtat gaaggtggtc 3960 aagatttaag atggtttact caattgatga aggatattaa tcaaccatta ccagtcattc 4020 cgttattact agctgacaat cagggtacta ttggtttatc aaaaaatgct caatttcaaa 4080 acagaaccag acatgtcgac gtgaaatatc attggattcg cgagcatgta gatagagagc 4140 atttcaaaat taagtatgtt ccaaccgctg atatgctagt gatatgttca cgaaatcttt 4200 accactgtac aaacacaaag atatctgtag aagaattatt ctggtagatt tagagaaccg 4260 gggagggaat 4270 // ID Gypsy-4_TMe-I repbase; DNA; FNG; 6501 BP. XX AC CABJ01003198; XX DT 13-FEB-2011 (Rel. 16.02, Created) DT 13-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Perigord black truffle genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_TMe_; KW Gypsy-4_TMe-LTR; Gypsy-4_TMe-I. XX OS Tuber melanosporum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Pezizomycetes; Pezizales; Tuberaceae; Tuber. XX RN [1] RP 1-6501 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Perigord black truffle genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; CABJ01003198; Positions 29936 36436. XX CC Positions [5337-5828] - Integrase core CC 'GTAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 289..1788 FT /product="Gypsy-4_TMe-I_1p" FT /translation="MSNFVKITPFNGGTTDFSESVDEYLDDIETAALSWDL FT SIAPGTTKATDRLKIRLFCQNLERDGDAWHWWYYVLPEASKKSYESIVKEF FT RERYGLRASEASSLFAVQNEMLSLSQVESEHIHDYVHRVEKLSGKVPKEMD FT SLFAIAFIKGMNDQEHRQWITFDLKNDTNFSFGKALSVVKFSFQEIGEPDP FT FRPQGKVNESDTMTLYKTLNVAQVNVLKKTDVPVQSIPLEVAQPMMIQEQF FT NTFMAAYEASVGRGSRMSSSQLLNRRLNTRITCFNCGQQGHYSNTCNRPPI FT PSNQQQQIREDIRRERELQEQDYRPQERYVPAPASGANTVAISPSSILPRP FT KQPALLVVAPIPVTCVRSCSVARNDLGKACAMLAKIPAVRTIFENALVEKR FT ARVEDDDEGSVYERGASKVPRRTQDQGENHPLRRSVRTTNNPRAGRYSDEM FT ENNLNEVEKMVEQARMAENEAIHVRRGEPAIPSAVKSIKAVHDFENGFQAF FT ERTD" FT CDS 1679..2587 FT /product="Gypsy-4_TMe-I_2p" FT /translation="MKPYMYGEENQQYHPLLKVSKPSMISRMGSRPLNVPI FT KWMEGQEPFTIGNALDGSNMNLSITLPQLLDCSPRLRRDLAELLRSSIPRV FT RKKKDAPDHNPVTRNVLHSSSKDPRKEIISEAASSTDENIECLYIEAWSGN FT YRIPDVLVDAGAMLDLISTSLVDELGLHRYPVSGLGMHLADDHLVILKHYV FT WVDILVAGVLTHIKAYEVAVCETYQLLLSRRWLKRVRAVEYHDNQILFIEG FT SDRTQRKVPGISMEKSQRRMNKIDTGSVIDVDDEEAEEAIETLLNELDHWE FT EGDEMFPISEN" FT CDS 5334..6395 FT /product="Gypsy-4_TMe-I_3p" FT /translation="MLQFQPMDMIGMDFVGPINPPCKATGNLYILIVIDYF FT SRFLWAVGVQKADQISTMKALLDHVIPSFGWPLSVYTDNGSHFTGAMISKM FT WTDHSVHQFTSAISHPQSVGLSECYVQMLMGRIRLCCIADASSRCWGLHIR FT DAVLSINTRYIRIHGYTPAEILLSFNPSVTRKSDEGLSNWLKRTQPAVPDL FT PITSEERINSYLDQRDEHGIEAGRKLAQAQEKINPRMTAGYKQPQPGDLVL FT LHDFQLAKDKGRKLEHRWSTPRILERVSKSGVSAHVRQLHDPPGMMKRFHF FT DDLLLYIPRTENYPMPEMSHVVTRGQGHGVEYVRGAMGAIEGVLTLGQRAF FT DISDIGGCLRQ" XX SQ Sequence 6501 BP; 1909 A; 1202 C; 1679 G; 1711 T; 0 other; ttggtgctct gaaacggatt tttttggtta ctacaccaga gaaggccttg tctcattgta 60 ttattgattg ttgacttttg ttgttggttc gaaatttatt tctaaattct ggaaataaat 120 acctcagcga attttagtcg ggctattgac tatagagagc agcttctgaa acgttaccga 180 gagcatactg cttggttttc attaccagaa tctcctcctc ttttcacagc tttagcccta 240 cccactccga atatacaggt ttgtaagccg aagttgcttc cgggcagaat gtcaaacttc 300 gttaaaatta ctccatttaa tggaggtact acggacttta gtgaaagtgt ggacgaatac 360 ctagatgata tcgagactgc tgcactatca tgggatctga gcattgcacc aggaactacc 420 aaggctactg atagattgaa gatcagattg ttttgtcaga acttggagcg agatggagat 480 gcctggcatt ggtggtatta tgttcttccg gaagcgtcaa agaagagcta tgagagcata 540 gttaaagagt ttagagaacg atacggatta cgggcatcag aggcatcatc tttatttgcc 600 gtacaaaatg agatgctttc actttctcaa gtcgagagcg aacatatcca cgattatgtt 660 catcgagttg agaagctctc tggcaaggtc cctaaagaga tggattctct ttttgccatt 720 gcgtttatta aagggatgaa tgaccaggaa cacagacagt ggatcacttt tgatcttaag 780 aatgatacaa atttctcttt tggaaaagcc ttatcagtag taaagttttc ctttcaagaa 840 attggagagc cagatccttt ccgccctcag ggaaaagtaa atgagtctga tacaatgaca 900 ctttataaga ctctgaatgt ggctcaggtg aatgtactta agaagacaga tgttccagtt 960 cagagcattc cattggaggt agctcagcca atgatgatac aagagcagtt caatacattt 1020 atggctgcct acgaagcaag tgtcgggcgg ggatcgcgga tgagttctag tcaactatta 1080 aaccgacgtt tgaatactcg aattacttgc tttaattgtg ggcagcaagg acactattca 1140 aatacttgca atagacctcc aattccttcc aatcaacagc agcaaatacg agaagatatc 1200 agacgtgaga gagaacttca ggaacaagat taccgaccgc aagaacgata cgtacctgct 1260 ccagcctctg gagcgaacac agttgctatc tccccaagtt cgattcttcc ccgtccgaag 1320 cagcctgccc tgctggttgt tgctcctata cctgtgactt gtgttcgttc ttgttctgta 1380 gccagaaatg atcttggtaa agcttgtgct atgttagcca aaattccagc tgtgcgaact 1440 attttcgaga atgctttggt ggaaaagaga gctagagtag aggatgatga tgaagggagt 1500 gtatacgaac gtggagcgtc caaagttccc agacggactc aggatcaagg agaaaaccat 1560 cctttgagaa gatctgtcag aactacgaac aatcctcggg caggaagata tagtgatgag 1620 atggaaaata atttgaatga agtagaaaag atggtagaac aagcaagaat ggcagagaat 1680 gaagccatac atgtacggag aggagaacca gcaataccat ccgctgttaa aagtatcaaa 1740 gccgtccatg atttcgagaa tgggttccag gcctttgaac gtaccgatta agtggatgga 1800 aggccaagag ccttttacaa ttggtaatgc tctcgatgga tccaatatga acttgagtat 1860 tacacttcct caacttcttg actgttctcc tcgtctccgt cgtgacttgg cagaactgtt 1920 acgttcttct attccgcgag taaggaagaa aaaggatgcg cctgaccaca atcctgttac 1980 caggaatgtg ttgcattcgt caagcaagga tccaagaaag gagatcatta gtgaagctgc 2040 atccagtaca gatgagaata ttgaatgtct gtatatagag gcttggagtg gcaattatcg 2100 tatacctgat gttcttgttg atgctggagc catgcttgac ttgatctcaa ctagcctagt 2160 tgatgagtta gggttacacc gttatcctgt gagcggattg ggaatgcatc tggcggatga 2220 tcacttggtg atattgaagc attatgtttg ggttgacatt ctagttgctg gagtgctcac 2280 tcatatcaag gcatatgagg ttgcggtatg cgaaacctac cagcttctcc tttcgagaag 2340 gtggcttaag cgtgttcggg cagtggagta tcacgataat cagatactat ttatcgaggg 2400 tagcgatcga acacagcgta aggttcccgg catttctatg gaaaagtcac agaggagaat 2460 gaacaagatt gatacaggat ctgtgataga cgtggatgat gaggaagcgg aagaggcgat 2520 tgaaacattg ttaaatgaac tcgaccattg ggaggaagga gacgaaatgt ttccgatttc 2580 ggaaaactaa aagcatcgct ctggcatcag ggtaaggaac gatgcttata tgaccctggg 2640 gcgaggaaaa acaatttgga tgaaaggatg aattactttt tggttggcgg agtgaggtgg 2700 cattttgtgg aaaggaggat gaaagggtgt caagaatatt ttctggggaa gaagcggaag 2760 ggaagaagag agtaggaatc agagtagtgc agggtattgg aaggaagaaa gaggagaagt 2820 gtaaaagaag aactgagcgt gagggaaaga gagggatagg aaagactcca ggagttttgc 2880 acacagctgc attaactcgg aacaaggcac agctatttcc gacaaatcgt aggccagtca 2940 gccatccaat cattactgag aaagaagtta ctcagccaga aatagatgag tggtttgtcg 3000 gaacaagtat tcacctggga gagcaactga ctattggaga acaaaagaaa gccaagcgga 3060 tgctgtatac gtggaaggat gtttttgaga ctgatcttct aagaatttga agaactgacc 3120 tgattcaaca tgcgatcatt ttaacccccg aagcgaaacc ctaccgagca aaaattccat 3180 tatacacgga ggaggagata gcgttttgtc accgtttgtt gccaaaaatg gaagaagccg 3240 gtttaatctt tcgctgtgat agtgagtggg gagcacgaac gaaatttcct tgaaagctac 3300 gagcagagtc attacccaag gaggcgagat tgcgaatggt gcacaatttt ataccgctga 3360 atcgagttac agaaaagtca caatatccat gtcctaggat tgagcagatt gtctatactg 3420 ttttgaagaa aggaaagcgc ttctttttca ctaccgatgc tgcaaactcc tactgggcaa 3480 ttccagttag agctggagat gagacaaaac tcggatttgt tacaccttat aggatgtact 3540 gttataatgt tatgggacaa ggacttacag gtggaactca tacatatagt cggttcagag 3600 acctggtatt tggtgctatt cccgaaggat acgaagacag ctcaggtcgt agagtggtgt 3660 tggaaggaat ggaatcgctg atcagggatc aaggcgaagt ggcatttgac ggaatgattg 3720 acgatagcta tggaagtgct actagcttct acgtaatgta tcgatttctc catacaaagt 3780 tttttccaag atgtgtctga gggccgatat acttgaagga ctccaagtct catttctttt 3840 gtgactctct ggattttatt ggatttagtg cgggccccaa tggtcttcgc ccgtctctgc 3900 gcaaaagaga ggcaattctg gaatggccta ttcctagcac cttcgctgat gtagaagcct 3960 tttgttattt gacaccattc ctacgacggt ttattcctgg aagagcggag ctagtctgaa 4020 taatgaagta cgggattgac gataaaacct tggacaggaa acatacggat gcagcgcgac 4080 gacgaaaaga gatcgaaaag acctttcagt ggacagaaga gaaaaatgta gctttccagg 4140 cgataaaaca ggctatcgca aataatgcta tggcatcacc agatcctaag atccaatacc 4200 atttggctgt agatgctagt aagcgtggag ttgggggggt actatttcag ctggagggta 4260 tcatggccgg aacggaagct acgaattccc agtcccacca ggcagctgag aggattgtta 4320 tgttcatatc atttaaactt gcagatacag agacccggta ctcaaactct gagagagaag 4380 cactggcggt tattcgttgt ttggcagaag tccgatggat ggtgatggcc tcatcctatc 4440 cagttcttgt gtatacagat catgaagcct taaggacctt attgaccgga ttggataatg 4500 atactcactg tagaatagct aagtggcagg aaaagctagg ggaatatgag tttcgcttga 4560 ttcatcgacc ggctacaaca cattttatcg gcatcgcaga cagactttcc cgactaccaa 4620 ctcgtttgat gtggagacat acggctgagg atagtgaagg tttacggcca gcaattcata 4680 ttattgtgcc tgtgaagggt ctagctacca atgttccggt tacttcagaa attccaaagc 4740 tgttgcgtga gtgtggggag ttctggaaga tggggagttg tgggagtcag gaggtaaaga 4800 ggagaaggag agaccggaat tggatgaata aaggagtaag cgtagtacag gaagcggtag 4860 ccgaagagaa tgtcgaaact gatggatgat tgaaggaggc tgcaagggat atgaggagaa 4920 gacgatggaa gatttggcta gaatcaaaga tgtatggggc tgtcgtgaag gaaagactag 4980 atgaactgga ctttgagatt ataggagctg gagaggtgag attgggaaga aatgagagaa 5040 ggatgttaga gaggttgatg tgaaagtttg tattagtgga tgggcaagaa cctaagcttt 5100 tctttcggga aaagaatggg caattggcaa gttgtgtatt ggagaaggac gtgaaaaggg 5160 tgcttggaaa tttacatgaa ggtcatggac actttgcgac tagtgttact atcggccacg 5220 cacatggaaa ggtctactgg ccatctcgct caaaggatat agcacaatgg gttgcctctt 5280 gtgaagcctg tcagcgggtt accaaaattc agaaagccgg agttatacga cctatgctac 5340 agtttcagcc catggacatg attgggatgg attttgtggg acctattaat ccaccatgca 5400 aagcaacggg aaatctttat attcttatcg taattgacta cttttctcga tttctttggg 5460 ctgttggagt acagaaggca gaccagatct caactatgaa agccttatta gatcatgtta 5520 ttccgagttt tggatggccg ctatccgttt acactgataa tggaagccat tttactggag 5580 cgatgatttc caaaatgtgg acagatcaca gtgtccatca gtttacttcc gcgatatcac 5640 atccccaatc agtgggactt agcgaatgct acgttcaaat gcttatggga cgtatcagac 5700 tttgctgtat tgctgatgct tcttctcgct gctggggatt acacatcaga gatgccgttc 5760 tcagtatcaa cactcggtac attcgcatac atggatacac tccggcggaa attcttctca 5820 gcttcaaccc ttcagtaacg cggaagtcag atgaaggctt gtccaattgg ctaaaacgaa 5880 cccaacctgc ggtaccagat ctaccgatca cctccgaaga acgaataaac tcatatctgg 5940 atcagcgaga tgaacacggg attgaggctg gaaggaagtt agctcaagca caggaaaaga 6000 tcaatccacg gatgacggcc ggatacaagc aaccacagcc aggagatcta gttcttcttc 6060 atgattttca actggcaaaa gacaaaggac gcaaactgga acatagatgg tcaacaccga 6120 gaatcttgga aagggtttca aagtcgggag tcagtgcaca tgtccggcaa ttacatgacc 6180 cacctggtat gatgaaacgg tttcactttg acgatcttct tctatatata ccgcggacag 6240 aaaactatcc tatgccagag atgtcgcatg tggtgactcg tggtcaggga catggagtag 6300 aatatgtacg aggagccatg ggggctattg agggagttct gacattaggg caaagagcgt 6360 ttgatatttc tgatattgga ggttgtctta ggcaatagcg caagcttttg gcagcagtat 6420 tgttttcttt ctggtttcat acaaattgtt taagaggaga ttagacatct gggtcgatgt 6480 cattgcaaaa atcggtgttt c 6501 // ID Gypsy-98_MLP-LTR repbase; DNA; FNG; 313 BP. XX AC AECX01000493; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-98_MLP_; KW Gypsy-98_MLP-I; Gypsy-98_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-313 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000493; Positions 32251 31939. XX SQ Sequence 313 BP; 88 A; 46 C; 51 G; 128 T; 0 other; tgtgatgaac gagttgcact tcgggttgtt atgcacatgt cgcgttctct gattagtaaa 60 ggactatgat gtcataggat tacatgatct ttcttttctt attctattta tatcagctgt 120 gaagttggtt attgtttctc ttttctttat cgaaacaatt tgttattatt gtcaccattt 180 gttataagaa atacttttca tctgagattc ttaaatacaa cattaaggct taggaatact 240 tttcatttga gattccggat tcttataatt gcgtaattat aaagaattct taaatacaag 300 ccaggtcatt aca 313 // ID Copia-28_MLP-I repbase; DNA; FNG; 5266 BP. XX AC AECX01003121; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-28_MLP_; KW Copia-28_MLP-LTR; Copia-28_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5266 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01003121; Positions 832 6097. XX CC Positions [2530-3033] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 979..4995 FT /product="Copia-28_MLP-I_1p" FT /translation="MQALQRHFSRGGRTNQFALFSRLIHHQLDLNETDIIT FT HMSNIDAIISELESTGFTWTSDSIKGLLYQLRMPAEMTKEINKDLDNRYDD FT KKPNFKLDDVKNAIQIHLAREKTASETISINSLSSSIEAFSFKTPQRMQSQ FT NRQNTPNRFTTPLNNNHRSTQFQYAYRNTPMNADLKARWRRGPETITHKND FT ARLASEDKPLSIPSSAHPNIKAGIIQCFFCGQWGHTYRDNNFKACKTFTSG FT GDRTGAAYNDWRKVENGLYYSIDVLYPRTYKTPSVNINSIVPDQSPSSHLV FT HASAMSFNSEGIILPDNESDVPTEYLTEYLCDGGATDAVSNNRSLLTNYQP FT LSTPIPVHTAADDSNAVIVGKGRLPIVTEDGSTAMIDDVYYCPKATTTIIS FT PGALIEKGAKMSLDEKNNYKIKLKSRKVIYAVHKNRRWFINSRRRSNATGN FT TTSPSFGMRNTSNRSFITIMGTIVLVTYQMRRIKRLFKHSASYGLPPLRPC FT HEVACEDCLKCKSTRNRVLGSTNREPKLLDVIVTDVAGPFTPCMTGEKLMV FT TFRDVATGYSEIEIIKQKSEVPQKLMNIIKKWERQTGIKVKTVRSDRGGEY FT IGGTLEKWLKDEGINHEFSNPYEPEQNGNAERLNRTLGDMARTLLSYSRLP FT NHFWNYAYLTSAYLYNRLPNTLTDDKTPYELSHGKKPNLDIIRTFGSIAYV FT HINTNQRAPGKLEDRGRRCTMIGYAEGGKGWLFYDPASKTIIVSAVAKFPY FT EEIIKTSNEIPKKTAVTTSKSLNKIMNEDLPDKGQLKHILNALKLGDFTNE FT LLIDKQDVAAKHALDGNDYLKLLKPPTSYAEAMRSKEADRWKEACDAEMAM FT MDKMGVWKVVDKPSDLVPIALKWVFAYKKVDDEGNPTKFKSRLVAKGYNQR FT EGIDYTETFAPTATFAGLRIMLTIAAHQNWPVHSFDITSAYLHSAIDSPIY FT FSLPTGYMCEARKKNKILEAIKALYGTKQGARCGWKHFDSILRELGFTSSQ FT YDQSLYFYRRKDETVIIWLHSDDGGVTGTSEKLLLEIAEKLKERLLIKWET FT TLDQIVGVNVKREEDGSFTLSQPGLTKKIIKSFLPDERTAKTPMNVQKIPC FT SPNEEEERVDTERYLSAIGSLNYLSVATRPDLTYTVNYLARFSSDPRRQHW FT QAIEHVMRYLNTTGVKSLKIKPIKTKVETPIHTYVDANWGGEGARSSHGFI FT TYFLNCPIAWTSKRQTCVASSTCHAEYMALGTACRDAIWLRNLIEDLTGQG FT NVVNMHCDNTSAIHVSKDNSSNKRTRHTDREFYYINEQIYKGRIFLHWIDT FT KSQRADILTKPLGPTLHDQGLKHLRLK" XX SQ Sequence 5266 BP; 1771 A; 1112 C; 978 G; 1405 T; 0 other; ctttaggtta tgagcccgcg ttaccgatcc taacgaacgg ccgacctacg aatgaaaacg 60 agaagtagga attacgctac gaatccacgt tctccaccta aaagacgcaa aagaatatcg 120 aagaagaaaa caaaaactga acctaatctc gatttcccat tattacaaga cttacctcca 180 ttaccaccat ctccactatt cacacctctt gctgatcttc ccccactacc accgtctcca 240 ccttatacat ctgacttaca tccatcaacc ccacaaaaaa ggattgaaga tgttgtatta 300 acgacaacat ccaagcctgt atctcatttt cctgatctat ctacgttacc gccacttcca 360 aactctccta tcagtcctct ttctcttgaa cttaatacgc tatacgatac gaaccttcct 420 gaactctcct taacttctgc tcaccctatc gaacatacgt ccctacccat tgaaagttta 480 tctccgactt cgatcggaac tatatcggac ttagcctccg gatttgctct tctgaatttt 540 gattcaattg cgaacgattc acacgttacg agaacagtcg catcttaccc tacgccacta 600 cctgaaactc ccttctattg ctctaatacg atatttgaac ctgatccagt acgccctgtg 660 aaaccctccg acatgtcgac acctatgact cccgatcaat tcgcccttga cgaaatcaat 720 caatctatca agaacgttgg aaacaacatc aattttccac atcttgagaa gaatggatcg 780 aatttcgttg actggaagaa agacaccgta cgagcgatga aagctatgat ttgaatcaac 840 aattactggg acacccctca acctttaatt actttcattg atacttcacg agataaacta 900 gccaatttgg tcatatcgaa tactattcat cacgatttaa agaacgtaac tgatccgagc 960 aacaacgctc atgaagccat gcaagcttta caacgtcact tcagccgcgg tggtcgcacc 1020 aaccaatttg ccctttttag ccgtctgatt catcaccaac ttgacctgaa cgagactgat 1080 atcattactc acatgtccaa tatcgatgcc attatttctg agttagagtc tactggattt 1140 acttggacta gtgactcgat taaaggttta ttgtaccaac tacgtatgcc tgctgaaatg 1200 acaaaggaaa tcaacaaaga tttagataac agatatgatg acaagaaacc taatttcaaa 1260 cttgatgatg taaaaaatgc aattcaaatt cacttagcca gagaaaaaac cgcttcagaa 1320 actatatcga tcaacagttt atcttcatca atcgaagctt tttcctttaa aacccctcaa 1380 cgtatgcaat ctcaaaatcg gcaaaacacg cccaatagat ttacaacgcc actaaacaac 1440 aatcacagat caactcaatt ccaatatgct tatcgaaata ctccaatgaa tgccgatctc 1500 aaagctcgct ggagacgtgg acctgaaact attacccata agaacgacgc gagactagca 1560 agtgaggaca aaccattgtc gattccctct tcggctcacc caaacatcaa agcaggtatc 1620 attcaatgtt tcttctgtgg gcaatgggga catacgtacc gagacaacaa cttcaaagcg 1680 tgcaagacat ttacgagcgg aggtgacagg actggtgctg cttataatga ttggagaaaa 1740 gttgaaaatg gcttatatta tagcatagat gtactttatc caagaacata caaaactcca 1800 tcggtcaata tcaactcaat tgtaccagac caatctcctt catctcacct cgttcacgcc 1860 tcagctatgt ctttcaactc cgaaggcatc atcttacccg acaacgagag tgacgtgcca 1920 actgagtact tgactgaata tttatgtgat ggtggagcaa cggatgctgt aagcaataac 1980 cgttccttgc ttaccaatta tcaacctctt tctaccccaa ttcctgtaca cacagcagct 2040 gatgattcca atgctgtgat tgttgggaaa ggtagattac caatcgtaac tgaagatggc 2100 agcacagcga tgattgatga tgtttattac tgtccaaagg caactacgac catcatatca 2160 cctggagcat taattgaaaa gggagcaaaa atgtcattgg atgagaagaa taactacaaa 2220 attaaattga aatcaaggaa ggttatctat gcagttcata agaacagaag gtggttcatc 2280 aattcacgta ggagatcaaa tgctactggt aatactacat ctccctcctt cggtatgcgc 2340 aatacaagca accgcagttt tatcacgatt atgggcacaa tcgttttggt cacgtatcaa 2400 atgagacgta ttaagcgact atttaagcat agtgcatcgt atggtttgcc accattgaga 2460 ccatgtcatg aggtagcttg tgaggattgt cttaaatgca agagcacacg caacagagtt 2520 ttaggatcca cgaatcgtga acccaaatta cttgatgtta ttgtcactga tgtggcagga 2580 cctttcactc catgtatgac aggagaaaag ctaatggtta cttttcgaga cgtagcaaca 2640 gggtactcgg aaatcgaaat catcaaacag aaatcagaag tacctcaaaa actaatgaat 2700 atcatcaaga agtgggaacg ccagacggga atcaaagtca agacagtacg atcggatcga 2760 ggaggagaat acattggagg aacattagaa aaatggctca aggatgaagg aattaatcat 2820 gaattttcca atccatacga acctgagcag aatgggaacg ctgaaagatt aaacagaact 2880 ttaggagaca tggcgagaac tctattatct tatagtcgcc taccaaatca cttctggaat 2940 tacgcttatc ttacatctgc ttacttatat aatagactcc ctaacacatt aaccgatgac 3000 aaaacacctt atgaattgtc ccatggcaaa aaaccaaatc tcgacatcat taggaccttt 3060 ggctcaattg cttatgtaca catcaatacg aatcagagag cccctggtaa attagaagac 3120 agaggacgac gatgtacaat gataggttat gctgaaggag gaaagggttg gttattttat 3180 gacccagcct caaagacaat cattgtatcg gcagtagcga aatttccata tgaagagata 3240 atcaaaacta gtaatgaaat acctaagaag accgctgtaa cgacatcaaa atccttgaac 3300 aaaattatga atgaagactt accagataaa ggacaactga aacacatact aaatgcgttg 3360 aaacttggtg actttacaaa tgaattgttg attgataagc aagacgtagc ggcaaagcac 3420 gcgttagatg gaaatgatta cctcaaattg ttaaaaccac caacatcata cgctgaagca 3480 atgagatcga aagaagcaga tagatggaag gaggcatgtg atgcagaaat ggccatgatg 3540 gataagatgg gagtgtggaa agtggtggat aaaccgtctg atttagtacc tattgcgctt 3600 aaatgggttt ttgcttataa gaaggttgat gatgaaggaa atcctacaaa attcaaatca 3660 agattagtgg cgaaaggtta taatcaacgt gaaggaatcg attacactga gacttttgca 3720 ccaacagcta cgtttgcagg tctcagaatc atgttaacca tcgctgcaca tcaaaactgg 3780 cctgttcatt catttgatat tacatcagct tacttacaca gcgcaattga ctcacctatt 3840 tatttctcac ttccgactgg ttatatgtgt gaagctcgaa agaagaataa aattttagaa 3900 gcaatcaaag ctctatacgg gacgaaacag ggtgctagat gcgggtggaa acattttgat 3960 tcaatattac gggagttagg tttcacatca agtcaatatg atcaatcatt atatttttat 4020 cgaagaaaag atgaaacagt tattatatgg cttcatagtg atgatggagg agtaaccgga 4080 acaagcgaaa aacttctact agagattgct gaaaaattaa aagaaagatt attaattaaa 4140 tgggagacaa ctttagatca aattgtagga gttaatgtaa agagagaaga agatggaagt 4200 tttactttat cacaacctgg attgacgaag aagataatca aatccttctt acctgatgaa 4260 aggacagcaa aaactccgat gaatgttcag aaaattccat gttcaccgaa tgaagaagaa 4320 gagagagtag atactgaacg ctacttatct gcgattggaa gcttaaacta tttgtccgta 4380 gcaacgcgac cagatctcac gtacaccgta aactacttag cacgattctc atcagacccc 4440 aggagacaac actggcaagc aattgaacat gtaatgagat atttgaacac cacaggcgtt 4500 aaaagcttaa aaatcaaacc aatcaaaacg aaagtcgaaa cccctatcca cacgtacgta 4560 gacgctaatt ggggtggaga aggagctagg tcgtcccacg gattcatcac ttactttctt 4620 aactgcccaa ttgcatggac ctctaaacgt caaacatgtg ttgcttcatc aacgtgtcat 4680 gcagaataca tggcattggg aaccgcctgt agagatgcta tctggttgag aaatttaatt 4740 gaagatttaa ctggacaagg caacgttgtg aatatgcatt gtgataacac ctcggcaatt 4800 catgtatcaa aagataattc atccaacaaa agaaccagac atacagatag agaattttat 4860 tacatcaatg aacaaatata taagggaaga atatttttac attggataga tacgaaatca 4920 caacgagcag acattctcac caaaccactt ggaccaactt tacatgatca aggcttgaaa 4980 cacttaagac tcaaatgaaa acctttattt tcataatctc atgttttttt gttatttttt 5040 tctgattttc ttttatgatg cttgttaaat gattggtatg tgaaggtact gtttctggca 5100 aatagaaata atcctagact tggtcataga ccttgtaccg ggcaacagat tccagacaac 5160 ggacctgatt ataactctca tcttgtttct tatcatgttt ttcttattac gctatgtttt 5220 ctttcactat catatggatg tctgtgactt gtcgtggggg gggtgt 5266 // ID Copia-60_MLP-LTR repbase; DNA; FNG; 195 BP. XX AC AECX01000558; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-60_MLP_; KW Copia-60_MLP-I; Copia-60_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-195 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000558; Positions 142486 142292. XX SQ Sequence 195 BP; 44 A; 35 C; 52 G; 64 T; 0 other; tgacttaaac ctttaggtta gtctgtcagg tagtttaaag atgtgaggtg tcaggagact 60 cacctctagg attcgtctat tgggtatcag gttagtctgt caggtagttt aaagacgtga 120 ggtgtcagga gactcacctc taggattcgt ctattgggta tcaggtcatc atcagacgcg 180 ctttgtcgct tttca 195 // ID BOTY_LTR repbase; DNA; FNG; 562 BP. XX AC . XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 17-APR-2011 (Rel. 7.1, Last updated, Version 2) XX DE Botryotinia fuckeliana gypsy-type retrotransposon BOTY_LTR, long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Long terminal repeat; Gypsy superfamily; retrotransposon; KW BOTY_LTR. XX NM BOTY_LTR. XX OS Botryotinia fuckeliana OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Leotiomycetes; Helotiales; Sclerotiniaceae; Botryotinia. XX RN [1] RP 1-562 RA Diolez A., Marches F., Fortini D. and Brygoo Y.; RT "Boty, a long-terminal-repeat retroelement in the phytopathogenic RT fungus Botrytis cinerea."; RL Appl. Environ. Microbiol 61(1), 103-108 (1995). XX RN [2] RP 1-562 RA Jurka J.; RT "Consensus."; RL Direct Submission to Repbase Update (17-APR-2011). XX DR [2] (Consensus) XX SQ Sequence 562 BP; 176 A; 123 C; 122 G; 141 T; 0 other; tgttacgacg gattagtaac aggctgtaga atcaccaacg tataggctat aatggtatta 60 taggcctcag tgattcagct gcagtatacc gggggacact aggcacccaa ggaaagcctc 120 aggcatgtat atagtattag tcataggata tcctagaacg taggatacag ttcctaggac 180 aataggtcct aggaaacacc gaacataact ttgcaaactt ttcgcgaagt tatattagta 240 atatcccagg ggattagccc caggataaaa cgataagcta ggacactgga agtcacggga 300 caagtgtcac gtgaccacaa agccgactcg atttcccgac tcgacttgtc gattcgactt 360 caggacgaag gtctatataa gggaatgggt ttcattataa tgtagagctt cgtgctcaag 420 aacaatcatt agtttcatta ctatagttac gagaattgca accagttaca accttattga 480 attcctactt gaagtctagt ctaaaccacc tcgagagatc tctagacact tccacgtgac 540 cctagaggca gctcccgtaa ca 562 // ID PiggyBac-1_ParBra repbase; DNA; FNG; 2369 BP. XX AC . XX DT 17-MAY-2011 (Rel. 16.05, Created) DT 17-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW piggyBac; DNA transposon; Transposable Element; KW PiggyBac-1_ParBra. XX OS Paracoccidioides brasiliensis OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Onygenales; OC mitosporic Onygenales; Paracoccidioides. XX RN [1] RP 1-2369 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 2369 BP; 759 A; 515 C; 406 G; 689 T; 0 other; cccctcgtcc gtaagttctt gctcaattag cgtgatttca attagtgtat ggaattttgc 60 accggcgcca ttcagtaacc aataagatct catataattt tacaaggatg atcattggtc 120 atttaaatcc acggattcgt gatttctcat caagctgaca gacgtgtatg gaattcaacg 180 tgatttttaa gctgccaacc tgcatgcctc tcagttcctt ctcttcatga aatccaccca 240 ccaacgtgga atccaccaaa aaacctatca aacccagtgg taatacttca tataaaaatt 300 aaatcttatg ggtcagctaa ttgatttatt ctccaggtca tacaaaaaca tctaagaaat 360 gaaaacaagc aattgaggat atcattgcag agattccatc cttggagtca ataaaattcc 420 atccaatgca gcctcttttg caagattcag ttctcaatct tcctgtcaat gttgatattg 480 atagtcccta tgctctattt accttatttt tctctgagga aagctttcaa aatatcagca 540 agtccactaa tctctatgca aaattgaaaa gagatgatgg agacactgat aatgaagctc 600 cggagctctc tgaggcagaa gttgatatag aatcaacttc attattatct gaaaatcaag 660 aatcttcaac tacaacctct actaccttca aaccttttca ccaatggtcc ttcaaaaata 720 ccaatgcagc agaaatgaag gttttatagg ccttcttctc tatatgggag ctcatagatt 780 ctgccgaact gatctatatt ggagaaataa tctggaacaa ggccctattc attccataca 840 aaaatatatg agttgcaccc gtttcgagca aataaagcgg tatattcata tatcaaatcc 900 ccgtgaggca cgccggccag aaactcaaaa taaggattgg tggtataagg tagagccttt 960 agccacagaa ttccatactg cctgccgaaa atattacaca cctggttcaa agatttcagt 1020 ggatgagatc atgattaaat gctttggacg aagtcaacac acctataaaa tgcctaataa 1080 accaattcca cgaggttata agatatttgg cctagcagaa catggatata tatggacttt 1140 ttcctggtct agccgtcggc agggtataat gcctatgtat caattcccag gaattacaag 1200 aacaggatct atggtgatga atcttcttca acgacttcct acagtcccta tattgactac 1260 tgagccgtca tctactgagc tgtctactga gcctgtcatc aaatcaccca cagagccgcc 1320 ttctgagttc accatcaaat cctctataag caatgaaatc cctgccatcc agcaagtaac 1380 tccttactca gtttaccttg acaactactt tacctccatt gctttattca aattactacg 1440 tgagaaggaa tatggtggtt gtggaactac tcaaccaaat caggctccat ctcttctatc 1500 tgaacttaga gagcatacag cagctattcc atggaatact ctgcatgcca ttgagaatga 1560 gaatgttctc tgtcttgcct ggcaagataa taatatggta tgggcactga ctacaattca 1620 ttcattggac acctttattg agtgtactcg aaagcggcct ggagagctct caacaaatgc 1680 taaagttgta tggaagatct ttgaaggtca gccacaaaag agattgaaaa ttcctgccgt 1740 tatagatgat tataatcata atatgaatgg agtggaccta gcaaatcagt accgcgcagc 1800 atatactagc catagaatca tctatcgaac ctggcttcct attttctact ggtttattga 1860 ttccgcggca gttaatgcct atatgcttca atatatatat ataagaagca gcagggtgtt 1920 ccaaaaaagg acctgccttc tcatataaac tttcatgaac ggctttacca acaactcttt 1980 gaattcacac ctaagattca tgactatctt ccacctcaac gattgaatcc tgatctaaat 2040 catcaacgga ttgctttacc aaagcaatca gtttgtgcct ggtgtcagta taaatgaaag 2100 ctaggtcagc aacaaaataa acagacacca agatcatatt ctggttgctc agcatgtcaa 2160 aatatgcctc tctgtttgaa aactcaatgt tgggaggaat tccatgagat tactcagtct 2220 agtagtaatg acgcgaatcc ttaattagtg ctttataaat taccttaata cacactaata 2280 ccaaagttac cttaattatt caatgagaaa ctgtataata tacactaatt gaaatcacgc 2340 taattgagca ataacttacg gacgagggg 2369 // ID Mariner-6_AF repbase; DNA; FNG; 1995 BP. XX AC . XX DT 28-FEB-2006 (Rel. 11.02, Created) DT 07-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE A family of Mariner DNA transposons - a consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW Mariner-6_AF. XX OS Aspergillus fumigatus OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Neosartorya. XX RN [1] RP 1-1995 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-1995 RA Kapitonov V.V. and Jurka J.; RT "Mariner-6_AF, a family of nonautonomous Mariner DNA transposons RT in the Aspergillus fumigatus genome."; RL Repbase Reports 6(2), 103-103 (2006). XX DR [2] (Consensus) XX CC It is a family of nonautonomous DNA transposon from the Mariner CC superfamily (Tc1 clade). The genome harbors 5 copies that are 96% CC identical to the consensus. The transposase encoding region is CC saturated by many stop codons occurred due to RIP. XX SQ Sequence 1995 BP; 736 A; 298 C; 322 G; 639 T; 0 other; tccgtgacta gcccacccac actgcttccc gctgtgggtg agctacactg taatcactaa 60 tttcactgta atcactaatt ttcgatattt taacctttta aagctattcc caccacctaa 120 catgcctaaa tctattaaat ttaataaatc tgacctcctt aaggcctgcg aagccgctca 180 ggcctaaaat aaactaaata tctctaagat tacgcgtgaa tatggtgttc cttatttaat 240 actatataat tatattaaaa agggcagata ggcttataca gcttagaaac tagtaaataa 300 agtacttaat aggtactagg aggaagcctt aatatagtag atagtctaga tataagatta 360 taatatacta gtaataccta agctactaga agagtttata aattagttac tttaatatac 420 tagtaaagct agacaggtta gtagggtata ggtatattac tttaaaaaat aactcctaga 480 ataccttaat ctaggccctg tgaagcaaaa gataaaggaa ttaaagtata ttaaggctaa 540 ggatgctggt ttactagtaa attagtataa ttagcttact aatatagtta aagatatact 600 actataatta gtatataact ttaataaata tagcttctga cctagcaaag gcaaggtaag 660 gaatataatt agattaaaag gttcttgcct taatcttgct gaatctaaga aggataagaa 720 tataataact attaaatata ttactgtaga tagttagtag atagatctat agtttatctt 780 taaaggcaag ctcctactct tttagactat tcttttctaa cctttcttac ttcctaaagg 840 taataggatc tttatagaat gttagtttaa taagagcaag gccctactac taaatataat 900 aatagctatg caagctaatg gctagatatt agatgaacta gcctattaat agctttaaag 960 ctttattaag gcaataaata agtgtataaa gagaggagag aaataaatac ttatatttaa 1020 tagtcatggc tcctatctta ctattaattt cttatagata tataaagata atagggttat 1080 tccctttaga ttccttcctt atataatata cctttactag ccactagata gcaagctatt 1140 cttaagctat aagtaatact tctaatatat aaataataag ctatcttact aggctagtaa 1200 gccagtaggg aagttagaat tcttataggt aattagacct atataggaga aagcctttaa 1260 ctaataaatt atctataagg cctttaaaga tcatagtatc tagcctgtta atagtagtaa 1320 gatagttaat aatcttacta tctaggcatg ggaataaatt ctagatgtct acgcgcctga 1380 tcttaataca tgccttagag ggacaccctc tctactacct atctccttat ctagtatgga 1440 tatcacccct ctaaggataa ttcaggccct taagaagaat taggcaaagc tatctaagta 1500 taaagatctg cttatactaa agctatagca gaatcttaaa tagatattta aatataatta 1560 aattactgct aagcatctgg ctatagtaaa taaaataatt aattaaatta gggctgtata 1620 agccccccta tagcactaat atactaagta atatattaag ctacttagtt aggatggtat 1680 actaaaagta cataatataa attaattaat tactttaagg aaggctaaag atactgctat 1740 ataagagagg tgtttataaa ggcagtagga gaaagtatat agtaaacccc caccactagc 1800 acctatataa gagaatctag tattaaatag atcagcaggg gcagcagatg aaaatagtga 1860 tgtttttttc ttagatagtc agctaatatg ttgagaatag cttcaaaata tcgaaaatta 1920 gtgattacag tgaaattagt gattatggtg tagctcaccc acagcgggaa gcggtgtggg 1980 tgggctagtc acgga 1995 // ID MARY1_TM repbase; DNA; FNG; 10419 BP. XX AC ABO27513; XX DT 29-MAR-2005 (Rel. 10.03, Created) DT 29-MAR-2005 (Rel. 10.03, Last updated, Version 1) XX DE Tricholoma matsutake retrotransposon, partial sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy retrotransposon; gag-pol; reverse transcriptase; MARY1_TM. XX OS Tricholoma matsutake OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Tricholoma. XX RN [1] RP 1-10419 RA Murata H., Babasaki K. and Yamada A.; RT "Highly polymorphic DNA markers to specify strains of the RT ectomycorrhizal basidiomycete Tricholoma matsutake based on RT sigmamarY1, the long terminal repeat of gypsy-type retroelement RT marY1."; RL Mycorrhiza 15(3), 179-186 (2005). XX DR Genbank; ABO27513; Positions 1 10419. XX SQ Sequence 10419 BP; 2615 A; 2621 C; 2452 G; 2731 T; 0 other; gtcgacctgc aggtcaacgg atctctgcta ctgttgtggg gaacctggac acagggctgg 60 agtgtgtccc cattgtcagg atatctgcat gctcaccccc gagggatgag acaaactcct 120 gcaagatttg ttggcgcttg tagatgttga ggaagcaaag gatcaagttg cagaggagat 180 taagttggaa gttgtagagg agcaggattt tcctattcgc agcgagtgaa tcgcacgccc 240 tcactgcctg tccataacca ctttgaagtc ttgtcagttg aatccctgga gaatgaatcc 300 atttctgtga ctgctcccct ctctgaccct tcgaagttgg ctaaagacgt acctgctctc 360 cccatctctc cttctactcc acgtcccaaa attaaatcat gggagcgtaa actaccacgc 420 aagtatattg tggcatccac tccctctgca cattccttaa acctagccat caaactgcag 480 acaaccgaca ctggacagat actcggtgtc tctgcactcc tggactctgg tgtgactggt 540 ctcttcattg attcccatct ggtacaacaa catcgcctaa atactcgctc actctcacgt 600 ccgatacctg tatataacat ggatggatcc ccaaatgaag ctggtgccat ctgagaggtt 660 gcagacctgg ttctccgtta caaagatcat agtgagtgtg ccctttttgc tgtcactcaa 720 ttaggaaagc aaaaggtgat tttgggatat ccatggcttc gagaccataa cccagaggtc 780 aactggcata ccggggaggt taccatgagc caatgtccat cccgctgtcg gacgtgtttc 840 acagaagcca ggcaagaaca ttgtggacag aaggcagcaa tacgatgtac tcagaagtgt 900 catgctggac ccctgcctat acctgaagtg gaattggaag gggttcctga tctaatgccg 960 gactcagatg atgatgatga ggacccagag gaagtggagg agggtgattg gatcttctac 1020 actgcgattc actcatcgca tgaggtgtgt gccacatcga acatatccca acacctcacc 1080 gaggccttcc acaaaaacaa tgctgctaaa tcctctacta actccttact ccctcatctc 1140 catgagtttg aagacgtgtt ctccaaggag tcattcgact ctcttcctgc acgcaagcca 1200 tgggatcaca cgatcgaact cacccctggc gcaacccctt catcctgcaa ggtctatcct 1260 ctctctccgg aagaacaatg gcagcttgac cagttcatcg acgagcacct ggcctctgga 1320 tgtattcaac catcaaagtc acccatggcc tctccctttt tctttgtcaa gaacaaggat 1380 ggctcacttc aactggttca agactacaga aggctcaaca acatcacggt caaaaattgc 1440 tacccactac ccctcatctc caaactcata aaccagctcc atggagccca atacttcacc 1500 aagctcgatg tccattgggg ctataataat gtgaggataa aggagggaga tgaatggaag 1560 gcagcctttc gtaccaaccg tggtcttttt gaaccacttg tcatgttctt tggccttact 1620 aatagccctg ccacctttca gaccatgatg aacaacatct tccatgacct catcctcgaa 1680 ggcattgtct gcatctatct tgatgacatc ctcatcttca ctcgcatggt tgaagagcac 1740 catcgcatca tgcgcctcat cctagagcga cttcgatggt acaaactcta ccttcgtcag 1800 gacaaatgtg agttcgaaag gaccaagatc gagtaccttg gcctgattat ctcggagggt 1860 caagtggaga tgaatcctat caaggtaaat ggggtcacag cttggctgac acccacgaac 1920 aagaaggagg tgcagtcttt cctgggcttc atcaatttct atcggtgttt tatcaaggat 1980 ttctcacatc atgcacgacc actgtttgac ttgacaggaa aggaggcatg gaggtgggat 2040 aatgcacaac aggaggcatt cgaaaagctt aaggagatgg tcacttccgc tccgatcctt 2100 acctttgctg acgatttgcg tccttttcgt gtagaagctg acagttctga ctttgcgatg 2160 ggcgccgtgt tgtctcagca gtctctggag gaccagaagt ggcatcctgt tgtgttctac 2220 tccaaaagcc ttagtgctgt agagcgaaac tacgagatcc atgacaagga gatgttggca 2280 ataagcatgc acttgaggag tggcgacact tcctcgaggg tgcccaacac aaggtggaaa 2340 tttgaacgga tcacaagaat cttgagtact tcatgacggc caagaaattg aaccatcgtc 2400 aagctcgttg gtccctgtat ctatctagat tcgacttctc cttgcaccac catccgggtt 2460 gcagcatggg gaagaccgat gccctatccc aacatgcaga ccatggggat ggcagcagtg 2520 acaaccaaaa cattgtgctg ttgcagccag acctgttcac cattcgggca ctggaaggca 2580 tcgcacccca gggcaaggag caggacatcc tacgtgacat tcatcgtgca aacatagcag 2640 tttacatgag gatgttgtca cccgagctgt taaagaactg aggaagggat cgaccctttc 2700 cttacgtgca gcagagtggg ctgaaaagga ggacttgttg tatttcaggg attgcatcta 2760 tgttcccaat gacccagacc ttcgccggca catcatatcc caacaccatg actcacagat 2820 tgctggccac cccggatgct ggaagacctt ggagctctat tggtggccac aaatgtcatg 2880 gctcattggt cagtattgtc gcacttgtga cctttgctta tgcaccaagg ttcctcatcg 2940 caagcctatt ggtgagctac accctcttcc tgttccagag agtcgttggg atgtcatcag 3000 cattgacttt gtggtggaat tgccagagtc aaatgggttt gatgtggtga tgtgcacagt 3060 tgactcagtg ggaaaacacg ctcacttcat tcccactcac accactgtca gtgccctagg 3120 tgctgctcga ctctacctcc accacgtgtg gaaattacat ggacttcctg gtgccttcct 3180 ctcggatcat ggccctcagt tctggggagg ttacaatgag ccagtgtcca tcccgctgtc 3240 acacttgttt catggaagcc agacaagaac actggggaca aagggcctca gtacaacaca 3300 ctcagaagtg tcatgccaga cccctgccca tgcctgaggt ggagttagag ggggttcctg 3360 atctgatgcc ggactcagat gataatgatg aggagccaga ggaagtggaa gaaggtgatc 3420 agatcttcta cactgcaatt tgtctatcac atgaggtgtg tgatacattg aacatatccc 3480 aatgcctcgc tgaggccttc cacaaaaaca atgctgctaa attctctatt gattccttgc 3540 ttcctcatct ccatgagttt ggggatgtgt tctccaagga atcatttgac tctctccctg 3600 cgtgcaagcc atgggatcat gtgatcaaac tcatccccag tgcaacccct tcatcctgca 3660 agtctatcct ctctctcctg aagaacagtg acagcttcga tgagcacctg gcctctggat 3720 gtattcgacc attgaagtca cccatggcct ctcccttttt ctttgtcaag aagaaggatg 3780 gctcacttca accagttcaa gactacagaa ggctcaatga catcatggtc aacaatcact 3840 acccattacc tctcattagg gcaaggcagg ggttacccct gggtgtcaaa tctcataccc 3900 ttacccctac ccctgaacac cctaccctta ggggtaaggg tatggaaaca tataagggtt 3960 gccaagggta tgggagggtt tgtcaagggt acaagggttt tgcacaaaat caacatattt 4020 agagcaaaat tcatttgaaa agtgttgata actattgtat ttactattaa tatcacatat 4080 taatacctct tattaacata tgaaagccat tatcatgttg gacaatccca aaatctgaat 4140 ttaattgatg aaaatcttga atatgagtct ttgtagctga aaaaatcaaa tgatattctt 4200 acataagcat acccctggca tacctctgta cccttagtaa gggtacaggg gtatgcaagg 4260 gtatgaactt ccaaacccct acccttaccc ctcataccct tagccccgaa aacacaaggg 4320 tataccccta cccctgcctt gccttacctc tcatctctga acttgtgaat gagctccatg 4380 gaggccaata ctttaccaag ctcaatgtcc attggggtta caataatgtg aggataaagg 4440 agggagatga atggaaggca gccttttgta ccaaccgtgg tctttttgaa ccacttgtca 4500 tgttctttgg ccttactaat agctctgcca cctttcagac catgatgaac gacatcttcc 4560 acgacctcat ccttgaaggc gttgtctgca tctatctcga tgacatcctc atcttcactc 4620 gcatggttga agagcaccat cacatcacac gccttgtcct agagtgactt tgacagtaca 4680 aactctacct tcgtcaggac aaatgcgagt ttgaaaggac caagattgag tacctcggcc 4740 tgattatctc ggggggtcaa gtggagatgg atcctatcaa ggtaaacggg gtcacagctt 4800 ggctgacacc cacgaacaag aaggaggtgc agtctttcct gggcttcatc aatttctatc 4860 agcattttat caaggatttc tcacatcatg cacgaccact gttcgacttg acaggaaagg 4920 aggcatggag gtgggataat gcacaacagg aggcctttga aaagcttaag gagatggtca 4980 cttctgctcc gatccttacc tttgctgatg atttgcgtcc tttttgtgta gaagccaaca 5040 gttctgactt tgcgacgggc accgtgttgt ctcagcagtc tccggaggac cagaagtggc 5100 atcctgttgc gttctactcc aaaagcctta gtgctgtaga gcgaaactat gagatccatg 5160 acaaggagat gttggcgata atgcgtgtgc ttgaggagtg gtgacacttc ctcgagggtg 5220 cccaacacaa ggtggaaatt tggacagatc acaagaatct cgagtacttc atgacagcca 5280 agaaattgaa ccatcgtcaa gctcgttggt ccctgtatct atctagattc gacttctcct 5340 tgcaccactg tccgggttgc agcatgggga agactgatgc cctatcccga tgtgcagacc 5400 atggggatgg cagcggtgac aaccaaaaca ttgtgctgtt gcagccagac ctattcacca 5460 ttcgggcact ggaaggcatc gcaccccagg gcaaggagcg ggacatccta cgtgatattc 5520 atcgcacaaa ccgtagcggt ttacatgagg atgttgtcgc ccgagctgtt aaagaactga 5580 ggaagggatc gaccctttcc ttacgtgcag cagagtgggc tgaaaaggag gacttgttgt 5640 atttcaggga tcgcatctat attcccaatg acccagacct tcgccagcgc atcatatccc 5700 aacaccatga ctcacagatt gctggccacc ccggatgctg gaagaccttg gagctcacct 5760 cacggaacta ttggtggcca caaatgtcat ggctcattgg tcagtattgt tgcacttgtg 5820 accgttgctt atgcaccaag gttcctcatc gcaagcctat tggtgagcta caccctcttc 5880 ctgttccgga gagtcgttgg gatgtcgtca gtgtcgactt tgtggtggaa ttgccagagt 5940 caaatgggtt tgatgcggtg atgtgcacag ttgactcagt gggaaaatgt gctcacttca 6000 ttcccactca taccactgtc agtgccctag gtgccactcg actctacctc caccacgtgt 6060 ggaaattaca tggacttcct ggtgccttcc tctcggatcg tggccctcag ttcatggcag 6120 agttcactcg tgagctgtac cgactcctgg ggatcaaact cctcacttca actgcttatc 6180 acccacaaac tgatggccag acagagcaag tcaaccagga actggagcag tacatctgcc 6240 tctttgtcaa tgaatgccag gacaattggg atgatctcct tcctttggcc aaattcggat 6300 acaacaacca tgtccatgct tccactcaac agaccccttt cctcttagac actggacgac 6360 atccacacat gggatttgaa ccttgccaaa ctccctcaca cattgagacg gtgaatgagt 6420 tcacagaacg catgagggat agcttggagg aggcaagggc agccttagca aaggcgaagg 6480 atgacatggc gcaattttac aaccaatgcc actccccaac ccctcagtac aaggttggtg 6540 atcgagttta tcttgactcc agtgacatct ccactacacg tccatccaag aaactcgctc 6600 atcgcttctt gggacctttc cccattgtca agtgcattgg aacgcatgcc tatcgtctat 6660 gcctcccagc ctccatgtcc agactccatc ccgtcttcca tgtcgtcaaa ttgcttccag 6720 ctcccccaga tctgttcccg ggacaccatt cacacccacc accaccacct acagtgatcg 6780 agggagaacc ccaatatgaa gtcgagtcaa tcttggacag ccatttgcgt cgaggaaaac 6840 tccagtattt ggtacattgg aagggttatg ggtatgagga gaactcatgg gtcgaagagt 6900 ctgacgtaaa tgccccttgg ttaatcaagg aattccaccg acggcacact gctgcaccgc 6960 atcgcattca agtcacagac tttggtcata tgtgatttcg aatgcatcgg ggcgatgcat 7020 cttagagagg gggtgatgta aggggactag gggggtgccc tgtgtgtttt tcctattttt 7080 tatgtttgtc ctatattttt ccccacctca tttccttcta ttcttcattt tctatgattt 7140 ttccggattt tattactaat tttcccttgc tatcatgttt gtacatgttg acacttttat 7200 atttagaggc ggagtacccc ttccgacaca taccccttgg tatgatgtca ttcggtggcc 7260 ttgatccgcc tatttccgcc tttttctatt tttaggttat atttgatttt tgtcatcttc 7320 acatatactg ggcatcagca agggtatcct tagagttttc tcttcggatt tccctttgct 7380 aggtgtgtcc ccaatgcctg tattatgtag tataatttta acttcagatt tccctttgct 7440 agaggcttag gttgccctta tatcagccct ctctcgtggc ccccttcttg tcccccctca 7500 ctgttcccca ttggtctgta tgcacccccc ttgctcattg ttgttccact gcggtccacc 7560 ccatgagcag ttgctcatga ggctgggggc aggtgttagg gtgacaccca gtatctgcta 7620 gctaagacca tgcaggttct ctctacaaga agggggcggg atccacgctc gtgatggggg 7680 aattgatggt tttgatgcga ggggttatta ataggtgagt ttgttgagat aagcctttca 7740 caccgaaaac ataggtggaa tacctaagag aacacaactg gggtccccta ggcatatatt 7800 tagggtttat atactatgtt acacatattg tacattgttg ttaccgatgc acttgtacat 7860 gcgcatacat tgtgatgcgt gcgaagtaaa attctcatgc tatcaaagtg tagcatcatt 7920 tggcatcatg tagtgtaatt tgacataata aaataaaatg aaattttatt aggtaatggg 7980 gtggtttcct aacagtgggt ggtgcatcgt tgtctcctgt ccctgcacct gggtgttgtt 8040 ccacccatgt tccaccccac aagcagttgc tcgtgaggct gggggcgggt ggtgcactgt 8100 tgtctcctgt ccctgtgcct gggcatcatt tcacccacat tccaccccac aagcagttgc 8160 tcatgaggct gggggtgggt ggcttgtcat catcattgtt ccacccatga tccaccccat 8220 gagcagttcc tcatgaggct ggggggtggg tggtgcactg ttgtctcctg tccctgtgcc 8280 tgggcatcgt tccacccacg ttccacccca tgagcagttg ctcatgaggc tgggggtggg 8340 tggcttgtca tcatttgtcc ccatcatccc ccgtttgaat ttctgtagtg tttgaaggtc 8400 tggtccatag gaccgggaaa agaccgaaga ccaaaccaaa ctgacacatg gtctggtcca 8460 ttttttggtt gtggttgctt actttcaagc tcagtccagt tgccagttgc cttgttttga 8520 aaatatttaa aaaccatcac aaaaccagtt gcaatcggtt gcaaccggtt ttctcgcaat 8580 atatacaatg tactatgtac ctgcttagaa gtaagaaatt ataactaaca taagtactaa 8640 tcataactat ctgtcataca gttacactac attacactac aattacacgc agtgggacaa 8700 gtcagggtag ttatttggtg ggtacttata taccctgtgg ccgtgcacag accatgtggt 8760 catgtgctct tgaaaaaaag aaaagtggca ccattgccag atgactccca cgacaacaaa 8820 caactcccac aactcccatc ccatccgtcc accaccacca tcaacacatg cacaccacct 8880 gccccaaaac caccaccaat gaatgaatgc aaagcagaca gcccaaacga cgaccagcat 8940 tgtttgggcc gtcgggtatg atttttttgt ttaatcttat ttcaattact aataattctt 9000 tttgtagtgt ctcttttgca aacgacattg cctagacatg acgcaacaca cctgggccct 9060 cactgtaccc atgccctatg tcaccgctgc gagcctctgc tcacaggggt ggcaacgggt 9120 cctgtttctg gacaatgaca acgacacaac aacgtggcac caacacgccg ccgcgctgac 9180 catgtcacag caccaccagc ccgcctgtga gccactgctc acagggtaga tggaggtgct 9240 gacaaccatg gggggacaac aaacaagaac gcgaacaggg aggatgacag ggagaagaca 9300 acaacaacag caaaaacagg agcggtggtg acagcggagg agacgacaaa caccagcatg 9360 gggacaacaa cacggggatg atccggacaa caacacgagg acgacaaatg acaatgcagc 9420 ggacaatgaa tgatggggct gaccataacg aatacccggg acaacataca atgccccgct 9480 ctcactgcaa acgtgagtct gtgggttctt tttcttttct ttcttttgtt gtgatggatg 9540 caccttacag gtgttgttcg ctcttttttt ttctttcttt ctattagcaa caccccactg 9600 ctgcgagcac ctgcttgcag ggtgtgttgt aattagttgt atttaatata tagtaaaatt 9660 aaaatatgta ttgaaaattg ctggaaaaac cggtggcaac tggtttttcg agggcaacca 9720 aaccgactag gactggtcct agtcagttcg gttgcagttg ccctaagttg tgcaccagtt 9780 gcaaccggtt gcggttgccg gttgcccatc ttggggtcga aaaaccggac caaactggac 9840 cttaaaacac tagatttctg tctttttttt ttgggagggg ggggggaggg gtttgggcac 9900 atcagtgcaa tgtggcgcat gtacgggggt tggtgggtgc ttactaggca ggtatccccc 9960 ttctggggtc tcctggcatc cctcggcacc ctcctagcct gtgtcgacag cctcacatcc 10020 catttgaaca gggaggaggg gggattgggt gggcgtgggc atgggttgcg ccttttttgt 10080 tgtggcaggt catccccagt agtatacaac ctaaaaagca ataaaataca attagttcat 10140 tgataaaaac aaaaagaaaa aagaaaaaaa aaacttaact atggcccaaa caacagttgc 10200 catcatttgg gcccacgttc catccagtcg gctcgtttaa aggtggtggt gatggtggga 10260 gtgggcatgg gaacagtggt ggcatgcgtg catgcagtgg tggtggtggt agagtggagg 10320 ggagcaaggg aggtggtggc atcgatggtg gtggtgagtg atgagctggc aacatgttga 10380 tgatgtggct gtgtgctttg tcatacaaag ccaaacagt 10419 // ID Gypsy-107_MLP-I repbase; DNA; FNG; 5657 BP. XX AC AECX01000643; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-107_MLP_; KW Gypsy-107_MLP-LTR; Gypsy-107_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5657 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000643; Positions 1358 7014. XX CC Positions [4361-4840] - Integrase core CC 'CGGGG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 224..5227 FT /product="Gypsy-107_MLP-I_1p" FT /translation="MQRAENDDTRRTTRAAGNTLVNQVFDPDAILRAPRPH FT LRNPGAGNYRGGYYVAPSSASLPSNSAPLAATPVGSLSPSSTPRSSLEHPP FT LTPHIPGNFPTTPFHHNLRSNPTSHEAPTFIEPTHPTHTIAEPVPSSSDEM FT HGNSESTASAPPTNEELQREILLYQRDGLRAANEAAQRMAKLEEELLTLRL FT ENQTQQESKPSVDQRSTSIDLTKFKTSDGPAFKGPFRQIEPFLRWAATLKI FT FFFVKGVTTDHDRIALVGTFLVEPNLQAFWSNGFDSFIQGSWEDFKDHLFR FT AALPADWQDRLTEQLQHLQMGEFEDFRSYSTRGRSIQSLVNYDVICVDDLE FT FARLLCFGMVPVLKSEVRMWKLLRVAEFDYNDFEADCDDLYASLKDKGLLA FT RKPNSTVARPPSTQTRSNFSSRVSTTDFIWKIHSFLDSKGLCHYCKRQCGS FT EHGACTGTPDRSRIVFPPNYVAPPKPANYKPPRAKLSSNQAGQPTHPPAGR FT PSRTSVAATSVINTDNNPFTAPTFPELDSASVAAFNAIDELVREDFQSADD FT DSDDQVAVTGEGSSKEYVPQPNNRPTVIIELTCRGTTIRALADAGSETNLM FT SNRLADLLKLHRRKLVKPTIVGLALDSSGARPVLTDFVVESLSHTDSGTIF FT DRTCMKLGDLGSSYDVILGAPFLARHQLAISCSRNCVLSETSSLKIFDYRL FT VLKLKLDLEDKLKKERSDTENRVERERAQSEAITWKNWVAGLSHQNEPERS FT AQWVKWEKELLLDYQDIFPIDIPAVSDEAEERGDFTDGSFPEKMQDPSSKV FT RHRIVLTDPNAVINEKQYPYPRKHMDAWRKLVNQHLAAGRIRRSTSQYASP FT SLIIPKKDPSELPRWVCDYRTLNSLTVRDRSPLPNVDELVRLVAGGKVFSI FT LDQTNAFFQTRMREADIPLTAVKTPWGLVEWVVMPMGLTNAPATHQSRLEE FT ALGDLLNVVCVVYLDDIVVFSNSVEEHRLHTRLVMDRLRKANLYCSSKKTK FT LFRKEIKFLGHYISAEGVRPDGEKVEKILNWPSPKSPRGVKKFLGTVQWMK FT KFIRGLEKYVGSLTPLTSSKLDAKNFKWGDLEETAFNNIKKIMTSLPCLKN FT IDYESKDPLWLFTDASGLGLGAALFQGAAWETANPVAYESRQMSSAERNYP FT VHEQELLAVIHALQKWRMLLLGMKINVMSDHHSLTYLLKQRTLSRRQARWL FT EHLADFDLDFKYVKGPDNSVADALSRKDGAEPDEFNVGIETISALILTQPT FT ISKDIRSSILSGYSSDPFCQDLRKVLPLRDDSAEVDGLLFIDGRLVIPNDK FT SLRANLILEAHRRLGHLGFKKTISNLRKDFFWPKMAKETEAFVQGCETCQK FT TKARTTLPNGHMKTPHFPFQPVTDIAIDFVGPLPKINNYDMLLTVTCRLSG FT FTRLIPTNQKDTAEKTASRLFASWHSIFGAPSSIISDRDKAWTSKFWKALM FT IRTNTGFHMSAAFHPQADGRSERTNKTVGQILRSFTAKRQTKWLESLPSVE FT FAINGAVNVATGFPPYELVFGGKNRLFPTKPALDNQPTSVETWIKQREEVW FT AQARDQLWTSRVQQAVQHNKRHRDVQLTEGSWVLLDSGDWRGRHSGGVDKL FT KERFEGPYKVIQVFNGGQSCELDLPVGDKRHRTFNVSKLKVFVGDDEWAV" XX SQ Sequence 5657 BP; 1505 A; 1508 C; 1257 G; 1387 T; 0 other; cttttttttt ctcgaacctt tcgaagtatt ctggataatc gaccaccttc taaccttgtc 60 gattcctccc aacgttgtcg attctattca atcgatacac attattatat cttacgaacc 120 ttacattaca gtcagactca atccttcaag ccttccagat ctcttactac ggttaaaacc 180 tttcatacac acctgcagat ctataattat ccttcatcat cgaatgcaga gagcggagaa 240 cgacgataca cgacgtacta cgagagcagc ggggaatact cttgtcaatc aagtattcga 300 tcccgacgca attctcagag ctcctaggcc acaccttagg aatccgggag ctggaaacta 360 tcgtggaggc tattacgttg ccccatcttc tgcttctcta ccctctaact ccgcccctct 420 tgcagctacg ccagtagggt cattatcgcc atcatcgacc cctcgttctt ctcttgaaca 480 tcctccctta accccgcata tcccgggaaa ctttcctacc acaccctttc accacaatct 540 tcgctcaaac cctacctcac acgaggcgcc gacctttatc gaacctactc atcctacgca 600 cacgattgct gagccggtac cttcaagctc agacgaaatg cacgggaaca gcgagtctac 660 agcctcagca cctccgacaa acgaggaact gcagcgagaa atcttactat accaacgcga 720 cggcctacga gccgccaacg aagcagcgca gcgaatggcg aaactagaag aggagctctt 780 gacgctacgg ttagagaacc agacccaaca ggaatctaag cccagtgtcg accaacgatc 840 aacaagcatt gatcttacga aattcaaaac ttccgacggg ccagccttca aaggtccctt 900 cagacaaatc gaaccctttc tacggtgggc agcgactctg aagatttttt tcttcgtgaa 960 aggggttact accgatcacg atcgcatcgc tctagtgggg acttttcttg tggaacccaa 1020 cttacaagct ttttggtcaa atggtttcga ctccttcatc cagggatctt gggaagactt 1080 taaagatcat ctattccgag cagcattacc ggcagattgg caagatcgtc ttaccgaaca 1140 acttcaacac ctacaaatgg gtgaattcga ggatttccga tcctacagta cgcgaggacg 1200 ctccatccag tccctggtga attacgacgt gatctgcgtt gatgatcttg agttcgctag 1260 gctcttgtgc tttggaatgg ttcctgttct gaaaagcgaa gtccgaatgt ggaagttact 1320 acgagtggcc gagttcgatt acaatgactt tgaggcagac tgcgacgacc tgtatgctag 1380 tctgaaagac aaggggttac tggcacgcaa accaaacagt accgtagcac gccctccgtc 1440 gactcaaact cgatccaact tctcgtcccg agtctctacc acggatttca tatggaagat 1500 tcattccttt ctcgactcta aaggcctgtg tcactattgc aaacgtcaat gtggcagcga 1560 acacggggct tgcactggaa ctccggaccg ttcgagaatc gttttcccac cgaactacgt 1620 agccccacca aagccggcta actacaaacc accgcgagcc aaactttcgt ctaaccaggc 1680 aggccaacca acccaccccc cagccggtcg accctcacgc acttcagttg ccgcaacctc 1740 agtaatcaac acagacaata accccttcac ggcacccacc ttcccagaat tagactcagc 1800 cagtgtcgca gcattcaacg ctatcgacga actcgtccgt gaagatttcc aatctgctga 1860 tgacgactcg gatgatcagg tagctgtgac aggtgaaggt tcatcgaaag agtacgtacc 1920 acaacctaat aacagaccaa ctgtgatcat cgagttgaca tgcaggggga caaccattcg 1980 cgcactagca gacgcgggat ccgaaacgaa tctcatgtca aaccgtttag ccgatcttct 2040 taagcttcac cgacgcaaac ttgtcaagcc aacaatcgta ggtctcgccc ttgattcgtc 2100 tggagcaaga cctgtactga cggatttcgt tgtggaaagt ttatcacaca cggattcagg 2160 aactatcttc gaccgcacat gtatgaaatt aggagacttg ggttcatcat atgatgtgat 2220 tctaggcgca ccattcttgg ctagacatca acttgctatt tcatgctcca gaaactgtgt 2280 acttagtgaa acatcctcct taaaaatttt tgattatcgt ctagtgctga aattgaaatt 2340 agatctcgaa gataaattga agaaggagcg ttccgacaca gagaatagag tcgagagaga 2400 gcgcgcgcag agtgaagcca tcacgtggaa gaactgggta gctggcctgt cccatcaaaa 2460 tgaacctgaa agaagcgcac aatgggtgaa gtgggagaaa gaattactcc tggattatca 2520 agacattttc cctattgaca tcccagcggt ttcagatgaa gcggaagagc gtggggattt 2580 cactgatggt tccttccctg aaaagatgca ggacccgtca tcaaaggtcc gacaccgcat 2640 tgttttaacc gaccccaacg ccgtcatcaa tgagaaacaa tatccttacc cgagaaaaca 2700 tatggacgct tggcgaaaac ttgtcaatca acacttagca gcaggaagaa tacgaagatc 2760 tacgagtcaa tatgcctcac cgagccttat catccccaag aaggacccta gtgaacttcc 2820 aaggtgggta tgcgattacc gcaccttgaa cagcctgacc gtaagggaca gatctccgtt 2880 gccaaacgtt gatgagttgg tcaggctggt ggctggtggt aaagtttttt caatacttga 2940 ccagaccaat gcattttttc aaaccagaat gcgcgaagct gacatcccgt tgacggcagt 3000 taagactccg tggggattag tcgaatgggt tgtgatgcct atgggcctta cgaacgcacc 3060 agctacacac cagagtagac tggaagaagc attaggggat ctactgaacg ttgtctgcgt 3120 ggtttatctc gacgacattg tagttttctc caactctgtc gaagaacatc ggttacacac 3180 gcgccttgtc atggatcgcc ttcggaaggc caatctgtac tgcagcagca agaaaaccaa 3240 gctcttccgg aaggaaatca aattcctcgg acactacatc tctgcggaag gggtacgacc 3300 agacggcgaa aaggtagaaa agatcctcaa ctggccatcc ccgaaatcac cgcgaggagt 3360 gaaaaaattc ctaggcacgg tccaatggat gaagaaattt atccggggtc ttgagaagta 3420 cgtgggatcc ttaactccac taaccagtag taagcttgat gctaaaaact tcaaatgggg 3480 agacttggag gagacagcct ttaacaacat caagaagatc atgacctctc taccttgttt 3540 gaagaacatc gactacgaat ccaaggatcc tctgtggttg ttcacggacg cgagtggatt 3600 gggattaggt gcagctttat ttcaaggagc agcgtgggaa acggctaatc cggttgccta 3660 cgagtcacgg cagatgtcta gcgcagaacg aaattatcct gtacacgaac aggaacttct 3720 ggccgtgata cacgctttgc aaaaatggcg tatgctccta ttgggaatga agattaatgt 3780 catgagtgac caccactccc tgacgtatct tctgaaacag cggaccctca gtagacggca 3840 agctcgatgg ctagaacacc tggccgattt tgacctcgac ttcaaatatg tgaagggtcc 3900 tgacaactca gtagccgatg cactttctcg caaagacggc gccgagccgg atgagttcaa 3960 cgtcggtata gagacgatct cagccctcat cctgacccaa ccaactatct ctaaggacat 4020 ccgatcctca atccttagcg gttactcttc cgaccccttc tgtcaagatc tacgaaaagt 4080 tctaccccta cgagacgata gtgctgaggt cgacggtctt cttttcatcg acggacggct 4140 agtgatccca aacgacaaaa gccttcgagc caatctcatc cttgaagcac accgccgact 4200 aggccacctg ggcttcaaga agacgatcag taatcttagg aaggacttct tctggccaaa 4260 gatggccaaa gagacggagg cctttgtgca aggctgcgaa acctgccaga agacaaaagc 4320 tcgcactaca ttaccgaatg gacacatgaa gaccccacat ttcccgtttc agccagtcac 4380 cgacatcgcc attgatttcg taggacccct gccgaagatc aataactacg atatgttgct 4440 cacagtcaca tgcaggctct caggtttcac gcgcttgatt ccaaccaacc aaaaagacac 4500 tgccgagaaa acagcctctc gacttttcgc atcgtggcac agcatctttg gagctccttc 4560 gtcgatcatt agcgaccgcg ataaggcttg gacttcgaag ttctggaaag cattgatgat 4620 tcggacaaac acgggcttcc atatgtcagc tgcattccat cctcaagcag acggacggag 4680 cgagcgtaca aacaagacgg tcggccagat tctacgctcc tttaccgcga aacggcagac 4740 gaagtggctg gaatctttgc catccgtcga atttgcgata aacggggccg ttaacgtcgc 4800 tacaggtttt cccccgtacg agctcgtctt tggtggtaag aaccgcttgt tccctaccaa 4860 acccgctctc gacaaccaac ctacctcagt tgagacctgg atcaagcaac gggaagaggt 4920 gtgggctcaa gctcgtgacc aactctggac gagcagggtc cagcaggcag ttcaacataa 4980 caagcgccac cgagacgtcc agcttaccga gggatcctgg gtcctccttg actcaggcga 5040 ttggcgaggt cggcactctg gtggtgtcga taagctgaaa gagcggttcg agggtcccta 5100 caaggtgatt caagtcttca acggcgggca aagctgcgag ttggacctac ctgtcggcga 5160 taagcgacac aggaccttca acgtctcaaa acttaaggtc ttcgttggag acgacgagtg 5220 ggcggtttag taagagtgtc aggggtaaaa gtaagtcccc tccactcgta tgcaccgccg 5280 aggtagtcta ctaccatcaa tttgaaaaca aaaccatggc cacactgtga gcgaaaaacc 5340 accgcctgct ttaccttaca gacacggcgg gccacgacgg gacaccttca cggttttgac 5400 tttcctctat ccttctttct tactcattct tttctcagtt tctctttctg ttttattttc 5460 attttctgtt tcaattttct ccttttctgt ttcaatttta tcattttagt ttctcgaaac 5520 ctttttctct ctctgtttct cataatggtt ctggtttagc ttacacttaa ttctcttggc 5580 ttgtctcttt ctttttctct tgtttttttt cttcttcttt tttttttttc tctttttctt 5640 tttataggag gggagaa 5657 // ID Gypsy-122_MLP-LTR repbase; DNA; FNG; 296 BP. XX AC AECX01000903; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-122_MLP_; KW Gypsy-122_MLP-I; Gypsy-122_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-296 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000903; Positions 4077 4372. XX SQ Sequence 296 BP; 53 A; 52 C; 61 G; 130 T; 0 other; tgtaaggaag gcgtatgggg atagattaac aggatttatg ggattacatg gtgttacagt 60 gttagtagtg tagagttatt taggagcgcg tcttctcttc tttcttctct ttttctcttg 120 aggacttctt tacctccttc tcttttctct tagttctttt attgatagta cggtgttttc 180 aggtatgact tcccccagga ggacttcttt acctccttct cttttctctt agttctttta 240 ttgatagtac ggtgttttca ggcttcttct aggaatacta ttttggtgat tgttca 296 // ID Gypsy-110_MLP-LTR repbase; DNA; FNG; 199 BP. XX AC AECX01000612; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-110_MLP_; KW Gypsy-110_MLP-I; Gypsy-110_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-199 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000612; Positions 11200 11002. XX SQ Sequence 199 BP; 48 A; 65 C; 40 G; 46 T; 0 other; tgttatgatc ccaagtcacg gatcaagaga ggggatgtca caactcgtga cacttgggcg 60 ctgagccggc tcactataga cacgtgcgca cttgtacact gtttccctct ttcctcatgc 120 tacaatccat atcatacgca cgatcaccag cactctccct ttgccccaag ccgtgaaccc 180 ccaagactgg gtcctaaca 199 // ID Gypsy-24_MLP-I repbase; DNA; FNG; 5725 BP. XX AC AECX01000965; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-24_MLP_; KW Gypsy-24_MLP-LTR; Gypsy-24_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5725 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000965; Positions 210676 216400. XX CC Positions [4562-4879] - Integrase core CC 'GCTAA' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 485..4558 FT /product="Gypsy-24_MLP-I_1p" FT /translation="MESAAMSKGKGPESEYEEDKDSIIARLQAEVNTLRTD FT RDRIMALEASMSRLISSQQPTNSAGSGSNTATPSHHWFSRFRGSPGTPTPA FT GSRSSPTESKQTTQAMKPTAPFTVIVEEPNAPRLSQFNQSGYEASSATPMA FT ADYRPATPRARPEIDPHDELPANVMATQTTNAPTTTPREAPVDVKVRASDI FT KLSDAPKFTGPVEDPAALFNWRRLVEQFFKLKRLDDVEERLIILGTIVVEP FT RAGNWLRRMENDLISLSWNEIMHALATETLPTGWLYDTEKAIRQLKLKANE FT DFKTYANRARDLYSLIEVESSISIKNLAEYVVWGAPDVFQRWVEDRKLLKV FT GNFRWFEFIAQGSDIWLLLQSSNLLPRQNPTHGNANPQVSSWRSNHSGPPA FT ASNDQRTVEQKADYAWHFHQYLRHRGICSACRTQCGNPTCQGPIKGPYISL FT PPLSVFDPGPRPRRQPPIAANPTQSRPNVPAGAPTARPAGRPPTAVAVRSM FT SEQPALKHAPEMTQEEMRRYEEADKILIACMEEESSGCVEEKARTKSIILQ FT LLINGTRMRALIDSAAEANLMSETALKKAQIPRRRLVTPTEVSLAISSKDT FT PFVIKEFCFANVSSADPNLRFGSTFFKIAPLGETYDVILGTPFLEKYHLDV FT SLHNRSVTHVPSHMVLLEESVKKEIENEMTKAMVSCVIQNLERVQTNRDLT FT MREERVLSEFSELFPEELPAVKDEEEDDFIPPEGQNASSKITHKIVLTNPD FT AVINEKQYNYPRKYLAAWGKLVAQHLKAGRIRWSTSQYASPSMIIPKKDPT FT ALPRWVCDYRTLNKYTVKDRAPLPNVDEAVRLVSTGKIFSIVDQINSFFQT FT RMREEDIPLTAVKTPFGLFEWTVMAMGLTNGPATHQGRVEEALGDIIGNYC FT VVFIDDVVVYSETVEEHEVHLREVLRRLQEAKLYCSPKKSKLFRTSINCLG FT HEISGEGVCPDDAKVEKISKWKTPGNQKQLLKFLGTVQFLKKFIDGLSHYV FT GTVSPLTSTKLKNSPFQWGEKEDAAFENIKRIITTLPVLKQINYDSDDPLW FT LFTDASGHGLGAALFQGAKWDMSSPIAYESRTMSPAERNYPVHEQELLAVV FT NALQKWKLMLLGMKINVMSDHHSLTHLLTQRNLSRRQARWLETLSQFDLNF FT NYIKGDENTVADALSRIDEVAAIQVEAALDEETMADIKAGYATDPFCLKVD FT KTLPLREGTKWREGLMYIDGQLVIPDTNGLREKLIEKTHTALGHLGSLKTL FT TQLWNEFFWPEMTRTVNMHVASCDKCQRTKARTNLLLGHLQATEVPRHPME FT DISLDFIGPFPKFRGYDMILSCTCRLTGFVRAIPTNQTDTAEKTAQ" XX SQ Sequence 5725 BP; 1762 A; 1364 C; 1294 G; 1305 T; 0 other; cttttttaat ctacccaaat tcaaacattc cagtagtatt caaatcgaaa gtcaagcaaa 60 tctatttttt tctttctttc aaaatctgcg aagttttttt ttcaagacaa gcctgagata 120 ctcaacttag atcgaccact tgataggatt cgcactcccc gcgaacttta tcagcgatcc 180 agaaactctc atacgctcac aaagacaaag acaacttgcc accgaaagtc cacctgccgc 240 accagcaccg ccaagataca attataagcc tgacgcacct ttagacactc ctgaagccaa 300 ctacaccttg acagccagta cgaccgcgac accaatcgta acatcgacac ctgagacact 360 accagacccc tttgacacca ctatcactcc tgacgctaga atcgtgcgcc aatacttcaa 420 ctctacattg actgcggaga ccccgactct accaggtagt tttatcttca caccgcaacc 480 cgcgatggaa agcgcggcga tgtcgaaagg caaaggacca gagtccgaat acgaagagga 540 caaggatagc ataattgcac gacttcaagc tgaggttaat accttgagga cggaccggga 600 cagaatcatg gctctagaag cgtcgatgtc acgattgatc tcaagccaac agccaacgaa 660 ctcggcagga tcaggatcta ataccgcgac accatctcat cattggtttt cacggtttcg 720 tggcagtccg ggaacgccaa ctccagccgg tagtagaagt tcgcctactg aatcaaagca 780 gacgacacaa gcaatgaaac ctacggcacc cttcacagtc atcgtcgaag aacctaacgc 840 accacgattg agtcaattca accaatcagg gtacgaagct tctagtgcga cacccatggc 900 cgcggattat cgacctgcaa caccgcgggc aagaccggaa atcgatcctc acgacgaact 960 tcctgctaat gtaatggcaa cccagacgac taacgccccg accaccacac cacgagaggc 1020 acctgtagac gtgaaggtac gagctagtga tatcaagctg tcagacgcac ctaagttcac 1080 cggacccgtc gaagaccccg ctgccctttt taactggcga agattagtag aacaattttt 1140 taagctgaaa cgtttagacg acgtcgaaga aagattgatc attctgggaa caatcgtggt 1200 ggaaccgcga gcagggaact ggcttagacg tatggaaaac gacctcatct ccctatcatg 1260 gaacgaaatc atgcatgcac tggcaacgga aactcttcca actggatggc tttacgacac 1320 cgagaaagct atccgtcaac tgaaactcaa agcaaacgaa gatttcaaaa cgtacgccaa 1380 ccgagcccgc gatttgtact cgttgattga agtagagagt tcaatttcaa tcaaaaactt 1440 ggcagagtac gtcgtgtggg gagcaccgga cgttttccaa cgttgggttg aggacagaaa 1500 attattaaaa gttggcaatt ttagatggtt tgaattcata gcgcaaggtt cagacatttg 1560 gctgttatta cagtcaagca atttattacc tcgtcaaaat ccgacccatg ggaacgcaaa 1620 cccacaagtg tcttcgtggc gatcaaacca ctcaggacca ccggcagctt ctaacgatca 1680 aagaacagtg gagcagaaag cagactacgc ttggcacttt catcaatacc tacgccaccg 1740 gggaatttgc tctgcgtgcc gtacccaatg tgggaatccg acatgtcaag gtccaatcaa 1800 aggaccatac atatccttac ctcccttgag tgtcttcgat ccgggtccgc gaccaagacg 1860 acaacctcca attgccgcga atccgacaca aagcagaccg aacgtcccag ctggtgcacc 1920 aactgcgagg ccagcaggtc gaccaccaac tgcagtggct gtcagatcaa tgtccgaaca 1980 accagcactt aagcacgcgc ctgaaatgac tcaagaggaa atgagacgtt atgaagaggc 2040 agacaagatt ctaattgcat gcatggagga ggaaagctca gggtgcgtgg aagagaaagc 2100 ccgtactaag tcaatcatct tgcagttact aatcaatggg actcgcatgc gtgctttgat 2160 tgattcagca gctgaagcaa acctcatgtc ggaaacagct ctaaagaagg cgcagatacc 2220 gcgtaggaga ctggtgacac ccactgaggt gagcttggca atttcatcaa aggacactcc 2280 atttgtgatc aaggaattct gctttgctaa tgtcagctca gcagatccga accttcgatt 2340 tggctcgacg tttttcaaaa ttgcaccatt gggggagact tacgacgtaa tcctcggaac 2400 accatttctg gaaaagtacc atctggatgt ttcacttcac aaccgttccg tcacacatgt 2460 acctagtcat atggtattgc tagaagagtc tgtcaagaaa gaaattgaaa atgaaatgac 2520 aaaagcgatg gtttcatgtg ttatccaaaa tcttgaacgt gttcaaacta atcgtgactt 2580 gacaatgaga gaggaacgtg tgttgagtga attttcagag ttattccctg aagaactacc 2640 tgctgtgaaa gacgaagagg aagatgactt cattccgcct gaagggcaaa atgcttcctc 2700 caagattacc cacaagattg tgctgacaaa tccagacgca gtaataaatg aaaagcagta 2760 caattacccc aggaagtatt tggcagcatg ggggaagttg gttgctcaac atttgaaagc 2820 gggacgcatt cgttggtcaa ccagtcaata cgcatcacca tcgatgatca tcccaaagaa 2880 agaccccacg gccttaccac gatgggtctg cgattacagg actctgaaca agtatacagt 2940 aaaagataga gcacccttac caaacgtgga tgaagccgta agacttgtaa gcacggggaa 3000 gatcttctca attgtggatc aaataaattc ttttttccaa acaagaatga gggaagaaga 3060 cataccactg acagctgtta aaacaccgtt tggactcttc gaatggactg taatggcgat 3120 gggattgaca aatggaccgg ctacacacca aggtagagtg gaggaagcgc tgggagatat 3180 catagggaat tattgtgtgg tttttattga tgatgtagta gtatattcag agacagttga 3240 agaacatgag gtacatctta gggaggtatt gagaaggctt caagaggcta aattgtattg 3300 ctcaccaaaa aagagtaagc tattccggac tagcatcaat tgtcttggtc atgaaatcag 3360 tggtgaaggg gtgtgcccag atgacgctaa agtagagaaa atatcaaaat ggaaaacacc 3420 agggaatcag aagcaactgt tgaaattctt aggcacggtt caattcctca agaaatttat 3480 tgatggattg tctcattatg tagggacggt atcgccattg acaagcacaa aactcaagaa 3540 ttctccgttt cagtggggtg agaaagaaga cgccgcattc gaaaacatca aacgtataat 3600 caccacgtta ccggtattga agcagatcaa ctatgactcg gacgacccgc tctggttatt 3660 taccgacgcc agtggacatg gacttggcgc agcgctattt caaggtgcaa aatgggatat 3720 gtcgtcgcca atagcatacg agagccgcac aatgtcacct gcagaacgga attacccagt 3780 acacgaacag gaattactag cagtagtcaa cgcgttacaa aaatggaaac taatgttatt 3840 aggcatgaag attaacgtca tgtcagatca ccattcgttg acgcacctct taactcaacg 3900 caacctcagt cgtcggcaag cacgatggct ggagaccttg tcccaatttg atctcaattt 3960 caattacatc aaaggcgacg aaaacaccgt ggcagacgct ctatcaagaa tcgacgaagt 4020 agcagcgatt caagtagaag cagcgctcga tgaagagaca atggcggaca tcaaagccgg 4080 atacgcaaca gacccctttt gcctgaaggt cgacaaaaca ctcccattgc gtgaaggtac 4140 caaatggcga gaagggctga tgtacattga cggacaactg gtaattccag atacaaacgg 4200 cttacgggag aagctcattg aaaaaaccca cacagcatta ggtcatttag gaagcctcaa 4260 gaccttgact caactttgga atgaattctt ctggcctgag atgacgagga cggtcaacat 4320 gcatgtagca tcttgtgaca agtgccaacg aacaaaagca agaacgaatc ttctgttggg 4380 tcatcttcaa gcaacagaag taccaaggca tcccatggag gacatctcgc tcgacttcat 4440 aggacctttc cctaaatttc gagggtacga catgattctg tcttgtactt gtaggctaac 4500 ggggttcgta cgtgctatac caacaaacca aacagacacc gctgagaaaa cagcccaatg 4560 attgttcaat gcatggcttt caatttttgg agccccaaag accatgattg gggatcgaga 4620 caaggcttgg acttctcgat tctggcaaga actcaactca ctgatgcaag tggatgtgaa 4680 attgacgaca gcttaccatc cacaagcgga cgggcgaagc gaaatctcca acaagaccat 4740 cgtccagatc ctacgtcagc tggtcgaaga tcgtcatgga aagtggctag aagcgttacc 4800 agctgttgaa tacgccatca acagcgccat gaacgtctca acaggagtat cgccctttga 4860 atttgttttt ggacgaaaac cacgcttatt ccccattcct ggccagacag caaccgtggg 4920 accggatgtg gcgagctgga tcgagcaacg acaaagtaca tgggcccagt atagggacaa 4980 gctctggtct agccgaatta cccaagctgt acattacaac aattgccgga accagggtgc 5040 cattttgaat gcgggggact gggtttttga tcgacagcaa ggaccgccaa cagattgttg 5100 gagggaaagg caaaccaacc tcaaaactac gaccccggtt tgacggacca tacattatca 5160 ccgagagtct gaacgacggg cgaaacttcc aacttcgact cgacgaaggc gacaaatcac 5220 atcccatttt ccacgtctca aagcttaagc ggtaccgttg gagggaggag gagcgaggaa 5280 caggggacca aaagtaagtt cctcccaaat tgtatgcacc gccggactac tacgtcaaaa 5340 ttttgtcaaa acattacctc ggccacttgt gtgagcgcat cgcatacgtc cggttatcca 5400 ttttcagatg caactcacga cgaaaagaca aaagaagact caagatcaag actcgacttc 5460 gcagtacaag atcaagcatt cataaattca taattcaatt caacataaag tttacaaatg 5520 acatcaagat atcaagatta cgatttcatg cctttactta aatttttttt cttttctgct 5580 caaatgtttc tgtttttttt tctttctttt cttgcgttgg gacaagttat ggtcaccttt 5640 tgaggggcga gttatgccac attttacaaa aaattccttt agatactagg ttgttaatca 5700 cgtttttttt ttagaagggg aggga 5725 // ID Gypsy-9_CCO-LTR repbase; DNA; FNG; 1104 BP. XX AC AACS02000011; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the mushroom Coprinopsis cinerea genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-9_CCO_; KW Gypsy-9_CCO-I; Gypsy-9_CCO-LTR. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-1104 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the mushroom Coprinopsis cinerea RT genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AACS02000011; Positions 166250 167353. XX SQ Sequence 1104 BP; 194 A; 349 C; 197 G; 364 T; 0 other; tgtcagggct cgtgcactta gcagaactct gatatacact cgcgccttcc ctagactcgg 60 cttagacttc gacacttcga cttcacatcc cgacgtctcg actttcgctt ccagccatac 120 tcgaaccctc gacttcccca acgtcttcag ccgcaccgcc acctagactc cggtcgtgtc 180 aatgtctact gatctagact tcaccggatc aggcaagacg cgcgactcgg ctccttccgt 240 gatctcggat cctccggaag tctagtggac tttaggcgcg cgcttggtat ttcgaagcct 300 tcgactctgt attgttcttt ggaagtatgt tcccttttta gacgtattcg ttctttgttg 360 ttgtccgaag ttccctcgta cgtcatatct cgactttcct gatttatcga tgttattatc 420 tttcttttct tcggcttatc tttcctcatc acatgttctt tgctcaagac ttgtcttcga 480 cctttctact tcatgtcttt gctctagact ttcttatcat gttgtgattg gttcgtagat 540 agcttcttcg actctcttgt gttgttcttg gattgattcg ccttcatgtc gtttgtattt 600 ctcattcata catcttagtc tcgccttcat gtttctccat cgtttctccg cactttctct 660 tgtcgaagat tctctagacg tgtcctctat aagtactgta tggttttgta gtagatacct 720 catccggaat ctcgacgagg actctccatc ctcccttctc catcgactag gatctccatc 780 ctcccttctg catcgactag gactcaatcc cctctcctcc atcgactagg actttcccct 840 catcgaatcc tctgtccgaa ttcctaagtc taacttaact ctctccacgt cttcgaccgc 900 tgagccttta gctccagtcg tctctctacg tcgcgctctc gccgcaaggc gtccgtccgg 960 tccgtcggcc tagccagcca taggaacctt gacgaagggt acacgttctt ataacgtgag 1020 accaaggcta acctccgacc ccgaagtctc ctctccatct cagacttcgc ctcactcctc 1080 ttcactcacg cccatctcct gaca 1104 // ID TY4 repbase; DNA; FNG; 6270 BP. XX AC S50671; XX DT 11-NOV-1996 (Rel. 1.1, Created) DT 20-SEP-2007 (Rel. 12.1, Last updated, Version 2) XX DE Gag homolog, TY4B=protease, integrase, reverse transcriptase, and DE RNase H domain containing protein {retrotransposon Ty4} DE [Saccharomyces cerevisiae=yeast, C836, Transposon, 6270 nt]. XX KW Copia; LTR Retrotransposon; Transposable Element; retrotransposon; KW TY4. XX NM TY4. XX OS Saccharomyces cerevisiae OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Saccharomyces. XX RN [1] RP 1-6270 RA Stucka R., Schwarzlose C., Lochmuller H., Hacker U. RA and Feldmann H.; RT "Molecular analysis of the yeast Ty4 element: homology with Ty1, RT copia, and plant retrotransposons."; RL Gene 122(1), 119-128 (1992). XX DR GenBank; S50671; Positions 1 6270. XX SQ Sequence 6270 BP; 2509 A; 983 C; 1090 G; 1688 T; 0 other; tgttggaacg agagtaatta atagtgacat gagttgctat ggtaacaatc taatgcttac 60 atcgtatatt aatgtacaac tcgtatacgt ttaagtgtga ttgcgcctat tgcagaagga 120 atgttaaacg agaagctcag acaatactga agctgtgtta aagacctatt agttgaacat 180 gttatggtag gtacatatat gaggaatatg agtcgtcaca tcaatgtata gtaactaccg 240 gaatcactat tatattggtc ataattaata tgaccaatcg gcgtgtgttt tatatacctc 300 tcttatttag tataagaaga tcagtactca cttcttcatt aatactaatt tttaacctct 360 aattatcaac atggcgaccc cagtgagggg tgaaacaaga aatgttattg acgacaacat 420 ttctgcgcgg attcaatcga aagtcaaaac aaatgatact gtcagacaga cgccattaag 480 aaaagtttct attaaagatg aacaggtgag acaatatcaa agaaatttaa ataggtttaa 540 aaccatacta aatggtttaa aggcagaaga ggaaaaactt tctgaggctg atgatattca 600 gatgctagct gaaaaattat taaaactcgg agaaaccatt gacaaggttg agaataggat 660 tgtggatcta gttgaaaaga tacaattatt ggaaacaaac gagaacaata atatattaca 720 tgaacatata gatgctacag ggacttacta tttattcgat acgttaactt caaccaacaa 780 aagattctac cctaaggatt gtgtttttga ttataggact aataatgtcg agaacattcc 840 tattctctta aacaatttta aaaaattcat caagaaatat caatttgatg atgtctttga 900 aaatgatatc atagaaatcg atcctcgtga aaatgaaatc ttgtgcaaga taatcaaaga 960 aggactcggt gaaagtttag atatcatgaa cacaaataca actgacattt ttaggataat 1020 cgatggttaa aaaaacaaat atagaagttt gcatggtaga gatgtcagaa ttagagcctg 1080 ggaaaaggtt ttggttgata caacatgtag aaattccgca ttgttaatga ataaacttca 1140 aaagttggta ctaatggaaa aatggatttt ttctaaatgc tgccaagatt gtcctaatct 1200 aaaggattac ctacaagaag ctatcatggg aaccttacat gaatccttaa gaaattctgt 1260 gaaacaacgt ttgtacaaca ttccacatga cgtaggaatt gatcacgaag aatttctaat 1320 caatactgtt attgaaacag taattgattt gagcccaatt gcagacgatc aaatagaaaa 1380 tagctgcatg tattgcaaat ctgttttcca ttgctcaatt aactgcaaaa agaaaccaaa 1440 tagggaactt aggcctgact cgaccaattt ctcaaaaacc tattatctac aaggtgcaca 1500 gagacaacaa ccacttaagt ccagtgcaaa acgaacaaaa gtcttggaac aagacacaaa 1560 aaaggtcgaa caaagtgtac aacagcaaaa aactggtaat tattgatacc ggttccggcg 1620 taaacattac caatgacaaa accttactgc ataattacga agacagtaat cgcagtacac 1680 gattttttgg tattgggaaa aacagttcag tgtctgttaa agggtatggc tatataaaaa 1740 tcaagaatgg tcacaacaat actgacaata agtgtctatt aacttactat gtaccggaag 1800 aagaatccac tataatcagc tgttatgact tagccaagac aaccaaaatg gttttaagtc 1860 gaaaatatac cagattggga aacaaaatca taaaaattaa aaccaagata gttaatggtg 1920 tcattcacgt aaaaatgaac gagttaattg aacgcttcct ctccgatgat tcaaaaataa 1980 atgcaataaa acctacttct tctcctggat ttaaactaaa taaaaggtct attaccttgg 2040 aagatgctca taaaagaatg ggccatacag gaattcaaca aattgaaaat tccataaaac 2100 ataatcatta tgaagaatcc cttgacttaa tcaaagaacc aaatgaattt tggtgtcaaa 2160 cctgtaaaat ctctaaagcc acgaaacgaa atcattatac cgggtctatg aataatcata 2220 gtactgatca tgaaccaggc tcatcatggt gcatggatat atttggccct gtatcaagtt 2280 caaacgcgga cactaaaagg tacatgctta ttatggtgga taacaacacg agatattgca 2340 tgacctccac acacttcaat aagaatgctg aaactatttt agctcaagtt agaaagaata 2400 ttcagtacgt ggaaacacaa tttgacagga aagtcagaga aattaattca gacagaggta 2460 ctgaattcac aaatgatcag atagaagaat attttatttc aaaaggaata catcacatac 2520 ttacttctac acaagatcat gctgctaacg gaagagcaga aagatacata agaacaataa 2580 taactgatgc aacaacactc ctaagacaaa gtaacttaag agtaaaattt tgggaatacg 2640 cagtaacttc tgctaccaat ataagaaatt gcctggaaca caaaagtaca ggtaaactac 2700 cattgaaggc aatctcacgt caacctgtga cagtgagatt aatgtcattc ttaccatttg 2760 gcgaaaaagg aataatttgg aatcataatc acaaaaaatt gaaaccatct ggacttcctt 2820 ctataattct atgcaaagat ccaaatagtt atggatacaa attctttata ccatccaaaa 2880 ataaaattgt cacatctgat aattatacaa ttcccaacta tacaatggac ggtagagtaa 2940 gaaatactca gaatattaac aagagtcatc aattcagttc acataatgat gatgaagaag 3000 atcaaatcga aacggtcaca aacttatgtg aagctttgga aaactacgaa gatgataata 3060 aaccaattac tcgcctggaa gatttgttca cagaggaaga gttatctcaa atagactcaa 3120 acgcaaaata cccatctcct agtaataacc tagaagggga cttggattac gtattttctg 3180 atgttgagga atctggagat tatgacgttg aatctgaact ttcaacgaca aataattcaa 3240 tctcaactga taaaaacaaa attttgtcaa acaaggattt taattcagaa cttgcatcga 3300 ctgaaatatc catcagtgga atcgataaga aaggattaat aaatacaagt catattgatg 3360 aagataagta tgatgaaaaa gtacacagaa ttccatcgat tatacaagag aaactggtag 3420 gaagtaaaaa tactattaaa atcaatgacg aaaacaaaat ctccgacaga attcgtagta 3480 aaaacattgg gagtatttta aacactggac tcagtagatg tgtagatatc accgatgaat 3540 ctattactaa caaagatgag tcaatgcaca acgcaaaacc cgaactaatt caggagcagt 3600 taaaaaaaac aaatcatgaa acttcgtttc ctaaagaagg gagcattgga acaaatgtaa 3660 aattccgaaa tacaaacaat gagatttctt taaaaacagg cgatacgagt ttaccaataa 3720 aaactttaga aagcattaac aatcaccata gtaatgatta ttccacaaac aaagttgaaa 3780 agtttgagaa ggaaaatcat catccgcccc cgattgagga cattgtggat atgagtgatc 3840 aaactgatat ggaatcaaac tgtcaggatg gtaataactt aaaagaatta aaagtcaccg 3900 ataaaaatgt accaactgac aatggaacaa atgtgtcacc aaggttggaa caaaatattg 3960 aacgatctgg atcaccagta caaacagtta ataaaagtgc cttcttaaac aaagaattca 4020 gttctttgaa catgaaaaga aaacggaaaa gacacgataa aaacaatagt ctaacaagct 4080 atgaattaga aagagataag aagcgttcaa aaaagaatcg agtgaaatta attccagata 4140 atatggaaac agtttcagca ccaaaaattc gagccatata ttataatgaa gctatttcaa 4200 aaaatcctga cctcaaagaa aaacatgaat acaaacaggc atatcataaa gaattacaga 4260 atttaaaaga tatgaaggta tttgatgtcg atgtgaagta cagtagatca gaaattcctg 4320 ataatttaat agtacccacc aacacgatat tcacaaagaa aagaaatggg atttataagg 4380 ctaggatagt ctgcagaggt gatactcagt caccagacac ttacagtgta ataactacag 4440 aatctttaaa tcacaatcat attaagatat tcttaatgat gcaaacaaca gaaatatgtt 4500 tatggaccct ggatatcaat catgcattcc tatatgctaa attggaagaa gaaatataca 4560 tcccacatcc gctgatagga gatgtgtacg tcaagctaaa taaggcgtta tatggtctaa 4620 aacagagtcc taaagaatgg aatgatcatc taagacaata cttgaatgga attggactga 4680 aagataactc ttatactccg ggattatacc aaaccgagga taaaaatcta atgattgcag 4740 tctatgttga tgactgcgta attgcggcaa gcaatgaaca gagattggat gaattcataa 4800 acaaattgaa aagtaatttt gaactgaaaa ttacaggaac attaatagac gatgtactcg 4860 atacagatat attaggaatg gatctagtat acaacaaaag acttggtact atcgatttaa 4920 cattaaaatc attcataaat agaatggata aaaaatacaa cgaggaattg aaaaagatta 4980 gaaaaagttc aattccgcat atgtcaactt ataaaataga tcctaagaaa gacgtactgc 5040 aaatgtcaga agaagagttt agacaaggtg ttctaaagct acaacaatta ctaggtgaac 5100 taaactatgt cagacacaaa tgcagatacg acattgaatt tgctgttaag aaagtggcta 5160 gactagtaaa ttacccacat gaaagagtct tttatatgat ttacaaaata atccagtact 5220 tggttcggta taaagatatt ggaatacact atgaccgaga ctgtaataaa gacaaaaagg 5280 ttattgctat aactgatgca tcagttggat cagaatatga tgctcaatca aggattggag 5340 ttatattatg gtacggtatg aatattttta atgtttattc taacaagagc acaaacagat 5400 gtgtatcatc aacagaagca gagcttcatg ccatttatga aggctatcga gactcagaaa 5460 cgttgaaggt aacattaaag gagctaggag aaggagacaa taatgacatt gtcatgatca 5520 ctgtgaaggt aacattaaag gagctaggag aaggagacaa taatgacatt gtcatgatca 5580 ctgactcaaa gccagccatt caaggattaa atcgcagcta tcaacaacca aaagagaaat 5640 tcacttggat aaaaactgaa ataataaaag aaaaattaaa gagaagtata actgttaaaa 5700 ttaccggcaa aggtaatatt gctgatttac taacaaacca gtatcagcat ctgattttaa 5760 aagatttata caagtattaa aaaataaaat aacatcacag gatattttgg cctcaacaga 5820 ctattgataa ttaattaatg aagttctaaa cacacaatga atatctgttg aagtacaata 5880 atatatcttt aagggagcat gttggaacga gagtaattaa tagtgacatg agttgctatg 5940 gtaacaatct aatgcttaca tcgtatatta atgtacaact cgtatacgtt taagtgtgat 6000 tgcgcctatt gcagaaggaa tgttaaacga gaagctcaga caatactgaa gctgtgttaa 6060 agacctatta gttgaacatg ttatggtagg tacatatatg aggaatatga gtcgtcacat 6120 caatgtatag taactaccgg aatcactatt atattggtca taattaatat gaccaatcgg 6180 cgtgtgtttt atatacctct cttatttagt ataagaagat cagtactcac ttcttcatta 6240 atactaattt ttaacctcta attatcaaca 6270 // ID Gypsy-56_MLP-LTR repbase; DNA; FNG; 154 BP. XX AC AECX01002490; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-56_MLP_; KW Gypsy-56_MLP-I; Gypsy-56_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-154 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002490; Positions 15730 15883. XX SQ Sequence 154 BP; 39 A; 36 C; 23 G; 56 T; 0 other; tgtaatgacc tcactaaagg gatatgctta tactgtacat ccacacgtct catattatcc 60 tgtgctcagt tgtacttctt aatcctcatg ttgcaatcta gttattaagg catctatacc 120 ttgttgcgct attctctcaa gtcaggttct aaca 154 // ID PCC2 repbase; DNA; FNG; 2657 BP. XX AC X98835; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 08-NOV-2002 (Rel. 7.1, Last updated, Version 1) XX DE Phanerochaete chrysosporium DNA transposon PCC2. XX KW DNA transposon; Transposable Element; PCC2. XX OS Phanerochaete chrysosporium OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Corticiales; Corticiaceae; Phanerochaete. XX RN [1] RA Garcia B., Raices M., Delgado J. and Pettersson G.; RT "Characterization of a gene encoding a CDH from Phanerochaete RT chrysosporium."; RL Curr. Genet.. XX RN [2] RP 1-2657 RA Raices M.; RT "Direct submission."; RL Direct Submission to Genbank (21-JUN-1996). XX DR Genbank; X98835; Positions 1 2657. XX SQ Sequence 2657 BP; 639 A; 650 C; 799 G; 569 T; 0 other; ggtcatcagc aaaaaaaaga ccgctagatc tagctaggcc agcgatgacc aggctaccag 60 agcaggtcag cgctttcagg ctgggattgt agttgccagc tcagacatgc gatcgataag 120 cgtgaccatg cccatcagtc tgacaccata gaccaggctc gccatcatgg taagctagat 180 caggcatggc cagggctggt aatgcctggc aagcggtggc agttccgggc gcacttttgg 240 gttcgttcgg ctgctactcg ggctcatccc atatatttta gacaataaaa tcgatttatt 300 catctggata cagtacagta cacaagtggc tgggtgacaa atgcagaata gaacagacgg 360 ctggctgtac cacatacttg tgccacccac atgctagccg tgcatgacgc tgatcatgct 420 agacggatgc tatacgctag gcactgccga tagggttcgt tgaaagctga gcagcatata 480 ggatacagtg acgaacctgt caaaggaaaa gactgctgcg cctgcaagtg gacatcagta 540 caaaagggtc agaaagcaga gagctgatca cgtacatgcg cacaacccta catcccaggg 600 tatcctgtgt agcacagcag gtcagatgag taacaggtaa atgtcgacca ccgtctgcca 660 aatattctta cctttgctcg ctgtcggaag tctgcgcatg gtccgggatc agcagccgca 720 taaagacgtt ctcggacact gtgtgcagca cggtctgcgt ggctgacagc atgccgtgag 780 cgtgggagag gtgggaaaga tggggagggc atgacgggaa gtggggcgca ccttgagtca 840 ggtcatcgtg tgcccggcag ccagacgtcc tcgttctgcc atgggatgcg acgcccgaag 900 catgtgacgc cgacgacgaa cagggcgatc ttccggccaa ggcgtgtgcc aatgtccggg 960 gagagagatg gacgcaagtg agcgcacgat cgggaaacag gtgcaggcgc gcgcaccagg 1020 agggtgatgt cgacgtggat ctcggcacgc ttgccccaga cgtgagtgaa gaggtcttga 1080 agaatgaggc gcgtctcgtc ccacacgagc ttgttgttcc gctgcgcacg accgctcaga 1140 aatgcagggt cgacagggga cggcacgagg gcatacctca gagaaagcgg gcgcggcgat 1200 cttgcgggac ggcttccagt gctcgccctc gctagcgacg atgttgccgc cgaagaaggt 1260 gaggatcttg tactgttcta tcgatttcgg aaagcgcgaa ccatgtgtcg tatttcctgt 1320 aatgaagatc agtaggagta tgcagcgggg agggggaatt gaaccatgat gacgtctgcg 1380 tctgcgacat agaaattggc gttgctccca tgaacagcaa cctgatgggg ccttttggtc 1440 agcggacgag cggtctggca ggagataacg cgctcggcag ccacactgcg gggcatcggg 1500 tatccgttgt tctcgcagtt gtgtgcaatg gttgcaagcg tgtgcagggc gggtcgatat 1560 atgcgtggaa ttgatgttgt aatttcataa tttcatacac gcgtaccgta actctgcaga 1620 acaaagtcga gagcaccctg ttagcagggc aaacagcaat aggaatctga tggcgtacct 1680 tagagttgcc ccataaatgc gctgtcacct gcaccagccc gcgcaacaaa gaacaaggca 1740 ttcctcgttg cgcatgatgt ttatcctgaa gctcggcttc ggagttccac gcgatgggga 1800 ggcggcttga caaggcaggg gcaatgtagg ggtggggaac atggtggtat gaggcagaga 1860 gcagggcaga ggggacagag gatttatatg cgttggagat ggtctttact tctcgggcac 1920 ttcgacgtca tgtctccttc cgtcttgggc atgacgcgtc agtgtcacag gttttgaagt 1980 cctggtaagt aaaaacgtca caggtaagtc accatgtgac cgggtaagcc gtcaggttgt 2040 aagggttggc ttttctgcta tgcactagca tctcgagcaa gtaacatggg cagaacagcc 2100 gcaaaccttt gcttgggcga gatcgttgac cacccgatac acttaagtaa ggtgtgtttg 2160 tcacattgta gaggtatgca gatacatgct atgtagatgt cagtgaggcc gacgaggggc 2220 ctgctgtgtc agcgttcgca tgtggcacat gccactgata tatgcgattg cggaggctgc 2280 cgtgcgcatg ccacagaacc atgaaacgtt ttaagctgcc tttgcaaccc cgtttcacaa 2340 tgtaagtagt gcccgacgat tgagcgtgcc ccggacacgg tgccaaagtg aggagctggc 2400 ccatgtgcgc ttaggaagta agacgacggt gaactgtcga ttactgctca taaaaactca 2460 tacagatatt aaatcccatc catgacctta ccaggacgtt tcatgccatg cagcatcagg 2520 atgacaagcc atggccaggc aataccaggc gataccagac tggcgcgcct ttgccattgc 2580 aggtcagcgc tagtctgact ggcgccgcct tgccagccct accgtaccag atctgcgacc 2640 catttttttg ctgatgg 2657 // ID Mariner-8_AN repbase; DNA; FNG; 1902 BP. XX AC . XX DT 09-JAN-2004 (Rel. 9, Created) DT 09-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE DNA transposon. Mariner superfamily. Pogo subclade. XX KW Mariner/Tc1; DNA transposon; Transposable Element; KW mariner superfamily; Mariner-8_AN; Pogo subclade; transposase. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-1902 RA Kapitonov V.V. and Jurka J.; RT "Mariner-8_AN, a family of DNA transposons in the Aspergillus RT nidulans genome."; RL Repbase Reports 3(12), 216-216 (2003). XX DR [1] (Consensus) XX CC DNA transposon. Mariner superfamily. Pogo subclade. CC TA target site duplications; 52-bp TIR. CC The mariner/pogo transposase-coding region is corrupted by CC mutations. XX SQ Sequence 1902 BP; 598 A; 397 C; 410 G; 483 T; 14 other; acgtatttcc cgggtgatcc gtagttcccg ggtgatccgt gaaactccca aaatttcgaa 60 cgcgtcaacc agcctcagcc tcyatcttca aacaactatc atgccacgaa argcgcgtaa 120 aacgcgccag gaactgagag agcaagaggg taggattgaa trtgctataa rcaatttaaa 180 aaaanngaaa aatttgcama gttcaggaag ctagccgcat ttataatgtg cctccttcaa 240 ccctacgtga tcggatgaag ggccaccaat ctcagccaga actctgtaat cagaaccaca 300 ggctatctct gcttcaggag gaagctttaa tagcttggat agtatccctg gatatacatg 360 gtgccgcccc taggcccttc caggtacaag aaatggcgca aataatcttg gatgctgcaa 420 tatcaactcc atctctacct attagaaaaa actgggttac agagtttacc aagcagcgcc 480 cggagatcaa aactaggttt gtacaaaaga ttaattgcca aagagcactg tatgaagatc 540 ctaggattat tagccaatag ttcaatgagc tgcagaaaac taaagatcag tagggtattc 600 aggataagga tatctacaac tttgataaaa ctggctttgc tatgggactt attgcaacaa 660 taaaggtggt ttccagagca gaaatgcctg gtaaaccatg gcttatacag ccagggagtt 720 gggaataggt taccactatc aaatacatca atactagggg gtggccaata ccctccacca 780 ttatctttaa aggaaaagtc tatatagagg gatggtttga tgaatgcatg atccctggca 840 gttagaggat taaaataagc accaatagat agactataga tataattggc ctttgctggc 900 ttcaaaatat cttcattcca gcgacaaata agcgtacaag ggggggatat tggctgctta 960 ttctagatgg gcatggaagc yacctaatgc ctgagtttga ccrtacatgy aaggmgaata 1020 atawtatacc tctttrcatg cctgccyatt catctcacct cctacagcct ttggatgttg 1080 gttgtttcgg acctttaaag aaggcatatg gcaagcttat tgaggagaaa gggcgcctgg 1140 gatataacta tattaacaag ctagatttct tgaaagctta cctagcagct taccaggaag 1200 tctttactac agagaatatt taaagcagat tcaaagcaac tgggatactt ccttctaatc 1260 ccaaggcagt gcttgaaaag ttaaatatca gcctgggtac tccaaccccc cctccaagcc 1320 atgggggtgc ttcaatccct tcttctcagc ttggtacgcc ttatactgtg cgccatgtac 1380 atcaaaaagt ctcttcagtt aaaaagctgc ttgagagaag gtctaaaagt cccccaactc 1440 ccaccaaaaa agtcctagat gagttggtaa aagggtgtga gttggctatc tataatgcca 1500 acttactagc aagggaaaac tgtgatctct gctcagccat agagaataac agacagaaaa 1560 agagttgttc taaataccag atgactccta cagaaggtct ttcatttcag gaagccaggg 1620 acctaatttc gtcaagaaat aacgaaatag aggcaagatg ggggggtgct ggcggaggtg 1680 cgcctcaaac tttaggtata cctaaacata ctctaccaac atgttcagaa tataatattc 1740 agggccataa gaggaccagc tgtcctactc attatggaat ttagtttatt ttaatttgaa 1800 tcacttttgg ttgatgtaca gtacttcaaa gttgagcaag cataggtttt gaagggaaaa 1860 tcacggatca cccgggaact acggatcacc cgggaattac gt 1902 // ID LTRTCA1 repbase; DNA; FNG; 408 BP. XX AC M94628; XX DT 11-NOV-1996 (Rel. 1.1, Created) DT 25-MAR-1997 (Rel. 2.02, Last updated, Version 2) XX DE Candida albicans retrotransposon Tca1 alpha long terminal DE repetitive element, 3' end. XX KW LTR Retrotransposon; Transposable Element; KW Alpha repetitive sequence; LTRTCA1; insertion site; KW Long terminal repeat (LTR); retrotransposon Tca1. XX OS Candida albicans OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; mitosporic Saccharomycetales; OC Candida. XX RN [1] RP 1-408 RA Chen y.J. and Fonzi A.W.; RT "A temperature-regulated, retrotransposon-like element from RT Candida albicans."; RL J. Bacteriol 174(17), 5624-5632 (1992). XX DR GenBank; M94628; Positions 1 408. XX SQ Sequence 408 BP; 123 A; 71 C; 87 G; 127 T; 0 other; gaatcaggga gtgttcgcta tagagagatt tcctagccgg aatgcacgac aatcctgaga 60 cggaagtcga tcgtcgatgc ccatggtgcg tggtgaaaaa ttttcttaga aaatttgttc 120 tttccttcaa ctgcttttaa gaaagggagg ttcaagtggt ttaagtacga cggtcacaaa 180 gattgcggct tatgaggccc gaactgagtt gaaatacaaa atcaagatat aattatatac 240 cttacttgtc tatattgttt tataatacat tcttcagata tttaaatttc tgtgtatcat 300 cctataaaac agagatacat tcagtacatt tagtatactg agtgaactgg tacctgtgac 360 attcaagata actgtttcgc gcacgctggc agacgaacat tggtctgt 408 // ID Gypsy-33_MLP-LTR repbase; DNA; FNG; 189 BP. XX AC AECX01000195; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-33_MLP_; KW Gypsy-33_MLP-I; Gypsy-33_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-189 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000195; Positions 112220 112408. XX SQ Sequence 189 BP; 55 A; 55 C; 31 G; 48 T; 0 other; tgttatgatc tcatctacat aacaggcggt gaaaggccca ttggcacatg tcacagatta 60 gaccaacttg tactcgccac gttgtacact ttcctctttt ctccttatcc gacaatctgc 120 ataacaagat aagaacctgg tagagatccc aaaaccctcg tccccagaac ccttgagcca 180 gtcataaca 189 // ID Copia-5_LBS-I repbase; DNA; FNG; 5641 BP. XX AC ABFE01001868; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-5_LBS_; KW Copia-5_LBS-LTR; Copia-5_LBS-I. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-5641 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01001868; Positions 23066 17426. XX CC Positions [2817-3314] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 333..5585 FT /product="Copia-5_LBS-I_1p" FT /translation="MSTNKSTSLSVNDIPHFNGRNFQGWSDKMIGIFMIAK FT VYDVVKGDLVKPANSEKPPAIRPPAIITDDTDVITTTKLNALWTQFQVQMN FT QSNYLLGIWERKNSAWTDANSQAMGIFNRALDIGIWDQVKAKNAKETWDWL FT KEKYAKSSHLEIMEHFRFMKNQNIDLSDPNPQLAKFMHHYQALPLNMISSA FT MAAIILLSNLPLSNNPNVSNVYQNLLESSFKDDIATSLVLEDVMTQIRDVW FT QARFLTLASHDQPRKGTTYDKGKAPANQPSQQKQQNQVQRNTAIKGKGPTP FT QYSQQQSAESGSSQQKKKRPFRRGGKGKSANNHNHVAGASDTFNSDFILAS FT SAIHIADTPALPAPTAHTVASFSAAGPSIRHERTTGPWKNPGRFTPPYPHV FT QRSRDLMSRLGVKPTIQTSKEFEGIASLEEVTDGSFGAPTYQWGAYKLTDP FT IRAPTPEMAPLLERMQVDELSQEEGDTVSLGDEEEPIPQGSDLFGDNEDVG FT ASIPQGMKLDLGSGESDYGDDGPETLCGPIESAGAAGPSKHVWNMGIPQTK FT RWYDDDEAYDGPTENESVPFCSHKGDSLTDPSTASISVDIGRSLDKYLAYV FT NENHCKFVLDCSDCKGKNSRSKHIEWVVDSGASVHFSEDESDFSELNLFSE FT KERPFAQTANGAAAIHGTGTVFVKTWIDNPKSPEKKSTTISRLHPVFYMPG FT IGIRLLSMGLLLKGNMHVKGDERTLRFIDAQSGKVKIVAVTRIFTDTIYWI FT NSEILTGSELIAHKSMHLDDFDLWHRRLGHPGQQVFEKFESSTRNFPKSIE FT IPKNPPVCEGCAKGKMHSRSFPDNSARATRPFQRIHSDLKEFAVKSYHHHK FT YYISFLDDNSSHSWITLLKKKSDSKVATQQFIAMVKNQYKAIISEWMSDNG FT GEYVDQNYVKLLKDEGIMIQRSVPAQPQMNGRAERFNRTIDEKAESMRHQA FT CLSDNWWEFCVLHANYLYNRTPMRRLDWKTPKGYLEKEDPDISHLRILGCG FT AYVFIHKDLRANKLSPKSEMMTFLGYRDGRESNLMFMRPPNNVIFTAATAL FT FDERLFPRCAKKSKVPPVTQIQEPGEPEIVIETESVPDGNSGAPFAPPHDI FT FIPPRDNESTHDDVEPHSPPGSPPPQPRNAPGGAQRGNEPPRRSERERKGT FT RKDGNVYPPGTSTDTDRRRKLPDANSATSVPNSGQSSKPAVGDPGHAKLAT FT EGGALWEYLLSKAIPHGELPDPMNVRAWTSKDIGKLPAEDQREWQNAQFEE FT LEALKKRKVYELADLPPGRKAIKNRWVFDLKSDGRKKARLVAKGCSQVEGL FT DFDEIFSPVVRFESVRTILALAALEKWKVEGLDVKSAFLYGELDEELYMEQ FT PQGFKIPGKEHKVLRLLKAIYGLKQAARAWWHELDKSLKELGFTRLYADAG FT IFVAKHADGTMVIILAYVDDIIVTGPNTTLVASKKKLFMDKWECRDLGECK FT EFLRMRIDYRDGKTYLDQVPYLEKILKRFGMADAKAAQTPLPTGYKPEPFD FT GTATAALRSQYQSVIGSLLYLMLGTRPDLAFAVTQMAKFAQNPSEEHLNQT FT RHIMRYLVGTRKYALVYDGRSDGGLYAYCDSSYGDDRSDADRRRRSTQGYH FT FTLANACVKWHSKTQTLISTSSTMAEYIALSDCARDCAWYKILFSELGKPM FT PYVPIYGDSHGAIFNAQNPVTQKGIKHIEIRYHYIREQIEKGAVKVFAVPT FT SENVADMFTKNLGPTLFLKHRKELGIEFYSL" XX SQ Sequence 5641 BP; 1550 A; 1447 C; 1324 G; 1320 T; 0 other; tgcctgttaa taggttatgg gccccgatgt cgaacgacgt gcgctacggc tgcacgagtt 60 ctaacatgcc ttagccttaa caacctttgg gcttgattgt gcgccctctg ggctgcacat 120 cgcctacaac agttcgctta attgctgaac gttgtcaaga ttgttcgcct cacggctgaa 180 cgtcaacgct gttgctgtct ggacattgac tctaaaggag tcgggttcat gtcaagcagg 240 atgcgcagtg cgccttgtag gctgcacgct ctttattaaa catccatgct gacaacaccc 300 aagtttttat cttacagata ttttctgaca atatgtccac caacaaatcc acttccctct 360 ctgtcaacga catccctcac ttcaatggtc gtaacttcca gggttggtct gacaaaatga 420 tcggcatctt tatgattgcc aaggtctatg acgtcgtcaa aggcgacctt gtcaaacccg 480 ccaactctga gaaaccacct gccatcagac cacctgccat catcactgac gatactgacg 540 tcattaccac cacgaagctc aacgctctgt ggactcagtt tcaggttcaa atgaaccagt 600 ctaactatct gcttggcatc tgggaacgga agaattccgc ctggaccgat gcgaattccc 660 aagctatggg aatttttaac cgagccctag atatcgggat ctgggatcaa gtcaaggcga 720 aaaacgccaa agagacttgg gactggctga aagagaagta cgccaagagt tcccatctgg 780 agatcatgga gcactttcgc ttcatgaaaa atcagaacat tgatctctcc gatcccaacc 840 ctcagctcgc caaatttatg catcactacc aggcacttcc cctcaacatg atcagttccg 900 caatggccgc catcattctc ctttccaacc tcccgctgtc caacaatcca aacgttagta 960 acgtttatca gaatctcctt gaatcttcct tcaaggatga cattgccacc tctctcgtcc 1020 ttgaggacgt tatgactcag atacgcgacg tctggcaagc tcgcttcctc actctcgcat 1080 ctcacgatca gccccgcaag gggactacgt acgataaagg aaaagctccg gctaatcagc 1140 cgtcgcagca gaagcagcag aatcaggtac agcggaatac cgctatcaag ggcaaaggcc 1200 ctacgcctca atattctcag cagcagagcg ccgaatccgg cagctcgcag cagaagaaaa 1260 agagaccctt ccgacgcgga ggaaaaggaa agagcgcaaa caaccacaat catgtggctg 1320 gagcctctga tactttcaac tccgatttca tcctcgcatc atcggcaatt catattgccg 1380 atacaccagc attaccagcc cctacggccc acaccgtagc ttctttcagt gctgccggtc 1440 cttcgatccg gcacgaaaga acaacagggc cgtggaagaa ccctgggagg tttactcctc 1500 cttaccccca tgtgcagcga tcccgcgatc ttatgagtcg ccttggggta aaacccacta 1560 tccaaacttc aaaagagttt gagggaatcg cttccttgga ggaagtaacc gacggatcct 1620 ttggagcccc cacatatcag tggggtgcat acaaacttac ggatccaatc agggctccca 1680 cgccagaaat ggcaccattg ctggagagaa tgcaggtaga cgaattatcg caggaagaag 1740 gcgatacggt atccctcgga gacgaggagg aaccgattcc tcagggatcg gatctatttg 1800 gagataacga agatgttggc gcctccatcc ctcaggggat gaagctggat ctcggtagcg 1860 gcgagtctga ctacggagat gacggtccgg agaccctttg tggtccgatt gaatcagctg 1920 gcgcagcagg accgtcgaag catgtttgga acatgggtat accccagacc aaacgatggt 1980 acgacgatga cgaggcctac gatggtccga ctgagaatga gtcagtcccc ttttgttcac 2040 ataaaggcga ttcactaact gatccctcta cagcttctat tagtgtagac ataggtagat 2100 cgttagataa gtatctagcc tatgtgaatg aaaaccattg caagttcgtt ctggattgtt 2160 cagactgcaa gggaaagaac tctaggtcga aacacatcga gtgggttgtt gattcaggag 2220 cctctgtcca cttctcagaa gacgagtctg atttctcgga gttaaacctc ttttccgaga 2280 aagaaagacc attcgcacaa actgcgaatg gtgccgctgc catacacggc acaggcacag 2340 tctttgtcaa aacgtggata gacaacccta agtccccaga aaaaaagtcg accactatat 2400 cccgcctaca tccggtattc tatatgccag gaatagggat ccgtcttcta tctatgggat 2460 tgcttctgaa aggcaacatg cacgtcaagg gagatgagcg aacccttcga ttcattgatg 2520 ctcaatctgg aaaggtgaaa atagtagccg ttaccagaat cttcaccgat acgatctact 2580 ggatcaattc ggagatcctc actggaagtg agttgatcgc gcacaagtcc atgcatctag 2640 atgatttcga tctctggcat cgacgacttg gacatcctgg tcaacaggtg tttgagaaat 2700 tcgaatcgag tacacggaat tttccgaaat cgattgagat ccctaaaaat ccgccggtgt 2760 gtgaaggctg tgcaaaagga aagatgcact cccgctcttt tcccgataat tctgcacgcg 2820 ccacacgtcc ctttcaacgg attcattctg atctcaagga gtttgcggtg aagtcttacc 2880 atcaccacaa gtattacata tcgttcctcg atgataactc ctctcactcc tggatcacac 2940 ttttgaaaaa gaaaagtgat tccaaagtgg caacccagca gtttatcgct atggttaaaa 3000 accagtacaa agcgatcatc agcgaatgga tgtccgataa cggaggagag tacgttgatc 3060 agaactatgt caaactcctt aaggatgaag ggatcatgat ccaacggagt gttccggctc 3120 agccgcaaat gaatggcaga gctgagcgct tcaatcgtac tatcgatgag aaagcggagt 3180 caatgcgcca tcaggcttgt ctatctgaca attggtggga gttctgcgta ttacatgcga 3240 actacctgta caaccgcacc ccaatgcggc gccttgattg gaaaactccg aagggctatc 3300 ttgaaaaaga agaccctgat atctcacacc tacgtatcct tggatgtggt gcttacgtct 3360 tcattcacaa agatctccga gccaacaagt taagtcccaa atcagaaatg atgacttttc 3420 ttggctatcg tgatggtcgt gaatcgaatt tgatgttcat gcgaccacca aacaacgtca 3480 tctttactgc tgccacggca ctgtttgatg aacgtttgtt tccaagatgt gcaaaaaaga 3540 gtaaagtgcc acctgtcaca caaattcagg aacccggaga acccgagatc gtgattgaga 3600 cagaaagcgt gcctgacggt aattccggtg cacctttcgc tccaccccat gatatattca 3660 ttccgccaag agacaacgag agcactcatg acgatgttga gcctcatagt cctcctggat 3720 cgccgccacc gcaacctcga aatgctccag ggggagcaca gcgtggtaat gagccgcctc 3780 gccgctctga gcgagaaagg aaaggaacca gaaaggacgg gaacgtatac ccgcctggca 3840 cctctacaga caccgacaga cgtcgtaaat tgcctgacgc aaacagcgca acgtccgttc 3900 ctaattcagg acagtctagc aagcctgctg taggtgaccc tggtcatgct aaattagcaa 3960 cggaaggggg agcattgtgg gaatatttgc tttccaaggc aattcctcac ggtgaattac 4020 cagatccgat gaatgtccgt gcctggactt cgaaggacat cggcaaatta ccagccgaag 4080 atcaaaggga atggcagaac gctcagtttg aggagctgga ggccctcaag aaacgcaaag 4140 tttacgagct tgctgaccta cctcctggcc gaaaggctat caaaaatcgc tgggtattcg 4200 atttaaagtc cgatggacga aaaaaggccc gcttagtcgc taaaggctgc tcacaagttg 4260 agggtctaga cttcgacgag attttctcac cggttgttcg gtttgagtcg gtccgcacca 4320 ttctcgcatt agcagcccta gagaaatgga aagtcgaagg ccttgatgtt aagtccgctt 4380 ttctctatgg agagctggac gaagaactct atatggagca accacaaggg ttcaaaattc 4440 ctgggaagga acacaaggtg cttcgattgc tgaaggctat ctatggtcta aagcaagccg 4500 cacgtgcttg gtggcatgaa ctagataaga gcctaaaaga gctcggcttt actcgcctat 4560 atgccgatgc tggcattttc gtcgcaaaac atgctgacgg aaccatggtc atcatactcg 4620 cgtatgtgga tgacattatt gtcaccggac caaacactac cctagtcgcg tccaagaaga 4680 aactctttat ggacaaatgg gaatgccgtg acttaggaga atgcaaggag ttcctacgta 4740 tgaggataga ttatcgagat gggaagacct accttgatca ggttccctac cttgagaaaa 4800 ttctcaagcg ctttggtatg gccgatgcca aggctgcaca aactccgctt cccaccggct 4860 ataaacctga accgtttgat ggaactgcta cggcagctct ccggtctcag taccaatcgg 4920 tgatcggatc cttgctatat ttaatgctcg gaacacgtcc cgacctagcc tttgctgtta 4980 cgcaaatggc aaagttcgct caaaatccat ctgaggaaca cctcaatcaa accaggcata 5040 tcatgcgtta ccttgtcggt acgcgcaaat acgccttggt ctatgatgga aggagtgatg 5100 gagggctata cgcctattgt gactcatcat atggagatga cagatcagat gcagatcgca 5160 gacgtcgatc aacgcaggga tatcatttca ccctagcgaa tgcatgcgtc aagtggcact 5220 ctaagacaca gacgctgatc tccacgtcct ccaccatggc agagtacata gcactctctg 5280 actgtgcacg tgattgtgca tggtacaaga ttctattctc cgaactcgga aagcccatgc 5340 cttacgtccc tatctatggt gacagtcatg gagcgatctt caatgctcag aatcccgtaa 5400 cacaaaaagg gattaagcat atcgagatcc gctatcatta tatccgagaa cagatcgaga 5460 aaggtgccgt aaaagtattt gccgttccca catcagagaa cgttgcagat atgttcacca 5520 agaatctggg accaacgctc ttcttgaaac atagaaagga gcttggtata gagttttatt 5580 ctctatagtc ccgtacccac agacaagtaa gtgcgctagc gctgcactac ttgaggggga 5640 g 5641 // ID GYMAG2_LTR repbase; DNA; FNG; 193 BP. XX AC AACU01001656; XX DT 04-SEP-2005 (Rel. 10.09, Created) DT 04-SEP-2005 (Rel. 10.09, Last updated, Version 1) XX DE GYMAG2: Gypsy-type LTR retroelement from Magnaporthe grisea (LTR DE portion). XX KW Gypsy; LTR Retrotransposon; Transposable Element; GYMAG2_LTR. XX OS Magnaporthe oryzae OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Magnaporthales; OC Magnaporthaceae; Magnaporthe. XX RN [1] RP 1-193 RA Jurka J.; RT "GYMAG2: Gypsy-type LTR retrotransposon from the rice blast RT fungus Magnaporthe grisea."; RL Repbase Reports 5(9), 245-245 (2005). XX DR EMBL/GenBank/DDBJ; AACU01001656; Positions 6391 6583. XX CC LTRs differ by 1 bp indel. This appears to be a recent insertion. XX SQ Sequence 193 BP; 43 A; 59 C; 40 G; 51 T; 0 other; tgtgacattt ggaccatgag atcaccagac cctaaccctg accctggacc aacgacccac 60 cagggtgata cctttatcga cgtcgcgtca agctcagaac tttgtttgtt tcctttcctc 120 tcctcagagt agaatcttcg tcgataggcg ccaatagact agcttccgtg ctatgcttac 180 cctggccgtg aca 193 // ID PIF_Harbinger-1_PleOst repbase; DNA; FNG; 3005 BP. XX AC . XX DT 17-MAY-2011 (Rel. 16.05, Created) DT 17-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Harbinger; DNA transposon; Transposable Element; KW PIF_Harbinger-1_PleOst. XX OS Pleurotus ostreatus OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Pleurotaceae; OC Pleurotus. XX RN [1] RP 1-3005 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 3005 BP; 841 A; 718 C; 709 G; 737 T; 0 other; aggggagctc aaagttggcg cataagaatc cgcgtacgca tatgcgcgca tacggccgag 60 gtgccgtcga ttatgtaatg gtatttgagt gtcacatatc cgagtagcgc ccacgtagcg 120 ctgaacgcag catatcactt ccgttgatct acactgatct actgcaatca tgccccgcgc 180 aaggatccac gacaaactca ggcatgcata tctcattcat atgatgatgc gcaacacgat 240 ctctccatcc gatgaccttg cagaggcaat ccaacggctt caacaagaca tggagatact 300 caatgccatc aaaaacactc gctaccttca cacacgaaca catgttccta agcagggtaa 360 cctccatgtt ctctgggagt atgcatccga ccctgctcag catgaccact tcactcaaat 420 ggtccgcatt tcaccaactt ctttctctgt acttctacat ttgattgaag accatctggt 480 tttccacaac aattcgaaca acgcacagaa gcccgttgaa gtgcagcttg ccactacact 540 ctttcgtatg ggtcggtacg gaaatgcagc atctctcgag gatatagccc gcattactgg 600 atcctcggag ggggatatcg tgaactgcac aaagcgttgc tttactgcca ttgagtcact 660 tcatgactta tttgttcgac cattgacgcc tgcagagaag gaggtggaga agcagtggat 720 tgatcagcat ctggggttca agggtttgtg gcgagagggc tgggtaatgt atgatggaac 780 cattattgtc ttgttcgaga aaccaggcta tgaaggagat gcctattaca ctcggaaagg 840 aaactatgga ttaaatgcac aggtatgaga tcatttcatc attccttcag ttattgctga 900 tatttccatt agattgggaa cattccttcc aatctgcgga ttgttgatta ctcccatggg 960 cataccggtt cagcccacga ctcgcttgct tttgagggga cagcagcaca caaatatccc 1020 aactggctct ttcaaggaga ggagttcgct tggggagatt ctgcatatac ggtgaattct 1080 cgaaccattt ctgttcacaa aaaaccggca tcagacgacc ccaacaaccg gttgttcgac 1140 agagcacttt caaatctccg agtccgctct gagcattgta ttggagcact caaaggacgg 1200 ttccagtgct tacggggcct tcgtgcaaaa atacacaacc cacatgaaca tgccttggca 1260 tgtcaatgga taacaatttg catcatcttg cacaacatgg tcattgacgt cgagggcaca 1320 agtggagcaa acttctttgg cctatcacac tctgaggttg aagaagatca tgatactgga 1380 gatcaacttg aggatgccaa tgaacaggac agcggagttg taaagcgacg tcaattagtg 1440 gcagaaattg ttgcatttgc agcaacaaga gcacattaaa tagaatgtat tgtagttgaa 1500 acgaatgtag tgccagtggc aaaatgaatg taactactag ttggtatctg aattctcatc 1560 atcagaaaac attcccatct ttttcaaccg ccgcctatac tgtgccggag tcattacacc 1620 cagctctagc tctttcagca tcagcttgcg gatcttgaca ttccggtcag catcagcacg 1680 gtcagcaagc tgtttgattt gctgcctgag aagaaggtga attcacagtt ataaagtata 1740 gatcaaggaa aaaacataca tctgaacatc aataattgac tcttcaaaag agcgcttctt 1800 gggaacaacc tggatagagg ccttggcctt gttaagtgcc tcatcactaa gatgggacac 1860 cttgggacct ttcctcaact gagatggagt gggtggaggt ggagtatcat cgtctgcaga 1920 agcctgagat gatgagggag tgttggtagc gtcagctcca aagaaacgcc cagggggatg 1980 gacaacggga gctccaggtg caggaggatc atcatccaag ggaatgagag gtctaaacag 2040 gtcagcagtt ggggagaaca gttcccgctc catggcgagt agaaccggat caatggggat 2100 attgtcacgg ttcaaatcct ccgcaagagc accaggatga gggtcctgca tataaaccac 2160 cgaacgaccc cgtggtccta ttcctgtggt gacaacagga gggactacat tcgggcgtga 2220 agcccaaaga cgatggagag caggaaagta ggggagatcc ttgcaaattt gctctttata 2280 ctattttaga acggtaaaaa aatcaaaaaa gatcacgtac cccagatgtt acgagcctca 2340 gggatagtgt catggtcagg accctctggt gagatgtaaa atggcatata gacctcgata 2400 ggtccatctg caacctcctg agaggcatca tcaccagtca ggccatcccc agtttgcttg 2460 aggcgcttcg ccatgatgat gtacttctta cgaaggctaa acctctttta taagtatggg 2520 ccaaaaggaa ataaatgtaa catggacgca cgcttcaact ttacttttca cccggtctcc 2580 catagccttt gcatacagct tagactcctc agggaacagc ttcgcagcaa tggctttata 2640 aaccatgatc ttccgctccc ctgaggtgtt ctaatttaat caaatgctca gcaaaatgaa 2700 tgtgaaaagc gaagaaaagc acacctcatc tttctctttt ttcccaaaga gaaccttggc 2760 attgtgacgc tcctccattt ctccaatcag cgcccaagtc cgatcatcag tccaatcaat 2820 tggcgcgatg tccgacttcc gaactcgagg tggcatgcta agagacgaga ggggaaaaga 2880 agagacacga caacgcattt acgatgtgac gaaagtgtga catttgatta cataatcgac 2940 ggcacctcgg ccgtatgcgc gcatatgcgt acgcggattc ttatgcgcca actttgagct 3000 cccct 3005 // ID Copia-33_MLP-I repbase; DNA; FNG; 4585 BP. XX AC AECX01001335; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-33_MLP_; KW Copia-33_MLP-LTR; Copia-33_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4585 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001335; Positions 42422 47006. XX CC 'AGGCT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 44..4561 FT /product="Copia-33_MLP-I_1p" FT /translation="MSDEGITVPIDQLILASPAHSLSSSEQDFQSPSESES FT LLDQTFIQSLLPTMSTSIPPNAGTTTVTTINPSAFGSRRYQMASSMVNTLS FT KVTITEKLNDTNYLNWSSEILTGIKSLSFTSFLLSDKDEPTDDADLQLFNE FT VVRESLMTWMVSKLDQPNRNRFQPRITVETAIGSDFESAPSKLWILIRDHH FT SPRTSAHQMLLRRLLFNLAQKDSSLPSHMDEFNRLYTAYMAAGGKIGNVEL FT GQQLLMSLNSDWTKVAEDIADSDTFEYDDVVKALKVRITNRAMLHPALNLP FT SLSVPQANAASPMRRRQGFVQTRCSPTKCLSLTHSEADCIRNPRNINKYNA FT WVESKKARGEWVDRSPNSPSNRSSSSKSVTASASLVIPDPELPSLSSLQAA FT LERSEFSASASAVYIVDPKASIENEHFGIVDSGTTHHMFKSEKYFNKDTFT FT DLSSSNEQVGLAGGSSTLQIKGRGDVTICGPTGDLTTLTNCLFVPSLKQNL FT IAGGRLFFEGWITDRREDGSFVIQRNGVTALAGTIHPNSMLLQLSAVRTIP FT SEFSTASAITDKTTESLHLLHHKLGHPNFTYLKQMVKGDVITGLPLSLSKV FT VLPESLPCNSCDLSKAHRQPHSDTRTRALQPLDNIHIDLSGIMRTPALCRS FT HYFILFTDDHTSYRHITGLKSKEKDEVHVAIHTYLSLVERQCDRKVKCLTL FT DGGGEFLNDVLLPYCQREGIYLRVTAAYTPEENGVSERSMRTVVCKGRAML FT IEANLPIRFWMESVKSAVFLNNRTTTTTLPKNKTPYEMWYHRKPDVTHIKP FT FGCLTYVLIRKPDRESKYGPTSEQGVLVGHTEHNRNYRVFMLNTSKIEITH FT DASFREDVYPFTRLPAFDISHLTTNQEEVSLLELNPNPTPILTPNDDEENQ FT QLVVIEQQPAVIEPQAVIDPNPPPALPRRSERVSRPVDRFVPNSNLAFWED FT DCLISKGDVCAYAFAAESTVRLVSEPNSYKQAMKSPSRDEWIKACAKEMDN FT MSRKGVWKLVDRPKDAPVVGSRWHFKVKHNPDGSVRKHKSRIVAKGFTQTF FT GVDYEQTYAPTGKPASYRILVAIAAYFGWDIHSMDAVAAFLNSALKETVYM FT EQPEGYVLPGDEDKVCLLLQALYGLKQSAHEWNEEFRIKLIKAGFKQAAGD FT ECVYIRQRSKSDIIIFYLHVDDMAITGPISSIILFKQEVSNFWEMDDLGVA FT TCVVGIQTMRLSQHHYAIHQRAMTESLLMRFELSECKPVSTPVQGGLKLLK FT STPEEAANFATLNLPYRSGVGSLMYISQCTRPDIAYAVGVLSQHLDTPCQR FT HWDAFRHVLKYLRGTLHLGIHYHSQDNQLFRMQSSWNVPMTNVDSDWAGCK FT NSRRSTTGYLTTLFGGAISWRSRLQQTVALSSTEAEYRATTEAGQEVQWLR FT NLLRDVGFQWNGPVSINCDNLGAIDLSSNAVNHGRTKHIDIEHHWIREQVK FT QNKIILNYCKSEDMTADLLTKPLHPGPFWEHMKSVGLKKCP" XX SQ Sequence 4585 BP; 1335 A; 1074 C; 967 G; 1209 T; 0 other; tggtagcggg agtgtcaaat cgatccgaca aatcacttga tttatgagcg acgaaggaat 60 caccgtacca atagaccaat taatactagc ttcaccagct cactctttgt catcttccga 120 acaagatttt caatcgccat ctgaatcgga gtcacttctc gatcagacat ttatccaaag 180 tctcctgcca acgatgtcga cttcaattcc acctaacgcc ggtactacta ccgttactac 240 catcaaccca tcagctttcg gtagtagaag gtatcaaatg gcctcatcaa tggtcaatac 300 tttatcaaaa gtaacgatca ctgaaaagct aaatgacacc aactatctga attggtcttc 360 agaaattctg accggcatta aatctctatc attcacatcc tttttattat ccgataaaga 420 cgaaccgacc gatgatgccg atcttcaatt gtttaatgaa gttgtaagag agtcactcat 480 gacttggatg gtatcaaaac tagatcagcc taacagaaac cgatttcaac ctcgtattac 540 cgttgaaact gctattggtt ctgatttcga atcggctcca agcaaacttt ggattctaat 600 tcgagatcat cactcgccta gaacgtcagc tcaccagatg ttgttgcgac gactcctttt 660 taaccttgct caaaaggact cttcattacc cagtcacatg gatgaattta atcggttgta 720 cactgcttac atggctgcag gtggaaagat tggaaatgtt gaactcggac aacaattatt 780 aatgtcactc aattcagatt ggacgaaggt agcagaagat atcgctgatt ctgacacctt 840 tgaatacgat gacgtcgtga aagcacttaa agtcagaatt acaaatagag ctatgctcca 900 cccagctttg aaccttcctt ctctgtctgt cccccaggca aatgcagcca gtcctatgcg 960 aagacgccaa ggattcgtcc aaactcgatg ctcaccaacc aaatgtttat cattgacaca 1020 ttctgaagct gactgtatta gaaacccaag aaacataaat aaatacaatg cttgggtgga 1080 atcaaagaaa gccagaggag aatgggtaga tagatcacca aactcacctt ctaatcgatc 1140 atcttcatct aagagtgtca cagcgtcagc ctctctagtc atacctgatc cagaattacc 1200 gtccttatcg tcacttcaag ctgcactgga aagatctgaa ttttcagcaa gtgcaagcgc 1260 cgtatatata gttgatccta aagcatcaat tgaaaatgaa cattttggca ttgttgatag 1320 tggcactacg caccacatgt ttaagtcgga gaaatacttc aacaaggata cttttaccga 1380 cttatcatca tcaaacgaac aagtaggatt ggctggtggc tcatcgactc tccaaatcaa 1440 agggaggggt gatgttacta tttgcggtcc aactggtgac cttaccactc tcaccaattg 1500 cttatttgta ccttctctta agcaaaatct tatcgctgga ggacgactat tctttgaagg 1560 ttggataact gatcgacgag aagacggttc ctttgtgatt caacgtaatg gagtcacggc 1620 cttggcggga acaattcacc ctaattcaat gctactccaa ctcagtgctg tgcgtactat 1680 accatctgaa ttttcaactg cttcagctat aaccgacaaa acaactgaat cattacattt 1740 acttcaccac aaattaggcc acccgaattt tacttatcta aagcagatgg tgaaaggaga 1800 tgtcattacg ggtttacctt tatctctctc aaaagttgtt ttacctgaat ccctcccatg 1860 caattcatgt gacttgtcaa aagcccatcg tcaaccacac tctgatactc gaacaagagc 1920 tttgcaacct ttggataata ttcacattga tctaagtgga attatgcgaa ccccggcttt 1980 gtgcagaagc cattatttca ttcttttcac agatgatcac acctcttatc gccacatcac 2040 cggacttaag tcaaaagaga aagatgaagt ccatgtcgcc attcatacgt atctaagcct 2100 cgttgaacgt caatgtgaca gaaaggttaa atgtctaact cttgatggcg ggggtgaatt 2160 tttgaatgac gtccttttac cttactgtca acgtgaaggc atatacctca gagtgacagc 2220 agcatacacc ccagaagaga atggcgtgtc tgagcggtca atgaggactg ttgtatgcaa 2280 aggacgggca atgctaattg aagcaaattt acctattagg ttttggatgg agtcggtgaa 2340 gagtgcagtc ttcttaaata accgaaccac cactacaacc cttccgaaga ataaaacgcc 2400 ttatgaaatg tggtatcatc gaaaacctga tgttacccat atcaaaccct tcggttgtct 2460 gacttatgtt ttaattcgca agcctgaccg tgaaagcaag tacggaccta catcggagca 2520 aggcgtttta gtgggtcaca ctgaacacaa tcgaaattat cgcgtattca tgctcaacac 2580 atcgaaaatt gagattacgc atgacgcttc tttccgcgaa gatgtctatc ctttcactcg 2640 ccttcctgct tttgatatct ctcacctcac cactaatcaa gaagaagtgt ctttgcttga 2700 gcttaacccc aatccaactc ctattctcac tccgaacgac gacgaagaaa atcagcaact 2760 agtagtcatc gaacaacaac ctgcagtgat cgaacctcaa gctgttattg atccaaatcc 2820 cccacctgcc ctccctcgtc gctcggaaag agtcagcagg ccagttgatc ggtttgtgcc 2880 caactcaaac ttggcattct gggaggacga ctgcctgatc agcaagggcg atgtttgtgc 2940 ttacgctttc gcagccgagt cgaccgtccg attagtttca gagcctaaca gctataagca 3000 agccatgaag agtccaagtc gagatgaatg gatcaaagcc tgtgcaaagg aaatggacaa 3060 tatgtcgagg aagggggtgt ggaaactggt ggaccggcca aaagacgcac ctgtggtcgg 3120 atcacgatgg cacttcaagg tgaagcacaa ccctgacgga tctgtaagaa aacataaatc 3180 aagaattgtt gcgaagggtt tcacgcagac gtttggtgtt gattatgagc aaacgtatgc 3240 tcctactggt aagcctgcgt cttaccgtat tttggttgca atcgcagcct attttggttg 3300 ggacatccac tctatggacg ctgtagcagc gttcctcaac tcagctttaa aagagaccgt 3360 atatatggaa cagcccgaag gttacgtttt acctggggac gaggataagg tctgtttact 3420 ccttcaagcc ttatatggat tgaaacagtc ggcgcatgag tggaatgaag agttccgaat 3480 caaactcatc aaagcaggat tcaagcaggc tgcaggcgac gagtgcgttt acatccggca 3540 acggtctaag tctgacatca ttattttcta cctccacgtc gacgatatgg caatcaccgg 3600 tccaatctca agtatcatct tattcaagca agaagtcagc aacttttggg aaatggacga 3660 cctcggggtg gcaacatgcg ttgttggaat tcaaactatg cgcctatcac agcatcatta 3720 tgccatccac cagcgtgcga tgactgagtc gctgcttatg cgcttcgaac tatctgaatg 3780 caaaccggtt tctactcctg tgcagggagg cctaaaactt ctgaaatcta cgcctgaaga 3840 agcggccaac tttgcaacgt tgaacctgcc ttacagatca ggtgttggaa gcctaatgta 3900 catctcgcag tgtacaaggc cagatatcgc atatgcggta ggagtcctat ctcagcacct 3960 cgacacacct tgtcaacgcc actgggacgc attcagacac gtcctgaaat atctccgggg 4020 aacgctccat ctcggtatac attatcactc tcaagacaat caacttttta gaatgcagtc 4080 cagctggaac gtaccaatga caaatgtaga ttccgactgg gcagggtgta agaactcacg 4140 gcgttcaaca accggctatc tcacaacttt gttcggtggt gctatttctt ggcgctcacg 4200 tctacaacag acggtagctt tgtcgtctac tgaagcggaa taccgggcca caactgaggc 4260 cgggcaggag gtacaatggt tgcgaaattt attaagggat gtaggttttc aatggaacgg 4320 accggtgagc atcaactgtg acaacttagg agctattgat ctttcttcaa acgcggttaa 4380 tcatggaaga acgaagcata ttgatataga acatcattgg ataagggagc aagtgaaaca 4440 aaataaaatc attttaaatt actgcaagtc agaagatatg acggcggact tacttaccaa 4500 gccgttgcat ccaggcccgt tttgggagca tatgaaaagt gtaggtctca agaaatgtcc 4560 ttagcgtgtc ttgattgagg gggtg 4585 // ID Gypsy-94_MLP-LTR repbase; DNA; FNG; 170 BP. XX AC AECX01000333; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-94_MLP_; KW Gypsy-94_MLP-I; Gypsy-94_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-170 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000333; Positions 17373 17204. XX SQ Sequence 170 BP; 41 A; 43 C; 25 G; 61 T; 0 other; tgttatgagc cattagttat aaggctttac catgtatgct tatactgtat tgtactctag 60 ttatcctttc tctcgagtag aactacagtt cctcacgtag caatctatta ccttacctag 120 aggatctttt acgctcactc tcttgcatct cagcaagcca gctcctatca 170 // ID Gypsy-63_MLP-I repbase; DNA; FNG; 5587 BP. XX AC AECX01001331; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-63_MLP_; KW Gypsy-63_MLP-LTR; Gypsy-63_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5587 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001331; Positions 22487 16901. XX CC Positions [2770-3189] - Reverse transcriptase CC Positions [4432-4911] - Integrase core CC 'AAAGT' target site duplication CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS join(2185..3339,3343..5493) FT /product="Gypsy-63_MLP-I_1p" FT /translation="MDHVGGPERTARNTEMGVKLSSLSKPPPSKSKLSPQI FT SVPRIASQHASIPVKTLRTGSMLRHQYGQQTTTRPTEAMIDAAKTSWNLLA FT KLAVEASKGKEEQSATELVPNCYHDYLDMFEKSKSNILPPRRPYDFRVDLV FT PGATPQAGRIIPLSPKEAAVLDEMIEKGLSNGTLRRTTSPWAAPVLFTGKK FT DGNLRPCFDYRKINSLTIKNKYPLPLTMELIDSLLDADQFTSLDMRNGYNN FT LRVREGDEAKLAFICKQGQFEPLTMPFGPTGAPGFFQFFIQDILKSHIGKD FT VAAYQDDILIYTKPGVDHEKVVKEILDILKQQNVWLKPEKCKFFQKEISYL FT GLIISRNQIKMDETKVKAVKDWPAPKNLSEVLTFLGFSNFYRFIHHFSKIA FT RPLHELSQEGIKFEWTDERNTAFENLKQAFTTAPVLTIADPYKPFVLECDC FT SDFVLGAVLSQVSSIDNELHPVAFLSRSLIKAEQNYEIFDKELLAVISAFK FT EWRQYLEGNPNRLNVIVYTDHKNLQSLMTTKELTRRQARWAKILGSFDFEI FT RFRPGEDSTKPDALSRRPDLKPDDHEKLSFGRLLKPENLPNDAFIDSLDVF FT DSWVEEDTIWIDDLYLGSSSSEVWSDKKILTEIRRTTAQDHQLMEIVNICV FT EMPDSKLIKGYSVIDGVLYFCEKVVVPGVEEIKLQILRSRHDSLLAGHLGR FT MRTLMLIKRNFHWASMKAYVNKYVDGCQSCQRVKTRPVKPFGSLQPLPIPA FT GPWVDICYDLITDLPESNGLDSILTVVERFTKMAHFVACKKTMNSEELADL FT MLKEVWRLHGTPRTITSDRGNISISKLTKEMNNRLGIKTQSSTVYHPQTDG FT QSEITNKAVEQYLRHFTSYKQDNWTNLLPMAEFSYNNNLHVSIGMSPFRAD FT YGFDVSFTGTPTQDQCLPAIKERFSQLNDVHEELKLAMKEAQDVMTDKFNR FT KVQDSPLWQAGQEVWLRSKHISTTRPTAKFSNRWLGPYKVKQQISTNAYEL FT ILPKEMQEIHPVFHVNLLREFKKSEFQGQDNGPPPPVMIEGEEEYEVNEIL FT HKRKRRSKIQYLVSWKGYGPNHDSWEPENSLGNAQEMVIRTRRRK" XX SQ Sequence 5587 BP; 1912 A; 1126 C; 1201 G; 1348 T; 0 other; tattgcaaca tctacctatc taactgaact ggagcaacta aagtaacaag caagaaagat 60 tagaagaaag ttaaaagaag aacgaacaaa gattatataa gaagtttttt tgattcaaat 120 caaagttttc agatctagat aaagtttaaa agtttaatta tcagaaggtt attaatacaa 180 cttggattca cctgcaactc tgtcaatcta aacccgcact ctactactga ttcaactacc 240 accacgccta acttcaaaat cccacacgac tcagcctctt cagaggaagg agaagtgttt 300 accaaaactg tagaagacat gtctaaagtc aatttagaag aaatcatgaa acaactccag 360 gatctcaata ccagacttgc ggaggagact agtttaaggc aagcggtgga aagagaattg 420 aagcagatga aggaagagaa gcaacaggat tatccaatga caaaccctat gagtcaaccg 480 gaagttccat tattccaaac tgctgcgcca cctcagatcc cggtctacgt gcaacaaacc 540 caaccgcgcc ctcccaaagt tgccatgccc gataaatacg atggccctag aggtcaaaaa 600 gctgagattt ttatgaatca gttaggagtc tatatgcaat tgaactcaag ttcgtttgca 660 aacgagcaat ccaaggtagc gttcgcgttg tcatacacga ctggcaaggc catgtatggg 720 gtcaacactt aatggagcag attctggata gtgaaagagc gttgttggta acttggcaaa 780 agttcgtcga ttcatttaaa gtgacatttt atgatagtga acgagtatca aaagctgaga 840 aggaaattag agcgctaaag caaactaaga cggtagcaga ttattggata catttctctg 900 agctttcgtt aattgtgaaa tggtctgagg atatactcaa gtctcaattt gaacaggggt 960 taaagtccga agtaactatt cacatggtgc gcaatgaatt caacacggtg gaagaaatgg 1020 ccaaacttgc aattaaatta gacaacaaaa tccacaaacg tagcccggaa agtttgaaca 1080 cgatgccagc ctcagcaccg accccctcag cgactgcaat tgatcccgat gccatggatt 1140 gctcggccta caaagtcaat gtatcaacag aagagtacaa ttgagagggg cagctggagc 1200 atgttatgaa tgcgggaaga caggtcattt tataggaagt tgtccaagac aaagaggagc 1260 aaggagaggt tacttcaggg gatctaatta tcaaggaaga acaagttatt aaggaggatc 1320 aaataactca aggattagtg aattagaaac tcaattgaaa gcacatttag atgaaataga 1380 tgaaaagtta ggaagagggg ataaagagag gaaagaagac agtagagctg aaagttcaaa 1440 aaatggagaa actcagggtt gaaagttgtg ccttcctcaa gtgtgattaa tttaggatct 1500 gaagagaatg ttattgatag tcttcatgag atgaatgata cccgtattat tgacatcatc 1560 aaactctatg atcccaaatc tgacacaacc aaacttgccc gtgccctaat tgatagtggt 1620 gccactcatg aggctatcag caagaaattc atcaatgaga cacagttcaa gaccttacca 1680 ttagctcagc gccgaagtgt tacaggcttc agcggtcatg tgtccatagt cactcacact 1740 ggagactact gtgtgaatga caatcaaact gagacagcct tcttcattac ggatctacgt 1800 gataagtacg atgtaatcct aggaatgccg tggatacgac aaaaccatca actagtgaac 1860 tgggctcaag gatgcataca agaacctatg gatcccaaaa ttgcaactgc ttcagcagtt 1920 tcgtccgtgc cgacaacagc cttgatggac cacgttagga ggccttcagg gcatgctagg 1980 acaagtgacg agggggtgtg agtttttgat aactcactaa cacccccgca atgtgagttc 2040 aattttacaa agactcaccc tattgaagaa tcagctggca atcatcttcc ccttctagaa 2100 ttctctgatg aaaacaggac tacgatcatc gatgcaaagg actccttact tggcaaatct 2160 ctattgaatc cgaaaaccac cttgatggac cacgttggag ggcctgaaag gacagctagg 2220 aacacagaga tgggggtaaa gcttagtagc ttgtcaaaac ccccgccaag taagtcaaaa 2280 ctttcaccac aaatttctgt acctagaatt gctagccagc atgcttctat tcctgtgaaa 2340 acactacgaa caggctcaat gttacgacat caatatggcc agcaaaccac gacaagaccg 2400 acagaagcaa tgatagatgc tgcaaaaact tcatggaact tattggctaa gctagcggtt 2460 gaagcgtcaa agggaaaaga agaacaaagt gcgactgaat tagtacccaa ctgctatcat 2520 gattatcttg acatgtttga aaaatcaaaa tcaaatatac tacctccaag acggccttat 2580 gacttcagag ttgatttggt acccggtgcc actcctcaag ctggacgaat tataccacta 2640 tcaccaaaag aagcagcagt attagatgaa atgatagaaa aaggattgtc aaatggtact 2700 ctacggcgaa caacttcacc atgggccgct ccagtcttgt tcaccggaaa gaaggatggc 2760 aatcttcgac cctgctttga ttaccggaaa ataaactcac tcactatcaa aaacaaatat 2820 ccgctcccac tgactatgga gttaatagat agtttattag atgcagacca attcacgagc 2880 ttagacatga ggaatggtta caacaactta cgtgtgcgtg aaggggatga ggcgaaatta 2940 gccttcattt gcaaacaggg gcaattcgag ccattgacaa tgccctttgg accaactgga 3000 gcaccgggct tctttcaatt ttttattcag gacattctca aaagccacat tgggaaggac 3060 gtagcagctt atcaagatga tattttaatc tatacaaaac ctggagtaga tcatgaaaaa 3120 gttgtaaaag aaatattgga tatcctgaaa cagcaaaatg tatggctcaa gcctgagaaa 3180 tgcaagttct ttcagaagga aattagctac ctgggtctga taatctcacg caatcagatc 3240 aagatggatg aaactaaagt gaaagcagtc aaggactggc cggcaccaaa aaatttgtct 3300 gaagtattga cgtttttggg tttctctaac ttctatcgtt gattcatcca ccacttttcc 3360 aaaattgcca gacctttaca tgaactatcg caggaaggca tcaaatttga atggactgac 3420 gaaagaaaca ctgcatttga aaatctcaaa caagccttta cgacagcacc ggtattgaca 3480 atagcagatc cgtataaacc ttttgtactt gaatgtgact gttctgactt tgtgttgggc 3540 gcagtattgt ctcaggtgtc ttctatagat aacgagttgc accccgtggc ctttctatca 3600 cgctcattga tcaaggccga acaaaattat gaaatatttg acaaggagct actagcagtg 3660 atttcagcat tcaaagagtg gcgtcaatat ttagaaggaa acccgaatag attaaatgta 3720 atagtttata ctgaccacaa aaacctgcaa tcgctaatga caacaaaaga actgacaaga 3780 agacaggcca gatgggccaa aattcttggt agcttcgact ttgagataag attcagacca 3840 ggcgaagatt caacaaaacc cgatgcacta tcaagacgac ccgacttgaa acctgatgat 3900 catgaaaaac tctcctttgg acgtcttctc aaacctgaaa acctaccaaa tgatgcattt 3960 attgactcac ttgacgtttt tgattcgtgg gtagaggaag ataccatatg gatagatgac 4020 ttatacttag gcagttcaag cagtgaagtt tggagcgaca agaagatatt aactgaaatt 4080 agacgaacta cagctcaaga ccaccaactg atggaaattg tgaacatatg tgtagaaatg 4140 cccgattcga aactgatcaa gggttactca gtcattgacg gtgtcttata cttttgtgaa 4200 aaggtagtag tacccggcgt agaagaaatc aaattgcaga tattacggtc aaggcacgac 4260 agcctcttag ctggtcacct ggggagaatg aggacgttaa tgctgattaa acggaatttt 4320 cactgggcat ctatgaaggc atacgttaat aaatacgtag acgggtgcca atcatgccag 4380 agagtgaaaa cgaggccggt gaagcccttt gggagtctcc aaccccttcc aatacctgct 4440 ggaccatggg tagatatatg ttacgacctc atcactgacc tacctgaatc caacgggtta 4500 gacagcattt taacggttgt tgaaagattc accaaaatgg ctcattttgt ggcttgtaag 4560 aaaacaatga actcagaaga gttagcagat ctgatgttga aggaggtatg gagattacat 4620 ggaaccccga gaacaatcac ctcagataga ggtaacatat ctatttcaaa gctaacaaag 4680 gaaatgaaca atcgactagg catcaaaact caatcctcaa ctgtgtatca tcctcagaca 4740 gacgggcagt ccgaaatcac aaacaaagct gttgagcagt acttacgaca cttcacttca 4800 tacaaacaag acaattggac taacctgctt ccaatggcag agttttctta taacaacaac 4860 ttgcatgtat caattggcat gtcaccgttc cgtgctgact atggattcga cgtaagcttt 4920 acaggaaccc cgactcaaga ccaatgttta ccagctatca aagaaagatt tagtcaatta 4980 aatgatgtcc acgaagaact caaactagca atgaaagaag ctcaagatgt catgacagac 5040 aaattcaaca gaaaagttca ggattccccg ttatggcaag caggacagga ggtgtggctt 5100 agaagcaagc atatctcaac aacgagaccc accgctaagt tttccaatag atggctagga 5160 ccttacaaag ttaaacaaca gatctcaact aatgcttatg aattaatatt accaaaagaa 5220 atgcaggaaa ttcacccagt gttccatgtc aacctgctac gggaatttaa gaagagcgaa 5280 tttcaaggcc aagataatgg accaccacca ccggttatga ttgaaggtga agaggaatat 5340 gaagtcaatg aaatcttgca caagaggaaa agaagaagta aaattcaata tttagttagt 5400 tggaaaggtt atggtccaaa tcacgactcg tgggaaccag aaaattcatt aggaaatgcg 5460 caggagatgg taatcaggac acggagaagg aagtgagggt aaagcttttc ccacagggat 5520 tttaatgcta acccgtggaa agatatctaa cccatcaaag ggggttgaga tataagaggg 5580 ggagtgg 5587 // ID Gypsy-67_MLP-LTR repbase; DNA; FNG; 361 BP. XX AC AECX01001283; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-67_MLP_; KW Gypsy-67_MLP-I; Gypsy-67_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-361 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001283; Positions 22940 22580. XX SQ Sequence 361 BP; 102 A; 69 C; 59 G; 131 T; 0 other; tgtcatgaac catccgcagc gtggtagtct gtttcacaac tgttaaaaga atagggctaa 60 tattttaata gctaagttga aagaacacga cgtatgtata agatttgtaa tcaggaacaa 120 ggaacgcgaa taatggttgt ttattcaatt ttaggttttc atttctcata tatatacttt 180 atgttttcta caactttgtt tgctcttttt cttatcggaa taatcattta tcttaccttt 240 ctcttcgttt gacgtctccg ttctgagaca aaaccgttcc cttataagaa ttgcttcgtg 300 ccgtaccaaa agactgctcc aatctgagtg tctaagtttt actacatccc agagcattac 360 a 361 // ID LTR-3_AN repbase; DNA; FNG; 371 BP. XX AC . XX DT 09-JAN-2004 (Rel. 9, Created) DT 09-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE Long terminal repeat of a LTR retrotransposon - a consensus DE sequence. XX KW LTR Retrotransposon; Transposable Element; LTR-3_AN; LTR-3a_AN; KW solo LTR. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-371 RA Kapitonov V.V. and Jurka J.; RT "LTR-3_AN, a family of solo long terminal repeats in the RT Aspergillus nidulans genome."; RL Repbase Reports 3(12), 208-208 (2003). XX DR [1] (Consensus) XX CC LTR retrotransposon. Solo LTR. CC 5-bp TSD. 99% identity to the consensus. XX SQ Sequence 371 BP; 110 A; 85 C; 89 G; 87 T; 0 other; tgttaaggat ctactttgtc agggcagatc agcctgtcaa ccaggtaagg atgccaatgc 60 cacgtaaggc caacaacaag gaaacgaata gtaatgtatt gcaagactaa accttgggga 120 tccttcagct gggggatctc ccgtagtcct aaatacattt gtggaaagac atggatgtaa 180 cccaacagta agaccatgac gagcgataaa gtcataccaa actttatctg taaacggcgg 240 ccaagtccgg tggtttcgat gatgcaacat gggaaaggat ttccggatct ctaacgccta 300 tcttagagct taatcgatcg tttgcctgag gctcccaatc cttgatatcg gggcccagca 360 gatccgtaac a 371 // ID Gypsy-4-I_AF repbase; DNA; FNG; 6940 BP. XX AC . XX DT 03-MAR-2006 (Rel. 11.03, Created) DT 12-APR-2006 (Rel. 11.03, Last updated, Version 1) XX DE Internal portion of the Gypsy-4_AF LTR retrotransposon. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; Gypsy-4_AF; KW Gypsy-4-LTR_AF; Gypsy-4-I_AF. XX OS Aspergillus fumigatus OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Neosartorya. XX RN [1] RP 1-6940 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-6940 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-4_AF, a family of gypsy LTR retrotransposons in the RT Aspergillus fumigatus genome."; RL Repbase Reports 6(3), 123-123 (2006). XX DR [2] (Consensus) XX CC This is an internal portion of the Gypsy-4_AF LTR CC retrotransposon. One copy of Gypsy-4-I_AF is flanked by identical CC LTRs (contig 1.12). Its internal sequence, reported here, encodes CC a well preserved polyprotein composed of reverse transcriptase, CC endonuclease and chromodomain (only a few stop codons). There are CC 4 other copies of Gypsy-4-I_AF present in the genome that are CC 77-84% identical to Gypsy-4-I_AF. Their ORFs encoding the CC corresponding polyproteins are severely damaged by many CC stop-codons due to transitions. The transition/transversion ratio CC is amazingly high, most likely because of RIP (quick mutations of CC CpG to CpA and TpG, CpA to TpA, and TpG to TpA). For example, CC Gypsy-4-I_AF and its cosmid1.16 copy are 77% identical. Among the CC corresponding 1582 mismatches, 1575 are transitions (only 2 CC gaps). The corresponding transition/transversion ratio is 225!. XX SQ Sequence 6940 BP; 1778 A; 2006 C; 1501 G; 1655 T; 0 other; acctggtagc atgtccatgt tcgttccaga gggtattcct cttgggttta catcagctct 60 ttagttagca tgtttaatca ttactgactg ccgctgaacc ttggggattc cgaggcctat 120 tagcgattac ccaatccctc cgactgaccg atcaacattg tgggacccgt catccaattg 180 cgtcattccc accccgatta cagcagctct agggcatagc cctaaccgca ctgttgctta 240 ccaagccctc caccctacca cagcctgtcc aacccctcaa cctccactct tctcctgcta 300 ttccgtactc caccaaccat catcttgtct taccttcatt ggattaaata ccctccatct 360 cgcgtcccac cgtttaccgc tggtcctttg accaatgact ggtcgggccc tgcgccttgc 420 tgagcttgga catcagctta gtgttttcca tgatcagtcc gaaagcctat tcgctcagca 480 gttcgacaca gagacaacct gtcttgcctg atatgtccag ttcactgaac cccttcctgc 540 atcatcgaat ctccatctat acccacctga ttggaacatt gaccgctcta cactaaaccg 600 agccacccgg gagttcctac gtttagcccg tccgttactc aagacagggg aataccagcg 660 ctggtcaagc gtgcactgca ataccgacga accacaaccc gaacgtccat tgggtagaac 720 gcagtctaga tcccgttctg tttccacatc ctctaacaaa cagtctaact tgcaacccga 780 acacgagctg acgactgttg aacccgaacc taccatcgaa tgtacaccgt tcgatcgttc 840 cgcggcaact ccacagggac ctcccgctga actagatgat ccacccacag tccgacctat 900 acatcgttcc cgctcgcggt ctgcttcgct tggccgaacc gtaaggcagc tgttccctca 960 agaagagccg cctatcgttg aaccgcaagc gcctgtcatt gaaccatccc gtttaccgcg 1020 aacctgtcct acccctctcg tcattccacg aaaccgattg ttcgacatga gtgcccaaca 1080 gagcagttcg aaccgaacat cagttaaccc cagtccgtca gtacaggacc taattaatct 1140 aatccaggca aaccaacaga ctttgaaccg tcagctggat gctaaccaag aagctatggc 1200 aaggcggttt gagcaatttg aggcctcagt taacgaacgg attgatgctg ctaatgtgaa 1260 tcaacgtcat gagcaacctc aacagtctca gcaacagcac cttccgccgc ctaacggacc 1320 acagctgaac aacccaccgg cgagtaacct tctattaccc actatgacaa atggacactg 1380 gaaaaccgaa gagcttggtt acttctggcc tgatatgcca gaagatgcta gcatcaagca 1440 gtacaacaca agcctgttct ataagaatgt gaatgtgttt accgaccgtg tgaaagatgt 1500 aattaattac aaaggtgaac cccttgtcaa agcaaactta caagcgagtc ttcgaggcgc 1560 cgcgttggac tggttcacca acgaactcac tgagttggaa aaacggtccc ttcgtgctct 1620 tctgttagac caaggatggt tacctgagct ggtaaaacgt tttaggccac gtgctgctga 1680 tgctctggtc aagttacaat ccctttctta tggatacaac gatgttagga atgacaggac 1740 ccctcgcgcc tttgcacaag atgttattcg tcatgcccgc gctgcccaag ttaatgatat 1800 attcaaccag atcacaatgg tttaaacccg gcttgagcca tatctccggc gtgacattcc 1860 cgaacctacc cctactacca cattgaccca gttcctcgag cagttggagt cccgctccgg 1920 catctggaag gacattgcca atggctacca gaaggaccgc aacaagggca cccagcctta 1980 tggtaaaccc aagcaccagc aatcccaaag gacccagggc aaatccccac aggaacgtca 2040 agctgaccaa cccagaactg accgaccgta ctatccatcg aacggttacg ggtatggccg 2100 atcctactat ccacaatacc aataccaacc acgtctgaat gcgtccccat tcgcccagcg 2160 aatgcagcag taccagtacg gtcagcagcc tcccgcgtcg aaccagaatc cctcgtacca 2220 aagtccggcg caaccgcagg cgaccgctca gccgcaagga aattggcgaa ctaattaacc 2280 tctaaatagg accgctgcga actacaaccg tcctatgtgg aatcacaacg gtaggcagtc 2340 atggaaccag ggtccttctc gcgttggccc cccaagagag cgagcctatg ctgctaacgt 2400 acatgtcgaa ccgactgtcg aacatccatc cccttcacca cggtacgatg aaccggaata 2460 ttcaccatat gttgaatcaa acgaacggta tgaatcaaac gaaccgtacg accagcacga 2520 ccagcccgat gatgaggacc cgggatatca agcattcctc gatcatccgg cgttcgaacg 2580 cggagatttc cctgaatcta aggatgaacc atgcgcattt gcagtcgatg ttggacctgt 2640 ggtcacgacc tgcaatggat gtggggctac attcccgtca cggaatcgtc tgcataccca 2700 tctttcggaa tatccacgca cacaggaacc aatccctgtt acccctacta ctaatgaccc 2760 aactgctgtc cgtatcattg aatctactca caagccaacc gggctcgttg gaatccgttc 2820 ctggcgttgc gctaccgcga agatcggaat taacagcaca tcagaatacc atgagatatg 2880 cctcgataca ggctgctcct ctaccattgg aaatacagag ttcatcgaag cattacctag 2940 cgtaaccatc accgagctga cggacggcat cacggtttca ggcatcggat cccgtcatcg 3000 atcaaaccgt tctgctatgc taacactatg gttccctgga ttgatgaatg atggtggttc 3060 tgcgaacgca ttcgctaaga ttaccatccg ttgccatttg gtcgatggcc tgaagcctaa 3120 gctacttatc ggaaccgatg tcatctgcgg tgaaggattc atgcttaatt ttgaacgtgg 3180 cattgcaaca attggttctt gcagtcgatt aaccttccca atcatcgcgc aagctaaacc 3240 gcgtcggatc actcgtgcag tagtatatgc aaagcaaagg tctcttctac cagcatattc 3300 agtcagtaaa ctggcagtac gtttgaagac tgatttaccg gctaaccgcg acttcatctt 3360 tgacccgctg gacggatcaa ccttgaacgg agccaccata tacgcgcata tggttgacca 3420 tgtgttctcc ttcatcgaag tccgcaacga aacacccact ccaattacgg tcactcgtca 3480 cgctcgcata gggaccatct ctgaagccga cttcgtcact gcctaccaag tcagcgagga 3540 cgctattcca ttagctaagc tgttggagtc cgaaccattg gaattcgagt ctaccaagtt 3600 gagtcaccct tcctctagat accgcgaaca cgtcgaacgt caactcgaac gtgcttacca 3660 actgacggct gaacccgaat ccccactgac cgtttcgccc gaccatgaca accagacagt 3720 cttacctaat ggtatcactg tatatggcaa aggccccgaa actgaccgac tcgccgaagt 3780 acttataagc ttcaatgtct gaggcgacga tggctccaca gcccgtattc ccgaggaaga 3840 atggatggag gttcccttga aggaaggatg ggaaaaccgt cttcctaaac cacatgtcta 3900 tcgcgtcagt cccaaagatc gtgaatgcat tgatcgcacg tttgaccctc tacgtgaagc 3960 tggcaaactt agtcctgcta ccggtcatac cccttcggcg tatccagtct tcgttgtatg 4020 gaaaaccgtc actaatcaga acagccaaac caaagaaaag ggacgtgtgg tcattgactt 4080 acgtggcgtt aataaggaag tcgttccaga tctataccct atcccgatac aagaagacat 4140 cattaatatg gttcgtggtt gccgttacat tacggtcatc aatgcctgtc ggttcttcta 4200 ccagtggccg gtaaagcgat cgcatcgaaa ccgtcttgct gtcgtcagcc atcgcggcca 4260 ggagatattc aatgttgcaa ttatgggatt tatcaattct gttccatatg tccaacgaca 4320 aatggatcga ctcctgaatg accttgagtt cgccagaacc tacgtcgatg atatcatcat 4380 tgcctctatg acattcgacg agcatctgaa tcacctcagc actgtcctac agcgcttgca 4440 agacatcggt atccgattgg aaccaaccaa agcattcgtt ggattcccaa gtgttcagct 4500 attaggacaa cgcgtagacg ccctaggtct gtccactcct gaagaaaagc ttgctgccat 4560 tcgtaccttg gaattcccgc gcaacctcaa gcaactggaa cactatctag gtcttactgg 4620 atgcttccgc cactatattg acaagtatgc ccacatcatc aagcctttgc aggaacgcaa 4680 gacgcgcatg cttaagggta gccctctaaa aggacctgag cgccgtgcat tcgccactgg 4740 caaggcaatc aacgctccta ctgattgtga gactgaagca ttccgtcagt tgcaatccgc 4800 atttgacagt ccgcttttcc gtgcccattt cgaccccctt aggcgcttgt acgccgacct 4860 cgacgcatca tatgaaggtt ttggtgttat ggcatatcat attcagattg acgaccacca 4920 cactaatctt tcgattccgc ccgcgcgtac ggttattcaa cccatcttgt tccttagtcg 4980 aacgcttacc agcgctgaat cgcggtactg gctaaccgag ttggaagtgt cctgtcttgt 5040 atgggctctc cggaagctcc gctacatgat tgaatcatcc cgtcaaccaa cagtcgtcta 5100 taccgaccat gcttctactg ttggtatttc cacgcaaacc tctatgaaca ctgtcgcact 5160 tgaatggtta aacctacggt tgattcgtgc atctcaatat attcagcagt ttagactgca 5220 ggtgtttcac cggcctggta aatcaaatat agtcgcggat gcgctgtccc gtctcacaac 5280 aaaacaaaac aagaacatta agtataacga acctgatctc gactctattg atgcttactt 5340 taccgaccat gggtacactg cgtcgtccat ccaactatcc acccagctca agaaacgcat 5400 aatagaaggg tactgcgatg atccacggac cacacgcatc attgaggtcc tctgcaataa 5460 ccgaacgtcc gacttaccta ccgtcctacc ctaccaattg gatgacgacg gactcctcta 5520 catgacgaaa catcgcttaa ttggcgaggg aatcactgag gaaccttgga tctacattcc 5580 gcgtcccctt gcgaaggaca tgttccggtt gatccatgac gaacggaatc atcaaggcat 5640 tgataagtgc ctggcttcac ttgatgggtt tacgctgtac caaggccgcc gtatgctacg 5700 tcagtacatt aaacactgtc cggtttgttt gcagaacaag atccgccatc acaagccata 5760 tggaagcctc caacccctgc aggtcccgcc tgctccgttt gagattatta ctatggattt 5820 catcgtgggc ttacctgacg acaatggcta tgatcaactg ttggtcgtcg ttgacaaatt 5880 cagtaaacgc gttggtctca ttcctggtaa atcaacgtgg accgctcagg aatggggatc 5940 tagcgtcttg cggtatttcc aggaacatga ttggggaatg ccctgtttct ttatctctga 6000 ttgtgattcc atcttcatga gcaagttctg gaaaggctat ttcaccgcgc tgaaagcacg 6060 ctggctgtac tcagcagcat tccacccaca gactgatggt cagactgaac gtgttatcca 6120 agtcattgaa gtcatgttgc gccattccta cactaccgct gaacgacctg atctgttccg 6180 ctggacgatg gatctaccga gtatcatctc cacaatcaat gggtcaccga atgaatctac 6240 aaaggccact ccgcaccgac tgctcttcgg tattgacctc cgccaaccat ggcagctact 6300 gaagcaattc gttaagcaag acttctcagt atgacttgat gctgaagaat ccatgaagta 6360 cgcatccata cgcatgaaag agatctacga tcggaatcat aaaccaatcg aattccgcgt 6420 tggtgatcag gtctatgtcc gactgcaccg tggctattcc ctaccaacta agcgagccaa 6480 tcgtaagctc caattgcaga atgctggacc gttccgcgta ttggaacgcg ttggaagact 6540 cgcctaccgt atcgaactac cctctacatg gaagatccat ccggttttgt ccgtcgccca 6600 ccttgaacct gcgccggcca cccccgatcc gttccaccgt gagttaccga agcctcccgc 6660 ggtcgttgac gccgaggtct accccggtga ggatgacata tacgaagtcg aacgcttgct 6720 ggacaagcgt accgttcagc gtggacgcaa acgtacccct tatgtggaat accttgtgcg 6780 gtggaaggga tacggaccgg aagacgacca atgggttcgt aaagatgatc ttcaaggttc 6840 tctcgaactc attgaggcat tcgaacgtaa ccgtcctatg taacgccagc aggctggttt 6900 tacagcaacg aggacgttgc ttttcgttgg ccccccataa 6940 // ID VADER repbase; DNA; FNG; 437 BP. XX AC U37228; XX DT 11-NOV-1996 (Rel. 1.1, Created) DT 11-NOV-1996 (Rel. 1.1, Last updated, Version 1) XX DE transposon Vader. XX KW VADER; target site duplication; terminal inverted repeats; KW transposon. XX OS Aspergillus awamori OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC mitosporic Trichocomaceae; Aspergillus. XX RN [1] RP 1-437 RA Amutan M., Nyyssonen E., Stubbs J., Diaz-Torres R.M. RA and Dunn-Coleman N.; RT "Identification and cloning of a mobile transposon from RT Aspergillus niger var. awamori."; RL Curr. Genet 29(5), 468-473 (1996). XX RN [2] RP 1-437 RA Amutan M., Nyyssonen E., Stubbs J., Diaz-Torres R.M. RA and Dunn-Coleman N.; RT "VADER."; RL Direct Submission to Genbank (28-SEP-1995)Eini Nyyssonen, RL Microbiology, Genencor International, Inc., 180 Kimball Way, RL South San Francisco, CA 94080, USA. XX DR GenBank; U37228; Positions 1 437. XX CC The Vader element is present in approximately 15 copies in both CC A. niger var. awamori and A. niger. CC Insertion of the Vader element caused a 2-bp duplication (TA) CC of the target sequence. CC The Vader element is flanked by a 44-bp inverted repeat. XX SQ Sequence 437 BP; 150 A; 65 C; 72 G; 150 T; 0 other; acgtaatcaa cggtcgaacg ggccacacgg tcaggcgggc catcctgaaa tcccatataa 60 aagatgtctt ggggattcta ttatatatca accagtacta cttctatgaa gctctaactt 120 tgtagatagt tatatatata agaataagta ttccatgaat ttttcagatt ttagaatttt 180 tactttgata atgaaaccag attcttatat aaaacatata aatacagata ttgtaatatg 240 ataagtccat aagtaaaagt atattcattt ttagaaggta tatagatatt atttatatta 300 tttaaaatct atatagaaga aatctaattc ttctagacct ggatggtaga gatatattat 360 gtttaaaaag atatcttttg tatagtatta ccagatggcc cgcctgaccg tgtggcccgt 420 ccgaccgttg attacgt 437 // ID Gypsy-8_MLP-I repbase; DNA; FNG; 5639 BP. XX AC AECX01002120; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-8_MLP_; KW Gypsy-8_MLP-LTR; Gypsy-8_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5639 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002120; Positions 173212 167574. XX CC Positions [4440-4919] - Integrase core CC 'CAGTC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 331..1389 FT /product="Gypsy-8_MLP-I_1p" FT /translation="MEEIQRQLTELQNSLAGERRLREQAEARSRQAEERLA FT AIESSRNTTSHTFTNAGNAAPPTVPQHETTPNPKGPKVATPDKFDGNRGGP FT AEVFASQVQLYMLAHPHLFPTDRTKVVFALSYLTGTASSWAQPMMTELLDD FT STAHLVTFDRFVRNFKAMYFDTEKKSKAEKALRSLTQKSTVAAYTYEFNLH FT AANTGWEVPTLISQFEQGLKRDIRVAMVLVQDEFTSVEQIANLAIKLDNKI FT HGTADTSTTVSTPARDPNAMDISSSFTRLSDEERSHRLRTGACFKCAARGH FT RANACPGRTGDRRGRGNGGYSSRIADLEVKLAALNDSRNEVKDRAEGSSRS FT DVSKNGGAQA" FT CDS 1416..5540 FT /product="Gypsy-8_MLP-I_2p" FT /translation="MDLGASIALGTSTVVTCNTSDPRLFLSVTFSLTQNPR FT ATPSYYPSARLLIDSGATHNVLGESFARKADILHLGVSTSRDITGFNGSRS FT RSSHELDLYLERDTQPTNFIITDLKNTYDGILGIPWIRANSHRIDWSSGII FT TTANVFAASTVVESSGPSKPSKDPKLEHPRDARYRDEGMCVTDNTLTSPQS FT ESFNLLPQSDSPEGVGKHDSLLQQLESSTTNNEYNLTEGTITHSNLLTPDI FT KTQTEGDCIAADDPASSTPKHNPEGPLEEPTGHARQHDEGACIVKSTHKPP FT QCEFDSSHLNPSPVSAGKPFLSLNNSPTEISSLKASWSTSACLAADEKKNI FT PTKSVEELVPTAYHRYIRMFQKSNAQCLPPRRKYDFKVELVEGAQPQASRI FT IPLSPAETQTLDKMINTGLANGTIRRTTSPWAAPVLFTGKKDGNLRPCFDY FT RKLNAVTVKNKYPLPLTMDLVDSLLDADKYTKLDLRNAYGNLRVAEGDEDK FT LAFICRAGQFAPLTMPFGPTGAPGYFQYFMQDILLQHIGKDAAAFLDDIMI FT YTKKGVRHENVVLQILEILDKHQLWLKPEKCEFSKSEVEYLGLVISRNKIK FT MDPTKVRAVSEWPAPRNVTELQRFIGFANFHRRFISHFSQTTRPLHNLTRH FT KTPYLWDDECKKSFEELKNAFTSAPVLKIANPYKPFVLECDCSNFAIGAVL FT SQISKDDNELHPVAYLSRSLAQAERNYEIFDKELLAIVASFKEWRHYLEGN FT PNRLDVIVYTDHHNLESFMTTKQLTRRQARWAETLGCFDFQIKFRPGQQAA FT KPDALSRRPDLAPTHDEKLTFGQLLRPENITPDTFQPEVASIESCFENEEI FT ELPDAEHWFEVDVLGVNNETPTDPILNDEDIIGLIRKTTNDDPRLKELIQA FT ISNPISTAIRQKTANYRVHNGIVYNQGRIEVPNNNEIKRQILRSRHDSLLA FT GHPGRAKTLSLVKRSFTWPSQKAYTNKYVDGCDSCLRNKSSTQKPFGSLEP FT LPVPAGPWVDISYDLITGLPLSNGKDSILTVVDRLTKMSHFIPCNESMTAE FT TLADLMVKFVWKLHGTPRTITSDRGSIFISQITKELDKRLGIRLQPSTAYH FT PRTDGQSEIVNKVVEQYLRHFVNYRQDNWETLLPIAEFSYNNKDHASTGVS FT PFKANYGYNPSFSGIPSAEQCIPSVEKRLTELINVQSELTECLKTAQEEMK FT IQFDKGIRSTPDWKIGDQVWLNNKNLSTTRPSPKLDHKWMGPFNISKKISR FT SAYELTLPTSMRGIHPVFHVSLLRKHETDGIEGRQESEPQPTIMEGNEEWE FT VLEILDCRKRYRKKEYLVSWKGFGTEHNSWEPEENLSNSKDLVNEFNIKFP FT NAASKYKRTRRK" XX SQ Sequence 5639 BP; 1784 A; 1378 C; 1217 G; 1260 T; 0 other; tattgtcgta tctacaacaa gcgggcatca acagatcaga agaactagaa tcagaaactt 60 taagaattga aattgaactc gaataagatt agaaacaaca agaaggatag attagatcat 120 atttagaaaa tcattgaact gaaactttat acttgatatt gaacgcacca ctgaagatta 180 acaccgaact acctccgcag aactctatcg atcacgtctc cctcatacag atctcccttg 240 ctcgacgacg acgatccgga tagtgatcgt gaggaatcct ttgtcgacac caaccaagca 300 ccacctgtca ctgaacttac ggtcgatgag atggaggaaa tccagcgtca gttaactgaa 360 ctccagaatt cattagccgg agaacgtcgc cttcgagaac aagccgaagc ccgcagtaga 420 caggcggaag agcgattagc cgctatcgag tcttctcgta acactacttc gcataccttc 480 actaatgcgg ggaatgcagc acctcctaca gtgccacaac acgagacgac cccaaatccc 540 aaagggccta aggtcgccac gcccgacaag tttgatggca atcgaggtgg cccagctgaa 600 gtctttgcca gccaagtcca actgtatatg ttggcacacc ctcatctatt ccccaccgac 660 cgtactaagg tcgtattcgc gctatcatac ctcacgggta ctgcgagttc atgggcccaa 720 ccgatgatga cagagttact tgacgactcc actgcccatt tagtcacgtt tgaccggttc 780 gtgcgtaact ttaaagcgat gtatttcgac accgaaaaga agtcgaaggc ggagaaggca 840 ttacgatccc ttacccagaa atccactgtc gcagcgtaca cctatgagtt taacttacac 900 gctgctaaca caggctggga agtgcctaca ctgatcagcc agtttgaaca aggcctgaaa 960 cgagacataa gagtagctat ggtactagtt caagatgagt tcacttcagt agagcaaata 1020 gccaacctcg cgatcaaact ggacaacaaa atccacggta cagcagacac ttcgaccact 1080 gtatcaacac cagctcgaga ccccaatgcg atggacatct catcttcgtt tactcgacta 1140 tccgatgaag aacgatctca tcgtttacgt actggtgctt gtttcaaatg tgctgcacga 1200 ggccaccgtg ccaatgcttg tcctggtaga acgggtgatc gtagaggaag aggaaatggg 1260 ggttacagta gtcgcattgc tgatttagag gtgaagctag ctgctttgaa tgatagtagg 1320 aatgaagtga aggatcgggc agagggttct agtcgtagtg atgtttcaaa aaatggaggc 1380 gctcaagctt gacggttgtg cctagcttga gcagtatgga tttgggtgct tcaatagctc 1440 taggaactag cactgttgta acttgcaata cgagcgaccc acgcttattc ctctcagtta 1500 ccttttccct gacccaaaat ccacgcgcca caccatcgta ttacccatca gcccgcctcc 1560 tgattgattc aggtgccacc cacaatgtgt tgggagaatc atttgcaaga aaagccgata 1620 tcttacacct aggagtgagc acttcacgcg acatcacagg attcaatggt tcaagatcaa 1680 gatcttcaca tgaactggac ctctacctcg aacgcgacac ccaacccacc aacttcatca 1740 tcaccgacct taagaatacg tatgacggta tcctaggcat tccatggatc cgcgcgaaca 1800 gtcacaggat tgactggagt agtggtatca tcaccacagc taatgttttt gctgcctcca 1860 cagtagtgga gtcgtctgga ccatcaaaac cctccaagga tcccaagttg gaacacccga 1920 gggacgctag gtaccgtgac gaggggatgt gtgttactga taacacgtta acatccccgc 1980 agagtgagtc ttttaatcta ttacctcaat cagattcacc cgaaggtgtt ggcaagcatg 2040 attctcttct acaacagcta gaatccagta cgacaaacaa cgaatacaac ctcaccgaag 2100 gcacgattac acactctaac ctgttaacgc ccgacatcaa gactcaaact gaaggagact 2160 gcattgcggc tgatgatcca gcctcgtcta cgccgaaaca taaccctgaa ggtccactgg 2220 aggagcccac agggcacgca aggcaacatg acgagggggc gtgtattgta aaaagtacac 2280 ataagccccc gcaatgtgag ttcgactcgt ctcatctcaa cccatccccc gtatcagctg 2340 gcaagccttt tctttcccta aataacagtc ccaccgaaat cagctccttg aaagcctcgt 2400 ggtcgacttc tgcttgtcta gcagccgatg aaaaaaagaa catacccact aaatcagtgg 2460 aagagctagt accaacagct tatcatcggt atatccgcat gtttcagaaa tccaatgctc 2520 aatgcctacc acctcgaaga aaatatgact tcaaagtaga gctggtagaa ggagcgcaac 2580 ctcaagctag caggataata ccattatccc cagctgaaac tcagacttta gacaaaatga 2640 taaacactgg cctggcaaat ggtactatca gacgtacaac ctcaccctgg gcagctccgg 2700 tgctcttcac cgggaaaaaa gatggaaatc taagaccctg cttcgactac cgtaaactga 2760 acgccgtgac ggttaagaac aaatatccgc tgcccttaac catggatctg gtggacagtc 2820 tcctcgacgc tgacaagtac accaaactgg acttacggaa tgcatacggt aacctacgag 2880 tagctgaagg agacgaggat aaactcgcct tcatctgcag agcaggtcag tttgcccccc 2940 tgacaatgcc ttttggacca acgggagctc ctggatactt ccagtatttc atgcaagaca 3000 tactgctcca acatattgga aaagacgcgg ctgccttttt agacgatatt atgatctata 3060 caaaaaaggg ggtgcggcac gaaaatgttg tattacagat actagagatc cttgataaac 3120 accaactatg gcttaaacca gagaagtgcg aattctcaaa atcagaagtc gagtacttag 3180 gcctggtcat atcacgcaac aagatcaaga tggatcctac taaagtaaga gccgtatccg 3240 agtggccagc cccaagaaac gtcaccgagc tccaacgatt cattggattt gcaaattttc 3300 atcgaagatt catcagccat ttctcacaga caaccagacc cctgcataat ctgacacgcc 3360 ataaaacgcc ttacttatgg gatgacgagt gcaagaaatc atttgaagaa ttgaagaatg 3420 ctttcacgtc agcaccggtt ttgaagattg caaatccgta caaacctttt gtactggaat 3480 gtgactgttc caattttgca attggcgccg tactatccca aatcagcaag gacgacaatg 3540 aactacatcc agttgcatac ttatccagat cattggcaca ggcggaacgg aattatgaaa 3600 tatttgataa ggaactcctt gctattgtgg cgtcttttaa agagtggagg cactacttag 3660 aaggaaaccc gaatagattg gacgtcatag tatatacgga ccaccacaat cttgaatcat 3720 tcatgactac caaacaatta acccgacgcc aagctagatg ggccgagact ctaggatgct 3780 ttgattttca aattaaattc cgcccaggac aacaagccgc taaaccagat gcactgtcaa 3840 gaagaccaga cctagcgcca actcatgacg aaaaattaac ttttggtcaa ctgcttagac 3900 cggaaaacat cacacccgac acttttcaac ccgaggttgc aagtattgaa tcttgttttg 3960 agaacgagga gattgaacta ccagatgcag agcattggtt cgaagtagat gtcttgggag 4020 taaacaatga gacaccaacc gaccccatcc tcaacgacga agatatcata ggcctaatca 4080 gaaagactac taatgatgat ccaagactga aggaacttat acaagcaatt tccaatccca 4140 tatctacagc aatacgtcaa aaaacggcca actatagagt acataatgga atagtgtata 4200 atcaagggcg aatcgaagtg ccaaacaaca acgagatcaa acgccaaatc ctacgcagtc 4260 gccacgatag tctactagcg ggacacccgg gcagggcgaa gacacttagt ttagtgaaga 4320 gaagcttcac gtggccaagc cagaaagcgt acacaaacaa atatgtagat ggatgtgatt 4380 cttgcctacg aaacaaatca agtactcaaa agccatttgg atctctcgag cctctaccag 4440 taccagcggg accatgggtc gacatcagct acgatttgat taccggactg ccactttcca 4500 acggaaaaga cagtatactc acagtagtcg atcgcctcac gaaaatgagc cattttattc 4560 catgcaacga atccatgaca gcagaaacat tagcagactt aatggtgaaa ttcgtatgga 4620 aattacatgg cacccccaga actatcacat cggatagagg gagcatcttc atttctcaga 4680 tcacaaagga actggataag agactaggca ttcgattaca accttccaca gcataccacc 4740 cgcgcaccga tggtcagtcc gaaattgtca acaaagttgt agagcagtac cttcgacact 4800 ttgtcaacta ccgacaggac aactgggaga ctctcttacc catcgccgaa ttctcgtaca 4860 ataacaagga ccacgcttcg acaggagtct cgccgttcaa agcaaactat gggtataatc 4920 ctagtttcag tggtatcccc tccgcagaac aatgtatacc ttcagtagag aaaagactga 4980 cagaactaat caatgtacaa tcggaactga ctgagtgttt gaagacagca caggaagaga 5040 tgaaaattca gttcgacaag ggcatccgct caacccccga ttggaagatt ggcgaccagg 5100 tttggctcaa caacaagaac ctttcgacga caaggcctag tcctaaatta gatcataagt 5160 ggatgggtcc ttttaacatc tccaagaaaa tatcaagatc cgcatatgag ctgactttac 5220 ctacgtcaat gaggggtata cacccagtat tccatgtctc tctacttaga aaacacgaaa 5280 cagacggtat tgaaggtcgt caggaatccg aaccgcagcc aacaatcatg gaaggaaatg 5340 aagaatggga agtattagaa atattagact gtagaaaaag gtataggaag aaggaatatt 5400 tagtcagctg gaaagggttt ggaactgaac acaattcatg ggaacctgag gaaaatttat 5460 caaatagcaa agatttagtg aatgaattca acatcaaatt cccaaatgca gcaagtaaat 5520 acaaaaggac aaggagaaaa tgagagaggg caagcttttt cccactgggt tttttaacgc 5580 tgcccgtgga aggaacgcag aacttgcaag agggagtttg ggcgtaaaaa gggggataa 5639 // ID Copia-26_MLP-I repbase; DNA; FNG; 4622 BP. XX AC AECX01001250; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-26_MLP_; KW Copia-26_MLP-LTR; Copia-26_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4622 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001250; Positions 5750 1129. XX CC Positions [1916-2440] - Integrase core CC 'AAGAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 77..3799 FT /product="Copia-26_MLP-I_1p" FT /translation="MSTESSAPSPDPSLTDSEDETFTSISDSGTEIPGNSN FT MADKGESKIFGTPIQQFVQKLTSLLAKYDIETKLSDGNFPAWALEVKQVLN FT VIDYQKYITKLNFKAPSLTDEQHIKVKLVVSIWMMSLMDKNNKIRVQTMLK FT LRGCGDDDDDDEEDDDVAYEPALIWRFLKNHHQKISEAGLQTIEDAINDMK FT ILSTDSFKEHCDKFNNLIADFVKYRGRISSASAARKLIKTVKSRITENVSE FT NIYTKVDPLTREGVVQYLIDYEARNGGFSTPAIVEASHSNFSTSAAQSGGN FT YRQNKPRCSESKCISTTHDDDKCFAKPKNHAARDLWIAQMEAKRSRPQYTR FT PNQTSTTSNVSGLRQVTLPSASLAVASDNPFASLHVHFDDSNDISASASAL FT STFESSHTVFDEDSPAANVVNESNGLWALYDTGATHYMFKSDKLFTSDSMT FT RVEDSSKRLKLAGGDASLAVHSTGTVQLKSGSGKMFELNNSLYVPDLSQNL FT LAGGAMLRKGVQVLVHPNDTNCFSLVFKGEALFNGVFASNNLMYVSLEPVS FT PITGSTALNTSTEDLTQLQHRRLGHLNNRYLKIMCNHRSVEGLPTNLSQLN FT HCDVCSLSKNHKIPHSSTRPRASRHLENVHVNLSGIIRVKGLKNEMYYTMF FT CDDYSSYRHIYFMKDKTKETVFDIFRAYIALAERQTCKMIKQFTLDRGGEF FT LNELLGTELRERGITLHLTAAHTPEENGVSERGNRTISTKARSMMIESGSP FT LRLWVQACQTAVFLTNRTVTAALEGHKTPFEAWHFRKPSVDHIRVFGCLAY FT SLIRKEIRGSKFNPVSSKGVLVGFEEDNFNYHVYDLDSSKITITHNSTFNE FT NVFPFMTENQTATEVPPVSTELPNLKLRFFDEESDEEDVEVTTTINSTPNT FT LDPPAQKSTPTCVADAQPPQPSSPPAPPRRSGRIKGPVKYSAITIGENLEE FT STLTTAWELKQLFDCVLPKSNLVTSAHDVPRSYSKAVNGPEGTKWLAACKK FT EIKAMYDKKVWVLVDRPATSNVVRGLWLFRKKITSDPENPFKFKARFVAMG FT NTQLEGEDYFETFAPTGKPTSLRLLIALAAIFGWEVHQMDAVSAFLNSYLD FT EVIYVEQPEGFRVPGEEHKVCKLLKSLYGLKQAPKCWQDDVMDFLLSVNFI FT QCEVDHCIYIRSEGNLFTAVYVHVDNLAITGNDISSFKKEISQRWEMEDLG FT LATTVVGIEIKRINEHKYSICQESYTLKVLE" XX SQ Sequence 4622 BP; 1351 A; 1190 C; 967 G; 1114 T; 0 other; tggtagcgag agtataaaag atccgaccgt atcctctatc gtcaaccgtc atccactgtc 60 acttcgacct tgttttatgt caactgaatc atcagcacct tcaccagatc cttctctcac 120 tgactctgaa gacgagactt tcacctcaat ctctgactcc ggaaccgaga tccctggcaa 180 ctcaaacatg gccgacaaag gggaatccaa aatctttggt accccaatcc agcagtttgt 240 tcagaaactt acctccctac tcgccaagta cgacattgaa accaaactct cagatggaaa 300 ctttccagca tgggccttag aagtcaaaca agttctcaac gtgattgatt atcaaaagta 360 catcaccaaa ctcaacttca aagctccgtc tctgacagat gaacaacaca tcaaagtaaa 420 acttgttgtg tccatctgga tgatgagtct gatggacaag aacaacaaaa tcagagtcca 480 gaccatgttg aaactcagag ggtgtggaga tgatgacgac gacgatgagg aagatgacga 540 tgtggcatac gaaccggctc tgatctggag atttctcaag aatcatcatc aaaagatttc 600 tgaagccgga ttgcaaacga ttgaagatgc gataaatgat atgaagattc tcagcactga 660 ttcattcaaa gaacactgcg acaaattcaa caacctcata gcagactttg tcaagtatcg 720 tggtcgcatc tcatccgcct ctgccgctcg aaaactcatc aaaaccgtga agtccagaat 780 caccgaaaac gtatctgaaa atatctacac caaggttgac cctcttactc gagagggtgt 840 agtacagtat ctgattgact atgaggctag aaatggaggc ttctccaccc ctgccattgt 900 tgaagctagc cactcgaact tctctacttc tgctgctcaa tcaggaggaa attatcgaca 960 aaacaaaccg aggtgctctg aaagcaagtg tatatcaacc acgcatgatg acgacaaatg 1020 cttcgctaaa cccaaaaacc atgccgctcg ggatttgtgg atagctcaga tggaagccaa 1080 aagatcccga cctcagtaca ctcgacccaa tcagacgtcc actacctcaa atgtttcagg 1140 cttaagacaa gttaccctcc cgtcggcgag tctagccgtg gcttccgaca acccgtttgc 1200 ctcgctccac gtacactttg atgactccaa tgacatctca gcctcagcat ctgctctctc 1260 cacgttcgaa tcatctcaca cggtgtttga tgaagattca cctgctgcta atgttgtgaa 1320 tgaaagtaat ggactgtggg ctctatacga caccggagcg actcactata tgttcaaaag 1380 tgacaaactc tttacttccg actcaatgac tcgagtagaa gactcatcca agagactcaa 1440 actggcagga ggagatgcgt cactggctgt gcactcaacc ggaactgttc aactcaaatc 1500 tggctcagga aagatgttcg aactcaacaa tagcctctat gttcctgatt tatctcaaaa 1560 tcttctagcc ggtggagcta tgcttcgcaa gggggtgcaa gtcctggtac acccaaatga 1620 caccaactgc ttctcgcttg ttttcaaagg agaggctctg tttaatggtg tatttgcctc 1680 taacaacctc atgtatgtgt ctctcgaacc tgtgagtcct atcactggct ctactgcact 1740 caacacctca actgaagatt tgactcaact ccaacaccgt cgcttaggcc atctgaataa 1800 ccggtacctc aaaatcatgt gtaatcacag aagcgtagaa ggactaccaa ccaacttgtc 1860 tcagctaaac cattgtgatg tgtgctcatt gtctaagaac cataagattc ctcactcgtc 1920 caccagacct agagcatcac gccatttaga aaatgtacat gtcaacctta gtggcataat 1980 cagagtcaag gggttgaaaa atgaaatgta ttacaccatg ttttgcgatg attattcttc 2040 ttaccgtcac atctacttca tgaaagacaa aactaaagaa acagttttcg acatattcag 2100 agcctacatt gctcttgcag aacgacagac ctgcaagatg atcaaacagt ttaccctcga 2160 cagaggagga gagtttctta acgaactcct gggtactgaa cttcgtgagc ggggtattac 2220 tctccacctg acagctgctc acactcccga agagaacgga gtgtcggaaa ggggtaacag 2280 aactatcagt actaaagcca gatcaatgat gatagagtct ggatctcccc ttcgactttg 2340 ggttcaggcg tgtcaaactg ctgtattcct cactaaccgc accgtgactg ctgccctgga 2400 aggtcacaaa actccgtttg aagcctggca tttcaggaag ccatcagtag atcatatcag 2460 ggtgttcgga tgtcttgcat actcgctcat tcgaaaagaa atcagaggat caaagttcaa 2520 tccggtgagc tctaaggggg ttctggtagg ttttgaagaa gataacttca attatcatgt 2580 ctacgatctg gactcatcca aaatcactat cacacataac tccaccttca acgaaaacgt 2640 attccccttc atgaccgaaa atcagaccgc taccgaagtt ccacccgtgt ctactgaatt 2700 acctaaccta aaactgcgct tctttgatga agagagtgat gaggaagatg tggaagtcac 2760 aaccaccatc aactctacac cgaatacctt ggatcctcct gctcagaaat cgactcctac 2820 ctgtgtcgct gatgctcaac cacctcaacc aagctctcca ccagctccac ctcgccgctc 2880 tggaaggatc aaaggacccg tcaagtactc agccatcacc atcggcgaaa atctggagga 2940 gtcaactctc accactgcct gggaactcaa acaactcttc gactgtgttc tacctaagag 3000 caacttggtc acctctgctc acgacgtccc aagatcctac tcgaaagctg tgaatggccc 3060 cgaaggcact aaatggttgg ctgcgtgtaa gaaagagata aaagccatgt atgacaaaaa 3120 agtctgggta ctggttgaca gaccagccac ctccaacgtt gtcagaggcc tgtggctctt 3180 caggaagaaa atcacgtcag atcccgaaaa tcctttcaag tttaaggcaa gatttgtagc 3240 tatgggaaac acccaattgg agggcgaaga ttacttcgag acattcgctc cgacaggtaa 3300 accaacatct ctacgcctgt tgatagccct agcagcaatc tttggctggg aagtgcatca 3360 gatggatgct gtctccgctt tcctcaatag ctatctggac gaagtcatct atgttgaaca 3420 gccggaagga ttcagagtac cgggtgagga gcacaaggtg tgtaaacttc tgaagtctct 3480 gtacggactc aagcaggctc ccaagtgttg gcaagacgat gttatggact ttcttctcag 3540 cgtgaacttc atccaatgcg aagtagacca ttgtatttac atcagaagcg aaggaaactt 3600 gttcacggca gtgtacgttc atgtggacaa ccttgccata actggaaacg atatctcatc 3660 cttcaagaaa gaaatctctc aacgttggga gatggaagac cttggtttgg ctacaaccgt 3720 ggtgggtatt gaaatcaaac gcattaatga acataagtac tctatctgtc aagaatcata 3780 tactctcaag gtactcgaat gattcaactc tctagatgtc aaacctgcca gcactccttt 3840 cactgccaac cttaaactct acaaacccga cctgcaagag attgaagact ttgcttccag 3900 aaaattacct tatcgaagcg tcgtaggctc acttatgtac ctggcacagt gcactcgacc 3960 ggatctagct cacgctgtcg gaacactctc ccaacatctc gactggcctg gattccagca 4020 ttgggacgct gcgtgtcacg tactcaggta ccttcgtggc actgtcaacc tgggtatcgt 4080 ctactttcga ctagcgaacg ttgaacctgt caaaggtctc aagagtgaag tctgcccgca 4140 agcgctgtgc gacgcggact gggcagggga tcaagacacc agacgatcga cgacagggta 4200 tgtcttcata ctggccaatg gagctgtctc ctggaagagc cgactacaac ctacggtggc 4260 tctttcgtca actgaagccg aataccgagc tattaccgaa gccggccaag aggttctgtg 4320 gctcagaacg atgctcgcca aacttggtct cgaagatgcg aatcatactg tccttgaaag 4380 cgacaacaaa ggcgccatac acctcaccaa caaatcaatt ttccacggta gaacaaaaca 4440 catcgaaata cactatcact ggattaggga ggttgttaac tcaggacaga tttcactcaa 4500 acactgcccc acaaaattca tgatagctga tctcctcacg aagccgctgg gtactcaaca 4560 attcacaaat ctcagaaaac ttatgggtct caaacctata gtgtgaagaa cttgaggggg 4620 tg 4622 // ID Gypsy-55_MLP-I repbase; DNA; FNG; 5741 BP. XX AC AECX01002782; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-55_MLP_; KW Gypsy-55_MLP-LTR; Gypsy-55_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5741 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002782; Positions 11075 5335. XX CC Positions [4540-5019] - Integrase core CC 'CTGCA' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 404..1552 FT /product="Gypsy-55_MLP-I_2p" FT /translation="MASVNLEDVMRQLAELNARLTNETSLRQEAELRRQEA FT ERELNQMKQAQHSTPAPTNPVFQQPQIQATPKTPKVATPDKYDGSKGSKAE FT IFLDQLSLYIQLNSHLFGNDQARVTFALSYTTSKANIWGQRFTDQLLDDDR FT SSEVTWKNFIDEFKATFFDTERVSNAQKEIRALRQVKSVSDYWIRFSELSF FT VVRWNEDILMSLFEQGLKPEISLYMIMEKFEKVEEMARVAIKIDNRLHKRV FT SENQSMHQFLSTSNQMATPDPDAMDCLAYRMNISNEEYNRRGNTGACYSCG FT KLDHLIAECPMKNSSSRGRGGFGRGFGGGRGFYRGNQRYSGGSLYSKIHEL FT EARLKSKLDELDTQTGKVDKGKRVEEERRAESSKNGGARA" FT CDS 1606..5640 FT /product="Gypsy-55_MLP-I_1p" FT /translation="MSNIIDSLELNDTRIIATISLFDPKTATTELARALIN FT SGATHEAVSNKFVNRSCFTTSPLPERHSVTCFSGHKSLITHTGDYHVNNQE FT EETTFIVTELRDKYDVILGMPWIRRNHKSIDWSNGTLFDTINTEIATAEAV FT SSLRDQSSKDHELRPKGKATNSDEGVQVLDLFTPPQCECDHVSKPSNLESA FT GEYFPLLEQESKMNKLEDNSEEQEKDCKSQRPTTALMDHGKRPDRQARIVE FT MGVEFGNSSKPPQSEFATSPRSKLLGNVGKLYSPCARYRPIGTTFTRNFGQ FT TLTPLPKKTVLEAWNVSTRLAVEAAKGKVEKTAAELVPEDYHDYLEMFEKS FT KSNVLPPCRPYDFRVDLIPGATPQAGRIIPLSPKETEVLHEMLEKGLSNGT FT IRRTTSPWAAPVLFTGKKDGNLCPCFDYRKLNSLTVKNKYPLPLTMELVDS FT LLDADQYTSLDMRNGYNNLRIREGDEAKLAFICKEGQFEPLTMPFGPTGAP FT GFFQFFVQDILRSRIGKDVAAYQDDILIYTKPGVDHKKVVKEVLDILRAQN FT VWLKPEKCKFSQQEISYLGLVISRNQIKMDVTKVNVVRDWPAPKNLSEVQT FT FLGFANFHRRFMSHFSKIAQPLHELSQDDVKFEWTDERNLAFESLKMAFTT FT APVLTIADPYRPFVLECDCSDFALGVVLSQVSTNDNQPHPVAFLSRSLIKA FT ERNYEVFDKELLAVISAFKEWRQYLEGNPNRLSVTVYTDHKNLQSLMTTKE FT LTRRQARWAEILGSFDFEIRFRPGKQSTKPDALSRRPDLKPKEGDKLTFGR FT LLKPENLPTNAFIDSLDCMSSWVEEEAQVQIEFSDLNVELEALSVNEEMWT FT DEHILNEIRKASRHDHRVQQLINLCLEMPNSKLISDYTVEDDILYFREKAV FT VPDDDNLKLQILKSRHSSMLAGHPGRMRTLMLVKRNFHWPSMKMYINKYVD FT GCQSCQRVKTRNSKPFGALQPLPVPGGPWVDVCYDLITDLPESEGYDCILT FT VVDRFTKMAHFMACNKTMNSEDLAKLMIHNVWKIHGTPKSITSDRGNIFIS FT KLTQEINSALGIKTQASTAYHPQTDGQSKITNKAVEHFIRHFTSYKQDDWR FT RLLPIAEFSYNNNLHISIGMSPFRANYGHDVSFTGTQNSEQKLPAVSELID FT QINDVQNELREAMKIAQDEMKIQFDKKVLKAPDWEEGELVWLNSKHISTTR FT PTAKFSHRWIGPYKIEKRVNTNAYKLILPKEMQDIHPVFHVNLLREFKESE FT IEGQQEKPPAPIKIQEEDEYEVNEILNKRRIRGRVEYLVSWKGYASNHDSW FT EPQDNLMNAQDLVKEFNDKYPDAEKKYFRTRRR" XX SQ Sequence 5741 BP; 1877 A; 1090 C; 1312 G; 1462 T; 0 other; tattgcaacg tctcattctg gatagcagca actaaagatt ctaaaattca agatacatta 60 gaagaagaag aaaaatcgaa aagttattag ctcaaaagtt taaaagatca agtaattcca 120 agaaagaagt tatcagaaga ataaagtaga aaattaaagt aaaagatatt aatagaagaa 180 ggattaaaga agacacaaga ttcggtttag gactaaatta aatagttgaa ccttataatt 240 cactcatcta taactacccc gcatattctc atactaagtt agaactctgt ttggaactca 300 atctcttatc accaccgcca ccacgcctaa ctttacagct ccaacggaag attcgacctc 360 atcaaacggg gaagactcaa acgtgaacaa accattatca gagatggcct cagtgaattt 420 agaagacgtt atgcgtcaac tggctgaact caatgctagg ttgacaaacg agactagttt 480 gcgtcaggaa gccgaactta gacgccaaga agctgaaaga gagttgaacc agatgaaaca 540 agctcagcac agtacccctg ctccgaccaa tcctgtgttt caacaaccac aaatacaggc 600 aactccgaaa actcccaagg ttgcgacccc tgacaaatac gacgggtcga aggggtccaa 660 ggctgaaatt tttcttgatc aattgagcct gtacatccaa ctgaattcgc acttattcgg 720 caatgatcaa gctcgagtga cgtttgcctt gtcttacacg actagcaaag cgaacatctg 780 gggtcaacgc ttcacggacc agttactaga tgacgatcga agttcggagg tgacttggaa 840 aaactttatt gatgaattta aggccacctt ctttgacact gaacgtgttt ccaatgcgca 900 aaaggagatc agagctctga gacaagtgaa gtcggtgtcg gattactgga ttagattctc 960 tgagttgtcg tttgtagtta gatggaatga ggatatctta atgtcattat ttgaacaggg 1020 cctgaagcct gagatttctc tttacatgat tatggaaaag tttgagaagg tagaagagat 1080 ggctcgggta gcaatcaaaa ttgacaatcg attgcacaaa agagtcagtg agaatcaatc 1140 catgcatcag tttttgtcaa cctcaaatca aatggcaact ccggatcctg atgcgatgga 1200 ttgcttggcg tacaggatga acatatctaa tgaggagtat aacagaagag gcaatactgg 1260 agcatgctat agttgtggta agcttgatca cttaatagct gaatgcccaa tgaagaatag 1320 cagttcgagg ggaagaggag gctttggaag aggatttggt ggtggcagag gattttatcg 1380 gggtaatcaa cgttattcag gaggatcatt gtatagcaag attcatgaat tagaagctcg 1440 cttaaaatca aagttagatg aattagacac tcaaactggg aaggtggata aaggaaaacg 1500 ggtagaggag gagcgtagag ctgagagttc aaaaaatggc ggtgctcggg cgtgaaggtt 1560 gtgcctcacc cgagcgtaat taacttagaa tcaaatgaaa atgaaatgag caatattatt 1620 gatagtcttg aattaaatga cacccgtata attgcaacta tttctttatt tgatcctaag 1680 accgccacaa ctgaactagc gagagcattg atcaatagcg gtgctacaca tgaggcagtg 1740 agtaacaagt ttgtgaatcg atcatgtttt acgacgagcc ccttgcctga gagacacagt 1800 gttacctgct tcagcggaca caaatcctta atcacgcaca ccggtgatta tcatgttaac 1860 aatcaagaag aagagacgac cttcatagtt actgagctga gagacaagta cgatgtgata 1920 cttggcatgc cctggatcag gcgtaaccac aagagcattg attggtcgaa tggcaccctt 1980 ttcgatacga ttaatactga aattgcaact gctgaagcag tttcgtcact gcgggaccaa 2040 tcctcgaagg accacgaatt gaggcctaag gggaaagcta cgaacagtga cgagggggtg 2100 caagtcttag acttattcac acccccgcaa tgtgagtgtg atcatgtttc taaaccttca 2160 aatcttgaat cagctggcga gtattttcct ctcctagaac aagaatcgaa aatgaacaaa 2220 ctcgaagaca acagcgagga gcaagaaaaa gactgcaagt ctcaacgacc gacaacagcc 2280 ttgatggacc atggaaaaag gcctgacagg caagctagga tcgttgagat gggggttgag 2340 tttggaaact cgtcaaaacc cccgcagagt gagtttgcta catctcctag atctaaattg 2400 cttggaaatg ttggcaagct ttattctccc tgtgcaagat atagacccat tgggacgaca 2460 ttcacgagaa actttggcca aaccttaact cctctaccga agaagactgt gcttgaggct 2520 tggaatgttt ccaccagact agcggtcgag gctgcgaaag ggaaagttga gaagacggct 2580 gcggagctag taccagagga ctaccacgat tacctagaga tgttcgaaaa gtccaagtcg 2640 aatgtcctcc ctccatgcag gccttatgat tttagggtag acctgatacc tggtgcgacc 2700 cctcaagctg gacggatcat ccctctgtcg cctaaagaga ctgaagtgtt gcacgaaatg 2760 cttgagaaag gattaagcaa tggcactata cgacgaacta cctcaccctg ggcggcacct 2820 gttcttttca cgggaaagaa agatggcaat ctgtgccctt gttttgatta ccggaaactt 2880 aattctctca cagttaagaa caagtaccca ttaccattaa ctatggaact tgtagacagt 2940 ttactagatg ctgatcaata tacaagcttg gatatgagga atggttacaa caacctgaga 3000 attagagagg gggatgaggc aaagcttgct ttcatctgta aggaagggca atttgagcct 3060 cttactatgc cctttggacc cactggcgcc ccaggttttt tccaattttt tgttcaggac 3120 atactcagat ctcgaatcgg aaaggatgta gctgcctatc aggacgatat cttgatttat 3180 acaaaaccag gagtggacca caagaaggtt gtcaaggaag ttctagatat cttacgagct 3240 caaaacgtgt ggcttaaacc tgagaagtgc aagttctcac aacaagaaat ttcatatctt 3300 ggattagtta tctctcgtaa tcaaatcaag atggatgtta ctaaagtcaa tgtggtaagg 3360 gactggccag ctccaaaaaa cctgtcggaa gtgcaaacgt ttcttggttt tgccaacttt 3420 caccggcgtt tcatgagtca tttttcgaaa attgcacaac cactacatga gttgtcacag 3480 gatgatgtta agtttgaatg gactgatgaa cgaaacttgg cttttgaaag tctgaagatg 3540 gctttcacga cggctccggt gttgacgatc gcagaccctt acaggccttt tgttctggag 3600 tgtgactgca gcgactttgc gcttggtgtg gtcctatctc aagtctcaac gaatgataat 3660 caaccacacc ccgttgcatt tctttcgcgg tcactgatta aagctgaaag gaactatgag 3720 gtgtttgata aggaattgtt ggccgtaata tcagctttca aggagtggag acagtatttg 3780 gaaggtaatc cgaatcgtct cagtgtcact gtttataccg atcataaaaa tttacagtcc 3840 ttaatgacaa ccaaagaact taccaggagg caagcaaggt gggccgaaat cttaggcagc 3900 ttcgattttg aaatccggtt tcgtccagga aaacaatcga cgaagccgga tgcgttgtcg 3960 agaaggccgg atctcaaacc taaagagggt gacaagttaa cgtttggaag attgctcaag 4020 cctgaaaact tacctaccaa tgctttcatt gattctcttg attgtatgag ttcatgggtg 4080 gaggaggaag cacaagttca aatcgaattc agtgatctca atgtcgagtt ggaagccttg 4140 agtgtgaatg aagaaatgtg gacagatgaa catatcctaa acgaaatacg taaagcttca 4200 agacatgacc atcgagtaca acagctgata aacctatgcc tagagatgcc aaactcaaaa 4260 ctcatatccg actacacggt tgaagacgat atactatact ttagagagaa agcagtagtt 4320 cctgacgatg ataatttaaa actacaaatt ctgaagtcca gacacagtag tatgttagct 4380 ggtcatccgg gacgaatgcg gactctaatg ctagtaaaac gtaacttcca ctggccgtca 4440 atgaaaatgt atatcaacaa atatgtagat ggatgtcaat cctgtcagcg tgtgaaaact 4500 agaaattcga agccctttgg agcacttcaa ccgctgccag ttccaggcgg tccttgggtg 4560 gatgtgtgtt atgacttgat tacagattta ccagagtcag aaggatatga ttgtattttg 4620 acagtagtgg atcggttcac gaagatggcc cattttatgg cttgtaacaa gacaatgaat 4680 tcagaggatt tagcaaagtt aatgattcat aacgtatgga agattcacgg aacacctaag 4740 tcaatcacgt ctgatagagg aaatatattt atatcaaaac ttactcaaga aatcaattca 4800 gctttaggaa tcaagacaca agcatcgaca gcttatcacc cgcaaacaga tggccagtcc 4860 aaaatcacaa ataaggcagt agaacatttt atacgacatt tcacttcata caaacaagat 4920 gattggagga ggttattacc aatagctgaa ttctcttata acaacaattt acatatttct 4980 ataggcatgt caccgtttag agcaaactat ggtcatgatg ttagctttac aggaactcag 5040 aatagtgaac agaagttacc ggctgtgagt gaattgattg atcaaattaa tgatgttcaa 5100 aatgaattac gtgaagctat gaaaattgct caagacgaaa tgaaaataca atttgacaag 5160 aaagtattga aagcgccaga ttgggaggag ggtgaattgg tgtggttgaa tagcaaacat 5220 atttcaacta caagacccac tgctaagttt tctcatcgat ggattggtcc gtataaaata 5280 gagaaacgag ttaacactaa tgcctataaa ttgattctgc cgaaagagat gcaggacata 5340 catccagttt ttcatgtcaa tcttcttcga gaattcaagg aaagcgaaat agaagggcag 5400 caagaaaaac cacctgcacc aatcaaaatt caagaagaag atgaatatga agtgaatgaa 5460 atattaaata aaagaagaat cagaggaaga gttgaatatt tagtgagttg gaaagggtac 5520 gcttcaaatc atgattcctg ggaacctcaa gataatttaa tgaatgctca agatttagtt 5580 aaagaattta atgacaaata tcctgatgcg gaaaagaaat attttaggac aaggagaaga 5640 tgagagaggg tgaagctttt ttcccactgg gtttttaatg ctaacccgtg gaaagatagc 5700 tagcctgtca agagggggct gagttataaa agggggagtg g 5741 // ID Gypsy-7_CCO-LTR repbase; DNA; FNG; 1233 BP. XX AC AACS02000003; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the mushroom Coprinopsis cinerea genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-7_CCO_; KW Gypsy-7_CCO-I; Gypsy-7_CCO-LTR. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-1233 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the mushroom Coprinopsis cinerea RT genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AACS02000003; Positions 3360448 3361680. XX SQ Sequence 1233 BP; 347 A; 301 C; 293 G; 292 T; 0 other; tgttacgcct gagggtcatc tgacccttca aaaaccgcga ataacacacc gccggactaa 60 tttcgtccgg taaaacgtac atctcgataa gtgactgaaa agccctaagg cactgtgttt 120 ttcataaccg gcggtcactt attctgtcgt atgagttgag accgagccca gagtcgtcaa 180 ctaagtcgaa ctctaagaac gactctgaaa gaggttgaac gtaacgcatc gttgatactg 240 tagtaattga cgggttaagg atgtgtactg gaccgattga agatcgaact acggtagtga 300 gaattcggca cgaagtcctc accgccggga gcaagactat ccatgggagt gcacgatcga 360 cttcaaatcg actagaaagc tagtagcgac ttaaccagaa cgacgacact tagcaccagc 420 gcctgtcagt accctaagag ggaacgccca taagcgccgc cggacggtcg gaatcagcca 480 tcagtcgcgg aaagctaaaa ggggacctct acgtattagg tacgagatcg ataagataga 540 agccgagata agataagcca gaacgacggg accaggaacg gatcacacga gatgttctat 600 tccgttggac aggtgtcctc tctaggacag atgaaacagg gtactaacag tcccatgacc 660 tgtcaacaga aaaagaagta gtacagagag aagggacaga acgatgaatc catacgccgc 720 cctccggcgg tagttacaaa aggggtatac catctgtttc cgcggaagca tcttcacgga 780 gtaacaatgg agtttcccac cacacattgg ttccgttccg actggacggg aactaccaaa 840 atccacgcgc ttcaacagcg catcaataga cgcctatggg catactcggc acgacaccgc 900 cggggcttct tttgtcgcgt ttctttccta tttcgtattt ctgattctct cgtacctccg 960 ctccgtctct ctactctcct tttctgtatc atgcgcatgt attcggtttg ttttatagtc 1020 tttgctagtc tttgttttaa agctcgtagt caaacatttt gtaccatata tataggcccg 1080 ccgggcaaaa gaatcaacag ataccccttt gactcgatta tatatcttgt cctgatcgac 1140 acaagtcgtt cagtaaaata aagaggaagt ctccgagcgt cgagtgaacc agttagcgag 1200 tcagagtcga gttagaattt gcgcgtgaat aca 1233 // ID Gypsy-14_LBS-I repbase; DNA; FNG; 10810 BP. XX AC ABFE01000655; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-14_LBS_; KW Gypsy-14_LBS-LTR; Gypsy-14_LBS-I. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-10810 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01000655; Positions 83374 72565. XX CC Positions [6747-7226] - Integrase core CC 'GTTTC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 411..7847 FT /product="Gypsy-14_LBS-I_1p" FT /translation="MSTINAASSNPSPSAPNQNDLPQPSSNAPLPDNMDST FT EDMSAIRRGMERGYALSPMYSLIIEYNREQGNNFSTAAEVSWILQQPNLKP FT SDIILKKDHPDPYTLVAVGEPRLRRLLRAVTELQTFLDRMANLIDERSNVF FT RVDPQDTMTTALQGCESRSQLEVAYSILLKRLLTAQQTVSKYEAQYRDEDT FT PLSPTSTLPDLYDDFDRLDDVDSRMRYLLQKIPHHQGHLSSAAQAAVHQGH FT SWDVIHPTLPLQPELQPHSQALPLPIPSSAPFEPDSSRQSIKEGKKKVEWG FT EASPWDEKSSSMEQGRDDNEGLEPSFGFQTPFKGGVRFFDVSNSPDQSAYF FT STPGVAPLPDVTVGLATPSVTPFADNVKQANQPRLALMSSKDAPSERNPHA FT RPGQTPPNSFPSFKGGSPPDDDPPGGGSGGRGGGGNGGNPFPPRGSPRDNS FT NPNPNPGGPYRPGGGSGDPGGGGGGGPPSAENQVAPPAPYGNIPASIKTEL FT KVEQLPEWDGNHWTAIDYFWEVQQLAHLGGWIPEALGYWLWFRLKDKSPVK FT SWFITLPLNYQTYMRSHYLKFLKGIKDGYLGHRWQLRMNNYYNSQHFRERN FT HERETPSEFIVRRIVYTRMLLSISGGPLEVFYIMRKAPISWGPILLISSIK FT DSSELYSWVTEHEEALLEAFRVSKGGQAPSIDNIVSQLKQMGLIPDKDKGG FT GSSTYQSPSTYQGPSTYQRRANLVENSNANPSIEPVDETVSPSAPENLPAN FT QSSDPSTNQHILREAYQVLKQRQRPPPVGGYPYAKNDHVTTKMGRLPPSPC FT KCCGSANHWDKECPDWNTYLEKAKRSANMAEFWAPNESEKTYVTVYSILLN FT ERLAGEIVNQPLLEDSLTQQDFRSASFLSQATAEGVSKSRKGSQVRKTRTT FT MEEVEDEDWLAYLAKPKSSEFLMEEVKQVKETLVSRTNSEEERVEPNFRSP FT SARPENETKETEEIPPTESSEWPGPPIPDIRVRLKKKRFAPAGASAVGVSV FT VAVQGWVGSTRNKPIDIRLDSCADVTLISQEYLESLQDRPPCQNGLKMNLW FT QLTDKDATLQGYVRIPIFFVSSDGILLETEAEAYVVPNMTVPILLGEDYHV FT NYELIVAHKVDFRSVVNFVGVPYSVPARGVSRTKDFGRMRQSTCTTASFIK FT ARLHKRNKARKARKIKSFGIEKRTVRAAEDYRLRPDECRRIRVDGHFEEDK FT EWLVEKNLLAAADDSTFVVPNVLISASNPWIPVSNPSPHPKVIRKGDVVGY FT LTDPQEYFDAPQTSQGLDKLVKMAEALVTIIAVSSSDSNQTSSTSEESEKP FT SEQAEPSLEKEVSEEEEPEAYGPKTAELPDTTEYPSLRMREFLDVGSLPEH FT LQERAWEMLEKRKKAFGFDGRLGHHPSKVHIRTVDGQVPIAVPMYGASPAK FT RAVINEQLDKWFEQDVIEPSKSPWSAPVVIAYRNGKPRFCVDYRKLNTATI FT PDEFPIPRQSEILSSLSGAQVLSSLDALSGFTQLEMDEDDVEKTAFRTHRG FT LFQFKRMPFGLRNGPSIFQRVMQGILAPYLWIFCLVYIDDIVIYSKTYEDH FT IDHLDQVLGAIEKAGITLSPVKCHLFYSSILLLGHKVSRLGLSTHNEKVRA FT ILELSRPTKLSQLQTFLGMAVYFSAFIPYYADRCYPLFQLLRKGAKWNWTA FT ECENAFNSIKSALQEAPVLGHPAEGLPYRLYTDASDEALGCALQQVQPIKI FT KDLQGTRLYDQLKRAHESGKKPPKLVVHLPSSIDDSIVAEDWSDSFEDTVV FT YIERVIAYWSRTFKSAETRYSTTEREALGAKEGLVKFQPFIEGEKVTLITD FT HAALQWAKTYENSNRRLAAWGTVFSAYAPHLSIVHRPGRKHSNVDPLSRLY FT RAPPPQDSPVKEDAVSLELNPIHIDFSANPSMGKAAFMAYSITDCLEECKE FT GHISTRSSKRKKEASPQKGAAELVRPTPVTNLTDVAGELTSEYWNATNPPP FT NLLVHLEEKMLRDWVKGYAKDPHFVKIWNDPKTRVDEWVPGHRFFRNDDGL FT MFFRDADYQPRLCVPLDQRRLILEEAHEQAFEGAHQGPEKLWQKLSGRFYW FT KRMKADLVKFVQTCDVCQKIKTPNFNKYGYLIPNPIPSRPYQSVAMDFIVN FT LPWSEGYNAIHVTVDRLTKHGTFTPTTTGLDAEEFGALFVKKIICRFGVPE FT SVICDRDPRWTSDFWKGVAKFLCTKMSLSSSHHPQHDGQTEIVNRFLEVML FT RAFVANNKESWALWLPLLEWAYNASVHSSTGTTPNFLMFGFEPRTPMDFLL FT PKDTTKESVKRSNSEEWLAQLQMLRESARQAIAHAQHHQARSHNKGRKTLE FT FSAGDKVLVNPHSLEWIESKGEGAKLIARWIGPFEILQKINPNVYRLRMGD FT NYPGSPVINIQHLKKYTEDKTHLDRTTLPESFTRRLESEEFEVEKIVGHRR FT IGKKATLKYLVQWANYGPQFDTWSTASDLKNSPILLKEYRAKHNL" FT CDS 8181..10781 FT /product="Gypsy-14_LBS-I_2p" FT /translation="MDSVIHHLNPIGVATVRADRDAIWIGLDSDTALIPDA FT AIVFRNLPDPGWSVDDFSESGQILPIDTVRNPDWYRSDEQWAPWTPTAFLL FT NERPWYDQLETAVPVEERLEGWSMAEEQRLVCSGDLIRTQACVRSIVEFDQ FT CFPPQAKAPLQYPAERLAKVYSTRKLVQINAAKAKRSILQALAFMSWWTAI FT RADWESQLNDTAVEIISKLLATTKGKRGVICDLERDWPVINIPLYLQHKIP FT FFYLWDFDVRADQRFSRLNPALNLTYWAVRQGTTLDLHRDLEEDDLNKIAR FT EAVKLDHYFQQVFTYRAAVDPSILSSYPAFIIDFVGWKRRPINRSEESTES FT LAKLYYYGVFDDNEEYEHKVVVFWRWRKREPRDEYLRCQYKTSLPGEEPAG FT ILRELYKFGYAPKPGVQYDDDTGLRVVRNRSPGSSLSLLERMGGALSGGRP FT SLQDRLSDDTPRSDTPNTQSAMSDDDLMTARILEVPDTLYHPRAINSPAAW FT IRHNENLLTNARQGTADRRASQGIGSTPFRRSQSPTRISDPIHSSHERPEI FT MFQRLLRDESAKITYTTSTWFAPHFAWNPEFLEAAYLFIPDEESEARLRYW FT ANCWDSVGTVKRLLTIAIEHGIRFHLSLPPDSVRRFRPIIVDNLDRSSASF FT IYNVGFQEPPLLPADNAATYCATYLARMNDVLRRPHACAFIAEGGQLSWIA FT RRWSGTRLVEEFMSGPSIQITVHNRGFYDSASEDASYLSHDIVSEQEKDLL FT LGYCSGSNGCSGRWLFPPADMFNGNFDLWTGEWNAALDHIYRRLADDIARG FT KAKLRTREKWKCWIRNNERGQRRPSYLPSATDFRDVMEGISKAGLKPTWHK FT EPLDNITFPERRVD" XX SQ Sequence 10810 BP; 2967 A; 2893 C; 2610 G; 2340 T; 0 other; aaggtggaca ctgtgggaat cggttgattc atctagccga caaaccaacc tcgtcaattc 60 ctcatacaac cgcgtctatt cgaaagatct aaaaagccga cacactgtcg aactaccgtc 120 ctacgaaacc gactccccct gtggtcggta gggggggtag acccgctttc tcctcgacgc 180 cttttggagc ttctaggatc tctagtaaag ccgactcgac ttccaccacc gcaacccctg 240 cacctgcatc tggcccatct gattcgctgt ggaggagagt ccccacttcc gaagctccgt 300 cttcctcatc gattaaaacc aaaacgaccc ctctaacgcg agagcagccc ccacaccaat 360 cgtccttgat cccaagaccg aaattctcgt ctcgttcccc tacgccgaag atgtcaacaa 420 tcaatgcggc ctccagcaat ccttccccct cggcgccaaa ccaaaacgat ttgcctcaac 480 cgagctctaa cgctccttta cctgacaaca tggactccac tgaggatatg tcggccatcc 540 gtcgaggaat ggagagggga tatgccctct ctccgatgta tagccttatt attgaataca 600 accgcgagca aggaaacaac ttctccaccg cggcggaggt ctcttggatc ttgcaacaac 660 ccaacttgaa accgagcgat atcatcctca aaaaggacca tcctgatcct tacacgctag 720 tcgctgtcgg agaacctcga ttacgacgac tcctccgagc agtgaccgag ctacaaacct 780 tcctcgatag aatggcgaac ttgatagatg aacgcagcaa cgtgtttcga gtcgacccgc 840 aagatacgat gaccaccgcg ctacaaggtt gtgaaagccg atcccaattg gaagtagcgt 900 actcaatact cctcaagcgt cttttgacgg ctcaacagac cgtcagcaag tacgaggctc 960 aatatcggga tgaggacacc ccgttatcac cgacctcaac cttgccagat ctatacgacg 1020 acttcgacag gcttgacgac gtagacagcc gaatgaggta cctgctgcag aagataccgc 1080 atcatcaagg ccatcttagc tctgcggcac aggcggctgt tcatcaaggc catagctggg 1140 acgtgataca ccccacgcta cctctacaac cggaactgca acctcactca caagctttac 1200 cgctaccgat accatccagc gctccattcg aacctgattc ctcgcgtcaa tcaattaagg 1260 agggtaaaaa gaaggttgaa tggggtgagg cgtctccttg ggacgagaaa tcctcgagca 1320 tggaacaggg aagagacgac aacgaagggc tcgaaccgtc tttcggcttc caaaccccgt 1380 tcaagggagg agtcagattc ttcgacgttt cgaacagtcc agaccaatca gcctactttt 1440 ctacaccagg ggtagcgcca ttgcccgatg ttacagtagg actcgccacg ccctccgtaa 1500 ctcccttcgc cgacaacgtc aaacaagcta accaaccacg cttggccttg atgtcttcga 1560 aggatgctcc atcggaaaga aatccacacg ctagaccagg ccaaaccccc cccaatagct 1620 tcccatcatt caagggagga tctccaccag acgatgatcc cccaggaggg ggtagcggag 1680 gtaggggagg tggagggaac ggaggaaatc catttcctcc acgcggatca ccacgcgata 1740 actcaaatcc aaacccgaat cccggaggac cctaccgccc tggaggaggg agtggcgacc 1800 caggaggagg aggcggagga ggacctccgt ctgccgaaaa ccaagtggcc ccaccagctc 1860 cgtacggaaa cataccagcc tctatcaaga cggaattgaa ggtcgaacaa ttgcccgaat 1920 gggacggcaa tcattggacg gccatcgatt acttctggga ggttcaacag ttagcccatc 1980 taggcggttg gatccccgaa gcattagggt actggttatg gtttcgattg aaggacaagt 2040 caccagttaa atcatggttt attacgctac cgttgaacta tcagacctac atgcgttcgc 2100 actacctcaa gtttctcaaa ggcatcaagg atggatactt aggtcaccgc tggcaactta 2160 ggatgaacaa ttattataat tcacagcatt tccgagagag gaatcatgag agagagactc 2220 catcggaatt catcgtcaga cgcatagtct atacgcgcat gcttctgtcc atcagcggag 2280 gaccacttga agtcttctac atcatgagga aagcccccat aagctggggg cccatccttc 2340 ttataagtag tataaaagac tcgagcgaac tctattcatg ggttacagaa cacgaagagg 2400 cactcttgga ggcgttccgt gtttcgaaag ggggtcaagc cccctcgatt gataacattg 2460 tcagtcagct taagcaaatg ggcctcattc cggataaaga caagggcggg ggctcctcta 2520 cttatcaaag tccctctacc tatcagggtc cctctactta tcaacgccgc gctaatctag 2580 tggagaattc taacgcaaac cctagcatag agcctgtcga tgagacagta tctccttccg 2640 caccggaaaa cctccctgcc aatcaatcct cagatccttc tactaatcag cacatcctac 2700 gcgaagcgta ccaggtcctg aagcaacgcc aacgtcctcc accagtagga gggtacccct 2760 acgcgaaaaa tgaccacgtc actaccaaaa tgggtaggtt acccccgtca ccctgtaagt 2820 gctgtggaag cgcaaatcac tgggataagg aatgtcccga ctggaataca tacctcgaaa 2880 aagcgaagcg gtcagcaaat atggctgagt tttgggcgcc aaacgagtct gagaagacct 2940 acgtgaccgt gtattccatc ttgctcaatg aaagattagc tggagagatc gtcaaccaac 3000 cattattgga agattccctc actcagcagg attttaggtc ggcatcgttt ctttcacagg 3060 caactgcgga aggagtgagt aagtccagga aggggagtca ggtaagaaag actcgcacca 3120 ccatggaaga agttgaggat gaggattggc tagcctactt agctaaaccg aaatcctcag 3180 agtttttgat ggaggaagtg aagcaagtga aagaaaccct cgtctcccga acgaacagtg 3240 aagaggaaag agttgaaccg aattttaggt ctccctctgc acgacctgaa aatgagacga 3300 aagagacgga agagatccct cccacagagt cttctgagtg gccaggacct cccattccgg 3360 atatcagagt ccgtctgaaa aagaaacgtt tcgccccggc cggcgcgtct gcagtggggg 3420 tatccgtagt agctgtccaa ggatgggtgg gatccactag aaacaaaccg atcgacatca 3480 ggttggactc atgtgcagac gtcaccttga tttcgcagga atacctcgaa agtctacaag 3540 atcggccacc ctgtcagaat ggattgaaga tgaacttgtg gcagcttact gacaaagacg 3600 ctacgcttca gggttacgtt cgaataccga tctttttcgt gtcctcagat ggcattctct 3660 tggagacgga agcagaagcc tacgtagtcc cgaacatgac ggttcccatc ctgttgggag 3720 aagactacca cgttaactat gaacttatag ttgcccataa ggttgatttc cgctccgtgg 3780 tcaattttgt gggagttccc tattcggtcc cagctcgagg agtcagtagg acgaaggact 3840 tcgggagaat gcgtcaaagc acctgcacga cggcaagttt cattaaagcc agactccata 3900 agcggaacaa ggcaaggaag gcgaggaaaa ttaagagttt tggcattgaa aaaaggacgg 3960 tcagagcagc tgaagactat cgccttcgcc cggacgagtg ccgccgaatt agagtagatg 4020 gtcactttga agaggataag gaatggctgg tcgaaaagaa cttgttagct gctgccgacg 4080 attcgacttt tgtcgtacct aacgtgctta tttccgcgtc gaatccctgg attccagtct 4140 ctaacccatc gcctcaccca aaagtgatca gaaaaggaga cgtggtggga tacttgacag 4200 atccacaaga atacttcgat gctcctcaga cttcacaggg tcttgacaag cttgtaaaga 4260 tggccgaggc tttggtgaca attatagcgg tttcttcgag cgactcgaac cagacctcgt 4320 cgacgtcaga ggagtctgag aaaccgtcag agcaggcaga gcccagccta gaaaaggaag 4380 tcagcgaaga agaagaacca gaagcgtacg gtccgaaaac agctgagctc ccagacacaa 4440 cggagtatcc ttctttgagg atgcgagagt tcttggacgt agggtctttg ccggagcatt 4500 tgcaagaacg agcctgggag atgctagaga aacggaagaa agcatttggg ttcgatgggc 4560 gcttaggcca tcacccgagt aaagtccata ttaggacagt cgacggacaa gtgccaatcg 4620 ccgtcccaat gtatggggcg tcaccagcta agagggcggt tataaacgaa caattagaca 4680 aatggttcga acaagatgtg atagaaccat caaagagtcc gtggagtgcc cccgtggtca 4740 tagcctaccg gaacggcaaa ccacggtttt gcgtcgacta tagaaaatta aatacagcaa 4800 cgatcccaga cgaattcccc atcccccgtc aatctgagat tttatcttct ctttccggag 4860 cacaagttct atcttccttg gatgcactct ccggttttac acagttggag atggatgaag 4920 acgacgtcga aaaaacggcc ttcagaacgc accgaggact ctttcagttt aaacgaatgc 4980 ctttcggatt aagaaacggg ccctcaattt ttcaacgagt catgcagggt attttagccc 5040 cgtacttatg gattttctgt ctggtataca tagatgacat agtcatttat tccaagactt 5100 acgaagacca tatcgatcat ctggatcaag tcctaggggc tatagaaaaa gcaggcatta 5160 ctctatcccc agtcaagtgt catttattct attcgtccat cttattgtta gggcacaaag 5220 tttcgcgttt gggactatct acccacaacg aaaaagttcg ggctatcctg gaattgagcc 5280 gccctacaaa actatcgcag ctacaaacgt tcttgggcat ggccgtctat ttttcagctt 5340 ttatccctta ctacgcagat cgttgttacc ctctgttcca gctattgcgg aagggagcca 5400 agtggaattg gactgccgaa tgtgagaatg catttaattc tatcaagagc gcccttcagg 5460 aagccccggt actagggcat cccgctgaag gtttgccgta tcgcctgtac acggatgcct 5520 cagacgaagc gctgggctgc gccctacagc aggtgcaacc tattaaaata aaagacttac 5580 aggggacgcg actgtacgat cagctgaaaa gggctcacga aagcgggaag aaacccccca 5640 aactggttgt ccatctacca tcctccatcg acgatagcat cgtagcagag gattggtcgg 5700 attccttcga agatacagtc gtctacattg agcgagtgat cgcctactgg tctcgtacat 5760 ttaaatcagc agaaacgcgc tactcaacga ccgagcgaga ggctctgggt gctaaggagg 5820 gacttgttaa atttcaacct tttattgaag gtgaaaaagt tacgctgatc accgaccacg 5880 cagccctcca gtgggcgaag acctatgaaa attcaaatcg gaggttagcc gcatggggaa 5940 cagtattttc ggcatacgct cctcatttat ccatcgtcca tcgaccaggt aggaagcatt 6000 ccaacgtaga tccgctatcg cgattgtacc gcgctcctcc accacaggac tctccagtca 6060 aagaggacgc ggtttcgttg gagttgaacc caattcacat agacttttca gcgaacccat 6120 ccatgggcaa agcggcattc atggcctaca gcataaccga ttgtctggaa gaatgcaagg 6180 aaggccatat cagcactcga agttcaaaac gcaagaagga agcctcaccc cagaaaggag 6240 ctgccgagct agttagacca actcccgtca ctaatcttac cgacgtggcg ggggaactga 6300 ctagcgaata ctggaacgcg acgaatccgc ccccaaatct gctagtgcac ttggaggaga 6360 agatgctccg ggactgggtg aaaggctatg caaaagaccc acacttcgta aaaatctgga 6420 acgatcctaa aaccagagtt gatgaatggg tacctggtca ccgcttcttc aggaacgacg 6480 atggattaat gttcttcaga gacgccgatt atcagccgag gttatgcgta cctttagatc 6540 aaaggaggct gattcttgag gaggctcacg aacaggcgtt cgaaggagct catcaaggcc 6600 cagaaaagct gtggcagaag ttgagcggga gattctactg gaaaagaatg aaagccgacc 6660 tggtcaaatt cgtccaaacc tgcgacgtat gtcagaaaat taagacgccg aacttcaaca 6720 agtatggata cctcattccc aatccaattc ccagcagacc ctatcagtcg gtagcgatgg 6780 actttatagt gaaccttccc tggtcggaag gctacaatgc gatccacgtc acagtagacc 6840 gactgactaa acacggcact tttacgccaa ctacgactgg attggacgcg gaggaatttg 6900 gggccttgtt cgttaagaag atcatttgtc gctttggagt cccggaaagc gtcatatgcg 6960 acagagatcc tagatggact tcagactttt ggaaaggagt ggcaaaattc ttatgcacta 7020 aaatgtcact atcgtcctct caccacccac agcacgatgg ccaaacagaa atagtaaacc 7080 gctttttgga agtgatgctg agggcattcg tcgccaataa caaggagtcc tgggctttat 7140 ggctgcctct attggaatgg gcgtacaacg ccagcgtaca tagttcgact gggacgaccc 7200 ccaacttctt aatgttcgga ttcgaacccc gcactccaat ggatttccta ctacccaaag 7260 acacaacgaa agaaagcgtc aagcggtcga attctgaaga atggctggct cagctacaaa 7320 tgcttaggga gagcgccagg caagcgatag cgcacgctca gcaccatcaa gctcgaagcc 7380 ataataaagg tcgaaagacg ctagagttct cggcaggaga taaagtctta gttaatcccc 7440 attctctaga atggatagag tcaaagggcg aaggagccaa actcatcgcg cgctggatcg 7500 gtcccttcga gatactccag aaaatcaacc ctaacgtcta cagactcagg atgggagata 7560 actatcctgg ctcacccgtt attaacattc aacacctcaa gaaatacaca gaagataaaa 7620 ctcacctcga ccggacaaca ttacccgaat cgtttacgcg acgtttggaa tcagaggaat 7680 tcgaagtcga aaaaattgtg ggtcatcgaa gaatagggaa gaaggccact ctgaagtact 7740 tggtacaatg ggcgaactac ggcccgcaat tcgacacatg gagcactgcc tctgatctga 7800 agaactcccc aattctattg aaggagtacc gcgctaaaca caatctctag acgtaatccg 7860 caatcacgaa attatacccc cctaacgact tcacgatctc cttcactttt attctgttcc 7920 ttcattttca atcttctttc gttcttactc aaattttttt tagacaatca gacgaaatag 7980 tcggagcaca tgtcagtcgc tcttcggact tatctatctc atccatcagg gacacgtgta 8040 cgaacaaaac accgccttgt cctttctacc ctatctttct gcccatccta ccccacgtct 8100 tcattcgtag ggactgtcca cgagtgaaat ctgggtaaac gacgagtcgt gcagaggact 8160 gtcctgctcg atttaacgtt atggattcgg taattcatca tctcaatcca atcggcgttg 8220 cgactgttag agctgaccga gacgccattt ggataggact cgattcagat acggccttaa 8280 tcccagacgc cgccatagtc ttcaggaacc tacctgaccc cggatggagc gtagacgact 8340 tctccgaatc cggccaaatc ctaccaatcg acaccgtcag aaaccctgac tggtacagga 8400 gtgacgaaca atgggccccc tggacgccaa ccgcattcct tctgaacgag cgcccttggt 8460 acgatcagct ggagacggcg gtcccagtcg aagagagatt ggagggttgg tccatggcag 8520 aagaacaacg tttagtttgc agcggggatc tcatccgcac gcaagcttgc gttcgcagca 8580 tcgtagagtt cgatcaatgc ttcccccctc aggcaaaagc tcccctccaa tatccagccg 8640 agcgcctagc caaagtctac agcaccagaa agcttgtgca gatcaacgcg gcgaaagcaa 8700 agcgttcgat tctacaagcc ctcgcgttca tgtcctggtg gacagcaatc agagcagatt 8760 gggagtcaca gctcaacgac acggcggtgg agattatcag caagctcttg gcaacgacta 8820 aggggaagcg gggtgtcatc tgcgacctcg agcgcgactg gcccgtcatc aatattcccc 8880 tatacttgca acacaagatc cccttctttt acctttggga cttcgacgtt agagcagacc 8940 aacgcttcag ccgattaaac ccagctctca atctcacgta ctgggccgtt cgacaaggta 9000 caactttaga cctccaccga gacctcgaag aggacgacct taacaagatc gcaagggaag 9060 cggtcaaatt ggatcactat tttcagcagg tcttcacata tcgagccgcc gtcgacccat 9120 ctatcctgtc atcttatccg gcgttcatca tcgacttcgt aggatggaag cgcagaccca 9180 tcaatcgaag cgaggaatca acggaatctt tagcaaagct ttattactac ggcgtgttcg 9240 acgacaacga agaatacgaa cacaaagttg ttgtcttctg gagatggagg aagagggaac 9300 cccgggatga atacctgcgt tgccagtaca agacgagttt accaggggag gaaccagcag 9360 gaatactcag agaactatac aagttcggct acgcaccaaa gcctggtgtc cagtatgacg 9420 acgacacggg ccttcgagtc gttagaaacc gctcacctgg ctcctcgctc tctctcttag 9480 aacggatggg aggcgcgctt agcggcggta gaccatcttt acaagatcgc ttatccgacg 9540 acaccccacg aagcgacacc ccgaatactc agagcgccat gtcggacgat gacctcatga 9600 ctgctcgcat cttggaggtg ccagacacct tgtatcaccc ccgggccatc aacagcccag 9660 cagcatggat ccgtcataac gaaaacctat tgacgaacgc tcgtcaggga actgcggatc 9720 ggcgagcatc tcaaggcatt ggttcgacgc ctttccgacg gtcgcaatcg cccactcgta 9780 tctcggaccc catccactca tctcacgagc gacccgaaat catgttccag agattgctaa 9840 gggacgaatc agcgaaaata acctacacaa cgagtacatg gttcgctccc cattttgcat 9900 ggaacccgga atttctggaa gccgcgtacc ttttcatccc agacgaagaa agtgaagcac 9960 gactccgtta ttgggctaat tgttgggatt ccgtgggcac agtgaaaagg ctcctgacaa 10020 ttgctatcga acacggcata cgattccatc tatccttacc acccgactca gtgagacgat 10080 tccgcccgat catcgttgac aacctcgacc gaagctcagc atcattcatc tacaatgtgg 10140 gctttcaaga gcctccactc ctcccagccg acaacgcagc cacatactgt gcaacgtacc 10200 tcgcccggat gaatgacgtc ctacgccggc ctcatgcgtg tgccttcatt gcagaaggag 10260 gtcagctgag ctggatagca cgacgctggt cagggactcg gctcgtcgaa gaattcatgt 10320 cggggccctc tattcaaatc accgtccaca accgaggctt ttacgactcc gcatccgagg 10380 acgcttccta tctttcacac gacatcgtct cagaacaaga aaaggatctg ctgctaggct 10440 attgctctgg ctcgaacggc tgctcgggac gatggctatt ccctccagct gacatgttta 10500 atggtaactt cgatctgtgg acaggtgaat ggaatgcggc actcgaccac atctaccgtc 10560 gattggccga cgacatcgca agagggaagg ctaagctccg aactcgagaa aagtggaagt 10620 gctggataag gaacaatgaa cggggccaac ggcgcccatc ctacttgccc tcagcgacgg 10680 atttccggga tgtaatggaa ggcatttcaa aagcaggatt gaagcctact tggcacaagg 10740 aaccgctgga caacatcact ttccctgaaa gacgagtgga ctaagcggga accggaaagt 10800 gggggggctt 10810 // ID Gypsy-21_LBS-LTR repbase; DNA; FNG; 403 BP. XX AC ABFE01001545; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-21_LBS_; KW Gypsy-21_LBS-I; Gypsy-21_LBS-LTR. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-403 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01001545; Positions 17227 17629. XX SQ Sequence 403 BP; 86 A; 126 C; 60 G; 131 T; 0 other; tgtaacggat gtgacgcatc tgttactctt tctccttatc taatctaatt agtcggactc 60 tttaccattc ccccttgttt tctttcaaga cgattacgcg tccttcaaac gctgacgcgt 120 actctctcgc ttacgcgtcc ccctcggtga cgcgtacata ttccttagtt cacctattcc 180 acgctcctta gccaatcaga gtctctctat tcccttaaca tggccttggc catccccatt 240 cctttgcgac cctttttacg taacccttag tataaaaaga cccaatgtat ctagctagac 300 tctcagtgaa tttagtcgct aaaacggtct atcttcgcta ccctcagtgt atcaaaaccc 360 tcctctgtta ctctccagtc acagcggtca cgcttttgct aca 403 // ID Gypsy-16_LBS-I repbase; DNA; FNG; 7225 BP. XX AC ABFE01001152; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-16_LBS_; KW Gypsy-16_LBS-LTR; Gypsy-16_LBS-I. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-7225 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01001152; Positions 72129 79353. XX CC Positions [5841-6332] - Integrase core CC 'CCTT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 301..2844 FT /product="Gypsy-16_LBS-I_1p" FT /translation="MVDSPSTSSLESNVSLATLSIPMPTPGTANAPLFKGK FT RVNDFLDSLEQHADSAKIPHNHLPSYVLRYCHTKVCKVISGSAHFSADDWV FT RARTYLLDLYGSNDAVPANSPDRLRQWCLKHGETGFVTSRKEVDKYYREFT FT ALSSNLTPTRMLANDVSLCFYRGIPTSLRTKIKKRIPVANLKTSAPPEVAT FT LLNLLRAEFDEDDLDAKVDSSNLDLDSDSDSDSSSSDSEDDIDKVTITKKK FT KKPTKKKTTFEKTVPAAPIAEPVGISPVDKITKQMEDLKLAHTEFLRSMNI FT TTNNNPSVSQIMRELRCFFCDSVNHRLGLHNCPEVKACINEGLVAYTPQGR FT LARSDGSDLPRAFGSDGGVAKILRDQRGTSSQMKGKGREPSRDLPPHMTSY FT AGLQFDGEDVLSNEVYNASTTSLVPAWRAPPSSTLAVTRSQKEKEVRFDPI FT KRPDRKENEAKSFSKPKEQRDRPPTPTLSNLGPPPSFQRQPVQSTPQAFNL FT RQPASTPKPPQVNTEDAFKNRRVAPSKAKQDVEMKDEPVKAKSTPQYHFTS FT DIQEMYDLDKIVRDKVNKTTVQLELGELLALSAFLQKSVSNMTKTRREYNT FT KPVTASIVEILEDEDDYTSELVGGYDDELSYPSSESAYTNTGYVESRIGLE FT FDEATEDKEEIMTRYASAVKIHPTPQPLFAMVTGRFRGKFAGFDVIFMIDT FT GSELNLMSLEFYEKTSLAIDLDGTRWSLKGINGRPVPLGGCVRDAEIKISG FT RRFDHHVFVSREGTGKQEIILGQPWLQWYSACIQYTRQGSMSMRIWQDGDG FT DKPGCMPQGPSIVIPLCTPGAPRNTATLNLDCRTRVEEIPDIDSGK" FT CDS 2925..7142 FT /product="Gypsy-16_LBS-I_2p" FT /translation="MGTYGFDSPSFISDLEPPDAYDLILQNIAHCSPRTSR FT PLLTPWTRRALTYENSALKLQMRTKYKTVDKKVRPVPSYMPDPAGQVFLPV FT TIPSLPPLPLDPPFIANFLPTRRLTQGRLQKIIDSVPKDFLLPREIDLLVF FT VLRARDQALAFEDSERGTFSDKYFPDYDIPVIEHVPWVQDPIRIPKAIEDT FT VRQMLLDQKAAGKYEYSTASYRSRIFAVGKPKGGIRLVADVQELNRVTVRD FT AGLPPRTDDFAESFVGHVIYGLADLFSGYDGQRLGITSRPLTTFSSLIGPH FT RSCVLPQGATNSLPEFQRCTTHTLQDEIPKNGGVFVDDVGLKGPTTTYGNI FT EVAPGIRRFVYEYATTFDRFLARFIEAGITASGKKLVLATPRLHIVGTIVS FT KEGWHLEHGLVSKILKWGPLTSVTDVRSFLGTAGVGRRWIRGFSLIAKPLT FT LLTRIAVQREFYFSPEAQEAQNKLKQLVSTAPVLIKLDYDAAKLLDRLDPS FT PRPSDHGLVVVGVDSCQNGAGWILYQMVQKEKHPVIFGSCMFSLTEANYSQ FT PKLELYGVFRAIKDLRHRIWGIHFRIDVDAKFLIEMVKQPDLPNAPMTRWI FT SYIALFDYVMHHVPSQSHVAEDGLSRRIRVPEDSDDEDAEEYLDRFMGSAH FT STPSSLSLFQFANSLSSESLHRFRPLPLDKNFLLDLLSTMRRTPKTPYASF FT CTSTCANVVSFLSTSEPSQSLGEEIRKIKSVPYDPSVRDSSKGSLVKYSLL FT SVTDNFSYTGREFEFRKVCTAEVIDCVLEGEARTLEIFSYERAYMSSLKQG FT ASPPSITDNPISLDLLDATLRADNRINYKDVPSQIEVTCAVHAYGVKDKDS FT PEMWAEIILYLKDDIMPARCEDSTERKSFIRQTKNFFLHDKGRLWKYEHKG FT KLPRLVIVDVDRRSALIAEAHNNVGHRGRDATYKTLSERYYWPNLYDQVAY FT FVRSCNICQLRSKSRPIVAFSPTWSTGILRRFDLDTIHMPDGLNGHKYLLQ FT ATDPAMSWVEARSVAKASSENWAKFLYEEVYSRFGCVLLCLVDGGKEFRGA FT VEILFKQYGIVVVTSTPYHPEGNGHAERSHQTLTNSIMRACGKETYRWPLY FT VHAGLWAMRCSTSRVTGYTPYFLLYGQRPFFAFDFSDRTWEMLDWHKVAST FT EDLLALRMQQILRRDKKLVLAMEEQKRTRQRAVDDFNSKHEKYLTSGDFIL FT GTWVLLHETWLDSQMGNKGALRWTGPYIVHRKLHDTTYQLRELDGTVIRGS FT VAANRLKIFYYREEHQTVRTVNQSEYSLHITATSSTSVHASTLTGTLNQSN FT LTTPPYPVSVKFGKVFFPSNLSLSFTPSLIVHNVDDSHIHCRALPTIADLQ FT PTKNTYISEVQYQSHSEATSLRYIDSSNVADLQAWALETIPLR" XX SQ Sequence 7225 BP; 1917 A; 1774 C; 1573 G; 1961 T; 0 other; gtggtgaccg agacagggga tttggttttc cgttctgcga ttagacaacc cctcaagaat 60 ctttgtggta cgtcccttct cttacgattg cgctgattgc ttggctttca ggcgcatata 120 ttcacccgaa attaaaccga ataaacgccg aaagtacgca tcctttcaag acgacgctag 180 gcttctatca aattctccat ttctgtgaaa agacgcttct caaacgctaa gtcttccgtt 240 taaaattttc tcccttttgt tgtcaaactt tttctttaaa aatttatatt ttcattcatt 300 atggtggatt ccccttcaac ttcatctttg gaatccaacg ttagccttgc tactcttagt 360 atacctatgc ctacaccggg aactgctaac gctcctttgt tcaaaggaaa gcgtgtaaat 420 gattttttag attcattaga gcagcacgcg gatagtgcta aaattcctca taatcacctt 480 ccatcttacg ttcttcgtta ttgccatact aaagtctgca aggttattag cgggtctgcg 540 catttttccg cagatgactg ggttagagcg aggacttatc ttttagatct ttatgggtca 600 aacgatgcgg tgcctgcgaa ttccccggat aggctacgac aatggtgcct caaacacgga 660 gaaacgggct ttgtcacgtc tcgtaaagaa gttgacaagt attatcgcga attcactgct 720 ttatcgtcga atttgacccc cacccgaatg cttgctaacg acgtctcttt atgtttttat 780 agagggattc ccacctccct tcgtacaaaa atcaagaaac ggattccagt tgcaaatctc 840 aagactagcg cccctccaga agtcgccacg ttattaaatt tattacgtgc tgaatttgat 900 gaagacgacc ttgatgctaa agtcgattcg tcaaaccttg acctcgactc tgattcagat 960 tcggattctt cttcgagtga ttcggaagac gacattgaca aggttacgat cacgaagaag 1020 aagaagaagc cgaccaagaa gaagacgact tttgaaaaga ccgttcctgc tgctccgatt 1080 gccgaacctg tagggatcag tcccgtggac aagataacga agcaaatgga ggatcttaaa 1140 ctcgcccaca ctgagttctt acgttctatg aacattacaa ctaataacaa cccttcagtg 1200 tctcagatta tgcgtgaact aaggtgtttc ttctgcgact cagtaaacca tcgtctaggg 1260 ttgcacaact gcccagaagt caaggcgtgt attaacgaag gtttggtagc ttacactcct 1320 caaggtagat tggcccgttc cgatggttca gaccttccac gggcatttgg aagtgatggt 1380 ggcgtagcca aaatccttcg agaccaaagg ggtacttcaa gtcaaatgaa aggtaaaggt 1440 cgtgagcctt ctcgagactt accgccccac atgactagct acgccggact tcaatttgat 1500 ggagaagacg tgctttctaa cgaggtttac aacgcgtcta caacgtctct ggttccagct 1560 tggagagcgc ctccttcgtc tactcttgcc gttacgcgtt cgcaaaaaga aaaggaagtt 1620 cgttttgatc ccattaagcg accggatcgg aaggaaaatg aagctaagtc tttttccaag 1680 cctaaagaac aaagagaccg ccctccgact cctactctta gcaatttagg gcctccgccg 1740 tcatttcaga gacaaccggt tcaatcgaca cctcaagctt ttaatcttcg tcaacctgct 1800 tcaaccccaa agccgcctca ggtgaacact gaagacgctt tcaaaaatcg aagagtcgct 1860 ccttcgaagg caaaacaaga cgtagaaatg aaggacgaac cggtgaaagc taagtcaact 1920 ccacaatatc attttacgtc tgacatacag gagatgtacg acttagataa gattgttagg 1980 gacaaggtca acaagacgac tgtccagctt gagctagggg aattgcttgc cctatcagcg 2040 tttctacaaa agtcggtcag caacatgacg aagactcgtc gagagtacaa caccaagccc 2100 gttacggcga gcatagtgga aattctagaa gacgaagacg attacacatc agaattagta 2160 ggaggatatg atgacgagct gagttaccca tcctctgaaa gcgcttatac aaacactggc 2220 tatgttgagt ctcgtatagg cttggagttt gacgaagcga cggaagataa agaagagatc 2280 atgactcgat atgcgtcagc agttaaaatt cacccaacac ctcagccgct tttcgctatg 2340 gtgactggac gttttcgcgg aaaatttgca ggatttgatg ttattttcat gattgatacg 2400 ggctctgaac taaatttaat gtccttagaa ttctacgaga aaacttctct ggcgatcgac 2460 ttagatggaa cgcgctggtc cttaaaagga atcaatggaa gacctgtacc actaggcggc 2520 tgtgtgcgag atgcggaaat caagatatct ggacgacgct tcgaccacca tgtcttcgtc 2580 agtcgagaag gaactgggaa acaagaaatc atactcgggc aaccgtggct ccaatggtat 2640 tcggcctgta ttcaatacac tcgtcaaggg tcaatgagta tgcgcatatg gcaagacggc 2700 gatggagata aaccaggctg catgcctcaa gggccttcga ttgttattcc tttgtgcaca 2760 cctggagctc ctcggaacac agcgacactc aacttggact gccggacacg tgtcgaagaa 2820 atccctgaca tcgattcggg aaaatagagt cggacgcgta ccccgagcct ggtcaacgcc 2880 aggacgtgtc cgtaccaact cttgggcggt ctcttcactc attaatgggg acttacggat 2940 ttgattcacc ttcgttcatt tctgatttag aaccacctga cgcttatgat ttgattttac 3000 agaatattgc tcattgttcc ccaagaactt ctcggccgct tctaacgcca tggactcggc 3060 gcgctctcac atacgaaaat agcgccctca aacttcaaat gagaactaaa tacaaaacag 3120 tcgacaagaa agttcgacca gtaccaagct acatgccaga cccggctggg caagtctttc 3180 tccctgttac aattccttcg ctaccgcctc tccctctaga cccgcctttt atagcaaatt 3240 ttctccctac acggcgtctt acgcaaggac gactccagaa aataatcgac tctgtaccca 3300 aagactttct tttgcccaga gaaattgact tactagtttt tgttctacgg gcgcgagatc 3360 aagctttagc gtttgaggat tcggaaagag gtacattttc tgataaatat tttccagatt 3420 acgatattcc ggtcatcgaa cacgtacctt gggttcaaga ccctataaga attcctaagg 3480 ctatcgaaga tacggtgcgc caaatgctcc ttgaccagaa agcagccggc aagtacgaat 3540 attcgaccgc ttcttatcga tctcggattt tcgcagtggg aaaacctaaa ggaggaattc 3600 gattagtagc agacgtgcaa gagcttaaca gagttacggt gcgcgatgcg ggtttacctc 3660 caagaactga tgattttgca gaaagtttcg taggacatgt aatttatggc ctggcagatt 3720 tgttttcagg ttacgatgga caaagactcg gcattacttc cagaccgctt acaactttca 3780 gttctttgat aggaccccac cgttcctgtg ttctccctca aggcgcgacc aactctctcc 3840 ccgaatttca acgttgcacc acgcacacct tacaagatga aatacccaaa aatggaggag 3900 tatttgttga cgacgtaggt ctcaaaggtc caaccacgac atatggaaat atagaagtcg 3960 cacccgggat tcgtcgtttt gtctatgaat acgctacaac gttcgaccgc tttctagcac 4020 gtttcatcga ggcagggatc acggcttcag gaaagaaact tgttctcgct actccacgcc 4080 ttcacattgt tgggacgatc gtctcaaaag agggatggca tctcgagcat ggtttggtct 4140 caaagattct aaagtgggga cctctgacta gcgtcacgga tgttagatcg tttctaggta 4200 ctgcaggcgt tgggcggaga tggatacgtg ggttttccct tatagcaaag cctctgactc 4260 ttcttactag gattgcggtc caacgcgaat tttatttttc ccccgaggcg caagaagcgc 4320 agaacaagtt gaagcaattg gtgtctacag ctcctgtatt aatcaagctc gactatgacg 4380 ctgcgaaact tctcgatcgc cttgaccctt cacctagacc ctcggatcat ggactggttg 4440 ttgtgggtgt cgactcttgt caaaacggag caggatggat tttgtatcag atggttcaaa 4500 aagaaaaaca tccggtgatc tttggctcgt gcatgttcag ccttactgaa gcgaattatt 4560 cccaacccaa attagaatta tatggagtct ttcgagcgat aaaagacctt aggcatagaa 4620 tttgggggat tcatttccga atcgacgtcg acgcaaaatt tttaatagaa atggtcaagc 4680 aacccgacct tcctaatgct ccaatgacgc gatggatttc ctatatcgct ctttttgact 4740 atgtcatgca ccatgttcca tcacaatcac acgtcgcaga agatgggctg tcaaggcgga 4800 tccgtgttcc agaagattca gatgacgagg acgccgagga gtatttggac cgatttatgg 4860 gctcagcgca ttccacgcct tcatctcttt ctctcttcca atttgctaat tctttgtctt 4920 cagaatccct tcacaggttt cgtcctcttc ctctcgacaa aaatttcttg ttggatttat 4980 tgtccaccat gcgtcgcacg ccaaaaaccc catacgcttc attttgcact tcgacttgcg 5040 ctaacgtcgt atcctttttg tctacctcag agccatcaca atctttggga gaagaaatta 5100 gaaagatcaa gagcgttcct tacgaccctt cagtcaggga ttcatctaaa ggatccctcg 5160 taaagtattc ccttttgtcg gtcactgata atttttccta cacaggacga gaattcgaat 5220 ttcgaaaagt ttgcactgct gaagtcatcg actgcgttct ggaaggggaa gcacgtaccc 5280 tcgaaatatt ttcttacgag cgcgcttata tgtcttcgtt gaaacaaggt gcgtctcctc 5340 cgtcgattac tgataatcct atctccctag atctactcga cgcgacgcta cgagcggaca 5400 acaggatcaa ctacaaagat gttccgtctc aaatcgaagt aacctgcgca gttcatgcat 5460 acggtgtcaa agataaggat tcacctgaga tgtgggctga aattatcttg tacctaaagg 5520 acgacatcat gccggcgcgt tgtgaagact ctactgaaag aaagtctttt attcggcaga 5580 caaagaattt ctttttgcac gacaagggta ggctttggaa atacgaacac aaagggaagc 5640 ttccccgact tgtgattgtg gacgttgacc ggcgttcagc cttaatagca gaggcacata 5700 acaatgtagg acatcgaggt cgtgatgcga cctacaagac cctctccgaa cgatactact 5760 ggccgaattt atatgaccaa gtcgcttact tcgtccgctc atgcaatatt tgtcagttac 5820 ggtcaaagtc acgccctatc gtcgccttta gtcctacctg gagcaccgga atcttacgaa 5880 gatttgacct cgataccatt cacatgcctg acggcctcaa cggtcataag tatcttttgc 5940 aagccactga ccctgccatg tcctgggtcg aagcgcgttc agttgccaag gcctcttcgg 6000 aaaactgggc caagttcttg tatgaggaag tctattctcg attcggttgc gtacttctat 6060 gcctggtcga cggcgggaaa gaatttagag gagccgttga aatccttttc aagcaatacg 6120 ggatcgtcgt cgtgacctct actccctacc acccagaagg aaacggacac gccgagcgct 6180 cccatcaaac cctcaccaac tcaataatgc gagcctgtgg taaagaaaca tatcgatggc 6240 ctttatacgt acacgcggga ctttgggcga tgcgttgttc gacttctaga gtcaccgggt 6300 atacccccta ctttcttctc tatggtcaac gtcccttctt tgcttttgac ttttcagata 6360 gaacctggga gatgcttgac tggcacaagg tcgcctcaac cgaggacctg ctggcgcttc 6420 gaatgcagca aatccttcga cgcgacaaga aactcgtctt ggctatggag gagcaaaagc 6480 gtacgcgtca acgggctgtt gacgatttca atagtaagca tgaaaaatat cttacctctg 6540 gcgatttcat tttgggaact tgggtgttgt tacacgagac gtggttggat tctcagatgg 6600 gaaacaaggg cgcattaaga tggactggcc catacattgt tcaccggaag cttcacgata 6660 ccacgtatca gttacgagag ctagatggga cagttattcg aggttccgtt gccgctaatc 6720 gtttaaagat cttttattat cgtgaagagc atcaaacggt tcgaacggtt aatcagtcgg 6780 aatactctct tcacatcacc gcaacttcct caacttctgt tcacgcgtct acgttgactg 6840 ggacacttaa ccagtctaac cttacgaccc caccttatcc tgtttccgtc aaatttggca 6900 aagtgttctt cccaagcaat ctatcccttt ctttcacccc ttcccttatt gttcataacg 6960 ttgatgactc acacatccat tgtcgtgctc ttccaacgat cgcggacctt cagcccacga 7020 aaaacacgta catatcggaa gttcaatatc aatcgcattc cgaggcaact tctcttcgct 7080 atattgactc atcaaacgtc gccgatcttc aagcatgggc actcgaaact attccgctcc 7140 gctaagtttc tcatgattct tttctttttt tttctttttt cctctttcga aatgatgagg 7200 gcatcatttt aaattttctc cctat 7225 // ID Copia-16_MLP-I repbase; DNA; FNG; 3198 BP. XX AC AECX01001151; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-16_MLP_; KW Copia-16_MLP-LTR; Copia-16_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-3198 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001151; Positions 80405 77208. XX CC Positions [512-1030] - Integrase core CC LTRs are 94% similar to each other. XX FH Key Location/Qualifiers FT CDS join(206..1030,1034..3178) FT /product="Copia-16_MLP-I_1p" FT /translation="MFHDNKNFSLVLNDLAIFNGFISDNNLMFILLKPVSG FT LNSSSSTSTSVTEITSSLQHRCLGHVSNKYLKLMARLDSVEEFEYIDENLT FT CDICSLSKNTKLPHNKTRPRARTFLENVHVDLSGIIRTAGLNNENYFILFC FT DDFSVFRHIYPLNDKTKEEVYEVFMAYIAVAKRQTGCSLKQFTLDCGSEFL FT NSLLGPKLKELGIVLHLTSGHAPEQNGVSERGMRIVNTRARSMMLESAVPL FT CFWYLACSTAVFLTNRCVTAALEGGKTPFEMWNFKPSINHLCVFGCQAFGL FT IRKELRQSKFSPVSSEGVLVGFDNDNFNYQIFDLSSRKVYLTHHATFNKDV FT FPFKSSASEVSSVPLEPDRKSVKVRFFDDDGDFGEVVPSILPTIKSNDSAT FT VDSDHQDVPPEIVPLAKTSDEAPHRSTRSAKKLHDSLRGMCTSGAEYEFDF FT ITESFFACLPPECHLVNLISNPDPKSYKRAMASADADRWKAACDKEFASLK FT EKDVYILVDPPTDCHVIRGLWVFRKKPLVNGGVKFKACFVAMGNTQIPGQD FT YGETFAPTGKPGSLRLLVAIAAIHGWEIHQMDAVTAFLNGFLEEELYVEIP FT EGYRTALSIGKVWRMKKYLYGFKQSPKIWQDDVEGFLIEIGFQQCEIDHCI FT YIRAKGDCFTAVYVHVDDLAITGNDISKFKVEILAKWEMEDLGIAHTVVGI FT QIQRIDDLTYSMSQQQYASTVLQRFNMTNAKPAVTPLSPNVKLLKATEDEA FT QEFAKTKLPYRSVVGSLMYLAQCTRPDMAHAGGVLSQPLESPGQIHWDSAI FT HVLRYLAHTINVGIVYSGKTQVVVTGQRSFECPISHCDADWAGDVNTRRST FT TGYVFVLAGGPISWRSRLQPTVALSSTEAEYRAITESCQELLWLRNMMAKF FT GFKDPNPTVLQSDNLGAIHLTSKSVFHARTKHIEIHHHWIREVVNKGDVVL FT KHCPTHLMVADLLTKQLPKEQFSTLRKSLGLRFLG" XX SQ Sequence 3198 BP; 875 A; 623 C; 726 G; 974 T; 0 other; agcctcactg atcaagattg acgatgctag taaatgcctt aaactagctg gtggggatgt 60 ttcgttggct gttcacagtt gaggagttgc taagttctca gctggagatg agacggtatt 120 tgaactgaag aattcgcttt acgtacctga actttcaaga aacctcgtgg ctggtggatt 180 actgaagaaa caaggtgtta gagagatgtt tcacgacaat aaaaatttct cattagttct 240 aaatgatcta gcaatcttca acgggttcat ctctgataac aacctcatgt tcattctgct 300 caagcctgtg agtggtttga actcatcatc atcaacatca acttctgtca ctgaaatcac 360 ttcatcactt caacaccgtt gcttagggca tgtaagcaac aaatacctga agctcatggc 420 aaggttggat agtgttgagg aattcgagta cattgatgaa aatttaactt gtgatatctg 480 ttctctatcc aagaatacaa aacttccaca taacaaaacc agaccacgtg ctcgcacatt 540 tttagaaaat gttcatgtag atttaagtgg aattatcagg actgctggtt taaacaatga 600 aaattatttt attttattct gtgatgattt ttctgtgttc cggcacattt atcctttaaa 660 tgacaaaacc aaagaggagg tctacgaagt tttcatggca tatattgctg ttgccaaacg 720 gcagactgga tgttccttaa aacaatttac ccttgattgc ggctccgaat ttcttaacag 780 cctacttggt ccgaagctca aggagttagg gatagttctt catctgacat ctggtcatgc 840 tcccgagcaa aatggtgtct cagaacgcgg catgcgcatt gtcaatacca gggcgcgttc 900 tatgatgctt gagtctgcag tgcctctttg tttttggtat cttgcatgca gcactgccgt 960 gtttctcaca aaccggtgtg taacggctgc tcttgaagga gggaagactc cctttgaaat 1020 gtggaacttt tgaaaacctt caatcaatca cctttgtgta tttgggtgcc aggcttttgg 1080 actgattcga aaagaattac gacaatccaa gttttcacct gtaagctcag aaggagtact 1140 cgtcggtttt gataatgaca actttaatta tcaaattttt gacttatctt caagaaaagt 1200 ttatttaact catcacgcaa cttttaacaa ggatgtgttt ccctttaagt catctgcttc 1260 cgaagtttca tcagtacctt tagaacctga tagaaaatct gtgaaggttc gcttcttcga 1320 tgatgacggt gattttggag aagttgttcc atctatttta cctacaatca aatccaatga 1380 cagtgccact gtggattcag atcatcaaga cgtacctcca gagattgttc ctttggcgaa 1440 aacgtctgac gaagctccac atcgttcaac tcggtcggcg aagaaacttc atgattctct 1500 tcggggaatg tgcacttctg gagctgaata tgaatttgat ttcattactg aatctttttt 1560 tgcttgtcta ccgcctgaat gtcatctagt caacttgata tcaaatccag atcccaaatc 1620 ttataaacga gctatggctt ccgcagatgc cgatcgttgg aaagccgcgt gtgacaagga 1680 gtttgcttcc ttgaaggaga aagatgtcta tatcttggtt gatcctccaa ctgattgtca 1740 tgtaatcagg gggttatggg tattcaggaa gaagcctctt gtcaatggtg gtgtgaagtt 1800 caaggcatgt tttgtggcta tgggcaatac ccagattcca ggccaggatt acggtgaaac 1860 atttgctcca actggtaaac ctggttcctt gcgtcttttg gtggccatcg cagcaattca 1920 tgggtgggag atccatcaga tggatgcggt aacagcgttt cttaacggtt tcctggaaga 1980 agaactttat gtcgaaatcc ctgaagggta cagaaccgct ttgtctatcg gtaaagtatg 2040 gaggatgaag aaatatctct atggtttcaa gcagtcgcct aaaatctggc aagatgatgt 2100 ggagggattc cttattgaaa tcggatttca gcaatgcgaa atcgatcatt gcatttatat 2160 ccgtgcgaaa ggtgattgtt ttactgcagt gtatgtgcac gttgatgatt tagcgatcac 2220 gggaaatgat atttcaaaat tcaaagtcga gattttggcc aagtgggaga tggaggatct 2280 cggaattgct cacacggttg taggaattca aattcagcga attgatgact tgacttattc 2340 gatgtctcag cagcaatacg cctcaacagt tcttcaaaga ttcaatatga ctaatgcaaa 2400 accagcagta actccacttt caccaaacgt gaagctactt aaggctactg aagacgaagc 2460 tcaggagttt gccaagacaa aacttcctta cagaagtgta gtaggctctt taatgtatct 2520 tgcccaatgc acacgtccag acatggctca tgcaggcggt gtcttgtcac aacctttgga 2580 aagtccaggt caaattcatt gggactcggc aattcatgtt ctgaggtact tggctcacac 2640 aatcaatgtt ggcatagtgt actctggaaa gactcaggtg gttgtgactg gacaaaggag 2700 ttttgagtgt cccatttctc attgcgacgc tgattgggct ggtgatgtca atactaggcg 2760 ttcgacgact ggttacgtgt ttgttttagc agggggacca atctcttgga ggagccgtct 2820 tcaaccaaca gttgcgcttt cgtcaacgga ggcagagtat agggctatca ctgaatcctg 2880 tcaagagttg ttgtggttga ggaatatgat ggcaaagttt ggtttcaaag atcctaatcc 2940 tacggtgctt caatctgata acttaggggc tattcactta acttcaaagt cagtctttca 3000 tgctagaact aagcatattg aaattcacca tcattggata cgtgaggttg tgaataaagg 3060 tgatgtggtc ttaaagcatt gtccaacaca tctgatggtt gcagatctct taactaagca 3120 gttgcctaaa gaacaattct caacgttaag gaaaagtttg ggtttgaggt ttttgggtta 3180 attcgctttg agggggtg 3198 // ID RESTLESS repbase; DNA; FNG; 4097 BP. XX AC Z69893; XX DT 28-AUG-1998 (Rel. 3.07, Created) DT 28-AUG-1998 (Rel. 3.07, Last updated, Version 1) XX DE DNA transposon RESTLESS. XX KW hAT; DNA transposon; Transposable Element; hAT superfamily; KW RESTLESS; transposase. XX OS Tolypocladium inflatum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Hypocreomycetidae; Hypocreales; Clavicipitaceae; OC mitosporic Clavicipitaceae; Tolypocladium. XX RN [1] RP 1-4097 RA Kempken F. and Kuck U.; RT "restless, an active Ac-like transposon from the fungus RT Tolypocladium inflatum: structure, expression, and alternative RT RNA splicing."; RL Mol. Cell. Biol 16(11), 6563-6572 (1996). XX RN [2] RP 1-4097 RA Kempken F.; RT "RESTLESS."; RL Direct Submission to Genbank (01-MAR-1996)Kempken F.and Kueck U., RL Ruhr-Universitaet, Lehrstuhl fuer Allgemeine Botanik, RL Universitaetsstr. 150, Bochum, Germany, 44780. XX DR GenBank; Z69893; Positions 1 4097. XX SQ Sequence 4097 BP; 973 A; 1061 C; 1072 G; 991 T; 0 other; cagagtgcgt aatcaaccaa ctacttccaa ccgttgaagc cattgggccc tccaactgtg 60 ggtccaaccc aactgagatc gcgcttccct tgcttggcaa ccagccaatt ctgggggcca 120 gttgattggg tttagatcca acaggttcca actcgttcta attccagggg tcaaggtaag 180 actccgactt acaacacctg tcaatttccc agttttggct catctcactg cccgctgacc 240 aatgataggc tgccgagcag catcccttcg cgcatcgccg tctcgtccct gccctgtccc 300 ttctcccgcg ccagcttcgc gcttttctcg cgccattctg gcgtctggtt agattggatt 360 cttgggcccg tcattaaggt tgtgctcgta aggctgactg gtggggtctc ggaagatgag 420 gagacaccac gcgaaggtct gcggggtctg cgaagtcgct agaacatcag ttcaagggcg 480 tatgttggtt aaggcgcggg tccctgagac ctgggatcgt ggactgacag ttattggaaa 540 tctacgactc catcaatttg agagattcca tctagaccta aagcctagat caagttaagt 600 gtagtccact tgaaagccgg tttctttatc tctcaagaag tttcgtctac tctcaccatc 660 ccatgttatc gttctatata ttgacgaaga ccgctgataa taatgatcag tttctatcat 720 cacacacata gctttctaga ccaaacgcga actaggaaat acatctccgg tcatgatgga 780 atgggtagag gcccccaacg agaggaatct agtgataata ttttaactag catgaacatg 840 attccccgta gagactgggg tgttattggg gcagcgtagt cttgcgctca ggaggatgtt 900 acaggaacaa acctacccta tatgcgcctg ataggggcta ctgggtatcc cgcctattgt 960 gcaatggcag ggctggtatc tcttaactat acgggttgcg atttgccgct aagataacac 1020 gcactttaac gtctctcacg ttgcagacat tcttcttgta agtcatctag atagctcatt 1080 taaatctata aaattaggtt gccgttaaac atatagagag catgcaattt tggcctccta 1140 tcaggacgat ggtttcggtt gtgagatacc tcttctgccc tagattactc cactgcatgg 1200 gctgtatccg ctaacatatc ttagcaccac tgcaggacaa ttgttacttg cacacgacgc 1260 gtcgggtcgc gtagcagttt ctgaacgcgt caatcgtccc tgcatacggc ctgagaaggc 1320 tgctctatcg tggccgataa aatggaggca aaagacgtta ctctctgggt gtcaaatgtt 1380 gcagtacgtc gtccatcgaa gagcgtagta ttccttcgct gatagactcc agtagtcttg 1440 cttcccagcc gccactgcaa tggatgccga tttgcaggac cttttctggg gccaggtagg 1500 ctcggaaacg tcgttgactc tacattcttc tccctgcccg tcgaggcgct ctactcagtc 1560 ttcttctctg tccgcttcgg ctccgcagac cccgccgagg aagatactta acctcgacga 1620 gaatcccacg cctaccgaga tagcgcactt cccgttcgag ctcttcaacg acgagcttcg 1680 ggctgccacg cctccgagga ctaagggtgc caaggttgct tggtggtggg tgaaagggtt 1740 ccgtatgagg cttaagagta acgacaagaa gcttcggtgg gtgtgccgtc tctgcgtgag 1800 gaggaagtgc aggacggtct cgcacttctc ttacgaatct aacggcagtg ccaatattat 1860 caagcatctg cgagatatcc atgggatcaa ggcaagcaat tgattcctgc catcttgtac 1920 gtcatcctaa ttttgcagga tccaggaagg aaagacgaga cgccgcctag ccgcactcag 1980 actctgacag aacacttcgg agtgactgca gcccctgccg accagcgcat cttttctact 2040 atcctcggga gactcaaact cggcgcattc gaagagcttc tcgttgactg gataacacac 2100 gataatctcc ctttccgact aatcgagagt gagcgcctcc gccgactgat tgagttcatc 2160 aacccgatgt acaaggacaa ggtcccgtct tccgcggtcc tccgatctcg actgcgatct 2220 atctacaatg gggcgaaagg agcagtgacg gagcatctaa aaaccgctcg tggcaagatt 2280 cacatcgcat ttgatggctg gacgtctcgc aatcagctct cgttgcttgg agtcaattgc 2340 ttctttgttg accaactatg gcggcaccgg agactgcttc tcgcgcttcc agctgtatct 2400 ggtcggcaca cgggcgataa cttagccaac gaagtggctg atgtcttggc agagtgggat 2460 cttggaagcg accggcttgg gtatatggtc cttgataatg ccagcaataa cgacacagct 2520 atggtggctc tcggcaagga attcgggttt gacccggacg agcgccgtct ccgctgtctg 2580 ggccacgtta ttaacctcgc tgtcaagcag ttgatattcg gcgaggcggc agacgcaatg 2640 gagcatacgg gcggtagcga accggattcc gaatactatt tcgactcttt acccgcagac 2700 acccttgcgc aatggcggag gagggggcca attggtcgac ttcacaacct caacgctgca 2760 gttttgaact cgccacagca cttagagtgc ctcgtgaagt ggcaagaaga agatctcaaa 2820 agcggcgtcc ttgaacgcat cgacccagag actgggaaga agcgagtgcc gctcagaccc 2880 atcgctgaca acgagacccg atggaactct cgccaccgta tgatggtccg cgcactgctt 2940 ctccgcagat acctcaatcg gattgtcgaa aaggcagaaa gagcgtggga aaggagcaag 3000 agaaagtcag tgaagccttc aatacttgac gacaagctgt cagaagagga ctgggacgtg 3060 gttgaggttt tcattcaggt cttgcggcca ttcgatgaga tatctgttcg tctacaaggc 3120 aacccgaaga ctagtgagga tgatcatgtt aacaccggct ccttttggga gtactttccc 3180 tcctttgagt acctcttaac gcatttagag gagctcaagc agaatcatga cctccttgat 3240 ggcttgggag aggacagcgc tgatatggtt actacccata tcaaccttgc gtggatgaag 3300 cttaacgagt attacgataa gctctggcct gttgcttata tcggtgctgt agtccttcac 3360 ccttgctttg ggtggccggc gattcaatac cactgggagg gacacgcgga tgcgaagcgg 3420 tgggaagaag actactcgac gagactcatg aagctgtgga aagaggagta cgctaatcgc 3480 gaggttcctg gcttggcgat gccgtcgacg gcgagcagcc gaacggggct gtcaggatat 3540 ggcgcgttcc tggcacagtc gctcggcaaa cgcaaaagag cgcatccgga tggcagagcg 3600 gggtttactt cgcagccacg cccaacgctt gatgagtacg aacgatatat tcagaccttt 3660 acccacgccg acgacaaata ccaattccgc cccctttctt ggtggcagga gcacgaaatg 3720 gagtatccga acctctgtcg gatggccact gacttgctgt ctatccctac aatgtcagca 3780 gagactgaaa ggtcgtttag tagcgcagga aagatggtat cgcctctgcg aactcgtctt 3840 gatcgacaca ctattgggat ggcgcagggt atgcggtcat ggagcaggga gggtattgtc 3900 ctaccgtcct ggtagtgatc ctgggatcgg cgttaagatg tgaagatcgg agatatccgt 3960 tggttccaaa caaccaactc caacccaact gagagcagcc acatccaact tcaatccaac 4020 tgaaatatac ttggctggat gtcaatccaa ctgcgaggaa ataccagttg agttgaattg 4080 gttgattacg cactctg 4097 // ID Gypsy-45_MLP-LTR repbase; DNA; FNG; 158 BP. XX AC AECX01001170; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-45_MLP_; KW Gypsy-45_MLP-I; Gypsy-45_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-158 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001170; Positions 134581 134424. XX SQ Sequence 158 BP; 41 A; 37 C; 22 G; 58 T; 0 other; tgttatgatc ctattatgat catatatgct tgtatgctac tactgagttc actctatact 60 cgagtctcac agagtcagag attttacatc tctacgttgc aatctcatta taactcaagg 120 acgccctttt tagcatagac tcttgatctc atctctca 158 // ID Gypsy-46_MLP-I repbase; DNA; FNG; 5797 BP. XX AC AECX01001225; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-46_MLP_; KW Gypsy-46_MLP-LTR; Gypsy-46_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5797 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001225; Positions 217529 223325. XX CC Positions [4544-5023] - Integrase core CC 'GTCTT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 206..5416 FT /product="Gypsy-46_MLP-I_1p" FT /translation="MPNTRKHPDSLEERLDNPERLLSHRTQPAARSASVPS FT HATAVPLLYTNLASSLQPRTPRQGERVLAASKTIAELFNLNPVPLDQLSPA FT TSSILSHVVNPPTRRRSILPGTLPDATAMSMMSGSDTIDPGGASTSIPPSG FT TSTSVLPTELTDKGKVVQEPLEETVRRIQEESAEGMARVAAENDDLRAELR FT DIKALLAEMVGQRREEQHTPKLSDQPPVVTSTPAEADTTEVPRPHPDSTMH FT GQHLSATSDDTPALNSDGHRNPFANYISPPHAVPPLFHYPAQPIPSQVPYQ FT NVSSGIPRQHDLPPGVNYWYPAPAVEKEVLARHLKDSEIPQFTCGYGDVSG FT FRLWRYRIEARFNVKGLVNDSERLKVLPAALAVPHAVQWHRTHEAELHGKT FT WEEAMNMFEDGVLPAGWLRDAKQSLRELSQKPNEDMQAYIVRARNLQDTLS FT ASNCNDQDLAERIVSGTSTLFRESAAKDNLVEGAIDARTQKWSFAVFEKKA FT LETARFLNAVEATQPRSSGRPSAQATRSTTAPPPSATSQGYQRPPVDPAAR FT SERWAAFMRSTGRCPRCKTQCPKWLGGCDARPNMTYVPFPADFPRNPPYPP FT PKAAESTSSFVARPGAPRRGPVVPVASVQVDPPTGQFPDLGKADVAAYDEL FT VAALAVHGEEYAEEFSASNPAPRPIVLELMINGVPVRALMDTAAGTNLMSN FT HLARKLRLVRRKLLKPTVVRLAIDTNASHVNLTDFAIANIKSTNVVFGSTF FT FKLAELHGDQYDVILGTPFLRKHELDVSVASRSVIQPKTGRVIFEKEKSEE FT LKQLKFKEEKEKFLMDKRHDLVAAVFDNLDVIENASDLSVREMKMLKEFEI FT LFPDDLPDVDAFDDAEFIPPELQDEKSAIRHNIVLTDPNVVINERAYGYPH FT KYEDAWRKLVDAHVAAGRLRRSDSPYASPSMVIPKADPTVPPRWVCDYRKL FT NKYTVKDRSPLPNVDEAIRRVGTGKIYSILDQVNAFFQTLMRPEDIPLTAV FT KTPWGLYEWVVMPMGLTNAPATHQRRCEEVLGELVNKICVVYIDDIVVFSQ FT TVEEHEEHLRMVLERLKAAKLYCSIKKTKLFRRRIKFLGHEISADGIYADD FT TKVEKVANWKTPKTAKQLKEFLGTVQWMKKFIDGLAEYAGKLSPLTSSKKA FT SEKFVWGEKEQDAFDNIKRIITTLPTLKNLDYDSGEPLWLFSDASGHALGA FT ALFQGLDWETSSPIAYESRQMSPAEKNYPVHEQELLAVINALNKWRMLLLG FT MKVNIMTDHHSLTHLLTQRDLSRRQARWLETLSQFDLDFKYIKGLDNSVAD FT ALSRIEDVAVMEVQSKLSQSLIQRIKDGYLSDPFCQKLSKVLPLRANCVWK FT EGLMYVDDRLVIPVTEGLQSELIQVAHDTVGHLGVVKTADRLRSEFYWQRL FT GQDVEEFVRNCDSCQRNKARTTRLPGRLQSTDVPSRPMAAISLDFVGPFPK FT VQGYDMLLTCTCRLTGFTRLIATCQKDTAETTARRLFASWLTIFGAPKSMI FT GDRDKIWTSRFWKELQLLLGVSVHLSTAYHPQADGRAERTNKTVGQMLRHY FT VAGKHGKWLQMLPTVEYAINSAVNVATGVSPMEFVLGFPPRLFPIAETDIQ FT VGGGVKSWLEKRQESWAVWRDKLWASKVNQAVGYNSRRGPDLTLEVGDWVL FT IDSKDRQKLVKGPVAKLKARYDGPYEIIEVLNEGRNVRVQLSEGDKTHNVF FT HQSKLRKYNSDEMDMAA" XX SQ Sequence 5797 BP; 1526 A; 1263 C; 1486 G; 1522 T; 0 other; ctttttttat ctatatccga acaccaaatc aaatcctttc gaaccaatcg atcaatcgaa 60 tccgaatttt ttttttttgc aaaacgaatc cgaagcttgc gccactgaat tgaactacgc 120 tgaaactgaa ctactgaatt gctgccgatc acttcgaact taatctgcgc gcgaataact 180 acttgttgtc tcacatcagt agttcatgcc aaatacaaga aaacatccag actcgctaga 240 ggaacgcctc gacaatccag aacgattact atctcaccgt acccaacctg cagcgcgatc 300 ggcgtctgta ccgagtcacg ctacagctgt acctctgctt tacaccaacc ttgcaagcag 360 cttgcaacct cgtacaccta gacagggaga acgggtctta gcggcctcga aaaccattgc 420 tgaacttttc aacctgaatc ccgtgccttt ggatcaactt tcgccagcaa cttcatctat 480 cttatcccac gtcgtcaatc cgccaacacg acgacgttcg atcttaccgg gaaccttgcc 540 tgacgcgacc gctatgtcta tgatgtcagg cagtgacacc attgatcctg ggggtgcctc 600 gacctcgata ccgccatcgg gaacttctac aagcgtttta ccgaccgaac ttactgacaa 660 gggtaaggta gttcaagaac ctctggaaga gactgtacgc cgaattcagg aggaaagtgc 720 cgaggggatg gcacgcgttg cggcagaaaa tgacgatctc agggctgaac tccgtgacat 780 caaggcgctc ttagccgaga tggtgggcca acgtcgcgaa gaacagcaca caccaaagct 840 gagcgaccaa cctccagtag taacttccac accggctgaa gccgacacca ctgaggtacc 900 gcgcccacac cctgattcaa cgatgcatgg acaacatttg tccgccacct cagatgacac 960 tcctgctctg aactcggatg gacaccggaa tccttttgcg aattatatct cgccgcctca 1020 cgcagtccct cctttgtttc actatcctgc tcaacctatt ccgtcacagg ttccgtatca 1080 aaacgtctca tcgggaattc cgcgtcagca cgatctgcct cctggtgtta actactggta 1140 cccagcacct gcggtggaga aggaggtact tgctcgccat ttgaaggaca gtgaaatacc 1200 tcaatttacc tgtggttatg gtgatgtttc cggattccga ttatggcgat atcgaattga 1260 ggcgcgattc aatgtgaagg gtctcgtcaa tgactctgaa cgattgaaag tactgccggc 1320 agccttggcg gtccctcatg cggtgcagtg gcatagaaca catgaagccg agctccatgg 1380 aaagacgtgg gaggaggcga tgaacatgtt tgaagatggc gtcttgccgg ctggttggtt 1440 acgtgatgct aagcaatcgt tgcgcgaatt gtctcagaaa ccaaatgagg acatgcaagc 1500 ttatattgta agagctcgaa acttgcagga taccttatca gcgtccaatt gtaatgatca 1560 ggatttagcc gagcgcattg ttagtgggac cagtacgttg tttagggagt cagctgcgaa 1620 agacaatttg gtggaagggg cgatagatgc aagaactcag aagtggtcat ttgcggtatt 1680 tgagaagaag gcccttgaaa ccgctcgatt tttgaatgcg gtggaagcca ctcaacctcg 1740 gtcaagcggt cgtccatctg ctcaagctac gcgatccacg acagcgccac ctccatcggc 1800 gacttcacaa ggctatcagc gtccaccagt agacccagcg gctagatccg agaggtgggc 1860 ggcgttcatg agatccacgg gtcgatgccc tcgatgcaaa actcaatgcc ctaagtggtt 1920 agggggctgc gatgctcgtc caaacatgac ttatgttcct ttccctgccg atttccctcg 1980 taacccaccc tacccaccac ccaaagcggc cgagtcgacc agttctttcg ttgcacggcc 2040 gggagcgccc cgaagaggtc cagtggtgcc agtggcaagt gtgcaagttg atccacccac 2100 cgggcagttt ccagaccttg ggaaggcgga tgttgctgca tatgatgagt tggttgcagc 2160 gctggcggtc catggtgaag agtatgcaga agagttctct gcgtccaacc cagctcctcg 2220 tccgattgtc ctcgaactaa tgatcaatgg tgtgcccgtt cgtgcgttga tggacacagc 2280 cgcggggacg aacctcatgt cgaatcacct ggcacgaaag cttcggcttg ttagacgtaa 2340 attgctcaag cctactgtgg tgcgtttagc catcgacaca aatgcatcgc acgtcaactt 2400 aactgatttc gccattgcta atatcaaaag tacgaatgtg gtttttggtt caactttctt 2460 caagctcgcc gaactccatg gtgatcagta tgatgtcatt ttggggacac ctttcttacg 2520 caagcacgag ttagatgtgt cggtggctag ccgtagtgtg atacaaccga agacggggag 2580 ggtgattttt gaaaaagaaa agagtgagga attgaaacag ttgaaattta aagaagaaaa 2640 agagaagttt ttgatggata aacgtcatga tttagttgct gcagtgtttg ataatttaga 2700 tgttatagaa aatgcctccg acttatctgt tcgtgaaatg aagatgctaa aagaatttga 2760 gatccttttt cccgatgatt taccggatgt ggatgcgttt gacgatgcag aattcattcc 2820 acctgaatta caagatgaga aatctgcaat acgacataac attgtcttga cggaccctaa 2880 tgtggtgatc aatgagcggg cttatggtta ccctcacaaa tatgaggatg catggagaaa 2940 gttggttgat gcccatgtag ctgcgggacg tcttcgacga tcagatagcc catatgcttc 3000 accgtcaatg gtaattccaa aggcagatcc tactgtaccg ccgcgttggg tttgtgatta 3060 ccgaaagctc aataaatata ctgtcaaaga tagatctcct ctgcctaatg tcgatgaggc 3120 tatcaggaga gtgggaactg ggaagattta ttccatattg gatcaagtga atgccttttt 3180 tcaaacactg atgcgtccag aggacattcc cttaacggca gtaaagacac cttggggttt 3240 atatgaatgg gttgtcatgc cgatgggctt aacaaatgct ccagcaactc atcaaagaag 3300 atgtgaagag gtgttagggg agttggtgaa taagatttgt gtggtttaca tagatgatat 3360 agtagtgttt tcacaaacgg tagaggagca tgaggagcat ttgaggatgg tgttagaaag 3420 acttaaggca gcaaaacttt actgttcaat taagaaaaca aaattattta gacgccgtat 3480 caaattttta ggtcacgaaa ttagtgcaga tggaatatat gctgatgata ctaaggttga 3540 gaaagttgca aattggaaga caccaaagac tgctaagcaa cttaaagagt ttttaggtac 3600 agtgcagtgg atgaaaaaat ttatagatgg attagcggaa tatgcaggga agttgtcacc 3660 actcacgagt agtaagaagg cgagtgagaa gtttgtctgg ggagagaaag aacaagatgc 3720 cttcgataac atcaagagaa tcattactac tctaccaact ttgaaaaatc tcgattacga 3780 ttctggggaa cctctatggc tttttagtga tgctagtggt catgctctag gtgcggctct 3840 ttttcaaggc ttggactggg agacatcttc tccaattgca tacgagagtc gtcagatgtc 3900 acccgcagag aaaaattacc cggtgcatga acaggagttg cttgctgtta tcaacgctct 3960 taacaagtgg agaatgctgc ttttggggat gaaagttaac atcatgactg atcatcattc 4020 attaacccat ttgttgacgc aacgggacct gagcaggcga caggctcgat ggcttgaaac 4080 cttgtctcaa ttcgatctgg atttcaaata tatcaaaggt cttgacaata gtgtagccga 4140 cgcattatct cgcattgagg atgtggctgt gatggaagtc caatcgaagt tgtcacaatc 4200 tctgatacaa cgcatcaaag atggctattt gtcggacccg ttttgtcaaa agctgagtaa 4260 ggtcctgccg ttacgagcta actgtgtttg gaaggagggg ttaatgtatg tggatgatcg 4320 cttagtgata cctgttacgg aggggctgca gtcggaattg attcaggtgg cgcatgacac 4380 agtaggacac ttaggggtag tgaagactgc tgatcgattg aggagtgagt tttattggca 4440 gagattaggg caagatgtag aggagtttgt ccgtaactgc gacagttgtc aaaggaacaa 4500 ggcaaggaca acacgtcttc ctgggcgact gcaaagcacc gacgtaccat cacgacctat 4560 ggcggccatt tctttggact ttgtgggtcc tttccctaag gtacaaggat acgacatgct 4620 gttgacatgt acgtgtcggc tgacggggtt cacgcggtta attgccacgt gtcaaaaaga 4680 cacggcggaa accacagctc gtcgtctttt tgctagttgg ttgacaatat ttggggcgcc 4740 aaaatcaatg attggcgatc gcgacaagat ctggacctcg cgtttctgga aggagctaca 4800 acttttactc ggagtcagtg tgcacctctc aacggcttat catccgcaag cggatgggag 4860 ggcagagagg accaacaaaa cagtcggtca gatgttgagg cactacgtcg caggaaagca 4920 tgggaagtgg cttcagatgt tacccacggt agaatacgct attaactcgg cagtcaacgt 4980 cgcaactgga gtatcaccga tggagtttgt cttaggattt ccaccacggc ttttcccaat 5040 tgctgagaca gacattcagg taggaggggg agtgaagagc tggttggaga agaggcagga 5100 gtcgtgggct gtgtggcgtg acaagctgtg ggcatcaaag gtaaaccagg cggtggggta 5160 taactctcgt cgggggccgg atttgacatt ggaagtgggt gattgggtac tgattgacag 5220 caaggaccgt caaaagttgg tgaagggtcc tgttgccaaa ttgaaggcca ggtacgatgg 5280 gccttacgaa atcattgaag tcctaaatga aggccggaat gtgcgagtac aactaagcga 5340 gggggacaag actcataatg tgttccacca gtcaaagctg cgcaagtaca attcggacga 5400 aatggatatg gcggcatagg aaggaaagct agtacagagg gggtcaaagt acgttctctc 5460 ctttcgtatg caccgccggt ggttctacta tgtcgttttg taagcattac cttggccacg 5520 ctgtgagcat cccttgttcg accgggtttt cttgcaggct cttcgactca acgacgtttt 5580 agaggggatt gtgatagttt atttcttttt ctttttgttt ctttttactt tattagtttt 5640 aattttaatt ttacaacttt tattattctt tttcttgact tttaatcaat ttcaatacct 5700 tggttcaact tttctttgat tatgtgatag tgcttttctt tctttgtttt ttcttttctt 5760 tattttgaag ggggtttttg tttttaggag gggagga 5797 // ID Copia-8_MLP-I repbase; DNA; FNG; 4723 BP. XX AC AECX01000970; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-8_MLP_; KW Copia-8_MLP-LTR; Copia-8_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4723 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000970; Positions 129113 124391. XX CC Positions [2030-2530] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1049..3208 FT /product="Copia-8_MLP-I_2p" FT /translation="MPVTLPNMAAQVHVQSTPSLSRGKKRSNVSQPPSNVS FT SDVTCNHCSMRGHSWQTCHKLANQFFKQMLENQPNLSVPQRHPEANLVQPS FT SQSSGEPIPLSAWSPAPDTFIHQPEGSSFLVTVDLCALSTNEDKRVWILDS FT GCTQTCIPSFTSGELLLSKISSTPHIQNIRVANGIGLKVTQQRLLPLPWST FT TGLDFEVIEVPGLQHSLLSISQLRSLGYSTAFLQDRVEILSSDGAVVARGS FT YRGGLPVLNDVSNFCLNSNAQSTPNTSFYQWHCLLGHIGEVGTKQALSHLN FT ISVPLPKHDVEEVKNCIHCCRGKLHKKTFRSRSAYRSKTPFSVVHSDVAFL FT PLKSKAGYPYFVSFIDDFSKFTVCYSMKTKDQVHLYFKKFVNLVKTKFNSS FT VLCLRSDNGSEYVNKNMIDFCLQEGITSLCGAPYTPESNGVAERWNRTAAE FT RLRTLFSSSGVPLFFWPECLQQIVLCFNNFPTKTNIGLCSPLSALKQPNDD FT ITLFAPFGCALEYLLPPTLCSKLSPVSQEAVFLHLVPDGKYVVAYDRARHL FT IVKTSSFKFFIQKFPFLLNPNKHPTESLPLPWSDLPVLEEPIQPYGVITRS FT KTTVTSPVPSSQPSTPSSPYIPHDTSIEIEDLMPLSPPHPTPSLPEPIPLS FT PPPVVRPSAPIAVQRSSWVTREPERFTPSAFVLDTSSPEPKSYQKAMDTEE FT SLEWSKAAKTEINTLLKLGT" XX SQ Sequence 4723 BP; 1317 A; 1042 C; 858 G; 1506 T; 0 other; tctgactgta tcttaaacaa aatctttcta caagattctt catgtttctc ttttacttat 60 ataaagattt atgtttttgg ttttgattca gaccaacagg ttatgagccc ttaaagacaa 120 tctcgaaatc catctctttc tcgatattcc aactccattt cctaatcaac tttttccatg 180 gcattgacat cagctaccaa tagtggaaaa gctcctgatc gaccaccacc acctccttcc 240 actgggacaa gaccgaagga accgttgact cctcccacta atttcgacga tctgttttct 300 gttggttcga aatggggact gttggcatta cctacgtcat cggttgtcgg tctgactctt 360 cttgagccac ctggtgataa ttctaactac agtaattggg agctcgccat gaatggttcg 420 attttaggag ctggaatggg tcacgttttg aatcctaagt tacgagatgt tgaacaacct 480 aattttcatt tgacacaaca gaataatgtg atttgttcag tactgttacg tcatgctcat 540 cctaacaatt atgctgttat gagacctcat atgggtaaca cagctgcgat gtggcaagct 600 ttgtccgatt cccataacaa tgcttcgatg ggatccaaga tgctcgttct gccacagctg 660 cgatgtggca agctttgtcc gattcccata acaatgcttc gatgggatcc aagatgctcg 720 ttctgcgacg gcttatctcg ttgcgaaatc taaactcagc atggatgaca ttgtttgtgc 780 cgcgatcagt gcatcgttac cgagtgaact cgttgcgaaa tctaaactca gcatggatga 840 cattgtttgt gccgcgatca gtgcatcgtt accgagtgat tggaatgccg ctatcagcag 900 tctgatgaat caacaatcag tatctgcagc ctctctgatt tcatctttaa gacgagaagg 960 tattcgacga cgggatcaag catcctcgtc tttaattctg tcagttacta ttgatcgtaa 1020 tgatcgtaac caagctctac tttgatctat gcctgtcact ttaccaaaca tggcggcaca 1080 ggttcatgtc caatctactc cgagtttgtc tcgagggaag aaaagatcta atgtttctca 1140 acctccgtct aacgtttcta gcgacgtgac gtgcaatcac tgcagcatgc gagggcacag 1200 ttggcaaact tgtcacaaac tagctaatca attctttaaa caaatgctag agaaccagcc 1260 taacttatct gtccctcaac gtcatccaga agccaaccta gttcaaccat cttctcaatc 1320 atcgggtgaa cctattcctt tatctgcttg gagccccgca cctgacacct tcatccatca 1380 acctgaaggt tccagctttt tagtgactgt cgatttgtgt gctttatcga cgaatgaaga 1440 taaacgtgtt tggattttag attctggatg tacacaaact tgtattcctt cttttacatc 1500 tggggaatta cttctctcaa agatttcctc aactcctcac attcaaaata ttcgtgtagc 1560 taacggtatt ggtctgaagg tgactcaaca acgactttta ccacttccat ggtcaactac 1620 tggcttggat tttgaagtta tcgaagttcc aggtttacaa cacagcttat tatcaatttc 1680 acagctacga tcactcggtt actcaactgc gtttttgcaa gatcgagtcg aaatcttatc 1740 ttctgatgga gctgttgttg cgagaggttc ttatcgtgga ggtttaccgg tattaaatga 1800 tgttagcaat ttttgcctta actcaaatgc tcaatctacg cctaacactt cattctatca 1860 atggcattgt ctccttggtc atataggaga agtcggaacc aaacaagctc tatctcatct 1920 gaatatttct gtacctctgc ctaagcacga tgttgaagaa gttaaaaact gcattcattg 1980 ttgcagagga aaactccaca agaaaacttt tcgttctaga tcagcctatc gctccaagac 2040 accttttagt gttgttcatt ctgatgtggc gttcctacct ttaaaatcca aggcaggata 2100 tccctatttt gtctccttta tagatgattt ctcaaaattt actgtttgtt attcaatgaa 2160 gacaaaagat caagtacacc tttatttcaa gaagtttgtt aatttggtta aaacaaaatt 2220 caattcatct gtcctctgtt tgcgttctga taacggttct gaatatgtca ataaaaacat 2280 gattgatttt tgtcttcaag aagggattac gtctttgtgt ggagcccctt acaccccgga 2340 gtccaatggt gttgctgagc gttggaaccg taccgctgct gaacgattac gtactttatt 2400 ttcttcgtca ggagtacctt tatttttttg gcctgaatgt ctacaacaaa tcgtactatg 2460 ttttaataac ttccctacta aaaccaacat cggtttatgc tcgcctttat cagctttaaa 2520 acaaccaaat gatgatatca ctctttttgc tccgtttgga tgcgcgctcg agtatctcct 2580 acctccaact ctatgctcaa aactgtcacc agtatcccaa gaggcggtct ttctacactt 2640 ggttccggat ggaaagtatg ttgtagcata tgacagagct agacacctta ttgtaaagac 2700 ttcatctttt aaatttttta ttcaaaaatt tccttttctc ttgaatccca acaaacaccc 2760 aaccgagtct ttgccattac cttggtctga cctgccggtg cttgaagaac caatacaacc 2820 ttatggcgtt attactcgct caaagaccac cgtaacatct cctgttcctt cttctcaacc 2880 ttctactcca tcttcaccat acatacctca tgatacctcg attgaaattg aagatttgat 2940 gccactttct ccaccacatc ctacaccttc tctaccagaa cctattccac tttcacctcc 3000 tccagtcgtt cgaccttcag caccaattgc agtacaaaga agttcttggg tcacacgtga 3060 accagaacgt ttcacaccat cagctttcgt acttgacact tcatctccag aaccaaaatc 3120 ttaccaaaaa gccatggata ccgaagaatc gttagaatgg tcgaaagctg caaagactga 3180 aatcaatact cttctcaaac taggtactta gaaacttgtg ccaagaccta ctaatcgaaa 3240 ggtgctaagg tcaatgtggg tctttaaaag aaaactattg ccctcaggag aaattgataa 3300 atataaagga agactagtcg ttctggggaa tggacaaatt gctggtgttg attttggtga 3360 tgttttctca ccggcatcta gacaggaatc taatcgagtt tgcttacggt tgctggggta 3420 aaatcatgga aagtcaaagg ctacgatatt acagcggctt tcttacatgg aaaacgttta 3480 gaagaagagg tttttatgga gcaacccgag ggttttcgtg atcaaaattt tcctacttag 3540 gtgtgtcaac taattttacc tttatatggg ttacatcaag ccacaagaaa ttggaatgat 3600 cgattcacgg catccctgat tgatgtgggt cttcgacaat cggctcacga tcctacttta 3660 ttttttaaac ttgagaacaa tgtgctggtt gatttagtaa ctgtacatat agatgacatt 3720 tcggcaacca gtgaagatca tttccttcat tctctatctt catcacttcg tttgtctttc 3780 cctatctcag cggagtcaga tttgtcgcat catctatcca tttctattga aagatctaaa 3840 aagttacgtt atttcactct tcatcaaaaa tcttatatcg tcaaaataac aaaacagttt 3900 ttctctactg aacttcaaca agttccaaca ccatacatag caggcttttg gtctttgatc 3960 ccagcagatg atgtttccac ttcaaatcca aactacggtt caattattgg ggcattgtta 4020 tggcttgctc aatgtaccag accggatatc acctttgccg tcaatcgact ttcccagttt 4080 ttgaagaaga acaatgaagg gcattggaaa gcagctttac gagtattagc ttatgtttat 4140 catactcgtg atcttggtct cactttagga ggttatgaca atcaacttca aggctttacg 4200 gattctgact gggcagagaa tctacatgat tgtttgtcta cttctggtta cttgtttaga 4260 ttaggtgatg ggtttatttc atggaagtct agaaaacaga agactacagc actgtcgagt 4320 acagaggcag aatacatggc attgagtgat gcggggagag aggccatgtg gttgagacaa 4380 ttacttcaag aaatttgttt tattaataat gtaccaacaa ctattcatta cgataacacg 4440 ggatcagcag cgctggcgaa caatcctgtg catcattcta gaagtaaaca catagattta 4500 cgttatcatt ttattcaaca attaataaaa gacaagaaca ttgaattgaa acaagtaaac 4560 actacttttc aattagctga tttcctaact aaagctcttc ctagacctgc atttattaat 4620 cttagaaatc aattaaaatt caacaactta tcaaaatgat tcaattttat tctcttttct 4680 ctttcttaaa aattttttta cggacgtttt cagcatgggg ggg 4723 // ID Copia-9_MLP-LTR repbase; DNA; FNG; 471 BP. XX AC AECX01000958; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-9_MLP_; KW Copia-9_MLP-I; Copia-9_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-471 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000958; Positions 64033 64503. XX SQ Sequence 471 BP; 100 A; 135 C; 89 G; 147 T; 0 other; tgtcggaagg aacacattgg tttgctgcca gggtactcac gcccttagcc ctgatcgtaa 60 ccaacaacct atctcgacca gtcaaccctc atcttgatcg gaccagtgga ttacttcagg 120 gcaaggtcca gaccagcaag gctggctagg cagggatgta tgaacatcat ggttcaggct 180 agacactaga cacatgtaca ctctcatatc acatatataa tgtactattg ttcgctctct 240 ttgttgtctc ttttcctcac caattccata catcactagc tcgccttcct cttgtcattt 300 tctctcaatt gttgtctcac tcgtgtgtgc tgatcgtgcg cacctctcag gtatgcattg 360 caatctatga ccgattccat acatcactag ctcgccttcc tcttgtcatt ttctctcaat 420 cgttgtctca ctcgtgtgtg ctgattgtgc gcacctctca gtctgctgac a 471 // ID BCBOTYPOL repbase; DNA; FNG; 1224 BP. XX AC X81791; XX DT 30-MAY-2000 (Rel. 5.04, Created) DT 30-MAY-2000 (Rel. 5.04, Last updated, Version 1) XX DE B.cinerea pol genes of retrotransposon Boty. XX KW Gypsy; LTR Retrotransposon; Transposable Element; BCBOTYPOL; BOTY; KW RNase; Long terminal repeat (LTR); retrotransposon; KW reverse transcriptase. XX OS Botryotinia fuckeliana OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Leotiomycetes; Helotiales; Sclerotiniaceae; Botryotinia. XX RN [1] RP 1-1224 RA Diolez A., Marches F., Fortini D. and Brygoo Y.; RT "Boty, a long-terminal-repeat retroelement in the phytopathogenic RT fungus Botrytis cinerea."; RL Appl. Environ. Microbiol 61(1), 103-108 (1995). XX RN [2] RP 1-1224 RA Diolez A.; RT "BCBOTYPOL."; RL Direct Submission to Genbank (20-SEP-1994)A. Diolez, INRA, RL Station de Pathologie Vegetale, Route de Saint-Cyr, 78026 RL Versailles, FRANCE. XX DR GenBank; X81791; Positions 1 1224. XX SQ Sequence 1224 BP; 403 A; 268 C; 285 G; 268 T; 0 other; tttcaagacc agggtcctga ggctttaccg gaacacaagc catgggatca tgagataata 60 ttggaggaag gcaagatgcc tgtgcacacc ccaatttatt caatgtcagc cgatgagtta 120 aaaaggctca gagaatacat cgacgacaat ttagccaagg gatggatcag ggaatccgcg 180 tcccaagtgg ccagtccaac tatgtgggta cccaagaagg atggacccga tagactagtt 240 gtagactata gaaagcttaa cacactcact aagaaggatc gatatccact tccattagct 300 acggaattaa gagatcggtt aggcggacgt acgatattta ccaagatgga cctacgtaat 360 ggttaccact tgatcagaat gaaggaaggc gaagaatgga agaccgcttt caaaacaaga 420 tacgggctat acgagtacca agttatgccg ttcgggctaa ccaacgcacc agcgactttc 480 atgaggctta tgaacaatgt gttgtcacaa tacttggata cttgctgtat atgctacttg 540 gacgacatcc tagtatattc aaacaacaag gttcaacaca ttaaggacgt tagcaacatc 600 ctcgaaagcc tatccaaggc agacttgctg tgcaaaccaa gcaaatgcga attccatgtc 660 acagagacag aattcttggg attcaccgta tcaagccaag ggctcaagat gagcaaagac 720 aaggttaagg cagtgctcga atggaagcag ccaaccacaa tcaaggaggt acaatccttt 780 ctagggttcg tcaacttcta cagaagattt atcaagggtt attcagggat tactacaccc 840 ttgaccacgt taaccagaaa agatcaagga agcttcgaat ggactgccaa agcacaggag 900 tcattcgata cgctcaaaca agcagtggca gaagagccaa tactattgac ttttgaccca 960 gagaaagaaa tcatagtgga gacggactcc tcggatttcg ctataggagc agttctgagc 1020 caaccgggcc agaatggaaa ataccagcca atcgcattct attcccgaaa actatcacca 1080 gctgagttga attacgagat atatgacaaa gaattactgg cgatagtcga tgcatttaga 1140 gaatggcgag tatatttgga aggtcgatac acagtacagg tgtacacaga cttggttact 1200 tcaccacaac gagcgtaccg gcgc 1224 // ID Copia-37_MLP-I repbase; DNA; FNG; 4886 BP. XX AC . XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-37_MLP_; KW Copia-37_MLP-LTR; Copia-37_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4886 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR [1] (Consensus) XX CC Positions [2212-2736] - Integrase core CC LTRs are 93% similar to each other. CC Includes a fragment of EnSpm like sequence (masked by x). XX FH Key Location/Qualifiers FT CDS 372..2173 FT /product="Copia-37_MLP-I_2p" FT /translation="MSSEDADIISADDSQDNSTVTGGSSTSSSADTFGTVK FT QSLTESVSADNPPGQSRTLEPSSSVMAESSDSRVFGDAIQKYSQIMTNALS FT KYKVSDDLTDENYTEWSQSLMEVFRSLEFHHYVKVKNYRNQSLSAEEHEKT FT RFNLTTYILNRLDLINKVRTRNRLTDPEDPSEILYDPFLCWSFLKEYHNRV FT SEDKLEAVTKALYACQIVKSDTLSSFIDKFENLIREFYRLKGELSDVQSAR FT MLLSAIPSLSIKMKEYIHNTVVPLTREGVATYLRKYEERHGWTCSAIREAN FT AVSIRPNKKVSGECTPDECVGPHLSKNCWSKPENADKRLAFLAKICGKSGG FT GNLNSSTPITQESSVRGAKKVFDSSVNNASASMAFLSLNVEFTEDVKDNSS FT ITSVSASPSEFDEDPDIDASVSAALSSSRDWALHDTGATHHMFKDVKFFDQ FT TTLVKIEDTSKRLKLAGGDVSLAVHSRGVTKLLAGDDSVFELKNCLYVPEL FT TRNLVAGGLLKKLGVRELFHDSKNFSLVLNGLAIFNGFISDNNLMFILLEP FT VSGSHLSSIGASITEVSSSLQHRRLGHVSNKYLKLMAKHESVEEFNYSHYT FT KCX" FT CDS 2395..4866 FT /product="Copia-37_MLP-I_1p" FT /translation="MSYIAVAERQTGCPLKQFTLDRGSEFLNNLLGPKLKE FT LGITLHLTSGYAPEENGVSKRGMRIVNTRARSMMLESSLPIRFWYYACSTA FT VFLTNLCVTATLEGGKTPFEVWYFRKPSIHHLRVFGCQAYGLIRKELRPSK FT FSPVSSEGVLLGFENDNFNYKLFDLASKKVYITHHATFNEEHFPFKTLPSV FT PSAVDNIPERHPVQVRFFDDDDDLLDDQPVPASTLIDNSHQSSEPNVVPTV FT QPVTKPVEAPRRSNRASKKVHDTLRGMCSGAEFDDDFITESFFACLPPECN FT LAGVLSSPDIPKSYKRAMASVDAKLWKAACDKEFNSLRDKEVWELVDRPKD FT KNVIRGLWVFRKKSLVDGSVKFKARFVAMGNTQIPGEDYGETFAPTGKPGS FT LRILVALASVNGWEIHQMDAVTAFLNGLLEEELYVEIPEGYRHESTIGKVW FT RMRRSLYGFKQSPKIWQDDVESFLIEVGFEQCEIDHCIYIRSVGNLFTAVY FT VHVDDLAITGNDITKFKSQIAAKWEMEDLGIAHTVVGIQINRLNHLTYSMS FT QQQYTQTVLKRFNMTDSKPAVTPLSPNVKLLQSTEEEAEEFSKTKLPYNSV FT VGSLMYLAQCTRPDMAHAVGVLSQHLKSLNKLHWDAAMHVLRYLSHTINIG FT IVYSGNDSNIVAGQRSFECPVSHCDADWAGDVNTRRSTTGYVFVLAGGPIS FT WRSRLQPTVALSSTEAEYRAITEAGQELLWLRNMMAKFGFIDPNPTVLHSD FT NMGAIHLTSKSIFHARTKHIEIHYHWIREVVKKGELTIKHCPTHLMVADLL FT TKQLPKEQFSNLRRSLGLRFLG" XX SQ Sequence 4886 BP; 1333 A; 1044 C; 1042 G; 1424 T; 43 other; ggttagttaa tcgaaaagaa tcaattttta tttcaactct tttgttgtct gtttaagacc 60 tttcccttac gcgcgatctc aattcatttc gagctcgcag gtcagttatt attgttctcg 120 aagtgactag cggatagctg tagtcttact ccggaacctg tgccctcagg tcagttgtta 180 ttgttctcga cgtgactagc ggatagctgt agtcttacgc cggaacctgt gccctcagtg 240 ttcattgact gctacagatt atcctgcgac gacgtcccac ctagataccc ttgtggttct 300 agagtttcct ttcgtcgaga actctttatg gtagcgggag taaccgatct ttttgaccag 360 tttttgcttg tatgagttct gaagacgccg acattatctc tgccgacgat tctcaggaca 420 actcaaccgt taccggtggt tcaagcacct caagttcagc cgataccttt ggtacggtca 480 aacaatctct taccgagtcc gtgtctgccg acaatcctcc tggtcaatct cgtactctgg 540 aaccttcttc atccgtcatg gctgaaagta gtgattcaag agtatttgga gatgctattc 600 aaaagtattc ccaaatcatg accaacgctt taagcaaata caaagtgtct gatgatttaa 660 cagacgaaaa ctataccgag tggagtcaat ccctcatgga ggtatttcga tcgctagaat 720 tccatcacta tgtgaaagtg aaaaattatc gcaaccaatc tctttcggct gaagaacatg 780 aaaagacacg cttcaatctg actacataca tcttaaaccg attagatttg atcaacaaag 840 tcagaaccag aaatcgactc actgatccag aagatccctc cgagatcctg tatgatccat 900 ttctttgctg gtcttttcta aaagaatacc ataatcgtgt ttccgaagat aagctagaag 960 cagtcaccaa agcactctat gcttgtcaaa tagtcaaatc ggacactctt tcttccttca 1020 tcgacaaatt cgaaaacctg attagggagt tctaccgcct gaagggcgaa ctttcggatg 1080 ttcaatctgc aaggatgtta ctcagtgcaa ttccttcgct atctatcaaa atgaaggaat 1140 acatacacaa cactgtagta cccttaactc gtgaaggtgt cgccacttat cttcgcaagt 1200 acgaagagcg acatggttgg acttgttcag caatccgaga agctaacgcc gtttcgattc 1260 gaccaaacaa gaaggtttcg ggtgaatgta cccctgatga atgcgtcggt ccacatcttt 1320 caaagaactg ctggtccaaa cccgaaaacg cagacaagcg tctagcgttt ctggcgaaga 1380 tttgtgggaa atccgggggt ggaaacctaa actcgtctac tccaatcact caagaatctt 1440 ctgtaagggg agcgaagaaa gtatttgatt ctagtgtcaa taatgcgtcg gccagcatgg 1500 cgtttctttc actcaatgtc gagttcactg aagatgttaa ggataacagc tcaattacct 1560 ctgtctcggc ttcaccatct gaattcgatg aagacccaga catcgatgcc tctgtttcgg 1620 ctgctctttc ttcgtctcga gattgggctc ttcacgacac gggtgcaact caccacatgt 1680 tcaaggatgt taaatttttt gatcaaacta cacttgtgaa gattgaagac accagtaaac 1740 gactcaagtt agctggtggt gatgtatcat tggctgttca tagtcgtgga gttactaaat 1800 tattagctgg tgatgattca gtgttcgaac tcaagaactg tctttatgtt ccggaactta 1860 ctcgaaatct ggttgccggt ggcttattga agaaattagg tgttcgagaa ttattccacg 1920 attccaagaa tttttctctg gttctcaacg gactcgcaat cttcaacggt ttcatttcgg 1980 acaataatct tatgttcata ctgctcgaac ccgtgagtgg ctctcactta tcctccatcg 2040 gcgcttcaat aactgaagtc tcctcgtctc ttcaacaccg tcgattaggc catgttagca 2100 acaaatatct caagttgatg gctaaacatg aaagtgtgga ggaatttaat tattcacact 2160 acaccaaatg taxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxaaacc 2220 cggcctcgag ctcgtacttt tttagaaaat gttcatgtgg atttgagtgg aataattaga 2280 acaactggaa ttaataacga aaactacttt attttattct gtgatgacta ctctgccttt 2340 cgccacattt ttcctctgca agacaaaacc aaggaggaag tgtacgatgt ttttatgtct 2400 tatattgccg ttgctgaaag gcaaactggg tgtcctctca agcaattcac tcttgatcga 2460 ggtagtgagt ttcttaacaa cttacttggt cccaagctca aagagcttgg aattactctt 2520 cacctgacct caggatacgc tcctgaagaa aacggtgtct ccaaacgtgg tatgcgaata 2580 gtcaatacca gggcccgttc gatgatgctc gaatcctctt tgccaattcg tttttggtac 2640 tacgcctgta gtacggctgt ttttctcacc aatctttgtg tcacagctac tcttgaagga 2700 ggcaaaactc cattcgaagt gtggtatttt aggaaacctt ccatccatca cttacgagtt 2760 tttggctgcc aagcttatgg actgatcagg aaagaactac gaccttccaa gttttcgcct 2820 gttagttcag aaggtgtttt actgggattc gaaaatgata acttcaacta taagttattc 2880 gatctagcgt caaagaaagt ttatattaca catcatgcga ctttcaatga ggagcacttt 2940 cccttcaaaa ctcttccttc agtaccttct gccgtagaca acattcctga aagacatcca 3000 gttcaagtcc gcttctttga tgacgacgac gatctgctgg atgatcaacc agttcctgcc 3060 tcaaccttga tcgacaactc acatcaatca agtgaaccta acgttgttcc aactgttcaa 3120 cctgtcacaa agccagttga agctccaaga cgttctaatc gcgcttcaaa gaaagttcat 3180 gatactctca ggggcatgtg ttccggtgct gaatttgacg atgatttcat tactgaatca 3240 ttctttgctt gcttgccgcc tgaatgcaac ttagcgggtg ttctttcatc acctgatatt 3300 ccgaaatcct ataaacgagc aatggcttct gtagatgcga aactgtggaa agcagcttgt 3360 gataaggagt tcaactcgct tagagataag gaggtgtggg agttggttga tcgaccaaaa 3420 gacaagaacg tgatcagggg gttgtgggtg tttaggaaga agtcgttagt tgatggtagt 3480 gttaagttta aagccaggtt tgttgcaatg gggaatactc agattcctgg agaagactac 3540 ggggagacgt ttgctcctac gggtaagcct ggctcgttgc gtattcttgt agccttagct 3600 tcagtaaacg gctgggaaat ccatcagatg gacgccgtca ccgcttttct gaatggactg 3660 ttagaagaag aactgtacgt tgaaattcct gaaggatatc gtcatgaatc aacaattggt 3720 aaagtgtgga gaatgagaag gtcactctac ggtttcaagc aatccccgaa gatctggcaa 3780 gatgacgtcg agtcctttct cattgaagtt ggttttgaac agtgtgaaat tgatcactgt 3840 atatacatta gatcagttgg caatcttttt accgccgttt acgttcacgt ggatgattta 3900 gcgataacgg gaaatgacat cacaaaattt aaatctcaaa ttgctgcaaa gtgggaaatg 3960 gaggatttag ggatagctca tacagttgtc ggaattcaaa ttaatagact taatcattta 4020 acttactcga tgtctcaaca acaatacact caaactgttc tcaagcgttt caacatgact 4080 gattcaaaac cggctgttac acctttatca ccaaacgtca aacttctaca atcaactgaa 4140 gaagaagctg aagaattttc gaaaaccaaa ttaccttaca acagtgttgt ggggtccttg 4200 atgtatctcg ctcagtgcac gaggcctgac atggctcacg cggttggagt gttatctcaa 4260 catctcaaaa gtctgaataa acttcactgg gatgcggcta tgcatgtcct gcgatatctt 4320 agtcacacca tcaatattgg tattgtgtat tccggaaatg attcaaacat agttgcaggc 4380 caaaggagtt ttgagtgccc tgtgtctcat tgtgacgctg actgggcagg cgacgttaac 4440 actcggcggt cgactaccgg atatgtcttt gtacttgctg gaggacctat ttcgtggcga 4500 agtcgtcttc aaccaacggt ggcgctttcc tcaactgaag ctgagtatag agcgattact 4560 gaggctggtc aagagctatt gtggctgagg aatatgatgg ctaagtttgg ctttattgat 4620 cctaatccta ctgttttaca tagtgataac atgggggcaa ttcatttgac ttcaaaatca 4680 atttttcacg caaggactaa acatatagaa atacattatc attggataag agaagttgtg 4740 aagaaagggg aacttaccat caaacactgt cctactcact tgatggtggc ggatctttta 4800 acaaagcaac tgcctaaaga acagttttca aacttgagga ggagtttagg gttaaggttt 4860 ctgggttgat gctctttgag ggggtg 4886 // ID TELREP1_PC repbase; DNA; FNG; 2621 BP. XX AC AL592263; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 08-NOV-2002 (Rel. 7.1, Last updated, Version 1) XX DE Pneumocystis carinii telomeric tandemly repetitive DNA, DE TELREP1_PC. XX KW Satellite; Simple Repeat; Repetitive sequence; TELREP1_PC; KW tandem repeat; Telomeric repeat. XX OS Pneumocystis carinii OC Eukaryota; Fungi; Dikarya; Ascomycota; Taphrinomycotina; OC Pneumocystidomycetes; Pneumocystidaceae; Pneumocystis. XX RN [1] RP 1-2621 RA Seeger K., Quail M., Harris D., Hall N., Wakefield A., RA Smulian G.A., Cushion T.M., Stringer R.J., Keely P.S. et al.; RT "Direct submission."; RL Direct Submission to Genbank (10-MAY-2001). XX DR Genbank; AL592263; Positions 29922 32542. XX CC Tandem repeat: bases 1-1301 and 1302-2621. XX SQ Sequence 2621 BP; 834 A; 305 C; 918 G; 564 T; 0 other; gagagaagag agtagagagt aggatgagag atgcttcttc tgtttaaaat aaaaaggatt 60 gaattattaa gagggagaaa cgtcaattga atagatcttc tattcgtaaa aaggaggcag 120 agtaggatga gagatcattt attttaaaaa ggggtaacgt caatggaatg cgactactga 180 ggaatggtgc gtccgaggaa tggcgcttct gaggaatggc actactgagg aatggcgcgt 240 ccgaggaatg tacgcgggag tataaatagg agggaaggaa atagggtggg tggggcggtg 300 gttggtggtg gttagaaagg gttaggaagg agtagggggg ggagtagaaa aaaaaaaata 360 agagggggtg ggggagagag gagagggggg ggtcgaaagt cacgtgacgg agagagaaag 420 tcacgtgaag ataagagggg gtcacgtggc gataaaagga agtaggaagt actaagtaga 480 cgtcgattca taaagtgcta gcaccgggtt aacgtcaatt cataaagtgc tagaaagagg 540 tgaacgcaag taggaaaggt gggcgtaagt aaggttaggt gacgtctatt cacaaagtgc 600 tgcttcaggg tgaacgtaag tagaaaaggt gagcgtaagt aaggttaggt gacgtctatt 660 cacaaagtgc tccgggtagg tgtacgcaga atgtgggtca gtcagcggag ggagtactta 720 gtcggggaag tgctagaaag aggtgggcgc agaagtactg ggtgggcgta agtagggtta 780 ggtgacgtct gttcacaaag tgctagttct gggtgggcgt aagtaggaaa ggtgggtgta 840 agtagtagta ggttaacgtc tgttcacaaa gtgctagtag taggttaacg cagggtgaag 900 tgctatgaag gtaaggtcaa agtcacgtga ccagtagaag tcacgagaat aggtaggaag 960 gagttgtagt aggtaggttc gagtcctgag attacgcaga gggggcagga aacgcaaaat 1020 gaagtgctat gaaggtgggg tcaaagtcac gtgacctagg ccgaagtcac gtgatgtaaa 1080 gttgattacg taagcaagaa gtcacgtgac gtagaggaga tcacgtgaca ctgaccaaga 1140 gagcacgtga ctttgaccaa gagaacacgt gactttggaa tgaaaagata cgggaaatct 1200 gaaacaagca aacgaaagct aagaatttaa atattacgcc accaagttta ctggttaggg 1260 gggttaggtg gttagtacgt tagggttagg gttagggtta gagtaagagt agagagtaga 1320 gagtagagag tagaataaga gatacttctt ttgttttaaa taaaaaggaa tgaattattt 1380 taaaaaagaa gagagtaacg tcaattgaat agattttttc ttcgtaaaaa ggaggcagag 1440 taggatgtga gatcttttct tttaaaaagg ggtaacagaa ttggatcgat cggcacttct 1500 gaggaatgcg actactgagg aatgtacgtg ggaggaatgt acagggtaca ggaacagggt 1560 ccaggggtgc gtgggggtat aaataggagg gaaggaaata gggtgggtgg ggcggtggtt 1620 ggtggtggtt agaaagggtt aggaaggagt agggggggga gtagaaaaaa aaaaataaga 1680 gggggtgggg gagagaggag aggggggggt cgaaagtcac gtgacggaga gagaaagtca 1740 cgtgaagata agagggggtc acgtggcgat aaaaggaagt aggaagtact aagtagacgt 1800 cgattcataa agtgctagca ccgggttaac gtcaattcat aaagtgctag aaagaggtga 1860 acgcaagtag gaaaggtggg cgtaagtaag gttaggtgac gtctattcac aaagtgctgc 1920 ttcagggtga acgtaagtag aaaaggtgag cgtaagtaag gttaggtgac gtctattcac 1980 aaagtgctcc gggtaggtgt acgcagaatg taggtaagtc agcggaggga gtagtcggtc 2040 ggggaagtgc tactgggtgg gcgtagaagt actgggtggg cgtaagtagg gttaggtgac 2100 gtctgttcac aaagtgctag ttctgggtgg gcgtaagtag gaaaggtggg tgtaagtagt 2160 agtaggttaa cgtctgttca caaagtgcta gtagtaggtt aacgcagggt gaagtgctat 2220 gaaggtaagg tcaaagtcac gtgaccagta gaagtcacga gaataggtag gaaggagttg 2280 tagtaggtag gttcgagtcc tgagattacg cagagggggc aggaaacgca aaatgaagtg 2340 ctatgaaggt ggggtcaaag tcacgtgacc taggccgaag tcacgtgatg taaagttgat 2400 tacgtaagca aggaagtcac gtgacgaaga gttgatcacg tgacactgac gaagagagca 2460 cgtgactttg accaagcgag cacgtgactt tggaatgaaa agatacggga aatctgaaac 2520 aagcaaacga aagctaagaa tttaaatatt aacgtcaccc ggtatccggt tagggggtta 2580 ggtggttagt acgttagggt tagggttagg gttagggtta g 2621 // ID Copia-1_CDC-I repbase; DNA; FNG; 4359 BP. XX AC NC_012867; XX DT 06-FEB-2011 (Rel. 16.02, Created) DT 06-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Candida dubliniensis genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_CDC_; KW Copia-1_CDC-LTR; Copia-1_CDC-I. XX OS Candida dubliniensis OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; mitosporic Saccharomycetales; OC Candida. XX RN [1] RP 1-4359 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Candida dubliniensis genome."; RL Direct Submission to RU (06-FEB-2011). XX DR Genome; NC_012867; Positions 1377250 1381608. XX CC Positions [1550-2062] - Integrase core CC 'AATAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 5..2830 FT /product="Copia-1_CDC-I_1p" FT /translation="MSLYNIFTGLTIPSEYILLSKANFTRWRKNFITIAEG FT VSENFGNYVKDVPLAAGASANNLQFNKQVDLLLQLSVDKEILREARTLGSM FT GKALYLEIVEEYSQMFTIDKVQVITSLWDKLLDLLVDIKKRLVTQKEFFQF FT WNLLTADEHEGILPFVWLHLSKSNFNLKYLEGFDPSLTVTGIKRFLTLHPE FT VNQTSGVLSLPVNYINNTTLSTIQITDANVEAKFRYQCFNCFGLGHTSSKC FT ALPRRGSIKIPNLEKKLDYYRQKKVFNRRRGGTNADESRIAETVSDSLSNS FT TKFTSTEANKWSNNQGSKNANVIIVLSVESLTNPTSASNFIIDTGASVNLC FT NDVSLLHDYTHFSEPHSVVAANGESLKVFGHGTLKFNHNSVEVEILYVGFA FT PNVAVNLLNPKSLIRGPQDSITLSHEGVVHSTLGKIGTFGDTSNCVMSPIV FT NPMSAAICAVLNRDQVATLHHSFGHPNATSFKKMLDLAGHVAKTADIKSPC FT DSCLQTKNFQLFPKASDGPHTKTPLQIIHLDVAGPFGGPAVDLSKVFLVIV FT DDFSRHKWVFPLQSKSDATEVIINWIRHWERYFAGRGEYKVSSIRSDNGGE FT FLNQDMSLFCLKQGIRHERTIPYNSHQNGKAERAIRSIMDKLRTLLCQSGL FT PSTFWCYSTIMAAHLLNITPSEVLNYQTSYEKWYGSAPKYSKLHPFGSTAY FT AHVPTTYRSKLQPNGVKCIFLGYPQTQSGYLLYDIQTKTIVVAKDVKFVDS FT EFLASSIDFSEINATKLTIPGVTRSSTTSETFVPSRTSQNFPTEPTAVVTS FT DEIDIIDNPSPHQSPEHSPVLTTVSSTPSSPSLPPPPIVQEGSEYEYSSDV FT STLLSSNSTVSDNLDLVIDGFGMMVDKSICGGDNTYVVQNLEVNEHSNKVY FT AVTKAHKKYKIDNVQIDSSASGCFTEYKWKLFYNQTTN" FT CDS 2865..4349 FT /product="Copia-1_CDC-I_2p" FT /translation="MQTQEKDNWLEAMSAEMAAHQTNQTWIEVPRPTNKQI FT LTCRWVFSKKPGNKFKARLVAHGHQQLKNQEFKETFSPVIRTESIKILLAV FT SALTNKIVHAMDVNNAFLNGILDSEVFMTQAPGFESENNTVYKLVKSIYGL FT REAPAVWNRVLSDVLVKDGFKRVESEISLYLKKGIMVGVYVDDILISAETE FT GEIDKVKLLLKSNFRMKDMKGITKFIGMNIKQSPLGIEISLSDYIEKMLED FT FGMTECNTVRTPTMSGLDLETIDAEPKLCDATEYRSLVGKLIFSANVCRFD FT TAYITGVLARHFHAPEERHYKAAKHVLRYLKGTKDFALSYNNHEGIQVFCD FT ADWASTSRDRKSLTGYLVKLGGSAISWSSKKQHSVSLSTTESEFYAMCSVA FT KEIVWILLVLDPMDLAIELPITVYSDNQSAISLASHPTLHARTKHISIRCH FT YIRDLIASGIIVFRHISTVEMQADLLTKGLEHVKFTRLLNKCGLKKSSLH" XX SQ Sequence 4359 BP; 1291 A; 855 C; 877 G; 1336 T; 0 other; ggttatgagc ctttacaata tcttcactgg cctcaccatt ccatcggaat acatcttgct 60 gtccaaagca aatttcacca gatggagaaa gaacttcatt actattgcag aaggggtttc 120 tgaaaatttt ggaaactatg tcaaagatgt tccccttgct gctggagcta gtgctaacaa 180 tttgcaattc aataagcagg ttgatttgtt attacaatta tcagttgata aagaaatttt 240 aagggaagct cgtactttgg gaagtatggg caaagctctt tacctggaaa ttgttgaaga 300 atacagtcaa atgttcacta ttgacaaagt tcaagtcatt acctcgttat gggataaatt 360 gcttgatctg ctggttgaca tcaagaaaag attagttacc caaaaggaat tttttcaatt 420 ttggaacctg ttgacagcag atgaacatga aggaatttta ccatttgttt ggcttcactt 480 atctaagtcc aactttaacc tgaagtatct tgaaggtttt gatccttcgt taacagtcac 540 tggtatcaaa cggtttctca ctttacatcc agaagttaac cagacatccg gtgtcctgtc 600 attgccagtt aattatatca acaacacgac tctgtccaca attcagatca cagacgctaa 660 tgttgaagca aaattcagat atcaatgttt taattgcttc ggtttaggtc acacttcttc 720 aaagtgtgca ctgcctagac ggggaagcat taagattcca aatcttgaaa agaaattgga 780 ttattaccgt caaaagaaag ttttcaatcg ccgtcgtggt ggtaccaatg ctgacgagtc 840 tagaattgct gaaacagtgt ctgactcatt gtcgaactca actaagttta ccagcactga 900 agctaataag tggtctaaca atcagggatc taaaaatgcc aatgtcatta tagttctgtc 960 cgtcgagtcc ctgactaacc ctactagtgc ttcaaacttt attattgata ctggggcgtc 1020 tgtcaactta tgcaatgatg ttagtctttt acatgattat actcattttt cagaaccaca 1080 ctctgtggtt gctgctaatg gtgagtccct taaggtcttt ggtcatggta ccttgaaatt 1140 caatcataac tccgttgaag ttgagatctt atatgttggc tttgctccta atgtggccgt 1200 taatttactc aatcccaagt cacttattcg tggaccacaa gactctatca ccttaagcca 1260 tgaaggtgtt gtgcattcca cacttggaaa aattggtact tttggtgata cgtcgaactg 1320 tgttatgtcg ccaattgtca atcctatgtc tgctgccatc tgtgcagtgt tgaatcgtga 1380 tcaagtggcc accttacatc actcttttgg tcatcctaat gctacctctt tcaaaaagat 1440 gttagacctg gccggtcatg ttgctaaaac tgcggacatt aaaagtcctt gtgattcatg 1500 tttgcaaact aagaatttcc aactgttccc taaagcatca gacggtccac acaccaaaac 1560 gccgcttcag attattcatc ttgatgttgc cgggcccttt gggggtcctg ccgtggactt 1620 atcaaaagtt tttctcgtta tagtggatga ttttagtcgc cacaaatggg tgtttccttt 1680 acaatcgaaa tctgatgcca ctgaagttat aatcaattgg atacgtcact gggagcgcta 1740 ttttgctggt cgtggtgaat ataaagtgtc atccatacgg tctgataatg ggggagaatt 1800 cctcaatcaa gatatgtcat tattctgttt gaaacaaggg attcgtcacg agcgtactat 1860 tccgtacaat tcccaccaaa atggaaaagc tgaaagagct attcgatcaa tcatggataa 1920 actgaggacc cttctttgcc aatcaggttt accttctacc ttctggtgct attctactat 1980 catggctgcc caccttctta acatcacacc atctgaagtg ttgaactacc aaacatcata 2040 tgaaaaatgg tatggcagtg ccccaaagta cagtaagtta cacccatttg ggagtacagc 2100 gtatgcgcat gtacccacga cgtatcgatc caagcttcaa cctaatggtg ttaaatgtat 2160 ttttcttggc tatcctcaaa ctcaatctgg ttacttgcta tatgatattc aaaccaagac 2220 cattgtggtt gccaaagatg ttaaatttgt ggactcggag ttcttggctt catctattga 2280 tttttctgaa ataaatgcaa ctaagttaac tattcctggt gtcactcgaa gctccaccac 2340 atcggagact tttgtccctt ctcgaacgag tcaaaatttt ccaactgaac caacagctgt 2400 tgtgacatct gatgaaattg atattattga caatccatct cctcaccaat cacctgaaca 2460 ttcacctgta ttaactactg tttcctcgac tccatcttct ccatcactac cacctcctcc 2520 catagtccaa gaaggatcgg aatatgaata ctccagcgat gtgtccaccc ttcttagctc 2580 caattctact gtttctgata accttgactt agtcattgat ggttttggta tgatggttga 2640 taaatctatt tgtgggggag acaacacata tgtggttcaa aacttggaag tgaatgaaca 2700 ttcaaataaa gtttatgcag tgactaaggc ccacaagaag tataagattg ataatgtcca 2760 aatagattca tctgcatctg gctgcttcac tgaatataag tggaaattgt tttataatca 2820 aacgaccaac tgaacaaggg attccaaaca catttaaaca agcaatgcaa actcaggaaa 2880 aggataattg gctagaagct atgtcagccg agatggctgc tcatcaaacc aatcaaacct 2940 ggattgaagt cccacggcca accaataagc aaattcttac ttgtcgctgg gtattttcga 3000 agaaaccagg aaacaagttt aaggctcgct tggtggcgca tgggcaccag cagctgaaaa 3060 accaggagtt taaagaaact ttttcacctg taataagaac tgagtctata aaaattttgc 3120 ttgctgtttc cgcacttacc aacaaaatcg tccatgcgat ggacgtcaat aatgcctttt 3180 tgaatggcat tttggattcc gaggtcttta tgactcaagc tccaggcttc gagtcagaaa 3240 acaacaccgt atacaagttg gtaaagtcta tttacggact tcgtgaagca ccagcagtgt 3300 ggaaccgtgt gttatctgat gtattagtta aagatggttt taagagagtt gaatcagaga 3360 taagcttata tttgaaaaaa ggaattatgg ttggtgttta tgttgatgat attctaattt 3420 cagcagaaac agagggagag attgataagg ttaaactgct cctaaagagt aatttccgta 3480 tgaaagatat gaagggtata accaagttca ttggaatgaa tatcaaacaa tctccattgg 3540 gaatcgaaat ctcacttagt gattatattg agaagatgtt ggaggatttt ggaatgactg 3600 aatgcaatac cgtgagaaca cctactatga gtggtttaga cttagagact atcgatgctg 3660 aaccaaaact atgcgatgct actgaatata ggtcgcttgt cgggaagttg attttctccg 3720 ccaatgtgtg tcgttttgac acagcttaca taactggtgt gttagctcgt catttccatg 3780 cacctgaaga acgacactac aaagctgcta aacatgtgct tagatattta aaaggcacga 3840 aggattttgc gttaagttat aataaccatg agggtataca agtattttgt gatgcagatt 3900 gggcatctac ttcaagagat agaaagtctt taactgggta tttggtcaaa cttggaggtt 3960 cggctatcag ctggtcttca aaaaagcagc actccgtctc tttgagtaca acagagagtg 4020 aattctatgc tatgtgctca gttgctaaag aaattgtatg gattctactg gttttggatc 4080 cgatggacct agctattgag ttaccaatca ctgtatacag tgataatcag tccgcaatta 4140 gtttagcgag tcatcctacg ttacatgcac ggactaaaca tattagtatc cgttgccatt 4200 atatcagaga tttaattgcg agtgggataa tcgtgttccg tcatatatct actgttgaaa 4260 tgcaggcgga tttattgacc aaggggttgg aacatgttaa gtttacaaga ttactaaaca 4320 agtgtggttt aaagaagagt agcttgcatt aagggggaa 4359 // ID Gypsy-4-LTR_AF repbase; DNA; FNG; 185 BP. XX AC . XX DT 28-FEB-2006 (Rel. 11.02, Created) DT 07-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE Long terminal repeat of the Gypsy-4_AF LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; Gypsy-4_AF; Gypsy-4-I_AF; KW Gypsy-4-LTR_AF. XX OS Aspergillus fumigatus OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Neosartorya. XX RN [1] RP 1-185 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-185 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-4_AF, a family of gypsy LTR retrotransposons in the RT Aspergillus fumigatus genome."; RL Repbase Reports 6(2), 66-66 (2006). XX DR [2] (Consensus) XX CC This is a long terminal repeat of the Gypsy-3_AF LTR CC retrotransposon. Gypsy superfamily. It is characterized by 5-bp CC target-site duplications. XX SQ Sequence 185 BP; 44 A; 47 C; 37 G; 57 T; 0 other; tgtcgcatag cgtacgctaa tccatgtatt gcttatccat gtatctaacg gcctatggtt 60 tatccgtccc gttgcgcagt tggttccttt gctgaaccgt actgttatcc aagacgggta 120 gacagatagc cccatcttat gagaatacaa tcaacttacc attcaaatcg tgttggtctg 180 ccaca 185 // ID Copia-63_MLP-LTR repbase; DNA; FNG; 350 BP. XX AC AECX01000530; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-63_MLP_; KW Copia-63_MLP-I; Copia-63_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-350 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000530; Positions 196375 196026. XX SQ Sequence 350 BP; 89 A; 83 C; 46 G; 132 T; 0 other; tgttgagata attcctacag tcctacgtga ctaaacacaa ttacatcata atgacatatt 60 ccaaattgtc atctcagact gtcacacaac tgtattacgg aattcctctc tttagtcatt 120 agtgtgcttt acaacatgtc acatgtggga taaatctaca agttgttttt ctatataacc 180 tccctgaagt agcgtaagga aaggtggtct ttttcctcac ctttccttta tctctttcca 240 cgctgcttca ggtaggtcta gttctctctt tttctctttc atatatgaaa tatatttctc 300 acctttcctt tatctccttc cacgctgctt cagatcataa attgatctca 350 // ID Mariner2_AO repbase; DNA; FNG; 1888 BP. XX AC . XX DT 24-JAN-2006 (Rel. 11.01, Created) DT 02-FEB-2006 (Rel. 11.01, Last updated, Version 1) XX DE A family of Mariner/Pogo DNA transposons - a consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner2_AO. XX OS Aspergillus oryzae OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC mitosporic Trichocomaceae; Aspergillus. XX RN [1] RP 1-1888 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-1888 RA Kapitonov V.V. and Jurka J.; RT "Mariner2_AO, a family of Mariner/Pogo DNA transposons in the RT Aspergillus oryzae genome."; RL Repbase Reports 6(1), 31-31 (2006). XX DR [2] (Consensus) XX CC This is a family of Mariner/Pogo DNA transposons. It encodes a CC 558-aa Mariner2_AOp TPase. XX FH Key Location/Qualifiers FT CDS 95..1768 FT /product="Mariner2_AOp" FT /translation="MPPKAARKLKDSIEQEGRILLAISAIKKQEIRTIAEA FT ARIYNIPRTTLRRRLNGHTFRAETRANGHKLTQNEENSLVHWILSLDQRGA FT PPRPAHVREMANILLLKRGFSDTPTTVGENWVYTFIKRREELKTQFSRRYN FT YQRAKYEDPKLLHKWFERVQITIMQYSIQPDDIYNFDETGFAMGLISTTKV FT VTRAELAGRPFLLQPGNREWVTSIKYISSRGPLPPCLIFKGKVHIEGWYEM FT GLPSDWRIETSANGWTTDEIGLRWLQQLFIPFTAGRTVGRYRLLILDGHGS FT HLTPQFDDICSQNNIIPLCIPAHSSHLLQPLDIGCFGPLKRAYGQLVENKM FT RLDFNHINKLDFLEAFPQARAQIYTTSNICSGFSATGLIPFNPKRVLSQLN FT IQLEATPPGSRPSSRSTNSVPKTPHNLKQLQKQETTLKKLLRARTKSPDSP FT TKIVIKQLFKGYERALNKATITKQEARELRAAHERILKKKKRSTRQLPIES FT GASVQEAQELIQGRNSTIEPITTASVDIGAPVESQRIRAPPRCSGCNILEH FT KITQYPNRQTI" XX SQ Sequence 1888 BP; 562 A; 431 C; 393 G; 502 T; 0 other; acgtaatcca cgggcgagcg gccatcccgg gcgagcggcc acgtcaacca atttcaacca 60 cgcttgaaga atctatatat ttcaaccacc aactatgcca ccaaaagcag ccagaaagct 120 aaaagattct attgaacaag aaggtagaat tctactagca atttcagcta ttaaaaaaca 180 agaaattcgc actatagctg aagcagcacg tatctataat attccacgta ctacccttcg 240 gagacgcctg aatggccata cttttcgagc cgaaacgcgc gccaatggcc ataaattaac 300 tcaaaatgag gagaattcgc tggtacactg gattctatcg ctagatcagc gtggagcacc 360 tcctcgacct gctcatgtac gagaaatggc caatatctta cttttaaagc gtggttttag 420 tgatactcct actactgttg gtgaaaactg ggtatatact tttattaaac gccgtgagga 480 gttaaaaact caattttctc gccgctataa ctatcagcgc gctaaatacg aggatcctaa 540 gcttttacat aagtggtttg agcgcgtaca aatcactata atgcagtata gcattcaacc 600 ggacgatatc tacaactttg atgaaactgg ctttgcaatg ggcttgatat ctactactaa 660 agtagttact cgagctgagt tagctggtag accttttctt ctacagccag ggaaccggga 720 atgggtcact tctattaagt atattagctc tagggggcct cttccacctt gccttatctt 780 caaaggcaaa gtccatattg agggctggta cgagatgggt ctaccatcag actggcgtat 840 agaaacgagt gcaaatggct ggactaccga tgagataggt ctacgttggc tgcagcagct 900 atttatacct tttactgctg gccgtacggt tggtcgatat cggcttttaa tactcgatgg 960 tcacggtagt catttaacac ctcagttcga cgatatatgc agtcaaaaca atattatacc 1020 actctgtata cctgcgcact catctcacct acttcagccg cttgatatag gctgctttgg 1080 ccccctaaaa cgcgcgtacg gacagctggt tgagaataag atgagattag acttcaacca 1140 catcaataaa cttgactttc ttgaagcctt tcctcaggcg agagctcaga tatatactac 1200 tagtaatatc tgcagcggtt tctcagctac tggtcttatt cctttcaatc ctaagcgtgt 1260 attatctcag ctaaatattc agctagaagc aacacctcca ggtagtagac ctagtagtag 1320 gtctactaat tcagtgccta aaacgcctca taatctaaag caactacaaa aacaggaaac 1380 tacactgaag aagctgctta gagctcgtac aaagagccct gattcgccta ctaagattgt 1440 gataaagcag cttttcaaag gctatgaacg agctttaaat aaagctacta ttacaaagca 1500 ggaagccaga gaactacgcg ccgcgcatga aagaatactt aaaaagaaaa agcgctctac 1560 tagacagctg cctatagaat caggcgcttc agttcaggaa gctcaggagc ttatacaagg 1620 cagaaattct actatagagc ctataactac tgcctcagta gatataggcg ctccggtaga 1680 aagccagcgt atacgtgctc caccaaggtg ttctggctgt aatatactag aacataaaat 1740 tactcagtat cctaatcgtc agactattta aattttttat agaaatgcac atttttcggt 1800 gttttgaata gcttcatttc taggaggtgc gtagagttct ggtgtggtgg ccgctcgccc 1860 gggatggccg ctcgcccgtg gattacgt 1888 // ID Copia-23_MLP-LTR repbase; DNA; FNG; 360 BP. XX AC AECX01001016; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-23_MLP_; KW Copia-23_MLP-I; Copia-23_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-360 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001016; Positions 3737 3378. XX SQ Sequence 360 BP; 96 A; 63 C; 73 G; 128 T; 0 other; tggatgaagg tgagtaattc agtttcgcgg gataggtact agattcaagg ttcactttct 60 gaatcataca atccctgctc tgtcaggtaa gtctctttaa tgttttgttt gttttcttat 120 taacttttag atagtttaat catgtgatgg aataataaaa ctgatccaag ttaatctcat 180 tacttcttgg atgaaggtga gtaattcagt ttcgcgggat aggtactaga ttcaaggttc 240 actttctgaa tcatacaatc cctgctccgt caggtgagta attcagtttc gcgggatagg 300 tactagattc aaggttcact ttctgaatca tacaatccct gctccgtcag atttgtaaca 360 // ID PCretro6_I repbase; DNA; FNG; 4239 BP. XX AC DQ097838; XX DT 08-MAR-2006 (Rel. 11.02, Created) DT 08-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE Phanerochaete chrysosporium RP-78 Ty1/copia LTR retrotransposon DE (internal). XX KW Copia; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; PCretro6_LTR; internal portion; PCretro6_I. XX OS Phanerochaete chrysosporium OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Corticiales; Corticiaceae; Phanerochaete. XX RN [1] RP 1-4239 RA Novikova O., Fursov M., Shutov O. and Blinov A.; RT "Divergent groups of LTR retrotransposons from Phanerochaete RT chrysosporium."; RL Direct subission to Genbank (2005). XX DR EMBL/GenBank/DDBJ; DQ097838; Positions 409 4647. XX FH Key Location/Qualifiers FT CDS 74..4237 FT /product="PCretro6_I_1p" FT /translation="MTASANSGSDSSSISASVRIEPLQGSENIAPWKVKIM FT DILTSLGLEDTITSDPPQAPEDADKATLIQHEKAKAEFKRRSLQALSHIRL FT RVSDAVLVYISAATTAKEAWDTLMQTFQPSGMYSYILAYRKFYQTRCPDGG FT DIPEHLGQMRRLRDEIHTIAGTARISDREFAFALLISLPDSWDQFIQAIDA FT TADNLTSAQVIARILQEDKRRKEREGAKGDSALLAKSKKKARPRCHNCGKQ FT GHLKKDCWASGGGAADKPKANDAVDGEFGFIAAEGQNSADPEPDIEKALSA FT LDESRLWYADSGATTHVANARELFHKYQPTAGGASISGLGGVSRIAGRGSV FT KLKCKVGNRIRKIWLHNVVHVPGAPHNLLSLTRADDAHCRYVGGKGTLTMY FT GPDGKILMVGRKLEGPGQLYRLSVAAELADQANVAVLGKRTWEEMHRILGH FT AHLASIKRLCLGLAEGVDVDMTSDDQFQCESCVRAKQHVQPFPDQATRTPD FT KLKVGEIVASDTWGPAQVRSVHSFYYYMSMTDLKSRFSGVYFSAQKSGLAK FT MALTEFRGLIRRMSRTGSSIMCLRIDNGTEFINNEFLQYCKSQGIVVETTA FT PYSPAQNGVAERLNRTLRESARAMLLAAELPRKFWPEAVSYACLIKNRLPH FT AALKGMTPYQALTGEKPDLSLLREFRSKCWVLDQSGNPGKLDAKSRPCIFL FT GFATDSKAFRCWDIGKNTFVKSRNVVFAARKREQEVLLPSPRSAPEGESAD FT TSSRAPEGQGAAAQAPADPSDPSAQKSSAAEGSGGVPDGKPAQPKAPTGVR FT PPTREASARLRSKPPVDYLRMHNPQARGPRQSKSTEEDAPADAAAEPDASS FT DSDDVVEYAYLGADLADDDPRSYRDALKRPDAARWQEAMQREYDQLTKLGC FT WDLVDLPAGRKAIGCKWVYRIKRNFSGAIIKYKARLVAQGFSQVPGVDYDE FT TYAPVMRPESLHILAAIAVILNLEWDIENAVGAYLNSQLKLTIYMRQPEGF FT DDGSGRVCKLNLALYGLKQSGREWNLLLDEFLRGIGFRASSVDPCVYLRID FT EGSPTFLAVHVDDFSLFAKTREIMDKLKGELSSRFEMTDLGPVRQILGYEV FT IRERDQRTLMLRQAAYIRKVLDRFNMADSNPVSVPMDPNTRLQKTPDAPPP FT DFPYREAVGSLMYAAVGTRPDISYAVQTLSQFCERPSTAHWTALKRVLRYL FT KGTAEWGIIYKAPEAQTTPIEVVGYSDADWGANPDDQKSISGYVFLLGGAP FT VCWASRKQKSVALSSMEAEYMAGSTAASQALWCRMLLEELGFAQPNPTLLY FT MDNQSALALARNTGTQGRAKHIDIRYHFLRDKISSKEISVAHCPGEDNPAD FT IFTKPLARQKFEHFRAMLGMSASRG" XX SQ Sequence 4239 BP; 957 A; 1232 C; 1265 G; 785 T; 0 other; ggttatgagc cccgactgtg ggctcgcctt ctggctctgc actgacctcg ccacccaggc 60 ctagactacc cccatgaccg ccagcgcgaa cagcggctcc gactcctcgt ccatctcagc 120 gtcggtccgg atcgaacccc tccagggaag cgagaatatc gcgccctgga aggtgaagat 180 catggatatc ctcacgagct tggggctcga ggataccatc acttcggatc cgccccaggc 240 acccgaagat gcggacaagg cgacgctgat tcagcacgag aaggctaagg ctgaattcaa 300 gcgccgtagc cttcaggcgc tgagccacat ccgccttcgc gtgtcggacg ccgtactggt 360 gtacatcagc gcggctacca ccgccaagga ggcctgggac accctgatgc agacgttcca 420 gcccagcggc atgtactcgt acatcttagc gtaccgtaaa ttctaccaga cgcgttgtcc 480 cgacgggggg gacatccccg agcacctagg gcagatgcgc agactgcggg atgagatcca 540 cacgatcgca ggcacggctc gcatcagcga tagggaattt gcgttcgcgt tgctgatctc 600 gctgcccgat tcgtgggatc aattcattca ggctattgat gccaccgcag ataatctcac 660 gagcgcccag gtgatcgcgc ggatcctcca agaggacaag cgccggaaag aacgcgaggg 720 cgcgaaaggt gacagtgcat tgctagctaa gtccaaaaag aaggctcggc ctcgctgcca 780 caattgtggc aagcaggggc acctcaaaaa ggactgttgg gcatcagggg gaggcgcggc 840 agacaagccg aaggccaatg acgctgtgga cggcgaattt ggcttcatcg ccgccgaggg 900 ccagaactcg gcagatcccg agcccgatat cgaaaaagcg ctaagcgcgt tggatgagtc 960 gcggctgtgg tatgccgaca gcggcgcgac cacacacgtc gcaaacgccc gagagctgtt 1020 tcacaagtat cagccgacgg caggaggcgc gtcgatcagt gggctcgggg gcgtatcccg 1080 catcgcaggg cgtgggagcg tgaagctcaa gtgcaaggtc ggaaatcgca ttcgcaagat 1140 ctggctgcac aatgtagtgc acgtccccgg cgcgcctcat aatttattgt cactcacgcg 1200 cgccgacgac gcacattgtc ggtacgtagg cgggaagggc actctgacca tgtacggccc 1260 ggacggcaaa attctgatgg tcggacgcaa gcttgaggga ccgggacagc tgtaccggtt 1320 gtccgtagcg gcggaactgg cggatcaggc gaacgtagcc gtcctgggca agcgtacctg 1380 ggaggagatg caccggattc ttggacacgc ccacctggct tcaatcaaga gactctgcct 1440 cggcttagcc gaaggagttg atgtcgacat gacgtcagac gatcaattcc agtgcgagtc 1500 ctgcgtgcgc gcaaagcaac acgtccagcc gttcccagac caagctactc gtaccccgga 1560 caagctcaaa gtaggggaaa tcgtggctag cgacacttgg gggcctgccc aggtgagatc 1620 cgtccacagt ttctactact acatgtcaat gactgacttg aagtctcggt tctcgggcgt 1680 ctatttcagc gcgcagaaat cagggctcgc gaagatggcg ctgaccgagt tcagggggct 1740 gatccggagg atgagcagga ctgggagctc cattatgtgc ttgaggattg acaatggtac 1800 tgaatttatt aataacgaat ttctacagta ctgtaaatcc caaggcatag tggtagaaac 1860 aacggcacct tattcacctg cccaaaatgg tgttgctgag cgcctcaatc gtaccctccg 1920 tgagagcgcg cgcgcgatgt tactcgcggc agagctgcca cggaaattct ggccggaagc 1980 cgtaagttac gcgtgtctga ttaaaaaccg cctcccgcac gccgccctca aggggatgac 2040 gccatatcag gcgcttacag gcgaaaagcc ggacctgtcg ctgctacgcg aattcaggtc 2100 gaagtgctgg gttctggacc agagcggtaa ccctgggaaa ctcgatgcca aatcgagacc 2160 gtgcattttc ttagggttcg cgaccgattc aaaggcgttt cggtgttggg acataggtaa 2220 aaatacattt gtaaagtccc ggaacgtcgt ttttgctgca cggaaacgtg agcaagaggt 2280 cctgctacct tcccccagaa gcgcgcctga gggggagagt gcagacacga gcagtcgcgc 2340 tcctgaaggg caaggagcag cggctcaggc cccggccgat ccgagcgatc cgagcgccca 2400 gaaatcgtcg gcggctgagg gttcaggggg agtgcctgat ggcaagccgg ctcagccaaa 2460 ggcacccacc ggtgtgcggc ctcccacacg cgaggcaagc gcacgtttgc gttcgaaacc 2520 tcccgtagat tacttgcgaa tgcataaccc gcaggcccga gggccgcggc agtccaagag 2580 caccgaggag gatgcaccgg cagacgccgc agctgagcct gatgcgagct cagatagcga 2640 tgacgtcgtc gagtacgcat acctcggggc tgatctggcc gatgacgatc cccgatcgta 2700 tcgtgatgcg ctcaagcgcc cagacgccgc gcgatggcaa gaagccatgc agcgcgaata 2760 cgaccagtta accaagctgg ggtgctggga cttggtggac ctaccagccg ggcgtaaggc 2820 cataggctgc aagtgggtct atcgcattaa acgcaatttc tcgggcgcga tcatcaaata 2880 caaggccagg cttgtggcgc aagggttctc gcaggtaccg ggcgtcgact acgatgaaac 2940 ctacgcaccg gtaatgcggc ccgagtcgct acatatcctc gcggctattg ccgtcatcct 3000 caatctcgag tgggacattg agaacgccgt tggggcatac ctcaatagtc aattaaagct 3060 cactatttac atgaggcagc ccgaaggctt cgacgacggc tctggacgcg tctgcaagct 3120 caatcttgcg ctgtacggcc tgaagcagtc cggacgcgag tggaacctcc ttttggatga 3180 gttcctgcgc ggaatcggtt ttcgggcgtc tagcgtagat ccttgcgtat acctccggat 3240 cgacgaggga tccccgacct ttctagctgt ccatgtcgat gatttctccc tgtttgccaa 3300 aacgagagaa atcatggaca aactcaaggg ggaactcagt tcgaggttcg agatgaccga 3360 cctcggaccg gtccggcaaa ttctcggcta tgaggtcata cgcgaacgcg accaaaggac 3420 actgatgctc aggcaagccg cctacatcag gaaagtccta gatcggttca atatggccga 3480 tagcaaccct gtgtccgtgc ctatggaccc caacacgcgg ctgcaaaaga ctcccgacgc 3540 gccgccgccc gactttccat atcgggaagc cgtcggctcg ctcatgtacg ccgctgtggg 3600 cactcggccg gatatctcat acgcggtgca aacactcagc cagttctgtg agcgaccctc 3660 caccgcacac tggactgcgc taaaacgtgt attgaggtat ctaaagggca cagcggaatg 3720 ggggattatc tacaaagccc ccgaagccca gactacaccc atcgaagtag tcgggtactc 3780 agacgccgac tggggagcca atccggatga ccagaagtcc atatccggat atgtcttcct 3840 gctcggcggc gcccccgtct gctgggcttc gcgcaagcag aaatcggtgg cgctgtcgag 3900 catggaggcc gagtacatgg ccggctccac tgctgcctct caggcactct ggtgccggat 3960 gctgctcgag gagctcggat tcgcgcagcc gaacccgacg ctactctata tggacaacca 4020 gtcggcgctg gcgcttgctc ggaacaccgg tacccagggg cgcgcgaagc acatcgacat 4080 ccgctatcat tttctccgag ataagataag ctcgaaggaa attagcgtcg cgcactgccc 4140 tggggaggat aaccccgctg acatttttac caagccacta gcgcggcaga agttcgaaca 4200 cttccgcgcg atgcttggca tgtcggcgtc gagggggag 4239 // ID Gypsy-52_MLP-I repbase; DNA; FNG; 5715 BP. XX AC AECX01001645; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-52_MLP_; KW Gypsy-52_MLP-LTR; Gypsy-52_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5715 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001645; Positions 16160 10446. XX CC Positions [4363-4842] - Integrase core CC 'GCCAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(1639..2862,2866..5259) FT /product="Gypsy-52_MLP-I_1p" FT /translation="MEERADQAWRYHSYMRHLGLCGVCRTACGKSDCRGPA FT KGPYVTVPGLDVFNPGPRPSRRQDASSTNPSRQSNANRLPLAPGAPTSRPA FT GRPQNQAPLVASINEAPDLFTQDLARYEEADRVLVAHLANEKEDARCVKNP FT GSHTKSIILKLKINGVDVRALIDCAAETNLMADRLSKQAKIPRRRLVAPVE FT VRLAIDNKEAQPFYFTEFCFANVSSSEPPLRFGSTFFKLAPLGENFDVILG FT TPFLEKFHLDVSLHDRTVKHVPSKTILRESQNIVNEINEIKNEMISCVVEN FT LKQVQDSRDLTAREERVLKEFSNLFPDELPPVESEEEDDFFPPESQNDSSK FT VRHKIILTDPNVVINEKQYAYPQKYWGPWRKLVDQHLAAGQIRRSTSQYAS FT PSMIIPKKDPNAPPWVCDYRTLNKYTVKDRSPLPNVDEAVRLVGKGKVFSV FT IDQINSFFQTKMREEDIPLTAVKTPFGLYEWVVMPMGLTNGPATHQARVEE FT ALGELIGGICVVYIDDVVVFSDSVEEHEAHVREILRRLQAAKLYCSPSKSK FT LFRRKINFLGHEISADGICPDEQKVDKVTKWSTPKSQKQLLRFLGTVQFMK FT NFVDGLAHYVGTLSPLTSTKRKNEPFKWGKSEDQAFENIKRIMTTLPVLKT FT LDYESEDPIWLFTDASGHGLGAALFQGLEWDKASPIAYESRTMTPAERNYP FT VHEQELLAVINALQKWRLLLLGLKVNVMSDHHSLTHLMTQRNLSRRQARWL FT ETLSQFDLDFKYLKGEDNTVADALSRIEDVAALEVKSTFDEQTLQDIKTGY FT DMDHFCVRLCSTLPLREGTEVKDGLIFLDNRLVVPDTDGLRHQLITQVHES FT LGHLGTLKTLLQLRSEFFWPNLTTDVKKFVASCDTCQRTKARTTLTSGRMQ FT TSEVPHIPMEHIALDFIGPFPKVKNYDMLLSITCRLSGFVRTVPACQTDTA FT ERTAQRVFSAWLSIFGAPRTMIGDRDKLWTSSFWQELHRLLRIDVHLTTAY FT HPQSDGCSEKSNKTIVQILRQLVSSRHGKWLESLPSVEFAINSAVNVATGV FT SPFEFVYGRKPRLFPTNALPVETSPSIAKWIEKRQSLWAETRDRLWTNWIS FT QAVHYNSRRREGDTLAIDDWVLIDSKDQQQVVGGAGRPTSKLRPRYDGPYQ FT VLECLNDGRNFKLRLSSEDNSHPVFHISKLKKYRGEGHEDEKEGTGN" XX SQ Sequence 5715 BP; 1640 A; 1392 C; 1269 G; 1414 T; 0 other; cttttttatt ctatttctca accttattct tcttttcact atcgtgtgat ctctcttgac 60 tcattatctt gacaccttga cattccatca ccaaatcaat cgtcgatctt atgccattca 120 acttacctgc caatttctcg accaatccca aagcttttct tagaagatat agacaacccc 180 aatccgtcga ttcacctcct gcaccagcgc ctccacgtta caactggcaa gtagaaggag 240 aagacgtgga tacgcctaac aagacaatct gtacaattta cccgtcgtcg tctcacactc 300 tagctgcgtc aactgaacct gatccaacga tcacacccga taggcacata gttcgcgaat 360 tttttgatcg agcaaatcta tcgacgcctc tggtaccggg aagcttcatt cacacgccga 420 tgcaacaacc cccagccgga tcgtcgcaag gtattggtgg gatcgcggcg ctggaaccca 480 tgtctcgaga cgaactcgtc gagactctac aagcaaaagt tgaaagctta gaacaagaga 540 agtcggaatg gactgcaaag ttggaactag ccgccgctga ccggacccgt atcgatgcgc 600 tcgaattgac aatctcccaa ttactgacgg aacggtctcc tggtacagct cgtagtacag 660 gatctaacaa cccctttgcc tcgtttcgaa atcaattatc gactccgacc ccagctgccc 720 cgcgtgatca tagagccatc aacacggaga tcgcacaagc gttagttcaa tcatctcccg 780 tgatgactca gcaagattca ccgattcgaa acgcattcat acggccaacg gaaagtgtac 840 aacgagaacc gcgtcgtgtt tcattcacgc acctcatgga tccggagctg taacaaccaa 900 cggtaccgtc gagagcacct tacaacgtgc agtccgcgct cacatccaag tacctccctt 960 tcgtacacaa ccggccgaag tccaacagaa agtcaacaag gtttatgcaa gcgatcttaa 1020 gttatctgaa atccctaagt tcaatggacc atcggaaagt cccgcagcgc tgtttaattg 1080 gcgtaggtta atcgaacaat acttcgaatt gaaaaacgtt tccgaccatc gacaacggtt 1140 catcatcttg ggttcggtag tggttgaacc acgagcagga gcatggtata cgaactgagc 1200 aggagcatgg tatacgaact cacgtaccga tctcatgcat gaactggcaa cagaaacctt 1260 accgactgga tggttagaag aaacagagaa aagcattcgg cagttacgaa tggggattaa 1320 tgaagatttc aggtttatgt tggtagagct cgtgatctat acaatctagt acaagtcaag 1380 acatctctta cacccaagaa tctggcggaa tacatcgtgt ggggaacgcc cgacatcttc 1440 caacgatggg tcagagatcg ccaattgctg caagtaccgg acttcaaact cttaccattc 1500 atcgcgcagg caaacaacat atgggattta gttctggcca gcaatctcct ccctcgagtc 1560 caaaaccgtc actcagtcag tatccacacg cctacacctc gtggaacgaa cgtcaccaac 1620 gcacctgttg gccgatcaat ggaggaacgc gccgatcagg cgtggcgata ccattcatac 1680 atgagacatc tgggactgtg tggtgtatgt cgaacggcgt gtgggaaaag cgactgtcga 1740 ggcccggcaa aaggaccata cgtgacagtt cctggtctgg acgtcttcaa ccctggacct 1800 cgtccatcgc gtcgacaaga tgcatcatca acgaacccat cacgccagtc gaacgcgaat 1860 cgtctcccat tggcgccggg cgctccaacg tctagacctg cgggccgacc acagaatcaa 1920 gcacctcttg tcgccagcat caacgaagcg cctgatttat ttactcaaga ccttgcacgt 1980 tatgaggaag ctgaccgagt attggtggct cacttggcga atgagaagga agacgcaagg 2040 tgtgtcaaga accctggttc tcataccaaa tccatcatcc taaagctgaa aattaatggt 2100 gttgatgttc gggcgctcat tgattgtgca gcggaaacga atctgatggc ggatcgacta 2160 tcaaagcaag ccaaaattcc acgccgtcga ctagtcgcgc cagttgaagt gcgattggca 2220 atcgacaaca aagaagctca accgttttac ttcacggaat tttgctttgc caacgtttcg 2280 tcatcagaac caccattacg tttcggatcc accttcttca aactcgcacc cttaggagaa 2340 aattttgatg taattctggg gacccccttc ttagagaaat tccaccttga tgtatctttg 2400 catgaccgga ctgtgaagca tgtaccttca aagactatcc ttagggaatc tcagaatata 2460 gtcaatgaga tcaatgaaat aaagaatgaa atgatatcat gtgttgttga aaatctgaaa 2520 caagttcaag actcgagaga cttgactgct cgagaagaaa gagtcctgaa ggagttttct 2580 aatttgtttc cagacgaact acccccagta gaatctgaag aggaggatga ctttttccct 2640 cctgaaagtc aaaatgattc ctcaaaagtg cgtcataaga tcatattgac agaccctaat 2700 gtggtcatca atgagaagca gtacgcttat ccacagaagt actggggtcc atggcgaaag 2760 ttagtagacc agcatttggc tgcaggtcaa atcagacggt caacaagtca gtacgcgtcc 2820 ccttcgatga tcattccgaa gaaggatccc aacgcgccgc catgatgggt gtgtgattat 2880 cgcacactga acaagtacac agtgaaggac cgttcaccat tgcccaatgt tgacgaggca 2940 gtgcgattgg tggggaaagg aaaagttttt tctgtcattg accaaatcaa ttctttcttc 3000 caaacaaaaa tgcgcgaaga agatattccg ttgacagcag tcaaaacgcc atttgggctc 3060 tacgaatggg tggtcatgcc aatgggctta accaatggac cagcgacgca tcaggcacgt 3120 gtggaagaag ctttaggtga attgattggt ggtatctgtg tggtatacat cgatgatgtt 3180 gttgtttttt cagattctgt agaagaacat gaggcccatg tcagagagat tctgagacgc 3240 ttgcaagcag caaagctgta ctgctctcca tcaaaaagca aactatttcg acgcaaaatc 3300 aacttcttag gtcatgagat aagcgcagat ggaatttgtc cggacgagca gaaggtagat 3360 aaagttacca aatggtcgac accaaaatct caaaagcaac tccttcgatt tttaggcaca 3420 gtccagttta tgaaaaattt tgtagatgga ttagctcatt atgttggaac actatcacca 3480 ttaaccagta caaagcgtaa aaatgaacct ttcaaatggg gaaaatcaga agaccaggca 3540 tttgagaaca tcaaaagaat aatgacaaca cttccagtat taaaaacctt agattatgaa 3600 tcagaagacc ctatttggct ttttacggat gctagtggac acggcttagg tgcagcgttg 3660 tttcaaggct tggagtggga taaagcgtca ccaattgctt atgagagcag aacaatgaca 3720 ccggcagaaa gaaattaccc tgtccatgaa caagaacttc tggcagttat caacgcactt 3780 caaaaatggc gtctcttact cttaggcttg aaggttaacg ttatgtcgga ccaccattca 3840 ctcactcatc tcatgacgca acgtaactta agtagaagac aggcccgttg gttggaaacc 3900 ctttcgcaat ttgacctcga tttcaaatac ttgaaaggtg aggataacac tgtggccgac 3960 gctctttcaa gaattgaaga cgtagcagca ctcgaggtca agtcaacatt cgatgaacaa 4020 actttacagg acatcaagac tggctatgac atggaccatt tctgcgtacg gctttgttcg 4080 acactgccgc tacgggaagg gacagaggtg aaagacggcc ttatctttct ggacaatcgt 4140 ttagtggtcc cagacactga cggattgaga caccaattga tcacgcaagt tcatgaatca 4200 cttggacacc taggaacact gaagacctta ctgcaacttc ggagcgaatt cttttggccc 4260 aatctcacga ctgatgtcaa gaaattcgta gcttcatgtg acacttgcca acgaacaaag 4320 gcgaggacga ctctgacctc cggacgaatg caaacgagcg aggttccaca tatcccgatg 4380 gaacacatcg cgctagactt catcggccca tttcctaagg tcaagaatta tgacatgctc 4440 ctgtccatta cttgtcgact ctcagggttt gttcgtactg ttccagcctg ccaaacagac 4500 acggccgaaa ggactgctca acgagttttc agcgcttggc tttcaatctt cggggcacct 4560 cgtacgatga taggtgaccg agacaaactc tggacgtctt cgttttggca agagctccat 4620 cgacttcttc gcatcgatgt gcacctgacg acagcatatc accctcaaag tgacggttgc 4680 agcgagaaat caaacaaaac cattgttcag attctgcgcc aacttgtgtc ttcacgtcat 4740 ggcaaatggc ttgaatcgtt gccatcggtc gaattcgcaa tcaactccgc tgttaatgtc 4800 gcaaccggtg tctcgccatt cgagtttgtt tatggccgca agccccggct gttcccaacc 4860 aacgcacttc cagtggagac gagtcctagt atcgcgaaat ggatcgagaa acggcaatcg 4920 ctgtgggcag agacacgtga ccgtctttgg actaactgga tttcgcaagc agtacattac 4980 aacagtcgac gacgcgaagg tgacacattg gccattgatg attgggtttt aattgatagc 5040 aaggaccaac agcaggttgt tggaggagca ggcagaccga cttcaaaact tcgaccaaga 5100 tacgatggac cataccaggt cttggaatgc ctcaacgacg gtcggaactt caagttaagg 5160 ctcagcagcg aagataactc acaccccgtg ttccacatct ccaaactcaa gaaataccga 5220 ggggaaggac atgaggatga gaaggaaggg acaggcaatt gacctgggtg tcaaagtaag 5280 ttcccaccat tattagtatg caccgccgaa ggctacatcc cccaaaaaat cttttcaaaa 5340 acaactcctt ggccacgccc gtgagcaccg cacaactttg tattgtttca cttttcagta 5400 acgaaactat cacgaacctg gagcggagga tcaagctcga cacggcacat tttgcgacgt 5460 caaggattca aacaaaaaca cctttttctt attcactttt tagcggcaat ttcaattttt 5520 ttttacattc agttcagttt caattttttt tagtttcttt caatttcaca aatcagatgg 5580 tttttaaggg cgagtcatgc caattttttt tcttttcttt tttctcttct cttatgaatt 5640 ttttcttttc aacaattatg atggatacat ggtataatta ggtggaacat agaatttttt 5700 ttaggaaggg aggga 5715 // ID Gypsy-102_MLP-I repbase; DNA; FNG; 5610 BP. XX AC AECX01000530; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-102_MLP_; KW Gypsy-102_MLP-LTR; Gypsy-102_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5610 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000530; Positions 187747 193356. XX CC Positions [2722-3141] - Reverse transcriptase CC Positions [4391-4891] - Integrase core CC 'ATTAG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1453..4455 FT /product="Gypsy-102_MLP-I_1p" FT /translation="MNDTRMITQIPIHDPYSDTTVFARALVDSGATHEVIS FT TSFVEKHHLYTLPLPHSHSVTGFSGHKSKVTHMGDYHVLNDSSITTFIVTN FT LRDKYDAILGMPWIRRNHPRIDWNSCQVKTNHEVAVTLLTSSSPKPALKDQ FT EMEPKRHARNCDEGVEFVDDSFMPPQCEYFSHISKSSKEAVSKPLHHLELC FT DIENTPSRYVESPAVIPSVTLSNPIKALENHELEPKRQVRNCDKGVERASV FT SCIPPQSESFSPSQSVKRNQMSQQTHLLQNKPPGTISRPKLRATRTTRYSS FT KYISLARPGASQVDTAKASWNVSAKPAAQQVKDQIEKTAFKLVPKAYHGYL FT DMFEKNRSNVLPPHRPYDFRVDLLPGARPQAGRVIPLSPKESLVLDEMLDK FT GLKNGTIRQTTSPWAAPVLFTGKKDGNLRPCFDYQKLNAVTVKNKYPLPLT FT MELIDSLLNADQFTSLDMRNGYNNLRVREGDKAKLAFICKRGQFEPLTMPF FT GPTGAPGFFQYFIQDILKNHIGRDVAAYQDDILIYTQPGVDHEAVVKEVLD FT ILKAQNVWLKPEKCKFSQEEVSYLGLIISRNQIKMEEAKVKAVTEWPTPRN FT LSEAQTFIGFANFYRRFIDQFSKVARPLHALSQKDTKFSWTPECQAAFDGL FT KKSFTTAPVLKIADPYKPFVLECDCSDYALGAVLSQISDDDNELHPVAFLS FT RSLIQAERNYEIFDKELLAVVASFKEWRQYLEGNPNRLNVVVYTDHKNLQS FT LMTTKELTRRQARWAETLGSFDFKIQFRPGKQSTKPDALSRRPDLMPNVNE FT KLTFGRLLKPQNLPVDAFVDKFELMEIEDFMTSEEVPIVVEVDQEEEDREE FT QNQLPVWNDKQILNEIKVKSQIDPKVKEILTLCSEMPTSILLKDYSSENGI FT LYFKQKALVPNDQNLKLQILRSRHDSLLAGHPGRARTLALIKQSYHWQSMK FT AYVNRYVDGCQSCQRVKTRTTKPFGSLQPLPIPQGPWTDNMLRPHH" FT CDS 4517..5512 FT /product="Gypsy-102_MLP-I_2p" FT /translation="MAHFIPCRKTLALEELADLMIQNVWRLHGTPVTITSD FT RGNVFISCITKDVNKRLGITTQSSTAYHPQTDGQSEITNKAVEQFIRHFTN FT YKQDNWRSLLPMAEFSYNNNQHVSIGMSPFKANYGHDVSFMNVPSSDQCLP FT LIEEQMKQIKEVQKELKDTMQLSQEAMKHQHDKGVRRTPNWTKGTKVWLSN FT KNISSTRPTVNFAHRWLGPFPITKRISTNAYKIHLPKSMAKVHPVFHVNLL FT RKFEGSSIRGQDQEPPAPIIIENEEEYEVLEVLDKRMMNGKIEYLVSWVGY FT ESNHDSWEPEAGLKNAQELVRKFNEKYPQAEKRYHRVRGK" XX SQ Sequence 5610 BP; 1889 A; 1192 C; 1232 G; 1297 T; 0 other; tattgcaacg tctatttcga gatccaagag ttcaagaatc gaaaacgagt aaagaagaag 60 aaagaaagtc gactcaagtc cgagaagttt taaaagaaaa aagaaattca agatctagat 120 ttacaagaag tttaaagttt aaattaaagt atccccacaa ccaagcaatc agtatcaaat 180 aaagtttaaa attggttgaa ccgatctaca tccgcaaccc acataactta gtcacaacgc 240 cggaattcac actacctgga agcccatcct ccgcagggct gaacttcgaa gatacagtcg 300 aagacaacat gtcaaacgta aaccttgaca aagtcatgcg acaaatcaac gaactgaaca 360 acaggctcaa caaagaagct cgactccgtc aggaggaagc tattcgccgc cagaacgctg 420 aacgcttagc agaacaaagg cttatgcaat tagaggagtt gagaaactcg actcaaacgc 480 accctcaagc tcctgtatcg tcgaaccctc atccgagcat ccctgtagta ttgcatgtaa 540 agcctccgaa agtagctact ccggataaat ttgacgggtc taaaggacca aagccgaagt 600 ttttttaaat caactgaatc tttatatgca gatgaatgcg agctcgtttg cagatgaaaa 660 gtcacgagtc atttttgcga tgtcgtatat gactggcaaa gcaagcattt ggagtcaatc 720 gttaacggat caaatcctaa acgaagacca ggctcattta gttacctgga aatctttcac 780 tacctctttt aaagcgactt tctttgatac ggaaagggta gccaaggctg aacgcgaact 840 tcgtgcatta tcacaaactc aatctgtgtc cgattactgg atcaaatttg ctgaactagc 900 gctggtagtt aagtggcctg agtcagtctt aaaatctcaa ttcgagcaag gccttaaaac 960 agaaatatct gtatacatga tccgagacga atttgaaaaa gttgaagaaa tggctcaaat 1020 ggcgatcaaa cttgacaaca agatcaacaa acgtgtcaag gatattggta gtttcagtgc 1080 tggaaactca aacccgactg ccccaacccc tgctcgagac cctgatgcca tggattgttc 1140 tgcataccga ctaaacatat cgtctgatga atataaacga agaggtgcta cgtggtcgtg 1200 ttacaaatgt ggaggaaagg atcatctcat tgctgagtgt ccgatgttga agaagaaggt 1260 ctttggaaat tctagattta atgaagttgc aaaactgaaa gcaagaatag ctgaacttga 1320 aggaaagaat gatgggaatg aggagggaag agctgagtcg tcaaaaaatg gcgaagctcg 1380 ggattgttag ttgtgccaac cccaagcgat gtcaaatggg gtttggaatt gggtgatatt 1440 agtagtcttg aaatgaatga tactcgcatg ataactcaaa ttcctatcca cgacccctac 1500 tctgacacaa cagtattcgc gcgtgccctg gttgacagcg gagcaacgca cgaagtgata 1560 agtacaagct ttgtagagaa acatcatctc tacactctcc ctttgccgca ttctcatagc 1620 gtcacaggat tcagcggaca caaatccaaa gtgacgcata tgggagacta ccatgttctt 1680 aatgactcaa gcattaccac attcattgta accaacctac gcgacaaata cgacgcaata 1740 ttgggtatgc cgtggataag aagaaaccac ccaaggatcg attggaattc atgtcaagtg 1800 aaaaccaacc acgaagttgc cgttactctg ttaacgtcgt caagtccgaa accagcctta 1860 aaggaccaag aaatggagcc caaaaggcat gctaggaact gtgacgaggg ggtagagttc 1920 gtagatgact cattcatgcc cccgcaatgt gagtatttct cgcatatatc taaaagttcc 1980 aaggaagcag tcagcaagcc tttacatcac ttagaactct gtgacatcga gaacacccca 2040 tcaagatacg ttgaatcacc tgcagttata ccaagtgtaa ctttgtccaa tccgatcaaa 2100 gccttggaga accacgagtt ggagccaaag aggcaagtta ggaactgtga caagggggta 2160 gagcgcgcaa gcgtctcatg tatacccccg cagagtgagt cttttagtcc ttctcaaagc 2220 gttaaacgaa atcagatgag ccagcagaca caccttctac agaacaaacc tcccggcaca 2280 atatcaaggc ccaagctacg agccactcga accacacgat atagcagcaa gtacatctca 2340 ctggcacgac caggagcaag ccaagttgat actgcaaaag cctcatggaa tgtgtcagct 2400 aaaccggctg cgcagcaagt gaaagaccaa attgagaaaa cagcattcaa attagttcca 2460 aaagcttatc atggttatct agacatgttt gaaaagaata ggtctaatgt gcttccaccc 2520 cacaggccgt atgatttcag agttgattta ctacctggag caaggcctca agcgggacgt 2580 gtaatcccgt tatcaccaaa ggaaagttta gttcttgacg agatgttgga caaaggactc 2640 aagaatggca caattcgaca aactacctcc ccgtgggcag ctccagttct gtttacggga 2700 aagaaggatg gtaatttaag accgtgcttt gattaccaaa aactcaacgc tgtaaccgtg 2760 aagaataagt acccactccc gctgacaatg gagctcattg atagcctatt gaatgcggat 2820 caattcacaa gtctagatat gcgcaatggc tacaataact tacgagtacg tgaaggcgac 2880 aaagcaaaac tggccttcat atgtaaacgt ggacagttcg aaccactaac catgccattc 2940 ggaccaacgg gggcgccagg atttttccag tatttcatcc aagacatctt aaagaatcat 3000 attggaagag acgtggctgc ataccaggac gatatcttaa tttatacgca accaggagtt 3060 gaccatgaag ccgtagtgaa ggaagtctta gacatactta aagctcaaaa tgtatggttg 3120 aagcctgaga agtgtaaatt cagtcaggag gaagtgtcat acttaggttt aattatttca 3180 agaaaccaaa tcaagatgga agaagcaaaa gtcaaggcgg taacggaatg gccaacacca 3240 cgaaatttgt ctgaagctca aactttcatt ggatttgcga acttttatag aagattcata 3300 gaccagttct caaaagttgc acgacctctc cacgcattat cgcaaaagga caccaagttc 3360 agctggacac ctgagtgcca agccgccttt gacggtctga agaaatcttt tacaacagct 3420 ccagttttga agattgctga cccatacaaa ccgtttgtac ttgaatgtga ctgttctgat 3480 tatgcactag gtgcggtgct ttcgcaaata tccgacgacg ataacgaact acaccctgtt 3540 gcctttcttt cccggtcgct gatccaagcg gaacgtaatt acgagatatt cgacaaagag 3600 ctactggcag tggtggcatc ctttaaagaa tggcgtcaat atctggaggg gaatcctaac 3660 cgcttgaatg ttgtagtgta cactgaccac aaaaatctgc agtcattgat gaccactaaa 3720 gaacttactc gaagacaggc taggtgggcg gagacattag gaagtttcga tttcaaaatt 3780 caattcagac caggtaaaca atcaacgaaa ccggatgcct tatcaagaag accggaccta 3840 atgcctaatg ttaacgaaaa actgacgttt ggacgcctac tgaaaccaca aaatttaccg 3900 gttgatgcgt ttgtggacaa gtttgagctg atggaaatag aagatttcat gacatcagaa 3960 gaggtaccca ttgtagtgga agttgatcaa gaagaagagg acagagaaga gcaaaatcaa 4020 ttaccagtat ggaatgataa gcaaattctg aatgaaatta aagtgaagtc ccaaatcgac 4080 ccaaaagtga aagaaatact aaccctgtgt agtgaaatgc caacgtctat actactgaaa 4140 gattacagca gcgaaaacgg aatcctttat ttcaaacaga aggcgctggt accaaatgac 4200 caaaatctca agctgcaaat cttaaggtca aggcacgata gtttacttgc aggacatcca 4260 ggacgggcac ggacactagc gttaatcaag caaagttatc attggcaatc tatgaaagca 4320 tatgtaaatc ggtatgtcga tggatgccaa tcttgtcaac gagtaaagac acgaacaacg 4380 aaaccattcg gaagcctaca acccctcccg ataccacagg gaccgtggac tgataatatg 4440 ttacgacctc atcactgact tgccggaatc tgaaggatat gatagtatcc taacggtagt 4500 ggacagacta actaaaatgg cgcactttat accatgtagg aaaactttgg cgttggaaga 4560 actggcagat ttaatgattc aaaatgtatg gcgacttcat ggcacaccag taactatcac 4620 atcagaccgg ggtaacgtat ttatatcttg cataaccaaa gatgttaaca aaagattagg 4680 tatcactacc cagtcatcaa cagcatatca tccccagacc gatggtcagt ctgaaatcac 4740 aaacaaagca gttgaacagt tcatccgaca ttttacgaat tataagcaag ataattggcg 4800 atctctctta ccaatggctg agttctcata caacaacaac cagcacgtct caataggcat 4860 gtcgcctttc aaagccaact atggtcacga tgtaagtttt atgaatgtcc cgtcaagcga 4920 tcaatgcctg ccgttgatag aggaacaaat gaagcaaatc aaggaggttc agaaggaatt 4980 aaaagacacg atgcaattaa gtcaagaggc aatgaaacat caacacgaca agggagtaag 5040 aagaacaccg aactggacga aaggaaccaa agtatggctt agtaataaga acatctcctc 5100 gacgagacca actgttaatt ttgcacaccg atggcttggc ccttttccaa taaccaaaag 5160 gatttccact aatgcgtata agatccactt acccaagtct atggcaaaag tccacccggt 5220 ttttcatgtt aacttgttaa gaaagtttga aggtagcagc attagaggtc aagaccaaga 5280 accaccggcg ccaatcatta tcgaaaacga ggaagaatat gaagttctgg aggtactaga 5340 caaaagaatg atgaatggca agattgaata tttagtcagt tgggtagggt atgaatcaaa 5400 tcacgactca tgggagccag aagcaggatt gaagaatgca caagaattgg tacgaaaatt 5460 caatgaaaaa tatccgcaag ctgaaaagag ataccacagg gtacggggta aatgagaggg 5520 tgaggctttt ttcccaatgg gtttttaatg ccaacccggg ggaagatgct aacctgtcaa 5580 gaggaggttg agtcataaag gggggagtgg 5610 // ID TSE3_I repbase; DNA; FNG; 4595 BP. XX AC AJ439555; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 14-MAY-2010 (Rel. 15.06, Last updated, Version 2) XX DE Saccharomyces exiguus retrotransposon TSE3_I, internal region. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW reverse transcriptase; gag; pol; protease; integrase; KW internal region; internal portion; RNaseH; TSE3_I. XX NM TSE3_I. XX OS Kazachstania exigua OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Kazachstania. XX RN [1] RP 1-4595 RA Neuveglise C., Feldmann H., Bon E., Gaillardin C. RA and Casaregola S.; RT "Genomic evolution of the long terminal repeat retrotransposons RT in hemiascomycetous yeasts."; RL Genome Res 12(6), 930-943 (2002). XX DR Genbank; AJ439555; Positions 948 5542. XX FH Key Location/Qualifiers FT CDS 101..922 FT /product="TSE3_I_1p" FT /translation="MSKESKFDSKALKEFKDSMPIILGKDNSTPPDNYDAW FT EAMVQRRLEQYGYDDEIATECMLMVMKGQCAIWFHSYLTTYRNIHGKSPST FT QEACAACREQYGNESFDEENIQNLLDLQINFSNREPGITKFLQYRHLLERM FT NSHELILHIYIHTLPPAIRNYIFSQKAKGLDEAMALTRRKMKELEMDKTVS FT YRQSSERIKSIPRKFLNQSLNRRPVSVVTVVSLGIQKTSADGRKLVYQTNQ FT NLPQRYLRINYIVQKISRAIIFKIFLYFYYTCL" FT CDS 1005..4415 FT /product="TSE3_I_2p" FT /translation="MTIQISKRNNKSTLALADTGASHNFIEQRFIQGMGLV FT KQRLPEQSVVKGALNTSGLIQHYVRLKFKHNKDLYEEIFLIIDSPLPQPVV FT FGIGFFKQHPEILSRLISPPTSSSKPITMVGRRQILADLKNPDNQVHLCWL FT RKKDQENSETQSHKILQRFDDVVVDEIPQGNSNKPLATPTVQHHIDLVEGA FT APVAKRAYRLSASDRDELDKQLSELIASGNVIPSESPYAAPVIFVQKKDGT FT KRLCVDYRGLNDITIKSKFPLPLIEDVLDQLSGATIFSKLDLISGYHQVAI FT ADEDQYKTAFTTHRGQYSWRVMPFGLTNAPATFQRLMNYVLRDYINKICVV FT YLDDILIYSKNEKEHSEHVSTIINVLRKHQLYAKKSKCEFYVPKIQFLGHE FT LSAKGITPDKEKILAIKDWPTPKTYKDAQSFIGLAGYYRRFIKDFSYIAKP FT LHQLAAQKIKWTDECKESLDKLKRQLSTAPIIIPFDRTKQIVLTTDASSTA FT IGAVLELYGKGTLKSELVGVVAYLSHLLRDNELNWPIRDKELYAVIFAFKK FT WRHYLAGTHIIIKTDHHSLQYFKTSVLDSNLRLARWRDILEEFDYEIQYIK FT GSTNHADALSRPPMLNSLVASKVELPEEDLRELETAYSTDPFFSKIIKHLK FT DPALEPPIEVRTPLQNHKLEGNLLYYTKEENMPKRLCIPQAHNVKLFYTYT FT MMPLLLAIQDNIVHFKALLENYYWPGMENDIKRYVSTCRACQQNKHPVLLS FT PGTFHPLPSGTHRWSHINIDFLGGLVLSQGHDCIMVVIDRATKMAHFIPVT FT KGADSETVIDLFIDSVLKLHGFPVEILSDQDKLFTSKLWQRSMQRFKIATK FT FTSTYNPSTDGQVERMNRTIMEMLRHYLTDNPSAWVMLLPVVEFAYNNTYQ FT VSIQTTPFFANYGYHPRLPGFYHLITSGGQSEKEARGTELGDLDDRIIQQN FT NIFLIIQERIAAAQQKQALQYNKKHRHAEFEVGDKVLVHQQAYWPGYHKGL FT KLHHIWYGPFPVTAADGANLTLDLPRQRTRRNTTFHMKYIKLYDERTNATP FT TAPPVTPGQIRQRTNEITKIVSLDTQLNKIEAQWQHCEPSDISLVSPQDLQ FT RTPYLERLLDHYDMSEADKIGTNRFRKHR" XX SQ Sequence 4595 BP; 1587 A; 979 C; 829 G; 1200 T; 0 other; tggtagcgcc gcagaatcac gatcaaactt tactccacaa aatttgaacg ttatacttta 60 ttttaacatc aatttaaatt attctaataa cttatcgagc atgtctaaag aatctaaatt 120 tgatagtaag gcactcaaag aattcaaaga tagtatgcca atcattttgg gtaaggataa 180 ctccacacct ccagataatt acgacgcgtg ggaagcaatg gttcaaagaa gacttgagca 240 atatggatat gatgatgaaa ttgctactga atgtatgtta atggtgatga agggccaatg 300 cgctatttgg tttcacagct acttaaccac ttacaggaac attcatggca aatccccaag 360 tactcaagaa gcatgtgctg catgcaggga acagtatggt aatgaatcat ttgatgaaga 420 gaacatccaa aatttattgg acttacaaat taatttttct aatagagaac caggtatcac 480 taagttttta caatacagac atttactaga aagaatgaac tctcatgaac taattttaca 540 catctacatt cacaccttac caccagccat tagaaactac atcttttcgc agaaggcaaa 600 agggcttgac gaagctatgg cccttactag gagaaaaatg aaggaattag aaatggacaa 660 aaccgtaagc taccgtcaga gttcagaaag gataaagtca attccaagaa agtttttaaa 720 tcaaagttta aacagaaggc cggtaagtgt ggttactgtg gtttcactgg gcattcagaa 780 aacgagtgca gacggaagaa agctggtata ccaaaccaat caaaatctac cgcagaggta 840 tttaagaatc aattacatag ttcaaaaaat tagtagagcc attattttca aaatttttct 900 atatttctac tatacctgtt tataatggca aaacgaaccc aactaatttc tatcattctt 960 acttacactt acaccaactg gcacaagaca acgagctctt gaccatgaca atacaaatta 1020 gcaaacgtaa caataaaagt acgttagcac ttgcagatac tggagcatca cataatttta 1080 tcgaacaacg atttatccaa ggtatgggac tggtgaaaca acgtttacca gaacaatccg 1140 ttgttaaggg tgcattaaac acctcgggtc taatacaaca ctatgttaga ctaaaattta 1200 aacataacaa agacctatat gaagaaatat ttctgattat tgattcacct ctcccacagc 1260 cagtcgtatt cggtataggt tttttcaaac aacatccaga aatcttgagc aggctaattt 1320 cacctcctac gtcatcttcc aaaccgatca caatggtcgg acgccgacag atactggcag 1380 atctgaagaa cccagacaac caagtgcatc tatgctggct taggaagaaa gatcaggaaa 1440 attccgagac acagtcacac aaaatcttac aacgattcga tgacgtcgta gtcgacgaaa 1500 taccacaggg taactcgaac aaaccgctag cgactcctac tgtccagcat catatagatc 1560 tggtcgaagg tgctgcaccg gttgcaaaac gagcataccg actttcagct agtgacagag 1620 atgaactgga caaacaactc tccgaattga ttgcatcggg aaacgttatc ccatcagaat 1680 caccatatgc tgcaccggtt atatttgttc agaaaaaaga tggtactaag cgcttgtgtg 1740 ttgactatag gggactcaac gacattacaa tcaaatccaa gtttccatta ccgttaattg 1800 aagacgtact ggatcaactt tcaggagcca cgatattttc taaactagat ttgatctccg 1860 gttaccacca agtcgccatc gctgacgaag accaatataa aaccgcattc accactcaca 1920 gaggacaata ttcatggcgt gttatgccgt tcggtttaac caacgcacca gccacgtttc 1980 agagactgat gaattatgtt cttagagatt acatcaacaa aatctgtgtt gtttatctag 2040 atgacatctt aatctattcg aaaaatgaaa aagagcattc agaacatgta tctacaatta 2100 ttaatgtact cagaaaacac caactgtacg ctaagaaaag caaatgcgaa ttttacgtac 2160 cgaaaatcca atttcttgga cacgaattaa gtgctaaagg tattactcca gataaagaga 2220 aaattttagc aatcaaggat tggcctacac ctaaaacata caaagatgct caaagcttta 2280 taggtttagc tggttattac agaaggttca taaaagattt ttcgtacatt gcaaaaccgt 2340 tacaccaatt ggcggcacag aaaataaaat ggacagatga atgtaaagaa tcactagata 2400 aattgaaacg acaactcagt actgcaccaa ttattattcc tttcgatcgt acaaagcaaa 2460 tcgtacttac gacagatgcc tcaagtacag cgataggagc agttttagaa ctatacggca 2520 aaggaacact taaatcggaa cttgtaggag tggtagcgta tctcagtcac ctacttcgag 2580 ataacgaatt gaactggcct attcgagaca aagaacttta cgcagtaatc tttgctttta 2640 agaaatggcg tcactactta gcaggtacac atattattat caagacagat caccactccc 2700 tgcaatattt taagacctcc gtattagata gtaacttacg tttagcaaga tggagagata 2760 tcctagaaga atttgactac gaaatacaat acattaaagg ttcaactaac catgcagatg 2820 cactatcgag acctccaatg ctgaactcgt tagtggcaag caaagtggaa ttaccagagg 2880 aagatctgag agaattggag actgcctaca gtacagaccc attcttttca aaaataatca 2940 aacatctgaa ggacccagct ctcgaaccac caattgaagt taggactcca ttacagaatc 3000 acaaacttga aggtaactta ctttattaca ctaaagaaga aaatatgcca aaacgtctct 3060 gcattccgca agcacacaac gtcaaactgt tttacactta caccatgatg ccgctgctgc 3120 tagccatcca ggacaacatc gtacacttca aagcattgct agaaaattac tactggccag 3180 gtatggaaaa tgacattaaa cgttatgtta gtacgtgtcg tgcttgtcaa caaaataaac 3240 atccggtact actgtcgcca ggcacatttc atcctttacc gagtggtact catcgatggt 3300 cacacattaa tatcgacttt ttaggtggct tagttctatc tcaaggtcat gattgtatta 3360 tggtggtcat tgatagagcc accaaaatgg cacattttat accagtgact aagggtgcag 3420 acagcgaaac ggtaattgac ttatttattg actcagtact caaattacat ggtttcccag 3480 tcgaaatact gtcagaccaa gacaaactgt ttacaagtaa actatggcaa cgtagcatgc 3540 aacgttttaa gatcgcaaca aaattcacat ctacctataa cccaagtaca gatgggcaag 3600 tcgaacgaat gaacagaaca atcatggaaa tgctacgaca ctacttaact gacaatccat 3660 cggcgtgggt aatgttgctc ccagtcgtgg aatttgctta caacaataca taccaagtat 3720 ctattcagac aacaccattc tttgctaatt atggatatca tcctcgactt ccaggcttct 3780 accatttgat cacaagtgga ggacaatcgg aaaaggaagc aagaggtact gaattagggg 3840 atctagacga cagaattatt caacagaaca atatcttttt aatcatccag gaaaggatcg 3900 cagctgctca acagaaacaa gcgttacaat acaacaagaa acacagacac gccgaattcg 3960 aagttggtga caaagtgttg gtacatcagc aagcttactg gcctggttat cataaaggtt 4020 tgaaacttca ccacatttgg tatggtccat tcccagtgac tgctgcagac ggtgcaaacc 4080 tcacattaga tttaccccgt caaaggactc gtagaaatac aacgttccac atgaaatata 4140 tcaaactata cgatgagagg actaatgcta caccaacagc accaccagtt acacccggcc 4200 agatccgcca acgcaccaac gaaatcacca aaattgtttc actcgataca cagttaaaca 4260 agatcgaggc ccaatggcaa cattgtgaac cttctgacat ctccctggtt tctccacagg 4320 atttacaacg cactccgtac ttggaacgct tattagatca ttatgacatg agtgaagctg 4380 ataaaatcgg taccaatagg ttccggaaac atcgttaagt acctacataa tgctatctat 4440 acacgtattg tgtctatcta cgttgctaat ttacactttc atagctatcc cgttatcgtc 4500 gatcctcgcc tagtaagcat aatttccaat cgcctcctat tattcttttc gtttatttta 4560 agagctgtgg gaccaactct actaaaaagg gggag 4595 // ID Gypsy-87_MLP-I repbase; DNA; FNG; 5439 BP. XX AC AECX01000135; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-87_MLP_; KW Gypsy-87_MLP-LTR; Gypsy-87_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5439 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000135; Positions 47997 53435. XX CC Positions [2713-3168] - Reverse transcriptase CC Positions [4272-4751] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1431..2549 FT /product="Gypsy-87_MLP-I_2p" FT /translation="MNFHQSRGPSSVSAVSALPQHMQRANNHRQPHVSAPP FT RSFQDVSAGFAQVPVQPYQLPPHMYNLSPTQYTLPSVQGPVPLQHPGFPQA FT MGVPAAVATQDPVQQVAALFSEYQNIDQLTYYDNLQTDSLDARPMFDPTGD FT GISDEPTISSISAIHFKNSSTMDHRLILSVDLIVNGVVFPAKALVDSGASG FT SFVDKAFVRKHDLSLSLRPFPLKCVTFDGSEGKDGLVTHSWKGVLSISDVT FT GSLFKSSISLDVTSLSGYDLILGMDWLRSHDGWVGGCGPSLRLDSPINHSF FT SKITGEYSSFASLSNLSSSAFSSSLITPSLSSSILSSPLIAPSSISWEKSI FT PLEFRRFADVFNSQGPTTLPPHRPAFDLEI" FT CDS 3486..5417 FT /product="Gypsy-87_MLP-I_3p" FT /translation="MAAILSQPDKDGNLLPVSFFSRKLTPAERNWQVHDQE FT LGIIVEAFEEWRAWLTGTPTPVTVFSDHANLCYFMTSQKLTPRQARWASYL FT SSFHFQILHTPGKINPADPLSRRPDYEVISNESNRMTLLKPYMVKDGVSVC FT SIDTTFIIPSAATRQLLIDSYKEYDNVDGSKPPPPFFTFQGGLWWFRDRLF FT VPPTLRQQILSAFHDSTSMGHPGLAHMLSAVTRTFAWKTVRKDIIDFVRSC FT DSCQRTKRSTQNQVGRLIPSAIPDWPWSTIGIDFIVKLPISSGCDSILVIT FT DHFSKGAHFIPCRESMSAPQLASLFISQFFRYHGFPDKIVSDRGSTFVLAF FT WISVQRQLRIHPAPSTAFHPQTNGQTERTNQVLESYLRHFIGYRQDDWVEL FT LPMAEFCFNNSTSSSTNLTPFFSWQGFHPRANSFTEFSRVPHADKFVEHLE FT ATQLNLLLSLKHAKQAQADQYNKHSREAPVYSPGELVWLSRQNITTARPSS FT KLDYRRIGPFPVVQMIGRNVVKLNLGSTYSRLHPVFNTSLISPYISPSVGG FT RVTPERRIEGQVAPTHDWRHVSAVLDYRKKGKAAPKYLLRWLGRPISDDTW FT VSLTDISSDLDLFLLNFHAKYPQFPCPSKLLDNRYAMGYQAALGNL" XX SQ Sequence 5439 BP; 1395 A; 1108 C; 1030 G; 1906 T; 0 other; ttatattata tatgtgtata gaatggttat tttgtttgtt tatatttttt gtttctttat 60 ctttctgtca tatcgactta tctctttaaa gaaattggtg ttcttctcaa cttacctctc 120 aaatcttaaa aattcttcaa catctatttc aatacttgtc aatttttctt ttaacactct 180 taatcatgtc tgcttctgat gcatcgccac aatctcaagc ttatcagctt tcttcaactg 240 aacaacaatt cgttaatcgt tatctggaac cacgtttgga tgcttttaat aagtcggtta 300 cgaaggtctt tgatgaacga aatgcaagtg cttcagatgc ggtggtaact ttagttaatg 360 atgctatcat tagagcaaaa aatgatgtga tgatggcttt acgtgcggag ttcagtaata 420 aaatttctgt tgatcacttt gatgcaaaac ttaaagaagg tttggattcg acttttgcca 480 cggtgaaacg tgatgttctc aatgttttgg acgaagaatt aatctctgtc aggcattctt 540 cgactttagt tcggtctgcg gaacctcctg ctttaccgga tctcgttttt agtggagatg 600 ctcgaaaggt tccaggtttt atcactacaa ttcgagatac tctcttttca tcttcttctt 660 gttttgcgaa tgaggatcga aaaatcacct gggtggcttg tcattttcca ttagcttcgg 720 ctcctcatga ttggtggtta agtttgctaa gacagaatga tcgcgaaaat ggtatttcgg 780 actcatcagc attttcggtc acaccaactc ctttcacgat agaacctctt acctctttag 840 cttcgttttt caaagaattt ttagataagt tcggcgataa gtttgctgaa gaaaggttgt 900 ttttggatct tcaaaatttc aagcaaggaa aactccctgt tggacaattt atttctcgat 960 ttaattctat ggctttccag cttcaacttt ccaatgtggt gttaaaaaat ttatttactg 1020 gaggtcttcg tccagatgtt aaggattgag cgcttaaaca tcctgattgg tttaagtgta 1080 agactgtgga ggaaaggcaa gcggtggcaa caatggcttc agaacaaata atgcatgtgg 1140 ttgcgaatag ttcatctttg ccagttcaac gttccttttc atctccgcct attgtcccga 1200 ttccccgtga tccaaacgct atggaagtcg acgttaacgc tatcaatgct cgcccaactt 1260 ttccgccctt gccgaatgga attttgtttc aattttttgt caaattctgt catgcacgaa 1320 agatgtgcca taaatgttta aagccgtatg actctactca ttgaactccg gatggtaaga 1380 gtttgtgtcc taacgctcct ccaaagacga ctcaagaaat caaaactttt atgaactttc 1440 atcaatctcg aggaccttct tcagtttcag ctgtgtcagc tttacctcaa catatgcaac 1500 gtgcaaataa tcatcgtcaa cctcatgttt cagctcctcc tagatctttt caagatgtgt 1560 ctgctggctt tgctcaagtc ccggttcagc cttatcagct accacctcat atgtacaatt 1620 tatctccgac tcaatacact ttgccctcag tacaaggtcc agtgcctctc caacatcctg 1680 ggtttccgca agcgatgggt gttccagcag cggtagcaac tcaagatccg gttcagcaag 1740 tagcggcctt gttctcagaa tatcaaaata tcgatcaatt gacttattat gacaacttac 1800 aaactgactc acttgatgct cgtcctatgt ttgatccgac gggtgatggt atttcggacg 1860 aaccaactat ttcatctatt tcagctattc atttcaagaa ctcatccacg atggatcatc 1920 gtcttatttt atcagttgat cttattgtta acggtgtggt ttttccggcc aaagctttgg 1980 tggactcggg agcttctggc agttttgtgg acaaagcttt tgttcgaaaa catgatcttt 2040 ctctttcgct tcgacctttt cctttgaagt gtgttacttt tgacggttct gaaggaaagg 2100 atggtttggt gacgcattct tggaaaggcg ttttgtcaat ttcggatgtc acaggttctt 2160 tatttaaatc atcaatttca ttagacgtta ctagtttaag tggttatgat ctaatcttgg 2220 ggatggactg gttgcgttct catgacggtt gggttggggg ttgtggtccc tctttacgtt 2280 tggactcacc gattaatcac agtttttcta agataactgg tgaatattcg tcttttgcat 2340 cattatcaaa tctatcttct tcagcatttt catcctctct catcactccc tctttatctt 2400 cttcgatatt gtcatctcct ctcatcgctc cctcttcaat ttcttgggaa aaatcaattc 2460 ctttggagtt tagaaggttt gctgatgtgt ttaattcgca gggtcctaca actttacctc 2520 ctcatcgacc agcttttgat ctggaaattt gattaaagga gggcgctgtg ccaacttatg 2580 gtggttccta tttacttttg aaggatgaag gattggatcg atactcaatt agcaaaaggc 2640 aatattcgac attcctcatc tccggctgct tctccaattt ttttcgttaa ggttcctggg 2700 aagaaaaacc gaccttgtgt agattataga tctttaaatt caatgactat ccgtgactca 2760 tttcctattc ctttgttagg agatttatta actaaggttt ctggctgtag atattttact 2820 aaattagatt taaaatccgc tttcaatttt attcgtgtta gggaagggga tgagtggaag 2880 actgctttta gaaccccctg gggtttgttt gaatgtctag taatgccttt tggattagcc 2940 aatggccctg cttgttttca acgtttcatt caatctgtct tatctgaatt tttaggtatt 3000 tcatgctttg tgtatattga tgatatttta attttttcca agactcgtca agaacaccag 3060 atacatgtag agcaagtttt actcaaattg caacagaatg aattacgtgt atctcaggac 3120 aagtgtctct tttatcaatc tgaagttacc tttttaggtt ttgttatctc tagcactgga 3180 ctgagaatgg accctaataa gttgaaacca ttgctgactg gccttatccc gtgaattcga 3240 aacagcttta cagattttta ggttttacca atttttatcg tcgttttatt catcaatttt 3300 cagacatttc ttctccactt acagctttga ctcaaaagac tgctgacgta gtgactggtt 3360 tagcaaatcc tgagtgtttt tatgctttca aaaatcttct acatgctttt acggttgctc 3420 ctcttctttc ccattttgac tttgagaaac ccaggtcatt acaagttgat tgttcaggtt 3480 ttgcaatggc tgcaatactc tctcagccgg ataaggatgg taatctttta cctgtatctt 3540 ttttttctcg taaattgact ccagctgaaa gaaattggca ggtacatgat caggaattag 3600 gcattattgt tgaagccttt gaagaatggc gagcctggtt aactggtact cctactccag 3660 taaccgtatt ttctgatcac gccaatcttt gttatttcat gacctctcaa aagttaactc 3720 ctaggcaggc gcgttgggct tcttatctca gttcttttca ttttcaaatt cttcatactc 3780 caggaaaaat taatcctgca gatccattat caagacgtcc tgattacgag gttatatcca 3840 atgaatcaaa taggatgact cttttgaagc catatatggt aaaagatggt gtttcagtat 3900 gttctattga taccactttt atcattcctt ctgctgcgac tcgacaactt ctcattgatt 3960 catataagga atatgacaat gttgatggat ctaagcctcc gcctccattt tttacttttc 4020 aaggaggttt gtggtggttc cgtgatcggt tatttgttcc gccaacactt cgtcagcaaa 4080 ttctttcagc ttttcatgac tctacttcca tggggcaccc gggccttgca catatgttga 4140 gtgcagtgac taggaccttt gcttggaaaa ctgttcgtaa agatattatt gattttgtcc 4200 gaagctgcga ttcatgtcag agaacaaaac gctctactca gaatcaagta ggaagattaa 4260 tcccgtctgc gataccggat tggccttgga gcacgatagg tatcgatttt attgttaagt 4320 tacctatatc atccggttgc gattccattt tagtaatcac agaccatttt tcaaaaggtg 4380 ctcacttcat cccttgtcgg gaatcaatga gcgctcccca attagcatct ctttttatat 4440 ctcaattttt tcgatatcat gggtttcctg ataaaattgt ttcagatcga ggttcgactt 4500 ttgttttggc cttctggatt tctgttcaaa gacagctacg tatacatcct gctccctcta 4560 ctgctttcca cccacagaca aatggacaaa cagagaggac gaatcaggtt ttagagtcat 4620 atttaagaca ttttattgga tatcgacaag atgattgggt tgaactatta cccatggccg 4680 agttttgttt taacaattca acttcttcct ctaccaatct tactccattt ttttcttggc 4740 aaggttttca ccctagagcc aatagtttca ccgaattctc aagggttcca catgctgata 4800 aatttgtgga acacttagaa gcaactcaat tgaacctttt gttatctttg aagcatgcga 4860 agcaggctca agcagatcag tacaacaaac acagtcgaga agctccagtt tattcgcctg 4920 gggaattagt ctggttgtca cgacaaaata taaccacggc taggccttca tcgaaattag 4980 attatcgtcg tattggacca tttcctgtag tacaaatgat tggaagaaat gtggttaaat 5040 taaatttggg cagtacctat tcaagacttc atcccgtttt taatacctct ctcatctcac 5100 catatatatc tccttctgta ggaggtcgtg tcaccccgga aagacgtata gaaggacaag 5160 tggcacctac tcatgattgg aggcatgttt cagcagtttt ggattatcga aagaaaggca 5220 aagcagctcc caaatatttg ttaaggtggt taggaaggcc gatttcagat gatacttggg 5280 tttccttaac agacatctcg tctgatttag atttatttct tttaaatttt cacgccaaat 5340 atccacaatt tccatgtcct tcaaaacttt tagataatcg ttatgcgatg ggatatcaag 5400 cagcgttggg aaatctttag ggtagacgaa aatcattat 5439 // ID Gypsy-12_LBS-LTR repbase; DNA; FNG; 340 BP. XX AC ABFE01000693; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-12_LBS_; KW Gypsy-12_LBS-I; Gypsy-12_LBS-LTR. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-340 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01000693; Positions 17331 16992. XX SQ Sequence 340 BP; 53 A; 106 C; 49 G; 132 T; 0 other; tgtcacagtc atggtcacgc cctcttcttt gcgacacatt caacgcgtcg catcgcgtct 60 tcatttatct atgtttgcca ttgttcccct tcttgactgt ctcttcgcag tcctctctct 120 ttgtgccttc ttgcacgccg tgtttacttg tcaccattcc tttcctccat tcattctctt 180 atgcatgtct tccttaggta tgctctcttg ttattcattt agtatatagc tactctgtgt 240 acttcgtagc tactcgtcta atcaacgcac attcctgcct ttgctcattt ccacccgtgt 300 gaacttctct cgtcgtctca atctaacctt gcccgtaaca 340 // ID Gypsy-27_MLP-I repbase; DNA; FNG; 6044 BP. XX AC AECX01000158; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-27_MLP_; KW Gypsy-27_MLP-LTR; Gypsy-27_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-6044 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000158; Positions 28226 22183. XX CC Positions [3333-3758] - Reverse transcriptase CC Positions [4965-5444] - Integrase core CC 'AAGA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 319..1989 FT /product="Gypsy-27_MLP-I_1p" FT /translation="MPLQRSPTRKAGDPIAPAATDSAVTGSTNPAAAPGST FT DTGTARRLTEREREILVSHLPASATSSQQLTAEQMDALTVDQRLVVNVLAS FT DYNEIPVVIPAAVYNFITKGLNEDLLDYVENDFPEKNEPIGVEGKLPENVG FT FRQHCKRVSNIISSDTSSATSNDDLDIRKLKPSPSFLESLDAKPAIDPLVG FT LNLKAHEVAAFREFQAMKANPDTLEAKISDLHLRGSRDGTPVLETRARTTS FT VQPTLLRVHDNSSVAKVVAVEPTGVRPAERLVVDPSLIIKGLGRDKIPKFS FT GINQDVTTYFDNFRVELEMRFCHPDVQHQVAFQWVAGAAKEILVKAVRDNN FT RPVSYDALTAFFMNAFPATVDLDTVDQELDALRQHGGETVIAYWTRHQLVT FT QKADILELPYDPVLTFKKRLVPGPVKEHVENYISMKKLDGITPQIKDVVAV FT AIERDQRPDIKHYTNANRQVSAIVSTSHHQNGNRKRQRSRMESSNGRTSTI FT EPICYNCNQKGHRFGSIDRPTCGAPITNKTRNYFKRNSKKSKGEPENAVAS FT GSGSSKGPAN" FT CDS 2211..6011 FT /product="Gypsy-27_MLP-I_2p" FT /translation="MGRYSKMSTDSESVSISTEHLMEDPLAVEKDIERTFH FT ERESPSSSTQQSQEFALGREIPISISISEVTENHSQTIVEDNASSPASEPR FT VPTTGYIDLDSVDSLAVLENNTSETRADFILTLNGVITKILIDTGASMSYV FT SDTYARKRNFLTSKNSDRPIVRGVWGQPYECKDIAEVKFSLSGLNYSHSCR FT VAPLTNHDVILGLDWITAYAVSTDWAKGEWTLRDKKARRAKFYPSRIPLPS FT LKHHTISGLSDIPIDELPCTRSQMRRAMRRPGTEVYLLQMSEALSPVDEVD FT GPREAPAPCLPRIEASDPRTKLRVENLLDEYKNLFDEVSQAPKQERVIQHL FT IDTGDAPPVSQPVRRLSPLLLDELRNKLDVLQKNGFIRPSTSPWSSPILFA FT RNASGKLRFCIDYRAVNALTKRDRHPLPLIQDCFDQLHGARVFSKFDLQQG FT FHQMKVSSSHVPKTAFSTRYGHYEWLVMPFGLVNAPSTFQRMMSDILRPFL FT DKFVQVYMDDILVYSSSPEEHEKHVRQVLQALENAELKVSGSKSLLFADEI FT QFVGHMISANGIRPMEDKIKAIQEWPRPSNVHDVRSFLGLSGYYRRFIKDF FT SKIASPLHELTSGNVTKKAKIEWQPKHEAAFIKLKSSMLTAPVLIVPDPPK FT PYIIETDSSDFAVGAVLLQVGPDGKEHPIAYESAKLNQAQQNYPAQEKELL FT AMIHAWRKWRNYVEGAVADTLVRSDHLSLTGLQTQVQPSRRLSRWIEEFAE FT MTIKVEYKTGVTNIVPDALSRRSDLLQSIFDDVTGLRDPSDWPLLVPYVLG FT QKELPEWVDAALVDQVVRGSLGFIWKPKEEVLYWKGTKDNPEESPFVPFSQ FT RIALIDRFHKKYGHRGRDGTLSLLKDRGWWPGRYSDVKEYCSYCPQCQIFD FT VPDKGQETAKQEPLPSVEPFERWAVDFISLPESKDGFKWILTMIDHCTNWP FT IAIPMKLATSANVAEALTRELIQVYGVPSEILSDRGANFLSKEMQAFYQGA FT HIRKLNTSGYHPRTNGKNERFNGVLEKALFKANRTNDPSRWPEYLPEALFA FT CRINKSTVTKWSPFELLYGVSPRLIGDRAKFRPVDIPTSESGNDARLEKLR FT AARSEATEASKKRSTENKARFDSQFVPDENGKSRSVIKTYEIGDSVKLRNE FT SHTKGEPYWYGPFTVYDSLGKNVYTLSDHTGSKFPHPIGGNRLKPAKIKDK FT SLGEVWALPPRLLQDIKNEDLKVSRTLKLKAKQLANTQIKITPRIKIVGKF FT ATDAAA" XX SQ Sequence 6044 BP; 1666 A; 1507 C; 1414 G; 1457 T; 0 other; ttggtagcga gatctattct ttttacgacg aaagctgata gtcatcattt cgtatttcct 60 tcgctattga ttctatctcg tgtctattct gttatttatt ccataacgtc atagtcgttc 120 gactctttct gggaccaata atctttcgag accagagcgc ggagatagtt cattctcagc 180 aaagctaatt gatcattgtc agctacctgt cttgcttttg aagaatatta atagtcttaa 240 atccatcatt gagtgaaagt gatactgacc tgaaattttt tcggactcct tttcgttttt 300 tttgcattaa cagtaacaat gcctcttcag cgaagtccta ccagaaaggc aggagatccg 360 attgcccctg cggccacgga ctcagcagtg actggatcaa ccaatccagc agcagcacca 420 ggatcgactg atactggtac agctcgtcgc ctaaccgagc gcgaacgaga gattctagtc 480 agccatctgc cagcctcggc tacgagttct caacaactta cggcggaaca gatggatgcc 540 ttaacggtgg atcagcgact cgttgttaat gttctggcga gcgactacaa cgaaatacct 600 gtcgtgatcc ccgccgccgt ttacaacttc atcacgaagg gattaaacga agatctcctt 660 gactacgtag agaacgactt ccctgagaag aacgaaccca tcggagtcga aggaaagttg 720 ccagagaatg ttggattccg tcaacattgt aagcgcgtct ccaacataat ctcctcagac 780 acatcttcag cgaccagcaa cgacgaccta gacatcagaa aactgaagcc atctccgtcg 840 ttcttagagt ctctggatgc caagccggcc atcgacccgt tagtcggcct taacctcaaa 900 gctcatgagg tagcggcttt tcgggaattc caggctatga aagctaaccc ggatacgctc 960 gaggcgaaga ttagcgacct ccaccttcgc ggatctcgcg atggtacccc tgtcctcgaa 1020 actcgtgcta ggacgacttc cgttcaaccc acgcttcttc gagtccacga caattcctcc 1080 gtcgccaagg tagtagcggt cgagccgact ggcgtcagac ctgctgaaag actagtcgtg 1140 gacccgagcc ttatcatcaa aggcctcggt cgagacaaaa tccccaagtt ctccggtatt 1200 aatcaagatg tcacgactta cttcgataat tttcgagtcg aactcgagat gagattctgt 1260 catcccgatg tccagcatca agtcgccttc caatgggtcg ccggagccgc caaagaaata 1320 ttggtgaagg cagtaaggga caacaatcga cccgtttctt acgacgcttt gactgcattt 1380 tttatgaatg ccttcccagc cacagtcgac ctcgacactg ttgatcaaga gctcgacgca 1440 ctgagacagc atgggggcga aaccgtcatc gcctactgga cacgtcatca actcgtgacc 1500 cagaaggcag atattctgga actcccttac gatccagttc tcaccttcaa gaagcgcctc 1560 gttccaggcc cagtcaagga gcatgttgaa aactacatct ccatgaagaa gttggacggt 1620 atcacccctc agatcaagga cgtggttgcc gtcgccatcg agcgcgacca acgccccgat 1680 atcaagcact atactaacgc caaccgtcaa gttagtgcta tagtatctac ttcccatcac 1740 cagaacggca atcgaaagcg acaacgttct aggatggaaa gctccaacgg tcgaacctcc 1800 accatcgagc ctatctgtta caattgtaat cagaagggtc accgtttcgg ttccatcgat 1860 cgtcccactt gcggagcacc cattaccaac aagacgcgaa attatttcaa gagaaactcg 1920 aagaagtcta agggtgagcc ggagaatgcg gtagctagtg gtagtgggtc atcaaaaggc 1980 ccggcgaact agatgtagct ccctcttttt cgcctaatct gttatcttgt ccagttgtca 2040 gcgatgtagt atgtaatgca tttgtcttga cccagactaa acaagaggag cctaaaaatt 2100 cgttctcgag ggatgagccc tcgggatcag acagagaatc tgtcgtcctt tcagccagta 2160 agctagctga taattccgtc ccacttagag gggatcctga atattctgac atgggcagat 2220 attcgaagat gagtaccgac agcgagtctg tctccatttc aacagaacac ttgatggaag 2280 atcctttagc agttgaaaag gatattgaga ggactttcca cgagagagaa agtccctcct 2340 cctcgacgca acagtctcag gaatttgcgt tgggtagaga aataccaatc agtatttcta 2400 tctctgaggt tacagagaac cacagccaaa ctattgtgga agacaatgca tcctcgccgg 2460 ccagcgagcc cagagtcccc acaacggggt atatcgacct tgattcagtc gatagcctag 2520 ctgtgttaga gaacaacacg agtgagaccc gtgctgactt catcctcaca ttgaatggag 2580 tcataacaaa gatactcatt gatacaggtg catcaatgag ttacgtgtcg gatacttacg 2640 cacgtaaaag aaacttcctt acgagtaaaa actcagatcg tcccatagtg aggggcgtgt 2700 ggggacagcc ttacgaatgt aaggacatag ctgaggtgaa attctccctt tccggattga 2760 attactctca cagttgccga gtagcccctt tgactaacca cgatgtgatc ttaggactcg 2820 actggattac agcatacgca gtatccacag attgggcaaa gggtgagtgg actttgagag 2880 ataaaaaggc cagacgagcc aaattttacc cctcaaggat ccctctccca tctctcaaac 2940 accacactat aagtggcttg agcgatattc ccatagatga actcccctgt accagaagtc 3000 agatgcgtcg tgcgatgcga cgtcccggta cagaagtgta tttattgcaa atgagtgaag 3060 ccttgagccc cgtcgacgaa gttgacggac ctcgagaagc tccagctcct tgtttaccaa 3120 ggatcgaggc atcagaccct cggactaagc tacgtgtcga gaaccttctc gacgagtaca 3180 aaaacctctt cgatgaggta tctcaagccc cgaagcagga aagggtaata caacatctca 3240 tagacacagg agatgctccc cctgtttcac aaccggtacg tcgtttatcg cctttactcc 3300 tcgacgagtt aagaaacaaa ctcgatgtgc tgcagaagaa cggttttata cgtccttcga 3360 catctccttg gtcctcgccg atccttttcg ctagaaacgc gtcaggtaag ctgagatttt 3420 gtattgacta tcgagcagtc aatgcactta ccaagcggga ccgccaccca ctcccgttga 3480 tccaagactg ctttgatcaa ttgcatggag cccgggtatt ctcgaaattc gatttgcaac 3540 aaggctttca ccagatgaaa gtttcctcgt cgcatgtccc caagacagct ttcagcaccc 3600 ggtacggtca ctacgagtgg ctcgtgatgc cttttgggtt ggtgaacgcg ccctctacct 3660 ttcagaggat gatgtcggat atcctgcgac catttttgga caaattcgtc caggtctaca 3720 tggatgacat cctggtttat tccagcagtc ctgaggaaca tgagaagcat gttcgtcagg 3780 ttcttcaggc tttggagaac gctgaactca aggttagcgg ctccaagtca ctattatttg 3840 ccgacgaaat tcaattcgtc gggcatatga tatccgccaa tgggattcgt cccatggaag 3900 ataagatcaa agccattcaa gaatggccgc gaccttcaaa cgtccacgat gtaagatcct 3960 ttttgggatt gtcaggctat tatcgacgat tcattaaaga tttttcgaaa atagcatcgc 4020 ctcttcatga acttacctcc ggtaatgtga ccaagaaggc gaagattgag tggcaaccta 4080 agcatgaagc ggcattcatt aagttgaaaa gcagtatgct aactgcaccc gtgcttatag 4140 tgccggatcc ccccaagccg tacatcatcg aaacagattc gagtgatttc gcggtgggcg 4200 ctgtcctgct acaagtagga cctgatggga aagaacatcc catagcctac gagtctgcca 4260 aattgaatca ggcgcagcag aattatcctg cgcaagaaaa agagctcttg gccatgattc 4320 atgcatggcg caagtggcgc aactacgtag aaggagctgt ggcagatacc ttagtacggt 4380 ctgaccacct tagtctcact ggtctgcaaa cgcaggtcca gccatccaga cggcttagtc 4440 gatggattga agagttcgcc gagatgacca ttaaggtcga atacaaaact ggcgttacta 4500 acatagtacc agatgcgcta agtcgccgaa gtgacctctt gcaaagtatc ttcgacgacg 4560 tgaccggcct acgggatccc tcggattggc cgctcctggt gccttacgtc cttgggcaaa 4620 aggaattgcc agagtgggtt gacgccgctc tggtggacca agttgttcga ggatccctgg 4680 gattcatttg gaaacccaag gaagaagtcc tttattggaa aggcacgaaa gataaccctg 4740 aagagagccc tttcgtacct ttttcgcaaa ggatagctct tattgaccga tttcataaga 4800 aatatggcca tcgcggccgt gacggaactc tgagtctgtt gaaagacaga ggttggtggc 4860 cgggtaggta ctctgatgtc aaagagtact gctcgtactg tccacaatgc caaatctttg 4920 acgtacccga caaggggcag gaaacggcca aacaagagcc cttgccgtct gtcgagccat 4980 tcgaaagatg ggccgtagac tttatctctc ttccagagag caaagatggt ttcaaatgga 5040 ttcttacgat gatagatcat tgtactaact ggcccattgc cattccaatg aaactagcaa 5100 cgtccgcgaa cgtagcagaa gcgctcactc gtgaacttat ccaagtttat ggagttccaa 5160 gtgagatatt atctgataga ggcgcaaact ttctgtcgaa agaaatgcag gccttctatc 5220 agggagcaca catccggaag ttgaacactt ctggatatca tccgcgtacg aacgggaaaa 5280 atgaacgctt taacggcgtc ttggagaaag cgctattcaa agctaaccgc acgaatgacc 5340 cgtcgaggtg gcctgaatat cttcctgagg cattattcgc ttgtcgaata aataagagta 5400 cggtaaccaa atggtcaccg ttcgagctcc tttacggagt aagtcctcgc ctcatcggag 5460 acagggccaa gttcagaccg gtggatatac ccacatctga aagtggcaac gatgctcgct 5520 tggagaaact tcgtgccgcc aggtccgaag ccaccgaagc gtcaaagaaa agatccaccg 5580 agaacaaagc tcggtttgat tcgcagttcg ttcccgacga aaacggtaaa tctcgttctg 5640 ttattaaaac gtacgagata ggtgattcag tgaaattaag aaatgagtcg cataccaaag 5700 gtgagcctta ttggtatgga ccttttacag tatacgattc cttgggaaag aatgtgtaca 5760 cactgtcaga ccatacaggg tctaaatttc ctcaccctat tggaggaaat aggttaaaac 5820 cagcaaagat taaggataaa tctttgggag aagtttgggc attgcctcct cgactacttc 5880 aagatatcaa gaatgaagat ctcaaagttt ctaggacgtt aaaactcaaa gccaaacaac 5940 tcgccaacac ccaaattaaa attacgccaa gaatcaagat tgttgggaaa ttcgcaactg 6000 acgctgcagc ctagggatgc tgcaacattt taaaaagggg gtgg 6044 // ID Gypsy-1_GGr-LTR repbase; DNA; FNG; 295 BP. XX AC ADBI01000108; XX DT 12-MAR-2011 (Rel. 16.03, Created) DT 12-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Gaeumannomyces graminis genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_GGr_; KW Gypsy-1_GGr-I; Gypsy-1_GGr-LTR. XX OS Gaeumannomyces graminis OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Magnaporthales; OC Magnaporthaceae; Gaeumannomyces. XX RN [1] RP 1-295 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Gaeumannomyces graminis genome."; RL Direct Submission to RU (12-MAR-2011). XX DR Genome; ADBI01000108; Positions 40781 41075. XX SQ Sequence 295 BP; 67 A; 88 C; 74 G; 66 T; 0 other; tgtcacacgg ggattgagcc ggtgccgatt gagggccgtt aggacggcca acaggacgaa 60 tcacaaggac ctcgcatgta ctacgcgccc ggtcgagagg ggtagctccc ctcgccggga 120 cacgtacccg gtcatacccc tctctctctc ggtctctcag acggcctata taaaagaggc 180 ccttcgctca aggagctcct gggagattct gcttccttaa gcatctccta aaggccttga 240 gaaattatca atacattgag cactctccaa gccggttatt gatccctttg taaca 295 // ID Gypsy-2_LENY-I repbase; DNA; FNG; 4806 BP. XX AC AAPO01000122; XX DT 12-FEB-2011 (Rel. 16.02, Created) DT 12-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Lodderomyces elongisporus genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_LENY_; KW Gypsy-2_LENY-LTR; Gypsy-2_LENY-I. XX OS Lodderomyces elongisporus OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Debaryomycetaceae; OC Lodderomyces. XX RN [1] RP 1-4806 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Lodderomyces elongisporus RT genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; AAPO01000122; Positions 21830 26635. XX CC Positions [3663-4154] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1005..4790 FT /product="Gypsy-2_LENY-I_2p" FT /translation="MSETRKVKVSMDEVSVQRPEEVLEVKEELVERNDAKE FT VSVPNVQHSVIEDNSIELDEITQRTESIILQDSESGLEGSKELELFELTVS FT QGFDPAKVNLPPPSPVENSKFYVPVVMMLTAVKGKECMALLDSGSGEDLVL FT SKFIEGKIDSLPQDACLCDVIVAFGASRQVTKRVLLEFSINGIHFSRWFMV FT VPGFTKDMVLGLDFVGEHEDLISFKKRSFAGVSKENPIGLVDSDEFVEAVG FT NAAEVGLFTLKTEEKVDKMPMDAANRFPDLIKEYKDIFLQELKETPVSRGQ FT WDHRINVIPYVTAPSGKQYPLAKPEFEELKSQVRKMLDAGLIRQLGVGESD FT FNSPVLFVRKKDGSYRMCVDYRLLNLTVVQKQFGMPVVEQLIKEVSGYRYY FT STLDMTQSFHQIRLDDETSHVTAFTAFGKKFAYNVLPFGFTNAPAILQETV FT SQLIQEIPGCVNYIDDIIVYSNTIEEHRKSLLLLFEAFRRNKFFFKGSKCE FT LGVSKVTFLGHEVGQKGTRIPEVQLEVIKAIKTPDNPKDMRSALGFFNYFR FT HYIQNFSKIAAPLYEYATKRKVEFTQVHDKAFNQLKADLVKSDALVKVNYE FT LKPVRFQLEVDASHHAIGAVLKQVHEDKVVGVIEMVSRTLTVAEKNYPIRQ FT KELLLIVFAVKKFRHYILGYETVVYTDHQSLESLFASNTRPESERIIRWLE FT SLQEVQLKVRYRNGDANVLADVLSRLVDAGRVEVHDLDTTVAEVTNTEETV FT TQILLKENVIDQIKKSYTTDAYLTTILNLLTDTDRTRNPIPPELNSAIKKY FT SLTDGVLYYATQSGQKPVVGKSVARMVLEKVHSFGHFGITKCYFAIQPYLF FT VPKLLEVVTDYINSCDHCQRAKAIHGSVGGLMLPSDVPKDVFSTIHLDFVT FT GIPTTPEGYDCILVVVCALTKYCVCIPTRKSTKVVQTAKLLIDEVFSVFGV FT PHTIKSDKDIQFMNSIMKYVFEFYQIDFRTTSTNNPSTNGQVEALNKVVVQ FT TIKSFCHKEHALWSHYLKVVQFAINTNYAPAIRMTPYQAVFGRLPRDRTGI FT LDINMKHSSLSAEQLVRRAEAIHTQIKDTMSLSQDAMSHKANRGRNPGRFE FT VGDLILLHREAYWRPGKYKKLTDVYHGPFRLVKKINDNAFEVDLPRMNKKD FT RVINIKYFRKYIEQRQVFKEPPINLVEAEARVEDLSAILGYDAEKFEFDVT FT WVGCRPGHGSTIPIEWFYRKVPKVLRDTLIANAIDFLGDRFKDADDDLEEE FT V" XX SQ Sequence 4806 BP; 1385 A; 842 C; 1216 G; 1363 T; 0 other; cttaagtaga aagatattat aaactatctg gtagcgtcgc ttccataata tttttcttaa 60 gaaatcactg ctagtatgac tccaagatct gaaaccgcta gaaacgcgga atcccaaacc 120 gctcacttcc ctatcgaggc catggaggcc ttagtcaaca accaaggggc catgtttgct 180 aaaatggttg aagctatgga ggagattaag tttagtgctg cttcaggtta tgaatctaac 240 ccaaaggtat tggaaaatcc acgtgacccg gagaagttac agactttcat tagacaagtc 300 gaaactcggt cccaaaggta tcccgagtac agggataaca aggtggatta tgccagacgg 360 ttcatggctg ggtcggtctt ggaaagatgg gtcttaagtc aacccgatat ttggcaactt 420 agttggtcag atttcctggc tagactccgt aattccttct tggatccgtc gtacggagac 480 aaggttgaga tgcagttaga gaaactcaaa cagaccagta cggtcgcgaa gtatgtggag 540 gagtttactc gtctcaaggc actcttgcct ccggggctca gatcggaaag atcgttgaaa 600 agagtcttcg tcatgggttt aaagccagtt attcgtcctt tggtgttggc tggtttacgc 660 aatgacgctg tgactcttga tgacatcatt aatgatgctc ttatgcaagc cacaggtgtg 720 gaagctggat gggttgatga ccttaaagcc tattccaatc gttggaatga gtttaaagca 780 tccaccaatg ctgatgctat ggagttagat gcggtctcct tcaagaagct tagaccccaa 840 gagaagaagg tattgatggc aaatggtggt tgcttcaagt gtcgtaagac gggacacttt 900 gctaggcaat gtccaatggg agggaagaaa gctgacaaaa ctagagctag ttcaaaaaac 960 tagttaggtc tctgaagctg ggggacgtta aacgaaacag caggatgtcg gaaaccagga 1020 aggtcaaggt gtcaatggat gaagtttcgg tacagcgtcc ggaggaggtg ctcgaggtga 1080 aagaagagtt ggtagagcgt aacgatgcta aagaagtgag tgtacccaat gtacaacact 1140 cggtgattga agataattca attgaattag atgaaatcac acagagaact gaatcgatca 1200 tacttcaaga ttcggaaagt ggcctggaag ggtcgaaaga gcttgagctc tttgagctca 1260 cagtgagcca ggggtttgat ccagccaagg tcaatttgcc acccccctcc ccagttgaga 1320 acagtaagtt ctatgttccg gttgtgatga tgctgactgc tgttaaaggt aaagaatgca 1380 tggcattgct tgattccggt tcgggcgaag atttggtgct gtccaaattt atagaaggga 1440 agatcgatag tctccctcag gatgcttgtt tatgtgacgt gattgttgca ttcggtgcaa 1500 gtaggcaggt aacaaaaaga gtgttgcttg aatttagtat aaacggtata cattttagcc 1560 gttggtttat ggttgtccca gggtttacca aagatatggt attgggactc gacttcgttg 1620 gtgaacatga ggacttgatc tcgttcaaga aaagatcctt tgctggagtc agtaaagaga 1680 accctattgg gttagttgac agcgatgaat tcgttgaggc agttggcaat gctgcggaag 1740 ttggtttgtt cacattgaaa acggaggaga aggttgataa gatgccgatg gatgctgcta 1800 acaggtttcc ggatcttatc aaagagtaca aggacatttt cttacaagaa ttgaaggaaa 1860 cgcctgtctc cagggggcaa tgggatcata ggatcaacgt aattccatac gttacagccc 1920 caagtggaaa acaataccca ctagctaaac ccgagttcga ggaacttaag tctcaggtga 1980 gaaagatgct cgatgctggt ttgattagac agttaggtgt gggagaatca gatttcaatt 2040 ctccggtgtt gtttgtccgc aagaaagatg gttcataccg tatgtgtgtg gattatcgtt 2100 tgcttaatct aacagttgtt caaaagcagt ttggtatgcc ggtggttgag cagttgatta 2160 aggaggtatc aggatatcgg tattactcca ccttggatat gacgcagagt tttcatcaaa 2220 tcagattgga tgacgaaact agtcatgtga cagctttcac agcatttggc aagaagttcg 2280 cttataatgt gttgccgttt ggatttacta acgctccagc tatcttgcag gagactgttt 2340 cacagttgat acaggaaatc ccaggttgtg tgaattatat tgacgacatt attgtttact 2400 ctaataccat tgaggaacat aggaagtcat tgctgttgtt gtttgaagct tttagaagaa 2460 acaagttctt ctttaagggg tccaagtgtg agttaggtgt atcgaaggtg acgtttttag 2520 gtcatgaggt tggtcagaaa ggaactcgaa ttccagaggt ccagttggaa gttatcaagg 2580 ccattaaaac tcctgataat cccaaggaca tgcgaagcgc gttaggtttt ttcaactact 2640 ttcgacacta catccagaat ttctccaaga ttgctgcgcc tttatacgag tatgctacca 2700 aacgtaaggt tgaattcacc caagtgcacg acaaagcttt caatcagtta aaggctgatt 2760 tagtgaagtc agatgcattg gtaaaggtca attacgagtt aaaaccagtt aggtttcaat 2820 tggaggtaga tgcttcacat cacgctatcg gtgctgtact taaacaggta cacgaagaca 2880 aggtggttgg tgtcattgag atggtttcca gaaccttaac ggtagctgaa aagaattatc 2940 ccattaggca aaaagaattg ttgctgattg tgtttgctgt aaagaaattt aggcattaca 3000 tattgggata cgaaactgtg gtatataccg atcatcagag tttagaatca ttgtttgcat 3060 caaatacccg tcctgaatcg gaaaggatca ttcgatggtt agagtcatta caggaagtcc 3120 aactcaaagt tcgttacaga aatggcgatg ctaatgtgct cgcggatgtc ttatcaaggt 3180 tggtagatgc tggaagggta gaggttcacg atttggatac tacggttgct gaagtcacga 3240 atacagagga aacggtcact caaatattgc tgaaagagaa tgtcattgat caaatcaaga 3300 agtcgtatac caccgatgca tatttgacta caatattgaa tttacttacc gatactgatc 3360 gtaccagaaa cccaattcca ccagaactca acagtgcaat taagaaatat tcgttaacag 3420 atggggtcct ttattatgct actcagagtg gacaaaagcc tgttgttgga aagtcagttg 3480 ctaggatggt attggagaag gttcattcat ttgggcattt cggtatcacg aagtgttact 3540 ttgccattca gccgtatttg tttgttccaa agttgcttga agtggttacg gattatatta 3600 attcgtgtga tcattgtcaa agagcgaagg cgattcatgg ttcggtgggt ggattgatgt 3660 tgcccagtga cgtcccgaag gatgtattca gtacgatcca tcttgatttt gttacaggaa 3720 ttccgacaac acctgaaggt tatgactgta ttttggtcgt ggtatgtgcg ctaactaagt 3780 attgtgtttg tattccaacg aggaaaagca caaaggtggt tcagactgcg aagttgctaa 3840 ttgatgaagt cttttcggta tttggggtgc cccacacgat taagtcagac aaagatattc 3900 agtttatgaa tagtatcatg aagtatgtct ttgaatttta tcaaattgac tttagaacta 3960 cgagtaccaa caatccatct acaaatggtc aagtcgaagc cttgaataaa gtggtggttc 4020 aaacgatcaa gagtttttgt cacaaggaac atgcgttatg gtcacattat ttgaaggttg 4080 tacagtttgc tatcaacacc aattatgctc cagctattcg tatgacacct tatcaggccg 4140 tgtttggtcg gcttcctaga gatcggactg gtattttgga tatcaacatg aagcatagta 4200 gcttgtccgc tgaacagttg gtgagaagag ctgaggctat acatacacag atcaaggata 4260 ccatgtcttt gagtcaggat gccatgtccc ataaagcgaa ccgggggagg aatcctggcc 4320 gctttgaagt tggggactta atcttacttc atcgagaggc atactggcgt ccaggaaaat 4380 acaagaagtt gacagatgtc tatcacggtc cattcaggtt ggttaagaag attaacgaca 4440 atgcatttga agttgacttg cctaggatga acaagaaaga tcgggttatc aatattaagt 4500 atttccggaa gtacattgaa caacgtcagg tattcaagga gccaccaatt aatttagttg 4560 aagctgaagc tagggttgaa gatttaagcg caattcttgg ttatgatgct gaaaagtttg 4620 agtttgatgt cacatgggtt ggatgtagac caggacatgg ttctaccatt cccatcgagt 4680 ggttttatcg aaaggtgcca aaggtattaa gggatacctt gattgctaat gctattgact 4740 ttctcgggga tcggtttaag gatgctgatg atgacctgga agaggaggtt tgatttaaca 4800 aggggg 4806 // ID TPA5_LTR repbase; DNA; FNG; 322 BP. XX AC AJ439553; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 08-NOV-2002 (Rel. 7.1, Last updated, Version 1) XX DE Pichia angusta retrotransposon TPA5_LTR, long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; KW Long terminal repeat; RNaseH; TPA5_LTR; gag; integrase; pol; KW protease; retrotransposon; reverse transcriptase. XX OS Pichia angusta OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; Pichia. XX RN [1] RP 1-322 RA Neuveglise C., Feldmann H., Bon E., Gaillardin C. RA and Casaregola S.; RT "Genomic evolution of the long terminal repeat retrotransposons RT in hemiascomycetous yeasts."; RL Genome Res 12(6), 930-943 (2002). XX DR Genbank; AJ439553; Positions 1 322. XX SQ Sequence 322 BP; 128 A; 45 C; 48 G; 101 T; 0 other; tgttgagata ttagtctata atatgtcacg tgactatgca gagatatgca tctcctgcat 60 aaatattatg tgaaacaaat cctaatcagt gaggattaaa gataagtttc agaaagcaca 120 ctaaagattt agagtggtat tagaatctac tataaataca gttagattct ccctcatctt 180 aagtgaacat tcaattggta agatttacat ataaatctta taaatataag tttacacaaa 240 ctactgatat aaggtagtaa aaggtatatt aaattatcca aaagatgaca acttcagatc 300 agttaaagtt agcattccaa ca 322 // ID Gypsy-108_MLP-LTR repbase; DNA; FNG; 707 BP. XX AC AECX01000606; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-108_MLP_; KW Gypsy-108_MLP-I; Gypsy-108_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-707 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000606; Positions 94176 94882. XX SQ Sequence 707 BP; 203 A; 186 C; 116 G; 202 T; 0 other; tgtcagccct tagccctgat tacaggagca cgggctaaca acatcacata taataacata 60 atattgtcaa aagccataag ttacatctca aggctcagtg tcaagagcca cttaaacaca 120 tgtacgtaat gtattagaca cataacaaac ctaacccaac cattgagaca agagtcacac 180 ggcaaccaca agttgacaca tgcccttgtg agtatataaa cccctccttc ttcccttgat 240 tcatcccaca tcttttacta caatcacaac acagaaccac attttacttg actcctactg 300 agaggatctt caagtatgtg gttagttgtg tagcttgtta aagaccattc ttgagcatta 360 cgggttccat tgctgaacac ccaaggcact gatttcaaga gcccagggca ttcattgtct 420 ctagaaataa accttaaagt agccaagctt cctccctctt aaggagtcta aagtagcctt 480 caggatcttt ggtttaccaa agtgtaaagc ctgcccctcg cagtctcacg aggttagagc 540 acattagctc tttattgccc ttatatttcc tttccctttc tcaaatcctc ttcttagagg 600 tcaactagct gtagtcttca cttgggtgta ccacgtgaag actgcacctt cctttgccta 660 ctctattttc tgtgaagagt acgcctaaca gacacatact ccttgca 707 // ID Gypsy-3_MLP-I repbase; DNA; FNG; 5769 BP. XX AC AECX01001703; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_MLP_; KW Gypsy-3_MLP-LTR; Gypsy-3_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5769 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001703; Positions 149203 143435. XX CC Positions [4569-5024] - Integrase core CC 'CACGG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 246..2363 FT /product="Gypsy-3_MLP-I_1p" FT /translation="MPNTRKNHDVLEERVDNPDQILSSRNQKPGLLPSPSP FT LANSSTSVTQPPALFTRLADTLTYPPLRQGERILSASKSVAELYKLNSVPL FT DTLSPNTSALLTDILHRPTGSTRRHSFLPGALPDSSGISLDTVHPMDQTSS FT FSGEGQPLSDNHAHNAANNSTDFESVLRQVREESADNAARMAADNDDLRAE FT LADIKAMIKAMAGLRPPTTPPSTHAPIATSTPAQTTATAIPPPCPDSTVRH FT QPVEATPPSTVQHDSSGNQPPFAAYQSPTGPAGGFNRELPANYTMSQPMPP FT YWAYPMVPAEREVLARHLKDSEIPLYTCDYGDVTGFRLWRYRIEAKFKVKG FT LNSDEERLKVLPAALAIPHAVQWHRTHEAELEGKSWVEAMEMFESGVLPSG FT WLRDAKQALRELTQKSGEDMQTYLSRAQRLQDTVSVATCDDKELAERIIGG FT TNALFRETAAKDDLIENAIDLRTGMWSFSLFEKRALDTARFLQAFEITHTR FT TGSRQPNQGGRAAQSNVAPSTGGSTVSRHPLPPIDQQARSERWAAFMRSTG FT RCPRCKTQCSKWLGGCDAAPNMMYIPFPPDFPRRPPYPAPNPAAHAPVQAS FT SSTSRPGAPRRGPVIPVASVQVATPTKSVNEFPELGKADLAAYGDLVAALA FT THDEGYNGHAFSSELSSPPLILELMINGVAVRALIDTAAGTNLMSNRLATK FT TESGAA" FT CDS 3216..5024 FT /product="Gypsy-3_MLP-I_2p" FT /translation="MREEDIPLTAVKTPWGLYEWVVMPMGLTNASATHQRR FT CEEALGDLVNKICVVYIDDIVVFSQTVEEHEEHLKLVMERLKAAKLFCSIK FT KTKLFRRRMKFLGHEISAEGIYPDEAKVEKVAAWKTPKSAKQLKEFLGTVQ FT WMKKFIDGLAQYAGHLTPLTSSKKASGTFTWKEKEQAVFDNIKRIITTLPA FT LKNMDFDSNDPVWLFSDASGHALGAALFQGADWETSSPIAYESRQMTPAER FT NYPVHEQELLAVINALNKWKMLLLGMKVNVMTDHHSLTHLLTQRNLSRRQA FT RWLETLSQFDLNFKYIKGLDNSVADALSRIEDVATVHVQSHLSPGLRQRIK FT EGYENDAFCKRLSTVLPLRQNCLWKDGMMFMDNRLVIPSTEGLQLELIHLA FT HEAVGHLGSLKTAERLREEFYWPRLAQDVDNFVKRCDSCQRNKARTTRLPG FT RLQSTDVPKRPMSDIALDFVGPFPKVQQYDMLLTCTCRLTGFTRLIPTCST FT DTAERTARRLYGSWLTIFGAPTSMIGDRDKIWTSRFWQELQCLLGVRVQLT FT TAYHPQADGRAERTNNTVGQILRHAVNGKHGKWLQALPTVEYAINSAVTLP FT PGCHR" XX SQ Sequence 5769 BP; 1543 A; 1248 C; 1475 G; 1503 T; 0 other; ctttttttat ctatttcaaa ataccgatcg aactcagaaa aaccgattca gaaaccgatt 60 tttttttctc acaaaccgag tttttcgaaa ccttgcttgc gccgccgaat tgaacttgaa 120 tcgctgagat ctgaattgaa ttgcaataca cttgaacaac ctgactgcgc ttgaaaccac 180 tgatcacaac accggattga attactgcat gttgattgaa ttgattaccg atacaccttc 240 ttttcatgcc aaacactaga aaaaatcacg acgtcctgga agagcgtgtt gacaaccctg 300 accaaatact ttcgagtcgg aatcaaaaac cgggattgct tccttctcca tcgccgttgg 360 ctaatagttc aacgagtgtg actcagccgc cggccttgtt cacgcgcctg gctgatactc 420 ttacctaccc accgcttcga cagggagaac gtatactgag tgcttcaaaa tctgttgctg 480 agctttacaa attgaattct gtgccgttag atacactgtc accaaatacc tcggcgttat 540 taactgatat actgcaccga ccaactgggt cgacgagacg ccactctttc ttacctggag 600 ccttacctga ttcttctggt atctcgttgg acaccgttca tccgatggat cagacctcct 660 ctttctctgg tgaaggacaa ccgctgtctg acaaccatgc tcacaatgct gcaaataata 720 gtaccgattt cgagtctgtg ctgcgccaag tgcgtgagga aagtgcggac aatgctgctc 780 gaatggcggc tgataatgat gacctgcgtg ctgagctggc cgatatcaag gcaatgatca 840 aagctatggc tggccttcgt cctccaacca cgccaccatc aactcatgcg cctattgcta 900 catccacgcc cgctcagacc actgctactg cgataccacc gccttgtccc gattccactg 960 tacgacacca gcctgttgag gccacgcctc cttcaacggt tcaacatgat tcatctggga 1020 atcaaccgcc ctttgctgct tatcaatccc cgactggccc agctgggggg ttcaaccgag 1080 aacttccggc gaattatact atgtctcagc ctatgccacc ttattgggct tatccgatgg 1140 taccagcaga acgagaggtt cttgccagac acctcaaaga cagtgagatc cctctttata 1200 cgtgcgatta tggtgatgta acagggttta ggttgtggag atacagaatt gaagcgaaat 1260 ttaaagtgaa aggattgaat agtgatgaag agcgactgaa agttttaccg gcagcccttg 1320 caatccctca cgctgttcaa tggcacagaa ctcacgaggc agaattggaa ggtaaatcat 1380 gggtggaagc tatggagatg tttgaatcag gggtgttacc gtcagggtgg ctcagagacg 1440 ctaagcaagc acttcgcgaa ttaactcaaa aatctggtga agacatgcaa acttacttaa 1500 gtagggctca acgtctgcag gatactgttt ctgtcgccac ttgtgatgat aaggaactcg 1560 ctgagaggat tattgggggg actaacgctt tattccgcga gacggctgca aaggatgact 1620 taatcgagaa tgccattgat ctgcgcacag gcatgtggtc cttttccctt tttgaaaagc 1680 gagccttgga cacggcgcga tttctgcaag ccttcgaaat aactcacact cgcacggggt 1740 ctcgacaacc aaaccaaggt ggtagggcgg cacaatcaaa cgtagcgcca tcaactgggg 1800 ggtctactgt ctcacgacat cctctaccgc ctattgatca gcaggcaaga agtgaaagat 1860 gggcggcgtt tatgcgttcg actggaaggt gccctaggtg caaaactcag tgctccaaat 1920 ggctgggagg atgtgatgcg gcgccgaaca tgatgtacat acccttcccg ccagatttcc 1980 cacgcagacc gccttatcct gcgccaaatc cagctgctca cgcaccggtt caagcatcgt 2040 catcgacttc tcgaccaggc gcacccagga ggggacctgt tattccggtg gccagtgtac 2100 aagttgcaac gcctactaaa tcagtcaatg agtttccgga attggggaag gctgatctcg 2160 cagcttatgg ggatttggta gcggcattgg caactcatga tgaggggtac aatggacatg 2220 cattctcttc cgaactatct tccccaccat tgatcctgga actaatgatt aatggggttg 2280 cggtgcgcgc attaattgat acagccgcgg ggacgaactt aatgtcgaat cggttggcaa 2340 caaaaactga aagtggtgcg gcgtaaattg ttgaaaccca ctactgtacg tttggcaatc 2400 gataccaaca gtgcagatat acatctgact gagtttgcta tagccaccgt caaaagctcg 2460 gattcggtct ttggtgcgac gttttttaag ctcagtaatc tccatgacga actatacgat 2520 gtcattctgg gacaccgttt ctaaagaagt atgaattgga tgtgtcactt tcaagaagat 2580 gtgtggttca gacgaagaag gggaaggtgt ggtttgatgc agcagagaaa gagaagagac 2640 ggggaacaga aaaagagatg aaagaaaaaa agtttttaac acaagctagg gaagatcttg 2700 tgaatgcggt gtttgataat ctagattcta ttaacgaggc tgcggagtta tcggttcgtg 2760 aaatgcaaat gttaaaaagt tttgagtgcc tttttcccga cgatctacct agtgtggatt 2820 tgttggatga tgtggagttt ttcccacctg aattacaaca tccgagttca aagataagac 2880 acaaaattgt gctgacggat cgaaatgtta tcatcaatga aaaaccatat gggaacccgc 2940 caaagtatat ggatgcgtgg aaacagttga ttgattcaca tgtagccgcg ggaaggttga 3000 gacgatcaga tagtccatat gcctcgccat gtttgataat cccaaaagct gatcctaagg 3060 ctttgcctcg atgggtgtgt gattattgac gtttaaacaa attcactgta aaggaccgct 3120 ctccactgcc aaatgtggat gaatgtatta ggattgttgg gactgggaaa gtatattcca 3180 ttctggacca ggtcaacgct ttttttcaaa ccctaatgcg agaggaggat atacccttga 3240 cagcggtgaa gactccgtgg gggctatatg agtgggtggt aatgccaatg ggattgacca 3300 acgcgtctgc aacccatcag aggagatgtg aggaagcttt gggagattta gtcaataaga 3360 tttgtgtggt atacattgat gacatagtgg tgttttcaca gacagtagag gagcatgagg 3420 agcatcttaa gttggtaatg gagagattga aagctgctaa gctgttttgt tctataaaga 3480 agactaagtt atttcgtcga cgcatgaagt ttttagggca cgaaatcagt gctgaaggga 3540 tttaccctga tgaagccaag gttgagaaag tcgcagcatg gaagacgccg aagtcagcaa 3600 agcaattaaa agagtttctg ggcaccgtac aatggatgaa aaaatttatt gacgggttgg 3660 ctcagtatgc aggacatttg acacctctaa caagtagtaa gaaggcgagt ggtacattca 3720 cttggaaaga aaaagaacag gctgtgtttg ataacatcaa aagaattatt accacgctac 3780 cagccttgaa gaacatggat tttgattcta atgatccggt gtggctcttc agcgatgcga 3840 gtgggcatgc cttaggtgct gcgttgtttc aaggggctga ttgggagacg tcatcaccaa 3900 ttgcatatga gagcaggcaa atgacaccag cggaacgcaa ttatccggtg catgaacaag 3960 agttgctcgc agttataaac gcgttgaaca aatggaagat gcttctgttg gggatgaaag 4020 ttaatgtgat gaccgatcac cattcgttga cccatttgct aactcaaagg aatttaagta 4080 ggaggcaggc gcgatggttg gaaactctct ctcagtttga tctcaacttc aaatatatta 4140 agggtctgga taacagcgtt gcggatgccc tatcacgcat tgaagacgtt gctacggtgc 4200 acgttcaatc tcatctttcg cctggattac gtcaacgaat caaagagggt tatgaaaatg 4260 atgcattctg taagcgactg agcacggtgc tacctttacg acagaactgt ctctggaaag 4320 atggtatgat gttcatggac aaccgtcttg ttataccgtc gacagaggga ctgcaattgg 4380 aactcatcca cttggcccat gaagcagtcg gacacttggg gtcattgaag acggcggagc 4440 ggctgcgtga ggaattttac tggcctcgtc ttgcccaaga tgtggacaat tttgttaagc 4500 ggtgtgacag ctgtcagcgc aacaaggctc gcacaacacg gctgccggga cgtctccaaa 4560 gcactgacgt tcccaaacga ccgatgtcag atatcgcttt agacttcgtt ggaccgtttc 4620 cgaaagtgca acaatatgat atgctcctta cttgcacatg cagactgact ggattcacta 4680 gattgatccc cacgtgctcg actgatacgg cggaacgaac agcacgaaga ctttacggca 4740 gctggctgac catctttggt gccccgacgt ctatgatagg ggatcgcgac aagatatgga 4800 cctctaggtt ttggcaagaa ctgcaatgtc ttctaggagt cagggtgcag ctaacaaccg 4860 cttatcatcc gcaagctgac ggccgcgccg aacgcaccaa taacactgtg ggacaaatat 4920 tacgtcacgc ggtgaatggc aagcatggga agtggctgca ggcattgccg accgttgagt 4980 acgccataaa ctccgcggtt acactgccac cggggtgtca ccgatgaaat ttgtgcttgg 5040 atactcgcct gccttatttc cgatagaagg gtcgccagta ggaatgtgtg atgatgttaa 5100 gaagtggatt gagtgtagac agggagagtg ggcaacatgg cgagataagt tgtggggtgc 5160 tcgagtcaat caagctgttg cttataatgc gcgacgaggg gctgatatga cattggaagt 5220 gggagattgg gtgttaattg acgcaaagga tagacaacaa ttggtgaaag gcccagtggc 5280 gaagttgaga gcgcggtacg aaggacctta tgaggtgtta gaggttttga atgaaggacg 5340 agatgtgaga gtgcggttag acaagggcga caagacatat gatacctttc atcaatccaa 5400 gttgcgacga tattgctctg atgagctgga cgaggatgtt tgagggaaag gaggggttag 5460 tactccttgg cacagaagta cgtttttctc cttcgtatgc accgcccagt gggattctac 5520 catatgtata taagcaaaac ctcggccacg ctgtgagcac aactttggtt cggcttgtta 5580 tttcaggttc aatgcgacat atacggcgag tcccgacgac atggagtttc cgttttttct 5640 tctcttcttc cttttttttt cttttctcgt attttctttt gtgtttcttt tcaaaaaatt 5700 tttttttttc tttgattatc tgtttgttgt ttgatttaag gggggatttt tctttttagg 5760 tggggaggg 5769 // ID Copia-1_SPDB-LTR repbase; DNA; FNG; 510 BP. XX AC ACOE01000107; XX DT 12-FEB-2011 (Rel. 16.02, Created) DT 12-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Spizellomyces punctatus genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_SPDB_; KW Copia-1_SPDB-I; Copia-1_SPDB-LTR. XX OS Spizellomyces punctatus OC Eukaryota; Fungi; Chytridiomycota; Chytridiomycetes; OC Spizellomycetales; Spizellomycetaceae; Spizellomyces. XX RN [1] RP 1-510 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Spizellomyces punctatus genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; ACOE01000107; Positions 22488 22997. XX SQ Sequence 510 BP; 151 A; 87 C; 135 G; 137 T; 0 other; tgttgagatt gaggcttaaa ccacgctgta tgatctataa atcatgtgag atcatgctgg 60 gatgtcatta gacagttcac atctgatttg gctaagttac aacgtgggga attggactta 120 gagtaatcag aatgatcaca gacgcgtgtg tgaatgattt aaaggggtaa atcatcatgt 180 aagcgctcct agaaggtctg gaaggtttgt gatgatctag atcttcaggt aatgtttgcc 240 aaggcatggc tgaagctgct gagcattgct gagcatggct gagcatgcaa acaagtgggc 300 tgaagcatgt aaacagtaaa caaaggcaga gggaggctgg agaatgtact gggttatata 360 aaggggactt ttgaacgtct ctggattaat acaagcgtgt tatcccacaa attactgcac 420 ccagaagtct tactagcggc cacaggcatt actaggcctt tcagaagtga tttctacatg 480 ggaaatcagt gttccaagcg ctaagtttca 510 // ID Gypsy-15_RO-I repbase; DNA; FNG; 6719 BP. XX AC AACW02000285; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-15_RO_; KW Gypsy-15_RO-LTR; Gypsy-15_RO-I. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-6719 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000285; Positions 12424 5706. XX CC 'CCAT' target site duplication CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 168..1382 FT /product="Gypsy-15_RO-I_2p" FT /translation="MASYPTGQIALANANNQTFSQGFRPRLIEFYGYEGED FT FRHFQEILDSYLAITNTLSDDRKLIVLKSQLRRAAKIYFEKEILKRIPNVT FT YDKAIELLKQHYITPELIQSYELEFNEMFQGEQEHPQIFLARLREAADLAN FT ITSEAVIESRFRVGLLKEIKQFCIQSSAHTFQEWINHADGWWNANRPRKIA FT MVDNPFIPRNVNNALIYHDDNRYTKHHGANNHNVELIDTEERHIHAVPIND FT LRNNFGNNYMAPYSEIITGPNQLVTMEVKGSTNNNHLYTQPTMNRHNNQSV FT HTNSQHDLVNLIQETIRSELNLHQHSSQPTRNYNRRPRYDNYNNFHNNEYN FT RNGNNSYNYRSDNRYRNNYDHSYGRYEYDNHNRNPNQPTNNHNNQPQQGIS FT QPRYQNQQSKN" FT CDS 1685..4426 FT /product="Gypsy-15_RO-I_1p" FT /translation="MPKQPIQATGSMEVDSDLPIEIKPKPIKRKTKAPPIR FT YDIVSDVMNQKADISVGDLMVAAPTLRRKLASACRPKRIPVTETSKETMAV FT IEEDDIHTTAVYSKINIGDKTVKVLVDCGAAKTCMSKSLADALGLEIDAAS FT ESVFTLGNGSKQPALGVIYDVPIEVQEDLIIPCTVEVLPACPTHLIIGNNW FT LNRAKAKIDFNSSTLKVSYKNKKAEIEIFFLRKNTPLPKISSYIQSYQHPI FT SSTNSKETKKVHFENDKADSEDDEVEEESEESTDEESTEEEEVEDEEHSLL FT LLENDYKKDIQIKNLKNQCILEASSDGLTIPANSSKTIIIKKPKDELRGLM FT YSFEITSTKILQKTGILDPCHNFITNKKSLEIRIHNRSNDNIILDPGEEIG FT ILEKLNLKEDTIIQAYDVEKDIHLCTMEMTEAMDKETSDEDKETLEAEKYS FT KLEIGEMDRKIARMLKRLLKKYEEIFDWDNNSIGYTKLLRHKITVQEGTLP FT ISHRPYRISPLEAEHLQKELEKYTKLGVISPSNSPWAAPVILVKKKNGEYR FT MVIDYRKLNAVTKKDAYPLPRIDDLLDTLGKAKVFSALDMRAGFHQVPMEE FT DSKELTAFTTKFGIYHYNTLPMGLVNSPATFQRLIDLCFRPLINKCLVAYI FT DDLNVYSFNQQDHLHHLEQVFQCIQIANLKLNPEKCFFFKDHLKFLGYIVT FT KDGVQTDPEKIKKIIEYPQPKTLKQVRGFLGLASYYRRFIKNFAAIARPLH FT DQTKTTKQVPWTEKTTDSFETLKKLLTTAPVLTRPDFNKPFILVTDASKAG FT LGCILTQLDENDHEHPVIFASRGLRSSEINYAPTKLECLAVIWAVKMFRPY FT LLGKRFTIITDHSALHGLLKSPNPTGIIARWITTLAEYDFEIKYRPGRVNE FT SADFLSRLGY" FT CDS 4608..5951 FT /product="Gypsy-15_RO-I_3p" FT /translation="MEQYKIDQLTQYLQTMKIAEDATSRMKKYLKKEANKF FT TIYRDILYRYNTDNGIIRKVLNKKEAEEIMYAYHQHPLGGHLAYNNTLHKI FT SSRYYWDNMSHDIMEYVKKCYRCQMHGKKMLKEELYPVAVSAKPFDRVALD FT VKHVQTSRSGHRYIIAAIDYLTKYVEARPLRFQNASEIALFVYEDIICRHG FT CPTIMVSDNGKPFLSELIRQVCRNYNIIHKTTTPYNPQSNGLIERFNRTLG FT QILQRRSYEEKLDWDAYLPATLFAYRSIKQATTKHSPFFLVYGYEPRTPFD FT LDHHLYEKNSPKFEAKLWHRTTHQIHNLNRIREQATQAIKQTQIAQKKAIE FT KKLLDQSKELKPPFKIGELVLVFKDYMSTSWSGKLQDKWEGPFIIQHILGK FT GTYHIKSADIRDTKLRRVHGNRLKAYLLPKTQWIMENERILVHQMDTETQE FT LLH" XX SQ Sequence 6719 BP; 2646 A; 1290 C; 1131 G; 1652 T; 0 other; tttggtggtc actacgaggg aacaaatcaa caaatcaaca aatcacaaac attattaaca 60 agcaatcgga aaaatctatc aaattaaaac aaatcgtcaa gaaaactatc aaaactctac 120 ttcaagaaat acaaaacata ccaaaaacaa atatatcaaa tttcaaaatg gcctcatatc 180 ctacaggaca aatcgcgttg gcaaatgcta acaaccaaac tttttctcaa ggttttcgcc 240 caaggttaat tgaattttat ggctatgaag gagaagactt tcgccatttt caagaaatac 300 ttgattctta ccttgcaatc actaatactc tcagtgatga tagaaaactt atcgtgctca 360 aatcccaatt acgacgcgct gctaaaattt actttgaaaa agaaatactc aagagaatcc 420 ctaatgtcac atatgacaaa gccattgaat tgctcaagca acactacatc acacctgaac 480 taattcaaag ttatgaatta gaatttaacg aaatgttcca aggtgaacaa gaacatcctc 540 aaatattttt ggcaagacta cgagaagcag ctgaccttgc gaacattaca agtgaagcag 600 ttattgaaag tcgttttcgt gttggtctac ttaaagaaat taaacaattc tgcatacaaa 660 gtagtgccca tacattccaa gaatggatca atcatgctga tggatggtgg aatgccaatc 720 gtccacgtaa aatagccatg gtggataacc cttttattcc aagaaacgtc aacaatgccc 780 tcatatatca tgatgataac agatatacaa aacatcatgg tgcaaacaat cataacgtag 840 agttaatcga cacagaagaa agacatattc atgccgtacc tatcaatgac ttacgaaata 900 attttggaaa taactatatg gctccctatt ctgaaataat cacaggtcca aatcaactag 960 tcactatgga agtgaaaggc agtacaaata ataatcattt gtatacacaa ccaacgatga 1020 atagacataa caatcaatca gtgcatacca actcacaaca tgacttggtt aaccttattc 1080 aagaaactat acgcagtgaa ttgaatctac atcaacattc atctcaacct accagaaatt 1140 ataatcgacg tcctagatat gacaactata acaattttca caacaatgaa tataaccgaa 1200 atggaaataa cagttataac tacagaagcg ataacagata caggaataat tatgatcata 1260 gttatggaag atatgaatat gacaatcata atcgtaatcc aaatcagcca actaataatc 1320 ataataatca accccagcaa ggcatctctc agccccgcta tcaaaatcag caatcaaaaa 1380 actaaatggg tcggttgctt ttgataatca aaagaatggt caatcgaaca cccaacataa 1440 caacaaaccc attaaacaac aaaaccaaca acataatctc aatgtcatac ttacccaaaa 1500 tgaatacgat aacacatatc aagcccaaga tctttatgct gctataagac ctgaacaccc 1560 gcctgaagtc ctaagctcta aaccttattc taaaccgaca accagagaaa aatggaaaca 1620 accaagtagt gcttcagtaa ctaggcgtgt aaacaaacgt aatcaagtca aggaaacaag 1680 caacatgccc aaacaaccaa tacaggctac aggaagtatg gaagtagact ctgatctccc 1740 tattgaaatc aagccaaagc caataaaaag aaaaacgaag gccccaccta tcagatatga 1800 tattgtatca gatgttatga atcaaaaagc agacatatct gtaggagatt tgatggtagc 1860 tgctcccaca cttagaagga aattagcaag tgcatgtaga cccaaaagaa tacctgtaac 1920 agaaacatca aaggaaacta tggcggtaat agaagaggat gacattcata ccactgccgt 1980 atattcaaaa atcaatatcg gagacaaaac ggtgaaagta cttgttgact gtggtgctgc 2040 caagacgtgt atgtcaaaat ccctggcaga tgctttagga ttagaaatag atgctgcctc 2100 agaaagcgta tttaccttgg gaaacggttc taaacagcct gctctaggag taatatatga 2160 tgtacctatc gaagtacaag aggatttaat tattccgtgt acagttgaag tattgccagc 2220 ttgcccaact catttaatta tcggaaacaa ctggttaaac agagcaaagg ccaaaattga 2280 cttcaatagt tcaaccctga aagtgtccta taaaaacaaa aaggcagaaa tagaaatatt 2340 ctttttgcgt aagaatacac ctctaccaaa aatttccagt tatatccaaa gttaccagca 2400 tccaatcagt tcaaccaatt cgaaagaaac gaaaaaagtg cactttgaaa acgataaagc 2460 agattcagaa gatgatgaag ttgaagagga atcagaagaa agcactgatg aagaatctac 2520 tgaagaagaa gaagttgagg atgaagagca ttcactatta ctgcttgaaa atgactataa 2580 gaaagacata cagataaaaa atcttaaaaa tcaatgcatt ctggaagcct catcagatgg 2640 attaactatc ccggcaaact catcaaagac tatcataatc aagaaaccta aagatgaact 2700 taggggatta atgtatagtt ttgaaataac cagcaccaaa attctacaaa aaaccggtat 2760 ccttgaccct tgtcacaact ttattaccaa caaaaagagt ttggaaattc gaatacataa 2820 tcgctctaat gataatataa ttttggatcc aggcgaagaa attggaatct tggaaaaact 2880 caaccttaaa gaagatacga ttatacaagc atatgatgtt gaaaaggata ttcatctctg 2940 tactatggaa atgaccgaag caatggacaa agaaacctca gatgaagata aagaaacttt 3000 agaagcagaa aagtacagta agcttgaaat tggggaaatg gatagaaaaa ttgcaagaat 3060 gttaaaaagg ctcttaaaga aatacgaaga aattttcgat tgggataata atagtatcgg 3120 ctatacaaag ttgctacgac acaaaataac tgtacaagaa ggcacattgc ctatcagtca 3180 ccgaccgtat cggattagcc ctttagaagc agaacacttg caaaaagaac tagaaaaata 3240 cacaaaacta ggtgtaatat ccccttcaaa tagtccgtgg gctgcgccag tcatcttagt 3300 taaaaagaaa aatggagaat acaggatggt aatagactac cgaaagctta acgcagttac 3360 aaaaaaagat gcatacccat taccacgaat tgatgacttg ctagacacgt taggaaaagc 3420 taaagttttc tcagcattag atatgcgagc tggttttcat caagtaccga tggaggaaga 3480 tagtaaagaa ttaactgcct tcacaacaaa gtttggaata taccattaca atacattacc 3540 aatgggactt gtaaactctc ctgccacatt ccaacgactt attgatttat gttttcggcc 3600 actcattaat aagtgcctgg tagcatatat agacgactta aatgtatatt catttaacca 3660 acaagaccat ttacaccact tggaacaagt ttttcaatgt atacaaattg caaatctcaa 3720 actaaatcct gaaaaatgtt ttttcttcaa agatcatctc aaattccttg gctatatcgt 3780 aacaaaagat ggtgtacaaa cagacccaga aaaaatcaag aaaatcatcg aatatccaca 3840 accaaagaca ctcaaacaag tcagaggatt tttaggcttg gcttcttatt atagacgatt 3900 tataaagaat tttgccgcta tagcaagacc tttacatgat caaacaaaaa ctactaaaca 3960 agttccatgg acagaaaaga ccactgattc ctttgaaacc ttgaagaaac tgcttacaac 4020 cgctcctgtc ttgactagac ccgatttcaa taaacccttt attttggtca cagacgcttc 4080 aaaagcagga cttgggtgta tccttacaca attagatgaa aatgatcacg agcatccagt 4140 catattcgcg agtaggggac taaggtccag tgaaataaat tacgcaccga caaagctaga 4200 atgtttggca gtcatatggg cagtaaaaat gtttcgtcct tatctgttgg gaaaaagatt 4260 cacgattatc acagatcatt cagcattaca cggattactc aaatcaccaa atcctaccgg 4320 aattatagca cgatggatca caacacttgc tgaatacgat ttcgaaataa aatatcgacc 4380 tggaagagta aacgagagcg cagacttttt atcaaggctc ggatattaaa gatacacaac 4440 aatgaagaca tataaataac aatatcaatc tatatatata ttttaccaaa actacatgga 4500 ggaagggagg ggtagttgaa acaaaatcca taaaacctta taaaaacagt acaaaagaaa 4560 aaaataaaaa tcccgacata acataccaac ttcaaactat aagcaaaatg gaacaataca 4620 aaatcgatca gttgacacaa tatctacaaa caatgaaaat agctgaagac gcaacttcta 4680 gaatgaagaa atacctcaag aaagaagcaa ataagtttac aatttatcga gacatcttgt 4740 atagatacaa tacagacaac gggattatca gaaaagtatt aaacaaaaag gaagcagaag 4800 aaatcatgta cgcataccac caacatccat taggaggtca tctggcttac aataatactc 4860 tacacaaaat atcatctcgt tactactggg ataatatgtc acacgatatc atggaatatg 4920 tcaagaaatg ttacagatgc caaatgcatg ggaagaaaat gttgaaagaa gaattatatc 4980 ccgtagcagt ttcagcaaaa ccatttgatc gagttgcctt agatgtgaaa catgtacaaa 5040 catcgagatc aggacataga tatattattg cagctataga ttatcttaca aaatacgtag 5100 aggcgagacc cttacgattt caaaatgcat cagaaatagc attgtttgtt tatgaagaca 5160 tcatttgcag acatggatgc ccaacaatca tggtgtcaga taatggtaaa ccctttttaa 5220 gtgaactcat ccgacaggtg tgtagaaatt acaacataat acacaaaact acaacaccct 5280 acaacccaca aagcaatggc ctgatagaac ggttcaaccg aacacttgga caaatactac 5340 agaggcgttc ctatgaagaa aagttagact gggatgctta tttacctgcg acattgtttg 5400 cctatagatc aatcaagcaa gctacaacca agcattcccc ttttttctta gtttatggct 5460 acgaacccag aacacctttt gacctagatc atcacttgta tgagaaaaat tctcctaaat 5520 ttgaagcaaa gctgtggcat aggacaacac atcaaataca taacttgaat cgaatccgtg 5580 aacaagctac tcaagccatc aagcaaactc aaattgccca aaagaaagct atagagaaaa 5640 aattattgga tcagagtaag gagttgaaac ctccattcaa gataggtgaa ttagtattag 5700 tattcaaaga ctatatgtcc acctcctggt caggaaaact gcaagataaa tgggaaggtc 5760 cattcatcat acaacatatt ttgggaaaag gcacgtatca catcaagagt gcagacatcc 5820 gggatactaa attaagaaga gtacatggaa atcgattgaa agcatatttg cttcccaaga 5880 cacaatggat catggagaat gagcgaattc ttgtacacca aatggatact gaaacgcaag 5940 aacttcttca ttaaagtcac acacaaaagc atatatatac ctcaaagaaa atcaaaaaaa 6000 aaaaaattta tatcaaatat tttaaaacaa tgagcaacat gcaagaagta caagaaaact 6060 ctgctgccaa tactcaagga cccgtctact tggacactgt tgagaactat gtcaagaagc 6120 aatacattga acaaggaatc gaggctgcct gccaagcttt agacaatttc ttgatctttt 6180 acgagaacca atacaaaaat tataactacc aagtcgatgg acaaatagta agcaagtacg 6240 agtttatcac aaatactatg gctcgaatcg aatctgaaat catggaacag caagaagcaa 6300 gcatgaatga tgatgaagaa aatgaacaag agaacaatga agttatcttg gatgagcctt 6360 tgattcaaaa caaccagagc attaccgacg attcatggtt ctatgaatta tgtatgtccg 6420 agtattatga acggtatcgt tcattattgg aacagcaagc aatgattaat cgtcaaaaag 6480 aaattgtggc taaaagcttt gcacaaaatg taactgaatg ggcacacttg aaattcaaaa 6540 gacctggaaa caaccagcgt gacatttatt tatatgaaac agcaaagtat aatcaactca 6600 agactattct tccaaattgc accgtctacc gtactcttgt caccaaaatg aaaaatcacg 6660 atcaatgggg tcagttgatg tctcaaaccg aggtcggttc ttctcttggt ggggggatc 6719 // ID Gypsy-3_PCR-I repbase; DNA; FNG; 12036 BP. XX AC AADS01000631; XX DT 30-JAN-2011 (Rel. 16.02, Created) DT 30-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Phanerochaete chrysosporium genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_PCR_; KW Gypsy-3_PCR-LTR; Gypsy-3_PCR-I. XX OS Phanerochaete chrysosporium OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Corticiales; Corticiaceae; Phanerochaete. XX RN [1] RP 1-12036 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Phanerochaete chrysosporium RT genome."; RL Direct Submission to RU (30-JAN-2011). XX DR Genome; AADS01000631; Positions 15894 3859. XX CC Positions [10948-11427] - Integrase core CC 'TCGC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 37..969 FT /product="Gypsy-3_PCR-I_1p" FT /translation="MTTTPPESTTTSAGSVSTISSISTPTNSPLPPQVQTT FT IPATQTTPPKMASATVADLPLPGTKGAPKKFTGKHSEVEPFLHFYQRLCTK FT CNVTQDKDKIENLIHYCSRTVRETLQGLKSYEARNWRAFQDDFKNYYEAER FT DKKRFRISDLGKYITQSEKGKIKDLAAWMKYRRGYVRIAGWLLQHKKITST FT EYETNFWLGIPQRFRDILEARLMAQNPNHDLEKPFGTWKNPLARMMSARLL FT RTCLGEIALIPSASVRNIAMTPLPRSLKLQTPTQVIATLNHRMRSVITRDT FT AASLEAMQARKRQTKRRRM" FT CDS 1122..7454 FT /product="Gypsy-3_PCR-I_3p" FT /translation="MSVNDSRYAAIYFKAYTLNPDIRHIIPTPREQQMQAR FT RARPNERDVPPHMQGTSSAPGGYFQRTCFGCGESGHSINNCEAVQEYVKKG FT VITRNQRGRYIFTSGAPIMKATMDEPLVSAIKRAMSPQSNYISVFGIGSGT FT GTKQVYSAQHPRTAVHMSDADDSSDEEMNEAYPVTRSEKQTTANRKEKFDG FT VWVPPRRQMEKPKNKENVPVPQGIPRPPQAINPTPVDVHKQAFNPQDPRAY FT EEDPAVAKKIRKDKGRERGNEDITMGEPEPRKDKDEKRMPRKSDLQSAVDS FT KLVLDKVLRTPVTMAVGELLAVSKEISQQVQEVIKVKSVKSQEVKPNAHLA FT EYANSLPPEMPVAAAAFMPRTRGQLIKLRMECDGVPILAIVDTGSQLNIVH FT HKVWLNCLARPMDITQRITMGDANGGEGTLAGFVQHVPLSCGAVLTYANMY FT VGRKAPFDLLLGRPWQRGNFVSIDERLDGTYLQFKDKALRVNYEILVTPDD FT IDPDIAEYISRSTAQARAHEQGQYYETQERHTNYATASPLSVYAVTEAKTQ FT GRGLTRQYATANIEELPEEIKNSDATIVAKEDRDATQSQQDETGTKREKKH FT TQQADAPQKGFPKGEGSPEGHDEDTEVEEVYYLAKEGDEHDEEGAEEEIEE FT PLTMPGGLILLNEDAHFPEDGGPLGRLLMRAGQFCGRVISKLWDNLPMGGG FT QPDSTPNEVPEASSTASDSSLPSDEEETDCGDSLASQYSDEERHLPEELTE FT SLRHLANLIQQYDHDLNERMKLTKERFQKSQLLNLAPDLTQRKADRLSRWT FT DKGSRTEMMEDEEQEDDLPDLVPLDEPGEYEAPTSTADRRLESYAENNSVT FT DGEERENVPDRQPLAMPGRGESSEDADTRLIEPLKARPCYQSEGDLAEGER FT SEKKQATGDGLGDVPEPREEVFAIVLNENDETRVLQELTNMSANRMATRRT FT RHLQDIGRMLPRPPGTRTEREVAEARRDVETLLDRLGHMTASRYRLAQYLA FT EQRLLESLDRQLLQIAEDERNGYYHLSQPWRTPESDRQSCEREWQIRNAML FT GWATREQVLNVGRTYLARLIVDLDMRGTAATPLPGFVSTLAARYRECAARA FT QDEAPMVNATRQENGQETGNSPEAMEIEDLLDLSSLAGAVNDISDENLEPG FT DIPSEESLRASRATPHLLAQDGEHDWATQATWTEDDTGQQEALQSLTQSIQ FT ELEELIELTAASATLRATPVSRNDQGQAPAIHNGLQTPQDDEERDQPTILL FT IEIQDLPIEELRDEKENDSAQQAFKEALPKQRETRPPCPPVLADMATIPFA FT CVDLHHFPRLRTLVELACRLKIEGNVRLMDPDAEVSRGTKELHEEMDQHDR FT VYDHITLKFTHQLPRGVEGMIPEVDGSVLETYMTCHDLDSVLGQVDKPQAT FT SIGHSRWPLVFWPLEKPERIDAPDGEDVDMLQPIDYQRDNPTHITYYTART FT FYEIHPNGLDCAPLRFEGHAIIEAYNGCPNPRFLERNDVWLLTDTPLPRVL FT LTLSQPGYLPEAIRESLDRHNARLFYREVMGFEGGHRLDGLESWPTACQTF FT EEAGKWTFRLGLAGQRPESWMMHVPGPQERPAGPEEEAGQREGRRIRTTPV FT LELSAQADQTGATAWKIYRVRTEHVRDTACGILDSSRFRVLESRCFLGVCQ FT SWVPRHEEEEDFWTNDYRSWEPATWWRINGDARDGHASQASGLYGADADNL FT PHHDDYHRVHEIPPAGSAAEAAHHDHELSPPPYSPTFCASVGRTSQQHNRA FT DDAAADAAEPDSDMDDTQSASTRSTMGSDDEIRTPGHAEGGEYQQPRIVRP FT HLPNLVHPARSESWSEMSEAEEGEVREDWRQKDERRDREEDWEMTEGSVSD FT ASSTFSRAASEETRQTTPQHHTPSPVQHLYENAGPHIRPPHISFMYQYSLA FT RDEHGTIHMRYVRLRTYGEIIPPHLDPRRRTERPATTTTKIVAMTATRCHL FT LGRAQAVLHETQAYDLPTYLAEKWTLEEPTTRDLECLLPYTEEFGTPYLYA FT CEQLAIRQCRHRWAGLLDIPHAADYVRECDAILAYGLGEPDEDQVRADRLA FT GYIGTPSYITPVTLYHIGNHTSTPLPLDDYGEAH" FT CDS 7261..12003 FT /product="Gypsy-3_PCR-I_2p" FT /translation="MPPTTFANATRYSLTVWANRTKIKYEPTASQAISAPP FT ATSPRSPSIISEITRARPSPLTTMARRIEEDTRMKAHHGNNYGTRTTSTTD FT FGANAMRYGYVFLYFSFVLYELWTDTDIGLRPDILGLGFLLCTSAGTGQLG FT RSYVARTLASLCTRNAAALPPAPECESTMTSVRARGGNKNISPGTLFLFVC FT TCISRSATLIWTFDPDELFLVYKTLEKFVYFLCSIILTYGLEKWAYPLYIK FT LQKNTEPPKPAREPDIRDFATRRDFGAQATPGSAKRLGLASVLNIQQTAPT FT RIEPLHSASAREQGARHKSAPDLGQPSGMPQGLLRGSAQENPEARAGFEPA FT TQTFGARSAADKLPAAHPHIKSEKSQEYSSNTRTQTQGYYHTALATQAPED FT PRRPKEMLGNEEDIKCISDPKPLGGSRDVVNKEEWKGLYVTVLEEEDAESP FT NAPLLYTVAMEDFLATLDDDDEIICMDGEELFAACKPMHLPEVFAAYKRVD FT RKVKPVPAVFPEDARVERHFPEEPLDSLPELPTHPPQFTPNGGRLTAERMK FT EMKINEDGFLWPEEEKLFKHILQTHQDHFVWEDHERGSFREDYFTPYIIPV FT VPHIPWAFPNIPIPQGILDQVVQLLREKMEAGVYESSQSSYRSRWFCVRKK FT NGKLRIVHDLQPLNKVTIRDAGLPPNLDSFVEPFAGHQCYTVFDLYWGFDA FT RKVHPRSRDLTAFMTPLGLLRITSLPTGFTNSPAEFQACMSFILQHEIPHK FT ANIFIDDLPIKGPASQYKDQNGRPEVLPANPGIRRFIWEHANDVHRIIHRV FT GHAGGTFSPAKVQLSRPEVLIVGQKCTPEGRLPDTQKVEKVLSWPPLKTVK FT DIRAFLGLCGTVRIWIENYSVKARPLTELIRRGNEFIWDARREEAFCTLKK FT AITSAPALRQIDHKSDRPVILAVDSSYIAVGFILSQIDEEGRRRPARYGSI FT PMNEREARYSQPKLELYGLFRALRAWRIHLIGVKKLIVEVDAKYIKGMLND FT PDLQPNAAINRWIQGILLFDFELVHVPAAKHRGPDALSRKERAEGESMEEE FT DDSWLDDIALFAHATATHQLSSWQQGEWDLYHVQTEPEDVKQLRIYASVLP FT KPEQTLVDIYDFLRTLQTPSFDSIQEKNRFIKKSLKYFVKGPAMYRRNPHG FT IPTKCIFRKEKRKAILEAAHEQLGHRGELATFQTVKQRFYWPGLWNDVRHH FT VRSCHQCQIRALRKAEVPIMVSTPATLFVKIYLDIMHMPKAQAYVCIIAGK FT DDLSGVSEGRPLRNKKAEAVSKFFWEQILCRYGAVGWVVTDNGPEFMGAYS FT LLMDRYHLPHIKISAYNSKANGVVERGHFIIREAILKSCEGRIKEWPDYVS FT HAFFADRVTVRQQTGYSPYYLLHGTHPVLPFDLVEASFMVDGFTRNMETSD FT LLALRIRQLQKRPEDIAKAARTMKALRLKSKTQFEERYRARMINESLAPGT FT LVLIRNTAIEKSLDKKSKSRYFGPYEIVRRTAGGSYIIKELDGAHWRTEIA FT AFRLIPYVARGDKKELKKLAADLSKIDPVPSLEELERRADEEERTDEEEEE FT DSSSEDLEEDSESN" XX SQ Sequence 12036 BP; 3390 A; 3413 C; 3069 G; 2164 T; 0 other; tctggagccc accgcgaggg gggagtttca gtgtgcatga caactacacc tcctgagtct 60 actacgacat cagcaggatc agtatcaaca ataagcagca tatcgacacc gacgaattca 120 ccgttgccac ctcaagttca gacaaccatt ccggctacac agaccacgcc cccaaagatg 180 gcttcagcta ctgtagcgga tttgcccctt cccggcacga agggggcgcc gaagaagttc 240 acagggaaac actccgaggt ggaacccttc ctgcacttct atcagcggtt atgcaccaaa 300 tgcaatgtta cgcaggataa ggacaagatc gaaaacctta tccactactg ttcgagaaca 360 gtacgcgaaa ccctacaagg cctaaagagt tacgaagccc gtaactggcg agcctttcaa 420 gacgatttca agaattacta cgaagccgaa cgagacaaga agcgattccg aatctctgac 480 ttgggcaagt acattaccca gtcagagaaa ggaaagatca aggatttggc cgcctggatg 540 aagtaccgca gaggatacgt ccgcatcgca ggctggctac tgcaacacaa gaagattaca 600 agcaccgaat acgaaaccaa cttctggctt ggaatcccgc agaggttccg agacatattg 660 gaggctaggc tcatggccca aaaccccaat catgacctgg aaaaaccctt tggcacctgg 720 aaaaaccctt tggcgaggat gatgtctgca aggctgctca gaacttgctt aggcgagatc 780 gctttgatac cgagcgcgtc agttcggaac atagcgatga ctcctcttcc tcggagtctg 840 aaacttcaga ctccgacaca agtgatagcg actctgaatc atcggatgag gagcgtcatc 900 acaagagaca ccgccgcaag tctggaagcc atgcaagcaa gaaaaaggca aacaaaaaga 960 agaaggatgt aactgaacac caaagctctg gaattacttc agagtcggat tcagatgcac 1020 ctcctaaatt gtttcgccgc cagaaggaaa gaatcgtagc ggaaaagaaa tcctctactg 1080 aggataaaca gcttgaagat cttgttgaac agctttcgcg catgtccgtg aatgactcac 1140 gttatgcggc gatctacttc aaggcgtaca ccttgaatcc cgacatacga cacatcattc 1200 caacaccccg cgaacaacaa atgcaagctc gacgagccag accaaatgaa cgcgatgttc 1260 caccccatat gcaaggcact agctcagcac cgggaggata cttccaaagg acatgctttg 1320 gctgtggtga atcaggccac agcataaata attgtgaagc cgttcaagaa tacgtgaaga 1380 agggtgttat tacccgcaac cagcgcgggc gatacatctt cacgtccggt gctccaataa 1440 tgaaagcaac catggatgaa cctctggtat ctgctatcaa gcgagcaatg tctccgcaaa 1500 gcaactacat ctcggttttt gggatcggtt ctgggacagg cactaaacaa gtctactcgg 1560 ctcagcaccc aagaaccgca gtacacatgt ctgatgccga tgactccagt gacgaggaaa 1620 tgaacgaagc atatccagtc acccgatcag agaaacaaac cactgcgaat cgcaaggaga 1680 aattcgatgg agtctgggta cctcccagac gccaaatgga gaagccgaag aacaaggaaa 1740 acgtacctgt acctcaagga attccgcgac ctccgcaagc cataaaccca actccggtgg 1800 atgttcataa gcaggctttt aacccgcaag acccccgagc atacgaagaa gatcctgccg 1860 ttgccaagaa aatcagaaag gataaaggac gggagagagg aaatgaggac attaccatgg 1920 gagaacccga accccgcaag gacaaggacg agaaacgcat gcctcgcaaa tcggacctcc 1980 aaagtgccgt agactcaaaa cttgttctgg acaaggttct gagaactccg gttacaatgg 2040 cagtcggaga actgctggct gtatcaaaag aaattagcca gcaggttcaa gaggtcatca 2100 aggttaaatc agtcaaatcg caagaagtca agccaaatgc gcaccttgca gaatacgcaa 2160 attccctgcc tccagaaatg cctgtagctg ctgctgcatt catgcctcgg actcgtgggc 2220 aattaatcaa gctgcgtatg gaatgcgatg gagttccgat actcgccatt gtggatactg 2280 gttcgcaatt gaatattgta catcataagg tctggctaaa ttgcctggcc cgtccgatgg 2340 acattactca gcgaatcacc atgggcgatg caaacggagg cgaaggtact ttagctggat 2400 ttgtacagca tgtgccactg tcatgtggcg ctgtacttac ctacgctaat atgtacgttg 2460 gccgtaaggc tccctttgat ttattattgg gaagaccttg gcagcgagga aatttcgtct 2520 caatagacga aaggctggac ggaacatatc tgcaatttaa ggacaaagca ttgagggtca 2580 actatgaaat tttagtcacc cccgatgaca ttgatcctga cattgcagaa tatatttcgc 2640 gatctacagc acaagcacga gcccatgaac aagggcagta ctacgaaact caagagcgcc 2700 acacgaatta cgccactgcc agcccgttat cagtatatgc cgtaaccgag gctaagaccc 2760 aaggaagagg acttactcgt caatatgcca ctgcaaacat tgaagaactc cctgaagaaa 2820 taaagaatag tgacgccacc atcgtcgcga aggaagaccg cgacgccacg caaagtcagc 2880 aagacgaaac cggcaccaaa cgcgagaaga aacataccca acaggcggac gcgccacaaa 2940 agggattccc taaaggcgaa ggaagcccag aagggcacga tgaagatacc gaggttgaag 3000 aagtgtacta cctggcgaag gaaggagatg agcacgatga agaaggagca gaagaagaaa 3060 tcgaggaacc gctcaccatg ccaggaggct tgatcctcct caacgaagat gcgcacttcc 3120 cagaagacgg aggaccgttg ggaaggctgc tcatgcgcgc cggtcaattc tgcggccgcg 3180 tgatctccaa actgtgggac aacttaccca tgggtggggg acagcctgat tcgaccccca 3240 atgaggtgcc cgaggcttca agcacggcct cggactcaag cctgccgtct gatgaggaag 3300 aaaccgactg tggagactcg ttagcaagcc agtacagcga cgaagaacgc cacctacccg 3360 aggagctcac tgagagccta cgccacctgg cgaacctcat tcagcaatac gaccacgacc 3420 tgaatgaacg aatgaagctc accaaagaaa ggttccagaa gtctcagctt ctgaacctcg 3480 cccccgatct gacgcaacgc aaagccgacc gcctgtcgcg ctggacagac aaaggcagcc 3540 gcacagagat gatggaggac gaggagcaag aagacgactt accagatctc gttcccctgg 3600 acgagccagg agaatacgag gccccgacga gcacggccga tcgaaggctg gagtcatacg 3660 ctgagaacaa ctcggtcacg gacggagaag agcgcgagaa cgtacctgac cgccagcccc 3720 tggcaatgcc aggccgaggt gagtcttcag aggacgcaga cacccgtctg atcgagcccc 3780 tgaaggctag accctgctac cagtcagaag gcgatctcgc cgagggagaa aggagtgaga 3840 agaagcaagc aacaggagac ggattaggag acgtaccgga gccacgcgaa gaagtcttcg 3900 ccatcgttct gaacgagaac gatgagaccc gagtgctgca agagctcacg aatatgtcag 3960 caaaccgcat ggccacaaga cgaacgcgcc acttacaaga tatcggacga atgctgccga 4020 gaccgccagg aacgcgcaca gaacgggagg tagcagaggc ccgacgagat gtcgagacct 4080 tgctagatcg gctgggccac atgacggcct ccaggtaccg attagcacag tacctggccg 4140 agcaacgcct gctcgagagc ctcgaccgtc agctgttgca gatagcggag gacgaaagga 4200 atggatatta ccatttgagt cagccgtggc gtacccctga aagcgatcgg cagtcctgtg 4260 agcgtgagtg gcagatcagg aacgcgatgc tcggctgggc aacgcgcgag caagtcctga 4320 atgtcggaag aacgtaccta gcgcgactta ttgtcgatct ggacatgcgc ggcaccgcag 4380 cgacgccact acctggcttt gtgagcactc tggccgctcg gtaccgcgaa tgcgctgcaa 4440 gagcgcaaga cgaagcccct atggtcaacg ccacacgaca ggagaacggt caggagaccg 4500 gaaactcacc ggaggccatg gagatcgaag atttgctcga tctatccagc ctggcgggtg 4560 cagttaatga catcagcgac gaaaacctgg aacccggcga catacctagc gaagaaagcc 4620 tccgggcaag ccgcgccacg ccccatctgc tggcgcagga cggcgagcac gattgggcca 4680 ctcaagcaac ctggacagaa gacgatacgg gtcagcagga ggctctgcaa agcctcaccc 4740 agtcgattca agagctggaa gaattaatcg aactcaccgc ggcaagcgcc actctgcgtg 4800 cgaccccggt atcgcgcaat gatcagggac aagcccctgc aatacacaac ggcctgcaaa 4860 cgcctcagga tgacgaagaa agagatcagc cgaccatact gctgatcgaa attcaagact 4920 taccgatcga agagctgcgc gacgagaagg aaaatgactc ggcccagcaa gcatttaagg 4980 aggccctgcc gaagcagagg gagactcgtc ccccctgccc tcctgtgctt gcggacatgg 5040 ccaccatccc cttcgcctgc gtcgacctgc accatttccc caggctgcgc acgcttgtag 5100 agctcgcatg ccgcctcaag attgagggaa acgtccgcct catggaccct gacgccgagg 5160 tctcaagggg aacgaaggag ctgcacgaag aaatggatca gcatgaccga gtgtatgatc 5220 acattacctt aaagttcact caccagttac ctaggggagt cgaggggatg atccccgaag 5280 tcgatggcag cgtcctcgag acgtacatga cctgccatga cctcgactcg gtgctggggc 5340 aagtcgacaa accccaggcg acctccatcg gccacagccg atggcccctg gtgttctggc 5400 cactcgagaa acccgagcgg atcgacgcgc ccgacggcga agacgtcgat atgctacaac 5460 cgatcgatta tcagcgagac aacccgaccc atatcactta ttacacggca cgcaccttct 5520 acgagatcca cccgaacggc ctcgactgcg cgccgctgcg cttcgaaggc cacgcgatta 5580 tcgaagccta taacggctgt cccaacccac ggttccttga aaggaacgac gtctggctgc 5640 tgacagacac gccgctgccg cgcgtgctac tcaccttgtc gcagccgggg tacctacctg 5700 aagcaatccg cgaaagcctc gatcgccaca acgccaggct cttttaccgt gaggtgatgg 5760 gcttcgaggg cggacaccga ctcgacggcc tcgaaagctg gccgaccgct tgccagacct 5820 tcgaggaggc cggcaaatgg accttcagat taggattagc cggacagagg ccagagtcat 5880 ggatgatgca cgtacctgga ccgcaggaga gacccgctgg acctgaggaa gaagcaggtc 5940 agcgcgaagg gagaagaatc cgcaccacgc ccgtgctcga gctcagcgcg caggccgacc 6000 agaccggcgc gaccgcctgg aagatctatc gagtacgcac cgagcacgtc cgcgacaccg 6060 cttgtggcat cctggactcc tcgcgctttc gtgtactcga aagccgatgt ttcctcggtg 6120 tatgccagag ctgggtgccc cgacacgaag aggaagagga tttttggacc aacgactacc 6180 gaagttggga gccggcgacc tggtggcgca tcaatggtga tgcccgtgat ggccacgcga 6240 gccaggcatc cggactttac ggtgcggacg ccgacaacct tccgcaccac gacgattacc 6300 accgcgtaca cgaaatcccc cctgctggca gcgctgccga agctgctcac catgaccacg 6360 aactcagccc cccgccttac agccccacct tttgtgccag cgtaggcagg acgtcgcagc 6420 agcacaaccg cgctgacgat gccgccgctg acgccgccga gcctgacagc gacatggacg 6480 acacccagag tgcgtccacg aggtcgacta tgggctcgga cgacgagata cgcaccccag 6540 gccacgctga aggcggtgag taccagcaac cacgcattgt acgcccgcac ttacccaacc 6600 tcgtccatcc agctcgcagc gagagctgga gcgagatgag cgaggctgag gaaggggaag 6660 tacgcgagga ctggaggcag aaggacgaga gacgggatag agaggaggat tgggagatga 6720 ccgagggctc ggtgagtgac gccagctcta cgttcagccg cgccgcctcc gaggaaacca 6780 ggcaaacgac gccgcaacat cataccccct cgcccgtcca acacctctac gaaaacgcag 6840 gcccgcatat caggccgccc cacatcagct tcatgtatca atattcgctc gcccgagacg 6900 aacacggcac catccacatg cgctacgtcc gcctcaggac gtacggcgag atcatccccc 6960 cgcatctcga ccctcggcgc cgtactgaac gccctgctac cacaaccacg aagatcgtag 7020 caatgactgc gacccgctgc cacctccttg gccgcgcgca ggccgtccta cacgaaacgc 7080 aagcctacga cctaccgacg tatttggccg agaagtggac cctcgaggag ccgacgaccc 7140 gcgacctgga gtgcctgctg ccctacaccg aagaattcgg caccccgtat ctctacgcgt 7200 gcgagcagct cgccattcgc cagtgccgcc accgctgggc tggcctccta gacattcccc 7260 atgccgccga ctacgttcgc gaatgcgacg cgatactcgc ttacggtctg ggcgaaccgg 7320 acgaagatca agtacgagcc gaccgcctcg caggctatat cggcaccccc agctacatca 7380 ccccggtcac cctctatcat atcggaaatc acacgagcac gcccctcccc cttgacgact 7440 atggcgaggc gcattgaaga agatacgcgc atgaaggcgc accacggaaa taactacggc 7500 acacggacaa cctcgaccac ggacttcggc gcaaacgcaa tgcgctacgg atatgttttt 7560 ctttatttta gcttcgtttt gtacgaactc tggactgaca cagacattgg actgcgaccg 7620 gacattttag gactaggatt tttattatgt acaagcgcag gcaccggcca gctcgggcgt 7680 agctacgtcg cccgcacgct cgcgagcctc tgcacaagaa atgccgccgc gctgccaccg 7740 gctcctgagt gcgaaagcac catgacgagc gtcagggcgc gtggcggcaa caaaaacata 7800 tctcccggta ctcttttcct tttcgtatgt acttgtatta gtaggagcgc gacgctcata 7860 tggacatttg acccggacga acttttctta gtttacaaaa ctctcgaaaa atttgtttat 7920 tttctttgta gtattatctt gacctatgga ctcgaaaaat gggcttaccc cttgtatatt 7980 aaactacaga aaaatacgga gcctccgaaa ccggcacgcg agccggatat tcgagatttc 8040 gcgacgagac gagacttcgg tgctcaagcg acccccgggt cggctaagcg cctcggcctt 8100 gcctcggtct tgaatattca gcaaacggca cccacgagaa tcgaaccctt gcactcggcg 8160 agcgcccgcg agcaaggcgc gcgccacaaa tcagcacctg acctcggcca accctcgggc 8220 atgccacaag gattgctacg aggctcagcg caagaaaatc cagaagcacg agcaggattc 8280 gaacctgcaa cccagacctt tggcgccaga tcagccgcag acaagttacc tgctgcacat 8340 ccgcatataa aatcggaaaa atcacaagaa tattccagca atacccgaac tcaaacacaa 8400 gggtactatc acacagcctt ggcaacccag gcaccagaag accccaggag acctaaggag 8460 atgttgggaa atgaagaaga cattaaatgc atatctgacc ccaagccctt gggcgggtca 8520 agagatgtag tcaacaagga ggaatggaaa ggattatatg ttacagtcct ggaggaggaa 8580 gacgctgagt ctccaaacgc tccactgttg tatacagtag ccatggagga tttcttagcc 8640 actctggatg atgacgatga gattatttgt atggacgggg aagaactctt cgctgcctgc 8700 aagccaatgc atctaccaga agtgtttgcg gcctacaaac gagtagaccg caaagtcaag 8760 cctgtacctg ctgtattccc cgaggacgcc agggttgaga gacacttccc tgaagaaccc 8820 ctcgattccc tgccagaatt acctacacac ccaccacaat tcaccccaaa tggaggcaga 8880 ctcacagccg aacgaatgaa ggaaatgaag ataaacgaag acggattcct atggccagaa 8940 gaggaaaagc tattcaagca tatcctccaa acccatcagg atcactttgt ttgggaagac 9000 cacgaaagag gatcatttcg agaagactac ttcacgccat acattattcc agtagttccg 9060 cacatacctt gggcattccc taatattcct ataccacaag gaatattaga ccaggttgta 9120 caactactaa gggaaaagat ggaggcagga gtatacgaat caagccaatc atcttatcgc 9180 tcaagatggt tctgtgtacg caaaaagaac ggcaagctgc gcatcgtaca tgacttgcag 9240 cccctcaaca aagttaccat acgggacgct ggactacccc cgaacctgga cagttttgtg 9300 gaaccctttg ctggacatca gtgttacact gtattcgacc tctattgggg attcgatgct 9360 cgcaaagtac acccccgcag ccgtgattta acagccttca tgacacctct tgggctgtta 9420 cgaatcacat cactacccac aggctttacg aactcccctg ctgagttcca ggcctgcatg 9480 tccttcattc tacaacacga aatcccacac aaggccaaca tatttatcga cgacctaccc 9540 atcaaaggcc cagcatcaca gtataaagat cagaatggac gacctgaagt tctgcccgca 9600 aacccaggca tccggcgatt catttgggaa cacgccaatg acgtacatcg aataatccac 9660 cgagttggac acgcaggagg aacattttcc ccagccaaag tacaactgtc cagaccagaa 9720 gtccttatag taggacagaa gtgtacgcca gaaggacgac tacccgatac gcagaaggtg 9780 gagaaggttc tcagctggcc tccgctcaaa acagtcaagg atatacgagc attcctagga 9840 ctatgcggca ctgtcagaat atggattgag aactactcag ttaaagcacg accgcttact 9900 gagcttatcc ggagaggaaa cgagtttatc tgggatgccc gccgagaaga agcgttctgc 9960 accttgaaga aggccatcac atccgcccca gcactgcgac aaatcgacca taaatccgac 10020 agaccagtta ttctcgctgt agactccagc tacatagcag tcggatttat cttatcgcaa 10080 atcgatgaag aagggcgaag aagaccagca cgctacggct caattccaat gaatgaacgc 10140 gaagcacgct actcgcaacc aaagctagaa ctatatggtc tattccgggc attgcgtgct 10200 tggagaatac acctcatagg cgtcaaaaag cttatagttg aagtagatgc caagtacatc 10260 aaaggcatgc tcaacgaccc cgaccttcaa ccgaacgcag ccataaaccg ttggatacaa 10320 ggaattctcc tgtttgactt cgagctggta catgtgccag cagccaaaca ccgaggacca 10380 gatgctttgt cacgaaagga gcgagcagaa ggagaaagca tggaagaaga agacgacagt 10440 tggctggacg acatcgcact gttcgcccat gcaacagcca cccatcagtt atcctcctgg 10500 caacaaggcg aatgggacct gtatcatgtc caaaccgagc cagaagatgt gaagcaactc 10560 cgcatatatg cttctgttct tcccaagcca gaacagacat tagtggacat ctacgacttc 10620 ctgcgaacac tccagacacc gtcgtttgac tctatacagg agaaaaaccg gttcatcaag 10680 aaatccctga agtacttcgt aaaaggaccc gccatgtacc gaagaaaccc tcacgggata 10740 cccaccaagt gcatcttcag aaaggagaag cgcaaagcca ttctggaagc agcccatgag 10800 caacttgggc accgaggcga gctagccacg tttcaaactg taaaacaacg cttctattgg 10860 ccaggactat ggaacgacgt acgccaccat gtacggtcct gccatcaatg ccaaatccga 10920 gccctgcgca aggcagaagt accaatcatg gtatcgactc cagccactct ctttgtcaaa 10980 atatacctcg acatcatgca catgcccaag gcccaagcct atgtctgcat catcgctggc 11040 aaggatgacc tttcaggcgt atccgaaggg cgccccctgc gaaacaagaa agctgaagca 11100 gtctccaagt ttttctggga acaaatcctg tgtcgatacg gcgccgttgg atgggtcgtt 11160 acagacaatg gaccagaatt catgggtgca tacagcctac ttatggatcg ctatcacctg 11220 ccacacataa agatctcggc ctacaattcc aaagctaatg gagtggtcga acgaggacac 11280 ttcatcatcc gtgaagcaat actcaagtca tgcgaaggaa ggatcaagga atggcctgac 11340 tatgtcagtc acgctttctt tgccgaccga gtcaccgtcc gccaacagac aggatattcc 11400 ccatactatc tgttgcacgg cacacatcca gtcctaccct tcgatttggt ggaagcctca 11460 ttcatggtgg atggcttcac ccgcaacatg gaaaccagcg acctactggc cttacgaatt 11520 cggcaacttc aaaagcggcc cgaagacata gccaaggcag cgcgaacaat gaaggccttg 11580 cgtctcaagt caaagacaca attcgaagag cgatatcgag cccgcatgat caacgagtct 11640 ttagccccag ggactctagt attgatcagg aacactgcca tcgaaaaatc gctggacaag 11700 aaatcgaaaa gccgctactt tgggccctac gaaattgtac gacgaaccgc cggcggatcc 11760 tacatcatca aggaattgga tggagcacat tggaggacag aaatagcagc cttccggctc 11820 attccttatg ttgctcgagg agacaagaag gagctgaaga aactggctgc cgaccttagc 11880 aagatcgacc cggtaccttc attggaagag ctggaacgaa gagcagacga agaagaaaga 11940 acagacgagg aagaagaaga ggactcaagt tctgaggatc tggaggaaga ttccgagagc 12000 aactgaggac agttgcaatt tcaagcgacc cccgga 12036 // ID Coprina_Pc1 repbase; DNA; FNG; 5058 BP. XX AC . XX DT 29-APR-2007 (Rel. 12.04, Created) DT 18-MAY-2007 (Rel. 12.04, Last updated, Version 1) XX DE Coprina_Pc1 is a Penelope-like retroelement. XX KW Penelope; Non-LTR Retrotransposon; Transposable Element; KW Penelope-like elements; reverse transcriptase; Coprina_Pc1. XX OS Phanerochaete chrysosporium OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Corticiales; Corticiaceae; Phanerochaete. XX RN [1] RP 1-5058 RA Arkhipova I.R.; RT "Distribution and phylogeny of Penelope-like elements in RT eukaryotes."; RL Syst. Biol 55(6), 875-885 (2006). XX RN [2] RP 1-5058 RA Gladyshev E.A. and Arkhipova I.R.; RT "Telomere-associated endonuclease-deficient Penelope-like RT retroelements in diverse eukaryotes."; RL Proc Natl Acad Sci U S A 104(22), 9352-9357 (2007)in press. XX DR [2] (Consensus) XX CC Coprina_Pc1 is a Penelope-like retroelement from the white rot CC fungus, Phanerochaete chrysosporium. Its single ORF contains CC homology to reverse transcriptases. No associated endonuclease CC has been found. Most copies are associated with telomeres and are CC 5' truncated by addition of reverse-complement P. chrysosporium CC telomeric repeats, (TAAACCC)n. XX FH Key Location/Qualifiers FT CDS 345..4142 FT /product="Coprina_Pc1_1p" FT /note="reverse transcriptase." FT /translation="MPAQDTTRRAVRATASSSTGHFPSFTRALSPSQASVV FT LGEQISFGTFDERCAAALAHCTLLDEVIATLPKLYQVALSPVLTDIHHDAT FT KRVEIAAQLERLRQHKAAGTLPSPLKGLGSTPAIQFMKGFLESQKPDFPGK FT APDIVKACAAKLLDGFIETKAAEHAWYESRLQAMEVTRRVLTAADTHWKDE FT LEQRYQVLEGFVRPPVEGQVPATVPSAEELHPLKFVTDPVMVAQHENIRRS FT IPRLASTVIMLVVNRALTQQQRNKAKRDLKDQADVEMATDDTAVDSAVKRL FT EKQVEGLVSGMKALTNGKGKVSFSSSSRTHVEDAHPFVSGQSPSSRPFSLV FT DHRARHEAQGVQLGKRLRQRQDDGHQGPSRRLRCSSGEAGSPQEGQGEGTR FT EGQEDPEEIVAYVRAAPWRYDAPHSYPDVILMLPAPRAIQLLLERVPLEVK FT RAARFRSRVFLGPHVTVPEHLQMHVSSSLRYLMYSVPSSNVVREAWEDFKD FT RFRWRVFFTEKLIDSPNDDDPDFDPDYVVPKARKRFDDGVRTTPAYIELGL FT RKGDKFVDDYISNVMPSVKATSRSLNLVWLDELKDYLSANKYIVSPSDKNL FT GVCILQADWATDQTRTLWNDPSNYRKLGSLLEADEIMEEKHERIFRAARLA FT RRLGKRQLAEFLVSKIHGKDAKTAGYTLGQFYGVPKVHKRPVKMRPIVPCH FT SVSQAPSATYVSKMLKPLVSRQPHVIHGSKHLVRQLESLQLDKHRKAWIVS FT GDIVAFYPNIPLENCIRVVTQMWRNSPEAEKLSPEERQLFLMSFMTANKRL FT VIQFGDEYAEQIRGLAMGIACSPDLANLYGSHYENPILARPDMKSKFAFFG FT RYLDDVLGIVYASSAEEAMSYANEIRYDGVEIEWSVSEYHTPFLDLFLYLD FT PVEKRLHWKPYSKPLNHRERIPWASHHPKDVKKGTFIGEMSRLATLCSKLE FT HYRDAISDLSNLYLARGYPDDAVRSWVKAHFSQRWERRLGEPIVRDEVFVL FT KSHFNPAWSAFNVHGLSEVVVESWLSSLSKLDGANQSQDRTWHAAESTDGS FT ELKIFRQTTLGQFQGEQSSLYPGESRPEGVAGRVGVANVAKPRVTYDPRDV FT MDVDELVALSEPSFSAHDLSVVWTDNALRVPKSLRIRYAMAELRLDADPQV FT NPALHVPLGVEDQVGDLGPASDAESVASGDDGTTNSKPYRICTAKKDSGIV FT QEDLDIRSVGFHDRAWLVSRKKIRNLGDIVNSWKKDIIQAALNSDSVPHMH FT VDEWQ" XX SQ Sequence 5058 BP; 1129 A; 1442 C; 1366 G; 1121 T; 0 other; atgatctgtg catacaaaac cttcgtactt ttgccgagaa agagtccctt aaggggattc 60 gttgctcccg acctcgggga gacttttctc cacaccaatc gcattacagc cgacgagcga 120 catttcgcct ctcgtcggcc cgccagtccc ttcgggcctg gcggttgcaa ggggtatata 180 tcccagtagg aaaccgctga tgagtcttca cgacgaaagc tcagcgcgac acaacagtgt 240 cggctgcctt cggttttcct gtgaagcatt ccgcgctcgc cgctcactac gctcgcgtct 300 cgcgcccccc aaaccctaat caaatcgagt aagcactctt caccatgccc gcccaagaca 360 ctactcgccg tgcggtccgc gcgactgcgt cttcatccac aggccatttc ccctcgttca 420 cccgtgcgct gtcgccttca caggcttcag tggtgctcgg cgagcagatc agcttcggca 480 cgtttgacga gcgttgcgca gcggccctcg cgcattgcac cttgcttgac gaggtcattg 540 cgacgttgcc gaagctttac caagtagcgt tgagcccagt cctcacggac atccaccacg 600 acgctacgaa gcgtgtggag atagcggctc aactcgaacg gctgcgccag cacaaggctg 660 caggcaccct cccctcacct ctgaaggggc tgggcagcac gcctgcgatt caattcatga 720 aagggtttct ggagtcacag aagcccgatt ttccggggaa agccccggat atcgtgaagg 780 cctgcgccgc aaagctcctt gacggtttca tcgagaccaa ggctgctgag cacgcatggt 840 atgaatcgcg ccttcaagcg atggaggtga cacgccgagt tctgaccgct gcggacacgc 900 actggaaaga cgagctcgag caacgttacc aggtcctcga gggattcgtt cggcctcctg 960 tcgaagggca ggtgccagcg accgtcccgt ctgcagagga actccatccc ctcaagttcg 1020 taacggaccc tgttatggtt gctcagcacg agaacatccg gaggagcatt ccccggctgg 1080 cctcgacggt catcatgctc gtggtgaacc gtgcgctgac tcaacagcaa cgtaataagg 1140 ccaagcgcga cttgaaggat caggcggacg tcgaaatggc tacggatgac accgcagtcg 1200 actccgctgt caagaggctc gagaagcaag tcgagggtct cgtctcgggc atgaaggcgc 1260 tcacgaacgg caagggcaag gtgagtttct catcttcttc gcgaactcac gtcgaggatg 1320 ctcatccttt cgtctcagga cagagcccct caagccggcc cttctcgctt gtcgaccacc 1380 gcgcaaggca cgaagcgcaa ggtgtccagc tcggcaagcg actccggcaa aggcaagacg 1440 acggccatca aggcccgtcc cggcgactcc gctgcagcag cggcgaggct ggatcgcctc 1500 aagaagggca aggggaagga acaagggaag ggcaagaaga cccagaagaa atagtcgcat 1560 acgttcgcgc ggccccttgg aggtacgatg cccctcattc ctaccccgac gtgattctga 1620 tgcttccggc accccgtgcc attcagttgc tcctcgagcg agtgccgctt gaagtgaagc 1680 gtgcagcgag attccgttcg agagtttttc ttggacctca tgtaactgta ccggaacatc 1740 tacaaatgca tgtatcgtct agcctccgct atttaatgta ctccgtcccg tcatcgaacg 1800 tagtgcgtga ggcatgggaa gactttaaag accgcttcag gtggcgagtt ttcttcactg 1860 agaaactcat tgactcccca aacgacgacg atcctgactt cgacccagac tacgtggtgc 1920 ccaaggcgcg caagcgcttt gatgacggtg ttcgcaccac acctgcgtat attgagctcg 1980 gccttcgcaa aggcgacaag ttcgtggacg attacatctc gaacgtcatg ccaagcgtaa 2040 aagccacttc tcgtagcctg aacttggttt ggctggacga gctcaaggac tacttgagtg 2100 ctaacaaata tatcgttagt ccttctgata agaatctcgg cgtatgcatt ttgcaggccg 2160 actgggcgac tgatcagacc cgtactctct ggaacgaccc ctccaactac cgtaagctcg 2220 gatcactgtt ggaggcggat gagattatgg aagagaaaca cgaacggatc ttccgcgctg 2280 ccagacttgc taggcggctt gggaagcgtc agttggcgga attccttgta tcaaagatcc 2340 atgggaagga tgcgaagaca gcgggctata cgctcggcca gttctatggc gtacctaagg 2400 tacacaagcg accggtcaaa atgcggccta tcgtcccgtg tcattctgtc tcgcaagccc 2460 cctccgctac gtatgtttct aaaatgctca agccactcgt gtcgcgtcag ccgcacgtaa 2520 ttcacggttc taaacatctc gtgaggcagc ttgagtcgtt gcagcttgat aagcacagga 2580 aagcgtggat tgtttcaggc gatattgtcg ccttctatcc gaacatcccc ttagagaact 2640 gtatccgtgt agtcacacaa atgtggcgga acagccccga ggcggagaag ctttccccag 2700 aagaacgtca gcttttcctg atgtccttca tgactgctaa caagcggctc gttatacaat 2760 tcggcgacga gtatgcagag cagatccggg gcctggctat gggcattgcc tgtagtccgg 2820 acttggctaa cctctacggt tctcactacg agaaccctat actcgccaga cccgacatga 2880 aatcgaagtt tgccttcttc ggtcggtatc tggacgacgt actcggcata gtctatgctt 2940 cgtcagcaga ggaggctatg tcgtatgcga acgaaattcg ctacgacggc gtcgaaatcg 3000 agtggtcagt gtcggagtac cacactccgt tcctggatct tttcctctat ctcgatccag 3060 tcgagaagag gttacattgg aaaccttatt ctaaaccact gaaccaccgt gaacgcatac 3120 catgggcatc tcaccacccg aaagatgtta agaaaggcac ctttatcggc gagatgtctc 3180 gtctggccac gctttgtagt aaactggaac actacagaga cgccatatca gacctgagta 3240 atctgtactt ggcacgtgga tatccagacg atgcggtgag gagttgggtt aaagctcact 3300 tttcacaaag atgggaacga aggcttggtg agccaatcgt acgggatgaa gtctttgtgc 3360 ttaaatccca cttcaatccc gcttggtctg cgttcaacgt ccacggcttg tctgaagtcg 3420 tcgttgaatc ctggttgtcc agcctgagta agctcgacgg agctaatcaa tctcaggacc 3480 gtacgtggca cgccgcggag agcactgatg ggtccgagct gaagatcttt cggcagacga 3540 ctctcggtca gttccaaggc gagcagagca gtctatatcc cggtgagagt cgccctgaag 3600 gggtggccgg ccgagtgggc gtagccaatg ttgcgaaacc ccgtgtgacc tacgatccca 3660 gggatgtcat ggacgttgac gagcttgttg ctctcagcga acccagcttc tcggcacacg 3720 acctcagtgt tgtgtggact gacaacgcgt tacgagtacc taaatcccta cgaatacgat 3780 acgcaatggc cgagcttcgg ttagatgcgg atccgcaggt aaatcctgcc ctccatgtac 3840 ctttgggagt ggaggaccag gtcggtgacc tcggacccgc ctcagatgcg gagtcggttg 3900 ctagcggaga tgacggcaca acgaatagta agccctatcg tatttgtacg gcgaagaagg 3960 actcgggaat cgtccaggag gatctggata tacggtccgt tggattccac gaccgcgcct 4020 ggcttgtctc ccgaaagaaa attcgcaatc tcggggacat agtcaatagc tggaaaaagg 4080 acatcattca agcagcattg aacagtgact ctgttccaca catgcacgtg gacgaatggc 4140 agtaatgcga ttcgataact gctggccgca atggtcagag gatatttact tagcggcata 4200 cactttggtt caaacccaaa acccctaacc taagccctaa gcctaaatat tcattagcgt 4260 ctccctcaca ggcctaaacc ccctggagcg gagttgtacc cgttgaacgg tacagccagc 4320 agttatagag tattttgttg tacagcgtca ttatttaatg gatgcagtga gcgcccgcct 4380 tcgtggcgtg ggcgcttcgg tgatcgccgt gatggccgcg cacgtgtccc acgtgttcgt 4440 ctcgtggctg ctaatagcgt ctgtgatcat gtgatcttcg acccagggct gcggttaact 4500 ctggattcga tgcgcatgtt aatagccggc ccgaatccgg gtcggctccg aactgactcc 4560 gctaggttcg ggagactttc gccggggtgg ctgcgtgccg gacttcgtcc gggcactttg 4620 tggcgttgat ccgatcagat cgagatctcg agacttcgaa aggtgacttt tcggtagtcc 4680 gctgcccgct ggggtgttgt tggccccttc cgcatgacct gtacattcaa aaccttcata 4740 ctttcgccga gaaagagtcc cttaagggga tttgttgctc ccgacctcgg ggagacgttt 4800 ctccgcgcca atcgcattac agccgatgag agacatttcg cctctcgtcg gcccgccagt 4860 cccttcgggc ctggcggttg caaggggtat atatcccagt aggaaaccgc tgatgagtct 4920 tcacgacgaa agctcagcgc gacacaacag tgtcggctgc cttcggtttt cctgtgaagc 4980 attccgcgct cgccgctcgc tacgctcgcg tctcgcgccc cccaaaccct aaaccctaaa 5040 ccctaaaccc taataccc 5058 // ID NHT2_I repbase; DNA; FNG; 5432 BP. XX AC AY038360; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 08-NOV-2002 (Rel. 7.1, Last updated, Version 1) XX DE Nectria haematococca copia-type retrotransposon NHT2_I, internal DE region. XX KW Copia; LTR Retrotransposon; Transposable Element; KW COPIA superfamily; NHT2_I; internal region; KW target site duplications; internal portion. XX OS Nectria haematococca OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Hypocreomycetidae; Hypocreales; Nectriaceae; OC Nectria; Nectria haematococca complex. XX RN [1] RP 1-5432 RA Shiflett M.A., Enkerli J. and Covert F.S.; RT "Nht2, a copia LTR retrotransposon from a conditionally RT dispensable chromosome in Nectria haematococca."; RL Curr. Genet 41(2), 99-106 (2002). XX DR Genbank; AY038360; Positions 234 5665. XX CC 5bp target site duplications. XX SQ Sequence 5432 BP; 1645 A; 1368 C; 1180 G; 1238 T; 1 other; aatagttata agcccggtag cactttaggc gcttactata actgattccg accatcatgc 60 aagacggctt gcacggtgcc tttcactaca catcacacag cggatacaag ccctgccact 120 ctctagtctg tcgccgctct atctgcttgc ctacgggagt cgttccagcc cgtcttactg 180 cccaacacta agatttttcc aaagatgaca gccaagtcta aagtcacgct ctccagaccg 240 tccgaataga ctacctagta cgacgatttc gtctcaaggg ctaagacttc taagatcttc 300 gactttatcg atatcgataa ggatactccg atcctagaag agccaaagga gcccatttcc 360 tctaaagaaa tgcttaagct actcaacaag gctacttacg aggcttagac acacgcaaac 420 cagcagaacg ctgcggccgc tggaccgaga ccagagccgg ccatagacct gacagatggt 480 caatggatcc gactcactcg tctccagtca gaatacaagg tcaaacttac gaccttcgcc 540 aatcggcaac atgcctatgc tgaactcgct atttaggtcc gatcaaccat cgataagatg 600 tatcttagca acacgacatc taagtctaac cttcgcacta tcgtccgcga cctaaaatca 660 agccttgcac cctctgagga cgaacaaaag gaggaagcac gcttaagata ccgtcaggtc 720 ctggcccaga ccaagagggt taagccggag gactagctcg tagcttagaa taaggctaag 780 ttagaaggcg agagacaaaa gatcgctgag cttaaagggc agtctaccct taatgatttc 840 ctcattgcta tctttgcctt tgatgccact taggcaagct agcagaaggt tcagatcata 900 aataataaga agttcggcct caagatggat atcacactgc gcgcactagg ggaactgttc 960 cacgaacacg ttcgaagcac tagaataact agcaagatag ccgatagcgt atttgccgtc 1020 tctaaagcga cctaaagcca ctatccgtgc ggactaccgt cctccaaaca tcgatagaag 1080 cccattgatt gcgctgctat gtagatcgca atcaatggcg aatccaagga cggaaggcaa 1140 gccaactcca gccgcgtcaa gacctacaag gaggctctaa agcaagccaa gtggaaaagc 1200 cttatggaca cagtcaaaga ctcagcgccc accacgagga cgaaccaact agaggaaacc 1260 agaagcacca acacaaaccc acgcaaacac ggcgactttg ctagcttaac catcgatacc 1320 aatggcctcc ccagtggaac caccgacgca atcttcgcag ctttgcgaga gaagcatcct 1380 ctttataatt cgacgatcta tgacggtgga gcaactaccc atattatcaa cgataggagt 1440 ctactaaccg aaatccgcga ggctggggaa accgactacg tcataatcgg cgaaggttct 1500 ttgaaggtag aggcacaagg cacccgaagg atagaaaaca tactagatag cgaagacaga 1560 agaaatacgc gcgctcttat cctccttaat gttacgtata tgccaagatt ccacacgaac 1620 atcgtcagcg cgaaaaggct tataaggaag ggattatggc attatgggtt caataacacg 1680 ctccgaatag gcacctataa ggacaatgac atcctttgcc gactcataga ccactatgac 1740 ctcgatatag tcgaatacaa gccagtttcc cgctcttact tcaagcttct acaatcagtg 1800 atctccatat tcaacgcctt ctaagatata ttctcctcct tccaagcgaa caaaccgaga 1860 cgatatagaa tagcgatttc aagggttccg cctccgccgc ggtcggattc ttccgaaata 1920 tagcatcttc gctcttgcca tgcagggcca caagcacttg aacaacttat cttatagacc 1980 aagggagtct agatcaaggg tccgactact attcaatgca cggcctacgg cacaggcaag 2040 gcgaccgaga ttatctcccg cagggaatca tcagaccggt caaaacaacc attttaccgg 2100 atcttcatcg atatctttga gttccctacg gcattcaata gacaccgcta cgttctacta 2160 atcacagacg agttcaacgg cataatgttc tcctagtcct tggcttcaaa gacggaagtc 2220 agtaagatca tcaaggactt taaggcccga gtcaagcgac acataggtgc atcgatctat 2280 aagattagaa tcaataacga aaggactatt attaatctac cttatcaaag ggactccgag 2340 ttttagacct aggccataga ggtaggaata gatatcgaac tcccaccgtt atacactaaa 2400 gagccgacta gtagagcgga acgacctagt agaatcaatt agctacggat gcggtatata 2460 attaggtatc ttccgtaaga tctctagcta gaattctact gtgtagccac ttggatatat 2520 aatatccttc ctagccgaag aaacaaataa atattaccta aggagaaaat aaaccgctgg 2580 tttcactagc actttcgata atatagaccg ccaaatatcg atttcaatat aacaaaggac 2640 ttccgaccag actggcgagg gatttatgca tatagatgca aagcatatcc cttaaacagc 2700 cgctataagg ccgggaaaga cattaatcaa ttcaagcttg gcccacgagc acatattgga 2760 tatctcgtta gctactatgc cagcaatatc tatcatattt gggtccccaa gctcaatcag 2820 gttatcctat cccgagacat acgattcaat aagggggaat tctttgatct agagcaagag 2880 gaatagctca agacggaagc agtcgtggaa tataggctaa caccgcaggc attagagcca 2940 cttcctcagc gagactggga tacgatcttg gacgaatttc tctatgagtt cgatgactat 3000 atcttaggtt tcagcccaga cggttcaaac tcaggggtag aagacgctag aggaactaca 3060 gacaacacat cacagcctat acagcctcta tcgcaaagac ttctcatacc tctaggcgaa 3120 ccaccaattc ttccgacgcc agacccaaca ccacagctaa acaataggga cgcctcacca 3180 gcttccgggt caaccggctc gaacacctca acaccagacc cgccattgct agcagacccg 3240 agctcaccta cacaacaatc taaggctagt ctagacagtg atcttaacga atatcatgac 3300 accacgacac gaagatctac tgcgccacca ccagatggcc aattcaaaga catcaagcca 3360 agtcatgaca tttcaagttc aactaagtca cgcgatctta gtcctaaagg agtcacccgg 3420 gacataatcg tagttcaacc gctaaaccca cgccggcgac cttcctcaga aggcaacaac 3480 atgcacccgg acaagcagcc gtcaggtaac caaacaactc ctaaagctcg cggcaccacg 3540 cgcacacgcc gcagccaaaa agaactctac ggtaaccaac caactcgcaa gagcgaccag 3600 acaccaaagc caaatcggag gtataataat aatatgcaca gcgttttcac tacatagctt 3660 ccaaccgatc aacatccgaa gggaaacctc gcaacatttc atagcgtctt tctcgccgca 3720 gtctccaaga cggcacgaga ttaccgattt caccgggata accttatacg acttcctaag 3780 cgatatcagg atctagaaga ccacccaata ggtctagaat tcaaagcagc cataaggaag 3840 gagttacacg atctcctctg acgaggcacc tggagactag tcgaacgcga aaaagctcgt 3900 ggactacccc ttccgctgaa gtgggtgttt acctataagt tcgatcaaga tggctacctc 3960 taaaggtgca aggctcgcat ctgtgtccgt ggggatctyt aagacgatga tgggggaccg 4020 gaaacttatg ccgctacgct cgcagcaaag accttcagga tcataatagc aatagcggca 4080 cgattcgatc tcgagattcg ataattcgac gttggcaacg ctttcctcta tagcgacctc 4140 aagaaggacc aactagtcta cgtttagctt cctaagggat atgtggaact tggcttcctg 4200 aagccagggg aaacgtcgac tatgatcgcg gaactcgata aggccctcta cggtcttcgc 4260 gaggctctag tcctttggta taatgagatt tctgagacgc ttaaggaagt agggattgat 4320 cgtatggatg aagaaccttg cgtctttacc aacggcaaga tcctagttct catctatgtt 4380 gacgatatcc tgatccttaa cccccggcaa gaaaagaagg cagtcggcga cctcgtttaa 4440 tatctctaat ccaaatataa cctgcgcgaa gaagagttta aatggtatct tggcattaga 4500 gtcatcaggg atcgactaaa tcggaagatt tacctctgct aagacgctta tgttgagaaa 4560 atcgcacgaa aatttaagct ctgcgattca aaacttcgag tgctattaat cccgatcata 4620 acaatccctc tcgtcaagca taatagatag gcgtctaagg aagaaatcaa ggcgtattag 4680 gaacgagttg ggtctttgat gtatatcgca gttatgacta ggccggacat agcgcatgta 4740 gcagcctagc ttgcaaggtt cttgacaaac ctatcgcctg aatacctttc tgtagccaac 4800 cagtgcatcc gttacctcta cataatgaga ttccttgcca tcgtctatga tgggatgcac 4860 gcaggagagg ctcttgtgat cgcgagtgat gcgtcatttg ccgatgatat cgagacgcga 4920 tgcttatcgc agggatacat catgatacta tttaacggac cggtcgtatg gaaggctggt 4980 ctctaagata tagttatgac ttctacaacc gaagcagaaa tcttaagtct agagagaaca 5040 gccaaggaga gctacgcctt agaccgccta cttagggata tctctcttga tcttagacct 5100 ctaaagatat actgcgataa ccttcagtcg atccgacttg tggttgagga gaatcagaga 5160 attacgacga agttgcgata tgttgacgtc cagaatatgt ggctgaagta ggaattcaag 5220 aaaggtaggt ttctggtgga atatctcaag atagattaga tgccagcaga cggcttgacg 5280 aaagctctat caagatcgaa gttttaagca ctttcgatct atgctgaatc tgatagatgt 5340 taaacgggag gtggatttgg atgacggtag ttctatggct ggttgtgata gcttctagca 5400 gcgttgactt gatcactcaa cgctgggggg tg 5432 // ID Copia-3_MLP-LTR repbase; DNA; FNG; 327 BP. XX AC AECX01001156; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-3_MLP_; KW Copia-3_MLP-I; Copia-3_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-327 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001156; Positions 138858 139184. XX SQ Sequence 327 BP; 89 A; 64 C; 48 G; 126 T; 0 other; tgttacaatt atgatgtttc tcatttattt gacatttacg tatgtttcac actttacatg 60 tcacatgtcc aaaatcatgg atttcctgct cttattccaa ttcacatatg tatctttgat 120 tacgtgtaac atttctaaac ctaatcaaaa cctatgtatc gcttgttgtt tcatttataa 180 acagtactga cagagcgcca tgagctcttc tctcttcatc caagaatttc tcatacaatc 240 ttggttgaag gtgagtaact cgtggttgtg ggattcagat tagattcaat ttattaagtt 300 tgaatcatac tgatccctgc tctgaca 327 // ID YETI-I_PA repbase; DNA; FNG; 5839 BP. XX AC AJ272171; XX DT 16-MAY-2005 (Rel. 10.05, Created) DT 03-JUN-2005 (Rel. 10.05, Last updated, Version 1) XX DE P. anserina gypsy-like LTR retrotransposon, internal sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW gypsy-like LTR retrotransposon; LTR; putative pol domain; KW YETI-I_PA. XX OS Podospora anserina OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Sordariales; OC Lasiosphaeriaceae; Podospora. XX RN [1] RA Hamann A., Feller F. and Osiewacz H.D.; RT "Yeti--a degenerate gypsy-like LTR retrotransposon in the RT filamentous ascomycete Podospora anserina."; RL Curr. Genet 38(3), 132-140 (2000). XX RN [2] RP 1-5839 RA Gentles A. and Jurka J.; RT "P. anserina LTR retrotransposon internal sequence."; RL Direct Submission to Repbase Update (16-MAY-2005). XX DR GenBank; AJ272171; Positions 395 6233. XX SQ Sequence 5839 BP; 2172 A; 1019 C; 795 G; 1853 T; 0 other; tataatcttt aaataatcga agacgataaa caacacacta caacaccata cgcgctatca 60 ccacttaccc cgctttcctc ctttgttgat gtccaccaca tcatcatcag ctccaccctc 120 agatatcgtt gaaggcttcg ttccactacc ttgcaaccct tgggttaacg acgacgaaga 180 ctcggacttc actacaggag aatactcctt caacgcgtcg cccatcccgc gacccttcat 240 tcccggctac gccgcccacc ttcctctccc gacgtcacta gtcacggtgt cgcctacgta 300 cgcgaccgct cgaaacctac aagaagagct cgacaacgca tctatctcca catacaatat 360 atcggactat cgctcccctt cggccgagcc ggtccaggtc aatgaccaca tggagctcac 420 gctctactcc cctcaggact tcgaagtcat tataagcgat ccctccgaca tccagccgtt 480 acctggtact atctataatc cggcactata gattattgaa ctttatcgcc gaatgagcga 540 ggcttatgag ggtatacgac agcttactta ataataatat aaggagcgta ttaaccacta 600 ataactcgta tactttaaac agcaatacaa gtatatccgc gaacaatata ctatttacgc 660 ggatatagtt aagctcgggg tctaggtttc taaataataa atcaggcact tccggcagta 720 aattaaaaat aatagcgccg cttttgcgaa ctaaatctag aaagctatcg caattatttt 780 ttaaaacgaa taataacgtt aataaactat cgagtagctt agcgaagcgg ctagccgata 840 ctactccgct ttagtaacct tcgaaaagga aattatattt taaaagagat ttaataacga 900 tatttagaaa taaactaatt tacgtaaata aaatatcgct agactcttag ctaaggagtt 960 cgtcgatccc gctactctaa acgaatataa ataaatcgtt ataaaataga ttaaaaagat 1020 tataagaaaa gattttaagg aatttcgcac ttaaatagaa cggaatctta ataccgatat 1080 cgactctatc tagcgtagcg ttaattaacg cgtattaata taactatcgt aattaactat 1140 tattatatta ttaaaggtat tatccttaac ttagcgtatt aaggagcgct taactactta 1200 ggcactataa gatacttaag tacgccgtat agcttaataa actaatatct aattcgaatt 1260 atttcgggcg agtcctatat atataacact tacttttagt aataacggag gaggaaactt 1320 cttcccataa taaattatcg ttattaacgg aaaattaccc tctaatccta acgactcgaa 1380 tataaaggac cttaataata aacaataaga aagaggaccc cctaacggag gagataaagg 1440 attacctaga ggtcctctag cctcgcgtaa ccgcttataa tcactaaatc ccccgaagcg 1500 aaataacttt acttaattta tttattaatt ttcctaaata ttagtctaaa ttaacccggt 1560 atatatagta aaacccccgg cctataataa taaaaacttt tttaaattcc ggtcttaata 1620 gtttaaagta gaaatatata tagattccta ccctataaag ttccccaccg attaaattaa 1680 aattaattag gttagattac tttttaccga taaaagctaa atataatatt aatagtaaat 1740 taattaaata taaagactaa gctttataaa tacctaaagt aattatatta ccgttattaa 1800 agaatacttt cgggatatag ccgaaaaata ccgtaatacc tataaaatat aaaaactccg 1860 gtattaaaaa aatattttat aatacttaac cgagcttata gaccttaata aagtaattta 1920 ataatcggga attacgtttt agagttatat tttaaaggta cttccggacg aaattactaa 1980 attaatttac tcgagatagg aaagagtttt aaagataaat aagaatttcc tttacactat 2040 ccgaaaagta aattatattt ataaaaatat atttataaat cctaatatct taattaacta 2100 ccgggtaaaa aaagatatac ttacttcgac ttcggagcgc ttaaagtcgg gccctctcta 2160 ttaaagtcgt ttttaataat tattaaactt atccttaaaa aatagtaaag tattttctaa 2220 actaccgaac gttaaacccg atttaaaaaa tattaaataa gtaaccgtta aagaggcttt 2280 agcggggatc ttttaaaaaa acgttaatag gtataagaaa gacggttcta tttattaaca 2340 atatagtcgt aataattact ataccctcac ttacttcgct agaaaggata tttctagtaa 2400 cgacctcccc gcggctttac ctaaagtaat taaagttaaa cggaaggccg aagaaaaaaa 2460 gaaaaaaaaa aaataatatt tttaaagaaa gtacgaatcg acgctatcgg aatttacttt 2520 aacgataatc cactagtctt taaggtaccc gacaacgata tatccggcac cgaaacgggt 2580 ttttactaaa ccgcctactt acgccgatag gatactatca gttaaccggc gttattatta 2640 caggagttaa gtgcctttac gaaaacgacg aagaaaagta actacgacgc cggttaaagg 2700 taagagcccg gcctattatt aatataaata ttattaaagc tattataagc cgcccccgcg 2760 agtacttagt atacgtcctc cttaacctta gtagcgtagt gcctatccta tcggaagata 2820 tagttaagcg ccttactatt aaaactttcc accaccctcc taagttctac ctccagtcct 2880 ttactaaaaa aggaaatata aataaataag cgctctactc cgatactata gtacttaagt 2940 acgtagaaaa ctattttacg cagctttaat ttaaagtaaa taagtttaat ccggaatata 3000 atattatttt attatattaa tagctataaa aatattaacc ggctaggttc tataattagg 3060 acccccggaa aattatattt atatttctta tttaccgtaa gcggtatact agcctaatag 3120 tattcgggaa agttaataaa attattttta atataaccga cgatcgaagc tttattccta 3180 aatacttccg ggacctagta ccgctaatcc tttataatat cttttaaaag ttattaaatt 3240 ataagcccta aaactatact atcgacctaa tcgataataa aatattactg tagggcccta 3300 tttattcctt tagcggtaaa aaacttaaat ttttataaat ataattaaat aaaatattaa 3360 tagaaaacaa aattaggcct tccaaatccc cttacagtac tctcgtcttt ataattaata 3420 aagatacctc catcgaaaaa aaaggaaggt ataaggacta cctccggccc gtaatcgatt 3480 aaagagtatt taatactatt ataatcctta atcggtaccc tttatcgctt attactgagc 3540 tacaaaactg ctttacggga gccaaatact ttactaagat cgatttaaag aatagattta 3600 acctcgtccg aattaaaaaa agcaataaat aaaaaacaac tttccgttac cattatagtc 3660 tcttcgaatt tacggttata tattttaaac taattaacgc cctaactacc ttctaaacta 3720 tagtttataa agtccttagt aactttatta atataggagt attaatatat ataaataata 3780 tccttatata cgccgatact ataaaggagt acgaccggtt tacgaaagaa atattatagc 3840 ggctaaagaa taataatttt attattttaa ttaaaaaaaa tatataagta ataaagtaag 3900 ttaaatacct taaatatatt atttttaagt ataggatctc tatatttaaa aagaaaataa 3960 attatatttt aaattaaaaa aggctaacta cgcttaaagc aatttaatcc tttattaact 4020 tcgctaactt ttattaatat tttattaagg gatttttaat tattatttaa acctttatag 4080 ttttattaaa gttcgacgta tattaataaa aataaacgta ggttatagag cttacgttta 4140 ataaattaaa aaaagttttt attattaccc cgatccttat ttatttcgac cttacgaaac 4200 tcgtagtcct aaaaataaat acgtttaact tcgttttagg cggaataata ttctagaaaa 4260 agaacgataa taaactttac ttaatcgttt tttattttaa aaagtttact aaaccgaaaa 4320 taaattacga agtatataat aaagagttac ttacgattat taattatttt ataaaataat 4380 gtagatacct agagggagcg gattattata ttaaagtata ttccgactat tagaacctaa 4440 tatactttat tacggttaaa gtcttaaata aatattaagt aaggtaagcg taaaagctta 4500 ctgcctataa ctttattatt atatatcgtt taggacggtt aaatagtaaa accgatatac 4560 tttcgagact cgattaatat taacctaaaa aggggggaaa taaggactaa ccgattactt 4620 tagtcctaaa aaattattat tttttaccgt taaacgaagg actttttttt ttaatttctt 4680 ccgtacggtt tattagtatt ccggctatat tagtgtagga cgaagggttc ctcgctaaag 4740 tcaaaatagc cgcccgttaa aacgccggtt atattaaaga tctagcaata ttacctaaaa 4800 acgaatacga aaattaaagt ttactatact ataataacct attaaaagtc cctaacgatt 4860 tagtattacg aaaaaaaatt ctcgaatcta aatacaactc catcgtcgta agctacctag 4920 gtatagatag gactatcgac cttattcggc gtaatttcta ataattaagt ataaattagt 4980 atatccgtaa gtacgtacga aaatataaaa agtattaata aaataaaaac ctccgctata 5040 atacgtttag gttactttaa ttaatagaaa ttccgtagaa gccgtagaga gcgatttcta 5100 taaactttat tacggatttt ctattatcct ctaggtgcga ctcgatttag gttattataa 5160 atatatttac gaagataata tactttattc ctctatatat agaagcgaag aaaattaata 5220 atttaatccg gattttcgcg tataaatatt aatagctcta cggtatacta atagatatta 5280 ttttagacta cgatttacac tttactattt acttataaaa ggatttctta aagctcgtta 5340 gtattaaaag ccgtataagt acggcgtttt atttataaat aaacggttaa acggaaatcg 5400 ttaattaaaa actaaaaata tatttccgta cctttattaa ttacgagata attaattaaa 5460 ataaattact ttctatagca aaattcgcgt ataataatac ccgtaaagta ttaataagta 5520 tatcgccttt cttcgctaac tatagttatt accctacctt taataacctt tcgtcgaata 5580 ctattattta taacccttta agccggctat acgcctactg aataacttaa gtttataagg 5640 aagtatagaa ttccctaaaa aacacctaca aacgtataaa gcagtaagta aatttaaaaa 5700 gaaaagaagt accttcgttt aaacaggatt aattagtaat gcttaacgcc aggtatatta 5760 agactcgtcg accctctaag aagctggata agaagatgtt aggtccgttt aaaatccaaa 5820 aggtaatctc gcctaccgt 5839 // ID Gypsy-113_MLP-LTR repbase; DNA; FNG; 834 BP. XX AC AECX01000737; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-113_MLP_; KW Gypsy-113_MLP-I; Gypsy-113_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-834 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000737; Positions 44617 43784. XX SQ Sequence 834 BP; 222 A; 167 C; 152 G; 293 T; 0 other; tgtcagcgag ggcataagtt gcctagacaa cttacctctt agctaacaag taacagttat 60 ctacatgact taagaacatt gaacgagtta tattcatttc tattcatcac tccttttctt 120 tattagtttc tctcaaagat caattcttct atttctttct tgatattcta tctattttct 180 ttttccttta ttacataact ggatcattta aattcatcga actcgtgcga gagatcttga 240 attaatcaat gatgttatga aatcttgagg aacgtagact accccagttc ttggctctcg 300 aaaggattgg gaaccctctt cgagttccat tgaaatccaa aacctatctg actagcaagt 360 gggagctgct agtcttctgg tctggggcag tttacaatcc gacggatgga ttacataatt 420 gtattatggg acgactcttg gcgaagggag tgctgggctc aatcgggatt catttctgtc 480 atataatagc ggaactccgt gaaatacgta actcaagata ctatgtgcga agtgttctct 540 tgtaagcgtg gtaaagactg atgagggtat aaaacccggc ttgtttctcc tttgttccat 600 tttgttttct tcatctttta gtagaataat aaaaccaact aaatagcttt ccagcttaat 660 cagttccaca gcatccacta cattccaact tctgttaaag aatcctttaa tccatcttgg 720 ttagtgttgt ccacgcctca aactcattta gttccgctta tttctatatt atctgaaggc 780 cttagtttgg gttttgtgat taacttgacg gtaggaatac cgtgcttgtt gaca 834 // ID YLT1_I repbase; DNA; FNG; 8025 BP. XX AC AJ310725; XX DT 08-JUL-2003 (Rel. 8.06, Created) DT 08-AUG-2007 (Rel. 12.07, Last updated, Version 2) XX DE Yarrowia lipolytica retrotransposon Ylt1. XX KW Gypsy; LTR Retrotransposon; Transposable Element; YLI310725; YLT1; KW YLT1_I; YLT1_LTR; gag protein; pol protein; internal portion. XX NM YLT1. XX OS Yarrowia lipolytica OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Dipodascaceae; Yarrowia. XX RN [1] RP 1-8025 RA Senam S.; RT "Ylt1 of the yeast Yarrowia lipolytica - sequencing reveals RT uncommon features of an fungal retrotransposon."; RL Unpublished. XX RN [2] RP 1-8025 RA Barth G.; RT "Direct Submission to GenBank."; RL Direct Submission to Genbank (16-MAR-2001)Barth G., Institute of RL Microbiology, Dresden Technical University, Mommsenstrasse 13, RL Dresden, D-01062, GERMANY. XX DR Genbank; AJ310725; Positions 715 8737. XX SQ Sequence 8025 BP; 1565 A; 2544 C; 1883 G; 2033 T; 0 other; tctggtggac gacacctcgt ttttgtttcg agtgatagcc tgccagattt attgtgcctt 60 tactgtcgta ccagcgtatt ttacttccct ttccgttctt tcgttttcac catgtcgaaa 120 gtaacaaaag acgagttcca ggctctcacc gcgaagatgg acgccttgac gctctcccac 180 caggagatca ccactactct cgctaccgcg gttaatacca aggaattccg ctcagcctta 240 gacgagctca agcagtcaaa tgagtctttc aagacacatc aggctgggga gtttgaaaag 300 ttgcaccagt tggtgtcggc ccagcacgaa accatcgcga acctcagcaa gcgtgtcgac 360 tcacccccga gcaccagctc ccttgaaggc atttcccgta ttggcaaggc ctgattccga 420 atttgaaccc gacacccccc agaagggtaa ctctgtccta catggcatgg actttgctgc 480 cagtgacacg ggtctgttga ctatcgaaaa gaatccgacg ccctcaagtc catcatccgg 540 aagaggttta cccctccttg tccagcgaag aattccgaaa acagttgtac gcttacaaat 600 cttttgattc ccgggtggct acctttatcc tactccagga tgacatccgt tggagcacca 660 gagctcgggc tttccacctt tgggcggtgt caacgataag cagcccctct tcgatcaaca 720 gtacctccaa ctccgaagca cctttgaagc agagactgct gatgaatcaa ccaagccagg 780 tgcccacgct cgtctgaaca ccgcttgcga acgtctcctc cgggaccttt ttcgaaagag 840 tgggtttgac cacgaacctg ctgtctccat tgctgaacaa gaggaggaac tctgtttccg 900 gttcaaagat taccacaccg ctttctctat cgagacacta cgtgcgtacc tctctgctct 960 gagcaccgct ctgacctcct cacctctgcc ggttctctct cggtgtctcc agttgcagaa 1020 gctcgctccg actcacctgg gcaccgccct cggacgcgag atccctcgtg acaacactcg 1080 ttggctgagc cttggcatgg actccgaccc tgagcgtgac catcaagttg ccacggagct 1140 catccagccc ctcgccaaga tgatcaccca acatgtccaa cgacttgacg atctcaatga 1200 tgaggctctt ctgagatggc gacagcacac ggtatctttc gaagacgtgt ggatgtgggt 1260 cgagccttcc gccccacctg agccttccac gcctgaacct gacaaagtcg tcgttcacgt 1320 tgcgggacgt ggtcgttcca ccaggaagcc tgccgacggc cctgtctcaa ctgagactgt 1380 gcagaaaccc acaatccgtg cctcggatgc ctcaacctcc tgggcagctg acaaggctcc 1440 ttgtgttttc tgtggttcca ctgcccatgc actggtcaac tgcgatgact ccgaaggcag 1500 ccccttggtc aaggccaagt accttggcag cttcaagtct ttcacccgac ttggctacca 1560 gggttttgaa aagtatctat cgcctgttga tgctgacttc cctctaaaac agagcgctct 1620 ctacacttcc tggaagcaga aggaatggtc gaatccgctt atggagccta gacttcgaac 1680 tttcaatgcg cagtggcgtc ctgccctagc tggccgcgta aatgccgttg aatttgtaca 1740 cgttgaagac ccaggcgggc ctttggataa cagctatgac tctgactcat ctgcgtctgg 1800 ttatgacttc caggaccttc tccaacccgg cacgttcagt gtctgtctag gaggggtgcc 1860 tcgagatctt ctgtctgata cgttttcctc ctacgacgag cctcgcacta tcattgactc 1920 cactgactcc cagcttgagc tcacgaaaga ggcaatccac cagcacattc tccagatggc 1980 aaaacagccc acacccctcc ccgtcactta cgcaattgat gcgcctgtgt caggctccta 2040 cagtggcaac atgaagcgcc ttgtggcaca cccagccttc atgcagctcg tagctcgctt 2100 aactgctccc ccgggcacgt tcaaagcctc cacgcttgaa gcgggttttg taggtgctgt 2160 tatggccccg tcttccccag ttgccggccg tgtctacttt gttgatgcgg tccagcgatt 2220 gctcaacaag tacaacgtag tattgcctac gaagctttac gagtgtttcg ctacagtccg 2280 aaatgacctc atggagcccg cttacgccac cgaaggcgat cctcgccgac agtctctcaa 2340 gaccaacatc aacgccttga aaactgttgt cgacagcaag caccctgaca gaccagtggc 2400 tccgttgccg cgacgcagcc cccgtcgtga tgtccgtgaa gacatgcccc cgcctgcttt 2460 gcctcaagcg acaaaacgtg gtgccgcttc ttctacggta tcttccgctg ctccccccac 2520 tgctaagcga accaaggctg tagccaatcc ctcctcagtg gggccgactg actccgcatc 2580 ctccacgggc gctgttgttg acgtccccag ctctcgtgtg gctgttcacc ccccccgtct 2640 gggtgataac cactacgtga gccctggaac ccgtgtcacc aaccacatcc acgatgcctc 2700 tgccgttgcg gagaatgcac ccttaaaaga ttacaaggat gctctggacc gcctcccgtc 2760 tgacctagag gactccaagg ctctcttcac cagagatcat cctcgttacg acaatgcctt 2820 gctgaaggct gggcgtctcc ctctcttcaa gcttgaggcc attcatgctc tgccggaatc 2880 ggaaaaagca gacttgtttg agcgcatcct caaggcgtcg gatgttgatg gcctcgttct 2940 ttttgagctt ctccaggtgt gccctgacct cacaaagtac atttggaaga acttccgtca 3000 ccagcggcac cggcttgcgg gtcctgacat ccaagccatc gctcttgagc tcggggacga 3060 tctcatggag tgtgccatgg atcttgcgct caacgtgatt tcgtccacac cgtatgagct 3120 caactctggc accttcggtc gtcttcagga ggttttcacc accattgacc gtcagctgta 3180 cgacgacaaa gttggtaggc cactctcgcc ccacttcacc gacaacattg ctcttttcga 3240 ccaacagacc agcgcccttt ttgacagtgg ctcgtctaac aacgtcatcg acactgattt 3300 ctttgccctt gtactcgcga aggctggtgt cacccctgac cgggtgattg ttttttctga 3360 cggacagtcc cacgccacag tcgcaaacgg agccaaggta aaagttgatt tctgggcgct 3420 tctccctgtc actttcctgg gtgtcgttac tcttgaaaca ttttgtgtca tgaagtgcag 3480 catgaagtgt attctcggca ccggatacat cagcaaactg cgcatttcgt tcgaccatga 3540 tcgttaccgt gtcgcttcag tggagaaccc tggtaaccct ggcgtccggt gctaccctag 3600 tgacagacct tcagctggcc ttgttgctca ccttgggttg cttgaccgtc ttgtgaggcc 3660 cggccgccgg cctgtgcctg cgtttcctgc tgccaagctg tctcgccaaa acgtcatggc 3720 acacgttcga cccactccct cactctgtgg tgatgttgat gaagcaacaa atctccttga 3780 cgcctcatct ctcggagggc ctcagcctgg atcttactcc caccgttctg aaacagcggc 3840 aggctgtttt gccatatcct ttgctgatga ctgtgagtct gccggcccgc cttcgagcgc 3900 tgccaccgcc attctcaagg ctctagcgtc cagctccggc cctgatcccc ttggcattcc 3960 ccctcgtgat gaccctgacg agtcttactc ctcatcacct gggcaccacg gtgatgtctc 4020 tggcgagccc aactcctcat cacctgggca cacctccgct cccgtcgacg gtacgcttgg 4080 acctttactg cctaccgatg cctcatctgt ggatgactcc aacgagctct gctcctcctc 4140 ttcaggctcg gaagccccgt cggaagcccc gtccggagtc tcggccgcct tgtcagatgt 4200 aggcaccgtt ttccaggaga gctacacgtc gtaccgtttc catgacaacc ttgcagacaa 4260 tgctcttcat gccagacccc cccgacgctc atgctctcag tggttttatt tcttctgccg 4320 acctcgacag tgttttccag tatccacctc ctgcatcgcc ctgcagctgc tgccagcagc 4380 ctgtacgtga gtgtcgtgta cttggtaaca ctgttttcat catggctgac attggcgact 4440 cggctaccat cgttcaggtt caaccggata ctgacctgtc catgcgccaa cagctctacc 4500 ttgctgaagt tctcagtgat gcccaagaaa tcacccctga agactctctg cattctttat 4560 ctgtctccgt caatgccatg tacaagccgc tacacaagcg ttctctgcct ctaaataagc 4620 ttcgcccgga cggttccttt cctgtcggtg acggctccaa gccttctccg cgacatcgca 4680 acttctctgg cgacgagtct tgccaatttg atgccaaact tgctccagtc cttttcccgg 4740 ccgagcttgc tctctgtcgc catcgcatgt cggacacgga gggtgtctgg gctttcaacg 4800 aagaccagga gggtgtcctc agtcaccata ttgaggagcc caccaagatc tacgtggaag 4860 agggaggcgt tatcaactca aagcacttcc ctctccgcgg ggctatggtc ggcgctgcca 4920 aagacatcat catgaagggt ctcgccaacg gccagatgga gcccagctcc tccccccacc 4980 gcaacgcctg gttcctcgtg agcaaaaaga gctcgggata ccgtttcatc cttgactgcc 5040 agggcctcaa caagatcacc ttgagagatg ctttccaccc acccaacgcg gacctcctgg 5100 ctgagagttt ctgtggtcgt gctgtaactt ccctgcttga cattaagaat ggttacggtc 5160 agaaggagat tgctcccgag tcccgtgact tgacggcttt taacacagat tttggctcct 5220 atcggttaac gcgcctgcct caaggttggt gcaactctcc agcggtgttc caccgtgcca 5280 tgctgcgcgt acttgggccc ctctttccgg atcaggctgt tgttttcttg gacgatattg 5340 gcgttctcgg gcctaagact gactatggcg gagccatgca tgacgacttt ccgggctgcc 5400 gccggtacat cgtcgaacac atggacaacc tcatggctgt gcttcagaat ctctatgaag 5460 cgggtctgac tgtgtctttc gacaaggccg agcttttcgt cagtgaggct gagttcctcg 5520 gtttcctcac tacctctgaa ggccgcttcc cgtcccctgg cagttccgag aagatcgaat 5580 ctttcgagtt ccccactact gtccgtggtg tgcgctcttt tctcggtgct gtggtgtatt 5640 ttcgcatgtg gatccctcat ttcagcagta tcgctgcacc tctgtacgac tgcatctccg 5700 ctgcgcagaa ggctggcaaa ctcaagatca ccaagaccga ggccaccgag tctgctttta 5760 tggcgctaaa aaaggctatg gtgagccctg cggttctgca ccgctacgac cccaccttac 5820 ctattgtcat caccactgat gcgtcctccc tcggatgggg cgcagtaatg tctcacatcg 5880 tcagtgttgg ccctccggct gcccgtcgcc ccgtccgttt cgagagtggt ttgtggaacc 5940 ccactgagcg tacctacgca tccaccaaga ctgagtgcct tgctgtaaaa cgtgccttgg 6000 agaagtgccg tcactatgtc actggcgttc atttcgtgat cgaaactgac aaccaggccc 6060 tggttttcct actgcagcaa tcccgagttg aactccctaa cgctatgttc acccggtggt 6120 ttggttacat caaacagttt gattacgagg ttcgatttgt taaaggccga gacaatccag 6180 tggccgattg gctgagtcgt gagaaatttt ctgacttccg acctgtcgat tttcgccctc 6240 ctgttgctga tacagctcga caagctgatg agcttgctcc gcttgtgccc ccgacttggt 6300 cccctgtggc ctccatcact gtcctgtcca ttggtccaga gcccgttttc atccacaaag 6360 gcatctctct cgatctcatt ttcaccacca ttgcctccgg cgatcttgat cgagatggtg 6420 ttgacattcc gcctagactg cgtcaaatct gctctgagtt cttcattttt gacgacattc 6480 tcctactgat cagctcacct ggtctccatc gacgtgttct cttcacagag aaggaggtgt 6540 ctgaggttct ccgagctacc catgaacaat atggccaccg cggtgctgct gccatccttc 6600 atgctctccg tcgcctctat tactggccgg gcatggctga tcacgtgaag tcccatcgtg 6660 catcatgcgg cacgtgtgca aaagccacca accatggtct tctcaaggcg agtctccact 6720 tcgtggttcc ccgtctcatt tgggagacag tccagctgga catcctctac cttcccgctg 6780 ttcatggtcc caccaaggag taccccgatc tggctgatcc cgccaaggct ctcgctgctc 6840 aaaccaccct caccgacttc ctccccactg cccctcagtt cactgatgtt tcctctgccc 6900 cccgcggacg gctaaatgtc actattgccc cctaccagta tgtccttgtc gctcgtgatg 6960 aattctctgg ctggcctgag gcggtgccct tacgaagtat caactctctc tccacagccg 7020 ctttcttcta cgatttcatc attgctcgtt ttggcgttcc ccgtcgggtc tacactgacg 7080 gtggtagtga gttcaagggt gattttaagc atctctgcga agatttccac atcaagcagg 7140 ttttcaccac tcctgctcat ggtcaatcga ctggcattgt ggaacgcggt caccaaaacc 7200 tcctccactg cctgcgcaaa tacggtcgtc agtggatctt atacctccac accgccctct 7260 gggctgaccg gtgcacccgt cgttcatcca cgggtaagtc accttttgag ctgatgtatg 7320 gtgtctctgg tgtcttgcct gtcgaaagtc gtttcctgac ctggaattac ctcagtggca 7380 agaccgacct ggcccacaac gaccccgctc atgctgcttt tctgcgcacc ttgcaacttg 7440 ctgcttctac tttcgaagtt ggctctgctc gtgaccacct gaccctgcaa cgccagcgtc 7500 aaaaggcgtt ctacgacaaa catcacaaca cagctgatac agatcccttg ggcgtaaatg 7560 atttcgtttt tgttcacgac ctccgtcccc acaacaagct gactcctcgt tggactggtc 7620 cctccatcgt gaccgcgtgt caccctgaga ccagcacgta cactgtcaac gatgttgacg 7680 gtgagaaccc gcggcgtatc caccgcaacc gcctcaaggt tttccacccc gcttccatcg 7740 ttgagttcca ggaccgcatg aaggaacatc agtcccgcga gtctgctctc cctgccattc 7800 ctggccgctt ttctgcctgt tctccacatg ttcttccccc ggcttcagtg gctactcgtt 7860 ccgtccgttc tgcagctacc actgcgtcga ctagagttac ttctcggtcg aagctcgctc 7920 gtgttgattc tgggcttgct cagggctcct tccttgctca gggcctttat tcttgattct 7980 tctaaccagg gcttcgggac gaaaccccgt tagacctggg caggt 8025 // ID Gypsy-20_RO-I repbase; DNA; FNG; 5697 BP. XX AC AACW02000339; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-20_RO_; KW Gypsy-20_RO-LTR; Gypsy-20_RO-I. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-5697 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000339; Positions 21864 16168. XX CC Positions [2533-2991] - Reverse transcriptase CC Positions [4255-4734] - Integrase core CC 'CTATT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 10..5253 FT /product="Gypsy-20_RO-I_1p" FT /translation="MNSDVEPNPQPPPARSSPDEVATSLDKISISSSDNMI FT HPTIEEDTPMTEASSSNNNAETLSPEDFINKLRSQKAAMLKIIDSINDQNI FT QFVMQNNFEGLESNDKKLAFATKSLERFDKLIADQVNIISLNSAMKETPVS FT RHINNKYIPSKELPKFNVNPTTSAMYQLTQHSGNKNNNNATNEPSLEMFLR FT EFERAFRDHNVSIQDHWLSNLEICFESCDNNLHFDWFCRYVKKPVVELKRK FT VTWDDAKALLQEKFDLASQTTPQAWMKLLLNFKQRPDQSFADALHHFRLFS FT IGAKVSSTEGNLLNSLFVSRLYTTKFQDTVMAIITNHTKNLPSTEPAHPGL FT SIHAPCNAPSPLNMSWNNFEAILMKNMANLESSLLYIQKEMQREQVKKPTN FT DDSENYKKRKLAAFSPPINERITTVKTLPTSSNNNHSNNNNRKHTDFYEKI FT ADLKRRGICTFCESAKYTTAHAENCAQRKRYDQRKSNNQHNNKPVSSNSLN FT NTDKAIASLELTLEPNTACQNLNKSETNNLHPSTISSQNNNISSGSPSCDS FT SSEDEYDRIYKEFDRSLISPYLFDHFDFETINDVNIFALRNNHVGDIDNPF FT NSDSVSFSPVTPITLNGVNAYGIIDTGANISVMNKKFANDNNIEFNPVPGN FT LILANGSTIPRMQTTHFVNVEYDNVDQVIPHRFDVIDNSILSYENKILIGI FT DLLPKLSIHLMKVAIKHKSTEKEIDDSIVDKAYEPNISRAGTDKEQKTFDL FT AIKSYVEANKKLNKNTLCNIEDAVLHLPTPENYVANVKQYPIPYSVQPKVM FT EIINSWLDDGIIVPAKPSAWNLPMTVTFKKNLDGTKTDKIRLVLDPRMLNK FT VLPVDNHQLPLINDIFNSMTDAVVFSTLDLKSAFNQFPVFGPDQHKATFTA FT PNNLQYMYRGAPFGISTISQLFSRIMLNLFKDLPYVKCFVDDICVFSSSIH FT AHFLHVKKVLQILTNANLRVNFEKAYLAKSAVYLLGYCISAEGKKIDTRKL FT SNIHEWPRPTTQKQVQSFLGFVNYFRQHTPNASFLMAPLDALRSHDEKING FT PFIWTSTHQMHFDSIKRILSSELILSHPDMSHPFCIATDSSDFGTGCCLYQ FT EFEVTQSNGETSKIKRYIGFMSHSLSRSEKRYSVTMRELLGVVYALTQFHK FT FIWGTRFTLYTDHKALCYIHSQKNANSMLIKWLDVILDYNFNVVHVRGLDN FT ILPDKLSRLYPPKEPVEHEQDLNEGKRIASFRGNYATSNNINNKKRHRAHV FT KHNVSINVVLVDHKPTTSDALPYVYPATHSDDTVNYNRNLDISSNFHSFNK FT KSNDSNEYNDFNRLHDKNIFYVQSAHSVFKDYFVPPDSEHRELMTDAHNKV FT GHYGAEQMVKRLHNEGIHWPNLISDCVKFVRQCNECMKHNIEKKGYHPLRS FT IYSYYPGDHYAIDLGGPMHTTSISNNNYFMVIIDVCTRFCILRALADKKAD FT TILRALIDAFCIMGFPTKLQSDNGTEFKNSLSKDLANAMGYDHRFITPLHP FT SANGLSERFVQTVKKLLAKSTNGVGNDWDAFLPSIQLAMNNRISKRLNSTP FT FSLMFARKMIETYGFRSDKDKLKEVKGKPPMSHEELMKRIDYMTDIVFPAI FT AAKTKAQVELEQAKFNDSHRLADYAPGSHVMVRIPTKSGQLAPAYEGPFTV FT VRKNQGNAYILRDETGVLMPRSYTTTELKLISNEEIIELDDEGNEIQSYEV FT EAVLNHRGPPKNREYLVRWKNYSSEWDE" XX SQ Sequence 5697 BP; 1840 A; 1170 C; 938 G; 1749 T; 0 other; tttttttcga tgaattctga cgtcgaacct aacccacaac cgccgccagc acgctcatct 60 ccagatgaag ttgccacctc gttggacaag atatccatct catccagcga taatatgatc 120 catccaacaa tcgaagaaga cactcccatg accgaagcgt cttcatccaa caacaacgct 180 gaaaccctct cacctgagga ttttattaat aaacttcgta gtcaaaaagc tgccatgttg 240 aaaattattg acagcattaa tgaccaaaat attcaattcg taatgcaaaa caatttcgaa 300 ggattggagt ccaatgataa aaagttggcc ttcgctacaa agtccttaga acgttttgat 360 aagttaatag ctgatcaagt gaatattatt tcccttaata gtgcgatgaa agagactcct 420 gtctcacgtc acattaataa taagtatatt cccagcaaag aacttcccaa gtttaatgtc 480 aacccaacta caagtgcgat gtatcaactc actcagcatt cgggtaacaa aaataataac 540 aatgccacta atgagccatc tttagaaatg ttcctgagag agttcgaaag agctttcagg 600 gatcataatg tctcaattca agatcattgg ttatcaaact tggaaatctg ctttgaatct 660 tgcgacaata atctccactt cgattggttt tgccgctatg tcaaaaagcc tgttgttgaa 720 ttaaaaagaa aagttacctg ggatgatgct aaggcccttt tacaagaaaa gtttgacctt 780 gcctctcaaa ccacacctca agcatggatg aagctactcc ttaacttcaa gcaaaggcct 840 gatcaatcct tcgctgatgc tttacatcac tttcgtcttt tttcaattgg tgctaaggtt 900 tcatccactg aaggcaatct cctcaattct ctttttgtat caagactcta cacaacaaag 960 tttcaagata ctgttatggc tataattact aaccatacaa aaaatttacc ttcgactgag 1020 cccgcacatc caggtcttag tattcatgct ccttgtaatg ctccctctcc attaaatatg 1080 tcttggaaca actttgaagc tattttaatg aaaaatatgg ccaatctaga aagctctctt 1140 ctgtatatcc aaaaggagat gcagagggaa caagtaaaga agccaactaa tgatgatagt 1200 gaaaattata aaaaaagaaa attagcagct ttctcaccac ccatcaatga aagaattact 1260 actgtcaaga ctcttcctac cagcagtaac aacaaccaca gcaataacaa caacagaaag 1320 cacactgact tttatgaaaa aattgctgat ctcaagcgtc gtggtatctg tactttttgt 1380 gaatctgcca aatacaccac tgctcatgct gaaaattgtg ctcaacgcaa acgctatgat 1440 caacgtaaat cgaacaatca gcacaacaat aaacctgtaa gtagtaactc tttaaataat 1500 acagacaaag ctattgctag tcttgagtta acgctcgagc ctaacactgc ctgtcaaaac 1560 cttaataaga gcgagactaa taatcttcac ccttctacta tttctagtca aaataataat 1620 atcagctctg gttctccctc ttgcgactct tcttctgaag acgaatacga tagaatttat 1680 aaagaatttg acagatcttt gatctcgccc tatttatttg accattttga ctttgagacc 1740 attaatgacg ttaatatttt tgctctgcgt aataatcatg ttggagacat tgacaatcca 1800 tttaattctg acagtgtgtc tttttcacct gttaccccaa taactcttaa cggtgttaac 1860 gcttacggta tcattgatac aggtgctaat atctctgtta tgaacaaaaa attcgctaat 1920 gataataata ttgagttcaa ccctgttccc ggtaatctca tccttgctaa tggttcgacg 1980 attccgcgaa tgcaaacaac tcatttcgtc aatgtcgagt acgataatgt tgatcaagta 2040 attccgcata gatttgatgt cattgataat agtattcttt cttatgaaaa taagatttta 2100 attggtatag atcttctacc aaagctctct atccacttaa tgaaagtagc cattaaacac 2160 aagagtactg aaaaggaaat tgatgattct atcgtcgaca aagcctatga gcctaacatc 2220 tctcgcgctg gtactgacaa agaacaaaaa acgtttgatc tggctattaa atcttatgtc 2280 gaagctaaca aaaaactaaa taagaacacg ttatgtaaca tagaggacgc agttttgcat 2340 ttacctactc ctgaaaacta tgttgctaat gtaaaacaat atcccattcc ttattcggtt 2400 caacctaaag tgatggaaat aattaatagt tggctcgatg acggtatcat tgtccccgca 2460 aaaccttctg catggaattt accaatgacc gtaacgttta agaaaaacct tgatggcact 2520 aagacagata aaattagatt agttttagat cctcgtatgt taaataaggt tttacctgtg 2580 gataatcatc aacttccact aattaatgac atttttaact caatgactga tgcagttgtc 2640 ttttctacac tggaccttaa atcagcgttc aatcagttcc ctgtttttgg ccctgatcaa 2700 cacaaggcta catttactgc tcctaataat ttacaataca tgtaccgagg tgctcctttt 2760 ggcatttcta caatcagtca attattttct cgtataatgt taaatttgtt taaggacttg 2820 ccttatgtga agtgctttgt ggacgatatc tgtgttttca gctcttccat acatgcgcat 2880 ttccttcatg taaagaaagt tttacaaatc ctcacaaatg caaatttaag agttaatttt 2940 gaaaaagctt acctcgcaaa atcagcagtt tatcttttgg gttattgtat ctccgctgaa 3000 ggaaagaaaa ttgatacccg taaactttcg aacattcatg aatggccgag accgaccact 3060 cagaagcaag ttcaaagctt cttaggattt gtaaattatt ttagacagca tacacctaat 3120 gcttcatttt taatggctcc tctggatgct ttacgttctc atgatgagaa aattaatggt 3180 ccctttattt ggacttctac acaccagatg cattttgaca gcataaaacg tatcttatcc 3240 tccgaattga ttttatcaca tccggatatg tcacatcctt tttgcatagc cactgattct 3300 tcagattttg gcactggttg ttgtttatat caagagtttg aagtgactca atctaatggc 3360 gaaacatcta aaatcaaaag atatattggt tttatgtctc attctctatc aagaagtgaa 3420 aaacgttaca gtgtcacaat gcgtgaattg cttggtgtag tttatgcctt aacccaattt 3480 cacaaattta tatggggcac acgatttacg ttatacactg accataaagc tttatgctat 3540 attcattccc aaaagaatgc caatagtatg cttattaaat ggcttgacgt tattcttgac 3600 tataatttca atgtggttca tgttcgagga ttagataata tccttcccga taaactctct 3660 aggttatacc cgccaaagga gccggttgaa catgaacaag atcttaatga aggaaaaaga 3720 attgcttcct ttcgtggtaa ttacgctact tccaataata ttaataataa gaaaagacat 3780 agagctcatg tcaaacataa cgtttctatt aacgttgttt tagtggacca caagcccacg 3840 acttcagatg cactaccgta cgtctaccca gccacacata gtgacgacac tgttaattac 3900 aacaggaatt tagacatttc tagtaatttc cattctttta acaaaaaatc taatgactct 3960 aatgagtata atgactttaa tcggcttcac gacaaaaaca tcttttatgt tcagtctgct 4020 cattcagttt tcaaggacta tttcgtgccc ccagattctg aacatcgtga gttaatgaca 4080 gatgcccaca acaaagttgg tcattacggt gcagagcaaa tggttaaacg tcttcacaat 4140 gagggtattc attggccaaa tttaatatct gactgtgtga agtttgtcag acaatgtaat 4200 gaatgcatga aacataatat cgagaaaaag ggatatcacc ctctgcgtag tatatacagt 4260 tattatcctg gtgaccatta cgctatcgac ttaggtggtc ctatgcatac cacttcaata 4320 tctaataata attattttat ggttatcatt gacgtttgta cacggttctg tatacttcgt 4380 gccctggccg acaaaaaagc agatacaatt ttacgtgcac tgatcgatgc cttttgcata 4440 atggggtttc ctacaaaact tcaaagtgat aatgggactg aatttaagaa ttctctctct 4500 aaagatttag ctaatgctat gggttatgat catcgattta tcactccttt gcatccttct 4560 gctaatggtt taagtgaacg ctttgttcaa actgttaaaa aattactcgc taaatctacg 4620 aatggtgttg gcaatgactg ggatgcattc cttccttcca ttcaacttgc aatgaataac 4680 cgtatatcta aacgcttaaa ttctacacct ttttctctca tgtttgcacg caaaatgatt 4740 gaaacttatg gctttagatc tgataaagac aagcttaagg aagtcaaggg taaaccacct 4800 atgtcgcatg aagaattaat gaaacgtatc gactatatga ctgatattgt ttttcctgct 4860 attgctgcta aaacgaaagc tcaagttgaa cttgaacaag ctaaatttaa cgattcgcac 4920 cgattagctg attatgcacc tggttctcat gtcatggttc gtattccaac taaatccggt 4980 caactcgctc cagcttatga agggccattt acagttgtgc gaaagaatca aggtaatgcc 5040 tatatcttac gtgatgaaac tggtgtcctg atgcctcgtt cttataccac tactgaactg 5100 aaattaattt ccaatgaaga gattattgaa cttgatgatg aaggcaatga aattcaaagc 5160 tatgaagttg aagcagttct taatcaccga ggtcctccaa aaaaccgtga gtaccttgta 5220 cgttggaaaa actacagcag tgaatgggat gaatagctga cagctgataa atttaatgat 5280 ccagacattc tacgtaaata ttggaaaaac atgggacaaa agtatgttcc tcctaaaacc 5340 gctaaaatca caaattctcc ttcttcttct aaggtactaa agaccaaccc atctggtacg 5400 attagcacta tgatgaagtc tgtatcccta gatgatgaca attctaattt ggctgatgtc 5460 gctgcacatt caaccaaccc aaattttaaa cgttcatatc ctcactcgat atcgcgccag 5520 tctaaaaaag ctcgtcgtga taaacaacac agacattcca caccaaatgt gcaagttgct 5580 gctactcgta ccagcaagag attgcgaaat tcctcttcat aaaaaaaaaa aaaaaaaaaa 5640 aagtgttttt caaaacaagt cactttttta ccgtaatctc acactggtgg ggggcta 5697 // ID Gypsy-1_MVPL-I repbase; DNA; FNG; 4787 BP. XX AC AEIJ01000645; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Microbotryum violaceum genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_MVPL_; KW Gypsy-1_MVPL-LTR; Gypsy-1_MVPL-I. XX OS Microbotryum violaceum OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Microbotryomycetes; Microbotryales; Microbotryaceae; OC Microbotryum. XX RN [1] RP 1-4787 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Microbotryum violaceum genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AEIJ01000645; Positions 9791 14577. XX CC Positions [3658-3981] - Integrase core CC 'AAGAA' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS join(263..1471,1475..3202) FT /product="Gypsy-1_MVPL-I_1p" FT /translation="MPTPTAPSHRPTYKPNDPPQLHSTSMEALREYFMRLD FT FYNRRIARTPITSDYDKIEGAGMGLQVFALKTWFTQGLSKHLAKPYAQFKL FT ELICRAVPADFVWTQLAKLHRLRQAPGHNIDAFQEFSDQMRSLQMEIGFQV FT VSDQEVAKLVLLGTDPELCRIIRTHMVSSLAGWSDLLLEKLALDAPVVPTS FT PEAAEVFSDAHTDQPLEFDYQVFERIGREEWSVIAQRCQAIATQVSALQQR FT PTYSRPAAASAAPHDSCYRPFPDHGWPPPRRPTHRRRARVPGPEPRMLSLL FT HSQRRPHLAQLPPLLVTHGSVRHSVNANVDRSCASPRAPYGGGHSDLPRNW FT RFNRVRRPQQRLGGRQRSFVRTASLSPLPVSLAGAHGTLLASALVDLGLPQ FT TFLSEELVQLGLERRALEQHSKYTLAMQNQVPTVFTCTHFVRVPLELANGL FT WAAGPTYAEVAPLGKDLAIILGGNFIYRHKMELGLFPQPHLTCKANPIHAI FT DLLALSSQPCAATRVAAIAPVSDDEEQLRLAALDHRLRAEFADRFPADIPP FT VHLYQSPVRHRIELDTPSTVVNLRGYPLPKKYREAWYLLLQEHLAAGRLRP FT SRSAYSSPSFIIPKKGCDVDPMIAPRWVNDYRELNKHTIKDRTPLPLPDNI FT LSTCSNAQFWAKIDMTNSLFQTKMAEEDIHKTAVSTPWGLYKWTVMPMGLC FT NAPATHQRRVNEALQGLLGTICFVYLDDITIFADTLEEHEARVRQVLDALR FT RAELYCLPTKTNLVTVECSFLGHIINRAGVHADPKKIQRIEDWSLPKMVKE FT LRGFLRLVQYLRKFIPGLAEHTAALTPLTRQGLSSIATLWTPNEGRHFKAI FT KAIVTSLDCLRPPDHSANAALFWVMTDASNQGIGGVLLQGQEWKVARPIAY FT WSRQYIPAERNYPTHEQELLAIVEALKEWRIDLLGSHFHILTGHSTLEHFQ FT TQRTVLSCQQAHWLDTLAKFDYDL" XX SQ Sequence 4787 BP; 1014 A; 1617 C; 1194 G; 962 T; 0 other; cttttttttg acaacgttga gttcattggt accccgacaa tacggcctac gttcatcagg 60 cccggccgta gtcgaaccat cgaacccatc aactagccca gatcggtcga ggcaagccaa 120 tcccacaacc atagcacgca gactcccagc cccagcacag cccagcccag cactgcccag 180 cgcgcccagc aaccccagca cgcccacaca cgcaccgcct tcgcacgcct caccagccga 240 catcgcaccg tccccgcgca ccatgcccac gcccaccgcc cccagccacc gccccacgta 300 taaacccaat gaccctcccc aactccatag cacctcgatg gaggccttgc gcgaatactt 360 catgcgcctg gacttctaca accgtcgcat cgcccgtaca cccatcacga gcgactacga 420 taagatcgag ggggccggta tggggttgca ggtctttgcg ctcaagacct ggttcaccca 480 gggcctgagc aagcaccttg caaagccata cgctcagttc aagttggaac tgatttgccg 540 ggctgtacca gcggacttcg tctggaccca gctcgccaaa ctccaccgtc tacgccaagc 600 ccccggccac aacatcgacg cctttcagga attctccgat cagatgcgct cactgcaaat 660 ggagattggg ttccaggtcg tctccgacca agaagtggcc aagctggttc tcctgggcac 720 cgatccggaa ctctgccgga tcatccgcac ccacatggtc tcgtcgctgg ctggctggtc 780 ggacctcttg ctcgagaagc ttgccctcga cgctccggtg gtccctacca gtccggaggc 840 cgctgaggtt tttagtgacg cacacacgga ccaacctctc gagttcgatt accaagtctt 900 tgagaggatt ggtcgcgagg aatggagcgt cattgcgcag cgatgccaag ccattgcgac 960 ccaggtcagt gcgctacaac agcgccccac gtactcgcga cccgctgcag cctcggctgc 1020 accccacgac agctgctacc ggccattccc cgatcatggg tggcccccgc cccgtcgccc 1080 gactcaccgt cgacgagcac gcgtacctgg accagaacca cggatgctat cgctgctgca 1140 ctctcaacgc cgaccacatc tcgcgcaact gcccccgcta cttgtcaccc atgggagcgt 1200 ccgccactcc gtcaacgcca acgttgaccg ctcctgcgcc tcgccccggg ccccgtatgg 1260 tggcggccat tcggaccttc caagaaactg gcgattcaac cgtgttcgcc ggcctcagca 1320 gcgactcgga ggacgacaac gatccttcgt acgcaccgca tccctttccc ccttacccgt 1380 ctccctagcc ggcgctcatg gtactctcct tgcctccgcc ctagtcgatt tgggattgcc 1440 tcaaacgttt ctcagtgagg agcttgttca gtgactgggt ttggaacggc gcgccctgga 1500 acagcattcg aagtacaccc ttgcaatgca aaaccaagtc cccactgtct tcacctgcac 1560 gcactttgtt cgagtcccgc tcgagctggc caatgggctc tgggccgcag gaccaacgta 1620 cgccgaggta gcgccccttg ggaaggacct tgcaatcatc ttgggaggca acttcatcta 1680 tcggcacaag atggagctcg gcctcttccc tcagccacac ctcacctgca aagccaaccc 1740 catacacgct atagacctgc tggcgctgtc gtcccagccg tgcgctgcca cgcgggttgc 1800 cgcaattgct ccggtttctg atgacgagga gcagctccgc cttgccgcgc ttgaccatcg 1860 cctccgagcg gagtttgccg accgttttcc tgcggatatc ccgcctgtac acttgtacca 1920 atccccggtg cgacaccgta ttgagcttga cactccgtcc actgttgtca accttcgagg 1980 ataccccctc ccgaaaaagt atcgcgaggc atggtactta ctactacagg aacatctcgc 2040 ggcgggtcgc ctccgcccct ctcggtcggc atactcgtca ccctcgttca tcatcccaaa 2100 gaagggctgc gacgtggatc ccatgatcgc cccgcgctgg gtgaacgact accgcgagct 2160 caacaagcac accatcaaag accgcacgcc tctgcccttg ccggacaaca tcctctcgac 2220 gtgctccaat gcgcaattct gggccaagat tgacatgacc aactccctct ttcagaccaa 2280 gatggcggag gaagacatcc ataagacggc cgtgtccaca ccttggggtt tgtacaaatg 2340 gaccgtcatg cccatgggcc tctgcaatgc ccctgccact caccaacgac gtgtcaacga 2400 ggcgctccaa ggcctgcttg gcaccatctg cttcgtctac ctcgacgaca tcaccatctt 2460 tgcggataca ctggaggagc acgaggcccg tgttcgccaa gtcctggatg cgctgcgccg 2520 tgccgaactc tattgcttgc caaccaaaac caaccttgtc actgtggagt gctcgttcct 2580 tggccacatc atcaaccgcg ccggggtgca tgcggacccc aagaagatcc agcgcattga 2640 agactggtca ctacccaaga tggtcaagga actcaggggt ttcctcaggc tggtgcagta 2700 tcttcgcaag ttcatccctg gccttgctga gcacaccgcc gcactgactc ccttgacgcg 2760 ccaaggactc tcatctattg ccacactctg gacacccaat gagggccgcc acttcaaagc 2820 catcaaggcc atcgtcacct cacttgactg ccttcggccg cccgaccact ccgccaacgc 2880 cgctctgttc tgggtgatga ctgatgcaag caatcaggga attggtggtg tcctcctcca 2940 gggtcaggag tggaaggtgg cgcgccccat tgcgtactgg tccaggcaat acatacccgc 3000 tgaacgcaac taccccacac acgagcagga actccttgcc attgtcgaag cactcaagga 3060 atggcgcatt gacttgcttg gcagccactt ccacatcctc actggtcact ccacacttga 3120 gcacttccag acccagcgca ccgttctctc ttgccaacaa gcccattggc tggacaccct 3180 ggccaaattt gattacgacc tgtgatacct gcccggcaag gacaacattg tagccgacgc 3240 catgagtcga tactctttca ccgacccact gcccacccta gtggccgctg tctcccacgt 3300 caagcttttg gatgctgtca agcagcagat ccttgacgtg tacgagaccg acccattctg 3360 ccagcaagcc atgtctaaca ttggctcggt cacatcggac ttcaagattg ttgatgcatt 3420 gctgtattta cggggccggc ttatcattcc atcactagcc ccgctccgtg agtccatcct 3480 tcacaatgct catgatgcgc aggggcacct aggtgatatg aaaacgtacc ggaccgttca 3540 gcaggcatat ggccaaacat gtcctgggac gtcaagcact acgtccaaca atgtgattct 3600 tgtcaacgga cgaaggctcg taccactcgc attgctggca aactccactc gctacctgtc 3660 ccggcacaac ccatagccga cattgccatc gattttgtcg gaccgctgcc cgccaacaaa 3720 ggctttgacc gtgtgttaac aatcaccgac cgcctgtcgg ggtacgtccg tttgctccct 3780 gcacacgaag ctgacgctgc agcggaggtt gctgcgtgct tccatgaagg atggcaccgc 3840 ctctttggcc tccaccaaag cattgtctcg gaccgggaca agctgtttac aagcaagttc 3900 tggactgccc tacacaagcg cctgaacgtc aagctccagc tctctttggt gtttcacccg 3960 gagacagatg gtcgcagcaa gtaaacgaag aagactgcct tccagattct tcgagcgctc 4020 gtcaacaagg agcaatccaa ctgggcggag tgccttgctg tctgtgagta tgccatcaac 4080 tcgtctctca atgtcgctac cgggaaaacc cccttcgagc tcgtgcttgg ctacacgccc 4140 tccctcgccc cacttgccca tgtcgacggt gatgacaact tgctgtccgt tgaggagttg 4200 cttgcgcttc gattccaagc ctgcgaagac gcccgggacc agctcgccat ttccaaggtt 4260 cggcaggcag cacagtccaa caagaagcgc caggatgaac cctcttgggc cgttggggat 4320 cttgttcttc tcgactcaag tgatcgccga aaacggctcc acacccgcaa gcgccgcgca 4380 gccaagctca tggaccgctt tgacggtccc tatcgcattg tcaaggccca accggaaatc 4440 cctgcaactg aacggggatg atgcagcggt accattcttt cacacaggca agctcaagac 4500 ctaccgcaag aacgacacag ccctgttccc caaccgtgaa cctgcgagac tgggcccagt 4560 ggatgtaggt ggcgagccgg agtacatcat tgaagacatt accgacgaga ggatccgtgc 4620 tggcaagcag caatgactcg tctcgtggct cttgtggccg tcagacagca attcatggga 4680 gcctgccgag gcactggagg atacggaggc attggatagg tgggagcgtc ggaacgagga 4740 gtaggatttt tgaccggagg aggggatttg tttcaaggtg gggagag 4787 // ID MULE-3_Cglob repbase; DNA; FNG; 3331 BP. XX AC . XX DT 17-MAY-2011 (Rel. 16.05, Created) DT 17-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW MuDR; DNA transposon; Transposable Element; MULE-3_Cglob. XX OS Chaetomium globosum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Sordariales; Chaetomiaceae; OC Chaetomium. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 3331 BP; 817 A; 930 C; 898 G; 686 T; 0 other; gggccttgct aggcgcgcac cccccaaaat tgctgcgcgc gcaccccccg cccagaattg 60 ctaggcgcgc accctgcagg gttagggtca gaggttcaac atcgccaacc cgcaatcaac 120 acaggtaagc tgcatagcaa aaacatgttt tctacggcta atctgaccgt tctagaatgc 180 taggaaccca ccagaatgtc gcaaaaccct cgcgcggtcg agccgcaggt gctcgaacaa 240 cctgcaccag ccctgccgct cgccacagag ttttcggctg caggcgagga ttctgacgaa 300 tacgacgata tcgatcttct tatcctccag ccaatcgacc tcacagaatc ggagacccta 360 cctcaccacc tcgcccaacg caacaccacg cccatctcca tcaattcgag cgattcaagc 420 gatcgaggag gctattcgga accttcttca ccaattgcaa gcgctataaa ccgcgcaggc 480 tttgttgagc ggccccgttt agaaccaggc gcgatttcca gccagtcaga gtgggacggt 540 tgcagccagg gctcggcttc ggagtggggt gggttcaccg accacgacgg ggatacgtct 600 gacgatgaag acattcccaa cccccctaac ggtatatcag cgcctactat tgagggccta 660 cagaacgccg tgaacgcctg ggcaaagaca agagggttcg cagttacccg gcaacatggg 720 aagaagaaca agcgaggcga atatgctcgg tattaccttg tttgtgaccg ctttggtcag 780 cctaggttcg gtgaatccgc tggcctacgt gaaaccggca cccgaaagtg cggctgcacc 840 tggaagtcag ttgccaagct tacccaagac ggctgggtgt tccggaattc tgacgagcct 900 caacaccata accatggacc ttcttctcat gcttccgccc atcctcaaca ccggaagatc 960 gccgctgagg ttcttgatac catcgtcaac gcttcaaagc atcacggaat ccgctccagg 1020 gagatcgggg cgctcatcag agacggcttc cccgattcca gctacatcag gaaagacatc 1080 tacaatgcga gggcgaagat ccggaaggaa aagcttggtg gttatactcc tgcaggtgcc 1140 ctgatcaagt cgtttgatga gaacggcatc aaataccgtg tcaagtgggc tgacgaggat 1200 gagactgagc tcctggcctt ggtgtttact ttcaacggct tgatggatat cacaaagcag 1260 tactcagacg tcatgcagat cgatctcacc tacaatacca actgctttgg ttatcctctg 1320 taccaagtgg cagggctcac cggcgccaac accatctaca actccatctt cggcttcatc 1380 gacaatgaga gaaaggaatc gttcgattgg ctttgccggg ggacacatga gctgcgagct 1440 gagttcagcg ttgagccccc catcgtcatc ttgaccgatc attgcaaaga gcttaaggcc 1500 gccctcctgg aggtattccc cgactcgcaa caacagatct gcatctatca cgtcataaag 1560 aacgtcctcc tcaatgccaa gaagaagttc aagagggttg aaagccctga tttccttgac 1620 gaagaggctt ttgaaggtga tgaggatgtt ggtgacgatg ggggtagtgc tgaggtagcc 1680 gctaggctga atgccgagga ggcaactgcc ttggcacgga tccgctcctt agatgaccct 1740 ggtgttacca ctgagtctcg cggccccttt ccccacgacc acaatggtgt tgaggctttg 1800 tttaaagtca tggtctattc ggaaacagag gatcaatttt atcaggccta gaaggcgttg 1860 aaggatgagt tttctgatca accaagtatc atggggtata tccagaagca atggatgcct 1920 ttgcggcacc agtgggcgca gtgctacacc cgaacgtacc gaaactacgg cgcaagagtg 1980 acgtcaccaa ccgagtcaag caatctcaac atcaagagct acctcctcga cggtagaagc 2040 gactgcgtcc gccttgttga gggtatcaag gagatggctg acgaacagct tgagcatctc 2100 cagcaagttc ttggccaaca gggaatgagt gtgagaaaga attggctgtc acggcggtac 2160 cttggctctc tacctaacaa ggtgacgtat aaagccttgg agttgatcaa ccgcgagtat 2220 cgctttgcca gggcagcgtt tcctggaaag aaacaaaagg cacgcccttt gccagcctgc 2280 aacgaaaaca actgtacggc tacgcagcaa tacggtatac cttgccgtca cagcatttac 2340 aacatcctgg cgcagtcact cgtcaaacca gatgccaaga tccgactctg ggacgtacac 2400 catcactggc acctcaagaa caggcttgta agtatcccca atatcaatat agtagcatgg 2460 ctaactgtag aaggacgagg gggaccccta cctccggatt cgtgatccca gggttgctac 2520 gaagcggaag ggtagaccga gaaacgagcc tgccgccgtg cccctagata tggcgatcat 2580 gtttgtcagg ggaggtgtgg cttccacccc tgcagctccg acgaactcca gtactccggc 2640 aacccccaga aggctccccc ccaccgccag ggcttcgaat ccattgtccc aggctcagcc 2700 gacaacccag cgacgcaaga ggacccagcc taggccccct aagggtactg cgccgcatct 2760 gaagcccagt atacgtcgta ggctgtcaca ggtggagttg gtgaagtctc cagaggttga 2820 tagagggttc gagaggttca agatcaaagc tgcaaagcgc ccatccttgc tcacggcgcc 2880 gaaggcaacg aaaaggggta gggtttctga ggtggctcct gaggctgcgc ctccgcgaaa 2940 acaagcctca aagcgcaacg cagtgctttc tacgacctca cagtcaagcg aacaggttca 3000 ggcgggctct gcagcgcaaa cccaagccgt tcaggacccg tctcccccca tgacgaggac 3060 tcggagtggc cgagcggtca aaccatcggc taaattacgt ggtgcggctc aggaggtcta 3120 aataggggta gttctacgta gtgtatgtag tctgagcgag tggaattgag ctgttttgcg 3180 aagtctttcg cttttggggc agagtaattc atttgtttta gttgcgacca tataaaccct 3240 aatcctttgg gcggggaggg ggtgcacgcc tagcaatttc gggcggggtg cgcgcgcagc 3300 aattttgggg ggtgcgcgcc tagcaagtcc c 3331 // ID REALAA_LTR repbase; DNA; FNG; 218 BP. XX AC AB025309; XX DT 03-JUN-2005 (Rel. 10.05, Created) DT 03-JUN-2005 (Rel. 10.05, Last updated, Version 1) XX DE Alternaria alternata LTR-retrotransposon REAL complete sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; gag; pol; LTR-retrotransposon; REALAA_LTR. XX OS Alternaria alternata OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Dothideomycetes; Pleosporomycetidae; Pleosporales; Pleosporineae; OC Pleosporaceae; mitosporic Pleosporaceae; Alternaria; OC Alternaria alternata group. XX RN [1] RP 1-218 RA Kaneko I., Tanaka A. and Tsuge T.; RT "REAL, an LTR retrotransposon from the plant pathogenic fungus RT Alternaria alternata."; RL Mol Gen Genet 263(4), 625-634 (2000). XX RN [2] RP 1-218 RA Gentles A. and Jurka J.; RT "A. alternata LTR retrotransposon REAL, LTR sequence."; RL Direct Submission to Repbase Update (03-JUN-2005). XX DR Genbank; AB025309; Positions 1 218. XX SQ Sequence 218 BP; 47 A; 65 C; 51 G; 55 T; 0 other; tgtaacgggc tgaacgccct tcagatagct cctcgataga atctcggtct actcgcagaa 60 tccgggtgct acctcgagca cccggattct gtcggcccga ttgttacccg ggcaaccagc 120 gttgtatata gatagccgcg tagctagaag cctagttctc tatcgaatac aacctttgta 180 taccttcgcg tttgctctac ggagcgcagc cccttaca 218 // ID Copia-21_MLP-I repbase; DNA; FNG; 4574 BP. XX AC AECX01002567; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-21_MLP_; KW Copia-21_MLP-LTR; Copia-21_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4574 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002567; Positions 6461 1888. XX CC Positions [1841-2338] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 427..1593 FT /product="Copia-21_MLP-I_1p" FT /translation="MELMLSSIDATNEALIELTDTPKDAYAKLAKAHGSSG FT GVLAAATICDIATVRLEPGQSLSEFITRVRNLQNQLAQYANEDKDIALSPK FT LLAIFLLNGLGKDYEYLTAPFYADIATLTVSKVMDRLVLEVAKQSSSDQRS FT SASAFNARTSHVRPTSAGNHLGRKIGPGPNDLCNLDNHRGLSHTNRNCYQQ FT GRPRNRNSPTTVAANPKSSLSEAEMARRYMSIEAAHLAKKNAPPATAVPSA FT PAYAVSTQEFSAEAFVGDFPDEGFSAQGYVAGNGISSTYGYILADTAATRS FT MVSNQAYFLHMRPISAINITGISEGRQIATHIGSIVLSGFSHLDHTPTDIL FT VPNVLYVPTMKVNLLSLSQLCAAGASFSGSAESITVTGMPGNIFCV" FT CDS join(1880..2494,2498..4564) FT /product="Copia-21_MLP-I_2p" FT /translation="MGPFPSSMGGARYLVTFIDDFSRHATIFPIKNKSDVF FT ECFLKYKARVELLCDACIKYLHCDRGGEYRSNAFLSYLDKHGISLEQGPAD FT TPQQNSIAERYNRSLIERVRCVLAHGSIPIKFWAEIALAVCFTLNHSPHTF FT LKFKSPLSKWNQNLPGAGSHDPDPSFLRTLGCSAIFLSSKNPGKLDLKGRE FT GVLIGYELGAKAYRIDLEDKKVVITRVVLFNEEHLPFSHSPSFTDEPKFLV FT LEDPGYDIDSPPSVTSSSSHAPTVLPISPITSIPTSTRSPSVPLPYSTRQL FT SNTTSHLRPLILSPAASPAKSPKSPIPPDASIISSGSGIVHAPGGKKAVPA FT RQSKRVTGKPNRLGDLQAFHAGIDSSDEPTYKQAMAASDADEWLKAMQAEF FT DSLKAHHVGRLVARPVGARVIGGMWVLKKKRNEKGEVTKYKARWVCFGNHQ FT VEGIDFNATYSAVGKSDTFRLLVAIAAYLKCSVVQFDIITAFLHGLIKEHV FT YIQQVKGFIEPGCEGMVWELVKSLYGTRQGACDFSDHLRSVLKAFGFVTSE FT ADNCLFIYQHGSHFLYLHIHVDDGFLISTSDELIEDFRQHLLKTYEVKWKP FT RPTLHLGMRLTYLNDGGIFLDQSHYLQDMLEKFGMDDLNSAKIPFPVGMKL FT SAGSPDDIQAAAHLPYQSLIGSLNWASVSTRPDIAYAVSQLARYNSCYTFD FT HWNAAKHLVRYIAGSLSRGILFTGAMPVELKGFADADYANDLNDRRSVTGY FT LFTFGDAIISWQSRQQKSTALSTTEAEYMSLSDCARQAIWFKLLFKDLNLP FT VSAVSVSSVGEAVQLFNDNRGTVLLSKEPVVNERSKHINVRYHFIRDHVKL FT NNIVTAHVPTTAMPADFLTKPLAIEAFERCCGQASVCDCSS" XX SQ Sequence 4574 BP; 1129 A; 1048 C; 948 G; 1449 T; 0 other; taggttatga gccgtttctc tcgagtaaat ccacggtcaa ttcaactacg tcatcttcgc 60 tcgagctatc tcgtgattct cctttttatt ttaaacttgg tcgcaatgtc ttctccttcg 120 aatactactg gttctcccaa atcatcttca tcttccagat cagacgcctc ctcaactgct 180 actgtgcgac gtcagactgc agatctgaca tctgtctcac ctctttcgtc atctattgat 240 atgtctacga ctctagtcaa cgatgcgatt aaaatccctc aactaactgc tgggatgttc 300 accaactgga acaagcggtt ccattttgcc cttcaaactc gcggtttact cgcattcttg 360 gtttccgaca cacctccgac cgatgctgcc gagttagcta cttccaagcg taatcaaggt 420 tgagtcatgg agcttatgct tagttccatt gatgcgacta acgaggccct cattgaattg 480 accgacactc ctaaggacgc ctacgccaaa ctcgctaaag ctcatggctc tagtggaggt 540 gttttagctg ctgcaacaat ctgcgatatt gcaacagttc gactggagcc tggccagtct 600 ttatcggaat ttatcacccg tgttcgaaat cttcaaaatc aattagctca atatgcaaat 660 gaggataagg acattgctct ctctcccaaa ttattggcta tttttcttct caatggattg 720 ggtaaagatt atgaatatct tactgctcct ttttatgccg atattgccac tctcaccgtt 780 tcgaaagtca tggatcgttt ggtccttgaa gtagccaaac aatcgtcttc tgatcaacgg 840 tcatcagctt cagctttcaa cgcgcgtact tctcatgtcc gacctacttc cgctggaaat 900 catttaggac gcaagatagg tccaggtccg aatgatcttt gcaatctcga caatcatcgg 960 ggtttgtctc atacaaatcg aaattgttat cagcaaggtc gacctcgaaa cagaaattcg 1020 ccgactactg tggctgcaaa cccgaaatcc tccttatctg aagctgaaat ggctagacgc 1080 tatatgtcca ttgaggcggc tcatttagct aagaagaacg ctcctcctgc aacagctgta 1140 ccttccgcgc ctgcttatgc ggtatccact caagagttta gtgctgaggc ctttgtggga 1200 gattttcctg atgagggttt ctctgctcag ggctatgtgg cgggtaatgg tatctcatct 1260 acctatgggt acattcttgc tgatacggca gctactcgta gtatggtttc aaatcaagct 1320 tattttttac atatgcgtcc tatctcggcc atcaacatca cggggatttc agagggacgt 1380 caaattgcta cgcacattgg atctatcgtg ctatcaggat tctctcacct agatcacact 1440 cctaccgaca ttttagtccc taatgtgctt tatgtaccaa ccatgaaggt caatttatta 1500 tcgcttagtc aactttgcgc cgctggagca tctttttctg gttctgctga gagtattacc 1560 gttactggta tgccgggaaa catattttgt gtgtgaaaag actcttgaca atacattatg 1620 gcaaacaaag gttcgctcat ccatttcaga tttcgcatat tctgcgtcag ctgaacgttg 1680 gcattaccgt ttaggccatt taaattactc gtcaatcaag catttggttt caaatggtag 1740 tctcaaggtt tcattagcta gtaattctac ggctccttca tcttgtatta catgtcagaa 1800 ggggaaaatt tctcagcctc cattcttatc tcattttcct gcttctactt caccattaaa 1860 ttgtatacat tcagatgtga tgggtccttt tccttcttct atgggcggtg cacgttactt 1920 agttacattt atcgacgatt tttctcgcca tgcaactatt tttcctatta aaaacaaatc 1980 agatgttttc gagtgttttc ttaagtacaa agcgcgtgtt gaattgttat gtgatgcttg 2040 tattaaatat cttcattgtg ataggggcgg tgaatatcgt tccaatgctt tcttatcata 2100 tttggataag catgggatca gtcttgaaca gggaccagcc gacacgcctc aacaaaattc 2160 tatagcagag aggtataata gatccttaat tgagcgggta agatgcgtct tggctcatgg 2220 atcgatacca ataaaatttt gggccgaaat tgctcttgct gtatgtttca ctttaaacca 2280 ttcacctcac acatttctca aatttaaatc tcctttgagt aaatggaatc aaaatttacc 2340 gggtgctgga tcgcatgatc ctgatccatc gttcttaaga acgcttggat gttcagcaat 2400 ttttctatcc agcaagaatc caggaaagct tgatctaaaa ggccgagaag gtgttttaat 2460 tggttacgaa ttgggggcta aggcgtatcg aatataggat ctagaagaca agaaggtcgt 2520 tattactcgg gtggtattat ttaatgaaga acatctacct tttagtcatt cgccttcttt 2580 tacggatgaa ccaaaatttc tggtacttga agaccctggc tatgatatag attctcctcc 2640 ttctgtcact tcgtcttcat ctcatgctcc tacagtttta cctatttcac ctattacttc 2700 aattcctact tccactcgat ctccttctgt tcctttacct tattctactc gtcaactatc 2760 taatactact agtcatcttc gtcctttgat cttatcgcct gctgcttctc ctgccaaatc 2820 acccaagtca cctattcctc ctgatgcctc aattatctcg tccggctcag ggatcgtaca 2880 tgctcctgga gggaagaaag ctgtacctgc acgacaatcc aaacgtgtta caggtaagcc 2940 taatcgatta ggcgatctgc aagcttttca tgcaggaata gactcaagtg atgaacccac 3000 ttataaacag gcgatggctg cttcggacgc tgacgagtgg ctcaaagcga tgcaagctga 3060 attcgattct ctcaaagctc atcatgttgg aagattggtt gctagacctg taggagctcg 3120 tgtgattggt gggatgtggg tgttaaagaa gaaaagaaac gaaaaagggg aagttacaaa 3180 gtacaaggca cgttgggtgt gtttcgggaa tcatcaagta gaaggcatcg atttcaatgc 3240 tacttattca gctgtaggta aatctgatac ctttcgactc ttggtggcta ttgctgcgta 3300 tctgaagtgt tcggtagttc aattcgacat catcactgcg tttttgcacg gtttgattaa 3360 agaacatgtt tatattcaac aagttaaagg ttttattgag ccaggttgtg aagggatggt 3420 gtgggaattg gtgaagtcgt tgtatggaac ccgtcaagga gcttgtgatt ttagtgatca 3480 tctacgaagt gtattaaagg cttttggttt cgttacttcc gaggctgata actgcctttt 3540 catttatcaa cacggaagtc actttcttta tcttcatatt catgtagatg atggttttct 3600 tatctctacc tccgacgaac tgattgaaga tttccggcaa catctattaa aaacctatga 3660 agtcaaatgg aagcctcggc ctacgttgca tctgggtatg cgacttacct atctaaacga 3720 tggaggtatc tttttggatc agtctcatta ccttcaagac atgcttgaaa agtttggaat 3780 ggacgatttg aactctgcca agatcccttt tccggttggc atgaagttat ctgccggttc 3840 tcctgatgac atacaagctg ccgctcactt accttaccag tccctcattg gatctctcaa 3900 ttgggcttct gtgagcactc gtcccgacat tgcttatgcc gtgagtcaat tggcccgata 3960 caattcctgt tacactttcg atcattggaa cgccgcgaag catctggttc gatacatcgc 4020 tggatctctt tcgcgtggta ttcttttcac tggcgccatg cctgtcgagt tgaaaggatt 4080 cgctgacgca gattacgcca acgacctcaa tgatcgccgt tctgtgaccg ggtatttgtt 4140 taccttcggt gacgccatca tctcttggca aagtcgccaa cagaagtcaa ctgctttatc 4200 gacgaccgaa gccgaataca tgtcactttc agactgcgct cgacaagcca tttggttcaa 4260 attactcttc aaagacctca atcttccggt ttcggctgtt agtgtttcat ctgtgggaga 4320 ggctgttcag ctgtttaatg ataatcgtgg aactgtgctt ttgtcaaagg aacctgtggt 4380 aaacgagcgc tctaagcata tcaacgttcg ctaccatttc attcgagatc atgtcaagct 4440 gaacaacatt gtcactgctc atgttccaac caccgcaatg cctgcggatt ttttgactaa 4500 acctttggct attgaggctt ttgagcgctg ttgtggtcaa gcctcagttt gtgactgctc 4560 gagttagggg ggaa 4574 // ID Copia-56_MLP-LTR repbase; DNA; FNG; 592 BP. XX AC AECX01000392; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-56_MLP_; KW Copia-56_MLP-I; Copia-56_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-592 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000392; Positions 115971 115380. XX SQ Sequence 592 BP; 159 A; 119 C; 81 G; 233 T; 0 other; tgttgagata taccttgatc tctcttgtga tatgtttcag cggtcgtgac aggtaataga 60 aatcacacta agtaatctag gttggtttaa actatgactc ttagtattaa actacaacct 120 taaccttaat gatctttcat atttaattga acaaatccca atacgcgtac aggctctttt 180 ccttacccgt tattctacgg aaggaaaaga gtaagtcaac cccctttata ctatcacaca 240 cctcatttta tctcttcacc tttgtttctc tttctcaata tagtttcact atttccttat 300 tctcttttgt gttatactaa tcaatatgaa tgttttcttt actgttattt catctgaacc 360 cgtcacaacg ctagttagat caaagactga gcactttgat cccttttctc tttttatacc 420 ttgacctgat tcaaccttac aggtcagatt agatttaata ctatgaactc atcttttaat 480 ctttttgtct tatctaactg ttcattttcc tttaagatct tttgttctaa atctgttttt 540 gtataggtta gtaccatcag gtagatattg tgacgagagg tgaaggagct ca 592 // ID Copia-3_TMe-LTR repbase; DNA; FNG; 395 BP. XX AC CABJ01002243; XX DT 13-FEB-2011 (Rel. 16.02, Created) DT 13-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Perigord black truffle genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-3_TMe_; KW Copia-3_TMe-I; Copia-3_TMe-LTR. XX OS Tuber melanosporum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Pezizomycetes; Pezizales; Tuberaceae; Tuber. XX RN [1] RP 1-395 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Perigord black truffle genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; CABJ01002243; Positions 15752 15358. XX SQ Sequence 395 BP; 103 A; 87 C; 62 G; 143 T; 0 other; tgttggattc ttggtccccg tcacatgagt tgtttctttc tttccttcaa acacttggta 60 gtctccttgc ttgtttcatt attgcgaagt taggagatca ggatatcttc aagctatttg 120 tttcattcat tcctttgaaa ctctctactc cttactggta gaaaaaccct ttttctttag 180 cgggagaaaa tatttcactt ccttgcttgg gaatgaaaat ttcacccaca cggactttat 240 attggaggaa acagatccta ttaagaagga cctggtcatg tagttagaaa aagtagtgaa 300 ctcttctgct attgcgaata tctcaagaac tcgaatctca ctctatctat caccatatac 360 aattcctcca ctcttatcag tctcttattt tcaca 395 // ID Copia-4_MLP-I repbase; DNA; FNG; 4962 BP. XX AC AECX01002163; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-4_MLP_; KW Copia-4_MLP-LTR; Copia-4_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4962 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002163; Positions 10122 15083. XX CC Positions [1727-2233] - Integrase core CC 'AAATG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 125..4624 FT /product="Copia-4_MLP-I_1p" FT /translation="MNPTEATDAKSELISEETKSKVPRSESSQPFKQQNSI FT PFPTIEQKFKMTSIEQNDFSLSSYSSVVKLSTGTFNDWKLKLTTILGGKRL FT SKYILKDLKAPDESDEAALDEFETNSSRALAAIHATIDAENFQVIRTCSSP FT REAFKLLCSHDDDAGGLSTAQLFSDIISLRMSHDDNLNDHINKFRKLHNDM FT LSNLASTPDNMISEPFFAIILIKSLTSDYTPLVQSLLTNFQSLTLARLYSL FT LKIEATRNPTPGADTALVAGRNQSNPRSRPAQKKNSSNQQSTTRCSLGHTG FT HTNDQCRTKRWRAFLEYEKTKAPQPTSSHAAQLTHSAPEADEDVSYWEEAF FT SASTSNTNPIICDTGATSHMFSDLSLLSDLQHIKPTRIGVASQDGAIWAKS FT KGTVRFESLILRDVLYSPELTGNLISIGRLCDDGYRASFDKNLGIITDPTG FT KIVLKMRRNPQTNRLWTPELKIISSLAYFSYTDAQKLADLWHKRLGHLHPD FT AVISFLRRFKSISLSRKDFVSCDACAMGKLIQSPSTHPFHRSPDLLNVVHS FT DLLGPISPSTKSGFKYIITFIDDHSRFTLIYLLKSKNQAFESFKQYQALME FT KKCGRKIHKLKSDRGGEYSSNEFIQYLKQEGIEIERGPAERPTANSVSERF FT NRTVLGKIRSQLIQSGLPLHLWGELAKYAVLQINCSPHAALSHKTPMETFE FT SLLPGHVHPFDIDRLKPFGSLCFAVDRSQKSKVGALARRFIFVGLEEGARA FT ARLWDKQTGRVLVTGDVVYREDVFPGIHPSFSPNVPEELILPEFFDSPPLV FT QSTPTASIPPQPTQATNDRNVSRPSTPVRVDRTATPPPHRSSSPKLSSDHD FT SSNQPSPIYPKVRIPLSSSIHAPRLAVESPNRFEQLRDLYSTTPPQQHVQY FT REHVMLDTPSDPSIHSSAPSVNQSPTPHREQPEGNVLPISVDSSPISSVSS FT VRSPYRTPSPVPLSSQQPVDQSVRQDASDATQDQPRLSPSAPLPDPIPQPA FT PESEPAASQPKQKPAPKAVPTRRSARTKTAPDRYGFPATTNSTSVATTSTF FT GPMIITSKKSIIPDRWGFSATTGTDPDSPTYAQAMAGPDKQAWIHAMEEEY FT KSLMDHSVGKLVDPPPNANILGGMWIFNKKRDEHNRVIRFKARWVVLGNHQ FT IKGLDYDDTYASVGKIDSLRILLALSIAKSKGNRRTRRMKIRQFDVVTAFL FT NSNMKDVVYARQVLGFEHPTQKLRVWLLIKSLYGTKQAARRWQQHFGATTA FT GFEIFPVSSDSAVYVLKCKLGLLILHLHVDDSLVFCDNDELFFKFQTFINS FT KYKLKWTERPTLYLGIKLDIAEDGSYIKISQSHYIESCLERFAMVNCKSAK FT TPLPQKLILAPGSIEEIEEAKDIPYQELVGCLQWISACTRPDISYAVSQLS FT KYNSAWTVSHWMAAKHVLRYLRGTQDLTITYSGGTDEPQAYSDSDFSQCSL FT TRKSVTGYVITVSNGAVCWKSQRQSIVALSTSEAE" XX SQ Sequence 4962 BP; 1379 A; 1237 C; 939 G; 1407 T; 0 other; ggttatgagc ccagcgtttt cttttaacgt ctgcgtctcc tctttccatc acaaactatc 60 agtttcagac agatctaaag taatttaatc acattcccga attgactaca actcgcgttt 120 tcggatgaac cctaccgaag ccacagacgc caaatcagaa cttatctccg aagaaaccaa 180 atcaaaagta ccacgatctg agtcatctca acctttcaaa caacaaaact caattccttt 240 tccgacaatc gaacaaaaat tcaaaatgac atcaattgaa caaaacgact tctctctcag 300 ctcctattct tcagttgtca aactttccac tggaacattc aacgattgga aactcaagtt 360 gaccaccatt ctcggtggaa aaaggttatc aaagtatatt ctcaaagatc tcaaagcccc 420 tgatgaatcc gatgaggctg ctttagatga atttgaaacc aactcttctc gagctttagc 480 ggcgattcat gcgactatcg acgcagaaaa ttttcaagtc atacggacat gctcatcacc 540 tcgagaagcc tttaagctat tatgtagtca tgatgatgat gcaggaggcc tttctactgc 600 acaactattt tctgatatca tatccctcag aatgtctcat gatgataatt tgaacgatca 660 catcaataag ttcagaaaat tacataatga tatgcttagt aatttagctt ccacacctga 720 caatatgata tccgaacctt tctttgctat cattcttatt aaatctctta cctctgacta 780 cactccatta gttcagagtc ttttaaccaa ttttcaatct ctaactcttg ctcgattata 840 ttctctctta aaaatcgaag ccacccgaaa tcccacccca ggcgctgaca ctgctcttgt 900 tgccgggcgt aaccaatcaa atccaaggtc tagacctgct caaaagaaga actcatccaa 960 tcaacaatca accacacgtt gttctcttgg ccatacagga cataccaatg atcaatgccg 1020 gacaaagagg tggcgagcgt ttttagaata cgagaagacc aaggcacctc aaccaacttc 1080 gtctcacgct gctcagttaa ctcattctgc tcccgaggcc gatgaagatg tttcctattg 1140 ggaggaggct ttttccgcgt caacttccaa cacgaatcct atcatttgtg acacaggcgc 1200 aactagccac atgttctccg acttgtcact tctatctgat cttcagcaca tcaaaccgac 1260 tcgtataggt gtagcatctc aggacggagc gatctgggct aagagcaagg ggactgttag 1320 gttcgagtcc cttattctta gagatgtctt gtattcaccc gagttgactg gtaatcttat 1380 ctcaatcggt cgtttgtgtg atgatggata tcgtgcttcg ttcgacaaaa atctaggcat 1440 catcacggat cctacgggaa agatagtcct caaaatgcga cgaaaccctc aaacaaatcg 1500 gctatggaca cctgagctta aaatcatttc atccctcgcc tacttctctt atactgatgc 1560 tcagaagctg gctgatttat ggcacaaacg tctgggtcac cttcatcccg atgctgtaat 1620 ctcttttctt cgccgattta aatctatatc tttgtctcgt aaagattttg tttcttgtga 1680 tgcttgtgct atgggaaagc ttattcaatc cccgtctact catccatttc atcgctcccc 1740 cgatttactc aacgttgtac atagtgacct attgggtccc atatctccat ctactaaatc 1800 aggcttcaag tacatcatta cctttattga tgatcattct cgattcaccc tcatctacct 1860 tttgaaatct aaaaatcaag cttttgagtc gttcaaacaa tatcaagcct taatggaaaa 1920 gaagtgtggt agaaagattc ataaactgaa gtcggatcgg ggaggtgaat actcttcgaa 1980 tgagttcatt caatacctta agcaggaagg tatagagata gaacgtgggc cagctgagcg 2040 tcccacagca aattccgtct ctgaacggtt taaccgtact gttttgggga agatacgtag 2100 tcagcttatt caatcaggtc tcccgcttca tctctggggg gaactggcca agtatgccgt 2160 gcttcaaatc aattgttctc ctcatgcggc actatcacac aagactccta tggaaacatt 2220 cgagtctctt cttcctggtc atgttcatcc ttttgatatt gaccgtctca agccttttgg 2280 gtccctgtgt tttgcagtgg acaggagcca aaagtctaag gttggtgccc ttgctcgacg 2340 attcatattc gtcggattgg aggagggagc tagagctgct agactctggg acaagcaaac 2400 tggtcgagtg ttggtcacag gggatgtggt atatcgcgag gacgtctttc ctggaattca 2460 cccctctttt tcccccaatg ttccagaaga acttattctg cctgagttct tcgattctcc 2520 accactggtt caatctaccc caacggcatc gattccgcca caaccaactc aggcaactaa 2580 tgatcgtaat gtatctcgac catcaactcc tgtaagggtc gatcgtacgg caacacctcc 2640 acctcatcgg tcatcctccc caaagctatc atcagatcat gattcatcta accagccgtc 2700 tccgatctat cctaaagtta ggattcctct gagctcatcg attcatgctc cacgactggc 2760 agtcgaatca cctaatcgct ttgaacaact aagagatttg tattcgacta caccgcctca 2820 gcaacatgtt caatatcgtg aacatgttat gttggatacc ccatctgatc cttcgataca 2880 ttcatcggct ccttctgtaa accaatctcc tactcctcat cgcgaacaac cagagggaaa 2940 tgtgttgccg atctcagttg attcatcccc tatctcatct gtttcatccg tacgctcacc 3000 ttataggacc ccatctccag ttccactctc gagtcagcag ccggttgacc aatctgttcg 3060 ccaagatgca tcagatgcga ctcaagatca gccacgacta tctccttctg caccgttgcc 3120 tgatccaatc cctcagccgg cacccgaatc cgagcctgcg gcttctcaac ccaaacaaaa 3180 gcctgcaccc aaagctgttc caacacgtcg atcggcacgt actaaaacag cacctgatcg 3240 atatggtttt ccagcgacca ccaattcaac ttctgttgcc actacatcaa cctttggtcc 3300 gatgatcata accagcaaga agtccattat tccagatcga tggggttttt cggcgactac 3360 aggaactgac cctgatagtc cgacttatgc gcaagcaatg gctggtcctg ataaacaagc 3420 ttggattcat gcgatggaag aggagtataa atctttaatg gatcactctg tagggaagtt 3480 agttgatcct ccgccaaatg ctaacatcct tggggggatg tggatcttta ataagaaacg 3540 agatgagcat aatcgggtta ttcgtttcaa agctcgatgg gtcgttctag gtaatcatca 3600 aattaaaggt cttgactacg acgacaccta cgcctccgtc gggaagatcg actctcttcg 3660 tattttatta gctctatcca tcgccaaatc aaaaggcaac aggcgtacca ggagaatgaa 3720 gattcgacag tttgatgtgg ttactgcatt tctcaacagc aacatgaaag atgtagtcta 3780 tgctcgtcaa gtcctaggat ttgaacaccc aactcaaaag ctccgtgtat ggttattaat 3840 caagtcacta tacggaacta aacaagcagc tcgccgttgg caacaacatt tcggggcaac 3900 aacggcaggt ttcgaaattt ttcctgtctc atctgattct gctgtatatg ttttaaagtg 3960 caagctcgga ttactcatct tacatcttca tgtcgacgac tccctggtat tttgtgataa 4020 tgatgaattg tttttcaaat ttcagacttt cattaattcc aagtacaaac tcaagtggac 4080 tgaacgaccg acactttacc ttggcatcaa actggacatt gctgaagacg ggtcttacat 4140 caaaatatcc cagtcccact atatcgaatc ttgtcttgag cgttttgcta tggtaaactg 4200 caaatcggcg aaaactccac ttccacagaa gctaatctta gctcctggtt cgattgaaga 4260 gatagaggaa gctaaagata taccttatca agaactcgtc ggatgccttc agtggatatc 4320 agcatgcact cgcccagaca tttcatatgc ggtttcacaa ctctcgaaat acaattctgc 4380 ctggacggtt tctcattgga tggcagcgaa acatgtctta cgctatttac gtggaacaca 4440 agatttaaca attacttact ctggggggac tgacgaacct caggcgtatt ctgactcaga 4500 tttctctcaa tgctcattaa cgcgaaaatc ggtgacgggt tatgtcatta ctgtatctaa 4560 cggggcagtg tgttggaagt ctcaacgaca gtcaattgta gcattatcta cttctgaggc 4620 cgaataaatt gcagccactg aatgttctaa acatatggct tgggtgcgct cattctactt 4680 cgacataatg catcagttag aaggaccgac accgttttac atcgacaata cttcagcaat 4740 ctttactgct acaggagatg ggttgaagtc aaagtccaaa cacatagata ggcgttttca 4800 ttatattcgt gatcttatac aatcaaatca tctcaacatt catcatatcc cgagtgaaga 4860 gatgctggcg gatcatttaa caaagccatt gggtcctcaa gctcttaatc atgctctcaa 4920 acttaatcac cttaattaaa aatgatcttg aaataggggg ga 4962 // ID PYRET_I repbase; DNA; FNG; 6300 BP. XX AC AB062507; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 08-NOV-2002 (Rel. 7.1, Last updated, Version 1) XX DE Magnaporthe grisea Ty3/gypsy-type retrotransposon PYRET_I, DE internal region. XX KW Gypsy; LTR Retrotransposon; Transposable Element; PYRET_I; KW Ty3/GYPSY superfamily; gag-pol pseudogene; internal region; KW internal portion. XX OS Magnaporthe grisea OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Magnaporthales; OC Magnaporthaceae; Magnaporthe. XX RN [1] RP 1-6300 RA Nakayashiki H., Matsuo H., Chuma I., Ikeda K., Betsuyaku S., RA Kusaba M., Tosa Y. and Mayama S.; RT "Pyret, a Ty3/Gypsy retrotransposon in Magnaporthe grisea RT contains an extra domain between the nucleocapsid and protease RT domains."; RL Nucleic Acids Res 29(20), 4106-4113 (2001). XX DR Genbank; AB062507; Positions 476 6775. XX SQ Sequence 6300 BP; 2386 A; 1622 C; 1101 G; 1191 T; 0 other; ttttgatcgc tcctgttcag tactttgctt taaaatatac cgagcaatat ccgacaacat 60 gtcttcccag acgtccaaaa acaacacccc caacaccccg gctacgaaca gcaaagcggc 120 ccagaaggcc cctgttcggc ccaaggatga taacgataat aaggacgagg aagaccaatt 180 ccgtaggcag gttgctcaga ccaaccagaa gctccacgaa caaacgttcc gcgcaaccca 240 actccaaaag gaattacagg ctttccgaat tagtgctaac gttacctcaa ccccttttaa 300 tgggcgcgaa aaactcaaac ttaacccccc aaccacattc gatagcacac ctggacaatt 360 taaaaggtac cttatatagg tacgcattta ccaaacattc catttggaaa ttttccgcag 420 taaaacaaaa aaggtggtac acgcggccat tttcctccga ggacgaacct tgtcttggtt 480 cgaacctctg ttacaaaaat ggcttaatat accccccgaa aaacgacgcc aaaaagtcac 540 cgacattttt tcaaccttcg cagggtataa acgcaccttc cgttccttgt tccaggaacc 600 cgataaaaaa cgacaagccg aacgaaattt ggccaaccta cgacaaacca agtcagcctc 660 cgcttataca accgaattta aaaggctagc aacaaggctg gatataactg aaaaaaccaa 720 aatcctccaa ttctaccagg gactgaaatt aagaataaaa gacgaggtat ccaagctcga 780 ccgacccgag gatttcctcg aatacgtcga acttgccatc aaacttgaca accggattta 840 cgaacgccga caggaaaaac aaggcggaca gcgaatattc gtcaccttaa ttcgacaaac 900 caatacggga cgcaagtacc aacaccccca acccacatac cacagaaaca atggtggatg 960 gcagcagccc cgtcatatgg caccccaaac ctacaataca gcttatagta cccacaatgg 1020 acccatggac ctcagtgcca cacaacacaa aaagccccgt aaagacccca aaagcggcaa 1080 gtgtttcaaa tgcggcaaaa cagggtatat cgccaaaaac tgtccgagaa accaggaaaa 1140 tgtccctaat tttaggggca agaccgaaaa accacgagga tttaatgcca ccaattacca 1200 gcaattacgg cctaaccaac acagcacgat aaactggacc acctgctaca atgacaattg 1260 tatggtacat aacagttcca aaaacgacgc aggatggtac ccacaaaagc cccgaggcat 1320 tcgaacccta acagtaaaaa accgagtacc cctagggaca ggataaccaa cgaaattata 1380 ccgacaaaag acccaattac ccgaatttac ggactcggat gcctccggaa aatcggatga 1440 ggaacctttg ggacaaaccc accatattag aacaaccaat caatccttaa tacaaaaatt 1500 aagcgaggat tccgagggaa agtccagcga ggaagagacc atgccctcga gccagtataa 1560 cgtaacaaag gaaaacaacc cctacatata cgaaaaaaag catatcgatt tataccgaga 1620 cgaccaatca gtacaatttc ccggattgcc atttcaggca caaatccgac atcctacagg 1680 accagttaat gcaatttttg gagaccaccc gaccttggac accaaacacc cttaatacgc 1740 agaaatcttt tgggcaaaat gtttccgaaa caattgtgga ctccatttaa gcaggaagat 1800 agtccacaat tttttcccac gacgacgcaa agcaacaggc atttacgaaa tatataccac 1860 aaccgaccta ccaaattggg aaattaagat aaaactcaat attgaaccag ccgcaatttt 1920 cacgcctaac gacaattatc ccatggcgtg ttgcaaccaa aaacaattcc catggtataa 1980 atgcacccga agcgtatgca aagtacatat gttagccaaa gcacaggcct ggcacgaaca 2040 gaaatcacgg caaccaggac ctgcaccaat gaccagaatt atgtggaacc ccaacgtacg 2100 agataacgag ataaccacga acgcacgcaa gacacccgtg ccagcgaagc acccacttga 2160 cctagtcgaa acaagaaacg gcacagggca ccgaacggcg acaacaggaa cgaaacaaag 2220 aaccctgaac gctacacaaa acaccatagc agcatatgaa cacatatttc tcgacatacg 2280 actcgatggc aaacccatac gagcactcct cgacaatgga gcacaaggca actatatctc 2340 accaagggtc gtagcgaaac gacggatacc ttggcagcaa aagaaggaac cataccagtt 2400 gcaaaccgtg gaaaaaaagg cagttttata cgggaacggc accatcaaaa tagagaccgt 2460 acacctttgg atggaaagct atggacgaaa ggaacaaatc accttaaaca ttacggaaat 2520 aggggacaaa aacgtgatat taggaatccc ctggcttaaa aagagtaacc cccggataga 2580 ttggacaacc ggccaaatcc aatgggaaga accgttagcc tttaaaggaa aatccgagaa 2640 acggacttca cgcaacgaac gtagggcata ggaacgaaat caacaaaaaa ttatggcgct 2700 attgaggaaa agcgagccac gcctggaacc aatatcggaa gaatcgcgac tatccatgag 2760 cgaaaaacga tcgaacttta ctacattgat cgacaatatc ccggccgaat accggatata 2820 taaacgatta ttcagccccg aattcgaaac aggattaccc gagcatagtc cgttcgacca 2880 cgaaatacca ttgaaaaaag gaacacagcc caaattccac aaaatatacg gcttaaaccc 2940 gacacaaatg gaggcactaa acgaatacct cgcaaaaaac ttcaaaaaag attatatcag 3000 accatcgata tcaccagcag gatacccaat cctcttcgta ccaaaaaaga acggaaaatt 3060 acgactatgc gtggattaca gacaattaaa cgacattacc ataaaaaatt gctacccact 3120 cccactaata agagaattcc gaaacatgct ttaccaaaca caatggttta cgacattaaa 3180 cctcaaagga gcatacaatt tgatccgcat aaaggaaggc gaaaaatgga aaaccgcgtt 3240 tcgcaccaaa cgaggatatt atgagtacct aataatgccc ttcggtctta ccaacgcccc 3300 agcaaccttc caaaccataa tcaaccacgt tctcagggaa tgcctaaaca ttttcgtcgt 3360 tatttacctg gacgacatcc ttgtcttttc caagacgttg gaaaaacata agcaacacgt 3420 tcacacaatc ttacaaaagc tgcaaaacgc taaactctta gttgagcccg aaaaatatct 3480 ctttcacagc aaacaaatca atttcctagg atacattatc gctcctggag aaattaggat 3540 ggagaaatcg aaaatccaag cggtaaagga atggcctcaa ccacaaaact ggacggaaca 3600 gacacaacag gcctttgaac agctacgaga cgcaataacc agagaaccaa tcctcaaaat 3660 accagaccca acgaaacctt tcgaagtcga aaccgacgca tcggattata cgatcggagg 3720 acaacttaat cagcgagacg aaaaaggacg gttgcatcct tgcgccttct tttcacaaaa 3780 attacatgga ccagaattca attaccaaat ttacgacaaa aaatttatag ccatcattcg 3840 aacatttgaa aaatggaaac cacaattatc cggaactaaa cacgaaatgt taatttacac 3900 ggaccacaaa aacctgaccc atttcaccat tagcaaaata ttgaacaaac gacaaattaa 3960 atggtcagaa ttcctgtcaa aattccattt taggattatc taccgaaaaa gaacagaaaa 4020 cggcagggcc gacgccctta accgaagacc agatcacgaa aacatagtgc caaaagaaat 4080 acgggttatc ctcaccacag acggaaacgg aaacctttta ccaacatacc ggagccttat 4140 aacaacaaat acggtaacca caccaaaaaa gatacggaag atccacggaa ataaagccca 4200 cggacaccaa ggaatttcca aaacatggaa acggctaaaa cagcattaca attttaaaag 4260 aacacgacaa aaaatacgaa aaaccatcaa agattgcgaa ctttgtacca aaagcaagtc 4320 cgcaaaacac aaattttacg gactcttaca acccttacca gcccccagca aagtatggca 4380 gaccatcaca atggacttta tcgtcaaatt acctctttcg gaaaaaccat tcactaaaac 4440 caaatacgac agcatattgg ttataatgga caaattcacc aaatacgcct actttctacc 4500 atacaaaaaa agcagcaacg ccgaagaaat cgcttattat acattcctac aaataattgt 4560 cagcaattac ggacttccaa aaaacatcat cacaaataga gacaaacttt tcatatcaag 4620 attttggaaa tccctgatgg aacaattagg aacaaaccac aaattatcca cagcattcca 4680 cccccagaca gacggataaa caaagcgaac caaccaaaca ctggaacaat acctaaaata 4740 ttatgtgaac cacaagtaag acaattgggt acggttatta cccacagcac aattcgttta 4800 caacagttta aaaaacgaaa ataccaagac aaccccgttt tacgccaatt atggatttaa 4860 ccctacagcg tacggagaac caagaaccac gattacggca ccgcgagccg ataaacaagc 4920 gagcgaatta cggcaattat acaaaaaact ccagcaggaa ttaaagttcg tacgaaaaag 4980 aataatgaaa tacgccaatt agcatcgaat aaagggacca tccttcaaaa agggaaatag 5040 cgtctacctt attcgacgca atatcaaaac acaacgacct aataacaaat tcgattttaa 5100 aaagcttgga ccttttaaaa ttagcaaaaa aatcagcgat accaactaca gactgtcatt 5160 accagacacc atgaaaattc accccacatt tcacgtatta ttactggaac cagcaccata 5220 cggcactagc atacaaaaaa cgatcgagat cgacgcagaa cagacctata acgttaaaca 5280 aatattcgac catagacgaa accacggcaa aataaagtac tttattaaat gggaaaacta 5340 tgggcatgaa aaaaacattt gggaacccct taaccacctc caggattgcc aggaaccgct 5400 ccgccaatac tatcaggagt tggatctgcg agcaaatcaa ccaaaaaaaa gaggacgacc 5460 taaaaaaaca ccacctacca ttcccaaaaa ttaaaggccg accaggattc gaaaccatac 5520 caccagacga acttacaaaa gaacgatcaa tatgcaatat aaaataaaaa caccagaaaa 5580 acgacgtacc aacggaatgt tgcgcaaaaa cattcatctc atccaacaac gaaggattca 5640 gaggatccat cgggaaaatc gaatcccaga caaaattgga atccggaaaa agcgactcca 5700 aggacatttc aggcaggacc tcggctgatc gtcgacgttc ctcctgttcc gcctcctcac 5760 gttccacccg atccaattcc tccaaattat caattccgcg agacacagct cgcattattt 5820 tttcaaacca catcttcttt tatttgcgaa aacgctcgac cttcgcatcg gccaaacgaa 5880 ttttttcgtc gcgcccggcg acgagcctcc tccgcagcct ttaattcctc ctccaatttg 5940 gtatactgtt ggccaatcgt gcgaaactga acaggcgaaa ttccaaggac atcgcacccc 6000 gaacggttcg accgcaaaca ctcgacatac cgagaggaat tttgaggaga aaccgcacat 6060 ttgggtattc ccctcgattt acaagcaaaa caaacaacca tctccaaccg tcgacaaaaa 6120 caaaaaaaaa agaagggcaa aacgacgctc tgtcgacgag cgaccaaatt tcggtttggc 6180 tacacgatta aatatggcgc aaaaaagtta gccaaaaagg aaggaaaccc aaaacaggga 6240 tcaagggaat acggactatt taaaggccaa aagaatttgg ccaaaggaaa ggaaacctga 6300 // ID Gypsy-30_MLP-I repbase; DNA; FNG; 6822 BP. XX AC AECX01000185; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-30_MLP_; KW Gypsy-30_MLP-LTR; Gypsy-30_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-6822 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000185; Positions 26317 19496. XX CC Positions [5637-6149] - Integrase core CC 'ATATT' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS join(576..2819,2823..6239) FT /product="Gypsy-30_MLP-I_1p" FT /translation="MAYDVDIKRRNMSNIDINAPAAASVANAPSTTAKKGG FT RKASKAVEVVGSSATEDTEMNEAAAVLQPPTAEGREFEDQTDFVSFDNGDP FT AANTANGSMEVSAPVSSKQGKGLFDLPPTMTTGLRSGRVANPSRTEVRTQG FT SKGRPNRGPEGDNNSVQTTSVAVEGTRGRSGGSVGESQSSITQDSLSRDQP FT PHQANKTSPESAVQAKGKQRQQPVGRESTICGGQLPPQTQSASNGGTTAHT FT GESAVSSQERIDGINAGTASGTNAGGRTGGLSSSGRNDVSEKSVKSGSHDV FT PSTPLSSLVVPVVTVSRPLPGIVPKNLVSNPITSDSPLCVNIGLRSASASS FT NISSLSLPPNNISLTKLVELETLLRVNFSTINGQLKAQVDSIESFRNEMMA FT VRDLIDDVTETNNMLDSKLAAFEDILEQYGRDQQAIAKLMHLTSSASEELT FT RDTANMCEAMEQLTLDRETERQSICELKIDFTRQFQEMKDHLDAVARNVSR FT LQAEKNKVSSENVLYESSHQKIQGEAKTAHSKINNVVYEDGEFEKNSDIIE FT TSAPYTNPRNYPKIKSYPKFSGKPEEDWVDFIDEIDTFKDSYGMPDAEIVS FT KLPSILKGIAYTWFRAVIKQNRGKGWEHWKGLLAKKFGGSVWRQRQLTLLH FT QMKFSYNNEDITEFLTTMHRKIESVYPSASSEDIKEHILMKLPAEVRDTIS FT ISTKDMEDISEYLTTCERILTNQKDVNKPSMNSSRRVWRNENPTNNTVKDD FT IIGEKKKVTIPLQGDSKERIRNKTCHRCKGPWIPGHTCNKVLNIDVEDDES FT LTYSGDEQDLAENDDHQEENNDVMILETEVLHGNFLTDGLGQLRDVQEAEY FT VRTPNEVKSVCPARCTAVVNQKEVMIVLDSGAGGSVVSSSYLQSVDPEWKN FT HLVNEDTGRWRGYGSVLKPLGTYVASVVFGHERGNIRAMMKFVVMNNEGLP FT QYFIVGNNNLLLYGIRLHLCDKYFTLGNNLKRKFALTYDKAPPTQHITVVA FT GKQTGENQPSPTVGIPTPEPEQFEKAFSEAVWDADLPDEDKKLLSKVIRDF FT PMVFAHGKRQLGEVSVEEFDINLNIDGDKMPTNLKKKAYPCSPQKRKDIEE FT NIQELLDLGVLEEIDRTPRDCVISPVIIQYQNGKKRMCGDFRSLNDFTVSD FT IYGMPRVDAILHGLKGATRISLLDGYKGYHQFRCTERASNYLIIITHCGMF FT RYLRMPFGPKNGPSVFQRTMDRTFSKEIREGWMTIYIDDIIVHSKNTADHI FT EHLRRIFTKLEQINLTLAFKKCHFAFQSTKVLGHIVSGLLMSVDDNKVKAI FT SHIQPPTTVKEVQSFLGMCGYYRQYLKNYQLVALSLTRLIRRNEAFEWTNE FT RQQAFDKLKQMLQEAPSLSLPDFDKPFIVYTDASFIGLGAALHQKQVVDGK FT EIEVPICFISRSLRNGELRYGTTQLECLAVVWALEKLHYYLDGSTFEVVTD FT CSAVKSLLGMKTPNRHMFRWQIAIQEYRGRMTISHRAGDKHQNADCLSRNP FT MPNNSDNPAGVSQKDDTEIFGLHVVDLEHDFYASVAEGYSLCPNMRKIIMV FT LSKPDGTSNSEIISSLDEPWKKLFNQGCFIFEDDLLYYRKQGSHRLVIHDS FT MKEQILKLCHDDILAGHFSLDKSLHCVKNTAWWLNYNQDVADYISSCDTCQ FT KGNRKTGKTFGLLQEIQKPTKPWEIINMDFVTGLPPAGDLSYNSVLVIVCR FT LSKKAKFVPCHKDIDAKGLAHLWWKYALNECGLPTAIISDRDPKFTSEFWT FT SLMRIAGCQLKLSTAHHPQTDGLAERTIQTMEDLIRRYCAFGLLYKDSEGC FT THDWISLLPGLEFAYNSSVHSSTGRTPFELERGYIPQSPRMLTNKKLGKLN FT IIQQLGIFSHARIS" XX SQ Sequence 6822 BP; 2196 A; 1310 C; 1618 G; 1698 T; 0 other; attgggggtc tggccgaagt tactagatct gttgtccttt ataagtcttg cattatattt 60 gcattataga ggacaataca ccaactaaag attgttttat ttacatcact tcgtttgact 120 ctgttgaaaa gccaacatca aaactgatca ctaataaaac aatcgttctt ttgcgactgc 180 ttagttttca aaccaaccaa gcagcatcca ctactatcaa caatccagtc cgaaaagcgt 240 cctagcatct ttgaattcat tcacagaata agtgagtgaa taatataaaa agaggtcggc 300 ctgataattt actgacaaat tttgatagta gtattctaga tttaacctaa acctggttta 360 atctccatca tcgtacctgt tcagtatccg ataaagtttt tgaaaagtct cggttcgtct 420 gaaaattttt cgtttgtccg aatcaaaacg aagaaaaggc tgactttttg ggtccttgtg 480 gaaactttta ttttcgtttc gctctatttt tattcatctt ttcgtttgaa aaaaaagatt 540 cattggtgtt atattcaaat actgaaacaa ccgtgatggc ttacgacgtc gatatcaaaa 600 gacgtaatat gtctaacatc gatatcaatg ctccggccgc tgcctcagtt gctaatgccc 660 catctaccac ggctaaaaag ggtggtagaa aggccagcaa ggcagtggag gtcgtaggta 720 gttcagctac cgaagatacc gaaatgaatg aggcggcggc ggtattgcag cctccaactg 780 ctgaaggcag agaatttgag gatcaaacgg acttcgtgtc atttgataat ggcgatccag 840 ctgctaatac tgcgaatggc agtatggaag taagtgcacc tgtgtcatca aaacaaggca 900 aaggattatt tgacttacct ccgacgatga caacagggct taggtccgga cgggttgcaa 960 acccttcgag aactgaagtt cgtacgcagg gatccaaggg acgcccaaat cgcggtcctg 1020 aaggagacaa taacagcgta caaacgacaa gtgtcgcagt tgaagggaca cgtggcagat 1080 ctggagggtc ggttggagaa tctcaaagct ccatcaccca agatagctta tctcgggacc 1140 aaccaccaca tcaggccaac aaaacctcgc ccgaatcagc cgttcaagcg aaggggaagc 1200 aacgacaaca gccagttggg agggaatcca ccatttgtgg tggacaatta cccccacaaa 1260 cgcaaagcgc ctccaacgga ggtaccacag cacatacagg cgaatccgcg gtatcctctc 1320 aagaaaggat cgacgggatc aatgccggta cagcaagtgg taccaatgca ggtggtagaa 1380 caggaggact ttcaagttcc ggaaggaatg atgttagtga gaagagtgtg aagagtggat 1440 ctcatgatgt accttcgacg cccttatcat cgcttgttgt gcctgttgtt accgtttctc 1500 gaccgttacc tggtattgtt ccaaagaatc ttgtatccaa cccaattact agtgatagcc 1560 cactctgtgt gaatattggt ttgagaagcg cttctgctag tagtaatata tcgtcactgt 1620 cattaccgcc gaataacatt tcactaacaa agttagtaga attagaaacg ttattaagag 1680 tgaatttcag taccatcaat ggtcaactaa aagctcaagt tgacagcatt gagtcattta 1740 gaaatgagat gatggcagta agagacttaa tcgacgatgt aacagagaca aacaacatgc 1800 tcgatagcaa gctggcagca tttgaggaca tattggaaca atatggacga gaccagcaag 1860 ctattgcaaa actgatgcat cttacaagct cagcgtctga ggaattgacc cgcgacacag 1920 caaatatgtg cgaggcaatg gagcagttga ccttggaccg tgagactgag agacagagta 1980 tttgtgaatt aaaaattgat ttcacgagac agtttcaaga gatgaaggat catctcgatg 2040 cggtagctag gaatgtttct agattacaag ctgagaagaa caaagtaagt tcagagaatg 2100 ttctgtatga aagtagtcat caaaagatac agggagaggc aaagacagcc cactcaaaaa 2160 taaacaacgt tgtctatgaa gatggtgaat ttgagaaaaa ttcggacatc atagagacaa 2220 gtgctccata taccaatcca aggaattatc ctaagatcaa gagctaccca aagttcagtg 2280 ggaaacctga agaggattgg gtcgatttca ttgacgagat cgataccttc aaagactcat 2340 atggaatgcc ggatgcggag atagtttcca aactaccatc aatcctgaaa ggcatagctt 2400 acacgtggtt tcgcgccgtg attaagcaaa atagaggaaa aggatgggag cactggaaag 2460 gattgcttgc aaagaaattt ggtggatctg tgtggcgaca acgtcagtta acactgttgc 2520 accaaatgaa attttcgtat aacaacgaag atatcactga atttctgacg acgatgcata 2580 ggaaaattga atccgtatat ccatcggcaa gtagtgagga tataaaagaa catatcttaa 2640 tgaaactacc ggcggaggta cgggatacaa taagcatcag taccaaggat atggaagaca 2700 tatctgaata cctgactact tgcgagagaa tactgacaaa ccaaaaagat gtcaataaac 2760 caagcatgaa tagcagtaga agggtatgga gaaacgagaa tccaaccaat aataccgttt 2820 gaaaagatga tattattggt gagaaaaaga aggttacgat tccgttacaa ggggattcaa 2880 aagagagaat tcgaaataag acttgccaca gatgtaaagg cccttggatt cccgggcata 2940 catgcaataa agttctgaac attgatgtgg aagacgacga atcgttgaca tattcggggg 3000 atgaacagga cttagcagaa aatgatgatc atcaagagga aaacaatgat gtaatgatcc 3060 ttgaaacaga ggttttacat gggaacttcc taactgatgg tcttggtcaa ctacgagatg 3120 tacaagaagc tgaatacgtc cggacgccca acgaggtgaa aagcgtttgc ccagcgagat 3180 gtacggctgt tgtaaaccag aaggaggtca tgattgtcct cgattcggga gcaggaggta 3240 gcgtagtgtc aagcagttat ttacaaagtg ttgatccaga atggaaaaac catttggtaa 3300 atgaagacac cggacgatgg agaggatatg gctctgtgct gaaaccactt gggacttacg 3360 ttgcaagtgt ggtgtttggg cacgaacgag gaaatatccg ggccatgatg aagttcgtag 3420 tcatgaacaa cgaaggcctg ccacagtact ttatagttgg gaacaataac ttattattgt 3480 acggcataag actacaccta tgtgataaat attttacttt gggaaataac ctaaaaagga 3540 aatttgcttt aacatacgat aaagcaccac caacacaaca cataactgtt gtcgcgggaa 3600 agcagactgg agaaaaccaa ccgtcaccga cagtgggtat tccaacgccg gagccagaac 3660 agttcgaaaa agctttctca gaagcggtgt gggatgctga tttacctgac gaagacaaaa 3720 agttgctgtc taaagtcatc agagacttcc caatggtttt tgctcatggc aaaagacaac 3780 tgggggaagt gtcggtggaa gagtttgata tcaatcttaa tattgatggg gacaaaatgc 3840 ctacgaatct gaaaaagaag gcgtacccat gtagtcccca gaaacgaaag gatatcgaag 3900 agaacatcca agaactgtta gatttaggtg ttcttgagga aattgatcga acacctcgag 3960 actgcgtgat ttctccggtt attattcaat accagaatgg caaaaagaga atgtgcggag 4020 atttccggtc attaaacgac tttaccgtct cagatatcta tggtatgcct cgagtggatg 4080 ctatactgca tgggcttaag ggcgctacca ggatatcgct attggatgga tacaaagggt 4140 atcaccaatt tcggtgcacg gagagagcaa gtaactattt gatcattatc acgcactgtg 4200 gaatgttcag atatttacga atgccatttg gtcctaaaaa tggtccctcg gtgtttcaac 4260 gaacaatgga cagaacattc agtaaagaga ttcgcgaagg atggatgaca atatacatag 4320 atgatatcat tgtgcattcg aaaaataccg cagaccatat tgagcatctt cggcgaatct 4380 ttacgaagtt ggaacaaata aacttgacgt tggcattcaa gaaatgtcac tttgcgtttc 4440 agtcgacaaa agtacttggt catattgtat ccggtttact gatgtcagta gatgacaata 4500 aagtcaaggc aatcagtcac attcagccac cgactactgt aaaagaagta cagagttttt 4560 tgggaatgtg cggatattat agacaatatc tgaaaaacta ccaactagtt gcgctctcat 4620 tgactcgact gatcagacga aatgaagcgt ttgaatggac aaatgagagg caacaggcct 4680 ttgataaact gaagcaaatg ctacaagaag caccatccct atcgttaccg gatttcgata 4740 aacctttcat agtttacacg gatgccagtt ttatagggct gggagcggcg ctacaccaga 4800 aacaggtggt ggacggaaag gaaattgaag ttcccatctg ctttatatca cggtcactcc 4860 gaaatggaga gttgcgatat ggaacaaccc aactagaatg tctagcagtg gtttgggcgc 4920 tggaaaaatt acattactac ttggatggca gtacattcga agtagttact gattgctccg 4980 cggtaaaaag tctacttgga atgaagacac caaatagaca tatgtttcgt tggcaaatag 5040 ctatccaaga atataggggt cggatgacta ttagccatcg ggctggcgat aaacatcaaa 5100 acgctgattg tctatcgaga aatccaatgc cgaataattc cgataatccg gctggtgtgt 5160 cacagaaaga cgacacggaa atatttggcc tacatgtggt ggatttggag cacgatttct 5220 acgccagcgt agcagaggga tattcgctgt gtccaaatat gaggaaaatc atcatggttt 5280 tgagcaaacc tgacggaacc tcaaatagtg agatcattag ctcattagac gaaccttgga 5340 aaaaattatt taaccaaggg tgttttatat ttgaagatga cttactatat tatcgcaagc 5400 agggatctca tcgcttggtg atccacgata gcatgaagga acagatttta aaattatgcc 5460 atgacgatat actggcggga cattttagtc ttgacaaatc attacattgc gtaaagaata 5520 cggcatggtg gctgaattat aatcaagatg tagcagatta tattagctcc tgtgatacgt 5580 gtcagaaggg aaacagaaaa acggggaaaa cgtttggttt gttacaagaa atacaaaaac 5640 cgacaaagcc atgggaaata attaacatgg actttgtcac cggactacca ccagcagggg 5700 atctttctta taattccgtg ctggtgatcg tctgtcgatt atcaaaaaag gcgaagtttg 5760 tgccttgcca taaagatatc gatgccaagg ggttggcgca tctgtggtgg aaatatgcgc 5820 tcaacgagtg tggattaccc accgccataa tcagtgacag ggatccgaag tttacctcgg 5880 aattctggac gtcattgatg agaattgcgg gatgccagtt gaaactgtca acggcacatc 5940 atcctcaaac ggacggcctt gccgagagga cgatccaaac aatggaggac ttgatacgca 6000 ggtattgtgc atttggcctc ctgtataaag acagtgaggg ctgtacgcat gattggatct 6060 ccttactacc ggggctagaa tttgcctaca acagtagcgt tcattctagc actggaagga 6120 ctccttttga gctggagcga ggatacatcc ctcagagtcc caggatgcta acaaacaaga 6180 aattggggaa actcaatatc atccagcagc tgggaatttt ctcacatgca agaattagct 6240 agaatacatg cggcggattg catatcaaaa gcctttgctt atgagaagca aagatgggat 6300 aagacccata cagctccccc tttcaaagcg ggggatcagg tgctgttgtc aacggtgcat 6360 ttcaacaact tgaatagtaa ttcaaagtta aaagatcctt ttatcggacc gtttacggtg 6420 gtcaagatgg ttggcaacaa tgcagcggag ttagatctgc agggagccta ttcgaggcgg 6480 catcctgttt ttcctgtgtc tttgatgaag ctatatctat catcagattc ggacaggttc 6540 ccgaagcgga ctgtcaataa gaaagcggca cctgaattaa cagaagaaga aggggtgata 6600 catcgagtat tacaacaaag agtggtgacg aaaggaaaca agaaagtacg acaatttctt 6660 gtctccttca agaatcagtc cccggaccta tcacgatggg tccatgagga cgaaataccg 6720 aatggtacgg tgctcttacg aaaatttcgg aaagaggcac gtgaagagaa aaatatgaaa 6780 aaatgacaga acattttttt ttgttttttt tgtgtgaggg ag 6822 // ID Copia-2_CCO-I repbase; DNA; FNG; 4324 BP. XX AC AACS02000002; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the mushroom Coprinopsis cinerea genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_CCO_; KW Copia-2_CCO-LTR; Copia-2_CCO-I. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-4324 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the mushroom Coprinopsis cinerea RT genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AACS02000002; Positions 2677207 2681530. XX CC Positions [1597-2094] - Integrase core CC 'AATAG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1177..3567 FT /product="Copia-2_CCO-I_1p" FT /translation="MRHQLSNVLYVPQAPNCLLSATRLDAAGGTVEIGGGK FT CTMRNKDGRIIGFGVKENGLYKLKARAEESRKGKAHLASAPKLTWDQWHRR FT YGHISVTALERLNREQLVHGLDIDGASIPSRSCEACIEAKQTRKSFPKEAE FT HRSKIPGERFMGDIWGPARTISAGKFKYYIAFTDDATRYTTTLFIREKSEA FT FDRIKGHIAWLKKMGRSPRFVRFDNAKDLINSKLEALCREEGIEIEPTAPY FT SSSQNGVAERFNRTVLELARAMLIDSELPAFLWDEAVAHATYLRNRSPTRA FT LNYMTPYEAWHGKKPGVAHLRPFGCNVWVLDESENRSKLAPKSKQMKLVGF FT NDGSKLIRYYDPQKRNVKVSRNFAFNENDELKELEVVEIPGMRTEGESTSD FT SPVSQTQEQPKTEPSSPILPLIETTPSPSISGSANTTDDEDKTIRRPLRAR FT RNETDYKKLGNPDARTPGLRFTPTPMRSVSPSPTISRPTESSKAKAKSKET FT ANIALAAIGEPSERETAFISQFSEPDLPQNVEEALQGPESGRWKEAMDEEL FT KTLKEMGTWELGELPEGRTTVGSKWVFTKKRNEKGEVTKHKARLVAQGFSQ FT KPGQDYQYDGTFAPVMRFETLRTVLGTSAIRNLKLRQFDVKGAYLHGRLKE FT SIFMRQPPGFDDGTGRVCVLKRSLYGLKQAGNVWNHETQLQARGFRLQTTH FT RAITAAIFDTRTTTSLFSSYGSTISSQLRLQTPKRPDRVRTRTALRNQVTR FT TTLTYRGSPNSAGRPLHLAFTIPLHRHTLKALWARKCEPSINTHRS" FT CDS 3335..4324 FT /product="Copia-2_CCO-I_2p" FT /translation="MGRRYHRSFDYRRLNDQIESELAQHFEIKSLGQPSLI FT VGLQIQQDDHYISLSQSHYIDTLLKRFGLENANPVSTPIDPNVKLDAFDSK FT DGNSLGEEVDPRISSGYATMIGSLMYLALGTRPEICYAVNRLAQYTSRPRA FT VHFTALKRIFRYLKGTKHFALIFGGLDDEEMTEDLHIYTDADWANGPDRKS FT ISGYVITLAGGAIAWSSKKQNTVALSTAEAEYVATTHAAKQVLWHRHLLEE FT LEFFPPETSTIFSDNQAAIAISRHPEFHPRTKHIDLAYHFLRDLVANNTLE FT IVYVRTDLNLADLFTKGLPRSRHQDLTYEIGVISRKGE" XX SQ Sequence 4324 BP; 1244 A; 1122 C; 1077 G; 881 T; 0 other; ggttatgagc cccgcctact gggcttaaac catcactctt acactcgttt aaactcgctc 60 gactcgcact ccaactaccc acatttaact aatccatcga caccgacacc atgccagaag 120 aatacacggg cggctatcga atggagcttc tgaaggccag caattggtta ccgtggaagc 180 aacggatgct tgcggtattg cgtgaccttg gactggagaa gtacgtcttg caagacgctc 240 cgggatcaga gaaggtagca gctgggaaat cggacgaact cacgacggag gagcgagtgt 300 cgagaactgc atgggagcaa ggcgatgcca aggctaggac aaggatcgag ctcgctgtgg 360 gagacagcga aatgattcat atcagtggag caacatcggc acgccaaatg tggaaacaac 420 tcacccaagt caaggaatcc aaagggaaga ttggaattct ggccttacga cgacaactat 480 ttcgagcgat ggcggaagaa ggttttgata tggtggagca cgtctcccat cttcgacagc 540 tacaggctca actccacaac ctcggaaacc tcatctcgga cgacgatttc ctcatgattc 600 tgatcacatc actgccggaa tcctgggatt cttacacatc tgcctgattt gggcgctaac 660 ggaaactcgc caaccatttc ctctttcgag ctcgttgcta tcttgattga agaagaacga 720 agaaggaaag gaaggaatga tggtgtgggt acgactctcc aggctgggag aggtcgtggc 780 ggaaatggaa aagccgggaa gtcagataag gaatgttaca actgcaagaa gaagggccac 840 atcaagtcgg aatgttgggc taaaggaggc ggaagggaag gacaaggtcc aaggggtcga 900 cgaggtccaa accgggcaaa tcaagcatcg gaaatcaaca gcacgcttaa cgaagtcgcg 960 tacattgccc aatttagagc tttaagctcc agggacttct ccaaatcaga ctggtacctc 1020 gattccggca ctacctcaca catatgcacc atccgaaacg cctttatcga ttatcaaccc 1080 ctcaccaacg ccactctcga tggcgtagga cctaccccag caatcgtcga agggcgcgga 1140 actattgccc tgaatttcga gttagacgga ggcttgatgc gccatcagct tagtaatgtg 1200 ctgtatgtgc cacaagcgcc caactgtcta ctctcggcta ctcgcttaga tgcagctgga 1260 ggaactgtcg agataggagg agggaaatgt accatgcgga ataaggatgg aaggatcatc 1320 ggatttgggg taaaggagaa tggactatac aagctgaaag cccgcgccga agaatccagg 1380 aaaggaaagg cccacctcgc atcagccccg aaactcactt gggatcaatg gcatcgacgt 1440 tacggccata tctctgtcac agcactcgaa cgcctaaacc gggaacagct cgtacacggt 1500 ctggatatag atggagcatc aattccatca cgctcatgtg aagcttgcat cgaagccaag 1560 caaactcgca agtcttttcc aaaagaggct gagcacagat ccaagatacc cggagaacgt 1620 tttatggggg atatttgggg cccagcgcga acgatttcgg caggaaaatt caaatactat 1680 atcgccttca cggatgacgc aacacgctat acgactacgt tatttatacg cgaaaaatcg 1740 gaggcatttg accgaatcaa agggcacatt gcatggctga agaagatggg ccgttcccca 1800 cgatttgtca ggttcgataa tgcgaaggac ttgattaatt cgaagctgga ggcactctgt 1860 cgagaggagg gaatcgaaat cgagccaaca gcaccatatt cgtcatccca gaatggagtt 1920 gcagagagat tcaatcggac tgtgttggag ctagcacgag ctatgctcat agactccgaa 1980 ctgcctgctt tcctttggga tgaagctgta gctcacgcca cctacctccg aaatcgatct 2040 ccaactcgcg cactcaatta catgacccct tacgaagcat ggcatggaaa gaagcccgga 2100 gtcgcccact tacggccatt tggatgtaat gtttgggtat tggatgaatc agagaataga 2160 tcaaaattag caccaaaatc gaagcagatg aagctggttg gctttaacga cggctcaaaa 2220 ttgatccgct attacgaccc gcaaaagagg aacgtcaaag tgtcaagaaa tttcgcattc 2280 aacgaaaacg acgagctaaa ggagctcgaa gtggtagaaa tcccgggtat gcggactgag 2340 ggggagagca catcggattc tccagtctcg caaacccagg agcaaccgaa aactgaacca 2400 tcgagcccta tcttacctct catcgaaaca acaccatctc cttcgatctc gggcagtgca 2460 aacacgacag acgacgaaga caagacgatt agacgcccat tacgtgctag aaggaacgaa 2520 acagactaca agaaacttgg gaacccggac gcacgcactc caggactgcg attcactcct 2580 acgcctatgc gatcggtctc tccatcccct accatctcca ggcctacaga atcgtcaaaa 2640 gcgaaggcaa agtcgaagga aacggcaaac atcgcactgg cagcaatcgg cgaaccatct 2700 gaacgggaaa ccgcttttat ttcgcaattt tcggagccgg acttacctca gaacgtcgaa 2760 gaagcgttgc agggaccaga atcgggacga tggaaggaag ctatggacga agaattgaaa 2820 acattgaagg aaatgggtac ttgggagctg ggagagcttc cagaggggcg aacaacggtt 2880 gggagcaagt gggtgttcac aaagaagagg aacgagaagg gggaggtcac aaagcacaag 2940 gcgagacttg tagctcaagg tttctctcag aaacccggcc aggactatca gtacgacggc 3000 accttcgcac cagtaatgcg attcgagacg cttcggactg tcctgggtac gtctgccatc 3060 cgaaacctca aacttcgtca atttgacgtc aagggcgcct atttacacgg aagattgaag 3120 gaatcgatct tcatgcgcca gccccctgga tttgacgacg gtactggtcg agtatgcgtc 3180 ctcaaacgtt ccctttacgg cctgaagcag gctggcaacg tctggaacca cgaaactcaa 3240 ctccaagctc gaggatttcg acttcaaaca actcacagag cgattactgc tgctatattc 3300 gataccagga cgacgacttc actattctcc tcgtatgggt cgacgatatc atcgcagctt 3360 cgactacaga cgcctaaacg accagataga gtcagaactc gcacagcact tcgaaatcaa 3420 gtcactagga caaccctcac ttatcgtggg tctccaaatt cagcaggacg accactacat 3480 ctcgctttca caatcccatt acatcgacac actcttaaag cgctttgggc tagaaaatgc 3540 gaacccagta tcaacaccca tcgatcctaa cgtcaaactc gacgcattcg acagcaagga 3600 cggaaactcg ctcggagagg aggtagaccc ccgaatttcg tcgggatatg cgacgatgat 3660 tggttctctg atgtatctgg cgcttggaac gcgtccagag atctgctatg cggtcaaccg 3720 actcgcccaa tacacatctc gccccagagc cgtccatttc actgccctca agcgcatctt 3780 tcgttacctg aagggtacca agcatttcgc actcatattt ggaggattag atgacgaaga 3840 aatgacggaa gacctccaca tctacaccga tgccgactgg gcgaatggac cggacagaaa 3900 atcgatttcc ggctatgtta tcaccttggc aggtggagca atagcttgga gctcgaagaa 3960 gcagaatacc gtggctctat cgactgcaga agccgaatac gtcgccacta cgcacgccgc 4020 aaagcaagtc ctgtggcatc gacacttact cgaagaattg gaattttttc cccccgaaac 4080 atcgaccatc ttcagcgaca atcaggccgc tattgcgatc agtcgacacc ctgaatttca 4140 tccaagaacg aagcacatcg acctcgcata tcatttccta cgcgatcttg tcgccaacaa 4200 caccctcgag attgtttacg ttcgcaccga cctgaatctt gccgacttgt tcaccaaagg 4260 actacctagg tcacgtcatc aagaccttac ctacgaaatc ggtgttatct cgagaaaggg 4320 ggag 4324 // ID LTR14_CN repbase; DNA; FNG; 450 BP. XX AC . XX DT 30-MAR-2005 (Rel. 10.03, Created) DT 30-MAR-2005 (Rel. 10.03, Last updated, Version 1) XX DE C. neoformans LTR - consensus. XX KW LTR Retrotransposon; Transposable Element; Interspersed repeat; KW LTR14_CN. XX OS Cryptococcus neoformans OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Tremellomycetes; Tremellales; Tremellaceae; Filobasidiella; OC Filobasidiella/Cryptococcus neoformans species complex. XX RN [1] RP 1-450 RA Goodwin T.J. and Poulter R.T.; RT "The diversity of retrotransposons in the yeast Cryptococcus RT neoformans."; RL Yeast 18(9), 865-880 (2001). XX RN [2] RP 1-450 RA Loftus B.J., Fung E., Roncaglia P., Rowley D., Amedeo P., RA Bruno D., Vamathevan J., Miranda M., Anderson I.J. et al.; RT "The genome of the basidiomycetous yeast and human pathogen RT Cryptococcus neoformans."; RL Science 307(5713), 1321-1324 (2005). XX RN [3] RP 1-450 RA Gentles A. and Jurka J.; RT "C. neoformans LTR sequence LTR14."; RL Direct Submission to Repbase Update (15-MAR-2005). XX DR [3] (Consensus) XX CC Average similarity to consensus is 97%. XX SQ Sequence 450 BP; 102 A; 124 C; 99 G; 122 T; 3 other; tgtcacgtgg ctgtcgccac ccgacctctt cttcatgtca gcacgattat gacngtaacc 60 cttcctcatt cccacttcag tccgcgtccg agcatacccc ttcgttttcg gccgcggcct 120 tcccccgaga gatatacaga gatagagaga tcgacaaccc tccancacct tcctcgccac 180 cgggtttcgg caacgcaaaa ggatgtagag ctggctggtg tgggaaagat gagtcagcag 240 agcttgaggt gtaaaggtat ataagacagt agttttccat agttttagat acttgtcgat 300 ccactattgt tgtcctctcc aagtattctt cgtctactta aagctatcat cttccggcat 360 tgatctggct gnctatttaa agaggccctc ggatcatttg ttgtccctaa cctgttgaag 420 gctatcgcca gaagagatag ccccctgaca 450 // ID Copia-5_TMe-I repbase; DNA; FNG; 4239 BP. XX AC CABJ01001229; XX DT 13-FEB-2011 (Rel. 16.02, Created) DT 13-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Perigord black truffle genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-5_TMe_; KW Copia-5_TMe-LTR; Copia-5_TMe-I. XX OS Tuber melanosporum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Pezizomycetes; Pezizales; Tuberaceae; Tuber. XX RN [1] RP 1-4239 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Perigord black truffle genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; CABJ01001229; Positions 193501 189263. XX CC Positions [1932-2144] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 96..1358 FT /product="Copia-5_TMe-I_1p" FT /translation="MATTRAAAPTPSTAAPAYPNPYTSLSHSIEKLDGSMA FT TGKSNYVAWKFRVLRILKEKGLARALEDNADDDDSMGEQTTRDRINDQAFT FT IISLNIKNSQIPHIQAATSAKEAWESLAKVHQGIGSNGRMVLMQKLWSLHL FT KQGQDMSAHVNSFKELATQVANLSPDGVGIPDSDLVSMLSLSLLESYEPLI FT MAVQSRAENVTFDFLTGRLLQESTRRQAARSTTADHTSQPLSAFTAASGLR FT PGGLRGRGNARWGGRGFGRGMRGGPSGIGRGRGSPGQGMVSGRCHYCNKEG FT HWKNECYKRKGDLQQGSGGGHLAFMGYSKPVIGESDWIIDSGASRHLTARR FT ELLEDYISMLPTSIMIGNGKDINAIGQRNITLQTTSGMISLSGVLYVPDIG FT SNLLSVASIVDQGFHVELTRAGFSVNK" FT CDS 1935..3479 FT /product="Copia-5_TMe-I_2p" FT /translation="MTTPYTPEYNGIAERANRTIMDMVRCMLFDSGMGKEF FT WEFAALTAVHIINRLPSTSHENKTPFEKWFGKPPSIAHLRVFGYVAYWHIP FT SQTRTKLEPKARRCRMIGYKEESGSRVYRLYDTASKQVILSRDVLFDETTQ FT GVTADSGPSTTTPTESEREDRNQKETPSEKEDEEAEGTLLPSIDPDEKDAA FT TPSYDKDTIIVRPLQPQEQTRQAEENTQRTTNESRRVLRKHPTREMFRPNG FT GQALIAWTEEPQSLEEAFGGEDRKEWNAAWQSELESLRKNGTWVVEKAPPD FT RNIVGYRWLFRRKEDGRYKVRLVAKRYSQEPGIDFKETFAPVAKFTTLRVL FT LALVAENDWELHSMNVKTAFLNGVLEEEIYMQCPEGISEATQNGMTCRLVK FT PIYGLWQSPRAWYQKIHSFFLSHDFIRSTQDYSLYINYGRRVLVLVYVVDL FT VLAAAEVEDIGWIKGALTEAFEMTDLGELTNFLGLEIGRKRSQRLLTLHQR FT KYIDKILTSHRFPSMSYTT" XX SQ Sequence 4239 BP; 1266 A; 942 C; 1091 G; 940 T; 0 other; ttttaggtta tgagcccggc tcgcgcactt taattagcct tcatactccc tctatctgct 60 ttcagtctct attctaaact cacattatct ccacaatggc gacaacgcga gcagcagctc 120 caacaccaag taccgccgct ccagcatacc caaatccgta tacctctctt agccactcga 180 tcgagaagtt agatggatcc atggcgacag ggaaaagcaa ctatgtggca tggaagtttc 240 gcgtgcttcg tattctcaag gaaaagggtc ttgctcgtgc attggaagat aacgctgatg 300 atgatgattc tatgggagag cagactacca gagatcgtat caacgaccaa gcctttacaa 360 tcatctccct caacatcaag aactcgcaga taccacatat acaggcagca acaagtgcga 420 aggaggcctg ggaatcgcta gccaaggttc accaaggtat aggatcgaac ggacggatgg 480 tgctgatgca gaagctctgg agcctacact tgaagcaagg ccaggacatg tctgctcacg 540 tcaatagctt taaggaactt gcgactcagg ttgccaatct atcaccagac ggcgttggca 600 tccccgatag tgacttggtg tcaatgttaa gcctatccct cctagagtca tatgaaccgt 660 taatcatggc tgtccaatcg agagcggaaa acgtcacctt cgactttctt acaggacggc 720 ttctgcaaga atcaacccgc agacaagcag cacgtagtac taccgcagat cataccagtc 780 aaccactatc tgcatttaca gcggcatctg gcttgcgtcc tgggggttta cggggcaggg 840 gaaatgctcg gtggggtggt agaggttttg gtagaggaat gcggggtggt ccttcaggaa 900 taggcagggg gcgcggaagt ccagggcaag gtatggtgag cgggcgctgc cattactgca 960 acaaagaggg tcattggaag aatgagtgct ataagaggaa gggagactta cagcagggtt 1020 cgggtggagg acatttggcg tttatgggct attcaaagcc agtgattggg gagtcggatt 1080 ggataatcga ctcaggtgca tcgaggcacc ttaccgcccg acgtgagttg ctagaagact 1140 atattagcat gttaccaaca tcgatcatga ttggaaatgg gaaggatatc aacgcaatag 1200 gtcagaggaa cataactctc caaacaacat ccggaatgat ctcactctcg ggagtgttat 1260 acgtacctga tatcggtagc aacctactca gcgtcgccag tattgtcgac caaggattcc 1320 atgtcgagct tacaagggcc ggattcagtg tcaataaatg aaataccgaa cgtgtaatcg 1380 gcaggcgaca agggaatatc tattttgtag ctgggctaca agaaatagca ttcgctggac 1440 tctcggatca gaaggacgct acaacaaagg aaatttggca tataagaatt gctcaccgat 1500 ctctaaacga acaaggtggt caatggatca caaagtcagt aatcggattc aacttaacag 1560 acaacgagaa gcaacaagcg agagtatctg ggatttgtgc ggaaggaaaa caagcaaggg 1620 aaagtctcac cggagaacgt gtcaagagcc aggagttgct acacacaatc cactcagatg 1680 tgtgtggacc gatggcgagt actggcttta tgggagagcg gtactttgcc acatttatcg 1740 atgaaggatc tgggcgtatt gctgtatcac ttcttacgta gaagttagaa gtgttcgaaa 1800 ggtttaagta gtacaaggcc aaggtagaaa gggagagcgg aaagagaatc aaatccatta 1860 ggtgcgatgg aggaggagag tatatgggaa acaaccttcg aaattacctc gcagaacaag 1920 gcattacgta atgtatgacg acaccataca caccggaata caatggtatc gcggaaagag 1980 caaatcgcac tatcatggat atggttagat gcatgctctt tgactcggga atgggaaaag 2040 aattttggga atttgcggct cttactgcag tgcacatcat taatcggcta ccctctacct 2100 cccacgaaaa caaaactccc tttgagaagt ggtttggaaa accaccgtca atcgcacact 2160 tacgggtttt tggctacgtg gcctattggc atattccctc ccagacgaga acgaaacttg 2220 aaccgaaagc acggagatgt cggatgatag gctacaagga agagagtgga agcagagtat 2280 atcgtctgta tgatacagcc agcaagcaag ttatactatc acgagatgtt ctgtttgacg 2340 aaacgacaca gggagtaaca gccgactcag gaccaagtac tacaacacca acagaaagcg 2400 agcgtgaaga ccggaatcag aaagagacac cgagcgagaa agaggatgaa gaagcagaag 2460 gaaccctgct tccttctatt gatcccgacg aaaaagatgc agcgactcct tcctacgata 2520 aagatacaat cattgtacga ccattgcaac cacaagagca gactaggcaa gcggaagaga 2580 acacccaaag gactacaaat gagtcacgaa gagtactacg aaagcatcct accagagaga 2640 tgtttcgccc aaatggtggg caggctctta tagcatggac tgaagaacca caaagcttag 2700 aagaagcatt cggtggagaa gacagaaagg agtggaatgc tgcctggcaa agtgagttgg 2760 agtcactacg gaaaaatgga acgtgggtag tggaaaaggc accgccggat agaaacattg 2820 taggatatcg atggctgttt agaagaaaag aggatggaag atataaggta cgccttgtgg 2880 cgaagaggta ctcacaggaa ccaggtatag acttcaagga aacatttgcc ccggtggcaa 2940 agtttacgac cctcagagtt ctcctcgcat tggtagccga aaatgattgg gagctacata 3000 gtatgaatgt caaaacggcc tttctcaatg gggtactcga agaggaaatc tacatgcaat 3060 gtccggaagg tatttctgag gcaacacaga atggaatgac ctgtcgtcta gtcaagccaa 3120 tttacggtct atggcaatcc cctcgtgcct ggtaccaaaa gatacattcc tttttccttt 3180 cccacgactt tattcggagt acgcaggact acagcctcta tatcaactat ggcagaaggg 3240 tgctggtatt agtgtatgtc gtcgatcttg ttctagcagc ggcagaggta gaagacattg 3300 gttggataaa gggtgcactt acggaagcct ttgaaatgac ggatcttgga gaacttacca 3360 actttctagg actagagatt ggcagaaagc gaagccaaag gctacttacc cttcatcaga 3420 gaaagtatat cgataaaata cttaccagcc atagattccc gtccatgtct tacaccactt 3480 gatccgcata cgagacatct cccacacaac aaagaaaccg aagggacaac tgttagcttg 3540 gacttgtacc agtctgccgt tggatctcta atgtacgcaa tcctgggtac tcgtccagat 3600 ctagcttatg ctgtcggatt agttagccag ttcaaccatg ctccactagt cgaacactgg 3660 gtagcggtga aaaggatttt tcgttatctg gttggaactc gtaccctcag gttgcaatac 3720 ggatccagta atcagagcgg aggatattcg gatgctgact gggcatctgg tcatgatagg 3780 aagtcagttg ggggttttgt ttttctcttg aatgggggcg cggtttcttg ggctagcaaa 3840 aaacagagtt cgatcgccct ttctaccacc gaggccgagt acatggccat gacatcggcc 3900 agcaaggaaa tcatatggct tcgagtactg ttggaagaag tgggcgcgct taaccatata 3960 acccagatgg cgacacttta cggagacaac cagggagctt tggtattagc tcgtaaccca 4020 gaatatcatg cccggacgaa gcacattgac atacagtacc gctttgttcg agaactcgtc 4080 caagacgaga aagtcaacct tgactactgc ccaagccccg atatgattgc cgacattatg 4140 accaaagccc tcccacgtcc cgcacacgag aaacatacca cggcaatggg aataatcgac 4200 aggactggaa aacaatacag aacgcttcgc gagggggcg 4239 // ID Gypsy-70_MLP-LTR repbase; DNA; FNG; 360 BP. XX AC AECX01001241; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-70_MLP_; KW Gypsy-70_MLP-I; Gypsy-70_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-360 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001241; Positions 65020 65379. XX SQ Sequence 360 BP; 77 A; 100 C; 55 G; 128 T; 0 other; tgtgataccc tgaatcccat tgttctcttt gttacctgtt tctttcctca ccgctttctc 60 aagaatgctt ttccttttac tttgtgtgct ttttaaatgt actctcattt ctcttatcga 120 caaatccccc ggattcgtct accaatgttg tcgaccggat ccaagtacca acggatccgg 180 ccatcctagc tttatctcga tgtctctgac atcaccgcgt tgtatcttcc cacctatttc 240 cttatttgta ctttgcagag atggatgtct atataaagac atccatccct cgcttgtgga 300 atgaaacctc aagcagcttc tcctacagtt tcatcgcacc gatccttgtt aggaataaca 360 // ID Gypsy-1_CCO-I repbase; DNA; FNG; 5407 BP. XX AC AACS02000001; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the mushroom Coprinopsis cinerea genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_CCO_; KW Gypsy-1_CCO-LTR; Gypsy-1_CCO-I. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-5407 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the mushroom Coprinopsis cinerea RT genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AACS02000001; Positions 101659 96253. XX CC Positions [4107-4595] - Integrase core CC 'ATAGT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 117..5243 FT /product="Gypsy-1_CCO-I_1p" FT /translation="MMDTLKLRGRTLERIEPVRKARKPRNANRTTVVTDSP FT RRDPQTTQLDDETPTGVQLERACGLLPAVDGETSLCTEFDDEDVPLPELSS FT IIDDSSDESDDESFTVTPPFVFQAHVDFDNWGADENVTVKMAYNARMATVK FT DNGKHFPTLTAGKCTIDVLDTFHVACENKAYQKGLADDKVVKHLFGCFEDH FT RIITWIGANRATLAALNITEFMNRLKELVLERDWATEWKRKISTRRQKDNE FT TFEDFANEMIYWNNLLRGTEKHRDNERMKDLLFSNCVDEISDEYQLSDIPE FT QFENADFNEIAVFNEWIKKVIGIDNKIARLRALSKKLLDAERSNKKARTNN FT SGSTGGSSDGTRAGRRADWVPRLTEEEKDILANHRGCFKCRQFYAGHRWAN FT CPNGYPTMKNYKKITVEMAADAKKKYDEANKSSTAKGKAIAAVLPSDDIEY FT EDSSSEESAEKDSSAEILDENDDEEENDVSDPTTSVHLLWKACMAGTSDFP FT EDVLTMFDTGCHTVLMDSKVATKHGLKRYPLKNALPVNPAFITSTDAEPFS FT LTEFVRFDLWSNDNVFFSKKFKAILVDNLCVDILLGLPWLEAHNVIIDCAA FT RKCFINGVYDFVIPRQVRRPQVLPRSGKERTHQLRRNWKVMIAELKKVLKP FT IRDIVDRRWERRPKQRFIKSMVGAIKAAVINVELKEKADKMRAEFSDLFQP FT IPHASQLPDQVTASITLKDATKTIETRTYSCPRKFRDAWKKLIDEHLKAGR FT IRPSNSPHASPSFIIPKADPTAPPRWVNDYRQLNANSVADRNPLPRIDDIL FT ADCGKGKIFSKLDMTNSFFQTRMRPEDIKWTAVTTPFGLYEWLVMPMGFKN FT APAIHQRRVSEALKKYIGHFCHVYLDDIIIWSKDVSAHEKHVRLILEALRK FT NKLYLNGKKSKFFCESVDFLGHVISKDGIRPDGSKVDRVVNWPVPKNASDV FT RRFLGLVRYLGSFLPDLARWTSVLNPLTKKECDRNWPGWTSTHGRAFEEIK FT KLVVSSDCLTTIDHENPGENKIFVTTDASEKRTGAVLSFGPTWETARPVAF FT DSTPLKKAELNYPTHEKELLAIINALKKWRADLLGEQFFIYTDHRTLEFFQ FT TQKELSRRQARWMEYMLQFDGKIVYVKGEDNVVADALSRLPIVDNSIAAEK FT TAREFFPCDELPGSSTPPVGALFTPKGAGIDVCAMTEALSSVRFKKPKETN FT TSLTLDESFVRDILRGYQLDPWCQKFENAAKGMAAFKKKNGLWFVNNRLVI FT PKYSNCRALVFQLAHDRLGHFGFEKAYAAIKNEYYWPGMRSDLEKGYIPGC FT EDCQRNKDNTTKKAGPLHPLPVPDGRGESIAMDFIGPLPVDEGYDMLLTIT FT DRLGSDIQLIPVKSTITAEQLANVFFDKWYCENGMPSHIISDRDKLFTSKF FT WKALHTLTGIKLQLSSAYHPETDGSSERTNKTVIQSLRFWVDRNQKGWVKA FT LPRVRFALMNTVNASTGFTPFQLKMGRSPRIFPSLIPREGQPRLQDVLNVI FT EKLQHDVSEAQDNLLAAKIDQAHQVNKHRTDEPSFEPGDRVWLTTRHRRRE FT YLSQDSRRVAKFMPRFDGPYTILDAHKDTSSYTLLLPEHSNTHPVFHVSLL FT KKCVPNDDNKWPDRRHTIPKPIVTEDGQEEWEIERILDRKRAGKGWRYLVR FT WVGHGPECDVWLPGKEVEDCEALDVFLDGLRTNGNTDPDA" XX SQ Sequence 5407 BP; 1392 A; 1478 C; 1277 G; 1260 T; 0 other; cttttttttg cgtatctcca aacgagtcta cacgccactt agtgggcgtg gattggtagc 60 gcgtctagcg ctagaacgta tacgacggta gctgttgcta caacgacgat agacacatga 120 tggataccct caaattacga ggaagaacgc ttgaacgaat tgaacccgtg cgcaaggcgc 180 ggaaaccaag gaatgctaac cgaacgacgg tggtgacgga ctcgcctagg cgggatcctc 240 aaaccacaca attagacgac gagacaccga caggtgtcca attagaaagg gcgtgtgggt 300 tattgcccgc cgtcgatgga gaaacttcat tgtgtaccga gtttgacgac gaggacgtac 360 ccctacccga attgtcttct attatcgacg attcctctga cgagagtgac gacgagtcgt 420 tcacggtcac cccgcctttt gtgtttcaag ctcatgttga ttttgacaat tggggtgctg 480 acgaaaatgt gacagtaaaa atggcctaca acgctcgcat ggctacagtc aaggacaacg 540 gcaagcactt cccgaccctc actgccggaa aatgcaccat cgatgttctc gacaccttcc 600 atgttgcctg cgagaacaag gcttaccaaa aaggactcgc cgacgacaag gtcgtcaaac 660 atctcttcgg gtgttttgag gaccatcgta ttatcacctg gattggagct aatcgtgcta 720 ctctggctgc tctcaacatc accgagttca tgaaccgtct caaggaactt gtactggaac 780 gcgactgggc taccgaatgg aagcgaaaga tctcgaccag gcgccagaag gacaacgaga 840 cgttcgagga cttcgccaac gaaatgattt actggaacaa cctcttgcgt ggcacagaga 900 aacatcgcga caacgaacgc atgaaggacc tcctcttctc taactgtgtc gacgagatct 960 ctgacgaata ccaactctcc gacattccgg aacagttcga gaacgccgac ttcaacgaga 1020 tcgctgtttt caatgaatgg atcaagaagg tcattggtat cgacaacaag atcgctaggc 1080 tcagggccct ctccaagaaa ctcctcgacg ccgaacgatc caacaagaag gctcgcacca 1140 acaactccgg ttcgaccggg ggatcttcgg atggcacccg cgccggtcga cgcgctgact 1200 gggtcccgcg attgactgag gaggagaagg acattctcgc taaccatcgt ggctgtttca 1260 aatgccgtca gttctacgcc ggccaccgat gggccaactg ccccaacgga tacccgacca 1320 tgaagaacta caagaagatc acggtcgaaa tggctgctga tgccaagaag aagtacgacg 1380 aggccaacaa atcctccact gccaagggta aagctattgc tgccgtactc ccctccgacg 1440 acatcgagta tgaggattca tcttcggagg aatcggcgga gaaggactcc tcggcggaga 1500 ttttggacga aaacgacgac gaagaagaga acgacgtgag tgatcctaca acttccgtac 1560 atctattgtg gaaagcctgc atggcgggga cctccgattt tccggaggac gtactgacaa 1620 tgtttgacac cggctgccat acagtcttaa tggactcgaa agtcgccacc aaacacggcc 1680 tgaagcgata cccactaaag aacgccttac cggttaaccc tgcattcatt acttccaccg 1740 acgcggaacc tttctctttg acggagtttg ttagattcga cttgtggtcc aatgacaacg 1800 tattcttttc caaaaagttt aaggccattc ttgtggataa tctctgtgtc gatatcctgc 1860 tcggtttacc ttggctcgaa gctcacaacg tcatcattga ctgcgccgcc cgtaagtgtt 1920 tcatcaacgg tgtttatgac ttcgtgattc cgcgacaggt ccgccgccca caggttctac 1980 ctcgttcagg aaaagagcga actcatcagc tgcgtcgcaa ctggaaagtc atgattgccg 2040 aattgaagaa agttctcaaa ccgatccgcg acatagtcga ccgccggtgg gaacgtcgac 2100 caaaacagcg tttcatcaag tccatggtgg gcgcgatcaa ggccgcagtt atcaatgtcg 2160 agttgaaaga aaaagccgac aagatgagag ctgagttttc cgatcttttt caacccattc 2220 cacacgcttc ccagttacct gatcaggtca cggcgtccat cacgctcaaa gacgccacga 2280 aaaccatcga aacccggacg tacagttgtc ctcggaagtt ccgagacgcg tggaagaaat 2340 tgattgacga acaccttaag gctggtcgta ttcgcccttc aaattcccct catgcatcgc 2400 cgtctttcat cattcctaaa gcggatccaa ccgcacctcc tcgatgggtg aacgactacc 2460 gccaacttaa cgctaactct gttgccgacc gtaatccctt accaaggatt gacgatattc 2520 tagccgactg cgggaaaggt aaaatcttct cgaaactaga catgactaac tcctttttcc 2580 aaacacgcat gcgtcctgag gacatcaaat ggactgccgt taccacacca ttcgggttat 2640 atgagtggct cgtcatgcct atgggtttca aaaacgcccc tgccattcat caacgccgag 2700 tctccgaggc cttgaagaaa tacatcggtc atttttgcca cgtctatctt gacgacatca 2760 tcatctggtc caaggacgtc tccgcacacg agaaacatgt ccgactaatc ctcgaggctc 2820 tccggaagaa caaactatac cttaacggta aaaagtctaa gtttttctgc gaatccgttg 2880 actttttagg tcatgtaatc tccaaggatg gcattcgtcc cgatggttcc aaagtcgatc 2940 gcgtcgttaa ctggcccgtt cccaagaacg cttccgatgt ccgccgcttc ttaggattgg 3000 tacgttatct tggaagtttt ttgcctgacc tcgctaggtg gacttcggtc ttgaacccgc 3060 tcacgaagaa agaatgcgat cgcaactggc cgggttggac ttccactcat ggtcgtgcct 3120 tcgaagagat caagaagctc gtcgtttcct ctgattgctt aaccaccatc gaccatgaga 3180 accccggtga gaacaagatc ttcgtaacca cggatgcttc tgaaaaacgc acaggcgccg 3240 tgttatcgtt tggccctact tgggagactg cccgacctgt cgccttcgac tctactccat 3300 tgaagaaagc tgaacttaat tacccaacgc acgaaaaaga attgctcgcc atcattaacg 3360 ctttgaagaa atggcgtgcc gacctcttag gcgagcagtt tttcatttat actgaccacc 3420 gaactctcga gtttttccag acacagaaag aactctcgcg tcgtcaagct cgctggatgg 3480 agtatatgct ccaattcgat ggaaagattg tgtacgtcaa aggggaggac aatgttgtcg 3540 ctgacgctct ttcgaggtta cccattgtcg acaacagtat cgccgctgag aagacagctc 3600 gcgaattttt cccttgcgac gaattacccg gttcatcgac gccgcctgta ggagcgctgt 3660 tcacgcctaa aggtgcgggt atcgacgtct gcgccatgac tgaagcgctc tcctccgtca 3720 ggttcaagaa accaaaggag accaacactt ccctcaccct tgacgaatct ttcgtccgag 3780 acattcttcg cggctaccag cttgatcctt ggtgccagaa attcgagaac gccgctaagg 3840 gcatggcagc tttcaagaag aaaaacggcc tttggttcgt caacaaccgt ttggtcattc 3900 cgaaatactc caactgccgt gccttggttt tccaactcgc tcacgaccga ctgggccact 3960 tcggtttcga aaaagcttac gctgcgatca agaacgaata ctattggcca ggaatgcgtt 4020 ccgatttgga gaagggctat attccgggat gcgaagactg ccagcgcaac aaggacaaca 4080 ccacgaagaa agccgggcct ttacatcccc tcccggttcc cgacggtaga ggagaatcga 4140 tcgcaatgga cttcattggt cctttgccag ttgacgaagg atacgatatg ctgttgacca 4200 tcacggacag attgggttcc gacatacagc tcattccggt gaagtccacg attaccgctg 4260 aacagctcgc taacgttttc ttcgacaagt ggtattgcga gaacggcatg ccttcccaca 4320 tcatttccga tcgcgataag cttttcacgt ctaaattctg gaaggctttg catacgctaa 4380 caggcattaa gttacagtta tcgagtgctt accatcctga aacggacggc tccagcgaac 4440 gtaccaacaa gaccgtcatc cagagccttc gcttctgggt tgatcgtaat caaaagggct 4500 gggtcaaagc tttacctcgt gttcgattcg cactgatgaa tacagtcaac gcctctacag 4560 gtttcacccc gttccaattg aagatgggcc gttcaccacg tattttccct tctctcattc 4620 cgcgagaagg tcaacctagg ctgcaagacg ttctgaacgt catcgagaag ctccaacacg 4680 acgtctccga agcccaagac aacctcctcg ccgcaaaaat cgaccaagct catcaggtca 4740 acaaacacag gactgatgaa ccgtcctttg aaccaggcga tcgagtctgg ctcaccacaa 4800 gacatcgccg ccgcgaatac ctttctcaag atagtcgtcg tgtcgccaag ttcatgccta 4860 gattcgacgg accgtacacc attctcgacg ctcacaaaga cacatcctcc tacaccctcc 4920 ttctacctga acactccaac acgcaccccg tcttccatgt ttctctcctg aagaagtgcg 4980 ttccgaacga cgacaacaaa tggcctgatc gacgccacac aattcctaaa cccatcgtaa 5040 cggaggacgg gcaggaggaa tgggagatcg agaggattct tgacaggaaa cgggcaggaa 5100 agggctggag gtatttggtt agatgggtgg gacatggtcc ggaatgcgac gtatggttac 5160 cagggaagga agtcgaagat tgcgaagctc tcgacgtttt tctggacggc ctccgaacga 5220 acggaaacac ggatcccgac gcttaattcc ttttttcttc cttttatttc ttttcccttc 5280 gagctactgg gactaacggt tgacttttcc ccactggggt ttttaatgca ccacgctaac 5340 aatgggtccc tcttttacga tagccttttt acgataattt cttctttttt ttcatagggg 5400 ggaaggg 5407 // ID Gypsy-1-I_AN repbase; DNA; FNG; 4909 BP. XX AC . XX DT 09-DEC-2003 (Rel. 8.11, Created) DT 09-DEC-2003 (Rel. 8.11, Last updated, Version 1) XX DE Internal portion of a LTR retrotransposon from the gypsy DE superfamily - a consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1-I_AN; KW Gypsy-1-LTR_AN; Gypsy superfamily. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-4909 RA Kapitonov V.V. and Jurka J.; RT "Gypsy1_AN, a family of gypsy LTR retrotransposons in the RT Aspergillus nidulans genome."; RL Repbase Reports 3(11), 188-188 (2003). XX DR [1] (Consensus) XX CC LTR retrotransposon. Gypsy superfamily. Internal portion. XX SQ Sequence 4909 BP; 1456 A; 1160 C; 1178 G; 1010 T; 105 other; ttaggaatcg ctcactaatc tcaataatag tatgaggaga ccttttacta tgacaatgga 60 agaagaaagt gtcacattgt tgctacagca gctccaggag ctccgtacag agatgcggac 120 tcagaaacaa cagctccaag aagagaataa cagcttacgg gcggaactac aggccgtacg 180 gaactcgcag ctgagaaacc atccaccagt tactactaca gttacatctg caacgcccac 240 cccctacgaa cgaagctatc cccgtcctcg tcacccggat gtcgaaccct ttactggaga 300 agaccctaag gactaccctc ctttccagat gaaccttcgt acgaagtttg caatcgacgc 360 cgcctgctac cctacagagg aggaacaagt ttactatgcc tacagccgcc tgagaggaaa 420 agccagccag cgtgtactac catggctctt ggctcgccag aaatctgaga ctcctgtgct 480 atgggcagaa ttctccgcgg tactagacaa ggcctttggt gaccctgacc gacagagaaa 540 ggctcttgta cgagtgaata caataaagca agggaracgt gactttgaag agttcttgaa 600 tgaatttgac gaagaacttc ttaatgctgg agggattaat tgggatgata accagaagaa 660 ggccttgttg gacacggcaa ttaatgttga gttgctaaaa gccatggttg gtattaggca 720 ggaggattcg tacgacaact actgtaatca actgcgcgaa atcaaccaca acctccagag 780 agtggccagg cttacaygaa aaggatctca cgctgctgtc cccatgcatg tcgcttgtac 840 aagaccagca ggaggctctg accggaccgg racccctgat caaatggact gggaagccac 900 ccatgctcaa attgcagccc tacaaaagga agtygcggcc ctccgtatga aagggaccag 960 gaccccaaga aaagctagtc aggcgcctgc agaggagaag caaaagaggt tgtctgaggg 1020 caaatgccta cgctgcggyg atcctgacca ctttatacaa gagtgcccta taaaacctac 1080 cagacgccct aggcaggtgg ccacagttca ggaagaacaa gaccaaatcg atgactacag 1140 caagagcgag tcggaaaacg aataacctct gtgcaaagtc gcgtacagag gggttataca 1200 gctagagaaa tactacttga ttggcaagat ttcaacagct cgcgcatgaa taccccccca 1260 ttcttggtgg argcactagt caaccatacc tataatgctc gtacaatgat agatacaggc 1320 tgcctgacct atggggtaat cagtgacaag tttgtcaaga tacatyaaat acctaccata 1380 cctatccacc cgaaaccttt caagggagtg accgggaata tagaggagat taataagatt 1440 rtacaggttc agctagacat cggggcgcat acagaaaaag gagcctactt ctatgtgata 1500 ccygataacc tgggctatga cytgatcttg ggactcccct ggctggagca acatgatgga 1560 aggttagagg ctaagagggg caggctgtac ctctgtacta ctggagtcyg tctatggagt 1620 actacaaaga ggcccttacc aaagctggac atagcacaga tatcagctgc aaccatggga 1680 ggatttatac aaaggaaaag gtgccatggc caagatatcg agatatttgc ggtctcattg 1740 gcagatatac agaaggcact ggccccaaag agacatattg acccccgtac aaagctacca 1800 aggcaatact ggaaatacct aaggctcttc gaacaagaca aagctgaaga actaccaccg 1860 caccggggag atgggattga tcacaaaatc gagctcgtac aggaggagag tgggaaggat 1920 cctgaagtcc cctggggccc cctttacaac atgacccagg aagaactaat agtcctccgg 1980 aaaacactct ctgaactact acagaaaggc tttatccgcg tgagccattc cccagctgca 2040 gccccagtac tctttgtaca aaaaccagga ggaggactgc ggttctgcgt cgactaccgt 2100 gctctaaatg ccattaccaa gaaggaccgc tatccattgc ccctgatcca tgagacactg 2160 aaccaaattg gacaagccag atggtttact aagctggatg tgtctgctgc cttccataag 2220 atccgcatag ccaaaggcca ggaatggatg actgccttcc gtacgagata cgggctcttt 2280 gaatggctag tcaccccttt tgggttggct aatgcaccga gtaccttcca aaaatacatc 2340 aactggaccc tccgggaata tctagatgaa ttctgctcag cctatattga cgatgtgctt 2400 gtctatacca atggggacct ccgccagcac cggaagcacg tatgaatggt cttgaagaaa 2460 ctggaagaag caggcctata tttggatatt aagaagtgcg aatttgagtg caaggagaca 2520 aagtacttgg gctttataat acaggcaggg aagggaatca aaatggaccc ggagaaggtg 2580 aaagcaataa aggaatggga aacccctact actataaagg gcgtccgagg attcctgggc 2640 tttgccaact tctaccgaag gttcatccct aacttctcag ggatcgtacg cccactaaac 2700 aacttgacaa agaaaggaac acccttcttg tggactaagg agtgccagga tagctttgat 2760 ctgcttaagg aaaagtttat tactggacct gtcctagcaa ccttcaaccc ttcctacygt 2820 acggtagtag agaccgactc ctcaggttat aatacaggag gagttctctc tcaatataat 2880 raaaaagggg aattgcaccc atgtgcctac ttctctaaaa ggaattctcc agctgaatgc 2940 aactacgaga tctatgacaa ggagctactt gcaattgtac gatgtcttga agcctgggat 3000 gctgaactgc gctcatgtgg agaattccaa gttattacag accacaagaa cctggagtac 3060 ttcttctccc caaggaaact gacagarcgr cacgtacgat ggtccttatt tctcagccgg 3120 ttcaacttca agttagtata taggaaaggg tcagccaatc agagagctga tgcactttca 3180 cggagagacc aagacatgcc tgatgatraa gatgacaggg tcaagtctcg tacgatgcaa 3240 ctytttaswr aaaaacactt gggraaratg gtagttgcca ccctycaacc aactgragag 3300 cmaccatgsg agccgtrtga raaargtgat atgtggaarg aggcactcaa rcaggatraa 3360 rgrtatartg argcartacw gtgcctgaar gatggagcaa ggaratttcc yccacaycta 3420 carttgaaag tcggaayctc rgaatgccar ytrgacgccc aagrcyatat cctcttccgy 3480 ggraggaggt gggtrcctgr kagtgaacag ctccgtacaa rtataattca rgctgcacay 3540 gactctatat tgacaggaca tcctggccgr gagcaaacat atwtgctggt tagccgtgaa 3600 tayttctggc ctaacatgtc ccargatatc agragattcg tccgraactg tgatatatgt 3660 ggraggacca artcttggag rgaccagaga argggrctat taaagcccct ccctgtgcct 3720 gatcgtccct ggcaggaggt ttcaatggat ttcattacag acctaccaga gagtgaaggt 3780 tgtacaaaca tcatggttat cacagaccgr ttaaccaaag gtgtgatact agaaggaatg 3840 tcagaraytg actctgagag tgtggcctgg gcmctcgtac gagtacttat aagcaaacay 3900 gggatcccga aggctatcac ctcggacagr ggaagccagt ttacaagtaa tacatgggcy 3960 cgcatatgta ccctgacagg gatyaaccgc cgrctatcta cagcctatca yccycagact 4020 gatggatcaa cagagagrat gaacagyaca gtggaracct acctccgcat statacctgc 4080 tatgaccaga rggactggaa caggytacty ccacttgcag agctrgcaat taatggccgt 4140 acatcaacag caacaggggt cagccccttc tayctaagcc atgggtayaa cctcagccca 4200 tttascccta ccgaggaggt agagcawcta gctgaagaac carccaagag tcctatccag 4260 aaaggggaag cyattgtacg gaaagttaag gaagccctag actgggctca agcctccatg 4320 gcctattccc aacagaatac agagaatcag gctaataaac acaggagccc ggccacaaac 4380 taccaagtgg gagataaggt ctggctaagt ctgaagaaca tctgtacgga ccgacccagy 4440 aagaaactkg actggaagaa cgccaagtat gaggttatag gcctggtggg cagycatgct 4500 gtacggctga atacaccccc agggatccat ccagtcttcc atgtggacct gcttcggctg 4560 gcttcatcag atccacttcc ttcccagaag aatgatgata sccagccccc trgcatcatk 4620 gtgaacggyg agraagaata catggtagag aaaatcctgg acgaacgtcs caggagatac 4680 gggagaggtc accggctgga atacctagtg aaatggtcag gctatgctcr gccaacctgg 4740 gaagctgcca cagctttgga ggaagtacaa gctctggatg agtggctgga tcgtacaaaa 4800 cartatagac ttcaggacgg ctcactaaac agagatgcat atataaaggc taaagcaaca 4860 tgacctaccc tgtgacctgt acttcctaca tagagggagg ggggggtac 4909 // ID FLIPPER repbase; DNA; FNG; 1836 BP. XX AC . XX DT 23-APR-1999 (Rel. 4.03, Created) DT 17-APR-2011 (Rel. 4.03, Last updated, Version 2) XX DE Flipper, pogo-like autonomous DNA transposon. XX KW Mariner/Tc1; DNA transposon; Transposable Element; transposase; KW pogo superfamily; FLIPPER. XX NM FLIPPER. XX OS Botryotinia fuckeliana OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Leotiomycetes; Helotiales; Sclerotiniaceae; Botryotinia. XX RN [1] RP 1-1836 RA Levis C., Fortini D. and Brygoo Y.; RT "Flipper, a mobile Fot1-like transposable element in Botrytis RT cinerea."; RL Mol. Gen. Genet 254(6), 674-680 (1997). XX RN [2] RP 1-1836 RA Jurka J.; RT "Consensus."; RL Direct Submission to Repbase Update (17-APR-2011). XX DR [1] (Consensus) XX CC Flipper is a pogo-like DNA transposon. It has 48 bp-long TIRs and CC is flanked by TA target-site duplicates. Flipper's ORF of 533 CC amino acids encode transposase. XX FH Key Location/Qualifiers FT CDS 149..1744 FT /product="FLIPPER_1p" FT /translation="MTKPYTEDDIAAALFAIAGGMSMRKACSEYGIPRTTL FT HNRINGHLSHKKGAQNLQKIAPVQERALANWILVQEALGTSPTHRQIRELG FT ESILNLEGGDLSLGKRWIHSFLERNPEIKTKRQYKIDNARINGATTEIISK FT FFEKLDLPAIKYIKPENRWNMDEAGIMEGQGFNGMVLGSSKRRFIQKKQPG FT SRTWTSFIECISATGRALLPLVIFKGKTLQQQWFPIKLDNYEGWEFTATDN FT GWTTDSTGLEWLKEVFIPQSAPTRPKEARLLVLDGHGSHETTQFMLECFKN FT NIHLLFLPPHTSHVLQPLDLSIFSPLKKEYRYHLNTLDSLTDSTPIGKRNF FT LACYQKARLKALTLRNITSGWKASGLWPQNRAKPLLSRLLLENSNQEVVYQ FT NPVSDDDPELQWNIHSSFIAWKTPQKGSDIRKYADIMEKVDETDIPTRRLL FT FRKIQKGFDAKDYELVQSKKRIKQLEQKLEEIIPKKRRMVKTSPNSRFAGI FT EAIYQAQIEAGDREIEIKDSDSIYETDTTGDCIEVE" XX SQ Sequence 1836 BP; 619 A; 348 C; 361 G; 508 T; 0 other; agttgagtac cccactttcg gaccacccct cttttggacc acctaaaatc ttaccccatt 60 ttaagcacta cctcccaact tcatctttaa taaatcaaca accacatatt caattgatat 120 aagattttaa tacattatca attaagctat gacaaagcct tatactgaag atgatattgc 180 tgcagcactt tttgcgattg caggaggcat gtctatgcgt aaggcttgct cagaatatgg 240 tattccccgc accactttac acaaccgtat aaatggccac ctttcacata aaaaaggtgc 300 acaaaaccta cagaagatag ctcctgtgca ggagagagct ctagcaaatt ggattttagt 360 acaggaagcc ctaggaacta gccctaccca tcgtcaaata cgagaattag gagagtccat 420 tctcaacctc gaaggaggtg atttatctct gggcaagcga tggatacata gttttttgga 480 aagaaaccca gagattaaga ctaaaaggca atataaaatc gataatgccc gtatcaatgg 540 tgcaactacc gaaattataa gcaagttctt tgaaaagttg gatttaccag caattaagta 600 tatcaagccc gaaaacagat ggaatatgga tgaagctggt ataatggaag gccagggttt 660 caatggtatg gtacttggga gttcaaagcg acgttttatt cagaaaaagc aacccggttc 720 aagaacgtgg acctctttta ttgagtgtat ctcagctacg ggaagagcac ttttaccttt 780 ggttatattc aaaggtaaaa cacttcaaca acaatggttt cccattaaac ttgataacta 840 tgaagggtgg gagttcactg ctacagataa tgggtggact acggattcta caggtttgga 900 atggctaaaa gaggtgttta taccacaatc agcaccaact cgaccgaaag aagcaagact 960 ccttgttttg gatgggcatg gaagccatga aaccactcaa tttatgcttg aatgcttcaa 1020 gaataatata cacctcttat ttttaccacc ccatacatcg catgtactac aacctcttga 1080 tttatcaata ttttcacctc tgaaaaaaga atatcgatac cacctcaata ctctcgattc 1140 attgacggat tctactccca ttggcaaaag aaactttctt gcctgctatc agaaagctag 1200 attaaaagct ttaacacttc gaaatatcac ttctgggtgg aaggcttcag gtttatggcc 1260 tcaaaaccgc gctaaacctc ttttgtccag attattgctc gaaaacagta atcaagaggt 1320 tgtatatcaa aatcctgttt cagatgatga tcccgagctt caatggaata tacattcatc 1380 ttttattgca tggaaaaccc ctcaaaaagg aagtgatatt cgaaaatatg ctgatataat 1440 ggagaaagtt gatgagaccg atattccaac tcgccggctg ctttttcgaa agattcaaaa 1500 aggatttgat gctaaggact atgaacttgt acagtccaag aaacgaataa agcaattgga 1560 gcaaaaatta gaagagatta tacctaaaaa aagaaggatg gtaaaaacta gtccaaattc 1620 gaggtttgca gggatagaag ctatatatca agctcaaatt gaagctggtg atcgggaaat 1680 tgagataaaa gactctgata gtatttatga aactgacact acaggggatt gtattgaagt 1740 tgaatagtgg ttgagttatt gaagtttaaa attcatatat atatatattt ttaggtggtc 1800 caaaagaggg gtggtccgaa agtggggtac tcaact 1836 // ID Copia-1-LTR_CCi repbase; DNA; FNG; 359 BP. XX AC . XX DT 21-JAN-2010 (Rel. 15.04, Created) DT 21-JAN-2010 (Rel. 15.04, Last updated, Version 1) XX DE Long terminal repeat of a Copia-1_CCi LTR retrotransposon - DE consensus. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_CCi; KW Copia-1-I_CCi; Copia-1-LTR_CCi. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-359 RA Kapitonov V.V. and Jurka J.; RT "Families of self-primed copia LTR retrotransposons from diatom RT and fungus."; RL Repbase Reports 10(4), 552-552 (2010). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data produced by the CC Broad Institute http://www.broadinstitute.org/ in collaboration CC with the Coprinus research community. XX SQ Sequence 359 BP; 85 A; 85 C; 59 G; 130 T; 0 other; tgttggaata caatgcacat gtcacatcta ttccatttac gggtacaagg atgcccaatt 60 atgtacgaac gtttcatgta tccttatacg gacattctat tgttctctga tctcggatat 120 attaagtagt tctattgttc tcttgtatcc cttgttccca attcggacat cttattgttc 180 taccgtatat agacatccct ttgtccccag acgtatatag gtagtgactg caagtgggag 240 ttcccccatc gaactcttcc ccacttgcag gtatgtcctc attgtactta tttatatctg 300 tactaatatc atcgaactct tccccacttg cagtggtaca tcgtcttacg tgtttaata 359 // ID I-4_AO repbase; DNA; FNG; 5434 BP. XX AC . XX DT 24-JAN-2006 (Rel. 11.01, Created) DT 02-FEB-2006 (Rel. 11.01, Last updated, Version 1) XX DE A family of I non-LTR retrotransposons - a consensus sequence. XX KW I; Non-LTR Retrotransposon; Transposable Element; endonuclease; KW reverse transcriptase; RNase H; I-4_AO. XX OS Aspergillus oryzae OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC mitosporic Trichocomaceae; Aspergillus. XX RN [1] RP 1-5434 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-5434 RA Kapitonov V.V. and Jurka J.; RT "I-4_AO, a family of I non-LTR retrotransposons in the RT Aspergillus oryzae genome."; RL Repbase Reports 6(1), 12-12 (2006). XX DR [2] (Consensus) XX CC It is a family of I non-LTR retrotransposons. It contains well CC preserved ORF1 and ORF2 coding for the gag-like and CC endonuclease/RT/RNase H proteins. XX FH Key Location/Qualifiers FT CDS 163..1392 FT /product="I-4_AO_1p" FT /translation="MAIPPAGVAEPHPTNTEETPRIFTQEVFRPRIPLTRK FT RRKRGVDTEGNEPISNPEFATCGENNGRVTNQDVRQFFTSFKEALAHQTEI FT IEAARAEIRELKAEQQLLQTQNVELREEIQALRAKTEAQQLNTPPTKSWAA FT VVAGNPVPDPKTTVPRPRNEPNCVRISTAPTLDEEIDNDRFSRFLPTDAAN FT THIRTALLNAEPTKEVQVAGIGPTKTGYVIRFRDAQSAETARNNTEWLEEL FT GNNTKVVKPRFGIVVHRVPTEDFNLEENKKQGIEKIMIENDLYEKEFRVED FT IAWLKKRDMPMGKSASMGIWLDSPEAAEWIINNGLLVGQRFIGSIEPYRVE FT QKRCRRCQQFGHLAWSCKKQVKCGYCAREHDQRHCFPGIRPKCSDCNGEHP FT TGDRMCQAPLNPRSSQ" FT CDS 1392..5294 FT /product="I-4_AO_2p" FT /translation="MTTNLRILQLNIMKSRPGMEALINDHQSQNLDMLLIQ FT EPPMTAYRSHVNHSAWRLYRPTYTDESVRFRSLLYVNRRISTSSHRQIHCN FT HPDVVAIKIWTPELQYLIFSVYIPPVALYEAPEVSSAQQILEEIQTSIRQH FT AEGNNRVTKLILAGDWNRHHPAWSHRPVHHSFAEHAEELINFFQAHELQWC FT LAPGQPTFWSLKEPGKTSVLDLTLTNNTERLIKCQLYHHHYGSDHRGTYSE FT WSLQPKQNVKLKLKRAYDRADWTKVGQDILNLIDPQPRILSSQDLDQVVEN FT LVHTTTTVLDQHVPFLAPSPYSKRWFTPDLKVQQTEVNQIRRRWQDGCAIL FT GPSHPMTKTLFEEMRRKRRQWTRAIEKAKSGHWREFLDKAGEGHLWKAATY FT MRPRDADMSIPTLKVDTKEVTDNQEKAEVFLEAFFPKMADPGDEEVESPAE FT ELRWEPITEIEIHRSIRAAKGTTAPGEDGIPTLVWKQLWAYLKETITIIFT FT KSLDLGYYPNQWKRARIVVLRKPGKPDYSAPGAYRPISLLNTLGKLLEAVM FT ARRLSYWAEKYGLLPDTQFGGRPGRNTEQALLVLANAIDRAWVRSRVVTLV FT AFDLKGAFNGVNKTSLDTRLRAKGIPFKARQWICSFMENRQASVTFDDFET FT ENLPLEHAGLAQGSPLSPILFCFYNSDLVDQPVDSNGGASAFIDDYFRWRT FT SPSAEENIKKIQEEDIPRIDEWARRTGASFAAEKTELIHLTRRKSEHCKGQ FT ILINGQVIKPADTAKLLGVIFDKEMRWKEHIQRAVRRATKVNIALGGLRHL FT RPEQMRQLYQACVTPTIDYASTVWHNPLRDKTHLRLLRTVQRTALIRILSA FT FRTVSTEALEVESHILPTHLRLKQRAQITAARLSTLPGNHPIHGVIVRAIA FT RSSHIGSGQRFPLAETMRTMDLNRLQALETIDPTPLAPWRTQPFTEIEIEP FT DREKAKANASARQAMTGATVFSDASGQQNQLGAAAVALDKNQQILGSRQIS FT IGSMSYWSVYAAELMAIYYAIGLVFQLAQKNQTTATTTRGPATILSDSMSA FT LQAIANAWNKSGQRILQAIHQAAGELKARGIPLRLQWVPGHCGDPGNETAD FT RLAKEAVGLEKKHPFRHLLSREKGYIRDRISKEWEQEWRTSKKGGHLRKID FT RTLPSSRTRRLYGSLPRNRAYLLTQLRTGHSWLATYGKQHRFQEEEKCECG FT AVETVVHVLIDCPRLNRLRQELRRKIGRAFNNISDMLGGAEQGKEGRLQDA FT PQDSSVLGAVLDYAEASQRFRSRAPRGRQNRTPGIGQHRP" XX SQ Sequence 5434 BP; 1643 A; 1446 C; 1341 G; 1004 T; 0 other; gacgacagca acaagcatca tactgtacag tggtcgatcc agctgtagca ttatcgagcg 60 gggactctgt atgaatatat ggcaactgta gttgtgccat atcaaaattg ttgagtggtt 120 gaagctgtca ttgtgtgccg agcgggaact agtctcggcc cgatggcgat cccgcccgca 180 ggggttgccg aaccccatcc cacaaacacg gaggaaacac cacgaatctt cacgcaagaa 240 gtctttcgcc caaggatccc attgacaaga aaacgccgca agaggggtgt agatacggaa 300 ggaaatgagc cgatctcgaa ccctgaattc gcgacctgcg gggaaaacaa tggaagggtg 360 acaaatcaag atgtccggca attcttcacc agcttcaagg aggcacttgc ccaccaaaca 420 gaaatcatcg aggcagcaag agcagaaatt cgggaactga aagcggaaca gcagctcctg 480 caaacacaga atgtagaatt gcgagaggag attcaagccc tccgggccaa aaccgaagca 540 caacagctga acacgccccc gacaaaatcc tgggccgcag tggtagccgg taacccggta 600 ccagacccca aaacaactgt tccacgcccc cgaaatgaac cgaattgcgt tagaatcagc 660 actgcaccga ctctcgacga agaaatcgat aatgaccgct tctcgaggtt cctgccgacc 720 gatgcggcaa acacacatat cagaacagcc cttctaaatg cggaacccac gaaagaagtg 780 caggtggccg gcatcggccc cacaaaaaca ggatatgtca tccgattccg agatgcccaa 840 tccgctgaga cagcccggaa caataccgag tggcttgaag aactgggaaa taacactaaa 900 gtagtgaaac ctcggtttgg catcgtggtc catcgagtcc ctacagaaga ctttaatctg 960 gaagagaaca agaaacaagg aatagagaag atcatgatag aaaatgatct ttatgaaaaa 1020 gaattccggg ttgaagacat tgcatggcta aagaagagag acatgcccat gggcaagtcg 1080 gcctcgatgg ggatttggct tgattcaccg gaggcagcgg aatggatcat caacaacgga 1140 ctgctagtag ggcagaggtt catcggaagc attgaaccct accgggttga acagaaaaga 1200 tgccgtcgct gtcagcaatt tggccaccta gcttggtcat gtaagaagca agtaaaatgt 1260 ggctactgtg caagagagca tgaccaacgc cactgtttcc caggaataag acccaaatgc 1320 tcggactgca atggagaaca cccaacagga gaccggatgt gccaagcacc ccttaacccc 1380 aggtcgtccc aatgacaaca aaccttcgta ttttacagtt gaatatcatg aaatctagac 1440 cgggaatgga ggctcttatc aatgatcatc agagtcagaa tctggacatg ctcctgatcc 1500 aagagccacc aatgactgct tatcgaagcc atgttaacca cagtgcatgg agactttacc 1560 gaccaaccta tacagacgaa tcagtccggt tccgcagcct tctctatgta aaccgaagga 1620 tctcaacatc atcacatcgg caaatccact gcaatcatcc ggatgtggtg gccatcaaaa 1680 tctggacccc agaactacaa taccttatat tctccgtcta tatcccacca gtcgcactgt 1740 atgaagcgcc agaggtttcc agtgcacaac agatcctaga agaaatccag acaagcatcc 1800 gacaacatgc agaaggaaac aaccgagtaa caaaactcat tctggctggg gactggaacc 1860 gccaccaccc tgcatggagc caccgtcctg ttcaccactc tttcgcagaa cacgcagagg 1920 agctgattaa cttctttcaa gcccacgaac tacaatggtg cctggctcca ggccagccta 1980 cattctggtc tctcaaagaa cctggaaaaa catcagtcct agacctcaca ctcactaaca 2040 acacagaaag actaatcaag tgtcaactct accaccacca ctatgggtca gaccatcgcg 2100 ggacatactc ggaatggagt ctccaaccga agcaaaacgt aaaactgaag ctgaaaagag 2160 cctatgaccg agctgactgg acgaaggtgg gccaagacat actcaacctg attgacccac 2220 aaccaaggat cctgtcaagc caggacctgg atcaagtggt cgagaatctg gttcatacaa 2280 caaccactgt ccttgaccaa catgtcccat tcttagcgcc atctccatac tccaagcgat 2340 ggttcacccc agatcttaag gtccaacaga ccgaagtcaa tcagatccgc cggagatggc 2400 aggatggatg tgcaatccta ggacctagtc acccaatgac aaaaactctc tttgaagaaa 2460 tgcggcggaa aagacgacaa tggacaagag cgattgaaaa ggcaaagtcc ggccactgga 2520 gagagttcct tgacaaagca ggcgaaggcc acctgtggaa agcagccacc tacatgcgcc 2580 cccgtgatgc cgacatgagc atcccaacac tcaaggtgga caccaaagaa gtcacagaca 2640 accaagagaa ggcagaggtg ttcctggaag ccttttttcc aaaaatggca gatcccgggg 2700 acgaagaagt ggagtccccc gcggaggagc tgcggtggga gccgattaca gaaatagaga 2760 ttcaccgatc aatcagagca gctaagggaa ccacagcccc tggcgaggat ggtatcccaa 2820 ccctcgtctg gaagcaacta tgggcatacc tgaaggaaac catcaccata atcttcacaa 2880 aatcactaga ccttggctac taccccaatc aatggaaacg ggcacgaatc gtggtactgc 2940 ggaagccggg taaaccggac tactctgcac ctggggccta ccggcccatc tcgctgctga 3000 acaccctggg gaaattactg gaagcggtta tggcccgaag gctgtcctat tgggctgaga 3060 aatacggtct gctaccagac acgcagtttg ggggcagacc aggacgtaac actgagcaag 3120 cccttctagt acttgctaac gcgatcgatc gagcatgggt gcgatccagg gtggtcacgc 3180 tagttgcctt cgatcttaaa ggcgcattca atggggtcaa caaaaccagc cttgacactc 3240 gtttacgggc aaaaggtatc ccatttaaag cccgacagtg gatctgtagt tttatggaaa 3300 atcgacaagc gagtgttaca tttgacgact ttgagactga aaacctcccc ctagaacatg 3360 ccggcctcgc gcaaggatcc ccactctcac caatcttatt ctgcttctat aactcggacc 3420 tggtagacca accagtggac agcaacggtg gtgcatctgc atttatcgat gactatttcc 3480 gctggagaac cagcccatca gcggaggaaa acatcaagaa aatccaggaa gaagacatcc 3540 cgcggattga cgaatgggcg cgacgaaccg gcgcgtcttt cgccgctgag aagaccgagc 3600 taatacactt gactcgccgc aagagtgagc attgtaaagg gcagattctc ataaacggcc 3660 aggtcattaa accagccgac acggctaagc tcctaggggt catctttgat aaagagatga 3720 gatggaaaga acatatacag cgggcagtga ggagagcgac taaggtgaat atagcccttg 3780 gggggctcag acacctccga ccagaacaga tgcggcaact ctaccaagcg tgtgtaacgc 3840 cgaccataga ctatgcatct acagtctggc acaacccact cagagacaaa acacacctaa 3900 ggctcctgag aacggtccaa cgaacggccc ttatccggat cctctctgct tttagaacag 3960 tatccacaga ggcactggaa gttgaatccc acatattacc cacccacttg cgacttaagc 4020 aacgggcaca gataacagcg gcccgcctca gcacattacc aggaaatcac ccaatccacg 4080 gagtgatcgt tcgagcaata gcgcggagtt cacatattgg aagtggtcaa cgattccccc 4140 ttgcagaaac gatgcggact atggacctta accgtctaca agccctagaa accattgacc 4200 caacacccct agcaccatgg cgaactcagc ccttcacgga aattgaaatc gagcctgacc 4260 gagagaaggc caaagccaac gcctcagcaa ggcaagcaat gacaggtgcc acggtgtttt 4320 cagatgcatc aggacaacaa aatcaattgg gtgctgcagc agtagccctg gacaaaaatc 4380 agcagatttt ggggtcccga caaattagta tcggttcaat gagctattgg tctgtctatg 4440 cagcagaact catggcgatc tattatgcaa ttgggctggt ttttcaactg gcgcaaaaga 4500 atcaaactac tgcgacaaca acacgaggcc cagcaacaat cctcagcgat agtatgtccg 4560 cactgcaggc aattgcgaat gcatggaata agtcaggcca gcgaattctt caggccatcc 4620 atcaggcggc tggggagcta aaagcccgag gaatcccgct gcgactacaa tgggtcccgg 4680 gacattgcgg tgaccctgga aatgagacag cagaccggct tgccaaagaa gcagtaggcc 4740 tagaaaagaa acacccgttc cgacatcttt tgtcccgtga aaagggatat atccgcgata 4800 gaattagcaa ggaatgggaa caagaatgga gaacctccaa gaaaggaggg cacctccgca 4860 aaatcgatcg gaccttaccg tcaagccgca cccgccggct ctacggctca ctccctcgca 4920 accgagctta tctgctcacc caactccgga ccggccactc gtggctggca acatatggca 4980 agcaacatcg cttccaggaa gaggaaaaat gtgaatgtgg ggcggtagag acagtggtcc 5040 atgtgttaat tgactgcccg cggctaaaca ggctgcgcca agagctacga cgcaaaatag 5100 gaagagcatt caacaacatt tcagatatgc taggaggagc tgaacaaggt aaggaaggta 5160 ggttacaaga tgccccgcag gacagcagcg tcttaggcgc ggtactggac tatgcagagg 5220 catcgcagag atttcgaagt cgcgcgccac gagggcggca gaacagaaca ccaggcatag 5280 gccaacacag gccttgacga ggctccaagt tcgtcaggaa agtaatagtt tactgtaaat 5340 agacgcacag atagagtaca gatagtggcg gtcccgcata tcgccctgag gcgaggggtc 5400 ggcgtatgaa tcatcatcat catcatcagc tgta 5434 // ID Gypsy-90_MLP-LTR repbase; DNA; FNG; 1682 BP. XX AC AECX01000233; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-90_MLP_; KW Gypsy-90_MLP-I; Gypsy-90_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-1682 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000233; Positions 8294 6613. XX SQ Sequence 1682 BP; 450 A; 205 C; 306 G; 721 T; 0 other; tgtggtaatc aaaactacag tttgaatgta ttttgatttt tttcttatta taatttaatt 60 tattttttta ttatgaatta atttagtttt ttttattttt tttatcatta caaatttaaa 120 ttatttttta tcttatttta ttattttcta ttctttttta taatttcgat attttaattt 180 tttaatttta attattactt ttttttatta tcacaaattt tatattttta atttttaatt 240 tttttatttt gttaactttt aatcttgaat tttattttat tttttttatt ttatgggttt 300 atgaggtatg ggttagttag gggtgtatgg gatatgtaga gtacataagt gttaagtggt 360 aatttccaaa attatgggtt gtgcatggtt ttattaaggt atgcaggatt ttatgggttt 420 tccattatgg gtttgaacaa ggctgaattt cgaaaattga ggacttggag gaaaaagtag 480 ggtggtggta aactgagata agctttaggg ttcaaagtca aatggtcacc atagtagatg 540 ttacaacaac aaagttttgc aatacaacat tgaaaatgac aattacattt gacacaggag 600 aatttccgac attacaaaac ttgacaatga ctttcgaaga ctgagaaatg catagttttg 660 gcagggaact ttgaattcaa gggtaccaca gcaccatttc agggtagatg ttagttgggg 720 tagattctat tttcgcatat ttcccaacaa ctttaatttt aaacatcatc cttatcaact 780 ttagtggtaa tatcaggtgg taaactgggt aaatccaatt tcgagtattt tcacataaaa 840 tcgcatttca ttttaagggt acaaccaata gagttagagt ttagcttggc tgagcaaagt 900 tgtagagcgg cagcagcctt ttttttctgt ctcttcagaa tttttatatg gtcccttttt 960 cttatcttaa ttcagctttg aagggaaaat ggcggaaatt tgggtttttg tggtaaacag 1020 ctaagttata gtgctgcttg tgagcttgaa agtttgtaga ttgtcaatgt gtgtgcgctg 1080 aatctgtcta tcaactttat tttgggggtt cattgagagt tatgctgtga aaaagttgta 1140 tgaaggtgaa tttcgagttt gaggggtgtt ttttgtctta tctttttttt gtgtgtgttt 1200 caattttgat atagtcaagt gtctcttttc ttcccctttg ggatcctttt atttagcaaa 1260 aattattatt agaattgaga ataaagcttt gtatcttttt attattatta tctgggtaaa 1320 caactaatga aagttgggat tttataatat tagacccttt attcactttt aataatttct 1380 tgttgagttg aaagctcatt tgctcatttg gaaggtgcct gagtgcgcct agatagtgct 1440 acttttgatt ttattatcct cttttagcct ggttcttatt aatccaaaga cttgtaggtc 1500 ttaaatttaa ttcatcagta atcaaagtcc ttgtttaaga agggcatttc tttctttgtt 1560 tgaagtcttt caattcacca tctcaagact tattcttctt ttgattgtgt gtgttgttga 1620 tttggtcttc attcttcccc ttcttggctc tgctttcagc tgtgtactga aagagttcta 1680 ca 1682 // ID TCN3-I repbase; DNA; FNG; 5061 BP. XX AC . XX DT 30-MAR-2005 (Rel. 10.03, Created) DT 21-APR-2005 (Rel. 10.03, Last updated, Version 1) XX DE C. neoformans LTR retrotransposon - internal consensus. XX KW LTR Retrotransposon; Transposable Element; Interspersed repeat; KW integrase; reverse transcriptase; RNase H; TCN3-I; KW internal portion. XX OS Cryptococcus neoformans OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Tremellomycetes; Tremellales; Tremellaceae; Filobasidiella; OC Filobasidiella/Cryptococcus neoformans species complex. XX RN [1] RP 1-5061 RA Goodwin T.J. and Poulter R.T.; RT "The diversity of retrotransposons in the yeast Cryptococcus RT neoformans."; RL Yeast 18(9), 865-880 (2001). XX RN [2] RP 1-5061 RA Loftus B.J., Fung E., Roncaglia P., Rowley D., Amedeo P., RA Bruno D., Vamathevan J., Miranda M., Anderson I.J. et al.; RT "The genome of the basidiomycetous yeast and human pathogen RT Cryptococcus neoformans."; RL Science 307(5713), 1321-1324 (2005). XX RN [3] RP 1-5061 RA Gentles A. and Jurka J.; RT "C. neoformans LTR retrotransposon TCN3."; RL Direct Submission to Repbase Update (15-MAR-2005). XX DR [3] (Consensus) XX CC 502 bp LTR deposited as TCN3-LTR. 97% average similarity to CC consensus. XX FH Key Location/Qualifiers FT CDS 17..4700 FT /product="ORF1p_TCN3" FT /translation="HLPFYTISTMPKSAQDSPTELLARLVDRVQQQDDTLA FT LLTELLAKQNKLSATKERYTPRGIPLPKASDIPRFEGPLGDADRALTHLRR FT LQQVLRTHGLLTPSDDEEVEERRRVTVIELANNSIECAGLVGWVEQDGRRM FT EADGITTWETWSAAFKAKAMPSNWEFRESRALFRLSFHEVSSEGWKKFDNA FT VILHRAHLLGSKLYPTDQQLTSFYRAACPERLFLRLVDLPEFHVDDLDALR FT ALISNHVERIQHEDAASRIRSRPTNQVSSDNVQLPHSMSYYHDPSHALPYY FT LSPAGRHIREVFRKENRCYDCRKTGHQHSKCPTRTRAPTPKVNQLETELTA FT GDATSAYLTLAQTVPPADDDTCDATYALFHPLLSISVPVSADSRLSAPSPK FT FLLDTGASTTFVDPRLAARLGWEVKKGLVRMRVRLAGGLAGPLVTDMVVGS FT FSLGGRMYRVDGVLMDLHGTYDGILGLNFFARHGLLAESNSFVRLLEAGGV FT NLSALGLQKLGAPVSHAAANPASTTATVSFADTRPNLSQTRAAAESDSLTD FT VLRKLQTEFHDVFCDDLGDVRNFPTISKTKSGVRFEINLKHGATPHRSPPY FT RVPEALLPRFREMLLEHLNAGRLRYSSSPWASPAFLVSKGNGKFRMVCDFR FT ALNNVTVPDMYPMGNVQDILHRAARKGKIFAKLDCKDAFFQTLMKEEDIPK FT TAITTPLGLLEWVVMPQGIRNAPAAQQRRINEALQGLTGECCEAYVDDIII FT WGKDAKDLHDNIVSVLSALRRSGLRCSREKSKLFLDEVAFLGHIIRPGQIL FT PDPAKIARVEQFPLPVNSHQLHSFLGLVNYLRDFVPNLADHTAVLHATLPP FT NAAAEKAYYKAVKMHKGHLPEGWTGWRWSFGPAEKAAFEATRRAVSTVPCL FT AVIDYDAVKAGKQQVFLFTDASNTGTGAWIGVGTSRESAQPVAYDSRTFNS FT AQRNYPVHDRELLAIINALDHWRPLLYGIPVHVYCDHFTLQWFLGQRNLSP FT RQLRWLSTLKDFDLRIEYIKGEFNTLADYLSRHAPSDAAEPADPSLDQSPV FT SVHATMTYEPTLDPDTLRAIAQGYQGDVLFKEWLADPSTAPGVTFHDHDTH FT RLLLVDNRLCIPDVNTLREELMRQAHEGTAGHLGVEKTMEVLRSGYFWETM FT SKDVREFVRACHLCQQANAPTTKPAGPLHPLPVPRDKFDDIAIDFVGPLPS FT SGGHDYLLTITDRLTGFIELVPCSTTINARDLAILVWDRWVSRYGLPLSIT FT SDRDTLFTSRFWTTLWEQQNVKLKMSTAFHPQTDGASERTNKTVVQLLRSW FT VDRHGKSWVKYLPRVSQAMNNTVRRSTGFSPAQLVFGRRLRTLPSLPRPPS FT FIQASLPTRAEWTLAADRTDLSLADARDNLLLAKHRMAVQANRHRRPEVVY FT KVGDWVWLDTRNRLKEFRAGDGEYRAAKFFPRFQGPYQVQEANPALSVYRL FT HMNDRTYPKFHGHLLKPYLSSPRFHQTSPVTHQXDTGRRSILQILDDRVYR FT GHRQLRVVLSGDGPNGQWRNLDDLRTHDGFRALYDEYIGDDELALX" XX SQ Sequence 5061 BP; 1048 A; 1495 C; 1246 G; 1267 T; 5 other; cttttttcga aactagcacc taccttttta caccatatcc accatgccca aatcagctca 60 agactctcca acggagttac tcgcgcgtct ggtcgaccgc gtccaacaac aagacgatac 120 gctggcgctc ctcactgagc tgctggctaa acaaaacaaa ctatcagcta ccaaggaacg 180 ttatacaccc cgaggtatac cattgcccaa ggcgtcagac atccctcgtt tcgagggtcc 240 ccttggcgat gcggaccgcg cccttaccca tttgcgtcgg cttcaacagg ttttgcgaac 300 gcacggcctt ctgactccct cggatgacga ggaggtggaa gagagacgcc gtgtgacagt 360 gattgagctg gcgaacaatt cgattgagtg cgctggtctc gttggttggg ttgagcaaga 420 cggtcgccgt atggaggccg atggaataac gacatgggag acgtggtcag ccgccttcaa 480 ggctaaggcc atgccgtcca actgggagtt ccgcgaaagc cgtgctcttt ttcgtctctc 540 cttccacgag gtgtcgtcgg agggttggaa aaaattcgac aatgcggtga ttttgcaccg 600 tgcccacctc cttgggagta agctataccc gaccgaccaa caactcacca gtttttaccg 660 tgcggcctgt cccgagcgcc tttttcttcg cttggttgac ttgcccgaat ttcacgtcga 720 cgatctggac gcactgcggg ccttgatcag caaccacgtc gaacgaattc aacacgagga 780 tgccgcctct cggattaggt cccgtcccac caaccaggtt tcttccgaca acgtccaatt 840 accccactct atgtcctact atcacgaccc ttcccacgct ctgccatact atctctcccc 900 ggcgggccgc catatccgtg aagtcttccg caaggagaac cgatgctacg attgtcgtaa 960 aacggggcat cagcattcca agtgccctac ccgtacacgc gctccgactc ctaaggtcaa 1020 ccaactcgag actgaactga cggcaggcga tgccacctcg gcctatctga cgttagccca 1080 aaccgttcct ccagccgatg atgatacctg tgacgcgacg tacgctcttt tccatccttt 1140 gctctccata tccgtcccag tctccgctga cagccgttta tcagccccct cccctaaatt 1200 cttactcgac acgggggcgt ccacaacctt tgtggatccg aggttggcgg cgagactggg 1260 ctgggaagtg aagaaagggt tggtacggat gcgagtgcgt ctagccggag gtttggcggg 1320 tccgttggtc acagatatgg tcgtcgggtc gttttcccta ggtggtcgca tgtatcgagt 1380 cgatggtgtc ctgatggatc tgcacggtac ctacgatggt atcttaggac taaatttctt 1440 cgcgcggcac ggattactcg cggaatcaaa ttctttcgtt cggttattgg aggcgggcgg 1500 tgtgaacttg tctgcgttag gcttgcaaaa gttaggtgcg ccagtttccc acgccgccgc 1560 aaatcccgcc tcgactaccg ccacagtgtc attcgccgac acccgcccca atctttcgca 1620 gacccgcgcg gccgccgagt ccgacagtct gacggatgtt ctacgcaagc ttcagaccga 1680 attccacgat gttttctgcg acgacctagg cgacgtgcga aatttcccca ccatctccaa 1740 aacaaagtct ggcgtgcgtt tcgaaattaa cctgaaacat ggtgctactc cccaccgctc 1800 gccaccctat cgagtaccgg aagccctgct tccacgtttc cgagagatgt tattagaaca 1860 tctgaatgca ggacgtcttc gatactctag ttccccttgg gcctccccgg cgttcctcgt 1920 ttccaaaggt aatggcaaat tccgaatggt ctgcgatttc cgtgctctga acaatgtcac 1980 ggtccccgac atgtacccta tggggaacgt ccaggatatc ctccaccgtg ccgctaggaa 2040 gggcaagatt ttcgcaaaac ttgactgtaa ggatgccttt ttccagacgc taatgaagga 2100 ggaggacatc ccgaaaaccg ctatcaccac tcctctcggt ctcctagagt gggtggtaat 2160 gcctcaaggg atccggaacg cgccggccgc tcaacagcgt cgcattaatg aggcgttaca 2220 aggtttaact ggggaatgtt gcgaggctta cgtggacgat atcatcatct gggggaagga 2280 tgctaaggac ctgcatgata atattgtgag tgttctttct gccctacgtc gaagtggtct 2340 tcgctgctcg cgtgagaagt caaagctgtt cctcgacgaa gtagcgttct taggccatat 2400 catccgcccc ggacagattc tgcccgaccc cgccaaaata gcacgcgtcg agcagttccc 2460 cctcccggtc aattctcacc aacttcactc gttcctcggt ctcgttaact accttcgcga 2520 tttcgtacca aacctggccg accacaccgc cgtgcttcac gccactcttc ctccgaatgc 2580 ggcggctgag aaagcttact acaaggctgt taagatgcat aagggacacc tccccgaggg 2640 atggactggt tggagatggt cgttcggccc tgcggagaag gcggcctttg aagcgactcg 2700 acgcgcggtg agtactgtcc cctgtcttgc cgttatcgat tatgatgctg tcaaagcggg 2760 aaagcagcag gtctttctat tcaccgacgc ttccaacaca ggtactggcg cttggattgg 2820 tgtcggtacc tctcgggagt ccgcccagcc cgtggcctac gattcgcgca ccttcaatag 2880 tgcacaacgg aactatccgg tacacgaccg tgagctgctg gcgattatta atgccctcga 2940 tcattggcgc cctctgctat acggcattcc agtccacgtc tactgcgatc actttaccct 3000 tcaatggttc ctaggtcaac gtaatctctc tccccgtcag cttcggtggc tgagtactct 3060 gaaggatttc gacctccgca tcgaatacat caaaggggag ttcaatactc tcgccgatta 3120 cctctctcgt catgctcctt cggacgccgc cgaacctgcc gacccctctc tggatcaatc 3180 accggtttca gttcatgcca caatgacata cgagcccact ttggatccgg acacccttcg 3240 cgcgattgcg cagggttatc aaggtgatgt attattcaaa gaatggcttg ctgatccgtc 3300 tactgctcca ggtgttacct tccatgatca tgatacccac cgacttctcc tcgttgacaa 3360 ccgcttatgc attccggacg ttaatacact tcgtgaggaa cttatgcggc aagcccatga 3420 aggcactgct gggcatctgg gagtggagaa aaccatggag gtactcagga gtggatactt 3480 ttgggagaca atgtccaagg atgttcgtga gtttgtccgt gcgtgccact tatgtcaaca 3540 ggcgaacgca cccacgacga aaccggctgg tcctttgcat ccgctcccgg tcccacgcga 3600 taaattcgat gacatcgcta tcgattttgt gggaccactc ccttcttccg gcggacatga 3660 ctatcttctc acgatcacgg atagactaac cgggttcatt gaattggttc catgttctac 3720 cactattaac gctcgcgacc tcgctatctt ggtttgggac agatgggttt ctcgctatgg 3780 cctcccgctc tctattacgt ccgatcgcga tacactattt acctcacgat tttggacgac 3840 cctctgggag cagcagaacg tcaagctcaa gatgtccacg gccttccatc cgcaaaccga 3900 cggcgcctcg gagcgtacga acaagacggt ggttcaactc cttcgtagct gggtggaccg 3960 acatggcaaa tcttgggtta agtaccttcc tcgtgtttcc caagccatga ataatactgt 4020 cagacgttcc acaggtttct ctccggcaca actggtcttt ggccgccgac tacgtaccct 4080 tcctagcctt ccccgwcctc cctccttcat ccaggcgtct cttcctacgc gggctgaatg 4140 gactctcgct gcggatcgca ctgacctctc gctcgccgac gcccgcgata acctcctttt 4200 ggcgaagcat cgtatggccg tccaggccaa tcgccatcgc cgtcctgaag tagtctacaa 4260 ggttggggat tgggtgtggt tggatacccg aaataggttg aaggaatttc gtgctggaga 4320 cggggaatat cgcgctgcta agttcttccc ccgcttccag gggccttacc aggtacaaga 4380 agctaaccct gccctctcag tttaccgtct tcacatgaat gaccggacat acccaaaatt 4440 ccatggtcat ctccttaagc cttatctgtc gtcgcctagg ttccatcaaa catcaccggt 4500 aacccaccaa tyggacactg gacgtcgcag cattctycaa atcctggatg atcgagtgta 4560 tcgcggtcat agrcaactcc gggtagtact cagcggcgac ggtcctaatg gycaatggag 4620 gaatctcgac gacttgcgta cacacgatgg tttccgggcc ttatatgacg aatatatagg 4680 cgacgacgaa ctggccttgt gagtcccctt cctgtttgct cctgttttcc tttgtatacg 4740 tgtctggggg gctgcttgcg ttcctccgtg ggttttttgg ccattattgg tttttttccc 4800 tggcttgcgc tcagcaagac taaattaaca ggactttctc acagagtgtt ctcctgccca 4860 gtccttccaa cgtggatgga tttggaaagt aacgtgttga caggggggcg ctggcagtcc 4920 ccacggcctg gtgatcgttc gttcttttgc ccctattctc tccttttcaa ttttacgttc 4980 ctttgcctta tctcttaatt ctttctttta tctttttgat ccggtcgagt cttcccctgg 5040 ctctcatctg gacgggggag a 5061 // ID Gypsy-6_RO-LTR repbase; DNA; FNG; 580 BP. XX AC AACW02000056; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-6_RO_; KW Gypsy-6_RO-I; Gypsy-6_RO-LTR. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-580 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000056; Positions 299169 299748. XX SQ Sequence 580 BP; 221 A; 87 C; 66 G; 206 T; 0 other; tgttgtagaa cagagtctga tcataacgga cattgaataa tcacaactaa aagaagtata 60 ttaaaattga taaaagataa taatttaaat ctttattaaa taaaatatat tcttcttctt 120 tttatataaa actatcacta ttcgattcat tctataaaat aaacatgaat actaaataag 180 tcatgatctt tactattcat catttctgta aatgaataat aatggatcag gtacctagtc 240 aactttgatc tgactggagg catttcatat aaatatatct ttcctctcaa tgaagacggg 300 acaactctcc cagatcagtt aaagaacaat aagatctata aaaacaacga attaaaaatt 360 ttcataatcg attcagtgat cactattcaa tcaaacgtgt caatgattgg atctacgctc 420 ataactactc atttacccgt caattgatag aagaatataa atagtagact ttgtaatgtt 480 ttctttttaa caattaagat tgtcttaaat aaagttcagt ctcaaaaagt attcttttgg 540 tttatctttg ttatttcttt tactttacca gaacataaca 580 // ID CIRT2_CA repbase; DNA; FNG; 1826 BP. XX AC AF205929; XX DT 11-MAY-2005 (Rel. 10.05, Created) DT 03-JUL-2007 (Rel. 12.08, Last updated, Version 2) XX DE Candida albicans transposon Cirt2 transposase. XX KW Mariner/Tc1; DNA transposon; Transposable Element; KW DDE transposase; CIRT2_CA. XX NM CIRT2_CA. XX OS Candida albicans OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; mitosporic Saccharomycetales; OC Candida. XX RN [1] RP 1-1826 RA Goodwin T. and Holton N.; RT "Characterization of transposable elements of Candida albicans."; RL Unpublished. XX DR Genbank; AF205929; Positions 1 1826. XX CC 45 bp TIR. XX FH Key Location/Qualifiers FT CDS 160..1710 FT /product="Cirt2ORF" FT /translation="MAKAISDIKNGLLLSYRKAAKKYNLCHETIRKRMNGV FT LPKQIAHRGKQLFTPQQEKEIVKWIIESDEAGNGRSRHDIVEYAELFLLLD FT RQLASIGGSWFERFKKRHKEIHVVQGRKISSLRVKAATPDQIKEYFHKYDL FT IVRQHQIPNENIFNYDESGFIMGQGKSSRVAVPSYKNRTYVKSTEGRDSCT FT VIEAISMSGEKLIPGIIFKGQTLRTGWFNDDASDYYYSVSKCGYTSYWLSW FT RWLEEVFIPQVKEKTNQGKVLLIMDGHGSHKTKKFRETCEDNNIIPLYLPP FT HSTHLLQPLDLGIFGPIKYGYKVKLSKLAHALGTDPVKQQLFLLNYYEARQ FT EKLTKERIIRSWETAGLNPFDVDKVLNSSQMIAAKINKEIDDYENNRDEHT FT QQDDVEIGLYLRGTPDEELIDKLTKKIDFLELENTQLKTDVAHLQAELTSA FT KLEIDNYKEALKNKKPPGRSSAIPMDENQGFKRAAPYAKEYRQNPPKKNKR FT KALTDMTNCTNGSNSYIRSL*" XX SQ Sequence 1826 BP; 674 A; 290 C; 374 G; 488 T; 0 other; tacctcacgt gtcggaattt gttgacaccc cacgactttg gcaaaaaaaa taaaaaattg 60 taaacaaaca aataaccaat actctttcca ttttgtttca ataatacttc aatatggcaa 120 ttaaacggaa aaaaaaagca agtggtcgat gaagtggcta tggcaaaagc cataagtgat 180 ataaaaaatg gattattgtt gtcttataga aaagcagcaa agaaatataa cctttgtcat 240 gaaaccataa ggaaacgtat gaatggagta ctacccaaac aaatagccca tagagggaaa 300 caactcttca cccctcaaca agagaaggag atagtcaaat ggattattga gtctgatgaa 360 gcaggaaatg gacgtagccg tcatgatatc gttgaatatg cagagctatt tttactgttg 420 gacagacaac tggctagtat cggtggttct tggtttgaac gctttaaaaa aaggcataag 480 gagatccatg ttgttcaagg aagaaaaata tcaagtttga gagtaaaggc agcaactccc 540 gatcagataa aagaatattt tcataaatat gacctgattg tcagacaaca tcaaattccc 600 aacgaaaaca tatttaatta tgatgaatct gggtttatta tgggtcaggg taagagttca 660 agagtggctg tacctagcta taaaaataga acttacgtta agtctactga gggcagagat 720 agttgcactg ttatcgaagc aatcagcatg agtggagaaa aattaattcc ggggataatt 780 tttaaaggac aaactctaag aaccggctgg tttaatgatg atgcttcaga ctactactac 840 tcagtctcca aatgtggata tacttcttat tggttgtctt ggcgttggtt ggaggaggtt 900 tttattcccc aagtgaaaga aaaaactaat caaggaaaag ttttgctaat aatggatggg 960 catggaagcc acaaaactaa gaagttcaga gaaacctgtg aggacaataa tatcattcca 1020 ttgtatttac cgcctcattc aacccatttg ttacagcctt tggaccttgg aatctttgga 1080 ccaataaaat atggatacaa agtaaaattg tcaaaattgg cccatgccct tgggacagat 1140 ccagttaagc agcagttatt tttactgaac tactatgaag caagacagga aaagttaacc 1200 aaagaaagaa tcattcgatc ctgggaaact gctggcctta atccatttga tgttgacaaa 1260 gtgttaaatt cgtcccaaat gattgcggcg aaaatcaata aagagataga tgattatgaa 1320 aataatagag acgaacatac tcaacaggat gatgttgaaa ttggactata tttaagggga 1380 acgccagatg aagagcttat tgacaagttg acgaaaaaaa ttgattttct tgaattggaa 1440 aatacacaat taaaaacgga tgttgcacat ttgcaagcag aacttaccag cgctaaatta 1500 gagattgaca actataaaga agctctaaag aataagaaac caccaggaag aagttctgcc 1560 attcctatgg atgaaaatca aggcttcaag cgagcagcgc cctatgccaa agaatatcgt 1620 caaaacccac caaagaagaa caaacgaaaa gctttaacgg atatgacaaa ttgtactaat 1680 ggtagtaata gttatatcag aagtttgtag gaatttagtg aatagtcgta gagttagagt 1740 tgaatttatg tgttttatga ttattgtcaa attagtaatt ttttgccaaa gtcgtggggt 1800 gtcaacaaat tccgacacgt gaggta 1826 // ID Gypsy-11_RO-LTR repbase; DNA; FNG; 665 BP. XX AC AACW02000050; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-11_RO_; KW Gypsy-11_RO-I; Gypsy-11_RO-LTR. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-665 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000050; Positions 1354 690. XX SQ Sequence 665 BP; 234 A; 110 C; 82 G; 239 T; 0 other; tgttaaggat acataagatt atagtcactt tatgttagta aaaggaatcg gatatgatcc 60 tgaatagtat gttgacataa cttggaatca tttataacat tattctcaga acagatcaat 120 aaaagacttt gattagattt ttatattaag ctttattcat attaacttca catagtcaaa 180 taatatggca tggttgtaat atagtttatt gatgatatga gcatgttgaa taaggaatga 240 caattactta ccggatcatc attaagtcat gtaaacattt gtcctaactg agatattatt 300 attagccata atatccaatc acggaactag aggattttat attcatccgt accgtctcct 360 aaatagtcct acatccagag ttatctgaaa ccatcccaat agcataattg acaatacaag 420 ccaggaacag gctaaactaa taattagaaa tagtaatcat ctattctgtc gaattgatca 480 taaatctgtc ctaattcttt atccgtatca tgtctataaa tactctcact ttttacatgt 540 agaattattt tttatcatat tcaatcttag ttatcttaac ttaataaaga aataccaagt 600 tctaacagta tttctttatc tttgttgtta tactactttt acctaaaacc cccctatcct 660 aaaca 665 // ID Gypsy-10_MLP-I repbase; DNA; FNG; 5634 BP. XX AC AECX01001617; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-10_MLP_; KW Gypsy-10_MLP-LTR; Gypsy-10_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5634 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001617; Positions 69474 63841. XX CC 'TATGA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 365..1414 FT /product="Gypsy-10_MLP-I_2p" FT /translation="MEDIQRQLNELTNALNQEQTLRRQAEERLAENMQNQQ FT PANSEPPQPMQTAPNPADTAPQTKGPKVATPDKFHGTRGVPAEVFASQVQF FT YIMAHPYQFPDDRSKVIFAMSYLTGTASSWAQPLTNELFDSATSHLVTFER FT FVDNFKAMYFDTEKKTKAEKALRSLNQKGSVATYTHEFNMHASNTGWEVPT FT LISQFEQGLKKEIRVAMVLVQEPFTSIEQISNLAIKIENKLHGTTDTSTSH FT SSTPADPNAMDLSSTYTRLTDEEHAKRLRSGLCFKCNRHGHISADCPDRKN FT TCNPIRNPTNFRGRNGYQSRIAELEIKIASLCSKNDDQEDRGEGTSRADAS FT KNGGARD" FT CDS 1534..2523 FT /product="Gypsy-10_MLP-I_1p" FT /translation="MPHTSRATTTAPLSSLLLIDSGATHNVLGESFVNKTG FT LLHYAFESSRDISGFDGSPSRSSHEIDLIMHSDTSPSRFVITRVKDTYDGI FT LGMPWILRHGHKIDWKKGTFSQDELTIAVAAATSSSPKTLSLGPTLEPVRD FT ARRFGEGVCISNDTLTPPQCVYNSTFCDNHPVSAGKPVYILENLNNPETKL FT PTLSDPVSPESHIATAPAVSSIPENLSPGQLDPVRDTRDFDEGVCIRNDTL FT TPPQCEHDISPLSPRNPEQAGQQDPLLNNVDVEIHTAKTSWSTSARLAATG FT KKDAPPKISRGTCTVSLPPTPAFVCQVQSAKTPTKKAL" FT CDS 2780..5539 FT /product="Gypsy-10_MLP-I_3p" FT /translation="MDLVDSLLDSDKFTKLDMRNAYGNLRVAEGDEDKLAF FT ICKEGQFAPLTMPFGPTGAPGYFQYFMQDIFVGRIGRDVAVFLDDTLIHTK FT KGENHETAVDSVLDIFENHQLWLKPEKCEFSKDQVEYLGLIISHNQVRMDP FT MKVKAVSDWPEPRNVTELQRFIGFSNFYRRFIDHFSATTRPLHNLTKKATQ FT FVWDTRCRAAFNRLKTAFTTAPVLKIADPYRPFLLECDCSDFALGAVLSQR FT CDKDGQIHPVAFLSRSLAQAERNYEIFDKELLAIVASFKEWRHYLEGNPNR FT LEVVVYTDHRNLESFMTTKQLTRRQARWAETLGCFNFTIKFRPGYMNAKSD FT ALSRRPDLAPTNEERLTFGQLLRPENINDKTFTEIASMDSCFESEEVNLEN FT TDHWFDVDILGITDVDEVRIADMTEETMVTDNLIIEEIREKTKASSRLQEL FT MKVIENPFSSRIHATVANYQVKDGLIYHHGRIEVPDDNSLKMKITRSRHDG FT LIAGHPGRAKTLGLVRRCFTWPSMKAYINKYVDGCSSCLRVKSSTKKPYGS FT LEPLPIPAGPWTNISYDMITKLPISNGKDSILTVVDRLTKMAHFIPCNESM FT NSEQLADLLLREVWKLHGTPKTIISDRGSIFVSQITQELNKRLGITLKPLT FT SFHPRTDRQSEIVNKSIEQYLCHFIGYRQDDWEALLPTAEFSYNNKDHVSI FT GVSPFKANYGYNPTFGGIPLGEQCVPAVEERLKTLQDVQDELKECLQASQE FT EMKIQFDRKVRQTPEWKIGDQVWLSNENISTTRPSPKLEHRWLGPFSIIKK FT ISCSTYKLNLPISMKGIHPVFHVSVLRKHKIDEIEGRQEPEPAPIDIEGNT FT EWEVSEILDCRLKNKRREYLISWKGFSTEHNSSEPTTNLDNCKALIDDFNS FT KFPTASSKYKKRRRKK" XX SQ Sequence 5634 BP; 1812 A; 1352 C; 1205 G; 1265 T; 0 other; tattgtcgga tcctatcgac aagcgacgag gactcctaag aaaaccagaa gaaaaagaac 60 ttacatcaaa accgaaaaga acaagacaga agatttacaa gattagatcg gatttaccag 120 attagattag aattacaatt aattgaagta accttaaaaa agagaacctt accgattcac 180 cattgaaaga caaaccttat ttacattcca aaaccttgat accaaccttc ccactctatc 240 gtacaacgcc acaacgtcac cgacatttag tacccaaaac ttcgacgaag acgtcgatac 300 tggaagtgaa ccggatcatt tgtttgtaga cgttgaatcc ctatcaaacc ccgactcggt 360 caacatggag gacattcaac gacaattgaa tgaacttacc aatgcgttga accaagaaca 420 aacattgcgc cgacaagccg aagagagact tgctgagaac atgcaaaacc aacaacccgc 480 gaactctgaa ccgccacaac ccatgcagac agcaccaaac cctgctgata ccgctcctca 540 gaccaaaggt cctaaagttg caacccctga caagttccat gggactcgcg gcgtaccggc 600 tgaagtcttt gccagccaag ttcaatttta tatcatggct catccgtatc aattcccgga 660 tgatcgtagc aaagtcatct tcgcgatgtc ctacctcact ggcacggcaa gcagttgggc 720 acaacctttg accaacgagc tttttgattc tgctacatcc catctggtca cgtttgagcg 780 ttttgttgat aactttaagg ccatgtattt tgacacggaa aagaagacga aggcagaaaa 840 agcattacgt agtttaaatc agaaaggcag tgtagctact tatactcatg aatttaatat 900 gcacgcgtct aacacaggat gggaagtccc aactctcatc agtcaatttg agcaaggtct 960 caagaaagaa atcagagttg caatggtatt ggtacaagaa ccgttcacct ctattgaaca 1020 aatatccaac cttgccatca aaatcgaaaa caaattacac ggaacaactg acacaagcac 1080 cagccatagc tccactcctg cagaccctaa tgctatggac ctgtctagca cgtacacacg 1140 gttaactgac gaagaacacg ctaaaagatt aagatctgga ttatgcttca agtgcaatcg 1200 ccacggtcac atctctgccg actgcccgga tagaaagaac acttgcaacc ccattcgcaa 1260 ccccaccaat ttcagaggca ggaatggtta tcagagtcgg attgccgaat tagagattaa 1320 gatagcttct ttgtgtagta agaatgacga tcaggaggat agaggggagg gtactagtag 1380 agcagatgcg tcgaaaaatg gcggagctcg agactgattg ttgtgccaat ctcgagcgaa 1440 ggggaagttg aggaattggt cgagttaggt tctagtcaaa ttgtaacatg caataacaat 1500 gatccaagat tattttacag aactgaactc cacatgcccc atacttcccg agccacaacc 1560 accgcgcctc ttagctccct actcttgatc gactcaggag ccacacacaa cgtgttaggt 1620 gagtcctttg tgaacaagac gggactcctc cattatgcct tcgaaagcag ccgtgatata 1680 tccggctttg acggttcacc cagtcggtca tcacatgaaa tcgacttaat catgcacagc 1740 gacacatcac cttcaagatt cgtcatcacc agagtcaaag acacctatga cgggatcctt 1800 ggaatgccct ggatccttcg acacggacac aagatcgact ggaagaaggg gaccttttcc 1860 caggacgaac tcacgattgc cgttgcggcg gcaacgtcgt caagcccgaa aacactctca 1920 cttggaccca ctttggagcc cgtgagggac gctaggagat ttggcgaggg ggtgtgtatc 1980 agtaacgata cgttaacacc cccgcaatgt gtgtataatt ctaccttttg tgataaccac 2040 cccgtttcag ctggcaagcc tgtttacatc ctagaaaact tgaacaaccc cgaaacgaag 2100 ctcccaacgc ttagcgatcc agtgtcacca gagagccaca ttgcaactgc cccagcagtg 2160 tcgtcaatcc cggaaaacct ctcacctgga caattggatc ccgtgaggga cactagggac 2220 tttgacgagg gggtgtgtat tagaaatgat acgctaacac ccccgcaatg tgagcatgat 2280 atttccccac tttcaccgcg taatccagag caagctggcc agcaggatcc tctcctgaat 2340 aatgtagatg tcgagatcca cacagcaaaa acttcttggt caacttcagc cagattagca 2400 gccacaggaa agaaggatgc cccccccaaa atcagtcgag gaacttgtac cgtctcatta 2460 ccaccgacac ctgcatttgt ttgtcaagtc caaagcgcaa agactcccac caagaaggcg 2520 ttatgatttc aaagtcgaac ttgtcccggg cgcccaacca caggccagtt gagtgatacc 2580 actgtccccg acagagaacg cagctttgag cgagttaatc aacaacggac tgaccaatgg 2640 aacccttcgc cgaaccactt caccatgggc agcgcctgtg ctattcaccg gtaagaaaga 2700 cggtaatctc aggccatgct tcgactatcg taaacttaac gcactgacgg tgaaaaataa 2760 gtacccactt cccttgacga tggacctggt tgacagccta ctcgactccg acaaattcac 2820 taaactcgat atgcgcaacg catacggtaa cctacgcgta gcagagggtg atgaggacaa 2880 gttggcattc atctgtaaag agggtcaatt tgcaccattg accatgcctt tcggccctac 2940 aggagcgcct ggatatttcc agtacttcat gcaggatatc tttgtgggac gtataggacg 3000 ggatgtagct gtttttctgg acgacacact gatacacacg aagaaggggg aaaaccacga 3060 gaccgcagtt gacagtgtac tagatatatt cgagaatcat caactatggc tgaaacccga 3120 aaagtgcgag ttttctaagg accaagtcga gtacttagga ctcatcatct ctcacaacca 3180 agttagaatg gacccgatga aggtaaaagc agtgtctgac tggccagaac ctcgaaatgt 3240 cactgaactc caacgattta ttggattctc caacttttac cggagattca ttgaccattt 3300 ttcagctacg actcgaccac tccacaatct cacaaagaaa gcaacacagt ttgtatggga 3360 cacgagatgt agagctgctt ttaacagatt gaaaactgca ttcaccacag ccccagtcct 3420 gaagattgca gatccgtatc gaccattctt acttgaatgt gactgctccg actttgcctt 3480 aggagcggtc ttatcccaac gctgcgacaa agacggccag attcacccgg tagctttctt 3540 atcacggtct ttagcccagg cagagagaaa ctacgagata tttgataagg aactcctagc 3600 aattgtggca tcctttaaag aatggcgaca ctacttagaa ggaaacccca acagacttga 3660 agtagtagtc tatactgatc accgcaacct tgaatccttc atgacaacta agcaacttac 3720 aagaagacaa gcacgttggg cagaaacttt gggatgtttc aatttcacta tcaaattcag 3780 accaggttac atgaatgcaa aatctgacgc gttgtcgaga agaccagatc tcgcccccac 3840 caacgaagaa cgactcacat tcggacaact actccgaccg gaaaacatca atgacaaaac 3900 cttcacagag attgcaagca tggattcgtg ttttgagagt gaagaagtga atttagaaaa 3960 cacagatcac tggtttgatg tggacattct aggaatcacg gacgtagatg aagtacgtat 4020 tgcagacatg acagaagaaa caatggtcac ggacaattta atcattgaag agatacgcga 4080 gaagacaaag gcttcatcga gactacaaga acttatgaaa gttatagaaa atccattctc 4140 gtcaagaatt cacgcaacgg tagcaaatta tcaagtcaaa gacggactca tttatcatca 4200 cggcaggatc gaggtccccg atgacaattc attgaagatg aagatcacaa gaagtagaca 4260 cgatggactt attgcaggac accctggtcg agctaagaca ttaggtctgg tcagaaggtg 4320 ctttacgtgg ccctcaatga aagcgtatat aaataagtac gtagatggct gctcttcatg 4380 cctgagagtc aaaagttcta ctaagaaacc ttacggatcc cttgagccat tgcccatacc 4440 agcgggccct tggaccaaca tcagctacga catgattacg aaactcccaa tctcgaatgg 4500 aaaagatagt attttgacgg tcgtagacag actcaccaaa atggcacatt ttattccgtg 4560 caacgagagt atgaactcag aacaactggc ggacctctta ctgagagaag tttggaaact 4620 tcatggtaca ccgaaaacaa tcatatctga ccgaggaagt atatttgtat cgcaaataac 4680 gcaagaactc aacaaaagac tgggaatcac cctgaaaccc ttgacatcat tccacccacg 4740 caccgacaga cagtcagaaa tcgtcaacaa aagtattgaa caatacttgt gtcacttcat 4800 aggctaccga caggacgact gggaagcgct acttccaacg gctgagtttt cctataacaa 4860 taaagatcac gtttcaatag gagtctcacc gtttaaggct aactacgggt acaacccaac 4920 atttggaggc atacctttag gagaacagtg cgtaccagca gttgaggaga gactcaagac 4980 tttacaagat gttcaagatg aattaaaaga atgtttacaa gcaagccaag aagagatgaa 5040 aattcaattt gataggaaag ttcggcaaac accagaatgg aaaattggcg atcaagtgtg 5100 gctgagcaat gaaaatatat ctacaacaag acccagcccc aaactggaac accgctggct 5160 aggtcctttc tctattataa aaaagatttc ttgttcaact tacaaattga atttacccat 5220 ttcgatgaaa ggcatacatc ccgtatttca cgtttcagta ttaagaaaac acaagataga 5280 cgaaattgaa ggcaggcaag aaccagaacc agctccaatt gatattgaag gcaacacgga 5340 atgggaagta tccgagattt tagactgcag attgaagaac aagagacggg aatacttgat 5400 cagctggaaa ggattcagta ccgaacacaa ctcatcggaa ccaacgacga atttagacaa 5460 ttgcaaggca ttgatagatg attttaattc aaaatttcct acagcatcaa gtaaatacaa 5520 gaaaagacgg cgaaaaaagt gagagggcaa gctttttccc acagggtttt ttaacgctgc 5580 ccgtggatga atgcagaact tgcaagaggg ggtttgggca taaaaggggg atac 5634 // ID Gypsy-9_CCO-I repbase; DNA; FNG; 5860 BP. XX AC AACS02000011; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the mushroom Coprinopsis cinerea genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-9_CCO_; KW Gypsy-9_CCO-LTR; Gypsy-9_CCO-I. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-5860 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the mushroom Coprinopsis cinerea RT genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AACS02000011; Positions 167354 173213. XX CC Positions [4606-4935] - Integrase core CC 'GGTTG' target site duplication CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 2479..4935 FT /product="Gypsy-9_CCO-I_1p" FT /translation="MSIPPSISLSSSTSKSSPSPSKSSPSFTSNPVPIAFV FT NASAYRRAVRMSGSQTFSLSISPSSDNVSRACAASTSTTDTPDLSNVPEEY FT HEFADVFSNAKAYSLAPHRPYDLRIDLDESQPLPPSRLYSLSRSRNPLALR FT EFLDENLRAGFIRSSTSPLGAPILFVKKKDGSLRLCVDYRALNRISKKDRY FT PLPLISDLLAAPARAKVYSKIDLKHAYHLVRIADGDEWKTAFRTRYGSFEW FT LVMPFGLTNAPSAFQRFMNDIFADMTDVSVIVYLDDILVFSDNLSQHREHV FT KEVLRRLRKHGLYASPKKCEFHADRVEYLGYILSPDGLTMSPEKVKVIQDW FT PEPRSVKDVQSFLGFANFYRRFIHGYSDIVVPLTRLTRKDTPFEFSSAARK FT SFETLKSAFISAPILVHWQPDRPLIIETDASDYALAAILSIQLESGEIHPV FT AFHSRTFQAAELNYDVHDKELLAIFEAFKVWRHYLEGAGDPVNVVTDHKNL FT EYFSTTKVLTRRQARWSEYLSQFNLVVRFRPGKLGTKPDALTRRWDVYLKQ FT GGSDFASVNPVNLRPIFASEQLASSLRASTIYIPALRASLVVDSASLHSDI FT KASYSSDPLASSQLPTPSDPRWELPSDGLLRLDGRIYVPDSPGLRLRVLQL FT KHDHPLAGHFGRNKTIELVRRDYVWPKLRSFVADYVNSCTTCRRTKTPRHK FT PYGLLKQLPIPLRPWESISMDLVEQLPSSGGFTDILVIVDRLSKQAIFIPT FT DQYLTSAELARLFVIHVFSKHGVPSNVTSDRGSEFVSHFFRALGEALDMKL FT HFTSGYHPEGDGQTETS" XX SQ Sequence 5860 BP; 1109 A; 1924 C; 1274 G; 1553 T; 0 other; gtagattccg gattcgactt cgtcttgtcg ttccaagtct acgtctatgt ccgatcaaga 60 ccgctacgac tttcgtcgcc ctcgccctcc tccttcagcg tccgcactgg gccaacctcg 120 acccttatcg cactccagac acgagtccct caccgatccc ttctacatct cgaattcgac 180 atcacctgcc atagctgcaa ccgctatttg gcaacctcct acgtctttcc tcggtcaagc 240 taccattcga cgcggcatgt ctaacgacgc tcgcgaagaa gagttcgtag cctcgcatct 300 aggcatccct ggtggcctcg ctaccagcgc cttcggtggt cttcgcgcct cagactctgg 360 gtctggcaac cccagtggtg ctgcaggtcg cgtctccaac tcgcgtcgtg gtgatccgtc 420 ttcctctggc aaaggaggcg atgatccact ccctttcgcc tttgatccct ctggctctcc 480 ccctcgcacg cctcctggtg atccgcctga tcccagggga ggtggtggtg gaggtccccc 540 aggtggcgac tctggtggtc ccccgccgtc tggcagcggt ggtggcgacc ctcctcctaa 600 ccatggtggt ggtggtggac ctggtgaccc gtctggcgat ccgcctggcg gagactccga 660 gactcaagct caagacgact ccaacgactt ccaacccgtc cttcgtcgca ttctcgagca 720 cctcgtcgag ggtcagcagt caaatgctcg tacccttgcg aaaggctctc gagaaactcg 780 agtctcagtc ctccagtacc cctagacttc gtgctcccga ccctttcgac ggctcagatc 840 ccgaactcct agacgccttc atcgcccagc tttacagttt atctcggtcg ctacatgcac 900 accttcaagc acccgagaga cgctgtggct actgccctct ccctacctca agggcaaggc 960 gtacacttac ttcgcatctc gaatcaatgt ggctcgcact accgatgcga cgttgccttg 1020 gtttgacaat ctggacctct tcatcgagga gctgactctc gtcttcggcc ccaagaaccc 1080 catcggcgac gcagaacgca aaattcgtgc gttgacgatg acaggacgtg cctaccagta 1140 taccctcgac ttcaatgagc tcgccgcacg catcaactgg aacgaagcgc ctttacgttc 1200 tcagtactac actggcctct ccgaccgcat caagaacgag ttggctaaac gtcctcggcc 1260 tgctacgctt ttggagttgc agaatgtcgt cttcgacatg gacgaacgtt ggcacgaacg 1320 tcaagaggag attcgtcgtg atcaagctct cctcaaggcc ctccaagcca agctatctgg 1380 caaggcttcg tcttcaccct ctctctcgtc tggctctttg gccaccccgg cttcgacttc 1440 atctaactcg accaagactt cgcctcagtc ggccggtaac acttcgtctt cgtcgacctc 1500 ttcctctgcc aaaggaaagt cgaacaagac ttcatcttct tcttcgtcta cgactccctc 1560 gacttcatcg tcgacaccct cgacgtcctc tgcccctcct cgcccttacg ccgacaagct 1620 tgacgctagc ggaaagctca agcctgagga gcgtgcccgc cgtatcaagg aaagcctttg 1680 tatggtgtgt ggcagcacag agcacaaggc gtcagagtgc cccaagtcta gacatcgcgc 1740 tcgctttgct tccgccggtc agcctgccac tgacacgtct gcgggaaaag cttgagcagt 1800 ctgctgacct cgcctcctcc agttgactgc agcctctcag caggagtgcc tgaggttcga 1860 cttaatgctg ctgctctctc cgatccccat gctctctcgt tttcggtctc cgtttattct 1920 tcgacgtctt cgcctatcga cctcgactcc ctggtggatt cgggttgtac ggactgtttc 1980 ctcgactcag actttgtcaa ttctcattcg actttgcaat tttacgaaat tccgccagtg 2040 ccccttagac tccttgatgg atctgttcgt acttggctca cttctgcagc ggatatcatc 2100 ctccgctttc cgtctggtga tgagttcttt ctcaagtgct tcatcacaaa gctggatgct 2160 ccgtatcccc tgattcttgg acacaactgg ctccgccgat acaatccgtt gattgactgg 2220 cgtcgtggtc agatactttc gttcgaatcg gccgataacc cctcttcttt gagctcactc 2280 gtcgcagcaa acgtcgagac ctccgtttct tcatccagcc ttccgtcaca ggcgtctcag 2340 ccttccgtgg tccctttttc gaccaagtct tccgtctctc ttccttcccc catcgacttc 2400 gactcactcg actccgacca tttatcaccg caggaaccga ttgttccttt acctcaagat 2460 tcgcctcaag actcgtcgat gtcgattcct ccttccattt ccctttcttc ttctacttcc 2520 aagtcttcac cttctccttc taagtcttct ccttctttca cttctaaccc tgtgccaatc 2580 gcttttgtga acgcttcagc ctatcgtcga gctgttcgca tgagtggttc gcagaccttc 2640 tcactttcta tttctccttc ttcagacaac gtctcgcggg cttgtgctgc ttcgactagc 2700 actaccgaca ctccagatct ttccaatgtc ccggaagagt atcacgaatt tgccgacgtc 2760 ttcagcaacg ccaaggctta ttcgcttgcg ccacatcgcc cttacgactt gcgcattgat 2820 ctggacgagt ctcagcccct tcctccgagt cggctttatt ctctctcccg cagccgaaac 2880 ccccttgcgc tacgcgagtt cttggacgag aatttgcgcg ctggtttcat tcgctcttcg 2940 acttctccgc taggcgcacc cattctcttc gtcaagaaga aggatggctc cctacgtctt 3000 tgtgtggatt accgggcgct taaccgcatc tccaagaagg accgctatcc gttacccctc 3060 atctcggacc ttcttgcagc cccagctcgt gccaaagtct attccaagat cgacttgaag 3120 catgcctacc accttgtgcg cattgcggat ggggacgagt ggaagactgc ttttcgaact 3180 cgctatggct ctttcgagtg gttggtgatg ccttttggtc tcactaacgc gccgtccgct 3240 tttcagcgtt ttatgaacga catcttcgcc gatatgaccg acgtctcggt tatcgtatac 3300 ctcgacgaca tcctggtttt ctccgacaat ctctctcagc atcgtgaaca cgtcaaagag 3360 gtgcttagac gtctccgcaa gcacgggctt tatgcttcgc ctaagaagtg tgagttccac 3420 gctgaccgcg tcgagtacct tgggtacatt ctttcacccg acggcctcac catgtcgcct 3480 gagaaagtca aagtcatcca ggattggcct gaacctcgtt cagtgaaaga cgtccagtct 3540 ttcctcggtt tcgccaactt ctatcgtcgt ttcatccacg gctactccga catcgtcgtt 3600 ccgctcacgc gccttactcg caaagacacg ccttttgagt tttcttccgc cgctcgcaag 3660 tctttcgaga ccctcaagtc tgctttcatt tctgctccta ttcttgttca ctggcaaccc 3720 gatcgacctc tcatcatcga gacagacgcc tccgactacg ccttggccgc cattctctct 3780 attcagctcg agtctggaga gatccaccca gtcgcctttc actcccgcac tttccaagcc 3840 gccgaactca actacgacgt ccatgacaaa gaactccttg ccatcttcga ggcgttcaaa 3900 gtctggcgtc actacttgga aggcgcaggc gacccggtca acgtcgtcac agaccataag 3960 aacctcgagt acttctcgac taccaaggtt ctcactcgcc gtcaggcgcg gtggtctgaa 4020 tacctttccc agtttaacct cgtcgtccgt ttccgtcctg gcaaattggg caccaagcct 4080 gatgcgctaa cacgacgatg ggacgtctac cttaaacagg ggggtagtga cttcgccagc 4140 gtaaatccgg tcaaccttcg tccaatcttc gcgtctgagc agctggcttc gtcattgagg 4200 gcttcgacta tctacatccc tgccctccga gcctcgctgg ttgtcgactc tgctagtctt 4260 cattctgata tcaaggcttc ttattcttcg gaccccctcg cctcttcaca acttcccact 4320 ccttctgacc ctcgttggga gcttccatcc gacggtcttc ttagacttga tgggcgtatc 4380 tacgttccag atagtcccgg cttacgtctt cgagtcttgc agctcaagca tgatcaccct 4440 ctcgctggtc acttcggtcg caacaaaacg atcgaattgg ttagacgtga ctatgtttgg 4500 cctaaactgc gttcgtttgt cgcagactac gtcaattctt gcacgacttg tcgccgtacg 4560 aagactcctc gccacaagcc ttatggtctc ttgaagcagc tacccattcc cctccgtcct 4620 tgggaatcca tctccatgga tctcgtcgag caacttcctt cctctggtgg tttcacagac 4680 atcctcgtca tcgttgatcg cctgtccaag caagctatct tcatccccac agaccaatat 4740 ctcacgtctg cggagcttgc acgccttttc gtcatccacg tcttctccaa gcacggcgtc 4800 ccttcgaacg tgacgtctga tcgtggttca gagttcgtct cccacttttt ccgtgctttg 4860 ggcgaagcct tggacatgaa gcttcacttc acctcaggtt atcaccctga aggcgacgga 4920 cagaccgaaa cgagttaacc agaccttgga gcagtacctt cgggcgtatc tgttcttacc 4980 agcaggatga ttggcatcga ctccttcccc tcgccgaatt cgcttacaac aacgcgcctt 5040 cagctaccac tggtgtctcg cctttcttcg caaacaaggg ttaccatcca agtcttaaca 5100 tccaccctga acgcgactta gccagttccg ccgcgcaaga atttgcagtc gacctcgcct 5160 cgttgcacga gtaccttcgg gaatctatca ccgccgcaca gaagtcttac cagctatacg 5220 ccgatcgcaa acgtctagca cccccagact tcaaagtcgg tgatcgcgtc ttcgtcaaag 5280 ctgccttctt ccgaaccacc cgacctacca agaagttgtc tgaacgatat ctcggtcctt 5340 acgagattat tgctcaggtt ggcacccact ccttcaccct tcgacttcca gactccatgc 5400 gttccgttca ccccgtcttc cacgtctcga tgctcgaacc ccatcacgag tctcagattc 5460 ctcttcgtca cgagcctcct ccacctccgg tagaggtaga cggcgaattg gagtatgaga 5520 ttgccgaaat cctagactcg aaagtggatc gtcgtagacg cgcctgcaag cttctctatg 5580 aagtacgctg gttaggctac gagaacacag acgaagaacg ctcctggctc cttgctaccg 5640 agttgagcca tgccagggag ttagtcgaag acttccatcg tcgatatccc gacaagcctg 5700 gacccgagtg gtgacccgtc cgccttcgcc ttgttatctc gacttctcat tttttccttc 5760 gacttcattt tcttttcttt cacattccta tgcttcaatt tctcttcgac atcgacttca 5820 cctttggcct cgagggctcg gccgtaatca ggggggtagc 5860 // ID PIF_Harbinger-1_TreMes repbase; DNA; FNG; 3280 BP. XX AC . XX DT 17-MAY-2011 (Rel. 16.05, Created) DT 17-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Harbinger; DNA transposon; Transposable Element; KW PIF_Harbinger-1_TreMes. XX OS Tremella mesenterica OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Tremellomycetes; Tremellales; Tremellaceae; Tremella. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 3280 BP; 807 A; 844 C; 712 G; 917 T; 0 other; aggggtgctc agttctataa tggctagtcg cgacattatc acagataagt gaacacccgt 60 gataacgagc catttttatg tcataaatgc cacgtgtgag gtagaaaatg ccacgtgtaa 120 cccctccttc gcgtcctccc ccatgtcatc tcatgcaggt ttatgtatat gtatctcggc 180 cattgcattt ctcaactatc tactaagcaa aacccctcgg atttcgtgtc atgccttcca 240 aaacccctcg agctcacctc ctgcacgatt tagaagatgc cgcattgctt ctgtttcctc 300 tgctggacga tcatacctca agggcctgtt tggaagacat aattcaagca catcaacaaa 360 tctcagcaaa tcgttactta gatcgatacc cgactgtctc taacaatctt ccccgaccct 420 ctttctacca cagtgacatt ctcctcttcg cagaacgagg cattcgtcat aaacaagcat 480 ttcgcatgtc ccctagccat cttgatgaca tcgtcgattt tttcaaggat gatcctgtct 540 ttttttcaag aggaaacaag cctcaggccc caccaaagta tcagataggg ttactggtct 600 accgaatggc tcatggacat gactgcagga cattagatcg cgagttcggg gtctcaagcg 660 agttttcctt ccaattgctc tgtccatttt gctaattagt tagccggcac tgtattcaag 720 tggtgtgacc gggccctcat tgctatactg aaaaaacgat cttacttcat ctcctggccg 780 tcccccaccg aaagaattga aatcaaagct cgatttctct ctgaatactc aattcctcac 840 tgcgtaggcc tcatcgacgg ctatcatgtg aacctctcaa ctgctcctgc tagagatgat 900 gcaggggctt atcatagccg caaggagagg tatggcttca atgtcatggc tatcgtcgat 960 catacaaagc gttttcgata tattcattac ggctatcccg cttccagcag cgatcaacga 1020 gttcagcgag ctatcgaacc ccttaacacg ccctctcagt atttcagcaa ccaggagtac 1080 ctcctggctg attctggctt tacagcatcg tctgtcgtcg taccaatgtt taaaaagagt 1140 gctggccaag cggttcttcg aggtaaaacc gcgtatttca actctcgagc ctctgtcatg 1200 agggtagggg tcgaacattc aataggtgtg ttaaaggctc gatggacgat acttcgatca 1260 atgtctatga gacttcgaac taaaagagac gaagctatgg ctcacgcaat cattgtagcc 1320 tgtacaattc tccacaacat cctcataaac accgacgaat attacaccga agaaaatact 1380 ggaaatttac caattgaacc acttcgagat gacctgcctg cagaggaggt agcagaggac 1440 gatttcttag gtgctggtcg tctcaggcca cacatgagaa gacgtcatga gttggtggag 1500 caaatgttag cattggagac agaggaaatc gacatagaca acttcatttt atcatgaaaa 1560 aaggtgagtg gtttcaacta cctatcactc tccgaccaat tgctccactc atcatattcg 1620 gtgtgaacgt ctccttcccc ttcttctctg tccctgtgag ctctatcata cctctcctct 1680 gcatctgcga tagcctgaac ccaggaaaca cgaagttcta ccggtagtcc ttgctctgca 1740 gtgagacgag cttgagcctc cttcgatgtt atgaccaacc tctcattttt agccctttcc 1800 ttcctctgtc tcttcgaatc ttctctttct tcctcttgga tagctagctt acgcttctct 1860 aggcctatct cctcccttct gctcgcagca atcccctcaa acaaccccgc aatacctcca 1920 gcattgtttt ttccttcccc cctcctggac gatgtgtaaa ggcctgtaaa tcagctatac 1980 cctgacctta gtggctatgc agtccatgtc accgtgcctg acgtacctgc agctcgcttg 2040 gtcagcctat gatcactaaa actgcgttcc ctagtcgcag gacgtgtatt ttcgctctga 2100 gtcgatgtat cttcagtttg agactcttgc tggcctagga agtcattcaa tgacatctga 2160 gatggtgatg gaggcctagg agagtgtgga gatctctcag gagaaggaga ggggtggtga 2220 gaacccctca taatcgtctc gagtgtgaca ttcgaatttc cggtggtacc tgtgcgaagg 2280 gattctgtcc cgccgtctct ttcggcaaat atggggtcca cagagtcata ttcaggacac 2340 ttcttcagta cagtggctgc agatcagcta aggaagcata ggagtgatga acagccagga 2400 gacagatgat acacaggtca caggaggtat tcggggagaa ggggagaagg gagagacggg 2460 gactcacctt taaccgtctg ttcagcttga tttatcttct ctggatcctc actaccatcc 2520 accatactct cgaataaacc cttccctgtg tgatttagcc agtcacgggc ctctcgccac 2580 cttcttaaga gttcacgaat ctagaagaca tcagcagagt tttagggggg gagaaaagga 2640 aggaaatcac acttaccttg ttttgtatct gatcagtcga tcttgtagtt ggacatcctg 2700 acttggacat ccactcagag cattccctag ccatccccag cttcttctta ccccctaggc 2760 catgttgata cttcttgaag ttatctccca tagctaacca ctcaataagg tagtcgagcg 2820 agcagtattg gccggtacct tgatcttttg tccagcctaa agttgctggg gggttgtcct 2880 gactatcctc tttgtccctg ttcctccttt gctgagttct agtagatggt tgccgagcag 2940 tggaggggat atgcagaacc tactgaagtt agcaggggta tgcgaagggg gtttaacgac 3000 atatttacag gtggtgacgg tgacctcaaa ttaccacccc ccgaggtgtc catagtatca 3060 gcttgaagct acagctctcc tctttcagat tcttggcttt ttatcgcttt aaagtgtctc 3120 tgtaagaaga tacgataact ctacggtgat gatgaaaact caacggtcac ttttcacctg 3180 tggtgatgtc gcgtttgttt gtcatcaaaa ttacccgaag aggaaaattt tacgtcattt 3240 taacacgtgt tattttggcg tatcagaaat gagcacccct 3280 // ID Gypsy-1_LENY-LTR repbase; DNA; FNG; 461 BP. XX AC AAPO01000065; XX DT 12-FEB-2011 (Rel. 16.02, Created) DT 12-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Lodderomyces elongisporus genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_LENY_; KW Gypsy-1_LENY-I; Gypsy-1_LENY-LTR. XX OS Lodderomyces elongisporus OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Debaryomycetaceae; OC Lodderomyces. XX RN [1] RP 1-461 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Lodderomyces elongisporus RT genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; AAPO01000065; Positions 72706 72246. XX SQ Sequence 461 BP; 162 A; 76 C; 81 G; 142 T; 0 other; tgtcacagtg cgcaatgaac tagcgccatt gtgcctatat ttctattttc taaattatcg 60 gaaatatacg gatataaagg aatatgactt gttcattata atttagtgaa aaaataaagt 120 atataaatga gacaaatgta ctcaaagtaa acttgtatca atttatatag aattatcaat 180 aaatatcagt ttatatggtc acaacaagtt aagcacaatc tagtggatat tcagaggaat 240 tgatcagcct actgaatctc ttcacttata ggttgtgcaa ggcaaactac aacttacact 300 taaacactgt gttgttatta cagcatcttc agatactgga acactttctc agaatgtgta 360 ggttctaagc ataaatgtaa tcgctggttc tagtgagcga gaagcaagat tctagtgcac 420 agcagtcgca cacttacgtt aaataacgtg gccatgtgac a 461 // ID Gypsy-12_MLP-I repbase; DNA; FNG; 5847 BP. XX AC AECX01001319; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-12_MLP_; KW Gypsy-12_MLP-LTR; Gypsy-12_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5847 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001319; Positions 72108 66262. XX CC 'GACAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 310..1320 FT /product="Gypsy-12_MLP-I_1p" FT /translation="MATADDVARIAAQLADLNTRLTEETALRQAAEARFAA FT SEARRTAQPTSAPAPAPTLVAPAPTPSGEAQIQVARPEQFTGTRGAAAEAY FT ASQVGLYVAVNASLFADDKTKVMFALSYLGGDAIQWAQPFLQRTLNPGDDQ FT PPTYQEFTEAFAAVFFDSDRKQCAEKALRALKQTKSAAEYTIQFNQLAPTS FT GWELPTLISHYRQGLKTDVRVAMIREHFERLEDITKLACAIDGELRGETVT FT YNAPRPVASDAMDISSARFAISSDEYGKRVKEGLCFSCGKSGHRARWCKGK FT GRRTGGGKVAELEAKIAAMEVEMRGGAGGSGGSAADRSKNGDARE" FT CDS 1710..5351 FT /product="Gypsy-12_MLP-I_2p" FT /translation="MPWLRENGHRIDWKTGTLRPKTEFTQIAALDHPPEKD FT IKTDKWKNLPRHLAAIDLASSNPKTTPDRSIARKGKARDSDEGIVSFTSAK FT PPQGEFDTTMRHTLIATDDDQVHFRQQLSKQTEPLGTQDHLAATALASSDS FT KTTPDSLDARMGNARDLDEGVVSYADAKPPQGEFNTALEPTLSVTDDDQVH FT FRPQIPPYEMPTPRTEEATIATDLQIAHAVSPVPKTTPNAVRMQMGNTRTL FT GEGALTLNETKPPRCEYHNIETTSLEAVDKLVVPGSRIGKIASASATWNVS FT AKLAADATKDKEEKPVEELVPTRYHRFINMFQKTNAMTLPPHQRYDFRVDL FT IPGATPQASKIIPLSPAEEVALDTMIDEGLSKGTIRRTTSPWAAPVLFTGK FT KDGNLRPCFDYRKLNALTVKNRYPLPLTMELIDSLRDAEDYSCLDMRNGYN FT NLHVKEGDEKKLAFICKRGQFEPLVMPFGPTGALGHFQFFISDILREKIGK FT ELAAYLDDLLIYTRAGVDHEKVVEEVLEILRSHNIWLKPEKCKFSRKEIDY FT LGLLISKNKVRMGPLKVSAVTDWPVPTNVTHIQRFLGFSNFYQRFIEDFSK FT ITRPLHDLTRDKTPFVWGEAQDKAFQTLKTSFTTAPILKIANPYEPFILEC FT DCSDYALGAILSQNDDEGVLHPVAFLSQSLVQAERNYEIFDKELLAVVASF FT KEWRHYLEGNPNRLEVTVYTDHKNLESFMTTKQLTRRQARWAETLGCFDFH FT ICFRPGSESTKPDALSRRPDLEPLAKDKLSFGSLLKPANLSEKTFNAELDC FT IEAWFEDEDVIHEDVESWFEQDVVEPTFSETLEIDAIERATDSPIWTDNLI FT IDRIREQSKLDPRIEELMKSTLASKESIKTPNEYKVHNEILYKDGRIEVPN FT DKTCKFEILRSRHSSALAGHPGRMKTLNLVQRQYRWPSMKAYINKFVDGCD FT SCLRVKSSNKLPFRSLEPLPIPARPWTDISYDLITDLPRSNGKNCILTVVD FT QLTKMGHFVPCTTKMTSDDLASLMIKHIWRLHGLPKTILSDRGSIFVSKIT FT DSLSKQLGIALHPLTAYHPQTDGQSEIVNKAVEQYLRHFTSYQQDDWEEHL FT LLAEFAYNNSTHMSTGVSPFKANLGYDLTFGRIPLADRCIPTVEERLQKIT FT EIQNELKEALIQAQDTMKRNYDAHVRPTPDWNEGDEVWLNSRNISTTRPSA FT KLDH" XX SQ Sequence 5847 BP; 1771 A; 1419 C; 1396 G; 1261 T; 0 other; cattgtagca tcttttcata caacagaccg ggaaagaaat tattacaaga tagatcacta 60 aagaagaaga agattaaaat tttttttaaa cagaaaagaa gaagaagaaa aaattgttgt 120 agtagcaaga agtcaagctt agtttaacct cagaagaaga agtattccac aatcaaacct 180 ttcacatcac atatctcaca tcacatcccc accacgccgt cctttcaacc accatcggaa 240 gacctcacgt ccacggaaga cgatacttgg cacgacgtcg gacaggatac ttttagccct 300 acatccccaa tggccacggc ggacgatgtc gcacgcatag cggcccagct agcagatctc 360 aacactcgtc tgactgaaga gactgccctt cgacaagctg ctgaagcccg gttcgcagca 420 tctgaagcca gacggaccgc acaacctact agcgctcctg ccccagcgcc cacccttgtt 480 gctcctgccc caacacccag cggagaggcc caaatccaag tcgctcgtcc tgaacaattt 540 accgggacac gaggcgccgc agctgaagcc tatgctagtc aagtgggatt gtacgtggca 600 gtcaatgcca gtttattcgc cgacgacaag acgaaggtga tgtttgcttt gtcttacctc 660 ggaggcgatg caatccaatg ggcccagccc ttccttcaac ggacgctaaa ccccggtgat 720 gaccagcctc ccacttatca ggagtttacc gaagcttttg cagctgtgtt cttcgattcc 780 gacaggaaac aatgtgctga gaaggccctt cgtgccttga agcaaacaaa gtcagcggcc 840 gagtatacga tccaattcaa tcaacttgcg cctacttctg gttgggaact tcctacgctg 900 atcagtcact accgccaagg cctgaaaacg gatgtccggg ttgcaatgat tcgggagcac 960 tttgaacgtt tagaagacat cacaaaatta gcctgtgcga ttgacggtga actccgaggc 1020 gagacggtaa cttacaatgc tccaagacca gtcgcctccg atgccatgga catctcttcg 1080 gctagattcg ccatttcgtc tgacgagtac ggtaaacgtg tgaaagaggg actatgtttt 1140 tcatgtggga aaagtggaca tcgagcaagg tggtgtaagg ggaaaggaag gagaacgggg 1200 ggtggaaagg tggcagaatt agaggcaaaa attgcagcaa tggaagtaga gatgagagga 1260 ggagcaggag gatcaggagg atcagcagcc gataggtcga aaaatggaga cgctcgggag 1320 tgacggatgt gccacccccg ggcttaggca aggataactt actgggagat atagataccg 1380 ttagtgacaa tgcaattcag ggtactgacc cacgcgtgtt cacctctata tccctatcca 1440 cgtctcattt tgccacatcc cctgaatcta aaaccacccc tgctcgagcc ctcattgact 1500 gcggctcgac acacgaggta cttggtacca agtttgttga gcgtagtggg ttgcctgtga 1560 cgagccttaa gaccgccggt gatgtatacg gcttcgatgg tgcgcctcgt accgttgctc 1620 acgatacctc cctttacatt gacgacgacg aatccaaaac tcgattcctc gtcacgaaaa 1680 tcaaggattc gtacgacgct atccttggca tgccttggct gcgagagaat ggtcaccgaa 1740 tagattggaa gactggaaca ttacgaccaa agacagaatt tactcagatc gccgcattag 1800 atcacccacc tgaaaaggac atcaagactg acaaatggaa gaatttacct cgacaccttg 1860 ctgccatcga cctggcatcg tccaacccga aaaccacccc ggatcgctcg atagctcgga 1920 aggggaaagc tagggacagt gacgagggga ttgttagttt tactagcgct aaacccccgc 1980 aaggtgagtt cgatacaacc atgagacata ccttaattgc cacagatgac gaccaagtac 2040 attttcgaca acagttatca aaacaaaccg aaccactggg gacacaggac catcttgctg 2100 ccactgcatt ggcatcgtct gactcgaaaa ccaccccgga tagcctggat gctcggatgg 2160 ggaacgctag agatcttgac gagggggttg ttagctacgc tgacgctaaa cccccgcaag 2220 gtgagttcaa tacagcatta gaacccaccc tcagtgtcac agatgacgac caggtacatt 2280 ttcgaccaca gatcccaccg tacgagatgc caacacccag aaccgaagaa gccacgatcg 2340 ccactgactt gcagattgca catgcagtgt cgccggtgcc gaaaaccacc ccgaatgccg 2400 tacgtatgca gatggggaat actaggacac ttggcgaggg ggctctcact ttgaatgaga 2460 caaagccccc gcgatgtgag taccacaaca ttgagactac atccctcgag gcagttgaca 2520 agctggttgt accgggatca cgtataggga aaattgcctc cgcgagcgcg acctggaacg 2580 tctcagccaa gttagcagct gatgcgacga aggacaaaga ggagaagcca gtagaggaac 2640 tcgtaccaac ccgataccac cgcttcatca atatgttcca aaagacgaac gctatgacac 2700 tcccaccaca ccaacgctat gacttccgtg tcgacctaat ccctggcgcc acacctcaag 2760 ctagtaaaat tataccgtta tcccctgctg aagaagtcgc tcttgacacc atgatcgacg 2820 aggggttgag taaaggcaca attcggagga caacatcccc atgggcagcc ccagtcctgt 2880 ttaccggaaa aaaagacggt aacttaagac cgtgcttcga ttataggaaa cttaacgcac 2940 ttaccgtcaa aaaccgatat ccactacccc taacaatgga actaatcgac agcttgaggg 3000 acgcggaaga ctattcgtgc ctggatatgc gaaacggcta taataacctg cacgttaaag 3060 aaggtgacga gaaaaaacta gcgttcattt gcaaaagggg tcagtttgaa cctttagtca 3120 tgcccttcgg accaacgggg gccctaggac atttccaatt ctttatctct gacatactta 3180 gggaaaagat cgggaaggaa ctagctgctt accttgacga tctactgatc tacacacgag 3240 cgggggtgga ccatgagaag gtggttgagg aagtccttga gatactacga tcgcacaaca 3300 tctggctcaa acctgaaaaa tgtaaatttt ccaggaaaga gattgactac ctaggattgc 3360 tgatatcaaa aaacaaagtc cgaatgggtc cgttgaaagt ctcagccgtt accgattggc 3420 ctgtccctac aaatgtaaca cacatccaac ggtttctagg attctcgaac ttttaccaaa 3480 gattcattga agatttctct aagattaccc gaccattgca tgacttaact cgtgataaaa 3540 cgccttttgt atggggggaa gcccaagaca aagccttcca gaccctcaag acctcattca 3600 ccacggctcc aattttgaaa atcgcaaatc catatgagcc gtttatcttg gagtgcgatt 3660 gttccgatta tgcgttgggc gctatcttgt ctcagaatga tgatgaggga gtacttcatc 3720 ccgtcgcctt cctgtctcaa tcgctggtgc aagctgaaag aaattacgaa atatttgata 3780 aggaactgtt agcggtagtg gcatctttta aggaatggag gcactatttg gagggaaatc 3840 caaataggct cgaagtcacg gtctacacgg accataaaaa cctcgaaagc ttcatgacga 3900 cgaaacaact aactagacga caagccagat gggccgaaac ccttggctgt ttcgattttc 3960 acatttgctt tagacctgga agtgagtcaa caaaacctga cgctttgtcg agaagaccgg 4020 accttgaacc attggcgaag gacaaactct ccttcggaag tctattgaaa ccagcaaacc 4080 tatcagaaaa aacattcaat gcggaacttg attgtattga ggcttggttc gaagatgaag 4140 atgtgatcca tgaggatgtc gaaagctggt ttgaacaaga tgtagtagaa ccgacattta 4200 gtgaaactct ggagatagat gcaattgaac gggcaacaga ctcaccgatt tggactgaca 4260 acctaataat tgaccgcatc agagagcaat ccaaacttga cccaaggatt gaggagctga 4320 tgaaatcaac actggcatcc aaagaaagta tcaagacgcc aaacgagtac aaagttcata 4380 acgaaatcct atacaaagac ggccgaatcg aagtaccaaa tgataaaact tgtaaatttg 4440 agatactgag aagtagacac agcagcgcat tagcaggcca cccaggaagg atgaagacac 4500 taaatctggt tcaacggcag taccgatggc catcgatgaa agcttacatt aacaaatttg 4560 tcgacggctg tgattcctgc ctgagggtga agtcatcgaa caaattaccg tttagatcat 4620 tggaaccatt accaataccg gcaagaccct ggacggacat tagttatgat ctgattaccg 4680 atctgccaag gtcgaatgga aaaaactgca tactcaccgt ggtagaccaa cttacgaaga 4740 tgggacactt cgtgccttgc acaaccaaaa tgacatcaga tgacttggcc tccctaatga 4800 taaagcatat atggagattg cacggattac ctaagaccat tctttcggat cgtgggagta 4860 tttttgtatc aaaaatcact gattccctga gtaaacaatt aggtatagcc ctacacccgt 4920 tgacggccta ccacccacaa acagatgggc agtccgaaat tgtcaataaa gctgttgaac 4980 aatacctacg tcatttcact tcttatcagc aggatgattg ggaggagcac ttactgttag 5040 ctgagtttgc atataacaac agcacacaca tgtccacggg ggtatcaccc tttaaagcca 5100 acttggggta cgacctaact tttgggagga tcccattggc cgacagatgc atcccgacag 5160 tggaagagcg cctacaaaag atcaccgaaa tacagaatga actcaaggaa gcgttaattc 5220 aggctcaaga tacaatgaaa cgaaactatg atgctcacgt tcgcccaacg ccagattgga 5280 atgaaggtga cgaagtatgg ttgaacagtc gtaacatatc cacgacaaga ccaagtgcaa 5340 aactggacca ttgatggtta ggacccttca gcatagtgaa gaaaatatca aagtcagcct 5400 atcaactagc attgccaatc agtatgagta aagtacaccc cgtatttcat gtctcagtct 5460 taaagaaaca ctcaccggac ttgatcgagg aaagattcca ggagttacca ccaccaatag 5520 agatcgacgg cgaaaccgaa tgggaagtcg cagaaatact ggataaacga cgtcgaagga 5580 tgaaggacaa gtacctggtg tcttgtaaag gattcgatag aactgaggat ttgtgggagc 5640 cggctgagaa cttgatgaat gcaaaagaga tgatcgacgg gttcaattta aggtttccga 5700 aagcatcaga agaacatcaa agatcaaaga ggatgcgggg gtaagggaga gggaccgacg 5760 ctttttcccg ccgggttttt taatgccagg tcccggggag agacgcccag ccgccaagag 5820 ggagccgggg cgtaaagggg gggatag 5847 // ID SCTRANSP repbase; DNA; FNG; 1143 BP. XX AC M11280; XX DT 11-NOV-1996 (Rel. 1.1, Created) DT 11-NOV-1996 (Rel. 1.1, Last updated, Version 1) XX DE Yeast (S.cerevisiae) mitochondrial 21S rRNA gene, r1 intron DE encoding a putative transposase. XX KW DNA transposon; Transposable Element; SCTRANSP; gene conversion; KW transposase. XX OS Saccharomyces cerevisiae OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Saccharomyces. XX RN [1] RP 1-1143 RA Jacquier A. and Dujon B.; RT "An intron-encoded protein is active in a gene conversion process RT that spreads an intron into a mitochondrial gene."; RL Cell 41(2), 383-394 (1985). XX DR GenBank; M11280; Positions 79 1221. XX SQ Sequence 1143 BP; 486 A; 104 C; 130 G; 423 T; 0 other; aatttacccc cttgtcccat tatattgaaa aatataatta ttcaattaat tatttaattg 60 aagtaaattg ggtgaattgc ttagatatcc atatagataa aaataatgga caataagcag 120 cgaagcttat aacaactttc atatatgtat atatacggtt ataagaacgt tcaacgacta 180 gatgatgagt ggagttaaca ataattcatc cacgagcgcc caatgtcgaa taaataaaat 240 attaaataaa tatcaaagga tatataaaga tttttaataa atcaaaaaat aaaataaaat 300 gaaaaatatt aaaaaaaatc aagtaataaa tttaggacct aattctaaat tattaaaaga 360 atataaatca caattaattg aattaaatat tgaacaattt gaagcaggta ttggtttaat 420 tttaggagat gcttatattc gtagtcgtga tgaaggtaaa ctatattgta tgcaatttga 480 gtgaaaaaat aaggcataca tggatcatgt atgtttatta tatgatcaat gagtattatc 540 acctcctcat aaaaaagaaa gagttaatca tttaggtaat ttagtaatta cctgaggagc 600 tcaaactttt aaacatcaag cttttaataa attagctaac ttatttattg taaataataa 660 aaaacttatt cctaataatt tagttgaaaa ttatttaaca cctataagtt tagcatattg 720 atttatagat gatggaggta aatgagatta taataaaaat tctcttaata aaagtattgt 780 attaaataca caaagtttta cttttgaaga agtagaatat ttagttaaag gtttaagaaa 840 taaatttcaa ttaaattgtt atgttaaaat taataaaaat aaaccaatta tttatattga 900 ttctataagt tatttaattt tttataattt aattaaacct tatttaattc ctcaaatgat 960 atataaatta cctaatacta tttcatccga aactttttta aaataatatt cttattttta 1020 ttttatgata tatttcataa atatttattt atattaaatt ttatttgata atgatatagt 1080 ctgaacaata tagtaatata ttgaagtaat tatttaaatg taattacgat aacaaaaaat 1140 ttg 1143 // ID Gypsy-40_MLP-I repbase; DNA; FNG; 5690 BP. XX AC AECX01001084; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-40_MLP_; KW Gypsy-40_MLP-LTR; Gypsy-40_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5690 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001084; Positions 36106 41795. XX CC Positions [2802-3221] - Reverse transcriptase CC Positions [4491-4970] - Integrase core CC 'CTGTC' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 319..1422 FT /product="Gypsy-40_MLP-I_1p" FT /translation="MFEDTVEPPSEARMSNITLEDVMRELSLMNTKLNTET FT ELRKKAETESQEANAQSKELAQKLAQLESNHSQGQTYATSSTPAAHPQAPI FT PTPSHNKPPKIATPDKFDGSKGPKAEVFMNQIGLYMKMDASLFTDEQSQVT FT FALSYMTGKASIWSQAITDQLLDPERTHLVNWAKFIQSFRATFFDTERVSK FT AKREIRALKQIRSVSDYWIRFSELSLIVKWSENILLSQFEQGLKPEIAIYM FT INETFEEVEKMAHMAIKIDNKLHKRHNKSNYTPISPATPHISATDPDAMDC FT SAYRLNISSDEYNRRGTTGSCYSCGSPDHFIGSCPKVKKHRGYGGFRGGSN FT RGSEMSRLRARVAELDSQLSEVKGK" FT CDS join(1542..2618,2622..5594) FT /product="Gypsy-40_MLP-I_2p" FT /translation="MIDTRLIENVQLLDPNSDTTITARVLNDSGATHEAIS FT LKFVEKHKLHIDPLPKSHSVTGFSGHETQVTHTGDYCVNHEHQTTTFIITQ FT LRDKYDAILGMPWIKRNSNIIDWKESKLKTQANHIVMSTPQLAFMDPGMEP FT VRHARHSDTGVEFSNDSVKPPQCEYFTIPNPPVEEALCKLKQHLELEVKPP FT VNSMSGKESQTPADVTLTSLSNPKNTSTNLSEEKERNARKSDMGVEISNDS FT LKPPQSEFAIDMRPLPKEPLKKQSRLLQKPPGTVPKPIPRPPKMKLYTQAL FT RSSGIAIPEVHTMKASWNLSARLSADQDKNKPEQSATELVPECYHDYLDMF FT EKSKANGLPPHRVYDFVDLIPNATPQAGRVIPLSPKESAVLDEMLDRGLAN FT GTIQRTTSLWAAPVLFTGKKDGILRPCFDYRKLNAVTVKNKYPLPLTMELI FT DSLLDADEFTSLDMRNGYNNLRVREGDEAKLAFICKRGQFEPLTMPFGPTG FT APGFFQYFIQDVLKSHIGRDVAAYQDDVLIYTKPGVDHKAVVREVLAILKA FT QNVWLKPEKCKFSKKGVSYLGLIISKNQIRMDETKVQAVTDWPTPRNVLEV FT QTFIGFANFYRRFIGQFSKIARPLHELSQKDTPFVWTTAREQAFQDLKKAF FT TTAPILKIADPYRPFVLECDCSDYALGAVLSQVSSDDGELHPVAFLSRSLI FT QAERNYEIFNKELLAVVASFKEWRQYLEGNPNRLNVIVYSDHKNLQSLMTT FT KELTRRKARWAEVLGSFDFEIRFRAGKQSTKPDALSRRPDLMPKNGEKFTF FT GQMLKPQNLPNDAFIDKMDLIDSWVVNEDLDQVIDIEEFGMDEQHLDDEGD FT LIWNDEQILNEIRLKSKVDSKIGEVIEMKKGGWNSRMLKDYSSIEGILYFK FT DKIVVPNDNNLKVQILRSRHDSRLAGHPGRMRTLALVKRGYHWKSMKAFIN FT QYVDGCQSCQRVKARTTKPFGSLSPLPIPSGPWTDICYDLITDLPQSSGND FT SILTVVDRLTKMVHFIPCTTTITSDELAELMIKDVWRLHGTPKSITSDRGN FT VFISRVTKDMNRRLGIVTQSSTAYHPQTDGQSEITNKAVEQYIRHFTSYKQ FT DDWCSLLPMAEFAYNNNNHVAIGMSPFKANYGFNVSFTDVPTGDQCLPSVE FT QRMAQLNDVQKDLKDSMQITQEIMKEQHDKHVEATPDWEVGGKVWLNNKNL FT STTRPTAKFSHRWLGPFPILNRVSKNAYKLHLPSSMNQIHPVFHVNLLRKF FT EPSKINGQIQEPPPPITIQNEDEYEVAEVLDKRKRRGKVEYLISWKGYDPD FT HDSWEPEEAMNNAQDLIKKFNSKYSQAEGRYRRVRRKQ" XX SQ Sequence 5690 BP; 1899 A; 1201 C; 1253 G; 1337 T; 0 other; tattgtagcg tctataattg ggaatctgaa gaagctagag gatcaagaag aaagtcaagt 60 tattgaaagt atcaagaata agaaagctta ttaaagttta aagtttaaag tttacccggt 120 tagaaatcaa gtagtacaaa gaagatcaat ctgcaaggtt taaaagcaag attcaagaac 180 atacaattta aattagaaaa ctcaaaccat tctaaagccg atagttcatt cactccacat 240 ctctaagatc acactcgatt cacgacgccg gactttacgc taccacagag cccgacttct 300 acgagtcaca actcagacat gttcgaagat actgtagaac caccctccga agccagaatg 360 tctaatataa ctctggaaga tgtaatgaga gaattaagct tgatgaatac gaagctgaat 420 acagagactg aactccggaa gaaagctgag acggaaagtc aagaagctaa tgcccaaagc 480 aaagagttgg cacaaaagct ggcgcagctt gagtcgaatc atagtcaagg acaaacttat 540 gcgactagtt cgacccctgc agcccaccct caagctccaa ttccgactcc ttcacataat 600 aagccgccta aaattgcaac tccagacaaa tttgatggat ctaaaggacc caaagctgag 660 gtttttatga accaaatcgg actctacatg aaaatggatg cctctttgtt taccgacgaa 720 caatcacaag tgacttttgc gttgtcctat atgacgggaa aagcaagcat ttggagtcag 780 gccattaccg atcaactact tgacccggag agaactcatc tagtcaactg ggccaaattt 840 attcagtctt tccgtgcgac cttctttgac acagaacgag tatctaaggc caaaagagaa 900 attcgagctc tcaaacaaat ccgttctgtc tctgactatt ggatccgttt ctccgaactt 960 tctcttattg tcaagtggtc tgagaacatc cttctttctc aatttgaaca aggtttaaaa 1020 ccggaaattg ccatctatat gatcaacgag acatttgaag aagttgagaa aatggcgcat 1080 atggccatta aaattgacaa taaactccac aaacgccaca acaaatcgaa ctatacgcct 1140 atttcacctg caactccgca cattagcgca actgaccctg atgctatgga ctgctccgcg 1200 tatcgtctaa acatttcatc tgacgagtac aatcgacgag gtaccacagg gtcgtgctac 1260 agttgtggga gtcctgatca tttcattgga agctgtccta aagtaaagaa acataggggg 1320 tatggaggat ttagaggtgg aagtaacaga ggaagcgaga tgagtagatt aagagcaaga 1380 gtggcggaat tagatagtca gttatcagaa gtgaaaggga agtagaaaga ggagggaagg 1440 tctgaagtgt caaaaaatgg agaagctcgg gattgttagt tgtgcctccc ccgagcaaat 1500 cttgtttaga agcagaagtt ggaattattg atagccttga aatgatagat acgcgtctta 1560 ttgaaaatgt ccaacttctc gaccctaatt ctgacacaac catcaccgct agagtcttaa 1620 atgatagtgg tgcgacacac gaagccatta gcctcaagtt tgtggagaaa cacaaactgc 1680 acattgaccc cttgcccaaa tcacatagtg tcactggatt cagtggacac gaaacacagg 1740 taactcacac gggagattac tgtgtcaatc acgagcatca gaccacgact tttatcatca 1800 ctcagttgcg agacaagtat gatgcaatcc ttggcatgcc atggatcaaa agaaactcga 1860 atatcattga ttggaaagaa agcaaattga agacacaagc caatcacatt gtaatgtcaa 1920 cgccgcaact agccttcatg gaccctggaa tggagcctgt aaggcacgct aggcacagtg 1980 acacgggggt tgagttcagt aatgactcgg taaaaccccc gcaatgtgag tatttcacga 2040 ttcctaaccc acccgttgaa gaagcacttt gcaagcttaa acaacattta gaacttgaag 2100 tcaaaccacc ggtaaactcg atgagcggaa aagaatcaca aacacctgca gatgttacct 2160 tgacatcttt gtccaatccg aaaaacacct ctacgaacct cagcgaggaa aaggagagga 2220 atgctaggaa aagtgacatg ggggttgaga tcagtaatga ctcattaaaa cccccgcaga 2280 gtgagtttgc tattgatatg aggcccttac ccaaagaacc actgaaaaag cagtcacgcc 2340 ttctacagaa accaccagga acagtcccga agcctatacc tcgaccaccg aagatgaagc 2400 tgtatactca agcactccgg tcgtcaggaa tcgcgatacc tgaagtacac acgatgaaag 2460 catcatggaa tttatcagca cgcctatcag cagatcaaga caagaacaag ccagaacagt 2520 cagcaacaga acttgtcccg gaatgttatc acgactacct agacatgttc gaaaaaagca 2580 aggcaaatgg actaccacct caccgagtct atgacttttg agtggatctg atcccaaatg 2640 cgacccccca agccggcaga gttataccat tatctcctaa agaaagcgca gtcttggacg 2700 aaatgctaga cagaggatta gctaatggaa ccatacaaag gaccacttca ctgtgggcgg 2760 ctccggtact attcactggc aagaaggatg gaattttacg cccttgtttc gactaccgta 2820 aactaaatgc agttacagta aagaacaaat acccgttacc attaacaatg gagctaattg 2880 atagcctttt ggacgcagac gaattcacga gcttagacat gcggaacggg tacaataatc 2940 tcagagtcag agaaggcgac gaagctaaat tagcattcat atgtaagcgc ggccagtttg 3000 aaccccttac aatgcccttt ggaccaactg gggctccagg cttctttcag tattttatcc 3060 aggatgtatt gaaatcacat ataggtagag atgtggcagc atatcaggat gatgtcctta 3120 tttacacgaa gccaggggtt gatcataaag cggtagtgag ggaagtcttg gcaatattga 3180 aagcgcagaa cgtatggttg aagcctgaaa aatgcaagtt ctcaaagaaa ggggtttcgt 3240 atttaggctt aatcatttca aaaaaccaga tcagaatgga cgaaactaaa gttcaagcag 3300 ttactgattg gcctacacct agaaatgtct tggaagtaca aacattcata ggctttgcaa 3360 acttctatag gcggttcatt ggtcaattct caaaaattgc aaggccttta cacgagttat 3420 cgcaaaaaga cactccattt gtatggacca cggcaaggga acaagcattt caagatctca 3480 agaaagcatt taccacggcg ccaattctca agattgccga tccttaccgg ccctttgtgt 3540 tagaatgcga ctgctctgat tacgcattag gagcagtctt gtcgcaggtc tcaagtgacg 3600 atggtgaact acacccggtg gcgtttctct cgcgttccct aatacaagct gaaaggaatt 3660 acgaaatatt caacaaggag ctgctagctg tggttgcgtc ctttaaagaa tggaggcagt 3720 atcttgaagg taatccaaac cggttgaatg ttatagtata ctctgatcat aaaaacttgc 3780 aatcgctaat gactacaaag gagttaacaa gacggaaagc gagatgggcg gaagtcttag 3840 gaagtttcga ctttgaaatc aggtttcgag caggcaaaca atcaacaaag ccggacgctt 3900 tatcccgcag accggatttg atgccaaaga atggcgagaa gtttaccttt ggacaaatgt 3960 tgaaacctca gaacttacca aatgatgcgt ttatagacaa aatggactta attgattctt 4020 gggttgtcaa tgaggatttg gatcaagtaa ttgatattga agaattcgga atggatgaac 4080 aacatcttga tgatgaaggt gacttaatat ggaatgatga gcaaatttta aatgaaattc 4140 gactcaaatc aaaggtagat tcaaaaattg gggaagtaat cgaaatgaaa aaggggggat 4200 ggaactcaag aatgttgaaa gactactcaa gcattgaagg aattttatac ttcaaggaca 4260 aaattgtggt acctaacgac aacaatctga aggtacaaat cttacgatca cgccatgata 4320 gtcgtttagc aggtcaccct ggaagaatga ggacgctagc tctagtcaaa cgaggatatc 4380 attggaagtc aatgaaagcc ttcatcaatc aatacgtcga tggttgccag tcatgtcaac 4440 gagtcaaagc acgaaccacc aaaccatttg gcagtctaag tccacttcca atcccaagtg 4500 gaccgtggac tgatatatgc tatgatttaa taactgacct tcctcagtca agtggtaatg 4560 acagcatctt aacagttgta gaccgcctca cgaaaatggt gcacttcata ccgtgcacga 4620 cgacaatcac ctctgatgaa ctggcggaac tgatgatcaa ggacgtatgg cgattacatg 4680 gaactccaaa gtcgattact tcagatagag gtaacgtgtt catctcgaga gtgacaaaag 4740 atatgaacag aaggctggga atcgtcactc agtcatcaac agcttatcat ccccaaactg 4800 acggacaatc tgagatcaca aataaagcag tggaacagta cattcgccac tttacatcgt 4860 acaagcaaga cgattggtgc tcattattac ccatggcgga gtttgcctac aacaacaaca 4920 accacgtagc cattgggatg tcgccgttca aggcaaacta cggctttaat gtaagcttca 4980 cggatgtacc aacaggtgat caatgcttgc cctcagtcga gcagaggatg gcgcaattaa 5040 atgatgttca aaaggattta aaagactcaa tgcaaatcac ccaagaaatc atgaaagagc 5100 aacatgacaa gcacgttgaa gctactcctg attgggaagt aggtggcaaa gtatggttaa 5160 acaataaaaa cctgtcgaca accaggccta cagctaagtt ttcgcataga tggctgggcc 5220 ctttccctat cttaaatcgt gtgtcgaaga atgcttacaa gttacattta ccaagttcga 5280 tgaatcaaat tcatccagtc tttcacgtga atcttctcag gaaattcgaa ccaagcaaaa 5340 tcaatggtca aatacaagag cctcctcccc cgatcaccat tcaaaatgaa gatgaatacg 5400 aagtggctga agtcttggat aaaagaaaaa gaagagggaa ggtagaatat ttgatcagtt 5460 ggaaaggata tgatcctgac cacgactctt gggaaccgga agaagcaatg aacaatgcgc 5520 aagatttaat caagaaattc aacagcaagt attctcaagc agaaggtaga taccgcaggg 5580 tacggagaaa gcagtgaggg tgaggctttt tcccacaggg ttttttaatg ccaacccgtg 5640 gaaagatgct gacccgtcaa aagggggttg agtcataaag gggggagtgg 5690 // ID Merlin-4_Roryzae repbase; DNA; FNG; 2154 BP. XX AC . XX DT 17-MAY-2011 (Rel. 16.05, Created) DT 17-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Merlin; DNA transposon; Transposable Element; Merlin-4_Roryzae. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-2154 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 2154 BP; 706 A; 354 C; 425 G; 669 T; 0 other; ggatggttgt acctttgatg ccgtcggact ttaaaatttc caatttcctt atttggaaat 60 cgataatttt cggaatatga gtgacacaat cggtagaata tcataaaatg atgcatgtcg 120 ttgtccagaa tggataaggt gtcgtctttg tatgtagagg gtcgcatgtt cgacccttgt 180 gtgtgtcatt tttttctgaa agttttagca gtcctttttt atgtctgagt tcttgctttt 240 tgtgacttaa ctattttaaa taaatttttt tttatttcgc gtttcccccc tgtttttttt 300 cccctttttt tttcttccct ttttacatct ttcaaattaa agttagtata tcatgtcttt 360 tatcaatatc tctaccgcaa actacaccaa tcctgctggt attcatcgtc aaagacctgt 420 tacttccccc aacagcccta ttgttatcgt tgattctgat gatgaagtag aagaaatgca 480 tttagaaacc gaacaaactt cagatttgga tgtagcagtt atttctatca ataatgatga 540 taacgaagag ttattggaaa tagaagcttt cgaagttgac gatgaggcac ataatagtaa 600 tcttttgccg gataattgga gcagaggtca aaatgttccc attatggcaa ccagaacttt 660 ttttttaact tttggaactg aagaagcgtg catgcagtaa gtataaaagt ataacaaagc 720 acacttgtat acaatattaa taactctttt tttttttata aattaggttt tttattaaca 780 acggaatcta ttatggaagc aacacagtgt accgttgtaa tggtgatgaa ggaagcgtca 840 tgtatttgga gcaaacagga agtgttccac gttggagatg caatggaatc atagttaatc 900 aggaaactgg tgagaaatgc tgcaatggac gtcaagttgc tgtacgaaat gcttcctttt 960 ttagtaatcg cagcttacca gcctatgatg cgctctccat tctttacttt tggagcttga 1020 aagtacgacg aatgacaatt tctaccatgg ttggatgttc accaaaggca gtacgagaaa 1080 ctttaaaaga ttggtaccaa gttttacaag aagatttaca gcagaacgac tgcaaaatcg 1140 gaggttatga tgtggatgga aatccaatcg tggtagaaat cgatgaatca aagtttggta 1200 aaagaaaata tcatcgagga catcgtgtgg agggtgtttg ggtggtaggt ggtgttgaaa 1260 aaacacctga acgaaagtgt tttcttgtgg tagtaaataa tagaaacact gaaacgatgg 1320 atacaatcat tcgaaattat gttgccgatg gttccatagt acatacagat tgttggagag 1380 catatgaaaa tatggtaaac cttggaatga acttggtaca tcgaactgtc aatcacagtg 1440 tgacatttcg tgacggtgat gtccatacca acaccatcga gggtaagctt aaaattggag 1500 tgtagaagtg tttataggtc tagttaataa aatcttttca ggaacttgga atggaataaa 1560 aattaatgta actcctgccc ttagaacaaa gaagatggta ccttggttgc ttatcgaatt 1620 tatttggcga agaaagcacc acaacgatat ctttggtggt ataattgatt gtcttaaaaa 1680 tgttagcttt gatcgagctc agcgtaatcc agcatggtta actgagcttg ctgcagaata 1740 ataataacaa taaaaaaaaa ttaaacaaag ctcaataaaa atgtcctcaa taatgttacg 1800 tacagagagc aagcattgtt aatatacttc gacaaaaaat atccaaaagc attactataa 1860 tcagttaaat aaaagaaaag cctatacaaa aaaagaatgc acaaaggaat caattcaaaa 1920 aaaaaatagc tcccacatag acacgcctgc agcatcggaa caaaaatacc gattacatgt 1980 ttcgaacagg cgacctcttc cgtcggaagt agacactcta gccattccgg acagcgactt 2040 gcataattcg actgtatttt gttgacgagc tgattcatat tcagaaaatt gttgatttcc 2100 aaataaggaa attggaaatt ttaacgtccg acggcatcaa aggtacaatt aacc 2154 // ID CRYPT1 repbase; DNA; FNG; 3563 BP. XX AC AF283502; XX DT 30-NOV-2000 (Rel. 5.1, Created) DT 30-NOV-2000 (Rel. 5.1, Last updated, Version 1) XX DE CRYPT1 is a HAT-like DNA transposon. XX KW hAT; DNA transposon; Transposable Element; CRYPT1; KW hAT superfamily; transposase. XX OS Cryphonectria parasitica OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Diaporthales; OC Cryphonectriaceae; Cryphonectria-Endothia complex; Cryphonectria. XX RN [1] RP 1-3563 RA Hillman I.B., Foglia R., Linder-Basso D. and Zhu P.; RT "An Activator-like transposon from the chestnut blight fungus, RT Cryphonectria parasitica."; RL Unpublished. XX RN [2] RP 1-3563 RA Hillman I.B., Foglia R., Linder-Basso D. and Zhu P.; RT "CRYPT1."; RL Direct Submission to Genbank (29-JUN-2000)Plant Pathology, RL Rutgers University, 59 Dudley Road, New Brunswick, NJ 08901, USA. XX DR GenBank; AF283502; Positions 1 3563. XX CC CDS 402..3239 CC /codon_start=1 CC /product="transposase" CC Transposase: CC MANSNAFNQLSHARYTSIQKQIDAHDATAQFSLGTLKDLPRIKVPGRSHAFVPARWMPTSSAKRKRHGSD CC IWHFGEKLFEVIVQADGTCKVAADWWFCRLCDQNNRQTLYTTRNSISTSTSAALKHLEGRDNEGHNLGKL CC KELVAEAEKEEVSPEELGRRKRRRINHRGLFNNSSLYTAPDALKDAANAPTRAVVSLAKKAITELVACCD CC IPLSTLEAPQFRQLLMVWNSTLTEHFFPRSDGTVMRWLFEAFHENLAGLQLQLQREAISKIHLSADLWTS CC PNHKGILAVVAHFVDSDAKLRNRLIAVRMINGQHTGENLQWHLHQVIMAFQLEERLGWLTFDNDSTNDKA CC TRLLFRKMYGFNHREAEKLRIERRCRCWGHIINLIVKGFLSGNAKQILEAEDSDADEVSRAVLYLPGLLL CC IFKQELEQQRNQWDQWQREGPIRRAHLIVVFLKASVQRVQDFEKIVQNSLFSPEDQDLEDTTAFLAEALP CC QPTAQDLTMPPPPLPTDPTDEQTGDISTLRPVVAQETRWNSTEAEISRLLRLKRPIDFWVKEQVRSGAKI CC QPLSEQDWRVLEATRDLLQPFLWHTKHHEGNAVAVHDVLSSLWDLEKHLRSFQERFGGRKTYAEEEILEE CC IIVSPNAEKPALQRPSSRYQAPSRPSSRPSPRPSPRPSSRLYSRPSSQRSSRPSPRPSPRSSPRRRASQS CC SDIFPSTPVREWPDVGSQQEAEIYPEECGLEGAEFLRESVAHALAILKRYKSKLRECPIYFAAAVLHPKY CC RLVGIRTAAPAIADDVEQEFKAFYEKYWAYKDPALSSASTPSLQEDQEHNKPKKQWRTAFRAQPRPKHQR CC PQSELEKYLQLEEDDEDDDGFEPLVWWAARKAEFPALSQMAFDVLSIPAMSSECERVFSQGKLTMSSQRR CC RMKSSTLELLLCLKDWLKNGTLAGPAAEMRRKGIA. XX SQ Sequence 3563 BP; 882 A; 934 C; 894 G; 853 T; 0 other; cagcgttcca cacaagtcaa gcggaatcaa gcggcttgac aagcgaagtt gacggtttgg 60 cgatggtctt caagacaagt aagcggcctt tttgaccaag cggcttgcaa gcgaacaagc 120 gaattgcctc agtcctattg tgggtcctta taggctcagt tgaagctatc gcggcccact 180 ctggatgcaa ccttgctggg gtttttggtg ttttggccta ggcttttgcc ctggcccttg 240 gctattcggc ctatatatcc gtattaagtc ttttcctccg ctggcaggca agctcttctg 300 tacctcttct acccttgtac ttggcatcct cgcgtttact tcctcgcgtc tcgcacctgg 360 gtgtcctcgc gtttatctat catcgctttc ctttcactat catggctaat agcaatgctt 420 tcaaccagct tagccacgcg agatatacct ctatccagaa gcagatcgat gcccacgatg 480 ccacagcaca gttctctctt ggtaccctca aagacctccc acgcattaag gttccaggac 540 gatcccatgc ctttgtccca gctcgctgga tgccaacatc atcggccaag aggaagcgcc 600 acgggtctga tatctggcat ttcggcgaga agctcttcga ggtcatcgta caggctgacg 660 gcacatgcaa agttgcagcg gattggtggt tctgtcgcct ctgcgatcaa aataatcgac 720 agaccttata taccacgcgt aacagtatct cgacaagcac atcagcagct ctaaagcatc 780 tcgaaggtcg cgataacgag ggccataacc ttggcaagct gaaagagctt gttgcagagg 840 cagagaaaga ggaggttagc ccagaggagc ttggccggcg aaagcggcga agaataaacc 900 atcgtggcct tttcaacaac agcagtctat atacagcgcc tgatgccttg aaagacgctg 960 ccaacgctcc aactcgtgca gttgttagtc tcgcgaagaa ggctataaca gagctggttg 1020 cttgttgcga tatacctctt tcaactctcg aggcgcctca gttccgccag ctgttgatgg 1080 tctggaatag caccctgacg gagcactttt tccctcgaag tgatggcaca gtgatgagat 1140 ggctgtttga ggcgtttcac gagaacctag ctggtcttca gctccagctc cagcgcgagg 1200 ctatctcgaa gattcatcta tctgctgatc tctggacctc gcctaaccac aaaggcattc 1260 tagcagtggt tgcgcatttt gttgatagcg atgcaaagct gcgaaatcgc cttatagcag 1320 tgaggatgat caacggccag catactggcg agaacctcca gtggcatctc catcaggtta 1380 ttatggcttt tcagcttgag gagcggctcg gctggcttac ttttgataac gacagcacga 1440 acgacaaggc aacaaggctt ctttttcgca agatgtacgg cttcaatcat cgtgaagctg 1500 agaagctgcg gattgaacgg cgatgccgct gctggggcca tattattaac cttattgtca 1560 aaggcttcct ctctggtaac gctaaacaga tactcgaggc tgaagacagc gatgctgatg 1620 aggtaagtcg agctgttttg tatcttcctg gcttattgct aatatttaag caggaactcg 1680 agcaacagag gaatcagtgg gatcagtggc agcgagaagg acctatacgc cgcgctcatc 1740 ttattgtagt cttcctcaag gcaagtgtac agcgggtcca ggacttcgag aagattgttc 1800 agaactcttt gttttctcca gaggaccaag atctcgagga cactacagcc tttttggcag 1860 aggcccttcc tcagccaaca gcacaagacc tcacgatgcc accaccaccg cttcctacag 1920 accctacaga tgagcaaact ggcgatatat ctaccttgcg gccagtggta gctcaggaga 1980 cgcgatggaa ctcaacagag gctgagatat cgcggctgct gaggctgaaa agacctattg 2040 atttctgggt taaagagcaa gttcgaagcg gtgctaaaat acaacctttg agtgagcagg 2100 actggcgagt gcttgaggca actcgagatc ttttgcagcc attcctatgg catacaaaac 2160 accacgaggg gaacgctgta gctgttcatg atgtcctcag ttctttgtgg gatctcgaga 2220 agcatttgcg gtcctttcag gagagatttg gaggcagaaa aacgtatgct gaagaagaaa 2280 tccttgagga gattattgtc tcgccaaatg cagagaaacc agccctccag cggccctcat 2340 cgcgatatca ggccccttca agaccttcct ctcggccctc gcctcgaccc tcccctcgac 2400 cctcctctcg actttactct cggccctcct ctcagcgttc ctctcggccc tcccctcgac 2460 cctcccctcg atcctcccct cgccgtcgag cttcacagtc ttcagatata tttcccagca 2520 ctccagtacg ggaatggcca gatgttggtt ctcaacaaga ggctgagata taccccgagg 2580 aatgtgggct tgaaggtgct gagtttctga gagaatcagt agcccacgcg ttggcaattc 2640 tcaagcgata taagagcaag cttcgcgaat gccctatata ttttgctgct gctgttttgc 2700 atccgaaata tcgcctggtt ggtattcgta cagcagcgcc agctattgct gacgatgttg 2760 aacaagagtt taaggctttt tacgagaaat attgggcata taaagatcca gccttatcat 2820 cagcatcaac acccagcttg caggaagatc aagaacataa caagccaaag aagcagtgga 2880 ggaccgcctt cagggcacaa ccaaggccaa agcatcaacg gccacagtca gagctggaga 2940 aatatctcca gctagaagag gacgacgagg acgacgatgg ttttgagccc ctcgtctggt 3000 gggcagcgcg taaggccgag tttccagccc tctcacagat ggccttcgac gtcctatcta 3060 tccccgcgat gtcgtccgaa tgcgagagag tcttcagcca agggaagctt acgatgtcct 3120 cgcagaggag gagaatgaag agcagcactc tcgagctcct tttgtgcctc aaggactggc 3180 tgaaaaatgg aacgctggca ggaccagcag cagagatgag gaggaaaggc attgcctagc 3240 agctttttgc caaaaaaagg cattcgaggg ggtcagaaat gtttctttca gcacttgagc 3300 gagggacctc gcgcggatca gagcgtaaca aggaaaacta tcatcatatg cctctgtacc 3360 ggttgaacca cgtgcgtagc tcatttgtta ctctcaccac ctttcgtttc gtatatctat 3420 cgtacattcc taatccggta tagtgaccaa gtcaccaagc gaatcaagcg gccaggtact 3480 agaagacaag caagcggcca aaatagttag caagtcaagt ggagggctac cgcttgcctt 3540 gacttgactt gtgtggaacg ctg 3563 // ID Gypsy-26_MLP-LTR repbase; DNA; FNG; 193 BP. XX AC AECX01000138; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 28-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-26_MLP_; KW Gypsy-26_MLP-I; Gypsy-26_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-193 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000138; Positions 110689 110497. XX SQ Sequence 193 BP; 65 A; 54 C; 33 G; 41 T; 0 other; tgttatgagc gtacactggg ggaaagactt gtgaaacatc agtacacatg tcacgagttg 60 aacccatatc ttgtaccaca caacactgta ctagcttttc ccttatccga caatcacaat 120 actaaggata caagaaccag acccagatca gaacctcact cccgtgccag aacccagaca 180 gtagacctta aca 193 // ID Gypsy-46_MLP-LTR repbase; DNA; FNG; 369 BP. XX AC AECX01001225; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-46_MLP_; KW Gypsy-46_MLP-I; Gypsy-46_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-369 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001225; Positions 217160 217528. XX SQ Sequence 369 BP; 68 A; 73 C; 74 G; 154 T; 0 other; tgtaaggtta cacctcacaa tcacaggtgt gacatgtatt atcagactat agtagattta 60 gtgtacacag ttgtttgatt ttctttctat ctttctcatt ctctcttctg ggtttctttc 120 gagatagtct ttgtagcgtc tggagtttct ttgatctgat gtttagttga gctaggtatg 180 gatctttcta tctttctcat tctctcttct gggtttcttt cgagatagtc tttgtagcgt 240 ctggagtttc tttgatctga tgtttagttg agctaggcct agctttcgca ataccaattc 300 aggattctag atactcttgg ctccgccact cgtgcctctt agtgtagctc ctttttatag 360 gagcttcca 369 // ID copia-3-LTR_AN repbase; DNA; FNG; 249 BP. XX AC . XX DT 09-DEC-2003 (Rel. 8.11, Created) DT 09-DEC-2003 (Rel. 8.11, Last updated, Version 1) XX DE Long terminal repeat of copia-1_AN LTR retrotransposon - a DE consensus sequence. XX KW Copia; LTR Retrotransposon; Transposable Element; KW COPIA superfamily; copia-3-I_AN; copia-3-LTR_AN. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-249 RA Kapitonov V.V. and Jurka J.; RT "copia-3_AN, a family of copia LTR retrotransposons in the RT Aspergillus nidulans genome."; RL Repbase Reports 3(11), 202-202 (2003). XX DR [1] (Consensus) XX CC It is a long terminal repeat of Copia-3_AN LTR retrotransposon CC characterized by 5-bp target-site duplications. CC The genome contains 10 copies of LTRs 8% divergent from the CC consensus CC sequence. XX SQ Sequence 249 BP; 68 A; 54 C; 59 G; 68 T; 0 other; tgatacggaa atatgaggcc acgtgatctc tggtagattg accggacatc gcctgtttct 60 agaaggttcc taccctctgt atatatatag tggggaagag aggaatacac acaagggaag 120 caatgaagga atctatcgtc aacaagatag gtgtcaatcg tcatatggca tccgatcctt 180 agtaggctac gttggtgtag ttgatgcagt ttcctcctgc cctttctcct ctgagcatat 240 tccacaaca 249 // ID Gypsy-111_MLP-LTR repbase; DNA; FNG; 192 BP. XX AC AECX01000696; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-111_MLP_; KW Gypsy-111_MLP-I; Gypsy-111_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-192 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000696; Positions 1303 1494. XX SQ Sequence 192 BP; 53 A; 50 C; 34 G; 55 T; 0 other; tgttatgaat gcttatacat ggaactcgtg taagagacac ttgacagttg gcgcacatgt 60 cacaggattg tggatcatct agacttgtac acacccttgt atcagagctt ttctcttcat 120 ctgacaatct caataagacc ctggaagcac cttcactccc gtttccaaga accttgagat 180 cccaccttaa ca 192 // ID Gypsy-16_RO-LTR repbase; DNA; FNG; 568 BP. XX AC AACW02000194; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-16_RO_; KW Gypsy-16_RO-I; Gypsy-16_RO-LTR. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-568 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000194; Positions 321264 321831. XX SQ Sequence 568 BP; 241 A; 84 C; 63 G; 180 T; 0 other; tgttgtagaa cagtaactga ttgtagttga cattaataaa tataagtctt aatacaagat 60 cataattaaa ataaaataaa atacaagttt atttaaataa gatcatacaa tcaattccaa 120 ttaaacatgt tcttttagaa tatgtattgt attcatgaga gttctttgat aagaagaagt 180 atcaactaaa atgaccaaac ccaatccaat gtcaggaata ttgctaatct ttcacataat 240 tgaaataacg cgtaaaataa cgttattcaa atcagtgagg aattcccaga ataaaagata 300 atctatgaca gacatacaaa agaacaaatg aaaagttttc ataatcataa ttatagataa 360 cacttgctaa agtacctgaa tattagaaga acgataacaa tagaagatat caagccactc 420 actcaataag atataaatac ctgccttcaa ttagaataat ttttacttta acttaaactg 480 aaataaagat cagtctcaaa aagtacccaa actgtttctt gatctttgtt tttacctttt 540 cctttattat ttaaaccatt atacaaca 568 // ID Gypsy-35_MLP-LTR repbase; DNA; FNG; 362 BP. XX AC AECX01000146; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-35_MLP_; KW Gypsy-35_MLP-I; Gypsy-35_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-362 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000146; Positions 6826 6465. XX SQ Sequence 362 BP; 92 A; 86 C; 42 G; 142 T; 0 other; tgtaattccg gaacttctta tttccttctt tttacttttc tatcttttgt aaaacgcttt 60 cggaaaagcc ttacgacttt tcctgcaagt gtcttttcct ttttctatca tctcataatc 120 atcataatgg tctactatct atattcggga tttattccta aaatcccttg acttgtatct 180 tttccaaact cggattcctc taagcagaat cctctcgttg taaactctca agttgtactt 240 tcctttttca gtctctagtg tttttcaaaa acacttataa agaggatcca cttctcgctg 300 aaatgaatcc tcaagtcact tcgcttacat tttcaaaagc acctgatcct gtaagaatca 360 ca 362 // ID Mariner-9_AN repbase; DNA; FNG; 2125 BP. XX AC . XX DT 02-JUL-2007 (Rel. 12.06, Created) DT 13-JUL-2007 (Rel. 12.06, Last updated, Version 1) XX DE DNA transposon, Mariner superfamily, Pogo clade. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW Mariner-9_AN. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-2125 RA Galagan J.E. et al.; RT "1. Galagan, J.E., et al., Sequencing of Aspergillus nidulans and RT comparative analysis with A. fumigatus and A. oryzae. Nature 438, RT 1105-1115, 2005."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-2125 RA Clutterbuck A.J., Kapitonov V.V. and Jurka J.; RT "Transposable Elements and Repeat-Induced Point Mutation in RT Aspergillus nidulans, A.fumigatus and A.oryzae."; RL Chapter in "The Aspergilli: genomics, medical applications, RL biotechnology, and research method." Edited by Goldman GH and RL Osmani SA. Publication expected 2007. XX RN [3] RP 1-2125 RA Clutterbuck A.J.; RT "Mariner-9_AN."; RL Direct Submission to Repbase Update (02-JUL-2007). XX DR [3] (Consensus) XX CC TA target site duplications; 45-bp TIR. Two complete copies in CC genome, 96% identical, both showing evidence of RIP, but ORFs CC AN3473.3 and AN8487.3 show similarity to pogo-subclade CC transposases. Consensus sequence includes the less RIP-affected CC alternative at each disputed site. XX FH Key Location/Qualifiers FT CDS 100..1965 FT /product="Mariner-9_AN_1p" FT /translation="MTPMDAAIEAIESLKPGDSINYTKIAKEFGVNRITLS FT RPHKGIQRSRRDQYEEQRILNDQQAKDLIKYIDKLSGKGLYISHEMLRNFA FT KELTGKKPGNHWPGRFLKRHQIELSSAYTTAMDSNRKRADSAYKYSRYFDL FT LAQKLDKYKVEPGNIYNMDKKGFLIGMLSKGLRIFSKRKYKQGNFKQRLQD FT GNREWITAIACICADGTLLSPVLIYQAASSDIQDTWLQDFDPQHHKTFFAS FT SPSGWTNDKLGYAWLTGVFDRETKDKARRQWRLLFLDGHGSHLTMKFFDYC FT DDNKILLATYPPHSTHSLQPLDVGIFSPLSHAYSSELEAYLHISMGLSHIT FT KRDFFRLFFPAWVKALSSKNIISSWRTVGIHPFNPEIVLARFSREPQSRPS FT TSESSRSILGAEDWRKIKKLLHDVVEDVYSENTRKLSLAMHNLSTENILLK FT LQCEGLQIALQNEKKKRQRGKPLQFQLKASDDGGAVFYSPQKIQQARDLQL FT GKERAAEQLKASKEEQKVRRQQEKEAKQRLIEDRRKIRASQREIHRLEAEQ FT KRQEKEDARISKEAAKQLQIDFQQAKKTPRKSSKASNHTDTQDTGPPSHVV FT VEEVPPTVNRRGREIRLPQHFRTN" XX SQ Sequence 2125 BP; 649 A; 490 C; 481 G; 505 T; 0 other; cagtgggtga tcacctgacg atcgttccca cctgacgatc gattttcccc tcaccgatat 60 ttcaaccacc aaacgtcaaa ctcttgtatc tcattcaata tgactccaat ggatgcggcg 120 atagaagcaa ttgaatcgct aaagccaggc gattcaatta attatactaa aattgcgaaa 180 gagttcgggg tcaaccggat aactctgtca agaccccaca aaggaattca gcgctctagg 240 agagaccaat atgaagaaca gcgaattctc aatgaccagc aggccaagga tcttataaaa 300 tacattgata agctctctgg caaaggccta tatatatcgc atgagatgct tcggaatttt 360 gcaaaagaac tgacaggaaa gaaaccagga aatcactggc ctggccgctt tctaaagcga 420 caccaaattg aactctcctc tgcctataca actgctatgg actccaatcg aaagcgagct 480 gattctgcat acaaatattc gcgatacttc gacttattag cccagaaact tgataaatac 540 aaggtggagc cagggaatat atataacatg gataagaaag gatttcttat tggaatgctg 600 tcaaaaggtc tcaggatctt ctcaaagcgc aaatataagc aaggaaactt caagcagcgc 660 ctacaggatg ggaatcgcga atggataact gcaattgcct gcatctgtgc tgatgggacc 720 ttgctatccc cagtgcttat ttaccaggca gctagcagtg atatacaaga tacctggcta 780 caggatttcg atcctcaaca ccacaagacc ttttttgcct cctctccaag tggttggaca 840 aatgacaagc ttggatatgc ctggttgact ggagtttttg accgggagac aaaggataaa 900 gcgcggaggc aatggaggct cttattcctt gatggccatg gatctcacct taccatgaag 960 ttcttcgatt actgcgatga caataagatc cttttagcaa catatcctcc acattcaacg 1020 cattcactgc agccgcttga tgttgggatc ttcagcccgc tttcccacgc ctacagcagc 1080 gaactggagg catatctgca tatatccatg ggactaagtc atattacaaa acgggacttc 1140 tttcgcctct tcttcccggc ctgggtaaag gccttatcaa gcaaaaatat tatatcttct 1200 tggagaacag ttggaataca tcccttcaac cctgaaattg ttctggcgag atttagcaga 1260 gaaccgcagt caaggccatc aacaagtgag tcctcgcgct ctatattagg tgcagaagac 1320 tggcggaaga tcaagaagct cctccatgat gttgttgagg atgtatacag tgaaaacacc 1380 aggaagctta gtttggccat gcataacctc tctacagaga atattcttct aaagcttcaa 1440 tgcgagggcc tccagatagc cctccagaat gagaagaaga agcgtcagcg cggaaagcct 1500 ttacaatttc aattaaaagc ttcagacgat ggtggtgcag ttttttactc ccctcaaaaa 1560 attcagcagg cgcgagacct tcagcttgga aaggaaagag ctgctgaaca gctaaaggcc 1620 tctaaagagg agcaaaaggt ccgccggcag caagagaaag aggcaaagca gcgcctgatt 1680 gaggatcgca ggaaaatccg ggcatctcag cgagaaatac accgcctgga ggcagagcaa 1740 aagaggcagg agaaagagga tgcccgtata tcaaaggagg ccgcgaagca gcttcaaatt 1800 gacttccaac aggcaaagaa gactccaagg aagtcctcta aagcttcaaa tcatacagat 1860 acacaggaca ctggcccgcc atctcatgtt gttgttgaag aggtccctcc tacagtaaat 1920 cggcgaggcc gcgagatccg gctcccacag cactttcgga ccaattaaaa ttgacagaac 1980 tactctaaat tattactata ttatgccgcc aaaaatttga gtgataatat tagttgtatg 2040 tggttgaatt gcttcatgtt tgttgtgctg gtggtggttg aaatcgatcg tcaggtggga 2100 acgatcgtca ggtgatcaac cactg 2125 // ID Copia-6_MLP-LTR repbase; DNA; FNG; 479 BP. XX AC AECX01000139; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-6_MLP_; KW Copia-6_MLP-I; Copia-6_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-479 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000139; Positions 89300 89778. XX SQ Sequence 479 BP; 127 A; 91 C; 56 G; 205 T; 0 other; tgttggagta tattcatatc tacatgcgta atactaacaa ctctaaggtt atgtttctgt 60 tttaattgct taattactta tgtcatgaat atatccattt tgtactcacg tgatttaacc 120 taaccaaaga ttgtatcagt gttctcatat atcttgtttc ctgtttcgaa agcttttctt 180 attcattcaa gctttcttct tttacaaact ataaacaata cctcatccct aacatctttc 240 agcgtgcttt caagatatcc tagactgagt ttatttgttt ctgtttgttt ctgatcaggt 300 atgatagaac acaatcttgt tttcattctt tgctttcatt tattactatt attatgaaag 360 caattcattc aagctttctt cttttacaaa ctataaacaa tacctcatcc ctaacatctt 420 tcagcgtgct ttcaagatat cctagactga gtttatttgt ttctgtttgt ttctgatca 479 // ID TCA5_LTR repbase; DNA; FNG; 685 BP. XX AC AACQ01000342; XX DT 05-AUG-2005 (Rel. 10.08, Created) DT 05-AUG-2005 (Rel. 10.08, Last updated, Version 1) XX DE Copia-like LTR retroelement from Candida albicans. XX KW Copia; LTR Retrotransposon; Transposable Element; TCA5_LTR. XX OS Candida albicans OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; mitosporic Saccharomycetales; OC Candida. XX RN [1] RP 1-685 RA Jones T., Federspiel N.A., Chibana H., Dungan J., Kalman S., RA Magee B.B., Newport G., Thorstenson Y.R. et al.; RT "The diploid genome sequence of Candida albicans."; RL Proc Natl Acad Sci U S A 101(19), 7329-7334 (2004). XX RN [2] RP 1-685 RA Jurka J.; RT "TCA5: annotation of LTRs."; RL Direct Submission to Repbase Update (05-AUG-2005). XX DR Genbank; AACQ01000342; Positions 265 949. XX CC LTRs are identical and ORF appears to be intact. XX SQ Sequence 685 BP; 277 A; 108 C; 122 G; 178 T; 0 other; tgttggaaaa atgaaatatg atcaatcctg catctagaac ctgtggcaga atgaaaccta 60 cgagattatg aatgacttgt gaatacaagt tgaatgttac agaatgttac caagaaggtt 120 acacttgaat atatgaatga ctagaaagtg aattgaatgt tacagaacct gaataacaat 180 gttacacgaa tgtgtgaatg atatgagttt atctatagta atgtgacata tacacaaagg 240 tgtgaatgac cgagaaaaca gatgttacat tacgggcact ggagagtgca agtctaaaga 300 atcttggagt agaaataagt aatataaaaa ggaccaaaga ttctttagag aaaagtaaat 360 gaaactatat tagattttat ataactaact aacaaataaa taaaaaatat aatatgtcta 420 caatgccacc aacttccaaa cgtactagaa agagaactag aaccgatgat aatgctgaac 480 caactattca agatccttca ccgccacttg ctaatgttga acccacaatt caagagactc 540 caccgctggt tgaagttagt gatgagacta attcaactga aatcaatgag acaaatagta 600 atactcatga agaaacaaat gtattaacta atgtgcactc ctctccaatc gagacagtta 660 ctgagaggaa cttcaatttt caaca 685 // ID Gypsy-42_MLP-LTR repbase; DNA; FNG; 184 BP. XX AC AECX01001151; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-42_MLP_; KW Gypsy-42_MLP-I; Gypsy-42_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-184 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001151; Positions 123285 123102. XX SQ Sequence 184 BP; 49 A; 53 C; 29 G; 53 T; 0 other; tgttatgatc tcacatgtca acactctgtg acattacttt atcacatgtc actaagtaga 60 ttaccacatc agttgtgcgt tgcaccagtt gtgcattcct catctgacaa tctacctata 120 tagccagaac caaccttgct gagccacgac tcttgtcgaa gaacccccga tcctggcctt 180 aaca 184 // ID Copia-14_MLP-I repbase; DNA; FNG; 4708 BP. XX AC AECX01002284; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-14_MLP_; KW Copia-14_MLP-LTR; Copia-14_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4708 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002284; Positions 26445 21738. XX CC Positions [1990-2514] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 193..4692 FT /product="Copia-14_MLP-I_1p" FT /translation="MSSSPIDSNFEDTSITDTESNSSDKSHRTARQDPDPE FT STTSENSREIPPTIMSGEEPVAVFGNALQRYGQQLTNALSKFKVTEDLEDG FT NYPTWQRSIFDNLETLELHHYITVKDFVDKELSSDKVLKTKKVIVSHILNR FT LDKANHCQAINRLTNPEDPNSIIHDPFILWTFLKERHFLINAQRLAAITKS FT LSTITISRGDTLPGYLDKFENLFVEFTRYGGKMDDTQSAIRLIDSIDRLPE FT STVEFIHSTISPLTRREVTQYLRAYDTRHNFSTEATREARNVEQSASATSR FT GRRIECTESVCKGPHPAEKCWSKPSNFPERDDFLARRRANRSGWNQRSAAN FT SNSRPSNNPVRGMRRVNASPASASSITQSLGMMSLHTSFEVISLDTPPAAN FT MSTSNSSDTVWALHDTGATNHMFNSQSLFVPDSLKPVDDPNRRVKLAGGDA FT TLAVEGVGKVHLKAGDGSIFELTECLLVPELSKNLIAGGILKKKGVRETFD FT PTDPNCFALVKNGLALFNGIILENGLMNVVIESVSRPASNLNSSSEASAHL FT CSSILHRRLGHISQPYLDKMVKHGSLDGLVMKTEDITECDICPLSKNTKLP FT FNHSRPRALRFLENIHVDISGINRVKGLRNESYYILFCDDYSSYRHIFSLV FT DRSKTEVFKVFKSYIAVAERQTGCRVKQFTLDRGGEFLNDLLGPELDSLGI FT TLHLTAGHTPEQNGVSERGNRTVITKARCMMLESGVPQSFWYLACSMAVFL FT TNRSITKAVSDFRTPFEVWHFRKPSINHLKVFGCQAYRLIRKELRQTKYTP FT VASPGILVGYEQDNFNFHIYDIKDRKIYITPHVTFNENLFPFNSEKTSVNS FT GNDLVKSFYFDDDEMLMESVIETNHENGPAGSEEIEDSQTNLNPILTDPPD FT VKTTESSSNDTSPQIKQTNVPVEIRRSGRNQTSVNYKGMCNAASYESCEFQ FT SYFSCLPQDCYAVSSATPDPKSYKKAMLAGDSPNWRKACDKEMESLLKKGV FT WSLVDRPKSKCIIRGMWIFRTKINSDGTKKFKARFVALGNTQVEGLDYGET FT FAPTGKPTSFRLLVAMASINGWEIHQMDAVTAFLNGILEEEIYIEQPEGYV FT IVGYEGKVLKLNKSLYGLKQSPKIWQDDVQAYLVSIGFIQCQIDHCIYIRS FT DESLQKFTAVYVHVDDMAITGNDALQFKQEISSKWEMEDLGLAQIVVGIEI FT KRNNSHSYSLTQTRFAETVLERFNMSEVKSASTPLTPNLKLYRSTDEEVES FT FALLKLNYRSAVGSLMYLSQCTRPDLAYAVGVLSQHLDRPSISHWNAALQV FT FRYLKGTVHLGITYNGKISTRVSGQKSFDLPVSHCDADWAGDKSTRRSTTG FT YIFTLAGGALSWRSRLQPTVALSSTEAEYRAVTEAGQELLWLRRMMEAFGC FT HDAQPTILQSDNLGAIHLTSKSTFHGRTKHIEIHHHWIREVVKNGDIKLKH FT CSTGEMVADLLTKALGKQQFIKLRSKLGLIQNEPQ" XX SQ Sequence 4708 BP; 1389 A; 1069 C; 951 G; 1299 T; 0 other; cctttttaaa ggttcgtcta aaggagttta gcactcactt tcatctgtca gcccgtgtct 60 cgtcaacacc aggctaagtg tcttttacac ctttttaaag gtttctgtcg tagatcaaca 120 gcgtcgagct tgttttctta catggtagcg agaggttcga acgacacctc atattcgtat 180 tagatccatt tcatgtcctc gagtccaatc gattccaatt tcgaagacac ttccataacc 240 gataccgaat caaattcttc cgataaatct catcgaactg cccgacaaga tccagatcct 300 gaatcaacca cctccgaaaa ctctagagaa attcctccta ctatcatgtc cggagaagaa 360 cctgttgccg tatttggaaa tgctcttcaa cgttatggtc aacaactaac caacgctttg 420 agtaaattca aagttaccga agatctagag gatggtaact accccacctg gcaaagatcc 480 atctttgata atctggaaac tctcgaactg caccactata tcacggtgaa agattttgtg 540 gacaaggaat tgtcaagtga taaagttttg aaaactaaga aggtgattgt gagtcatata 600 ctcaatcgtt tggacaaggc aaatcactgc caagccatca atcgtttaac gaatccagaa 660 gatccgaatt caatcatcca tgatcctttt atcttatgga ccttcttgaa agaacgtcat 720 ttcttgatca acgctcaacg tctggctgca atcactaaat ctttaagcac catcactatt 780 tcacgtggtg atactttacc tgggtatctg gacaagttcg aaaatctttt tgttgaattc 840 actcgttatg gtgggaaaat ggacgacact cagtccgcaa ttagactcat tgattcaatt 900 gatcgacttc cggaatctac tgtcgaattc attcactcta ctatttcacc tctaacccga 960 cgtgaagtta cccaatacct tcgtgcatat gacacccgtc acaatttttc aactgaagca 1020 actcgagaag ccagaaatgt tgaacaatcg gctagtgcaa cttcaagagg acgccgaatt 1080 gagtgtaccg aatccgtctg caaaggccct catccggctg aaaagtgctg gtcgaaacct 1140 tctaacttcc ctgaacgcga tgattttctt gcccgtcgac gtgccaacag atccggatgg 1200 aatcaaagat ctgcagccaa ctccaactca cgaccctcaa acaatccagt cagaggtatg 1260 agacgggtaa acgcatcgcc tgcttctgct agctcaatta ctcagtcttt aggcatgatg 1320 tctttacaca caagttttga agtaatctct ctcgacacac ctcctgcagc caacatgtct 1380 acgtccaact cttccgacac tgtgtgggcc ctccatgaca ccggtgcaac caatcacatg 1440 ttcaattctc aatctctctt cgttcctgac tcgttgaaac ccgtcgacga tcctaatcga 1500 agagtgaagt tggcaggtgg agatgctacc ttagcagtcg aaggagttgg caaggttcac 1560 ttgaaagcag gagatggatc catattcgaa ctcactgaat gtctgcttgt cccagaactc 1620 agcaaaaatc tcatcgccgg aggaatcctg aaaaagaagg gtgtccgtga aacttttgac 1680 ccaactgatc caaactgctt tgcccttgtc aagaatggac tcgctctatt taatgggatc 1740 atccttgaaa atggactcat gaatgtggta atcgaatcag taagccgtcc ggcttctaat 1800 ctcaactcgt cttccgaagc atcagctcat ttatgctctt caattcttca tcgtcgctta 1860 ggccatataa gtcagcctta cctcgacaag atggtcaaac atggaagttt agatgggctt 1920 gtcatgaaaa ctgaggatat aacggaatgt gatatttgtc ctttgtctaa aaacaccaaa 1980 cttcctttca accactcccg tcctcgtgct ctcagatttc ttgaaaacat tcatgtagac 2040 attagtggca ttaatcgagt taaaggctta agaaatgaat cttattatat tctattctgt 2100 gatgattact ccagctacag acatatcttt agtcttgtag acagatcaaa gactgaagtt 2160 ttcaaagtgt tcaaaagtta tattgccgta gctgagcgcc agaccggttg cagagtcaaa 2220 cagtttactt tggatcgagg gggtgaattc ctcaatgatc tcctcggacc tgaactggat 2280 tcactcggta ttactctaca cttgaccgct ggacacactc ctgaacagaa cggagtgtcg 2340 gagagaggta atcgcacagt catcaccaag gcgagatgca tgatgcttga gtccggtgta 2400 cctcagtctt tctggtatct ggcttgttcc atggctgtat ttttaacaaa tcgaagcatt 2460 acaaaagcag tatctgattt ccgtacgccg tttgaagtct ggcactttcg aaaaccaagc 2520 atcaatcatc taaaagtctt cggatgtcaa gcatacaggt taattaggaa agaacttaga 2580 cagactaaat acacacctgt tgcatcccct ggtattcttg ttggatacga acaagacaac 2640 ttcaactttc atatctacga tatcaaagac agaaaaatct atatcacacc tcatgttaca 2700 tttaatgaaa acttgtttcc tttcaactca gagaagacat cggtcaattc tggcaacgat 2760 ttggtgaaat cattctattt cgatgatgac gaaatgctta tggaatccgt gattgaaaca 2820 aatcatgaga atggtccagc tggctctgaa gaaatagaag actcacaaac aaatctgaat 2880 ccaattttaa ctgatcctcc tgatgttaaa acgactgaat caagttcaaa tgatacatct 2940 cctcaaatca aacaaaccaa tgtaccagtt gaaatccgac gctcaggtcg aaatcaaact 3000 agtgtaaatt ataagggaat gtgtaatgcg gcaagctatg aatcatgtga atttcagtcc 3060 tacttttcgt gtcttccgca agattgttat gctgtatcga gtgctactcc tgaccccaaa 3120 tcttacaaga aagccatgtt ggcgggtgac tcccccaact ggcgtaaagc ttgtgataaa 3180 gagatggaat ctctcctcaa gaaaggcgtg tggagcctgg tagatcgacc taaaagcaaa 3240 tgcatcatca gaggtatgtg gatctttcga acgaaaatta acagtgacgg caccaagaaa 3300 ttcaaggcga gatttgtggc tcttggtaac acgcaagtag aaggtctcga ctacggtgaa 3360 acattcgctc caactggaaa gccaacctca ttcaggcttc ttgtggccat ggcatccatc 3420 aatggctggg agatccatca gatggatgcg gttacagctt ttctaaacgg aattttagaa 3480 gaagagattt atattgagca accagaggga tatgtcattg tcggttatga aggcaaggtg 3540 ctcaaactga ataagtcttt gtatggtctc aagcaatcac cgaagatctg gcaagacgac 3600 gttcaggcct accttgtcag cataggtttc attcagtgtc aaatcgatca ctgtatctac 3660 atacgttcag atgagtctct tcaaaaattc acagccgtct atgttcatgt cgacgacatg 3720 gcgatcactg ggaatgacgc tcttcaattc aaacaagaaa tatcctcaaa atgggaaatg 3780 gaggatcttg gattagctca aatagttgtc ggaattgaaa tcaaaaggaa caactcacac 3840 tcttactcat taactcaaac tcgtttcgcc gaaaccgtac tcgaacgatt caacatgtct 3900 gaggtcaaat ctgcatcaac tccacttacc cctaatctta aactttacag gtcaactgat 3960 gaggaagtag agagttttgc tttactaaag ctaaactacc gaagtgcggt aggatcgttg 4020 atgtacttat cacagtgcac tagacctgat ctggcctatg cagtcggggt actgtcacaa 4080 catctcgata gaccttccat ctctcattgg aatgcagccc ttcaagtctt tcgatatctg 4140 aaaggaacgg ttcacttggg tatcacttac aacggcaaga tttcaactcg agtctctgga 4200 cagaagagtt tcgatttacc tgtctctcac tgcgacgctg attgggcagg cgacaaatcc 4260 acacgacgtt caactactgg ctacatattc actttggcag ggggtgcatt gtcctggagg 4320 agtcgtttac aaccaactgt agctctgtct tcaacagaag ctgaatatcg cgccgttact 4380 gaggcaggtc aagagttgct gtggctacgt cgtatgatgg aggcttttgg atgtcatgat 4440 gctcaaccta cgattcttca aagtgataat ttaggtgcta ttcatttaac aagcaaatca 4500 acttttcatg gaaggacaaa acatatagaa attcatcatc attggatacg tgaagtagtc 4560 aagaatggtg acattaaact caagcactgt tcaactggtg agatggtagc tgatctttta 4620 actaaagcct tgggtaagca acagttcatc aagctacgaa gcaagttagg tttaattcaa 4680 aatgagcctc aatagtcttg agggggtg 4708 // ID LTR11_CN repbase; DNA; FNG; 616 BP. XX AC . XX DT 30-MAR-2005 (Rel. 10.03, Created) DT 18-APR-2005 (Rel. 10.03, Last updated, Version 1) XX DE C. neoformans LTR - consensus. XX KW LTR Retrotransposon; Transposable Element; Interspersed repeat; KW LTR11_CN. XX OS Cryptococcus neoformans OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Tremellomycetes; Tremellales; Tremellaceae; Filobasidiella; OC Filobasidiella/Cryptococcus neoformans species complex. XX RN [1] RP 1-616 RA Goodwin T.J. and Poulter R.T.; RT "The diversity of retrotransposons in the yeast Cryptococcus RT neoformans."; RL Yeast 18(9), 865-880 (2001). XX RN [2] RP 1-616 RA Loftus B.J., Fung E., Roncaglia P., Rowley D., Amedeo P., RA Bruno D., Vamathevan J., Miranda M., Anderson I.J. et al.; RT "The genome of the basidiomycetous yeast and human pathogen RT Cryptococcus neoformans."; RL Science 307(5713), 1321-1324 (2005). XX RN [3] RP 1-616 RA Gentles A. and Jurka J.; RT "C. neoformans LTR sequence LTR11_CN."; RL Direct Submission to Repbase Update (15-MAR-2005). XX DR [3] (Consensus) XX CC Average similarity to consensus is 94%. XX SQ Sequence 616 BP; 175 A; 123 C; 192 G; 126 T; 0 other; tgtcacgggc gtcacgtcag tcagcgtagc tgcccggcgc tcaactcgga tgatgtagaa 60 ctcgacgaat gaaagtgaag gattcacgcc tagcagagcc ggacggaaga attgccaggc 120 tgacggaaca gaagctggac ttagaggaag gaatagccag ggcctgaagg agcaggagct 180 ggactaaggg ctgactgaag cgacttaaaa gaaaagaatg ctcggaggca aggtagtgag 240 aaaggacaac agtattgatt ttctggggac tatgaaaact acgaaaagct atcttcttat 300 atacctttcc ttccttctgc ttactcatct ctctcgggag aaagttatcg ctaactcgtg 360 gcggggacgg agctgtcgga ccggggggaa atgaacactg accaagtgtt ggggaagaag 420 aggagagtgc tgaccaggcg aaacgaaaga agaggcaagc cgaaagatgc ctcggttgat 480 tttgcaagaa tcgggcgata aatcgcggtc gtctgaggtg gaggtttgtt tggttacttc 540 ggggacaagg tggaataata tcccgttgcc cggttaaaga cgccgcagag tgcgattctt 600 gcataacatc gtgaca 616 // ID Gypsy-3_PPM-LTR repbase; DNA; FNG; 887 BP. XX AC ABWF01002000; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Postia placenta genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_PPM_; KW Gypsy-3_PPM-I; Gypsy-3_PPM-LTR. XX OS Postia placenta OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Polyporales; Postia. XX RN [1] RP 1-887 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Postia placenta genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABWF01002000; Positions 16145 17031. XX SQ Sequence 887 BP; 192 A; 221 C; 225 G; 249 T; 0 other; tgtggtggag gagagtgcgc taggactcaa gtacgatggc acgtacggtt tcggtttcat 60 tttatttttc tgtttatttc ttggtacctt tctctctctc atgcggttga tggctcggtt 120 gtatctgttt cgttccaact ctagggcttg cgaggacagt gtcgaaggcg aggggaaaca 180 gcagcgtagg ccgatcaggg tgccgatgtc gtcacgctct cgagggcggc gagaagcgca 240 agtaaatatg acgcgtcgga ggcgacaccc ttgtgacgct gagacgagag ctgactgtgg 300 actcagaccg gaggctctgg gcttagcttg cttgtgccca ggaaagggct tccatcccga 360 ttaggatagt gttttccatt ttgggacgat gtgtcgatca acggcctcgt agtatgtgcg 420 tcattctctc cttgcaggct gcatatgata tcatagctgt caccgtctct ctgtacatat 480 atataccgac gttggcgtcg atgaatccaa gttccctctt gcactcaaac tcccacattc 540 cctctctcgc gcaaaccgat caagtcatta gtctgtcgca ggtaagtgtt gaactcggta 600 gacggatatt acgagaaaat acgcgcatag tcattcggac tagcctcggg actcattcgc 660 tcgggcatcg tcattcggac gtgcatatcg agacgtatcc ggtattagac gtcgggcata 720 ctattgaagt tattgatcgt tatcgttcac gttcaattga tcgtggactc gttacgcata 780 tagtcagaca ttgtggccaa ataacatttt attacttcgc agctagcctc caacgtcttc 840 accacctttc aaccatcttc gaacatcgac tgtcgacgac cctccca 887 // ID Copia-59_MLP-I repbase; DNA; FNG; 4986 BP. XX AC AECX01000446; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-59_MLP_; KW Copia-59_MLP-LTR; Copia-59_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4986 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000446; Positions 40827 45812. XX CC Positions [2164-2688] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1345..4794 FT /product="Copia-59_MLP-I_1p" FT /translation="MTFLIDFQPIDVQPPPVRITNTSLEVIRTIDVSSGSI FT QTPIDPEVVYLRSSEVAETQHPSAQGSELVDFGNTSSIAPAEGIWALNDTG FT ASHHMFNDIKLFDASTIKPLEDTNKRLRLAGGDASLAVHSTGSVKLRAGDG FT TVFTLADCLYVPELSCNLIAGGALLKKGVVTRINPSEPECFSLVMGQCALF FT NGAFTCNLMLVSLEPVSSILSNQPQQQPNSHACHLQHQRLGHVSQRYINEM FT VKKESVEGLCENSSMVKSCPICLQSKAKRLPFSGTRPRASIFLQNFHVDLS FT GINCTLGLHNEAYFILFTDNYSSFRHIFHLCNKSKETVFNAFLKYITLVER FT QTSCKLIQFTMDQGSEFVNSIMREYCEETGIALHLTAAYTPEQNGVSECGM FT QTIIGRARSMMIQSGVPQLFRYEACSTAVFITNRTITTALPNGKTPYKVWW FT FCKPSVKHLRIFGCQAHLLIRKELRLGKYSPVTSEGVLVGFQDDNFNYYVY FT DLDSRKIILSHNVTFQEDVFPYKVSVSSNVKSGSSPLEFVDDTPRRLPVES FT ANPFQPLTTLEEDDDESLLLDQSHDRDSVTVPSAISPLSPAPASPTPAPTI FT RRSLRVKESTSYKGMGGCAIFDDHTFTSAFNILNEEFSSSLIDIPVPKSIK FT VALKGPEKDLWSQACDKEYNSLMSKGVWVLVDPSPNTNILRGFWLFRKKFN FT IDGTVSKHKARFVVMGNMPQAGVDFHETFSPTGKPSSLRLMIAIAATQGWD FT VHQMDAVTAFLNGVLDEDVYMYQPKGYVVAGQEGKVCHLRKSLYGLRQSPK FT IWYDEVVSFLLSVGFSQCTMDHCIYTRSIKSTSSTHSSFTAVYVHVDDLAI FT TGNDISTFKSQISLKWEMEDLGIASTVVGIQIIRQSQLCYTLTQAALVRAV FT LERFDMSNLKSASTPFPAGLKLYCGSDDEVEAFKHEGLNYSSAVGSLMYLA FT QCTRPDLSYAVGVLSQHLKSPARKHWDAVIHVFRYLSGTINSGISYNGDLA FT APTTIEGLRSFNRPDSHCDSDWAGDKESRRSTTGYIFQLAGGPISWKSRLQ FT PTVALSSTEAEYRAITKAGQELIWLRHMLEVFGFKDPNPTVLRSDNMGAIH FT LTTKSIFHARTKHIEIQYHWIRDTANIMYFSEVKNDYIYFTK" XX SQ Sequence 4986 BP; 1372 A; 1109 C; 971 G; 1534 T; 0 other; gtacttacct tgacctgatt accccaggtc tgtcttttag cttctcgaag ctgttttgtg 60 tggcttgttg ccagtactta ccttgacctg attaccccag gatccacttt attggcaact 120 gcccaacagc tcttgtagct attcgactcc attctttaca tggtagcaag agtctgatat 180 tatatcagat ctctttttaa acaacgacgt cttattcatg caatccgaag aagacaaact 240 ttcgatatca ataactgact ctgagtctca agacaaccca tcttcatctg aatcggactc 300 ggcttccacc gtaaaaagtt ttactggtca aagatcgttg atactaccta atttaaattc 360 tcctacattg ccttccaaca tcatgtccac tgaaccttca tcaaactcag atttaactcc 420 actccaatta ttcgtccata gaactaatgc tgctctgagc aagtttttta ttaagaatga 480 tctgacagac aacaattcaa ttattggcac cttctcattc atgaatccat cgagccttta 540 ggctatgaac tgtatttaga taggaaggat tgtcgagatg aatcactttc tgaagagaaa 600 cataagaagg tcaaattcat tattacaact tggatactta acaaatgtga cggtgtgaat 660 ggagaacgag ctcaagataa acttactcaa agagacccta atactaagga catgatcata 720 gtatacaacc cttttgtgtt atggaatcat ctcaaagttt atcattctaa cgtatcagaa 780 gctaaactca agaacattga aatctctttg cataacttaa ctcaacttcg tactgacagt 840 ctcaaggtcc atatcgacaa gttttcatct ttgttacgag aatttaataa ttttaaagga 900 gaaatgtctg atactcaaga agctcgcacg ttaatcaaaa gtcttaagcc tggttatgag 960 gtagccattc aaattatcta ttgaatcata aaaccactta ctctagaagg tgtaatacgc 1020 aaattattag aacatgaaga tgaacagacc gtttcacctt tatcacatca tgtcaaccac 1080 tactcagctc aacctagtac cttagtcaag tgtaccgcgg aacgatgtgt tggtagtact 1140 tttcaaaacc ctcataaacc cgaacagtgc tttaaactac cttctaattt tggtaaacga 1200 gacgagtgga tggctaagca agatgagaat tagaagaaat tcaaacgtcg taactttcat 1260 tcacaagctc cctatactga cttgtcatct agcatcagag gagttaagat tgtacgacct 1320 cctgtggcga ctgccaatca cgccatgact tttttaattg actttcaacc gattgatgtc 1380 caacctccgc ctgttcgtat cactaatacg tccttagagg tcatcagaac tattgatgtg 1440 tcatctggat ccattcaaac accaatcgat ccggaagtgg tttatctacg gtcttctgaa 1500 gtagccgaga cccaacatcc ttctgcacag ggatccgaat tggttgattt tggcaacact 1560 tcttctatcg ctcctgctga aggcatctgg gctttgaatg atacgggtgc ttcgcaccat 1620 atgttcaatg acattaaact ttttgatgct tcgacaatca aacctctcga agatacaaat 1680 aagcgtcttc gactagccgg aggagatgcc tccttggctg ttcattccac tggctccgtg 1740 aaacttcggg ctggtgacgg cacggtgttt actctggccg attgtcttta cgtacctgaa 1800 cttagttgca atctcatcgc tggtggtgcc ttactgaaga aaggcgtggt tacgcgcatc 1860 aacccatcag agcccgaatg ctttagctta gtcatgggac aatgtgcttt gtttaatggc 1920 gcgttcacct gtaatctcat gctcgtatct cttgaacctg tgagttcaat tttatccaat 1980 caacctcaac aacaaccaaa ttctcatgct tgtcatctgc aacaccaacg cttaggtcat 2040 gtgagtcaga ggtatatcaa tgagatggtt aagaaagaga gtgtggaggg gttatgtgaa 2100 aattcatcta tggtcaaatc atgtcctatt tgtcttcaat ctaaggctaa gagacttcca 2160 ttttccggta ctcggcctcg tgcatctatt tttcttcaga attttcatgt ggaccttagt 2220 ggtattaatt gtaccttagg attgcacaat gaagcgtatt ttattttgtt taccgacaat 2280 tattcttcat ttcgacatat ttttcaccta tgcaataaat ctaaagaaac cgtgtttaat 2340 gctttcttga aatacataac tcttgttgag cgacaaacta gttgcaaact cattcaattc 2400 actatggatc aaggcagcga atttgtcaac agcatcatga gagagtattg tgaagagacg 2460 ggtattgccc ttcatttaac tgctgcctat actccggagc agaacggagt atctgaatgt 2520 ggtatgcaga ctatcatagg aagagccaga tcaatgatga ttcaatcagg tgttcctcaa 2580 ttatttcggt atgaggcgtg ttccaccgca gtttttatca ccaatcgaac tatcaccact 2640 gctctcccta acggtaagac accatacaaa gtttggtggt tttgcaagcc ttccgtcaaa 2700 catctccgta tctttggatg tcaagctcac ctactcattc gaaaagaact tcggctgggc 2760 aagtattccc ctgttacgag tgagggagtg ctagtaggtt ttcaagatga taactttaac 2820 tactacgtat atgacctgga ttctaggaaa atcatactat ctcacaatgt tactttccaa 2880 gaagatgtgt tcccctacaa ggtttctgtc tcctctaatg ttaaatccgg aagctcgcct 2940 ctagaatttg ttgacgacac tcctcgtcga ctaccagtgg agtcggcaaa cccttttcaa 3000 cctttaacta cgcttgaaga agatgatgac gaatctctgt tgttggatca atctcacgat 3060 cgtgattctg ttactgttcc ctctgctatt tcaccgttat ctcctgcgcc tgcttctcca 3120 acacctgcac ctactattcg tcgatcacta cgtgttaaag aatcgacgag ctataaggga 3180 atggggggat gcgctatttt cgatgatcat acttttactt ctgcatttaa catcttgaat 3240 gaagaattct catcttcttt gattgacata ccagtgccga aatccatcaa ggtcgctctc 3300 aaaggtcctg aaaaggattt gtggtctcaa gcctgcgaca aagaatacaa ttcactgatg 3360 tctaagggcg tatgggtgct agtggatccg tctcccaaca ctaacatttt acggggtttt 3420 tggctatttc gaaagaagtt taacattgac ggtactgtgt ccaagcataa ggctcgcttt 3480 gttgtcatgg gtaatatgcc acaggcgggt gtagattttc atgagacatt ttcacccact 3540 gggaaaccgt cttctttacg tcttatgata gctatagctg cgactcaagg ttgggatgtt 3600 catcaaatgg acgctgttac agcttttctc aatggtgttt tagacgaaga tgtctacatg 3660 tatcaaccca agggttatgt tgtagctggg caagaaggca aggtctgtca cttaagaaag 3720 tctttatatg gtctcaggca atcacctaag atttggtacg atgaggttgt atcatttctt 3780 ttgagtgtgg gtttctctca atgcactatg gaccattgca tttatactcg tagcatcaag 3840 tcaacttctt ctacccactc ttccttcact gctgtgtatg tgcacgtgga cgaccttgct 3900 atcaccggca acgatatttc tactttcaaa tctcagatta gtctcaagtg ggagatggag 3960 gaccttggca ttgctagtac ggtggtcgga attcaaatca ttaggcaatc tcaactgtgc 4020 tacactctca ctcaggcggc ccttgttcgt gctgttctcg aacgctttga catgagtaat 4080 ctcaagagtg catcaacgcc ttttcctgct ggactgaaat tgtattgtgg aagtgatgat 4140 gaggtcgagg ctttcaaaca cgagggtcta aactatagta gtgctgttgg ttctttaatg 4200 tacttggctc aatgtactag accggatctt tcctatgcgg ttggtgtgct gtctcaacat 4260 ctcaagtctc ctgccagaaa acactgggat gcggtgatac atgtatttcg ttaccttagt 4320 ggaacaataa actcgggtat ctcgtacaac ggagatcttg ctgcacctac tactattgaa 4380 ggacttcgaa gtttcaatcg tcctgattct cattgcgact cagattgggc cggcgacaaa 4440 gaatctcgac gatccaccac agggtacatc tttcaactgg ctggaggtcc tatttcctgg 4500 aaatcccggc tacaaccgac tgttgctctt tcttcaaccg aagccgaata tcgtgctata 4560 accaaagccg gtcaggaact catctggtta agacacatgt tggaagtttt tggttttaaa 4620 gatcctaacc ccactgtatt acgcagcgat aacatgggcg ctatccacct taccaccaag 4680 tccatctttc acgcgcgtac caaacacatc gagatccaat accattggat tcgcgacact 4740 gcaaatataa tgtatttttc cgaggtgaaa aatgattaca tttatttcac aaaatgatta 4800 aaaatgcatt aattctgcac taattctgca tcaattctgc atttttctgc attaattctg 4860 cattattaat cattttgtgc attaatgtca atcatttttc accttggaaa aatacattat 4920 tttcacagtg tgaagtggtt aagcagggtg cgctcgttat aaaacacgta cctacggcgg 4980 aaatga 4986 // ID Gypsy-83_MLP-I repbase; DNA; FNG; 5947 BP. XX AC AECX01001005; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-83_MLP_; KW Gypsy-83_MLP-LTR; Gypsy-83_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5947 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001005; Positions 31388 37334. XX CC 'CTCCA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 340..1404 FT /product="Gypsy-83_MLP-I_1p" FT /translation="MEELQRQLTELTNSVQEERRLRLEAEVRTLNAEARLA FT AIEATQATDAANQTPQTPAQTSEPASETAAQPAMPAATPRGPKVSVPDKFN FT GTRGGPAEFFASQIQLYMLAHPYLFTNDRSKVAFALSYLTGAASSWAQPLT FT LELFDANTSHTVTFDRFVTNFKAMYFDSEKKSKAERAIRALTQKTSVAAYT FT HEFNIHVSATGWETSTLVSQYEQGLKREIRVAMVMMQEEFSSIEQIANMAI FT KIDSKIHGVANSSTVTFHAPDPNAMDISSGFVWLSEEEKARRMRSGSCFKC FT NQQGHRANECSSRRTDKGKDRGGYKARIAELEFKLSALGSKSDGDVITTTS FT CAESSKNGGAQA" FT CDS 1980..5462 FT /product="Gypsy-83_MLP-I_2p" FT /translation="MHQPPQCEYDMTIPQMFTETASKLYTPQNSQIPTTPR FT TLPTTTTLNNPIHVIKNLQQSNPQIEETQEHAAADSSASSTPHTTPFGHKV FT EPTRDARHSDEGMCTISDTLASPQCESAPAHNPLPVEAAGKPVSPPISQID FT RIRTSNIVNCPNDPSGPSSLSYAAVATASSNPKKSLFKPGLEPQGHARKCD FT KGAAINIDVHQPPQCEFNHGLIPHAIETAGQPLHFQNRPLIIDTAKTSWST FT LARIAADEKAKLPAKTVEEMVPTYYHRHLHLFQKSKAQCLPPRRKYDFRVD FT LVPGAQPQAGRIIPLSPAEEAVLDEMIKTGLDNGTIRRTTSPWAAPVLFTG FT KKDGNLRPCFDYRRLNALTVKNRYPLPLTMDLVDSLLDADRFTKLDMRNAY FT GNLRVKEESEDILAFICKQGQFAPLTMPFGPTGAPGYFQFFIQDIMVGRIG FT KDTAAYLDDTMIYTKKGVRHESAVDSVLDIFNKHQLWLKPEKCEFSRSEVE FT YLGLLISKNKVKMDPAKVKAVKDWPAPKNTNKLQRFIGFANFYRRFIDQFS FT KTTQPLHDLTKLNKPYVWNNACETSFEKLKTAFTSAPILKITDPYRPFILE FT CDCSDFALGAILSQRSDDNGEIHPVAYLSRSLVQAERNYKIFDKELLAIVA FT SFKEWRHYLECNPNRLEVIVYTDHRNLETFMTTKQLTCRQARWAETPGCFD FT FVIKFRPGRKSDKPDALSRRPDLKPSEAERLTFGQLLKPENIGPDTFTTDS FT ANIDSFFLDESIDLEDAEKWFEIDVMGCSDIEAKPEETDGVLSDIKILELI FT REANPTDKQITTLMESVTNPISANMKKLTWMYEVKDGILYRHGKVEVPNDD FT NVKFNILRSRHDTLLAGHPGRNKTLSLTKRCFTWPSQKAYVNRYVDGCDSC FT LRIKSSTQKPFGTLQPLPIPAGPWTDISYDLITKLPISNGCDSILTVVDRL FT TKMAHFIPCKETMKAEELADLMINNVWKLHGTPKSIVSDRGSIFVSKITQQ FT LDRRLGIKLHPSTAYHPQTDGQSEIVNKEIEAYLRHFVDYRQENWEALLPT FT AEFVYNNRDHESIGVSPFKANYGFNPMFNLIPSSEQCIPLVEERLKLIQDV FT QKELTVCLELAQESMKHQFDKHVRQNPKWKEGDEVWLDSKNITTTRPSPKS FT GH" XX SQ Sequence 5947 BP; 1857 A; 1470 C; 1294 G; 1326 T; 0 other; tattgtcgga tccctatcga cggactgagg acataaattg caagaatcaa aaagaagaac 60 gaacgtttag aagaaagaaa aaacaagatc agaatcagaa ctaagattag aattagaaag 120 aaattcaaga tttagactgc atatcattgc cgcaagatcc ccccgcaacc ttatttgctt 180 acttaattga acacctgcag cacgtcgccc acatttagat cacccagtca cggggaagac 240 gattccaaac ccgaaacgcc cactccgttc gtagacgccg acgaatttac cggaacactc 300 atcatcactg aaccatctag ccgcgttaac cccgatctga tggaagaatt acaacgtcaa 360 ctgactgaac tcacaaattc ggtccaggaa gaacgtcgac tacgccttga ggctgaagtt 420 cgaacgttga atgccgaggc caggctcgct gcgattgagg caacccaggc cactgacgca 480 gccaaccaga cacctcaaac ccctgctcag acctccgaac ctgcttctga gactgccgcg 540 caacctgcga tgcccgctgc tacccctaga ggaccaaagg tctctgtccc ggataaattt 600 aatgggacta ggggcgggcc ggctgaattt tttgccagcc agattcagct ttatatgcta 660 gcccaccctt atttgtttac caacgaccgt agcaaagtcg cgtttgctct ttcctactta 720 accggtgccg cgagcagctg ggcccagccc ctcacccttg aactttttga cgccaacacg 780 agccacaccg tcaccttcga tcgcttcgtg accaacttta aagcaatgta cttcgactcc 840 gaaaagaagt cgaaggcaga gcgagccatc agagcgctta cgcagaagac gtctgtagca 900 gcctatactc acgagtttaa tattcacgtg tcagccacgg gttgggaaac ctctacactt 960 gtcagtcaat acgaacaagg attgaaaaga gaaatccggg tggctatggt tatgatgcaa 1020 gaagagttca gtagcatcga acagattgca aacatggcta ttaagattga cagcaagata 1080 cacggtgtag ccaactcgtc cacagtcaca tttcacgcac ccgaccctaa tgctatggac 1140 atatcttcag gttttgtttg gttgtcagaa gaggagaagg ccaggcgcat gagatccggt 1200 tcatgcttta aatgtaatca acagggtcat cgggcgaatg aatgctctag tagaagaact 1260 gataaaggaa aggatagagg aggttataaa gcccgtatcg cggaattaga atttaaacta 1320 agtgcgttag gaagtaaaag cgatggagat gtgattacta caactagttg tgctgaatcg 1380 tcaaaaaatg gcggtgctca ggcctgagcg atgtgcctat cctgagcaat tgtggggtaa 1440 tagctgaaat tgaacttgat gcaagtagaa tcttaacgtg caatgcaaac gatccacgtg 1500 ttttcctcca atgttcatta tcaacatcca attacccacg cgccacatca accaatccct 1560 tatttcctaa ctttcttata gactcaggtg ccacgcacga cgtgctgagt gagacttttg 1620 cccaaaagac tggtcttatc aaccacgcag tgtgtgcaac acaagtcgtg acaggattcg 1680 acggttcaag aagccacgcc tctttcgaaa ccaacctatt tctggaccat gacaccgaac 1740 cgacccattt tatcatcacg cgtatcaaag actcatatga cggaattctc ggcattccat 1800 ggatcaagaa gaaccatcga cgcataaatt ggcagacagg tacagtcagc tcgtatcacc 1860 acgacattgc gactgcatct gcagtttcgt caactccgca accaccctcg caagcccaag 1920 gtctggagcc caggagggac gctaggaact ttgacgaggg ggccgctgtc acaattgata 1980 tgcatcagcc cccgcaatgt gagtacgata tgaccattcc ccagatgttc accgaaacag 2040 ctagcaagct ttatactccc caaaattcac agatcccaac gacccctcga acgctaccga 2100 caacaactac cctcaacaac cccattcatg ttatcaaaaa ccttcaacag tccaacccac 2160 agattgaaga gactcaggag cacgctgcag ctgattcatc agcttcgtct acgccgcaca 2220 caaccccgtt tggtcacaaa gttgagccca cgagggacgc taggcacagt gacgagggga 2280 tgtgtaccat ttctgataca ttagcatccc cgcagtgtga gtccgcccca gcccataatc 2340 ccttacccgt tgaagcagct ggcaagccag tatctcctcc tatatcacag attgaccgta 2400 ttcgaacatc gaacatcgtg aattgtccaa atgacccgag tggcccttca agtttgtcct 2460 atgcagctgt tgcgacagct tcgtccaatc cgaaaaaatc ccttttcaag ccaggattgg 2520 agccgcaagg gcacgctagg aaatgtgaca agggggccgc tatcaatatt gatgtgcatc 2580 agcccccgca atgtgagttc aatcacggtt taatccccca tgccatagaa acagctggcc 2640 agcccttaca tttccaaaac aggcccctta tcatcgatac ggctaaaacc tcgtggtcga 2700 ctttggccag aatagctgca gatgagaagg caaaactacc tgccaagacc gttgaggaaa 2760 tggtaccaac gtattaccat cgtcacctcc atctctttca gaagtcaaag gcccaatgcc 2820 tacctcctag acgcaagtat gatttccggg tggatttggt gccaggtgca caacctcaag 2880 ccggtaggat cattccccta tcaccggcag aggaggcggt cctagacgaa atgatcaaga 2940 caggtttaga caacggcacc atcaggagga cgacatcccc atgggccgcg ccagtgcttt 3000 tcaccgggaa aaaagacggc aatctacgcc catgttttga ctaccgaaga ctgaatgctt 3060 tgacggtgaa gaaccgatat ccactgcctc tgacgatgga cctagttgat agcctgctag 3120 atgcggatcg gttcaccaaa ctagatatgc gcaatgcata tggtaaccta cgcgtcaagg 3180 aagaatccga agacatactg gcattcatct gcaaacaagg acagtttgcg cccttaacga 3240 tgccgttcgg acctacagga gcccctggtt actttcagtt tttcattcaa gacatcatgg 3300 taggacgaat aggcaaggac actgctgcgt atctcgatga tacgatgatc tatacgaaga 3360 agggagttcg ccacgaaagc gcggttgaca gcgtgttgga catattcaac aaacatcagc 3420 tttggctcaa acccgaaaaa tgcgagttct caagatctga agtcgaatac cttggcctct 3480 taatttcaaa gaacaaagtc aagatggacc ccgcaaaagt gaaagcggtg aaggattggc 3540 cagctcctaa gaacacaaac aaactccaac ggttcattgg ctttgctaac ttttacagga 3600 ggtttataga ccaattctct aagacaacac aaccacttca tgacttgaca aagctcaaca 3660 aaccttatgt ttggaataac gcctgcgaaa cctcttttga aaaactcaag actgcattca 3720 cgtcagcacc cattctgaaa atcacggatc cctacaggcc cttcatacta gaatgcgatt 3780 gctctgattt cgcgctaggc gctatccttt cacagagatc ggatgacaat ggcgaaattc 3840 acccagtggc ttatctatca cggtcactgg tacaggcaga aagaaattac aaaatctttg 3900 acaaagagct attggcaatt gtagcatctt tcaaggaatg gcgccactac cttgagtgca 3960 acccaaaccg actcgaagtg atagtgtaca cggatcaccg aaatttggaa accttcatga 4020 ctaccaaaca actcacttgc cgacaagcaa gatgggcaga aacacctggc tgctttgact 4080 tcgttattaa atttcgacca ggacggaaat ctgataaacc cgatgcacta tctcggaggc 4140 ccgatcttaa gccttcagag gctgaaagac taacatttgg acaactactg aaacccgaaa 4200 acattggacc tgatacgttt accaccgatt cagctaacat tgactcattc ttcttagacg 4260 aatcaattga cctagaagat gccgagaagt ggttcgaaat tgacgtgatg ggatgctcag 4320 acatcgaagc caagccagag gagaccgacg gagtgctgag tgacatcaag attttagaat 4380 tgatacgaga agccaaccca acggacaaac aaattacaac gctcatggaa tcagtcacaa 4440 atcccatatc tgccaatatg aagaaactca catggatgta cgaagtgaag gatggaatct 4500 tatacagaca cggcaaagta gaagtaccaa atgatgacaa cgtcaagttc aacatattac 4560 gaagccgaca cgatacactc ctagcaggac acccaggacg gaataaaacg ctcagtctca 4620 caaaacggtg ttttacctgg ccttcacaga aagcttacgt caaccggtac gtagatggat 4680 gtgattcctg tttacgaatc aaatcttcga cacagaaacc attcggaacc ttacaaccac 4740 taccgatacc ggctgggcct tggaccgaca tatcatatga cttgatcaca aaactgccaa 4800 tctccaacgg atgcgatagt atactcactg tggtggatcg acttactaaa atggctcatt 4860 ttatcccctg taaagaaacc atgaaggcag aagaattagc ggatttgatg atcaacaacg 4920 tgtggaagtt acacggtaca ccaaagagta tagtatcaga tagaggcagc atcttcgtat 4980 ccaaaatcac tcaacaattg gataggagat tgggtattaa actacatcct tcaacagcgt 5040 accaccctca aactgatggt caatcggaga tagtgaataa agaaattgaa gcttacttac 5100 gacattttgt tgactacaga caggaaaatt gggaagcact gcttccaacg gcagaatttg 5160 tgtacaacaa ccgggaccac gagtcaatag gagtatcgcc ttttaaggcc aactacggat 5220 tcaacccaat gtttaaccta atcccgtcgt cagaacaatg catcccttta gtagaagaac 5280 gactaaaact gattcaggac gttcaaaaag aactgacagt ttgtttagaa ttagcacagg 5340 aatcaatgaa gcaccaattt gacaagcatg tacgacaaaa cccaaaatgg aaagagggag 5400 atgaagtttg gttagattca aaaaatatta caaccacgag accaagcccc aaatcaggac 5460 attgatggct tggtcctttc aatatcagta aagtaatctc aagcactgct tacgcgctaa 5520 accttccact ctctatgaag gggatacata atacttttca cgtttcactt ttacgcaagc 5580 acaatccaga tacaatcgaa caacgggtac aggacgagag accagcaatt gaaattgaag 5640 gtgaagacga atgggaagtc tcagcaatat tggattgtag agtaagaaga aacagaaaag 5700 agtatttagt gaattggaca ggatttaatt caagccacga ctcgtgagaa ccagaatcaa 5760 acctcaagaa ttgtaaagat ttattaaaag aattcaagaa gcgatttcct aacactgaaa 5820 agaagaaaaa ggcacggaga agaatgtgag agggctaagc tttttcccaa gtggtttttt 5880 aacgctgccc ggggggggga gattgcagag cttgcaaaag ggagcttggg cattaaaggc 5940 gggataa 5947 // ID Gypsy-47_MLP-LTR repbase; DNA; FNG; 652 BP. XX AC AECX01001226; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-47_MLP_; KW Gypsy-47_MLP-I; Gypsy-47_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-652 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001226; Positions 30420 31071. XX SQ Sequence 652 BP; 192 A; 157 C; 137 G; 166 T; 0 other; tgtagcaggg ttgcagtaca acccggtaca aagaccccat gagtcacggg tctgtctgac 60 gcccgggaca agcagtatga cctagacatc gacgagtcgt cgatggagtc atacagacag 120 atgcaggcat agcgaagtac cggaatgctg agctacaaga gtagtacagc atctcaagat 180 acttactcag atagagatgc gagaacaaac caagactaag gtcgctcttg cctttggaga 240 ttcacagatg gagtcatgcc gctaccatac ctagagatgc tagtagggtt cccaaagtcc 300 tgggatctcg gcaaagcact ccaagcaagg aacgaatggc taagttcgat gaattcttca 360 tcacgccatt tgcatactaa gaatgagggt atagaccctc acttaggaag ctaaccacga 420 acctcaccag ggagtatctt ttcacttgac tcattagatg tataaataga gtcttcatgt 480 tcaatgcaat tcaggaccca gttccaacag ttcagaatta ctcttgtttg gaatactcgt 540 gccttatcca attgttcatc tttctgattc attatcacaa tcaacaatta tccttaagcc 600 actcaattcc atccctagtt gttcgtgagg tttctcccta agtcacttaa ca 652 // ID Gypsy-10_LBS-LTR repbase; DNA; FNG; 583 BP. XX AC ABFE01000310; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-10_LBS_; KW Gypsy-10_LBS-I; Gypsy-10_LBS-LTR. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-583 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01000310; Positions 2908 3490. XX SQ Sequence 583 BP; 151 A; 162 C; 116 G; 154 T; 0 other; tgtaacaggg cacgaacacc taagcctaca gcttctctca ttcccttcct tacgatattt 60 cctatccaac gctacctatc catatgccaa ttcagggagt cctgtcgtca cgcaatcaca 120 gtagtcacat tcgagtacat atcactttgt cacatatgta aacaaatcat ctaactttac 180 catcatctct gttctgactc gcgcacttga gcatcgtcct cacgactcaa tcacgggtat 240 acaaggggtt tcggaagtaa ggtcagtaga attgttgata ccgtactgtg gagagctcgt 300 gtctctcctt gcgcactgat tgtgtttgca catctttccc gagggaaatc tcctagcgcc 360 tttctgcggc attccgagaa gggagagttc gtctacgctc gtggcattcc agagcaagta 420 gcaactcgtg taaagagatt gaccttgatt aactcctaga tactcgaaca agggtctcac 480 accataacgc aaacctatcc atacactaca gctcgccagg atcgcttcgt tgatagatac 540 acggaccgcg tcatcatacg caatccctga agaggttccc gca 583 // ID Gypsy-87_MLP-LTR repbase; DNA; FNG; 533 BP. XX AC AECX01000135; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-87_MLP_; KW Gypsy-87_MLP-I; Gypsy-87_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-533 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000135; Positions 47464 47996. XX SQ Sequence 533 BP; 156 A; 68 C; 80 G; 229 T; 0 other; tgtaatgatt tgtaatattg taaagaaata ttcaaatcta ctaggagata ttatcaaatg 60 taaaagagat ggaatgattg gaaatggttg ttatagcgaa tatggttgtt aggataagat 120 gttacaaact gtttatgtat atatatgttg ttagtttaca attgttttat tcttttctta 180 tcgaattatt aaagtagctt tatcactctt aaaaactaga acaaaactcg tgataagagc 240 aatgctccac gagaccctag acaagagccc ttacagtata attcaatcat tacaaataat 300 ttttagctat tttttgcgtg ttattttctt ttttttcata atttttctta ctcttttctc 360 gtgtttattt ttcttatatt cagatatatc aattagagag ttagttatat tcttttatct 420 gttaatcatc ttatcctgtt tatatgtgtc tccgtggagg attttccgta tatacttgtt 480 attactttta gatatagggt atcccctttt cgtttggact tgttattacg tca 533 // ID TSK1_LTR repbase; DNA; FNG; 322 BP. XX AC AF492702; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 08-NOV-2002 (Rel. 7.1, Last updated, Version 1) XX DE Saccharomyces kluyveri retrotransposon TSK1_LTR, long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; KW Long terminal repeat; RNaseH; TSK1_LTR; gag; integrase; pol; KW protease; retrotransposon; reverse transcriptase. XX OS Lachancea kluyveri OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Lachancea. XX RN [1] RP 1-322 RA Neuveglise C., Feldmann H., Bon E., Gaillardin C. RA and Casaregola S.; RT "Genomic evolution of the long terminal repeat retrotransposons RT in hemiascomycetous yeasts."; RL Genome Res 12(6), 930-943 (2002). XX DR Genbank; AF492702; Positions 1 322. XX SQ Sequence 322 BP; 123 A; 53 C; 55 G; 91 T; 0 other; tgttggaacg aaatacaact atcgaccatc gactagtatt cgtgttacta gtatattatc 60 acacacggtg ttataaggtg acataaagaa tgagaaacag tcatctaaat tagtggaagc 120 tgaaacgcaa ggattgataa tgtaatagga taatgaacca gaaacatata aaaggaagaa 180 tatttgtata tagagttatc gactcccttt tctggattcc tagaaagatg aggagaactt 240 ctagtataac catataccaa ttattatagc cttaatcaaa aatggaatca caacaattgt 300 cccagtattc acccatttct ca 322 // ID Gypsy-65_MLP-LTR repbase; DNA; FNG; 647 BP. XX AC AECX01002583; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-65_MLP_; KW Gypsy-65_MLP-I; Gypsy-65_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-647 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002583; Positions 15911 15265. XX SQ Sequence 647 BP; 186 A; 158 C; 138 G; 165 T; 0 other; tgtagcaggg ctacagtatg gctcagaaga ggttcctgta catacagctg tactgaatac 60 gtacaggaag taatactagg gtatcactga tcaactcgtg ctgagccaga catagaactt 120 gtctacccaa gatgaatgca gggaatgaca agactccact cgaaggctag agcctcgagt 180 ggattagctt agagatgact ctaacaatga gatcgcttag aagtgaagag acgctcgtcc 240 taagacaagc gtctcaagaa tagctgaagg gatcatcttc aataaacgat gatgcacctt 300 tctggctaaa ggcctcaagc aaggaggaca agctcagata tacaagggta gagtgtcggc 360 cgaacgggtc cccataccag ctaactagtg ggaaccccat tcttgaccta ctgcatttct 420 cttacttccg aagtctctgc tctatgttgt tgcgtataaa tagagccgct ccttctttgc 480 aattaagaac cagttctcta agaattatct tttcgagaat accttcgtgc cctctgtccc 540 ccatcagaca tcgtccattc cctaagtctt ccctatctaa actcactacg taagtctaag 600 tagatttaat acaatccgta tacgaggcgc agtttcgccc cgttaca 647 // ID TY1C repbase; DNA; FNG; 114 BP. XX AC M25056; XX DT 11-NOV-1996 (Rel. 1.1, Created) DT 11-NOV-1996 (Rel. 1.1, Last updated, Version 1) XX DE S.cerevisiae Ty1 transposable element D15 DNA, segment 2. XX KW Copia; LTR Retrotransposon; Transposable Element; TY1C; KW Ty1 transposon; mobile element. XX OS Saccharomyces cerevisiae OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Saccharomyces. XX RN [1] RP 1-114 RA Eibel H., Gafner J., Stotz A. and Philippsen P.; RT "Characterization of the yeast mobile element Ty1."; RL Cold Spring Harb. Symp. Quant. Biol 45, 609-617 (1981). XX DR GenBank; M25056; Positions 1 114. XX SQ Sequence 114 BP; 39 A; 12 C; 16 G; 47 T; 0 other; attttactgt atacttcatt aatatttggg aaatagacat attttgagat tattttttag 60 atcctgttta cgtaaaagac aataatatag gtgtgtaaat tataccctaa gttc 114 // ID DIRS-2_MLP-I repbase; DNA; FNG; 19097 BP. XX AC . XX DT 22-APR-2011 (Rel. 16.04, Created) DT 24-MAY-2011 (Rel. 16.05, Last updated, Version 2) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW DIRS; LTR Retrotransposon; Transposable Element; Copia; KW Copia-66_MLP_; Copia-66_MLP-LTR; Copia-66_MLP-I; DIRS-2_MLP-I. XX NM Copia-66_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-19097 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX RN [2] RP 1-19097 RA Kojima K.K. and Jurka J.; RT "DIRS-type retrotransposons from fungi."; RL Direct Submission to Repbase Update (24-MAY-2011). XX DR [1] (Consensus) XX CC Positions [2947-3453] - Integrase core CC LTRs are 100% similar to each other. CC Contains an insertion of DNA3-2_MLP (masked by x). CC [2] Re-classification as DIRS based on the presence of tyrosine CC recombinase. XX FH Key Location/Qualifiers FT CDS 542..1621 FT /product="DIRS-2_MLP-I_3p" FT /translation="MSQPDISTDRLKELNRKHDDVHKFTTGIEHLALDGSN FT IARWKLRTARAVYEMTDVSSYWTSQRPDEELEIQMAIDRCAGRVINSTIHE FT DLKDLIADKSLAHNAMFTLDELFCQGGRTAQYVTFRSIMLRRYNPASTNLM FT TFINDVNKDFEKLKSVGFTWDEDIVKGMVFQLCAPTTGEYGMDIINATLDA FT QYQLDKKPFSSTEVRAAMQNLITTRRTVHESEGIQALSSSFAQMNTGNLNN FT RFDNQPRFTPPRYSTPSKPILMNFTGPTPPGPISALIKDRNKSEINPIKPG FT QAAAPKVLQGLFQCFHGGVFGHGTQRCPLWMKRINRTAPHFWDWRKTNLNR FT FYSIQALGLNPSSISAR" FT CDS 6440..7522 FT /product="DIRS-2_MLP-I_4p" FT /translation="MIQEAFDRDDIEGARKLTEKLHREEARQSMPQQTQQR FT RNAYVFDEESSNDPNEPRTGQRDRERAMHEPAQRTPSVTFLREGRQVVEKE FT ARSGANGEGEEGWFHGSKSEEESNKAKSIMAAKKGSLTLSNGDMIHNGQVF FT RCGSAKMEDALTPLSPHLTTKLKALKSFIPLPVFDEDFLMKDQKAWSLLPP FT KSEDKVEKEKRMYGGEGPVEELTMNFEAWSDCMELFCVHLRAADWEPVAEK FT FEHHMNIVKKIRKDFGWMVALRYCRLVRQGVMRETIDGSLGNWAELQDHLL FT TQAKTKAESFNERAYKTNPYAEGGPKASICPYTNQPKPSSLKHVASTTAQF FT ASNQASQAVNRHASTSSY" FT CDS 12387..11314 FT /product="DIRS-2_MLP-I_5p" FT /note="tyrosine recombinase." FT /translation="MPREVKRKKLTKSTKTYEVPGHFGRAVKQSTIPGAHL FT AVGSWAKSTLSGYSSGLSKYITFIESKGMTFDEKKPISNDDIYGFITWAGR FT TKINLPNPPKKPITAKTIDKYLDGINAWHIVKHLQKISIDPEIVKMLLRAT FT RREEEGEVLESAKKPVKVKHLYRLLSESYGKSEVQTLVADVGLIVFWGMLR FT LGKIFRTEKKDGAILNHHVEFGMSNKGEWAKIHLRKGKTASPNEIQVIHLQ FT AQPSVLDPVRALKRILKANPMKEREALVFQTTKGPLTKRVFINAVEKVWGK FT SMLERWSGHSFRVGGASLRANLGTPEKVIQKVGRWKSEAFRRYIKSYSLTE FT IEETKTFLDRIRLGS" FT CDS 8161..9765 FT /product="DIRS-2_MLP-I_2p" FT /translation="MNLEVWEKELYQVNLLSKHKYIIRMFTTGFTQGIKEA FT YVPNKDYYCPPNHSSAQLAREKIEENFKVELEAGRMFGPWTKQEVYKKIGF FT FRTNPLGAVVNRDGSFRAISDLSYPRHLVDVSSVNEGVNKKEFETSWDDFK FT QVAYFFTSYKGKLLLGIFDWAKAYRQIPVAENQWRFLMVLDFNDQVLLDTR FT VQFGGVAGCGTFGWAAEVWKELMVEKFKLLAGFRWVDNTLFVKPVEDTEAG FT SMLDIVSLSTEMGVTTNEKKWKEFTYVQKYLGFVWDGENKTVQLPREKVEE FT RLENISTYLKVERQSFKETEILVGRLNHSAYVFPQMKVYCSAIYRWMKEWK FT HKYALRDIPEEVVEDLAIWKHTLTVAEPLRTIPRPETRTVGWVGDASTSYG FT LGIIIGIYWARFKLKDRWDEEEPDGSKRGIAWAETVAIRLGWQMLKCLEDV FT RGRTYLALTDNTVSEGAINNGKSRDPLVNREWKRIQEDLLWHNSAIEAKRV FT KSADNAADALSRGLDSTKQKENMLLISLPADLKEFMYQT" FT CDS 14273..12669 FT /product="DIRS-2_MLP-I_4p" FT /translation="MNLSVWKQELRRYGIEVKYEHILKGFKHGFDQGIRDA FT EIKGMRYYCPEDHTSALKAKEKIEENIEAEIKRGRIFGPWSKEEAFEKLGF FT FRTNPLGAAVNGDGSFRPISNLSYPKNDPEIESVNFRVDKDDFETSWDDFK FT KVAYFFTMYKGRLVLGVFDWAKAYRQIPVAMNQWRFLMILGFNNQVYANTR FT VQFGGVGGCGTFGEAAEAWKEIMMIKFNLLQGFRWVDDTLFIKPADGSNQT FT TMKEIASASERLGVETNEKKWREFDVEQKYLGFVWNGENKTVRLPKEKIEL FT RINQIMKYLQADKQSFKQTEVMVGRLNHITFVFPQMKVYTAAIYRMMKNWG FT KKYALRKLSEEIVEDLNIWLDTLREANPLRVIPKPVVRNVGWVGDASTSYG FT IGIIIGKKWSQFKLKPNWHAREANGDRRGIAWAETVAVRLGLLMIEQLESV FT EGRNYLVLTDKTVTENAVVNGKSKDPLVNAEWKKIQERLIKNQYSITAKRV FT ISADNAADALSRGKVSKKKNEDKIVVDLPKDLNAFLRQM" FT CDS 17006..17689 FT /product="DIRS-2_MLP-I_6p" FT /note="tyrosine recombinase." FT /translation="DSLVRIFPCEVLWFDSYAEIGRGREGNLRVLLLYDQP FT RDCDMCLTCHLIVHSIVDTRTKPCLIWCTSDDGALLRRNVEFGESGGGKWV FT KLHLRKGKTAKPNEVQIIHLQEQPSVLDPVAAVRRIMEMNGSTDRDEFLFK FT TKKGLLKKKRFLTILSEIWGEAKIGTWTGHSFRVGGASIRANLGTSEKTLK FT KAGRWKSDSYKRYVKLFTTEELQKTEKFLDAINNNKVR" XX SQ Sequence 19097 BP; 5136 A; 3982 C; 4191 G; 5165 T; 623 other; actactgtgg aagccgattc tgggagtgag cactgacgtg ctatcgtact atctgctata 60 ggttatgagc tgcgctatcg ttgaaatcgc cttttaaatc ttgcaaagct cgtagtgtaa 120 gtctgaactg ccagtcagca tccggtcata tgtcagtagc tcgcactccc ccgaagaaat 180 tgtcacgtac cacggaaagt agtatccaaa cgcggtcgca gtcacaagct ttatcgtctc 240 gttcgaactc cgtcaattgt cacgtaccac ggaaagtagt atccaaacgc ggtcgcagtc 300 acaagcttta tcgtctcgtt cgaactccgt ctcgggatct caaaccggag gagaaacatc 360 gactagggaa aacacaccct tagccggtac tcattcatct ttgaatcatc ccttcactga 420 aaaccccccc acaaactatc tgaaccctgc tctactattg aacatcggac attgtcgtca 480 aaccattttt catcatactc aagaattgga tcatttatcg ttgaacgaag acactgcaag 540 aatgtcacaa ccggacatct ctacggatag actcaaagag ttaaatcgca agcacgatga 600 tgtgcataag tttactactg gaatcgaaca tctagctctt gatggatcca acatcgctcg 660 atggaagttg cgcaccgcaa gagcggtcta cgagatgacg gacgtttctt cctactggac 720 gtctcagagg ccagatgaag agttggagat tcagatggcg atcgatcgat gcgccggacg 780 agtcatcaac tcaacgattc acgaagacct aaaggacctg atcgccgata agtctctagc 840 tcacaatgcg atgttcactt tggatgaact cttttgccaa ggtggtagga ccgctcaata 900 cgttactttt cgctccatca tgctcagacg ctacaatcct gcttctacca acttaatgac 960 cttcatcaac gatgtgaata aagatttcga gaaactcaaa tcggttggtt ttacttggga 1020 tgaggatatc gtcaagggta tggtctttca actctgtgcc ccaactactg gggagtatgg 1080 tatggacatt ataaatgcga ctctcgacgc gcaataccag ctagacaaga aacccttttc 1140 ctctactgag gtacgagcag ccatgcaaaa tctcatcaca actcgtagaa cggtacatga 1200 gagtgaaggg attcaagcac tcagcagctc tttcgctcaa atgaacaccg gcaatttgaa 1260 taaccgattc gacaatcagc ctcgatttac tcctccgcgt tactcgactc ccagtaaacc 1320 aattttgatg aactttactg gacctacccc tccgggcccg atttcagctt tgatcaaaga 1380 cagaaacaaa tcggaaatca acccgatcaa gcccggacaa gcagcagctc ccaaagtact 1440 tcaaggcctt tttcaatgct ttcatggtgg cgtgttcggt catggaactc agcgttgtcc 1500 tctttggatg aaacgaatca atcgcacggc gcctcacttt tgggattgga gaaagacgaa 1560 tctcaataga ttttacagta tacaagcttt aggccttaat ccctcttcaa tctcggctcg 1620 ttaaattcaa gtagaacata ctccttcatc atatacgcca tctcaagtat cggcatcaac 1680 cgtagcttgg agtgcagaag gcaatcaacc tgactctgtc cccacagagt atttgttgga 1740 cagcggtgcc acgcatcatg tgagcaatgc gttgcctctt ctgttgaacc attcccctct 1800 tccccatcct atcagattga aaacagccgc tcaaggtgac aacgctctga ttgtaggaaa 1860 aggaaacctc cgagtgaagg cgacgaacgg atctgatgtc atcatcacag atgtctatta 1920 cagtcctcaa gctacaggaa tgcttatttc tcaagcagcg ctggtcaaaa atggagcaaa 1980 gttatggttc tggggaaatg atgttatttt gagaatgaga gatggtctat cagtagcagc 2040 ttcttattgt aatcggcgct gggttataaa tgcaatccaa actcatgtta tcaccaaaaa 2100 agaacctgag agagtaggct gtgtgaaagt gatgagcact cagcaagata ccgatttagc 2160 ctttctttgg cacagacgtt ttggtcacgt axxxxxxxxx xxxxxxxxxx xxxxxxxxxx 2220 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 2280 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 2340 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 2400 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 2460 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 2520 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 2580 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 2640 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 2700 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 2760 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxgtagac 2820 atgaagagaa ttcgcaagct ttgcgctggc cagctaggcc taggcctgcc cttgtctatt 2880 ccatccactt ctttcacctg tgaagattgt ttggtatgca agagcacaag acagaggaag 2940 ttggggagat caggaagaga acacggactt ttagacatca tcgtatcaga cgtcatgggc 3000 cctttccctg aagatgtgaa tggtaaccgt tttacagtca ctttcagaga tgtagcaaca 3060 actttctcag acatagtcat cattaagaat aaatcagaag tacctgacac tttcactcgt 3120 gtaatcaaca aatgggaaag ggaaactggc tttaaagtta aacatgttag aactgacgga 3180 ggaggcgagt acataaagac aacattcaat tcttggctga aatcaaaagg catttttcac 3240 gaacactcga acccttatga accggaacaa aacggtgtgg cagagcgtct aaatcgaacc 3300 ataagtgata tgggtaggac gatgttggca gcatcacatc taccgcaatc attctggtct 3360 ttcgcatacg tggcggcatg ttaccttcat aacagacttc ctaactcgct gactggggac 3420 aaaacacctt ttgaactctt ctacggacgc aagcctcagt tagacatcat caggactttc 3480 ggatcgacag cttttgtaca ccgacatgaa gcgcagcgta gtggtaagct agacaattga 3540 gggagaaagt gcgtgatggt gggttactta gatggtggta aaggctggat cttctacgat 3600 gctggctcga aggtactact caagaccggc atcgcagtct ttccatatga agataagttc 3660 gtacatgaaa aatcttcatc ctcaccagaa gaacctactt catctaccac tttgaacaag 3720 ctactgaaca taccacctaa gaagggaagt ttagaattca ttgtcaatgc gctacaactt 3780 ggcaatttct cagacaaaat caaagtacac cacgaggacg cgcaagtaga agtcttaaca 3840 aactcagtga atctgtcggc tggactgaaa gaacctctat catataccgc agctatgaaa 3900 gacgaatacg ctaaggagtg gaaagatgct attgacgttg agctgaaagc catggacgac 3960 atgggagtgt ggaagattgt agactgcccc gccaacgtca acacaatcag cctcaagtgg 4020 gttttcaaac acaagcaact ggctgaggat ggaactccaa tcaaatttaa agctcgacta 4080 gtcgctcagg gattctcaca agtcgaagga atcaactatg acaaaacttt cgctcctacg 4140 gccagcatgg catctcttcg ttttgtctta attctcgcaa caagaatggg atgggttgtc 4200 cattcttttg acgtgacggc tgcatatctt catagtgaca tcatgaacaa actgtatttt 4260 agattgccac cgggatgtat ggaagcagcg aggaaggcta ataaggtttt ggaagcggta 4320 aaagcgttgt acggaactaa acagggtgcg aggtgttggt ggaagcatat tgacaagatc 4380 ttaatcaaac taggttttaa accatgtcag tttgatcaat ctctatatat ttgcaagcgg 4440 ggtaatgagg tgtgtattat ttgggtgcat gtagatgatg gggttgtgac gggtagcagt 4500 gtggagttgt tgagagagtt gtcgaaggcg ttgactgaag agttgaagat cagaaggcat 4560 ggatgtgaaa agattagagg atggtagttt tgaattatct caagctggtt taacgaacaa 4620 gatcgtcaga gacttcctag agactcccgg gaaggcaact acgcctttaa acgcatcaaa 4680 cttaccttca tcgccggaag aggacgaaat tgaagtcgac aaaacaagat acctttcggt 4740 cataggatgt ctcaattatc ttgcggtagc tactagacca gatctcacct acgctacgaa 4800 tttcttggca agattctcgg ctagacctac ggctcaacac tgggcagcta ttcaaggagt 4860 gatgcgttat ctcaagacta ccggggttgt gaagttacat ctgaaaccgt tgatgagcga 4920 ggtgaagacg gggctacata cttatgttga tgcaaattgg ggaggggagt actcaaggtc 4980 cgtccatgga tacacgactt tctttctcgg atgcccgatt gcttggacgt cgaaaagaca 5040 aggttgcgta gcaacatcga catgtcatgc cgaatacatg gcattgggta cagcagcaag 5100 ggaagcggtg tggttgagga atttagtgga agatgtgata ggtgcaatgg ggccggttaa 5160 actgttctgc gataacactt cagctattca cgtatcgaag gacaattctt caaacaaaag 5220 aaccagacat actgataggg agttctatca aggctgccac tatactatac cagtatatta 5280 tactgttggt atagtatatt tcaagtatac tggaatacac ctcgaggcac gagttggtgg 5340 aaatgtccac tctcgtgaaa ggtctagatt aggagtgaca cttggcgtcc gagaacttct 5400 cggaagagat ttgtaattcg aaatctctct aagccattca gtcactcctc atctagacct 5460 ccacagcgct tagctcaagc ataaaagata aatcaaaata gttttaaaaa ttcaaattag 5520 aaaataagac gtttacaatg gatttaagaa atcaaataaa tcaagcaact ggaaatgtac 5580 aaagacgatt aactcgagct gaagcagcgg caactggagc gcagttggca gagaggttgc 5640 ctactcctag aagaggacaa ccagcaagga gagacgaaga aaggaacgaa gaagacgaaa 5700 gcgaggggag aaatgaggag gaatcacacg taccattgcg aagacaagca atagaagagg 5760 aagacgaaac agaggaatgg ggaggaatta aagaggtaga agacgacgag caagtcaacg 5820 acgatgatga gatactcgaa tttgagaggt taaagaatgg taagactaac agtgatatta 5880 ctgactttcc ttatccaatg tcctttcttt tctctttatt tccttttttc cttctatttc 5940 caaatttctt tttcttttct ttccccatct ttccccatct ttttcccttt ttattctcct 6000 tccttctttt ccccctttcc atcttttata tatgaggtat gaaaaaacag attaggaacg 6060 atgaacgaag aggttgaacg ggaaggacaa tcgaagcctg gcccggctac taataggtgc 6120 ccagcctaag aattggctta cagtagatat cttagcctgg cccggctgca aataggcgcc 6180 cagcctaaat cggctaacaa tagatgatta gcctagtctg gcaggagaat aagactcagc 6240 cgagacgggc ttacaatacg tgctatccgt tatcatgatc ctttttgatc ctccgctatc 6300 cgttatcatg atccgttgat ccaatttcca tcagtcgaat catccaacac cacatcgtaa 6360 tttcgtttgg aaacccaggt actgacacat cgaatggtta ttgtcatctg tagaagcgag 6420 gcgagaaacc ttagtggaga tgatccaaga ggcgtttgat cgagacgata tagagggagc 6480 aaggaaactt accgagaaac tacatcgaga ggaagcgagg cagtcaatgc cacaacagac 6540 acaacaacgc cggaacgcct acgtcttcga cgaagagtca tcgaatgatc caaacgaacc 6600 tcgcactggc caaagagatc gcgaacgtgc gatgcacgaa ccagcgcaaa gaacaccaag 6660 tgttacgttc ttgagagaag ggaggcaagt cgtcgagaag gaagcgagaa gcggagctaa 6720 tggtgaaggc gaagaggggt ggttccatgg ttcaaaatcg gaggaagagt ctaacaaggc 6780 aaagtcaatc atggcggcca agaaaggatc attaactttg tcaaatggcg acatgatcca 6840 taatggacaa gtgtttagat gtggatcagc gaaaatggaa gacgcactga ctcctttgtc 6900 acctcacctc acgacaaagt taaaagcttt gaaatcattt atcccgttac cggtttttga 6960 tgaagacttc ttgatgaaag atcagaaagc atggtctctt ctaccaccaa agtcagaaga 7020 taaagttgaa aaggaaaaac ggatgtatgg gggggaagga ccggtggaag aactcacgat 7080 gaattttgaa gcatggagcg attgcatgga gctgttttgt gtacacttac gagcagctga 7140 ctgggaacca gttgcagaga agttcgaaca tcatatgaac atcgtgaaga agatacgcaa 7200 agacttcggt tggatggtgg ctttacgata ttgccgcttg gtccgtcaag gggtgatgag 7260 agaaaccatc gatggatctt taggcaactg ggcggagttg caagatcatt tgttaactca 7320 ggctaagaca aaggcagaaa gtttcaacga acgggcttac aaaactaatc cctatgcaga 7380 aggtggaccg aaggcgtcca tatgtccata tacaaaccaa ccgaaaccgt cgagcttaaa 7440 acacgtcgca agcacgacag cacaattcgc tagtaatcag gcaagtcagg cggtaaatcg 7500 acatgccagc acgagttcgt attgaggaaa tggaaactgg aaacgatacc agggaagtta 7560 tcgagatacc tatcgagaac ctgcacgtga gcaataccgg ggtggtgatc agcgggacac 7620 taggggtccg aagtggggag aagggaagcg aagagagcgg agcagaagcc cagagaagcg 7680 ctatggaaaa cgaggagacg ggaaatggaa acaatcgtga gccttgtagt tgaatacatt 7740 tgttgagaaa tgaaatgcat gccgtagtcg gcaattgcat gacaacgtcc caagcacatt 7800 agagtgcagt cttttccctt gtcccacctt tctctttctg agcttcatac atgaaagctg 7860 acgagaattt aaagcgtaca aagctttgac ttgacctaca acattgtcaa atcttcttat 7920 aaaaccctaa tttgacaact cagaataagg agtccctgac attaatttcg aacttttttg 7980 aataatttgg cctgttacga cagattagcc caagtagaaa gagtacgtcc tgacgatgag 8040 aaatcagaag ggccaagggt agacaaaata agacctgttt tagattcgtc gcagtcgtgg 8100 aaggagacag aggcacgaga cagacgcact agggtatggc caaatggccc gtctcataat 8160 atgaatttag aggtttggga gaaggagttg tatcaagtaa atctgctgtc aaagcacaaa 8220 tatataatca gaatgtttac aacgggtttc acacaaggca tcaaggaagc ttacgtaccg 8280 aataaagatt actactgtcc acccaaccac tcttcggcac aactagcaag agagaagata 8340 gaagagaact tcaaagttga actagaggca ggaagaatgt tcggcccatg gacgaaacaa 8400 gaagtctata agaaaatagg tttctttagg acgaatccgt tgggagccgt ggtaaacagg 8460 gacggatctt ttagagcgat tagcgatctg tcgtacccaa gacatttagt agatgtttcg 8520 tcagtaaatg aaggcgtcaa taagaaggaa tttgaaacat catgggatga tttcaaacag 8580 gtggcttact tttttacttc gtataaaggg aagctactat tagggatctt tgactgggcg 8640 aaagcctatc gtcagatacc agtcgcagaa aaccagtggc ggtttttgat ggtactagat 8700 tttaacgacc aagttctttt ggatacgaga gtgcagtttg gaggcgtagc tggttgcgga 8760 acgtttggat gggcggcaga agtttggaaa gagctgatgg tagagaagtt taagctattg 8820 gccggtttta gatgggtaga caatacattg tttgtgaaac ctgtagaaga cacagaggcg 8880 ggaagtatgt tagatattgt tagtttaagc acggagatgg gagtaacgac aaatgaaaaa 8940 aagtggaaag agttcacgta cgttcagaag tacttaggtt ttgtctggga tggtgaaaat 9000 aagacggtgc agttaccacg agaaaaagtt gaagaacggc tagaaaacat atccacttat 9060 ctgaaagtag aaagacagtc cttcaaagaa acagaaatat tagtaggaag gctgaatcat 9120 agtgcgtacg tatttccaca aatgaaggtg tattgttcag caatttacag gtggatgaaa 9180 gagtggaaac acaagtacgc cttacgagac atacctgaag aagtggtgga ggatttagcg 9240 atttggaagc ataccttaac agtcgctgaa cctttaagaa ccatccctag acccgaaaca 9300 aggacagtag gttgggtagg ggacgcatcg acgtcttacg gcctaggtat catcatcgga 9360 atatactggg cacgatttaa gctaaaagat cggtgggatg aagaagaacc tgatgggtcg 9420 aagcgcggga tcgcttgggc tgaaacagtt gcaatacggt taggctggca gatgttgaaa 9480 tgcctggaag atgtaagagg aagaacttac ctagctttaa ctgataatac agtatcggaa 9540 ggagcgataa ataatggaaa gtctagggat ccactggtaa atcgggagtg gaagcgaata 9600 caggaagact tgttatggca caatagtgca atcgaagcta agagagttaa gtctgcagac 9660 aacgcggcag atgcattgtc tagaggatta gattcgacaa agcagaagga gaatatgttg 9720 ctgattagct tgccggcaga cctgaaagaa ttcatgtacc aaacttgata agaaatagga 9780 aacatacctg tgacatcctt ttgaaccgtt tttgcagtca cacaacctag cgagaagagg 9840 taagctactt agtaagagag acgagaagat agcaggagtg agaggatcaa cttaccaaag 9900 gaaaacttgg gtatagtgaa aatggaggta agtgatcgcc agcagttgaa caacgaaagc 9960 tcgtgagata caatgactag caatagaggg gctccgttca tcgccttcga gcagctcgtg 10020 agataccatg agacttatgg taggtctcca aagggacttg aacgaaaggt actagataac 10080 ctaggggact ggtaaagagg aaagagaaat gtcagcgaca gaaccttcac atttgattga 10140 tagaaccacg tcccaggaaa gacaatacct aaatcgattt aataccttaa tcgatttact 10200 tcatagacat gccaccatat agaaacccaa ccaacgaatc aagaagacag tcagacttac 10260 caggtcactt tgaacaagca gcgagagacg cgacactagc cggagctcat atggcggcaa 10320 gtggttgggc taaatccacg ttaaaaggtt attcgtctgg actgtcgaag tatatcacct 10380 ttgtcgaatc aaccggaaag aaatttcgag atgacaagaa gatatcagca aatgaaatgt 10440 acaattttat agtgtgggca ggggaatcgg gagtcaccgt aaatgatcca ccaaagagaa 10500 aggtagcttt gaaaaccatt gataagtata ttgatggaat acaagcctgg cacttagtga 10560 aacatgtcgc taaaatcgaa ttagatgaag gaatcgttag gatcttattg agagcgacga 10620 agaaaaagga ggaaggtgtc ttgctcaact cagagaaaaa acctgtaact gtccagcaat 10680 tactcacgat gttggataga tgcacgggac gatcagaaga acacgaattc gtagcagcaa 10740 ccgcgctagt cgccttttgg ggtttaatgc ggttaggaga ggtgttccgg tttaaaaagt 10800 ccactctcac tgttacaatt taggtaacat ttatgtacaa gaaggggtga tatactaatc 10860 accatccctg cccttacctt gcctggcggt gaggtaagtg ctttatggga aagcgcgcgg 10920 agcgagcgcg agtttggttt tacccaacca aactcttggt ttgttgaaac caatcccttg 10980 gttttccaaa ccaatctggg agttggttga gatctggttt ggttttgcta accaaaggga 11040 tctcgcggct catttactat aaactggctc attccgatga aggggttgtc cgattcccct 11100 tatcccaagg atggtaaggt acagacctgc cgttcttggc cggttggaca accccttcta 11160 cggcatgtct gggactcaac tctcgagcat tcgagaagca tgttccgggg ttaatgcggt 11220 aataccgcca tttttgagat tcttctaacg aaattgaatt gccgtgtatg aaccttaaag 11280 gttgcatcaa cgttttcccg cctttgagtt tcaactaccc agtcttattc tgtctaaaaa 11340 tgttttcgtc tcctctattt ccgtgagaga ataacttttt atgtagcgtc tgaaggcctc 11400 agatttccac cgtccaactt tttgtatcac tttctctgga gttcctaaat ttgcccttaa 11460 cgaagctcct cccacccgga aagaatgtcc tgaccatctt tctaacattg attttcccca 11520 gactttctcc accgcattga tgaagactct tttcgtcaag ggtcctttag tcgtctgaaa 11580 aaccaaggct tctcgctctt tcatcgggtt cgctttaagg attcgtttga gtgctctcac 11640 cgggtctaac accgaaggtt gagcttgaag gtggatgacc tgaatttcgt tcggtgaggc 11700 tgttttccct ttcctcaaat gaatctttgc ccattcgcct ttgtttgaca ttccgaattc 11760 tacgtggtgg tttaggattg caccgtcttt cttttccgtc ctgaagatct tgcctaacct 11820 caacattccc caaaacacaa tcaatccgac gtctgccact aaggtttgta cttccgattt 11880 tccataagat tcagacaata acctatatag atgtttgacc ttcactggtt tctttgccga 11940 ttccaagact tcgccctctt cttctcttcg tgtagctcga agtaacatct ttacaatctc 12000 tggatcgatt gaaattttct gtaaatgctt gactatgtgc catgcgttta tcccgtctaa 12060 gtatttgtcg attgtctttg ccgtgatagg cttctttggc ggattcggta agtttatctt 12120 tgttctaccg gcccaggtta tgaatccgta tatatcatcg ttactgattg gtttcttttc 12180 gtcgaacgtc attccttttg attcgataaa ggtgatatat ttcgagagac cggatgaata 12240 acctgatagg gtcgacttag cccagctacc aaccgctaga tgtgctcctg ggatggttga 12300 ttgcttgacc gctcttccga aatgtccagg tacttcatac gtttttgtac ttttggtgag 12360 tttctttctc tttacttctc ggggcatgat ttgattaatc tctgaagtta actgtcttct 12420 ttcatttggt tctagttact tgtcattgaa ctctccactc ggaaacctct ccttgatctc 12480 acttcgactc gcttacgagt cgtatatttt tttttttttc acacacccgg caagttggag 12540 accaaattgg ccaactccgt ggtgaacagt cgtgatctta atgaaagagg gcctgctttc 12600 tctcacgggt cacaaggggt ctcacattgg tttcaattca acttcattgc aacctccctt 12660 cccgactaca tctgtcttag gaaggcgttc agatcctttg gtaggtctac tactatctta 12720 tcttcgttct tcttttttga gactttccct cttgacaacg cgtctgccgc attgtctgct 12780 gatattaccc tcttagctgt tatcgagtac tgattcttta tcagccgttc ctggatcttt 12840 ttccattctg cattcactag gggatctttt gatttcccat ttaccactgc gttttccgtg 12900 accgttttgt ctgtcaggac caggtaattt ctaccctcta ctgattctag ctgttcaatc 12960 atcaataaac ccaggcgtac cgcgaccgtc tcggcccacg cgataccacg tctgtccccg 13020 ttcgcttctc ttgcatgcca attgggtttg agtttaaact gagaccactt ctttcctatg 13080 ataatgccta ttccatagga ggttgaagcg tctcccaccc agcctacatt tcgaacaacc 13140 ggttttggga tcaccctcag tgggtttgcc tctcgtaagg tatccagcca gatattcaaa 13200 tcttctacaa tctcttctga taatttacgt agtgcgtact ttttccccca atttttcatc 13260 attcggtaaa ttgccgcggt atatactttc atttgtggaa agacaaaggt aatgtggttt 13320 aatctaccta ccataacttc cgtctgttta aaagactgct tatctgcttg taagtacttc 13380 atgatttggt taattcttaa ttcaatcttt tcttttggca acctgacggt tttattttcc 13440 ccgttccata caaaacctag gtatttttgt tctacgtcga actcccgcca cttcttctca 13500 ttggtttcta ctcccaatct ttcacttgct gatgctattt ctttcattgt cgtttggttt 13560 gacccgtccg cgggttttat gaataatgtg tcatcgaccc atcggaagcc ctgaagtaag 13620 ttaaatttaa tcatcattat ttccttccaa gcttccgccg cctctccgaa agttccgcat 13680 ccgcctaccc cgccgaactg gactcgtgta ttggcgtaaa cttggttgtt gaagccgagg 13740 atcataagga atctccattg attcattgca acaggtattt gacgataggc ttttgcccaa 13800 tcaaatactc ctagcactag acgtccttta tacatagtaa aaaagtaggc taccttcttg 13860 aagtcatccc aagaagtttc gaagtcatct ttgtcgactc tgaagtttac tgattcgatt 13920 tcggggtcgt ttttcgggta cgaaaggtta cttataggtc gaaatgatcc gtccccattt 13980 acggctgctc caagtgggtt tgttctaaag aaacccaatt tctcgaacgc ctcttctttc 14040 gaccaaggtc cgaaaattct gcctctctta atctcagctt caatgttttc ttctatcttt 14100 tctttcgcct tcaacgctga cgtgtgatct tccggacaat aataccgcat tccttttatt 14160 tcagcatctc ttataccttg atcgaaacca tgcttaaacc cctttaggat atgctcatac 14220 tttacttcta ttccatatct tcgaagctct tgtttccaca ctgataggtt catttcacac 14280 gagggagttt ctggccaata tctgcctctt ctttctgacg gtacctctcc acttgacacc 14340 tttgggctta acctgttttg aagacaggaa ggagggtgtg gtggtatgaa ggggtcattt 14400 cgttttgatt tgggtcaagg tggcaagaat gtacacggaa ggcaaagggt atgtatcaca 14460 tgttagctac tcaatggcaa tttggcacgt gtctttcgac ttttatttca aggagcatgt 14520 gattccgact gtatcgttct tagcttacgc tcttttcccg ttcacaggtt ttccatttct 14580 atcccttcgg tcagggcttc tgcttttatc acgatggcta gcataccggt ctccatagca 14640 atcttctctt ttccttccat cgtcaaatcg tcctttatat cctccacctt tacctcttcc 14700 gttacctttg tagccacttg aggatgcatg tctattctct ttcgcatggt aagttgtttg 14760 atttgatcct gacgagttct tcggcttatt cgtgattgga cagatatttt cttttggctt 14820 cccggacgca taaggatttg tcttgtaatg acgtttgttg tagctttccg cggtcgtctt 14880 cgcctcggct aacaagatct cttgaagttc ggcatagttc ccaaccgatt tgtcaaccgt 14940 ttccctcatt acaccttgcc taactaactt gcagtacctt aacgctacca tccaaccaaa 15000 atctttacga agcttcttca ctactgctat gtgaccttca aactttgttg ctacgggcct 15060 ccaaccgcaa gctaccaggt gagtacaaaa aagttccata caatctcccc acacttcgta 15120 atcgatagtt aattcttcga ctggggcttc tcctccataa atccgtgctt cacctgactc 15180 cttcttatct gatttggtag ttcttgaaga ccacgccttt tgatccgcta ccagccattc 15240 ttcatcgaag acagtcaatt ggatgtatga agtaagtcct tttagaatta tagttagacc 15300 tggtgataaa ggcgctaatg actgtttcat ttgtgcaccc cctcctctta agatccttcc 15360 gttgtggatc acatccccgt tactcaattc taacaaacct ttcttcgaag acattatcgc 15420 gagagccttg tccgattcct cctttgacgt ggcgccttcg aaccacgctg acacatctgt 15480 tgcttcgtcg atggcgcccg ggacagtagg ctggttctgt cctgccggtg gttgaatgtg 15540 cacatcatga gacaggggcg gattggctgc tatagtttgg acgattggat ctttaactcg 15600 ttgtgagttt tgtggaggga caatttcctt gacgaatgct accaaagttc catctgacat 15660 tttggtatcg tcttcgtcta agacatatgg gttcttcgtt ggtctttcgt ccctgcctcc 15720 atccaggatc cgctttagag acttagcttc tttgtcgtcc cctctgtcgt aagcgtcttg 15780 gattaacgct ccgagggtgc ttctccttgc ctggtctgat tatttttcca atgagatgca 15840 caaacagtca gctgacgtcc tgtattcagc ttgtctttct cgtgatctaa acgcgatggg 15900 tggattcctt tctgcagtct tgttaaattt ccgcagcctt taagatatcc cggcatacga 15960 cttttggaga ctctattgtc taggcaccta ttttaagccg ggccgaggct gggttcctac 16020 tgtaagccgg accatggctg ggtccctatt ttaagccggg ccaggctggg ttcctattgt 16080 aagccggacc tggctgggca cctattcgaa gccgggccag gctttcttga agtacgttca 16140 tcttccctgt attctaccta aagcataaat ctttgggaaa aaaccgagcg taattatact 16200 ttttggactt cggcacaggt tctcttttcc atatatgttg gttaggtttc ctgcgccatt 16260 tacatcctat ctatggttta tcaaggcgga actgagaatt ttatgaagag gtggatttga 16320 aatgtgttac tgaagattgt gccgatgggt gggtacgata tcttggaatt attgtggtat 16380 atagaatgta gagaatttca tagacgaagt gtgggaatga gttgaaaagt ttatcagact 16440 tacttttatg atctgtacgc tcatttcctt gttcttcctc ttctctctcc tctgtacctt 16500 cctcgtttcc actacctatt cctccccact cttccatctc ttcttctacc gcttgaagtt 16560 cctcctccct ttcatttcgc ggttccacct gacgcctttc cttctttttc ctcaactcct 16620 gttgcgtttt cctcaagcct ttaggacgtt gtccatattt cttgattcct gacattcttc 16680 cgttttgatc accggtagga gttctttgtg acgttacgct tcgtcggccg tttccacccc 16740 gcttcggtgt tatgagtcga ttacctggtt gtacccccgt tgctgctgca tttgctctcg 16800 tcagaactct gtttgtttgt tctaattgga gctgaaatga catttttgtt gttttcttgg 16860 ttcttttttt aattttcttc tttctttcct tgatttatat cttgaaattt tctttgactt 16920 taatcggttt gatttgtgtg ttttatttta attttgttgt aataataata attttgaaaa 16980 gaacagaaaa ggttgaccta aataagattc gttggttcga atctttccgt gtgaggtgct 17040 gtggtttgac tcgtatgcgg aaataggtcg tgggagagaa ggaaacctta gagtattgct 17100 tctctacgac cagccgcggg actgtgacat gtgtcttaca tgtcacctca tagttcacag 17160 catcgtggac acacgcacca aaccgtgcct tatttggtgc acatcagacg atggcgcttt 17220 attgagacga aacgtagaat tcggagaatc tggaggagga aaatgggtga aactacattt 17280 acggaaaggg aagacagcga aaccgaacga agtacagatc atacacttac aagaacagcc 17340 ctcggtacta gatccagtgg ccgcggtaag aaggattatg gagatgaatg gcagcacaga 17400 tcgagacgag ttcctgttta agacgaagaa gggtttgctg aagaagaaaa ggtttttaac 17460 aatcctaagc gaaatctggg gtgaagcgaa gatagggact tggacaggtc actcctttcg 17520 agtaggagga gcatcgatta gagcaaactt aggaacaagt gagaaaacat tgaagaaagc 17580 aggccgttgg aagtctgatt cgtacaaaag atatgtaaag ttatttacta cggaagaact 17640 acaaaagacg gaaaagtttt tagatgctat aaataataac aaagttcgtt gaaagatatt 17700 caagaatata gatggatatt caagaaatat aatcacactc gttgagttaa atgcaccatg 17760 cattgacgca cctgagacca aagccgtgaa ggggattgtc ctccaggcca aggacggctt 17820 ttagcaagat taacagcttc cttggtagtg ggagcgtcgg acaacccaac caaccctcaa 17880 aggagggtgg gagggaagga ggctgaaagc cacccgcact cgtagagatt ggtttcgaag 17940 aaccaaaatg agcgcggggc acagcttccc ccgaagacct ctcacctcac cagaaggcaa 18000 ggtgagaggc gggtgggttt aaaactcagt taggataaca gagtgggttt tacattaaac 18060 ccacctctga tgatatcttt aagctatgaa ctaagctgaa gagtggatgg aatacacctc 18120 gaggcacgag ttggtggaaa tgtccactct cgtgaaaggt ctagattagg agtgacactt 18180 ggcgtccgag aacttctcgg aagagatttg taattcgaaa tctctctaag ccattcagtc 18240 actcctcatc tagacctcca cagcgcttag ctcaagcata aaagataaat caaaatagtt 18300 ttaaaaattc ctacccaccc acccggcgaa ttaaggaccc acccaccgac ccaggctgaa 18360 agccacccgc actcgtagag attggtttcg aagaaccaaa atgagcgcgg ggcacagctt 18420 cccccgaaga cctctcacct caccagaagg caaggtgaga ggcgggtggg tttaaaactc 18480 agttaggata acagagtggg ttttacatta aacccacctc tgatgatatc tttaagctat 18540 gaactaagct gaagagtgga tatactagtc attatatgga atatactagt atatttatac 18600 ttatactatg ctgaaaatat acccagtata tttggattta ccaaacatac ctggaagaca 18660 cctacaagtt tttgacaagt ttttataaat atactgcagt atatttcaat tatactatac 18720 tatactgcac aatactatac catgtttgga aatatactat accaagtata tttcaggtcc 18780 agtatagtat agtgcagtat agtacagtat agtatagtgg cagccttggt tctattatgt 18840 gaatgaacaa ttattcaaag ggacggtgac tctgcattgg gtagaatcaa aaaatcaaaa 18900 ggctgacatc ttaaccaagg cactgggccc attgctgttt gcggctggga ggaagaattt 18960 atgtttggag tagtttttct tattttcttt tatattttct cttttgattt tttaaagata 19020 tatgtgtgcg cgtttttttc ttatcatcag tttcattttt tgtttcgaga ggagtgctgt 19080 ggcttgtcgt ggggggg 19097 // ID DNA-2_EN repbase; DNA; FNG; 4506 BP. XX AC . XX DT 09-JAN-2004 (Rel. 9, Created) DT 19-SEP-2005 (Rel. 10.1, Last updated, Version 2) XX DE Nonautonomous DNA transposon. Putative classification: MuDR DE superfamily - a consensus sequence. XX KW MuDR; DNA transposon; Transposable Element; Nonautonomous; KW DNA-2_AN; nonautonomous DNA transposon; putative MuDR superfamily; KW DNA-2_EN. XX NM DNA-2_AN. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-4506 RA Kapitonov V.V. and Jurka J.; RT "DNA-2_AN, a family of nonautonomous DNA transposons in the RT Aspergillus nidulans genome."; RL Repbase Reports 3(12), 204-204 (2003). XX DR [1] (Consensus) XX CC Nonautonomous DNA transposon. Putative classification: MuDR CC superfamily. CC 9-bp TSD. XX SQ Sequence 4506 BP; 1319 A; 868 C; 907 G; 1409 T; 3 other; gagtacacag gtgaggcggc gggtaaaata tgtcgggtgt ctcgctaagt tgcacgtgac 60 caaattaaga gtcatctgag ccacagtgtc atttacccgt caattcagaa tttcatctgc 120 ttaccgtaca cattgaagct gcgatcatca ggcctgcaac tagttttcaa tcatttttga 180 ctggaaaaaa ggttcctaca atgtctttga ttggacagaa caaatggttc aaaaaatata 240 aagaagaagt gaaaagtggc aagcttgact tgcctgcttt ggaattcgtg aggcttcatt 300 atcactacat tatcaattaa ctacttgcta agtccttact gcgtcaagga tgcgaatcag 360 aatgttgtca tatactatgg tgaagtcttc tgtcgctacg aagattgtgt gaagaatcgc 420 gtgggtattc aactattttc tagctgtttg tcaactgttc attaactgtt ttacaactac 480 ttagtcgcca ttctctacca ctaataatct ccgaacccat ctccgcgatc agcatgactg 540 taaattagag gagagcaaag ggggtcgcaa tgctcacaag acgattaatc tgggcatacg 600 tgaggatctc caactacctg gcaactgctt tctaactact tagaatggta caagggcctc 660 ttctccgagc aagatgcaca ccggccagtt gcccttgaag ataaacatga agccgttcag 720 cagcaaatta tcagcaatct acaaacccag tccaagtcag acctgtccgc ggccctacct 780 tccctgcccc gaaagaagga tggaacagta agattaatta cctgttagct actttatagc 840 tgcttgctaa ctactcaggt tcatatttct aacatgcgca agaaggtcaa agagctgggc 900 catatcatac cctgcaataa atgtccatct gctaaggact gctgcaagga tcagaatacc 960 tgtgagcact tccgtcattt tgagaatgat ctagaggagg acgaggacna agacaaagaa 1020 tgagacacat cttcgtagta actagaagtt ctgtcaggcg ttatggagct tgaattatca 1080 gaagttctct aataagttac cagttagttt acagctagtt atgaatcaat ttcctgcagt 1140 ttcttccgtc tttccataag ctctaattcc ttctcctcat tttgaagctg caaaacccga 1200 atttcttcct gttctttttt cagcttcatc ttaatttgct ccagctccaa ggcctgccgc 1260 tgttcctcaa aagataatrt atcctctgag ggtcttataa ggctttctga agaagatgac 1320 ctaaccattt ataaggtggt tagcaagtag atactaagtg attgctaagc attggcagag 1380 atattatcca cacaccggat taaccttggt cgtgatcgtg acgtacracc agatcgtgaa 1440 gataaatgtg ctgatgatga ccgacggcgt tttcgagatt ctagaatagt taattgctga 1500 tataatatat tgatagagaa gatacctacc ttcacgagct aaatggcgta ggtaattagc 1560 ttccatattt gctgtcctat acgagtgatg aactccaaag tagtcatggt tttgatactg 1620 aataatatct tgcttgtcaa gctttgcaga gctatattaa tgcaagtagt tagtttgtga 1680 ttaggaagtg gttaggaagt agttagtaag ccaataactt acttctgaac agcctcaatt 1740 aatgaaagct gtttaccacc agcatttgct ttatggtgtg actgctctgc tgaattagta 1800 tgattccgta tagaatcata caaagatgat ggaatcctag agcagttttt gttaagccca 1860 gctttaataa ctgcattctt tttatggaga gcccaattct gaatttttgg gtcctcatat 1920 gctaggataa attgttaata agtacctccc aaatatttac gaaacactta cctgcaagta 1980 agtcacataa ctgatcataa tcctcctctg atttacaatc aaccagactt gccatgcgac 2040 tccacacgcc tgttccttta ttgtcattac caacaatctc agtaatagtc cggaaaaaat 2100 ggacccggca gaaaataatg atgctctgta gttgccataa tagggaacga tggtagggat 2160 caatctcttt aagatatcga gccaggccta gctagttgtt aactacttat caagtaattc 2220 ccaagtggct gattaatgat tgacctacca gcatattgtt tcgtgtccat gtcaacaata 2280 attccagata ttccatttcc atggatagaa ttaaattcca caggttgacc tgttactttc 2340 tcaacaagag taaatactct tttgaataaa agatagtagc ctgtcgtagt ttcttggttc 2400 gtaaacacac acattaaggt gataactaga gtagtttgta agtggcttcc aagtagttgt 2460 taactggcta tggaatacct acttttcccg tgctcatgta ggtaggttgc aaaaactacc 2520 tcattcatgt ttttttgtct tatccttttg taggacatat caacttcaaa tgaggttaac 2580 ttgtttagaa gcatgatctg ctccttatag gcacataata tcataatgcc atcaggatca 2640 cggtatttct cctggatata ttgctagata ttgttataaa gcagctccca actagttata 2700 aaatatttat aatcttacct tcaaggtagg gtttatgtta gatatgtata ccacaccatt 2760 gaaatcctgg ccttcgggat atgtaagtaa acgttgtttc tgtataatgg cagatattct 2820 gtcaatattg gaaagggagg cgtgtatttc agctggtgta gaagcattat actgttgaca 2880 aaaagcctct aactcaggac ttcgtaggaa ttttgctatt gagtagttgg taactagtta 2940 tcgaacagta atcaaatagt attagacaac ttaccaaggg ttaggcttgg attccgcata 3000 tttctaatta ttgattcaat ccctttcaaa atctgctccg gtggcttgtg tggtggtggt 3060 ggaggatgtt gatgaattcc atgtgaagta aagaggatat agggagtttc ctgaatatta 3120 actggtacca atgcagtaaa gattacactg cattttatat gttccagctt gccggggcct 3180 tgaacatggt caaagtctag aagtagttta gaattggttt gccgaatgtt gccaagcagt 3240 tttaaagtag ttgggaacca cttaccacag aatttgcgcc gtgttgttag tggttcaata 3300 acagcacact cctcctgcga cgggagaatg gactcattaa ataaccgctc taagaattca 3360 agatcaattg atgtatggcc tttcagagag ctgtgataat gtttctgtag aggacctggt 3420 gcgctattta tgcagctaat atatggatgg tgcttattat tgatgtcctt ttagcaagtt 3480 agtaggtagt taataagcag tttccaagga cttaccgggt agaaagattt tcgaaataca 3540 ggcgcacagc ttgaaaggtt ttcgatacaa gctttgccac tgttgaagaa tctcttcttc 3600 gagcgataaa gactgaatag aagcaattcc ttgtaagtaa ttggcattta gtttccaggt 3660 agttccaaag tacttacctg aatgcattcc tcttccgaat atcattttca acaaggtcaa 3720 tatctttccg aattgtttgg atttcttgcc atgctttttc atcaaggtat gtatgataaa 3780 gaagtcgaag acttggcttg atatattcac agacaaaagc tcccgaacat ttccaactcc 3840 atttcttaac cttacagttc aagaaactgg aattgatatg tcttagccca tggaggcttg 3900 aacgagaata ctgaatcttc aagcaattaa caaacagttc ttaagcggtt acggaacagt 3960 tctgaagtac ttacgtcttg aactcgttgc tccgcctccg attgtgacat tttatctgct 4020 gcaatgatat acgtatgacc attgatatga gtctttggat actcaggaag gtcatctata 4080 tattcaatat agagcgtctg taaagggtgt aactttgcat cggtaaggtc tgtcaaagtt 4140 gttggttggt cctaagatat atatttagtt atgcagtact taagagctat acagttatat 4200 tttacctcca atacatcctc tcttacctcc tcttcaaacg tatctgatag aatatcatta 4260 atatcttctt cccaatcaac tggatctatg ttgtcatcca ttataaaatc actctagaac 4320 tgattgggaa gtactttgaa agtggtttgt acgttgggag tacgaagaaa taataaagga 4380 gacgaattga aaatgaaatt cggtaattaa ctactgtatt taaaggctca gttgactagt 4440 cgggttgcca agatttggtg gggccatcat ccgagttttc cgatgaatca cttgacctgt 4500 aacctg 4506 // ID Gypsy-11_RO-I repbase; DNA; FNG; 5122 BP. XX AC AACW02000050; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-11_RO_; KW Gypsy-11_RO-LTR; Gypsy-11_RO-I. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-5122 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000050; Positions 6476 1355. XX CC Positions [3700-4176] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(25..2865,2869..4941) FT /product="Gypsy-11_RO-I_1p" FT /translation="MNSTVENSGANTAEMNSMGAANDSNDYSQDFNEGSSL FT SSAGMSIQDDSAMVVDSGSDFVSSSVVSNKKKDNVAINLTQLRSEMDEQLK FT MFCHIRASGDDAAIDEALRGIDKTKRRIAAMLECQSFYKKLDMHQNPTGGS FT NAVSAGLSLSHRDLPKFQLATSVIRPFPNEEVFESVEHFLTKFENIIEGSS FT YQSVEHVWKKFLPLCLPYSDNAWVETELKRCRNWSEARQAFVGHHGSSMSS FT SYYNDLVFTMTMSSKESISDYSKKFLQAVFYAGLPKDDPRIADRFLASLTL FT PVQTIIRMTVTRVEGKRSRPHNWTVEYLSQVGRDVLGDDNSLYAEATAMIP FT GANRVKKQEGVDHGLASKHRISKPIKANKLAKSFFCSHHGKNGTHNSKDCY FT SLKNRAEKNTEKKENNCYKCHQPWFKGHVCKKDEPRRVLAVSKVEKQEESQ FT IDKATAEVEAMMEDLSYDCKYPSKNKNKACNDNSAFNLLTPIFIENIKLIG FT LIDTGSDASFIDVISLNKKLKINKVNKITGSYNFLSHNNEISRIGITDPLQ FT FKYANGIQFKHSLEVMKFNKEFEFDILLGTDILPKMNIGLTGVAFKLDGEH FT SHSDTTANDNLILDNINIDRENKHEPDNSPAGTSNQRAEFFKIIKEAIDKN FT QSIPIESSCPMPESIIHLPTKEGATAYRRQYPIPHALRPVLDKQIKEWLET FT GTIIKSKINTSFNSPLLLVPKRNKAGEIVNHRVCLDVRLLNKILPPSFNYP FT VPLIRDIFDNLAGKKVFTTLDLSNAYHRFKVAPEDVHKLTFTHNNAQYSFI FT KGCFGVKMLTSQFQKCLATLFDGIDCVQNFVDDCIIASDNYEQHAKDVKLV FT IDKLTSVNLIINPEKCVWFQQSVRLLGFVVNTTGTKVDRKKLTNVENWPLP FT NKSNKQIQQFMGLINYFREYIPMISRVAEPITRLSNAANVEELTDEQTNSF FT YALKKILQSDMILHYPNLNKKFFVATDASLYGVAAVLYQKDEHDRDKYISF FT ISSSLTPSQRRWSTTKRELYAIVLALNKFRKFLWGRHFTVYSDHKALVYLH FT TQKIANSMMIGWIETLLDFDFDVVHIPGILNKLPDMLSRLYPPLQDSYKLV FT EDNVPSKLNNRNKKRVIAKRKKYSRDKIINVLATKLVETKNERTDYMTPPE FT EERDAILRETHSFGHYGYQAIVRDIHSRGMHWTNIYDEAKSIMSSCTECQK FT HNISKRGYHPLTNVMANRPFDHIAIDLAGPLPVTDEGYLFLLVVTDICTKY FT VVIRALKNKQSDTIAKQLISIFGDYGFPRIIQSNNGTEFRNALMTSLSKNL FT GIDRRYSTAYHPQGNGSAEASVKIALNTLRKMVQSNSRDWDHYLPIVQLCM FT NRYIKNKSLSSPFSLMYARRVNLPDDYTNKEKYPLPKDVMTIKELEERVNY FT MENVIFPAINERTQKINEEYSKRYNDKNILVNIPSGTHVMVRLNSRSGKLA FT PLYEGPYTVVRKNKGGSYELKDEQNELMHRNYTPSELKIVHIDESNIEDEY FT YELEAIRDHRGPSGNREYLVKWAGYGERANTWQKAGDFTDPTIIQKYWDKQ FT DELKKLEHERAEQLVNKASSNSKYNESNRSSTPKVSDEKKVHSVETKKRTL FT PTKLSREERLLKRRLNKEKK" XX SQ Sequence 5122 BP; 1787 A; 887 C; 1003 G; 1445 T; 0 other; ttttttactg ttaattactt gcaaatgaat tctactgttg aaaactctgg tgctaatact 60 gctgaaatga attctatggg cgcagcaaat gattccaatg attactctca agacttcaat 120 gaaggttctt ctttgtcttc tgctggtatg tctattcaag atgattccgc tatggtggtt 180 gattctggtt ccgattttgt atcttcttct gtggtgtcaa acaagaagaa ggacaatgtt 240 gccattaacc tcacgcaact gcgcagtgag atggatgagc aattgaagat gttttgtcat 300 ataagggctt caggtgatga tgcagccatt gatgaagcac ttcgaggaat agacaaaaca 360 aagcgcagaa ttgcagcaat gttggaatgt caatcctttt acaagaaact tgacatgcat 420 caaaacccca ctggaggtag caatgcggtt tctgctgggc tttccttgtc tcatagggat 480 ttgcctaaat ttcagttggc taccagtgtt atcagaccct ttccaaatga agaggttttt 540 gaatctgtgg agcacttctt gacaaaattt gaaaacatca ttgagggttc ctcatatcag 600 agtgttgagc atgtttggaa aaagttttta ccactttgtt taccttacag tgacaatgcc 660 tgggtagaga ccgagcttaa gagatgtcgt aattggtccg aagcaagaca agcattcgta 720 ggtcatcatg gatcaagcat gtcaagtagc tattacaatg acttggtttt tacgatgacg 780 atgagcagca aggaatctat tagtgattac tctaagaaat ttctgcaagc tgttttttat 840 gcgggacttc ctaaggatga tccacgtatc gctgatcgtt ttttggcatc gttgacgcta 900 cctgtccaaa ctatcatccg aatgactgtg acaagagtgg aaggaaaacg aagcagacca 960 cacaattgga ctgtcgagta cttatcgcaa gtgggacgtg atgtccttgg tgatgataac 1020 agtctgtatg ctgaagccac tgctatgatt cctggtgcaa atcgggtaaa gaagcaagaa 1080 ggtgtagatc atgggcttgc aagcaagcat cgcattagta agcctattaa ggcaaacaag 1140 ttggccaagt ctttcttctg ttcccatcat ggaaagaatg gcactcataa ctccaaggat 1200 tgttacagcc tgaagaatag agctgagaag aatacggaga aaaaagaaaa caattgttac 1260 aagtgccacc aaccatggtt caaaggccat gtatgcaaaa aggatgagcc tagaagagtt 1320 cttgctgttt ccaaggtaga gaaacaagaa gagagtcaaa ttgacaaggc cactgctgaa 1380 gtcgaggcga tgatggagga cttgagttac gattgtaagt atccaagtaa aaacaagaat 1440 aaagcatgta atgataacag tgcatttaac ttattgactc ctatctttat agagaacata 1500 aaacttattg gtttgattga tactggaagc gatgcaagtt tcattgatgt catttcttta 1560 aataaaaagc taaaaatcaa taaagttaac aaaattactg gttcctataa ttttttgtct 1620 cataataatg aaatatctcg aattggtata actgatccgt tgcaatttaa atacgcaaat 1680 ggtattcagt ttaaacattc attagaagta atgaagttta acaaagaatt cgaatttgat 1740 attcttcttg gaactgatat cttgcctaaa atgaatattg gtttaactgg tgtagcattt 1800 aaacttgatg gtgaacactc acactctgat actacagcca atgataatct aatcctagat 1860 aatatcaata ttgatcgtga aaataaacat gagcctgata attcgccagc tggtacttcc 1920 aatcaaaggg cagaattttt caaaataatt aaagaggcca ttgataagaa ccagagtatt 1980 cctattgaat cttcatgtcc aatgcctgag tccattatcc accttcccac taaagaaggc 2040 gctacggctt atagacggca atatccaatt ccgcacgctt taagaccagt attagacaag 2100 caaataaaag aatggttgga gactggtacc attatcaagt ccaaaatcaa cacatctttt 2160 aatagtccac tccttctcgt tcctaagaga aacaaagctg gtgaaattgt taatcatcgt 2220 gtgtgcttgg atgtgagact gctgaataaa attttacctc ctagttttaa ttatcctgtt 2280 ccattgatta gagatatttt cgataacctg gctggtaaaa aggtgtttac aacacttgat 2340 ttgtctaatg cataccatcg attcaaagtt gcacctgaag atgtgcataa attgaccttc 2400 acacataaca atgcccaata ttctttcatt aaaggatgtt ttggagtaaa aatgctaact 2460 tcccaattcc aaaagtgctt agcaacccta tttgatggga tcgactgtgt acaaaacttt 2520 gttgatgact gtattattgc tagtgataac tatgaacagc atgcaaaaga tgtgaaattg 2580 gtgattgata agctgacctc tgtgaattta ataataaatc cagaaaaatg tgtatggttt 2640 caacaatcag ttcgactttt aggctttgtt gtaaatacga caggcactaa agtggatcgt 2700 aaaaagctaa caaatgttga aaattggcca cttcctaata aaagtaataa gcaaatccaa 2760 caatttatgg gtttgatcaa ctattttaga gaatacatcc caatgatttc cagagtagct 2820 gaaccaatca caagattgag taacgctgcc aatgtggaag agttgtgaac tgatgagcaa 2880 acaaacagtt tctatgcttt aaaaaagatc ttgcagtcag acatgatttt gcattatccc 2940 aacttaaata aaaaattttt tgtggctaca gatgcatcgt tatatggtgt tgcggcagtt 3000 ctttatcaaa aagatgaaca tgatcgtgac aagtatatca gttttatctc ctcttcgttg 3060 acaccctcgc aaagacgttg gagtacaact aaaagagagt tgtatgcaat cgtattagca 3120 ctcaacaaat tcaggaaatt cttatggggc agacacttta ccgtttatag tgatcataaa 3180 gccttagttt atctacatac ccagaaaatt gccaattcta tgatgatcgg ctggattgag 3240 acacttcttg actttgattt tgatgtagtc catatccctg gtatattgaa taagcttcct 3300 gacatgttaa gccgtttgta tccaccttta caggatagct acaaactggt ggaggataat 3360 gtgccaagca aattaaataa ccgtaacaag aaaagagtta ttgctaaaag aaagaaatat 3420 agcagagaca aaattattaa tgttcttgct acaaaattgg tagaaacgaa aaatgaacgt 3480 accgattaca tgactcctcc tgaagaagag agagacgcta tacttagaga aacacattcc 3540 tttggacatt acggatacca agcaatagtc agagacattc atagtcgagg catgcattgg 3600 actaatattt atgatgaagc aaagagtatt atgagttcct gtaccgaatg tcagaaacat 3660 aacattagca agagaggcta ccaccctttg actaacgtta tggctaatag accgtttgat 3720 cacatagcta ttgatcttgc aggaccactt ccagtgacgg atgagggtta cttgttctta 3780 ttagttgtca cagacatttg tacaaaatac gttgtaatca gggctttaaa aaacaaacaa 3840 tcagatacca ttgcaaaaca gctaattagt atctttggag attatggatt tccacgaatc 3900 atccaaagta acaatggaac tgaattcaga aatgctttaa tgacaagtct atcaaaaaac 3960 cttggaatag accgtagata ctctactgca tatcatccac aaggtaatgg aagtgctgaa 4020 gctagtgtaa aaattgcatt gaatacattg agaaagatgg tacaatccaa tagtagagat 4080 tgggatcatt atcttccaat tgtgcaactc tgtatgaaca gatacataaa aaataaatca 4140 ctaagctctc cattctcgct catgtatgct cgcagagtaa acttacctga cgactataca 4200 aacaaagaaa agtatccact ccctaaagat gtaatgacca tcaaagagct tgaagaacga 4260 gtcaattaca tggaaaatgt tatatttcct gccataaatg aaagaaccca aaagattaat 4320 gaggagtaca gtaagagata caatgataaa aatatacttg taaatatccc cagtggaaca 4380 catgtaatgg ttagactaaa tagtagaagt gggaaacttg caccactata cgaaggacct 4440 tatacagtgg ttcgaaagaa caaaggagga tcatatgaat taaaagatga gcagaatgaa 4500 ttaatgcatc gtaactatac gccgtctgaa ttgaaaatag tgcatataga tgagtctaac 4560 attgaagatg aatactatga gcttgaagcc attagagatc atcgtggacc ttctggcaat 4620 agagagtatt tggtaaaatg ggctggttat ggcgaacgtg ctaacacatg gcaaaaggct 4680 ggtgatttca cagacccaac tatcatacag aaatattggg ataaacaaga cgaactgaaa 4740 aagttggaac atgagagagc tgaacaactt gtaaataaag catcctccaa ttccaaatac 4800 aatgagtcca atagaagttc cacacctaaa gtgagtgatg agaaaaaagt acactctgtt 4860 gagacaaaga aaagaactct tcctacaaag cttagtagag aagaaagatt gttaaagaga 4920 aggcttaaca aggagaaaaa ataataatat atataaaaaa aaaaaaaaaa aaaaaaaaga 4980 tcacggacaa agatagtcac ggtaaattac cccacaattt aacatgaaaa tgatctacca 5040 aattgattag acatttcata acgcatacct attcaccttt atgccgaatc aacaaggcgt 5100 tgttgaatct ggtggaggat aa 5122 // ID Copia-35_MLP-LTR repbase; DNA; FNG; 365 BP. XX AC AECX01001650; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-35_MLP_; KW Copia-35_MLP-I; Copia-35_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-365 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001650; Positions 81499 81135. XX SQ Sequence 365 BP; 89 A; 91 C; 50 G; 135 T; 0 other; tgttaaatta cgtgtgcaat ggtcaacatc ctacttaatc acttagctac caattccgca 60 ttggtcctgg tatcatacct tttcttttct cttgaacgcc tcactcctta actatttgta 120 gacgttttac tcttgtccca aagcagtgca tgatactcac aaacctgata gaagaacgtt 180 aacctatgtc tagagaagtt acctatataa agaccttttg tttgcgtgtc atttgttttc 240 ctcactcaaa cattatcgtg tcctttattt gctaccttcc aacaggtgct tatcttcttc 300 tcatcactca ttagtgattc tcagcctttg cgctcgctta catcactaat acgtgattct 360 tatca 365 // ID Gypsy-23_LBS-LTR repbase; DNA; FNG; 236 BP. XX AC ABFE01002024; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-23_LBS_; KW Gypsy-23_LBS-I; Gypsy-23_LBS-LTR. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-236 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01002024; Positions 930 1165. XX SQ Sequence 236 BP; 46 A; 64 C; 30 G; 96 T; 0 other; tgtaatcccg ggattacgct ttgtttggca tccattctaa ttttagcgca tatttaactt 60 accatacgct tcgcttcttc ttctctttct ttagaaccac gaccatctct ctctctctct 120 ctctctcatt acgctacttt tccatactct cagaatcata cacctggtta acttctgtga 180 gtatcattaa gttctttgag tgttttctat cccctcttct aggggatagt tttcca 236 // ID Gypsy-13_MLP-LTR repbase; DNA; FNG; 162 BP. XX AC AECX01002076; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-13_MLP_; KW Gypsy-13_MLP-I; Gypsy-13_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-162 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002076; Positions 3904 3743. XX SQ Sequence 162 BP; 41 A; 44 C; 25 G; 52 T; 0 other; tgttatgagc cactattagg ctatgcttgt actgagtcct acttgagact atactgtgcc 60 tgtcagagat tttcccaaac ctcacgttgc aatctcatta caatctcatg gcacctttcc 120 tcatagactc ttgatcacca cttcacgagg tcctaatata ca 162 // ID Gypsy-78_MLP-I repbase; DNA; FNG; 13689 BP. XX AC AECX01001152; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-78_MLP_; KW Gypsy-78_MLP-LTR; Gypsy-78_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-13689 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001152; Positions 19751 33439. XX CC Positions [1117-1479] - Integrase core CC 'TGATG' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 7934..9103 FT /product="Gypsy-78_MLP-I_1p" FT /translation="MYVNVVLIMCQYGTLCIPYNTNNFSTPLGGENLFYSN FT IFFTLFFVFLFHPVFLFLLACSDTLPVTTWSCKTSMSSSDNPVLPCSTRQH FT PGATLNQVDNPERIIFPTRPQQPASHQIPSATFIDSPIRAASAPPAPDHTI FT RGPPLIQSKWDLIVNPPNPQGYLVVMATWPLPGISVSLDQLSDNLVPAASQ FT PSLPRPKPRRSVPGSFDPQLSILASEPSSTTQVMDQQSSSLTQDVNVARTK FT AENEQLCQDNVSLRAEMGDICRLLENLLAQQKPPVGDVTGGSSGDATSTQR FT PPHVDPSLDHGSQTAQIFTLTPVVNNQGNLFHVPASLPSVPETAHIPMISH FT VDLSKFRASDWPQYKGAFGDVAAFRLWQYQMESIFWVKQIDQAEDRF" FT CDS join(9197..10210,10214..13312) FT /product="Gypsy-78_MLP-I_2p" FT /translation="MVKMQSVVLPVGWDEAAKEKLRELTMKPNESCTAYCG FT RARLIQEEIGVEGCPDETLAYVVVGGMGGTFKAWAKMEKVVQKSLDATGRF FT SFPIFEERVGSIWLLTQAMDALGSNQSQLSEGRNSSTSAPAPNIQNQFQSN FT NGTQPFFRPALSSEEIMARNVCFGAYMCLIGLCPRCKSPCDKWLGGCTAKA FT NTAFFSVPIEFPRVVPYPPPKSISGNVVGRSTSTIGTPRLVPRKVDVAAVE FT SGGSAVVADVGDFPDLGRVDLAAYDALVAKLNKPTDEGEMVGSDEYAPLLN FT PVERSDDGKTLIVHWLVNGIVFCVLIDTGAGTCLMSERTAETLRVVRPLPV FT LLAVLPAIQSEPESFILKEYAFANLKTPHLAYNFGVTAFKITPLGGDYDLI FT LGAPFLSKHKLDVSISQRLLRNEKNSYRFREQTKIDEMNAAKDLKLRQEAL FT FKTVLDNVEKVGEVHEFSMKEVAMLNEFADLFPEELPALDQDVEDEDAEVF FT PCNIQNEALKIRHRIILTNPDVVINEKQFGYPRKHLNAWAKLINQHVAAGR FT LRRSTSQYASPSMIIPKKDPTALPRLICDYRTLNKHTVKDRGPLPNPDEVV FT RPVGTGKIYSSLDQINAFFQTRMEEKDIPLTAIKTPWGLYEWVVMPMGLTS FT APATQQRRCEEALGDLGNCVCVVYIDDIVIFLQTIEEHEQHLREVLQRLQA FT ANLYCGIKNTKLFCRRIQFLGHIISEDGISADDEKVEKVANWRTPKSAKHL FT KEFLGTVQWMKKFIDRLAHHVCKLTPLTSTKQKTEEFKWGTDEETAFQNIK FT RMITTLPVLRNLDYDSDEPIWLFTDASAQGLGAALFQGEDWETALPIAYDG FT RTMTNAERNYPVHEQELLAVVHALNKWWLLLLGLKFNVMTDHHSLTYLLSQ FT QNLSCRQARWLETLSQFDLNFCYLQGPDNSVADALSRVDMAAIVRSHPELG FT AEVKQKIIGGYMEDSFCQKLTKVLPLRDNCTMSDGFMLIDNRVVIPNVDTL FT REDLIKSAHQAVSHLGSLKTAPYLQSEFFWPGLTDNVDKWTSKCDDCQRHK FT ARTTLLPGRAQSTNLPRRPMSSVALDFVGPFPKVSGYDMILSCTCRLTSFV FT RLIPTNQKDSAERTARRLYSSWLSVLGALNDMVGDWDKLWTSRFWQELHKL FT LGIKINLATAYHPQADGRSERTNKTMGQVLRFSTQGQQGRWLAALPSVEYT FT INSAVNVVTGVLPMRFVLGHQPRLFPVASEVDMVGRDVEKWVSLRQNDWEH FT WRDKLWVLRVEQALQYNKRRGGDLKLQVGDLVLVESSNRAAVVGGKVAKLR FT ARYDGPYKILRVLNDGQDGQLQLSANNKSHDIFHISKLKIYWTDASEEAPE FT GQETTGCK" XX SQ Sequence 13689 BP; 3995 A; 2634 C; 2981 G; 4079 T; 0 other; cttttttgaa ctacacccat cgaacacaat cactcggatt cgataattca attcaatact 60 acagaataca tagagcgcca tactgacata tgattagcac tacatttaca tacatatttt 120 ttgcaatcca aattcagccg tacaccaagt tgtgtactag atgagtcaga ctgacacaat 180 agaattccct atttcactac ccaaaccttg agtgtcagta tggcacttca tgtagcacct 240 gccgcaacat aaagtgcata ctgacattca aggtagggtt agccggttca gggcttggtt 300 ttgagacaac cgtcacagcc atgaccaatc atctgtcatc tagggctaca tcattgcaca 360 taaccatact cactgtgaag ccatgaatct catttacgac atacttgacc tacccatcac 420 tatcatttta ttcacttgat acactcatca acacgacaac tttacaatcg taatcagaat 480 gccaacatgg cttgacaatc aaaagctgca tggagtgcta acttatggat tcattaataa 540 tctcaccatc aaacaaattc aatcccggct actgtctgaa catcaaatat ctgtaacaga 600 aaggacaata taccgataca acaagaaatt tgacctgaag aaaaatcaag gtgctccacc 660 aaagccaaac gagtccaatg ttgaagaagc aatcaacaaa cactatcacc aaggcaagac 720 aatgtcagag atgctcattg cactgaaaaa tgaggatctg attaatatta gtagacaaac 780 tctgaatcgg cactgcaaga atttaggact ttctcgtaga tatgatgaca ttcaaagagg 840 taaatacaca gttgaagaaa taacccaact tatcttagag gtccagaagg aaaaccatga 900 aaacactggg atccgatcgc tcaagtcttt attactcagt gaacacaaaa tacatatcaa 960 caggtacttg aatgttatca catctctatt cagattcttc attgttacag ttgcaattct 1020 ttcattgaca gggagccgct aacagaaatc ttaagacaag tggatcctga aggtgttcaa 1080 cgtcaacgtc gcagagttct gactcgccgt gcctttagat gtcctggacc aaaccatatt 1140 tggtcggctg atgggcacga caagcttcaa agttggggaa ttaggatata cggatttgtt 1200 gatgcttggt ctcgtcgcat acttggacta tatgtacatg tcaccaatag cgaccctcgg 1260 cacatcgata tatattatct ggatgtggtg aaacggcatg gtggtagccc acagcatctt 1320 cacactgatc gtggtactga gactggcaaa atggcagcca gccatgctat ccttgtttgt 1380 gtatttggtg gtctagacaa agtcgaggca ctcaaacggc acaaatacac taaatccgtt 1440 cacaatcaaa agattgagtg cttttggggt caaatgtgtt gacgtcatac acgcgggatc 1500 atcaatggat tttatgagtc gctggaaaag ggccaatata actctgataa tattttggat 1560 tggtgggttt cttctccatc tgtttgttca caaagttaca cctaacttac aaaatgcata 1620 ctcttttcaa agtttattat tcctttatat ttggatcccc ttcatgcaga agcacttaag 1680 gaaattcacc aaccggtaca acaacttcag aagatgaaag gatcctcgaa ctgcccttcc 1740 aagtggaact agtgctgatt ttgcttataa gaatcctcat gagttgtttg gggggattga 1800 tggcttgatt gaagtgccaa atgaagatgt tcaagtaatg atttcagaac aataccctga 1860 tatacaaagt ctctttactt acacaccatc ctggttttcc actgttgctg atgatttgat 1920 gccggggctt aacttgtacc atgataacct caagtattca aacatatgga aagcattcaa 1980 cttgatgcga caagcattga tccttcataa ctggagtgga actccattag acggcctcca 2040 atttcaatga tcacatagta atacaaagaa gaaaaactta tgctgaatat ctttcaaatt 2100 gaatcatctc tttctcttgt attcagtttc attttttatg tgcgattgat aaaagaaatc 2160 attcttcaaa tttgtttgtc aaaaatttta caatacccaa caaaaaatga atttgagaaa 2220 tattacaaag atttcaattt ttctttaagc tcttccatct gtgccttttt cttcttatct 2280 tttaacatct gaagtttacg aagcaactca tctgcatagt tttcagccaa tgtttctcga 2340 tatacctccc attcttgatc acacatgttg cctcgcatgt cagtaagaag tttgatgcgt 2400 gcaacaatac tttttgcgat ttgacaagac gatccaaatt gggttatctt cgcatcttcc 2460 ggtggaatag gctgatgatc atgtgtataa gtcttggagt gagtactata gtgtctaaca 2520 tctgaagccc aaaccatcac tacaacacca tcacattcat cgaaatcaat ttccacattg 2580 tagtcaggaa atacaaattg agcccctttg acgtagccgt gctttgaggt ccttccacgt 2640 cgatataatt tacctgtgtc acaatgtata aggctaaata atccaaaagt gaaaccagaa 2700 tgatcattat caacatggat catgttggca aactcatcac gtgttacaac aaagtttgac 2760 ccaaagaccc gtccatttgg agatttgaaa aacaacttcg aagaccaaga gggtgtgcga 2820 catttagcag cccgatgaca attggattca aaggctccaa tagataggga ggcaaattgt 2880 tcagctagaa aatggttgat ggatgggaga tacatctggc ggcgtaggtc gttttccatc 2940 cgggcaacag aatgtgcagt tgttgcattg agagcatagg tacctaaatt aaaacataca 3000 tgaacaacca aaatttttaa aaaccaaaag aaataaatga gaaactatgc acctccaaat 3060 tttgtctttc tgtatccacc acgaaaaccc accacaaaca tctgaccaga tttcatatgc 3120 ttcagacctt cagctgcacc attggtatag cagcgataac gggctgttgc atgctgatac 3180 aaggtcatga ttgatttttt aaaattgtcg aatagtttgg cacaatcagg tgtaagttct 3240 cgcttcagct ttccattatt caaccgcttt cctgtatatt cttgctcgtt gaatcggacg 3300 accgcgagga gtttagttcc atatttatta agtataacta ctgtaccaaa agaaagtgac 3360 aaaagggcct gtccaccatg ctcagttcgg aagacaaagg tagcgtcttt gggaaggcca 3420 taccaatcac atattttttg tttgaattca gctctgatgc gcgattcaac ttgaagggtc 3480 aatcagaaga tggaagttgt ggggtattat gagcaaccca ttgtgcaaca attttactct 3540 tcttgattgc aactcgagtt ttgatctgac tagtattaaa ttttgcttga gcacgtttct 3600 gtgcagggga tctatacttc ttattactga agcggaataa aaaaacatca aaaattattt 3660 aaataatctt ccatgtcaga gatcaacttt ttttgagtgt cacaactctc ccacacttga 3720 acaatgaact agattaggtt tgctcaaggt ttgcgtcaaa ccctacccac aagaatttgg 3780 tgtgtgaaac caacccagat tcttggggag gacaagtgca gttaaaccca aagcagtgac 3840 catagatcat caaggtccaa aaccaacttg atagtggaat ggagacatca ggtagttctg 3900 catcttaact gtgtcaagat agagcatgca agatgaaatt ccatccatta tcaagttggc 3960 ctgtgggagc tctgctatcc tctagttgct gatcatgaca aaagaaagat gaatgaagaa 4020 tgaagaatga agaaatatgc atattggtat ctcacatcat gctgatttga taaataatgt 4080 cgatcgtttt taatgtatcg tttttttgac tggaatcaat ttgtattgat tgtgaagtaa 4140 attggactgt tactagaatt gaagtctaag gaattagaat tgtattgatt atgaagaatt 4200 gaaacaattg tgattagatt tgggtctttt caagacacaa actggtttat agactagaga 4260 aactaaccaa ttactagact ggacttggat ctaaagtgga ttgaatctgg attgaatcta 4320 tcaactgatt gatttttgaa tcatatagac agatcaatag tgatccaaat cgatttgatt 4380 ttgactagaa tcgattagtc ctgactagaa caggatgaat aacgtgtaat ttgaatgtat 4440 tgtttgaatc aattaactga ttgattttga ataaacgaga tggatcgatt gtgagtcaaa 4500 tcgattagtt ttttcgagta ggattgaata gtttagacta caaaaggatg aatgatgtga 4560 atcgatttca aggtgttgtt tgaaatgatt aagtgattga ttttgaataa aagagatgaa 4620 ttgatagcag ttcaaagcga ttggatttgt tttttttgac tagaattgaa tagtttgaaa 4680 tggatgaagt atgaaagaga ttagcaagac atgattctat cttgtaaaat attgaacagt 4740 ggaagaactt actgaagaga ttcaagcaaa tgctagttga tttgagtagg gaagtataag 4800 atgaaagagc tcaaaagaaa tcgcttacgt ggccccattt ggataagata tgcttatata 4860 taggatatgc ttatatccaa tctcatgcaa ttgggtgcaa tttggtacgt ccaaaacata 4920 tcaatcaatc atgattgctc ctttctttca ttgctctctg ttttgtcaag ggaatatctt 4980 ggatttcaca aattgtaaaa ggtgaatcaa tgtataagga ccttgaaacc acaatacatg 5040 aaacacaatc tggacaaata aattgagtgg tatggacaac tgtcatgatt gggcctcatg 5100 ttcctcagag ctgcgatcat aatccataga agtgaagatc ggctaaagac tttcttgaca 5160 acagtgatca gaaacaaatt gcaaaagaaa attactacac atcaggtgat ttttgagaac 5220 ggagtctcat acggtcttga gtagtttttt gtttccagaa ctcatttggt gcattctcgg 5280 gtgcagtaaa gtctggattt cccccattag caatatgatc ttcaatgaaa tctttttcca 5340 tattttcact atggttttga tcggcttctt gaccaggtgg aaacagcagt cctagtttgt 5400 gacaaatttc atagcaccgg tgctgtttgt taaatctttg aatttcctct tcagctgtgt 5460 tagtctcctg aacccaagca ttgctgtaga ataagataat gaattcagta agattcgtat 5520 atatgtcata cacattaaat ttgctcactt tatatctaaa atgagtggat cagtgagata 5580 ttcaccaaat ccttgaagat cagcaactaa tctctgatga ttgttcttct catatacgta 5640 acaagtgaaa gcagaaaggg ccatatgggt tgaatcacat ggtgatcgta tcttgaaaca 5700 catgttggat gtgtacttct caaatttccc ttcttttaca agggattttt caaaaaacca 5760 aatgtttctt gggtcaagat cttggctcaa agatccagag tagactacaa aggattttac 5820 aacctgcaga aaaatcagaa ttctttcata cacaaccaat cttcaagaga cactcacttt 5880 taagtcttgt ataatactgt acataggaaa tcacccatca agcttctcac aatcgtcttg 5940 aaagcgcttc aagtactttg caacacccat gcaagcttgc attgttcgaa tataaggctc 6000 caaataagtc cctcgaaaca gtttatgatt gatcagctgt tttgcgatca tgttgtatac 6060 aattcctcca gattatccca atgctgcata gcagatccga gtcaagcctc ttcccacttc 6120 cttgtcagtg aaaactttga atctaaatgg aactggaatc catactgttt taccattcct 6180 ggcttgaagg acaagcttcc ttccttcaat gagttgtgtc ggcgatatag gatcatggac 6240 ggggagtggt tttgcaaaag ctgttacaca catctgtttg atgtgtgtcg aatcatctgc 6300 atgagaagaa tcagaatgga tgtttgaaaa cataagaaat gacagacaaa aaactcacaa 6360 ctgaaatctt tgacaactgt ctgagatgga gtgtagttcg ctttctcatt caaaaatcca 6420 tccaatacca ttgtacctag tacatcctct gaaaaattgg ttccgtagtt aagttgagtt 6480 ggtgggagat gaaaacaagc tgtactactt ccacggagag gagcagtgga agttgctgat 6540 agtagtgatg gattaataaa ggttggatca ggtagctggg ctgatgagac gttattggag 6600 agtgaggttg agaaattttg aatcaatttg actggcttcg aagacttttc attgtatgtg 6660 tatcaaactg atcactagca aattcgcaat catccaagaa ttgtgaacaa tcttgaataa 6720 cctttgcgtt ctttttgatt ctctttgatg gtttctttgg tgctaatata attgattcta 6780 attcgcttga gtcgtctaac tcagcttcag aggtggatat cttgtgattt cttttctttg 6840 atgctttatt tgatctctta tctttatctg ttttttctaa aataccaaat ctactatcat 6900 agcgcactga gttgaatgaa agttccagca tctttgatcc acctcgcagc aagtcagaga 6960 ggtcatcttt ttgtatttct atagcaggac aaaaattgtt cagatggaag tacttttcat 7020 taaacttgta ttgggtaggg aaatcaggaa gatcattctt tgcttcgtga atttcgggta 7080 gaaaactata ataaatgcta gaagcaactt gatcgtacag ttcacctgta gattcagtca 7140 atttaatcaa taatttgaaa gattttgctt gtgctggtaa gtatttcgtg gttattttcc 7200 ctttaagata catattcatc caagcattga agttgtaatg ggtgaggata ggttgtggtg 7260 aatttgactt tttaggaggt tcagatagcc cttgaagttc taacgatttt gcttgtcaga 7320 atttgttacg tagtatcatt gagtcttgtt tcttggctgt agctgactga tgcagttgaa 7380 atgcatggtt gccagtactt gaagtggatt gctgagcggt tatagaagat gaggggggat 7440 cgcagtcgac cctacaagca ccctcaattt ttttgtccat tgaaagactg ttacagattt 7500 tacaaatcaa agacattgtg tagtgatctg tagggttggt aaaacaatga taatagtgat 7560 tgattcagtc aagctttgaa agtttgttta tgttgtgatg aacctagaga atgcacgata 7620 actagtagaa agtgtaggta tatgatgttc tacatccagg gggatggcat cgcccataca 7680 tacatgttga tgagataaac aaaaacaggt gacctaagtg tcagtggggg tagataggtg 7740 gtaggcatac tagcactata ataaagaaga agatccttat cagtcagtat gtcactagat 7800 gtatcgcttg ccactaaggg aatcaccata ctgacactca aggtttgtgt agtgaaatag 7860 agaattctat tgtgtcagtc tgactcatgt agtacacaac ttggtgtacg gctgaatttg 7920 gattgcaaaa aacatgtatg taaatgtagt gctaattatg tgtcagtatg gcactctatg 7980 tattccgtac aataccaata acttttcaac gccactcggc ggggaaaacc ttttctattc 8040 aaacattttt ttcacacttt ttttcgtttt cctgtttcac ccagtattcc tgttcctgtt 8100 agcctgctca gatactcttc ctgtgactac ctggtcctgc aagacttcca tgtctagtag 8160 tgacaatcct gttctcccgt gttccactcg acaacatccc ggagcaacat tgaatcaagt 8220 cgacaatcct gaacggatta tttttcccac ccgccctcaa cagccagctt cacatcagat 8280 tccatctgca acttttattg acagtccgat acgtgcagcc tcagctcctc cagcccctga 8340 tcatactatc cgtggtcctc ccttgataca atcaaagtgg gatcttattg ttaatccccc 8400 caatcctcaa ggatatctgg ttgtgatggc tacttggcct ttaccaggca tctcagtatc 8460 attggatcag ctctctgaca atttggttcc agcagcttcc caaccttcgt taccgcgtcc 8520 taaaccgcga cgtagtgttc ctggttcctt cgatcctcag ttatccattc tcgcatctga 8580 acctagctcg acaactcagg tcatggacca acaatcttct tcattaactc aggatgtcaa 8640 cgttgctcgt acaaaagccg agaatgagca actttgtcag gacaacgtga gtctgcgcgc 8700 ggagatgggc gacatttgtc gactcctaga aaaccttttg gctcaacaga agccaccggt 8760 gggtgacgtg acgggcgggt cttctgggga cgcaacatca actcaacgac caccacacgt 8820 tgacccttcc ttagatcatg gctctcaaac tgcccagatt tttactttga caccggtggt 8880 caataatcaa ggaaaccttt tccatgtacc agcttctctg ccttcagtcc ctgaaacggc 8940 ccacattcct atgatttcac atgtcgactt gtcaaagttt cgggcaagcg attggcctca 9000 gtataagggc gcctttggtg atgtggctgc gtttaggtta tggcaatacc aaatggaatc 9060 catcttttgg gtcaagcaaa ttgatcaggc tgaagaccgc ttctgaatct tacctttggt 9120 gatagcgaac aatccagctt cttcttggtg ttgacgatca gagaggacct ttgtgggtaa 9180 aacgtgggag caggtgatgg tcaagatgca gagtgttgtt ttaccggtag gatgggatga 9240 agcggccaaa gaaaaattaa gggaactcac aatgaagccc aatgagtctt gcactgctta 9300 ttgtgggaga gcacgtttaa ttcaagaaga gatcggcgtg gaagggtgtc cagatgagac 9360 attggcgtat gtggtggtag gtggtatggg tggtacattt aaagcgtggg caaagatgga 9420 gaaggtggtt cagaagagtc ttgatgcgac aggaaggttt tcgttcccaa tcttcgaaga 9480 gcgtgtaggg agtatctggt tattaactca agccatggat gcattgggtt ctaatcaatc 9540 acaattgtca gaaggtcgta attcatcgac ttctgctccg gcgccgaaca tccagaacca 9600 gtttcagtca aataacggca ctcaaccttt ttttcgtccg gccttgtctt ctgaagaaat 9660 catggcacgc aatgtttgtt ttggggcata tatgtgtttg attgggttgt gtccaaggtg 9720 caaaagtcct tgtgataagt ggctaggtgg ttgtacggca aaagcaaata ctgcattctt 9780 ttcggttcca atcgagttcc cgcgagttgt gccttatccg cctccaaaat caatctcggg 9840 taatgttgtt gggcgttcaa cttcaacgat aggtacacca agactggttc ctcgtaaggt 9900 ggatgtagca gcagttgaaa gcgggggtag tgctgttgtt gcggatgtgg gtgacttccc 9960 ggatcttggt agagtagatc tggcagctta tgatgcgttg gtggccaagt tgaataaacc 10020 aactgatgaa ggtgaaatgg tggggtctga cgagtacgca cctcttctca atcctgttga 10080 gcgatctgac gatggaaaaa cactgattgt tcattggttg gtcaatggaa tcgtcttctg 10140 tgttttaatt gacacgggag cagggacttg cctgatgtcg gaacgtacgg ctgagacatt 10200 acgcgtggtt tgacgacctc ttccggtact gcttgcagtg cttccagcca ttcaatctga 10260 gcctgagtct ttcattctga aggaatatgc gtttgctaac ctcaagactc cacatctggc 10320 ttataatttt ggggtcacag ctttcaaaat cacacctttg ggaggagatt acgatttaat 10380 tttgggcgct ccatttttgt ccaaacacaa attagacgta tctatttctc agagactgct 10440 ccgcaatgaa aaaaacagct atcgttttag ggaacaaact aagattgatg aaatgaatgc 10500 agcaaaagat ctgaagttga gacaagaagc cttatttaag acagtgttgg ataatgttga 10560 gaaagttggt gaagtacatg agttttcaat gaaggaagta gcaatgttga atgagtttgc 10620 cgatttattc cctgaggagc tgccagcgtt ggatcaggat gttgaggatg aagatgctga 10680 ggtgtttcct tgtaacattc agaatgaagc cttgaagatt cgacatagga tcatactcac 10740 taacccagat gttgtcataa acgaaaaaca gtttggttat ccgaggaaac atttgaatgc 10800 ttgggccaag ttgattaatc agcacgtggc ggcggggaga ttgagacgat ctacaagcca 10860 gtatgcctca ccttcgatga ttataccaaa gaaggaccca acggcactac ctcggttgat 10920 ttgtgattat cgtaccttga acaaacacac agtcaaggac cgtggacccc ttcctaaccc 10980 ggatgaggta gtacgtccgg tgggcaccgg aaagatttac tctagtctgg atcaaatcaa 11040 tgcttttttt cagacacgga tggaagaaaa ggacatacct ttgactgcaa tcaaaacacc 11100 ttggggattg tacgagtggg tcgtgatgcc aatgggattg acaagtgctc ccgcgaccca 11160 gcagagacgt tgtgaagagg cgttgggtga tttgggcaat tgtgtgtgtg tggtgtacat 11220 tgatgatata gtgatttttt tgcaaacaat tgaagaacat gaacagcatt taagagaagt 11280 tttacaacga ctacaagcgg ccaaccttta ctgtgggatc aagaatacaa aattgttttg 11340 tagacgtatt cagttcttgg gtcatatcat cagtgaggac ggtatcagcg cggatgatga 11400 aaaggtggag aaagtggcga attggagaac ccccaaatca gcaaaacacc tcaaggaatt 11460 tcttggaacc gttcaatgga tgaagaaatt tattgacaga ttagctcatc atgtgtgcaa 11520 attgacgcca ttaaccagta caaaacaaaa aacagaagag ttcaagtggg gtacggatga 11580 agaaacagca tttcagaaca taaaacggat gattactaca ttgccagtct taaggaatct 11640 tgattacgat tcagatgagc ctatctggtt gtttacggat gctagtgcgc agggtttggg 11700 ggcggcactg tttcagggag aagactggga gacggctttg ccaattgctt atgacggacg 11760 tacaatgaca aatgctgaac gcaattatcc ggttcatgag caagagttat tggcagtggt 11820 tcatgcttta aataagtggt ggttattact gttgggattg aaatttaacg tgatgacgga 11880 ccaccattca ctgacgtatt tattgtctca gcaaaatctc agttgcagac aggcccgttg 11940 gctagaaacg ttatcacaat tcgaccttaa tttttgctat ttgcagggtc ctgataattc 12000 ggtagcggac gctttgtcaa gagtagatat ggcggcaatc gttagatcac atcctgagct 12060 tggagctgaa gtcaaacaga agatcatagg gggatatatg gaggattctt tttgtcagaa 12120 acttacaaaa gttctgcctt tgagggataa ctgcactatg agtgacgggt ttatgttgat 12180 tgataaccga gtcgtaattc ctaacgttga taccttacga gaagatctca tcaaatcggc 12240 acatcaggcg gtcagtcact taggaagttt gaaaacagct ccttacttac aatccgaatt 12300 cttctggccc ggattgacgg acaatgttga taaatggacg tcgaagtgcg atgattgtca 12360 gaggcataag gctcgtacaa cactgttacc tggccgagca cagagcacca acttacctag 12420 aagaccgatg agtagtgtag cattggattt tgtaggcccg ttccccaaag tatctggcta 12480 tgacatgatt ctttcttgca cttgtcgatt gacaagcttt gtgcgtttga tcccgacaaa 12540 tcaaaaagat tcagctgaac ggacggcacg ccgcttatat tcgtcttggc tttcagtcct 12600 tggcgcactg aacgatatgg ttggagattg ggataaactg tggacttcgc ggttttggca 12660 ggagcttcac aaattgctgg gaatcaagat caatttagcc accgcgtacc atccgcaggc 12720 agacggacgc tctgagagaa caaacaaaac catgggtcaa gtacttcgct tttcaacgca 12780 aggtcaacaa ggtcgttggc tagcggcact accttcggtg gaatacacca tcaattcagc 12840 tgtgaatgta gtgacaggag tcttgcccat gcgtttcgtc ttgggtcatc agccgaggtt 12900 atttccagtg gcgtctgaag ttgatatggt tggtagagat gtagagaagt gggtgtcttt 12960 gcggcagaat gattgggagc attggagaga taagttgtgg gttttgaggg ttgaacaagc 13020 attgcaatac aacaaacgca ggggtggtga tcttaagctt caggtgggag atttggtatt 13080 ggtagaaagt tcaaacagag cagcggtcgt aggtggcaag gtggcaaagt taagagcaag 13140 atacgatggg ccatacaaga ttcttcgagt tctgaatgat ggtcaagatg gtcaacttca 13200 attgtcggca aacaacaaat cacatgacat ttttcacatc tcaaagctca agatttattg 13260 gacggacgcg tcagaggagg ctcctgaggg gcaagagact actgggtgca agtaagttct 13320 ctccttagta tgcaccgccg cgggtttcta ccccttgtcg ttttttcaca acaacacctt 13380 ggccacgtct gtgagcacga aaccttctct gtgttttacc aggactggcg cggcgacctc 13440 aactcgacgg agaatctcat ggattttggt cgtggttgtg gttatttctt ctgtttgctg 13500 tttcattttc attttcttcc ttttttttct ttctttctct tcttctcttc aaaaaaaaaa 13560 attattaggg tgcttggatt ttatgaagga ttttgtttta aatttctttt ttttctttct 13620 ttcatttcta tattttatga attacaattt ttccggttag aaagtagctt cattgttagg 13680 tgggggggg 13689 // ID Gypsy-8_CCO-LTR repbase; DNA; FNG; 436 BP. XX AC AACS02000003; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the mushroom Coprinopsis cinerea genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-8_CCO_; KW Gypsy-8_CCO-I; Gypsy-8_CCO-LTR. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-436 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the mushroom Coprinopsis cinerea RT genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AACS02000003; Positions 840702 840267. XX SQ Sequence 436 BP; 80 A; 120 C; 106 G; 130 T; 0 other; tgtaagggga ccagacgctt tctctgactt ctcctgtatc accggcccct atgtggagta 60 aggtaagcgt gattacgaga atcctacctg cagattctcg atggcttgga cgatgtgcgt 120 catacttcca tttcgggaag tagtttccca cgacccctcc aaaacctttc cgacacagct 180 tgctccactg atgtgtctca gatgctccac agcatgggtg gctgcatcaa ctgtatagtt 240 tagtcccacg gtctctagta gtcgatcatc gtcgctccgc actgacaggt ttccttaggc 300 ttttgtcttt ggattctccc ctgtcaggta ggcccttgta gttgtcgttg tttctcagag 360 gctcaataga ctttggattc tcccctgtca gacccgtgtg tgagtgtcca ccggtgcggc 420 tagggttgcc cttaca 436 // ID Tad1 repbase; DNA; FNG; 7029 BP. XX AC L25662; XX DT 26-APR-2005 (Rel. 10.04, Created) DT 03-JUN-2009 (Rel. 14.07, Last updated, Version 2) XX DE Neurospora crassa Tad1-1 LINE-like element ORF1 and ORF2. XX KW Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; endonuclease; TAD1_NC; Tad1. XX NM TAD1_NC. XX OS Neurospora crassa OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Sordariales; Sordariaceae; OC Neurospora. XX RN [1] RP 1-7029 RA Cambareri E.B., Helber J. and Kinsey J.A.; RT "Tad1-1, an active LINE-like element of Neurospora crassa."; RL Mol Gen Genet 242(6), 658-665 (1994). XX DR Genbank; L25662; Positions 1 7029. XX FH Key Location/Qualifiers FT CDS 158..1618 FT /product="Tad1_1p" FT /translation="MSCDERSDGVVSSLPMAIDGGARGLENSVHNPLAEIS FT ACNDPEAALTAKTSDKTEEDDETLKKFIARANKSTDYDTVSTEDRRWAREL FT LLEFAKARVGKSGSTRPEADKAGTDQFVTRKDFDDLKKDILKEIKKGAQEA FT PKRWADVVAKASPSKIGATLQQPPTTTKKFVPSNLERQLTIKGATIAAEFV FT NRSNEDTKTTLATCLGKKKPGLIVRAATRMPTTGDYVIVFDEPTRTWCWRN FT QAWAKEVFGPDAFITMSTVGVLVRGVPWDSVDNYTTAEAISNVAKERNPEA FT SIIRVRPWKRRDGESRGLLLVEVATASAACFLQDNLFLWDGGAYPCEPFQA FT SSNVQQCFRCWGIGHTARFCRQDDICARCGEAKHEGDRFGEVNCPSNDDKS FT LVYCKPCGKKGHCAYNRKECPILRKAIAKASVAHAERPRAFAPARTQPERC FT WSRRQRLWTRVSCQTPPPAPVLLLPWRDMLEEAQRGDGRER*" FT CDS 1812..5276 FT /product="Tad1_2p" FT /translation="MVQLKILYWNVGKSYERGKLLLEQEETYDIVAIQEPG FT RNLNGDIYRPRGGRYFGVDCSEKGRAVIYVNKKWNLKDLDFQAGKDWTAVT FT FKNLRDPTTVYSIYSPILTQGTPEHQWGSPLLEFIEAGPPAGNLVAVGDLN FT LHHPDWDLENRTSPGAARLLTWARRWRLSLLTPRGEPTRLGNATRGERDGT FT IDHAWLSPGVEAEYWGAQRCEQTGSDHCPQEIWVQVGRKNKKAEEEGGYSW FT SLMDTEKVKEASEKRISVPERIDTVEDLELAFRRLDGELTSIAEEFVPHRK FT KGRGRRANWWTLECKTATKEARRTYWEWFGCRTASNWREFVEATMKKKRII FT AKAQQATWRDGVAEASKNPSRLWPLERWARTRSHRPADPPKLPALQTQEGA FT AEIGDHSGKAEILADRFFPKVEVPTDHLPDWTEPPEGVDPIVISELVDEED FT VQDVLSRMAPNKAPGIDWYSNRFLKLCGQPFRQAMACLASASLRLGHFPQR FT FKDAKVVVLKKPGKSAAQQKLAGAWRPISLLSNVGKILEALVAKRLTQAAE FT EFNLLPEGQMGNRAGRSTEFAVRVVTDAVHTAWKLGAVTSLMLLDLKGAFD FT RVNHRWLLHTLWEMQLPTWIIRWVASFVAGRRGSLFFDDETSRPYAITAGV FT PQGSPLSPILFILFITPLYRKLATIPNTITVGFADDTNVVAVARTTEENCR FT TLQAAWEVCSGWAGARGMEFEATKTELMHFTRTRAPRTETLQLGDTVLQPT FT ESTRFLGVWLDRKLNYRAHAEAVKQKMTTQTNALTRLAAKTWGCSFARARE FT IYTKCIRSAIAYGASAFHQPTEVYGKPRGIVIGLAKYQTKCLRVVAGAYKA FT TPVRNLETETFCPPLDLYLNKRVRAFEERLARTDQARFVRGTAAWVASCVQ FT ARRARGRPPADKPESGAAKARWAEAWAPTEKLLAMAAGPPPESGTPPPASQ FT IPEPTAADREEWLEDATNREWVSRYNHHLAAAVAKSGQRYEFTIADLNPKF FT SGEAIQRHMVGTKKKKGRRRKPEEEEDEPYPLTKAKSSLLVQARTEVIGLR FT AFLHRRKVPDVLTPICACGLEKETFAHIVLNCLDAETLFWSDPDSWPLSKE FT ELQEYLDDGVKAERILSWALRLGRLKEYRLAVELEEDQREELARHS*" XX SQ Sequence 7029 BP; 1816 A; 1864 C; 2204 G; 1145 T; 0 other; caaattgcct actaattatg aacgttttct ctatactatt aattataata aaagaaactg 60 ctttgaaacc ggaaacgccg atttctggtc gaggagtgct ccaaaacgaa aactgacaac 120 ggcaatcgac tcaggagggt cgtttacgtc cctcacgatg tcctgcgacg aacgcagcga 180 cggcgttgtg tcgtcgttgc cgatggccat cgatggcggt gctcgtggac tcgaaaacag 240 cgtccacaac ccgctcgcgg aaatctcagc atgcaatgat cccgaagccg cactcactgc 300 gaagacctcg gataaaacgg aagaagacga tgaaacattg aagaagttca tcgccagagc 360 gaacaagtct acggactacg ataccgtctc gaccgaggac cgtcgttggg ctcgagagct 420 gcttctcgag ttcgcgaagg cacgcgtcgg aaaaagtggg tctacgcgac ccgaggctga 480 caaggctgga accgaccagt ttgtcaccag gaaggatttc gacgacctga agaaggatat 540 cttgaaggag atcaagaaag gcgcccaaga agcgccgaag cgttgggccg atgttgtggc 600 gaaagccagc ccgagcaaga tcggtgccac cctccaacaa ccacccacca ccactaagaa 660 gtttgtcccc tccaacctcg agcggcagct cactattaag ggagcaacga ttgccgccga 720 gttcgtaaac cgttcgaacg aagacacgaa gacgaccctg gccacttgcc tgggcaagaa 780 gaaacctggc ctgatagtcc gcgcggctac caggatgcca acgaccggag actacgtgat 840 cgtattcgac gagcctacac gtacctggtg ctggcgaaac caagcgtggg ctaaagaagt 900 tttcggcccc gatgctttca ttaccatgag caccgtgggc gtgctggttc ggggcgtgcc 960 ctgggacagc gtcgataact acactacagc ggaagctatc tcgaacgtag caaaggagag 1020 gaacccggag gcatctatta ttagagttcg gccatggaag agaagggacg gagagagcag 1080 aggcctgctc cttgtggaag tcgccaccgc atcggcggcg tgcttcctcc aggacaacct 1140 ctttctctgg gacggcggcg catacccatg cgaaccgttc caagcctcga gcaacgtcca 1200 gcaatgcttt agatgttggg gaatcggaca cacggcaaga ttctgccgtc aagatgatat 1260 ctgcgcccgc tgcggcgaag ccaaacacga aggagaccgc ttcggcgaag tgaactgtcc 1320 gtccaacgac gacaagtcac tcgtctactg caagccgtgt gggaagaagg ggcattgcgc 1380 gtataaccgc aaggagtgcc ctattctccg gaaggcgatt gcaaaggcgt ctgtcgcgca 1440 cgccgagagg ccgcgggcgt tcgcccccgc aagaacccag ccggaaaggt gttggagccg 1500 acgacagaga ctgtggacga gggtgtcctg ccagacgccc cccccagctc ccgtactgtt 1560 gctgccttgg agagacatgt tggaggaggc gcaaagaggg gacggccgag aaaggtaatc 1620 ggcccgtcca cccggccgaa gcctatgacc ccacaaggtc cgatagatgg atactataca 1680 cgctcgcaga gcgtcggggg aagtcggccg cgtggcggca atggcgccca cacgaccaca 1740 gcccccccca gggaagcagt acccccatcc tccgccgagg gatcagaagg aaccccatcc 1800 cagggtaact aatggtccag ctaaaaatac tctattggaa cgtggggaag tcgtatgaga 1860 ggggaaaact tctactcgag caagaggaga cctacgatat cgtagcgatc caggagccgg 1920 gtagaaatct gaatggggat atctatcgtc ctagaggcgg aaggtatttc ggggttgatt 1980 gttcggagaa gggaagggcg gtaatatatg tgaataagaa atggaatttg aaggacctag 2040 atttccaagc cggaaaggat tggaccgcag ttacgttcaa gaacttgagg gacccaacga 2100 ccgtctattc tatctactct cctatcctta cacagggaac gccggaacat cagtgggggt 2160 caccgctcct cgaattcatc gaggccgggc ctccggcggg aaacctagtc gctgtcgggg 2220 acctcaacct ccaccatccc gattgggatc tggagaacag gaccagccca ggggccgctc 2280 gcctccttac atgggcacga cgctggagac tcagtctact caccccccgt ggtgaaccga 2340 ctagactcgg gaacgcgaca cgaggagaga gggatggaac gatcgaccat gcatggctat 2400 cacccggggt cgaagccgag tactggggag cgcagcggtg cgagcaaacc ggctcggacc 2460 actgccctca ggagatctgg gtccaagtag gaagaaaaaa taaaaaagcc gaagaagagg 2520 gaggatatag ctggagtctt atggacacgg agaaggtgaa ggaggcgtcg gagaaacgga 2580 tatcggtccc agaaaggatc gacacagtgg aggaccttga gctcgccttt cgcaggctcg 2640 acggagaact tacaagcatc gcagaggaat tcgtcccgca ccggaagaaa ggtcgaggga 2700 gaagagcgaa ctggtggacg ttagaatgta agacggctac gaaagaagcg cggaggacgt 2760 actgggagtg gttcgggtgc cggacagcat caaactggag ggagtttgtc gaagctacga 2820 tgaaaaagaa gaggattatt gcgaaggccc agcaggctac gtggcgcgat ggcgtcgccg 2880 aggcctcgaa gaacccaagt cgactctggc cactagaacg atgggcccgg acccgaagcc 2940 accgaccagc agatccgccc aaactaccag cccttcaaac ccaagaaggg gcagcagaaa 3000 tcggagacca cagtggcaag gccgagatcc tagccgaccg ctttttcccg aaggtagagg 3060 tcccaacgga ccacctgccg gactggaccg agccaccaga gggcgttgac ccgatagtca 3120 tcagcgaact agtcgacgag gaggacgtcc aggacgtgct ctcccggatg gcgcctaata 3180 aggcccccgg gatcgactgg tactcaaaca ggtttctgaa gctatgtggg cagccgttcc 3240 ggcaggccat ggcatgctta gcatccgcca gcctccggct cggccacttc ccacagcggt 3300 tcaaggacgc gaaagtggtt gttctcaaga agcccggaaa gagcgccgcc caacaaaagt 3360 tagcgggcgc atggagacca atatcgctac ttagcaatgt aggaaaaatc ctcgaggcac 3420 ttgtggctaa gagacttacg caggcagccg aagagttcaa cctcctgcca gagggccaaa 3480 tgggcaaccg ggcaggccgt tcgacagagt ttgcagtccg ggtggtcacc gatgcggtgc 3540 atacggcctg gaaactcggg gccgtgacgt cccttatgct cctagatttg aaaggagcgt 3600 tcgatagagt taaccatagg tggcttttgc ataccctatg ggaaatgcag ctcccaacat 3660 ggattattag gtgggtagcg agcttcgtcg ctggtagaag aggttccctc ttcttcgacg 3720 acgaaacgtc ccgcccctat gcgattaccg ccggcgtacc gcaaggctcc cccctctccc 3780 ccattctgtt cattctcttc atcaccccgt tataccggaa gttggcaaca atcccgaata 3840 caattacggt cggcttcgca gacgacacca acgtcgtagc ggtagcccgt actactgaag 3900 agaactgccg gacactacaa gcagcgtggg aggtgtgctc gggatgggcc ggggcgagag 3960 gcatggagtt cgaagcaact aagacggagc taatgcattt cacgcggacg agggcaccaa 4020 gaacggaaac tctccaactc ggggacaccg tacttcaacc gacagagtcg acgcgattcc 4080 taggggtctg gctggaccgt aagctcaatt atagggctca cgcggaagcg gtgaaacaga 4140 agatgacgac ccagacgaat gcacttacaa gactagcagc gaagacatgg ggttgctcct 4200 ttgctcgggc aagggaaata tacactaaat gtattcgcag cgctattgca tatggagcct 4260 cagcattcca ccagccgacc gaagtctacg ggaaaccacg agggatcgta attgggctgg 4320 cgaaatacca gaccaagtgc ttgagagttg tcgccggggc gtacaaggct acgccggtac 4380 gaaacctaga gacggaaacc ttctgtccgc cgctcgacct ctacctcaac aaaagggtcc 4440 gggccttcga ggaaaggctc gcccggacgg atcaagccag gttcgttcga gggaccgcag 4500 cctgggttgc cagctgcgtc caagcccgaa gggctagagg tcggccaccc gccgacaaac 4560 cagaaagcgg tgcggccaag gctcggtggg ccgaggcatg ggcaccgacg gagaagttgc 4620 tggcgatggc agccggtccc cctccggagt cagggacccc gccaccagca tcgcagatac 4680 cggaacctac ggccgccgac cgagaggagt ggctggagga tgcaacgaat cgagaatggg 4740 tgtcgcgcta caaccaccac ctagcggcgg cggtagccaa atcaggccag cggtacgaat 4800 tcaccatcgc tgacctcaac cccaagttta gcggagaggc tatccaacga catatggtcg 4860 gtacgaagaa gaagaaagga agacgtcgaa aaccggagga agaagaagat gagccgtacc 4920 cactaaccaa agcgaagagc tccctcctag tccaggccag aacggaagtc atcggcctcc 4980 gggcattcct ccaccggagg aaagtaccgg acgtcttgac gcccatctgc gcctgcgggc 5040 tcgagaaaga gaccttcgcc catattgtcc taaattgcct cgatgcggag actcttttct 5100 ggtcggaccc cgacagctgg ccactatcaa aggaggaact ccaggaatac ctggacgacg 5160 gtgtcaaagc ggaaaggatt ctgtcttggg cgttgaggtt agggaggctg aaggaatatc 5220 gtttagcagt cgaactagag gaggatcagc gagaggaact cgctaggcac tcctaggagg 5280 gaacggaaca tcaacgacac tagaacaacc ggaaagggaa cgaggaggag atgattgtgc 5340 taactacgtg tgctacctaa accgcaagcc acagcaggga ccgaagcacc atctaacgaa 5400 gcgcctagtt gccgctaaga agctgctcga gcagcgccaa catgcgagcc tggttcgcgg 5460 tgacctcacg gcgccactga atccagtcct cgttcatgac ggggcggccg ggagtatcga 5520 cggccatagg gctgccagga gcggaaccac caccaccacc accagggaca gcacgggcag 5580 gagtagcata accgctaccg ccggacacga cacgcgcagg agtgatagcg ccaccgctgg 5640 aaccagaagc aatccggacc ggattcatgg caccgccgcc ggagggggcg cggccgtgac 5700 ggacatccct cacgatcaac tcgcggtagc ccttctgctg gaggtcggcc acgttgagat 5760 gcgggggacg gggaagagtg ccaaggttgg tgggcaggcc atcccgctcc agacgcgcgt 5820 gcatacggtc gatgaaagag ttggcagcac gaacgaagga atccagtctc cgattgaacc 5880 caggctggcc acccgaaagc accgcagcag ctgccgccta gacagcgttg cgggcgtcgc 5940 aaaggccagc ggggacgtcc agcatgtact tgccggcggt gtgggagcag gtcgcgttct 6000 gggtgccagc agctagatgc agacgttaga tggcgacgaa ggaaggagaa agaaagggag 6060 gaacctaccg gacgccgaga tgtggcagac ccgcttgacg acagcctcgc ggaatgcgac 6120 gacagaggca ggcttatcgc cggtcctcgg gaaagcagcg aggtccaagg tgtggcagag 6180 gcaagggatc tcacggacga cactagccgg gttaacgcac aactccgaaa agcgagggga 6240 ggaaacatac cggttctggg gaaccacgag gtcaggaagg gcgcgaagga ggcgagcgac 6300 ggccgcaggc tccaagggag ccggggacgc agggcccggg gacggatcgg gggaggagac 6360 ctccacgtcg gaggtcagat cgatcgggct gggacgatgg cggccggagg aagccatctt 6420 gaaggaggtt gcgatgggga gaaggaagga ggaagccagg aacgggcgcc gcagggagga 6480 ttatatagac gaggaaggag gcccttgcgc cgttgcaaat ggaggtggag ccgcggggta 6540 gagagtgaag ggacggggcg gaggtaaaag caaacgggga gtgaaggcgg ggaagtgggg 6600 agacagacgg gaggctggga taagggggcg gagagaaggg aaaggaagga gcggtggaag 6660 acgggacgga gaagcggcga gggcggcgct cggggctcga atcctgttcc tattatccgt 6720 attgttctag tgttgggatg acgcagcggt ggaagagacg agatcggttc gctgtggcgg 6780 tgggtgagaa ccgtcatagc tgtaagcaaa gtaggagcgg cgtggagaac tacgaccgag 6840 atggagctgg gggttttcat ccgacggtgc ttaacaccgt acggttttcc cctccagaaa 6900 actggagaag gaccccatgc caataaggcg ctatagttat agcataggga ataagactat 6960 taattaatta attattaatt ataatatcgc cgacttttaa taaaagggtt actataaatt 7020 aattataag 7029 // ID Gypsy-105_MLP-LTR repbase; DNA; FNG; 184 BP. XX AC AECX01000506; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-105_MLP_; KW Gypsy-105_MLP-I; Gypsy-105_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-184 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000506; Positions 209609 209792. XX SQ Sequence 184 BP; 43 A; 51 C; 40 G; 50 T; 0 other; tgttatgatc ctacagttgt aacttacgtg acacttgtac ttacatgtca cagaggggga 60 tgatgatctg gaggacacgg cctgtatggc ttgagtcctc atccgacaat ctaactacat 120 taccggaatc cccttggacc ctgttcctct cccttgccga caagcccctg tgacggtcat 180 aaca 184 // ID Copia-29_MLP-I repbase; DNA; FNG; 5228 BP. XX AC AECX01003109; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-29_MLP_; KW Copia-29_MLP-LTR; Copia-29_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5228 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01003109; Positions 301 5528. XX CC Positions [1910-2434] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 59..4546 FT /product="Copia-29_MLP-I_1p" FT /translation="MSAEDADPFTDDDSQDNSTVTEGSSTSSSADSSGTAR FT QDLSNPLSSDVLSDPLNLKDSVSNKMSDHAESRVFGDQIQKYSQIMTNALS FT KYKVSDDLTDENYVEWSQSLMEVFRSLEFHHYFKKENFRKATLSDEEHEKT FT LFNLTMYILQRLDTVNKVRTRNHLTDPKDASEIIYDPFKCWNFLKSYHNRI FT SEDKLEAVTRALYACQITKADTLTSFIDKFENLVREFYRLKGELSDIQSAR FT MLLGAIPSLSAETKDYIHNTVVPLTREGVGTYLRKYEERHGWTCAAIREVN FT AVSVRPSKKTSGECTPDECVGPHLAKNCWSKPENADKRLAFLAKIRAKSGG FT GNSSSNSTSLTSQTTSVKGVKKVFDASVNNASASMAFLSLNAEFDDVFSND FT SKETIYSASPSEFEEDPVISASVTAALSSTSRDWALHDTGATHHMFKDKKF FT FVQSSLINIDDANKRLKLAGGDVSLAVHSRGVARLLAGDETPFELKNCLYV FT PELSRNLISGATLKKQGVRELFHDSTNFSLVLNGLALFNGFISDNNLMFIL FT LEPVSGPHSASVSASSTEITASLQHRRLGHVSSKYLKLMAKLESVEDFEYI FT DENLNCDICSLSKNTKIPHNKTRPRARTFLENVHVDLSGIIRTTGFNNENY FT FILFCDDYSAYRHIYPLSDKTKEEVYEAFMAYIAVAERQTGCSLKQFTLDR FT GSEFLNSLLGPKLKELGITLHLTSGHAPEENGVSERGMRIVNTRARSMMLQ FT SALPIRFWYFACSAAVFLTNRCVTAALEGGKTPFEMWFYRKPSIHHLRVFG FT CQAFGLIRKELRQSKFSPVSSEGVLVGFEQDNFNYQIYDLSSRKIYITHHS FT TFNEEVFPFKSIQSVSPSIPNEPDRTSVKIRFFDDEDEDFTEPVTPITSPG FT EIHEESTPSAPVTSNLPTSNPTQPRRSERSSKKVYDTLRGMCTSGAEFDDD FT WITESFFACVAQDCFSAPDPKSYKRAMDSPEAKLWKAACEKEFASLRNKEV FT WELVDHPKDRNVIRGLWVFRKKSLVDGSIKYKARFVAMGNTQVPGQDYGET FT FAPTGKPSSLRLLIAIAAINGWEIHQMDAVTAFLNGILHEELYVEIPEGYR FT TASTVGKVWRMKKSLYGFKQSPKIWQDDVEAFLIQIGFYQCEIDHCIYIRS FT VGKLFTAVYVHVDDLAITGNDIARFKVEISAKWEMEDLGLANTVVGIQIQR FT IDDHSYSMSQHQYALTVLKRFEMTNSKPAITPLSPNVKILKSTEEEVQEFA FT KTKLPYRSVVGSLMYLAQCTRPDMAHAVGVLSQHLEQPNKQHWDSAMHVLR FT YLCHTINVGIVYSGNYPKIISGQRSFECPISHCDADWAGDVNTRRSTTGYV FT FVLAGGPISWRSRLQPTVALSSTEAEYRAITEAGQELLWLRNMMSRFGFID FT PNPTVLQSDNLGAIHLTSKSIFHARTKHIEIHYHWIREVVKKGDLSIKHCP FT THLMVADLLTKQLPKEQFSNLRKQLGLRFLG" XX SQ Sequence 5228 BP; 1458 A; 1124 C; 1084 G; 1562 T; 0 other; ttcgagaact ttatatggta gcgagagtgc aaacgatcta ttcgaccaga ttgcttgtat 60 gagtgccgaa gacgccgatc ctttcaccga cgacgattct caggacaact caaccgtaac 120 cgaaggctca agcacttcaa gttccgccga ttcatcagga acggctcgcc aggatttatc 180 aaatcctctg tcctctgacg ttctttcaga tcccttaaat ctcaaggatt ctgtatcaaa 240 caaaatgtcc gatcatgccg aatctcgagt gttcggtgat caaattcaaa aatactccca 300 aatcatgacg aatgcattaa gcaagtacaa agtgtctgat gatttgactg acgaaaacta 360 cgttgagtgg agtcaatctt tgatggaagt gtttcgatca cttgaatttc atcattattt 420 caaaaaagaa aattttcgaa aagcaactct gtccgacgag gaacacgaga aaacactttt 480 taacttgacc atgtatatct tacaaagatt agacactgtc aacaaagtca ggacccgtaa 540 tcaccttacc gaccccaagg atgcttctga aatcatttat gatcctttca aatgttggaa 600 ttttctcaag agctatcaca accgaatctc tgaagataaa ttagaggctg taactagagc 660 tctttatgct tgccagatta ctaaggcaga cactctcacc tcgttcatcg acaagtttga 720 aaatctcgtt cgcgaatttt atcgcctgaa gggcgaactc tctgatattc aatcagcgag 780 gatgcttctt ggagcgattc cctctttgtc cgctgaaact aaggattaca tacacaacac 840 ggtagtccct ttaactcgtg aaggcgttgg cacctatcta cgtaaatatg aagaacgcca 900 cggttggact tgtgcggcaa ttagagaagt gaatgctgtt tctgttcgtc cttcgaagaa 960 aacctccggt gaatgtactc cggatgagtg cgttggacct catctcgcca agaattgttg 1020 gtccaaaccc gaaaacgccg ataaacgact tgcttttctg gccaagattc gtgcgaaatc 1080 agggggtgga aattctagtt caaactcaac gagtcttact tcccagacaa cgtcagtgaa 1140 aggtgtgaag aaagttttcg acgctagtgt aaataatgca tccgccagca tggcttttct 1200 ttcactcaat gccgaattcg atgatgtgtt ttcaaatgat tcaaaagaaa cgatctactc 1260 tgcttctcca tcagaattcg aggaagatcc agttatcagc gcttcggtca ctgctgctct 1320 ttcctccacc tctcgagatt gggcccttca tgatactggt gctacgcacc acatgttcaa 1380 agacaagaaa ttctttgttc agtcctctct catcaacatc gacgatgcca acaaacgtct 1440 taaactagct ggtggagatg tctccttggc agtccatagc cgaggtgtag ctaggttatt 1500 agctggagat gaaactcctt ttgaactcaa aaactgtctc tacgttcctg agctttctcg 1560 aaacctgatc tctggagcga cgttgaaaaa acaaggagtg agggaattat ttcacgatag 1620 caccaacttc tcactagttc tgaatggtct cgctttattc aacggattta tttcggataa 1680 caacctcatg ttcattctac tcgaacctgt gagtggtcca cactcagcct cagtctccgc 1740 atcaagtact gaaataaccg cttcacttca acatcgtcgt ttaggacatg tcagcagcaa 1800 gtacctcaaa ctgatggcga aacttgagag tgtggaggat tttgaataca tagatgaaaa 1860 tttaaattgt gatatttgtt ctctgtctaa aaatacaaaa attcctcaca acaaaactag 1920 acctcgtgca cgtaccttcc tagaaaatgt ccatgtcgat ttgagtggta tcatcagaac 1980 taccggtttc aacaatgaaa attactttat tttgttttgc gacgattatt ctgcataccg 2040 acatatctac cctctaagtg acaaaacaaa agaagaagtg tatgaggctt ttatggcgta 2100 tatcgctgtt gccgaaaggc agactggttg ctctctgaag caattcacac tcgatagggg 2160 tagtgaattt ctcaacagtc tacttggtcc caaactcaaa gagttgggta tcacccttca 2220 cctgacttct ggacatgcgc cagaagaaaa tggtgtttcc gaacgtggta tgcgcattgt 2280 caataccagg gcccgttcga tgatgcttca atcagcatta ccaattcgtt tttggtattt 2340 tgcctgcagc gcggctgttt ttctgacaaa tcgctgtgtc actgctgctc ttgaaggagg 2400 caaaacaccc ttcgagatgt ggttttatcg aaaaccttca atccatcatc ttcgagtttt 2460 tggctgccaa gcttttggtc tcattagaaa agagcttcgt caatctaaat tttctccggt 2520 cagctctgaa ggtgtacttg tgggatttga acaggataac tttaactatc agatttatga 2580 tttatcctct agaaaaatct acattactca tcactcaaca ttcaatgagg aagtgtttcc 2640 gtttaaatca attcagtcag tctcccctag tattccaaat gaacccgaca gaacttcggt 2700 caaaattcgt ttcttcgacg acgaagatga agatttcaca gagcctgtaa ctccaatcac 2760 ttcgccgggt gagattcatg aagaatcgac tccttctgct cccgttacta gtaatctacc 2820 cacctcaaat cctactcaac ctcgtcgttc tgagcgttct tcaaagaagg tttatgacac 2880 tctacgagga atgtgtactt ctggtgctga gttcgacgat gactggatca ctgagtcgtt 2940 cttcgcttgc gtggcacaag attgtttttc ggctcctgat cccaaatctt acaagcgggc 3000 tatggattcg cctgaagcaa aactttggaa ggccgcctgc gagaaggaat ttgcttccct 3060 taggaataaa gaagtgtggg agttggttga tcatccaaaa gatagaaatg tcatacgagg 3120 cttgtgggtt ttcagaaaga aatcattagt tgatggaagc ataaaataca aagcaaggtt 3180 cgttgcgatg ggaaacactc aagtaccagg tcaagactat ggagaaactt ttgctcccac 3240 tgggaaacct agctctttgc gtttacttat cgcaattgca gcaatcaatg ggtgggagat 3300 tcatcaaatg gacgccgtca ctgcttttct aaacggtatt ctgcatgaag agctttacgt 3360 cgaaattcca gaaggctacc gcactgcttc taccgtcggc aaggtctgga ggatgaagaa 3420 atctttgtac ggattcaagc aatctccgaa aatttggcag gatgatgttg aagccttcct 3480 cattcaaatt ggtttctatc aatgtgaaat cgaccattgc atctatatcc gttcagtagg 3540 caagttgttc accgctgtgt atgtgcatgt agatgatctt gctatcacgg gtaatgatat 3600 tgcacgtttt aaggtagaga tctcagcgaa atgggagatg gaggatttgg gtctggctaa 3660 tactgtagtt ggaattcaga ttcaacggat cgatgatcac tcttattcaa tgtctcaaca 3720 tcagtatgct cttacggtcc tcaaacggtt tgaaatgaca aattctaagc ctgctataac 3780 tccactgtct ccaaacgtaa aaattttgaa atcaactgaa gaagaagttc aagaatttgc 3840 caagacaaaa cttccgtata gaagcgtagt aggttcattg atgtatttgg ctcagtgcac 3900 tcgccctgac atggctcatg cggtgggcgt gttgtctcaa catctcgaac aacccaacaa 3960 acagcactgg gattcagcaa tgcatgttct cagatatcta tgtcatacta taaatgttgg 4020 aattgtgtat tcaggtaatt atcccaagat catctctgga caacgtagct ttgaatgtcc 4080 tatatctcac tgcgatgcag actgggcagg agatgtcaat actcgtagat caacgaccgg 4140 ctatgtgttt gtgcttgctg gtggtcctat ttcttggcga agccgtctcc aaccgactgt 4200 ggcactatca tcaacagaag ccgagtatag ggctattacc gaagccggtc aagaactcct 4260 gtggttaagg aacatgatgt ctcgttttgg tttcattgat cctaatccca ccgttcttca 4320 aagtgataac ttaggcgcca ttcatctgac ctctaaatcc atctttcacg caagaacaaa 4380 acacattgaa attcattatc attggatacg agaagtggtt aaaaagggag atctttcaat 4440 caagcattgc ccgacacatt tgatggttgc agacctgttg acaaaacaac taccaaagga 4500 acaattctca aatctcagga agcaattggg tttacgattt ctaggataat actctttgag 4560 ggggtgtgtt aagatactat cctcttacga gtctaccatc ttcattgtta gtcaaagatc 4620 attatccatt tcaggtaatc agaaggtgca gatgcgaggt gtatgagttg gttaagaaga 4680 agtatgagtc ggttaagaag aaatgttgtg ttgtatggac ttggtgtagg aagttgacca 4740 aaggggttaa attgtggatg tatttcagtt gagctagtgg acaaatcggg aaatggaatc 4800 aaggggtagt ggaagaatct tttgggttat ctcttctcat tttgattttt gtttttccct 4860 cttttctgat tcttttcttg tcgaaaagaa tcagtgagtc tctctctatt agttctctat 4920 tattgtcttt ttataagcct ttttctacct ttactaacat ctcgactctc aactcagaat 4980 caaactgttc atttaattta tcctaatacc ttcttacgcg tgctcttcac gattcccttt 5040 gagctcgcag gtagataaat ttctcttctt aatttttctc tgatctctgt taatctttct 5100 ctatgattgt ttgtcttgtg atggttgtta ggttagtgtc ttgttatcgt cgtagacctg 5160 ctgttcgctg tagtcttacg aagatacctg tgccctcagg taagttgtcg aaaagaatca 5220 aatcaaac 5228 // ID Gypsy-103_MLP-LTR repbase; DNA; FNG; 165 BP. XX AC AECX01000547; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-103_MLP_; KW Gypsy-103_MLP-I; Gypsy-103_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-165 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000547; Positions 12496 12660. XX SQ Sequence 165 BP; 37 A; 50 C; 25 G; 53 T; 0 other; tgttatgagc atattaccac ttatacacat atgcttgtac tgtgcatgtc cgcgctccca 60 tgtctaccct tgactgagcg ctgtgacttt gcttcttctc tatgttgcaa tctcattatc 120 aaagcacaac atcctagctc ctgtccctca cgtcagaccc taaca 165 // ID Gypsy-17_RO-LTR repbase; DNA; FNG; 398 BP. XX AC AACW02000311; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-17_RO_; KW Gypsy-17_RO-I; Gypsy-17_RO-LTR. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-398 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000311; Positions 330148 330545. XX SQ Sequence 398 BP; 145 A; 71 C; 50 G; 132 T; 0 other; tgatgtatcc atcattcata actcgatatg aaaacagtat taataagcta catgtaaatg 60 tatccaatag acaagctaca ggttaattcg gataatcttc attaaggata agatgctgcg 120 taattgcacg gttaaactat attaaatcca ggaactaaat cacagtcagc cgcagtcaat 180 catatacgag tattcaatga caaatataac aatagaagaa cacgtaaccc ttatctattt 240 gtcatcagct atatcatact cattaattat taatccatgt catgaaatat tatccgtcta 300 tgaaaagtat aaatagagta caattttctt attaaataaa acaatctctt ttatcgagtc 360 tcgtcctctc ttgttccttt ttattcttgg ataaatca 398 // ID Gypsy-9_MLP-LTR repbase; DNA; FNG; 155 BP. XX AC AECX01002130; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-9_MLP_; KW Gypsy-9_MLP-I; Gypsy-9_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-155 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002130; Positions 20390 20544. XX SQ Sequence 155 BP; 39 A; 38 C; 28 G; 50 T; 0 other; tgttgtgatc cctataaggt gattatgctt gtactgagta gagggccgct accattaatc 60 tctgcgcgcc agttagttgt acactcatgc ttcaatctta gttactatat cagatcacct 120 atctctgatc tctgatacta ccgaagccca taaca 155 // ID Copia-38_MLP-I repbase; DNA; FNG; 5365 BP. XX AC AECX01000934; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-38_MLP_; KW Copia-38_MLP-LTR; Copia-38_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5365 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000934; Positions 40581 45945. XX CC Positions [2174-2404] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 3806..5347 FT /product="Copia-38_MLP-I_3p" FT /translation="MSGPDKQAWVHAMQQEFDSLTEHGVGKLVEPPPGANV FT LGGMWIFNKKRDEHNRVIQFKARWVVLGNHQIKGVDYNNTYASVGKLDSLR FT ILLALATVKTNRRTRGRMKVRQFDVVTAFLNGNMKDLVYAIQVKGFENPTL FT RHRVWQLIKSLYGTKQAARRWQQHFGATAAGFELHATTSDTAVYVLKSTLG FT LLILHLHVDDLLIFCDNDDLFLKFQTFINSKYQLKWTDKPTLYLGIKLDIS FT QDGSVIKISQSHYIEAVLERFAMVNCKPSKSPLPQKLVLTPGTIDEIEEAK FT NIPYQELVGCLQWIATCTRPDIAYAVSQLSKYNSAWTITHWTAAKHLLRYL FT KGTQDLSITYSGRIEEPQAYSDSDFSQCPLTRKSVTGYVVTVANGAVSWKS FT QRQSVVALSTSEAEYLAATECAKHMSWVRSFYFDIMHQLEKPTPFYIDNTS FT AIFTATGDGIKSRSKHIDRRFHYIREIIESNNLIIHHIPTEEMLADHLTKP FT LGPIALKHALQLNHMIEM" FT CDS join(587..2404,2408..3727) FT /product="Copia-38_MLP-I_1p" FT /translation="MNPTDATSESKPSESTSHNPSSQKTQLSLPKNSIPFP FT SKEFKMTSNSTDQKFSLDSYSNVVKLSTGTFNDWKLRLSTALGAQRLSKYI FT LQDLKAPTDPDELEEYETYSLRALAAIHSTIDAENFQVIQSCTSPREAFKA FT LCQHHDDAGGLSTAHLFSDLMTLRMNPDDNLSDHVSKFRKIHNDLLSNLSS FT TPDFKISEPFIAIILIKSLPSEYTPLVQSLLANFETLTLPRLYSLLKIEAT FT RASSTNPSDTALAASRPNFKRPGKKSDRPTNHNSSLKCSLGHAGHNDENCR FT TRKYRAFLEYEKNQQSSSSNHVNVSAQLSQSIPEADEDVSYWELAFSASTS FT SSAPIICNTGATSHMFSDKTLFSDLQHTRPTRIGVASQDGAIWAKHKGTVR FT FESIILRDVLYSPQLTGNLISVGRLCDDGFNASFTKTLGVITDSSGKEVLR FT MSRNKQTNRLWTPIVKSTHSTAMFTYTDPAELASTWHRRLGHLHPDAVIIF FT LRRHKLISLSRKDFLPCDSCAMGKLKQSPSTHSFHRSPGVLNLVHSDLIGP FT ISPATKTGLKYIVTFIDDHTRYSMVYLLKSKDQTFEAFKQYKALMENKCGS FT KIKKLKSDGGEYSSNKFLNFLQQEGIEIERGPANRPTADSVLERYNLTILS FT KTRSQLIQSGLPLYLWGELVKYCCIQINCSPTAALCHKLPIEVFESLLPGH FT VHPFDVDRLKPFGSLCFAVDRNRKSKMAPIARRYIFLGLEDGARAARLWDK FT ASGRVFVTGDVVYREGVFPAHHPTLSPNVKDEFILPEYPDEAVTSPGPSNI FT PEDSSADKNNPEEMSPIQDESKSLHNTPSIHSLESDGSSSAPSFRPSKIIP FT LSQSIHAPRPGTSSQPIPAEEPNQNQKITPIITDLRLRRQESLSPPSSHSL FT ASSVNQSSGVSSNDQPDVTFPASNRSPSIQSPVSVHETIRSPSLSPVPSAP FT TSDRQLSKSPSPEVQIQVTSPAKSPSPPARLPTPPPCIPTPPPVAPTKLAV FT KTDPPPMRRSSRTRQAPDRYGFLSQSNVAVSTSPMLITPH" XX SQ Sequence 5365 BP; 1495 A; 1337 C; 1030 G; 1503 T; 0 other; attcaaacct aatagctcaa attgtatcgc atatttatac ctggaagagc accgagtatg 60 ggattcttct ctcttcatcc aagttcttct atttaattct tggatgaagg tgagtctcaa 120 ctcggggatt tgtgataaga ttcaaggttc tcattctgaa tcatatctaa tcctgctctg 180 acaggtaagt tattctaatg ttgtcatttc atatttcata actttgtttt gtgttttcat 240 attgttttat gatttgaaat atcaaaactg atccaagttc ttctatttaa ttcttggatg 300 aaggtgagtc tcaactcggg gatttgtgat aagattcaag gttctcattc tgaatcatat 360 ctaatcctgc tctgacaggt gagtctcaac ttggggattt gtgataagat tcaaggttct 420 cattctgaat catatctaat cctgctctga cagatgaagt ttgacaggtt atgagcccag 480 cgtgatccac gtctgcggta aacattcctt tacattctat cagtttcagt ttagaacctt 540 atcaatctat tcactacccc gacttcgatg atctcaactc gtcttaatga accccacaga 600 cgccacttca gaatctaaac catcagaatc tacttcacac aatccttcga gccaaaagac 660 tcaactttcg ttacctaaaa actctatacc atttccttca aaagaattta aaatgacatc 720 taactctacc gatcaaaaat tttcgttaga ttcttattcc aatgtagtca agttatccac 780 cggtaccttc aacgattgga aacttcgtct atctactgct ttgggggctc aaaggctctc 840 taagtatatt ctccaggatc tgaaagcacc cactgatcct gatgagcttg aagaatatga 900 aacctattct ctccgagctt tagctgcgat acattcaacc atcgatgctg aaaattttca 960 ggtgatacaa tcttgtactt cgcctcgtga ggcattcaag gctctttgtc aacatcatga 1020 tgatgcgggt ggtttatcta ccgctcattt attctctgat ctaatgacgc tccgaatgaa 1080 tcccgacgac aacctctctg atcacgtatc taagttccga aagatacata acgatcttct 1140 aagcaacttg tcctcgaccc ctgatttcaa gatatctgaa ccgttcatcg cgataattct 1200 aatcaagtca ttaccatctg aatacactcc ccttgttcaa agcttacttg ccaactttga 1260 gaccctcact cttccgcgtc tctactcatt actcaagata gaagctactc gagcctcttc 1320 gacaaatcca tcggatactg cacttgccgc cagccgtcca aatttcaaaa ggcccggtaa 1380 gaaatctgat cgaccgacca atcacaactc atctctgaag tgttctctgg gtcatgcggg 1440 tcataatgat gaaaactgtc gaacgaggaa gtaccgagca tttcttgagt atgaaaagaa 1500 tcaacaatca tcctcttcca atcacgtcaa cgtatctgct caactatctc aatccatacc 1560 tgaagccgat gaagacgttt catactggga gttggctttt tcagcctcaa catcatctag 1620 tgcgcccatc atatgcaata ctggggctac cagtcatatg ttctctgaca aaaccctttt 1680 ctcagatcta caacacactc gtcctactcg aatcggtgtt gcgtctcagg atggggcgat 1740 ttgggcgaag cacaaaggaa ctgtgaggtt cgagtccatt atacttagag acgtcttgta 1800 ttcacctcaa ttgaccggaa atcttatctc tgttggtcga ctatgtgatg atgggttcaa 1860 tgcttcattc accaaaacct taggagtgat aacagattca tcaggaaagg aagtccttcg 1920 aatgtctcgt aacaagcaga caaatcgact ctggacgcca attgtcaagt caactcattc 1980 aacggctatg ttcacctaca cagatccggc cgagcttgct tctacctggc atcgacgact 2040 tggtcacctt cacccggatg cggttatcat ttttctacgc cgacataaac tgatatcttt 2100 gtcaagaaag gatttccttc cctgtgattc ctgtgcgatg ggtaaactca aacaatcccc 2160 atccactcat tcatttcatc gttctcctgg tgttctcaat cttgtacata gcgacttaat 2220 tggtcctata tctcctgcca ctaaaaccgg tctgaaatac atagtaactt ttattgatga 2280 tcacacacga tatagtatgg tctacttgtt gaaatccaaa gatcagactt ttgaggcttt 2340 caaacaatac aaagctctca tggaaaacaa atgtggcagc aagattaaga aactcaagtc 2400 agattgagga ggagagtact cctccaacaa gttcctcaat ttccttcagc aggagggtat 2460 tgagattgaa cgaggtcctg caaatcgtcc aactgcagat tcagtattgg aacgatacaa 2520 cctcacgata cttagcaaga cgcgatcgca actcattcag tcaggtcttc ccttatactt 2580 atggggggag ctagtgaagt attgttgcat tcaaatcaat tgttcaccga ctgccgctct 2640 ttgtcataaa ttgcccatag aagtttttga atctcttctt cctgggcacg tacatccttt 2700 tgacgtagat cgcctgaaac cttttggttc attgtgtttt gctgtagatc gtaatcgtaa 2760 gtccaagatg gctcccatag ctcgtcgtta catttttctt ggtctggaag atggtgctcg 2820 agcggcacgt ttgtgggaca aggcatccgg acgggttttt gtgactggag atgtggttta 2880 ccgagagggg gtttttccag ctcatcatcc cactctatct cctaatgtca aggacgaatt 2940 cattctacct gagtatccgg atgaagccgt cacgtctcca ggaccatcaa acatccctga 3000 agactcttca gcagacaaaa acaatcccga agaaatgtca ccgatccagg atgagtcgaa 3060 aagccttcat aacacacctt cgattcacag cctcgagagc gacggatcct catcagcgcc 3120 atcattccgg ccatccaaaa tcattccgtt atctcaatca atccatgctc ctaggcctgg 3180 cacctcatct caaccgattc cggcagagga acccaaccaa aaccagaaga tcaccccaat 3240 catcacagat ctcagactaa ggcgtcagga atccttgtct cccccatcgt ctcactcttt 3300 ggcatcgtct gtgaatcagt catcaggtgt atcttcgaac gaccaaccgg atgtcacatt 3360 cccggcttca aacagatcac catctatcca atcacctgta tctgtacatg aaaccattcg 3420 ctcgccttcg ttatcgcctg ttccatctgc accaacatct gacagacagc tatcaaaatc 3480 accaagccca gaagtccaaa ttcaggttac atcaccggcc aaatcacctt ctccgccagc 3540 tcgattgccc acaccgccgc cttgcatacc tacacctcca ccggtagctc ctaccaagct 3600 tgccgtgaaa acggaccctc ctccaatgcg cagatcgtcg agaacgcgtc aggcaccgga 3660 ccggtacggt tttctgtcgc aatcgaatgt tgcagtgtcc acgtcaccca tgttgatcac 3720 tccacattga acgttcatcc caaatcatta tgggttctct gctacaaacg gaacggatcc 3780 tgatagtcca acatattcgc aagccatgtc agggccagac aaacaggcat gggttcatgc 3840 gatgcaacag gagtttgatt ccctaactga gcatggtgtc ggaaaacttg tcgaaccccc 3900 tccaggagca aatgttcttg gggggatgtg gatcttcaac aagaagagag acgagcacaa 3960 ccgtgtaata caattcaagg ctcgctgggt ggttttggga aatcatcaaa tcaaaggtgt 4020 ggactataac aacacttatg catctgtagg caaactcgac tctctacgca tcctgttggc 4080 attggctact gttaagacca atagacgcac acgcggacgt atgaaggtga ggcagtttga 4140 tgtggtcacg gcctttctga acggaaacat gaaggatcta gtttatgcca ttcaggttaa 4200 aggttttgag aaccctactt tacgtcaccg ggtctggcaa ctaatcaaat cactctacgg 4260 aactaagcaa gcagccaggc gatggcaaca acacttcggg gcaactgctg ctggatttga 4320 acttcatgct accacttccg atacagctgt atacgttcta aaatcaactc tgggtcttct 4380 tatcttacat cttcacgtcg atgatttgct tattttctgc gacaatgatg atctctttct 4440 caaatttcag accttcataa attcaaagta ccagcttaag tggaccgaca aaccaacact 4500 ttacttaggc atcaagctgg atatatctca agacgggtct gttattaaaa tttcacaatc 4560 tcattacatc gaggctgttc tagagagatt cgcgatggtc aactgcaagc cttctaaatc 4620 gcctcttcct caaaagcttg tgctcactcc tggtactatt gacgaaatcg aggaggcaaa 4680 gaacattcca tatcaagaac tggttggatg ccttcaatgg atcgcgacat gcacacgacc 4740 agacatcgca tacgcagtat ctcaattgtc gaaatacaac tcagcctgga cgatcactca 4800 ctggacagcg gcaaagcact tacttcgtta tctcaagggc actcaagatc tttcaatcac 4860 ttactctggg cggattgagg agcctcaagc gtattccgac tccgatttct cacaatgtcc 4920 gttgactaga aagtctgtta ctgggtatgt tgtcacggtg gcaaatggag ccgtaagttg 4980 gaagtctcaa cgccaatcgg tggtggcttt atcaacgtcg gaggctgaat accttgctgc 5040 aacagagtgt gccaaacata tgtcgtgggt gagatcattt tattttgaca ttatgcatca 5100 actagagaag cctaccccgt tttacataga taacacttcg gctatattca ctgcaactgg 5160 cgatggaatc aaatcaagat ccaaacacat cgacagaaga tttcattata ttcgagagat 5220 tattgaatca aacaacttaa tcattcatca cataccaact gaagagatgt tagcggatca 5280 tttaactaaa ccattaggtc ctatagcact caaacatgca ttacagctta atcatatgat 5340 agaaatgtag ctgaaatagg gggga 5365 // ID Gypsy-1-LTR_AN repbase; DNA; FNG; 198 BP. XX AC . XX DT 09-DEC-2003 (Rel. 8.11, Created) DT 09-DEC-2003 (Rel. 8.11, Last updated, Version 1) XX DE Long terminal repeat of a LTR retrotransposon from the gypsy DE superfamily - a consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy-1-I_AN; Gypsy-1-LTR_AN. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-198 RA Kapitonov V.V. and Jurka J.; RT "Gypsy1_AN, a family of gypsy LTR retrotransposons in the RT Aspergillus nidulans genome."; RL Repbase Reports 3(11), 189-189 (2003). XX DR [1] (Consensus) XX CC LTR retrotransposon: Gypsy superfamily. Solo LTR. XX SQ Sequence 198 BP; 45 A; 57 C; 38 G; 58 T; 0 other; tgttatgggt cctttgccta tacaaggacc ttagacctta gtgactcggc caaggcctgc 60 gctgtcctga aggcggtgag ccacctacaa gacttcctcg caacaacaat ccttctttct 120 cctttcttct ttagcgattc cttcttgtac gtacggcacg tctagatagg aagatccatc 180 taaatacgtc ccttaaca 198 // ID Copia-19_MLP-LTR repbase; DNA; FNG; 671 BP. XX AC AECX01002810; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-19_MLP_; KW Copia-19_MLP-I; Copia-19_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-671 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002810; Positions 3122 3792. XX SQ Sequence 671 BP; 191 A; 127 C; 99 G; 254 T; 0 other; tgttatgact aatcggattg ttgtgttatg ctgttcgatg ctgtgtgatg tttatgtaat 60 gctgtgtgcc ctgatgacgt gtgtgctgac caatgatatg tttctctatg acatcattga 120 tatgtttcat ctcagaattg atttaagtac cgctctactc cgaagaactt tgtcccactc 180 ttcgcaaaac ccatttcaac ttaaatcatc ttctctattc agcgtttctc ttttcgctaa 240 aagttccttc gtacaaaact ctttcgactc gttctaaaaa gtgactcaat actacacgac 300 aatcttcttc aactataagg ctcaatctaa atcgaacttc agtctctatc aaaaaaggaa 360 gaaggtgaca ttactttcta atcactatgt tctatgattc aaagtcgtgt taagattttc 420 tcctctgact cctttttcaa atcgattcat ttaaaatgta gttcctcttt gattacattg 480 ttcgatttaa aattgctgtt ataaagttga atcttactag tgtagttatt gataaaggta 540 caactttctc ttttcatttc actctatcta aatgtatttg ttgcgggtta cttacaatag 600 tgttttattg ataaagaaat taaaatataa gttttcgtct taaagaaaga gagatcctac 660 cattcaccac a 671 // ID Copia-58_MLP-LTR repbase; DNA; FNG; 275 BP. XX AC AECX01000356; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-58_MLP_; KW Copia-58_MLP-I; Copia-58_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-275 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000356; Positions 13 287. XX SQ Sequence 275 BP; 61 A; 64 C; 56 G; 94 T; 0 other; tgtctttttt tttcataacc tcttacgcgt gctcctcacg ttttgcttcg agctcgcagg 60 ttagtgatca gttctcgttg ttgactagca tagaagctat agtctaacga ggtacctgtg 120 ccttcaggtt agtgatcagt tctcgtcgtt gactagcata gaagctatag tctaacgagg 180 tacctgtgcc ttcagtgtat ccatttcgta ctatacgctt ctttttgcag aacagtcaac 240 tcaagattct catcagagtc tcgagaacag cttca 275 // ID Gypsy-81_MLP-LTR repbase; DNA; FNG; 390 BP. XX AC AECX01001084; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-81_MLP_; KW Gypsy-81_MLP-I; Gypsy-81_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-390 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001084; Positions 47795 48184. XX SQ Sequence 390 BP; 77 A; 106 C; 57 G; 150 T; 0 other; tgtgattccc aatttctctc tttgtccgcc taattccttt gtttctctct agacgcgctt 60 ttcagtgctt ttcacttttc tatttctttt cttcttttgt ttcttttcta tgttgctttt 120 ctaaaaagac ggatcttgca tagatccgtc ccgatcatca ttgttaccaa ttgtagcccg 180 gatccagttg taccggatcc ggccattcct actgatcgta gtcatccatt ccaaggctat 240 cccttgcctc atattgtact acatctattt ccttattcgt atcttacttg tagacactcg 300 cgtatataaa ctgctcgcat ctctcaaggc aatggattcc tcaagtcact tcctgctcat 360 tctcgcccca aagattgtga gtagatcaca 390 // ID Gypsy-1_PCR-I repbase; DNA; FNG; 14334 BP. XX AC AADS01000368; XX DT 30-JAN-2011 (Rel. 16.02, Created) DT 30-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Phanerochaete chrysosporium genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_PCR_; KW Gypsy-1_PCR-LTR; Gypsy-1_PCR-I. XX OS Phanerochaete chrysosporium OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Corticiales; Corticiaceae; Phanerochaete. XX RN [1] RP 1-14334 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Phanerochaete chrysosporium RT genome."; RL Direct Submission to RU (30-JAN-2011). XX DR Genome; AADS01000368; Positions 15001 668. XX CC Positions [13119-13598] - Integrase core CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 2568..5402 FT /product="Gypsy-1_PCR-I_3p" FT /translation="MEDLTQPSLKQKGKQRAEPMREGTTARTGPSGSKTDA FT EGVQRRQPRRSEIQALVNPRTVLDKVLNAPLTLGVGEFLAVSQEMSKQVQE FT AIKPKNALGKSKALDKAMISQVKERDTFFDTATCATASAFPTRHRGQLIRF FT RIEIDGSPITAIVDTGSQLNIVNRKIWQNTVGRPIDVTRQIVMGDANGGEG FT LLQGFVPDVPLTCGAVVTRASLFIGTQPPFDLLLGRPWQRGNYVSIDERRD FT GTYLLFKDQSLVVRYEMLVTPEYDVDPALQEYLDRTHHLVNMVTGIDTSDI FT DIEFEYKQPRLERRQSGRTARMIQAARIQEVNTDVEDEELLTDYSEAESYA FT TSSYTTSPPLSPTASLESIDTIIDLESSDEDTDGGAESDSSYASASSDVIR FT WLDLQESEDNEDVSMEGEESEDVPELEPTTDAYDKREGSAEVYYASRPSQL FT HQMSEEEPAEDEYNEPGPILVNEDSHVPERGPPGRSLWELIIETVQDSVNK FT VLDEGSANSSDMQRRAHTLEVQHGIRAMDGQRCQEETCRAALLATDVPYGL FT IDLECIQSSQVSNGRYADEARADVPLEEADSLHDVVMNADTGSTENETPEP FT QENVFSVQVVGRKVSKRVKVKEGRRNAPDECIMPTVTDLSEQDTGDPGSMR FT VEEDPCVEGQGRDGEERIVDSVGETDQSDSEVSMGENDSPTDSQPESVTSD FT DESSDSGSEADPWQIIHDEYDDKHEEVRQSRRTIEKCLARRLRHELKEIDK FT AEEREGYSRDLLCWLADDIRVRCKDDHDQVWRELAQDFWEETRRLIEAEQY FT ELKALGRDCPQDVPAKCEVQSGGRPRCNLLKAGHVLSVQVGDSDTLAESVQ FT ALNLEEGSQPDVEMRDESLPRVLTEPGTPVLQPLDTFPVPGNAPLPLTRQR FT WRPELVSDPLTEEDVIPILADEDAWLELVGLRYVNGEEIAH" FT CDS 6659..9562 FT /product="Gypsy-1_PCR-I_2p" FT /translation="MQLVHHEPHGHLGIHVSGSSLFTHEVLGPGRDAISFR FT VPTQCFPFNQALWYAYNARLYLTVRPGHGLLGVFAMRPRLGRYFNADTPGD FT DHAILKFKTDDGAAFYIPLAEPGKGTVARVSITTQLLMGCLVPEMYQAGDH FT MFPMIVAPQHLVQLLEETDGLFREYEMHVRGLLVLTYDPPPPVPEFKAALG FT RVREIMERDHPDVEIFGRDRLRITWPWGEWVLDRNGNMQALRDEEETYQED FT EMEDVRKRRDDEKPRPELIPLPQAVFDFAGQRVQSPSSRREEQGLVENGPR FT RDEMRARETEAKRFCEQARLLIAGYAALKSGKPCDGHDRRYAREKPRERGT FT DAKPPRLLATDSSPHMVVPRPELERSDITDFFKSLTADLGAPVSEDPRRRW FT FPIRLAVGPGPSTAASAEGASSSGSVSPTAYSAGSPLSSTTSFHSPPIPFS FT PPRPATASPRIDVGRRTQDVHEALKAVKEVMREVGVGREHGELHGTPAGAG FT EGMLHAARTALTPYLCRDTDSAHRQALRPSQVLHINAAVEADVIERPPTPY FT LKPVDEHAVPRYGTANFSPSPLQLPSPLLNHVPNGLTLSPQKQQVMPHTVR FT TQPLRLPWHGADEAAQSSTSPASPSSSCPSLVLSSPHDDDALSTLYERLEE FT GQHNASNYNAWNAAFTVNTDSMDVEEPDTRAAPPQDDFEVLDEAVIEEARR FT EFVHRATRNDMDARTDEEERSFRRDTTGSSSSASTQESPESAIEEPYTRPL FT TVEEARAAGPPPSMVAPAPRIPLPARPRFAPTFTYQQQATHGQGEGMPRVG FT ESAGELHSLLRLPNPKQAFARYQRDGYLGTLALAGAAAVRNTVLPALHSEV FT RDAICAGLLQDMVDARSGKTELSIEMLEMFYPYHEDRDTPYLYPEEQMSLR FT RCIFFWTARNSLGQHDERIKWLQSLLHLQSGNRADNRNVRRMRQNGRLGPA FT GRGRDVVTYGGSL" FT CDS 8712..14198 FT /product="Gypsy-1_PCR-I_5p" FT /translation="MTSRCSTRRSSKRLGASSYTAPRATTWTHARMKKNGP FT FGGTLLAAQALPVRKSPPNPRSRSLTPGRSPSRKHAPPALRLPWSRPPRAS FT PYQRALDSHRRSRISSKRLTDKAKACPGSESPQASCTRYCASPTLSRPSLD FT TSATDTSGPSLWQEQQQYATQYSLLCTVRSATPSALDSCKTWSTHAVGRPS FT SPSRCSRCSTRTTRTATRPTCTPRSRCPCDAASSFGRRGTPSASTTNASSG FT FKASSTCSQATGQTIATSDVCARMGALALQDADAMWSPTGDRFEAVSASFD FT VATVPRQANGRFLAIVESLPRTVEDASGWQRRDSCHRGMFSWFPTFLCFLL FT TLFRFSFLLFVWPEPTDAYVADSHALYATGSRAVWLFAGENGLKRGLARIL FT GLGTRTLDFLCATDAFCCVSDSDRFSLDFSDYSFDFSYLFYGFGLSTTDTD FT MDRDRDRETDSDPCSTMDSDSDSGSDTDSWHLNAALVLVAGSLTSMYALWT FT LYYQRKSSKLACQSLGNIQAQATPGPAVESDSMLEARNSISTTTLTMDTGP FT DSMTAMMSSLPAMRDIQPAANLAAASADITTSALETGTGAPHATRRNANDS FT CLLPSTVGALARFGSPDTCERRISDPRDLGAYLTEGLHATSLESGSMGAIP FT SNECNQPQTFYATIQEESLNIQWDDIEPVLVTTDMDKFMEGLDEEDTIITY FT DGEYMFSSKQQMSSPEVLTAYKRVDRKVKPVPAVFPEDARVLRQFPEDPLA FT SLPPLTPRPPTFEPNGGRLTFENLEAMNLNATGFLWPEELKLFQHVLQLNQ FT AHFVFEDSQRGSFREDYFSPYIIPVVPHVPWAFKNIPIPLGIHQKVLDLLR FT EKIVAGVYESSQSSYRSRWFCVLKKSGKLRIVHDLQPLNKVTIRDAGLPPN FT LDNFVEPFAGRQCYTVFDLYWGFDACKVHPQSRDLTAFMTPLGLLRITSLP FT TGFTNSPAEFQACMSFILQHEIPEYANIFIDDLPIKGPKSCYKDKLGRPEV FT LQANPGIRRFIWEHANDVHRIIHRVGHAGGTFSPSKVQLAQEEVLIIGQKC FT TPEGRLPDSQKIEKVLKWPPLKTVKDVRGFLGLCGTVRIWIENYSAKARPL FT AELIRHDVEFEWDTRRQAAFEELKQAITSPPALRPIDYDSERPVVLSVDSS FT IIAVGFILAQYDENGKKRPAKYGSIPMNSIESRYSQPKLELYGLFRALRAY FT RLYIIGVKHMIVEVDAKYIKGMLNDPDLQPNATINRWIQGILLFDFELVHV FT PANKHRGPDALSRKEMAEGEEIVEEDDSWLDDIPLLIPEQCGPWMNIVSLY FT TRLVSSVPEPTIRTFKACLYEPDEPMVEHVFAASTQQQERTLADISHFLQT FT TRTPQFQTTQEKQRIIQKALRFYVTGKAMYKRRGLQGPAKVIFRAHKREEL FT LQAAHEGQGHRGAEAVMAMLKERFYWPNMWQDIRHHVQSCHQCQIRSVRKA FT QVPIMVSTPSTLFLKVYVDVMFMPKSHGYRYIAAARDDLSGAAEGRSLKRN FT DALSMSKFLWEQVFCRYGAVGQVVTDNGKEVEAAFSELMDRFGVPQIRISA FT YNSKANGVVERGHFIIRESIVKACNGKVSDWPKHVCHAFFADRITVRRQTG FT YSPFYLLHGTDPVLPFDLTEATFMMDGYKRGMSSVELLALRLRQLQKRPED FT LSKAAEVLAKHRFKAKEQFEHRYRTRLIRESYPPGTLVLLRNKAIEATLER FT KHKDRYLGPYEVVRQTRNGAYILQELDGTVWRQAIAAFRLIPYVSRSSEYL FT EAIMEQLHDNAEDESSGTDPWFSDSEREYGSGKSAESSVNQDSSFEGSESD FT SYSPSSF" XX SQ Sequence 14334 BP; 3734 A; 3649 C; 3877 G; 3074 T; 0 other; agtagagtgt tgttgttgtt gttgtttgga gttgcgctct agcctggttt aatccctggt 60 agagtgttgg tgtttgatgc tctaggcttg gttagatccc aagtagagta tctgtgttgg 120 tgtttatgtt gtttggtgtt ttccggcaaa gtggtcaagc agacaggctt ttccttttgc 180 tgtgtgttta aggtcctgct acaaaacagt ttgtagtttg atcttacccc aggcgtcacc 240 ccacccttac catcttctca cacatccttg aacccccttc cacctctgta atttctccca 300 cgaacgcccg aggcagaagc tgaccaaaga taagtccgcc taactcggtt cctcgattca 360 aacactactc ctgcaagcta aggaggtttg aacaggatca accacgacat ctggagccca 420 ccgcgaggga gaaaacagag ttggttgttt gttctggtag atgtcaggag atcctccgac 480 gcctacgttt tcgtctacgt cttcgccacc gtcgtcagta ccgtcgttgt caagcacgtc 540 gtcccgacct acgtcacctc agcaagtagt tggttttatg tcacacgcgt ctacgtcgtc 600 aagtgttcac gttacggttc gcggtttagc atccatgtca tcaaggatgc ctgctggttc 660 gtcgcaagtt caaggccagc agcagcaaca gcctcagggt cagccccacg tgcagtcatc 720 gcaacagaca gcacaggcta ccatagctgc agctgcaccc ctaggaggtg gtgtagtcga 780 cctacccctc ccaggtcaga aaggagctcc taagaagttc acaggaaagc attcagatgt 840 gcttccgttc ctgtatttct atgagcgcct ttgtcagaag cacaatgtta cgtcagacca 900 agacatggtc gagagcatta cacagtattg ctcacgtagt gtgcgcaagt tcatggaggg 960 tttgccaagc tatcgaaccc ccagctgggc atcctttgtt cacagcataa agaagtatta 1020 tgaagcggac aaggattcca agcgttttgc taccagagat ctggagaagc tcgctcgtcg 1080 ttctaggact aagtctatca agacaatgaa ggcttgggcc aagtatatgc gagacttcat 1140 ccggattgct ggttggttga agaatcacaa caggctttca gactacgact acaagtattt 1200 ctgctggata ggaattcctg aatccttcag aaacttactt gagtctcggc ttgttgctaa 1260 gcttcccaac catgacctcg cagaaccctt caatgtggat gatgtggtcg aggtagcaga 1320 gtctctcttg catcgttgtc gctttgatcg agatcgttta ctatcagata cagaagtaga 1380 tgaatctgga aatgaaacaa cgtcagactc agatgaatca gattcaggat caggagaaga 1440 agacctatct gaagatgact ttatcagcaa gaggaagagg aagaagaaga agaagcgaga 1500 gtcgtcgagg gttagagaag ggaagaagaa gacccaggta gcgaagaagt catcgaagcg 1560 ctcagtcaag ttcaagaaga cagagcatgg gtcttcatct gaaagcgact ttgatgcccc 1620 atcgtcacgc atgacattac gtccacccac tcctaacaag gaactgtcta gggatgatct 1680 tgaaattgag gaattaatca cgcagatgca aggaatgcgg attgatgatc ctgcttacgc 1740 agtggcctat ttccgggcct gtaacaggaa ccccttagtc aagagcattg tggcagcacc 1800 agtagatcgg aggcgggctt ctgcaccacc agcgcctcca ttagtgccaa gagccaggaa 1860 cgcagatcgt gatccaccac cccattttgg aaatggatct tcgtcgttct cggatagaat 1920 ggaggatcgc ctctgttctg tttcggatgt ggacttacag gacacaccat gaatagatgc 1980 gagaagctgc aggagtactt tgctaatgga atcatcacca gaaatgagag tggtcaaatc 2040 attctcacca atggcggtat tctattcaag cgacgtaatg aagattggat atcagccatc 2100 aagcgagcca ctacgcctca ggccaacttt gtttcagtcc aggcaaccca gcgccaagcg 2160 actcaaacaa atttgcatct gcaagaaggc aagctccggt tcactactac tcaagtgaat 2220 cagagcaggt ggaaagtgat ggttacggtg acttcgcaga agatgaagaa gaggaggaag 2280 agaaggaagt ggaggtcctt gaggctacta ggtctacccg aggcattact acaaatcgta 2340 aggagcgctt cgacggtgta tggttaccct cttctaagca agcagaagag cgggcaaaga 2400 aaaaccttcc acctgttaca gtaaagcgtg gtcctccgag gaatgcccgt aaggatgcag 2460 atcagcgaca ggtgcaagta cctgtacctg taccggttcc agacccagta ccagtgccag 2520 tggatattcc ccctattgcg tttgacaccc atgatgatga cgcattcatg gaggatttga 2580 ctcagccatc ccttaagcag aagggaaaac agcgagcaga acccatgagg gaaggaacca 2640 ctgctcgaac gggaccttca gggagcaaga cagatgcaga aggagttcag cgtagacagc 2700 cacgacgttc agagatccaa gctcttgtga acccaaggac tgtgctggac aaggtcttga 2760 atgcccctct tacgttggga gtaggggagt ttttggcagt ttcacaagaa atgtcaaagc 2820 aggtacaaga agctatcaag cctaagaatg cccttggtaa gtccaaagcg ttggacaagg 2880 ctatgatctc acaagtcaaa gaacgcgaca cattctttga cacagcaact tgtgctacag 2940 cttccgcatt tcccactcgg catcgaggtc aactcattcg cttcagaatt gagattgatg 3000 gaagtcctat tacggcaatt gttgatacgg gatctcagct aaatattgtt aatcgcaaga 3060 tttggcagaa caccgtagga cgtccaattg acgtcacaag gcaaatcgta atgggagatg 3120 ccaatggtgg agaaggactg ctgcaaggct ttgtgcctga tgttccatta acctgtggtg 3180 ctgtagtcac ccgggccagt ctgtttattg gaactcagcc acccttcgat ctattgcttg 3240 gaagaccttg gcagcgaggg aactatgtta gcattgatga gcgtcgggat ggaacatacc 3300 tgctattcaa agaccagtct ttggtagtca ggtatgaaat gctagtcact cctgaatacg 3360 atgttgatcc tgcactgcag gaatatctag accgtactca ccatctagtc aatatggtga 3420 caggcataga tacgtcagat attgacatcg agttcgagta caagcagccc cggttagaac 3480 gacgtcagtc agggcgaaca gccagaatga tccaagcagc tcgtattcaa gaggtgaata 3540 cagacgttga agacgaggag ctgttaacag attattctga ggctgagagt tacgctacat 3600 caagctatac tacatcaccc ccgctaagtc ctacagcaag ccttgaaagc attgacacga 3660 ttatcgacct tgagtccagc gatgaagata ctgatggcgg tgctgaatca gacagcagtt 3720 atgcaagcgc cagcagtgac gtcatacggt ggttggactt acaagagtca gaggataatg 3780 aagacgtgtc aatggaaggc gaggagtcgg aagatgtacc tgaactagaa cctacgacag 3840 atgcttacga caagagagaa ggcagcgcag aggtgtacta tgcgtcacgc ccctctcaac 3900 tgcatcagat gtcggaagag gagccagccg aagatgaata taatgagccg ggacccatac 3960 ttgttaacga ggattcgcac gtacccgaac gtggtcctcc tggacggagc ttatgggagc 4020 tcattattga aaccgtgcaa gactctgtaa ataaggttct agatgaagga tcagcaaatt 4080 catcagacat gcaacggcgc gcacatacct tggaagtaca gcatggcata agggccatgg 4140 acgggcagcg gtgtcaggaa gagacttgcc gtgctgcctt gcttgccact gatgtgccgt 4200 acgggctgat tgacttagaa tgtattcagt ctagccaggt atctaatgga agatacgctg 4260 atgaggctag ggctgacgta cctctagaag aagcagacag cctgcacgat gttgtgatga 4320 acgctgacac tggctccact gaaaatgaga cacctgaacc acaggagaat gtgttcagtg 4380 tccaggtcgt aggaaggaag gttagcaaga gggttaaggt gaaggaagga aggagaaacg 4440 caccggatga atgcatcatg cccacggtca cagacctgtc agagcaagac acaggagacc 4500 ctggtagtat gagggtggaa gaagatccgt gcgtagaggg acagggtaga gatggtgagg 4560 agaggatcgt ggacagcgtg ggagaaacag atcagtcaga cagtgaagtc agtatgggag 4620 aaaacgactc acctacagac tctcagccgg agagtgttac ttctgatgac gagtcctcag 4680 acagcgggtc tgaggcagat ccgtggcaga taattcatga tgagtatgat gacaagcatg 4740 aggaagtaag gcagagtcga cgtaccattg agaaatgtct tgcccgacgt ttgagacacg 4800 agttgaagga gatcgacaaa gctgaggaac gcgaaggtta ctcgcgtgac ttgctttgct 4860 ggttggctga cgacattcgc gtacgctgta aggacgacca cgatcaggta tggcgcgagc 4920 tcgcacaaga tttttgggaa gagactcgcc gtctgattga ggctgagcag tacgagctca 4980 aagccttggg aagagactgt cctcaggatg ttcctgcaaa atgtgaggtt cagagcggcg 5040 ggaggccgcg ctgtaatctg ctcaaagcag gacatgtcct cagtgtacag gtgggagatt 5100 cggacacgct agcggaaagt gtccaagccc tcaacctaga agaaggaagt cagcccgatg 5160 tggaaatgcg cgatgaaagc ttacctaggg tacttacgga gcctggaacc ccagtgcttc 5220 agccactgga tacattccca gtaccaggaa acgcaccgtt accattaaca cggcagcgtt 5280 ggcgaccaga gctagtaagc gatcctctga cggaagagga tgttatcccc atcctggcgg 5340 atgaggatgc ctggctagag ctggtaggtc ttaggtatgt gaatggagag gaaatcgcgc 5400 actgagaatg gcgtgagact gtgcgcaagg cgaaacgcat tctacgggac cggttgcgtg 5460 agatcgacga ggatgaaagg aatggcactt acccaccgcc acctggagac ctatgaatta 5520 gcgcacgaag tgatcgggcc tccgcagaac gcctctttga tgtcgagtgt gactcggcgt 5580 caagagacct ctgggtgcgc cgtagtgcac tgttcaggca gctggagaac gagcggagcg 5640 ccgttcgaca cgtcttcagc cggcagcgaa cgccaggcct tcatgctgga ccacccgacg 5700 aagggggact tgaggggacg cgctatacat cactcgaacg tgagcgcgaa cgcgaagact 5760 caggatctgc aagcagcgcc atgacgtcag tagatcaatc ctcggtacct gaagaagacg 5820 taccttgggc tgggctggcg cttgcgcgac aggagctgag aggggaagga tgcccggcgg 5880 ccgaaagaac cgacactgca cgaacgcgtg caggacccct ggcacccatg cgcgtacgcg 5940 aagttcccgg gtctgcgacg cgacaggacc gcggcttgcc gagttttgtg cttgagggtg 6000 caaaccaagt gaatggctcg atctgtgaag ggcaaagggt atgtcaaacg tctcagagag 6060 tgagagaaag gcatacctgc gcagcagagg ccgcgaacgg ccgccgagtc gctgaaggcc 6120 tcgaagtacc gtgttcgacg cgagtgcaga ctcctgaatg cacacaagac gttagcgaat 6180 ccgaaaggga ccctgaagca agcggtaggc gtacctttgg aaatgagcgg ggcagtgaag 6240 ctgccgcgcg gctcactgac atgttttcgc ggcgctgcgc cgacgcacat gtgcaggtgt 6300 acgaagggca ttgggcacct aaagaaaggc gtgaatggct cgaacgcgta gcagggcagt 6360 acatacctac gatgctcacc tcgtgcgctg ccctgagtcc ggattcgttg aggattggca 6420 cgattgagag tcgccaggtc gagaggaagt cgcccaggta cctcagggca caggggactt 6480 ctgcgaccga aacgggagag ttagcaggac agaaagtgat aaggagcaga agggggactc 6540 accagaacgt gttccccggc tgctcagctc gagtcttggc tacacaccga cgttttcacc 6600 tgtttgcacg acttctgagc tttttcgacg cgttctcagc atgtcggatg cttgagacat 6660 gcagctcgtg catcacgagc cgcatggcca tctgggtatt cacgtttctg ggtcatcctt 6720 gttcacgcac gaagttctgg gccccggtcg cgacgcgatc tccttccgtg tgccgacgca 6780 atgcttcccc ttcaatcagg ccctctggta cgcatacaac gcgaggctat acctcaccgt 6840 gcgaccaggg catggcctgc ttggagtctt cgcaatgcgc ccacgcctgg gacggtactt 6900 caacgcggac acgccaggtg acgaccacgc gatcctgaag ttcaagacgg acgacggcgc 6960 agccttctac attcctctgg cagagccagg gaaaggcacc gttgcccggg tcagtatcac 7020 gactcagctg ctcatggggt gcctcgtacc tgagatgtac caggccggcg accatatgtt 7080 tcctatgatt gtcgccccgc agcacctcgt gcagctcctt gaggagacgg atggactctt 7140 tcgtgagtac gaaatgcatg tgcgaggatt attagtactt acctatgatc ccccgccccc 7200 agttcctgag ttcaaggccg cgctcggccg tgttcgcgaa atcatggagc gcgaccaccc 7260 tgacgtcgag atcttcggcc gcgaccggct gcggattact tggccgtggg gagaatgggt 7320 gctggatagg aatgggaaca tgcaggcgct gagggacgaa gaagaaacgt accaggagga 7380 cgagatggag gatgtgagga aacggcgcga cgacgagaag ccgcggccag agttgatacc 7440 gctgccgcaa gcggtgtttg actttgctgg ccagagggtg cagtcgccgt cgagccgtcg 7500 agaggagcag ggactagtag agaatggacc gaggagggat gagatgaggg cacgcgagac 7560 cgaggcgaag aggttctgcg agcaagcgcg actgctgatc gccgggtatg cggccctcaa 7620 gtccggtaag ccgtgtgatg ggcatgatcg caggtacgca cgtgaaaagc cacgcgagcg 7680 aggcactgac gcaaagccgc ccagacttct ggccacggac agcagcccgc acatggttgt 7740 tccccggccc gaactggagc ggtcggacat caccgacttc ttcaagtcgc tgacagccga 7800 cctgggcgcg cccgtctccg aggacccacg gcggcgctgg ttcccaatca ggcttgccgt 7860 tgggccagga ccttcaaccg cggcaagtgc cgaaggcgca tcaagctcgg gctcggtgtc 7920 gcctacggct tactctgccg ggtctccgct ctcgtcaacc acttcgttcc actcgccccc 7980 catccctttc agccctccac gcccagcgac cgcatcaccc cgtatcgatg tcggacgacg 8040 tactcaagac gtgcacgagg ccctcaaggc tgtcaaggag gtgatgaggg aagtaggagt 8100 tggacgagaa cacggagagt tacacgggac tcctgcagga gcaggagaag gtatgcttca 8160 cgccgctcgg actgcgctga cgccctattt gtgccgagac acagattcgg cgcaccgaca 8220 ggcactgcgt ccctcgcagg tcctgcacat caacgctgca gtcgaggccg acgtgatcga 8280 gaggccgcct acaccatacc tcaagccggt ggacgagcac gcggttcccc gctacggaac 8340 ggccaacttc tccccctctc cactccagct accctcgcca ctgttgaacc atgtgcccaa 8400 cggactcacg ctgtcgccgc agaagcagca agtcatgcca cacacggtac gcacgcagcc 8460 gctgagactg ccttggcatg gcgctgacga ggccgcgcag agctctacca gcccagcctc 8520 gcccagcagc agctgcccaa gtctcgtact cagctccccc cacgacgatg acgctctcag 8580 cacgctatac gagcggctgg aggaggggca gcacaacgcc agcaactaca acgcatggaa 8640 tgccgcgttc acggtgaaca cggacagcat ggacgtagaa gagccagaca ctcgggcagc 8700 accaccacaa gatgacttcg aggtgctcga cgaggcggtc atcgaagagg ctcggcgcga 8760 gttcgtacac cgcgccacgc gcaacgacat ggacgcacgc acggatgaag aagaacggtc 8820 ctttcggagg gacactactg gcagctcaag ctctgccagt acgcaagagt cccccgaatc 8880 cgcgatcgag gagccttaca ccaggccgct caccgtcgag gaagcacgcg ccgccggccc 8940 tccgccttcc atggtcgcgc ccgccccgcg catcccctta ccagcgcgcc ctcgattcgc 9000 accgacgttc acgtatcagc agcaagcgac tcacggacaa ggcgaaggca tgcccagggt 9060 cggagagtcc gcaggcgagc tgcactcgct actgcgcctc cccaacccta agcaggcctt 9120 cgctcgatac cagcgcgacg gatacctcgg gaccctcgct ctggcaggag cagcagcagt 9180 acgcaacaca gtactccctg ctctgcacag tgaggtccgc gacgccatct gcgctggact 9240 cctgcaagac atggtcgacg cacgcagtgg gaagaccgag ctctccatcg agatgctcga 9300 gatgttctac ccgtaccacg aggaccgcga cacgccctac ctgtaccccg aggagcagat 9360 gtccttgcga cgctgcatct tcttttggac ggcgcggaac tccctcggcc agcacgacga 9420 acgcatcaag tggcttcaaa gcctcctcca cctgcagtca ggcaaccggg cagacaatcg 9480 caacgtccga cgtatgcgcc agaatgggcg ccttggccct gcaggacgcg gacgcgatgt 9540 ggtcacctac gggggatcgc tttgaagcgg taagtgcttc gtttgatgta gccactgttc 9600 cacggcaggc taacggacga tttctcgcaa tagtagagtc attgccgagg acagtcgagg 9660 atgcctcagg gtggcagcgg cgcgatagct gccaccgcgg tatgttttca tggtttccga 9720 cgtttctgtg ttttcttttg actttgtttc gattttcttt tcttttgttt gtatggccag 9780 aacccacgga cgcttacgta gccgactcgc acgcgttata cgcaaccggc tcacgagctg 9840 tatggctctt cgcaggcgag aatggactca agcgcggcct ggcacggatt cttggacttg 9900 ggactcgtac gttagacttt ttatgcgcca ctgacgcttt ttgttgtgtt tcggactcgg 9960 acagattttc tttggatttt tctgactatt ctttcgattt ttcttatctt ttttacggat 10020 ttggactcag tactacggac acggacatgg atcgcgacag ggatagagaa acggactcgg 10080 acccttgtag tactatggac tcggactcag actcaggctc ggacacggac tcatggcact 10140 tgaacgccgc acttgtacta gtcgcaggga gccttacatc aatgtatgcc ctctggactc 10200 tgtattatca gcgcaaatca agtaagcttg catgccagtc actggggaac atacaagcac 10260 aagcgacccc cggaccggcc gtcgaaagcg acagcatgct tgaagccagg aactcgatat 10320 caacgacgac tttgactatg gatacaggcc cggactccat gacggccatg atgtcatcac 10380 taccagctat gcgcgatata cagccagcgg caaaccttgc tgcggcatcc gcggacatca 10440 cgacatcagc gctagagacc ggcaccggag caccgcacgc tacccgacgc aatgccaacg 10500 attcatgcct gctgccatca acggtgggcg ctctcgctcg atttggcagc cccgacacct 10560 gcgaaaggcg catcagtgat ccacgtgacc ttggcgctta cctgaccgaa ggcttgcatg 10620 caaccagcct agagagcggt agcatgggcg ctataccttc caatgaatgt aatcagcctc 10680 aaactttcta tgccactatt caagaagaaa gtctgaacat tcaatgggac gatatagaac 10740 cggtcttagt aaccactgac atggacaagt tcatggaggg cctagacgaa gaagacacca 10800 tcattacgta tgatggtgaa tacatgtttt catccaagca gcagatgagc tctcctgaag 10860 tacttacggc ttataaaagg gttgatcgta aggtcaaacc tgtgcctgca gtattcccag 10920 aagatgcaag agtattgcga cagtttcctg aggacccatt agcctcactg cctccattaa 10980 ccccacgacc acctactttt gaacccaatg gagggcggct aacctttgaa aacctggaag 11040 caatgaatct gaatgcaaca ggatttctat ggccagagga attgaagtta tttcagcatg 11100 tacttcaatt aaatcaagcg cactttgtat tcgaagacag ccaacgaggc tccttcaggg 11160 aagactattt ctccccgtac atcattcctg tggtacctca cgttccttgg gcattcaaga 11220 atatccctat acccttaggg attcatcaaa aagtccttga cctgctacgg gaaaagatag 11280 tcgctggagt atacgagtct agtcagtcct catatcgatc aagatggttc tgcgttctca 11340 agaaaagtgg caagctaaga attgtacatg acttacagcc gctgaacaaa gtaactatac 11400 gagatgcagg attgcctccc aatctggata actttgtgga gccatttgct ggacgacagt 11460 gttatacggt ttttgacttg tattggggat ttgatgcatg caaggtgcac ccacaaagca 11520 gagaccttac agcattcatg acaccactag ggctactacg catcacttcg ctacctacag 11580 gctttaccaa ctcaccggct gaatttcaag cgtgcatgtc ttttatactg cagcatgaaa 11640 ttcctgaata cgccaatatc tttatagatg acctacctat taaaggtcct aagagttgct 11700 ataaagataa gcttggaagg ccagaagtat tgcaagccaa tccaggcata cgacggttta 11760 tctgggaaca tgcaaatgat gtacaccgga ttattcacag agttggacat gcgggaggaa 11820 cgttctcccc ctcaaaggtt caactagcgc aggaggaagt tcttattatt ggccagaagt 11880 gtaccccaga agggcggttg ccagatagtc agaagatcga gaaggttctg aaatggccac 11940 ccttaaagac agtaaaggat gtcagaggat tccttggtct ttgtggtaca gtacgcatct 12000 ggattgagaa ctattcagct aaggcacgac ccttggcaga actcatacgc cacgatgtgg 12060 agtttgaatg ggacacacgg cgacaagcag cctttgaaga actgaaacag gcaattacgt 12120 caccaccagc cctacgcccc attgactatg actcggaacg acctgtggtg ttatcagtag 12180 actccagcat aattgcagta ggcttcatac tcgcacagta tgatgagaat ggcaagaaga 12240 gaccagcaaa atacggttca atcccaatga acagtataga atcacgctac tctcagccca 12300 aactggaatt atatgggtta ttcagagcac tgcgagctta tcgcctttac attattggag 12360 ttaagcacat gatagtggag gtggatgcga agtacattaa aggaatgctt aacgaccctg 12420 accttcaacc taatgccacc atcaataggt ggatccaggg cattctttta tttgacttcg 12480 agttagttca cgtaccagca aacaaacatc gaggacccga cgccttatca aggaaggaga 12540 tggcagaagg agaagagata gttgaagaag acgacagttg gttggacgac atacccctgt 12600 tgatacctga acaatgcgga ccatggatga atattgtgtc gctctacaca cgcttagtga 12660 gctcggttcc tgagcccacc atcagaactt tcaaagcatg tttgtacgag cctgacgagc 12720 ccatggtaga gcatgtgttt gcagcatcaa cccagcagca agaacgaaca ttagcagata 12780 tttcccactt tctgcaaact acccggacac ctcagtttca gactacccaa gaaaagcagc 12840 ggatcattca gaaggcatta cgcttctatg taacaggaaa ggccatgtac aaacggcgtg 12900 gtttacaagg acctgccaag gttatattcc gtgcacataa gcgtgaagaa ttgcttcaag 12960 ctgcacatga ggggcaagga catagaggag cagaagcagt catggccatg ttaaaggaac 13020 gtttctattg gcctaatatg tggcaggaca ttcgccatca tgtccagtca tgtcatcaat 13080 gccagatacg aagtgtacga aaggcacagg tgcctatcat ggtctcaaca ccatcgacac 13140 tgttcttgaa agtctatgtg gacgtcatgt tcatgcctaa gtcacatggc taccggtaca 13200 tagcagcagc acgtgatgac ttatcaggag cagcagaagg acggtcactc aagcgcaatg 13260 atgcacttag catgtccaag ttcttgtggg aacaagtctt ctgcagatat ggagcagttg 13320 gacaagttgt cactgataat ggcaaggaag tggaagcagc ttttagcgag cttatggatc 13380 gatttggcgt acctcagatt cgaatatctg cttacaactc caaagcaaat ggagtggtgg 13440 aacgaggaca tttcatcata cgcgaatcca ttgtcaaggc atgtaatggc aaggtttcag 13500 actggcctaa gcatgtttgc cacgccttct ttgcagacag gataactgta aggcgacaga 13560 caggctactc ccccttctat ttgcttcatg gaacagaccc agtcctaccc tttgacctta 13620 cagaggctac ctttatgatg gacggctaca aacgcggcat gagctcagta gaattactag 13680 ctttacgttt acgtcagtta caaaagcgcc ctgaagactt gtcaaaggca gcagaagtct 13740 tagctaagca taggttcaaa gcaaaagaac aattcgaaca tcgatatcga actcgactta 13800 tacgcgaatc atatccacct ggtaccttag tgctactgcg taacaaagca atagaggcta 13860 ctttggaacg taagcataag gataggtatt taggaccata tgaggtggta cgacaaacaa 13920 gaaacggagc atacatccta caagagctgg atggtacagt ttggcgacaa gccatagcgg 13980 cctttagact cataccctat gtgtccaggt caagcgagta tcttgaggca ataatggagc 14040 agcttcatga taatgctgag gatgagagct caggcacaga tccttggttt tctgatagtg 14100 aaagggaata tggttctggt aaatctgctg agagctctgt aaaccaggac tcctcctttg 14160 aaggcagcga atcagactcg tattcaccct catcatttta gaacggccaa ctggggacag 14220 ttgaaacttc aagcgacccc cggatatcat gcatcaaaca tgtttgatac aagatatgga 14280 tgaatagagt ctaattcact gctcaaagac gagcgcgaga gaacagggac attc 14334 // ID Gypsy4-LTR_AO repbase; DNA; FNG; 669 BP. XX AC . XX DT 25-JAN-2006 (Rel. 11.01, Created) DT 02-FEB-2006 (Rel. 11.01, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy4_AO LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy4_AO; KW Gypsy4-I_AO; Gypsy4-LTR_AO. XX OS Aspergillus oryzae OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC mitosporic Trichocomaceae; Aspergillus. XX RN [1] RP 1-669 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-669 RA Kapitonov V.V. and Jurka J.; RT "Gypsy4_AO, a family of Gypsy LTR retrotransposons in the RT Aspergillus oryzae genome."; RL Repbase Reports 6(1), 6-6 (2006). XX DR [2] (Consensus) XX CC This is a LTR of Gypsy4_AO. Solo LTRs and proviral copies are CC flanked by 5-bp TSD. XX SQ Sequence 669 BP; 189 A; 175 C; 158 G; 147 T; 0 other; tgttacagac ccttgtctag cgcaccttat cgaaggcgga cacacaaggc agaataacga 60 taggatgacg atggccgatt cctgagtgga aggcacccga cgaatcgttt atctagtagt 120 tattcggaaa gggctacgga cgccgatacc agctaccaca acaaagtcac gtgcatcgta 180 gcagtgacac gaccaaacgt atcagcgaca cgacaagctc gtagcactga caagatacat 240 cgtatcactg acacgaagag tcgtagcagt gacacgacgg gcaaggatat ataaggacac 300 tcatttatgt acttaagttt agttcttcgt gatcaataca catggttact acagagttac 360 ctttagtagc aagaccagtc tcaactcgtt cgcactcttc taactattaa gtatacgtac 420 ttaataggcc tggccctttg gactcgagtc tagtatcgac ggcagtcttg tgtcgactgc 480 atccctgtga cgacggcagt ctagtatcga cggcagtcta gtgtcgacgg aagcctcata 540 acgactcccg tcttccccag accgcaaccc aacagcgacg gaggtcaaag aacgacttca 600 ggttagtaac cctcgaaacg tggtccgcta actggtctca gacttgacga cgaagccccg 660 ggcgtaaca 669 // ID Mariner-5_AF repbase; DNA; FNG; 1981 BP. XX AC . XX DT 28-FEB-2006 (Rel. 11.02, Created) DT 07-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE A family of Mariner DNA transposons - a consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-5_AF. XX OS Aspergillus fumigatus OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Neosartorya. XX RN [1] RP 1-1981 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-1981 RA Kapitonov V.V. and Jurka J.; RT "Mariner-5_AF, a family of Mariner DNA transposons in the RT Aspergillus fumigatus genome."; RL Repbase Reports 6(2), 102-102 (2006). XX DR [2] (Consensus) XX CC This is a family of DNA transposons from the Mariner superfamily CC (Tc1 clade). The genome harbors six copies that are 98% identical CC to the consensus sequence. It encodes a 372-aa Mariner-5_AFp CC transposase (pos.590-1705). XX FH Key Location/Qualifiers FT CDS 590..1705 FT /product="Mariner-5_AFp" FT /translation="MVIEMWFHNLKVLMEQNYIHLRDLYNFDETGFQIGIG FT KDQWIVTREFKKPSFSPSNTNQEYTTVVEAISADGHFIPPFIIFPGKCILA FT GWFDVCDEPDYIIGLSDSGYINDILAFQWIQHFEHHTRRRMLGVKRLLLCD FT GYGSYMTYEFIDFCEKNNILLYFLPPHTSHILQPLDVGVFHAYKHWHSEAI FT EDATQTGCGKFTKVEFLSALFEIRRRTFKQRTLKHAFRLTGLNPWNLSVAL FT ERLQDSKLFTDESSSNSSFLITNTPQTARQIDRFNRHLLDLSPTEEDSFHT FT TLSKLVKAAKTQAILVEHLTERVRESDAAKLARQRCVQASRAHLQIGGIMR FT KEEVTCMKRIRQDYDELVEKNRLRPQWRK" XX SQ Sequence 1981 BP; 571 A; 450 C; 435 G; 525 T; 0 other; accgacccct tcgacgtagc ggcttagtaa gactaagcct tagtctacgg ggtcgcgcta 60 cgcgcgaccc tgccatctca atgaaaaata tcatctcaac gataaaagct tcaaatgtca 120 tatcctttct agaatttcta ttatttctca accacaacca tgccgaatga ctattatgag 180 atagaagatc aaattgagag ggctctgcgc gtattacggc gtcaagaaaa gccaaatatc 240 tcaaaaactg cgcgagagtt taaagtccca atgcagaggc ttcgacgccg cttccttggg 300 actccttcac ggtgtgacag ggccccaaca aatacaaagc tttccacaga gcaagaggcg 360 gctctcataa agtatattaa tatacttatc aagctagata ttcccccgcg gccgaaggca 420 atcagcaatg cagcaaattc aatactcttt cgtggacata ctgatccgat aacccccccc 480 cccttcaatt ggcgtgcatt ggacgaaacg cttccttgaa cgctatccag aatatcgagt 540 acgaaggcag agagctattg aattagagcg gaagcgggca cacgacccta tggtgattga 600 gatgtggttc cataacctga aggtgttgat ggaacagaat tatatccatc tacgagacct 660 ctacaacttt gatgagacag gcttccagat tggaatcggt aaagatcaat ggatagttac 720 tagagagttc aagaagcctt ctttctctcc tagtaacact aatcaagagt atacaactgt 780 tgtggaagca atcagcgcag atggccactt tattccaccc tttattatct tcccagggaa 840 gtgcatctta gcgggatggt ttgatgtctg tgatgagcca gattatataa ttggactgtc 900 agattctggc tacataaatg atatccttgc cttccaatgg attcagcatt ttgagcatca 960 tacacggcgt cgaatgctag gggtaaaacg cctacttctc tgcgatggat atggatccta 1020 tatgacctac gaattcatcg atttctgtga gaaaaacaac attctcttat acttcctccc 1080 tcctcataca agccatattc ttcagccgct ggatgttggt gtattccatg cctataagca 1140 ctggcatagt gaagctattg aggatgctac ccagacaggc tgtgggaagt ttacaaaggt 1200 ggaatttcta tccgccctct ttgagatacg acgcaggact ttcaaacagc gtactttaaa 1260 gcatgctttt aggctgactg gcctaaatcc atggaaccta tcagtcgctc ttgaaagatt 1320 acaggattcc aagctcttta ccgatgagtc gtcaagcaat tcctccttct taatcaccaa 1380 taccccccaa acagcgcgtc aaattgatcg atttaatcgt catcttcttg atctatcccc 1440 cactgaagaa gattcattcc ataccactct ttctaagctg gtaaaggctg ccaaaacaca 1500 ggcgattcta gtcgaacatc ttacggagcg ggtgagggaa tcagacgctg caaaactagc 1560 ccgtcaacgc tgtgtgcagg cttctagagc tcatctacag ataggtggaa tcatgcggaa 1620 ggaagaggtc acatgtatga agcgaattag gcaagattat gatgagctgg tagagaagaa 1680 tcgcttacgg cctcagtgga ggaaggtaat ggcggaactt caagagtact gccttgccaa 1740 aggaataatt attcgaaagc ggagaaaagt gcgaaaggta gtttaaacct agtagtcatt 1800 atttctatgt attttactag taaattcaga caggaaggtt gcctatttct tcatgatatt 1860 tgaatgttga taaactttat cgttgagatg atatttttca ttgagatggc agggtcgcgc 1920 tacgcgcgac cccgtagact aaggcttagt cttactaagc cgctacgtcg aaggggtcgg 1980 t 1981 // ID Gypsy-119_MLP-I repbase; DNA; FNG; 5762 BP. XX AC AECX01000800; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-119_MLP_; KW Gypsy-119_MLP-LTR; Gypsy-119_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5762 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000800; Positions 126242 120481. XX CC Positions [4521-5000] - Integrase core CC 'AATGA' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 394..1446 FT /product="Gypsy-119_MLP-I_1p" FT /translation="MEGSQQSGAEPTSDPMLALLAQMEARLKEMDAKLSEE FT TRLRQHSDRELAELRAERHRTSGVETSTTTTAPTVIQTTKTPKIATPDKFD FT GTKGRKAEIFANQVGLYILMNPALFPNDQIKIGWTLSYMTSKGAEWAKPVT FT QRLISGNEEDKMTWDGFVKHFEATFFDSERVAKAERSIRSLKQTGTVLAYS FT LKFGDLAPIVNWPDNILISQFESGLKPKIQVLIVRDEFKTLDQISELAIKI FT DNKLHRRGDDPIIDTKDTTTPLDPNAMDCSAMSFQISKEEYRRRTEGNLCY FT KCGRNGHRARECGGSYFRRGWKGKMRSGNEAKISTSEVKNEESVKEGRAEE FT SKNGVAQG" FT CDS 1446..3170 FT /product="Gypsy-119_MLP-I_2p" FT /translation="MKVAPTLSLSGDNSNIDLGAINIEIDACAMNDNRVFA FT TIPIFDPSQEAKFFACAMFDSGATHDVMNEKFVRQTKISTTKLEKPKPVTG FT FNGSQSFITHVAHHILKITNEPTPTTFLISRLMDNIDCIIGVEWISKNHEL FT INWRERTLKTPTRESSIAVACAALPGPKTTPITSRVESKEHARQSGKGVCV FT EVDTLTPPQCEFESLDSTLSVKTMSKHFSLPNRYVNRTDNNITKVEELTEE FT AGMHVATNKLVSSQPKKALTDHQDGLGHARKDDEGVCVKIDALAPPQRECD FT RISVNDSKKPVNKQNFSMNFTHRTRTPQLNSTRSYLPKPPVLKTPQLDAAR FT ASWNISAKLAAEKTEQVKKVSAADLVPEEYHEYLSMFEKSKSRVLPPHRPY FT DFCVDLLPGATPQASRVIPLSPAENEVLEEMIEEGLTTGTIRRTTSPWAAP FT VLFTGKKDGKLRPCFDYRKLNALTVKNKYPLPLTMELVDSLLNADRFTALD FT MRNGYNNLRVREGDEAKLAFICKAGQFKPLTMPFGPTGAPGYFQYFIQDIF FT GNRIGRDMPALLDDIMIRQDQGKTMRMR" FT CDS 3288..5108 FT /product="Gypsy-119_MLP-I_3p" FT /translation="MVKSKVKAVTAWPVPKNTGEVQQFIRFANFYRRFIED FT FSRIARPLHELTQKGIIFEWTPARDEAFRTLKDAFTKAPVLKIADPYKQFV FT LECDCSDYALGAVLSQVSDEDSELHPVAFLSRSLAQAEQNYEIFDKELLAV FT IASFKEWRHYLEGNPQRLNVIVYTDHKNLESLMTTKELTRRQARWAETLGC FT FDFEIRFRPGKKSAKPDALSRRPDLKPTDDCKLTFGQMLKPSNLPSDAFIA FT ELDIADNWFVPEDDWSQMFDDEYDKLHKGEAMVYALNKENPEKVWDDVLIM FT NKIREKSKEDKRLEQLMYICQQQEDSSWEEYTYVDGVLYFKGEVEVPNDNE FT IKIQILKSRHDGILAGHPGRMKTLRLVKQQYHWPSMKAYINAYVDGCHSCQ FT RVKTRTSKPFGSLQPLPIPAGPWTDICYDLITDLPISEGKNCILTVVDRLT FT KMCHFIPCTTTMDSKELATLMIKHVWKYHGTPKSITSDRGNIFISKITKEL FT NHQLGIVTQASTAFHPQTDGQTEIANKAVEQYLRHFVSYKQDNWTTLLDMA FT EFAYNNSPHTSTGISPFKANYGFDVSYSRIPTDEQCIPAHIRGYLQMSNVS FT QLLKTCLNS" XX SQ Sequence 5762 BP; 1933 A; 1227 C; 1292 G; 1310 T; 0 other; tattgaaccg tccataatca aaatcagaca tcagagactc aagtaattca agtagttaga 60 agaaaaatca aattacaagt aaagaagaag aaaatcgaag aaaatatcat cagaagttat 120 tagatattag aagcaaagtt taaagtttta aagttaataa acgcaatcaa tccagatctc 180 aaaccttaaa gacattgatt cagtggcaga acaattctca atcccgcact acccccgacc 240 aaccttattc ttatccagtg aactcaagtc gattcaaact taatctgaaa aaccgcaagc 300 accacgccaa acttcacatt accccggagt cggcccacta gcctgtcgaa cagtacagcc 360 ttctcgtcag ctgtaccatc ccgagcaaac agtatggaag gttcacaaca atcgggcgcg 420 gaaccaacat ccgaccccat gttagctcta ttggcgcaga tggaagcgcg ccttaaagaa 480 atggatgcta agttatccga agagacgcgg ttacgccaac attctgatag agaattggct 540 gaattaagag ccgagaggca ccgtacttca ggagtagaaa cctcaactac gactaccgcc 600 cctacggtaa ttcagacaac taaaactcct aagatcgcca caccagataa attcgacgga 660 acaaaaggga gaaaagcgga aattttcgct aatcaagtag gactttacat cttaatgaac 720 ccagcactct ttcctaacga tcagattaaa ataggttgga ctttatctta tatgacaagt 780 aaaggtgctg aatgggcaaa gccagtcact caacggctga ttagtggtaa tgaggaggat 840 aagatgacgt gggatggatt tgttaagcat tttgaagcaa cattttttga ctcggaaagg 900 gtggcgaaag cggaaaggtc cattagatca ctcaagcaaa ctggcactgt tctagcgtac 960 tcacttaaat tcggcgatct agcccctatt gtgaattggc cagacaatat cttaattagt 1020 caatttgagt caggccttaa acccaaaatt caagtattga tagtgaggga tgaattcaag 1080 acattagacc agatatcaga actagccatc aagatcgaca ataaattaca tcgaagggga 1140 gacgatccaa tcattgatac caaagacaca actacgcctc tagacccaaa tgccatggac 1200 tgttccgcca tgagtttcca aatatctaag gaggaatacc gccgtcgaac tgaagggaac 1260 ttatgttaca agtgtggaag gaatggtcat agagcaagag agtgtggggg aagttacttt 1320 agaagaggat ggaaaggaaa gatgaggagt ggaaatgaag cgaagattag tacatctgaa 1380 gtgaagaatg aagagagtgt gaaggaagga agagctgaag agtcgaaaaa tggcgtagct 1440 cagggatgaa ggttgctcct accctgagct taagtgggga taattcaaat atagatctag 1500 gcgctattaa cattgaaatt gatgcttgtg caatgaatga taatcgtgtg ttcgccacca 1560 tccctatatt tgacccatcc caagaagcaa aattttttgc ttgtgccatg tttgattcgg 1620 gcgctaccca tgacgttatg aatgaaaagt tcgtccgaca aaccaaaatc tcgacaacca 1680 agcttgagaa accaaaaccg gtcacagggt tcaatggctc tcaatccttt atcacccatg 1740 tggctcatca cattttgaaa atcaccaatg aaccaacacc caccactttc ctgatctctc 1800 gcctgatgga caacatcgac tgcatcatag gtgtagaatg gatcagtaaa aaccatgaat 1860 tgatcaactg gagggagcgt acattgaaga cacccacaag agaatcaagc attgcggttg 1920 catgtgcagc cttgccagga ccgaaaacaa ctcctatcac gtcccgagtg gagtcgaagg 1980 agcacgctag gcaaagtggc aagggggtgt gtgtcgaagt tgacacgtta acacccccgc 2040 aatgtgagtt cgaaagcctt gattccacac tatccgttaa aacaatgagc aagcatttct 2100 ccctcccaaa tagatatgta aaccgaaccg ataacaatat cacaaaagta gaagaactca 2160 ccgaagaagc agggatgcat gttgcaacca acaagttggt ttcgtcacaa ccgaaaaaag 2220 cccttacgga ccaccaggat ggactagggc atgctaggaa agatgacgag ggggtgtgcg 2280 tcaaaattga tgcattagca cccccgcaac gtgagtgtga tagaatttca gtcaatgatt 2340 caaagaaacc agtgaacaag caaaattttt ccatgaattt cactcacaga actaggacac 2400 cgcagttgaa ctccacacga tcttacctgc cgaaaccgcc agtgctgaaa acacctcaat 2460 tggatgcagc gcgagcgtca tggaatattt ctgcgaagct ggcagcagaa aagactgagc 2520 aggtaaagaa ggtatcagct gcagaccttg tcccggaaga ataccacgag tacctgtcga 2580 tgtttgagaa atccaaatcc agagtcctac cgccccaccg cccctacgat ttttgtgtag 2640 atctactacc aggagcaacg cctcaagcaa gccgtgtcat tccattatca ccagcagaaa 2700 atgaagtgtt ggaagagatg attgaagaag gacttacgac gggtactatc cgcaggacaa 2760 cttcgccatg ggctgctccg gttttattca caggcaagaa agacgggaaa ttaagaccct 2820 gttttgacta tagaaaattg aacgcactca cagtgaagaa caagtaccca ctaccgctaa 2880 ccatggagct tgtagatagt ttactgaacg cagacaggtt caccgcacta gacatgagga 2940 atggttataa taatttacgt gtaagggaag gggacgaggc aaagctggct ttcatatgta 3000 aagcgggtca gttcaaacct cttaccatgc cgtttggccc gacaggagca cccggctact 3060 tccaatactt catacaagac atctttggca atcgtatagg cagggacatg ccggctctct 3120 tagatgatat aatgattaga caggaccagg ggaagaccat gagaatgcgg tgaaagaagc 3180 actggaaaca ctgaacaaca aatgtatggt taaaaccaga aaagtgcaag ttcaatcagt 3240 ctgaaatctc atacttaggt ctttggttat cacacaacaa aatctctatg gttaaatcaa 3300 aagtcaaagc cgtcactgca tggcctgtcc cgaaaaacac aggggaagta caacaattca 3360 tcaggtttgc taacttctac agaagattca tagaagactt ttcacgaatc gcacgaccat 3420 tgcacgaatt gacgcagaag ggcatcatct ttgagtggac accggctaga gatgaagctt 3480 ttaggacttt aaaagacgcg ttcactaagg caccagtact caagatcgcg gatccttaca 3540 agcagtttgt cttggagtgc gactgttcgg attacgccct aggagcagtc ctgtctcaag 3600 tatctgacga ggatagcgag ctccaccctg tagccttctt atcacgctcg ttggctcaag 3660 cagaacaaaa ctatgagatt tttgataagg aattactagc ggtgattgct tcgtttaaag 3720 agtggagaca ttatttagaa ggcaatccac aacgtctgaa tgtaattgtt tatacggatc 3780 ataagaattt ggaatcattg atgacgacga aggagttaac ccgccgtcaa gctagatggg 3840 cggaaactct tggctgcttt gattttgaga tacgattccg tcctgggaag aaatcagcaa 3900 aaccggacgc cttatctcgg agaccggacc tgaagccaac cgatgattgt aaactgacgt 3960 tcggacagat gctcaagcct agtaatctac caagtgatgc tttcatagca gaactagata 4020 tcgcggacaa ttggtttgta ccggaggacg actggtccca aatgtttgat gacgaatatg 4080 acaaacttca taagggagaa gctatggttt atgcactaaa caaggaaaac cctgagaaag 4140 tatgggacga cgtgctgatt atgaacaaaa taagagagaa gtctaaggaa gataagcgat 4200 tagaacaact aatgtacata tgtcaacagc aagaagattc aagttgggag gagtatactt 4260 atgtggatgg agtgctctat ttcaaaggag aggtagaagt gccaaatgat aatgagatta 4320 aaattcaaat cttaaagtca cgacatgacg gcatattagc aggtcatcca ggaagaatga 4380 agacgcttcg actggtcaaa cagcaatatc attggccctc aatgaaagcg tatattaacg 4440 cctatgttga tggatgtcat tcttgtcaac gtgtgaaaac aagaacgtca aaaccctttg 4500 gaagccttca accactccca ataccagcgg gaccgtggac tgatatatgc tatgatctca 4560 tcaccgacct tcctatatca gagggaaaga actgtatatt aacagtggtg gatcggttaa 4620 caaaaatgtg tcattttata ccatgcacaa caacgatgga ttctaaagaa ctagctacac 4680 tgatgatcaa acatgtatgg aagtatcacg gcacccccaa gtcaataaca tcggacagag 4740 gtaacatttt catatcgaaa atcacaaaag agctgaatca tcaactagga atagtgacac 4800 aggcgtcgac agcattccac ccacagacgg atggacaaac tgagatagca aacaaagcag 4860 tcgaacaata tctccgacac ttcgttagtt acaagcaaga taactggacc actttgctcg 4920 atatggccga atttgcatat aacaacagcc cacacacctc aaccggaatt tcaccattca 4980 aggctaatta cggattcgat gtttcatatt cgcggatacc tacagatgag caatgtatcc 5040 cagctcatat tcgcggatac ctacagatga gcaatgtatc ccagctgttg aagacatgct 5100 taaacagcta gctgacgttc agaacgaatt gaaagactct ttagccttag cacaagaaac 5160 aatgaagcat aactacgaca agaagacaaa ggcaactcca aattgggaag tcagcaccaa 5220 ggtttggctg gactcaagac acatatcaac aactcgaccc agtgcaaaat ttggccataa 5280 gtggttgggg ccgttttcaa tatctcagcg aatatcagag aatgcttaca aactgacatt 5340 gccagaagca atgaaaagag ttcatccagt cttttctgta ggactattac gacgttacga 5400 accaagcagc atcaatgggc aagtccaacc tccaccagcg cctgtcgtca ttgacgaaga 5460 aaatgaatat gaagtcaacg aaatcctcaa caagagaaag tgtggatcat caattgaata 5520 tttagtcaat tggaaagggt acggtgcaga ggaagattcg tgggagccag aaagaaattt 5580 aaagaacgct caagatatag ttcaagattt taatattaga taccctgatg cggagaagaa 5640 ttataagagg acaagaaggg taaagtgagg gcaatgcttt ttccccacgt gggtttttta 5700 atgctagccc ggggaaagac gtcagctcag caagagggag ccggacgtaa agggggaagt 5760 ag 5762 // ID EGRT1 repbase; DNA; FNG; 687 BP. XX AC X86077; XX DT 11-NOV-1996 (Rel. 1.1, Created) DT 18-JAN-2011 (Rel. 2.02, Last updated, Version 4) XX DE E.graminis mRNA for a retroposon-type repetitive element. XX KW Non-LTR Retrotransposon; Transposable Element; retroposon; KW Repetitive element; EGRT1. XX NM EGRT1. XX OS Blumeria graminis OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Leotiomycetes; Erysiphales; Erysiphaceae; Blumeria. XX RN [1] RP 1-687 RA Wei D.Y., Collinge B.D., Smedegaard-Petersen V. RA and Thordal-Christensen H.; RT "Characterization of the transcript of a new class of RT retroposon-type repetitive element cloned from the powdery mildew RT fungus, Erysiphe graminis."; RL Mol. Gen. Genet 250(4), 477-482 (1996). XX RN [2] RP 1-687 RA Thordal-Christensen H.; RT "EGRT1."; RL Direct Submission to Genbank (04-APR-1995)H. Thordal-Christensen, RL The Royal Veterinary & Agricultural Univ, Section for Plant RL Pathology, Dept of Plant Biology, Thorvaldsensvej 40, DK-1871 RL Frederiksberg Copenhagen, DENMARK. XX DR GenBank; X86077; Positions 1 687. XX CC This sequence is a putative master transcript of EgR1, a CC repetitive element from the fungus Erysiphe graminis f.sp. CC hordei. Sequence analysis of the cDNA revealed that the EgR1 is a CC member of the retroposon superfamily with properties in common CC with SINEs and LINEs. SINE-like properties include the transcript CC size (approximately 700 bp), and the lack of major open reading CC frames. In contrast, the fact that the transcript is CC polyadenylated and is most probably transcribed by RNA polymerase CC II, suggests a functional relationship to LINEs. A constitutively CC high transcript level is found throughout the asexual life cycle CC of the fungus. Small differences in band patterns of Southern CC blots were observed between two isolates of E. graminis f.sp. CC hordei, while the band patterns in an isolate of the very close CC relative E. graminis f.sp. tritici in general appear dissimilar. CC This may imply that the element is currently active. Recent CC dispersal is confirmed by the observation that an approximately CC 550 bp internal hinfI fragment is conserved in the majority of CC the copies in all three isolates. Approximately 50 copies are CC present in E. graminis f.sp. hordei. XX SQ Sequence 687 BP; 195 A; 170 C; 156 G; 166 T; 0 other; ctcaacacct gtcagtggtg tcccctacga acctccagct ctcactgtag agtctgcaga 60 gccaagattg agtaacaacc tcctcacttc gatgaggatt cccagtcgcc ttcgcgatct 120 gtatcgtctt catttctcat cacatccccc tatcaccatt atcatgaagc taaccacgat 180 cagatcagac gttagggtcg aagcccttct cgtcaccaac ccttgatcat ggaggaaaag 240 ccatccgatg agtttccgga gcgaacccag cacccgatac agagatggtg gattggatca 300 agatggcctc gaatgctaca aagaagggag atatagatga atatgcgcac ctcatgtggt 360 cccggtgtct aggcctcgcc tggctggggc tgcaggtttt ttgagacttt ttcccgcgca 420 cgcagtttcc attgctacca tttttatttt tttttgggta gaaggttccg tggaaaaggt 480 ggctgaattc cacgggtaaa tactgagctg aatggctatt catcatgggc aatatatcaa 540 tgatcttaca agtaacaaga gaggacagga gcactgtaca aggtgtggcc tgggccagag 600 aggagcccaa tgtgctagat agtcgaagac tcaagtagta caataaaaac ccacccactc 660 attactccaa aaaaaaaaaa aaaaaaa 687 // ID Copia-4_TMe-I repbase; DNA; FNG; 4832 BP. XX AC CABJ01002876; XX DT 13-FEB-2011 (Rel. 16.02, Created) DT 13-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Perigord black truffle genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-4_TMe_; KW Copia-4_TMe-LTR; Copia-4_TMe-I. XX OS Tuber melanosporum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Pezizomycetes; Pezizales; Tuberaceae; Tuber. XX RN [1] RP 1-4832 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Perigord black truffle genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; CABJ01002876; Positions 13039 8208. XX CC Positions [1860-2381] - Integrase core CC 'GAATG' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1038..2609 FT /product="Copia-4_TMe-I_2p" FT /translation="MMDEQMWMTEDVENWKVKATQQMTVGGLGEQVVAGAA FT RRFCSVMTDTRNVDSSDLGLTTDKALIDKAWIGDAEENEGIWVLDSRCTSH FT ITNHKDLFLKETFRDLPRGTRHIKTATGKVVPAVGIGNVQLPIWIPERGRG FT TVQLCDVLYVPDAGLTNLISVLHLVEKKIKILFTSNIIEVYKDGLLSAIAI FT KSNRMFTLVTREIPIEAAFIMSDSTQLTLWHKRFGHLYADAVLKMSNKQLV FT NGMPVIQSTNGRHRCESCLEGKMTRLLFRRAEHRTSAPLEIVHSDSCVPMK FT HKSIGGSSYFMLLIDDYTKFTAVYFLKKKSEAAECFKNYRTHVEKVHSQNG FT GGYVIKTMRTEGGGEFTGATFLRELEKNGIEAHTTVPYTPQEDGISENRNR FT VLIGRANALLKQAGAQNSYWAEAIQTTAYLKNISITKGTHGMEATPFELWF FT GKKPKVEHLRIWGRTAYAYIHPANRADKKWSPRAEKLIFVGYTMMTKQFST FT KILHFFNKQRTVYSCRWKTPKRKSNLSP" XX SQ Sequence 4832 BP; 1568 A; 944 C; 1215 G; 1105 T; 0 other; ggttatgagc cactacgcat cagggtaaaa atacagagca cgtttgaggc ataccgattt 60 ctttcatacg tcaaattttt atttgttttt cagaaactgc gagagtagct gtaagaaaag 120 aaaagtgtcg taacgccgca atgagtgaac aagatggtat accattgggc aagtcaggga 180 tgatgataga gaaattggat gggacaacct accgcatgtg gtctataagg atgggtgcct 240 atctactcag agaaggtctt tgggatctta cttctggagt agaggagatt atgatagcac 300 ccgcaaaaga aagcaacaac ttcaaggaag accagcagca gtacatgaac cagagaagac 360 gtattcaaaa ggccaatggc gatttgaaac ttgcgatggc agattcgatt attatcaact 420 acaatgaatc gttttgggat atgccacaga gaatctggga cgacatcaag cggaactatc 480 aggtttaagt cagctacgac gccaaccact tacaggtggg actctatgaa tgtcagctca 540 aggagtgcgg aacaatactc aactatctca acaaacttaa agacattaaa gataaactat 600 ctctttgcgg tcaaataccc acaatgtcac aaatgatttt ccatgttttc cacagactcc 660 ccaagaccgc tgaatggaaa gcttggacct acgttatgga accacaattt tccaccacca 720 tcacgaactc cgactatatc agactacagt ctcagctcaa ggcgtctgaa gccaaattca 780 agagagacaa gtcaatcgaa cctggacaag ccttatttgc atcgagtaga agctcaagat 840 ggaaacaacc gaatgaggat tccggaaaca gcaagtccaa tacaagacat agctaaccta 900 ctagttcaaa acagttcacg agaaattgct ataagtgtag caagaaaggc taccgcgaga 960 gggaatgctg gtcaaaaggg acgagcagaa agaagaaagg tggaaaccta cagagcagaa 1020 acacagatag tgtggggatg atggatgaac agatgtggat gactgaagat gttgaaaact 1080 ggaaggtgaa ggccacgcag cagatgactg ttgggggcct gggggagcag gtagtagctg 1140 gagcagcaag acggttttgt agcgtgatga ctgatactcg gaatgttgat tccagtgatc 1200 tgggtctgac tactgataag gccttaatcg ataaagcctg gattggcgac gcagaagaga 1260 atgaagggat ctgggtgtta gactccagat gcacaagcca cattactaac cacaaagatc 1320 tttttctgaa ggaaacgttc agagacctac ctagaggtac gaggcatatt aaaactgcca 1380 cgggcaaggt ggtaccagct gtaggaatcg ggaatgtaca gctaccaatt tggataccag 1440 aaagagggag aggaacagtc cagttgtgtg atgttctcta cgttccagac gctggactga 1500 ccaatctcat ttcagttctc catcttgtgg agaagaagat caagattcta ttcaccagca 1560 acataataga agtgtacaag gacggattac tttctgcgat tgcaattaaa tcaaaccgga 1620 tgtttactct agttactcgt gaaattccta ttgaggcagc atttataatg agtgacagta 1680 cacagttgac tctatggcac aaacgtttcg gacacctgta cgccgatgct gtattgaaga 1740 tgagtaacaa gcagcttgtc aatgggatgc cggtgattca gtctacgaat ggtcgacatc 1800 ggtgtgaatc ctgtttggaa ggaaagatga cgagactact attccgccgt gcagagcaca 1860 ggacatccgc accactagag atcgttcact cagattcgtg cgtaccaatg aagcataagt 1920 caattggtgg aagctcctac ttcatgcttc taatcgacga ttataccaag tttacggctg 1980 tgtatttcct gaagaagaag agcgaggcag ccgaatgttt taaaaattat agaacccatg 2040 tggagaaagt gcactcgcag aatggaggtg gttatgtaat caaaaccatg agaacagagg 2100 gtggaggaga gtttaccggt gctacattct taagagagct agagaagaat gggattgaag 2160 cacacaccac cgtaccatat acaccacaag aagatggtat ttcagaaaat aggaatcgcg 2220 ttcttattgg aagggcaaat gcattactta aacaggccgg cgcacaaaac tcgtactggg 2280 cagaagcaat tcaaacgaca gcttacttga aaaacatctc aattacaaag ggaacgcatg 2340 ggatggaagc aacaccattt gagctttggt ttgggaagaa accgaaagta gaacacctac 2400 gcatctgggg ccgtacagcc tatgcctaca ttcatccggc caaccgagca gataaaaaat 2460 ggagcccaag agccgagaag cttatatttg ttggatatac catgatgacg aagcagtttt 2520 ctacgaagat actccatttt ttcaacaagc aacgaacagt gtattcttgc cgatggaaga 2580 ctccgaagag aaagtccaac ctatcaccat gaaacctcaa cacactggcc ctaccacgag 2640 gagtatggcg agggctaagc gtgaagaaga aggagaagct agtagtaaaa gtggaagctg 2700 gagtagtagt agaacgagaa gatgcgcagg acgagaatga ttgagatccg attacgaagc 2760 tccaaatcag gtacctgtga ggaaaggtgt aacagtgact ggaagtccac ctgttgaagg 2820 agaccctgtt gctgattatg atgtaaattc acaagatgca ctcgcacaag ttgaacacac 2880 ctggaggaag gtggattctc tacattctaa actcgatgaa tcccagccaa ggaatgagag 2940 gagagcacaa gcatccgtgg agagacagcg gatatttgaa ctcaattccg gcactactgg 3000 gggagttgct ggaatagacg aattggctaa aagaagacaa gacgatgagg acgacgcgaa 3060 cacagttttt gagcctgatc ttgtttttat ggtcaacgat ggaccaggca gtattgaaga 3120 ggccttgagg tgtccaggcc atgaccattg gattaatgca atcaataacg aactcaaatc 3180 gcttgagaat cacgagacat gtggagtgat agaccctgga agcctaccga ttgaagcaag 3240 accaattagt tcacgaatgg tactacaaga gaaggttgga gaggacggaa aggttgcacc 3300 gtacaaagct aggcttgtag catgcggatt ccgacagaaa cccggaatcg attttaccga 3360 gacatattct cctacaatct catacgctgt aatacgtatt atcctgttca aggtggcagc 3420 ggaggataag gaaattactc agttggatat tgtgaccgcg tttcttgaga gtcgaatcga 3480 tgaagaacta tatctgaagc tttctaagaa ttctttcatg acaaaagatg gaagcgtgag 3540 acttatgacg caggattcag gtagaaaggt gaagaaaccc gatacacaga tcatcgtaaa 3600 gttgaaaaga agcctttatg gtttaaagca agctggacac aattggtatc acatgctcga 3660 gacctacttt atcgagcaaa tgaagataaa accgtcactc tacaaagcag gaatctatgt 3720 ctcagacacc ggagcgatgg ttatcgcgtg ggtggatgat cttctgctta ttggaaaaaa 3780 gggagaagtt acggaaataa agaaccgtat acgtgagcgt ttcaggtaaa agacttggga 3840 aatgtcagat tttttctcag catgcttgtc gaacacgata gggataagag gacatgtatc 3900 tcagccaaca aacctacctg tcgaaaattc tcaccagatt ctctatggaa cactgtaaag 3960 ggtgctcgac acctcttgat cctaagatta aattgcacca aagaactgaa gtggaggaga 4020 gcgccaacct acgtacttat caggaggtag taggaagtct gacgtatgca gccattacaa 4080 caaggccaga catcgcctac gctgccggac tcgttggaca ttttgctgca aatccatcta 4140 ctcttcactg gcaggtggta aaaagaatac ttagatatgt tcaagcgaca aaggaatacc 4200 gtctgctgtt gggtagagga gcaagtaagc aaacaagcga tctagtactg acagtctacg 4260 ccgacgccaa ctttgcgggg gagattgacg gaatgaagtc tacaactggt ttcgtgatca 4320 ttgacccata cggagcgatt gtcaactgga aatccctaaa acagagaacc atggcaaagt 4380 ctacggcgga tgcagagttt aacgcgacag cgctagcagc cgaggaagga atctggctac 4440 aaaaggtcca aaaagagctc tacccgtcaa tttactggaa gcaggaagag aaggagagac 4500 cacacatcag aatattttat gacaatcagg cctgtattgc tagccttaca aatgggaagt 4560 ttagggcatc tacgaggcat gtcggtgtac gttatttctg gctgaaggaa atagttgaga 4620 aaggggaggc agagatagag tatatcagga cagatgagat ggtagcagac ggattgacga 4680 agtcactagc aaagacaaag catgagcaat gtgtagcgat gttaggcatg tactagactg 4740 aaggtcatgg aaccaatatt tttatttttt ttctcttttt atagtcaaaa agtttttttg 4800 tttttgagcg cactaggtgg gattaggggg ag 4832 // ID PiggyBac-1_FO repbase; DNA; FNG; 2063 BP. XX AC . XX DT 18-MAY-2011 (Rel. 16.05, Created) DT 18-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW piggyBac; DNA transposon; Transposable Element; piggyBac-1_FO. XX OS Fusarium oxysporum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Hypocreomycetidae; Hypocreales; OC mitosporic Hypocreales; Fusarium; OC Fusarium oxysporum species complex. XX RN [1] RP 1-2063 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 2063 BP; 533 A; 517 C; 491 G; 522 T; 0 other; ccctttgact gtcgtaagcc tccaccttcg tacctggccc cacataaccc ccgccgaacc 60 ttaggtacat ggagtgactt tgagaaccac caaaatcaac cccgcccatt tcatcatata 120 tctcatttta tgagttctca tggattccca acagctcatc agccaagcag tcaacgcata 180 tgatgataat agcttcatcc gtgtcgatga tagccgccgc gagcagcccg ctgataacac 240 ctgccagcca atacatgatc ctattcttga tcgtggctac aacttcgagg ccatggtcgt 300 tccaacccgg ccctttgaga taacggccct gccctcagag ccgctgttgc ttttccagca 360 attcgtgcct atttccttag tggaaagctg ggtcagtcac cggcttgagc agcatgaaga 420 tggcacccaa agactacatc cgcaatcgcg gctgctagct tggaagccga cctctactgc 480 ggaaatctac gtatggcttg cgacgttgat atatatgaga atacataagg agccaaacat 540 cgaggattat tggaaggttt caaaacccaa ggacattcgc ccagttcatc cagtagtcaa 600 atggatatct tataatcgct tccaattact atctcgacat cttcatatct atgatcctct 660 gactctcaat atagatgata tgagcttcta tggccgtaca ttcagtcggg ttcatgcatg 720 gtctgaccat atacaacata tatctaccgt cttctatata ccgggcacat ctattgctgt 780 agatgagtgt gtggtccgtt ttctgggaag atctcttgag acaacaacta tcccttccaa 840 gccaatcccc acaggcttta aagtctggac tgtagctcag cgaggctatt ttcttcggtg 900 gatttggcat aggccgggcc ggaagttcgg gcctgttggg gttaggccga cctatcggcg 960 cctgccctca cagctgcgga tgatgcgaca gcgccggcag cttcgagaga gaaacacgat 1020 ccatctgaac ccgactcaag ctgtggttat tgccttggtt aatttactgc cgaaatccac 1080 gtaccatgta tttctcgata atttgttctc ttcctgcgat ctattccgga gacttcggca 1140 gcgtggccat ggagccacgg gcacagcccg caagaactgt ggcatctaca agccccttgt 1200 caagctaaag gctatagata atacagctgc tggtagcctc gaatttaatg ttgttaaggc 1260 tatcccaaca gccgataaca aggtcaacca gatcgcttgg aaagacaatg cgcttgtgtt 1320 atttctgacg actgtgttta agggcaacga aaggcttgac tgcatcagga agaaaccaac 1380 aacagaccaa atgcaaacac gaccaataca gcgttttttt tggtgacgac cctgttaaac 1440 aggtttctat cccatctatt gctgctatct ataacaacga gatgaatgcc gtggatcgag 1500 gtgaccaaat gagagcttat tggggccttg accgtcgtgt gcgccgaggc ggctggaaag 1560 ccctggcttg ggactttctc ctggaaattg cccttgtaaa tagcttcatt ttgcagcagc 1620 gagggcagcc gcaatggaag gcagagattt cgcaaggcag ttggcggcaa cggcttgtga 1680 atgatcttat ggaagcgtat gcaccaaaga cgcaatcaag aaagagattt cggactggaa 1740 acgaatttac gcctacatta cagcacactc gcgttcgcaa aggaaaatct gaatgccttg 1800 cctgtcaagg cctccggcta ggtcagcgtc ggtctcgaag atcaagggca gcattatctg 1860 cgattagcgg taacagtcga agggcaccgc agtcacgata taggtgtcaa gaatgtaatg 1920 tagctttatg tagacttggc aattgctggg acctttggca caaccaaagt taggtgggga 1980 taaatgtacc tttgatttaa tgcattttta ttggctaacc ccgctgctag gtacggaagt 2040 ggaggcttac gacagtcaaa ggg 2063 // ID U35230 repbase; DNA; FNG; 692 BP. XX AC U35230; XX DT 24-FEB-1998 (Rel. 3.01, Created) DT 24-FEB-1998 (Rel. 3.01, Last updated, Version 1) XX DE Magnaporthe grisea short interspersed nuclear element (Mg-SINE). XX KW SINE; Non-LTR Retrotransposon; Transposable Element; U35230. XX OS Magnaporthe grisea OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Magnaporthales; OC Magnaporthaceae; Magnaporthe. XX RN [1] RP 1-692 RA Kachroo P., Leong A.S. and Chattoo B.B.; RT "Mg-SINE: a short interspersed nuclear element from the rice RT blast fungus, Magnaporthe grisea."; RL Proc. Natl. Acad. Sci. U.S.A 92(24), 11125-11129 (1995). XX RN [2] RP 1-692 RA Kachroo P.; RT "U35230."; RL Direct Submission to Genbank (01-SEP-1995)Pradeep Kachroo, RL Department of Microbiology and Biotechnology Centre, Faculty of RL Science, M. S. University, Baroda, 390 002, India. XX DR GenBank; U35230; Positions 1 692. XX SQ Sequence 692 BP; 158 A; 236 C; 177 G; 121 T; 0 other; cgaggaactg cgtttgccaa gacccctact gggccattat ttggccatga ggaccggcca 60 cggcgatttc aaggcctacc atgaccgttt caaccaccag gatgcaaaca cctcgtgtgc 120 ctggtgctgg aagcggacct cccctgagca cccggtgcac tgccgctatt cgcgggcggt 180 gtggagaaac tggccgtggc ctaataacga ccggccggcc gggccgccaa atcgcgccca 240 acgctggaaa ttcttccaga caagcttcgg gcaaccgaaa agctttgagg cgttttcgat 300 agccaccaac tacttcagcg cccgccccag agcggcccgc cagcgccccg cgcgcacgaa 360 cgcactttac gcctagggac accgatcgta aacgactcga actctgacga ggaatagact 420 tactgccttt tcacccacac ttcacggcac aagccgtcag acgaaccctc cgtgcacctt 480 aggcacgggg tcgcgtcagt gaactaaccc cctaagcagg accgggcccg aacccggtca 540 ggcacgatcc gcctctgccc tccttgtttt ccccctgtgt aaataaagaa gatagaacgc 600 gcgccgagat acccctcggg aggttgctaa cggccggcta acaagccggc cgagccgccg 660 gttaactaat actactacta ctactactac ta 692 // ID Copia-1_TMe-LTR repbase; DNA; FNG; 312 BP. XX AC CABJ01002173; XX DT 13-FEB-2011 (Rel. 16.02, Created) DT 13-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Perigord black truffle genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_TMe_; KW Copia-1_TMe-I; Copia-1_TMe-LTR. XX OS Tuber melanosporum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Pezizomycetes; Pezizales; Tuberaceae; Tuber. XX RN [1] RP 1-312 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Perigord black truffle genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; CABJ01002173; Positions 62653 62342. XX SQ Sequence 312 BP; 70 A; 83 C; 41 G; 118 T; 0 other; tgttgaaata tgcttctggc actaccgtag tatcactcgc actattgatt ttctccttgc 60 attagtcttg atcgccatct caaaaccccc ttacaatatt ggtactattg aacatacctc 120 cttgcctttc ttttcttctc tctgtttctg tttacttcct actgttgatt tactcgcaac 180 ttgtgtctcg ctctcaacag tagatagtct gggtagtttg ctctctactg agaagtcaat 240 taagcaattc attattcttc tcgtcactca accagttcca aaaaccatca ttccgttctc 300 actgtttcaa ca 312 // ID PCretro5_I repbase; DNA; FNG; 4640 BP. XX AC DQ097839; XX DT 08-MAR-2006 (Rel. 11.02, Created) DT 08-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE Phanerochaete chrysosporium RP-78 Ty1/copia LTR retrotransposon DE (internal portion). XX KW Copia; LTR Retrotransposon; Transposable Element; KW internal portion; PCretro5_I. XX OS Phanerochaete chrysosporium OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Corticiales; Corticiaceae; Phanerochaete. XX RN [1] RP 1-4640 RA Novikova O., Fursov M., Shutov O. and Blinov A.; RT "Divergent groups of LTR retrotransposons from Phanerochaete RT chrysosporium."; RL Direct subission to Genbank (2005). XX DR EMBL/GenBank/DDBJ; DQ097839; Positions 462 5101. XX FH Key Location/Qualifiers FT CDS 98..4630 FT /product="PCretro5_I_1p" FT /translation="MPSKRSDTTAPAVCSGKPHIFDKLETEDRWYKSTGHA FT GWMPCELQPCPHRYSQPIGVVEDASWQVAVDPSQLKLFVARLSGRQPSPPR FT SASIQEVPEAEATAAPTLPPPSRSSERSVAEEELEYLDATSSPVARPSVSS FT SSIIRAFNSVPKLALEGSNYRLWHQRIVVAARGVQCDDLLEEPDVPATRKR FT EADALLSAILDTLPDSVFMSVSSTADVPADVMSAMKVRYGVSTAVSDAAAQ FT RKLFSMTCKDDRKLQAHLDSMLAVREQIIESGTSVSDKVFTDAIIASLPDS FT YKPIVNAYSATLLLDAARTGTPRPARSHELIPLLRAEAHSRYTVARGGAKD FT AVPTAASADARGQRGGRGKGKGRSAQQQQRGGNSANQSSREGEEVTCYKCQ FT GKGHTKKVCPSKNYAKRPEPAANAAQVTSAPAPASASAPAPAPAAKIVEVE FT DAWAASALAPESDAPPEREDTVDEALVSQSTDPVDIYDSGATHHMTPFRHL FT LYNYRPIPARTVRAAGKSHFAATGVGDMKLLMPNGNSWMRITLRDVYFAES FT MAATLVSLGKFDDAGYRAVIGGGYLRIMRGDAQFAAIPKIRGLYRYYHSGE FT GLALAAFSASLSLYQLHQHLGHLSYGYIKKLVASKAIQGLQLDPARRTEDE FT CSVCMRAKAARAPIAAKRSSPLAATFGEHLHLDVWGPAPVRTINHCRYALV FT MVDDHSRWLEEPLLRSKDEAFARFRDFVALIRTQSGAQLKVVSSDRGGEFT FT SHEFSEFLSRNGVVRRLTVHDTPEHNGVAERVHGTIFNMVRALLISSGLPR FT TLWGEAVRHAVWLYNRTPHAAIDFRTPYEVRFGSPPDLSGLKPFGAVCFVR FT NLSAGKLDARAVECRWLGFDPTSNGSRIYWPTSHKVSVERDIKFSSREVPL FT LEGEDYSLDPAPDSDSDNEQHPDAASDTSGTFPSDPPDDEPLATPRRSARL FT AQKRLAAHIIDLSHSELEATLEAAQSEALGHDPRSYAEAMRSPDAPAWQEA FT MDEEIRRLEQHCAWVYETAPSGAHVVGSKWVYRTKRDAQNAITGYRARLVG FT QGFTQIDGVDFFSDDTFAPVAKMASQRANAALAAQRDYEMAQIDIKSAFLY FT GPLKDDEVIYLRPPPGVKLQGLKTGQVLRLRVALYGLKQAGRRWALFLREI FT IADIGLTRSEQDHAVFYRHLPGNHVAIISSHVDDLTLIAPDQKTIEDIDRR FT IRARVEATPLQPLNWLLGIEIKRDRAKRTVSFSQRAYIDQIISRYGFEDIK FT PLAAPMDPHLVLSKEDCPSTAAEVAEMRHKPYRQALGALMYAAIATRPDIA FT YAVNQLARFAENPGMKHWNALRRVYAYLKGTRDLSLVLGGDARDGPLVGYT FT DADGMSTEGRQAVSGYAFLIGGAVSWSSKRQEIVALSTSEAEYVALTHAAK FT EALWLRNYLHEVWQMPLQPMQLYSDNQSAIALARDDRYHARSKHIDIRYHF FT IRYHIEHGNITVTYCPTEDMVADTLTKALPSMKAKHFASSLGLAKA" XX SQ Sequence 4640 BP; 850 A; 1692 C; 1240 G; 858 T; 0 other; gttatgagcc cttagaggcc gacctcagtc gaagtcattc gtttgcgcac cttcgttcgc 60 ccctgtctct cgctccccac cgctcgtttt cgttcgaatg ccctccaagc gttcagacac 120 caccgccccg gccgtgtgct ccggcaagcc gcacatcttc gacaagctcg agaccgaaga 180 ccgctggtac aagtcgacgg gtcacgctgg atggatgccc tgtgagctcc agccctgtcc 240 gcaccgttac tcacagccca taggcgttgt cgaggacgcg tcctggcaag tcgctgtcga 300 cccctcgcag ctcaagttgt tcgtcgcccg tctgtccggc cgccagccgt cgccgccccg 360 ctctgcgtct atccaagagg tccctgaagc cgaagcgacc gctgcgccta cgctcccccc 420 gccgtcgcgc tcctctgagc gttctgtcgc ggaggaagag ctagagtacc ttgatgccac 480 gtcgtctcct gttgctcgcc cctctgtatc gtcgtcttca attatacgcg cattcaattc 540 tgtccctaag ctcgcgctcg aaggcagcaa ctaccgcctc tggcatcaac gcatcgtcgt 600 tgcggcgcgt ggcgtacaat gcgacgacct cctcgaagag cctgacgtgc ccgccacacg 660 caaacgcgaa gccgacgccc tgctgtctgc tatcctcgac accctgcctg actccgtctt 720 catgagtgtc tcgtccacgg ccgacgttcc tgctgatgtt atgtctgcga tgaaggttcg 780 ctacggcgtg tctaccgctg tttctgacgc cgcagcgcag cgcaagcttt tctccatgac 840 ctgcaaggac gatcgcaagc tgcaggctca cctggactcc atgctcgccg tgcgcgaaca 900 gattatcgag agcggtactt ctgtgtctga taaggtcttc actgacgcga taattgcgtc 960 tctgcccgac tcctacaagc cgatcgtcaa cgcctacagc gccacactcc tcctcgacgc 1020 tgcgcgcacg ggtacccctc gccctgctcg gtcgcatgag ctgatcccgt tgctgcgcgc 1080 cgaggcgcac tctcgataca ccgtcgctcg aggtggcgct aaggacgccg tgcccaccgc 1140 tgcgagcgca gatgctcgcg gccagcgtgg agggcgcggc aaggggaagg gccgctccgc 1200 gcaacagcag cagcgcggtg gcaactctgc taaccagtcc tctcgtgagg gggaggaggt 1260 tacctgctac aagtgtcaag ggaagggaca taccaagaaa gtttgcccat ctaagaacta 1320 tgcgaagcgt cctgagcccg ctgcgaacgc cgcgcaagtc acgtccgcgc ccgcgcccgc 1380 gtctgcatct gcgcctgcgc ctgcgcccgc cgctaagatc gtcgaagtcg aggacgcctg 1440 ggccgcctct gcgctcgcac ctgagtccga tgcgccaccc gaacgcgaag ataccgtcga 1500 tgaagccctg gtcagccagt ccacagaccc tgtcgacatc tacgactctg gcgccacaca 1560 ccatatgacg ccgttccgcc atctgctgta caactatcgc cccatccccg ctcgcaccgt 1620 gcgcgccgcc gggaagtctc acttcgccgc caccggtgtc ggggacatga agcttctcat 1680 gcccaacggc aatagctgga tgcgcatcac ccttcgcgac gtgtacttcg ctgaatcgat 1740 ggccgccacc ctcgtctctc tgggaaaatt cgacgacgcc ggctaccgcg ccgttatcgg 1800 cggtggctac ctgcgcatca tgcgcggcga cgcgcagttc gctgcaatcc ccaaaatccg 1860 cgggctctat cgctactacc actcgggcga aggtctcgct ctcgccgcct tttcggcctc 1920 gctctcgctg taccagctgc accagcacct tgggcatctt tcctacggct acatcaaaaa 1980 gctcgtcgcc tccaaggcga tccagggcct ccagctcgat cctgctcggc gcaccgagga 2040 cgaatgctct gtctgtatgc gcgcaaaggc tgcgcgcgcc ccgatcgccg cgaagcgctc 2100 ctcccctctc gccgctacct tcggagagca cctgcacctc gacgtctggg gccctgcgcc 2160 cgtgcgcacc ataaatcact gccgctacgc tctcgtcatg gtcgacgacc actcgcgctg 2220 gctcgaagag ccgctgcttc gctcgaagga cgaagccttc gcgaggttcc gcgactttgt 2280 ggcgctgatc cgcacccaat ccggcgccca gctgaaagtc gtctcctcgg atcgcggcgg 2340 tgagttcacc agccacgaat tctctgagtt cctctcgcgc aacggcgtcg ttcgccgcct 2400 caccgtccac gacacgcctg agcacaacgg tgtcgccgag cgcgttcacg gcactatctt 2460 caatatggtg cgcgcacttc tcatctcgtc tgggctaccg cgcacgctct ggggcgaagc 2520 tgttcgccac gctgtctggc tgtacaaccg cacaccgcac gccgccatcg acttccgtac 2580 gccatacgag gttcgtttcg gctcccctcc tgatctctct ggcctcaagc ccttcggcgc 2640 tgtctgcttc gtccgaaacc tctctgctgg gaagctcgat gcgcgcgcag tcgaatgtcg 2700 ttggctgggc ttcgacccta cgtcgaacgg gtctcggatc tactggccga cctcgcacaa 2760 agtctcggtc gagcgggaca tcaagttctc gtcgcgcgag gtgccgctcc ttgaggggga 2820 ggattacagc ctcgatccgg ctccagactc tgacagcgac aacgagcagc accccgacgc 2880 tgcctccgac acttctggca ccttcccaag cgatcccccg gatgacgagc ctctcgccac 2940 accgcgccgc tcagcgcgcc tagcccagaa gcgcctagcc gcccacatca tcgacctctc 3000 gcacagcgaa ctcgaagcca ccctcgaagc cgcgcaatca gaagcactcg ggcacgaccc 3060 tcgaagctac gctgaagcca tgcgcagccc ggacgcacct gcctggcaag aggcaatgga 3120 cgaagaaatc cgccgcctcg agcagcactg cgcatgggtc tacgaaaccg ccccgagtgg 3180 cgcgcacgtc gttggctcga aatgggtcta ccgcacgaaa cgcgacgcgc agaacgcgat 3240 taccggctat cgcgcgcgac tcgtcggcca aggcttcact caaatcgacg gcgtcgactt 3300 cttctcggac gacacgttcg cgcccgtcgc aaaaatggcg tctcaacgcg caaacgcagc 3360 tctcgctgcc caacgcgact acgagatggc ccagatcgac atcaagtccg ccttcctcta 3420 cggaccgctc aaggacgacg aagtcatcta cctgcgcccg cctcccggtg tcaagctgca 3480 gggactcaag actggtcaag tccttcgcct tcgcgttgcg ctctacggcc ttaagcaagc 3540 cggccgtcgc tgggcactct tcctgcgcga aatcatcgcc gacatcggtc tcacgcgctc 3600 cgagcaagac cacgccgtgt tctaccgcca cctgcccggg aaccacgtcg ccatcatctc 3660 ctctcacgtc gacgacctca cgctcatcgc gcccgaccag aagacgatcg aggacatcga 3720 caggcgcatt cgcgcacgcg tcgaagccac gccactgcag ccgctcaact ggctcctcgg 3780 catcgagatc aaacgcgatc gcgcaaagcg aacggtctcc ttctcgcaac gggcctacat 3840 cgaccagatc atcagccgct acggcttcga agacatcaag ccgctcgctg cgccgatgga 3900 cccgcacctc gttctctcga aggaggactg cccctcgacc gctgccgaag tcgccgagat 3960 gcgccacaag ccttaccgtc aagctctcgg tgcgctcatg tatgccgcca ttgccacacg 4020 cccagacatc gcctacgctg tcaaccagct cgcccgcttc gccgagaacc ctggcatgaa 4080 gcactggaac gcgctgcgcc gcgtttacgc gtatctcaag ggcacgcgcg acctctcact 4140 cgttctcggc ggcgacgcgc gcgacggacc tctcgtcggc tacaccgatg cggacggcat 4200 gtccaccgaa gggcgtcaag ccgtctcggg ctacgctttc ctcatcggcg gcgcagtctc 4260 ctggtcatcc aagcgtcagg aaatcgtcgc gctctcgacg agcgaagccg aatacgtcgc 4320 gctcacgcac gccgcgaagg aggccctctg gctgcgcaat tacctgcacg aagtctggca 4380 aatgcccctc cagccgatgc agctttattc ggacaaccag agcgccatcg ccctcgcgcg 4440 cgacgatcgc taccacgctc ggtcgaagca catcgacatt cggtaccatt ttattcggta 4500 ccacatcgag cacggcaata tcaccgttac ctactgtcct acagaggata tggtggccga 4560 cacgctcaca aaagcgctcc cttcgatgaa ggccaagcac ttcgcttcgt ccctcggact 4620 cgcaaaggct tgagggggag 4640 // ID Gypsy-2_RO-LTR repbase; DNA; FNG; 462 BP. XX AC AACW02000235; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_RO_; KW Gypsy-2_RO-I; Gypsy-2_RO-LTR. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-462 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000235; Positions 282242 282703. XX SQ Sequence 462 BP; 161 A; 70 C; 58 G; 173 T; 0 other; tgttgtaaac atagttattt tctataagat atcagttgta ttaaaataaa taacgataca 60 caaaagatta aactatgatt gaacaattcc acacttttcc aaatttagaa caataagatc 120 agtatttgtc atcctagcga gtctaatcaa ttgtcaaccc caaaagtaat tagatcaaag 180 ttaattgact ggttattatt tctattgttc aacttcagat gcatatatta gaacatcgac 240 aagctttagc ttaatcattg gttcttcagc acgacattgt tgccgagtct atatctttta 300 tatctacgct cacaaagaat ataaatagat gccaactttt tatccattaa acggggctct 360 tttgcaatat aaagaaatct gcaataaaga aggattttaa ttttatattg ctgctattgt 420 ttttttgttt aatttttttt atttaatctt acgattaaaa ca 462 // ID Copia-61_MLP-LTR repbase; DNA; FNG; 184 BP. XX AC AECX01000544; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-61_MLP_; KW Copia-61_MLP-I; Copia-61_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-184 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000544; Positions 119341 119158. XX SQ Sequence 184 BP; 54 A; 31 C; 31 G; 68 T; 0 other; tgaagatggg ggggaattag gatcacaaag tgatcccata aagcgcatta ccattaaaga 60 ttcattttca ttatcatgtg cattatgtaa cacgtttctc ttcttacacc atgacgcatt 120 gttgagttct atagaatgat atatagagta gtctatctat ctaattgttt tcttttctct 180 atca 184 // ID Gypsy-3_CCO-I repbase; DNA; FNG; 5970 BP. XX AC AACS02000001; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the mushroom Coprinopsis cinerea genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_CCO_; KW Gypsy-3_CCO-LTR; Gypsy-3_CCO-I. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-5970 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the mushroom Coprinopsis cinerea RT genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AACS02000001; Positions 82033 88002. XX CC 'GCTAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 3974..5014 FT /product="Gypsy-3_CCO-I_3p" FT /translation="MSDPPPCSPPNGTTKSTTGRCWPLSRLSRTGATSWKD FT SLNPSRLSLTIPTLNTGAPHKTSAVDRLVGLSTSPASGSRSPTALARQTPK FT PTPSLVSTSTRSPTPRTTNNRWYSNPSCLPSSPPRAWSSTPWRTVSDRPVP FT ENPRSWMDLPSSARQVPVVSPLVLPNGRRITALFTTRVESTFPQTTNSDAT FT FSNSATTILPLDIPVYMEPSNELNDSTGGRPCEPLSRSMWKVATSVPGRNT FT PSTPAPRLSPWRYPMAPGRPLESTLSPSFLMPMGMMPSSSSSTTTASFSMP FT SPAPRTSPLKESLTSTTEKSSVSTAFHSGSSATAAPSSPPKSCAPCSSVSA FT SNPT" FT CDS 4345..5967 FT /product="Gypsy-3_CCO-I_2p" FT /translation="MVINPLEDRIRQASARESQVLDGLAQLRKTGPRRLTS FT GLAEWEEDNGLVYYKGRVYVPPDDELRRDVLKQCHDDPTSGHPGVHGTLER FT VERQYWWPTMRAFVKKYVEGCDICARKKHAQHPRASTQPLEVPDGPWETVG FT VDLITQLPDANGHDAIIVFVDHYSKLLHALPCTSDITTEGVADIYYREVFR FT LHGLPLRFISDRGPQFASKVMRTLLKRLGIESNLTTAYHPQANGQTERANQ FT EVEKYLRLYVSRRQDDWDKHLPMAEFVINSRVHSAHNRSPFEVLYGYIPHF FT NIPVGKRTDLRPVDERIQQLQEVRKDAEAALRLEKAQQKDLYESGKRTAHQ FT FKVGDHVWLNAKDIQIKVPTRKLGDLQLGPYKITERIGDLDYRLELPPSLS FT RIHPVFHVDKLSPWKGNDVNGILPPPPEPVELEGELEYEVHDILDSRWKGR FT GKNRKLGVPGQLERLRVDGRHLGNRKRTWSTAPEIVKEFHQRLPLCTTSNF FT RHRLQLSPLAPTRELHRRSTYRLRMGTWTPPTVDHRGRSDSLEGG" XX SQ Sequence 5970 BP; 1352 A; 2130 C; 1343 G; 1145 T; 0 other; ttaggtcaag ctacctttgt gcactggcaa cgccgatcgt tgctcctgca tcatccgcat 60 ttctggctcc tggttgacct gccgatcagg tccctccctg gcaactctgg tccccgcact 120 ccgtcaagat ttccgccgca ccggaagaca cctcctgtct gtttaagccg cacttcgtcg 180 ttcgggtcag cgacggtcac cccatcgtcc tagacgtcca tctaccgagc ctcaaccgtt 240 ggaccgcctg ctccgaacag ccctgcccca accgttacca cgaaccttcc gtacacgact 300 gggtctatat caactccctc aagttaaacg ccgaatctcc gtcccccgtt aatcagcaag 360 cggttgaaga attcgcccgc aacaacccca ggactccgcc ctccgaaacg caacccaccg 420 ccgctgaatc cccagaaacc ccgtaccaag cttaccgctc tgcaaacacc tctcctgctg 480 ccaaagccgg ttcattcaag cttccagatc ccgacaccga tcccgaagaa gaagacgagg 540 aagacgaaga agaagtcgtc cagtccctta gcgtccgatc agcatcccct gaacaggaaa 600 acatcctccc accgttcaat accctccact ccgctttagc cgcatccgca cccctcatct 660 ccgcacaacg cctcagctcc tccggtcgct ctacaccaac ttaacccgcc tctttcatct 720 ctcgcccggg tcatcccccc tctcacacct ctcggcagtt ggaggataca tctgacgctc 780 caagctcaga acagaaacct cccacttcca cccaaccaac tactctactc tcttccactt 840 cccaaaccac cgacgacttc acgatgtccg ccgccgatct cgcgaaactc caggaacagc 900 aaaaccaact caccaccgcc atttctaccc tggtcagcca ccttacagcc gccaaaaccg 960 cctcgagcaa gagcccggtc aaggcgcctg agcccttcca cggcaaggct gaagacgccc 1020 gacgcttcct ccagttcttt accaattggg ccggacacca accccagttg aagaaggacg 1080 acggcactcg ggatgacaag gaatggatct ccacggccct ctcgtacatg catggagaag 1140 ccggccgatg ggcatctcgc ttccttcagc agatcgccga tcatgatgcc gacagcacca 1200 aaccatggcc cttcgctggg ggaaagtggg ctaacttcct caccgagttc aagcaacgct 1260 tccaacccgc caacgacgcc caggctgccc tccaggagct tgagcagctc accctcggtc 1320 gaggaactgt agccgacttt gcctctcgct tcgtcgacat cttttcgcgc accaacttat 1380 ccgattcgga cgggatggct cgcttcatag aggaagctct ctcaggaaca ccttctctgg 1440 ctcgccctca gtaacctcat caaggaaccc gccaacctgg aggaactcgt cgctcaagcc 1500 atcaagaacg agttcgtcat gcggaacatg aaccccaact cgtcccgcac acccctggcc 1560 cagtacgctc ccgctcccac tcccgccccc gctcgtgatc cgtacgcaat ggagatcgac 1620 gctacccgac ctggccctag tggcaagacc taccaggact tcctcaatgc catgcgcggt 1680 cgatgcttcg gctgtggatc caccggtcac aacaagaagg atgggaatca tggggcgtta 1740 cgctgcaact actgccagcg tatgggacat gctgccaccg tctgccagga caagtacatg 1800 ggattcccac ctggacgagg gctctctcga cccccacgtc gtcgtgtcgc cgccacccaa 1860 gaagcaccct tttctctctt tgacgaacct gcccctgctg ccactgccac cgttgccgcc 1920 accacccccg ctgctccccc tgcccctgcc cctgcccctg cccccaccct cgcctcgctc 1980 tccgccaccc agcaacagca actcgaactc ttggctcaac tccagaggct ccaccagggt 2040 ttttgaagcg ggtgcccgtc cctgccgcaa cgcaggcacc ctttactcct ccgtacgcca 2100 cttccgaata tgattctgtc cgttcatgta ctattagctc aattcgcgaa ataaaccccc 2160 gttcatcaca cttccgcgtg aacgtgaggc tcagaggcag aaatcgcagt acgaactccg 2220 cggcaatgat tgactgtggg gcaacaggca agttcctgga caagcccttc gtcgatcggc 2280 acaacattac cacattcccc ctccgacacc ccatccgact cctcaacatt gacggtaccc 2340 ccaaccaggc aggcaacatc acacacttcg cacggctaga gctcacagtt gatggtcact 2400 ccgaatggac cgactttctt atcgccgacc ttggaggaga ggacatcatc ctcggactcc 2460 cttggcttcg caagatcaac cccaccattg actggcgtaa cggcaccctc caagttccct 2520 ctaaggtctc tagcgtcact attgaggagg ttccagacga ggacgcacac cctcccagtc 2580 gaggaactcc ctctggcgac gccatcctgg aacagattga taccacggtt gatccttctc 2640 cggtactccc cactcccagc cccagccctg ctcctgaacc ggaaccagaa ccgatcggga 2700 ccacccctct gtcgcatcag tgccaaccga accactcgca ggaaatggct acgagcaggg 2760 ctgatgtgag gaaacgggtg acgaactttg gtgcgctgca ggattcacct actcccaaca 2820 aattgccgag aaatctcaga aagccaaacc ccagaaaact ttcgaggaga tggttccccc 2880 gcagtaccgc cagcacgctt ccgtcttctc cgagtccgag tcgcatcgcc tacccgaaca 2940 cagaccctgg gaccacgcca tcgatctgat tcccggagcc ccggctacca tgcgcaccaa 3000 ggtttacccc atgtctcaga acgaacaaga agagctgaac cgattcctcg acgagaatct 3060 caagaaggga tacatccgac catccaagtc tcccctctct ctccccggtc ttctttcgtc 3120 aagaagaaag atggcaagct ccgtttcgtt caggactacc gcaagctcaa cgagatcacc 3180 gtcaagaacc gctaccccct ccccctggtc tccgacatcg tcaacagact ccgcggtgcc 3240 aagtatttca cgaaattcga cgttcgctgg ggttacaaca atatccggat caaggaagga 3300 gacgagtgga aggccgcgat tcgccaccaa ccagggtctc ttcgaaccac tcgttatgtt 3360 cttcggcctc accaactctc cggcaacatt ccaggctctc atgaactcta tctctctccg 3420 acctcatcgc tggagggaag gttgcagtgt atctcgacga catcctcatt tttacggccc 3480 acactcgacg agcaccggca agttgtacac gaagtcctgc atcgcctcaa gaagcatgat 3540 ctgtatctcc ggccagagaa gtgcgagttc gaacgtcagg agatcgaata cctcgggctg 3600 atcatccgcg agggcgaagt cagcatggac cccgccaagg tcgaagccgt ccgcaactgg 3660 ccagtccccc gcaacctccg cgccgttcgt ggtttcctcg ggcttcgcca acttctaccg 3720 ccgttttcat caaggacttt gccaccatcg ccccgaccgc tcaacgacct caccaagaag 3780 aaacatcccc tggcaatgga acgatcctca acagcaggcc tttgacaccc tccgcaatgc 3840 cttcacgtcc gcccccattc tgacgctctg ggatcctgta ccgccccact cgcattgaag 3900 tcgacgcctc cggctttgcc actggaggag cccttctcca gaagcaggat gacggcctct 3960 ggcaccccgt tgcatgtccg atccgcctcc atgcagcccg ccgaacggaa ctacgaaatc 4020 tacgaccggg agatgctggc cattatcgag gctctcaagg actggcgcca cttcctggaa 4080 ggactccctg aacccttcga gattgtcact gaccattcca accttgaata ctggcgcacc 4140 gcataagacc tcagccgtcg acaggctcgt tgggctctct acctctcccg cttccggttc 4200 tcgctcaccc accgccctgg caagacaaac acccaagccg acgccctctc tcgtctcgac 4260 gtccaccagg tctccgacgc cgaggacaac aaacaacagg tggtactcaa acccgagctg 4320 tttgccaagc tcgccgcctc gagcatggtc atcaacccct tggaggaccg tatccgacag 4380 gccagtgccc gagaatccca ggtcctggat ggacttgccc agctccgcaa gacaggtccc 4440 cgtcgtctca cctctggtct tgccgaatgg gaggaggata acggccttgt ttactacaag 4500 ggtcgagtct acgttccccc agacgacgaa ctccgacgcg acgttctcaa acagtgccac 4560 gacgatccta cctctggaca tcccggtgta catggaaccc tcgaacgagt tgaacgacag 4620 tactggtggc cgaccatgcg agcctttgtc aagaagtatg tggaaggttg cgacatctgt 4680 gcccggaaga aacacgccca gcacccccgc gcctcgactc agcccttgga ggtacccgat 4740 ggcccctggg agaccgttgg agtcgacctt atcacccagc ttcctgatgc caatgggcat 4800 gatgccatca tcgtcttcgt cgaccactac agcaagcttc tccatgccct cccctgcacc 4860 tcggacatca ccactgaagg agtcgctgac atctactacc gagaagtctt ccgtctccac 4920 ggccttccac tccggttcat cagcgaccgc ggcccccagt tcgcctccaa agtcatgcgc 4980 accctgctca agcgtctcgg catcgaatcc aacctgacca ccgcctatca tcctcaggct 5040 aacggacaaa cagaacgggc caaccaagaa gtcgagaagt acctccgctt gtacgtcagc 5100 cgccgacaag acgactggga taagcacctt cccatggccg aatttgtgat caattcccga 5160 gtccactccg cgcacaaccg atcccccttc gaagtcctct acggttacat ccctcacttc 5220 aacattcccg tcggtaaacg cactgatctt cgccctgtcg acgagcgcat ccaacaactg 5280 caagaggtac gcaaggacgc cgaagccgcc ttacgtctcg agaaagcaca acagaaggac 5340 ttgtacgaat ccggcaaacg aacagcccat cagttcaagg ttggtgacca cgtttggctc 5400 aacgctaagg acatccagat caaagtccct actcggaagc ttggcgacct ccaactcggc 5460 ccctacaaga tcaccgaacg cattggcgac ctcgattacc gccttgagct tcccccctcc 5520 ctatcccgca ttcacccagt gttccacgtc gacaagctct ccccctggaa gggcaatgat 5580 gtcaacggca ttctgccacc tccgcctgag cctgttgaac tggagggaga gcttgagtac 5640 gaggttcacg acatcttgga cagccgctgg aagggccgag ggaagaaccg taagctggga 5700 gtacctggtc agttggaaag gctacgggtc gacggacgac atcttgggaa ccggaagaga 5760 acctggagca ctgcaccgga gattgtaaag gagttccatc aacgactacc cctctgcacc 5820 acgtcgaatt tccgccaccg tcttcagctc tctcccctgg cgcccactcg agaacttcac 5880 cgacgctcca cctatcgact tcgaatggga acttggacgc cgcctaccgt tgatcatcga 5940 ggacgatccg acagcttaga aggggggtaa 5970 // ID Gypsy-13_RO-I repbase; DNA; FNG; 6774 BP. XX AC AACW02000268; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-13_RO_; KW Gypsy-13_RO-LTR; Gypsy-13_RO-I. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-6774 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000268; Positions 154615 161388. XX CC Positions [5073-5372] - Integrase core CC LTRs are 94% similar to each other. XX FH Key Location/Qualifiers FT CDS 145..1308 FT /product="Gypsy-13_RO-I_1p" FT /translation="MATYNTGHIALANNYNYKPVPVGFHGYEGEDFRYFLE FT RLESYLAINNVHDARKLAILRSLLKGAAKVFFEKDILKKLPDVKYEQAIEA FT LRSQYVTAELIQNYELEFNDMIQGEQEHPRIFLARLREAADLANIEDEAVI FT ESRIRAGLQPEIKRFCIQSSSKTLKDWMNHAEGWWNANRPRRIAMVDNPFI FT PRNANQALVYPSDSYYQPHHTPPNHNIELIDDYEYGVPVLPYREVQHTNMP FT VYNSNDHNMLAINQLAAMDTSKGYPSQQHGNRNHHRSQQTYNHQQIMETNQ FT QQDLVDLIQKTIRMEFNNHQQYQQPARNYNRNNRYNNSSNYNDYNNNSSNN FT NGNNNGNNRYGRYSNNYNNNNRYNTQMDNNNVQQQTSNTKQPSKN" FT CDS 1629..4370 FT /product="Gypsy-13_RO-I_2p" FT /translation="MQPQNIEVDMDLPIQEKQKHKAPRIKRTQPDIKYDIV FT SDVLRHKADIDIGDLITVAPALRRKLVSECRPKRKPKQDQASQQLQQIMAL FT LEDEELNTTAVYSTVSIGDKNIKALVDSGAAKTCMSKALADALGLEIDSAS FT DSIFTLGNGSKQTGLGLIYDVPIGIKDDMVIPCTIEVLPSCPTHLILGNNW FT LNRAKARIDFDTLTLKVKYKHQKAKLPIYYERKNVVLPKMRSYQQTYQPPV FT SLTNNKLEQSDEEQSDDSYDDDIEETESEEEESSENDTEEEITDDEKENQS FT LLVLEDGTKEEVTITNFKESHFIQATASGLTIPANSSMTLTLDKPEKEIPD FT LIYHFETTHPKLNNASGYFDTCSTLIVNKRTVEIRLFNRTNEVIIVEPEEE FT LGILERYNLHRDKVISAYTLQTNHDLCTIGTSDIRNKHKTKHKLLDTALYE FT KLEIGELPSDIRKQLRALLSRYDHVFDWNNDTIGRTDVINHKIIIQEDTMP FT ISHRPYRISPLEAEYLQKELDKYCKLGVISPSNSPWAAPVILVKKKNGEYR FT MVIDYRKLNTVTKKDAYPLPRIDDLLDTLGKAKVFSALDMRAGFHQVPLDE FT SSKELTAFTTKFGTYHYNTLPMGLVNSPATFQRLIDLCFRSLINKCLVAYI FT DDLNVYSNNDQEHLQHLELVFQCVETANLKLNPEKCFFFKDHLKFLGYIVT FT KDGIHTDPEKIKKIVEYPQPTTTTQVRSFLGIASYYRRFIKDFAAIARPLH FT DQTKTKKKIPWTQQTTESFETLKKLLTTAPVLARPDFNKEFILVTDASKLG FT LGCVLSQLDEDGKEHPVVFASRGLKPNEANYAPTKLECLAVIWAVKLFRPY FT LHGKKFMIITDHSALTGLLKTPNPTGLIARWIVTLSEYDFDIKYRPGRVNE FT SADFLSRLGY" XX SQ Sequence 6774 BP; 2705 A; 1327 C; 1147 G; 1595 T; 0 other; tttggtggtc actacgaggg tcaattacac tataattgaa aatactcaag aatcaacaaa 60 tcaaaactta caaaacctta actcacaaga atttacttgc tcaagcactc aaacaactta 120 tttacaaact caataacatc taaaatggcg acttacaata ccggacatat cgcgttggcc 180 aataattata actataaacc tgtacccgtt ggctttcatg gatatgaagg cgaagatttt 240 cgctacttcc tagaaagact cgaatcttat cttgctatca ataatgtaca tgacgcccgt 300 aaactcgcaa ttctcagatc tttacttaaa ggtgctgcaa aagtcttttt tgaaaaagac 360 atactcaaga aacttcccga cgtcaaatac gaacaagcaa ttgaagctct tagaagtcaa 420 tatgtcacag cagagctcat ccaaaactat gaattggaat ttaatgatat gattcaagga 480 gaacaagaac acccccgtat cttcttagct agactaaggg aagcagctga tcttgccaat 540 atcgaagatg aagctgtaat cgaaagccga attagagctg gtttacagcc agaaatcaaa 600 agattctgca tacaaagtag ctcaaagaca ctaaaggatt ggatgaacca cgcagaagga 660 tggtggaacg caaatagacc acgtagaatt gctatggtag ataatccctt tatcccaaga 720 aatgcaaacc aagctttggt ctatcccagt gacagctatt atcagccgca tcatacaccc 780 ccaaatcata acatcgaact tattgacgat tatgaatatg gtgtaccagt cttaccttac 840 cgagaagttc aacatactaa tatgccagta tacaattcaa acgatcataa catgcttgct 900 atcaaccagt tagcagctat ggatacttct aaaggttatc caagtcaaca acacggtaat 960 cgtaatcatc atagatctca gcaaacttac aaccatcagc aaattatgga gactaatcaa 1020 cagcaggatc tagtggacct tattcaaaag actattcgta tggagttcaa caatcatcaa 1080 caatatcaac aacctgccag aaattacaac cgcaataatc gttacaacaa tagcagcaat 1140 tacaacgatt ataacaacaa cagtagcaac aataatggca acaataatgg caacaatagg 1200 tatggaagat acagcaataa ctacaacaac aataataggt acaacacaca aatggacaat 1260 aacaacgtgc agcaacaaac ttccaataca aaacaaccgt caaaaaacta aatgggtcgg 1320 ttgcctttaa acaaggaaat ggtcaatcaa acacccaaaa taatacaaaa cccattaata 1380 atacacaaca gcatcaacta aacgccttat taacagaaac ggaaaataaa aactcaaagc 1440 ataataaaga tctatatgca gctatcagac cggaaaaacc tcctgaagtc ggaaccgcaa 1500 ccccatatcc aaaaaataga taagctaagg acaaaggaaa gcaagtagca ggaccttctg 1560 tgacgaaacg agtaacaaca agaagtcata tcgaagaagt tcatccgaca actagaccaa 1620 aggtatcaat gcaaccacaa aacattgaag ttgacatgga cttgcctata caagaaaagc 1680 agaagcataa ggcacctaga ataaaaagaa cacaacctga catcaaatac gacattgtat 1740 cagatgtgtt acgacacaaa gccgacatag atataggtga tttaattaca gtagcacctg 1800 cattaagaag aaaacttgta agcgagtgta gaccaaaacg aaagccaaaa caagatcaag 1860 catcccagca attacaacaa atcatggcat tactggagga tgaagaatta aacacaaccg 1920 ctgtgtattc aaccgtaagc attggcgaca aaaacataaa agcattagtc gatagtggtg 1980 ctgcaaaaac atgtatgtcg aaggctttag cagatgccct aggattagaa atagattcag 2040 catcagacag cattttcact ctaggaaatg gctcaaagca gacaggtctt ggattgatat 2100 atgatgtacc aattggaata aaagatgata tggttattcc atgtaccatt gaagtattac 2160 catcatgccc tacacacctt atcctgggaa acaattggtt aaaccgtgca aaagcaagaa 2220 ttgactttga caccttaacc ttgaaggtca aatacaaaca tcaaaaggcc aaactaccca 2280 tatattacga acggaagaac gttgtattgc caaaaatgag gtcttatcag cagacatatc 2340 aacccccagt tagcctaacc aataacaagt tagagcaatc tgatgaagaa caatccgatg 2400 atagttatga cgatgatatt gaagaaactg aatcagaaga agaagaaagc tcagaaaatg 2460 acactgaaga ggaaataaca gatgatgaaa aagaaaatca gtctctatta gtattagaag 2520 atggaacaaa agaggaagtt actataacaa atttcaaaga atctcacttc atacaagcta 2580 cagcatccgg attaacaata ccagctaact catcgatgac cttaacattg gacaaaccag 2640 aaaaagaaat acccgatttg atttatcact ttgaaacaac tcaccctaag ctaaataatg 2700 ctagcggata ctttgataca tgttctaccc taattgtgaa taagcggaca gttgaaatac 2760 gactctttaa tcgcaccaat gaagtaatta tcgtagaacc agaagaagag ctaggaatat 2820 tagaacgata caatttacat cgagataaag tcatatcagc ctatacttta caaacgaatc 2880 acgatctatg tacaatagga actagtgata ttagaaacaa gcacaaaacg aaacataaac 2940 tgttggatac tgcattatat gaaaaactgg aaataggaga attaccatct gacattcgaa 3000 agcaactgcg agctctactc agtagatacg atcatgtatt cgattggaat aatgatacca 3060 taggacgaac agacgttata aatcacaaga ttattataca agaagataca atgccaatca 3120 gccatagacc atatcgaatc agccccttag aagcggaata tcttcaaaag gaattggaca 3180 agtattgcaa gttaggagta atatccccat caaatagtcc ttgggctgcg ccagttatct 3240 tagtaaagaa gaaaaacggt gaatatagaa tggtaatcga ctaccgaaag ctcaacacag 3300 taacaaagaa ggatgcatat cctttaccaa gaatagacga tctattagat acgctaggaa 3360 aagccaaagt gttctcagca cttgacatgc gagctggatt tcatcaagta cccttagacg 3420 aatcaagcaa agagctgaca gcatttacta ccaaatttgg aacatatcat tacaatacct 3480 tacctatggg tttagtaaac tctccagcca ctttccaaag attgatagat ctatgttttc 3540 gatcattgat caataaatgc ttagtagctt atatcgacga cttaaatgta tactcaaaca 3600 acgatcaaga acatctccaa cacttagaac tagtattcca gtgtgtagaa acagcaaacc 3660 tgaagctcaa cccggaaaaa tgctttttct ttaaagatca tttaaaattt cttggatata 3720 tcgtaaccaa agatggtata catacagatc cagaaaagat aaagaagata gtcgaatatc 3780 cacaaccaac aactacaact caagtcaggt cattcttagg aattgcatca tactatagac 3840 gtttcattaa ggatttcgca gcgatagcta gacctttgca cgaccagacg aaaacaaaga 3900 agaagatccc gtggacacaa caaacaacag aatcattcga aacactcaaa aagttgctaa 3960 ccacagcacc tgtattagca agaccagact tcaacaagga gttcatttta gtaacggacg 4020 catcaaaatt aggactagga tgtgtattgt cacagttaga tgaagatgga aaagaacatc 4080 cggttgtatt tgcaagtcga ggtctcaagc caaacgaggc taactacgca cctacaaaac 4140 tagaatgctt agccgtcatt tgggctgtaa aactgtttcg accttattta catggaaaga 4200 aatttatgat cattactgat cattcagcct taactggttt actgaaaaca ccaaacccca 4260 ccggacttat cgcacgatgg attgtcacat tgtcagagta cgactttgat atcaagtatc 4320 gacctggacg tgtcaatgaa agtgcagatt tcttgtcaag actaggatat tagacgacta 4380 taactttaca actattatta ccaacatata tattactatt catcaatagg aggatgggag 4440 ggaagatgat acctaaaaca acaaaaatcc aaaaacaaca caaaaaaaaa aaaaatcatt 4500 cgaggagcac ggcccccaaa tttcagaaaa cgaaaatttt catacaaccg aaaacaccga 4560 gattagaaaa atggaaattt tacaaattta gaatacatca cattatcaaa aagaaccctg 4620 aaacatcaaa caatagatac agtatcacaa ataacgatta gtaacacaaa aactttattc 4680 aacaaaaaaa aatggaacaa aaacaaatag acatgctgaa acaatattta caaggcttga 4740 aattgccaga aggtatttca aagaagcaag caaaatacct acaaaaagaa gcacacaagt 4800 tcaccatata ccaaaacaca ctttatcgtt ataatactga gaatggtatt atacggaaag 4860 taatgaataa acaagaagca gaagaaatca tgtatacata ccatcagcat cctctaggtg 4920 gacacatggc ttataacaac acattgcaca aaatagcatc acgttattac tgggacaaca 4980 tgactaaaga tataatggaa tacgtaaaga aatgtcatag atgtcaaaag tatggaaaga 5040 aatctctgaa tgaagaatta taccctgtgc cagtatctac aaaacctttt gatcgaatag 5100 cattagatgt taagcatgta caagcgtcta gagctggata tcgatatatt atcgcagcaa 5160 ttgactacct caccaaatac gttgaagcta gacctatccg ttttcaaaca gcttccgaaa 5220 tttcactgtt cttatacgaa gagatcatat gtagacatgg atgtccaact attataattt 5280 ccgataatgg caagccgttt atcagcaagt ttgcagaaca tattcaatta ttcataaaac 5340 aactacacct tataacccac aaagtaacgg gttgatagaa cgtttcaaca gaacgttagg 5400 acaaatatta cagaaaagaa gtgtggacga aaaagaagac tgggatcaat atttaccagc 5460 ggcacttttc gcttatcgaa caattaaaca agcatctaca caacaaacgc cattcttcct 5520 actctacggt tacgaaccaa aaacgccatt tgacatggac tatcatatat acgaacgaaa 5580 ttcaccaaag tttgaagcca ttctgaaaca taggacaatc catcaaatac ataatctcag 5640 taacataaga aaagttgcag cgcaaaatat acgccaaacg caagaatcac agaagaagca 5700 gatagaaaac aagatattag atgaacgcaa agaactgaaa cctccattta gactaggaga 5760 cttggtactc atctataaag attacttatc cacatcttgg tcagcaaagt tacaagacaa 5820 gtgggaagga ccttatgttg tacaacacat attaggcaaa ggcacgtatc acatcaagag 5880 catgaatcca gaagatacca aattaagaag ggtacacgga aacagaatga aaccgtacct 5940 gttaccaaaa gtgcaatggt gtcaagaaaa cctgagacat gtcatgacca aattagatga 6000 acaaacaaag gagttatttc attaagatga aacaacatgt gactagtatc aatcagcaat 6060 acaaaaaaaa agtatataaa tagagaaaaa aaaatggaaa agctatattt tatttcaatg 6120 aacaaaacaa acgaacaagt ccccgtatca atggttaacc agaacaacaa tgcattttcc 6180 atggatatgg atacttacat caagaaggta tatcaagaac aaggctatga agcagcttta 6240 acagtaatga acgatttttt aataatgtgg gaagattgta atattggacc tacgaatctt 6300 acatatgaaa tcgatggtat cacatttaca agggaagaat atgcaattta cactgtagag 6360 ggaatcaagg caatgttaga aggacataat caggaagacc aagaagccaa ttgggaaata 6420 gaagatgatt ttgaattgtt aatgaacatt gaagcgctta ttcaaaacca atatgaacaa 6480 tataaggcgg ctataagaga aattgaacgt ttgcaagaac taaaggactc tctttgtgaa 6540 cttatatcat taaatgtatc gcgatgggcc cacttcaagt tcagaaggcc aaagcctact 6600 ccagaatcag taaccattta catagcagca aaacaaacag cccttaaaac aattgtacct 6660 ggttatagtc tcttccgtgc cttatcttta aaaatcaaac aaacagacgc ttggtcaaga 6720 atcgaggacg attcttcccg tggtccgacg atctgtgaca aagaagtcaa aaaa 6774 // ID Copia-2_LBS-LTR repbase; DNA; FNG; 234 BP. XX AC ABFE01000771; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_LBS_; KW Copia-2_LBS-I; Copia-2_LBS-LTR. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-234 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01000771; Positions 38105 37872. XX SQ Sequence 234 BP; 53 A; 55 C; 36 G; 90 T; 0 other; tgttagaaat tgacatctag atacattctc gtttaaacgc gtctcggact tttctactta 60 catgtcacac ccgtcacact cagtggttgt tcttctacac aacgctgagc ttgtacgtcg 120 tttcgttctc tttctttctt atcaatactc actggtacct agacttgtta tttcattagg 180 tactaaatac agtggttgtt cttctacaca acgctgagct tacttgttat ttca 234 // ID Gypsy-38_MLP-I repbase; DNA; FNG; 5711 BP. XX AC AECX01001140; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-38_MLP_; KW Gypsy-38_MLP-LTR; Gypsy-38_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5711 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001140; Positions 240711 246421. XX CC 'CAAAT' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 399..1445 FT /product="Gypsy-38_MLP-I_2p" FT /translation="MEDIQRQIAELQNALAGERNLQEQAEARSRQAEEHLA FT AIEAGHTRQPPAAQPAAPMPTQPEVAHTPKGPKVSTPDKFDGTRGSPAEVF FT ASQVQLYMLAHPHLFATDLSKVVFALPYLTGTASAWAQPLMHKLLDDAKSA FT TVTFERFVNNFKAMYFDTEKKSKAEKAIRSLSQKTTVAAYTYEFNMHATNT FT GWEVPTLISQFKQGLKRDIRVAMVLNQEEFTSVEQISNLAIRLDNKLHGVA FT DTTMHTATPARDPNAMDISSSYTRLSPDKRARRLRTGSCFHCAKQGHIASD FT CPTKSGERKGKGRVDYRARVAELEIKLAAMSGKESVIGDQMEGGSRSDISK FT NGGAQA" FT CDS 1835..5614 FT /product="Gypsy-38_MLP-I_1p" FT /translation="MPWIRANHRKIDWKTGNIDVAEVFTAFTDVESSTPTN FT PSLGPRTELWRDTRYRDEGICTNTSTLASPQSESCIFFKSPSLLEAEDKLF FT PLLQQQLDTPTPPEFIDSTTQDDAITTVGLSVTKTTPETNQVAAVNPASSR FT LPQTPAGPMEEPTGHARNDDEGACIFSSTLTPPQREFDILHSHALPETASK FT QNSPLNDSLTVAVDAANTSWSTSARLAADKKKNQLTKTVEDLVPTMYHRYL FT HMFQKSKAQCLPPRRKYDFKVELIKDAQPQASRIIPLSPAENAALDEMINA FT GLANGTIQRTTSPWAAPVLFTGKKDGNLRPCFDYRKLNALTVKNKYPLPLT FT MDLVDSLLDADEFTKLDMRNAYGNLRVAEGYEEILAFICKAGQFAPLTMPF FT GPTGAPGYFQYFIQDILRAHIGKDTAAFLDDIMVYTKKGMKHQESVFQVLD FT ILDKQQLWLKPEKCEFSRSEVEYLGLIISRNKIRMDPTKVKAVAEWPAPRS FT VNELQRFIGFANFYRRFIHQFSKTTRPLHNLTKDNTPYNWDDKCDKAFVAL FT KTAFTSAPILKIADPYKPFVLECDCSDFALGAVLSQRSDDDNELHPVAYLS FT RSLVQAERNYEIFDKELLAIVASFKEWRHYLEGNPNRLDVIVYTDHQNLET FT FMTTKQLTRRQARWAETLGCFDFQIKFRPGRHATKPDALLRRPDLEPDQAE FT KLTFGQLLRPENIGPDTFTEIATMESFFKSEDVKFNNAKHWFEVDVLGVSD FT ITTDEHLLEEPTLNDQALITLIRESTLKDSRLQELIQAAENPISSSIKKAV FT VAYKVKDGILYNHGRIEVPDDNDIKYKILRSRHDSLLAGHPGRSKTLNLIR FT RSFVWPSQKAYVNRYVDGCDSCLRSKATTQKPFGSLEPLPIPTGPWLDISY FT DLITKLPISNGKDSILTVVERLTKMSHFIPCNEAMSAEDLADLMVRFVWRL FT HGTPRTITSDRGSIFISQITKELDKRLGIRLHPSTAFHPRTDGQTEIVNKA FT IEQYLRHFVNYRQDNWESLLPIAEFSYNNKDHASTGVSPFKANYGFNPNFG FT GVPSSDQCIPSVENRLQQLNEVQSKLTECLNVAQQEMKAQFDKGIRTTPEW FT NVGDQVWLNSKNLSTTRPSPKLDHRWMGPFSIIEKISRSAYKLMLPVSMRG FT IHPVFHVSLLRKHKPDTVEERHTTEPQPIIINDTEEWEVMEILDCRKKFNK FT MEYLVSWKGFGPEHNSWEPRVNLKNCAELLQDFDTRFPKAAEKYKRTRRK" XX SQ Sequence 5711 BP; 1758 A; 1378 C; 1281 G; 1294 T; 0 other; tattgtcgga tctacttatt tcgacgagcg ccgaggaatc gagaagaaac agaatacacc 60 acattaccga tattagatta gaaaactttg tttgattgag attgaaacct tgtgtctgca 120 gaagaagatt caccacaaat accagcgaac tatttcagga ccaagattca gtagattagt 180 cagaattaaa attagataga ttagatttac ccagaacaag atcgattgaa ccagaacctt 240 gcagattcaa ctcaccagaa ccccgcactc cacaaccgac cacgtctcct ttgtacagaa 300 cccccagtgc cgacgatccg gacagtgact ctgagactga aacccccttc gttgacgtag 360 atccatcgtt acctgtcacc gagctcacta aaggcaaaat ggaagatatc caacgtcaga 420 tcgccgagct gcagaatgca ttagcaggag aacgtaactt acaagagcaa gctgaggcgc 480 gaagccggca ggccgaagaa catttggcgg cgatcgaagc aggccatact cgccaaccac 540 ctgctgccca accagcggca cccatgccaa ctcagcctga agtggcgcat acccctaagg 600 ggcctaaagt ctcgacccca gacaaatttg acggtacccg tggtagccct gcggaggttt 660 ttgcaagcca agtccagctt tacatgctgg cacaccccca cctttttgcc accgatctta 720 gcaaggtggt atttgcgttg ccgtacctga cgggcaccgc cagcgcatgg gcccagccat 780 tgatgcacaa actacttgac gatgccaaat ctgcgacagt gactttcgag cgttttgtta 840 acaactttaa agccatgtat ttcgacacgg aaaagaagtc taaggcggag aaggcaatac 900 ggtcattatc tcaaaagaca acggtcgcag cttacactta tgagtttaac atgcacgcca 960 cgaataccgg ttgggaggta cctacactga ttagccagtt caaacaggga ctgaaacgtg 1020 atattagagt tgctatggta ttgaatcagg aggagtttac ttcagtagaa caaatttcaa 1080 acttagccat cagattagat aacaagcttc acggagtagc tgacacaacc atgcacacgg 1140 ccactcccgc acgtgacccg aatgctatgg acatttcgtc ctcttacacc cgactctctc 1200 ctgacaaacg tgctcgacgt ctacgaaccg gatcatgctt tcattgtgca aaacaaggcc 1260 acattgctag tgactgcccg accaaaagcg gtgagaggaa agggaaagga agggtggatt 1320 atcgtgctag agttgccgag ctggaaatca aattagcagc catgagtggg aaggagagtg 1380 tgattggaga tcaaatggag ggtggcagtc gtagtgatat ttcaaaaaat ggaggcgctc 1440 aagcctgaag gttgtgccta gcttgagctt agaggctttg gatgattcaa taggaatagg 1500 agcgagtgga gttgtacctt gcaataacag tgatccacgt ttgtttctga cagcttcact 1560 ttccttgacc caaagacctc gcgccacacc attttttaga cccaccaccc gactcttgat 1620 tgactcaggt gccactcaca atgttttggg ggaagccttt gcctcagaga gagacctgtt 1680 acgacacgga gtgagcacat caagagatat caccggtttt aacggatcca aagccaaatc 1740 ttcacacaaa ttgaacttat tcattgacca cgactcatac ccaaccaatt tcatcatcac 1800 gagcctcaag aacacctatg atggcattct tggaatgccc tggattcgag caaaccaccg 1860 gaaaatcgac tggaagaccg gcaacataga tgttgctgaa gttttcactg ccttcactga 1920 tgtggagtcg tcaacgccga caaacccctc tttgggtccc aggacggaac tttggaggga 1980 cactaggtac cgtgacgagg ggatttgtac taatactagt acattagcat ccccgcagag 2040 tgagtcttgt atttttttta aatcaccgtc attgttagaa gcggaagaca agctttttcc 2100 cctactacaa caacagttag acacacccac gccgcccgaa tttatagaca gtacgacgca 2160 agacgacgcc attacaacag taggcttatc agttaccaaa accacaccag agacgaatca 2220 ggttgcggct gttaatccag cctcgtcacg tctgccacaa acccctgcag gtccaatgga 2280 ggagcccaca gggcacgcta ggaatgatga cgagggggcg tgtatttttt caagtacact 2340 aacgcccccg caacgtgagt tcgatatctt acactctcat gctttacccg aaacagctag 2400 caagcaaaat tctcctttga atgacagcct aacggtagca gtcgacgccg caaatacttc 2460 ctggtccacc tcagcacgac tcgcagcaga caagaagaag aaccaactga ctaaaaccgt 2520 agaggactta gtaccaacaa tgtatcacag gtaccttcat atgtttcaga agtcaaaagc 2580 tcaatgctta cctccccgac ggaagtacga cttcaaagtc gaactgatca aagacgcgca 2640 accccaagct agccgcatca tcccgctctc gccagctgaa aacgccgctt tagatgaaat 2700 gatcaacgcg ggccttgcca acggcacgat acaacgcaca acttcaccgt gggcggcgcc 2760 agtattattc actggaaaaa aagacgggaa cttgagaccc tgttttgact atcgcaagtt 2820 gaacgcatta acggtcaaaa acaaatatcc attaccgtta acaatggacc tagtggatag 2880 tctactggac gctgacgaat tcactaaact agacatgcgg aacgcatacg gaaatctacg 2940 cgtagctgaa ggctatgaag agatccttgc gttcatctgc aaagcgggcc agtttgcacc 3000 gttgacaatg ccattcggcc caacgggagc acccggctat ttccagtact tcatacagga 3060 catcttacgc gcacacattg gtaaagacac tgcggccttt ttggatgata tcatggtata 3120 tacaaagaag ggaatgaaac atcaagaatc tgtctttcaa gtcttagaca tcctagataa 3180 acaacaacta tggctcaaac cggaaaagtg tgaattctca cgatcagaag ttgagtactt 3240 aggattaatt atatctcgca ataagatccg aatggacccg actaaagtca aagccgttgc 3300 tgagtggcca gcgccccgga gcgtcaacga actccagcgt ttcattggat tcgctaattt 3360 ttacagaagg ttcatacatc aattctccaa gacgactaga cctcttcaca acttgactaa 3420 agacaatacc ccatataact gggatgacaa gtgcgacaag gcatttgtag cattgaagac 3480 ggcattcaca tcagctccta tattgaagat tgctgatcca tacaaaccat ttgtattgga 3540 atgtgattgt tctgattttg cccttggggc ggtcttatct caacgcagtg acgacgacaa 3600 cgagctacat cccgtagcat atctttcacg atccctggta caagcagagc ggaattatga 3660 aatatttgat aaggagctct tagcgatcgt agcatcgttt aaagaatggc gacattacct 3720 ggaaggtaac ccaaaccgat tagacgtcat agtctacact gaccatcaaa acttagagac 3780 gttcatgact actaaacaac tcaccagacg tcaggctaga tgggctgaga cgcttggatg 3840 ttttgatttc caaattaaat tcagacctgg aaggcatgca actaaacctg acgcgctttt 3900 gagacgaccg gatttagaac ctgaccaggc ggaaaagctc actttcggac aactcttgag 3960 acctgagaac atcggacccg acactttcac cgaaattgct actatggaat ccttcttcaa 4020 aagtgaagac gtgaaattca acaacgccaa acattggttc gaagtggacg tgttgggagt 4080 atcagacatc acaacggacg agcacctatt agaagaaccg acgctcaacg accaagcgtt 4140 aatcactttg attagggaat caacactcaa ggactctaga ctacaagaac tgattcaggc 4200 tgccgaaaac ccaatatctt ccagcataaa gaaagcggtg gtagcttata aggtaaaaga 4260 cgggattttg tacaatcatg gccgcattga agtacccgac gacaatgaca tcaagtacaa 4320 aatactcaga agccgccacg atagcttgtt ggcaggccac ccaggtaggt ctaaaacgct 4380 gaacctgata cgtcgcagtt tcgtatggcc gtcacaaaag gcgtacgtta ataggtatgt 4440 cgacggctgt gactcttgtc tgagaagcaa ggcaacaaca caaaaacctt tcggatccct 4500 ggaaccgctt cccataccaa ccggcccgtg gctcgacatt agctacgact taatcactaa 4560 actaccaata tccaacggaa aggacagtat cctgaccgtc gtggagagat tgaccaagat 4620 gagtcatttc atcccgtgta acgaggcaat gtcggccgaa gacctagcag acttgatggt 4680 cagattcgta tggcggctac atggaacacc gaggactatc acatctgata gagggagtat 4740 tttcatatct cagataacga aggagttgga caagagactt ggaatacgct tacatccatc 4800 tacagcattc cacccacgca cagacggtca gacagaaatc gtgaacaagg ctatcgagca 4860 atatctacga cacttcgtca actatcgtca ggacaattgg gaatcactac taccaatcgc 4920 ggaattttct tacaacaata aagaccacgc ttctacaggg gtttcacctt ttaaagcgaa 4980 ttacgggttt aatcctaatt ttggtggagt accatctagt gaccaatgca tcccaagtgt 5040 tgaaaataga ttgcaacaat tgaatgaagt acaatccaaa ctgactgaat gtttaaacgt 5100 tgcacagcaa gagatgaaag cacaatttga caaaggcata agaacaacac ccgagtggaa 5160 cgtgggtgat caagtatggc tgaacagtaa aaatttgtca actactagac cgagcccaaa 5220 gttggatcac agatggatgg gtccttttag catcattgaa aaaatatcac gatctgctta 5280 taaactgatg ttacctgttt ctatgagagg cattcaccct gtgtttcacg tttcattact 5340 taggaaacac aaaccggaca cggtggagga acgacacaca actgaaccac aaccaatcat 5400 cattaacgat acggaggaat gggaggtgat ggagatatta gactgccgta agaaattcaa 5460 caagatggaa tacttagtta gttggaaagg atttggaccg gagcataatt catgggaacc 5520 tcgagtcaat ttgaagaatt gcgccgaatt actacaagac tttgacacaa gatttccaaa 5580 ggcggcagag aagtacaaaa ggacacggcg taagtgagag ggcaagcttt ttccctctgg 5640 gttttttaat gctgcccgtg ggaggaacgc agatcttgca agaggaagat tgggcgtaaa 5700 aagggggata c 5711 // ID Gypsy-113_MLP-I repbase; DNA; FNG; 6761 BP. XX AC AECX01000737; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-113_MLP_; KW Gypsy-113_MLP-LTR; Gypsy-113_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-6761 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000737; Positions 51378 44618. XX CC Positions [5573-6085] - Integrase core CC 'CAGTC' target site duplication CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 508..3591 FT /product="Gypsy-113_MLP-I_1p" FT /translation="MAGDVDLNRRNMSNYEINAPVLAPAAVITTKASGKKD FT GKTGTGISAATDTVMADSMTSAEAQGGAGAIVDDQDDVLVYTHGDPSASAN FT NATMEVSGPVISKRGKGLFDLPPVITTGLRSGRVTNSSRHEVRPKGPQGRS FT NHGPSGNHHGIRAPGTPTQEPCGRLGSSIGDPQGSRDHLSRDPSPHKRSET FT SSESTVRSKSQQRIHTIDGTSTSTSRELPPQAQGAISGSSATYRAEPIVPI FT SQGNDGPITTGSYATNRDGARELPSAGRDDVSKKSVKGKEVSKPVPSPSLQ FT SSVIPPIVPVPSVIVSRPELGIVPTSLVSTPITTNSPLNVKVGSRGRASTS FT NINSLSLPPNNIALTKLVELETMLRVNFNTINGQLRSQVESIESFRDEMIS FT VRDLIDDVTETNSMLDSKLAAFEEILEQYGRDQREIANLMHLTSTALEELT FT RDTANMCEAMEQLTLDREIERQSTRELRVEFTRQFQEMKDHIDAVARNVSR FT LQADRNRAVMLETQPIMPKRGLTEEKRPYHTMVGDVFQEDIMSKKDSDIAE FT ISNTSYTNPRNYPKIESYPKFSGKPDEDWSDFVDEIDTFQDSHGMPDSEIV FT SKLPSLLKGIAYTWFRAVNKQHKKQGWEHWKGLIAKKFGGPVWRQRQLTLL FT HQMKFLYNEQDIIQFLTAMHRKIESVYPSASNEDIKEHILMKLPAEVRDTI FT SISTKGMDDISKYLKTCERILTNQKDVNRPSMNTGRRVWRNDTSNNTTVVR FT KDESSIEKKKVTIPLVGESKDRIRNKTCHRCKGPWAPGHTCNKVLNIDEED FT DESSVYSGEDEELVENDAIQGNDTDVMILETEILHGHFLTDNLGQLRDVQE FT AEYVRTPSEVIGVCPARCTAMVHQNEVTIVLDSGAGGSVVSSAYLRKVDPE FT WRNNLVNEDTGKWKGYGSVLKPIGTYVANVVFGHERGNIRAVMKFVVMDNE FT GLPQYFIVGNNNMLLYGMRLHLCDKYFTLGNNLKRKFALTYDKSIPAQQVT FT AVDGDGDKKEKYRDTWYTNTRT" FT CDS 3533..6157 FT /product="Gypsy-113_MLP-I_2p" FT /translation="METEIKRKNTEIRGIPTPEPDTFEKAFSEATWDTELP FT AEDKTLLSKVIRDFPMVFAHGKRQLGEVSVDEFDINLNIDGDKMPTSLKKK FT AYPCSPHKRKDIEDSIQELLDLGVIEEIDRTPRDCVISPVIIQYQNGKKRM FT CGDFRALNDYTVSDVYGMPRIDAILHGLKGATRISLLDGYKGYHQFKNTER FT ASNYLIIITHCGMYRYLRMPFGPKNGPSVFQRTMDKTFSKEIREGWMTIYI FT DDIIIHSRNTEEHVEHLRRVFTKLEQINLTLAFKKCHFAFKSTKVLGHIVS FT GILMSVDDNKVKAISHIQPPRTVKEVQSFLGMCGYYRQYLKNYQLVALSLT FT RLIRRNEAFEWTEERQRAFDNLKQMLQEAPSLSLPDFDKPFIVYTDASFVG FT LGAALHQKQVVEGKEVEVPICFISRSLRNGELRYGATQLECLAVVWALEKL FT HYYLDGSTFEVVTDCLAVKSLLGMKTPNRHMFRWQIAIQEYRGRMTISHRA FT GDKHQNADCLSRNPMPNTPDNPACVAPDDDTEVFGLHIVDLESEFYWSVAE FT GCALCPNMRKIVMLLKKPDGNTNNEIVCSLDEPWKKMFNQGQFIYEDDLLY FT YRKQGSHCLVIHESMKEQIMKLCHDDILAGHFSLDKTLNRVKNTAWWLNYN FT QDVIEYISSCDTCQRGNKKTGKTFGLLQEIQKPTKPWEIINMDFVTGLPPA FT GNVSYNSALVIVCRLSKKAKFVPCHKDIDAKGLAHLWWKSALNECGLPSAI FT ISDRDPKFTSEFWTSLMRIAGCDLKLSTAHHPQTDGLAERTIQTMEDLIRR FT YCAFGLLYKDSEGCTHDWVSLLPGLEFAYNSSVHASTGRTPFELERGYIPQ FT NPRMLTNKKLGKLDVHPSAGNF" XX SQ Sequence 6761 BP; 2117 A; 1310 C; 1665 G; 1669 T; 0 other; attgggggtc tggccgaagt tgctaggaat tcatacctac catcacgatt gagttctctt 60 attataacat cgcaatatat ctgcaggtta taataataaa ccgatcaaat cgcgattcga 120 ccttactaat aaagtttgtc gaatccgaat actgatattt ggtttgaaac gaacagctaa 180 agaaagtcat agtaatacac ctcaccacta acaacaatcc aaagccgatt ccttgtaagt 240 cgagcttggg agaatcggtt agttataaaa actgacagat tctgattgtc ctgtggtagt 300 acttgtccca tcgatcttgt tgaacttcga ttagaacaac atctagccga aaaacacagt 360 tttttacatt acaggttcgt ttgttcattg gggataagta cagaagtacg aatagaaact 420 gactaactga ttcacctttt caaatccgat cgtttcattt tccttcatac tttttactct 480 ttgcatccgc ttaactcgac catcataatg gctggtgacg ttgatctcaa tagacgaaat 540 atgtccaact atgagatcaa cgcacctgtt ctggctcctg cagccgtcat taccactaaa 600 gctagtggta agaaagacgg taaaacagga acgggcataa gtgcagcgac agacactgta 660 atggccgatt ctatgacatc tgcggaagct cagggaggag cgggagccat cgtcgatgac 720 caagacgacg tattagttta cacccacgga gatccgtcgg cgagtgctaa taacgcaacc 780 atggaggtta gtggccctgt gatatccaag cgaggtaagg gattgtttga cttacctccc 840 gtgataacca caggacttag gtcaggaagg gttacgaact cttcgagaca tgaagttcgc 900 ccgaagggac cccaaggacg atcaaatcat ggtccttcgg gaaaccatca cggcatacga 960 gcgccaggta ctccaactca agagccatgt ggccgacttg gaagctcgat tggtgatcct 1020 caaggctccc gagatcacct atctagggac ccatcacccc acaagagaag tgagacctcg 1080 tccgaatcca ccgtccgctc caagagccaa caacggattc ataccattga cgggacatca 1140 acctcgacaa gtcgagaact acccccacaa gcgcaaggtg ccatatccgg tagttccgca 1200 acatatcgag ctgaacccat cgtacccata tcccaaggga acgatggtcc cattacaacc 1260 ggtagctatg caaccaacag agacggagca agagaacttc caagtgccgg aagggatgat 1320 gttagtaaga aaagcgtgaa ggggaaagaa gtttctaaac ctgtaccttc accgtcgtta 1380 caatcatctg tcatacctcc tattgttcct gttccttctg ttatagtgtc taggcctgag 1440 cttggtattg ttcctacatc ccttgtatca actcctatta ctacaaatag cccacttaat 1500 gtgaaagttg gatcaagggg gcgtgcgagc actagcaata taaattcact atcgttaccg 1560 ccaaataaca ttgcactaac aaagttagtt gagctggaga cgatgctgag agtgaatttt 1620 aacacaataa atggtcagtt gagatctcaa gtcgaaagta tagaatcatt tagagacgaa 1680 atgatttcgg tgagagactt aatcgatgat gtaactgaga ctaatagtat gctcgatagc 1740 aagttagctg cttttgagga gatattagaa cagtacggta gagatcagag agaaattgcg 1800 aatttgatgc atcttaccag tacagcgttg gaggagttga ctcgtgacac tgctaacatg 1860 tgcgaggcaa tggagcaatt gactctggat cgggagattg agagacagag cactcgagaa 1920 ttacgagtgg agtttacacg tcaatttcag gaaatgaaag accatatcga tgctgtagca 1980 cggaacgttt ccagattaca agctgataga aatagagcgg taatgttgga aacacaacca 2040 ataatgccaa aacgcgggtt aacagaggaa aagagaccat atcatacgat ggttggggat 2100 gttttccagg aagacatcat gtcaaagaag gactcggata ttgcagaaat atctaatact 2160 tcatacacga atccgaggaa ctatcccaaa atagaaagtt atccgaagtt cagtggaaaa 2220 ccggatgagg attggtcaga tttcgtggat gaaatcgata cattccagga ttcacacggt 2280 atgcctgatt ctgagattgt ttcgaagcta ccatcgttgc ttaagggtat tgcttatacg 2340 tggtttcgtg ccgtgaataa gcaacataaa aagcaaggat gggagcactg gaaagggctg 2400 atagcaaaga aatttggcgg accggtatgg aggcaacgac aattaacgtt gttacatcag 2460 atgaagtttt tgtataatga gcaggatatt attcaatttc ttacggctat gcataggaaa 2520 attgaatcag tatatccatc agcgagtaat gaagatataa aagaacacat cttgatgaaa 2580 ctgccggctg aggtacgaga cacgattagc attagcacta agggtatgga tgacatatcc 2640 aaatacttga agacgtgtga gaggatattg acgaatcaga aagatgtcaa tagaccgagc 2700 atgaatacag gacgacgggt atggagaaat gatacttcta acaataccac tgtcgtgaga 2760 aaggacgaga gtagtataga aaagaagaaa gttaccattc cgttggtagg agaatcaaaa 2820 gacaggatta ggaataagac ctgtcataga tgcaagggtc cttgggctcc gggacatacg 2880 tgtaataaag tcttgaatat tgacgaagaa gacgatgagt catccgtata ttcgggagaa 2940 gacgaggagt tggtagaaaa tgatgctatc cagggaaacg atacagatgt aatgatcctg 3000 gaaacagaga tattacatgg acatttccta acagacaatt tagggcaatt acgcgatgta 3060 caagaagccg aatatgtacg aacaccaagt gaggtgatag gcgtatgccc ggcgaggtgc 3120 acagccatgg tacaccagaa tgaggtcaca attgtgctag attctggggc aggcggaagt 3180 gttgtgtcca gtgcctattt acgaaaggtt gatccggaat ggagaaataa cttggtaaat 3240 gaagacaccg ggaaatggaa gggttacggc tcggtgctca aacctattgg aacttatgtt 3300 gcaaatgtgg tgtttgggca cgagaggggt aacatccggg cggtaatgaa gttcgtagta 3360 atggataatg agggacttcc gcaatatttt attgtgggaa ataataacat gttattatat 3420 gggatgcgac ttcatctttg tgataaatac ttcacgttgg gtaataactt gaaacgaaag 3480 tttgccttga catatgataa aagcatccct gcacagcaag taactgctgt cgatggagac 3540 ggagataaaa aggaaaaata ccgagatacg tggtatacca acaccagaac ctgacacgtt 3600 tgaaaaagcg ttttcagaag caacatggga tactgagcta cccgctgaag ataagacgtt 3660 gttatctaag gtgattcggg attttccaat ggtgtttgct catggcaaac gacaacttgg 3720 agaggtgtcg gtggatgaat tcgatataaa cctgaatatc gatggagata aaatgcccac 3780 tagcttgaag aagaaagcat acccttgtag tccacataaa cgaaaagata tcgaggacag 3840 catacaggag ctgctagatt tgggggtgat tgaggaaatc gaccgcacgc ctcgagactg 3900 tgtcatctcg cctgtgatta tacaatatca gaatggcaag aagaggatgt gtggcgattt 3960 tcgggccctg aatgattaca cggtctccga tgtgtatggt atgccgagga ttgatgcgat 4020 attacatggt ctgaaaggtg caaccaggat atcgttattg gacgggtata aaggatacca 4080 ccagtttaag aacacggaac gagctagtaa ttatctaata attataactc actgtgggat 4140 gtatcgatac ttgaggatgc cctttggtcc caagaatgga ccgtcggtgt tccaacggac 4200 aatggataaa acctttagta aggaaatccg cgaggggtgg atgacaatat atattgatga 4260 tattatcatt cactcgagaa acaccgaaga acatgtggaa catcttcggc gagtttttac 4320 aaaactcgaa caaataaatc taacgctcgc ctttaagaaa tgccatttcg cgtttaaatc 4380 aacaaaggtg ctaggtcaca tagtgtctgg tatattgatg tcggttgacg acaataaagt 4440 gaaagcgatt agtcatatac aacccccacg cacggttaag gaagtgcaga gcttcctggg 4500 tatgtgtggg tattataggc aatatcttaa gaactaccag ctagttgcgc tctccctgac 4560 tcgattaatt cgacgaaatg aagccttcga gtggacggaa gagcggcaac gagcttttga 4620 taacctgaag cagatgctgc aagaggcgcc atcgttatca ctacctgact ttgataagcc 4680 atttattgtc tacaccgatg cgagtttcgt aggattaggg gcggccttac atcagaaaca 4740 ggttgttgaa ggaaaggagg ttgaggttcc aatatgcttc atctcgcgat ccttgcgaaa 4800 tggcgaactg cggtatggag cgactcagct cgagtgcctg gcagtggtgt gggcattaga 4860 aaagttacat tattacctag atggtagcac gtttgaagtg gtaacagatt gtttggcagt 4920 aaaaagttta ctgggaatga aaacaccgaa caggcacatg tttcggtggc aaatagctat 4980 ccaagagtac aggggacgga tgactattag ccaccgcgcg ggtgataaac accaaaacgc 5040 tgactgcttg tcgcgcaacc cgatgcctaa tactcctgac aatccagcgt gcgttgcacc 5100 ggatgatgac acagaggtat ttggactgca tattgtggac ttagaaagcg aattctactg 5160 gagtgttgca gagggatgtg ctctgtgtcc caacatgagg aaaatcgtaa tgctgttgaa 5220 aaagcctgat gggaatacga ataatgaaat cgtatgttca ttggatgagc cttggaagaa 5280 aatgttcaac caaggacaat ttatctatga agatgatctc ctatactacc gtaagcaggg 5340 gtctcattgc ttggttattc acgaaagcat gaaggagcag attatgaagc tctgccatga 5400 tgatatactc gctggtcatt ttagtctgga taagacgttg aaccgggtga agaacactgc 5460 atggtggttg aattacaacc aggatgtaat tgagtacatc agttcctgcg acacttgcca 5520 gcgaggcaac aaaaaaacgg gaaagacgtt tggattacta caggaaatac aaaagccaac 5580 aaagccatgg gaaattataa atatggactt tgtgactggc ttgccaccag caggcaatgt 5640 ttcttataat tcagcgctgg taatcgtatg tcgcttgtcg aaaaaagcga aattcgtacc 5700 atgtcataaa gacattgatg cgaaaggact cgcccattta tggtggaaga gtgcgttgaa 5760 tgagtgtgga ttgccatcgg ctattattag cgaccgggac ccaaaattca cgtcggaatt 5820 ttggacctcc ttaatgcgga tagccggatg tgatcttaag ctgtcaacgg cacaccatcc 5880 ccaaacggat ggactcgcgg agagaaccat tcagacaatg gaagatttga taaggcgata 5940 ttgtgcattc ggcctgctgt ataaagacag tgaaggatgc acacatgact gggtatcact 6000 cttgccaggc ttggagtttg cttataacag cagtgtccat gctagtacgg gtagaacccc 6060 gttcgaactg gaaagggggt atatacctca aaatccgcgg atgctaacaa ataagaagtt 6120 aggaaagctg gatgttcacc catctgcggg aaacttttag catatgcagg aacttgcgcg 6180 attacatgcc gccgattgca ttacaaaggc ctttatgtat gaaaaacagc gatgggacaa 6240 aacacatact gttccgcctt ttaaagcagg agatcaagta ctattgtcga ctgtccattt 6300 taataatctg aatggcaatc caaaattaaa agaccctttt ataggaccat tcacgattgt 6360 taagatggta ggaacaaatg cagctgagct ggatttacag ggggcttatt ctcggcgaca 6420 tccggtattc ccagtgtcgc tgatgaagtt gtatatctcc tccgattccg ataaattccc 6480 gaggagaacg gtaaacaaaa aagcgactcc ggaattaacg gaagaggaag gtgtgataca 6540 acgagtatta caacaacgag tagtgactaa aggtaataaa aaagtaaggc aattccttgt 6600 ctcttttaaa aaccggtcac cggacttggc gcggtgggtc catgaagacg aaattcctaa 6660 cggaaccacc ctactgagac gattcaggaa ggaggcgaga gaggaaaaac tggtaaaaaa 6720 atgatgtaga catttttttc ggtttttttt gtctgaggga g 6761 // ID Copia-7_MLP-LTR repbase; DNA; FNG; 913 BP. XX AC AECX01000123; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-7_MLP_; KW Copia-7_MLP-I; Copia-7_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-913 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000123; Positions 22575 21663. XX SQ Sequence 913 BP; 212 A; 158 C; 196 G; 347 T; 0 other; tgttaagata atatcttgtg tagtggttaa gatactatct taggtagtag tcaaagatca 60 tttacgatta aggaaattgt ggatttggaa gaagaggttg attagttggt tggtgaacaa 120 tcggttaagc tagtgtggaa tgtaaggcaa gcggttagtt gtacaaaggg ttggagattg 180 gttatgcatg aatcaagggg tacaggaatt ttgtgaattg ctatatcttt tgttgtgttt 240 catttctccc acttgttctg attctttcct tcgtcaaaaa gaatcagtaa gtcatctttc 300 attattgttt cattttcact ggtactcaat ttaagataac taattttctt attctgttcc 360 tattcatctc acaagattat tagttgtgct tgttcatctt actcattttc cttacgcgtg 420 tccttgttgt tattcgagct cgcaggtaca aatcttcttc gtagctcaag tttctttcta 480 tagagtgtct gacggtgtta ttttctgttt cttctttgga tatgttggtt gtgtaggtct 540 gttcttgttt ctcgtcataa gactagcagt gaagctatag tctgacagga acctgtgccc 600 tcaggtgtgt tgaaaagaat caattattag ttgtgcttgt tcatcttact cattttcctt 660 acgcgtgtcc tcgttgttat tcgagctcgc aggtctgttc ttgtttctcg tcataagact 720 agcagtgaag ctatagtctg acaggaacct gtgccctcag gtctgttctt gtttcttgtc 780 ataagactag cagtgaagct atagtctgac aggaacctgt gccctcagga ttctgttttc 840 attactatag gtttcacttc tgaaagtctt acccagttcc aactggtgac tcgagatcaa 900 ggacgtctga aca 913 // ID Gypsy-2_GDe-LTR repbase; DNA; FNG; 301 BP. XX AC AEFC01000241; XX DT 12-MAR-2011 (Rel. 16.03, Created) DT 12-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Geomyces destructans genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_GDe_; KW Gypsy-2_GDe-I; Gypsy-2_GDe-LTR. XX OS Geomyces destructans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Leotiomycetes; Leotiomycetes incertae sedis; Myxotrichaceae; OC mitosporic Myxotrichaceae; Geomyces. XX RN [1] RP 1-301 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Geomyces destructans genome."; RL Direct Submission to RU (12-MAR-2011). XX DR Genome; AEFC01000241; Positions 43944 43644. XX SQ Sequence 301 BP; 73 A; 78 C; 78 G; 72 T; 0 other; tgtaaggatg cctccttact cgggttagtt acggtcgtca cgtggcgccc tcgagactcg 60 gggccattac ggttcaatcg caactgcgag ccttaggcac atagcgtagg gaaaccttaa 120 ggcaaacagg gagttagcag agtcggaggg gtttcgggta tttaagggaa caggaacaga 180 gggttatacc ttcatctttt ttgtatcctt cgtagtagac ctctaggata gaatacagca 240 tcacctccga cccgcactcc cacatgcacc tacagagagc cgtattccct gtgcccttac 300 a 301 // ID Gypsy-2_PCR-LTR repbase; DNA; FNG; 848 BP. XX AC AADS01000313; XX DT 30-JAN-2011 (Rel. 16.02, Created) DT 30-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Phanerochaete chrysosporium genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_PCR_; KW Gypsy-2_PCR-I; Gypsy-2_PCR-LTR. XX OS Phanerochaete chrysosporium OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Corticiales; Corticiaceae; Phanerochaete. XX RN [1] RP 1-848 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Phanerochaete chrysosporium RT genome."; RL Direct Submission to RU (30-JAN-2011). XX DR Genome; AADS01000313; Positions 24796 25643. XX SQ Sequence 848 BP; 141 A; 334 C; 196 G; 177 T; 0 other; tgatgcgccc cgcgcgccgc tgcgcccgcc gacgcgtcag aacgcatgtc tcccgcacct 60 cgcatcgtca gccgataaag ccgaagccaa gacgactgca ccatgtgctc gcccgaatcc 120 tcccgaacgc gtcgcaaccg acacgtcagc gccccgagtc acgcacccga tgccgaaccc 180 tgccgagcgc ctcgccacat gggtcccacg ccgcgtccga gcagcgcctg ctcggcctaa 240 gtattcgccc gcgcaccaga cctcgctgcc cgctgcgagc ccccctccgt cctcgcggta 300 tcgtcgccga catcgcgcag gcctcgttaa gcggtttcac cgccgtcgga gagtgtagga 360 acatagtact gtatagttct gacgtagggt ggctgagtag aatagtataa aatatcccct 420 ctgtagtaga cagaatcgaa caagttcccc tctcgcccta agtcttacaa gcgtgtctgc 480 gaacatcgcc cagcgactcc tctgcgaccc gcgactctgc ctgcgaactc gtccctgcga 540 tccttgactt cgcccgagca cccgagccga gcgagcgacc tcgcgagcct cggcctcttc 600 ctcgcgtctt cctctcacct ttctcgcctt ctaccaccca cttgcccgct gaccgcggct 660 cgcgtttagt gttactaata gctctatttc cttttattct aggtacattc ggtcgcctcg 720 gtccccgtag tcgcgctcac ctctactcct actgctcgcg cgccctcggc cgacgcctat 780 cccgttcccc tccatctccc tctctacgtt ctccctttct ctcggcctaa ccgcgcgcaa 840 gcgcatca 848 // ID Gypsy-25_LBS-I repbase; DNA; FNG; 18796 BP. XX AC ABFE01002558; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-25_LBS_; KW Gypsy-25_LBS-LTR; Gypsy-25_LBS-I. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-18796 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01002558; Positions 19155 360. XX CC Positions [2880-3239] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 162..3971 FT /product="Gypsy-25_LBS-I_2p" FT /translation="MDDQFSEEETRNLYLTKYLKHEKEKMTDKAASTVLVN FT YLEKYYSKQSQESGTASTVETMMNTIYNHDGKAQPSEVEGEHPATREVLVT FT KKYKPVAQKIRPVFQDLPDKFRIIRDIKGDPMDTLPKLNRNPPKFTPTGRY FT TEERREQFDKVHDGDFLWPEERKLLHYFMMENNEAFAWDDSERGSFKTEYF FT PPVDIPIIPHTPWVLKNIPIPPGLYQEVCRIIKTKLDAGVYEPSNSSYRSR FT WFCVLKKDGKSLRLVHSLEPLNAVTIAHSGLPPATDALATHFAGRACGGMF FT DLYVGYDERLLAETSRDLTTFQTPFGALRLVTLPMGWTNSVPIFHDDVTYI FT LQEEIPEVTVPYIDDVPVRGPATRYELPDGKYETIPENSGIRRFIMEHFEN FT ANRVVHRMKYAGGTFSGFKSVICAAEITVVGHLCTYEGRKPETERMKVIDL FT WGPCKDLHDVRAFLGTVGVCRMFIENFAKKAEPLNDLCKKGAPFVWGPAQE FT KSMDKLKSSLRESKALVPLDYENNPNPVVLAVDTSWKAVGFYIYQDSHTTG FT LRQYARFGSITLNDREARFSQPKRELFGLMRSLQAASYWLLGCRKLIVETD FT AKYLSGMLRNPEMGPNATINRWIDACLMFHFKLQHVQGKTFGADGLSRRNA FT QPGDDVYPPSDEHLDEPIGVIEVLPDPDGGEPPLDFETFKDQIDSRGGYIQ FT QLALDISDFKDELDREIAMTQKIAKDVKTRVDNDPDHTKSSPEQKEFLRTY FT MISTLTSPGLEARFDELETEQPYPETHRSFIGQSQDEFLLILREWLKNPKK FT IEGYSDKKYGAFVRFAKSFFLDKEDRLYRRSIESRHKLVVDKSHRMYMMRS FT AHDAMGHRGAYATQNMLTERFWWPEIERDVHWYVKTCQRCQERTKRVVEIP FT PTVTHTPSIFQRLHADTVHMTPKSNGCKYIVHGRCALSSWVEGKPLKSETA FT RTVGLWLFEEVICRWGCLEVIITDNGSVFLAAVKWLEEKYGIKGIRISPYN FT SKANGRIKRPHWDIIQMLTKCCGPKNLAKWYWFFWQVLWADRIMIRKRFGC FT SPFFMVTGAHPILPLDVQEATWLVKLPDRMLSTAELIGYRAKALAKHRQHV FT AEMRERVTLEKVKRVAKYELSHRHKISGRMINPRDLVLIRNSQVSSSLNSK FT MQPRYLGPMIVVRRTKGGSYIVCELDGSVNHNKIGAFRVIPYFARERIDLP FT EHFSTLIDASNEELDEIENTGVEERPNARDYYFEGVNLDGSSADESGDLEE FT IEDLSEEEA" FT CDS 16536..17981 FT /product="Gypsy-25_LBS-I_3p" FT /translation="MPDPSSSSIEEPAPTGRSSHRTMPLPGNKGALFFDKE FT KPIELLRFLDQMEDLFAEYGINSDVEKKKKLGKYTDQRTEFEWKAFKTFED FT SSFDKFKKALIADYPDARNAGKGTLTGLRAVCRENSRLAEKNLTELKILTR FT SFRAQQKLLMAPPVLVSNRELVDMFLGCLTDSFADQVKASLNIEQTRDRKQ FT KGNEDETTARRTEDPYDIIDIIDMAETIAGRLTGDSQDHPSNARAVLTGRS FT VQAREREIKTEHDEFEELRSITATFLDQIKVDQKQNSELRNMVQNMQIEMK FT KLLETLVHQQPNISNPQSKVASYKMTSDGCWYCDEPGHFTSNCPHREAHVA FT QKKIKLLGNRMYFTHNNTAVPRGNGEKSVRQIVEEASRQNLTLQNNMFAEP FT GEVFSQEAVVPGIIRLPDNNNGNEISVFTNQMRDTRDDVILNMNNQVLKLT FT DMLANLVTNPNKTRDVDASQFAVTCRSQTETQGQSGN" XX SQ Sequence 18796 BP; 5766 A; 4279 C; 4287 G; 4464 T; 0 other; ccagtaatga tgaaaatgca cactgaagag gatcttcaac cgaaaactga gaaagttttt 60 caaccctcga tgatttgatt gaaaaagatc aaggagaaat cgcgatagct ttcgaagaag 120 acggtgcggg aggatataaa gtcctatcgt acgccttacc aatggacgat caattctctg 180 aagaagagac taggaatttg tatcttacaa aatacttaaa acatgaaaaa gaaaagatga 240 cagacaaggc ggctagtacg gttctggtaa actacttaga aaaatattat tctaagcaaa 300 gtcaagaatc gggcaccgca tctacagtgg aaaccatgat gaatacgata tataaccatg 360 acggtaaagc gcaaccgagc gaagtggagg gtgaacatcc tgcgacaagg gaggtactgg 420 taactaaaaa gtataaacca gtcgcacaga aaatccgacc cgtctttcag gacctaccag 480 ataaatttcg aattattcgg gatataaaag gcgatccaat ggacaccttg ccaaagctga 540 atagaaaccc accaaaattc acacccacgg gacgctatac tgaagagcga agagagcaat 600 ttgacaaagt tcatgacggc gatttcttgt ggccagagga gcgcaaacta ctacattatt 660 ttatgatgga aaacaacgaa gcctttgctt gggatgatag tgagaggggg agcttcaaaa 720 cagaatactt cccacccgta gacattccaa taattccaca tacgccgtgg gttttgaaaa 780 atattccaat acctcctgga ctttatcaag aagtgtgcag aattattaag accaaattgg 840 atgctggggt atatgaaccc tcaaattctt cttataggtc aagatggttt tgtgtattaa 900 agaaagatgg aaaaagtttg aggcttgtac atagcctgga acctttaaat gcagtgacta 960 tagcgcattc agggttacca ccggcaaccg atgcacttgc tacgcatttc gctggtcgtg 1020 cttgcggagg aatgtttgat ttatacgtag gatatgatga aaggctcttg gcagaaactt 1080 ccagggatct gactacattt caaacacctt ttggagcttt aaggcttgtc actctaccca 1140 tgggttggac aaattccgtt ccaattttcc atgatgatgt tacgtatatt ttacaagaag 1200 aaattcctga agtaactgta ccatatattg atgatgtgcc tgtcagaggt cctgcaacaa 1260 ggtatgagtt gccagatgga aaatatgaga ctatacctga aaatagcggt ataagaaggt 1320 tcataatgga acacttcgaa aatgctaaca gggtagttca taggatgaaa tatgctggag 1380 gaaccttttc gggatttaag tcggtaattt gtgcggctga aataacagta gtaggccacc 1440 tatgcactta tgaaggcaga aaaccggaaa ctgaaagaat gaaagttata gatctttggg 1500 gaccttgtaa agacctacat gatgtcaggg cttttctagg aactgtaggg gtctgcagaa 1560 tgtttattga gaattttgca aagaaagcag aacctctcaa cgatttatgc aaaaagggag 1620 ctccttttgt atggggaccg gcacaagaaa agtctatgga caaattgaaa tcctcacttc 1680 gcgaaagtaa agcgctcgtg ccgttggatt acgaaaataa tcctaaccca gtggtactag 1740 ctgtagacac ctcttggaaa gcggttgggt tctacatata tcaggatagc catacgactg 1800 ggcttaggca gtatgccagg ttcggttcta taactttgaa tgatagagaa gctcgatttt 1860 ctcaaccaaa aagagaatta tttggtttaa tgcgatcttt acaagctgca tcatactggc 1920 ttctaggatg tcgaaaacta attgtggaaa cagacgcgaa atacctgtct ggtatgcttc 1980 gaaaccctga aatgggaccg aatgcaacca tcaacagatg gattgacgca tgtctaatgt 2040 tccattttaa gttacagcat gtccaaggaa agacctttgg cgcagatgga ttatcacgca 2100 gaaacgctca accaggtgac gatgtgtatc caccttcaga tgaacatctc gacgaaccca 2160 ttggtgtcat tgaagtgtta ccagatccag atggagggga gccccctttg gatttcgaaa 2220 ccttcaaaga tcaaattgac tcaagggggg gatatatcca acaattggct ctagacatct 2280 cagattttaa ggatgaacta gacagggaaa tagcaatgac acagaaaata gctaaagatg 2340 tgaaaacaag agtggacaat gatccagatc atacaaaaag cagcccagaa cagaaggaat 2400 ttcttcgaac ttacatgatt tcgactttaa cttcaccagg cctggaggcc aggtttgacg 2460 aactagaaac agagcaacct tacccggaaa ctcataggtc attcattgga cagtcacagg 2520 atgaattcct tcttatacta agagaatggc tgaaaaatcc taaaaagatc gaaggatatt 2580 ctgataaaaa gtatggcgct ttcgtaagat ttgcaaaaag tttcttccta gacaaagaag 2640 atagattata tcgtcgaagc attgaatcac gtcacaagct agtggtcgac aagtctcacc 2700 gaatgtacat gatgagatct gcacacgatg ctatgggaca taggggagca tatgcaacgc 2760 agaatatgtt aacggagcgt ttttggtggc ctgaaatcga aagagacgta cattggtatg 2820 tgaaaacatg ccagcgttgt caggaacgaa ccaagcgtgt ggtggaaatc cctcccaccg 2880 tgacacatac accatctatt ttccaacgac tacacgctga taccgtacac atgactccga 2940 aatcaaatgg atgtaagtac atcgtccatg ggaggtgtgc actgtcaagc tgggttgaag 3000 gaaaacccct aaaaagtgaa actgcaagaa ctgtcggctt atggcttttt gaagaagtca 3060 tctgcagatg gggttgtcta gaagtaataa taactgacaa tggttcggtt ttccttgctg 3120 ctgtaaaatg gctcgaagaa aaatatggga taaagggtat taggatatct ccgtataatt 3180 ctaaggcaaa tggcagaatc aaaaggcccc attgggatat tatacaaatg ctcacaaagt 3240 gctgtggacc caagaacctt gccaagtggt attggttctt ttggcaagta ctttgggctg 3300 acaggatcat gattagaaaa agatttggtt gttcgccgtt ctttatggta acaggcgcac 3360 atccaatatt acctctcgac gtacaggagg ctacctggtt agtaaaatta ccagatcgta 3420 tgttgtccac tgcagaattg attggctata gagccaaagc attagcaaaa cacaggcagc 3480 atgtggcgga aatgcgtgag agggtcacct tagaaaaagt gaaaagagtg gcaaaatatg 3540 aactaagcca ccgccataaa atcagtggtc gaatgataaa cccgagagac ctggtgctta 3600 tcagaaattc acaagtatcc tcgtcattaa atagtaaaat gcaaccaaga tacctgggac 3660 ctatgattgt ggtacgtcga acaaagggag gttcatatat agtttgtgaa ctggacggat 3720 cagtcaatca caataaaatt ggagcgttca gagtcattcc gtatttcgca agagaaagga 3780 tcgacttacc tgagcacttc agtacactta ttgatgcaag caatgaggaa ctggatgaaa 3840 tcgaaaacac aggcgttgaa gaaagaccta acgcaagaga ttactacttt gaaggagtta 3900 atcttgatgg gtcttccgca gatgaatccg gggacttaga agaaatcgaa gatttaagtg 3960 aagaagaggc ttagtagact tatgctttaa aattggctaa aaagtttaat aaaaacctcc 4020 agcagtttat aaacataatg cgcaagcgcc ttctagattt tccatgcgta gtcaactata 4080 catgcgggct ctaggtgtcc gtctacaaaa aagttcggat agttctacta ctccttagtt 4140 acttcacgct cagcaatctg ggcaagggca atttcgtaat cagcgacaag tacatcgcga 4200 agacgaagct ccaagtcgag gcgctgcatg ctaagaagaa cgcgttcacg aagatcgacc 4260 tcttgctgct tgaaataagc aagggtggga taagccggca tgcggttggt aacagcagca 4320 tgtgcgcgcg ctaagatctc cctgtacaga gggttgtcga tgagcaagga gttggcagac 4380 ttgtctataa caaaggacga gcgcgttagt ttacgattta tgaagataaa agataaatac 4440 gaactttgac gagactgaag atctgctaaa ccactaataa agggcggttc agtacgaggg 4500 cgtttgtcgc cagcgcaact gcctttggca tgctcgacga cgagggcctt gtgaaattcg 4560 cgactgttag tgtccatgcc tggaacagcg tcatagggaa tgctgaccgt ttcggcacgc 4620 ttggatttgg ctggggtctg taagttaaca acgtggcagt ggactcacgc aataatagca 4680 gcttaccttg ggaccagagg cagcgcgctt cttgcctttc tctttgggag aggaagcagc 4740 gtccttgact tcttcgtcac caaagaggtt gtccactgcc atctgagcaa agcggtacgc 4800 aatgttaaaa ttgtgttctg ctcaaaagaa aataaaaaat aacacacggt gtcagctgca 4860 gtgatagcgt tggggccatc acacccaccc atcaatccga ggggccgggg acaatctcga 4920 cctcgtcatc cttgtcctca acgacagcct tggacttgac aagcttgcgg ggacgctgtt 4980 tcaaaatgtg tccgcgctgc gttaataaaa tgagaaaaga tgaacaaaat aagtataaac 5040 gtgccttggc tttagcggcg gccattacct ctgcaatctc agcctgagcc tgagcttgag 5100 cagtaacagg tggtggaacg acctgcgcag cggcgcgcgc ggcgagaacg gtgcgacaat 5160 attctttgat agtgtcgagg tcggctgggg gcgcgatggc gtgaactcca gtgctgcggg 5220 tattgtactc gcctctaaca aggaaaagca cgttgtgcat tggctggtgc ataagcagac 5280 acggatcctt ggtgtcgata agctgaacct agtcaagtgc gcagtattag cctttgttcg 5340 aagaaataga gaatacaaag tttataacag aaaaattcac gacttacaca gtaattagcg 5400 gctacgccaa gaatggtggg agcggcagcg ctgttcagtc cctctttcat gtcctgataa 5460 gttttgaagg cagtgagagc ggactcgcga agcttgacag cggcgggaag agcgttgtgt 5520 ggaatagaca taatgagcga gtggtaaaat gagagtgcgt ggtagaaaag caaaaaactg 5580 ttctgtccga aaatttttcg ccactgcctg cttttatatg tgtgtctgag accaaggcat 5640 gcgcgttccg aaattccgcg ttgcaattca cttgaagcat gaaaaatttg gagttgcgcg 5700 ttccagcacg atcacgtcta aaaatagaat aaactttccg agccttggat ttcccgcgcc 5760 acgtgaccta tgaacttagt cgttttgact gaggacagtc aatatttagt gagggggagg 5820 gatcaaggct accttgatag ataaaaaatt cgatttcatg attcactaaa accaccaagg 5880 ggaaaagagt cataaaaacg cgaactagag acaataaaac cctaaggcag aatgaatttg 5940 atagaaaaat gcgaaataaa gtcgaaaatt cacaagttca cactctagaa agcatgcaaa 6000 aaggctctca cttaggaaca aatctcaaag ttcaagaaag acaaagtcaa gattctacta 6060 taaggacagc agaatctaac tacaaaagat gatgacacaa aaaggttgat atagaaagtc 6120 caaaaaacta agatgtccga atgcatgcca agtaaaagag tccgtaaaaa caagcgcacc 6180 tactcctcgt caacaagctc gtcgataacc tcctcctcct cctcctgaga atgggcggaa 6240 ggtggaagac caggcgaagc aacggcgata gaaggaataa gggacagcga gcggacgcgg 6300 cgttgacgag gtgtcagaga tgaaggccca gcaatgggag gagaatcgtc tcgctgaaaa 6360 agtgagacga gtctctccaa atcggccgtc gttgaacccg caggcactgg gaccccctga 6420 gtatctacaa agtcttgaac gaagctgtac cacgagccga actccttgcg agtctcgaaa 6480 acagtgccta gaagagttcc aggctcggcg gtgagacaca ggtccgcgat ggcgtttgcg 6540 aactctcgaa gattggcgtc gcgacgatgg cgaatggcct caaggactgc ggatagggtg 6600 cgcatatgaa gcgtgtcttc ttcgatggac tccatcagcg cagatatagc tagaaaaata 6660 agtgaaaagt cattaaaaag actataaaat aaaaataaaa taaaataaat catacctgaa 6720 ggcgcatgaa tagccaaccg gttacggatg acatccctga cggcccctcg gcgttcatca 6780 ttcagatagt actcacatga agaaacatgc gccgcgtcgc aaggaccgca gggagtgccc 6840 cacccacgaa aggtgcagtt cttgacagcg gaatccttgc gcgcagagca ctgaaggcaa 6900 ggttcgggca tctaataaaa caggtatcag tacctatgaa aactcagaca aaaatgagag 6960 gaggagacat acatggattt gtgtaccctt aagaaccgcg gggagctctt taagcccaaa 7020 tctagcagtg gcacgttcta aaaagaaagt cagcagggta accaaaatac tagtaaaata 7080 acgacttact aaatcgcttc ttgtcgaaca cgatgggctc ttggaaaata atgggagctg 7140 gaaaaaaccc gaattataaa gttaaaaagc aaattgagaa agagaaaaag cttacacttc 7200 ttacgagggc gaccgcgacg cttgggttga acagaagttt ccttctgctt gaccaccggt 7260 ggcttaaagg cagtacgctt ggcggaacta cgagtagtag gaggagtcct cgcccgagcc 7320 tttcccttgt caactcgaga tgaacctgca acaggcggag gcgtcttggc gcgagccttg 7380 cccttgtctg catgcaaagg cctagcgaga ggcacagagt ccgacgagct cgaacgggaa 7440 cggttgggat gttggatggc ctcgtcatca gacgagggtt tctcaataat cttgcgtcgt 7500 ttacctggct aagaaaccaa aaaagttagt aaaaatgtct aagcagagga aaatgaagaa 7560 acttacaggg ccaatagcca gcttcaaggg cgttgaagac ggaggatcga tgatctcgac 7620 cacatccgaa tcttcgtcca tgaccatggc acgagtacac ttcgaccttc gaacagggac 7680 aggcggcgtc gagtcaggcg aggaaactgg cagggactct gggccagggc tcggtagaaa 7740 caacagatcg gatcccgtag gagaagggga ctgaacgaca cgagctcgac ctcggggagt 7800 aatgatcccc ggaggtgacg aagacccgcg gtcgctcggc gcgtcctcct ctcctatgta 7860 ccccgtggta aagctggagt ctaggcattc gccctagaaa gataaaaatc attagaagaa 7920 aaataaaaag aaagaacgga agaaagttac ttacagggct ataaagcgca tgaccatcca 7980 atggttcttg cagccattcc tcccaggcag tactttggtt gcgttcttca acgcgttggg 8040 cggcattgta aacggacgcc agtaaacgag actcgtgtgg cgtaggacca ccaggacgag 8100 cgaggatgtc taaaaacaat gctaactaag gcaataaaaa gtcgaaaatc agcagagaat 8160 aaaagaaaag aacagagact caccatataa ctatgcaggc ggtatagcat agccataatc 8220 tcggcctcag acatagcgtt gaggcgggaa cgcgagaaga ggtcaatgcc gacttcgaac 8280 gcttgaacgg cggcgcgaca accaaggaat gcggtaggca tggtggggaa agttagaaag 8340 aaattgtgac aatcgctccc ataaagcatg aagtttatag gaacgtcagg cggaacatac 8400 ggaatagcga ggcactttcc gaagaatagt ctacgggaga agattgctga gaatccctcg 8460 gttccgtcga ccctgaaaat cttaatggac gtcaaaaacg ttaatcaact aaggcgtacc 8520 taggcaacaa taggaatatg atgtcatgtc ttcgaagaat aacgcgcctg cactagtaat 8580 agtaattgac gccgaaagat aaaagacacg cttgacagac ggtgatcttt tgaagacata 8640 tcaatataaa atccagaaaa acacgagggg aacgtcctca tacggggatc tagtgatgac 8700 cgaggacggt cagtattata gtgaggggga gatgatacag agtgtaacaa tacacccgta 8760 aaaatgcgaa ttttagcgat ttctcctata aaacctccta tagtagaaat acagtcacat 8820 tgatcataag gatcgttaga cgcaacccgg tctagtcgat taaagtttac atccgagaaa 8880 tgtgcacgcc ttagacagta ggaattgcgg tcggttcgtg accgaatcac ctcaatatac 8940 tcgaagtcta ataaaaggct acactaccat gaaatagtac gctaccgagt gttctagagt 9000 aaaaacaccc ccgcgcaggg tgttggagtt tacaccaaac cagggtctca ccttattatg 9060 gattaaaaga tattaattct aaagataaac gcgcaagcgt agaaactcga cgcgtacctg 9120 acgcgagaag tcattcggtc acgtgctacg aagcatgatc caagaagtca tgtggagaaa 9180 cgcatcacgc gcatcaacac atcacaaatc atggcatcat tgacccaaga aaactaagct 9240 tcgtgaaata ttccgaagcc tgccaaaaga gaaaaaacta tgaacgattt gccgtgcggt 9300 ccgaggcccg ggcatagact ctacactgtg tccgcacgta aaacaatacc ttttgttccg 9360 aaaaggtcta ctagacttag acaagtgata taaaatatac aaaaaacgtt tccccgaact 9420 tctacttaag ttctaacaaa acaagtgtgg taattctgtt taccgtacga acccgaactc 9480 gaagtaccga gcacgtacct gtggtgataa gtgtttaagg tgtttgtgtg tgtgtgttgt 9540 ttaggtgttc acgcatcttg tcagcgtgtc cgattacact attctcactc ccgcgtgaca 9600 cggtcttctc acagccacat gtccacctac catcactgac ctcttttcca caagactcag 9660 ttgttttccc gacggctggc ccttctgtcg atccgccagc tctcacaccg gatgaacaag 9720 cccatgttca tctaccatca ccgccagctg tacaccccgt tgaagcaccg tcaatcccac 9780 gaattctggc caagccgcca tcatccatga atccccttcc taatcctcga ttcgcctcgc 9840 agatgcggcc catcttcaca gaccaatcaa agcgtgaaca ggagttacga gaaaacaccc 9900 gtgttgtgga ggaacagcgg cttgctgcaa tcaagaaagt gaagcatacc gttactgtgt 9960 gcacgtggct ctcggtaggt ttccaaatca ttgtagtatc acctgttatt cacctgcttg 10020 aaggatgacg atgggcccga ggaattcatc caccagggtg gcttcgtctg gcccaatttc 10080 tgtatatcaa cctcccttct tgctgccgcc ggctttcaaa caagcaatga ggatgcctgg 10140 taccatttgt acaaccgttc acgctgctcg tggcaacggt tcaaggccaa tcacgtcatt 10200 gccgtggata tgacgacgga ggtcctcatc aaagatgtca atgtgaagac ctgccatcgt 10260 ctcagtgact accttgccga cagcaggata aactctactc caaaaaactt tcaaacgaac 10320 cttcgtgctg agcgagtgag catcaaggtc aaactcgacc gccagttcat ggagttgaat 10380 ccatccaccc ctcactcttc cactccctct caatccaaac gacctcgtaa ggtctcttgg 10440 ccacgcactt ccatcgcact gaccaccaca caaccgccat cctccagtac tgccacagat 10500 agcgagttgt cgtctgacat gtattgtaca ccagtcaaac gtcgcaaact gcctgctggc 10560 aattcctcta ggacaactgc accagtcagc tcaagcacca ttctaggaga agacaaacgg 10620 tggccctctg actaccgggt tcgcgatgtc atccatgtcc tgcatagctg caaaccacct 10680 ccaccacgaa caaccattgc ccaacacttc tattcgttga ccggcatacc tttcaagagc 10740 tcgacgtact acgacgcatg gtccaggtgg gatgaagcca cacaagagca acgtgatcat 10800 gctgaagggt gtggcaagga tggtctttgg aaagactttg cggttcgtgt ttctcttgag 10860 aaaatgagac tcaaggtggc tcgccaacgg gtcctgcggc gccaacgcac tgctgaagct 10920 gaggaaagtg atggtagtga tgacagcgtc atcagtttct ctacgctgtc gtctgattaa 10980 attaacatat ataccatgtc ttgtcctcgt tatgtacgat gtcttgttct agttatgccc 11040 attctcgtta tgtatgtatt ttgaataaaa agctggtacc tcatcaatgt ggagaattgg 11100 gtggtttagc tccaccattg ggggttaagg ccttggcttg taagctttgc cttagttgcc 11160 ctattcctca tcatgaccat ttcatccagg aaaaaatcga acatatccac cgccatctct 11220 cgtcaaattg aatggaatcc gcccaaccag acatttgttt gctctgctat cgaggtgttg 11280 aaccaccagt tccttctgtt ctcaccaaga agctctggaa agagtgccgg attgcagttg 11340 ctttgacaga aagtgtccga cacagcatgt ttgttggtga aatggaagac gtcaacggca 11400 agcccaacat tgttgaaatc tgagtcatct ttgacgtgct ttgcaacact acaattgttg 11460 tcgaagatta tgtggttggg catgaacccg ccccggtaag ttcgctttat catctcctac 11520 ggcccaacgt gaatcaacaa tgtacttcat aaaaattaaa ttccgcttac cacaactgca 11580 gttattgact ctgcatagta aaacgtctct ctagctatga tcattccgca aggggcaaca 11640 agaatttgct cattgtgagt ccgcaaacgg ccaaattgcg cacggatttt cttttttgct 11700 gctgggagcg cgggagctgc cagtgaggtt tgtgtgatta ggttagtagc gctagtccca 11760 atagttggat tcaacgaggc ccgagcaaga gctgtttggg ttatcgggtc agcctcggaa 11820 gggatcactc ggtcgccaac aatctcatat gtctcgtcaa ggtcgtcacc ttgttcaatg 11880 atatgagcca ccggcaagtc aagagcctca ccattttttg ggtgagcaac ctttgctctc 11940 ttcagcttct ccttcagttg gaatgccgcc tgcccacgtt caacatggac accctcaacc 12000 tgctgatgaa tggggttatt acatgtctgt ttgtcaggtg ccgaaatttt atcgctacag 12060 cctatgatgg cgcatttttc gttgaaatgg acatgtgtag gacagaagcg atgatgcttg 12120 ttggcaagag gattagtgca atgcagtacg ccgcatgtcg ggcgacctag cgttaccccg 12180 tccatgacaa ccgcatgaac ggtgagccct aaaacgtgca tagtacataa aacatcattc 12240 tgataacaaa aaattgctta ccagtgtgcc catcatttac atattcacgg acacatttct 12300 gacaccggtg acaaagttct ggttgaccat agaggcgaat ccgatcattc ctttcgtgca 12360 tggctacctt gtaacgttca gattgtaagc cgccgtgagg tacctgaagg atggtgttgc 12420 gccgttggtg gtcctcaagc aacgctaaca aaatgaaggc atgtgcgaca tttttgccgt 12480 caagtttgcc cttgaatttc caagactcgg ggattttatc gttcgtcgtt gtgttgtgtg 12540 tgttcaagta aacggcggcg caattggcaa aggacaccca tgcgacgttg gtgcttgttc 12600 gccacatgtc tataagatga cgtttgatga attggtgatc tgcaatttgg aaaatgtcag 12660 gtggattgtc ataatagaat cttctctcat tcttcatgta aaaatcaagg tgataatcgg 12720 tcttgcagtc tggacaacta gatgagccta aatagagtga attaggaggc caattaatat 12780 acctttacaa gttaggaagc aggaatgaat cggaatgggt ccttcattaa gtgtgtagag 12840 gatcccctca cgttgggttg caatctggag ttttcttcca ttatctgtgt gtttacaatc 12900 agggttggta cagagctgtg agggaggcca cagattacaa gctgaaacta tgtagtagca 12960 atgtaattac ttcgttgaat atagaagtga ctggtgacaa actgaatcct ttatctgcgc 13020 cgtgattctt gaaggcagct gacacagcct tgtcatcaag gatactacaa gctgaccaca 13080 caatctcttt tacaactttc cagcactcct cgacctcctg ttcacttata atgcagcttg 13140 aagcaaggaa ttcaattatg ctaacaggaa ggaactgagg aacgatgtca ggaaggtgag 13200 acgattgttg ggcaagcagg atttcatttt tgagtcttga ggcatattga ataaagcatg 13260 atagcttgag gatgctgttt gcagcatcct cgcttatgat agcgttaagt aggccttgca 13320 tcggacgaca gagatcgcac tatgtgaacg tcagagggaa gctcgctccc tcataagtcg 13380 atgcttgtgt gtgacactga ggcttatgac gtcatatggc ctcgtcagtt agggtagcca 13440 gaagttgcac ttgagttgca cagctttttg acttgtgaaa tttacgcatt tccactcata 13500 taaggatttt tctgtttttc tatcagatta taaatggaat tagtatatat ataacactga 13560 gggaagaatt tatgtgaggc ggagaatgag gcagagtccg gcctaatcca gatgaggcgg 13620 agttcccatt tccgactttc tttgggaatg gaatgcaacg gaatccggtc ctttgtttgt 13680 ttgttttact tctgttacca aacaaaggca acattaccaa acacccccac gacagccacg 13740 ataaaccgac aacgccatgc caacccacga cgacgacgac tccacaacgg cgacgaccac 13800 ccagcaacga acggcaacga ccaccgccag cgacgtggca gccaaatgga cgacaacaac 13860 aacaccacca cgatccacga gcacccacca cgatccacga gcgcccacca cgatgaccac 13920 gcaaatgccg caacgccacg tcacaacgac gacaacgcga cgaacccacc accatccacg 13980 agcacccaca acgaccacgc acgccacaac gccgacgacg acgacgacgt tgacgtcgtt 14040 cgtcacccgc atctcacgac ccacgtaatg accacccact gcccacgaaa acaccataac 14100 acccacgaaa acaccacgaa cacccacgaa aacaccacgg cgcccacgaa aacaccacag 14160 cgcccacgaa aaccacaggc cgcccacgaa acgatgccca gcgcccacga aacgacgccg 14220 agcgcccaca aaacgatgcc cagcgcccac aaaacgacgc ctaacgccca caacgcaacg 14280 cccagcggac acgacaccaa tgaggacgac aacacgcgag gatgacgaca cgacgcgatg 14340 acgatgcgcg ccaacgagac cactccctcc ccctcccctg ctgctcacaa ctccttccct 14400 ctccctactg ctcacccatt acccaccatc cccttccccc tcccttactc cccctccttt 14460 cctgctcacc actccctccc tctccctacc cccttcattc ccaaccccct cccttcattc 14520 cctaccccct tcattcccta ccccctgtct tcattcccta ccccctgcct tcattccctc 14580 attccctacc tcccctctcc ctcactgccc tcccttcagt ttttcccccc ctctagttct 14640 tcattccccc aactccccag cacccctacc cctccactcc ttccccctca ctcgctccac 14700 tcctccccct ccctctatct agattcaatg tagataaatt ccatcatgta tttttaataa 14760 atgagtttaa atgaatcttc cctctttaag tataaatgtc ctaaggaatc ctaagattaa 14820 gttgtcagca gaaggcctga gttagattca gtcacataat atctccttac tttgtccaag 14880 gtctgatcag aactaatact gaaaactgat cagcttaaaa agtactgaga gaagtatgca 14940 taagtgtgta ttggtgtaaa tgtatgttgt atctgtgatt ttacttctat tccagtccca 15000 ttcctgtgga ttctggtccc attcctgtgg attcctgtgg attcctgcag attccggtcc 15060 cattcctgtg gattcctgca ggaatgggag gggcactgta aagtactggg tccaaggccc 15120 gggcatagta tctaataata gagagaaaaa tcactaagca gaccgttcgt ctcggctgaa 15180 ccgatcgtga aaaccgttta cggtgattcc cgtgtatgca tcaatactga tgcgtcgcgt 15240 caccacaata gagaaaacac tacgaaaact taaaagaagt cagctcagtt gtgccacaca 15300 caaccgagcc acttcaaacc tttactaaaa atcagtatta attcgtccga cctttcggac 15360 aatgcacgga catgtaaata taaatagaga acatagaata gtataaaaag aggatgtgcg 15420 caggctgaaa cgcaaggaat tatcccccgc gttgaccagc tgtcgcactc ccagtagcat 15480 ctgaggtagt ggataagttt gtaatcaatt taaggacttt tcttcgcaga cataagtaac 15540 gtgtgcccgg aaccagcgtg ttaagtgggg atcacaaagc ttttgtaagg ttgtctcgtc 15600 aattagtaag agagctctct actattaccg aggaatccaa gtgagcgtgc ttgaagagtc 15660 tttaagacta gtttggaccg catcagaggc ataataaaag ctcaagggag ttataaaagt 15720 ctacaaagca caatttcctt gtcatttctt cgtatacagt tagaacgcaa tcccagaaca 15780 ccgagacagt cccaataaaa gctgtttccc ttcgaagact actaagtttc gaattctata 15840 ctgtgtccgc acgtaaaaca ataccttttg ttccaaaaag gtctactaga cttagacaag 15900 tgatataaaa tatacaaaaa acgtttcccc aaacttctac ttaagttcta acaaaacaag 15960 tgtggtaatt ctgtttactg tacgaacccg aactcgaagt accgagcacg tacctgtggt 16020 gataagtgtt taaggtgttt gtgagtatgt gtgttgttta ggtgttcacg catcttgtca 16080 gtgcgtctga ttacactatt ctcactcccg cgtgacacgg tcttctcatc tggagcccac 16140 cgcgagggtt ttcatttgtt ttgtttttca attatacttg aaaaagttgt ttgaaagcgg 16200 ttttcgaatt atttaaacat ctgtttgaat caaccaaaga ccatataaaa taaaaaaata 16260 gcctagtttc taataccaag aactagcatc aaaagtcgtt tacaacggca gccataaaaa 16320 tcagagaaac agccacatct aaaagagcta gagaaggtcc acaagtaccc ttacctccta 16380 catggaaacc caaacgcgct gctgagccag tcatacaacg ttacccgtta agaaaagaag 16440 tacctccaaa acctatacct agtcgacctg caaaagctgt tcgtcaatct gaaccaaaat 16500 cttcgaactc aaaagccaga agaaacccaa taatcatgcc tgacccatcg agttcaagca 16560 tagaagaacc ggcccctaca ggtcggtctt cacacagaac catgccatta cccggtaata 16620 agggagccct gttcttcgat aaagagaaac caattgagct tctacgtttc ttagaccaaa 16680 tggaagatct tttcgcagaa tatggtatta atagtgatgt ggagaaaaag aagaaattag 16740 gaaaatatac ggatcaacgg actgaattcg aatggaaagc atttaaaaca ttcgaagaca 16800 gctcctttga caagtttaaa aaggctctaa tagcagatta ccctgatgcg cgaaatgcag 16860 gaaaagggac cttaactgga ctaagagctg tatgtagaga aaacagtcgg ctagcagaaa 16920 agaacttgac agagttgaaa attttaactc gaagttttag ggcacaacaa aaattactca 16980 tggcgccgcc agtactggtg agcaaccgag agctagttga catgttctta ggctgtctga 17040 cagattcctt tgctgatcaa gttaaggcca gtttgaatat tgaacaaact agagaccgaa 17100 aacagaaagg caacgaggat gagaccactg ctagaagaac tgaagatcca tacgatatca 17160 tagacatcat agacatggca gaaaccattg ctggacgtct tactggagac tcccaagacc 17220 acccaagcaa tgcgagagct gtattgactg gtagatcagt tcaagcgcgc gaacgggaaa 17280 ttaaaacaga gcatgatgaa ttcgaagaat tgcgaagtat tactgcaacg ttccttgatc 17340 aaattaaagt agatcaaaaa cagaattcag aacttcgtaa tatggtacaa aacatgcaga 17400 tagaaatgaa gaaactctta gagactttag ttcaccaaca gccgaatatc agtaacccac 17460 aaagcaaagt ggcgagttat aaaatgactt cagatggctg ttggtactgt gatgaaccag 17520 gccattttac ttcgaattgt ccacatagag aagcacacgt agcccagaag aaaatcaagc 17580 ttcttgggaa ccgaatgtat tttacgcata ataatactgc agtaccaaga gggaatggcg 17640 aaaaatccgt tcgacaaatt gtcgaggagg ctagtaggca aaatttgact ctacaaaata 17700 atatgtttgc agagcctgga gaagtcttta gtcaagaagc tgttgtccca ggtataatca 17760 ggctaccgga taataataat ggaaatgaaa tttccgtttt tacaaaccag atgcgagata 17820 caagagacga tgtgattctc aacatgaaca accaagtgtt aaaacttacc gacatgcttg 17880 cgaaccttgt tacaaacccc aataaaacga gggacgtaga cgcgtcgcag tttgccgtca 17940 cctgtagatc tcaaaccgag actcagggac agtcgggaaa ctaaagagtc cgatgaaggg 18000 gccttccagt aagactaaac ccaatgctgg agaaccaaag gcaccggagg acgcaccaaa 18060 gaaaaggggt aggaaagttg tgattatcga agaatcagag gaagaagaaa ctcctcccga 18120 gaaaccagaa aatctcaatg gaccaaagaa actttgaaaa ataatcaaaa attacttgaa 18180 agaaatgatg atgtcgagga tcctaatcca aggcacaaac aattaccgta tctcggtatc 18240 ccagacttaa atagtaacgc gaaagaatcg gttgaagagt atgacccagt gcctttctat 18300 gagaagcgcc cgatagcata caagcattta gctccaatag aaaatgttaa aaatgaggaa 18360 caagtgttga acagtctgtt aaaagcacct gttactctaa gcgcggaagt acttatgagc 18420 atttctcctg gagtaaggca agaattgttt aaggccttag ctaaaaagaa ggtaccaatt 18480 cagactgctc acaatcgaaa agtaactata gtcgaggagg ttgataaaga cgcacctcca 18540 attaaaaagc aagaagataa ttttaatttc ggaaaaatca atataaatga cttggacatc 18600 aaagccactt ttatgtgcac aacagaggat gacggagtaa taccaaaggg ttctatagtc 18660 ctgacagatc ctgttgagca atatttgcaa ggcctaggct catcggaaac tccaaaggag 18720 atttacgtat ctaaagagtc ccatgcttta aaatcgatat atccagtaat taataagttt 18780 ggtcaagtag agagct 18796 // ID Gypsy-61_MLP-LTR repbase; DNA; FNG; 330 BP. XX AC AECX01002851; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-61_MLP_; KW Gypsy-61_MLP-I; Gypsy-61_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-330 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002851; Positions 1682 1353. XX SQ Sequence 330 BP; 76 A; 67 C; 83 G; 104 T; 0 other; tgtaagaggg ttacacatat tagagtctat ggggttacgc attatgggtt tagatggaga 60 gtaggatatg tttctgttcc acgccgactc cggtcagtcc tcactctttc gagagcctcg 120 tatgggactg cctggagtta ggtgagcgct tcttcaattt ctcttatatc ttgtacggat 180 ggtgttgctg actctttcga gagcctcgta tgggactgcc tggagttaga ttctttgtaa 240 tacatactga agttccaagt tagtgctttc gagagagcca ttttgaagag acaatccata 300 cccccagtga aggtcttcaa agaccttaca 330 // ID Gypsy-17_LBS-I repbase; DNA; FNG; 10745 BP. XX AC ABFE01001078; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-17_LBS_; KW Gypsy-17_LBS-LTR; Gypsy-17_LBS-I. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-10745 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01001078; Positions 12062 1318. XX CC Positions [4861-5316] - Reverse transcriptase CC Positions [6880-7359] - Integrase core CC 'CCTAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 670..8001 FT /product="Gypsy-17_LBS-I_1p" FT /translation="MSATETSMTQPFRPGGEGGYASSPMAKLMEEYQQERD FT EELSTLQRVSNLIAKKTLAGRDIIYLDSADANIALVNELLADISQVIGELQ FT SYLERTSGLIPERRKCFKVDPRDTFMNILTGATDLPQLHAAWKGFNRRITL FT AQENLSKYEAQYRQPYEDSNYVVPTSPISTDPEIYEVMSDLGELDSKMRYL FT YQSVPHLQEEIQSPRRLTDGSTWDKILPLPENLAELHYNESNTPFVGTRQI FT FSRDKGKGRSHENDIGIPTSPRMLNVGYGTPFRSSSQFFDKPDRQKFPLPV FT PTVLSKQSVLVGLGLPDTPAFQNLDDPSRGRNALQSRRSNPFEQRDQTNIN FT RGASTGTFNLPSNDPNHSNRGNGGNSNPSGGGGGINSGNNPYRSQDGDNES FT SSSSGTSNLVPRNHGGGGNGPPGGDPPGGGGGGPNRGNFGNQRNENQNQGL FT IPYGDTRATIRNDLKQDQLPIWDGNKDTAIEYFWKVQQLAALEGDIPQALG FT YWLWKSLKENSKIWWWFSTLSYSEQAKMRTHYLYYLKGIKDNYLGRTWQIS FT MNTKYESQSFRQEGFERESPPAFITRRIMHTRMLAASDDGGPTEVYLVMQK FT APISWGPILNIETVRSTSLLYSRATDHEYALIHAAKYESSNILTSDNLLST FT LRRLGISTDRNRPFERSAKLVSSDETKEDEVIHEAFLGQLSREECEHEISS FT SPRAIKEAYQVLKKRQRPPPKGGYPYCKNDHVTTKMGRLPPSPCKVCGSEN FT HWDKECPDWDVYQAKQGKSAYRIELNEIEDLENYYSSVYSVLVAERLAKEH FT KTIEDSEKGFDEAVLRAQEELHDVRERKTYKTTEPWRKQTMFAEEVEDEYW FT VEYRAKEKSETHLLYQIGEDDDTEQQKEVHTASKSERVPRPEDVNPERHSR FT DDFASSPNDKFKASTSEYVPSEEMKPAEIKDPPISSIDPPPSKERFIRIPK FT VRSRPEGTSAIGVSVLSVRGHVGSLNNSETDLRLDSCADITLISSEFYDSL FT VDKPKIKQGMRMQLWQLTDKDAKLRGFVSIPVYMVSEDGDVIETEAEAYVV FT PNMTVPILLGEDFQQSYETCVTRNVEEGTHISFRRHDYRIKAVPVERTKDF FT GRLRQSAYMVGQFVRRRLHRRNKNKRHRRKVKFGLEEKTVRAAEDYRLKPH FT ESKPIRVEGQLGEDREWLIQKNLLANANDSFFAVPNVLISAAHPWVPIANP FT TDHPRYIRKGEIIGSICDPGSYFDSPSSPEEFRQFQETADKIRTVIAVQMD FT NEGSKGQGGEDSEPEEYGPKTAAMPDPTVYSSSELEDLIDVGSLPDHLKER FT AWAMLRKRVKAFGFDGRLGNLPAKVHIRTVDGQVPISLPMYSSSPEKRAII FT NEQIDTWFEQGVIEPSKSPWGAPVVIAYRNGKPRFCVDYRKLNAATIPDEF FT PIPRQSEILASLSGAQVLSSLDALSGFTQLELAEEDIEKTAFRTHRGLFQF FT RRLPFGLRNGPSIFQRVMQGILAPYLWIFCLVYIDDIVIYSKSYEEHIDHL FT DKVLEAIEKAGLTLSPKKCHLFYGSILLLGHKVSRLGLSTHAEKVKAIIEL FT ERPRKLSQLQAFLGMIVYFSAFIPYYASICAPLFQLLRKGHKWIWGIEQEH FT AFQAAKSSLNSSPVLGHPMEGLPYRLYSDASDEALGCSLQQVQPIKVRDLK FT GTRTYERLRKAYENGLKPPRLVTSIGTAKKDSTFEDEWAPDFDDTTVHVER FT VIAYWSRLFKNAETRYSTTEREALAAKEGLVKFQPFIEGEDVLLVTDHSAL FT QWARTYENANRRLAAWGAIFSAYAPKLEIVHRAGRVHSNVDPLSRLPRAPP FT PQTSPPEVNEPVILAKETLDVRQEVERPAERMAAYCFTAWSIEDCLDTPRE FT AMINVRSRNKRSSEASQVDRPAPKEAIIQEQGNDRTKEEGEELDTLRTTTE FT YWGALNPPPTVHLSMDEKAKKEWREGYLHDPSFKTIATDPKYSHDRFTQGR FT RFFVDKDGMVFFSNEDYQPRLCVPTSQRNFVLMEAHENPLESAHAGPERLW FT QSLSPRFYWKRMKIDIEKFCKTCDVCQKSKFSNFNKFGLLIPNPIPSRPYQ FT SISMDFIVNLPWSDGFNAIYVVVDRLSKHASFIPTTTGLDSEGFADLFVKH FT IVCRYGLPESIITDRDPRWTADFWIGIAKFLQTRMSLSSSHHPQHDGQTEV FT VNRLLTTMLRAFVSSNKSDWSKWLHLLEFAYNSAIHSSTSAAPFHLLLGFH FT PRTPLDFTGTKRNDEVLNRALTPEAVTFLENMAMHRDSARRAIAKAQDQQT FT RSYNNGRKPIPHLKKGDRVLVNPHALEWIESKGEGKKLTQRWIGPFEVMQR FT INPNVYRLRMSSLYPGLPIFNYQHLKKYEESPIEFGPRTTLPETRTAVPAK FT EEYEVDRIIAERRTRKGLEYLVRWEGYSPLYDTWEPRKALTNAPEVVAKWQ FT RQGEGGDPPQ" FT CDS 8168..10744 FT /product="Gypsy-17_LBS-I_2p" FT /translation="MEFLYTDGDGIVPTSGREVMPAGEWIALNASAGKIPS FT TDVFQQALEDRAFQLEETCARPTAWTWTKEPAWFSQRYHWDAWNVIESFPD FT ENDNVPWYFNKGSAPIINNGGKFYLATDLREKAMDLLSRLWLCVESVVSNH FT PFIKGTDHPLKFNYLRLQSGWDSADSVDSITREARAKALEFLGFLNWWTSS FT VTRWDFSLHPWMVDFIRTFQLRSLKKRGVFLDLVRNWHHLNIAHLLAEDVP FT VYYFWMDDHTQYPSLTQLSPRILQAYHDACTALDRTEVDAEEIMGYDEEIT FT TIRRHDEFLQLHSPPDHISSPLFIDIPATATVYIVDFEGWARRPITDFVIV FT KDYAEKFHFFIDLEMTGGMVTIWRWKPRVTNPGSGQKAGTEGSGISVEAKR FT GNREIREIFKSLYAPSPKESFDKWGRLRLPGEVSSDSSAGSPNPNASPQRD FT EASVTMLPRPSWVPEAHPSIPVPRMTASEEIPSRWVQAMSAPSPLLSQSRA FT SSVRRSDTSERRSSRDSRRSASPPTSSRKQLLPASRSTFLAELRRLGDEFA FT VREATWSSKKPLNWNLEFLDVGFLLIPDPKAQARLRYWAACSGDASSMAAI FT LFKAICFSIPFAIGVKVEDFSRFKPEEVSDMDRLVGKPTCNVEPPFLYTAQ FT GALKAYYMSRVNDVIRRPHARILIGMGGPIAWLGRKWGGAELVAQFMSGPS FT PDVYVHRRGYIDSDDEHPLFLYTDEMSPQEVDVMFGCIRSDSDKDKSLYPS FT KDILDDGCFFWTGEWDSRMEDMFVDLTKEILQGTAKFRTPGMWNDYFRRRN FT RMSRGNRDRLNHIVPASLMHLHTRILDGFHVDWHKSRIARIEIPEEYRTRQ FT IGNRGA" XX SQ Sequence 10745 BP; 3003 A; 2729 C; 2576 G; 2437 T; 0 other; aaggtggaca ctgtgggaac caacagcaac ctggccggcg taacagcaga cttagacctc 60 ggagaggacg atagagagag atcgaataga tcgggagcat cacgatcaag gcgaggagct 120 cccccttctt ctctcagctt acctttccaa agatcatcta gatctgtctc gccaacgaaa 180 caatcttcta ccatcggcag tactaaatca ccgggattac ctgccagcac tcgatttcct 240 tccgttactc cttcgactct caaacccgct ttacgagccg actatactag tcgctttcag 300 acatcggata agcaaaacaa aggagcttct acctcaggaa gaaacgtcac cttcagacat 360 cacctattcc aaccagctag aaaccccatt aagactacct ttcaacctcg tctagccgac 420 tccgttatcg aatcatcgtt agcattcgat ttcaaatccc ctactcccgt tactcctact 480 acggacatca cggcgaccgg agaaattctg gcttcccctc ctactcaatt ccccgactct 540 ccacaaactc atctacagtc atccagtccc tctattccat ccaactctaa aaacaaccct 600 agtcccatat cacctccttc agtaacgaca ttatcttcga ggaatacgcc gtcacaggaa 660 gtagaagaca tgtcggcgac ggaaactagt atgactcaac cgttccgtcc aggcggtgaa 720 ggaggatatg caagctcacc aatggccaag cttatggaag aataccaaca agagagagac 780 gaggagcttt ctactttaca gcgcgtttct aacttaatcg caaagaaaac cctagccgga 840 agggatatca tttacttaga tagcgcggac gcgaatatag cgctcgttaa tgagctattg 900 gcagatatat cacaagtaat aggggagttg caatcatatt tggaacgtac ctccgggctc 960 attccggaaa gacgaaagtg tttcaaagtt gatccacggg atacgttcat gaacatctta 1020 accggggcaa cggacttgcc gcaattacac gcggcttgga aagggtttaa taggcgtata 1080 accttagcac aggagaacct ttccaagtat gaagctcagt accgacagcc ttacgaagat 1140 tcaaactacg tagtcccaac ttctcccatt tctaccgatc cggaaatcta cgaagtaatg 1200 tccgacttgg gggagttgga ttcgaaaatg aggtacctat atcaaagtgt accgcacctc 1260 caagaagaaa ttcagtcccc aagaagacta acggatgggt cgacttggga taaaattctg 1320 cctctccccg agaatctagc cgaactgcac tataacgaat caaatacccc gtttgtagga 1380 acacgtcaaa tcttctccag agacaagggt aaaggtagaa gccacgaaaa cgatatcggt 1440 attcccacct cacctcgaat gttgaatgtg gggtatggca ctcctttccg ttcaagttcg 1500 caatttttcg acaaaccaga taggcaaaaa ttcccactac ctgtccctac cgtactttcc 1560 aagcaaagtg tcttagtagg cttaggcctt cccgacacgc ctgctttcca gaatttagac 1620 gacccttccc gaggaaggaa cgctctgcag tctagacgct caaatccttt tgaacaaagg 1680 gatcaaacaa acatcaatcg gggagcgtct accggtacat tcaacttacc ctccaatgat 1740 cctaaccact ctaatcgtgg aaacggagga aatagtaatc cttcgggggg cggaggagga 1800 attaacagtg gcaacaatcc atacagaagt caggacggcg acaatgaaag ttcgtcttcc 1860 tccggaactt cgaatctcgt tcctcgaaat catggagggg gcggcaacgg tcctcctggc 1920 ggagatcctc ctggaggagg aggaggaggt ccgaacagag gaaattttgg aaatcagaga 1980 aacgaaaatc aaaatcaagg cctgataccc tacggcgata ctagagcgac cattagaaac 2040 gatctaaaac aggatcaact tcctatatgg gatggtaaca aagatacggc aattgaatac 2100 ttttggaaag tacaacagtt agccgcattg gaaggtgaca tcccgcaagc actagggtat 2160 tggctatgga aaagcttaaa agagaactct aagatctggt ggtggttttc tactttgtcg 2220 tattcagaac aggcaaagat gaggacgcac tacttgtact atctcaaagg gatcaaggat 2280 aactatttgg gacgaacttg gcagattagt atgaatacga agtacgaaag ccaatcgttc 2340 cgccaggaag gatttgaacg cgagtcacca ccagcgttca ttactcgccg cattatgcac 2400 actaggatgt tagctgcttc ggacgacgga ggtcccacgg aagtttattt ggtgatgcag 2460 aaagcaccta tttcatgggg accaattctg aatatcgaaa cagttaggag tacatccttg 2520 ctttactcac gagccaccga ccatgagtac gcactcatac atgcagcgaa atatgagtca 2580 tcaaatatcc tcacatcgga caatttgctg tcaacgttaa ggcggttggg aattagcact 2640 gatcggaatc gccctttcga aagatctgct aaactcgtca gttctgacga aacaaaagaa 2700 gatgaggtga ttcacgaagc attcttaggt cagctaagta gagaagaatg cgagcatgaa 2760 atctcgtcta gtcctagggc aataaaggaa gcgtaccagg tgttgaaaaa gagacaacgc 2820 cctcctccta aaggagggta tccatattgc aaaaacgacc acgttactac taaaatgggt 2880 cgtttacccc cctccccttg caaggtatgc ggaagcgaaa accattggga caaagaatgt 2940 ccagattggg acgtctatca agcgaaacag gggaagtcag cttaccgcat cgaactcaac 3000 gaaatagagg acctcgagaa ttattatagc agcgtgtatt cggttctcgt agcagaacgt 3060 ttagcaaagg aacataaaac gattgaagac tcggagaagg gttttgacga ggctgttcta 3120 cgagctcagg aggagctcca tgatgtaaga gaacgtaaga cctacaaaac aacggaaccc 3180 tggaggaagc aaacgatgtt cgcggaagag gttgaagatg aatattgggt ggagtataga 3240 gctaaagaaa aatcggaaac ccatctattg tatcaaatcg gggaggacga cgacacggaa 3300 caacaaaagg aagtacacac cgcttctaaa agcgaaagag ttcctagacc agaagacgtc 3360 aatcccgaaa gacattcaag ggacgatttt gcaagctctc ctaacgataa attcaaggct 3420 tcaacttccg aatatgttcc atcagaggag atgaaacctg cagaaataaa agacccacct 3480 atcagcagta tcgacccacc cccatcaaag gaaagattta tccgcattcc taaggtccgc 3540 tcgaggcccg aagggacatc cgccatcgga gtttccgtac tctccgtcag aggacatgta 3600 ggatctttaa ataacagtga gacagacctt cgattggact catgtgcgga cataactttg 3660 atatcaagcg aattttatga ctcgctagtt gataaaccga agataaagca gggtatgcgg 3720 atgcagctat ggcagttaac agataaagac gcgaagctaa gaggattcgt tagcatacct 3780 gtttacatgg tcagcgagga tggggacgtc atcgaaacgg aagcggaagc ttacgttgtt 3840 ccgaacatga cggttcctat acttctgggc gaagacttcc aacaatctta tgaaacttgc 3900 gtcaccagaa acgtcgaaga aggcactcac atttcattcc gacgacatga ttaccgaatt 3960 aaggcagttc ccgtggagcg gacgaaggac ttcggtcgat tgcgtcaaag tgcctacatg 4020 gttggacaat tcgtgcgtcg tcgccttcat cggcgtaaca aaaataagag acaccgtcgc 4080 aaggtaaaat ttggtctgga agagaaaacc gtgagagcag cagaagacta ccgtctgaag 4140 cctcacgaaa gtaaacctat ccgggtggaa ggtcaattag gagaggaccg ggaatggtta 4200 atacagaaaa acctactcgc gaatgcgaac gattcgttct tcgccgtccc taacgtattg 4260 atctcagcgg ctcatccttg ggtcccaata gctaatccaa cggatcatcc tcgctatatt 4320 cgcaaggggg agatcatagg atctatctgc gatccaggaa gttacttcga cagcccaagc 4380 tctccagaag agttcagaca attccaagaa acagcagata agatacgaac cgttatcgca 4440 gtacaaatgg acaatgaggg ttcaaaaggc caaggtggag aagactcgga acctgaagag 4500 tatgggccta aaacagcggc catgcctgac ccaactgttt actcttcctc agaacttgag 4560 gatttaatcg atgtcggcag tcttccagac cacttgaagg agagagcttg ggcaatgcta 4620 cgtaaacgcg tcaaggcatt cggttttgac gggaggctgg gaaatttacc agccaaggtg 4680 catattcgga cggtagatgg tcaagtgccg atatctttgc ctatgtatag ctcctctccg 4740 gagaaaagag ccatcatcaa cgagcagata gatacgtggt tcgagcaggg agtgatcgag 4800 ccttccaaaa gtccttgggg ggcgccagtt gtgatcgcct accggaacgg caagccgagg 4860 ttctgcgtgg actacaggaa attgaacgca gctacaattc ctgatgagtt tcctatacca 4920 agacagtccg aaattttagc ctcactgtcg ggagcacaag ttctatcatc cttggacgct 4980 ttatccggat tcacacagct cgagctagcg gaagaagaca tcgagaaaac ggcctttaga 5040 actcaccgag gcttgttcca gttcagacgg ctgccttttg gcttacgtaa tggtccttca 5100 atatttcaga gagtgatgca aggcatcctc gcaccgtacc tgtggatttt ctgcttggtg 5160 tatatagatg atatcgtcat ctattccaag tcgtatgaag aacacattga ccatctggac 5220 aaggtcctag aagcaataga gaaagcagga ctgactctct ccccgaagaa atgtcatcta 5280 ttttacggtt caatcttact tctgggtcac aaagtttcgc gtttagggct atcgactcac 5340 gcggaaaagg ttaaagccat tatcgaattg gagcgcccaa ggaagttgtc acaattacag 5400 gctttcttgg gaatgatagt ctacttttcc gcgttcatcc cctactatgc ctccatctgc 5460 gcccccttgt tccaactcct acgcaaaggg cataaatgga tatggggcat agaacaggaa 5520 cacgcattcc aggctgccaa atcctctttg aacagtagtc cagttctggg ccaccccatg 5580 gaagggcttc cgtatcgtct gtactccgac gcctcagatg aggctctggg ttgctctcta 5640 cagcaagttc agccgattaa agtacgagac ctgaaaggaa cacggactta cgaacgatta 5700 aggaaagcgt atgagaacgg attgaaacct cctcgtttgg tcacatctat agggacagcg 5760 aagaaggatt cgacgttcga agatgaatgg gctcctgatt ttgatgacac gacagttcac 5820 gtcgagcgag tcatagcata ctggtctcga ttattcaaga acgccgaaac acgttattca 5880 acaactgagc gcgaagcttt agctgcgaaa gaagggcttg tcaagtttca gcctttcatt 5940 gaaggtgaag acgtactatt ggtgacggat cattcagccc tacagtgggc gaggacctac 6000 gagaatgcaa atcgacgtct cgctgcgtgg ggagcaattt tttcagcata cgcgcctaaa 6060 ttggaaatcg tgcaccgagc gggtagagtg cattctaacg tagaccctct atcgaggctc 6120 ccgcgagctc caccgcctca aacttctcct cccgaagtca atgaaccagt gatactggct 6180 aaggagacgc tggacgtcag acaggaagtc gaacggccgg cagagagaat ggcagcatac 6240 tgcttcacag cctggtcaat agaagactgc ctggacacgc ccagagaggc catgattaac 6300 gtacgttcca gaaataaaag aagcagcgag gcatcgcaag tcgacagacc cgctcctaag 6360 gaggccatca tccaggaaca aggaaatgac cgtaccaagg aggaggggga ggaactcgat 6420 actctgcgga ctacgaccga atactgggga gctttgaatc ctcctcctac ggttcaccta 6480 tccatggatg aaaaagcaaa gaaagaatgg agagagggct acttacacga tccgtctttc 6540 aaaaccatag ctacagaccc gaagtattcc catgacagat tcacccaagg tcgccgattc 6600 ttcgtggaca aagatggcat ggttttcttc agcaacgaag attaccagcc ccgtctttgc 6660 gtgcctacga gtcaaaggaa cttcgtcttg atggaagcac atgaaaaccc tctggaatca 6720 gcgcacgcgg gacccgaacg attatggcaa tccctaagtc cccgctttta ttggaaaagg 6780 atgaaaatag atatcgaaaa gttctgtaag acctgcgacg tatgccagaa atcaaaattc 6840 tctaatttca acaaattcgg cttacttatt cccaatccca tcccctcccg accctaccag 6900 tcgatttcca tggactttat cgttaatctg ccatggtcag acggtttcaa cgccatttat 6960 gtcgtggttg atcgtttgtc gaaacatgcg tcgttcattc caaccacgac tgggctagat 7020 tccgaaggct tcgcggatct cttcgtcaag catatcgttt gtcgttacgg tttgccagag 7080 agcataataa cggatagaga ccccagatgg acagccgatt tctggatagg catcgcaaaa 7140 tttcttcaga cacggatgag cctatcgtca tcccatcacc ctcaacacga cgggcaaacg 7200 gaagtcgtaa atagattgct caccactatg ttgcgcgcat tcgtttcaag taataaatcc 7260 gactggtcga aatggttaca cctgttggaa ttcgcctaca acagcgcgat ccactcgtcc 7320 acgagtgcag caccttttca ccttttattg ggcttccacc ctcgaacgcc attggacttc 7380 accgggacca aacggaacga cgaagtcttg aatcgagctt tgacaccaga ggcagttacc 7440 ttcttggaga atatggctat gcacagggat agcgccaggc gagcgatcgc caaagcgcag 7500 gatcaacaaa cccgttctta taacaacggt cggaagccga taccccactt gaaaaaaggc 7560 gatcgcgtac tagtcaatcc gcacgcttta gaatggatcg agtccaaagg ggagggaaag 7620 aaactcactc aaaggtggat agggccgttt gaggtgatgc agagaatcaa tccgaacgtg 7680 taccgtcttc ggatgagtag tttgtatccg gggcttccta ttttcaatta ccaacatctc 7740 aaaaagtatg aagagtcacc tatcgagttc ggacctcgca ccactctccc tgaaacacgc 7800 accgccgtac cggccaaaga agagtatgaa gtggatcgga ttatagcgga aaggcggaca 7860 cgtaagggcc tagagtacct cgttcgctgg gagggttaca gccctctcta cgacacctgg 7920 gagccgagga aagcgctcac taacgcccca gaagttgttg ctaaatggca gcgtcagggt 7980 gaagggggag acccccccca atgatttgta ataacactac ctaacttatc tcgtctagat 8040 cagtgatgga tctcactgat agacgaccgt ataccctttc cttttgtttt ttgttctttc 8100 cttctttcct cttacatcgt ttctcttttc tcatttggat tcactgcgca tactcatcca 8160 cgacatcatg gaattcttgt ataccgacgg agatgggatt gtcccaacat caggtcgaga 8220 agtaatgccg gcgggggaat ggattgcgtt gaacgcatca gcaggtaaga ttccctctac 8280 ggatgtcttt cagcaagctt tggaggatcg agcatttcag ctggaggaga cctgcgcacg 8340 gccaacggcg tggacttgga cgaaggagcc cgcctggttc agtcaacgct accactggga 8400 cgcgtggaac gtaatcgaat ccttccccga cgaaaacgac aacgtccctt ggtactttaa 8460 caagggaagt gcgcccatca tcaacaacgg aggaaagttc tacttagcaa ccgatctacg 8520 ggagaaagca atggacttgc tttcgcgtct atggttgtgc gtcgaatcag tcgtctccaa 8580 ccaccctttc atcaagggca ccgaccatcc tttgaagttc aactatctcc gcttacagtc 8640 gggctgggac tcagcagaca gcgtcgactc cattaccagg gaagctcgag cgaaggcctt 8700 ggagttcttg ggatttctca attggtggac ctcgtcagtc actcgttggg acttttctct 8760 acatccttgg atggtagatt tcatcaggac tttccagctt cgcagcctga agaaacgagg 8820 cgtattcttg gatctagtca gaaattggca ccatctcaac atcgcacacc tgttagcgga 8880 ggacgtcccg gtttattact tctggatgga cgaccatact caatacccca gcctcaccca 8940 gctgtccccc cgcatccttc aagcctatca tgacgcctgt acagcgctgg acagaaccga 9000 ggtcgacgcg gaagagataa tgggctatga cgaggaaatc acaacaatcc gacgtcacga 9060 cgaattctta cagctccact ctccacccga tcacatctcc tctccactat tcatcgacat 9120 ccctgccacc gccacagttt acatagtaga ctttgaggga tgggctcgac gccctatcac 9180 agacttcgtg atcgtcaaag actacgccga gaagttccac ttcttcatcg acttggagat 9240 gactggaggt atggtgacga tttggcgatg gaaaccacga gtaaccaacc caggttctgg 9300 tcagaaagcg gggacggagg gctcaggcat cagcgtagaa gcaaaacgcg gcaaccgcga 9360 gattcgagaa atcttcaaaa gcttgtacgc cccttccccc aaggaaagct ttgacaagtg 9420 gggacgtctg cgtcttccag gggaagtcag cagcgactcg tctgcaggtt cccccaaccc 9480 caatgcttcg ccacagcgcg acgaggcttc agtcacgatg cttccacgtc ctagctgggt 9540 tcccgaagct catcctagca ttccagtgcc acgcatgaca gcttccgagg aaattccatc 9600 cagatgggta caggcgatgt ctgcgccttc tcctcttttg tcccagtcac gagcatcatc 9660 ggtccgccgg tctgacacct ccgagcgacg cagctcccga gactctcgac ggtcagcctc 9720 tccccccacc tcctcacgca agcagctact accagcatcg cggagtactt tcctggcgga 9780 attacgcagg ttgggagatg aatttgcagt tcgcgaagca acgtggtcca gcaagaagcc 9840 cttgaactgg aacttggagt tcctcgacgt agggttccta ctcattcctg accccaaggc 9900 tcaagctcga cttcgctatt gggcagcttg ctccggcgac gcatcatcta tggccgccat 9960 ccttttcaaa gccatctgct tcagcattcc gttcgcgatc ggggtcaaag tcgaggattt 10020 cagtcgtttt aaaccagaag aggtatcgga tatggaccga ctcgttggga agccaacctg 10080 taacgtggaa ccgccttttc tctacacagc tcaaggggcc cttaaggctt actacatgag 10140 ccgtgtcaac gacgtcattc gtcgccctca tgccaggatt ctcataggca tggggggccc 10200 tatcgcttgg ctaggtcgca aatggggcgg agcagaactg gtcgcgcagt tcatgtcagg 10260 gccttcccca gacgtctacg ttcatcgtcg cggatacatc gattccgacg acgaacaccc 10320 gttgttttta tacaccgacg agatgtctcc gcaagaagtc gacgtcatgt ttggctgtat 10380 ccgtagcgac agtgacaaag ataaatccct ctacccatcc aaggacatat tggacgacgg 10440 ctgcttcttc tggacgggcg agtgggatag tcgcatggag gacatgttcg tggacttgac 10500 caaggaaatc ttacagggta cagcgaaatt ccgcacgcca ggaatgtgga acgattattt 10560 tcggcgtcgg aatcgcatga gcagaggaaa tcgtgatcgt ttgaaccaca tcgtaccagc 10620 ttccttaatg caccttcata cccggatttt agacggtttc cacgtcgatt ggcacaagtc 10680 tcgcattgct cgcatcgaaa ttccggagga gtacagaacc cgtcaaatcg gaaatagggg 10740 ggctt 10745 // ID Gypsy-20_LBS-I repbase; DNA; FNG; 12713 BP. XX AC ABFE01001469; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-20_LBS_; KW Gypsy-20_LBS-LTR; Gypsy-20_LBS-I. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-12713 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01001469; Positions 15711 2999. XX CC Positions [4858-5313] - Reverse transcriptase CC Positions [6853-7332] - Integrase core CC 'ATGTT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 8135..9289 FT /product="Gypsy-20_LBS-I_4p" FT /translation="MDCLYTDNDGKVPTSGRERMIAGEWIALNASAGKIPS FT TEVFEQALEDRAFLLPELAARPTSWSWTKQPSWYSDKYHWDAWNVVDCFAD FT ENDTIPWYFSEGNSPWQDAGGKFHFDPSMRVKAEESLSKLWLCVEAITTNP FT PFLNGTPHPSKFNYLLLSAAWDSARGAASLMNDAKDRALELIGFINWWVSS FT VSGWQNSLQHWMVDYIAGFKLHDLKKRGVFLDLLKHSKSLNIGHLLAENVP FT VYYFWQEDMDDYPSFTRLSPTILQAYHDTCSSLDKTEVYGEDMQGFQDDIA FT TIKHYDEFFQLRRVPDHVASPISLDIPPNAVAYICDFEGWSAWLIKDPDLI FT RDYADRYHLSIEMDDRDTYVTIEVFTFPGIIHMESMWNPWNP" FT CDS 11456..12712 FT /product="Gypsy-20_LBS-I_5p" FT /translation="MPTEHGDHQALLPRPHWVSASQPQLSIPRPTSPITTV FT SSWAQAMAAPSSHSSSRASSLAQHSRSSGDYGSRRRSASPQRRHLHHANAM FT PSLRAVFVDDLRDLANQYPMVANPWVSKKPLTWNADFLDVGYLLIPNDRAQ FT ARLRYWATSSQDATTMLDLLLKAICHGLSFSIGVKVEDFGRFKPEDISDTD FT RLVGKPTSAIEAPFIYTAQGALRAYYLSRVNDIIRRLHARILIGMGGPEAW FT LGRKWGSSELVAQFMGGPSPDVYLHRRGYIDSDDEHPMFLYTDEMTPQELD FT VLFGCIRNDSDRDRSLYPSRDILDEGCFFWTGEWDTRMDSMFNDITKDILQ FT GSAKFRTPGMWNEYFRRLNRSHRGSKERLNQVVPSMLARLHAKIIDGFPVD FT WNKRRLSGIELPEEYRPRQIGNRGA" FT CDS join(661..1074,1078..7974) FT /product="Gypsy-20_LBS-I_1p" FT /translation="MSSTIETTSSQTFRPETGRGYEGSPMARLIRQFKESS FT DDDISTYQRAINLSAKSCLLSNDVVELEGSQRSFALHPTLLRDVTQLVGEL FT QVFLERTSALIPERSSYYKVDPRDTFLSILRESSDALQVHAAWMGLSGRLF FT AQENLWKYEAQYRSPLAGENVIMPTSPLSTDPGIYEAMEDLDDLDLRLRYL FT FDNVPHHQDQVKSPRKLRNGVPWNEVLALPGNLQENTYNRLPTIVEHSRDD FT QESNSQVKKGKRRITDEFVSPPTSPRGLNVGYGTPFKSSSQFFVRPGGIPL FT PPPETSAQQNILVGLGLPHTPAFENIPSSKDAPQATWVNTQVRSRASNPFE FT GRPLPPHMVDVQQTRDGLVRTTSIATNQRQRPVESRSEGGSSRHRSQSRNH FT ENRNQGNPPNGDPDGSDDDDDDGDEYPRRNNHDNREDRPSSYPTNHGGGGG FT NPGGGGGGNGPSSNGLYPNTQPQGNVPYGNLVATIRNELKQDQLPVWDGNK FT ETAIEYFWKIQQLAALEGDIPVALRYWLWKSLKENSRIWMWFTTLPFTEQS FT KMRTHYLHYLKGIKDNYLGRTWQISMNRKYESQSFRQEGFERESPPAFIVR FT RIMHTRMLVASDDGGPTEVYLVMQKAPISWGPILNLETIRSTSLLYSRATD FT HELALVHASKYESSNVLTADNLIQTLRKLGISTDRNRPYDRSAKLVLGKDD FT VIREAFLGQLSREECTQEINSDPEVMKEAFQILKKRQRPPPKGGYPYSKND FT HVTTKMGHLPPSPCKVCGSNNHWDKECPDWSFYEAKQTKAAYRIETDKDDD FT LENYYCSVYAVLVTERLTLEDKQGNESSDFHEAVLKEGNDHLPRERKSGDN FT SSWKPQKVFIEEEEDESWSEYRAKEKSTTHLLYQLGDDNDDDEILQKEAFS FT SHKKAPSSSRPEEDKDLPKEVKDPPGAAQDPPTETKDPPFPPLFKDKPFRI FT PKARSRPEGMSAVGVSVLSTRGFVGGLNNVETDLRLDSCADITLISYEFYE FT KLVSKPSIKQGMRMRLWQLTDKDAQLKGFVRIPIYMTTVEGDILETEAEAY FT VVPGMTVPILLGEDYQQSYELNVTRNVENGTHISFGRHDHRIQAIPVERTK FT DFSRLRQSAYMVGQYVRRQFHRRNKAKRHRRKMKFGIQERVVRASEDYRLK FT PHESKPIRVEGQLGEDKDWLVQKNLLSNANDTFFAVPNVLISAAHPWIPVA FT NPTDHPRYIRKGEIIGTLVDPATFFDSPKSLDELEKFQKAAEVIRTVIAVQ FT SQQEEPSRDESEEEQEQYGPKMAAMPDPDLYPSDQLEDLIDVGSLPDHLKE FT KAWAMLRKRIKAFGFDGRLGNLPAKVHIRTVDGQVPISVPMYHASPQKREI FT IDTQLNTWFEQGVIEPSRSPWSAPVVIAYRNGKPRFCVDYRKLNAATIPDE FT FPIPRQSDILASLSGAQVLSSLDALSGFTQLELAEEDIEKTAFRTHRGLFQ FT FKRLPFGLRNGPSIFQRVMQGILAPYLWIFCLVYIDDIVIYSKSYEEHLSH FT LDQVLAAIEKAGITLSPKKCHLFYGSILLLGHKVSRLGLSTHEEKVKAIME FT LERPKKLSQVQAFLGMVVYFSAFIPYYASICAPLFQLLRKGCKWNWGVEQE FT HAFQSAKSALESSPVLGHPMEGLPYRLYTDASDEALGCSLQQVQPIRVGDL FT KGTRTYVRLKKAFESGLPPPRLVPGLSVKCKDHEFDDVWAAEFDETVVHVE FT RVIAYWSRLFKNAETRYSTTEREALAAKEGLVKFQPFVEGESILLITDHSA FT LQWARTYENANRRLAAWGAIFSAYAPKLEIIHRAGRVHSNVDPLSRLPRAP FT PTHISPPETKEPSIRAKEDLGESQEVAPAEKMAAFSFAAWSIEDCIEEPKE FT VLINVRSRNKSRRETDELESPKMVEGDSAGEGSDELNTLDTTTEYWGAVNP FT PPTINIAMSEDAKRQWKLSYQEDPMFKNIASNSRHLYDKLEPGTRFFVDQD FT GMIFFNNEDYQPRLCVPAGQRNFILKEAHESPFESAHAGPERLWHSLSSRF FT YWKRMKLDIIRFCKSCDVCQKTKSPNFNKFGMLIPNPIPSRPYQSISMDFI FT VNLPWSEGFNAIYVVVDHLTKHASFIPTTTGLDAEGFALLFVKGIACRFGL FT PESIITDRDPRWSSDFWMGVARALQTKMSLSSSHHPQHDGQTEVVNKLLTV FT MLCAFVEGKRDQWAIWLHILEFAYNSAIHSSTGTTPFHLLLGFHPRTPLDF FT IGTKRSDETVSRSLNPDAISFLENLAMHRDSARAAIAKAQDQQARSFNKGR FT RPIPDLMKGDRVLVNPHALEWVESKGEGKKLAQRWIGPFEVLQRINPNVYR FT LRMSNLYPGLPVFNYQHLKKYEDSPAEYGPRHALPETRTTRPAKEEYEVER FT IIAERLTKKGLEYLVRWAGYSPLYDTWEPKRALTNVPEVVSKWKRLGDEER FT QEL" XX SQ Sequence 12713 BP; 3479 A; 3340 C; 2962 G; 2932 T; 0 other; aaggtggaca ctgtgggaat caacgacagc ctggtcggct taggcacaac aacggaagag 60 gagcttagaa cagaagcgag tgggggacga tctgaaggat cgaagcaacg gaaaccttgt 120 agaggcgccc ctccttcttc gttaagtcaa ccaactcctt ccgatcgttc tcgatctttg 180 tcaccttcta aatctaacag atctacttca actactgccc tacctagaac ttccacgtca 240 acgaaagcga aagctttctc cattccggca actcctcagc ctcccaagaa cgtaaagaag 300 cctatttcaa actatttcgg agcgcgcaac gatcaactgg acgtagccgc ttcttttagg 360 aggagacaac cgttagaggg ccgagcggct ttcaaacatc atcgctttct accacattcg 420 gcgtcgacta ctactactcc ctcttcctcc agtttccaac ctcagtttcc atccccggaa 480 tcacattaca cccagagctc ataccgagct ttcgaattta aaacacccgc tcccatttcg 540 ccaacaacgg atatcagcgc acaaggggaa atacttcctg cttcttctat atcccatcag 600 ctcagtatca cctctccacc aatcatagaa tcagtcctca ctgacaacga tactatagac 660 atgagctcta ccatagaaac tacgtcgtca caaacttttc gccctgagac aggcagaggc 720 tacgaggggt cccctatggc gcgactcata cgtcagttca aggagtcgag tgacgacgac 780 atatctactt atcagcgagc aataaacctt tccgcgaagt cttgtctact atctaatgac 840 gtagtggaac ttgaagggtc tcaaagaagc ttcgcattac atcccactct cctaagagac 900 gtcactcagt tagtaggaga gttacaagtt ttcctagagc gaacgtccgc tttaatccct 960 gaaagaagct cttactacaa agtagatcct agggatacat ttttatctat cttgagagaa 1020 tcttcagatg ctctacaggt gcatgcggct tggatggggt tgtccgggcg actatgattc 1080 gcacaagaga atctatggaa atacgaagca cagtatcgca gtcctttggc tggcgaaaat 1140 gtcatcatgc ccacttctcc tctttccacg gatccaggta tctacgaagc catggaagac 1200 ttggacgatt tggatcttcg attaaggtac ttattcgata acgtaccaca tcatcaagac 1260 caagtaaaat ctccccgtaa attgagaaat ggagtgccgt ggaatgaagt gctagcactc 1320 ccgggcaatc tgcaagaaaa cacttataac agactgccaa caatcgtcga acactcgcgc 1380 gacgatcaag agagtaattc gcaggtgaag aaaggaaaga ggaggataac cgacgaattt 1440 gtcagtccac caacttctcc tcgcggtttg aatgtcggat atggtactcc attcaaatcc 1500 agttcgcagt tcttcgttag accgggtgga atacctctac cgccaccaga gacatcggca 1560 caacaaaata tattagtggg cttgggcctt ccacacacgc cagctttcga aaacattcca 1620 agttcgaagg acgcacctca agctacgtgg gtgaatactc aagtacgatc tagagcttca 1680 aaccctttcg aaggtcgacc tctaccccct catatggtgg atgtccaaca gactagggat 1740 ggactagttc gaacgacttc cattgcgaca aaccaacgtc aacgccccgt agaatcgaga 1800 tcggagggag gctcttcgcg ccatcgatca cagtcaagaa atcatgagaa taggaatcaa 1860 ggaaatcctc ccaatggcga tccagacgga tcagatgacg acgatgacga cggggacgaa 1920 taccctcgca gaaataatca tgataacaga gaagacagac catcttccta ccccactaat 1980 cacggaggag gaggaggtaa tcccggaggg ggaggcggag gtaacggtcc aagtagtaat 2040 gggctttatc caaatacaca acctcaaggg aacgtaccct atggtaattt ggtagctaca 2100 atcagaaatg aactaaagca agatcagttg ccggtttggg acggtaataa ggagacggcg 2160 atcgagtatt tttggaaaat tcagcagcta gccgctctgg aaggcgatat accggtcgct 2220 ctcaggtact ggctatggaa gagcttgaaa gagaactctc gaatttggat gtggttcact 2280 accctaccat ttacggaaca atccaagatg cggactcatt atttgcatta ccttaagggc 2340 atcaaggata actatctggg gcgtacatgg caaatcagta tgaatagaaa atacgagagt 2400 caatccttcc gccaagaagg attcgagaga gaatcccccc ctgccttcat tgtccgacgc 2460 atcatgcata cgcgcatgct agtcgcttcc gacgatgggg ggcctacgga ggtctatctg 2520 gtaatgcaga aggctccgat atcatggggg cccattttga atttagagac aatccgttcg 2580 acttctttgt tatattccag agccacggat catgagttgg cattagttca tgcatctaag 2640 tatgaatcgt cgaatgtgct caccgcagac aatctgatcc agactctccg taaattgggc 2700 atttctacag accgaaatcg tccttatgat cgttcagcaa aattggtctt gggcaaggat 2760 gacgtcattc gtgaggcttt cctaggacag ttgagtaggg aagaatgcac gcaggagatt 2820 aattcagacc ccgaagtgat gaaggaagct ttccaaatac ttaagaagag acagcgtcct 2880 cctccgaaag gaggttaccc atacagcaag aatgaccatg tcaccaccaa aatgggacac 2940 ttacctcctt ccccctgtaa agtatgcggt agcaataacc attgggacaa ggaatgtcct 3000 gattggtcat tttatgaagc gaaacagact aaagctgcct atcggatcga gacagacaaa 3060 gacgatgatc tggagaatta ttactgcagc gtttatgccg tcttagtcac ggaacggcta 3120 accttggagg ataaacaggg taacgagtct tcggattttc atgaggcagt tctaaaggag 3180 ggaaacgacc acctcccgag agaacgtaag tccggcgaca attcttcctg gaagccgcaa 3240 aaggtcttca ttgaggaaga agaagatgaa tcctggtcag agtatcgagc gaaggaaaag 3300 tcaaccactc acctcttata ccaattagga gacgacaacg acgacgacga aatcctacag 3360 aaggaagcct tttcgtctca caagaaggcc ccttcctcct cgagacctga agaagacaag 3420 gatctaccaa aagaggtaaa ggatcctcca ggggccgctc aggaccctcc aacggaaacc 3480 aaggatcctc cgttccctcc tctgtttaaa gataaaccat tcagaattcc caaggcgaga 3540 tcccgtccag aaggaatgtc tgccgtcgga gtttcggttt tgtcaaccag aggttttgtg 3600 ggagggctaa ataatgtcga aacggactta cgcttggact cttgcgccga tattacgttg 3660 atatcctatg agttttatga gaaacttgtt tcgaaaccct ctattaagca aggtatgagg 3720 atgcgactat ggcagctgac agacaaggac gcacagctta aagggttcgt tcgcattccg 3780 atctatatga ctacggtgga aggcgacatc ttggaaacgg aagctgaagc ctacgttgtt 3840 ccaggcatga ctgtccctat cttgttgggc gaggactatc agcaatcata cgaattgaat 3900 gtcactcgca acgtcgaaaa tgggactcac atttcttttg gtcgacatga ccatcggatc 3960 caagctattc cggtagagag gactaaggac ttcagtcgat tgcgccagag tgcctacatg 4020 gttggacagt acgttcgacg ccaattccac cggaggaaca aggccaaaag gcaccgtcgt 4080 aagatgaaat tcggtataca ggaaagagta gtgagggcct cggaagatta tcgtctcaag 4140 cctcacgaaa gtaagcctat cagagttgag ggtcaactag gcgaagacaa ggattggtta 4200 gtgcagaaaa acctcctatc aaatgccaac gatacctttt ttgcggtgcc caacgtattg 4260 atatcagcgg ctcatccttg gataccggta gcgaatccga cggatcatcc tagatacatt 4320 agaaagggag aaattattgg taccctagtt gacccagcca ccttctttga ttctcccaaa 4380 tcgttggatg aacttgaaaa gttccagaag gcagcagagg ttatcagaac cgtcatcgca 4440 gtccagtctc aacaagagga gccctcacgc gatgaatcgg aggaggaaca agagcaatac 4500 ggccccaaga tggcggctat gcccgatccc gatttatatc cgtcagatca gttagaggac 4560 ttgatagacg taggcagcct accggatcat ttgaaagaaa aagcctgggc catgcttcgg 4620 aagcgtatca aggctttcgg atttgatgga cggttaggca acttacctgc aaaagtccat 4680 attcgaactg tggacggaca agtcccaatc tcagtcccaa tgtatcacgc ttctcctcaa 4740 aaacgcgaaa taatcgatac acaattgaat acctggttcg aacaaggcgt catcgagcca 4800 tcgagaagtc cgtggagcgc accagtggtg atcgcctacc gaaatggcaa acctagattc 4860 tgcgtggatt atcgaaagct gaatgcggcg acgatacctg atgaatttcc tatacctaga 4920 caatctgata ttcttgcttc gttatctgga gcccaagtct tgtcgtcatt ggatgcactt 4980 tctggattca cccaactaga gttagcggaa gaggacatcg agaaaaccgc attcagaact 5040 catcgtggac tttttcaatt caaacgctta ccttttgggc tgcgaaatgg tccttctatc 5100 ttccaacgag tgatgcaggg catcctagcc ccatatctct ggattttttg cctggtttat 5160 attgacgaca tcgtcattta ttctaagtcg tacgaggaac atctctcgca tttggatcag 5220 gttctggccg ctatagagaa agcaggcatc accttgtctc cgaagaaatg tcatctattc 5280 tatggatcca ttctactttt aggacacaaa gtatcgcgcc taggactttc aactcacgaa 5340 gaaaaagtca aagcgatcat ggagttagaa agaccgaaaa aactctcaca agtgcaggcc 5400 ttcctgggga tggtggtcta cttttctgcg tttattccat attatgcttc catttgcgcc 5460 ccgctgtttc aactattaag gaagggttgt aagtggaatt ggggagttga acaggaacat 5520 gccttccaat cagcgaagtc cgcactagag tctagtccag ttttaggaca ccccatggaa 5580 ggcctccctt acaggctcta taccgacgcc tcggatgaag cattgggatg ctcactacaa 5640 caggtgcaac ctataagagt aggagacttg aaaggaacac gtacttatgt tcgcctcaag 5700 aaagcgtttg aaagtggtct accgccgcct cgcttagtcc cagggttgag tgtcaaatgc 5760 aaagaccacg aattcgatga cgtctgggct gctgaattcg atgaaactgt tgtgcatgta 5820 gaaagggtca ttgcttactg gtctcgactc tttaagaacg cagagacgag atattccacg 5880 acggaacgcg aagcacttgc tgctaaggaa gggctggtca aattccaacc atttgttgag 5940 ggcgaaagca ttctcttaat cacagatcat tcagcgcttc agtgggcccg cacttatgag 6000 aacgctaaca ggcgattggc ggcatgggga gctatattct cagcgtacgc accgaagctg 6060 gagattatac accgtgcagg aagagtgcac tctaacgtgg acccgttatc tcgattgcca 6120 cgagctcctc ccacgcatat ttctcctccg gagacgaaag aaccgtcgat tcgagccaag 6180 gaggatctgg gagaaagtca agaggtcgct ccagcggaaa agatggcggc attctcattt 6240 gcagcatggt ctatcgagga ctgcatagag gagcctaaag aggtcttgat taatgtcaga 6300 tcgaggaata aaagtcgccg ggagacggat gagttggaat cccccaaaat ggtcgaaggc 6360 gatagtgcag gagaaggctc agacgagttg aatacccttg atacgacgac agagtattgg 6420 ggggctgtga accctccccc tacaatcaat atagccatga gcgaggatgc caagcgtcaa 6480 tggaagttgt cataccaaga agatcctatg tttaagaaca tcgcgtcgaa tagtcgtcac 6540 ctatacgata aactcgagcc agggactaga ttcttcgtcg atcaggacgg aatgatcttc 6600 ttcaataacg aggactacca acctcgacta tgcgtgccag cagggcaaag gaatttcatc 6660 ctcaaagaag cccacgaaag tcccttcgaa tcagcgcacg cgggaccgga gcgtttatgg 6720 cattccttga gttctagatt ctattggaaa agaatgaagt tagacatcat tagattctgc 6780 aaatcgtgcg acgtatgcca gaaaacgaaa tcacccaatt tcaacaagtt cgggatgcta 6840 ataccgaacc ccataccatc tcgtccctat caatccatct cgatggactt catcgttaac 6900 ttaccatggt ccgaaggttt caacgcgatc tatgttgttg tcgaccatct aacgaagcac 6960 gcgtcattta tacccacgac tacaggcttg gacgcggaag gtttcgcctt gttatttgta 7020 aaagggatag cgtgtcgatt tggcctgcca gaaagtatca ttaccgacag ggaccctcgg 7080 tggtcaagcg acttctggat gggcgtagct agagccttgc aaacgaaaat gagtctgtcg 7140 tcctcacacc atccgcaaca cgacggacag acggaggtag tgaataagct gctcacagtc 7200 atgttatgcg cttttgtgga aggaaaaaga gatcagtggg cgatctggtt acacatactg 7260 gaatttgcgt acaatagcgc tattcactct tcgactggaa ctaccccgtt ccatcttctg 7320 ctaggcttcc atcccagaac ccctttggat ttcataggaa ccaaacgatc agacgagacc 7380 gtttcgcgct cgttgaatcc tgacgcgata tcattcttag agaacctagc tatgcatagg 7440 gacagcgcca gggcggctat tgctaaagcc caagatcagc aggctagatc tttcaacaaa 7500 ggccgaaggc caattcccga ccttatgaaa ggggaccggg tccttgtcaa tcctcatgct 7560 ctagagtggg tcgaatccaa gggtgaagga aagaagctcg ctcagcgatg gatcggtcca 7620 tttgaagtgt tgcaacgtat caaccccaac gtctatcgat tgcgaatgag taatctgtat 7680 ccaggacttc cagtattcaa ctatcaacac ttgaaaaaat atgaggactc acctgcagaa 7740 tatggaccac gccacgccct acctgagacg aggacgacta gaccggcaaa agaggaatac 7800 gaagttgaga gaatcatagc cgaaagactc accaagaagg gcctggaata tttggtccgc 7860 tgggcgggat acagcccgct ctacgacact tgggagccga aacgagcctt gacgaacgtg 7920 ccagaggtag tctcgaaatg gaagagactg ggcgacgagg agcggcagga gttgtagatc 7980 tagaaggagt taacttattt cgccggttgg atcgagatcc gatcgatgga cgtcatcaca 8040 tgtttctttg ttcccctttc tcttcttttc tttttacgtt cttttcttat cttttaacat 8100 ctattacgat tccatcgctg ggacaacata cgccatggac tgcctttaca ccgacaacga 8160 tggcaaagta ccgacgtccg ggagggaaag gatgatcgca ggagagtgga ttgccctaaa 8220 cgcttccgcg ggtaaaatcc cctcaactga ggtattcgag caagctctgg aagaccgagc 8280 attcctttta ccggaattag ctgcacgacc tacaagctgg tcatggacaa aacagccctc 8340 gtggtactca gacaagtacc actgggacgc ttggaatgta gtggattgct tcgccgacga 8400 aaacgataca atcccgtggt acttcagcga agggaactct ccttggcagg acgcaggagg 8460 gaaatttcat tttgacccca gcatgcgcgt caaagcggaa gagtcccttt cgaaactttg 8520 gctttgcgtg gaggccatca cgaccaatcc cccttttctc aacggcaccc ctcatccgtc 8580 caaattcaat tatctcctcc tctcagcagc ttgggattcg gctcgcggag cagcatcgct 8640 tatgaatgat gcaaaagacc gcgcgcttga gcttataggt ttcattaact ggtgggtttc 8700 gtcagtctca gggtggcaga attcccttca gcattggatg gtcgattata ttgcggggtt 8760 caagctgcat gatctgaaga agaggggagt cttccttgac ctccttaagc attcgaaatc 8820 gctgaatata ggtcatctac tggcggaaaa cgttcctgtg tattactttt ggcaagagga 8880 tatggacgat tatccgagct tcacgcgact ttccccaacc attcttcaag cttaccacga 8940 cacttgcagc tccctggata agacagaggt ctacggggaa gatatgcagg gtttccagga 9000 cgatatcgcg acaatcaaac actatgacga atttttccag ctaagacgcg tgccagatca 9060 tgtcgcctct ccaatctccc ttgacatccc tcctaacgca gtagcgtata tttgcgactt 9120 tgaaggctgg tcggcttggc tgatcaaaga ccccgacctc attcgtgact acgccgatag 9180 atatcacctc tccatcgaaa tggatgaccg cgacacgtac gtcaccatcg aggtctttac 9240 cttccccggt ataatccata tggaatccat gtggaatcca tggaatccat gaagaattcc 9300 atatggaatc catggaatca atgttggctg aggacacagc cagtttctga ttccatggac 9360 atcatggatt ccatatggaa tgatgctgga atggtcatgg aatgattaat tccatatgga 9420 ctccaacacc attccatatg gattccatca gattccatat ggaatgcagg catatccaca 9480 tggattccat ggagcagtcc atatggattc catggaatgg aattaatgac tattatacca 9540 ttagaataca ttgtccttac tataagtaag taaaataaat tttttttaca tctcgcggaa 9600 ttgaactctc gacctcgtga atgtaagcag cgctcaagcc gactgagcca tgaaggcatt 9660 attatatgag aaactaaagg tatgtatata ccgcttttag gcacgtgccg cgtcaaacgt 9720 gacagctgtg gctgaagcgt tgccacgtcg ttctttcctt cctaccacca ccatcaccat 9780 ccaccaccac cgtttcaaac cacgaccgcc cgcgacatcg acaacgacga cgacgacgct 9840 gacgctgacg acgcgtgacg agggcccagc ccaggtaagt gggaacaacc accccaccca 9900 ctctcctttt taacacaaac cacaggtgcc acgtcgcgac gtggcaacca gacggcgaac 9960 gacgaccgat gtcgtcgttc gccgtctctg ctcggccacg aggactcacc accacgaaca 10020 gcccaccaac acaccccacc aacgtcaacg agcgcccacc cccacgagcg cccaccgacg 10080 aaaacgagcg cccaccaacg tcaacgacct cacaccaacg acgaccagcg cccacgaaaa 10140 cgaccgccca cggacgtcca gcgaccgcaa acgacgtccc cgcccacgaa aacgatgtcc 10200 accgcccatg aacgacgccc accgccaacg atgtcaaccg cccatgaaca acggccacca 10260 ccaacgacgt cccccgccca cgaagacgac gtccgccgcc caccgcccac gaaacaacgt 10320 ccgccgccca cgaaaaacgt gtaccgccca cgaaacaacg tccgccgcct accgaaaaca 10380 tccgccaccc accgaaaacg tccaccgccc acgaaacaat gtcccccgcc caccgaaaat 10440 gtccgccgcc catccacgaa cacagtgacg agcccaggtg agcctcccct ccctccatca 10500 ttattccaca tcacaccctc ccccattgct ccccttctaa ttgctgcccc cctctccacc 10560 tttccttccc cctccataac tccccacctc catcactctc cacccctccc ctccctcctc 10620 attctccaca actccctctc tccacaactc ccactcccct ccatcacttc cccctccctt 10680 gatcactcta tatctatacc tctccctccc cctcccttta atattatatg tagtaaaata 10740 aaattatgta gtctgtttca atacacctta gttttcttca tttattagtg tttaagttgg 10800 aactttgcag attaagtctg gacttgtgat tctatgataa taattgtgga attgggactt 10860 attccatctg aattccatat ggaaaaagta tggattccat ctaaaaaagt atggagtcca 10920 tcagctattc catggattcc atctgaaaga agtatggagt ccatcagcta ttccatggaa 10980 tccatctcca gaaagtatgg agtccatcag ccattccatg gaatccatct ccagaaagta 11040 tggagtccat cagccattcc atggaatcca tctccagaaa gtatggagtc catcagccat 11100 tccatggaat ccatccccag aaagtatgga gtccatctgc cattccatgg aatccatctg 11160 gaattcctgg aatccacccc ccttccatgc attccatatg gattaagccg gggagggtaa 11220 agtacaggtc accatctggc gatggaagcc acgatacgca gtcccctggg ttgtaccgag 11280 agcaggaatt ctggggactg gagttaccat agaagctcgg cggagtgatc gggagattcg 11340 cgaactttac aaaaatgctc acgcaccgga tagcaaacga caattcgacg aatggggtcg 11400 catcaccttg agcagccggc tgggagacat ctgcgaggac tatcagggta gccccatgcc 11460 aaccgaacac ggtgatcatc aagccctcct accacgtccc cattgggttt cagcctctca 11520 accacagttg tctattccac gtcctacatc acccatcact accgtttcta gttgggcgca 11580 agccatggcg gcaccttcaa gccactctag ctccagagcg tcgtcgctgg ctcaacactc 11640 gaggagctct ggcgattacg gatcccgtcg acgttcggcc tcacctcagc gacgtcatct 11700 acaccatgca aacgcgatgc ccagtctaag agcagtgttt gtggacgact tacgggacct 11760 ggcgaatcaa tatccgatgg ttgccaatcc ctgggtaagt aaaaaaccct tgacctggaa 11820 cgcagatttc ctcgacgtcg ggtaccttct cattccaaac gatagagcgc aggcacggct 11880 gcgttattgg gccacttctt cccaggatgc caccacgatg ctggacctct tgctcaaagc 11940 gatctgccac ggcttgtcat tcagtatcgg ggtcaaagtg gaggactttg ggaggtttaa 12000 gcctgaggac atctctgaca cggaccgatt ggttgggaag ccgacgtccg caattgaggc 12060 cccattcatc tacaccgccc aaggtgctct acgcgcttac tacctaagtc gcgtcaatga 12120 tatcatacgc cgtctgcatg ctagaatttt aattggcatg gggggccctg aagcctggct 12180 aggtcggaaa tggggcagtt ctgaactcgt cgcacagttt atggggggac cttcaccgga 12240 cgtttacctt caccgccgag gctatatcga ctcggacgac gagcatccaa tgtttctgta 12300 cacggacgaa atgacacctc aggagctcga cgtgctcttt ggatgcattc gcaatgatag 12360 cgaccgggat cgttccttgt acccctcgag ggacatcttg gacgagggct gcttcttctg 12420 gacaggggag tgggacactc gcatggactc tatgtttaat gatatcacca aggacatact 12480 gcaaggatca gcaaagttcc ggactcccgg catgtggaac gagtactttc gtcggctgaa 12540 ccgcagtcat agaggttcca aggagcgtct caaccaagtg gtgcctagca tgctggctcg 12600 tcttcatgcc aaaatcattg acggtttccc ggtggattgg aacaagcggc gtctttcagg 12660 aatcgagttg ccggaggaat acaggccacg tcaaatcgga aatagggggg ctc 12713 // ID Gypsy-1_ARO-I repbase; DNA; FNG; 5423 BP. XX AC ABVF01000027; XX DT 02-MAR-2011 (Rel. 16.03, Created) DT 02-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Arthroderma otae genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_ARO_; KW Gypsy-1_ARO-LTR; Gypsy-1_ARO-I. XX OS Arthroderma otae OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Onygenales; Arthrodermataceae; OC Arthroderma. XX RN [1] RP 1-5423 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Arthroderma otae genome."; RL Direct Submission to RU (02-MAR-2011). XX DR Genome; ABVF01000027; Positions 54155 59577. XX CC Positions [4320-4811] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 193..1359 FT /product="Gypsy-1_ARO-I_1p" FT /translation="MEDWNAMDTAEGQRQGQENNGNGQQNSNQPPSWLGAF FT LEQQSTLMERQTEQIRTLTAQLATLRERVQEQGTPLVAHLTPQASPAPSIQ FT TPPHPDRPKPALPNAEKFSGERLLDYPSWRMEVNAKLTVDGESIGGPSQRV FT WYIFSRLEGVAARRVLPWVERNQYDPSQFTPAHFCAYLDHLFVDPARQEKA FT RRRVYTIRQGKRALAEFLAEFDQVIMEAGGHTWDDTAKIGALQMALDPKLL FT EYMIGVDTPESYEDYCSLLRRTDDRSREAQRVAQAQRIGLQGYRAWSSQNQ FT QPRALAPAVPRGDPMDIDSTAQKPARRAKWASPEEREKRRQGRRCLRCGGA FT GHFVAACPYLPPRPTTQVTDVTPAASGPPPDLEEEEDANYSGQGKD" FT CDS 1437..5420 FT /product="Gypsy-1_ARO-I_2p" FT /translation="MESPHIVVDAIVNQTAVARTMIDTGCRAYGVITSAFA FT RKQRIKRIPITPRALSGVNGPTGETITEVGYMTLDIAGNRQERVFMYILPR FT AAGYDIILGRGWMADQDASLEKQTRTLVFNRTGVRVEVGDMQRKMDIKEIS FT AAAFSCLYRRARREKAQKGDLDEQEAGCRPRTTSESIWRRPHQTAPSESVG FT QGAQAPTRPVWRCHHQTAPSESVGQGAQAPTRPVWRCHHQTAPIEPTAQST FT LCAQVTTEPVCGSLHQTAPVEIFAASMADIEKALKVKPRTDPRTKLPPQYY FT EYLDVFDQKVAETLSPHRPGIDHEIELVEQDERGNKPKIPWGPLYNMSRGE FT LLVLRKTLTELLEKGFIRVSNSPAAAPVLFAKKPGGGLRFCVDYRGLNKIT FT RKDRYPLPLINETLQRISKAKWFTKLDVIAAFHQLRIAEGSEWMTAFRTRY FT GLYEWLVTPFGLANAPSTFQRYINWVLRDFLDEFVSAYIDDILIFTEGPLE FT EHRAQVKRVLQKLREAGLPIDINKSEFEVTSTKYLGFIIDTKEGIKMDPAK FT VNAILDWQAPQSVKGVRGFLGFANFYRRFIKDFSRIAAPLTELTKKDAEFK FT WTSEADAAFNELKKKFVAEPTLLQFDPEQETILETDSSGYVVGGTLMQYDN FT EGLLRPCAFYSKRNSPAECNYEIYDKELLAVIRCLEEWEAELLAVGSFRIL FT TDHKNLEYFATVRRLTERQMRWAEFLSRFNFKMIYRPGILNSMADALSRRE FT QDMPKDISDERLAYREVKLLKDEWIDTSNSSKEDLPGIGIKLSAKPEVCVL FT PIAPNGRPLRHPGDGITPSGTGGHHQTGLETDELRSIWENARRNDPVILAL FT EQAVREKRARFPVHLGVKASITECSLAPNGELLWRGRRWVPDSEPLRTRIV FT QDTHDSVLSGHPGREVTTALVGRQFYWPGYARDIRQFLRNCATCRASNSWK FT DRRQGLLKPLPVPDRLWSELSMDFITDLPKSQRCTTLLVITDRLSKGVILE FT PCEKIDADTVAKIFIRTIYRRHGLPRAIVTDRGTQFTGALWTRICQVLGIR FT RRLSTAWHPETDGSTERMNQTVEAYLRKYVNYLQDDWHDWLPSAELAINNR FT DAAATGVSPFFLEHGYHLDVVDFPEDLTAGASDSRSPIQRADAILHKLKST FT GEWAQASMAAAQQEQQDFVNRFRDPGTQYEVGDYVWLDLRNIKTDRPSKKL FT DAKCGKFRVLKRIGSHAYRLDTPGTIHNVFHINLLRPAASDPLPSQRVQAW FT RPPGIVGEDGEPEEFEVDQIVKERTRKGQRQLLVKWTGWDKPTWEPASALQ FT ETAALDAFEARTRRGA" XX SQ Sequence 5423 BP; 1527 A; 1353 C; 1518 G; 1025 T; 0 other; ttacaaaaca catcattata ttggttatat tggttacatt ggacacattg gtcatattgg 60 atacattggt tatattggac acattggtta cattggacac attggttaca ttggacacat 120 tggttacatt ggacacattg ggtacatcga gtacacagaa gatattggaa acattggtat 180 aggaacattg gaatggaaga ttggaacgca atggatacag cggaaggcca acgtcagggc 240 caggagaaca atggaaatgg acagcagaat agtaaccagc caccgagttg gttaggagct 300 tttctagagc agcagagtac ccttatggag aggcagacgg agcagatcag aaccttaaca 360 gcgcagctag ccacgctgag agagagggta caggagcagg ggacacccct agtagcacat 420 ctcacccccc aagcctcacc ggccccctcg atacagacac caccacaccc tgatcggcca 480 aaacccgcct taccaaacgc ggagaagttc agtggagaaa gactattgga ctatccatca 540 tggaggatgg aagtcaacgc taaacttaca gtggatgggg agtcaattgg gggccctagc 600 cagcgagtat ggtatatttt ctcccgacta gaaggagtag cagcgaggag agtccttccc 660 tgggtagagc gcaaccagta cgacccaagc cagtttacac cagcacattt ctgtgcttac 720 ctggaccacc ttttcgtcga cccagctcgg caggagaagg ccaggcggcg ggtgtacact 780 atacgccaag ggaagcgagc actggctgag tttcttgcag aatttgatca agttattatg 840 gaagcaggag gccacacttg ggacgacaca gcaaagatag gagcactgca gatggcccta 900 gaccctaagc tcctagagta tatgattgga gtagataccc ccgagtccta cgaggactac 960 tgctccctcc ttaggcgaac ggatgatagg tctagagagg cacagcgagt ggcccaggcc 1020 cagcgcatcg ggctccaagg ttacagagca tggagctctc agaaccagca gccccgcgct 1080 ctcgccccag cggtccctcg aggggaccct atggatattg acagcaccgc ccagaagcca 1140 gctcgccgtg ctaagtgggc gtcgcctgaa gagagggaaa agcgtaggca gggccgtagg 1200 tgcctacgct gcggaggcgc aggccacttc gtcgcagctt gcccttacct gccgccacgc 1260 ccaacaacac aggtcacaga tgtgaccccg gcggcgtcgg gaccaccgcc agacctagaa 1320 gaggaggagg acgccaacta ctcgggccag ggaaaagact agctcctgtg ttaaagcgca 1380 cacaggagcc ccgagaagat atactgcaag aatggcaaag gttccagaaa ggaaagatgg 1440 aaagtccaca catcgttgta gatgctatag tgaatcaaac cgctgtagct agaactatga 1500 tagatactgg ctgtagagcc tatggagtaa taacatcagc ctttgccagg aagcagcgta 1560 ttaagcgtat tccaatcaca ccacgtgcac tctccggtgt gaatggaccg acgggcgaga 1620 cgataacaga agtagggtac atgaccctag atatcgcggg caaccgacag gagcgtgtgt 1680 tcatgtatat cctgccgcgg gcagcagggt acgacattat attgggaagg ggttggatgg 1740 cggatcagga cgcctcattg gaaaaacaga cgaggacatt ggtttttaat cggacagggg 1800 ttagagtaga ggtaggggac atgcagagga aaatggatat aaaggaaatt tcagccgcag 1860 cctttagctg cttatacagg agagcaagga gggagaaggc gcaaaaggga gacctcgatg 1920 agcaggaggc agggtgccgt ccacgaacaa cgtcagaatc tatctggcga agacctcacc 1980 agacggctcc cagcgagtct gtaggacaag gtgcacaagc accaacaagg ccggtctggc 2040 gatgccatca ccagacggct cccagcgagt ctgtaggaca aggtgcacaa gcaccaacaa 2100 ggccggtctg gcgatgccat caccagacgg cccctatcga gcccactgca cagagtacgc 2160 tgtgcgcaca ggtgacaaca gagcctgtct gtggaagtct tcaccagaca gctcctgtcg 2220 aaatattcgc agcgagcatg gcggacatcg agaaggctct aaaggtaaag ccacggacgg 2280 atcccagaac gaagctaccg ccgcagtatt acgagtacct ggacgtgttc gaccagaagg 2340 tagcagaaac gctgtcccca catagaccag gcatcgatca cgagatcgag ctagtcgaac 2400 aggacgaacg gggaaacaag ccaaagatcc cctggggccc cctctacaac atgtccagag 2460 gcgagctatt ggtgcttcgg aaaaccctta cggaactact agagaaaggg tttatccgcg 2520 tgagcaactc acctgcggcc gcccctgtac tgtttgccaa gaagccagga ggaggactga 2580 ggttctgcgt tgactatagg ggcctaaaca agatcacccg gaaggataga tacccccttc 2640 ctctaataaa tgagacacta cagcggattt cgaaggcgaa atggttcact aagctcgatg 2700 tgattgctgc ctttcaccag cttcggatag cggaaggatc ggaatggatg acggcgttta 2760 ggacacggta cggcctatac gaatggctcg taaccccatt tggtctagct aatgcaccga 2820 gcacattcca acgatatatc aactgggtac tacgggactt cctggacgaa ttcgtatcag 2880 cctatattga cgatatcttg attttcacag aagggcccct cgaggaacac agagcgcagg 2940 taaagagagt gctccagaaa ctaagagagg caggactacc tatcgatatt aataagagcg 3000 aattcgaagt tacttctact aagtatctcg ggtttattat cgatactaaa gaaggaatca 3060 agatggaccc cgcgaaggtt aacgctattc tggactggca agcaccgcaa tcggtaaaag 3120 gggtgcgagg gttcctaggg ttcgcaaact tttatcggag atttataaag gacttctcaa 3180 ggatagcggc ccccttaacg gaacttacta agaaggatgc cgagtttaag tggacgtcag 3240 aggctgatgc ggcctttaat gaactcaaga agaaatttgt ggccgagcct acccttttgc 3300 aatttgaccc ggaacaggag actatactag aaactgactc ttccggatat gtggtgggag 3360 gcaccttaat gcagtacgac aacgagggcc ttcttcgccc ctgcgccttc tactcaaaaa 3420 ggaactcccc ggccgagtgt aactacgaga tctatgacaa ggaactactc gcagttatcc 3480 ggtgcttaga agaatgggaa gcggaactcc tagcagttgg ctcttttaga atcctaacag 3540 accataagaa cctggagtac tttgcaacgg tccgacggct aacagagcga cagatgagat 3600 gggcagaatt cctgagccgg tttaacttca agatgatcta tcggccaggg atcctaaact 3660 cgatggcaga cgcgctctcg cggagggagc aggatatgcc caaagacata tcagacgaac 3720 ggttggccta tagagaggta aaactactaa aggatgaatg gatcgacaca agtaatagta 3780 gtaaagagga cctccctggt attggcatca agctctccgc gaagcccgaa gtatgtgtac 3840 tacctatagc gcccaacggt aggccattaa ggcaccctgg tgatggcatc acgccctctg 3900 gaactggagg ccatcaccag actggccttg agaccgacga actgcggagt atttgggaga 3960 acgcacggag aaatgacccc gttatactgg cactggaaca agcggtacgg gagaagagag 4020 cgaggttccc cgtccacctc ggggtaaagg catcgataac ggagtgctct ctagcaccaa 4080 acggcgaact gctctggaga gggaggaggt gggtacccga tagcgaacca ctacggacaa 4140 gaatagtaca agatacccac gactctgtgc tcagtggcca cccaggaaga gaggttacaa 4200 ccgccctagt aggtagacaa ttctactggc caggctacgc gagagatatt agacagttcc 4260 tacggaactg cgcgacctgc agagcctcaa actcctggaa ggaccgaaga caaggattac 4320 tcaagccctt acccgtaccg gatagattgt ggagtgaact ctctatggac ttcattactg 4380 acctcccgaa gagccagcgc tgcactaccc tattggtaat aacggatagg ctaagcaaag 4440 gggttatact ggaaccttgc gagaagatag acgccgatac cgtcgctaag atctttatta 4500 ggacgatata taggcggcac ggactaccac gggctatagt cacggatcgc ggaacgcagt 4560 ttaccggagc gctttggact cggatctgtc aagtcctagg gataaggagg cgcctatcga 4620 ccgcgtggca cccggagacg gacggttcca ctgagaggat gaaccagaca gtagaggcct 4680 acctacggaa gtacgtgaac tacctacagg acgactggca cgactggctc ccctcagcag 4740 agctcgcgat taataaccga gatgcggcgg ctacgggcgt gtcccccttc ttcctagagc 4800 atgggtacca tctagacgtg gtggattttc cggaagacct gacggcagga gcttctgact 4860 caaggagccc tatccaacga gcggacgcca tcttacacaa gctgaagtca actggcgaat 4920 gggcacaggc ttcaatggcg gccgcccaac aggagcaaca ggattttgtg aataggttta 4980 gggacccagg cacgcagtac gaagtagggg attatgtatg gctagatttg cggaatatca 5040 agacggaccg accctctaag aagcttgacg cgaaatgcgg aaagtttaga gtcttaaaga 5100 ggatcggaag ccacgcttat cgcctagata cccccgggac gatacataat gtgttccata 5160 taaatctact ccgtcctgca gcctcagacc cactaccttc gcagcgagta caggcctggc 5220 gaccacctgg gatagttggc gaggacgggg agcccgaaga attcgaggtc gaccagatag 5280 taaaggaacg gactagaaag ggacaacggc agctactagt gaagtggacg ggctgggaca 5340 agccaacctg ggaaccagcc tcggcgctac aggagaccgc cgccctcgac gctttcgaag 5400 cgagaaccag aaggggggcg taa 5423 // ID Gypsy-1-I_ACa repbase; DNA; FNG; 6414 BP. XX AC . XX DT 26-FEB-2009 (Rel. 14.02, Created) DT 26-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE An internal portion of the Gypsy LTR retrotransposon from DE Ajellomyces capsulatus - consensus. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW reverse transcriptase; Gypsy superfamily; Gypsy-1-I_ACa. XX OS Ajellomyces capsulatus OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Onygenales; Ajellomycetaceae; OC Ajellomyces. XX RN [1] RP 1-6414 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from Ajellomyces capsulatus."; RL Repbase Reports 9(2), 358-358 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 57..1076 FT /product="Gypsy-1-I_ACa_1p" FT /translation="MAPRTRAGTAPSGGASTSSPRARTPLPRQESEIPEPA FT AHLGMGPPAALPPPRPLHSYVYHGDKGIKVGDPPNYKGKTLSEFHSFTRAL FT ERKFRLNLSQFNTHEVRVIYAASLLEGTTATTWATKERELERSNAAMTWEE FT FKNFLLDQIEDHRNRAFTAAQRFNSLRQRDDENPSDFNARFEEYWEELSDD FT LNLLKTQLYLAKLTHKVRTTISQYQSPPTNRTELVNLASRIYLNNQHHEKK FT GDKTPSKQPTTGSKRRHDGSGADDAKKNSKRSSSSLTAKERQNRLDNNLCF FT NCGISGHFASDCRKSKKGRNRGDTSSKTTPLLANVSTEDKKERPKED*" FT CDS 1200..4925 FT /product="Gypsy-1-I_ACa_2p" FT /translation="MKGWKKIGPPGHAKFTDGTKCHRFGIIQTKCEATDSE FT GTSKGSSLLLHAVNIIDEDVILGLPWLRENNPLIDWVTQAWRYRLTDEEAA FT IEEPEEFHRKNLEDKIESIYCVAGDLDLEAGGEAATLTARLAVMSATEGGR FT TQRLFTDYKDVFSEKEAAKFPDTKVRHKIHLEEGATAPYGPLYNLSVKELD FT VLRKYLEEAQSKNWIRPSKSPAGAPILFVPKKDGRLRLCVDYRGLNKVTVK FT DRYPLPLIDEMIDRLSGARVFSKIDIRDAYHRIRVNENDIWKTAFRTKYGN FT FEYLVLPFGLCNAPATFQAYINEALRGLVDVSCIVYLDDILIFSKTEKEHD FT THVREVLDRLKQHQLYANTEKCSFYTQEIDFLGYIVGVDGISMDRSRVSAI FT EDWPAPRTYREVQVFLGFANFYRRFIYAYSRIAAPLTDLLKGSVKGRKTGP FT FILTQDAVRAFEELKQAFADATMLNHFDPGLPSQCETDASGNGVCGIFSQL FT TPETFQRWKDRGLITSGDTGNGMPVGEGFFTEPTRRREWRPVAFFSKKLSL FT TQRRYDTHDQEMLAIVESFKIWRHYLQGCKYPVQVLTDHANLRYFLTTKSL FT TGRQARWAELLSEYDFFIRYRPGRLNPADAPSRRPDYDIVDGDEDPTKGPL FT PSLQKKLAAGGWGSLKLGNGAGKAAARPYTGGGELGSPQGTGTEGHELLVP FT RFAAIELAEDETVYNLPTQRLLEFIGELQQGDALTAARIQKLKEEGSGSNL FT EGASPERGGAAEEPSRWQLGECGLLLLNGSVFVPRSRAVRQELLRIHHDDP FT HAGHFGLEKTEELLRRKYVWDKLRTDVKEYVETCDVCQRIKVPRHKQYGEL FT APLPVPKGPWQDLSMDFIVDLPKSTRVKGYDSILVIVDRFTKMAKYIPTHK FT KVKAPELADLFLDAIIRDHGSPKSLVSDRGSLFTSKYWSAFCYHLTIRRLL FT STAFHPQTDGQTERQNQTLEHHLRAFCSYHQNDWPRHLPTAEFAYNNAKNA FT TLKVSPFYACYGYNPSLPSDVADDVLKGGIDSRNVERDRGTEYPSVPEVKA FT RLELLDRIHGEAAKSIRRAQERQAEYYNRKHLPMKFSIGDQVLLSAKNIKT FT TRPCKKLSERWLGPFPIIRVVGKQAYELKLTSGFKSIHPVFHISYLEPYKQ FT RPGAEPPRPDGVEIEGNTEYLVEGILDKRMHYNKVQYLVKWEGYPSSENSW FT EPLEHLENAEEEIAKYEVASRDRPAARRRCG*" XX SQ Sequence 6414 BP; 1677 A; 1648 C; 1704 G; 1384 T; 1 other; attcgaatcg ctcgacctgg ttacacgttt gaatcagtcg agctctattg atcgatatgg 60 ccccacgcac ccgtgctggg actgcaccct ctgggggagc ctccacgtcg tcccctagag 120 cgcgtacacc tctaccccgg caggaatctg agattcctga acccgccgct caccttggca 180 tgggccctcc agccgccctg ccaccgccgc ggcccctaca ctcctatgtc taccatggag 240 ataaagggat caaggttgga gatccgccga attacaaggg caagacgctg tctgaattcc 300 acagcttcac gagagccctt gaacggaaat tccgccttaa cctgagccag tttaatactc 360 atgaggtacg agtcatatac gccgcgagcc tcctcgaagg caccactgca actacatggg 420 cgaccaagga aagggagctt gagcgtagca atgctgccat gacatgggaa gaattcaaga 480 acttccttct ggaccagatt gaagaccacc gtaaccgcgc cttcacagcg gcgcagcggt 540 tcaactcact acgtcaacga gacgacgaga atccttccga tttcaatgca agattcgaag 600 aatattggga ggaactgtct gacgatttga atctattgaa aacgcagctt tacctcgcga 660 agctgactca taaggtgcgg acgacgatat cgcaatatca aagtccgccc accaatcgca 720 ccgagcttgt caacctggct tctcgaatct atttgaataa tcaacaccat gagaagaagg 780 gggacaagac tccctcaaag caaccgacaa caggatcaaa acggcgccat gacggctccg 840 gtgcggatga tgcaaagaag aactcaaagc gttccagttc ttcacttact gcaaaggaac 900 gccaaaatag actggataac aacctttgtt ttaactgcgg aatctcaggc cattttgcca 960 gtgattgccg taagtcgaag aagggaagga accgaggaga tacctcctcg aagacgactc 1020 cccttctggc aaatgtatcc actgaagata aaaaggaaag acccaaggag gattgacaat 1080 cctcacgact ccgagtccat gtaaccttga agatgaatac cccatgaggg gaggtgccag 1140 tgaggtgcct agtcgactcc gaggcggacg ctaacttcat atcctattcc ttttcactca 1200 tgaagggttg gaaaaagata ggtccgccgg gccacgcaaa gtttacagat ggaacaaaat 1260 gccaccgatt cggcatcatt caaacaaagt gtgaggccac ggactctgaa ggaacctcca 1320 aggggagcag tctgctcctc cacgctgtca acatcataga tgaggacgtc atcctaggtc 1380 tcccgtggtt gagagaaaac aatcctctca tcgactgggt cactcaggcg tggcggtacc 1440 gcctcaccga cgaagaggca gctatagagg aacccgaaga gttccatcgg aagaatctcg 1500 aggataagat tgagtcgata tactgtgtag caggtgactt ggacttggag gcaggcggag 1560 aagccgccac cctcactgcc agattagcag tgatgtctgc cactgaaggc ggacgaacgc 1620 aacgtctctt cacagattac aaggacgttt tttcagaaaa ggaagcagcg aagtttcctg 1680 ataccaaagt gcgacataag atccatcttg aagagggcgc gacagcgccg tacggacccc 1740 tttacaattt gtcagtaaag gaattggacg ttctgcgcaa atatcttgaa gaagcgcagt 1800 caaagaactg gatacgccca tcgaagagtc cagcgggggc acctatacta ttcgtgccca 1860 agaaggacgg taggctgcgc ctctgcgtag actacagggg cctcaataag gtgacggtta 1920 aggatcgcta tccactacca ttaattgatg agatgatcga tcgattatcc ggagcaaggg 1980 tgttctcgaa gattgacata cgggacgcct atcatagaat aagggttaac gaaaacgaca 2040 tttggaagac agcttttcgt accaagtacg gcaattttga atatctcgtt ttgccttttg 2100 gtctttgcaa cgctccggcc accttccagg cctacataaa tgaggcgctg cggggccttg 2160 tcgatgtctc ctgcatagta tacttggacg acatcctaat cttctcgaaa acggaaaaag 2220 aacatgatac ccacgttcgg gaggtgttgg atcgcttgaa gcaacaccag ctctatgcga 2280 acactgagaa atgctcgttc tacacacagg agattgattt cttaggttat attgttgggg 2340 ttgatggcat ttcaatggac aggagccggg tctccgccat cgaggactgg ccggcgccgc 2400 gcacgtaccg cgaggtgcag gtattcctag ggttcgcgaa tttctataga cggttcatct 2460 atgcgtattc gcggattgct gcccccctta cagatttgtt aaagggaagt gttaaaggca 2520 ggaaaacggg accgtttatc cttacccaag acgctgtgag ggcgtttgaa gaattgaagc 2580 aagcttttgc agatgccacg atgttaaatc acttcgaccc cggcctaccc agtcagtgtg 2640 aaactgatgc atctgggaat ggtgtctgcg gaatcttttc gcagttgaca ccggagactt 2700 tccaacgttg gaaggatcgg gggttaataa cctctggaga tacaggcaat ggaatgcctg 2760 taggagaagg ctttttcaca gaacctacaa ggcgccgtga atggagacca gtggccttct 2820 tctccaagaa gctatcgctc acgcaaagaa ggtatgatac tcatgatcaa gagatgttgg 2880 caatagttga atcctttaag atttggcgac actaccttca aggttgcaag tatccggtgc 2940 aagttctcac cgaccatgca aatttgcgtt acttcttgac aacaaagagc ttgacaggac 3000 gtcaagccag gtgggcggag ctgctttcgg agtatgattt ctttatccga taccgtccag 3060 gtcggcttaa ccccgccgat gcgccatcaa ggcgacccga ttatgatatc gttgatgggg 3120 atgaggatcc tacaaaggga cctctaccat cactgcagaa gaaactggct gcagggggtt 3180 ggggttctct aaagcttggg aacggcgccg gtaaggcagc tgcgcgaccc tatacaggtg 3240 gaggagaact cgggtccccg caggggactg gcactgaagg tcatgaactt ctcgtgccac 3300 gttttgctgc cattgaattg gcggaagacg aaacggtcta caaccttcca acccaacggc 3360 ttctagaatt cattggtgag ttgcagcagg gggacgccct tactgcggct cgaatccaga 3420 agttgaagga ggaaggatcc gggtcgaact tggaaggggc aagccctgag cggggcggtg 3480 ccgccgagga accaagtcgc tggcaactag gggaatgtgg actcctactg ctaaatggtt 3540 cagtctttgt ccctcgtagc agggcggtgc gccaggagtt attgaggatt catcacgatg 3600 atccccatgc tgggcacttc ggattagaga agactgaaga gctcctgaga cgtaagtacg 3660 tttgggacaa gcttcggaca gatgtcaagg agtatgttga aacttgtgat gtctgccaga 3720 ggataaaggt tccgcggcac aagcaatatg gcgagttggc gccactgcct gttccaaagg 3780 gcccatggca ggatctatcg atggatttca tagttgacct gccgaagtcg acaagagtca 3840 aaggatatga ctcaattcta gttattgtcg accgctttac gaagatggcc aaatatatcc 3900 ctacccacaa gaaggttaaa gcccctgaat tggctgacct gtttctagat gccatcattc 3960 gtgatcacgg ttctccgaaa tcgttagtgt cagaccgagg atcccttttc accagtaaat 4020 actggtcggc cttttgctat caccttacga taaggcgcct cctcagcacc gcgttccatc 4080 ctcagacaga tggccagact gaacgtcaga atcagacatt agaacaccat cttcgggcgt 4140 tctgttcata tcatcagaat gactggccta gacaccttcc gaccgccgaa ttcgcatata 4200 acaatgcaaa gaatgcaacc ttaaaagtgt ccccctttta tgcttgttat gggtataatc 4260 cttcattacc tagtgacgtc gcggacgacg tcctaaaggg ggggatagac agtcggaatg 4320 tcgaacggga tcgtgggaca gagtacccct cggtgccaga agtaaaagct agattggagc 4380 ttctggatag aatccacggg gaggcggcga aaagcatccg ccgcgcccag gagcgacagg 4440 cggaatacta taatcgaaaa caccttccaa tgaagttctc aataggggat caagtgttac 4500 tttcagcaaa gaacatcaag acgacgagac cgtgcaagaa gctcagtgag cgatggttgg 4560 gacctttccc gataataagg gtagtgggta agcaggcata tgagttgaaa ctaacctcag 4620 gattcaaatc tatccatcct gtcttccaca tttcctattt ggaaccctat aagcaacgac 4680 ccggggcgga gccgcctcgg ccggatggcg tcgagattga gggtaatacc gaatatcttg 4740 tagaaggaat attggacaaa cgaatgcact acaacaaagt tcaatacctt gtcaaatggg 4800 aaggttatcc atcatccgag aactcatggg aaccgcttga gcatttagag aatgccgagg 4860 aagagatcgc aaaatatgag gtggctagta gagaccgccc cgctgcacgt cggaggtgtg 4920 gttaggcaag agctgctgct tgagacttct tctttttccc aaagggtttt ttagagatga 4980 tctcgcggtg gtgagccggt gcctcgaaga cgctgtaaca ctcttctcct atttttccct 5040 aggggttttt aggagtttaa tgaggttacg cagctcaaag atctcagagc ggtaggaagg 5100 aaaaaaggga tatgaagaac caacaattaa tgcaatttta tttggcctga gggtcgctat 5160 tattctataa gggccgaagc ccacgaagat cgaagaaaac acctgaaggc ctcattcttc 5220 caccacctca cccccaccga actcctcctc caccgacgga gtccctggca ccggcacctt 5280 tgcctgccgt ggagttagcg ctgcggcgcc gcaggggggt gtaggaaagt acaacaaagt 5340 gtcaacttac caatttgcaa agcacctcag tcacctgctc caattggagc gagatcctct 5400 ccaacaaacg atttgtctcc gtggcgcttc caacggcggc gccggccgac cccttcttct 5460 tagggccggc cattctgaag tgtttttcca cctcggagac ccacgctttc gcctccttct 5520 ccagacgggc cttgggaacc tccccagaac ggacggaatc cgccaacgcc tggagatcca 5580 caactttggc cctgtactgc tttggcacct atacgccttg tcagaaccgg cacggcgccg 5640 ttacagaagg agcagtacgt acaggaaggc aggtgtcgtt gtttttcacg caacggaaac 5700 agcgtgaatg ggcggagggg cgggtgcaaa ggatcccccc ttctcgaagc tgcttggcgc 5760 agcgaaggca ggtagcggtg ccagccaggc gccgcaccgg ggtttcccca caaggagaaa 5820 gcaaaggaac catggaaccc tgcggacccg acccacaggc accctgagca gggctcgggc 5880 gctctgcagg aggggaacta cggactggac tcatttgagc cgctgtcccc ggaaccaagc 5940 tctccctgag gtcctctccg aacgaccatg cggggtcata catcgccctg ttgattggcg 6000 gggggggcgg cggtgtgaca gcggcagggg gggaggagca gaacccccac agctgctagg 6060 tgtctccgcc tccgcctcca acctggccct tttggccgcg acagaggccg cagctgcacc 6120 acgtagacgt tttttaccca ctagaaaaca tcagaataag cccccttact gtcttccaag 6180 gacgacgttc ccatgcactc actattgcar accaggaagg ggaccagggg gggggagaaa 6240 aagaaggact gcgcattcag cagaagaatc acgtcgaata agactcaccc attcctttat 6300 gccgcgcagc gcgcgcgctg ccaaagcggt tggcaaccgt tgctacacaa caaagataat 6360 tcccaatccg gtaatcttga ggcatcggga ctatgccttt aaaggggagg gcat 6414 // ID LTR-4_AN repbase; DNA; FNG; 528 BP. XX AC . XX DT 09-JAN-2004 (Rel. 9, Created) DT 09-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE Long terminal repeat of a LTR retrotransposon - a consensus DE sequence. XX KW LTR Retrotransposon; Transposable Element; LTR-4_AN; solo LTR. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-126 RA Kapitonov V.V. and Jurka J.; RT "LTR-4_AN, a family of solo long terminal repeats in the RT Aspergillus nidulans genome."; RL Repbase Reports 3(12), 210-210 (2003). XX DR [1] (Consensus) XX CC LTR retrotransposon. Solo LTR. CC 5-bp TSD, 98% identity to the consensus sequence. XX SQ Sequence 528 BP; 126 A; 145 C; 139 G; 114 T; 4 other; tgtcacgggc cgacgccgga gccagctggc gacgacctgg ccctgtgaca cgtgacccct 60 atccctcaat tcccacgtaa cgcggacggg cgccttaccg gtgcctatcc gccagtagga 120 atatgaggga tcgccagagc cgtaccggtc cgccagtagg aatatgaggg atcgccagag 180 ccgtaccggt ccgccagtag gaatatgagg gatcgccaga gccgtaccgg tccgccagta 240 ggaattcaag ggacgagccg tactgggatc aacccatata ttgttgcagt gggatgcggt 300 ggtrtacgca atcaccaatg cgtcactgta atggtattgc ttggagatgc cgaactaacc 360 acactggagg ttatatraag ggcctcctga tgccatgtay atagagatct ccaagcaatc 420 ctttcaatat tcttcagtac caatcragct ccctccattg gataattatc gttgatagtc 480 atcgcttcgt tgtagctaga gaaccccgac gttagaccac gtcttaca 528 // ID Copia-4_LBS-LTR repbase; DNA; FNG; 247 BP. XX AC ABFE01000971; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-4_LBS_; KW Copia-4_LBS-I; Copia-4_LBS-LTR. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-247 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01000971; Positions 5506 5260. XX SQ Sequence 247 BP; 58 A; 61 C; 53 G; 75 T; 0 other; tgttgaagcc tgcatccgag atcgtgatcg aggatgatca cattacgtaa ccactactat 60 atctactgta caggggttcg ttcctttcct cagtcgctct tccatcggct gaggtgagta 120 acggatcccg cctttactat atcattggga ctaactcgag tactactcat agggtatact 180 aggtaggtta ccaatgatat ctagtcgctc ttccatcggc tgagggtata ctagtccttt 240 tccaaca 247 // ID Gypsy-95_MLP-LTR repbase; DNA; FNG; 391 BP. XX AC AECX01000488; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-95_MLP_; KW Gypsy-95_MLP-I; Gypsy-95_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-391 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000488; Positions 134500 134890. XX SQ Sequence 391 BP; 108 A; 101 C; 43 G; 139 T; 0 other; tgtataaata gaggaaccta ttcccactgt aatagtttcc tcacttcatc ttttatcaac 60 ttatctttta cttaaaaaca aactcttatc tgtcgcttta taaagatcct ttttataaaa 120 ccacgcaatc aaatcaatta ccttttgttt tgtctttttg aaacttaaac gcttctacct 180 gtcttcatcg ttatctttga aacttctcaa aagcatctca ccggcctccg ttatacctaa 240 ctgtcacccg tgcccctttc gactcaatca ccaacgtctg aatacagttc aagatctgtg 300 atcttgttca tctttcttat acttcgttca ccccaagttc ttcttacaga aagacttatc 360 aaataacgtg ggtaatcgca gccccgttac a 391 // ID Gypsy-92_MLP-LTR repbase; DNA; FNG; 144 BP. XX AC AECX01000330; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-92_MLP_; KW Gypsy-92_MLP-I; Gypsy-92_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-144 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000330; Positions 168707 168850. XX SQ Sequence 144 BP; 37 A; 36 C; 22 G; 49 T; 0 other; tgttatgagc ctataagact atgcttatac tgagttactg gatagttctt caatatccac 60 gttcactatt gtactagttc ttcacgttgc aatctataca ctaggcagca acacccttga 120 ctctatcctg gcacgtctct taca 144 // ID Copia-9_LBS-LTR repbase; DNA; FNG; 289 BP. XX AC ABFE01002869; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-9_LBS_; KW Copia-9_LBS-I; Copia-9_LBS-LTR. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-289 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01002869; Positions 30392 30104. XX SQ Sequence 289 BP; 70 A; 77 C; 39 G; 103 T; 0 other; tgttgggatg tcacgacacc gtgaacactc ccttttgtcc gttctcttta tttctttttc 60 ctttattccc ttagccactc gtacttacta tataaacccc ccagtagttt actatcacct 120 cagtccgtat aaccgagact gaggtgagcc actcttactc atatctgaga ttactcttac 180 ttactgtacc tagatctttc tctatctact ctaggtacat cactaatata gagtccatat 240 aaccgagact gagatctttc tctatctact ctagggttaa gattctaca 289 // ID Copia-43_MLP-LTR repbase; DNA; FNG; 130 BP. XX AC AECX01001209; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-43_MLP_; KW Copia-43_MLP-I; Copia-43_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-130 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001209; Positions 5521 5392. XX SQ Sequence 130 BP; 33 A; 32 C; 23 G; 42 T; 0 other; tgtaccaatc atataggtca gagctatctt tgtagatttt gtgctgggta ttggttctca 60 tcgtgctatt gtgcacactc ttagttcccc aaacccacag taatcaacct tgttccaaga 120 cactcagaca 130 // ID Gypsy-23_MLP-I repbase; DNA; FNG; 5650 BP. XX AC AECX01000136; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-23_MLP_; KW Gypsy-23_MLP-LTR; Gypsy-23_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5650 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000136; Positions 177305 182954. XX CC Positions [4452-4931] - Integrase core CC 'AATAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 370..1413 FT /product="Gypsy-23_MLP-I_1p" FT /translation="MEDIQRQLAELTSSLAEEKNLRMQAEARSQQAKARMA FT ALKSNQRQPGNIPPAAHYQPPMQVDAIPKGPKVSVPDKFTGTRGGPAEIYA FT SQVQLYMLAHPSLFANDRSKVVFALSYLTGAASSWAQPMTQELLDESTAHK FT VTFERFVNNFKAMYFDTEKKSKAERALRSLTQKTTVAAYAHEFNIYATATG FT WETPTLISQFEQGLKREIRVAMVMVQEPFKSIEEIANFAIRIDSKIHGVSD FT HSSHTTPTVSDPNAMDLSAAFVRLSEEEKAKRLKTGSCFHCSQQGHRAREC FT PSRKRDGQFRGKGNFKTKISELEAKIAVLCSRDGGDVDRKEGPSRADSSKN FT GGAQE" FT CDS 1413..5552 FT /product="Gypsy-23_MLP-I_2p" FT /translation="MKVVPTLSRVVDFDGLDLGASRIISCNANDQRLFFNA FT SLSQSQNPLATPLHYPTARFLIDSGATHDVLSESYATSTGITEHVHRTSRV FT VTGFDGSSSRSSFETNLHINDNTSPTHFVITRIKDSYNGILGILWIKKNYS FT LIDWPNGIVLHHTSHIAAAKAVLSSPNTPSPGQGMDPLREARQSDKGACDL FT SDTLKPPQCEFDPVPSNSSHCTAGKPGSPLKKQTFQSQQHTILNVPEADPV FT ETSKTTDLDIAADDTASSRPSQNPEDLKEEPKGHVRNCDEGACDLLDTFKP FT PQCEFVTKSLCHVTELAGQQDPLPNNSALGISVANASWSTSAKLAAEEKKK FT LVSKPLEELIPAYYHRHLNMFRKAQAQRLPPRRQYDFKVELVAGAQPQASR FT IIPLSPAEEEVLNEMIKNGLENGTIRRTTSPWAAPVLFTGKKDGKLQPCFD FT YRKLNSLTVKNRYPLPLTMELVDSLLDTDKYTKLDLRNAYGNLRVAEGYED FT ILAFICKAGQFAPLTMPFGPTGAPGYFQFFIQDILVGRIGKDTAAFLDDTM FT IYTKPGENHELAVDGVLDILSKHQLWLNPEKCEFSKSEVEYLGLIISKNKI FT RMDPTKVKAVKDWPAPKNTTELQRFIGFSNFYRRFIDHFSSTTRPLHDLTK FT INNPYVWDKKCNDAFESLKTAFTTAPILKIADPYKPFVLECDCSDFALGAV FT LSQWCEDDGELHPIAYLSRSLVQAEHNYEIFDKELLAIIASFKEWRQYLEG FT NPNQLEVIVYTDHCNLESFMTSKQLTRRQARWAETLACFDFQIKFRPGRKA FT AKPDALSRRPDLKPDVHDNLTFGQLLKPENIGPDTFPVDLATLESFFEDKT FT VHLEDSEHWFEVDVLGVLDKDELNKDESKTCTDIEIINMIRDTSKTDRHIQ FT QLMDTIDNPISSKVKSATKMYEIKDGILYNSGRIEVPDNDHIKFHIVKSRH FT DSLLAGHAGQSKSLGLVRRSFTWPSQRAYVNRYVDGCDSCQRVKSSTKKPF FT GTLEPLPIPAGPWTDISYDLITKLPISEGFDSILTVVDRLTKMSHFIPCNE FT SMTAEKLADLMITNVWKLHGTPKTIVSDRGSVFVSQITKEIDKRLGIRLHP FT STAYHPRTDGQSEIVNKAIEQYLRHFVKYRQDNWSKLLPTAEFSYNNKDHE FT SIGVSPFRANYGFDPTFNIVPSADQCVPSVQERIKTIQEVQNEVADCLQLA FT QEVMKNQFDKGVQTTPNWNVGDQVWLNSRNISTTRPSPELDHRWIGPFNII FT EKVSTSAYRLNLPPSMKGIHPVFHVSILRKHDTDTIKERRQKEPTPIEIEG FT QDEWEVSEILDCRNQRNRKEYLVSWKGFGTEHNSWEPASNLKNSNDLVKEF FT NSLHPDASKRYKKRRRM" XX SQ Sequence 5650 BP; 1757 A; 1348 C; 1249 G; 1296 T; 0 other; tattgtcaga tcttttcatc atacggaact cgaagcctta gattgaaccc acatcacgac 60 tgaattgttt atcctcaagg aagatcgaaa taaactgaaa cagattagaa actcaaagtt 120 agaataaagt agaaactcag attgatataa ttagataaga ttagatttac aagattagat 180 tgaagcttaa actcaccgca aggccaaccg aaactctacc gaattctcag atcaatacaa 240 gccaccacgt ctccgacata caacacaccc tcactggacg atactgaatc ggactctgaa 300 aatccttcgg catttgtcga cgcaccccgc attacagact cagaccatct tgatacttcg 360 actgtgacca tggaagacat tcaacgtcaa ctagcagaac ttaccagctc actagctgaa 420 gaaaagaacc ttcggatgca agccgaagca cgatctcaac aagccaaggc tcgaatggct 480 gcgctcaaat ccaatcagcg ccaaccgggt aatatcccac cggctgccca ttaccagcca 540 ccgatgcagg ttgatgccat accgaaaggg ccgaaagttt cggtgccaga caaatttacc 600 ggtactagag ggggaccagc tgagatctat gccagccagg tccaactcta tatgctcgcc 660 cacccatctc tcttcgcgaa cgatcgtagc aaagtcgtat tcgctttgtc gtacttaacc 720 ggtgctgcta gtagttgggc tcaaccaatg acccaggagc ttctggatga atctacagct 780 cataaagtga cgtttgaacg cttcgtgaac aatttcaaag ccatgtactt tgacacggaa 840 aagaaatcga aagctgagcg agcgcttcga agtttaactc agaagactac agtcgcagct 900 tatgcacacg agttcaatat ctatgctacg gccaccggct gggaaacacc aaccctgatc 960 agccaatttg aacaaggcct caaaagggaa attcgtgtcg ctatggtgat ggtgcaagaa 1020 ccctttaaat ctatcgaaga aatcgccaat ttcgctatac gtatcgacag caagattcac 1080 ggcgtctccg atcatagctc tcacaccacg ccaacggtat ctgatcccaa cgctatggac 1140 ctgtcagctg cgttcgtgcg attatctgag gaagagaaag ctaagaggtt gaagactgga 1200 tcttgttttc actgttcgca acaaggtcat agagcacgcg aatgcccaag tagaaagagg 1260 gatggtcaat ttagaggcaa aggcaatttt aaaactaaga tcagtgaatt agaggctaag 1320 atagcggttt tatgtagtcg tgatggtgga gatgtagata gaaaagaagg acctagtaga 1380 gctgattcct caaaaaatgg aggggctcag gaatgaaggt cgtgcctacc ctgagccgtg 1440 ttgtggattt tgatgggtta gatttaggtg ccagtagaat tatatcgtgt aatgcaaatg 1500 atcaacgatt atttttcaat gcctcactat cccagtccca gaatccacta gccacaccac 1560 tacattaccc cactgcccgt ttcctcatcg actctggagc cacccatgat gtactgagcg 1620 agtcttacgc gacttcgacc ggtatcactg agcacgtcca cagaactagc cgagtggtga 1680 ctggatttga tggatccagt agccgctcct cttttgaaac taacttacac atcaacgaca 1740 atacatcacc aactcatttc gttatcaccc gaattaaaga ctcctacaac ggtatcttag 1800 gaatcctgtg gattaagaag aactactcac ttattgattg gcctaatggc attgttttac 1860 accatacttc ccacattgca gctgctaagg cggttttgtc cagtccaaac acaccctctc 1920 caggccaagg tatggaccct ttgagggaag ctaggcaaag tgacaagggg gcttgtgatc 1980 tatcagacac attaaagccc ccgcaatgtg agtttgatcc cgtcccctcc aattcctcac 2040 attgtacagc tggaaagcct ggttctccat tgaaaaaaca gactttccaa tcccagcaac 2100 acaccattct taacgtacct gaggccgacc ccgtagaaac atcaaaaacg acggatcttg 2160 atattgcagc tgatgacaca gcttcgtccc gtccaagcca gaaccccgaa gatctcaaag 2220 aggagcctaa ggggcacgta aggaactgtg acgagggggc ctgtgatcta ttagacacat 2280 ttaagccccc gcaatgtgag tttgttacca aatctctttg tcatgtcacc gaattagctg 2340 gccagcagga tccccttcca aataactctg ctctaggtat ctcagtggcg aacgcatcat 2400 ggtcgacatc tgcaaaatta gctgcagaag agaagaagaa actcgtcagc aaaccacttg 2460 aggaactgat cccggcttat tatcaccgac acctgaatat gttccgcaag gcccaagctc 2520 agcggcttcc accacgacga cagtatgatt tcaaagtaga actcgtcgca ggagctcaac 2580 cccaagccag ccgaataata ccactatctc cagcagaaga agaagtccta aatgaaatga 2640 tcaagaatgg actagagaac ggcaccatcc gacgcactac atcaccgtgg gcagcacccg 2700 tcctgttcac tggaaagaaa gatggcaaat tacagccttg ttttgattac cggaaactga 2760 actcgctgac agtgaagaac cggtaccctc ttcctcttac catggaattg gtggatagtc 2820 tcttagacac agacaaatac accaagctgg acctccgaaa cgcctatggg aatctcagag 2880 tggctgaagg ctacgaagac atccttgcat ttatttgtaa ggccggccaa tttgcccctt 2940 taactatgcc ttttgggcca acgggggcac ccgggtattt ccaattcttc atccaagata 3000 tactggtagg gagaattggg aaggataccg cggcgttttt ggacgacaca atgatctaca 3060 cgaaaccggg tgaaaatcac gaacttgcag tggatggtgt ccttgacatc ctgagcaaac 3120 accagttatg gctcaatcct gaaaagtgtg aattttctaa atccgaggtg gagtacctag 3180 gcctgattat atcaaagaac aagatccgca tggaccctac gaaagtcaag gccgtgaagg 3240 actggccagc cccaaagaat acaacagagc tgcagaggtt tataggtttc tcaaattttt 3300 accgtcgatt tattgatcac ttctcaagca ctacaaggcc tctccacgac ttaaccaaga 3360 tcaacaaccc ttatgtgtgg gataagaagt gtaatgatgc ctttgaatcc ctgaagaccg 3420 cattcacgac ggcccctatt ctcaagatag ccgaccctta caaaccattc gtgctcgaat 3480 gtgattgttc ggatttcgca ctcggtgcgg ttttatctca gtggtgcgag gacgacgggg 3540 aactacatcc gattgcctac ttgtcaaggt cgctagtaca agccgaacac aattatgaga 3600 tatttgataa agagctattg gccatcatag cctcttttaa agaatggcgt cagtacttag 3660 agggaaaccc caaccaacta gaagtgatcg tgtacactga tcattgtaac cttgagagct 3720 tcatgacatc taagcagtta actcgacggc aggcacggtg ggcagagaca ttggcctgtt 3780 tcgacttcca gattaagttc cgacctggca gaaaggcagc caagccggac gcattatcaa 3840 gacgaccaga tctaaaaccg gatgtacacg acaacctgac gtttggacaa ttgttaaaac 3900 cggagaacat aggacccgac acgtttccag tcgacctagc aaccttggag tccttctttg 3960 aggacaagac agttcactta gaagattccg aacattggtt cgaggtagac gtattaggag 4020 tgttagacaa agatgagctg aacaaagatg aaagcaaaac atgcactgac attgagatta 4080 tcaacatgat cagggacact agcaagacag acaggcatat ccaacaattg atggatacaa 4140 ttgacaaccc gatttcatca aaggtcaaga gcgcaacgaa gatgtacgaa attaaagacg 4200 gaatactcta caactcagga agaatagaag taccagacaa cgatcacatc aaattccaca 4260 ttgtcaagag cagacacgac tcattacttg ctggccatgc aggacaatcc aagtcattag 4320 gattggtacg taggagtttc acctggccat ctcagcgggc ttacgtcaac agatatgtcg 4380 acggatgcga ctcctgccaa cgcgtcaagt cgagcacgaa gaaacccttt ggcaccttgg 4440 aaccactacc tatccccgcg ggaccttgga cagatataag ttacgatttg ataacaaaac 4500 tccccatctc tgaaggtttt gacagtatac tgacggtagt agataggctg actaagatga 4560 gtcacttcat accttgtaac gagagcatga ccgcagagaa actcgctgac ttgatgatca 4620 cgaatgtatg gaagttgcac ggtacaccca aaacaatcgt atcagatcgc ggtagtgttt 4680 ttgtatccca aatcaccaag gaaattgaca aaagattagg aatcagacta cacccatcaa 4740 cagcgtatca cccgagaacg gacggccaat cagaaatagt caacaaagca attgagcagt 4800 acttgcgcca ttttgtcaaa taccgtcagg acaattggag caagctcctg ccaacagctg 4860 agttttctta taataacaaa gaccatgaat ccattggtgt atcacctttt cgagcaaatt 4920 acggatttga cccgacgttt aacatcgtac catcggcaga tcaatgcgtc ccgtcggttc 4980 aggaaaggat caagactata caggaggttc aaaatgaagt tgctgattgt ctacaattag 5040 cgcaggaagt aatgaagaac caattcgaca aaggagtgca gaccacaccg aattggaatg 5100 taggagacca agtatggttg aattcgagaa acatatcgac cacaagacct agtcctgaat 5160 tggatcatag atggattggt ccttttaaca ttatcgagaa agtttccaca tcagcttaca 5220 gattgaatct ccctcccagc atgaaaggta tacacccggt gtttcacgtc tcaatcctgc 5280 gaaaacacga cacagatacc atcaaagaac ggagacagaa agagccaaca ccaattgaga 5340 ttgaaggtca agacgagtgg gaagtgtctg aaatattaga ctgcaggaat caacgcaacc 5400 gtaaggaata cctggtcagt tggaaaggat ttggaacaga acataactcg tgggaaccag 5460 catcaaatct caagaatagc aatgatttag tgaaagaatt taattcactg catcctgacg 5520 cgtcaaaaag gtacaagaaa aggagaagaa tgtgagaggg taaagctttt tccccaaggg 5580 gttttttaat gctacctgtg gaagaatgca gaacttgcaa gagggagttt gggcattaaa 5640 ggggggataa 5650 // ID Gypsy-1_BFB-I repbase; DNA; FNG; 10429 BP. XX AC AAID01000422; XX DT 25-FEB-2011 (Rel. 16.02, Created) DT 25-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Botryotinia fuckeliana genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_BFB_; KW Gypsy-1_BFB-LTR; Gypsy-1_BFB-I. XX OS Botryotinia fuckeliana OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Leotiomycetes; Helotiales; Sclerotiniaceae; Botryotinia. XX RN [1] RP 1-10429 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Botryotinia fuckeliana genome."; RL Direct Submission to RU (25-FEB-2011). XX DR Genome; AAID01000422; Positions 940 11368. XX CC Positions [5829-6311] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 874..3378 FT /product="Gypsy-1_BFB-I_1p" FT /translation="MEPIKFEGEAGQMRERDSISDSASDTSIPITASNSRE FT QRDTPEAPGEGPLPQQVKEQRKTESVKQKQSANVFPQDGHVSAAQLAAILE FT QLSRANAPPLPLPGQEGMPWFQGSNATEWCEGMEQLRRNYRMNEEDFRLRI FT PLQVERTLREDVKAMKEWQQEGWDWERKFKPAFLKEHLADDIHQKMYSRDY FT IRRLAKKYKQEGEDNADALARFTRQYNQVAQRLIRDKASTKSEVTETFLKN FT IPKRAARKCIAGLKIDLKKPGTVEWDKVFQWVMQYTDNESQFRILEVSDSE FT EDSGLEELIDTVTSFKKTKVQAKSPVIPKADSIEQITKMLESFSAITQQEK FT APKRTEGKVQILDPHSNYRIPLATRPQMSTYYPSLGSNNVQDGMDDSVEYV FT NVAQRRYCYFCQNQYPGKEDHSFKKDCPVFKEYIDYGYIHHNQYGELFWGP FT KDTKENPSPIIHDHKTQGSLAKAVSLRAKSNGWDQPQREVEQVQLIEDIYT FT AEVLNVDSPENFTIGEYVNAVSSEPRKRGRPPHGPSKVHKPPVKSAKNQNS FT TLASKVLKYAHLMSPEEDTLSHNGAEPDTEPMDIQDPNEPEIIYERVVDKT FT KKPRKRMMPRLMTNTFGSIEELDRDILSSTINVSLGLILRECPAIWKRWTT FT RVGDPTMPSELNSHLINAINRLPTNLRPEAPEQVNHTDLVYTTNEDRESLP FT KVKVSVNGLGPVNAIPDSGSTINLIDSVLARKLNLPISPVQTTIVGISADS FT LNLRGICRDVRVALGGVSNVINIYVMENARNELLLGMPWFIAGEVSFTYRD FT GAQYLTVVDDDRETQASVWSAGQKFQTVMQSEN" FT CDS 3543..6989 FT /product="Gypsy-1_BFB-I_3p" FT /translation="MLERSDEGKHIISSQTFEELFNIHIVNDDVEQVNTMY FT KNKATKVRPSNQPCNITGEPPGGRDDWFERDQLRNPYQEPSGRWKEFLIPR FT FNKEPEGYRLTPERTKAIDCGDLLTDLERDILMACLLNREGALAFDWTHAG FT RVQEDVAPPQRIRTVPHEAWQTPGFPVPKALVEVVSTMLRDRIKQGVFEPC FT DGPYRNPWFLVKKKTPGAYRIINAVVELNRHTIRDANLPPDADAFAENFAG FT CTVASLIDFFSGYDQIPLALESRDLTAFQTPLGLLRYTTLPQGATNSVAQF FT CRIIMKILGDLTPSVAISFLDDIGIKGPKTVYDNKEIVPGVRKFVLEHIQA FT IDKTLERLERAGCAVGAKSKWCYDGMEVVGYVVGSEGRKPVEAKIKKILDW FT PEPKSATEVRMFLGVCVYYRIWVEKFAQKAEPLYRLLKKDVEFSWTIEQEQ FT AMKSLKENLIKPPTLMPLDYGDTMIVGQIVLGVDASIDGWGAHLGQESGGK FT RTVARFESGLWNDAEKNYDATKRECRGVLKALRKCRFDLYGIHFVLETDAK FT VLVAQLNRAATDLPGALVTRWIAWIRLFDFEVRHVKGTAHTAADGLSRRPA FT TLSDVSEEDKEEDIDEWIADQLNNVEDVSTIHNFANFSRSYELDKGYNSES FT RKIAQYLTSLQRPEGMTSQEFSKFKKRALGFSVREGCLYRNGTRDTPHRKV FT IDEQQQRTSLIGNLHEAYGHKGVESTFDKVHKLYYWNGMYEDVKRFVKSCP FT NCQKRQPNVQEEPLHPTWVSTCWQKCGLDIMYMPPDNGYKYLVSLRDDLSG FT WIEAKPLRNATSQAVAAFIWNVVCRHSVFGKVSVDGGPENKAHVEYFLRKY FT GIKRVQISAYNSKANGQIERGHRPIADGLAKVMDGKSGWLRHLDAMLFADR FT VTTHRPTNLTPFYVIYGRHPVLPIETQYSTWRVLDWEKVTSREDLLALRLR FT QLEMHDLDMEEAVMRKKRYRLDGKARFDKDNNVIDKKITAGDIVLQFDPQS FT KIDMSRERKLGYRWLGPYRVNKVLPNKSTYILEEMDGTPIDGTYSGARLKK FT FVKREGEYIPVGGEEEDTGDITEDELEDPLEALLNPRRSERIAQRFTTTED FT MIDHIQENQDVTPPTNTNTRVVVEIPAISPSTRRAFPTVLE" FT CDS 7405..10293 FT /product="Gypsy-1_BFB-I_2p" FT /translation="MSDEDNSTGSGHHSSDNQSPAPSGNEEEQDQAMMIDV FT SDIRLQHMRHNAIITAGNEAILAMMRAHEAIQTFPEEERYAKYLEFNNWVD FT AQKDCGDNLKEMSMITIIRDNLHIKANVSTKQMKIDNKETLTRIEKIRRMR FT QKAIDKLSLLWHHSRKGVTFATILIMTYGTRFQYELTKVVKLADPDTLLEV FT VNGELIRRLTSPPKPGVRVNRHLMPSEFRMAFKVLDPKQELKAFEWDDESR FT KQLGLKYNMVGMLGEESSDYSEDVPPAGDSHIEFSAIPSPPPNEEVIANLE FT RWSRLGVTVAEEDKEPRQNCLMDKWIVRQAAPLGSEEEAENVEQPESCKCP FT PEALVSKDWYLALKEKFVAKQIGWMTMAELIPRRPEGLCNFHLRHLANVLG FT LKTTRVKSEYLVHRLKTVYKRRDRIDDVRIHDKYFAWFKLNEGANKERKKR FT ILGVLKFRPLNPSASEPAFELEPPATLTKVHEVGRRRALATDPILTMLMDR FT CSDSVSASIKMYCHHNRYEKQGLGFAHHCYYAPWQQGCRQDLMLWREVAKQ FT RMDQEYRLVTVPFPALIIQAEEDFDRTFVHGNARDLYTVSDYKVADHITAY FT VVHSHDENGENTSQTEVVHGRVKPSALNDTDWRTLAESTLGKGKMIFDHGF FT FHDITAKRRHQSYTNLTKSDQYMNGRNNSWALVAGGNPTSIRPYMDAYGDE FT PFAATIVPYTAVKKRNDGTLDPILENGMKYQDLRDAHSNLKPIFNKTVTWD FT GQHEIPGFAAECHLITENPVEQALVGRLPWDNHSVLIEASLLLEMTDDEFH FT DWYRRHQKEIWHQHQRAFTDVISKESAYGEKSYYYVASVEPAAGQKRKYAS FT ADEEEGMLGFSTDTSGEDATPPPSPQPPSPAPSPQGTRRVKRRRRELTGQD FT RRLNDERDENEGVTGVMRGGLSNSRGLPGRPRGSRRIREVIPGERRSGRLA FT GRSSQPES" XX SQ Sequence 10429 BP; 3233 A; 2340 C; 2513 G; 2343 T; 0 other; ttggtgacct acgcagggaa gaaaacgtcg aaaggtcaga tagctatgtg cgaacctgag 60 ctcttggatc gtcttcagaa agcaagtggt tattgggtgc aataggtaga ttgaggtaac 120 gggactgccg gaatacctgt agatggtatg gtgtcgcaac tcgtgattgt tgcggtgagg 180 gagacattgt ttccctaaaa actcaaaacc gatatgtggc ctcttggtgt aaaagtgggg 240 gggagtggaa aaactttatt gttgtatcca catccaccac agcaccgaga gaacatatcg 300 tgactatctc gaacgaaata gcctcctgcg gaaacgccgg tatcgcttgg aagtcaaggg 360 agaatcttcg ctttttcaat ctacaagtca ggtgggcggg gggacaccag ggtaaacaag 420 gtgaaaactc aaaagtctac ggggctagaa aattcctata gagttctgat tatacgtggt 480 cagaatgggg gtggactagc agccatctga tgggtattca ccggcgatac aagggtgaat 540 tctctcttgc ccaatgatga aagggagaca aagagcggag atacttcctt agccaagagc 600 tattccgaaa actcgggaaa gtaggaggtt acgagttaga gaaagcccaa cgtcaatatg 660 ccgaaaaagt ggtgaaatgt tggttgtcgt ggatgcaatg ggtctctttg taacttcgca 720 aacttttcgc gaagttatga agaagggtaa aagactccga agttgtacga aaacaacggt 780 ataaagtctt gctacccttc caagattgca acctgcaaca accgatcaag aataaaacat 840 tagaaaaaga aggaaagaat catttcagac aaaatggaac ccattaaatt cgaaggagag 900 gctggtcaaa tgagggagag agactcgatc tctgattcgg cttccgatac atcaattccg 960 atcactgcat ccaactctcg ggaacagaga gacacccccg aagctccagg agaaggacca 1020 ctccctcaac aagtaaagga gcaacgaaaa accgagtctg tcaaacagaa gcaaagcgcc 1080 aatgtgttcc cgcaagatgg gcacgttagc gctgcacaac ttgctgctat tttggaacaa 1140 ctgagcagag caaatgcccc accattgccc ctccccggac aagaaggaat gccgtggttc 1200 caaggttcca atgctacaga gtggtgtgaa ggcatggaac aactccgacg caactacagg 1260 atgaacgagg aagacttccg tcttcgaatt ccattgcagg tggaaaggac tcttcgagag 1320 gatgttaagg ccatgaaaga gtggcagcaa gaaggttggg attgggaacg caaattcaag 1380 cctgcatttc tcaaggagca cttggcagat gatattcatc agaaaatgta ttcgagagac 1440 tacattcgtc gtcttgctaa gaaatataaa caggaagggg aagacaatgc tgacgcactt 1500 gcacgcttta cgcgtcagta taatcaagtc gcccaacgct tgattagaga caaagcttca 1560 accaagagcg aggtcactga aactttcttg aaaaacatcc ctaaaagagc tgctcgcaag 1620 tgcattgctg gactcaagat agacctgaag aaacctggta cagtggagtg ggataaagtc 1680 tttcaatggg taatgcaata tacagacaat gaatcccaat ttaggatcct cgaggtgagc 1740 gattctgagg aagattcggg tctggaagag ttgatcgata ctgtcacttc attcaagaaa 1800 accaaagtgc aagccaagag tcctgtcatc ccaaaggcag acagtatcga gcaaatcacg 1860 aaaatgcttg agagcttttc tgcaataacc caacaagaga aagctcccaa gagaacggaa 1920 ggaaaagtgc agatacttga cccccattcg aactacagaa tccctcttgc tacgcgaccc 1980 caaatgtcaa catactaccc ctctctgggc tcgaacaatg tgcaagatgg tatggacgac 2040 agtgtagaat atgtcaatgt cgctcagagg agatattgtt acttttgcca aaaccaatac 2100 cctgggaaag aagatcacag tttcaaaaag gactgtcctg tcttcaaaga gtacattgac 2160 tatggatata tacatcataa tcaatatggc gagctgtttt ggggaccaaa ggatacaaag 2220 gagaatccct cacctatcat tcacgaccac aagactcaag gttcccttgc taaggcggtg 2280 agcttgagag ctaagtcaaa cggatgggac caaccacaga gagaagtgga gcaagtacag 2340 ctaattgaag atatatatac cgcagaagtt ctcaatgtcg acagcccaga aaatttcact 2400 ataggagaat atgttaatgc ggtcagctct gaacctagga aacgcggccg acctccacac 2460 ggtcctagta aggttcacaa accacctgtc aagagcgcga agaatcagaa ttctactctt 2520 gcttcgaagg ttcttaaata tgctcacttg atgtctcctg aggaagatac tcttagccac 2580 aatggggctg agccagatac tgaacccatg gatatccaag accccaacga accagagatt 2640 atctatgaaa gggttgttga caagacaaag aaaccaagga agagaatgat gcctaggctc 2700 atgacgaata ccttcggcag tatagaagag cttgatagag acattctctc atcaactatt 2760 aacgtgtctc taggactgat actcagggaa tgtccagcca tttggaaaag atggacaacc 2820 agagtaggcg atcctactat gccttcagag ttgaacagcc acttgattaa tgccatcaac 2880 cgacttccta ctaatcttcg acctgaagcc ccggagcaag taaaccacac cgacttagta 2940 tacactacaa atgaagatag agaaagcctc ccgaaggtta aagttagtgt caacggccta 3000 ggtccagtca atgcgatccc ggactcagga tccacgatca acctgatcga ctctgttttg 3060 gctagaaaac ttaatctacc catcagccca gtccagacaa ccattgtggg catcagtgca 3120 gactccttaa atctaagagg aatttgtaga gacgttcgtg ttgcattggg aggagtctcg 3180 aatgtgatca atatttacgt tatggaaaac gctcgcaacg aactattgct aggtatgcca 3240 tggtttattg caggagaggt tagctttaca tacagggatg gtgcacaata ccttacagtc 3300 gtggatgatg acagagaaac tcaagctagt gtttggagtg ctgggcagaa gtttcagaca 3360 gtgatgcaat cggaaaacta ggcactgaca cattgagggg acaatgtgtc ggcaatttcc 3420 ccaattcccc cagttctcgt aacttcgcaa acttttcgcg aagttatgac gatggcaaga 3480 cagagcattt tgcaagaata tatgcttccc ttagtaccga atttcaggag ttctgcttca 3540 acatgttaga aaggtcagat gaaggaaaac acattatatc ctctcaaacc ttcgaagaat 3600 tgttcaatat tcacatcgtg aatgacgatg tggaacaagt aaatacaatg tacaagaaca 3660 aagctacgaa ggtccgcccc tcgaatcaac catgtaacat cactggagaa ccccctggcg 3720 gaagagacga ttggtttgag cgtgatcagt taagaaatcc gtatcaagag ccatccggga 3780 gatggaaaga gtttcttata ccccgcttca acaaagaacc tgaagggtat cggctcactc 3840 cagaaagaac aaaggcgatt gattgcggag atcttttgac ggatttggaa agagacatac 3900 tgatggcctg cctcctcaat cgagaagggg ccttagcatt cgactggaca catgctggac 3960 gggtacaaga ggacgtagcc ccaccacaaa gaattcgaac tgttccccat gaggcttggc 4020 aaactcctgg tttcccagtt ccaaaagcct tagtggaagt ggtttctact atgcttagag 4080 atagaataaa gcaaggagtc ttcgaaccat gcgatgggcc ttatcgtaat ccgtggttct 4140 tggtgaaaaa gaagacccca ggcgcctatc gaatcatcaa tgctgtggta gagctgaatc 4200 gacacacgat tcgagatgcg aatctacctc ctgatgcaga tgcttttgcg gaaaattttg 4260 caggctgcac ggtagcttca ctaatcgact tcttttctgg ttatgatcag atacctctgg 4320 ctctcgagag cagagactta acagctttcc agaccccgct tggcctcctc cgatacacta 4380 cacttcccca aggagcaacc aactcggtag ctcaattctg tcgtatcatt atgaaaattc 4440 ttggtgattt gactccgtca gtggcaatat cctttctgga cgatatcggc atcaaaggac 4500 cgaaaacggt atatgacaac aaagaaatag taccgggtgt gcgaaagttc gttctagaac 4560 atatccaagc aattgataaa acccttgaac gccttgagcg cgcaggctgt gctgttggtg 4620 caaagtccaa gtggtgctat gatggaatgg aagtggtagg ctatgtagtt ggatcagaag 4680 gacgaaagcc tgtggaagcc aaaatcaaaa agatccttga ctggccagaa cccaaatcag 4740 caacagaagt ccgaatgttt ctaggtgtct gcgtgtacta caggatatgg gttgaaaagt 4800 ttgcacagaa agcggaacca ttgtatcgct tgctcaagaa ggatgttgaa ttctcatgga 4860 caattgaaca agaacaagcg atgaagtcgc tgaaggagaa cttaatcaag cctccaaccc 4920 tgatgccttt agactatgga gatacgatga ttgttggaca aattgtcctc ggagtggacg 4980 ctagcatcga tggatgggga gcacacttag gacaagaatc tggaggaaaa cgcaccgttg 5040 cgcgctttga gagtggacta tggaatgacg ccgaaaagaa ctatgatgcg acaaaaaggg 5100 agtgtagagg agtgttaaaa gctcttcgaa aatgtagatt cgatctatac gggatacact 5160 tcgttctgga aacagatgcc aaagtcttag ttgcccaact gaatcgagcg gcaacagacc 5220 tccccggagc tctggtcact cgatggatcg cgtggattag gctattcgac tttgaagtgc 5280 gacatgttaa aggcacagcc cacactgctg ctgacggact ttcaagaaga ccagccacac 5340 tctctgatgt atctgaggaa gataaggagg aagacattga tgagtggatt gcagatcaac 5400 tcaacaatgt ggaagatgtt tctactatac ataacttcgc aaacttttcg cgaagttacg 5460 aattggacaa gggatacaat agtgagagcc gaaagatcgc tcaatatcta accagtctgc 5520 agcgtcccga aggcatgaca agccaagaat tttcgaaatt caaaaagcgt gccttaggat 5580 tctccgtacg tgaggggtgt ttatacagaa atgggactcg tgatacaccg caccgcaagg 5640 ttatcgatga gcaacaacaa aggacatcac taatcggtaa cttgcacgaa gcgtacggtc 5700 acaaaggggt tgaaagcaca tttgacaagg tacacaagct ctattattgg aatggaatgt 5760 atgaagatgt caaacgattc gtcaaatcct gccccaattg ccagaaacga caacccaatg 5820 tccaagaaga acccctgcac cctacgtggg tgagcacctg ttggcaaaaa tgcggcttag 5880 atatcatgta catgcccccg gataacggct acaaatacct tgtttcttta cgagatgatt 5940 tatcaggttg gatcgaagcc aagccattga gaaatgcgac gtcgcaagcc gtcgctgctt 6000 tcatttggaa tgtcgtttgt aggcattcag tctttggcaa ggtttctgtg gacggaggac 6060 ccgaaaacaa ggcacacgtc gaatatttct tgagaaagta tgggatcaaa cgggtgcaga 6120 tctccgccta caactcgaaa gcaaacggcc aaattgaacg aggacacaga cccatagctg 6180 acggactggc aaaggtgatg gatggtaaaa gcggatggct gcgtcacttg gacgcaatgt 6240 tgtttgctga cagggtgacc acacaccgac cgaccaatct gactccattc tatgtgatct 6300 atggacgcca cccggtgtta cccatagaaa cgcaatatag cacatggaga gttttagatt 6360 gggaaaaggt tacttcccgt gaagatctct tggcgttacg tcttcgccag ttagaaatgc 6420 acgatttaga catggaggaa gcggtaatgc gcaagaaacg atatcgtctg gacggaaagg 6480 cgcgttttga caaagacaac aatgtgattg acaagaaaat cactgcaggg gatatagttc 6540 ttcaatttga tccgcagtcg aaaatcgaca tgtctagaga aaggaaactt ggataccgat 6600 ggcttggtcc ataccgtgta aacaaagtct taccgaacaa gagcacttac attctggaag 6660 aaatggatgg aactccaatt gatgggactt attcaggagc ccgccttaag aagttcgtca 6720 agagagaggg agaatacatc ccagtcggag gagaagaaga ggatacggga gatatcactg 6780 aagatgaact tgaagaccct ctggaagcat tgctaaatcc aagaaggagc gagcgaatcg 6840 ctcaacgctt tacgacaaca gaggacatga tagatcacat acaggagaat caagatgtca 6900 caccaccgac aaacaccaat actcgtgtcg tagtggaaat cccagccatt agcccatcaa 6960 cacgacgcgc ttttccaaca gtgctagagt aattgacggg gcgtcaacca tacaccaagg 7020 tagggtagaa tggtaaactt cgtttgtctt tgttagaaat cgagaacttc cttattcggg 7080 ttaagaatgc gtctctttgt attataccct ttcattgcct ttataattag atcgcttaga 7140 gaaccacatc taagtccaga agagctagaa aagttctata aaggcagaca gagtttccat 7200 tttctctctt cacttgaggt tcaaaacact acaacccttt actcagcagc tccctatctc 7260 aaaaatatct ctcttgcctc ctttaaatcc tacttctcac taattccctt ctcttatatc 7320 ttttcttaaa aatactttca ccttttctgt aaaatataac ttcgcaaact tttcgcgaag 7380 ttacattcaa cacagacttt cacgatgtcg gacgaagaca atagtacagg ttcaggccac 7440 cattcatcag acaaccaatc tccagcccct agtggcaatg aggaagaaca agatcaggcc 7500 atgatgattg atgtgtcaga catacgactt cagcacatgc gacacaatgc gataataact 7560 gcagggaacg aagctatact agcaatgatg agggcccatg aagccatcca aacatttccg 7620 gaagaagaac gatacgccaa gtacttagag ttcaataact gggttgatgc tcagaaagac 7680 tgcggtgaca acctgaaaga aatgtccatg atcaccatca ttcgagataa tctccacatc 7740 aaagccaatg tatcgacgaa gcaaatgaag atcgacaaca aagagactct cacaagaata 7800 gaaaagattc ggcgcatgcg acaaaaggct atcgacaaac tcagcctgct gtggcaccat 7860 tctcgtaaag gcgttacctt cgccacaatc ctcatcatga catacggaac taggttccaa 7920 tatgagctaa ccaaagtagt caaattggct gaccccgaca cgttactaga agtggtcaat 7980 ggagagctca tacgccgact gacaagcccc cccaagcctg gagtgcgggt taacagacac 8040 ttgatgcctt ctgagtttag gatggctttc aaagtcctcg accctaagca agaacttaaa 8100 gcctttgagt gggatgacga gagtcgaaaa caacttggcc ttaagtacaa tatggtcgga 8160 atgcttggag aggaaagttc cgactactcg gaagacgtcc caccggcagg agactcacac 8220 attgaattca gtgccatacc atctccacct cccaatgagg aagtgatcgc aaacctggag 8280 agatggtcaa gattgggagt cacagtggct gaagaagaca aagagccgcg acaaaattgc 8340 ttgatggata agtggatcgt gagacaagct gcaccattgg gaagtgagga ggaggcagaa 8400 aatgtggaac agcccgaatc gtgcaaatgc ccaccggagg ctctagtatc aaaggactgg 8460 tacctagcct tgaaagaaaa attcgtggcc aaacaaatcg gctggatgac catggccgaa 8520 ttaattccgc gcagacctga aggactttgc aacttccacc tccgccacct agcgaacgtg 8580 ctaggcctca aaactactcg agtgaagagc gagtatcttg tccaccgtct caaaactgtg 8640 tataaacgca gggaccgtat cgacgacgtg cggatacacg acaagtactt cgcctggttc 8700 aagctaaacg aaggcgcgaa caaggaaagg aagaagagga tccttggagt tttgaagttt 8760 cgacccttaa atccctcagc tagtgagcca gcgttcgaac tggagccacc cgccaccctc 8820 acaaaagttc atgaagtagg tagacgaaga gcgttggcta ctgatccaat cctcactatg 8880 ttgatggacc gatgttccga ttctgtgtct gcatcgataa aaatgtattg tcaccacaac 8940 cggtacgaaa agcagggact tggcttcgct catcactgct actacgcccc ttggcagcaa 9000 ggctgcagac aagacctgat gctttggaga gaagtggcta aacaacgcat ggatcaggag 9060 taccgcttgg tgacggtacc attccctgcg ctcatcatcc aagcagagga agactttgat 9120 cgcacctttg tgcatggaaa tgctagggac ctgtataccg tatccgacta caaggtagcg 9180 gatcacatta cagcgtacgt cgtacattcc catgacgaaa atggtgagaa tacgtcgcaa 9240 actgaagtcg tacacggaag ggttaaacct tccgctctga acgatacaga ctggagaacc 9300 ttggctgaaa gtacgcttgg caagggcaag atgatcttcg atcatggatt ctttcatgat 9360 atcacagcta agagacgaca ccagagttac accaacttga cgaagtccga tcagtacatg 9420 aatggacgga ataactcgtg ggcactcgta gctggaggga accctacctc tatccggcca 9480 tacatggacg cgtacgggga tgagccgttc gcagccacaa ttgttccata tacggctgtg 9540 aagaaaagaa atgatggaac cttggatccg atcctcgaga atggcatgaa gtaccaagac 9600 ctgagggatg cccacagcaa cttgaagcca atcttcaaca agacagtcac gtgggatgga 9660 caacacgaaa ttccagggtt tgctgccgaa tgccatctga ttactgaaaa ccctgtcgaa 9720 caagctcttg tcgggagatt gccatgggat aatcactccg ttctcatcga agcttcactc 9780 ctcctcgaaa tgactgacga cgaattccat gattggtacc gtcgacatca gaaagaaatt 9840 tggcatcaac atcaacgcgc tttcactgat gtcatctcta aagaatcggc ttatggtgaa 9900 aaatcgtact attatgtagc ctctgtagaa ccggcagctg gacagaaacg aaaatatgct 9960 tcagccgatg aagaagaagg aatgctggga ttctctacag acaccagcgg tgaagatgct 10020 actccccctc catcgcccca acccccgagc cctgccccct ctccccaagg aaccagacgt 10080 gtaaaacgaa gaagacgcga actcacggga caagatagga gattgaacga cgaaagagac 10140 gagaacgagg gagttacggg cgtaatgaga ggtggccttt ccaacagtcg aggtctccct 10200 gggcgaccaa gaggttcaag aagaatacgt gaagttattc ctggagagag acgcagtgga 10260 aggcttgctg gtcggagttc tcaaccagag tcgtaaagga agacgataaa cgcatttcat 10320 aacttcgcaa acttttcgcg aagttatgat cgcagcaaaa agagcattgt tgtttggatt 10380 cttgacttta attgatcggg cgatcaattg accaaagtgg ggaggaatg 10429 // ID Gypsy-1_CAl-LTR repbase; DNA; FNG; 313 BP. XX AC . XX DT 04-APR-2011 (Rel. 16.04, Created) DT 04-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Candida albicans genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_CAl_; KW Gypsy-1_CAl-I; Gypsy-1_CAl-LTR. XX OS Candida albicans OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; mitosporic Saccharomycetales; OC Candida. XX RN [1] RP 1-313 RA Jurka J.; RT "LTR retrotransposons from the Candida albicans genome."; RL Direct Submission to Repbase Update (04-APR-2011). XX DR [1] (Consensus) XX SQ Sequence 313 BP; 106 A; 57 C; 29 G; 121 T; 0 other; tgacgatcct gcatatttcg tcataattca cacattctta aaattatgca cacatccttg 60 aaatgtgtta atattcccaa cattatcaat tatatgtgtt caaaattggt tgcaaagtta 120 tcaactcaat tcacgctata taaaccttac aatttctcta catttttata tttttttata 180 ttggctttct tttagaatca atcaatactt tttttatcat ttagatacat ctttcatcta 240 ttaatagatt atctttctat atatcaaaac acgacacagt cacgtgccaa aaaggatata 300 agaaggaact tca 313 // ID Copia-23_MLP-I repbase; DNA; FNG; 5756 BP. XX AC AECX01001016; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-23_MLP_; KW Copia-23_MLP-LTR; Copia-23_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5756 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001016; Positions 9493 3738. XX CC Positions [2546-2818] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 690..2642 FT /product="Copia-23_MLP-I_2p" FT /translation="MNPNSSTNVNPESKPSKSSRSKPPTETTQPSLSQSKN FT SIPFPLIENKPSMTDQKEFSLSSYSNIVKLTSYTFNDWKLKLITVLGGQRL FT SKFLLKDLPEPTDPTELEDFETNSSRALAAIHATIDGENFQVIRTCTSPRQ FT AFKALCEHHDDAGGLSTAHLFSDLVSIKLHPDEDLSEHIAKFRKIHNDILS FT NLASTPDFKISEPFIAIILIKSLPSDYTPLVQSLLTNFETLTLTRLYSLLK FT IEATRNASSANTDTALSANRPRNNYKSKANRKNNRSGNHTPSTSNLRCSLG FT HSGHTDENCRTRKYKAFLEFEKQQNSKSSNHATTSVSAQLSQSIPDADEDV FT SYWESAFSATTSSKDPIICDTGATSHMFSDSSLITDLQSIKPTRIGVASQD FT GAIWAKHKGIVRFESLILRDVLYSPELTGNLISIGRLCDDGFNASFAKTIG FT TITDSTGKTVLRMRRNTSTNRLWTPIFDKPVPRAFFSYTDAEKFATLWHRR FT LGHLHPDAVILFLKRHKMLPLSRKNFLPCDSCAMGKLTQAPATHSFHRSPG FT VLNLVHSDLIGPISPSSKTGFKYVVTFIDDHTRYSTVYLLKSKNQTFDAFK FT QYKSLMEKKCGSKIMKLKSDRGGSIHLLNLSSSSKMRGSKKKKVQHIDQLL FT TL" FT CDS 2795..5230 FT /product="Copia-23_MLP-I_1p" FT /translation="MEVFEANLPGHVHPFDTDRLKPFGSLCFAVDRNQKSK FT VGPIAKRFIFLGLEDGARAARLWDKQTGCVLVTGDVVYWEDIFPAQHPSLS FT PKVQDELILPEYTTKRVSSTPESEPSLSPKPPKIASPNCFEVLQDTTESNE FT DESDGDKPPPSWPYQPKETIPLSGSIHAPKDIEPITDADTVNHPQPLLVKD FT NAIRDHTSISSKSSPSVRSAPSPHSSASSISDLIDQNPEDTPDVISPVSIH FT SSPLSSIKSLPGRESIDHTPPRVITPSPKTPPQVTVSPKPPTPPSVVSPKS FT PISVPDVTPTPQASSSTKPPPAPPPRIKAAKQSAPVAVRKSSRVSKPPERY FT GFLTESGVSKSSSSPMLITNSWTVIPNHYGFSATTGSDPDSPTFSQAMTGP FT DRQAWINAMQHEFDSLTEHGVGKLVDPPPDANILGGMWIFNKKRDEHNHVV FT CFKARWVVLGNHQIKGLDYDDTYASVGKIDSLRILLALSTIKPKAKGKRRM FT KVRQFDVVTAFLNGDMKDRVYAKQVTGFENPTLRHRVWLLIKSLYGTKQAA FT RRWQQHFGATAAGFELFPIDSDTAVYVLRTSLGLLILHLHVDDSLIFCDDD FT DLFDKFKLFIDSKYKLKWNDKPTLYLGIKLEISDDNSEIKISQSHYLDAVL FT ERFAMLNCKPAKSPLPQKLILTPGTEEEVEEAKNIPYQELVGCLQWISTCT FT RPDISYAVSQLSKFNSSWTITHWTAAKHLLRYLRGTQDLAITYSGGIAEPQ FT AYSDSDFSQCSLTRKSVTGFVVTVANGAVSWKSQHQPVVALSTSEAEYIAA FT TECSKHMAWV" XX SQ Sequence 5756 BP; 1666 A; 1366 C; 1066 G; 1658 T; 0 other; gtcgctaaag gctgaattca gctacccttt cgacccgggg ctggccccgg tcccaatttt 60 gggggtttga atttccaaaa aaatagtttt ttagcatctt caaatcttca agaacccatc 120 atattatacc agaattatag ctttcagggc tcaaacgaga gtctataagg cccaacagac 180 ttccaactag tcaaaaccag tgttcaaggt ggtaattgat ctgattttgt ggccaacctt 240 ttgttacgac tagagctatt aagtacttaa gctagcgggc ctaagtactt aggtggctaa 300 ggtactacgt ttattagcat aatacagcct aaatatatgc ttaggtccca aaatgtggga 360 attttagtaa aaaaaaaaat caacttttga accgttcaac ttttttgatt ttttttcaca 420 ttttcagatt gagctcattc agcccattcc ataaatgtcc tggtatagag tgtagtgtag 480 aattgagtta gggtaacacc gggtcgaagt tgaaattttg attttgcggg caaggggtag 540 ttggaatctg gctttagcga ctgtaacagg ttatgagccc agcgtgatcc acgtctgcgg 600 tactctattc catcacattt ctagcagatc agaattgaac cttatcagtc tagtccttct 660 ccgatcttcg aattacgccc gagacgtcaa tgaacccaaa cagctccaca aacgtcaacc 720 cagaatcgaa accaagcaaa tctagtagat ctaaaccgcc aactgaaact actcaacctt 780 cactttctca atctaaaaac tcaatcccat ttcctttaat tgaaaacaaa cccagtatga 840 cagatcaaaa agagttttct ttaagttcat actccaacat agtaaagcta acttcttaca 900 cgttcaatga ttggaagctt aagttaatta ctgtgttggg gggacagcgt ttgtctaaat 960 tcttactgaa agacttacct gaacccaccg atccaaccga attagaagat ttcgaaacga 1020 attcatcgag agctttagca gctattcatg caacaattga cggagaaaac tttcaagtca 1080 tcagaacgtg tacatcgccg aggcaagctt tcaaggcttt gtgtgaacat catgatgatg 1140 cgggaggtct ctctactgct cacctgttct ctgatctcgt ttcaatcaaa ctgcaccccg 1200 atgaagattt atcagaacac attgcaaaat tccgaaaaat tcacaacgat atacttagca 1260 atcttgcatc tacccctgat tttaagatat ctgagccttt cattgcgatc attcttataa 1320 aatctcttcc ctctgattat acccctttag ttcaaagtct tctcacaaat tttgaaacac 1380 ttactcttac tagactgtac tctcttctca agattgaagc gactcgcaac gcgtcttctg 1440 ccaacacaga caccgcccta tctgccaacc gaccacgtaa caactacaag tcaaaggcta 1500 ataggaagaa caatcgatct ggaaaccaca caccatccac ttcgaatctc aggtgctcac 1560 ttggtcattc aggacatacc gatgagaact gtcggactag gaagtataag gcttttcttg 1620 aatttgagaa acaacaaaat tcaaagtcat ccaatcatgc aactacctcc gtttctgctc 1680 aactgtctca atccattccc gatgccgatg aggatgtatc atattgggaa tcagctttct 1740 ctgccacgac ttcttccaaa gatccaataa tctgtgacac tggagcaact agtcatatgt 1800 tttctgattc ctctctcatc actgatttac aatccatcaa accaacacgt attggagtgg 1860 cttctcaaga tggggcgata tgggcaaagc acaagggtat tgtcaggttc gaatctctaa 1920 tccttcgaga cgttctctat tctcccgaat taactggaaa tctaatctct attggacgcc 1980 tgtgcgatga tggtttcaat gcttctttcg cgaaaaccat tggcaccatc acagattcga 2040 cagggaaaac ggtccttcgg atgcgacgca acaccagcac caatcggctg tggactccaa 2100 tctttgacaa accagtcccg agagctttct tttcttatac cgatgctgag aagtttgcta 2160 cgctctggca caggagatta ggtcaccttc atccagatgc tgttatctta tttctcaaac 2220 gtcataaaat gcttccttta agccgaaaga attttcttcc ttgtgattct tgtgcaatgg 2280 ggaagcttac ccaagccccg gcgacccact cttttcatcg atctcctgga gtgctcaatc 2340 ttgtacatag tgatctgatt ggtcctattt ctccctcttc caaaaccggt ttcaaatacg 2400 ttgtcacttt tatcgatgat catactcgtt atagtacagt ttatcttctg aaatctaaaa 2460 accaaacttt tgatgctttt aaacaatata agtccctcat ggaaaagaag tgtggctcca 2520 aaatcatgaa gttgaaatcg gatagggggg ggagtattca tctgctcaat ttatcaagtt 2580 cctcaaagat gaggggatcg aagaagaaaa aggtccagca catcgaccaa ctgctgactc 2640 tgtagctgag cgatataata gaaccctact gagtaaagtt agatctcaat taattcattc 2700 cggcttgcct ctttcacttt ggggggaatt agtcaaatac acctgtcttc aaatcaattg 2760 ctcaccttca gcagctttac aaaatctgtc accaatggaa gttttcgagg caaatcttcc 2820 gggacatgta catccgttcg acacagaccg cctaaaaccg tttgggtctc tgtgttttgc 2880 tgttgatcgg aatcaaaaat ctaaagtcgg accaatcgcc aagcgtttca tctttttagg 2940 attggaagat ggcgcaagag cagctcgtct gtgggacaag caaaccggtt gtgtcttggt 3000 caccggtgat gttgtgtatt gggaggatat ctttcccgct cagcatcctt ccctgtcccc 3060 caaagtacag gatgaactca ttctgcctga gtacaccact aaacgtgttt cttcgacgcc 3120 ggaatcggaa cccagtttgt ctcccaaacc acccaagata gcttcaccaa attgtttcga 3180 agtgctccaa gacacgactg agtcgaatga agatgaatct gatggagaca aacctcctcc 3240 atcgtggcct tatcaaccaa aggaaacgat tcctctatca ggttctattc atgctcccaa 3300 ggatattgag ccgataacag atgccgacac agtgaatcac cctcaacctt tgttagttaa 3360 agacaatgct atacgggacc atacatcaat atcctcgaaa tcatctccat ctgttcggtc 3420 tgctccatcc ccacactcat cagcttcgtc gataagtgat ctgatcgatc agaacccaga 3480 ggatacaccg gatgtcatct ccccggtctc tattcactcc tcaccgttat cttcaattaa 3540 atctctacct ggccgggaga gtattgatca tacacctcca cgggttataa ctccttcacc 3600 caaaaccccg ccacaagtga cggtctcgcc aaaaccaccc acaccacctt cagtggtatc 3660 gccgaagtct cctatatcag ttcctgatgt gactcctact ccacaagcct cgtcgtcaac 3720 taagccacca ccagcgccac cgcctcgcat caaagctgca aagcaatctg caccagtggc 3780 ggttcggaag tcttcaagag ttagcaaacc accagagcga tatggctttc tcacggaatc 3840 tggtgtctcc aaatcatctt cgtctccaat gttaatcacc aactcttgga ctgtcattcc 3900 aaaccattat ggattttcgg cgacgacagg ctctgatcct gatagcccga ctttttctca 3960 agcaatgaca ggtcccgaca gacaggcatg gataaatgct atgcaacacg agtttgattc 4020 actcactgag cacggcgttg gcaaattagt cgacccacct cctgatgcta atattctggg 4080 gggaatgtgg atttttaaca agaagcgaga tgaacataat cacgtagttt gtttcaaagc 4140 gcgttgggtt gtgttgggta atcatcaaat caaaggtttg gactatgatg atacatatgc 4200 atctgttggc aagattgact ctctccgtat tcttttagcg ctcagtacaa ttaaacccaa 4260 agctaagggt aaacgtcgaa tgaaagtacg tcaattcgac gttgtcactg cttttctcaa 4320 tggcgatatg aaggatagag tttatgcaaa gcaagtcact ggttttgaaa atcccaccct 4380 tcgtcatcgt gtttggctgc ttattaaatc actttatggc accaagcaag ctgcaagacg 4440 atggcaacaa cactttggag caacggctgc ggggtttgaa ctctttccaa ttgattcaga 4500 cactgcggta tatgttcttc gaacttcatt aggattactc atacttcacc ttcatgttga 4560 tgactcgtta atattctgtg acgatgacga tctatttgac aaattcaaac tattcattga 4620 ctctaaatac aaactcaagt ggaatgacaa gccaactcta tatcttggta tcaaactcga 4680 aatctctgat gataactctg aaattaaaat atctcaatct cattaccttg atgctgtttt 4740 ggaacgattt gcaatgttaa actgcaagcc ggcaaaatca cctctacctc aaaaacttat 4800 tcttacacca ggcactgagg aggaagttga ggaagctaaa aatataccat atcaagaatt 4860 ggttggttgt ttacaatgga tatccacctg cacaagaccc gatatatcat acgctgtatc 4920 tcagttatca aaatttaact catcatggac cattactcac tggacggccg caaaacatct 4980 tcttcgttat ctacgtggta ctcaagattt agctattact tactctgggg ggattgctga 5040 gccacaagct tactctgatt ctgatttctc tcagtgctca ctaacccgaa agtcagtaac 5100 cggatttgtt gttaccgtag caaatggggc tgttagctgg aaatctcaac atcaaccggt 5160 tgttgcttta tcaacatctg aagcggaata cattgctgcg acagagtgtt caaaacacat 5220 ggcttgggta tgatctttct acttcgacat catgcatcag ctcgaatcac caacgccttt 5280 ctatattgat aacacctcag caatctttac ggcaacaagt gatggcatca aagcacgttc 5340 aaagcacatt gatagacgtc accactatat tagagatctt attcaatcca atagtatcat 5400 catttaccac attccaagcg aagagatgtt agcagaccat ttgaccaaac cattaggacc 5460 agtagcttta aatcatgcac tcaagataaa taatatgatt tagaaatgcg ttgaaatagg 5520 ggggatgtta aagtaaaagt gtaaatgtta cattcagatt tcacgattta tgatatttca 5580 cgtatatgta tttctttctc atttctaatg tttgacatgt ttgtgattgt tacgcttaca 5640 agaatagttg tactcatttt aaacctaact ttaatcatac tgtacagttc atatacctga 5700 tggagcagcg aattctgaat tcttctttcc tcatccaagt taatctcatt acttct 5756 // ID Gypsy-109_MLP-LTR repbase; DNA; FNG; 151 BP. XX AC AECX01000596; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-109_MLP_; KW Gypsy-109_MLP-I; Gypsy-109_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-151 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000596; Positions 44019 44169. XX SQ Sequence 151 BP; 40 A; 39 C; 24 G; 48 T; 0 other; tgttataacc ttataacgag ttatgcttgt agggagaaca gtgcagcgct cccttgaact 60 ctgcgcttag accttgtttt gttccctaac tacaatctca gttataccta aagatcatct 120 ctgacctctg atcccaatca cgttcataac a 151 // ID Gypsy-16_MLP-LTR repbase; DNA; FNG; 328 BP. XX AC AECX01001344; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-16_MLP_; KW Gypsy-16_MLP-I; Gypsy-16_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-328 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001344; Positions 205310 205637. XX SQ Sequence 328 BP; 73 A; 70 C; 74 G; 111 T; 0 other; tgtaaggtgt tacataggga cgtacagcag tcatggctga agacaggttc tgatcaatag 60 ttgtcaacgg ccctagctcc acgaagatct ctttcttcac cgaaatcttc ggctggagct 120 aggtgagcta tcatttctct tttcatctct tttctctcta gtttgtttct cttccatttg 180 ttatcagtag agcgtaggaa ctgacttcac cgaaatcttc ggctggagct agaattgtgc 240 taggaattta tagaagggtt ttagtttaga tttagagcct tgctctcgtg cctcgtcatc 300 agtgaagact ccgctttgga gtcttaca 328 // ID Hop repbase; DNA; FNG; 3299 BP. XX AC . XX DT 09-MAY-2005 (Rel. 10.05, Created) DT 29-SEP-2005 (Rel. 10.1, Last updated, Version 2) XX DE Mutator-like transposon, partial consensus. XX KW MuDR; DNA transposon; Transposable Element; Interspersed repeat; KW mutator; Hop; MUDR1_FO. XX NM MUDR1_FO. XX OS Fusarium oxysporum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Hypocreomycetidae; Hypocreales; OC mitosporic Hypocreales; Fusarium; OC Fusarium oxysporum species complex. XX RN [1] RA Chalvet F., Grimaldi C., Kaper F., Langin T. and Daboussi M.J.; RT "Hop, an active Mutator-like element in the genome of the fungus RT Fusarium oxysporum."; RL Mol Biol Evol 20(8), 1362-1375 (2003). XX RN [2] RP 1-3299 RA Gentles A. and Jurka J.; RT "Mutator-like transposon."; RL Direct Submission to Repbase Update (09-MAY-2005). XX DR [2] (Consensus) XX CC 99 bp terminal inverted repeats. XX FH Key Location/Qualifiers FT CDS 193..2700 FT /product="Hop_1p" FT /translation="MDSIGIHRQFPDDALPPEGHYSSREELRSAINAWAAP FT RGYAFVIKRSSKTANGRTHVIFNCDRGAGRIPSLSDRRQTTTRRTGCLFSV FT LAKESLCKTIWSLRHRPGPHFSQHNHEPSFSEVAHPTLRQLSRQEEITVNQ FT LTNAGIAPKEIGSFLRITSNTLATQQDIYNCIAKGRRDLSKGQSNIHALAD FT QLNEEGFWNRICLDESSRVTAVLFAHPKSLEYLKTYPEVLILDSTYKTNRF FT KMPLLDIVGVDACQRTFCIAFAFLSGEEEGDFTWALQALRSVYEDHNIGLP FT SVILTDRCLACMNAVSSCFPGSALFLCLWHINKAVQSYCRPAFTRGKDNPQ FT GLGGESEEWKEFFNFWHEIVASTTEDIYNERLEKFKKRYIPDYINEVGYIL FT ETWLDLYKKSFVKAWVNTHLHFEQYATSRVEGIHSLIKLHLNHSQVDLFEA FT WRVIKLVLMNQLSQLEANQARQHISNPIRESRVLYSNIRGWISHEALRKVE FT TQRERLLKEVPVCTGVFTRTLGLPCAHSLQPLLKQNQPLLLNHFHSHWHLR FT RPGSPRFLIEPRKQFDRLTASSTLPPTSTQREPSTFERIEKALQPKAPPKC FT SRCHQQGHMMTSKACPLRYKHLLQAPTQTSTIQAPTQDSTTTHIITHTTTR FT SITRSLSPSSSSGSSIVSEIVAYTTTHTTHTTTRVLSPPAARPGTVPAANA FT IVVPALRADDPRAIFQRYKEAREAWFTTLPRGAYKTNQQYRRAMGLPLRYS FT KAEYDWCLDYKQMGSHCKVGNGTRDWTKEEMMSYLDWDRAENDRVEQSVEI FT EMAEQPFSRRRGMQDIWDAAERDIMLQESIFQGR" XX SQ Sequence 3299 BP; 973 A; 769 C; 711 G; 846 T; 0 other; gggaagccat acctgcacga ccgatggtgc acctaagatc acgtgacctt cgcaccaccg 60 gtcgtgcgaa taagatccgt tatgtaagag gcataggtgg ccttatggaa agggggtaaa 120 tttgcacccc aattcaacga tcaacagttt ctgcagcatc aattcaactt gccgtgcctt 180 tcttctacta taatggactc aattggcatt caccgtcaat tccctgatga tgctcttcct 240 cctgaaggcc actacagctc acgggaggaa ctacgttcag caattaatgc ttgggcagcg 300 ccacgaggtt atgcctttgt gatcaagaga tcttcgaaga ctgccaatgg aagaactcac 360 gttatcttca actgtgaccg tggagcaggg cgtattccct ccctttcaga ccgtcggcaa 420 actacaacac gtcgtacagg atgcctcttc tctgtattag caaaggaaag cctgtgtaag 480 accatatgga gtctcaggca tcgtcccgga cctcatttca gtcagcacaa tcatgaacca 540 agcttcagtg aagtggcaca tccaacactt cgtcagctat cacgccaaga ggaaataaca 600 gtcaatcaac tcaccaatgc cggcattgcg ccaaaggaga ttggatcctt cttacgcatt 660 acctcaaata cacttgctac gcagcaagat atctataatt gcattgcgaa gggcagacga 720 gatctctcta agggccagag taacattcat gcccttgcag atcagctcaa tgaggagggc 780 ttctggaatc gaatatgcct tgacgagagc agtagagtta cagcagtatt atttgcacat 840 ccgaagtcac tggaatacct taaaacatat cctgaagtgc ttatattgga ctctacatat 900 aagaccaata ggttcaagat gcctctcctt gatatagttg gagttgatgc ttgccaacgg 960 accttctgta ttgcatttgc attcctcagt ggtgaggaag agggtgactt tacctgggct 1020 cttcaagcgt tacgatctgt atatgaggat cataacatag gccttccatc tgtaatactt 1080 acagatcggt gcctcgcttg tatgaatgcc gtttcctcct gtttcccagg ttcagcccta 1140 ttcttatgcc tatggcatat caataaagca gttcagagct attgcaggcc tgcatttact 1200 cgaggtaaag acaatcctca aggtcttgga ggagagtctg aagagtggaa agagttcttt 1260 aacttttggc atgagattgt agcttcaacg actgaggaca tctataacga gaggcttgag 1320 aagttcaaga aacgttatat ccctgactat atcaatgagg tgggctacat cctggaaacc 1380 tggctagatc tctataagaa gagcttcgtc aaggcttggg tcaacactca ccttcacttc 1440 gagcaatatg ctacatcacg ggttgagggc attcattcgc ttatcaaatt acatttaaac 1500 cactcgcaag ttgatctctt tgaggcctgg agggtcatca agcttgttct gatgaaccag 1560 cttagtcaac ttgaggcaaa ccaagccagg caacatatta gcaaccctat tcgcgaatct 1620 agggtattat acagcaatat ccgtggttgg atatcacatg aagccctgcg gaaggttgag 1680 actcaacggg aacgactatt gaaagaggtt cctgtgtgta caggggtatt cactaggact 1740 cttggtctgc cttgtgctca tagccttcag cccttactga agcagaatca gccccttcta 1800 ctgaatcatt tccactcaca ttggcatctt cgacgcccag gaagcccccg gttccttatt 1860 gagcctcgta agcagtttga tcgtctaaca gctagttcaa cgctaccacc aacaagcaca 1920 caacgtgagc cttctacgtt tgagcgtatc gagaaggcac tacaaccaaa ggcaccgcca 1980 aagtgttcaa gatgtcatca gcaaggccac atgatgacct ctaaagcatg ccccctacgt 2040 tacaagcatc tattgcaggc tcctacccaa acctctacga tacaggcacc tactcaagac 2100 tctacaacaa cgcatataat aacacataca acaactcgct cgataacccg ctcactatcc 2160 ccatcatcct catcaggatc ctctatagta tccgagattg tagcctatac aacaacacat 2220 acaacacata caacaactcg tgtactatct cctccagcag ccagaccagg aactgttcct 2280 gcagcaaatg cgatagtagt gcccgcattg cgagctgatg atccccgtgc catcttccaa 2340 cgatataaag aagcacgaga ggcctggttt actacattgc ctcggggagc ctataagact 2400 aaccaacaat accgtagggc tatggggctg cctctacgtt atagtaaagc agagtatgac 2460 tggtgccttg actataagca aatggggagt cattgcaagg tgggaaacgg cactagagac 2520 tggactaaag aggagatgat gagttatctt gattgggata gagctgagaa cgatcgggtt 2580 gaacagagcg tagagataga aatggcagaa cagcctttct cacggcggcg aggtatgcaa 2640 gatatctggg atgcagccga gagagatatt atgttgcaag agagtatatt tcaagggaga 2700 tagcctagag gtataacccc acggtattaa catttgtaat actaagggca tatgtaatta 2760 taatatatta ttagaaagag aaaggaaagg caaattttca agtataagtg cagtgatcta 2820 atggcttaat ttaatagaga tataattgag cttatatata gggctaaatg taaatatatc 2880 taacacactc taccctccta attatagcaa ttaataaaga agacttacaa atccttcacg 2940 atgacttccg ctttattaat aatatggtcc tcaacaaggg agttaagcaa ggcggggata 3000 gcttccgcag cggctatggt gttataatga ggcattatac gttctttata gccctaagat 3060 agagcaatgt attgcagata gataatataa tacatattaa acagctaagc ttacaataaa 3120 ccattatatt acttcaattc tataaataac agctattttc gtagataaaa gagagataca 3180 cttacccaca tatgtacact cacctatgcc tcttacataa cggatcttat tcgcacgacc 3240 ggtggtgcga aggtcacgtg atcttaggtg caccatcggt cgtgcaggta tggcttccc 3299 // ID Gypsy-2_AM-LTR repbase; DNA; FNG; 1093 BP. XX AC ACDU01005031; XX DT 07-FEB-2011 (Rel. 16.02, Created) DT 07-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Allomyces macrogynus genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_AM_; KW Gypsy-2_AM-I; Gypsy-2_AM-LTR. XX OS Allomyces macrogynus OC Eukaryota; Fungi; Blastocladiomycota; Blastocladiomycetes; OC Blastocladiales; Blastocladiaceae; Allomyces. XX RN [1] RP 1-1093 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Allomyces macrogynus genome."; RL Direct Submission to RU (07-FEB-2011). XX DR Genome; ACDU01005031; Positions 5004 6096. XX SQ Sequence 1093 BP; 202 A; 330 C; 288 G; 273 T; 0 other; tgtcacaggg cctcccgctg aggacgttct cagcgaccag ggtttagatt tggactcggt 60 tggtgatggg tcgtgtgacc aaagtcgcgc caacactggc tgatcactcc gactcactcg 120 atggggatgg cagtcgacag ttcaaatccg actccacttg accaaacgtg tcaggcctgc 180 gcctccgcca gccaccctga acctcttccc gacccagcaa accgccaagg cggacgatgg 240 gccgactatg tgcacgccat ccgccacacg gtgatgacaa tgctacaaac cttgcccaca 300 cgccctgatt tcgtcgcatt tttttgggga gtctggaggc cccacgcctt gttttttggt 360 tgttttcctt tgtcattccg atttggtcgc tgagcggcag cacaagcgtc gtgggattca 420 aaagctcggt gggcccgcct tggtgtggcc tttgccccct gcttgcccgc cgaggaaaac 480 gaactgcccg ctgacgattc tcgttgcctg tcgctcaggt ctgtactgcc attttctttg 540 ttgtccgcca ttttgggatg agcgagtttg gctatatcgt gccaaaacaa atacaaggcc 600 ccctgcttgc ccgccaagga aaacgaactg cccgctgacg attctcgttg cctgtcgctc 660 agagctctcg ctgtcaagaa cgtacgatat gacccgggtt ttgaggacgc aacgtgtgcc 720 gtccgccctg gttttgtacg cagagacggt tcgctcattg tgacaggtac aaattcgggc 780 ctcgcttctt ttccttcttt tggccaggcc actgtgacca caacgcgcag gactgtgcgc 840 gaagtgtgtg agaaaagccg tgcggcagac ctcccttgag agaaccgcac ggtaggcttc 900 ttcggaataa cgcgctgtct tcaggcacgc gtttttggtg attcatccca tttccgtcga 960 tcacaggccg tggtgtgact gtcagctttg cggcgaaacg ggcatccacg gaaacctgta 1020 ttttctcgct gggctattcg cctctgctat attgtccatt ccacctaccc gatactcctg 1080 tgtgcccccc aca 1093 // ID Gypsy-13_LBS-I repbase; DNA; FNG; 10784 BP. XX AC ABFE01000616; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-13_LBS_; KW Gypsy-13_LBS-LTR; Gypsy-13_LBS-I. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-10784 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01000616; Positions 198696 209479. XX CC Positions [4887-5342] - Reverse transcriptase CC Positions [6891-7370] - Integrase core CC 'CTCC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 2419..4461 FT /product="Gypsy-13_LBS-I_3p" FT /translation="MYTRMLAASDDGGPTEVYLVMQKAPISWGPILNIETV FT RGTSLLYSRATDHEYALIHAAKYESSNILTTENLPATLKRLGINLDRNRTY FT DRTANLVSSKEENDVIHEAFLGQLSREECEQGISNDPEAIREAFQVLKKRQ FT RPPPKGGYPYSKNDHVTTKLGRLPPSPCKVCGSDNHWDRECPDWDVYQAKQ FT QKSAYRIELVEVEDLEQYYSSVYSVLVAERLAKEHRPSRETEEDFDEAVLR FT TQGPTLESRERKTEDTSKPWSRQTAFVEEIEDEYWEEYRAKEKSNSHLLYQ FT IGDEDDAEERKEAYSSSRGEGPRAENADPECNKVKEPKVSPTHDSPPFVFD FT HTAPNGEKPAEVKDPPPVKLDPPPSKEKCIRIPKVRTRPEGMSAIGVSVLS FT TRGFVGSLKNSEIDLRLDSCADITLISSEFYESLVDKPKMKQGMRMQLWQL FT TDKDSKLKGFVRIPIFMVSDTGDIVETEAEAYIVPNMTVPILLGEDYQQTY FT EMGVTRNVEEGTHVSFRRHDYRIRALPVERSKDFDRLRQSAYMVGQFVRRR FT LHRRNKGKRHRRKVKFGLEEKTVRAAEDYRLRPHESKPIRVEGQLGEDREW FT LVQKNLLANANDSFFAVPNVLISAAHPWVPVANPTDQPRYIRKGEIIGSIC FT EPEKFFDSPSSPEELLRFQEAAERIRTVIAVQ" FT CDS 4461..8015 FT /product="Gypsy-13_LBS-I_5p" FT /translation="MNHDNSNNQTEGQKEEESEPEEYGPKTAAMPDPTIYP FT SSQLEDLIDVGSLPEYLKERAWAMLRKRIKAFGFNGRLGDLPAKVHIRTVD FT GQVPISMPMYNSSPEKRAIINEQIDTWFEQGVIEPSKSPWSAPVVIAYRNG FT KPRFCVDYRKLNAVTIPDEFPIPRQAEILASLAGAQVLSSLDALLGFTQLE FT LAEEDIEKTAFRTHHGLFQFRRLPFGLRNGPSIFQRVMQGILAPYLWIFCL FT VYIDDIVIYSKSYEEHIDHLDKVLEAIEKAGLTLSPKKCHLFYGSILLLGH FT KVSRLGLSTHAEKVKAIMELERPRKLSQLQAFLGMVVYFSAFIPYYASICA FT PLFQLLRKGHKWTWGIEQEHAFQSAKSALNSSPVLGHPMEGLPYRLYTDAS FT DEALGCSLQQIQPIKVRDLKGTRTYDRLKKAYESGQKPPQLVAAIGTGKKD FT SNFEDEWAPDFDDTIVHVERVIAYWSRLFKNAETRYSTTEREALAAKEGLV FT KFQPFIEGENVLLVTDHSALQWARTYENANRRLAAWGAVFSAYAPKLEIVH FT RAGRVHSNVDPLSRLPRAPPPQTSPLEFNEPVIRAKESLDERQEVSVPAER FT MAAYSFAAWSIEDCLDFPKEAMINVRSRNKRDDITISAESDVPQSRQPPPS FT ESDNDELDTLTPTAEYWGATNPPPTIHLAMSEEAKTEWKKDYADDPAFSAI FT ATDPKFLYENVTPGRRFFRDEDGMIFFSNEDYQPRLCVPKGRKNFVLQEAH FT ENALESAHAGPERLWQSLSTRFYWKRMKIDIEKFCKTCDICQKSKFSNFNK FT FGLLIPNPIPSRPYQSISMDFIVNLPWSEGYNAIFVVVDRLSKHASFIPTT FT TGLDTEGFAHLFVKWIVCRFGLPESIITDRDPRWTTDFWLGIAKALQTRMS FT LSSSHHPQHDGQTEVVNKLLTTMLRAFTSGKKSEWAKWLHLLEFAYNSAVH FT SSTSAAPFHLLLGFHPRTPLDFLGTKRNDDVVGRALSPEAVTFLENMAMHR FT DSARRAIAKAQDQQARSYNKGRKPVPHLKKGDRVLINPHALEWIESKGEGK FT KLTQRWIGPFEVMQRINPNVYRLRMSNLYPGLPIFNYQHLKKYEESPTEFG FT GRIVLPETRTTAPAKEEYEVDRIVAERRTRNGLQYLVRWEGYSPLYDTWEP FT RKALTNAPEVVAKWQKQRDEKGPGDL" FT CDS 8180..10738 FT /product="Gypsy-13_LBS-I_4p" FT /translation="MEFFLYTDNDGIVPTSGHEVMTAGEWVALNASAGKIP FT STDVFQQALEDRAFQPGELCARPNAWTWTKEPAWYSQRYHWDAWNMIDYFP FT DEHDTIPWYFSEGNTPRVGSGGRFYLDTSFQEKAADSLAKLWLCIESIISN FT PPFVKGTDHPMRFNPLRLQAGSESSNSVDSLAREARTKALEFLGFLNWWSL FT SVTHWDASLQPWMIDYVAALHLRSLKKRGVFLDLVRNWHHHNVAHWVAEDV FT PVYYYWMEDHTQYPSLSRLSPRILQAYHDACTALDRTEASADDIMGYDDEI FT ATIRGHDEFLQQRLEPNHTTSPLFIDIPSTATVYIVDFEGWARRPITDFIT FT VQDYTERFHFFIDMEMTGGLVTIWRWKPRVFNDGSGHRAGTEGLGSSVEAR FT RGDREIREIFKSLYAPSAKECFDRWGRLRLPGEVSSGSSVGSDSPSIHSPA FT AEAPIQMLPRPSWAPQSHPDIPVPCPPLMDVPSRWVQAMMTPSPLAPHSRA FT SSAARHSSISPRHSSRDSRRSASPRTSSRRSQLPTSRSSFVMALRAIGDEY FT AVREATWTSKKPLNWNPDFLEVGYLLVADSRAQARLRYWAACSGDASTMAA FT LLFKAIRFSIPFAIGVKVEDFGRFKPEEVSDMDRLVGKPTCTTEPPFVYTA FT QGALKAYYMSRVNDIIRRPHARILIGMGGPIAWLGRKWGGMELVAQFMTGP FT SPDVYVHRRGYIDSDDENPKFLYTDEMSPQEVDVLFGCIRSDSDKDKSLYP FT SRDILDDGCFFWTGEWDARMEEMFADLTKDVLQGTAKFRTPGMWNEYFRRR FT SRMTRGPRDRLNQLVPASLLRLHSKILEGFNVDWHKSRIAHIELPEEYKPR FT " XX SQ Sequence 10784 BP; 3021 A; 2665 C; 2605 G; 2493 T; 0 other; aaggtggaca ctgtgggaaa tagcggcaac ctggctggtg taccagcaag tctcggagag 60 gacgataaag ttagatcgag caaatcaggg gttgcacgac caaggcgagg agcacctccc 120 tcttccttaa gcttacctac ccagagatct tctagatcga gctcgccatc gaagcgatca 180 actcctacga actacccgct ttcggacaac gctcttcgtt ctacttcatc tcctcaatca 240 ctcgctagct acaatcgcct ttcagtcatc acgccggcgg ctccgaaatc agctctacga 300 atcgactcct cgagtcgcac ttcaaacttc aatcaaacgg ttagaggagc aactgctagt 360 agcaagactg tgaccttccg gcaccatcta tttcaaccta caacaaatcc tgtcaaaacg 420 acgttccagc ctcgttcaga ttctactaca gagtctagct tggcttttga tttcaaatcg 480 cctactccca tcactccaac caccgacgtt acggtatcag gcgaaatact actctctcct 540 ccgtcactac ttcctgacac atcttcgcac tctcacactc ttccgaataa cccgaatccg 600 agtcctacaa attccccttc cgtttctact atatcttcgc aaaatacgtc gatacaagaa 660 gtaggagata tgtcaactac tgaaaccaat ctgacacaac cgttccgtcc gggtggcgag 720 aatggctatg ccgactcgcc tatggctaga cttatgctag agtatcagga ggaaagagac 780 gaggatccgt ctacgcgaca acgggctatg agcttatatg cgaaaaagac tttagcggct 840 agggatctca tatacctaga tacggccgac gctaatatcg cgctagtaga cgacttacta 900 agtgaagtga gtcaggtctt gggggagctt cagactttcc tagaaaggac ttccggatta 960 ataccggaga ggtcaaagtt ctttaaagtc gacccaagga atagttttat ggacgtcttg 1020 cgcggagcta cagatttacc acaattacat gcagcctggg ctgggctcaa ctgacgcatc 1080 ggattagccc aggagaattt aatcaaatac gaagctcagt atcgtcaacc cttcaaatcc 1140 tcggatttcg ttgtcccaac ttcaccgatc tcgacagacc cagaaattta cgaagcaatg 1200 tcagatctag gagaattaga ttcgaaaatg agatacttat accagagtgt acctcatctt 1260 caagatgaaa ttcagtctcc aaggaagctg acggatggtt cggcttggaa cgatataatt 1320 cccctcccgg aaaatctagc cgaactacat tacaattggc aaggtaatgc tataagtact 1380 tcgagcggta aagaaaaagg gagagatatg aataacaacg ggagtcttcc tacttcccct 1440 cgaatgttga atgttggcta cggcacgccc tttcgttcca gttcgcagtt ttttagtaaa 1500 ccggatagac aaaagtttcc gttacctgtt cccagcgttt tagcgaatca aaatgtttta 1560 gtggggctag gcttaccgaa tactccagca ttccaaaaca tagatgacat acctcctagg 1620 cgactgcatg gattgcgagc acaatcgagg caatcaaacc cttttgaaca aaggagtcag 1680 acaaacttga atattgggac atcatctaat ccctctaatt ttcctcgcaa ctcagacttc 1740 cataactcca atctccttag tcaaggtacg aacgggggag gaagcggcgg aaatggcgat 1800 cctcctggag gaggaaatgg ggggcgagga aacgatcccc gtaggaatca agatgcagaa 1860 agcaatgagg atacggagca aaacaataac cgttacaatg cgtttcctcg gaacaacggt 1920 ggaagtgatc ctcccggagg agacccccct ggcggcggag gaggaaatgg tcctccagga 1980 agaggaaatt tcgggaatcc tagaaatgat ggtcagaatt cgggactgat accctacggg 2040 gacactagag ctaccattag aaacgatctg aaacaagatc aactgcccgt ttgggacggg 2100 aataagaatt cagcaataga atacttctgg aaggtgcagc agttagtggc tcttgaaggg 2160 gatattccac aagctttggg ttactggcta tggaaaagct tgaaggagaa ttccaagatt 2220 tggtggtggt tctcaacttt accgtttgca gaacaagcaa agatgagaac acactatcta 2280 tattatctca agggaatcaa ggacaattac ctaggaagaa cctggcagat cagtatgaac 2340 actaagtacg agagtcaatc cttctgacag gaaggttacg aacgtgaatc ccctcctgca 2400 ttcattacac ggcgcatcat gtacaccaga atgctcgcag cttcggacga tggcgggccg 2460 acggaagtat acttagtgat gcagaaggct cccatttctt ggggaccaat tttgaacata 2520 gaaaccgtcc gcggcacttc cttactatat tcgcgagcca cagatcatga atacgcgcta 2580 attcatgcgg ctaaatatga gtcatctaac attcttacta ccgagaattt gccggctaca 2640 ttgaaacgct tgggtattaa cttggatagg aaccgcactt acgatcgaac agcaaatctc 2700 gtcagttcga aagaagaaaa cgatgtcata catgaggctt ttcttggaca gttaagtagg 2760 gaagaatgcg aacaaggaat ttcgaatgac ccagaagcaa tacgagaagc ctttcaagta 2820 ttgaagaaaa ggcaacgtcc tcctccgaaa ggagggtacc cctacagtaa aaacgaccac 2880 gtcaccacta agctaggtcg tttaccgcca tccccatgca aggtatgcgg gagcgacaac 2940 cactgggata gagagtgtcc tgattgggac gtctatcaag ctaaacaaca gaaatccgct 3000 taccgaattg aactagtcga ggtagaagat ttggaacaat actacagcag cgtgtattct 3060 gtgctagtag cagagaggtt agcaaaagaa catcgtccat cacgagagac agaggaggat 3120 tttgacgagg cagttctacg aacacagggt cccacccttg aatcaagaga acgtaagacc 3180 gaggacacgt caaaaccatg gagtaggcag acggcattcg tagaggaaat cgaagacgaa 3240 tactgggagg aatacagagc aaaagagaaa tctaactctc acttgttgta tcaaatcggt 3300 gacgaagatg acgcggaaga gcgcaaggaa gcctactcgt cctccagagg agaaggtcct 3360 agagctgaaa acgcagatcc agaatgcaat aaggtgaaag agcctaaagt ttctccaacg 3420 catgattccc cccctttcgt ctttgatcat actgccccga atggagagaa acctgccgaa 3480 gtcaaagacc ctcctccagt caagcttgat ccgcctcctt caaaagaaaa atgcatacgt 3540 atccctaaag ttagaactag accagaagga atgtcagcaa tcggcgtttc agtattatca 3600 accaggggtt ttgttggatc tttgaagaac agcgagattg atctgcggct agattcatgc 3660 gcagacataa ctctgatttc cagtgaattc tacgagtcgt tagtagataa accgaagatg 3720 aagcaaggaa tgcgtatgca actttggcaa ctgacagata aagactctaa attgaaagga 3780 ttcgttcgca tacctatttt catggtttca gatacagggg atattgtcga aacggaagca 3840 gaagcttaca ttgtaccaaa tatgacagtt ccaatactct taggagaaga ttatcagcaa 3900 acatacgaaa tgggcgttac taggaatgta gaggaaggta ctcacgtatc gtttcgtcgc 3960 catgactatc ggattcgagc tctcccggtg gaacgttcga aggattttga ccgcttacgg 4020 caaagtgctt atatggtcgg tcaattcgta cgaaggcgcc ttcacagacg caataagggc 4080 aagcgccatc gccgaaaagt gaaattcgga cttgaagaga aaactgtgag agccgctgag 4140 gactatcgtc tgcgaccaca tgaaagtaaa cctatcagag tggaaggtca attaggagaa 4200 gatcgcgagt ggctagttca gaaaaactta ctcgcaaatg caaatgattc cttttttgct 4260 gtccccaacg tcctgatttc agcagcgcat ccttgggtac cagttgcgaa tcctacggat 4320 caacctcgtt atataagaaa gggtgagatt ataggatcga tatgcgagcc agagaagttc 4380 ttcgatagtc ccagttcgcc agaagaactc ctacgattcc aagaagcagc cgagaggata 4440 cggacagtaa tagctgtaca atgaatcacg ataactctaa caaccaaact gagggtcaaa 4500 aagaagagga atcagaacct gaggaatacg gaccgaaaac ggcagccatg ccagacccaa 4560 cgatttaccc ctcctcgcag ctcgaggatt tgatcgatgt tggcagttta cctgagtact 4620 tgaaagagag agcatgggct atgctgcgta agcgaattaa agcattcggt ttcaatggaa 4680 gattgggaga cctaccagcc aaagtccata tacggacggt cgacggacaa gttcccattt 4740 ccatgcccat gtacaattca tctcctgaaa aaagagcgat cattaatgaa caaatcgaca 4800 cgtggtttga acaaggagta atcgaacctt ctaaaagtcc ttggagcgca cccgtagtaa 4860 tagcttaccg caatggcaaa ccgagattct gtgttgatta tcgaaaactg aatgcagtaa 4920 cgattccgga tgagtttcct ataccccgtc aggcagaaat tcttgcttcg ttagcgggag 4980 cacaagtcct gtcctccctg gacgccttat tgggttttac gcaattggag ctagcagaag 5040 aagacataga gaaaacagcc ttcagaactc atcacgggct atttcaattt cgaagactgc 5100 ctttcggatt gcgcaatggg ccttcgattt ttcagagagt aatgcagggc atcctcgctc 5160 cttacttgtg gatattttgt ttggtctata tagacgacat cgtcatttac tctaaatctt 5220 atgaggaaca catcgaccac ttggataaag tcttggaggc catagagaaa gccggattga 5280 ctctttcacc gaaaaaatgc catttgttct acggttctat cttactacta ggacacaaag 5340 tttcacgctt aggactatcc acgcacgcgg aaaaagttaa agctattatg gaactggaac 5400 gtccgagaaa gctgtctcaa ttacaagcct ttctaggaat ggtggtttat ttctctgcgt 5460 tcatacccta ctacgcatcg atttgtgctc ccttgtttca actactgcgc aaaggtcata 5520 aatggacatg gggcatagaa caggaacatg cctttcaatc agctaagtca gctctgaatt 5580 ccagtccagt cctgggtcac cccatggaag gtctcccgta tcgattatac accgacgctt 5640 cagacgaggc tttgggttgc tccttacaac agattcagcc aattaaagta agagatctga 5700 aaggaacacg cacctacgat aggttgaaga aagcatatga aagcggccaa aaaccccccc 5760 aactagtcgc tgcgataggg acagggaaga aggactcgaa ctttgaggat gaatgggctc 5820 cagatttcga cgacacaatc gtgcacgtgg agcgagtcat tgcttattgg tcaagactgt 5880 tcaaaaacgc agaaacgcgg tactcaacta cggaacgcga agcactagcc gcgaaggaag 5940 gactggttaa atttcaaccc ttcatagaag gggagaacgt actattagtg acagatcatt 6000 ccgccctgca atgggcaaga acatatgaga acgctaatcg tcgacttgcc gcttgggggg 6060 ctgtgttttc agcatacgct cccaagttag agatcgtaca tcgcgccggc agagtacatt 6120 ccaacgtgga tcctctttca agactcccga gggctccacc tccgcagact tcccctttag 6180 aattcaatga gccagtaatt cgcgctaaag aaagtttgga tgaaagacag gaagtcagcg 6240 ttccagccga gaggatggca gcctactctt ttgcggcatg gtcaattgaa gactgtctag 6300 attttccgaa ggaagcaatg attaatgtgc ggtcccggaa taaaagagat gacattacca 6360 taagtgcaga gagcgatgtt cctcaatccc gtcaaccccc gccttccgaa agcgacaacg 6420 acgagttaga cacattaact ccgacggcgg agtattgggg agccacaaat cctcccccca 6480 ccatccattt agccatgagc gaggaggcga agacggaatg gaagaaggat tacgccgacg 6540 accccgcttt cagcgctata gcgacagatc cgaaattctt gtatgaaaac gtcaccccag 6600 gacgccgatt cttcagggac gaagacggta tgatcttttt cagtaacgag gactaccagc 6660 ctcgcctttg cgtaccaaag ggtaggaaga actttgtctt gcaggaagct cacgaaaacg 6720 ccttagaatc agcccacgca ggcccagagc gtttgtggca gtccttgagt acgcgattct 6780 attggaagcg aatgaaaatc gacatcgaaa agttttgtaa aacctgtgac atttgtcaga 6840 aatcaaaatt ttccaatttc aacaagtttg gtctgctaat cccgaatcct attccctctc 6900 ggccttacca atccatatcc atggacttca ttgttaacct gccctggtcg gaagggtaca 6960 atgcaatttt tgtggtagtg gatcgtctgt cgaagcatgc atcgttcatt ccaacgacga 7020 caggtctaga caccgaaggt ttcgctcatc tgttcgtcaa gtggatcgtt tgtaggttcg 7080 gattgccgga gagcattatc acagacaggg atccgagatg gacgacggac ttctggttgg 7140 gaatcgccaa agctctacag acacgcatga gtctatcgtc gtctcaccat ccacagcacg 7200 atggacaaac cgaggtggtc aataaactcc tcaccacgat gctgagggca tttacgtctg 7260 ggaagaagtc ggaatgggcg aagtggcttc atttgttaga attcgcctac aatagcgcag 7320 tgcactcgtc gacaagcgca gctcctttcc acctcttgtt agggtttcat ccacgtactc 7380 cgctggactt tttgggaacc aagagaaatg acgatgtcgt aggccgcgct ctgagtcctg 7440 aagcggttac gttccttgaa aatatggcta tgcacaggga tagcgctagg cgcgccatag 7500 ctaaagctca ggatcagcaa gcgcgttctt acaataaagg gcggaaaccg gtacctcact 7560 tgaagaaggg tgaccgagta ctgataaacc cacacgcgtt ggaatggatc gagtccaaag 7620 gcgaggggaa gaagctcacc cagcgctgga tcgggccctt cgaggtgatg cagaggatta 7680 accccaatgt ctaccgcctt agaatgagca acctataccc aggcttgcca atcttcaatt 7740 atcaacatct gaaaaaatat gaggagtcac ctaccgaatt cggaggaagg atcgtgctcc 7800 ctgaaacgcg caccactgcg ccggccaaag aagagtatga ggttgatcgt attgtggcag 7860 agcgcaggac acggaatggt cttcaatatc tagttcggtg ggagggttac agtcctctct 7920 atgatacctg ggaacccagg aaggcactca ccaatgcccc agaagtcgta gccaaatggc 7980 agaagcaacg cgacgagaag ggtccaggtg atttgtaata ctaccacatg ttaacttatc 8040 tcgtctagat cagcgatcga tctcgctgat aggcgacttg ttacccattc gcatttcgtt 8100 tgttcttttg ttctttcctt tctctttttc tttactactt ttcttaacaa catctcactc 8160 ccctctgctt tcaaatccca tggagttctt cctgtacacg gacaacgacg gaatcgtccc 8220 cacgtctggt cacgaggtca tgacagcagg cgaatgggtc gctctgaacg cgtcagccgg 8280 caagattcct tctacggacg tcttccaaca ggctctcgag gatcgggcat ttcaaccagg 8340 agagctatgc gcgcgtccaa acgcctggac gtggacgaaa gagccagcct ggtacagcca 8400 acggtatcac tgggacgcat ggaacatgat tgactacttc ccggacgagc acgataccat 8460 tccttggtat ttcagcgagg gtaacacgcc gagggtcggc tcaggtggga ggttctacct 8520 ggatacaagc ttccaagaga aagctgcgga ttctttagcg aagttatggc tctgcattga 8580 atctatcatt tctaaccctc cttttgtcaa gggcaccgat cacccgatga ggttcaaccc 8640 gctacgcctc caagcaggct cggaatcttc aaacagcgtg gattctttgg cacgagaggc 8700 tcgaacgaaa gccttggagt ttttaggctt ccttaattgg tggtcattgt cggtcactca 8760 ctgggacgcc tccctgcagc cgtggatgat tgactacgtt gccgctctac atctacggag 8820 cctcaagaag agaggggttt tcttggattt agtacgaaat tggcaccatc ataacgtcgc 8880 acactgggta gcggaggatg tgccggtcta ttactattgg atggaggacc atacacagta 8940 ccccagcctc agtcgacttt ctccccgcat cctacaggcg tatcacgacg catgtaccgc 9000 tttggacagg acagaagcct ccgcagacga cattatgggc tacgacgacg aaatcgctac 9060 tatccgcgga catgacgaat tccttcaaca gcgactggaa ccaaaccaca ctacttcccc 9120 cctctttatc gacatcccat ctacggccac ggtttacatt gttgatttcg aaggctgggc 9180 ccgacgtccc atcaccgact tcattacagt tcaggattac acggagcggt ttcacttttt 9240 catcgacatg gaaatgacgg gcggcctagt gacgatttgg agatggaaac cgagggtatt 9300 caatgacggt tcaggtcaca gagcaggaac agaaggcttg ggctccagtg tagaagcccg 9360 acgcggggat cgagagatca gggagatctt caagagccta tatgcgcctt cagcgaagga 9420 atgctttgac agatggggtc gactccgcct tccaggcgaa gtgagcagtg gttcatccgt 9480 aggctcagat agtccatcca tacattcccc agcagcggaa gccccaatcc aaatgctgcc 9540 tcgtccgagc tgggcgcctc agtctcaccc cgatatcccg gtaccatgcc ctcctctaat 9600 ggacgttcca tcaagatggg tccaagcaat gatgacacca tcccctttag ctccacattc 9660 acgagcatca tctgcggctc gacactccag catatcgcca cgacacagtt ctcgagattc 9720 tcgtcgttcg gcctctccta ggacatcttc tcgtcgatcg caactcccaa catcgaggag 9780 ctccttcgtc atggccttac gtgcaatagg ggacgaatat gcagtacgcg aggcgacgtg 9840 gacaagcaag aaacctttga actggaatcc agacttcctg gaggtcggct acctattagt 9900 ggcggactcc agagcccaag ccaggctgcg ttattgggct gcatgttcgg gcgacgcatc 9960 aactatggcc gccctcctgt ttaaagccat ccgtttcagc atcccattcg ctattggagt 10020 gaaagtggag gatttcggga gattcaaacc ggaagaagtg tcggacatgg accgcttagt 10080 agggaaaccc acttgcacga ccgaaccccc ttttgtttac acggcacaag gcgccctcaa 10140 agcgtactac atgagtcgag tcaacgacat tatccgtcgc cctcacgcca ggatccttat 10200 tggaatggga ggtcccattg cttggttagg tcgtaaatgg ggaggaatgg agctggtagc 10260 ccaattcatg acaggaccct cgccagacgt ttacgtacac cgccgtggat atatcgattc 10320 cgatgacgaa aatccaaagt tcttatacac ggacgaaatg tcaccccagg aagtcgacgt 10380 ccttttcggc tgcattcgga gcgacagcga caaggacaag tctttatacc catccagaga 10440 tatcttggac gacggttgtt tcttttggac tggcgaatgg gatgcgcgca tggaggagat 10500 gttcgccgac ttaaccaagg atgtcttgca gggcacagcc aaattccgca cacctggtat 10560 gtggaacgag tatttcaggc gccggagccg tatgactagg ggtccacgcg accgcttgaa 10620 ccagcttgta ccagcttcac ttctacgcct tcactccaag atattggaag gtttcaacgt 10680 ggactggcac aagagtcgaa tcgctcacat cgagctcccc gaagagtaca aaccccgtta 10740 aatcgggaat aggggggctt tgtgaggggt cagtatccct ctcc 10784 // ID Gypsy-59_MLP-LTR repbase; DNA; FNG; 216 BP. XX AC AECX01001344; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-59_MLP_; KW Gypsy-59_MLP-I; Gypsy-59_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-216 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001344; Positions 221258 221473. XX SQ Sequence 216 BP; 58 A; 54 C; 44 G; 60 T; 0 other; tgttatgatc tcaggatgca tgtaacagtg atctcaggat gcatgtaaca ggatgtgaaa 60 gatctgtagg agatgtcaca ggattggact ccaacctctg ttgtacacgt cacgcttgta 120 tatcttttcc tttccttctc aaccgacaat ctacatacaa tactagagaa gccactcctc 180 aatccccgtc gagagatccg tgccgtggcc ttaaca 216 // ID Gypsy1-LTR_AO repbase; DNA; FNG; 321 BP. XX AC . XX DT 23-JAN-2006 (Rel. 11.01, Created) DT 24-JAN-2006 (Rel. 11.01, Last updated, Version 1) XX DE LTR of the Gypsy1_AO retrotransposon - a consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Nonautonomous; KW Gypsy1_AO; Gypsy1-I_AO; Gypsy1-LTR_AO. XX OS Aspergillus oryzae OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC mitosporic Trichocomaceae; Aspergillus. XX RN [1] RP 1-321 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Basturkmen M., Spevak C.C. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-321 RA Kapitonov V.V. and Jurka J.; RT "Gypsy1_AO, a family of Gypsy LTR retrotransposons in the RT Aspergillus oryzae genome."; RL Repbase Reports 6(1), 2-2 (2006). XX DR [2] (Consensus) XX SQ Sequence 321 BP; 79 A; 77 C; 70 G; 94 T; 1 other; tgtcacagaa atgataacga tcatgcaagg tgactgtaca gtataccctg tcccaggtgc 60 atggacatcc cggtggccga acggcgtctt cggtgtctat atccgtcacc tttctaatga 120 cgtacagtgg ccccgttagn aggattgatc attgtataca gagaatctta ggggtcctcg 180 gctataaatt agtactgtag tcattagtag gaaatcaaga atcagtcttc tgtctctagt 240 atctactgcc tagtgctttg cccacgtatc cttacctagt tgacaggtcc tctaccctga 300 ctagtagata ttcctctgac a 321 // ID POT2 repbase; DNA; FNG; 1861 BP. XX AC Z33638; XX DT 23-APR-1999 (Rel. 4.03, Created) DT 23-APR-1999 (Rel. 4.03, Last updated, Version 1) XX DE Pot2 DNA transposon. XX KW Mariner/Tc1; DNA transposon; Transposable Element; POT2; KW Autonomous DNA transposon; pogo superfamily; transposase. XX OS Magnaporthe grisea OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Magnaporthales; OC Magnaporthaceae; Magnaporthe. XX RN [1] RP 1-1861 RA Kachroo P., Leong A.S. and Chattoo B.B.; RT "Pot2, an inverted repeat transposon from the rice blast fungus RT Magnaporthe grisea."; RL Mol. Gen. Genet 245(3), 339-348 (1994). XX RN [2] RP 1-1861 RA Kachroo P.; RT "POT2."; RL Direct Submission to Genbank (18-MAY-1994)Pradeep Kachroo, of RL Plant Pathology, University of Wisconsin, 1630 Linden Drive, RL Russels Laboratories, Madison, Wisconsin, 53706, USA. XX DR GenBank; Z33638; Positions 1 1861. XX CC POT2 is a pogo-like DNA transposon. It has 43 bp-long TIRs and CC is flanked by TA target-site duplicates. POT2's ORF of 535 CC amino acids encode transposase. Pot2 is present at a copy number CC of approximately 100 per haploid genome and represents one of the CC major repetitive DNAs shared by both rice and non-rice pathogens CC of M. grisea [1]. XX SQ Sequence 1861 BP; 590 A; 401 C; 367 G; 503 T; 0 other; taacgttgcg taccccctgt tcggcacccc ccctgttcgg cacccaaaaa taacaacact 60 tttattttta tcctccaact tctattataa atatctcgat gaatctgtca cttttggatt 120 tacttttttc tgattttaat tctccatttt ccaatgaagc aatatactga aaaacagctt 180 atatctgcaa ttaacgacgt caataatggc aatccaattg caaaaacctc ccgaaaatgg 240 ggaataccta ggtctacact tcaaagtcga cttaaaggtt ctcaacctta taaaaaagca 300 caaagccctt ttcaaaggct ttccacggaa caggaaaagc atttggctga ttgggtactt 360 acccaaacag ctttagggct tccgccaacg catcaagaat tacgcttttt tgccgaacga 420 attcttcaag ccgccggaga gacaaaaggc cttggaaaac gttggataac tcgttttttg 480 gctcgttatc caatccttaa aacccaaagg ccccgtcgaa tagataacgc ccgggttaat 540 ggcgctacta cggaggtaat taaatcttgg tggctttata ttacgaaccc ggttattaac 600 gctattaaac cggaaaaccg ttggaatatg gacgaaaccg gtataatgga aggcaaagga 660 tctaatggcc tagtattagg gcttaacggg atccggccgt tgcaacgaaa agagcccgga 720 acgcgtggtt ggacgactat aatcgaatgt atatcggcta cgggcgttgc cctccctccc 780 ctcgttatat ttaagggaaa aaacgtacaa caacaatggt ttcccacgga tttaagccct 840 ttcgataatt ggcaatttca tgcaaccgaa aacgggtgga caaataacca aacggctatc 900 gaatggttaa aaaaggtgtt tattccgtat acccaacctt taacccctga aaagcggtta 960 ttagttttgg atggccatgg atcacatata acggacgaat ttatgcttct ttgcttgcaa 1020 aataatattc aactcctata tttaccccct cattcgtcac acgttcttca accattggat 1080 ctatcggttt ttgggccgtt aaaagaagct tatcgacgtc aacttggatt tgttagccaa 1140 ttttgctgtt caacagttat tggaaaacga aatttcctac tttgttatcg aaaagccaga 1200 ttaaaagcat ttatagcaaa aaccattcaa tctggttggc gtacaacggg gttatggccg 1260 gtaaacttgg ttaaaccact tttaagccct tttttgttag aaaatagcaa cgccaacgtt 1320 ataaaagata aaaacaacgg tttgcaaagg gataaaacac cggaaagccc agcccaaaaa 1380 attaacgacc cgtctttact tatttggaaa acccctaaaa cgacccgaga tatccgactt 1440 caactgcaaa aactttccca atccaacaaa accaacgcta cttcacgtct tttatttgca 1500 aaagtccaaa aaagcttcga agccaaagat acccttttgg ctagcgccca gcaaaaaatc 1560 agcttattgg aagcacaact ggaggcaata cggccggtta aaaggaggag ggtggttccg 1620 gatccaaacg agcttttggt taacaaacag aacattattg gattgcagga aaatgatata 1680 gaaaatttgg aacctttagc tgatgaagaa gaggttaatg aaccggagaa gcgtgaaaac 1740 gattgtattt ttgtccgttg ataattttat atctaaatca gtttataaaa gtaggcattt 1800 cgttgttaga ttttcagggt gccgaacagg gggggtgccg aacagggggt acgcaacgtt 1860 a 1861 // ID copia-1-I_AF repbase; DNA; FNG; 5242 BP. XX AC . XX DT 28-FEB-2006 (Rel. 11.02, Created) DT 08-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE Internal portion of copia-1_AF LTR retrotransposon - a consensus DE sequence. XX KW Copia; LTR Retrotransposon; Transposable Element; Nonautonomous; KW Interspersed repeat; COPIA superfamily; copia-1-LTR_AF; KW copia-1-I_AF. XX OS Aspergillus fumigatus OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Neosartorya. XX RN [1] RP 1-5242 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-5242 RA Kapitonov V.V. and Jurka J.; RT "copia-1_AF, a family of copia LTR retrotransposons in the RT Aspergillus fumigatus genome."; RL Repbase Reports 6(2), 51-51 (2006). XX DR [2] (Consensus) XX CC It is an internal portion of the Copia-1_AF LTR retrotransposon. CC Coding regions are corrupted by numerous stop codons generated by CC RIP. XX SQ Sequence 5242 BP; 1870 A; 765 C; 597 G; 1883 T; 127 other; aggttataag cccttatagc ttattataag gagatcttta tctttatctc tattccctat 60 attctatatt ttaaaacgcg ttctgtacgc atattcttat tagttagatt aaaattctaa 120 ctatatagat ctatactact cctccttcct tctatagtaa ttcttctata gaatctagcc 180 tatcttatta aattagaagc ctctaaggcc ctattaatct atctaattaa aatagatatt 240 aatattaata ttaaacccca caaggttatc cctaccctta agggtgaatc taactagact 300 atctagatag aaggactcta gggatacctt tatagctata atactaacta ttagaagctt 360 ctataataaa gatatattac cctaaactaa ctaatactct ataaaataaa ttaagagtat 420 atatactaaa gtctcctata aagacttaga gagtagtaat ctcttaagcc ttagtaagct 480 actaaaaatc ccctagcttc tatagtagta gtagctttaa cctcctctnt nctagtaata 540 gctactatag cnctaaatta acttntatta atagtcntag agacagaggt taattaaant 600 atctaagaat aaactacatg taattactag ctcctagaat gctatgagta ataaatagaa 660 gtatttactt atcttaacta ctaggtgctt ataatactct atactactat taataatata 720 gtaaaggtat atattttagg tcttactaat astagtctag gcttttaaga agctaaagag 780 cctatataaa taatctagta gagatagtct cttcttacta tagcaaaaga tatataggat 840 taaatataan ctaaatagta attataatrc ctttctanag aaatagcaag ctgtnattac 900 tragatatat aytctaggag atcttaacta gactagatta aatattactt attcctttaa 960 ttaattaaag gcatccctta tctagataac tttataataa atataraaat aataatagct 1020 agctaataga tagccttagc taatatctat acttagttyt aatattttaa atttaaataa 1080 yrcryaayag gcatggyryn gactnaatct aatactaatn aactatctac taattctrtt 1140 taaggggggt raggtagcta atatagtaga tatagtanta attaatytaa tacctyayag 1200 tctaattcta atagtaattt aactctaaat tagactaatt aatatcctcc trataggcct 1260 rtaactaarg accctaataa taaagatact agagttayct ggtgtyacta ytatagtaaa 1320 tttagtagtt atctcctatc taaatattac ttaaagaata ctacttctaa tccttctact 1380 actacctcct tatagagaga aggctagaga gggcatggct atagatatag tagatttact 1440 actaatacta cttcctattc taattaagat aataagaatt ataataaaga ggctaattag 1500 attatattta atagagactc cctctaggta aataatatca agctctcctg ctagtagata 1560 taatataaag aggttagtac tattaactag gactatccct ataactagat attagattct 1620 agtataagta atataataac tccttactat tctttattta ttaactatta aagagatagt 1680 tctactagtc trtatagcta taggttaggt cttctagact aaaggctatr gyaytattat 1740 tattaatcty aycctactaa atagcactra atatatyaga tytattatac tactaaaagt 1800 ctagcataca ctagctcttg cttataatct aatttctatt agaaagctta tagaattagg 1860 tattaatact atctttagga agaatagtgg tattaagctt atctataata gcttaattaa 1920 ggcctttgct aaaataatat agaattacta cttcctttat actactagct ataaatctat 1980 taaaatattt aaataactat agcagtctgg cagactatta ggcttagaag atattaaata 2040 tatatcttac cctagatata aagtatttat taatataata actattaagg aactatagct 2100 aattagtatt aaactagctt attaatatac tatatatcta tctaagcatt acctctgtat 2160 gcttcctaac tatataacta gcttatatct taagaaaggg gggttataac ttccttatag 2220 cctatatatc ataggtaagg gctaatagtt actatatagc taggataaat ctatatagtc 2280 tagccctagt aagttcttat atattaatat ctatagacta cttagtgtct aagcctatag 2340 cagagagata tactttctaa ctattataga taatactata tagttctact aggtcttcct 2400 tcttaaatct tatacagagg tcctagataa gcttatttag gtaattaatt acctagagat 2460 atagttctcc tataaggtta agaaaatctg tagagataat gctactaagc attaactaat 2520 ataattatat ataatagata agggtattat ctaggaccta ataccttctt atatactaca 2580 gcttaacagt gttatagaga ttaagaatta ttatcttatt aagcctatta tcttagttat 2640 agcagagaac tagctcccta agtatctata agggcacctg gttcttatag ttaactatct 2700 atctaattac ctcttctatt ctaagattag tataactchc tataaggcgc tctatagtat 2760 ctagctagat atctcctatt agtatacact aggctgtaag tgctaggtct taatacctaa 2820 ggaatagtat aagaagctgg atctatatat atctaagggc tgctttatta gctatgtaga 2880 agggttctat aagatctata atatatctac taagaagatt atttaatctt ataatattat 2940 ctttatagaa gagctacaga tatctactaa gttagatata acctataatc tagattaaac 3000 taattaatat agttaatagg ctattaatga gacaaattaa ctgctagaat aatatcatat 3060 tctaattaaa ttccttaaga taagaaatag tagatactta ctacttatat agcctctagt 3120 tctaattatt ctagctctag tcccttctaa cttagattct actatatcta agatacctaa 3180 ttagctaatc taatataaat ctataggtat ccttcctaat ctcctactcc cttcttattt 3240 taaatatact ctacagctta ataatgcttc taattatcct attattaaat aattagctaa 3300 tcacagccta gagctactta atctaactaa taagagccta tagattcctc ctataggacc 3360 tgcttaatag ataagtaaga cttaggcctt agaggatacc ctactatatt attctatata 3420 tatttagaaa ctattatatt atgctataga agchatggag ggcctggcta tatactttaa 3480 taagctattc ttaccttagc tagtttaaat agctgatata tatattctct agataattat 3540 taatcacatc ctagacctag ttaatagagt agaggaataa ttatatacct atatagttaa 3600 atctgctata gctaatctag atatctccta taatctatag agagataatt tccttcccta 3660 tacctatact aaggctatta tatataagca tagatctaac tagtattaag ctatctataa 3720 ggaagtctta gagcttatct ataatagtac tttttaagta attaactagt aaaaagagat 3780 aaggcttcta cctcttaaat agatatttaa gattaaaagg gacaggaccc ttaaagctta 3840 ccttattatc tagggattct gctaatataa aggaattaac tttaataaag tctatacagc 3900 tattactaag ctaataagct ttaagctttt tatagctatt atagcctata atagctacac 3960 cctatactat attaatatya ctayyryrtt tctacayacc tatcttaaag aattaatyta 4020 tatatatctt ctagagggcc tttaatagat aggtaaatat actttrytta taaagacttt 4080 atatagcctt aagyagtccc cytataaata gtayatgcta gtctatract tyctcctatc 4140 tattagtttt aaatatatat atactaatta ttctrtcttt attaaacaya gyattactat 4200 actactctat atagataata ttcttatact tttaaattct aataatytta ttaataactt 4260 ccttaagcag ctaggaaaat tatttaaata tactaataat agtaaggttt ctgtctacct 4320 agggattaat atactatatt aaaagaaagg tatttatctt tattagaaaa tatatatcct 4380 aaagatcctt aggtagttca gcctgtctaa ctgtcatccc tatactatac tatataataa 4440 taagragata ctctctaatt ctattagtaa ggctataaaa gatrayattc ttctatacta 4500 rgagatratt agcttrctta cctagcttat actagraacc tatctagata ttacttttac 4560 tatatctaag cttrcttact tyactaggaa tcctagccyt aattacttta ttatagtaaa 4620 gtatrtattc trctatctag yagagatrct yttactytta ttattttatc cttytatryt 4680 tagtaakctt aatagtttta ttaatactaa ttaggctagt cctyayacta ttaaattagt 4740 cttaatatct agatatatct tcctattagr taatttacyt atctcttart atttaaaasa 4800 gcagactagy attactctct tattaactaa attagaatat attactatag ccctyayagc 4860 ttaagaagta atctagctta tactaatctt atctaaactc tctatcctta atagggtcta 4920 ttatyrcctt cctatatrya tttryaytaa taattaaggg gcttarttat taarytataa 4980 tryagagtat yatacttaga ctaagtatat taatattara tattacttyc tttattaaga 5040 ggttatagct agtaaactag agtatctata tatactaacc actaagyagg cagctaatar 5100 ccttactaag ccyctartaa mgaataratt tatatagttt attaaatagc taggaatart 5160 taatctacta gagactatya taccttccta attaggtart aagttaagya yrtatrctaa 5220 cttcagcttr gagyaagggg gg 5242 // ID Gypsy-114_MLP-LTR repbase; DNA; FNG; 813 BP. XX AC AECX01000711; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-114_MLP_; KW Gypsy-114_MLP-I; Gypsy-114_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-813 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000711; Positions 45195 44383. XX SQ Sequence 813 BP; 222 A; 152 C; 139 G; 300 T; 0 other; tgtcagccag ggcatatgtc cctacgtgac atatcctcta actaacaaat ggaagcgcgg 60 aaccatacat gaaagactac tgtcatattt tctttactct tttctcactt tacatcagaa 120 tcattctttt ctacattaca tataatttta aatttttcat aatatctttt ttatataaca 180 tccattaact aattcaaatt aactgtggaa agtaagattg aattaattaa agaagttatg 240 atcccctgaa gggagtagaa gtctcagttc ttgactctcg agttggttgg aggacccagc 300 tcgagtttca aattcatcta aaaccaattc ggacataaac gcctgggaaa cgttagtcca 360 atggtctggg attacatacg accggatggg gtgatgacct ccttgatttt gggatactac 420 ttgaaaaggg aactataggt ttggtggatg atctctttgg gatttattaa atgtcatctt 480 aacggatttc cgtgttctta tttcagatat ttgataattg ttttgttcga gtgatgtttg 540 ttttgagtat aagaaccacc ttgttttctc ttagtttttt gtttctgcat tcctgagaag 600 atcattaatc taaacattag tttaaactag ttgctttcaa tctatcatcc gcgtccatat 660 tactccagtt accgttcaac ttctgttaaa tcctctctgg ttagtaatag ccacgccttg 720 atcttgaaat cacctcttta ttcttttgat ttgatctttg aagattccgt gaatcctagt 780 tcactcgacg ttctattaac gtgcttgttg aca 813 // ID Gypsy-2_BFB-LTR repbase; DNA; FNG; 149 BP. XX AC AAID01002526; XX DT 25-FEB-2011 (Rel. 16.02, Created) DT 25-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Botryotinia fuckeliana genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_BFB_; KW Gypsy-2_BFB-I; Gypsy-2_BFB-LTR. XX OS Botryotinia fuckeliana OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Leotiomycetes; Helotiales; Sclerotiniaceae; Botryotinia. XX RN [1] RP 1-149 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Botryotinia fuckeliana genome."; RL Direct Submission to RU (25-FEB-2011). XX DR Genome; AAID01002526; Positions 8529 8677. XX SQ Sequence 149 BP; 28 A; 47 C; 29 G; 45 T; 0 other; tgtcacggcg ccaactacat gttctgtagt tgccttcgtg ccttaggcac ggactactag 60 cgtgccctgc ttcctataag tagggcctca ctcttccata gctcttccac ccttatggta 120 caatatacgt ctttacccgt gccttgaca 149 // ID Gypsy-75_MLP-I repbase; DNA; FNG; 6120 BP. XX AC AECX01001083; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-75_MLP_; KW Gypsy-75_MLP-LTR; Gypsy-75_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-6120 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001083; Positions 89059 95178. XX CC Positions [4920-5399] - Integrase core CC 'GGAGA' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS join(2127..4061,4065..6017) FT /product="Gypsy-75_MLP-I_1p" FT /translation="MGYFRVDCTHSGDYHIDDESTITTFIITSLRDKYDLI FT LGMPWIRRNHSRINWKEGSLLRHNLSIAVADATSSRPEKTLTDQVLEPDRH FT ARQLDEGAEIVNDSSMPPQCEYFSCTINPSEETSGKLNHHLENSLPNEAAT FT PNSISRMESPAVNQELTLSNPPNSSEDHELDPMRKARKSDKGVEIVNDSFI FT PPQSESVNAQSTMSCETMKKQSPLLQGKISRLATRALPQGPARRLYSQIIS FT KQQHSLSHINAASASWNVSAKLNAELTQKAPTRTAAELVPKCYHKYLDMFE FT KSKSNVLPPHRPYDFRVDLLPGATPQAGRVIPLSPKESAVLDEMLEKGLEN FT GTIRRTTSPWAAPVLFTGKKDGSLRPCFDYQKLNAVTVKNRYPLPLTMELI FT DSLLDADEYTSLDMRNGYNNLRVREGDEAKLAFICKQGQFEPLTMPFGPTG FT APGFFQYFIQDVLKAHIGRNVAAYQDDVLIYTKAGVDHKAVVKQVLDLLKA FT QNVWLKPEKCKFSKSEIAYLGLVISKNQIKMDETKVKAVKDWPTPKNLGEV FT QTFIGFANFYRRFIGQFSKIARPLHELSQKDTAFEWTTERQKSFELLKDAF FT TSAPILKIADPYKPFVLECDCSDYALGAVLSQISDSDGQLHPVAFLSSLIK FT AERNYEIFDKELLAVVSAFKERRQYLEGNPNRLNVIVYTDHKNLESLMSTK FT ELTRRQARWAETLGSFDFEIRFRPGKQSTKPDALSRRPDLKPKDGEKLSFG FT QLLKPQNLPADAFIDELDLFESWVIDKQELGGAEVNELVDNSDTDDKDSEL FT LWNDARILEEIKSKSKSNPKIIEILKLCNGASWTKLVKDYSKINEVLYFKG FT KVVVPNDHNLKVQILKSRHDSRLAGHPGRMRTLALVKRAYHWKSVKAFVNN FT YVDCCQSCQRIKSRTMKPFGSLKPLPIPSGPWVDICYDLITDLPESDGNDS FT ILTVVDRLTKMAHFVPCKKTLSSDELADIMIKHIWKLHGTPQTITSDRSNV FT FISKLTKDIHKRLGITTQSSTAYHPQTDGQSEITNKAVEQYIRHFTSYKQD FT DWSSLLPMAEFSYNNKDHVAIGMSPFKANYSFNVSFTDVPTGDQCLPSAKG FT RLAQIKDTQRELKDALQFTQETMKAQHDKKVQATPNWKEGVMVWLNNKNIS FT TTRPTAKFSHRWLGPFPILKHISANAYKLDLPKSMEKIHPVFHVGLLQRFE FT RSKIANQDQVPPEPIIIQESEELEIAEILDKRRRRGKVEYLISWKGYDANH FT DSWEPENLMSNAQELINEFNIKYPQAETRYRRVRR" XX SQ Sequence 6120 BP; 2096 A; 1308 C; 1239 G; 1477 T; 0 other; tattgcaacg tcataattca agagatcgga gaagattgat aaattgaaga acaagcaaga 60 aaaaaaaaaa aaagaagaat taaaagcaat caaaacgaaa aaagaaaaga aagttaaagt 120 taattaaagt tagtagtaaa acaaaagtga agatcaaaga agcaaaacta tacggagaaa 180 gtaaagatta gaagattaga taattcaaga atcgattaaa gtttaagatc tgtaaaagtt 240 tgaagatcca gactcaatta ctagcctccc tcggaatcaa ccaccacgcc ggaattcacg 300 accccccaaa gtccgacttc tacaacttct acagaccacg actctgaatt cctagacgct 360 ctcgaaccca cgtctaatcc acatatgtcg gaagtcaaca tgtcagaagc tagcttggcc 420 ggtgttatgc agcaactcaa taatctaacg gctagattaa atgaggaagt caatcgccga 480 gaacaggctg aagccaacca acgtgcatcc gacgccaaac gtattgaagt cgagcaaaga 540 ttactgcaat tagaacaaac gcaaacgtcg tgatcagctc ctgcagctca agctcctcct 600 atgccttcct caactatccc atcgtttcca gtaagcgctg gtgcaaccca cgcacaggtg 660 caacatatca gaccacctaa aatcgctact ccggataaat ttgatgggtc taaaggatct 720 aaagctgaga tttttatgaa tcagattgga atctacatgc aaatgaattc aacgtccttt 780 gttgacgaaa aatctcaagt tgcgttcgca atatcctaca tgaccgggaa agctagcatg 840 tggagtcaag cctttaccga tcaattgcta gattcagaac aagctcacct tgtcaattgg 900 actagctttg ccaactcttt ccgtgctaca ttctttgata ccaaaagact tgccaaagct 960 gaaaacaaaa ttcgtagtct caagcaagtt aaggctgtat ccgattattg gatcaagttt 1020 tctgagctct ccctagttgt caaatggcct gaaaatatcc ttatgtcaca ttttgaacaa 1080 cagtcaactc tgaaataacc cataccactt ataacacata ctcttgccta acccatactt 1140 tttttggtcc ataattttca ctttttagcc ccaaagccca ttgtgataac acatactcta 1200 tgaaaaccca taccttttgg ggtccataaa aaggtatggg ttgtttcaga gacaactgaa 1260 ggtctcaaac gcgaagtcag tatttatatg attcaacaag agttcaaatg tgtagaagaa 1320 atggcccaat tcgcaatcaa actcgacaac aaattgcaca accgatctca gagcttaacg 1380 atgccaacca gcacttcatc aactccaatc gccgatcctg atgcgatgga ttgctcagct 1440 tatcgaatca acatctccca agaataatat cgaaaaagag cggcaaatcg tgcttattat 1500 gaatgtggga aaaccaacca tgtgatcgcc aattgttata ttgcaaagag aaagaagaga 1560 ggaggaggag gattcagaag tagtgaagtt gatcagttaa aggccagaat tgcagaatta 1620 gatagccagt tgtcagaagt caaaatcagt gatacaaact taagtagatc agagttgtca 1680 aaaaatggag aagctcggga gtgctagttg tgcctccccc gagcaaaata cctttaggtg 1740 tggaagttgg tgacattgat agtcttgaaa tgaatgatac tagaattatt gaccatgtta 1800 ccctccacga cccaaactct gacacaacaa tcaccgcacg agccttaatt gacagtggtg 1860 ctacccacga ggctattagc ctcaagtttg tgagaatgta taaccttcac actgttcctt 1920 tatcacattc tcacagcgta accggattca gcggacatga atcccgagtt acagttgtct 1980 ctgaaataac ccataccttt ttatggaccc caaaaggtat gggttttcat agagtatgtg 2040 ttatcacaat gggctttggg gctaaaaagt gaaaattgtg gaccaaaaaa agtatgggtt 2100 aggcaagagt atgtgttata agtggtatgg gttatttcag agttgactgt actcactctg 2160 gcgactatca cattgacgac gaatccacaa tcaccacttt catcatcaca tccctcagag 2220 acaagtatga tcttatactg ggaatgccgt ggataagaag aaatcactca agaatcaatt 2280 ggaaagaagg cagtctatta agacacaatc tcagcattgc tgttgcggat gcaacatcgt 2340 cgaggccgga aaaaaccttg acggaccaag tattggagcc tgataggcac gctaggcaac 2400 ttgacgaggg ggcagagatt gtaaatgact catcaatgcc cccgcaatgt gagtatttct 2460 cgtgtactat caacccaagt gaagaaacca gtggcaagct taatcatcac ttagaaaaca 2520 gtttacccaa cgaagcagct acacccaaca gtatatcacg aatggaatca cctgcagtta 2580 accaagagtt gactttgtcc aatccgccaa actcctcaga agaccacgag ttggacccaa 2640 tgaggaaagc taggaaaagt gacaaggggg tagagatcgt aaatgactca tttatacccc 2700 cacagagtga gtctgttaat gctcaatcaa ccatgtcttg tgaaacaatg aaaaagcagt 2760 ctcctcttct acagggaaaa atctctcgat tggcgacacg agcacttcca caaggaccag 2820 caagacgcct gtattcgcag ataatctcaa agcaacagca ctctctctca cacatcaacg 2880 ccgcatctgc ttcatggaac gtatctgcta aacttaacgc ggaactaaca cagaaagcac 2940 caacacgaac agctgccgaa ctagtcccca aatgttacca caagtacctt gacatgttcg 3000 aaaagagcaa gtccaatgtt ctacccccac atcgtcctta tgacttccgc gtggatctat 3060 tacctggagc gactcctcaa gctgggcgag taatccctct atcacccaaa gaaagcgcgg 3120 tattggacga aatgctcgaa aaaggcttag agaatggaac aatacggcgt actacttctc 3180 catgggcggc tcccgtattg ttcactggga agaaggatgg cagtctacga ccatgctttg 3240 actaccaaaa attgaatgca gtgacagtga aaaatcgata tccgttgcct ttaacaatgg 3300 aattaattga cagcttgctt gatgccgatg agtataccag cttagatatg agaaacggat 3360 acaacaactt acgtgtaaga gaaggcgacg aagccaaact ggcgtttatc tgcaaacagg 3420 gtcagtttga acctctcacg atgccttttg gtcccactgg agctcctggg tttttccaat 3480 actttattca agacgtactc aaggctcaca taggacggaa tgtagcggct tatcaggatg 3540 atgttctcat ttatacaaaa gctggggttg atcataaagc tgtggtaaag caagtcctag 3600 atcttctcaa agcacaaaat gtatggctca aaccggagaa atgcaagttt tcaaaatcag 3660 aaatagcata cctaggacta gtaatatcta agaatcaaat aaaaatggat gaaaccaaag 3720 tcaaagctgt taaagactgg ccaactccga aaaatttggg agaagtccag acattcattg 3780 gctttgcaaa tttctatcga cgatttattg gtcaattctc aaagatagct aggcctttac 3840 acgaattatc acaaaaagac accgcctttg aatggacaac tgaacggcaa aaatcatttg 3900 aactacttaa ggatgcattc acctcagccc caatcctcaa aatcgctgac ccatacaaac 3960 cgtttgtatt ggaatgtgat tgttccgact atgctctggg cgcggtttta tctcagatct 4020 cagatagtga cggacaactt catcctgtag cttttctatc ttgatcattg attaaagctg 4080 aaagaaatta tgagatattc gataaggaac ttctagcagt tgtcagtgca ttcaaagaaa 4140 ggcgccagta cctagaaggg aacccgaata gactaaatgt gattgtatac actgatcaca 4200 agaatctgga atctctcatg tcaacaaagg aactaactag aaggcaagct cgttgggcgg 4260 aaacattggg cagttttgac tttgagatac gattcagacc ggggaagcag tcaactaaac 4320 cggatgctct ttcacgtcga ccagacctaa agcctaaaga cggtgaaaaa ctatctttcg 4380 gacaattact caagccacag aacttgccag cagacgcatt cattgatgaa cttgacttat 4440 ttgaatcatg ggtaattgac aaacaagaac ttggaggtgc tgaagtgaat gaattagttg 4500 ataatagcga cactgacgac aaagactcag aactgttatg gaacgatgcg agaatactag 4560 aagaaatcaa gagcaaatca aaatccaatc cgaaaatcat tgaaatattg aaactttgca 4620 atggagcttc ttggacgaaa ttagtcaaag attacagcaa gattaatgaa gtcttatatt 4680 ttaaagggaa ggtggtagta cctaatgatc ataatctaaa agttcaaatc ctcaagtcac 4740 gtcatgatag tcgtctagca ggccacccgg gaagaatgag aacattggca ctagtcaaaa 4800 gagcttatca ttggaagtcg gtgaaagcat ttgttaacaa ctacgtcgac tgttgtcaat 4860 cgtgccaaag aatcaaatca agaactatga aaccgtttgg tagcctgaaa cctctcccta 4920 ttccaagtgg accctgggtt gatatctgct acgatctaat taccgactta cctgaatcag 4980 atggtaatga tagcatactt acagtagtgg accggctaac caaaatggca catttcgtac 5040 catgcaaaaa gactctttca tcagacgaat tggcagatat catgatcaag catatatgga 5100 agttacacgg aacacctcaa accatcacgt cggacagaag taatgtcttt atctccaaac 5160 tcacaaaaga tatacataaa agattaggga taaccactca gtcttcaaca gcctaccacc 5220 cgcaaactga tggtcagtca gaaatcacta acaaggcggt tgaacagtac attagacact 5280 tcacaagtta taaacaagac gactggagct ctttattacc catggcagaa ttttcataca 5340 acaacaagga ccatgttgcc ataggcatgt caccgttcaa ggcgaattac agcttcaatg 5400 tcagtttcac agatgttcct acgggagatc aatgcttacc atcagccaag ggaagattag 5460 ctcaaataaa agacactcaa agggaactta aagacgcgtt gcaattcact caagaaacaa 5520 tgaaagccca acacgacaag aaagtgcaag caacaccgaa ttggaaggaa ggggtgatgg 5580 tgtggctaaa caacaaaaac atatccacca caagaccaac tgctaagttt tcgcatcgct 5640 ggcttggtcc gttcccaata ctgaagcaca tatctgctaa cgcgtacaaa ctagatctac 5700 caaaatcaat ggagaaaata cacccagtgt ttcatgtagg actactgcaa cggtttgaaa 5760 gaagcaaaat agcaaatcaa gatcaagtgc cacctgaacc tattatcatt caagaaagtg 5820 aagagttaga aatagctgaa atcctagata aaagacgccg aagaggaaag gtagagtact 5880 taataagttg gaaaggatat gatgctaatc atgattcatg ggaacctgaa aatttaatga 5940 gtaacgcgca agaacttatt aatgaattca atattaaata tcctcaagca gaaacaagat 6000 accgtagggt acggagatag tcatgagggt taaagctttt ttcccacagg gtttttaatg 6060 ctaacctgtg gaaagatgct aacccgtcaa gagggggttg agtcataaaa gggggagtgg 6120 // ID Copia-46_MLP-LTR repbase; DNA; FNG; 864 BP. XX AC AECX01001112; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-46_MLP_; KW Copia-46_MLP-I; Copia-46_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-864 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001112; Positions 457359 456496. XX SQ Sequence 864 BP; 257 A; 169 C; 136 G; 302 T; 0 other; tgttgaaata tccaatcgac gtagtataac ataataggag ataagccaag tcaaatgtca 60 gttattaggt aatggtttat atcaatttta accatgagat cacgaacgta cacaatcaag 120 aaagagtccc aagctaagct agattagagt agtagaagtc gtgagatgat tagagttcac 180 gtggtttaat tcacggacgg ctttattttg ttgtttcgtt ttatttcatt ttctttcttt 240 ttacttaagg actgtgtcga gtagcatgta gtgtttctct ttcccccatc tttgggaaga 300 gaaacgtacg tggccatttc atctttgtta tcaataaaag ctacttacca tgattgatct 360 aacatgttat cattattagc tttttgtcca tagatcaaat aattacctca tcattacttt 420 gccacttata ctaaacctga accattatag gtactaaact aactcatatt gttttattct 480 caattttact tacgcttcat attcttcttc attatatgtt tcttatgttg taggtcagtc 540 acgtccttaa gaacaggatc gtggagcttg ttatctaact tctatcacac acacattgct 600 aggtaaatat atataaatct ttgggaagag aaaccttttc gtccatagat caaatactta 660 cctcatcatt actttgccac ttatactaaa cctgaaccat tataggtcag tcacgtcctt 720 aagaacagga tcgtggagct tgttatctaa cttctatcac acacacattg ctaggtcagt 780 cacgtcctta agaacaggat cgtggagctt gttatctaac ttctatcaca cacacattgc 840 tagatccgta ttccaagaca ttca 864 // ID Gypsy-69_MLP-I repbase; DNA; FNG; 10027 BP. XX AC AECX01001171; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-69_MLP_; KW Gypsy-69_MLP-LTR; Gypsy-69_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-10027 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001171; Positions 47790 57816. XX CC Positions [8895-9239] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(4377..5603,5607..8816) FT /product="Gypsy-69_MLP-I_2p" FT /translation="MNPWRDGRESPAISSVFGRDNPPHQGSSSELSDPHTI FT RIDSPPNHRNGPRLPQSQTPVSRLAQGMSNLDFATNRRNLHQSGQHVPHAP FT QPMPGSQPHSWQRFAHRFQQDFGPQQMPQFAPPPQMFPEGQPQFAPPPPPS FT FAPPPRYSIPPHFVPPPHFGPAPVPQFAPAPHFVPQPVHNQFPFQPQHPQP FT AQQQQSSSTARPPSRAPTAEVMLDTRAPKIRDNLRFFGDNRGLKQFLVEIH FT DELDQITWKDDKSKINWIARHFTSLNASQSSTQIWFMGLLERNAYSQGYLN FT PYGNLKSLEYKLPELLTLDNFLDELIYKFGDKHADKTCREELEACKQGKLS FT IIDYNSKFEMLALHVKKTEEDKILLYVEGLHPSIQLEATRISGWVNETDLL FT RKQAMAVEAADILDLSKVTQVHPHLRIAGEVYKHPNHHPSTRPHHHQSQSS FT NRNNSGPVPMDIDVNDVSLQHDGSNPFPAIRKICNTKSLCYDCLKPYDDEH FT KRLRNLHGRRSCPNPPVKLEEKLKMLRTTVDSASQRQTQVSAMELDDMEYA FT AYTNLPTATIEETSRFVESFWESLSPPVYPTSTSETHHFSAQDIQVDAVRV FT QADDENPRRFLIPLHLVSDNLPVVVMALVDTGAMDSFIDFGFANLHHLNLN FT KKKIPQKVSGFDGGSSTSVTKEWCGVMSVTDVDGKRSTFKAKLGVTKLGGG FT HDVILGLPWMIENEVTLLMSKKGRWLEIGKSIVSAVIVKDEDVYLSLVSCK FT QDSSVISFSPSTFPDTTATTLHTVPSTSPLPSSDKSFLFNLPDSCKKYLHV FT FSQQDSVLPPHRSFNIAIDLKPGCEPPFGGLYNLAPNEQIELRAYLNEQLS FT KGFIRPSKSPAAAPIFFVKVPGKKNRPCVDYRGLNKITKRDSYPIPVMSWL FT LNQLRGCKFFAKIDLKAAFNLLRVAEGDEWKTAFQTPWGLFEYTVMPFGLA FT NAPAVFQRFIQWVLREYLDVFCFVYLDDILIFLKNEGDHAQHIEKVLSKLS FT EHKLTASPEKCQFFAKEVVFLGFVISTEGISMDPSKLRTITEWPFPRDLSD FT LQRFLGFSNFYRRFIANFSRIVGPLTSLTTKSSDATAGLRKESSRIAFEEL FT CRLFSNAPFLLHFDFNLPRVLQVDASGYAYSGILSQKSDEGQLRPVAYFSK FT KLTESERRWQIHDQELGAIVACFHEWRAWLMGTDTPVAVLSDHANLRYFMS FT AQMLTPKQARWASFLGEFNFEILHTPGKTNPADPASRRSDFACGKKDSAKV FT VLLGSRDIKHANISAIHINNPGSFDVSTYMPISDLSKKKITDAYLADGLIA FT GVHPTFLHFLDGLWWWRDRVYVPSVLRQGFLKEIHQSSTGGHWGSLKTMDL FT LSRSFGWPNLRKDVLEFIQGCASCQQVKVDHRPPQGQLIPLPIPDQPWSTI FT GVEFIVKLPLSSGFDLIMVVVDHLSKAAHFIPAKETWSAEELARSFIAQVF FT RFHGLPDTIRGSQT" XX SQ Sequence 10027 BP; 2657 A; 2307 C; 2077 G; 2986 T; 0 other; ttagaccgct ccgacccgag agtctcgttt tcacaattcc agatacaacc caggccctta 60 catttttatt ccgatcttct ttctagttga aattttctcc gggatcactt agatccacct 120 tttaatactt tgttccttta aattttctca aatcgttctc taaataaatt tcttcaggat 180 caattagatt cacttttaat atatttttta ctcttcaaac tttcgcagat cattctttta 240 tttttccttt tttgaaaaaa ttttatgagc actttgtttt cgcacatttc aatttcacaa 300 gctttcactg ggaaaatgta gtatttttcc caagtgaaaa atgattgacc tgaaggacaa 360 aaaaaattga aaaaggtgga attctgcatt cattctgcat gtgcattaac actgcatttt 420 tctgcattga tgctgcattt tttttcatta atggtgcatt aatggtgcat ttttatgcat 480 taacatagca tttttaatta tttttgacat taattctaat cattttgcat agccgaaaaa 540 tactacatct gtgcagtgtt ttatatatca attcagcccg agccccgaag actatccgtc 600 gggtagtggg cagcccggcc cccgaaccga cgggtagagg aactctttca gcccgtttcg 660 aagcccccga tcgagttttt tttgtaaaaa aaggctgccc ggggcccgaa cgagtaatcc 720 gccggatacc cggcgcctga gccaaaaagt gcgatttgcc cagggtttga acaccggacc 780 ggggaaatta gcctatatcc ggcggtctat gagatattta acatcaattg ctgcctgcac 840 ctgaccagat actgctaata tgacacgatg aagctcgact gattgttcta ttcgtcgact 900 tcatgacttt ttgattagcc tgcacactcc aatactaacc tcaaagaaaa catatacgaa 960 caaactaggt tactacggtt ataaactagg tttacaaagc tccctcctat ttccacctgg 1020 tcaccctgtg ccctttgatc tttcctcaga cagctcgtca tcaaagattc cttttgaacc 1080 cttatttttc acttcttgac ccttatattt cccttttcta gctttctatc agctttcaaa 1140 aactccgtaa atactaaatc atgtctcaga ttaccaacac gagtagtgtt tcatcgcaat 1200 ctgctatcac tactgtggat ttaaccactg aaccagatac tgctaatacc tccaccaccg 1260 agattatagc atctgctgtt cgtccagcaa gctacgtctg gcttcacttc gtcaagctcc 1320 gaaaacgtaa cgttaacaga tgtggtgtta tcaagcctga cggatcaaaa tgtggtgccg 1380 aattgaagcg tgatgagacg ggtagtacta agtcgatgaa gaatcatctc gagtccaagc 1440 acggaatccg tgatgctaat cttcctaatc aaagcaacat tcttgctgct ttcaagaagg 1500 tgaagactga tcacatggta agtcttattg tttatctttg tttaactttc cttaacacta 1560 atgatgtggt attacagcgc aaacttgatg caactagttt gatgactgct gtctcgtatt 1620 ttgtggccga ctgcgacctt gctttctcta ttgtggatcg aaactcttat cgagaactct 1680 tgacgctttg taactaccaa gttaacggaa tgctcaccaa acgccattcg cttgctcaac 1740 atactcgcaa aatctattac tattacgagg aatacataaa gaagcagtac ttatcttcaa 1800 ctcccgctat agctataact caagatgctt ggacctctcc aaacaatgtc ccattcatgt 1860 cacttactgg gcattttatc acggatgagt ggaagttgac cgatatcact cttggtattg 1920 ctgagattaa gggtgagttc gatttaatca atgattgatt ttatttctat gtatttaaca 1980 ttcaatctat atacttttct cttttgcttt ttttaaattt atttattatc tatgtatatg 2040 taggtgctca tgatgcagag acttttgccc atatcatatt cgactacttg aagaaattcg 2100 atgtcactca aaagttgtct gcgatgaccg cggacaacgt tggtacgaac gccgcgattg 2160 gccgaaagct tggattgttc cctgaaatca ctttcgatca ctcgactcaa gtacatggat 2220 gtgtggcaca tgtaatcaac ctcgcggcca aggaaggcct caaggaattt ggtgaggtga 2280 ccgacttgga agaacatcca acgagctcaa tggatgtccg gaacctcgtc gatcctccag 2340 atgtgcttca attcaacctc aagactatct atgcccgatg ccacggcctt gtgaagttaa 2400 cccgatctag tccacaacga gctcaggcct ttgctaatgt tgtcacggca gtgcgacacc 2460 ttgccaacac acttgattcc gataccgatc atattgaacc tcctgctccc cgtgctaaca 2520 accctcaaga tgaagaagat gttaattaca tgatgcaaga caaaaacagt gaagaagtac 2580 aatctactca gaagagtcgt catggtattg cgaatcgact ggtccccgat gttcccactc 2640 gttggaattc gtcctattac atgttcttaa ggcttctccg tttacgttca gcttgtgatg 2700 aattctgcaa gggtcgagat ttcaaaaaat ttgctttgtt acctgctgag tgggactacg 2760 tacaacagat gtgcaatttc ttagaaccac tcagcgcggc aactgatctt ctttgtcgat 2820 caaagtatcc gacgatgcaa caagtggtgc ctatgtacgt ggcggtgatt caaggtctca 2880 aattggtcag tatctcatta tcttaataac tcgttagatc aagtctccta actttattca 2940 actttggtca atcaggtgtc tcagaagtac gatcacgatc aattggttcc tgcctcgaag 3000 aaaatgattg caaagctcga agattacttc aagaaggtca taacaaaacc tggcccgatc 3060 tgtgctacta tacttgaccc acgcttcaaa cttctttatt ttcaagtaag tctaagtcaa 3120 agataacaca ccaattctac tatgtactga tgatattttc ataactttca aaacaggagc 3180 gagagttgat cttgagtgat tatgacctat tcactcaaga catcagatca atgttcgaaa 3240 tcgaagcgca gaagtatgag tacgctgcac ctgaagttcc agtgacctca aacgtacctt 3300 tgacctcttc gaactcttca actgctcagt acttcgaaaa tgccatctac ggcgaacgag 3360 acaatcttcg tgatcaatcc attggtactg aaatcaatcg atacttgtgc gagctacccg 3420 aaccgcgtga aaccgatgtt ttgtcttttt ggaaatctcg ccaggagatc tatcctggcc 3480 tggctaagat ggccaagata tttctagcca tcccagctac tagcactccc tccgagggtg 3540 tttttagtaa aagtaagaac atcttgagcc cccagcgtgc ttctctttcc tctttgaatg 3600 tggaggtact cttatgtctt aaagattggt accgtttgtt tggaccttta tttgttgttg 3660 atgacgagta ggaaagtaaa ctttccagca ctttgttttt ctaaatcttt agtaagattt 3720 tatatgccta tatgcctatg ccttgttttc ttttttcctt ctgtcctctt ggaccctctc 3780 atctactttt atctacaata cacgtctata agtagatatt tttgttaaac cccctatatg 3840 caattatccg ccggataacc cggaaatccg atgttttacc cagaaccaca cccgaagacc 3900 gatggtcggg gatccccggc catcttgggt ctcagagcct cttcccgaag accggaccta 3960 tcggggtacc cgtcgggtac tcgatcgggg ggcttcttcg gggctcgggc tgaatatcaa 4020 tacattcacc ctgtcaagtt ttctactgaa gatttgttag attcctttag tgttttaccc 4080 ttcttttttt cagattcccc acttgaagat ccctgtcaaa ccttatcagt ctttcggatt 4140 cttccttttt ttcattatcc ggaaacaaac cccttcttag acacaccgac tacgctcccc 4200 ctcaccgctg agtccttatt tcttcgttct ttcacatatt accttgatct agaggaatac 4260 gaaaggttat atctggatcg acgaattcga tattgctccg atctcctcac tccaccaact 4320 ttgcttaact cctcgccacc aacgtctttg gaatcttcga attcagatca agaacaatga 4380 atccctggag agatggaaga gaaagtccgg caatatcttc agtgtttgga agggataacc 4440 cacctcatca aggcagcagc agtgagctct cggatcctca cactatcagg attgattccc 4500 ccccaaatca tcgcaatgga ccccgtctcc cacaatccca gacccccgtt tctcgactag 4560 ctcaaggcat gagtaatcta gattttgcaa ctaacagacg taatttgcat caatcaggtc 4620 aacacgttcc tcatgcgcct cagccaatgc cgggatctca acctcattct tggcaacgct 4680 ttgcacatcg ttttcaacaa gactttggac cccaacagat gcctcagttc gcaccaccac 4740 ctcaaatgtt tccggaaggt caaccacagt ttgcaccacc acctccacct tcgtttgctc 4800 ctcctccacg ttattcgatt cctccacatt tcgtgccacc tcctcatttt ggacctgctc 4860 ccgtaccaca atttgccccg gcccctcatt ttgtgccaca gccggtccat aatcaatttc 4920 cttttcaacc gcaacatcca caaccagcac aacagcaaca atcatcatcg acagccagac 4980 ccccttcccg agcaccgact gctgaagtta tgttggatac tagagcgcct aagattcgtg 5040 acaatcttag attctttggt gataacagag gtttgaaaca gttcctggtg gaaatacacg 5100 atgagctgga tcaaatcacg tggaaggacg acaaatcaaa aatcaactgg atagcacgac 5160 attttacctc tttgaatgcg tctcaatcta gtacgcaaat ctggttcatg ggtcttttgg 5220 aaaggaatgc gtatagtcaa ggttatttga atccgtatgg taatttgaaa agcctagagt 5280 ataagttacc ggaattattg acgcttgata atttcttgga tgagttgatt tacaagtttg 5340 gagataaaca cgcagataaa acttgtcgag aagaattgga ggcttgcaaa caaggaaaac 5400 tatcaatcat cgactacaac tcaaagtttg aaatgcttgc tctgcatgtc aagaagacgg 5460 aagaagacaa gattttgttg tatgtggaag gacttcaccc tagcatccag ctcgaagcaa 5520 cacggatttc gggttgggta aatgagactg atttgttgag aaagcaggcg atggcggttg 5580 aagcagccga cattctagac ctttgatcta aggtcactca ggtccatccc catttacgaa 5640 ttgcaggaga agtttataaa caccctaacc atcatccttc gacacgaccc caccatcatc 5700 aatcgcaatc ttcgaatcgt aacaactctg gtcccgttcc aatggacatc gacgtgaacg 5760 atgtttctct ccagcatgac ggttctaatc catttccagc tatccgaaag atctgtaaca 5820 cgaagagtct atgttacgac tgcctcaaac catacgatga cgagcacaaa cgactgagaa 5880 acttacatgg cagacgatct tgtccaaacc cacccgtgaa gttagaagag aagttgaaga 5940 tgttacggac gactgtggat tcagcctctc aacgacaaac gcaagtctca gcaatggaat 6000 tggacgacat ggaatacgct gcttacacaa acctaccaac agcgaccata gaagagactt 6060 ctcgttttgt tgaaagcttt tgggaaagct tgtcgccacc agtttatcca acttcaactt 6120 cggaaactca tcacttctct gctcaagata tccaagtcga tgcagttcgc gttcaggcgg 6180 atgatgaaaa tccacgtcgg tttttgattc cattacatct tgtctctgac aatctccccg 6240 ttgtggttat ggcacttgta gacacgggtg caatggatag tttcatagat ttcggttttg 6300 ctaatctaca tcatttgaat ctgaataaaa aaaaaattcc acagaaggtt tcaggttttg 6360 atggcggaag cagcacgagt gtcacgaaag aatggtgtgg agtgatgagt gtaacagatg 6420 tggatggtaa acggtctacc ttcaaggcga aattaggtgt tactaagcta ggcggtggtc 6480 atgatgtcat tttaggattg ccatggatga ttgaaaatga ggtgacgctg ttgatgagta 6540 agaagggtag atggttggag atcgggaaga gtatagtgtc agctgtaatt gtgaaggatg 6600 aggatgtgta tttatcatta gtttcttgta agcaagattc ttcggttata tctttttctc 6660 cctccacttt tccagatacc acggcgacga ctcttcatac ggttccttca acatcccccc 6720 tcccttcttc ggacaagtct tttcttttta atcttccaga tagttgcaag aaatacttac 6780 atgttttttc ccaacaggat tctgtattac ctccccatag atcgttcaat attgcaattg 6840 atctcaaacc gggttgtgaa cctccgtttg gaggtttgta caaccttgcg cctaatgagc 6900 agattgagtt gcgagcatac cttaacgagc agttaagtaa aggcttcata cgtccatcta 6960 agtcaccagc tgccgccccg atcttctttg tgaaagtccc tggtaagaag aacagaccat 7020 gtgtcgatta ccgaggcctt aataagatca ctaaacgtga cagttacccc attcccgtca 7080 tgtcttggtt gttgaatcaa ctgaggggat gtaagttctt cgcaaagatt gatctgaagg 7140 cagcatttaa tttgcttcgt gtggcagagg gtgacgagtg gaaaacggct ttccaaaccc 7200 cttggggttt gtttgagtac accgtgatgc cttttggact tgctaacgct cccgcagtct 7260 ttcagagatt tatccaatgg gtccttcgag aatacttgga tgttttttgt tttgtttatt 7320 tagatgatat actgattttt ttgaagaatg aaggcgatca cgctcaacat atcgaaaaag 7380 tgctttctaa attatccgaa cataaactga cagcttcccc agaaaaatgt caatttttcg 7440 cgaaggaagt agtattttta ggcttcgtga tatcaacaga aggtattagc atggaccctt 7500 ccaagctgag aacgatcacg gagtggccat ttcctcgtga cttatcggat ttacagcgct 7560 ttctaggttt ctcgaacttt tatcgacgtt ttatcgcaaa tttctctcgc attgtgggac 7620 cattgacatc attgacaaca aaatcatcag atgcgacggc aggactacgg aaggaatcgt 7680 caagaattgc atttgaagag ctatgtcgtt tgttctctaa tgcacctttt ctgttacatt 7740 ttgactttaa tttgccgcgg gtattacagg tcgatgcgtc aggatatgct tattcgggta 7800 ttctatcgca gaaatcggat gaaggacaat tacgtccagt ggcatacttt tcgaagaagt 7860 taactgaatc ggaacgtcgc tggcagatcc acgatcagga gctaggcgcg atagttgctt 7920 gttttcacga atggagggct tggttgatgg gcactgacac acctgtggcc gttttatctg 7980 accatgctaa tttgcgctat tttatgtccg cacagatgtt gactccgaag caggcacgtt 8040 gggcttcctt tttaggagag ttcaattttg agatattgca cactcctggg aagacaaacc 8100 cagccgatcc agcatctaga cgatcggact ttgcgtgtgg aaagaaagat tctgcgaaag 8160 tggtgctgct gggatcgagg gacatcaaac atgccaacat tagtgccatt catataaaca 8220 atccggggag tttcgatgtc tctacctata tgccaatatc tgatttgtca aaaaagaaaa 8280 ttactgacgc gtatctggcc gatggtttga tagccggcgt gcacccgacc tttttgcatt 8340 ttttagatgg gttatggtgg tggcgtgata gagtatatgt tccatcagtt ttgaggcaag 8400 ggtttttgaa agaaatccat cagtcttcaa ccggtggaca ttggggaagc ttgaagacga 8460 tggatttgct ctctcgttcg tttggttggc cgaatttacg gaaggacgtc ttggagttca 8520 tccaaggctg tgccagttgt caacaggtta aagttgatca ccgccctcca caaggacagc 8580 tcattcctct acccattcct gatcaaccat ggagtacaat aggtgttgaa tttattgtta 8640 aattaccttt gtccagcggt ttcgacttga ttatggttgt tgtcgatcat ctgtcaaagg 8700 cggcccactt cataccggcg aaagaaactt ggtctgcaga agaattggca agatccttca 8760 ttgctcaagt ttttcgtttt catggcttac cagacaccat taggggctct caaacctgac 8820 ccggttttca aaaacccagt tgacccagtt gactttttgg cagtaacttt gtatataaat 8880 ttgaaattta ttgacagctc ccagccaaaa agtcaactgg gtcaactggg ttttttgaaa 8940 ccgggtcagg tttgagagcc cctattgtat ctgatcgcgg cacgaccttg gtctcgggat 9000 tttggactag tgttttgcgt ctattacgtg tttcgtcttc gccttcgaca gcttttcacc 9060 cacagactga cggccaggta gaacgtctga atgctttgtt gcaggattac ttacgtcatt 9120 atgtttccac agagcaggag gattgggcat cttggttacc aatggctgag ttttcgtaca 9180 ataactcaat gtccagttcc acaaaaatgt tgccattctt tgcgttgcaa ggctatcacc 9240 cacggtttaa ttccctgaca ggctcttctg gtcgaccgaa agctgatgcc ttggttgcac 9300 atttacagga tgttcaaaca cgtttgtcgg ataatctggt ccaagcaaag gagtcacaag 9360 cacggttcta caacaaaggc cgacaggttg atacgactta tgcgccagga gacttggtct 9420 ggttatcaag aaaacatttg aagacaaaac acccgagtaa caagctggat gtacaacgct 9480 taggcccctt ctctgtggta cggatggttg ggaggaatgc agctgagttg gatttgccac 9540 catctttgtc ccgactgcat cctgtgttta atgtgtcact tctaatgccg ttcgtaggag 9600 agactgctga cattacacct gcggaaaaga cgcggtggaa tacgttggtc cttcaacaac 9660 aggagacagt cagcactgtt ttggattatc ggctaacaag agacggcgtt catgaatacc 9720 tggtacggtt ctctggttct tcaaacttgg atgatcgttg gctgccactg tctcgactat 9780 cattgtcttt ggacacttgg ttggaacgtt ttcaccgcat tagcccttct gtaggacctg 9840 gacctgggga acttgtatgg atagcacggg caaagcgaca tgtggatcaa gaggctggtc 9900 ttttacctag tgaataagtt tttctctcag caaccaaccc ctttgtgcct aaacaagcca 9960 ttcaagacga gggtatgtga aggtgttgga agttttttct ctttgcggta catggaaggt 10020 cataatt 10027 // ID Gypsy-58_MLP-I repbase; DNA; FNG; 5613 BP. XX AC AECX01001648; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-58_MLP_; KW Gypsy-58_MLP-LTR; Gypsy-58_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5613 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001648; Positions 43205 48817. XX CC Positions [4358-4837] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 419..5020 FT /product="Gypsy-58_MLP-I_1p" FT /translation="MTTRRQAPPPNPKAPPVPLPDPEAIIRENRRLARSSR FT PDPAIARLAASQLLQTSSRARTDQPPPATPNRTHIKPSTLPQLPESREESP FT LSSVPSSPTPYQQHPKPFPTMSAPPDKPDNPDTDLMKAIIESQLASNRRME FT KFEDLMMKLMTPKESTPGTSTPEEPKPSAGLDLTKFRTSDGPVYKGPYRDA FT EPFLLWYLSLKTFFRTKGVTLDTDRITLVGCFLEEPNAHAFYEGGFNRFIK FT GTWKDFLVQLFDSCLPSDWNSRLFEKAQHLKMSPYEDFITYSTRARGIQVL FT INFHTVVINDHQLATFVEYGMIEELKTAVKLWGFTKSIKDFTYHTFENQCE FT TLYASLVASHAITKKIRANPTTSYNHNASYSRPRAAPTNTTGRLTDEEFTW FT KIHLYLDVLGKCHFCKGYCGSEYRGCKGPYVREKVIFPPGYVAPPKPANYI FT PPKARSSPPGTHNQAGRTNQQPARRPAPSYAKVAAAGEFPDLDEASVAALQ FT ALDEELDLAEAEGCVPPPKSPRVILEFTCNGKTLRALAYPGAEVNHLTDRA FT AEELHLKRRKLVKPTHLSLAVATESSPPPLTHFTTIDLTESTSGRQFSRTY FT CKLGDVGGGFDMILGTPFFNLFQFSVSINQRAITCERTGFKIYDYRVLEEL FT KEKHTKLLIPRITKPMRECREPWEREVRAQGDLSIMSVNEHSETKLLEEFK FT DLFPIDIPAVSDEAEDEGLFVDGSFPEKMQNESSKIRHKIILKDPEATIKE FT KQYNYPPKHMNAWKKLIDQHVAAGRIRRSTSQYASPSLIIPKKDPTELPRW FT VCDYRVLNSLTVRDRAPLPNVDRLVRLVATGKFFSIIDLTNAFFQTRMREA FT DIPLTAVYTPWGLYKWCVMPMGLTNAPSTHQGRLEEALGDLLNSICVVYLD FT DIVVFSNSPEEHEEHTRKVLQKLREANLYCSKKKTKLFRDEIKFLGHWISA FT KGIRVDGEKVEQVLNWKKPKSAKGIKKFLGTVQWMKKFIWGLEKYVNKLTP FT LTSTKLDASKFVWGPAEDEAFQNIKRLMATLPHLKNINFDSEDPLWLFTDA FT SRAGLGAALFQGKEWKLASPIAYESRQMTPAERNYPVHEQELLAVMHALQK FT WRMMLLGMKVNVMSDHHSLTYLLKQLNLSRRQARWIETLADFDVDFKYVRG FT EDNGVADALSRKGCDEDEVEIDGIECIAALVEAGPTLSPQLRKRICEGYER FT DSFYSAVLSALPLRDDCSVQEGLLFIENCLYIPCIDNLRMELVDETHSRLG FT HLGYLKTITDLRREFFWPRMAKDVNEFVRSCDVCQKTKSPTQAPLGKMLTP FT SIPDKPLECLAIDFVGPLPKVNNYDMILTVTCRLTGFTKLIPCGQTDTAEK FT TATRFFSNWTSLFGPPTTIISDRDKTWTSNFWKTLMRRSGISFHMSTAFHP FT QADGRSERTNRTVGQVLRSFTSKRQGKWLESLAAVEYAINGAVNVAIGKSP FT FKLIFGREQWLIATGVAEETDPAALTRWFKTRQDAWANARDALWTSRVQQA FT IQHNKHCRTKFVGVTGLR" XX SQ Sequence 5613 BP; 1612 A; 1346 C; 1265 G; 1390 T; 0 other; ggtaggactt ctttctctat atttctctac atttctcttt tatcatttag tatagccttc 60 cttagagata tcgggactaa tctcttgccc tttatcattc ggaaccagtg ctttcattga 120 tattgcaatt cacagattag atctcttggg ttctagaaaa ccctttgccg tgttctcacg 180 gtgtcccgtt cgtagtatcg agtgaaggtt ccttcgaacc ttacactttt tttttctcaa 240 acccaaatcc gagatttttt atatcattca aaatcaattc aaaaaatata aaccttaatt 300 gatactccag gaaaccttac tgaatgcgcc acctcgattc tattacgatc gacaccattc 360 actgagacct gaaaccttac attaccccac gttccgaccc ctcctctgtt catcgtgtat 420 gacgacacga cgacaggctc caccccccaa cccaaaagca ccgccagtac cattgccaga 480 cccggaagcc atcatccgcg agaaccgacg attggcacgc agctctagac cagacccagc 540 cattgccaga ctagccgcgt cgcaacttct tcagacatct tctcgagcca gaaccgatca 600 gccaccgcct gctacaccaa atcgtaccca tatcaagcct tctacgctac cacagctgcc 660 tgagtcccga gaagaatcac cactttcttc tgtaccttcg tcgcctactc cttatcaaca 720 acaccccaaa cccttcccaa cgatgagcgc accacctgac aagccagata atccagatac 780 ggatctgatg aaagccataa tcgaatccca gctcgccagt aatcgtcgaa tggaaaaatt 840 cgaagatcta atgatgaagc tcatgacacc caaagaatcg actcctggga cgtctacccc 900 agaagaaccc aaaccgtcag ctggattaga tttaacaaag ttccgaacat ctgacggacc 960 ggtttataaa ggaccgtacc gtgacgcgga gccttttctt ctatggtact tgtcgttgaa 1020 gacgttcttc cgaactaaag gtgtgacgtt ggatacagac cgaatcacac tagttggatg 1080 cttcttggag gaaccgaacg cacatgcctt ctacgaagga ggattcaatc gtttcatcaa 1140 gggaacctgg aaggactttc tcgttcaact ttttgactcc tgcttaccta gcgattggaa 1200 tagtcgcctg tttgagaaag ctcagcactt gaagatgtca ccgtacgaag acttcatcac 1260 ttacagtacc agagctcgag gcattcaagt cttgatcaac ttccacaccg tcgtgataaa 1320 tgatcatcaa ctggcaacct ttgtggagta cggtatgata gaagagctga agacggcggt 1380 caagctatgg ggatttacga aatcaatcaa agattttact tatcatacct tcgaaaatca 1440 atgtgaaaca ctttatgcgt cattggtagc ctcccacgcc atcaccaaga agatcagagc 1500 caatccgacc acgtcgtaca accacaacgc gtcgtatagt cgccctcgag ccgctccgac 1560 caacacgaca ggccggctaa ccgatgagga gtttacatgg aaaattcact tgtacttaga 1620 tgttctgggt aaatgccact tttgtaaagg ctattgtggc agcgaatacc gagggtgcaa 1680 aggcccctat gtcagagaga aggtcatctt ccctcccggg tatgtagctc ctcccaaacc 1740 ggcgaattac attccaccga aagctagatc gtcgcctcca ggaactcata atcaagccgg 1800 ccgtaccaat caacaaccag ctagacgacc agcgccatct tatgcgaaag tagcagcggc 1860 cggagagttc ccggaccttg acgaggcgtc agtagctgca ttacaagcgc tagacgaaga 1920 gctcgaccta gcggaagctg aagggtgcgt ccccccccca aagtcccctc gcgttatact 1980 tgaattcacc tgtaacggca agacgttacg agccctcgct tatccagggg ccgaggtcaa 2040 tcacctaaca gatcgggcgg cagaagaact tcaccttaaa cggcgaaagc tcgtgaagcc 2100 aacgcacctc agcttagcgg tcgctaccga atcttcccca cctcctttga ctcacttcac 2160 cacaattgat ctgaccgaat ccacgtctgg aagacaattc agccgaacat actgtaagct 2220 aggagatgtg ggaggaggat ttgatatgat tttagggact ccatttttca atttatttca 2280 attctctgtt tctatcaatc agcgtgctat tacttgtgag cgtacaggat tcaaaattta 2340 tgattatcga gtgttggagg aattgaaaga aaaacacacc aagttattga tcccacgcat 2400 aaccaagcct atgagagagt gtcgtgagcc ctgggagaga gaagtgcgag cacaagggga 2460 cctatcaata atgagtgtga atgagcatag cgagacgaag ttgcttgaag agttcaaaga 2520 tctttttccg attgacatac cggcggtatc agatgaagca gaagatgagg gactgtttgt 2580 agacggttca tttcctgaga agatgcagaa tgaaagctca aagattcgtc ataaaatcat 2640 cttgaaggat ccagaggcaa ccatcaaaga gaaacagtac aactacccac ctaaacatat 2700 gaatgcgtgg aagaaactca ttgatcaaca cgttgctgct ggaaggatac gccgctcaac 2760 tagtcaatac gcctcaccga gtttgattat cccaaagaag gaccctaccg aacttccaag 2820 atgggtatgt gattaccgtg tcttgaatag cctgaccgtg agggaccgcg cgccattgcc 2880 aaatgtggac aggttggtca ggctggtggc tacagggaag tttttttcca tcattgacct 2940 caccaacgcc ttttttcaaa ctcgaatgag agaagctgac atacctttga cggccgtata 3000 cactccgtgg ggattataca agtggtgtgt tatgccgatg ggacttacaa acgctccaag 3060 cacacaccag ggacgtctag aggaggcatt aggggatctt ttgaattcaa tctgtgtagt 3120 atatttagat gatattgtgg ttttttcaaa ctcacctgaa gaacacgaag aacacactcg 3180 caaagtactt caaaaattaa gagaagcaaa tctatactgt agcaagaaga agactaaatt 3240 gtttagagat gagataaagt ttttaggaca ttggatatca gcaaaaggga ttagagtaga 3300 tggggagaaa gtggaacagg tactgaattg gaagaaacct aaatcagcta aaggtatcaa 3360 gaagtttttg ggtaccgtac aatggatgaa gaaattcatc tggggcttag agaaatacgt 3420 gaacaaactc acgccgctca ccagtactaa gctcgatgca tcaaagtttg tctggggacc 3480 ggcagaagat gaggcctttc agaacatcaa acgcttaatg gcaactttac ctcacttgaa 3540 aaacatcaac ttcgactcag aggacccgct gtggcttttt acggacgcaa gtcgggcagg 3600 actaggagct gcattgttcc agggcaagga atggaaactg gcctcgccaa tagcatacga 3660 atcacgacaa atgacgccag cggaaaggaa ttacccagtc cacgagcagg aattgctagc 3720 ggtgatgcac gctttgcaga aatggcgcat gatgctgtta ggtatgaagg tcaatgttat 3780 gagcgaccat cattccttaa cctacctgct caaacaactc aaccttagtc gacgacaagc 3840 caggtggatt gaaacattag cggattttga tgtagatttc aagtatgtga gaggtgaaga 3900 taacggtgta gcggacgcgt tatcacgaaa aggatgtgat gaggatgaag tggaaataga 3960 tggaatcgag tgcattgcag cgctagtaga agctggtcca accctctcgc ctcaactgag 4020 aaagcgaatt tgtgaagggt atgagaggga ctctttctat agcgcggttc tctcagcttt 4080 acctcttcgt gacgattgct cggttcaaga aggccttctt ttcatcgaaa actgtttata 4140 cataccctgt attgataatt tacgcatgga actagtagac gaaactcact cgaggctagg 4200 ccacttagga tacctcaaga ctataacgga tttacgtcga gaattctttt ggccacggat 4260 ggcgaaggac gtaaatgaat tcgtgagatc ttgtgatgtc tgccagaaga caaagagtcc 4320 gacgcaagca ccgttaggaa aaatgctgac tccaagtatt cccgacaaac ctctcgagtg 4380 tcttgcaatc gattttgtag gaccgttgcc aaaagtaaat aattatgaca tgatactgac 4440 ggtgacttgc aggctcacgg gcttcactaa actcatccca tgtggccaaa ctgacacagc 4500 tgaaaagacc gctactcgat tcttctcaaa ctggaccagc ctgtttggac cccctaccac 4560 gatcatcagc gaccgcgata aaacctggac gtcaaacttt tggaaaacct tgatgcgacg 4620 cagtgggatc agttttcaca tgtcaaccgc atttcaccca caagcggacg gtcggagcga 4680 gcgtaccaat cgcactgtag gacaggtttt gaggtcattc acgtccaaaa gacaaggaaa 4740 gtggttggag tcactggcgg cggtagagta cgccatcaat ggggctgtta atgtggcaat 4800 cgggaaatcg ccgttcaagc tcatcttcgg acgcgaacag tggttaatcg caacaggtgt 4860 agccgaggaa acggacccgg cagctctgac acgttggttc aagacgagac aggacgcctg 4920 ggcaaatgct cgagacgccc tatggactag tcgcgttcag caggcgatac aacataacaa 4980 acactgtaga accaaattcg tgggcgttac tggactcagg tgactggaga ggtcgtcatc 5040 aaggaggagc agataaattg aaggagaaat ttgaaggccc ttaccgagtc atcaagtctt 5100 caaaccacgg tcaaaatgtc gaattagaac tcccgttggg agacaggcgt cacaatgtat 5160 ttcatgtttc aaaggtgaaa ccgtacgtcg agagagctat gggaggggtg aatacgtcgg 5220 attcaagagg tcaggaagta agttcctctc ctctgtatgc accgccgtag tatactacgt 5280 aaaaatccgt caaaacaaca cctcggccac gctgtgagca caaccttttc ggtattcttt 5340 ttctaggtca ccgaggcaaa tggacagcgg agaagcatat tcaattttat tcaccctatc 5400 aagctcaact acaagtcaaa ccttcagtta tttaattcct ttcttatttt ttcttttcta 5460 attttttctt ttttaatttc tgtttaattt tttttctctt gagtatgctt aatttttctc 5520 ctttctcttt ctttcttctt ttattttcaa gtctcaagcc ttgtggggag gctccgtaca 5580 aggcatgctc tattctttta taggagggga gac 5613 // ID TCA5_I repbase; DNA; FNG; 4220 BP. XX AC AACQ01000342; XX DT 05-AUG-2005 (Rel. 10.08, Created) DT 05-AUG-2005 (Rel. 10.08, Last updated, Version 1) XX DE Copia-like LTR retroelement from Candida albicans. XX KW Copia; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; TCA5_LTR; internal portion; TCA5_I. XX OS Candida albicans OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; mitosporic Saccharomycetales; OC Candida. XX RN [1] RP 1-4220 RA Jones T., Federspiel N.A., Chibana H., Dungan J., Kalman S., RA Magee B.B., Newport G., Thorstenson Y.R. et al.; RT "The diploid genome sequence of Candida albicans."; RL Proc Natl Acad Sci U S A 101(19), 7329-7334 (2004). XX RN [2] RP 1-4220 RA Jurka J.; RT "TCA5: annotation of LTRs."; RL Direct Submission to Repbase Update (05-AUG-2005). XX DR Genbank; AACQ01000342; Positions 950 5169. XX CC LTRs are identical and ORF appears to be intact. XX FH Key Location/Qualifiers FT CDS 98..4159 FT /product="TCA5_I_1p" FT /translation="MISPDFLDFINKDTMDLQQYPTVYQTFLDRLICATID FT PHIKQSLKYRKLSGKKMLSEIISQFGSMTIKDKVNYSIIMATKIHSDVTTH FT LDKMNLLAQFYAFLMRQPQDLKPALLLIAGINDSRFNETYFHDNKELTISK FT LERYIINQNSKITPSVPTPSPRDAVTGLLVTQPTSALGQSEVFNTQCFNCF FT GLGHTARRCASPKRLGQINNLRSKLLAFETRSKSRKRFPPQPPPTNRSANS FT TIITNPSPTDDTISSTTEDSFPRDVFGWAASSDQIKSKDNLSLFFDTGASA FT HLINNLNLLHDYKPSKENKHVITANGDKIPILGTGTVKLQHGQHKISLRNC FT QYSPHLHINLISPRLLLDDSTSMTITQSGIYHSKIGQIGYYSTEDGNLIKC FT MFRPITIPHLSLYSQYVEMGLQSNNVLRNIPAFTVHIPQLHDSLGHTSTQQ FT VSNVMKRFNVTTDNIGTDCETCRLGKAITQIPKISTHTISSHCLELLHVDV FT HGPISVPSIFQERYFLVILDDYSKYLTVQPLCNKSDATAEIIEFINHWEKF FT FLGNGNYHTKILRSDNGGEFLNKTLTTYLDSKYITHQTSNAYEHHENGAAE FT RAIRSVKDMARVILLQSKLPVPFWSLATRCAAFVMNRLPHKTINGKIPYEV FT WTKQLVNLKMMKPFGSQVYVKIPIGVKSFSAQALSGIMVGYATNKKGYLVY FT DPTQNRIFTSSQIICHPSIYPAANLTFNEPLIISSKVTAAHLHPLTISNLV FT IPPTNAVSETPLANCVLSSNSSVCPKVCQLQTVLEHGEDKIYALIIPISIG FT NMKRTRTNENKICQLDESNNTTIPDSVILSANNVLLNLESRSSIPKSYKEA FT ITSNEKSKWADAMDSEFNSLQSNNTWSLEPLPEGRKAIGVKWVYTIKDTGR FT YKARLVALGYRQQAGVDFLETYAPVIRGESIKLIFALASKSKLKIHSIDVT FT TAFLNGEILELIFVKQPPGYEDKKRPNHVCKLNRSLYGLKQLPLMWNIKLN FT DVLIKEGFRRLGGDLGIYISKDKRTIMGVYVDDILICGPSDSEIEQVKNNV FT RKYFSITDNGLCRKFLGINVYQQANEIRLSLNDYIRRMIEELKLSVSETNP FT VSIPSDVNYEIFKVNENDDEKPCDQTKYRSLIGKLLFASNTIRFDIAYSVN FT SLSRFINDPKEKHWIAAVKVVKYLSGTQRYGICYNGNGDLNIYADSDWAST FT PSDRKSITGYIVTYAGAPISWRSKKQNVIALSTTEAEFMALTESIKEALWL FT IYIFRDINVILKLPIVIYEDNLLCQKLLENPRFHNRTKHIDLKYKFTKDHI FT EAGTIKVESTNSADNLADMLTKPLPKIKFKHLRWLAGLRPLD" XX SQ Sequence 4220 BP; 1381 A; 897 C; 723 G; 1219 T; 0 other; ggttattgcc tctatctcca ctgtggacaa tcaaagtctc ttgaaggata aaatttctta 60 tgatcattgg ttcagtacct tgaaagaaaa tgcaatcatg attagtccag attttcttga 120 ctttattaac aaagacacca tggatctcca acagtaccca actgtctacc aaacattctt 180 agatcgtctt atttgtgcca caattgaccc acatatcaaa caatctttaa aatatcggaa 240 gttatcagga aagaaaatgc ttagtgaaat tatctctcaa tttggttcta tgactattaa 300 agacaaggtt aactactcca taattatggc taccaaaatt cattctgatg tcaccactca 360 tttagacaaa atgaatttac tggctcaatt ttacgcattt cttatgcgtc aacctcagga 420 ccttaaacct gcccttttac ttattgcggg tatcaatgac tcacgtttca atgaaacata 480 ctttcacgat aacaaagaat taacgatctc taagttggaa cggtatatca ttaatcaaaa 540 ctccaaaatt actccgtcgg taccaacacc ttctccacgt gacgctgtta cgggtttact 600 ggttacccag cctacgtccg ctctgggaca aagtgaagtg tttaatacac aatgttttaa 660 ttgctttggg ttgggccaca ctgcacgtcg ctgtgcctct ccgaaacgtc ttggccaaat 720 aaacaacctt agatctaaat tacttgcgtt tgaaactcga tccaaatcca gaaagcgttt 780 tccacctcaa cctcctccta cgaatcggtc ggcaaactca acaataataa ctaatccctc 840 acctactgac gataccatct cgtccaccac tgaagattct tttccacggg acgtctttgg 900 atgggcggca tcatctgacc aaatcaaatc aaaggacaac ctttctttat tttttgacac 960 aggtgcctcg gcacatctta tcaataatct caatctactt catgattaca aaccctctaa 1020 agaaaacaaa catgtgatca ctgcgaacgg tgataaaatt cctatcttag gaactggaac 1080 tgtgaaactc caacatggtc aacacaagat atcacttcgc aattgccaat attctccaca 1140 tctacacatc aatcttatct cacccagact cttacttgat gattccacta gcatgactat 1200 cacccaatcc gggatttatc actccaaaat tggacaaatt gggtattatt cgactgaaga 1260 tggtaatcta atcaagtgta tgttccgtcc cattaccatt cctcatcttt cgttatattc 1320 tcaatatgtc gaaatgggtc ttcaatctaa caatgtacta cgtaacattc cagctttcac 1380 ggtccatatt cctcaactac atgactccct tggacacaca tctactcaac aagtttcaaa 1440 tgtcatgaaa cgtttcaatg tcactactga caacattggt acggactgcg aaacttgtcg 1500 gcttggaaaa gccattactc agattcccaa gatctcaacc cataccatct ctagtcattg 1560 cttagaacta cttcacgttg atgttcatgg accaatatcc gttcctagta tatttcaaga 1620 acgttatttt cttgtgatcc ttgatgacta ctcaaaatac ttgacagttc aaccactatg 1680 caacaaatct gatgctactg ccgaaattat cgaattcatc aatcattggg aaaagttctt 1740 tctgggaaat ggcaattacc atacgaaaat tctccggtcg gataatggag gggaattctt 1800 aaacaaaaca ttgactacct atcttgattc aaaatatatt actcaccaaa cctccaatgc 1860 ctatgaacat catgagaatg gcgctgcaga acgagctatt agatcggtta aagacatggc 1920 tcgagtaata ttgcttcaat ccaaattacc agtgccgttt tggtccctag caacccgatg 1980 tgctgcgttt gttatgaatc gtcttcctca taaaacaata aatggtaaga ttccttatga 2040 agtatggact aaacaacttg tcaatctcaa aatgatgaaa ccgtttggct ctcaagtata 2100 tgtgaaaatt cctattggag tcaaaagttt ttctgcacaa gcactttctg gaatcatggt 2160 gggatatgcc actaataaga aaggctacct tgtatatgat cccacacaaa atcgaatatt 2220 cacatcctca caaataatat gtcatccgag catttatcca gcagccaacc ttacgtttaa 2280 cgaaccctta attatctcat cgaaagtcac ggctgctcat cttcaccccc ttaccatttc 2340 caatttagtt attccaccta ccaatgctgt atctgagaca cctcttgcaa attgtgtgct 2400 ctcctcaaat tcgtcagtat gtcccaaagt ttgccaatta caaactgtct tggaacatgg 2460 ggaggataaa atatatgcac tgattatacc aatatcgatc ggcaatatga aacgcacaag 2520 aacaaatgaa aacaaaatat gccagctaga tgaatcgaac aataccacca taccagatag 2580 tgtaatttta tcggctaaca atgtgttatt aaacttagaa tcgagatctt ccattcccaa 2640 aagttataag gaagctataa catctaatga aaaatccaaa tgggctgatg ctatggatag 2700 cgagtttaat tcattacaat ccaacaacac gtggtcactt gaaccactac cggagggacg 2760 caaagctatt ggtgtcaaat gggtttatac aatcaaggac accggtcgct acaaggctcg 2820 ccttgtggca cttggttatc gacaacaggc tggtgtggac tttctcgaaa cgtatgctcc 2880 cgtgattcgt ggagaatcaa tcaaactaat ctttgcactc gcgtcaaaat ccaaactaaa 2940 gattcattcc atagatgtta ccacagcttt cctcaacggg gaaatactgg aactcatatt 3000 tgtgaaacaa cctccgggat atgaagataa gaagcgtcct aatcatgttt gtaagctcaa 3060 tcgcagctta tatgggctta agcagctgcc actaatgtgg aacattaaat taaatgatgt 3120 acttataaag gaaggtttcc gtcgacttgg tggtgactta gggatataca ttagtaagga 3180 caaaagaaca ataatgggag tttatgttga cgacattctc atttgtggac cttctgacag 3240 tgaaattgaa caagtaaaga acaacgtgag aaaatacttc tcaataactg ataatggatt 3300 atgccgaaaa ttccttggaa ttaacgtcta tcaacaagca aatgaaataa gattaagttt 3360 gaatgattat ataaggagaa tgattgagga gttaaaatta tctgtctcag aaacaaaccc 3420 agtatctata ccatctgatg tcaattatga aatatttaaa gttaacgaaa atgatgatga 3480 gaaaccatgt gatcaaacca aataccgaag tttgataggc aagctcttgt ttgccagtaa 3540 tactataagg tttgacatcg cctattctgt caactcccta tccaggttta tcaacgatcc 3600 caaagaaaaa cattggattg cagctgtcaa ggtggtaaaa tatctcagtg gtactcaacg 3660 gtatggtatt tgttataacg gtaacggtga cttgaatatt tacgctgata gtgattgggc 3720 ttccactcca tctgatcgaa agtctattac ggggtacatt gttacctatg ctggagcgcc 3780 gataagttgg cgttccaaga agcagaacgt gatagccttg agtacgacag aagcggagtt 3840 tatggctctc acagagtcca taaaggaagc cctttggcta atatacattt ttcgagatat 3900 taatgtgata ttgaaattac caattgtgat atatgaagac aacctactgt gtcagaaatt 3960 acttgaaaat cctcgattcc ataataggac aaaacacatt gacttgaaat ataaatttac 4020 caaagaccat atagaagctg gtacaatcaa agtggaatca actaattcag cagataactt 4080 agccgacatg ctaactaaac ctttaccaaa aattaaattt aaacatttaa gatggctagc 4140 aggattaaga cctttagatt gattagataa tgataaaatg aaataaagat taatttggag 4200 atgcaggttg atggggagga 4220 // ID Gypsy-34_MLP-I repbase; DNA; FNG; 5709 BP. XX AC AECX01000176; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-34_MLP_; KW Gypsy-34_MLP-LTR; Gypsy-34_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5709 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000176; Positions 20451 14743. XX CC Positions [4509-4988] - Integrase core CC 'GTGGT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 370..1431 FT /product="Gypsy-34_MLP-I_1p" FT /translation="MSGMEISGTDLMTAILAWLEAMDLKLAEETQRREAAE FT QRSVAAEQRMAEMQQQAQQASQSQSANSAPAIPAIVPKDSATGFQAKPPKV FT ATPDKFDGTRGSKAEIFANQVGLYILMNPSQFPDDRTKIGWTLSYMTGKGA FT EWAKPVTHKLIHKPDKAMTWDSFSKTFEATFFDSERIAKAEKAIRALAQSG FT TVLSYSLRFNDLALVVKWPQSVLITQFEQGLKREIQVQMVRDVFTSLDQII FT ELAIKIDNKLHKRTEENQAETRDAPAHDPNAMDCSAFRFKISDDEYKRRWD FT KDLCFKCGNSNHIARECYSGKGKWRGKFRSHREAKISTSEVKLEGEERGSK FT SEESKNGEARG" FT CDS 1830..5609 FT /product="Gypsy-34_MLP-I_2p" FT /translation="MSENHDLIDWTTQTLKTEETDIATVKAVSSLPKTTRG FT DSDGVSLEQARNSDEGVCIHDTLTPPRCESFYLYSKPPIESAGKLGHFLNN FT RYQATVDEPEPDQRKETTDEIKEYGMVAADLSASSNLKTTHGDSKGGPVEQ FT AKKIDEGVCNPSDTQLTPPHRESFTAPSKPLNVTASKQAYSPFNRFNPVFR FT KQQAYRVQNWSKPWTAAPSPISTPHLDAARASWNMSAKLAAEASKDTPQKA FT AAELVPQEYHEYLKMFEKNESKVLPPHRPYDFRVDLVPGAEPQAGRVIPLS FT PAENKVLDEMIEEGLATGTNRRTTSPWVAPVLFTGKKDGKLRPCFDYRRLN FT ALTIKNKYPLPLTMELVDGLLHADKFTSLDMRNGYNNLRVKEGDEAKLVFI FT CRAGQFEPLTMPFGPTGAPGYFQYFIQDIFKDRIGRDMAAFLDDILIYTKP FT GEDHEKAVKEALETLRNQNVWLKPEKCKFGQKEIVYLGLQLSHNKISMDKS FT KVQAVTAWPTPRNINEVQQFVGFANFYRRFIQDFSRIARPLHELTKKDVKF FT EWNPARDSAFKLLKEAFTSAPVLKIADPYKQFVLECDCSDYALGAVLSQVS FT DDDGELHPVAFLSRSLIQAERNYEMFDKELLAVVASFKEWRHYLEGNPYRL FT NVIVYTDHKNLESRMTTKELTRRQARWAETLGCFDFKIRFRPGKHSTKPDA FT LSRRPDLKPDDNCKLTFGKLLKPENLPTDAFISELDDCSKWLSEDEECMEF FT YFDNEEEELAHIMVMEEEEIWDDSLILEEIRKKTKEDPRILELMQGIKNEK FT NEYSSHDGLLYYRGLVEVPNDHEIKMRILQSRHDSLLAGHPGQMKTLSLIQ FT RTYHWPSMKAFVNAYVNGCHSCQRVKPRTTHPFGSLQPLPIPAGPWTDLCY FT DLITDLPLSEGKDSILTVIDRLTKMCHFIPCNTTMSSEELARLMIKFVWKH FT HGTPKSITSDRGNIFISKLTKELNRQLGIKTQSSTAYHPQTDGQSEIANKA FT VEQYLRHFVSYKQDNWTGLLDMAEFAYNNSPHTSTGISPFKANYGYDVSYS FT RIPSSERCIPAVEEMLRQLSEVQEELQSSLRLAQDTMKYQYDKKVLETPSW FT DVGSQVWLNSRHISTTRPTAKFGHKWLGPFTIAQRVSTNAYKLILPESMSR FT IHPVFSVGLLRPYKPSSIKGQIQSPPAPIIVDNEEEFEVNEILNRRRRGSK FT VEYLVNWKGYGPEEDSWEPASGLDNAKELVDEFNIHYPTAEKDYRRTRRK" XX SQ Sequence 5709 BP; 1932 A; 1209 C; 1230 G; 1338 T; 0 other; tattgaagta tctacattta caaccctgga cgtcagagaa caagcaaaga agaagttaag 60 aaattgaaac caaagaaatc tgaagaaagg aagaaaaagt ctaaatcaaa gaagaacaac 120 tagaagtgaa gatcaatcta gggagtttaa aaatactctt atattcaaac tcaaacctta 180 ttataaacct gaacaacctt atcaccacct tatcaccacc gtattccccg cataaattct 240 ttcaccacgc ccgaattcca agtacctcaa agtcccactc agtcgctcag ctctgattca 300 atcgcataca cctctgcatc ttctaacgag gtcgttgaaa ccggactaga tcccgtaact 360 tcaaacgaga tgtcgggaat ggagatctcg ggaacggatc tgatgactgc aatcctagca 420 tggctcgaag ctatggatct gaaactagcc gaagagaccc aacgtcgtga agcagctgaa 480 caacgcagcg tcgctgctga acagcgcatg gctgaaatgc aacaacaggc ccagcaggcc 540 agtcaatccc aaagtgctaa ctcagcaccg gctatcccgg ctatcgtgcc aaaggactct 600 gctactgggt ttcaagcgaa acctcccaaa gtcgcgacac ccgacaaatt cgatgggact 660 cgaggcagca aagctgagat ctttgcgaat caagtgggac tatatatttt aatgaacccg 720 tcgcaatttc ccgatgaccg taccaaaatc ggatggacac tatcctatat gactgggaag 780 ggtgcagagt gggctaagcc agtcacccat aagcttatcc acaaacccga caaagctatg 840 acgtgggatt ccttttcaaa aacgtttgag gccactttct tcgactctga acgcatcgca 900 aaggccgaaa aggctatccg agcgttggct caatccggca ctgtattatc ctactcgctt 960 cgattcaatg accttgcact tgtggtgaaa tggcctcaat cagttcttat aactcagttc 1020 gaacagggcc ttaaacggga aattcaagtg caaatggtga gagatgtctt cactagtctc 1080 gaccagatca ttgaactcgc gatcaaaatt gacaacaagc ttcacaaacg gactgaagag 1140 aatcaagcgg agacaagaga tgcaccagcc catgatccaa acgcgatgga ctgctcggct 1200 tttagattta aaatctcaga tgatgagtac aaacgtagat gggataagga tttatgtttt 1260 aaatgtggta attcaaatca tattgcaaga gaatgttatt cgggaaaagg gaaatggagg 1320 ggtaaattca ggagtcatag agaagcaaaa ataagtacat cagaagttaa attagaaggt 1380 gaagagaggg gaagtaaatc tgaagagtca aaaaatggag aagctcgagg ttgaaggtcg 1440 tgccttcctt gagcttagaa aggagtggat caggtgtatc aatgggattg gttgaactgg 1500 aaattgacaa ctttgcaatt aaagacaacc gcatttttat caccgtgccc atttatgatc 1560 cctccaaaga cacgacccat tttgcccgtg ccatgcttga ttccggtgct actcataacg 1620 ttctgaatga aaagtttgta tactgaaata agctgacaac tcagacctta aatcagccaa 1680 aaccagtaac tggttttaat ggatcacaat cttatatctc aagaatagga gaatacatca 1740 ttgaagttaa tttaaaaaga aaaaattcaa atttattcct aatatcacag ttgaaagact 1800 cagtagattg tatattagga attgattgga tgagtgaaaa tcatgattta attgattgga 1860 caactcaaac gttgaagact gaagagactg atatcgcaac tgtgaaagca gtgtcgtcct 1920 taccgaaaac aacacgagga gactcagatg gagtgtcctt ggagcaagct aggaacagtg 1980 acgagggggt gtgtatccat gatacgctaa cacccccgcg atgtgagtct ttttatcttt 2040 actccaaacc gcctatagaa tcagccggca agcttggtca tttcctaaat aacagatatc 2100 aggcaacagt ggatgaaccg gaaccggacc aaagaaagga aacaaccgat gaaataaaag 2160 aatatgggat ggttgcagct gatttatcag cttcgtcaaa tctgaaaaca acacacgggg 2220 attccaaggg aggacctgtg gagcaagcta agaaaattga cgagggggtg tgtaatccga 2280 gtgatacaca attaacaccc ccgcaccgtg agtcctttac agcaccttca aaaccgttaa 2340 atgtaacagc tagcaagcag gcatactctc catttaacag gtttaaccca gtcttcagga 2400 aacaacaagc gtaccgagtc caaaactgga gcaaaccttg gacagcagcc ccatcgccaa 2460 tctcaacccc acacctagac gctgcgagag cctcgtggaa tatgtcagca aaactagcag 2520 ctgaagctag taaggataca cctcaaaaag cagcagccga actagtacca caagaatacc 2580 acgagtatct gaagatgttt gagaagaatg aatcaaaagt cttacctcca catcgaccat 2640 atgattttcg tgtcgatctg gtacctggtg cggaacctca agcaggaaga gttatacctt 2700 tgtcaccggc tgaaaacaaa gtcctagatg aaatgattga agaaggctta gctacaggaa 2760 ccaaccggcg aactacatca ccgtgggtgg cgcctgtttt attcactggg aagaaggacg 2820 gaaaattacg accttgcttt gattatagaa gactcaatgc tcttacgatt aagaataaat 2880 atcccctacc cctaacaatg gagctggttg atgggttact tcatgctgat aaattcacat 2940 cactagacat gaggaacggc tataataacc taagagtcaa ggaaggtgac gaagctaaac 3000 tagtatttat atgtagagct gggcaatttg aaccgttgac tatgccattt ggacctaccg 3060 gcgccccagg ttattttcaa tattttattc aggacatatt caaggataga attggaagag 3120 acatggcggc atttctagat gatatattga tttatactaa accaggtgaa gatcatgaaa 3180 aggcggttaa agaagcatta gaaacattac gaaatcaaaa tgtatggtta aagcctgaaa 3240 aatgcaagtt tggtcagaaa gaaatagtct atttaggttt acagctttca cataacaaga 3300 tctcaatgga taagtcaaaa gtgcaagcag taacagcatg gccaacccct aggaacatca 3360 atgaagtcca gcaatttgtg ggatttgcta atttctatag acgcttcatt caagactttt 3420 ccagaatagc acgtccactc cacgaattaa ctaagaagga cgtcaaattt gaatggaatc 3480 ctgcaagaga tagtgcattc aagttgttga aagaagcttt cacgtctgcg cccgtactaa 3540 aaattgctga cccttataaa caatttgtcc tagagtgcga ttgttctgac tacgcactag 3600 gagcagtact atcccaagtg tccgacgacg acggtgaact tcatccagta gctttcctct 3660 caagatcact tatacaagcc gaacgaaatt acgaaatgtt tgataaagaa ttattagccg 3720 tagtagcctc ttttaaggaa tggaggcatt acttagaagg gaatccttac cgcttgaatg 3780 tgatagttta tacggaccat aaaaacttag aatcacgcat gacaactaag gaattaactc 3840 gtagacaggc aaggtgggct gagacattag ggtgttttga tttcaaaatt cggttccgac 3900 cgggtaaaca ttcaacaaaa ccagatgcgc tatcacgacg acccgacctg aaaccagatg 3960 acaactgcaa gttaactttc gggaagctac ttaagcctga aaatctacca acagatgctt 4020 tcatttcaga actagacgat tgtagtaagt ggctaagtga agatgaggaa tgcatggagt 4080 tttactttga taatgaggag gaggaattag cccatataat ggtaatggaa gaggaggaga 4140 tttgggacga ctcattgatc ctagaagaaa tcagaaagaa gacaaaagag gacccaagaa 4200 tcttagaact gatgcaagga attaaaaatg aaaaaaacga atactcatct catgacggac 4260 tattgtatta tagaggacta gtggaagtcc caaacgatca cgaaatcaaa atgagaattt 4320 tacaatcaag acacgatagt ttattagcag gccaccctgg gcaaatgaaa actttaagtc 4380 tcatccagcg aacctaccac tggccctcca tgaaagcttt tgttaacgca tacgtcaatg 4440 gatgccactc atgccaaaga gtaaaaccac gaactaccca cccatttggc agtctacaac 4500 ccctaccaat ccccgcagga ccatggactg acctttgcta cgacctgata acggacctcc 4560 cattatcaga gggtaaggac agcattctga cagtaattga tagactgaca aaaatgtgtc 4620 acttcattcc atgtaacacc accatgtcgt cagaagaact agctcgattg atgataaaat 4680 tcgtttggaa acatcacgga acgccaaaat ccataacttc tgatcgaggt aacattttca 4740 tttccaaatt aaccaaagaa ttaaaccgcc aattgggaat caaaacccag tcgtcgactg 4800 cctatcaccc acagaccgac gggcaatcag agattgcaaa caaagcagtg gaacaatatt 4860 tacgccactt tgttagctac aaacaggaca actggacagg attactcgac atggcagaat 4920 ttgcgtacaa caacagtcct catacatcaa caggaatttc tcccttcaaa gcaaactatg 4980 gatacgacgt ttcatactca cgaatcccgt ctagcgaacg ttgtattcct gcagtcgaag 5040 agatgctacg acagttatct gaagttcagg aagaattgca aagctcctta agactggcac 5100 aggatacaat gaaatatcag tacgataaaa aggttctgga aacaccatcg tgggatgtag 5160 gctcacaagt ctggctgaat agccggcata tatcaacaac acgcccaaca gcaaagtttg 5220 gtcataaatg gttgggacca tttaccattg ctcaacgtgt atcaactaac gcatacaagc 5280 tgatcttacc tgagtctatg agccggatac accctgtgtt ctctgtaggt cttctgcgac 5340 cctacaaacc aagttcaatc aagggacaaa ttcaatctcc gccagcacca atcattgtgg 5400 acaacgagga agaatttgaa gttaatgaaa ttcttaatag aaggagaaga ggttcaaaag 5460 tagaatattt agtcaattgg aaggggtatg gaccagagga agattcatgg gaaccggcca 5520 gcgggttaga caacgcaaaa gaattggtag atgaatttaa tatacattat ccaacagctg 5580 aaaaagatta tagaaggaca cggagaaagt agagagggta acggcttttt ccttacgagg 5640 tttcttaatg cctacccggg gaaagacgtc aactcagcaa gagggagcgg gacgtaaagg 5700 gggaggtag 5709 // ID Copia-1_MVPL-I repbase; DNA; FNG; 4265 BP. XX AC AEIJ01000200; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Microbotryum violaceum genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_MVPL_; KW Copia-1_MVPL-LTR; Copia-1_MVPL-I. XX OS Microbotryum violaceum OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Microbotryomycetes; Microbotryales; Microbotryaceae; OC Microbotryum. XX RN [1] RP 1-4265 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Microbotryum violaceum genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AEIJ01000200; Positions 46726 42462. XX CC Positions [1489-2025] - Integrase core CC LTRs are 92% similar to each other. XX FH Key Location/Qualifiers FT CDS join(10..1041,1045..3249) FT /product="Copia-1_MVPL-I_1p" FT /translation="MSLFAATDLPHSKVDKLHNSFVKLKDESNYASWHKKM FT TLLFMGHKVMGSVLGEQVKPLSVPRPPATDIPDAAELDASADRQKKIATWT FT DRDIWAQSHIMATLSPEIEGMVMYCETSHEMWDLLEKKYRKRGHNALYSGL FT VCLLSTKYRDGDSIDEHITKLLTYSAELMKIGKPVPDEYLNVFILFSLPPS FT WSVVNTWYKQAAAIDEKVTIRPSLVVEHIQSEAQRRATANMQLVPVKVSSS FT SNAGALSANALKMCDFCNRKGHNADERYTNPNGQGYRPPQGKQSRKRGKAR FT SVKSTQSSNQEAVAFSVMSTGQSAKSRFIIDSGAWKNHTGDHTQLTDFIED FT PEAETANGQWVVCPGHGTLSIETAGGRKLEAKKVYHMPGMPVGLLSVQQMT FT NSGLSVHFEPDKLCTILRNGEFIASTIVLDGPYELDIKQPKLALATHETVS FT APLMLWHRRFGHPSPQTIINMARSGAVQGLVLSDKLIRDCLHCVLAKSKRS FT PFTIKATIATEILKRVCIDLGFVPEPDHQGRTVYLAIVDQHSCGRWVFPLS FT SKLSEGVIEAFNMFRTSVENMTGKKIRFVRSDNGGEFTSKMFETYFKTNGI FT IHERTAPYTPENNGQVERLNGSIMTTVKAMLHDANLPKTFWSYAMQAAVYL FT SNRTTVACLNRVTPYKIIFGKKPRVSHIKPFGAVAFVHIDGTLRTKLDDKA FT VEGILVGFNNYDYVVWLKDQQKEVRTRHATFGRRKHHLLETSELVTDFEVP FT PISEISEKDSTVTPAEKEVPQQLVPVPDGYIQVQNGYQPGKYGELDMSNII FT PYKHRQALFAGPALDSSSSNLQFIRIDDVLPQIDGKWGKTYLCVSVPQGHF FT IPKTYEEAIPCPDAKFWIIAIREELGALENHNVFEVSFLPDGAIALGSRWV FT FTIKLDAAGRIIHFKARLVAQGFAQRPGIDFHETFAPVARMLTIRFLVALA FT IARSLKLVQFDFDTAFLNGKMTDDVYMRVPKGWTGVIKPGQCLKLIASMYG FT TKQAPRKWNRTLDQLMVEKKWTKCLSDVCLYFKQVGSKYIIVAFYVDDGLV FT ASTSQELIETEIGALQATFKLK" XX SQ Sequence 4265 BP; 1009 A; 1221 C; 1025 G; 1010 T; 0 other; tcttaggtta tgagcctatt cgctgctacc gacctgcctc acagcaaggt ggacaagctc 60 cacaactcct tcgtcaaact caaggacgag tccaactatg cgtcatggca caagaagatg 120 actttgctct ttatgggtca caaggtcatg ggcagcgtcc tcggtgaaca ggtaaagccg 180 ctgtcagtgc ctcgcccgcc tgcgaccgac attcccgacg ctgcggagct cgatgcatcg 240 gccgatcgcc agaagaaaat tgcaacctgg accgatcgcg acatttgggc tcagagtcac 300 atcatggcaa ctctcagtcc cgaaatcgaa ggaatggtca tgtactgtga aacatcgcac 360 gagatgtggg acttgctcga gaagaagtac aggaagcgcg gccacaatgc tctctacagc 420 ggcctcgttt gcctactctc gaccaagtac cgtgacggcg acagcataga cgagcacatc 480 accaagcttc tgacctactc ggcagagctg atgaagatcg gcaaacccgt tccggacgag 540 tacctcaatg tattcattct gttctcgctc ccgcctagtt ggtccgtcgt caacacgtgg 600 tacaaacaag ccgccgcaat tgatgagaaa gtaactattc gtccatcgct tgtggtcgag 660 cacattcaat cggaagccca acgtcgtgct acagccaaca tgcaactcgt tcccgttaaa 720 gtctcgagct catcgaacgc cggtgcgctc tccgccaacg ccctcaaaat gtgcgacttc 780 tgtaacagga agggacacaa tgcggacgag cgttacacca acccaaatgg tcaagggtat 840 cgacctcctc aaggcaaaca gtcaaggaag cgtggcaaag ccagatcggt caagtccacc 900 cagtcatcca accaagaggc tgtggcattc tccgtgatgt caactggtca atccgccaag 960 tctagattca ttatcgactc tggcgcgtgg aagaaccata ccggcgatca tactcaattg 1020 acggacttca tcgaggatcc gtaggaagcc gaaaccgcaa acggccaatg ggttgtctgt 1080 ccgggacacg gtacgctctc tatcgaaact gctggtggtc gaaaactcga ggcaaagaag 1140 gtgtatcaca tgcctggtat gcccgtggga ttactctccg ttcaacaaat gacaaactcg 1200 ggtctctcag tccacttcga gccggacaag ttgtgtacca tccttcgcaa tggcgagttc 1260 atcgctagca cgatcgtctt ggatggtccc tacgagcttg acatcaagca accgaaactc 1320 gccttggcaa ctcacgagac ggtcagcgcg ccgctgatgc tttggcatcg ccgcttcggt 1380 catcccagtc ctcagacgat tatcaacatg gcgcgcagcg gggctgtaca aggcctcgtt 1440 ctttcggaca agctcatccg tgattgcctg cactgcgttc tcgccaagtc gaagcgtagt 1500 cctttcacga tcaaggccac catcgccacc gagattctca aacgcgtgtg tatcgatctg 1560 ggtttcgttc ctgagcccga tcaccaaggt cgtaccgttt accttgctat tgtcgatcag 1620 cactcttgcg gtcgatgggt ctttccgctc tcctccaagt tatccgaggg agtcatcgag 1680 gccttcaaca tgtttcgaac tagcgttgag aacatgacgg ggaagaagat acgatttgta 1740 cgctctgaca atggcggcga gtttacttcc aagatgttcg agacctactt caagaccaac 1800 gggatcattc acgagcgcac tgcgccgtac acccccgaga acaacggtca ggtcgaacgt 1860 ctcaacggat cgatcatgac gacggtcaaa gccatgctcc acgacgcaaa tcttcccaag 1920 actttctggt cgtatgcaat gcaagctgcc gtttacctgt caaaccgcac aactgtggct 1980 tgtctcaaca gagtaacacc ctacaagatc atcttcggaa agaagcctcg tgtgagtcac 2040 atcaaaccct tcggcgctgt cgcattcgtc catattgatg gaactctacg gacaaagctt 2100 gacgacaagg ccgtcgaagg tatccttgtt ggtttcaaca actacgacta tgtcgtttgg 2160 ctcaaggatc aacaaaagga ggttcgaact cgccatgcta cctttggtcg gcgcaaacat 2220 catcttcttg agacttcgga gctcgtaact gacttcgagg tacctccaat ttccgaaatc 2280 tccgaaaaag actcgactgt taccccggcc gagaaggagg ttccacaaca actcgtgcct 2340 gtccccgatg gctacattca agtccagaac ggttaccagc ccgggaagta cggcgagctt 2400 gatatgtcca acatcattcc ttacaagcat cgtcaagcgt tattcgctgg tccagcactc 2460 gattcttcga gctcgaactt acaatttatt cgcatcgacg atgtcttgcc gcagattgac 2520 ggaaaatggg ggaagacgta tctctgcgtc tccgtccctc aaggccattt cattccaaag 2580 acgtacgagg aggcaattcc ctgtcctgac gccaagttct ggatcatcgc aattcgtgag 2640 gagctcggag cgcttgaaaa ccacaacgtc ttcgaggtct ctttccttcc agatggcgca 2700 attgctctcg gttcccgctg ggtcttcacg atcaagttgg acgctgctgg caggatcatc 2760 cactttaaag ctcgtctcgt tgctcagggt ttcgctcagc gtcccggcat cgatttccac 2820 gagaccttcg ctcctgtcgc tcgcatgttg accatacgat tcctcgtcgc gcttgctata 2880 gcacggtctc tcaagctcgt gcagttcgac tttgacaccg ccttcctcaa tggcaagatg 2940 actgatgacg tctacatgag ggttcccaag ggttggactg gtgtcatcaa acccggtcag 3000 tgcctcaagc tcattgcttc gatgtacgga accaagcaag ctcctcgcaa atggaaccgc 3060 acgctcgatc aactcatggt cgagaagaag tggaccaagt gcttgtccga tgtctgcctt 3120 tacttcaaac aggttggctc caagtacatc atcgtcgcgt tctatgttga cgacggcctg 3180 gtcgcttcga cttcacagga gctcattgag acggagatcg gagcactgca agccaccttc 3240 aagctcaaat gacaaggtgc tgtgtctcac tttctttctc ttgagttcaa acgtgatgcg 3300 ggctattgta ttgtgcatca gtctcgctac attacggaca tcctcgctcg attcggattt 3360 gcttcgtcca agcccaagac gactccgatg attgattgcg aagccaggga gctcgatttg 3420 ttgcccctga tcaaggatgt tcacctctac caatccatgg tcggtgctct tcagtacacc 3480 gctcagatgg tttgacccga aatcgccgcc gctgttcgca gtgctgcaca gcgtctcgct 3540 ggtcctaccg aaaatgactt gctcgcggtc aagaggatct tttgatacct cgttggcacc 3600 attgactttg gtctctgcta ccgacccaat gcatccacgg tcatcacagc ctactccgac 3660 gccagctggg ccaacaactt tgagactcgt cgctcagtcg gtgcctatgt gcgtctcctc 3720 ggtggtgctg ctatctcatg gcagtccaag caacaaaccc tcgtcactac atcgacaact 3780 gagtccgaga tcctcgccgt tttgtcggca accaaggagg tgatttggct tcatcaggtc 3840 gccaaggact tgagcgtcga gcagccctca gccacgacga tctacgagga taatcaggcg 3900 acaatcaaga tcacctacaa cccagctcac cacgctcgca ccaagcactt tgatgtggtt 3960 catcacttcg ttcgcgagcg cgtcaccctt ggtgacataa agctggtcta ctgcccaacc 4020 aattcaatga tggccaacgt tctcacaaag gggctcggtc cgattaagtt cgctgcgcac 4080 cgcaaagcga tgggaatggt tcaactttca aagctctgaa ccagggggag tgtcacagtt 4140 attggttaag agtctgaaga gtttctactg cgattaggtt tgatctcatt gttctaatcc 4200 ttatttgctg aagggaagat attatcacct cacaaatctt gccctggcct tacaatctat 4260 tagct 4265 // ID Gypsy-26_LBS-LTR repbase; DNA; FNG; 154 BP. XX AC ABFE01003004; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-26_LBS_; KW Gypsy-26_LBS-I; Gypsy-26_LBS-LTR. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-154 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01003004; Positions 2799 2646. XX SQ Sequence 154 BP; 31 A; 47 C; 19 G; 57 T; 0 other; tgtaaggata cctatttgct cttatttaca cacgcatcaa gctataagta gctcgcctct 60 agctttcact ctcttccttc cttttccccc gtacttcatc tcatatacac gttttggaac 120 tgttttctcc tcgcgtgatc agtaccttct caca 154 // ID Copia-20_MLP-LTR repbase; DNA; FNG; 453 BP. XX AC AECX01002710; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-20_MLP_; KW Copia-20_MLP-I; Copia-20_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-453 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002710; Positions 11901 11449. XX SQ Sequence 453 BP; 113 A; 86 C; 54 G; 200 T; 0 other; tgttggagtg cttatttgaa atgactgttt tcttctacat ctgtttctct ctttatttgt 60 tatgttcttt atttgttatg tttctttatt ctacacgtat atttgatgtg ttcacataac 120 tacatttcat attcaaatca tatgtagtat gttttctatt tcttaactct ctcatctaca 180 gatcacagtt gtccttatat acctgtagga tcacgcacat agactcaaac ttcatctgtt 240 ttgactctac ctcatttata agctacattg actaacttat aaactcgtgc atgacttaca 300 ggtaagatct ttgttttgtt ttctcttctc atttgttctg ttatcttttc tactatgctg 360 ttacttgtat aaaagaaatg aacttcatct gttttgactc tacctcattt ataagctaca 420 ttgactaact tataaactcg tgcatgactt aca 453 // ID TPA5_I repbase; DNA; FNG; 4239 BP. XX AC AJ439553; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 08-NOV-2002 (Rel. 7.1, Last updated, Version 1) XX DE Pichia angusta retrotransposon TPA5_I, internal region. XX KW LTR Retrotransposon; Transposable Element; RNaseH; TPA5_I; gag; KW integrase; internal region; pol; protease; reverse transcriptase; KW internal portion. XX OS Pichia angusta OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; Pichia. XX RN [1] RP 1-4239 RA Neuveglise C., Feldmann H., Bon E., Gaillardin C. RA and Casaregola S.; RT "Genomic evolution of the long terminal repeat retrotransposons RT in hemiascomycetous yeasts."; RL Genome Res 12(6), 930-943 (2002). XX DR Genbank; AJ439553; Positions 323 4561. XX SQ Sequence 4239 BP; 1149 A; 955 C; 758 G; 1377 T; 0 other; ggttatgagc cctgtcagcc acttccaatt agctaatcca ttttcagaaa catggaagct 60 tatgtcatcc aacaactatc tggagtggaa aagaagattt gaaaaatatc tacttttagc 120 aagtgaacat atttaccagt actataagac tggccaattg aatgttcaaa atgacgtcct 180 tattgatgag tccgttattc aagaatgtga tattgccttg aaagtgctcc tattgaaaac 240 tttgtctcaa aatgttattg acgtcctaga tcaaaatgtg atgtccggat ttgatcatat 300 aagagccatt gacaaaactt acgggaatat aagtttgcgt ggtttattac ttaggtacaa 360 gaaaatgaac aaagattccc cttctccttt agaacatatt agaaacttta ggatgctcgc 420 agaagatctg aaatttattg caccagattt agagactgtt gaagttctat tgtgcctctc 480 aacactgaac gatgattctc ttgaagataa gttgtttgtc aacaaactag aaagactatc 540 gttcatggat ttacaaaatg catatgctga aaatgagtta agaccttcat ttgcgtatac 600 aatcaatcaa tctaagcaaa tgcaagattc aaaatcttct aagaagaaaa agatcatatg 660 tactagatgt caaaaggaag gacacaagtc ttttgaatgt cgtgctcctg ctccagtctc 720 taagaagtca cttctaagtg atgaaactgg aaaagatgat gatacttatg gcgaaatctt 780 tgttgctgaa gcatttagaa ccatgggtga ggtatacaca gcgagaaact taccccttcc 840 tcatattgat gatgcttatt ggtttgactc gggagctact tgtcatattt ccaacaacag 900 aaaccatttc agcagttttg aaaaatatca tggttatctg caaggtttat cttcaggatg 960 tcaaattgaa ggaaaaggaa ctgtcctact atatcacaat ggtaaggata ttactttaac 1020 agatgtctat tatgttcctg aggccggaaa gaacttaatt gctttctgta aagtctttaa 1080 agcaggagga attcgcgttg atgaatcagg actatttgtc aacaataaac tgtttgctac 1140 ttatcaacca aatagactat tcaaatgcat attctctccc cagtactgtg cacaggctaa 1200 tctagttatg gactctcatt ctcatcaacg gtttggacat ccatctgaaa agacttctcg 1260 caaattgggt ttacctatct tgaaagaact ctgtgcaagc tgtcagtttg gacgaaacac 1320 tgcaacattt cccaaggttt cacgaacagt cgtcaaagct cctctagaac tactacatgt 1380 tgatgtgtgt ggcccattta acacaaaagg tacacaaaat gaacgatttt tcttaaccat 1440 tgtcgatcgt ttcactcata tggtggccgt ttacccatta caacataaat ctcaggtttc 1500 tcagctattg aaagagtata ttacttatgc tgaaaatcac tttcatcgtt tcccttataa 1560 ggtcatgcga gttcgaagtg ataatggcac tgaattctgc aataatcagt tactgacttt 1620 cttcaaacaa aaagggattc aacatgaact tactaatact tactccagct atcaaaatgg 1680 cgttgctgag cgtatgcatc ggactctcat tagtcgggtt cgaattctct tggctagctc 1740 tggttgtcct gacatgtttt ggcctcaggc actgaaattc gttgctctca ttatcaacca 1800 ggaaccttca tcctctattc atggggacat tcctcatatg cgctggtttg attctcaacc 1860 agattataca atgtatcatc cttttggatg tcaggcttat cctttaactc ccagtgttca 1920 tcggtcttcc aaactttctc cggtgtccac ttctagcatc tttatgggag tgagcgcccg 1980 ccgcaaagct tatatctttt atgatcctat tgctgacagt tttactgaat ctcaacatgc 2040 aacattttcc gactctcact ttccgttttt acaccaaact cgaaaatctg tgggtgttcc 2100 catttttgac ctatccgagt cctataggca gcaatcctct ggaattctgc cgtcattacc 2160 gaacccctct actgtgccaa tattgcctag aggtatatcc gatgtctcgt ctcgtccttt 2220 gtctcattct acttctgatc cggactatac tccctctgac tctatggact ataagtcttc 2280 tgattctatg gagctcgttc cacttgatca tattgaatct aatttggaat ctatttccga 2340 tcctactgtg gatatttctc ctgattattc ccctgcacca tctgtgccta tatcttctaa 2400 gttgattgac tatcccgttg aacgatctct tattccctac tcacaatcag cacatgatca 2460 ttctactgtt gttacttttg ctcctcgtcc tgagtgtcct tctgaaccct cccttcttcc 2520 tccaccaacc tccgcttcta tgaaccccga attgttaact cccgttattc ctgctgtgca 2580 tcctcgggat ccttctgaat ctgatgttgg acctgatcct ccccgctatc ctatgcggcg 2640 acttaatgcg gatgcctctc atgctcttct tgctgacgct gcctcagctg tctcttcttc 2700 caacgttcct acctcggtca ccgaggcctt aaactcatct gtttggtcta aggccatgca 2760 acaagaaatg gaggctcacc atctaaataa aacttggtcc ttggtccctc ttcctaaggg 2820 tcgacgtgct ttaggttgtc gttgggtttt cactgagaaa ctcccaagtc atcttcctaa 2880 agctcgtctt gtggttcaag gttttcgcca gatagagggt attgattata ctgaaacctt 2940 tagtccggtt gttcgttatg aaagtgtgcg cactcttctt gctcttgctg ctcaaaaaca 3000 tatggttatt catcagatgg atgtcactac ggcctttctt aatggtgatc tagctgagga 3060 gatttatatg actcagccca ttggctttgt tcatccgggt caggattctc ttgtttgtca 3120 tcttcagaag agtctatatg gtctgaagca ggcccccttg tgctggaatc tcaagattga 3180 ttctctacta gcctcttctg gtttccgcaa aattccatgt gagtttggtc tttatatctc 3240 gctcactcat cctttgatct atgttgctct ttatgtggat gatctcctca ttctctctac 3300 gtctcaggca cgtattactc ataccaaatc tcttctttct catcatttta ttatgaaaga 3360 tcttggcact gcaagccact ttcttggact agatatacat cagtctcttc cttctattgc 3420 tctctctgcg tccacttata ttcatcagat gctcgatgat tacaatcttt ctgcttgtaa 3480 tcctgtttcc actccttgtg atactcaatc ttgggtgctt tccgatgatc ctcctttatc 3540 caaccctact gaattccgca gcatggtagg gaagttgcta tttgctgcta acacagtaag 3600 gattgatatt gcatttgttg tctcaaaatt aagtcgtttc ttacaaaatc cacgcaaatc 3660 acatttagct aaagccagac atgtgatgcg atatttaaag ggcactgccg atttgggaat 3720 tgtttactcc aaagactccc atttcaaatt atccggattc tgcgactctg actggggaac 3780 tgatccgaat gatcggatct cagttacagg gtatgtcttt atcttagcag gtgctcccat 3840 tacttggaga tctaagaaac aaacaacagt ggcattatcg tcaacagaag cggaatatat 3900 ggctttaggc gatgctctca gagagctctt atggttaaaa caattacttg atcaacttga 3960 aatcaaatcc tctgaacctc cgacactcta ttgtgacaat accggagcat tatctttgat 4020 gaagcatcca gcatttcatc caagaacaaa acacattgat ataagacatc acttcatcag 4080 acaacatcta acctccaatg attttatctc aaaacatgta tcatcaggca tgaatctagc 4140 tgatccactt actaaagcat tagataagcc aaaatttgaa tctttgagac taaaatggaa 4200 atttgaaaaa ctttgacata gtatacgact atggtaggg 4239 // ID Copia-1_BDJ-LTR repbase; DNA; FNG; 676 BP. XX AC AATT01000134; XX DT 07-FEB-2011 (Rel. 16.02, Created) DT 07-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Batrachochytrium dendrobatidis DE genome: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_BDJ_; KW Copia-1_BDJ-I; Copia-1_BDJ-LTR. XX OS Batrachochytrium dendrobatidis OC Eukaryota; Fungi; Chytridiomycota; Chytridiomycetes; Chytridiales; OC Chytridiales incertae sedis; Batrachochytrium. XX RN [1] RP 1-676 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Batrachochytrium dendrobatidis RT genome."; RL Direct Submission to RU (07-FEB-2011). XX DR Genome; AATT01000134; Positions 105870 105195. XX SQ Sequence 676 BP; 215 A; 151 C; 70 G; 240 T; 0 other; tgttaaagaa atcagacctt tgttgaagta atcaaactct aaccgattac gaaccgttca 60 actaattaac ttacatcgga tataatatcc aagcagggta ttattcaact ctaacacaat 120 gtaacactag tgtgtcacat tgtatatccc ctaattcata acgtcaccat cattattgac 180 ccttatgttt ctataacgac cctacttact ttaacattgt attcatagtt accaatcata 240 tgtaactata tcccattata acgacacagc taacagctgt aatcaatcct attggccaag 300 ttatctccgg gtactatagc ctattatcaa atactacctg tttgagccac aagcctcaaa 360 caatctttta ctataagaac taagacactt tcttattaaa taaacatcct ctgaatatcc 420 tctgaatatc ctctgaatca atcatctaaa tatcatctga cataacttcg ttattatcta 480 ttaagttcaa ctcgttattc tgttgacttt aatttgcgtg actacttcct gattatcatt 540 caagatctct tctgaccgat cttttctaca gcggcagatt atatattatt accttatact 600 ttactaattg aaagactcag tctctttctc tatttttatc tctatacttc ttgcgcgata 660 atcaaaatct ttaaca 676 // ID Gypsy-79_MLP-I repbase; DNA; FNG; 5681 BP. XX AC AECX01001086; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-79_MLP_; KW Gypsy-79_MLP-LTR; Gypsy-79_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5681 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001086; Positions 19167 13487. XX CC Positions [2818-3237] - Reverse transcriptase CC Positions [4483-4920] - Integrase core CC 'GTGCC' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 2176..4920 FT /product="Gypsy-79_MLP-I_1p" FT /translation="MNSTNLAEVSRYPQTPLKNHDMRPEGEARNIEMGVVV FT ETNPEKPPPSKCDNAHSPFTKKNVSKQFSLLQRLRTGLRARPQAHNHCLLR FT EQPIGAMIDASKASWNVSARLAAEDAKEKPVRTAKEMVPDCYHEFLGMFEK FT SNSNVLPPHRPYDFRVDLVPEATPQAGRIIPLSPKENEVLNEMLEKGLANG FT TIRRTTSPWAAPVLFTGKKDGNLRPCFDYRRLNALTIKNKYPLPLTMELID FT SLLDADQFTSLDMRNGYNNLRVREGDKSKLAFICKAGQFKPLTMPFGPTGA FT PGFFQFFIQDILQSHIGKDVAAYQDDILIYTKPGVDHKAVVKEVLAILKAQ FT NVWLKPKKCRFSQKEIAYLGLIISRNQIHMDEGKVKAVRDWPAPKNLSEVL FT TFLGFANFYRRFIHHFSKIARPLHELSQDGVKFDWSKERNQAFETLKLAFT FT TAPVLTIADPYAPFILECDCSDYALGAVLSQVSKIDNELHPVAFLLRSLIK FT AERNYEIFDKELLAVILAFKEWRQYLEGNPNRLNVIIVYTDHKNLQSLMTT FT KELTRRQARWAEILGSFDFEIRFRPGKDSAKPDALSRRPDLVPASGERLTF FT GRLLKPENLPEDAFIDSLDAVEQWIEDESQIEMEINDLETSSGIWSDEDIL FT NEIRLKMKQDEKINEIARCCADMPSSKLLVDYSWGDGLLYFRGKVMVPNYN FT NLKLQILKSRHDSLLAGHPGRMRTLMMIKRAYHWPSMKAFVNKYVEGCSSY FT QRVKAQTSKPFGKLQPLPIPSGPWVDICYDLITDLPESEGFDSILTVVDRF FT TKMAHFIPCRKTMTSEELATLMIANVWKSHGTPRTITSDRGNIFISKLTKA FT MNSRLGIVTQALTAYHPQTDGQSEITNKAVEHYLRHFVSYKQDNWSELLPL FT AEFAYNNNLHV" XX SQ Sequence 5681 BP; 1940 A; 1140 C; 1285 G; 1316 T; 0 other; tattgcaaca tctatcttca agaatacaaa caacgggaca acactcaatt aatcaaagtt 60 aagaaaagaa gaaaaagaaa aggaaagaaa gaagttataa taagaaaaga aaatcaaagt 120 ttgaattaaa aagttttttt ttagaagaaa gaagaaagtt aaagtgcaag aaattgaaac 180 attcgctgca ggctatccaa tcgataccag atcttcaagc tgtaaaacaa acaaggttct 240 aacaattctc agatcatcca ctagttaatt cacaaactgc atcacttcat cagtcaaaac 300 gcccaaattc acattaccta ttgaaagttt gtcggaatct gaatctgaag gatcactatt 360 ctacgaaacc acagatcaga gaatggctac aactttggaa gacatcatgc gtcagttaaa 420 tgagctaaac gcaaggttga acgaggaaac ggcacttcgc caaacggcgg agcgtgaact 480 gcaggaaatg agaagaaatc aaacggctga acaatctgca ccccatgttg aaatgaatca 540 cccaccacag tttcagcccc acacgatccc tgttcaacaa gcaatccctc gtgcaccgaa 600 ggtcgctacc ccggacaaat ttgacggctc taaaggtttg aaggcagaaa tcttcatgaa 660 ccaaataggg ttatacatgc aattgaacgc tggtgtattt ccgaacgatc aggccaaggt 720 tgcctttgcc ttaagctaca cgacaggcaa agccaacata tggggtcaaa cgttaacgga 780 tcagctactt gatagtgaga aagctcagac tgtaacttgg aacagattca tagactgatt 840 taagagcacc ttcttcaata gtgaacgtgt aactaaggcg gagaaagaga tgcgtgaatt 900 gaagcaaacg aaatctgcat ccgactattg gatccgtttc tctgaactgg ccttaatcat 960 caaatggcaa gatagtatat tgagatcaca attcaagcaa ggcctcaaga ctgaaatctc 1020 tgtgctgatg gttcgagatg aatttgagaa tgtggaggac atggctaaac tggcaattcg 1080 actagacaac aagatcaaca aacgcaatgc agaccaacct accttgaatc acacaactca 1140 gagcacacag tcaactcagc caattgaccc agacgccatg gattgctcgg cctattggtt 1200 ggacatcaca caagacgaat atcaacgtcg aggaacaacc gggtcgtgct acaactgtgg 1260 aaagagtggt aactttattg ccgactgtcc gatgagaaag agaagaggaa ggggtggagg 1320 tttttcaggc agtacttatt cgagaggttt tagtagagga cgttcaggtg gaagttactc 1380 aggtggttat caaggaggaa gttcaaggat tcaggagtta gagagtaaaa ttcaagcgcg 1440 tatagatgaa ttagatgctc aattagatgg aagtgataag aaagtagaag gaagagcgga 1500 ggaggcaaaa aatggaggtg ctcgggattg aaggttgtgc ctaccccgag cgtgatcaaa 1560 ttagggttaa atgaaaattt tattgattct cttgaaatga atgatacacg aattatcgac 1620 ttcattttgc tctatgaccc taagactgac acaaccaaga aagcacgtgc gctggtagac 1680 agcggtgcca cgcatgaagc tataagtgag aattttgtga agacccacaa ttttgagctt 1740 agccccctac cgcagagaag aagcgtcacc ggcttcagtg gacacgaaat gacaataacg 1800 cactcaggag actgctgtgt gaacaacaag acgaccggga ccaccttcat tgtcacaaca 1860 ctaagagaca agtatgacat cattttaggc atgccatgga tcaaggaaaa tcacgaatta 1920 attgactgga aagaaggaaa actcaagact cacgagatgg acattgcaac tgctgagaca 1980 gttttgttgc tgccaaaaaa cacctcgatg gaccactgat tggagcctaa gaggcaagct 2040 aggaacagta acaagggggt gcgagttaat cttgactcga caacaccccc gcaatgtgag 2100 tgtgcaactt ctcaaattaa tgacaatcat gcatcagctg gcgagtatag tctccctttg 2160 aaatttgata gtgaaatgaa cagcacgaac ttggctgaag tctcaaggta tccgcaaaca 2220 cccctgaaaa accacgacat gaggcctgaa ggggaagcta ggaacattga gatgggggta 2280 gtggttgaaa caaacccgga aaaacccccg ccaagtaagt gtgataatgc ccattcacca 2340 tttaccaaga agaacgttag caagcaattt tctctcctac aaagattgag aacaggacta 2400 cgagcgcgac cccaagcaca taaccattgt ctgcttcgag aacaaccaat tggagcaatg 2460 attgatgctt ctaaagcatc atggaatgtg tcagcaaggt tagcagctga agatgctaaa 2520 gagaaacctg tacgaacggc caaggagatg gtaccagatt gctatcacga gtttctgggc 2580 atgttcgaaa agtccaactc aaatgttcta ccaccacacc ggccttacga cttccgtgtg 2640 gatttagttc ctgaagcaac cccacaggct ggacgaatta tccccttatc tcctaaagaa 2700 aatgaagtac tcaatgaaat gttagagaag gggttggcga acggaaccat aagaagaacc 2760 acttcacctt gggcggctcc agttctattt acaggaaaga aagatggcaa tttaaggcct 2820 tgttttgatt accgaagatt aaatgcatta acaatcaaaa ataagtatcc cctcccactt 2880 acgatggaat taatagacag tcttctagac gcagaccaat tcacgagttt ggacatgaga 2940 aatggttata ataatttacg tgtcagggaa ggggacaagt cgaaactggc cttcatttgc 3000 aaagctgggc aattcaagcc gttaacgatg ccgtttggtc cgactggagc acccgggttc 3060 tttcagttct tcatacaaga catcttacaa agccatattg ggaaggatgt cgcggcgtat 3120 caagacgata ttctgattta cactaaacct ggagtagatc acaaggcagt agtgaaagaa 3180 gtcttagcaa tactgaaggc tcagaatgtc tggttgaagc ccaagaagtg cagattctca 3240 caaaaagaaa tagcttactt aggattaatt atctcgagaa atcaaattca catggatgaa 3300 ggaaaagtca aagcagtccg tgactggccg gcaccaaaaa acttatctga agtacttaca 3360 tttttagggt ttgcaaactt ctataggcgc tttattcatc acttttcaaa aatagctcga 3420 cctcttcacg aattgtcaca agatggagta aaatttgatt ggagcaagga acgtaatcag 3480 gcattcgaaa ctctgaaact agccttcacg actgcaccag tgctgactat agcagaccca 3540 tatgcgccat tcatcctcga gtgtgattgt tctgattacg cgttaggagc ggttttgtca 3600 caagtttcca aaattgataa tgaactacat cctgtagcgt tcctgttgag atcgctaatc 3660 aaggcagaaa ggaattatga gatatttgac aaagaattac ttgcagtcat attggcattc 3720 aaagaatggc gtcagtatct ggaaggaaat ccgaacaggt tgaatgtaat tattgtgtac 3780 acagaccaca aaaacctgca atctttaatg accacaaaag agttgacccg tcggcaagca 3840 agatgggcgg aaatcctggg gagcttcgat ttcgagatcc gttttcgtcc agggaaggac 3900 tctgctaaac cggacgcgtt atcgaggcga cctgatcttg tacctgcaag tggtgaaaga 3960 ctcacgtttg gcaggttatt aaagccagag aatctgccag aagacgcatt cattgactcc 4020 ctagatgctg tagaacagtg gattgaagat gaaagtcaga tagagatgga aattaatgac 4080 ctggaaacta gcagtggaat atggagtgat gaggacatct tgaacgaaat cagactgaag 4140 atgaagcaag atgagaaaat caatgaaatt gcgagatgct gcgccgatat gccatcttcg 4200 aaactcctgg tagattattc ttggggggat ggactactct actttagggg taaggtaatg 4260 gttccgaatt acaacaactt aaagcttcaa atcctaaaat cccgccacga cagcctatta 4320 gcaggccatc caggtaggat gcggacgttg atgatgatca aacgagctta ccactggccg 4380 tcaatgaaag cgtttgtaaa caagtatgtt gagggttgct catcatacca aagagtgaaa 4440 gcacaaacct caaaaccgtt tgggaaacta caaccattac caataccaag tggaccatgg 4500 gtagatattt gctatgattt gatcacagat ttgcctgaat ccgaaggctt tgatagcata 4560 ctaactgtgg tagatagatt tacaaagatg gctcacttca taccatgcag aaaaacgatg 4620 acatcagagg aactagccac actcatgatt gcgaacgtgt ggaaatctca tggaacacca 4680 agaaccatca cttccgacag aggcaacata ttcatatcca aattaaccaa agcaatgaat 4740 tcaagattag gtattgtaac gcaagctttg acagcctacc atccgcaaac ggatgggcag 4800 tcggaaatca caaacaaggc tgtagaacat tacttacggc actttgtctc ttacaaacag 4860 gacaactgga gtgagttgct gccactggcc gaattcgcct acaacaacaa cctacacgta 4920 tgaataggaa tgtcaccatt taaggcgaac tatggattcg atgtcagttt cacaggaact 4980 ccttctgagg agcagtgttt accctctgta gacgaaagat taagtcagct caaagaagtt 5040 taagatgagt tatcacacgc gatggaagaa gctcaagagt tgatgaaaca agaattcaat 5100 aagaaagctg tacaaacccc ggattggaat caaggagacc tagtatggtt gaatagcaag 5160 cacatctcga cgacgatacc aactgctaag ttttcacaca gatggatagg tccttacaaa 5220 atattggaaa gaatatcagc taacacctat aaacttctgt tgccgaaaga gatggaagca 5280 gttcacccgg tattcaacgt cagcttatta cgtaagttta agaagagtca aattgaagga 5340 caagtacaac ctccaccagc accaatctta attaatggca aagaagaatt tgaagtacat 5400 gaaatactaa acaagaggaa attaagaggt agaactgaat acctaattag ttggaagggg 5460 tatggacctg aacacgactc atgggagcca gagaaagaat taggaaatgc taaagaagca 5520 gtacaagaat tcaacacaag atatccgcaa gcagaaaata attatttcag gacaaggaga 5580 agagtgagag ggtgaagttt ttcccactgg gtttttaatg ctaacccgtg gaaagatatc 5640 tagtctgtca agaggagact gagatataaa aggaggagtg g 5681 // ID ASCOT1 repbase; DNA; FNG; 409 BP. XX AC AF054897; XX DT 15-SEP-1998 (Rel. 3.08, Created) DT 01-JUL-2005 (Rel. 10.08, Last updated, Version 2) XX DE Ascobolus immersus Ds-like transposon Ascot-1 inserted within b2 DE gene. XX KW hAT; DNA transposon; Transposable Element; ASCOT1. XX NM ASCOT1. XX OS Ascobolus immersus OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Pezizomycetes; Pezizales; Ascobolaceae; Ascobolus. XX RN [1] RP 1-409 RA Colot V.; RT "ASCOT1."; RL Direct Submission to Genbank (20-MAR-1998)Dept. of Microbiology, RL Institut J. Monod, Universities Paris 6-7, 2, Place Jussieu, RL Paris 75251, France. XX RN [2] RP 1-409 RA Colot V., Haedens V. and Rossignol J.L.; RT "Extensive, nonrandom diversity of excision footprints generated RT by Ds-like transposon Ascot-1 suggests new parallels with V(D)J RT recombination."; RL Mol Cell Biol 18(7), 4337-4346 (1998). XX DR GenBank; AF054897; Positions 1 409. XX SQ Sequence 409 BP; 108 A; 102 C; 110 G; 89 T; 0 other; cagtgttctc aacagtcagt ccggcccgcc attttgggcg ggctgtagtc cgactgccaa 60 gtagcccaac ctagaaaatg gcctagacac cgtagtccag accagaggta cagactggac 120 tggtctatag acagactgta ggctgtagac catgccaagc ccagtcagaa ccaccttgac 180 actactaaaa atgtcatatt tccctctaaa gaagcctgaa aataggccaa aaatcagcgc 240 caagatgata tgaagcgggc tgtggacggt ctgccttaga atgatggtct acagtccgtc 300 cagaatgtgt aggctggtct gtagaccagt cgagtaccct agggtttggg ctggattgac 360 tgggcggtct gtagcccgtt tacagccgga cggactattg agaacactg 409 // ID Copia-53_MLP-I repbase; DNA; FNG; 5255 BP. XX AC AECX01000204; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-53_MLP_; KW Copia-53_MLP-LTR; Copia-53_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5255 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000204; Positions 17291 12037. XX CC Positions [1663-2163] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 118..3693 FT /product="Copia-53_MLP-I_1p" FT /translation="MSKDHDTKPSEPQIKYNMSNPSSGNTHFSTSAVASIP FT KLTTGNIYAWKTEVKIYLKMNGLYDFIKEEIKRPSEVQEKSRFDMRQAAAL FT YAIRNSIDSANLASIESIEDPKKAFETLVSQHGSDNGITTANTLTELFSLK FT YDHSIGITNYIAKVQDLHSKIRDMTAGDKDLQLSDRIYAILLVNSLPRNEF FT GLIIQHFLSNIKTISTSDVCARLRLEASSNTNSEEKFKEVYYTKSKRPTRI FT DNRKVGKSPKDLCHIHPNSKHTNEQCHTQKKNENKDSSSLSVEEMAKRYQA FT MMAKSINTNSASVHVAVENDSSNDFITYSAFNATTHITQKDQFLIDSGANT FT HIASSAKLLSNIHTIQPVNISGIGGQNGRITAKLSGTAYIHGKTHTGENRL FT IALEDVLLIPEAGVNIISVSAMLREGANLTSNQTNIILENELQDYIITGKG FT SDGLFLTQARLSSALFAAPASIPADIWHRRFGHLNYRSLSKVSPTSTSKLD FT CEPCTLSKAYRLPFSSHFPKSDVPLFRIHSDVLGPMPTASLGGGKYVVSFI FT DDATRYNCIRIMRTKSQVFSFFVQYITEVERYQNKKIAIFKSDRGGEYTSN FT QFTEFLSQKGITFERAPAETPEQNSVSERFNRSILERTRSTMIEGNIPKFL FT WGEIMMATSYLLNISPNASIDMETPLVLWNADIAGAHPPNTKFLRVLGCAA FT YPLLKSTELNKLSAKSRLCVHVGYEKGARAYRLWDLTTKKIIISRNVFFNE FT RLFPFKNKNNDEQSFITDDEFFPFNETMLEANKINSETAQIPFNTTSNNST FT TQENTTHLATSNPQSAQRPFSHFQSQPRGQQESLSQSSSPFLTPQQLSPIS FT PNFSEFSLSDKASDLISHNISPIEFTPKEIIPLHHSIHAPHNINKKKESQE FT KELKEKEMKELKEKELKELKEQELKQLKEKELKEIKEKELKELKENQLKES FT KEKQLKELKEKELKEQELKQLKEKELKEIKEKELKELKENQRKESKEKQLK FT ELKEKELKEKELKELKDLKEKELKEKQLQEKELKELKDKESKQMKENHENN FT TKEKENNLKIKIKITKQMKTDLESLKATEKSREEDNHQKNTSDSITKENNS FT EYPKDKNEPLPHPEPPIPNRQPIPIQNSTTISQRPIRDRKPPVRYGNLTLY FT AAKKSDDCPTYTEAMRGKERDTGSKQWKKNLTL" FT CDS 3672..5255 FT /product="Copia-53_MLP-I_2p" FT /translation="MEKEFNSLTSHNVGELVDAPPEANIIGGMWRFKRKRD FT EHGNIIKYKARWVALGNHQIWGVDFDKTYASVVQSDTMNMLFSLCASEDWE FT MEQFDIATAFLKGKMHMPVYTRQVQGFYNPKDPKKVWLLRQSIYGTRQAHR FT EFNTDLESKLRSIGFTPSKDDNSLFTLRREKELVHIPMHVDDGLTFSNSKP FT LLKEVKEKLHQLYDLVWTDEFTLHLGIKITRDCKNRTIHLSQEHYLKNVLD FT RFDMTHCNTAPTPFPTSVELTPGTDEEITDAAHLPYQQAIGCLNWAAVHTR FT PDIQYAVSTLARYSSKYTLRHWKVLKHLLRYVQGTLDRGILFKNHEVPSTE FT LRAYADADYAACTETRRSTTGYVFTLGGSLVSWKSRRQPTVALSTTEAEYM FT ALGDCAKHCLWFRRMISHLTQTPIPTSPISLPPLSIFNDNNGAVFLSQESA FT TNSRSKHIDIRHHFIRDLINSHQISTHMIDTKTMPADFLTKNAPKDMLNRC FT RFLIGNISSNETHNITTHLLPKPTKVSSKGGC" XX SQ Sequence 5255 BP; 1966 A; 1261 C; 829 G; 1199 T; 0 other; ttaggttatg agcccagcgc tctagacgcg tactatatat ttacataata caaacccttt 60 gtccatcaaa ggtactatca attaattcta tctctctata atcgtttttc ttcctccatg 120 tcgaaagatc acgacaccaa accatccgaa ccccaaatca aatacaatat gtccaatcca 180 agttcaggaa atacccattt cagcacctct gcagttgcct ctatccctaa actaaccacg 240 ggaaatatct acgcatggaa aactgaagtg aagatctatc ttaaaatgaa cggactatac 300 gacttcatca aagaagaaat caaacgaccg tctgaagttc aagaaaaatc tcgattcgac 360 atgcgacaag cagcagcttt atacgcaata cgcaactcta ttgattcagc aaacttggca 420 tccatcgaat ctatcgaaga tcctaaaaag gctttcgaaa ctcttgtttc tcaacacggc 480 tctgacaacg gcataacaac tgcaaatact ctaaccgaac ttttcagctt aaaatatgac 540 cactcgatcg gtatcacaaa ctacattgca aaagtccaag atctgcacag caagattcgt 600 gatatgacag ctggagataa ggaccttcaa ttatcagaca ggatttatgc tatactttta 660 gtcaacagct tgcctcgaaa tgaattcggc ctcataatcc aacatttctt gtcaaacatc 720 aagacaatct ccacaagcga cgtatgtgct cgactccgac tcgaagccag ttcaaacacc 780 aacagcgaag aaaaattcaa agaagtctat tacaccaaat caaaacgacc tacgagaatt 840 gataacagaa aagttggaaa atcgccgaag gatctatgtc atattcaccc gaattcaaag 900 cacaccaatg aacaatgtca cactcaaaag aaaaatgaaa acaaagactc tagctcactg 960 tccgttgaag aaatggcaaa acggtaccaa gctatgatgg ccaaatcaat caacaccaat 1020 tccgcatctg tacatgtagc agttgagaat gactcaagca acgatttcat cacgtactcc 1080 gctttcaacg caacaactca catcactcaa aaagatcagt ttcttatcga cagtggtgca 1140 aatacccaca tagccagcag tgcaaaactt ttatctaaca ttcataccat acaacccgtt 1200 aacatttctg gaataggggg acaaaatgga agaatcactg caaaactctc tggaactgcg 1260 tacattcacg gaaaaactca taccggcgaa aatagattga ttgctcttga ggatgtactt 1320 ttaatccccg aagcaggagt caacataata tccgtctcgg ctatgcttcg tgaaggtgca 1380 aaccttacca gcaatcaaac caacatcata ttagaaaacg aacttcaaga ctatatcatc 1440 actggcaaag gatctgacgg actgtttcta acacaggcac gtctgtcgtc tgctcttttt 1500 gccgctcctg catctattcc cgctgacatc tggcaccgtc gatttggtca ccttaactac 1560 agatcccttt ctaaagtatc accaacatcc acaagcaaat tggactgtga accatgtact 1620 ttgtcaaaag cctatcgatt acctttttct tcccactttc ccaaatctga tgtcccttta 1680 ttccgtatac atagcgacgt tttaggacca atgcccactg catccttggg aggaggaaaa 1740 tatgttgttt ctttcataga tgacgctaca agatacaatt gtattaggat catgcgaaca 1800 aaatctcaag ttttctcatt ttttgtgcaa tacatcactg aagttgaacg ttatcaaaat 1860 aagaaaatag cgatattcaa atcagatcga ggaggtgaat acacctccaa ccaattcaca 1920 gaattcttgt cacaaaaagg tataaccttt gagcgtgctc cagctgaaac acccgaacaa 1980 aactcagtct ctgaacgttt taaccgatcc atccttgagc gaactcgttc caccatgatt 2040 gaaggaaata tcccaaaatt cctgtgggga gaaatcatga tggcaacttc ctacctgctt 2100 aacatatctc ctaatgcctc aattgatatg gaaactccac ttgtactatg gaatgctgat 2160 atcgcaggag ctcatccacc aaatacaaaa tttcttcgtg ttctaggctg tgccgcatat 2220 ccactcctga agtcaaccga actcaacaaa ctttctgcaa aatcacgtct gtgtgttcat 2280 gtaggatatg aaaaaggtgc aagggcgtat cgtctgtggg acttaacaac caaaaagatt 2340 atcatctctc gtaatgtgtt tttcaacgaa agactgtttc cttttaaaaa caaaaacaac 2400 gatgaacaat cttttattac tgatgatgaa ttcttcccat tcaatgaaac catgcttgaa 2460 gcaaacaaaa ttaacagtga aacagctcag ataccattta acactacatc aaataactca 2520 acaactcaag aaaacaccac acacttggca acttcaaacc cacaatcagc tcaaagaccc 2580 ttctcccatt ttcaatcaca accacgtggc caacaagaat ccctatctca gtcttcatca 2640 ccatttttaa ctccacaaca actatcgcca atatccccta atttctctga attctcgctt 2700 tctgacaaag cttcagattt aatctcacac aacatatctc caatagaatt tacacccaaa 2760 gaaattatac ccctccatca ttcaatacac gcaccacaca acattaacaa aaagaaagaa 2820 tcacaagaaa aagaattaaa agagaaagaa atgaaagaat taaaagaaaa agaactaaaa 2880 gaattgaaag aacaagaact gaaacaacta aaagaaaaag aattaaaaga gataaaagaa 2940 aaagaattga aagaattgaa agaaaatcaa ctaaaagaat caaaagaaaa acaactgaaa 3000 gaactgaaag aaaaagaatt gaaagaacaa gaactgaaac aactaaaaga aaaagaatta 3060 aaagagataa aagaaaaaga actgaaagaa ttgaaagaaa atcaacggaa agaatcaaaa 3120 gaaaaacaac taaaagaact gaaagaaaaa gaattaaaag aaaaggaatt aaaagaactg 3180 aaagacctga aagaaaaaga actaaaagaa aaacaacttc aagagaaaga attaaaagaa 3240 ttaaaagata aagaatcaaa acaaatgaaa gaaaatcacg aaaacaatac aaaagaaaag 3300 gaaaataatc tcaaaattaa aattaaaatc accaaacaaa tgaaaactga cttagaaagt 3360 ctgaaagcaa cagaaaaatc gagagaagaa gacaaccatc aaaagaacac atcagattca 3420 atcaccaagg aaaacaacag cgaatatcca aaagacaaaa acgaaccctt acctcatcct 3480 gaacctccaa tcccgaatcg acaaccaatc ccaatacaga actcaaccac tatatcccaa 3540 cgaccaattc gagatcgtaa accacctgta agatacggta atctcacatt atacgctgca 3600 aagaaaagtg acgactgtcc cacttacacc gaagccatgc gcggaaaaga gcgagacact 3660 ggctccaagc aatggaaaaa gaatttaact ctttgacttc acataacgta ggtgaactag 3720 tcgacgcacc accagaagcc aatatcatcg gcggtatgtg gcgtttcaaa agaaagcgag 3780 acgaacacgg aaatatcatc aagtacaaag ctagatgggt cgccttagga aaccatcaga 3840 tttggggagt agatttcgac aaaacttacg catcagttgt gcaatccgac actatgaata 3900 tgctcttctc cctatgtgca tctgaagatt gggaaatgga acaatttgat atcgccaccg 3960 ctttcctaaa aggaaagatg catatgccag tatacactcg acaagttcaa ggtttctaca 4020 accccaagga ccctaagaaa gtatggctac tccgacaatc gatctatgga actcgacaag 4080 cccacaggga attcaacaca gacctggaat caaaactacg cagcatagga tttacaccgt 4140 caaaggatga caattcccta ttcactcttc gacgagaaaa agaacttgtt cacatcccga 4200 tgcatgtaga tgatggactc accttctcaa actccaaacc cctactgaaa gaggtaaaag 4260 aaaaactgca ccaattatac gaccttgtat ggacagatga attcacccta catttaggaa 4320 tcaaaatcac tcgagattgc aaaaacagaa caatccactt atcacaagaa cactacttaa 4380 agaacgttct cgaccgcttt gacatgacgc actgcaacac cgctcccaca ccttttccca 4440 cctccgtcga actaactcca ggaaccgatg aagaaatcac tgacgctgcc cacctaccct 4500 accaacaagc catcggatgt ctgaactggg cagctgtgca tactcgtccc gacattcaat 4560 atgcagtatc aactcttgca agatactctt caaaatacac tctccgccat tggaaagtac 4620 tcaaacacct cctacgatac gtacaaggaa ccctcgacag agggatactt ttcaaaaacc 4680 acgaagttcc ttccaccgag ttacgtgcat acgctgatgc agactatgct gcatgtacag 4740 aaactcgaag atccacaacc ggatatgttt tcaccctagg aggatccctt gtatcatgga 4800 aaagtcggcg acaaccaaca gtagctttat ccaccactga agccgaatat atggccctcg 4860 gtgactgtgc aaagcactgc ctatggttcc gacgaatgat ttcccatctc actcagacac 4920 ctataccaac atctcccatc tctttacctc cactcagtat tttcaatgac aataacggag 4980 ctgttttcct ctcacaagaa tctgcaacta actcacgctc aaaacacatc gacataaggc 5040 atcatttcat ccgagatctt ataaactcac atcaaatttc tacgcacatg attgatacca 5100 aaacgatgcc tgccgacttt ctcacaaaga atgctccaaa agatatgttg aacaggtgtc 5160 ggtttttaat cggaaacata agctcaaacg agactcacaa cattaccaca catctacttc 5220 cgaaacccac gaaagtatcg agcaaggggg gatgt 5255 // ID Gypsy-18_LBS-I repbase; DNA; FNG; 11801 BP. XX AC ABFE01001149; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-18_LBS_; KW Gypsy-18_LBS-LTR; Gypsy-18_LBS-I. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-11801 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01001149; Positions 12525 725. XX CC Positions [7379-7864] - Integrase core CC 'ACAGG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 488..8500 FT /product="Gypsy-18_LBS-I_1p" FT /translation="MATPSLSNTDNADPSDKPKTTGRSTRATSKNLQLAEE FT SARGASTPAVPSAKGKTTATSPIAEESAKGATTPAVPPAKGKATVASQTKP FT SKPGLSSQKAPAPLPLPQSGSSAPQGSVVPINPPGRNLAFLHETGQILSDA FT PSLTYLTTISSISTLGAKDATVPTEHHRIFEGIATYEEVMIYDEPLSASLA FT RYHRQFNSAIDDDPLIYFGGVDGRLPGRSLEDSQGIRDLLWRWYSPHIDGV FT WNNLPSIIDQFHRINLDPEYINDYTVDSVKYYSVAPMFLEEVAQIFLAMYQ FT VLDAFQTFLEEGDRQPFAFDPEWKMLRMLAWHTNKKDILIGYVVLQRRCEV FT CNSHIKEYFNAMRKMFLQTDEQESVSTMYSTRPSIRSQFGRGSAREEVGKL FT LLRPDYDCIPAAYRDGVTPLTEAAVAAANRRNEKIPSNFYEATRTRIWKTL FT PTDAVAAARSASNASYKVRALDQAFAKSPQMATPPANAQSNQAFSQYAPSV FT VPTTQYAPSAVPTIATQSVKVPMPASISVIAHRRHAISTMPGMGRIELPTN FT DTAVTALVANTSVEQRPRPVVKPAQTVLFAGRQTPPHLATKAAEPPPAPAS FT SAPSRWTSYQFKDNRPIPSPSDEDLRSPGDPPPGGDPPDDDAGGNGPGGGG FT GPNSGPPGGPPSGPPGGGGDPPPSNPSGWVSQPYSPFPYSDEWQLNHKLNS FT SVVPAWDGHGSTAIEYISSMSFLARMSDKMKEGVAAMAPYKWSGRAKSWWE FT CLPVADQIYFRQDWEYMMVGLRIQFLDLRWSRERLYEFEAMFFRQKGHTTE FT DPIDFIQRRVRHHMFLHPEDLDGPGAVDRILRNQPIEWEKDLNDVTCPSIF FT ALQSAASRLATSLINDWQSSEIRARNTAVGQGDKSSIPGKGRYGRSAHLVS FT ADTPAEPPALIQDEGSDSSTDEGAEAFAATFRKRNEKSGTAGSSKWPKGKV FT MNGVAFPRDDSIKSARPPNGSCFICDSPHHFARDCSHYGKWDSLRSANLID FT VDLDHEVEAAADREYLAMLAEIRAEASSSAYAAKLSKISKEAFCMSAEMTG FT AKALHAKATFMNDNRNIRRRLASEPKGKDKEDLDPLNPGIAPLSRAKRRLR FT DSAKPPSYSMAPPPYTEDSKKEILIAPKGRSFPEGLGSLGTRALHTKAYVS FT STDGEVVQARLDSGADITLMSEDFWSSIPGLPKPKDGLRMKLYHLTGAAKV FT LGYIRTTLFMPADNGTFISFDMEAYVVRNMRVPLLLGEDFQTTYELGVTRY FT ATGHCNVHVGRTGWVIKASSAQRVDLGFEIRQAQTTQSFVRAKAARRAKWL FT KPIQGDPPVYADEDSIIQPHSVKGVRVTAPFAGREDWLVEKVIIGSEDGDI FT LAAPTTWINASVPVLPIANPSAFPRIIRKGELVGYLLDPSCALDSPSDDDT FT LARYVASADALRKVITGSLKAQDLADAQPKFHGPDDKLEQDDAWGPKTTAV FT PEDPVQGDILDAVNLGPDVPEEFQAPLADILRKNVAAFGINGRLGKVAAKV FT TIPLKENSAPVSVPMYAASPAKREVIDKQMKTWFESDVIEPSVSPWGFPVV FT VVYRNGKARLAVDYRKLNAQTIPDEFPIPRQTEIVQALSGAQVLSTFDALA FT GFTQLEMADEEKEKTAFRCHLGLWQFKRMPFGLRNGPSIFQRLMQGILSPY FT LWLFALVYIDDIVVYSKSWEEHLAHLDTVLGAIAAAGITLSPSKCFIGYSS FT ILLLGQKVSRLGLSTHQEKVQAIMEIARPTSVSDLQKFLGMVVYFSTYIPF FT YSFIAKPLFELLKKGSKWDWRAEHELAFRQAKEALSKAPVLGHPIQGSPYR FT LYTDASDVALGASLQQVQPIQVRDLKGTPTYDKLAKAWAEKKPIPHLFPVL FT TKDVTEQPEDDAWGASLDDTTVHVERVIAYWSRTFKAAERNYSATEREALG FT AKEALVKFQPFIEGETVILITDHAALQWARVYENANRRLAAWGAVFAAYPG FT LRIVHRAGRIHSNVDPLSRLPRIPPHNSPIVDEIAAIAPDDDKQGRAQAVE FT DCIHRATAPRAAFTVSCWEDVVERRAYANRYEPSESPAQREADRLAAAEAR FT LRRANKRNGTQSTESETTEPVVEEDVPMTNNAPAVSEPLSPVLNESVNPVE FT DVLPFPLDDHWTYPVGVKPSPLELDDEWLNKPHLLVAMNPSVVTTFAQGYP FT LDKYFVSRYVEDAPNPKTVLTPSHFRKSKNGLLYFIDADWSARLCVPESRI FT HFVLKWIHDSPYESAHAGPRRFLSRLRELFFWPTMHKDVMDYTKSCDTCQK FT IKTDRRAQAGGLCPAHIPARPFATVSLDLITGLPPSGEEGYTAIFAIVDKL FT TKFAIFLPTHDTLTQEQFADLFVDRVANVYGLPERIIADRDKRWSTAFWKS FT VVSHYGGVMALSSSHHPQTDGQTEILNATIEQMLRAYVATDRTSWAKWLSV FT LNYSYNSSVHSSTGYAPHFLLMGYKPRTSTVGLTPEGDPVARPFIASQTAE FT EFIGELEFHRAAARDALVLAQERQAASYNKGRRPAEILQKGDLALINPHTL FT KLVDVKGTGRKLVQRTISPFEVLEQINPQVYRLKLPPNYPMHPVFNLEHLK FT KYHPSPPRFDDRESLPSTRELVAYPEYEVESIVGHRLTTQKRGNRRMYLVH FT WKGHEPVDDTWVSEYELRNAPEITRAYLRANNL" FT CDS 8745..11765 FT /product="Gypsy-18_LBS-I_2p" FT /translation="MHSFNSNRRPARDSRVTSTSRRSSATGYLAKPCVDSR FT LLARLPIFEIQSHRIVQIGEEAWELWSPNSLQIPFLPGRVEDFMEISIASR FT RPDRRSDGHLGRFDACVSPQYSTGHSPWAPHVRRVPSSTLACPEFANILDV FT TDMRGSVNPRYVEELLQRNSAFESRLTELMAISQNNSAIAFEDEWRRSRPF FT ISQEDIEDLRSIRSFDRALDDVTHLQRKFKMKAAWIDWMEQLQSTPSWSRS FT SNRPPDMADDSYMGTWINDAWEEDVEWFLAHRIPCFVVHRLMGEELNLLFD FT AGHGGRVTSFITGTPIENLSHPHNRIEAFLRRQNVIITHSDDDDNIGKEEP FT SEPSDPAAARAASFSLSHRAWSRGGPAPVAYSVTAHVPSDPGTSTSASTRP FT SPLLEIDFAGWEAEPLDLVEIDPSRVSWIRPPPVVRPLERQKWEKWAEKNV FT NGQLFVSKVGKAYSNMDSASYFDRARNREVILLEEVPMLPGIVSEPEVFGF FT PCPRHLQFVEEPQNKAPRPAKSSYWLYTTMNPISGNVGLQPMYPDASSLPL FT KDEFRTGGSDRPPSPPSSPGHPALPDPLPFVSFDSPPSRGIPDSERPISPT FT DSLLASPRTVNWDSEFNEPMGPQIPAMSTEDVEMGTVDLGDVPSPSSTPLS FT SLPHQLSREITPNQRSTPRRSPPRAPPRSSAPKSAPRTQRQRSPPRPLHQS FT PPRRGPRKHARTRSRTPSPPRRRGDSYRPRAPAGSLGGDRHNPWPPAPAAN FT AWPTPSGTSNPWESTPTPLSWSTPNPLPSGPTPPSWGPSPGYMPWGFGPPT FT QNPWSIHYPGYYPSEQIPLPNWPPMSYPPMQPHHCPSCNNCASCGRPSLPA FT ASQDPHQPPPTTSSSSLLGRLRSPSSTDRPHTTAVPLAQRLRDPAPLIDRV FT APPASLSQRLQSPSPEFVQGSSSRPFQSNSTRSLHQRLANKEEEGEPSDRE FT ENVPTSRARRSRRAGRKQKEEDEQRIREGKQPAPGKGKRRDRRDRGNRGGF FT A" XX SQ Sequence 11801 BP; 2829 A; 3385 C; 2921 G; 2666 T; 0 other; ttggtggggg caccgcggac ttcacatccg atgttgaagt ctcgctccgg gagagctcgg 60 aggctgttga tcacgtcata gtaagtatga ctcatcactt acgagtcaca tcgttcctcc 120 atccctctcc ctcgccgtac cgtccactac ccactccttt ccactcgttg ctgagcgcag 180 tccctcctgc gttcgcgcga gtatgccctg gagcgctgcc acagcgttca cggttattcg 240 actcacctta gtaattccga ttatccgaat tcagttagtc gtctagccca ccgctagatc 300 cttcacccat cgccctcatt cattcagtcc acccctctcc caaccgactc tacccgattt 360 agctatgcca cacacaccgg atgaaggatc ttcctcgact tagtcggcga agttttacga 420 tcagacaggg cgccgccagt gtatataaga aatagactaa agtttcgtct ccttcagcta 480 cactaccatg gctacccctt ccttatccaa caccgacaac gccgacccct ctgacaaacc 540 aaaaactacc ggtcgctcaa ccagagctac ctccaaaaac cttcaactcg cggaagagag 600 cgctagaggt gctagcaccc cagctgttcc ttctgctaag ggaaaaacga cagctacgtc 660 gccaattgcg gaagagagtg cgaagggtgc tactaccccc gctgttcctc ccgcgaaagg 720 gaaagcgaca gtcgcgtcac aaaccaaacc atctaagcca gggttgtctt cacagaaagc 780 tccagctcca ctaccccttc ctcagtcggg ttccagcgcg ccgcaaggca gcgttgtgcc 840 gatcaatcct cctggccgaa acctagcgtt tttgcatgag acaggacaaa tcctttcgga 900 tgcgccatcg ctcacttact tgacgactat atcaagcatc tctacgctag gtgccaagga 960 tgcaacagtg cctacggagc atcatcgtat cttcgaaggc atcgctacct acgaggaagt 1020 tatgatctat gacgagccct tgtcggcatc cctggcccgt taccatcgtc agttcaactc 1080 agccattgac gatgacccct tgatctactt tgggggtgtt gacgggagac ttcccgggag 1140 atccttagaa gactctcagg gaattagaga tctcctatgg cgctggtact cgcctcacat 1200 tgacggcgtt tggaacaatc tcccatcgat aatcgaccag ttccatagga tcaatcttga 1260 tcccgaatat atcaacgact acactgtgga ctcggtgaag tattattcgg tggcgcccat 1320 gttcctcgaa gaagtggccc agatcttctt agccatgtat caagtactcg acgcctttca 1380 aaccttcttg gaggagggcg atcgtcaacc attcgcattc gatccagaat ggaagatgct 1440 caggatgcta gcctggcata ctaacaaaaa ggacatctta attggttacg tagtcttaca 1500 acgacgatgc gaagtctgca actcccacat caaggaatat ttcaacgcga tgcgaaagat 1560 gttcctgcaa acggatgaac aagagtccgt cagcaccatg tactccactc gcccttcgat 1620 tcgttcccag tttggacgag ggtctgcgcg tgaagaagtc ggaaaactcc tcctacgacc 1680 ggattacgat tgtattcccg ctgcataccg agatggagtc actcctctca cggaagcagc 1740 tgtagctgct gccaacagaa ggaatgagaa gattccctca aacttttacg aggccactcg 1800 aactcgtata tggaaaactc ttcctaccga tgctgtcgct gctgcgcgtt ctgccagcaa 1860 tgccagctac aaagtcagag cgctggatca agcgttcgct aaatcaccgc agatggctac 1920 tccccctgcc aacgcccagt ccaatcaagc gttttctcaa tatgctcctt cggttgttcc 1980 taccactcaa tatgcgccat cagcggttcc aactatcgct acccagagtg ttaaagtacc 2040 gatgccggct tccatttctg tcatcgccca tcgtcgtcac gccatctcca ccatgcctgg 2100 catggggcga attgagctgc caacaaatga taccgctgtt acagcgctcg tggccaacac 2160 gagcgttgag cagcgtcctc ggccggttgt gaaacctgcc caaacagtat tattcgctgg 2220 taggcagacc ccccctcact tagcgaccaa agctgcagaa ccgcctcctg cccctgcgag 2280 ctccgcgcct tctcgttgga caagctacca gttcaaggat aaccgtccca ttccgtcgcc 2340 tagcgatgag gacttgagat cccctggcga tcctcctcca ggaggtgacc ccccggacga 2400 tgatgctgga ggcaatggtc caggcggtgg aggcggtcct aatagcggtc cccctggagg 2460 ccctcctagt ggtcctcctg gaggaggcgg ggatcctccc cctagcaatc cgtcggggtg 2520 ggtcagtcaa ccgtactcgc cctttcctta ctcggatgag tggcagctta accacaagct 2580 caactcatcc gttgtaccag cttgggacgg acacggttct actgccatcg agtacatcag 2640 ctccatgtcc ttcctcgctc gtatgagcga caaaatgaag gaaggtgtcg cagctatggc 2700 gccgtataaa tggtctggcc gtgcaaaaag ctggtgggaa tgcctacccg tagcggatca 2760 gatctatttc cgtcaagatt gggaatacat gatggtgggc ctccggattc agttccttga 2820 tttacgttgg tcgcgtgaac ggctctatga gttcgaggct atgttcttcc gtcagaaggg 2880 acacactaca gaagatccca tcgacttcat acagcgaaga gtccgccatc acatgttcct 2940 tcatcccgaa gatctcgatg gtcctggcgc agtggaccgt atcttgcgca accaacctat 3000 tgaatgggaa aaggatctta atgacgttac ttgtccgtcc atattcgcgt tacagagcgc 3060 ggctagtcgt ctcgctacaa gcctaattaa tgactggcaa agtagcgaga tccgcgctcg 3120 taacacagcg gttggccaag gagacaagtc gtctatccca ggcaaggggc gttacgggcg 3180 atcagcgcac ctcgtctcgg cggatactcc tgctgagcct ccagccttaa ttcaggatga 3240 aggcagcgac tcgagtacgg acgaaggcgc cgaagccttt gcggcgacct tccgtaaacg 3300 taacgagaag agcggcacgg ctggttcgtc taagtggccg aaaggcaaag tcatgaacgg 3360 cgttgccttc cctagggacg atagtattaa aagtgctaga ccccctaatg gctcctgttt 3420 tatctgtgat agtcctcacc acttcgcaag agactgttca cactacggga aatgggacag 3480 ccttcgtagt gctaatttga tagacgtaga tttagaccac gaagtagaag ccgcagcaga 3540 ccgagagtat ttagctatgc tcgctgaaat cagggccgag gcctcttctt ccgcttacgc 3600 cgctaaacta agtaaaatct ccaaggaagc gttctgcatg agcgctgaga tgacgggcgc 3660 caaagcttta cacgctaaag ccacatttat gaatgataat cgaaacattc gccgacgttt 3720 ggccagcgaa cctaagggca aagataagga agacctggac cccttgaacc caggcatagc 3780 gccattatcg agagctaagc gtcgcctcag ggacagcgct aagcctccat cttactccat 3840 ggcacctccc ccgtacactg aagattcgaa aaaagagatc ctgatagcgc ccaagggtcg 3900 ttccttcccc gaagggttag gttccttggg aacacgggct cttcatacca aggcgtacgt 3960 atcttctaca gacggggagg tcgttcaggc gcgattggac tcaggtgcgg acattacctt 4020 aatgtcagaa gatttctggt cctctatccc ggggctaccg aaacccaagg atggattacg 4080 catgaaatta tatcacctca cgggcgcagc gaaagtccta ggttacatcc gaactacgct 4140 gtttatgcct gcagacaacg gcaccttcat tagtttcgat atggaagctt atgtagtacg 4200 caacatgcgt gtacctcttc tactaggaga ggactttcag actacatacg agctaggggt 4260 tactcggtac gccactggtc actgtaatgt gcacgtagga cggacaggct gggtgatcaa 4320 agcgtcatcc gctcagcgag ttgatctcgg tttcgagatt aggcaagctc agacgacaca 4380 gtcgttcgta cgcgctaagg ccgctcgtcg ggccaagtgg cttaaaccta tacagggtga 4440 ccccccagtc tatgcggacg aggatagcat catccagcca cactcagtca agggtgtccg 4500 agtaactgca cccttcgcag gacgtgagga ctggctcgta gaaaaggtca tcattggatc 4560 tgaggacggg gatattctag cagcccccac tacgtggatt aatgcgtcag tacccgtcct 4620 acccatagcc aatcctagcg cttttccgag gataatccga aaaggcgagt tagtagggta 4680 cttgctcgat ccgagctgtg ctttagacag tccctcagat gacgacacgc tagcgcgcta 4740 cgtcgcgtcg gctgatgctt tacgaaaagt catcactggg tccctgaagg cccaagatct 4800 cgcggacgct cagcctaagt ttcacggtcc tgatgataag ctggaacagg atgacgcttg 4860 gggccctaag actacagctg tgccagaaga tccagtacaa ggcgacattc tggatgccgt 4920 gaacctcggc cccgacgtgc cagaggaatt tcaagcgcct ttagcggaca ttttgcgtaa 4980 gaacgttgca gccttcggta ttaatggccg cctaggcaaa gtagcagcta aggttacgat 5040 cccgctgaaa gagaattcgg cgcccgtctc tgtgcctatg tacgctgctt cgcctgcgaa 5100 acgcgaagta attgataagc agatgaaaac ctggttcgaa tccgacgtca ttgagccatc 5160 agtcagtcct tggggcttcc ctgtagtcgt ggtgtaccgc aacggtaaag cacgactggc 5220 ggtggactac aggaaactga acgctcaaac gataccggac gaatttccta tcccgcgtca 5280 gacagagatc gttcaagcgc tgtcgggtgc tcaggttctt tcgaccttcg atgcactggc 5340 cggattcacg cagttagaaa tggcagacga agagaaggag aaaacagcct ttcgctgtca 5400 tttaggactt tggcagttca aaaggatgcc gttcggactg cgcaacggcc cttccatttt 5460 tcagcgtttg atgcagggga tcttgtcccc ttacttatgg cttttcgcgc tggtctacat 5520 tgatgatatt gtcgtgtatt ctaaatcgtg ggaggaacac ttagctcacc tggacacggt 5580 cctaggtgct atagctgcag caggaatcac gttatcccct tcgaagtgtt tcatagggta 5640 ttcctcaatt ctcctgctgg gtcagaaagt ttcacggttg ggattatcta cgcaccagga 5700 gaaggttcag gctatcatgg aaatcgcccg cccgacctcc gtgtccgacc tgcagaaatt 5760 tttgggcatg gtagtatatt tttccaccta tatccccttt tattccttta tcgcgaaacc 5820 cttgttcgag ctgctcaaaa agggctcgaa gtgggactgg cgggctgaac acgagcttgc 5880 ttttcgacaa gctaaggaag cgttatcaaa agcacctgtt cttggccacc cgattcaagg 5940 tagtccctat cggctgtaca cggatgcatc ggacgttgca ctaggcgcga gcttgcaaca 6000 ggtacagcca attcaagtca gagacttgaa ggggactcct acgtacgata aactagcaaa 6060 ggcttgggct gagaagaagc ccattcccca ccttttcccc gttttgacca aggacgtcac 6120 agagcagcca gaggacgatg catggggtgc gtctctagac gacactacgg tccatgtaga 6180 acgtgtcatc gcatactgga gccggacttt caaggctgct gaacggaact atagcgctac 6240 cgaaagagaa gcgttgggtg caaaagaggc cttagtcaaa tttcaaccct ttatagaggg 6300 tgagaccgtg atcctgatca cggatcatgc agcattacag tgggctagag tttacgagaa 6360 cgccaatagg agattggcgg cgtggggggc agtgttcgca gcatacccag gactccgaat 6420 agtacacaga gctggcagga tccattctaa cgtggaccca ctatccagac tccctcgcat 6480 accgccgcac aattctccta ttgtcgatga aatcgcagct atagcgcctg atgatgacaa 6540 acagggtaga gcacaagcag tggaagattg tattcaccgc gcgacagccc cccgagcggc 6600 gttcactgtg tcctgttggg aggatgtggt agagcgccgg gcttacgcca accgttatga 6660 accatccgag agccctgctc agagagaagc ggaccgttta gccgcggcag aagcccggct 6720 acgcagggct aacaagcgta atgggacaca aagtacggaa tcggaaacca ctgaaccagt 6780 agtggaggaa gacgtaccca tgacaaataa cgctcctgcg gtcagcgaac cactctcccc 6840 agtcttgaat gagtcggtta accccgtaga agacgtccta ccattccctc tggatgatca 6900 ttggacgtat ccggtcggcg taaagcctag tcccctagag ctggatgacg aatggttgaa 6960 caagcctcat ctgttggtgg ccatgaaccc ctcggtagtc accacgtttg cacaaggcta 7020 cccgttagac aaatacttcg tctcaagata cgtcgaggat gcccctaacc ccaagaccgt 7080 tcttacccca tcgcatttcc gtaaaagcaa gaacggttta ctatacttca ttgacgcgga 7140 ttggtcagcg cgtctgtgcg ttccggagtc tagaattcat ttcgtgttga aatggattca 7200 tgactccccc tacgagagcg cacacgcagg cccccggcgt ttcctgagta gactacgtga 7260 gctctttttc tggccaacaa tgcataagga tgtcatggat tataccaaat cgtgcgatac 7320 gtgtcagaag attaaaacgg atcgacgcgc tcaggctgga gggctatgtc ctgcacatat 7380 cccagccagg ccctttgcta cagtatcttt agacctcata acgggtctcc cgccctctgg 7440 cgaagaaggc tacacggcca tatttgctat agtggacaag ttaactaagt tcgccatttt 7500 cttaccgacc cacgacacat tgacccaaga acaattcgcg gatttgttcg tggaccgcgt 7560 ggctaatgtc tatgggttac cagagcgcat catagccgac cgcgataagc gttggtccac 7620 tgcgttctgg aagtccgtag tatctcacta cggaggagtc atggcgttgt catcttctca 7680 tcacccccag acggacggtc agaccgagat tcttaacgcc accatcgaac aaatgctgag 7740 ggcgtatgtt gccacggatc gtacctcttg ggctaagtgg ctgagtgtat tgaattattc 7800 gtataatagt tcggtacact cttccactgg atacgcgcct cacttcttat tgatgggtta 7860 caagccacga acttccacag tgggcttgac ccctgaagga gaccctgtcg cgcgcccgtt 7920 catcgcgagt caaacggctg aggaattcat tggtgaatta gaattccatc gagccgccgc 7980 tagggatgcg ttagtattag cacaagaacg tcaggcagct tcatacaaca agggtagacg 8040 tcctgcagag atcctgcaaa agggggatct agctttgatt aaccctcaca ccctcaagct 8100 agtggatgtc aaaggtaccg gccgcaaact cgtacagagg accatcagtc ctttcgaagt 8160 acttgaacaa ataaaccccc aagtatatcg attgaaactg ccgccgaact accctatgca 8220 cccagtattc aatctcgaac acctcaagaa gtatcaccct tctcccccga ggtttgatga 8280 tcgagagagc ctacctagca ccagagaact ggtcgcctac cctgaatacg aagtagaatc 8340 tatcgtggga catcgtctta cgacccagaa gcggggcaat cgacgaatgt acttagtgca 8400 ttggaaagga catgaacccg tggacgacac ttgggtatcc gaatacgagt tgcgcaacgc 8460 ccctgaaatc accagagcct atttgcgagc caataacctc taaggaacga gagagatctc 8520 ggatctcgga tattaattcc cgcttatttg ctggtattga tgatcgaggg ttgataacaa 8580 ttttcatctc cgacatcctt ctttcttcct tacccccatt atcttctatc gtcgcgcggt 8640 ccccactgcg ttaagactca ttcgtacacc tcgcgcggtt cacactgcgt tgaaactatt 8700 cacccactct tttaattact tatagctact ttctaattac taaaatgcac tcgttcaatt 8760 cgaaccgtcg tcctgccaga gactcacgag tcacttccac gtctaggaga tccagcgcaa 8820 ccggctacct agctaagcca tgcgttgact ccagactgct tgcccggcta cccatcttcg 8880 agatccagtc gcatcgcatc gtacaaatag gcgaagaggc ttgggagttg tggtcgccaa 8940 attcactgca aattcctttc ctgcctggac gggtggaaga cttcatggag atcagcatag 9000 ccagccgtcg tcctgaccgc agaagtgacg ggcatctggg ccgattcgat gcctgcgttt 9060 cgcctcaata ctccactggg cattcgccct gggctcccca cgtacgacga gttccttcca 9120 gcacgcttgc gtgcccagag tttgccaaca tcttggatgt cacggatatg cgtggttctg 9180 tcaaccctcg ttatgtggag gaacttctac aacggaactc ggcgttcgaa agtcgcctca 9240 ctgaactcat ggctatcagc cagaacaatt cagccattgc attcgaagat gagtggcgtc 9300 gaagccgtcc cttcatcagt caggaggaca tagaggatct tcggtccata cgctcattcg 9360 accgggcact agacgatgtc acccatctac aacgtaaatt caaaatgaaa gcggcgtgga 9420 tagactggat ggagcagcta caatccactc cgtcttggtc tcgctcttcc aaccgtcctc 9480 ccgacatggc tgatgactca tacatgggta cttggatcaa cgacgcgtgg gaagaggacg 9540 tcgaatggtt cctagcgcac cgcattcctt gtttcgttgt gcacagatta atgggtgaag 9600 agttaaacct cttattcgac gcggggcatg gagggcgcgt tacttccttc atcacgggta 9660 ccccaattga gaatctgagc catccgcata atcgaatcga ggccttttta cgtcggcaaa 9720 acgtcatcat cacccactct gacgacgacg acaatatagg caaggaggag ccttcagagc 9780 cttcggatcc tgctgcggcc cgagcagctt cattctcttt atctcataga gcatggtcaa 9840 ggggaggacc cgctccggtc gcctacagcg tcaccgctca cgtcccctct gaccctggca 9900 cctctaccag tgcgtcgact aggccttccc ccttgctcga aatcgatttc gccggatggg 9960 aagctgagcc cctggatcta gtagagatag acccgtcgcg cgtttcgtgg atccgtcccc 10020 ctccagtcgt tcgaccccta gagaggcaga aatgggaaaa atgggcggag aagaacgtaa 10080 atggtcagct gttcgtatcg aaggttggga aggcctactc aaacatggac agtgcctcct 10140 acttcgatcg tgcacgtaac cgggaagtta ttctattaga agaggtacct atgctcccag 10200 gtattgtctc agagcccgaa gtcttcggat tcccgtgtcc taggcacctc cagttcgtgg 10260 aggagcctca gaacaaggcg ccacgacccg cgaaatcatc atactggctt tatacgacaa 10320 tgaaccccat ttccggcaat gtaggcttgc agcccatgta cccagatgca tcatccctcc 10380 ctctcaagga cgagttccga acgggaggat ctgatcgacc gccttcacct ccttccagcc 10440 ctggccaccc agcactaccg gatcccttgc cattcgtctc tttcgattcc cctccttcca 10500 gagggattcc tgacagcgaa aggccgatct cgcccaccga ttccttatta gctagtccga 10560 gaactgtgaa ctgggattca gagttcaatg aacctatggg cccacaaatc ccggctatgt 10620 caacggagga cgttgaaatg ggtacggtag atttaggcga tgtgccttcc ccttcgtcta 10680 ctcccctttc gtcactccct caccagctgt ccagggagat tactccgaac cagcgcagca 10740 caccgaggcg ctcccctccc agagcaccgc caaggtcttc agctcccaag tccgccccta 10800 gaactcagcg ccagaggagt cccccaaggc cattgcatca gagcccacct cgccgcggac 10860 ctcgtaaaca tgctcgtact cgctctcgca ccccgtcccc tcccagaagg agaggagact 10920 cttaccggcc acgagctcct gctggcagtc tggggggcga tcgtcataat ccctggccac 10980 ccgcgcccgc tgctaatgct tggcccaccc ctagcggaac ttctaaccct tgggagtcga 11040 cccccacccc tctttcatgg tcgacaccca atcctcttcc ctcgggacca actccaccat 11100 cttggggacc ttcaccgggc tacatgcctt ggggtttcgg ccctccaact caaaatccgt 11160 ggtcaatcca ttatccgggt tactacccct cggagcaaat cccgttgccc aactggccgc 11220 ccatgagcta tcccccaatg cagcctcatc attgtccgtc gtgcaacaac tgcgcctctt 11280 gcggccgccc ctcattgcca gcagcaagtc aggacccaca tcagcctcct ccaactacgt 11340 ccagttcatc tttactggga cgcttgcgct ctccgtctag cacggatcga cctcatacca 11400 ctgccgttcc cttggcacag cgcctacgtg atccggctcc cttaatcgat cgtgtggctc 11460 caccagcttc cttatcacaa cgattacagt cgcctagccc ggagttcgtg caagggtcct 11520 cttcgaggcc cttccagagt aactcgacaa ggtcattaca ccaacgactg gcgaataagg 11580 aggaggaagg agagccttcg gatcgggagg agaacgtgcc aacgagccgt gcacgacgaa 11640 gtcgcagagc aggacggaag cagaaggaag aagacgaaca acggatcaga gaagggaagc 11700 agccagcgcc gggcaaaggc aagaggaggg accgtcggga tagaggtaac cgaggaggtt 11760 tcgcatgagg atgtaatgcg acactgcgcg caggggggca t 11801 // ID Gypsy-92_MLP-I repbase; DNA; FNG; 5706 BP. XX AC AECX01000330; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-92_MLP_; KW Gypsy-92_MLP-LTR; Gypsy-92_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5706 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000330; Positions 168851 174556. XX CC Positions [4504-4983] - Integrase core CC 'TGGGG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(1555..4317,4321..5610) FT /product="Gypsy-92_MLP-I_1p" FT /translation="MHDNRIITMIQLYDPSKDTTINARALIDSGATHEAVS FT SDFVSNNGLSSYPLPEPRSVTGFSGHEASVTHTGDYIVNNQNQETTFIVTK FT LRDKYDVILGIPWIKENFRLIDWERSTLKTDPMKIATVETVSSRLKNTSTV FT QGMEPKRHARQLDEGVSIIDTQKPPQCEFDNHPCLDLNHDSSKLVSSPIVQ FT LNHQEHEIDPKRESATAKTVSSSLPKALEGPKMEPIRQAKSCDEGACIVDM FT LKPPQCEFEGLSSIKFCEQMKKKASLQYFPKPQIPIRQNLHRSQQSGIMNP FT RRSYIQKSLLYALATAKTSWNLSAKLAAESNKNTPEKTAAELVPKAYHEYL FT AMFEKGNSNVLPPHRPYDFRVDLVPGATPQAGRVIPLSPKENKALNEMLNK FT GLANGTIRRTTSPWAAPVLFTGKKDGNLRPCFDYRRLNALTVKNKYPLPLT FT MELIDSLLNADQFTSLHMRNGYNNLRVREGDEEKLAFICKAGQFEPLTMPF FT GPTGAPGFFQYFIQDILKSRIGRDVAAYQDDILIYTQPGVNHQEVVKEVLE FT ILKKQNVWLKPEKCKFSQSEIAYLGLIISKNQIKMDKTKVQAVQDWPIPRN FT LSEVQTFVGFSNFYRRFISQFSKIARPLHALSQKDTPFEWTDQCQRAFETL FT KTSFTSAPVLKIASPYHPFVLECDCSDFALGAVLSQVSEDDGELHPVAYLS FT RSLIKAERNYEIFDKELLAVVASFKEWRHYLEGNPNRLSVIVYTDHKNLQS FT LMTTKELTRRQARWAEILGSFDFEIRFRPGKQSTKPDALSRRPDLAPSKED FT KLSFGQLIKPENLPEDAFIDELEVIEKWFDDERQVEEPAIDEIELEDENDN FT EEVWNDLKLIEEVKIRSRNDSRITETMKLCEEMPNSKLLKDYKCINRVLYF FT KNIIEVPNDAELKLMIISRHDSRLAGHPGRMRTLALVKRAFHWPSLKSYVN FT QYVDGCHSCQRIKARTTKPFGSLQPLPIPKGPWTDICYDLITDLPQSNGHD FT SILTVVDCLTKMAHFIACNKTMNSDQLADLMISNVWKLHGTPETITSDRGS FT VFISRITRDINRKLGIHTQASTAYHPQTDGQSEITNKAVEQYIRHFTSYKQ FT DDWSMLLPTAEFAYNNNEHVAIGMSPFRANYGYDVGFAGVASSGQQIPLVE FT DRFKQLREVQQELKDSLLITQENMKNQFDKKVLETPKWKIGMKVWLSSKHI FT STTRPTTKFSHQWLGPFKISSKISTNAYKLELPMSMKKVHNVFHVNLLREF FT RKGTIKGQEATQPPPIVIEEEEEYEVQEVLDKRRRGKGKMEYLISWKGYDS FT SFDTWEPEENLKNASELVKQFNSKYPNADESYRKAKGYK" XX SQ Sequence 5706 BP; 1985 A; 1163 C; 1239 G; 1319 T; 0 other; tattgcaacg tctaatcacc tataggatac ggtcaaaaag aagcaagaag aaaagaagtt 60 caaaaacgaa gaaatcaaac aaaagaaacc ggtaaaattg attaggatca aatctataag 120 caaggttgga aaatctcaaa gttaaaataa atttaaagtt agaaatcata aaagttcgaa 180 caaggaagaa gatcaaaata gtacggaaat taagatttaa gaagtgaaag ttaaaattag 240 aagtagaaga ataaactgta gtggtattac ttagatctca ccctcgtaac tccgcaatct 300 agtcaacacg ccggaattct cagtacccac aagcccaact aattcggatt caactgaatc 360 gaagtctaat cgacacaaga tctcagaagc acgaactgaa ttagatatgg atgaaatatt 420 gaaacaactt gcagatttga atacgagact ttccgaagag acgagacttt gacaacaaga 480 aacaactctc cgccagcaag ctgaacttcg aataaagcag ctggagcagg acaagctgtc 540 ggcccaaact caacaagtgc cgatgagtgg cattgatgtt cttccatcac aagcccaagc 600 tccgactcag atccctgtcg taacgcaatc agtaagacca ctgaaggtgg caacacctga 660 taaatttgat ggttcgaaag gccagaaggc cgagatcttt ttgaaccaac taggcctgta 720 tatgaggcta aacggcaatc agttcgcgga tgaacaatcc agagttgcgt ttgctctctc 780 gtacaccacc ggcaaagcca gtatttgggg tcaaagcatc atagaccaag tccttaacag 840 tgaaaccgct cacctggtca cttgggacaa attcgtcgat tcctttaaag caaccttctt 900 cgacaccgag cgtaccgcga aggctgagaa ggagttgaga gctctcaagc aaaccagaac 960 tgtttccgat tactggatca agttctctga gctcttgtta gttgtcaaat ggcctcaaca 1020 aatcctagtc tctcaattcg aacaaggatt gaaaaccgaa gtcttgattt atatgataag 1080 agaaactttc actgaagtcg acgagatggc caagtttgca ataaaacttg acaacaaaat 1140 tcacaagcga tcgacagaga ctagtcagag ttcggcaatg acccaagcca ctcctgtgca 1200 tgatcccgat gccatggact gtttggcgta tcgattggac atatcagctg aggagtataa 1260 aagacgaggt gcagatagag cgtgttatag ctgtggaagg aaagatcaca tgatagcgca 1320 atgcccgaca tctaaaagaa gaaaaggagg ttatttttca aaatttaatg atgctgagaa 1380 acgtattaat gctagattag ctgaataaga tagcaagata gaaaactcaa atgtcagtag 1440 tgaagttagt agagcagaca agtcaaaaaa tggagatgct cgggagtgac ggttgtgcct 1500 cacccgagca agttggagat aggagttgaa aagggtgatc tcagtagtct tgaaatgcat 1560 gataatcgaa ttattaccat gatccaactc tacgaccctt ccaaagacac aaccattaac 1620 gcgcgagccc tcattgacag cggggctacg catgaagctg taagttcgga ctttgtgagc 1680 aacaacggac ttagcagtta tccattgcct gaacccagaa gtgtgactgg attcagcggt 1740 catgaagcat cagtcacaca cactggagat tacattgtca ataatcagaa tcaagaaaca 1800 acattcattg tcacaaaact tcgagacaag tacgatgtaa tcttaggaat cccgtggatc 1860 aaggaaaact ttcgattgat tgattgggaa cgaagtacac tcaagacgga ccccatgaag 1920 attgccactg ttgaaacagt gtcgtcaagg ctgaaaaaca cctcgacggt ccaaggaatg 1980 gagcccaaga ggcacgctag gcaacttgac gagggggtga gtatcataga tacacaaaag 2040 cccccgcaat gtgagttcga taatcacccc tgtctagatc ttaaccatga cagtagcaag 2100 ctggtttcct ctccaattgt tcaattaaac catcaggaac acgagattga cccaaaacga 2160 gaaagtgcta ctgccaagac agtatcatca agcctgccaa aagccttaga aggtccgaag 2220 atggagccta taaggcaagc taagagctgt gatgaggggg cgtgtatcgt tgatatgctt 2280 aagcccccgc aatgtgagtt cgaaggcctt agttctatca agttttgtga acagatgaag 2340 aagaaagcat ctttgcaata tttcccaaaa ccacagatac caatccggca gaaccttcac 2400 cgtagccaac agtcaggaat catgaacccg agaagatctt acatccaaaa gtcactattg 2460 tacgccttag caactgcaaa aacgtcttgg aatctctcag caaaactagc cgcagaaagt 2520 aacaagaaca ccccagaaaa gacggcagca gaattagtcc caaaagcgta ccatgaatac 2580 cttgctatgt ttgaaaaagg aaattcaaat gtcctacccc cccatcgacc atatgatttc 2640 agagttgacc tagttccagg agccactcct caggctggta gagtaattcc cctatcaccc 2700 aaagaaaaca aagctctcaa tgaaatgtta aacaaaggac tggctaatgg aaccatcaga 2760 cgaacaacct ctccgtgggc ggccccggtt cttttcacgg gaaagaaaga tggaaatcta 2820 cgcccatgct ttgactatag aagattgaac gcgcttacgg ttaagaataa gtatcctctt 2880 ccattgacaa tggaattaat tgacagctta ttgaacgctg atcaattcac cagtctccat 2940 atgagaaatg gatacaacaa tctcagggtt cgtgaaggtg atgaagaaaa actagcattt 3000 atatgtaagg caggacaatt cgaaccgtta actatgccct tcggcccaac tggagcaccc 3060 ggattttttc agtacttcat tcaggatata ttaaagtctc gcataggaag agatgtagcg 3120 gcatatcaag atgatatact catatatact caaccaggag tgaaccatca agaagttgtg 3180 aaggaagtat tagaaatcct gaagaagcaa aacgtttggt taaagcctga gaagtgcaaa 3240 ttctcacaat ctgagatagc gtaccttgga ctgattattt cgaagaatca aatcaaaatg 3300 gacaagacta aagttcaagc agttcaagac tggccaatcc cacgtaacct atctgaagtc 3360 caaacatttg taggcttttc aaacttctac cggagattca tttcacagtt ttcgaagata 3420 gcacgcccat tacacgccct atcacaaaag gacactccct tcgaatggac ggaccaatgt 3480 caacgcgcat tcgaaacact gaagacatct ttcacctcag caccagtact gaagatagcg 3540 agcccgtacc atccatttgt tttagaatgt gactgttcag actttgccct aggggcagta 3600 ctatcacaag tctctgagga cgatggcgaa ctccacccgg tagcttactt gtcacgatca 3660 ctcatcaaag cagaaagaaa ttacgaaatt ttcgataagg agcttctagc agttgtggcg 3720 tctttcaagg aatggagaca ttaccttgaa ggcaacccca acagactgag tgtaatcgta 3780 tataccgacc acaagaacct tcaatcactg atgacaacaa aagaattgac ccgtagacaa 3840 gctagatggg cggaaatttt aggaagtttt gatttcgaga tacgcttcag gccgggtaag 3900 caatcaacga agccggatgc tctgtcgcga aggccagatt tagctccttc gaaagaggac 3960 aagctctcgt ttggacagtt aatcaaacct gaaaacctac cggaggatgc tttcattgat 4020 gaattagaag tgattgagaa gtggttcgat gatgagaggc aagtagaaga gccggcaata 4080 gacgaaattg aattggaaga tgaaaacgat aatgaagaag tatggaatga tttaaaatta 4140 attgaagaag ttaaaatcag atcacgcaat gactcacgaa tcacagaaac tatgaagcta 4200 tgtgaagaaa tgccgaattc aaagttattg aaagactaca agtgtatcaa cagagtgtta 4260 tattttaaaa atattattga agtccccaat gatgcagaac tgaaactgat gatcatttga 4320 tctcgacatg atagcagatt agcaggacac ccgggtagga tgcgcactct cgctttagtc 4380 aaaagagcgt ttcactggcc atcactaaag agctacgtaa atcaatacgt tgacggatgt 4440 cattcatgtc aaaggatcaa agcaagaaca acaaaaccct tcggaagttt acaaccgcta 4500 ccaatcccga aaggaccgtg gacggatatc tgctatgatc taatcactga cctaccacaa 4560 tccaatggac acgatagtat cctaactgtt gttgattgcc taactaagat ggcccatttc 4620 atagcctgta acaaaactat gaattctgac caattagctg atttaatgat ttcgaatgtt 4680 tggaagctcc atgggacgcc tgaaactatc acatccgata gaggcagcgt cttcatctct 4740 aggataacaa gagatatcaa caggaagcta gggattcata ctcaagcgtc gacggcttat 4800 caccctcaaa ctgacggtca gtccgaaatt acaaacaagg cagtggaaca gtacatcagg 4860 cactttacaa gctacaagca agatgattgg tcaatgctct taccaactgc agaatttgcg 4920 tacaacaaca acgagcatgt tgcgataggt atgtctccgt ttagggctaa ctacggctat 4980 gatgtaggat ttgcaggagt agcgtcgagt ggacaacaaa tcccactggt tgaggacagg 5040 ttcaagcaat tacgtgaagt gcaacaagaa ttgaaagatt cattattgat tacgcaagaa 5100 aatatgaaga atcaatttga taagaaagta ctggaaacac caaaatggaa aatcgggatg 5160 aaggtttggt taagtagcaa acatatatca actacgagac caaccacgaa attctctcac 5220 caatggttag gaccattcaa gataagtagt aaaatatcaa ctaatgctta caagttagaa 5280 ttaccaatgt cgatgaaaaa agttcacaat gtttttcacg taaatttatt aagagaattt 5340 aggaaaggca cgatcaaagg tcaagaagca acacaaccac cgccaattgt gatagaagaa 5400 gaagaagaat atgaggtcca ggaggtatta gacaaaagaa gaagaggaaa agggaaaatg 5460 gagtatttaa tcagttggaa aggatatgat tcaagctttg acacgtggga accggaagaa 5520 aatttgaaga atgcgagtga gttagtaaaa caattcaatt caaaatatcc aaatgcagat 5580 gaaagttata gaaaggcaaa ggggtacaag tgagggttat ggctttttcc cactgggttt 5640 tttaatgcca acccggggaa agattctaac ctgcaagagg gggttgagtc ataaaagggg 5700 gagtga 5706 // ID MarinerN-1_AO repbase; DNA; FNG; 492 BP. XX AC . XX DT 24-JAN-2006 (Rel. 11.01, Created) DT 02-FEB-2006 (Rel. 11.01, Last updated, Version 1) XX DE It is a family of nonautonomous Mariner DNA transposons- a DE consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW MarinerN-1_AO. XX OS Aspergillus oryzae OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC mitosporic Trichocomaceae; Aspergillus. XX RN [1] RP 1-492 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-492 RA Kapitonov V.V. and Jurka J.; RT "MarinerN-1_AO, a family of nonautonomous Mariner DNA transposons RT in the Aspergillus oryzae genome."; RL Repbase Reports 6(1), 40-40 (2006). XX DR [2] (Consensus) XX CC This nonautonomous family of Mariner DNA transposons is CC characterized by TA target-site duplications and 23-bp TIRs. XX SQ Sequence 492 BP; 178 A; 86 C; 53 G; 163 T; 12 other; cagtaaaaca ccgttataag caaccccgat atagggaatt tacgcgttat aagcaatcta 60 attgattgtc tnaaatcgat taccatacaa aattccttaa aaccccgata cagggaaatt 120 tyatcattct gtctaattct ctatacaaaa ttgcttataa gggartaaag tggatyctyt 180 tracyaatta trtattttta aaaatatata aagatcacgt gctctaytay tttttagtrt 240 ttttagattt aaaatcttac ctgtacaaca catcaacaac taaaatatat atatttctct 300 actacactat accttactat tagaataccc tttactacgc agaaagatcg ataaatatac 360 aatataaatt atatatacta cttaaaatcc tgattyctac tagtctatat agatatttca 420 atataagcaa tcccccgata taagcaattt aggcagccag ctcaattgat tgcttataac 480 ggggttttac tg 492 // ID PIF_Harbinger-1_CryNeo repbase; DNA; FNG; 3408 BP. XX AC . XX DT 17-MAY-2011 (Rel. 16.05, Created) DT 17-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Harbinger; DNA transposon; Transposable Element; KW PIF_Harbinger-1_CryNeo. XX OS Cryptococcus neoformans OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Tremellomycetes; Tremellales; Tremellaceae; Filobasidiella; OC Filobasidiella/Cryptococcus neoformans species complex. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 3408 BP; 872 A; 818 C; 785 G; 933 T; 0 other; tagggaattc aaaacaggag cgtcgcctta aactggagct cactaaaccc tcagggtctc 60 agggaattca tctcagggtg gccctagaaa agtggaggtt aaaccctaga attccaccgt 120 ggcatttctg tttaaaaaga gtttgtgtaa agaagtatga aattacgtaa tttagtttta 180 ctaattaagc gatgcgacat gttacatgac gactgagatg catcttctgc tgtttataca 240 tacctatgca taaccaccgc attcttgccc ctcagttccc taatcataat catatttggt 300 cccactcgtc gatatgcccc aagtatcaaa gaaacagaag cttgagaaga aggcctacag 360 gatggctcgc atctcagctg ctctctgtct accatattca gctgagatta ggatgtttta 420 tacaactgtc tcacagtctc gataccttaa acgccctcac cagtactctg tcgtaaggaa 480 tcgaaactgg aatgcaagaa tcgattgcat tcatcaactg cctgagaggg atttcaaacg 540 caagctccga gtctcctacc aagagttcca ttcgattctt gacctcatca aggacagtaa 600 tgtttttatt tcacaaggac cacggaaaca agcccctcct ctgtatcagc taaccgtggc 660 cctgtataga tttggccacg aggggaattc ttgcaacgcg tatgaaatcg gccataactt 720 tgacatctca ggtatgtgtc tccaaggtca tcccctatag gaaacttgtg ctgactaaat 780 acagagggct catcacttct ttggacgtat agagtgattg aagcttttat gggtatcgaa 840 gaccaggtgg tggtgtggcc ggatgaaaga ggaaggtctt gcattgctaa taccttcgac 900 gaagaaagat tgcccggtgg ttgcgtgggg gtcatggatg gctgcctaat tccgtttgcc 960 atcaaacccc cgagaccaga tgcctcggac tttttctctt ataaatccag atatggattt 1020 tcaatcctag ccgtttgcga tgataaaatg cggatcacat tcgcccagta tgggttcccg 1080 gcttcttgtc atgatgcacg ggcttacaat agctccatcc tctcgagcca atcccaccgg 1140 ttcttcgccc caggccagta tgtggtagct gattcagcgt tccctgttgg agaccattgt 1200 atatctctgt tcaaaacccc ccgtaatgca gctcgcttag gtgaatcgga agtgagttag 1260 tgatcccctg cggacatttc aggtgaactg acatctactt tcaacagaga gtctttaacc 1320 aaagggcagc tggcgttcgt gtctcaattg aacatacttt tggtatactc aagagtcgat 1380 ggcagtcact tcgaggtttg cgactactga ttcgagatcg atttgatgag ggaatcgcat 1440 catgctggat tcgttcttgc gtgatccttc acaatcttct aattgccaca ggcgagtggt 1500 acataagcgg cgaaggagat gagatggatc gcgaggatgc tgaataccga aggcaagggc 1560 aagacgaaga ggagcgcata ttactgcaag gtaatggtcc tcatcgtcgc catcaagtta 1620 tgcacttgat gggcttagtt tagagccaac aatcggggga gagagccgga ccaaatcaga 1680 tgtgcgaact actatgcatg ttttattcct tgatggcaaa accttttaca tttgtctgtt 1740 ttcgttccat tctttcaggt catcggttgc cttctttttg gcttcctcat acgaggcccc 1800 ttccgccttg tggagcgcca aaaactcctt ccacatatca aaatcgactt ttttaacctc 1860 tgcaagagct ctcttttcct cggcagcaac gccctccttg aggatcgcat gcttcgcttc 1920 ttgcatgcga agaagatcct tgtgtctcct ctcggagttt gcctcgcttg cctcggcgag 1980 caactgtagt tggtcttcgg cgctacaagc tttggccctc gcatttacag agaggactcg 2040 ttgcctcgag agagcgccgg aaggagagac accagattca gaggctgagc tagggatatg 2100 gacagaagga gcggtcgaag tagatgccct gtctgcgata acctcgggac gatcggggca 2160 agcgggaact ggcatagtga caagaccctg ggtgcgtatc gattgagagg ggcctgatga 2220 tgtgacatct gcgatggctg cagctgaata gcaagtcaat agtacaaaca aagaatagaa 2280 atagactgga aaccgattca ctttcgtgat ggaggtcagt atcccattgg acaatatcat 2340 cgtcatcgcc ctcctcctct tccttctcct gcctgattct cttcctcaac ctctcacgat 2400 cctggtcctc atcttcgtcc agagtcgagt gcccaggttc aaccatagca gcggccgagt 2460 gcgacatgat cggcatcagc tcactatacc aagggcaaat gctaatggcg tcatctgggt 2520 aaaaacattc gtcaataagt gaacatgaca aatgatggaa aaggcgcaaa acatgagact 2580 cactgccatt aggccccccc gaacctgttt tgttgatctc atcgtttgca cttttgaagg 2640 tgctccacat tcttttaatc tatatatgtt ttgccagttg aggacatata agactagaga 2700 caacactgtc tgtacccacc tgtctcaaaa cagactgcac gtcgcgatta gtggcgcacc 2760 ccttctggac gagatactcc agagcaactt tactccagtg ctccctggtt tttcctgtca 2820 cattgtctga tttccatcta tagaacatca aagtcccatc tgagtcaggc gtacccagcc 2880 agtcgatcaa atggtcattg gcgttacgct ggtcattgcc aaggtcattc tcccagttta 2940 ttgttcgctg cctttttcct ttggtagcag agtcggcctg actgctgtcc tgaccttgtg 3000 ttgttgcgga ttcttttcct ttgtgcatga tatatgtaga gtgatgtgtc agagctaaag 3060 aagatggaag gtagaaacag atttttgctc cacacatgca gactacttgt aggcgcgttt 3120 tctccttcgt tcgttggaga taaggatccg catcatgttt aagaccatct taaatttaag 3180 tcgggcgtca gacccctacc ttaaacttaa gaccaaaaat taaaaaaatt acctggaaag 3240 gaagattttg aagttgccgc aagaccccgc cgccttaaat ttcagatccc agttaagggt 3300 ctataagggt tcaaaaaggg tccaaagagg ttaaattctt acgtaaaatg acgacttctg 3360 acttaaacca ggtttaagga ggtcttaaac gctgttttga attccctc 3408 // ID Copia-1_MPA-LTR repbase; DNA; FNG; 534 BP. XX AC ADBL01000478; XX DT 26-MAR-2011 (Rel. 16.03, Created) DT 26-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Magnaporthe poae genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_MPA_; KW Copia-1_MPA-I; Copia-1_MPA-LTR. XX OS Magnaporthe poae OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Magnaporthales; OC Magnaporthaceae; Magnaporthe. XX RN [1] RP 1-534 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Magnaporthe poae genome."; RL Direct Submission to RU (26-MAR-2011). XX DR Genome; ADBL01000478; Positions 1578 2111. XX SQ Sequence 534 BP; 133 A; 138 C; 124 G; 139 T; 0 other; tgttaaaaat aactattgca tgtaacaggg tcctaccctg taacccttac ctgttagggt 60 tagtttatgt aattagctta aggttaaggc acgcacctaa cctaccctgc actttctact 120 gctttgtaag gggttgggcc acctaccctg taagggttgg caccacgagt cacctaccct 180 gcagggcggc cacgcaagtg aatcgctccc acgattaagt cagatttgtc ggcctaacgc 240 ggagatgatt ttggtggccg tgggtcgatt ttttgcaaat cccatacaaa ataatttttc 300 ggaaattagc ttaggtcacc ggaacccccg cccgcgcaaa tcccgcggta ccgatgtcgg 360 tgggtatatg agggagtaga gttggctgcc cctcccaaga ataacaagtc tgtaacccac 420 aatttatttg ggggccctgc attgtaattt agtgcattgc agggcccaaa ccaattaaac 480 gtcagtccct gtgatagcgc tgcatcccct gcgcacccgc tatataatat aaca 534 // ID Gypsy-1_LBS-LTR repbase; DNA; FNG; 145 BP. XX AC ABFE01000018; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_LBS_; KW Gypsy-1_LBS-I; Gypsy-1_LBS-LTR. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-145 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01000018; Positions 52294 52150. XX SQ Sequence 145 BP; 42 A; 36 C; 39 G; 28 T; 0 other; tggaataggg tatggagcac cccttccaag gaggcacgtg atgcgtagat tagtcatcaa 60 gtagtgtacc acagcgtgta gagagaggga gaaaccacca cttcaccagg catccacagt 120 gtcaaaggca tcagtgccct ttcca 145 // ID Gypsy-2_LENY-LTR repbase; DNA; FNG; 527 BP. XX AC AAPO01000122; XX DT 12-FEB-2011 (Rel. 16.02, Created) DT 12-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Lodderomyces elongisporus genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_LENY_; KW Gypsy-2_LENY-I; Gypsy-2_LENY-LTR. XX OS Lodderomyces elongisporus OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Debaryomycetaceae; OC Lodderomyces. XX RN [1] RP 1-527 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Lodderomyces elongisporus RT genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; AAPO01000122; Positions 21303 21829. XX SQ Sequence 527 BP; 165 A; 79 C; 120 G; 163 T; 0 other; tgatgttata tatcgtgccc aagggctcga ttagccaaga gatttaacca actgcaagtt 60 gatgctagtt cttacattca agttccaaga gttaaagagt ggagaaccaa ctcaagttat 120 atcgattaca tgagctacaa gaataggggt aattggacta ggagtggggt aaacgaatgg 180 gaatagcttg gcttgtattg cataagatcg catggcttga atcgtatgag attcgaggag 240 cggggtaagc gaacgaggcg gaatagcttg gcatgaagat atggaatatg aggagtcgtt 300 tagatatgtt gtcttttata tgtcttatat agcatatagt tcttgaggac aagttctggt 360 ataaatagag gacttgttct tcaagaagtt tgctttttag ttgtttatga attaacttgt 420 ttaccaatca acaaaaacac atttttatag taacactaat actatagatc aagcacctca 480 atttatacgt acctcgaacc ttgttgcaag tttcaaggtt catcgca 527 // ID Copia-44_MLP-I repbase; DNA; FNG; 4728 BP. XX AC AECX01001150; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-44_MLP_; KW Copia-44_MLP-LTR; Copia-44_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4728 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001150; Positions 80537 85264. XX CC Positions [1957-2481] - Integrase core CC 'GTTTA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 55..4728 FT /product="Copia-44_MLP-I_1p" FT /translation="MSNREDILDDSQLPTPASTPAPSSEEIDDNHVDSDSS FT SIDSTHSAHTMVPASHQQINMATTAPEILDTDTPAQRQASKMTSILHKVAI FT STKLSSSNYVAWSDSIRFGLMAASYDHYLDAEEVTGQTIDPEIILATKKAI FT FYWLLASIESTQSTRFISMISKFENGIKTTPPSPSLLWKTIRDYHISNSES FT VKLMLRSDITDLSQGSTKDLLDYIDTFRAKVDAYLGSNGEMSEEEQARQFV FT RSLNREWAEKGCDLLDAGHVKFRNLETELKKAYQTRKMFSSNRQQISRAAE FT SSEASQSRRTGRWSTCNKNKCLGRDHPTKPHEQSECYHHPNNAKKMEDWKR FT SKQESGEWVEYPRGSRGLNRGGFRGRGRGRGNPQPARNQSNLTDFPNSDDL FT YSVFENLRLEDRDVSYNVEIDGQFSCTARPQLACVGDRCDLVALIDTGASH FT HMFHDASLFESATLMANHDPGAKLNLAGGGATLDIHSIGNVNLLNSKGEII FT KLKDCLYVPDLSRNLIAGGRLLRAGAVTTVLADPNFRIDHGKKELFIGQFI FT GEGSMMYVPIKSSVSHHALRSSISTANQLTILKLHYSLGHPSEKYLKRMWN FT LGYFDDVLPNNVNTNDFAIITKCPICPLAKNHRLPFSSTRPRATKFLENVH FT VDLSGIIRTSTVNQEEYYILFTDDYSSYRISFGLPNKSAETVFECFKRYIS FT FSERQTGEKLKMFSFDGGGEFINGILTPYLEDLGINTRITSPHTPEENGVA FT ERSNRTINTKARCMLIQSCIPVKYWYHAISYAVMLQNRTVTTSLNLKNTPH FT SLWKNRQPSMKRFHPFGCLAYRHIRKEIRGGKFEPVSRPGVLLGAAEDNHN FT FVVLDLESNKIYTSHDVTFQPLVFPFMKDAENNPDWMFIEDLPVEPTEDGE FT ELIDLNPNPPMCPDLCDEEDDVIEERLTPRTITEPPALIEESEVETEEPIP FT EPIEKEPTPAAPTPPAIEPRRSGRERHPVERYSPGDSNWIEAHPMRSIVPK FT GMRCDILSKVYQCREAMKARAEPKSYKMAMKMHDAEKWKAACDKEMQNIKN FT MGVWEIVDRPKNAPVVGGRWHFKYKVNPDGSISKHKARYVAKGYTQTEGID FT FNDTFAPTGRLASFRILVAVAAAKDWAIEQMDAIAAFLNSDLKEEIYLELP FT EGYDAERADGKVARLRKALYGLKQSARCWSDEVKSKLTSIGLIQNPHDACL FT WYGKDEYGLETLLYLHVDDMAITGDKIDEIKDLLKLKWKMEDLGPAHCVVG FT IEIHATDDGGYALSQSSFIQTVLERFNSEDCKPASTPFPGGTKILKASESD FT VIEFQQSGLPYNSLVGSLMYIAQGTRPDIAYAVGALSQHLSKPSLTAWQMG FT MHVLRYLKGTQCMGLNYSPKDNIISGNQSWSFPECHTDSDWAGDPSTRRST FT TGYLFKLNGAAVSWKSRLQPTVSLSSTEAEYRATTEAGQEVVWLRGLLSNI FT SLKQNFPTILCSDSTGAVALTQKSIFHARTKHIEVQYHWIREQVEKNLIKL FT RHISNKNMFADTLTKPLHPGPFKELRNQIGLDVVVGQLKQGV" XX SQ Sequence 4728 BP; 1445 A; 1042 C; 1049 G; 1192 T; 0 other; tggtagcgag agtttggttc aactcttatc tattctaatc gaaacgcaag tcttatgtca 60 aatagagaag atattctaga cgattcccag ttgcctacac ctgcatctac cccagctcca 120 agttctgaag agattgacga taatcacgtc gattctgatt ccagctcaat tgattcaacc 180 cattcagcgc acactatggt cccagcatct catcaacaga tcaacatggc aactactgcc 240 cctgagattt tagatactga tactccagca caacgacaag ccagtaaaat gacttcgatt 300 ttacataagg ttgctatttc cacgaagctg tcgtcgtcta attacgttgc ttggtcagat 360 agcatacgtt tcggattaat ggcggcatca tacgatcatt atcttgatgc tgaggaagtg 420 acaggacaaa ccatcgaccc tgaaatcatc ttagccacga agaaagccat cttctactgg 480 ttgctcgcta gcatcgagtc cacccaatcc acccgattca tatctatgat ctcgaagttc 540 gaaaacggca ttaaaactac ccctccatcg ccttcgcttc tatggaagac tatcagagat 600 taccacatca gtaattcgga atctgttaaa ctcatgctga gaagtgatat caccgatttg 660 tctcaaggct ctacgaagga tctccttgat tacattgata ctttcagagc aaaagtggac 720 gcttacctcg gctcgaacgg agagatgtct gaagaagaac aagcgagaca gtttgttagg 780 tcactcaatc gggagtgggc tgagaaaggg tgtgaccttt tggacgccgg tcatgtcaaa 840 tttcgaaatc ttgagaccga gctgaagaaa gcttaccaaa cccggaagat gttttcatcg 900 aatcgccaac aaatcagtcg agctgccgaa tcctctgaag ccagtcaaag tcgtcgaact 960 ggaagatggt cgacgtgtaa caaaaacaag tgcttaggac gtgatcatcc gactaaacct 1020 catgaacagt ccgaatgcta ccatcatcca aataacgcaa agaagatgga ggactggaag 1080 cgatcaaaac aagaatctgg ggagtgggtg gagtaccctc gtggcagtcg tggtttaaat 1140 cgaggtggtt ttcgaggaag aggtcgaggc cgtggcaatc cacaaccggc cagaaatcaa 1200 tctaatctaa ccgatttccc aaattccgac gatctctact cggtatttga aaaccttcga 1260 cttgaagaca gagatgttag ctacaatgtt gaaatcgatg gtcagttttc ctgcactgct 1320 cgtcctcagt tagcatgtgt tggagaccgg tgtgatcttg tggcgctcat cgatactgga 1380 gcgtcacatc acatgttcca cgacgccagc ttgtttgaaa gcgccacgct gatggctaac 1440 cacgacccgg gtgccaagtt aaatctggct ggaggcggtg ccacactcga catacattcc 1500 attggcaacg tgaatctatt gaattcaaaa ggggaaatta tcaaacttaa ggattgttta 1560 tatgtacctg acctttctcg aaatctgatc gctggtggga gattattaag agctggagct 1620 gtgactactg tacttgccga tccaaatttc agaatcgatc atgggaagaa ggaattgttt 1680 attggccaat tcataggaga aggtagtatg atgtatgtgc caatcaaatc atcggtcagt 1740 catcatgctc ttcggtcttc aatatcaaca gcaaatcaac tcaccattct caaactccac 1800 tattctctag gtcacccaag cgagaagtac ttaaagagaa tgtggaatct gggttatttt 1860 gatgatgtct taccaaataa tgttaatacc aacgactttg caattataac aaagtgtcct 1920 atttgcccgt tagccaaaaa tcatcgcctg ccattttctt caaccagacc cagagcgact 1980 aagtttcttg aaaatgtaca tgtggatttg agtggcatca taagaacctc aactgtaaat 2040 caagaagaat actatatatt attcacagat gattacagca gttatcgtat ctctttcggc 2100 ctgcctaata aaagcgccga gactgttttt gaatgtttca aacgatacat ctctttctca 2160 gaaagacaga ctggggagaa attgaagatg ttctcatttg atggaggagg cgaattcata 2220 aatggaattc ttaccccgta cttagaagat ctcggcatca acacacgaat cacttctcct 2280 catacaccag aggaaaatgg tgttgctgag cgttctaaca gaactatcaa cacaaaagca 2340 aggtgtatgt taattcaatc atgtatacct gtaaaatact ggtatcacgc aatctcgtac 2400 gctgtaatgc tacagaatag aaccgtaacc acgtctctga atctcaaaaa taccccacac 2460 tctctctgga agaatcgaca gcctagtatg aaaagatttc acccattcgg atgccttgct 2520 tatcgccaca tcaggaagga gatccgtgga gggaagtttg aacctgtctc tcgaccaggc 2580 gtcttactag gagccgctga agacaatcac aacttcgtcg tactagattt agaatcaaat 2640 aaaatttaca caagtcacga tgtgactttt caaccgctag tcttcccttt tatgaaggat 2700 gccgagaaca atccagactg gatgttcatc gaagacctcc cagttgaacc tactgaggat 2760 ggtgaagaac ttattgatct caatcccaat ccacctatgt gtcctgatct ctgtgacgag 2820 gaggatgatg tgatagagga gcgtctcact cccagaacca tcacagaacc tcccgcttta 2880 attgaggaaa gcgaagtcga aaccgaagaa ccaattcctg aacctattga aaaggaaccc 2940 actcctgctg caccaactcc gcctgctatt gaacctagaa ggtcaggaag agaacgtcat 3000 cctgtggagc gttactctcc tggagactcg aactggattg aagctcaccc tatgcgaagt 3060 attgtaccaa aaggtatgag atgtgatata ctatccaaag tataccaatg tcgtgaagca 3120 atgaaggcaa gagctgaacc aaagagctac aagatggcaa tgaagatgca tgatgctgag 3180 aaatggaagg cagcgtgtga taaagaaatg caaaatatca agaatatggg agtctgggaa 3240 attgttgatc gtcctaaaaa tgcgcccgta gtaggaggaa gatggcattt taaatataaa 3300 gtgaatcccg acggaagcat atcaaaacat aaagctagat atgttgcaaa aggctacaca 3360 cagactgaag gcattgattt caatgatacg tttgctccca ctggacgatt ggcttcattt 3420 cggattctgg ttgccgttgc cgctgctaaa gactgggcta ttgaacagat ggatgcaata 3480 gcggcgttcc tcaacagcga cttaaaagaa gagatttatt tagaattacc agaaggctat 3540 gatgctgaac gtgcagatgg aaaggtagca agattgagaa aggctctata cggcctgaag 3600 cagtctgccc gatgttggag cgatgaagtc aaatccaaac tcacgagcat tggattgata 3660 caaaaccctc atgatgcgtg cttgtggtat ggaaaagatg aatatggact agagacatta 3720 ctttaccttc atgttgatga tatggcgatc actggagaca agattgatga aattaaggat 3780 ttgctgaagc tgaagtggaa gatggaggac ctaggacctg ctcactgtgt cgtaggtatt 3840 gagattcatg caactgacga tggtggttat gcgttaagcc aatcatcctt cattcaaact 3900 gttcttgaac gatttaactc agaagactgc aagcctgcat caacaccatt tccaggtgga 3960 acgaagatcc tgaaagccag tgaatctgat gttattgaat ttcaacaatc aggactcccg 4020 tacaacagcc tagttggaag cttaatgtac atagctcaag gaaccaggcc ggacattgcg 4080 tatgctgttg gtgcgctatc tcaacactta tcgaagccat ctctcacagc ttggcagatg 4140 ggaatgcacg tcttgaggta cttgaagggt actcaatgca tggggttaaa ttattcaccc 4200 aaagacaata tcatttccgg taatcagagc tggtccttcc cggaatgcca tacggattca 4260 gactgggcag gagacccaag tactcgacgg tcaactactg ggtacctttt caaactcaat 4320 ggcgccgcag tgagctggaa aagcaggctt caacctacag tttcgctatc gtcaacagag 4380 gctgaataca gagcaacaac agaagcaggt caagaggtgg tatggttaag gggtttgtta 4440 tccaacatct cactcaagca gaattttcct acaatcttat gcagcgatag taccggcgca 4500 gtagctctca ctcagaaatc tatattccac gcgcgcacga aacacattga agtgcaatac 4560 cactggataa gagaacaagt cgagaagaac ctgataaaac ttcgacacat cagcaataaa 4620 aatatgtttg ccgatacttt aactaagcct ctgcaccctg gccctttcaa agagctacga 4680 aaccagatag ggttagatgt tgtagttgga cagctgaaac agggggtg 4728 // ID Gypsy-20_RO-LTR repbase; DNA; FNG; 300 BP. XX AC AACW02000339; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-20_RO_; KW Gypsy-20_RO-I; Gypsy-20_RO-LTR. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-300 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000339; Positions 16167 15868. XX SQ Sequence 300 BP; 95 A; 62 C; 40 G; 103 T; 0 other; tgtcgtattt accttaataa ttttcttaat gtcgtgtgag atcactcaaa agattcttca 60 actaataatt tgactgtata tcatatatag cccaaagtgc tatatacgat aaacagccca 120 cacaatttat accgtactac ctcggaccct aaatcacagt attatattct aagacgtagt 180 ttgtattgaa tgggctcgct ctctcgtcct tactcaataa agaggttcaa gagactcatc 240 gatttttcga cttattttat tacttttatt aatcgagaga aataacacgc ctattccaca 300 // ID Copia-30_MLP-I repbase; DNA; FNG; 5049 BP. XX AC AECX01003112; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-30_MLP_; KW Copia-30_MLP-LTR; Copia-30_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5049 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01003112; Positions 6070 1022. XX CC Positions [2572-2958] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 3971..5032 FT /product="Copia-30_MLP-I_3p" FT /translation="MKKLGFKVCPVDNSLYTLRVGAHFIHVPMHVNDGMAF FT SNNKNFLNEFRENLKQFYKFRWNKNPTLHLGIHIKRYRVHRLITLDQSHYV FT DSILERFGMSDCNGVKTPLPQNVKLVTPTQEDSSEIEQYRAAVGMLNFLLV FT QTRPDISFAVSYLARFNSRHNDTHWAAVKHLLRYIKRTKSYTLQFGTNKAQ FT NMLVEGYADADYAGDVDTRRSTTGFVFYVYGSLVSWKSRRQHCVTLSTTEA FT EYLAIGDCAKHGLWLCRLLEHLCQEQSINVPIKLPLSNDNQGAVFLCNEAS FT VNNKSKHIDIRHHFIRELTREGKITVSHVSMKEMPADVLTKAVGANILTSS FT YDQLGISEIKS" FT CDS join(925..2568,2572..3492) FT /product="Copia-30_MLP-I_1p" FT /translation="MSLSKADAKDFKKVEDLECERAGVLEVIRASVTPTRL FT PVIRGIEDPKKAYETLLEHASQDDGLEVASLIAKVATIRFTGSEPISSFLD FT NINDLHTKLAEATSDDNTLKISDKLLAVFLLLSFPGDHFGTIRDQMFGDLT FT NLSTSKVLSQLQTKSTLSVVDESPVVMAATTRPSHTPQSRLVTPRTDKSPN FT APGVLREHWGFSHTNGVCSRQQKRQTGGHTQTNNSSSTPVSSLSDQEKVRR FT FNQLAAAGVIGFNAQASPVVKPTPHVPNQTETQPEGTVDCQFATSFNTIAE FT VPTQVSMTMSNVYPAASLTHKATLADTACNRHMFGDILLLDDLHEVDPVWI FT NVANADKSSQIKATMMGTAKLHAFDLDGAPSLVHIPNVLYSPALPANLISV FT TKLYESGYKIVDPHYGTNKSDVNMYFSDSRNIIPVYKDVGPAGFWRFYHFS FT EPCALSVVSPKPNETDLWHLRFGHLNNRSTSSIMENVLRVSPNAPTTCKAC FT TMGKQTRSSHTGNLPRSQVPFYRIHCDLAGPFPFASIEGFNYIMVLIDDAT FT RNWAVLLKSKSQAFEAFKQFHSMIGNQTSRKLAVFKSDRGGEFTSTAFTKY FT LSDNGIVHEMGPPESPQQNSVAERFMRTISERLQTQLIHGNLPVRLWGELI FT MATSFLLNLCPSKSIRYNCPEYAWQKHALRIKSPNIPYSCLCVIGCLAYTI FT PPGHHTKLAPRSIKSIMVGYEKNSNAYRLWDPKTSRILISNDVVFDETNFP FT LQTIDKTSTEELTISTIPCGTKFGSIHATHTQTPDPFVTSWVFLMIFKYPI FT HLSNLLAILDKTLSLLKGLVILLATTQLLNSTAVSTQRSMTGLRTTNPHTH FT RR" XX SQ Sequence 5049 BP; 1469 A; 1228 C; 965 G; 1387 T; 0 other; aggtatgatc ttgttattgt ctggtttcta tttgtatagt tagtaaagtt tgataagaaa 60 ttaattcaac tatattgaaa gcttcttctc tacaaccatt atatcatctc atcactccgg 120 attactcatt ccttgttgac ttctgactga tataattgtt gtttgggatt gatcaaggta 180 tgatcttgtt attgtttggt ttctatttgt tttgtttgtt tcgtttgata agaaattaat 240 tcaactatat tgaaagcttc ttctctacaa ccattatatc atctcatcac tccggattac 300 tcattccttc ttgacttctg actgatataa ttgttgtttg ggattgatca aggaatctga 360 gattcctata ggttatgagc ccaggctccc tggcactcga gtcgatcatc tctcaacaaa 420 aagtaaacta gagttctaga tacaatctct tggcacgaga atccgttcaa tctaaactcg 480 aaatactatt ccaatagatc ctagtcatct agaatcttct tgacaccctg aactgtacca 540 attcgatcat aatcactcaa attccggagt aatctaacac attcagttca ctgtcaaact 600 tcagagatta ccccacaatc acttccccac gctggatttg acaactatac cagatcccag 660 tcaatctgac ccacccaccc taaacccaaa ctcaactgtc gattcagtaa ctcaaatcaa 720 tacaactacc actgttttgg aaactgtcaa caatcgacaa tccgaaatga catctaatca 780 aaacgccact tctgtacctt ctaatcactc atccttctta atcgcccaaa tacccaaact 840 taatgacacc aataaagtcg attggcaatt aggattgaag acctacctta agggccgcag 900 acttgcaaat atgtctccac tgacatgtca ctctccaaag ccgatgcgaa ggacttcaag 960 aaggtagagg accttgaatg cgagcgagct ggtgtgttgg aagtgatccg tgcttctgtc 1020 actcccactc gtcttccggt aatccgagga attgaggatc ccaagaaggc ttatgagacg 1080 cttttagaac atgcctctca agacgatgga ttggaggtag catccctaat tgccaaagtt 1140 gctactatta gattcaccgg atctgaacca atatcttcct ttctcgataa catcaatgat 1200 cttcacacca agcttgcgga agcaacgtct gatgacaaca ctttaaaaat aagtgacaag 1260 ctactggcag tgtttctcct cttaagcttt cctggtgatc actttggcac aatcagagac 1320 cagatgtttg gcgatttgac aaacctatct acttcaaaag tcttatctca actccaaaca 1380 aagtccactc ttagcgttgt tgacgaatct cctgtcgtca tggcagcgac tacaagaccc 1440 agccatacac ctcaatcacg actcgtcact cctaggacgg acaaatctcc taatgctccc 1500 ggcgttctgc gtgagcattg gggattttcc cataccaatg gagtttgttc aaggcagcag 1560 aagcggcaaa ctggaggtca tacccagacg aacaattcat cctcaactcc agtctcatca 1620 ctctcagatc aagaaaaagt gagaagattc aatcaactcg ctgcagctgg cgtgatagga 1680 ttcaatgctc aagcaagccc tgtggtcaag ccgactcccc atgttcccaa ccagacggaa 1740 acacagcctg aggggactgt agattgccag ttcgcgactt cgttcaacac aatagctgaa 1800 gttccaactc aggtatcaat gacgatgtcc aatgtatacc cggctgcatc tttgactcac 1860 aaggctactt tggctgatac tgcttgtaac cgacacatgt tcggcgatat cctacttttg 1920 gatgaccttc acgaagtcga tcctgtatgg attaatgtag cgaatgcgga taaatcgtca 1980 caaatcaagg ctaccatgat gggaactgcc aagcttcatg cgtttgactt agacggtgca 2040 ccatcacttg ttcacatacc caatgtcctt tactctcctg ctctacctgc caacctaatt 2100 tcagtaacca agctttacga atctggatac aaaattgttg acccacacta cggcaccaac 2160 aaaagcgacg tgaacatgta cttctcggac agcaggaaca tcatcccggt gtacaaagac 2220 gttggtcccg caggattctg gcggttctac cacttctctg aaccatgtgc gttgtccgta 2280 gtatcgccta agcccaatga aactgatttg tggcatctac gatttggaca tttgaacaat 2340 agaagtacct caagtatcat ggaaaatgtc ctacgagtta gtcctaatgc accgacaact 2400 tgcaaagcct gcaccatggg gaagcaaaca aggagcagtc atactggtaa tcttcctagg 2460 tctcaagtcc ctttttatcg tattcattgc gaccttgctg gtccgtttcc ttttgcaagc 2520 attgagggtt ttaattacat tatggtgctt atagatgatg ctacacgttg aaactgggct 2580 gtcctcttaa aatctaaatc tcaggcattt gaagccttta aacaatttca ttctatgatc 2640 ggtaatcaaa cttcacgtaa attggctgtt ttcaaaagtg atagaggagg cgagttcacc 2700 agtactgcgt tcaccaaata ccttagtgac aatggtattg tccatgaaat gggacctcct 2760 gaaagccctc aacaaaactc agtggctgag cgcttcatga gaacaatttc tgagagacta 2820 caaactcaac tgatccatgg aaatcttcca gtgagattat ggggagaatt aatcatggct 2880 acttctttcc ttcttaatct atgcccttcc aagtccatcc gttacaactg tcctgagtat 2940 gcctggcaga agcacgcctt aagaatcaaa agtcccaaca ttccctacag ttgtctttgt 3000 gttattggtt gccttgcgta tacgatacct cctggccatc ataccaagtt agcaccccgc 3060 tcaatcaaat caataatggt tggatatgaa aagaactcga atgcgtaccg actatgggat 3120 cccaaaactt cccgtattct gatatccaat gatgttgtgt ttgatgaaac taacttccct 3180 ttacaaacca ttgacaaaac atccacagaa gaactaacaa tctcaacaat cccatgtggg 3240 acgaagtttg ggagcatcca tgctactcac actcagactc cagatccctt cgtaacttct 3300 tgggtgttcc tcatgatctt caagtaccca atccacctat ccaacctcct cgccattctg 3360 gacaaaacac tcagcctgtt gaaaggcttg gtgattttgt tggccaccac acagttgctg 3420 aactcaactg ctgtttccac acagcggtca atgacgggtc tgagaacgac aaaccctcat 3480 actcacaggc gatgaagggt cccaatcgag aggattggct tgctgcaatg tccgaggaat 3540 tcacttctct tcagctccat gccgttggac aacttgttga accaccacca gatgcaaata 3600 tactacctgg gatgtggcga ttgaaacaca agagagacga gtttggacga atcacaaaat 3660 acaaagctag gtgggtggcc ggtggtaatc atcaaataaa aggaatcgac ttcgatttga 3720 catacacatc tgttggcctc accaatactc tttgaacact atatgccctt gccgccaagg 3780 aagacctaga aatgcagaaa ttcgacattg aaacggcctt cttaaacggc aacatgaagc 3840 accgagtctt tgtacgtcaa gttactggct ttcacgaccc gtcaaaacct gggcacgtca 3900 tggaacttga ccgctcactt tatgaaccta ccaagcacat cgtgagttta atgaagactt 3960 ggacactaaa atgaagaaac tcggattcaa ggtctgcccg gttgataact ctctgtacac 4020 tctgagggtt ggagcccatt tcattcatgt tccaatgcac gtcaatgatg ggatggcttt 4080 ctcaaataac aaaaattttc taaatgaatt ccgtgaaaac ttgaaacaat tctacaagtt 4140 tagatggaac aaaaacccaa cactccacct aggcattcac atcaaacgct acagggttca 4200 tcgtcttatc actctcgacc aatcccatta tgtagattcg atccttgaac gattcggcat 4260 gtctgattgc aacggtgtca agactcctct tcctcaaaac gtcaaactcg ttacaccaac 4320 tcaagaagat tcatctgaaa tcgagcaata ccgtgcggct gtaggcatgc tgaactttct 4380 attggtacag acaaggcctg acatatcttt tgcggtaagc tacttagctc ggttcaattc 4440 taggcacaat gatacacatt gggcggctgt aaagcatctt ctccgataca tcaagcgaac 4500 gaaatcatac acacttcaat tcggtactaa caaagctcag aatatgctgg tggagggata 4560 tgctgatgca gactacgctg gtgacgttga cactcgtcgc tccacgactg gttttgtatt 4620 ctatgtctat ggatccttag tttcttggaa aagccgccgt caacactgcg tgactttgtc 4680 cacaacggag gcagagtacc tagctattgg tgattgtgca aaacatggct tatggttatg 4740 ccgcttactt gaacatttat gccaagaaca gtccatcaat gttcctatca aattacctct 4800 atccaatgat aatcaaggcg cggtattttt atgcaatgaa gcgtctgtca acaataaatc 4860 aaaacatatc gacatacgcc atcacttcat tcgagaactt acacgagaag gaaaaatcac 4920 tgtctcccat gtatcaatga aggaaatgcc tgctgatgtg ttgacgaagg ctgtaggtgc 4980 aaatatctta acatcaagct atgatcaact agggattagt gaaataaaaa gttagtgagc 5040 agggggggc 5049 // ID Copia-2_SPDB-I repbase; DNA; FNG; 6082 BP. XX AC ACOE01000236; XX DT 12-FEB-2011 (Rel. 16.02, Created) DT 12-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Spizellomyces punctatus genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_SPDB_; KW Copia-2_SPDB-LTR; Copia-2_SPDB-I. XX OS Spizellomyces punctatus OC Eukaryota; Fungi; Chytridiomycota; Chytridiomycetes; OC Spizellomycetales; Spizellomycetaceae; Spizellomyces. XX RN [1] RP 1-6082 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Spizellomyces punctatus genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; ACOE01000236; Positions 11665 17746. XX CC Positions [2225-2671] - Integrase core CC 'AACCC' target site duplication CC LTRs are 94% similar to each other. XX FH Key Location/Qualifiers FT CDS 2441..4723 FT /product="Copia-2_SPDB-I_2p" FT /translation="MDCGIQHQQTTAYSSQQNSIAERWNHTSLGRVCAIFA FT DMKLPWKLWAILIQMVTYLHDLTPNSLSDKTPHELLFQCRPNVTHLRHIGC FT IAYAHVPAAGRDKMANRARKTYLVGYAQQNGTKAYLVWDLVCNAIATSRDI FT KFNEDAPLELKYQQDPSTMDLLDETTSDDDQEYQIDYIMDKCQTRVHKPEY FT LVKWTGYTDPTWEPRVTVDDTQVLDTWENCIQAYAIDVHTGKINPESLTFK FT QAMATPDANDYLIACQQEIESLKAHGTYVTVPHPPHKHVISTKWVCHKVFN FT PDGTFKKYKARAVCQGFTQRPGIDFDDMYSPTLPLPILRLFLMYMVQHGLD FT IHSMDSVTAFLNSEMDKEVFIEQFEGFELPNKPRKSWVLMLKKALYGLKQS FT PLLWYQNLRTALEELGYCVVEISPCMFYKYAAPGHTNSDVQPSLLTLVIIM FT IFVDDLLILTKDPEQMNESKKELHSKFSFMDQGHITNHLGLAYEYDISADI FT RTLKFSNPMYIDNLLDHFKLMNCHAIATPYAMSVRSQQMQESAPFPNPKIY FT QEAVGALTWASISWQWDIATAVGYVARNVSTPTQCDWEAVKHIFKYLKGTR FT DFAITYTSLEKDPCGLRVYTDADYAGDLSDCKSMSGLLATWNGYPIHWKSS FT KQTCVAHSTTEAEYITASVGFREILWIHQILADMFKPIAGFQLVPTPLHID FT NESAATLAQTKAINQCMKHIDRHYHGICRGVTDKQIVILDMDTKEQMADYF FT TKPLTQEDHN" FT CDS 4814..5923 FT /product="Copia-2_SPDB-I_3p" FT /translation="MHGDCETRQLNMIDINYTEIPAPPKPPNPPAPPTPPN FT PPKGPNRSSQLSKSSLSFPSLSPSSSSSSSPSLTISANSSPSSVTTSSKSL FT VSFAISSRTTPAAFFFTCRAFSFWNSVGGKAGALPLLQALFRERSWRPRYF FT PRASRCMGLCTGKDRVTSCSVFAKDNCLCNTSSLSFRCCKKAQRRAASMNS FT SLSALSLHAGSGTFPTAFATFLLAVNTTASKTPLLGLSSSLGVLPCLPSGR FT APASALSDTTPLTAPSTKGAGPCHHFLPNPGNKGPPGDAALLSAESSRSCW FT ERPFPQDFSLGSSNFPFSLPFLDEGTSRTSGVVSCSADHSRISSANVFAWG FT GSPLALPLPLSSFGAIIQKKKKSWKHR" XX SQ Sequence 6082 BP; 1770 A; 1643 C; 1334 G; 1335 T; 0 other; ggttatgagc cctggaagta gagtaccact tcaaagaaca ccgcctctca catgcagcaa 60 aatgctggca tgatgaggca gcgctcaagg aacttctaca tgcatgcaag gcataccata 120 tttggacgaa ccctccgagc aggacctcct aaatcagtct gtatatcatg acacactccc 180 ggaggaccct gaaaacacag acccagatgt tgggtacagc agatacctat tggaccctgt 240 acgggaccct ggggcgcacg tgcccaaaca ggaaatggac caaataaaga tggaaatatg 300 gccaagcact ctggctgagg gcaccagtga gagaagaatg gaatactctg cagaatatcc 360 tccaagtggg acatatgctg tgcgcccgga ggacatggac tcgagcaatt gaatagatag 420 gcttgaagca cttgttggac agctcattct cactgtaaaa agagaaaaca gcaaagaagc 480 tgtgaggact gaaccaacag tgcaaaccca ttatggaaaa gcacagtatg cggacgagca 540 aagggcaaga ctgcatacac ctagtggggg cataccacac agtgggccgc caaaccctct 600 cctacaacca cccagcactt accacatagc gatgtcagtg aggtcaaatt cacccacaag 660 ctcagtttac atgtcaacat caagtacggc agacaagcgc ggatgcagag catttgtgga 720 tgggatgcta aaggatatcg ggcaatataa tggggaacca aatactatga agctcaatac 780 atttctgggc agagtagaca tggtaataaa gcaccagatg gttccacctg acatggttgt 840 agactttgtg agcaccaaat tcatgggatt agctcaacaa tggtggacat cactgtcacc 900 aacaagacag gcaagcttta ctgcttggta tgatcaagag gatggacacg gacgaacaat 960 aactgggtta cggtctgcct tgtatgtgcg cttcatgcca cgaacacata tgcatgatac 1020 attgagatgg ttgaaacaat taaagcccaa gacaaaccag tcccagagca tggatgagtt 1080 tgtggcatat ttcaactctc agatatactc actaccatct atgcctgatt ttattaaaca 1140 ggactacttc atttctgccc taccggcaga agtagcagca ctgctgcaat ccaatgagga 1200 aaaccttacc tctctggaaa ccctacaaga agcggcaacc cgaatctatc cagtgattca 1260 caagcatgcg aaaccccagg gaggcagtat gcacgccgcc cctgcacagg tcacgcaaat 1320 ggacaatgca tctggacaat ccacgacttc caaaaaggga aagaaaggaa aacaaaagaa 1380 gcaaaatgag ggccacatcg agaagcaacc gaattcaaac caacactcaa atcaaaactc 1440 gaatcaaagt acaccttgac catgcagata ctgcaatgca tcacatccaa ggactgtatg 1500 tcccttgatc ctcaagctgg tccagaaaca atccaactgc ttgagtgcaa gcctaacaga 1560 ggcttgccat gtggaaatgg tacacgatga tgtgccgtac agtgctctta tcaattccgg 1620 cgcgactgac catatgacac ccaatagggg tcatttggat ccctccacca tatatcagga 1680 tgtgaggaat gtcagcttag ctgacaacac cattttaaaa tcaactgtac aaggcaatgt 1740 tatcatcaaa gcagatggat attcgactcc actcgaattg cgtgatgtcc tccatgttga 1800 tgggctgcag aaaaacctaa tctccatccc caaatgtaat gaaaatggtg tggatgtaac 1860 cttccacagt agtggagaag tttctctcat gaaaaacagg tgacaaattg caagcggcat 1920 atgatggaac aatgcatatt acctggaagg aaccctgaaa ccccagggct ttccagatac 1980 agaagtctgt gcaacagagg tatcacctgc ccttgcacat gccagactcg gacacatagg 2040 acaacatgtc ctaggaaaga tggcattggc tatgctcagc tggcccatag tgggggctca 2100 cagtcatagg gactgtgagg catgcatact tggaaaatca tcctgcacac catttccacc 2160 agcaaacaaa gaaccaagag aactttgtga gctaattcac tccaatctac aaggaccatt 2220 ctgaattaca ggcctggatg gagaatgcta cgctctcgtg tacattgagt acacctctag 2280 gtatggtgtc acttactgca taccaaacaa ggaagcagag accatcctct ccaaattcaa 2340 ggaattcaag gcatggctcg aatgcaccac agacaagaaa atatgcactg ttcatacaga 2400 ccgaggaggc gaatatcatg gcatattcca caagtacctg atggactgtg gcatccagca 2460 tcaacagact acggcatact ccagtcaaca gaacagcatt gctgagcgct ggaaccatac 2520 aagcttaggg cgagtatgcg ccatcttcgc agacatgaaa ctgccatgga aactatgggc 2580 catcctaata caaatggtga cctacctcca tgatctcact ccaaactctt tgagtgacaa 2640 aaccccccac gagctactat ttcaatgccg ccccaacgtc acgcacctca ggcacattgg 2700 ctgcattgcg tatgcccatg tccctgctgc tggcagagac aagatggcca acagagcaag 2760 gaagacatac ctagtggggt atgcccaaca aaatggcacc aaggcctacc ttgtttggga 2820 tctggtatgc aatgctattg cgacctccag ggacatcaag tttaacgaag acgcccctct 2880 tgaactcaaa tatcagcaag atcctagcac tatggatctg ctggatgaga ccacttccga 2940 tgatgatcag gagtatcaaa ttgactacat catggacaaa tgccagactc gcgtgcacaa 3000 gcctgaatac cttgtgaaat ggactggtta tacagaccct acctgggaac cccgagtcac 3060 tgtcgatgac acacaagtgc tggacacatg ggaaaactgc atccaagcat atgccattga 3120 tgtgcataca ggaaaaataa accctgagtc cctgaccttt aaacaggcaa tggcaactcc 3180 agatgccaac gactacctga tagcctgcca acaggaaatt gaatccctga aggcacacgg 3240 cacgtacgtc actgtaccgc atccgcccca taagcatgtg atcagcacaa agtgggtttg 3300 ccacaaagta ttcaaccctg atggcacatt caagaaatac aaagccagag cagtatgcca 3360 aggcttcacc caaaggccag gaatcgactt tgatgatatg tattccccaa ccttaccact 3420 tccaatcctc cgcctcttcc tcatgtacat ggtccaacat ggattggaca tacactcaat 3480 ggattccgtc actgcttttc tcaacagtga aatggacaag gaagtcttca ttgagcaatt 3540 tgaaggcttt gaactaccta acaagccaag gaaaagctgg gtactgatgc tgaaaaaggc 3600 gctctatggc ctgaagcagt cacccctcct ctggtaccaa aacctcagaa ctgccctgga 3660 agaactcggc tactgcgtgg tggaaataag cccctgcatg ttctacaagt acgccgcgcc 3720 agggcacacc aactccgatg ttcagccatc cctgttgacg ctcgtaatca taatgatctt 3780 cgtggatgat ctactgatcc taaccaagga tcctgaacaa atgaatgagt caaagaagga 3840 gcttcatagt aagttttcct ttatggacca aggtcacata accaaccatc taggactggc 3900 atatgagtat gacatctccg cagacataag gaccctcaag ttctccaacc cgatgtatat 3960 tgacaacctg cttgaccatt tcaaattaat gaactgccat gcaattgcca caccatatgc 4020 catgtccgtg cgcagtcaac agatgcaaga atcagcacct ttccctaacc ccaagatata 4080 ccaagaagca gtgggggccc ttacttgggc ttctatctca tggcaatggg acattgccac 4140 tgcggtcgga tatgttgcgc gcaacgtcag cacacctacc caatgtgatt gggaggcagt 4200 caagcatatc ttcaaatatc tgaaaggaac aagggacttt gccattacct acacatccct 4260 ggagaaagat ccctgcggtc tcagggtgta tacagatgcg gactatgcag gcgacctctc 4320 ggattgcaaa tctatgtccg gcttgctagc tacctggaat ggctacccca tccattggaa 4380 aagtagcaag caaacctgtg ttgcccatag caccacagaa gcagagtaca tcacagcaag 4440 tgtgggattc agagagatcc tctggataca ccaaatcctc gcggacatgt ttaagcccat 4500 tgctggcttc cagctcgtgc ccacccctct ccatattgac aatgagagtg ctgccacttt 4560 agcccaaacc aaggccataa accaatgcat gaaacatatt gataggcatt accatggtat 4620 atgcagaggt gtcacagata aacaaatcgt gattctggac atggatacaa aggagcagat 4680 ggctgattac ttcaccaagc cattaaccca ggaggatcat aattgactga gactgtgaat 4740 catggatggc ctggataata ttcaaatgca tgcatgaccc gcgtatgtgg aggcaaactc 4800 agtggaagct gaaatgcacg gggattgtga aacacgacaa ctgaatatga ttgatataaa 4860 ttatacagaa ataccagcac cacccaaacc accaaatcca cctgcgccgc ccacaccacc 4920 aaatccacca aaagggccaa acagatcatc ccaactgtcc aaatcctcgc tgtccttccc 4980 ctcactctcc ccctcttcct cctcctcttc ctcaccctcc ctcaccatct ctgcaaactc 5040 ctccccctca tcagtcacca cctcctccaa gagcttggtg agcttcgcaa tctcgtcgcg 5100 caccacgccc gcagcctttt tcttcacatg ccgggctttc tccttctgga actcagtagg 5160 agggaaggct ggggcccttc cactcctcca agccctattc agagaaaggt cttggagacc 5220 acggtacttc cccagagctt ccagatgcat gggactgtgt acgggcaagg atcgcgtaac 5280 ctcctgttca gtctttgcca aagacaactg cctctgcaac acctcttccc tatccttccg 5340 gtgttgcaag aaggcacagc ggcgcgctgc ctctatgaac tcatccttga gtgcgctaag 5400 cttgcatgca ggctctggga cttttcccac tgcttttgcc acttttctct tggctgttaa 5460 caccacagcc tcaaaaaccc ctctcctggg tctctcctct tccttaggag tcttgccttg 5520 tctgccctct ggtcgtgcac ccgcctctgc cctcagtgac accaccccac tcaccgctcc 5580 ctctaccaaa ggagccggcc cctgccacca cttcttgcca aatccaggaa acaaagggcc 5640 accaggggat gccgcactac tctccgccga gagctcgcgc agctgttggg aaaggccctt 5700 tccccaagac ttctcgctgg gaagctcgaa cttccccttt tccttgccct tcttagatga 5760 agggaccagc aggacatcag gcgtggtctc ctgctccgca gaccactcgc ggattagctc 5820 ggccaatgtc tttgcatggg gtggctctcc cttggccttg cctttgcctt tatcttcctt 5880 tggtgccata atacaaaaaa aaaaaaagag ctggaaacac cgttagcagc tagccaaatg 5940 agctctgcaa gtctgcaatg atgtcacggc aggtcttaca gttgatagaa gtatgtgtga 6000 aagtgtagaa agtcggtggg caatgggttc ttgtcttgta aagaaaatag cggttcagac 6060 ttttgacagg tctgagtggg gg 6082 // ID Gypsy-116_MLP-LTR repbase; DNA; FNG; 159 BP. XX AC AECX01000868; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-116_MLP_; KW Gypsy-116_MLP-I; Gypsy-116_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-159 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000868; Positions 41526 41368. XX SQ Sequence 159 BP; 36 A; 38 C; 32 G; 53 T; 0 other; tgtaatgatc cttatattga gtactatgct tatagggaga ggaagggtct tttcacctct 60 cttgttgtat taccgttgac ctcacactgc aatctagtta gcattaggac agcaccttgt 120 cgcctcatca tcgtccccgc tgtgttctgg atcataaca 159 // ID YLT1_LTR repbase; DNA; FNG; 714 BP. XX AC AJ310725; XX DT 08-JUL-2003 (Rel. 8.06, Created) DT 08-JUL-2003 (Rel. 12.07, Last updated, Version 1) XX DE Yarrowia lipolytica retrotransposon Ylt1. XX KW Gypsy; LTR Retrotransposon; Transposable Element; YLI310725; YLT1; KW YLT1_I; YLT1_LTR; gag protein; pol protein. XX OS Yarrowia lipolytica OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Dipodascaceae; Yarrowia. XX RN [1] RP 1-714 RA Senam S.; RT "Ylt1 of the yeast Yarrowia lipolytica - sequencing reveals RT uncommon features of an fungal retrotransposon."; RL Unpublished. XX RN [2] RP 1-714 RA Barth G.; RT "Direct Submission to GenBank."; RL Direct Submission to Genbank (16-MAR-2001)Barth G., Institute of RL Microbiology, Dresden Technical University, Mommsenstrasse 13, RL Dresden, D-01062, GERMANY. XX DR Genbank; AJ310725; Positions 1 714. XX SQ Sequence 714 BP; 171 A; 188 C; 143 G; 212 T; 0 other; tgtaacactc gctctggaga gttagtcatc cgacagggta actctaatct cccaacacct 60 tattaactct gcgtaactgt aactcttctt gccacgtcga tcttactcaa ttttcctgct 120 catcatctgc tggattgttg tctatcgtct ggctctaata catttattgt ttattgccca 180 aacaactttc attgcacgta agtgaattgt tttataacag cgttcgccaa attgctgcgc 240 catcgtcgtc cggctgtcct accgttaggg gtagtgtgtc tcacactacc gaggttacta 300 gagttgggaa agcgatactg cctcggacac accacctggg tcttacgact gcagagagaa 360 tcggcgttac ctctctcaca aagcccttca gtaccgccgc ctgtcgggaa ccgcgttcag 420 gtggaacagg accacctccc ttgcacttct tggtatatca gtataggctg atgtattcat 480 agtggggttt ttcataataa atttactaac ggcaggcaac attcactcgg cttaaacgca 540 aaacggaccg tcttgatatc ttctgacgca ttgaccaccg agaaatagtg ttagttaccg 600 ggtgagttat tgttcttcta cacagcgacg cccatcgtct agagttgatg tactaactca 660 gatttcacta cctaccctat ccctggtacg cacaaagcac tttattttct caca 714 // ID Gypsy-28_MLP-I repbase; DNA; FNG; 5788 BP. XX AC AECX01001249; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-28_MLP_; KW Gypsy-28_MLP-LTR; Gypsy-28_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5788 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001249; Positions 96694 102481. XX CC Positions [4586-5065] - Integrase core CC 'GAATC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 1667..5260 FT /product="Gypsy-28_MLP-I_1p" FT /translation="MPWLQKNGHRINWKDGSLQPKNPTRPTIGNTTTIASS FT MPTTTPNRPCAVEGKARNIDEGALTEGQISETKPPQCEYDTKDSPILVPDD FT KLLYPRINFNQTPEEHDNLLRPVDNTRTTSNTTDLAALPIATDQTVSPEPK FT NTPDRLHAVEGNARNLGEGALIDSHINGIKPPQCEYDTDKALLVVTDDKLF FT HPRINSEQNLPTNDLRPVATTPVVLPSPKNTPDQLHAEEGNTRNLGKGAHT FT EGLISVIQPPQCEHATVLSPSVVTDDKLFLPLNSFKPEVCAMAPSWNISAK FT LAAEAGKDKIEKPVEELVPTRYHRYINMFRKRNAMTLPPHRRYNFRVDLVP FT GATPQAGKIIPLSLAEEKALDTMIDEGLEKGTIRRTKSPWAAPVLFTGKKD FT GNLRPCFDYRRLNALTVKNRYPLPLTMELVDSLRNAERYTSLDMRNGYNNL FT RVKEGDESKLAFICKRGQFEPLVMPFGPTGAPGYFQFFISDILRDRIGKDL FT AAYLDDLLIYTPAVVDHEQVVEEVLKTLAEHSIWLKPEKCRFSQKEIAYLG FT LLISKNQVRMDPLKVSAVRDWPAPKNVNQTQRFLGFANFYRRFISNFSKIT FT RPLHELTQDNVKFEWTPKRNESFEILKNAFTTAPVLKIADPYKAFILECDC FT SDFALGAVLSQEDESGLLHPVAFLSRSLVQAERNYEIFDKEFLAVVASFKE FT WRHYLEGNPNRLEVVVYTDHKNLETFMTNKQLTRRQARWAEMMGCFDFHIR FT FRPGKQGTKPDALSRRPDLEPTQEEKWSFGTMLKPENLSESSFNAEINSFE FT AWFEGEDIVHEEVDDWFEDDIKHGNPQETYGIDAIDRSPTTVGPLWTDIEI FT LNRIRLLSAEDARLKVIIDDIKNKREHVPKSYTVTDNVLYRDGIVEVPDDC FT QVKLEILRTRHDSLLAGHPGRAKTLSLVQRQYRWPSMKAYVNRYVDGCDSC FT IRVKSTTSKPFGSLEPLPIPAGPWTDISYDLIPDLPLSNNKNCILTVVDRL FT TKMSHFIPCTTEMDADELATLMLANIWKLHGSPKTIVSDRGSVFISKITES FT LNKQLGIQLHPSTAYHPRTDGQTEIVNKSIEQYLRHFVSYQQNDWEELLPL FT AKFAYNNSTHTSTDISPFKANYGYDLALGRIPTNGQCIPTVERRLTIMQEV FT QDELKETLKRAQTAMKDQFDRGVRPTPDWDIGDEVWLSS" XX SQ Sequence 5788 BP; 1744 A; 1437 C; 1386 G; 1221 T; 0 other; cattgtagcg tcttatcaag gcaaaccgag agaagcaaga agaagataaa ttactcatca 60 aagttagaga agtcaaaaag aagaaatcac aagtttaaaa gaagttgaaa attttttcaa 120 agtggaagaa gttgaaattt ttttaaagtc accctcaagt ggtataaaga agaagaagtt 180 ataatcaaaa ttactaaagc ccatcgaaac ccacgtccta tattacaccc tcgtaacact 240 ccatcatgcc tacctacgaa cccccaactg atcactccac cgcggcatcc gatacctccg 300 attatctgtc aacgggtgaa atggacgctg aaactagtca ccccgctccc accatcgact 360 tacaagccat ggccaaccaa atcgagcagc tgagtcaaca actggccgcc gagaccgcat 420 gccgaacagc agcggaggct gcgactaaac ctgcggcgaa gactccaaaa gtcgcgaccc 480 ctgataagtt cgacgggtcc agaggtgcca aggccgaggc atttgcaagc caagttggac 540 tctacatcat taccaacaag gcgctattcg agaacgacga tgccaagatc acgttcgccc 600 tttcgtacct gactggagat gctgtcaagt gggcgcagcc gttcttgaac cacgtgctga 660 atccgttgcc ggatgcaaag cctctcacct acgacgagtt cgcctgttca ttcgaagcgg 720 tattctttga cagcgaccgt cagaaacgcg ccgaggctgc catccgagtc ctcaaacaaa 780 cccgctctgc tgctgaatac acggttatct tcaaccaact ggcaccaact actcagtggg 840 aaactccaac gctcatcagc cactatcgtc aaggtttgaa gagagatgtt tgacttgcca 900 tgatacgtga atcatttgcc gacctcgaaa gcatcacagc cctcgcctgc gccatcgaca 960 acgatattcg cggcgagtat ggaccccctc aaccgaccac cagtcgccca tcggatcctg 1020 acgctatgga catctcgagc acccgttttg atatcacgtc tagcgagtat aagcgtaggg 1080 gagatgaaag attgtgttat cagtgtggaa gtagtggtca tattgctaag tggtgtaaaa 1140 gaagagagaa agggaaagga aggagtcagg gtaaaatggc ggagttacat gctaagatag 1200 cggcgttaaa gagtagattg ggagaaggga gcaggtcaga ggagtcaaaa aatggcggcg 1260 ctcaagagtg acggacgtgc cacacttgag ccgaggtgat agggaggtta ttgaagttga 1320 tagtagtgat tatgaaaaca agaaattaaa agacccacgt atttttacta ccattacttt 1380 atcaaataac ccccaagcca cgtcccatac agaccccaat cctagcccca atgtccaagc 1440 ccgtgcactc gtcgactgtg gttcgacgca cgaagtgctt ggtacccaat ttgccgaccg 1500 aaccagcctc cctgtgacag ctctgcctgc cgcaggcgat gtgtatggtt ttgatggtca 1560 accccgcagt gtagcgcacg acgcgaagct gtttattgac gaagatcaag agtcaacaag 1620 atttctagtc acaaagatca aggacgctta cgacgtaatc cttggaatgc cgtggctcca 1680 gaagaatggc caccggatca actggaaaga cggaagcctt caaccaaaga atcctacgcg 1740 cccaacaatc ggcaacacca ccaccattgc gtcgtctatg ccgacaacca cccccaatcg 1800 accttgtgcc gtggagggga aagctaggaa catagacgag ggggctctca ctgaaggcca 1860 gatcagtgag acaaagcccc cgcaatgtga gtacgataca aaagactcac ctatccttgt 1920 accagatgac aagcttctct acccccggat taactttaac cagacacccg aagaacacga 1980 caacctactc agacctgtgg acaacacaag gaccacatca aatacgaccg accttgctgc 2040 gttgccaatt gccactgatc aaacagtgtc gcctgaaccg aaaaacaccc ctgatcgact 2100 tcatgccgtg gaggggaatg ctaggaatct aggcgagggg gctctcattg acagccatat 2160 caatgggata aagcccccgc aatgtgagta tgatacggat aaagcattgt tagttgtaac 2220 ggatgacaag ctttttcatc cccggattaa ctctgaacag aacctaccta cgaacgacct 2280 gaggccagtt gccactacac cagtagtgtt gccttcacca aaaaacaccc ccgatcaact 2340 tcatgccgag gaggggaata ctaggaattt aggcaagggg gctcacactg agggccttat 2400 cagtgtgata cagcccccgc aatgtgagca tgctaccgtc ttgagccctt ccgtcgtcac 2460 agatgacaag ctttttcttc ccctgaatag ctttaaaccc gaagtctgcg cgatggcacc 2520 gtcatggaac atttccgcca aactagcggc cgaggctgga aaagacaaga ttgaaaaacc 2580 agtcgaagaa ttagtgccaa cacggtatca tcgttatatc aacatgttca gaaaacgaaa 2640 cgccatgacc ttaccacctc accgacggta caacttccga gtggatctcg ttccgggtgc 2700 gacgccgcaa gcgggaaaaa tcataccgtt atcactggct gaggaaaaag cattagatac 2760 aatgattgat gaaggtctag agaagggaac tatccgacgt accaagtctc catgggccgc 2820 gcctgtgctc ttcaccggca aaaaagacgg caacttgcgc ccgtgctttg actaccgacg 2880 attgaacgca ttgaccgtca agaatcgcta ccctttgcca ttaaccatgg aattagtgga 2940 cagtctgaga aacgctgaaa ggtatacttc actagatatg cgtaatggat ataataactt 3000 acgagtgaaa gaaggggatg agtcaaagct ggcctttatt tgcaagaggg gccaattcga 3060 acccttggtg atgccatttg gcccaaccgg ggcccctggc tattttcagt tcttcatatc 3120 cgacatcctt cgcgaccgga ttggcaagga cctagcagca tacctcgatg atttactaat 3180 ctacacacca gcagtagttg accacgagca ggtagtagaa gaagtcctca agacattagc 3240 ggaacactcg atttggctca aaccggaaaa atgccggttc tcccagaaag agatagctta 3300 ccttggacta ctgatttcga aaaatcaagt tcgtatggat cccttgaaag tatcggcggt 3360 gagagactgg cctgcaccca agaacgtcaa tcagacacaa cgattcttgg gctttgccaa 3420 cttctacagg cggtttatca gcaatttttc taagatcaca cgcccactac acgagcttac 3480 ccaggacaat gtgaagttcg aatggacccc caagcgcaac gagtcattcg aaatcctcaa 3540 gaatgctttc acaacggcac cggtgttgaa gattgctgat ccttataagg cctttatcct 3600 cgagtgcgat tgttctgact tcgcattagg ggcggtccta tcccaggaag acgagtctgg 3660 actactacac cccgtcgctt tcttatctcg atccctggta caggccgaac gtaactacga 3720 aatattcgat aaggaatttt tagcagttgt ggcctccttc aaggaatgga gacactattt 3780 ggaaggaaac ccaaatagac tggaggtggt agtatatacc gatcacaaga acttagaaac 3840 gtttatgact aataaacagc ttaccaggag acaggccaga tgggcggaga tgatgggatg 3900 ctttgacttc catatacgat tccgacctgg taaacaaggt actaaaccag acgcactgtc 3960 acgacgcccc gacctggaac ctacccagga agagaaatgg tcatttggaa cgatgctaaa 4020 accagagaac ttgtcggaat cctcattcaa cgcggaaatc aacagttttg aagcatggtt 4080 cgaaggtgaa gacattgtac atgaagaggt tgacgattgg tttgaagacg atatcaaaca 4140 cggaaatcca caagagacgt atgggatcga tgccattgat agatctccaa cgactgtagg 4200 acccctgtgg acagacatcg aaatcctaaa ccggatcagg ttgttatcag ctgaagatgc 4260 aagactcaag gttatcatag atgacatcaa gaacaagcgt gagcacgtac caaaaagcta 4320 cacagtgact gacaacgtgc tataccgtga tggaatcgtc gaagttcctg atgattgtca 4380 ggtaaaactg gaaatcctaa ggacacggca cgatagccta ttggcaggcc acccgggtcg 4440 agcgaaaacg ttaagcctag tacaacgcca gtaccggtgg ccatctatga aggcctacgt 4500 gaaccgatac gtagatgggt gcgattcgtg cattcgagtc aagtcaacaa cgtcgaagcc 4560 atttggatcc ctcgagccgt tgccgatccc agcgggaccc tggacagata taagctatga 4620 cctaattccg gacttacctt tatcaaacaa caagaactgc atactgaccg tagttgaccg 4680 actaacgaaa atgtcacatt ttatcccttg caccacggag atggatgcag acgagttagc 4740 aaccctgatg ttagccaaca tatggaaatt acacggctca ccaaagacta tcgtatctga 4800 tcgtgggagt gtcttcatat ccaaaatcac cgaatcgtta aataagcagc tgggtattca 4860 attacaccca tccacggcgt atcatcctag aactgacggt caaacggaaa ttgttaacaa 4920 gtcaattgag cagtatttaa ggcattttgt tagttaccaa cagaacgact gggaagaact 4980 cttacctctt gccaaatttg cgtacaacaa tagtacacat acttcgacag acatatcccc 5040 attcaaggcc aactatgggt acgaccttgc tctgggccga attccgacca acgggcaatg 5100 tatcccaacg gttgagagac gactaaccat catgcaggag gtccaggacg aactgaaaga 5160 aaccttaaaa cgagctcaaa cggcaatgaa ggaccaattc gaccgagggg taaggccaac 5220 tccagactgg gatatagggg atgaggtatg gctgagtagt tgaaatatct cgacaacacg 5280 cccgagcgcc aaactagacc accggtggtt gggacccttc agcatcgcca agaaagtatc 5340 accgtcagca taccgattag cattaccagc aacactagga cgagttcacc cggtattcca 5400 cgtgtcggtt ctgaggaaac acaaccctga ttcgatcgaa ggacgtattc aggaaccacc 5460 agaattagtc cagatagaag gcgaggatga gtgggaagtg aacgaggtcc tggataaaag 5520 aagaagacgg ggaaaagatg agtatttgat cagttggaaa ggttttggaa gagaagaaga 5580 ttcatgggaa ccaatggcta accttaataa cgcctcagaa gcgatcgcca agtttgatcg 5640 acagtttccg aaggcagtgg aagaacacag aagaacaaga cgggtatgag agaggggtaa 5700 ggttttttcc ccacggggtt ttttaatacc gccccgggga ggagggcagg gccgcgaaca 5760 gggagcccgg gcctaaaacg ggggatag 5788 // ID Mariner1_AO repbase; DNA; FNG; 1873 BP. XX AC . XX DT 24-JAN-2006 (Rel. 11.01, Created) DT 02-FEB-2006 (Rel. 11.01, Last updated, Version 1) XX DE DNA transposon, Mariner superfamily, Tc1 clade - a consensus DE sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW Mariner1_AO. XX OS Aspergillus oryzae OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC mitosporic Trichocomaceae; Aspergillus. XX RN [1] RP 1-1873 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-1873 RA Kapitonov V.V. and Jurka J.; RT "Mariner1_AO, a family of Mariner/Tc1 DNA transposons in the RT Aspergillus oryzae genome."; RL Repbase Reports 6(1), 29-29 (2006). XX DR [2] (Consensus) XX CC This is a family of Mariner DNA transposons (Tc1 clade). It is CC 74% identical to Tao1 (491 mismatches: 450 transitions and 0 CC gaps). This transposon is a fine example of a strong RIP and fast CC RIP-induced change of the AT content (from 60% in Tao1 to 82% in CC Mariner1_AO). A transposase-coding ORF is destroyed by many CC stop-codon introduced by RIP. XX SQ Sequence 1873 BP; 781 A; 166 C; 160 G; 751 T; 15 other; acgtaatcgg taagcgagct gcgcatgtaa gcgvgctgcg taccacacgc gttttcdacg 60 cgtcttaatt tcttagagaa tttaaatatt agctatacca ctaaaaatac ctaaattntt 120 aaaattacgn gttgagtaga aaggctggat tctattagct atattagctt taaaaaaaaa 180 taaatttagt agctttagaa aggcagctaa gatatttaat atatctaaat ctattctata 240 taatcgctta aaaagagctt tttttagaga tgatcnacgt gttaatagtt ataaattaac 300 tactagtaaa aagaaattac ttaaataata gattctatta ctaaataaat atagagtatc 360 tccccaacct gtatatatat aagaaatagc taatattcta tttttaaagt rtratactac 420 ttctttytyt aytattatag aaaaraaata gatatataay tttattaact atatatytar 480 ayttaaattc tactttatat attactataa ttattaatat attaaaataa agaattctaa 540 gattctaaat atttagttta aatagataaa taaaactatt taaaaatata gtatagtctt 600 aaataatatc tataatttta ataaaataag atttataata gatctaatag ctatagctaa 660 agttattatt agatctaata tactaaagaa actattttta ttatagttaa aaaactagaa 720 atagattact actatagaat atattaattc taataaatag actctatttt tctatattat 780 ttttaaaaat aagatatata tagagtttta gtataaaaat aataagattt tttttaatta 840 aaaaattaag attagttcta atagctaaat aaataataag attaatctat attagctata 900 gaattatttt atattttata ctattagcta atctatagaa aaatattatt tatttatttt 960 aaatagctat aatagttatt taatatctta atttaattaa atctatataa agaataatat 1020 tatactaatc tatatatcta tatatatttt ttataagctt taattattta atattactat 1080 tttcttacta ttaaaagatt tatataaaga tctaattaaa tctaaaatat attatagatt 1140 taattatatt aataaactta attttttaaa gatatatttt tagatctata tagagatctt 1200 tattaatagt aatatctata gtatttttaa agctattagt ctagtttctt taagctttaa 1260 atatattttt ttttaactaa taatttagct taagatctct atatctctag ataatagatc 1320 tagtagttaa aatagtttag tgtctaaaat attttatata attaaatagt taaaaaaatt 1380 agaatctata tttaaaaagc tacttaaaaa ttaatcttat agttttaatt tatctattaa 1440 aactagttta aattaattaa ttaaagatat taaataagct ataaataata taataatatt 1500 aaaatattaa ttaaaataat attagactat atataaaaaa tagcttaaaa aataaaaata 1560 ttttactaaa tagattattt ttaaaaaaga tatattaata tctaaagtct aaattcttat 1620 taaacgctaa aattagatag ataaagctat atagatagat agatctctat ctattaaatt 1680 agcttctaga cacgctctat ctaaatattc tagctatagt atatagagtt ataaaattat 1740 atactatcta aattaataga actacttatt tttatataga agtctaaatt tatagtattt 1800 taattatttt aattttttat atagcgcgtg gtacgccgct cgcttacatg cgccgctcgc 1860 ttaccgatta cgt 1873 // ID Gypsy-6_RO-I repbase; DNA; FNG; 4839 BP. XX AC AACW02000056; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-6_RO_; KW Gypsy-6_RO-LTR; Gypsy-6_RO-I. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-4839 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000056; Positions 299749 304587. XX CC 'TCTTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 35..4816 FT /product="Gypsy-6_RO-I_1p" FT /translation="MSTDQENPMVDPVVTSSPGNEGSIISDLSSPLSEPQG FT SMASKYASEPTPMDTDTATPAPKVLESTADIIKRKFQIIENLKNHAKLHFM FT EYMVLNEDDSDPAAALIAHQKFKECEEKVNRAKEALKSFTAMFEEVKPPVD FT GPSNHYLRLVVPNDLPTLQLKGDAIWRKKAECYDSAYDFCNTFETVLRAHG FT QALNSNWERLLPLCMNPEQVSWCREALLEKNLSWKQVRPMILDHFDTPYRK FT FLLMVEVGSMCQGTYESNREYSNRFQKMRREAGMEDSTLLAVTYFASLKAS FT VKSVAQLAISSHFGSRLPSSITQIIDLVLASGEDSAFAAKTPHKRARPMND FT DERTPHNAGNTSKAPFGTNKVLANKFKTPLANKGKYKPKPCTYCRKDWFQG FT HKCKEFLDAKNNNSNNKDNVVHINRMAVRTENNVSDEEDECEINSPLNRMA FT LDCKSKKEKELIVTRDFKSTNDNSITFPILVNNRKAITILDTGANFSSINK FT KFCFQNNFLIIPPKNNNVIKLADSDSTTKRIGLTEVNIKCNGKSFNHTFEV FT MNLTNDHDMSIGTDFMSKLGIGLTGLPYKWDDSKVSLDNSKVPEHKFNDFS FT ELLNKVDDEMSELENCPAGSSNEYQQALNYIKPLIQLNQAIPKGSFCTIPE FT SVVSIETPEGVTSYRRPYPVPLAYHKIVQDQIDEWLENGVIKRAPANTEWN FT SALTVVKKTNAKGEVTGYRVCHDPRHVNALLKSIYRMPLPKISELFEELKG FT ATVYSTLDLKSAFNSLKMNEEHAHKLAFSWNSVQYIPVGTPFGLKHVSSVM FT QRTMSIALENMSFAKCFVDDIVVASTSIEEHKQHLKQVIDKLTKVNLKLNP FT DKCTFFQRKINLLGFRISPKGVSLDLRKVANVQEFPVPKTGKDIMRYCGLI FT NYFRPHIPKASALLAPLDALRNEKSLIKLWNDKHQQCFDNLKKVLLENVIL FT SYPDMNQPFYVATDASNVGIGVILYQKINGKVHYISMIAKTLSKSERNYSA FT TKRELLAVVYALKRFHKYLWGNKFTLYTDHKALTYLHTQKIANAMLINWFD FT TLLQYDFKVVHLPGVDNILPDTLSRLYEIEDPVNELGGDKPHLLNRAAVKL FT PSEDKGEYMTPPDPEERKLLLLREHIKGHFGSDAIYHALKRKGIYWSNLKN FT EAVELVKSCIPCQQFNIIKKGYNPLRPITATLPGDSWGIDLAGPMKTSLNG FT NNYLLIMVDIATRYCVLKPLPDKQSMTIVKALIDVFSHYGFPRVIQSDNGG FT EFVNELMQLLAENAGYDHRLISSYHPRANGVSERWVQTAVNAIKKQIEGAK FT ADWDLYVPSTQLYLNGKHNERTKTPPFTLMYARNMNDFEDFSKEKNKATKE FT EINKQLLLNIKRMTEIVFPAIYERTKIITNNQKEKFDASHKLITIPENSYV FT MVKVNIKTNKLDPNYEGPYKVKRITQGGSYVLEDEMGELLSKNYPPSALKL FT ISQDEVISTDKFYQVEAILAHKKLKGKYLYKCRWKGYDKSDDTWEPASNFT FT DPKFITEYWQRVGIVPEDIKYRYDLQNNKGKSNNVKLINTSHTGKRKTRTD FT LNENSHSSTTSSNKVNGNINSKRSRRY" XX SQ Sequence 4839 BP; 1627 A; 909 C; 881 G; 1422 T; 0 other; ttttactttt tgaattccaa actttatcgc aaatatgtct actgatcaag aaaaccccat 60 ggttgacccc gttgtcacta gctcccctgg aaatgaaggt tctatcatct ctgatctttc 120 ttctcctctg tccgaacctc aaggaagtat ggctagcaag tacgcttctg aacctacccc 180 tatggatacg gatactgcta cacctgctcc taaagtgctg gaatccactg ctgatatcat 240 aaagcgcaag ttccaaatca ttgaaaatct caagaaccac gccaaattgc actttatgga 300 atatatggtt ctgaatgaag acgattctga tcctgctgcc gctcttattg cacatcaaaa 360 gttcaaagag tgcgaagaaa aggttaatag agcaaaggag gctctcaagt cgttcactgc 420 catgtttgaa gaggtgaaac cccctgttga tggtcccagt aatcattatc ttcgtctggt 480 tgtaccaaat gatctgccta cgctgcaatt aaaaggtgat gcaatttgga gaaagaaggc 540 tgagtgctac gacagtgcgt atgatttttg caacaccttt gaaactgtct tgcgcgctca 600 tggtcaagcc ctcaactcca actgggaacg tttgctgccg ctctgcatga accctgagca 660 agtctcttgg tgcagggaag cactgctcga aaagaatctc tcttggaaac aagtccgccc 720 tatgatcttg gatcatttcg acacccctta ccgcaagttc ttactgatgg ttgaagtcgg 780 ttccatgtgt caaggcactt acgagagtaa tagggaatac tctaaccgtt tccaaaagat 840 gcgtcgtgaa gctggtatgg aagacagcac tctcctggct gttacctatt ttgcttcatt 900 aaaagcatct gtcaagtctg tcgcgcaact agcgatatcc tctcactttg gctctcgctt 960 acctagctcg ataactcaaa tcatcgactt ggtacttgcc tctggtgaag attctgcttt 1020 tgctgccaag actccccata aacgtgctcg tcctatgaac gatgatgaac gtacacctca 1080 taatgctggt aatactagta aagctccttt tggaactaat aaagtattgg ccaacaagtt 1140 caagacacct ctggcaaata agggaaagta taagcccaag ccatgtactt actgccgcaa 1200 ggattggttc caaggtcata agtgcaaaga gttcttagat gctaagaata ataatagcaa 1260 taataaggac aatgttgtac atataaaccg aatggccgta agaactgaaa ataatgtatc 1320 cgatgaagag gatgaatgtg agataaacag ccctctcaac cgtatggcac ttgactgtaa 1380 gtctaaaaaa gaaaaagaat taatagttac tcgagacttt aaaagtacta atgataattc 1440 tataactttt cctatacttg taaacaatag aaaagccatc acaattttgg atacgggagc 1500 aaatttttcc tctataaata aaaaattttg tttccaaaat aattttttaa ttattcctcc 1560 taaaaataat aatgtcataa agttagctga ctctgactct actactaaga gaattggttt 1620 aactgaagta aacattaagt gtaatggaaa aagttttaat catacctttg aagtaatgaa 1680 tttaacaaat gatcatgata tgagtattgg tactgatttt atgagtaaac ttggtattgg 1740 tcttactggt cttccttata aatgggatga tagtaaagta tctttagaca attccaaagt 1800 acctgaacat aaattcaatg actttagtga actcttgaat aaagttgatg atgagatgtc 1860 tgaattagag aactgccccg ctggttcatc taatgaatat caacaagcat taaactatat 1920 taaaccatta attcaattaa atcaagcaat acctaaaggt tccttttgta ctataccgga 1980 atctgttgtc tcgattgaaa ctcctgaagg tgtcacatct tatcgacgtc cctatcctgt 2040 gcctttagca tatcataaga tagtacaaga tcagatagat gaatggctgg agaatggtgt 2100 aatcaagaga gctcctgcta acaccgagtg gaactcagct ttgactgtag taaagaagac 2160 caatgctaag ggtgaagtaa ctggttaccg tgtgtgccat gatcccagac atgttaatgc 2220 tcttctaaag tcaatttaca gaatgccctt acctaaaatt tctgaattat ttgaggaatt 2280 gaaaggagca actgtatatt ccactttgga ccttaaatct gcttttaact ctttaaaaat 2340 gaatgaagag catgcacaca aattagcttt ctcttggaac tctgtacagt atatacctgt 2400 tggtactcct tttggtttaa agcatgtatc cagtgtaatg cagagaacta tgagcatcgc 2460 attagaaaac atgtcctttg ctaaatgttt tgttgacgac atagttgtag catctacttc 2520 cattgaagag cataagcaac acttgaagca agtaatcgat aaacttacta aagtaaatct 2580 caagttaaat cctgataaat gtacattttt tcaacgcaaa attaatttac ttggtttcag 2640 aatatctcct aaaggtgttt ctcttgacct tcgtaaagtt gctaatgtgc aagaatttcc 2700 tgttcccaag actggaaagg atataatgag atactgtgga ctcataaatt atttccgtcc 2760 tcatattcct aaagcatccg cattattagc tccattggat gcgttaagaa atgaaaaatc 2820 attgattaag ttatggaatg ataagcacca acaatgtttt gacaacttaa agaaagtatt 2880 gttagaaaat gtaattcttt cataccctga tatgaatcaa cctttctatg tcgctaccga 2940 tgcctctaac gttggtatag gtgtcattct gtatcaaaag ataaatggga aagtacatta 3000 tatctctatg atagctaaaa ctttatctaa aagtgaacgt aactattcag ctactaagcg 3060 agaattatta gcagttgtat atgctctgaa gaggttccat aagtatttat ggggaaacaa 3120 atttaccctg tatactgatc ataaggcact cacttatctt catacgcaaa agattgctaa 3180 cgctatgctc ataaattggt tcgatacact gctccaatat gattttaaag tcgttcattt 3240 accaggtgta gataatattc tccctgatac actttctcgt ctatatgaaa ttgaagatcc 3300 tgtcaatgaa ctgggagggg ataagcctca tcttttgaat agagcagcag taaaactacc 3360 cagtgaagac aaaggtgaat acatgacacc tccggatcca gaagaacgta aactacttct 3420 tctacgtgag catattaaag gacactttgg ttctgatgca atttaccatg ccttaaaacg 3480 taaaggtatc tattggtcca atctgaaaaa tgaagccgta gaacttgtta agagttgtat 3540 tccatgccaa caatttaaca taatcaaaaa gggttataat cctctaagac ccattactgc 3600 cacattacct ggagatagtt ggggcattga cttagctggt cccatgaaaa cttccttgaa 3660 tggcaacaat tacttattga ttatggttga cattgctaca cgttactgtg tattgaaacc 3720 cttgcctgat aagcaatcta tgacaattgt aaaagcttta atcgatgtct tctcgcatta 3780 tggttttcca cgtgtaatcc aatctgacaa tggtggtgaa tttgtaaatg agcttatgca 3840 gttacttgct gaaaatgctg gttatgacca tagactcata tctagttatc atcccagagc 3900 taatggagta agtgaacgtt gggtacaaac tgctgtaaat gcaataaaga aacaaatcga 3960 gggtgctaaa gcagactggg atttatatgt tccttctaca caactctact tgaatggtaa 4020 acataatgaa cgtactaaga cgccaccctt cactttaatg tacgctcgta acatgaatga 4080 ctttgaagat tttagcaaag aaaaaaataa ggcaactaaa gaagaaataa ataaacaact 4140 ccttctcaat ataaaaagaa tgacggagat tgtattccct gccatatatg aacgtacaaa 4200 aataatcaca aacaatcaga aggaaaagtt tgatgcgtca cataaactaa ttacgatccc 4260 tgaaaatagc tatgtaatgg taaaagtaaa tattaaaacc aacaagttag atcctaacta 4320 tgaaggacct tataaagtaa aaagaattac tcaaggagga tcgtatgtac tagaagatga 4380 aatgggtgaa ttgttatcta aaaactaccc accttctgcg ttaaaactca tctcacaaga 4440 tgaagttata tcaacagata aattctatca ggtagaagcc atcttagcac ataaaaagct 4500 taaaggcaaa tatctttata agtgcagatg gaaaggctat gataagagcg atgatacctg 4560 ggagcctgcc tctaacttta ccgatcctaa attcattact gaatattggc aaagagttgg 4620 tatagttcct gaagatatta aatatagata tgaccttcaa aataataaag gaaaatctaa 4680 taacgtgaag ttaataaata cttctcatac aggcaaacga aagaccagaa cagatcttaa 4740 tgaaaacagc cactcttcta caacaagtag taataaagta aatggcaata taaattctaa 4800 aagatctaga cggtattagt tatacctggg aggggatta 4839 // ID Gypsy-2_TMe-LTR repbase; DNA; FNG; 539 BP. XX AC CABJ01000834; XX DT 13-FEB-2011 (Rel. 16.02, Created) DT 13-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Perigord black truffle genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_TMe_; KW Gypsy-2_TMe-I; Gypsy-2_TMe-LTR. XX OS Tuber melanosporum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Pezizomycetes; Pezizales; Tuberaceae; Tuber. XX RN [1] RP 1-539 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Perigord black truffle genome."; RL Direct Submission to RU (13-FEB-2011). XX DR Genome; CABJ01000834; Positions 153747 154285. XX SQ Sequence 539 BP; 119 A; 87 C; 105 G; 228 T; 0 other; tgtaaccggt atgggtactt ggttattggt tttgttttat cttttgtttt aagtatcgct 60 ttgatacttg gtttgtttta agtatcgttt gatacttggt tttgttttat gttttgtttt 120 aagtatcgct ttgatacttg gtttgtttta agtatcgttt gatacttggt tttgttttat 180 ggtttgtttt aagtatcgtt tgatacttgg ttttgtttta tgttttgttt taagtatcgc 240 tttgatactt aaagagtttt accaaagagt atcaattgat acccttgatt agtttcgttc 300 atgctagaca tgaacagatt tgtgtattta aagaccgcct gtttctacga tagaaaaggc 360 agacttcgac agacagtatt tcaatatact tggttgaacg gttacttctt tgttcctcag 420 tacgccctcg aagcctcctt tagttcccag ctctcgcctc ctctctggaa gagcaggaca 480 cctgccaact atcgtatcta ttatcgataa cgtgattagc accttgcgaa caagttaca 539 // ID Copia-21_MLP-LTR repbase; DNA; FNG; 515 BP. XX AC AECX01002567; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-21_MLP_; KW Copia-21_MLP-I; Copia-21_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-515 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002567; Positions 1887 1373. XX SQ Sequence 515 BP; 103 A; 104 C; 90 G; 218 T; 0 other; tgttggtgtt tgttacgacg gcgaatagtg ttgcatgatt gatctttacg tagtgtgtga 60 tcctttttct gttttctgat tctgatgatc tgttgcgacc tgtgatctta tttgttatgt 120 tttctactat ttgatttgtt gtcatgcctg ctcttttctt gtctagagtt taactcttcg 180 ctgcgtacat gtttttcttt ctcttcataa gctgcctatg agtcaaaggg tttgatcctc 240 atcgattcaa accctttatt cactttcact catctgcaat atccttactt acgacgtaat 300 tgcattaccg tgaatgactc ataggtaagc tcatttcagt ctagtagtgt tttcctctct 360 ctatctctca tttatttact gtgtgtgtgt gcgtgtttgt cctatgcgtg tttgtctgag 420 aaataaatgt tatgatcgat tcaaaccctt tattcacttt cactcatctg caatatcctt 480 acttacgacg taattgcatt accgtgaatg actca 515 // ID Gypsy-75_MLP-LTR repbase; DNA; FNG; 166 BP. XX AC AECX01001083; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-75_MLP_; KW Gypsy-75_MLP-I; Gypsy-75_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-166 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001083; Positions 88893 89058. XX SQ Sequence 166 BP; 41 A; 45 C; 26 G; 54 T; 0 other; tgttatgagc cgtattaggc tacttcagat atgcttatac atagtagacc acttatcttg 60 tacgctctcg agcacagacc atgttcctca cgttgcaatc tactcagtac tacaagggca 120 ctattgatca tctctcatct ctctctatct acgtcgagtc cttaca 166 // ID Gypsy-56_MLP-I repbase; DNA; FNG; 5713 BP. XX AC AECX01002490; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-56_MLP_; KW Gypsy-56_MLP-LTR; Gypsy-56_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5713 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002490; Positions 15884 21596. XX CC 'CCTTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 354..1304 FT /product="Gypsy-56_MLP-I_3p" FT /translation="MSSISLEDVMKQLAEMNMRLNEETSLRQKVEKELQQL FT KQEKHKSQQQRAAEDQFNPTSFQSIPNPTMQAPVQQMQSGQTALPRTHKVA FT TPDKFDGSKGTKAEIFLNQLGLYMQLNSSVFANDQAKVAFALSYTTGKANI FT WGQSFTDQLLNGDNAHMVTWQRFVESFKGTFFDSKRVSKAEKEIRALKQTK FT TVSDYWIKFSELSLVVKWPEEVLRSQFKQGLKSEISVYMIRDVFESVEEMA FT RVAIKLDNKIHKRTPEGSSVFTSGNQNASMSTPTDPNAMDCSAYRLNISAE FT EYKRRGSNGACYGCGKDNHFIADCP" FT CDS join(1585..3915,3919..5019) FT /product="Gypsy-56_MLP-I_1p" FT /translation="MKDTRIIDLISLFDPKTATTKTARALVDSGATHEAMS FT RKFASLSFFETTPLPQKRSVTGFSGHESIITHTGDYRVNDNDNETTFLITY FT LRDKYDVILGMPWIRRNHKTIDWELGRLRPSTNHEIAAVRSASSLPKTTSM FT DHDRRPERIARFSDEGVQVSKDLLSPPQCEFDIPFLNQVEKSVSDQDSPLE FT ISYNTETQTTTVTKPCRLLATKELQPKTTLPPPSQPLQDQEWSPKREARNG FT DMGVKLTSLCKPLQSEFDTTNVSSVLRKTGKHVSLGQRLNLTGTRSKSQTS FT YANIHRPTKAMIDAAETSWNVSTRLAVEASKGRTEKTAAELVPECYHEYLE FT MFEKSNSDVLPPHRPYDFRVDLLPNATPQAGRIIPLSPKETEVLSDMIEKG FT LANGTLRRTTSPWAAPVLFAGKKDGNIRPCFDYRKLNALTVKNKYPLPLTM FT ELVDSLLDADKFTSLDMRNGYNNLRVREGDEAKLAFICKSGQFEPLTMPFG FT PTGAPGFFQFFIQDILKTHLGKDVAAYQDDILIYTKPGVDHQKVVKEVLDI FT LKAQNVWLKPEKCKFSKKEIAYLGLIISRNQIKMDDTKVKAVRDWPAPKNL FT SEVLTFLGFSNFYRRFISHFSKIARPLHELSQDGVKFEWTTGRNEAFENLK FT LAFTSAPVLTIADPYRPFVLKCDCSDYALGAVLSQVSSLDGELHPVAFLSR FT LLIKAERNYEIFDKELLAVISAFKEWRQYLEGNPNRLNVIVYTDHKNLQSL FT MTTKELTRRQARWAEILGSFDFEIRFPGKKSAKPDALSRRPDLKPEDGDKL FT TFGQLLKPENLPCDAFIEELDLVDSWFITEDTSPIQIEELNSQNEFWTDER FT IIEEIKQKSIEDERVLDIIKLCNEMPNSKLLGGYSLNDDVLYFNDKLVVPD FT DDNIRLQILRSRHDSLLAGHPGRMRTLMLIKRSFYWPSMQRYVNQYVDGCH FT SCQRVKSRTSKPFGSLQPLPIPEGPWLDICYDLITDLPSSNDFDSILTVVD FT RFTKMPHFVACKKSMNSEELAKLMLNSVWKIHGTPRTITSDRGNIFISKIT FT KEMNNQLGIKTQASTAYHPQTDGQSEITNKAVEQYIQHFTSYKQDDWQDLL FT PMAEFAYNNNLHVSIGMSPFKAILRVRCELHGDPEY" XX SQ Sequence 5713 BP; 2003 A; 1148 C; 1197 G; 1365 T; 0 other; tattgcaaca tcttaaactc agtagaatag acgcattcaa gatcaaacga aggaaagaag 60 aaataaagaa aatcgaagaa gttatcaaaa ttttaaattc aaatcaagaa gaaacaaagt 120 tcagatctca agtttaaccg aagcatttaa agtagatatc agtacagatc aaaaaagtta 180 aagtttaaat caacaatagg attacaaatc aagtataaga agaagatcga accagtaccc 240 cgcaaaactc taattatcaa aaccctctca ccacgcctga attcaagatc ccccacgaag 300 atccttcgga agaagacgaa cagtctactg aaggaactca actagctgga actatgtcgt 360 ctatcagctt agaagatgtt atgaaacaac tggctgaaat gaacatgaga ttgaatgaag 420 aaacatcctt gagacaaaaa gttgaaaaag aactacagca gcttaaacaa gagaaacaca 480 aatcccaaca acaacgtgca gctgaagatc aattcaaccc tacgtcgttc caaagtattc 540 ctaaccctac gatgcaagcc cctgttcaac agatgcaatc aggccaaacc gctctaccac 600 gaacgcataa agtcgctact ccagacaaat tcgatggctc taagggaact aaagcggaga 660 ttttcttaaa tcagttaggg ttgtacatgc agcttaattc gtcggtattt gcaaatgatc 720 aggcaaaagt agcgttcgcc ctgtcctaca caacgggaaa ggcgaacatc tgggggcagt 780 cgtttacgga tcaactccta aatggagaca atgctcatat ggtcacgtgg caacgctttg 840 tagaatcttt taaaggcaca ttctttgata gcaaaagagt ttcgaaggcg gaaaaggaaa 900 ttcgagctct gaaacaaacc aagacagttt cggactattg gatcaaattt tcggagttat 960 cgttggttgt taaatggccc gaggaagttt tgagatcgca attcaaacaa ggcttaaaat 1020 ccgaaatttc agtttatatg attagagatg tctttgagtc tgtggaggaa atggcaagag 1080 tagcgattaa attagacaat aagattcata aacgtactcc agagggatcg agtgtattta 1140 caagtggtaa tcaaaacgcc tcaatgtcta cgccaacgga ccccaacgcc atggactgtt 1200 cagcgtaccg tctaaacata tcagctgaag aatacaaacg tagaggttca aatggagctt 1260 gttatggatg tggaaaagat aatcacttta ttgcggattg tccctagaat agaagacaaa 1320 gtggaagagg aggatattca aattcaagag gcagttattc aacttcaaga ggaggataaa 1380 ttcaaggatc agtgaattag aagctcaatt gaaagcacgt atagatgaac tggatgctaa 1440 aatcacggga ggaagaaatg agagtaaaga agaaggaaga gcagatgtgt caaaaaatgg 1500 aggagcttga gtttgaatgt tgtgcatacc tcgagcttac aaaatttatt agggttagaa 1560 gataatgtca ttgatgaatt agaaatgaaa gacactcgta ttattgattt gatatcactt 1620 tttgacccaa agaccgccac aacaaaaact gcgagagctt tagtggatag cggagctact 1680 cacgaagcca tgagcagaaa atttgcatct ctttccttct ttgaaaccac cccactaccc 1740 cagaagagaa gtgtcactgg attcagtggc catgagtcga taatcacgca cactggtgac 1800 taccgagtca acgacaacga caatgagacg acattcttaa tcacttactt gcgagacaag 1860 tacgatgtta ttctcggaat gccatggatc agacgaaatc acaagaccat cgattgggaa 1920 cttggacgcc tcaggccatc aaccaatcac gaaattgcag ctgtaaggtc agcttcgtcc 1980 ctgccgaaaa ccacctcgat ggaccacgat aggaggccag agaggattgc taggttcagt 2040 gacgaggggg tgcaagtctc aaaagactta ttatcacccc cgcaatgtga gtttgatatt 2100 ccttttttaa accaagttga aaaatcagtt agcgatcaag attctccttt agaaatctct 2160 tacaacacag aaacacaaac tacgacagtg acaaaacctt gccgactact agccacgaag 2220 gaacttcagc cgaaaaccac tctgccacca ccgtcacaac ccttgcagga ccaagaatgg 2280 agcccaaaaa gggaagctag gaatggtgat atgggggtta agctaacaag cttgtgcaaa 2340 cccctgcaga gtgagtttga tacaaccaat gtctcctccg tccttagaaa aactggcaag 2400 catgtttctc ttggacaaag attaaacctc acaggaaccc ggtcaaagtc acaaacatct 2460 tacgccaaca tccatcgacc gacgaaagcg atgatagatg cagctgagac atcttggaat 2520 gtatcaacaa gactagcggt agaagcctct aaaggtagaa cagaaaagac agcggctgaa 2580 ttagtgcctg aatgctatca tgaataccta gaaatgtttg aaaaatcaaa ttcagatgtc 2640 ttacctcctc atagacctta tgacttccga gtagacctac ttccaaacgc aacacctcaa 2700 gcaggaagga tcataccctt gtcaccgaaa gaaacagaag tactcagtga tatgattgaa 2760 aaagggttag caaacggaac actaagaaga actacttcac cttgggcggc tccggtatta 2820 tttgcaggaa agaaggacgg taatatcaga ccctgtttcg actatagaaa actcaatgcc 2880 ttaactgtaa agaacaaata tccattaccc ctgacaatgg aattagtaga cagtttactc 2940 gacgcagaca aattcactag cttagatatg cgtaatggct acaacaactt gagagtaaga 3000 gaaggagatg aagccaaatt agctttcata tgcaaatcag gacaatttga acctttgact 3060 atgccctttg ggcctacagg tgctccggga ttctttcagt tcttcatcca ggatatactc 3120 aaaactcacc ttggaaaaga cgtggcagct tatcaggatg acatattaat ttacacaaag 3180 ccaggagtag atcatcagaa agtagttaag gaagtcttag atatattaaa agctcaaaat 3240 gtttggctca aaccggagaa gtgtaaattt tcaaaaaaag agatagctta cttaggatta 3300 atcatcagta ggaatcaaat taagatggat gacaccaagg ttaaagcggt acgagactgg 3360 ccagcaccaa aaaacttatc tgaagtttta acgttcctgg gattttcgaa cttttaccgg 3420 agatttatca gtcacttctc aaagattgca agaccacttc acgaattatc tcaagatgga 3480 gtgaaatttg aatggacaac tggaagaaat gaagcttttg agaatttaaa attggcattc 3540 acgtcggctc cggtattaac aattgctgat ccttatagac cgtttgtact caaatgtgac 3600 tgctccgatt atgcactagg agcagtccta tctcaagtct caagtctaga tggagaatta 3660 catccagtgg ctttcttgtc tcggttgtta atcaaagcag aacgcaacta tgaaattttt 3720 gataaggaac tgctagcggt gatatcggca ttcaaggaat ggcgccaata cttagaaggc 3780 aatccaaatc gtttgaatgt tatcgtatat accgatcata agaatttaca atctttgatg 3840 acaacaaaag aactcaccag aagacaggct cgttgggccg aaatacttgg cagctttgac 3900 ttcgaaatca gattttgacc gggaaagaag tcggcaaaac cagacgctct ttctcgaagg 3960 cctgatctga agccagaaga cggtgataaa ttaacgtttg gtcaacttct taagccagaa 4020 aacctacctt gcgacgcctt tattgaagaa ctagaccttg tagactcgtg gtttataact 4080 gaagacactt cgccaataca aattgaagaa ctgaactcac aaaacgaatt ttggactgat 4140 gagcgtatta ttgaagagat caagcagaaa tcaattgaag atgaaagagt attagatatc 4200 atcaagctat gcaatgaaat gcctaattca aagctgttgg gaggatactc tttaaatgat 4260 gatgtcttat atttcaacga caaactggta gtccctgacg atgataacat cagacttcaa 4320 atactgagat ctagacatga tagtctttta gctggccacc caggaagaat gcgtactctc 4380 atgctgatca aaaggtcctt ttactggcct tcgatgcaac ggtatgtaaa ccaatatgtt 4440 gatggatgtc actcgtgtca aagagtgaag agcagaacta gcaagccgtt tggaagcctt 4500 caacccctgc caattcccga aggaccgtgg ttagacatat gctacgatct catcaccgat 4560 ttaccaagct caaatgactt tgatagcata ttaacggtgg ttgaccgctt cacaaaaatg 4620 cctcatttcg tggcgtgcaa gaaatcaatg aactcagaag aattagccaa gttaatgctg 4680 aactcggttt ggaagattca cgggacacca agaaccatca cctcggacag aggcaacatc 4740 ttcatttcga aaataacgaa ggaaatgaat aaccagttag gcatcaagac tcaggcatcg 4800 actgcgtatc acccgcagac agacggtcaa tcagaaatca ctaacaaggc agttgaacaa 4860 tatatccaac acttcacgtc ctacaaacaa gacgactggc aagatctgtt acctatggca 4920 gaatttgcgt ataataacaa tttacacgta tccattggca tgtctccttt caaagcaatt 4980 ttacgggtac gatgtgagct tcacggggac cccgagtact gaccaatgct tgcctattgt 5040 tgaggacaga atcaataacc tgaaggaaat acacgacgaa ctcaaagaag caatgcagga 5100 agcacaaact aatatgaaac atcaattcga caagaaagtt ctagcagctc caaactggga 5160 gaaaggtcaa cttctatggc tcagcagaaa acatatatca acaactagac ccatcgctaa 5220 gttctcacat aaatggcttg gaccttataa gataattgaa aaagtgtcta ctaacgcata 5280 caaattacag ttgccaaagg aaatgaacga cgttcatccg gttttccacg ttaatttact 5340 cagagaatac gtcaagagca aaattgaagg acaagttgaa gaaccaccgc ctccagtaat 5400 aatccaagac acagaagaat ttgaagtgaa cgaggtttta aacaaaagaa gaagaagagg 5460 aaaaatagaa tacttagtta gctggaaagg ttatggacct gagaatgaca cgtgggaacc 5520 tgaaaatgct ttaatgaatg cgcaagaagt tgtcaatgat ttcaattcaa aatatcctca 5580 agctgaaaac aagtattata ggaaaagggg aagagtgaga gggtaaggct ttttcccact 5640 gggtttttta atgccaaccc gtggaaagat acctgacctg tcaagagggg gttgaggtat 5700 aaagggggag tga 5713 // ID Gypsy-21_LBS-I repbase; DNA; FNG; 9004 BP. XX AC ABFE01001545; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-21_LBS_; KW Gypsy-21_LBS-LTR; Gypsy-21_LBS-I. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-9004 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01001545; Positions 17630 26633. XX CC Positions [5020-5481] - Reverse transcriptase CC Positions [6970-7461] - Integrase core CC 'ATGGG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 154..8970 FT /product="Gypsy-21_LBS-I_1p" FT /translation="MSNHYSLRTRPSRAGLAIQQGMVRTPRKVADILGSQI FT QPGTALSATPQVHASDNRLTEDLTQMTRSYSDVVASRPPSPVSATEGEAPS FT GEAEALARYARAEETLVNTTEVVSQHNTKNIVQTDSEIHKSDTSSLSEVSE FT ADDNKNPWTTVVLRRSRSLDSLKKDTKTTKKVKVVPNRVNKLTTEQDTVVN FT QAEKQLTNAQKEQLSRRYEKVQNPPTPRERSESRGEGPSTLKGKGADPRNW FT GGAQLSDSDLDVEAQRAALESFTNKRDKNLDYSSSDEDQPEKMGTTHQRNR FT KPSRAPSGKTLDRASIVPAPKVDAKRNAQLVKATRTNVPINQIAPKSYLGR FT ALDNIEKSDKPSRRRRRGYSSSSSDTFGSDSSPSSSSSETSSDSEDTRSDS FT SINSRRARKRPRHRSKRRHNHRRRSRSKKPKTLLKPIPPVEYDGAADARAY FT HRFVTEGTDYVTSGKVRKNRRAFVLSYYLKGKAYDFYTQSVSLNPHEWTLK FT EFFVQLFNYCFPSDYRSELREKLRKCFQNNKNVSEYAFELKELFNMIGVMD FT EREKAVKFWNGLRASIQRQLWLFGLNPEVSTWDEIQQGAETIETSEKFTDP FT QHRRNNNSGGGGGGHNGGGGGGHHGGGGGGSNGHRNSRQNPGRKNESERKR FT SGSQFSRGTSQTHSAPRQQSLQLGSNRSQSQAPKGNYSQTNHNKRDWKSSK FT PTNVQRERATPRLSDKEKSELLASGSCFNCKKPGHLSRNCPQGNTVKSNGN FT KPPGLTNYNIEFEPESSDDVKVLDSLELGMVEFWDLPSNYTYMFSYEPAWL FT EYDPEARRRTRLGDSLALMAEYVLDIMQPYPGDSEFLRSGVRERFEVFTRP FT NADFYQIYDRLTHKGCSIWSGHLKNHYFRLGEWYARWQARKYNLNERPKRP FT WTMGDAHAENAMCVLRSCIPTLYPTSDPEIDDEFRITVVSKNENQYFIHDE FT DFKEPLTVDKSFFENPHIDLGKWYRARRLQKSEALDDDLTIDFSIFDHVWA FT FSLFENDVVVNEDENLPELHPGSDSDDEMPGLQSCSDSDSEHDGPSGLWPI FT NEADSEEEDDDDLPDLQSVSDSDEESEDPPPDNTSEGELTDNPDDETMVHT FT SDTETETDERPYRPIGDVLSDTITRILEASSPYPGDSLYPRTFLTRFRAYR FT VDDDFIRIEDRLQQSAASLALERARRPDFAPARWYAHRCSLRSGYSARRWE FT EDIPQLVMGDVLQREITRQLIHGAPYELDENFADVRLSRRFYVYLDPFDDD FT LFIIQDTRWDAWIRLPRFLAEAPGFDLADWYEQQLAALIEFFECPDPQPKG FT HPGRKPDDDPGSGSGITVDKPRTVPVSCDCAKQPTHRETHCEGPALELSGI FT QVPRGSYPAVQRNAAIAKDASRKVPRPVVVTVKIDGQPARALLDSGSLSDF FT LSSTLADQLNVKRVKLDVPLALQLAVQGSRSKINSGARVRFEYQKIAEERY FT LDIINISNYDVILGTPWMFQHQVCVGLNPSRVVIGSDTALPVEGTSVTGVA FT SRAVSLEGGQLEAAREALRKYAKPLCKTAGETELPPLRVINHSIPLIDEGK FT VYPWRPSRCPEALLSQWIEKRDAYLRTGRWEITNASNTIPMLLIVKPRKPG FT EPALLRTVFDLRARNENTYKMTSPLPDPEGILRRAARRRFRSMMDGKDAYE FT QIRIDPAHVQRTAVTTPDGNMICHVIQQGDCNAPATYQALMNHIFSPYLGR FT FMDVYLDDVIIYSDTLEEHVKHVKLVIDVLTREKLYLSEKKLHFLCSELQI FT LGRIVSDEGIRMDPYKVDCVLNWKTPTNRDLLRGFLGSVGYLGDDIPNVRI FT PMGVLHGLTGDTVPFRWGFTEQRAFEDVKTLVQASRDHHRVPLDYADDAPQ FT VWMVTDGCATGVAGVVSQGENWKTAKIAAFYSAKLNSAQQNYPVHEIEMLA FT GVETMLRHRDILQGVRFKWITDHKGLEHLLRQRNLSGRQARWVEKISEFDF FT EVVYVPGSENVVADALSRLYSADSPGTVRTKSEYTYFDVVNEDTGGVEETS FT VPVLAGVEARVAVQKRPRKVLPGAETGRPETAKEFAARVKSHFVLKPPAER FT KEGESGNTTTTSSKLTIRIPPRAKDSSTLPRSVSGNQTSIEIDATAKSSKI FT DATAKSSPSLLNVISESQDGIDLLKELRGKYIEDPLFKVIIDNPREHRNFL FT IENELVYLKERERKLLCVPNISIRGRNAREIVLSEAHSLLAHLGASKTLNY FT LRDSLWWKTMVSDTKAFCESCMTCRRSKPSNQKPYGLLNPLPVPSHPWEAI FT GIDFVGPLPESKNRDGVFDSITVIICLLTAMVHLVPSRTTYNARQIAELMF FT EEVYKLHGLPKHIISDRDVLFTSIFWGHLHKLMGTKLKLSSAYHPETDGST FT ERANRTVTQMLRQCIGEKQTDWVAKLPAIEFAINSARSESTGYAPFFLNTG FT RMPKSMIWDTARTDEYPSVRTFALQRKLALISAHDSILAARVKQTRNANRK FT RQLAPFKENNLVYLSTKNITFPKGLARKLIPKFIGPYKILKDFNNQSFLID FT LPSHLKQRGVHNVFHAALLRIHVPNDDRLFPGRMDTQLALSEEESEGEWAV FT EKILAHSGSMENAVFQVQWKAGDVTWLPYYQITHLNALPVYLDLLGVEDIS FT KLPKGQGTPPVDDPQIFVGFIALQEAFKPHLERPSNSFTTRPFPSTTSRAL FT LKASPPRRQLTQPAIMTDIPTPAPIVTDDITPDAPIAPAAAMAAPKSTKES FT KYSTITHRCLDRPCLTIITVTDPVQKTSTSYHVGQIALYCLTDARLRKQKP FT PRHGLPAGYEHFATNFNIWAEEDQKKRFAGYDDALGTYDLAGEPIDLSDFN FT IAPEIIGWASKTAPLKRKDAPAPEPSKGAALTPKRMKIVDGLLWKVAESAA FT EEEEKAIKLKFKKYKKYTSTPHHPSSSSNSKKRDGDSSSGASAIAV" XX SQ Sequence 9004 BP; 2447 A; 2357 C; 2132 G; 2068 T; 0 other; tattttttga cccgacaact cgactcttaa aaacaaaccc gactttactc gtcaccacct 60 tgcccgtcga ctggaccact tcggattgat cacccggacc ctccgttcac tggcaacaca 120 aaaatctctg cttttaaaaa caattctttc tctatgtcta atcattactc gcttcgaact 180 cgcccctccc gcgctggctt agctatccag caaggtatgg tccgaacccc tagaaaagtt 240 gctgatatac tcggttcaca gattcagcca ggcacagcct tatcggctac gcctcaggtt 300 cacgcgtcgg acaatcgtct gaccgaggac ttgacgcaaa tgacgcgctc ttacagcgac 360 gtcgttgcct ctcgacctcc ctcaccagtg tctgccactg agggagaagc accctcgggt 420 gaggctgaag ctcttgcacg ttacgcgagg gcggaggaaa ctttagtcaa tacaactgaa 480 gtcgttagtc aacacaatac caaaaatatt gtacaaactg acagtgagat tcacaaaagc 540 gatacctcct cgcttagtga agtaagcgaa gcggatgata ataaaaatcc gtggaccacg 600 gtggtcctcc ggcgctccag gagcctggat tcacttaaga aggacactaa gactactaag 660 aaagtcaaag tagttcctaa ccgagttaac aagttaacca cagaacaaga cactgtagtt 720 aaccaggccg agaagcaact tacaaatgct cagaaagaac agctctcacg tcgttatgag 780 aaagttcaga acccaccaac gccgcgtgaa agatctgaat cgcgcggcga aggaccttcc 840 accctaaaag ggaaaggcgc tgacccccgt aattggggag gcgcgcaact cagtgattcc 900 gatctggatg ttgaggccca acgtgcagct ttggaatcct tcaccaataa gcgtgataaa 960 aatctggatt acagcagttc agatgaagac cagcctgaaa agatggggac cactcaccag 1020 agaaacagaa aaccctcacg agccccatca ggcaaaactc tggaccgagc gtcgatcgtt 1080 ccagccccga aggtggatgc caaacgtaac gcccagctgg ttaaagcaac tcggacgaat 1140 gtaccaatca atcagatcgc gcccaaaagt tatcttgggc gtgcgcttga taacatcgag 1200 aaatccgata agccctcacg gcgccgccgt cgggggtact catcctccag ctctgatact 1260 tttgggtcag actcgtctcc aagctccagc agctctgaga ctagcagtga ctcagaagat 1320 actcggtcgg atagctcgat caatagccgc cgggcacgaa agcgccctag gcacaggtcc 1380 aaacgacgcc acaatcaccg gcgtcgatct cgatctaaaa aacctaaaac cttgttgaaa 1440 ccgattccgc ctgtggaata cgatggtgct gcagatgcac gcgcctacca tcggttcgtc 1500 acagaaggaa cagactacgt tacctccgga aaggttcgca aaaatcggcg tgcctttgtt 1560 ctttcatact acctgaaagg caaggcctat gatttttata cccagagcgt gtccctcaat 1620 cctcatgagt ggaccctcaa ggaattcttt gttcaattgt tcaactattg ttttccctca 1680 gactataggt ctgaacttcg ggaaaaactt agaaaatgtt tccagaacaa taaaaatgtt 1740 tcggaatatg catttgagct aaaagaacta ttcaacatga taggagtcat ggatgaacga 1800 gaaaaagctg tcaaattttg gaatggactt cgtgcgtcta tccaaaggca gctttggctc 1860 tttggtttaa accctgaggt ctccacttgg gatgaaatac aacaaggggc tgaaaccatt 1920 gagacctcgg aaaaattcac cgacccccag caccgccgaa ataacaattc tggtggtggc 1980 ggcggtggtc acaacggagg tggcggcggt ggtcaccatg gaggaggcgg cggtggctcc 2040 aatggtcacc gaaatagtcg ccagaaccct ggaagaaaaa atgagtccga acgtaaacgt 2100 tccggctccc agttttctcg cggcacttcg caaacgcact ccgcaccaag gcaacaaagc 2160 ttgcaactgg gatcgaacag gtctcagtcg caggctccta agggaaatta ttcccaaacc 2220 aaccacaata aaagggattg gaagagttct aaacctacca atgtacaaag ggaaagagct 2280 accccacgcc tgagtgacaa agaaaaatct gagctattag cctcgggttc ttgttttaac 2340 tgcaagaagc ctggtcactt atcacgcaat tgcccccaag gcaatactgt taaatcaaat 2400 ggaaataaac ctcctggttt aactaactat aacattgaat ttgaacctga aagttctgat 2460 gatgtgaagg ttctggatag ccttgaactc gggatggttg aattttggga cctcccaagc 2520 aattacacat acatgttttc atatgaacct gcatggttag aatacgaccc agaagcacgg 2580 cgtcgtacac gcctcgggga tagtctggcc ctcatggcgg agtacgtgct tgacatcatg 2640 cagccgtatc ccggcgatag tgaattcttg cgctccggtg tcagggaacg cttcgaggtt 2700 ttcacccgtc cgaacgccga cttctaccag atttatgatc gcttaaccca caaaggttgc 2760 tcaatctggt cagggcatct gaaaaaccac tacttccgtt taggagagtg gtatgctcgt 2820 tggcaggcgc gtaaatacaa tttaaacgaa aggcccaagc gtccttggac gatgggcgac 2880 gcgcacgctg aaaacgcaat gtgcgtgcta cgaagctgta tccctacact ctaccccaca 2940 tctgatcctg aaatcgatga tgaatttcgt atcactgtgg tcagtaaaaa tgaaaatcaa 3000 tacttcattc atgacgagga tttcaaggaa ccccttactg tggataagtc atttttcgag 3060 aacccccaca ttgatttggg gaagtggtat cgtgccagac gtctccaaaa atcggaagct 3120 ttggatgacg atttaaccat tgacttctca atttttgatc acgtctgggc gttctcccta 3180 tttgagaacg acgtcgttgt taatgaagat gaaaacttac ctgaattaca cccaggaagt 3240 gattctgatg acgaaatgcc gggcctgcag tcttgttctg actcagactc agaacacgac 3300 ggcccctctg gattgtggcc aattaacgag gctgactcag aagaggaaga tgatgatgac 3360 ctacccgatt tgcagtcggt ttcggattct gacgaagaat cagaagaccc accccctgat 3420 aatacatcag agggagagtt aacagacaat ccggacgatg aaactatggt tcacacatcc 3480 gacactgaga cagaaaccga tgaacgccct taccgcccca tcggcgacgt cttgtcagac 3540 actataacca ggattcttga agcttcaagt ccctaccctg gtgactcgtt atacccacga 3600 acgtttctta ctcgattccg tgcgtatcga gtcgatgacg attttattcg cattgaggat 3660 agactccaac aatctgcggc gtcgttggcg ttggaacggg cgcggagacc tgacttcgca 3720 cctgcccgct ggtatgcgca tagatgctcg ctgcgcagcg gatattccgc acggcgctgg 3780 gaagaggaca tccctcagtt agtaatgggg gatgtgttgc aacgggagat tacccgacaa 3840 ttgattcacg gcgcacctta tgaactcgat gagaacttcg ctgatgtgcg cctatctcgg 3900 cgattctatg tgtaccttga tcccttcgat gatgatttgt ttatcataca agacacacgc 3960 tgggacgcgt ggattcgcct ccctcggttc ctggcagaag ccccgggatt cgatttggcg 4020 gattggtatg agcaacaact cgctgcactc atagaattct tcgagtgccc tgatccacag 4080 ccgaaaggtc atccaggccg aaaaccagat gacgacccag gttcaggtag tggaattaca 4140 gtggataaac cacgtaccgt acctgtgagt tgtgattgtg caaaacaacc aactcatagg 4200 gaaactcact gtgaggggcc cgcacttgaa ctaagcggta tccaggtccc ccgcgggagc 4260 taccccgcgg tacagcgtaa tgcggccatt gcgaaagacg ccagtcgtaa ggtaccacga 4320 cctgttgtgg taaccgtaaa aattgacgga cagcccgcac gcgctttgtt ggattcgggg 4380 tcgctcagcg acttcctatc ctcgaccttg gcagaccaat taaatgtcaa aagagttaaa 4440 ttagacgtac ctctcgcatt gcagcttgca gtccaagggt ctcgttctaa aataaattct 4500 ggggccagag tgagatttga ataccagaaa atagcagaag aacgatacct cgacatcata 4560 aacatatcaa attatgatgt catactaggt acaccctgga tgttccaaca ccaggtctgt 4620 gtaggactta acccatcgcg tgtcgtcatt ggcagcgaca ccgctttacc cgttgaaggc 4680 acctcagtta ctggtgtcgc ttcgcgtgcg gtctcactcg aaggaggcca gctggaagca 4740 gcacgtgaag cactcaggaa atatgcaaaa ccgctctgta aaacagcggg agagactgag 4800 ctgccgcctt tacgggttat aaatcactcc atacctttaa ttgatgaagg caaggtgtat 4860 ccgtggcgcc cttctcgttg cccagaagcc ttgctttctc agtggataga aaaacgagac 4920 gcttacctgc gcacagggcg ttgggaaata accaacgcct ctaacacgat tccaatgctt 4980 ctaatagtaa aaccacggaa gcctggagaa ccagctctgc tcagaactgt ttttgattta 5040 cgagcacgga atgagaacac ttacaaaatg acctcgccgc tacctgaccc agagggtatc 5100 ctacgaaggg cagcgcgacg acgttttcgt tccatgatgg atgggaaaga cgcttatgaa 5160 caaattcgca tcgacccagc acatgtccaa cggaccgccg ttacaacacc tgatggcaat 5220 atgatatgtc atgtcattca gcagggtgat tgtaatgctc cagcaacgta ccaagcatta 5280 atgaaccata ttttttcgcc gtacctcggc cgattcatgg atgtttatct ggatgatgtt 5340 attatctatt cggatacact ggaggagcac gtcaagcatg ttaaattggt tattgatgtg 5400 ctcacccgcg aaaaacttta tctaagtgag aaaaagcttc acttcctttg ctcggaattg 5460 cagattctgg gacgaatcgt ctccgacgag ggcattcgga tggaccctta caaggttgat 5520 tgtgttctga attggaaaac cccgaccaat cgcgatttgc tgcgaggttt cttgggatct 5580 gtgggttacc tgggggatga catccctaac gttaggattc ccatgggggt tctacatgga 5640 cttaccggtg acacagtccc tttcagatgg ggcttcactg agcaacgcgc tttcgaagac 5700 gttaaaacat tagtccaagc gtcccgagat caccatcgcg taccattgga ttatgcagat 5760 gacgccccac aggtctggat ggttacagac ggctgtgcga ctggagtcgc cggtgttgtc 5820 agtcagggcg agaattggaa gacggctaaa attgctgcat tctattccgc taagctgaat 5880 tcagcgcaac aaaactaccc cgtgcacgag attgagatgc tcgcgggggt agagacgatg 5940 ttgcgacacc gtgatatctt acaaggtgtt agatttaagt ggataacgga ccacaaaggt 6000 ctggaacact tacttcggca acggaatctc tctgggcgac aggcacgctg ggtggaaaaa 6060 atcagcgagt ttgattttga agtcgtttac gtcccaggat cagagaacgt cgttgccgat 6120 gccttatcgc gtctgtattc tgcagatagc ccgggcactg ttcgcacaaa gagcgaatac 6180 acttactttg acgttgtcaa cgaagacact ggtggcgtcg aggaaacatc tgtgcctgtc 6240 ctcgctggcg tcgaagctcg tgttgcagtc cagaagcgcc cgcgtaaagt tcttcctggt 6300 gccgagactg gacgtccgga gacagcgaag gaattcgctg ctcgagttaa gagccatttc 6360 gttctgaagc ctcctgcaga acgaaaggag ggcgagagtg ggaacacgac aacaacaagt 6420 tccaaactta ccatcagaat tcccccacga gctaaagata gctcaactct acctcgatct 6480 gtcagcggta accagacctc gatcgagatt gatgctacgg ctaaatcgag taagattgat 6540 gccacagcta aatcgagtcc gtcactgctc aatgtaatct cagagagcca ggacggcatt 6600 gatctcctca aggaattgag aggaaaatat attgaggacc cactattcaa agtcataatt 6660 gacaacccgc gggaacatcg aaattttctc attgaaaatg agctggtgta cctcaaagaa 6720 cgggagcgga agcttctctg tgtacctaac atctcaattc gaggcaggaa cgcccgcgaa 6780 attgtactat ctgaggctca ctccttgctt gcccacttag gagcaagtaa aactcttaat 6840 tatctaagag attctttgtg gtggaaaact atggtttctg acaccaaggc attttgtgaa 6900 agctgcatga cgtgcagacg aagcaagccc agcaatcaaa agccttatgg gttattaaac 6960 cccttgccag tgccaagtca tccttgggaa gcaattggaa tcgattttgt gggtccccta 7020 cccgaatcaa aaaatcgcga cggagtcttt gactccataa cggtcataat atgtctattg 7080 acggccatgg tacatttagt tcctagtcgg actacctaca acgcccgaca gatcgcggaa 7140 ttaatgtttg aagaggtcta caaattacac ggcctcccca aacatatcat aagtgatcgg 7200 gatgtcctat tcactagcat tttctgggga caccttcata agctaatggg gacaaaatta 7260 aaactgtcca gtgcctacca ccccgagacg gacggttcaa cggaacgcgc aaaccgaacc 7320 gtcactcaga tgctgcgcca gtgtatcggc gagaagcaaa ctgattgggt ggcgaaactt 7380 cctgctatcg agttcgctat taactctgcg cgctcagaaa gtacgggcta cgcaccattt 7440 ttcctgaaca caggacgaat gcctaaatcc atgatttggg acacggcgcg cacagacgag 7500 tacccctccg tccgcacgtt tgcccttcaa aggaagctgg cattgatatc agcccatgat 7560 agcatcctag cagcacgcgt taagcaaacc cgcaacgcta accgaaaacg ccagctcgca 7620 ccttttaaag agaacaactt ggtgtacttg tccacgaaaa acatcacctt tcccaaggga 7680 ctggcgcgga aacttattcc gaaatttatc ggaccgtaca agatcctaaa agattttaat 7740 aatcagtcgt tccttatcga cttgccttca catctgaagc agcgaggtgt gcacaatgtc 7800 tttcacgctg ccctgctccg gatccatgtc ccaaacgacg atcgtctgtt tccaggacgg 7860 atggacactc aactcgcgct tagcgaagag gaatcggaag gtgagtgggc cgtcgagaaa 7920 attttggcgc actcgggatc tatggaaaat gcggtgttcc aggtacagtg gaaagcaggt 7980 gatgtcactt ggctcccata ttatcagatc acgcacctaa acgcgctccc cgtttacctc 8040 gacctccttg gggttgagga tatttcaaaa ttgccaaagg gacaaggcac gccgcctgtt 8100 gacgatccac aaattttcgt cggctttatc gcattgcaag aagcttttaa acctcacctt 8160 gaacgacctt caaactcatt caccactcgt ccctttccat ctacgacctc ccgcgcgtta 8220 ctaaaagcct ctcctcctcg ccgacagttg acccaaccag ccatcatgac cgacatccct 8280 actcctgcac ccatcgtgac tgatgacatc actcctgatg cacccattgc ccccgccgcg 8340 gccatggcag cccccaaaag caccaaggag tccaaatact ccaccatcac tcatcgttgc 8400 ctggatcgac cctgcctcac catcatcacc gtcaccgatc ccgtccaaaa gacgtccacc 8460 tcttatcacg ttggccagat tgcgctgtac tgcctcaccg acgcgcgctt gcgtaaacaa 8520 aagcctcctc gccatggcct tccagctggc tatgagcatt tcgctaccaa cttcaacatc 8580 tgggcagaag aggatcagaa aaagcggttt gcgggctacg atgacgcact aggcacgtac 8640 gatctcgccg gcgaacccat cgacctctcc gacttcaata ttgcccccga gatcattggg 8700 tgggcatcca aaactgcgcc cttgaagcgc aaagatgcgc cggctcccga gccctccaaa 8760 ggcgccgcgt tgactccgaa acgcatgaaa attgttgatg gtttattgtg gaaggttgct 8820 gagagtgcgg ctgaagaaga agaaaaagct atcaagttaa agttcaagaa gtacaagaaa 8880 tataccagta ccccccatca cccttcttca tcctctaact ccaagaagcg cgacggtgat 8940 tcttcctctg gcgcgagcgc catcgccgtt tgaaggaatc cctgtaggat ccctggaggg 9000 cgta 9004 // ID Gypsy-84_MLP-LTR repbase; DNA; FNG; 215 BP. XX AC AECX01001040; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-84_MLP_; KW Gypsy-84_MLP-I; Gypsy-84_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-215 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001040; Positions 108471 108257. XX SQ Sequence 215 BP; 64 A; 52 C; 26 G; 73 T; 0 other; tgttatgatc tcaatttgat cattctataa ggatatgctt ataaactgta tagcttgttc 60 ttatatactt tgttatacgt catccttcat tgtacccttt cctctatggt acaattcaat 120 catcaatcat cataagatcc aacccagcag aatagccaca agccccaact cttagaacca 180 acagactagt ctctgtacgt gatctgctca tatca 215 // ID GYMAG2_I repbase; DNA; FNG; 5484 BP. XX AC AACU01001656; XX DT 04-SEP-2005 (Rel. 10.09, Created) DT 04-SEP-2005 (Rel. 10.09, Last updated, Version 1) XX DE GYMAG2: Gypsy-type LTR retroelement from Magnaporthe grisea DE (internal portion). XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; GYMAG2_LTR; internal portion; GYMAG2_I. XX OS Magnaporthe oryzae OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Magnaporthales; OC Magnaporthaceae; Magnaporthe. XX RN [1] RP 1-5484 RA Jurka J.; RT "GYMAG2: Gypsy-type LTR retrotransposon from the rice blast RT fungus Magnaporthe grisea."; RL Repbase Reports 5(9), 244-244 (2005). XX DR EMBL/GenBank/DDBJ; AACU01001656; Positions 6584 12067. XX CC LTRs differ by 1 bp indel. This appears to be a recent insertion. XX FH Key Location/Qualifiers FT CDS 172..927 FT /product="GYMAG2_I_1p" FT /translation="MPRLKDHPPASRKSQRLITNQTEQTDRPIDDPTRSES FT ETDFRPGTKSSVEDEIEVASSSRSPELNVNAQENNPRMMSDANDPASEIAR FT LEAEILRLRNALRNDTPSPRPYRELRHREPSVESAFGGTSFKAKGMAAWPA FT FTEFQGTGGINPAYNDKAKARADSPPKFAGDKTQFDSWLIKVADKFEEDVA FT IFRTEKSRMRYLMNLLEDKAEKAMITRYVSVTRPFSSAAEMIQILESMYHD FT PNQSIACQRSP" FT CDS 1540..2634 FT /product="GYMAG2_I_2p" FT /translation="MSTIAQIGSNPRPKSFIIRTQIQVNGVALSVKALCDT FT GADISLLISPAIAEQAAERLGARLQRLKTPLLLSDYRKQDAGRITHKLKAT FT LEIDGRRFSNQMFYVTESGHDMFIRQDWLVEQDVWIHPKTQTFAWPERTPS FT LAKFSPAIRLPNMLDKPDPVAQADAERRDRAFERETKRVQILRRPWRQTTF FT LEPRPTTPVVLGEDRDVVNIAALQHKLDTDPRQKRWKSCPIPTQPITLEIG FT PEKPAISLSAVGRSYQWKTNKGEPIPFPENKDPDHVELVRSKLPSRLAHLE FT GFFSKAASTNLPPHRPGHDVILELDKPKTGSPPTYRTPVEFLPLEKETVDE FT LLRIGFIEPCMQADPAPCPVRT" FT CDS 2805..5423 FT /product="GYMAG2_I_3p" FT /translation="MAVGSEYLTAFRTRQGTFQWKVLPFGLKVGPAWWQSF FT INAQLNELLDLFASAYADDVLVYSDGNEEEHWDQVEEVIYRLSRVDLQGDI FT KKSRFNVTTVDYLGIVMDAGLGIRIDPDKLQAISDWKFEDLTSKTAVRSFL FT GLCNYIRMFCHHASSVAEPLNRLLKKDAKFAMGPEQRRAFEEMKRLACDAP FT VMAFFTPGRPTKVETDASRNATGGAIWQQQPGGEWKPVGYFSKTMTPAERA FT YPIQDRELLAVVQTLKHYEPELLGTSFFVVTDHQALVYYSTKRLISTRQVR FT WADFLANFNITFQYRRGKDNIAADALSRKTADLPTVKAREKEERTMILIPP FT EKITPTVAAVSTADPNAHVVSGADLVDLIRQENEKQKLGQHQGKLVVPETT FT LDGQIFLRTALIREAHEPKIFAHAGQNKTIQMIKRRYFWEGMSQTIRKYIK FT NCHDCERNKGRHDKTPGLLHPLPIPNYVWEHVAVDGKDMPKDKFGYDYVWV FT FVCKFSRLIATIPGQKTDTAEILASRYYRYLYRFLGLPFVWISDNAGPFIS FT EFMETINELTGTKHRHGSALHPQTQGAVEITNQELDQKLRFYIDKYQTSWS FT VHLPALDFAHNAAWHSSLGMCPLKVVLGTEPRNPLSTDLPTTTVDSDQKRK FT ALQIVRQTKEVQELARQNALKTQARQEEQANKKRRPVDFGVNDYVFVKKKG FT FPTTAPTTRLDSQWTGPWQILEERGYSYVLDVPESFKGKNLFHADRLRKAA FT MDPLPQQKREPPPPEEINGEPEFVVDKVLASRLFGRSKILQYQVAWQGCDP FT DDTWYPAENFKNSATALDDFHKKYKDAAGPPKRLAIWIKAAAEDKLDEPNQ FT EDNVAEHGELNGKRKKRRHG" XX SQ Sequence 5484 BP; 1651 A; 1447 C; 1346 G; 1040 T; 0 other; caattctact ctctaaagaa tcgctggcct agcgattaac aaataaggcg caacaaccta 60 tctttacgcc gctggcgagc gacgaaaata aatccgcacc cgacagaccc gacccgaccg 120 gagttcgaca ggaacggcaa ttccagtccc cgaccacgac cgatccgacc tatgcccagg 180 ctgaaggacc acccacctgc gagcaggaaa tcacaaagat tgattacgaa ccagacggag 240 caaacagacc gccccatcga cgacccgacc cgatcagaaa gcgaaacaga ctttcgccca 300 ggcaccaagt ccagtgtcga ggacgaaatc gaagttgcat cttcttcgcg gtccccggag 360 ctcaacgtga acgcacaaga aaacaacccc agaatgatga gcgacgcaaa cgacccagcc 420 agcgagatcg ctaggctaga agcggaaata cttaggctaa gaaacgcgtt gagaaacgac 480 acacccagcc ccaggcccta cagagaatta aggcaccgag agccatcagt ggaaagtgca 540 ttcggcggaa cgagtttcaa agcaaaaggg atggccgctt ggccagcctt taccgagttc 600 caaggaacag gcggcatcaa ccctgcttac aatgacaaag caaaagccag agccgattca 660 cccccaaagt tcgcaggaga caagacgcaa tttgatagct ggctgataaa ggtggccgac 720 aaattcgaag aagatgtcgc catttttagg actgagaaaa gtcggatgcg ttacctgatg 780 aatttgctcg aggacaaagc cgagaaagca atgattactc gttacgtttc agttacacgc 840 cccttttcat cagcagccga aatgatccaa atcctagaat ccatgtacca cgaccccaac 900 cagtccatag cgtgccagag aagcccttaa gaaacacgag tttgagctag gcaagggcca 960 agatatccac gagtttattg ccacgtttaa ttcactagcc cagcaggcaa aggttcggga 1020 agaagactgg aagcaaacct tgtggggatg tatacccgcc gacctcgatc accgtctgct 1080 gcacgatagt gagaacatcg acatagacta cgagacgttt tgccagtacg ttaccaaagc 1140 cgtctacagc aaccaactgg cacaagaaag gcgcaaagac cgagagagca ccgacaaaac 1200 cagcacgaag aacgagaccc ggagcaggca gaaaacagct tcgacgaaag tccgttcgcg 1260 ctacaagccc aaagattatc aaggggatag aacggaacca attacaatgg ctaaggccgg 1320 tcgctcactg acttatgagg agaaaaaggc gcattgggat gccaacacct gtttcatgtg 1380 cggaaggggt ggccataatt ccaaagattg tcccgaaaaa gagagaataa gggatgtcaa 1440 agcagtgaag ccgaagcaac aaacgagtag cttagattcg gaaaccacag acgaatcggg 1500 aaaagagtga gaccgccgca agcctcctcg gtggtccaga tgtctacgat agctcagatt 1560 ggtagtaacc cacgacctaa atctttcatt attcgaactc agatacaggt taatggcgta 1620 gcactatcag ttaaagctct ttgtgacacc ggagcagaca tttctctttt aatcagccca 1680 gcaatcgctg aacaagccgc agaacggcta ggagcccggc ttcaaaggtt aaaaaccccg 1740 ctacttttgt cggattaccg caaacaggac gcaggacgca ttacgcacaa actaaaagcg 1800 accttggaaa tcgatggacg ccggttcagc aaccaaatgt tttacgtgac ggaaagtgga 1860 catgatatgt ttattaggca ggattggcta gttgaacagg atgtttggat ccacccgaaa 1920 acgcaaacgt tcgcatggcc tgaaaggaca ccgtcgctgg caaaattctc gccagccatt 1980 agacttccga acatgttgga caagcccgat ccggttgcac aagccgatgc agagcggcgc 2040 gaccgagcat tcgaacggga aaccaaaagg gtgcaaatac ttcgacgacc gtggcgccaa 2100 acaacctttt tagaacccag accaacgacc ccggtcgtct tgggtgagga ccgagacgtt 2160 gtaaacatag ccgcccttca acacaaactg gacaccgacc cccgacaaaa gcgatggaaa 2220 tcgtgcccaa tacccacgca gcccatcacg ttagaaattg gacccgaaaa acccgccatc 2280 agcctatctg ccgtcggccg ttcgtaccaa tggaaaacca acaagggaga accaatcccg 2340 ttcccggaaa ataaagaccc cgaccatgtc gaactggtcc gaagcaaact gccatcccga 2400 ctcgctcacc tggagggatt cttttcgaaa gcagcttcca ccaacctgcc tccacatcgc 2460 ccagggcacg atgtaatttt ggagctagac aagccgaaaa caggatcgcc tccgacgtac 2520 aggacacctg tggaattcct gccgctcgaa aaggaaacag tcgacgagct gctgcgcatt 2580 gggtttatcg aaccctgtat gcaggccgac ccagcgcctt gtcctgttcg tacctaaacc 2640 acactccaaa ggaacgccgt ttttgcaccg actaccgatg gatcaaccag tttttgaaag 2700 atcgattcgt accagccccg gatgtcaacg gaactatttt taattgtcgc aatgcaaaga 2760 ggtttacaaa aatcgacatt attcgagctt tcaaccggtt acgtatggcg gttggctcag 2820 agtacctcac tgcttttcgc acccgacaag gcacctttca atggaaggtg cttccctttg 2880 ggctaaaggt cggcccagct tggtggcagt catttatcaa cgctcagcta aatgaacttt 2940 tggacttgtt cgccagcgca tacgccgacg atgttttggt ttattccgac ggaaacgaag 3000 aagaacattg ggaccaagta gaagaggtta tttaccgatt aagccgagtc gacctccagg 3060 gagatattaa aaagtctcga ttcaacgtta ccacggtcga ttatcttggc atcgttatgg 3120 acgcaggtct gggcatccgt atcgatccag acaaactgca ggcaattagc gattggaagt 3180 tcgaagacct tacgtccaaa acagccgtgc gttccttttt gggattgtgc aattacattc 3240 gaatgttttg ccaccatgcc agcagcgtcg ctgagccgtt gaaccgactg ttgaagaaag 3300 acgccaaatt cgctatgggc cccgaacaga ggcgagcatt tgaggaaatg aaacgtttag 3360 cctgcgatgc accggttatg gcatttttta ccccaggccg accgaccaag gtagaaaccg 3420 acgcctcacg caacgcgacc ggcggagcaa tttggcaaca gcaaccgggt ggagaatgga 3480 agcctgtggg atatttctca aaaacgatga ccccagcaga acgcgcatac ccgatccaag 3540 accgagagct gctggcagtg gttcaaacgc tgaaacatta cgaaccagaa ttgttaggga 3600 caagcttttt cgtggttaca gaccaccaag ctttagttta ctattccaca aaacgactca 3660 tttcgactcg acaggtgcgt tgggcagatt tcctggccaa cttcaacatt acgtttcagt 3720 accgccgcgg caaagacaat atagccgccg atgccctgtc ccgcaagacg gcagaccttc 3780 caacagttaa agctcgggaa aaagaagaac gaacaatgat tttaatccca cctgagaaga 3840 tcacacctac ggtggccgcc gtcagcactg ccgacccaaa cgctcacgtg gtgagcggcg 3900 cagatttggt agacctaata cgacaagaga acgaaaagca aaagctaggc caacaccaag 3960 gaaagctagt cgtcccggaa acgactttag acggacagat ttttttgagg accgctctga 4020 tccgtgaagc tcacgagccc aaaatattcg cccacgcagg acagaacaag acgatacaaa 4080 tgatcaaaag acggtatttt tgggagggca tgagtcaaac cattcggaaa tacatcaaaa 4140 actgtcacga ctgcgagcga aataaaggca gacacgacaa gaccccgggc ctattgcatc 4200 cgctaccgat accgaattac gtttgggaac atgtagcagt tgacggaaag gacatgccca 4260 aagacaagtt cggatacgac tacgtgtggg tgttcgtttg caagttcagt cgcctgatag 4320 ccaccatccc aggacagaag accgacacag cagaaatcct ggcctcccga tattaccgat 4380 acctataccg tttcctagga ctacctttcg tttggattag cgacaacgcc gggccgttca 4440 tctccgaatt catggagacg ataaatgaac ttacgggtac taaacaccga cacggcagcg 4500 ccctgcaccc tcaaacacaa ggtgccgtag agataaccaa ccaggaactg gaccaaaagc 4560 tgcggtttta tatcgacaaa taccagacca gttggagcgt acatttgcct gctttggatt 4620 ttgcccacaa cgccgcgtgg cattccagcc ttggtatgtg cccgttaaag gtagtgctcg 4680 ggaccgaacc ccgaaacccg ctgtccaccg acctgccaac cacgaccgtt gattcagacc 4740 agaaacgaaa agcgctgcag atcgtccgcc agactaaaga agtccaagag ttggctcgcc 4800 aaaacgcgct aaaaacgcag gcgaggcagg aagaacaagc taacaaaaag cggcgaccag 4860 tagattttgg ggttaacgac tatgttttcg tcaagaaaaa agggtttccg acgaccgcac 4920 cgacgaccag attggattca cagtggaccg gaccatggca gattctagaa gaacgaggat 4980 atagctatgt tttggacgta cctgaatcgt ttaaaggaaa aaatttgttc cacgcagacc 5040 gcctccgcaa agccgcaatg gacccattac cacaacagaa aagagagccg cctccgccag 5100 aagagatcaa cggagaacca gagtttgtgg tcgataaagt tttagcgtcc cgattatttg 5160 gccggagtaa gatattgcaa taccaggtcg catggcaagg atgtgatcca gacgacacgt 5220 ggtacccggc tgaaaacttc aagaattcag cgacagccct tgacgacttc cacaagaagt 5280 acaaagatgc cgcaggaccg ccgaagcgat tggcaatctg gataaaagcc gctgccgagg 5340 ataagttgga cgaaccgaat caggaggata acgtggcgga gcatggagag ttaaacggta 5400 aaaggaaaaa gcgacgacat ggttaacttg acaagagcct cgcatgatct cctgcgtcga 5460 catacgaggt ttggaagggg gtag 5484 // ID DIRS-1_MLP-I repbase; DNA; FNG; 12121 BP. XX AC AECX01001077; XX DT 18-MAY-2011 (Rel. 16.04, Created) DT 18-MAY-2011 (Rel. 16.05, Last updated, Version 2) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW DIRS; LTR Retrotransposon; Transposable Element; Copia; KW Copia-45_MLP_; Copia-45_MLP-LTR; Copia-45_MLP-I; DIRS-1_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-12121 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX RN [2] RP 1-12121 RA Kojima K.K. and Jurka J.; RT "DIRS-type retrotransposons from fungi."; RL Direct Submission to Repbase Update (24-MAY-2011). XX DR Genome; AECX01001077; Positions 183364 195484. XX CC 'CTAGC' target site duplication CC LTRs are 99% similar to each other. CC [2] Re-classification as DIRS based on the presence of tyrosine CC recombinase. XX FH Key Location/Qualifiers FT CDS 6430..8391 FT /product="DIRS-1_MLP-I_2p" FT /translation="MKFDKHFLSLVQTDLYQTVPSNRHTPQNKIIFLRETK FT TSEPPRGKKPLLVTIELRHEPDSDQDPRDEKEVERQIPKGKRRDPLPLKDR FT KTYTVHVDNPWPVPVKTDMKIDKWVALMKEVGLADDIPYIVDGFKHGFCLG FT IPQHELPGRLWYMPENHGSALAAKEQIENSLAKEVAAGRVAGPYTHEEIRE FT KLGFFRSNPLGSSENGDGSIRMISDLSYPVNEPETPSVNHFVDKNNYTTQW FT DDFRVVADFFRRNPGEWDLALVDWVKAYRQLSVHPCQRRFLIIRDFQGRLW FT VDLAVGFGGVASCGVFGAPADVWKTIVEKVLGLPKICRWVDDNLVFKRKWQ FT SISLKDIMDLSDSLGVKTSPKKNREFDLEQKFLGFIWNGENHTVRLPEEKL FT EQRRGQIDAILVPGSKWTYDQINSFVGRLVHTVNIVPHMAPYLNSIYRWLH FT SWMNEKARRKAPEDVIEDLTEWQVCLQEFNVCPIIQDREPEEVGWVGDAAS FT TAGIGVLIGTFWAMFKLNEDWQEAGLESGKRSIAWAETVAIRLGLILLDRI FT NVVGGRTFKVYTNNTTTEMCIKNQKSKDKWVNHEWKMIQRLLTRLNCNLRE FT VRVTSKENDADELSRGIVPTKLDWYREVKIEVPEDLKPLLSQVSCIDAPPG FT QKSMF" FT CDS 8638..9747 FT /product="DIRS-1_MLP-I_3p" FT /note="tyrosine recombinase." FT /translation="MNTSQLARQTKPYGRFKKNAQGIDISDSNEVQTQFLS FT GAWADGTLKHYNSAVVKLFRFAKLKGIEHKDLLPISPKLVAKFVAWGSKKI FT EEKAERDESVKSSTMKAYVAGLKAWHMFHGEMYPSQADRTVNTLLKSAAKT FT EATLYEQNVQRPPVLVSDLVNLHNEMSTEKEEDLAILAIALTAFWGTARLG FT ELVTDDAKRRMPKWRDITWSKNGKFARMALHEAKTASPGELQYLHLERQKS FT RLDPLKALENLYRFRPRKDSDDIFEVLRGKGKHRLGKSEVIQKMRKIWNCH FT RPKHKQILYGHSFRIGGASLRWNLGESREEIMNAGRWKSEAYLIYLRKFDQ FT KETRKTVKLLKEMKLEWETERGAQKEV" XX SQ Sequence 12121 BP; 3688 A; 2730 C; 2976 G; 2727 T; 0 other; cgccccctca aggacttcag gtctcaccaa catgcctcaa gcctagaagt tgttgcagtc 60 ggtagaattg ttgtctgcca agaggcttgg taagaagatc agccatcatt tcactagttg 120 gaatgtgttt aacaatcagt gctccttgtt ttacgacttc tcgaatccag tgatactgaa 180 tttcgatgtg ttttgtttga gcgtgaaaaa tagacttggt ggttaggtgg atggctccca 240 tattatcact tcgaagaacc gtcggattag gatcttcgaa accaaaaact cgcagcatat 300 tccgtagcca aatcagctcc tgccctgcct cggttatagc acgatatttg gcctcggtag 360 acaaaagagc tactgtgggt tgcaaacgag acttccagga tatgggtccg ccagccagct 420 gaaaaatgta gccggtagta aatctacgag tctccttgtc tcctgcccaa tcggcatcgc 480 agtgagaatt tggtaagcta aagcttttga gaccgttgac tatggtagct tccgagtgtg 540 atccgtcgta cgtaatgccg aggttgatag tgccactaag atattgaaac acgtgtagtg 600 cagcatccca gtgtttacga gctggagagt ccaagtgttg agataatacc ccaaccgcat 660 acgacagatc tggtcttgtg cattgtgtga ggtacataag ggaccccact gcacttcgat 720 aattaaggcc ttcgagcaga aaagcacagc ttctttttca cttgaccggt agagtttaag 780 cccagccgga aacggagtag aagcactttt caagttagtc aaaccaaaac gctccaacac 840 ggctcgggtc aaagctgatt gtgttattgt gtaacataaa ggagagggcc gactgatttg 900 aatgcctaca actgtggtgg ctacacccag atattccatc tcccatctct tggtgatttc 960 taccttgaag gtggcgatgt cattgccagt gatagctagg tcgtcaacat gcacgtaaac 1020 agctgtgaaa gtgttggatt gatttcgagt gtaaatgcaa ggatcatcca cacattgggt 1080 aaagttaata ctgaggagaa aggcaaccac gtcgtcgtac cagatcttag gagattgacg 1140 cagtccatac agtgacttgt tcaggtaaca gactttacct tccatcccct ttaccacata 1200 accttcgggt tggtacatgt aaacatcttc ttccaaattt ccatttaaga aggcagtaac 1260 cgcatccatt tgatggatct cccaaccctg agtggcggca atggcaatga gcagtcgaag 1320 agatgatggt ttaccggtcg gggagaaagt cttgtgaaag tcaacaccct gttgttgcat 1380 atttcccatg acaacaaacc gggccttata ctttgagacc gtaccatcga gatgaaactt 1440 cttacgaaat aaccagaaac ccctcaagat gtttacccca actggtggat aaaccaatgt 1500 ccaaacattc tttttgagta gagattcgta ttatttaata catgcttgct gccaacagtc 1560 cttctcaggt cccctaagag cagtcttcat ggacttgggg ataggtacat ccacagtaga 1620 agagacaaaa tcttctgaga gaatactaaa agctgaaaca tatgtgtaat cgtcaaacat 1680 tgtgtgtccg cccattcctt tataattagg agcatctttc acccgacttg atcgacgaac 1740 ttcgatagat ggagtctcta acgcaggtga aatgacatct gcaggtgaag gcacagcttc 1800 tctttgctct tgaggtgtag ctttgggaac ttcgggtact agatcttctg gttcactatt 1860 gtcatcttct atgggcaatt ggtaaaaacg atttggtgtt ttgatagcaa gaggtacttc 1920 attttttgaa tgtgatggta acaacggtgt gttccctgtt tttagaggaa aaaccgattt 1980 gcagaagctg acattgtggg ttaaaataac tttcctagtt tccagatcaa agattctgta 2040 gttgaagttg tcatcttgaa acccgaggag aactccctcc gaggtgacca gtgagtattt 2100 acctggtcga atctcctttc gaatgagatg gttggcctga catccaaaaa cacgtaaatg 2160 actgatatta ggttttcgaa aattccatag ttcaaacggc gttcttccat caggaatggc 2220 tgaggtagtg aggcgatttg tgatgaacac tgcagtagca caagcttcat accaaaagtt 2280 cggaggcact cctgactgta ccatcataga gcgagctcga gtattgatgg tctgattccc 2340 tctctcagat acgccatttt gctcaggggt gtagcttgcg gtaaaatgta gtacaatacc 2400 ggtttcttca caaaatccat ccataatgga gttcacaaat tcgctccctt ggtccattgt 2460 gaactgaaca attttcctgc cagtttgacg ttcagccaaa gctatgtact tgagaaaaat 2520 gttaaaaacg gtttctttgg ttttattgca tagtggaaag atatggcgaa aagaggaaaa 2580 attgtccgtg aacatgataa agtggtgctc ttgatgtagt ccagagacac gattgatacc 2640 actaagatcc acatgaacat tctgaaggaa attgatagca cgaggtcgag tggaggaaaa 2700 aggaagcttc ttggatttag attgaataca tataggacaa tgattggcag gagcagtttg 2760 atcacataac ccatccacag atccatgctt attcatccta tccagatatt tgttgcttac 2820 atggcctaga cgttgatgtt gtagattgca agagtgagtg ttgatttgtg aactttgagt 2880 gaagacggca gaactcacag gttcaagaga gataagcata aggtttccgg aaaaagcccc 2940 gttaaataga gcgcattgtc ccatgaccaa actaaaacaa tctgggtctg ttgagttaat 3000 aagtgtaaca acacctttct ttaataaagc tcctccagcg attaaatttc tgctaacacc 3060 atgtaaatat ccgtgtagca cgtcgacgtg cacacacgga gagttttgaa ataaatctca 3120 aagtcttttg gtcggattga gactgagatt tttgctgagc accctttgtg ctttgtgatt 3180 tgtttaaagg atttgcagaa tcctttgttg tcagtggggg ccaactgact ttcaacctcc 3240 caggtgataa ggggaagatg tacttgtcgt ggaacctctt tgatgggtcg ttacagggtc 3300 cacggaaagt ctttcgtgtg tgcgcgtcga cgtgctacac ggatatttac acggtgtaag 3360 ctcaggaaca tatagacaat cggccaacgt gaaaattgtg ccatcgccag cccgtagttt 3420 aacggaacct gtagaatgta ccgctaacga tgcatctcct cctgctaggc gtaaacgttt 3480 attggtgtct tcaagaggct tgacggtaga agcgtcgaac aaagagatgt cattgaacat 3540 gtggtgggac gcccccgtat cgtttaacgc ccatataccc ttagatgggg ttactgagga 3600 agcagtagcg aaatcaataa gttctgaacc ctgagccgat agatcagtga cttggcgaaa 3660 ttcaggttgt tgatattcat agagtaccca gagcccgcgg aggtatctct gcagtccatt 3720 agagggttac ggtcaggaga taggtccata gggagcgccc ccatagaaca gagatgtgaa 3780 acaaacacgg tttcacatcc tatggctcta gagagacccg gggtgagaga acaagttttt 3840 ctcctctgct ctcccaccaa cccgttctct cccagtcagc cctccctcta cgcacatatc 3900 tcccagaaag cttgcgacat ttctctcccc tctgtctttc tacaactgct ctaagttttg 3960 aaacattatt ctacacgtga atgcatctgg aagaacaaga tagcggaaaa tttctggaga 4020 attacaagaa acaccagaaa aacgtaggaa agaaacttga aaaagagaag aaagagcaac 4080 aaaagaaaga agaagaggaa gaaaggaaga gagagcaaga agggcgacat caagggtacc 4140 agctgaacgg attcaccaag aagactccgc accccaagat caccaaatgt cttcaaagca 4200 cccaacgatg gaaaacggag gtgacggaga cggcattgat gggaaagtga taaatggatt 4260 ggtggaggag gaaggaaacg agatggagag cacagcagct cacttcggag gacaccaaca 4320 ggtgacattc ggaagcaggg aagaggtgga agatacaaga ggaaaggaga gacagctgaa 4380 acgatcagat tcagagtcca ccctggaata cctggacaag gagggagaag gggaagatag 4440 gttaggggac ctcatagaaa agaacagctg tgagtgcaga aataaaacga cttctcctca 4500 caaacggtgg tggtactgta atccctaaag gggaaaacct catgatgatt taagcctaga 4560 agcaaccact ccggaccaac catccgtaga gtataaggct gaaggctcag acccttctca 4620 tcaacccaat ccaaaccacc gagggataga tatctccagg catattttaa gccctgttga 4680 gaaaaacgtg atcagaagga aatcctctaa accagcacga atgacaggag ctgacctaac 4740 ggggaccccg tcgaatccca tccaccctcc agccggaata gcccacacag gagcccgagc 4800 cgagaagaaa ctgaaatcgg cagagagaga agagctggtc ctgatgatgg ttgcagcaat 4860 agacaaagag aagtttgacg tcgcggtagc cctgcgaaac caattcgaac ggaaatacgg 4920 aggaacggtg gaacagatca gcaaggaaac aggaaagaag gaaggaaaaa agaaagagaa 4980 gaagaaagga aggaaaggag gaaagaaggt aggaagaaca agaaaggaaa ggggaagtcg 5040 aagaagaaga gacaaaggag aaagagttct acctcgtcct ccagctcaga ctcggatgat 5100 tcggacaatg actccagtga ttcatcaagc tcctcagaca ccgactcgga ttcgtcttcg 5160 tctgctaaga agaaatcttc ttcaagctca tcctcttcat cgtcatcatc ctcatcatct 5220 cctagcgaca ctagcgacga cgatagagac aaaaagagga aaggaaagaa aaagtccttc 5280 tcaactttca ctctgagaga cttaccacgg gtgtgggcca aaggcttcaa gagactgaaa 5340 tactacgtac ccttgtcggt cttcaacaaa gcctacatca accgcttcca ctcactgaac 5400 cgagagctga aaccgggagt aaaacaacta tcaatcccag gagtctcgga gaacgaacta 5460 gaacgacaga tgacctacgg agaattcatc caggcctgcg accttgaggc cagatatgcc 5520 tcggaaatct acagaatgaa agactacgtt agattcgtcg aagaccacaa ggacgtgatt 5580 agtaagctca tgaccaaaca caactcgtgg atggtggctc tgagatacca cttagcaatc 5640 agagcagtca tttttcgaga ccgatcatcg tcgaagaaaa gcaagaagaa aaggaagaag 5700 gaggagagag ttgtcttacc aagggggatc caagcggacg tagaggagga ggccaggtgg 5760 gaagctcaac tggcaggcga catgcgcttc gcgggcaacc catacgctcc tggaggaccg 5820 aaacacgggt ttgatttctt gacaggtgaa ccagctgagt caacctaccc gacagaagag 5880 aatcagtgga gatctcgccg tcagaggaat aaaggtagaa acccaccggc ctcattcgga 5940 aacaacacga cgaccaccgt gtacgccaaa ggaaacggct cgagaggagg ttatggaagg 6000 tcgtacgccc aaggaagtgg aaaccacgcg agcacatccc agggaaacca ggactggatt 6060 gcccaaggcc agcagcatgc caaccaactc gtgagccaag ctccgaaagg gaaccccaaa 6120 gcgcctgtga acgccaccaa gaaccagaac cagaggtgag cacgaagacc acccgagtga 6180 aagagaagta catacctctg actgtttcct caaactctcc cacttgcaca aaaccaatgc 6240 cgtcgcaacc aaacaactcc cccccgatca aacaccacaa ctgatcatgg aaccggtcat 6300 acttgccggc ctaccgtaat tgaaatgttg attggttcct gggttgtcta aataacctct 6360 atcgcgggta cggtggctgt tttcagacaa tgacttattc gtgacaagga gaggagtgga 6420 gtaactcaca tgaaatttga taaacatttt ctgtctctcg tacaaacgga cctgtaccaa 6480 acagtgccat ccaacagaca cactccccaa aacaagataa ttttcctccg tgagacaaaa 6540 accagtgagc ctcccagagg aaagaagccc ctgcttgtga cgatcgaact aagacatgag 6600 ccggactcgg accaagaccc gagggatgag aaagaagtag agagacaaat acccaaaggc 6660 aaaaggaggg acccactccc attaaaagac cggaaaacgt ataccgtcca cgtcgataat 6720 ccctggccag tcccggtcaa aacagacatg aaaatagaca agtgggtcgc tctgatgaag 6780 gaagtaggcc tggcagatga catcccgtac atagtggacg gattcaagca cggcttctgc 6840 ttgggcatac cccagcacga acttcctgga cgactctggt acatgccaga aaaccacggg 6900 tcagccctag cagctaagga gcaaatagaa aactcactgg cgaaagaggt ggcggctgga 6960 agggtagcag gtccatacac tcacgaggaa atccgcgaga aactgggttt cttcagatcg 7020 aaccctttag gcagctcaga gaacggcgac ggctcaattc ggatgatctc ggacttgtca 7080 tacccagtca atgaacctga aaccccctca gtaaaccatt tcgtcgacaa aaacaactat 7140 acgacacaat gggacgattt cagagtggtg gctgacttct tcagacgaaa ccctggagaa 7200 tgggacctcg cactagtaga ctgggtgaaa gcgtaccgac aactgtcggt acacccatgc 7260 cagaggaggt tcttaatcat ccgtgatttc caggggaggc tatgggtaga cctggcggta 7320 ggatttgggg gcgtcgccag ctgcggcgtt ttcggagcac cggcagacgt gtggaaaacc 7380 atcgtagaaa aagtcctagg cctaccgaaa atctgtaggt gggttgacga caatttagtc 7440 ttcaaacgga agtggcaaag catatctctc aaagacatca tggacttgag cgactcactg 7500 ggtgtaaaga caagcccgaa gaagaaccgc gaattcgact tggaacaaaa attcctgggg 7560 ttcatatgga acggagaaaa ccacacggtc cgtttgcctg aagagaagct ggagcaaaga 7620 cggggacaga ttgacgcgat acttgtcccg ggatccaaat ggacctacga ccaaatcaat 7680 agctttgtgg gcagattagt ccacacagta aacatcgtgc ctcacatggc tccttacctg 7740 aattcaatat acaggtggct acactcatgg atgaatgaaa aagcaagacg taaggcgccg 7800 gaggacgtca tagaggatct cacagagtgg caagtatgct tacaagagtt caatgtgtgc 7860 cctatcatcc aagaccgaga gccagaagag gtaggctggg tcggagacgc agcatccacg 7920 gcagggatcg gagtcttgat aggcacattc tgggccatgt tcaagctgaa cgaggattgg 7980 caggaagcgg ggctggagtc gggaaagcga tcgatcgctt gggcggaaac cgtagccata 8040 cgactggggt tgatcttgct cgacaggata aacgtcgtag gaggtaggac cttcaaagta 8100 tataccaata atacgacgac cgagatgtgc atcaaaaacc aaaagtctaa agacaaatgg 8160 gtcaatcatg aatggaaaat gatccaacgc ctcctaacaa gactgaactg taacttacgg 8220 gaagtgaggg tcacgtcaaa agagaacgac gcggatgaat tgtccagggg cattgtaccc 8280 acgaagctgg attggtacag agaggtcaag atagaagtgc cggaagacct taagccttta 8340 ctttcacaag tgagctgcat agacgcgcca ccaggccaga aatccatgtt ctagcagagg 8400 gattgcaatg tagcagcaat gcatgaaatg agtcgagtcc gaatgagcca ctagcagcca 8460 gaaagactgg tgagatgagc gagccatccc tgtgagcctg agacgagtga gagacactag 8520 agcagcgatg aaacccaagc ggggttcctt tccgaactca cacggaaccg cttgctggtc 8580 taaatccttc accttcagac aaagacttga cagcctggat taccaccagg cccaaaaatg 8640 aatacatccc agttagcgag gcaaacaaaa ccatatggga ggttcaagaa gaacgcacaa 8700 ggcatcgaca tatcggactc caacgaagta caaacccagt tccttagcgg ggcatgggcg 8760 gatggcaccc tgaagcatta caactcagcg gtcgtcaagc tattccgctt cgcaaaactg 8820 aaaggaatag aacacaaaga tcttctacca atctcgccca aactggtagc gaagttcgtg 8880 gcctggggtt caaagaagat tgaggagaaa gcagaaaggg atgaatcagt aaaatcatcg 8940 actatgaagg catacgtcgc aggactgaaa gcctggcata tgttccacgg cgagatgtac 9000 cccagccagg cagacaggac agtgaacaca ctgctgaaat ccgcagcaaa gacagaggcc 9060 accctctacg aacaaaacgt acaaaggccc ccagtcctag tctcagacct cgtgaaccta 9120 cacaacgaaa tgtcgactga gaaagaagaa gacctggcga tcctagcgat agctctaaca 9180 gcgttctggg gaacagcccg gctaggtgaa ctcgtgaccg acgacgccaa gaggagaatg 9240 ccgaagtgga gggacataac ctggtccaag aacgggaaat tcgcgaggat ggccttacac 9300 gaggcaaaga cggcaagccc aggagaacta caatacctac acctcgaaag gcaaaagtca 9360 cgactagacc cactgaaagc gctggaaaac ttataccggt tcaggcccag aaaggactca 9420 gatgacatct tcgaagtcct aagaggcaag ggaaaacata gattgggcaa gtcagaggtg 9480 atccaaaaga tgaggaagat ctggaactgc catagaccaa agcataagca gatcctttat 9540 ggacactctt tccgtatagg gggagcttcc ctgcgatgga acctaggaga gtcgcgggag 9600 gagatcatga acgcaggaag atggaagtca gaagcctacc tgatttacct acggaaattc 9660 gaccagaaag aaacaaggaa gaccgtaaag ctcctgaagg aaatgaaatt agagtgggag 9720 acggaaaggg gggcacagaa ggaagtttga tgagggtgaa gctatctgca ggcaatacaa 9780 atttaaagtt tgccgatcac tacaggtatg gcttgtaacg ccacggaccc tgtaaacaag 9840 gtttgggcag gagcggcaga tctcaaactc acgggtgtcg agtgaacggg aggtttctca 9900 gggctcacgg ctctactacc cgcgagtaga tggggtcgag aaaccccgaa gatcacacac 9960 cgttctcgcc tcacaggggg tggtaaaata aggtacctag gtaccttaca ccttaccacc 10020 cccctagagg ccccccccaa accacgctga gagttctaga ggctgtacaa cacaacctcg 10080 agagctctca gaggtaggat ggagtgtagg cctaagggac tctagccctt gtaactacaa 10140 gggctcggga aaagacaatg ctaacacagc actaggcaag cttaagaccc tcacccacac 10200 taatggattc atagagtacc cagagcccgc ggaggtatct ctgcagtcca ttagagggtt 10260 acggtcagga gataggtcca tagggagcgc ccccatagaa cagagatgtg aaacaaatac 10320 ggtttcacat cctatggctc tagagagacc cggggtgaga gaacaagttt ttctcctctg 10380 ctctcccacc aacccgttct ctcccagtca gccctccctc tacgcacata tctcccagaa 10440 agcttgcgac atttctctcc cctctgtctt tctacaactg ctctaagttt tgaaacaccc 10500 acccgccgtc ccgggtcgag aaaccccaaa gctcacacac cgttctcgcc tcacaggggg 10560 tggtaaaata aggtacctag gtaccttaca ccttaccacc cccctagagg ccccccccaa 10620 accacgctga gagttctaga ggctgtacaa cacaacctcg agagctctca gaggtaggat 10680 ggagtgtagg cctaagggac tctagccctt gtaactacaa gggcttggga aaagacaatg 10740 ctaacacagc actaggcaag cttaagaccc tcacccacac taatggactc gatgtcatga 10800 tgaataggag cttgaataaa tgcagaggat acgtctagta cttgaattac ctcgtgaatc 10860 ttggtggata cagttgttgg aatttgctgc ttgacaggtt gaagatcgat aagaaaagtc 10920 attgcatgat tggcggaagt tgcaggaggt ctcacaattc taactcctct aatggcacta 10980 gcactagcgt tggaattgct ttgatgttga ttgtttcgac gtttgaatct tttcttgttt 11040 tcatcttgct ttgccatcca ctcatcacgc ttactaaaat tttcaggtct tttaaagcat 11100 tgatcgggtt tgtgcggatt ttgaaaagta tcaccaacac atctatcaac agtacacttg 11160 atcatttgac tcggttgagc tgaatagtga gaaacttgat gagacacttg atgtaacaaa 11220 gttccagcat attgttgttc gtcttcatat aataagagtt ctcgcttgac tttttcaaaa 11280 gtgagaggtt gaatgactcg atagatgatg gcaatagtta cttcataacc aggcttcaag 11340 ctcttaatca aagtttgagc aacctgacca tcttgcatat ctcctttata tttatgatat 11400 tcacgtaaca aagaggagaa cttgtatatg tgaactttca atgagtctga ttgtagttga 11460 gtcattgaaa gtaaggcagt ctcaatatgc ttgagcttgg cttcagaaat cgtcaaatga 11520 taatcattga gatgtttcca taaaacataa gggtcgtagt gaatagtcat ggttttcgtt 11580 tcagggtctc aaacagttaa agcatctcta gcacgttcac catttactcc atcacacttg 11640 ttcagtatcc atgtagtcaa tagaaatttg attttcttat gtttctcctc agtaagagtc 11700 ggatcaacat aatcacacac atctaagtaa gactcatagc ctaaagattc tatggattcg 11760 tgaatcaata agttccagta gttaaaattc tcatctgtaa gatcattttt gatgttgaat 11820 ttactgaggg atgaagtcgt gcaatgtatg aaagattgta acggcgttaa gttttcaagt 11880 gatgaagcct tgtcggtcaa aattggagat gagggcaatt tcaaagttaa aacaagagat 11940 tttagtccag tggaggtacg tttgactgtg tcagaagatt taacggaact cgatgtgtca 12000 gagtcagaat ttgattcggt gtcagttgga ttggtggaat cgtctttgtc aggtgattgc 12060 attcagttaa gttgtcggaa aaagttagag ttttgaacag tgtcaaagac tctcgctacc 12120 a 12121 // ID Gypsy-99_MLP-I repbase; DNA; FNG; 6904 BP. XX AC AECX01000442; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-99_MLP_; KW Gypsy-99_MLP-LTR; Gypsy-99_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-6904 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000442; Positions 29025 22122. XX CC Positions [4656-5078] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1689..5144 FT /product="Gypsy-99_MLP-I_1p" FT /translation="MKKEDLPNWREVTAFDGRRSFIKFKIELKIENKEETE FT ELQVADLKGQYDVILGMPWFRRYGHLVDWAHSKIGCITSSPPKVPAEEGIE FT DKARKLEEGANRLSVVTPPRCESARSISVYKRTITGKDSVLEQVDVPDKVN FT NEEVSPDAIMAGQSDELGGQTSKCDPGVAAMSPSVKAEPPPSESNRLVPSV FT VKIIAGKDPPSKQDNRPNKVKRRKKCKNRHFWRKKKKALKSLDAPKAGHSG FT GLGGQTSTCDSGVAAICPSVNAEPPPSESNFLFVLDDVEDVVASNFFRHNI FT DQSGSEDVKVDTLKSSWSKSAELAASNPELLEEKSAADLVPVEYHDFLPLF FT EKKASLRLPPRRKYDFRVELVEGASPYAGRIIPLSPAENKALDKMVEEGLK FT AGTIRRSTSPWGAPVLFTGKKDGKLRPCFDYWKLNAMTVKVKYPLPLTMEL FT VDSLLDAEEFTSLDMRNGYNNLRVAEEDEAELAFICRAGQFEPLVMPFGPT FT GAPGYFQFFIQDILKDRIGKDVTAFLDDMMVYTKKFNVHKDAVRDVLQVCK FT DQSLCLKPEKCKFSQSEIAYLGLIISKNRIKMDPKKVEAVTDWPAPKNVSE FT VLSFVGFANFYRRLIDNFSKVARPLHELTKKGTTFEWTQARQNSFDTLKRL FT FTTAPVLKIADPYKAFVLECDCLDFALGAVLSQISDVDGELHPVAFLSRTL FT VPAERNYEIFDKELLAILAAFKEWRHYLEGNPNRLDVIVYSDHKNLESFMT FT TKQLTRRQARWAETLGCFDFEIRFRPGKQSQKPDALSRRPDLRPAVEDKLT FT FGQLLKPDNLRHDTFIADIAIDSFELLFEDEVGGQNFEDNFENKECIEIGA FT LLSYDEIWNDEMIMTLLKNKMKDDECIQNLIKSIALQNAKEDNSDEEYEVS FT EGLLYFRKRVVVPQNDKLKFEILRSRHDSRLVGHPGQARTLSLVERNFYWP FT SMKKYVNRYVDGCSSCQRVKSRNTKLFGFLKPLPVPAGPWTDICMDLITDL FT PLSEGKDTILTVVDRLTKMGHFIACNKTTDSMGLARLLIDNVWKLHGTPKS FT IVSDRGSIFVSRITREMNAQLGIATKPSTAFHPQTDGQSEVSNRSVEGYIR FT HFANYDQSNWVRNLSTFEFSYNYGIQTRNPDLFRVSQFGLFGCV" XX SQ Sequence 6904 BP; 2061 A; 1375 C; 1579 G; 1889 T; 0 other; cattgtacca tcttttaagt acagacgcta gtagaacaaa acaatattaa ctctaagctt 60 caaaacctca tctcacgtca caaccttatc gacgatacaa accaacgcca ccaatctcga 120 caatccatac aaaccaacgc cacaatcatc tgaaccaaac cttaccgaca caaacaaacc 180 aatacaatcg atacacgtca caagacgtca caaccaaacc gatacaatca atacaactga 240 tacaactatc ccttcgtacc actctctgta tactcttcga atacagtcta gtctccctgt 300 aatctaaaga cgacgaggta cgggattctt cgtaacgacg atcagaattc tcaggattcg 360 cacaattcat cgaattctaa ttcctcgtta cacactgctg aaacaaccgg tcaaatggcc 420 aacttagatg agatcgcttg acagatccag gatctcaacg ctcgtcttat caatgagaca 480 aaccttcgtc acgaggaaac tgcgcgacga attgcagcag aagctcgttt agaggcttct 540 attgctgctc atcccgctac tcaacctgct gtgtcacaag ctgtagctca catcccggcg 600 cctgttgctc acatggcgaa gatggctacc ccggacaagt tcgacgggaa gagaggtgct 660 aaggctgaga tgtttataca acaggtttcg ttgtatgttt taacaaacga acgctcattt 720 cccgccgagt acaacaaagt agcgttcgcc gtatcttatc tcacgggcga cgctagtgtt 780 tgggccgctc cttttgttcg caaaatcatt ggcgacgaca aggaatcggt caccttcaaa 840 agctttttag atgtattcaa aggtacctac tttgatccac atcgtgtaag caaggccgag 900 aacgcgatca gagcgttatg gcaaaccaaa tcagtattgt aatactcgac tcgcttcaat 960 caacttgcca gtattgttag ctgggaggac tcgaccttaa tgagtcattt taaaggacac 1020 ctcaagcccg aaatcactgt tccgttgatt agggacaatt cctctacttt ggatcaactg 1080 atcaaagcgg cgattgaagt ggatcacttg ttgcatccca atcgagacgg tatttctttg 1140 tttcatgagg gatcagagga aaaagtgatt gaagttcaag gagtagctga tagcggaggt 1200 gttagaagtg tgatgagatc tgatgatgct atggacttgt cagcggttcg tttcaattgt 1260 tcttacccgg aatatcaacg ccgcaagaag gaaaatctat gcttctattg tggcaaggcg 1320 ggtcattcgg tatcttgctg tactcaagcc aaggccaata aggggaagaa caaaggtttc 1380 aagggtcaat tttcagacgt tatgatcggt ggtagtggag gaggaagtag tagtatgacc 1440 agtagtagtg ttgtagagtc gaaaaatggg ggtgctcagg agtgaacgtt acgccaccct 1500 tgagcagttt tagtggtttt tgtgttggtg ttggtgcaaa taaattgtta agtaattcag 1560 acccaagact ttttaagtgc atctccgtcc acgatcccac ccaagccgta acccataaag 1620 ctctttgcct acttgactgt ggatcaactc acaatgtcat taaggagaag tttgttgatg 1680 agaagaaaat gaaaaaagaa gatttgccca attggagaga ggtgactgcc ttcgacggga 1740 ggaggagttt cattaaattc aaaattgagc tgaaaattga gaacaaagaa gaaacagaag 1800 aacttcaagt agcagacttg aaaggccagt acgacgtcat cttaggtatg ccatggttta 1860 gaagatatgg tcatctagtt gactgggccc atagcaagat tggatgcatc acctcaagtc 1920 cgccaaaggt ccctgccgag gagggtattg aggacaaagc taggaaactt gaggaggggg 1980 ctaatcgttt atcggttgtt acgcccccgc gatgtgagtc tgcccgctca atttctgtat 2040 acaagaggac tattactggc aaggattctg ttttagaaca ggttgacgtg ccagataaag 2100 tcaataatga agaagtgtca cctgatgcga tcatggccgg ccagtcagac gaacttggag 2160 gccaaactag caaatgtgac ccgggggtag cggctatgag cccatctgtt aaagcagaac 2220 ccccgccgag tgagtctaac cgtcttgttc catctgttgt aaaaattatc gctggcaagg 2280 atcctccttc aaaacaggat aaccgtccaa acaaggttaa acgaagaaag aagtgcaaga 2340 atcgtcattt ttggaggaag aaaaagaagg cactcaagtc actcgatgcg cccaaggccg 2400 gccattcagg aggactagga ggccaaacta gcacatgtga ctcgggggta gcggctatat 2460 gcccatctgt taacgcagaa cccccgccga gtgagtcgaa cttcctcttt gttcttgatg 2520 atgttgagga cgtcgtggct agcaattttt ttcgtcacaa tattgatcag tcaggatctg 2580 aagacgtgaa ggtggacact ttgaaatcgt cgtggagcaa gtcggcagag cttgctgcat 2640 ccaacccgga acttctagaa gagaagtccg cagctgacct ggtaccggtt gaataccatg 2700 atttcctgcc gcttttcgaa aagaaggcat ccctgcgttt accgcctcgt cggaagtacg 2760 atttccgagt ggaactggtg gaaggtgcat ctccttatgc tggaaggatc atccctttat 2820 ccccagcaga aaacaaagca cttgacaaga tggtagaaga aggactcaag gcaggaacaa 2880 tacgacgctc gacatcgcct tggggtgcac cggtcttatt taccggaaag aaagatggta 2940 aattacgacc atgttttgat tattggaaac tgaatgcaat gacagtgaag gtaaagtacc 3000 cactgccgct tacgatggag ttggttgata gcctgcttga tgcagaagaa ttcacatcac 3060 tggatatgag aaatggctac aacaacttac gtgttgctga ggaggatgaa gcagaattgg 3120 cgttcatctg tcgagcagga caattcgaac cattggtcat gccgtttggc ccaaccggag 3180 cacctggcta ctttcaattt tttattcagg atattttgaa agacaggatt gggaaagatg 3240 ttacagcatt tttagatgac atgatggttt ataccaagaa gtttaatgtg cataaagatg 3300 ctgttcgaga cgtacttcaa gtttgcaagg atcaatcttt atgcttaaaa ccggaaaaat 3360 gcaagttttc acaatctgaa atcgcgtatt taggtttgat catctcgaaa aaccggatca 3420 agatggaccc caagaaagta gaggctgtaa ctgattggcc agcaccaaag aatgtgtctg 3480 aagtattgag tttcgttggt tttgcaaact tttaccgacg tcttattgat aacttctcta 3540 aggtagctcg cccgttgcat gaattaacga agaaagggac aactttcgaa tggactcaag 3600 cacgacaaaa ttcttttgac acgttaaagc gattattcac tactgcgcct gtgcttaaga 3660 tcgctgaccc ttataaagct tttgttctag aatgtgattg cttggatttc gctctagggg 3720 cagtgctgtc gcaaatttcg gatgttgacg gagagttaca cccagtcgct ttcttgtcaa 3780 gaactctagt accagcggaa cggaactacg agatcttcga caaagaattg ttggcaattt 3840 tggctgcttt caaagaatgg cgccattatt tggaaggtaa cccaaaccga ttggatgtca 3900 tagtttactc agatcacaag aatttggaat ccttcatgac gacaaagcaa ttaacacgtc 3960 ggcaggccag atgggcagaa acccttggct gctttgactt cgagataaga ttcaggccgg 4020 ggaagcagtc tcaaaaacca gatgctttat caagacgacc tgatttgcgt cctgcggtgg 4080 aggataagtt gacatttggc caattgttaa agccagataa cttgcgtcac gatactttta 4140 tcgctgatat tgctattgat tcttttgagt tgttgtttga agatgaagta ggaggtcaaa 4200 attttgaaga caatttcgaa aacaaagagt gtattgaaat tggtgcatta ttgtcatacg 4260 atgaaatttg gaatgatgaa atgattatga ctttactgaa gaacaaaatg aaagatgacg 4320 aatgtattca aaatttgatc aaatcgatag cattgcaaaa cgcaaaagaa gataattcag 4380 acgaggagta tgaagtttca gaagggcttt tgtatttccg taagagggta gtggtaccac 4440 aaaatgataa actgaaattt gaaattctgc gttcgaggca tgatagcaga ttagtcggac 4500 acccgggtca agcaagaaca ctatcgttgg tggaaagaaa cttttactgg ccgtcgatga 4560 agaaatatgt gaacagatat gttgatgggt gttcatcgtg ccaaagagta aagtcacgaa 4620 acacaaaact tttcggcttt ttgaaacctt tgcctgtccc ggctggcccg tggactgaca 4680 tttgcatgga tctaattact gatctgcctt tgtcggaggg gaaagacaca attttgactg 4740 tggttgatag gttgacgaag atggggcatt tcatagcctg caataagacg actgattcaa 4800 tgggtcttgc acgcttgttg atcgacaacg tatggaaact tcatggtacg ccaaaaagca 4860 tagtgtcgga tcgtgggtca atctttgtca gtagaatcac cagggaaatg aacgcccagt 4920 taggaatcgc aacgaaacct tctacggcgt ttcatccgca aaccgacggt caatccgagg 4980 tatccaaccg tagtgttgag ggttatataa gacattttgc taattatgat caaagcaatt 5040 gggtcaggaa cttatcaact tttgaattct cttacaatta cgggatacaa actagaaacc 5100 cggatttgtt ccgggtttcc caattcggac tttttggctg tgtttgagaa actcggaatc 5160 tgggtcaaaa ttcccaattt ttttctactt tgaatccctg gacgccttca acttcccatc 5220 tttgccaact ccaaatctca aatccgcttt gatctcaact gacaggtcct ccttcaccac 5280 acacctccat ccactaccat cctctcatgc ctcattttga ccgtcaaaag ttgaaaactt 5340 caaaagtcaa taccagcttc cagttatgca agaacggctc ttgctcggaa cacacgtgga 5400 gtctttttgt tctcttaatc ccttttttat catctttcac taaactgact tttccttctc 5460 tttgggttgg ttgattagaa ttaagaccac ttgatagaac cagtagctcc gatcgtttca 5520 gagcacagtt tcttttacgc tctctgtaat gctaacatcg acttcagaac tcgtatcact 5580 ttatatgatc gattactagt tcatctcttt acctctcttt tttgtttcca tcaatctttt 5640 tatactaatc ttttctttct ctattcgatc tttgatttgg tgcctatagt ctcaagtatt 5700 gggaattagc agatacatgt tttttagtcg ctaagaactt tgcaatcaaa tttaaaccga 5760 gtttcctttt actttttttc aaagtctttg tgtaaggagg atgagttgat catcaataat 5820 atgatgagtc agagggtagt tcggactctg attggatcat ggtgccgtga tcgaagtgtg 5880 gagcggagat ggttgtggag atcttgtttg acaacatgga agtcaaaatt ggggctcttg 5940 attttttccg agtttcccag attctgaaat gaagttggga aactcggaaa ccaggaaaaa 6000 aaacccggtt tctactttgt atcccgtaaa taatctagat cacacagcga ttggtatgtc 6060 accattcaag agtaattatg gccacgattt aagtttagga agattaattc atggtgaaag 6120 atgtgtgatg gcggtcgagg aaaggttaaa tcaattaaca gaagtacaag aagagatcaa 6180 gtcacatatg aaaagaagcc aagacattat gaaaaagaat tatgacaaaa acgtaaagga 6240 ggaggaagaa tgggaggaag gtagcaaagt atggctgagt agtagaaatt tatcgacaac 6300 gaggccaacg gcgaaattta gccataaatg gctaggacct ttcaaaatag aaaagaagat 6360 ttcacctgtt gcttataagc tgacactacc attgtcaatg aaatgcttac atcctgtttt 6420 tcatgtttct ttgttaaggc agtttattcc ggacacaatt gaagaaagaa gacaagcaac 6480 accggagcca attgtattag agggtgtgga ggaatttgag gttgaagaaa tattagattg 6540 tagaaggaga agaggagttt tggaatattt attgagctgg aaaggttatg gaccggaaga 6600 agactcttgg gaaaaagagg attccttaag caacgctcaa gatttattaa atgacttcaa 6660 aatcaaatat ccaaatgccg aaaatggata caaacggaca cggagaaaga agtgagagat 6720 tcaagctttt tccctgaaag tgagggtttt ttaacactga atcgtggaaa gattgcagag 6780 ctgcaagagg ggcttgggca ttaatggggg agtaatgtta tgatctcatc ttggatcatt 6840 ctataaggat atgcttataa ctgtaaagct tgtctttata tactttgtta tatgtcacct 6900 tcat 6904 // ID Gypsy-14_CCO-LTR repbase; DNA; FNG; 496 BP. XX AC AACS02000009; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the mushroom Coprinopsis cinerea genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-14_CCO_; KW Gypsy-14_CCO-I; Gypsy-14_CCO-LTR. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-496 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the mushroom Coprinopsis cinerea RT genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AACS02000009; Positions 2251424 2251919. XX SQ Sequence 496 BP; 84 A; 154 C; 80 G; 178 T; 0 other; tgttacgaat tccgccccta tcttttctct tcgcgtcctt tctctatcac atgactctct 60 ttcttgttca catgatctct ttctttatct cgtgactgtt ttatcagtta attcttctta 120 ctgctgcggc gcagccttct ttgactcttt atgactcctt cttgctgcgg cgcagccact 180 ctctattgtt ctctaattta tctctgctgc gctccgcagt aacctgtgct cgcgcagccc 240 ttttctctga cttctcttga ctcatattca cgtagctaga taggtatata agctctgtag 300 atacctctct tgttccttag ctagttctcg tgaatcatac ccacttctcc tcttctctcg 360 aactcgctct agtcgtctta agacaatacc tcttgtgagg cactcttaga cgcctatacc 420 ctcgaactcg cctcggaacc acgactctac ggtcgacgtg catgctccct tgagttcgcg 480 tcccttgacc gtaaca 496 // ID Copia-11_MLP-LTR repbase; DNA; FNG; 351 BP. XX AC AECX01001249; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-11_MLP_; KW Copia-11_MLP-I; Copia-11_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-351 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001249; Positions 54076 54426. XX SQ Sequence 351 BP; 69 A; 83 C; 57 G; 142 T; 0 other; tgttgaacta cttgatgtct catgatgaac tcctctcgta ctccttatgt catctgtcat 60 gtgtgctttc catctactgc ttcttgtcta ttgttttctt ttatctctca tgacggatca 120 tgtgactcct ctagttctta cataactaac aaatgtttgt tacgtagtag tgttgtgtat 180 atatacaacc tggagctgca ctaggaaagg gtctcttttc ctcacccttt ccatttcatt 240 ctcgtgcgct ccaggtatgt attttctttt tccttcatgt tgtgtgtgtt acttatgaaa 300 ttgaaaccac cctttccatt tcattctcgt gcgctccagt tgatcgcaac a 351 // ID MGRL3_I repbase; DNA; FNG; 5841 BP. XX AC AF314096; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 08-NOV-2002 (Rel. 7.1, Last updated, Version 1) XX DE Magnaporthe grisea gypsy-type retrotransposon MGRL3_I, internal DE region. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; MGRL3_I; gag; internal region; pol; KW internal portion. XX OS Magnaporthe grisea OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Magnaporthales; OC Magnaporthaceae; Magnaporthe. XX RN [1] RP 1-5841 RA Kang S.; RT "Organization and distribution pattern of MGLR-3, a novel RT retrotransposon in the rice blast fungus Magnaporthe grisea."; RL Fungal Genet. Biol 32(1), 11-19 (2001). XX DR Genbank; AF314096; Positions 1372 7212. XX SQ Sequence 5841 BP; 1606 A; 1556 C; 1558 G; 1121 T; 0 other; ataataatat tcacgatacc gaagaaaaaa cacgaacgcc tataaggcac gataccgaaa 60 aagaaacacg aacgcctata aggcacgata ccgaaatggc caagaacgac gccgaacggc 120 agccgaccgc ccagttggga aggcaagagg tcgaatctga ccacggtcgc cccgagacag 180 gtcccccccg cgcagagcgc cagcgtcagc agcctacggt acgaacacgg gcccagcgtc 240 gcccgtcgac ccgggagcgc gtgggaaata ttgtgacggt taggggggag gacgagacca 300 ccgattcccg agaagacacc gacgaacccg ttcggttaga tgattgggac cagcgagttt 360 taccatcaat agatattgat gggccgacac gcaaaaccaa cgaggctcct ctttccgagg 420 aagaacacca gatgatagta gacgaattgg aagatcgcat tgcccatatg caggaccaaa 480 tcgatttact aaacaacacc aagcaagaaa acgaggccac catttcgcgc ctgctccgcg 540 aagcccgttc ccgacgagaa actaccgcga ttagtattgg taccggttcc ggtggttcgc 600 accacaaatc ccgcgtaaag gatcctccga cattctcaaa taaccccgac aaagacgagg 660 ttactttcga ggtttggcac cgccggattg agaacaagtt gttgctcgac ggagcgcatt 720 atcccaccga tgcagataag cgcgcctacg tcgagtcccg tctcggaggt gacgccgccg 780 acaccttgct cccttacctc aacgataccc atcccgatca gatcaccacg tacgacggta 840 taatgggcca tttgcgcgag gaatacaagg ataccaaact cgaggataca gcgcgggacg 900 aattggacaa gctgattatg tcgacgaagg acaagtttat gattttcaag aacaaattcg 960 tcaaactggc aggtcagagc ggccttccga agcgaagctg gaagcgggaa ttgcaccgcc 1020 gcctgcctac caacttgcga gtcgcaatgg ttatatacca ccaggatacc aacgtctcgt 1080 ttgacgccta cgtccgtacg gccgatggca tctcctatga cctcaccaaa gcctacgcct 1140 cgcggaccga agataagaat aagaccaaga ccacctcgat caagaagacc acaggagggt 1200 ctgttattcc acgtgctgga ggaccagccc ggaccaccgg aggcgctgtt gacgagggac 1260 agaagggcaa actgagtgtg gaggagatgc gtggacttat ccggacaggc aaatgtttcc 1320 aatgccgtga accgggccac attagcagga actgccccaa cggtgacacc ggtcgcccca 1380 ccatttccga ggaccgagtt aaccaaatta ttgcaagcta ccatggaaag ggtaccgata 1440 tgtttggaga agacgtcgtc atgtcggaaa actgattgcc ctggggaaat tggactccct 1500 cccagggaag gtgtggccaa ccaccgagat gtgcgaattg agagaaaaaa tcgctgcaaa 1560 gcaaaggctg ctggaagagc cgatacccta cgatttctta cgtagatatc taggcggcga 1620 ccccatgctt atagcggcca gcatttccct aaatggactc aattataaca ctcgtgcctt 1680 gattgacacc ggagcaaatg ggtacgtttt tatcagcaaa caacttgctc ggcgactgta 1740 cacaacgttg aaagccccta agattgtcgg cttccacccc caaaaggttg gaggctatga 1800 cggaaaagcc tcgcaacaga tcgatactgc tgcattggcc acttttgtga tacagggccg 1860 gaggttccgc gaaacgccga tgctggtatt ggatatgcca caagagatga tcgtgggacg 1920 actcttcctg gcggagcacg gaatacttac tgactgtcgt caccgacggc tccaattccc 1980 ggaaagccta ccctcattgg cggtatgtta tcgagatatc gacatgaccc aatcaccaga 2040 tccgaggtct gtgagcccca tacaccagtc ggacgcggat cgtcgagatg cggccctgga 2100 aaagaaagat caatcggcct cccgacagcg aatgatacaa cgacgaatcc aagccttgga 2160 aaaaatgaca gaacgccctc cgtcgatacc ccggagagca aaactgctgg ccaggtcggg 2220 acagacctgg gagatcccca cggcccggga aacttacgcc gatccgaatt tacaggagat 2280 gcaacgggaa ctttacgacc tgccgaggcc aaaggttgaa gtgacaagcc tgagtcatac 2340 ccaaaggaaa gagaccagag aggccgaagg ttcctctcgc atacaaaagg atgccaaagg 2400 aaaatacata ctgaagagag acgcggtggg ttggtacaaa gaccgacagg caagtgtcgc 2460 catgataggc gctatacaac ttatgcgttt ggtgcaggag gattcagccc tctactgcac 2520 ctcgataaac gaaatcgacg atgtcctgcg gcagaagaat gccgagaggc tgctccctga 2580 tgacgacgga gacctgcggg cgatgttgtt ggaaagagtt ccagaacatt accacgacct 2640 gttagacgtg ttcagcaaag tggaatccga tcaggtcccg ccttttcgac cgggatcaga 2700 ccataatata caactcgagg gtgaccccag aacattggga tatagtccac tttacaagat 2760 gacggaggaa gagttggaag cttgcaggaa atatctccaa gagaacctcc agaaaggttt 2820 tatagagcca ggatcgaccc cgtgggccgc ccccatcctg ttcgcacgaa aaggcgatgg 2880 aggactccgg ttttgcgtcg attatcgaaa actgaacgcc cttaccaaaa aagacgtctg 2940 tcctttaccg ttaatcgagg agacactggc ccggatctcc aaggctcgat tcttcaccaa 3000 gatcgacatt aggcaagctt tccaccggat ccgcatgaac ccggaacatc gagattacac 3060 aacctttagg acccgatacg ggaccttccg gtataatgtg ttaccttttg gtctcactaa 3120 cgggccggcc acgttccaga agtttatcaa cgagatcctg atggagtatt tggacgattt 3180 ctgttccgct tatatggacg acattctgat ttggagcgag actgaggagg aacaccagac 3240 tcacgtccgg caagtcctgg aaaggctcaa aaaggctgga ctccaggctg acataaaaaa 3300 gtgcgagttc cacgtcacag aaacacgatt cctgggcttt attataggca ccaagggcgt 3360 cgctgtcgac ccagataagg tagccgccgt gaagcagtgg gcaccaccca ctaccgtgaa 3420 aggagtgcag tccttcctag ggttttgcaa tttttacaga aagtttgtcc cagaatacag 3480 ccgggtagcc aaaccgctaa cgaatttgac gaaaaaagcg gtgccttttc aatgggacga 3540 acattgccgc gtcgctttcg atacactgaa aaaggcgtta acttccgcac ccgtccttgc 3600 ccattaccaa gcggattacg aaacaagggt agagactgat gcgtccggag gagtggtggc 3660 cggagcgttg ctgcagcaga acccaaacac ccaagaatgg catccgatcg cattcttttc 3720 cgaaactatg cagcaggcgg aactcaacta ccccatccac gacaaggaat tgctagcagt 3780 cgttagggcg ctgaaaacct ggagaccaga gctaatgggc accaaaaaga agtttgtggc 3840 gattactgat cataaggcct tggaatactt ttctacgaaa cggttgctca actctcggca 3900 agcggcctgg gccgacttct tttcccaata taattttgag atcacttacc gccctggcag 3960 cgagaacgtg ctggcagatg ctttgacccg caaggcagag gatgtacaaa atcaaaaaga 4020 acgtcaggaa gcagaacgct acatggctat tttccgtcct gtcgacagtg ctggcgacga 4080 cctttatgtg gtcaacccct tttcatattg gcaaactgtt gccgatggcg gtttcggagt 4140 gctgttggca gcgatcgaac caactggcgc agctggacaa ctgctgttcg ctgaagaggt 4200 gctcagggcc aacgagcatt caccactgct agaacagtac cgaattaagg cccgtaaccc 4260 cgaagagaag cactggacgt tgttgcacga caaacacgta ctgtaccgaa ggagattggt 4320 gatcccaggc gacggggaat ggcccgccaa agtgattcgt gaagttcatg ggactattgc 4380 tacggcccat cctggccgca ataagacccg ccgtttggtg gcacagcagt tttggtggcc 4440 aggcatgagc ggaatggtgg accgctatgt cgccaactgc tcctgtcgat cagccaaagt 4500 tccgagggat aagacccctg ggcttttaca accattaccc gtgccggatc gacaatggag 4560 caccatcgtc gtagatttta aatcaatgcc caagtcgaag tcgggaaacg ataacctgtt 4620 tgtaatgatc gatgcgctga cgaaacggtc gtgggccgtt ccatgcacaa ggacggctac 4680 cgcaaaggat gcagccatga tgtactacga gggtccttac cgcatttatg gtttacccac 4740 gaaagtcgtt tccgacagag gtccgcagtt cgtatcagat ttgatcgacg agatgtccaa 4800 gatcttgcag attaagtgga agctttcgac ggccgggcac tcccaaacag cagggcaagc 4860 agagattatg aacgcctaca tcgaccaaag gttgcgccct catattaacc atttccaaga 4920 cgattgggac aaacgtatgc ccgcaatcga tttggtgcag gcaacgctac cccatgactc 4980 tctcggagga ttttcacctt tcgagattgg aaacggctac ccggcccata tgcatttcga 5040 ttggacgcaa aggaccgaac taaagggact gcctacccgt gagcgactta caagaaccga 5100 agctcaagcg attaccaaaa agctcgaaag ctacgtggag gcggcacgca cccacctcca 5160 aatggcgcaa caacgaatgt gcgaccaagc caacaagcac cgccgggaac ccgatttcgg 5220 agtaggaagc gccgtctata ttattaaaaa gcattgggtg acggaccgtc ctagcgataa 5280 actagattac ccgttgacca gatgtagtta tgtgattaag gagaagcggg ggcattccta 5340 ccgtttggag ttgccagact cttggaaggg aagccgcgta tttcatgccg accgtcttcg 5400 gttagaccca cgtgacccat tggacggaca agccgcagaa aggcccgaag gcgaggtaat 5460 cgacgcgtca gaagatacca acgaagagga gtgggaagtc gaacgggtaa tttcctcgcg 5520 ggtgctacga ggtgtgcttc agtaccaagt ccagtggaga ggctgggacc ccgacccgga 5580 gttctacgac gcggaaggat tcaagaacgc ggctgtgcaa ttgcgacagt accataacgc 5640 ctaccccgac aaggccggcc ctcccatgcg cctagaggcc tgggaaaagg ctgccctcca 5700 gaatcggttc gacccgcccg acgagtacga caacgccccg gctacgggga acagcacagc 5760 cagcagattg cgacgtcgcc ggtgaagaca gaacaggaca cagcaagagt cctaaccgct 5820 cgcggggcgg acggggggta a 5841 // ID Mariner-1_AN repbase; DNA; FNG; 1361 BP. XX AC . XX DT 09-DEC-2003 (Rel. 8.11, Created) DT 09-DEC-2003 (Rel. 8.11, Last updated, Version 1) XX DE DNA transposon, Mariner superfamily, Tc1 clade - a consensus DE sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; KW mariner superfamily; Mariner-1_AN; Tc1 clade; transposase. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-1361 RA Kapitonov V.V. and Jurka J.; RT "Mariner-1_AN, a family of DNA transposons in the Aspergillus RT nidulans genome."; RL Repbase Reports 3(11), 194-194 (2003). XX DR [1] (Consensus) XX CC DNA transposon. Mariner superfamily. Tc1 clade. CC Mariner-1_AN elements are characterized by TA target-site CC duplications CC and 58-bp TIRs. CC The 364-aa Mariner-1_ANp transposase is encoded by a single CC ORF (pos. 173-1264). Its copies are less than 1% divergent from CC the consensus. XX FH Key Location/Qualifiers FT CDS 173..1264 FT /product="Mariner-1_ANp" FT /translation="MAIPRPSTPPEAPLEVTEISERSQSSRWLSRDDRIRI FT LTLRDAGFTYQQISSQLGFTYRQVQYTCQNEQSTPRKPPGQRPKLSEEDMD FT NIITFISSSQRTRRLSYKRVIEELNLPCGETALARALKKRGYSRCKALRKP FT PLSDDTKRVRLAWALEHVNWTIEQWNRILWSDETWVTPGFHTRIWVTRRAG FT EELDETCIRSSTPKKRGWMFWGSFYGDTKGPCLFWEKEWGSINAESYCERI FT VPIIDGYLRLNRQQGNYLCLMHDGAPGHASKDTIAELHERSIYPISWPAFS FT PDLNPIEMVWNWMKDWIQERYPDDRQLSYDALREIVRASWDAVPTDFLKGL FT IGSMQARCQAVIEAEGGHTKY" XX SQ Sequence 1361 BP; 372 A; 314 C; 319 G; 356 T; 0 other; ctctggtata ccccggtaag aatgttagtc cggtgggcct tacaaggata tgactcctct 60 gaagcggtga ggccttcaaa gcaccgcttt cttaattcag cctagttttg aagctagtaa 120 aacctccaat atcgccatcc tcctgagatc catggtggtg agcattgcta atatggctat 180 ccccagacct agcacccctc cagaggcgcc tttggaggtg actgagatat ctgaaaggag 240 ccaaagttct agatggctaa gtcgcgatga tcggattcgc attttgactc tacgagatgc 300 tggttttacc tatcaacaga tctcttctca gcttggattt acctatcgtc aggtgcaata 360 tacctgccag aatgagcaat ctactcctcg aaagcctcct ggccagcgcc cgaagctatc 420 agaagaggat atggacaata tcattacctt tatctcttca tcacaacgta cgcgccgact 480 atcttataaa cgagttattg aagaactaaa tcttccctgc ggagaaactg cacttgctcg 540 agcacttaaa aaacgaggct attcccgatg caaagctctt cgaaagccac ctttatcgga 600 cgatacaaag cgtgtacgtc ttgcctgggc ccttgagcat gtgaattgga caattgagca 660 atggaatcga atactttggt ctgatgagac ttgggttact ccaggcttcc ataccagaat 720 ctgggttacc agaagagcag gagaagagct agatgagacc tgtattcgtt cgtctacccc 780 caaaaagcgt ggttggatgt tttggggatc attttatgga gatactaaag gcccttgcct 840 tttctgggag aaagaatggg gctctatcaa tgcagagagt tactgtgagc gaattgtgcc 900 tattattgac ggctatcttc gcctgaaccg acagcaaggt aactatcttt gtcttatgca 960 tgatggagca cctggccatg ccagcaaaga tactatagca gagcttcatg agcgtagtat 1020 ctatcctatt agttggcctg ccttctcccc tgatctgaac cctattgaga tggtatggaa 1080 ctggatgaaa gactggatcc aagagagata tccagatgac cgccagctat cttatgatgc 1140 cctacgagaa attgtacgag cttcatggga tgcagtccct acagactttt tgaaaggcct 1200 tattgggtct atgcaagcca gatgtcaggc agtaatcgag gcagagggtg gccatacaaa 1260 atattagtaa gatattagca ttaatacgaa cggcagaatc caaaggagtc atatccttgt 1320 aaggcccacc ggactaacat tcttaccggg gtataccaga g 1361 // ID Gypsy-80_MLP-I repbase; DNA; FNG; 5694 BP. XX AC AECX01001140; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-80_MLP_; KW Gypsy-80_MLP-LTR; Gypsy-80_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5694 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001140; Positions 224391 230084. XX CC Positions [2831-3250] - Reverse transcriptase CC Positions [4493-4972] - Integrase core CC 'GTCCC' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 3818..5593 FT /product="Gypsy-80_MLP-I_3p" FT /translation="MTTKELTRRQARWAEILGSFDFEIRFQPGKQSTKPDA FT LSRRPDLKPEAGDKLTFGRLLKPENLPSDAFIDCVDMIEDWIVDDLPIENE FT INALNSSEEIWTDQEIIEEIKNKSKEDIKINEIIQICRNMPNSKAISNYSV FT SEDVLYYNGKVVVPNNNDIKLKILQSRHDSGLAGHPGRMRTLMLVKRNFHW FT NSMKMYINKYVDGCQSCQRVKARTNRPFGSLQPLPVPQGPWLDICYDLITD FT LPASDGCDSILTVVDRFSKMAHFVPWRKDMNSEELAKLMLHNVWKIHGTPR FT SITSDRGNIFISKLTKEMNTLLGIKTQSSTAYHPQTDGQSEITNKAVEQYI FT RHFVSYKQDDWKDLLPLAEFSYNNNFHVSIGMSPFKANYGFDASLTGTPSN FT KQCLPATEERLKHIKEVQEELKIAMTEAQISMKKQFDRKVQETPTWKKGEE FT VWLSSRHISTTRPTTKLSHRWLGPYKIINRISTNAYKLSLPKEMKEIHPVF FT HVSLLRKFDKSEITGQISAELPPVNIEGQDEYEVEEVLNKRKLRKKTEYLV FT SWKGYGPHHDSWEPEENLINAKEMVEEFNNKYPQAEKTYFRTRRR" FT CDS join(1622..2437,2441..3394) FT /product="Gypsy-80_MLP-I_1p" FT /translation="MHDTRIIDMIYLYDPKTATTKIARALVDSGATHEAVS FT KKYLTQTSFETSELPQKRCVTGFSGHESVVTHTGDYCVNNKKEETTFIVTD FT LRDKYDVILGMPWIRRNHKFIDWERAKLKTEEMTELAAVNTVSSVPTTSLK FT DHILRPEGNARFSDEGVESSNSSHIPPQCEFAFNTIIEENERVSDQDTLLN FT FSPDETTAAHSLDEPKKTSTDHVLRPEREARNIEKGIESISNSQMPLQSES FT NTVPRSSRFGNVGKHFSPCVRYGTAGTRRTFDTVNSQPTKSMINAATTSWN FT VSTCLAVEASKDKPEKTAAELVPECYHEYLGMFEKANSNVLPPHRPYDFRV FT DLIPGATPQAGRIIPLSPKETEVLNEMLDKGLANGTLRRTMSPWAAPVLFT FT GKKDGNLRPCFDYRKLNALTIKNKYPLPLTMELVDSLLDADEFTSLDMRNG FT YNNLRVREGDEAKLAFTCKAGQFEPLTMPFGPTGAPGFFQFFIQDILKAHI FT GRDVAAYQDDILIYTKPGVDHKKVVKEVLDILRRQNVWLKPEKCQFSQKEV FT VYLGLIISRNQIKMDETKVKAVMEWPAPKNLSEVLTFLGFSNF" XX SQ Sequence 5694 BP; 1947 A; 1139 C; 1255 G; 1353 T; 0 other; tattgcaacg tctctaactt agtcaaacgg gaaggagcgg aaggattatt taaagaagaa 60 gaaagttcaa aagaaaatta aagttataat tggaaagcaa aagtttaagc gaattaaagt 120 taagaagatt acaaatctca ggaagtttag aaatcaattc aaagtttaaa gttgaaagta 180 tttagagtga tcacataatc ccattcagaa ggatctgccg agttaaaacc aatcaagtaa 240 ttattcaatt caatctacca aattcttcga aatcatctgc aggtcatcac ccgacatctt 300 gtttcaaaac gccagactac acgatacccc gcagctctac cccgagtgaa gtggaagaca 360 catttgtaga cgcacccgag gtcgaaacga tgtctagtat caatctagaa gaagtgatga 420 agaagttaga ggaactgagt gcaaagctag ccgaggagac aaacctttgt caaaaagctg 480 aaatggaagt gttgcaaatg agacaatctc aacaagttgc catggatcac gagcaatctt 540 cgactagtgc gaaccaaccg ccacccagtt tcacaattcc ccagcctgca cctgcaaaac 600 ctaaacctcc caaaattgcg acgccagaca aatttgatgg gtctaaagga ccaaaagccg 660 agatctttat gaatcaatta ggcctgtaca tacaaatgaa tcatcagtca ttttgcaatg 720 accaggcacg tgtcgcgttt gcattgtctt acaccactgg caaggcaaac acttggggtc 780 agatgtttac ggatcaactt ttagatacta caactgccca gttggtcacg tggaagttat 840 ttgtagattc tttcaaggca acattttttg actccgagcg cattccaaaa gctgaagaca 900 acattcgtgc tttaaaacaa accaagactg tcactgatta ttggatccgg ttctccaaat 960 tgtctttgat cgtgaaatgg cccgaatctg tttgaatatc tcaattcaaa caaggtctta 1020 agcgtgaaat aacggtacac atggtcagag atgagtttga taaagtagaa gatgctgcaa 1080 aaattgcaat caagattgac aacaacataa ataagtgaca tcaagacttt tcattgtcaa 1140 atcacactga acaggcaact tcgactacgg taacagaccc ggacgcaatg gattgttctg 1200 cgtataaact caacatatct gaggaagaat attgcaagag aggaagtaat agagcttgtt 1260 attcatgtgg taaaactggt cattatatta gagagtgtcc gatgaggaag agaggaagaa 1320 gtggttattc aaattcaaat ggtcattcaa attcaagtgg ttttagtaat aggtattcaa 1380 atagttatag gggtagagga ggatactcag gtggcagtaa ctcacaaatt gatgaattgg 1440 agactcaact gaaggcacgc ttagatgaga tagatgttca attaggcaaa agtgacaaag 1500 gaaagggtgt tgaaagtaga tcagaacaat caaaaaatgg cgaagctcga gattgagggt 1560 tgtgccaccc tcgagcttag aagatttagg attagaagga aatgttattg atagtttgga 1620 aatgcatgat accagaatta ttgacatgat atacctttat gatcctaaaa ctgccacgac 1680 caaaattgcc cgtgccttag ttgacagcgg tgccactcat gaagcagtga gcaagaagta 1740 tttaactcaa acatcctttg aaacatctga acttcctcaa aagagatgtg taactggctt 1800 cagtggtcac gaatcagtgg taacccacac tggtgactat tgtgtgaaca acaagaaaga 1860 agagacaact ttcattgtca cggacctaag agacaagtat gatgtcattc taggaatgcc 1920 ttggatacga cgcaaccaca aattcattga ctgggagcga gctaagttga agacagaaga 1980 aatgaccgaa cttgcagctg tgaacacagt ttcgtccgtg ccgacaacat ccttgaagga 2040 ccacatactg aggcctgagg ggaacgctag gtttagtgac gagggggtag agtctagtaa 2100 tagctcacat atacccccgc aatgtgagtt cgcttttaac acaatcatag aggagaacga 2160 aagggttagc gaccaggata ctctcttgaa ttttagccca gatgaaacga cagcagcaca 2220 ctcactcgat gaaccaaaaa aaacctcgac ggaccacgtt ttgaggcctg agagggaagc 2280 taggaacatc gagaagggga tagagtctat aagcaactca caaatgcccc tgcagagtga 2340 gtccaacact gttcctagat cctctcgttt tggaaacgtt ggcaagcatt tttctccgtg 2400 tgtaagatat ggaactgcag gaacccgacg aaccttttga gacacagtca actctcaacc 2460 gaccaaatca atgatcaatg ctgcaacaac atcatggaac gtttcaactt gtctagcggt 2520 ggaagcctcg aaagacaaac ctgaaaagac ggctgccgaa ttagtccctg agtgttatca 2580 cgagtactta ggaatgttcg aaaaggccaa ctccaatgta ttacccccgc accgcccata 2640 tgactttcgc gtagatttaa tcccaggagc aaccccgcaa gctggacgaa ttatcccttt 2700 atccccaaaa gaaactgaag tcctgaatga aatgttagat aaaggattgg ctaatggaac 2760 cctgaggcga acgatgtcac cttgggcggc tcctgtctta ttcacgggga aaaaagatgg 2820 caatttaagg ccttgctttg attatcgaaa attaaatgct ctgacaatca agaataagta 2880 ccctctcccg ctaaccatgg agctagtaga tagtcttctt gatgctgacg agttcacaag 2940 tctggacatg aggaatggat ataataactt acgagttagg gaaggagatg aggctaaatt 3000 agcattcact tgtaaagccg gccagtttga acctctgaca atgccctttg gacctaccgg 3060 ggcgcctgga ttttttcaat tcttcattca ggacatcttg aaagcacaca ttggacgcga 3120 tgttgcagcg taccaggacg acattctgat atacacaaaa cctggagtcg accacaagaa 3180 ggtggtgaaa gaggtgttgg acatactcag aaggcagaat gtgtggttaa aaccagagaa 3240 gtgtcaattt tcgcagaaag aggttgtgta tttaggactc attatttccc gaaatcaaat 3300 caagatggac gagacgaaag taaaggcggt aatggaatgg ccagcgccga aaaacctctc 3360 tgaggtgctg acattcttag gcttctcaaa tttttaaaga cgttttatca accatttctc 3420 agagatagcc cgaccacttc atgaactttc aaaagacaat gtcaaatttg aatggactca 3480 agaatgcgac aacaccttta aaagcctcaa aagatccttc accacagctc cagttttaac 3540 aattgcaaat ccctacaagc cttttattct ggagtgcgac tgctcggact ttgccttagg 3600 agcagttctg tcgcaggtct ctgaagaaga caatgaacta cacctggttg catttctatc 3660 aagatcctta attaaggccg aaaggaacta cgaggttttt gataaagaac ttttagcggt 3720 catttcagca ttcaaggaat ggcggcatta cctcgaaggg aatccgcata ggttgaacgt 3780 gatagtgtac acggatcaca agaacttgga gtctctcatg acaactaaag aactgacgag 3840 acgccaagca aggtgggccg agatccttgg aagctttgac ttcgaaattc gctttcaacc 3900 aggaaagcag tcgacaaagc cagacgcgtt atcaagacga ccagacctaa agccagaagc 3960 aggagacaaa ctcacatttg gacgactact taagccagaa aacttaccaa gtgatgcttt 4020 cattgactgc gtagacatga tagaagattg gatagtagat gacttgccta tcgaaaatga 4080 gattaatgca ttgaatagct cagaagaaat atggactgat caggaaatca ttgaagagat 4140 taagaacaag tcgaaagaag atatcaagat caacgaaata atacaaatat gccgcaacat 4200 gccaaactca aaagccatat cgaattactc agtctcagaa gatgttttat actacaacgg 4260 aaaggtagta gttcccaata acaatgatat caaactgaaa atactccaat cccgtcatga 4320 cagcgggtta gccggccacc ctggtagaat gagaacttta atgctggtga aaagaaactt 4380 tcattggaac tcgatgaaga tgtatatcaa taagtatgtc gatggctgcc agtcttgcca 4440 aagggtaaaa gcaagaacga accgaccctt tggaagtcta caacctctac cagttcctca 4500 aggcccttgg ttggacatat gttacgacct gataacggat ctaccggctt cagacgggtg 4560 tgacagtatc ttaacggttg tagacaggtt cagcaaaatg gcacattttg tgccctggcg 4620 aaaggacatg aactctgaag aattagctaa gttgatgttg cataacgtgt ggaaaattca 4680 tggaactcct cgaagtatca cttcagatag agggaacatt tttatctcaa aactaacgaa 4740 agaaatgaac acattgttag gaattaagac ccaatcttca actgcatatc acccgcagac 4800 ggatggacaa tcagaaatca caaacaaagc ggttgagcaa tacatacgcc actttgtgtc 4860 gtacaaacag gacgattgga aagacctatt gcctctagcg gaattttcct acaacaacaa 4920 cttccatgtg tccattggaa tgtctccatt caaggctaac tatggatttg acgccagttt 4980 gacaggaaca ccgagcaaca aacagtgcct accagccacc gaagaacgac ttaagcacat 5040 taaagaagtt caagaggaat taaagatagc aatgactgaa gctcaaatat caatgaagaa 5100 acaattcgac agaaaagtgc aagaaacgcc gacgtggaag aagggagaag aagtgtggct 5160 cagcagtaga cacataagta ctaccagacc aaccactaag ttatctcaca gatggctggg 5220 accgtataag attatcaaca gaatttctac taatgcctat aaattatcat tgccaaagga 5280 aatgaaagaa atacacccgg tcttccatgt cagtcttctc aggaaattcg acaaaagtga 5340 aataacaggt cagatcagcg ctgaattacc tccagtcaac atagaaggtc aagacgaata 5400 tgaagttgaa gaagtattga ataaaagaaa attgagaaag aagactgagt atttagttag 5460 ctggaaagga tatggacctc atcatgattc atgggaacca gaggagaact taatcaatgc 5520 aaaagaaatg gtggaagagt ttaacaacaa atatcctcaa gcagaaaaaa cgtactttag 5580 gacaaggaga agatgagaga gagggtgagc ttttttccca ctgggttttt aatgccaacc 5640 cgtggaaaga tatctaactc gtcaaaaggg agtggaggta tagaggggga atgg 5694 // ID Copia-41_MLP-I repbase; DNA; FNG; 4278 BP. XX AC AECX01001336; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-41_MLP_; KW Copia-41_MLP-LTR; Copia-41_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4278 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001336; Positions 75555 71278. XX CC Positions [1800-2300] - Integrase core CC 'CTTTT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 219..3674 FT /product="Copia-41_MLP-I_1p" FT /translation="MANPTLTTARIKNPRSESHTRESLTWESKPTTLSYSH FT EKLDATGVKPLEAPGPLSNYHQWQFVMGTIIRGSPYGYVIRENQPDPLPST FT DEHDRCKVSALILRYVDSSNYEYLSPFEDDPKSQWNALREAHQEASAGSIM FT FWLKKLITAKMGDQSITAHLDSMSADYQRFKALVTPQRPLTADSVFATAVS FT LSLTSDWQPILAPLLQRDSVTLTTVLKVLREEATRRSISSEDPETISSAKN FT TKQKEISCTHCSKAGHLIDDCRILKSKLDKYEVLKSEKATSEVSKSKPTKN FT QVRRAKAAKAKAASASLISSESDVSIDMNSEISAAAVIKASVGKADGWLVD FT SGTSQIMTPYSDALINKKPDDMMISLANGSNVKAVSKGLISLPFSDFNDIK FT SLYVPSLSEPLLSISKLADKNIVSVFDSEKVSFIKNSQFSGDIIGTGKRQG FT NLYYLDQKVRQSTSPDKTSRASVADVNLWHLRFNHPGYASLEKKLSTLNIK FT VDESDLRKLQTCSICIQGKMRRRNMSSRAGWRSSKPLSIIHSDVSSYSVVS FT RKGFKYFVTFIDDFTRFTKIYLMKSKDEVLSKFKLFHNEMTSKVDGVITEL FT RSDNGTEYINKDFTNFCIEKGITQTTGPPDSPQLNGVAERCNRTVKEKVRC FT CLIESLLPDSFWGDALEYAIETLNNMPTRTNEKFQSPTSLMGLRERQEHTF FT HAFGCKVWVHVIKPNSTLKPRARQAIFLSYLNNNEGAIVWDTEKKQALKAT FT SLVFLDKIFPGLKSNSSSDNFQPKEDNLLPWPSLESNQTPIPTNSPDTDLN FT QEAVHTRPIRSRRQPERYGKYGRAAFIESEKDPATYKKAKKSVNWDLWKLA FT AEAEMESLVGKGTWKLVPRPARRKIIRCKWIFKTKRNVDMSVYKLKARLVA FT LGFSQIKGIDFSEVFSPTSRQESIRLFLTLMAKNKWKGVAVDIKTAFLNGD FT LDEEIYMEHPEGFVNEENPDWVYRILRSLYGLKQSPRQWNIKLHEYLLSIG FT YIRSSHDPSIYYKKNEDGDIKSMLVTHVDDIAVTGTDKEIDNFRTLIKKRF FT EVSSDKPLSHFLSLNISKVKDQKVTVDQTHYISTLEKQFSKFNIKLFNSPN FT DDTFKTLLPSNEDDEISECPYSSLIGGLLWVAQCTRPDISFVVN" XX SQ Sequence 4278 BP; 1399 A; 799 C; 842 G; 1238 T; 0 other; ggttatgagc ccatctcggt taaaactata agaagttcac ctctgaaaac cgtttcgatt 60 caaatcggtt ttgaaagcct acggaaaggg ccgtctgtac aaagacgata aactacatat 120 agcaatcgaa taaactccga aagcttaaaa cgtcatacat tgaagtattc ataaaccttc 180 cttgttcaaa cactaactat cttctcggaa aatccaaaat ggcaaatccg actttgacaa 240 ctgctcgtat caagaatcct cgtagtgaat cacacactcg tgaatcactt acttgggaat 300 caaaaccgac tactctttcg tattctcacg aaaaactcga cgctactggg gtgaaaccac 360 tcgaagctcc tggaccatta tcaaactacc accaatggca gttcgtaatg ggcacgatta 420 ttagaggatc tccctatgga tatgtcataa gggagaacca acctgatcca ttgccatcaa 480 ctgatgagca tgatcgatgc aaggtatcag cgttgatact tcgatacgtt gattcctcga 540 actatgagta tctttcacct ttcgaagatg atcccaagag ccaatggaat gcactgcgtg 600 aagcgcatca agaggcatcg gcgggatcga ttatgttctg gttgaagaaa ttgatcaccg 660 cgaaaatggg tgatcaatcg attactgctc atctcgattc gatgtcggcg gattatcaac 720 ggtttaaggc gttagtaact ccacaacgac ctcttactgc cgattcagta tttgcaactg 780 cagtcagctt gtctctcaca tcggactggc aacctatact tgctccactc ttgcaacgtg 840 attcggtaac attgactact gttttgaagg ttcttcgaga ggaagcaact cgccgttcga 900 tctcttcaga ggatcctgaa actatttcat ctgctaaaaa cactaagcag aaggaaatat 960 cttgcactca ttgcagcaag gcaggccatt tgattgatga ttgtagaatt ctgaaatcca 1020 agcttgataa atatgaggtt ttaaaatctg agaaggccac ttcggaagtc tctaaaagta 1080 aaccgactaa aaatcaagtt cgccgagcta aagcagcaaa agcgaaagcg gcttcagcat 1140 cactcatttc atctgaatca gatgtatcga tcgatatgaa ttcagaaata tcggctgctg 1200 cagttatcaa ggcttctgtt ggtaaagcag atggttggct agtcgattca ggaacttccc 1260 aaatcatgac tccttattct gacgctttaa taaacaagaa accagatgac atgatgatca 1320 gtctagcaaa tggttcgaat gttaaagctg tctcaaaagg attaatctca ttaccttttt 1380 ctgatttcaa cgacatcaaa tctctctatg tcccatcttt gtctgaacca cttctctcaa 1440 tttctaagtt agccgataaa aacatagtat ccgtttttga ctccgaaaaa gtatctttca 1500 tcaaaaactc tcaattttca ggagatatca tcggaaccgg aaaacgtcaa ggaaatctct 1560 actacctcga tcaaaaggta cgtcaatcaa catctccgga taaaacgtct agagcttcag 1620 ttgctgatgt gaatttatgg catcttagat ttaatcatcc tggatatgca tctctcgaaa 1680 agaaattatc aacgctaaat attaaggttg atgaaagtga tttgaggaag ctacaaacct 1740 gtagtatttg tattcaaggg aaaatgagac gtcgcaacat gtcgagtcgt gcaggatgga 1800 gatcttctaa acctctcagt attattcatt ctgatgtgtc atcttattca gttgtgtctc 1860 gaaaggggtt caaatatttt gtaactttta ttgatgactt tacacgcttt acaaaaattt 1920 atttaatgaa atcaaaagat gaagttttga gtaaatttaa attgtttcac aatgagatga 1980 catcaaaagt tgatggtgta ataactgagt tgagatctga taacggcacg gagtacatta 2040 acaaggactt tacaaatttc tgcatcgaaa aaggaatcac tcaaaccact gggcctcctg 2100 attcgccaca attaaatggt gttgcagaaa gatgtaatag gactgttaaa gaaaaagtaa 2160 gatgctgttt gatagagtca ttgttaccgg atagtttttg gggggatgca ttagaatatg 2220 ctattgagac attgaacaac atgccgacaa gaacaaacga gaagtttcaa tcgccaactt 2280 cacttatggg tcttcgtgaa cgtcaagaac atactttcca cgcttttgga tgcaaagtat 2340 gggttcacgt tataaaacca aattcgacgc taaaacctag agcgcgtcaa gctatttttc 2400 tttcatattt aaacaacaat gaaggagcta tagtatggga tacagaaaag aagcaagctt 2460 tgaaagctac ttctttagtg ttccttgata aaatttttcc tggtctaaaa tcaaattcgt 2520 ccagcgataa ttttcagcct aaagaagata atctacttcc atggccttct ttggaatcaa 2580 atcaaacacc gattccaaca aattctcctg atacagattt aaatcaagaa gctgttcata 2640 ctcgaccaat ccggtcaaga cgtcaacctg aacgctacgg caaatatggt cgagcagctt 2700 ttattgaatc tgagaaggac cctgcgactt acaagaaagc taaaaagtcc gtcaactggg 2760 atctttggaa gttggcggct gaagctgaga tggagtcatt ggtgggtaaa ggaacttgga 2820 aacttgtacc tagacctgct cgaagaaaaa tcatccgatg taaatggatt ttcaaaacga 2880 aaagaaatgt tgatatgtca gtatacaaac tgaaagcacg attagtagct ttaggttttt 2940 ctcaaatcaa aggaatagat ttttcagaag tattttcacc tacttctcgt caagagtcaa 3000 tcagattatt tctcactctt atggcgaaaa acaagtggaa aggcgtagct gtggacatca 3060 aaactgcttt cttaaatgga gatctggatg aagagatcta catggagcat cctgaaggct 3120 tcgtcaatga agaaaatccg gattgggtat atagaatact tagatcttta tatggattaa 3180 aacaatctcc aagacaatgg aacatcaaac tacatgaata tttattgtca attggttata 3240 ttcgctcatc acatgatcct tccatttatt ataagaagaa tgaagatgga gatatcaaat 3300 caatgctagt tactcatgtt gatgacatag cagtgacggg aactgataag gagattgata 3360 actttcgaac tttaatcaag aaacgctttg aagtttcgtc agacaaacca ttatcacatt 3420 ttctttctct taacatctca aaagtcaaag atcaaaaagt aactgtagat cagacacatt 3480 atatttcaac attagaaaaa caattctcaa aatttaatat caaattattt aattcaccaa 3540 atgacgatac tttcaaaact ttacttccat caaatgaaga tgatgaaata tcagaatgtc 3600 cttattcaag tttaatagga ggcttattat gggttgccca gtgcactcgg ccagacattt 3660 catttgtagt taattgactt tcgcaatttt tgaagaaacc ttcatctcaa cactggaatt 3720 cggcgttacg ggtattgggt tatttatgta ggacgaagga attgaaatta actttaggag 3780 gaaatgattt gaatccgaca gcttattcag atgcggattg ggctgaagac cgtcatgaaa 3840 gaagatcaac aacaggttat gtctttatgt tgggagttgg accagtttcg tggagatctc 3900 ggaaacaaag gacaacatct ttatcaagca ctgaagctga atatatggcg atgagtgact 3960 cgtgtagaga agcacgttgg ttggtttcgt tattgaatga gttgagtatt tcgaaggata 4020 aagtaataaa gttgtgtgta gataatgaag gcgcagaagc actagcgcag aatccatctc 4080 atcattctcg aacgaaacat attcatacaa gatatcattt tgtacgacaa tgcgtggcag 4140 atggaatagt taaattagta catgtttcat cttcaaggat gctggctgac atgttcacaa 4200 aagggttatc taggttgtta ttaacaaaac ataggttatc attgagtatt ttttaactga 4260 gttgttagca cggggggg 4278 // ID Gypsy-2_MVPL-I repbase; DNA; FNG; 4655 BP. XX AC AEIJ01000741; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Microbotryum violaceum genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_MVPL_; KW Gypsy-2_MVPL-LTR; Gypsy-2_MVPL-I. XX OS Microbotryum violaceum OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Microbotryomycetes; Microbotryales; Microbotryaceae; OC Microbotryum. XX RN [1] RP 1-4655 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Microbotryum violaceum genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AEIJ01000741; Positions 25913 21259. XX CC Positions [3530-4009] - Integrase core CC 'CTCGA' target site duplication CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 512..2110 FT /product="Gypsy-2_MVPL-I_1p" FT /translation="MWQHYRRLDNLRQRGGYREFSTARDLQLELGVRLVSD FT TQLVRMLLLHMDEELSQRLRLSDVIKGTGLSPDELDASIIKTDPSTSPADF FT DFQTFDAEARRLWQVISATRASVKATAQASAKIVRPTHTSHTIKSLRTTIS FT PSTPPPLPSEIRPPPMTDRERAFLKSNRGCFKCRKINAGHLSDTCTTWATT FT ACKVPPGWRQGPVSSTQVVSMTDIDEGVDDEGEQLHCLRDDEYDNGTDEER FT CEPIFLPVKLINDDGPVLRALVDTGASASFIADKEVDKLRLTRRKLQVPTS FT VSVAIQSHRVSHPVTEFVCVPLCTLNDRWSTQHVVLKIAPLTSLKLVLGRP FT FLKRHDMLVDCRRRQVLVPDPTLPDERIDLLTDRSASAVQAGVRACLARLE FT EQQAEESYLRAMEAEFRREFSDRFPVDIPPVSQYESKVRHRISLKPGMKRP FT RQPTYGTPMRWRAAWRRLLDEHHAAGRLRPSSSEYSSPAFIIPKKGMDTDP FT SIMPRWVNDYGILNAATVPDRTPLPLPEEILAVSVRA" FT CDS 2951..4636 FT /product="Gypsy-2_MVPL-I_2p" FT /translation="MGVHFNVLSDHESLKYLKTQENLSKRQARWIEQLADY FT NFDITYIPGGKNTVADAMSRYSFPQGQANSIQAVLVMEVNSQLRRRVVEGY FT EADPFCQQVKRNLNSLPGFSCADGVLYFEGRMVVPAVPQLWEDVLHDAHDA FT LGHFGPRKTFQQVSRTFFWPGLRSSCEAYISTCDLCQRTKPATTGPLGVSH FT ALGVPNEPMQEVALDFVGPLPKSQGFDMLLTITDCPSGYTRLIPSRAADTA FT KDVAKRFHEGWHRFFGPPIRMVSDRDKLFASYFWRAYHKLMGTRLAMSTLF FT HPETDGRSERTNKTVIQALQAVVNHQQNDWVRHLGNIEFAINASVNALTKK FT SPFEVVLGCSPRLLPTSLNRPDLPKVPAAKDLIAERKAVLVEVRDALAAAK FT VRQVEQVNRHRRREPEIAVGDLVMVDTRDRRLRFKTGQRKSAKLFDRFEGP FT YKVLATNVATSNYTLQLNEGDRSHPTFHVSKLRPYRANDLSYFPSRGQPRP FT KPVIVDGQEEWVVREILEETTRGKKRFRVWWEEYPQHEATLEPRKHLEGTE FT ALRRWELKKRTEGRV" XX SQ Sequence 4655 BP; 1060 A; 1353 C; 1287 G; 955 T; 0 other; attttttttt gacaacggtc agtacacgtc agcattaccg tcgtcaccct cgccgccgcc 60 gccaggtggg ctacagccct cactaaacct gatccactgt ttatacctct gccgttaagc 120 gattcggaga gcaccaactc gggagcctcc gaacccgcat ccgccgttac cgtgaccgac 180 tccgactctg acgacgaccg ccgccgtccc gccgcgcgca ggatgagcga accgcccgaa 240 gacgcctcgc tggaaactgc acgccccgcc ctacctttgc cgcgccgacc gtaccttatc 300 cggacttcga ctcttctggt ctcgccatga tcgctacttc aagagagaga agagcatcaa 360 gaccgatgag gacaagatcg acacgattgg ccagctgctc ctcggcccgg agctgaaggt 420 gtggtattcc agcgatgccg cgtcacatgc gaaaaagacg tacagcacct tccaacgcga 480 cttgacactt cgcgcgctgc cgtccgacta catgtggcaa cactaccgac gtcttgacaa 540 cctgcgacag agggggggct accgcgaatt ttcaaccgct cgtgatcttc aactggaact 600 tggcgtccgc cttgtcagcg acactcagct ggtccgcatg ctcctacttc acatggacga 660 ggaactctct cagcgactcc gcctatccga tgtaatcaaa gggactgggt tgtcacccga 720 tgaactcgac gcttccatta tcaaaaccga cccgagtacc agtcccgccg acttcgactt 780 ccagacattc gacgcagaag ccaggcgact ctggcaagtc atcagtgcca ctcgcgcttc 840 cgtgaaggcg accgcccaag cctcagcaaa aatcgtccgc cctacgcaca cgtcacacac 900 tatcaaatca ctgcgcacta ccatctcacc gtctacgccg ccgccgttgc cttccgaaat 960 ccgaccaccg ccgatgactg accgtgagcg tgctttcctc aagagcaacc ggggctgctt 1020 caagtgccga aagatcaatg ccggccacct gagtgacacg tgcaccactt gggcaaccac 1080 agcttgtaaa gtcccaccgg gttggcgaca aggtcccgtg tcttcgacgc aggtcgtttc 1140 catgaccgac atcgatgagg gagtggacga cgagggcgag cagctacact gcctacgcga 1200 cgacgaatac gataatggca cggatgagga gaggtgcgaa cccattttcc tccctgtgaa 1260 gctcataaac gatgatggac ctgtcttgcg agctctagtt gacacgggtg cttccgcttc 1320 tttcatcgca gataaggagg tggataaact acgattgaca cggaggaagc ttcaggtacc 1380 cacatctgtt agtgttgcaa ttcaaagtca cagggtctcg catcccgtca ctgaattcgt 1440 gtgtgtgcca ctttgtacct tgaacgaccg atggtctact caacacgtcg ttttgaagat 1500 cgcgccgttg acgtccttga aactggtact ggggcgaccg tttttgaagc gacacgatat 1560 gctggtcgac tgtagaaggc gtcaagtgct ggtcccggat ccaaccttac ccgacgagcg 1620 tatcgatcta ctgaccgacc gttcggcgag cgcggtccaa gcaggcgtcc gggcctgcct 1680 cgcccgtctc gaagaacagc aggcggagga gtcttaccta cgggcaatgg aggcggagtt 1740 tcgcagggag ttttctgatc ggttccctgt ggacatcccg ccggtctcgc agtacgagtc 1800 aaaggttcgt caccgtatct cgctcaagcc gggcatgaag agaccgcggc aaccgaccta 1860 tggcactcca atgcgctggc gtgcagcgtg gcgacggtta ctggatgaac accatgcggc 1920 cggtcgtctt cgtccatcct cttcggaata ctcgtcaccg gccttcatca ttcccaagaa 1980 aggcatggac accgaccctt cgatcatgcc gcgttgggtc aatgactacg ggatccttaa 2040 tgcggccacg gtaccagatc gcacaccgct tcccctcccg gaagaaatct tagcggtctc 2100 cgtgagggcg tgattctggt ccaagatcga catgacgaac agtttcttcc agacgaagat 2160 ggccgaggag gacatcccca agaccgcggt ggcaactccc tggggactct tcgagtaggt 2220 ggtcatgccg atgggtctgt ccaacgcacc agcaacgcat caacgccggg tcaacgaagc 2280 cttgtcaagt ctcatcggca agagttgttt tgcctacctc gacgacatca caattttctc 2340 gaatagcatt gaggaacatt ggactcacgt caaggaggta ctcgaggcgc tgcgtcgggc 2400 agatctgtgt tgttcgccga agaagaccga acttttccga atgtcttgcg tcttcttggg 2460 acatgtcatt tcacgcgagg ggattgcggc ggatcagtcg aagttcgagc atattctgga 2520 gtggccacgt cctcggactg tcaccgagct gcgaggattc ctgggcctcg ttcagtattt 2580 gcgcaagttc atcaacggcc tggcccaaca taccaaaccc ttggcggatt taacgtcgaa 2640 gaacgcaaat gtccgactca tgtggggtgc tgagcaagag agacacttca acgcgatcaa 2700 gaaaatcgtc acctcactgc cgtgtctcaa gccggtcgac cacacggact ctgccgaccc 2760 actctgggta atgaaggacg cgagcaatgt cgggatcggc gctgtgttgc ttcaggggca 2820 agattggtgt aaggcccatc cggtcgcgta ctggtctcgg cagtacatca gcgcggagat 2880 taactgcccg acacacgaac aggagctgtt agcggtggtg gacgcgttgc ggtagtggcg 2940 cgtcaacttg atgggggtcc acttcaatgt actgtccgac cacgaatcgc tgaaatatct 3000 aaagacgcag gagaacctgt caaaacggca agctcgctgg attgaacaac tcgccgacta 3060 caatttcgac atcacgtaca tcccaggagg caagaacacc gtcgcagacg caatgagccg 3120 atactctttc ccgcaaggac aggccaactc aatacaggcg gtgttggtga tggaagtcaa 3180 tagtcagcta cggaggcggg tggtagaagg ttatgaagcc gatcccttct gccagcaggt 3240 caagcgcaat ctcaattcgt tgccagggtt ctcttgtgcg gatggggtgc tctattttga 3300 gggaaggatg gtggtacctg cggtgccaca gctttgggag gacgttctac atgatgccca 3360 cgatgcgctc ggacacttcg gcccgcgcaa gacctttcag caagtgtcgc ggactttctt 3420 ctggccaggc ctgcgttctt cgtgcgaagc ctacatttca acatgcgatc tatgccaacg 3480 gaccaagcca gccaccacag ggccgttggg cgtgtcacat gccctggggg tgccaaacga 3540 gccgatgcag gaggtggccc ttgattttgt gggccccttg cccaagagcc agggattcga 3600 catgctcctc accatcaccg attgtccgtc aggttacacg cgcttgatac caagcagggc 3660 cgcggatacc gcaaaggacg tcgccaagcg ctttcacgag ggatggcacc gcttcttcgg 3720 cccaccaata cgtatggtct ccgatcggga taagctgttt gcatcgtact tctggcgcgc 3780 gtaccacaag ctgatgggca ctcggcttgc catgtctaca ttgttccacc ctgaaaccga 3840 cggccgcagc gagcgcacca ataagaccgt cattcaggcc ttgcaagccg tggtcaacca 3900 ccagcagaac gattgggtcc gacatctggg taacatcgaa ttcgcaatca acgccagtgt 3960 caacgccttg acaaagaagt cgccgttcga ggtggtgcta ggctgttcgc ctcgcttatt 4020 gccaacatct ctcaatcgac ccgacctgcc caaagtgcct gcagcaaaag acctgatagc 4080 cgagcggaag gcagtactgg tggaggtgcg ggatgcgttg gctgccgcca aggtacgaca 4140 agtggagcag gttaatcggc atcgccggcg agaaccggaa attgcggtgg gagatcttgt 4200 tatggtggat acccgtgacc gtcgacttcg atttaagacg ggacaacgca aatccgccaa 4260 actttttgat cgctttgaag gaccttacaa ggttcttgcc accaatgtcg ccacctccaa 4320 ttacacgctc cagctaaatg agggtgaccg atcgcatcct acattccacg tgagcaaact 4380 tcgtccttac cgagccaacg acctaagcta tttcccaagc cgaggacaac ccagaccaaa 4440 gccggttata gtcgatgggc aggaagaatg ggttgtgcga gagatcttgg aggagaccac 4500 aaggggaaag aagcgctttc gtgtgtggtg ggaagaatat ccgcagcacg aagccacctt 4560 ggagccgcgc aagcacttgg aggggacaga ggcgcttcgg cgttgggagc tgaagaagag 4620 gacggaggga agggtttgat tcaggagggg gagag 4655 // ID Gypsy-13_CCO-I repbase; DNA; FNG; 5421 BP. XX AC AACS02000012; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the mushroom Coprinopsis cinerea genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-13_CCO_; KW Gypsy-13_CCO-LTR; Gypsy-13_CCO-I. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-5421 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the mushroom Coprinopsis cinerea RT genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AACS02000012; Positions 324606 319186. XX CC 'CTTGC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 128..5257 FT /product="Gypsy-13_CCO-I_1p" FT /translation="MVDSLELRGRTLERTVPVRKARRTRKNQTTTTMDSPQ FT RGRQIIQNDDDPIADEELLKACGLIPAVEGETSLCTDIDDDEPLPELSSIF FT DDSSDDESDEEPFDITPPLIVQTHVDFDNWGTDADVQQNSKMSYNTRMATI FT KDNGKHFPTLTAGKCTIDILDRFQTACENKAIQRGIEDNKVVKHLFGCFED FT HRVITWIAANRATLATVDITEFMKQLKELLLDRDWATEWKRKVSNRRQKET FT ESFEDFANEVIYWNSLLRGTEKIRTNERLKDLLFANCLDEISDEYQNSDIP FT EQFENADFDEIAVFNEWIRKVIGVDKKIARLRALSRKLFEAERATKKPRTN FT NASGPSDGNRAGRRADWVPNLTEDEKDILGKHGGCFKCRQFYAGHRWANCP FT NGYPTLKDYKKITPEMAADAKKKRDNTTNNASAAKGKAVAAVLPSDDFEYE FT DSSSEESGGQNTSGEILEENEDENEDDVSDPTTSIHLLWKACMAGTSDFPE FT DVLTMFDTGCHTVLMDSKVATKHGLKRYPLKSPLPVNPAFITSDDAEPFSL FT TQFVKFDLWSNDNVFVSKRFKAILVDNLCVDILLGLPWLEAHNIVIDCANR FT KCFVKDGYNFIVPQQVSRPKILPRSGRERTQKLRRDWKVMIAELKKVLKPI FT RDLVDQRWERRPKQRFIKAMVGAIKAAVINVELKDKADKLRDEFADLFQPI FT PHASQLPDQVTASITLKDATKAIETRTYSCPRKFRDAWQKLIDEHLKAGRI FT RPSNSPHASPSFIIPKADPSAPPRWVNDYRQLNANSVADKNPLPRIDDILA FT DCGKGKIFSKLDMTNSFFQTRMRPEDVKWTAVTTPFGLYEWLVMPMGFKNA FT PAIHQRRVSEALKQYIGKFCHVYLDDIIIWSKDAATHEKHVRLILEALRKN FT KLYLNGKKSKFFCESVDFLGHIVSKDGIRPDGSKIERVINWPVPKNSSDVR FT RFLGLVRYLGNFLPDLARWTSVLNPLTKKEYDRNWPGWTTEHGRAFEEIKK FT LVVSSDCLTTIDHDNPGENKIFVTTDASEKRTGAVLSFGPTWETARPVAFD FT STPLKKAELNYPTHEKELLAIVNALKKWRADLLGENFYIYTDHKTLEFFQT FT QKELSRRQARWMEYMLQFDGKIVYVKGEDNVVADALSRMPTVDDSAEAQRT FT AKEFFQWEDGQNLSKTSVAVLFTPKGAGIDVCAMTEALASARFKKVEEKPK FT SLSLDDSFVRDLIRGYQLDPWCRKFETAAKGMAAFKKRNGLWFVNDRLVVP FT KYSNCRSIVFQLAHDRLGHFGFEKAYSAIKNEYYWPGMRTDLEKGYIPGCE FT ACQRNKDSTTKRAGPLHPLPVPDGRGESIAMDFIGPLPVDEGYDMLLTITD FT RMGCDIQLIPVKSNISSEELANIFFDKWYCENGMPAHITSDRDKLFTSKFW FT KALHTLTGVKLQLSTAYHPETDGSSERTNKTIIQSLRFWVDRNQKGWVKAL FT PRVRFALMNTVNSSTGFTPFQLKMGRSPRVFPSLIPREGQPRLEDVVKAIE FT KLQNDVSEAQDNLLAAKVQQAHHANKKRGHEPSFEPGDRVWLTTRHRRREY FT LSRDSKRVAKFMPRFDGPYTILDVNKENSTYTLLLPEHSNTHPVFHASLLK FT KCIPNDNEKWPERQHIIPKPIVTETGQEEWEIDRILDRKRAGKGWRYLVRW FT VGHGAECDVWLPGKEVEDCEALDIFLEGLKDVQKSDAEIDV" XX SQ Sequence 5421 BP; 1446 A; 1446 C; 1321 G; 1208 T; 0 other; cttttttttg cgtatttcca aacgagtcta tcacgccaac tgttgggcgt ggtatcggcg 60 ccttgcgcgc ttgaacgatt gaacgattgg acggtagcga agctacaaga acgacagacg 120 taccagaatg gtagacagcc tcgaattgcg aggaagaacg ttagaacgaa cggtgcccgt 180 gcgtaaagcg cggagaacca gaaagaatca gacgacgacg acgatggact cgcctcagcg 240 gggacgccaa atcatccaaa acgacgacga tcccatcgca gacgaagaat tgctcaaggc 300 gtgcgggcta atacccgccg tcgaaggcga gacttcactc tgcacggata ttgacgacga 360 cgaaccctta cccgaattat cttcgatttt tgatgattct tctgatgatg aaagtgacga 420 agaacctttt gatattaccc cgcctttgat tgttcaaact cacgttgact ttgacaattg 480 gggtactgac gccgacgttc aacagaattc aaaaatgtcg tacaacaccc gaatggccac 540 gatcaaagac aacgggaaac acttcccaac ccttaccgca gggaagtgca cgatcgatat 600 cctcgatagg ttccaaaccg cttgtgaaaa caaggccatt cagagaggca tcgaggacaa 660 caaagtcgtg aagcatttgt tcggatgctt tgaagaccat cgcgtcatca cttggatcgc 720 cgccaatagg gccacccttg cgaccgtcga catcaccgag ttcatgaagc aactgaagga 780 attattgttg gacagggact gggccacgga atggaaacgt aaggtttcca acaggcgtca 840 gaaggaaacc gagtcgttcg aggatttcgc caacgaagtc atttattgga acagcctcct 900 ccgtggcacg gaaaagattc gtaccaacga acgtttgaag gacctccttt tcgccaactg 960 cctggacgag atctccgacg aataccagaa ctccgatatc ccggaacagt ttgagaacgc 1020 cgacttcgac gaaatcgccg tcttcaacga gtggatccga aaagttatcg gcgtcgacaa 1080 gaagatcgct cgacttcgtg cgctgtccag gaaactcttc gaggcagaga gggccactaa 1140 gaagcctcgt acgaacaatg ccagcggacc ttcggacggt aatcgagctg gtcgtcgcgc 1200 tgattgggtc ccgaacctca ccgaagacga gaaggacatc ctcggtaagc atggtggttg 1260 tttcaagtgt cgacagttct acgccggcca tcgatgggcc aactgcccga acggttatcc 1320 gaccctgaag gattacaaga agattacccc cgaaatggct gccgatgcca agaagaaacg 1380 cgacaacacc accaacaatg cctccgctgc caagggcaag gctgtagctg ctgtcttgcc 1440 ttccgacgac ttcgaatacg aggactcctc ctcggaggaa tcgggtggac aaaacacctc 1500 gggagagatt ttggaggaga acgaagatga gaacgaggac gacgtgagtg atccaactac 1560 atcaatccat ttgttgtgga aagcctgcat ggcgggtacc tctgatttcc cggaggatgt 1620 actgacgatg ttcgacaccg gatgccacac agtcttaatg gactccaaag tcgcgacaaa 1680 gcacggattg aagagatacc ccttgaaatc ccccttacct gtgaaccccg ccttcatcac 1740 ttccgacgac gccgagcctt tttccctgac gcaattcgta aagtttgact tgtggtccaa 1800 tgataatgtt tttgtttcca aacgtttcaa ggcgattctg gtcgacaatc tttgcgtgga 1860 catcttattg ggtctccctt ggctggaagc ccacaacatt gtaatcgact gcgccaatcg 1920 taagtgcttc gtcaaagacg gttataactt catcgttcca caacaggtca gtcgccccaa 1980 aatcctccct cggtctggga gagaacgaac acaaaagctc cgccgcgact ggaaggttat 2040 gatcgccgag ctcaaaaagg tattgaagcc catcagagat ctggtggacc aacgatggga 2100 acgtagaccg aagcaacggt tcatcaaggc catggtcggt gccatcaaag ctgctgttat 2160 caatgtcgag ctcaaggaca aggccgacaa gttacgagat gagttcgccg acttgttcca 2220 accgataccc catgcttctc aactccccga ccaggttacg gcttctatca cgctcaaaga 2280 cgccacgaaa gccatcgaga ctcggacgta tagctgccct agaaaattcc gcgacgcctg 2340 gcagaagctc atcgacgagc accttaaggc aggaagaatt cgtccatcaa actcgcctca 2400 cgcgtcacct tctttcatta ttccaaaagc cgacccttct gcgccacctc gctgggtaaa 2460 cgattatcga cagctaaatg ccaactcagt tgcggataag aaccctttgc ctcggataga 2520 cgacatcctc gccgattgtg gcaaaggtaa gatcttttcc aaacttgaca tgacgaattc 2580 ctttttccag actcgaatgc gcccggaaga cgtcaaatgg acggctgtga caaccccctt 2640 cggcctctac gagtggttgg tcatgcccat gggtttcaag aacgcgccag ccattcacca 2700 acgcagagta tcggaggcct tgaagcagta cattgggaag ttctgccacg tttaccttga 2760 cgacattatc atttggtcaa aggatgccgc cacccacgaa aaacacgtac gcctaatttt 2820 ggaagccctt aggaagaaca agttgtacct caacggtaaa aagtccaaat tcttctgcga 2880 atcggttgac ttcttaggtc acatcgtgtc gaaagacggc atccgtccag atggctccaa 2940 gatcgagcga gtaatcaact ggccagtccc caaaaattcc tccgacgttc gacgattcct 3000 agggttggtg cgttacctcg gaaactttct tcctgacctc gctaggtgga catcggtctt 3060 gaatcccctc acgaagaagg aatatgaccg caattggccg ggttggacga cagaacacgg 3120 ccgtgcgttt gaagaaatca agaagcttgt agtctcgagc gactgtctta ccaccataga 3180 ccacgataac ccgggtgaga acaagatttt cgttaccaca gacgcttctg aaaaacgcac 3240 aggtgccgtt ctctccttcg gtcctacatg ggagacggct cgtccagtcg ccttcgactc 3300 aactccctta aagaaagcag aactcaatta cccgacccac gagaaagaac tcttggccat 3360 cgtcaacgcc ttaaagaaat ggagagcgga tttgctcggt gagaatttct acatttacac 3420 tgaccataaa actctcgaat ttttccagac ccaaaaggaa ctttcgcgtc ggcaggcacg 3480 gtggatggag tacatgctcc aattcgacgg caaaatcgtc tatgtcaaag gagaagacaa 3540 cgtcgtcgcc gacgcactgt cccgcatgcc cacagttgac gacagtgcag aagcccaacg 3600 aacagctaag gaattcttcc aatgggaaga cggtcagaat ttatcgaaga cctccgtggc 3660 ggttctattc actcctaaag gtgcgggtat cgacgtttgc gcgatgactg aagcacttgc 3720 atccgccagg ttcaaaaagg tggaggagaa gccgaaatca ttgtccttgg acgactcctt 3780 cgtcagagac ttgattcgcg gttatcaact cgatccttgg tgtaggaaat tcgaaacagc 3840 cgcgaaggga atggccgcct tcaagaaacg caatggattg tggttcgtca acgacaggct 3900 tgtggtgccc aagtattcca actgtcgctc tatcgttttc caactggctc acgaccgact 3960 gggacatttc ggattcgaga aggcctactc cgccatcaag aacgaatact actggcctgg 4020 tatgcgcaca gacttggaaa agggatacat tcccggatgc gaagcctgcc aaagaaataa 4080 ggatagcacc acgaaacgag caggtccctt acatccactt cctgttccag atggtcgcgg 4140 tgaatccatc gcaatggatt tcattggtcc ccttcctgtc gatgagggct acgacatgct 4200 cttgactatc actgaccgta tgggatgcga tattcaactg atcccggtga agtccaatat 4260 ttcttccgaa gaacttgcta acattttctt tgacaagtgg tactgtgaaa acggaatgcc 4320 cgcccacatt acttcggatc gcgataagct tttcacctct aaattttgga aagctttaca 4380 tacactgacg ggcgttaaac tccagctttc cacggcttat cacccagaga ccgatggctc 4440 gagcgagcgc acgaacaaaa caatcataca aagtcttcga ttctgggtag acaggaacca 4500 aaagggctgg gttaaagctt tacctcgcgt cagatttgcg ctgatgaaca cggtcaattc 4560 ctccacaggc ttcaccccct tccaactgaa aatgggacgc tctccgcgcg ttttcccttc 4620 ccttatccca cgagagggcc aacctaggct ggaagacgtc gtcaaagcca tcgaaaaact 4680 tcagaatgat gtctccgagg cgcaagacaa tctcctggcc gcgaaagtcc aacaagcaca 4740 ccatgccaat aagaagcgcg gacacgaacc ttctttcgag ccgggggatc gcgtttggct 4800 aaccacgcgt catcgacgac gggagtattt gtcgcgcgac agcaaacggg tcgcaaagtt 4860 catgccccgg ttcgatggcc catacacgat cctcgatgtc aacaaggaaa actccactta 4920 caccctactc cttcccgaac attccaacac tcatcccgtt tttcacgcgt cattgctgaa 4980 gaagtgtatt cccaacgaca acgagaagtg gcccgaacgt caacacatca tccctaaacc 5040 tatcgtcacc gaaaccggtc aagaggaatg ggagatcgat agaatcctcg atcgcaaacg 5100 agcgggcaaa gggtggagat acctggttcg ttgggtgggt cacggtgccg agtgcgatgt 5160 gtggctgccc gggaaggagg tagaggattg cgaagccctc gacatctttt tggaaggcct 5220 gaaggacgtc caaaaatcgg atgcggaaat tgatgtttaa ttgtattttt tctgtttgtt 5280 tttttttcct ttcgagctac tggggctaac ggttgtcttt ttccccaccg gggtttttta 5340 atgcaccacg tcaaaaatgg gcccctcttt tacgatagcc tttgctacga tacctttcct 5400 ttttttcggg gtagggagga g 5421 // ID copia-2-LTR_AN repbase; DNA; FNG; 136 BP. XX AC . XX DT 09-DEC-2003 (Rel. 8.11, Created) DT 09-DEC-2003 (Rel. 8.11, Last updated, Version 1) XX DE Long terminal repeat of copia-2_AN LTR retrotransposon - a DE consensus sequence. XX KW Copia; LTR Retrotransposon; Transposable Element; KW COPIA superfamily; copia-2-I_AN; copia-2-LTR_AN. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-136 RA Kapitonov V.V. and Jurka J.; RT "copia-2_AN, a family of copia LTR retrotransposons in the RT Aspergillus nidulans genome."; RL Repbase Reports 3(11), 200-200 (2003). XX DR [1] (Consensus) XX CC It is a long terminal repeat of Copia-2_AN LTR retrotransposon. CC It is characterized by 5-bp TSDs. XX SQ Sequence 136 BP; 29 A; 41 C; 30 G; 36 T; 0 other; tgtcggggca tactgccctt cagtaccgcc tgcccctggg gttcgacctc gattagtcat 60 tatttagtaa gctagttgtt tagataggca ggccattccg gccacccaat gagagaacca 120 tccatctctt cccaca 136 // ID Gypsy-4_CCO-LTR repbase; DNA; FNG; 539 BP. XX AC AACS02000012; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the mushroom Coprinopsis cinerea genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_CCO_; KW Gypsy-4_CCO-I; Gypsy-4_CCO-LTR. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-539 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the mushroom Coprinopsis cinerea RT genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AACS02000012; Positions 1928092 1928630. XX SQ Sequence 539 BP; 100 A; 142 C; 100 G; 197 T; 0 other; tgtcgtgaac cgccggtata gaccctacct tatttttcat tttcgtttca acactggttt 60 gaaccctcga cttcacccgc cggacactac acacccggtt atccgacccg ccggaatgtc 120 taagtcacgc acctgacatt tccgcccaaa gtccctctca ttgttcttcg gaagacttct 180 ctcgtttgac actttacttc actccgtagg cttctcttct ttgacattta ccttccattg 240 atattcctat tggctggtgc actacatatg attcttcatt gttattttca cttcgtattt 300 cacctttgtt tatccgccgg atcgtctaga gtagtcgtac ttatagttga cgcgagtagg 360 atagattcct ttagcttgac ctcttgtctt ataggtacga ctgaccgcct tttagttcaa 420 ttattgttta acttgtctta tagacattct tattgctcaa cgtcgctcat cgcgttctaa 480 ggctccgccg gtttctggta ttggttgtgg tttgggttta gtgttcgctc ttcgcgaca 539 // ID AFUT1 repbase; DNA; FNG; 6914 BP. XX AC L76086; XX DT 11-NOV-1996 (Rel. 1.1, Created) DT 11-NOV-1996 (Rel. 1.1, Last updated, Version 1) XX DE Aspergillus fumigatus retrotransposon-like element (Afut1); DE reverse transcriptase; ribonuclease H and endonuclease DE pseudogenes; LTR. XX KW Gypsy; LTR Retrotransposon; Transposable Element; AFUT1; KW Gypsy group; LTR; Retrotransposon Afut1; endonuclease; KW reverse transcriptase; Ribonuclease H. XX OS Aspergillus fumigatus OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Neosartorya. XX RN [1] RP 1-6914 RA Neuveglise C., Sarfati J., Latge P.J. and Paris S.; RT "Afut1, a retrotransposon-like element from Aspergillus RT fumigatus."; RL Nucleic Acids Res 24(8), 1428-1434 (1996). XX DR GenBank; L76086; Positions 6 6919. XX CC Comparison of the peptidic sequences with other putative CC polypeptides CC of fungal LTR retrotransposons showed that Afut1 is a member of CC the CC Gypsy group. CC Afut1 is a defective element: the putative coding domains contain CC multiple stop codons due exclusively to transitions from C:G to CC T:A. CC 5 bp target duplication. CC 5' LTR 1..282 CC CDS 2228..2777 CC /note="homologue of gene encoding the reverse CC transcriptase in retroviruses and retrotransposons" CC CDS 2778..3489 CC /note="homologue of gene encoding the ribonuclease H of CC retroviruses and retrotransposons" CC CDS 3855..4778 CC /note="homologue of gene encoding the endonuclease of CC retroviruses and retrotransposons" CC 3' LTR 6633..6914. XX SQ Sequence 6914 BP; 2413 A; 1332 C; 1221 G; 1948 T; 0 other; tgtcacatgc tcagaacaac ataagctcag cgagccgcca gcggcaagcc ccctcttact 60 taggtaatta tatacccgat tctaaacaca ctcagtagaa ataggagttc taaatagctt 120 gggggagact aagcagtttg ttctaatcat taacacacat atattttcac taatacatct 180 atacatccct gaaacccagt attcctatcc atagccacag cctagttgcc gtcacatcgc 240 tggctaggac cccctagatt aattacctct aatcacctta tagtttaaac tgcttcccct 300 gcggcctact agatataaat acttaataat atgctaacta gtctaataat agggacttta 360 agggcctact gaaccagcat ctagaatcta gttaagaatt aacctttaag tacagctatg 420 ggtcctctat gagaagcccg cgtagcaatc ccccttctgt ggattgggtg gctagtaagt 480 taaggaccct ggaagtaaat atatcctaac tacttcagac tagcagcata atacctatac 540 ataaccataa tcctagctcc acgctgccag gtattactag taataccgcc ctaataatta 600 tgctagacca ggctagcctt gcttagctta tctaggggtt aagggcagct atagaccagc 660 ataataataa attaaggggc cttaagctag ctgacctagc tatataggat ggccacaggg 720 cctatgctgg cgctaagctg agaatattcc tatataacct ctctaaatac tacctggcta 780 agccctagac atttaataat aacataaata agatcctata cgcgacctcc taacttagca 840 tagatattaa gaatacttag cagaagcagc accctgaggg atataaggct agttataagg 900 aattcctata ggactatatt aaactatagg gggactacca gaacactact gtgtactagc 960 ttctatagct ctagtatagg agatatattt aatctttctt tacagagttg caagagttgc 1020 aaaatataat taagcctagc ctaggcctag agtggttcta gataaggata gctatagaga 1080 aactatatcc taatataaag gcagaagtcc agcacacgcc tagctataat taattatttg 1140 actacctcaa ggcccttgta gcctagcatg ataagatcct gtagtaggaa ggataaccta 1200 ctaccttata gttaactagg acaggcaact ataactacca tctgtagaag gcagatgccc 1260 taagtaggtg gaaacagtaa tatagtagct tactaaagac acccctaaga gctgctagcg 1320 acactaaaag gaaggattcc tcctttaata agcacctgct ggggaaacct aataataggt 1380 agggatcatc tagtaataag gactagcttc agaaggaggg atgctgcttt tattataagg 1440 aaaagaggca cctcctccta acctacctaa agagagccta acagagggga ggcactctac 1500 cctaagggcc tcctacctta caataggaac ccaagcccta ggagcttgat gtcttatact 1560 acactaggga accgctagta cctcctgtcc ttgctaaggt tgaaactata gttaatagga 1620 caggtgtaac aactaaggtg cttcttaact ctagggctag gcctaactat atattatact 1680 aatttataat atagaatggg tactaattta aaaatagtac aatgcttcct cctatagaac 1740 ttactaatag cttaaaagta aaaatctata gggaatagtc actaccagta tcagcaatag 1800 atagggaatt aactaagaag acattctaaa ttcccttctt aattactaat attacaaggt 1860 ataatatagt cttaggacaa aattagctcc agcaagccaa cccagacatt aattagaggg 1920 tgagtagctg gtattattac attaactcac tagggattaa gatcctggag ctaaaggagt 1980 tccttaagaa gtcacagggc aaggaccttt tcctctttat aattaattag gctagatata 2040 aagcccctaa tctaggaata ccactgcagt attaggaata cactagggta tttttagaaa 2100 aggaagctag taccctaccc ttagacagag cagtctataa gatatagcta cttaatagga 2160 agtcactact atatagcccc ctatttacta tattatagct agagctagag gctctgtgca 2220 aatatcttga taaaatactt aagaaaaggt ggattaggga atcatctagc ccagcagcag 2280 cactagtact ctttataaag aagctagata gaggcctgtg tctctgcatt aactactagg 2340 ggctgaatgc tattatagtt aagaattaat accccttacc ctagattaat aaattaatag 2400 atcgcatcta gggagtgaag ttctttacta agctagatct gtgggatgct tactattata 2460 tctatatata gcagggtaat aagtagaaga tagcattccg cacctgctat agctaattta 2520 aatatttagt tatgcctttt aggctaacta acacaccagt caccttctag gcttatatta 2580 ataagataat gcagggaatt cttaataagt tctgtattgt ctacctagat aatattctta 2640 tctactcaca gacagaagag gaatatgagt agtatattaa ggaggttctc tagcatctta 2700 atagtatgaa cctatatact aaactattaa aatataagtt ttataagaca gaagttaagt 2760 ttcttagctt tcttataggc taggaagggg tctgggtaga tcctatataa cttagcactg 2820 ttagtaagtg gctggtgcct tacttattct ataatattta gatttttctg gggtttatag 2880 gattcttcta atactttatt aaggcatact tacagatcac tattccccta acaaacctgc 2940 tgaagggggt aaagaataga tataatccta ggtcttttac ttagacagag gaggcccaga 3000 aggcattcta ggacctaaag gatgcattca ttaaaaccct aatccttgct cattttaacc 3060 ctaagaagct aatccttctt attactaata tattaggttt tataattata ggtattctcc 3120 tatagcctga cagtaactta atagagacat atagcagcca taattagcgc ctaattacat 3180 tctacttaag aaagctttat aatactaaat aatagtatga ggtatataac caggagttac 3240 ttataattat taagtgcttt aagtactaga gacactacct agaaggtagc tgttacctaa 3300 ttagggtcta gacagactat ataaacctaa cttacttttt tactactaag actttaaata 3360 taagacaggc ctagtgggca gagttgctcg cggcctataa ttttataatt aaatataaac 3420 taagacagct caaccccata gacatactat taaggtacta ggattataag ctaactaggg 3480 atgaagaaac tagagctagc ctcttaccta cacttcagag gaagctgaca ccaggcttat 3540 agttaaacta gaccagcata aataacccta gtatttatag tgtgttagtg ggggcaggaa 3600 gtctagaatt cctattccca agactgctag tgatggaggc tactagccct gaggttactt 3660 aggactcatc aatagcttgc ttaaaggaca ttatccagga actatagtat agggatacct 3720 atatataact agtttctaag tggctgcagg gggtccctaa taatacctag gttaggaact 3780 agtaaatcaa cagtaagggt ctcctgtaat ttaaagatgc tatttatata cctactagta 3840 ctacagttag gcaagagctt ttgaaaatac actataataa cccctacaca gggcatctag 3900 gggtagagaa aatagtggcc ttgctctagt ataaattcta ctaggaaggg atgtaacata 3960 atattaaaga atacatctat atatatgcag catattaaaa gactaagact ccccagcatc 4020 ttactacagg gaagcttgct ttactaccta tactaaaaga accctagagt aatttatcta 4080 tagattttat tacctcacta cccccattag catgaggtca agaagtattc aatgcaattc 4140 tagtaattat taattaatat acaaaaatag ctaaatatat ccctattagg aagataatta 4200 ccacagagga acttgctaat gtgttcttag aaaaggttat cctataatat agtatgctaa 4260 gaagcctggt aactaacagg gacacactat ttactagcta ctactagact aatttctata 4320 gatccttaag gattaagaaa tagataagta tagtatttta ttcacaaact aatagccaga 4380 cagaacgcca gaatcagata ctagaggtat acttaaggat ctttattaac tactagtaag 4440 ataactaggt tatatagtta cagatagcag agtttgcata taactacacc ccatatacct 4500 taacaaaaat cacacccttt aaggctgcct atggctacgt cctagtatta ccccagctta 4560 ctaatattac taataattta tcagtagtag tagtagagga atgcctaaaa gtacttcagg 4620 atctgtaaaa ctagctagag agtcacctag tagcagctta ggagacataa gctaggtatt 4680 ataaccagaa gcaccaggac atctcatttt ccattagaga aaaggtgcta ctcttattaa 4740 gaaacttaag aatatggcgt ctacataaga aacttaataa taaatacgac aggctgtttg 4800 aaattattac tattataggg aagtaggcat acaccctgca cttactaaaa ttatatagac 4860 aaatccaccc taccttctat atatccctcc ttaagaagtg gaaccctcag ggcaacagtg 4920 agacaccatc agaatcccag ccccttaata ttaataaaga agaggaatag gaggtagagg 4980 atatactagc agaatagact taacagggag agatataata tctagtcaag tagaaaggat 5040 tccctaacta taagaactca tgggaactag aagacaacct agttaatact atagatatgc 5100 taaatatata taaagagcaa gctagacagg ctgcagagca gccccattaa aggagaggaa 5160 ggagatagcc tttttcctat aaggttttta ctatctacca gctcctacct atcatcataa 5220 cagcataaac cttcctctat tttcctatag agtttagagg tttaatagga ataggtctat 5280 aactatagtc tatataaagc ttgtataaaa tagggacctc actatgctag cagcagtatc 5340 atattataaa aagacagata taggaataat tactactcct tactcttatt actagattta 5400 ctagtaaccc tagcaaagtt ctgaaacacc acagtcttat cctcaggggt gggggtcagg 5460 tgtcctataa tcatatttgc agcaatataa agcttaaata agttataatt aattaattac 5520 agagtataaa tgaccctgcc tatgttatac ttaggcataa agcctattat tacctctatc 5580 tcattgcaca acactataat agccctagta ttatacttag taattagagt ctcctaaaat 5640 atagcaactc agtgaacaag ctataggggg actggcttgc ataacttctt taaacactag 5700 tagtaataat acttataatt aatattcacc caaatatagg gataggccac cccctgctac 5760 cactcctact taatatagtg ctaataatct ataaagggga ggggcatagc gccatcagcc 5820 tacaggatag taggagaggt agatagagag ggtgataggg aggcagaata agagacagac 5880 tcagactagg aataattagc cactatagta gtaggtatag ataaagtaac acaacagcac 5940 tgaacagcag gggaaatagt aattttatta ctagactact tcctcatcag cttagtaggc 6000 ttaacctaag gtaactagac tctgcaaaat aagtacttac actagaacta tctaaacagg 6060 gcaagggaga agcattagcc tactaatact tctccctatg ggagcaagca ggagtttggg 6120 aatacttaat aagtatatta agcagtgtag acacgtcaga ctactagcaa aaaagaaata 6180 gaatattatt aaagggttag aatcaacaag agaaaggggg ggaactacta atatttatac 6240 tactaagaaa gcaatactgc cagaccctta atacctatta atcttaatca cttaacaaga 6300 ccttactact atactcatta attattcgct cctgacttat tgtaaaggct aaatataata 6360 gcatgttaat aagataatac tataacatca taatagctgg gggaatctaa ctaatcacta 6420 gcatcacacc agacaaagcc tattaaggaa gataataatt ctattaagaa ctaacgcagt 6480 gtattccaac acccttctaa caccttttaa acactgatat aaagcccttt gttaaccgtg 6540 attaatagca aagaactact agtattgatc cttaacagct actttaggat tagcatggtt 6600 caggacaaac ctctctgaag gggggggata gttattatat actcagaaca acataagccc 6660 agtaagctgc cagtggcaag ccccctctca ctcaggtaat tatataccca attccaaaca 6720 cactcagtag aaataggagt tctaaatagc ttaggggaga ctaagcagtt cattctgatc 6780 attaacatac atatattttc actaatacat ctatacatcc ctgaaaccca gtattcctat 6840 ccatagccgc agcctagtcg ccgtcgcatc gctggctagg accccctagg tcgattacct 6900 ccgatcgcct taca 6914 // ID Copia-32_MLP-LTR repbase; DNA; FNG; 339 BP. XX AC AECX01001251; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-32_MLP_; KW Copia-32_MLP-I; Copia-32_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-339 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001251; Positions 140282 140620. XX SQ Sequence 339 BP; 86 A; 78 C; 54 G; 121 T; 0 other; tgttgataca caatgcgtat cactgagtac tcgacactta cctgcgattc agcactcaga 60 tagaatgact atacgctgag tcctttcctt gagttctaat tgtaagaact cagttcctga 120 gcaacttctc agcgttaaag caaattgtat ataatgtatc tagaatgctt gtattcattt 180 ctctctctga tcatcagaaa ttaataatat cattagattc atctttgctt tcactcttcg 240 gagtattcca tttcgtgaca ctctcatctt cgcatctctt ctcattcaag tgacagctct 300 ggtttagctg ttctcgcttg tgtgacacga cgctttaca 339 // ID Gypsy-3_AN_LTR repbase; DNA; FNG; 222 BP. XX AC . XX DT 02-JUL-2007 (Rel. 12.06, Created) DT 13-JUL-2007 (Rel. 12.06, Last updated, Version 1) XX DE LTR of LTR-retrotransposon, Gypsy superfamily, Pogo clade. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Nonautonomous; KW Alternative name "Dane LTR"; Gypsy-3_AN_LTR. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-222 RA Nielsen M.L., Hermansen T.D. and Aleksenko A.; RT "A family of DNA repeats in Aspergillus nidulans has assimilated RT degenerated retrotransposons."; RL Molecular genetics and genomics 265, 883-887 (2001). XX RN [2] RP 1-222 RA Galagan J.E. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [3] RP 1-222 RA Clutterbuck A.J., Kapitonov V.V. and Jurka J.; RT "Transposable Elements and Repeat-Induced Point Mutation in RT Aspergillus nidulans, A.fumigatus and A.oryzae."; RL Chapter in "The Aspergilli: genomics, medical applications, RL biotechnology, and research methods." Edited by: Goldman GH and RL Osmani SA. Publication expected 2007. XX RN [4] RP 1-222 RA Clutterbuck A.J.; RT "Gypsy-3_AN-LTR."; RL Direct Submission to Repbase Update (02-JUL-2007). XX DR [4] (Consensus) XX CC Probable LTR of Gypsy-like "Dane" element (Degenerate Aspergillus CC nidulans element),consisting of a single insertion, flanked by CC LTRs with matching TSDs, in a subsequently duplicated CC subtelomeric sequence. The internal portion is highly CC degenerate. 8 other full-length solo LTRs (95-98% identical to CC the consensus), and 17 fragments exist in the Broad genome CC sequence. XX SQ Sequence 222 BP; 37 A; 70 C; 50 G; 65 T; 0 other; tgtcacgggg ccagcccgag cctcatcctg agccttgatt ctgccgccgc tgaccgcccg 60 gttagctgag atttctggag ctccgactct gatccaaatc ggaacctcga gctacgtctt 120 gtcttgtcta tgcacctgtc tgatagcctg actctgtagc ctgcctgttg tatctactcc 180 gttatcctgt tctgaatata ttcctgagcc tgcaccttga ca 222 // ID Gypsy-61_MLP-I repbase; DNA; FNG; 5809 BP. XX AC AECX01002851; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-61_MLP_; KW Gypsy-61_MLP-LTR; Gypsy-61_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5809 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002851; Positions 7491 1683. XX CC Positions [4478-4957] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(203..2911,2915..5377) FT /product="Gypsy-61_MLP-I_1p" FT /translation="MSYPGSTKPDYYLDNPEILLRKARKGKQPSTLESPPP FT PPPPSYLRRLEEAAVSGSPAPSDRILYASSTLATSTSEGARVRFSPTNISP FT SPSESTIVRSHFAQLFNREPLDKTPHPPGRFTFETPLVASSSLTTDPTLTT FT QAANITMSTAEETIESLRREVEGLRAQNEDLSEARKETRALQQLVRQLLEK FT QTTSATEHTPGPSFVSPGMAAPANAPSNMFEQYANRTPVSPTPLPRGPATS FT APATTVAHEQGYQSTPAGAAPGPSDNQPGSPERPFSPRHIPSPQYQSVPAA FT SPHFTREPTTYNAPPYHQEHVEMAPSVPISTTADRVKLSDLPKFTGKFGRP FT ADLFHWQELIEETFEVKNVTEDREKLKLLGSLLNEEAMAAWYQSNKETLRQ FT GSWRDAMDMIATGTLPHAWLTDTEEAFRRLEMKPQESFDAYVMRAQSIHRL FT IKRIGNVTDRHLAQYITWGAPQLFRDMVDREKHLKAEPFSFPVFKDAADGI FT WRFLIHSKLLLDPTAKPRAPAPSNSNSSTTINRPSNAPRTKTDEERADNGW FT RYHEYLRRSGICAVCKERCNNPSCTKKSTRFFSVPQDFDAGPRPTRNAPVA FT RANPAGAPTQRPAGRPSSSTPSARVAALDPVSDLLAEDVAAYEEADRFRES FT LIEEEAQQESGCVPVVPASPVILELTCNGQSIRALIDSGAGTNLMSDRMAA FT RLQLARRPLVAPVEVRLAIATGGTPIVLREFALANLKCDAPSIRFGAVFFK FT LAPLGEKYDVILGSPFLSKFHLDVSIHRRCVIQTSSGKIMYERRVLSEMQR FT VVCALQNLEKISEAESLCEKEESMINEFKDLFPEELPPVEQDEMLETFPEG FT MPDPGRQVRHKIVLTDPNVVINEKQYGYPLKHRESWRKLIDQHLKAGRIRS FT HSQYGSPSMIIPKKDPTELPRWVCDYRTLNKFTVKDRTPLPNVDEAVRLVA FT TGKVWSVVDQTNSFFQDRMREEDIPLTAVKTPWGLYEWTVMPMGLTNGPAT FT HQARNEEILGELVGKICVVYIDDIVIFSQTIEEHEEHLRMVLTRLRKAKLY FT CSIKKSKLFRRHIQFLGHDISTLGVCPDEEKVEKISKWSSPTNSKQLLKFL FT GTVQWMKKFIDGLSHYVGTLTPLTSTTLKGKPFKWGTAEEDAFNNIKRLIT FT TLPVLRNLDYDSGEPIWLFTDASGNGLGAALFQGARWDTSSPVAYESRTMN FT AAEKNYPVHEQELLAVINALNKWRLLLLGMRVNVMSDHHSLTTLLTQRNLS FT RRQARWLETLSQFDLDFRYLKGQDNSVADALSRREDVALCEIHASMDQEEL FT QAIRDGYNHDAFCIKLSKVLPLRDDCLLKDGLMYLDGRLVIPSHGTLREDF FT ITQAHDALGHLGTIKTLASLRHTFFWPSMAKDVETRVSSCDSCQRNKARTT FT KTAGRLQSNPIPQQPMESIAIDFVGPFPKISGYDMLLTCTCRLTGFVRLIP FT ACQRDTAERSATRLFNAWSSIFGLPSSIIGDRDKAWTSRFWKQLHDHLGVR FT INLSTAYHPQTDGRSERSNKTVGQILRHFAATKHGKWLESLPVAEFAINSA FT VNSATGVSPFHFVYGRIPRLFPVQGHTDEKDDDVHQWIERRQSEWAAWRDR FT LWSSRVEQALHYNLRRRQGDVINVGDWVLLDSTDRQQIVGGKSRGVGKLRA FT RFDGPYKVLEVLNGGRNFRLQLAPEDLSFPIFHISKIKIYRDTRGDLELGT FT SARK" XX SQ Sequence 5809 BP; 1581 A; 1508 C; 1370 G; 1350 T; 0 other; ctttttttaa cttgcacact tcaaaagata ttcatccatc atttttctct gtcaacctca 60 aaaattttct gtcattctgt cattctgtct tctgtcaatc ggtcaaccat atcgcagcgt 120 agtaaggttc taccccggtc aactcagtca aagatttctc tgtatcaaac cttgttcatc 180 ctgtcaacca ctgcactgtc gaatgagcta ccccggatcg acaaagccgg attattatct 240 cgacaatcct gaaatactgc tacggaaagc cagaaagggt aagcaaccaa gcaccctcga 300 atctcctcct cctcctccac caccgagcta tttgcgacga ctagaggaag cagctgtctc 360 gggctcacca gcaccatcag accggatcct gtacgcctct agtacattag caacaagtac 420 ctcggaagga gcacgagtgc gattttcacc caccaacatc tcaccatcgc catctgagtc 480 gacaatagta aggtcgcact tcgcacagtt gttcaaccgg gaaccgctag acaaaacacc 540 gcatcctcct ggtcgtttca cattcgaaac accactcgta gcaagctcaa gcttgacaac 600 agaccccacc ctcactactc aggcagccaa cataacaatg tcaacagcag aggagaccat 660 cgaatccctg cgtcgagaag tcgaaggact gcgagcacag aacgaggacc taagcgaagc 720 acgtaaggag acgagggcgc tgcaacaact agtcagacag ttactggaga aacagacaac 780 ttcggctacg gagcataccc caggaccaag ctttgtaagt cctggcatgg ctgcgccggc 840 gaatgcacca tcgaacatgt tcgaacagta cgccaatcgg acgcctgtat cacccacacc 900 tctcccaaga ggtccagcaa cctcggcacc agcaaccacc gtagcgcatg agcagggtta 960 tcaatctacc ccggctggtg cagcaccagg accatccgac aaccaaccgg gttctcctga 1020 gagaccgttc tctccgagac acattccatc accccagtat caatcagtcc cagcagcgtc 1080 tcctcacttt actcgcgagc cgactactta caacgcacct ccttatcacc aggagcacgt 1140 tgaaatggca ccctctgtac cgatctcaac aacagcggat cgggttaaac tcagcgattt 1200 acccaaattc acgggaaaat tcggacgacc tgctgattta ttccactggc aagagttgat 1260 tgaggagaca ttcgaagtca agaacgtcac cgaagatcgt gagaagctta aattgctggg 1320 gtccttgttg aacgaagaag cgatggccgc ttggtatcag tcaaacaaag agactttacg 1380 tcaaggatcc tggagggacg cgatggatat gatagccact ggcacattgc ctcacgcctg 1440 gctcacggac accgaagaag cctttcgaag actcgaaatg aagcctcaag agtctttcga 1500 cgcttatgtg atgcgggctc agagtattca ccggttgatc aagcgaattg gaaacgtcac 1560 ggaccgtcac cttgcacaat acatcacgtg gggtgcacca caactattta gggatatggt 1620 agaccgcgag aaacacctca aggcggaacc attttccttc ccagttttta aagatgccgc 1680 cgatggcatc tggcgttttt taattcatag caaacttcta cttgacccta cggcgaagcc 1740 tagagcacca gccccgagca atagcaactc ttctacaacc atcaaccgac cgtccaacgc 1800 accgagaacc aaaacagacg aggaacgtgc tgataacggc tggcgttacc atgagtattt 1860 gaggcgatca ggcatctgtg cagtttgtaa ggaacgatgt aacaatccat cgtgcaccaa 1920 gaaatcgaca cgcttctttt cggtccctca agacttcgac gccggaccca gaccaacgcg 1980 taacgcacca gtcgccagag cgaatcccgc gggtgctcct acccaacgac ccgctggacg 2040 cccaagttct tcgacgccat cagcacgagt cgcagccttg gatcctgtat ctgatttact 2100 agcagaagat gtagcagcct atgaggaggc cgatcggttt cgagaaagct tgatcgaaga 2160 ggaagctcag caggaatcag ggtgcgtacc tgttgtgcca gcatcaccag tcatcttaga 2220 actgacgtgt aacggtcaat caattcgggc actcatcgat tcgggcgcag gcacgaatct 2280 catgtccgat cgaatggcag ctagattgca actggctcgg agaccgttgg tggcacccgt 2340 tgaggtacgc cttgccattg ctacgggagg gacacctatt gtcttacgag aatttgcctt 2400 agctaatctc aaatgtgacg ctcctagcat ccgattcggc gctgttttct tcaagctcgc 2460 accattggga gagaaatacg atgtcatcct gggttcacca ttcctatcta aattccactt 2520 agatgtctct atccatcgcc gctgtgtaat ccaaacatcg agtggaaaaa tcatgtatga 2580 aagacgagtg ttgagtgaaa tgcagagagt agtttgtgcg ctgcagaatt tagaaaagat 2640 cagtgaggct gagagtttgt gtgagaaaga agagtcaatg ataaatgaat ttaaagattt 2700 attccctgaa gaattacccc ctgtggagca ggatgaaatg ctggagacat tcccagaagg 2760 catgccagat cctgggaggc aagtaagaca caaaattgtg ctgactgacc ctaatgttgt 2820 gatcaacgag aagcagtacg gttatcctct taaacacagg gagtcgtgga gaaagttgat 2880 tgatcagcac ctgaaggccg gccgtattcg ttgatcccac agccaatacg gatctccttc 2940 aatgattatt cccaagaaag acccgaccga gctcccaaga tgggtttgtg attacagaac 3000 tttgaacaaa tttaccgtca aggaccggac gccgctaccc aatgtcgacg aagcggttag 3060 gttggtagcc actgggaaag tgtggtcagt tgtcgaccaa acgaattcat tctttcaaga 3120 cagaatgaga gaagaggaca tcccccttac tgcagtgaag actccatggg gactgtacga 3180 atggacagtg atgcccatgg gccttactaa tggaccagct acacaccagg cacgtaacga 3240 agaaattcta ggagaacttg ttggaaagat ttgtgtcgtg tatattgacg acatcgttat 3300 tttttctcaa acaattgagg aacacgagga gcacctacgt atggttctga cacggctccg 3360 taaagcaaag ctctactgct caatcaagaa gagcaagctt tttcgacgtc acattcaatt 3420 cttgggacac gacatcagca ctctgggagt ttgtccggat gaagaaaagg tcgagaagat 3480 ctccaaatgg tcttccccta cgaactcaaa acaactgcta aaatttctcg gtaccgtaca 3540 gtggatgaag aaattcatcg acggtctgtc acactacgtg ggtacactga caccactcac 3600 cagtacgaca ctgaagggca aaccattcaa gtgggggaca gctgaagaag acgccttcaa 3660 caacatcaaa cgacttatca caactcttcc tgtcctgcgg aatctagatt atgattcagg 3720 agaacccatc tggctattca ccgacgctag cggtaatgga ctgggagcag ctctgtttca 3780 gggtgcaagg tgggacacgt cctcaccggt ggcatacgaa agcagaacga tgaatgcagc 3840 tgagaagaat taccccgtgc atgagcagga gttgttagcc gtgattaacg ctctgaacaa 3900 gtggcgatta ctacttttag gtatgagggt gaacgtaatg tcagatcacc actcgttaac 3960 aacattactg acacaacgta acctgagccg ccgacaagca cgttggttag aaacgctctc 4020 acagtttgat ctggatttcc gatatttaaa aggtcaagat aactcggttg cggacgcgct 4080 gtctcgacga gaagatgtgg ccttgtgtga aattcatgcg tccatggatc aagaggagct 4140 gcaggctata cgcgacggat acaaccacga cgccttttgt atcaagctca gcaaagtgtt 4200 gcctttgcgg gacgactgcc tactcaaaga cgggttgatg tacctagacg gaaggctggt 4260 aataccatcg cacggtacgc tgcgggagga cttcatcacg caagcgcacg acgccttggg 4320 gcacttgggc acaatcaaga ctcttgcaag tctccgtcac actttcttct ggccgagcat 4380 ggcaaaagat gtcgaaaccc gagtgtcctc ttgcgactcc tgtcaacgga acaaggcacg 4440 cacgacgaag acggcaggga gactccaaag taatccaatt ccgcaacaac caatggagtc 4500 aatagccatc gactttgttg gcccattccc gaaaatctca ggatacgaca tgttacttac 4560 ctgtacctgc cgtctgacgg gctttgtgcg attgatcccg gcatgccaac gcgacacagc 4620 agagcgctca gcaactcggt tgttcaatgc atggtcctcg attttcggcc ttccaagcag 4680 catcatcggg gaccgcgaca aggcgtggac ttccagattt tggaagcaac tgcacgacca 4740 tctgggggtt cgcatcaacc tatccaccgc gtaccaccct caaacggacg gtcgaagcga 4800 aagatcgaat aagaccgtgg gtcaaattct acgccatttt gccgcaacca agcacggcaa 4860 atggttggag tcattaccag tagcagaatt cgcaattaat tcagcggtga actcagcaac 4920 tggagtctca ccttttcact tcgtgtatgg ccgaatccct cgcttattcc cagtgcaagg 4980 gcacaccgac gagaaggacg acgacgtgca tcaatggata gaacgtcgac agtccgagtg 5040 ggcagcctgg cgtgatagac tatggagcag ccgggttgag caggccttgc actacaatct 5100 cagacgccga caaggagatg ttatcaatgt aggagattgg gttctgttag acagcactga 5160 ccgccagcag atcgtgggag gcaagtctcg cggagtaggg aaactacgcg cacgattcga 5220 cggaccatat aaggtcttag aggtgcttaa cgggggccgc aacttccgac tgcaactcgc 5280 acccgaggac ctctcctttc cgatttttca catctcgaaa atcaagattt atcgcgacac 5340 aaggggggac ttggagctag ggactagcgc ccgaaagtaa gtcaccttct tctgtatgac 5400 accgccgccc ctactacacc ttgtcgaaat aaaacatacc atggccacac tgtgagcacc 5460 gcaatatgtc aatgcttgtt acaggtcagg tttccgacaa tggacaacga attcatcacg 5520 attacgggca cgattttttt tcatttttta ctcttatctc tcttaacttg cttttactca 5580 ttaggttacg gattgttttt gttttgattt ttttctttta acttacattc taatttcttt 5640 gggacgagtc atggtcagtt taagggcgcg ttatgccatt ggttacggtg tgattgaaca 5700 actacttcaa gaatttttct ttctcagttt tccttttctc tttttttttt ctttcaattt 5760 tctttacgga tttcttaacg cggatagcat tttttttagg aagggaggg 5809 // ID Copia-2_GDe-LTR repbase; DNA; FNG; 197 BP. XX AC AEFC01001875; XX DT 26-MAR-2011 (Rel. 16.03, Created) DT 26-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Geomyces destructans genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_GDe_; KW Copia-2_GDe-I; Copia-2_GDe-LTR. XX OS Geomyces destructans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Leotiomycetes; Leotiomycetes incertae sedis; Myxotrichaceae; OC mitosporic Myxotrichaceae; Geomyces. XX RN [1] RP 1-197 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Geomyces destructans genome."; RL Direct Submission to RU (12-MAR-2011). XX DR Genome; AEFC01001875; Positions 394 590. XX SQ Sequence 197 BP; 60 A; 45 C; 42 G; 50 T; 0 other; tgttgaagga ggaggttgct cagactgatt ggttgggcgt gtggctcgcc aaccgaatcg 60 ggttgcggac cacccacaaa taattccgta gatagcttca ctcagaaaaa caatacaaca 120 catattacag tgaactgttg ttcaactcaa cttgatgact tatactaccg ctgcaagtga 180 tatcatacac ttcaaca 197 // ID TSE5_LTR repbase; DNA; FNG; 370 BP. XX AC AJ439554; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 08-NOV-2002 (Rel. 7.1, Last updated, Version 1) XX DE Saccharomyces exiguus retrotransposon TSE5_LTR, long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; KW Long terminal repeat; RNaseH; TSE5_LTR; gag; integrase; pol; KW protease; retrotransposon; reverse transcriptase. XX OS Kazachstania exigua OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Kazachstania. XX RN [1] RP 1-370 RA Neuveglise C., Feldmann H., Bon E., Gaillardin C. RA and Casaregola S.; RT "Genomic evolution of the long terminal repeat retrotransposons RT in hemiascomycetous yeasts."; RL Genome Res 12(6), 930-943 (2002). XX DR Genbank; AJ439554; Positions 1 370. XX SQ Sequence 370 BP; 139 A; 59 C; 47 G; 125 T; 0 other; tgttgtgatt aatgctaagt tgcaatactt acattctaat aacaaagtag ataagttacc 60 aaataacatt taacttgcta cttccacttt aggaatcaac ttcctaaata ggcactattg 120 gccgctgtag gacaaaaagc ggataacaaa ttttctaaga aaaatttgat gtcatcgaga 180 aaaactaatc gttatataaa agaactcaat gacttatgta cactaaatac tagagtttat 240 acttactttg tacattatat agaagaactc taaccgaaat attttatatt aattgatatt 300 tataaaatat gtcctcaagt tccattaacc atatgtttgg aaatttctct ttagttaacc 360 tgcgtcaaca 370 // ID Gypsy-9_LBS-LTR repbase; DNA; FNG; 598 BP. XX AC ABFE01000442; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-9_LBS_; KW Gypsy-9_LBS-I; Gypsy-9_LBS-LTR. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-598 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01000442; Positions 93331 93928. XX SQ Sequence 598 BP; 172 A; 153 C; 105 G; 168 T; 0 other; tgttacagac tctttttcgt acatatacgc gcagacactt acatattcct aagacgtatc 60 tagaccatac tggacagttc tagacacgca ctccagccag attccttttg actcatatga 120 ctcatatgac tcttatagac tagatacgac tcgtatgact ctacacaagt catacaaact 180 catacctctg acttgtactt gactcatacc cttagtatat aagtagggta cttctcgttt 240 gtattcggta gcttgaacct aggcaatcta agcttagttc tccctcatca gaccctcaga 300 aactaagatc gccctcgaac tctatagtct ttcgacttag agcctctttt gaaaccttga 360 cgataagtct tcaacgttga agacttaata cacgtttgac ttaacagaag tcagcgtatc 420 ttagaacctt ggactgacct tcgggcgcaa ggctactaga acctaccgtg tcgctttaag 480 ggctaagcga acgttgtagt gacccacttt gtctcaacga cgagaactgt ggtagacgaa 540 cgatcaaccg aactcgaaag cttaacataa ctgcccgtta cgcttaacaa ccgtaaca 598 // ID Gypsy-14_MLP-I repbase; DNA; FNG; 5965 BP. XX AC AECX01001397; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-14_MLP_; KW Gypsy-14_MLP-LTR; Gypsy-14_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5965 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001397; Positions 15439 21403. XX CC Positions [4764-5243] - Integrase core CC 'AATTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 424..1488 FT /product="Gypsy-14_MLP-I_1p" FT /translation="MEEIQRQLAELSTSLSEERLLRQQAELRSQQAEARIA FT AIEAGRQPNPAAPAASTPMSPLVPQQTPKGPKVSTPDKFNGVRGGPAESFA FT SQVQLYMMAHPYLFPDDRSKVVFTLSYLTGPASSWAQPMTEELFNPETAEN FT VTFQRFIDNFRAMYFDTEKKSKAERAIRALTQKGTVAAYAHEFNLHAHKTG FT WEVSTLVSQFEQGLKREIRVAMVMVQEAFTSVEQIANLAIKLDNKLHGVPD FT VSTRSTAPSVDPNAMDISATYTRLSDDERARRLRSGSCFNCGTQGHIATRC FT PDRRASKPFKGRSGVGNFRTRISELEAQLAVMRGGSSNQIEERGSSNSGVN FT RANDSKNGAAQE" FT CDS 1488..5867 FT /product="Gypsy-14_MLP-I_2p" FT /translation="MKVVPILSADGKEDVIELGSSRILSCNENDPRLFFHT FT SLSIFPNPRATPPKTLEALFLIDSGATHNVLGENFATRHGLMGHATHSSRE FT VTGFDGSKSQSSYEIPIQLDQAPHAGTFIITRLKDNYDGILGMPWIQEHGS FT TIDWRRRKFVTTNSIAAATAASSHPEPPSHNLHLGPRRDARDTDEGTCILD FT DTLQSPQCEFDASIDSHPPETAGKLDFLAEFRTPSDHEDEHRNGCTDSLCQ FT VPENRESHAAANAVSSDPKTPSHNLHLGPRRDARDNDEGTGILYDTLQSPR FT CEYSNLPTKMLPRTAGKLQFSTEFIPHDHDHAESIAAGAPASLYTPHPPPT FT PTGPILEPLGHARKYDEGACVFDNTFQPPRCEFATPSERLTFEAAGKPVYL FT LNNITQVAAANTSWSTSARLAADEKKDTPVRPVEELVPTRYHRFLHMFKKS FT TAQRLPPRRKYDFKVNLIPGAEPQASRIIPLSPAENAALDEMVNTGLANGT FT IRRTTSPWAAPVLFTGKKDGNLRPCFDYRKLNAITVKNRYPLPLTMELVDS FT LLDADRFTKLDLRNAYGNIRVAEEDEDKLAFICKSGQFAPLTMPFGPTGAP FT GHFQYFMQDILLAHIGKDAAAYLDDTMIYTKKGVDHEGSVSDVLEILDKHE FT LWLKPEKCEFSKSEVEYLGLIISHNRIRMDPTKVRAVTEWPAPKNVSELQR FT FIGFANFYRRFIGHFSSLTRPLHDLTKDNVPYLWSTKCNQAFESLKTAFTS FT APILKIADPYKPFLLECDCSDFALGAVLSQRSEDDGELHPIAYLSRSLVQA FT EKNYEIFDKELLAIVASFKEWRHYLEGNPNRLEVIVYTDHRNLESFMTTKQ FT LTRRQARWAEILGCFDFKIKFRPGHQATKPDALSRRPDLKPDEADNLTFGQ FT LIRPENLTDDSFHTDVASVDCFFMDDSIKLHDAEHWFEVDVLGISDTRIEE FT LPTCDDVQPDETIIRNIREKTSTCPRLAELVQAIENPISTKLKSAVANYQV FT RDGVVYNKGRIEVPDDAEIKRQILKSRHDSLLAGHPGRAKTMNLVKRCFTW FT PSMRAYVHRYVDSCDSCLRVKPTTQKPFGSLEPLPIPAGPWTDISYDMITG FT LPPSNGKDSILTVVDRLTKMAHFISCNESMNATELADLMVNHVWKLHGTPR FT TIVSDRGSVFISQVTQELDRRLGIRLHPSTAYHPRSDGQSEIVNKCVEQYL FT RHFVSYRQDNWEQLLPTAEFSYNNNNHSATGMSPFRANYGYNPVYGGVPSH FT EQCVPVVEERLKMLAEVQSELTECLRLSQESMKEQFDRGIRQTPNWQVGDF FT VWLKSTNISTTRPSPKLDHKWLGPFSIIKKVSQSAYKLTLPASMKGIHPVF FT HVSLLRKHEPDTIAGRSEPPPPPVEVEGESEWEVFEILDSRIRYKKQEFLV FT QWKGFSSEHNSWEPLENLKNCKELITKFKEQYPDAANKYKRHRRKK" XX SQ Sequence 5965 BP; 1721 A; 1540 C; 1396 G; 1308 T; 0 other; tattgtcaga tctcaataac accgacggac tgtcaaggac cagatcgaaa tccgaaactc 60 tggattagaa accgaagatt gaaaaaattg aaactagatt gataccgatt atccacaaga 120 acaatacacc agatctgata accttaactt tgtttattga acccttcacc gaattcacat 180 tagattgaga gaactgagaa gaaattgata ccgaccttta gaattgatct gctgcaagac 240 gaatacacta gaattaggca aaactttaaa ccttagaatc accaccgact taccaccgca 300 tcccttcgac aaacttccac aacgtctccc acattcagca cctcatcgtt cgacgacgta 360 gaagacattg acaccgaatc tgatactgtt acgtctgctg ttactgccga gtcttcgaac 420 gcgatggaag aaattcaacg gcaactcgcc gaactgagca cgtccctatc agaggagaga 480 ctccttcgcc aacaggctga gctccgaagc caacaggccg aggctcgtat tgccgctatc 540 gaggcaggaa gacagccgaa ccctgcggca ccggcggcct cgactcctat gtcaccgcta 600 gttccccaac agaccccaaa ggggcctaaa gtctccacac ccgataagtt caatggagta 660 cgtggtggtc ccgccgagag ctttgccagt caggtacaat tgtacatgat ggcacaccct 720 tatctctttc ctgacgaccg aagcaaggtc gtattcacgt tatcctactt gaccggcccc 780 gcgagcagtt gggcacaacc gatgaccgag gaacttttta atcctgagac tgccgaaaac 840 gtcactttcc aacgttttat tgataacttc cgagctatgt acttcgacac agagaagaaa 900 tccaaggcag aacgcgccat acgggccttg actcagaaag ggacggtcgc cgcctacgct 960 cacgagttta atcttcacgc tcataaaacc ggctgggaag tttcgacact tgttagccaa 1020 ttcgaacaag gattgaaacg cgagatacgt gtcgcgatgg tgatggtgca ggaggctttc 1080 acttcagtag agcagattgc aaacctcgca atcaaacttg ataataaact gcacggtgtg 1140 ccagatgtct caacccgatc aactgctcct tctgtcgacc ccaacgctat ggatatatcg 1200 gctacttaca ccaggctctc agatgatgaa cgtgcgagac gtcttcggtc cggttcttgc 1260 ttcaattgcg gcacgcaggg ccatatcgcg accagatgcc cagaccgacg agctagcaaa 1320 cctttcaagg ggagatcagg agttggtaac tttaggactc gcatatcaga attagaggct 1380 cagttggcag taatgagagg tggtagcagt aatcagattg aagagagagg gagctcaaac 1440 tcaggtgtta atcgtgctaa tgactcaaaa aatggcgcag ctcaggaatg aaggtcgtgc 1500 ctatcctgag cgctgatggg aaagaagatg tgatagagtt aggttctagt agaatattat 1560 cttgcaatga aaatgatcct cgactatttt ttcatacctc tctatccata ttccccaatc 1620 cccgagccac accacctaag accctagaag ccttgttcct cattgattcc ggcgcgacgc 1680 acaatgtgct gggtgagaac tttgccactc gacatggcct gatgggccat gccacccact 1740 caagccgtga agtcacagga ttcgacggat ccaaaagcca gtcatcctac gagattccca 1800 ttcaactcga tcaagcacca catgcgggca ccttcatcat cacaagactc aaagacaact 1860 acgatggcat cttagggatg ccctggatac aagaacacgg ctcgactatt gattggcgac 1920 gccgcaagtt tgtcacgacc aattccattg ccgctgccac cgcagcgtcg tcacacccgg 1980 aaccaccctc tcacaaccta cacttgggac ccaggaggga cgctagggac actgacgagg 2040 ggacttgtat cttagatgat actttacagt ccccgcaatg tgagttcgat gcatctattg 2100 attcccatcc tccagaaaca gctggcaagc ttgatttcct tgcagaattt agaaccccat 2160 ccgaccacga ggacgaacac cgcaatggtt gtacggactc actttgtcaa gtaccggaaa 2220 accgtgaatc acatgccgct gccaacgcag tgtcgtcaga cccgaaaaca ccctctcaca 2280 acctacacct aggacccagg agggacgcta gggacaatga cgaggggact ggtatcctgt 2340 atgatacctt acagtccccg cgatgtgagt acagcaatct acctaccaaa atgttaccta 2400 gaacagctgg caagcttcaa ttttccacag aattcatacc acacgaccac gaccacgcag 2460 aatctattgc ggctggagca ccagcctcgt tgtacacacc acatccgcca ccaaccccaa 2520 ctggtccgat actggagccc ttggggcatg ctaggaaata cgacgagggg gcttgtgtct 2580 ttgataacac ttttcagccc ccgcgatgtg agttcgccac accttctgag agattgacct 2640 ttgaagcagc tggcaagccg gtgtatctct tgaataatat aacacaggtt gcagccgcta 2700 atacctcgtg gtcaacatcg gcaagactgg ccgcagacga gaagaaagat acgccagttc 2760 gacctgtgga agagctggtt ccgacgcgct atcaccgatt cctacacatg ttcaaaaagt 2820 caactgcgca gcggctaccc ccccgtcgaa aatacgactt caaggttaac ctgatcccag 2880 gcgcggaacc tcaggccagt aggataattc cgttatcgcc tgctgagaac gctgccctcg 2940 acgagatggt gaacactgga ttggcgaatg gaaccattag gaggaccacg tcgccgtggg 3000 cagcaccggt cctgtttacc gggaaaaaag acggcaactt acggccgtgc tttgactacc 3060 gaaagctgaa cgccatcacc gtcaagaacc gttaccccct accgttgact atggagctcg 3120 tcgacagtct ccttgacgca gacagattca ccaagcttga cttacgaaac gcatatggca 3180 acatacgagt ggcagaagag gacgaggata agttggcgtt tatatgcaag tccggccaat 3240 tcgccccatt gacgatgccc tttggaccaa cgggagcacc ggggcatttc caatatttta 3300 tgcaggatat cttactcgcg catattggaa aagacgcagc agcctactta gacgacacca 3360 tgatctatac gaagaagggt gtggaccacg aaggctcagt atcagatgtg ttggaaatcc 3420 tggataagca cgaactctgg cttaagcctg agaagtgcga attctcgaaa tcagaggtcg 3480 agtatttagg gctcattatc tctcacaata gaatacggat ggacccaact aaggttcgag 3540 ctgtgacgga atggccggct cccaagaatg tatctgagtt gcagcgattc ataggctttg 3600 caaactttta cagacgtttt atcggccact tttcctcttt gacacgtcca cttcacgacc 3660 tgaccaagga taatgtcccg tatctctgga gcaccaagtg caatcaagca ttcgaatcct 3720 taaagacagc tttcacgtca gcccccatac tcaagatagc agatccgtat aagcctttcc 3780 ttttggagtg cgactgctcc gattttgctc taggggcggt actctcacaa cgcagcgaag 3840 acgatggcga gcttcatcct attgcctacc tttcacgttc actggtacag gcggagaaga 3900 actatgaaat attcgacaag gaactcctcg ccattgtcgc gtccttcaag gagtggcgtc 3960 actatttgga aggtaacccc aacaggctcg aagtcattgt gtatactgac cataggaacc 4020 tggagagttt catgacaacg aaacaactca cacgccggca agcacgatgg gcggagatac 4080 tcggatgctt tgactttaaa attaaattca ggccaggaca tcaagccacg aagccagatg 4140 cattgtcgag aagaccggat ttaaaacctg acgaagctga taatttaaca tttggacaac 4200 tgatcaggcc tgagaatctg acagatgact ctttccacac cgacgtcgca agcgtagact 4260 gtttctttat ggatgattca atcaaactcc acgacgctga acattggttt gaggtcgacg 4320 ttttaggtat atcggatacc agaatcgaag aactgccaac gtgcgatgac gtccaaccgg 4380 acgagaccat catcagaaac atacgagaga agacgtcgac ttgcccacgc ctcgccgaac 4440 tagttcaagc catcgagaac cccatatcaa caaagcttaa atcagctgta gccaattatc 4500 aggtaaggga tggtgtggta tataacaaag gacgtatcga ggtcccggac gacgccgaaa 4560 tcaaacgaca gatcctcaaa agccgccacg acagcctgct agcaggccat ccgggtcgag 4620 ctaagacaat gaatctagtc aagcgatgct tcacatggcc atcaatgcgt gcatacgttc 4680 accggtatgt cgacagctgc gactcctgcc ttcgggtcaa accaaccacc cagaagccat 4740 ttggatcttt ggagccactc cccataccag ctggaccgtg gacggatatc agctacgata 4800 tgatcacggg tcttccgcca tcaaatggca aggacagtat attgactgtg gtcgacaggc 4860 tcactaagat ggcccatttc atctcttgta atgaaagtat gaatgcgacc gaacttgctg 4920 acttaatggt taaccacgtt tggaagttgc atggcacacc gcgcacaata gtgtcagacc 4980 ggggtagtgt ttttatatca caggtcacac aggagctaga tagacggttg ggcatcagac 5040 tacatccatc caccgcttac caccccagat cagatggaca gagtgagata gtcaacaaat 5100 gtgtcgagca atacttacga cactttgtca gctataggca agacaattgg gagcaactac 5160 tcccaacggc ggaattttcg tacaacaaca acaaccactc tgcgaccggc atgtctcctt 5220 ttcgagcaaa ttacgggtac aatccggtat atggaggagt accatcgcac gaacaatgtg 5280 ttccggtagt agaagaacgg ttgaagatgt tagcggaggt acaatcagag ctaacagaat 5340 gtttacgttt gagtcaggag tcaatgaaag aacaatttga ccgaggtatt cgacaaaccc 5400 caaactggca agtgggggat tttgtgtggt tgaagagcac caacatatca acaaccaggc 5460 cgagcccaaa actggaccac aaatggctag gtccttttag tatcatcaaa aaagtttctc 5520 aatcagcgta taaactgact ctacctgcgt cgatgaaggg gatacatccc gtattccatg 5580 tatctttact acgaaagcat gaaccagaca ctatcgcagg acgtagtgaa ccgccgccac 5640 caccagtaga agtcgaggga gagagtgaat gggaggtgtt tgaaatatta gatagtcgca 5700 taagatacaa gaagcaagag tttttggtac aatggaaagg attcagctca gaacataatt 5760 catgggaacc attagaaaac ctcaagaatt gtaaagaact tatcacgaaa ttcaaagaac 5820 aatacccgga tgcggcaaac aagtacaaac ggcataggag aaagaagtga gtgggcaagc 5880 tttttcccca cggggttttt taacgctgcc cagggacaga atgcaggtct tgcaagcggg 5940 agactgggca ttaaaagggg gatag 5965 // ID TSE1_I repbase; DNA; FNG; 4843 BP. XX AC AJ439547; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 08-NOV-2002 (Rel. 7.1, Last updated, Version 1) XX DE Saccharomyces exiguus retrotransposon TSE1_I, internal region. XX KW LTR Retrotransposon; Transposable Element; RNaseH; TSE1_I; gag; KW integrase; internal region; pol; protease; reverse transcriptase; KW internal portion. XX OS Kazachstania exigua OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Kazachstania. XX RN [1] RP 1-4843 RA Neuveglise C., Feldmann H., Bon E., Gaillardin C. RA and Casaregola S.; RT "Genomic evolution of the long terminal repeat retrotransposons RT in hemiascomycetous yeasts."; RL Genome Res 12(6), 930-943 (2002). XX DR Genbank; AJ439547; Positions 425 5267. XX SQ Sequence 4843 BP; 1762 A; 877 C; 810 G; 1394 T; 0 other; tggtagcgcc gcgaaaactt cgatggataa caatccattg aaaggttctc caagaacacc 60 gatcccaggt agtcaggaac caattaggaa cccaggtgaa actgttaatc caaaagctac 120 tcctatctat aatgaagatc ctagaatttt ttatcaaaaa tttggttctc caatgtttgt 180 caacccagct atgttcgcta atccatttca acaagcggaa tttatgtatc aacagattcc 240 acattgggct atgccaaaca tgttgggaca ccaacaacta cctcaacact ttacaggtgg 300 gcgtattcct tctgcatcat cagaaacagt aaatgaaagc gtaaataaca atcaatttac 360 tcaacaagaa attgctccaa ctgctactct aaaccgaatc aatgatcaat cagagtttag 420 tacatggatc aagaacttta ttatcttttt aagagaacgt aaacttgaac atgttatccc 480 acctgaaaat ggtcaatcca ccaattctgc tactgacgaa gaaaagaaat tcatcactga 540 tgtctttaaa tactttgtat ctccaaaagc ttatcctgct tggttctcca agcaatcaca 600 agaacgtttc attgaacttt atgatgtgat tgaaaacgca ctatacaagg aaacaggata 660 tgataacgaa agatctattg ctaaggagtt aaactccatt tactatgatg gtaaaactga 720 tcctttcttt tttcagaaaa aattgaataa cttacgtaag aaaggtttag aaatcggggt 780 tacaactact gatacaattc tatgtgaaag aattgtcgat catctttcag gtcctttcaa 840 accaattgct gataattacg atcgtaatta tacaaatatt tcattaactg atttattaga 900 tgaaattcaa agagtgtaca ataggcaaat cagggataaa acgtatacaa ataaaagcaa 960 ttcaaattca actgactata aacctacaac caagtcaact ctagaaaaac aagtaattaa 1020 caaatcaact caaaaagctg cttctaaagg tcgcaacaag aaactgttcc atattcaaga 1080 cgttgactca tctattgaag atcttaggca attcagaaaa taatttagat aatttattga 1140 ttgtagactc aggtgcagag atatccgtta tccgcgatat atcttttcta acaggaatta 1200 accataaccc aacagaccga ctattaggtg ctggggacca agaattgaag gttaatgctg 1260 ctggtactct tcgtttaaac ttcggtaaga cggtaattaa aatcaaagct ttagtatctc 1320 cagattcaac atgtaatcta atttcattac atgatttaga aaaagcaggt ttaattatcg 1380 atctacagaa ccgtattgtt ttaaacaaga atcataaaaa aattggtaac atcattgatt 1440 gtggacgtta catatgtcta ccgttaaaac taattactcc aaacaaaggt attcacaaag 1500 ttaacaatat ttcaaaaaaa ctacaaatgg cttttctaca tcgtctcttt ggccatatca 1560 atattaaaac aattaaagag tcaatatcta acaaactcat caaaaacatc tcgttaaatg 1620 acgtcgactg gtctcacatc gataaatttc aatgtaccga ttgtatgaaa ggtaaggcaa 1680 ccaaacacaa acacatagtt ggttccagat taaaatacca aaacaattac ggtccattcc 1740 aatacattca ttcagatcta ttcggccccg ttactggtgt ttctgctact tctccttctt 1800 attttatttc atttacggat gaatgcacta gattccgttg ggtataccct ttgcgcacca 1860 aatctgctga atcgatttat aacatattcg atcatttggt aagacaaatc gatactcaat 1920 ttaatacaaa gattttatca ttccatatgg accgtggatc ggaatataca aataccgaaa 1980 tgcaaatgtt ttttcaaaaa catggtataa tcccgattta tagttctact acagactcgt 2040 cttcaaatgg tgtggctgaa cgtagcaatc taacatttct aaacgattgc cgtacgttac 2100 tggtatctag tcatcttcca aatagtttat ggtttaatgc cgttgaattt gctactctta 2160 tgagaaacgc ttttatcaac tctacaaata aaatgtctcc tcgcggaaaa gctggtctcg 2220 ccggtttaga tgcaagtaca attttaccat tcggtcaaga ggttgtggta cataatcata 2280 aggtcaagaa caagttacat ccaagaggta ttaccggata tgcactttcc ccatcaaagg 2340 aatcacatgg atacttgatt tacctacctt ccacaaaaca gattattgat acttctaact 2400 atgtgttggt aaaaaacttg gctaacaaca ctactgccga taattcgatc tttgatgacc 2460 taattacaat ctacgaacaa gacatcgata cttccatctt agatggcact ccaatagact 2520 ctacttataa taataacaat gctgctttgg gtggcactgg tactggtaat ggcttggaag 2580 gttttgatga ttacaacaat gatgctttag gtggtactgg taatggtaat ggtaatggat 2640 tggaaggtat tgatgattac atcaagactg ctttgggtgg ttctggtcct ggtaatggta 2700 atggattgga agttattgat gataataacg agctacttca ttttgacgat gaaaattcgc 2760 ccctaaatga atccaacgat aatattccac ttgatgctct tgaggatatt tcaaatggtg 2820 ataacagcca agaaggaact gctattgatg aatttacaaa ttttactgac gactctcagg 2880 aagctgactt acatccgtta gattcaggag agacaaatca taactcaact ccaattgaca 2940 acacagctgc aaatgacact acaatagtaa ctgacgaaca atctcaaatc ttacctagtt 3000 cgggtggtag aaagaaaatt gaaagtatgc ttaaaaagaa cggcaactca ttatccactt 3060 ttcaaaatcc aagaaaaaga gtaagaagta atataactga caaaagtgaa actagaaaag 3120 atttgcaaga agagaaagat tataaggaaa ctggtattcg accaaagaaa ctaaggcgta 3180 gagtaaacta catcaaggct atcaagcatg tcatgcaaac aaaacaacta aatccgtcgc 3240 ttaactactt tgatgctatc gttcataata aatcaacaac tgaaagcaaa gaatacaaag 3300 ctgcttatga taaagaaatc aatcaactca tgaaaatgaa cacatgggat aacaatcaac 3360 tttatgacgc caaggatata ccatcaaaga aaataatcaa ttcaatgttt attttcacca 3420 ctaaaagaga tggtacaaga aaatgtaggt ttgttgctag aggtgatcaa caacatccaa 3480 gtacttatga cgaaaatgca attgcaaaca cagtacatca ctatgcattg atgacatcac 3540 tctcattagc tctagattct aagaaataca ttgtgcaatt agatatctct tctgcttatc 3600 tttatgctga tctatcagaa gaactataca ttagaactcc tccacacatg tccaaaagag 3660 gtaaagtaat gagactaaac aaatcattat atggcttaaa acaatcaggt gctaattggt 3720 acaacacaat caaggaatat ttgatcaaga agtgtaaact acaagaggtt aaaggttggt 3780 catgtgtttt taggaacaaa gacctaacag tttgtttatt cgttgatgat atggtggtaa 3840 cttcttcaaa ccgtgaatta gcaaataaat tcattgacac attgaaaaag aagtttgaaa 3900 caaaggttgt aaatactggt gagatagata accaaggtta tgcttattat gacattctag 3960 gtttagaaat agaatacaag ttcggttcaa agatgaagat tggtatggaa aagtctctac 4020 aatcaaagct aactacatta gacgtcaatc taaatcattc aggcaaaatg cttaaagctc 4080 cagcccctcc aggtactgtg ataactaaat gtgaacctga aacgactgag gatgaataca 4140 agaaagatgt aaaatggtta caaaggataa tcggtttggc ttcatatgtg gcttacaaat 4200 acagatttga tctactattc tatgtcaaca cattggcaca acatactctt tttccaagca 4260 aacaggtaaa aaggttagct acattgatgg tacaatactt atgggatact aaacataaga 4320 aactaatttg gtatgctaag gatacaaatc aaaatgatct tcatgccatt actgatgctg 4380 cattcgctaa cttagatgga tatggttctc aagtcggcta ctttataagg ctcaacaata 4440 aaattattgg tggttcttcg tccagagcaa agttaacttg tacatcatct actgaagctg 4500 aaatatatgc tgttagtcgt gctataccaa tgctagacag tttaaaatta ttaatcccaa 4560 agataagtcc tactaaatta aatgccgaaa taaaatcaga cagcatgtca acaataaaca 4620 ttgcgacatc tgatgacgac aaaaaattca gaaacagatt cttcggtacc aaggcaatga 4680 gaatcaggga tgaagtacaa cagcttgggt tgaatataaa atatattaac acagaagaga 4740 acactgctga cgtcttaact aaacctttat catacaagag atttttaaag ctcacagctg 4800 attggataag ctagaatatc aaacattcaa gatataggtg gta 4843 // ID TCN4-LTR repbase; DNA; FNG; 434 BP. XX AC . XX DT 30-MAR-2005 (Rel. 10.03, Created) DT 30-MAR-2005 (Rel. 10.03, Last updated, Version 1) XX DE C. neoformans LTR retrotransposon - LTR consensus. XX KW LTR Retrotransposon; Transposable Element; Interspersed repeat; KW TCN4-LTR. XX OS Cryptococcus neoformans OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Tremellomycetes; Tremellales; Tremellaceae; Filobasidiella; OC Filobasidiella/Cryptococcus neoformans species complex. XX RN [1] RP 1-434 RA Goodwin T.J. and Poulter R.T.; RT "The diversity of retrotransposons in the yeast Cryptococcus RT neoformans."; RL Yeast 18(9), 865-880 (2001). XX RN [2] RP 1-434 RA Loftus B.J., Fung E., Roncaglia P., Rowley D., Amedeo P., RA Bruno D., Vamathevan J., Miranda M., Anderson I.J. et al.; RT "The genome of the basidiomycetous yeast and human pathogen RT Cryptococcus neoformans."; RL Science 307(5713), 1321-1324 (2005). XX RN [3] RP 1-434 RA Gentles A. and Jurka J.; RT "C. neoformans LTR retrotransposon TCN4."; RL Direct Submission to Repbase Update (15-MAR-2005). XX DR [3] (Consensus) XX SQ Sequence 434 BP; 123 A; 82 C; 73 G; 152 T; 4 other; tgtaagtgca gagcactata ctcatttatt ctcnatgcat gcagtctttg tgcatgcatc 60 ctcctatgca tattgcatat gcaggaattt atgcatngca tttacatgca ggaatttatg 120 cagtgcattt acatgcatgc attcccatgt ataagctgta tgaatattca gcnatatttg 180 ttgtgtatgt antcatccag attatgtaac attccatgtg tataatgtta catttgaaat 240 acttagtaat gacgtggcat ttgatgtgtg taactttact ttctgacaca cagttggtac 300 ttagtaaaat agtagtcgca tattgttgta gtttcttcaa tactcaatcc acatgtatta 360 caaccagtgc atgttttgct agtcaactac acactatact actacgacac tcagtgagac 420 acccagtgtc taca 434 // ID Gypsy-1_GGr-I repbase; DNA; FNG; 7447 BP. XX AC ADBI01000108; XX DT 12-MAR-2011 (Rel. 16.03, Created) DT 12-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Gaeumannomyces graminis genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_GGr_; KW Gypsy-1_GGr-LTR; Gypsy-1_GGr-I. XX OS Gaeumannomyces graminis OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Magnaporthales; OC Magnaporthaceae; Gaeumannomyces. XX RN [1] RP 1-7447 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Gaeumannomyces graminis genome."; RL Direct Submission to RU (12-MAR-2011). XX DR Genome; ADBI01000108; Positions 41076 48522. XX CC Positions [6307-6624] - Integrase core CC 'CTTCC' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1816..3435 FT /product="Gypsy-1_GGr-I_1p" FT /translation="MTVSQIVQDGLTISKDQARSFEKRMIEGSNQFAAKVW FT DVIAKTASNASIRDHAYEEMKKAHEAHRQTIEEMHYRFAHTIGSVEQWAGH FT TEHRVASTEQRLDHIKGQQEAISAYPLTTLEQQQAFAENIIAEVNRRLAKG FT KQVDTATILSRVASRQAFRQTTPAMQAMPPIPVSFAKTAHSKSSAFRSDPT FT MIRANTAPSLAPFALKQTSSMHRQAMREDPAGLMAAIHRQARESSYFQMLG FT MTSGVPPRGISSYRPPAPPGDNPSSSDLNGDGPRQPPGPPRRPRANSLPAG FT RDPSRRHGAIFMVNPVLLKAPPTFDGKKTDFKDWWYMVRAYINAVPQTFTG FT DDHKISWVGSYLKRAALRVHVGKARMLEAHGLPDTWNAYAGFLAKQFRDPN FT KERTDSIKLNALEYKDNITGFFMELTQLNATIKLTGTRFKDFVRKKLPREI FT SRLVFANKGHIPDVDEEFLESFQQAGVTYKLMMADKSTLQKLQTGDQGLKE FT KGKDEPASTSERSNAGRPRDKSSKAPYGQSSNSRTAANKGPTLA" XX SQ Sequence 7447 BP; 2059 A; 2277 C; 1723 G; 1388 T; 0 other; aataatttct catctctccg ggactgcctt gacagcacct atacctctgt taggaagggg 60 cgccatcaat ccaaactact cccaagcacc tacttgcctc catgggctcg attcccctcc 120 tcatcgcgca gatccttgat agcgttgccg cgattatacg ctcttgggca gatgaccaag 180 ccggcaagcc gaaatctgcc cctgtggacg aagagagtct tatcgggtta aatgtttccc 240 cttttcgttg ccagcactgc acccgccctg cgcttaaagg atcgcggtct tgcacttatc 300 atctcacaag gaccgaacgc gaccaacaat cccgctagaa gaccgacttt agcgtcggtc 360 accgtcacaa cggaccccaa caagcccttg caatcggcta ccccgcagcc gcctgctttt 420 gttaataggc caatcgataa gtcccctaat ccgctcgttg ctttaggcag aaaacccacc 480 cctgccttca aaaggctacg agcgccctcc gggtcaatca acccgcacca agtaacctta 540 ccaacagcca caggactcgc aattatcggt gaagatcgct tcgtacgagt cgagaggcag 600 caagaagacc tccgacaaaa cctgacggct cgagttgacg tcctgaccga ggactttgac 660 cacgcacgcg actccacacg ttccatcctc acccaagcaa gtcaacaata cgagtccacc 720 ttacagctcc tcgaggacag caagaccgag cgaacgactt acgagggagc tgttgcgcga 780 cgtatggaca ccatcgaaaa gcgcctccgt tacgaagcaa gccaggccct taacagcctc 840 gcccagcaga tcgagtccac acaaggaaac ctctccgccc tgtcacaaca agtccaagca 900 cgaggaaaga ccaccaaaaa acaactcgct gtcctccaag aagaatagag ggagactgtt 960 aggaaaattc ggttgtcctt cgttcggcag aagcaggagc acgacctttt cagacaaaag 1020 cttaatcgac tctcgctcct aacccacgcg gtgaaggcca acaccgagga cacaaacccc 1080 ttcgacgaga ttagggcacg ctataaggac ctccacaaag aaattatcag caactcggaa 1140 cggatccaac aacaaaacaa cgccatcgcc gccctaaagc acagcctcaa cgtggtagcc 1200 gagctttatc aggaccactc taggcagatc tcccgcatca cctcgccaga acccctgcct 1260 acgtacgacg aaagtcgacg cgcctccatc gaaccccttt cgctagacgc aattatggca 1320 gacggcgagc aagcacccaa gccccggggc cgggcctcct ctcagccccc caagaatact 1380 ccagaggcca ccaacggcag caacgcggcc tcttccggga acgacacgtc gggacctgcc 1440 actgaaccac ggaggccccg gttatcccac tcccgcgccc ccagtgagaa tcgaacgagg 1500 caggataccc ccatgttcga cccaaacact ggagccatcg ttttgtacac ggccccaact 1560 ttcgacgagc tttggcagag caacggggga aaacagcttt tgcccatccc ccaagggaca 1620 gattaccagc cccggatccc aggggaccca gcatactctc aatggatgaa ggaaatccat 1680 ggtggccttc aaacgctcca ccaagcagtg ctcgccttgc gcaatagggt acgggaccca 1740 acgcaggtac ccgaactggt gcaatcccaa atggcagagc tacgccagca atacgacctt 1800 acgcgttaaa actacatgac ggtctcccag atcgtccaag acggcctaac catttcaaag 1860 gaccaggcac gctcctttga gaaaagaatg attgaaggaa gcaaccagtt tgcggccaaa 1920 gtctgggatg ttatcgccaa aaccgccagc aacgcctcca ttcgcgacca cgcctacgag 1980 gaaatgaaaa aggcccacga agcccaccgg caaacaatcg aagagatgca ttaccgcttc 2040 gcccatacca tcgggtcggt cgaacaatgg gccggccaca cggagcatcg ggtggccagc 2100 actgagcaaa ggctggacca tataaagggg caacaggagg caattagtgc ttaccccctg 2160 accaccctgg agcagcaaca agccttcgcc gaaaacatta ttgccgaagt caacagacgg 2220 ttggccaaag gcaaacaggt agatacggct accatcctgt cccgagtagc ctcccgacag 2280 gcgtttcgcc aaaccactcc ggccatgcaa gccatgcccc caatacctgt ctccttcgca 2340 aaaacggctc attccaaatc ctccgccttc cgatccgacc ctaccatgat aagggccaat 2400 acagccccga gccttgctcc ctttgcccta aagcagacca gttccatgca ccgacaagcg 2460 atgcgtgagg accctgcggg ccttatggca gcgatacacc ggcaagccag ggagagctcg 2520 tacttccaaa tgttagggat gaccagcgga gtccccccga gaggaatatc ctcctatcga 2580 ccaccggccc caccgggcga taatccttcc tcttcggact taaacgggga cgggccacgg 2640 cagcccccag gaccaccgcg tcgcccccga gccaactcat tacccgcggg gagggaccct 2700 tcccggcggc atggcgccat ttttatggtg aaccccgttt tacttaaagc cccgcctacc 2760 ttcgatggca aaaagacgga ctttaaggac tggtggtaca tggtccgggc ctatataaac 2820 gccgtccccc aaacatttac tggcgacgac cacaaaattt cctgggttgg ttcttacctt 2880 aaaagggcag ctttaagggt ccacgtagga aaagccagga tgctggaagc acatggccta 2940 cccgacacct ggaacgccta cgcgggcttt ctggccaagc aattccggga ccccaataag 3000 gaacgaacag attccataaa gctgaacgcc ctcgaataca aggataacat taccgggttt 3060 tttatggagc taactcaact taacgctacc attaagctta ccgggacgcg ttttaaggac 3120 ttcgttagaa aaaagctccc aagggaaata agccgcctcg tttttgccaa caagggacac 3180 atccccgacg tggacgagga gttcctggag tcatttcaac aagccggcgt tacttataag 3240 cttatgatgg ccgacaaaag caccctccag aagcttcaaa cgggggacca aggacttaag 3300 gaaaagggaa aagacgagcc cgcttcgact tcggagcgct cgaacgcggg ccgtccccgc 3360 gacaaaagct ccaaggcccc ctacggccaa tcctcaaact cccgcaccgc agcaaacaaa 3420 ggtccaactc ttgcttaagc ccgcaaggac aacatttggt ccacaaccag ggccgccctt 3480 aatggcatca accagaaaga catcgatgct aggaaagagg aaggcatcaa ttgctggcgc 3540 tgcggtaaaa ataaccacca cacccgaaat tgcttcgcca gaaaagacac cgaagggaag 3600 gacctccctc cggcaccagc gaagaccgcg gccggaagca aaaggcccgc gattgagcct 3660 ccggatacga aggaggaaaa ccaagaaaag cccgccaaga aagttagaat tgctggcctc 3720 gccatacacg atcctaagcc cacttacaac cgctttttcg aagagattga tacagactcc 3780 gactgctaag ggttaagggt ttcgatgaca ccacccaaag gacaatggga aacaaataac 3840 gttacccaga ccgccatctg tgccttccta caggcgcgac tcacaggctc gtcccgggta 3900 atcacgagac ccatagttga catggcccta cgcattaaag gccaggaaga aagaacagtt 3960 agggtcctcc tagattgcgg cagcgacaac ggctacctta tttctaccca actagccgca 4020 gaactacagc cacacgttgc caaacgcacc atcccgcggg ttatgtccaa ctttacaggg 4080 cacaaggagg atcccttaac acattatatc ccccgagtca ccctgcgaca ctacgaccac 4140 cggtcttgcg tggaccttga cgtgtcggac ttggacccgg attgtgacgt acttttcccc 4200 gtccaatgga taacacggca ttacccaacc ggaatcttta aacgagacca tgacgacatc 4260 cgcttcgata ccgacgaatg caggtcctgt cgcaaaccca gatttcgcgg ggtttccgac 4320 ccacgagacc cggcaaacat ctccaacacc gaccccgcca caatcgcttt ggttaaagga 4380 atgacgaaaa tagctcccac aatggacagc acagccaacg aattcgacgg tgtccccgag 4440 aagttccgcg agttcctccc tatcatgtca ccagaggcag caatgcgttt acccgaccat 4500 cagccttagg accataagat ccatctaaag gagggagcaa cccttaattg gggacccatt 4560 tacaacctta gcgagcgaga actcggcgtg ctgaaaccat ggcttacaaa aaacctaaaa 4620 gcaggccggt tgtataaggg caaaggcgcg gtcggcgcac ctttcatatt tgccgaaaaa 4680 aaggacccca aggatccttt gcgaccagta atcgactacc gagacgttaa ctccaagacc 4740 attccggaac gatgcccaat cccgttgatt acggagctgc aagaccggct ccgcaacgcc 4800 aaatggatga ccaagatcga ctttaaaact ggcttccacc tgattcgcat aaagccggga 4860 tacgaatgga aaacagcgtt ccgatgcaaa tacggcctgt tcgagttcgc agttatgccc 4920 ataggtctca ttaacgcacc agcgaccttt taaagaataa taaaccacat atttatggac 4980 cttattaacg ccggcctttt agtctacctt aacgacctcc tcatctacgc aaaaacagag 5040 gaggagcacg acgacctcgt taaggaagta ttaaaacgcc tgacccatta caacctcgct 5100 gtcgcccctt gcaaatgcgt ttggggagcc caaaaggtta agttcttggg ccacatcatt 5160 agccaggaag ggatccgaat gttacaggac aaggtcgagg ccatcctcaa ttgggaggcc 5220 ccccgacagc tacgggaatg ccaatcggtg gttggctttg ccaacttcta ccaacggttt 5280 atcgcagggt tttccaaaat tgtcaaaccc ttaacgacag ccaacgccct cttaaagaag 5340 gactgaacct ggaccacgga aatgcaagcc gccttcgaca ctctcaaaca agcatttacc 5400 tcgcccccta tcctgcggca cttcgacccc attttaccag cgattattga aactaacgca 5460 tctaacttcg ccctggctgc cgtattatct taacgagacc cgatcgacgg gaaactctac 5520 ccggtggctt tccactcccg caaaatgacc ccggctaaaa tcaactacga aatccacgac 5580 aaagagttac tcgcgattgt ggatagcttc ggccaatgga agcattacct cgagggagca 5640 cagcatcgga ttaaggtttt tacggaccac caaaacctcg cttacttcac cacggccaag 5700 gtccttaacc gaagacaggc gagatgggca caacagcttt ctagctactg gttcctaatt 5760 acgtaccgcc caggcaagca aaacgagaag gccgatgtac tctccaaatt agagcaacac 5820 aggcctgaga aaggggggag tgagaaccag ccgattacga cgattttgca ggacaaacac 5880 tttaatccca ccttataagc accttccttt ttcctctcgt cggcaagact ttgcagcatc 5940 ccaaccatgc gatggaaccc cgcatttaca aacaaaatta ggcaaatcgc ctccttcgac 6000 accaagtacc aggaacattt gaaggccccc cacaaagggg aggaagtcga caatgggctc 6060 ctgtaccacc gctaccgact atggatccca gcacagctgg aacttaagcg cgaaatcttg 6120 aagtccgagc acgacaccaa agtcgccggg cacatgggaa tggacaaaac ccttgagctg 6180 attacccgcc acttttggtg gccagggatt gagggggacg tccgaaatta cgtgagaagt 6240 tgcctggaat gccaacgcaa caaagcaccc aggcatgcac cattcgggct attccagcct 6300 atggagctgc attaccgacc atggaacacc gtggccatgg acttcattac ggacctcccg 6360 gacagccaag gctgcgactc catctgggtc atggtagatc cgtttaccaa aatgggtcac 6420 tttatccccc taaaaaaggg acacaaaaag acggacgacc taatccgcat cttcgcccgg 6480 gagtactgga gattccacgg ggtaccacta gacattattt taaatcgaga tttacgattt 6540 accgcgcatt tgtggcagga cttccttaac ctcgtaggca ttaagcttcg aataagcact 6600 gctttccatc cccagacgga tggataaacg gaaaggctta accaaagcct taaagcctat 6660 ttacggtcct ttgttaacta cgaaatgtcc aattgggtag acttactgcc tatggctgaa 6720 ttcgcttaca acaatacctc tttagcaagt acctccgtgt cccctttttt cgctaattac 6780 ggattcgagc cagctgcgca caaccccccg aggccaggac gtactaagcc ctgcccagcc 6840 agcaaacttt atgcgcactg gatgcacgga gtccataacg aggccaagca acacttcgac 6900 gccgcccggg aacgaatgaa gaaatgggca gacaagaaac gacaagccct gcctgtcttt 6960 acagagggtt aatacgttat gcttaacgcc aagaacatta aaacccgtcg ccccattaag 7020 aaattcgact ccaagctcct cggccctttc cagatcaccc aggtcatttc ccctttagct 7080 atgcgactcg cgctacccct tacatggaaa atccacccaa ccttccatgc ctcccttatc 7140 gagccttacc gcgcgagggt ttagggccag cctgacccgg accaagtcct acgggacgca 7200 ccccccttgc tggacgtatt taaggtcgag gaggttaggg acttacgcca aaaggaagga 7260 gccgtacaat acctcgttaa atggttgggc ttcgacaaga agacggagat aacgtgggag 7320 ccctggaaga actttgaggg agaagcagcc aaggaggaag tcagggactt ccaccggaaa 7380 cacccgcgga aaccccgcga cccccgcgtc tagggcccgg gtcgtgccct taggaagggg 7440 aggattg 7447 // ID Copia-2_PPM-I repbase; DNA; FNG; 4212 BP. XX AC ABWF01005750; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Postia placenta genome: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_PPM_; KW Copia-2_PPM-LTR; Copia-2_PPM-I. XX OS Postia placenta OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Polyporales; Postia. XX RN [1] RP 1-4212 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Postia placenta genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABWF01005750; Positions 46588 42377. XX CC Positions [1592-2110] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 5..2797 FT /product="Copia-2_PPM-I_1p" FT /translation="MSPTDTYEITLGSNESPVSRLPKLAEDGSNWVLFKAQ FT FKATVSSKGLLRFLEGRNKIPIELTAPGVDSDADENYENALDIWTGKHEAI FT HALLFQTIPETMKLRILSLPKASDAWNLVCKQYENQGEFVQMDILTRMHAL FT TTENGSDPRPTLNELQRLCTEYAAAGGSLDDAGYKAIILKCLPLSYRGVVR FT TILSSAHLIALATSVPTPGTSPSPPGTTSATPCSGLSPTALVEQIYAIARD FT EFALAGPVLANDSALVADSSNKCLNCGRAGHTKPNCWSKGGGKEGQGPRSK FT ARREKAATDAHESAKAAIAELDNDGDVFAFKAIDLPITDIHLRSATGNVPR FT LWDSGATRHLHPIRTDFANFRSIHPKPIRTADGRVFYTTGEGDVLLPTEYK FT GSKMKVKLQDVLYTLSIPHSLISVSRATKAGFSVRFEPDGCHLIAPNRRTL FT SLIPAHQGLYSTSTDSSAQEGALSAEVMSVSLHELHCRMGHAYSPALLKMV FT RAGVVKGIRLSDESSVFCNSCMRAKHAHEPFPKQRSSPRATKYGERIHTDV FT WGKAPVATLGGKEYYVLFLDDHSDEAIVHFLRRKSDTFGRYKAYRAWAKTQ FT RSAEHAKELQCDRGGEYLSEEYKAYLEGEGTIRRLTVHDSPQQNGKAERLN FT RTLTEHARAMLIDASLPKFLWGEAVSHAVWLRNRTTTSNTPSATPHELATG FT NKPNLAGLPRFGATVWVRIDPATKLDVKSKRGRWVGFDLQSKGHRVYWPDK FT RTVGMERDVHFEPDVSVAAVDVPLEGEQLLHDQHPPAEPVSQHAPGTDVVP FT VPESAIDGDANHDCDPPPHILPEPRAQRARKPSQWVKDLQSGTGTTGGRGA FT QKVPSSILEHGSLAADESSTYALAAMPGDEPSHHEALSGPEREAWLDSMHE FT ELARIESMGTYELVPPPPPGTNIVGST" FT CDS 2964..4199 FT /product="Copia-2_PPM-I_2p" FT /translation="MTGRFIKSTSKMLTSMETSTRLSTCVNPPGFKVPGRD FT DWVWKLLKALYGLKQAGLMWYEKVCELFAELGLTRSKHDYGVFFLFRIGDV FT IIIVIHVDDCTLVTSSKGLMIKLKSDLGSQYEIVDLGEARWLLGFEIQCDR FT HTRTLTLSQAGYIATLLERFRMTDAYALSVPLDPHVNLFDVELTAKDRSEM FT ATRPYARLVGSLMYTAIGTRPDVAFTVSLLARFMSDPVPVHWDAAKRVLRY FT LKGTRDLRLTFSGSDEGLIGFTDADWCSLPHRHSISGYVFTFSGGAVSWRS FT RKQPIILLSSTEAEYVAASEAGRELLWLRYLIGELTHPLQKATPLRCDNQS FT AIAIVDSGLLHARTKHIDIRFRFIQYVQDSGAASITYIPTGDMIANILTKA FT LPRSKIGKLVALLGLRLA" XX SQ Sequence 4212 BP; 958 A; 1268 C; 1105 G; 881 T; 0 other; ggttatgagc cctaccgaca cttacgagat cacgctcggc tccaatgagt cgcccgtcag 60 ccgcctcccc aagctcgctg aggatggatc gaactgggtc ttgttcaagg cccagttcaa 120 ggcgacggtg tcgtccaagg gcctattgcg gttcctcgaa ggtcgcaaca aaatacctat 180 cgaactgaca gctccaggcg tcgattcaga tgctgacgaa aattacgaga atgctctcga 240 catctggaca gggaagcatg aagccattca cgcactcctt ttccagacca ttccggagac 300 aatgaagctt cgcatccttt ccctgccgaa ggcctcggat gcctggaacc ttgtctgcaa 360 acaatacgaa aatcaagggg agtttgttca gatggacatt ctcacccgta tgcacgccct 420 caccaccgag aatgggagcg accctcgacc gacgctcaac gagctacagc ggctatgcac 480 agagtacgcc gccgccggag gctccctcga tgacgcaggc tacaaggcga tcatcctcaa 540 atgcctgccg ttgagctaca gaggtgtggt acgcaccatc ttgtcttctg cacacctcat 600 agctttggca acatcggtgc ccactccggg tacatcacct tcaccgcctg gtaccacctc 660 cgcgacacca tgctctggcc tatctccaac tgcactggtt gaacaaatct acgcaatagc 720 gcgagatgaa ttcgccttgg ccggccctgt tctcgccaac gactctgctc ttgtagcaga 780 cagctccaat aagtgcttga attgcggacg cgctggccac accaagccaa attgctggag 840 taaaggtgga gggaaggagg gccagggacc taggagcaaa gctcgccgtg agaaggccgc 900 aacggatgcg catgagagcg cgaaagccgc catcgctgag ctggacaatg acggtgatgt 960 cttcgcattc aaagcaatag acttgcccat cactgacatc cacttgcgct ctgcaaccgg 1020 caacgtacct cgcctctggg actcgggcgc aacacgacac ttgcacccca tacgcacaga 1080 ctttgcgaac ttccgcagca tccatcctaa gcctatccgg acggctgatg gtcgggtgtt 1140 ttacactacc ggagagggag atgtcctact gccgactgaa tacaagggca gtaagatgaa 1200 ggtgaagctc caggatgttt tgtacacact gagtattccc cactccctta tctccgtcag 1260 ccgggccacc aaagctggct tctccgtccg ttttgaaccc gacggctgtc atctcatagc 1320 ccccaacaga cgcaccttgt cgctcatccc cgctcatcag gggctgtact ctacctcgac 1380 agatagcagt gcacaggagg gagcgctcag tgcggaggtc atgtccgtgt cactgcatga 1440 gctgcattgt cgcatgggcc atgcatacag ccccgctttg ctcaaaatgg tccgtgcggg 1500 tgtggtcaaa ggtatcaggc tgtcggacga gtccagcgtc ttctgcaact catgcatgag 1560 agccaaacac gcccatgagc ctttcccaaa gcagcgctcg agcccacgtg ctaccaagta 1620 cggtgagcgc attcacacgg atgtctgggg aaaggcgcct gtggctactc tcggcggcaa 1680 ggaatactat gtcttgttcc tcgatgatca ctccgacgag gccatcgttc actttttgcg 1740 acggaaaagt gacacgtttg gccgttacaa agcatatcgg gcttgggcta aaacacaacg 1800 tagcgccgaa catgcaaagg agctacagtg cgaccgcggg ggcgagtatc taagtgaaga 1860 atacaaagct tacctcgagg gggagggaac catccgccgg ctcaccgtac acgactcgcc 1920 ccaacagaac ggcaaagccg agcgcctgaa ccgcaccttg actgagcatg ctcgcgctat 1980 gctcattgat gccagtctcc cgaagtttct ttggggtgag gctgtctctc acgctgtgtg 2040 gcttcgcaac cgcacgacca caagcaacac tcccagtgca accccgcatg agctcgccac 2100 gggcaacaaa cccaatctcg ccgggctacc tcgctttggc gccacagtgt gggtacgcat 2160 cgaccctgca accaaactcg atgtcaagtc gaagcgcggc cgttgggttg gttttgacct 2220 gcaaagcaaa ggtcaccggg tctactggcc ggacaagcgg actgttggca tggaacgtga 2280 tgtccatttt gagccggacg tttctgtagc tgcagtcgat gtgccgctcg agggggagca 2340 attgttgcac gaccagcacc ctcctgccga gccagtcagc caacatgcac ccggaacaga 2400 tgttgtacct gttccggaat cagctattga tggcgatgca aaccacgact gcgatccacc 2460 tccacacatt ctgcccgagc caagggcaca acgtgcgcgc aagccgtcgc aatgggtcaa 2520 agatctacaa tcaggcacag gtaccacagg cgggcgggga gcgcaaaagg ttccctcatc 2580 aatactggag catggaagcc ttgcagctga cgaaagcagc acctatgccc tcgctgctat 2640 gccgggagat gaaccgtctc atcatgaagc actgagtggc cccgagcgag aggcctggct 2700 ggactcaatg cacgaggaac tcgcacgcat tgaatcgatg ggaacttacg agcttgttcc 2760 gccgccgccg ccaggcacca acattgtcgg gtccacatag gtgctgcgaa agaagcgaga 2820 tgagaacaat aacattgcta agttcaaatc tcgcctctgt gcgcagggct tttcgcaggt 2880 gcatggtgtc gactacatgg acaccgccgc ccccactgct cgccttgagt ccctccgcct 2940 cgtactagct ctcgcagccc agaatgactg ggagattcat caaatcgact tcaaaaatgc 3000 ttacctcaat ggagacctcg acgagactat ctacatgcgt caaccccccc ggattcaaag 3060 tgcccggacg agacgactgg gtctggaagc ttctcaaggc attatacggc ctcaaacagg 3120 caggtctcat gtggtacgag aaggtctgcg agctgtttgc ggagctgggt ttgacccgaa 3180 gcaagcatga ttatggggta ttcttcctct ttcgcatagg agatgtcatc attatcgtca 3240 tccatgttga tgactgcacc ctcgtgacga gctctaaggg gctcatgatc aagttgaaga 3300 gcgacctcgg ctcccagtat gagatagtcg acctcggaga agctcgttgg ctcctaggct 3360 tcgaaatcca atgtgaccga catacccgta ccctcaccct ctcgcaagcg ggatacattg 3420 caaccctcct cgagcggttc cgaatgactg acgcatatgc cctatcggtg ccgttggatc 3480 ctcatgtcaa cctttttgac gtcgaactga ctgcgaagga ccgtagtgaa atggcgacgc 3540 gcccgtacgc ccgcctcgtc ggcagcctca tgtacacggc aattggtaca cgccccgacg 3600 tcgcattcac ggtttcgctg ctcgcgcgat tcatgagcga ccctgtgccg gtgcactggg 3660 atgctgcaaa acgcgtcctt cgttacctta agggcacgcg agaccttcgc cttactttca 3720 gtgggtccga tgagggatta atcgggttca cagatgccga ctggtgttcc ttacctcacc 3780 gccactccat ctcggggtat gttttcacct tctcaggcgg agccgtcagt tggcgctctc 3840 gtaaacaacc aatcatattg ctgtcttcca cggaggcaga atacgtcgca gcttccgaag 3900 ctggtcgcga gctcctctgg ctccgatacc tcatcggcga acttactcac ccattacaaa 3960 aggccacgcc gttgcgttgt gacaaccaga gcgcgatcgc aatcgtcgac tccggtctgc 4020 ttcacgctcg gactaagcac atcgatattc ggttccggtt cattcagtat gtgcaagact 4080 cgggtgctgc ttccatcaca tacatcccta cgggcgatat gatcgccaac atccttacaa 4140 aggccctccc acgctcgaag attggcaagc tcgtcgctct tcttggcctg cgcttggctt 4200 gagggggagt at 4212 // ID Copia-64_MLP-I repbase; DNA; FNG; 5516 BP. XX AC AECX01000588; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-64_MLP_; KW Copia-64_MLP-LTR; Copia-64_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5516 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000588; Positions 45016 50531. XX CC Positions [2916-3416] - Integrase core CC 'CTCA' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1227..5486 FT /product="Copia-64_MLP-I_1p" FT /translation="MLSNANSSVSHSSFLIAQIPKLNDTNRVDWVLGLKTY FT LKGRKLWKHVERDTSLPEADLKDLDKVEQLECERASLLKVICATVSPTRLP FT TIRGIENPKVAYERLIEQATQDDGLEVASLIAKVATTRYTRSETVTTFLDG FT ISDLHTKLAEATASDKDLSISDKLLAVFLLLSFPEEQFGTIRDQLFGDLKN FT LTTAKVFSRIRTKSALSSVDETPIALAVNARPPSRPPQRTTIPRTDKSPNA FT PCVLCEHWNFSHTNVVCSRQNCRQPSRPATNTHQSSANSLSDNEKIRRFNQ FT LAAAGVIGFNAEAQGAHNPPPPVESNIQTPASSEIPTADCQFATSFNVVAE FT DASMEVSLTISNNYPPPTETHKATLADTACNRHMFGDVMLLDDLRDVNPVW FT IKVANDDKSSRIMANKMGTAKLHAFKPDGSSTLVQIPNVLYSPSLPANLIS FT ITELYETGFKVVDPHYGTNTDDKNMYFSNSKHIIPAYKDSGPGGFWKFYHY FT SEPSACAAISIKPNLSDLWHLHFGHLNNRSTSSVMEDVLRLNPSPPTTCEA FT CTLGKQSRSSHSGSLPRSQVPAYRIHCNLAGPFPSASIGGFLYSMALIDDA FT TRRNWVMLLKSKSQAFDAFKRFHLMLSNQTSYKIAVFKTDRGGEFTSAAFT FT NYLNEHGIIHEMGPPESPEQNSVVERFNRTISSRLRSQLIHGNLPVRLWGE FT VMMATSFVLNLCPSKSICFSCPEYAWQTNALKIKSPNLPYNRLRVIGCLAY FT TIPPGHHHKLQSRSIRTIMVGYEKYSNAYRLWDPKSNRIMVSNDVVFNEHE FT FPLRRLDVEFTKELTVLNEDNWDEMWETACVNNNKSDQVNKFLGIPNQQNE FT SPPPPRRSQRQPQPVERLGDLVGYHTIAETYQCYHSQVDDGPENDEPSYSK FT AMNGPNREDWLKAMAEEFTSLQQHSVGHLVEPPPDANILPGMWRLKRKRDE FT FSRITKYKARWVAGGNHQIKGVDFDLTYASVGMTDTLRTLYSLSAKDDLEM FT EQFDIKTAFLNGSIKHKVYVCQVTGFRDQDKPKHVMMLDQSLYGTCQAHRE FT FNDDLDIKMKRIGFVVCPVDNSLYTLRKQLSFVHVHMHVDDGMTFSNDRNL FT LNHFKEQLSTFYKFCWNTNPTLHLGIHITRDRKQHTITLDQSHYCESMLDR FT FGMSNCNGVKTPLPQNLKLSTPTIEDSTEIENYRAAVGMLNFLSVQTRPDI FT AFAVGYLARFNSRHNDAHWAAVKHLLRYVKWTIHFNLTFGEHKAKDKVVEG FT YADADYAGDIDTRRSTTGFVFLVHGSLVSWKSRRQQSVTVSTTEAEYLAIG FT DCAKHGHWLCRLLEYLLQQPPIYVSISLPISNDNQGAVFLCNEASVNNKSK FT HIDIRHHFIRELTREGRITVSHVSTKAMPADSLTKSVGPNILLESYNQLGI FT YEIKRC" XX SQ Sequence 5516 BP; 1576 A; 1341 C; 1138 G; 1461 T; 0 other; ctcactccgg tcttccagat ctcctttcct gtttactgat aagtttgttg ttggatctga 60 tcaggtatga tacaattgtg ctagtcatgt ctttcttatt atgttttgtt tatgatataa 120 agaaataaaa acctcatcag tttagaaagt tttcttctca caacttctta tcataaaaca 180 ctcactccgg tcttccagat ctcctttcct gtttactgat aagtttgttg ttggatctga 240 tcaggaatca aagattcctt taggttatga gcccaggctt ccagccatat tcgatcaaca 300 ataaaagtag actagagttt taaatacgct ctcctggcac gagaccacgt caatttaaac 360 tcgaaatact atttcaaaag gacactagtt ttctagagtc ttctcggtat atgaattgac 420 aaattcgaac tatctaattt tgaataccgg agaaccaaat ctcagcccga gccccgatcg 480 ctacccgtcg agtagtggct tgcccggccc ccgaaccgac gggtagagag acctcccaag 540 cccgttcggg ggtccccgaa cgagtttttt tgaaaaattg tctgcccgaa ccccgaccgg 600 gtccccgccg gggactcgct ggctggacca aaaagggtcc ctcacccagg tttgaacacc 660 ggaccgggtc atttatcgat attgatgcca aatacatcga aaaaaagtgt ccacttgatg 720 tttaaattgc actcagggcc tcacatcaca gccatggcag tgctggcaag gttcccatat 780 gcttggtagg gcctcccctg acaatggact aggtagtttt tacctgtgat ggagtgtttt 840 aaatacattt tttactcgtc gggtacccgg cggatagtat ttttttcgat ttctacgccc 900 gacacccgac tcgcggggat ccccgatcat cgggggttgt agagcctccg cccgaatccc 960 ggacctatcg gggaccccgt cgggtccccg atcggggggc tggtccgggg ctcaggctga 1020 accaaatcta ttcaattcaa tactgaaacc ttaacgatta ccccacattc aaactggatc 1080 caattactga ttcagaacct aatcaatatc aaaatcaacc tcccaccctc ccgtcaaact 1140 caactgtcga tcctcataca accgatcgta cacgagacaa ctcaactgcc gatactgtcc 1200 aatctaacca aacagtcatc cgcacaatgt tatctaatgc caattcttcc gtttctcatt 1260 cttcgttcct cattgctcaa ataccgaaac tcaacgacac caaccgagtc gattgggtgc 1320 tgggacttaa gacctatctc aagggtcgca agctatggaa gcacgtcgaa cgcgacactt 1380 ccctacctga agctgatctc aaggatctgg ataaggtaga gcagcttgag tgtgagcgtg 1440 cgtcactgct caaggtaatc tgtgcaactg tctcacccac tcgtctccca accattcgag 1500 gaatcgaaaa tcccaaggta gcgtatgaac gtctcatcga acaagctacc caggacgatg 1560 gactcgaggt ggcatctcta attgcaaaag ttgccactac tcgatacacc agaagcgaaa 1620 ccgtcacgac ttttctcgac ggcatcagtg atcttcacac aaagttagct gaggcaactg 1680 caagtgataa ggacctcagc ataagtgata agttgttagc tgtattcctt ctacttagct 1740 ttcctgaaga acagtttggg actattcgtg atcaactatt tggtgatctc aaaaacctaa 1800 cgactgcaaa agtcttctca cgaattcgaa ccaagtcagc actgagctcg gtcgacgaaa 1860 ctcccatagc gttagctgtc aacgctcgac cacccagtcg ccctcctcag cgcacgacta 1920 ttccaaggac tgacaagtct ccgaacgctc cgtgtgttct ttgtgaacat tggaacttct 1980 ctcacactaa cgtggtttgt tcaagacaaa attgtcgtca gccatcaaga cctgccacca 2040 acactcatca gtcatcagcc aactcactat cagacaatga aaagataagg cgattcaatc 2100 aactagcagc cgcaggtgtg ataggattca atgctgaagc gcaaggggca cacaatcctc 2160 cacctccagt cgaaagcaac atccagactc cggcttcgag tgaaataccc actgccgatt 2220 gccaatttgc aacttctttc aacgtagtcg cggaggatgc atcaatggaa gtttctctca 2280 ccatttcgaa caactaccct cctcccactg agactcacaa agcgactctg gctgacacag 2340 cgtgcaaccg acacatgttt ggtgatgtga tgttgctaga tgacttgagg gatgtcaatc 2400 cagtatggat caaggtggct aatgatgata agtcttcaag gattatggcg aacaagatgg 2460 gtactgcgaa gctacacgca ttcaagccgg atggatcgtc cactctagtt caaattccca 2520 atgttctgta ttctccctct ctacctgcaa atctaatatc aatcactgaa ctctatgaga 2580 caggtttcaa ggtagtggat ccgcactatg gaacaaacac ggatgacaag aatatgtact 2640 tctccaactc caaacacatc attccggcat acaaggactc tggacctggc ggattttgga 2700 aattctacca ttactcggaa ccaagtgcgt gcgctgcaat ctctatcaaa ccgaacttgt 2760 ctgacttatg gcatctccac tttggccatc tgaacaacag aagtacctca tctgtcatgg 2820 aagatgtcct acgacttaat ccttctccac cgacaacttg tgaggcctgc accttgggca 2880 aacagtcaag gagcagccat tctggcagcc ttccaaggtc tcaggtccct gcttatcgta 2940 ttcattgcaa tttggccggt ccatttcctt ctgcgagtat tgggggtttt ctgtattcaa 3000 tggctcttat agatgatgca actagacgga attgggttat gttacttaaa tcaaaatctc 3060 aggcttttga tgcctttaaa cgattccatc tcatgttatc taatcagacc tcatataaaa 3120 tcgctgtatt taagactgat aggggagggg aattcaccag tgctgcgttt acaaactatt 3180 tgaatgaaca tggaataatc catgagatgg gcccacctga gagtcctgaa caaaattcag 3240 tggtagaacg attcaaccgc actatctcga gccgtctccg gtcgcagcta atccatggca 3300 acttgcctgt tcgtctctgg ggtgaagtaa tgatggcgac gtccttcgta ttaaacttgt 3360 gcccatctaa gtcaatttgc ttctcatgtc ctgagtatgc atggcaaacc aatgctctta 3420 agatcaaatc gcctaatctt ccgtacaatc gtctaagggt aattggatgc ctagcttaca 3480 ctattccacc tggccatcac cacaagcttc aatcaagatc aatcagaaca ataatggttg 3540 gttacgagaa atactctaac gcatatcgcc tttgggatcc gaagtcaaac cgaatcatgg 3600 tgtcaaatga tgtagtgttc aatgagcatg agtttccctt acgccgtctt gatgttgaat 3660 tcaccaaaga acttactgtg ttgaatgagg ataactggga tgagatgtgg gaaacagcgt 3720 gtgtcaacaa caacaaatcc gatcaagtga acaaatttct tggaatacca aatcagcaaa 3780 atgaatcacc accccctcca cggcgttccc aacgacaacc tcagcctgtt gaacgattgg 3840 gtgatctagt cggatatcat acaattgccg aaacatatca atgttatcat agccaagttg 3900 atgatggacc tgaaaatgat gagccttcgt actcaaaagc catgaatggt ccaaaccgtg 3960 aggactggct aaaggcaatg gctgaagaat tcacgtcact acagcaacat tctgtgggac 4020 acttggtaga acctcctcct gatgccaaca tcttacctgg aatgtggagg ctcaaacgaa 4080 aacgcgatga attttcccgc ataaccaagt acaaagctcg ttgggtggct ggcggcaatc 4140 atcagatcaa gggagttgat tttgatttga cctatgcatc agtggggatg accgacacgc 4200 tcagaactct atactctcta tctgcgaagg atgaccttga gatggaacaa ttcgacatca 4260 aaacagcatt tctgaacggt tcaatcaaac acaaagtgta tgtttgtcaa gtaactgggt 4320 tccgtgatca ggataagcca aaacacgtaa tgatgctgga tcaatcgtta tatggaacgt 4380 gccaagccca tcgagagttc aatgatgatc ttgacatcaa gatgaaaaga attgggtttg 4440 tggtatgtcc tgtggacaat tcattgtaca ctctcaggaa acaattgtct tttgtccatg 4500 tacatatgca tgtcgacgac ggaatgactt tctcaaatga tagaaatctt ctcaatcact 4560 tcaaagaaca actctcaaca ttttacaaat tctgttggaa caccaatcct actctacatc 4620 tcggcattca cataactcgt gacagaaaac aacatacaat cacacttgat caatcacact 4680 actgtgaatc tatgcttgac cgatttggta tgtccaattg caacggagtg aagacgcccc 4740 tccctcaaaa tctcaaactc tcaacgccga caattgaaga ttcgactgaa attgagaatt 4800 atcgggctgc tgtcggaatg ctgaatttcc tatcggtaca aacccgtcct gacatagcgt 4860 ttgctgttgg ttacctggct agattcaact cacgacacaa tgacgcacac tgggctgcag 4920 taaagcacct actccgatat gtgaagtgga ctattcattt caatttaaca tttggtgaac 4980 acaaggcaaa ggataaagtg gttgaaggat atgcggatgc ggattatgct ggtgacattg 5040 atactcgtag atctacaaca ggttttgttt ttttggttca tggttcttta gtttcctgga 5100 aaagtcgaag acaacaatcg gtgacagtat ccacgactga agctgaatac ttggctattg 5160 gagattgtgc caaacatggt cattggttat gtcgtctact cgaataccta ctccaacaac 5220 caccaatata tgtttcaatc tctctaccta tttcgaatga caatcagggt gctgttttcc 5280 tatgtaatga ggcatctgta aataacaaat cgaaacacat tgacatacgg catcacttca 5340 ttagggaatt gacaagggaa ggcagaatca cagtttctca cgtttcaacg aaagcaatgc 5400 ctgcagattc attaacaaag tcagttggac caaacatatt gttggaaagt tataatcagc 5460 ttggaatata tgagataaaa aggtgttgag caggaggggc tgttggaata aactca 5516 // ID Gypsy-39_MLP-LTR repbase; DNA; FNG; 272 BP. XX AC AECX01002289; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-39_MLP_; KW Gypsy-39_MLP-I; Gypsy-39_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-272 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002289; Positions 5923 5652. XX SQ Sequence 272 BP; 70 A; 46 C; 38 G; 118 T; 0 other; tgttatgatg tggtgtaatt gtcactgagt gtgtaatgtt tgtatgacat catagcttgt 60 atgctaatac atttctctat ctaaaatagt cttatatctt attagcatag atactttatc 120 tattgtagaa ttgttgttag attggattgg ttcttcctca cctttcctat atgttctaac 180 atctctcttc taggtaagtt attctatcat tgcaatatca acctttccta tatgttctaa 240 catctctctt ctagtgagaa tcaatcttat ca 272 // ID Gypsy-2_AN-LTR repbase; DNA; FNG; 176 BP. XX AC . XX DT 03-JUL-2007 (Rel. 12.06, Created) DT 13-JUL-2007 (Rel. 12.06, Last updated, Version 1) XX DE LTR of LTR-retrotransposon, Gypsy superfamily, Pogo clade. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Nonautonomous; KW Gypsy-2_AN-LTR; LTR-retrotransposon. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-176 RA Galagan J.E. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-176 RA Clutterbuck A.J., Kapitonov V.V. and Jurka J.; RT "Transposable Elements and Repeat-Induced Point Mutation in RT Aspergillus."; RL Chapter in "The Aspergilli: genomics, medical applications, RL biotechnology, and research methods." Edited by GH Goldman and SA RL Osmani. Publication expected 2007. XX RN [3] RP 1-176 RA Clutterbuck A.J.; RT "Gypsy-2_AN-LTR."; RL Direct Submission to Repbase Update (03-JUL-2007). XX DR [3] (Consensus) XX CC Solo LTR. 5-bp TSDs. Solo LTR of retrotransposon. 3' half 95% CC identical to Gypsy-1_AN_LTR. 12 copies in the A. nidulans genome, CC which contains only fragmentary traces of an internal portion. XX SQ Sequence 176 BP; 33 A; 57 C; 35 G; 51 T; 0 other; tgttatggga tgctcccata acacgcaccg cgtagcgtac gcggctgtgg tcacgtggct 60 ccccatccat ctccaatctc tgatctgtcc tgtcccgctt tctcttcttc agcgattccc 120 tcttgtacat acggcacgtt tagataggaa gatccgtcta tacacgtccc ttaaca 176 // ID Copia-65_MLP-LTR repbase; DNA; FNG; 269 BP. XX AC AECX01000622; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-65_MLP_; KW Copia-65_MLP-I; Copia-65_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-269 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000622; Positions 35770 35502. XX SQ Sequence 269 BP; 58 A; 65 C; 59 G; 87 T; 0 other; tgttgatgtt ctgtctattt tctacttatg tgtgtgcgaa cttgtgtgtg gctttcacta 60 ggatacacct atgtgacatg tagtagttag tgtttagctc agtgccgatg gcggagtagc 120 tagatccctc gtcactctac gacttgggat cctattagct aacgtgcctt gatccaccat 180 cggcactcaa caggtactta tgcaattgtc actctacgac ttgggatcct attagctaac 240 gtgccttgat ccaccatcgg cactcaaca 269 // ID Gypsy-13_RO-LTR repbase; DNA; FNG; 347 BP. XX AC AACW02000268; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-13_RO_; KW Gypsy-13_RO-I; Gypsy-13_RO-LTR. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-347 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000268; Positions 154268 154614. XX SQ Sequence 347 BP; 158 A; 54 C; 34 G; 101 T; 0 other; tgacaaagaa gtcaaaaaat gagaaaggtc aaaaaaatga aaaaagtcaa gaaaatgaaa 60 gaagtcaaaa agagaaaaaa aaaaaattta acaatataaa tatcccaact atttagatga 120 agaaaaagga cagtacagtt ctactcacat tttattttat atattttatt caataaacag 180 gcaaccttct cataaaaagt tcctgatccc tttatttttt ttttatcttt acctttattt 240 gatcttaatt aaaagatctg gtggtcacta cagtacctta tcttaaaaat ttcaagaaaa 300 acttctctaa aacactcaaa ctaaaccaac catactacaa aataaca 347 // ID Copia-24_MLP-I repbase; DNA; FNG; 4660 BP. XX AC AECX01002457; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-24_MLP_; KW Copia-24_MLP-LTR; Copia-24_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4660 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002457; Positions 12784 8125. XX CC Positions [1985-2509] - Integrase core CC 'GAGTT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 65..2554 FT /product="Copia-24_MLP-I_2p" FT /translation="MSSPHPVLADADDKSEDSTSDTSSSKTHKPSSQEPLP FT NLLSTSLSNFTRMTDKDELKFFGTPLQQFVQELNSILSKYTIDNKLSDGNF FT PEWSLAIKRVLKIIDYHKYLSKYDFKAPSLTDEQHLKVKLVISVWLLGLMD FT SQNKTRCQTMLKLANCNDSEDENEDSEDEDDFDYEPYLIWKFLKNHHQRIS FT EAGLQTIEDTISDMKILSSDSFKVHCDKFNNLIADFVKYKGQISSASAARK FT LIKTVKSQINENVSENIYSKVVPLTREGVVQYLIDYEARNGGFTTPAIIEA FT SQATVNHHFEAARTTYSIPQSSAAQAAGGNQRRFRPKCTEHRCISTTHTPE FT KCFAKPENRSARDLWIAQMEARRKGNHGNRFRNQSQTPSVNGIKQVSLPSA FT SPAFASSNPFASLHIFLDEDSVSVSASASETFEASHTVFDDDTPAANLVNK FT SSGHWALYDTGATHYMFKNDQLFASMSKIEDSSKRLKLAGGDVSLAVHSTS FT TVQLKSGTGKMFELNNSLYVPELAQNLLAGGAMLRKGVQVLIHPNDSNCFS FT LVFKGEALFNGVFAANNLMYVALEPVSPTFDSKASSTQAEDVTQLQHRRLG FT HLSHRYLRLMCKHKSVQGLTEDLTQLKHCDICSLSKNTKIPHSSTRPRAFR FT HLENVHVDLSGIIRVKGLKNEMYYTLFCDDYSSFRHIYFMQSKSKEEVLNV FT FKTYLAMAERQTCVKLKQFTLDRGGEFLNDLLGTELCERGIVLHLTAAHSP FT EENGVSERGNRTISTKARSMMLESKMPLQFWVQACSTAVFLTNRTVTAALS FT NNRTPFEVWHFRKPLVQHLKVFVAWPML" FT CDS 3043..3951 FT /product="Copia-24_MLP-I_1p" FT /translation="MVTTTHDVPRSYSKAVSGPEGDEWMNACKKEIQAMRD FT KKVWVLVDRPENCNVIRGLWLFRKKPTADPNKKFKFKSRYVAMGNTQIHGI FT DYFDTFAPTGKPTSFRLFIALAAINGWEVHLMDAITAFLNSDLDKEIYVEQ FT PEGFVVPGEEHLVCRLLKSLYGLKQAPKCWQDDVEEFLISINFIQCEVDHC FT VYIRCVDNLFTAVYVHVDDLTITGNDILSFKAEISKRWEMEDLGLATTVVG FT IEVKRESESVYSICQEVYATKILQRFDSLDLKPASTPLPPGLKLFKPSLDE FT IEDFACKKLPY" XX SQ Sequence 4660 BP; 1365 A; 1091 C; 957 G; 1247 T; 0 other; tggtagcggg agtccctacg actcaatcac tcctttactc aaagacctca tctaaagact 60 gtttatgtca tcgccacatc ctgtgctagc cgacgccgac gacaaatccg aagattccac 120 atccgatact tcttcatcaa agactcacaa accatcttct caagaacctc ttcccaatct 180 cttatctaca tcattgtcaa acttcacaag aatgactgat aaggatgaac tgaagttttt 240 tggcactccg cttcaacaat tcgttcagga gctcaactcg attctgtcga aatacacaat 300 tgacaacaaa ctttctgacg ggaattttcc tgaatggtca ctcgcaatta aaagagtgtt 360 gaaaatcatc gattatcaca aatatctctc caaatacgat ttcaaagcgc cttctttgac 420 tgatgaacaa catctgaaag tcaaacttgt gatctctgtg tggcttcttg gactgatgga 480 tagccagaac aaaactaggt gtcaaaccat gttgaagttg gcgaactgta acgactcgga 540 agatgaaaat gaagatagcg aagacgagga tgactttgat tatgaaccgt atctgatctg 600 gaagtttctc aagaatcacc accagagaat atcggaagct ggtctacaaa ccattgaaga 660 cacaatcagc gatatgaaga ttctgagctc tgattcattc aaagtgcact gtgacaagtt 720 caacaacttg atcgctgact ttgttaagta caaaggacaa atttcatctg cctctgccgc 780 tcgcaaactc atcaaaacgg tcaaatctca aatcaatgaa aacgtgtctg agaacatata 840 ctcaaaagtc gttcctctta ccagagaagg agtagttcag tatcttattg attatgaagc 900 tagaaatgga ggtttcacta ctccggctat cattgaagca agtcaagcga ctgtaaacca 960 tcactttgaa gctgctcgta ccacttactc aatccctcaa agcagtgctg ctcaagccgc 1020 tggaggaaat caacgtcgtt tccgaccaaa atgtactgaa cacagatgta tctccaccac 1080 ccatactcct gaaaaatgtt tcgccaaacc tgaaaatcgc agtgctcgtg atctttggat 1140 tgctcaaatg gaggcaagac gaaagggaaa tcatggaaat cgttttcgaa atcaatccca 1200 gactccaagt gtcaacggca ttaaacaagt ctcattacct tcagctagtc ctgcttttgc 1260 ctcctcgaac ccgttcgctt cccttcatat ctttcttgac gaagatagcg tttctgtgtc 1320 agcttccgcc tcagaaacat tcgaagcatc tcataccgta ttcgatgacg acactcctgc 1380 tgcgaacctg gtcaacaaga gcagtggtca ctgggctctg tatgacaccg gggcgactca 1440 ttatatgttc aaaaacgatc aactgtttgc ctctatgtca aaaattgaag attcgtccaa 1500 acgactgaaa ctcgcgggag gagatgtgtc tcttgctgtt cactctacca gtacggtgca 1560 actaaaatca ggtactggca aaatgtttga gttgaacaac agtctttatg tgcctgaact 1620 ggctcaaaac ttattggctg gaggtgctat gcttcgtaaa ggagttcagg tcctgattca 1680 tcccaacgac tcaaattgtt tctcgcttgt attcaaagga gaggccttgt tcaacggagt 1740 gtttgctgca aacaatctca tgtatgtagc tctcgaaccg gtgagtccaa cttttgactc 1800 aaaggcatca tcaactcaag ctgaagatgt gactcaactt caacaccgtc gtttagggca 1860 tctgagccac cgttatctca ggttaatgtg caaacacaaa agtgtgcagg gtctgacaga 1920 agacctaact caattaaaac actgtgatat ctgttctcta tccaagaaca ctaaaatccc 1980 tcactcttca actaggccca gggcttttcg acatctagaa aatgtacatg ttgatctcag 2040 tggcataatc agggtcaaag gattgaaaaa tgaaatgtac tacaccttat tttgtgatga 2100 ctattcatct tttcgtcaca tctactttat gcaatctaaa tcaaaagaag aagtgttgaa 2160 cgtttttaaa acctatcttg ccatggctga acgtcaaact tgtgtgaagt tgaagcaatt 2220 tacccttgac agaggaggtg aatttctcaa tgacctcctg ggtaccgaac tttgtgagcg 2280 cgggattgtt cttcacctca cagccgcaca ttctcctgaa gaaaatggcg tgtcagaacg 2340 tggtaatcgc accataagca caaaagctag atccatgatg ttggaatcga agatgccttt 2400 gcaattctgg gtacaagcat gtagcactgc agtttttctc acaaatcgta ctgtcacagc 2460 tgctcttagc aacaatcgca caccgtttga agtttggcat ttcagaaaac cattggtaca 2520 gcatttgaaa gtctttgttg cttggcctat gctttaatca gaaaagaaat cagaggttcg 2580 aaattcaacc cagtcagcag tcatggtgtt ctagtcggtt atgatgagga caattttaac 2640 tatcagatct atgatctgac ctctcataaa atcatcatca cgcatcacgc tacattcaat 2700 gaagatctct ttccttttgg aaatgatccg gtcactaggc tatctcctct tcctccaaca 2760 agcaagggag tcaacattcg attctttgac aacgactctg atgatgagca ttttgaagag 2820 gtcctggagc cgactgactc tgataatcgt cctgaggttc ctacagttct tgatcctaac 2880 ccgccagcca ttaaagtatc cccagatgtt cgtcgctcag gcagagtgaa gcaagtagtt 2940 aagtacacag ccactgccgt gtgtgatgat atcaaagtct cttccttgat tgctaagtgg 3000 gagtctgctc aacatttaaa ctgtggagtt cccgaatgcg atatggtaac tacgactcat 3060 gatgtgccta gatcatactc aaaggccgtg tctggacctg aaggagacga atggatgaat 3120 gcctgcaaga aagagatcca agcaatgcgt gacaagaaag tgtgggttct ggtagacaga 3180 cccgaaaatt gtaacgtcat cagagggctc tggttgttca gaaagaaacc tacggcagat 3240 cccaacaaaa agttcaagtt caagtctcgt tacgtggcca tgggtaacac tcagatacac 3300 ggaatcgact actttgacac cttcgctccg actggaaagc ccacctcttt ccgactcttc 3360 attgcacttg ctgccatcaa tggctgggaa gtacacctca tggacgccat tacagctttt 3420 ctcaacagtg accttgacaa agagatatac gtagagcaac ctgaaggctt tgtagtaccc 3480 ggtgaagaac atctagtgtg ccggctgctt aaatctctat atggtcttaa acaggctcct 3540 aagtgctggc aggatgacgt cgaagaattc ttgattagca ttaacttcat tcaatgcgaa 3600 gtagaccatt gtgtatacat cagatgtgta gataacctgt tcactgctgt ttatgtacat 3660 gttgacgatc tcaccattac aggcaatgat attctttcat tcaaagctga aatctcaaaa 3720 cggtgggaaa tggaggacct cggattggct acgacagttg tgggcattga agtcaaacgt 3780 gaaagtgaaa gtgtttactc gatatgtcag gaagtgtacg ctacaaaaat ccttcaacgt 3840 tttgattcct tggatctgaa acccgctagt acacctctcc cacctggact caaactattc 3900 aagccaagtt tagacgaaat tgaggatttt gcctgcaaga aactaccata ctgaagcatc 3960 gttggttctc tcatgtatct cgcacaatgc accagacccg acctggccta cgcagtaggt 4020 gtgctttctc agcatcttga ctggccggga tttcagcact ggaatgctgc aaaccatgtt 4080 ctttgttacc ttgtgggaac cgtcaactac ggtatttgct attctggaga atcaccaaac 4140 aaaccggtca aaggtctcaa gagtcagcac tgtcctcagg ccttgtgtga tgcagactgg 4200 gcaggggaca aagatactcg caggtcaacc acaggctatt tattcatctt ggcgggtggt 4260 gcagtatcct ggcgtagcaa acttcaaccc actgtggcat tatcatcaac tgaggcagag 4320 tatcgtgcca ttactgaagc cggtcaagag ctcctgtggc taagaactat gctgacaaaa 4380 ctcggttatg aagactcaga tccaacaatc ctcaagagtg acaacatggg tgcgattcac 4440 ttaaccaaca aatcaatctt tcatgcaaga acaaagcatg ttgaaatcca ctaccactgg 4500 atcagagaag tggtaaaagc tggaaagcta actgttcaac actgccctac tcatctcatg 4560 acagctgatc ttctaacaaa gtccctaggt tctgctcaat ttactcacct tcgcaaaatc 4620 ttaggactca aacctatccc gtgatgaact tgagggggca 4660 // ID Gypsy-4_TMe-LTR repbase; DNA; FNG; 389 BP. XX AC CABJ01003198; XX DT 13-FEB-2011 (Rel. 16.02, Created) DT 13-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Perigord black truffle genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_TMe_; KW Gypsy-4_TMe-I; Gypsy-4_TMe-LTR. XX OS Tuber melanosporum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Pezizomycetes; Pezizales; Tuberaceae; Tuber. XX RN [1] RP 1-389 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Perigord black truffle genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; CABJ01003198; Positions 29547 29935. XX SQ Sequence 389 BP; 91 A; 69 C; 94 G; 135 T; 0 other; tgtggaggga aaaaattgcg cgatgatttt tattcttagt atactattgc ttttagttag 60 agtttctgaa catagtctgg gttggcaaga gtgtgactgt tgtgtttagt atcgtggagg 120 aaaagttttc acaggactgc gtttttccac acaccttgaa ggtatataag aggctgtggg 180 aaacaccttg ggaacgatat tggcgattgg ttgaacgagc aacatccccg catcactatg 240 acttggattt ttgattgtgt ttgttgtttt atttcttgtg cgctttagtt acttcaaact 300 ctctactcta cattccgctt acaggctata cttgatccgc ctgagtgcgt tgcgtgaaat 360 agagccggga cccacgatat cgtttttca 389 // ID TY3-1p_I repbase; DNA; FNG; 4980 BP. XX AC AY198186; XX DT 15-APR-2008 (Rel. 13.04, Created) DT 15-APR-2008 (Rel. 13.04, Last updated, Version 1) XX DE Saccharomyces paradoxus Ty3-like retrotransposon, Internal DE region. XX KW Gypsy; LTR Retrotransposon; Transposable Element; TY3-1p; KW internal portion; TY3-1p_I. XX OS Saccharomyces paradoxus OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Saccharomyces. XX RN [1] RP 1-4980 RA Fingerman E.G., Dombrowski P.G., Francis C.A. RA and Sniegowski P.D.; RT "Distribution and sequence analysis of a novel Ty3-like element RT in natural Saccharomyces paradoxus isolates."; RL Yeast 20(9), 761-770 (2003). XX DR EMBL/GenBank/DDBJ; AY198186; Positions 1 4980. XX FH Key Location/Qualifiers FT CDS 333..1256 FT /product="TY3-1p_1p" FT /note="TyA3p, similar to the TyA3 ORF of the FT Saccharomycescerevisiae retrotransposon Ty3." FT /translation="FKQTNESPNLATPTSVMSFMDQISGGGSYPKLPVECL FT PNFPIQPALTFRGRNDSHKLKNFISEIMLNMSMIPWPNEASRIVYCRKHLL FT NPAAQWANDFVQEQGILEITFDTFIQGLYQHFYKPPDINKIFNTINQLSEA FT KLGIERLNKQFKMIWDRMPPDFMTEKAAIMTYTRLLTKETYNIVRLHKPKT FT LRGAMEEAYQTTALTQRFFPEFELDADGDTIIAAATRLHEEYDYDSDPQEN FT LLVQKRHVHAVRTRRSYHKPTATYHNRRSNNPSREECVKDRLCFYCKKEGH FT RLNECRARNASSSRY*" FT CDS 1216..4980 FT /product="TY3-1p_2p" FT /note="TyB3p, similar to the TyB3 ORF of the FT Saccharomyces cerevisiae retrotransposon Ty3; FT frameshift generates fusion with TyA3p coding FT region." FT /translation="TSVELVMRVLADTELESKDHKKLSITSRPIVHYIAIP FT EMDKTAEKHIKIKNTKIKTLFDSGSPTSFIRRDTVNLLNLPTHDTPPLRFR FT GFISTESTTTSEAVTLDLTVDNLQINVAAYVLDKMDYQLLIGNPILRRYPK FT LLYTILNTKQCTSAQKPKAYHSENVNYVKAKSAGNRGNSRNKTPSFAPTIP FT EATDQKSAGNRSNSRTNTMSFATTTPEATDPLTTLDNPGSTQSTFAQFLIP FT EEAIILEEDGKYSNVVSTIQNVEPKATDHSNKDTFSTLPVWSQQKYIEIIR FT NDLPPRPANIHSMPVKHDIEIKPDTRLPRLQPYHVTERNEQEINKIVQELL FT DNKFIVPSKSPCSSPVVLVPKKDGTFRLCVDYRALNKVTISDPFPLPRIDN FT LLSRIGNAQIFTTLDLHSGYHQIPMDPKDRYKTAFVTPSGKYEYTVMPFGL FT VNAPSTFARYMADIFRDLRFVNVYLDDILIFSESQEEHWKHLDTVLGRLKK FT ENLIVKKKKCKFASEQIEFLGYNIGIQKITPLQHKCAAIRDFPKPRTVKQA FT QRFLGMINYYRRFIPNCSKIAQPIQLFICDKSQWTEEQDKAIEKLKFALCN FT SPVLVPFNNKAIYRLTTDASKDGIGAVLEEVNAKNALVGVVGYFSKSLESA FT QKNYPAGELELLGIIKALHHFRYMLHGKHFTLRTDHISLLSLQNKNEPARR FT VQRWLDDLATYNFTLEYLAGPKNVVADAISRAVYTITPEIPQPIDPENWKT FT HYKSDPLCSATLIYMKELTQHNVIPEGMSAFRSYHKKFQLSETFRKNYSLE FT NGIIYYRDRLVVPVKQQNEVIKLYHDHTLFGGHFGVTVTFGKIAPIYYWPK FT LQHSITQYIRTCVQCQLTKSHRPRSQGLLQPLPVAEGRWLNISMDFVTGLP FT LTTNDLNMILVVVDRFSKRAHFIATRKTADASQLINTLFRYIFSYHGFPKT FT ITSDRDIRITAEKYQELTKRLGIKSTMSSANHPQTDGQSERTIQTLNRLLR FT AYASTNNQNWHTYLPQIEFVYNSTPTRTLGKSPFEIDLGYTPNAPTIKTEC FT EINARSFTAVELARHLKAITIQTKERLESAQIEMETNNNQRRKTLLLNIGD FT HVLVHRDAYFKKGTYMKVQPIYVGPFRVVKKINDNAYELDLDSHKRKHRVI FT NVQYLKEFVYRPDAYPKNKPISSVERINRANEVIAVIGIDTTHKTYLCRMQ FT DVDPTISVEYSEAEFYQIPEEIRKSILANFRQLYETQDNSEREED" XX SQ Sequence 4980 BP; 1750 A; 1195 C; 862 G; 1173 T; 0 other; gaattcacta gtgattcgaa acaatagctc caaaacggac aatattgagt atactaggca 60 gcctacttgc ctaagacgaa ccaaaccaac caaacgtata aatacctgaa caattagttt 120 agatccgaga ttctgcgctt ccacccttta gtgaaatcca gatcttatat agattatata 180 agacaagtaa catcaagtaa catttctgtg aatcacgtta ataataagtc tgacaacaag 240 ttactctcct aaacgacttt aggattgtca agacatccgg tattactcga gctcgtaata 300 caacatctgg tagcgctaaa ggttactaat tgttcaagca aaccaacgaa tccccgaacc 360 tagctacacc aacatctgtc atgagcttta tggatcaaat atcaggagga ggaagttacc 420 caaaacttcc ggtggaatgt ctccctaatt tcccgatcca accagcacta accttcagag 480 gtaggaatga ctcgcacaag ttgaaaaact tcatctctga aataatgtta aacatgtcta 540 tgattccttg gcccaacgaa gccagtcgta ttgtctattg tagaaagcac ttactgaacc 600 ctgctgctca atgggctaat gattttgtcc aagagcaagg catactggaa ataacattcg 660 acactttcat acaaggactc tatcagcatt tctataagcc accagacatc aacaaaatct 720 tcaacacgat caatcaacta tctgaagcaa aactgggtat tgaacgtctc aacaaacaat 780 tcaaaatgat ttgggacaga atgccaccag actttatgac cgaaaaagcc gctattatga 840 catatacccg cttactgaca aaggaaacct acaatatagt tagattgcat aaaccaaaaa 900 cactgagagg agccatggag gaggcttacc aaacaacagc actaactcag agattcttcc 960 cagaatttga gctagacgcc gatggagata ctataatcgc agccgcaact cgtttacacg 1020 aagagtacga ctacgacagt gacccacaag aaaacttgtt ggtccagaag agacacgtcc 1080 acgcagttcg gacaagaaga tcataccata aaccgaccgc gacttatcac aacaggagaa 1140 gcaataatcc atctagagaa gagtgtgtaa aggaccgtct atgtttttat tgtaagaaag 1200 aaggacaccg tctgaacgag tgtagagctc gtaatgcgag ttctagccga tactgaactt 1260 gagtcgaaag accacaagaa actttctatc acatcccgac ctattgtaca ctatatcgcc 1320 atacctgaaa tggacaagac tgccgaaaaa cacataaaaa taaaaaacac gaaaataaaa 1380 accctgtttg atagtggatc gcccacatca tttatccgaa gagataccgt aaatctcctg 1440 aatctgccaa cccatgatac cccgccgctc cgctttagag gattcatatc caccgaatcc 1500 accaccactt cagaagcagt tacgctcgac cttacagtcg acaatctgca aatcaatgta 1560 gccgcgtatg tactcgataa aatggactac caacttctaa tcggaaatcc aattctacgc 1620 cgctacccaa aactcctgta cacaatcctg aacactaagc agtgtacctc cgcccagaag 1680 cccaaggctt accattccga aaacgttaac tatgtgaaag caaaatccgc tggtaatcgt 1740 ggtaactcca gaaacaaaac accgtccttt gcccccacta ttcctgaagc aactgaccag 1800 aaatccgctg gtaatcgtag taactccaga acaaatacta tgtcttttgc cactaccact 1860 ccagaagcaa ctgacccgct tacgaccctc gacaatccag gtagtactca aagtacattt 1920 gcgcaattcc tgatacctga agaagcgatc atcctagaag aggatggaaa gtactccaac 1980 gttgtgtcaa ccatacagaa cgtagaacct aaagctactg atcacagcaa caaggacaca 2040 ttttcaacgt tgcctgtttg gtcacaacag aaatacatag agattatacg taatgatctc 2100 ccgccaagac cggccaatat ccacagcatg cccgtaaaac acgacattga aattaaacct 2160 gacacaagac tacctcgact acagccatat catgttacag aaaggaacga acaagaaatt 2220 aacaagatcg ttcaagaact gctcgacaac aagttcattg tcccctctaa gtctccgtgc 2280 agttctcctg tagtccttgt cccgaagaaa gatggtacgt tcagactttg cgttgactac 2340 cgtgccctga acaaggttac catctccgac ccattcccat tacccagaat cgacaaccta 2400 ttaagccgta ttgggaacgc ccaaatattc accacgctag atttgcacag tggttatcac 2460 cagattccaa tggacccaaa agaccgctac aaaaccgctt ttgtcacccc gtccggtaaa 2520 tatgaataca ctgtcatgcc attcggtcta gtcaatgcac ctagcacctt tgcaagatac 2580 atggccgaca ttttcagaga cttgagattt gtcaatgtct atcttgatga catattaatc 2640 ttctctgaat cccaagaaga acactggaaa catttagaca cagttcttgg aagacttaag 2700 aaggaaaatc ttattgtcaa aaagaaaaaa tgcaagtttg catcagaaca aattgaattt 2760 ctaggttata atattggaat tcagaaaata accccattgc aacacaaatg tgctgccatt 2820 cgagatttcc caaaacccag aacagtaaaa caagcacaac gatttttagg aatgattaat 2880 tactatagac gattcatccc aaattgctct aagattgcac aaccaatcca gctctttata 2940 tgcgacaaga gtcaatggac agaggagcag gacaaggcaa tcgagaaatt aaaattcgcc 3000 ctatgtaatt cccctgttct agtaccattt aacaacaaag caatttaccg attaactaca 3060 gacgcatcaa aagatggtat cggtgccgtt ctagaagaag tcaatgccaa aaacgcactt 3120 gttggtgttg tcggttattt ctctaaatca ttagaaagtg ctcaaaagaa ctaccccgcc 3180 ggtgaactag aactacttgg aattattaaa gcacttcacc acttccgtta tatgctccat 3240 ggaaagcatt ttacattgag aacggaccac attagcttac tgtcactgca gaataagaac 3300 gaacccgcac gaagagttca aagatggttg gatgacctag ccacttataa cttcacccta 3360 gaatacttag ctggacccaa aaacgtcgtc gcagatgcta tatcccgtgc cgtgtacact 3420 ataaccccag aaatacccca acctatcgac ccagaaaact ggaaaactca ttacaaatca 3480 gacccattat gcagtgccac cttgatttac atgaaagaat tgacacaaca caatgtcata 3540 cctgagggta tgtccgcatt ccgtagctat cacaagaaat ttcagttatc agagaccttc 3600 cgaaagaact actccctaga aaatggaata atctactatc gagaccgatt agtagtcccg 3660 gtaaaacaac agaacgaagt cataaaactg tatcacgatc atactttatt cggaggccat 3720 tttggtgtaa ctgtaacatt tggaaagatt gctccaattt actactggcc gaaactgcaa 3780 cattcgatta cacaatacat ccgtacctgc gtacaatgtc aactcacaaa atcacatcga 3840 ccacgttcac aaggactatt gcaaccgctt cccgtagcgg aaggaagatg gcttaatata 3900 tcaatggatt ttgtgactgg actaccctta acaacaaacg atttgaatat gatcctcgtt 3960 gttgttgacc gcttctcgaa acgcgctcac ttcatagcta caagaaaaac agcagacgca 4020 tcacagctaa taaatacgct attccgctat atcttttcat atcatggttt cccgaagaca 4080 ataaccagtg atagagacat ccgtataacc gcagaaaaat atcaagaact gacaaaaaga 4140 ttagggatta aatcgacaat gtcttctgca aaccaccccc aaacagatgg acaatccgaa 4200 agaacgattc aaacattgaa ccgattgcta agagcttatg cctccaccaa taaccaaaac 4260 tggcacacct acttaccaca aattgaattt gtctataact ccacaccgac cagaacactt 4320 ggaaaatcac cattcgaaat tgatttagga tatactccca atgcacctac tattaaaacg 4380 gagtgcgaaa ttaatgcaag aagttttacc gccgtagaac tggcccgaca cctcaaagct 4440 attacgatcc aaacaaaaga acgactagag tctgctcaga ttgaaatgga aactaataac 4500 aaccaaagac gtaaaacttt gttattgaac ataggagatc acgtactagt gcatagagat 4560 gcatacttca agaaaggaac gtatatgaag gtacaaccta tatacgttgg accatttcga 4620 gttgtcaaga aaataaatga taacgcttac gaactcgact tggattcaca caagagaaaa 4680 catagagtta ttaacgtaca atacctaaag gaatttgtat accgacccga cgcatatcca 4740 aagaataaac caatcagttc cgttgaaaga atcaacagag ccaatgaagt cattgcagtt 4800 atagggatag ataccacaca caaaacatac ttatgtcgta tgcaggatgt ggacccgaca 4860 atctcagtag aatactcaga agctgaattc taccaaatcc cagaggaaat acgaaagtca 4920 atattagcca attttagaca attgtacgaa actcaagaca actccgagag agaggaagat 4980 // ID Gypsy-1_CAl-I repbase; DNA; FNG; 3922 BP. XX AC . XX DT 04-APR-2011 (Rel. 16.04, Created) DT 04-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Candida albicans genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_CAl_; KW Gypsy-1_CAl-LTR; Gypsy-1_CAl-I. XX OS Candida albicans OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; mitosporic Saccharomycetales; OC Candida. XX RN [1] RP 1-3922 RA Jurka J.; RT "LTR retrotransposons from the Candida albicans genome."; RL Direct Submission to Repbase Update (04-APR-2011). XX DR [1] (Consensus) XX CC Positions [2755-3261] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 268..3621 FT /product="Gypsy-1_CAl-I_1p" FT /translation="MSFPRTHSPRPSGSREQEDLTLMIKAFRDSMEAKLDL FT HSQKLTALVANIPRTDEGFEDLSQRITVLKNHQKAFLPKQEKEIGSLLHRQ FT REEEGDVKDIKTVVGEEKEELHQVEDFVLKDQEELRNVEKKVLKEEEELQK FT VEESMEKEKQELYQVEDFILQRDETVKKLGESNQSQQEPYTPATSGSDQRF FT RSQQPNIGNTLAQDLALIPKLDLEICKIAVKYPKLFETKLRPPPPRDFQYK FT IQLTDHTQIYSKPYKCNQEEQALIKDFINEKLEAGVLVPAPIDAWLHPIFP FT IRKTNANQSSTKIAVDLRRLNKVTVRMYTYPTDTKDLLSSLTDSHYFSALD FT LKNAFYQVSIHKDSIKYFGISTSEGNYCFTTLPFGAINSPTIFTNFVRQIL FT EGIPCIFIYMDDILIHTKTLHDHMSSLRRIMEKLNEHQLQMNYNKMQLLTT FT KINFLGYSIQANKISPDISKIQAIQNWELPTTTTQIRAFVNFSNHFRIFIP FT EIAKFTNPLNELLKNNNGKNIKIEHTQASIDGYKALKAAIIGLPTLQLYNP FT KLPTIIFTDASHMVVGGYICQPTFRNDKEVLVPIAFSSHKLTETQSRYAAM FT EKELLAIIVILEKFRYHCSNTVEIYTDYQSLASYLDKKTTPPPRIARFLDL FT IGSFSPKVYYLSGKKNFVADIITRYQTQNIKELVDEDKILGQTFTVKRNLK FT QQLLPRLEAIELENLNESQVHKIQTSLEQQQQHDLEDNDEELPLQLFKLMN FT DELFVIINNQLLKYLPRLEYNDICQTIHDKHHPSTRVTDYLCTLAYWHPDH FT LLIATNITRKCHYCQLNTSIREAIRPYRPLEPLKAFSRWGMDYSGPYFNTV FT QHRYILVAVEYVTGLTIAVPTLHKDADNAISLLQSIILIMSAPTELVTDQG FT KEFSSQALATLCDQNNIQHHITSAHHPRGNGRVEKVNHLLKKILKALTNDT FT MQDWDLKLYDALRIYNATPTIFNYTPLYLALGIEPHHNLNQLQKDLIENLQ FT KELPPEVQSTEEHEENPNDEQQEEGREQQISREEQQDGRDLVHLRIYELEA FT IKKARKLHTNLKTRRNAVQNIVKDPYGIPDLFTRDIGYTELELKHENMNQI FT SMVHIKFKKY" XX SQ Sequence 3922 BP; 1561 A; 744 C; 622 G; 995 T; 0 other; gaaaattaat tttctgatta tactacttac tagattgcat aaagtcaata tctgattgat 60 acaacttggt tcattattca taaaacttaa caactaattc aacaagaaaa cccaacaaaa 120 aaatccaaat aaaataatca aaaaaatatt ataattaatt aattacaaaa aaaaaaaaca 180 aaaaatacac acacacatac acacacacaa aatcttgttg caaaaaaaaa agaaaaataa 240 taataatata ataagaatta attaacaatg tcgtttccac ggacacattc accaagacca 300 tctggttcac gagaacaaga agatctcaca ctgatgatta aagcttttag agattcaatg 360 gaagctaagc ttgacttgca ttcgcagaag cttactgctt tggtagcaaa cattcccaga 420 acggacgaag ggtttgaaga tttatcacaa aggatcactg ttcttaaaaa tcatcaaaaa 480 gcatttttgc ccaaacaaga aaaagaaatc ggaagtcttc tccacagaca aagagaggaa 540 gaaggtgatg ttaaggatat caaaacagtt gttggtgaag aaaaagaaga attgcaccag 600 gttgaagatt tcgttttaaa agatcaagaa gaattacgaa acgtcgaaaa gaaagttttg 660 aaagaagaag aagaattgca aaaagtggaa gagtcaatgg aaaaggaaaa acaagagtta 720 taccaggttg aagactttat tttgcaaaga gatgagacgg taaagaaact tggagaaagc 780 aatcaatctc aacaggaacc atatacacct gcaacttctg gttcggatca gagattcaga 840 tctcaacaac ctaacattgg aaatacctta gcgcaggatc tagcattaat tccaaaatta 900 gatctggaaa tttgcaaaat tgcagtcaaa tatccaaaat tatttgaaac aaaattaaga 960 ccaccaccac ccagagactt tcaatataaa attcaactca cagaccacac tcaaatttat 1020 tcaaaaccat ataaatgcaa tcaagaagaa caagctctca tcaaggattt catcaatgaa 1080 aaattagaag caggcgtttt ggtaccagct ccaattgatg cttggttaca cccaatattt 1140 ccaatcagaa aaaccaatgc caaccaatcc tccaccaaaa tagcagttga tttaagacgt 1200 ctcaataagg tcacagtacg aatgtacact tatccaacag acacaaaaga cctcttatcc 1260 tcactaacag attcccacta ttttagcgct ttagacttaa agaatgcgtt ctatcaggta 1320 agcatacaca aggatagtat aaaatatttt gggatttcaa catccgaggg gaattattgc 1380 tttacaactt taccgtttgg agcaatcaat tccccaacca tctttactaa ctttgtgaga 1440 cagattttag aggggatccc atgtatattt atatacatgg atgatatcct catccatact 1500 aaaaccttac atgaccacat gtcatcactc aggagaatca tggagaaact aaatgagcat 1560 cagcttcaaa tgaattataa caagatgcaa ttattaacaa caaaaatcaa tttcttaggg 1620 tacagcattc aagcgaacaa aatatcacca gatatttcca aaattcaagc aatacaaaat 1680 tgggaattgc ccacgaccac tactcaaatc agagcatttg tcaatttcag caaccacttt 1740 cgcatcttca tcccagaaat agcaaaattt actaatccat taaatgaatt attgaagaac 1800 aacaatggta aaaacataaa gattgaacac acccaagcat ccattgatgg ttacaaggca 1860 ttaaaagccg ccatcattgg attgccgacg cttcaacttt acaatccaaa actaccaacc 1920 atcattttca cagatgctag ccacatggta gtaggaggat atatatgtca accaacattt 1980 agaaatgaca aagaagtcct tgtcccgatt gcattttcat cacataaatt aacggaaaca 2040 caaagcagat atgctgctat ggaaaaggaa cttttggcaa ttattgtgat attggaaaaa 2100 tttagatatc actgcagcaa tacggtagag atctatacag attatcaaag tttggcatca 2160 tatttagata agaaaactac tccaccaccg agaattgcta ggtttttaga tctaattgga 2220 tcattttccc caaaagtgta ctatttaagt ggaaagaaaa atttcgttgc tgatatcatt 2280 acaagatatc aaactcaaaa tattaaggaa ttggtagatg aagacaagat actaggacag 2340 acttttacag tcaagagaaa tttgaaacaa caactattac caagattgga agcaattgaa 2400 ttggaaaatc ttaatgaatc acaggttcac aaaatccaaa cttcattaga acaacaacaa 2460 caacatgatt tggaagacaa tgatgaagag ttacctctcc aactgtttaa attaatgaat 2520 gatgagttat ttgtaatcat taacaaccaa cttttaaaat accttccaag actggaatac 2580 aatgatattt gtcaaacaat ccatgacaaa caccatccat caactagagt aacagactac 2640 ttatgcacac tcgcatattg gcatcctgac catctattaa ttgctacaaa cattacgaga 2700 aagtgtcact attgtcaact aaacacgtca attcgtgagg ccattagacc ataccgacca 2760 cttgaaccac tcaaggcatt tagcagatgg ggaatggact actctggacc atactttaac 2820 acagtccaac acaggtacat attagtagcc gtggaatatg tcactggttt aactattgca 2880 gtaccaacat tgcacaaaga cgcagataac gcaatcagtc ttttacaatc aatcattctg 2940 atcatgtcag cacctacaga attagttaca gatcaaggta aagaattttc atcacaagct 3000 ttggctaccc tatgtgacca gaataacata caacaccata ttacctccgc ccaccaccca 3060 cgtgggaatg gtcgggttga gaaggtgaac cacctattga agaaaatatt gaaagcatta 3120 accaacgata cgatgcaaga ctgggattta aaactatatg acgctttaag aatctacaat 3180 gctacaccta caatttttaa ctacactcca ctttatcttg cacttggaat tgaaccacac 3240 cataatttaa atcaattaca aaaagattta attgaaaatt tgcaaaaaga attgccccca 3300 gaggtccaat ccacagaaga acacgaagaa aacccaaatg atgaacaaca agaagagggc 3360 agagaacaac aaatttcaag agaagaacaa caggacggca gagatcttgt acacttaaga 3420 atttacgaat tggaagcaat taagaaagct cgcaagttac acacaaattt gaaaacacga 3480 agaaacgcag tccaaaatat tgtaaaggac ccatatggca ttccagacct ttttacaagg 3540 gacattgggt atacagaatt agagctaaag cacgaaaata tgaatcaaat ttcgatggtc 3600 catatcaagt tcaagaagta ttaggtaaag gtgcttataa attgagagac atcactggaa 3660 gagaaaaagg aatctacaat caggatcagt tgaagttagc atattcagca gacaacgatc 3720 caatacaggt ttttagttct tttaataaag aatatgatcg agtacaacaa aaattgttag 3780 acaaaattca atcagaaaga gatcatcaat taaattgttt gtcagtccaa catttacaca 3840 gacaaagaag gttactcgat atatccagct gtcttgagca aattctgcaa taatttcgct 3900 aatcattgga ggaaagggta ga 3922 // ID Gypsy-6_MLP-I repbase; DNA; FNG; 8604 BP. XX AC AECX01002027; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-6_MLP_; KW Gypsy-6_MLP-LTR; Gypsy-6_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-8604 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002027; Positions 16433 25036. XX CC Positions [5729-6265] - Reverse transcriptase CC Positions [7436-7948] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(2069..6265,6269..8569) FT /product="Gypsy-6_MLP-I_1p" FT /translation="MADWNPPPHPRSRKEPERYGNLQPTSSSRPGTDGARL FT RVGSRTGSTASGKQRSNESRGKLPVTNQPLTKDLADTTSTTGVGRVQLPVA FT GSSSPVLEMGTRPNNPQRREQAQDASEAIGPRGQESQESLGEGSRGERGGF FT QGADERLVHQRDHKMESLRTVDQNLRSESRRTGSSAANDGHEPAVDLASEL FT TPRAFGPLLQSQSVEHKPERLSSLFLGPSGSGQGPTADPSSLSLGPFLEKL FT QRTRRPGEKELALLSTQERVILRKFSEITQSEFILPMMTELTNYLGHFEQQ FT MNASQDFNQSFREFLKVFNGNLEILDKNLKDFHSDSYSKFSEILKKEVHIL FT EVVDAQYKELYRITRQDNKDLSSLDKKIVSFLNQFPLESLNQFLEKNFTST FT LSLSDINENLLDFKASLNEFCENRESEKPSVVQNDLVMTTLIENLHEKLIQ FT KLENKLYQDTSSSDEINKFLQDQNLAVENEKIQRSNFQDFLTKSVEEIKEN FT QTQFENSTNSKLDRIENMLRKLTNDNQRDVEHVQEPVQSSTLPRNRVGSMV FT RNFEGISQTPMQGGSTQNVRSVPPHLDRQDNGLRFRAETTPNLGNNPFFPQ FT NVGQSTMLQPDNSMVDLETTTELFRQASIKKELRKEYPNVKDWPTFSGEGE FT YNHQEFIDWVDAVPVRLQMPDILITSKLSIVFTGVARQWVMDVMKEDGAHN FT KTWEEWKEAIQTRFGNSQWRSSMERLFMRDKFRPEIHSDCVKWATRQQQRF FT RSFRPHANSSEVAEKVLWQLPSTIQIPVQNRIGPDSNWTDFILAFEDIAKS FT LTPVRRTQTYKSNFQPKTHEREQQVSSTKAQPSTSERKASRSCNNCGSTDP FT KHVWRTCKGKGKAINAVEADEEDQAYDDEEAGAPMEFVEYADSIADSESRH FT SADLNVAAIESTFDGEVDISQVQAEAEMPIITPMEESNKQISDAKLIRCKP FT ASGKAHTIGFHSLTRVIVNGQDAELLLDSGASCSVVGNHYLNNFIPDWKLK FT LMPCNNNMKFSGCSGSLFPLGIIKLEVLFPHKQGGIRIEAEFVVMENSHTK FT YFILGDEHLSMYGIDIFHSKDKFFTFGNDVKKKKFGLPLQRPILSGLQVKK FT EKINELDNEQPEVEKISCKKKDDLTPKINEAKFSPKLSKEEKDAFVNLILQ FT YKDEFGLGEKELGTIEKHQVSIKLTIEKPYPPILRKAAYPASPRNQVEIEK FT PIDKLLRLGVMRKVGSSEEVDVTTPIIIAWHNGKSRLCGDFRALNTYTEPD FT RYPLPRTEHVLLNLGKAKFITLMDALKGFHQMIVELASRRYLRIICHMGIF FT EFVRMPFGIKNAPAFFQRMMDIEFNKELREGWLKIFIDDIIIFHDDFNEHL FT KGLEIVLKRVKAMGMTISLKKSNFGFDEVKALGHLVGLWIAVDQNKTAAVM FT NKPPPKNIKEMQCFLGFASYYRNFLPDYGKIANCLYKLTSNGVAFEMTKQR FT LEAWNQIRKMITEAPVLLQPDFTKPFKLYVDASFEGLGAALHQVQIVDGKS FT REGPIVFISRQLKDSETRYGSPQLEALALVWALEKLHYYLDGVYFEVIMDC FT TGVRSLMNIKTPNRHMLRWMLAIQEYRPYMTITHRPGKMHNNADGLSRMAL FT PNDSSNPAWDPEDAEIDIPVTGISLSELSDEFFSEIKESYKSDPNTLKLVK FT LLSKDKADLSLATTLESPWRGYYAEGKISLQSGLLYFREKHTSVVVLVSKI FT HKEQILQVCHDEFLSGHLSADRTLERIHTTAWWIRWKLDAEEYVASCDRCQ FT KANKATGKRFGLLQQIEAPTYPWQIINMDFVTGLPPAGIDNYNACLVVVDR FT FSRRTRFLPCYKESSAMDIALLFWERLISDVGLPRVIISDRDPKFTSEFWK FT SLFKLMGTKLAFSTAYHPQTDGLSERNIGTFEDLVRRYCAFGLEFKDKDGY FT THDWVSLLPALEIAYNSSVHSTTKKSPFEVERGYCPRLPKDKFKTRDVEIH FT PTSLSFASMISKAREFAANCIRESEEYNKDRWDKTHREPNFKVGDQVLLST FT TNFNNMEGSKKLRDLFVGPFVISKLVGPNAVELILTGEMERKHPVFPVSLI FT KKYNLPDENRFHQKVSTVLPTFTPDTEKKFLKILREKRVKDKNNKDIKLYL FT VRYKNQGADADEWLPEDKVPNGKVTLRAYRASKRDHKT" XX SQ Sequence 8604 BP; 2909 A; 1615 C; 1704 G; 2376 T; 0 other; tttgggggcc tcatccgggt tttcagaacc tttttctttt tttttgagct tgttatagct 60 cagtctcatt tagaaaagta aaaaataaca aaaacatata tttcatttca ttatttgatt 120 ttatttatcc acctttaaat tcaaaaactg agcaaagctc agcatttttt gaatttagga 180 taatagcttg gaggaaagaa agaaattctt ttcccccccc aaatttcgca tttttttaat 240 ttttttaata ttttgagaaa taggatcaat agttgatcca tctacaagtt aattacatat 300 aattcattgt tgacttatat ctgtaattca ttgatttgta ttgtgtaatt agccaaacac 360 gtggctcaca taaattacat acacatatca tcaggtgact cgtgtacaat taccacaagt 420 agtaattgta cagaagcccc gcgcgtggtc cgaaagccac caccgtcacg ttgctgaatt 480 tcgcaatcag ctcagcattt cgaatccaac cacccccagt gatcgttcga ttaagccacc 540 ggaacagcaa ccgaagcgaa gtaacctgca aagaagaaac aagaacaacc agtacaaaca 600 tacttagatt gaaataatca aaatactcac atattcaatc gataaagtca atcaaaaccg 660 attagattat cattccttaa tcttttcatt ttacttgtga ttggatcgaa gattgtttga 720 attcgagaaa gatataaaag aaaacactat cgagttagtt gtttttttct ttttctttgt 780 tgtcataaat aaaacgataa ctgatcgtta ctattacatc cttatcttgt tttccgtgca 840 actatctgca cccgtaatcg atcattgtca ttgatctaat ttgaagagtt caaaaactgc 900 tcaataattg atcttcgcct tttaaaaacg attcattcat tcatccgggt actcagcata 960 aacataagat attgaaagat ttctatctag tgagtcatca cttttttttg tttccaagtt 1020 gagataataa aattcagaaa gataactgat agttgttatc ttttaatatt attgcttttt 1080 tgtttatttg tttcaattga atcatacgag taggattcat cagtcgaacc tcggttccga 1140 aaccttatca tccatttaaa atcgaatagg aactattacc tttaccatcg attggaacat 1200 cgctttaacg gttcttttac tatagatcgt cccagttagc ggttcttttt tcatctaatt 1260 tttgaatcat agagttatag ataataaaat tgacagcaat tttgttttgg tcttttctca 1320 ttatcttcat tggtcattct tcacattcaa aatagatttt gataacccca ctttcaattg 1380 gatagagatt ggaccaaaat taaaatacga tacatcgagt aagtaagatt gtaaatcttc 1440 tttatcaatc aattcagagt tacatatcaa agatcaagca atcgagttga tttctttttc 1500 atttctctct aattttgatt tttttatatc gatcttcgtt tgctagtagt gaatttagcc 1560 gacctaaatc atgaactacc tcagctaact caagctcatc tggaagacga tttgaaccga 1620 ggtcaacggg agctttcatt cgctaccgac ttcaatttca ctagtagctc ggttgctaca 1680 accccgtctc accaaaccga gtcatcagct gcgactcctc agcctcaacg gttccaaaca 1740 gcggaccctt gggactttga acgacctcgc agccctaccg aaagcgtctt ggactttatt 1800 ttagaaagaa gtccgatcaa gaagatacct ccaactgaat ttcgattcac tactcaaaga 1860 ttcgaaagta agtcaacacg ttgtgttctc tatcattttc attaagaaga gtaagaatct 1920 aacagagttg tactgttact ctctttttct ccttagcatc gactacaaac gaaactcaaa 1980 gtaagcaata gactcatttg cttttctact ttacttgtta taaactgaga agtttattct 2040 tttttgctgt tatcgtaatt tctttctgat ggctgattgg aatccaccac cgcatcctcg 2100 cagtcgaaag gaaccagagc gatatggcaa cctccaaccc accagcagca gccggcccgg 2160 gaccgatgga gcaagattga gagttgggtc caggaccgga tcaacagcca gcggtaagca 2220 aaggtcaaac gaaagcagag gtaagcttcc agttaccaac caacctttga caaaagacct 2280 agctgacaca acttcaacta caggagttgg ccgagtacaa ctacctgtgg caggatcttc 2340 gagcccagta cttgaaatgg gaacgaggcc gaacaatcct cagcgacgtg agcaagcgca 2400 agatgcttct gaagcaatcg gtcctcgagg gcaggaaagc caggaaagcc ttggagaagg 2460 atcacgaggt gagcgaggag gatttcaagg agcagatgag aggctggttc atcaacggga 2520 ccacaagatg gaatccctgc gaacagtcga tcaaaatctt cggtctgaga gcagacgaac 2580 cggaagctca gcagccaacg acggccacga accagcagtc gacctcgcat ccgaactcac 2640 accaagggca ttcggcccat tactacaatc acagtcggtc gaacacaagc cggaacgact 2700 ttcaagctta tttttgggcc caagtggaag tgggcaaggg cctacagccg acccctcatc 2760 actttcattg gggccgttcc tcgaaaaact tcagaggaca aggaggccgg gggaaaaaga 2820 actagccctt ttgagtactc aagaacgagt gattctgcgt aagttttctg aaattactca 2880 gagtgaattc atcctgccta tgatgactga actgacaaat taccttggtc attttgaaca 2940 acagatgaat gcttcacaag acttcaatca atcatttagg gaatttttaa aagtttttaa 3000 tggtaactta gagattttag ataaaaatct caaagatttt cactcagact catattcaaa 3060 attttcagaa attttaaaaa aagaagttca tattttagaa gttgttgatg ctcaatacaa 3120 agagctttat agaattacca ggcaagataa taaagatctt tccagtctag ataaaaaaat 3180 tgtttccttt ttaaatcaat ttcctttaga atcattgaat caatttttag agaaaaattt 3240 tacatctact ttgtctttga gtgatatcaa tgaaaatctt ttagatttta aagcttcttt 3300 gaacgagttt tgtgagaacc gtgagagtga gaaaccaagt gtggtccaga atgatctagt 3360 tatgaccaca cttattgaaa atcttcatga aaaattgatc caaaaacttg agaacaaact 3420 ttatcaggat actagttcaa gtgatgaaat caataaattc ctccaagacc aaaaccttgc 3480 cgttgaaaat gaaaaaattc aacggagcaa ttttcaagat tttctcacta agagtgtaga 3540 agaaattaaa gaaaatcaaa cacagtttga aaattctact aatagtaaat tagataggat 3600 tgaaaacatg ttgcggaagc tgacaaatga caatcagcga gacgtagaac atgttcagga 3660 accagtacaa agcagtacct tacctaggaa cagagttgga tcaatggtga gaaattttga 3720 aggtattagc caaactccga tgcaaggagg ttcaactcaa aatgtcagaa gtgtacctcc 3780 tcaccttgac aggcaggata acggactaag gttcagagcg gaaactaccc cgaaccttgg 3840 aaacaatccc ttctttcccc aaaacgtagg tcaaagcact atgttgcaac cggacaattc 3900 aatggtagat ttagaaacaa ccacggaact gttccgtcaa gcatcaatca aaaaggaact 3960 caggaaagaa taccctaacg tcaaggattg gcctacgttc agcggtgaag gtgagtacaa 4020 tcaccaagag ttcattgatt gggttgacgc agtaccagtg cgcctacaaa tgccagatat 4080 cttgatcaca tcaaagttat caattgtatt tactggggtg gcccgtcaat gggttatgga 4140 cgtcatgaaa gaagatggcg cgcataataa gacctgggaa gaatggaaag aagctattca 4200 aaccaggttt ggtaattctc aatggagatc ttcaatggaa agattgttta tgagggataa 4260 attcaggcca gagatccatt ctgattgtgt caaatgggct acaagacagc aacagaggtt 4320 tagatcattc agaccacatg caaactcatc tgaagtagct gaaaaagtac tttggcagct 4380 acccagcaca atccaaatcc cagttcagaa tagaattggc ccagatagca actggacaga 4440 tttcatttta gcttttgaag atattgcaaa aagtttgacg cctgttagac gcacacagac 4500 atacaagtca aattttcagc caaaaactca tgaaagagaa caacaagtga gtagcaccaa 4560 agctcaacct tctacgtctg agagaaaagc cagtagatcc tgcaacaact gcggtagcac 4620 agatcctaaa catgtttgga gaacgtgtaa aggaaagggc aaagctatca atgcggttga 4680 agcagatgaa gaagaccaag cttatgatga cgaagaagca ggcgctccaa tggaatttgt 4740 agaatacgct gatagtatag ctgattcaga gtcacggcac tcagctgacc tgaatgttgc 4800 agcaatagaa agtacctttg atggtgaagt tgacatatca caagtccaag ctgaagcaga 4860 aatgccaatt attacaccta tggaagaaag taataaacaa atctcagacg ccaaactgat 4920 caggtgtaaa ccggcctcag gcaaagcaca tacaattgga tttcacagtt tgactcgagt 4980 cattgtaaat ggtcaagacg cagaattgtt acttgacagt ggagcatctt gctctgtggt 5040 ggggaaccac tacttaaata actttattcc agactggaaa ttgaagttaa tgccttgcaa 5100 taacaacatg aaattttccg gttgtagtgg aagtttgttt ccactaggaa ttataaaact 5160 tgaagtttta tttccgcata aacaaggtgg cattcgaata gaagccgaat ttgtggttat 5220 ggaaaattcc cacaccaagt atttcatact gggggatgaa cacttgtcta tgtacggaat 5280 tgatatcttt catagcaaag acaaattctt cacatttggt aatgatgtaa agaaaaagaa 5340 gtttggttta ccacttcaaa gaccaattct ttcaggttta caagtcaaga aagagaaaat 5400 caatgaactt gataatgaac aacctgaagt agagaaaatc tcgtgtaaga agaaagatga 5460 cttgacacct aaaatcaatg aagcaaagtt tagtcccaaa ctttcaaaag aggaaaagga 5520 tgcatttgtt aaccttattc tccagtacaa agatgagttt ggattgggtg aaaaagaatt 5580 aggtactatt gaaaagcatc aagtaagcat caaacttaca attgaaaaac cctaccctcc 5640 aatattgagg aaagctgcct atccggctag tcctagaaat caggtagaaa ttgaaaaacc 5700 catagacaaa cttcttagac taggtgtaat gaggaaagta gggagttctg aagaagttga 5760 tgtgaccaca ccaatcatca tagcctggca caacggcaaa tcacgcttat gcggagactt 5820 tcgagctcta aacacgtaca cagagcctga tagatatcct ttacccagaa ctgaacatgt 5880 cttgcttaat ttaggaaaag caaaattcat cacactgatg gacgcattga aaggtttcca 5940 ccaaatgata gtagaactgg caagcagaag atatctcaga ataatctgtc atatgggaat 6000 attcgagttc gtgagaatgc cgtttggaat aaaaaatgcg cctgccttct ttcaaagaat 6060 gatggacatt gagttcaaca aagaattgcg agaaggatgg ttaaaaatct tcattgatga 6120 tataatcatt tttcatgatg atttcaacga acatctcaaa ggactggaga ttgtcttgaa 6180 aagagtcaaa gccatgggaa tgacaatttc attgaagaag tcaaattttg gttttgatga 6240 agttaaagct ttggggcact tggtatgagg attatggata gcagtggatc agaataaaac 6300 tgctgcagtt atgaataaac ctccgcccaa aaacatcaaa gagatgcagt gttttcttgg 6360 gtttgcaagt tactatagaa atttcctacc agactacggg aaaattgcaa attgtcttta 6420 taagctgaca tccaatggtg ttgcctttga aatgacaaaa caaaggttgg aagcttggaa 6480 tcaaattaga aaaatgataa cagaagcacc agttctgtta caaccagact tcacaaaacc 6540 tttcaagcta tacgtagatg cttcatttga aggtttaggt gctgcattac accaagtcca 6600 aattgtggat ggcaaatcta gagaaggacc tatagttttc atatctcgac aattaaaaga 6660 tagtgaaact aggtatggtt caccacagct agaagcgtta gctctggtgt gggctctaga 6720 aaaattgcac tattacttgg atggtgtgta cttcgaagtc attatggact gtaccggagt 6780 aaggtcactc atgaatatca aaacacccaa cagacacatg ttgagatgga tgttagctat 6840 tcaggagtac agaccttata tgaccataac tcacagaccg ggtaaaatgc ataacaatgc 6900 ggatggtcta agtagaatgg cactgccgaa tgacagcagc aatccagcct gggatcctga 6960 ggacgcagag attgatatac cagtgacggg tatcagcctt tcagaattga gtgacgagtt 7020 cttttccgaa atcaaagaaa gttacaagtc agatcccaat acgctaaagc ttgtcaaact 7080 tttgtcaaaa gacaaagcag atttgagtct tgcaaccact ttagaatcac cgtggagagg 7140 ttattacgca gaaggtaaaa tctcattgca atcaggtttg ttatatttca gagaaaagca 7200 tacatcagtt gtagttttag tgtcaaaaat tcataaggag cagatactac aagtatgtca 7260 tgatgagttc ctttctggtc acttatcagc agataggact cttgagagaa ttcataccac 7320 agcttggtgg ataagatgga agcttgatgc tgaggagtac gtagctagct gtgatagatg 7380 tcaaaaagca aacaaagcta cgggcaaaag atttggacta cttcagcaaa ttgaagcacc 7440 aacttatcca tggcagatta tcaatatgga ctttgtaaca ggtttaccac cagcaggtat 7500 tgataactat aatgcttgtc ttgtagtagt agatagattc tccaggcgaa ctagattctt 7560 accgtgctat aaagaatcgt cggcaatgga tattgcactg cttttttggg aaagattgat 7620 ctccgatgtt ggtttaccaa gggtaataat cagtgataga gatcccaaat ttacgtcaga 7680 attttggaaa agtcttttta aactcatggg tactaaatta gccttctcta cagcctatca 7740 tcctcaaact gatggtttaa gtgaaagaaa tataggtaca tttgaggatt tagttaggcg 7800 gtactgtgct tttggtcttg aatttaaaga caaagacggt tatacccatg attgggtaag 7860 cttattacct gctctagaaa tagcttacaa cagtagtgtg catagtacta caaagaaatc 7920 acctttcgaa gtagaaagag ggtattgtcc acgactgcca aaggacaaat ttaaaacccg 7980 agatgtagaa attcatccta catctttatc ttttgcatct atgatcagca aagcaagaga 8040 gtttgctgcc aattgtatca gagaatcaga agaatacaat aaagacagat gggataagac 8100 ccatagagaa cctaatttta aagttggaga tcaagtgctg ttgtcaacaa caaactttaa 8160 caatatggaa ggatccaaaa aacttagaga tctgtttgta ggaccttttg tcatatcaaa 8220 actagtagga ccaaacgcag tagagcttat actgacgggg gagatggaaa gaaaacaccc 8280 tgtgttccca gtctctctca taaagaagta taacttgccg gatgaaaata gatttcacca 8340 aaaagtcagt acagtcttgc caacctttac tccagacact gaaaagaaat ttctcaagat 8400 attgagggaa aagagagtaa aggataaaaa taataaagat attaaattat atctagtaag 8460 atataaaaac caaggtgctg atgcagacga atggttacca gaggataaag tgcctaatgg 8520 taaggtcact ttgagagcgt acagagcatc aaaaagagat cataaaactt gagaaggtgt 8580 atgacctctt ctgatggtgg ggaa 8604 // ID RPS1_CA repbase; DNA; FNG; 3313 BP. XX AC . XX DT 26-APR-2005 (Rel. 10.04, Created) DT 26-APR-2005 (Rel. 10.04, Last updated, Version 1) XX DE C. albicans repeat RPS1. XX KW Transposable Element; RPS1_CA. XX OS Candida albicans OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; mitosporic Saccharomycetales; OC Candida. XX RN [1] RP 1-3313 RA Iwaguchi S., Homma M., Chibana H. and Tanaka K.; RT "Isolation and characterization of a repeated sequence (RPS1) of RT Candida albicans."; RL J Gen Microbiol 138(9), 1893-1900 (1992). XX RN [2] RP 1-3313 RA Gentles A. and Jurka J.; RT "C. albicans RPS1 repeat."; RL Direct Submission to Repbase Update (26-APR-2005). XX DR [2] (Consensus) XX SQ Sequence 3313 BP; 1140 A; 641 C; 594 G; 938 T; 0 other; gaattcaaca cacctggaat tgattttcac acactatatt catcttagtg ctttctctga 60 ccaagcataa atcaatacca acttaaatat caatatattg aaatgccttt gattaaaaca 120 ataagttaaa tctgattatt tctaatgcaa attcaatggc ttgcagttca atctctacaa 180 aataattaaa gtgacttcca aatatcaatc atgagtttca aaatgtgcat aatatgctcc 240 cgaaatgaaa aaccaactta tatcaatatg ctttgtgaaa taagggggca atgcatggtc 300 tcatcttctg aatgcaaatg gtatgcattt ctctgtagtc atttaatatc ccggttattc 360 aaataagtct taatatgacc aatcaccaat gttatcacta ctcttatata actaaaccaa 420 ttttcagaaa gacaatttga acaatgacac aaaaaccaag taacgactat gacaccaata 480 caagtgtgac tagttctgcc aaacacatta ctctaaaaca catcattgta gattatgaat 540 atcacttgaa tatgatatta ggtagattaa cggaccagcg tctaaattaa tgggaacttc 600 aaaatcaata ccaaatgacc aattataact tgacccatta tcaattctct gtaactccca 660 aatgataatg attcactata tactctcttt cattctcttg attggactag ataaatcaga 720 cattccgccg aaacaaaact ggaaaccgct agccgctatc acaaaatatt gaaactaatg 780 gcgttaatgg gattgctaat acatttcaaa agatcttaat tcaataacaa ttgttggatt 840 aattgggatt tgttttgtat tacttgtggt gtttgaaatg gttgtgtctt atagtgtact 900 aatgagatga ttaaattggg tgtaagaaaa tagttgtaaa gaaaatagat taatagaaat 960 cgcagtgcta gaaaagtact aaaaatttct aagaaaaaat tttgtatgca aaattatttc 1020 cgcaccttcg gaaattacac aaactaatct ccgtttccta acaaatttac tagtattttt 1080 gggctcgggt acacgaccct taactccgtc tccgtgtcac gtgattttcc catacaaaat 1140 cctgcacaat ccgccacaag tctcttatct ccgcattctc gggaatacct cagacaaaga 1200 agaaagatac tccacactca ttttccctcc cgcacgcccc agctccactt cttctcatcc 1260 cgatctaaca taagaatttg agaatacaat aaaaaataac agagatattg gagatcaaaa 1320 tattcaccat cgatttcaac aaaaatgcta ccaacaacca ccttcccata cttaatatta 1380 aacttaataa aatcatcagt tctatctcta caatataaga actaccaaga tagcagagct 1440 cagaagtatt ataattacta tattaacaat agaaatttaa tgcctcacaa tacaaaccaa 1500 acactcattc tacaattctc tagggattga ccaccaaaaa aaacgacaca aaatccaaac 1560 tacaatagtt aagaaggaaa gttgaagtgt gggaggtaga ggagggctac aagagccaat 1620 caggtgccga tttgtggcct acagttgtgg ttttaaaacc cgagagaatc gttaataacg 1680 gcaattagtt gggtgctgca ggagcaaaaa ggccgttttg tccatagtta agagcaccct 1740 ggtaaccccg tttgctaata gcacaaccaa ttgaagctgg tatttggtgg ctctggtgtc 1800 aatttatagc caacaataaa cattttcaaa tccgtctaga ccggtcaaaa gaagagttga 1860 gcttccatct ctgggtcaaa aaaggccgtt ttggccatag ttaaggccac cccctttctg 1920 tagcacaacc aattgaagtt ggtatttggt ggctctagtg ccgatttgta gagtcaagtt 1980 atagtgtttg catccgagag ttttgattta ttcagtgttg ttttcattgg ttgagggcaa 2040 aaagttcgca tcgagcagaa aaggtcgtgc ccggggcata gtggatagac aacgattact 2100 gatgaaccac atgtgctaca aagaccaaac tagggccgtt ttgaagctac aatcatgtag 2160 agtattgggt gtgaattagg catgaatcgg atcagaattg gttgagctat tgaagaaaat 2220 gttttctccg tggaaatgtg aaattaactc cgccaaggct gacacagtca gtttcgatgc 2280 tagaaagacc caactagtgc caattcatag ccataagatg taagtactat gactgtaaga 2340 gctgttagaa acaaggttca actgctttct gtagaacaaa aaaggccgtt tttgccatag 2400 ttaaggaatt tgcggtgatg tccgttgaag actgcgcgat gaaaaataac gctacaaaaa 2460 tcaaactagt gccgatttat acctttttct tatgagtgct aaccatgcaa gaactgttag 2520 aaacgaaata caactgctat ctgtggaaca aaaaaggccg ttttggccat attaagggag 2580 ccgcagctat gtctgatcac aactacgcga ccaaattcaa cgctacaaaa atcaaactag 2640 tgccgtttat acctttggat tatgagttct atccctgcaa gaactgttag aaacgaaata 2700 caactgttat ctgtggaaca aaaaaggccg ttttggccat agttaaggga gccgcagcta 2760 tgtctgatca caactacgcg accaaattca acgctacaaa aatcaaacta gtgccgattt 2820 atacctttgg attatgagtt ctatccctgc aagcactgtt agaaacgaga tttaactgca 2880 tcctgtggaa caaaaaaggc cgttttgtcc atagttaaga cggaaaaatt atgtatattg 2940 ttgacagaag atcgaatttg aatgagttaa tgacaaggct agtatcgatt tggaaccaca 3000 aaatgtgagt gtcaaagccg tgggatactg ttagaaaaga gatacaactg cataccgtgg 3060 gacaaaaaag gccgttttgg ccatagttaa gaatcactcg tatttttatt ttgactactg 3120 ctcgaccaat agaggcgtta caaagaccaa actaataccg ttttgaagcc aagacaaatg 3180 tttattatac gcacagatgg tgattgaggt aggcgaatga ttggagatgc tacaacaaaa 3240 aaggccgttt ttttatccct tttttcctaa caaattttcg acgaaaaaca ccaagatgcc 3300 ttgccaccta aat 3313 // ID Copia-2_BDJ-I repbase; DNA; FNG; 4558 BP. XX AC AATT01000295; XX DT 07-FEB-2011 (Rel. 16.02, Created) DT 07-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Batrachochytrium dendrobatidis DE genome: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_BDJ_; KW Copia-2_BDJ-LTR; Copia-2_BDJ-I. XX OS Batrachochytrium dendrobatidis OC Eukaryota; Fungi; Chytridiomycota; Chytridiomycetes; Chytridiales; OC Chytridiales incertae sedis; Batrachochytrium. XX RN [1] RP 1-4558 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Batrachochytrium dendrobatidis RT genome."; RL Direct Submission to RU (07-FEB-2011). XX DR Genome; AATT01000295; Positions 9712 5155. XX CC Positions [1634-2143] - Integrase core CC 'ATTAG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 5..4558 FT /product="Copia1-BDE_I_1p" FT /translation="MSPEESTRVKVADGLFIPLLTNTNFQDWNNGIHAAMS FT SFNLWELHIKIAPPEEKTNTFIVNDYKAYSILATTTGKENDDLTHNSEGAP FT LSAHDAYKAIHAHYSTGSTSSRIQLFADLNAIVYDPDQGVEVLYNSMIEIR FT KKLANQKRCYPDVEFIEMMISRLIVVSHHWGNLRTRYDRLSEEEFSTTTTR FT RDLLAEERSLISAGVLPPITTGEHAHQANTRSNTKLMSKNSNRNNANQMQS FT SNGRHTAPQNQDSKRSPFWPNGKRKVLCNNCGRLGSHTEKDCRLPTQNSNY FT YSQNNDTYGLNACDISKTQDHTSTLKTPQDEFMFQATNGTNGMQDKLILDS FT GATAHMICHKDWLTDLVPITREIHCSGAEKHVATAKGTLTIESEYNGRKST FT IHLADVLYVPTFKVNLLSETSFLDKGCDIDITRYSRTIYAANKKPILLCNR FT VKNLSIADCKIHTPIHQSYRSQIVDDSDNDDAYFSSIASPAVSLETWHKRF FT GHLNTDAIKALANLTNGLKLTSDKGLTNQNCSGCTQGKAHRAPFERIRESR FT TNGINELLHMDLAGPFEVESLGKAKYYLIIVDDFSRYSHLYTLQAKSQTSE FT RLKEHIALMETGTGIPVRRIRSDNGGEFLSKDFAAYLKKKGIEHQTTAPYT FT PQHNGVAERRNRSIGNAIRSMILSAGLPKRFWAEAAATAVYLQNRSPHSAI FT NNLTPYEKKFGKKPWVSHLKTFGCEANAVIPSTLLRKLDSRTSNCVMLGYK FT QGSKAYRLYDLDSKKIISTRDAKFNENNFPFKNQKTKQRSQQTGKLKIIQL FT EISRTQEIPVKKATPAKEVITVTETQQEEDLGNSSQTQPIIATSEYNDGTY FT PTVHQSRPNPEDSNQASQIIRSGGVTTKDPFESSSPSGFNPSRLPNPLRPY FT KGYIYEEITDEEPESHSSPPTGRGTSRNKQILSSLNEKDLKNNTRALTAIE FT TVEDKAELSTYSPLYKNPHSFPELIDEDFSYFSAGIEDNIIPTNWKQAMAT FT KDSQSWRLAADLEMNALHKNQTWIECQLPSGRKPIQVKWVFRIKRNSDGTI FT DKYKARLVAKGFVQIPGIDYNETYAPVLRFTSFRTLLTLAAILDLEIDHTD FT ANNAFLNGVITEDLYIELPEGYNNNINNKDSGRTVGKLQKALYGLKQSPHK FT WNEILVQECTNLGFIQCISDPCIMIKRDQHLFVMIAIYVDDILFVGNNRPY FT LDEAKKDLFKVFSMKDLGPIHTCLNIRVIRDRQNKRISVSQEHYLKEVLTR FT FNMTDCKPACTPFDPGIVLTPALETENTTDAPYREAVGSLMYAMVATRPDL FT GAAIGLVSRYLHKSNDSHWTAVKRILRYVKHSISYSLVLGGDSPTLTGYCD FT ADWAGDVDSRKSTSGYAFFIGNGCISWRSTKQTSVAVSTMEAEYIAAATAT FT RELLFLRTLLKELGFEQLNPTILYSDSQSAIANTKNLAPNHAATKHMDVKL FT KFLRDQVTNKTVSVEYVKTNDQVADILTKGLPRFAFDKLSDLLGLYPVQGV FT VG" XX SQ Sequence 4558 BP; 1591 A; 1058 C; 862 G; 1047 T; 0 other; ggttatgagc ccagaagaga gcacaagagt caaggtcgcg gacggcttat tcataccact 60 cctgactaat acaaatttcc aagattggaa caatggcatt catgctgcaa tgagcagctt 120 taatctatgg gagcttcata ttaaaatcgc cccaccagaa gaaaagacca atacattcat 180 cgtgaatgat tacaaggcat atagcatact tgccaccact actggcaaag aaaacgatga 240 cctaacccat aacagtgaag gagctccact aagtgcgcac gatgcctaca aagctatcca 300 tgcccactat tctactggaa gcacatcctc acgcatacaa ctatttgcag atttaaacgc 360 catagtttac gacccggatc aaggcgtaga agtgctttat aactctatga ttgaaattcg 420 caagaaactg gcaaatcaaa aaagatgcta tcctgacgtc gaattcatag agatgatgat 480 aagtcgactt atcgtggtat cccatcactg gggaaatcta cggactcgct acgacagact 540 gtcagaagag gagttctcta ccacaactac aaggagagat cttttggccg aagaacggtc 600 actaatttct gcaggagtac tacctccaat aactactggt gaacacgcac accaggcaaa 660 cacgagatca aatacaaagc tcatgtcaaa gaattccaat cggaacaatg caaatcaaat 720 gcaaagtagt aacggacgac acacagctcc gcaaaatcaa gattctaaaa gatcaccttt 780 ctggccaaac ggaaagagaa aagtcctatg taataactgt ggaagactag gctcccacac 840 agaaaaagac tgtagactgc ctacccaaaa cagcaactac tacagccaaa ataatgatac 900 atatgggtta aatgcttgcg acatctcgaa aacgcaagat catacaagta cgcttaaaac 960 acctcaagac gagtttatgt ttcaagcaac aaatggtaca aatggcatgc aggacaagct 1020 catactagat tcaggagcaa cagctcacat gatctgtcac aaagattggt tgaccgattt 1080 agtcccaatc acacgtgaga ttcactgttc tggtgcagaa aagcatgtgg caactgcaaa 1140 aggaaccttg acgatcgagt ccgaatacaa cggaagaaag tcaacgattc atctggcaga 1200 tgtcttatat gtgccgacat tcaaagtaaa cttgctctcg gagactagct tcctggacaa 1260 aggctgtgat atcgatatca caagatactc tcggaccata tacgctgcta acaagaagcc 1320 tatattactg tgcaaccgag ttaaaaatct aagcatcgcc gactgcaaaa ttcacacacc 1380 cattcatcaa agctacagat cacaaattgt cgatgatagc gataacgatg atgcatattt 1440 ttcatcgatc gcttccccag ctgtatctct agaaacatgg cacaaaagat ttggccattt 1500 aaacactgat gctatcaaag cactagccaa ccttacaaac ggactcaagc tgacatcaga 1560 caaaggcttg actaaccaaa attgctcagg atgtacccaa ggcaaggccc accgagcacc 1620 atttgaacgg atcagagaat caagaacaaa tggaattaac gagctgctac acatggatct 1680 agcaggacct ttcgaagtag aatctttagg aaaggccaag tactacttga ttattgtaga 1740 tgacttctcg cggtattcac acctatacac actacaagcc aagtcacaga cctctgaacg 1800 cttgaaagaa cacattgcgc ttatggaaac tgggacaggt atccctgtaa gacgcatacg 1860 aagtgacaat ggcggagaat ttctctcaaa agacttcgcc gcctacctca aaaaaaaagg 1920 gatcgagcat cagactacag ccccttatac tcctcagcat aacggggtag ctgagcgaag 1980 aaacagatcc attggaaacg cgatcagatc aatgatccta tccgctggac taccaaaacg 2040 attctgggca gaagcagctg caactgccgt gtacttacag aataggtcgc cacattcggc 2100 tataaacaac ctaactccat acgagaaaaa gtttggaaag aaaccatggg tttcacatct 2160 caagaccttt ggctgcgaag caaatgcagt cataccttca acactactga ggaaattgga 2220 ttcaagaaca tccaattgtg ttatgctagg ttacaaacaa ggatccaagg cttatcgact 2280 atacgatctg gattcaaaga aaataatatc aacaagggat gcaaagttta atgagaataa 2340 tttcccattc aaaaatcaaa agacgaagca acgctcgcaa caaactggaa aattgaaaat 2400 tattcaacta gaaatttcaa ggacacaaga aataccagtc aaaaaggcaa caccagccaa 2460 ggaggttata acagtcaccg aaacccagca agaggaagat ctgggaaatt cgagtcaaac 2520 tcaacctatt atcgctacta gcgaatacaa tgatggaaca tatcctactg ttcatcaatc 2580 caggcctaac cctgaggact caaatcaagc gagtcaaatt atcagatctg gtggggttac 2640 tactaaagac ccatttgaat cttccagtcc atcaggattc aatccatcaa gactgccaaa 2700 cccacttaga ccatacaagg ggtatattta tgaagaaatt actgatgaag aaccagaatc 2760 acattcttct ccaccaactg gacgcggaac atccagaaac aagcaaatac tgagctctct 2820 taacgaaaaa gatctcaaga acaatactag agcattaact gccatagaga ccgtcgaaga 2880 taaagctgaa ctatccactt attcgcctct ctacaaaaat cctcactcat ttcctgagtt 2940 aatcgatgag gacttctcct acttttctgc tggaatcgag gacaatataa ttccaacaaa 3000 ctggaaacaa gctatggcta ctaaagactc ccagagttgg agactagctg ccgacttgga 3060 aatgaacgcc cttcataaaa atcaaacctg gatcgagtgc cagctcccaa gtggacgaaa 3120 accaattcaa gtaaaatggg tattccggat caaaagaaat tccgatggta ctatcgacaa 3180 atacaaggct agactagtcg ctaaaggatt cgttcaaatc ccaggaatcg actataatga 3240 aacatacgca ccagtacttc gtttcacatc tttcagaaca ctacttactc tcgccgctat 3300 attggatcta gaaatcgatc acacagatgc taacaatgca ttcttaaacg gggttatcac 3360 tgaagatctt tacatcgaat tacctgaagg atacaacaac aatatcaaca acaaagattc 3420 gggcagaact gtcggaaaac tacaaaaggc cctatacgga ctaaaacagt ctcctcacaa 3480 atggaacgaa attcttgtcc aagaatgtac aaacttagga ttcattcaat gcatctctga 3540 tccgtgcatt atgatcaaac gtgatcaaca tttattcgtc atgatcgcca tatacgttga 3600 cgacatatta ttcgtcggaa acaatcgacc atatcttgac gaagctaaaa aggacttatt 3660 caaggtattc tctatgaaag acctaggacc aatccataca tgcctcaaca tacgcgttat 3720 aagagacagg cagaacaaaa ggatcagcgt atctcaagaa cattatctaa aggaggtact 3780 gacaagattc aacatgacag attgcaaacc tgcatgtact ccatttgatc ctggaatagt 3840 tctgacacct gctctagaaa cagaaaatac tactgacgcg ccatacagag aagccgttgg 3900 gtcattaatg tacgccatgg ttgctacaag acctgatcta ggtgctgcta tcggactagt 3960 ttccagatat ctgcacaagt caaacgactc tcattggact gctgttaaac gtatactaag 4020 atacgtaaaa cattcaatca gctactctct tgtacttggt ggggattccc ctactctaac 4080 aggatattgc gacgctgatt gggcaggaga tgtagactcc agaaaatcta cttcaggcta 4140 cgcattcttt attggaaatg gatgcatctc gtggagatca acaaaacaga cctcagtggc 4200 tgtatctacc atggaagccg aatatattgc agcagctacg gcaactagag aactattatt 4260 tctcagaact ctgctaaaag aactaggatt cgaacagcta aatcctacta tactttactc 4320 tgattcgcaa tctgcaattg caaatactaa aaacttggct ccaaatcatg ctgctacaaa 4380 gcatatggac gtgaagctaa agtttcttcg cgatcaagta acaaacaaaa cagtatcggt 4440 cgaatacgta aaaacaaacg atcaagtcgc cgacatactt accaaaggat taccacggtt 4500 cgcctttgac aagctttcag acctactcgg actttaccct gttcaaggcg tggtgggg 4558 // ID FOT1_FO repbase; DNA; FNG; 1928 BP. XX AC X64799; XX DT 11-NOV-1996 (Rel. 1.1, Created) DT 02-JUL-2010 (Rel. 2.02, Last updated, Version 3) XX DE F. oxysporum Fot 1 transposon. XX KW Mariner/Tc1; DNA transposon; Transposable Element; KW transposable element Fot1; FOFOT1; FOT1_FO. XX NM FOT1_FO. XX OS Fusarium oxysporum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Hypocreomycetidae; Hypocreales; OC mitosporic Hypocreales; Fusarium; OC Fusarium oxysporum species complex. XX RN [1] RP 1-1928 RA Daboussi J.M., Langin T. and Brygoo Y.; RT "Fot1, a new family of fungal transposable elements."; RL Mol. Gen. Genet 232(1), 12-16 (1992). XX RN [2] RP 1-1928 RA Langin J.T.; RT "FOT1_FO."; RL Direct Submission to Genbank (17-MAY-1992)T.J. Langin, C N R S, RL Lab de Cryptogamie Bat 400, Universite` Paris-Sud, 91405 Orsay RL Cedex, FRANCE. XX DR GenBank; X64799; Positions 1 1928. XX SQ Sequence 1928 BP; 477 A; 493 C; 491 G; 467 T; 0 other; agtcaagcac ccatgtaacc gaccccccct ggtaaccgac ccccacctca gacacgtctt 60 cagacgcgtc cacaccagta tttaaatcca cgataaatcg agttcttctt caatttactt 120 ttttctttct tcatcctctt cttttacttc tacaatgccg gtatactctg cggacgacct 180 agaaaatgcc attgcagact tcaagaatgg ggtctctttg aagaccgccg cgaaaaaaaa 240 cggtctacca cccagcaccc tacgaggtcg cctcactggt gcgcaaagtc gtcaggtcgc 300 tcgccaagaa caactacgcc ttaccaccga tcaagaagat gaccttgagc gctggattct 360 gcgacaggaa aagctcggcc acgctccaac tcacgcgcaa gtgcgaacta tcgtccgcag 420 cgttctcgcg cgtcacgggg atcacgcgcc attaggaagg aagtggacta cgcgattcgt 480 ggagcgccac cctgccttga agacaaagtt gggtcgccgt acagactggg agcgtgtaaa 540 tgctgcgacc ccggcgaata tcaagcgctt gttcgacgtg tatgagaccg tggattggat 600 cccccccgaa cgacggtata acgccgacga gggcggcatt atggaaggcc agggcgttaa 660 cggcctcgtg atcggctcgt cacaggagag ccctaacgcg gtaccagtca aaacagcgac 720 cgtacgtacg tggacttcca ttattgagtg tatatcagcg gtcggggttg tcctccatcc 780 gctcgttata ttcaaagcga aaacgattca agagcaatgg ttccgacgcg aatttttaca 840 gaagcacctt ggttggcaag ttaccttctc aaaaaatggc tggacgagca actctattgc 900 gttagagtgg cttgagaagg tattccttcc ccaaacggct cctgcagatc cagctgatgc 960 ccgtttatta atcgttgacg gccatggctc gcatgcaacc gagcaattca tggccaagtg 1020 ttacctgaac aatgtttatc ttctcttttt accggcacat tgttctcatg tactccagcc 1080 tttagatctc ggttgctttt ctagtctgaa ggcggcgtac cgtactttgg ttggcgagca 1140 taccgctctg acggattcta cccgggttgg gaagcaaagg tttctcgatt tttatgcgag 1200 agcccgcgaa atcggtttcc ggaaggtaaa tattcgatct ggatggcggg cagctggctt 1260 atggcctgtg aatattaaca aaccgctcgc ttcgcgttgg gtgatggtgc tcacgaagtc 1320 ggcactacct ccctcagaaa ctctcgatat cgcaacgcca aagcgtggcg gcgacgttgt 1380 aaagcttttc tctgccaaaa gcagctctcc ttcttcacgg ctctcaattc gaaaagcggc 1440 tgcggcgtta gacaaggttg caatcgagct cgcgatgaaa gaccgcgaaa ttgagcggtt 1500 acgcgctcag cttgaagcgg cgcaaccgaa gaaaaaacgg aaaatcaggc aggatccgaa 1560 cgaatgcttt ataagcctcg cacagatact tgcagaggcc aatcgcgagc ctgatcaacg 1620 tgttattcag tcacagaaag gcgatcttga ttgtattgtg gtggatggga aaagtagctc 1680 tgagtcggag gaagatccag cccctgtgcg ccgatcaacc cgcgtcaggc gcgctacaaa 1740 aatgtatatt agacaggatt tgagtagcga agagagcgat tgagagacct ttaaatacca 1800 ttttctctgc atttttagct atttatttga caaagttgtg ttaaaatatt gtaaatagat 1860 cgatggattt tctgagtctt gcaggtgggt cggttaccag ggggggtcgg ttacatgggt 1920 gcttgact 1928 // ID Gypsy-2_MVPL-LTR repbase; DNA; FNG; 248 BP. XX AC AEIJ01000741; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Microbotryum violaceum genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_MVPL_; KW Gypsy-2_MVPL-I; Gypsy-2_MVPL-LTR. XX OS Microbotryum violaceum OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Microbotryomycetes; Microbotryales; Microbotryaceae; OC Microbotryum. XX RN [1] RP 1-248 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Microbotryum violaceum genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AEIJ01000741; Positions 21258 21011. XX SQ Sequence 248 BP; 46 A; 61 C; 69 G; 72 T; 0 other; tgcaaggccc gaggcctttt gcccgggagc cggaagaagg tgatggaatt ttggggctta 60 gttggatagc ttgcgcacac ctagaacctc tcttggcttt cgccagctga ctgcctcgag 120 gttctaggtg agtgcaccaa tcccctatgt ttagtattgt agctgactgc ctcaaggttc 180 tagacttgaa ctacgactgt ttgatcctgt ctattgttct tggagcacgg ctgcatgcgg 240 tgcttcca 248 // ID Copia-9_LBS-I repbase; DNA; FNG; 4771 BP. XX AC ABFE01002869; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-9_LBS_; KW Copia-9_LBS-LTR; Copia-9_LBS-I. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-4771 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01002869; Positions 35163 30393. XX CC Positions [1927-2445] - Integrase core CC 'TGTAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 94..3477 FT /product="Copia-9_LBS-I_1p" FT /translation="MRPTVFVLRLFLVMAPKESVAATSLSLNEALLKADAL FT RPLISLTRPNNVFVGEKLDKLKSNYKKWSKDMTHYLAINGLLSYVLGEKSK FT PSPTTEPHACENWVENDRFAYTAIAMNVSDDDEAELDMAKGAKAAWDTLRE FT RHQNEGPVRQVDLLRTALNIKCKKGTPLPQTCREICDAVDQAFAMGDFTAD FT LFCCIAIINSLEDFPHIRSSILCDLRASTKDKEYTSKDIRHYLESEETLHS FT ATKSFSTSDVALAIHTKSYSTPICSNCKREGHTDLYCISTGGGMAGKTIQE FT SKDARKRDREKARGNTNNAIKSNSNTGKIAVNVKDSTGKAFIIHVDPSDIS FT APSSSKTEFAGLASDLPESILPDSLENIEWSGWLALEEEPTTSLDWRKHTK FT PVDIAAISEISPLQQNKRTPISLDDLPFYVDTGATVHISPEQGDFLTLRPI FT AARSVKGVGGSSVTAIGLGNIKLHIARGAHIILENVLFIPNAAVWLISVST FT LARDSQAVAHFDEATCWITNKSSGATIAHGTLLPTKNLYSLNLLSPHAEHA FT FTISHAPDLETWHRCLGHANYQAVREMAKNGLIPGMPASFPPTNPPKCEFC FT VLGKQTKTPVPKTRKEGPGHKATRVLEKVWVDLSGQHLRSRAGNEYVMDIV FT DDYTSQLWSIPLKNKDDAFPELKAWELARESETSQKIGTYITDQGELKSDK FT MNEWLKSRGIEQRFTAPYTSAHIGRVERMHQTLMAKARTMRIYADCPPYLW FT DEFYLTAVHLHSKTLTRSLPGGITPWEKYHRRKLDYSYMREIGCRVFVLIQ FT NKHNPKVFDRSIECTLIGYDPNAKTYRCYHRESKRVISSYHVRFLESHDGH FT MRSLPATEDIPLTLNEILQNATPTPILSDDEEEDVLPDDLSQINNTQDPQH FT VANPTDDAQIRHSSRIAEKTNKPKPTRTEIAIQQSIDTGIRLRETRAERKK FT TLQDIQEEEMRNSPEIVENAAIAELREIFGTLNLSDVEGDRIDRALSAISE FT LPEIDPSTLEFEDEPRTWNEAKTSADTKRWEEGYCDELKSLKDMGVYKLIP FT RSDVPQGKKIRKGMPVFRIKRNEIGKAMRWKVRLVFKGCEQIYGKDYTKTT FT SPTMRMESWRILLHIAATLD" XX SQ Sequence 4771 BP; 1443 A; 1266 C; 1008 G; 1054 T; 0 other; ggttatgggc ctatagtcac acatggtgac cgaagaacct aaccctagat tttaatcttg 60 tcacatcaaa cacgttacga acgcgtaaat tagatgcgtc caactgtctt tgttctccgt 120 ctatttctcg tcatggcgcc aaaggaatcc gtcgccgcca ccagtctttc cctcaacgag 180 gcactgctta aagctgatgc acttcgtccc ttgatatccc ttactcgccc aaacaacgtt 240 ttcgtcggcg aaaagctcga caaactcaaa tctaactaca agaaatggag caaagatatg 300 acgcattatc tcgccatcaa cggtctcctc agctacgtcc ttggtgaaaa atccaaacca 360 tcccctacaa ctgaacctca tgcctgtgaa aactgggtgg agaacgaccg cttcgcctat 420 acggccatcg caatgaacgt ctccgacgac gacgaagcgg aattagatat ggcaaagggg 480 gccaaagcag catgggacac actaagggag cgacaccaga atgaggggcc agttcgacag 540 gtggatctct tgcgcactgc tcttaatatt aagtgcaaga aaggtacgcc actccctcaa 600 acgtgccgcg aaatttgcga cgccgtagac caagcttttg cgatggggga ttttacagct 660 gatctatttt gctgcattgc gatcatcaac tctctcgaag attttcccca tatccgctcc 720 agcattctat gtgatttacg cgcttcgaca aaggacaaag aatacacatc aaaggatatt 780 cgtcactacc tagagagcga agaaaccctc cattccgcga caaaatcatt ctccacatcc 840 gacgtcgctc ttgccatcca cacgaagtcc tatagcactc ccatttgcag taactgcaaa 900 cgtgagggcc acacggattt atactgcatc tccacaggag ggggaatggc aggcaaaaca 960 atccaagaat ccaaagacgc acgtaagagg gatcgcgaaa aagctcgagg aaataccaat 1020 aacgctataa agtccaacag caacactgga aaaattgccg ttaacgtgaa ggactccaca 1080 ggcaaagcct tcattatcca cgttgaccca tccgacatct ctgctcccag tagctcaaag 1140 actgagttcg ccggcctagc atcggacctt ccggagtcta tactccctga ctccttggag 1200 aacatagaat ggagtggttg gttagcgctt gaggaggagc ctacaacctc actggactgg 1260 agaaagcaca ctaaacccgt cgacatcgct gctatctctg aaatttctcc actccaacaa 1320 aacaagcgca cccccatctc tcttgacgat cttcccttct acgtcgacac gggagcaact 1380 gttcacatat ccccagaaca aggcgacttc ctcacgcttc gtccaatcgc agctcgatcc 1440 gttaaaggcg taggtggatc atccgttact gccatcgggc taggtaatat caaacttcac 1500 attgcgcgcg gagcgcatat aatcttggaa aatgttttat tcattcctaa tgcagcagtg 1560 tggcttatct cggttagcac acttgcgcgc gatagccaag ccgtcgctca ttttgacgag 1620 gctacctgtt ggatcaccaa caaatcttcc ggtgcgacaa ttgcccatgg cacgctcctt 1680 cctaccaaaa acctctattc cctcaacctt ctatctcccc atgccgaaca cgcctttact 1740 atatctcacg cgcctgatct cgaaacatgg catcgttgtc ttggccatgc aaactaccaa 1800 gctgtacgag aaatggcgaa gaacggctta atcccaggta tgcctgccag ctttcccccc 1860 acaaatccac ctaaatgcga gttttgcgtg cttggtaaac agacaaagac accagtacca 1920 aagacacgca aggaggggcc ggggcataag gctacaaggg tgttggagaa ggtgtgggtt 1980 gatttatcag gacagcacct gagatctcgc gcaggaaatg agtatgtcat ggatatcgta 2040 gatgactaca caagtcaatt gtggtcaatt cccttaaaaa ataaggatga tgcgttccca 2100 gaactaaagg catgggagtt agcgagggaa tctgaaacca gtcagaaaat tggtacctac 2160 atcaccgatc aaggggagct aaagagcgac aaaatgaacg aatggttgaa gtcccgtggt 2220 atcgaacagc gcttcacagc gccctacacc tctgctcaca taggacgtgt agaacgaatg 2280 catcagaccc ttatggccaa agcccggact atgcgcattt acgctgactg ccctccctat 2340 ttatgggatg aattttatct gacagcggtg catcttcatt ccaaaactct cacacgttca 2400 cttccaggag gtataacacc ttgggaaaaa taccacaggc gaaaactgga ctactcttac 2460 atgcgtgaga taggatgtcg agtcttcgtt ctcatacaaa acaaacacaa cccaaaagtc 2520 ttcgaccgtt ccattgaatg caccttgata ggctatgatc caaatgcaaa aacttacaga 2580 tgttaccata gggaatctaa aagagtaata agctcttatc acgtacgctt tctagaaagc 2640 catgatggac acatgcgttc cctccctgca accgaggata taccattgac gctaaacgaa 2700 atccttcaaa atgctacacc cactcctatc ttgtcagatg acgaagagga agatgtctta 2760 ccagatgatc tctcacaaat caataatacc caagatcccc aacacgtggc caatccaaca 2820 gatgatgctc agatcaggca ctcatcacgt atcgccgaga agaccaacaa acccaaaccg 2880 acccgcactg aaattgccat ccaacaatcc atagacactg gaattcgact tcgtgaaact 2940 cgagctgaac gcaagaaaac cctccaagac atccaggagg aagaaatgcg caactctcca 3000 gaaatcgtcg aaaacgctgc catcgcggag ttaagggaga tatttgggac cctaaatctc 3060 agtgatgttg agggtgatcg cattgatcgg gccctatccg ctatatccga actacctgaa 3120 attgatccct cgaccctgga attcgaggac gagccaagga cttggaatga ggcaaaaaca 3180 tcagcagaca ctaaacgatg ggaagaaggg tactgtgacg aactcaagtc actcaaggat 3240 atgggtgtgt acaagctaat tcccagatca gacgtcccac aaggaaagaa aattaggaag 3300 ggaatgccgg tctttagaat caaacgcaac gagatcggaa aagcgatgcg ttggaaggtt 3360 cgtctcgtat tcaaaggttg cgagcagatc tacggtaaag actatactaa aacgacctca 3420 cctaccatgc gcatggagtc ttggcgcata ctccttcata tagccgcgac actcgattga 3480 gatgctcaac aaattgacat caagacagcc ttcctatatg gattacttcc tgaaggggag 3540 taccagtaca tggaacaacc ccaagagttt gaagaacctg gtaaggaaga ctggatatgg 3600 gaaatccagc ggggcctata cgggatgaag caaagtggcc gtatttggaa cataacgatg 3660 aatgagaaaa tgatctcatg gggcttcacc cgtctgtctt gcgaatcatg catctactat 3720 cgtaaaactg acacaggcac caccatctgc gctgtacatg tcgacgactt cctgtccatt 3780 gcaagcaaca aagacgaaaa tgagagattc aaaaacaaga tgcgtgaagc atggacaata 3840 tctgatctcg gtaatgttcg cttcgtcgtg ggcatagcag tagactggga tagacctaac 3900 aaaacggtca tgctctccca aacagcctta atcaacaaaa ttgtagctca attcggacag 3960 tgaaacgcct caccatccac attaccaatg gatcccggat taaaattgcg atgcgccaac 4020 tacaaaaata tgtcaaaggt cgagttggat gatatcaaaa aactcccata ccgctcactt 4080 gtcggatgtt tgctttacct gtccatcggc acgcgtcccg atatcacata ctccatccaa 4140 cagctctctc aatatcttga ctgctattca tatgctcact ggaatgcagc aatccgtgtg 4200 gtacgttatt tatcgggcac gcgaaatctc aaactacgtc tcggcggaac aaaccaaata 4260 tccttactag gttttaccga ctcagactgg gccaactgtc ttgacacacg gagaagtgta 4320 ggtggtcacg catattcact aggctcgggc gtcgtatcct ggcaagcgag aaaacagaaa 4380 accatagccg cctcatcctg tgaagcagaa tatactgcag ctttcgaagc ttcaaaagaa 4440 ggcatttggc tacgcacact cctcaacagc ataaatcaca caaccaccaa acccaccaca 4500 atctgctgcg acaacaacgc cgccataaat ctatcagaag acccagccct acatgaccgc 4560 ataaaacaca ttgatatcaa acatcacttc ctccgcgagc gcattcaatc caacgaaata 4620 tccctctcct acataaatac caatgacaat attgccgaca tattcactaa ggctctcgat 4680 ataaagaaat tcaatcgctt ccgcgggttc ctaggactca gctaattcac caagactcca 4740 cgcgaggagg agtattcaca gtgaggagga g 4771 // ID Copia-3_LBS-LTR repbase; DNA; FNG; 226 BP. XX AC ABFE01000651; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-3_LBS_; KW Copia-3_LBS-I; Copia-3_LBS-LTR. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-226 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01000651; Positions 139963 140188. XX SQ Sequence 226 BP; 70 A; 43 C; 42 G; 71 T; 0 other; tggaataaca cctggtatat ggtcaagtat tccatgcatc ttgaggactt tgtagataga 60 tcgagattga tggattaata agtccgagaa catcaccgaa caattcgcac gtatctagta 120 attagtagta tattctagta gttacatttt gtagtatata agggcgacaa agctgccctc 180 gacatgatta ttatctttat cccctactac cactgcgcta tataca 226 // ID LTR1_AO repbase; DNA; FNG; 469 BP. XX AC . XX DT 24-JAN-2006 (Rel. 11.01, Created) DT 02-FEB-2006 (Rel. 11.01, Last updated, Version 1) XX DE solo LTR of an unknown LTR retrotransposons- a consensus DE sequence. XX KW LTR Retrotransposon; Transposable Element; Nonautonomous; KW LTR1_AO. XX OS Aspergillus oryzae OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC mitosporic Trichocomaceae; Aspergillus. XX RN [1] RP 1-469 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-469 RA Kapitonov V.V. and Jurka J.; RT "LTR1_AO, a family of LTR transposons in the Aspergillus oryzae RT genome."; RL Repbase Reports 6(1), 21-21 (2006). XX DR [2] (Consensus) XX CC It is a solo long terminal repeat of an unknown LTR CC retrotransposon. XX SQ Sequence 469 BP; 203 A; 11 C; 17 G; 211 T; 27 other; tataagyrct ytaytratat aytattatct atarataata ratctataaa gtatagttar 60 ttaatagact ttagaaatat agtaytagaa gayttaarat tatttagaat tatataataa 120 tttttttatc tagtataata raataaatta tttaaaaatt agaattctct atataattyt 180 aatataaata tatttatata ttatataagy tagtagtcta ttagtrrytt rtttatatag 240 aatytagtta tttaarataa ttactattat aactatattt tttttaaaaa awtaaantat 300 tttaatttat anttaaattt taaaatattt aatattttat tttaatanat tatataatta 360 taaaaatwta ttttatttat aatatttaat aatanaatta taaatatcta atatattatt 420 ttataataat tttaaataat taaaatatta ataaaatatt tttaaaata 469 // ID Gypsy-51_MLP-LTR repbase; DNA; FNG; 1619 BP. XX AC AECX01002217; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-51_MLP_; KW Gypsy-51_MLP-I; Gypsy-51_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-1619 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002217; Positions 36821 35203. XX SQ Sequence 1619 BP; 549 A; 220 C; 306 G; 544 T; 0 other; tgtggtgatc tatttatgat attaagcaaa aaaattaatc atctttactt taccagttta 60 ttagcaaact tcaatatagc agagaattcg aaattcaggt tttcagatca attgaagctc 120 ttagaggctt atgtatactt attagctcat tattatgtca tcttatttca aaattattat 180 taagatcaac atagtttact tacaaagtta ttacaaaaag acagaaaaag aaaagataag 240 aaaaattcga aaatataggt gtaaaagttg gaaaatggtt tttagcttat ttacaattaa 300 gttatgggtt ttataaggtt agaatatgaa agaacaacag tttattatac aggaacaatt 360 ttaatattca aggttgataa ggggaaacag aggttattag cagaaaggta ctttttcgaa 420 gttggtaaac aagggttagg cgtgtagttt acatatgtaa tggtggtagt gatggaattt 480 aaggttgcca agaggatttt ccgggatttt taaggttata gaagatgatg tcatattaca 540 tgagcaagga aaagggagta gcagctgtga aaagtgaaga aaataggtga aaatcgtcga 600 aaatggaggt tttcagctaa ttacatatgt caaatgacgt cagaataagt tggtagatgg 660 ctcattcaac gcagaagagt gtattacata atgtacaacc tttgaatcat tagtaacgga 720 ggtttatgta aaaagttata agggaaataa gaaattggtg tgaatctcga atttacagtc 780 acaagacaca tgtaacacca aaaaccttaa cttcacaact caatatctta ctcatcactc 840 atcagaagta ggtcttatag gtgtcagaaa tgatcagtaa cacgtgggct acaagatgca 900 ctttagtaat caaaaattgg agcacatctg aaaaaattcg taatctcaaa aattgggttt 960 cagctagttt acaaggaaaa aatcacctgt attcatctta agaagtgctg tacattacat 1020 cattgcgtca ttgatcactt ttggctgtga aaattacatc atacattagt aacaagatca 1080 ttacattgcg tcattacatg gtttttaggg ttatttaggc ggatttgatt ggtttttgag 1140 gctgttgact gttgtaaaga ttgttacatg tggatataag tactccttga tgattttggg 1200 attgtagcac ccttttctgc atcttatacc tttgttgtag tgttataaat catcatttat 1260 ttactatatt tgtactaaga tcacttaaat ccgccccttc aaaagcaagg agaaatttat 1320 ttacgtgtgt taaagactac aagtctaatt gggaataata cttacaacag tattgctagt 1380 aataaacata gtgtgtgact tttttgcgct taaaatctgg gaataatact taacaacagt 1440 attgctttat ttctctggtg ccttttatcc ttttctctgt tatcacaaag tggtcagcaa 1500 gtgcccttgt gctggtttct ttgaagaagg cagtctttag gcaacttcta ctatattaca 1560 catagtagat tatcccttct taggtgttgt aaagctcttg ttgagcttta aatactaca 1619 // ID Copia-1_PAD-LTR repbase; DNA; FNG; 279 BP. XX AC AEOI01000003; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Pichia angusta genome: long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_PAD_; KW Copia-1_PAD-I; Copia-1_PAD-LTR. XX OS Pichia angusta DL-1 OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; Pichia; OC Pichia angusta. XX RN [1] RP 1-279 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Pichia angusta genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; AEOI01000003; Positions 920581 920303. XX SQ Sequence 279 BP; 101 A; 45 C; 52 G; 81 T; 0 other; tgagataaga tagtcataga ttatgtcatg gctgtgcggc ttatcctgat atgaaggttt 60 atcggtctta gcacatgcat aatgtgagca cattctcaga ttatgcaagt ctgaatcaca 120 tatgagacta accaaataat atataagggt gtgtcatatc ccacttataa gtgagggact 180 acggtatacc aaaagtatta aatatctgat acatagtcac actattgaag gaagaataat 240 gactgtagac attacactta tcaaaacaaa gtttcaaca 279 // ID Copia-51_MLP-I repbase; DNA; FNG; 4567 BP. XX AC AECX01002953; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-51_MLP_; KW Copia-51_MLP-LTR; Copia-51_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4567 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01002953; Positions 5402 9968. XX CC Positions [1858-2370] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 100..4554 FT /product="Copia-51_MLP-I_1p" FT /translation="MSSSDSPREREQSAPSPAASESSDTSTITSSSTVHPA FT DDTVSNVFSLSQLIPGTPLTMTTPTTSEDRIKIPELTDGNFPHWNKRFHFA FT LQTRGLLKFLTEDSPPDNAEGLALYRRQQGRVMEIIVNSLNAANDSLIELT FT DTPKIAYEKLAKAHGSSGGVLAAATICEIATARYESGQPLSDFITRIRTLH FT NQLAQYATEDKEIALSSKLLAIFLLNGLGKEFEFITAPFFADLSKLTVQSV FT MDRLVLESAKQSSAGQVSSTATAFSAKSSTTATSSTPANRRIGPGLNDLCN FT LPNHRGLPHTNKNCHAQNGQQRIRPKHPTSTSSNTLSESEMAQRYLKIEAA FT HQAKKNPLPQPSVSNAPTYAAVTTSNQFSALELADDFPDEGFAAQAYVASN FT EDIAYGVILADTAATRNICHNKDMFLHMKPISPVKITGISGEPQFAHHIGS FT IIIPGFSEDSHQPMDILIPDVLYVPTMVVNLLSLSQLCSNGASFSGSDTSI FT TVVGLSGNGYFVCRKTVDERLWQCKVRLLSSDFVFSASAERWHHRLGHLHY FT DGIRRLASDGSIKVSSSLSSAARSPCITCQQAKISRLPFTSHFPISSTILN FT RIHSDVVGPFPSSLGGAKYLVSFIDDCSRFASIFPIKNKSDVFDCFVRFKT FT RVELLLDSKIKFLHTDRGGEYQSSNFLSFLSQHGIALEQGPAETPEQNSIS FT ERYNRSLVERIRCNLHYTSLPTSLWAEVALATAFTLNQSPHTFLHYQSPLS FT VWNSFIRDTGIHGPDHSFLRTIGCSAIYLAPRISGKLALKGREGVLIGYEA FT GAKAYRIWDLGYKKVVVTRSVIFNEDIFPFSSLPSTTNCPSFIILEDSGYD FT IPDNVTYNAPVLPITPRVAHDSQISSPITGPILKPLLLSSVSPASSPPSIS FT RRNRSIAIPGNSCIVEGSPNSSNSATTKGKVDRPARATSKPQRFGNFIGHA FT GTDSNDEPTYKQAMDSPDADEWKKAMRVEFDSLQHHNVGKLVARPIGARVI FT GGMWVLKKKRDENGAVLKYKARWVCFGNRQVEGVDFHDTYSAVGKSDTFRI FT LVSIAAYLKCSVVQFDIMTAFLHGLIKERVYIQQVKGFVEPGCEGMVIELY FT KSLYGTRQGARDFSDDLRVKLLAFGFKTSKADDCLFIYRRKSHFIYLHMHV FT DDGFIVSNSNPLIEEFRLHMLKSYELKWKTKPNLHLGMRLNYHNDGAISID FT QTHYLQDLLERFGMEDLNPVKLPFPVGLKLVHGSPKEVSAAAFLPYQSLIG FT SLNWAAISTRPDIAYAVSQLSRFNSCYTFAHWNAAKHLVRYLKGSISQGIM FT FRGSLPSDLKGYGDADYANDPLDRRSVTGYLFTFGDAIVSWRSRRQNSTAL FT STTEAEYMAISDCARHALWFKALFHDLKLHVSAVSVSSVGEAIQLFNDNRG FT TVLLSKEPVINNRSKHIDVRYHFIRDHVRLKNIVTAHVPTKSMPADFLTKP FT LPIDAFRCCCDQVSSVECSS" XX SQ Sequence 4567 BP; 1170 A; 1023 C; 924 G; 1450 T; 0 other; ggttatgagc ctttttaccg agtaatcgct taattgttcg acacgtcatc gattagtgca 60 gatccttctt gattaatttt tcctcctcat cacagacaga tgtcttcatc agattctcct 120 cgggagcgcg aacagtctgc accttctcca gctgcttctg agtcttccga tacttcaact 180 attacttcgt cttccactgt tcatccagcg gacgacactg tttccaatgt tttctctctg 240 tcgcaactga ttcctggcac tccgttaaca atgactactc caactacttc cgaggatcgt 300 atcaagattc ctgagctgac tgacggcaat tttccgcatt ggaacaaacg tttccacttt 360 gcgttgcaaa cacgtggtct tctcaagttc cttactgagg actcgcctcc cgataatgcg 420 gagggcttgg ctttgtatcg acgtcagcaa ggtcgagtta tggaaatcat tgttaattca 480 ttaaatgctg cgaacgattc tctcatagaa ctaaccgaca ctccgaagat agcttacgaa 540 aagctcgcta aagctcacgg aagcagtggt ggtgtacttg ctgctgccac aatttgtgaa 600 atagcgactg ctcgatatga atctggtcaa cctttatctg acttcattac tcgtattcgg 660 actcttcata atcaacttgc tcagtacgca actgaggata aagaaattgc tctatcttcc 720 aagttacttg cgatctttct attgaacggt ttagggaagg aattcgaatt cataacggct 780 cctttctttg cagatctgtc taaactgact gtacagtctg tcatggatcg acttgtgctg 840 gaatctgcta agcagtcctc tgctggtcaa gtctcatcta cggctacggc ttttagtgct 900 aaatcctcga ccacggccac gtcctcaact cctgctaatc gacgtattgg acctggtttg 960 aacgacttgt gtaacttgcc taaccatcga ggacttcctc atactaacaa aaactgccac 1020 gctcagaacg gccaacaacg gattcgtccc aagcatccaa cctctacttc ttcgaacacc 1080 ttgtcagagt ctgagatggc tcaacgatac ttgaagatag aggctgctca tcaagctaag 1140 aagaatcctt tgccacaacc ttctgtcagt aatgcaccta catacgccgc ggtgactact 1200 tcaaaccaat tcagtgctct cgaactcgcc gatgattttc ctgatgaagg ttttgcagct 1260 caggcttatg ttgcgtctaa tgaggatata gcttatggtg taattttggc cgataccgcc 1320 gcaactagga acatatgtca caacaaagac atgtttctcc atatgaagcc gatttcgcct 1380 gtcaaaatta ctggtatttc aggagagcct caatttgctc accatatcgg atcaatcatc 1440 atccctggtt tctctgaaga cagtcatcaa cctatggata tcttaatacc tgatgtactt 1500 tatgtgccga ctatggtggt taatttatta tcacttagtc aactttgttc aaatggagca 1560 tctttttcgg ggtctgatac tagcatcact gttgtaggtt tgagtgggaa tgggtatttt 1620 gtctgtagaa agacggtcga cgagcgatta tggcaatgca aggttcgcct tttatcatca 1680 gattttgtgt tttctgcctc agctgaacga tggcatcatc gtttagggca tcttcactac 1740 gatggaataa gaaggcttgc ttcagatgga tccatcaaag tttcttcttc attatcttca 1800 gctgctcgtt ctccttgtat tacttgtcaa caggcaaaaa tatctaggct tccttttact 1860 tctcattttc ctatttcatc aactatccta aatcgtattc attccgatgt agttggtccg 1920 tttccttcat ctttaggtgg tgcgaaatat ctggtctctt ttatagatga ttgttctcgt 1980 ttcgcttcta tttttcctat caagaataag tcagatgtgt ttgattgttt tgtgcgtttt 2040 aaaactcgcg tagagcttct tcttgattcc aaaatcaaat tcctacatac agataggggg 2100 ggcgaatatc aatcttcaaa ttttttgtcc tttttaagtc aacatggaat agccttggag 2160 cagggaccag cagaaacgcc cgaacaaaat tctatctcag aaaggtataa tcgaagtcta 2220 gttgaaagaa taagatgtaa cttacattac acatcattac ctaccagtct ttgggcagaa 2280 gttgctttgg ccactgcctt cactctcaat cagtctcctc acacatttct tcattatcaa 2340 tcacctcttt cggtctggaa ttcgtttatc cgcgatacag gaatacacgg accagatcac 2400 tcgttcctaa gaaccattgg atgctctgct atttatctgg caccgcggat ctcaggcaaa 2460 ttagcactaa aaggaaggga aggtgtacta ataggatatg aagctggtgc taaagcgtat 2520 cgcatatggg atcttggata taaaaaggtt gtagttactc gttctgtcat cttcaacgag 2580 gacatttttc ctttttcctc attaccttcc acaacgaatt gtccttcttt catcatccta 2640 gaggattctg gttatgatat tccagataat gttacatata atgctcctgt attaccgatt 2700 actcctaggg ttgcacatga ttctcaaatt tctagtccta ttaccggtcc aattcttaag 2760 cctctgttat tatcttctgt ttctcctgct tcatcccctc catcaatttc tcgaagaaat 2820 cgttctatcg caatcccagg aaattcctgt attgtcgaag gctcacctaa ctcttccaat 2880 tccgccacga cgaaaggaaa agtggatcgt cctgcaagag caacaagtaa gcctcaacga 2940 tttggaaatt ttattggtca tgcaggtaca gattccaatg atgaacctac ctacaaacag 3000 gcaatggatt cacctgatgc ggacgaatgg aagaaggcca tgcgagtcga atttgattct 3060 ttacaacatc ataatgttgg taaattggtt gctagaccta ttggagctcg tgttatcggt 3120 ggtatgtggg tattgaagaa gaaacgtgat gaaaatggtg ctgttcttaa gtataaagct 3180 cggtgggtgt gttttgggaa tcgtcaggtt gaaggggtgg attttcatga tacgtattca 3240 gctgtgggca aatccgatac tttcagaatc cttgtttcga tagctgctta ccttaagtgt 3300 tctgtggttc aatttgacat tatgacagca ttcttacacg gtcttattaa ggaacgggtt 3360 tatattcaac aagtgaaggg tttcgtggaa cctggttgtg aaggtatggt cattgaatta 3420 tataagtctt tatacggtac acgccaggga gccagggatt ttagtgatga tctacgtgtc 3480 aagttgctag ctttcggttt caaaacctct aaggccgatg attgtttatt catctatcga 3540 cgaaaatctc atttcattta tcttcacatg cacgttgatg atggttttat tgtctcaaac 3600 tccaatcctt taattgaaga atttagactc cacatgctta aatcttatga attgaaatgg 3660 aagaccaaac ctaacttaca ccttggtatg cgattgaatt atcacaatga cggagcaatc 3720 tctatcgatc agactcatta tcttcaagac cttttggaac gttttggtat ggaagacttg 3780 aatccagtga aacttccttt tcctgtgggt ctaaagctgg ttcacggatc gcccaaagag 3840 gtatcggctg ctgctttctt accttatcaa tctcttatcg gctcactcaa ttgggcagct 3900 atcagcacca ggccggatat tgcttatgcc gttagccagt tgtctcgatt caattcatgt 3960 tacacgtttg cgcactggaa cgcagcaaaa cacctggttc gatatttaaa gggttcaatt 4020 tctcaaggca tcatgtttcg cggcagctta ccttccgacc ttaagggata cggtgatgca 4080 gactatgcga atgatccgtt ggatcgccga tcagttaccg gatacttatt caccttcggc 4140 gatgctattg tttcctggag aagtcgacga caaaactcaa ctgctttatc gacaactgag 4200 gccgagtaca tggctatatc cgactgtgct cgtcatgcat tatggttcaa ggcgttattt 4260 catgatttaa aactacatgt atcggctgtt tctgtttcct ctgtgggtga agctatacaa 4320 ctcttcaatg ataatcgagg aacggttttg ttatcaaaag aacctgttat caacaataga 4380 tccaagcaca ttgatgttag gtaccacttt attcgtgacc acgttcgtct caagaacatc 4440 gttacggctc acgtccctac taagtctatg cctgccgact tcctcaccaa gccgcttccg 4500 attgatgcct ttcgatgttg ttgtgatcag gtttctagtg tagagtgttc gagttagggg 4560 ggaatat 4567 // ID Mariner-7_AF repbase; DNA; FNG; 1825 BP. XX AC . XX DT 28-FEB-2006 (Rel. 11.02, Created) DT 07-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE A family of Mariner DNA transposons - a consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-7_AF. XX OS Aspergillus fumigatus OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Neosartorya. XX RN [1] RP 1-1825 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-1825 RA Kapitonov V.V. and Jurka J.; RT "Mariner-7_AF, a family of Mariner DNA transposons in the RT Aspergillus fumigatus genome."; RL Repbase Reports 6(2), 104-104 (2006). XX DR [2] (Consensus) XX CC This is a DNA transposon from the Mariner superfamily (Pogo CC clade). It is characterized by the TA TSDs and 23-bp TIRs. The CC genome harbors ~20 copies. The 518-aa Mariner-7_AFp transposase CC (pos. 181-1734) is 35% identical to the S. pombe ARS binding CC protein 1 (GenBank NP_596460). XX FH Key Location/Qualifiers FT CDS 181..1734 FT /product="Mariner-7_AFp" FT /translation="MPQRKSISTSQKVALRARKRLYPKATQKELREWFKDT FT YDHTLSSGLISDILSRKYDYLDADSLPSRDAKRQRRENWPELENALFKWII FT RVEGQISISAEIIQQKAQFFWEKIYPGQEMPTFSNGWLHNFQARRSVRWHR FT QHGEEGSVSAQADQEMLEIKQVLSAYSLKDQFNCDETGLFWKKTPARSLST FT RQLPGRKKDKARITALFCCNVDGSEKLLLWFIGTAKNPRAFQAAGVNIRNL FT NLIWRSNQKAWITTPIFTEFLRWFNRQMSGRNVVLLMDNFSAHQAAVAEIQ FT SSGYLLQNTLIIWLPANSTSRYQPLDQGIIFCWKSYWKRYWIHFILQEFEL FT NHDLIALMNILRAIRWGIQSWEFDLSGQVIQNCFRKALYSQLLYQEPVDPA FT VLDDIEKAFSLLKVSTPIQDLMDINTFLNPAEEAIQDTPEDIKSQILAQYG FT PELDDDSEGELEILLQISLDEALEALRKLCLYEEQQAEGIPSLIYKLDKHE FT CILLGRKLSLQTQRDIRSYFTG" XX SQ Sequence 1825 BP; 540 A; 384 C; 382 G; 519 T; 0 other; cagtaaaacc tcgttataac gagtctcgat ataacgagaa cctcgatata acgagcgcaa 60 ccccttgcct caattatttt cccatataaa taacgagatc cctcgctaca gcgagtctcg 120 ttaattttcc ccagcttccc atacaaaatc tcgttatatc aaggggctcc cttagctaat 180 atgccacaaa ggaagtctat ctcaacctcc cagaaggttg ctctacgtgc tcgaaaacgc 240 ctctatccaa aggcaacaca gaaagagctt cgagaatggt ttaaggatac ctacgatcat 300 actctttcat ctggtcttat ctctgatatt ttatcccgca aatacgatta tcttgatgcc 360 gattccctcc cttcccgcga tgcaaagcgc caacgccgtg agaactggcc agagctagag 420 aacgccttat ttaagtggat tatacgtgtg gagggacaaa tatctatctc tgcagagatt 480 atacaacaaa aggctcagtt tttctgggag aagatctacc cagggcaaga gatgccgact 540 ttcagtaatg gctggcttca taactttcaa gcaaggaggt cagtaagatg gcatcgacaa 600 catggagaag agggaagtgt ttctgctcag gcagatcagg agatgcttga gattaaacag 660 gtcttaagtg cttatagcct taaggaccag tttaattgtg atgaaactgg tctcttttgg 720 aagaagactc ctgcgcgaag tctttcaaca cgtcaacttc caggtcgaaa gaaagataaa 780 gcacggataa ctgccctctt ttgctgtaat gtagatggct ctgagaagct actactatgg 840 tttattggta cagcaaagaa tccacgagct ttccaagctg ctggtgttaa tatacggaat 900 ctgaatctta tatggcggag taatcagaaa gcctggatta caacaccaat ctttacagaa 960 ttccttcgtt ggtttaatag acaaatgagt ggtcgaaatg ttgtccttct tatggataac 1020 ttctctgctc atcaagctgc cgtggcagag attcaatcta gtggttatct actacagaat 1080 actcttatta tctggcttcc tgcgaactct acaagccgat atcagcctct tgatcaaggg 1140 attatcttct gctggaagtc atactggaaa cgatactgga ttcactttat ccttcaagaa 1200 tttgagctta atcatgatct aattgcctta atgaatattc taagagcaat tcgatgggga 1260 atccagagct gggaatttga tctttctgga caggtaatcc agaactgctt tagaaaggct 1320 ctatattctc aactattata tcaagagcct gttgatccag cagttctgga tgatatagag 1380 aaagcattct ctcttcttaa ggtctctaca cctatccagg acctaatgga tattaataca 1440 ttccttaacc cagcagagga agctatccag gatactcctg aagatattaa gagccagatt 1500 ctagctcagt atgggccaga gcttgatgat gattctgagg gggagcttga gattctacta 1560 caaatatcac ttgatgaggc tctggaagca ttaagaaagc tttgtttata tgaggagcag 1620 caggcagagg gtattcctag ccttatatat aagctagata agcatgagtg tatccttttg 1680 ggtcgaaagc taagcctaca gactcagcgg gatattcgaa gctattttac aggctaaatt 1740 atatatatcc ttgatataac gagaacctcc ctataacgag aaaattggtc tgtctggaat 1800 atctcgttat aacgaggttt tactg 1825 // ID Copia-4_MLP-LTR repbase; DNA; FNG; 574 BP. XX AC AECX01002163; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-4_MLP_; KW Copia-4_MLP-I; Copia-4_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-574 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002163; Positions 9548 10121. XX SQ Sequence 574 BP; 144 A; 96 C; 101 G; 233 T; 0 other; tgttaaagtt ttcaatctcg ttatgtattt taacttaatt gtgttttatt tcatatacca 60 tttgttaatt cacatgtagc gtttaagtgt gttatgttat gttttacaac tgtttgtcat 120 cgttattaac ctaatctttg ttttcaactg gtattaatgt tcaacctgga agagcagcga 180 agcaagtctc cttcttcatc caagatattc ttctacattc ttgggtgaaa ggtgagtact 240 caattcgtgg gattttgatt taggttcatt agatctgttc tgaaccataa tcatccctgc 300 tctgacaggt atgtagttca atcatgtttc attcatatct tgttcttttc tttattattg 360 tgatggaaat aaactgatcc aagatattct tctacattct tgggtgaaag gtgagtactc 420 aattcgtggg attttgattt aggttcatta gatctgttct gaaccataat catccctgct 480 ctgacaggtg agtactcaat tcgtgggatt ttgatttagg ttcattagat ctgttctgaa 540 ccataatcat ccctgctctg acagatttgt aaca 574 // ID Gypsy-53_MLP-LTR repbase; DNA; FNG; 168 BP. XX AC AECX01002318; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-53_MLP_; KW Gypsy-53_MLP-I; Gypsy-53_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-168 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002318; Positions 3549 3382. XX SQ Sequence 168 BP; 45 A; 43 C; 26 G; 54 T; 0 other; tgtaataacc cttatagcgg gttatgctaa tactgagata gaacccatct agacatttgc 60 gctcttgttg tattagtctt ttcctcacac tgcaatccag ttaatcaatc ctgggatcat 120 cacatcttag ctccaagtta tccctcgtta cgcactcagg tcataaca 168 // ID Copia-1_BFB-LTR repbase; DNA; FNG; 163 BP. XX AC AAID01001576; XX DT 25-FEB-2011 (Rel. 16.02, Created) DT 25-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Botryotinia fuckeliana genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_BFB_; KW Copia-1_BFB-I; Copia-1_BFB-LTR. XX OS Botryotinia fuckeliana OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Leotiomycetes; Helotiales; Sclerotiniaceae; Botryotinia. XX RN [1] RP 1-163 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Botryotinia fuckeliana genome."; RL Direct Submission to RU (25-FEB-2011). XX DR Genome; AAID01001576; Positions 1565 1727. XX SQ Sequence 163 BP; 43 A; 42 C; 34 G; 44 T; 0 other; tgtcgggata gggatcatgt gaccgtgcta gcgcttgcgc tagcgccttt cggcactaca 60 gcaatacgtt tcgtagtcct agcttttaga accctaagca atattacaac agtgtgaacc 120 atagtactat attgacttca ccatccccat ggtgatacca aca 163 // ID Gypsy-64_MLP-LTR repbase; DNA; FNG; 190 BP. XX AC AECX01002907; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-64_MLP_; KW Gypsy-64_MLP-I; Gypsy-64_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-190 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002907; Positions 2573 2384. XX SQ Sequence 190 BP; 46 A; 62 C; 36 G; 46 T; 0 other; tgttatgatc tctctgacac gggattacag ggatgtcacg atcctgtgac aggcaagact 60 cagagccgca cttgtaccag ctctcgagcc atccatgctt tgctccctat gccacaataa 120 ctatcatacg tggatcactc tctctccctt tgccccaaga accatcaccc cctgaagccg 180 ggtcataaca 190 // ID Harbinger2-3_TMe repbase; DNA; FNG; 3060 BP. XX AC . XX DT 13-AUG-2010 (Rel. 15.09, Created) DT 13-AUG-2010 (Rel. 15.09, Last updated, Version 1) XX DE A family of autonomous Harbinger transposons - a consensus. XX KW Harbinger; DNA transposon; Transposable Element; KW Interspersed repeat; Harbinger2; Harbinger2-3_TMe. XX OS Tuber melanosporum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Pezizomycetes; Pezizales; Tuberaceae; Tuber. XX RN [1] RP 1-3060 RA Kapitonov V.V. and Jurka J.; RT "Harbinger2, a novel clade of Harbinger transposons in protozoan, RT fungi, choanoflagellate, and metazoans."; RL Repbase Reports 10(9), 1225-1225 (2010). XX DR [1] (Consensus) XX CC Harbinger2-3_TMe belongs to a novel clade, Harbinger2, of CC Harbinger DNA transposons. This clade includes transposons CC present in protozoan (brown alga), fungi, choanoflagellate, and CC metazoans. CC Harbinger2-3_TMe is a consensus sequence of a family of CC autonomous Harbinger transposons that were active in the Tuber CC melanosporum genome recently. The consensus was derived from 6 CC full-size copies ~98% identical to it. These copies are flanked CC by unusual (for Harbingers) AYT target site duplications. This CC transposon codes for the 281-aa TPase. XX FH Key Location/Qualifiers FT CDS 530..1372 FT /product="Harbinger2-3_TMe_1p" FT /note="Harbinger TPase." FT /translation="MLLARLAYPNRLSDLALKFGWPVERISRISSTVQSMI FT HDRWKHLLEWDAARLTPDKLAQYASVIARKGAPIRTVWGFIDGTIRGIARP FT SRRQRTCYNGWKHKHCLKYHAIITPDGLISHLFGPVDGRRNDAFLWRESNL FT PAILERYAHMPDGTPLQLYGDPAYSISNRLLSPYQGARISHNQRLWNRSMS FT RVRIVAEWAFKEMVNMFGFLDYVKGQKHLLQPVGVQFRVAALLHNAHVCLH FT RPQVTQYFNVIQENNGGNEQEEFIEEELLEPPLLLEYFHN" XX SQ Sequence 3060 BP; 824 A; 694 C; 633 G; 909 T; 0 other; agagggtttg cgcacgggga cctaggtggc ggttaacaga aaatagacct aggtggaaaa 60 ataagtttgc gcaacaggtc taggtggtga ttatagatga cggcagaatg aacacatagt 120 cttggaatga tatatctaaa aagtgtaaac aacaacccac aattgacatc aagcaactcg 180 aacatactca agaatgctag cctcacgttt ccaatacaag cgaaatgcac gacgacttga 240 agccttatta cgcctgagga cagcagttac acgaagtaac attcggaaga gaggagcgct 300 tcgagaagga ggacgaattg acttggatat gctggaagag caggagtggc aaaatctatt 360 ttggtaggtt tagagatcat gattgtacat taaaataaaa gggaaggaga agacttacaa 420 ataaagtttt acgagacgtg aaattgaaca gctggtagag gccttacagc taccagaaac 480 catcagagcc gacaaccgta ctattgagaa tcgccgaact gcgctttgta tgctacttgc 540 acgattagca taccccaatc gtctgagtga cctggctttg aaatttggct ggccagttga 600 gcgcatatct cgtattagca gcacagtaca gtcaatgatt catgataggt ggaaacactt 660 acttgagtgg gatgctgcac gattgacccc cgataagtta gctcagtatg catctgttat 720 agcgaggaaa ggtgcaccga tcagaacagt ttggggcttt atcgatggca cgattcgagg 780 aattgcacgc ccttctcgcc gacaacgtac ctgttataat ggctggaagc acaagcactg 840 cttgaagtac catgccatta ttactcctga tggtttgatt agccatctct ttggcccagt 900 agacggccga agaaatgatg catttctgtg gcgggagtca aacctaccag caattctgga 960 acggtatgca catatgcctg atggtacccc cctccaactc tacggtgatc cagcctatag 1020 catcagtaac cgtcttcttt caccatatca aggcgctcgg atatctcaca atcagcggct 1080 ttggaatcgt tcgatgtcac gagtaagaat tgtagctgaa tgggccttta aagagatggt 1140 aaacatgttc ggatttttgg attatgttaa aggtcaaaag catttacttc aacctgttgg 1200 agtgcagttt agagtagctg cactcttgca taatgctcat gtatgccttc ataggcctca 1260 agtcactcag tattttaatg ttatccaaga gaataatgga ggtaatgagc aggaagagtt 1320 tattgaagag gaactacttg aaccccctct attgcttgag tattttcata attaaatgta 1380 taatttatca caatatatgt tggctatcgt agggttgatg tcgtccagcc gtcccattag 1440 cattttcacc acttcctcat gcctattgca agtggtccac aagagtggta ttcaactatt 1500 gttatttggc ttgttgggac tgacattgat ccgtctatat agtattctca ccaccccctt 1560 gtgcccatta caagtggcct acattagagg tatggtaccg gtatagtctt cctggttgat 1620 atcacaatct ccaatctcta ccaaaccagc tacaatctca acagtcccga agaagaatgc 1680 accgtgtaga gagctgaacg ggcggggttt attgaaatca agattgtaaa aatacagttc 1740 ctgagctttt aagatagttt tagtgggaaa tatgtagaaa tagaatagaa ctaatatgta 1800 aggggattgg tagaagaatg acagaattgg acgatgtatt cattcatttt tatgatatat 1860 acctcctgtc tttaccctca atattagaag gatataaaaa ttaaatttca ttatccttcc 1920 ttaatgcttc caatattaaa accaactcat ctttccttgc ctcgcgctcc aaatacgcat 1980 catgagaccg ccgttcactc tgttccctca tctcatctgt aagtcccaat attgcatgca 2040 tcaagtcttg atgcttttca tcctctcttt tttcaatttc cttcaactcg tctccatctc 2100 ttttcattcc ctccttaaac tcatataaaa cattatccac atcttttacc attacattct 2160 ttgccttttt ttttgccggc attaccccct ctccactatc cctatgccct actccgatat 2220 tctcttctct ctcccggact aaccctcaca tagaatcctc ccggactcgc tttccgtccg 2280 cctcaagttg tgcagctttc tgttgtgcag cctttcgtgt ctcgggatcg gtactggact 2340 ggtcccatcg ggttgcaagc tctaccatat tgagctcaaa ctctccgaat tcttcatcgg 2400 taccggtctt tctcaacgag atgagttcgg atttctaatt ctttccttat taacaatctc 2460 ctataagtac tttatatttc taatatacct tatatatttc aacaagtttt ttaactctct 2520 gccgacagga ctctccacta cgacatatgg gttgaggttc tatacttcct aaaattgtag 2580 aaacctctgc ccatttcgca gtagtcctcc ctctaccaca gtttatagga tctgtggcta 2640 acacttgccg taccagtgct cgatccatcc atgctgacca ttttgttgtt ctccaccccg 2700 ggcgtatagt tgcaggagaa ttgctcgatc ttgagttatc ggcggctgac tcttccacag 2760 tcgtggaact ggctgtagtg gctgactcaa aggaggaaag aaaataggta tgattggggt 2820 caggagtgga aggagactgc attttgtacg tattgggtat tcaaagttga cttgtattga 2880 tttgatacgg tatttctttc accggtgtca tggtatgaac tgatgagagt tcttcaggtc 2940 ttcgaagcca gtatttacac ttattccagg tggcgccacc cacctgggtg accctgactt 3000 ttagcttaac tcgagataac cggttttcct cccacctagg tccccttgcg caaaccctct 3060 // ID TY5 repbase; DNA; FNG; 5376 BP. XX AC U19263; XX DT 11-NOV-1996 (Rel. 1.1, Created) DT 20-SEP-2007 (Rel. 12.1, Last updated, Version 2) XX DE Saccharomyces paradoxus retrotransposon Ty5-6p associated with DE autonomously replicating sequence, complete sequence. XX KW Copia; LTR Retrotransposon; Transposable Element; retrotransposon; KW TY5. XX NM TY5. XX OS Saccharomyces paradoxus OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Saccharomyces. XX RN [1] RP 1-5376 RA Zou S., Wright A.D. and Voytas F.D.; RT "The Saccharomyces Ty5 retrotransposon family is associated with RT origins of DNA replication at the telomeres and the silent mating RT locus HMR."; RL Proc. Natl. Acad. Sci. U.S.A 92(3), 920-924 (1995). XX RN [2] RP 1-5376 RA Zou S., Ke N., Kim M.J. and Voytas F.D.; RT "The Saccharomyces retrotransposon Ty5 integrates preferentially RT into regions of silent chromatin at the telomeres and mating RT loci."; RL Genes Dev 10(5), 634-645 (1996). XX RN [3] RP 1-5376 RA Zou S.; RT "TY5."; RL Direct Submission to Genbank (29-DEC-1994)Sige Zou, Zoology & RL Genetics, Iowa State University, 2278 Molecular Biology Building, RL Ames, IA 50011, USA. XX DR GenBank; U19263; Positions 1228 6603. XX SQ Sequence 5376 BP; 1707 A; 1208 C; 1006 G; 1455 T; 0 other; tgttgaatgt gataacccaa aagcatgata tgggtaatgt ttcagtactg tttcagaatt 60 gtttcagtaa tgttttagac aaggaaaaca tagagcagca aacctccgat ccgacagtac 120 ttaagaaacc atagtttctg tgtacaagag tagtacctat gtaattctta catttacata 180 acatatagaa aggtccaata aacttacaac attatgacat ataagctaga tcgtaattca 240 ctacgtcaac aggttatgag ccctgagagc aatgcttcag agaccataat taatctatct 300 aatcccaaca attataaaca gtggctgtac ggtatcgaga ccgctgctga atatgctaac 360 gaatatatga acgaattcgt tcataccgga gatatccaat caatgaaaag ggattacaat 420 ctcagcgcga atgatgaaag ctttgtcaaa accgtattta acagtttcct ggtaaagctc 480 tacaagaaaa ctatcgtggg tgaagctgca tgtgaaatga actggatatg tgatgattca 540 cttggaaggg tctctgctta tgatattttc tcgcacttcg aagaaaacta taatgaagtc 600 actattggat ccaggcttac tcttatagag gacttaccaa atatatcctc caagcctgta 660 gatgaaatcg cttccttttt gaaaacccta ttcacaatgc ttgaagacaa tagcgaagaa 720 caggacaaaa agaaaagacg cgataccaat atcgcgttgc tattaatgac cttcttaccc 780 gagttaaagg aatcattcca cgagaaattc ggtgactcta aagctcttca gctgtcacaa 840 gtcattagat tctgtaaatt aaaggcgtca tcgaattcat tatcttcagt ctcagatgca 900 ttggttgcac aagacagaag aagctatcaa aagaaaggaa ataagggatg tatgatttgt 960 ggggctgatc atcgcttaag caactgttct ctgcttaaaa gaagaatacc agaagccaga 1020 atctttaaat tatatcctaa tgacaagacg aatagatctt catctgctag tgttgcgatt 1080 cctgactatg aaacgcaagg ccaaacagca ggacagataa caccaaagtc ctggctctgt 1140 atgttatctt cgaccgtccc agctaccaaa tcctcagatt ggatttgtga cacaggatgt 1200 acttcacaca tgtgccacga ccgttctatg ttctcatcat ttactagatc ctctaagaaa 1260 gactttgtca gaggagtcgg cggttccata cccatcatgg gctccgggac tgtaaacatc 1320 ggcactgttc aattaaatga cgtatcctac gtccctgatt taccagtcaa cctaatatcc 1380 atttggaaac tatgtgctaa atccaactct tctgttacgt tcacaaaaga gggtgtcact 1440 gtgaaatcac ctgatgacgt gatttctacg gctgggaagt taaacaatta tctgtacatt 1500 ttcgatgatc ttacgcccgt aactaccttc tcttcgcaaa attacttctg ctctaaaaca 1560 ttggattcat ctaaaatgat aacttccgca gcgtttcata ccgttgcaga taaaatgttg 1620 tcgcaacaca tttctcccac tgctctcccg gtaaaatggc atgctcgtat gggccatccc 1680 ggagcagata tttacaattc cttggctaga actctgcgtt ttccaaaatt taagacggct 1740 gaatacacta tttgtcctac ctgctcacta gcaaaaggaa tcatcaaaaa gggtaaagtc 1800 tcgctcaaaa aatataccca acctcttcaa atggtacagg ctgatctctg tggtgggttt 1860 cgctaccaag agtttcagtc aaataaatat tttcttacta tccgtgatgc ctatagtcgc 1920 tactactctg taatacattt aaaatccaaa gcagacgctc cgataaaatt catggaatgg 1980 atcaacgaaa ccgaacaata ctttagctcc cggggtggat tcaaagtcgg atctgttcgt 2040 acagacaatg gtacagaatt cgtaaataaa aatcttcatg cgttttttaa atctaaagga 2100 atagagcatc agttaactat tccatatcat agttatcaaa atggtgctgt tgaacgtgca 2160 catcgtacca tcgaagaacg cactcgttgt ctccttatcg gggggcgtgt tcctccgtcc 2220 ttgtggtctg aagctgtttc ttgcgcagtc tatttaatca ataggtcccc tgtagtgtcc 2280 aaaaataaca gtatcccata ctgccggtgg ttcaacatcc ccgcaaaaga tttcggtatc 2340 gcacatcttc gaatttttgg atgtacagca tacgcaacct tacaacctag tcttcgagac 2400 ggcaaacttg ccccaactgt catatctggt gttatggttg gctatgactc taaccatcga 2460 ggatacagga tttatcatcc cgaaactggc cgcatctttg tgagcagtca agttcgattt 2520 gacgaacaca tgtttcctct tgctgataca gaggcagttc acgtctctca cgactttgcc 2580 acttccgcta ttgggggggt gtccaaatat cctgaaacag ggtcaaccgt ctctgctcca 2640 aagaacgacg gatctgactt ggcaaatttg ccaataactg ttcccaaaaa tgtaaatcaa 2700 ccagcacata aacctaatac cagtaacatc tcttcctctg atgatgatga ggatatttca 2760 atggaaatcg aaatggaaaa acctatccct gagtgtaacc aagacaactt accaaactcc 2820 ggatgtccac caacaaggat acaacattct aactttgaat ccttaccaac cgtgtctacc 2880 gaagacgaaa ctaattcttc tatggagaaa actcctgaaa gagttccagc ggcactaact 2940 tatcgagaaa ttccaaaatc atccgattca gaatatattc cgacatgccg aaatagaact 3000 agacgtgtta aaagaactaa taagaaacca acgcgatccc gcgaaataga aatatatgat 3060 atatcacgtc caaacgtaat atcgagtgac aacttacctg aagttagaag tgccaagcaa 3120 agaaagacgg tgtccaatac aaatgatact gtagcaagga caaatagact tccaaccgtg 3180 ctacgaactc tagactcaaa caacattgac acgctgcatg ttgccagtac tggtgaagaa 3240 gtgtccatcg aaagactttc aagcatggct cttcaggaag cgaagaacaa ttccgccaga 3300 actaatcaag ctaattctct tactgattgg tttccagtag gcgcaatgcc gatacctgac 3360 cagaggtatc tatccgttca cgatggaaca tatatcagcg actcacaaga tgtgggtgat 3420 actgacctca ctcctgctgt aaccaggcta gttactgaag agaattcaat cgaatctcct 3480 ccatcgttgg attcatcgcc tccaaatacc tcatttaacg cggctctaac tgctattatc 3540 catagcacaa aaaaaggaaa cccgaaaacc tatgcccaag caatgggaag gcctgacttt 3600 caagaatggc acaacgcatg cctcaaggaa ctttccgcgt tcaaagatca caatacgtac 3660 aaattggtgt ctcttccaaa gcaaagaaga gctcttggat cgcgctgggt attcacaata 3720 aaagactccg ggacgtacaa agctcgcctt gtcgcccaag gacatactca aaaggctggt 3780 attgactatc aagaaacttt tgcaccagtc attcgatatg actctgttag attatttctg 3840 gcccttgcta gctgcctcaa actaatagta tatcagatgg acgttgacac cgcgtttcta 3900 aactcaaaaa tgaatgagcc ggtatacgta aaacaaccac ccggatttat taatgaaagt 3960 aatcccgact atgtatggga actatacggc ggtatgtatg gactcaagca agccccatta 4020 ctatggaacg aacatatcaa caatactctt caaaagattg gttttcgtcg acatgaaggc 4080 gaacatggct tatactttcg ttccacatct gatggtccca tctacattgc cctatacgta 4140 gacgacttac ttgttgctgc tccctctccg aaaatatatg acagggttaa gcagaaacta 4200 acgaagttat actcaatgaa ggatctaggt aaagttgaca aattcctcgg tcttaacatt 4260 aatcaatttt caaatggaga catcactctc tcacttcaag actatattgc taaagctgca 4320 tctgaaagcg aaataaacat atgtaagcct acacagactc cgctctgtga ctcaaagcct 4380 cttttcgaaa caacttcccc gcacctaaag gacatcactc cttatcagag catagttgga 4440 cagcttctct tttgtgcaaa tactggtcgt cctgacatat cttatccggt ctcactactc 4500 tccaggtttc ttcgcgaacc tcgcgcaatc catttggagt ctgctcgacg agttctacgg 4560 tacctatata ccaccagaag tatgtgtctc aagtatcgtt ctggatctct gttggcacta 4620 actgtatatt gtgatgcatc tcatggagca attcacgatc tcccacactc tactgggggg 4680 tacgtgactc tacttgctgg tgctccagtt acgtggtcat caaagaaact caagggtgtg 4740 attcctgtat catctactga ggcagaatac attactgcaa gtgaaactgt catggagata 4800 gaatggattc aaaacttgtt tgaacactta ggccagccac ttatctcatc aacattatac 4860 gtagataatg aacctgctat aaaactatct aaacatcctg tatttcacac gagaacaaaa 4920 cacattgcct tgagatatca caagctaaga agtgcagtgg cagcaggcat aattaccata 4980 gagcatgtta ttacaaagag acaagttgct gacatattta caaaaatcct tccagcagaa 5040 tcattcaaag cacatagggc tgtcatggtg agggaaccag aaactgcaaa ataaccactc 5100 tcatgcgtat tcagttatgg ggggatgttg aatgtgataa cccaaaagca tgatatgggt 5160 aatgtttcag tactgtttca gaattgtttc agtaatgttt tagacaagga aaacatagag 5220 cagcaaacct ccgatccgac agtacttaag aaaccatagt ttctgtgtac aagagtagta 5280 cctatgtaat tcttacattt acataacata tagaaaggtc caataaactt acaacattat 5340 gacatataag ctagatcgta attcactacg tcaaca 5376 // ID PIF_Harbinger-2_Mlaricis repbase; DNA; FNG; 4581 BP. XX AC . XX DT 17-MAY-2011 (Rel. 16.05, Created) DT 17-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Harbinger; DNA transposon; Transposable Element; KW PIF_Harbinger-2_Mlaricis. XX OS Melampsora laricis OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 4581 BP; 1217 A; 965 C; 1012 G; 1387 T; 0 other; aggtggtgtc aaactaggaa aagttgaagt tttcaaaata aaaattttga accgtgtaat 60 taaatacatg tgatacagtt tcaactgtgt atttacagtt tcaactttgt atgtatttgg 120 agggtggcct ccctcgggct tcaccatttg tgaagaaaca aaaaagggct gatgccccca 180 tccttaataa atcaacacct acaccgggaa acgcgccagc aaccacctct cctcgatgtg 240 atatgcctcg aatcactcgt cgccaaaccg ttgtatcaga aattcgcacc gccctggatg 300 ctcttgacga taatgatcca acgattacat tccccctccg gtccagccta tcaacatttt 360 ctcatgcgat ggcggggtct tcgattgcac aactggctgt ctgcttacat atcaaatcat 420 ctcgtgttct cacagtctcc tgcatcataa tgactgtttt cagcttagca atgcaatggc 480 tagaaaacac atttaatgat tcagacaacc agctcgtatt tctttttcaa ctctggcatc 540 aagaagaagt tgaagactta ctcggtcagc tacacgcggc tcaaacccaa cgctatctag 600 taccacgagg gccacctgca cgaggaccac ccgatggacg ctttgccctc ctcttcaatg 660 agcgaagctc cgtgtttagg cacctggtta gtatttcttc ggtatctcaa actcgggtct 720 tgatgattga tgtgatcaaa gtcatatgtt gatttcttag gcaaggatgc aaaaggaaac 780 ttttgtgaag ttacttcaca tgattggcgg gcatcacatc tttcagaaca gatctcccag 840 ctcgccacag gccccacctg agtggcaact cttggtcgct ctggcgtatc tcgggttgac 900 tggaaatggt gctagtccta ggatgcttgc cctgggtttt ggtatatcag gtaagtaaat 960 tttactttcc attccctttg tccaacggca attgctaacc tcacttctca attctcgtag 1020 aaggtagtat ttataattat acaactcgat gtgttatagc aatcctttcg ttgaaggatc 1080 gcttttatcg atggcccgac cagacagaac gcagcaaaat caaaaaggcg tttgggcggc 1140 atagtttttt taaaggatgt gttggtgtca tagatgggac actggttact cttgctacag 1200 gtcctgagaa aaacattgag gattaccgaa ctcgaaaatc aacgtatgca ttgaattcaa 1260 tgcttgtttg tgatcatcgc aaacggatca tctacgcgac tcatgggtgg tgcggaagtg 1320 cccatgactc acgagttttt cgtaattcaa aggcaagtgt attcaactca atcatgtaca 1380 ccattaaggc tgatgaatct ctttctgaac agctctcaag gagccctcaa catttcttcg 1440 gcgaaggtga atatctgctg ggcgattcag gttacccttc aaggaaaaat ttagtttcaa 1500 atttcaaaaa gcctcgacat ataccccttc cagcagccca aaaccacttc aacaatgcgc 1560 tcagcaaact tcgatatgtg attgagcaca caatcttact gtggaaggct cggtggcagt 1620 ttgtgagaca gtgtcgagtg gtactgaaag gtgttcaatc agcaaagcga cttaactatt 1680 ttttgagtgc cacagtagtt ctacacaact tcctgatcga tgtccaagat tctcaaatcc 1740 ttcaagattt aggacacgat ggttttgatt ttgatgaagg tgatgatggc ggatccgaag 1800 ctattgatga tgatgtcata gaatcagatg acactcgtgc agctctattc actcaattcc 1860 aacaattgta tcattagttg ttattaattg taattacata tcaatgaaaa gaaatcaatg 1920 attttgttcg ctttttcatc cttggttgtg gatcgatgtc actatctacc tggtgaagta 1980 actcgaacag agttactcat atttgctaaa taagcgatcg attttgaaat catttggttt 2040 tcaacatcaa aacatgatcg attataaatg caaggtttta tatttgagcg tagcattggt 2100 aatttcatta cttcattttt gagagatgat acaatcatgt ttttgctggt atttgagacc 2160 cacaaatcca accttatcac gagaaaaaag gagaggtaca accatcaaag cacacaggat 2220 aaaatttttc ataacatcat atttcagtag tcatagaatc gatcgtttaa ggataatcga 2280 atacgtttat attgaaagga aaatactcgt tagcaaattt tcgattttgc ttagaagtcg 2340 gggattgtac atgacaagct gacaagctcc tcggctcaat cactatcact gtcgttgctg 2400 tcataatcca ttaaagcatc taaatcaggt ctagactctg accgatgttc aaaaaagtca 2460 cagatcaatc agtgatctca atccgattca agacaaaggt tgacttactt tgtttttggt 2520 tttcaatcag acgctctgcc aagttggtag cttctgtgga ggacattccc gtctccatca 2580 attcacgtat agtcttagta gttagctcaa gatcacgata atcattctca gatttctgct 2640 tgtgaattaa caactgaagc tcccacttct cacggcgcaa ctgcatcttc tcgcgcttct 2700 tggcgatttt gtcacgtggt gggggtttgg ggtttgccaa attcgtgatg gcagatgaga 2760 tagttttcat tgaactagac atctcttggc caatcttaag tgttccaata gccgtcttgg 2820 aatgagcttg aatagctcgt tcgcgttgtt cataattttc atccgccttt tcacgtgctt 2880 cagatgtgtt gatcttgagt acagacttgc ttggcgattg agccctaaag ccaattgatt 2940 ttaaaaaaga gcctgatggc cgtcgctttt tgactataac gtcgctgtgt gaatcgaagg 3000 aatcttcttt gatgctatcg gtagactcgc ggcgatgctt tcgggattgg gtgacttttg 3060 gttgatctgc attttggatg gccgccatca ggttatcgcg ttgtagaggg gctggtaggt 3120 tgaatgaatc atcattggta tcagatgggt caaagaggtc actaaaatct gggttgtcga 3180 tgttttgttc cacatctaag acgtcttgat tggccgtacg gagctggaaa gcatcgcgga 3240 tatcattctc gtttacgtcg tccacgaggt ctatctcatc aaagggcatg gctgatggtc 3300 gatcacccat gatttggtcc agttcgtaga atgatgggca tatcccctcg atttggtcta 3360 tatatacaag atgtgagcca cagtatcggt tcaaattcaa caagatatac ataccggaga 3420 atgatttctc tcctgagatt ccaattagtc ccccgccagt ttgcttggtc ctctctagag 3480 cttctcgata cctatcctct aacgagtgca tctaaatttt gaaactacat tagtatacac 3540 gtttcgctac ttccaatacg atagtgacaa acttgttgaa ccactgaatt tccatcgcgc 3600 ccaaagattc cattttgaat caagaattgc gagatatctt cggctatcac attcttcttg 3660 gaaccctttt cgcgccatct gttgtaatta ccggggatga ggacccattg gattaagata 3720 tcgctagccg attcgccatt caaattctta tgtcttgccc agtggcggtg aaaatcgtct 3780 tttgagttgg tcggcgtgca attaggagat tttaacgtgg cggtatcctg cactgctgaa 3840 ttggtcttat tggcctttgg cttgcgtcct cttttacctt tgactaggac aggtgatggg 3900 gacgtataag tcgttggctg agtattcatg ctaccgggct ggttaggcgg tggttgattg 3960 gggtaatgag gctgatcttg gacggcagaa tttccgtggg caattgctga tgcaggggga 4020 gtgctgatgt gatgactttg agggaatcct ccaaaatttc catctgaaga tccgtggaaa 4080 ataaagggcg tttgcggcat tggaccagac atgtgatcaa acggttcaaa cccgtgatca 4140 aattggggaa aatgttgtgg tggtccagtt tggtggtcat gagagggagg atcggggttg 4200 tagccataaa ttggaggtag aggttctgat ggaaagcatc cctcagggga tggtacgtaa 4260 tagttcatta gagctcttta gcgttgagac ttggaagaga tggccatcta tggaagtact 4320 tacaaacacg gcctcaagag acctgtccca acgcgcttta ccctgcccca gtgttctgtg 4380 tagatgttga tttattaagg atgggggcat cagccctttt ttgtttcttc acaaatggtg 4440 aagcccgagg gaggccaccc tccaaataca tacaaagttg aaactgtaaa tacacagttg 4500 aaactgtatc acatgtattt aattacacgg ttcaaaattt ttattttgaa aacttcaact 4560 tttcctagtt tgacaccacc t 4581 // ID Gypsy-26_MLP-I repbase; DNA; FNG; 6526 BP. XX AC AECX01000138; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 28-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-26_MLP_; KW Gypsy-26_MLP-LTR; Gypsy-26_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-6526 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000138; Positions 117215 110690. XX CC Positions [5327-5806] - Integrase core CC 'CAAAG' target site duplication CC LTRs are 100% similar to each other. CC Includes an insertion of DNA3-2_MLP (masked by x). XX FH Key Location/Qualifiers FT CDS 344..1417 FT /product="Gypsy-26_MLP-I_2p" FT /translation="MEEIQRQLAELTTSVQEERRMRLEAEVRTLNAEARLA FT AIEAGQTPNQPSQAQVPPSAPSPDPAAQPTLQTAMQKGPKVSVPDKFSGIR FT GGPAEVFASQIQLYMLAHPYSFVDKRSKVVFALSYLTGAASSWAQPLTLEL FT FDDSTSHTVTFDCFVTNFKAMYFDSEKKTKAERAIRSLTQKTSVAAYTHEF FT NIHAAATGWETPTLVSQYEQGLKKDVRVAMVMAQETFTSIEQIANLAIRID FT GKLHGVSESSTISFHTPSDPNAMDVSAGFIRLSDDERARRMRAGLCFRCNG FT QGHRANECSERRVEKGKGKGGYGGFKSRIAELEIKLAAMNGKDESRIDVVG FT GSSRADVSKNGGAQE" FT CDS 4607..6430 FT /product="Gypsy-26_MLP-I_3p" FT /translation="MTTKQLTRRQARWAETLGCFDFQIKFRPGRNSTKPDA FT LSRRPDLKPSDDTNLSFGQLLRPENIGPDTFPVELSSFESFFADETVELNS FT AEHWFDIDVLGIDECDAAPQDSMTDDQIINLIRESSATDDRIRDLMEACQN FT PMSSRTKTATKIYDIKDGILYKHGKIEVPHVEHIKYQIARSRHDTLIAGHP FT GRSKTLSLVKRSFTWPSLKSYINKYVDGCDSCLRVKSTTQKPFGTLEPLPV FT PAGPWTDITYDLITKLPISNGFDSILTVVDRLTKMSHFIPCRESMTAEDLA FT DLMVKNVWKLHGTPKTIISDRGSIFISRITRELDKRLGIRLHPSTSYHPRT FT DGQSEIVNKVIENYLRHFVTYRQDDWERLLPTAEFAYNNRDHESIGVSPFK FT ANYGYNPTFNTVPSSDQCIPIVEERLKVLEEVQRELTVCLELAQDTMKNQF FT DKKVRHTPKWDVGDEVWLDSKNISTTRPSPKMSHRWLGPFNITKKISNSTY FT ELNLPLTMKGVHKVFHVSVLRKHNPDAIELRKQPERPPIEVEGEEEWEVVA FT ILDSRKRRNQVEYLVNWTGFNSNHNSWEPEGNLTHCKDLLKEFKSRFPDTA FT RKLRARRRMK" FT CDS 1417..2650 FT /product="Gypsy-26_MLP-I_1p" FT /translation="MIDVPILSQEGTVDEIELGASRICNANDPRIFFKSSI FT SPLHNSRATSSTSFHTLFLIDSGATHDVLSETFASETGLLDHAEKATRVVT FT GFDGSRSHASYETDLILDQDPNPTRFIITRIKDSYDGILGIPWIKKNYHRI FT NWADGCINYPQDHIAAAIAVSSRPEPPSRALGLEPQREARTIDKGMCIGNN FT TIASPQCEYNSHFDNHHPEAASKLCTLPNSQISDTETPTHNSNSTALAKIH FT ESTRPPKNSATADTVSSTPPTTPSGHTTGPMEEARKSDEGMCIKSDTLASP FT QSESDLLNYPKLQRTAGKLLSLPKSQPRAHIRTNARSPTSRISQQRSYAAA FT VTVSSIPKSSPTQPKMEPGGHARICDEGAVIGIDTDKPPQCEHVIPLQSSP FT IEAAGQLDPLQDNLPNIX" XX SQ Sequence 6526 BP; 1792 A; 1525 C; 1330 G; 1256 T; 623 other; tattgtcgga tcaatcaacc atcaggactg aggctttatt attagaagaa tcgataatcg 60 aaagaaaaga agaatcagtg attagaactt accggaaccc caagatttaa acagaaactt 120 agattatcac cgcaataact ccgctcaccg caagaccaat agaaccgaaa ccttatctga 180 aaaccttaaa aaccccacac ccgataatcc agacaacacg tcgccgacgt acagatcacc 240 cagccacggt gaagaagatt ccgattccga aacgccttca tcgttcgctg acgcagccga 300 taccggctat atcgacaatt ctctcgctct taaccccggc tccatggaag aaatccagcg 360 tcaactggct gaacttacca catcggtaca ggaggaacgc cggatgcgcc tcgaggccga 420 agtgcgcacc ttgaacgcgg aggctaggct agcagccata gaggccggtc agacaccaaa 480 ccaaccgtct caagcccaag tgccaccctc ggcaccgtct cccgatcccg ctgcccaacc 540 tactctgcag acggccatgc agaagggacc aaaggtctct gttcctgaca aattcagcgg 600 tatcagaggt ggcccagcag aagtattcgc tagccaaatc cagctataca tgttagcaca 660 cccatactcg ttcgtcgaca aacgaagtaa agtcgttttc gccctctcct atcttacagg 720 agcggcgagc agctgggctc aacccctcac attggagctc ttcgacgaca gtacgagtca 780 taccgtcact ttcgactgtt tcgttacgaa ttttaaggct atgtactttg atagcgaaaa 840 gaagaccaag gcagagagag ctatccgcag cctgacccaa aagacgtctg tcgcggcata 900 cactcacgag tttaatatcc acgctgctgc tactggatgg gaaaccccta cccttgtcag 960 tcaatacgaa caagggctga agaaagacgt ccgagttgcc atggtcatgg cccaagagac 1020 cttcaccagc atcgagcaaa ttgctaacct ggctattcga attgatggta aacttcacgg 1080 cgtctctgaa agctcgacga tctccttcca cacaccgtct gacccaaacg ccatggacgt 1140 atcagctggt tttatccggt tatctgacga cgaacgtgcc agacgtatga gagcgggcct 1200 atgctttcgg tgtaacggtc aaggtcatcg agctaacgag tgtagcgaga ggagagttga 1260 gaagggaaaa ggaaaagggg gttacggagg attcaagtct aggattgctg aattggagat 1320 taagttagcg gcgatgaatg ggaaggatga atctagaatt gatgttgtag gaggaagtag 1380 tagagcagac gtgtcaaaaa atggaggggc tcaagaatga tagatgtgcc tatcttgagc 1440 caggagggaa ctgtagatga aattgaactt ggtgctagta ggatttgtaa tgcaaatgac 1500 ccacgcattt ttttcaaaag ctcaatttca ccgctccaca attcccgagc cacatcctca 1560 acttcctttc acactctctt ccttatcgac tccggtgcca ctcatgatgt ccttagtgag 1620 acatttgcat cagaaacagg acttttagac catgctgaga aagctacacg tgtggttacc 1680 ggattcgatg gatccaggag ccatgcgtcc tatgagactg acctcatctt agatcaagac 1740 cccaacccaa cccgattcat aatcaccagg atcaaggact catacgacgg gattctcggc 1800 atcccttgga tcaagaagaa ctatcaccga atcaattggg ccgacggctg tatcaactac 1860 ccacaagatc acattgcggc tgcaattgca gtttcgtcac gtccggaacc accctcacga 1920 gccctaggat tggagcccca gagggaagct aggacaattg acaaggggat gtgtattggt 1980 aacaatacaa ttgcatcccc gcaatgtgag tacaattcgc actttgacaa ccatcatccc 2040 gaagcagcta gcaagctttg cacgctccca aattcacaga tatcagacac agagactcca 2100 acccacaact cgaattcaac tgcattggcc aagattcacg aatcgacgag accacccaaa 2160 aactctgcaa ctgctgacac agtttcgtcc actccgccaa caaccccatc aggtcatacg 2220 actggcccca tggaggaagc taggaaaagc gacgagggga tgtgtatcaa atcagataca 2280 ttagcatccc cgcagagtga gtctgatcta ttaaattacc ctaagctcca aagaacagct 2340 ggcaagcttt tgtctctccc taaatcccaa ccaagggcac atatcagaac caacgcacgc 2400 tcacccacat cacgcatcag ccaacaaagg tcttacgcag ccgctgtgac agtgtcgtcc 2460 attccgaaat caagccctac gcagcctaaa atggagcccg gagggcacgc taggatatgt 2520 gacgaggggg ctgttatcgg aattgataca gacaagcccc cgcaatgtga gcatgtcatc 2580 cctctacagt catctcctat tgaagcagct ggccagttgg atcctttgca ggacaacttg 2640 cccaatatcx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 2700 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 2760 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 2820 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 2880 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 2940 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 3000 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 3060 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 3120 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 3180 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 3240 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxatcggggc agcgaagaca tcgtggtcca 3300 catcagcaaa acttgcagcc gacgagaaga agaacctccc taccaagaca gtcgaggaga 3360 tggtcccaac gtgctaccac cgccacctgc atatgttcca gaagtccaaa tcacaggtac 3420 tcccaccacg acgaaaatac gacttccgcg tcgatctagt acctggagca cagccacagg 3480 ccagtaggat catcccatta tcacctgcgg aaaacaaagt actggacgaa ctaatcaaag 3540 aaggactagc taatggcacg atacgacgca ccacatcccc atgggcagcc cccgtgctgt 3600 tcaccggcaa gaaggacggg aacctttgac cctgtttcga ctacaggaag ctgaacgcat 3660 tgacggtcaa aaataagtac cccctgccat tgacaatgga gcttgttgac agtttgctcg 3720 acgcggaaga tttcacaaaa ctggacctac aaaacgcata cggaaacctc cgggtagcag 3780 aaggggacga agataaatta gtgtttatct gcagccacgg tcaattcgcc cctctcacga 3840 tgccattcgg acctacaggc gcccctggat atttccaata ctttatgcag gatatattcc 3900 tgggaagaat tggacgagac accgcggcgt acctagatga cacaatggta ttcactaaga 3960 agggtgttaa ccacgaatcc gcggtcgact caatcctcga tatcctagat aagcatcaac 4020 tctggcttaa gcccgagaaa tgcgaattct caaagaagga ggtcgagtat ctcggtctca 4080 tcatctcaaa gaataaagtt aagatggacc ctacgaaggt taaggccgta aaggagtggc 4140 cagccccccg gaacacatct gaacttcaac gctttatagg ttttgccaat ttttaccggc 4200 gtttcatcga tcaattttcc aagacaactc gaccactaca caaccttacg aaactgaata 4260 cgccgtattc atgggacaat gcgtgtgaga aggcatttga aagtctcaaa actgctttta 4320 cttcagcgcc aatattgaag atagcggacc cctacaagcc ctttatcctc gaatgcgatt 4380 gctccgactt tgcactggga gcaatattat cccagcgttc agaggaagat ggtgaagttc 4440 atcccgtttc ttatttatca agatcactcg tgcaagctga gcgcaattac gaaatcttcg 4500 acaaagaatt acttgcaatc gtggcatcct ttaaggagtg gcgccactac ctcgagggta 4560 atccgaatag gcttgacgtg attgttaccg taatttggag tctttcatga cgaccaaaca 4620 actcacacgt cgacaggcac gttgggccga aactttaggt tgttttgact tccagataaa 4680 gttcagacca ggtcgaaatt caaccaagcc ggatgcattg tcacgacgac ccgatcttaa 4740 gccctctgac gacaccaacc tatcttttgg tcaactgtta cgaccagaga acattggtcc 4800 ggacacgttc ccagttgaac tctccagctt cgagtccttt tttgccgacg agacggtgga 4860 actcaatagc gcggaacact ggtttgacat tgacgtatta gggatcgatg aatgtgacgc 4920 agcgccacaa gactcaatga ctgacgacca gattatcaac cttatacgcg aaagcagtgc 4980 aacagacgac cgaatcagag acttgatgga agcatgtcag aacccgatgt catctagaac 5040 aaagactgcc acaaagattt atgacattaa agacgggatc ttatacaaac acggcaaaat 5100 cgaagtacca cacgttgaac acatcaagta tcaaatagca aggagcaggc acgacacgtt 5160 aatagcagga caccccggca ggagcaagac attatcgcta gtgaaacgca gtttcacctg 5220 gccgtcacta aaatcctaca tcaataagta tgtagacggt tgtgactcct gcttacgtgt 5280 gaaatctaca acacagaagc cctttggaac cttagagcca cttccagtac cggcaggacc 5340 ttggaccgac attacttacg acctgatcac aaaattgccg atatcaaatg ggtttgacag 5400 cattctaacc gtcgttgaca gactgacgaa gatgagtcat tttataccgt gtagggagag 5460 catgacggcc gaagacctag ccgatttgat ggtgaagaac gtatggaagc tgcatggcac 5520 tccgaaaact atcatctcgg atcgcggaag catctttatt tcaaggataa ctcgagaatt 5580 agacaagaga cttgggataa gattacaccc ttccacttct taccatcccc gtaccgatgg 5640 gcaatctgag atcgttaaca aagtgattga aaactacctt aggcacttcg tcacgtaccg 5700 ccaagacgat tgggaacgac tactgccaac agcagaattt gcttacaaca accgagacca 5760 cgaatccata ggcgtatcac cattcaaagc caattatggt tacaacccca ccttcaacac 5820 cgtaccatcc agcgaccaat gcatccctat cgtggaggaa agactcaagg tactagaaga 5880 agtacagagg gaactaacag tatgtttaga actagcacag gacacaatga agaaccaatt 5940 cgacaagaaa gtcaggcaca caccgaaatg ggacgtaggc gatgaagtat ggttggactc 6000 aaagaacatc tcgacaacac gacccagtcc taaaatgagt catagatggc tgggcccttt 6060 caacattacg aagaaaatct ctaactctac ttacgaatta aatttaccac tgactatgaa 6120 aggagttcat aaagttttcc atgtttcagt actccgaaag cacaaccccg acgcaattga 6180 actccgaaaa cagcctgaaa gaccgccaat agaagtagaa ggcgaggaag agtgggaagt 6240 ggtggcgata ctggactcca gaaaacgtcg aaaccaagtg gaataccttg tgaactggac 6300 aggtttcaac tccaatcaca attcatggga accagaaggt aatctcacac attgtaaaga 6360 cctactgaaa gagttcaaaa gcagattccc ggacacggca cgaaaactca gagcacgaag 6420 gagaatgaaa tgagggcata gctttttccc taaggggttt tttaacgctg cccagggagg 6480 aatgcaggac tcgcaagagg gagtctgggc attaaagggg ggataa 6526 // ID Gypsy-19_LBS-LTR repbase; DNA; FNG; 247 BP. XX AC ABFE01001248; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-19_LBS_; KW Gypsy-19_LBS-I; Gypsy-19_LBS-LTR. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-247 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01001248; Positions 3617 3371. XX SQ Sequence 247 BP; 39 A; 72 C; 42 G; 94 T; 0 other; tgtaatcctg gtcccggatt actcttattt ggacctgtta ccattgtttg gacttgttgc 60 ttctctttcg tatttctact tcttcctgcg cgcttccgcg cttggactct taccaccatc 120 ttcccatacg catcttactc tcctgtaccc atactttccc tctagaacct ccagaatcat 180 agacgtggtt tatttctgtt ggttcgtctt gtgagcagtg ttatctaccc ctccgggtag 240 atttcca 247 // ID Copia-2_CCO-LTR repbase; DNA; FNG; 211 BP. XX AC AACS02000002; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the mushroom Coprinopsis cinerea genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_CCO_; KW Copia-2_CCO-I; Copia-2_CCO-LTR. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-211 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the mushroom Coprinopsis cinerea RT genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AACS02000002; Positions 2676996 2677206. XX SQ Sequence 211 BP; 56 A; 49 C; 32 G; 74 T; 0 other; tgttgagaat gttctcgtcg atacacgctg actcgttact agttagatta gttattgcat 60 tcttatgttt ccactttatt agaaattccc cgagcgctta tacacgcgct tatcgagcgc 120 tccaatgact taccaatcac gtctcgtagc tactataaat actgttctgt agaacacgaa 180 aatctatcgt tttattcctc tatttacaac a 211 // ID Gypsy-114_MLP-I repbase; DNA; FNG; 6897 BP. XX AC AECX01000711; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-114_MLP_; KW Gypsy-114_MLP-LTR; Gypsy-114_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-6897 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000711; Positions 52092 45196. XX CC Positions [5705-6217] - Integrase core CC 'GATAC' target site duplication CC LTRs are 99% similar to each other. CC Includes a mariner insertion at positions 2492-2673 (masked by CC "x"). XX FH Key Location/Qualifiers FT CDS 508..2492 FT /product="Gypsy-114_MLP-I_1p" FT /translation="MADDVDITSWNMSTSDINAVPAMSSANTETTVKKGGK FT KGSKATGKVGGINTDTEMNDATAAEEAPGGMSSMNDDQDDLEVVIHGEPTN FT NNSNVGMDVSKPLTSRRGKGLFDLPPPMTSGLRSGRMASSTRHEIRPNPTK FT GRTAQDSGGDGSCLRVSGGSTEEPRGGLRGSIGENANPTSARDTISRDIPP FT HQTNETHPQALVRPKDKHTVRSDGVGSTSNGGQLRQQTETSTPRIEPIIPA FT SKGYDSSNSTGGTGATNGNGSGELRGSGRNDVSKENVTAPNVATIPLSSVI FT TKSTALRSVPSIVPNSLVSQPITQSSPLKVNVGLRSQSASSNLISLSLPHY FT NAMITRLAELETVMRVNFGTMNGQLKTQAESLQAFRNEISSVQELIEDVTQ FT MNNMLDSKLEAFEEALEQHRRDQQNIAKLMHITSTASEELTRDTTNMCEAM FT EQLTLNRETERQSIRELKIDFTRQFQEMKEHLDGVARNISRMQSEQDKNIL FT EKIPTEIKRGVSELENTKTDMFVGAREDTKYKADSEVIEISTPYTNPRNYP FT KIDSYPKFSGKPDEDWVDFVDKIDTFQDSYGMPDAEIVSKLPSILKGVAYT FT WYRAVHKQHRQKSWEHWKSLIQKKFGGPVWRQRQLTLLHQMKFSYNSHEVI FT EFLTELHRKLESX" FT CDS 5501..6853 FT /product="Gypsy-114_MLP-I_4p" FT /translation="MKEQILKLCHDDILAGHFSLEKTLNRIKNTAWWLNYN FT QDVIEYISSCDTCQKGNRKTGKTFGLLQEIQKPTKPWEIINMDFVTGLPPA FT GELSYNSALVIVCRLSKKAKFIPCHKDIDAKGLAHLWWKNALNECGLPSAI FT ISDRDPKFTSEFWTSLMQIAGCDLKLSTAHHPQTDGLAERTIQTMEDLIRR FT YCAFGLLYKDSEGFTHDWVSLLPGLQFAYNSSVHSSTGRTPFELERGYIPQ FT NPRMLTNKKLGKLDIHPSAGNFSHMQELAWLHAAECITKAFAYEKQRWDKT FT HTTPPFKAGDQVLLSTIHFNNLNCNQKLKDPFIGPFTVVKMVGSNAAELDL FT QGSYSRRHPVFPVSLMKMYLSSDSEKFPKRTANKKAAPEITEEEGEIQRVL FT QQRVVTKGNKKVRQFLVSFKNKSPDLSRWVQEEEIPNGTALLQKFRKEARE FT ERITKK" FT CDS join(2714..3619,3623..5428) FT /product="Gypsy-114_MLP-I_2p" FT /translation="MKLPTEVQDTISVSTRDVEDISEYLTICERILTNQKN FT VNKPSTNNIRRPWRNEVTNNNIGRKDDNIVEKKKVTIPLQGDSKDKIKSKI FT CHRCKGPWVPGHTCNKILNIDGDEDESDVYSGEEEDLVENDNVQNNENDVM FT ILETEILHGCFMTDDLGQLRDVQEAEYVQTPSKVTSVCPARCTAVVNQKEV FT TIVLDSGAGGSVVSSNYLQKVDQNWRHNLVNEDTGKWKGYGSVLKPIGTYV FT ANVVFGHKRGNIRAVMKFIVMDNEGLPQYFIVGNNNILLYGIRLHICDRYF FT TLGNNLKRKFLTYEKTIPVQRITVVAGSSEKKECVMRGIPTVEPEEFEKAF FT SEATWDTALPFEDRELLSEVIRDFPMVFAHGKRQLGEVTVDEFDINLNIDK FT DKMPSSLKKKAYPCSPQKRKDIEENIQELLDLGVLEEIDRTPRECVISPVI FT IQYQNGKKRMCGDFRSLNDHTVSDVYGMPRIDSILHGLKGATRISLLDGYK FT GYHQFRNTERASNYLIIITHCGMYRYLRMPFGPKNGPSVFQQTMDKTFSKE FT IREGWMTIYIDDIIIHSKNTADHAEHLRRVFTKLEQINLTLAFKKCHFAFQ FT STKVLGHIVSGILMSVDGNKVKAISHIQPPTTVREVQSFLGMCGYYRQYIK FT NYQLVALSLTRLIRRNEAFEWTDERQRAFDTLKKMLQEAPLLSLPDFDKPF FT IVYTDASFVGLGAALHQKQVVDGREIEVPICFISRSLRNGELRYGATQLKC FT LAVVWALEKLHYYLDGSTFEVVTDCSAVKSLLGMKTPNRHMFRWQIAIQEY FT RGRMTISHRAGDKHQNADCLSRNPMPNNSDNPACVAPEDDAEIFGLHVVDL FT ENEFYYSVAEGYTLCPNMKKIIIVLKKPDNNTNHEIISSLDEPWKKLFHQG FT " XX SQ Sequence 6897 BP; 2275 A; 1247 C; 1513 G; 1680 T; 182 other; attgggggtc tggccgaact tactaggctt taaactatct cgtcgaagga cttttataaa 60 tattgcacca tatctgcaca ttataagagc cctagacact gataactgac tattgcttat 120 tgatatatcg ataagaaaca gattagaaag cagtgctgat attcgttcgt actttttaaa 180 taccaaatac cagcttttcg catgtatcaa atctgcattc gcgatcaaac aatccggccg 240 aattagtcta acaactcaag atcaaaactt gtaagttgtg atagtatctc ttccgagaag 300 ggtaaaagac taactggcaa atagttatag ttaatcgaac aatcattcga agaaatcgaa 360 cataaaatcg ccgattttat taaaactaca aagttcgttt ttcttgtttt cgtcgcttcg 420 ttgaagtcct gtgataagtt gaacataaga actaacgaaa ggttgcgttc ttaaatcaaa 480 ctttacaact catttgaatc gatcgccatg gccgacgacg ttgatatcac cagttggaat 540 atgtctacta gtgatatcaa cgctgtaccg gccatgagtt ccgctaacac tgaaactacg 600 gttaagaaag gtgggaagaa gggtagcaag gctactggaa aagtgggggg tatcaacact 660 gataccgaga tgaacgatgc gaccgcggct gaggaagctc caggaggaat gagtagcatg 720 aatgatgatc aagatgatct cgaggttgtc attcatggcg aaccgactaa taacaacagt 780 aatgttggaa tggatgtaag taaacccctt acatctagaa gaggaaaggg tttgtttgac 840 ttaccacctc ccatgacttc aggacttagg tcgggaagaa tggcgagctc tacgaggcat 900 gagattcgcc ctaacccaac caagggacgt acagctcaag attctggggg agacggtagc 960 tgcttacgag tgtcaggtgg ctcaactgaa gagccacgtg gaggacttag aggctcgatt 1020 ggcgagaatg ccaatccaac ctcagcccga gatacaatat ctagggacat cccaccacac 1080 cagacaaacg aaacccatcc ccaagccctt gttcgtccca aagacaaaca taccgtgcgc 1140 tccgatggcg tcggctccac cagtaatggt ggacaactac gccaacaaac ggaaacatca 1200 acacctcgaa ttgaacccat catacccgca tcgaaagggt acgatagttc caattcaacc 1260 ggtggcacag gcgcaaccaa tggtaatgga tcaggagaat tacgtggttc cggaaggaat 1320 gatgttagta aagaaaatgt gacggctccg aatgtagcta ccattccttt atcatctgtt 1380 ataaccaagt ctactgcttt gcgctctgta ccgagtattg taccaaattc tcttgtatcc 1440 caaccgatta ctcaatctag tccgcttaaa gtgaacgttg ggttgagaag tcagtctgcc 1500 tcgagcaatt taatatcact gtcattacca cactacaatg ctatgataac aagattagct 1560 gagttagaga ctgtaatgag agtaaatttt ggcacaatga atggccaatt aaagacacaa 1620 gctgaaagct tacaagcctt caggaatgaa atatcatcag tacaagaact gattgaagat 1680 gtcacacaaa tgaacaatat gctggatagc aaattggaag catttgagga ggcattggaa 1740 caacacagac gagatcaaca gaacattgcc aaattaatgc acataacgag tacagcctct 1800 gaggaattaa ccagagacac aaccaatatg tgtgaagcaa tggagcagtt gactttgaat 1860 cgtgaaactg aacgacaaag tatcagagag ttaaaaattg attttacgcg gcagtttcag 1920 gagatgaagg aacatcttga tggagtagcc cgtaatatct ctcgaatgca gtctgagcag 1980 gataaaaata tcctagaaaa gataccgacc gagataaaac ggggtgtgtc agaattagaa 2040 aacacaaaaa cagatatgtt cgtgggtgca agggaagaca caaaatataa agcagattct 2100 gaagtaatcg aaatatctac gccgtacact aatccaagga actatccaaa aattgatagt 2160 tatccaaaat ttagtggtaa accggatgag gactgggtag atttcgtaga caaaatcgat 2220 acattccagg attcatatgg gatgcccgat gctgaaatag tatcaaaact tccatccatt 2280 cttaagggtg ttgcgtacac atggtatcgt gcggtacata aacaacacag acaaaagagt 2340 tgggaacatt ggaaaagttt gatacaaaag aaattcggtg ggccagtgtg gagacaacgc 2400 caattaacgt tactacacca gatgaagttt tcgtataata gtcacgaagt tattgaattt 2460 ctgactgaat tacacagaaa attagaatca cxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 2520 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 2580 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 2640 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxcccatcg gcaagcagtg atgatataaa 2700 agaacatatc ttaatgaaac ttccaacaga ggtacaggac acaataagtg ttagtacaag 2760 ggacgtggag gatatctctg aatatttgac aatttgtgaa agaatattga ccaatcagaa 2820 aaatgtcaat aaaccaagca caaacaatat tcgtcgaccg tggagaaatg aagtgacaaa 2880 caataatatt gggagaaagg acgataatat tgttgagaaa aagaaagtca cgatcccatt 2940 acaaggcgac tcaaaagata aaattaagag taaaatttgc caccgttgca agggtccttg 3000 ggttcctggg cacacatgta ataagatcct aaacatcgac ggagatgagg acgaatcaga 3060 cgtgtattca ggagaggagg aagacctagt ggagaatgat aatgttcaaa acaacgaaaa 3120 cgatgtaatg attttggaaa cagaaatatt acatggttgt tttatgacag atgatctagg 3180 acagttgaga gacgtacaag aagcagaata cgttcaaact cctagcaaag ttacaagtgt 3240 atgcccagca aggtgcacag ccgtggtgaa ccagaaggaa gtcacaattg tacttgattc 3300 tggggcaggc ggaagtgtgg tatcaagtaa ttatttacaa aaagttgacc aaaattggag 3360 acacaactta gtaaatgagg atacaggaaa gtggaaaggt tacggatctg tgctaaaacc 3420 aattggaact tacgttgcaa atgtggtatt tggacacaag agaggtaata ttcgcgccgt 3480 aatgaagttc atcgtaatgg ataatgaggg actacctcag tacttcattg tgggaaacaa 3540 taatatattg ttgtacggga taagattgca catatgtgac aggtacttta ctctgggtaa 3600 taatctaaag aggaagttct aattaacata tgaaaagaca atccctgtgc aacgaataac 3660 tgttgtagct gggagcagtg agaaaaaaga atgcgtgatg aggggtatcc caacggttga 3720 gccagaggaa ttcgagaaag cgttctctga ggcaacttgg gataccgcgc taccattcga 3780 ggacagagaa ttattatcgg aagtcattcg agattttccg atggtatttg ctcatgggaa 3840 aaggcaactt ggtgaagtaa cagttgatga atttgatatt aatcttaata tcgataaaga 3900 taagatgccc tcgagcctaa agaagaaagc ctatccatgt agtccacaga aaagaaagga 3960 catagaggaa aacatacaag agttattgga ccttggtgtt cttgaggaaa ttgaccgtac 4020 acctcgtgaa tgtgttattt cacccgtgat aatacaatat cagaacggga aaaagaggat 4080 gtgcggtgat tttcgttctc tgaatgatca tactgtatca gatgtgtatg ggatgccacg 4140 gattgactcg atattacatg ggcttaaggg agctacaaga atatcattgt tggatggata 4200 taaaggatac catcaatttc gaaacacaga acgtgcgagt aattatttaa taattatcac 4260 acactgtggg atgtatcgat atcttagaat gccgtttgga ccaaaaaatg gaccatcagt 4320 atttcaacaa acaatggaca aaacattcag caaagaaatt cgagagggat ggatgacgat 4380 atatatagat gatatcatta tccattcaaa aaacactgca gatcacgctg agcatctgcg 4440 ccgggtgttt actaaactag aacaaattaa tttaactttg gccttcaaaa aatgccattt 4500 tgcatttcaa tcaactaaag tacttgggca tatcgtgtct ggtattctga tgtcagttga 4560 tgggaacaaa gttaaagcaa ttagtcacat ccagcctcca acgactgtaa gagaggtaca 4620 gagttttcta gggatgtgtg gatactatag acaatatata aaaaactatc aattagtcgc 4680 actgtctcta acaaggttga ttcgtcgcaa cgaagcgttt gaatggacgg atgagagaca 4740 aagggctttt gatactctaa agaaaatgct acaagaggca ccattgttgt ccttgccaga 4800 cttcgataaa cccttcatcg tgtataccga tgctagtttt gtaggactag gagcagcgtt 4860 acaccaaaaa caggtagttg acggaagaga gattgaggtt ccaatttgct tcatctcacg 4920 ctcgttgcga aacggcgaac tgagatatgg tgcaactcaa ctcaaatgtc ttgcagtggt 4980 atgggcatta gagaagttgc attactacct agacggcagc actttcgaag tggtgactga 5040 ttgttcagca gtaaaaagtc tgctggggat gaagacacct aaccgacaca tgtttcggtg 5100 gcaaatagcc atccaagaat atcgaggaag gatgactatc agccaccgag caggagacaa 5160 acatcaaaat gctgattgtt tgtcccgaaa cccaatgcca aataattccg ataatcctgc 5220 gtgtgttgca ccagaggatg atgctgaaat ttttggcctg cacgtcgtag atctcgagaa 5280 cgagttctac tacagcgtag cagaagggta taccttatgt ccaaatatga agaaaataat 5340 catagtcttg aagaaaccgg ataataacac gaatcacgaa attatcagtt ctttagacga 5400 accttggaag aagctttttc accaaggctg atttatcttt gaggacgatc tattgtatta 5460 ccgaaagcag ggatcacatc ggctagttgt tcatgaaagc atgaaggagc agatactgaa 5520 attatgtcac gacgatatac tcgcgggaca tttcagtttg gaaaagactc tgaaccgtat 5580 caagaatacg gcttggtggt taaattacaa ccaagatgtg atagaataca ttagttcttg 5640 cgatacctgt cagaaaggga atcggaagac tggaaaaaca tttggtctcc tgcaagagat 5700 acaaaaacca acgaagccat gggaaataat caacatggac tttgtaacag gtttaccacc 5760 tgcgggtgaa ctttcataca actccgcatt ggtcattgtt tgccgattgt ccaaaaaagc 5820 aaagtttatt ccgtgtcata aagacattga cgcgaaggga cttgctcatc tttggtggaa 5880 aaatgcattg aatgaatgtg gattaccgag cgctatcatt agcgacaggg accccaaatt 5940 tacatccgaa ttttggacat ctctgatgca aatagcgggg tgtgatctga aactctctac 6000 agctcaccat ccacaaacgg atggactggc agagaggacc atccaaacaa tggaggactt 6060 gatcagacgt tattgtgcat tcgggctatt atacaaagat agcgagggtt ttacgcatga 6120 ctgggtttct ctcctgccgg gattacagtt tgcttataat agcagtgtac attcaagtac 6180 tgggaggaca ccattcgaat tggaaagagg ttatatacct cagaacccgc gaatgctgac 6240 caataagaaa ttgggaaaac tggatatcca tccttcggcg ggaaattttt cccatatgca 6300 agaattggca tggttacatg ccgctgaatg cataacgaag gcctttgctt atgaaaaaca 6360 gcgatgggat aagactcata cgactcctcc ttttaaagca ggagatcagg tcttattgtc 6420 aaccattcac ttcaataatt tgaactgtaa ccagaaattg aaggatccat ttattggacc 6480 atttactgtt gttaagatgg ttggcagcaa cgctgcggaa ttagatctac aaggatccta 6540 ctcaagaaga catccagtgt tccctgtgtc tttgatgaaa atgtatttgt cttcagactc 6600 tgagaaattt ccaaaaagaa ccgccaataa aaaagcagct ccagagataa cagaggaaga 6660 gggagaaata caacgagtgt tacaacaacg cgttgtaaca aaagggaaca aaaaagtccg 6720 tcagtttctt gtttcattca agaataagtc accggactta tcaagatggg tgcaagagga 6780 ggaaatccca aacgggacgg cattattgca aaagtttaga aaagaagcac gggaagaaag 6840 aatcactaaa aaatgattta aacatttttt tcgttctttt ttttttgtct gagggag 6897 // ID Copia-36_MLP-I repbase; DNA; FNG; 4614 BP. XX AC AECX01001637; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-36_MLP_; KW Copia-36_MLP-LTR; Copia-36_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4614 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001637; Positions 5873 1260. XX CC Positions [1900-2424] - Integrase core CC 'GTTAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 46..3192 FT /product="Copia-36_MLP-I_1p" FT /translation="MSTNPFDPPSPADSTNTYNESEASSSDRTVNQEFFDS FT STLPDSVNPLERVPSEVIMSTPDEGVVSLFGNAIQKYGQQLTTALFKFKIT FT DNLEDGNYASWSRVVFGNLDNLELHHYILTKDYKDPDLSAAQILKTKKILV FT NYILNHLDKSNNTQAVNHLTDPEDSLSLVYDPFSLWSFLKERHFLINSQRL FT ASISKTLSTVSIHRSDTISTFLDKFESLFIEFTRYGGKMDDTQSAIRLIDA FT ITTLPPSIVEFIHATVSPLTRKDVIKYLRDYDTRQNSFTTEATREVNHIES FT TPFISANSAVRSTRVMCTETVCHGPHPPDKCWSKPQNFRERDDFLARRRAR FT GSWNHRGRQFSTRPSNSNSSQVRGMQRVDTPSANSVADSIQMLSLNAEFDS FT ITSSPEETNHTSFELITSEPEANSLESINSDIIWALHDTGATHHMFNDKSL FT FVKDSLKPVEDKNRRLKLAGGDVSLAVDSIGKIQLKAGDNSVFELTECLYV FT PELRKNLISGGTLKRKGVREVYNDSEPTCFALVKDGLALFNGVILQNGLMN FT VKITPVSSSVHRDQSSSSINSSIVHRRLGHLSDQYLRTMKNHESVDGIDVL FT EDNFESCEICSLSKNTKLPFNHTRPRAIKFLENVHVDLSGIYRVNGFNNEN FT YFILFCDNYSCYRHIYGLTSKTKEEVFETFKSYIAVSERQTGCKLKQFTLD FT RGSEFVNNLLGPYLIELGIELHLTSGYAPEENGVSERGMRIVNTKARCLML FT QSGLPQKFWFLSCRAAVFLTNRTVTKALNDFRTPFEVWHFRKPTISHIRTL FT GCQAFRLIRKELRESKYSPVSSEGVLIGFEQDNFNYLIYDLEDNKIHTSHH FT VTFNEDVFPFLKLKSHQATSQSLVKMLYSDDEDEMIEVCELADKAKDSDDY FT RNPDADRLNTGPSSSELEDKIPEHEPIPKVASKNKFENLTMRKSTRNQTQV FT NYKGMCNYSEFEDEDFCTQVLFDGLPQCFNAVLVSQPDPKSFKKPMLAGDS FT PSWKAACDKDMSSLLKKEVWTLVKDQLTNQPFLECGYFERRQS" FT CDS 3318..4589 FT /product="Copia-36_MLP-I_2p" FT /translation="MAAIHGWEVHQMDAVTAFLNGILEEEIYMEQPEGYVV FT VGQEDKVLKLNRSLYGLKQSPKIWQDDVTQSLIALCFEQCTIDPCIYIRSN FT EEEQLFTAVYVHVDDMAITGNDSITFKAEISSIWEMEDLGLAQTIVGIEIN FT RRSPNIYTLTQTKFAITVLKRFDMLESKSASTPLTPNLKLYRATDEECKDF FT SLLGHNYRSAVGSLMYLSQCTRPDLAHSVGVLSQHLDRPGLQHWNAALQVF FT RYLKGTMHLGIVYSGEDNLTVSGQRSFSYPNSYCDADWAGDQNTRRSTTGY FT IFILAGGALSWKSRLQPTVALSSTEAEYRAITEAGQELLWLRRMMELFGCK FT DLNPTVLQSDNQGAIHLTNKSIFHGRTKHIEIQYHWIREIVKNGDLILEHC FT PTSEMVADLLTKALGKQQFIRLRSKLGIKM" XX SQ Sequence 4614 BP; 1434 A; 928 C; 898 G; 1354 T; 0 other; tggtagcgag agcttcgatc agctcaatct aactcagtct agtttatgtc aacaaatcca 60 ttcgatcctc ccagtccagc cgattctacc aatacctaca acgagtctga agcttcatct 120 tcagatagaa ccgtcaatca agagttcttc gattcctcaa ctttacctga ttctgtcaat 180 cctctcgaac gtgtaccatc cgaagtcatt atgtctacac cagatgaagg tgttgtttcg 240 ctttttggaa acgctatcca aaaatacgga caacaactaa ctactgctct gttcaagttc 300 aaaataacgg acaatcttga ggatggtaac tacgcgtcgt ggagtcgagt tgtttttggc 360 aatttggaca atctggagct tcatcattat atattaacta aagattacaa agatcctgat 420 ttatcggcag ctcagatttt gaaaaccaaa aagattctag tcaattatat cttaaatcat 480 ctagacaaga gtaacaatac tcaagctgta aatcacctta ctgaccctga agattctcta 540 agtctggtct atgatccttt ctctctctgg agttttctga aagaacgtca ttttctcatc 600 aattctcaac gtttagcttc tatctcaaaa actctaagta ctgtttccat acacagaagt 660 gatactattt caaccttcct tgataaattt gaaagtttat ttatcgagtt tactagatat 720 ggaggaaaga tggatgatac tcaatcagct attagactta ttgacgcaat cactacctta 780 ccgccttcaa tcgtcgaatt cattcatgca acggtttctc ctcttactcg aaaagatgtc 840 atcaagtacc ttcgtgatta cgacaccaga caaaactcgt ttacaactga agccaccaga 900 gaagtaaatc acattgaatc aactcctttc atatctgcca attctgccgt tcgatcaacg 960 agagtaatgt gtacagaaac ggtctgtcat ggtcctcatc ctccagacaa gtgctggtcg 1020 aaacctcaaa acttcagaga acgcgacgat tttcttgctc ggcgccgtgc tcgaggtagt 1080 tggaatcatc gaggaagaca gttttcaaca cgtccatcaa attccaactc aagtcaagtg 1140 agaggaatgc aaagagttga tactccttct gctaactctg ttgcagactc aattcaaatg 1200 ctgtcgttga atgccgaatt tgactcgatc actagttcac ctgaagaaac taatcacacg 1260 agttttgaac tcatcacttc cgaacctgaa gcaaactcct tagaatccat caactctgac 1320 atcatctggg cacttcatga cacgggagca acgcaccaca tgtttaatga taaatcacta 1380 ttcgtgaagg acagtcttaa gccagttgaa gataaaaatc gaagattgaa acttgctggg 1440 ggtgatgtat cgttagcggt tgatagtatt ggaaagattc aattgaaggc tggcgataac 1500 tctgtttttg aactcaccga atgtctctac gttcctgaat tgaggaagaa tttaatatca 1560 ggaggaactc tcaagagaaa aggggttaga gaagtgtaca atgattcaga accgacctgt 1620 tttgctttag ttaaagatgg actagctttg tttaatggtg ttatcttaca gaatggtctt 1680 atgaatgtca aaatcactcc tgtaagctca tcagttcaca gagatcaatc ttcaagttca 1740 attaactcat caattgttca ccgtcgtcta ggtcatttaa gcgatcaata cttaaggact 1800 atgaaaaatc acgaaagtgt agatgggatt gatgtcttgg aggataactt tgagtcgtgt 1860 gaaatctgta gtctctctaa aaacaccaaa cttcccttta atcatactcg tcctcgtgct 1920 attaaattcc tcgaaaatgt acatgtagac ttaagtggaa tttatagagt taatggtttc 1980 aacaatgaaa attattttat cctattttgt gacaattact catgctacag acacatctat 2040 ggattgacca gcaaaactaa agaagaagtt tttgaaacct tcaaatcata cattgcggtt 2100 agtgagcgtc aaactggttg taagcttaaa cagtttaccc tggacagggg ttcagaattt 2160 gtgaacaatc tactcggacc ttatctcatt gagcttggaa ttgaactgca tctaacgtct 2220 ggttatgctc ctgaagaaaa cggtgtttct gaaagaggaa tgcgtattgt caacacaaaa 2280 gccagatgtt tgatgttaca gtcgggttta cctcaaaaat tctggttcct gtcttgtcgt 2340 gctgcagttt ttctaacaaa tcgaacagtc accaaagctc tgaacgactt cagaactcct 2400 ttcgaagttt ggcattttcg aaaacctacc atctcacaca taagaacact cggttgtcaa 2460 gcttttcggc tcattcgaaa agaacttcga gaatcgaagt actctcctgt tagctctgaa 2520 ggagtactga tcggatttga acaagataat ttcaattatt tgatctatga ccttgaagac 2580 aacaaaatac atacttcaca tcatgtcacg tttaatgagg atgtttttcc ctttctcaaa 2640 ttaaaatcac atcaagccac ttctcaaagt ttagtaaaaa tgctttactc cgatgatgaa 2700 gacgaaatga tcgaagtgtg cgaattagct gataaagcaa aggattctga tgactacaga 2760 aatcctgacg ctgatcggtt gaatactgga ccttcatcgt cggaacttga agacaaaatt 2820 cctgaacatg aacctattcc aaaagttgct tcaaagaata agtttgaaaa tttgacaatg 2880 cgaaaatcaa caagaaatca aactcaagtt aattacaaag gaatgtgtaa ttattcggaa 2940 ttcgaagatg aagatttctg tactcaagta ctttttgacg gtttacctca atgctttaat 3000 gctgtactag tctcacaacc cgatccaaag tcgttcaaga agcccatgtt ggcgggtgac 3060 tcccccagct ggaaggctgc ttgcgacaag gacatgtctt cccttctgaa gaaagaagtt 3120 tggacactag ttaaagacca actcacaaac cagccattcc tggaatgtgg atatttcgaa 3180 agaagacaat cctgatggtt cgattaagta caaggctcga tttgtggctt taggcaatac 3240 tcaagtcgaa ggtgaagact acggtgaaac atttgctcct accgggtaaa cctacatccc 3300 ttcgactact cctggctatg gcagctattc acggctggga agtccatcaa atggatgcgg 3360 tgactgcttt cttaaatggc attttagaag aagaaatata tatggaacaa cctgaaggtt 3420 atgttgtagt tggtcaagaa gataaagtac tcaagttgaa tagatcatta tacggcctta 3480 aacaatcacc gaagatctgg caagatgatg tcactcaaag tcttattgct ctctgctttg 3540 aacaatgcac tattgatcct tgcatttaca tcagatcaaa cgaagaagaa cagttgttta 3600 cagcagtgta cgttcatgtc gatgacatgg ccataactgg aaacgactct atcactttca 3660 aagccgaaat ttcatcaata tgggaaatgg aagatttagg actggcccaa accattgttg 3720 ggatcgaaat aaaccgaaga tcacctaaca tttataccct aactcaaact aaatttgcta 3780 ttactgtcct caaaaggttt gatatgcttg aatcaaaatc agcatccaca ccactcacac 3840 caaatttgaa gttgtatcga gctactgatg aagaatgcaa ggatttctca cttcttggtc 3900 ataactatcg cagtgcagtt ggatctctca tgtatctttc ccagtgtacc cgaccggatc 3960 ttgcgcactc agttggtgtc ttgtctcagc atttggatcg gccaggttta caacactgga 4020 atgctgcact tcaagttttt cgatatttaa aaggaacgat gcatcttggg atagtgtact 4080 ccggtgaaga taacttaact gtgtctggtc aaagaagttt ttcatatcca aactcttact 4140 gtgatgctga ttgggctggt gatcaaaata caagaagatc aactacggga tacatcttta 4200 tcttagcagg gggtgcgttg tcttggaaaa gcaggctaca acctacagtt gctttatcct 4260 cgacagaagc tgagtatcgt gcgattactg aagcaggtca agaattgctg tggttacgga 4320 gaatgatgga attgtttggg tgtaaagatt tgaatccgac tgttcttcaa agtgataatc 4380 aaggtgcaat tcatttgaca aacaaatcga tttttcatgg tagaactaaa cacattgaga 4440 tacagtatca ttggattaga gaaattgtca agaatggtga tcttattctt gaacattgtc 4500 ctacgagtga aatggtagct gatttactta caaaagcttt gggaaaacaa cagtttatca 4560 ggttaaggag caagttaggg atcaaaatgt aactggcatt gtcttgaggg ggtg 4614 // ID FOT1_CA repbase; DNA; FNG; 1804 BP. XX AC AF259783; XX DT 11-MAY-2005 (Rel. 10.05, Created) DT 03-JUL-2007 (Rel. 12.08, Last updated, Version 2) XX DE Candida albicans transposon Cirt1 transposase pseudogene. XX KW Mariner/Tc1; DNA transposon; Transposable Element; KW DDE transposase; Cirt1; FOT1_CA. XX NM FOT1_CA. XX OS Candida albicans OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; mitosporic Saccharomycetales; OC Candida. XX RN [1] RP 1-1804 RA Fan J. and Chaturvedi V.; RT "Structure, distribution, and evolution of Fot1-like transposable RT genetic elements, Cirt1 and Cirt2, in the human pathogenic RT yeast."; RL Unpublished. XX DR Genbank; AF259783; Positions 1 1804. XX CC 40 bp terminal inverted repeats. ORF interrupted by stop codons. XX FH Key Location/Qualifiers FT CDS 116..1696 FT /product="Cirt1ORF" FT /translation="MPKTKEVKQEVDETAIQNAIQDFKSGKFKTIAETARF FT YGLVPNTLRFRYNGGLPRKLAHTKEQLLSPEQEDEIVNWIVDSDADGHGRS FT RSDIIAFAELMLGTSESSPKIHESWYDRFKSRHDEIHTVEGRSISSLRAKA FT VTYEEILKYYRDYDLIVRQHKISHENIFNYDESRFIMGKGKSSRVAVPSYK FT NRTYVQSTEGRDSCTVIEAISMSGEALVPAVIFKGGSLRTGWFKDDAPDWY FT YTASKRGFTTNWLSLC*LKDIFVPQVKEKTSQGKVMLIMDGHGSHKTDEFR FT ETCEKNNIIPMYLPPHSTHLLQPLDLGIFGPVKSRYKTKLSKLAGILEDTP FT VRQRLFIMRYHEARQEKLTKERIIKSWETAGLNPFDPDKVLKSSQLIVKRI FT NDEIDQAEKDRERRAKERLRESEPSLEKLTKSQMIEKYTKDVKMLKSRICF FT LEAMNARLQFDLNIATAAAQNKKPEGVSSAIPMDENKGFKQVMGYIKEIRK FT DPKKKRRRKALGDITNTSKGSNSYTNFSGS*" XX SQ Sequence 1804 BP; 639 A; 310 C; 383 G; 472 T; 0 other; tacctcacgc ccggtggaaa attctctcct gggaaactta cgcgcaaaaa taaaaaaatg 60 taaacaatcc tttgaccaca ctcttctcct attcctcttc aatgtgccac cttacatgcc 120 aaaaacaaaa gaagtaaagc aagaagttga tgaaactgcc attcaaaatg ctatccaaga 180 ttttaaaagt gggaaattta aaacaattgc cgaaacagct cggttttatg gtcttgttcc 240 taacacattg aggtttcgtt ataatggagg attaccaagg aaattagccc atacaaaaga 300 gcaactcctt tctcctgagc aagaagatga aatagttaac tggattgttg attcagatgc 360 agatggacat ggacggagtc gtagtgacat aatcgcattt gctgagctta tgttaggtac 420 tagtgaatct tcccctaaaa tccatgagtc ttggtacgat cgcttcaaat caagacacga 480 cgagatccat acagttgaag gaaggagtat atcaagtttg agagcaaaag cagttacgta 540 tgaggagata ctaaaatatt atcgtgatta cgacctgatt gttagacaac ataaaatttc 600 tcatgagaac atatttaatt acgatgaatc taggttcatt atgggtaagg gtaaaagttc 660 aagagtagct gtacccagct acaagaatag aacctatgta caatccacag aggggaggga 720 tagctgtact gttattgagg caattagtat gtctggagaa gctttagttc ctgctgtaat 780 ttttaaagga ggttccctca gaactggctg gtttaaggat gatgctccag actggtatta 840 cacagcatct aaaaggggat ttaccacgaa ttggttatct ttgtgttgat tgaaggatat 900 ttttgttccc caagtaaaag aaaaaacaag tcaaggaaaa gtcatgctta taatggatgg 960 gcatggaagc cacaaaactg atgagttcag ggaaacatgt gagaagaata acattattcc 1020 tatgtacttg cctcctcatt ccacgcattt gttgcagcca ttggaccttg gtatatttgg 1080 cccagtaaaa tctcggtata aaacaaaatt gtcaaaattg gccggcattc tagaggacac 1140 tccagttcgg caaaggctat ttataatgag gtaccatgaa gcaagacaag aaaagctaac 1200 aaaggagaga atcatcaaat cttgggaaac ggcaggatta aatcccttcg accctgataa 1260 agttttgaaa tcatctcaac tcatagttaa acgaatcaat gacgaaatag atcaagctga 1320 aaaggataga gagcgtcgtg ctaaagaacg tcttagggaa agcgaacctt ccttagagaa 1380 attgactaaa agccaaatga ttgaaaaata tacaaaagat gttaaaatgc ttaaatccag 1440 aatttgtttc ctagaagcaa tgaacgcacg gttacaattt gatcttaata ttgctacagc 1500 agctgctcag aacaagaaac cagaaggagt gagctccgct ataccaatgg atgagaataa 1560 gggttttaag caggtaatgg gttacatcaa agaaatccgc aaagacccga aaaagaaaag 1620 gagaagaaag gcactagggg atattacgaa tactagtaag ggttccaata gttataccaa 1680 ctttagtggc agttaagtag atagttgtag agttaaccta caatataagt gttttattag 1740 tcgtgtaaag ctttgaaaaa aaaataagtt tcccaggaga gaattttcca ccgggcgtga 1800 ggta 1804 // ID Gypsy-80_MLP-LTR repbase; DNA; FNG; 162 BP. XX AC AECX01001140; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-80_MLP_; KW Gypsy-80_MLP-I; Gypsy-80_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-162 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001140; Positions 224229 224390. XX SQ Sequence 162 BP; 39 A; 40 C; 24 G; 59 T; 0 other; tgttatggtc tatttagtag catatgctta tagagcacgt tgtacctttt atctattatc 60 tggcgcatgc atgcatggag ctttttcctt acgttgcaat ctagttatta caacagcatc 120 acctttccct tctcatctca actagcccat caagtcctaa ca 162 // ID AFLAV_LTR repbase; DNA; FNG; 664 BP. XX AC . XX DT 04-SEP-2005 (Rel. 10.08, Created) DT 04-AUG-2008 (Rel. 13.07, Last updated, Version 2) XX DE Aspergillus flavus LTR-retrotransposon AFLAV (LTR portion). XX KW Gypsy; LTR Retrotransposon; Transposable Element; LTR; AFLAV_LTR. XX NM AFLAV_LTR. XX OS Aspergillus flavus OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC mitosporic Trichocomaceae; Aspergillus. XX RN [1] RP 1-664 RA Okubara P.A., Tibbot B.K., Tarun A.S., McAlpin C.E. and Hua S.S.; RT "Partial retrotransposon-like DNA sequence in the genomic clone RT of Aspergillus flavus, pAF28."; RL Mycol Res 107(Pt 7), 841-846 (2003). XX RN [2] RP 1-664 RA Hua S.-S.T., Tarun A.S., Pandey S.N. and Chang P.-K.; RT "AFLAV, a new Tf1/Sushi retrotransposon from Aspergillus RT flavus."; RL Direct Submission to EMBL/GenBank/DDBJ (21-NOV-2003). XX RN [3] RP 1-664 RA Smit A.F.; RT "Consensus update."; RL Direct Submission to Repbase Update (14-MAR-2006). XX DR [3] (Consensus) XX CC LTRs differ by 1bp substitution. XX SQ Sequence 664 BP; 183 A; 172 C; 155 G; 154 T; 0 other; tgttacagac ccttgtctag cgcaccttat cgaaggcgga cacacaaggc agaataacga 60 taggatgacg atggccgatc cctgggagga aggcacccga cgaatcgttt atctagtagt 120 tattcgggaa gggctacgaa cgccggtaac aacaccatca cgtgatacta ctcaggacct 180 accccagagg agtatatcgg gggtatacga cgctacgcac cgcaattgga tatactccat 240 cccctacccc taagggagtc acgatgggag tagataactc aagggtatat aaggacattc 300 attcatgtat ttagtttagt tcttcgtgat caatctacat tgttactaca gagttacctt 360 agtatcaaga ccagtctcaa ctcgttcgca ctctctcaac tattaggtat acgtacttaa 420 taggcctggc cctttggact cgagtctagt atcaacggca gtctagtgtc aactgcatcc 480 ctgtgacgac ggcaatctag tatcgacggc agactagtgt cgacggaagt ctcgtaacga 540 ctcccgtctt ccctagaccg caaccaacaa cgacggaggt caaagaacga cttcaggtta 600 gtaacccttg aaacgtggtc cgctaactgg tctcagactt gacgacgaag ccccgggcgt 660 aaca 664 // ID Copia-1_PAD-I repbase; DNA; FNG; 4003 BP. XX AC AEOI01000003; XX DT 24-FEB-2011 (Rel. 16.02, Created) DT 24-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Pichia angusta genome: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_PAD_; KW Copia-1_PAD-LTR; Copia-1_PAD-I. XX OS Pichia angusta DL-1 OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; Pichia; OC Pichia angusta. XX RN [1] RP 1-4003 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Pichia angusta genome."; RL Direct Submission to RU (24-FEB-2011). XX DR Genome; AEOI01000003; Positions 924584 920582. XX CC Positions [1413-1703] - Integrase core CC LTRs are 96% similar to each other. XX FH Key Location/Qualifiers FT CDS 1539..3974 FT /product="Copia-1_PAD-I_1p" FT /translation="MNRVITSRARILLSSSGCLESYWPFAVQMATYLVNRE FT PSRAIQFEIPYSCWFHKSPDYSRYHPFGCKAYSLVDSHERPSKFSSVISPL FT IFVGYSKRRTAFQLIDPHTLKLKETDQVRFDDTCFPFKQDTSLDFKPQSSH FT VPIMVPPTYGLITDSHLLTTSIIPKEPVPCSRSPSPSGSDSAPMDPPSTTS FT DLHNASAAPPLGPMLVTIPTTIQDSVSPVLEHLEDGTTSAEVTASSERDST FT DHSTIRLLTAVNAPPTSEMDLHHCIHPVLYPGRRSKRSRLPITKYRRLNNS FT TDHSSALVALPGSDHVPSSVSEALSIPEWKQAMQREFQAQLDNHTWSLVDL FT PSGRRAIGCRWVYTMKPSTSTFPHKARLVAQGFSQIHGIDYNETFSPVIRY FT DSLRLLLALAASHSLSVYQMDVTTAFLNGNLVEELYMRQPPGFISTDHPQR FT VCRLHKSLYGLKQAPLCWNHTIDQVLVQANFTRIKSDYGIYVSHSSPPTYV FT ALYVDDLLILSKSLTQIKSVQSLLSSKFQMKDLGPVTTFLGLNIRQTPQSI FT TLSLSSYLTSILQDFGLDTCNPVTRVSTTTAWISDHDVPLTDPTLFRSMVE FT KLLFAANTARPDITFTVAKLSRYLKQPSQNHLTIAKHVLRYLKGTIDDGIT FT YTRTSSVELKGFCDADWGGILEDRTSTTGYIFMLANGPISWKSKKQNSIAT FT STTEAEYMAMSDAVKELLWLKQLMKELSVLGSYVPILFGDNTSSISLAKHP FT TQHQRTKHIDIRYHFIRDHILKGDLQIEYVDSQSNIADLLTKQLVREKLNS FT LKKKMHLAKI" XX SQ Sequence 4003 BP; 1140 A; 965 C; 727 G; 1171 T; 0 other; ggttatgagc cctgcgagcg ctttttgggc gaccaactca tttaacgatg atgataagct 60 tatgtcagcg gcaaatttta tgcgttggaa aacaaaattc acaaatactt taaattcatt 120 tgaaccatca gttgttgact atattcaaac tggtaatata cctttttccc tcgaacagtg 180 tggaggtaac gaacaagtta ttgaaattca agctgcttat gaccaactcc tgttattctg 240 ggtacacacc tcagtgtcac aaccagtaaa gaatctgatt agtcaggatc aaactggtta 300 tcagaaatat catcttctga tcaaatgtta tggacaaata tcctttagac agttgtacga 360 agaatactgt tcaattaact cgtcaaatgg acaagggcct atggaatggg gagcaaaatg 420 tgtcaatcta cttcaatgga atcaagaccc agcagtatgg ttaatcttct ccaaacttag 480 ttctctgcag aatgagaaac tagaacgata tctatttgct gactgcggaa atgaactaac 540 tattcaaaga gctgtcacag caatgcaatt atttgaacat gaaaccaagt ctggattcct 600 ggctactaaa cgttctatcc aaacgacgaa gaagaagaat attatttgtg caagatgtca 660 gaaagctggc cacaaatctc caaactgtag agcaccaact ccagtttcaa aaggaaaccg 720 ggcttaaaca gaaatcactt cacagacttt cgtacaacca atggatatat ttctggaata 780 gcatctggtg tgcaggtgaa cggaaaacga actgttgtcc tatctaaaag acacaagaca 840 gtgacactca aagatgttct ttatgctcca caatgtccat ccaacctgat tgctattaac 900 aaagctgtgc ttgctggtaa ggagattaaa attagtaaca agaaactatg gtgtggtaaa 960 gatttattag gaacttacca ccatcagaac aaactgttca aatatacata caaagtgatg 1020 tcttcctctt cttattcaaa ggcatattaa gcctcagtat cagatcttca tattgtaatg 1080 ggacatccac cgaagtccat ctctgcaaga cttcaaatcc caaatcctcc tcatccatgc 1140 caagattttc ttgctggaag gcctgttaaa tccaaacgaa agatctcatc tacgtctgtg 1200 tcgagacctc tcaaactctt acatgctgat atctgtggac cttttcccca cacaggaata 1260 aatggtgaga aatacttcct ggttattgtt gaccgattta cacatatcac ctcatcatat 1320 tgcttaaagg tcaaatcaga agccacagat cttattcagc aatttataac aaacctgaga 1380 cacaattgtc accccataat tacaaagtta ctaccttgcg tactgataat ggtggcgaat 1440 ttgttaacaa gtctcttgaa acttttcttg catccaaagg catcacgcaa gaactaacag 1500 ttgcccactc cagccatcaa aatggagttg cagagcgtat gaaccgagtc atcacttcaa 1560 gagcccgcat acttctttca agctcaggat gtcttgaatc ctattggccc tttgctgttc 1620 aaatggccac atatcttgtc aacagagaac ccagtagggc tatccaattt gagataccat 1680 attcgtgttg gttccataaa tctcctgatt attctcggta tcatccattt ggatgtaaag 1740 catattcact tgttgattct catgagcgtc catcaaagtt ttcttctgtt atctcgccac 1800 taatctttgt tggatatagt aaacgccgga cagcgttcca gctcattgat ccgcatactt 1860 tgaaattgaa agagactgat caagtgcgtt ttgatgatac ttgtttccca ttcaaacaag 1920 atacatcctt agatttcaag cctcagtcct ctcatgtccc aatcatggtt cctcctacct 1980 acggtctaat tacagattcc cacctgctta cgacctctat tattcccaag gaaccagtac 2040 cttgttcgcg atctccttcg ccttctggct ctgactcggc gccaatggat cctccctcta 2100 ctacatctga tttacataat gcatcggctg ctcctcctct gggcccgatg ttggttacta 2160 ttccgacgac tattcaagac tccgtctcgc ctgtcctcga acatctagag gacggtacta 2220 catctgctga ggtaactgct agctctgagc gtgactcgac tgatcattcc acgatacgcc 2280 ttctcaccgc agtgaatgct cctcctacat cagagatgga tctgcatcac tgtattcatc 2340 ctgtactcta tcccgggagg cgttccaagc gttctcgttt accaatcacc aagtatcgtc 2400 gactcaataa ctctactgac cattcatctg cacttgttgc ccttcccggg tctgatcatg 2460 ttccctctag cgtgtcagag gctctctcta ttcctgaatg gaaacaagcc atgcaacgcg 2520 aatttcaggc acaattggac aatcacacat ggtctcttgt tgacctccct tctggccgcc 2580 gtgctattgg ctgtcgctgg gtttacacta tgaaaccgtc tacatccacc tttcctcata 2640 aggcacgact ggtagctcag ggtttcagtc aaatccatgg tattgactac aatgagacgt 2700 tcagtcctgt cattcgatac gacagtcttc gtcttcttct tgcgcttgct gcctctcact 2760 ctctctctgt ttaccaaatg gatgtcacca ctgcttttct caatggcaat cttgtcgaag 2820 aattgtacat gcgacagcct cctgggttta tcagtactga ccatccccaa cgagtctgcc 2880 gtcttcacaa gagtctgtat ggtctcaagc aggcaccttt gtgctggaac cacaccattg 2940 atcaggtcct tgttcaagcg aatttcaccc gcattaaaag tgattatggc atttacgtgt 3000 ctcactcctc tcctcctacc tatgttgccc tatatgtcga tgaccttctt atcttgtcga 3060 agtcgcttac tcagattaag tctgtccaat ctcttttgtc ctccaaattc cagatgaaag 3120 atcttggtcc tgtcaccacg ttcctgggtc tcaatattcg ccagactcct cagtcgatta 3180 ctctgtcttt gtcttcatat ctgacttcca tcctccaaga ctttggtctt gatacctgca 3240 atccggtaac tagggtgtct actacaactg cgtggatatc tgatcatgat gttcctctga 3300 ctgatcctac cctttttcgc agcatggtag agaagttgtt atttgctgca aacacggcta 3360 gaccagacat cacttttact gttgcaaaac ttagcagata tcttaaacag ccaagtcaaa 3420 atcatctcac tattgcgaag catgtgttga gatacttgaa aggaactata gacgatggca 3480 ttacctacac tcgtacatca agtgttgagc taaaaggttt ctgtgatgct gactggggag 3540 gtattttaga agacagaaca tctacaactg gatacatctt catgctagca aatggtccta 3600 tttcatggaa atccaagaaa caaaactcaa tagcaacatc caccactgaa gctgaatata 3660 tggcaatgag cgatgctgtg aaagaattat tgtggcttaa acaattgatg aaggaattat 3720 ctgttttggg atcatatgtt cccatactgt ttggtgacaa tactagttca atatccttag 3780 caaaacaccc cactcaacat caacgcacaa aacatataga tatacgctat cattttatca 3840 gagatcatat cttaaaggga gacctacaaa ttgaatacgt tgacagccag tcaaatattg 3900 ccgacctact gactaagcaa ttggtcagag agaaacttaa ctccctaaag aagaaaatgc 3960 atcttgcaaa gatttgaccc aatccacgac tatggtaggg tgt 4003 // ID Gypsy-21_MLP-LTR repbase; DNA; FNG; 160 BP. XX AC AECX01001225; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-21_MLP_; KW Gypsy-21_MLP-I; Gypsy-21_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-160 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001225; Positions 102221 102062. XX SQ Sequence 160 BP; 37 A; 38 C; 25 G; 60 T; 0 other; tgttatgagc cctttatgct tatactaagt tatgcttgta ctagtctcag ttgtacttcc 60 aatcactggt cagagatttt ctagtcttta cgttgcaatc tcgttatagt acttgaatac 120 ccttgaagtc ccttgatcag atccctccac gcttctaaca 160 // ID Copia-15_MLP-LTR repbase; DNA; FNG; 475 BP. XX AC AECX01001148; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-15_MLP_; KW Copia-15_MLP-I; Copia-15_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-475 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001148; Positions 109541 110015. XX SQ Sequence 475 BP; 118 A; 92 C; 70 G; 195 T; 0 other; tgagatcata atcataaggc ttattaactc atcatgattt ctttgttaca tcaattatac 60 acctgttatt gtaaagatga tatgatgtca taatgtttga tttccttata taaagcttat 120 acgtttcttt acttttagct ttttccttta tcgaactcaa acatactttt gttctcaatc 180 ttctttacca tcagattgtt ctcatcagtt gttcgatcag ttctcagctt cattgagctg 240 ataacttgat tctcaccagg tgtaagttgt gtcgtgttag tgttgtggat attgatgaaa 300 tcgaactcaa acatactttt gttctcaatc ttctttacca tcagattgtt ctcatcagtt 360 gttcgatcag ttctcagctt cattgagctg ataacttgat tctcaccagg tattgttctc 420 atcagttgtt cgatcagttc tcagcttcat tgagctgata acttgattct cacca 475 // ID TCG3_I repbase; DNA; FNG; 4639 BP. XX AC AY673967; XX DT 04-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Candida glabrata transposon LTR-retrotransposon Tcg3 (internal DE portion). XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; TCG3_LTR; internal portion; TCG3_I. XX OS Candida glabrata OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Nakaseomyces; mitosporic Nakaseomyces. XX RN [1] RP 1-4639 RA Neuveglise C.; RT "A LTR-retrotransposon in C. glabrata."; RL Unpublished, 2004. XX DR EMBL/GenBank/DDBJ; AY673967; Positions 965 5603. XX CC LTRs differ by a single bp substitution. XX FH Key Location/Qualifiers FT CDS 264..821 FT /product="TCG3_LTR_1p" FT /translation="MRGAVRKFRTLPDKVLINNIRRATDDDIKYDLTKIRE FT DLPSRITIKTFLKAIDNHYERSSKETVRHVNLIDRPKFITAASLRRARDAF FT ENICDEENMTCAIYATKHYLPPRIERQIDDLHWPSPSTILDELDNEIEKRK FT AKEKYFKSNKYKNKFNKRRYNPKRFNNNRRNNNSSGGGFSSNKNTKN" FT CDS 806..4510 FT /product="TCG3_LTR_2p" FT /translation="QKYKKLTNPSQQRYTNFYINSLSVVEDGPHISPSQQI FT AILKHQLLQYDYENKQPINRFEIPIKFCLNKQKTHTCLLDTGASTNLIRSD FT ILAGFSNIKYETITPIKLTVATGEHRYLKHKVQLSFTLSGQQHQQHFYVVP FT TKLKPSIILGAPFINNNPQLFIKILQQGNDSRSTTKATIAETTRGLDHDLR FT NPINEKYLLWVTTSDVSTTELPKELQDRYADLIVDDLPPTSTTTTVKHEIE FT LVPGTAPIAKRAYRLSPVKRAELEQQIKELISSGRISPSDSPFAAPVLFVK FT KKDGSSRLCVDYRGLNNATVKSKFPLPLIEDVLDSLHGAKIFSKLDLISGY FT HQVSVNEPDRYKTSFITHEGQYQWNVMPFGLTNAPATFQRLMNAVLRPYIS FT KFCVVYLDDILIYSKTREEHLHHISQVLDKLRKHSLYPKKSKCHFMLTQVQ FT FLGHVINANGISTDPEKINAIKQWPILRNYKDAQRFLGLANYYRRFIKNFS FT KMASPLYEFAAKKNTKWTTECHNAFISLKDALISAPILIAFDPKSPYQLTM FT TVDASDNCIGATLEYKDGRKPKGVIAYLSHKLHSYETRWHIRDKELYAIVF FT ALKKWTHYVQGSHVIIYTDHKTNVNLNRLALLSPRLARWAEVLANYDFEIK FT YIPGPRNHADILSRPPGEPEITDSTDNDLIEIDSDHTDVNNKQGNSPETQH FT LAKVFFDPRPIKDLKNVTIETINAEVLKDPYYENIAQYIRDPNRPIPSKYF FT ELVRRFRYFGDILVYQDKQDYVVYRVCIPKTLVHEIIREHHDTPLFGHRGA FT DTTYKHLLKGFYWPNMLDSIKQYIKRCPECRKGKHPTTHPYGPLHSHPIPN FT QRWSEISIDFIDAFKTSGDNKYDRILTIVDNFTKMAHFIPCRKKDFTAEML FT TDIFIQNIIKLHGRPLRILSDNDVLYTATAWTAVMNRLQIHRDYTTVASSS FT SNSFAERTNRTIEEILNTLVAHSPTKWSTYLPLAEFAYNNSYQATIDTTPF FT FANQGYHPLAINYYMPMQGDNAPYKLTDHPTIDEHQEAQEALYLKIKQKIA FT DQNAAKALKNNVNYQTHAYQPGDQVVVHQSVYKPPKRDEYNQLKQISKKPH FT YVWYGPFTIKAKKTDQTYLVRTHNRLDGNLKEFHIRHMRPYYKKLLPKRNV FT PPIDTAKADSVAHESTNVINVDWKHNDWRMAAQWAHCEPWDYSIYDQKDKP FT LLSHLEQRMDMFDTTISLKQALARL" XX SQ Sequence 4639 BP; 1621 A; 1137 C; 743 G; 1138 T; 0 other; tggtagcgcc cgcatctaat aggaatactc gtcctgctga agaaccagaa aggacctccc 60 gtcctgctga ggaaccagaa aggaccaccc gtccttctga aacccccgac tcagattatg 120 actcatcatt ggatctttac ccacctgatt cagaccttct cgaagaagaa agcaaactac 180 gccattattt gaaagacgaa gtcgaacacc tcacaaaaaa gatagcccgc agcaatgcca 240 gacgtaagga atggttgaca gacatgcgcg gtgcggttag aaaattccgc accttaccag 300 acaaagtctt aatcaataac attagacgtg ctacggacga tgatataaaa tacgatttga 360 ctaaaataag ggaggaccta ccttcgcgta ttaccatcaa gacattcttg aaggccattg 420 ataaccacta tgagagaagc tcaaaagaaa ctgttaggca tgtcaattta attgatcgcc 480 caaaatttat cacagccgcc tcgctaagac gtgctcgcga tgcatttgaa aacatctgtg 540 atgaagaaaa catgacttgt gccatttatg ccacaaaaca ttatttacct cctagaattg 600 aacgtcaaat cgacgacctt cattggccat ctccttccac cattctcgat gaattggata 660 acgaaatcga gaagcgcaaa gctaaagaaa aatatttcaa atcgaataag tataagaata 720 agtttaacaa acgtaggtac aatcctaaac gtttcaacaa taaccgcaga aacaataata 780 gttcaggcgg aggattttcc tctaacaaaa atacaaaaaa ctaactaacc catctcaaca 840 acgttatact aatttttata tcaattctct atctgttgtt gaagatggtc cacacatttc 900 tccatcccaa caaattgcta tcctaaaaca tcaactatta caatatgact atgaaaataa 960 acaacctatc aatcgcttcg aaattccaat taaattttgt cttaacaaac aaaaaacaca 1020 tacgtgtcta ctagacacag gcgcctcaac taatttaatc cgttccgaca tactcgccgg 1080 attttcaaac ataaaatatg aaacaataac cccaattaaa ctaactgttg ccactggaga 1140 acacagatac ttaaagcata aagttcaact aagctttacc ttatcgggcc agcaacacca 1200 acaacacttc tatgtcgtcc ctacaaaact caaaccttcc ataatacttg gagcaccatt 1260 tattaataac aacccccaac ttttcataaa aattctgcaa cagggtaatg attctagatc 1320 tacaactaag gcaacaatag cagaaacgac acgcggattg gaccatgatt taagaaatcc 1380 aattaatgag aaatacttac tatgggttac cacgtccgac gtttctacta ccgaattacc 1440 aaaagaacta caagaccgat atgcagacct gatcgttgac gatttacctc ccacatccac 1500 tactacaact gtcaaacatg agatcgaact agtacctggt acggcaccta tcgctaaacg 1560 cgcatacaga ctttcaccag tcaaaagggc cgaactcgaa caacaaatca aggaacttat 1620 cagttcagga agaatatctc catcagactc accattcgcc gccccagtac tttttgtcaa 1680 aaagaaagat ggctccagcc gtctatgcgt tgactatcgt ggactaaaca atgccacagt 1740 taaatcaaag ttccctttgc cccttattga agatgtatta gacagtctgc atggagccaa 1800 gatattcagt aaacttgatc tcatttcagg ataccatcaa gtatcagtca atgaacccga 1860 ccgttacaaa acttccttca taacacacga aggacaatac caatggaacg ttatgccctt 1920 tggccttacc aatgccccag ccacatttca acgactaatg aacgcagttc ttcgcccata 1980 tatatcgaaa ttttgtgttg tctacctcga tgatatctta atctactcaa aaaccagaga 2040 agagcattta caccacattt cccaagtttt ggacaaacta cgtaaacaca gcctttaccc 2100 taaaaagagt aaatgccact ttatgttgac tcaagtccaa ttcctaggac acgtgatcaa 2160 tgcaaatggg attagcacag acccagaaaa gataaacgct atcaagcaat ggccaatatt 2220 acgtaactat aaagatgccc aaagattcct aggccttgcc aattattatc gtcgcttcat 2280 caaaaacttt tcgaaaatgg cttcaccact ctatgagttt gccgcgaaga aaaatacgaa 2340 atggacaacc gaatgtcaca acgcttttat tagtttaaaa gatgctttga tttctgcacc 2400 aattctaata gcatttgacc ctaaatcacc ttaccagctt actatgacag tcgatgcttc 2460 agacaactgc atcggtgcaa cattagaata caaagatggc agaaaaccaa aaggtgtcat 2520 agcataccta agtcataagc ttcattcata tgaaacgaga tggcacatcc gagacaaaga 2580 attatatgcc attgtgtttg cactcaaaaa atggacacac tatgtccaag gtagccatgt 2640 tattatctac acagatcaca aaaccaacgt taaccttaac cgattagcat tactcagtcc 2700 ccgcttagca cgatgggctg aagtcttggc taattacgat ttcgagatta aatatatacc 2760 cggtccccga aaccatgccg acattttatc acgccctcct ggcgaacctg aaattacgga 2820 ctcaactgac aatgacctta tcgaaattga tagcgatcac actgacgtaa ataacaaaca 2880 aggaaacagt cctgaaaccc agcacctagc aaaagtcttt ttcgatccta gaccgataaa 2940 agacttaaag aacgtcacga tcgaaactat caacgccgaa gtattaaaag atccttatta 3000 tgaaaatata gcacagtaca ttcgcgaccc taaccgtccc ataccaagca aatacttcga 3060 actagtcaga cgtttcagat atttcggtga tatactagta taccaggata aacaagatta 3120 tgtcgtctac agagtttgta taccgaaaac acttgtgcat gagatcattc gagaacatca 3180 tgacaccccg ttattcggtc atcgtggtgc cgataccaca tataaacatc tcttaaaagg 3240 attctactgg cctaatatgc tcgactccat caaacaatac atcaaacgtt gtcccgaatg 3300 tcgcaaaggt aaacacccta cgacacatcc atatggccca ttacactcac accctatacc 3360 aaatcagcga tggtcagaaa tttcaatcga ttttatcgat gcttttaaaa caagcggtga 3420 taataaatat gatcgtatct taacaatcgt tgacaatttc acaaaaatgg cacatttcat 3480 accatgcaga aagaaagatt ttacagctga aatgttgaca gatatattca ttcagaatat 3540 cataaaatta cacggtaggc ctctcagaat actatcagat aacgatgttt tatacaccgc 3600 cacagcctgg acagctgtca tgaacagact acaaattcat cgtgattata cgacagttgc 3660 gtcttcctca agcaatagtt ttgccgaacg cacaaaccgc acaatcgaag aaatacttaa 3720 cacactagtg gcacattcgc caacaaaatg gtcaacctac ttaccactag ccgaattcgc 3780 ctataacaac tcgtatcaag caactatcga taccacacca tttttcgcta accaaggata 3840 tcatccttta gccatcaatt actatatgcc aatgcaaggc gataatgcac catacaaact 3900 tacagatcac cctacgattg acgaacatca agaggctcaa gaggctctat atttgaaaat 3960 caaacagaaa atagctgatc agaacgccgc gaaagccctt aaaaacaacg tcaactatca 4020 aacacatgct tatcagccgg gagatcaagt ggtagtgcat caatctgtct acaaacctcc 4080 taagcgagat gagtacaatc agttaaaaca aatcagtaaa aagccacact acgtgtggta 4140 cggtccattc acaatcaaag cgaaaaagac tgaccaaaca taccttgtcc gaacacacaa 4200 ccgcctcgac ggaaacttaa aagaatttca catccgacac atgagaccat actataaaaa 4260 gctcctacct aagcggaacg tccctccaat cgatactgcc aaagccgatt ccgtggccca 4320 tgaatccacg aacgttatca atgtcgactg gaaacataac gattggcgca tggccgcgca 4380 atgggcccac tgcgaacctt gggactactc aatttatgat caaaaagata aaccgctttt 4440 atcacaccta gagcaaagga tggacatgtt tgatacaacc atttccctca aacaagctct 4500 agcccgatta taacctttag tttaattact tgctaacgaa tcattgtttc ataacctacg 4560 cgccgtatat tgctaatcaa taacctgtat tatcttcctc ctgtacgctg tgggaccagc 4620 gcatttttaa acgggggcg 4639 // ID TY3 repbase; DNA; FNG; 5510 BP. XX AC M34549; XX DT 11-NOV-1996 (Rel. 1.1, Created) DT 11-NOV-1996 (Rel. 1.1, Last updated, Version 1) XX DE S.cerevisiae Ty3-1 retrotransposon integrase gene, complete cds, DE and Cys-tRNA gene. XX KW Gypsy; LTR Retrotransposon; Transposable Element; TY3; integrase; KW transfer RNA-Cys; transposon. XX OS Saccharomyces cerevisiae OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Saccharomyces. XX RN [1] RP 1-5510 RA Hansen J.L. and Sandmeyer B.S.; RT "Characterization of a transpositionally active Ty3 element and RT identification of the Ty3 integrase protein."; RL J. Virol 64(6), 2599-2607 (1990). XX DR GenBank; M34549; Positions 1 5510. XX SQ Sequence 5510 BP; 1955 A; 1306 C; 919 G; 1330 T; 0 other; aactttcatg gaaggaccac ctagttaata aaaagctcgc actcaggatc gaactaagga 60 ccaacagatt tgcaatctgc tgcgctacca ctgcgccata cgagcttgat tttctgaaag 120 tgttgtatct caaaatgaga tatgtcagta tgacaatacg tcaccctgaa cgttcataaa 180 acacatatga aacaacctta taacaaaacg aacaacatga gacaaaaccc gaccttccct 240 agctgaacta cccaaagtat aaatgcctga acaattagtt tagatccgag attccgcgct 300 tccaccactt agtatgattc atattttata taatatataa gataagtaac attccgtgaa 360 ttaatctgat aaactgtttt gacaactggt tacttcccta agactgttta tattaggatt 420 gtcaagacac tccggtatta ctcgagcccg taatacaaca cctggtagcg ttaaaggtta 480 ctaattgttc aaacgaacca tcgaaaagcc gaacctagct acaccacacc ccagtatgag 540 ctttatggat caaatcccag gaggaggaaa ttatccaaaa ctcccagtag aatgccttcc 600 taacttcccg atccaaccat ctttgacctt cagaggtaga aatgactcgc ataaactgaa 660 aaactttatc tccgaaataa tgttaaacat gtctatgata tcttggccga atgatgccag 720 tcgtattgtg tactgcagaa gacatttatt aaaccccgct gctcagtggg ctaatgactt 780 tgtacaagaa caaggtatac ttgaaataac attcgacaca ttcatacaag gattatatca 840 gcatttctat aagccaccag atatcaataa aatctttaat gcaatcacgc aactttccga 900 agctaaactt ggtattgagc gtctcaacca acgattcaga aagatttggg acagaatgcc 960 accagacttc atgaccgaaa aagctgccat aatgacatat actaggctat tgacaaagga 1020 aacctataat attgtcagaa tgcacaaacc agagacatta aaagacgcca tggaagaggc 1080 ttaccagaca actgcactaa ctgaaagatt cttcccagga ttcgaacttg atgctgatgg 1140 agacactatc atcggtgcca caacccactt acaagaagaa tacgactctg actatgattc 1200 agaagataat ctgacccaga atggatacgt ccataccgta aggacaagaa gatcttacaa 1260 taaaccaatg tcaaatcatc gaaacaggag aaataacaac ccatctagag aagaatgtat 1320 aaaaaatcgg ctatgcttct attgtaagaa agagggacat cgcctgaacg aatgtagagc 1380 acgtaaggcg agttctaacc gatcttgaac tcgaatcaaa agaccaacaa actcctttta 1440 tcaaaacctt accaattgta cactatatcg ccatccccga gatggacaat accgccgaaa 1500 aaaccataaa aatacaaaac acgaaagtaa aaaccctgtt tgacagtgga tcacccacgt 1560 catttatccg aagagatatt gtagaacttc tcaaatacga aatctacgag acccctccac 1620 tccgttttag aggattcgta gccaccaaat ccgccgttac atccgaagca gtcaccattg 1680 acctcaaaat caatgacctg catataactt tagccgcgta catactggat aacatggact 1740 accaattgtt aattggaaat ccaatcttac gccgctaccc gaaaatcctg cacacagtac 1800 tgaataccag agagagcccc gactccttaa agcccaagac ttatcgctcc gaaaccgtta 1860 ataacgttag aacctactcc gctggtaatc gtggtaaccc cagaaacata aaactgtctt 1920 ttgcccccac cattctcgaa gcaactgacc cgaaatccgc tggtaatcgt ggtgactcca 1980 gaaccaaaac cctgtctctt gcaaccacta ctcctgcagc aattgacccg cttacgaccc 2040 ttgataaccc aggtagtact caaagtacat ttgcgcaatt cccgatacct gaagaagcga 2100 gcatcctaga agaggatgga aaatactcca acgttgtctc aaccattcag agtgtagaac 2160 ctaatgctac tgatcacagc aataaggaca ccttttgcac tttgccagtt tggttacaac 2220 agaagtatag agagatcata cgtaatgatc tcccaccaag acctgccgac attaataaca 2280 tccccgtaaa acatgatatt gaaattaaac ctggcgcaag actacctcga ctacagccat 2340 accatgttac agaaaagaac gaacaagaaa tcaacaaaat agttcaaaaa ctgctcgata 2400 acaagttcat tgttccctca aagtcgcctt gcagctcccc tgtagtcctc gtcccgaaga 2460 aagacggtac cttccgactc tgcgtcgatt accgcaccct gaacaaagct accatctccg 2520 acccattccc attacccaga atcgacaacc tattgagccg tattggaaat gcccagatat 2580 ttaccacgct agatttgcat agtggttacc accagatccc gatggaaccc aaagaccgct 2640 acaaaaccgc ctttgtcaca ccatccggta agtatgaata taccgtcatg ccatttggct 2700 tagtcaatgc acctagtaca ttcgcaagat acatggctga tacatttaga gacctgagat 2760 tcgtcaatgt ttaccttgat gatatattaa tattctccga atctccagaa gaacattgga 2820 aacatttaga cacggtacta gaaagattaa agaacgagaa cctcattgtt aagaagaaaa 2880 aatgtaaatt tgcatctgaa gaaactgagt ttttaggcta tagtattgga atccagaaaa 2940 tagctccact acagcacaaa tgtgcagcaa tccgagactt tccgacgcct aaaacagtaa 3000 aacaagcaca gagattttta ggaatgatta attactacag acgattcatt ccaaattgct 3060 ccaagattgc acagccaatc caactgttta tttgtgacaa aagtcaatgg acagaaaaac 3120 aagacaaggc aattgataaa ctaaaagacg ccttgtgtaa ctcccccgtc ctagtaccat 3180 tcaacaacaa agcaaactac cgacttacaa cagacgcctc aaaagacggc attggtgctg 3240 ttctagaaga agtcgacaac aagaacaaac ttgttggtgt cgtcggttac ttctctaaat 3300 ccttagagag tgcccagaaa aactatcctg ctggcgaatt agaactactt ggaattatca 3360 aagcactcca ccacttccga tatatgcttc acggaaagca tttcacgtta agaacagacc 3420 acattagttt gttatcatta caaaacaaga acgaacccgc acgacgcgtg caacgctggt 3480 tagatgacct agccacatat gacttcacct tagaatacct agctggaccc aagaacgttg 3540 tcgcagatgc catatcccgt gccgtatata ctataacccc cgaaacatcc cgacctatcg 3600 acacagaaag ctggaaatct tactacaaat cagacccatt atgtagtgct gtcttaattc 3660 atatgaaaga attgacacaa cacaacgtca cacctgaaga tatgtcagcc ttccgtagtt 3720 accagaagaa actcgaacta tcagagacct tccgaaagaa ttattcccta gaagacgaaa 3780 tgatctatta ccaagaccga ctagtagtac caataaaaca acagaacgca gttatgagac 3840 tatatcatga ccatacctta tttggaggac attttggtgt aacagtgacc cttgcgaaaa 3900 tcagcccaat ttactattgg ccaaaattac aacattcgat catacaatac atcaggacct 3960 gcgtacaatg tcaactaata aaatcacacc gaccacgctt acatggacta ttacaaccac 4020 tccctatagc agaaggaaga tggcttgata tatcaatgga ttttgtgaca ggattacccc 4080 cgacatcaaa taacttgaat atgatcctcg tcgtagttga tcgtttttcg aaacgcgctc 4140 acttcatagc tacaaggaaa accttagacg caacacaact aatagatcta ctctttcgat 4200 acattttttc atatcatggt tttcccagga caataaccag tgatagagat gtccgtatga 4260 ccgccgacaa atatcaagaa ctcacgaaaa gactaggaat aaaatcgaca atgtcttccg 4320 cgaaccaccc ccaaacagat ggacaatccg aacgaacgat acagacatta aacaggttac 4380 taagagccta tgcttcaacc aatattcaga attggcatgt atatttacca caaatcgaat 4440 ttgtttacaa ttctacacct actagaacac ttggaaaatc accatttgaa attgatttag 4500 gatatttacc gaatacccct gctattaagt cagatgacga agtcaacgca agaagtttta 4560 ctgccgtaga acttgccaaa cacctcaaag cccttaccat ccaaacgaag gaacagctag 4620 aacacgctca aatcgaaatg gaaactaata acaatcaaag acgtaaaccc ttattgttaa 4680 acataggaga tcacgtatta gtgcatagag atgcatactt caagaaaggt gcttatatga 4740 aagtacaaca aatatacgtc ggaccatttc gagttgtcaa gaaaataaac gataacgcct 4800 acgaactaga tttaaactct cacaagaaaa agcacagagt tattaatgta caattcctga 4860 aaaagtttgt ataccgtcca gacgcgtacc caaagaataa accaatcagc tccactgaaa 4920 gaattaagag agcacacgaa gttactgcac tcataggaat agatactaca cacaaaactt 4980 acttatgtca catgcaagat gtagacccaa cactttcagt agaatactca gaagctgaat 5040 tttgccaaat tcccgaaaga acacgaagat caatattagc caactttaga caactctacg 5100 aaacacaaga caaccctgag agagaggaag atgttgtatc tcaaaatgag atatgtcagt 5160 atgacaatac gtcaccctga acgttcataa aacacatatg aaacaacctt ataacaaaac 5220 gaacaacatg agacaaaacc cgaccttccc tagctgaact acccaaagta taaatgcctg 5280 aacaattagt ttagatccga gattccgcgc ttccaccact tagtatgatt catattttat 5340 ataatatata agataagtaa cattccgtga attaatctga taaactgttt tgacaactgg 5400 ttacttccct aagactgttt atattaggat tgtcaagaca ctccggtatt actcgagccc 5460 gtaatacaac agaaagttcc attttggatg ctctatttat gggaatatga 5510 // ID Helitron-N1_AN repbase; DNA; FNG; 739 BP. XX AC . XX DT 09-DEC-2003 (Rel. 8.11, Created) DT 09-DEC-2003 (Rel. 8.11, Last updated, Version 1) XX DE Nonautonomous rolling-circle Helitron DNA transposon - a DE consensus sequence. XX KW Helitron; DNA transposon; Transposable Element; Nonautonomous; KW HELITRON superfamily; Helitron-1_AN; Helitron-N1_AN; KW Nonautonomous rolling-circle DNA transposon. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-739 RA Kapitonov V.V. and Jurka J.; RT "Helitron-N1_AN, a family of nonautonomous Helitrons in the RT Aspergillus nidulans genome."; RL Repbase Reports 3(11), 191-191 (2003). XX DR [1] (Consensus) XX CC Nonautonomous rolling-circle Helitron DNA transposon. XX SQ Sequence 739 BP; 213 A; 189 C; 135 G; 202 T; 0 other; ttaacataac tatacgaact ttaccctctg actccaccgc agtcacgtgt gatccactaa 60 attcgcagaa tgcaatatac acccatgatg tatcctacca ttacgttttg gcatctatag 120 cagctaactt gctgttgctt gcccttacta ataatataat ttataatctc taagtacagg 180 caattagaat cacctatgat tatagcatat tctttacgag cttctctacc taacaagttc 240 ataattgcta ttatatatat ttggatttgc tgtgacagtt agcagggcgc cggaaacagc 300 atatgacgct ctgggaactg gttttcttat ataaaactaa ttattctccg atccaaatcg 360 atcagataag ccgcacgata gtgataccga taccgatccc gtaccgataa cgtgcgtcgg 420 cggatcccgc agcgggactg cccggccgac tgttacacaa aagtatctat aaaagcgata 480 gacaccaaca gggacccaga ttccaaatag gctgtctata tataaagatt tacacaatgc 540 cagccatcac tgttaagcca ctaacaccgc cagccgggtc tgcaatcgac ttcggtgccg 600 tcattacaga tgttgacctg gagcatttga ctggtacctc tctccctcga gcaccctttg 660 atttattgtg tacaggtact aaatgcaatg acagacggag acttctccac gatacgctca 720 gccctgtaca cacaccttg 739 // ID Copia-1_ATe-LTR repbase; DNA; FNG; 412 BP. XX AC AAJN01000194; XX DT 12-MAR-2011 (Rel. 16.03, Created) DT 12-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Aspergillus terreus genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_ATe_; KW Copia-1_ATe-I; Copia-1_ATe-LTR. XX OS Aspergillus terreus OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC mitosporic Trichocomaceae; Aspergillus. XX RN [1] RP 1-412 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Aspergillus terreus genome."; RL Direct Submission to RU (12-MAR-2011). XX DR Genome; AAJN01000194; Positions 1798 1387. XX SQ Sequence 412 BP; 98 A; 91 C; 82 G; 141 T; 0 other; tgtcagacag tagaggtgcc tgcctcaggc agccatttag tcagcagaat caggcatctg 60 tttcaggcag ccattattgt ctaggcaacc accagattga ttctattgat ttctattgtt 120 ttgatttgca ttgttctgat ctattctttc tgattctatt gcgtcatctg cttgcgttcc 180 tacgtcattc ctgattttga ttgcctgctt ctagaagact ctaggcctta ggaccatact 240 ataaataggg ccttctgacc catggttcag ggctagagtc aaccagagtt gtttactaga 300 ttcaattaat catctagttc aattgtcaat ttccttagct ctcattggtt catattatct 360 agtcaggtca atagggactt cccaattagg aaaccctaga gggcctaagg ca 412 // ID Copia-15_MLP-I repbase; DNA; FNG; 4531 BP. XX AC AECX01001148; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-15_MLP_; KW Copia-15_MLP-LTR; Copia-15_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4531 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001148; Positions 110016 114546. XX CC Positions [1565-2065] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1247..3688 FT /product="Copia-15_MLP-I_1p" FT /translation="MGCLLEDGYNLSHDKGVILMTKEGTGCLTGQVRNRVI FT ELSIAIGTSYSATCNSVTLSDYTTLHRQSGHTNPVRLNVMYGVRSPDDWFC FT EECAISKGHRLSYSNSLPKATFPLEVVHSDLSGKISSVSEGGCHYYFKLTD FT AYTNYKYVYIMKSKDETFSYFMKFKAEVETFHNRKIVSLVNDGGGEYMSKV FT FLKMLSENGTTSYISAPYTPQQNPVAERGNRTMTEKARTLLKHSNLPQSFW FT GEAVATAVFLENVTPMTKIDNQMPYKLWFNRPFDVNCLKPFGCRAYVLIPK FT KFRRKFDDTSFKGILLGYQLGMKNYRVRLENGKTVYSHDVTFNVNSFPGVG FT NNEEQGEGFVEVFNDNIVTSSSESEAPLLNLPSSGVLESATSPVLPFEPVI FT PPMPSSRMIEPALPPTDQLVHVPSGVPMQVEVVPDLPVPPIIPMELDSPPS FT TPPGQLVIRPSQQNLQIQLSPVNPVVLQKRPGWEWQPLSIPAPQDISSAIE FT ESNILEPGVKRHRANLAQRQRAHTARKLRAKAIRLQVKNYEDEHKLIDSSM FT ILRTAFTPGFHLTQRAMKTTAVAGVSIDPKTYLQALKSDNQFDWKKAIDNE FT LENMERRKVWEVCDLPAGKNAVGTTWVFKKKMGGDGELIKFKAQLCAQGFS FT QEYGEDYNETFAPTGRIATLRAINAMAALENLDVYQMDAVAAFLNGSPKET FT IYMKIPKGLTVNGATLTSVLKLKQALYGLKQAPKVWYDCLRAFFISIQLIP FT STSDPSLFISKDPTWKCIVHVHVDDLTIATDDFQRFRSLITQKFQMDEDVE FT YKYILGMKVTQN" XX SQ Sequence 4531 BP; 1398 A; 890 C; 960 G; 1283 T; 0 other; ggttatatcc actttaagtg gttatgagcc cagcgctctt gcgactagct accgccatca 60 tcatcttcat atctttgaac ttcattaaga agtctatcga acacaaacaa attcaaaaac 120 ctacgcatac atccaacttt gtcatgtcaa gtgataccaa gattcacatc gttaaactta 180 acaatgacaa ctatacggag tggaaaggcg acattacagg ttacctgatg tctcacggac 240 tccaggaatt tcttgttcct aatcatactc ctgttgctca agctattgca ggagaggaaa 300 agattaactt tgaagcttat gtcacccgtc aaaacaaagc cgccggtatt atgtacagtt 360 acttgactga accttttcga attcaaattg aatcagaagg gctcttatcg aatccagtcg 420 gtatttggaa gcacttgaaa gagaagtttc aatcgactag tgccgatagt caaggacgag 480 cttgccgaaa ctttctatga attcctttcg ttacccttgc tcaatatatc aaggatgtcc 540 gtaaaggaat gtcgttaatg gaagcatgcg gatgtgcaac gacgaatcca attcttgaac 600 cgctgttgtg tgaaggaatc atcttcaaac tgccggattc catggagact gttgtgtcac 660 tcatcacggc caaacaatca gagtcaggca aactttcctg taagacagtc ttgaccatgt 720 tagatgctca tctagtggat ttcaccgaac aacatcgaga ggacagttca atcgctctga 780 tgacgaccac cactgctgct cctgcttgat attcatatcc ttgatgtacc aacggtaaac 840 ataatccagc cgtaactaat catctgccgg acgcatgttg ggtggaaagg ccacatttac 900 ggcaaccacc acgtcaacct aattctcgtt cacaaacatc acgaactaga gcgacaacag 960 caatacttca acgattgtat cttgatgatg ttattgaaac ctcagcttta tcatgccttt 1020 ctaaatctcc gggtgctgct attttattag atagtgcgtg ctctgatcac atggtgtcag 1080 atcaaaaatt atttaaaaat tatgaatcta tggcatcgga agtctgctta gctgacggta 1140 actcaattca cattgccggc aaaggaggat tgtcattaca tctttcggga ccaaactggt 1200 tcttgtcgcc tatcacgttc ccgaacttgg aaactccctc atcagtatgg gttgtttgtt 1260 agaagacggg tataatcttt ctcatgataa aggagtcatt ctaatgacaa aagaaggtac 1320 cggttgtctt actggccaag tccgaaatcg agtcatcgaa ctcagcatcg ccattggtac 1380 gtcatattcc gctacttgca acagtgtcac gttatctgat tatactacgc ttcacagaca 1440 atccggacat acaaatccag tccgattgaa tgtaatgtat ggtgtgagat ctccggatga 1500 ctggttttgt gaggagtgtg caatttcaaa aggacatcga ttatcttatt caaactctct 1560 tcctaaagcg acttttccat tagaagttgt gcatagtgat cttagtggca agatatcatc 1620 tgtctctgag ggaggttgtc attattattt taaacttact gatgcttaca ctaattataa 1680 gtacgtttac atcatgaaat ctaaagatga gacgttttcg tactttatga agtttaaagc 1740 ggaagtggaa acttttcata atcgaaagat agtgagttta gttaatgatg gaggcggaga 1800 atatatgtca aaagtttttc tcaaaatgct ctctgagaat gggacaactt cctacataag 1860 cgccccttat acacctcaac aaaatcctgt tgctgaacgt ggcaatagga ctatgactga 1920 gaaagccaga acattgctta agcattcaaa tttgcctcaa tcattttggg gagaagccgt 1980 tgcaactgct gttttcctag agaatgttac tccgatgacg aagattgaca atcaaatgcc 2040 ttacaaattg tggtttaatc gaccatttga cgtaaattgt ttgaaacctt tcggttgtcg 2100 tgcttatgtc ctaattccta agaaatttcg aaggaaattt gatgacactt catttaaagg 2160 aattttactt ggttatcaat taggcatgaa gaactataga gttaggcttg aaaatggcaa 2220 gacggtttat tcacatgacg ttacatttaa tgtcaattct tttcccggag tcggcaacaa 2280 tgaagagcaa ggtgaaggtt ttgtggaagt gttcaacgat aacatagtca cttcttcctc 2340 tgaatcagaa gcgcctttac tgaatctacc gtcctcaggg gttttagaat cagcaacttc 2400 accagttcta ccattcgagc ctgtgattcc accgatgcca tcgtcaagaa tgatagaacc 2460 agcattgccg ccaacagatc aattggtgca tgtaccttca ggagttccta tgcaggttga 2520 agtggtacca gatctaccgg tgccaccaat aattccaatg gaactagatt ctccaccgtc 2580 gacacctcca ggtcaacttg tcatacgtcc ttctcaacaa aacttgcaga ttcagttgtc 2640 acccgttaat ccggttgttc ttcagaaacg accaggatgg gagtggcagc cactgtcgat 2700 tcctgcgcca caggacattt ctagcgccat cgaagagtcc aacattctgg aaccaggcgt 2760 gaaaagacat cgagctaatc ttgctcaacg tcaacgtgct cacactgctc gaaaattacg 2820 agctaaggca attcgtcttc aagtcaagaa ttatgaagat gaacacaaat tgattgattc 2880 tagcatgatt cttcgaacgg cctttacacc cggttttcat ctaactcaac gtgcgatgaa 2940 aactacagct gtggcaggtg tgagtataga tccaaagact tacctgcagg cgttgaaatc 3000 cgacaatcag tttgattgga agaaggcaat tgacaatgag ctggaaaaca tggaacgacg 3060 aaaggtgtgg gaagtgtgtg atttgccagc gggaaagaat gctgtgggaa ccacgtgggt 3120 atttaagaag aagatggggg gtgatggtga acttattaaa tttaaagccc aattgtgtgc 3180 acaaggcttc tcacaagaat atggagaaga ttacaacgaa acctttgcac cgacaggacg 3240 aatagccaca ttaagagcaa tcaacgccat ggctgctttg gaaaatttag atgtgtatca 3300 gatggacgct gtggccgcat ttctgaatgg aagtcctaag gaaacaattt acatgaaaat 3360 tccaaaagga ttgactgtga atggtgctac tttaacatca gttttaaaat taaagcaagc 3420 attgtacggt ttaaaacaag cgccgaaagt gtggtatgat tgcttaagag catttttcat 3480 ttcaattcaa ctcattcctt ctacctcgga tcctagtcta ttcatttcaa aggatcctac 3540 gtggaaatgc attgtgcacg tacatgttga tgatctaaca attgcgacag atgattttca 3600 acgatttcga agtctcatca ctcaaaagtt ccaaatggat gaagatgtag aatataagta 3660 cattttagga atgaaggtga cacaaaattg agaagaaaga acaatcactc tgacgcaatc 3720 tcaatatatt caaaatctat tagaggagta caacatggcg gattgtaaat cagtagggtg 3780 tccaatgatt cctggaagtt atttactgcc aggatcagat gatgaagtta gagaatttaa 3840 ggatttagat gaaaattatc ggcacggtgt tggaaaacta atgtatctta ataatgctac 3900 gcggccggac atttcctttg tagtctctca attgtctcag catctaaaca atccttcaat 3960 tcttcattgg caagccttca agcgagtatt aagatattca aaagggacga agacagttgg 4020 aatagtatta gggggaaaag atttagaatt gaaaggatat gcagacgctg atttttcttc 4080 atgtccgtat ataagaagat caacaggagg atactgtaca atggttggtg atagcacggt 4140 taattggaag agtaagaagc aggaaaatgt agctacgtca acaactgaag ctgagtatcg 4200 gtctgcgtat gaaggtggcc aggatttggt gtggtttcga catttattaa acaatttatc 4260 aattaagcaa cactctgctc cggttttgaa tttagataac cagggggcaa ttgcattgac 4320 caagaatgaa caatttaaat cacgaacaaa acatgtcgat gttaaatatc attggcttag 4380 ggaattggtg gcagatgatc aaatgaaggt agaatacgta ttgacagaaa atatgttagc 4440 ggatattttt accaaggctt tgacaccaat caaacatcaa agattttgtt cattgctagg 4500 tttgcaggat gtagcaaaga cggggggaaa t 4531 // ID Gypsy-2-LTR_AF repbase; DNA; FNG; 167 BP. XX AC . XX DT 28-FEB-2006 (Rel. 11.02, Created) DT 07-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE Long terminal repeat of the Gypsy-2_AF LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; Gypsy-2_AF; Gypsy-2-I_AF; KW Gypsy-2-LTR_AF. XX OS Aspergillus fumigatus OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Neosartorya. XX RN [1] RP 1-167 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-167 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-2_AF, a family of gypsy LTR retrotransposons in the RT Aspergillus fumigatus genome."; RL Repbase Reports 6(2), 63-63 (2006). XX DR [2] (Consensus) XX CC This is a long terminal repeat of the Gypsy-2_AF LTR CC retrotransposon. Gypsy superfamily. It is characterized by 5-bp CC target-site duplications. XX SQ Sequence 167 BP; 58 A; 35 C; 28 G; 46 T; 0 other; actatcacag aacagggtgt aaacactata aatatagctt actaaggcac agggagaagt 60 tcttagtagt gattccatag ctggtacatg tatcctaccc atcatccatc cataaaaaca 120 atacagtgtt ttgggtgttt ccatcaaatc catcagtata agcatca 167 // ID Gypsy-1_ADJ-I repbase; DNA; FNG; 4952 BP. XX AC ADAR01000039; XX DT 21-APR-2011 (Rel. 16.04, Created) DT 21-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Batrachochytrium dendrobatidis DE genome: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_ADJ_; KW Gypsy-1_ADJ-LTR; Gypsy-1_ADJ-I. XX OS Batrachochytrium dendrobatidis OC Eukaryota; Fungi; Chytridiomycota; Chytridiomycetes; Chytridiales; OC Chytridiales incertae sedis; Batrachochytrium. XX RN [1] RP 1-4952 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Batrachochytrium dendrobatidis RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; ADAR01000039; Positions 28591 33542. XX CC Positions [2170-2709] - Reverse transcriptase CC Positions [3829-4308] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 200..1108 FT /product="Gypsy-1_ADJ-I_3p" FT /translation="MENYVEALQNVLQNMQSEIAALRTERSAPLVQATLNI FT PPPDRYNGNRSKFTTFVSQCNLAMRMHPTQFPSDQARVGFMISLLTGPAAD FT WATPLLAANDPVLNSLPEFINLFRRQFDDPDRERTAEARLKNFEQGQRSCA FT DYATEFLRLSADTDWNDGAKIFIFKGGLSSDIKTRLSYISACPRDLRGFID FT LCIRVDERITELQKEVPPFKPSKRLPSLTTFQPQSSNDMQVDAVVRGPLST FT QEKERRRQLGLCLYCGQPGHRAHDCPNRRSNRRTTAGSRSVVRQTNTSAST FT SDNSGNELSRR" FT CDS join(1168..2250,2254..4947) FT /product="Gypsy-1_ADJ-I_1p" FT /translation="MTDYGTKQRTVMCGSVGVARGMLDKDVDVVVPSGETG FT SFTQMPSTPPLYSHQPAKPTQHAKSVQCKTPSRSYSLVSSVSPSSIVIPVS FT LVLPSGRFVSCHSLVDSGASGNFIDESFIARSRIPLVTKQEPLHVEGVDGR FT PLSSGPISKETLPLRVVIGDFIQEMTFDVIKCPHQPLIFGYPWLSSCNPKI FT SWFSRVMKLPHSVSCLVSHFVSNSSSKINPVSSDNPVSYRSFPVNPVSVDD FT SHFIEPPVNDSCFVLPTPDIPSIQATTTPSSDIPLHLVEFASVFSKKKALV FT LPPHRQYDCPIDLEPGSTPKFGPLYSLSEPELKALRDYLDENLANGFIQPS FT TSPAGSPILFVKKKDGSLLCVDYRSINNITIKNRYPLPLINEMLDRLQGSQ FT VYTKLDLRGAYNLLRIRAGDEWKTAFRTRYGHFEYKVMPFGLTNAPATFQH FT FINDVLRNELDQFAIAYLDDILIYSKSEKEHLDHVRTVLKRLHESQLYCKL FT EKCEFSKDRISFLGYVISPKGIEMEREKLTAVLDWPQPKNLTDLRSFIGFA FT NFYRRFIDGYSKIAAPLTQLFKKSKVFEWTPDANNAFQMLKNAFVSDPTLV FT HADTTRPFIIESDASDFAIGCVLSQRDANNQLRPCSFHSRSLSSAERNYSI FT YEKELLAIREAFDVWRHYLEGAHHAVEVLTDHKNLEYLASARILNQRHARW FT SEFLSRFNIKIQYRPGSKNGKADALSRRSDYETQSGDIPDIPRTLLSAESF FT SETIDTPHHPSSVALQILPRTTTKDLLQLIQQSQQTDDYIMSLGTDPEFVV FT KEGLLLHQDRIVVPKSCYAAVLHSCHDVLAVGHPGIRRTLNLVSRKFWWPT FT MRADTKNYVSTCDTCARAKVPRHKPYGLLKPLETSDRPWGIITMDFISSLP FT TSNGHTCIFVIVCSLTKLAHFVSCPELPSADETATMFIENVFRLHGLPDSI FT VSDRGPQFTSHFWQALCKGLGIRTRLSTAFHPQTDGQTERVNQVINQYLRC FT YTSYQQDDWVSLLPLAEFAYNNLEHASIRSSPFLATYGYHPRIEFPTLLES FT SINVPSAQERIQHIQQNLALLKENLEKAKSDYKHFSDRNKVAEPVFTPGEL FT VWLLARNIKTTRPSKKLDYQRLGPFRVIEPIGTLAYRLELPKDIRIHPVFH FT VSLLEKHQRNEFADRQIIPPPPVIVENHLEYEVEKILDSKIVKGQLHYLVD FT WKGYTINDRSWEPVENISAPDKLAEFHSANPSKPKEPSRLQRRRRLERG" XX SQ Sequence 4952 BP; 1330 A; 1283 C; 1010 G; 1329 T; 0 other; ataaagttct aaagttcctt gatcctagtt ggtaacaagc acctggtttc ccttcgcctt 60 attttgcgat taagtacttc ctgacacatt gataaaactc ttaattttct gaaccacctg 120 cttgtctact tcttgtctct atccttgtct ttgtctttct ttctttcttt tatttcttat 180 ctcattccaa aatctcaaca tggaaaatta cgtggaagct cttcaaaatg ttcttcaaaa 240 catgcagagc gaaatcgctg ccctgcgcac ggaacgcagt gccccactag ttcaggctac 300 gttgaacata cctccccctg acaggtataa cggaaatcgt tccaaattca cgactttcgt 360 cagccaatgc aatttggcta tgagaatgca ccccacacaa tttccatccg atcaagcgcg 420 ggtcggattc atgatcagcc tgctcacagg ccctgctgcg gattgggcga ctccattact 480 cgcggccaat gatccagtct taaactcgtt gccggagttt atcaacctct tccgacgaca 540 attcgatgat cctgacagag agcgcactgc tgaagcgaga ttaaaaaact ttgagcaagg 600 tcaacgctct tgcgcagatt atgcgaccga gttcctgcgt ttatcagccg acaccgattg 660 gaacgacgga gctaaaatct ttattttcaa aggtggactg tcatcggata tcaaaacacg 720 cctgtcatac atttccgcgt gccccaggga tctacgtggt tttatagatt tgtgcatccg 780 tgtcgacgaa cggataacgg aattacaaaa ggaggttccc ccgttcaaac ctagtaaacg 840 attaccctcg cttacaacat tccagccgca gtcgtccaac gacatgcagg ttgacgcagt 900 cgttcgaggc ccgttatcaa ctcaagaaaa ggagcgccga cgccagctcg gattatgctt 960 atattgtggt cagccaggtc accgagcaca tgactgccct aacagacgtt ctaacaggcg 1020 aacaactgca ggctcaagat ccgtcgttcg tcaaaccaac accagtgcct cgacttcaga 1080 caattcggga aacgaactca gccgtcgcta agcggaggct tggttccggc tgtcacacct 1140 ttgcctcctg aaacagacac tgatggtatg acggactatg ggactaaaca acgaacagtt 1200 atgtgtggat cagtgggtgt agcccgtggt atgcttgaca aggatgttga tgtggtagta 1260 ccctctgggg aaactggctc ttttacacag atgccctcca cacctccttt gtactcgcat 1320 cagccagcaa aacccactca acatgctaaa tctgttcaat gtaaaacccc gtctcgttca 1380 tactcactgg tatcatctgt ttcccccagt tccatagtta ttcctgtttc ccttgtacta 1440 ccatctggtc gttttgtatc atgtcatagt ttggtcgatt ccggcgcgtc cggaaacttc 1500 atcgacgaat cttttattgc tcgttctcgt atccctcttg taaccaagca agaacccctc 1560 catgtagaag gcgtcgatgg tcgaccccta tcttcgggcc caatctcgaa agaaacactc 1620 ccgctgagag tggtgatcgg agattttatt caagagatga cgtttgatgt tatcaaatgt 1680 ccccaccaac cattgatttt tggatatcca tggttgtcca gttgtaatcc caagatatct 1740 tggttcagtc gtgttatgaa gctaccacat tccgtttctt gtcttgtttc ccatttcgtt 1800 tccaattcca gttccaaaat aaaccctgtt tccagtgaca atcctgtttc ttatcgtagc 1860 ttcccagtta accctgtttc tgtcgatgat tcacatttca tcgaacctcc tgttaatgat 1920 tcctgttttg tgttgccaac cccagatata cctagtatcc aggcaactac caccccgtca 1980 tccgatattc cattgcatct tgtcgagttt gcatctgtat tttccaagaa aaaggctctt 2040 gtattacctc ctcatcgaca gtatgactgt cccattgact tggagcccgg atctacacca 2100 aagtttggac cgctatattc gctatcagaa cccgagttga aggctctacg agactatctg 2160 gatgaaaacc tagcaaacgg ttttatccaa ccttccactt cacctgctgg ctctccaatt 2220 ctgtttgtca agaaaaagga cggttcattg tgactatgtg tagattaccg gtctataaat 2280 aatattacca tcaaaaatag atatccgtta ccactgatca acgagatgct agaccgactt 2340 cagggatcgc aagtctatac caagttggac ctccgtggcg catataattt attgcgcatt 2400 cgagcgggtg acgaatggaa gactgctttc cgaactcgat acggacactt tgaatacaag 2460 gtcatgccgt ttggcttgac taatgcacca gcaacgtttc agcactttat caacgatgtt 2520 cttcgaaacg aactagatca atttgcaatc gcctatttgg acgatatact gatctacagc 2580 aaatcggaaa aagaacattt ggatcatgtt cgcaccgtac tgaaacggct tcacgagtca 2640 caactatact gtaaactaga aaagtgcgag ttctccaaag atcgaatctc attcttgggt 2700 tatgtcatat caccaaaagg catcgagatg gaacgtgaaa agttgaccgc agtcctggac 2760 tggcctcagc caaagaatct cacggacttg cgttcattta ttgggtttgc caacttttac 2820 cgacgattca ttgatggata ttccaagata gctgccccac tcacccagtt atttaaaaag 2880 tccaaggtgt ttgagtggac acccgatgcc aataatgcat ttcaaatgtt gaagaatgcg 2940 tttgtctcgg acccaactct tgttcatgcg gacactactc gtccgttcat cattgaatcc 3000 gatgcgtcag attttgcgat tggttgtgta ttatcccaac gcgatgccaa caaccaactc 3060 cgtccgtgtt cctttcactc ccgctcactg tcaagtgcag agcgcaacta cagtatatat 3120 gaaaaggaac tcctggcaat cagagaggca ttcgatgtat ggcgccacta cttggaaggt 3180 gctcatcatg ctgtggaagt cttgactgac cacaagaacc ttgaatacct tgcttcggca 3240 agaatcctga accaacgaca cgctagatgg tccgagttcc tgtcccgatt caatatcaag 3300 attcaatatc gcccaggatc caagaacgga aaagctgatg cgctttcccg tcggtcagat 3360 tatgagactc aatccggcga tatccctgac atcccacgaa ctctactgtc cgcggaatcg 3420 tttagcgaaa ctattgatac tccacaccac ccaagttccg tggcactcca gatacttccc 3480 agaacaacta ccaaggatct attgcagcta atccagcaat ctcagcagac cgacgattac 3540 ataatgtcac ttgggactga cccggaattt gtagtcaagg agggtctgtt actacatcaa 3600 gatcgtatcg tggtcccgaa atcatgttat gcagctgtgc tacactcatg ccacgacgta 3660 ctagcagtcg gccacccagg tatccgacga actctgaatt tggtttccag aaagttttgg 3720 tggcctacta tgcgtgccga caccaagaat tatgtatcaa cgtgcgacac atgtgcccga 3780 gcaaaagtcc cacgtcataa accttatgga ctactgaaac ctctagaaac atctgatcgg 3840 ccttggggta taatcacaat ggattttatt tctagtttgc cgacatcaaa cggtcatacc 3900 tgcatatttg tcatagtatg cagcttgacc aagttggcac attttgtctc atgtccagag 3960 ctaccatcgg ctgacgaaac agcaacaatg ttcatagaaa acgtgtttcg tctgcatggt 4020 cttcccgact ccatagtatc agatcgcgga cctcagttta catcgcactt ttggcaagca 4080 ttgtgcaaag gccttggaat tcgaacccga ctatcgactg ctttccaccc gcaaacggat 4140 ggacaaactg aacgagtcaa tcaggtcatc aaccaatacc tgcgttgtta tacatcctat 4200 cagcaggacg attgggtatc gttgctgcca cttgccgagt ttgcgtacaa caatttggag 4260 cacgcatcca ttcggtcctc accattcctg gcaacatatg gatatcatcc acgtattgag 4320 tttcccacgt tactcgagtc cagtattaat gttccatccg cccaagagcg catccagcat 4380 atccagcaaa acttggcgct actcaaggaa aatctagaaa aggccaagtc agattacaag 4440 cacttttccg atcgaaacaa agtcgccgaa cctgtattca cacctggcga gttagtctgg 4500 cttttggctc gaaacattaa aaccactcgg cctagcaaga aactagacta tcagcgtcta 4560 ggccctttcc gcgtaataga acccattgga acactggcgt accgactcga gttaccaaaa 4620 gatatccgaa tccatcccgt attccacgtc tcactgttgg agaaacatca gcgaaacgag 4680 tttgccgata gacagataat tccacctcca cctgttattg tagaaaacca tttggaatac 4740 gaagtagaga aaatcttgga ctcgaagatc gttaaaggac aattacacta tctggttgat 4800 tggaaaggtt acacgattaa cgatcgaagc tgggaacctg tcgaaaacat ttcagctcca 4860 gacaagttgg cggaattcca tagcgcaaat ccgtcaaaac ccaaggaacc gtctcgactc 4920 cagagacgtc gtcgtttgga gaggggataa tg 4952 // ID Gypsy-4_AM-LTR repbase; DNA; FNG; 142 BP. XX AC ACDU01002060; XX DT 07-FEB-2011 (Rel. 16.02, Created) DT 07-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Allomyces macrogynus genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_AM_; KW Gypsy-4_AM-I; Gypsy-4_AM-LTR. XX OS Allomyces macrogynus OC Eukaryota; Fungi; Blastocladiomycota; Blastocladiomycetes; OC Blastocladiales; Blastocladiaceae; Allomyces. XX RN [1] RP 1-142 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Allomyces macrogynus genome."; RL Direct Submission to RU (07-FEB-2011). XX DR Genome; ACDU01002060; Positions 1689 1830. XX SQ Sequence 142 BP; 31 A; 38 C; 28 G; 45 T; 0 other; tgatgtgaag cagagctacc tgtatggctc ctttgcatac ccgttgtttt tgttgttgat 60 tcccgtttgt cattccatgt atataaacgc ccaaacgtac ctctgggtac gacaggctcg 120 ttccacactc gacaccatat ca 142 // ID Gypsy-18_MLP-LTR repbase; DNA; FNG; 192 BP. XX AC AECX01000171; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-18_MLP_; KW Gypsy-18_MLP-I; Gypsy-18_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-192 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000171; Positions 118918 118727. XX SQ Sequence 192 BP; 43 A; 59 C; 39 G; 51 T; 0 other; tgttatgatc tgtgacactg gtctgcgggc atatcacaac ccgtgaaagg gggacggact 60 gagtcccact cttgtagtct gagcgtctcc aacttgctct tttccctatg ctacaataac 120 tatattacgt agtacccagc atcctctctc tctctgtccc acacgcaaga cccccgaagc 180 ctggtcgtaa ca 192 // ID Gypsy-12_CCO-I repbase; DNA; FNG; 6182 BP. XX AC AACS02000012; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the mushroom Coprinopsis cinerea genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-12_CCO_; KW Gypsy-12_CCO-LTR; Gypsy-12_CCO-I. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-6182 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the mushroom Coprinopsis cinerea RT genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AACS02000012; Positions 886892 880711. XX CC 'CTGGC' target site duplication CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS join(1178..2122,2126..3127) FT /product="Gypsy-12_CCO-I_1p" FT /translation="MITLISEEWRNTSCGTRRIVVRKLRKWEEFMPVVLLV FT VTVGPQILLKRLIDPLCLSVSFWVIAGCEVELHVQGFPQTAEEVGDELGAA FT VGGYMARNTMFGEDVEHKQSRKFCGGEAGVCRNKNGLFGQSIHHHQDIRVS FT AGCRELFDEIHRNGVPGAERNRQGLQQTIRLVTRRLVALADCAGADVVRNE FT SAEFRPDVVAAHQLNRLVLAVVSRKRMVVFELKYAEAECGVVRNVDASVLA FT EYTICGESPARVGRCGEVFGSDFVRGVSSLDVLEERSLINYDRCAECGEHD FT RSCAERCGQLLIDEDGSEVVRIDRVVTTSLFKVDVPTSSEGIGLHPQLSRA FT KADDKVELGEILRPARLPAGEDLGSGEIFQVLVVRHHIHRLRRSLEVMPPN FT LEGLEDSQELLVVDVVVELRSLESAGMECDGVDFARVELLGQDGGKGVVRG FT VSFHNQGSVGLPVHQDGCGSECGLEGFKCGTCGVAEHERGVFAGEAGERNS FT DVGVVVDETSIEVGKTEEGLYILHRARFRPVLDGLGLVRRHSEACRRQDVT FT QVFHALRMKFAFLCGGEETMLPQAAEDFLDVLAMILHVVGIDEDIIQVDDD FT TDINHVRKDIVHESLECGGRIGQPKRHHQPFIRPITCAECGLPLIPVRNAD FT " XX SQ Sequence 6182 BP; 1354 A; 1406 C; 2169 G; 1253 T; 0 other; gtacctccct gttaaggccg agccctcgag gcttgtgtta ggactcggtc catgcacaga 60 atctgatgtc cgcatcttgc aactcggcat catcagcatg cttgtgactc tgcggtgcgt 120 ttgcggagaa gtgaaaaata atttcttgtc ctcttgaagt ggaaatgaaa agtgcggaca 180 ggaaatgaaa gtgaggagag agaagcggag gcaagggaga aattgaagca taacgtgcgg 240 agtaagggaa gtcatagtga agtggaggaa aatgaaggca agtggaataa gtggagtaag 300 cgggaacatc aagaagagag gaatgcggaa caggaaaaaa gatcaaacag gtaaggcata 360 gaggagtgga ggaagcatag agaggaaagg aataatgcgg aaccacgaat tgaagagaaa 420 gaagcagaag aaacaggatg cggaaaattg cggagagcgg agagacgaag gtaaaacttg 480 ggaaaataaa aggatacaaa ggtgaagaag aaatgcggag tcaactgtgg aatgcgggga 540 atgcggggaa tcaccaatgg ggaccaggct tgtctgggta cctcttgtgg aactctccaa 600 cgatctcttg agcatgttca agctctgtgg caaggagcca gcttcgttcc tcgtccgtat 660 tctcgtatcc gcgccaccgc acttcgtaga ggagcttgca atgtttgcgg cgacgatcca 720 atttggagtc gaggatctcc gcaatctcgt attcaagttc cccatccacc tcgattggag 780 gcggaggcgg ttcgtggcga tgcggaatct gagaggtgta atgcggttcg agcattgaga 840 cgtggaagac ggggtgaaca gcgcggagag agtcggggag gcggagagtg taggaaaccg 900 tttccacttg ggcaatgatc tcgtaagggc caaagtatcg ttccgagagc ttcttggtag 960 gtctggttgt gcggaagaac tgggccttaa cgaaaactcg gtcgcccacc ttgaagtctg 1020 gcggaggtaa gcgcttttgg tccgcatagc ggcgtgcgga ctcctgcgcg ctggcgatac 1080 tggtgcggag gtgttcgtgg agctcctgca ggtcgaccgc aaagtcgcgg gctgcggaac 1140 tcgcaagatc gcgttcaggg tgaacagaga tgttcggatg ataaccctta ttagcgaaga 1200 atggcgaaac accagttgtg gcactaggcg cattgttgta cgcaaactcc gcaagtggga 1260 ggagttcatg ccagttgtcc tgctggtagt tacagtaggt ccgcaaatac tgctcaagcg 1320 tctgattgac ccgctctgtc tgtccgtctc cttctgggtg atagccggat gtgaagtgga 1380 gcttcatgtc cagggcttcc cccaaactgc ggaagaagtg ggagacgaac tcggagccgc 1440 ggtcggaggt tacatggcca ggaacaccat gtttggagaa gacgtggagc acaaacagtc 1500 gcgcaagttc tgcggaggtg aggcgggagt ctgtcggaac aaaaatggct tgtttggaca 1560 gtctatccac caccaccaag atatccgtgt atccgccgga tgcagggagc tgttcgatga 1620 gatccatcga aatggagtcc caggggcgga gcggaatagg caggggcttc agcaaaccat 1680 aaggcttgtg acgcggagac ttgttgcgct tgcagactgt gcaggagcgg acgtagtccg 1740 caacgaaagt gcggagttcc ggccagacgt agtcgcggcg caccagctca atcgtcttgt 1800 tcttgccgta gtgtcccgca agcggatggt cgtgtttgag ctgaagtacg cggaggcgga 1860 gtgcggagtt gtccggaacg tagatgcgtc cgtcctggcg gagtatacca tctgcggaga 1920 gagtccagcg agggttggaa ggtgtgggga ggtgtttgga agcgatttcg tccgtggagt 1980 aagcagcctt gatgtccttg aggagcgctc cctcatcaac tacgatcgat gcgcggagtg 2040 cggggagcat gatcgaagtt gcgcggagcg atgcggacag ctgctcattg atgaagatgg 2100 gtctgaagtt gtgcggattg actgacgcgt agtcactacc tccctgttta aggtagacgt 2160 cccaacgtct agtgagggca tcgggcttca tccccagctt tccagggcga aagcggatga 2220 caaggttgaa ctgggagaga tactccgacc agcgcgcctg cctgcgggtg aggaccttgg 2280 aagtggcgaa atattccaag ttcttgtggt ccgtcatcac attcacaggc tccgcagatc 2340 cctcgaggta atgcctccaa atcttgaagg cctcgaagat agccaggagc tccttgtcgt 2400 ggacgtcgta gttgagctcc gcagcttgga aagtgcggga atggaatgcg acggggtgga 2460 tttcgccaga gtcgagttgt tgggacagga tggcggcaag ggcgtagtca gaggcgtcag 2520 tttccacaat caggggtcgg tcgggttgcc agtgcaccag gatgggtgcg gaagtgaatg 2580 cggacttgag ggtttcaaat gcggaacgtg cggagtcgcc gaacacgaaa ggggtgtctt 2640 tgcgggtgag gcgggtgagc ggaacagtga tgtcggagta gttgtggatg aaacgtcgat 2700 agaagttggc aaaaccgagg aaggactgta tatccttcac agagcgcggt tcaggccagt 2760 cctggatggt cttggtcttg tccgcagaca tagtgaggcc tgcaggcgac aagatgtaac 2820 ccaagtattc cacgcgctcc gcatgaaatt cgcatttctt tgcggaggcg aagagaccat 2880 gcttccgcaa gcggcggagg acttccttga cgtgcttgcg atgatcctcc atgttgtcgg 2940 aatagatgag gatatcatcc aggtagacga tgacactgac atcaaccatg tccgcaaaga 3000 tatcgttcat gaatcgctgg aatgcggagg gcgcattggt cagcccaaaa ggcatcacca 3060 gccattcata cgacccataa cgtgtgcgga atgcggtctt ccactcatcc ccgtccgcaa 3120 tgcggactaa gtggtaggca tgcttcaggt caatcttggt gtagaccttc gcgcggccgg 3180 gtgcgtcgag gagatcggag atcagcggaa gcggatagcg atctttcttg gagatgcggt 3240 tgagtccgcg ataatcgacg caaaggcgga gcgatccatc cttcttcttg acaaaaagga 3300 tgggagcgcc taggggagag ttggaggagc gaatgaaacc agcccgcaag ttctcttcaa 3360 ggaactcgcg gagtgcggag gtctccgcag gagacaggga gtagagacga ctcggtggaa 3420 gcggagcggc agcgtcaagc tcgatcttga ggtcgtagga tcgatgcgga ggcaaggtgt 3480 aggccttggc attgctgaac acgtcggcaa actcgtggta ttcctctgga acagaggaga 3540 ggtcgacgag ttccgcatca actgctgcgg cgcatgcccg caccacattg tcgctcttgg 3600 aaagggaaat ggaaaaggag tgtgagccac gcatcgcgga tgcgcggcgg aaagcggctg 3660 cattgacaaa agaaattgga actggattgc tggtgaatgc gcaaggaggg gaagtgggcg 3720 gagcactagg ctccggtgcg gagagaggcg gagggatgct ggaggttgcg gaggtctcga 3780 cgtttgcggc ggcgagtgag ctcaatgcgg aggagttatc ggatgattcg aacgagagta 3840 tctggcctct gcgccaatca atcaacggat tgtaacgggc aagccagttg tgtcctagaa 3900 ccaacggata cagggcatcc agggttgtga tgaagcactt gatatggaat tcctcgcctg 3960 aggagatgcg gagcgtgatg tccgcagctt gcgataacca gcgcgaaatg gttccatcga 4020 ggagtttaag cggaacaggc ggaatgtcat agaattcgat attgtgcttg gatacaaatg 4080 cggagtcgat aaaacagtcc gtacaacccg aatcaatcaa ggcgcggagt tcagcagagt 4140 ccagagagaa ggaatgagtt ggggtgcgga gaatcgaaaa gagaatggaa agagcattgg 4200 gatcagagag ggctgccgca tttagtcgga cctcaagcct ttctgcggcg gagacgcagc 4260 cctgagggaa aggcgaggtc acagggctgc tcagtgattt cccgactgag aggagtcgga 4320 aacctgcgga gcggacgcgt tggccctgcg ggcacggcgg gtgcgcgggc agtccgctgc 4380 gcggtgtcct gaggcagcgc agacaaggca aagattgttc cgcatgcgac gttcgcgctc 4440 ctcagggcgg aggcgaccgt tagcgtccag gttgttgttc caagggcgct gctggccacc 4500 gccctgacgg ttgttgcggt tgccgcctcc agaggagtta gcggaggagg cggagggcgc 4560 ggacgtggaa gcgtgcggag cggagggttg ccgttcttgc ggacggcgga agttgtgtgc 4620 ggggaatgag ggtgacgaga gtggttgttg cggaagtgtg aggaggcgga atgatgcgga 4680 gtggagctgg aagaggaagg accaggacca ccggaggagt gattgttgcg ggggtgcgga 4740 gcagggttgg atccgcggtt ggtggtgcgg agtgcggcct gctccttttt gatctcctcc 4800 tggcgatcca ggtagcgctc gtcgatcttg aacacctcaa cctggaggtc cgcaagggtt 4860 gcggggttct cccgcttgga gagctcgttc ttgatgcggt cgaggagtcc attgtagtac 4920 tggtcgcgga gcggctcctc attccagttg atctttgccg caagctggtt aaaggccagg 4980 ttgtactggt aggcgcgtcc ggtcatcttg agcgccttca gcttgcgggt tgcgtccttg 5040 atggggtctt tgggcccaaa cacgcggatc agttcgtcct tccatgcgcg gaagctctgg 5100 aaccaatccg ggactcgagc gtctccgcgc gcgcgggcgg ccgcaatctt gtcgccaaag 5160 taggcatagg cgtcgccctt gaggtaggtg agggctgttg cgacgcggtc cgcatcgtcg 5220 cggaaaccgt cctggtacct cccgaagtag agggtgagct gggtgatgaa gttgtcgagt 5280 ttgtctgggt cctccccatc gaagggctca gggggtcgtg ccttggggcc ctcgcgcggc 5340 cttgtgagtg cggcctccat ggtccgcgcg aggcggcggt tggtccgcgt gaggccggcg 5400 aggagccgac tgaggagttg cggaacgtcc ggttcgtcgt ccgcatcgga gtcagatgaa 5460 tccgagctgt catttccatc atccgcaggg ccgccgggtg gaccgccggg tgggggccct 5520 cgattcggct gaggagggcg acccgggccc tgaggtggtc cgtctccgcc ctgcggagcg 5580 tctgcgggcg gttgcccgtc atccgagtct ggagggtctc cacccccgac aggcgggaca 5640 gcagggggtt cgtagggcgg aactgggtcg ttggagcctc ccgcaaccgc gttctggagt 5700 agcgatgcgg tgcggagggg tgcgggtgtg acataggata ggtccgccgg tgcaggcggt 5760 tcatcgactg ttctcccctc gccggaaaac gggagtaagg cggcggtgac agtggcctcc 5820 tctccgcgtt cctgcgctga ctgtgaggag agtccgcctg ggaattgcgg agtgtttgcg 5880 ggtgtggtcc gcgtctggac ctgtgtcgcc attgggtttc ggtaggcaga ggccgatgta 5940 gcgggcccaa ggaaggatgc gggcggaccc cagattggct gctcctgggg ccgcggggcg 6000 tcaggttggc ggaagttggg tgaaaaggga ttgtggaggg ggtcgtcgat gacaggcgga 6060 tgtcgacgat gatgtgtgga gattgagtag gaaaactccg gaggggtttg cggacgttgg 6120 aacacaatag agtagttagt gctcctggtg cgcataccac agcgtgcgga tccggaatct 6180 ac 6182 // ID Gypsy-72_MLP-LTR repbase; DNA; FNG; 171 BP. XX AC AECX01001226; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-72_MLP_; KW Gypsy-72_MLP-I; Gypsy-72_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-171 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001226; Positions 57763 57593. XX SQ Sequence 171 BP; 40 A; 43 C; 27 G; 61 T; 0 other; tgttatgacc ttacttgtat aaggcttacc tgttgtatgc ttatactgta ccatctagac 60 ttcttgtgat gcgtagagca gggactcttt tccttactta gcaatctaac tacaatacct 120 gaaggccctt tctatctctt gcatcttgtc ccatcaagcc aagtcctatc a 171 // ID Gypsy-22_LBS-I repbase; DNA; FNG; 11818 BP. XX AC ABFE01001864; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-22_LBS_; KW Gypsy-22_LBS-LTR; Gypsy-22_LBS-I. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-11818 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01001864; Positions 22593 10776. XX CC Positions [6976-7455] - Integrase core CC 'CTCGT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 601..7791 FT /product="Gypsy-22_LBS-I_1p" FT /translation="MEAEKTSRLGMENGFENSPMAKLLREYSGEAGETEDL FT DTSLVSTQTNVGLVLDLLELSEEMIVDRGPIRDKGLPRYHLDALTLGRITT FT VVTEQQIFLERAAGLIAGRTNFYKVDPGDTLLPILKGTSSAPQLRAAWVAL FT RHRIELGTKAWRKYITEYRLPIDSQPVLSLLSTLPELYLPLQDIAEQDDKL FT RYLYKHIPHHQEQLTSEGKRALEKTRSWLDILPVPDAFKNAFAENKIPSPS FT TLEVTDIRRASKGKEKEEQNPHKTGAQSSVWMGMETPFKSSNNWFVKPGES FT NRVKQPGTSRPPAEPNILLGIATPAAPQTVTAWEGREQPPHMAAQAQALRS FT ETKPGRAESRASSHASDRQGRHRRREHRETRDGPDEPPSSDDDSSSSRSRR FT SHRSHSWRRRSRTPKPRRRRSSTPRPKRRRSSPDDDGGDGSSSSDTSYSSS FT STHSRSRRRRRRRSKDSVVIPYGRIAPTIDAKLKQEDLPTWDGNPNTAIEY FT FWKVQQQATLGGYIPSALGYWLWLKLKEGSDVQNWFATLPFEEQSRMRGHW FT VDYLKGIKEGYLGRNWQFDIGEAYKAQYFRQPGHERELPKSFITRRIMYTR FT MLAKSDDGGPLEVHLVMARAPLAWRTILVLENIKSSSMLYTKAAEHEASLL FT EISRSRSQSSNTITSENLVSTLNKMGYTLDRPKFNNFPQNRRANLTSNERE FT ANPAGEGTKESYTTQAVSQESHQNEEEILAEVFQVLKKRQRPPPPGGYLFS FT KNDHVTTKMGRLPPSPCKCCGSANHWDKECPDWAVYLEKTAKSGYGNEREL FT IEEEMYYLSTYSILLSQRVASLQVDQTKLEKDFRPAVRSSVRDQSYHGRKT FT EVQPKKMAVTVEEVEDEFWEEDRNRAKSDTHLLRAEDDEDAPLEEPEERRP FT SPSKHPTPQRRTRDTAPPARKPTHQVSIEEMEDEDVVAARIKPTSPRHLLI FT PMDGIGDPEENYEPNRSEDIKTAEGNKEAHHNARRTESADIPTLKDLPPPP FT PDSKPIRLCKKRLTSPGMSSLGVSVLSTKGWVAGLENASVDLRVDSCADVT FT LISEEFFNTLKGVPRIQQGMRMKLWQLTDKNEELRGYVRIPIFMQTTEGVV FT LESEAEAYVVPNMTVPILLGEDYQLNYELGVTRNVETGTKLRFAGTDHEIA FT AVHVSRTPDFDRMRQSTMLVNKFAKAKVHRREKARRHRRKLKFGMEEKTVR FT AAEDCILKPHGCRRIKVEGQFDVDKDWLVQKSLLANANDSYFAVPNTLISA FT RNPWVPIANPTDQPRYIRRGEVIGSIHDPEDFFETPDSAERADVLYKHAAA FT IKAVIGAQMEAEGSSNKATTNVDNQQIPEEEEEYGPKTAAMPDLTDYPSAK FT IEELIDVGTLPDHLKEKAWNMLKNRQQAFGFDGRLGHLPTKVHIRTADGQV FT PIAVPMYGSSPEKRRVMDEQMDKWFEQDVIEPSISPWSAPVVIAYRNGKPR FT FCVDYRKLNAVTIPDKFPIPRQSEILSSLSGAQVLSSLDALSGFTQLELHE FT EDVEKTAFRTHRGLFQFRRMPFGLRNGPSIFQRVMQGILAPYLWLFCLVYI FT DDIVIFSKSYEEHITHLDKVLEAIEKAGITLSPNKCHLFYGSILLLGHKVS FT RLGLSTHSEKVKAILELERPKKLSQLQTFLGMVVYFSAFIPYYASICAPLF FT QLMRKGAKWKWGAEEEYAFQAAKEALRSGPVLGHPIEGRPYRLYTDASDEA FT ARCALQQIQPIQVKDLKGTRTYNRLKKAYEEGLPPPKLTTTLSAKTTDSPA FT DDKWGDDFDSSVVHVERVIGYWSRTFKGAETRYSTTEREALAAKEGLVKFQ FT PYIEGEKILLVTDHSALQWARTYENSNRRLAAWGAVFSAYAPGLEIIHRAG FT RVHSNVDPLSRLPRAAPDHVSPEIDPGPSIVTDFSLAEEQERTLNRTFNRE FT TFVAWSISECLEGPTSSCFTEARQGIADFEGEDPVSENQSASRGDELDTLP FT VAEEYWEASNPAPNLHVEMDASFIRDWVEEYQSDQSFRSVWNDEKREIENW FT KRDGRFLKDQRGLLFFLDEDYQPRLCVPKSKRNLILREAHENPLESAHAGP FT ERLYQTLSQKFYWKRMKADVLEYCKSCDPCQKTKFGNFNKFGFLIPNPIPS FT RPYQSISMDFIVNLPWSNNFNAIFVVVDRLTKQGTFIPCTTGLTAEEFAEL FT FVKHIICRFGIPDSIITDRDPRWTSDFWKGVARFLKTKMALSSAHHPQHDG FT QTEILNRHLTTMLRAYVSDDLADWATWLHILEFAYNNSVHGSTGASPNFLA FT YGFQPKSPLDFLLPKDSPRAKSALVPTTFMIKPGHLPAQSFQSPQVKYEMK FT CLGYFILALQLFHSYPPHFSFLLGNEMHFILLPVLFHNFFILCQLYFFSFH FT SLPVSFHSFACTFS" FT CDS 9324..11807 FT /product="Gypsy-22_LBS-I_2p" FT /translation="MDFLPLDDTGLVPQVDFRTIGEDLWVGLNASCDKIPS FT EASAYVDYQSNQRVADEFRDKGPTGLNLEMEWALDADWYDPEASWRPFLPL FT PSGMATEWYYQMDQGTPSDPESISPQYTIFPPLLSEMEKDLHRFESYVVAI FT AKSSIFPARAARPGLYDFEQLKGPFDSIRSLENFGANVKRQALDYLSFLAW FT WTLSASGWDVYLPQDTVDSIMDLGLDDRPKRGVLLDLDRDWKQMSIPHLLR FT HRVPVFYRWNESLQSNNRFLSLSPYILRAFDKRRKATPSERVFAIDLPEFT FT NDFEVMKDYDEFFQRRVFHNEPSPDLEFRADWHYAVVDFQGWMYRPIDLET FT AKELARRFGSHVVRRENRTSVIFRRWEAFQFDDTIAEPAGLAEAYVDRDPM FT RGNIEIREIHKSFFAPVQNQKFDLNGFPDYGPLPGNNLLRPTRPRSWVEAM FT SSAGHTSSRSSSSGPSRRQSSSKNLNRSSPYPRPRSSSPSFRATYQHRDES FT SPETTKEQFIRRLRVQGNIVGTASLWLSPVEAEWNPLFLEEGVLLFPDNRT FT QIRVRYWTICGSPSIRMRQVLDFAISRGMKFLIAIPFDALPHFHQLEKQNM FT ASLTKRTYDTGFQESPLTYSKGGSAFMDQYLGKLADILRRPHARAVIAMGG FT PTSWIARFYGGSRLVEEYMSGPSSQVTVHHRGGVTVAPFLDMPVFHDQLSK FT QEIELIHGYIPLGNPNEDRWAFPTSELLEEFSNHWRGEWNQGCERIMGNIA FT RALESGALAPLTRREWREYLRNNNRGEHAPSPGSIPKDSDFAFVENAIDSS FT YPVRWHGRRIRDIFLPEDFEVSSSGN" XX SQ Sequence 11818 BP; 3413 A; 2995 C; 2827 G; 2583 T; 0 other; aaggtggaca ctgtgggaaa tcaacgcgcc ctggccgact cgacaaccat agacgctaca 60 acagccaatt cgaatagtgc cgttcctccg atcaggagaa atttgagcgg agtcaccact 120 acaagaccgt cttactcatt caaccgtgcg gactcttcga aatcgacacc gcccgcttca 180 cgttcagcct ctcactctgc atccccgaag aaaggatcta caaagaccac cacgagcaaa 240 tcagcctctc aggagctttc ttcgaggtct cataagaagg ctcacgtgga agccatagtg 300 aaatcggcac cgttccgttc aaaaacatcc ctcacacccc gaagcgcaga aatcctcaac 360 acctttaata acacccctac ttccccgata cacgctctga atttttcagc agctacacct 420 tcccaaacgc cgactacgtt atccagcgtg cacgtcagcg gagaattcaa ggcagcaact 480 aaacccaact tcaaacggtc ccaaccaact cctgaagctc acacaacgga tccgtcgaac 540 ccagtaatct tacccccacc tacaactgga cagccaattg ggccgatcat tgaggtagag 600 atggaggcag agaagactag ccgattgggc atggagaacg gtttcgagaa ctcgccgatg 660 gcgaagttat tacgggaata ctcaggagaa gcaggcgaaa ccgaagacct ggatacctcc 720 ctagtttcta cacaaaccaa cgtagggcta gtactggacc tgttggaatt atcagaagaa 780 atgatagtag atagaggacc cattagggac aaagggctac ctcgatacca cttggatgcc 840 ctgaccctgg gcagaataac taccgtagtg acggaacaac aaatctttct ggaacgagcg 900 gctggtctca tcgcgggtag aactaacttc tacaaagttg acccggggga cacgctgcta 960 ccaatactga aaggcacttc aagtgcacct cagctacgcg cagcgtgggt agcgttaaga 1020 catagaattg aactgggcac gaaggcctgg cgtaaataca taactgaata ccgacttccg 1080 atagactcac agccagtcct ttccctgtta tctactctgc ctgaactcta cctaccacta 1140 caagacatcg cggagcagga cgataaactg cggtatcttt acaagcacat cccccaccac 1200 caagaacaac tcaccagcga aggaaaacgc gcgctggaaa aaacccgctc atggctagac 1260 atcctgccag taccggacgc gttcaagaac gcttttgcgg aaaataagat accatcacct 1320 tcgacgttag aagttacaga cataagaagg gcgtcgaaag gaaaagaaaa ggaggaacag 1380 aatccacaca aaacaggcgc ccagtcttcg gtttggatgg gaatggaaac gccttttaag 1440 agctccaaca attggttcgt gaaacccggc gaaagcaatc gagtgaagca gccagggacc 1500 tccagacccc ctgcagaacc gaacatccta ttaggaatcg ctactcccgc agcgcctcaa 1560 acggtgacag catgggaagg ccgggagcaa ccacctcaca tggccgcaca agctcaggcc 1620 ctgaggtccg aaacgaagcc aggaagagct gaatctcgag cgagctccca cgcaagcgac 1680 agacaaggtc gtcatcgtcg tcgagagcac agggagacca gagacggccc cgacgagcca 1740 ccaagtagcg atgacgacag tagttcatcc aggtcaaggc gatctcacag gagccacagt 1800 tggaggagac gttcacgcac gcccaagcct cggagacgac ggtcatcgac tccccgacca 1860 aaacgaaggc ggtcgtcacc agacgatgac ggaggagacg gaagcagcag cagcgacact 1920 tcatactcgt cttctagcac tcactcaagg tctagacgtc gtagacgaag acgttctaaa 1980 gatagcgtcg ttattcctta cggtcggatt gccccaacta tagacgcgaa attgaaacag 2040 gaagacttac cgacgtggga tggcaacccc aacacagcaa tcgagtactt ttggaaggtg 2100 caacaacagg ccaccttggg ggggtacatc ccttccgcgc taggatattg gctctggttg 2160 aagctgaagg aaggatcaga cgtccaaaac tggttcgcca ctttaccttt cgaagagcag 2220 tccagaatgc gtggacattg ggtcgactac ttaaaaggaa tcaaggaagg ttatttggga 2280 cgtaattggc agtttgacat cggagaagca tacaaggccc aatacttcag acaaccgggt 2340 cacgaaaggg agctacctaa gtccttcatc acacgtcgca tcatgtatac taggatgtta 2400 gctaaatcag atgacggggg acctctcgaa gtccatttag tcatggcgag agctccattg 2460 gcgtggagaa ctatcctagt actggaaaac atcaagtcat cctcgatgtt gtacaccaag 2520 gccgccgaac acgaagcttc tcttctcgaa atctcgagaa gccgttcaca atcatccaac 2580 acaatcacct ccgagaatct tgtgtccact ttgaataaga tggggtatac cctcgacaga 2640 ccgaaattca ataactttcc acaaaacagg agagctaatc taacctctaa cgaaagggaa 2700 gcgaatccag caggggaagg gaccaaggaa tcctacacaa ctcaagcggt cagtcaagag 2760 tcgcaccaga acgaggagga gatcctcgct gaagtattcc aagtcttgaa gaagagacaa 2820 cgccctccgc cacctggagg atacctgttt tctaaaaacg atcacgttac tacgaaaatg 2880 ggaagactac ctccgtcacc ttgcaaatgt tgcggtagtg ctaaccattg ggataaggaa 2940 tgccccgact gggcagtgta tctagagaaa acggcgaagt caggctatgg caacgagcgc 3000 gaacttatag aggaggaaat gtactacctg agcacctaca gcattctact ctcacagcga 3060 gtggcatcgc tgcaagttga ccaaacaaag ctggagaagg attttagacc ggcagttcgt 3120 agtagcgtga gagatcaaag ctaccacggg cgtaagaccg aagtacagcc aaagaaaatg 3180 gccgtcactg tggaggaagt agaggatgag ttctgggaag aagacagaaa ccgagctaag 3240 agcgatactc acctacttag agcagaagac gacgaggatg ctccgctcga agagcctgag 3300 gagaggagac cctcccctag caaacaccct acaccacaac gccgtacaag agacactgcg 3360 ccgcccgcca ggaaacctac acatcaagtc tccatagagg aaatggagga cgaagacgtc 3420 gtagcagcaa ggatcaaacc aacttcgcca agacacttgt tgatcccaat ggacggaatt 3480 ggcgatcctg aagaaaacta cgaaccaaac cggtcagaag atatcaagac agcagaggga 3540 aacaaagaag cgcaccacaa cgcacggcga acggagtccg ctgacatacc gaccctcaag 3600 gatctacccc caccaccacc ggactccaag cccattcgcc tatgcaagaa acgtctgaca 3660 tcaccaggga tgtcttccct aggagtttcc gtcctgtcaa cgaaggggtg ggtagcagga 3720 ttagagaatg ctagcgtcga cctcagggtg gattcttgtg ctgatgtcac tttgatctct 3780 gaggaattct tcaacacttt gaaaggcgtg cctcgtattc agcaggggat gaggatgaag 3840 ctgtggcaac ttacggacaa aaatgaagaa cttagaggtt acgtacggat ccccatcttc 3900 atgcaaacca cggagggagt ggtactggaa tctgaagcgg aagcatatgt agtgcctaat 3960 atgacagtgc ccatactcct aggagaagac tatcagctaa attatgagct cggagtcacc 4020 aggaacgtgg agacgggtac caagttacgt ttcgcgggta ctgaccatga aatcgctgca 4080 gttcacgtca gccgtactcc ggactttgac agaatgagac agagcaccat gcttgtcaac 4140 aagttcgcca aagccaaagt acaccgtcga gaaaaagcca gaagacatcg tcgcaaactc 4200 aaatttggca tggaagagaa gacagtaagg gcagccgaag actgtatctt gaaacctcac 4260 ggatgtcgcc gtattaaggt ggaaggccag ttcgacgtcg ataaggattg gttagtacag 4320 aagagcttgt tagcaaatgc gaacgactct tatttcgccg tacctaatac cctgatttct 4380 gcccgcaatc cctgggttcc tatcgccaac ccgacggatc agccaagata cattcgaagg 4440 ggagaggtaa ttggttccat acacgaccca gaggacttct ttgagacgcc cgactctgcc 4500 gaacgtgcgg atgttctata caagcacgca gcagctatta aagcagtcat tggagcccaa 4560 atggaagcgg aaggcagctc gaacaaggct acgacaaacg tggacaacca gcagatcccc 4620 gaggaagaag aggaatacgg acctaaaaca gcagctatgc ccgatcttac ggattaccca 4680 tccgcgaaaa tagaggaact gatcgatgtg ggtacactac cagaccacct gaaagagaaa 4740 gcatggaaca tgcttaaaaa tcgtcaacaa gcattcggct tcgacggaag gttgggacat 4800 ttacccacta aggttcacat taggaccgcc gacggtcagg tcccaatagc tgttccaatg 4860 tatggcagtt cccccgaaaa aagacgcgtg atggacgaac agatggataa atggtttgag 4920 caagacgtaa tcgagccttc tattagtccg tggagcgcgc cagtggtcat cgcatatcgc 4980 aacgggaaac cgagattctg cgtcgattac aggaagttga acgctgttac gatcccggac 5040 aaattcccaa ttccgcgaca atcggagatc ctttcctcac tatcgggcgc tcaggtctta 5100 tcatcgttag acgccctgtc aggattcact cagttggagt tacatgaaga ggacgtagag 5160 aaaaccgcct tccggacgca cagagggcta ttccagttcc gacgaatgcc cttcggcttg 5220 aggaacggtc catccatctt tcagcgagta atgcaaggca tcttggcgcc ctatttatgg 5280 ctattctgtc tcgtatacat tgacgacatt gtgatattct ccaaatctta cgaagaacac 5340 atcacgcact tggacaaagt tctggaggcc atcgaaaaag caggaattac cctctcgcct 5400 aataagtgcc atttgttcta tggttccatc cttctgttgg gtcataaagt gtcaagacta 5460 ggactttcaa ctcactcgga aaaagtcaag gccatattag aattggagag acctaagaaa 5520 ttatcccagc tacagacatt cctcggcatg gtcgtttatt tctctgcttt cattccttat 5580 tacgcttcca tctgcgcccc attatttcag ctaatgcgga agggcgctaa atggaagtgg 5640 ggggctgagg aagaatacgc ctttcaagca gccaaggagg cactccgctc tggcccagta 5700 ctggggcatc caatagaagg tcgaccgtac cgtctgtaca cagatgcgtc cgacgaagca 5760 gccagatgtg cgctacaaca aatacaaccg atccaggtca aagatctcaa aggcactaga 5820 acgtacaatc gtttgaagaa ggcttacgag gaaggtctac caccgcctaa gctcactaca 5880 acgttgagcg ctaaaacgac cgacagccct gcggacgaca agtggggtga cgactttgat 5940 tcttcagtgg tacacgtaga gcgggtcatt ggttattggt caagaacctt taagggcgct 6000 gagactcgat actcgacgac cgagagagag gctctggcag ctaaagaagg gctagtcaag 6060 ttccagcctt acatcgaagg tgaaaagatc ttactggtca cggatcattc agctctgcaa 6120 tgggcgcgta catacgaaaa ctccaaccgc cggctagctg cttggggggc agttttttct 6180 gcgtacgcgc ctgggctaga aatcatacac agagcgggga gggtacattc taatgtggat 6240 ccgttatcac gcttacctcg agcagcacct gatcacgtct ctcctgaaat cgacccggga 6300 ccgagtatag tgacagattt ctccctagcg gaggaacaag aacgcactct caaccgaacc 6360 ttcaacaggg aaaccttcgt ggcatggagc atatcagagt gcctagaagg accgacgtca 6420 tcttgtttca cggaggctcg acaaggtatt gcagacttcg aaggagaaga ccctgtgagc 6480 gaaaatcagt cagcatcaag aggagacgag ttggacaccc taccagtcgc tgaggaatac 6540 tgggaggcct ctaatcctgc acctaatctc catgtagaaa tggacgcaag tttcatcagg 6600 gattgggttg aagaatacca atcggaccaa tcctttcgct cagtctggaa cgacgagaaa 6660 agggaaattg agaactggaa acgtgacggt cggttcttga aggatcaaag gggattgctg 6720 ttcttcttgg acgaagatta ccaacctcgc ctatgcgttc ctaagtcaaa acgaaatcta 6780 atcctcagag aagctcacga aaaccctctg gaatcagcac acgcaggtcc agagcgccta 6840 tatcagacac taagtcagaa attctattgg aagagaatga aggcggacgt actggagtat 6900 tgcaagtcat gtgatccatg ccagaagact aagtttggca atttcaacaa gttcggcttc 6960 ctgatcccca atcctatacc ctcacgtcca taccagtcaa tctccatgga cttcatcgtg 7020 aatctgccgt ggtccaataa tttcaatgct atcttcgtcg tagtggatag actcaccaaa 7080 cagggaactt tcatcccatg cacaacgggt ctcaccgccg aggaatttgc agaactattt 7140 gttaaacata tcatttgtag attcgggata ccggatagta tcatcacaga ccgcgatccc 7200 cgctggactt cagatttttg gaaaggggta gcgcgtttcc tcaaaacgaa aatggccctg 7260 tcatcagcgc atcaccctca gcacgatgga cagacggaga ttcttaatcg acacttaaca 7320 accatgctca gagcctacgt atcggatgac ctcgcagact gggccacttg gctacacatc 7380 ttagagttcg cgtataataa ttccgtccac ggatcgacag gcgcttctcc gaacttcctc 7440 gcgtatggtt tccagcctaa atcaccctta gacttcctat taccaaagga tagcccaaga 7500 gctaaatccg cactagtgcc taccactttt atgataaaac ctgggcattt gcctgcacag 7560 tcttttcaat ccccccaagt gaaatatgaa atgaaatgcc taggctattt cattcttgct 7620 ttgcagctat ttcattccta cccaccacat ttttcatttc tccttggcaa tgaaatgcat 7680 ttcattcttt tgcctgtgct ttttcataac tttttcattc tttgccaact ttatttcttt 7740 tcttttcatt ctttgccagt atcatttcat tcttttgcct gcactttttc ataatttttt 7800 tcattctttg tcagtcttat ttcattcatt tgcctgtgct ttttcataac ttatgtattt 7860 gtttggttat gatgggatca aaatataaga catgtgatct tttagttgcc tcaatttaca 7920 tgtttattgc tcaattaagt tagattatgg cctcaaatac aaacaaatgt aaagatcaca 7980 agccaggggt tgattaggtc aaactatagc taataatttg agaatacata tgtttattat 8040 ttacaaacaa ataatgtgtc tagttatgaa aaggcttaga gaagcaatga aatgagtttc 8100 atttttttta agtgcaaccc taaccttaaa catggcaaac aatgaaataa taagttatga 8160 aaaggtgcag gcaatggaat gaaataggac tggcaaagaa tgaaaaaaaa attatgaaaa 8220 agcgcaggca aaagaatgaa atgatactgg caaagaatga aaagaaaaga aataaagttg 8280 gcaaagaatg aaaaagttat gaaaaagcac aggcaaaaga atgaaatgca tttcattgcc 8340 aaggagaaat gaaaaatgtg gtgggtagga atgaaatagc tgcaaagcaa gaatgaaata 8400 gcctaggcat ttcatttcat atttcacttg gggggattga aaagactgtg caggcaaatg 8460 cccaggtttt atcataaaag tggtaggtac cactcaaatc cgctacttac tcgctgagcc 8520 ctgaagtccg aaacttccta gacactctgg gcatgcacag ggatagcgcg agacgggcta 8580 tagcaaaagc acaagatgag caagcagcac agtataacaa ggggcgtaaa cctgtaccag 8640 acttcaagaa gggagacaga gtcctcgtga atccacactc ccttgattgg atcgacgcaa 8700 aaggaagcgg cacgaaattg aagcagcgat ggataggtcc gttcgaaata acacaaaaaa 8760 tcaatcccaa agtcttccgt cttcgcatga gcgataaata cccaggtttt ccagtcttca 8820 acatcgagca cctaaaaaaa tacgaggaat caggaccaga atggggagaa cgcactcgaa 8880 tgcctgaatc gcgcagaaca cagatcggat ccgaggaatt cgaagtggaa accatcgtag 8940 gacatcgccg caagcgcaac gcactccaat tcctagtacg ttgggcaggc tatggtccac 9000 agttcgacac atgggaacca cagagaagtc tgcggaacgc ctccattgta ctaaacgagt 9060 acaagaaaag gcataatttg tgaagaagca tcgcatttaa agccattcca cggctatagt 9120 agtggaacat tatgcctcat tttcattatc taatcgttct cacagatttt ccttagtcag 9180 catatttttc ttatctttta tatctttctc tttcttttat taccactcca tagtcgggtc 9240 aagcgaaagc gtcggatttc ccgcttaagc catctgacag gatcctcgag tctttttcgg 9300 atttccgttt cagaccgaga ttcatggatt tcctcccact ggatgataca ggcctcgtac 9360 cccaggtgga tttcaggacg atcggagaag atctctgggt aggtctcaac gcctcttgtg 9420 acaaaatccc atcggaagcc tccgcttacg tcgattacca aagtaatcag cgggtagcgg 9480 acgagttcag ggataaaggg ccgacaggac tcaacttgga gatggaatgg gccttggacg 9540 cggattggta cgatcccgag gcaagctggc gcccatttct gcctctgccc tcgggaatgg 9600 ccacagaatg gtactatcaa atggatcaag gtaccccgtc cgacccagag tcgatctcgc 9660 ctcaatacac catcttcccg ccgttgttat cggagatgga gaaggacctt catcggttcg 9720 agtcatacgt agtagccata gctaagtcat ctattttccc agctagagcg gcaaggccag 9780 gcctttacga ctttgagcag ctgaaagggc ccttcgactc aatccggtcc ctcgagaact 9840 ttggcgccaa cgtgaaaaga caggccttgg actacctcag tttccttgct tggtggacgt 9900 tgtctgcttc cggttgggat gtctacctcc cgcaggacac agtcgacagc atcatggact 9960 taggcctcga cgaccgtccc aagagaggcg tcctcctcga tttggacagg gactggaaac 10020 agatgagtat tccacactta ctccgtcacc gagttccagt cttttacaga tggaacgagt 10080 ctctccaatc aaacaaccgc ttcctctccc tatcccctta catactccga gcattcgaca 10140 aacgcaggaa agccacaccg agcgaacggg ttttcgccat cgacctgccg gaatttacga 10200 acgacttcga ggtcatgaaa gactatgacg aattcttcca acgccgagtc tttcataacg 10260 aaccctcgcc ggacctcgaa tttagggcgg attggcatta cgcggtagtg gatttccagg 10320 gttggatgta ccgaccaatc gacttggaga cagccaaaga attggcaagg cgtttcggat 10380 ctcacgttgt caggcgcgag aacaggactt ccgttatctt caggcgatgg gaggcttttc 10440 agttcgacga cactatcgca gaaccagcag ggctcgctga ggcgtacgtc gacagggatc 10500 ccatgagggg caacatcgaa atccgagaga ttcataaatc cttcttcgcg ccagttcaga 10560 accagaagtt tgacttgaac ggatttccag actatggccc cctaccagga aacaacctct 10620 tgcgtcctac tagacctagg agctgggttg aagctatgtc atcagcaggc catacttcga 10680 gcagaagcag ctcgtctggc cctagtcgcc gtcaatcctc gtcgaagaat ttgaacagat 10740 ccagtccata tcccagaccc cgatcatcct cgccgtcgtt cagagctact taccaacata 10800 gggacgagtc gtcaccagaa accacgaagg aacagttcat ccgtcgcctc cgcgttcagg 10860 gtaacatcgt cgggacggca tcattgtggc tatctcccgt tgaagctgaa tggaacccct 10920 tgttccttga ggaaggcgta ctcttatttc cagacaatag gacacaaatc agggtcaggt 10980 attggaccat atgcgggtcc ccctcaattc gtatgcgcca ggtattggat ttcgccataa 11040 gcagaggaat gaagttcctc atcgccatac ccttcgacgc ccttccccac ttccaccagc 11100 ttgagaaaca gaacatggcg agcctcacga aacggacata tgacacggga ttccaagaaa 11160 gcccgttgac atatagcaag ggaggatccg cttttatgga tcagtatctc gggaagctag 11220 cagacatctt acgtcggcca cacgcgagag cagtaatcgc gatgggcggg ccaacgagtt 11280 ggattgcgcg cttctatgga gggagtcgtt tagtggaaga atacatgagc ggaccgtcca 11340 gtcaggttac agttcaccac agaggcggag tgacggtcgc ccctttcctg gacatgccgg 11400 tgttccacga tcaactatcc aagcaagaaa ttgagctcat tcatggatat attcccctcg 11460 ggaatcctaa tgaagaccga tgggcgttcc cgaccagcga gctcctggaa gaattctcca 11520 accactggcg aggagaatgg aaccaaggtt gcgaacggat catgggcaat atagcccgtg 11580 cattggaatc aggagccttg gctcctctta ctaggcggga atggagggag tacctacgga 11640 acaataacag aggcgaacat gctccatcac ccggctctat ccccaaggat tctgatttcg 11700 ctttcgtcga gaacgctatc gatagctcct accctgtacg ttggcacggt cgtcgcatca 11760 gagacatctt cctgcctgaa gattttgagg tttcatcgag cggaaactag gggggctt 11818 // ID Copia-9_MLP-I repbase; DNA; FNG; 4272 BP. XX AC AECX01000958; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-9_MLP_; KW Copia-9_MLP-LTR; Copia-9_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4272 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000958; Positions 64504 68775. XX CC 'CTGTT' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 381..2087 FT /product="Copia-9_MLP-I_2p" FT /translation="MWMALTSLHALSSPANKAKIFGQFYALSCPTSGDLKT FT FLSEVRLVDQQLTRIGFKVDSEVLAYFTLFKLPQELDSVRSALLFGGKEVT FT LKFVLDSLDQTSTTASTPTITVKTESALAAVPEHSPAASRPRCTHCKRMGH FT LVDTCYRLKPHLNPCNSLASVAEQVYTVSKSTSSNLTILDSGTTSHMFNIL FT SLLQNPQPCNIKVGIGESGRFMTATHMGTVSAVACSGSLQLGQALFIPELS FT RNLLCYNRFFEKGYYICPSTPGRFNPTNGSHILINGTVNNHLFYPDITFSS FT SGTLGSVSFIAHGIKTSTSLWHNRLGHPSPSYLSSILSLVGAGPSDEAVCA FT TCATSKAHRLPFSGNLPRPLAPLDVVHSDLSGMITPPTISGFRYYMKITDG FT FTSYRHIFLLRSKDEAFSQFQLYANNVETFHSRKIKKLVSDGGGEYVNKSF FT STWLSKKGITHQVTAPYTPEQNGVSERGNTITVERARCLLSTAGMPSNLWG FT EAVTTAIYLENRLPSKSTNFSTPYELWHGRVASYSHIRTFGCAAFVHLPKF FT KREGKFGPTAQRGVLLGFCEGT" FT CDS 2656..4041 FT /product="Copia-9_MLP-I_1p" FT /translation="MVPLTALRLDLLPKDFLKFPVSTLEKLMPLIPGQLST FT LRLLIAICIVFGWDFDQMDVVTAFLNGLLDKEIYMDPPPGLPNSEGLKCRL FT HRTLYGLKQSPNRWYHRIYTWMLSMGFVVSEHDPCLFIRAKPVSPCFVYIH FT VDDLAIFGKNITWFKDAIKKEFKMVDHGPAHFLLAMRITRTPDSISLCQDW FT YILSLLELFGMTECRPVSTPFVPDTHLVPATQSEIDELNALGVSYRGIVGC FT LGYLVQCTRPELAYPCSILGQFLENPGITHWMAAKHVLRYLSGTRLVGLTF FT RNKPSSLNIIGYSDSDWAACRFSRQSMCGYTFIICGGTISWRAKKQVSVAS FT SSTEAEYRGLLDAGKEALWLRGLYQSIIPSNSTDPNILFCDNQAAIALTKS FT SAFRANTKHIEAHFHWIRDQVSSGLISIPYCPTGEMAADIFTKALDRVKLV FT RFCAMMGIGPCVAPSAK" XX SQ Sequence 4272 BP; 960 A; 1251 C; 879 G; 1182 T; 0 other; ggttatgagc ccagcttagc gctaactcat acaaacaaat acaaacacaa gattaaaatc 60 cctataatga aataagaaga gcaatcagtc ctcggtcaaa attcattttt ttttttatca 120 tgtccgatga aaccccctct cgcgagcgat ctggaatccc tctgctcaaa tctgacaact 180 ttgcggtttg ggaattcaaa atgaagactt gactacgtgt tctcaaggtt ctgaacatcg 240 ttgaaggaaa cgaggcggca ccggatccaa tcacggatgc ctggaacgat agaatcaatc 300 tagctctatc tgaaatcgtc accaaattag atgacgccaa tacctcccat gtcttcgagc 360 acgtcaatga tcccgcagcc atgtggatgg cactcacttc tctccacgct ttgtcttccc 420 ctgccaacaa agcgaaaatc tttggccagt tctatgctct ctcctgcccc acttcaggag 480 acctcaagac ttttctgtct gaggttcggt tggttgacca acagcttact cgcataggat 540 tcaaggttga ctcagaggtt ctagcctatt tcaccttgtt caagctcccg caagaactcg 600 attctgtacg ttcggccctt ctgttcggtg gcaaggaggt cactctcaaa tttgttctcg 660 acagtctcga ccaaacttca accaccgcct caactcccac cattaccgtc aaaactgaat 720 cagcgctggc cgccgttcct gagcatagtc ccgcagcaag tcgacctcga tgtactcact 780 gtaaacgtat ggggcatctt gtcgacactt gctaccgcct gaaaccccac ctcaaccctt 840 gtaactcttt ggcttcagtg gcagagcagg tctatacggt ctccaaatcg acttcctcca 900 acctcaccat tctggattct gggaccacat cgcacatgtt caatatattg tcccttcttc 960 agaatccgca accttgcaat atcaaggttg gtataggtga atcaggtcgg ttcatgacgg 1020 caacccatat gggcacggtg tcagctgtgg cttgttcagg gtcactccaa ctcggtcagg 1080 ctctgtttat cccggagctt tcgagaaatc tgctttgtta taacaggttt ttcgagaagg 1140 ggtactacat atgtccttct acccctggaa ggttcaatcc cacgaacgga tctcacattc 1200 tcatcaatgg gactgtcaac aatcatctct tttatccgga catcacattt tcatcttcgg 1260 gaactttggg gtcagtatct ttcatagctc atggcatcaa gacctccact tccctttggc 1320 ataatcgcct gggtcatcct agcccatcct atctttcttc cattttatct cttgttggag 1380 ctggtccatc tgatgaggct gtgtgcgcca catgcgctac atcaaaagcc catcgtcttc 1440 ccttctctgg caaccttccc cgcccgttgg ctcccttaga tgttgttcac tccgatctga 1500 gcgggatgat cactcctcca acaattagtg gctttcgtta ctatatgaag ataactgatg 1560 gcttcacctc ctatcgacac atttttctac tcagatccaa agatgaagcg ttttcacaat 1620 tccaattgta tgcaaacaat gtcgaaacgt ttcactcccg caagatcaag aagctcgtat 1680 ctgatggagg tggggaatat gtcaataagt ctttctcgac ttggctttcc aagaaaggca 1740 tcactcacca agtcaccgct ccgtacacgc ctgaacaaaa cggcgtgtct gaacgcggca 1800 acaccatcac agtcgaaaga gcccgctgtc tactttcaac tgcaggcatg ccgtcgaatc 1860 tttggggtga ggcggtcacc acggccatct acctggagaa tcgccttcca agcaaatcaa 1920 ccaatttttc cacaccgtat gagctctggc acggacgagt ggcatcctac agccacatca 1980 gaacattcgg atgtgctgcc tttgttcatc ttcccaagtt caagcgtgaa ggcaagtttg 2040 gtcctaccgc tcagcgcggt gttctccttg gtttttgtga gggaacttga aacttcaggg 2100 taattgatcc gacttcaatg aagatcactg tttcccacga cgtctctttc gacgagtcct 2160 ccttcccatt ttcaacgctt tgccacaatc cggatctcaa ctccatctct gagctattcg 2220 aagaggatca atctccttcc atcgacactt ccaaacctcc ccaacgactt gttcttcgga 2280 ttgggcctcc tccggctacg attccacttc ctgactctcc aaccactgag tccacgacct 2340 atgagacggc cgagtccaat ccctcgactg cagcccccgt cagtccacct tctccttgtc 2400 ccgttcgcaa ttgacagcct cctgaacgat acggagaatt tggtgctctt gccttcaatg 2460 ccgagatcat acctcctggc gaaccggtca cttaccgaca agccttggcc tccgcagact 2520 gtcatcaatg gaagtcagcc atggaccaag agcttcaatc tctggacgag tgcaaaacct 2580 ggtcgctaat caagctcccc cccggtaaac attccattgg ttcaaaatgg gtctttaaaa 2640 tcaaatacaa gtcagatggt tccattgacc gctttaaggc tcgatttgtt gccaaaggat 2700 tttctcaagt tcccggtctc gactttggag aaacttatgc cccttatacc tggtcaactt 2760 tccaccctcc gccttctcat tgccatctgt atagtgtttg gttgggactt cgatcagatg 2820 gacgtcgtaa ctgccttcct caacggtctt ctcgacaagg aaatctatat ggaccctcct 2880 cccgggcttc ccaactcaga aggtctcaag tgtcgacttc accgaaccct ttacggcctg 2940 aagcaatcac ctaatcgatg gtatcatcgc atctatacct ggatgttatc catgggtttt 3000 gtggtaagtg agcacgatcc gtgtctcttc attcgtgcca aacctgtctc tccgtgcttc 3060 gtctatatcc acgtggacga cctcgccatc ttcgggaaga acatcacctg gttcaaagac 3120 gccatcaaga aagaattcaa gatggtcgat cacggtccag ctcacttcct gctcgccatg 3180 cgcatcactc gtacccctga ctcgatctct ctctgccaag attggtacat tctctccctg 3240 ctcgaactct tcggcatgac agaatgcaga cctgtctcaa ccccttttgt tcccgacacc 3300 catctcgtcc cggcaaccca atccgagatc gatgagctca atgccttggg cgtatcctac 3360 cgaggtattg tgggctgcct cggctacctc gttcagtgca cacgcccgga actcgcctat 3420 ccatgcagca tccttggtca gtttctcgaa aatcccggca ttacccactg gatggccgcc 3480 aaacacgtcc tacgatatct ttccggcact cgacttgtcg gtctcacctt ccgcaataag 3540 ccttcctccc tcaacatcat cggctacagc gattccgatt gggccgcctg ccgcttctcc 3600 cgtcaaagca tgtgtgggta caccttcatc atctgtggcg gtacaatctc ttggcgtgcc 3660 aagaaacaag tctccgtcgc ctcatcctca actgaagccg agtatcgagg tcttttggat 3720 gctggtaagg aggccctttg gcttcgtggc ctctaccagt cgattatccc atccaactcg 3780 accgatccga acattctctt ttgcgacaac caagctgcta tagccttaac gaagtcttcc 3840 gcctttcggg ccaacaccaa gcacatcgag gcccattttc actggattag ggaccaagta 3900 tcttccggcc tcatctcgat tccctattgt cccactggtg aaatggccgc tgatattttt 3960 accaaggcct tagatcgtgt caagttggtc cgcttttgtg ctatgatggg gatagggccc 4020 tgcgtggcac cttcggcgaa atgacggctc atgtcctgat cgcgtttttt tttcttcttt 4080 ctctttgcat caggacggtc acgtgggacc cgagtgcgtg ttatcctgtg ctgttttctt 4140 ttcaagctac tccacttatt ggttgcatgt gatcaggatg tatgctgctt atcatttttt 4200 tttctctttc tcagctttgc taccttcttt tcttgagagt tgtgaggcac cctgtgcatc 4260 aaatgggggg gg 4272 // ID Gypsy-94_MLP-I repbase; DNA; FNG; 5715 BP. XX AC AECX01000333; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-94_MLP_; KW Gypsy-94_MLP-LTR; Gypsy-94_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5715 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000333; Positions 23088 17374. XX CC 'ACATC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 371..1462 FT /product="Gypsy-94_MLP-I_2p" FT /translation="MSDITIESLVKKMEEMNNRVMEESRLREEAEKRWLEL FT KNSIDQSNASNINTQTATQAPPSITTPAPVQQVQHIRPPKIATPNKFDGAK FT GQKAEVFVNQISLYMQMNADSFINEQAQVAFALSYMDGKASLWGQSMTDQL FT LDSERMRLVTWSKFIESFKATFFDTERVAKAEKEIRALRQVKSVADYWIKF FT SEISLIVKWPQSVLISQFKQGLKGEIRVHMVRDVFEDVEDMAKLAIKIDNE FT VNERNYDSLTHQAKVTAATPSTSTTPLNPDAMDCSAYRFNISNDEYQRRGN FT AGQCYNCGKDDHYVGGCPLRKNVGGRGTWRGSWRGRGGYRYKSKLAEIDGV FT KEEGKEESIAENSKNGGAQEC" FT CDS 1633..5619 FT /product="Gypsy-94_MLP-I_1p" FT /translation="MSRSFALKHRLSTEPLPQVRKVTGFSGHETQVTHTGD FT FHINTLNSTPTTFILTDLKDKYDVILGMPWIKRNWKLIDWKNSKLINEEST FT IAVTEVTSLIPTTTSKALVEPKRNARNINEGVEFENSLTPPQCECDNLPPH FT SFPEAAGKRSLLLELKNKPILVENEKHNEDEDMKTPSRNDTKTATIDKIVL FT SMSNQALKDQSGLEPERHARQNDKGVEFRTNSIKPPQSKCIQSFHHASKNI FT GKRFSLSNSVINNLSSRSQKPKSTMLKVREKMTPSQMMINAAKASWNLSAK FT LAAEQIQGAPEKTAAELVPECYHEYLYMFEKSKSNVLPPHRPYDFRVDIIP FT GATPQAGKVIPLSPKESEVLKEMLEKGLSNGTIRRTTSPWAAPVLFTGKKD FT GNLRPCFDYRKLNAVTVKNKYPLPLTMELIDSLLNADEFTSLDMRNGYNNL FT RVREGDEAKLAFICKEGQFEPLTMPFGPTGAPGFFQYFIQDILKNHIGRNV FT AAYQDDILIYTGPGVDHQAVVKEVLKILKEQNVWLKPEKCKFSQKEISYLG FT LIISKNQIRMDESKVKAVKDWPRPKNLTEVQTFLGFSNFYRRFISHFSKIA FT RPLHELSQKDMAFEWNEDRERAFETLKTAFTTAPVLKIADPYRPFVLECDC FT SDYALGAVLSQISEDDNELHPIAYLSRSLIQAERNYEIFDKELLAVVASFK FT EWRQYLEGNPNRLNVIVYTDHKNLQSLMTTKELTRRQARWAEILGEFDFEI FT RFRPGRQSAKPDALSRRPDLIPPEGTKLTFGQILKPDNLPKDAFLDDLDIA FT DMWFVNEDEIHIDEINEVEDEELESAEDNQRIWNDLKILSNIKEKSGEDLK FT MNEIKKLCQEMPNSKLLKDYKLNDDILYHRNKVVIPNDLELKLQILRSRHD FT SRLAGHPGRMRTLALIKRSYYWPGMKAFINQYVDGCQSCQRVKSRTEKPFG FT SLRPLPVPEGPWLDVCYDLITDLPISKGFDSILTVVDRLTKMAHFIPCMKT FT LTSEELADIMIRDVWRLHGTPRSITSDRGNVFISRATKDFHKRLGIKTQSS FT TAYHPQTDGQSEITNKAVEQFIRHYTSYKQNDWHDLLPFAEFSYNNNHHVA FT IGMSPFKANYGFDVAFTDVPLSEQCLPAVEDRMNQIKDVQRELRDAMNLTQ FT ETMKTQHEGKTKKSPSWKKGDKVWLSNKNIATTRPTVKFSHRWLGPFTITA FT HVSDNAYKLDLPKSMHRIHPVFNVNLLRKFEKSKIEGQNQRPSPPIIIDNE FT EEFEINEILDKRKKGKKVEYLINWKGYGPNYDSWEPESGLKNAKDIMNEFN FT RKYPQAEDRYKKARRKN" XX SQ Sequence 5715 BP; 2000 A; 1090 C; 1259 G; 1366 T; 0 other; tattgctacg tcttatttca ttcgagatcc aagagaaaac acttacggca agaaagaaaa 60 aatccgaaga gaagaaatca aaattttaga agtgaaagtg attagaagtt taaaacaaag 120 aaaaagttca ttaaaagacg aagaaagtta aagattgaaa ttaaaagaag tactgtgaag 180 aaagtaaaac agagaagaga ttcaaattag attaagttta tccgacagaa gattttccaa 240 accttatctt gattatccat tgtatcgacg caccccgcaa attcgctaaa aaccttaatc 300 cgccaccacg ccagacttca ctgcaccaga tagtccaacc ggacaggacg aggaattcgt 360 agacgctgaa atgtcggaca ttactataga aagcttggta aagaaaatgg aggagatgaa 420 caacagagta atggaagaga gtcggctgag ggaggaagct gaaaagagat ggttagaatt 480 gaaaaatagc atagatcaaa gtaatgcgag taatatcaat actcaaactg caacccaggc 540 cccaccttct ataacgactc cagctccagt acagcaagtg caacacattc gaccaccgaa 600 gatagccacg ccaaacaaat ttgatggcgc taagggtcaa aaggcggaag tttttgttaa 660 tcaaattagc ttgtatatgc agatgaatgc ggattccttt attaacgagc aagcacaagt 720 tgcgtttgct ttatcttata tggatggcaa ggcgagtcta tggggtcaaa gcatgacaga 780 ccaattactt gactcggaaa ggatgcgact tgtgacgtgg agcaaattca tcgaatcttt 840 taaggccaca ttctttgata ctgaaagggt tgctaaggca gaaaaggaaa tacgagcgct 900 gcgtcaggtt aaatcggtgg ctgactattg gattaagttt tcggaaatct cgttaattgt 960 aaaatggcct caatcagttt taatatctca atttaagcag ggactaaaag gtgaaattag 1020 agtgcatatg gtgcgggatg tgtttgaaga tgtggaggac atggcaaaac tagcaatcaa 1080 gatcgacaac gaagtcaatg aacggaacta tgactccctg actcaccaag cgaaagttac 1140 tgcggctact ccatccacct caacgacacc tctcaatccg gatgcgatgg attgttctgc 1200 ttatcgattc aacatatcta atgacgagta ccaaagaagg ggaaatgcag gtcaatgcta 1260 caattgtggg aaagatgatc attatgtagg aggatgtccg cttagaaaga atgtaggagg 1320 aaggggtacc tggagaggga gttggagagg aagaggggga tatagatata agtcaaagtt 1380 agctgaaatt gatggtgtta aagaagaagg aaaggaagag agtatagctg aaaattcaaa 1440 aaatggcggt gctcaggagt gttagttgtg ccacacctga gcggtaaatt gttgaatcta 1500 gaccatagta ttatcaatga acttgaaata aaagatacgc gtatttttca tgatatcact 1560 attattgatt cctcatctgc cacaaccgta attgccaaag ccctcgtgga cagtggagcc 1620 acccacgagg caatgagtcg aagcttcgct ctgaaacatc gactaagcac agaaccgtta 1680 cctcaagtgc gaaaggtgac tggttttagt ggacatgaaa cacaagtcac tcatacgggt 1740 gacttccaca tcaacacttt gaactcaaca ccgacaacat ttatcctgac cgacctgaaa 1800 gacaaatatg atgtaatctt aggaatgcca tggatcaaaa ggaattggaa gcttattgac 1860 tggaagaaca gcaagttaat caatgaagaa tcaaccattg cagttacgga ggtaacttcg 1920 ttaataccga caacaacctc aaaggccctt gtggaaccca agaggaatgc taggaatatt 1980 aacgaggggg tggagttcga gaactcatta acacccccgc aatgtgagtg cgataatcta 2040 cctcctcatt cttttcccga agcagctggc aagcgttctc tccttctaga attaaaaaac 2100 aaaccgattc ttgtggaaaa cgagaagcac aatgaagatg aagatatgaa aacaccgagt 2160 agaaatgata ccaagactgc tactattgat aaaatagtat tgtcaatgtc gaaccaagcc 2220 ttgaaggacc aaagtggatt ggagcctgaa aggcacgcta ggcaaaatga caagggggtt 2280 gagtttcgca caaactctat taaacccccg cagagtaagt gcattcaatc ttttcaccat 2340 gcatcaaaga acattggcaa gcgtttttct ctctcaaatt cagtgattaa caacctatca 2400 tcaagatcac aaaaacccaa gtcgacaatg ttaaaagttc gagagaagat gacaccgtct 2460 caaatgatga taaacgccgc caaagcatca tggaacctat cagcaaagct cgcagctgaa 2520 cagatacaag gagcacccga aaaaacagca gcagaattag taccggaatg ttatcacgag 2580 tacttataca tgtttgaaaa gtcaaagtcc aatgtattac cgcctcatcg cccttatgat 2640 tttcgagttg atatcattcc tggagccaca cctcaagctg gcaaagttat tccgctgtca 2700 ccaaaggaaa gtgaagtcct caaagaaatg ctagagaaag gcttgtcaaa tggcacaata 2760 agacgaacaa cttcgccgtg ggctgcacct gtgctcttta cggggaaaaa ggatggaaat 2820 ttaaggcctt gtttcgatta ccgtaaattg aatgcagtca cagtcaagaa taagtatcca 2880 ttaccactga caatggaact tattgatagt ttattaaatg ccgacgaatt cacaagcctg 2940 gacatgcgca atgggtacaa caaccttaga gtacgtgaag gcgacgaagc aaagcttgca 3000 tttatctgca aagaaggaca gttcgaaccc ttaaccatgc catttggacc aactggagct 3060 cccggtttct ttcaatattt tatacaagat attctcaaga atcatattgg aaggaacgtt 3120 gcagcctacc aggatgatat attgatttac acgggacctg gcgtagacca tcaagctgtt 3180 gtaaaagagg tactcaagat cttaaaagaa cagaatgtgt ggttaaaacc ggaaaaatgt 3240 aaattctctc agaaggaaat ctcatatcta ggccttatca tttcaaagaa tcagattaga 3300 atggatgaaa gcaaagtcaa ggcagtaaag gattggcccc gaccaaagaa tctcactgaa 3360 gtacagacat ttttaggttt ttccaacttt tatcgccgct tcatcagtca cttttccaaa 3420 attgcacgac cattacacga actatcacaa aaagatatgg ctttcgaatg gaatgaagac 3480 agggaaaggg cgtttgaaac attgaagact gcgtttacga cggcaccagt gctgaaaata 3540 gctgatccat accgcccgtt tgtgttagag tgtgattgct cagactatgc attaggagcg 3600 gtactatctc aaatttcgga agacgacaat gaattgcacc caatagccta tttatcacgg 3660 tctttgatac aagcagaaag aaattatgag atctttgata aagaactcct agccgttgta 3720 gcctccttca aggagtggag acaatacttg gaaggaaatc ccaacagact gaatgttatt 3780 gtttacacgg atcataaaaa tttacaatct ttaatgacaa caaaagagct aacacgacgg 3840 caagctagat gggcagaaat tctgggtgaa tttgactttg aaatacggtt tcgacctggt 3900 agacaatcag ccaagccgga tgcattgtcc agacgacccg atttaatacc accagaaggc 3960 acgaaattaa catttggcca aatattgaag ccggataacc tgccaaaaga cgcctttctt 4020 gatgatttag acattgctga tatgtggttc gtgaatgaag atgaaatcca cattgatgaa 4080 attaacgaag ttgaagatga agagcttgaa agcgctgaag ataatcaacg gatatggaat 4140 gatttgaaga ttttgtcgaa cattaaggaa aaatccggcg aagatctgaa gatgaatgaa 4200 attaagaaac tatgtcaaga aatgccaaac tcaaagctgt tgaaagacta taagttgaat 4260 gacgatattc tgtatcacag gaacaaggta gtcataccca acgacttaga attgaagtta 4320 caaattcttc gatctaggca cgacagccga ttagctggac atccgggaag gatgagaacc 4380 cttgcactaa tcaaaagatc gtattattgg ccggggatga aagcattcat aaatcaatac 4440 gtcgatggct gtcaatcttg ccaacgcgta aagtctcgaa cggagaaacc gttcggcagc 4500 ttacgtcccc tcccagtacc tgaaggacca tggcttgatg tctgttacga tttaatcaca 4560 gacttaccaa tatcaaaggg tttcgacagc atattaactg ttgttgatcg attgactaaa 4620 atggcacact ttataccatg tatgaagaca ttaacatcgg aggaactagc tgacattatg 4680 attcgagatg tttggaggtt gcatggaaca cctaggtcaa tcacatctga tcgaggaaac 4740 gtgttcatct cgagagcgac gaaagacttt cacaaacgac tgggaataaa aacccaatcg 4800 tcgactgcgt accatcctca aactgatggc cagtcggaaa taaccaacaa agcagttgag 4860 cagttcataa ggcattatac atcttacaaa cagaatgatt ggcatgatct tcttcctttt 4920 gcagagttct cgtataacaa caaccaccat gtagccattg gaatgtcacc cttcaaggca 4980 aactatggat tcgacgttgc ttttacggat gtgccattga gcgaacagtg tttaccagcg 5040 gtagaagaca ggatgaatca aatcaaagat gtgcaacgag aattgagaga tgcgatgaat 5100 ttaacccaag aaacaatgaa gacccaacat gaaggcaaaa cgaaaaaatc accgagttgg 5160 aaaaaagggg acaaagtttg gcttagcaac aagaacatag ctacgacaag gcctacagtt 5220 aagttttcgc ataggtggct aggtccattc acaattacag cgcatgtgtc agataatgct 5280 tataaacttg atttacccaa gtctatgcat aggatccacc cagtatttaa cgtcaattta 5340 ctccgaaagt ttgagaagag caagattgaa ggacaaaatc aaagaccatc accgccaata 5400 ataattgata atgaagaaga atttgaaatt aatgagatat tagacaaaag aaagaaaggg 5460 aaaaaagtag agtatctaat taattggaaa ggttatggac ccaattacga ctcttgggaa 5520 ccggaaagtg gactcaagaa tgcaaaagac attatgaatg aattcaacag aaaatatcct 5580 caagcagaag atagatataa gaaggcacgg agaaagaatt gagggtgaag ctttttccct 5640 atggggtttt ttaatgccaa cccgtggaaa gatgctaacc tgcaagaggg ggttgagtca 5700 taaagggggg agtgg 5715 // ID Gypsy-96_MLP-LTR repbase; DNA; FNG; 177 BP. XX AC AECX01000490; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-96_MLP_; KW Gypsy-96_MLP-I; Gypsy-96_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-177 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000490; Positions 129546 129722. XX SQ Sequence 177 BP; 50 A; 44 C; 27 G; 56 T; 0 other; tgttatgact ctacctgtga aaggttcaac acgcatgtca cacatagtag actatttgta 60 tatacgtcac tagctcatac tgtcctattt ccttcatgcg acaatcttat aatagagacc 120 agatgaccta ttgattcctc tcttctgttg acaagatcca cacctagggc cttaaca 177 // ID PIF_Harbinger-1_PB repbase; DNA; FNG; 3125 BP. XX AC . XX DT 17-MAY-2011 (Rel. 16.05, Created) DT 17-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW Harbinger; DNA transposon; Transposable Element; KW PIF_Harbinger-1_PB. XX OS Phycomyces blakesleeanus OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Phycomyces. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 3125 BP; 957 A; 579 C; 607 G; 982 T; 0 other; gggggtactc agccaggttg atcgctcaaa ttttaaattc agaacgcgat attattggac 60 taaaattgta atagcgacga tcaaaatttg atcgtctttt tgaaagatca aatttcgatc 120 aaatttcgat caaattttga tcgaaaaacg tacggctctc tgattggctg aatccttact 180 tttttatatg caatacgata ctgtttttat ctttctcctt aataaatcat agaatttact 240 gtcaacaaag tcatgtaaca cttctttcac tccgtataac atgccaaaat acagtgcaaa 300 gcaacaaaca gttgctgcct taaagaaatc gcgtaaaatc agaaaatttg tagcgaagga 360 aaaaaaaagt ctggtacaat ctttatcaac aagcgcagat gagctgttga aaaatatttc 420 agaagtcctt gaagaggaag ttgatgcaga gttggagaag atagctgttg tggaggatct 480 tgagcagaac cttcaagcca acagatactt acacaagggt ggcaatacat tgccaaagct 540 taaatcaaag gaagaaaaac ttgctttcct cgaagaccta gatgcagatg ggttcaaaga 600 agaaatcaga atgtccaaaa gctcgtttaa taagctgtac gagataatca aagaccattc 660 tttatataag tcaccgaaag gacataagca gactgatgta aagcttcaat tagctttggt 720 cttggagaga ctaggctcag acgggaatgc tgtatcctat agcaggcttg cacgaagatc 780 tggtgttgga ggtaagcgtc ttttagtaaa ctgacttccg ttaatctata taaacatata 840 attaatccat tgaactaatt attttaaact tattagaggg cagtgtctta aacttcacag 900 ttcgtttctt caaggtaatg ctatcaatgg aaaacatgta cgttttttgg ccgtcagaac 960 tcgagaagac agcaacgata atcccaagca atgagaaacc attcggattt cctgatcttg 1020 ttgggttttt ggatgggtgt ctgtttaact tgcagcatac accgtcgtgg aaaaagccag 1080 aattctttgg tcgtaagcgc acatactgtg taaacactct tggggtatgc gaccatcaaa 1140 agaagattcg ttatctctct tgcggatact tcggaagcat gaacgatgcc agggtgtttg 1200 aagaatcgac catgggtagc gaccccaata acttcttctc tggtgaccag tatgttttag 1260 ccgatgctag ttacattccc aagacgtacg ttgtccctgt cataaagaag ccaaaaaaca 1320 aggaattgtg tgatgcggac aaagcgttca attcatacat agcaatgatg aggataaaaa 1380 ttgagcatgc atttgggatc ctcaaagcaa gattctgctc tttgaagcga ctcccaatta 1440 agataagaag tcgaaaagac atgaacatgg tcaattcttg gatcagagtt tgtgtcattc 1500 tccataactt tttgatcgac cagacagatg acatgatgac aatgacgatg aagatttcat 1560 gggagaaaag agaaaaagaa gagatcgaaa ggatcagcag agaaggtgtg actggcactg 1620 tagatgacta tccagcaatg tcattgtcag atgagaatgc taagctcaaa cgtatccgta 1680 tcaaagaaga gatactcata aaaaacggag atgggaacct tcttaaaaaa aagtaaaaaa 1740 taaagaaata aattatctca aaagtagaaa tgatgtataa caggtattct atgatagttg 1800 ctgtcattat taaattgata gtctttaact accatacgtt ttattgtaaa cgtcgtcagt 1860 ttgctcttgc actttttcat ctgaccagtt ttgcactttg gccaattctg caactgcttt 1920 catcatgttt tcgaattttc tgtgtataaa ttcgcgttct cgcttaatgt gaagatcgtc 1980 cttaagaact tctagctttt ccctataaag ccgttctatt tttgcctcta gggacacaga 2040 agactcatta agttgtctcg tcacctcttc tgccaactct ttatattcga cgtctactct 2100 tctgcgtttt ctagaagagc ttgacttagg aggaggggag cagagtacag acatgttatc 2160 attttggctg tcatattcct cattgttgct gctatgactg ttttcagctg attcttcgtc 2220 cgtgttgtca tcatcgccat tgctttgctc gccgtcctcg tcatgtatcc attccattgg 2280 ctttgtggta tcacaaaccg ctggagaacc agtctttcta ttgcccataa ccttttccat 2340 ctgaaagaaa gcagggcaga tttgattcaa ttcagctatt gttatacaaa ttacattaaa 2400 acaaagagat ctagaattgg tattatatag atagtatcaa cttacactct aagccttctt 2460 tttcactggt agagccttcg tcgttagaat tatttttaat actattcttc caaacacgat 2520 aggcttctcc atattgtttt gtcaaaagat tgttgagcct cgactttatt tgggctgtag 2580 ttcgataaac cccttgctcc tgaaaatact ggttgcattt attcagtatt gtaacctttg 2640 acgacttgaa cgttcgtcct tccttgtcac cgccaagata catattgagg ttttctccgc 2700 cgttcatcaa catgaattgc tgaagtctct ccatagaagt cagaccatct ttaccaccat 2760 catgtttgta ccatgtctta tattcgtttt tgatagcatg gttatcttca gtggataaag 2820 atatttcgct gttagagtta gacatgtttt tttttcgtaa taaggtgtga agcaaaataa 2880 aggtgcaggt gtgaagaaag atgtcaattg tatgtttcct agattatata gcaatttttt 2940 tgcataatct tttattaaat gacgtataat cttggcatat gaatgtcatt aaaattttgg 3000 tcaaaatttg atcgaaattt gatcgaaatt tgatcttttg ccttttggtg tcctcaacta 3060 caaattgaca tcattggcta agattcacac aagaatttga gcgatcaacc tggctgagta 3120 ccccc 3125 // ID PCAL_I repbase; DNA; FNG; 5866 BP. XX AC AF007776; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 08-NOV-2002 (Rel. 7.1, Last updated, Version 1) XX DE Candida albicans Ty1/copia-type retrotransposon PCAL_I, internal DE region. XX KW Copia; LTR Retrotransposon; Transposable Element; PCAL_I; KW Ty1/COPIA superfamily; internal region; internal portion. XX OS Candida albicans OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; mitosporic Saccharomycetales; OC Candida. XX RN [1] RP 1-5866 RA Matthews D.G., Goodwin J.T., Butler I.M., Berryman A.T. RA and Poulter T.R.; RT "pCal, a highly unusual Ty1/copia retrotransposon from the RT pathogenic yeast Candida albicans."; RL J. Bacteriol 179(22), 7118-7128 (1997). XX DR Genbank; AF007776; Positions 281 6146. XX SQ Sequence 5866 BP; 2024 A; 947 C; 1159 G; 1736 T; 0 other; gattagaagt cgatagtgat aatcatttcg tcccaaatta gcgttgtata aattcagtcc 60 tcagatttgt attattgatt gatagtttcg aagtttgaag gtacagaatt tcacaagatg 120 agttccgcaa agaatgatga taacgaaggg aaggtcatgg aaagtgttga tcaagctaat 180 gctattagta aggtggatga acatatcaag gctagattca atatgctttt cataaaattt 240 aatgacttac ctaagttggc cgtcggtaat cagaaaagcg tggataaatg gaatgaagaa 300 tttaaatatt tccacgttgc ttaccccgat gttttggaat ttttgcttga ctataatcct 360 aaagataaat tcaaggttaa aaaggtagaa ggtatttatt ttactggttg gtgtttacaa 420 atgtgtttac agtccatttt tgataggttc agattgatca tgatttctaa gctaccaaag 480 cacttgcaaa aggaagcaaa cttaatcaaa gctgcttatg atgctgttac taaatctaaa 540 gattatacca ttactagtaa gatcttgctg aagtttgtaa acgttgaaca tgagttagtg 600 gtttgctata accttccata tttgctgcag gtggaagaga aacttgagga aatactctac 660 aacacttcaa acgttgtcga tgagtatgtc cgtagtcttc caaatctcat aggtcaagtc 720 ttgtacttca atcatgtgaa gaaatcagag gctttaagtt tgtttttgaa tattcatgcc 780 tcatactact caaagtggat tcaagctgac aatgatacat cagtactccc aagttgctct 840 accatagctg aagaaatgtg tgatcatcct gattatgcta gattggttga cattccaagc 900 aacaaatatg aacttaatct tattgttagt ttaccagcac cagagaaacc aaaaggaaaa 960 ccagaggaga actcactgga acaatctcaa aagaagaacc tgaaatcaag aaagagaaat 1020 aagaaacatc caaaatcaga taacgataaa ggtgaaaaag aaaaagaaaa agaaaaaact 1080 tcactggaat gaaaaacagg tgctgcttct attaattgtg taatgaatat acataattgc 1140 agcaaaacca cgtttccagt agaaaattct cattctctta atgcttcttt gaacgtaatg 1200 aattttaaag gtttaaggtt taacaagtat ctagtgtatg atactggtgc cacaatatct 1260 gttgtgaaca ataaagatat attgctgaat gttaaggacg caacaattga agtttctgtt 1320 gctgatggtg ctacattaga agcagattgt attggtgatc taattatcag agtcggtatt 1380 gtctcgatta cgttagagaa tacattgtat ttaccagaaa gttcctttaa tcttgtgagt 1440 ttgaaacaaa ttgaagaacg aggatttaat gttcttatta ctaaagaatc agtgattgta 1500 tttaaccaaa atgtggctcc tactattatt gcttcaagga agaatgctgc tgatctttat 1560 atgggtcctc aattcagtga agaatcttta gaatgtgatt ttgattatga tggtttggca 1620 gatatgttgt ccaatgctaa ccaagatgac aaagataaat caagtatgaa tgaaatgtca 1680 gaatatcaag aacatgatta tagttctcga gcattaataa attctttgac ggaggttgat 1740 gttttagatg ttgaaatttc cccatatgga gttgaacaat tgctaccaac tggagataag 1800 aacgatattt ataatttcca tttgatgtca aatcatatgt ccattgagaa aatcttgttg 1860 ttacaaaaat accagggtct cgtacttcac acttcaaaag agagtcttca aaagattgct 1920 gattgtaagg tatgtctatt atcgaatgcc aaacagagaa gtcacaatca tcattcagaa 1980 agaaaagcct cgagaagaca tgagagactt cattgtgata ctctcggtcc atttaggtcc 2040 gaaaataaca agtggtattt aacgtctgtt atagatgaac atacgggtta cattgaagga 2100 attattacta aagacagaaa ggtaaaggat ctcttaattc aacgattaaa gatctggaat 2160 aatcggttta acgataaggt ggcatacttc agaagtgata atgctcctga gttcccacaa 2220 ccttctgatt tagctgagtt cggtatttgg agggagacta tagcggcata tctgcctgag 2280 cttaatggtc tcgccgaggt tgttaataaa ttgattttac aacagattta caggatcgtt 2340 gtgacacttg gtccacaaat actcaagttg atttattatg tgattcaata ttctattaca 2400 atgatcaacc acactccacg tcgttcactc aagggacaaa ccccttatgg ttgctattat 2460 caattaagtg agggaaattt ctaccggttt ccttttgcca tcgattgtgt cgttacattt 2520 agtaatgcca tcgaaaagaa ccgttacgga gttacatcaa ctaaaggagc tccttcatcg 2580 atcatgggtg ctgtgattgg ctacgctagc gattgtttta gttattacgt gttgctaaaa 2640 aatatgcggt gtgatattat ccttagccct aatgtccgta tattgcgaag ctatgaggtt 2700 attaactcct atctcaaaaa cttatccact acacctatgt cacacattgt tcctatggct 2760 gaaggtatcc agggaaggca actgggcgct cagtacgagg tacgcggaac atatgtggaa 2820 agtgaatatg acaatacaaa tgacgtgatg cacatgccca aagagtcata ttcagttcag 2880 ccagcatcgt ttactttaac tacgggtaac agttctaacg aatatgttat aaatgatgat 2940 ccagtacaga ttaccattga gaatcccgat gatttttcta accctcttca actaactgaa 3000 gaatcacacg atatggtatc cgaagtaaaa tcggatgaga atcctaaacc cagtctccac 3060 gagctaacac ctggggataa tccggtgtct aaacctcctc aacttggtac cgagacttca 3120 gtaataggga agtctaaaga gcctattaca aaccacacaa aggacgcccc ttccatccag 3180 gggagggacc ataaacgcct ggaatctact gctcaggttg gactatcaca ccaaccccag 3240 actggtactc ccgcttcgga ggagtcaaaa ttgtcaggaa cagatcattt cggtgtcgac 3300 gttgttaaag aaacagtctc agaagattgg catacttctg actacccaga aactagtgct 3360 gaagatgaac agcaaaatcc ctcgttactg gctaataaga atcgggtaac tgaaaaaata 3420 gatgagggag aaaatatttc atttccgggg ggtgatgatg attctgtcgt gatcaactca 3480 aatgttgagc aatctaatgt tgaaacagag gatgctggta acagtccaat tcaagacgaa 3540 gtttctcaag agggaagaat acttaatgaa caaactgata tagttgatac tgttgctaaa 3600 gttattgaga atgaaaaaat ctctcctatt aattcattag atgatcatac tgaacttgct 3660 acagactcgg gaaatgatag caattcaaca gaatccgaca ttcaatcgaa aaatgaaata 3720 tcaccagtga ttaatgagaa aaatactgaa ataatccaaa aacacattga aagtatcctt 3780 gctgataaga gattggatga atttgaaacg tataatgttg atgaaattga gaatgtgatt 3840 aatgacgatg acattgctga agctaatcca ctaccagatg aaaataatga tgttcagatg 3900 aatgagagtt ttgataataa tcatagcatg tcacgagcaa agaagaaata cacatttgag 3960 aaagaagtta acgaaaaaat tgctggtact aaacattcac ttgatacaac tgatccaaga 4020 gaagcaatca gagtgttaaa tactggtgaa accaagagaa tcgaacccaa gaaaagagag 4080 gtgcctatca ctgtgaaatt aaacaaaaga tcgcaataca agtcaccata tgttacaaga 4140 agtggtagaa cggttataaa ccccaagagg tatttacatg cggtcgtcaa caaaatcgac 4200 tataatgatc cgggatggat aaagtcaatg aatgctgaac tagagaaatt tagatcaaaa 4260 gatgtttacg aagaagttcc aattcccacc ggtgtgaagc ctatatctat gggttgggta 4320 catactgaga aaattgattc tctcaaaggt gttgttcgga aatcacgttg tgttgtccat 4380 ggcaacagac aaaaggaaaa attggattat gaccctttta gtgttagttc acctgttata 4440 gatcttgtga ctataagatt attgacaata ataggttgtg aattaggaat gacaattcaa 4500 catttagacg tcgagtcggc gtatctaaat gcctctatta ctcattcaaa tccaatttat 4560 gtctttcctc ctaaatcagt acctttgaag aaaaaccatt gttggttatt gaaacgttct 4620 gtctatgggt taaaacagtc gggtttggaa tggtatcaca ctatcaaaag agtattggaa 4680 gacattggtt ttactcaagt tttacacaat gatggtttat ttcacattga atatgaagag 4740 ggatcagtaa tatatttagg tttatatgtt gatgatattc ttatggttgg aagttcacaa 4800 aaagttattg ataattttgt ggatcaattg agagatcatt ttgaagttaa agtgtttggt 4860 gaaatatcaa attatcttgg tattgaattt cgtaaaaccg aatctggtta tattttatct 4920 caagaaaaat ttctcaagaa attacttaag gatttcaaac tagatgactc atatgggaaa 4980 aacataccct ggattccgaa tgacaaatat gaaaaggttg caataattcg tgaaaacgtt 5040 aatccagaga atgattttga aaaggttccg aatgagacat tgcttgaccc tgatgctaaa 5100 aaactatacc aaagtggtgt tggcctgctt ttatgggctg ccacaaacac acgtccagat 5160 atatcggtcg tagtgaattc gttgggttct aaatctgcaa atccaaatgt ccatgattat 5220 gagaaattga tttattgtct taggtatatc aaaaatagca tgggatatca cattgagtac 5280 aaaagaaaca gattgaatat accaccaaaa tcatttgtta tcgaatgttt cagtgatgcg 5340 tcatttgcac caggattgga tagaaaatct attagtggaa ctttgattta tgtgaatgga 5400 aatttggtgc aatgggcgac caaaaaacaa acggtcatag cacaaagctc agcagcttgt 5460 gaaatgttgg ctctaaatta tacaatgttg aaagctatcg aaataaaaaa ccatttaatg 5520 gatttgggtt ttgaagtagg taagatacat tgtcatcaag acaaccaagc tgtgattaaa 5580 gttttgagaa ataactattg tcacccacat cgaccaatag atatctgcta taagtttcta 5640 cgccaattga tcaatgataa agtattttca atatcctatg tgaagacaaa tgataattac 5700 gccgattgta tgactaagtg tctaagtcgt gctaaattca aagcattcgt tgagggtatg 5760 ataaaacggt tagacctaga agataatcaa acactgatac aaaatgcaat aacggcagaa 5820 taagtggatt tatcattact attatcgtaa tgctcaatca ggggag 5866 // ID Copia-48_MLP-LTR repbase; DNA; FNG; 311 BP. XX AC AECX01001003; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-48_MLP_; KW Copia-48_MLP-I; Copia-48_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-311 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001003; Positions 333 23. XX SQ Sequence 311 BP; 94 A; 58 C; 63 G; 96 T; 0 other; tgaacctaat agtacttcac catcaccgga atagagacct agaaagcaaa ccacaagttg 60 aacgtattag tgacggattg ggaaggaaga cgtgatagaa atggttatta tgaaacatgg 120 atgtagtaaa gagaatgtac ggattgggaa agggaaagag gagatttttc acctacttgt 180 agtattcctc tgttcacaat tatgttgttc ctctttttcc ctatctgact ttttactcta 240 tcatccaagt ccatttcctc acgatcatta tccttgaccg ttcgaatagg ttagctgcta 300 gtagcatttc a 311 // ID Gypsy-3_LBS-I repbase; DNA; FNG; 5372 BP. XX AC ABFE01000084; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_LBS_; KW Gypsy-3_LBS-LTR; Gypsy-3_LBS-I. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-5372 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01000084; Positions 861256 866627. XX CC Positions [4094-4582] - Integrase core CC 'TAATT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 491..5209 FT /product="Gypsy-3_LBS-I_1p" FT /translation="MSTLATVDPTSAKAPILTQGDISPSVMMDFENAALDF FT FMSKSIPADKQVTMVIPGIKDLRIRDWIAADRARIVALPFDKFMVEMRLNY FT LPPDWEDQVRNKILTSTLTALKTSFWNWSQNLLKLNCLLRGTASAFDESTL FT RNHLEAHLDDELKTKLRHSDARKDKVFKTWVTAVRLLDEAHAVENKHQREL FT IEETLTQRQSKRQNTSNEALRGPSRRGNTSQSNMSAGGSSSTYTRLPPLTD FT AERTLLNEHDGCTKCRRFYTDHRSHSCPNGFPAGKTYKTLTATDAMNAKKG FT KAVTKPATKAVAATSASIETVDSDNDISAAAAVLPDSPGDYSDSVEDQDVS FT RREVSPPFRAKHLVWNCQVHSLTEDFPVKTRALIDNGAHLVLIRPDLVDRL FT GLKKHKLPEPELVDVAFNNQKQKTQLYHYVKLSLTSLDSSWTSRTVRAVVT FT PGLCAPIILGLPWLIHNSIVTDHAARTCIDKNTLYDLLNPPVIKPPPPPKP FT KLKEQIKITKADKKLVLAELMMVCNDRINHLKLKPEKVKDFDVVGAVRERI FT EVLAMQDSLLQKETELKNEYGAIFEPIPHVNELPHEVVAEIHVKDAEKTIK FT SRSYPSPRKYKEAWQILIQQHLDAGRIRPSSSPCASPAFIVPKANPNVLPR FT WVNDFRQLNENTITDSHPLPRIDDILTDCAKGKIWATIDMTNSFFQTRMHP FT DHVHLTAVNTPLGLYEWLVMPMGLKNAPAIHQRHVTAALRHLLGKFCHIYL FT DDIVIWSNTVEEHEKNVRAVLQALRDARLYINPDKTKLFCTEINFLGHHIS FT ARGIEADLGKVNRIMSWPTPTSATEVRSFLGLVRYISAFLPALAEHTGVLM FT ELTTKASDKNFPLWAPKFQVAFDAIKAIVTGWECLTTIDFNKMPENKIFVT FT TDASDKRSGGVLSFGPTWETARPVAFDSMTFKGAELNYPVHEKELLAVIRA FT LKKWRVDLLGSPFFVYTDHKMLENFLSQKDLSRRQARWMEFLSQFDAKIVY FT IKGDDNTVADALSRLPTNTDSSTANASARHPYDFCEDDDTLCAVASISLSS FT SQNPWEAAKSLANVSAISNNVSATLKITADKDFLESVKSGYAEDAWCKTLP FT SATLSFSNLVLRDGLWYIGERLIIPRTGNLRELLFALAHDTLGHFGFHKTY FT GSLRTAYYWPNMRRDLEQGYVASCPECQRNKSSTIKPYSPLHPLPVPDQRG FT DSVAIDFIGPLPEDENKNCIITFTDRLGSDVQLVATRTDIMAEDLAILFFD FT KWYRENGLPADIVSDRDKLFISRFWRALHKLTGVKLKLSTSYHPETDGASE FT RTNKTINQALRFHVERNQMGWVRALPRVRFDIMNTVNKSTGFTPFQLRFGR FT SPRVIPPLVPAKSSATVADIDAWHVIRRLETDVLEAQDNLLRAKISQSVQA FT NKHRTLTFPFSIGSRVRLSTLHRHNEYKAKGEKRVAKFMPRYDGPYTVIDV FT DEEHSTVTLDLPNSPNICLTFHTSEVLPYVESNTTLFPSRRFEELDPIITD FT DSNEEFYIDRILDARRRGRGYQYLVRWHGYGQEHDRWLPGSELQDCEALDS FT WLASHG" XX SQ Sequence 5372 BP; 1431 A; 1469 C; 1137 G; 1335 T; 0 other; ctttttttga atttacaccg ccgttatcgg ttctgacata tcttgacacg cgcatatagt 60 caccgccttt ggaggcctga ttcaggtcat cgttcggcgc gtgtacacgc tgtcttgtac 120 gtctgatcgt tgttcctgtc gttcacgaca catcaaccct gattttccgt ctgtaacagt 180 ggagggtact tgtacctgta tcacttaacc gccatcgccg taactgttta ccgcaacact 240 ggaactccct cgtttcacct gtacaaccgc ctctgtccaa caattactga ttcttgacag 300 tttggctcca accacacatg tgaggggcaa tcgtttacct tcacaaccac cgctagcctc 360 atcggcttca ccggttctcc caaatacgca tacatcgccg ccctcctctc ctgcatcgtc 420 tttctatctt tcatcgagct cttctgacga cgaaaaacgc gcatatactc agtcaaaaac 480 tcgtaacaag atgtctacac ttgctacagt cgatcctacc agtgccaagg cacctatcct 540 cactcagggc gacatctctc catcagttat gatggatttt gaaaatgcgg cacttgattt 600 cttcatgtcg aagtcgattc cagcggacaa acaggtcacg atggttatcc ctggtatcaa 660 ggatcttcgt atccgtgatt ggatcgctgc ggaccgcgcc cgcattgtcg ctctaccttt 720 tgacaagttt atggtggaga tgagactcaa ttaccttccc cctgactggg aagaccaagt 780 tcgtaacaag attttgacgt caacactcac cgctttgaaa acttccttct ggaattggtc 840 acaaaacctg cttaaactga actgcctact ccggggcacg gcgtctgctt ttgacgagtc 900 gaccttgcgc aaccacttgg aagcgcatct cgacgacgag ttgaagacta aactccgaca 960 cagtgatgcc cgcaaggaca aggttttcaa gacctgggtt actgctgttc gtctattaga 1020 cgaagcacat gctgtcgaga acaaacacca acgtgaattg attgaggaga cactcactca 1080 acgtcaatcc aaacgccaga acactagcaa tgaagcattg cgaggtccgt cgcgacgtgg 1140 caacacatca caatcaaaca tgtctgcagg cggctcctcc tccacatata cccgtctacc 1200 cccccttacc gacgccgagc gcacgctcct taacgagcat gacggttgca ccaagtgccg 1260 tcgtttttat acagaccacc gctcgcattc ttgccccaat ggttttccgg caggcaagac 1320 ctacaaaact ctgactgcca cggacgctat gaatgccaag aagggcaaag ctgttaccaa 1380 accggctacc aaagctgtcg ccgccactag tgcatccatt gaaaccgttg actccgacaa 1440 cgacatctct gcggctgcgg cagttttacc tgactccccc ggagattact ccgattcggt 1500 tgaggatcag gatgtgtcgc gtcgcgaggt aagtccacca ttccgtgcca aacatctcgt 1560 ttggaattgt caggtgcaca gtttgactga agacttccca gtgaaaacgc gcgcgttgat 1620 tgataatggc gcgcacttag tcctcattcg ccccgacctc gtagaccgcc ttggattgaa 1680 gaaacataaa ctgcctgaac cagaactcgt agacgtcgca ttcaacaatc agaagcaaaa 1740 aacgcagttg tatcattatg tcaaactttc tctcacttcg ttagactcgt cttggacttc 1800 gcgcaccgtt agagccgttg tcacaccggg tctctgtgcc cccattattc tcggactgcc 1860 ctggctcata cataactcaa tcgttacaga ccacgctgca cgcacttgta tagacaaaaa 1920 tactttgtat gatttattaa acccccccgt gatcaagccg cctcctccgc caaaacctaa 1980 gttgaaagaa caaattaaaa tcacaaaagc agataaaaag ttggttctcg ccgaactaat 2040 gatggtctgc aatgacagga ttaaccatct caaacttaaa ccggaaaagg ttaaggattt 2100 tgacgttgtt ggcgctgtcc gagaacgcat tgaagttcta gcgatgcaag attcactgct 2160 acaaaaagaa acagaattaa aaaatgaata cggcgcaatc ttcgaaccaa tcccacatgt 2220 caacgaatta ccgcatgaag tggtcgccga gattcatgta aaagacgcag agaaaacaat 2280 aaaatcacgt tcctacccat ccccgcgcaa gtataaagag gcatggcaga tattgattca 2340 acaacatctg gatgctggtc gtatccgtcc atcatcttca ccttgcgcct cgccagcatt 2400 tatcgttccg aaggcgaacc caaatgtcct accacggtgg gtcaatgact ttcggcaact 2460 taacgaaaac accatcacag actcccaccc gcttccaaga atagatgaca ttctgacaga 2520 ctgtgctaag gggaagatct gggcaacgat tgatatgacc aatagtttct ttcaaacgcg 2580 aatgcaccca gatcacgttc atttgacagc cgtgaacacc ccattaggtc tatatgaatg 2640 gcttgtgatg ccaatgggac tgaagaatgc ccctgctatt caccaacgac atgttaccgc 2700 tgcactacgc catctcctgg gcaaattctg tcatatttat ttggacgaca tcgtgatctg 2760 gtcaaacacc gtggaagaac acgagaagaa tgttcgcgcg gttctacaag cgcttcgtga 2820 cgctcgactg tacatcaacc cagataaaac taaactattc tgcactgaaa tcaatttcct 2880 aggtcatcac attagtgccc gtggaatcga ggctgacttg ggaaaagtca accgtatcat 2940 gtcatggcca acaccaacat cagctacaga agtccggagt ttcttgggac tggtccggta 3000 tatctccgct ttcctgcctg cacttgctga acatacaggt gtactcatgg aacttacgac 3060 gaaagcctcc gacaaaaatt tcccgctgtg ggcaccaaaa tttcaggtcg cgtttgacgc 3120 gataaaggcg attgtaacag ggtgggaatg cttaacaact atcgatttca acaaaatgcc 3180 agaaaacaag atctttgtga caacggatgc gagcgacaaa cgatctggcg gcgtactctc 3240 ctttggtcct acatgggaaa cagcccgccc agtcgctttc gattcaatga cgttcaaggg 3300 cgctgaactg aattatcccg ttcatgaaaa agaattactg gccgttatcc gtgcattgaa 3360 aaaatggcga gtggacctac ttggctcgcc attcttcgtt tacactgatc ataaaatgct 3420 ggagaatttc ttatctcaga aggatctctc gaggcgccaa gctcgatgga tggaatttct 3480 ctcccaattt gatgccaaga tcgtgtacat caaaggggac gacaacactg ttgcggatgc 3540 actctcccgc ttaccaacca acaccgattc ttcaacggct aatgcgtcag ctaggcaccc 3600 atacgatttt tgcgaggacg acgacaccct gtgcgctgta gcttcaatat ccctatcctc 3660 atcacaaaac ccttgggagg cagccaaatc gctagctaac gtgtcagcga tatcaaacaa 3720 tgtcagtgca acattgaaaa ttacggcgga caaagatttc cttgaatctg tgaaatccgg 3780 gtacgccgaa gacgcttggt gtaaaacgtt accctctgcc actctcagtt tctccaacct 3840 cgtcttacgt gatggccttt ggtacatagg tgaaaggcta atcataccgc gcacaggtaa 3900 ccttcgcgaa ttgttgtttg cactggccca cgacaccctc ggccacttcg gtttccacaa 3960 gacttatggc tcgctgagaa cggcatacta ctggccaaat atgcgaaggg acttggaaca 4020 gggctacgtt gcctcctgcc ccgaatgtca acgtaacaag tcgtcaacaa taaaacctta 4080 cagcccactg cacccactcc cagtacctga tcaacgaggc gattccgttg caatcgattt 4140 cattggccca ctaccagaag atgaaaacaa gaactgcata atcacgttca ctgaccgctt 4200 aggtagtgat gtccagttag tcgcaacacg gacagacatc atggctgaag atctagcaat 4260 cctatttttc gataagtggt atcgtgaaaa tggtctacca gctgacattg tttccgatag 4320 agacaaactc ttcatttcaa gattttggag agccctgcat aagctgactg gtgtcaaatt 4380 gaaactgtcg acgtcgtatc acccggaaac ggatggcgca agcgaacgga ctaacaagac 4440 catcaatcaa gccctgcgat ttcacgtcga gcgtaaccaa atgggctggg tccgcgcctt 4500 accgcgagtc cgtttcgaca tcatgaacac cgtaaacaaa tcaactggct ttacgccatt 4560 ccaactccgt tttggtcgaa gccctagggt tatcccacca ctcgtgcctg ccaaatcttc 4620 tgcaacagtc gctgacatcg atgcgtggca tgtcatcaga cgtttagaaa cagacgttct 4680 cgaggcacaa gataacctcc ttagggctaa gatttcccaa tctgttcaag caaataagca 4740 caggacgcta acctttccat tcagtatagg ttcacgtgtg cggctatcta cgctacatcg 4800 acataacgaa tacaaggcta aaggcgaaaa gcgtgtggcc aagttcatgc cacgatacga 4860 cggcccttac actgttatcg acgttgatga agaacattcc accgtgacct tggatctacc 4920 taactcccca aatatctgtc tgactttcca cacttccgag gtcctccctt acgtcgaatc 4980 caatactacc ctattcccat cccgccgttt tgaagaactt gatcccatca tcacagatga 5040 cagcaacgaa gaattctaca tcgatcgtat actggacgca cgtcgacgcg gacgcggtta 5100 tcaatacctg gtccgttggc acggttatgg ccaagaacac gacaggtggt tacctggttc 5160 cgaacttcaa gactgcgaag cgttggactc ctggctggcc tcacacggat gatctgcttt 5220 ttatgtagat tttctttcgc tcttgccagc cggtagcttt ttcccacagg gttttgacgc 5280 acccggcgtt cggattttac ttactcttct tactaacctt tcctgatttt tgttttgctt 5340 tgacgtatat tttttttcag aacatgggag gg 5372 // ID Copia-1_SPDB-I repbase; DNA; FNG; 3193 BP. XX AC ACOE01000107; XX DT 12-FEB-2011 (Rel. 16.02, Created) DT 12-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Spizellomyces punctatus genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_SPDB_; KW Copia-1_SPDB-LTR; Copia-1_SPDB-I. XX OS Spizellomyces punctatus OC Eukaryota; Fungi; Chytridiomycota; Chytridiomycetes; OC Spizellomycetales; Spizellomycetaceae; Spizellomyces. XX RN [1] RP 1-3193 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Spizellomyces punctatus genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; ACOE01000107; Positions 22998 26190. XX CC Positions [1537-2076] - Integrase core CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 183..1220 FT /product="Copia-1_SPDB-I_2p" FT /translation="MSNTDLDTKLMMEKFNGMNYHLWKFKMTMLLKEKGLW FT GVIEGDQSMEGTSEKDWRRKDEWAMAVIALSLSDAQLMHVQTAVTAKEAWL FT KLSDIHQRKGIASRLYLRRKLLTLKMEDGTGGMLGRINMVRTMAQQLEAIG FT APVSEEDQVITLLYSLPESYEQLVVSLEARSDSLTLEFLTSRLLHEETRRK FT EAGSEGDSKGKAFFSRDGGQKVGKHGGTKKIAGKCFYCNKKGHFKKDCRKF FT LADTKQQANQASTSQATQNEFLFFAQSEDDTKTSQGAVWWIVDSGASQHMS FT HRKDWMTNYMEMSPIQVHLADNRTVQAIGKGQITLETGRNRFCTPQRHVVC FT SQA" FT CDS 1201..3192 FT /product="Copia-1_SPDB-I_1p" FT /translation="MWFVPKLEKNLFSVRRATENGAKVEFGSKGCCIKTSD FT GQEAIQGTLGGGLYMFHAKEHANIASGMSLDLWHCRLGHLNVNSIKILARN FT QLGSVLEVKNHTETSPCEACAIGKITRCPLSKGEAARAALLLGVVHSDVCG FT PMKTPSHGGAKYFVTFIDDKSRMTAVYLLKEKSEVLSKFKEFEAWATNFTG FT QKIKTLRSDNGGEYVGEEFKKFCKQKGIQQQFSTPYTPEQNGVAERMNRTL FT VEMGRCMLHHSGLSYKFWGKAIMTAVYLRNRSPTAALSNQSTPFQEFTGSK FT PDLQHLRVFGCMAYTHIPKEKRSKLDEKAKKAVMVGSSTQSKAYRLWVPEM FT NQVVMSRDVIFDENLMWSTQQQLQSVPQQDQELQMPLGGEPSFLDDQEGLG FT TVQETQRPSSITPDNSSASEDDGDDFQDAEEVPELSRKSARFHKPPGEWWI FT AKPTQSNLAFAFTIGEVPQEEPSTLAEARTRHVAKQWEVAVKAEYDSLMEN FT QTWDLVKLPPGRSTVGCKWVFKLKLKSDGSVECYKARLVAKGYSQVQGLDY FT KETFAPVVKFATIRMVLTLAAMKDMEIHQMDVKTAFLNGTIEEDIYMDQPK FT GFTTAKGLVCKLKKSLYGLKQAPRAWNKVIDGHSKNLALKGVNQTTGCTST FT RRWASTLSCMWMILF" XX SQ Sequence 3193 BP; 972 A; 709 C; 850 G; 662 T; 0 other; ataggttatg agccctgata gctatcactg atattttcct gcttcaaatc tcttttctga 60 tggccacacc atcttcctga cggacctatc ctcttcccaa cctcatattt ggaccaccta 120 ccatacgagg atgatttcac ctcttcccca acaacaaatc cactcaccaa atcagctcag 180 acatgtcaaa cacggacttg gacaccaaac tcatgatgga aaagttcaat ggcatgaact 240 accacctctg gaagttcaaa atgacgatgt tgctgaagga aaaggggctg tggggtgtga 300 ttgaggggga tcagagcatg gagggaacct ctgaaaagga ttggaggaga aaggatgagt 360 gggcaatggc agtaattgct ctgtcactga gtgatgccca actcatgcat gtacagaccg 420 ctgtaactgc aaaagaggcc tggttaaagc tcagtgacat ccaccaaagg aagggaattg 480 ccagtaggct ctatctaagg aggaaattac tcactctgaa gatggaagac ggcactgggg 540 gcatgttggg acgcataaac atggtcagga caatggctca gcagttggag gccattgggg 600 cgccagtcag tgaagaagac caagtcatca ccctactcta cagcctacca gagagttatg 660 aacagctggt ggtgtctttg gaggccagga gtgattctct gacgctggaa ttcctcacat 720 cacggctttt gcatgaagaa acacgcagga aggaagctgg gagtgaagga gatagcaaag 780 ggaaggcctt tttcagcagg gatggtgggc agaaggttgg caagcatggt gggaccaaga 840 agatagcagg gaagtgcttc tattgcaaca agaagggtca cttcaagaag gactgcagga 900 agtttctggc ggacaccaaa caacaagcca atcaagccag caccagtcag gcaactcaga 960 atgagttcct attctttgct cagtcggaag atgacaccaa aacctctcag ggagcagtct 1020 ggtggattgt ggactcgggt gcctcacagc acatgtcaca caggaaggat tggatgacca 1080 attacatgga gatgtcacca attcaagtcc atctggcaga caacagaaca gttcaggcaa 1140 tagggaaggg ccagatcaca cttgagacag gcagaaaccg attctgtacc cctcagagac 1200 atgtggtttg ttcccaagct tgaaaagaat ttgttttcag taagaagagc cactgaaaac 1260 ggtgccaaag ttgagtttgg cagcaaggga tgctgtatca agacttcaga tggccaggaa 1320 gcaatccagg gaactctagg cggcggtcta tacatgtttc atgcaaaaga acatgcaaac 1380 atagcctctg gaatgagttt ggacctctgg cattgcaggt taggacacct gaacgtcaac 1440 agcatcaaaa tcttggctag gaaccagctg ggaagcgtcc tggaggtcaa gaatcacaca 1500 gaaacatcac catgtgaagc ctgtgccatt ggaaagatca ccaggtgtcc cctttcaaag 1560 ggagaagcag ctagagcggc tctactgctg ggggtggttc acagtgatgt gtgtgggccc 1620 atgaagacac caagtcatgg tggggcaaag tactttgtca cattcattga tgacaagtca 1680 aggatgacgg cagtttatct tctgaaggag aagagtgagg ttctcagcaa attcaaggag 1740 tttgaggcct gggcaaccaa cttcacgggg cagaagatca aaactctcag atcagacaat 1800 gggggagagt atgtgggcga agaattcaag aaattctgca agcagaaggg aattcagcag 1860 caattcagca ccccctacac accagagcag aatggtgttg ctgagaggat gaacaggacg 1920 ctggtggaaa tgggaaggtg catgctgcac cacagtggac tcagctacaa gttctggggc 1980 aaggcaatca tgacagcggt ctatctcagg aacaggagcc ccactgcagc actcagcaac 2040 caatcaacac catttcaaga gtttacaggc agtaaaccag atctccagca cctcagagtg 2100 tttggctgca tggcctacac tcatattcca aaagaaaagc ggtcaaaact tgatgagaag 2160 gccaagaagg cagtcatggt cggctccagc actcagagta aggcctacag attgtgggta 2220 cctgagatga accaagttgt catgtcaagg gatgtcatct ttgatgaaaa cttgatgtgg 2280 agcactcagc agcaactgca gagtgtacca cagcaagacc aagagctaca gatgcctctg 2340 ggaggggagc cttcattctt ggatgaccaa gaagggctag ggacagttca agagactcag 2400 agaccatcta gcatcactcc agacaactct tcagcatcag aagatgatgg ggacgacttc 2460 caggatgctg aggaagtgcc agaactcagc aggaagagtg ccagatttca caaaccaccg 2520 ggagagtggt ggattgcaaa gccaactcag tcaaatctgg cttttgcctt caccattggg 2580 gaagttccac aggaagaacc cagcactctg gctgaagcta ggaccagaca tgttgccaaa 2640 cagtgggagg tggctgtcaa agcagaatat gactctctga tggagaatca gacatgggac 2700 ttagtcaaac taccaccagg aagatccaca gttggctgca agtgggtttt caagctaaaa 2760 ctcaagtcag atggctctgt agagtgctac aaggccagac tggtagccaa aggatactcc 2820 caagttcaag gactggacta caaggaaaca tttgccccag tagtcaagtt tgcaaccatc 2880 aggatggtcc tgacactagc agccatgaag gacatggaga ttcaccaaat ggatgtcaaa 2940 acagcattct tgaatgggac cattgaggag gacatctaca tggaccaacc aaaaggcttc 3000 accacggcaa aaggacttgt ctgcaagcta aagaagagtt tgtatgggtt aaagcaagct 3060 ccaagagcct ggaacaaagt cattgatggg cactcaaaga acttggcttt gaaaggtgtg 3120 aatcagacca cgggctgtac atcaacaaga agatgggcgt ccaccttatc ctgtatgtgg 3180 atgatcttat tct 3193 // ID Gypsy-18_RO-LTR repbase; DNA; FNG; 535 BP. XX AC AACW02000280; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-18_RO_; KW Gypsy-18_RO-I; Gypsy-18_RO-LTR. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-535 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000280; Positions 22058 21524. XX SQ Sequence 535 BP; 174 A; 97 C; 52 G; 212 T; 0 other; tgtaggaaca acaaggtcaa gtcaagccac ggtcatggtt acatgttgta acttaaagtt 60 catttgaact ttatttttct tttgaacttt attttttatt tcaactttat ttcttctttt 120 ggatttgatt cctgatcgag gaatcctctc ctaaaatttt atatcttttt tgaatataaa 180 aaggacaacc tttttatgac ttttatcttc acttctcttg aattatatat cttaaatata 240 ttttatcaat aaatcaagta tttttatctg aatcttttgg tataacactc ttaaattaca 300 cgcaagacaa gtcccttgaa gccaattatc ttcattgtaa acactttcaa aactcttatc 360 caattactct attacaaaca agactgatct acttaacaag atctggtggt cacttacaac 420 aagcaaatcc aatatcaata cttaactcaa gtcttatttt tattcttttt catctacttt 480 attctacttt ataaaacgtt aactcttaca agccattaaa ggaagcaaaa ttaca 535 // ID Gypsy-8_LBS-I repbase; DNA; FNG; 8434 BP. XX AC ABFE01000288; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-8_LBS_; KW Gypsy-8_LBS-LTR; Gypsy-8_LBS-I. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-8434 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01000288; Positions 193477 201910. XX CC Positions [6230-6709] - Integrase core CC 'GTAGG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 207..2639 FT /product="Gypsy-8_LBS-I_2p" FT /translation="MDNTKKTANPGRKTNNSQSAGMTRQTSNTPTSGTTTT FT PRGREAKLETSTQNMFMPGTPELSEGLDIPEAPSFQTAQERDQPSIISEGI FT EPEHPGVEDLTDLGAPFAPEELPDRGSYKEVVVEFPACMPVSSFGKSVTET FT GLDRSEKIFALAGRLRQALQLGEAALRRPIYSSYGKIVSRTERREIFKEEM FT DVLRELLTNLYHFDNGANQVGFALQKIILVNLNQQLKSRRNEAEEDLITSG FT EGIPSLPKWGFTGKADEFWSANDFEILGACYRREVENFLAYIAEHHDFAKI FT RKNRRPEQRVVTSSRNNKRSVIKPPVVTAIRDYLPTFIENDSDYLPMFTEG FT EPDSISTHLKKSRTRFSHPMANVNSSAFGHPVQNNSSHALKELFGVKRHED FT PLGSNVPRGTANFEPRLICESQGEDNRSENSHSHQSRDSFQQQPRQGGGDP FT GDSDGDDSDDGEGNNGPRRGPRKPFSPSNPRKNPFSAIPGEGNSVVSKPPQ FT EPQFDTKLKVDAIPTWDGNSENLRRWILKINSLAKRSDIVFKQLGTLAPTR FT LTGSAEIWYYSQSVETRDRIEQDWNTLRAAIGEYYMNRAFLDKQKARANRA FT SYQDAGNGRETPSEYVIRKLELLQFVYNYTDRELINEIMEGAPSFWRSIIT FT PHLFQDLEQLRLSVKFHEDSLLNMGGGENSYQNANRQGSQGQAKENTQRSP FT YNPFRNVRVNLVGWTKATSNPQFPKDDSNVSPHGTPDEKGARPCRHCGSGK FT HWDRDCKYARKGEKRARANMVTTAAEDEQAQEDYDNAYYERFSDEEDLNDD FT ADFQKPSQL" FT CDS 3050..7405 FT /product="Gypsy-8_LBS-I_1p" FT /translation="MSVQPKIRTGQRINLIQVTGNALITGYVMLDLYFETE FT EGLVLIKVEAYVVKGMSAPLILGNDFADQYAISLLREEGQSTLLFGKSGRT FT KQVHNSITAPAFVDEDGHTFKVRARPDITSKVFKAKSHRKSQKLKRRSTRR FT TADNYVRTTGPIQIAPETTRLVKVQANFSKNSGLLFVEKSLATSGGPEDIY FT GCGDTLVCKNSPFMYVSNFSKKPITIPTGQAISQGQDPQSWLDKESQFSEK FT DRHTVNAHANLLRSVINLEGTLLKKNPFAQTSRSEVKSLHDASRRDYSAED FT PLAEPALEGGPKTAEMPEGPTNFKQVLEEVDISPNLTQLQTKLLQDVLKRH FT ERAFGLEGRLGYYAEEVDIPLLPNTKPISVPPFQASPANREVIDKQMDSWL FT QLGVIEPSKSPWGAPVFIAYRNNKPRMVIDLRRLNEQVIADEFPLPRQDKI FT LQSLEGSQYLTTLDALASFTQLSIKPEDRDKLAFRSHRGLFQFRRMPFGYR FT NGPAMFQRVMQGILAPFLWIFALVYIDDIVIFSKSFEDHLIHLESVLKAIS FT NAKITLSPGKCHFAYQSLMLLGQKVSRLGLSTHKEKVDAIVSLDNPRNIHD FT LQVFLGMMVYFSAYIPFYAWIVHPLFQLLKQENKWKWGEEEQNAYDLCKQV FT LTQAPVRAHAMPGLPYRIYSDACDFALAAILQQVQPIKVRDLKGTKTYELL FT ERAYKAKEPIPDLATHLVKEDSDIPPQGGWDKDFENTMVYVERVVAYWLRV FT LQSAERNYSPTEREALALKEGLIKFQPYLEGEKIFAITDHAALTWSKTFQN FT VNRRLLTWGLVFSAFPNMRIIHRAGRVHSNVDPVSRLRRRIPPQESPLDDN FT VNPLELKPVEDPLRNMYEELGPRFEEKLLTAATQFAEADLQVEDDCILSWN FT LPLVLPNGEEVEIPYSTSQSYSTTVQIDAHEMRRWKDAYIKDPHFKSVIES FT KDNETDLPLSQYHHSAEGLIYFEDSTGNTRLCVPKDLRIEVMQEVHNTVTE FT AAHGGYFKTYNRISGTYYWPRMSREIKVFVNTCDVCQKTKPRRHGPTGLLQ FT PIPIPSQPFEVVSMDFIPELPISNGFDNILVIVDKLTKYAIIIPTTTKVTE FT VETARLFFKHVIAKFGIPRQVISDRDTRWRGDFWKEICRLMGMRRSLTTSY FT HPQADGQTEVLNQGLEISIRAYIGPDRDDWSEILDALTLSYNSSPHTATGF FT SPAYLLRGYHPITNTTILGRPSSIDRTEVSSSRNADQDTIHDKALDLVEGF FT AAERAKARDALLLGQLFQKKAYNKGRLNWEFEKGDKVVINRKNLGLLRDEK FT GRGDKLLAKYEGPFEIIQKVSAVAYRLRMPASYGIHPVLNIQHLEKYQESL FT REFGERPCLQANRLNFDALPEYEVDKIVAERMRKGRNGRRIPIYRLRYTNY FT GPEGDTWETKRNLKNAPEILREWEKFKALQKKKSARTD" XX SQ Sequence 8434 BP; 2574 A; 2030 C; 1937 G; 1893 T; 0 other; ttggtggaca cactgggaaa tatccctacg atctcgtgag tatctccgca atcgatctcg 60 atcaaaatct tgggaaaaga acaaggactt ctgttacaga tggaacaaat cacacccaca 120 caacacatct tataccctac gccttacttc atcacgcgga atcatactac ccttatcaaa 180 tttcaaatca gtttacaaac gactctatgg acaacacaaa gaagacagcc aatccgggta 240 gaaagaccaa caattcgcag tccgctggga tgacacggca aacaagcaac acacccacat 300 ccggaacaac aaccacacct cgcggaaggg aagccaagct agaaacaagt acgcagaaca 360 tgttcatgcc aggtacacct gaactaagcg aaggactaga cattccggaa gcaccatcat 420 ttcagacagc acaggaacgt gatcaaccat cgatcatctc ggaagggatt gaacctgaac 480 accctggagt cgaggatctt acagatctag gagcaccctt tgctcccgag gaattgcctg 540 atcgagggtc gtacaaggaa gtagtggtgg aatttccagc atgcatgcct gtttcttcat 600 tcgggaagtc tgtcactgag accggattgg acagatccga aaaaatcttt gcattagcag 660 gtcgacttcg acaagcgctt caattggggg aagctgcctt acgccgaccc atctacagct 720 cttacggaaa gatagtatcg cgcacggagc gaagggaaat attcaaggag gaaatggatg 780 ttttgaggga gttactgacc aatttatatc actttgacaa tggcgctaac caagtcggtt 840 tcgctttaca gaaaattata ctcgtaaatt tgaatcaaca actcaaatcg cgcagaaacg 900 aagcagagga agatttaatt acgagcggcg agggtatccc ttccttacca aaatggggtt 960 tcacgggtaa ggcggacgaa ttttggagcg ctaacgactt cgaaatctta ggggcatgct 1020 atcgtcgaga agtcgaaaac ttcttagcct atatagcgga acatcatgat tttgcgaaaa 1080 tcaggaagaa tcgtagaccg gagcaacggg tagtcacttc tagtaggaat aacaagaggt 1140 ccgtaatcaa acctccagtt gtaacggcca tacgagacta cctcccaaca ttcatagaga 1200 acgattccga ttaccttccc atgtttactg aaggtgagcc agacagtatt tcgacgcatc 1260 tcaagaagtc gagaactcgc tttagtcatc caatggccaa cgtgaactcc tcggcgtttg 1320 gtcacccagt ccaaaacaac tcttcacacg cgctcaagga gttatttggt gtcaaaagac 1380 acgaagaccc attaggatcg aacgttccga gaggaaccgc caatttcgaa ccaaggctca 1440 tatgcgaatc tcagggggaa gataacaggt cagaaaattc acattctcac cagtcaagag 1500 acagttttca gcaacaaccg cgccaaggag gcggggaccc aggagattca gacggcgacg 1560 acagcgatga tggggaaggg aataatggtc cgcgcagagg acccaggaaa cctttctcac 1620 caagtaaccc acgaaaaaac ccgtttagtg ccatcccagg ggaaggaaac tcagtcgtaa 1680 gtaagcctcc ccaagaacct cagttcgaca ctaaacttaa ggtagacgct attcctactt 1740 gggacggaaa ttctgagaat ttacgacgct ggattttgaa aattaacagc ctggccaaac 1800 gatccgatat agtattcaaa caattaggta cgttggctcc tacgcggcta acaggctcgg 1860 ccgaaatctg gtattatagt cagagcgtag aaacacgtga taggatcgag caggattgga 1920 atacactacg agccgcaatc ggggaatact atatgaatcg ggctttcctg gacaagcaga 1980 aggcgcgcgc caatcgcgca tcctatcagg atgctgggaa cggacgagaa acacccagtg 2040 agtacgtcat acgcaagctg gagctattgc agtttgttta taactacacg gatagggaat 2100 tgatcaacga aatcatggag ggcgcgcctt ccttctggag atcaattatc acgccacacc 2160 tgttccaaga tctcgaacag ttacgattat cagtcaagtt tcatgaagat tcactcctca 2220 atatgggagg aggtgaaaat tcgtatcaga acgcgaaccg acagggaagt cagggccaag 2280 ccaaggaaaa cactcaacga agtccttata atccttttag gaatgtacgc gtcaacttgg 2340 taggatggac caaagcgact tcgaaccctc aattcccaaa agacgactct aacgtttctc 2400 cgcacggaac gccggacgaa aaaggtgccc gtccttgcag acattgcggt agcggaaagc 2460 attgggacag ggattgcaaa tacgccagaa agggagaaaa acgcgctaga gcgaacatgg 2520 ttacgactgc agcggaagat gagcaagcac aggaggatta cgacaatgcc tattacgaac 2580 gatttagtga cgaagaagat ttaaacgacg acgcggattt tcagaagccc tctcagctgt 2640 aaaggtttcg aagctgggag gggtaagtgt agcaacttct tctcgagtgc actccaacac 2700 tgaaacctgg cagcagaacc ttttgaatcc gtattccaat gctccctctc taaactcttc 2760 atttgtaaga cactcggcgc cgccaactat taatcgtaag acacgtcgaa aactttccaa 2820 agaaattaat agtaattcgt tgcatacgca gacacaagaa agccgtagta cagacaaacc 2880 aattatcgaa ttaaaaagac atctatctcg accacttgga tgcgcctttt tgggagcacg 2940 cgctacggaa actccggccc gcctcaacga ccctaaggga aaatcaatac cggtcattgt 3000 tgactcaggg tccgacatca ccctgatatc tcagagggcc ctggagcaaa tgtcagtaca 3060 accgaaaatt cgaaccggcc aaagaataaa tcttattcag gttacaggaa atgcgctgat 3120 tactgggtac gtcatgttag acttgtactt cgaaactgag gaaggattag tccttatcaa 3180 ggtcgaagcg tatgtggtta aaggaatgtc agcaccccta attctaggaa atgacttcgc 3240 tgatcagtac gcgatttctc tattaaggga ggaaggacag agtactctac tattcggaaa 3300 gtccggacga acgaagcaag tacacaactc tattaccgct ccagctttcg tcgacgagga 3360 cggtcacaca tttaaggtgc gcgctcgacc tgatataaca agtaaggtat tcaaggccaa 3420 atcacatagg aaatctcaga aactgaaaag gcggtcaact cggcgaactg cagacaatta 3480 cgttcgcaca acgggcccca tacagatcgc acccgaaact actcgattgg tcaaggtcca 3540 agccaacttt agcaagaatt ctggactttt gtttgtggaa aaaagcctag caaccagcgg 3600 tggaccggaa gacatatacg ggtgcggcga tacattggta tgtaaaaact cgccattcat 3660 gtacgtctct aatttctcca agaaaccaat caccattcct acgggacagg ctataagtca 3720 aggacaggac cctcagtcgt ggttagacaa ggaaagtcag ttttcggaaa aggatcgcca 3780 caccgtaaac gcgcacgcta atctcctacg tagcgtcatc aacttggaag gaactctcct 3840 caagaaaaat ccattcgcac aaacgtccag gagtgaagtt aaatcattac atgatgcgtc 3900 acgaagggac tactctgccg aagacccttt agccgaacca gcactggaag gagggcctaa 3960 aaccgcagaa atgccagaag gacctacgaa tttcaagcaa gtattagagg aagtcgacat 4020 ctctccaaac ctcacccaac tccaaacgaa attattacag gacgttctca aacgccacga 4080 aagggcattt ggacttgaag gaagattggg gtactatgcc gaggaagtag acattcccct 4140 tctcccaaat actaaaccga tttccgtacc accctttcaa gcgtctcccg ctaacagaga 4200 agtcatcgac aaacagatgg attcatggct tcagctgggg gtcattgagc cgtctaaaag 4260 cccctgggga gcaccggtgt ttatcgccta tcgcaacaat aaaccacgaa tggtaattga 4320 cctgagaagg ttgaacgagc aagttatagc tgacgaattt ccactccctc gacaggacaa 4380 aatattacag tcgttggaag gcagccaata tcttacaaca cttgacgctt tagccagctt 4440 cactcagtta agtatcaagc cagaagatcg agacaaacta gccttccgaa gccatagagg 4500 tctgtttcag tttaggagaa tgccattcgg atacaggaac ggacccgcca tgtttcagag 4560 agtcatgcag ggcattttgg ccccttttct ctggatattc gctctagtat acattgatga 4620 tatagtaatt ttttcgaaat cctttgaaga tcatctaatt catttggaat cggtcttgaa 4680 ggccatatca aacgcgaaaa ttacattgtc gcctgggaaa tgtcatttcg cctatcaatc 4740 tctaatgcta ctgggccaga aggtatcacg gttaggttta tctacacata aggaaaaggt 4800 tgatgcaatc gttagcctgg ataatcccag aaatatccat gacttacaag tatttctggg 4860 aatgatggtc tatttctccg cctacattcc cttttatgca tggatcgttc acccgctatt 4920 ccagctattg aaacaggaga ataagtggaa atggggggag gaagaacaaa atgcatacga 4980 cctttgcaaa caggtattaa cacaggctcc agtccgagca catgcgatgc ccggactacc 5040 gtaccgtata tattctgatg cttgcgactt cgctctagct gctattcttc aacaagtcca 5100 gccaattaaa gtcagagatc tgaagggaac taagacttac gagctattgg aacgcgcata 5160 caaagctaag gaaccaatac cagacctcgc aactcatctg gtgaaagaag actccgacat 5220 tccgccgcag ggaggatggg acaaggattt cgagaatact atggtgtacg ttgaaagagt 5280 tgtcgcttac tggttgaggg ttttgcaatc cgctgaacgg aactactcac cgacagagcg 5340 ggaggcttta gcattgaaag agggactaat caagtttcag ccgtatttag agggagaaaa 5400 aatattcgct atcactgacc atgcagcctt gacttggagt aagacgtttc agaacgtcaa 5460 tcgacgactt ctaacttggg gattagtatt ttcagcattt ccgaatatga gaataattca 5520 ccgagccgga agagtccatt ccaacgtgga ccctgtctct cgacttcgac gacgtattcc 5580 acctcaagaa agtcccctag atgacaatgt taacccatta gaattgaaac cagtggaaga 5640 cccattacgc aacatgtacg aggagctggg gccaaggttt gaagaaaaat tgttgacagc 5700 tgctacgcaa ttcgcagaag ccgatcttca agttgaagac gactgtatat tatcctggaa 5760 cttaccatta gtactgccaa atggtgaaga agtcgaaata ccgtactcga cctcgcaatc 5820 atattccact acggtacaaa tcgacgcgca cgaaatgcga aggtggaagg acgcttatat 5880 caaagatcca catttcaaat cagtaattga aagcaaggat aatgaaacgg atttaccttt 5940 atctcagtat catcactccg cggaaggact aatctatttt gaagattcca cggggaacac 6000 tcggttgtgc gtacccaagg atcttcgaat tgaagtaatg caagaagttc ataacacagt 6060 aacggaagca gcacatgggg gttacttcaa aacatacaat agaattagcg gaacgtacta 6120 ttggccaagg atgtctaggg aaattaaggt ctttgtgaac acttgtgacg tgtgtcaaaa 6180 gaccaaacca agacgacatg gacctacagg attgctccag cccattccaa tcccatcaca 6240 gccgttcgaa gtggtgagta tggattttat acctgaatta ccaatatcaa acggcttcga 6300 caatatttta gtgatcgtgg ataagcttac caaatacgca atcattatcc caaccactac 6360 aaaggtcacc gaagtagaaa cggccaggct cttcttcaag catgttatcg caaaattcgg 6420 gataccgcgc caagtaatct cggacaggga cacgagatgg agaggtgatt tctggaagga 6480 aatctgccga ttgatgggca tgcgacggtc cctgactaca tcataccatc cgcaagccga 6540 cggacaaacg gaggtattga atcagggtct tgaaatttca attcgtgctt atatcggtcc 6600 agatcgggat gattggagtg agattctgga cgctttgacg ctatcctata actcctcacc 6660 acacactgct acagggttta gtccagctta cctcttacga ggataccacc ctatcacaaa 6720 cactaccatc ctaggtcgac cgtccagcat agatcgaaca gaagtgtcga gttcaaggaa 6780 tgccgatcag gacacaatac acgacaaggc gctagatcta gtggaagggt tcgctgctga 6840 aagagcaaag gccagggacg cgcttctttt aggtcagtta tttcaaaaga aagcatataa 6900 caaagggaga ctgaactggg aatttgagaa aggagacaag gtcgtgataa atcgaaagaa 6960 ccttggcttg ctgagagacg agaaagggag gggtgacaaa ttgttagcca aatacgaggg 7020 accctttgag atcatacaga aggtgagcgc cgttgcttat cgacttcgta tgccggcgtc 7080 ttatgggata caccctgttt taaatattca acatctagag aagtatcagg agtcactgag 7140 ggagttcgga gaacgaccgt gtcttcaagc caacagatta aatttcgacg cgttaccgga 7200 gtacgaggtc gataaaatcg tggcagaacg catgcggaaa ggcaggaacg ggcggaggat 7260 accgatctac cgtttacgat ataccaatta cggtcctgag ggtgatactt gggaaacaaa 7320 gcgaaatttg aaaaacgctc cggagatatt gcgtgaatgg gagaagttca aggcgctcca 7380 aaagaagaag tcagcaagga cagactaagg tgattccaga aatttattta aacgaactgc 7440 ttcaccaaca actacatcat accattccag ttcaaatatc gaatcatcta cccccagtag 7500 aacactatca tctcactatg caatccgtcg aactatacaa ccccaactcg ccaaacttac 7560 acgtctcgtc tctcgcctct atgtactcct tatccgagga tctcaaccct ctcaactacc 7620 agttgactac ggacgacttc attgcacttg ctcaacccga cgggcaactt ttatataccc 7680 ttgatccatc cacctcgaca tctctatttt gggcgttgta tcgtttagtg cggtacatta 7740 cccgcgccgc atcacacgac ctctatcagg acgctaccat tccattcgcg tccaagggtt 7800 acatccctca aatttatttc gaccgaatcg gcatgcagcg cccagacatc ctcgccatcc 7860 aggaatatca ttccatcaat ccgctttgcg gatataccag tagagaaaat ctcatcgcct 7920 ggtaccgcga cttccagaca atcggacgac tcacagttca gtggatccag accgcgcgcc 7980 gacagacgct tcaactggga gacgggacgg gatggaattg ggaggcacgc cacgggcctc 8040 gtctcgagta cccaaaccgt ctactcatca ccaacgacac ttccgagacg gaagccgatt 8100 ttttttccga cactagttct gaagggtcag agagcgaact gggcagctgg agtgctgaag 8160 gttccggtac gggaatcgcc catgttgaca accgcgtaat tttacaccca accaagtccg 8220 cttacgccgc ccgctacccc gatttcaatc cctttgccgg tttaccgctt atcaacaatg 8280 gctacgtcga cgataacaac aacatgtgcc gagccatgat tattcacccc gctatgggag 8340 ccaaccaggc cggtgcgact atcgattacg ctaatgcgct ttttacggtt caggatgcag 8400 aagagtaatc aaaactcaag gtcagggggg ggta 8434 // ID Gypsy-3_LBS-LTR repbase; DNA; FNG; 234 BP. XX AC ABFE01000084; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_LBS_; KW Gypsy-3_LBS-I; Gypsy-3_LBS-LTR. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-234 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01000084; Positions 861022 861255. XX SQ Sequence 234 BP; 51 A; 67 C; 39 G; 77 T; 0 other; tgtaatcctg ggattaccac atcgggcagt tgtttggcat cggcatccgg actatttggt 60 ttcctacata cacgcgatcc tctcttaacc acgtaccatc atctactcct gcatacgcac 120 ccccttcttt tacctttacc ttctagatac cctcagaatc atatactcgg ttaaattctg 180 acggtttctc tgtgagtagt gttatctgtc ctgtctacac aggacagatt tcca 234 // ID Copia-7_LBS-LTR repbase; DNA; FNG; 265 BP. XX AC ABFE01002190; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-7_LBS_; KW Copia-7_LBS-I; Copia-7_LBS-LTR. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-265 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01002190; Positions 12287 12551. XX SQ Sequence 265 BP; 56 A; 75 C; 50 G; 84 T; 0 other; tgttgagtat tatgattcgt cccgccgcta ccttacaggc ccactctttt cacgtgacct 60 accgtagttc accgtcgtcc gtttacttca gatcttgtat ctaacgtaca cggacgacag 120 gtgagtagta tagtttttac tctagatctt gtatctaacg tacacggacg acagtccgtt 180 tatatactac acttttcagt gttcacgttg gatcccactg tcttgggagt tcctcaagcc 240 gtctctccct gcgccaatcc caaca 265 // ID Gypsy-36_MLP-I repbase; DNA; FNG; 5382 BP. XX AC AECX01001004; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-36_MLP_; KW Gypsy-36_MLP-LTR; Gypsy-36_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5382 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001004; Positions 65014 70395. XX CC Positions [4110-4589] - Integrase core CC 'ACATG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 216..4997 FT /product="Gypsy-36_MLP-I_1p" FT /translation="MNTRQAGPAEISPDPSKYLPDPEAIIRANRREQKLAK FT QKLPAPSDPDPSLRSSPENSSIQLPATPTRTRKDQNTLTPYKPPAPPFNVG FT KKMSDGQDSSSKSGAATGDLVNIILQSQVTSIKRMERMEEVVAKLVDLRVS FT PTADDQSESAREERGIDLAKFRTSDGPVFKGPYHDTDAFLHWFAALKTFYR FT TKGVVLDKDRITLIGSFIAEPTLHAFYEGGFNRFIHESWQEFLKSLFDAAL FT PKDWMDRLYERAQHLRMSPYEDFMTYSTRARTIQNLINFNEVTLNDYQLAK FT FVEYGMMEELKTSVKLWGFIKSDKSFKYHDFEHQVETLYQALVASHAITKK FT TRPTPGGSSSWTNRGFTAARRPQLDNDEFVWKIHSYLDLQGKCHFCKKYCG FT SEHRQCKGPYVREKTIFPPGYVAPPKPDNYVPPRAKSSPPSQTQAGRATNP FT PAGRPSTSYARVAAAGEFPELDQESVAALEALDEELKEDGTEGCVTQAKTP FT RVILEFECGGQRVRALADPGSEINLLTDRGADHLKLLKRQLVRPTQLGLAV FT TTSGQATILTHFTIANLKDEVSGRRFDRTYFKIGEVGGDFDMILGTPFFNL FT NQLSVSITQRAAVCERTGMKMLDFRFLRELREKEESMNQPRIVEPKVRDRE FT SWREAVSRPEKMAAAVGEVVESRGERLEAEFFTEFSDLFPVDIPAVSDEAE FT DEGLFVDGTFPERLQNESSKVRHKIILKDPNAIINEKQFNYPPKHMNAWKR FT LVDQHLAAGRIRRSTSQYASPSMIIPKKDPNELPRWVCDYRVLNSLTVRDR FT APLPNVDRLVRLVATGIFFSIIDLTNAFFQTRMREADIPLTAVHTPWGLYE FT WCVMPMGLTNAPSTHQGRLEEALGEYLNDICVVYLDDIVVFSNSEAEHVEN FT TRKVLQRLREANLYCSQKKTKLFREEIKFLGHWISAKGIRVDDEKVKQVVD FT WKTPKSAKGVKKFLGTVQWMKKFIWGLQNYVNKLTPLTSSKLDSTKFHWTT FT KEDNAFHNIKRLMTSLPHLKNIDFDSDNPLWLFTDASGSGLGAALFQGKEW FT KLASPIAYESRQMTPAERNYPVHEQELLAVMHALNKWRMLLLGMKVNVMSD FT HHSLTYLLKQKNLSRRQARWIETLADFDVEFKYIKGEENSVADALSRKDVE FT DEVLTSYDVNCAASLIEAGPTLAPGVKTRIQAGYSADGFYKALSSVLPLRD FT ECVEEDGLLFVEGRLYIPPSDGLRQELISEAHSRLGHLGNLKTIADLRRDF FT FWPKMAKEVELFVRSCDLCQRIKTPTTARPGMLMTPPMPTTPLDCLAIDFI FT GPLPKVNNSDMLLTCTCRLSGFIRLIPCNQTDTAEKTASRFFTSWIGTFGA FT PTSIISDRDKTWTSAFWKALMKRLGTSFHMSTAFHPQADGRSERTNRTVGQ FT VLRTFTMKRQGKWLESLPAVEYAINGAVNISTGKSPFELVFGKTQRSLDAT FT GPHEDDPLALQKWLDLRTNAWATARDSLWTSRVQQAVQHNKHHRPAPTIED FT GSWVLLDSGDWNGRHTGGVKKLREKYEGPYKVVRTFNHGLNLELELPAGDK FT RHRVFNVSKIKPYVERGGVEREEEVQK" XX SQ Sequence 5382 BP; 1576 A; 1241 C; 1250 G; 1315 T; 0 other; cttttttttt ctcaaaccca tcctgagaca tcgacgccac accatcgcaa tcactccgaa 60 ttgaacttta caactcaata cacttcgaat cccaaccttt cacttccgta ttcaaatcct 120 gaactctatt cgatctagtt catacgtacc ccgcattaga gatcccattc ctcggggtcc 180 taattatcga ttgactatac cattctcaga ctcgcatgaa taccagacaa gccggaccag 240 ccgagatatc gccagacccc agtaaatatc taccagaccc agaagctatc atcagagcca 300 acagacgtga acagaaacta gcgaaacaaa aattacctgc accatcagac ccagaccctt 360 cgttgagatc atcaccagag aacagttcga tacaattacc agcaacacca accagaacac 420 gcaaagatca gaacacactt acgccgtaca aaccaccagc gccgccgttc aacgtaggaa 480 agaagatgtc cgacggtcag gattcatcgt ccaaatcagg tgctgcgacg ggagatcttg 540 tcaacatcat tctgcaatct caagtcacga gcattaagcg catggaacgc atggaagaag 600 tagttgcgaa acttgtcgat ctacgagttt ctcccacagc cgatgatcag tcggaaagcg 660 ctcgcgagga acgtggcatc gatctagcca agttccgcac gtctgacgga ccggtcttta 720 agggtcccta tcacgacaca gacgccttcc tccattggtt cgcagcgttg aaaacttttt 780 accgcacaaa aggagtggtc ctcgataagg atcggatcac tctgattgga agttttatag 840 cggaaccgac tcttcacgca ttctacgagg gagggttcaa ccggtttatt cacgaatcat 900 ggcaagaatt ccttaagagt ttatttgacg ccgcgttgcc taaggattgg atggatcgac 960 tttacgaacg ggctcaacat ctacgcatgt ctccatacga agacttcatg acctacagta 1020 cgcgtgcccg aacgattcaa aatcttatca acttcaatga ggtcaccctc aacgattacc 1080 aactcgcaaa attcgtggag tatggtatga tggaggagct caagacctcg gtgaagttgt 1140 ggggttttat caaatcagac aaatcattca agtaccacga cttcgaacat caagtagaaa 1200 cactttatca agcgttggtc gcttctcatg ccattaccaa gaaaactcgt ccgactcctg 1260 gtggttcttc gtcatggacc aaccgaggct ttaccgcagc tcgaagacct caattagata 1320 atgacgagtt tgtgtggaag attcattcct atcttgatct tcaaggcaag tgtcacttct 1380 gtaagaaata ctgtggtagt gagcatcgtc agtgcaaagg accatatgtg cgagagaaga 1440 cgatcttccc accaggctac gtcgcacctc ccaagccaga taattacgtg ccgcctcgtg 1500 cgaagtcttc accaccgagc caaactcaag ctggaagagc gacaaaccca cccgctggac 1560 gtccatccac gtcgtacgcc cgagtcgcag ctgcaggaga attcccagag ttagatcaag 1620 aatcggtggc agcgttagag gctttagatg aagagttgaa ggaagatgga actgaagggt 1680 gcgtgactca agccaaaaca ccacgagtga ttttagaatt tgaatgtgga ggccaacgcg 1740 tacgagcact agctgatcca ggatcggaaa tcaatctcct tacagatcgt ggagcggacc 1800 atctcaaatt gcttaaacga caactggttc gcccgactca actaggcctt gcagttacaa 1860 ccagcggtca agcgacgatc cttacccact tcaccattgc taatctgaaa gatgaggtat 1920 cagggagacg ctttgacagg acgtacttta aaataggaga ggttggagga gattttgaca 1980 tgatcctggg gactcctttt tttaatctca atcaattgtc tgtttctatt actcaacgtg 2040 ctgctgtatg tgaacgaact ggcatgaaaa tgttggattt caggttctta cgagaattaa 2100 gagagaagga ggagagcatg aatcagccgc gtattgttga gccgaaagtg agagatcgag 2160 aaagctggag agaagcagtg agcagaccag agaagatggc agcagcggta ggtgaagttg 2220 tagagagcag aggggagcgc ttggaggcag agtttttcac cgaatttagt gatctcttcc 2280 cagttgacat cccggcagta tcagacgagg cggaagatga aggcctgttt gtcgacggaa 2340 cattcccaga acgcttacaa aatgagagct caaaagtaag acataagatc atattgaagg 2400 atccgaacgc aatcatcaat gagaaacaat tcaattatcc gccgaaacat atgaatgcgt 2460 ggaaacggct ggttgatcaa catctcgcag ccggacgaat cagacgatcc acgagtcaat 2520 acgcttctcc gagtatgatc attccgaaga aggaccctaa cgaacttcca agatgggtat 2580 gcgactaccg cgtcttgaac agcctgaccg tgagggaccg cgcaccatta cctaatgtag 2640 accggttggt caggctggtg gctacgggaa tttttttttc aattattgat ttaaccaacg 2700 ctttttttca aaccaggatg agagaggctg atataccatt gacagcagtt cacacaccgt 2760 ggggtttgta tgagtggtgt gtcatgccga tgggtcttac aaacgcaccg agtacacacc 2820 aaggacgatt agaagaagct ttaggagaat acttaaatga tatttgtgtc gtttacttgg 2880 acgatattgt tgtattttca aattcagaag ctgaacatgt agaaaacact agaaaagtct 2940 tacaacgcct tcgagaagcc aatttatact gcagccaaaa gaagaccaaa ctatttcgtg 3000 aggagatcaa gttcttgggc cactggattt cagcaaaagg cattagagta gatgatgaaa 3060 aggtgaagca agtagtagat tggaaaacgc caaaatcagc aaaaggggtg aaaaaatttt 3120 taggaacggt acaatggatg aagaaattta tctggggtct tcaaaattat gtcaacaagc 3180 ttaccccttt gacaagcagt aaattagatt caacgaagtt tcactggaca acaaaagaag 3240 ataatgcatt tcacaacatt aaacgtctca tgacctcatt acctcactta aagaacattg 3300 acttcgactc tgacaatccg ttgtggctat tcactgacgc tagcggttca ggtcttggag 3360 cggcgctgtt ccaaggaaag gagtggaagt tggcatcacc aattgcatac gaatcgaggc 3420 aaatgacacc agctgaaagg aattatccgg tccacgaaca agaactttta gcagtcatgc 3480 acgcgttaaa caagtggagg atgcttttgt tgggtatgaa agttaacgta atgagtgatc 3540 atcactcatt gacgtatcta ttgaaacaga aaaatttaag caggagacaa gctcgatgga 3600 tagagacttt agctgatttt gatgtggagt ttaaatatat caaaggagaa gagaattcag 3660 ttgcggacgc tttgtcgagg aaagacgtag aggacgaggt tcttacaagt tatgatgtca 3720 actgtgcagc ttcacttatt gaggcgggtc caactttagc gcctggggtc aagaccagga 3780 ttcaggctgg atactcggct gatggatttt ataaggcttt gtcatcagtt ctgccattac 3840 gagatgaatg cgttgaagaa gatggactcc tctttgttga aggacgattg tatatcccac 3900 caagcgacgg gttgcgacaa gaactgatta gtgaagcaca ctcgaggcta ggtcacttag 3960 gcaacttgaa gaccatcgct gatctgaggc gcgacttctt ctggccgaaa atggcaaaag 4020 aggtcgaact atttgttcgc tcttgtgacc tatgccaaag gattaagacc ccgacaaccg 4080 cacgtccagg catgctgatg acacctccaa tgccaaccac acctctcgac tgccttgcca 4140 tagacttcat cggaccttta ccgaaagtca acaactcgga catgctcttg acttgcacgt 4200 gccgtctatc agggttcatt agactcatac catgcaacca gacagacaca gcagagaaaa 4260 cagcgtcacg gttctttact agctggattg gtacctttgg ggcgccgacc tcaatcataa 4320 gcgacagaga taagacatgg acgtctgcgt tctggaaagc actcatgaaa cgacttggga 4380 ccagctttca catgtctacg gcgtttcatc ctcaagccga tgggcgaagc gaacgtacaa 4440 accgaacagt gggacaggtc ttgcgaactt tcacaatgaa gcgccaaggc aagtggcttg 4500 agtctctacc agcggtcgag tatgccatca atggagctgt caacatttcc acaggcaagt 4560 cgccgttcga attggttttt ggtaagaccc agcggtccct tgacgcaaca ggaccacatg 4620 aggacgatcc gttggcttta cagaaatggc tagatctaag aaccaacgcg tgggcgacag 4680 caagagattc actctggaca agccgggtac aacaagccgt acaacataat aaacatcacc 4740 gaccagcacc cacaatcgaa gacggctcat gggtcctttt agattcaggg gattggaatg 4800 ggagacacac aggcggggtg aagaagcttc gtgaaaagta cgagggtccc tacaaggtcg 4860 tcaggacgtt caatcatggc ctcaacctcg agctcgaatt accggcaggt gataaacgtc 4920 accgagtctt caacgtgtca aaaatcaagc catacgtgga acgtggaggt gtggagcgcg 4980 aggaggaggt acaaaagtaa gttcctccca aagtatgcac cgcccgggta gctaccccta 5040 aaaaataagc tcgaaagaaa gacatacctt ggccactccg tgagcacata acttcgacat 5100 tttcttccca ggactccgtc gacaattggg gacgaacgtg gccacctttc tcttctaccg 5160 atttttatct ttattttgcc atttagcttt gtttctattt ttttcttttc ttctttctct 5220 tttcttttct ctcttttgtt tagttttctt ttaaagaaca ggtttcacta attttatagt 5280 aggattcaat caaattattt cttttctttg gattttgatt tttcaatttt tcttttcttt 5340 tcattaaagg ctggtcgaac atttccttaa aggaggggag ac 5382 // ID Copia-32_MLP-I repbase; DNA; FNG; 5274 BP. XX AC AECX01001251; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-32_MLP_; KW Copia-32_MLP-LTR; Copia-32_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5274 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001251; Positions 140621 145894. XX CC Positions [1720-2178] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 117..1871 FT /product="Copia-32_MLP-I_2p" FT /translation="MTKGDDTKPTEPPVTYNMSTSNSGTTNFSTSAISSIP FT KLTMGNLFAWKTELKIYLKMNGLYEFIEQEIKRPSDFQERSKYDMRQAATL FT YAIRNTIDSANLASIESLEDPKLAFETLVSQHGSDNGIVTANTLTELFSLK FT YDHSIGITNYIAKIQDLHSKIRDLTAGDKDLQLSDRLFAILLVNSLPRNEF FT GLIIQHFLSNIKTITTSDVCARLRLEASSTLGNEEKSKEVYHIKSRKPART FT DNRRVGKLPKDLCHIHPNSKHTNEQCHTQKKNNTKDSSSLSTEEMARRYQA FT MMANSNQTTPTNLNSVSAHVAIENPSSNDFITYAAYNASTKTTKNTEFLID FT SGANTHITSQANLLSNIHTIPNVKISGIGGHDKQVSARLAGTAYIQGTTIS FT GEPRLIAIEDVLLIPNAGVNIISVSSMIKDGATLSSSKSALMLKNSTSDYI FT ITGKEIDGLFITQARRSPVLFAASASIPADVWHRQYGHLNYRSLSKVSPSP FT PRKLDCESCILSKAHRLPFSSHFPKSDTPLFRIHSDVLGPMPTTSIGGGKY FT IVSFIDDATRYNCIKIMRTKSQVYTSFVQYVTESERYP" FT CDS 1774..5274 FT /product="Copia-32_MLP-I_1p" FT /translation="MMLPGTIVLKSCAQNHKYTQVLYNMSLNLRGTHNKRV FT AILKSDRGGEYTSNEFSKYLSQQGIAFERAPAKTPEQNSVSERFNRSLLER FT TRSVMVDSNIPNFLWGEIIMSISFLLNVSPNAAIDMETPLNLWNADISGAH FT KANNDFLRVLGCAAYPLLKSTEYNKLSAKSRACVLVGYEQGARAYRLWDIS FT NKKIIVSRNVVFNERLFPFKNKIPIQTNEINTDVITFSPTPAQNQSTAPEI FT DVNHLPTENLTSTSSNSAVNDTTVLPVIPNSTVQQQTLPPQSPPPNPPQNH FT NLPFHISGPRDNNTISINSSPFLSPQRLSPISPNFSEFSLSEYPTNLSQKP FT EEIIEFTPKQLIPLSKSIHAPKEYTPAPQEIKIISETDTISELEKQKQKEN FT KIKTQKDHELEIEKQKQIRHQYELTLKLEKEKEKEKEKEREEQKQKETEKA FT KLLTYQKELNIRLEKRKQNQIKQQQELEEKLEKAKQKQIAHEKELQLTLEK FT EKEKEREKEKEREKEKEKEKENNKLIIRIKRPKNNNNNIALNKEEQSKLTK FT ESDNCTNISNTTPEEQIEMNNNETNPVENPVLPVTNKQTPKIIPHPPSQRP FT ARDRKPPLRYGNLTIYAAKRSADCPSYAEAMRGNERENWIKAMESEFNSLT FT SHNVGELVDAPSEANIIGGMWRLKRKRDEHGNIIKYKARWVALGNHQIWGV FT DYDKTFASVVQSDTMNMLFSLCASEDWEMEQFDICTAFLNGKMKLPVYTRQ FT VQGFYNPKSPTKVWLLKQSIYGTKQAHREFNSDLEEKLRSIGFTPYKDDNS FT LFTFRRNEEIIHIPMFVDDGLVFSNSKSLIKETREKLEQLYKLVWTTNPTL FT HLGVKITRDRTNRTIHLSQEHYLKSVLDRFDMTHCNTTSTPFPTSMELSPG FT TDEEIEDAAHLPYQQAIGCLNWAAVHTRPDIQYAVSTLARYSSKYTYRHWQ FT VVKHLLRYIQGTLDRGIVFKKHKTPSTELRAYADADYAACTTTRRSTTGYV FT FTLGGSLISWKSRRQPTVALSTTEAEYMALGDCAKHCLWFRRMISHLTQTP FT IPTSPISLPPLSIFNDNNGAVFLSQESATNSRSKHIDIRHHFIRDLIFSHQ FT ISTHMIDTKLMPADFLTKNAPKEMLNRCRFLIGNVNTKEIHLISTNTLPKP FT AKVSSKGG" XX SQ Sequence 5274 BP; 1930 A; 1297 C; 792 G; 1255 T; 0 other; tctataggtt atgagcccag cgctccagac gcgccctcta tacacttaca ctaccctttg 60 atccatcaaa ggcactaaat aactatacct acaaaccact ctcgctctct ctctctatga 120 cgaaaggcga cgacaccaaa ccaaccgaac ctccagttac ctataacatg tcgacttcaa 180 actcgggaac cactaacttc agcacctcgg caatatcatc gattcctaaa ttaactatgg 240 gaaatctttt cgcatggaaa actgaattaa agatttatct caaaatgaat ggcctctacg 300 aatttattga acaagaaatc aaacgcccat ccgatttcca agaacgatcc aaatacgata 360 tgcgacaagc agcaacgcta tacgctatac gtaacactat tgactccgcc aacttagcat 420 ctattgaatc actcgaagat ccaaaactag ccttcgaaac tctcgtatcg caacacggtt 480 ctgacaatgg tattgtcacc gcaaacaccc tcaccgaact cttcagtctg aaatacgacc 540 attccattgg tatcactaat tacattgcaa aaatccaaga cttacacagc aaaattcgcg 600 atctaactgc gggtgacaaa gacttacaac tctccgacag actatttgct atactactag 660 ttaacagcct ccctcggaat gaatttggtc tcatcattca acatttcctc tcgaacatta 720 agacaatcac cacaagtgat gtatgtgctc gactcagact tgaggctagc tctactcttg 780 gaaatgaaga aaaatcaaaa gaagtatatc atatcaaatc ccgaaaacct gcaagaactg 840 ataaccgacg agttggaaaa ctgccaaaag acttatgtca catacaccca aactcgaaac 900 acaccaacga gcagtgccac actcaaaaga aaaataacac caaagattct agctcacttt 960 ctactgaaga aatggctaga cgataccaag caatgatggc taactcaaac cagaccactc 1020 caactaactt gaattcagta tctgctcatg tagctattga aaacccttcc tctaacgact 1080 tcatcacgta cgctgcttac aatgcctcca ccaaaaccac caagaacacc gagtttctaa 1140 tcgatagtgg tgcaaataca cacatcacga gtcaagcaaa cttattatct aatatacata 1200 ctattcccaa tgtcaaaata tctggcatag gaggacatga taaacaggtt tctgctagat 1260 tagctggtac tgcatacatc caaggcacca ccatatccgg agaacctaga ctcatcgcaa 1320 tcgaagatgt actcctaatc cccaatgctg gagtaaacat catatccgtt tcatctatga 1380 tcaaggatgg agctactctc tccagtagca aatctgctct gatgcttaaa aactcgacaa 1440 gtgattacat catcaccgga aaagaaatag atggactgtt tataacacag gcacgtcgat 1500 ctcctgttct attcgctgct tctgcttcaa tcccagctga cgtatggcac cgtcaatatg 1560 gtcatctcaa ctacagatct ttgtctaaag tatcaccatc tccacccaga aaactggact 1620 gtgaatcttg tattctatca aaagcccatc gtcttccatt ttcctctcat tttcctaagt 1680 ccgatactcc tctttttcgt attcatagtg atgttctagg accaatgcct accacatcga 1740 ttggtggagg aaaatacatt gtatctttta ttgatgatgc taccaggtac aattgtatta 1800 aaatcatgcg cacaaaatca caagtataca caagttttgt acaatatgtc actgaatctg 1860 agaggtaccc ataacaagcg agttgctatt ctgaaatccg atcgaggcgg tgagtataca 1920 tcaaatgaat tctcaaaata tctttcccaa caaggtattg cttttgaacg tgctcctgcc 1980 aagactcctg aacaaaactc ggtaagcgaa agattcaatc gatctcttct tgaaagaaca 2040 agatccgtca tggtcgattc aaatatccca aacttcttat ggggggaaat catcatgtca 2100 atttcattcc tcttaaatgt atcacccaat gcggcaattg acatggaaac ccctctcaac 2160 ttatggaatg ctgacatatc cggtgctcac aaagcaaaca atgatttcct ccgagttttg 2220 ggatgtgctg cctatccgct attaaaatca accgaataca acaaattatc tgccaaatct 2280 cgggcttgtg ttctggtagg atatgaacaa ggagctcgtg cttatcgact atgggatatc 2340 tctaacaaaa agatcattgt ctcacgtaat gtcgtattca acgaacgact ctttcctttt 2400 aagaataaaa tacccattca aactaatgaa atcaacactg atgtcattac cttctctcca 2460 actccagcac aaaaccaatc aactgcacca gaaatcgatg tcaaccattt gccaactgaa 2520 aacctaacaa gcacatcgtc aaattctgca gttaacgaca cgactgtact acctgttatt 2580 cctaattcca ctgttcaaca acagacatta ccacctcaat ctccaccacc caatccgcct 2640 caaaaccaca accttccttt tcatatttcc ggaccacgag ataacaacac tatatcaata 2700 aattcatctc cgtttctatc ccctcaacga ctttcaccaa tttctcccaa tttctcggaa 2760 ttctcactat ccgagtatcc aacaaatcta tcccagaaac ctgaagaaat catagaattc 2820 actccgaaac aactaatacc tctatccaaa tcaatacatg ctcctaaaga atatacacca 2880 gcaccacaag aaatcaagat tatatcagaa accgacacaa tctcagaact agaaaagcaa 2940 aaacaaaaag aaaataaaat aaaaacgcaa aaagatcatg aattagaaat agaaaaacaa 3000 aaacaaatca gacatcagta tgaattaaca ttgaagctag aaaaggaaaa ggaaaaagaa 3060 aaagaaaaag aaagagaaga acaaaaacaa aaagagacag aaaaagcaaa actacttaca 3120 tatcaaaaag aactgaatat cagattagaa aaacgaaaac aaaatcaaat caaacaacaa 3180 caagaattag aagaaaaatt agaaaaagca aaacaaaaac aaattgcaca tgaaaaagaa 3240 ttacaattaa cattagaaaa agaaaaagaa aaagaaagag aaaaagaaaa agaaagagaa 3300 aaagaaaaag aaaaagaaaa agaaaataac aaactaatca ttagaataaa aagacctaag 3360 aacaacaata acaacattgc cctaaacaaa gaagaacaat caaaattaac gaaagaatca 3420 gataactgta ctaacatctc aaacaccaca ccagaagaac aaatcgaaat gaacaacaac 3480 gaaacaaatc cagtagaaaa tcctgtacta cccgtcacaa acaaacaaac tccaaaaatc 3540 attcctcacc ctccttctca acgacctgcg cgtgatcgaa aaccccctct tcgttacggc 3600 aacctcacaa tatatgctgc aaaacgaagt gctgactgtc catcctacgc agaagcaatg 3660 agaggtaatg aacgcgaaaa ctggatcaaa gcaatggaga gtgaattcaa ttccctaacc 3720 tctcacaatg ttggagaatt agttgatgct ccatcagaag caaacataat aggaggtatg 3780 tggcgtctaa aaagaaaacg agacgaacat ggtaatatca tcaagtacaa agccagatgg 3840 gtagctttag gtaatcatca gatatgggga gttgattatg ataaaacctt tgcatcagta 3900 gtccagtcag acacaatgaa catgttattt tctctctgcg cttctgagga ctgggaaatg 3960 gagcaatttg atatatgcac tgcttttctt aatggtaaaa tgaaacttcc tgtctataca 4020 cgacaagtac aaggcttcta caaccccaaa tctccgacaa aagtatggct tctcaaacaa 4080 tcaatctatg gaacaaaaca agcgcatcga gaattcaact ctgatctgga agaaaaactt 4140 cgcagcattg gcttcacccc atacaaagac gacaactcac ttttcacctt tcgaagaaat 4200 gaagagataa ttcacattcc catgtttgtt gacgacggtc ttgtattctc aaattcgaaa 4260 tccctgatca aagaaaccag agagaaacta gaacaacttt acaagctcgt atggaccact 4320 aaccccacat tacatcttgg cgtcaaaatc acacgagata gaactaatcg cacaatccat 4380 ctttcccaag aacattacct caagagtgtt ctcgatcgat ttgacatgac acattgcaac 4440 accacctcaa caccttttcc aacctctatg gaactatctc ctggaacaga tgaagagatt 4500 gaagatgctg cccacttacc atatcaacaa gccatcggct gccttaactg ggctgctgtt 4560 cacactcgac ccgatataca atatgccgta tcaacattag cacgctactc atcaaaatat 4620 acctaccgtc actggcaagt agtaaaacac ctactacgat acattcaagg aactctggat 4680 cgtggcattg tattcaaaaa gcacaaaact ccatccaccg aacttcgcgc gtatgctgat 4740 gctgactacg cagcttgtac cacaactcgc agatctacga ctggatatgt ctttacatta 4800 ggtggatcat taataagctg gaaaagccgt cgacaaccca ccgtagctct ttccactaca 4860 gaagcagaat acatggcact cggtgactgt gcaaaacact gcttatggtt tcgaagaatg 4920 atatcacacc taacccaaac acccatccca acatctccaa tctcgttacc acctctcagc 4980 attttcaacg acaataacgg cgcagtattc ctctctcaag aatcagctac aaacagccgc 5040 tccaaacata ttgatataag acaccatttc attcgtgacc taatattctc acatcaaatt 5100 tccacacaca tgattgatac aaaattaatg cctgccgact ttctcaccaa gaatgcccct 5160 aaggaaatgc ttaatcgatg ccgatttcta attggtaatg taaatacaaa agaaatccac 5220 cttatatcta caaatacact tcctaaaccc gcgaaagtat cgagcaaggg ggga 5274 // ID Copia-41_MLP-LTR repbase; DNA; FNG; 261 BP. XX AC AECX01001336; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-41_MLP_; KW Copia-41_MLP-I; Copia-41_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-261 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001336; Positions 71277 71017. XX SQ Sequence 261 BP; 66 A; 44 C; 35 G; 116 T; 0 other; tgttatggat gtaatctatg tgacattgat tgatattaca tcatagtagt ttgctaactt 60 tctcttcact gtaacaatat tcttataaca ttattagcat tagttattga tatactgtta 120 gttgttgtta gactggattg gtttctctct tcttcacctt tccaatattg tctaattgtt 180 tctttcaggt aagctcacta aatgcaattt acctttccaa tattgtctaa ttgtttcttt 240 cagagtgaat ccttctcatc a 261 // ID Gypsy-105_MLP-I repbase; DNA; FNG; 10023 BP. XX AC AECX01000506; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-105_MLP_; KW Gypsy-105_MLP-LTR; Gypsy-105_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-10023 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000506; Positions 209793 219815. XX CC Positions [8824-9303] - Integrase core CC 'TCTCA' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 6214..9924 FT /product="Gypsy-105_MLP-I_1p" FT /translation="MPWILKYGHLIDWKAGVLRTQEENIAATEMVSSYPKT FT PLPSPGMEPVRDARKCDEGMCINSDTLASPQCEFDSTISDHSREATGKHLS FT LLNNSVTKPITTTTTMEPLASTAVDSSPPTINPNGPLEEPKGHARNSDEGA FT SVLIDTVMPPQCEFDISKSQSLTESAGQPFSPLNTDVNVDAAKTSWSTSAK FT LAAEEKKKLPVKNVEELVPTQYHRHLHMFVKSKALGLPPRRRYDFKVDLIP FT GAQPQASRIIPLSPAENLVLDEMIKSGLANGTIRRTTSPWAAPVLFTGKKD FT GSLRPCFDYRRLNSLTVKNKYPLPLTMDLVDSLLDADEFTKLDMRNAYGNL FT RVAEGDEDILAFICRQGQFAPLTMPFGPTGAPGYFQYFMQDILVGRIGKDT FT AVFLDDIMIYTKKGEPHAPIVDIVLDILGKHQLWLKPEKCKFSKSKVEYLG FT LLISHNKIKMDPTKVRAVSDWPEPRNVSELQHFIGFANFYRRFINQFSKTT FT RPLHDLTKNNTQFKWDERCKKAFDSLKLSFTSAPVLKIADPYKAFILECDC FT SDFALGAVLSQRCDKDGEIHPVAYLSRSLAQAERNYEIFDKELLAIVASFK FT EWRHYLEGNPNRLDVIVYIDHRNLESFMTTKQLTRRQARWAETLGCFDFTI FT KFRPGRQSTKPDALSRRPDLAPTQEDKLTFGRLIRPENLSSDSFIKIDSIE FT CFFEDKTIELENAEKWFEVDVLGISEEEDLSNSVAQEDETHSDMEIIDCIR FT NSTKNCDRLQKIISTVHNPASSKVKEAVSRYSVKNGLLYNKHHIEVPNDNS FT LKLKILKSRHDSLLTGHPGRSKTLGLVRRCFNWPSIKSYVNKYVDGCDSCL FT RVKSIHQKPFGSLEPLPILAGPWTDISYDLITKLPTSNGYDSILTVVDRLT FT KMSHFIPCRESMSSEELADIMVKDVWRLHGTPKTIVSDRGGVFISQMIKEL FT NKRLGIRIQPSTAFHPRTDGQSEIVNKTIEQYLRHFVSYRQDNWSDLIPTA FT EFAYNNRDHASTGVSPFKANFGYNANFGGIPLGEQCIPAVEERLKILEEVQ FT TELKECLEASQEEMKVQFDKGVRDTPTWQVGDQVWLSNKNIATTRPSPKLE FT HRWLGPFSIIEVNSRSTYKLDLPAALRGVHPVFHVSLLQKHNPDEIKGRKL FT KEPSAVFIEGNEEWEVNEILDCRIRNRSREYLINWKGFSTENNSWEPVGNL FT KNSKDLVKQFDKKFPNAALKYKKRRRK" XX SQ Sequence 10023 BP; 3210 A; 2140 C; 2141 G; 2532 T; 0 other; tattgtcgga tcttcccgac gagcactgag gaactaaaga actagacacc accgatcgaa 60 gaaaagaacg aaccagatag attagaatta gaattgaaaa ccttaaaata gaattagaat 120 tgaaaccttg aagagattac atcagaactt cagaactcac ccgaatcccc agtttgaaaa 180 ccttatctta ggatcttcaa aaccttgtct tacatctcat accgaacgga taaaccccgc 240 agtaaaaccg atcaaccatc tactgaatcg agatctaaaa cgccaccacg tctcctactt 300 tcgaaacccc gcaattcgac acatcctctg agtcagacga cgacgtcgat ctcgagaacg 360 aagcgaataa tcacttcgtt gacgctgaaa ctgcagttga agctgaaact gcagcaatgg 420 aagatatcca gaggcagtta aatgaactca acgcttcgct agcggaagaa cgccgaggaa 480 ggaggacatc atttagaaat tctccaccac catcaagtgt gaaacacttg accttcttat 540 cgcactgacg ttcaactagg gatatgtaag tatgaatggc attgtacact tcatccttct 600 cttttgattt aaggcctgtg atgtgtcgaa aagacgtgtg atcgtcggta aagagaatga 660 aatacagact tcgacatatt gatggtgtac gcataatacc actcaagtca atatggatgt 720 tatctagcgg ttgtaaagat cttgtgcgag tttcggagtg aggctggcga tgggctttag 780 aaagatcaca tgaattgcat ggtaaagaaa ttggaagaac aagttttgat agcgacaaag 840 ggacccctaa cacaacatct cccttgatca tttgcttcag ataagcataa ttcgggtgac 900 ctaatttatg atggagtagt gttagagatt cagtgagatt gtcgtttatg gctgatgatg 960 ttgaaatgag ttcagatggt ggtgtacgca ccgcattgag ttgaaggagc atcgaggttg 1020 gatgaacctt tccggctagg acagtagcac cgtctcgctt gatgacgaaa gattgatcag 1080 atagccgttc cgttatccat ccttcgaaaa acaatcggcc tcctgcaatg aggttctgat 1140 tgagagccgg tacatacaga cagtccgtga gagttacaat gtctccagtt ggaccacaga 1200 tagtaacatt tcctcgtcct ttgatttcta aggttgagga tcctcccgct agacctactt 1260 ggtgattaga tgataacaag ggctgaaagc tgtctttatc gaagtacttt tcagatttga 1320 acatgtgatg ggttgtaccg ctatccacta taccaaaatg tttgttttca attgtggctt 1380 tagggtcgac tatgtatacg gcactagcgc ttgccgagaa ctccgatcta tccaaagctt 1440 cttgaaggga cggtagagat ggttattcgg acgtgacaga aatagatgaa gtagatggag 1500 tagatgaaat tgatgatgaa gtcgaagtgt tagatgtttt cggtgtagga cctctttcta 1560 tccactctcc tttagccttc ttcgcatcaa cccatgcttt gtatttgtta atatttttag 1620 ggtttctgat gcagttggcc tcagtgtgcg ttaatgacac gcattttcca ggagagcacc 1680 tagtttgcac atacccttga cgtttcctca tagaactggc tgcacttgat tgaatgtttg 1740 gaaacacagg agggttcgtc gggtgcaaca tagctctatt agtaatacga actttaagtg 1800 cagttactac atcgtcgtat tcaaaggtgt tgctatcagc gatatcttca gctaccttca 1860 tccaatcgga atttaaagac ataagaagct gttgaccaag ttcgacactt ccgatctttc 1920 caccagcact agtgtaagct gtgtacaatt ggttgaattc gtccatgtga atgggtagta 1980 gcgaatcacc ttgatttaaa ttgaatagaa gacgtctaag gagcatttga tgtgctggag 2040 ttcgtggtga atgatgatct cgaattttaa gccaaagttt actaggagtg gatgagatat 2100 ctggaccaat tgccgtttca acagtgatag ctggttcaaa acgatttcta ttcgcttgat 2160 ccaattttga taacatccaa tttgttaatg attctctaac aacactatta aatagttgta 2220 tgtctgcttc ggttgattct tcatctttat cagataatag aaaagatgta aaggataaga 2280 atcgaatacc ggataagatc tcggatgacc aatttggata gttggtatca ttaagctttt 2340 ccgtgattgt aattttggat aaagtattaa ccatggaaga agctatttga aatcgcctac 2400 taccaaatgc agtattggta gtagtgacgg tagtggtacc gattggagga attgttgttg 2460 acatggcagg aggtaacaaa ttttggatga gtgtctgatc actgattgat tcgggttctg 2520 atgaggattg aaactcttct tctgaagata ttacggaagc cgtgacgtga agtgaccgcc 2580 acggtttttt caaaacttcg gttttcggtt tttggttttc agatttcaga ttttcgactt 2640 ttcagtcttt tttgaatttc tgatctccta cggaagccca aacttattca atcgcctcaa 2700 agatcaagtt tgtttcgact cattggaaga cccgatcgaa catctcgaac actattgacg 2760 ccatcaatca ccacctcagc tgaaaggctt ccttcaaatc tatagcccgt tggctttcat 2820 atctcaaact catccacaac ctccacagat gagttcagtt tccattctct tttagaccaa 2880 atgagaaact gaaccatagc tttgatcttc atattgattg taaaagatgg gaggtgcatt 2940 ttcacttctt aagcaccttc ggtatgtctg ttgacttaaa cgttcattgg caaaaaaagc 3000 ataggatgag tcgtcatcac tcgagaatgc tttattctct agcctcgtgt atactacgat 3060 ctatgtaacc aagttgtgaa tttgtcattt aatagtgatc aacacatact ctttgagctc 3120 ttcatcttga caataaggac catgagacta ttgatcttca ccaagaagat cagaaagaag 3180 tatttccaga aaggagtatt tggaaccata ctgacacaaa ccattcaagc aaacatagga 3240 cgagcttgtt agttgcagac ccttaacaag atacaccgac tcaagattaa tgacattgca 3300 ataaacatga aaacaagcga ttgagaaagg caaaaaagcg aggtaaaact ttcatataat 3360 ctatcatcaa gctaatatac atgattagtc tcaatcttgt aggttcaaac tttggaaccg 3420 aatagcctcg agaggaagaa aagcaacatt tttcaccgcg taaaaaatga attcaataat 3480 ctcaccgttg aacgtaaaca acccgatcaa ttctgagttt actcacaccg cctaaccagc 3540 tggacacgtt cccactgata tgaatagttt ctgtgtagac ttgatcacca acggagctga 3600 tgagattgaa catttaccag gatgtcatca tcatggactt gttgctcttg aaaaagaagc 3660 aattcgccat tgtctttgaa ttgaatagaa atgtggtcat cgtcatagac ttctcacaaa 3720 tgtgcaaggc aaggggcact tttagactac aaaaatgaca gtgtctttgc taagggcaac 3780 gtagtgtagt cttattaatg caggggcttg aggcaaatgc atgtttggaa atcaaaaaga 3840 ctgaaaagtc gaaaatctga aaactgaaaa ccgaaaactg aaaaccgaag ttttgaaaaa 3900 actgtggcgg tcacttcacg tcacggcttc cgtaagatat cgattgagtt ggagaagtgg 3960 tattttgaac tagaggtacg gtaatctctt cgtcattcat aaatcaattg cgaatgaaat 4020 gtcggatcga tttgagtctc tcgctaccat gtaaatggct tgagacaccg tggtacctga 4080 tgttattgtg ctaaatgtaa gcacacttct ccaaagctcc acacgtatgc agacgtggga 4140 ctaacctgat tttattgtgc taaatgtaag cacacttctc caaagctcca cacgtatgca 4200 gacgtgggac taacctaatt gtttggaggt aaatagaaaa cgagaaataa gttgagagca 4260 acgatgatga gctcagacaa ggatctaagg gatttctctt tccaatagat atattaattg 4320 ctatcataac taacctgatg ttattgtgct aaatgtaagc acacttctcc aaagctccac 4380 acgtatgcag acgtgggact aacctgaaca acaaatagat gagaaacaaa acaacgtatt 4440 gagaagttag ataacgtaag aataagatat gaaatggtta tatatatgta cctaattgtt 4500 tggaggtaaa tagaaaacga gaaataagtt gagagcgaca atgatgagct cagacaagga 4560 tctaagggat tctaccagat tgatgaggtt agtgataatg cgaaaactaa ggataatcga 4620 aaacttactc tctttccaat agatgaggaa aagagagtaa gatgcgaagc taacaacata 4680 tatatctaaa catcatcgaa acataatcat tggggttgat caacatgaaa gttggttgag 4740 aaacatacga aaacgcagca caacactctt actactatga gattagaaaa gataagaaag 4800 cacactcgta acatctgatg atcacctttc acctcactat gacacttgta acgtatgtct 4860 tgattgtctt tcaacacatc taagatccca gcaagacaac gcacgagccg ccgcctctaa 4920 tcccaatcct cacattcatc caatgcaaga cttagcgaga ctagaacctg tgactaaagg 4980 tcctaaagtc tctactcccg acaagtttag cggatcaagg gggatgcacg ctgaagtctt 5040 cgcgagtcaa cttcaatttt atatgatggc ccacccttat ttatttcccg atgatcgtgc 5100 caaagtggtg ttttccctgt cttaccttac cggaacggcg agcagctggg ctcaaccgct 5160 taccaaggag ctattagacg aatcaacggc gcacttagtc actttcgaac gttttgttag 5220 gaattttaaa gctatgtact ttgatacaga gaagaaagct aaagccgaaa aggcgattag 5280 aagcttaact cagaaaggga ctgtggccac atatacacat gaattcaact tacatgctgc 5340 caacactgaa tgggaaatcc ctaccctcat tagtcaattc aagcagggac ttaagaagga 5400 aatccgggtc gcgatggtat tagctcaaga accatttact agcattgaac aaatatccaa 5460 tatggcaatt aaactggata acaagattca cggtgttgca gacacctcca ccgctttacc 5520 aaaccacgct cccgacccca acgcgatgga tctatcacct atgaactcca ggctgaccga 5580 cgaagagaga gcctcagcca aaaggatgcg ctcaggcaac tgctttcgct gtaacgggca 5640 tggacatatc tccgcggcat gcccaaatag aaagaccaac acatcattca aaaacaaatc 5700 aaattacaat agtcgaattg ctgagttgga ggtacaattg gcagctttac gtagtgaagg 5760 gagtaacagt agagaatcat ctagtagagc agaatcctca aaaaatggag gagctcagga 5820 atgactgttg tgccatacct gggcggaaaa gagaattggg aagaagtgaa attaggagct 5880 agtgaaattg taaattgcaa tgaaaaagat ccgcgactat tttttcaaac ttcactccat 5940 accatcccca attcccgagc cacaaccacc gagacctccc actccttgat gtttcttata 6000 gattcaggag ccactcatga tgtcttagca gaatctttca tcgcaaagac aaacctccaa 6060 gaccacactt ttgagagtaa ccgatgcgtg acaggttacg acggattcac cagttgatct 6120 tcacgcgaga ccaatctgat ccttgaaggc gacacatcac caacccactt tgtcatcacc 6180 aagctcaagg accgatatga tggaatatta ggtatgccgt ggatactcaa atacggacac 6240 ctaattgatt ggaaagcagg tgtgctacgt acacaagaag agaacatcgc agccaccgag 6300 atggtgtcgt cctatccgaa aacaccctta cctagccctg gaatggagcc cgtgagggac 6360 gctaggaaat gtgacgaggg gatgtgtatc aattctgata cgttagcatc cccgcaatgt 6420 gagtttgata gcactatctc agaccattcc cgtgaagcaa ctggcaagca cctatctctc 6480 ctgaataaca gtgttaccaa acctatcaca acaacgacaa caatggaacc acttgcgtct 6540 accgcagtag actcgtcacc tccgacaatc aaccccaacg gtccactgga ggagcctaag 6600 gggcacgcta ggaacagtga cgagggggct agtgtcttga ttgatacagt gatgcccccg 6660 caatgtgagt ttgatatatc caaatctcag tctttgaccg aatcagctgg ccagccattt 6720 tctcccttga atacagatgt caatgttgac gcggcaaaaa cctcgtggtc gacctcagca 6780 aaattagcgg ctgaagagaa gaagaaatta ccggtcaaga acgttgaaga actagtccca 6840 acccaatacc accgacatct tcacatgttc gtcaagtcta aggcactggg gttaccacca 6900 agaaggagat acgatttcaa ggtagacctc ataccaggcg cgcaaccaca agcaagcaga 6960 atcatcccgc tatcaccagc ggagaattta gttctagatg aaatgattaa atcaggttta 7020 gccaacggaa caatacgacg caccacctcg ccttgggctg ccccagtgct cttcacaggg 7080 aagaaggatg gcagcttaag gccatgcttc gactacagga ggctaaattc cctgacagtg 7140 aaaaacaaat acccgctccc actgacaatg gacctagtgg acagcctctt agacgctgat 7200 gaattcacga agctggatat gcgcaacgct tacggcaatt tacgcgtggc cgagggagac 7260 gaagacatac ttgcatttat ttgtcgacaa ggccaatttg ccccacttac catgcctttc 7320 ggcccaaccg gagccccagg atattttcaa tactttatgc aagatatcct ggtggggcgt 7380 attgggaagg acacggcggt tttcctagat gacatcatga tttataccaa gaaaggcgaa 7440 ccccatgcac ctatagttga cattgtcctg gatattttgg gaaagcatca gctatggttg 7500 aagccagaga agtgcaaatt ctccaaatcc aaagtcgaat accttggttt attgatttca 7560 cacaataaaa tcaagatgga tccaacgaaa gttcgagcag tgtcagattg gccagaaccc 7620 cgtaacgtct ctgaattaca acattttatt gggttcgcaa acttctaccg cagattcatc 7680 aatcaattct caaagaccac acgaccccta catgatttaa cgaagaataa cactcaattc 7740 aagtgggacg aacgctgcaa gaaggcattt gacagtctga aattgtcatt cacatcagct 7800 ccagttctaa agatcgcgga tccctataaa gctttcatac ttgagtgtga ttgctcagac 7860 tttgcgctag gggcggtgtt atctcaacgt tgcgacaaag atggcgaaat acaccctgta 7920 gcctacctat cgagatcatt agcacaagcg gaacggaact acgagatctt cgacaaggaa 7980 ttgttagcca tagtagcttc cttcaaggaa tggagacatt acctcgaagg aaaccccaac 8040 agactcgacg tgatagtcta catcgaccac agaaacctgg aatctttcat gacaaccaag 8100 caattgacaa ggcgacaggc tcggtgggcg gaaacactag gatgcttcga cttcaccatt 8160 aaattcagac caggacgaca gagcactaaa cccgatgcat tatcacgacg accagactta 8220 gctcctacgc aagaagataa attaactttt ggccgactaa ttagaccaga gaacctatca 8280 tccgactcgt ttatcaagat tgatagcata gaatgtttct ttgaagacaa gactattgaa 8340 ttagaaaacg ctgaaaaatg gtttgaagtc gacgtgttgg gaatcagtga agaagaagac 8400 ttaagcaact cagtagcaca ggaggatgaa actcattcag acatggaaat cattgattgc 8460 attaggaact caacaaagaa ctgcgataga ttacaaaaga tcatatccac agtacacaat 8520 ccagcatcgt caaaggtgaa agaagcagtc agcagataca gcgtcaagaa tggactatta 8580 tacaacaaac atcatatcga agtccccaac gataactcac tgaagctcaa gattctcaaa 8640 agccgacatg acagcttact cacaggacat cctggaagat caaaaacgct gggccttgtc 8700 agacgatgtt tcaactggcc gtcaatcaaa tcgtatgtca ataagtatgt agacggatgt 8760 gactcatgct taagggtgaa aagtatacac cagaagccct ttggttcgct agaacctcta 8820 ccaatactgg caggaccatg gacagatatc agctacgatt tgatcacaaa gctacctaca 8880 tctaacgggt atgatagtat actgaccgta gtcgatagac tgacgaaaat gtcgcatttc 8940 ataccctgta gagaaagtat gtcatcggag gagctcgcgg atatcatggt caaggacgtc 9000 tggcgactac acgggacccc aaagaccatt gtatccgaca gaggaggtgt cttcatttca 9060 caaatgatca aagaactgaa caaacgctta ggaatccgca tacaaccctc aacagccttc 9120 catccaagaa cagacggaca atctgaaata gtcaacaaaa ctatagagca gtacttacgt 9180 cattttgtct cataccgtca ggataactgg agcgacttaa tccctactgc tgaatttgct 9240 tacaataaca gagatcacgc ttccacaggt gtatcaccct ttaaggcaaa ctttgggtac 9300 aacgcaaact ttggaggtat acctttgggc gagcaatgta tacctgcggt agaggaaaga 9360 ctcaagatcc tggaggaagt tcagaccgaa ttgaaggaat gtttggaagc atcacaagaa 9420 gagatgaagg ttcaattcga caaaggcgta agagatacac caacatggca agtaggcgat 9480 caggtttggc tcagtaataa aaatatagca acaacaaggc caagtccaaa attagaacat 9540 cgctggctag gtcctttttc tatcattgaa gtaaattcac gatcaactta caaattggac 9600 ctacccgcag cactaagagg cgtacaccca gtattccatg tatcactcct acaaaaacac 9660 aacccagacg aaatcaaagg aagaaaattg aaggaaccca gcgcagtttt catagaagga 9720 aacgaagaat gggaagtcaa tgaaatactc gactgtagaa tcagaaacag aagtcgtgag 9780 tacttgatca attggaaggg ttttagcact gaaaacaatt cgtgggagcc agtagggaat 9840 ttgaaaaaca gtaaagattt agttaaacaa ttcgacaaga aatttcccaa cgccgcactc 9900 aagtacaaga agagaaggag gaagtagaga gagggctaag ctttttccca cgaggttttt 9960 taacgctgcc cgtggaaaga atgcagactt gcaagaggaa gttgggcata aaagggggaa 10020 taa 10023 // ID Gypsy-2_CCO-I repbase; DNA; FNG; 5493 BP. XX AC AACS02000013; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the mushroom Coprinopsis cinerea genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_CCO_; KW Gypsy-2_CCO-LTR; Gypsy-2_CCO-I. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-5493 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the mushroom Coprinopsis cinerea RT genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AACS02000013; Positions 136266 130774. XX CC Positions [2553-3059] - Reverse transcriptase CC Positions [4189-4689] - Integrase core CC 'ATTAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 330..1373 FT /product="Gypsy-2_CCO-I_2p" FT /translation="MSTHPSPLQVPPANDGQPAPQLLQPDQTQVQGQQVSA FT WDYMYQALGSLADSVANLVAAVSGYQPALAQASEVLAQSAAANTQVSEALA FT QRADKPKTKPDPPETYDGKLEDAQAFLDACVLYFKHIREADELRQISYALS FT KIKGGTNGMASAWANSQRAEIIADTLAYDDWKGFAKAFLAHFQVQNSRAEA FT ILAIQTIEMGDRSCEAYNTEFRSYMTRSGYTEVALIEEYKRGLNGRLLDKC FT HDQENLPTTLAGWMERAANLDRQYRIRKREKARFKTEKPAPKATSTPPPRP FT FFPSFRPTAQSTPTPPPPRPADPNAMDIDAHRRQGLCFTCHQKGHISRDCP FT TRPRS" FT CDS 1521..3923 FT /product="Gypsy-2_CCO-I_3p" FT /translation="MPELESHDVYARSKRDNARFYGKRVEPGSAEWVAQRL FT DDSDPSMARGILMSWVHPKKRQEVTDEKISQLRQLLDGPVVEALKEMARPK FT RFIRGHKGNQLDLDMVMSTLDDKRSFRTSGLLDSGCTGSCIDRDFVRRNNI FT RTRKLPIPIPVYNADGTENAGGAISECVELHVKIRDHIERLEFGVTELGKT FT DVFIGFEWLKRHNPSIDWRAGTIEFNRCPIECTPLVGSDPESEEIEYEDLE FT DGEHILVVDIQHEIRLRAFQTTSGKIAERKAKEEKAKTFEELVPSHYHDFK FT FVFSKQSFDELPPRRPWDHAIELIPGAETMDCKIYPLNPEEQQALDAFLEE FT NLQSGRIRPSKSPMASPFFFVKKKDGSLRPVQDYRRLNDITVKNRYPLPLI FT QELVDKLRKARYFTKLDVRWGFNNVRIKEGDEWKAAFRTNRGLYEPLVMFF FT GLTNSPATFQTMMNAIFREEINSGKVIIYMDDVLIFTETLEEHREMVRRVL FT SKFAEHKLYLKPEKCVFEAKEVEYLGVIVSHNQVRMDPAKVDAVRDWPTPR FT SKREVQQFLGFANFYRRFIKGYGEIARPLTELTGNKEWTWGQAQEVAFQAV FT KDRICSAPVLAIPNDTGKFRVEADASEFATGAVLSQQQPDGKWKPVAFISH FT ALNPTERNYEIYDKEMLAIMRALSDWRQYLLGAKHVFEIHSDHKNLGYFRK FT PQKLNRRQARWLTEMQQYHFTLHHKPGKQMTKADLLSRRADHDRGANDNED FT VILLKEETLQKGRSGCLRTRCQLLGTGQTLLEEKGQSGGQGSTAEGPRLD" XX SQ Sequence 5493 BP; 1540 A; 1552 C; 1305 G; 1096 T; 0 other; ttagctaaag actgcccctt ctcgactgtg tctatccccg gcaccctcga gtctctcatc 60 gccctccgcc ctaacaaggc aaccccactc gcgttactta ctcccttata tagttccttt 120 tctctttccc ctttcagtag gttatccttg gcgacacccc ttctgtagga aggcagcaac 180 caagagagct gcggaactta ggtccaaagc acgtagtgta gctcccctct gtgtggctaa 240 cacagtgaca accgggagat agtctagctt ttcttacctt acctttttct ttctcttgcc 300 cacgagccca ttgttctttt tacgatacca tgtccaccca cccctccccc ctacaagtac 360 cccctgccaa tgacggccaa cccgcccctc aactcctcca gcctgaccag acccaggtcc 420 aaggtcagca ggtctccgct tgggactaca tgtaccaagc cctgggaagc cttgccgaca 480 gcgtggcaaa cctggtggct gccgtttccg gttaccagcc agcccttgcc caagcttcag 540 aggtgttggc ccagtcagct gctgccaaca cccaggtctc tgaggcctta gcccaacgag 600 ccgacaagcc caagaccaaa ccggacccac ccgaaaccta cgacgggaag ctggaagatg 660 cccaggcctt cctcgatgcc tgtgtcctct acttcaagca catcagagaa gcagatgaat 720 tacgacaaat ctcctatgct ctctccaaga tcaaaggagg aaccaacggc atggcatctg 780 catgggcaaa ttctcaacgg gccgagatca tcgctgacac cctcgcttac gatgactgga 840 agggcttcgc caaggccttc ctcgcccact tccaggtcca gaactcccga gcagaagcga 900 tcctagccat tcaaaccatc gagatgggag atcgctcctg tgaggcttac aacaccgagt 960 tccgatcata tatgactcgt tctggataca ccgaagtagc actcatcgag gaatacaagc 1020 gagggctgaa tggcaggcta ttggacaagt gccatgacca ggagaacctt cccacgaccc 1080 ttgccggatg gatggagcga gctgctaacc ttgatcgcca gtaccgaatt cgtaaacggg 1140 aaaaggctag attcaaaacg gaaaaacctg ctcccaaggc gactagcact cctccaccaa 1200 ggcccttctt cccttccttc cgacccactg cccaatccac ccctactccg cctccccctc 1260 gtcccgctga cccaaacgct atggacatcg atgcgcaccg acgacagggt ctctgcttta 1320 cctgtcacca gaaaggacat attagccgag actgtcccac aagacccaga agttaacacc 1380 agaaccatcg acatcagaac cctcacggag gatgaacgtg ccgcgctgaa gaaaacactc 1440 gaggattttt acacggaccg ggagtgagcg cagacctgcc ggtcgataga agtacctttg 1500 ttcgatcccc atatgtacat atgccggaac ttgaatcaca tgatgtttat gcacgatcga 1560 aaagggacaa tgcgcgattt tacggaaaga gagtagaacc gggctcagca gaatgggtag 1620 cccaacgact tgacgactca gatccctcga tggcaagagg aattctgatg tcttgggtgc 1680 atcccaagaa aaggcaagag gtcacggacg aaaagatatc ccagctcaga cagctattgg 1740 acggaccagt tgtagaagcc ctaaaggaga tggctcgacc taaacgattc attcgtggcc 1800 ataagggcaa ccagctggat ctagacatgg tgatgagcac cctagacgac aagcggagtt 1860 tccgaacttc tggactctta gatagtggtt gcaccggaag ctgtatagac agggactttg 1920 ttcgacgaaa caacatccga acaaggaagc tccccattcc catccctgtc tataatgccg 1980 atggtacgga aaacgcagga ggagctatct ctgaatgtgt tgaactccac gtcaaaatcc 2040 gagaccacat cgagagactg gagtttggag taaccgaact agggaaaacg gatgtcttca 2100 ttgggtttga gtggctgaaa cggcacaacc cctctattga ctggagggct ggtacgatag 2160 agttcaaccg gtgtcctata gaatgcaccc cacttgtcgg atcagaccct gaatcagagg 2220 aaatcgaata cgaggacctt gaggatgggg aacacattct cgtggtagac atccaacacg 2280 agatccgact acgagcattc cagacgacat ccggaaaaat tgctgaaagg aaagccaagg 2340 aggaaaaggc caagacattt gaagaactgg taccgtcaca ttaccacgac ttcaaatttg 2400 tgttctcaaa acagtccttc gatgaactac ctccgaggcg accctgggac catgcaattg 2460 aactcatccc tggagcagag accatggact gcaaaatcta cccattgaat ccagaagagc 2520 aacaagcact tgatgccttc ttggaggaaa acctgcagtc aggaaggatc cgtccgtcta 2580 aatcccccat ggcctctccg ttcttctttg tgaaaaagaa ggatggctcc cttcgaccag 2640 tccaggacta cagaagactt aatgacatca ccgtgaagaa ccgatacccc ctgccactga 2700 tccaagagtt ggtagacaaa ctcagaaaag cacggtactt caccaaacta gatgtccgtt 2760 ggggcttcaa taatgtccgc atcaaagagg gagacgagtg gaaagcggcg ttccgaacca 2820 acaggggact gtatgaacct ctggtgatgt tcttcggact tacgaactct cccgcaacgt 2880 tccaaaccat gatgaacgcc atattccgag aagaaatcaa ctctggaaag gtcatcatct 2940 atatggatga cgtccttatc ttcacggaaa cattggagga acatcgggaa atggtccgaa 3000 gagttttgag taagtttgcc gagcacaaac tttacctcaa accggagaaa tgcgttttcg 3060 aagccaagga agtggaatat cttggcgtca ttgtctccca caaccaagta cgcatggacc 3120 cagctaaggt tgatgctgtc agagactggc ctactccacg atcgaaacgc gaagttcaac 3180 agttccttgg gttcgccaac ttctatcgac ggttcattaa gggatacgga gagatcgcta 3240 ggccactcac agaactgacc ggaaacaagg aatggacttg gggacaagca caagaagtag 3300 ccttccaagc agtaaaggat cgaatctgct ctgcacccgt ccttgctatc cccaatgaca 3360 ctgggaagtt ccgtgttgag gctgatgcat ccgaattcgc gactggagca gtcctctccc 3420 aacaacagcc tgatgggaaa tggaaaccag tagccttcat ctcccacgcc ctgaacccga 3480 cagaacggaa ttacgagatt tacgataagg agatgctcgc catcatgcgc gcactctcag 3540 attggagaca gtatcttctt ggagccaagc atgtctttga gatccactcc gatcacaaga 3600 atctcggata tttccgaaaa ccccagaagc tgaatcgccg ccaagctcgc tggctcactg 3660 agatgcaaca gtaccatttc acccttcatc ataagcctgg gaaacagatg accaaggctg 3720 acctcctctc gcgacgagcc gatcatgatc gaggagctaa cgataatgaa gatgtgatcc 3780 tcttgaagga agaaactctt cagaagggtc gaagtgggtg tctcaggacc agatgccaac 3840 ttcttggaac aggtcaaacg ctcttggaag agaagggaca aagtggtggc caaggctcta 3900 ctgcagaagg acccagactg gattgaatcc gacaaacacc tcgtgacgtt caaaggaaga 3960 gtgtacgtgc caaaggataa ggggctccga gaggacatca ttagattcca ccacgactcc 4020 ccggtatcgg gacaccctgg tcgatatcgc acccaggaac tgatcactcg gaactactgg 4080 tggccccgaa tccaagcaga catccgaacc tatatcgacg gatgcaacac gtgtcagcgg 4140 actaaacccc gtcgaaccgt cctggctgcg ccactacagc caaacgaggt cccctctctt 4200 ccatgggaga tagtgtcggt agacctaatc gggcccctcc cagagtcgaa aggatatgat 4260 gccatcatgg tagtggtcga cagattctca aagaagatag aggcaattcc caccaatgtg 4320 gagctctcgt cgcttggcgc agcgaaacac ttccgggact atgtgttcaa acaccacgga 4380 ctcccccgaa aagtaattag cgaccgaggc ccacaatttg tctcaaactt catgaaggat 4440 ctgatgaaac tccttggcat tgaaggaaac ccctctacgg cattccaccc ccagaccgat 4500 ggacagacgg aacggatcaa tcaggagata gaacagtact tacggatctt catcatacca 4560 aagacaggac gactgggccg actggctccc tttggcccaa ttctcctaca acgacaagat 4620 ccaccagtca accgggtaca gtccgttcta cctgaactat ggccaacatc catacaaggg 4680 aggcgaacca aggggcccaa tacggaatca ggctgcagag gactttgtca ccgagattaa 4740 agacatccgc aaagaggccg aagctgcctt gaaacgagcc gccgagacta tgaagcatta 4800 ctatgatcgg aaacggagac cctccaggcc ctactgagta ggcgacatgg tgtacctcga 4860 ggcaaccaac atcaatacgc agcggccagc caagaaactc gatgatcgga gatacggacc 4920 ctttaagatc accaagaagg tcggagcttc tgcttacgaa ctcgccatcc ccaagacttc 4980 ggaagaccat acacccggtg tttcaacgaa tcactcctca ccccatacca cgaacccatt 5040 tttccctctc agaagaaacc tccaccacct cctcccgagc tggtggacgg agaagtagag 5100 tatgaagttg aatccatcga cgactctagg ctctacaggg gaaaactgca gtacctggtg 5160 aactggaagg gatatccaaa ggaggagagg acgtgggaag atgcctccaa cctgaagaat 5220 tctcccgaac tcgtagaaaa gtttcaccga gagaatcctt ccgcacctag gcggatcacc 5280 aagaaactcc gattcatacc ctacgagaac ttcactgaac cttccggaag aatcttcgac 5340 tggaccgccg ggaagttcga agcctcctat tcccaaaacc gtcgacccaa agggactgcc 5400 gaaccaatgc cggaaaactt gccaaagtgc cgaaaagggt ggtgccctaa atgcaacgat 5460 gttcacgagg acgtgaacct taaggggggg tga 5493 // ID 5SrRNA_AN repbase; DNA; FNG; 129 BP. XX AC . XX DT 09-DEC-2003 (Rel. 8.11, Created) DT 09-DEC-2003 (Rel. 8.11, Last updated, Version 1) XX DE It is a consensus sequence of 5S RNA genes. XX KW 5SrRNA_AN. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-129 RA Kapitonov V.V. and Jurka J.; RT "5SrRNA_AN consensus."; RL Direct Submission to Repbase Update (NOV-2003). XX DR [1] (Consensus) XX CC It is a consensus sequence of 5S RNA genes. The genome contains CC more CC than 20 dispersed copies of 5S rRNA that are ~99% identical to CC the CC consensus sequence. The genome contains also ~20 pseudogenes of CC 5SrRNA that are less than 75% identical to the 5SrRNA_AN copies. CC Some of them share with 5S rRNA only the ~60-bp 5' end. XX SQ Sequence 129 BP; 29 A; 32 C; 35 G; 33 T; 0 other; caagtacata cgaccatagg gtgtggaaaa cagggcttcc cgtccgctca gccgtactta 60 agccacacgc cggctggtta gtagtatggt gggtgaccac atgcgaatcc cagctgttgt 120 atgtttttt 129 // ID Gypsy-66_MLP-I repbase; DNA; FNG; 5745 BP. XX AC AECX01002913; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-66_MLP_; KW Gypsy-66_MLP-LTR; Gypsy-66_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5745 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002913; Positions 3934 9678. XX CC Positions [4452-4931] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 234..1556 FT /product="Gypsy-66_MLP-I_1p" FT /translation="MSDNRLGFNRTTRRKPGDPIPPVDNPERILFPSRGVP FT APISCTPIRPTSAPPLLPLVESTGSTPLIVQQWDSIVSPPNPNGYPVITAN FT WPLPSDGLSLDQLSEDSLAVDRQVNHPASNLRHPRHSIPGNFDPHSSFISS FT GEETILNMANRQSTSQPPEFDVSAELARIKLDNDGLRSDNAGLRTEISEVR FT SLLQNFLQSQRPQDESREGNVTARGGSSTQQPSPGVGHEATPQIFTSTPAV FT GRQNLFNIPPSLPSVADTPVSVAPAPADLSKFRATDWPQYKGKFGDVAAFH FT MWQYQMETTFRVKQIDRPEDRFRILPMVLANDPASSWCRRSERNFEGKTWE FT SVMLEMQGVVLPVGWDEAAKERLRELAMKGNESVTAYCGRARMIQEEIGVD FT DCSDESLAEAVVGGTSGTFKAWIKMERVVKNSLDPLTKRFSFPVFED" FT CDS 2943..5324 FT /product="Gypsy-66_MLP-I_2p" FT /translation="MIIPKKDPTALPRLICDYRTLNKYTVKDRGPLPNPNE FT AVRLVASGKIYSSLDQINAFFQTRMEEKDIPLTAIKTPWGLYEWVVMPMGL FT TNAPATQQRRCEEALGDLVNKVCVVYLDDIVVFSQTVEEHEYHLREVLKRL FT RAARLYCGIKKTQLFRRQIKFLKHVIGDEGIMADEEKVEKVANWRSPKNPK FT QLKEFLGTVQWMKKFIEGLSKYVSTLTPLTSSKRKPEDFKWTVKEEDTFNN FT IKRMITTLPVLRNLDYDSEEPVWLFTDAIQGHNWETASPVAYDGRTMTGAE FT RNYPVHEQELLAVVHALNKWRLLLLGLKINVMTDHHSLTYLLKQRNLSRRQ FT ARWLEILSQFDLDFKYLQGADNSVADALSRIDTAALTESRPALDAETRQTI FT TEGYKTDPFCIKLAKTLPLRANCRWEDDLMLMDDRVVVPVNGELRTALVKS FT AHQAVGHLGNLKTYHRLKEEFFWPGLSDEVERWVRRCDECQRNKARTTLLP FT GRAQTTDLPKRPMSSIALDFVGPLPKVSGYDMLLSCTCRLTGFVRLIPTNQ FT KDSAEKTARRLYGSWLSLFGAPDEMLGDRDKIWVSKFWKELHCLLGISINI FT TTAYHPQADGRSERTNKTLGQILRFSTQGRQGRWLESLPSVEFAINSAVNS FT ATGVSPMYFVLGHQPRLFPIPSSRASSSRDVEKWVMSRQADWARCRDKLWV FT SRVEQAVQYNKRRGGDLKLEVGDQVLVDSTHRSAQVGGKVAKLRARFDGPY FT KVLEVINGGRDVRLRLSANDKSHDIFHISKLKKYQTDELVDNC" XX SQ Sequence 5745 BP; 1413 A; 1326 C; 1469 G; 1537 T; 0 other; cttttttaaa gtacacctcc gagttcgaag gcgtcgatcg attctcgaac tcaatctatt 60 gcgccactat tcaattaaac cttaaaccac tcaattcaat tcaaaatttt tttgaacact 120 cgattctcct tttttttatt atttctatta cctgctttag cctgacacct gaacttaaaa 180 ccagttaccc cgcacttcca gtatcctgcc cagtctcacc tgctgtcctg tctatgtctg 240 ataatcgtct gggttttaat cgaacaacac gtcgaaagcc aggagatcca atacctccag 300 tcgataaccc agaacgcatc ctgttcccat ctcgtggtgt tccggctccg atatcttgca 360 ctcctattag acctacctca gctccgccac tcttaccctt ggtcgaatct acgggttcga 420 cacctttgat agtacaacag tgggattcta ttgttagccc accgaatcct aacggctacc 480 ctgtgatcac ggcgaactgg ccgctaccat ctgacggtct ttccctggat caactttcgg 540 aagattcgct tgcggtcgat aggcaggtta atcatccagc ctcgaatttg cgccatcctc 600 gccacagcat ccctggaaac ttcgacccgc attcctcctt catcagttcg ggggaggaga 660 ccattctgaa tatggcgaat cgacagtcta cttcccaacc tcccgagttt gatgtgagtg 720 cggagttggc tcgaataaaa cttgataacg atggactacg ttcagacaac gctggtcttc 780 gaacagagat tagcgaagtt cgtagcctct tgcagaactt cttgcaaagt cagaggcctc 840 aggacgaatc tcgtgagggt aatgtcactg cccgaggggg atcttctact cagcaaccca 900 gcccgggagt gggtcatgag gcgaccccac aaatattcac ctcgacaccg gcggtaggac 960 gacagaatct tttcaatatt ccgccttcct tgccttcggt cgctgatacc ccggtctctg 1020 tggctccggc tcctgccgat ttgtcgaagt tccgtgctac tgactggccc caatataagg 1080 gcaaatttgg cgatgttgcc gcctttcaca tgtggcaata ccagatggag accaccttcc 1140 gtgtcaaaca aattgatcga ccggaagatc ggtttcgtat actcccgatg gtcctggcaa 1200 atgacccggc ttcgtcttgg tgccggcgat cggaacgcaa ctttgagggt aagacgtggg 1260 agtctgtgat gttggagatg caaggtgtgg tgttgccggt tggttgggat gaagcggcta 1320 aggagcgctt acgtgaatta gccatgaagg ggaatgagtc ggtaacagct tattgtggac 1380 gggctaggat gattcaagag gaaattgggg ttgatgactg tagtgatgag agtttggctg 1440 aggcggtagt gggagggaca tcagggacgt ttaaggcgtg gatcaagatg gagcgcgtcg 1500 tgaagaacag cctggaccct ctaacgaagc gcttctcgtt ccctgtcttc gaggattgac 1560 tcggttccat ttggctcctt gcccaatcaa ttgacgggcg gaacactggc cgatcacaat 1620 cgacggcggg accctcctcg actgcagcgg gtactacacc agttgttcat cctagatcga 1680 ctcactcggt taatctggcg gcttcgaacg ccaaccgtcc tcctctgtcg cctgaagagc 1740 ttcaagcccg taatgtccgt tttggcgctt acatgcgatc catcggctta tgtcctcgct 1800 gcaaaacctc gtgcaacaag tggttgggag gttgcgaagc gagacctagt gctacgtttt 1860 tttccgttcc tatggagttc ccgcgagcac caccctatcc atccgccaag ggtggcgcac 1920 ctatcccacc atctaaacct gcagtgccga caggtggtgc gagaccatct cgtcgtgtgg 1980 acgtggcgac tgtggaaacg ggtcaaagtt cggtggatgt tgcggcgacg ggtaactttc 2040 ctgacctggg acgtgcagat ctagctgctt acgagcaatt gatcaaccac ttgaacgggc 2100 cagccgatga cgatgttgtg gaggcggctg agtacgaagc ttctcctgat tgatctatgt 2160 atggcaagac cctgattatg agctggcttt tgaacggggt accccttcgt gttctaattg 2220 ataccggtgc agggacgaac ctcctctcgg aacacgcggt tgaccagcta cacctcattc 2280 gacggccttt gccggtacca atcacggtac gtcctgcgat cctttcagaa ccgacacctt 2340 ttgttttgaa ggagttcacc tttgctcacg tcaaggctcc tcttccctcc tttacgtttg 2400 gggcgacagc tttcaagata gcgccgcttg ggggagaata tgatgtgatt ttaggggcgc 2460 ctttcctttc taaacatcat cttactgtat ccttgtcccg acggctgctg cgcaatgaaa 2520 agaacagtta cgaatttttt gaacagtgtg tgattgatga gaagagaaaa actgaggagt 2580 tgaaaaagaa gagagatgta ttggtgaata cagtgttgca gaatttggag aaagtgaatg 2640 aagtacatga atttagcatg aaagaagtag ctatgttaaa agaatttgag gaccttttcc 2700 ctgatgagtt gccagcggtg tatgaaatgg atgatgatgc tgaagaaatt ttcccaactg 2760 agttgcaagc cgaatctagt aggattcgtc atcgtatcat tctcacccag cctgatattc 2820 aaattaacga caaacaatac gggtacccac ggaaacacct tgacgcgtgg aaaaaactga 2880 tcaatcagca tatcgatgcc ggttgacttc ggaaatcttc gagtccttat gcgtcaccat 2940 cgatgatcat accgaagaaa gacccgacag cccttcctcg cctcatctgt gattacagga 3000 ccctgaacaa atacactgtg aaagaccggg gacccttacc gaatcccaat gaagcggtga 3060 ggcttgtcgc atcgggtaag atctactcgt ctctggatca aatcaacgcg ttttttcaaa 3120 caaggatgga ggaaaaagac ataccactga ctgccattaa gactccgtgg ggtctgtacg 3180 agtgggttgt gatgcctatg gggctaacta atgcgccggc gacacaacaa cgtcgttgcg 3240 aggaggcctt gggggatttg gtgaataagg tttgtgtcgt gtatctggat gatatcgtgg 3300 tcttttctca gacggtggag gagcatgagt atcacttacg agaagtcttg aaacgtcttc 3360 gagcagctag actctactgt ggcattaaga agacccaact tttccgccga cagatcaaat 3420 ttctcaaaca tgtgattgga gacgaaggca tcatggctga tgaggaaaaa gtagagaagg 3480 tggcgaactg gcggtctcct aagaatccaa agcaactcaa agaattctta gggaccgtgc 3540 agtggatgaa gaagtttatc gaaggattat ctaaatatgt tagcacattg acgccgttga 3600 ctagttctaa gcgcaaacca gaggacttca agtggacagt aaaagaagag gacactttta 3660 acaacatcaa acgcatgatc accaccttgc cggtcctccg aaacttagat tacgactctg 3720 aggagccggt ttggcttttc accgatgcta ttcaaggtca caattgggaa accgcttccc 3780 cggttgctta tgatggtcga accatgacgg gcgcagaacg taattaccct gtacatgagc 3840 aggagttatt ggctgtcgtg cacgctctta ataagtggcg acttctcctt ctaggtttaa 3900 agattaatgt aatgacggat catcattctt taacatattt gctgaaacaa cgaaatctca 3960 gtcgcagaca agcacgctgg ttagaaatat tatcacaatt tgatctagat tttaaatatc 4020 tacaaggtgc agacaattca gttgcggatg cgctatcgcg cattgataca gcggccttga 4080 ctgaatctcg acccgcgctg gatgctgaaa ctcgtcaaac tattacggag gggtataaga 4140 cggacccatt ttgtatcaag ctagcaaaaa ccttaccact ccgggcgaac tgtagatggg 4200 aagatgactt gatgttgatg gatgataggg ttgtggtccc agtgaacggg gagttgagga 4260 ctgctttagt gaaatcggca catcaagcag taggccattt aggaaacctc aagacttacc 4320 atcgtttgaa agaagagttt ttctggccgg gattgagtga tgaggttgag aggtgggtca 4380 ggcgttgtga tgagtgtcaa aggaacaaag cgcgaactac tcttttgcca ggacgagcgc 4440 agaccacgga tctacccaaa cggccgatga gtagtatagc gctggacttt gtgggacctc 4500 tacctaaggt ttcaggttac gatatgctac tgtcttgtac ctgtcgactt accgggtttg 4560 tacgcttgat acctacgaat cagaaggact cggcggaaaa gacggctcgt cgcctttacg 4620 gttcttggtt atcgcttttt ggagctccgg atgagatgct gggtgaccgt gacaagatat 4680 gggtctctaa gttctggaaa gaactccatt gcctgttggg tattagtatc aatataacca 4740 ccgcgtatca tcctcaagcg gacggccgtt ctgagcgtac gaacaaaacc ttgggtcaaa 4800 ttctacgatt ttcaactcaa ggccgtcagg gacgttggtt agagtcctta ccttcagtcg 4860 aatttgccat caattcggca gtcaattcgg cgacgggggt ttcgcccatg tatttcgtac 4920 taggtcatca gccgcgactt tttccaatcc cgtcatcgag agcatcgagc agtcgggacg 4980 ttgagaagtg ggtgatgtct cgacaggctg actgggcccg ttgtcgagat aagctttggg 5040 tctcgagggt tgaacaggcg gtacagtaca acaaaagaag ggggggcgac ttgaagttgg 5100 aagtagggga tcaggtgttg gtagacagca cacatcgaag cgcgcaagtg ggggggaagg 5160 tggcaaagtt gagagcgaga tttgatggtc cgtacaaggt gctggaggtc attaatggag 5220 ggagagatgt ccgattacga ctttcggcca acgataaatc tcacgacatc ttccatatct 5280 cgaagctgaa gaagtatcag acagatgagt tggtggataa ctgttgagtg gggcgaagga 5340 ctaccctgtg caagtaagtt ccttccttaa gtatgcaccg ccgcggggtt ctaccacctc 5400 ccattttcgc aacaacacct tggccacgcc tgtgagcaca taatcttcga ccttgctctt 5460 ttcaggacgt gacgcggcgc ggggattcga cgattcttca aatgccagga tttacggatt 5520 tagttatcaa tggtttcctt ttctttcctt tcttttattt ctgtttcaat ttttaatttt 5580 ttattttctg tttcaatttt aatttctttt atctcacgtt aatttagttt cttcttttga 5640 ttatttcttc aattttgttt tgttctatgt tctgttttgt ttacttttga atttgggaat 5700 agtggaggaa gtctttaggg ggacttttgt tttagttggg ggggg 5745 // ID Gypsy-10_RO-LTR repbase; DNA; FNG; 487 BP. XX AC AACW02000074; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-10_RO_; KW Gypsy-10_RO-I; Gypsy-10_RO-LTR. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-487 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000074; Positions 350794 351280. XX SQ Sequence 487 BP; 154 A; 74 C; 86 G; 173 T; 0 other; tgaattggta tagtaggtgg agtgcgcctt gggatagatg ttgccgacgt catcacgttg 60 ctgagtgcag aggtgtgctt ataagttgtg ctgaatggaa gattgatgag aagggggcgt 120 ttttatcctt aactttattt tatataaact taaaccttaa ttccattctt ggattattgt 180 aacaagcaaa tacttattgt ctcgatcctt tgtgttcacg ttcattaaat ttatatataa 240 tctttattat atattttaac tgaacaatcc ctaattacta cttttattct attactactc 300 gtgaagaagt ttgctgactt aatctgaagg aagagattat ttttgacgaa tatactacaa 360 taatatactc agaaggaatt ggtggaatta ttgctatatt caaatcaatc actacgtgca 420 gcagtagagc aacaacaata catatcgtaa tcagtaatat tataacacct aagtaccaat 480 tgtctca 487 // ID Gypsy-3_AM-I repbase; DNA; FNG; 5655 BP. XX AC ACDU01007691; XX DT 07-FEB-2011 (Rel. 16.02, Created) DT 07-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Allomyces macrogynus genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_AM_; KW Gypsy-3_AM-LTR; Gypsy-3_AM-I. XX OS Allomyces macrogynus OC Eukaryota; Fungi; Blastocladiomycota; Blastocladiomycetes; OC Blastocladiales; Blastocladiaceae; Allomyces. XX RN [1] RP 1-5655 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Allomyces macrogynus genome."; RL Direct Submission to RU (07-FEB-2011). XX DR Genome; ACDU01007691; Positions 31903 37557. XX CC Positions [4379-4861] - Integrase core CC 'CCAGT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 390..1655 FT /product="Gypsy-3_AM-I_2p" FT /translation="MTGEPATPATPLEQLAEVRAVVDQHLPAMMANDNALA FT TRTVALEGSVSEVKTAVAEQGAKLDALMVILQKLSNNSAPLPAAPSPDAVP FT VAAVASAASTAPAMAVAGAAASPVGDETAYGGNEKIPPPKFDGTGTYEAVR FT AWFSDVGKWVAFMRARKHSEMAIVLLLDRAFTGEAETYRVLQNATGQWPTT FT PEGIANTVSERFRPKTAALTKRRELQQFRFQSGGTMEQHVAAFAALAVTVE FT QASAAELYSCFLASVPTNVMTLLGDGRLNPPSEDQNLRWSFMEQVYTRAID FT IGAFSSFRTGSTCWRCMIAYRTTVCRSTLCKGAVAMVIAARVVVAVARMAV FT GVVVAAVAVTTARTTSTRSKRAAVIGAMRWAICVTIARAGRTRMSRFSIIN FT APAVLIALLALVRSSLLVAQVQILFIRT" FT CDS 1301..5572 FT /product="Gypsy-3_AM-I_1p" FT /translation="MLAVHDRVQDDRMQVNAVQGRRGHGNRGQGRSRRGQN FT GGRGGRGGRGRDDRQDDEYEVETRRCYRCDEVGHLRHNCPRRPNANVAVFN FT HQCTCRFDCSSRSGQVVPVGGTSPNFVHTDVNDGGTTQLLHVAVDVDVDDA FT REPLIVSPKDTADRRVDLGQGEERDDAVVEEDLEILAFGTDQPYAYDAVLA FT VSEEPRKGRTWSKKCMGERHRKKLLQEHDAVVMVDSGASHNVARRAVVERV FT GARIVRDGWLRVSGFDGVSRRVRREWAELRVMVAGAATTARCLVLDNVAWE FT VILGRPWLRECDPQLDWSSGALSLDGGATRVQPVAGGPSHIEVDLLSAEEC FT AAAMRDDPANVFMVSLNHIEVEGVGYTGTVVKQLVDEFPTVFAEPTGMPPK FT RDVDHRIELKEGARPVAQRAYQASPKELEELRKQLDELLRRGLIRPSKGSP FT FASPVLFVPKKDGGVRVCVDYRALNKITVRDTYPLPRAETLFRMLRGARYY FT TKLDLVSGYHQVRMDESDVHKTAFVTRYGTFEWRVLSFGLCNAPSTFMRAM FT NQVFEPYLDRFVVVFLDDILIYSKTKAEHERHVRQVLEVLRKNEFHVKPSK FT CEFFKERVEYLGHFVDRDGVHVDPAKVAAVRDWPVPKDQKELRGFIGLANY FT YARFVNGFAELAHPLTDLLADDVTYEWSAAADAAFRALKDALSSVPVLVLP FT DEDGDFEITTDASGFAVGAVLQQNGRPIEYFSRKLKPPERAYPVHDRELLA FT IVAALNRWRTYVHGKSVRCNSDHRTLEYFATQRELNGRQVRWLSSLAEYDL FT EIKYEKGERMPHADALSRRPDLRPDGSLLADALYTPLARGANGALRVAAVP FT LYKGEDGKFAVLQVRRSHKHDETMAIAVEGEHAQEPAEDLVQGVRPEVEPD FT MMRRVVAATAADRRLHAMLEQPPAGYVADDGILFKFVEVSGEARRVPLIPN FT DRELRTLWLSEAHDTPLAGHMGARRTLDLLRRWCYWRGMAADVEAYVRTCV FT SCQANKPKIGKTDGLLRPLPVPDRPWGRINIDLIGKLPDGEGGKDAIVTLV FT CALTKRLRVIPTVMTTDAEGLARLYIRYVLPDFGLPDCIVSDRDPRFASAF FT WEALWRRLGTRLAMSTAYHPQSDGQAEVANRTVQSMLRHFVNAQRTNWPTY FT LPFVEFAYNNSRAEPTGKSPFELWGVVPRGPVEIMLGNNIPVPQLERMAEQ FT LDAATRDARVSLDRSRAAMVMQVNQHRKHVEYLVGDRVWLSTKNIAARTAS FT GTELTSRKLLPKFIGPFNVEAVVGGGRACRLGLPAGYKVHPTFHVELLKPY FT RELDLARFPGRPYAGNDGTDDEHGWAPATADEPECVLREGMTGRRGEGDTL FT YLIKWQQRAIKESTWERAEDLREDADAVVKWREYRAQTPGVQVLSGVKAGK FT PTATVLRSLQ" XX SQ Sequence 5655 BP; 948 A; 1667 C; 1990 G; 1050 T; 0 other; actggtagcg tctgcaggtt cgttccttca agtgacggac tggcgttgga agtgtgcgtt 60 cgggcgctgg gcttccgcgg tttcgtttgc gctggttcct gtttggcgcg gtttctcgtt 120 ggcgctggtt cccatttggc gcggtttctc gtttgcgccg gttctcgttg gcgcagtgtc 180 tcgttgcgcc ggttctcgtt tggcgcggtg tctcagttgc gctggttctc atttggcgcg 240 gtgtctcgtt tgcgcctgtt ctcgatagcg cattttcgcg gttgcgccgg ttctcgttgg 300 cgcggtttac ttgttgtgca aattcgttgc gggttcctgc ccatcgagtt gttttctctc 360 gtctcgcgtt tttcatcacg gttggcgtca tgacgggcga accggcgaca cctgcgacgc 420 ctcttgaaca gctggccgag gttcgggcgg tggtcgacca gcacctgccc gcaatgatgg 480 cgaatgataa tgccctcgcg acgcggacgg tggccctcga ggggtcggtt tcggaggtca 540 agacggccgt ggccgaacaa ggcgcaaaac tcgatgccct catggtcatt ttgcagaagt 600 tgtcgaacaa ttccgcgccg ctgcctgcgg cgccgtcccc agatgcggtt cctgttgcgg 660 cagttgcctc tgctgcgtcg acggctcctg ccatggctgt cgctggtgcc gccgcgagcc 720 cggtgggaga cgagacggcg tatgggggca acgaaaaaat tccgccaccc aagtttgacg 780 gcacaggcac gtatgaggcg gtccgcgcgt ggttcagcga cgtgggtaag tgggtcgcct 840 tcatgcgcgc gcgcaagcac agcgagatgg ccattgttct cttgctcgac cgcgcgttca 900 cgggcgaggc ggagacctac cgcgtgctgc agaacgcgac gggccagtgg cccacaacgc 960 ccgaaggtat tgcgaacacc gtttcggagc gtttccggcc caagacggcg gccctgacca 1020 agcgtcgtga actgcagcag ttccgtttcc agtcgggtgg gaccatggag cagcatgtgg 1080 ccgcgtttgc ggccctggca gtgactgtcg agcaggcgtc tgccgccgag ctttactcgt 1140 gtttcttggc gtcggtgccg acgaacgtca tgacgctgct cggcgacggt cgcttgaacc 1200 caccgtcgga ggaccagaac ctgcgttggt cgttcatgga gcaggtctac acccgggcca 1260 tcgacattgg ggcgttctcc agctttcgaa cgggttcaac atgctggcgg tgcatgatcg 1320 cgtacaggac gaccgtatgc aggtcaacgc tgtgcaaggg cgccgtggcc atggtaatcg 1380 cggccagggt cgtagtcgcc gtggccagaa tggcggtcgg ggtggtcgtg gcggccgtgg 1440 ccgtgacgac cgccaggacg acgagtacga ggtcgaaacg cgccgctgtt atcggtgcga 1500 tgaggtgggc catttgcgtc acaattgccc gcgccggccg aacgcgaatg tcgcggtttt 1560 caatcatcaa tgcacctgcc gttttgattg ctcttctcgc tctggtcagg tcgtccctgt 1620 tggtggcaca agtccaaatt ttgttcatac ggacgtgaac gatggcggca cgacgcagct 1680 tttgcacgtt gcagttgacg tcgacgtgga cgacgcacgg gagccgctca tcgtttcgcc 1740 gaaggacacc gccgacaggc gcgtcgactt gggccagggg gaggagcgtg acgatgcggt 1800 ggtcgaggag gatcttgaaa tcttggcgtt tggcactgac cagccgtacg cgtacgatgc 1860 ggtgctcgca gtgagcgagg agccgcgcaa gggccgcact tggtcgaaga agtgcatggg 1920 cgagcggcac aggaaaaagt tgctgcagga gcacgacgcg gtggtcatgg tcgactcggg 1980 cgcctcgcac aacgtcgcgc gacgtgccgt cgtggagcgc gtgggcgcgc gaatcgtgcg 2040 cgacgggtgg ttgcgcgtct cgggtttcga cggcgtctcg cgacgcgtgc gacgcgaatg 2100 ggcggagctg cgtgtcatgg tcgcgggcgc cgccacgacc gcgcggtgcc tggtgctcga 2160 caacgtcgca tgggaagtga ttttgggccg gccgtggttg cgcgagtgtg acccgcagct 2220 cgactggtcg tcaggcgcgc tgtcgcttga cggcggcgcg acacgcgtgc agccggtggc 2280 gggcgggcca tcccacatcg aggtcgactt gctttctgcc gaagagtgcg cggcggcgat 2340 gcgcgacgac ccggcaaacg tgttcatggt ctcgctcaac cacatcgagg tcgagggcgt 2400 tggctacacg ggcacggtcg tgaagcagtt agtcgacgag ttcccaaccg tgttcgccga 2460 gccaacgggc atgccgccga agcgcgatgt cgaccaccgg atcgagctca aggagggcgc 2520 gcggccagtc gcacagcgcg cataccaggc ctcgcccaag gagctcgagg aactgcgcaa 2580 gcagctagac gagctgttgc gacgcgggct gatccggccg agcaagggct cgccgtttgc 2640 gtcgccggtc ctgttcgtgc caaagaaaga cggaggcgtc cgggtgtgcg tggattatcg 2700 cgcactgaac aagatcacgg tacgcgatac atacccgctg ccgcgcgccg agaccctctt 2760 ccggatgctg cgtggcgcgc gctactacac gaagctcgac cttgtcagcg ggtatcacca 2820 agtgcgcatg gacgagtcgg acgtgcacaa gacggcgttc gtgacgcggt acgggacgtt 2880 cgagtggcgc gtgctcagct tcggcttgtg caacgcacca tcgacgttca tgcgcgccat 2940 gaaccaggtg tttgagccgt acctcgaccg gttcgtcgtc gtgttcctcg acgacatctt 3000 gatctactcg aagaccaagg ctgagcatga gcgccacgtt cggcaggtgc tcgaggtgct 3060 acgcaagaac gagttccatg tcaagccgag caagtgcgag tttttcaagg agcgggtcga 3120 gtaccttggt catttcgtgg accgcgacgg cgtgcatgtc gacccagcca aggtcgcggc 3180 agtgcgcgat tggcccgtgc ccaaggacca gaaggagttg cgcgggttca tcgggctcgc 3240 caactactac gcgcgatttg tcaatggatt cgccgagctc gcgcacccgc tgacggactt 3300 gctcgccgac gacgtcacgt acgaatggtc agccgcggcc gacgcagcgt tccgcgcgct 3360 caaggacgcg ctgtcgtcgg tgccggtgct ggtgctgcct gacgaggacg gcgatttcga 3420 gatcacgacc gacgcgagcg ggttcgccgt cggcgcggtg ctacagcaga atgggcgacc 3480 gatcgaatat ttcagtcgca agttgaagcc accggaacgc gcgtacccgg tccacgaccg 3540 cgagttactg gcaatcgtcg ccgcgttgaa tcggtggcgc acgtacgtgc acggcaagtc 3600 ggtgcggtgc aactcggacc accgcacgct tgagtacttt gcgacgcagc gcgagctcaa 3660 tggacggcaa gtgcggtggc tgtcgtcgct cgccgagtac gatctggaaa tcaagtacga 3720 gaagggcgag cgcatgccgc acgctgacgc gctgtcgcga cggccggact tgcgaccgga 3780 cgggtcgctg cttgccgacg cgctgtacac accactcgcg cgcggtgcca acggtgcgtt 3840 gcgcgtcgcg gcggtgccat tatacaaagg cgaggacggg aaattcgcgg tactgcaggt 3900 acggcggtcg cacaagcacg acgagaccat ggcgattgcg gtcgagggcg agcacgcgca 3960 agagcccgcc gaggatctcg tgcaaggggt tcgaccagaa gtcgagccag acatgatgcg 4020 gcgcgtcgtg gccgcgacgg cggcggatcg acggttgcac gcgatgctcg agcagccgcc 4080 cgcggggtac gtcgccgacg acgggatcct gttcaagttc gtggaggtca gtggcgaggc 4140 gcggcgggtg ccgctcatcc caaacgaccg cgagctgcgc accttgtggc tgtccgaggc 4200 ccacgacacg ccgctcgccg ggcacatggg cgcgcgtcgc acgctcgacc tgctgcgtcg 4260 gtggtgctac tggcgtggca tggcggccga cgtcgaggcc tatgtgcgga cgtgcgtgag 4320 ctgccaggcc aacaagccca agatcggcaa gaccgacggc ctgctgcgac cgcttcctgt 4380 accggatcgg ccgtggggcc gaatcaacat cgacctgatc ggcaagcttc cagacgggga 4440 gggcggaaaa gacgccatcg tcacgctcgt gtgcgcgttg acgaagcggt tgcgcgtgat 4500 cccgacggtc atgacgacgg acgccgaagg gcttgcgcgc ttgtacattc ggtacgtctt 4560 gcccgatttt ggcttgcccg actgcatcgt gagcgaccgt gacccgcggt tcgcgtctgc 4620 gttttgggaa gcgctttggc ggcggttggg cacgcggtta gcgatgtcga cagcgtatca 4680 tccacagtct gacggccagg ccgaagtcgc gaatcgcacg gtgcagtcga tgctgcggca 4740 tttcgtcaac gcgcagcgca ccaactggcc gacatacctg ccgtttgtcg agttcgccta 4800 caacaactcg cgcgccgagc cgacgggcaa gtcgccattc gagctgtggg gcgttgtgcc 4860 acgcgggccg gtcgagatca tgctcggcaa caatatcccg gtaccgcagc tcgagcgcat 4920 ggcggagcag ctcgacgcag cgacgcgaga tgctcgggtg agcctggatc gatcgcgcgc 4980 ggccatggtg atgcaagtca accagcaccg caagcacgtc gagtacctcg tcggcgaccg 5040 cgtgtggctc tcgaccaaga acattgcggc gcgcacagca tcaggcacgg agctgacgtc 5100 gcgtaagtta ctgcccaagt tcattggccc gtttaacgtc gaggccgtgg tcggtggcgg 5160 tcgcgcgtgc aggctcggat tgccagccgg gtacaaggtg cacccgacgt tccacgtcga 5220 gctgctgaaa ccgtatcgcg agctggactt ggcacgattt ccgggtcggc cgtacgcggg 5280 caacgacggc accgacgacg agcacggatg ggcaccggcg acggccgacg agcccgagtg 5340 cgtgctgcgc gaaggcatga cgggccggcg cggcgagggc gacacgctgt acttgatcaa 5400 gtggcagcag cgcgcgatca aggagtcgac gtgggaacgc gccgaggact tgcgcgagga 5460 cgcggacgcg gtcgtcaagt ggcgcgagta tcgggcgcag acgccgggcg tgcaggtgct 5520 gtcgggcgtc aaggccggca agccgacggc gacggtgttg cggtcgctgc agtagtggtt 5580 ggtggtgttg tggttgagtg gaggatcgat cgtggatccg acgcgtcgcg cgcctcggac 5640 gggcagggag ggagt 5655 // ID YPRIME repbase; DNA; FNG; 3187 BP. XX AC M58721; XX DT 24-SEP-2001 (Rel. 6.08, Created) DT 15-APR-2009 (Rel. 14.05, Last updated, Version 2) XX DE S.cerevisiae Y' element. XX KW Y' element; YSCYELR; YPRIME. XX NM YPRIME. XX OS Saccharomyces cerevisiae OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Saccharomyces. XX RN [1] RP 1-3187 RA Louis J.E. and Haber E.J.; RT "The structure and evolution of subtelomeric Y' repeats in RT saccharomyces cerevisiae."; RL Genetics 131(3), 559-574 (1992). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 1128..3185 FT /product="YPRIME_1p" FT /note="Helicase." FT /translation="MQICALGNSYDAFNHDPWMDVVGFEDPNQVTNRDISR FT IVLYSYMFLNTAKGCLVEYATFRQYMRELPKNAPQKLNFREMRQGLIALGR FT HCVGSRFETDLYESATSELMANHSVQTGRNIYGVDSFSLTSVSGTTATLLQ FT ERASERWIQWLGLESDYHCSFSSTRNAEDVVAGEAASSNHHQKISRVTRKR FT PREPKSTNDILVAGQKLFGSSFEFRDLHQLRLCYEIYMADTPSVAVQAPPG FT YGKTELFHLPLIALASKGDVEYVSFLFVPYTVLLANCMIRLGRRGCLNVAP FT VRNFIEEGYDGVTDLYVGIYDDLASTNFTDRIAAWENIVECTFRTNNVKLG FT YLIVDEFHNFETEVYRQSQFGGITNLDFDAFEKAIFLSGTAPEAVADAALQ FT RIGLTGLAKKSMDINELKRSEDLSRGLSSYPTRMFNLIKEKSEVPLGHVHK FT IRKKVESQPEEALKLLLALFESEPESKAIVVASTTNEVEELACSWRKYFRV FT VWIHGKLGAAEKVSRTKEFVTDGSMQVLIGTKLVTEGIDIKQLMMVIMLDN FT RLNIIELIQGVGRLRDGGLCYLLSRKNSWAARNRKGELPPIKEGCITEQVR FT EFYGLESKKGKKGQHVGCCGSRTDLSADTVELIERMDRLAEKQATASMSIV FT ALPSSFQESNSSDRCRKYCSSDEDSDTCIHGSANAST" XX SQ Sequence 3187 BP; 901 A; 636 C; 811 G; 839 T; 0 other; aatttggcac cctacatgtt tttgttacta cacgtagatg agctatcgat tttttctgca 60 taccaagcaa gtttacctgg cgaaaagaaa gtcgacacag agcggctgaa gcgtgatcta 120 tgcccacgta aacccattga gataaagtac ttttcacaga tatgtaacga tatgatgaac 180 aaaaaggacc gattgggtga tattttgcat attatcttgc gagcatgtgc actcaatttc 240 ggggcgggtc cccgtggtgg cgctggtgac gaagaggatc gatctattac gaatgaagaa 300 cccattattc cctctgtgga cgagcatggc ctgaaagtat gtaagttgcg cagtcctaac 360 actccacgaa gactcagaaa aacactagat gccgtgaaag ctttattggt gtcgtcttgt 420 gcttgtaccg caaggattta gatatatttg atgacaacaa cggcgttgca atgtggaaat 480 ggatcaaaat tctgtaccac gaagtagcgc aggaaaccac gctgaaggac tcttatagaa 540 taactttggt accttcttct gatggtatat cagtatgtgg aaaacttttt taatcgcgag 600 tatgtccgcg gcttttactt tgcatgcaag gctcagtttg ataacctttg gggagagttg 660 aacaactgct tttatatgcc tacagtggtt gatattgcca acctcatttt gcgtaatcga 720 gaagttttgt tcagagaacc gaagcgagga attgacgagt atctggaaaa cgattctttt 780 cttcaaatga tacctgttaa atatcgtgaa attgtgctgc ccaagttgag aagagatact 840 aacaaaatga ccgcggctct taaaaataaa gtcgctgttg caattgacga gcttacggtg 900 ccacttatgt ggatgatcca ttttgccgta gataccctta ccgttatcca gagcttcagc 960 tactcgcttt tgccggtcct cagcgcaacg tatacgtcga tgatacaaca agacgcatcc 1020 aactgtacac tgattacaac aagaacggtt catcggagcc tcgactaaag acgcttgacg 1080 gactcacctc agattacgtg ttttattttg tcactgtgct aaggcaaatg caaatatgtg 1140 cgcttggtaa cagttatgac gcttttaatc atgatccttg gatggatgtg gtgggatttg 1200 aggatccaaa tcaagtaaca aatcgagaca tttcgaggat agttttgtat tcctacatgt 1260 ttctgaatac cgcgaagggc tgtctggttg aatacgcaac ttttcggcag tacatgaggg 1320 aacttccgaa gaatgcacct cagaagctga attttcggga gatgcgtcag gggttgattg 1380 ccctaggacg gcactgcgta ggtagcagat ttgaaacaga tttgtacgag tcggcgacga 1440 gtgaactcat ggccaatcat tccgttcaaa cagggcgaaa tatttacggt gtggattcct 1500 tttcgttaac tagtgtcagt gggacgaccg ccactttatt gcaggaacga gcttccgagc 1560 gctggattca gtggttaggc cttgaaagcg actaccattg ttcattctcc agtactcgga 1620 atgcggaaga cgtagtggca ggtgaggcgg cgagttcaaa tcatcatcaa aaaatttcaa 1680 gagtaacgcg aaaaaggccc cgagagccca agagtacaaa cgatatcctc gtcgcaggcc 1740 agaaactctt tggcagctcc tttgaattca gggacttgca tcagttgcgc ttatgttatg 1800 aaatatacat ggcagacaca ccctctgtgg cagtacaggc cccaccgggc tatggtaaga 1860 cggagttatt tcatctcccc ttgatagcac tggcatctaa gggcgacgtg gaatatgtgt 1920 cgtttctgtt tgtaccgtac acagtgttgc ttgctaattg catgatcagg ttgggccgac 1980 gcggttgctt gaatgtggcc cctgtaagaa actttattga agaaggttac gatggcgtta 2040 ctgatttata cgtggggatc tacgatgatc ttgctagcac taatttcaca gacaggatag 2100 ctgcgtggga gaatattgtt gagtgcacct ttaggaccaa caacgtaaaa ttgggttacc 2160 tcattgtaga tgagtttcac aactttgaaa cggaggtcta ccggcagtcg caatttgggg 2220 gcataactaa ccttgatttt gacgcttttg agaaagcaat ctttttgagc ggcacagccc 2280 ctgaggctgt agctgatgct gcgttgcagc gtattgggct tacgggactg gccaagaaat 2340 cgatggacat caacgagctc aaacggtcgg aagatctcag cagaggtcta tccagctatc 2400 caacacggat gtttaatcta atcaaggaga aatccgaggt gcctttaggg catgttcata 2460 aaattcggaa gaaagtggaa tcacagcccg aagaagcact gaagcttctt ttagccctct 2520 ttgaaagtga accagagtcg aaggccattg tagttgcaag cacaaccaac gaagtggaag 2580 aattggcctg ctcttggaga aagtatttta gggtggtatg gatacacggg aagctgggtg 2640 ctgcagaaaa ggtgtctcgc acaaaggagt ttgtcactga cggtagcatg caagttctca 2700 tcggaacgaa attagtgact gaaggaattg acattaagca attgatgatg gtgatcatgc 2760 ttgataatag acttaatatt attgagctca ttcaaggtgt agggagacta agagatgggg 2820 gcctctgtta tctattatct agaaaaaaca gttgggcggc aaggaatcgt aagggtgaat 2880 taccaccaat taaggaaggc tgtataaccg aacaggtacg cgagttctat ggacttgaat 2940 caaagaaagg aaaaaagggc cagcatgttg gatgctgtgg ctccaggaca gacctgtctg 3000 ctgacacagt ggaactgata gaaagaatgg acagattggc tgaaaaacag gcgacagctt 3060 ccatgtcgat cgttgcgtta ccgtctagct tccaggagag caatagcagt gacaggtgca 3120 gaaagtattg cagcagtgat gaggacagcg acacgtgcat tcatggtagt gctaatgcca 3180 gtaccaa 3187 // ID Gypsy-73_MLP-LTR repbase; DNA; FNG; 148 BP. XX AC AECX01001139; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-73_MLP_; KW Gypsy-73_MLP-I; Gypsy-73_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-148 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001139; Positions 8977 9124. XX SQ Sequence 148 BP; 33 A; 42 C; 22 G; 51 T; 0 other; tgttatgacc tgtcacgcgt gttacatgta gagttcatac tagtcttgtc tcccgttgtg 60 catattgctt ttcctcactt agcaatctac tatattcacg agaccaccct taaagcttct 120 cttgcatctc cctctcgaag ctacaaca 148 // ID Gypsy-3_TMe-I repbase; DNA; FNG; 7878 BP. XX AC CABJ01003414; XX DT 13-FEB-2011 (Rel. 16.02, Created) DT 13-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Perigord black truffle genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_TMe_; KW Gypsy-3_TMe-LTR; Gypsy-3_TMe-I. XX OS Tuber melanosporum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Pezizomycetes; Pezizales; Tuberaceae; Tuber. XX RN [1] RP 1-7878 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Perigord black truffle genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; CABJ01003414; Positions 145278 137401. XX CC Positions [6558-6920] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(3306..4169,4173..5168) FT /product="Gypsy-3_TMe-I_2p" FT /translation="MRTPRQGVKTASSGLPNIPKGNKVCLRRPKWSKLRRR FT GNKHPSCYHTSDMSKRQNDGSTPHPKKRAFSAPKPTRYNCPEGHAFIHIIN FT PKRQHYQIRILLDSGSNVFLINKELVKELRIPFTERPNPVNIQGFDGVSTL FT SGGKFYTYPIWLEIGRNKHMSEVACEIAAAGDYGMIIPFGWWHSEHPLSNI FT GEPAKWEFKNSKCKNCVEEEDEGIADMFEYDETVAWDENAIQVGRIGTIED FT PNEINFDSVPKHYWKFKKLFLPQTAAKLAPRRTFDHEIKLVEEAKPSGPIY FT PLSDKQLKLLCEYLDQMLREGKITKSQSPFGAPILFTPKADGRLQLVVDYR FT NLNKITILNKYLLPLMSELRDRVAGASIFSKIDLKDGYNLIRIKKGDEWKT FT AFRSRYGHYEYKVMPFGLANALATFQDMMNEVLREFLDQGIVCYLDDILIY FT SKNEKEHIELVTKVLQRLTDFQFAISIKKSFFHVKEVEFLGYMVAVDGVSM FT STRKVDSVLNWRSPTSVKEVQIFQEFANFYRRFIENFSKICAPITETLKGD FT SKKFQWGPEQEKAFQELKRRFTSAPILHHFFPERETVVETDTSDFALGCIL FT SQFVEKRLHPVAFHSRKLNSAE" XX SQ Sequence 7878 BP; 2611 A; 1757 C; 1795 G; 1715 T; 0 other; gtatattgtt agattacctt gttaccaagg ctctctaatc ggttaccact tgcatgtgtt 60 cgattcacac tttcagtgaa gtacttaata ctacggtcgg cctagacacc attggagaac 120 tatcctgaac caattcaatt tttgaaccac cgactaagat tcacttacca tctattggtc 180 aagtgttttc tagccagttc gagctttcag aagtgccatt tctgttacaa gaagctcctt 240 cgtggtatac cgaccccaac ctagttccta cggaggaaca attccaaaaa atatacgaaa 300 cctggtacgg accagaattc agttggaaac ccttccttga ctgggatacc agcgaggaaa 360 gttctcttag cagcagattc tctgctacta tacctctagc atggagaccc ggaactcatc 420 cgaactctca ttaaaattta tctcctccta ccgcacgacg cggccaggac accaagtgtc 480 gcgccgcata ttctacttaa acggaaacgt cccaaaacgc gggcttcata gcccgcctac 540 gagaaaaaga agctcgcaaa cgagcaagag agatggctca acctcctaag cagatcctcc 600 caggagaacc aatggatacc tcaggagatg atcactctac aatctttagc cattcttcct 660 cacttcctcc cgctcttcgg aaacctattg gaggacaaga gggcagggct cacagtgtcc 720 ctccatcctc aagagcccct tctcctgctt cacgatctac cacaccacat caaaataccc 780 atgcagcaga tctcgcaaga cttgccgcaa gggtaggaat tgtcataggt caaatccgac 840 cagacatatc aaccagtatt gatactgcca atattgcatt tgacacgcta tcagcagatt 900 ctcaaaccaa tgcacgagga ctgatagctg tccagcaacg attgttggaa ggacaagatg 960 tacacgacaa tgacattgct attatgttac gtcagtgtac aatggtagaa gcaaatgttc 1020 gagaaatcca agaaaggatc aatagggaaa ttgccaccaa gttggctgca caagatgcac 1080 ggcaagaaca tcacgaagga gtaaccactc aactgcagcg agaagctcta gaacgagacc 1140 tcaccctgga acgtgccttc caagagttca gagcacaaca gacccagcaa aataccgcaa 1200 tgcgaaccct gatatccaat ttggccgcaa ggcaagatag ggactcagaa cggtatagtg 1260 actttacccg taccatccaa aaagaccaaa cccacatatc cgatacccta tcactagttc 1320 ttaacaagct taaggcacta gatagcggac aaccaggtgg tcggcacgct ttgttggaat 1380 cagtcccaga ggaatgggaa gagactgcag aacaagggga acctgaagaa atcccttctc 1440 gagacaaggg gaaacaaaga taaggagaac ctccaacctt tctaataccg ggtggcgacc 1500 cggatgatgg aagtgatgat ggagatgacg atggaaacaa agggaaacgt ggaggagggc 1560 catccagatt caagcgggga ttaacccccg aacagcgtgg aacagctgac gtacttgcag 1620 aagcattggc tcaaacagtt aaacgaccgg cagacccgcc cccgtttttc gaagcaaagg 1680 ccagtcaatg tatttaccat tggttaaccc aatatgtaga ctacttcgaa cggaacccaa 1740 gaatctggga tgatgattga gaacgcatca tttacgtatt ggggaggaca aaaggggatc 1800 aggtcagccc aatagctgta tggtatcgac gcgcaatgga tggtctccga ggagaagaac 1860 aagatccgga actatcttct tggactcgct ttgagcacga attcactagt cggtatgccc 1920 ctctcgaaga aggacgaaag gccatgcggg atatggaatc tgtggagtac aaaaatgata 1980 ttcagacctt tatggggaaa atgcataatc ttaatctcca cgcccgatta acgggaccag 2040 cttggcaaat ggcatgcgat aggaaattgc cctccaagat taggtggcga ctctccttaa 2100 acaaatttga taccgatgcc gaatgggaaa aagcagtccg ggaagcagga agggatgaag 2160 aggaacaaca gaaacgggac cacctggacc gggacaaatc aaaacccgta aggacgtctt 2220 cagataagaa acctacggac aaaattcaaa caaaaaggga gaaacccttc ataaaaggag 2280 gtggtccctc caagaaaagg aaacggggac cggaacccaa taaaccggta aagaaatctg 2340 ctggaactac gaagttaaca cacacaaact tcaaaactgc ccatgaagga atccctcagt 2400 caactatgga tgaaagaaag cgggatggca attgtactcg ctgtggtaag aaaaaccaca 2460 actacaaaac ctgctgggga caaattcagg tatctagaac aagggcagat gactgaccat 2520 tcaagacact aaggcctaga agggtggcaa cagttaaccg tcccagaaag atttccactg 2580 cccaagttcc aagaggtaat aaactctggg aactcggcga tgatgaagag atgacttaag 2640 tcatctcatt ggtaccgaca actacggtgt tcgggtagtc tagaagagac atactctcta 2700 ggaatcgaaa tcaacggcaa attaagggga cgccgaagcg aagaaaacga cttcttgcga 2760 tcatgacctg agtgggagta cacgcgtcct tcttgccgca aggacaaggg gattaataaa 2820 gagcaatgta cgcattggcg gcgcctctat gcttcccaag agagtgacac aagctcagtc 2880 taatgaccta atgcatgacc gttagggtag tcaggcacgt gacctggata ctgaaaggtc 2940 tccgataagg accaaagcct agacaggaaa cagacagcac caagggttcc gaagggagcg 3000 acagtagggc taagatgaag gatctagagg atgagcgaag cacatcttcg atctatatgg 3060 gaaataggta caagagctct agaccttacc tttagttact gcttcttagg tgccgcgctg 3120 gtaccaaacg gaactcgata aaatgaggac actgtctacg gttaagcgag ggtggagaat 3180 aagcctgaga aggttatcaa agagacagcc gaccagccct acgcacccgg cctttgtttc 3240 catgatatct ggacgatggc agagaccccc ttgattggtt ccacaggaag tataaagtgg 3300 aaggtatgag aacccctcgg cagggggtaa agaccgctag tagtgggctg cccaatatac 3360 caaagggaaa caaagtttgt ctgaggagac ctaaatggtc gaaactgcgc aggcgcggta 3420 acaaacaccc tagttgttat cacacctcag acatgtcgaa acgacaaaac gacggttcga 3480 cacctcatcc aaagaaacgc gcttttagtg caccaaaacc tacaagatat aactgtcctg 3540 agggacatgc ttttattcac attataaacc caaaacgtca gcactaccaa ataagaatac 3600 tacttgattc aggatccaac gtattcctga tcaacaaaga actcgtaaag gaattacgaa 3660 tcccgttcac agagagaccg aatccggtca atattcaggg ctttgatgga gtatcaactc 3720 tgtccggagg aaaattttat acgtacccta tttggcttga aataggtcgc aacaaacata 3780 tgtctgaagt tgcgtgcgaa atcgcagcag caggcgatta cgggatgata atccctttcg 3840 gatggtggca ttcagaacac cccttatcca acattggaga acccgcaaag tgggaattca 3900 aaaactccaa atgcaagaac tgcgtggaag aagaagatga aggaattgca gatatgttcg 3960 aatatgatga aacagtagcc tgggatgaaa atgcgatcca agtaggaaga attggaacta 4020 tagaggatcc taacgaaatc aacttcgatt ctgtcccaaa acactactgg aaatttaaaa 4080 agctatttct cccccaaact gcggcaaagt tagcacctag gagaactttc gaccatgaaa 4140 ttaaattagt cgaagaagca aaaccctctt gaggacccat ttatcctctg tctgacaaac 4200 aactcaagtt gttatgtgaa taccttgacc aaatgttacg ggaaggaaag atcacgaaga 4260 gtcaatcacc tttcggtgct ccaatcctgt ttacaccaaa agcagatgga agattacaac 4320 tggtagtgga ctaccgaaac ttaaataaga taactatctt aaacaagtat ctactaccgt 4380 taatgtcgga attacgtgat agggtagcag gagctagtat cttttcaaaa atcgatttaa 4440 aagatggcta caacctcatc cgaattaaaa aaggggatga atggaaaaca gctttccgat 4500 cgcgttatgg acactatgaa tataaggtca tgccgtttgg attagccaat gctctagcta 4560 catttcagga catgatgaat gaagtccttc gggaattcct agaccaaggg atcgtatgtt 4620 atttggacga tattctcatc tattcaaaaa acgagaagga acatatcgag ctagtaacca 4680 aagtgttaca aagactaaca gattttcaat ttgcgatttc aatcaagaaa tcctttttcc 4740 atgttaagga ggtggaattc ttaggctaca tggtagctgt cgatggggta tctatgagta 4800 cccgaaaggt ggattcagtg ttaaactgga ggtcacctac gtcggtcaag gaagttcaaa 4860 tattccaaga atttgcaaac ttctaccgac gatttataga aaacttctca aagatttgtg 4920 ctccaattac agaaacattg aaaggtgact ctaagaagtt ccagtggggt ccggaacaag 4980 aaaaggcctt tcaggagctt aagagaaggt ttacttcggc cccaatctta caccattttt 5040 ttccagaaag agaaacggtg gtggaaacgg ataccagtga tttcgcttta ggttgcatcc 5100 tttcccagtt tgtggaaaaa agactgcatc cagtggcttt tcactcccgc aaactaaatt 5160 ccgcagagtg aaattacgag atccatgata aggaactgtt ggcaatacta gaagcattca 5220 aagaatggcg tcaatacctt gtcggaactg aaaaacccat aacggtttat accgatcacc 5280 aaaacttgca atatttcctt acctcgaagg tctggagcgg acgccaaatc cgttaggcgc 5340 agaagttggt agactacaac ttcaggattg tctaccgtcc aggcacaaag ggaggtaaac 5400 ctgatgcact cagcagacgg ccggagtacc gttctgaggg gggagctgca catcgcgaac 5460 aggcgatatt aaaacccgaa catttccaaa tcagttccgt aacaagaact acaaaaaggg 5520 aaattcaaat atgcttaata gcaggtggct caccggttga tataagcaca ggtgatatac 5580 aggaaacagc aaactggaat aactcgaact ccgtgattcg agtaaaacgg ttagatacaa 5640 gagcaaggct accaaccaag ggttcacgga tggcagcggg acatgatctc tacacaatgc 5700 aggagaatat aattccagca cggggacaac tagttgtggc aactggaatc gcaatcggaa 5760 ttccaaccgg aacctacgga agaattgctc ctagaagtgg actagcagtc aaaagcggta 5820 tttctacagg ggctggagta atcgacacgg attataccgg cgaactcaaa gtattactat 5880 tcaatcatca agacgcagac tgtcacatta aagctggcga ccgaattgcc cagttagtga 5940 tagaaaaagt taatactgca gatatgatgg aaattgacaa tctagaagat accgaaaggg 6000 gtgcgtcggg attcggcagt acggacatcg caacaagggc tattacgatc cgggaattgg 6060 agccagtagt cagccagtta catgcctcag cggaacagaa tatatactgg tatggcacag 6120 ccgaaatacc tgaacgagcc ctcatgacta gctctatact agcccataag gaactcaaac 6180 aattcaatcc ggaattcttg gaacagatcc aaaaagccat cacggaagat agagagtggc 6240 ttaagagaag aaatgagtta gaaaaactat gagacaatga caaacctttc ccaaaagatt 6300 ggaccctatc cgatggattg ctctactaca aacaaagact atttgtacca aataaccaag 6360 acctttatac gacaatagct gaaggatgtc acgatgctaa gatagcagga cacttcagac 6420 aagaaaaaac ccttgagata attatgaggg acttctattg gaagggattg gcaaattggg 6480 taaatgatta cgtacggtcg tgcgatgatt gtcaacacaa caaatcccca cgacatgtga 6540 aatacggtct actacaacca ctatcagtac catatgcagc atgggcatca atatcggtcg 6600 acttcattac acaattacca caaagtcaag gctgtactca aattatggta gtggtcgacc 6660 ggtttacaaa aatggcccac ttcatcggat tagaaggaac agctactgca aaggacgtgg 6720 ctacagcctt tacaaaagaa atatggaagc tacatggctt accaacagaa attgtcttgg 6780 atatggattc caagttcgct ggagaatttt gggaatcctt gtgcaaaaca ctaagtataa 6840 aaagaaggat gtctacagca tatcatcccc aaacagatgg acaaactgag cgaaccaacc 6900 aagtgctgga aggttacttg tgaagtttcg tcaactatga tcaggatgac tggtaccagc 6960 tcctaccgtt agccgaacat gcttataata actcggcaac caatgcacat ggactaaccc 7020 ctttctacgc aaactatggt ttccacccac aaaccgaatg gatgaaggaa cacgaggcgc 7080 ataatcccgg agccacactc tatgcgaact ggataagaat gatccacgaa caagcaaggc 7140 aatcacttca acagacacga gaagctatgg ccaaatacta tgacagaaag gccaaacaac 7200 aacctgactt caaaatcggt gacaaagtaa tgttaaatgc aaaaaacact cgcacgcgtc 7260 ggaagtcgaa gaaactgagt ccaaaactat atggaccatt cgtgatcttg gagaaacggg 7320 gaaaccgtgc ctgcaaactc aacatttcgg atcgatggaa catctaccca gtgttccatg 7380 tatctctgtt agaaccttac agggagtcgg tacacgcgaa ccgacaacaa cctcccagag 7440 aacccgagga cattgagggc gacttacaat gggagataga aaaaatagta agaagcgaaa 7500 tacgctctta ccgaaaaaga gtacgtagtc gctaccagga ggtacgtgaa ctatggtact 7560 tcgttaaacg gcaaggatgt tctgaagatg agaacacctg ggaacctgcc acgggcatgg 7620 ataacacaag agaaatggtg gatgaatttc accgcgataa ccctgaaatg ccacggcggg 7680 cagaggaagg acgacgagaa aaaggttttc ctgccatggc caagcaaacg cgggaggttt 7740 tttaccctac tcttatcggg tcctaaaaga attaatgttt aaggtttaat aattagttgc 7800 gttaattaaa ttttttaatt tttatttgaa tgtaccacaa aaggcattca ggacaaagcc 7860 ttaagaggag ggagctca 7878 // ID Gypsy-8_RO-I repbase; DNA; FNG; 5740 BP. XX AC AACW02000181; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-8_RO_; KW Gypsy-8_RO-LTR; Gypsy-8_RO-I. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-5740 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000181; Positions 7485 1746. XX CC Positions [2973-3449] - Reverse transcriptase CC Positions [4736-5221] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 196..1104 FT /product="Gypsy-8_RO-I_1p" FT /translation="MSTSVNIKSSSSLDKGMLRYLNEPKCFAGGNDYEEAT FT TWLDRMTRLQAATRMSDEEILFVAGDHLVEKAATWWKVVGKKSTDWKSFEE FT AFKDQYLADREGSWWRQLQTLKQGPKDSIDDVAFRMQELFDLLGNKNHDIQ FT VSMFLDAIDPTIAFEVDKDVTPSTLRDARARAKQVERSIQRYGARPGPSAP FT VLPDRDVFSVGSRNGYGRDDLSSAVSTMFSLADKLEKLTINLVRANDGMAQ FT ENAAASVKPRRGLVCFFCDEEGHKKYDCPKYLARQGNKEGDRSSPASGSNL FT TPISGKGSEHQ" FT CDS 1467..4130 FT /product="Gypsy-8_RO-I_3p" FT /translation="MMGNAHGGVLQNNGMPGGNGMSAHGIPEVIGVNATGG FT RAINNGGSREVLPSGVIPNKPVKKRTPRKKARRLPVKLRGGRTIWDILDQC FT KADITATELIALDVKAQKDMVDGIRFLRENKRKLRKPVEAMEVDQGTGVDR FT VGEAGSATNPAVVNVVDQDWETEDSLSDEASTDLSSMYFEDDGSDDDQASM FT VSETDDGVSVYCYPYNLQRMKVSSPLRGAIMINDKPVEVVFDTGASVSVIG FT KGLVDSLGLVPNGDTLPLTGFDNKKGPSSPIVMDVPIRIAGKLRPEHMCIQ FT PTGNDNLCLLGIPWFQAYGIEIDVVNSCVKIPTTSGMVKLQAYTTHLPVPV FT ASMSGGGVGGVVFPVVDSNVSTAATMGSDRQVYMVDASQHHCDNYEEDLLP FT VGEPIKEEIKFDAENITMGVPDELVEVIERYKNCFSEVSGLGRVKNYVMDI FT PLVSGATPIRSKPFRMTWQEEEALDAYLEELLDLDIIKPSNGLWTSPCFFI FT PKKDGTLRLVIDYRRLNKMIKQDAYPLPHIDELLDAVGGATVFSTLDCTSG FT YHQLPLNPEHAERTGFVTKKGTFSFNVLPFGITTGCSQYQRMMNSVLSKYV FT GDFILIFLDDILVYSKNMEDHKVHLSLLMEACQEVNLRLKRKKCRFGDSQV FT EYLGHLITGDGVLPSDYNIDKVKKFSVPANVDEVRSFLGLTGYYRKYVPNY FT ASVAEPLTRLTKKKVGFSWGAEQQAAFDHFLVALTRAPILVYPDRQKVQVL FT SVDASGKGLGAVLSQVDDAATMANERVISYASRGLRGSECNYAITHLEALA FT VVWGVTHYKHYFKGRHFILITDHSSLVYIFRPTRLTPKLSRWAACLLDYDF FT EIRYRAGVINPADVLSRMVLDECTAEVQSGDFIKN" FT CDS 4364..5620 FT /product="Gypsy-8_RO-I_2p" FT /translation="MAVSFGWFLALYRYIQDGTLPQDCDGKTRKRLEVHSK FT RWYIADDRLLQKSTARPLLHEGEALEVVKRLHEEGHFDILNTMERVNKYYV FT VSKCRELVTAVVKSCDTCQFRSRIKAVRNNPATVMKTPRYPFFMVGIDAVG FT PLQKTEKGNQYILTGIDYLTRWPVAMAVADITEETTVEFLYHEIVKNYGVP FT QYILSDRGANFISTYVHYFLRQIGCKNIMTTSYRPQVNGMCERLNQTLTQT FT MAKLARDGEEISQWDKYLDPALMALRSMVNTATGYSPSYLLFGYEFRSPAV FT WEAPRQDFVLGEELDALKDRVVMIQDKMEEVRTLARAKSDEQKAKAKIRYD FT ARVVDPRRYQVGEQVLLKDNTPTTKFSDKWLGPYTVQKVNKNGTYHLTGHN FT SQRLKHAVNGDRLRLYDKSAAHFGA" XX SQ Sequence 5740 BP; 1594 A; 954 C; 1507 G; 1685 T; 0 other; gtcttctctt ttctttttat tatatttctt tttacactta ctttacgcct ttaaagtcaa 60 tactagttct ctttttcctg gttactgttt taaatttcaa gctattcaag gtaaagcata 120 atcaggggaa aagcatatat caagtggtcc ggcctaccag ataaagtgta ttaaaaggga 180 tattcaagtt tatagatgtc tacaagtgtt aatattaaaa gttcaagctc tctcgataaa 240 ggaatgttac gttatttaaa tgaaccaaag tgctttgcag gtggtaacga ttatgaagaa 300 gctactacct ggttggatcg tatgactcgt cttcaagctg ctactcgaat gtctgatgaa 360 gaaatattgt ttgttgctgg tgatcattta gtagaaaagg ctgcaacttg gtggaaagtt 420 gtaggaaaaa aatctactga ttggaagtca tttgaagaag cattcaagga tcaatatttg 480 gcagatagag aaggtagttg gtggcgtcag ctgcaaacgt tgaagcaagg tcccaaggat 540 tcgattgatg atgtggcttt ccgtatgcag gaactttttg atttgcttgg gaataagaat 600 cacgatatcc aagttagtat gtttttggat gctattgatc cgacgatagc atttgaggtg 660 gataaggatg tcacgccatc tactttgagg gatgctaggg cacgagccaa gcaagtcgaa 720 cgcagtatcc aaaggtacgg tgcacgtcct ggtccctctg ctccagttct acccgatcgt 780 gatgtctttt cggtgggtag tagaaatggc tatggccgtg atgatctgag ttctgctgtg 840 tctacgatgt tttctcttgc tgacaagttg gagaagttga ccatcaattt ggttcgagct 900 aatgatggta tggcacaaga aaatgcggct gctagtgtga agcctcgtcg tggtttggtt 960 tgcttctttt gcgatgagga gggacataag aaatatgatt gtccaaagta cttggctcgt 1020 caaggtaaca aggagggtga taggtcttcg cctgcgtctg gatccaactt gactcccatt 1080 tcgggaaaag ggtcagaaca tcagtagagg aggttactgg tgttcgtaca aaaaatgcaa 1140 atcctctatt aaatgaacaa gtcgaaatca aattggtcga cacggtgcgt gaggaggcct 1200 tctgtgacac tggtgaagtg tatgtaaaaa gaagagcaga gggtcctcct tcagttgcga 1260 atgggcctgg tagggtggct aaacatattg tcacgggcgg tggtaatgcg gttgggtcgg 1320 gtcctggaaa tttggtggct ccacttccac ctagcggtgt ttatggcggt gctcaaggaa 1380 atttctctgc tgacattcca gtgggtgcac cggagccatc cggtactttg ggaggtactg 1440 tcgtgcctaa cggtcaaggg cctgctatga tgggtaatgc acatggaggt gtattgcaaa 1500 acaatggtat gcctggtgga aatgggatgt ctgctcacgg tatccctgag gtaattgggg 1560 tcaatgctac tggtggtaga gcgatcaaca acggtggtag tcgtgaagtt ttaccaagtg 1620 gggtgatacc taacaagccg gttaagaaaa gaactcctcg taagaaagca agacgtttac 1680 ctgtgaagtt gaggggtgga cgaacaatat gggacattct tgatcagtgt aaggcggata 1740 tcactgcaac agaattgatt gccttggatg tcaaagccca aaaggatatg gtggatggta 1800 tccgtttctt gcgggaaaac aagcgcaagc tgaggaaacc ggtagaagct atggaggtag 1860 atcaaggcac tggtgttgac agggtaggag aggctggaag tgctaccaat ccagctgtag 1920 tcaacgtggt cgatcaagac tgggagactg aggattccct atccgatgaa gcttccactg 1980 acctgtcctc tatgtatttt gaagatgatg gtagcgatga tgatcaagct tcaatggttt 2040 ctgagacgga tgatggtgtg tcggtgtact gttatccgta caatctgcag agaatgaagg 2100 tcagttctcc tttgcgtggt gcgattatga taaacgacaa accagttgag gtggtttttg 2160 acactggtgc tagtgtaagt gtcattggga agggcctggt agattctctt gggttagtac 2220 caaatggtga cactttgccg ttaactgggt ttgacaataa gaaggggcct agcagtccta 2280 tcgtgatgga tgtacctatt cgaattgcgg gtaagctacg acccgagcat atgtgtatcc 2340 aacctactgg taatgacaat ctctgcttgc tgggtatacc gtggttccag gcatatggga 2400 tagaaattga cgtggtaaac tcgtgtgtca agatccctac tacgtcaggt atggtcaagt 2460 tgcaagcata tactacccat ctacctgttc cagttgcgtc catgtctggt ggtggtgttg 2520 gtggtgtggt cttccctgtg gtggattcga atgtgtcgac agctgcgact atgggttctg 2580 atcgacaagt gtatatggtg gatgctagtc agcatcattg cgataattat gaagaagatc 2640 tgctgcctgt aggggaacct attaaagaag agataaagtt tgatgctgaa aatataacta 2700 tgggtgtacc tgatgaactg gttgaggtca tagaacgata caagaactgt ttttctgaag 2760 tttcgggtct tggtcgtgtc aagaactatg tgatggacat tcctcttgtg tcgggtgcta 2820 caccgattag aagtaaaccg ttcagaatga cttggcagga agaagaagcg ctggatgcat 2880 atttggaaga gttactcgac ttggatatta tcaagccttc taatggccta tggactagtc 2940 cttgtttctt tataccgaaa aaggatggta ctctgcggtt ggtgatcgac tacaggcgat 3000 tgaacaagat gatcaagcaa gatgcgtatc cgctgcctca tattgatgaa ttgttagatg 3060 cggtaggagg tgctactgta ttctctacgt tggattgtac ttcgggttat catcagttgc 3120 ctcttaatcc tgaacatgcg gagcgtactg gttttgtgac taaaaagggt accttttcct 3180 ttaatgtctt gccctttgga atcacgacag gttgtagtca gtatcaacgt atgatgaatt 3240 cggtgctatc taagtatgtt ggtgatttta ttttgatttt tcttgatgat atcttggtgt 3300 attcgaagaa catggaagat cacaaggtgc atttgtcgtt gttgatggaa gcttgtcaag 3360 aagtgaatct acgattgaag cgaaagaagt gtagatttgg tgactcgcaa gtagaatacc 3420 ttggtcactt gattactgga gatggtgtct tgcctagtga ttacaatata gacaaggtaa 3480 agaagtttag tgtgccggcc aatgtggatg aagtacgttc ttttctaggt cttactgggt 3540 actatagaaa gtacgtaccg aactatgctt ctgtggcaga gccacttacc aggttgacaa 3600 agaagaaggt tggtttctcg tggggtgctg aacaacaagc agcatttgat catttcttag 3660 ttgcgttgac aagagcgcca atcttggttt atcctgatag acagaaggtg caagtgctgt 3720 cagtagatgc tagtggaaaa ggcttgggtg cagtgctgtc gcaagtggat gatgctgcta 3780 ctatggcgaa tgaaagggta atctcgtatg cttctcgtgg tttgcgtggt agtgaatgca 3840 attatgctat aacgcatttg gaggcgttgg cagtggtatg gggagtcacg cattacaaac 3900 attacttcaa gggaaggcat ttcatattga ttacagatca ttccagcttg gtatacatat 3960 tcaggccaac tcggttaact cctaaattgt caagatgggc tgcgtgtttg ttggactatg 4020 atttcgagat aaggtatcgt gctggtgtga taaatcctgc tgatgtattg tccagaatgg 4080 tcttagatga gtgtactgcg gaggtgcaat cgggtgattt tatcaaaaat taagtggtgc 4140 tgtgaaaatc tgcttgaaat gaactgttat aaaaaaaaaa aaatttgtta atatatatat 4200 acacgaagtt gaataaataa aaagggatat cgaattgggg ggaaatcgag gataatccaa 4260 aatctttact tacaaaaaaa aaaaaaaact tttcgtttaa atttctattc tttgataatt 4320 aattattttg gtgttgttgt tttaattgtg gactgttcta ataatggcgg tttcctttgg 4380 atggttctta gccttgtatc gttatattca agatggtact ttacctcaag attgtgatgg 4440 taaaactcgt aaacggttgg aggtgcattc gaaaagatgg tatattgcag atgatcgttt 4500 gctacagaag agcacggcta gaccattgtt acatgaagga gaagctttag aagtagtcaa 4560 gcgtttgcac gaggaaggtc attttgatat tcttaatact atggagagag tgaataagta 4620 ttatgttgta tcgaaatgca gagaattggt gacggctgtg gtaaaatcat gtgatacatg 4680 tcaattccga tccagaatca aggcggttag aaataatcct gctacggtta tgaagactcc 4740 aaggtatccc ttctttatgg ttggtatcga tgctgtagga ccccttcaga aaactgaaaa 4800 aggaaatcaa tatatcctaa ctggtataga ctatttgaca cgatggcctg tggcaatggc 4860 tgtggctgat ataacggaag aaactacggt ggaattcttg taccatgaga ttgtcaagaa 4920 ttatggtgtc cctcagtaca tcctatcgga taggggtgct aattttatat caacgtacgt 4980 tcactacttc ctcagacaga ttgggtgcaa gaatattatg acaacaagtt atcgtcctca 5040 ggtgaatggt atgtgcgaaa ggttgaacca gacgttgact caaacaatgg ctaaattggc 5100 gcgagatggt gaagaaatat ctcaatggga taagtatttg gaccctgcgt tgatggctct 5160 acgatcgatg gtgaatactg ctacgggtta cagtcctagc tatttacttt ttgggtatga 5220 atttcggtct cctgcagtgt gggaagcacc tagacaagat tttgtacttg gagaagaact 5280 ggatgctcta aaggatagag tagtaatgat tcaagacaaa atggaagagg tgagaacatt 5340 ggcaagagcg aaatcagacg agcagaaagc aaaagcgaag attcgttatg atgcaagagt 5400 agtcgatcca agaagatatc aagttggaga acaagtactt ctcaaggaca atacgcctac 5460 aaccaagttt tctgataaat ggttgggacc atatactgtc caaaaagtga acaagaatgg 5520 aacttatcat ttaactgggc ataacagtca aagactcaag catgctgtta atggagacag 5580 attgcgactt tatgacaaga gtgctgctca ctttggtgcc tgatgtgttg acctcggctg 5640 cacaacaaca atttcgtaca tgggtgaatt caagacagaa tcctgccttt ttggtgaagg 5700 ttgttccatt taagaaagaa gaggctaggg gggccgtacg 5740 // ID Gypsy-103_MLP-I repbase; DNA; FNG; 8905 BP. XX AC AECX01000547; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-103_MLP_; KW Gypsy-103_MLP-LTR; Gypsy-103_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-8905 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000547; Positions 12661 21565. XX CC 'ATCTC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(4842..6140,6144..8807) FT /product="Gypsy-103_MLP-I_1p" FT /translation="MSKKFASEAKFETSPLDQQRSVTGFSGHVSIVTHTGD FT YCVNNEFPPTTFIITKLRDKYDVILGMPWIRMNHHLVDWTNGCLKNSDRTE FT IATASLVSLLPTKSSMDHITRPKRNARNSSKGVQVLNDLSTPPQCEFDFNI FT NPMTPESAGECVSLLENSTPTKHQHTGIRDDIKMEKLPSESLQLPKTTSKD FT HKGDLERNARVVEMGVKSAIDLSNPPQSECDNLVRQNIHRVASKQRSPPIT FT FAKAGTTIRRLFGRHTMAKPTTTMIDSASASWNLLAKLAVEASKGKDDKPA FT SELVPPCYHEYIEMFEKAKSNVLPPRRPYDFRVDLVEGATPQAGQIIPLSP FT KESEVLNEMIEKGLANGTLRRTTSPWAAPVLFTGKKDGNLRPCFDYQKLNA FT LTVKKRYPLPLTMELVDSLLDADQFTSLDMRNGYNNLRVEGDEAKLAFICK FT EGQFEPLTMPFGPTGAPGFFQFFIQDILKAHIGKDVAAYQDDILIYTKPGV FT DHEKVVKEVLDILKKQNVWLKPEKCKFFKKEIGYLGLVISRNQIRMDETKV FT QAVCDWPAPKNLSEVLKFLGFSNFYRRFIHHFSKIARPLHELSQADVKFEW FT TDERNQAFESLKKAFTTAPVLTIADPYRPFILECDCSDFALGAVLSQVSAE FT DNQLHPVAFLSRSLIKAERNYEIFDKELLAVISAFKEWRQYLEGNPNRLNV FT IVYTDHKNLQSLMTTKELTRRQARWAKTLGNFDFEIRFRPGKDSSKPDALS FT RRPDLKPKDHEKLSFGQLLKPENLPNDAFIELLDLIDSWIIDESIGINSLD FT HVDSWIVNDSISMDEIEAEPKRKDVWSDERIIQEIRSKSCQDKRIMDVATL FT CNELPNAKLMKDYSYTDGILYKKEKLVVPNVNEIKLQILRSRHDSLLAGHP FT GRMRTLMLIKRTFYWPSMKAYINKYDGCQSCQRVKTRTTKPYGSLQPLPIP FT AGPWVDICYNMITDLPISKEHDCILTVVDRFTKMVHFLPCNKTMTSEELAT FT LMLRNVWKIHGTPRTITSDRGNIFISKLTKDMNRKLGIQTQSSTAYHPQTD FT GQSEITNKAVEQFLRHFTAYKQDDWVDLLPMAEFSYNNNLHVSIGMSPFRA FT NYGFDVSFSGTPTQEQCLPAVEARFSQLNDIHNELKAAMKEAQEAMAENYD FT RKVLASPTWNIGQEVWLSSKHVSTTRPTAKFSHKWLGPFKIEQRVSTNTYK FT LILPKEMQEIHPVFHVNLLREYVKSQIEGQEDIPPDAIMIQGNEEYELNEI FT LNRRKRRGKIEYLVSWKGYGPNHDSWEPEKALGNAKEVVKDFNKRYPQAEE FT QFKRTRGRK" XX SQ Sequence 8905 BP; 2852 A; 1788 C; 1971 G; 2294 T; 0 other; tattgcaaca tctgtcatat taagggaccc gagcaactgg attcaagtta agaagaaaag 60 aagaccacca aatcttaaga agaaaagaaa agaaagttta aagttaatct aaagaaccaa 120 agatttatta aagtttaaat taaagttaag attgaaatag aagtgtaaag actaaacttt 180 aaacatcatt ttgaaactac aacggaagca ccactctacc ccgcattctc aagatgtgct 240 ccgaaaccga aaatccgtcg gatactcgac ggagcatccg acggttccgg atgtcggatg 300 aggccttgga gctatccggt tcccggatac tccggccggt tcggttatcg gatgaggggg 360 tcagaaaatg cgatatccgt cggatatccg gcggatattt gcatatcatt gcagtctcag 420 gtagccgtgt gtttacatca atttaaggaa aaatattgaa taaaaaggga aacaagatga 480 taaaaaaaat aaaaactaat cactatcatc ctcagcatca cggctaggtt tcaaagggcc 540 aaacacccga tgccaatctt tcacacacaa gagagcctca acgtgtaagg acgcgagatt 600 agtacgctga ggccccaaga tcctttttgt cttgttaaaa atcccctccg acggacagct 660 agttgctgga atggcaagga aggtcatggc catttttgcc aatgcaggat acatatcgct 720 accacgtttt ttccaaaaat ctaggacctt agtgtcgcgg gattcgagtt tatcgttgag 780 atattgagtg atctccgcac tgatcgaatg gtcctgtgga gcgtctgggg caccaaaaat 840 agcgtcttcg aagtactcgg aagtagacgt tgcagtggat ttcctggtgg tagttggagt 900 gggcattgaa tcatcgactt catatttcaa ggcctctagt tcaaatgttg acctgatgtc 960 tgaagtaaag aggtcgtact cagaaaaaaa catggatcgt tccttttggt aaatgagaca 1020 tacttcctga tcagtacaac ctcaaagtaa gatgagtatt aagaaactca cttgaaaata 1080 gctcattttg aggcgtggat ccaggagggt agcacagact ggcgcaggtt tagtaaggac 1140 tttcttgaag tagtcttcaa gttttgagat catgttttta gacgcgggtt gaagttgatc 1200 gtgatcgtat tgatggcgta cctagatgta cattaagtca cattagccaa ttataattaa 1260 tgtgtattta aaatcaaaaa gatacaaata cttactaatc ttaatttttg caagattgca 1320 acatacaaag gaacaacttg atgcattgtt ggatactttg acttacagag tatgtcagtt 1380 gcttcgctca gaggttccaa gaagttgcac atttgtttga cgtaatccca ctcgagcccg 1440 gagagcgcat actttttgta ctcgttttgc aggcagaatt catcacatgc ttctcgtaga 1500 cgcaaaagtc ttcgaaacat gtgatatgaa gagttccatc tggtgggaac atcgacaatc 1560 aggcgagttg cagtttcagc cgggtttgtt tttgggggtg gttgaacatt gtcgacagta 1620 ggatcatcta atagataatc gagagcctca gtgtcagcag gatcagaatt atcagtatca 1680 gggtcggctt gggctgatgg aggattcgca gctggatttc tttgaggctg aggattgttt 1740 cttgaaatct gattcttctt cgcagcagca gcaatagcac gcttgagctt gacaatatca 1800 gcgaaatgtt gaccacgttg agggctggac cgagtacaag taactagacc atggcatcga 1860 ttgtagatgg tcttgaggtt tacttgagtc acgtcgggtg ggtcgacaag gttgttaata 1920 tccattaagt ttgatgaatc ttgctcttcg tcctcgtcga gaatttcacc aaaaaccttc 1980 agtccctctt ttgcagctag gttgataacg tgcgccacgc aaccatggac ttgtttcgtg 2040 tgatcgaaag tgataccatc tgaatttgaa aggactcgac ctatagcagc gtttgtacct 2100 acattatcgg cagtcatcgc acacaactta tcggtaagtt catacttggt aagatattcg 2160 atgattaaac cggcaaagtt atcggcgttg tgtgcacctg gaagattggg tgtcagtcag 2220 cttgagttga tcagatcgat agaaaacaag atacactcac ctttgatttc cgcaatgcca 2280 agagtcaggt ccataagttc ccagttctcc gttataaagt gaccagtgag cgacataaac 2340 ggcttattat ttggagacgt ccatgcgtct tgagtaatgg cgattgcttt gcaaggtcct 2400 aagaaaagtt tcttgatatg atcttcatga tgataataaa gtctgcgcat atgtttagaa 2460 attgcatgtc gtccgacaag cattcctttt acttcaggat tgacaagctt caataatctt 2520 cggaaggatt ctttctcaac gattgagaag gcgaggtcgt tatcggcgat gagataacat 2580 atactagtct tgaggttgga aacatcaagt atatcctaac agtttttcga atcaaaagaa 2640 catgaacatc agctattgtc gatgtgttct ggagtaagaa gacaaaaaga ctcgccatgt 2700 gaactgattt catcttcttg aaacctgcag tcaagcttcc ttgtcttcca tcgattggac 2760 cgtcatcgcg gagattatgc ttgaaagtga ggtggttctt catcgaagta gtacttcctt 2820 tatgatcgcg tttcaacttc gtgccacatt cttttccatc cctcatgacg taacgacact 2880 gattgatatc tgagttttct aatcttttaa aatatttcca tacgtaactt tttcgagttg 2940 actgagattc cgtcgtatcg gagctatcgg catgggacaa atcaattgtg gttgaagtgg 3000 gggcttcgga catgttgatc aattgattta aggttgatta attgaagtga gaaggtcgtt 3060 tgcgcggtga taagtggatt tctcagacaa ttgtaaggtt tgaggtggtg gggaagatga 3120 tcgaaaacgc ccttttctct ttttttttat tttttttttt ttttggctgt accggccgga 3180 tatccgccgg atagccggcg agtagccggt tcgagtgtcg gatgggggtc cagcatgccg 3240 gatagtcaaa aataaccgtc ggtccggatg tcggatggag ccgaccacgc cggatattga 3300 ccggagcaca tcttgaccgc attcatctcc aaaacgcccg agttcaccac accttctcga 3360 gttagctcga ctttgtcgac agaaagtgac ttgtacatcg aagtatcaag tacattggac 3420 gagatgtcaa acatcacttt agaagatgtc atgcgacaag ttcaggatct taatcaacga 3480 ctgttagcca aaacctctcg acgtcaggaa gttgaaaaag aattgcaaat aatgaaggaa 3540 gaaaaacaac agcacttcca aaacattcca caagcgtcta atctctcaaa ccccgctcaa 3600 caacaacaac agatctctgc tgtccagcaa ccttctcaac accgtactcc aaaggtggcc 3660 acacccgaca aatttgacgg accaaaaggc ccaaaagccg aaatttttat gaatcaactt 3720 ggaatttaca tgcggttgaa ctcatctgtg tttgaagatg aacaggcaaa agtggcattt 3780 gcactatcct acacaacagg aaaagcaagt atatggggtc agcatttaat ggatttaatg 3840 ttagatgaag ctcaagtgca caccgtaact tggtcaaggt tcatagattc atttaaggca 3900 acattttttg atggtgaaag agtatttaaa gctgagaaag acatacgaga cttaaagcaa 3960 accaaggcag tgtcagacta ctggatctga ttttctgaat tatcgcttgt tgtaaaatgg 4020 cctgaatctg tccttatatc tcactttcaa caaggtctca aatccgaaat cactatacat 4080 atggttagag atacctttca aacagtggaa gaaatggcaa aactggcgat caaaatagac 4140 aacactcttc ataatcgtaa cccagatggc attaatgcgt caacgtcgac tgtcagtaca 4200 ccaattcaaa cccccgatcc caacgcgatg gactgctcgg cttataaact caacatatca 4260 agtgaggagt ataattgtag aggtgcaatt ggtgcttgtt atgagtgtgg gaagaccggt 4320 cactttatag gaaactgtcc gagaaaaagg ggagcgaaga gggggtattt tagaggttcg 4380 ggttatcagg caagatcaaa tcactatggg ggatcaagtc attcaagaat tagtgaatta 4440 gaaagtcagt tgaaggcgcg tatagatgaa tatgatgaaa agatgggtag aagtgataaa 4500 gagaagaagg aggaaggaag agcaggaatg gcaaaaaatg gagatgcttg cactgcgtcc 4560 gccagagccg ttctcgacgt cgagatggtc ataaactaca aaggggccct ttgtagtcgg 4620 caacgttctt gacgtcgaga acgactctgg cggatgcagt gttgggattg aaggttgtgc 4680 ctcccccgag cgtaatcaat ttagattcaa atagagatat tatcgatagt cttgaaataa 4740 atgatactcg tgttattgat aagatccaac tatttgaccc taaaactgcc acaaccaaat 4800 ttgctcgagc cctaattgac agtggtgcca cgcatgaggc aatgagcaag aaatttgcaa 4860 gtgaggcaaa attcgagact tcacccctag accagcaaag aagcgtaacc ggcttcagtg 4920 gtcatgtgtc aatcgtgact cacacaggag actactgtgt gaacaatgaa ttccctccga 4980 ctaccttcat cataaccaaa ttacgcgaca aatacgacgt tatcttagga atgccctgga 5040 tcagaatgaa tcatcacctg gtcgactgga caaacggatg cttgaagaat tctgatagaa 5100 ctgaaattgc gaccgcttca ttggtctcgt tgctgccgac aaaatcctcg atggaccaca 5160 ttacaaggcc aaagaggaac gctaggaaca gtagcaaggg ggtgcaagtc ttgaatgact 5220 tatctacacc cccgcaatgt gagttcgatt ttaacattaa tcccatgact cctgaatcag 5280 ctggcgagtg tgtctctctc ctagaaaact caactcccac caaacaccaa catactggca 5340 ttagggacga catcaaaatg gaaaagctgc caagtgaatc tctacaacta ccgaaaacaa 5400 cctcaaagga ccacaaagga gacctggaga ggaacgctag ggttgtagag atgggggtta 5460 agtccgcaat agacttgtca aaccccccgc agagtgagtg cgataacctt gtcagacaaa 5520 acatccatag agtagctagc aagcaaagat ctcctcctat aacatttgct aaagcaggta 5580 ctacaatacg acgtctattc ggtcggcata ccatggctaa acccacaaca acaatgattg 5640 attcagcaag tgcgtcatgg aacttattgg ccaaactagc agttgaagcg tcaaaaggaa 5700 aagacgacaa gccagcatct gaattagttc caccgtgcta tcatgaatac attgaaatgt 5760 ttgaaaaggc taaatcaaac gtacttcctc caagacgacc ttacgatttt agagtagatc 5820 ttgtagaagg tgcaacacca caagcaggac aaataatacc tttgtcaccc aaagaatccg 5880 aagttcttaa tgaaatgata gaaaaaggat tagctaatgg aaccctccgc cgcacaactt 5940 caccttgggc tgcaccagtc ctctttacag gaaaaaagga tggaaattta aggccttgtt 6000 ttgattatca aaagttgaat gctcttacag tcaaaaaaag atatcccctg cctctcacaa 6060 tggagctagt tgatagtcta ttggatgcag atcaattcac cagcctcgat atgagaaatg 6120 gatacaacaa tctgagagtt tgagaaggag acgaagcaaa actagcattc atctgcaaag 6180 aaggtcaatt tgagcctctg actatgccct ttggtccaac tggagctccg ggtttcttcc 6240 aattttttat acaggatatt ctcaaagcgc atattggaaa ggacgtggcc gcataccagg 6300 atgacatttt aatctatact aagccaggag ttgatcatga aaaagttgtc aaagaagtac 6360 ttgatatatt gaaaaagcaa aacgtgtggc tgaagcctga gaagtgcaag ttcttcaaga 6420 aggagattgg gtacctagga ttagtcatat ctcgaaatca aataaggatg gatgagacta 6480 aggtgcaagc tgtgtgtgat tggccagcac caaaaaatct atcagaagta ttaaaatttt 6540 taggcttctc aaatttctat cgaagattca tacatcattt ctctaagata gctagacctc 6600 ttcatgaatt atcacaagca gatgtcaaat ttgaatggac agatgagaga aatcaggctt 6660 ttgaaagttt aaagaaagcg tttacaactg caccggtatt aacaatagcg gatccttatc 6720 gaccattcat actggaatgt gattgctcgg actttgcact aggtgcagtc ctgtctcaag 6780 tatcagctga agataatcaa ttacacccag tagccttctt atcacgttct ctaataaagg 6840 cagagagaaa ctatgaaata tttgataaag aactattagc ggtcatctca gctttcaaag 6900 agtggcgtca gtatttagag ggaaatccta atagactgaa tgtaatagta tacaccgacc 6960 acaaaaacct ccaatcactg atgacgacaa aagaactcac gagaaggcaa gcgcgttggg 7020 ccaaaacgct gggtaatttc gactttgaaa tcagatttag accggggaaa gactcaagca 7080 aacctgatgc cctatctcgt agaccagatc tcaaacctaa ggatcatgaa aaactatctt 7140 ttggacaact cctcaaacct gaaaatttac caaatgatgc atttatagaa ttgttagacc 7200 ttattgactc atggataatt gatgaatcca ttggaataaa ctctttggac catgtcgact 7260 cgtggattgt gaatgattca attagcatgg atgaaatcga agcagaaccg aaaagaaaag 7320 atgtatggag cgatgagcgc ataattcaag aaatcagatc aaaatcatgt caggacaaaa 7380 gaatcatgga cgtagccacc ttatgtaatg aactaccaaa tgcaaaattg atgaaagatt 7440 actcttacac cgatgggata ctgtacaaga aagaaaagtt agtagttcca aatgtaaacg 7500 aaatcaaact gcaaatacta cgatctcgcc acgacagcct actggctgga cacccaggac 7560 gcatgagaac actaatgttg attaaaagga cgttctattg gccgtcgatg aaagcataca 7620 tcaacaaata cgatggatgt caatcatgtc aaagagttaa gacccgcaca accaagccgt 7680 acggtagcct tcagccactt ccaatacctg ctggtccatg ggtagatatt tgttacaaca 7740 tgatcactga cttaccaatc tccaaagagc atgactgcat tctgacagta gttgacaggt 7800 ttacaaaaat ggtgcacttt ctcccttgca acaagaccat gacctccgaa gaacttgcaa 7860 cgctaatgtt aagaaacgta tggaagattc atggaacacc gcgtacaatt acttctgata 7920 ggggcaatat ctttatctcg aagctgacaa aagacatgaa tcgcaaattg gggatacaaa 7980 cgcagtcttc gacagcttac catccgcaaa cggatggtca atcagaaatt actaacaaag 8040 ccgttgagca atttctaaga cactttacag cgtataagca agatgactgg gtcgacctac 8100 tcccaatggc cgaattctcg tacaataaca acttacacgt atccattgga atgtcacctt 8160 ttcgagcaaa ttatggattt gacgtcagct tttcaggaac tcctacgcaa gaacagtgcc 8220 taccagcggt tgaagcaaga ttctctcaat taaatgacat tcacaatgaa ctgaaagcgg 8280 caatgaaaga agctcaagaa gccatggctg aaaattatga tagaaaagta ctggcatcac 8340 cgacgtggaa cattggacaa gaagtatggt tgagcagcaa acatgtctca acgacccgac 8400 ccactgctaa gttttcacat aaatggctag gcccgttcaa aattgaacaa agagtctcca 8460 ctaacacata taaattgatt ctgccaaaag aaatgcagga aatacatccc gtctttcatg 8520 tcaatctact acgagaatat gtcaaaagtc agatagaagg tcaagaagac atccctccgg 8580 acgcaataat gatacaaggc aatgaagaat atgaacttaa tgaaatttta aacaggagga 8640 aaaggagagg gaaaatagag tatctggtta gttggaaggg gtatggtcct aatcacgatt 8700 catgggagcc tgaaaaagcg ctaggtaatg caaaggaagt ggtcaaagat ttcaacaaaa 8760 gatatcctca agcagaagaa caattcaaaa ggacacgggg aaggaagtga gggtgaaagc 8820 ttttttccca atgggttttt aatgctcacc tgtggaaaga tatctaaccc atcaagaggg 8880 ggtgaggata tgaaggggga gtggt 8905 // ID Gypsy-31_MLP-I repbase; DNA; FNG; 6308 BP. XX AC AECX01000183; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-31_MLP_; KW Gypsy-31_MLP-LTR; Gypsy-31_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-6308 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000183; Positions 16074 9767. XX CC Positions [5238-5717] - Integrase core CC 'CCTT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 325..2298 FT /product="Gypsy-31_MLP-I_1p" FT /translation="MPSKNIDNVTPIEVTPDQWYKLCGHLPALAEGETYSP FT EVIAGFSDEQVMINETIESLAAGTCLCLDRYSQAILGVNALNFPHVLPRSR FT SRANTPAVNSPVGATETTAATSKGGSSKTPSAANTPLPLSDPGSTKTKSKP FT LSAPGGILNTTPMSYSAAIGTISPATGSAGMQNSGRCSTSGITTGRTITNQ FT PDTTRSANTFRKVDSPGFAPSTALGLTLGSGPPRNPKPDAPHLNTPRAKAF FT TSAVDSIMADKGKRSDRMEIEPSPLPRDGTGIDNSESRQSDITVTMRPDQL FT PGWLAYQESVARKIKAEWDAEILRGEDVEMSPPTRHDRASAMPSTSQRPDP FT TAGHFSPPGSPPPSSPASGARDAYQRPPPMAGRSIFENDAYRAFKAMNAKK FT APRFSGTNHKDAATWCASTAKSLSILGIDRSIWHRVGMQLVEHAALTKIER FT VIDTEFAPRNWEEFVTLLKKKFPSTLTVEALFTRFGNFAFRPNEKAVEAYG FT RFRVIQNDADTIGLKYEVEAMWIKKLPPKLRELVQTQVDQAREMGLKMNME FT QVVECTYNRDARIQENRETLYSTSDNSRKHSRNNSTKKPNRPSSSKRRTNN FT SSSISTDPPSKRCYNCGKDGHIFGTLSNPICPDAPTSRTMNYFKTHKKPEK FT DSASGPSKD" FT CDS 5322..6275 FT /product="Gypsy-31_MLP-I_4p" FT /translation="MIDHCTGWPLAIPMKTATSANVAEALLHQLIEVYGVP FT SEILSDRGLNFLSKEMDAFYKGCGIHKLNTSGYHPRTNGKCERFNGLLEQA FT LFKINKTLDPSRWPEYLAQALFAVRINKSTVTNFSPFELLYGVSPRLISDP FT AKLRPLENSPDPSSNKDRLEKLRSSRIKARQAQEERAKKNKSRFDSKFEDS FT PSKKASSRIVSYAIHDKVKLRNETRSKGEPAWYGPFEIFDNLGNNVYLLLD FT HNTNPFPHPISGNRLKPVHIRDHTLGDSWALPPRLLQEINREDIKISKTIL FT TAAKKLSKTQKATTSKIRIVGRYATT" FT CDS join(2616..4118,4122..5072) FT /product="Gypsy-31_MLP-I_2p" FT /translation="MPDLVVKDHLSVERPVNVFGKDTCGLTDAAPPAFSPA FT SEPSRQVSGDCGRSSAEILAAIESDSGLPRVDFVLTQNRMAVRVLLDTGAR FT TASYVLKSYVEQCGLSSQRVKKTTSISGVWGDSKLITHSSNVPITLGGLKF FT VHKCRIAPLQSYDIILGMDWISQYAVSTDWETWVWLLKDVKGNTVNFSPAD FT VDSPSESVHLIGGDPQLDEEDMPINRSQMRRFLRHPNMEAWVVCGSDRLDS FT MDEVGGPREELPSELPKISVDSPKLRAAAKALVDRYCTLFDPIDKAPEKEH FT VIEHLIDTGDSKPVSQPVRRMSPLLLSELQTKLATLHKNGFIRPSTSAWSS FT PVLFARNASGKLRFCVDYRAVNAITKRDRHPLPLIQDCFDNLHGAVRFLKL FT DLQQGFHQMKLALDSIPKTAFSTRYGHYEWLVMPFGLVNAPSTFQRMMSDI FT LRPYLDKFVQVYLDDILIYSKSDDEHITHVDLVLDALRKADLKVSGGKSEL FT FADEIFVGHMVSKDGIRPMSDKIDAIKAWPRPENVHDVRSFLGLAGYYQRF FT ISGFSKVASPLHELTAGNVTKRARIEWTPACEVAFITLKERLVSAPILIMP FT DPAKPYVIETDSSDFAVGAVLLQEGSDGKLHPVAFESYKLNAAQRNYPAQE FT RELYAIIHAWRKWRNYVEGAVADTVVRTDHASLTYLSTQVLPTCRLLRWIE FT EFGEMTIRVKYKKGSTNFVPDALSRRSDHTILAMYEPRNGLEDDSDWPLVL FT PYVKSDQRLPDWVTTTAIDTAVRRSHEFVWNANDETLIWTGGGDEPKESSP FT FIPFYQRADLMNTFHA" XX SQ Sequence 6308 BP; 1614 A; 1605 C; 1434 G; 1655 T; 0 other; ctggtagcga gagctaaaga ttctgaaatt cgtactgacg aataaatttt tttactatct 60 tttgattttt ttggaattat tataattcgt gtttacagaa acagttattt ccgtaaaatt 120 catctttttg acgcttgacg cctaagaaat tcgaattgta acagcgaata atttcaagaa 180 atcgaaagta agtaaataat tccgtataaa atttgttgaa attttttttg aaaaggataa 240 ggtgctcaat ttttttgatt atttatgttt ggtcgcgtgt ttcttctctt tgattaagaa 300 aaaacagcca taaataaaac aagcatgcct tccaagaata tcgacaacgt cactcccatt 360 gaagtcacac ctgatcaatg gtacaaattg tgtggacacc ttcctgcttt ggcagaagga 420 gaaacgtact caccggaagt catcgccggc ttcagcgacg aacaagtcat gattaacgaa 480 accattgaat cgctcgctgc cggcacttgc ctatgccttg acaggtacag tcaagcgatc 540 cttggagtca atgccctcaa ctttccgcac gtcctaccaa gatctcggtc tcgtgcaaat 600 accccggcgg tcaattcacc cgttggcgcc acggaaacga cggcagctac ttccaaaggt 660 ggttcctcga aaaccccttc tgccgccaat acaccattgc ctctctctga tcctggctca 720 actaaaacaa agtcgaagcc actttccgcc cctggtggaa ttttgaatac cacgcctatg 780 agctattccg ctgcgattgg caccataagt cccgcaaccg gctcggcagg gatgcagaac 840 tctggtcgat gttcgacatc tggcatcacc actggtcgca ccattacgaa tcaaccggac 900 accacccgtt cggccaacac gtttcggaag gtcgacagcc cgggctttgc accttccacc 960 gccttgggtt tgactctcgg cagtgggccc ccgcggaacc ctaaaccgga cgctcctcat 1020 ctcaacactc cccgtgctaa ggccttcaca tctgctgtcg atagtatcat ggcagataag 1080 ggtaaacgct ccgatcgtat ggagatcgag ccttctcctc ttcctcgtga cggcactggt 1140 atcgacaatt ccgagtcccg ccaatccgac attaccgtca ccatgcgacc tgatcagctt 1200 cctggttggt tggcctatca ggagtctgtc gctcggaaga tcaaggccga atgggacgca 1260 gagattctca gaggcgaaga cgttgaaatg tctcctccta cccgtcatga tcgggcctcc 1320 gccatgccca gtaccagtca acgcccggat ccaacggctg gtcatttcag tcctcccggg 1380 tctcctcctc ctagctctcc agcatcggga gctagggacg cttatcaacg tcctccacct 1440 atggccggtc gttcaatctt cgaaaacgac gcataccgtg cttttaaagc catgaacgcc 1500 aagaaggccc cgcggttcag cggcacgaac cacaaggacg ccgcgacttg gtgcgccagc 1560 acggccaaat ccttgtccat cttgggtatc gaccgatcca tctggcaccg tgttgggatg 1620 caactggtcg aacacgctgc tctcaccaaa attgagcgtg tcatcgacac cgaatttgcc 1680 cctcggaact gggaagaatt cgttacttta cttaagaaga agtttccatc caccttgact 1740 gtcgaagctt tgtttactcg tttcggaaat ttcgcttttc gcccgaacga gaaggctgta 1800 gaagcctatg gtcggttccg tgtcattcaa aacgacgccg acactattgg actgaagtac 1860 gaggtcgaag ccatgtggat caagaaatta ccacccaagc ttcgcgaatt agtgcagact 1920 caggttgatc aagctcgtga gatgggtctc aagatgaata tggagcaagt tgttgaatgc 1980 acttacaata gggatgccag gatccaggaa aatcgtgaaa ccttgtattc cacctctgac 2040 aactctcgta agcattctcg aaacaacagc accaagaaac ctaatcgacc ctcatccagc 2100 aagcgcagga ccaataacag ctcttccatt tccaccgatc caccttcgaa gaggtgttac 2160 aactgtggca aggatggcca cattttcggt actttatcta atccgatttg tcctgatgct 2220 cctacttccc gcactatgaa ctacttcaaa acccacaaga agcctgagaa agattctgcc 2280 tccggaccat caaaagatta ggcgaagaag gtatagctcc cctttattcg ccctgtctct 2340 ctttgtcatc ccttattgcg tttgaaaatt gcaatgaaac tgtgccactc gtttctgatg 2400 ataaaaacct tgagctaaat gtgccgttga tagagcacac aaatccactt cctttggaag 2460 tcggtcatga gaatttgttt gatgagaatc agtgcccgta tgtgtcaaat gacagacacc 2520 aagattcgct taataacgac gagcatcttg ccgaaacgag ctctgttgat tcaccccttt 2580 cccctccact tgacatctgt gtcggcctct cgaagatgcc tgatcttgtg gttaaagacc 2640 atctttctgt cgaaaggccg gtcaacgtct ttggaaaaga cacatgcggt ttaactgatg 2700 ctgctcctcc tgcattctca ccggccagtg agcccagtag acaggtatca ggggattgcg 2760 gccgtagctc ggccgaaatt ctagccgcta ttgaaagtga ctctggattg cccagagtcg 2820 actttgtttt gacacaaaac agaatggcgg tgcgcgtgtt actcgacact ggcgcacgaa 2880 cggcttctta tgttttgaag tcttacgtcg aacaatgtgg tctttcttca caaagggtga 2940 agaagaccac ttctatctct ggtgtttggg gggatagtaa attgattact cattcgagta 3000 atgttccaat aacacttgga ggacttaagt tcgtccacaa gtgcaggata gctcccttgc 3060 agtcgtatga tatcatcctc ggtatggatt ggatatctca atacgccgtc tccactgatt 3120 gggagacgtg ggtttggcta ttaaaagatg taaaggggaa cactgttaac ttttccccgg 3180 ccgacgtgga ttccccttcg gaatcagtcc atctgattgg cggagatcct cagcttgatg 3240 aagaggatat gcccatcaac agaagtcaga tgcgtcggtt cttgagacat cctaatatgg 3300 aggcttgggt agtctgtggc tcagatcgtc tggattccat ggacgaagtg ggtggtcctc 3360 gagaggaact cccatcggaa ctacctaaga tatcggtgga ttctccgaag ctacgcgcag 3420 ccgcgaaagc tttggttgat agatattgta ccttgtttga tccaattgat aaggctcccg 3480 aaaaggagca tgttatcgaa caccttattg acactggtga ttcgaagccg gtttcgcaac 3540 cagtaaggag aatgtctcct ttactcttga gcgaattaca aaccaagctg gccactttgc 3600 acaagaatgg gtttatacga ccatcaactt ctgcttggtc ttcgccagtt ctatttgcgc 3660 gtaatgcttc tgggaagctg cgattttgtg ttgactaccg cgcagttaat gcaataacta 3720 agcgggaccg tcatcctttg cctttgattc aggactgttt cgacaatctt catggggcgg 3780 tcagattctt gaaattagac cttcagcaag gcttccatca gatgaagctt gcccttgatt 3840 cgatcccgaa aaccgctttt agtactcgct acggtcatta tgaatggctt gttatgccgt 3900 ttggactagt aaacgcacca agtacttttc agcgcatgat gtcggatatc ttacgtccat 3960 accttgataa attcgtacag gtatatttag atgacatatt gatctattct aagtctgacg 4020 atgaacatat cactcacgtt gacttggtct tggacgccct ccgcaaagct gatttgaagg 4080 tcagtggagg caagtccgaa ctctttgccg acgaaattta gttcgtcgga cacatggtgt 4140 ctaaagatgg gattagacca atgagtgaca agattgatgc aataaaggca tggcctcgcc 4200 ccgaaaatgt tcacgatgta cgatcatttt tgggactggc aggttattat caacgtttta 4260 tatccggatt ctctaaagtt gcgtctcctc tgcatgaact tacggcagga aacgttacga 4320 agagagcccg gattgaatgg acaccggcct gtgaagtcgc tttcataacg ctcaaagaac 4380 gtcttgtgag tgcgccaatc ttgataatgc cagaccccgc gaagccttac gttatagaga 4440 cagactctag cgacttcgcg gttggtgctg ttctactgca ggaaggctcc gatgggaagc 4500 ttcatccggt agcttttgaa tcttataaac taaatgccgc ccagcgtaat taccccgccc 4560 aggaacgtga actatatgcg attattcacg cctggaggaa gtggcgtaac tacgtggagg 4620 gcgctgtggc agacaccgta gtacgtacgg accatgcttc tctcacttat ctttccacgc 4680 aggttctgcc gacttgccgt cttcttcgat ggattgaaga attcggcgag atgacgatcc 4740 gtgtgaaata taagaagggc tccaccaact ttgtgcctga tgccctcagc cggaggtcag 4800 atcacactat tctggccatg tatgagccca gaaatggact cgaagatgat tcggattggc 4860 cattggtatt accttatgtc aaaagtgacc aacgccttcc cgattgggtt acgaccactg 4920 caattgatac tgccgttcga cgttcacacg aatttgtgtg gaacgctaac gatgaaaccc 4980 tcatttggac tgggggtggt gacgaaccaa aggaatccag ccccttcatc cctttctatc 5040 aacgagctga tttgatgaat acctttcacg cttgatatgg tcatagggga agggacggaa 5100 ccctcagcct cctccataac cggggttggt ggcctggttg ttacgccgat gttgaatctt 5160 tcgtcaagca ctgtcctgct tgtcaaattt tcgactctcc ggaccttttt caggagacag 5220 atcgacaaca ctctttgcct tccgttgcac cttttgaacg ttgggcagtc gactttattt 5280 cccttccaga aagcaaggag ggatttaagt ggatactaac aatgattgat cattgtacgg 5340 gttggcctct cgccataccc atgaaaacag ctacatccgc aaatgtggcc gaagccttgc 5400 ttcaccagtt gattgaagtc tatggcgttc catctgagat cctctcggat cgtggactta 5460 acttcctatc aaaggaaatg gatgcgttct ataagggttg cggtatccac aagttaaaca 5520 cctcaggata ccaccctcgg accaacggga agtgtgaacg cttcaatgga ctcctggaac 5580 aggctctctt caagatcaat aaaaccctgg acccctcgag atggccggaa tacctggctc 5640 aggcgctgtt tgctgtccga attaataaaa gcacggtcac caatttttcc ccatttgagc 5700 tcctctacgg agtctctcct cgcttaatca gtgatccagc caagttgcga ccactcgaga 5760 actctccgga cccctcttcc aacaaggacc gattggaaaa gcttcgttca tcccggatta 5820 aggctcgtca agcccaggaa gaacgtgcga agaaaaacaa atcgagattt gattcaaaat 5880 ttgaggattc cccttcaaag aaagcttcat cccgtatcgt cagctacgcc attcatgaca 5940 aggtcaagct tcggaatgag acacgaagca aaggcgagcc cgcttggtat ggtcctttcg 6000 aaatttttga caaccttggc aacaatgtct acttgttgct cgaccataat accaatccct 6060 ttccacatcc tatcagcggg aatcgactga agcctgttca catccgcgac catactctcg 6120 gagactcttg ggctttgccc ccacgtcttc ttcaggaaat caatcgtgaa gatatcaaaa 6180 tttcaaaaac catcctcacc gccgctaaga aactatcgaa aactcagaaa gcaacaactt 6240 caaagattcg aattgttggt cggtacgcta cgacatagga atgccatgtt tttttagggg 6300 ggggatgt 6308 // ID Gypsy-86_MLP-LTR repbase; DNA; FNG; 231 BP. XX AC AECX01002125; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-86_MLP_; KW Gypsy-86_MLP-I; Gypsy-86_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-231 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01002125; Positions 19118 18888. XX SQ Sequence 231 BP; 58 A; 48 C; 55 G; 70 T; 0 other; tgtaaggtgt tacatgtaga cttatgggtt agacatatgg acttagagag ggcgccggat 60 tacttgtact cacaaacgcc ctagctccat gaagatcctt tctgcaccga atacccttcg 120 gatggagcta gttgtcggat ttcggaatac atagttgggt ttttagatta actttggaac 180 cttgttcccg tcagagccaa ccttagtgaa gatcctttcg aggatcttac a 231 // ID TCN2-LTR repbase; DNA; FNG; 966 BP. XX AC . XX DT 30-MAR-2005 (Rel. 10.03, Created) DT 30-MAR-2005 (Rel. 10.03, Last updated, Version 1) XX DE C. neoformans LTR retrotransposon - LTR consensus. XX KW LTR Retrotransposon; Transposable Element; Interspersed repeat; KW TCN2-LTR. XX OS Cryptococcus neoformans OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Tremellomycetes; Tremellales; Tremellaceae; Filobasidiella; OC Filobasidiella/Cryptococcus neoformans species complex. XX RN [1] RP 1-966 RA Goodwin T.J. and Poulter R.T.; RT "The diversity of retrotransposons in the yeast Cryptococcus RT neoformans."; RL Yeast 18(9), 865-880 (2001). XX RN [2] RP 1-966 RA Loftus B.J., Fung E., Roncaglia P., Rowley D., Amedeo P., RA Bruno D., Vamathevan J., Miranda M., Anderson I.J. et al.; RT "The genome of the basidiomycetous yeast and human pathogen RT Cryptococcus neoformans."; RL Science 307(5713), 1321-1324 (2005). XX RN [3] RP 1-966 RA Gentles A. and Jurka J.; RT "C. neoformans LTR retrotransposon TCN2."; RL Direct Submission to Repbase Update (15-MAR-2005). XX DR [3] (Consensus) XX SQ Sequence 966 BP; 174 A; 298 C; 159 G; 333 T; 2 other; tgtaagaggc tagcctccga cccagttgac ccacatctca tgaggtcacc actttcccca 60 gcttctaatt ttcgagccgc gtttcaattt gcttccgctt ctttcctcca agttgcgcgc 120 gtcaaattgc atccgcgctc ccttctccca ggtcacgtgc tttatttttc ccctgtttct 180 ccgagccgcg tctcaattta cttccgctcc tttcctccaa gtcgcgcgcg tcaaattgca 240 tccgcactct ccctcccagg tcacgtgcat cattttactt ccgctttttt ctcttcttgt 300 tttctcttcc atttctcgtc ctaagtccga gtttcttgtc cgatttttca tgccacaagc 360 cccggtgttt accggagctt tggtcgcgac tggtttcgag agcgaaggca aaagtgcatg 420 gaaaggttga cacacaaatt cctttattct tttatccttg ttgtgttcaa ccttttcctt 480 tcttccgact ttacagtcac aagtaccttg ccttctcctc tgtaagcaga gctctctact 540 aattaccttc tcccctcaga ccccctggct aatcctgatc agtgttccta tcaattcttc 600 atcttgtctc ctttactagt cgctgatagg acataatcag gtaagcttcg ggctctttgg 660 gtttcacaca aaatttcctt tattcntttt atccttgttg tgttcaacct tttcctttct 720 tccgacttta cagtcacaag taccttgcct tctcctcttg ttcctatcaa ttcttcatct 780 tgtctccttt actagtcgct gataggacat aatcagcata tgcatttggg atcttgtttg 840 cttctctcca gtccgaagtt aagcctcgag ccctgccagc cctaagagcc caccagttct 900 ctccaagcct tgtccctccg agctagcccc aacagtncag tgaaggtcta ttgtccggac 960 cttaca 966 // ID Gypsy-70_MLP-I repbase; DNA; FNG; 5986 BP. XX AC . XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-70_MLP_; KW Gypsy-70_MLP-LTR; Gypsy-70_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5986 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR [1] (Consensus) XX CC 'AAAT' target site duplication CC LTRs are 100% similar to each other. CC This is a reconstructed sequence (an insertion of a CC non-autonomous DNA transposon deleted). Therefore, it is listed CC as consensus. CC The original coordinates: Positions 48279 54167 Accession No CC AECX01001591. XX FH Key Location/Qualifiers FT CDS join(500..3757,4026..5744) FT /product="Gypsy-70_MLP-I_1p" FT /translation="MDEQNGFVQIPAEDIRLLRAQIQQTNIDSQNTAQAHA FT QALSELWEQYQQAYQRQALELADTKNIIQQLQTNQQHTYNKKPDAFRGEPR FT KFEGYGDNAETWILELESNFRIHHYPESEWGEMIGAYLDEESKMYWLSERN FT ATGGIITNYQEFRRKFLQKYNFSLTMDEIQEKIKLLYYRNNIHDYILQFRK FT LSCQLPMDKYCFFDRKFAFLDKLPANIREWANRPEIVDKEDMELIYSAARE FT RDRHNKVNNRSFKTPESKPTFERKFSRPFSRNHYHRSSNFGKPPAVTNSSG FT PIMDLDLMDKEKMKVTSQTECYNCHGHGHVARNCTNSKKPSKPNFKKPSYS FT NGDRRKQSLLMIDIDPRTSHGMDTTPLIGPDGTMMKASKESSLNLDDPDLD FT GREDLISKVKAYKETLKNPERAEELKKERDALTKLFQEQAEDLSYRCCKSC FT LIFGNEDSPETISKGIEYFNLDAKQVNNLMKRPLEEDSLGEGTSENPILVH FT SSPPLTSGYSGYLGLPIPGYEQRSPSPELANIDMSNLPFPTTLPTDEKAKM FT TPETESEQCLHLLPLGERTFSYPEVKMDGRRIDGFGFTNTPFHSPLGERTR FT EELGERELEHVKSWNKIMNHSKKRPRWTGYAVSGPKDLGDYDGEAILTLEE FT GGLYDKSLQLNIVDLATHSDSSSSLPTYTSHFRNYPLATILDTGAAANYIS FT EDKIKKMIELHPNRIKLEETQGQKVRLANGAHETSTKTATFELEIDNCNNG FT GFFRFTIKAYVLPLPNVELILGMPWFKEHKPKINFKNMEYEIEKNSHCHNF FT IPRPNGSNLYTMAVDALHSILEKAARDTAPECFEEEIKQVERKHKHFIDTG FT DSTPIKTHGRPLTPPEHQLIGQFVEEGLEQGIIEKTDSPWSSPLLLVKKPD FT GLTRVCVDYRSLNKVTRKNAYPLPRIDDAYQFLSGAKYFTTIDLKSGFWQV FT PMAPQDKEKTAFTCRQGHFQWSVMPFGLCNAPAMFQEMMNDILRDVIDKFA FT LVYLDDVIIFSRTPEEHLEHVKRVFEILKSNRLVVSPKKCQWGRNSLLFLG FT HIVDGSGIKTNPEKSTKSLNGHNLITSPRTDRSSKTGSFDYALYQKNLKNN FT NLRPIAFESRKLSKTEQNYSAQERELLAIVHGLKHFRGYIEGSPVLVRTDH FT ESLKHFKTQKFINRRLACFVDEIEFFDTHIIYRPGKEQLAADSLSQKPNTK FT FDLNPPEVAQPLFNMDGQKDPYKVLVEYKLKLLAGVDPATIGNGLFRMIQN FT EMVLYEDENLSTIPRIVATNEGRAESLAFQIHKDLGHRNVEDVITAVKKRY FT WFPCLSQTVRNSVSLCAACQVHAAPTKQQTLPIQMITRGKPFRKWGLDFVG FT PLVKTTNNHQHLVTAIDYGTGWAYAKPLKRTSATAAIDLLKEITLNHGLPV FT EITTDNGSEFISREFREYVQEMGILHITTTAYHPQANGLVERFHGTLLNAL FT RKMCSPYNQVRWDEYLNSCLFAHRASYSHSMKASPYFMAYGSEARLPSEKV FT SVNFNNSIENLELIHKQRNLDIRKHASRQDELIEDLNSRARERNTRTEETY FT TERSLKTGDLVLRSFEGQPSKLHPKWDGPFAVRECYPNGTYSLMTANGHIL FT NAKVNGARLKLYKGRSAEFFYASRELHKRDKRARRISATRC" XX SQ Sequence 5986 BP; 2056 A; 1448 C; 1158 G; 1324 T; 0 other; aataattatt tgttagtaaa aattaaccct tttccttttt ttttctaaaa gaaacacctt 60 tcccaaacaa accggagaaa agaagactcc ctactaaacc tgtgacctcc tcttctcaaa 120 acagtgctca acaacaccaa tcaaattcaa tctcgaaaaa aaacccaaac agcactaacc 180 ctcacaggct cgatatctta aagtcgtgag ttattctcaa attccttagc tagttatttc 240 gattatatct caaacgactg cttctcttcg attaaaagaa aaaaaaaagg acagactcaa 300 aagtcctcaa aacatacaac taacaaacca aacaaataaa acgcgcacca cctttttagg 360 tatcaacaaa gtaacaacta gaagatttta atctttataa agtacgtaca ttttttgtag 420 actctgatca agtcccacga aaaattgtta atagacaatc cgttattttt ctaaaaagaa 480 cagaaaataa aataaaacca tggatgaaca aaacggtttc gtacaaatcc cagcggagga 540 catcaggctc ctacgagccc aaattcaaca gactaatatc gactcacaga atactgccca 600 ggctcatgca caagcactct cggaactttg ggagcaatac caacaagcat atcaacgcca 660 ggctttagaa ctcgccgaca ccaagaacat catccaacaa ttacaaacta accaacaaca 720 cacttataac aaaaaacccg acgccttccg cggggaacct cgaaaatttg aagggtacgg 780 cgacaacgcg gagacctgga ttctagaact ggaatctaat tttcgcatcc atcattatcc 840 cgagagcgag tggggagaaa tgatcggggc atacttggac gaggaatcca agatgtattg 900 gctctccgaa cgaaatgcca ccggaggaat catcaccaac taccaagagt tcagaagaaa 960 gtttcttcaa aaatacaact tcagtctaac tatggacgag atacaagaaa aaatcaaact 1020 actatactac cgtaacaaca tacatgacta catcttacaa ttccgaaaat tgtcttgtca 1080 actacccatg gacaaatatt gcttcttcga cagaaaattt gcatttttgg ataagctccc 1140 tgctaacatt cgagaatggg caaatcgacc cgaaattgtt gataaagaag atatggaact 1200 aatctattcc gcagctagag aacgtgatcg acacaacaag gttaataacc gcagcttcaa 1260 gacaccagag agcaaaccca cattcgaacg caagtttagc cgacctttct cgcgaaacca 1320 ctaccatcgg tcatcaaact ttggaaaacc ccccgcggtt accaactcat caggacctat 1380 tatggacctg gacctgatgg acaaagagaa aatgaaggta acctctcaga ccgaatgtta 1440 taattgtcac ggacacggtc atgtcgcacg aaactgcact aactcaaaaa aaccttccaa 1500 accaaacttc aagaagccaa gctattcgaa tggcgatcgt cgcaaacaga gtttgttaat 1560 gatagatata gatccaagaa cctctcacgg aatggatacc acacccttaa taggccccga 1620 cgggacgatg atgaaagcct caaaggaatc tagtttgaac cttgacgacc cggatcttga 1680 tggaagagag gacctaatct cgaaggtcaa agcttataag gaaaccctca agaaccccga 1740 gcgcgcggaa gaactaaaaa aggaacgtga cgctctaacc aaactattcc aggaacaagc 1800 tgaggatctc tcataccgat gttgtaagtc gtgcctaatt tttggcaatg aagatagccc 1860 agaaaccatt tctaaaggta ttgaatattt caacctagat gccaagcaag tgaataacct 1920 catgaaaaga cccctggagg aagactcact tggtgaagga acgagtgaaa acccaattct 1980 ggtacactcc agcccaccgc tcacgtccgg atattcaggg tacctaggcc taccaatccc 2040 tggatatgaa caaagatctc cgtcaccaga acttgcaaac atcgacatgt ccaacttacc 2100 gttcccgaca actttaccca cggacgaaaa ggcaaaaatg actcccgaaa cagagtctga 2160 acaatgcctc cacctgttac ccctaggtga gagaacattc tcctaccctg aagtaaaaat 2220 ggacggtcgt cggattgacg ggtttggttt taccaacact cccttccact cacccctagg 2280 agaaagaact agggaggagc taggagaaag agaacttgaa catgtcaaat catggaataa 2340 gattatgaat catagcaaaa agagacctag gtggacgggt tatgcagtta gcggacctaa 2400 ggacctggga gattacgatg gcgaagccat cctaacactt gaagaaggag gattatatga 2460 caaaagccta caacttaata tagtcgatct agcaacccat agtgactcca gctcatcgtt 2520 accaacctat acgagccact tcagaaacta ccctttggcc accatcctcg ataccggggc 2580 tgcggcaaac tatatctccg aagataagat caagaagatg atcgaactcc accctaaccg 2640 gattaagctt gaagagactc aaggccagaa agtcaggtta gcaaacgggg cgcacgaaac 2700 ctcaaccaag acagcgacct tcgaactgga aattgacaac tgtaacaatg gcggcttctt 2760 caggttcacc atcaaggcat acgtattacc tttacctaac gtagaattaa ttctagggat 2820 gccctggttc aaagaacata agccaaaaat caattttaaa aatatggaat atgaaatcga 2880 aaaaaactct cactgccaca acttcattcc aaggccaaac ggcagcaatc tttacactat 2940 ggctgttgat gcactacaca gtatcctcga aaaggcggct cgcgatacgg caccggaatg 3000 cttcgaagaa gaaatcaagc aagtggaacg aaaacacaaa cactttattg acacaggcga 3060 tagtacacca attaaaactc acggacgtcc cttgacacca ccggaacacc agcttatcgg 3120 gcaatttgtg gaagaaggat tagaacaagg aataatcgaa aaaaccgatt ccccttggag 3180 ctcacccttg cttttggtaa aaaaacccga tggtttgaca cgggtatgtg tcgattatcg 3240 ctcactaaat aaagtaacac gtaaaaatgc ttacccatta ccgcgaattg acgacgcgta 3300 tcaatttcta tcgggcgcga agtacttcac cacgattgat ctaaaatccg gtttctggca 3360 agtacccatg gcaccgcagg ataaagaaaa aacagcattt acatgtcgac agggacactt 3420 tcaatggtca gtaatgccct ttggactttg caacgctccg gcaatgttcc aggaaatgat 3480 gaatgacatc ctccgtgatg tcattgacaa gttcgcactg gtatacttag atgatgttat 3540 tatcttttct aggacacccg aggaacactt ggaacatgtt aaacgtgtat tcgaaatact 3600 aaaatcaaac aggttagtag tctcgccaaa aaaatgtcaa tggggcagaa actcattact 3660 gtttttggga catattgtgg acggcagtgg gatcaagact aacccagaaa aatcaacaaa 3720 atcattgaat ggccacaacc tgataacatc tccaaggtga gaggattcct aaacctctgc 3780 acctattaca aaaggtttat taaaggcttt tcctcaattg cttccccaat ttataagctc 3840 accgagggat ctccgaagcc ggggtcttcg attcattggg gggaagaaca acaattggcc 3900 tttaatacac taaaagaagc actgtcaaga tcagtacccc tccaacatcc aacaccattc 3960 cgcccatttg tattggacac cgacgcgtca ggaacaaaca tcggcgcggt gttacagcaa 4020 gatgaacaga cagaagttcc aaaactggaa gttttgatta cgctctgtat caaaagaatc 4080 ttaaaaataa caacctacgt ccaatagcct ttgaatcaag aaaactatct aaaacggaac 4140 agaattactc ggcccaagag cgagaactgc tcgctattgt acacggacta aaacattttc 4200 gcggatacat agaaggttcc ccagtcttag tcagaaccga ccacgaatcc ctcaaacact 4260 tcaagactca gaagtttata aaccgaagac tagcttgttt tgtagacgag atagaattct 4320 ttgatactca cataatatac cgacccggta aagaacaact tgcggcagac tccttgtcac 4380 aaaaacccaa caccaaattc gacctcaacc caccagaagt ggcacagcca ctttttaaca 4440 tggacggaca aaaagatccc tataaagtcc tagtcgaata caaattaaaa ctcctggccg 4500 gagtcgaccc agccacaatt ggaaacggat tgtttcgaat gatccagaat gaaatggtct 4560 tatatgagga cgaaaattta agcacgatcc ctcgaattgt ggcaactaat gaaggcagag 4620 ctgaatctct agctttccag attcacaaag acctgggtca ccggaacgtc gaagacgtca 4680 tcaccgctgt caagaaacgg tactggttcc catgcttaag ccagaccgtc aggaactcgg 4740 tgtcgctctg tgcagcatgc caagtacatg ccgcaccaac aaaacaacaa accctaccta 4800 tccagatgat cactagaggt aagcctttca gaaaatgggg cctagacttt gtgggaccac 4860 tcgtgaaaac tactaataat caccaacatc ttgtaacagc catcgattat ggaactggat 4920 gggcgtacgc gaagccactc aaaaggactt ccgccacagc cgcgatcgac cttctaaaag 4980 aaatcacatt aaaccacggt ttacccgttg aaatcactac tgacaacggc tcggaattca 5040 tttcacgcga attccgggaa tatgtacaag agatgggcat cctacatata actacaaccg 5100 cgtatcaccc tcaagctaac ggcctagtag agcgttttca cgggacactc ctgaacgcct 5160 tacggaagat gtgtagcccc tacaatcaag ttcgctggga cgaatactta aactcatgct 5220 tattcgcaca ccgagcatca tactctcact caatgaaagc ctcaccttac tttatggctt 5280 atggaagtga ggcccgccta ccatccgaaa aagtaagtgt taacttcaat aactctatcg 5340 aaaacttgga attaatccat aaacaacgca acctagacat caggaagcac gccagcagac 5400 aggacgagct tatcgaagat cttaattcaa gagcacgaga acgaaatact agaactgaag 5460 agacctacac ggaaagaagt cttaagacag gagatctcgt cctacggagc ttcgaaggac 5520 aaccatcaaa gttgcatcca aaatgggatg gcccttttgc tgttagagaa tgttacccca 5580 acggtacgta ttccttaatg actgcgaatg gccatatctt gaatgctaaa gtaaacggag 5640 ctcgtttaaa actttataaa ggaagatccg cggaattctt ctatgcctct cgggaactcc 5700 acaaacgtga caaacgcgca agaagaatta gcgcaacccg ctgttgatat agaacgagaa 5760 gagttgttag cgagaagaga agaatacctc gtagatctgg caataaaact acataacgag 5820 ttatacgcag atgaagcaaa cagaaatctc agagacttct ccgccagagc tttgggcgct 5880 ctctcttact acttgcacgg cgcgtcggag gtattaaacg acgaaataca acacaatgaa 5940 aaccgtgctg aaaactagga agtttccata cttaaaaagg gggtga 5986 // ID LTR1_CA repbase; DNA; FNG; 282 BP. XX AC AF191499; XX DT 16-MAY-2005 (Rel. 10.05, Created) DT 16-MAY-2005 (Rel. 10.05, Last updated, Version 1) XX DE C. albicans LTR sequence. XX KW LTR Retrotransposon; Transposable Element; Interspersed repeat; KW LTR; LTR1_CA. XX OS Candida albicans OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; mitosporic Saccharomycetales; OC Candida. XX RN [1] RP 1-282 RA Goodwin T.J. and Poulter R.T.; RT "Multiple LTR-retrotransposon families in the asexual yeast RT Candida albicans."; RL Genome Res 10(2), 174-191 (2000). XX DR GenBank; AF191499; Positions 1 282. XX SQ Sequence 282 BP; 106 A; 40 C; 51 G; 85 T; 0 other; tgttgaaatt aataatggag taatcctagt catgtgatct aacctaacac aaaatgtaag 60 aagagttttg tgtcttggta gtctggacac gcagaatgaa agctctagac gataaaagtg 120 tactttatat aaaagaagac gagaagtctt ctttactaac aagtagtgat atagtagaca 180 actaaagttt ataagaatat atgtacaaca ggtgagttat tatctattaa ttacttaatc 240 atatgctaag acagactctg tgacactcag tacaatctcg ca 282 // ID Gypsy-30_MLP-LTR repbase; DNA; FNG; 856 BP. XX AC AECX01000185; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-30_MLP_; KW Gypsy-30_MLP-I; Gypsy-30_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-856 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000185; Positions 19495 18640. XX SQ Sequence 856 BP; 241 A; 162 C; 145 G; 308 T; 0 other; tgtcagccag ggcataagtt gcttagacga cttatctcct ggctaactac ctaaggatgt 60 ttggagacaa acacacaaat gaaaccattg aaatacaaca tcttacaact ctcagtttct 120 ctcgtataaa tctattacaa ctgttttctt tattcttttt cctattttac ataattcttt 180 tacttttctc tttctacata tattcttttt cattatcaaa taaaccaatt aattcacaat 240 actcgcgtga gatgatcttg aattaattga ctttgatatg aactcctgaa tgatgttgaa 300 atttccagtt catggttctc gagtcggatt gggacaccac tcgagttcct actcatctaa 360 aaccaatcca gctaacatgt gggaactgtc agctggaagg tctggaaatg gaaacggatg 420 aaaggacgga ttacaaatga tattctggga taacatcttg gcgaatgagc gatgggctcg 480 gtattacatc aattcaaatt acgtatgtat tcaagtcggg gaattccggg tttacacatc 540 tggatgatat catgggtact tctgtttgtg tgtggtctag ttgtgagaag ggtataaaac 600 ccgcgttgtt ttctttatat tgtgattttt tcttcatctt ctattatact aatctaatca 660 actattagcc taaccagctt acgagttttt atcgcgtcca cctcattcca atatctttta 720 aaacaatctg tttcaatcca tcttggttag tgttgtccac gcgtttataa atttcgatct 780 gatttgtatt ataagaaacc gtaaattctc tatatattga gtttatttcc aggttgaata 840 accgtccttg ttgaca 856 // ID copia-1-LTR_AF repbase; DNA; FNG; 147 BP. XX AC . XX DT 28-FEB-2006 (Rel. 11.02, Created) DT 07-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE Long terminal repeat of copia-1_AF LTR retrotransposon - a DE consensus sequence. XX KW Copia; LTR Retrotransposon; Transposable Element; Nonautonomous; KW Interspersed repeat; COPIA superfamily; copia-1-I_AF; KW copia-1-LTR_AF. XX OS Aspergillus fumigatus OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Neosartorya. XX RN [1] RP 1-147 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-147 RA Kapitonov V.V. and Jurka J.; RT "copia-1_AF, a family of copia LTR retrotransposons in the RT Aspergillus fumigatus genome."; RL Repbase Reports 6(2), 52-52 (2006). XX DR [2] (Consensus) XX CC It is a long terminal repeat of the Copia-1_AF LTR CC retrotransposon. It is characterized by 5-bp TSDs. XX SQ Sequence 147 BP; 38 A; 38 C; 21 G; 50 T; 0 other; tgtgaatacc actctctcgc ctacgtttcc tgaataggct tagtgcctag acatctcgcc 60 ttaatggctc atgtaaatat cttcatttct ctctcttagc tagcgaagat agtcagaatt 120 aagtcactca tactattctt actcaca 147 // ID Gypsy-106_MLP-LTR repbase; DNA; FNG; 1765 BP. XX AC AECX01000915; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-106_MLP_; KW Gypsy-106_MLP-I; Gypsy-106_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-1765 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000915; Positions 12205 10441. XX SQ Sequence 1765 BP; 615 A; 207 C; 339 G; 604 T; 0 other; tgtggtgagc aagatctcta cttaagcaaa aaatttaaaa attgtgtgaa aattcgagtt 60 tttggtcaaa ggcaatatta ttattgaggt tgttagtcat caaggttatc aaagttgatt 120 tattattttc atttcattat ctttcagaat atatatttac ataagcaatt gataagatca 180 tttaagagca aagaaaaatt cgaaaagatc aataaaaata gtataagagt gaaaatcaaa 240 ttaagagaat tagtaagaag ggatttgagg aagaagaaca tttagtttgg atctttagga 300 atggaaactg ttatgaaaag aatgaaagaa gaagaagatt attaagcaag ttggagaaaa 360 attcgaaaat atgggtatga aagcttggaa agtggttttt agcttattat tagaagagat 420 attggttcag gaaggtttta ttgtgaaaga gaaggatcac attagtcatc aacaacttta 480 atcatgagga ttgataaggg aggtgtaaag ttattcaaga aaagggaaaa ttcgaagaag 540 ggtactatgg tattaattcc tgtgtagtga tgtcattgat ggattgattg cgccaagagt 600 ttttcagagt gatttttatg ctgtgtgatt gattggagtg ctgtggaatg atcttttgat 660 agaaggaaaa tcattattag acgtgaaaat agtcaaaatt catcaaaatt ggaggttttc 720 agttaattac atatggtaga acttgtcaga attgggtgtg tgatagttca ttcgacgcag 780 aataatcgat ttcataatgt acaatcttta aatcattagt aacggaggag tagtaaaaaa 840 gttataaggc aacaaagaaa ctggagtgaa tttagaattt ggaggttaca tgtgacattt 900 acatgacaaa accttatttt tgcaatcttt acaactcaat accttattca tcacacatca 960 gaaataggtg atataggggt gataaatgat aagaatcacg agtgctatca aatgagctat 1020 agtaatccaa aattgaacaa ctttacaaaa aattcgtaat tccaaaaatg gatttcagct 1080 atgaatatta ggaaaaaatc acctgtattc atcataagaa gatctgttca tcatttattc 1140 acatcaattg atcaaattca cgccttggat cattacaaac aataatattt atgacatcat 1200 atgattactt tggaggatat tctattgatt ttgacggttt ttgaggtttt ttggaggttg 1260 ttgaccattg tggaagttgg ttggaaggtt atatactatt gttaaggatt ttacgggatt 1320 ttgttcccct ttttactcat cttatacttt tgttagagtg tttttaatca ttattaaatt 1380 actttagcag tactaagttc actaaatagc ccctttacat caaaaggaga gatttcttta 1440 cgtgtgataa gactacaagt cttcatttgg aataacaatc aaactgattg ttgcttgcta 1500 gtaattaaaa ttgtgttttg ttgtgttcaa attggaataa cactacaata gtgttgagat 1560 ttctttattt ctaaaactca cgggaataat acttgaatta agtattgaga catatttctc 1620 tggtgcctta tcccaaagaa ttcagcaaaa ccccgtgttg attctttcaa agaggcagtc 1680 ttaggcaact tctactagtt agttatatta gtagattatc cctctttaag ctctgtaaag 1740 ctcttgttga gctttaagtt ctaca 1765 // ID Gypsy-7_CCO-I repbase; DNA; FNG; 15274 BP. XX AC AACS02000003; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the mushroom Coprinopsis cinerea genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-7_CCO_; KW Gypsy-7_CCO-LTR; Gypsy-7_CCO-I. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-15274 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the mushroom Coprinopsis cinerea RT genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AACS02000003; Positions 3361681 3376954. XX CC Positions [7819-8313] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 1670..3127 FT /product="Gypsy-7_CCO-I_2p" FT /translation="MEYEISMDPRVDHLTTQQLYKNPRVPQLWKASHLVLE FT DDTAAEVWAYIERMETMFEQASVVLDRHKKKALVDFVKKVDLKDQWSRLTQ FT YRDGTYAQFKAEILSYYPGIKELQEGSLDKLEKIFNRYVRIKMHEEQKLMD FT LIAKVRTEVSKIRETERKRKIQLLSERTLVQDFLACLDRNFRMAVQVRLST FT ADQRRILAELKRESRANARTLDKRLKRMEAPKGPKVEPEEGKEGGPSTAPE FT GDLGTEVNPIQVEDDGDDDFDGELYDSSGLERVMPIVTKYTLDDLLYVAQV FT VAREHSNSSVFGFTTDLSDTGTADQDVINTAVAETLKRLGIKPKREEQHVQ FT DAGPPRGSQNMPPGMYQWSGPPKSDNDGFKLILERLDQQEAKWKDTLTGLR FT KEQKEQLTEFETRMEAKIDKRVDRPFSAIQLAQLVREETTLVNPTFRREGG FT TTRTRVPQAEGTSTFRVDQVNRATAKACGEMLTLVVESDEE" FT CDS 3440..4642 FT /product="Gypsy-7_CCO-I_3p" FT /translation="MTLRNEVSQLETEKAQSLGSAPNDTSAEAPAAADEKA FT AMWNALPSVLPKFASTRGFSMTTRKAAAAASSKPAGSSKTELEKSKKTAGV FT EKENTNKKECVEEKESESEEEEVDELQEEEEEPLRFVEVKRVGPPKRKKKV FT TWSDSQPTAKLTPSGTLPYVDVPPLSTKPLGAFKPKVNKEEEKSFEKEREV FT EEKGEPLPEIDRRGPAYNKRAPVEQNGAAKAIVNELLEVTVPVNIKSLMGV FT SPEAREYLKNLLTRKRVPTREVPIPRETEVFKRFFDDPHEVKRLHVGLQWV FT ENSEEAMAKAEAVVEQRALAEKQNRDYKEASTLRPAVRTVAEKGDEKPLGS FT ITISDPVEQYMIELGTEAKPVIVKAEESSSLRVIYPENQWSHEGRVLIRRR FT FDNLLNA" FT CDS 5806..8313 FT /product="Gypsy-7_CCO-I_1p" FT /translation="MDKVCRFIKERIDSGVYEPSNSSYSSRWFCVLKKDGE FT SLRIVHSLEPLNAVTIAHSGIPPATEELANHFAGRACGGCMDLFVGYDERR FT LDEKYRDLTTFQTPYGAMRLVTLPMGWTNSVPIFHDDVTHILQDEIPAVTK FT PYIDDVPTRGPATRYEDGKGGYETIPENPGIRRFVWEYLNDVNRVMQRMKY FT CGGTFSGKKSIILADEFTVVGHLCTYEGRKVNPERADAILKWGACKDVSDV FT RSFLGVLGTLRMFVKDYARRAEPIQELTRKEVPFRWGPEQAAAMADLQEAI FT RVAPVLKAIDYDRNVPLKLSVDTSYKAIGWMISVQDDDDPDKWHYARFGST FT TLSEREARYGQAKRELFGLMRALEENRYWLLGCRELIVETDAKYIKGMLQN FT AGNGPNATINRWIESILMYHFTLEHVAGSKFAIPDGLSRRVPHPDDKPKTP FT YDHSEEETGGLTGYRKPNPDDPEPLDFEEFKHEIDTRGGYLQDIADFEEDL FT ELDRAVADRVLSLGPKMETQFVNVEVLPDLSLKEDPKLRQEYDGESRRHPL FT IKEIDRKIPIVDKWLENPSVRPKGMSDAEYLAFARFSTHFFKKDGRLFRRG FT INGEHKIVVYPENRMYLMRTAHDATGHRGHYATKELLAKRFWWPDLDGDVG FT WYTRTCHVCQIRQKLFMRIPPTVTETPSIFQVLHADVIHMSIPSDKCRFIV FT HGRCALTQWMEGKALQKQNASTIGKWLFEDVILRWGCIKKIVTDNGGPFIK FT AVGWLEEKYGIRGIQVTGYNAQANGTIERPHFDVVNMLAKVTAGNLSKWKW FT YFPHIMWADRITIRKRFGCSPFYMVTGAEPVTPS" XX SQ Sequence 15274 BP; 4732 A; 3293 C; 4242 G; 3007 T; 0 other; cgagtccggg tttttaaaaa cgagtcaacg agtgatcccg ccgggacgag tttacgtatt 60 tttcgagttc aaattcgagt ttcgagtcta agtcgtgttt aaatcgagtc acgagtcgca 120 cttagtgttt cccttgtgtg tagagcacaa caaaaatacc aaacaccatc cgtataaagt 180 caagagcctt ttcaaagctc taaagcccta aaaaccgttc tatacgagtc agagagtctt 240 ctcctcgtta acttagtcta ctaaaaacaa caaccccatc gaacttgcgt gccaacgtac 300 agacagtacc taaaagtctt tagattcggc cgtcggtctt catcttcgtg tcacttgtct 360 cacaacaagt tccactcgat tcgggaccgc cggttgtctt cttgctcttc tttgtgcttt 420 ctgtgctagc cagcacatca tttggagccc accgcgaggg gttgtccaat ttccaagttt 480 aagtagacga agttttgcga aagcaggact taaagacaag tgtagcacga cctttcggct 540 gcgtcagatc ttgagttcgg acttaggtct ttgagctcga tccaacgtcc gccgggaaac 600 caaaaagcct acgagatttc cactatcaag aaaagaaatc ctaaggcacc aaaaccatct 660 gttgaggcgc gaataaatcc gctcacagga aaagcgtttg aagaaagaac tacgagaaga 720 cagcgtcagg gacaaccttt ggaataccaa gagctagagt cggatagagc gctaaaacgt 780 caaatcaagg aggcaaaagc aaaaactacc tctggagaag acgaagaagc cgcagttgaa 840 gatctagttg atcccggcgc agcgacagag tcagaagagg aatcagttcc tccaacttcc 900 cagttaggag ccagtcaggg accaagtttc cccggcggct ttgaattacc aaaagcagaa 960 agcgtttcgt accgtcggtt agacaaactc aaaaccgacg tcgcagaatt agggtttcca 1020 tcttcactag gaactcccaa cgagccactt tacttcgaga aacatccgtt tagaagtagc 1080 taccgcgtca ccagagtctt ccgccactga aggcgatttt tcggacttat ccttagaaga 1140 cttaggaaag tcagttcaat ccccagcagt tcccaaacct tatagtccgc tagaagcttc 1200 tttaagccag cgagaagaag gtaaagaact tagtccagta acagaaagaa aacgaggtag 1260 aacttccccg ccggaagtga cgagagtaga aggtagcctt cgatctcaca aaagagacaa 1320 aggaaagcaa cgcgctactt cagaggatag aagttcgact tcagagccca ctacaaatac 1380 accaagcgaa cctactacaa gggaaagctt tgtggatttt ggaaacaaga gcgaagagga 1440 tatcgaacga ctccctgaac gcagggcaag aagaagaaaa aggacgaaga aaacgagcgt 1500 cttaagagaa acgccgagag gaaaggaaca agttaacgtt ccacaaccgc cggtaattca 1560 aacggctact tctgttcaac gagaacaaaa gccttcagtt tcgaacttcg agactcctac 1620 ccacgaaaaa gcacccttaa tctctccgtt gttacaaaac caccacaaga tggaatacga 1680 gatttccatg gatcccaggg ttgatcactt gacaacccaa cagttgtata agaaccctag 1740 ggttcctcaa ttatggaagg cgagccactt agtgttggaa gatgacacgg cggcagaagt 1800 atgggcttac atcgagcgga tggagacgat gtttgagcaa gcgagcgtag tgctggacag 1860 gcacaaaaag aaggcgttgg tcgacttcgt aaagaaagtc gatttaaagg atcagtggtc 1920 ccggctgacc caatacagag acgggacata tgcgcaattc aaggcagaaa ttctgtctta 1980 ctatcctggc atcaaggaac tccaagaagg aagccttgat aagttggaga aaatcttcaa 2040 cagatatgtt aggattaaga tgcatgagga acagaagttg atggacctaa ttgcgaaggt 2100 cagaacagag gtctcaaaaa ttcgagagac cgaaaggaag aggaaaatcc aactcctttc 2160 agaaaggacg ttagtccagg acttcctggc gtgcttagat cgtaacttta gaatggcggt 2220 ccaagtacgg ctctcaacag ccgaccagcg gaggattctg gcagaattga agagggagag 2280 cagagccaat gccagaacat tggacaagag gctgaagagg atggaagccc cgaaaggccc 2340 aaaggtggaa cccgaagaag gcaaagaagg aggaccctct acggccccag aaggtgatct 2400 tgggacagaa gtcaatccca tccaagtcga agatgatgga gacgacgact tcgatggcga 2460 gctctacgac tcctcaggtt tggaacgggt tatgcccatt gttactaagt acacgctaga 2520 cgatctgttg tacgtagcac aagtggtagc ccgggaacac tccaatagtt cggtctttgg 2580 cttcacgacg gatctaagtg acactggtac agcagaccag gacgtcatca acaccgccgt 2640 ggcagaaacg ttgaagagat tgggtattaa gcccaagaga gaagagcagc atgtgcagga 2700 cgctggaccc cctagaggga gtcaaaacat gccgccgggg atgtaccagt ggtcagggcc 2760 tcctaagtcg gacaacgatg ggtttaaact catcttggaa agactggatc aacaagaggc 2820 caagtggaag gatacactca ctggtctcag aaaagagcaa aaagaacaac tgacggagtt 2880 tgagacgaga atggaggcca aaatcgacaa gagggttgac cgaccattct cagcgattca 2940 actcgcacaa ctcgtcaggg aggaaacaac attggtcaat cctacgttcc gccgggaagg 3000 aggaactact cgaactcggg ttcctcaggc ggagggtacg tcgacttttc gcgtcgacca 3060 ggtcaacagg gctacagcca aggcgtgtgg cgaaatgctg acgctggtgg tggagagcga 3120 cgaggagtga cttgctggtt ctgtggtgag ccagctcatg cagttatggg ctgcaaaatt 3180 agaagggtga tgttggatac gggtcgtatt tgtcatgaag gtggcgatta tcgtatcctt 3240 gaaagtaata gaattattac gcccttcaga aatggggaag aatttactct agacctcatt 3300 agaaagagta tggagagtag cggccatacg ttggacctcg acgcgattaa gcgcgaggct 3360 cagtacgtac aagaggagcc gccggagcat aggtggaggg atccgttgac gggtcaacca 3420 acgcaagagc agttggtaca tgaccttaag gaacgaggtt agtcaattgg aaacggagaa 3480 ggcgcagagt ttagggtcgg cgcctaacga cacgtccgcg gaagcgcccg cggctgctga 3540 cgaaaaggca gcgatgtgga acgcccttcc aagtgttcta ccaaaattcg cgtcgacaag 3600 agggttttca atgacgacga ggaaagcagc agctgctgct tcttccaaac cggcaggaag 3660 tagcaagact gagctagaga agtctaagaa aaccgccggc gtagaaaaag aaaacaccaa 3720 taaaaaggag tgtgtcgaag aaaaagaaag tgagagtgaa gaagaagagg tggacgagtt 3780 acaagaggaa gaagaggaac cgttgaggtt cgtcgaggtt aagagagttg gacctcctaa 3840 gagaaagaag aaagtaacat ggtcagattc tcaaccaacg gcgaagctta cgccgtccgg 3900 aactctacct tatgtagacg tccctcctct ttcaactaaa cctttaggtg cattcaagcc 3960 taaagtgaac aaggaagagg agaaatcttt tgagaaagaa cgagaagtag aagaaaaagg 4020 cgaacctcta ccggaaatcg accggcgggg accagcctat aacaaacgcg cacctgtaga 4080 acaaaatggt gcggcaaaag cgatcgtcaa cgaattgctt gaagtcaccg taccggtgaa 4140 catcaagtcg cttatggggg tttcgccaga ggcaagggaa tacctcaaga acctcctgac 4200 tcggaagcga gtaccgacta gggaagtccc tatccctaga gagactgagg tgtttaagcg 4260 tttctttgac gatccgcacg aagttaagcg tctacacgtt ggcttacaat gggtagaaaa 4320 tagtgaggaa gcgatggcaa aagccgaagc agtggttgaa caacgggccc tagccgaaaa 4380 gcaaaacagg gactataaag aagcctcaac tttgcgaccg gcggtccgaa ctgttgcaga 4440 aaaaggagat gagaaaccat tggggagtat cactatctct gacccggtgg aacaatatat 4500 gattgaactg ggtactgaag ctaagccagt tatagtgaaa gctgaagagt cgtcttcgtt 4560 gagggtcatc taccctgaaa atcaatggag ccatgaagga agagtcctta ttagacggcg 4620 gttcgacaat ctgctcaatg catgaacaag ccgcaaaaga ccacggcctt acttgggatc 4680 cggcggtgac ggtgtggatg tcctcagcgg acaaaggatt gaagaaatca ttgggtttgg 4740 caagagacgt gccctttgtc tttggagaag taagagtgta cctccaaata catgtacttc 4800 cagacgtggg ttataggatc ttgcttggaa gaccgttcga gatgatagcg gaaacgctaa 4860 caaagaacaa tcgagatggg tcacaaacga ttaccctgac cgatcctcag aacggcaaag 4920 tggtgaggtt gaacacttat cctagaggag aacctcctcc agagttaaag gcagagttgg 4980 acgccaaccg gcggtctttt cagaactcga tgaatcatca atagatgtag gagagcttgc 5040 agtcgtcttt gaacagacag cttctggaga gattaaagtc gaaaaggtgt tccagtccgg 5100 ccataaaagt ggaaagttca gccaagaaga cgtgaagtta atgtttattc aagaagcatt 5160 aactagggaa gtagaaaata agaaagccgc taaaaaggtc gaagctttct tgcaacaagt 5220 caggacaggc aaaagagaca aatccgtcgg tgaagtcatg atgttagata ctgtgataaa 5280 acacttcgaa cagatttatc aacagttaaa agaggatatc atcgaggatt ctgccaagac 5340 ttcgtcaaca acctccagcg agaacaaaga cgaagtagta gcagggaacc ctaacgctaa 5400 accaaaaatt aagggtgtct ctacaggaaa gaagtataag aaagtggcag ataaagtaag 5460 acccatcttg gggactctag cagaggagtt tagaattatt cggaagatta taggaaaccc 5520 attggctgaa ttgcctaaga tgcctacgaa tccgccggaa ttcacacccc aaggaaggta 5580 caccacagag agaaaggaag ccatggataa ggtccacggc ggcgatttca tttggccaga 5640 agagcgaaaa ctattacact ggttaatagg agtacataac ctagctttcg cttgggaaga 5700 tgcggaaaaa gggagattca gacaagagtt tttccctgat gtccctattc cacgatacca 5760 cacacaccgt gggtattgag gaatagacca atcccacctg aacttatgga taaagtatgt 5820 agatttatca aggagaggat agattcaggc gtgtatgagc cttcgaactc gagttatagc 5880 tcgaggtggt tctgcgtact taagaaagat ggtgaaagtc tgagaattgt acatagtcta 5940 gagcctctta acgcagttac gattgcgcac tcaggcatcc cgccggcaac agaagaattg 6000 gcaaatcact tcgccggtag agcttgtgga ggttgcatgg acttgttcgt aggttacgat 6060 gagagacgat tggacgaaaa gtacagagac cttaccacgt tccaaacacc ttatggagcg 6120 atgcgtttgg ttaccctacc gatggggtgg accaattcgg taccaatctt ccatgacgat 6180 gtaactcata tcttacaaga cgagattcct gcagtcacga agccgtatat tgacgatgtt 6240 cctacgagag gaccagccac gaggtatgaa gacggaaaag gaggttatga aaccattcct 6300 gaaaatcctg gtatccggcg gttcgtatgg gaatatctaa atgacgtgaa cagagtgatg 6360 cagcgtatga agtattgtgg agggacattt tcaggaaaga agtcgatcat tttggcagac 6420 gagtttacag tagtaggaca tctttgtact tacgaaggaa ggaaagttaa tcccgaacgc 6480 gcagatgcaa tccttaagtg gggagcatgt aaagacgttt cagatgtacg atctttccta 6540 ggagtactag ggactttgcg aatgttcgtc aaggattatg cccgccgggc agaacctatc 6600 caagaactaa ctcgtaaaga agtaccattt agatggggac cagaacaggc agcagcaatg 6660 gccgatttac aggaagccat tcgagttgca ccagtattga aggctataga ctacgacagg 6720 aatgtgcccc taaaattatc cgtggatacg tcgtacaaag ccataggatg gatgatatcc 6780 gtccaagatg atgatgaccc tgataagtgg cactatgcta gattcgggtc tacaacgcta 6840 agtgagagag aagcaaggta cggccaagca aagagggaac tctttggctt gatgcgagct 6900 ttagaagaaa ataggtattg gttacttggg tgtcgagaat tgattgtcga aacggacgct 6960 aagtacatta aaggcatgtt acaaaatgct ggaaatggac caaatgcgac tataaatcga 7020 tggatcgaat ccatactcat gtaccatttc accttggaac atgtagctgg aagtaagttt 7080 gcgatacccg acggtctgtc acgtcgagtt ccacacccgg atgataaacc aaagactcct 7140 tacgatcact cggaagaaga gaccggcgga ctcactgggt acaggaaacc aaaccccgac 7200 gacccagaac cgttggactt cgaagagttc aaacacgaaa tcgatactcg tggaggatat 7260 cttcaagaca tcgcagattt tgaagaagat ttggagttag atagagcagt ggcggacagg 7320 gtgttgtcac tagggccgaa aatggaaact caattcgtaa atgtggaagt actgccagat 7380 ttatccttaa aggaagatcc taaacttcga caggaatacg atggagagtc ccgccggcac 7440 ccacttatca aagagatcga cagaaaaata ccgatcgtag ataagtggtt agaaaaccct 7500 tcagttcgac caaaaggaat gagcgacgcg gaatacctag cattcgctag gtttagtacg 7560 catttcttta agaaagatgg aaggcttttt cgtcgaggaa taaatggtga acataagata 7620 gtcgtttatc cagaaaatcg gatgtacttg atgcgtacag cgcatgacgc taccgggcat 7680 cgaggacatt atgctactaa ggaattacta gctaaacggt tttggtggcc agacttagac 7740 ggggacgtag gctggtatac gaggacttgc catgtctgtc aaatccgtca aaaattgttc 7800 atgaggatac ccccgacggt aacggaaacc ccatctatat ttcaagtctt acacgccgac 7860 gtgattcata tgagtattcc tagcgataag tgcagattca tagttcacgg aaggtgtgca 7920 ttgacacaat ggatggaagg gaaagccttg cagaagcaaa atgctagtac tattggcaag 7980 tggttgttcg aagatgtaat cttaagatgg ggttgcatca aaaagatagt gacagataat 8040 ggtggacctt tcataaaggc tgttggttgg ctagaagaga agtatggaat ccgtggcatt 8100 caagtaacgg gttacaacgc tcaagcaaac gggacaatcg aaaggcctca cttcgacgta 8160 gtaaatatgc tagccaaggt gaccgccggg aatttgtcta agtggaagtg gtatttccct 8220 cacattatgt gggcggacag aatcactata agaaaaagat ttggatgttc gccgttttat 8280 atggtaacag gagcagaacc agtaacaccc tcttgacata caagaagtga cctggctaat 8340 tgaacctccc gaatcgttcg tttccacgga agacttgctc gcgcgaagag ctaaggcgtt 8400 ggcaaagcat cgagacttcg ttgagtcagt aagaaataag atccaccagg cgaagaaaga 8460 tagagtagct aagtacgaga ttgaacaccg gcggacgatt aaagactaca acttcaagcc 8520 aggagacctc gtaataatga gaaactcccc aattgaggat agccttgatc gaaagctgaa 8580 acctaagtac aacggtccgt acgtagtgct gaatagaaac aaaggaggcg cctacatagt 8640 tgccgaaata gacgggtcag tataccagtc tgtagtggca gcatttaggc tattacctta 8700 ccacgcacga aagaagattg agctacctcg aaatattcac gagagcatcg acctagggcc 8760 ggcagagttg aacaaactag tgtcttcgcg gcgaaaaggc gaaccggcgg acgacatagc 8820 gttcgaagga atgcctaaag ggaatagacg aagcgaacag gtgaatgagg acaacgatct 8880 cgacgacttg ggaagctcaa ccgaggacgg ttgatatttc taagttgggg ggagaggagt 8940 accaataggg cgaaagaatg taactcaaaa agagtacatt ctgaagacca agtaaacatt 9000 aacctaaagt tcgcattaaa attttcgatt ttccaagtca ggaagactta gaaaggactt 9060 aagacaactt agagcactta ggaggagttc acacacagat agagaggtgc aaagcatgat 9120 gatattcaat atagtagtag tactagggaa aaagcgaaga atcgcaacaa aaacctaata 9180 ctactacaaa acgaggccca aaaacgagaa gaggctaagc ctcagtcatg tcaacatcac 9240 cggcggcatc gacgtcgccc ccaagagtgc tatccgcctc aagaggagga ggggaagcag 9300 gagactccac agaagaagaa ggaagatcct tctccctcaa gaggagggag ttcctttctc 9360 cagatgggcc tggatcctat catactcagc gaagagtact tcaaggaacc gaacgtcgag 9420 actgagaagg tgaacccgcc ggaggatctc agagcgaatg tcagtcagcg tcgcacgagt 9480 cgacgaaccg acgttgacag cgtcagagta gttggccaag atgttctgta cttgagacag 9540 aacaccattg acccactcgt tgttgacggc gaagtggtcg gctaagaaga agagagtgag 9600 taagaaaagt aagaaacaga agacgagaga tgacttaccc aggtcttgac aagcatcccg 9660 tcggggaaca ggttgggata tttggaggtc aaagcgttct tattgacctc ctcgagcgcc 9720 ttggcgtcga gcgaaacgcg agcgcgctta cccagctgag tagccttgag aaaagcggga 9780 gaggataagt cgagattcgg aacgaggtag taagggactc gtacgacggc agcagaagca 9840 ccggcggtgg tacgagggcg agaagggcca ccaccgcggc caccctgtaa agtcaaagag 9900 taagtaacga aacgaagaga gaaagaaaga aagaaactta ctcgaccacg agcgacggcc 9960 ttgccaccac gagtcgacga agttgaagta gaagcagccc caggaccgcc gttttcagta 10020 ccgacgattt cctgaaaaac ataagtcagt aaggaacaga aagggcaaga gataaacact 10080 taccacgtcg gaatcgtaaa tgatagcggg ggacttgggt tgcttgccat ctaagaaaga 10140 tgggtaagca aagatagtag tagaaagaga ataggactta cgctttttgg gagtaggagc 10200 ctcctcctcc tgagtcgaag ccccattagc gagcttttta atgttatgga acgccgagct 10260 gacgtcgaac gggtcagcat cggcaggaag ggggatggtg ggagctttca ccgcctcgcc 10320 agaccaagcg aggtggtaag agacctccgc cgggacgtcg agcccgatgt ccttgtcacc 10380 gcagaactcg acgaagacct tcagcgccgg gtgcatgacg cgaagttgct tctgaatgag 10440 agcggtgtag gccatctgcc cgccgtagag gcgacgctta gggtactaaa agaaaagcgt 10500 aagttagaaa ggaaacgagt aagcaaggaa aagagactca ccatgtaaga gacgagctca 10560 agggcagccg tcttgtactc cttaaggatg ctgaaaggaa tagcctcctt ggcgatcgca 10620 cgaagacgga ggtgaacgcc ggtgagcttg tggatgccac gagcgacgtt ggcgttatct 10680 tcaacgtaga cgcgcttcat gccatcccac acggacggcg ggatagactt ggccggcact 10740 gggagaccaa gaggaacctt ggggtcgaag ccgaggatct cctggaagac gttggaggcg 10800 ggagaagaca tggtgatagc aagagaggaa gagaatcgaa gagacgaaga ggagagcgat 10860 gagaaaggtg aaaagaacca gaggaggttg ctggagaaac tcgaccagga aaccggcggt 10920 ttttatagca aaaagcctct cagttttagg gaatttctat tggtagacgt gttcaaggaa 10980 aaaccgacac gaccacgatc tcacagtcga aaatcggtgg tgcagaacgt gccccaaaac 11040 cgaaaaacat gagtcttcct tagaaatttg tgcaaagtcg agtatcccga attggcaata 11100 ccgccggagg ggttgatggc ttggcaaacg cccacattgc cacttagttt gagaattgct 11160 tccgcggaca caatccgaaa gggttaagac gaaccccaga aggcaattca gccggacgga 11220 cggcaatttc accgtttcga gaaatcactc taagtctcag ataggatacc ggcagtagag 11280 atagccaaga aattggattt gaggctacac taatatgact acctaagagt cggtgtagta 11340 atccaataat gtacaagatg gtgcgaagga catgagtata caacaacaag aatgtgcgaa 11400 aaataaataa atacgagtgt ccccaataag cgcctccccc gccggcggcc agagtagaaa 11460 gaagaaatgt ctctcagaaa agagcgagag acgcgaaaag aaagaaagaa gaagaagaag 11520 aaaacgacga acttagtatt cgggggagtc agaaaggtct ccctcgtcgc ttgagttcga 11580 ggacgaggcg ccaccaccag aacgagcaga ggagagggcc gagctagacc tctcgtcgac 11640 gttcatcgcc gaaccctcga tgcctggcca ctctggagac tcctctgacg aaaagagacc 11700 aagagagtcc ccaatgataa cgtcagcact gggagctacg ccagtacgat cgtcgtagac 11760 tagaaagcaa taagttagta ggagtaagat aggaaaaaga gcgatatgac taccagtcat 11820 cttcgtcgtt cggagcagga ccaccgtccg ccgggtcgac aagagggtcg aatgcgggaa 11880 gagaggacgc gtcttgatcg aggaaatcgt cgaactcgtc gtcgatagga cgatcgagtt 11940 gcgacgggga ccagagacgg ccgtccttgg ccatcgggaa ctcctgcgca atgaacgaac 12000 caaagacgta aggagcgtga ggagggttgg cgacggtagg aaggagttcg tcgtcaggga 12060 tgggtacgcc ggaggcccga cgcgaacggg tgaagagctc gtccatacgc tggacaagct 12120 cggccttgtt gacggtgcta ccgtcggcac ccacgaggaa attatcgaag gtaaacatcg 12180 agcgaaactc gaggtacgca tcgaagtcgt tgttgaaaaa ctcgacgacg atggcctcgg 12240 gtccctcaga cgtttccaaa cgataaatgg ccgagaggag acgagcgtga agctcgtaaa 12300 ggtggaggga gtgggtgagg tactgattga gcgaatctcg caccacgtta gactggtgca 12360 agatgtcgcg cacgacagtg tcaatgtcta aagagtaaga cataataatc agtaccggcg 12420 gaggaaaaga aggttatgag gggacttaca atgagaagat ccctcagctg cacccgccag 12480 ttggttgcga aggccgcgac gagcgtcgat gttcaacgaa aacggacatg gcttgtccgc 12540 tcgaccgcag ttgccgcagc taagaccagg accacggtac tcacacctag ggatgcctaa 12600 gaaggcacac ctagcgcagg ggcccacaac ctgggagggt attaagttag taagaagtta 12660 aagagaaaga aaagagaaag gcaacttacc aaggcatggt cggacaacaa ccgccgggaa 12720 cccatgatac ctgtaaggtt ctggtagagg atgttgcgaa ggtcggtagt ctgacgttgc 12780 gcggtaggca tgttgcgacg aagggatccg acgaaggggt tgctcgagga gcgagacgca 12840 tcgtctctag catccgacga gcctccttca gagttagacg aagaagacga agacgaagag 12900 gaaggggagc ccttaggagg gactggtgga cgcaagtagt gcgcctggcg ttcctcccgg 12960 ctcaggtgtt gaatggagac gccagtcctc tgtgagaact gaagctcgag aggagtggtg 13020 tccctccctt gggacaggaa acccgtagga tcgtcaacgc catcccggcg gacgatcggg 13080 ttgaaaccaa cggaaggggg gatggaaaag ctctccatcg atcgactcaa ggttgcgcgg 13140 ggccagtacc ggtatagcgc gtacttgctt gaggaccttt gggagcacga gggaaagagg 13200 aaggaccgta gacggcgggt tgaccaggaa taggaacgat tttcctcgca gaggaaggag 13260 tcgcagggat aggagggatg ccgggcgtct tgggcgtctt aggagacacc tgaaaggagc 13320 gagatcgacg aggggttcga ggagcctctt gatgcacgtc ttcgccgcta ggcgaagggt 13380 cgagttcgtc gacagcgtcc ccgtcgtcaa gacccgtacc cgacgcgact acagaggacg 13440 acgcggaacg acgatcgtct aaggagtcgt cagctcgtcg gttaaaggaa cgaaagaaga 13500 gagaagagcc tacttaccca gagtagacac aggaggcacc gaagaagcag gagccgcctg 13560 ggagaccgac gcgctggtac caaagacttg acgtttccag actcgaagta ccagagggcg 13620 agagtgaagc aggtgcagag tcgcgtccct ctcctcgaag gagagcagtg ggcggtagaa 13680 caacgccagg cggaaaattg cgaggacgaa cagtaagacg aggaagttga gggtgaggat 13740 gacgaataac aggggtacca ggaggaggcg aagtatggtc agcgagccat gcagctcgct 13800 cccgcctgta cttggccaag gcctcttcaa aagaaacggc cacatcgata tagatgtgcc 13860 agtgctagat attggatccc tcagtaatac gccacccgcc ggaacgatca gaaacttact 13920 tgatagtgag gatctgccct ccaagtggcc caatccatag gcaacgctgg aggagaaatg 13980 agtcgccatc ctcgtaaagt gttgtcacga aggcgacgag gaatgaggaa cggcatctca 14040 gcgtcctaac aagagaatag taagtaagtg tccgagaaga gatagataag gaaaagacga 14100 accgtcgaag gaacgattgg acaggtcaac cttaagatag ccatctgggg gtcggtgaag 14160 acgatgttca tcattcgacg aatgcccata tttcggtgga ctggttcagc ccaggcccgc 14220 caggagtaag gcgcagtaag agggtagcga ggatgaccag ggggaccgtc attgccttgg 14280 tagacagtca tgaaggccca tggtgtaggc cctcactgcc atcagcgaat gaccgccggg 14340 caacctctgt aagatacact gaaaagtcaa gtcagaggac agaaagttga agaaacagca 14400 caaagactca ccaagtcgtt ataagtggcg aaagataaga agtcccactc aggggaatat 14460 ataaacgcct cctcgaacct gaattggccg aggaaacgat tgaggtggtc gacgatgaaa 14520 ttccagcact gtcgaagata aggggaagaa cgaaagagac ggaacgtttc agggttgtcg 14580 tcaatttgga gacgttggtc gccaatttgg atgaggaact gctgaggcga gaagaatgcc 14640 tcgctagccg gagagagtag caggttgagg agtggacatc gatgaaagat agcacagtac 14700 aacagtaagc caggagcttg gagaagggaa gaccggcggt cgtcccctat atataggcca 14760 aagaacgacc ccgagcggat aatccctcta ggagcgcaga ctgacttcat gagaagaggt 14820 ggggaaagta ttaacgctgg tacttcccaa aaaggaaagt gatagaccct aacaggagtc 14880 aaaccacggc agcttgaccc ttaatatgtg gtatcaaacc ccactagaat ttaacgctct 14940 actcatctaa gtcaagggca gaaagatcag agaaagtaga aaagcgaaat tgtcgatcca 15000 aagtacgttt gaaagcaacg tttgatgtct tgtcgtaacg gcggcagatt caagagccac 15060 ttaagcgcca gatcacaaaa caccaccgta tcacagacaa gtccaagttt gggacgacgg 15120 gctagtctag cagaagtgaa ttggagttcc tgtaaggtgg cgaagaactt cagaagtcga 15180 tgtcaagagg agtgacggat cgtatatgtc gaagttccaa gaacttcgac aagaatctga 15240 ctggggacag tcctcattat aagttggggg gaga 15274 // ID Gypsy-3_RO-I repbase; DNA; FNG; 5604 BP. XX AC AACW02000296; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-3_RO_; KW Gypsy-3_RO-LTR; Gypsy-3_RO-I. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-5604 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000296; Positions 57628 63231. XX CC 'GTTT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 39..1022 FT /product="Gypsy-3_RO-I_3p" FT /translation="MYIQGYIKMSSLNQENQVFSGALTGESQQGNKMMRYV FT NEPKSFSGYTKYPGGDICTEAYTWLNRMSRLKATAKLSDKEILFIVGDHLV FT EKAETWWNVVGSKSESWKEFSELFKKQYMVDQEDKWWRQLQTMKQGPQDSI FT DDVALKMEELFGLLGSMSDAFQVRTFLGAINPAIAFEVEKDETPNTFSAAK FT KKAKQIERSLAKYGANGVMGTRGFSNGTIVENDNVGSQHGWNSSSSTVSSL FT AERLEQLRINLVQLSNVVMGQERVAPQVVNPKPVRRGLVCFFCDEEGHRKF FT ECPKFLENQNGGAINQTPATGSNAIPVNQGKGLERQ" FT CDS 1151..3922 FT /product="Gypsy-3_RO-I_1p" FT /translation="MDTPPSVSASDAHVDKRPVAGTGQVYSGLQQATSSSQ FT PFGAGVPVVVGAPVGGGVPSNTQMGTLPGLQGGPTVKRVVKKRTRGKARRL FT PVKIKKKGNVWEVLDGTNAGLSVAGLIAMDRGIQRDVIDGIRFLREKQASL FT RSNEKVATKGRRIEENSSAVPMVINVVDQDGYGSDSSIVSGDDLVSNDDSD FT WQSTFSLSDGSNDSVNFSEDWDDGVSVYHYPYSLNQMKQGSPLKGVVNING FT KSVVAIFDTGASVSVIGKALAASLGLRPNGDTLALTGLENKQGDDSPIVSD FT VPICIAGKIRPEHMCIHDSGNTDLCLLGITWFQAYGIELDLINSVVKIPTT FT TGMVKLQGYTSHIKDRSLLNSNVMLPSVLDASASIKATVGSDDDRRVYMVQ FT NSQNHVDAYEDVLIPDGEALEEEFKYNADNITIGVLPELADVVEKYKECFS FT EVSGLGRVTGYKMDLPLVDGATPIRTKPFRISWQEEEVLDQYLQEMLDLNI FT IKPSNGLWASPCFFIPKKDGSLRLVIDYRKLNKMVKQDAYPLPHIDELLDS FT VGGATIFSTLDCTSGYHQLPLNEEHAERTGFVTKRGTFSFNVLPFGITTGC FT SQYQRMMHSILSKYVGDFVLIFLDDILVYSKSVEEHRYHLSLLLEACAKAN FT LKLKRKKCKFGESQVEYLGHVITADGVLPSDHNINKVKMFETPAGVDDVRS FT FLGLTGYYRKYVPGYATMAEPLTRLLKKTVPFSWGPDQQSSFDQFIKALTQ FT TPILSYPDRKKIQVLSVDASLKGLGAILSQVDDAETMANERVLSYASRSLR FT GSEVNYAITHLEALAVVWGVTHYKHYLKGRPFVLITDHSSLVYIFKPSRIT FT PKLSRWAAALLDYDYEVRYRPGKCNPADALSRICSKEDSDQSVVGYENGSV FT ANESKNDEKMVFAAGVTRGD" FT CDS 4595..5605 FT /product="Gypsy-3_RO-I_2p" FT /translation="MKTPLHPFFMVGVDAVGPLQPTEKGNRYILTAVDYLT FT RWPMAMCVQDIDEETTAEFFFSAIVKNFGVPQYILSDRGANFISSYVHYFL FT RQIGCKNIMTTSYRPQVNGMCERLNQSLVQTMSKIARDNDDILQWDKYLDA FT ALMVLRSTVNSSTGYSPGYLLFGYELRTPAVWVAPKEDFVLGEEMEALKDR FT VIKIQDKMAEVRKIAREKSDENKLKAKKRYDKRVVYRKSFEVGEKVLLRDT FT TPSTKLSDKWLGPYTVSQVNKNGTYLLVGKNSLRLKHAVNGDRLKLFNNDV FT KHMVPDVVAAAATEQFRSWLNTKKNPAFLVETQLKEEEESRGGVR" XX SQ Sequence 5604 BP; 1552 A; 919 C; 1425 G; 1708 T; 0 other; agtggtccgg cctaccagat aaagtatatt aatcggtgat gtatattcaa ggttatatta 60 aaatgtcaag cttaaatcaa gagaatcaag tcttttccgg tgcgttgaca ggtgaaagtc 120 aacagggaaa caagatgatg cgttatgtta atgagcctaa gagcttctct ggttatacca 180 aatatccagg tggtgatatc tgtacggaag cttacacttg gttaaaccga atgtcgcgtt 240 taaaagctac tgcaaagtta tcagacaaag aaattctttt cattgtgggt gatcaccttg 300 tagagaaagc tgaaacctgg tggaatgtcg ttggttctaa gtctgaatct tggaaagagt 360 tttctgagct gttcaagaaa cagtatatgg tggaccagga agacaagtgg tggcgtcaac 420 ttcaaactat gaagcaaggt cctcaggaca gtatcgacga tgtggccctg aaaatggagg 480 aactctttgg tcttctgggt tcgatgagtg atgctttcca ggtcagaacg tttttgggtg 540 ctataaatcc tgctattgcg tttgaagttg aaaaggatga aactcccaac acgttttctg 600 ctgccaaaaa gaaggctaaa cagattgaaa gaagtttagc aaagtatggt gctaatggcg 660 tgatgggtac tcgtggcttt agtaatggta ctattgtgga gaatgataat gttggttctc 720 agcatggctg gaactcctct tcatcgaccg tctcatcttt ggctgaaagg ctagaacaat 780 tgcggattaa cttggtgcaa ttgagcaatg ttgtcatggg tcaagagagg gttgctcctc 840 aggttgtaaa ccctaagcca gttcgtcgag gtttggtctg tttcttttgt gacgaagaag 900 gtcatcgcaa gtttgaatgt cctaagttct tggagaacca aaacggtggg gccataaatc 960 aaacgccggc tactgggtct aatgctattc ctgtgaacca gggaaaaggc ctggagcgtc 1020 agtaaaagag gtgactgacg ctcataaaat taatcaaaat cctcaaaatt ttggtgacgt 1080 caacttggtg gacacaatct gccaagatag tggtgatggt ttggctgagg tgtacgtgtc 1140 caaaagaaga atggatactc ctccttcggt ctctgcttca gatgcccatg tggataaacg 1200 accagtggcg ggtacgggtc aagtgtactc aggtttgcaa caagctacgt cgtcaagtca 1260 accttttggt gctggtgttc ctgtggttgt tggtgctccc gttggtggtg gtgttccttc 1320 aaacactcaa atgggtacat tgcctggtct tcaaggtgga cctacggtta aaagggtggt 1380 caagaaacgt acacgtggga aagctcgtcg tttacctgtc aaaattaaaa aaaaaggaaa 1440 cgtctgggag gtgctggatg gcacgaatgc tggtctgtct gtagcgggtc taattgctat 1500 ggatcgtggt attcaaagag atgtcatcga tggcattcgt ttcttacgag aaaagcaagc 1560 atcccttaga tcgaacgaaa aggtagctac aaaggggcgt aggatcgaag aaaattctag 1620 tgctgttcct atggtgatca atgtggtgga tcaagacggt tatggtagtg attctagcat 1680 tgtttctggt gatgacttgg tttctaatga cgattctgac tggcagtcta cattttcgct 1740 ttcagatggt tcgaatgata gcgtaaactt tagtgaagat tgggatgatg gtgtttctgt 1800 ctatcattat ccttacagtc taaatcagat gaagcaaggt tctcccttga agggagtggt 1860 aaatattaat ggcaagtcgg tggtggctat ttttgatact ggtgcaagcg ttagtgtcat 1920 tgggaaagcg ttggcagctt ctttgggtct aagacctaat ggtgatactt tggcgttgac 1980 tggtctggag aataagcaag gtgatgacag tccaattgta agtgatgtgc ccatatgcat 2040 cgcgggtaag atacgtccgg agcacatgtg tatccatgat tcgggtaata ctgatctgtg 2100 tctgctgggg attacgtggt ttcaagccta tggtatcgaa ctggatttaa ttaattccgt 2160 ggttaaaata cctacgacaa ctgggatggt taaattgcag ggttacacgt ctcatatcaa 2220 ggatcggtcg ctattgaatt caaatgttat gcttccttcg gtgttggatg cgagtgcgtc 2280 aattaaagct acggtgggtt ctgatgatga tcgccgggtc tatatggtgc aaaatagtca 2340 aaatcatgtg gatgcttatg aagatgtgtt gattccagat ggtgaagcgt tggaagaaga 2400 atttaagtat aatgctgata atattactat tggtgtttta cctgaattag cagatgtagt 2460 agaaaaatac aaggaatgtt tttctgaggt ctctggtctg ggtcgtgtca ctgggtataa 2520 aatggatctt ccgctggtag atggcgcgac tcctattaga acgaaacctt ttaggatcag 2580 ctggcaagaa gaagaagtcc tagatcaata cttacaagaa atgctggatc taaatataat 2640 caagccatct aatggtcttt gggctagccc ttgtttcttc ataccaaaga aggatgggtc 2700 gttgcgtttg gtgatcgatt atcgtaaatt gaacaagatg gtcaaacaag acgcgtaccc 2760 tcttcctcat atcgatgaat tgttggactc tgtaggtggt gctacgatct tctctactct 2820 agattgtacg tcaggctatc atcaacttcc gctaaatgaa gagcatgcag aaaggactgg 2880 gtttgtcact aagcgaggga ccttttcttt taatgtatta cctttcggga ttactactgg 2940 gtgcagtcaa taccaaagaa tgatgcattc gattctgtcg aaatacgtgg gtgactttgt 3000 gctgatcttc ttggatgata ttttggtgta ttcaaaatct gtagaagagc atcgttacca 3060 tttgtctctg ttgttagaag catgtgcgaa agcgaatctt aagttgaagc gcaagaagtg 3120 taagtttgga gaaagtcaag tcgagtatct gggtcatgtc attactgctg atggtgtact 3180 gcctagtgat cataacatca acaaggtgaa gatgtttgaa actcctgctg gtgtggatga 3240 tgttcgttct tttctgggtc tgaccgggta ctatcgcaag tatgtccctg ggtacgctac 3300 gatggctgaa cctttgacgc gtttattaaa gaaaacagtg ccttttagct ggggtccaga 3360 ccaacaatcg tcatttgatc agtttatcaa agctcttacc caaacgccaa tattatccta 3420 ccctgatcga aagaagatcc aagtgctctc tgtggatgct agcttgaaag gtctgggtgc 3480 gatcctgtct caagtggatg atgctgaaac catggcgaat gagcgggtgt tatcctatgc 3540 ttctagaagt ctgcgtggaa gtgaagtgaa ctatgcgatt acgcatttgg aggctctggc 3600 tgttgtttgg ggtgttactc actacaaaca ttacctgaaa gggcgacctt ttgtattgat 3660 cactgaccat tcgagtctgg tgtacatttt caagccaagt cgtattacgc ctaagttgtc 3720 aagatgggct gctgctttgt tagattatga ctatgaagtg cgttatcgtc cgggtaaatg 3780 taatcctgcg gatgcgttgt ctagaatctg tagtaaggag gatagtgatc aatcggtggt 3840 gggatatgaa aatggatcgg ttgctaatga atcgaagaat gatgagaaaa tggtttttgc 3900 tgctggtgtt actcgtggtg attgaatgct gtaataattg atgaagatta ttttggaaaa 3960 actttttata tcgtgaattg ctttaaatct ttttactttt catcttcttt tacttatctt 4020 tttatcttta cttttacatc tttatctctc tttacaaaaa aattttactt cgctttttac 4080 tttggaaaag ggtgataaat aataaagttt actgttgcta taaaagaaga gtaattgggt 4140 ccttttaaag ttcaatatcc tttctgctta caagatttgg tctttaatcg tatgtttctg 4200 cgtttggtat caaatataat ggctattagt ttcgcttggt ttctgactat ctatcactat 4260 ttggctgatg gttcgctgcc tgaagattgt gataagaaga cggcgcaacg tataaggtat 4320 caagcaaaga agtggtgtcg gtctgatgat ggttccttgt tggaaaaaga gaacaggtag 4380 aagattacat cacgaaagcc gacgcgttag aagtgggtta aaaggataca cccaagaggg 4440 tcatttttgg tatactgaaa tactatagac aaaggttaac aaatactatg tggtttctga 4500 atgcagaaag ttggttacgg cagtagtaca ctcttgtgag gcttgtcagt tttgtgctag 4560 gatcaaggca gttaggtcta atcctggtgt gattatgaag actcctcttc atcctttctt 4620 tatggttggt gttgatgcag tggggccgtt gcaacccaca gaaaagggaa atcgttatat 4680 attgactgcg gttgattatt taacaagatg gcctatggcg atgtgcgttc aggatattga 4740 tgaagaaact actgcagaat ttttctttag tgctattgtt aaaaattttg gtgtacctca 4800 gtatatcttg tctgacaggg gagctaattt tatatcgtcc tatgtacatt acttcctccg 4860 tcaaattggt tgcaagaaca tcatgactac tagctataga ccacaagtaa atggcatgtg 4920 cgaaagatta aatcaatcac tggtgcaaac gatgtctaaa attgcgcgtg ataatgatga 4980 tatacttcaa tgggacaagt atttggacgc tgctttgatg gttctcaggt caacggtgaa 5040 tagctctact ggttatagcc cagggtatct gttgtttggt tatgaattaa gaactcctgc 5100 tgtttgggta gcacctaagg aagattttgt gcttggagaa gagatggaag ctcttaaaga 5160 tcgggtgatc aagattcaag acaaaatggc agaggtcaga aaaatagcaa gggagaagtc 5220 ggatgaaaat aaacttaaag caaagaaaag gtacgataaa cgggtggtat atcggaaaag 5280 ttttgaagtt ggagagaaag tcttgttgcg tgatactaca ccttcaacaa agttatcaga 5340 taaatggttg ggtccgtata cggtgtcgca agtgaataag aacgggacat acttattagt 5400 tggcaagaat agtctcaggt tgaaacacgc tgtgaatggg gatcgtttga aattgtttaa 5460 taatgatgtt aaacacatgg tgcctgatgt ggttgctgct gctgctacgg agcaatttag 5520 aagttggtta aatacaaaga aaaatccagc gtttttggtg gaaacgcaat taaaagagga 5580 agaagagtct agggggggag tacg 5604 // ID Copia-38_MLP-LTR repbase; DNA; FNG; 133 BP. XX AC AECX01000934; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-38_MLP_; KW Copia-38_MLP-I; Copia-38_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-133 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000934; Positions 40448 40580. XX SQ Sequence 133 BP; 35 A; 18 C; 17 G; 63 T; 0 other; tgttgaagta aatagattag ttacatgata ctcatttcac gtgcatgaca tttacatttg 60 atttatattt catttctaat ttactttcat atttgtgtag cgttacatgt ttttatgttt 120 tcatctgtac tca 133 // ID Gypsy-23_MLP-LTR repbase; DNA; FNG; 193 BP. XX AC AECX01000136; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-23_MLP_; KW Gypsy-23_MLP-I; Gypsy-23_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-193 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000136; Positions 177112 177304. XX SQ Sequence 193 BP; 56 A; 54 C; 30 G; 53 T; 0 other; tgttatggaa cttattacgt gtcacttcca gtgacattct cattacaaca tgtcacacgc 60 tttgtaccca gtcacatact tccacaacgc ttgtatcaga gattttccct catctgacaa 120 tctaactaga cacctggata agctgtgaag agcttcactt ccgtagcaag aacccagagg 180 accctccata aca 193 // ID Gypsy-118_MLP-LTR repbase; DNA; FNG; 391 BP. XX AC AECX01000825; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-118_MLP_; KW Gypsy-118_MLP-I; Gypsy-118_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-391 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000825; Positions 10229 10619. XX SQ Sequence 391 BP; 90 A; 111 C; 61 G; 129 T; 0 other; tgttattccc aatttcggca ttcccctttg ttatcgttgt acgcgctttc gaaaccatcg 60 acgcgctttc cactactttt gtatgctttc catttctttt atttctttta taatgctttc 120 catcggacag gctctatcca gatccgtccc gatcagacac atcatactac ttgtaaaccg 180 aatccagcag taccggattc ggtcttccat aactcatatc tgcttgtatc aagacgacat 240 ccctgtcgtc tttcgttgta ccacacattg ccatattcgt acttgtagag agatggagga 300 ctatataaag ccctcctctc tcccccttgg aatgaatccc caagttactt ctcctgcaca 360 cttaccgtac caagttgtga taggaatcac a 391 // ID Copia-53_MLP-LTR repbase; DNA; FNG; 355 BP. XX AC AECX01000204; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-53_MLP_; KW Copia-53_MLP-I; Copia-53_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-355 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000204; Positions 12036 11682. XX SQ Sequence 355 BP; 92 A; 95 C; 52 G; 116 T; 0 other; tgatacaaca cttgtatctc gaatcgtact cgacacttag tggccataag tattcagacc 60 atcatgctga gcttcactct gaagactcta agcatgttct catttccttt ctacgaacac 120 ttcttgtatt cttactcatc caatgttcag accgttccat ttgtatataa tccaaatgat 180 tgttgttttt ccttctctct caatcagcat caataatatt cattacattt ctattaaaac 240 gcttccactc cgagaacgct gttgacagat tctcgaagat ctcgtttgtg acattcgact 300 aacagtcctg aagcccgcac acggagcttc cacgacctgc acgtagcctg tctca 355 // ID Gypsy-2_GDe-I repbase; DNA; FNG; 7138 BP. XX AC AEFC01000241; XX DT 12-MAR-2011 (Rel. 16.03, Created) DT 12-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Geomyces destructans genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_GDe_; KW Gypsy-2_GDe-LTR; Gypsy-2_GDe-I. XX OS Geomyces destructans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Leotiomycetes; Leotiomycetes incertae sedis; Myxotrichaceae; OC mitosporic Myxotrichaceae; Geomyces. XX RN [1] RP 1-7138 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Geomyces destructans genome."; RL Direct Submission to RU (12-MAR-2011). XX DR Genome; AEFC01000241; Positions 51082 43945. XX CC Positions [4513-4896] - Integrase core CC 'CCAAC' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 362..1672 FT /product="Gypsy-2_GDe-I_3p" FT /translation="MSRKTAPNPPPDNDGQPLRMTTRQTATAATTTGPSES FT PPDRIANLENEMRELKAMIAGLVASQTRIPSTPPPAADTPVRTIEADDNED FT EQSEQPAPTVRNTPGTVRLPPVVNARSPSIPVLPRQYTEDPPNRYKKSTIS FT EKITPLSDGIEHTFMQWSASIRDRLVVNEDHYPTDVSRRALIWGTTTGLAK FT TYLEPQYLSVTHAFRSADEMMDLLGSYFLTGNETEQAQNLFDDMQMGEKGH FT SNETFPKFKARFQSAAITGQVTESEWFRYMWNKLTPQFRSRVAVVKDQWKG FT DYLTMVRALTAFDQERRRNGELGSQSKAGELSFRGPARLSSSRFTPVEPAK FT TQPLRPTRSPLVSFQRTPFIPTAPKPDITGCVRTAMPAASGNCFKCGKLSH FT FQDKCPLNATVKEINWEDPETEETWEEAVELQSDENLEENDEA" FT CDS join(1768..4509,4513..5631) FT /product="Gypsy-2_GDe-I_1p" FT /translation="MLSLNELGVQTKALADTGANGYLVINKPFALRLSKAL FT QSPIRKLPYSVPIRGFQRKIQSHVSEYIRLHLTVEGRRIYNCPFVLLDLGA FT QDVIIGKKLFSRFRILLDSAKSRLVWPKEYPPTPSWSKEITLPYHSRETSI FT RHPAHQSDANRRDLAIHHEDRRHAGGRSMDMKVINPEYKEATPMPSSPPRN FT PKVPPPPMRTPTHGFVPVQICEIGTNAFYHEMMRSDAEFFQTSIYEIDQII FT QQRELEEDDETTSLIQEKLPFVHRNYADVFSKSESDRLPPHRIYDHRIQLE FT APLPNAFLPLYRQSTEELKATKQYLLENLDKGFIANSDSPFASPILFVKKA FT DGKLRFCINYRKLNGLTRNYPYPILRIDELLGRVSKAKIFTKLDIRQAFHR FT IRMHPDSEEYTTFQTRYESYECKVLPFGLTNGPATYQRYMNDVLMSYLDDF FT CTAYLDDILIYSEDPSEHIDHVHKVLQRLREARLQADIKKCEFNVTRTKYL FT GYILTTSGVEADPDKIEPLRNWIRPTTVTGVKSYLGFCGFYRQFIRDFGKI FT AKPLTAITRPSEPFLWTDECEAAFEELRRQLLTVQALHHFDPELPTKLETD FT SSDGVVAGVLSQEHPDKRWYPVSFYSHVLTGHEINWEFHDKELFAIVKAFA FT KWRPELASVRHRVDVYTDHRSLEYFMSMKVLTAKQVRWMELLSDFNFCITY FT TAGKNNQKADILSRRDQDLVMQQKVKLDSRARVLLGPARLHPRISAELANS FT YVASINAIDPLKAKMPFDLIEALCVDNRKSFAETRAKQPLPDGYALADQLL FT LYKDRLCVNRHTELCTKLIREAHDQVSTAHPSGRKTYQLLALKYHWIGMEA FT DCLRYVCNCVACRLSHTNVTKQQGFLHPLPIPEYPMQHLTMDFKEFPKDKH FT RFDSILVFMDLGKGSVSIPCHKTTDARQMAQLFIEWIYRFGHTPESIISDH FT GPQFISAFWREFCRIIGVKIKLSTAYHKQTDGQTEIMNRYIDQRLRPFVSH FT YQDNWSELIPIMDRVQMTLPHTSIGMTPYRLKYGTDPRTSWDWETPKATTP FT REVLNQTEAKAVASRMHSTWVYAKGNMVKAQEDMSRYANRCRRAIDWDVGD FT KVYLSTKNLKNHRPSRKLGQQWVGPYTVLEQIGHAFLDLPTGSQIHDVFSP FT DILIKAPNNPLTGQAAPEPSSEVVDGLEEWQVEKILTVRTRRKQLQYQVSW FT VGYDPDPEWYPASNFKNSPYKIREFHAEYPNLPGPPQDLPGWITDWETGEP FT SYDHERHGKAEENSVVAIYDTKLNTL" XX SQ Sequence 7138 BP; 1929 A; 2093 C; 1777 G; 1339 T; 0 other; ttatacaaaa aagatcacgc ctgccgggga cggccgtacg caaataatat cccacaactg 60 gaacgactca ggagaactgt cgaaacacca ccgaccgtga ctcggtgcca tttcgaagta 120 gagacctgac gacttcaaat agccgtgatt ccagagatct cacggacagc cagtggatca 180 gggtttgatc ctgcagagca agctggtaaa gtcgccgacg cgttggcgcc gaataggata 240 cgagatccta aagaaccccc tcagcacaga ggtaattgtg aacaagagtg ctgatagcta 300 cgagctaccg gaagtcagac tcttctgtat cgactgcgac tcgatacacc cccgaatcaa 360 gatgtcgcgc aagacggcac caaatccacc acccgacaat gacggtcaac cgctacgaat 420 gacaacgagg caaaccgcca cagcggccac gactacaggg ccatccgaaa gccccccgga 480 tcgcatagcg aacctcgaga acgagatgcg ggagcttaag gcgatgatag ccggccttgt 540 agccagccag acccgaatac catctacgcc gccaccggcc gcggacacac cagtgcgtac 600 cattgaagct gacgataacg aagacgaaca gtctgaacag ccagcaccga cagtgcgaaa 660 cacgcccggc actgttcgac tgccacctgt cgtcaatgcg cgctctcctt caatcccagt 720 tctaccccga cagtacaccg aggaccctcc gaatcgatac aagaagtcca ccatctctga 780 gaagattacg ccgctcagcg atgggataga acatactttt atgcaatgga gcgcgtcgat 840 ccgagaccga ctcgtagtca atgaggacca ctaccctacc gacgtctcaa gaagagcgct 900 gatatgggga actacgaccg gcctggcaaa gacctatttg gaaccccaat acctgtccgt 960 tacacatgcg tttcgcagcg ccgacgagat gatggaccta ctaggttcat acttcctcac 1020 cggtaacgag acggaacagg cccagaacct cttcgatgac atgcaaatgg gagagaaggg 1080 acactctaac gaaaccttcc ctaaattcaa agctcgattc cagagtgctg ctatcactgg 1140 acaagtcacg gagtcggaat ggttccggta catgtggaac aagttgaccc cgcaattccg 1200 aagccgcgtg gctgtcgtga aggaccaatg gaaaggcgat tacctcacca tggtccgagc 1260 cttgaccgcc ttcgaccagg agcgacgccg gaacggcgag ttgggctccc agtccaaggc 1320 gggcgagttg agcttccgag gaccggcccg cctgagctca tccaggttca cgcccgtgga 1380 gcccgccaag acccaaccgc tgaggccgac aagaagtcct ttggtttcct tccagaggac 1440 tcctttcata cccaccgccc ccaagccgga catcacaggc tgcgtccgga ccgccatgcc 1500 ggctgcatcc gggaactgtt tcaaatgcgg aaagctcagc catttccagg acaaatgccc 1560 cctcaacgct actgttaagg aaatcaactg ggaagacccc gagactgagg aaacctggga 1620 agaagctgtg gaactccagt cggacgaaaa tctggaggaa aacgacgagg cctagtaaaa 1680 ggctccttcc taggtctacg caaatgtgat ctgcgagatc tcgacctgca agaactcttg 1740 ggcgtaacct agttcttagt cgacacgatg ctgagtctga acgaactcgg tgtgcaaacg 1800 aaggctctcg cagacacagg agcaaacgga tacctcgtca tcaataagcc cttcgcgcta 1860 cgactgtcga aggcgcttca gagcccaata cggaaactac cctactctgt acccatacgt 1920 ggttttcagc gaaaaatcca aagtcacgta tcggagtata ttcggttgca cttgacagtc 1980 gaaggacgcc gaatatataa ctgccccttt gtgctccttg acctcggagc gcaagatgtg 2040 ataattggga agaagctttt cagccgcttc cgaatactac tcgactctgc aaagagccgg 2100 ttagtatggc caaaggagta cccgccaact cctagctgga gtaaggagat caccttgccc 2160 taccactcgc gcgaaacgtc cattcgacat ccggcacacc agtcggatgc gaatcgtagg 2220 gatctcgcga tccaccacga ggatagacgc cacgcaggcg gacgctcgat ggacatgaaa 2280 gtgataaatc ctgaatataa ggaggccacc cccatgcctt cgagcccacc ccggaacccc 2340 aaggttccac cacctccgat gaggacgcca acgcatgggt ttgtgcctgt gcagatctgc 2400 gagatcggta caaatgcctt ctatcacgag atgatgcgga gcgacgccga gttcttccag 2460 actagtatat atgagattga tcagatcatt cagcagagag aattggaaga ggacgacgag 2520 acaacatcac tcatccagga gaagctcccg tttgtgcatc gaaactatgc cgacgtcttc 2580 tcaaagtcag aatcggatcg acttccgcca caccgcatat acgatcacag gatacagctc 2640 gaggcccctc taccaaatgc ctttttaccg ctctaccgac aaagcactga ggagctaaag 2700 gctactaagc aatacctgtt ggagaacctc gacaaaggct tcatcgccaa tagtgatagt 2760 ccctttgcgt cgcctatact attcgtgaag aaagcggacg gcaaactgag gttctgcatc 2820 aattaccgca agctgaacgg ccttacccgc aactatccgt accctatact gcgcatcgac 2880 gaattgctag gccgggtgtc gaaagcaaag atcttcacga aattagacat ccggcaagcc 2940 tttcatcgta tacgaatgca ccccgactcc gaagagtaca caacctttca gactcggtac 3000 gaatcctacg aatgcaaagt gctaccgttt ggcctaacca acggtcctgc aacctatcag 3060 aggtacatga atgacgttct catgtcatac ttggatgatt tctgtactgc ctacctagac 3120 gacatcctaa tatactccga ggacccttcg gaacacatag atcacgtaca taaagttttg 3180 cagcgtctac gagaagccag actccaagcc gatataaaga aatgtgagtt caatgttact 3240 aggacgaagt acttaggtta tattctcact acttccggtg tggaagcgga ccccgacaaa 3300 atcgaaccac ttcgtaactg gatccgaccg actactgtaa cgggagtgaa gtcctaccta 3360 ggattctgtg gtttctatcg acaattcatc cgagatttcg gcaagatagc gaagccactc 3420 acggccatta ctagaccttc ggagccgttc ctgtggacgg acgaatgcga ggctgctttt 3480 gaggaattgc gacgacaact cttgacagta caggcacttc accacttcga ccccgaattg 3540 cctacgaagc tggagactga ttcctccgat ggagtagtag ctggagttct gtcccaagaa 3600 caccctgaca agcgatggta tcccgtaagc ttctatagtc acgtccttac cggacacgag 3660 atcaattggg agttccacga taaggagctt tttgcgatcg tgaaagcctt tgctaagtgg 3720 aggcccgagt tggcttcggt acgacaccga gtcgacgttt acactgacca ccgatcgctc 3780 gaatacttca tgtcgatgaa ggttctcacg gcgaagcagg tccggtggat ggaactactc 3840 tccgacttca acttctgcat tacgtatacg gcaggaaaga ataaccagaa ggcagatatt 3900 ctaagccgaa gagaccaaga cctggtaatg caacagaagg tcaaacttga tagtcgtgcg 3960 agggtgttgc ttggcccagc tcgcctacac ccccgtatca gtgccgagtt ggccaacagt 4020 tatgtagcat cgatcaatgc catcgacccc ctaaaggcga agatgccctt cgatctgatt 4080 gaggcactat gtgtagacaa ccggaagtcg tttgccgaga cacgcgcgaa acaaccctta 4140 ccggatgggt acgccctggc cgaccaactc ctcctgtaca aagaccgcct atgtgtgaat 4200 cgtcacacag agctttgtac gaaactcatc cgggaggcac acgaccaagt gtcgaccgcg 4260 cacccaagtg gcaggaagac ctaccagttg ctcgccctaa agtatcattg gatcggtatg 4320 gaagcggact gcctccgata cgtatgtaac tgcgtagcgt gccgactttc tcacaccaat 4380 gtcacgaagc agcaaggttt cttacacccc ctaccaatac ccgaataccc tatgcaacac 4440 ctaacgatgg actttaagga gttcccaaag gacaagcaca gattcgacag catcctggtg 4500 ttcatggact gactcggtaa gggttcggtg tccataccgt gtcacaagac caccgatgcc 4560 cgccaaatgg cgcaactatt catcgaatgg atatatcgat tcggacacac tccggagtcg 4620 atcatcagtg atcacggacc ccaattcata tcagctttct ggcgagagtt ctgtcgaatc 4680 ataggggtca aaattaagct gtcaacggcc taccacaagc agaccgacgg acagacggaa 4740 atcatgaatc ggtatattga tcagagactg aggcctttcg tatcccacta ccaggacaac 4800 tggagtgagc ttatccccat catggatcgg gtgcagatga ccctacctca tacctcgatc 4860 ggaatgaccc cctaccggct caagtacgga actgacccgc gcacaagttg ggactgggag 4920 acgccaaaag ccacgactcc cagggaagtc ttgaaccaga cggaagccaa agcggtggcg 4980 agccgaatgc attccacatg ggtttacgca aaggggaata tggtcaaagc acaagaggac 5040 atgtctagat acgcgaatcg atgccgacga gctattgact gggacgtagg agacaaggtt 5100 tacctatcaa ctaagaacct gaaaaaccac cgtcccagcc gcaagttggg gcaacaatgg 5160 gtcggaccct atacggtatt ggagcaaatc ggacacgcct tcctagacct accgacgggt 5220 tctcaaatac atgacgtttt ctcaccagac atactcatca aagctccgaa taacccgcta 5280 actggtcaag cagcgcctga gccaagcagc gaagtcgtcg acgggctaga ggagtggcag 5340 gtcgaaaaga tcctcaccgt gaggacccga cgaaaacaac tccaatacca ggtatcctgg 5400 gtaggatacg atccggaccc ggaatggtac cccgcctcga acttcaaaaa ctccccctac 5460 aaaatcaggg agttccacgc ggagtaccca aacctgccag gaccgccgca ggacctaccc 5520 ggttggataa cagactggga gacaggcgag ccaagctatg atcacgagag gcacggcaaa 5580 gcagaggaaa acagcgtcgt tgctatctac gacactaaac taaacaccct ctaagccagg 5640 gcaccctccg agccgtggcc cttgcggagg ccacccttcc caaatgtctc accggcgtcg 5700 caacaagcgg ctaccaccat gagaaacgaa gggtcgaggt gaccccacgc gaaagaccac 5760 gcccgagttc cctccgacac agggccaacc ccctactctc cgtaggaggc cccaacccca 5820 tcctccacct tcttctcctt ctccttctcc ttctccttcc cgcggggggc aggcgccagg 5880 accgacttcg cagcgacggc gacgacggca gcacccgcct ccaccgcggc caccacctcc 5940 ttcaggaggc cgggaacccc caactgctcc gtccgaagac ggcgcagcga aacacgaaca 6000 acctgacctg ggtcagcact gatacaccac caggaaagaa gattcactca cctcaagctt 6060 cgagtcgaca gactcgaggg cggcgcgggt tgcgcgggcc tcctcgtaga ggtcccgagt 6120 tgcctcgttc aggaacttgg ggacggaggc agcggcgagc aacaggacgc ggttggcctc 6180 gagggccgcg gcgacgacgt cggagtcgag gaggggggtt gccactttgg cggcgaggag 6240 gtcggcgaac tcgggcccaa gaaaccaggg cagctattcc caccaagtca acactgaaca 6300 aacccaaaaa cgagccaaat gcaaaccgaa aagcaacagg aaacccaccg gaacacacgc 6360 agaatgctgt gaagtgcagt aactgcactt cgagctggtc ggggaccaga cacaagtatg 6420 ggccgcagcc ccacgggaga ttgcccgaag gcagcggacg cacaggatgc ggaaggaatc 6480 cgaaagggcc ccagagggca cgggaaccgg catcttcaaa ttggaaatct gcgatttccc 6540 gtcagcccag cccgagttat caccaccgcg ccagcgcggt tcctcaacct accgaatcgt 6600 tcgaagttgt tttgtatcta aaaacaatca attagtatcc ttctcgacgt ccggaggggt 6660 cacttaccag tctcgcgagg gggggtcggc gttgttgttg ttgtggagaa aacaacaagc 6720 tgtgtgcgaa gctggaaatc atattctggc ttacaacaaa gccgtccgaa tcgggttagt 6780 gcgaaactcg cctttgacct cgaaattcta cgacgacaca taccgcgcgg actcgattcg 6840 ataccaatac ggacctttac cacactaatc accgcacgca caacaaagat gcgactctca 6900 gcacacagat tcagaccact tccgccaccc atccggccgc ctaccatagc cgtctcgaca 6960 aacaaacggc attccagacc ttgcgcccgc acctcacaca acaacccaaa caagggttcg 7020 tcgcggtcgc tcctcggtct cgattcttcc tccgcaccac acaacccata caataactca 7080 cacacggcgt gggcttgcat gctttcggga caaagcacct tctgaaggct gggggtag 7138 // ID TCN6-LTR repbase; DNA; FNG; 275 BP. XX AC . XX DT 30-MAR-2005 (Rel. 10.03, Created) DT 30-MAR-2005 (Rel. 10.03, Last updated, Version 1) XX DE C. neoformans LTR retrotransposon - LTR consensus. XX KW LTR Retrotransposon; Transposable Element; Interspersed repeat; KW TCN6-LTR. XX OS Cryptococcus neoformans OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Tremellomycetes; Tremellales; Tremellaceae; Filobasidiella; OC Filobasidiella/Cryptococcus neoformans species complex. XX RN [1] RP 1-275 RA Goodwin T.J. and Poulter R.T.; RT "The diversity of retrotransposons in the yeast Cryptococcus RT neoformans."; RL Yeast 18(9), 865-880 (2001). XX RN [2] RP 1-275 RA Loftus B.J., Fung E., Roncaglia P., Rowley D., Amedeo P., RA Bruno D., Vamathevan J., Miranda M., Anderson I.J. et al.; RT "The genome of the basidiomycetous yeast and human pathogen RT Cryptococcus neoformans."; RL Science 307(5713), 1321-1324 (2005). XX RN [3] RP 1-275 RA Gentles A. and Jurka J.; RT "C. neoformans LTR retrotransposon TCN6."; RL Direct Submission to Repbase Update (15-MAR-2005). XX DR [3] (Consensus) XX SQ Sequence 275 BP; 86 A; 46 C; 76 G; 67 T; 0 other; tgtcagaatg attgatccaa cctggatctg gagagatcag tagggatctg gaggcatggg 60 aattgttatg gaggcatggg aattgttggt ggaggcatgg gaattgttgg tggtggcatg 120 ggaagaaggg gagattagat tacataagcc agaacactta caatatcgaa acaaataaag 180 acaaaggatt gtagaatagt ttccatgcaa cttagacatc tctggcaatc gacaccgcgt 240 gctcaagctc tctctagaca actctacatc tgaca 275 // ID Gypsy-4_MLP-LTR repbase; DNA; FNG; 175 BP. XX AC AECX01002004; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_MLP_; KW Gypsy-4_MLP-I; Gypsy-4_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-175 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002004; Positions 54728 54902. XX SQ Sequence 175 BP; 45 A; 44 C; 30 G; 56 T; 0 other; tgtaataacc acatgaggtg ttagtagcat tatgctaata gcgtgtatga gcggactcta 60 tagttttcac tcctctccgt tgtacttgat cctcacactg caatctagtt aatagtacca 120 ataggccttc atcctctatc tcatcacatc caccgtgtta gtccgggtca taaca 175 // ID PCretro6_LTR repbase; DNA; FNG; 369 BP. XX AC DQ097838; XX DT 08-MAR-2006 (Rel. 11.02, Created) DT 08-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE Phanerochaete chrysosporium RP-78 Ty1/copia LTR retrotransposon DE (LTR). XX KW Copia; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; PCretro6_LTR. XX OS Phanerochaete chrysosporium OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Corticiales; Corticiaceae; Phanerochaete. XX RN [1] RP 1-369 RA Novikova O., Fursov M., Shutov O. and Blinov A.; RT "Divergent groups of LTR retrotransposons from Phanerochaete RT chrysosporium."; RL Direct subission to Genbank (2005). XX DR EMBL/GenBank/DDBJ; DQ097838; Positions 40 408. XX SQ Sequence 369 BP; 77 A; 111 C; 92 G; 89 T; 0 other; tgttgaagag cgggcgtgaa gaccgacgcg cgacatgcgc gggcgggcgg gcgcgcgcct 60 gtcagacccc tgacaccgcg gacccggagg actctgcgcc gtgcgcgcag gtcactccgc 120 acctcctgct cggtagacac gttctaccgc gccaccctta cgtgagctcg ctgtaccaga 180 cagcgagcga atattctagg taatatacgt cattttctgt agctttcccc gacgtgtatg 240 tagctagagt ataaatacaa tctcgaggac gccgccctcg tcgttctcgt ccgttcactc 300 gtctctacct gcgtgaataa gtctattcat tttctactct actacaagta agtctatata 360 attttaaca 369 // ID Gypsy-117_MLP-I repbase; DNA; FNG; 7087 BP. XX AC AECX01000853; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version 1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-117_MLP_; KW Gypsy-117_MLP-LTR; Gypsy-117_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-7087 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000853; Positions 262024 254938. XX CC Positions [5597-5857] - Integrase core CC 'TCTAT' target site duplication CC LTRs are 99% similar to each other. CC Includes a Mariner insertions at approximate positions172-1095 CC (masked by "x"). XX FH Key Location/Qualifiers FT CDS 1136..4306 FT /product="Gypsy-117_MLP-I_1p" FT /translation="MSKPEYYLDDPEVLLRNLRKGKQAEVIETPPAPPPAK FT FSWRPETVVTGSPAPTDRILYASSAARETNTAELTRLRFSPPPSSPSHSEA FT TIVEQHFKKLLEQKRGINTPHPPGHLSTETPTVSRPTLDVTSVTMTGETSN FT PNPGNEDSVDSLRQQLEAMRLQNERLERVEKENLNLQRLVRELLEGKAQHS FT QAAEAASAPPSANMPNVFDRYIRRTPASPSPLPRDQSGPEARQQGQQAPPL FT ESTPVGQTIPGLGNRAASPPRPFSPRQAPSPHYQSVPAISPQYARATSVPH FT MQSFHQAVPEQFLQATPILTTANCVKLSDLPKFTGKFGVPADLFHWRSLIK FT ETFEVKNVIDDGERMKLLGSLLNDEAMAAWYQSNKESLRHGSWNEAMETMA FT MGTLPHAWLTDTEEAFRRLEMKPQESFNAYVARAQALHRLIKRQGQVTDRH FT LAQHITWGAPQLFRDMVDKDKLLFADPFVFPVFKDAADGIWRILIHSKLLP FT DPTAKPKAYSTSAGSTFSNNRQTITPRARTEEERADNGWRYHEYLQRNGIC FT AVCKEKCNDPRCNKRSNRFLSVPATFDAGPRPSRTTPAARLNPPGAATQRP FT AGRPAPTTTSKQIASVETFPDMLVEDIAAYEEADRELQRQIEAEVGMETPC FT VPQVPQSPIILELTLNGKPLRALIDSGAGTNLLAAKVARNLHIPGRPLVSP FT VEVRLAVATEGKPIVLTEFAFANLKSEEPNIRFGAVFFKLAPLGGTYDMIL FT GSPFLSKFKLDVSLHRRCVIHTSSGKILYEKEMKNELQRILCAVENLTKLS FT DVQELSKKEDSMLKEFEDLLPAELPPVEEGEAPAEIFPENMPDQSRQVRHK FT IVLTDPNVVINEKQYGYPIRHREAWRKLLDQHIKAGRLRRSHSQYASPSML FT IPKKDPSELPRWVCDYRRLSKFTVKDRAPLSNVDESVRLVATGKVWSIVDQ FT ISSFFQDRMREEDIPLTAVKTPWGLHEWVVMPMGLTNGPATHQARNEEILG FT DLVGRICVVYIDNIVIFSQSVEEHEAHVRMVLEKLREAKLYCSIKKSKLFR FT " FT CDS join(4445..5593,5597..6607) FT /product="Gypsy-117_MLP-I_2p" FT /translation="MKKFIDGLSHYVATLTPLTSTTLKGKPFHWGKAEEDA FT FNNIKRLITTLPVLHNLDYDSGEPIWLFTDASGSGLGAALFQGAEWDTASP FT VAYESRTMNPAERNYPVHEQELLAVINALNKWRLLLLGMKVNVMLDHHSLT FT TLLTQRNLSRRQARWLETLSQFDLDIRYLKGQGNSVADALSRRDDMAPCEI FT KSGFGDEELKAIREGSVQDAFCVKLRKVLPLREDCIVKDDLLYLDGRLVIP FT NYAGLRQNFINQAHAALGHLGPFKTLTRLRHTFFWPGMTKDVEKQLKTCDS FT CQRNKARTTTISGKLQANPIPAHPMEAITIDFIGPFPKVSGYDMILSCTCC FT LTGFVRLIPASQRDTAERSATRLFNSWSSIFGLPESIIGDDKAWTSRFWQR FT LHDLLGVRVKLSTAYHPQADGRSERSNKTIGQILRLSVSNKHGKWLESLPS FT AEFAINSAVNVSTGVSPFQFVYGRIPRLFPVELPSKEEDEDVHAWIQRRQS FT EWASWRDHLWGSRVDQAVQYNKRRRVGDTLKRGDLVMIDSHDRQQVVGGAS FT RGVGKLRARYDGPYEIQEVLNGGRDFKLKLQADDSTYPIFHISKLKAYHSK FT AEDADGETIATMKMVSSHPEPIPEGPTKEPVGHATKRDEGVCINMDALTPP FT QCECDNFISNPIHENVGKQVSLPNNSFHFRSEDDGHELETQRTTCSHCGDF FT VKSENNPMEQQLGDHWGMLG" XX SQ Sequence 7087 BP; 1752 A; 1430 C; 1436 G; 1545 T; 924 other; ctttttttaa cttatcaact ttgaaaccat ctattcggag ttcaatatca gctatcaatt 60 tttttttttt ctcatcagtc tacaattttt tttatcttcg gtcatctgtt ctgtcatctg 120 tcaatcggcc aataggtctc tgtcaaagca aatctgaaaa gacggcttgt axxxxxxxxx 180 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 240 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 300 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 360 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 420 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 480 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 540 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 600 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 660 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 720 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 780 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 840 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 900 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 960 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 1020 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 1080 xxxxxxxxxx xxxxxtcctg ccaacccaaa ccttacttgt tctgtattcc accggatgag 1140 taagccggaa tactacctcg acgatcccga agtactgctt cgaaacctca gaaaaggcaa 1200 acaagccgaa gtcatcgaga ctcctccagc accaccgcca gcaaagtttt cctggcggcc 1260 ggagacggtc gttactggat cgccggctcc taccgatcga atcctttacg catctagtgc 1320 ggcaagggag acgaacacag cggaattgac tagattgcgt ttttcaccac caccgagttc 1380 accttctcat tctgaagcca ctattgttga gcagcatttc aagaaattac tagaacaaaa 1440 gcgcggtatc aacacgcctc atccacctgg acatctttcg acagaaacac caacggtatc 1500 tcgtccgaca cttgacgtaa cttctgtcac gatgacaggc gagacgtcga atccaaaccc 1560 tggaaacgaa gactctgtcg actcgttacg ccaacagctg gaagccatga gacttcaaaa 1620 tgaaagattg gaaagggtgg agaaggaaaa cttgaacctt caaaggttgg tacgggagtt 1680 actggagggt aaagcccaac actctcaagc agcggaagcc gcatctgcac caccttcggc 1740 gaacatgcct aacgtcttcg accggtacat tcgacgaact ccagcgtcgc cttcgccctt 1800 acctcgtgat cagtctgggc cagaagcaag acagcaaggt cagcaagcgc cgcctctcga 1860 gtctacacct gtaggacaga cgataccggg cctaggaaac cgtgcagcaa gtcctcctcg 1920 acctttttct ccgaggcaag ccccatcgcc tcattatcaa tcggtccctg ctatatctcc 1980 gcagtacgct cgagctacta gtgtaccaca catgcaatct ttccatcagg cagtccctga 2040 acaattcttg caagcaacac caatcttgac cacggctaat tgtgtcaagc ttagtgatct 2100 cccgaaattt actggcaaat tcggagtacc tgccgacctt tttcattggc gtagtttaat 2160 caaagagaca tttgaagtca agaatgtcat tgatgatggc gagcggatga aactgcttgg 2220 atcactgctg aacgacgagg ccatggcggc atggtaccag tctaacaaag aaagcttacg 2280 tcatggttca tggaatgagg cgatggaaac aatggcaatg ggaaccttac cccatgcttg 2340 gctcacggat acggaagaag cgttcaggag gctagaaatg aaacctcaag aatcttttaa 2400 tgcttatgtg gcgcgtgctc aagcgttaca tcgattgatc aagagacaag gtcaggttac 2460 ggatcgacac ttagctcaac atatcacctg gggcgcccct caactctttc gagatatggt 2520 tgacaaagac aaactcctgt ttgcggatcc ctttgtattc cctgttttca aggatgcagc 2580 agatggtata tggcgcatct taatccatag caagttatta cccgacccaa ctgctaaacc 2640 caaagcgtac agcacaagcg ctggatcgac tttcagcaac aatcgtcaaa caatcactcc 2700 cagagctcga accgaagagg agcgtgctga taatggttgg cgttatcatg aatacctgca 2760 aaggaatggc atttgtgccg tttgcaagga aaagtgcaac gatccgagat gtaacaaacg 2820 atcaaaccgg tttctatccg tccctgctac tttcgacgcg ggaccaagac cctctcgtac 2880 tacgcctgca gcaagattga atccaccagg agctgcaacg caacgcccag cgggtcgccc 2940 agcgcccaca accacttcca agcaaatcgc gtcagtagag acctttccgg atatgctggt 3000 ggaggacatt gctgcgtatg aagaggcgga tagagagtta caaagacaaa ttgaagctga 3060 agttggaatg gagactccgt gcgtaccaca agttccgcaa tctcctatta tcctggagct 3120 gaccttgaat gggaaacctc tccgtgcgtt gatcgattcg ggtgcaggga ccaatttgtt 3180 agcggcgaag gtagcacgta atctacatat ccctggtcga ccactggttt ccccagttga 3240 ggtacgcttg gctgttgcta cagagggaaa gcctattgtt ctgacggaat tcgcttttgc 3300 taatttgaaa agtgaggaac cgaatatacg atttggggcc gtattcttca aactggcgcc 3360 gctaggagga acatatgata tgattcttgg ttctccgttt ttatctaaat tcaaattaga 3420 tgtatcgtta cataggcgtt gtgttattca tacctcaagc ggaaaaattt tgtatgagaa 3480 agaaatgaaa aatgaacttc aaagaatttt gtgtgcagtt gagaatttga caaaactgag 3540 tgatgtgcaa gagttaagta agaaagaaga tagtatgctt aaggagtttg aagatttgct 3600 cccagcagaa ctacctccag ttgaggaagg ggaagcgccg gccgagatct ttcccgagaa 3660 tatgccagac caatcgcgac aggtacggca taaaatcgtg cttactgacc ctaatgtagt 3720 tatcaacgaa aagcaatacg ggtatcctat aagacaccgc gaggcctggc gaaaactatt 3780 ggatcaacat ataaaggctg gacgactgcg acgatctcac agtcaatatg catccccttc 3840 gatgcttata ccgaagaagg acccatccga attgccacga tgggtttgtg actataggag 3900 gttgagcaaa ttcacggtga aggatagggc gccgttgtcc aatgtggatg agtcagtacg 3960 gttggtagca acggggaaag tttggtcaat agttgatcaa atcagctcat tctttcagga 4020 tcggatgcga gaagaggaca tccctctgac tgcagtgaag accccatggg gactacacga 4080 atgggtagtg atgcctatgg ggttaacaaa cggtcccgca actcatcagg cacgtaatga 4140 agaaatttta ggtgacctag ttggaaggat ttgtgttgtt tatattgaca acattgtcat 4200 tttttcgcaa tctgttgagg agcacgaagc ccacgtaagg atggttctag aaaaattacg 4260 tgaagcaaaa ctgtactgct ccatcaagaa aagcaagctc tttcgatgac aaataaactt 4320 tttaggccac gaaatcagcc aagcaggagt ttgcccagac gatgcaaagg tcgaaaaaat 4380 ctcaaaatgg tcatcgccgt catcttcaaa gcaactcctc aagttcttag gcacagtcta 4440 atggatgaag aaattcatcg acggattgtc tcattacgtt gctactctca caccactgac 4500 aagcacaact ttgaaaggta aaccattcca ttggggaaaa gctgaagaag atgctttcaa 4560 taacatcaag cgcctcatta caactttacc ggtcttacac aatctcgatt atgactctgg 4620 ggaacccatt tggcttttca ccgatgcaag tggaagcgga ctgggtgctg cattgtttca 4680 aggcgcggag tgggatacag cttctccagt agcttatgaa agccgtacta tgaatccagc 4740 cgagaggaat tatccagtgc atgaacagga gttgttagcg gtaatcaacg ccctcaacaa 4800 atggcgattg ttgctattag gcatgaaggt caatgttatg ttggaccatc attcactgac 4860 gactcttctg acgcagcgaa atctgagcag gagacaggct aggtggttgg aaactctgtc 4920 acagtttgac ttggatatca ggtatctcaa aggacaaggt aactccgttg cggatgcctt 4980 atcacgcagg gacgacatgg ctccatgcga gattaaatca gggtttggcg acgaggagct 5040 aaaggcaatc cgtgaaggtt cggtgcagga tgcattctgc gtcaagttac gcaaggtgtt 5100 gccgctacgg gaggactgta ttgtcaaaga tgatcttctg tatcttgacg gacgcttggt 5160 cataccgaat tatgcgggac tgagacaaaa cttcattaat caagcacacg ccgcgctggg 5220 gcatctagga cctttcaaga ctcttactcg gttgaggcat actttttttt ggccaggtat 5280 gactaaagac gtcgaaaaac aactaaaaac atgtgactcc tgtcaacgaa acaaggcaag 5340 aacgactaca atctccggca agctacaagc taacccaatt cctgctcatc caatggaagc 5400 gatcacaatc gacttcatcg gaccgttccc aaaggtatcc ggctacgaca tgatcttatc 5460 ttgcacgtgt tgtctaacag gttttgtacg attaattcct gcatctcagc gtgacacagc 5520 cgagcgatca gccacacggt tattcaactc ttggtcgtca atcttcggtc tgccagagtc 5580 catcattggc gattgagata aggcgtggac ctctagattc tggcaacgct tgcacgactt 5640 attaggagtg cgggttaaat tgtctacggc ttaccatcct caagcagacg ggcgcagtga 5700 acggtcaaac aagaccattg gccagatctt gagactctca gtttcaaaca aacacggcaa 5760 atggctcgaa tccttacctt ctgccgaatt cgccattaat tccgcggtga acgtttcaac 5820 aggggtctct ccgtttcaat ttgtgtacgg acgcatccca cggttatttc cggttgaatt 5880 accatcaaaa gaagaagatg aagacgtaca tgcatggatt caaagacgtc agtccgaatg 5940 ggcatcctgg cgggatcacc tctggggtag tcgagtggac caggcagtgc aatacaacaa 6000 gcgccgccgg gtgggagaca cattgaagag gggtgatctg gtgatgattg acagtcatga 6060 tcgacaacag gtggtaggtg gcgcaagtcg tggcgttgga aaactccgag ctcgttatga 6120 tgggccgtac gaaattcagg aggtgttgaa tggaggtcgt gatttcaaac tcaagcttca 6180 agcggacgat tcaacatacc ctatttttca catctcaaaa ttgaaggcct atcatagcaa 6240 agctgaagac gcggacggcg aaaccattgc gaccatgaaa atggtctcgt cacatccgga 6300 accaatcccc gaaggtccta caaaggagcc cgtggggcac gctacaaaac gtgacgaggg 6360 ggtgtgcatc aacatggatg cgctaacacc cccgcaatgt gagtgtgata attttatttc 6420 aaatcctatt catgaaaacg ttggcaagca agtttctctc ccaaataaca gttttcactt 6480 tcgaagcgaa gacgatggac acgaattgga aacacaaaga accacttgca gtcactgtgg 6540 tgattttgtc aaatccgaaa acaaccccat ggagcaacag ttgggagacc attggggaat 6600 gctaggataa ttgacaaggg ggtgtgcatc gcattgatgc gctaacaccc ccgcaatgtg 6660 agttcgataa aacattcttt ttctttcaat ttccccgaaa tggctggcaa gcagttttct 6720 ctcttgaata acagattacc gcaacaagga cggaaagaaa aagctcaagt ttctttacga 6780 caaaacctca agttacttac gtcaagaatt catgcttaca atactaactt caagacctca 6840 acactcaaat taaattaatt acttacatca agatttcaag atttcattaa attcaaaact 6900 ttctgtttca aattttactc aattttttac aattcaatca agttttctct tttcatggtg 6960 acgagttatg gtcgaatttt ggagcaagtt ataccatttg atgattgttt ttgtttcttt 7020 gtttcttttc tttctctttt caatttggga tcattaaggg tacgggttta tttttaggaa 7080 gggaggg 7087 // ID Copia-10_MLP-I repbase; DNA; FNG; 4266 BP. XX AC AECX01000965; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-10_MLP_; KW Copia-10_MLP-LTR; Copia-10_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4266 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000965; Positions 285599 281334. XX CC Positions [1555-2055] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 1054..4266 FT /product="Copia-10_MLP-I_1p" FT /translation="MVPDRDLFEHYESVSSTVALANGKKIEIIGKGNIRCN FT SADDEVILQALHVPELGCCLISMGALIMDGYSISSYVDDIIMTKKGAGGFI FT GHVVNHVIELDITLGTSSSAPSPRVNISISDYDVLHRQGGHPGSERLKIMY FT DVSEPSGWSCDICKLSKGHRLPYSGHFPSSKSPLDMVHSDLSGKISVPSEG FT GGQYYFKLTDFCTNYKYVYIMKLKSETFSCFLNYKSNVETLHNKKIVSLVN FT DQGGEYLSKEFQKLLKENGTTSHLSAPYTPQQNPVSERGNRTTTEKARTLL FT RQSKLPYRFWAEAVMTAVFLENITPAKITNNKSAYELWFGRSFDYSRLKPF FT GCRAFVLIPKQFRRKFDSTSMSGILLGYQVGMKNYRVRLDDGRVVYSHDVT FT FDTACFPGVENGDQVTETDLDGLFHEDTHLSIQDTVSVLNDAPSDAAPIVE FT VPDSPTEPEQPLPTSPASSPQQIAIAPTSPNKPALPADTSNENPFAEDEDD FT EDEVMKQLTEKARPGWEYHVWSKAPNDICSDIDTFNILPEGSSRRANTVKL FT TNTSPKTYRQALMGTDKTDWQRVITAELNDMDTRKVWDVVILPKGRRAIGT FT TWVFKKKLGGDGELLKYKARPCAQGFSQEFGEDYNETFAPTARLVSLRILC FT AVAAAEDLDVYQMDAVAAFLNASPEERIYMKIPKGLNIEDATENTVLELKQ FT ALYGLKQAPKVWYDTLKAFFKTILLLPSAQDPSVFISHDPKWKCIVHVHVD FT DMVIATNDFKRFHSAITKKFQMDENVDFKYILGMKVTRDRQAKTITLSQGQ FT YIRDLLEDYSMTDCKPVGSPMILNTYLVPGTDEENQQFLDLGISYRRAVGK FT LMYLNNATRPDLSFVVSQLSQHLNKPSIVHWTAFKRVLMFLKGTQDLGLVL FT GGSDLELNAFADADFAACPRTRRSTGGYVTRLGESTVNWTSRKQENVATST FT TEAEYRSAYEGGQDVVWVTNVMKGMGIGQRSSPIFKLDNQGAIALSKNEKF FT KRRTKHVDVKYHWLRELVKTKQLTVEYVPTVDMIADVMTKALSPAKHLNFC FT KALNLHDVIKMGGN" XX SQ Sequence 4266 BP; 1231 A; 944 C; 937 G; 1154 T; 0 other; gataaagaac catccctgaa tctctttggt tatgagccca gcgctccaat aagcgcttat 60 caatcaaaat ttttcgatcg actgtcgtcg atttttcgaa aacatttttc atcttcaatt 120 tactctcatc tcaatcatga acgataacgc gagctcagct agaactcatt ttcccaagct 180 ggccaagacg aattacgtcg aatggtcagg tgacgtcatc gctcacttga tgactgtcaa 240 cttggaggag ttcaccaaca tcgacagtcc acctgtcatt ccaaccaaaa ccgaattgaa 300 cgatgcttct gtgcaaattg ccgtatacaa cgccaagaga aagaaagctg ctggaattct 360 gtatggttgt atcgatcaag acaaccgcgt ccgaatcact tcaaaagacg ctgtctcgga 420 tcctattcaa atctggaaga tactcaagga gcactttcag tcgtcttctg acgaaaatca 480 ggcgagagcc tatctcaagt tcactgacct cacgttcacg aacctcgaat cctacataac 540 tgacactcaa catgcccttg ctggtatgct cgctgtcgat ggcattcaac acattgccct 600 gaaatatctc ggggaaacga ttgtcaccaa gcttcctgtc tcgatggaca tcaccaagac 660 gttactccga aaagaacgtc ctttgactcc tgaaaagtct taaactattt agaggatcag 720 ctctcggcca tcaaggacga ggaagagcat gcaactgcta ttgctctggc tgttcaccaa 780 cctagatctc ttcaacgatc tgtcactcaa cgccctctta atcttaaccg accgggcttc 840 gcttattgct ctaacggacg tcataacaat gacgtcaaga gtcacacctc cacagaatgc 900 tatcaagtta atcctgctct tcgacctcct cgtactactc gacctcgcgc cgcagctgtt 960 gccgtagata acactttctt tgaggcgaaa gcttttgttg tggtttcctc tgatcacgat 1020 tcaatcttgt tggatagtgg atgttctcat cacatggttc cagatcggga tctatttgaa 1080 cactacgaat ctgtctcgtc tacggtcgcg ttggctaacg gcaagaagat tgagatcatt 1140 ggtaaaggca acatccggtg caattcagct gacgatgagg tcatccttca ggccctacat 1200 gtaccagaac tcggctgctg tttgataagc atgggtgctt tgatcatgga cggttactca 1260 atctcaagct atgttgatga cattatcatg acgaagaaag gtgctggtgg ttttattggt 1320 catgtcgtta atcatgtcat agaactcgac attactttag gcacgtcaag ttccgctccc 1380 tctcctcgtg tcaatatctc aatctctgac tacgacgtct tacacagaca gggaggtcat 1440 cccggatccg agcgccttaa gataatgtat gatgtgtcgg aaccctcagg ctggagttgt 1500 gatatttgca aactgtctaa aggtcacagg cttccctatt ctggacattt cccttcctct 1560 aaatctccct tagatatggt tcacagtgat ctcagtggta aaatctctgt accatctgaa 1620 ggtggaggtc aatactattt caaactcaca gacttttgta caaactataa atatgtttac 1680 ataatgaaat taaaatcaga aacattttcg tgctttttaa actacaaatc aaatgttgag 1740 actctgcaca acaaaaagat cgtgagtctt gttaatgatc aaggtggaga atatctgtcg 1800 aaggaatttc aaaaattgct gaaggaaaat gggacgactt ctcacttaag tgctccttac 1860 acccctcaac aaaatcctgt ctcggaaaga ggcaatcgaa ccacgactga aaaagcaagg 1920 actttactga ggcaatcaaa attaccttat cgcttttggg ctgaggctgt gatgacggca 1980 gtgttcttag agaatatcac tccggctaag atcacgaaca ataaatcagc atatgagttg 2040 tggtttggta ggagctttga ctattccaga ctcaaacctt ttgggtgtcg tgcttttgtg 2100 ttaatcccta agcagttccg acgaaaattc gacagcacct cgatgagtgg tatactgctg 2160 ggatatcaag tgggtatgaa aaattatcgt gttcgacttg atgacggtcg tgttgtttat 2220 tctcacgatg tgacttttga cactgcctgt tttccgggtg ttgaaaatgg agatcaggta 2280 actgagactg atttggacgg tctttttcat gaggacactc atctatcgat tcaagatact 2340 gtctctgttt tgaacgatgc tccttcagat gccgcaccaa ttgttgaagt tcctgattca 2400 ccaactgaac cggaacaacc tcttccaact tcaccagctt catctcctca acaaattgcg 2460 atcgcgccga catcacctaa caaaccggct ttaccagctg acacttcaaa tgaaaatcct 2520 tttgctgaag atgaggatga cgaagatgag gtgatgaagc agcttactga gaaggcaagg 2580 ccaggttggg agtatcatgt gtggtcgaaa gcgccaaatg acatctgtag tgacatcgac 2640 accttcaaca tcttgcctga aggatcctcg cggagagcga acaccgttaa attaaccaac 2700 acttcaccca aaacttatcg acaggctttg atgggcacgg acaaaacaga ttggcaacgt 2760 gtgattactg ctgaattgaa cgacatggat actagaaaag tctgggatgt cgtaatctta 2820 cctaagggtc gtcgcgcaat cggcacgact tgggtgttta agaagaagct aggtggtgac 2880 ggtgaactgc tgaagtacaa agcgcgaccg tgtgctcaag gcttttctca agaattcggc 2940 gaagattaca atgaaacctt tgctccgaca gctcgactag tctctcttcg aattctctgt 3000 gctgtagcgg cggctgaaga cttggacgtc tatcaaatgg acgcagtggc tgctttcttg 3060 aacgcttcac ctgaagaacg aatttatatg aaaattccca aaggactcaa cattgaggat 3120 gcgacagaaa acactgtgct agagctcaaa caagcacttt atggtttaaa acaggctcca 3180 aaagtgtggt acgatacgct gaaagctttc ttcaaaacta ttcttctctt accgtcggct 3240 caagatccaa gcgtcttcat ctctcatgat ccgaagtgga agtgcatcgt acatgtacac 3300 gttgatgaca tggtcatcgc gactaatgat ttcaaacgct ttcactctgc cattacgaag 3360 aagtttcaaa tggacgagaa tgtggacttc aagtacattt taggcatgaa ggtgactcgt 3420 gatcgacaag ccaagacgat tacactctca caaggtcagt atattcgtga tctacttgaa 3480 gattattcta tgaccgactg taagcctgtt ggttcaccaa tgatcctcaa cacctatctt 3540 gtaccgggaa cagacgaaga aaatcaacaa tttctggact taggtataag ttatcgtcga 3600 gcagtcggaa agctgatgta tctaaacaac gcaacacgcc cagatctgtc ttttgttgtg 3660 tctcaactat ctcaacacct caacaaaccg tcaatcgtac attggacagc attcaagagg 3720 gtcttaatgt ttttgaaagg aacgcaggat ttaggtttag tactaggagg tagtgatttg 3780 gagttgaatg cgtttgcaga tgctgatttc gcagcatgtc caagaacgcg acgctcaaca 3840 gggggatatg tgacaaggct tggtgaaagt acggtaaatt ggacaagtag gaaacaagaa 3900 aatgtggcaa catcaacgac ggaggcggag tacaggtctg cttacgaagg aggacaagat 3960 gtagtttggg ttacaaatgt gatgaaaggt atgggtattg gtcaaagaag ttcaccaatt 4020 tttaaactag ataatcaagg tgccattgct ttgtcgaaga atgaaaagtt caagagaagg 4080 accaagcatg tagacgtaaa gtaccattgg ttacgcgaat tggtcaagac gaagcaattg 4140 acggtggaat atgtgccaac agtagacatg atagctgatg ttatgacaaa ggctttatca 4200 ccggcaaaac atctcaactt ttgtaaggca ttaaatttac atgatgttat caagatgggg 4260 gggaat 4266 // ID Copia-6_MLP-I repbase; DNA; FNG; 4618 BP. XX AC AECX01000139; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-6_MLP_; KW Copia-6_MLP-LTR; Copia-6_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4618 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000139; Positions 89779 94396. XX CC Positions [2070-2570] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS join(363..2720,2724..4604) FT /product="Copia-6_MLP-I_1p" FT /translation="MTTVASATHTSHSSHLLGCIPKLGDTNKVDWTIGIKT FT YLRSRKLWKYVERDIKLSGSDLKDPDKVETLEAERAGVLEVIRATVLPTRL FT PLICGIEDPKRAYEVLLEQVSQDDGLEIATLIAKVANTRYTGNESVSSFLD FT GINDLHTRLAEATSSDNDLKISDKLLAVFLLLSFPGDQFGTIRDQLFGDMK FT NLSTSKVMSRLRTKSALSTVDEVTIAMNAMAKPFHAPNKPNSAQPNPYRTN FT KSPNAPCILREHWQFNHTNGSCSKQIQRRMGNAGNQIIRPRNNAATNGNVS FT DAEKIKRYNQLTAAGVFGYNTTVPDSAPNPPAVSTPTADDNGPTCQFATSY FT NVVAIDADPNEASVNLSQTYPQASEVHRPTLADTACNRHMFGDAMLLDDLH FT EVAPVWINVANANQSSRIMANRMGIARLSAIDNKGLSVVVDIPNVLYSPDL FT PANLISITSLYEAGYKIVDPYYGTNVSDVNLYLSKDDHIIPAFKDDGPGGF FT WRFYHHSEPRAYSTVSSPSPDPNLWHLRFGHLNHRSTRDIMEDVVRFNPNI FT PSNCEACTLGKQARSSHSGKLPRSQVPMYRIHCDLAGPFPAQSTGGYLYTM FT VLIDDATRKNWIMLLKCKSHALNVFKQFHKMILNQTPYPIAVFKSDRGGEF FT TSTEFSNYLAEHGIVREMGPPESPQQNSVVERFNRTASERLRTQLLHGNLP FT VRLWGEVMLATSFILNLCPSKSIDFKCPEEAWQKLALQIKVPQLPFSRLRP FT VGCLAYTIPPGHRDKLAPRSIKAIMIGYEKNSNAYLWNPKEQRIMISNDVT FT FDEMKFPLSEKSNNDTEELAILNDESWDEIWDCNARTASETSPTTTFVQPT FT RPRRNVQPIDRLGDFIGYHTVADLNQPIYVSIDDGPENDEPNYSQAMKSPN FT REAWLDAMSREYTSLQAHNVGTLIEKPEGANVLPGMWRLKRKRDEFQRITT FT YKARWVAGGNHQIKGVDFDSTWASVGMTDTLRTLYSMAATEDLEMEQFDIE FT TAFLNGSMKHKVYMRQVTGFRDLTKPGHIVLLNASLYGTCQAHREFNDDFN FT KKMKIMGFKVCPVDNSLYSLRSGKSFIHIPMHVDDGMVFSNDKKMLASFRE FT TLKTHYKFQWNENPTLHLGVHITRNRSKRTITIDQQHYCDNILERFGLSNC FT NGVKTPLPTNIHLSTPIQDESEEIEDYRAAVGMLNFLSVQTRPDIGFAVGY FT LARFNSRHNKTHWSAVKHLTRYIKATSHYGIMFGNGETKKIIEGFADADYA FT GDLDTRRSTTGFVFYVYGSVVSWKSRRQQSVTLSTTEAEYMAIGDCAKHGL FT WLCWLLEYLIEGANLSVPVLLPLSNDNQGAVFLCNEASVNNRSKHIDIRHH FT FIRELIREGKITVSHVSTKEMPADILTKSVGSQILSSCNEKLGLNTKKSS" XX SQ Sequence 4618 BP; 1359 A; 1123 C; 935 G; 1201 T; 0 other; ggattctctg aatcctttta ggttatgagc ccagcgctta aagctacgat ctagatccga 60 ccgaaggtag attagagttc tgaacacggt ctggtagcac agaatccgtc aattcaaact 120 caaaataccc acccacctcg aaatcccctt ttcgaccttc aataagattc gaactatatc 180 aattctttca aattcttgaa ccttcgctga aattcgaaca acaatcttat tgactgtaca 240 actcaattac tccgcaatcc caagaatcca atctggaacc ttcggattca actgagtcta 300 gcgtctcgca acctacgtca aacccaccct cgggtaccga aaccaaactc actcagtcga 360 tcatgactac agttgcgagc gccacccata catctcactc ttctcatcta cttggatgta 420 ttcccaaact gggagacaca aacaaagtcg actggacgat aggcatcaaa acctacctac 480 gcagtcgcaa actctggaaa tatgtcgaac gtgacattaa gctttctgga tctgacctca 540 aagatccgga caaagttgaa acccttgaag ctgagcgtgc cggtgtgctt gaagtcatac 600 gtgcaacggt attgcccact cgattgcctc tcatctgtgg aatcgaggat cctaaacgag 660 cgtatgaagt actgcttgaa caggtctctc aagatgatgg ccttgagatt gctactctca 720 ttgccaaggt tgcgaacacc cggtacactg gaaacgaatc tgtctcgtcc ttcctcgacg 780 gaatcaatga tcttcacact agacttgctg aagctacctc gagtgacaat gatctgaaga 840 tcagtgacaa actattagct gtcttccttc tcctaagttt tccgggtgat caatttggaa 900 ccattcgcga ccagctattc ggtgatatga agaacttatc gacttcaaaa gttatgtcgc 960 gtcttcgaac aaaatcggcc ttatcaacgg ttgatgaagt tacaattgcg atgaatgcta 1020 tggccaaacc tttccatgct cccaacaagc ctaactcggc ccaaccgaat ccttaccgaa 1080 ccaataagtc gcccaatgct ccatgcatac tcagagagca ttggcaattc aatcacacga 1140 acggatcgtg ttccaaacaa attcagcgcc gaatgggcaa tgcaggtaat caaatcattc 1200 gcccgaggaa caatgctgct accaatggca atgtatcgga tgcggaaaaa atcaagcgat 1260 acaatcaact tactgctgcg ggtgttttcg gctataacac taccgtcccc gattctgctc 1320 ccaatcctcc tgctgtttcg acgcccaccg ctgacgacaa tggtccgaca tgccagtttg 1380 ccacgtccta caacgtagtg gcaatcgacg ccgatcctaa cgaagcctcc gtcaatctat 1440 ctcagactta ccctcaagct tcggaggtgc accgacctac attagctgat accgcatgta 1500 atcgtcacat gtttggcgat gcaatgttgt tggacgatct tcatgaggtt gcacctgttt 1560 ggatcaatgt cgccaacgcc aatcaatcgt ctcgaataat ggccaaccgc atgggaatag 1620 cccgactcag cgctatcgac aacaagggcc tatccgtggt cgttgatatc cctaacgttc 1680 tatactcccc tgacctccct gccaacttaa tctccatcac ctcattatac gaggctggat 1740 acaaaatcgt cgatccttac tacggtacaa acgtatcaga cgtgaatcta tacctgtcta 1800 aagacgatca catcattcct gcgtttaagg atgatggacc tggcggtttc tggcggtttt 1860 accatcactc cgagccacgt gcgtactcca cagtatcttc cccatctcct gatcctaacc 1920 tctggcacct acgattcggc catctcaatc acaggagcac ccgggatatc atggaagatg 1980 tcgttcgatt taatccaaat ataccatcga actgtgaggc ctgcaccttg gggaaacagg 2040 cgcgcagcag ccattctggc aagctaccac gctctcaagt gcctatgtat cgcatacact 2100 gtgacttggc tggcccattc cctgctcaga gcactggtgg atatttgtac actatggtat 2160 taattgatga tgctaccaga aagaactgga tcatgctact gaaatgcaaa tcacatgccc 2220 ttaatgtgtt caaacaattt cataaaatga ttttgaatca aacaccttat ccgattgcgg 2280 tttttaagtc tgaccgtgga ggcgagttta cgagcactga gttctcaaac tacctagccg 2340 aacatggtat tgtacgtgag atgggtcctc ctgaatcacc acaacagaat tctgttgttg 2400 agaggttcaa ccgaacggca tctgagagac ttagaactca actgttgcat ggcaacttac 2460 ctgtgcggtt gtggggtgaa gttatgcttg cgacatcatt tatcctcaat ctatgtccgt 2520 caaaatcaat cgacttcaag tgtcctgagg aagcttggca gaagcttgcg ttacaaatca 2580 aggtgcccca actgccattc tcccgtcttc gtccagtggg ctgcttagca tacactatcc 2640 cacctggtca ccgtgataaa cttgcaccta gatccatcaa ggcgataatg atcggatatg 2700 aaaagaattc taatgcttat tgactatgga atcccaaaga acaacgtata atgatttcta 2760 atgacgtcac ttttgatgaa atgaaatttc ctctatctga aaaatccaac aatgacaccg 2820 aggaactcgc tattttgaac gatgaatcat gggatgagat ctgggattgt aatgcccgaa 2880 ctgcctctga gacatcacca actaccacat ttgtccagcc aacacgacca agacgaaatg 2940 tacaacctat cgatcgtctt ggtgatttca taggatatca tactgttgct gatttaaatc 3000 aacctatcta tgtctctatt gatgatggac ccgagaacga tgaacccaac tactcccagg 3060 ctatgaaaag tcccaaccgt gaagcatggc ttgatgcaat gtcgcgtgaa tacacttcac 3120 tacaagctca taatgtcgga actcttatag agaaacctga aggcgctaat gtattgcctg 3180 gcatgtggag actgaaacgc aaacgtgacg aattccaacg tatcacaact tacaaagcgc 3240 gttgggtagc tggaggtaac catcaaataa aaggtgttga ctttgactca acgtgggctt 3300 ctgttggtat gactgacaca cttcgaacac tctactctat ggcggctacc gaggatcttg 3360 agatggaaca gtttgacatt gaaactgctt tcctcaatgg atcaatgaag cacaaagtct 3420 acatgagaca agtaacgggg tttcgtgatc tcaccaaacc gggacacata gtgttattga 3480 atgcctcact ttatggtact tgtcaagctc atagggaatt taatgatgat ttcaacaaga 3540 aaatgaagat aatgggtttc aaagtgtgtc cagtagataa ctctctatac tctctaagaa 3600 gtggtaaatc atttattcac ataccaatgc atgtcgatga cggtatggta ttctcaaatg 3660 acaagaaaat gttagcatca ttccgtgaaa ctctcaaaac tcattacaaa tttcaatgga 3720 atgaaaatcc aactctacat cttggagtcc atatcacgag gaatcgatcc aaaagaacca 3780 ttactattga tcaacaacat tactgcgaca acattctcga gcgctttggt ctttcaaact 3840 gcaatggcgt aaaaaccccg ctaccaacca acattcactt gtcaacacca attcaagacg 3900 agtctgaaga aatcgaagac taccgagctg cagttgggat gcttaatttc ttgtccgtac 3960 aaacccgtcc tgacataggc ttcgctgttg gatacctagc aagattcaat tcacgacaca 4020 acaaaactca ctggtcagcg gtaaagcatt taacgaggta catcaaagca acttctcact 4080 acggcataat gtttgggaat ggagaaacta agaaaataat cgagggattt gctgatgcgg 4140 attatgcggg agatttagac accaggagat cgacaactgg ctttgtgttt tatgtgtatg 4200 gttcagtggt atcatggaag agtagaagac aacaatcagt gacattatca actactgagg 4260 ctgagtatat ggcgatagga gattgcgcaa aacatggtct ttggttgtgt tggttactag 4320 aatacttaat tgaaggggcc aatttatccg ttcctgtttt actgcctcta tcaaatgaca 4380 atcaaggagc agtgttctta tgcaatgaag catccgtcaa taacagatca aagcacatag 4440 atattcggca tcacttcatc cgagaattaa ttcgtgaagg aaaaataact gtatctcatg 4500 tttcaaccaa agaaatgcct gctgacatct taactaaatc tgtggggtct caaatactat 4560 catcatgtaa tgaaaaatta ggactcaaca ccaagaaatc gtcgtgagca gggggggc 4618 // ID DNA-1_EN repbase; DNA; FNG; 3820 BP. XX AC . XX DT 09-JAN-2004 (Rel. 9, Created) DT 19-SEP-2005 (Rel. 10.1, Last updated, Version 2) XX DE Nonautonomous DNA transposon. Putative classification: MuDR DE superfamily - a consensus sequence. XX KW MuDR; DNA transposon; Transposable Element; Nonautonomous; KW DNA-1_AN; nonautonomous DNA transposon; KW putatively MuDR superfamily; DNA-1_EN. XX NM DNA-1_AN. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-3820 RA Kapitonov V.V. and Jurka J.; RT "DNA-1_AN, a family of nonautonomous DNA transposons in the RT Aspergillus nidulans genome."; RL Repbase Reports 3(12), 203-203 (2003). XX DR [1] (Consensus) XX CC Nonautonomous DNA transposon. Putative classification: MuDR CC superfamily. CC 9-bp TSD. XX SQ Sequence 3820 BP; 1171 A; 676 C; 707 G; 1266 T; 0 other; ggtggttgca ggtgacctga ggtcaaaatt ttggtcgtgc gggacaaacc tgattatata 60 atggccttgg cagtcatgtg atgtacaacc tgtctttaca aagtttcagc ccgtactacc 120 tctttcaaat gtattaccat cgacatgtaa actagcttcc atactgccat tactagaggt 180 gctctaaact aaaagggaag attattactt cttaagtatt tagtaagcaa gggctctatt 240 acgaatcact aattagggtt gaaatggtat ctttggtgtc tctttccaat tgagaatagt 300 tatataggat tgaatacatc tcccgtacag gatatatata aaaggtcttt cgcacaaatt 360 ttcaattcaa ctacttggaa ggtacttgta aactagttat aaactactta gctcatgctg 420 ctgagcttcc tgtatatgta tcctcctctc cataagatca agttccaact tctcattttg 480 aagttgttgc aggcgaaagt cctcctcttt cttcttaaga tcagctctct tttgttgaag 540 ctctaattca agattctcta gctcaagctc ttgatgtctt tgttcaagat tcaaggcatt 600 tattgaaata gcacgtcgaa gattaccaga ccttattgac ctaatacttt gtaagtactt 660 tgcaactgct tgtaaaccac ttctaaagta cttggcagac ttactcatca tcaccaattc 720 ttgaggagtt ttgagacctt gatcttgtat ttccttgtgg taagggtaaa gctgattcgg 780 cttcagttaa gggagataaa gcagaccttc ggcgttttcg acttcctaaa agatagtagt 840 aagcaagtgg taggcaagta gtttgtaagc acttgcaaag caacacctga atacctactt 900 tcctaactta tatgctgaag ataattagct tccatattag aggtttgata taaatgatga 960 atattgaact cctggaagtt attatactat aagatatcat ccctatctag ctttgcagaa 1020 ctaggttatc ttagtagctt tgccagccag tttgcaagta gttagtgagt acttagaaac 1080 ttacttcttg actgcttaca ccagactcaa gtacttgcca gatatatata acttctgatg 1140 ggactgttca acagcatttg tatagttttg taataaatca aagtaataag gctgaatctt 1200 agagtatact ttatttaatc ctgccttgat cacagcccct ttcttttgag atgcccaatt 1260 ctggacatta gcattttcat actctatatc agttagtaac tggttaccaa gcagtaagta 1320 agtattcagt acttattaag agatccaata aagagtcata atcatcctcg gatttgcagt 1380 ctagaaggct tattattcgg ctccaaaggg gagaaccatg cttattagtt ctaatatatt 1440 ttaagattgt cctttggaaa tggactctgc agaagacaat aatatgctgt aaatactatg 1500 ttatatctca atattgagga ttaatttcag atagatagtg accaagtcct tggcagttag 1560 taagtatttt ggaagtagtt aataagtact tactggtata ttgttttgag tctatattaa 1620 taataatacc atagatccca gaaccatgga taggatcaaa atatattagc tgatgggaaa 1680 tcctctgaac aaggctgaaa acctgcttaa aaagtaaata ataaccctca gtagaatcaa 1740 tactggtaaa tacttggagt aatgtgataa ctagtaccca ggtagttagt aattgcttgc 1800 caagtagtta ataataatac ttacttttgc attggtctgg caggaatata gtaaaaagca 1860 ctttattaat atcttttgac tgtatttgtt tataagacat atcaacctta aaagatgaca 1920 gctgtaaaag tagttaaatt tgctctttaa aagtacaaag taccatggta ccctgagaat 1980 cataataata ttcttgaata tagtcctaag tacttgattt agtatctatt aactgctctg 2040 caactatata gcaagtactt accttcaagt tctggtcagt attctggagg aagataagac 2100 cattaatatc ctgtctgttt agataagata ttagatgttg cttttgaatt attgctgcaa 2160 ttcagtcctt attacaaaag ctagaataaa tctctgctaa tgtcgaagca ttgtactggt 2220 gacagaaatc ttcaagttgc ggatttcaaa ggaattgagc tagaactagt taccaactac 2280 ttcccaagtg gttggtgagt acttaccagt tgttagatta gggtcccgaa tctgttcaat 2340 aattctcttc acacctgcta gaattctttc aggtgccttg cttggtagta gtggtggatg 2400 tttataaatc ccgtgcgatg taaataatat ataggggcat aggtctgtat ttataggtac 2460 tagagtatta aagaccacat cacaggtagt gtgcttcaac tgaccagacc cctgaggatg 2520 atcttgatct ggtaattgct taataactgg ttgcaaatta gttaggaagt gcttaccaca 2580 gtatttttgg caacttgaca gaggtttaaa aacaccacat tcttcagtag ctggcagaat 2640 ctctttatta aagagatcct ctagaaactc caagtctatt actgtatgtc cttgaattac 2700 gcccctataa tgttttgtta aacctctata ggacctattt atacagccaa taaatggtgc 2760 atattctcta tgaatatcct gtatatttat tagctgcttc ttaagtactt agcaagtact 2820 tgagatatac tatctagtta tatcttttga aaactgcttt gcatgttggg agttgatcaa 2880 tgcatgcatg gcccttttcg aaaaaggcaa ctttggacca ataataacta agattgagta 2940 agtacttagt agccggttaa caagcagctt taatacctat atgcattatg ctttctaata 3000 tcagactcta agatctggat atctttctga gatctttgga tctcttgcta tgtatattta 3060 tcaacagatg tataataata agactgcagg gctggactga ggaatttata tgcataaatc 3120 ccagagcatc tccatgtcca ttttctgact tgacatccaa gaaatggaca gtaaacaggt 3180 tgtttttgtc caaaaagctg ttgttttgca tattagatct ggaaatactt agcagccagt 3240 taccaagtac ttgacatgca cttacactat cagcaagata ttccatttca acctgtgttc 3300 atcctttgga cacaacaaca tatgtatggc catatatatg cgttgtggga tattctggaa 3360 gattattgat atactcaata tttaatatta taagactgga ggagctttta gctcttgtaa 3420 gcggaataag catgtctggt tccttttatt tagttagcaa gcagtttaaa actaacttgt 3480 aagcagtggc gcaataacta tacctgtatt tctaaggaaa catcagcagc tggctcaata 3540 agatcatcat ccaaataggt tgaaatgttc tcaattgaaa tgttcccgga gtctggacac 3600 tccattgttc taactgctta gcaaccaatt gctaaatgct ttgatatttg tgtaggtcaa 3660 gtatcaataa attggaaaac agaaattgaa ttacaagtgt agacaccagt atttaaagac 3720 ctcgtgcggg acaaacttgc ttaataatga tcttggcact cacgtgcagg gggtgtaggc 3780 cgcatgagct caccgcttgg ctcatgcggc ctgtgtacac 3820 // ID Mariner-1_AF repbase; DNA; FNG; 1882 BP. XX AC . XX DT 28-FEB-2006 (Rel. 11.02, Created) DT 07-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE A family of Mariner DNA transposons - a consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-1_AF. XX OS Aspergillus fumigatus OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Neosartorya. XX RN [1] RP 1-1882 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-1882 RA Kapitonov V.V. and Jurka J.; RT "Mariner-1_AF, a family of Mariner DNA transposons in the RT Aspergillus fumigatus genome."; RL Repbase Reports 6(2), 96-96 (2006). XX DR [2] (Consensus) XX CC DNA transposon from the Mariner superfamily (Tc1 clade) The CC genome harbors ~20 copies that are 99.7% identical to the CC consensus. It encodes a 556-aa Mariner-1_AFp transposase (pos. CC 95-1762). XX FH Key Location/Qualifiers FT CDS 95..1762 FT /product="Mariner-1_AFp" FT /translation="MPPKARINSKNSVEQEGRVLLAVSALKNKEILNIREA FT ARVYNVPYTTLQRRLKGHTFRAELRANGHKMTQNEEDSLIRWILSMDQRGA FT APRPSHVREMANILLAQRGSTPTQTVGEKWVYNFINRHDEIKTRFSRRYNH FT QRAKCEDPKIILEWFNRVQITIMQHGITLEDIYNFDETGFAMGLVATAKVV FT TRAEMLSRPFLIQPGNREWVTSIECINSTGWVLPPCIIFKGKVHIEGWYQD FT TALPADWRIEVSENGWTTDQIGLRWLQKVFIPATTSRTTGRYRLLILDGHG FT SHLTPQFDQICTENDIIPICMPAHSSHLLQPLDVGCFSPLKRAYGRLIEDK FT MRLGFNHIDKFDFLEAYPQARTAIFSADNIKSGFSATGLIPLNPDRVLSQL FT NIQLRTPTPPGSRSTNSVPKTPYNLKQLKKQETTLKKLLRERTYSPPTPTK FT AVLGQIIKGCEMAMNNAALLAKENHDLRAAHEKHLQKQKRSRRQIETAVGL FT SIQEGQEIIQRRDQAAEAIPTIPPEQVVDTEQRPQRAPPRCSDCHILGHRR FT LQCPQRKNN" XX SQ Sequence 1882 BP; 541 A; 445 C; 428 G; 468 T; 0 other; acgtaatcgg taagcgagcg gatcacgcaa ccgagcggat caccaaacgt gttttgactt 60 ctatttcaaa agctgcagtg ctcagccagc cactatgcca ccaaaagcgc gtataaactc 120 aaaaaattca gttgagcagg agggaagggt cctacttgca gtatcagctt tgaaaaataa 180 ggaaattctc aatattcgtg aagctgcgcg tgtctataat gtgccttata ctaccctcca 240 gcggcgccta aaggggcata cttttcgagc tgaattacgc gcaaatggcc ataaaatgac 300 tcagaatgaa gaggattcac ttattagatg gattctatct atggatcaac gtggagcggc 360 tccccgaccg tcccatgtac gagaaatggc gaatatcctg cttgcgcagc gtggttcaac 420 tcctacccag actgttggag agaaatgggt atataacttc attaatcggc atgatgagat 480 caaaacccga ttctctaggc gctataacca ccagcgtgct aaatgtgaag acccaaagat 540 tatcctggaa tggttcaatc gtgtccagat cacaataatg cagcatggga ttacactgga 600 agatatctac aactttgatg aaactggctt tgcaatgggc ttagtagcta ctgctaaggt 660 agttacaaga gctgagatgc ttagtcggcc cttccttatc cagccaggga accgcgaatg 720 ggttacctct atagagtgta ttaactctac tggctgggtg cttccaccat gcattatctt 780 caagggaaag gtccatattg agggctggta tcaagataca gccttaccag cagactggcg 840 gatcgaggtc agtgagaatg gatggacgac tgatcagatt ggattacgat ggcttcaaaa 900 agtctttatt cctgctacta ccagtcgtac aactggtaga tatcgactat taattcttga 960 tggccatggg agccatctaa caccacagtt tgatcaaatc tgcactgaga atgatatcat 1020 tccaatctgc atgcctgcac attcatcaca tctcctccag cctctagatg ttggctgttt 1080 ctctcctctt aagcgtgcgt atggccgctt gattgaggat aagatgcggc ttggtttcaa 1140 ccatattgac aagtttgatt tccttgaggc ctatccacaa gctcgtacgg caatcttttc 1200 agcagataat attaaaagtg gcttttcagc aactggatta ataccactga atccagatcg 1260 ggtgctcagt cagcttaata tccagcttag aacacctaca ccaccaggca gccgatcaac 1320 taattctgtc ccaaaaacac cttacaatct caagcagctg aagaagcagg aaactacgct 1380 taagaagcta cttagggagc gtacatacag ccctcctacc cctacaaagg ctgtgctagg 1440 tcagattatc aaggggtgtg agatggcaat gaataacgct gcccttcttg caaaggaaaa 1500 tcatgatcta cgtgctgcac atgaaaagca ccttcaaaag cagaagcgat ctaggcggca 1560 gatagaaact gcagtgggat tatctatcca ggaagggcag gagatcattc aacgcaggga 1620 tcaggctgct gaagctatcc caactatccc tccagagcag gtagtagata cagaacaacg 1680 ccctcaacgg gcacccccac gctgcagtga ctgccatatt ctaggccata ggcgattgca 1740 atgtccgcag cgcaagaata actagattta gtaataaaat catgttttag gggttcaaaa 1800 tagcctccaa tttcggccgc ggccaaattc tatggtatgg tgatccgctc ggttgcgtga 1860 tccgctcgct taccgattac gt 1882 // ID Gypsy-47_MLP-I repbase; DNA; FNG; 5766 BP. XX AC AECX01001226; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-47_MLP_; KW Gypsy-47_MLP-LTR; Gypsy-47_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5766 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001226; Positions 31072 36837. XX CC Positions [3097-3600] - Reverse transcriptase CC Positions [4711-4941] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 311..1732 FT /product="Gypsy-47_MLP-I_2p" FT /translation="MVATRDQIANAKTIVDCLPPPADGSAQYSAAELKSFT FT EEEREALRILGAEGPVSKKVVSSTVESQEMVTDEGPAMNPKGKASTVPASA FT APPSNPSAAPSSAPVTQEQFLEWKEWMLREEARKSVIKSKSSVSVSFCSVS FT EFVSDVGVDSPADRMATVALDDESVDRKPNLKAFMSADGDGQSKYEVLSRL FT HFKRVADKALHAAPGQPFVIADGPEDIVSDPEKARKAMADVTLEMFDGSMS FT SDFSIWASKFERAATLRFIHRSLFHRLAFTRFKDGALTHLENAMASATRPT FT SWESFVEWGKAKFRSSATEYSVKAKLRELKQRPNETVGQFYDRWNLFEQEA FT KVVGLIYSRSPEFVERLQPGVRKVVQAKISDQLLAGVEMPFADIALWAVRK FT DDEFRSNRESVSVIAEGSNSASNNKKRKKNQVKKEDRKCFNCNVVGHIFGS FT LENPGCPKAVTDQTCKYFEGKKEVKTLKD" FT CDS 2284..4941 FT /product="Gypsy-47_MLP-I_1p" FT /translation="MLLEADSEGRRVDFLSVASGRMSSFLLDTGGKDKYVS FT CSFVKKNGLVTMVSPSTLRVKGAFGQTIKCDQLCQLEFELAGLKFSISCRV FT VPLASFDIILGLAWINEFVVETNWTSGVWTLQDGRGLKAKFQPGRMGAPCE FT VVHELQVVEESCDDMKEVSRSGFRRFCRQKDVDIVLLQPFEPITEVNEVDG FT PREEVPASMPELAAMDKALKTRVQSLVEKFKDRFSEVKTAPKVKRVVDHLI FT DTDDKGPVSQPVRRMSPLLLDELRLKLAELLEKGMIRPSTSAWSSPVLFAK FT NASGKLRFCVDYQAVNELTKRDRHPLPLVQDCFDQLGGARFFSKFDLQQGF FT HQMKIAGRDIEKTAFGTRYGHYEWLVMPFGLVNAPSTFQRMMNDILREYLD FT DFVQVYLDDILVYSKSKEEHLEHIEKVFSALEKAELKISGKKSLIFAEEIQ FT FVGHIISHNCVRMMPEKVKAIQEWPTPGNVYDVRAFLGLAGYYRRFIRNFA FT RLASPLHGLTAGTTTKKQKVVWCPVHDLAFCSLKKALQEAPVLTTPNAERP FT FVIETDASDHAVGAVLLQEGEDGVEHPVAFESAKLNPAQQNYPAQEKELLG FT ILHAWRKWRVYVEGAVSDTIVRTDHASLTYLQQQVLPSRRLHRWLDEFAEM FT TIKIQYKKGSANIVPDALSRRSDLMEIQEVREGLRDPSDWPLLVPYILDQR FT DIPEWVDSTSLRMAVTSMHKLTYDKKAEALSFKEEDSPFVPFDFRPDLLEA FT VHRESGHRGRDGTLAFLKGRGWWPGRYKDVEEFCKHCSQCQIFDRAREGVE FT TGEQVPLPQVAPFERWAADFIQLPESKDGYKWVLTIIDHCTGWPIAVPLKE FT ATSANVAETFLREVFQQFGIPSEILTDRGQNFL" XX SQ Sequence 5766 BP; 1471 A; 1226 C; 1627 G; 1442 T; 0 other; gatccctaat ctaacccata gttttggtcc tgttacactg gtagcgagct tttagttttt 60 tgaagttctc tcgacttaac tagcactctg tgaaaccctt gcgcggatca tacgtcctgc 120 ttcgtgaaat tttttcgaaa gtctttcttt taaatcaatc aacgttagtt tttggccgta 180 ttttagtttc actctgggtt ctattttgtt cgcataaaac ttctccccga cacctccgga 240 ctcagccagt tgttagaagc tgatcatcag tgattgcttt ttaccttctc ttctttttgt 300 gaactcacag atggtggcta ctcgtgatca aatcgctaac gcgaagacga ttgtcgactg 360 ccttccgcct cctgctgatg gttcggctca gtactccgct gcggagttga agtccttcac 420 ggaggaggag cgtgaagcgc ttcgtatcct cggcgctgaa gggccggtta gcaagaaggt 480 tgtttcctcc acggtcgaaa gccaagagat ggtgacggat gaaggaccag ccatgaatcc 540 taaggggaag gcctctacgg ttccggcttc tgccgcacct ccgagcaatc catctgccgc 600 cccgagttcc gctcctgtca ctcaagagca gttcctcgaa tggaaggagt ggatgctgcg 660 tgaggaagca agaaagtcgg ttatcaaaag taagtcttcc gtctccgttt cgttttgctc 720 tgtttcagag tttgtttctg atgtgggtgt ggactcaccg gcagaccgaa tggcaactgt 780 tgcccttgac gacgagtctg ttgatcgaaa gccaaatttg aaggctttta tgtccgccga 840 tggagacggc caaagtaagt acgaagtcct gtcgagactt catttcaagc gagttgctga 900 caaagcttta catgcagctc ccgggcaacc atttgtgatt gcggatggcc cggaagacat 960 cgtgtctgat cctgagaagg ctcggaaggc gatggctgat gtgaccctgg agatgttcga 1020 cggctctatg tcgagcgact tttctatctg ggcctctaag ttcgaacgcg cggctacgtt 1080 gcgtttcatc cacagatccc ttttccatcg cctcgcattc acccgtttca aggatggagc 1140 cttgactcac cttgagaatg ctatggccag cgccactcgc ccaacttctt gggaaagttt 1200 tgtggagtgg ggaaaggcca agttccgttc cagcgctact gagtactcgg tgaaggccaa 1260 actcagggaa cttaagcagc gccctaatga gactgtcgga cagttctatg accgatggaa 1320 tctgtttgag caggaagcta aggtagtagg tctgatctac tcgcgttccc ctgagtttgt 1380 tgaacgcctt cagccgggtg ttcgtaaagt ggtgcaggcg aagatcagcg atcaactact 1440 tgctggagtt gagatgccct tcgctgacat cgcgttgtgg gcggttcgaa aggatgacga 1500 gtttcgtagt aaccgggaat ccgtttcggt gattgctgaa ggctcaaact ctgcgtcaaa 1560 caataagaag agaaagaaga atcaggtgaa gaaagaggat aggaagtgtt ttaactgtaa 1620 tgttgttggt cacatctttg gatctttaga aaaccctggt tgtcctaagg cggtaactga 1680 tcagacttgc aagtactttg agggtaagaa ggaggtgaag actttaaaag attaggcaat 1740 gatgatggga ttcagtcgtc ggcgccaagt agtagcgtat tatgcaataa agtttcagtt 1800 tctggttgtt ctagtacgct ttgtgagtcc aattgtgagt tgcctattga gtttcccaat 1860 gagccgccta gtataggcga acttgtgagt ttagttaggt ctgaagtcca cgaggtggct 1920 tccttgaccg atgaggtaag tcggttgtct tcgacaccgt tctgtgagga aaaagttgag 1980 gatggacgtt ttggttgtct caagacaagc caagctgtcc agtcagacgt tccaaacctt 2040 gagttgggtt tggaaggaat tcccgaaagt tcagaggacc ttcggggcac agtgagtgaa 2100 aatttttcaa gtccaaaacc tgcattagag gtgaagagtt ttcgagtagt tcccgatagt 2160 ggtgaacgta cctcggtaga ggagcaacgt ctaacggctg tcacagaacg gactagccca 2220 ccggccagtg ggccgctttt ggggcaagcc gggatcctcg atggtagctc cgtcgaggat 2280 ctgatgttgt tagaagcaga ttcggaagga cgccgcgtcg actttctctc agtcgctagt 2340 ggcaggatgt caagcttcct gcttgacact gggggtaaag ataagtatgt gtcgtgctca 2400 ttcgtgaaaa agaatgggct cgtaacaatg gttagtccgt cgaccttgag agtcaaggga 2460 gcctttggac aaacgatcaa atgcgatcag ttgtgtcagc tagagttcga actggcaggg 2520 ctcaagttca gtataagctg tcgagtggtt ccactggcca gcttcgatat aattctaggg 2580 ttagcctgga taaatgagtt tgttgtagag acgaactgga cctcgggggt ctggacgtta 2640 caagatggca gaggtcttaa agctaagttc cagcctggaa ggatgggagc accatgtgaa 2700 gtggtgcacg aactacaggt tgttgaggaa tcgtgcgatg acatgaaaga ggtttcgcga 2760 tccggtttca gacgtttttg tcgacaaaaa gatgtagaca tcgtgttgtt gcagcccttt 2820 gaacctataa ctgaggttaa tgaggtcgat gggccgcgtg aggaagttcc ggcttccatg 2880 cccgaattag cggctatgga taaagccctt aagacccggg tgcagagcct ggtagaaaag 2940 ttcaaggacc ggttttcgga ggtgaagacg gcgcctaaag tgaagagagt ggtagatcat 3000 ctgattgaca cggatgacaa gggacccgtg tcacagcctg tgcgacgtat gtccccgtta 3060 ctcttggacg agttgaggtt gaagcttgca gagcttctcg agaagggcat gatacgacca 3120 tccacatcgg catggtcgtc accagtgctt ttcgctaaga acgcgagtgg taagcttcgt 3180 ttctgcgtcg attatcaagc agttaacgaa ctcacaaagc gtgacaggca ccccttgccg 3240 ttggtacaag attgctttga tcagttaggg ggagcgaggt tcttctcgaa gttcgacctt 3300 caacaaggtt ttcaccagat gaaaatcgcg ggaagagaca tagagaagac agcgtttggg 3360 acgcgttacg ggcattatga atggctggtg atgccattcg gcctggtaaa cgccccaagc 3420 acctttcagc ggatgatgaa tgatattcta agggaatatc tggatgattt tgttcaagtg 3480 tatcttgacg atatcctagt gtattcaaaa tccaaggaag agcatttgga acacatcgag 3540 aaagtgttct cggctcttga gaaggcagaa ttgaaaataa gtggcaagaa atctttaatc 3600 ttcgctgaag aaattcaatt tgttgggcat attatctcac acaattgtgt tagaatgatg 3660 ccggagaaag ttaaggcaat tcaggagtgg ccaacaccgg gaaatgttta cgatgtaagg 3720 gcatttttgg gtttggcagg gtattaccgg aggttcatca ggaacttcgc caggttggct 3780 agtccattgc atggattaac ggctggtact acaacgaaga aacaaaaggt ggtttggtgt 3840 ccggtacatg atttggcgtt ttgcagtcta aagaaggctt tgcaagaagc accggttcta 3900 acgacaccga atgcagaacg tccctttgta atagagacgg atgcaagcga ccacgcggtg 3960 ggagccgttt tgttacagga aggagaagac ggggttgaac accccgtggc tttcgaatct 4020 gcgaagctga atccggcgca acagaattac ccagctcagg aaaaggagct cttgggcatt 4080 ctgcacgcgt ggagaaaatg gcgggtctat gtcgaaggcg ctgtaagtga taccatcgta 4140 aggacggatc acgcgtcgtt gacgtactta caacaacaag ttttgccgtc tcgtcgcctt 4200 caccgttggt tggatgagtt cgccgagatg acaattaaga tacaatataa gaaaggatcg 4260 gcaaacattg tccccgatgc gctcagtcgg aggtctgacc tcatggaaat tcaagaagta 4320 agagagggcc tgcgggaccc aagtgattgg ccgctgttgg tgccatacat acttgatcag 4380 cgggacatac cagaatgggt ggactccacg agtctgcgta tggcggtgac aagtatgcac 4440 aaactgacgt atgacaagaa agcggaagcg ctgtcgttta aggaagaaga ctcgcctttt 4500 gtgccattcg attttcggcc ggaccttctg gaggcagtgc acagggaatc tggacaccgc 4560 ggacgtgatg gaaccctagc gttcctaaag ggacgtggtt ggtggcctgg acgttacaaa 4620 gatgtcgaag aattttgcaa acactgcagt cagtgtcaaa tctttgaccg ggcgagagaa 4680 ggggtagaaa cgggagagca agtcccgttg cctcaagttg cgccgttcga gcgttgggcg 4740 gcggatttta tccagctgcc ggagtcgaaa gatggttata agtgggttct aacaatcatt 4800 gatcattgta caggctggcc tattgccgtc ccgcttaaag aagctacttc cgcgaatgtg 4860 gcggaaactt ttctgaggga ggtctttcaa cagtttggaa tcccctctga aatcttgacc 4920 gacaggggtc aaaacttttt gtaaaaggac atgatgcggt tttacaaagc cgcgcacatc 4980 agaaagttga caacttcggg ctaccatcct cgcacgaatg ggaaaagcga gcggctgaac 5040 gggattttgg agggtacctt gttccggttg aacaagacag gagaccctac caggtggccg 5100 gaatttttgt cacaagctgt gtttgcagtt cgcgttaatc aaagtacggt tacgggctat 5160 tcgccctttg agttattgta cggagtaaaa ccgcggttgt tagcggatag ggctaagata 5220 cgccctaaaa tattagaaga caaccagaaa aagaaaacca ccgctcgaga agctagactc 5280 gagcgattgg ccctggatcg agctgaggct cggaaattgc aagaagagag ggccttgaag 5340 aataaggccc gtttcgacca gacgctagcc gtcaaaggta cgctgcagtc gtataaggtt 5400 ggagaaacgg taaagctccg gaatgagaca agaacgaaag gccagccgag ttggcacggg 5460 cctttcgaag ttttcgacgc gttgggcaac aacgtgtatt cgctggtgga cccaacaggt 5520 tctttatttc ctcatccagt gaatgggaat aggttgcagc ctgcgaattc taaaggtgga 5580 gcactagtca aaccgtgggc actaccggct aggttgcagc cggcttgtga caaggtggaa 5640 aaagtgaccg gcgacataaa caacgaagtt aaaaagttaa ctaaagctca gaaagacgct 5700 ttgaagataa agataaagct gccgaaagcg tccagtggaa tggacgctgt tcaagggggg 5760 gaaagt 5766 // ID Copia-62_MLP-I repbase; DNA; FNG; 4739 BP. XX AC AECX01000577; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-62_MLP_; KW Copia-62_MLP-LTR; Copia-62_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4739 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000577; Positions 14211 9473. XX CC Positions [2034-2576] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 219..4715 FT /product="Copia-62_MLP-I_1p" FT /translation="MNEEDNLPSDQLLNTQSPSNSSSSSALPNDEAYQSAS FT EIGSVGEETFLLESLFPDMTSSIPPTTTITTTTPSNLFGGKDHQIAASVVN FT TLSKIVISDILNDTNYLTWSTDIRNGIKSLRFHPFLLSDKDENGIDEDIRL FT YNEVVRECLMTWMISKMDQTNKNRFEPMMTVKTELGPDVSPTPSALWVKIR FT DHHSPRTSAHQMLLCRSLFNLIQGESSLPLHMDEFHKAYIAYMNSGGKIGS FT VELGQQLLMSLNADWMKVAEDIADSDTFEYHSVVTALKVRIANRAMLRPQR FT TTVIEASAASPMRRRQNFVQTRCSPAKCVSMSHTSADCLRNPKNIDKYNAW FT VEAKKAKGEWIDRPPKPNSKQISSTSPSTSLITTDSAVPSLSSLQEALERS FT EFQASASAVFIVDPDANIDNDHFGIVDSGTTHHMFKSQKFFDLNHFKLLDT FT SNERVGLAGGTSTLKIKGQGNVTIMGPTGDLTTLTDCLFVPTLKQNLIAGG FT RLLHDRWITTLKDNSSFIIERHGKIALTGRVHPTSMLLQLNAVRTSPSSSA FT ITDTTTESLQLLHHKLGHPNFTYLKQMIRGEAVLGLPLSLSKVVLPLELPC FT NSCDLSKAHRQPHSYTRTRSLHLLDNVHIDLSGIMRTPAICKSLYFILFTD FT DHSSYHHITGLKSKEKDEVHGAIHTYLSLVERQCDRKIKSLTLDGGGEFLN FT NVLLPYCRSEGIYLRITAAYTPEENGVAERSMRTIVSKARSMLIQANLPIR FT LWLEAAKNAVFVNNRTTTTTLPKNKTPYEIWYGQKPDVSHVKAFGCLTYVL FT IRKPDCINKYAATSEHGVLVGHTEHNRNYKVFMLKDAKINITHDATFREDV FT YPFTQLPKFDISHLTTEDEAVNIPPLAINPAPPPPDDDDPVQHVVTVQEQA FT TPEPPVDIPPLPSSPPQQMPRRSEREIRPVDRFVPNSSMTFWETDCFIEQG FT DICAYAFAAGPTVRLVAEPGSYKQAMKHPNAHNWKNACDKEIDNMNHKGVW FT KLVARPLDAPVVGSRWHFKVKLNPDGSIIKYKARIVAKGFTQTFGVDYNQT FT YAPTGKPASYRIIICIASCNGWSVHGMDAIAAFLNSALKELVYMEQPEGYV FT NEGDEGKVCLLLQAIYGLKQSAHEWNEEFKLKLIKAGFTQAEGDECVYIRR FT RSTSEVVIFYLHVDDMAITGPLECIISFKKEIMTFWEMDDLGEVSCVVGIE FT TLRLSSHHYAIHQRSMTESLLLRFGLSDCKPASTPVQGGQKLLKSTVEEST FT AFAKLNYPYRSGVGSLMYISQCTRPDIAYAVGVLSQHLDNPCQRHWDAFRH FT VLRYLRGTINLGIHYHVEDNRLFQMKSSWNVPSTNVDSDWAGCKNSRRSTT FT GYLTTLCGGAISWRSRLQQTVALSSTEAEYRATTEAGQEVLWLRNLLRDVG FT FEWSGPVNLNCDNLGAIDLSSNAVHHGRTKHIDIEHHWIREQVQKDNITLS FT YCKSEDMTADLLTKPLHPGPFWNHMKGVGLKRCA" XX SQ Sequence 4739 BP; 1389 A; 1124 C; 972 G; 1254 T; 0 other; atttcgctta gatcactatt tgagcacatt gctcattaaa cccttttcat ttctcacttt 60 aacaatttcc tccattaact aggttagtac caaggtctgc atacctttgg tttggagaag 120 agtgctcacg tacacactgc tgttatcagg tcaaccgtct tcttagacat ttacatggta 180 gcgagagagt ctagtttcga tcaaaacaaa tcaattgaat gaacgaagaa gacaatctac 240 ccagcgatca attattgaac actcaatctc ccagtaactc atcatcgtca tctgctttac 300 caaacgacga agcctaccaa tcagcttctg aaattggttc tgttggagaa gaaacttttc 360 tactcgaaag tctatttcca gacatgacct cctcaatccc accaactacc accatcacaa 420 ccaccactcc atctaatcta tttggtggta aggaccatca aatagcggct tcggttgtca 480 atacgctgtc caagatagtt atctctgata tcttgaacga cacaaactac ctaacttggt 540 caaccgatat tagaaatggc atcaagtctc tacgttttca tcctttcctc ttatcggata 600 aagatgaaaa tggtattgac gaagacatca gactatataa cgaagtagtc cgtgaatgcc 660 taatgacttg gatgatctct aaaatggatc agacaaataa aaatcgattc gaaccaatga 720 tgacagtcaa aactgaactc ggtcctgatg tttctcctac tccaagcgct ctatgggtaa 780 agataagaga tcaccactct ccacgaacat ctgctcacca aatgctcctt tgccgatccc 840 ttttcaatct tatccaagga gaaagttccc ttcctcttca tatggacgag tttcacaagg 900 catatatcgc ttatatgaac tcaggaggaa agataggatc agtggaacta ggacagcaat 960 tactcatgtc attgaatgct gattggatga aagtagctga ggacatagcc gacagcgaca 1020 cttttgaata ccattcagtt gtaacggcgt tgaaagtccg aattgcaaac cgagctatgc 1080 ttcgacctca acgaactacc gtcatcgagg caagtgcagc aagtccaatg agaaggcgtc 1140 aaaattttgt tcaaactaga tgttcaccag ccaaatgcgt atcgatgtcg cacacctcgg 1200 ccgattgctt aagaaatcca aagaacatcg ataagtacaa tgcatgggtt gaagccaaga 1260 aagctaaagg tgaatggatc gatcgtcctc caaagcctaa ctcaaagcaa atttcatcca 1320 cttcaccatc aacctcctta ataacgaccg attctgctgt accatctttg tcttctctcc 1380 aagaagcttt ggaaagatct gaattccaag ctagtgctag cgcagtgttc atagttgatc 1440 ctgatgcgaa tattgacaac gatcattttg gtattgttga cagtggaact actcaccata 1500 tgttcaagtc tcaaaaattc ttcgatttga accacttcaa actacttgat acctcaaatg 1560 aacgggtagg attggcagga ggaacgtcta cgctcaagat taaaggccaa ggcaacgtga 1620 ctatcatggg tcctacgggt gacttgacaa ctctcaccga ttgccttttt gtccccacac 1680 tcaaacaaaa cctaatcgcg ggtggaagac tattacatga cagatggatc accactctga 1740 aggacaacag ttctttcatt atcgaacgtc atggaaaaat tgccttaacc ggacgtgttc 1800 atccaacttc aatgctatta caactcaatg ccgtgcgtac atctccaagt tcttcggcaa 1860 ttaccgacac cacaactgaa tctttgcaac ttttacatca caaactaggt cacccgaact 1920 ttacttattt aaagcaaatg attaggggag aagctgtttt gggtttacca ttgtccttat 1980 caaaagttgt attacctctt gaattaccat gcaattcatg cgatctttcg aaagctcatc 2040 gacaacctca ttcttacact cgcactagat ccttgcatct gttagataat gtgcacattg 2100 acttaagtgg cattatgcga acccctgcca tatgcaaaag cctttatttc atcttgttca 2160 ccgatgatca cagttcttac catcatatta ctggcttgaa gtcaaaagag aaggatgaag 2220 ttcatggagc tattcatacg tacctttctt tggtcgaacg ccagtgtgat cgaaaaatca 2280 agtccttaac ccttgatgga ggaggagagt ttctcaacaa tgtccttctg ccatattgca 2340 gatcggaagg catctatctc cgaatcactg ccgcgtacac tccggaagaa aatggagttg 2400 cggagcgatc tatgcgtaca atagtgtcta aagctaggtc tatgctcatt caggctaatt 2460 tgcctattcg actctggttg gaagcggcaa agaatgcagt ctttgtcaac aaccgaacaa 2520 caacaacaac cctaccgaag aacaagaccc cgtacgaaat ttggtacggc caaaagccag 2580 atgtctccca cgtaaaggca tttggttgtc tgacctacgt cctcatcagg aaaccagact 2640 gcatcaataa gtatgcagcc acttctgaac atggcgtctt ggtaggtcat accgaacaca 2700 accgtaacta taaggttttc atgttaaaag atgcaaagat caacatcacc cacgatgcaa 2760 cttttcgaga agacgtatac ccttttactc aacttcctaa gtttgacatc tctcacttga 2820 caactgaaga tgaagctgta aacatccctc cgttggcaat caatcctgct cctccgccgc 2880 ccgatgatga tgatcctgtt cagcatgtgg tcaccgttca agaacaagca acgccagaac 2940 ctcctgtaga cattccaccc ttgccatcaa gcccgccaca acaaatgccg cgtcggtctg 3000 aaagagaaat cagaccagtc gatcgtttcg ttcctaactc cagcatgacg ttttgggaaa 3060 cggactgttt catagagcag ggtgacatat gcgcttacgc gtttgctgca ggacctactg 3120 ttcgattggt ggctgagcca ggtagttata agcaggcaat gaagcatcct aatgcgcaca 3180 attggaaaaa cgcttgcgat aaagaaatcg acaacatgaa tcataaaggg gtgtggaagt 3240 tagtggcccg gccgctggat gcaccggtgg ttggttcgcg ttggcatttc aaggtcaaac 3300 ttaatccaga tggctcaata atcaaatata aggcccggat tgttgcaaaa ggcttcacgc 3360 aaacctttgg agtggattac aatcaaactt acgccccaac tggaaaacca gcatcatacc 3420 gcatcatcat ctgcatcgct tcctgtaatg gatggtcagt ccacggaatg gacgccatag 3480 ccgcttttct taactctgct ttgaaagaac ttgtatacat ggaacaaccc gagggttatg 3540 ttaacgaagg agatgaagga aaagtctgtc tcttactgca agcgatttat ggcttaaagc 3600 agtccgctca tgagtggaac gaagagttta aactgaaatt gatcaaggca ggttttactc 3660 aagctgaagg cgacgagtgt gtgtatatca gaagaaggtc tacatcggag gtggtcattt 3720 tctatctgca cgtcgatgat atggcaatca ccggaccttt ggagtgcatc atcagtttta 3780 aaaaggaaat tatgaccttt tgggagatgg acgacttagg tgaagtatcg tgtgttgtgg 3840 gcattgaaac tctgcgctta tcgtctcatc attatgctat acatcaacgg tcaatgaccg 3900 aatctctgtt gttacgattt ggcctgtctg actgcaaacc ggcgtctacc ccagttcaag 3960 gtggacaaaa gctactcaaa tctacagttg aggaatcaac tgcttttgca aaactcaact 4020 acccttaccg gtcgggtgtc ggaagtctca tgtacatttc gcaatgtacg cgtccagaca 4080 ttgcctacgc cgtaggtgtt ctgtcacagc atctagacaa tccgtgccaa cgccattggg 4140 acgctttccg tcatgtcctc cgatatctaa gaggtaccat caacttaggc atacactacc 4200 acgttgaaga caatcgactg tttcaaatga aatcttcatg gaacgtacca tctacaaatg 4260 tcgattctga ttgggccggt tgcaaaaact ctcgacgctc aaccaccggg tatctcacaa 4320 ccctctgcgg tggcgcaatt tcttggcggt caaggctaca acaaacagtc gcattgtcct 4380 caaccgaagc tgaatatcgc gcaacaactg aggctggtca ggaagttctt tggttacgaa 4440 acttgcttag agatgtaggt tttgaatggt caggaccggt aaatttgaac tgcgacaacc 4500 taggagctat tgatctctct tccaacgcag tacaccacgg tcgtacaaaa catattgaca 4560 tagaacacca ttggattcgg gagcaagtgc agaaggacaa cattacctta tcttattgca 4620 aatccgaaga catgacagcg gacttgctta caaagccact ccatccgggg ccgttctgga 4680 atcatatgaa aggagttggt ttaaagaggt gtgcttagcg tgtcttgatt gagggggtg 4739 // ID TCN4-I repbase; DNA; FNG; 5487 BP. XX AC . XX DT 30-MAR-2005 (Rel. 10.03, Created) DT 21-APR-2005 (Rel. 10.03, Last updated, Version 1) XX DE C. neoformans LTR retrotransposon - internal consensus. XX KW LTR Retrotransposon; Transposable Element; Interspersed repeat; KW reverse transcriptase; TCN4-I; internal portion. XX OS Cryptococcus neoformans OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Tremellomycetes; Tremellales; Tremellaceae; Filobasidiella; OC Filobasidiella/Cryptococcus neoformans species complex. XX RN [1] RP 1-5487 RA Goodwin T.J. and Poulter R.T.; RT "The diversity of retrotransposons in the yeast Cryptococcus RT neoformans."; RL Yeast 18(9), 865-880 (2001). XX RN [2] RP 1-5487 RA Loftus B.J., Fung E., Roncaglia P., Rowley D., Amedeo P., RA Bruno D., Vamathevan J., Miranda M., Anderson I.J. et al.; RT "The genome of the basidiomycetous yeast and human pathogen RT Cryptococcus neoformans."; RL Science 307(5713), 1321-1324 (2005). XX RN [3] RP 1-5487 RA Gentles A. and Jurka J.; RT "C. neoformans LTR retrotransposon TCN4."; RL Direct Submission to Repbase Update (15-MAR-2005). XX DR [3] (Consensus) XX CC 434 bp LTR deposited as TCN4-LTR. XX FH Key Location/Qualifiers FT CDS 2..4361 FT /product="ORF1p_TCN4" FT /translation="FFQNNAHKITPNAHSPNAQTPLMPPKGQTLKDTPPSE FT EANASPVEGDINAQVAAVILHLSKLTSQQSLFGTQGNHKPREVTIPQASSI FT PNFTGPIRDAPAVLLHIDRLEKLLRTNSLFSPQDDDAQTEKLRVIILEVAN FT NSIECTALKAWVRTTGRMMEADGESGWDAWTVAFKKAAMPLAWDQKELRSL FT LRLQFNQVSDWETFDNQAIQHRHNLMGTDLYPSDTSMATLYRGACPEPLFI FT RLMTTPEFRNGSLDEVRNLLELTVVKYLENRDQPTHARIKPQASVTSQTQP FT HLAHPISHYYQAQNPLPYYLSPAGKHIRELFAKENRCNDCRKVGHNFKTCP FT NRRGPTRVNSRHVDNHEVLIQELTSQLDDHLQLETSIDSDETSYALFHPPL FT VIQASAAADGRTSIAPPPPLRFLLDTGAGTSFFDPSVVARLGWKVRKDAVE FT RTVRLAGGKPGPVVRDVTSGTFRVRHAIYSGEGVVMELNGTYDGILGLNFC FT KRYHLLDQSPLFRRLLSSSGKVLTGNDVAVPTSPPPVSHDAKISPLAGTDV FT TVPMRHSKIFHKSKVSTTAAAPNEDVTVPTPHSKIFHKSKVSQTPVAPQPS FT AITTKFSAIPSSNSVSLGAAETASYAEIIAQLKNDFAEVFCDKLGDVADFP FT SITKTKSGVRFEIILKPNATPSHAAPYRVPETLLPRFREMIAEHLNAGRLR FT YSSSPWASPAFLVKKPNGQHRLVCDFRALNNQTVPDMYPMGNIQDILHRAA FT RRGNLFAKLDCKDAFFQTLMKDDDIHKTAITTPLGLLEWVVMPQGIRNAPA FT AQQRRINEALQGLAGECCEAYVDDIIIWGTDSEDLRRNCVSVLAALRRSGL FT RCSREKSQFYLTETSFLGHVIRPGQILPDPTKIARIEQFPHPTTPKALHSF FT LGLLNYLREFIPSLATHASVLHACLPPNTAAEKAYYRHLKEHKGRLQESWQ FT GWRWNYGPKERVAFDALRRAVSTVPCLTTIDYDAVKAGTLKVFLTTDASKV FT GMGAWIGVGTTKENAQPVAYDSRSFNSAQRNYSTHERELLAIVHSLDHWRP FT FLYGIPVYAFTDHFTLKWFLGQRNLSSRQIRWLDILKDFDVRIEYIKGEHN FT VLADHLSRHIGSDTLPPTDPALPEPDNCILAPVVTFAPTLAPEIIRTIAQG FT YDRENLFKDWLADPTTAPGVSTIPHDTHTLLLLDNRLCIPDVDTLREDLMR FT QAHEGTAAHLGIEKTVEVLREGYFWEGLVGDVREFVRACHSCQQANASTKK FT PVGPLHPLPVPRDKFDDIAMDFVGPLPSAGGHDYLLTITDRLTGFIELVPC FT STTLNARDLALLFWNCWVSRYGLPLSITSDRDALFTSRFWTTLWDEQGVRL FT KMSTAFHPQTDGTSERSNKTVGQLLRSWVNRQGTSWAKFLPRISQAMNNTV FT RRSTGYSPIQLVFGRRLRTLPGIARPPATLHSRVXPYSCX" XX SQ Sequence 5487 BP; 1167 A; 1655 C; 1184 G; 1478 T; 3 other; cttttttcaa aacaacgcac acaaaataac gccaaacgcc cactcaccaa acgctcaaac 60 gcccctcatg ccccccaagg gtcaaacgct caaagacact cctccctcgg aggaggcaaa 120 cgcatctcct gtcgaaggag acatcaatgc tcaggtggca gcagttatcc tccacctgtc 180 gaaattaacc agtcagcaaa gcctcttcgg tactcagggg aatcacaaac ctcgagaagt 240 aactatccca caagcctcca gcattccgaa ctttactggt cccattcgtg atgctccagc 300 tgtccttctt cacattgatc gtctagagaa acttctccgg actaattcgc ttttttcccc 360 tcaagatgac gatgcccaga cagagaaact gcgagtcatt atcctggagg tggcgaacaa 420 ctctatcgaa tgtaccgcgc tgaaagcatg ggtgaggact actggtcgga tgatggaagc 480 agatggcgag tcgggatggg acgcatggac agtggctttc aaaaaggcag cgatgccgct 540 cgcctgggat cagaaagaac ttcgttccct cctccgccta cagttcaatc aggttagcga 600 ctgggaaacc ttcgataacc aagccataca acaccgacac aatttgatgg gcactgacct 660 ctatcccagt gacacctcta tggccaccct ttaccgcggc gcttgtcccg agccattatt 720 cattcgtttg atgacgacgc cggagttcag gaatggaagc ttggacgagg tacgcaacct 780 gttagaactg acagttgtca agtacttgga gaaccgggat caacccactc atgctcgcat 840 caaacctcaa gcctcggtca cntcgcagac ccaaccgcac ctggctcatc caatcagcca 900 ctattatcaa gctcaaaacc ccctacctta ctatctctct cctgccggta aacacattcg 960 cgaacttttc gccaaagaaa accgatgcaa tgactgccgc aaggtgggcc acaacttcaa 1020 aacctgtccc aaccgccgtg gtcctacgcg tgtcaactct cgccacgtcg acaaccacga 1080 ggttctcatc caagaactta ccagtcagct tgacgaccat ctccagcttg agacgtctat 1140 tgactcagac gagacctcgt acgcactgtt tcatcctcct ctagtcattc aagcctctgc 1200 tgccgctgac ggacgtacct cgattgcccc tccccctcct ctccgatttc tcctcgacac 1260 gggcgctggt acatctttct ttgatccgtc agtggtagct agactaggtt ggaaagtccg 1320 aaaggacgca gtggagcgta ctgtgcggtt ggctggtggg aaacctgggc cggtagtcag 1380 agatgtgacg agtggaacgt ttagggttcg acatgcaata tattcaggcg aaggagttgt 1440 catggagttg aacggtacat atgatgggat cttgggtctc aatttctgca agcgctatca 1500 tctacttgat caaagtcctc tttttcggcg tctcttgagc agctctggga aagttctcac 1560 cggcaatgac gtcgccgttc ccacgagtcc cccaccagtt tctcatgatg cgaaaatttc 1620 tcctctcgcc gggactgacg tcaccgttcc catgcgccac tccaaaattt ttcacaagtc 1680 aaaagtttcg acaactgctg ccgcgccaaa cgaagacgtc accgttccca cgcctcactc 1740 caaaattttt cacaagtcaa aagtttcgca aactcctgtc gcgcctcagc cttcggcgat 1800 aaccaccaaa ttttcggcga ttccttcatc aaattctgtc tctctcggtg cggccgaaac 1860 tgcttcctac gccgagatca ttgcacaact caagaacgac tttgcagaag ttttttgtga 1920 caaactgggt gatgtagccg atttcccaag cattacgaag accaagagtg gtgttaggtt 1980 cgaaatcatc ttgaaaccta acgctacgcc cagtcatgca gctccctacc gggttcctga 2040 aactcttttg ccccgtttcc gtgagatgat tgctgaacat ctgaatgcag gtcgtctacg 2100 ctactctagt tctccttggg cgtcacctgc ctttctcgtc aagaaaccca atggccaaca 2160 ccgtctcgtc tgcgactttc gcgcactcaa taaccagacc gtgcccgaca tgtatcctat 2220 gggaaacatc caggacatct tgcaccgtgc tgctcgacgc ggcaatcttt tcgccaagct 2280 cgactgtaag gatgccttct ttcagacgtt gatgaaggac gacgacattc acaagacggc 2340 gattacaaca ccgcttgggt tgctggaatg ggtcgtcatg ccgcagggca ttcggaatgc 2400 acctgctgcc caacaacgtc gcatcaatga agcactgcaa ggtctcgctg gagaatgttg 2460 cgaggcgtac gtggacgata tcatcatctg gggaacggac agcgaggatc ttcgtcgaaa 2520 ttgtgtgagt gttctcgcag ctctgcgtcg gagtggtttg cgctgttctc gtgagaaatc 2580 acagttctac ctcaccgaaa cctctttcct cggccatgtc atccgtcctg gtcagatact 2640 ccccgacccc acgaaaatcg cacgcattga acagttccct caccccacga ctcccaaggc 2700 tttacactcg tttcttggtt tattgaacta tctccgtgag ttcataccca gccttgctac 2760 tcacgcttcg gtcttgcacg catgtctccc tcccaacacc gccgccgaga aagcctacta 2820 caggcacttg aaggagcata agggccgttt gcaagagtca tggcagggtt ggaggtggaa 2880 ctatgggcct aaggaacggg tagcctttga cgctttgcgt cgcgccgtga gtactgttcc 2940 ctgcttgact acaatcgact atgatgctgt caaggctggt actctcaagg ttttcctcac 3000 cacagatgca tccaaagtcg gcatgggcgc ctggataggc gtcggtacta ccaaggaaaa 3060 cgctcaacct gtcgcctacg attcccgctc gttcaactct gcacaacgca actactccac 3120 acacgagcgc gaactcctgg ccatcgtcca ttctcttgat cattggcgcc ctttccttta 3180 tggtattccc gtgtatgctt tcacagatca ttttacgctc aaatggttct taggtcaacg 3240 caacttgtcc tctcgacaaa tccgctggct ggacatcctc aaagacttcg acgttcgtat 3300 agagtatatc aagggtgaac ataacgtctt agccgaccat ctttcacgac acattggctc 3360 cgacactctt ccgcccaccg atcccgccct gcccgagccc gacaattgta ttctcgcccc 3420 cgtggtcacc ttcgctccca cattagcacc ggagattatt cgtaccattg cccagggtta 3480 tgaccgtgag aatctgttca aggactggct cgctgatcct accacagctc ctggcgtctc 3540 taccatcccc catgacactc atactctcct ccttctcgat aaccgcctct gcatccccga 3600 cgttgacact cttcgtgagg accttatgcg tcaagcacat gaaggcactg cagcacactt 3660 gggcatagag aaaacagtgg aggtattacg tgaaggctac ttctgggaag gcttggttgg 3720 agatgtccgt gagttcgtcc gtgcttgtca ctcatgtcaa caggcaaatg catccaccaa 3780 aaaacctgtc ggtcccttgc atccgctgcc ggtccctcgc gacaaattcg acgatatcgc 3840 tatggatttt gttggtcctc tacccagcgc tggcggccac gactatctcc tcaccatcac 3900 cgaccggtta actggcttca tcgaactcgt cccatgctcc accaccctca acgctcgcga 3960 cctcgccctt ctcttctgga actgttgggt ctctcgctat ggcttaccgc tctccatcac 4020 ctcggatcgt gacgctctct tcacctctcg cttttggaca accctttggg atgaacaggg 4080 tgtccggttg aagatgtcga ctgcttttca tccgcaaact gatgggactt cagagagatc 4140 aaacaagaca gtaggacaat tgttgcggag ttgggttaac agacagggca cttcatgggc 4200 taagttcctt cctcgtattt ctcaggctat gaacaatact gtccgtcgtt ctacagggta 4260 ctctcccatt cagcttgtct ttggccgccg cctccgcacg cttcctggca tcgcccgccc 4320 gcccgccacc cttcactcgc gagtccntcc ctactcgtgc tgactggacc gccgctgctg 4380 ctcaacaaga tctcttcctg gccgacgctc aggacaacct ggtcttggct aagcatcgca 4440 tggctatcca agctaaccgc caccgtcggt cggagatcat ttacaatgtt ggagattggg 4500 tctggttgga tactcgcaat agactcaaag agttcaagtc tggtgatggg gatttccgtg 4560 ctgctaagtt cttcccacgt ttccaaggtc cttacaaggt actggctgct cactccgacc 4620 gctcagtgta taccttggat cttccagact ctccctctgg cagttacacc aagttccatg 4680 ccaaccttct caagccctac ctctcatctt ctcgctttca tcagacctcn cctctgactt 4740 cttcagcgcc atcaagccct tcctctagtt cctccccaag gttattacag attctggatg 4800 atcgtgagtt ccgtggtcgc cgtcagcttc gtgtggtatt ctctggcaat ggtcccactg 4860 gccaatggcg ctacctggat gacctccgcc ctctggctgg tttccagtcc ctgtatgagg 4920 agtacttggg tcctgatgat cttgccattt gatcattcct tcctattctt ctctattcct 4980 ctccttattt tctctttctc gtccgcctgt tccttcttcc tttccatttt cttttctctc 5040 ttcatttttt ctttttatct tcttctctct tcttcttttt ctggctcatt tcttctcttc 5100 ctttttgctt attttctggt ttcctctcct cttttctttt cttccatagt gtcatggctt 5160 ggcttgggct gctctgttgg gttatttttt ttggccttta tggtttgttt ttcccctggc 5220 ttgtgctcaa caagactaaa tcttaaaagg ctgcgctcag gagtggttac ctgctgcgtt 5280 ctccttcccc atcttatcca tcctggatct ggtgacaaga atttacagag tcagcccatg 5340 acccccatgc ttatagctgt ttttttctcc tttctttttt gcctctttgt ccttctctta 5400 ttacttctat ttctttttcc tgtccctgcc tcctggtaac tgctgttctt gcttccaggc 5460 ataggctctt tgacagatgg gggagag 5487 // ID Copia-26_MLP-LTR repbase; DNA; FNG; 891 BP. XX AC AECX01001250; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-26_MLP_; KW Copia-26_MLP-I; Copia-26_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-891 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001250; Positions 1128 238. XX SQ Sequence 891 BP; 197 A; 204 C; 164 G; 326 T; 0 other; tgttgtggta atgtaatctt tatgtcaagt ccttcacaat caagtcctaa gtacgaagta 60 caagtagagt gcgtgggaca ggtgtgtagt aggtatggta gtgatgaatg gtttcaaggc 120 tttataagta ctcaatatgg atgaatgaga ttaactgaat caggaaaggt agttggaatg 180 tgaggatgtg tagtatcaag ttatcgtgtt catttttcct ctttgagaat agctcctgac 240 tactgcatgt gcgctcttgt gtttcatccc tgtgctgaat ttttcatgtg tgtgttgttt 300 tcctcatata cactccctct ctcgagctct ctcttccttc ctcaccgtta ggaatagaaa 360 gctcaagagt acgtattttg atttttctca ccactgtgtt tatctggatg tgtgctcatt 420 tctctcgaat agtctttccg tccttctctc tcctgctcat ctgcaggtaa ggcgttgatt 480 ccatccaccc atacttcttc tatcattcga gtctttcact ttagagctca atactcacac 540 ctgtctgtcc aggtaaatac ttgttacttt aatactgttt agttctcttt ttgcttagtt 600 tgaataaaag accgttagga atagaaagct caagatcttt ccgtccttct ctctcctgct 660 catctgcagg taaggcgttg attccatccg cccatacttc ttctatcatt cgagtctttc 720 actttagagc tcaatactca cacctgtctg tccaggtaag gcgttgattc catccgccca 780 tacttcttct atcattcgag tctttcactt tagagctcaa tactcacacc tgtctgtcca 840 ggttccttct gctctatttg cgtagatcat tatccgcatc tgattatttc a 891 // ID Copia-4_LBS-I repbase; DNA; FNG; 5134 BP. XX AC ABFE01000971; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-4_LBS_; KW Copia-4_LBS-LTR; Copia-4_LBS-I. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-5134 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01000971; Positions 10640 5507. XX CC Positions [1942-2472] - Integrase core CC 'TTCAT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 70..4917 FT /product="Copia-4_LBS-I_1p" FT /translation="MTVGESSSSDPVSTPASSRPISLLSPLPNTPLASDTS FT HPSIPIQSVKESSWGDVAVLDKNMNNYAAWNRHVIRILQLSSGLDQYLDGS FT LPAPDPHYEPRAHRHWKLNNAAIQAFIFLKCASSEHQFIENCTTAEDIWSI FT LKKRHVHQGPMTQVTLIQEALSVRYSSSVPFAETTLVLRNLNRRIWDMGAP FT TSEGFLCILMLLVLSADDSLSPVRDSIVTGLSSSTADRAYTSAHIVDRLDY FT EQQARSMTIARTVSVPAEAHVARGSCSSDSKQTICSNCKKPRHTSEFCIQP FT GGGMAGKTIAEAQAAREAKRGKKSKEKMSKPAGSIIQSGNQAYIVDADGKA FT HEIISSSLSSAAMSTVPDSAHFLHTDDINLIDPLVLDSMCAADIAEYAHIA FT ELSWLATQDSLHASVDWHERRRNTEDLGLAAVTAAPLPASSRRTVLSLDTS FT PFLLDSACTTHISPDRSDFMTLHPITDRTVTGVGGSSINALGIGTIKLIIA FT KGSHILLENVLFIPASTVRLISIGCITESLQCSVTFDTTTVTLKNCSGSLF FT ATGTRLLTRKLYSLNCTRLSAEHAFVMADLDTWHRRMGHASNQSILDLATK FT QLAQGMLINLSRSPPKCDSCIHGKQGRTPVPKIRQGERSNRKLGIIYVDLT FT GPEAVKSASGNQYVMNIVDDHSSHPWTFCLKLKSDVLPFLQSWARRAESES FT GERIGIICIDGGELKSDAMNAWCDANGYTLQVTAPYTSAHNGRAEHMHLTV FT MNRMRAMRASTPSVPPNRWDEFAMTAGYLSARTPTRTLGKTPFEVWHGKKP FT DLSHLCEIGSRAFALILHHNPKIYERSFECVLVGYSPHSKAYRLYHPSTHR FT LFESFHVKFIERKDDISHPLYPGCVIDLPVTDDPINITPPSSPPIPSPPSS FT SSSSGSSPKHTSVQDEEESIPSASNPVWQPGPTHEIEVPVPLLHNPANVVP FT VLADDIVPVPADDTDIVPRRSGRPPAPSTKAAENLGIKQIPHVAQAVLESR FT EAGHRLKEQWAQAKFEQRQQVLDLRASTTNVDVVQTPSLPTNAVPDDTLPP FT LVTLDSEADFVAFCEAYAAELAFPLINPRDPDEPTFREAMKSPDSAKWTLG FT IQDELKSLKDMGVYKLVPRSDVPAGRKILRGKWVLILKRDELGDPVRHKAR FT FVVLGYEQIFGQDYGDTTSPTARMESVRLLLNIAAAKDWDIQQIDVKTAFL FT YGLLPADEAQYLEQPESFEEPGKEDWVWCLQRGLYGMKQSGRIWNKTMHKA FT MLNWGFTRLHADPCVYYRVTSLGTVLTAVHVDDFLITASSPEASRSFKEEL FT KSLWTISDLGEARFCVGIAICRDRAKRLVSISQTALIDRIIEQFGQTDADP FT ISTPMDPSVAKSLTRPLPSDPPLSPGNAYDLTRIPYRSLVGSLMYLAVGTR FT PDISFAVAHLCQFLDCYRRPHWNAAIRVVRYLKGTRLLTLTLGGNSELDLV FT GFSDSSHADCPDTARSTMGYCFSVGGTVFTWSSCQQKTVSNSSTEAEYIAL FT SEASREALWLRQFLREVHLLKPGPTVLLCDNNGAKALSSDPTHHSRSKHID FT VRHHFVRERVDDGSLTIWRVPGFDNVADIFTKALPRPDFTHLRPFLGLR" XX SQ Sequence 5134 BP; 1151 A; 1471 C; 1076 G; 1436 T; 0 other; ggttatgggc ctatttggtt tctaggcacc ccgctgtact tgtcgaccaa cttcaaccct 60 tgattaacga tgactgtagg cgagtcttct tctagcgacc ctgtgtcgac ccctgcatct 120 tctcgcccaa tctctttgtt gtctcctttg cccaacaccc ccctcgccag cgacacgtcg 180 catccttcca ttcctattca gagtgtcaag gaatctagct ggggtgatgt ggcagttctt 240 gacaagaata tgaataacta tgcagcttgg aaccgacacg tcatacgtat cctccagctt 300 tcatctggtc tagaccaata ccttgacggt tcccttccag ctcctgaccc tcattatgaa 360 cctcgcgccc accgccattg gaaattgaac aacgccgcca ttcaggcgtt catctttctg 420 aagtgcgcgt catccgaaca tcagtttatc gaaaactgca caacggccga ggacatctgg 480 agtatcctga agaaacgcca tgttcatcaa ggtcccatga cccaagtcac cttaatacaa 540 gaagctctct cggtacgcta ttcgtcctca gtccccttcg ccgagaccac tctcgtttta 600 cgcaatctca atcgacgaat ctgggatatg ggcgcaccta catctgaggg tttcctctgc 660 attcttatgc tactggtgtt atccgccgac gactctctat ctcctgtccg tgactccatt 720 gtcaccggct tgtcatcttc cactgccgac cgcgcatata cgtcggcaca tattgtcgat 780 cgtctcgact acgaacaaca ggctcgttct atgactatcg ctcggactgt atcagtgccg 840 gctgaagctc acgttgcccg cggatcatgc tcctcagatt ccaagcaaac catctgttcg 900 aattgcaaga aaccacgaca cacctcagag ttttgcattc aacctggggg tggaatggcc 960 ggcaaaacca ttgctgaagc acaagcggcg cgtgaggcta aacgaggcaa gaagtccaag 1020 gagaagatgt caaaaccagc tggcagcatc attcaatctg gcaatcaggc ttatatcgtt 1080 gatgccgatg gtaaggctca cgaaatcatc agttcgtctt tgtcctctgc tgctatgtct 1140 actgtccccg attccgctca cttccttcat acggacgaca tcaacttgat cgatccactt 1200 gtcttggact cgatgtgtgc tgctgatatt gcggaatatg cgcatatagc cgaactttca 1260 tggctcgcca ctcaagattc tctccatgct tccgttgact ggcacgagag gaggagaaac 1320 actgaagatc taggtctcgc agcagtcacg gcagcacccc ttccagcctc ttcccgacgt 1380 acagtccttt ctttggatac atcgccattt ttactcgata gcgcctgtac gacccatatc 1440 tcacccgatc gctccgactt catgacgcta cacccaatta ccgataggac tgttaccggc 1500 gttggcggat cttcaatcaa tgctctcgga attggcacaa tcaaattaat cattgcaaaa 1560 ggatctcata tcttacttga aaatgtcctc ttcattccag cttccacagt tcgtctcata 1620 tcgatcggtt gcattaccga atctcttcaa tgttccgtca ccttcgacac taccacggtt 1680 actctcaaaa attgttctgg ttctctcttc gctactggta ctcgactcct gacacgtaaa 1740 ctatatagcc tcaattgcac tcgactgtca gcagaacacg cattcgtaat ggctgatctt 1800 gacacctggc atcgacgaat ggggcatgca tccaaccagt ccatccttga cttagctacc 1860 aagcaactcg ctcaaggtat gcttatcaac ctatctcgat cccctcccaa atgtgacagt 1920 tgtatccacg gtaaacaggg tcgcacccca gtaccaaaga ttcgtcaggg agagaggtcg 1980 aataggaagt tagggataat ttacgtggac cttactggtc cagaagccgt aaaatcagcg 2040 agtggcaacc aatatgttat gaacattgtg gatgatcatt ctagccaccc atggaccttt 2100 tgtcttaaat taaaatccga cgtgttacct ttccttcaat catgggcgcg tcgagctgag 2160 tcagagagtg gggaacgaat tggcatcatt tgtattgatg gtggagaact caaatcggac 2220 gcgatgaatg cttggtgtga tgctaatggt tacacactac aggtcactgc accatacact 2280 tccgcccata acggacgcgc tgaacacatg catcttactg tcatgaacag aatgcgtgct 2340 atgcgtgcat caacgccaag cgttcccccc aacaggtggg atgaatttgc tatgacagct 2400 gggtacctat ctgcccgcac tccgacgcgt acccttggca aaacaccatt tgaagtttgg 2460 catgggaaga aacctgacct atcccatcta tgtgaaatcg gctcacgggc tttcgccctc 2520 attcttcatc acaatccgaa aatatacgaa cgctccttcg aatgtgtcct cgttggctac 2580 tccccccatt ctaaagctta tcgtctttac catccctcga cgcatcgtct ctttgaatca 2640 ttccacgtca aatttattga acggaaagac gacatttctc acccgcttta ccctggatgt 2700 gtcattgatc ttcctgtgac tgatgaccct atcaatatca cacctccctc ttccccacct 2760 attccctctc ccccttcttc ttcttcttct agtggttctt cgcccaagca cacctctgtc 2820 caagatgagg aggagtccat tcccagtgca tccaatccag tttggcaacc tggacccact 2880 catgaaatcg aagtacctgt ccccttactt cataaccctg ctaatgtcgt tcctgttctt 2940 gcagatgaca ttgttcctgt ccctgctgat gacactgaca ttgttccacg cagatctggc 3000 cgtcccccgg ctcctagtac caaggcagct gagaacttag gtatcaagca aatcccgcac 3060 gttgctcaag cagttcttga atcacgcgag gctggtcacc gcctcaagga acaatgggct 3120 caggctaagt ttgaacagcg tcaacaagtc ctcgaccttc gagcatcaac tacgaatgtt 3180 gatgttgttc aaactccctc cttgcctact aatgctgttc cagatgatac tttaccacct 3240 ttggtcaccc ttgactcaga agcggacttt gtcgcgtttt gtgaagctta cgctgctgaa 3300 cttgcgttcc ctcttatcaa cccacgtgat cctgatgaac ctactttccg cgaagctatg 3360 aaatctcctg attccgccaa atggacgctg ggtatccagg atgaattgaa aagcctcaaa 3420 gatatgggtg tttataaact cgtcccccgt tctgacgttc ctgcaggtcg caagatctta 3480 cgtggtaaat gggtccttat tctcaaacgt gatgaacttg gcgatcctgt ccggcacaag 3540 gctcgattcg ttgtattagg ttatgaacag atcttcggtc aagattacgg cgacaccact 3600 tcgcctactg ctcgcatgga atctgttcgc ctgcttctca acatcgccgc tgctaaggac 3660 tgggatatcc aacaaatcga cgtcaagact gctttcctat acggtctcct acctgctgat 3720 gaggcgcaat accttgaaca gccggaatcc tttgaggaac ctggtaagga ggattgggtt 3780 tggtgtttgc agcgtggttt gtatggcatg aagcaaagcg gacgtatatg gaacaagaca 3840 atgcataaag ctatgttaaa ttggggcttt acccgtttac atgctgaccc ttgtgtttat 3900 tacagagtca cttcgctggg cactgttctc accgctgttc acgttgatga ctttctgatc 3960 actgccagct cacccgaagc atctcgttca ttcaaggagg agctcaaatc cctctggacg 4020 atttccgacc ttggtgaggc acgcttctgc gtgggcattg ccatttgtcg tgatcgggct 4080 aagcgtctcg tctctatttc tcaaactgca ctcattgacc gcattatcga acagtttggc 4140 cagaccgacg cagaccccat ttccactcct atggatcctt ccgttgcgaa aagcctcaca 4200 cgtcctttac cttccgatcc tcccctttcc cctggcaatg cctacgactt gacgcgcatt 4260 ccttaccgtt ctttggttgg ttcgctaatg tacttggcag ttggcactcg accggacatc 4320 tcattcgctg ttgcccactt atgccagttc cttgattgct accgacgccc ccattggaat 4380 gctgccatac gtgttgtccg ttatcttaag ggtacacgct tacttacgtt aactctagga 4440 ggcaattccg aattggacct tgttggcttt tctgattcct ctcatgctga ttgcccagat 4500 accgcccgct caactatggg ttactgcttc tctgttggtg gcactgtctt tacctggtct 4560 tcctgccagc agaagactgt ttccaactct tccactgagg cggaatacat tgctcttagt 4620 gaggccagcc gtgaagccct ctggttacgt caattccttc gagaagttca ccttctcaag 4680 cccggaccaa ctgtcctctt atgtgacaac aatggtgcaa aggctctctc ttcggatccc 4740 acccaccatt cacgtagcaa gcacattgac gtccgtcatc actttgttcg cgagcgcgtg 4800 gacgacggtt ccctcaccat ctggcgcgtc cccggtttcg acaacgtcgc tgatatattc 4860 accaaggcgt tgcctcgtcc cgactttact cacctgcggc ctttccttgg acttcggtga 4920 tgcttgtgtg aggaggagca tttctggttg gcgtttcgat ttttaatctc cttgacctag 4980 gtagattttc tttcgacctt gccagctggt agctttttcc cactgggttt tgacgcaccc 5040 ggcgcttgga tttcacatat ctatttctcc tcttttatct tctcttttta tctttctttt 5100 ttatcctctc accaatcctc aatttgagga ggag 5134 // ID Gypsy-37_MLP-LTR repbase; DNA; FNG; 962 BP. XX AC AECX01001016; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-37_MLP_; KW Gypsy-37_MLP-I; Gypsy-37_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-962 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001016; Positions 17778 18739. XX SQ Sequence 962 BP; 330 A; 211 C; 133 G; 288 T; 0 other; tgtcaactat aagaagttct gcggagcaga actcctacaa cgtagacaaa acttacaatg 60 taagccccaa aacatagacc tcagggttga accctttctt gcacataaaa ttatatatac 120 aaacaaaatt acaacgctac cactcagtct caagagtgaa aatcgagaac cctgaaaact 180 aaaaagctat gacgaaaaca ttaataataa aagtaaccgt agctacgact aaaataaact 240 tcgcctggtc ctaacaggaa tcaagggtca gacagaccac cagaccctga ccaggcaaag 300 tttgaagaag tcttactaca accaggccag tttagaatat ccgaaccctg gtaaaaaagg 360 taaaacgctc ttaacaaatt ctcatataaa actagatagc ttctctaaaa gaaaagagag 420 gtcatcccat actttaacct ttttaacgag aacacctact tcatttagat acttttctac 480 cctttgtctt agaaaacacc tcgtctgaaa actattcctt cttacttacg acttatcaag 540 atagtgttac ctttttagac atcttaccga tagacatctt taaaataact ctctactcct 600 gattttcgtt ttctcttctc gtttcacttc ttcattttct tccttaagaa gttctttaaa 660 gattgtttag tgagaactac tcgactaaag acttcatacc cattttgatt gaactatttg 720 ttcaacactt ttgctaaagc aaatctgaag ctagtttact cttgtgtcca cttcagaata 780 agcgtactct tctatttaga ggaagagatt cgcttgaaat cactctcagt acatatactc 840 aagtttttct acgaagaatt gaaaaataaa ttcttagctt ataagctcag aatatacatt 900 tcgtaagttc atttcagctt agacctacct actagtcaac agttcgtagt ctcgagctga 960 ca 962 // ID MarinerL-1_AO repbase; DNA; FNG; 9143 BP. XX AC . XX DT 25-JAN-2006 (Rel. 11.01, Created) DT 02-FEB-2006 (Rel. 11.01, Last updated, Version 1) XX DE A family of long Mariner/Tc1 DNA transposons - a consensus DE sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; MarinerL-1_AO. XX OS Aspergillus oryzae OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC mitosporic Trichocomaceae; Aspergillus. XX RN [1] RP 1-9143 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-9143 RA Kapitonov V.V. and Jurka J.; RT "MarinerL-1_AO, a family of long Mariner DNA transposons in the RT Aspergillus oryzae genome."; RL Repbase Reports 6(1), 39-39 (2006). XX DR [2] (Consensus) XX CC The consensus sequence represents an unusual family of long CC Mariners. It encodes a 326-bp Mariner/Tc1 transposase CC (MarinerL-1_AO-1p) and is characterized by 29-bp TIRs and TA CC TSDs. The genome harbors 3 copies of MarinerL-1_AO (99% CC identity). In addition to the transposase, this transposon CC encodes two unclassified proteins: MarinerL-1_AO-2p (tree exons, CC pos. 7004-5698) and MarinerL-1_AO-3p (two exons, pos. 8920-7298), CC and a DnaJ chaperone-like protein (two exons, pos. 4510-5307). CC The MarinerL-1_AO-2p and MarinerL-1_AO-3p proteins are important CC for transposition. They are also encoded by other long Mariners CC present in A. oryzae (MarinerL-2_AO), A. fumigatus CC (MarinerL-1_AF, MarinerL-2_AF) and Gibberella zeae. XX FH Key Location/Qualifiers FT CDS 261..1238 FT /product="MarinerL-1_AO-1p" FT /translation="MAPNLAPSQHQIICDMIKCDPSLTNAQIAEAANCSTR FT AIPRIRSNLRLFGSSKAPPNKGGRPRSISPIMLEALCDHLLEKPDLYLDEM FT AIFLWDEFQIYATTSSIRRALSSKGWSKKAARQKAKERNSDLRDMYFHLIS FT DFHSYQLVYVDESGCDKRAGFRRTGWSPLGTTPIQVSKFHRDQRYQILPAY FT SQDGIILSRIFRGATDTSVFEDFIEELLLQCGKWPEPRSVIVMDNASFHYS FT ERIDQMCLEAGVKLVYLPPYSPDLNPIEEFFAELKAFIRRHWSVYEDSPEQ FT GFDTFLEWCIDVVGARKQSAKGHFRHAGLTVEES" XX SQ Sequence 9143 BP; 2551 A; 1973 C; 2083 G; 2536 T; 0 other; cagttggagt cgttactgct gtcatcccct tatactcttc gattgttttt cgaaccctaa 60 tgccaagcac gctagtctat tataggaaag gatccggatt aatgtgtttt cataacgcgg 120 tactgtatgg tacttctgta ttatatcacc gaagctcatg tatcttacat gtatatatta 180 tacagacaca accttggtta ccccaccatg atgtttcctg cagataatct cctgacgatc 240 aatcttacca cagggatatg atggcaccca acctggcgcc ttcgcaacat caaattattt 300 gtgatatgat caagtgcgat ccatcactta ctaatgccca gatagctgaa gctgctaact 360 gcagcacacg cgcaattcct aggattcggt caaatctccg gctattcggc agtagcaaag 420 cccctccaaa taaaggtgga cgcccacgaa gcatctcacc aataatgctg gaggctcttt 480 gtgatcatct tcttgaaaag cctgatctat accttgacga aatggccatc tttctatggg 540 atgagttcca aatatacgca actacatcta gtatcaggcg ggctctgtct tctaaaggtt 600 ggtccaaaaa ggcagctcgg cagaaagcaa aggaacggaa ttcagatctg cgggatatgt 660 atttccattt aatctcagat tttcattcct atcagcttgt atacgtggac gaatctggat 720 gcgataaacg agctggcttc cgacgaacgg gctggtctcc tctaggtaca actcctattc 780 aagtgtctaa atttcatcgt gatcagcggt atcaaatatt gcctgcatat tctcaagacg 840 gtatcattct gtcccgtatc tttcgaggtg cgaccgatac ttcagtcttt gaggatttta 900 ttgaggagct gcttcttcaa tgtgggaaat ggccagaacc gaggtctgtc atagttatgg 960 acaatgcatc ttttcattac tccgagcgga tcgaccaaat gtgcttagaa gcgggggtta 1020 aactagtgta tttgccgcca tattcgccgg acctaaatcc aattgaagag ttctttgctg 1080 agttgaaagc cttcattcgg cggcactgga gcgtgtatga agatagccca gaacaaggat 1140 ttgacacctt ccttgaatgg tgcattgacg ttgtaggggc aagaaaacaa agtgcgaaag 1200 ggcacttccg gcacgccggt ctgactgttg aagagtcatg atgcagggac aattgtttgt 1260 gaggcgatct cgccattttt gtgtacattc attgagtctc ctcagagcta gccaaaagat 1320 actaactatt agggtagata tatctgtttg ctactaaaca aacatcgaat cgcttcgatc 1380 cgatagctat gtacagtaca gtagagtaat tattatcatt ctgtactctc tgattccata 1440 atagtgaatg ccctcctctc caaaaggtcc aataagtcaa acaatggttc ccccggtcgt 1500 ttctcaggtt ttgatttagc aattagatta aatggtcggc tttctgtttt tcgcagacta 1560 atcagatatg cgtagcaaat cgttctttta acggtggcta aatcaatcag tatcgtcata 1620 aaatgctgtt gaataataac aaaccatatc cttctttttc ttcaactgtt ttccttccgg 1680 tcatgatttc gaagcacctt tcttctggcc atataatttt tgcattcgct ttaattcgat 1740 cagttcttgg ctttaaaaca caacggagat tcgataggac cacctagaat ttgctggcgg 1800 agttgcgaca cctgcatttg atgcaaaagg atcccgcgat cctcgttgct aaggtatgct 1860 gtcactgaac tcatattttc ctacactcag tagtgttgag caaatattga tgatacgttc 1920 gtcggggttg tcggtggttg gagaaagtag gaagggctct gcagaccact atcaggtcgg 1980 tcctaatata tatagcttgt gcggggtaat gtggacgatg gcttcttcca tgacgccctg 2040 ggtgttttgt tgaccagact tctctattgt aatcgaaaga agtacaggtc ttaagaacaa 2100 tagccagtcc accattaatg tcgaagtttt tctattctga tttacaactg attgcaaaaa 2160 cggcgcaaat gtttacacac ttcttctttc tgaaaccctt aagtctaaac tgtatttgga 2220 aagaccacag aagagctata tttttacgaa taatggataa aacggtgaaa aaagatatta 2280 ctgtaaagaa tatttgtaac gccttataca ggtgatccac aggtggtgat attttttcgg 2340 cttgtgagga ttactgccct ggtacgggac atgtccttgg gtggactaac aaatgggaca 2400 aggaccctca tattaggctt ctcggcagca tcctaaaata attcagcaat cttcgatggc 2460 tttatcgata gagccacacc agttgaccca gcccacatta ggaacagtct catttgcaat 2520 ggccgacata gctatgctac aaaaataaat cctagggtag aggatggctt atcaccaact 2580 tccattttcc tctctttctt ttgcttcttt tctacggcca tcacaataat cgctttcgat 2640 ttgaactcta atcccgtcct aaacccgtct gaatcacttc agggcaaaga tggtgaataa 2700 taaatccata accacggtac atagactgtc acatctttca gtgaaaggga tggttgatat 2760 ggaaccactt tccgccggtc cactctgttt aaacaaatta gccacgatgg ttgaaatttt 2820 atgggtagat agcagttact tacttccttg ccgattcgtg tacctttcgc ctccgacaac 2880 gtagtctcgt gggattttac gcacaaacag ggcttcgtac ttggcattgt ggaatgcgtt 2940 atcttgaatc caaaacccca gttctctttt taggtcgtat attgttgccc attcaagatc 3000 gtcatcctca tccaagcaga ttttccagag aacgacactc ttcttgccat cgttcacccg 3060 gcgcagtgct tcggttttgg ccgtgtcgaa atcaagatat gcggaaatga acggtgtcgg 3120 gtatctgttc ttccagttca gatgtcgctg gatatattcc acgagctcta tagcatcgtc 3180 ttgtataagg ggatccagtg ttactcggga agaagtattc cctgcgataa aaccttgtcc 3240 cggttcaaat gcagcaacgg agtctttgtc agttacccgg tagaaaaaca tcgtagttgc 3300 gcgtactgat aagtagccga gaggagcagg ggtctgattt caaaatgcag cagggtgaga 3360 aaggtcgaaa tacggcgctt taagaatatc aggacttcag gatcctgaaa ctagtgtaac 3420 aaaagggaca gaagaaccag gaataattcg aatatatata gcgtcgcctc cctcgctgta 3480 caataaatgt gatggcataa gatacagcaa ccacggtgcc tgttggatgc ctcacaccaa 3540 tttggcctat tgacgagcta tgttgggcac acgcgcctga acggtcccat tgtgcagccc 3600 catgttgaca aacggccgac actaccccgc cataatgtcc atgtggcata gggtaattgg 3660 cttgctaaag agccgtcatg tgaaatcgtg taaccttcat ggctatgatt taaatcagca 3720 gggataaatt ggcagggata tagccaagcg tcgctaggcg tgcttacctt agtaaatcat 3780 atcttgtatg gactctagtt tcgatggagc tagatcagga gccatttgtg ctgtggagga 3840 tgatcaacga gatcttccgg aaaggatctt tggtgggctc ctggggtatg ttctccacgg 3900 cagaaggatg cgcttgtata caagttacag gccacgtaaa atacgccgag tcattcaaaa 3960 cacaattaac taccattcaa tacagtggcc tctactgtgg ctttaggtgt atcttagaga 4020 cctgaagagt atgaacaatt gcaccagttt tgcaatcagc tgtacgggat gcctagtatg 4080 gatagagata tgcgcaacgg gaaagtaaaa aattgcatac agctggttgt aaaaacggcg 4140 caaataccca tacttttcct ttttttctgg gctctagtgc taagcctttt taggttaggt 4200 ttcggatcag ctatgatacg cgtaatatac tcgttatact ggtatgggcg tataaaatta 4260 tgccgtacgg ctctgtgatg agctcagcga gagtggggca aagagtgagt gggtgagcac 4320 ttccagcctt ggcttttgac ggaataataa cggcccgaag atatgacgct tccctaaacg 4380 tcacctcccc cccgccccac agtcagtgaa ccttcagcca tcaagctcca ttattttgtg 4440 gatcagactg ctcctgttcg ttagcattcg atcattattt acctggattc acgtcaaatt 4500 agctttcaca tgtcacccac aagtgtcggg cacgactatt atgaaattct cggaatatcc 4560 cacgacgcac agccggctac tgtcaagtta gcatacaaac gtctggcttt agccagacac 4620 ccagatagac ggaagaacga acctaatgca acagctgatt ttcaactagt aagatttatc 4680 gctatctact acctaccctt ctggcctctt agggctcgcg cgccttgtta ttgatacagc 4740 tagaaacagc ttagcgaagc ttacggggag ctgtctgata ttcataaacg ccaagaatac 4800 gacaagctat atcgctcagc gatacttccc gggaaaatca agagtcaaaa gatagcagag 4860 cttgaagaac gacttcggca gtttgctctt aaacgcgagg gatcgaaaac attactgtac 4920 aacacaaaga aagaccttat tagactccgc gctgagaaag acagtgtcaa aggagaaaaa 4980 gaacgcctcc tgaaggaaag agctacagag gagacatggt ggtcttacat ctcgtcctta 5040 atgataggaa acacggtgga attcaaccag cggagacaac gacgagagcg tgagataact 5100 gactcgattg ggaaacaacg gacgaaggaa tggaatattg atcttaaact ggcggaggtt 5160 caatatcttg aaagaatgct cgattctatc tcttctgctg agattgaaat caaagttgag 5220 ataacgaaaa tagaagagcg ctggcgcgaa aggctatcat tgcaggaaat ggaaagggta 5280 ttggcgaaat ggaaaaatca aagataatta gcgaagggaa ctcgagtagc aacacggcat 5340 agatctacga aggcagaaac tatagccatc agtcatatat tcaaaaaatt gtggtagagt 5400 atagcgaagt gtgctaagtg gtgccaactg aagaataatc agtggcagga ggaactttgg 5460 tggatttggg acgaaataca cacgtggtaa gaaatgtcct tgtatgagag gatacaagcg 5520 acggaagccg cgctgagtca ccccagtgca tagttacgtt ttaatacaga agctggtaac 5580 agatgtccgg aggaatagtc gtaaaaaagc ttagcctaat cccgattagg gcttctcaaa 5640 cataggaaga gtataaacat ttgcgccatt tttgcaacct agtgtaaacg aatggaatca 5700 aaaaacacat aatgtgttaa gccatccacc aagtcgtaaa ctcattatat ggcccaagtt 5760 cgttgaaccg tctgctttca catcaacctc ctctatgtct cgaagaatat tctctatgta 5820 ttcacctgca ccgctaagtt tcattataag tgcgatagca cgatgaacat caaggagccg 5880 ccgtgagggc gtatcaatta cacgagtggg actaagggtt aaagtccgcg tgactgggaa 5940 gagtggatca cgtaaaaatg gacttcgctc tgttgaatca attttgtact gatatggcac 6000 gcctgtgggt tcgaaataaa tctgaaattc accgaacata cgatgatagt cgagcgttaa 6060 agtaagggcg ttaatggggc tgtcaatctt cggaccatca attagatgga tgacaccagg 6120 gtcaaacata tctaaaatcc gaagcacatt ctttttcgag tcgctctatg gaactatgtt 6180 agcggagcga tgaagcatgt tttagatcag acatactagg tctgcatctc cagaggaaac 6240 tgttgtaaga caatgtggta gaatatgggc cacttccagg aactgaaagc ggtcacttga 6300 ttcgtttttc aattctattc cttcatcgtc cttgcaatct tccccatact gctcgaaacg 6360 ttttctagcc tcgctcttat caaattttcg agaaattacg caacgatagc gatcacgcac 6420 aaggcaactt tttcgcaaga tagatacacg gtatggtgtg ccggagggcg tagatgtttg 6480 tattgcggat aatgatgcag gcgtcggttg cggagtcttg acagatgaag ctcggactgt 6540 gatcatccaa gttaaatagt gcttctcaca gtagaatggt caatagccaa catacgcgga 6600 aggaggaaat tctcaatgat gtaatcagcg aattcttcga tcgcgctctt tgctttgttc 6660 ttctcgtctg gactccaaga cgtaaagtta tcgaagaacg tcaagacaat cgtaatatca 6720 gaatcaatca aatcagcagg ctgtgaacat agattttcgt atatcgacga gaaaaagaac 6780 gttaaaaatg tatccttagc taccacatgc tcatatgtgg ccttaataag tgccgctggc 6840 ttgtaacctt ttcgtgcact cctttcgggg ccataacact gaattaagac ctgaagaagg 6900 ttggctgccg attgactttg gtggggtggt aatgagaaag gttttgagaa gttcaggact 6960 ttttctaaag atgactggtg tcgatgcaaa ggatgtgaag gcatcatcga taagcccaat 7020 ccattgtgtg aggtatgcag aggaagccaa ccaaggatgt ttctacaacg cgcctcaggt 7080 cacgtggttg ggagatcgtc gggttcttgt caagtcgaga actgttaaaa agttagttgc 7140 tcatgtaccc gctagtccca cttaagactg tatcgttatc ggtttatata ataaatcttg 7200 gatgactgta acaatatata tatatattcc agtagttaat tgggctagtg acgggttaag 7260 ctatggaaca atacgatcga ttcaacgcgc tttggtctca gtcatgtctg tattggttgg 7320 cccaatctct aatgtcgcta acgaaccggc gtgcgacacc aattatcacc ccttgcttga 7380 cgaagaagtc tgggtcttcc ttcgaaacct gttgaaggtc taatccattt tccaacgcca 7440 catcacgtgc ttttttaata tggtctctat aaatctcacg gccaacacgc gacaagtgcc 7500 aattggcata ttcctccact gcatcatcta agaacccagg aatatcaaca gagtcaatac 7560 agttgggact tgagcttgac tgctcagccg taacagatct gtctgttagg ctttgcgatg 7620 actgcgcggg gaggacattg atattgattg gtggacaaat tgatccactt gcggaatgct 7680 tcgggttctt ttgtttttca agccgcagtg cctcctctgc atagagttgt tcacggacat 7740 cgtcaggaat atcatcatgc gtatccaaga tgcctccacc ctcaacgtat ttgacaagtc 7800 tcctcagatg gtgcgttcta agtttgtaat gcttctttcc aacagggtcc agccagcaat 7860 actgcccttc atggcggcag ggtggcccag gacaacgcat cgtttgatac acttcccgcc 7920 aatatggacg ttgtccagaa gcctgttcag catcgatctg ggcgtctcgt tctgtaagca 7980 ttctcctagt tactgatgac tttcctctct tatctgtatt ccgtgaaaga ggagggccac 8040 tgtcctctat atagtttatg gatataaaaa gtttgagctt cttgccaata tgaaacagat 8100 ttccccacat taagagctgt ttctctatag gtttccaatc aatattagtg ccgtcaaaac 8160 gtttgttcag atcagattgt ccacgttcgt ttacagatac tctgactgta gtatcatctg 8220 atctcacacg ttggttgtga cgtatttttc gacgcataac attttcagca tcctgtgtta 8280 tcttcgccca gtgtgaactg ggtgctacag ccaagtcctg ttcagtgtcc tttgacacag 8340 ttcggttgtt cagagttacc ttccactcaa tagtataatg aatacaaggc tttcctctat 8400 gttgcctcgt agtcctttct tcgggctcct ggaagaaacc cagatgattg ggctgggatt 8460 gatgcaaggg agtataaggt tcatcaagta catgttcagg tgatgggcaa aatacggatg 8520 gcgtacgatc tctaccgaag tcaccagggg tgggggcata cgatggagtt tgtatccacg 8580 gatcaggtgg ctgaagctga gaggcatcgt catcgtagta aggactaaac gtcatcccct 8640 caaggcagta gatgccactg agaagcctag tgttgggatc atcatatgtt agcctacacc 8700 atatgggtgt cccagcaaga gtgtccgtga gggaagaggt gcagctaaca aaaccagtaa 8760 aatgatcagg ttcatggaca atgaactaag acaggtacag tattgtagcc ctacccgtct 8820 tggttaacct ggtaaggtca aaaaggatcg aaccgtggct cagtacaaac aaaaggaatg 8880 ttaacagttt gcgggagatg caaggcacat gctttgtcat gtttgacgcg tttgcagtgt 8940 agaagcttcc agctaccgta gattactgat acaaactcaa tacactattt ctataacctt 9000 actgttcaat acagtacgat caaaatttcc ggaatattaa tgttacggtt accttccata 9060 tgtagactag cgcacttggc attagggttc gaaatacgat caaagagtat tggggggggt 9120 gacagcagta atgactccaa ctg 9143 // ID Gypsy-86_MLP-I repbase; DNA; FNG; 5841 BP. XX AC AECX01002125; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-86_MLP_; KW Gypsy-86_MLP-LTR; Gypsy-86_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5841 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01002125; Positions 24959 19119. XX CC Positions [4449-4928] - Integrase core CC 'GCGTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 162..5342 FT /product="Gypsy-86_MLP-I_1p" FT /translation="MGFDLPSNFLSDPEEIIRRQRQRPATIESPAAPTPPR FT YNYRPEAPTDIPESNYTIFASTTPAPIATSTPATNLPDPSDPLITPDSRIV FT RQFFNSKLGDETPRPPGSFIYTPQPTNTMDLGSGSKGKAPEVEEDADKDAL FT IARLQAEVATLQADRSRIMALEASMSKLLASHDATAQGSGGSNVTAGNTHH FT HSHFNNAETPTPLAPRNLMSDAQVTPQMQVPPNSVHNSGGSAALTHISGGQ FT TDLLSDDIDDPPAATPIVNVPAHTAPVPAPTINIPIHTGFIPESQPGTARR FT VSFLVQPLTTAPTDKAVRAGDIKLSEAPKFTGPAESPAALFNWRRLVEQFF FT KLKRLDDVQERLMILGSIVVEPRAGNWLRSMEAELSEMSWREIMHAMAVET FT LPTGWLYDTERAIRQLKMKAQEDFKTYANRARDLYSLIEVESSITVKGLAE FT YVVWGAPDVFQRWVTDRKLLKVERFKWLEFIAAASDIWSLLVASNLVPKVT FT ANQVPAFNQTSNARPYNQPRTSTWSNDRRTVEERADAAWHYHQYLRHLGIC FT SACRTKCGNPQCQGPIKGPYITLPPLSVFDPGPRPARRTINTNVNATTSAR FT PAGAPTQKPAGRPAAPLPVRAIEQIESPSAAPDLFDKEIQRYEEADRILIA FT HMEEETTGCVEEHSRTKSIILRLVVNGVPLRALIDSAAEANLMSEATAKRA FT RIPRRKLVAPIEVSLAISTESKPHIITEFCFANVSSDDPKLRFGSTFFKVA FT PLGNSYDIILGTPFLQKFHLDISLHNRSVTHVPTKTVLLEESVKSEIKDAM FT TKSMVACVIQNLERVESTRDLSKREEQVLREFADLFPEELPAVADDEEDDF FT IPPEGQNPSSKVRHKIVLTDPNVVINEKQYNYPQKYLPAWNKLIEQHLKAG FT RIRKSTSQYASPSMIIPKKDPNALPRWVCDYRTLNKYTVKDRSPLPNVDEA FT VRLVSSGKIYSIIDQINSFFQTQMREEDIPLTAVKTPFGLFEWTVMDMGLT FT NGPATHQSRVEEALGNIIGTYCVVYIDDVVVFSNSVEEHELHVREVLRRLK FT AANLYCSPAKSKLFRTKINFLGHEISGEGVCPDNAKVEKLAKWKTPSSQKQ FT LLKFLGTVQFLKKFIDGLSHYVGTLSPLTSTKRKNDPFQWGQKEDEAFENI FT KRIITTLPVLKKMDYESDEPIWLFTDASGHGLGAALFQGKKWDTSSPIAYE FT SRTMTPAERNYPVHEQELLAVINALQKWKLLLLGMKINVMSDHHSLTHLLS FT QRNISRRQARWLETLSQFDINFNYIKGEENSVADALSRRDEVAALQMDSAL FT DETTLDIIKTGYKTDSFCQRVIKTLPLREGTEWRGDLMFIDNRLVIPEMRG FT LRKQLIQQTHLALGHLGTLKTLAQLRAQFFWPEMANDVKSLLSTCDSCQRS FT KARTTLPSGRLQATDIPRLPMDDICLDFIGPFPKVRGYDMILSCTCRLTGF FT VRAIPTRQTDTAEQTAQRLFAAWLAIFGAPRSMIGDRDKAWTSRFWQELNR FT LMQIKVNLTTAYHPQSDGRSEKSNKTIVQILRQMVDSKHGRWFTSLPAVEF FT AINSALNVATGVSPFEFVFGRKPRLFPVSGNLQSTGSDVSKWIEERQAAWA FT QYRDKLWTSRISQAVHYNSRHGSGEVLNVGDWVLIDSKDRQQVVGGDGRPT FT SKLRPRYDGPYEVKEILNDGRNFRLKLDDNDKSHDVFHISKLKRYRWREEE FT CDTMVSK" XX SQ Sequence 5841 BP; 1763 A; 1421 C; 1298 G; 1359 T; 0 other; cttttttaat ctaaaaacat tcaatcaact tattttatca caatcaaccc aacgcgattc 60 catttcaatt ccttatttcc tttttttatt tcgaatctat cgacatcgtt tgatcgacac 120 ctgtcgtgag aaccactacg acaactgaca accgacgctt catgggattc gacttaccat 180 cgaatttcct tagtgacccc gaagagatca ttagacgcca acgccaaaga ccagccacaa 240 ttgagagccc agccgcaccg acacctcctc gatataatta tcggccagaa gcccctacgg 300 acatccccga gagtaactat actattttcg ctagcacaac accagcacca atcgccactt 360 cgacaccagc taccaatcta ccagacccct cggacccttt gatcacccct gacagtagaa 420 tcgtacgaca attttttaac tccaaactag gagacgaaac cccgagaccg cctggtagtt 480 tcatctacac gccacaaccg accaacacca tggatctagg aagtggtagc aagggcaaag 540 cgccggaggt ggaggaagac gcagacaagg acgcactgat cgctagatta caagccgaag 600 tagcaacact tcaagctgat cggtctcgta tcatggcttt ggaggcatca atgtcgaaat 660 tattggcatc acacgacgcg acggctcaag gtagcggggg ttcgaacgtc acggcaggca 720 acactcacca ccattctcat ttcaacaacg cggaaacgcc gactcctctg gcaccacgga 780 atctcatgag cgacgcacaa gtcacgccac aaatgcaagt accaccaaat tcggtgcaca 840 actctggcgg atccgcagcc ttgacacaca tatcaggtgg acagacggat ttactctccg 900 acgacatcga cgatccaccc gcggcgactc caatcgttaa tgtacccgct catacggctc 960 ccgtccctgc tcccacaatc aacataccca ttcatacagg tttcatccca gaaagtcaac 1020 cagggacagc ccgtcgagtt tcattcttgg tgcaaccttt gacaacggca ccaactgaca 1080 aagcagtccg cgcaggggat atcaaacttt ccgaagcacc gaaattcacc ggtcctgctg 1140 aaagccctgc ggccttgttc aactggcgac ggcttgttga gcagttcttc aaactcaagc 1200 gactagacga cgtgcaagaa cgacttatga tcttgggaag catcgttgtc gagccacggg 1260 ctggaaattg gttgagaagt atggaagctg agttgagtga gatgtcatgg agagaaatta 1320 tgcatgcgat ggcggtggag acattaccta cgggttggct gtacgacact gagagagcaa 1380 ttagacaatt gaagatgaag gctcaagaag atttcaagac ttatgctaac cgcgcaaggg 1440 acttgtatag cttgatagag gtagagtcgt caatcacggt gaaaggttta gcggagtatg 1500 tggtatgggg cgcacctgac gtttttcaaa gatgggtgac ggatcgtaaa ttactcaagg 1560 tggaacgctt caaatggctc gaattcattg cggcagcgtc ggacatctgg tctttactag 1620 ttgccagcaa cttggttccg aaggtcaccg cgaaccaagt tccagcgttc aatcaaacca 1680 gcaacgcacg cccctacaat cagccaagga cttcgacctg gtccaatgat cgacgaacgg 1740 tagaggaacg agcagatgca gcatggcatt atcaccagta cctacgacac ctcggaattt 1800 gctcagcgtg ccgaactaaa tgcggcaatc ctcaatgcca aggcccaatc aagggaccct 1860 acatcactct accgccgttg agcgttttcg accctggacc gcgccctgca cgtcgtacta 1920 tcaacaccaa cgtcaacgct acgacctcag caagaccagc aggagcacct acacaaaaac 1980 ccgccggacg accagctgca ccattaccag tcagagcaat tgaacaaatt gaatcacctt 2040 ctgcggcacc tgatctcttt gataaagaga ttcaacgcta cgaagaggcg gataggattt 2100 taattgctca tatggaagag gagaccacag ggtgcgtaga agagcattct cgcactaagt 2160 ctatcattct ccgtctggta gttaatggtg ttcctctgcg tgctctcatt gattccgcag 2220 ccgaagccaa cctgatgtcg gaggcaacag caaaacgggc aaggatacct cgtaggaaac 2280 ttgtggcacc cattgaggtg agcctcgcaa tttctactga gagtaaacct cacatcatca 2340 ccgagttttg tttcgctaac gttagttcgg acgatccaaa gttacgtttc gggtcaacat 2400 tcttcaaagt cgcaccattg ggtaattcat atgatatcat cctaggcacg cccttccttc 2460 aaaaatttca tctagatatt tctcttcata accgttccgt cacgcatgta cccacaaaga 2520 ctgttctact agaggaaagt gttaagagtg aaatcaaaga tgcaatgaca aaatcaatgg 2580 tggcttgtgt tatccagaac ctcgaaagag ttgaaagtac gcgagaccta tcgaaacgcg 2640 aagagcaggt gttgagagaa tttgcggatt tatttcctga ggagctacca gcggtcgccg 2700 atgatgaaga agatgacttc atcccgcctg aagggcaaaa cccgtcttca aaagtcagac 2760 acaagattgt cctgacggac ccaaacgtgg taattaatga gaagcagtac aattatccgc 2820 agaagtattt acctgcgtgg aacaaattaa ttgaacaaca tctgaaagca ggaagaatac 2880 gaaaatcaac aagccaatac gcgtcacctt caatgattat cccaaaaaag gatcccaatg 2940 cactaccaag gtgggtgtgt gactatagaa ctttgaacaa gtacacggtc aaagatagaa 3000 gtccactacc aaacgtggat gaggcggtca gacttgtgag cagcggaaag atttactcaa 3060 taatagatca gattaattca ttttttcaaa cacaaatgcg cgaagaagac ataccgctga 3120 cagcagtgaa gaccccgttt ggattattcg aatggacagt catggacatg ggtcttacaa 3180 atggaccagc aacccaccaa agtcgtgtag aagaagcctt gggcaatatc attggcacat 3240 attgtgttgt gtacattgac gacgtagtag ttttttcaaa ttcagtagaa gagcacgagt 3300 tacacgttag ggaggtctta cgacggctaa aagctgcaaa cctttactgt tcacctgcaa 3360 aaagcaagct ttttcggaca aagatcaact tcttaggaca tgaaattagt ggtgaagggg 3420 tttgccccga caatgctaaa gtcgaaaagc tagcaaagtg gaagacacca tcatcacaaa 3480 aacaactgct caaatttcta ggcacggtgc agtttttgaa aaaattcatc gacggtttat 3540 cacattatgt tggtactctt tcgcctctga caagcacaaa acgaaaaaat gacccattcc 3600 aatgggggca aaaggaagac gaagcatttg agaatatcaa gagaataatc actacacttc 3660 cagtgctcaa gaagatggac tatgaatctg acgagccaat atggttgttt acagacgcca 3720 gcggtcacgg cctaggagcg gcactatttc aaggaaagaa atgggacacg tcatcgccca 3780 tagcatacga aagcaggaca atgacaccag ctgaacgtaa ctacccagtc cacgaacaag 3840 aattgttggc tgttattaat gcactacaaa agtggaagct gctactacta ggaatgaaaa 3900 taaacgtcat gtcagatcac cactccctca cgcacctttt gtctcaacgg aacattagta 3960 gacgacaagc tagatggtta gaaacactct ctcagtttga cattaacttc aattacatca 4020 aaggtgaaga gaattctgta gcagacgcat tatcacgaag agatgaggta gccgcactgc 4080 agatggattc agcattggac gagacaacgc ttgacatcat taagactgga tacaaaacag 4140 actctttctg ccagcgagtg atcaaaacgc taccgctgcg agaaggcacc gaatggcgcg 4200 gagacctaat gttcattgac aatagattag tcatcccgga aatgagaggt ttgcgtaaac 4260 aactcattca acaaacgcac ctagcactgg gacatctagg aacactcaaa acactcgcgc 4320 aattacgggc ccagtttttt tggcctgaaa tggcgaacga cgtcaaatca ttactatcaa 4380 cttgtgacag ctgccagaga tcgaaagccc ggacgacgct accatcaggg agactgcaag 4440 ccacggacat accacgactg ccaatggacg acatctgcct tgacttcatc ggaccctttc 4500 caaaggtcag aggctacgac atgatcctgt cgtgcacttg ccgacttacc ggctttgtac 4560 gagctatacc aacacgccaa accgacacag ccgaacaaac agctcaacgt ctatttgcag 4620 cctggttagc aatcttcggc gctccaagaa gcatgatagg cgaccgagac aaagcgtgga 4680 catcgcgctt ttggcaggag ctcaaccgat taatgcagat taaagtcaac ctcaccacag 4740 cttatcatcc acaatcggac ggccggagtg agaagtcgaa caagacaatc gtacaaattc 4800 tacgacaaat ggttgacagt aagcacggac gttggttcac ttcgttacca gcggtcgagt 4860 ttgcaatcaa cagcgctctg aatgtggcta caggagtctc acctttcgaa tttgtcttcg 4920 gtagaaaacc tcgactattt ccagtctcgg gaaacttaca gtcaaccgga tcagacgtct 4980 cgaaatggat tgaagaacgc caagcagctt gggcacagta ccgcgacaag ctgtggacca 5040 gtcgaatcag tcaagcagtt cactacaaca gccggcatgg aagtggcgaa gttctgaatg 5100 tgggagattg ggtactcata gacagcaaag atcgacagca ggtggtaggt ggggatggaa 5160 gaccaacatc aaaactacga ccccgctatg atggaccgta tgaagtcaag gaaattctga 5220 atgacggccg caacttccga ctgaagcttg acgacaacga caaatcccac gacgtcttcc 5280 acatatcaaa gctgaaaagg taccgatgga gggaggagga gtgcgataca atggtgtcaa 5340 agtaagttcc tcccataagt atgcaccgcc gggcaactac gccaaatcaa attgtgtata 5400 aaacattacc ttggccacac atgtgagcac cgaaccgggc gtctggcttg tttcctctcc 5460 acgatagata cgaagcatat aaacgaagaa caaattacga gattaagatc aagacttcaa 5520 gattttcatt atttaattac atcaagactt caatatttta attattcaat tacttcaaag 5580 atattcatca aggagcaata taggtggaaa aatatcaggt cctttttttg ttttctgggt 5640 gcttttcaat tttttctttt cctttttcga attttaattt tcaggttgcg atacattttc 5700 aatttttttt agttttcttt ctaattgcac tagggacgag tcatggtcaa ttttttgggg 5760 cgagttatgc catttttttt tctttttttt cagttgattt ttatttattt ttcaaaattc 5820 tttttcttag gaggggaggg a 5841 // ID Gypsy-82_MLP-LTR repbase; DNA; FNG; 206 BP. XX AC AECX01001127; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-82_MLP_; KW Gypsy-82_MLP-I; Gypsy-82_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-206 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001127; Positions 34646 34441. XX SQ Sequence 206 BP; 61 A; 48 C; 21 G; 76 T; 0 other; tgaacctcct tttaaggagt catttaactg aagtctgtct ggatttatcc aaacagacca 60 gttctcctct tccctcttct ctcttttaac cttataattt tattgtaata catcatattg 120 aaatatatct ttaattaact aaaataacca acccacctag ccatttttag tgagcattga 180 gtagtctttt gaatacccaa ccctca 206 // ID NHT2_LTR repbase; DNA; FNG; 228 BP. XX AC AY038360; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 08-NOV-2002 (Rel. 7.1, Last updated, Version 1) XX DE Nectria haematococca copia-type retrotransposon NHT2_LTR, long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; KW COPIA superfamily; Long terminal repeat; NHT2_LTR; KW retrotransposon; target site duplication. XX OS Nectria haematococca OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Hypocreomycetidae; Hypocreales; Nectriaceae; OC Nectria; Nectria haematococca complex. XX RN [1] RP 1-228 RA Shiflett M.A., Enkerli J. and Covert F.S.; RT "Nht2, a copia LTR retrotransposon from a conditionally RT dispensable chromosome in Nectria haematococca."; RL Curr. Genet 41(2), 99-106 (2002). XX DR Genbank; AY038360; Positions 6 233. XX CC 5bp target site duplication. XX SQ Sequence 228 BP; 63 A; 56 C; 36 G; 73 T; 0 other; tgtcgcatat caccactagg ttcgtgcgac tggatatgaa acgcggccca tactttaggg 60 tcgtatatcc ggtttcacat ccaacttgtc gcatcaagga ttccaatcct aacgatatga 120 gcttaaatag ctctgcctga acacatagct atttaatttc ttttgttctt cagataattc 180 tcatcaattg actagtgcca tcaataatcc ctatacagcc aattatct 228 // ID Gypsy-13_CCO-LTR repbase; DNA; FNG; 188 BP. XX AC AACS02000012; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the mushroom Coprinopsis cinerea genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-13_CCO_; KW Gypsy-13_CCO-I; Gypsy-13_CCO-LTR. XX OS Coprinopsis cinerea OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Psathyrellaceae; OC Coprinopsis. XX RN [1] RP 1-188 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the mushroom Coprinopsis cinerea RT genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AACS02000012; Positions 319185 318998. XX SQ Sequence 188 BP; 46 A; 48 C; 37 G; 57 T; 0 other; tgtaagggcg ggccttacac gatcacgtgt agtcattcca tctcattaga ctgcccgccc 60 acacaagtag tatcacctta tctctcttgt acgtgtactc gaaaaaccgc tatagactcg 120 attcatacgc attagacgca ttgtgagtgt ttgctgttct cctgtttctt aaggagaaca 180 gatttcca 188 // ID Copia-56_MLP-I repbase; DNA; FNG; 4859 BP. XX AC AECX01000392; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-56_MLP_; KW Copia-56_MLP-LTR; Copia-56_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4859 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000392; Positions 120830 115972. XX CC Positions [2061-2585] - Integrase core CC 'AGAGG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 113..1027 FT /product="Copia-56_MLP-I_1p" FT /translation="MSGRESASDESQLPTPASTPAPFQEEIDKTNANSQLE FT DQSDHSSSAETAQSSHTVTPAGQDPSLVMASNDPIDTDTPAQRQSIKMNSI FT LRKIAISTKLTASNYTAWSDGIRFGLMATSYDHYLDSEDAGGQIDAEIVFA FT TKKAIFFWLLASIESTQSTRFISMISKFENGIKSTTPSPSLLWKTIRDYHI FT SNSESVKLMLCSEITDLSQGSTKDLIDYIDIFRSKVDAYLGSNGEMSEEEQ FT ARQFVRSLNREWAEKGCDLLDAGHVKFRNLKTELKKTHQTRKMFSSNRQQS FT SRIAETSESSQGN" FT CDS 1812..3560 FT /product="Copia-56_MLP-I_2p" FT /translation="MMYVALRSAVSPSSHQSSISKTNQLTILKLHYSLGHP FT SEKYLKRMWGLGYFKDVLPKTITSKDFEIITKCPVCPLAKSHCLPFLSTRP FT RANKFLGNVHVDLSGIIRTSSVSCEEYYILFTDDYSSYRVSFGLPDKSAET FT VFECFKRYIIFAERQTGETLKMFSFDGGGEFINGLLTPYLEDLGIVTRVTS FT PHTPKENSVAERSNRTINTKARCLMVQSCVPIKYWYHAVSYAVMLQNRTIT FT TSLNLQTTPHSLWNDRLTSMKRFQPFGCLVYRHIRKEIRGGKFEPVLRTGV FT LLGATENNHNFIILDLETNKIHISHDVTFQPLVFPFMKDAEDGPDWMFIED FT LPIEPTDDKESLNEYPIDSINQRDTIDDVDDLIIETTHPSPIKEPQTINDD FT QIDPSLIHPLPADEVLPIEAEQPVPELPAAEQPGPKEPRRSERAKQPIERY FT TPGKTNFIEAFPMRSIVPKMSYSNCSWSVYQCREAMKARAEPKSYKMAMKA FT PDADKWRAACDKEMQNIMDMGVWEIVDRPKDAPVVGGRWHFKYKLIQTVVC FT QNIRPDMLLKGILKLKALILTKLLLLLDNWLLFVYW" FT CDS 3780..4859 FT /product="Copia-56_MLP-I_3p" FT /translation="MGLTQNPHDACLWYGKDEDGLETLIYLHVDDMAITGD FT KIAEVKSLLKQKWRMEDLGPAHCVMGIEINSTEEGGYSLSQPGFIQTVLER FT FNAENSKPASTPLPGGTKIIKASDNDLIEFQRTKLPYNSLVGSLMYIAQGT FT RPDIAYAVGALSQHLSKPSLFAWQMGMHVLRYLKGNQSLGLIYLPQNTAIE FT GNQSWFYPECHTDSDWAGDPSTRRSTTGYVFKLNGAAVSWKSRLQPTVSLS FT STEAEYRATTEAGQEVVWLRGLLSNISLNQESPTILCSDSTGAVSLTQKSI FT FHSRTKHIEVQYHWIREQVKKKTIKLRHISNKNMFADTLTKPLHPGPFREL FT RDQVGLDVIHGHLKQGE" XX SQ Sequence 4859 BP; 1551 A; 1069 C; 1029 G; 1210 T; 0 other; cctctaggaa attgtcacgg gatattcagg tacctcttcc cggatttttc gcttttcatg 60 gtagcgagag tcagttcgac tcttagttat tgatttagaa aacgaaaatc atatgtcagg 120 aagagagagt gcttcagacg aatcccaatt acccacaccc gcatccactc cagctccatt 180 ccaagaggag atcgacaaga caaacgccaa ttctcagctc gaagaccagt cggaccattc 240 tagctcagcg gagacagctc aatcatctca tacagtaacc cctgctggcc aagatccgag 300 tctagtaatg gcgtctaatg accccataga taccgatacc cctgctcaga ggcaatccat 360 caaaatgaac tcaatcctcc gtaagatagc gataagtact aaacttactg cttccaacta 420 cactgcttgg tcagatggaa tacgcttcgg tttaatggca acctcatatg atcactacct 480 cgattcagaa gatgccggag gacagatcga tgccgagatt gtttttgcaa ccaagaaggc 540 tatttttttc tggcttcttg ctagtatcga atccacgcag tccacccggt tcatatcaat 600 gatatccaaa tttgagaacg gaatcaagtc gactacacct tcaccgtccc ttctatggaa 660 gacaatccga gactatcata taagcaactc ggaatcagtc aaactcatgc tctgtagtga 720 gattactgac ttatctcaag gatccaccaa agacttaatc gattacattg atatctttag 780 atcaaaagta gatgcatacc ttggatcgaa tggggaaatg tctgaagaag agcaagcacg 840 tcaatttgtc agatcactca accgcgagtg ggctgagaaa ggctgcgacc ttctggatgc 900 cgggcatgtg aagtttagaa atctaaaaac tgagctcaag aaaactcacc agactcgtaa 960 gatgttctca tccaaccgac aacagtccag ccgaatcgcc gagacatcag aatccagtca 1020 aggaaattga acaggtagat ggcagacttg cagccgaaac cgttgtatgg gccaagagca 1080 ccccaccaag ccgcacgatc aatcagagtg ttaccatcat cctaataacg ctggtaaaat 1140 agagctggaa acgatcgaag caagaggctg gcgagtgggt tgaataccca agaggtggtc 1200 gtggaggaaa tcgcggtggt tctcgaggtg gtttttgtgg aggttttcga ggaagaggac 1260 acggtcagag cagttctcat gaaaatcgag caactagtcg acatatcgtg gattctgact 1320 tccctaattc agaggatctt caatcggcat tcagcaacct aagattggaa gatcgcgaag 1380 tcagctacag tgtggaattg gacggaaatt cctcatgctc agcacaacct caattgtcat 1440 gcataggaga tcattgtgat tcagttgccc taatcgatac cggcgcctca catcacatgt 1500 tccatgaccc cggacttttt gaatcagcta ccttgatgga aaatcatgac cctggagcta 1560 agctgaacct agctggaggc ggcactaccc tcaacatcca ctctattgga aatgttaact 1620 tactgaattc aaaaggagaa aatatcaaat taaaagattg tctttatgtt ccagaattat 1680 cctgaaacct cattgctgga ggaagactac tcagagccgg agctgtaaca actgtcctcg 1740 aagatccaaa ctttcgaatt gatcatggaa agaaagagct tttcattggt aggttcattg 1800 gagaaggaag tatgatgtat gtggcattac ggtcagctgt cagtcctagt tcccaccagt 1860 catcaatttc aaaaacaaat caactcacca ttctgaaatt acactattcc ttaggtcacc 1920 caagcgagaa atatctaaaa aggatgtggg ggttgggtta ttttaaagat gtacttccta 1980 agactattac ctcaaaagat tttgaaatta ttaccaagtg tcctgtctgt cccttagcaa 2040 agagccattg tctgccattt ttgtccacta gaccacgagc aaacaaattc ctaggaaacg 2100 tacatgtcga tctaagtgga atcattagaa cctcatctgt gagttgtgaa gagtactata 2160 tcctcttcac tgatgactac agtagctacc gagtttcttt tgggttacca gataagagcg 2220 ctgaaaccgt cttcgaatgt ttcaagcgat acatcatttt tgccgagaga cagactggag 2280 aaacattaaa aatgttctca tttgacggag ggggagagtt tataaatggc ttgctcaccc 2340 catacttaga ggatttaggt attgtcacca gagtaacgtc gccacacact cccaaagaaa 2400 acagcgtggc tgaacgttca aaccgaacca tcaatacaaa agcaagatgc ctgatggtac 2460 agtcgtgtgt accaatcaaa tactggtacc atgcagtatc gtacgctgta atgctacaga 2520 atcgtaccat tacaacctcg ctcaaccttc aaacaacacc tcactctctc tggaacgata 2580 ggctgacaag tatgaaaaga tttcaacctt ttggctgtct agtttaccgt catatcagaa 2640 aggagattag aggaggaaaa tttgaacctg tgttgcggac aggagtcctc ctaggagcaa 2700 cagaaaacaa ccacaacttt ataatccttg accttgaaac aaataagatt catatcagcc 2760 atgatgtaac ttttcaacct cttgttttcc cttttatgaa ggatgccgag gatggacctg 2820 attggatgtt cattgaggat cttcctattg aaccaacaga cgacaaagaa tcattaaatg 2880 aatatcctat tgactcaata aatcaacgtg acacaataga cgatgtcgat gatttgataa 2940 ttgaaaccac ccacccaagt ccaattaaag aaccgcaaac aataaatgat gatcaaatag 3000 acccatcctt gatccatcct cttccggcgg acgaagtcct tcccatagaa gctgaacaac 3060 ctgttcctga attacctgct gctgaacaac ctggcccaaa ggaaccaaga cgatcagaga 3120 gagcaaaaca acccattgaa cgatatactc ctggtaaaac aaacttcatc gaggcattcc 3180 caatgcgtag catagtccca aagatgtcct attcaaattg ttcttggagc gtataccagt 3240 gccgagaagc catgaaagct cgtgccgagc caaaaagtta taaaatggcc atgaaagctc 3300 ctgatgctga taaatggagg gctgcctgtg ataaagaaat gcaaaatatc atggatatgg 3360 gagtttggga aattgtagac cgcccaaaag atgctccggt ggttggtggc cgatggcatt 3420 ttaaatataa attaatccag acggtagtgt gtcaaaacat aaggccagat atgttgctaa 3480 agggtatact caaactgaag gcgttgattt taacaaaact tttgctccta ctggacaatt 3540 ggcttctttt cgtatattgg tagcagttgc ggctgggaaa ggatggtcaa tagagcagat 3600 ggacgccatt gctgctttcc tcaacagaga cctcaaggaa caaatttatc ttgagttacc 3660 agaaggttat gatgcggaaa gagcaaacgg aaaagtcgca caattaaaaa aaggcacttt 3720 acggcctcaa gcaatcggcg agatgctgga gtgacaaggt taaggaaaaa tttatcagca 3780 tggggttaac ccaaaatcca cacgatgctt gtctatggta tggaaaggat gaggatggtc 3840 tagaaacttt gatatattta catgtagatg atatggctat taccggcgat aaaatagcag 3900 aggttaagag tctattgaaa caaaaatgga ggatggagga tctgggtcca gctcactgtg 3960 tcatgggaat tgagataaat tctactgaag aaggtggtta ctcattaagt cagcctggtt 4020 tcattcaaac agtgctagaa agattcaatg cggaaaacag caaacctgca tcaactcctc 4080 tccctggcgg cactaaaatt attaaggcta gcgataatga tctaattgaa tttcaacgaa 4140 caaaacttcc ttacaacagc ttagttggta gcttaatgta cattgctcaa ggcacaagac 4200 ccgacatagc ttacgcggtt ggagctttat cacagcattt gtcaaaacca tcactatttg 4260 catggcaaat gggaatgcac gtacttagat acttgaaggg gaatcagtca ttaggattaa 4320 tttacttacc tcaaaatacc gctatcgagg gaaatcaaag ctggttctat ccggaatgtc 4380 ataccgactc cgattgggct ggtgatccca gtacccggag atcaacaacc ggctatgtat 4440 ttaaactcaa cggagcagca gtcagttgga aaagccgtct tcagccaaca gtctccttat 4500 catcaactga agctgaatac agagcaacaa cagaagcagg acaagaagta gtttggcttc 4560 gtggtctttt atccaatatc tcactcaatc aggaatctcc aacgatcctc tgtagtgata 4620 gcacaggagc agtatccctt actcaaaaat caatattcca ttcccgaaca aaacatattg 4680 aagtacagta ccactggatt agggagcaag tcaagaagaa aactatcaaa ctacgacaca 4740 ttagcaacaa aaatatgttt gctgacactc tgacaaaacc ccttcacccc ggaccattca 4800 gagaactccg tgatcaagta ggattagatg taatacatgg acatctgaaa cagggggag 4859 // ID Gypsy-16_MLP-I repbase; DNA; FNG; 5743 BP. XX AC AECX01001344; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-16_MLP_; KW Gypsy-16_MLP-LTR; Gypsy-16_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5743 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001344; Positions 205638 211380. XX CC Positions [4387-4866] - Integrase core CC 'TGGGG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 126..3194 FT /product="Gypsy-16_MLP-I_2p" FT /translation="MGFALPTNFTNDPESFLRRQRQLRQRQQDTDTSPAAP FT PPPRFNWKPEAPLDTPDSNFTLQASITPRPITTSTPAGFPDPFEPLITPDS FT RIVREFFNSRLSDPTPRPPGSFIFTPYNAMESSSGSKGKTVEVQDDSDKDA FT LIERLQAEISSLKSDRDRIHALEASMSRMMSMQQPTITTGGGSTGSNTTTQ FT PTNWFSRFRSPGTPTPAGPKTPPTTAGTPAQAMRTEAVPLTSIVEEHVDRN FT EERTREDAAEYSLPRASYAAAPWSHLGAEAVDDTPRRVSFAQASTVPTAYR FT HNAAHDEPDNRVVRACDIKLSDAPKFTGPVEDPAALFNWRRLIEQFFKLKR FT LDSTEERLIILGTIVVEPRAGNWLRRMESELVHLPWTEIMHALATETLPTG FT WLYDTEKAIRQIKMRPNEDFKVYANRGRDLYALVEVESSISIKNLAEYVVW FT GAPEVFQRWVADRALLKVANFKWFEFIAAASDIWLLLQSSNLLPRGGFNTN FT TAPRQTGFQPRPTQYAQAATMNDRRTPDQRADYAWHYHEYLRHKGICSACR FT TECGNPNCQGPIKGPYIQLPPLSVFDPGPRPIRRATTLAAPAPSRQNPPAG FT APTSKPAGRPAQSVAIRAVSESPSLSYAPDLSDKEIERYEEADKILVAHME FT EELSGCVDETKRTRSIILQLMINGTPMRALIDSAAEANLISEPSAMKAKIP FT RRRLVKPVEVSLAISSDATPFVINEFCFANVSSADPKLRFGSTFFKVAPLG FT NNYDVILGTPFLEKYHLDVSLHNRTVTHVPTHTVLLEESAKKEIENSMKNS FT MIACVIQNLERVKTTRDLSKREERVLEEFSSLFPDELPAVAEDEGDDFIPP FT EGQNASSRVTHKIVLTDPNVVINEKQYNYPHKYLNAWSKLVTQHLKAGRIR FT RSTSQYASPSMIIPKKDPNALPHWVCDYRTLNKYTVKDRAPLPNVDEAVRL FT VSTGKIFSIIDQINSFFQTRMREEDIPLTAVKTPFGLFEWTVMAMGLTNGP FT ATHQGRVEEAKLLCSVY" FT CDS 3706..5280 FT /product="Gypsy-16_MLP-I_1p" FT /translation="MASPIAYESRTMSPAERNYPVHEQELLAVVNALQKWK FT LMLLGLKINVMSDHHSLTHLLTQRNLSRRQARWLETLSQFDLDFNYIKGEH FT NTVADALSRVEEIAAIEITAMLDDQTVQDIKAGYKNDPFCLRVKKTLPLRE FT GTEERDEMIYVDGRLLIPHIDKLRDRLIDKTHNALGHLGSLKTLAQLRHQF FT FWPEMTKDVTNFVSSCDSCQRSKARTTLPSGRLQATEVPRRPMEDISSDFI FT GPFPKFRGYDMILSCTCRLTGFVRAIPTSQTDTAERTAQRLFGAWLAIFGA FT PKSMIGDRDKTWTSRFWQELNGLLHIDVKLTTAYHPQADGRSEISNKTIVQ FT ILRHLVENRHGKWLEALPAVEYAINSAVNVATGISPFEFVFGRRPRLFPVT FT GKLTDKGSDTTKWIEERQGAWAQYRDKLWTSRITQALHYNNRRSAGEVLKC FT NDWVLIDSKDRQQIVGGQGKPTSKLRPRYDGPYQVSECLNDGRNFRLRLND FT DDKSHPIFHISKLKRYRWREDEQGTGAQK" XX SQ Sequence 5743 BP; 1713 A; 1404 C; 1270 G; 1356 T; 0 other; cttttttaaa ctatcaccga aactaaaaat tcaaacttta aatcgacctc tgagcactct 60 gaaaacttaa ttcatctgac atccgtgaga taatcgaatc cttgacaagt catcaagtct 120 ttttgatggg attcgcatta cctacaaatt tcaccaacga cccagaatct ttcctacgcc 180 gacaaagaca gctccgacaa cgccaacaag acactgacac ctcgccagca gcgccacctc 240 ctcctagatt taactggaaa ccggaagccc ctctcgatac acctgacagc aactttacct 300 tacaagccag cattactcca agaccaataa ccacatccac acctgccgga tttcctgatc 360 ctttcgaacc attgattacg cccgatagta gaattgtaag agagtttttt aactcgagac 420 tttccgatcc gacccccaga ccccccggta gtttcatttt taccccttac aacgcgatgg 480 agagctcatc gggttcgaaa ggcaagacgg ttgaagttca agatgactca gacaaagatg 540 ctctcataga acgcctccaa gcagagatct cgtcactcaa atccgatcga gaccgaattc 600 atgccttgga ggcttcaatg tctcgtatga tgtcaatgca acagcctaca atcacaacgg 660 gaggcggcag caccggctca aacacaacaa cccagcctac caactggttc tcacgatttc 720 gaagccctgg tactccgact cctgcaggac ctaaaacacc acccaccacc gcgggaacac 780 ctgctcaagc catgagaact gaagccgtcc cattaacttc aattgtcgag gagcatgtcg 840 atcgcaacga ggaacgtacc agagaagacg cagccgagta ctccctgccg cgagcatctt 900 acgcagcagc cccatggagt cacctaggag ctgaagccgt cgacgacaca cctcgacggg 960 tttcttttgc tcaggcttcc accgtaccaa ccgcctatcg tcataatgca gcacacgatg 1020 aaccagacaa ccgcgtggtc cgagcttgtg acattaagtt atcagatgca cctaaattca 1080 caggcccagt cgaagatcct gcagcactct tcaactggcg acgacttatc gaacaattct 1140 tcaaattgaa acgattagat agcactgaag aacgtctgat aatcctaggt accatcgtag 1200 tggaacctcg agcagggaac tggctaaggc gaatggagag tgaactcgta cacttaccat 1260 ggactgagat tatgcacgca ctggcaacag aaacgctacc caccggatgg ctttacgaca 1320 ccgaaaaggc aattcgacag atcaaaatga gaccaaacga agactttaaa gtttatgcca 1380 atcgaggacg agatttgtat gccttagtcg aagtcgaaag ctcaatctcg atcaagaact 1440 tagccgagta cgttgtttgg ggggcaccag aagtgtttca gcgttgggtg gcggatcgag 1500 cattactgaa ggtggcaaac ttcaaatggt tcgaattcat cgcagcggct tcggacatct 1560 ggttacttct tcaatctagt aatttgttac ctcgtggagg attcaacacc aacactgcgc 1620 ctcgtcaaac cggttttcaa ccgagaccga ctcaatatgc tcaagctgcg acgatgaatg 1680 accgacgaac tcctgatcaa agggctgatt acgcgtggca ctaccatgaa taccttcgac 1740 acaagggtat ttgttcagcc tgtcgaaccg agtgtgggaa cccgaattgc cagggaccaa 1800 tcaagggacc ttatatccaa ttgccacctt taagtgtatt cgaccctggt ccaagaccga 1860 tacgtcgagc aacaactctc gcagcaccag cacccagtcg acaaaatcca ccagcgggag 1920 cgccaacgag taaaccagct ggtcgtccgg cacaatctgt agcaatccga gcggtatctg 1980 aatcaccttc actcagttat gcaccagatc tatcagacaa agaaattgaa cgctatgaag 2040 aggccgacaa gatcttggta gcgcacatgg aggaagagct atccgggtgc gtagacgaaa 2100 ctaagcgtac aagatccatc atccttcagc tgatgataaa tggaactcct atgcgcgcct 2160 tgatagactc agcagccgag gccaatctca tatcggaacc ttcagcaatg aaggcaaaaa 2220 taccacgccg acgactcgtt aagcctgttg aggtgagtct agccatctct tccgacgcca 2280 ctcccttcgt aatcaatgag ttttgctttg cgaatgttag ctcagctgac ccgaagttgc 2340 gatttggatc aactttcttc aaggtcgcgc cgttagggaa caattatgac gtaattttag 2400 ggacgccgtt tttggaaaag taccaccttg atgtttcact tcataaccgc acggtaacac 2460 atgtacctac tcatacagtc ctgttagaag aatccgcaaa gaaagaaatt gaaaattcaa 2520 tgaaaaattc aatgattgct tgtgtaattc aaaacttgga acgagtcaaa accacccgtg 2580 acttatcaaa acgagaggag cgcgtgctag aagaattctc tagcttgttt cctgacgaat 2640 tacctgcagt agcagaggat gaaggagatg atttcattcc gcctgaaggg cagaatgcgt 2700 cttcgcgagt gacccataaa attgtcctca cagaccctaa cgtggtgatt aatgagaagc 2760 aatacaacta tccacataag tacttgaacg catggtcaaa gcttgtgaca caacacttga 2820 aagcgggacg cattcgacga tcaactagcc agtatgcgtc accgtccatg attataccaa 2880 agaaagatcc caacgcttta ccacactggg tttgtgatta tcgaacgttg aacaaataca 2940 cggtgaaaga cagggcgccc ttacctaatg tggatgaagc agtcaggctg gtaagcacag 3000 ggaagatatt ctctattatt gatcaaatta attccttctt tcagactagg atgagagaag 3060 aagacattcc cttgaccgcg gtgaaaacac cgtttggatt gttcgagtgg acagtaatgg 3120 caatgggact aacaaatggt ccagccacgc atcaaggccg tgtagaagaa gcgaaattat 3180 tgtgtagtgt ttattgatga tgttgtagta ttttcaaatt cagtggagga acacaagatt 3240 catgtcagag aagttttacg ccgactacaa gcagccaaac tgtattgttc tccagcaaaa 3300 agcaaattat tcagaacaag catcaacttt ttggggcatg aaatcagcgg cgagggagtt 3360 tgtccggacg atgccaaagt ggagaaaatt gctaagtgga aaacaccaag tacacaaaag 3420 caacttttga aatttctagg cacggtacaa ttcttgaaaa agtttattga cggattgtca 3480 cactacgtag gaacgttatc acccttgaca agctccaaac gaaaaaatca accttttcag 3540 tggggcagga aagaagacga agcttttgaa aacatcaaga ggatcatcac gacattaccg 3600 gttctgaaac agattgatta cgactcggaa gatcctgtct ggttgttcac ggacgcaagc 3660 gggcatggac taggagcagc tttatttcaa ggagcaaaat gggacatggc atcaccaatt 3720 gcatatgaga gcagaacgat gtcaccggct gagcgtaatt acccagtcca tgaacaggaa 3780 ctgttggcag tagtaaatgc acttcaaaaa tggaaattga tgctcctagg gttgaaaatt 3840 aatgtaatgt cagatcatca ctcattgacg cacttactaa ctcagaggaa tctcagccgc 3900 cgccaagccc gctggttgga aacattatca cagtttgacc ttgatttcaa ctacatcaag 3960 ggtgaacaca acacagtcgc ggacgcattg tcacgagtcg aagaaatagc cgccattgag 4020 atcactgcga tgcttgacga tcaaaccgtt caggacatca aagcaggcta caagaacgac 4080 cccttctgct taagagttaa gaagacatta ccgctgcgcg aaggcactga agaacgagac 4140 gagatgatct acgtcgacgg aagactctta attcctcata ttgacaagct acgggatcga 4200 ctgatcgaca aaacccacaa cgccttaggt caccttggaa gtctgaaaac cttggcgcaa 4260 ctacgacatc aattcttctg gcctgagatg acaaaagacg tcacgaactt tgtatcatca 4320 tgcgacagtt gtcagcgatc aaaggcaaga acgacgcttc cgtcaggtcg acttcaagcg 4380 acagaagtcc caagacgacc aatggaggat atctcgtcag attttattgg tcccttcccc 4440 aaattcaggg gctatgacat gatactttct tgcacttgta gattgacagg gttcgtacgc 4500 gccattccaa cttcccagac ggacaccgcc gaacgaacag ctcaacgact attcggtgcc 4560 tggcttgcaa tctttggcgc ccccaagagc atgatcggag accgcgacaa aacgtggact 4620 tcgcgatttt ggcaagaact caacggcctg ttgcacattg acgtcaaact gaccacagcg 4680 tatcaccccc aagctgacgg gcgaagtgaa atctcaaaca agacaatcgt gcaaattcta 4740 cgacacctcg tcgaaaaccg acacggaaag tggctagaag ccttacccgc cgttgaatac 4800 gcaattaata gtgcagtaaa cgtggcaacg ggcatttcac cattcgaatt tgtgttcgga 4860 cgtcgacctc gactttttcc tgtgactggt aaattaacag acaagggaag tgacactacg 4920 aagtggatag aagaacgtca aggagcctgg gcgcaatacc gtgataaatt atggactagc 4980 aggataacgc aagcattaca ttacaacaat cgaagaagcg cgggcgaggt cctcaagtgt 5040 aatgattggg tactgattga cagtaaggat cgccaacaga tcgtcggagg ccaagggaaa 5100 ccaacttcga agctgcgacc ccgatacgat ggcccatacc aagtcagcga gtgtttgaat 5160 gatggacgca actttcgtct gcgactcaat gacgacgaca agtcacatcc aatttttcac 5220 atttcgaagc tgaaacgcta ccggtggagg gaggacgagc aaggaactgg ggcccaaaag 5280 taagttcctc ccaaattgta tgcaccgccg gacgtactac gtcaagtttc aaaaacatta 5340 cctcggccac ttgtgtgagc acccccttgg acgtctgctt ccttcacagg atcatcaaca 5400 aggttgcaag catagtggtg ttcacgatag atattagatg cattacaaga ctcaagatca 5460 cgactttact tactgactta aatgataatg acgaatgaag atcaagaatt caagatgcaa 5520 gactcacgga ctcactcaat tttttattct ttttttagtt ctcttttttt ttctttctct 5580 gtttctgttt cttttttttt ttctcaatta ctcaactcaa atattgaatt tcttttctta 5640 aggacgtgtt atggtcacta tttaagggcg agttatgcca cttggttttt tttgtttgtt 5700 gttttgttgc ttggttcagt tttttttttt agaaggggag gga 5743 // ID HOBS_I repbase; DNA; FNG; 5139 BP. XX AC DQ370139; XX DT 09-MAR-2006 (Rel. 11.02, Created) DT 13-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE Ustilago maydis retrotransposon HobS LTR retrotransposon DE (internal portion). XX KW Copia; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; HOBS_LTR; internal portion; HOBS_I. XX OS Ustilago maydis OC Eukaryota; Fungi; Dikarya; Basidiomycota; Ustilaginomycotina; OC Ustilaginomycetes; Ustilaginales; Ustilaginaceae; Ustilago. XX RN [1] RP 1-5139 RA Kamper J., Kahmann R., Bolker M., Saville B.J., Banuett F., RA Kronstad J.W., Gold S.E., Perlin M.H., Woesten H.A.B. et al.; RT "Living in pretend harmony: the genome of the biotrophic fungus; RT Ustilago maydis."; RL Unpublished (2006). XX DR EMBL/GenBank/DDBJ; DQ370139; Positions 751 5889. XX FH Key Location/Qualifiers FT CDS 7..4497 FT /product="HOBS_I_1p" FT /translation="MSPINDQDLAFSSMRVGLPKRLMSSEDYFEWATSMQN FT VLSCKNANLWFIIEGRLVKPEEHLGEGDLKEVKLGNKFPTKDIAEYYRADV FT EARSILLNSLGPAQQALVDTSTTARKVWEKLRENYAQNVAQQIASLEAQLA FT NLYQGDDKINVYSYKLETICRKLDHVDAPVSGLRKLRTFLRGLGPQHDVWR FT KIFYFNTRLFFQKEGDSDETANKKALEDYEIAVSTIMAEEAEQKSFRRQYP FT ARAMQAQSQPVKKGKDKFCTNCKRDNHNLEDCFMEGGPKHKDRTEKQKQKK FT TKKVTGNLAQVDSDEMNLCLHVSTPDDNVAPQNETWIIDSGASRHMTGDKT FT LFSTYGPSPVQEVFVADNRGVPVAGMGNVRLVMSNSKGSRKSITLQDVLHV FT PGLGNNLFSTPQVQRLGGSINFTKKTVEIFDKKGRLALRGKRRGDVNYLLV FT EGTTTAVAKLVTSEKALDQAKLWHQRLGHLHMQATLKTASLTDGMNLKAMS FT GPSVGNNCETCIKSKHRHAPIKSRGPKTTRPLELVHMDLAGPLPEGLSKEK FT YYLLMVDDCTRYCFGAALIYKSSAFQAFRTIDRWTQTQLGKRICRVRTDNG FT GEFLSREFSNYLNHRGIGREVTPRFTPQSNGLVERTNQIVKDYIRCMLEEA FT NLTTQYWPFAFSHGLKLRNMSATSTDSSKTPHEGMHGKRQDLQGLRVFGCK FT AWARVPDELRKSLDPKSVECIHLGHVSNNHPYIYRLMDVETGQIFTSRHVI FT FRENERIRRKSEAPFEELSDDETGTTGNNLPRPGLPAPVRSSLNIPRTSPS FT SEEPSQPVGATNYPHLASIEEAQLADTTESGDSLESPTQQLVPSAESTDDE FT FHEPINLIPSRRRPQDIRGRDSPSRHQEIDDESDDSLALLPTPHQTVGGDT FT TTVGDTRTVGDKNNANESTGDESRYNLRARPHRLGDYARHVTTNLSKPPAT FT LKQARMRADWPLWEGAIQAELKSHESNKTWTLVDHPQNKATNVVSCKWVFA FT IKKKADGSLDKYKARLVARGFTQRYGYDYDETFSPVVKATTLRILIGLAAA FT FDWKIVHWDAVTAFLNGRLSAEVYMTMPPGHEVPGKVCFLNKAIYGLKQAG FT REWYLFATKVLEQLGFTKLQEDHCLFHSKKAGRQILLALYVDDLVAASPKA FT SELAWLHTEIQTHFKITDQGDLSSVLNVSVSKSTNSTSLGQPGYIQKILDR FT FQMLEAKPAFTPLPATGIAHPENPEHCSVADKELFQQLVGSVNYLACYTRP FT DVAYAVQALSRYLAQPTIHALSAGKHLLRYLKTTQDYRLRFPKLASGRNLT FT LEVFTDADFANQKAIYSPNQELTTKNKIVIPVDTTNTPRKSVTGMIFLMNG FT SPISWLSKQQPIIATSTQMAEYIAAAEGAKEALWIRSLFHSLQLRGKEAIP FT HYIDNQAAIQLCKNPVLHKATKHIDIIYHKIRELAAVGVINIEYTESGEQR FT ADALTKTLNRQQIEKFCKEIGLKDRSNEKSSQ" XX SQ Sequence 5139 BP; 1493 A; 1489 C; 1189 G; 968 T; 0 other; aaggttatga gcccaatcaa tgatcaagac cttgccttct cctctatgag ggttggacta 60 cccaaacggc tgatgtcctc ggaggactac ttcgaatggg cgacctcgat gcaaaatgtc 120 ctttcgtgca aaaacgcgaa cttatggttc atcatcgagg gacgactcgt caaacccgag 180 gaacatctgg gtgagggaga ccttaaggaa gtgaagctcg gcaacaaatt ccctactaag 240 gatatcgccg agtactatcg agctgacgtg gaagcacgca gcatactcct caactccctt 300 ggacccgccc agcaagccct ggtcgatacg tcaactactg ctcgaaaagt ctgggagaaa 360 cttcgcgaga actacgcgca gaacgttgct cagcagattg cttcactcga agcgcagttg 420 gcgaacctct accaaggaga tgacaagatc aacgtctact cctacaagtt ggagactatc 480 tgcaggaaac ttgaccacgt ggatgctccg gttagcggac ttcgcaaact gagaaccttt 540 ctgcgtggcc tcggtcccca gcacgatgtc tggcgaaaga tcttctactt caacactcgt 600 ctcttcttcc agaaggaagg agattccgac gaaacagcca acaagaaggc cctggaagac 660 tacgaaattg ccgtcagcac gattatggct gaagaagccg agcaaaagtc tttccggcgc 720 caatacccag ctcgggccat gcaagcccag tcgcagcctg tcaaaaaggg caaagacaaa 780 ttctgcacta actgcaagag ggataaccat aacctcgaag actgcttcat ggaaggaggt 840 cccaaacaca aggatcgcac cgagaagcaa aagcagaaga aaaccaagaa agtgacggga 900 aacctggcac aagtcgactc cgacgagatg aatctgtgtc tccacgtgtc cacacccgat 960 gacaacgtcg ctccccaaaa tgaaacctgg atcattgact ctggcgcaag ccgacacatg 1020 accggcgaca agaccctctt ctcgacgtac ggaccatctc ctgtccagga ggtattcgtc 1080 gctgacaaca gaggagttcc ggttgctggc atgggcaacg tcagactggt gatgagtaac 1140 tcgaagggat cgcggaagtc gattacactt caagatgtcc tccacgttcc aggacttggg 1200 aacaatctct tctctacacc acaggtgcaa cgcctcggtg gaagcatcaa cttcaccaag 1260 aaaacggtcg aaatcttcga caaaaagggt cgactggccc tgaggggcaa acgccgtgga 1320 gatgtgaact acctgctagt ggaaggaact accaccgcag tagccaaact cgttactagc 1380 gagaaagcct tggaccaagc caaactctgg caccaacggc taggccacct ccacatgcag 1440 gccacgctca agacagcatc gcttaccgac ggcatgaacc taaaggctat gtcgggcccc 1500 tctgtcggga acaactgcga aacctgcatc aagtcgaagc ataggcacgc accgatcaag 1560 agccgtggac ccaagacaac tcgaccactc gagctagttc acatggatct tgcggggcct 1620 ctgccggaag gcttatccaa agagaaatat tacctactca tggtggacga ttgcacgaga 1680 tactgcttcg gggcggcact gatctataag tcatcagcct tccaagcatt ccgaaccatc 1740 gatcgatgga cccaaacgca actgggaaag cgtatctgcc gagtccgaac ggacaatggt 1800 ggtgaattct tgagcagaga gttctcgaac tacctcaatc accgaggtat aggacgagag 1860 gtcactccaa gattcacacc acaatccaac ggtctcgtgg agcgcactaa ccagattgtc 1920 aaggattata ttcggtgcat gctagaagaa gcaaacttga ctacccagta ctggccattc 1980 gccttcagtc acgggctgaa gcttcgaaac atgtcggcca ccagcacgga ttcttccaag 2040 acacctcatg aaggaatgca tggcaaacgc caggatcttc aaggcctccg ggtctttggg 2100 tgtaaagcat gggcacgcgt acctgacgaa ctccgcaaat ccctggatcc caagtctgtt 2160 gaatgcatac acctgggaca tgttagcaat aaccaccctt acatatacag actcatggac 2220 gtagaaaccg gccagatctt cacaagtcgg cacgtcatct tccgggagaa cgagcggatt 2280 cggcgaaaat ctgaagcccc cttcgaggaa ctttcagatg atgaaactgg aaccacggga 2340 aacaatctcc cgaggccagg attaccggcc ccggtccgct catcgctgaa catcccaagg 2400 acttccccat cgtccgaaga acccagtcaa ccagtgggag ccaccaacta ccctcatctc 2460 gcctcaatag aggaagcgca attagcagat actaccgaga gcggtgattc actggagagt 2520 ccaactcaac aacttgttcc ttcagcagaa tccacagatg acgaattcca tgaaccgatc 2580 aaccttattc cctcaagaag gcgacctcaa gacatccgag gccgggactc cccatcgcgg 2640 catcaagaaa tagatgacga gtccgatgac tctctcgctc ttctcccgac tcctcaccaa 2700 acagtgggag gagatacaac aacagtggga gatacaagaa cagtgggaga caaaaacaac 2760 gcaaacgaga gcacgggtga cgagtcacgt tacaacctta gggcaaggcc ccacagactt 2820 ggcgattacg cccggcacgt gacaacaaat ctttcaaagc caccggccac cctgaaacaa 2880 gctcgcatgc gcgccgactg gcccctatgg gaaggtgcca tccaggcaga actcaaaagt 2940 catgaatcca acaagacttg gaccctcgta gaccacccac aaaacaaagc caccaatgtg 3000 gtcagctgca aatgggtatt cgccatcaag aaaaaggccg atggatctct cgacaaatac 3060 aaggcacgac tcgtcgcacg aggcttcaca cagcgatacg gatacgacta cgacgaaacc 3120 ttctctccag tagtaaaggc caccacgtta cgcatcctca tcggcctcgc cgcggcattc 3180 gactggaaga ttgttcattg ggacgcagtc actgctttct taaacggacg cctatcggca 3240 gaagtataca tgacgatgcc tcccggtcac gaagtacccg ggaaggtctg cttcctgaac 3300 aaagccatct acggtctcaa gcaagccggc cgcgaatggt acctcttcgc tacgaaggta 3360 ctcgaacagc tcggattcac gaaactgcaa gaagaccact gcctctttca ttccaagaag 3420 gccggacgac agatcttact tgcattatac gtcgatgacc tcgtcgctgc gtcacccaaa 3480 gcatcggaac tcgcctggct ccacaccgaa atccaaactc attttaaaat cacagatcag 3540 ggcgatttat cttctgtgct caacgtcagc gtgtcgaaat ctaccaattc cacttcccta 3600 ggccaacctg gttacatcca aaagatcctc gatcgtttcc agatgctcga agcgaaaccc 3660 gccttcacac cattacccgc cactggcatt gctcatcccg aaaaccccga gcactgctcc 3720 gtcgcggaca aggaactctt ccaacagctc gtaggctctg tcaactacct ggcttgctac 3780 actcgaccag atgtggcata cgcggtacaa gctctcagcc gctatttagc tcaaccgacg 3840 atccacgcac tctctgctgg aaaacacctt ctccgttacc tcaagactac gcaggactat 3900 cgcctccggt tccccaaact agcgagtggg aggaatctga ccctagaggt cttcacggac 3960 gctgattttg caaaccagaa ggcgatttac tctcccaatc aagagttaac caccaaaaac 4020 aagatcgtca taccggtcga cacaacaaac acccctcgca aaagcgtcac aggaatgatc 4080 ttcctaatga acggttcccc aatcagctgg ctatccaaac aacaacctat catcgcaacc 4140 tcaacacaaa tggctgaata tatcgcggcc gcggaaggcg cgaaagaagc gttgtggatc 4200 agaagcctgt ttcattccct tcaacttcga ggaaaagaag caatacctca ctacattgac 4260 aaccaggcag cgatccagct atgcaagaac ccggtacttc acaaggctac aaagcacatc 4320 gacatcatct accacaagat acgcgaattg gccgccgtcg gtgttatcaa catcgaatat 4380 accgagtcag gggagcaacg agcggatgcg ctcacaaaga cgctcaaccg tcagcagatc 4440 gaaaagttct gcaaggagat tggcctgaaa gacaggtcca acgagaaatc ctctcaataa 4500 actccatgac ggaaacatct gcctttcaca accttcaatt cgccctcgct gaacgcgtgc 4560 gaaaactagg cagatcgata caacagaaag cctgttggcc gcagctttcg acctgatggg 4620 tttggatcat ccaaagaagg tcccatcaag gttcaaggac ttgcaaaagg actggaaacc 4680 caactgacac agtggtatcc acaaaagtca ccatgcacac ttcctgattc tccctccact 4740 ggaggataac gagaacgaat acatgatgta ttcagggaac aaaacctcac gcgtcgccaa 4800 aaacgagctg tgaaagttcc caaatcacga gcatcacgta ggtttcccag gaccatggag 4860 cagtcaagta gtcaacgaaa gctcaccgaa cgaagcttca acgagggccc acctagagga 4920 aagattcgct ggaaagagcc gcagcgcacc tcaaatgccc cagacactta cacttcgatc 4980 cgtaagagga cacggcgcac acaccttcca aacttggacg gaaagtttta cgccgcacac 5040 atcaagacag agagcaacat ggcgcgatcc ccaccgagtg ggagcgtcag tgtctctgcg 5100 attcatgatt tgggcactca aacaccattc gagtgggag 5139 // ID TCA1_LTR repbase; DNA; FNG; 388 BP. XX AC AF043301; XX DT 05-AUG-2005 (Rel. 10.08, Created) DT 05-AUG-2005 (Rel. 10.08, Last updated, Version 1) XX DE LTR-retrotransposon from Candida albicans (ltr). XX KW LTR Retrotransposon; Transposable Element; TCA1_LTR. XX OS Candida albicans OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; mitosporic Saccharomycetales; OC Candida. XX RN [1] RP 1-388 RA Chen J.Y. and Fonzi W.A.; RT "A temperature-regulated, retrotransposon-like element from RT Candida albicans."; RL J Bacteriol 174(17), 5624-5632 (1992). XX DR Genbank; AF043301; Positions 5232 5619. XX CC LTRs are identical suggesting a relatively new insertion. CC However, no obvious ORF could be found indicating that this is a CC non-autonomous element. XX SQ Sequence 388 BP; 120 A; 70 C; 79 G; 119 T; 0 other; tgttcgctat agagagattt cctagccgga atgcacgaca atcctgagac ggaagtcgat 60 cgtcgatgcc catggtgcgt ggtgaaaaat tttcttagaa aatttgttct ttccttcaac 120 tgcttttaag aaagagaggt tcaagtggtt taagtacgac ggtcacaaag attgcggctt 180 atgaggcccg aactgagttg aaatacaaaa tcaagatata attatatacc ttacttgtcc 240 atattgtttt ataatacatt cttcagatat ttaaatttct gtgtatcaac ctataaaaca 300 gagatacatt cagtgcattt agtatactga gtgaactggt acctgtgaca ttcaagataa 360 ctgtttcgcg cacgctggca gacgaaca 388 // ID Mariner-3_AN repbase; DNA; FNG; 1848 BP. XX AC . XX DT 09-DEC-2003 (Rel. 8.11, Created) DT 09-DEC-2003 (Rel. 8.11, Last updated, Version 1) XX DE DNA transposon, Mariner superfamily, Pogo clade - a consensus DE sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; KW mariner superfamily; Mariner-3_AN; Mariner-3a_AN; Pogo clade; KW transposase. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-1848 RA Kapitonov V.V. and Jurka J.; RT "Mariner-3_AN, a family of DNA transposons in the Aspergillus RT nidulans genome."; RL Repbase Reports 3(11), 196-196 (2003). XX DR [1] (Consensus) XX CC DNA transposon. Mariner superfamily. Pogo clade. CC The consensus sequence was reconstructed based on multiple CC alignment of 3 CC copies that are ~98% identical to each other. CC Mariner-3_AN elements are characterized by TA target-site CC duplications and CC 13-bp TIRs. CC The 525-aa Mariner-3_AN transposase is encoded by a single CC ORF (pos. 185-1759). CC The Mariner-3_AN transposase is most similar to the CENP-B CC homologue CC protein 2 (CBHP-2) from Schizosaccharomyces pombe. Most likely, CC CBHP-2 is CC also a transposase rather than a protein involved in a mitotic CC function CC (Irelan, J.T. et al., Genetics 157:1191-1203,2001). XX FH Key Location/Qualifiers FT CDS 185..1759 FT /product="Mariner-3_ANp" FT /translation="MAPPKKLSDVQRKALRDWVHSQSPRPTQKACIAWFQS FT RYNHRLSQSTVSDILSQQYQYLDSECNPSSATRKGIGQWQDLEAILYEWHH FT ILDCKGAYITGDILIEKARQIWSSLPQYRDQPLPAFSSGWLHRFKTRYNIK FT QRTYHGEAGSVQEEAEKEMKAIHIFAGKYNEEDIYNMDETGLFWRMPPLQS FT LSSINRPGIRKDKSRISIICCVNASGSDRLPLWVIGNARTPRALRNINISA FT IGIRWQWNKKAWMNQIIMREWLLDFYQHIGQRSVLLAMDNLPAHLSGLELA FT PPPPNVRICWLPKNSTSRFQPLDQGIIQNLKIYYRKQWLRYMLSYYERNLD FT PLQSVTILDCIRWLVRAWHHDVQSSTILACFYKSTLVQDPIELPVEAPDLR FT PLYTQVQQSGRLSDCMDISFFLNPAEESPEPISSGNEISSDALLEQLIAEA FT SGNADIYPNNLDDDSGEPAPLPKPQDALDAVRLLISYMEGQDTSKTPILRS FT LERLERDIEGEIITAKAQGTLDSWLSNAR" XX SQ Sequence 1848 BP; 518 A; 435 C; 392 G; 503 T; 0 other; cagtgcggcc ccgccaagcc gatcctcgtt tagccgataa tctcgcttag ccgatatttt 60 tttgtgggat gaattccatc ctatataatc atacctcgct ttactatgta accccgatcc 120 ccgatatatc gagaaccata ttgtctatct gaaatctcat tcttattgga tacttcatca 180 atatatggct ccaccaaaga aactttctga cgtccagcgg aaggctctaa gagattgggt 240 tcatagccag tctcctcgtc caacacagaa ggcctgtata gcatggtttc aatctcgcta 300 taatcaccgc ttgagccagt ctactgtctc tgatatcctc agtcaacaat atcaatacct 360 tgactctgaa tgcaatccat cctcagcaac ccgcaagggc attggccagt ggcaagacct 420 tgaagctatc ctttatgaat ggcatcatat acttgattgc aaaggggcat atatcactgg 480 cgatatcctt attgaaaaag cacgtcaaat ctggagttct ctgcctcaat atcgtgatca 540 gcccctacct gcatttagta gtggttggct acatcgattc aaaacacgct ataatatcaa 600 gcagcggaca taccacgggg aagctggctc agtacaagaa gaggctgaga aagagatgaa 660 ggcaatacat atatttgctg gcaaatataa tgaggaggat atttataata tggatgaaac 720 tgggcttttc tggcgtatgc cgcctttaca gagtctatct tccattaata ggccaggaat 780 caggaaggat aagagtcgga tatctataat atgctgtgtt aatgcctccg gatctgatcg 840 attaccactc tgggtaattg gaaatgcacg tacgccacga gctcttcgca atatcaatat 900 ctcagcaatc gggattcggt ggcaatggaa caaaaaagcc tggatgaacc aaattatcat 960 gcgagaatgg ctcctggact tctatcaaca tattggccag cgatcagtcc ttcttgcaat 1020 ggacaacctc cctgcacatc tttctggcct agagctggca ccaccacctc ccaatgtacg 1080 catctgctgg ctcccaaaga attcaacaag ccggttccaa cctcttgatc aggggattat 1140 ccagaacctg aagatctatt atcggaaaca gtggttaaga tatatgcttt cttactatga 1200 aaggaacctg gatccgctgc aatctgtaac aattctagat tgcatacgat ggcttgtacg 1260 ggcctggcat catgatgtcc aaagctcaac tatcctagcc tgcttttata agagcacgct 1320 agtccaggat cctatagagc ttccagttga agcacctgat ctaaggccac tttatacgca 1380 ggtacagcaa tctggtaggc tatcagactg catggatatc tccttctttc tcaaccctgc 1440 agaagagtct ccagagccaa ttagctctgg gaatgagata tcctcagatg cattacttga 1500 gcaactaatt gctgaggctt ctggaaatgc agatatatat cctaataatc tggatgatga 1560 ttcaggcgag ccagcccctc ttccaaagcc tcaggatgct cttgatgctg tacgacttct 1620 aatctcttat atggagggtc aggatacgtc caaaacacct attcttagat ctcttgagcg 1680 gttagagcga gatatagagg gtgaaattat cacggcgaag gctcagggta ccttagatag 1740 ttggcttagt aatgctagat aatgacaaaa acttcatctt ggcgataacc tcgtttaggc 1800 gatatttttt gctgggatga cttgtatcga ctaaacgggg ccgcactg 1848 // ID LTR12_CN repbase; DNA; FNG; 146 BP. XX AC . XX DT 30-MAR-2005 (Rel. 10.03, Created) DT 30-MAR-2005 (Rel. 10.03, Last updated, Version 1) XX DE C. neoformans LTR - consensus. XX KW LTR Retrotransposon; Transposable Element; Interspersed repeat; KW LTR12_CN. XX OS Cryptococcus neoformans OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Tremellomycetes; Tremellales; Tremellaceae; Filobasidiella; OC Filobasidiella/Cryptococcus neoformans species complex. XX RN [1] RP 1-146 RA Goodwin T.J. and Poulter R.T.; RT "The diversity of retrotransposons in the yeast Cryptococcus RT neoformans."; RL Yeast 18(9), 865-880 (2001). XX RN [2] RP 1-146 RA Loftus B.J., Fung E., Roncaglia P., Rowley D., Amedeo P., RA Bruno D., Vamathevan J., Miranda M., Anderson I.J. et al.; RT "The genome of the basidiomycetous yeast and human pathogen RT Cryptococcus neoformans."; RL Science 307(5713), 1321-1324 (2005). XX RN [3] RP 1-146 RA Gentles A. and Jurka J.; RT "C. neoformans LTR sequence LTR12_CN."; RL Direct Submission to Repbase Update (15-MAR-2005). XX DR [3] (Consensus) XX CC Average similarity to consensus is 90%. XX SQ Sequence 146 BP; 32 A; 22 C; 40 G; 52 T; 0 other; tgttggatga gcgagaagat gggagcctgg gatacggagt tccgagagag atcgtttgtg 60 gcgtgggagt tgcgtggctg atttttcttt tagattctat aatagtttct ttcttttctt 120 tatgcattca tagacacctt acaaca 146 // ID Copia-47_MLP-I repbase; DNA; FNG; 4680 BP. XX AC AECX01001103; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-47_MLP_; KW Copia-47_MLP-LTR; Copia-47_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4680 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001103; Positions 338572 343251. XX CC Positions [2093-2596] - Integrase core CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 407..4663 FT /product="Copia-47_MLP-I_1p" FT /translation="MSSNANVSHSSLLLAQIPKLNHTNKVDWVLGIKTYLK FT SRKLWKHVERDTSLSETDAKDFDKVEQLEAERASVLEVIRCTVEPTRLPAI FT WGIEDPKKAYELLHAQASQDDGLEVAALIAKVATVRFNGSETISTFLDGIS FT DLHTQIAEATASDDDLKISNKLLAVFLLLSFPSDQFATIRDQMFGDMKNLT FT TSKVVSRLRTKSALSTVDENPIAMAASIKHPPAPAHRLTIPRTDKSPNAPC FT VLQEHWQFTHTNGVCMKQRRNRGPTRPMNSSQPSSNQGLTDSEKIRRYNQL FT AAVGVVGFNTEASSQPTQATTPTPSTAAPVPNHTSEAACQFATSYNTIATP FT VIDEVSFVASPSYPSPSDTHKATLADTACNRHMFGDALMLDDLRDVAPVWI FT NVANADNSSRIVATKMGTAKLHAFGPDGSPSLVEIPNVLYSPSLPANLISV FT TELYESGFKMVDPHYGTNTNDLNMYFSNSKHILPAYKDAGPGGFWKFYHYS FT EPRASFVKSSNPNNTDLWHQRFGHLNHRSTSSVMEDVLRLNPDALKTCEAC FT TLGKQARSSHTGALPRSEIPGYRVHCDLAGPFPAASTGGYLYTMVLIDDAT FT RKNWVILLKTKPQAFEAFKQFHSMLSNHAPNNKISVFKTDRGGEFTSNEFN FT QYLRENGIVHEMGPPESPEQNSVVERFNRTAASRLRAQLIHGNLPVRLWGE FT VMMATSFVLNLCPSKSVKFNCPEYAWQTLALNMKVPNLPYNRLRVIGCLAY FT SIPPGHQNKLIPRSIKTIMVGYEKNSNAYRLWDPKSNRIMISNDVIFDEGS FT FPLRHLDKSTTEELTILNDDSWDEVWDIPTSSQSTTDAIHDFLQIPTPPAI FT PPPVPRRSERGAAPVERLGNLIGYHTVAELNQASISQTDDGPENDEPSYSQ FT AMKGPNREEWLAAMAEEFMSLQIHEVGRLVEPPPDADILPGMWRLKQKRDE FT FSRITKYKARWVAGGNHQIKGVDFDSTYASVGLTDTLRTLYALAATDDLEM FT EQFDIETAFLNGKIKHCVYVCQVTGFRDTSKPKHVVELDRSLYGTCQAHRE FT FNDDFDTKLKSMGFKVCPVDNSLYTLRDGDLFIHIPMHVDDGMAFSNSKSF FT LKQFRTNLSQHYKFRWNENPSLHLGIHIKRDRSQRTITLDQSHYCDAMLER FT FGMSQCNGVKTPLPQNIRLTTPSLEESEEIEEYRCAVGMLNFLSVQTRPDI FT AYAVSYLSRFNSRHNQTHWAAVKHLLRYVRRTSSFDLMFGTSKAKNLPVEG FT YADADYAGDIDTRRSTTGFVYFVKGSLVSWKSRRQHCVTLSTTEAEYLAIG FT DCAKHGLWLCRLLEHLHQQSSISVPIKLPLSNDNQGAVFLCNEASVNNKSK FT HIDIRHHFIRELTREGKIMVSHVSTKEMPADVLTKAVGANILSSSYDQLGL FT SQIKK" XX SQ Sequence 4680 BP; 1344 A; 1140 C; 990 G; 1206 T; 0 other; ataggttatg agcccaggct tacagcttta aacgattctc aatactcaac gatagtagat 60 taaagttata aatacgatct cttggcacga gaaaccgtct aatttaaact tgaaatactt 120 ttaatcgaac ctggtattgt tccagattct tccgataatt gaattaggca ttcgaatctt 180 tcatttcgaa gtccggagaa ccataatacc tcaattcaat tgtcaaacct tagtgataca 240 gtcttaaccc cacgaaaact atatcactct ggatccgaca gaacctacag atcccagtca 300 atctgaccaa cccaccctcc caaactcaac tgtcgatcct gaaccaaaca atccgtcgat 360 ctctacccca catacagagt cgaaggaagc cgtgacgcct cctgtcatgt cttccaacgc 420 caacgtttcg cactcttcac ttctactagc acaaataccg aagctcaacc acaccaacaa 480 ggttgattgg gtacttggaa tcaaaacgta cctgaagagt cgcaaacttt ggaaacacgt 540 agaacgcgac acctccctat ctgagaccga tgcgaaagat ttcgataagg tcgagcagct 600 cgaagctgaa cgcgcgtctg ttctagaagt cattagatgc acggttgagc ctactcgctt 660 acctgctatt tggggaatcg aagaccccaa gaaagcatat gagcttttac atgcccaagc 720 ttctcaagac gacggcttgg aagtggctgc actcatagcc aaagttgcta ccgtacgctt 780 taacggttcc gaaacgattt ccaccttcct tgatgggatc agtgatctac atacacaaat 840 cgcagaggcc actgcaagtg atgatgatct gaagattagc aacaagcttt tagctgtttt 900 ccttcttcta agttttccaa gcgatcaatt tgcaacgatt cgagatcaaa tgtttggaga 960 tatgaagaat ctcacaacat cgaaagtcgt atcccgtcta cgaacgaagt ctgctctcag 1020 cacagttgat gagaacccaa tagccatggc tgcttcgatc aagcatccgc ctgctccagc 1080 tcatcgtctg acaatccctc gtaccgataa atcccccaat gcaccttgtg ttctacaaga 1140 gcactggcag ttcacgcaca ctaacggtgt atgcatgaag cagaggcgaa accgtggtcc 1200 aactcgacca atgaatagta gtcagccatc atccaaccaa gggcttactg attcagagaa 1260 gatacgaagg tataatcagc tcgcagctgt tggagttgtc gggtttaata ccgaagcttc 1320 atctcaaccg actcaggcga ctactccgac accgagtact gcagcccctg ttccgaatca 1380 taccagtgaa gcggcatgcc agtttgcgac atcctacaat acaattgcga caccggtaat 1440 cgacgaggtc tcattcgttg cttcaccaag ctatccatct ccatctgaca ctcacaaagc 1500 cacgctagcc gacactgctt gcaacaggca catgtttgga gacgcgctga tgcttgatga 1560 tctacgtgat gttgctccag tatggattaa tgtggcaaat gccgataact cgtcccgcat 1620 tgttgccacc aagatgggta cggcaaagct tcacgctttt ggaccggatg gatcgccttc 1680 tcttgttgaa atacccaacg ttctttactc accgtctctt cctgcaaacc tcatttcggt 1740 cactgagcta tacgaatctg ggttcaagat ggttgatcca cactatggca caaacaccaa 1800 tgatctcaac atgtatttct cgaactcgaa gcacatcctt ccagcttaca aggatgccgg 1860 tcccggtgga ttctggaagt tctatcatta ctccgaacca cgtgcgtctt tcgtcaagtc 1920 atcaaatcct aacaatactg acttatggca tcaacggttt ggtcatctaa atcacaggag 1980 tacctcaagt gtcatggagg atgtcctacg acttaatccg gatgcattga agacttgcga 2040 agcctgcacc ttggggaaac aggcgaggag cagccatact ggcgctttgc cgaggtctga 2100 aatccctggt tatcgggtgc attgtgatct cgcaggtcca tttcctgctg caagtactgg 2160 aggttatctt tacactatgg tacttataga tgatgctact cgcaaaaatt gggttatact 2220 tcttaaaaca aaacctcagg cgtttgaggc ctttaaacaa tttcattcaa tgctttcaaa 2280 tcatgctcct aataataaaa tttctgtgtt caaaaccgac cgaggtggag aatttacaag 2340 taatgaattc aatcaatatc ttcgtgaaaa tggcatagtg catgaaatgg gcccacctga 2400 gagtcctgaa caaaactctg tagtggagcg tttcaacagg actgctgctt cacgcctacg 2460 agctcagctt atacacggca atctgcctgt tcgtctttgg ggtgaggtaa tgatggcgac 2520 atcatttgtt ctcaacctgt gtccctccaa atctgtaaag ttcaactgtc ccgaatatgc 2580 gtggcagacg cttgcgctca acatgaaagt gcctaacctc ccgtacaacc gactaagggt 2640 cattggttgc ttagcgtact ccattccacc tggtcatcaa aacaaactga ttccacgttc 2700 aataaagacg ataatggtag ggtacgagaa gaattcaaac gcgtatcggc tctgggatcc 2760 gaaatccaat cgcatcatga tttccaatga cgtgatattt gatgaaggtt cttttcccct 2820 tcggcatctt gacaagtcga caacggaaga actgacgatt ctaaatgatg attcatggga 2880 tgaagtctgg gatattccta cctcatcaca atcgaccact gacgcaattc atgactttct 2940 gcaaatccct actcctcctg caataccacc acctgtacca cgacgatcag agcgaggagc 3000 agccccagtt gaacgacttg ggaatctgat aggctatcac accgtcgcgg aactcaatca 3060 agcttcgatt agccagactg acgatggtcc agaaaatgat gaaccatcat actctcaagc 3120 aatgaaaggt ccaaatcgag aagaatggtt agcagcaatg gcagaggagt ttatgtcctt 3180 acaaattcat gaagtaggtc gccttgtgga acctcctcca gatgctgaca tcttacccgg 3240 catgtggcga ctcaaacaaa agagggatga attctcacga atcactaagt acaaggctcg 3300 ttgggtagct ggcggcaatc atcaaatcaa gggcgttgat ttcgactcca catatgcttc 3360 agttggcctg accgacacct tacgaaccct atacgctctt gctgctacgg acgatctaga 3420 aatggagcag tttgatatcg agacggcttt tctgaacggg aagattaagc attgtgttta 3480 tgtttgtcaa gtaacgggat tcagagatac atcaaagcct aaacatgtag tagaactcga 3540 cagatctctt tacggtactt gtcaagcgca tcgagagttc aatgatgact ttgacaccaa 3600 actcaaaagc atgggattca aagtctgtcc tgttgacaac tccttgtaca ctctgaggga 3660 tggagactta ttcatacata tacctatgca cgttgacgac ggcatggctt tttcaaacag 3720 caagtctttc ttaaagcagt ttcgaaccaa tctcagccaa cactacaaat tccgttggaa 3780 cgaaaacccg tctctccacc tcggcataca catcaaacgt gaccggtctc aacgaaccat 3840 aactctagat cagtctcatt actgtgacgc gatgctcgaa cgctttggta tgtctcaatg 3900 caatggagtc aagacaccat tgccccaaaa catcagacta acaactccgt ctcttgagga 3960 atcagaggaa atagaggagt accggtgtgc cgtcggaatg ctcaattttc tttcagtgca 4020 gaccagaccg gacatagcct atgcggtcag ttatctttca cgtttcaatt cccgccacaa 4080 ccaaacgcat tgggcagcag tcaaacatct actacgctac gtgcggcgaa cgtcatcttt 4140 cgacctaatg tttgggacga gcaaggctaa gaacttacca gttgagggat atgctgatgc 4200 ggattacgct ggggacattg acacgaggag gtcaactacg ggttttgtgt attttgtgaa 4260 gggatctttg gtatcatgga aaagtcgtcg acagcattgt gtaactcttt caactactga 4320 ggcagaatat ttagctatag gagattgtgc aaagcatgga ttatggttat gccgattact 4380 cgaacattta catcagcagt caagtatcag tgttcccatc aaacttccac tatcaaatga 4440 taatcaaggg gcggtgtttt tgtgcaatga agcttcggtc aataataaat caaaacatat 4500 agatataaga catcacttta taagagaatt gacgcgtgaa ggaaaaatta tggtttcaca 4560 tgtttccaca aaggaaatgc ctgctgatgt gctgacgaag gctgtaggag ctaacatttt 4620 atcaagtagt tatgatcagt taggtttatc tcaaatcaaa aagtagtgag cagggggggc 4680 // ID Gypsy-48_MLP-I repbase; DNA; FNG; 5594 BP. XX AC AECX01001259; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-48_MLP_; KW Gypsy-48_MLP-LTR; Gypsy-48_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5594 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001259; Positions 39953 45546. XX CC 'GAAGG' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 314..1405 FT /product="Gypsy-48_MLP-I_2p" FT /translation="MATIDDINRLAQQMTELNTRLAEETARRNKLSAQLTE FT ETTRREQAEARLAQLEANRPTTTTTSPAPTPTVPPPAPAPTTTTTPHMIKV FT ATPDKFEGTRGALAEAYASQVGIYIAMNSALFRTDSSKVMFAISYLTGEAI FT KWAQPFLQRILNPTTENEVTYNEFSHAFESVFFDSDWQKRAETALRALKQT FT QSAAEYTIKFNQLAPATKWELPTLISHYRQGLKSNVRIPMIRDNFDTLEDI FT TKLACAIDNDLRGEYTEPAVSSRPSDPNAMDILSARFDISSDEYRRRATDN FT LCFKCGKSGHCARWCKVGNGNRDGFRGRGGGKGKLAELEAKIAAMEGGLNQ FT LTTRDSGSSAADLSKNGAARE" FT CDS 2096..5494 FT /product="Gypsy-48_MLP-I_3p" FT /translation="MTNHSPLLTKCRLLRKPPQIALRPGRGTLGNMARGLF FT PLERLSPREVSLAELVPCPLVQLTSFYCPTTARLQLSATTSWNVSAKLAAD FT DAKGRPEKPVEELVPTHYHRYLNMFRKKNSMTLPPHRRYDFRVDLVPGATP FT QACKMIPLSPAEEKALDVMIDEGLAKGTIRRTTSPWAAPVLFTGKKDGNLR FT PCFDYRKLNALTVKNKYPLPLTMELVDSLRDAEDYTSLDMRNGYNNLRVFE FT GHEKRLAFICKRGQFEPLVMPFGPTGAPGYFQFFISDIFRDRIGKDLAVYL FT DDLLIYTPAGVDHEKVVTEVLEVLQAQNIWLKPEKCKFSRKEIDYLGLLIS FT KNKVRMDPLKVSAVTDWPAPKNVSQIQRFLGFANFYRRFIDGFSRIARPLH FT DLTCADVPFVWSEAQQKAFDILKVSFTSAPVLKIANPYKAFTLECDCSDYA FT LGAVLSQLDDKGVLHPVAFLSRSLVQAERNYEIFDKELLAVVASFKEWRHY FT LEGNPNRLEVTVYTDHKNLETFMTTKQLTRRQARWAEALGCFDFHIKFRPG FT HKSTNPDALSRRPYLEPSAGEKLSFGSLLRPENLSEDSFRADLDSIEAWFT FT DETIEHDDVETWFEQDISQPTHEAVEIDALDKSNDSPIWTDDQIMERIREV FT SNDDLRIQGLMQALDGPTVKELDAALKEYEVRDGVLYHNGLVEVPDDKKCK FT YEILRSRHDSVLAGHSGRAKTLSLVQRQYRWKSMKTYVNQFVDGCASCARI FT KPSSMAPFGALEPLPIPAGPWTDISYDMITNLPLSNGMNCILTVVDRLTKM FT GHFIPCTTEMDAGELATLMLSNVWKLHGAPKTIVSDRGSVFIAKLTESLNK FT QLGIALHPLTAFHPQSDGQSEIVNKAVEQYLRHFVSYRQDDWVEHLPLAEF FT AYNNGTHSATGYSPFKANVGYDLSFGRIPTRERCIPEVEERLKNIEDIQNE FT LKENLTRAQEAMKKTYDEGRRPTPDWNEGDEVWLNSKNISTTRPSAKLDHR FT WLGPFSIEKKISTSAYRLRLPATMSRIHPVFHVSVLRKCSPDAIEERIQVE FT PQPIEIDGEDEWEVEAVLDKRLRRGKAEYLISWKGCNCSEDSWEPEVNVIN FT AKELIDNFNLRFPKAVEEHQRKRCM" XX SQ Sequence 5594 BP; 1693 A; 1379 C; 1327 G; 1195 T; 0 other; cattgtagca tctaacccga aacagaccga gaggacatca agaagaaaag aagaaaagaa 60 gttaattaaa gaattgaaat taagaaatat agtgacaaag tttaaaacga agaaaaataa 120 agttaaagtt aagaagaaga aagtttatta aagttaagaa gaagaaagtt tattaaagtt 180 aaagtaaatc aaatccacat tctgatctac caccaatccc accacaacca cgccgtcctt 240 tcaaactccc cccaaggacg acgctgaatc tctcaggagc gagtcctggc acaaagtcgt 300 aaccggtaac ccaatggcaa ccatcgacga tatcaatcgc ctagcccaac agatgaccga 360 attaaacact cggttagctg aggaaactgc acgccggaac aaactgtccg cgcaactcac 420 ggaagagact actcgacgtg agcaagctga agcccgcctg gcccaacttg aagccaaccg 480 accaaccacc accaccacgt caccggcacc aacaccgaca gtcccaccac ctgcaccagc 540 cccaaccacc accacgaccc cgcatatgat caaagtcgct accccggaca aatttgaagg 600 cactcgcgga gctctcgcag aagcttacgc tagtcaggtt ggaatttaca tcgcaatgaa 660 ctcggcacta tttcgtaccg acagctcaaa ggtcatgttt gctatatcgt acctaacagg 720 ggaagccatc aaatgggctc aacctttctt gcagcgtatt ctaaacccaa ctactgagaa 780 tgaagtcacc tataatgaat tttcccacgc ctttgaatct gttttctttg actctgattg 840 gcaaaaacga gccgagactg cccttcgtgc gttgaagcaa acccaatcag ctgccgagta 900 tacaatcaaa ttcaatcaat tggcgcctgc aactaaatgg gaactaccga cattaatcag 960 ccattatcga caaggtctaa agagcaatgt gcgcatacca atgatccgcg acaacttcga 1020 caccctagag gatatcacaa aactcgcatg tgctattgat aatgatctac gaggagaata 1080 taccgagccc gccgtgtcat ctcgcccgtc agacccaaat gccatggata tcttgtcggc 1140 acgctttgac atatcatcag acgagtatcg acggagagca acggataatt tatgcttcaa 1200 gtgtgggaag tctggtcatt gtgctcggtg gtgcaaggtt ggcaatggta atcgtgatgg 1260 gtttagagga agaggtggtg gaaaggggaa gcttgcagag ttagaggcga agattgcagc 1320 tatggaaggt ggtttaaatc agttgacgac tagagattcg ggtagttcgg cggcagatct 1380 gtcaaaaaat ggcgccgctc gagagtgacg gacgtgccac cctcgggcct gtgtaaggag 1440 gaattattag cggaatcaga tgctgtgatg ttaaatgcaa agacaaacct agaccctcgt 1500 gtgtttacat ccatatcact ttcccaagcc tcctgtgcca cgtcccccaa cccagacagt 1560 accccagccc gtgcgttgat cgattgcggt tcaactcacg tggtattggg caaaaaattt 1620 gccgattgca ccggactccc tctgaccagc ttaaagaaag caggagaggt gtacggcttt 1680 gatggagccc cgagaacctt cgcccacgat gccgacctgt tcattgacga cgacgagaca 1740 aagacaagat ttttggtcac ccaaatcaag gactcctacg acgccattct cggtatgcca 1800 tggctacgcg agaatggaca ccgtattgac tggaagaacg gtatattaag acctaaagac 1860 tcacctggcc acatcgcagc catcaattgg gcttcgtcca acccgccaac caccccagat 1920 cgccttgagg cccggaaggg gaacactagg gaaagtgaag agggggccat cagcatgact 1980 gatattacgc ccccgcgatg tgagtccaac attgcacagt cacccttgtt tcgtgaaaca 2040 gttgacaagc agatacacaa cttacctttc ataggacaga ccacgaccaa cccagatgac 2100 caaccactcg ccactgctga ccaagtgtcg cctactccga aaaccacccc agatagcctt 2160 gaggcccgga aggggaacac taggaaatat ggcgaggggg ctctttccgt tggagagatt 2220 aagcccccgc gaggtgagtc tagcagaatt agtaccttgc cccttggtcc agttgacaag 2280 cttctattgt ccgacaacag cacgactaca gctgtccgca acaacgtcat ggaacgtgtc 2340 cgccaaacta gctgccgatg acgccaaagg ccgacccgaa aagccagttg aggaattagt 2400 accaacccac tatcaccgct atctcaacat gtttcgaaag aaaaattcga tgaccttacc 2460 tccacaccgc agatatgact tccgcgtcga tctcgtacca ggagcaaccc cgcaggcttg 2520 caaaatgatc ccactgtcac ctgcggagga gaaagcactc gacgtaatga tcgatgaagg 2580 cctggctaaa ggtacaatcc gacgtaccac ttccccgtgg gcggcacccg tcttgttcac 2640 tggcaaaaaa gacggcaatc tccgtccctg cttcgactac cgaaaattga acgcgttaac 2700 tgtgaagaat aagtacccct taccattgac aatggaattg gtggatagct tacgtgatgc 2760 agaggactat acgagcctgg acatgcgaaa cggttataac aacctccgag tattcgaagg 2820 ccatgagaaa agactagcgt tcatctgcaa aagaggacag tttgaaccat tagtgatgcc 2880 ctttggccct actggagcgc cgggttattt ccaatttttc atctcagata tatttcgtga 2940 tagaatagga aaggacttag ccgtgtattt agatgactta ctgatatata caccagcagg 3000 agtggatcat gaaaaggtag tgacagaggt acttgaggtc ctacaggcac aaaatatatg 3060 gttaaaaccc gaaaaatgca agttttcgag gaaggagatt gactaccttg gcttactgat 3120 ttcaaagaat aaagtccgca tggaccctct taaggtctcg gcagtgactg actggccagc 3180 ccctaagaac gtctcgcaaa ttcaacggtt cctcggattc gccaatttct acaggcggtt 3240 catcgacggc ttctcacgca tcgctcgacc attgcacgac ttgacctgtg ccgatgtccc 3300 attcgtatgg agtgaagccc agcagaaggc attcgatatc ctaaaagtgt ccttcacatc 3360 cgccccagta ctgaaaattg cgaacccgta caaagccttc acgctggagt gtgactgctc 3420 cgactatgcc ctaggcgcag tactatccca actcgacgac aaaggtgttt tgcaccctgt 3480 agcgttctta tcacggtctc tcgtccaagc ggaaagaaac tacgaaatat tcgacaagga 3540 gttattagcg gtggtggctt ccttcaagga gtggcggcac tacctagagg gcaatcccaa 3600 tagactcgaa gtgaccgtct acactgatca caagaactta gagacattta tgactacaaa 3660 acaacttaca cgaagacaag cccgttgggc ggaagcgtta ggttgttttg acttccatat 3720 taagtttaga ccaggccaca aatccacaaa tcccgacgcc ctttcacgac gaccgtattt 3780 agagccgtcg gcaggggaga aactgtcatt tggtagctta cttagacctg aaaacttgtc 3840 tgaagattcc tttcgagcgg acctagatag catagaagcg tggttcacgg acgaaacgat 3900 tgaacatgat gatgtagaga catggtttga acaagatatc agccaaccaa cccacgaagc 3960 tgttgaaatc gacgcgttgg acaaatcaaa cgactcacca atttggaccg atgatcaaat 4020 tatggaacgg atccgagaag tttcaaatga cgatctacga attcaagggt tgatgcaagc 4080 actggacgga ccaacagtaa aggaactgga tgccgctctg aaggagtatg aagtgcgaga 4140 cggagtctta tatcataacg gcttagtgga agtaccagat gacaagaaat gtaaatacga 4200 gatcctacgg agccgacatg atagtgtatt agctggacat tcaggaagag ctaagacact 4260 aagcctagtt cagcggcaat atcggtggaa atcgatgaag acttacgtta atcaatttgt 4320 agacggatgt gcgtcgtgtg ctcgcatcaa accatcgagc atggcaccat tcggagcgtt 4380 ggagccccta ccaatccccg ctggcccctg gaccgatatc agctacgaca tgattaccaa 4440 cttaccactt tcgaacggca tgaattgcat actaactgtt gttgaccgac ttacaaagat 4500 gggacacttc attccctgca caacagaaat ggacgcagga gaattagcaa ctttgatgct 4560 gtctaacgta tggaaactgc atggagcccc aaagacaatt gtttcggacc gaggtagtgt 4620 attcattgct aaacttaccg agtctttgaa caaacaatta ggtattgcac tccacccgtt 4680 gacggcgttc cacccccaat ccgacgggca gtctgagatt gtcaacaaag cagtcgagca 4740 atatctacgt catttcgtga gctacaggca ggacgattgg gtcgaacatt taccattagc 4800 tgagtttgcg tacaataatg gaacacattc agcgacggga tactcacctt ttaaagccaa 4860 tgtagggtac gacttatcct ttggaagaat cccaacaaga gaaagatgca tccccgaagt 4920 ggaagaacga ctgaagaaca tagaggacat tcaaaatgaa ctgaaggaaa atttaactcg 4980 agcacaggaa gcaatgaaaa agacttacga cgaaggaaga agaccgacac cagactggaa 5040 tgaaggcgac gaggtttggc ttaacagtaa aaacatctcg acgacacgcc cgtcggcaaa 5100 actcgaccac cgctggctgg gccccttcag cattgaaaag aagatatcca cgtccgccta 5160 cagattacgg ttaccggcaa caatgagtag aattcacccc gtctttcacg tgtctgtact 5220 aagaaaatgc tcaccagatg ctattgagga gcgcatacag gtagaaccac aaccgatcga 5280 aattgacggg gaggacgaat gggaagtcga ggccgtatta gacaaaagac tgaggcgcgg 5340 gaaggctgag tacttgatta gttggaaagg ctgcaattgt tcggaagact catgggagcc 5400 ggaggtcaac gtcattaacg caaaagagtt gatcgacaat tttaatttaa ggtttccgaa 5460 ggcagtggaa gaacaccaaa gaaaacggtg tatgtgagag gggtgaagct ttttcccacc 5520 gggtttttta atgcaagccc cggggaagaa cgcagggcca agaacaggga gcctgggcgt 5580 aaaagggggg atag 5594 // ID Gypsy-22_MLP-LTR repbase; DNA; FNG; 410 BP. XX AC AECX01000122; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-22_MLP_; KW Gypsy-22_MLP-I; Gypsy-22_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-410 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000122; Positions 184939 184530. XX SQ Sequence 410 BP; 114 A; 92 C; 61 G; 143 T; 0 other; tgtaatgata tatacatcat cacagacata cacataatat aatacgtaga cctagacgta 60 gacttagact taacatattt tcacttactc tctgtgtgcg cacgatcacg gactggctcc 120 ttagggaact caagactgat cttgattcta ttacctttgg agccaggtta gcagttttca 180 ttaccttatc tccttttata ttattctttc cttttatata atactttgta tatccctata 240 gagggaactc aagactgatc ttgattctat tacctttgga gccatatttc tagcaataat 300 aaaaccatac ctactttggt tctagaaact tcaagtccac attgtggcat ctgatcctta 360 gttccctccg agagacttcc ttcgtagtga ggacattcca gtcctttaca 410 // ID copia-1-I_AN repbase; DNA; FNG; 4906 BP. XX AC . XX DT 09-DEC-2003 (Rel. 8.11, Created) DT 09-DEC-2003 (Rel. 8.11, Last updated, Version 1) XX DE Internal portion of copia-1_AN LTR retrotransposon - a consensus DE sequence. XX KW Copia; LTR Retrotransposon; Transposable Element; KW COPIA superfamily; copia-1-I_AN; copia-1-LTR_AN; KW internal portion. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-4906 RA Kapitonov V.V. and Jurka J.; RT "copia-1_AN, a family of copia LTR retrotransposons in the RT Aspergillus nidulans genome."; RL Repbase Reports 3(11), 197-197 (2003). XX DR [1] (Consensus) XX CC Despite its young age (98% identical LTRs, Copia-1-LTR_AN) ORFs CC are CC damaged by multiple stop codons (possibly by RIP). XX SQ Sequence 4906 BP; 1872 A; 862 C; 871 G; 1301 T; 0 other; gttatgagcc cggcattgct ctagggactg ggatacagaa ttattacctc tttttgtcat 60 atcgaagata tctgcagccc cgtaccgagc tatacagaca agctaggagg tggttgaaca 120 ttacccaaag gacactggca acccggagcc tactccttcg aatatacaac gagttccgct 180 gagatcggaa tatcttgacg acaaaaatgg cgattgcata taataaagac agcgccacga 240 aggtccaagc catcctgagc tcgcgggatg actggcgcgc atggtttcaa gtcattaaag 300 accatgcaaa caagcaagaa gtatgggaat attttgatcc tgatgctgat aacaaaacaa 360 ggcctgagct accgaccaaa ccgacaattc tatcagcaaa taaggccttg gaaaatatat 420 actaatatta tcaaataatg ttgcaggaat ggaaggatac gcgaaataca atctaatcaa 480 taaataaagc tatttacagc ttagtagctc tacagataag agagcttata gcagataaag 540 aaatatataa aatattgtaa atactgaaac agtaatacac cccaagtgat taagaggcag 600 atttcaaagc tttgaagaac tataacaatg tcagatctta atcaatttga caaggaaaga 660 tttctgcatg gcttaataag tttgataata cttatttagc aattaaacac tgcaatctcc 720 tagaaagtaa taacaagtat ataaaacaac agtttctggc agctatatta ccagtttcct 780 attcctttgc agatagacag gcagagatga taaacaacct gatatataag aaggagaact 840 tctatatact cctaggacaa tatcagactt acctggctaa tactaaaggc ttcaagacaa 900 caacttccag aatagtattt gcaatacttc atagccaaag caaacccaat aacaacagat 960 actggagctt atatatatat agaaagaact atatatacag cagttgctgg tatataatac 1020 acttaaaaca actaaaataa tagaaaccta acaaagagat taaaagcaag attagcagag 1080 ccattacaaa agataataag gttagaagaa agatcaaaac cttgatagaa aataataagc 1140 aataaaataa gggcaataag cagaaagaaa gcaacaaacc agagaagtat tatattataa 1200 tacttgtttt tagtacagaa attagcagct tattaaagaa ttactttatc ctagactctg 1260 gagctttaat acatgtttgc aataatatct caaggtttga aaactataat ctatcagcaa 1320 ccagaattct ttgtactggt aatacaataa taaggataca aggcactggg agtatcaaga 1380 tctgtccaaa ttatagtaga gaatcaggaa atattattat taccttgact aatatagcct 1440 atgtgccagg tctttatacc agcattatta gagctagaag acttaaacaa gccagatata 1500 gctaggattt tgacaataat attattaaga aagaaaataa tattatcttc aagatcagag 1560 attacctatc aggtctctgg gttgtagagc aacaaagcaa taatatatat acttttacaa 1620 ctatagataa ttaacaatca gcaaaactac tacttctaaa aggaaatata gatatttggc 1680 attaaaggat ggcttatact tatattaata ctctgcagca tcttcctgaa gcagttacag 1740 gtatcaagat taataatctg gatataaacc atgattctaa gcaaccatgt aaagattaca 1800 agttggcaaa tatgcctcag caaatctctc agagactaat aataactact actgctctgc 1860 tagagcaagt atacttcaac cttatcaaaa tacaacctgg tctgaacagg gataaataga 1920 tcacctattt ctataataaa gccacaagaa tatattttat ctttacctac taataaaaga 1980 gcaaatatgt caacactgta cagaaattta taaactagat gaagaatctt ttcagcctca 2040 ctatcaaata tctgaaaagt aataataaga aaatactagg gtatcagatt acaatattta 2100 aagcagcaga aggcattgct tactagtact tagtagttac aatactaaat cagaatagag 2160 ctgcagaata atcaggatat cttcttatta caaaggcaag gcaggttcta ataggtgcct 2220 gcttactgca agacctttgg ccctggatta ttgctttagt tgcaaatatt atcaacagaa 2280 cacctacaag agctatagga tagaaaatac cataggagat gcttacagga aagaagccaa 2340 atctagccaa tttctacctg attggttgta tatcttatac atatcagcag taagaaaaag 2400 gtgtaaaaat agctccaaga gcttaccaag gtattctagt aggctatata gcctctaata 2460 tatggtttat atggaaccta aagaagcagc gagttgagac agcaagagat atcaagtttg 2520 ataaaacaag gctatacaat ctatcagatc cttttataga ggatgagtta agtattagct 2580 tacctatact accagtggag atatagatat tacctggcag ggctaacagg gaagcaatta 2640 atatagatat taagctctcc gaatatataa tacctagtac aatacaagca gaatctccca 2700 ataagcagag cattagaaat gacgaaaaat tgactgagat aggagaagct ggtgttgagg 2760 aaccacaaaa ggaagataca gagaaatata ggtattctgc cagaaaaact atactgtctt 2820 tgcctctgct attacctata cctgatagaa tactgaactc aggcaattct ggcttgaatt 2880 agatgccagg agcctttaca gaagcagaag aatagtcaga atctgcagaa caagaaagag 2940 aggacttact aagctgatat ctccttgaat tgctctaaga agatcagaat caggcaagca 3000 gagagactgc aaatctccct aataccttag agggtacaga taatcttgat ttgcaagcct 3060 tatcaagatc agggggtagg agatcaggaa gacagtatct ggacaggctt gatacagcaa 3120 atataataga aggaagatgg cagacttgtc cttgaaaaga tcaagcagat taccaggcct 3180 atacagccat cgaagcacta tttatcaaag agccagaaag ggtcaaatat atatttacta 3240 tagcactaaa tactgcagaa gagagcaaga aataatagca ttgcagcaag cttcctgaac 3300 tactgaagaa ctggtttaat ttactatagt atctgctgaa gaatgaattt atagcagcag 3360 cgcgcctaga aattagagga ttagagataa aagaagtatt tatacctgtt aaacaagagg 3420 aggcagcagg gaagcagatc ctgctactca aataggtatt tatatacaag tttaacaagg 3480 atagttactt tacaaaggca aaggcatgta tctgtataag gggagatctt gaaaaggatt 3540 atactgctaa taactatgct ataactactt tagcaagaat atttagagta gtcatagctt 3600 taatagcagc ctttgacctg gatacagacc agaaagatac tattaatata taccttaatt 3660 ccttacttga tacgcctatc tatatacaaa taccagatag cttcaaagta tctggaaaga 3720 tatagaaggt tagaaaagta ctatataggc tcaggaagtc tggacagctt tggcagcacg 3780 acctgaaaat aatacttata aagtttggac ttctactagt tctagaagaa gaatgcctct 3840 ttttaaataa gtatataatt attctggtat atattgataa tattattatt attaacctac 3900 caacaccagc agccagagag gcagctaagt acttcaaaga taaactggca aagcaataca 3960 agcttcagca tataggagaa gtaggatggt tcttaggggt aagagtactc agagataggc 4020 ctaacaagaa gctctggctc tgccaggact cctatattaa gcatcttgca gcatattttt 4080 acctaaacag ccttgcaaaa tggccagaaa tactattagc aagcttatat aacctctctc 4140 caaataaata ccaagcaaca gaagctcaga tcaaagaata ttaaataaaa attagctcta 4200 tacagtaccc tgcagttcta acacaagcag atattatata tactgttagc catttatctg 4260 aagccttaac aaatccttta ccagactata ttcaggctgc aaacagggtt attatatacc 4320 tatatataat atattttctg gcaattaaat actcaggcaa caatactggc aaggaagttg 4380 ttatattagc ctctaataca gtatttgctg actaacctga ccagcaaagc tctgcaggat 4440 acctttgcaa gctatataat agcctgatta aatagaaatc tagaaaacaa tatacagtca 4500 caactttaat aactaaagct gaatatctag ctttatctaa tacagcaaag gcaatacact 4560 ggtagaaaag ggtattcaga accatgggtt ttgatccgca acatcagata gcagtatact 4620 gtgacaacca acagaccatt aacttgctta cctctaaaaa tatcaagcaa caatctaagc 4680 tctgttatgt caatatatat agatcctggc tttgtcaaga ggttcaggaa ggccgactct 4740 atgttaaata gatacctacc aaccagatga ttgcggatgg ccttaccaag agcctcataa 4800 tctaaaaata ccaagaattt gtcaggatgc tcaacctagt agatgtgaaa gacctcatcc 4860 aatcccaaga aggaaactga aggactctta atgcactgag ggggta 4906 // ID Gypsy-88_MLP-I repbase; DNA; FNG; 5491 BP. XX AC AECX01000283; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-88_MLP_; KW Gypsy-88_MLP-LTR; Gypsy-88_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5491 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000283; Positions 80988 86478. XX CC Positions [4171-4650] - Integrase core CC 'CTACA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 453..3266 FT /product="Gypsy-88_MLP-I_2p" FT /translation="MGNTLLEETTSTNDLLRAILQTQQAGIQAANESALRI FT SKLEEERIIAAKESAQKIALLEEKLLVLQLNESSPRRTVEVEDATSRGVDL FT TKFKSSDGPAFKGPYRDTESFLKWFTALKIFFLVKKIEKDEDRIILTGSYL FT QETNLQSFWAHNIENFKNITWKEFKASLFRAALPSDWNDKLREKILLLRMS FT YNEDFKSYSTRARSVQSLMNHDNLILNDYTLATHLVLGIIPELKSSIRLHE FT ALVEEGFDYNKFEERCEVLYEDLIVKRVILKRAGPSSYRSAPTNPTFHHST FT KTSPPVTAEDEEARKRAFIWKIKLYLDLAGLCHYCKKQCGSEYKQCRGTFS FT KERVNFPPGYTAPPKPDNYTPSRARLASSPAGRPTQPPAGRPVRVNAAAVT FT FPEYDAVTVAAYEEIDCELKEIQELGEEELLQPGEGGYVDQPISRPVVILQ FT LTCNGKPIRALADAGSETNLIADRVVDTLGLRRRKLLKPTIVGLALDSGSQ FT PPMLTEFTTASLKHLSSDMTFDHTYMKIGQLGQSYDIILGAPFLSRHELSV FT SCAKKAVISERSFVLLYDFREELKLKDKIEESKLALVAASDEKRESDRWKL FT WEKKLTQKAEPLRRNAPSSWEEWEETYFSQYHELFPVDIPAISDEAEEQGE FT FTDGSFPNKMQDPSSTVRHKIILTDPNAVVNERQYPYPHRHMKAWRKLLDQ FT HIAAGRLRKSSSQYASPSLIIPKKDPLELPRWVCDYRTLNSMTVRDRSPLP FT KVDELVRLVSSGKIFSLLDQTNAFFQTRMREADIPLTAVKTPWGLYEWCVM FT PMGLTNAPATHQSRLEEALGELINTVCVVYLDDIVVFSNSLEEHKKHLTLV FT LERLKKANLYCSRKKTKMFRREVKFLGHVVSTDGVRADEEKVEQIRSWESP FT KSTKGVRKSWYCTMDEEICTWPRKIRGETHTPDQ" FT CDS 3094..4947 FT /product="Gypsy-88_MLP-I_1p" FT /translation="MWSQQTEYERMKRKLNKYVLGSPRNQRKEFENLGTVQ FT WMKKFVHGLEKYVGKLTHLTSSKLDKKNFKWGDMEEAAFQNIKRIMTTLPC FT LKNVDFDSTDPLWLFTDASGTGLGAALFQGKEWKGASPIAYESRQTTTAER FT NYPVHEQELLAVIHALQKWKMLLLGMKVNVMSDHHSLTYLLKQRSLSRRQA FT RWLKQLADFDLQFQYIKGEDNSVADALSRKDSEDFQAGVEMVAALALSQTS FT LSDDFRREVIRGYKTDKFCTAVRDGAPLRDDCYIDEGLIFIDGRLLIPNVE FT SLRLNLIEEAHKRVGHLGVLKTVSSLRLDFFWPRMSKDVEFRLKSCEICQK FT TKARTSLPNGHMRTPNVPSEPLSNIAIDFIGPLPKINNYDMLLTVTCRLSG FT FTRLIATNQADTAEKTASRFFGAWMGTFGVPQLIISDRDKAWSSKFWKALV FT ERLNFSFHRSSAYHPQADGRSERTNKTVGQVLRTFTAKRQTKWLEALPSVE FT FAINSATNVATGLSPFEAVFGRPAKLFPSCGSIDDSPPSLETWLRQREGTW FT AQIRDNLWTSRVQQALQHNKRHVDLKVEPDGWVLLDSGDWRGRHSGGVDKL FT KERYEGPYKVLDTFNNGQSV" XX SQ Sequence 5491 BP; 1613 A; 1323 C; 1228 G; 1327 T; 0 other; cttctttttt ctcgaaccaa tcccgaatcc cgtcaacctt gtggttccaa tacaatcgca 60 aaacaacccc gcatatgacg accaatcccg attcccgtac aacccgtagc agcttaaacc 120 atacactcac cggcatagtt gataatccag aagatatcat aagatgaaca aactccgcgc 180 cgacttatca ccgacgttca attgaagaag aaggacgata cgaagaaaat aaggttttgt 240 ttgcattcat tcctaagcct atacctgcaa ccaaaaagcc atacaatcca aacccacggc 300 ttcctactac cccagtacaa caagtaccgg caagtttccc gcgatcacca tctcagactc 360 gtgtctccca aatcctgttt gaagagaaac gacttctaca ggatcgatct tctaagccat 420 ctcaatcatc ttctctgcgc tcctctacaa caatgggcaa cactttgttg gaagagacta 480 cctcgaccaa cgacctcctt cgtgctatcc tgcagactca acaagcaggg attcaagcgg 540 ctaatgaatc ggccctacga atctcaaagc tagaagaaga gagaatcata gcggccaaag 600 aatcggcaca gaagattgcg ttgctagaag agaaactcct cgtattacaa ctaaatgaat 660 cctcaccgag acgaacagtc gaggtggaag acgccacgag tcgtggcgta gacttgacaa 720 aatttaagtc atcggacgga ccagccttca aggggccata ccgggacacc gaatcctttc 780 tcaagtggtt cacagcacta aagatcttct ttcttgtgaa gaagatcgag aaagatgaag 840 atagaatcat actcacgggt tcttacctgc aagagaccaa tttacaatcc ttttgggctc 900 ataacatcga aaacttcaag aatatcactt ggaaagagtt caaagccagc ctattcagag 960 cggcactacc atccgattgg aatgataaac tgagagagaa gattttatta ctcaggatgt 1020 cttacaatga agattttaaa tcatatagca ccagagcgcg atcggttcag tcactcatga 1080 atcacgacaa cttgatctta aacgactaca ccttggcaac tcacctagtc ttagggatta 1140 tacccgaact taaatcgtct atacgcttac atgaggctct agtggaggaa ggatttgatt 1200 acaataaatt tgaagaacgt tgcgaagtct tatatgaaga cttgatcgtc aagagagtta 1260 tcttgaaacg agcgggacct tcttcatacc gttctgcacc aaccaatccc actttccatc 1320 attcaaccaa gacatcgcca ccggtgacgg cagaggatga agaagctagg aagcgtgcgt 1380 tcatttggaa gatcaagttg tatctcgatt tggcaggctt gtgtcactac tgcaagaaac 1440 agtgcggcag tgaatacaag caatgcagag gtactttctc taaagagcga gtcaactttc 1500 caccgggcta cacagcacca cccaaaccgg acaactacac tccatctcgc gctcgattgg 1560 catcctcccc tgcgggtcga ccaacacaac ccccagctgg cagaccagtt cgagtcaacg 1620 cagcggcggt aaccttccct gagtatgacg cggtcactgt agcggcatat gaagagattg 1680 actgtgagct caaggaaata caagagttgg gagaggagga gctacttcaa cccggagaag 1740 gagggtacgt ggaccaacct atctcgcgac cagtggtgat cctacagcta acctgtaacg 1800 gcaaacccat ccgcgccctg gctgatgcag gatcggagac gaacctgata gcggacagag 1860 tggttgatac cttgggacta cgtcgaagga aactactgaa accgacgatt gtaggactcg 1920 cactcgattc aggcagccaa ccacccatgc tcacggagtt cacaacagca tccctcaagc 1980 acttatcgtc ggacatgact tttgatcata catacatgaa gatcggacaa ttaggtcaat 2040 catatgatat aattctgggt gctcctttcc tttctcgtca cgagttatca gtatcatgtg 2100 caaagaaggc tgttattagt gaaagatcct ttgttttgtt gtatgatttt agagaagaat 2160 tgaaattgaa agataagatt gaggagtcaa aactggcctt ggtagccgca tcagatgaga 2220 aacgtgagag tgatcgttgg aaattgtggg agaaaaaatt gacacagaaa gctgaaccct 2280 tgagaaggaa tgccccaagt tcgtgggagg aatgggagga aacttacttc tcacaatacc 2340 atgaactatt ccctgtagat attccggcaa tctcggatga agctgaagag caaggtgagt 2400 tcaccgatgg atccttcccc aacaaaatgc aagatccatc atccacggta cgacacaaga 2460 tcatcttaac ggacccaaac gcggtagtaa acgagcgcca atacccttac ccgcaccgtc 2520 acatgaaagc ttggagaaaa ctactagacc aacacatagc agcagggagg ctccggaaat 2580 cttcaagcca atacgcatct cccagtctga tcattccaaa gaaggaccct ttggaacttc 2640 caaggtgggt atgtgattac cgcaccttga atagcatgac cgtcagggac cgttcaccgc 2700 ttcctaaggt ggacgagttg gtcaggctag tatcgtcagg aaaaattttc tcattattag 2760 accaaactaa tgccttcttc caaactcgaa tgagagaagc tgacatccct ctcaccgctg 2820 tcaagacacc gtggggactt tacgagtggt gcgtgatgcc tatgggtctc acaaatgcac 2880 ccgcaacaca ccagagtcga ctcgaggagg cattagggga attgatcaac accgtatgcg 2940 ttgtatatct ggacgacatt gtagttttct cgaattcatt ggaagaacac aagaaacatt 3000 taactttggt tttggaaaga ctcaagaagg ctaatcttta ctgtagtaga aagaagacaa 3060 agatgttcag gcgtgaggtg aagttcttag gacatgtggt ctcaacagac ggagtacgag 3120 cggatgaaga gaaagttgaa caaatacgtt cttgggagtc cccgaaatca acgaaaggag 3180 ttcgaaaatc ttggtactgt acaatggatg aagaaatttg tacatggcct agaaaaatac 3240 gtggggaaac tcacacacct gaccagtagt aaactcgaca agaagaattt caagtgggga 3300 gatatggagg aagcagcatt tcaaaacatc aagaggatca tgacgacact cccatgcctg 3360 aagaacgtgg attttgattc aacggaccca ttatggctat tcaccgacgc gagtgggacg 3420 ggtttaggag cggcgttgtt tcaagggaaa gagtggaaag gggcgagccc aatagcttac 3480 gagtcgagac aaacgacaac ggccgagcgc aactacccgg tacacgagca agagctatta 3540 gcagttattc acgcattaca gaagtggaaa atgctcctct taggcatgaa ggtgaatgtg 3600 atgagcgacc atcactcgct gacctacctt ttaaagcagc gatcactcag taggcgccaa 3660 gctcgctggc tcaaacaatt agccgatttt gaccttcaat ttcaatacat caaaggggag 3720 gacaattcgg tggccgatgc gttgtcaagg aaagactctg aagactttca agcgggagtt 3780 gaaatggtag cggcattagc actctcacaa acctctctct cagatgactt tcgacgcgaa 3840 gttatacgtg gatacaagac tgacaaattc tgtactgctg tccgtgacgg agcgcctctt 3900 agagatgact gctatattga cgagggcctg attttcatcg acggaagact tctgattccc 3960 aacgttgaaa gcttacgctt gaacctcatc gaagaagcac acaaacgagt aggacattta 4020 ggagttctaa agaccgtatc aagccttcga ctagatttct tttggccacg aatgagcaaa 4080 gacgtagaat tccgcctgaa gtcttgtgaa atttgtcaga aaacgaaagc acggacgtca 4140 ttacccaatg gtcacatgcg cacacccaac gtaccatccg aacccttatc aaatattgcc 4200 atagatttca ttggaccttt accaaagatc aacaactatg acatgttact gacggtcact 4260 tgtcgacttt ctggcttcac gcgactgatt gcaactaatc aagccgacac agcggagaaa 4320 acggcgtcaa gattctttgg cgcttggatg ggcacttttg gagttcccca attgatcatc 4380 agcgacaggg acaaggcgtg gtcttcaaag ttttggaagg ctctggtaga acgcctcaac 4440 ttctcctttc accgatcgtc ggcgtaccac ccccaagctg atggcaggag tgaacggacc 4500 aacaagacag taggtcaagt attacgtact ttcaccgcca aacgtcagac aaaatggtta 4560 gaagccctac cctcggtaga gttcgcgatc aacagtgcca caaatgtagc gacaggtctc 4620 tctccctttg aggctgtgtt tggaagaccg gctaagcttt tcccttcctg cggttccatt 4680 gacgactctc caccctcact cgagacctgg cttcgtcaac gggagggtac ctgggctcag 4740 atacgcgata acttgtggac cagccgggta caacaagcat tacaacacaa caaacgtcat 4800 gttgatctga aggttgaacc agacggttgg gtattactcg attccggtga ttggagaggc 4860 cgccattcag gcggggtcga caagctcaaa gaacgatacg aaggaccata caaggtcctg 4920 gacacattca ataacggaca gagtgtttga attgatttac cagtgggcga caaacgacat 4980 aatgttttca acatctcgaa attgaagcct ttcgtggaga actcggggtt agggcctgag 5040 gcccaaaagt aagttctctc cttagtatgc accgccgggc tttctacgcc gtttgtcata 5100 aaatcattac cttggccaca ctgtgagcgt cgaacggggg ctccgcttgt ttcattcaca 5160 gcaaacaatc acacaaagta ctcgaggcga cactggcgta cttcaccttt aatcaagttc 5220 acattattaa ctcataattt ttctcttctc ttacgtctct ttttttttct tctctttttt 5280 cttctttctt tttcttctta cttttcttct ttaattattt tacttatttc acgggggatt 5340 aagatcaaga tgggacacaa aaatttccaa catcgggtta ctgttccttg tttctcattc 5400 tcatttctct ttctcttttc tctttcaatt attactcatt ttcacggctc tcgcacctcg 5460 gcactcaagc cttttttata ggaggggaga a 5491 // ID Copia-13_MLP-LTR repbase; DNA; FNG; 469 BP. XX AC AECX01000951; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-13_MLP_; KW Copia-13_MLP-I; Copia-13_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-469 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000951; Positions 10780 11248. XX SQ Sequence 469 BP; 129 A; 99 C; 70 G; 171 T; 0 other; tgagtatgac cttctctcat gacagaggtt ccagatgtcg tgacatgtta ctggaaatga 60 atagtacata agccacaagt cttaatacaa cttttatact aaacatacta aactaaatct 120 caaaccctaa ttatcttcaa cttaaattgc gccactcctg ggacgcgtgt acactctctc 180 ttttcctcat ctgttcattc acggtatagg aaaagagtga gttactccac atttctcttc 240 tatagtttca tctctctatt ctttctcatt tcttcttatt tctttattac taattgaata 300 tgtgaccttt tcatctgatg acatgtcgac actagattca atactgagac catacgtgtc 360 gaattcatat tctatctttt catcttgact ataaaccttt aggttagtca ggtagttagg 420 acgagaggtg atttgagact cacctctaga tttgtcattt gggttatca 469 // ID Gypsy-57_MLP-LTR repbase; DNA; FNG; 1127 BP. XX AC AECX01001694; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-57_MLP_; KW Gypsy-57_MLP-I; Gypsy-57_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-1127 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001694; Positions 13433 12307. XX SQ Sequence 1127 BP; 344 A; 304 C; 212 G; 267 T; 0 other; tgtcagcacc cgagccttcc gatggaagag ccaaaggtac tccaagaaga gattgagcaa 60 taattaataa ataacataaa aatagataca acttaaggtc tagtttcaaa gaccccaaac 120 ccaataaaag cataatgaaa gaattacaac ttataggtct agtgtcaaag accgcgaacc 180 acatcacgaa ggaataccaa gtgtgaaacc cacaagactc caaggcccct agcacgagta 240 caagacataa gtcacccttc ccaaaaggga aggaggctga gatgaaatca acagccgtag 300 aagagacgac gagcctctcg acctaatgtc cttcaaacag atgaggcaac aggagccgtt 360 cctaccaacc aactaactaa cgtgggcgcc cacgaagcaa gttcaaacgc agcatatcag 420 caaccacaag ctggcagtcg agagaaaggg tataaatacg accttctcga caggaagaag 480 agaaggcatc tgattcaact tacttacttt cgtatagata tttctatccc gaaaacacct 540 gtcaagagcc caactatact cgttttctag aatctttaga atcacccgtt agttccactc 600 ttgatcctga agcctgagga aatcctttcc cctacgctta tctaatcagt tctctgcttc 660 atactgaagc cccttatctg tccgtattcc tgttagttcc cttggattat cgcctccgga 720 gcctctgctt cccagtcctt aatcctagcc aacagctcga agtccacgcc tggataaggc 780 tccatccttc tactagttag ttggaattct tctggtctat tgatcttgat ccgtcatcgc 840 tcagctgacg agatccctgt tcccaacagg ctaccttcta gacttgtcta gtaggatccg 900 aaccggctta ggtgtgacca accttcaacc gaaaacctcg gtaagccgct atctctccaa 960 aatcacgctg aaggattaga gtacttaccc gtctcgatat cccttgtgaa acatcttgta 1020 gtgttactac catacctatt cctcgtaacc agagaaatag cagtcagggc ctcgaaccta 1080 gtaagtctag accttacagg ctcatataga gacttgccac cctgaca 1127 // ID Polinton-1_GI repbase; DNA; FNG; 11954 BP. XX AC AC163889; XX DT 14-MAR-2006 (Rel. 11.02, Created) DT 14-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE A family of autonomous Polinton DNA transposons - a fossilized DE genomic copy. XX KW Polinton; DNA transposon; Transposable Element; KW Interspersed repeat; Maverick; Tlr; Polinton-1_GI. XX OS Glomus intraradices OC Eukaryota; Fungi; Glomeromycota; Glomeromycetes; Glomerales; OC Glomeraceae; Glomus. XX RN [1] RP 1-11954 RA Kapitonov V.V. and Jurka J.; RT "Self-synthesizing DNA transposons in eukaryotes."; RL Proc. Natl. Acad. Sci. USA 103(12), 4540-4545 (2006). XX DR GenBank; AC163889; Positions 5506 17459. XX CC This transposon is characterized by 440-bp terminal inverted CC repeats and 6-bp target site duplication. It encodes family B DNA CC polymerase (POLB-1_GI, it is corrupted by two stop codons), CC retroviral integrase (INT-1_GI), and two unclassified proteins CC (PGI1-1_GI and PGI2-1_GI). PGI1-1_GI is similar to the B263R CC protein from African swine fever virus. XX FH Key Location/Qualifiers FT CDS 1948..1160 FT /product="PGI1-1_GIp" FT /translation="MLTFSELTPSLTVTEFKINVEGLDYTVLGHKLEPTKD FT VIAINSNFIHKAFEGYEQYLSKPKGERRCRRSTPDNPLNRRRRTGDGSTFN FT ACIEFVIILADTVHTIRYFPKSGSIQVFGSFDPVAIFLRYLSECSLPEFSS FT VELVGGNKALLHNYKFVINIGGDKFINLSTLAYALESIDGIREILPFPIKY FT IKNDAGDIHSKIAIIFTSKIRVHIWPKSGKVNIFGTKTELDADMVYKFLQK FT LFSNPNDFGDFICDAPIPDCER" FT CDS 5690..2775 FT /product="PGI2-1_GIp" FT /translation="MAAIPPDPITEILNNRTTIANSVAGIRIHLDRTTIAA FT HPHINHLFNTANDSLDAIIRYANELRNIIEDQDNMEDRLEGLLDDAHNREE FT YLRHELNNTRATILRTRRTYEDAYANKVRHRQHWEVLAQNTQIQLANIQAQ FT YANSQIQLANVQRERDESRRNAHRLLQRYNTETERSRRRANGIIQQARAWR FT GQFLNCRNHSQNLQNQVNNLNQQILALQNNPPVIQQPIMAGYAPKKFRGAS FT GEDPELWLQEFRQWCESAGLDPAANARTRVRIHGVFETLLEDDARDWYETH FT IKGKNWECVNLLDNTGVANLAAFNALNNGAIQAVAANQFRGGANVLHGQAA FT AVNTITGANFIPDHTVWDEDWSTAEGRPTDIAVNNPNANNGGTIVAPGIRI FT GQLIYRFKHYFPTITSEKSKLAFNAIVQGSDTVSRFYSKLRRMVRLAYPTL FT PEVNQNELVRQQFLNGLSSENKLDARRIGLENSVASILNKLEEVEKYRTDM FT PPTPIVTYQDPTLADIENLINSKIPMTTARSSAVPVPSSSPFSSPQNDTSF FT QRLLALAYKLGLPRDIDISKVTLSDLEIFIDTELNKNLPSDHIYRGNQVFG FT ISSGTRKPKKSKKCSSCGKSGHTKSSCPKSKKGKRKTNYAHDSSDSSNSSD FT ASDSSDSSDSDSDSRHACYGLKKKPTERKRVDKKPTDKKKSQLERQRIIFE FT VFIQLLKLLVQSFVNAVPKETVISAYNALNAKFINYKEPVLSQLKGEPSIK FT TREKVWDNVKDLFTSILHPMISVVTNSMAANLIRKNDDVICNNDLWSSIAG FT IGIVNRKSASDVVTIKTKVVAPESEKSLVIPTTIFDTGSDSSLISNNIVKR FT LNLDVDRSNAPDLSGVATKSDTIGTVYGLGILVYDGENSKKIEDDFMVVKS FT DKDFLLLGVPWIDRANVILDFQNRQLTIPLSSRKKITIPISLHKRKTNVTS FT LQMDTIDLKKIHTLEED" FT CDS 5969..7010 FT /product="INT-1_GIp" FT /translation="MYLPLPVPQIFIDPTFNKQIEQNYILQQIYYHPCGYY FT RTPKKLWAKVQAEGHDFNISDITEWLHKQAIWQIHSPIPKYVPQVSFNKIT FT RPNMYHQADILYMPYDTVDRKLYKYCLNIVDVASRYKASVPLEDRSSARVA FT KAFKKIYSKADCPLIWPRVLQVDGGSEFKDEVIRLMDEKGVRIRVGTTHKS FT QAIVERYNRTLAERLFKIQDAKELLTEGVNTAWVKNLSDIVNELNNSNTRL FT LGMSPVEAIQKDQVYALPSKIRKDRPVGAEEIRLPAGSLVKYLLDNSDYQG FT RRRATDPIWSTKVYTIESSTVVDGQPVIYRLSDGPKKIFTREELLVVPSNT FT MLPPX" FT CDS 10382..7011 FT /product="POLB-1_GIp" FT /translation="MPILPSTDINGTFNFALARIMQKVEDYVNYGSGWEFY FT RVEKIFIEISQFQPPTGAGHIPLPKDLATKKGVVNPANDDDKCFQWEILAA FT LHPVEKNAERISKYKEYVNELNFEGIEFPVQADEVILRRFERQNPTIALCI FT CEWRDHRLCPIYVTDRDDAEGRKIIDLVLISNGEKQHYCWIKNMSRLVNKR FT TKDGHATFVCRWCISHFTHQQEIHDKHVAICQGLKKTPQANRMPSVKKGND FT IYEFKNWKRRMQVPYYFVADFEALVMDIPPTDEDKDKKTKKVQEQIPCSYS FT YIKVRYDGVSESQKIFTGENAAQKFVIEMLNEAEAIRNEFRNPMEMMPLTT FT *EQVSYDNAINCWICRNPLDGNKVRDHCHITGRYRGAAHRGCNLDLSIKPR FT EMHIPVIFHNLSGYDGHIIMQGIGAMECEDDIDPIPYNMEKYMAFKLGSLR FT FIDSLQFMKSSLDKLASNLGAEKCRAQECSNPQHLWRIDAGRCFAHPENFK FT IT*SQIPPELLEIYLKKGVYPYEYMDDWKKFEKTSLPPKGAFYSKLNETHI FT SDKEYEYAQYVWEKAGCKTMQDYHDIYLKTDVLLLADIFQNFRKMALKKYG FT LDPLWYYSTPGFAWDALFLMTGQRLDLITDQDMYMMVEQGLRGGISMVSKR FT YARANNPGMGEGKWTTDKPKSSILYLDANNLYGWAMLQYLPTGNFHWIKEE FT NELFNIQRQIESNEIPDDSSEGYILKVKLEYPQALHSQHTDYPLAPERMKV FT KKEWLSKKQQEIIARSGQRYTPTDKLIPNLFDKDEYVVHYRNLQYYVSQGL FT VIKKVYEAIKFEQAPWMKPYIEFNTAERAKAKNDFEKDFYKLMNNSVFGKT FT MENLRKRVRVSVVQPQTHPKKYKKLTSDPAFKGRKIFSENLVAIHRRKVEV FT MLNRPTYVGMSVLDLSKLCMYQFYYDTLKVRYGEKIQLCYTDTDSLLIQIQ FT TEDINADLIDMADQFDFSDYPKDHPVRKALGDKTDINMKIPGKFKDECNGA FT VIAEFIGLRPKMYSILKVGDETTNPKHGIRKAKGVPSKVVKKEFHHERYNK FT ALFDPKHNDTVTFRAIRSDRHTINVIEMSKVGLSPMDDKNGLHKIISQCMH FT METIGYHNLKINIIYE" XX SQ Sequence 11954 BP; 3556 A; 2244 C; 2129 G; 4025 T; 0 other; agtagtatga aaaccccaat tctggctcac ttccaccata taatctaaaa tactactact 60 ttatataatc taattaccac catcatataa cttaattacc atcatataat ccacctacca 120 cttttaatat aatttaacta ccactttata taatctagtt accaccatca tataatttta 180 attaccacca tataatccac ctaccacttt tcatataatc tagttaccat catatagttt 240 tattaccacc atacaattct attaccacca cacaatctga aaaaattaac cgaacaagaa 300 ataggatttc aaagccaacg gagccaaaaa aggtaaggaa atattaaccc atccaggaaa 360 ccagccatgg agtcaataga gcggaatagc aaggcaagca attaccctta taccgagaaa 420 aaatgtaatt gaataaaaga ttcacatata aaagacttac aagatgtcac aatacagacc 480 attactaaga cgtatactag gccagtttga actcaacata gaagaatggg taaaggaaat 540 aatgtggtat ttagatatag catatttaaa agctttagat tccgaattta gcaaagaaag 600 tatgttagta tctggattag aagagctagt agaaatattt gatgagatgt tagaaggtga 660 tgatatatgg atacatccat cacttttatc acgaataaga tattctattc taccagatga 720 gatggcaata tatgaaagat tttcagatat cattaagaac ttctcagagg ataaagaatt 780 agcatatgca ttattagact tagtagaaat attatcaaaa aagaaaccta caacattcaa 840 cattatagat tacaatatac aaagaatatt cacgaatcct aatgatgaat atagaaatga 900 attcagcatc taccatatta aaggagatga agatatggat gaagacacag atgaagacat 960 agatgaagta gagcagaatg aattcaatga atactatcag aatcttgcac cgatcaccga 1020 gagtcagctg gatagactaa aaaatgagtt ggctataact ctatgtgacg aatgtcttat 1080 gccaacaaga agtcaggaat gcgattgctg gattttacaa atagagggtt tttagttttc 1140 agtgaaacca ttttttttac ctctcacaat ctggaatagg tgcatcacaa ataaaatccc 1200 caaaatcatt cggattagaa aacaacttct gtaagaactt ataaaccata tcagcatcca 1260 attcagtctt agtgccaaat atattaacct taccagattt aggccagata tgtacccgaa 1320 tctttgaagt aaagattata gcaattttag aatgaatatc acctgcatca ttcttaatat 1380 actttatcgg aaagggaaga atctcacgaa taccatctat agattctaaa gcatatgcta 1440 aggtagaaag attaataaac ttatcacccc caatattaat cacaaattta taattgtgca 1500 ggagagcctt atttccaccc acaagttcta cagaagaaaa ttcaggtaag gaacattcag 1560 aaaggtaacg taagaagata gcaacaggat caaaagaccc gaaaacttgt atactaccag 1620 atttaggaaa ataacgaata gtatgtacag tatcagctaa gatgataaca aactcaatgc 1680 atgcattaaa agtgctacca tcccctgtac gtcgccgtct atttaatgga ttgtcaggtg 1740 tacttcgtct gcatcgcctc tcaccttttg gtttagacaa gtactgttca taaccctcaa 1800 aagctttatg aataaagttg ctattaatgg caataacatc cttagtaggt tcaagcttat 1860 gaccaagcac agtgtaatca agcccttcaa cattgatctt aaattcagta acagtaagag 1920 atggagtaag ttcactaaaa gtaagcatgt ctggtttttt ctattgatag aaaagaaatg 1980 ttcaaattat tttatcctaa gaaaaaatct aattaaataa aacttcttca tcaatatctc 2040 atatgagata ccatgacaaa caatagctta cctttcgaac ttttagaaaa gatattcaaa 2100 acttcatatg aaggtactta ctcccgttgg cattatcttt gcaattatgc attagtgtgt 2160 cgaaaatggg ctgtagtagc aaactctctc ctatggggtg aaacagacct ttataacctc 2220 tataatatta aggagttcca aatgtacaaa catctcacaa aaccagggac agtatgtgga 2280 aaatatatca gaaagcttaa tatggatgga gtattattat ggcctatctg tattgtaaaa 2340 atgctacaag cttgccccaa tatccaggaa cttagcataa tagattatca atactatggt 2400 ggaaaaggtg acgttagaga tttgctaagt gaaatcctgc atatattacc aaaccttcaa 2460 aagctagata tcagatatag tcgatatttt aaaaatagaa atgccattga aaaactcata 2520 gaaacccgta aagacctaga aatcagagta acatggcaaa tgtaatattt gtaaaggcta 2580 atcactgcca aaacatttgg gcactgccct gttactgccc gtcactgccg aaaaaatttg 2640 ggcactgccc tgtcactgcc cgccactacc catcattgcc gcagaagttt gggaaacaca 2700 tggatttttc ctagcaagat gacgctttaa cttagaggga aagtcaaaac ttgctccaca 2760 tagatgacag gttaatcttc ctcgagagtg tgaatttttt ttaagtcgat agtatccatc 2820 tgcaaggaag tcacgttagt tttccgttta tgtagagaaa ttggaatggt aattttttta 2880 cgtgatgata gagggatggt cagttgccga ttttggaaat ctagaatgac attagcacga 2940 tcaatccaag gtacacctag caataaaaag tccttatcac tcttaacaac cataaagtca 3000 tcttcgattt ttttagagtt ctccccatca tatacaagaa taccaagacc atatactgtc 3060 ccaatagtat cagattttgt tgcaacacca ctaaggtcag gagcattact tctatcaaca 3120 tcaagattta ggcgctttac aatattgtta gatatcagag aactatcaga accagtatcg 3180 aaaattgtcg taggaataac taaagatttt tcactctcag gtgcgactac tttggtctta 3240 atagtgacta catcactagc agacttgcgg ttaacaattc caatacctgc tatagaagac 3300 cataagtcat tattacatat aacatcatca tttttacgga ttaaatttgc agccatacta 3360 ttagttacaa ctgaaatcat aggatgaagg atagatgtga aaaggtcctt aacattatcc 3420 caaacttttt cacgtgtttt tatcgagggt tctcctttaa gctgactgag tactggttct 3480 ttataattaa tgaatttagc attcagagca ttgtaagcag atataacagt ctcttttggt 3540 acagcattaa cgaaggattg aaccagaagc tttaagagtt gtataaaaac ctcaaagata 3600 atacgctgcc tttcaagttg tgattttttc ttatcagtag gctttttatc tacacgtttc 3660 ctttctgtag gctttttttt taatccatag caggcatgtc tagagtctga atcactatca 3720 gaggagtcag aggagtcaga ggcgtcagaa gagttagagg aatcagagga gtcatgtgca 3780 taatttgttt tcctctttcc cttcttagat tttggacaag aactcttagt atgaccagat 3840 tttccacaac ttgagcattt ctttgatttc tttggtttgc gtgtacctga acttatacca 3900 aatacctgat taccacggta aatatggtca gatggtaaat ttttattaag ttctgtatca 3960 ataaatattt caagatcact aagggttact ttacttatat cgatatctct tggaagacct 4020 aatttatatg ctaaagcaag taatctctga aatgatgtat cattttgagg acttgaaaat 4080 ggactagaag atggtactgg tactgcagat gatctagccg tagtcatagg aattttagaa 4140 tttataagat tctcaatatc cgctaaagtt ggatcctggt atgtaactat aggagtaggt 4200 ggcatatctg tcctgtattt ctcaacttct tccaatttat tcagaataga tgcaactgaa 4260 ttttctaaac caatacgtcg cgcatccaac ttattttcac tggaaagacc atttaagaac 4320 tgttgccgta ctaattcatt ctgatttacc tctggaagag ttggatatgc aagtctaacc 4380 attcttcgta atttagaata gaatcggctt acagtatcac taccctgaac aatagcatta 4440 aatgcaagct tagacttttc agatgttata gtaggaaagt aatgcttaaa acgatatatt 4500 aattgtccaa tacgtattcc aggagccaca attgtacctc cattattagc attaggatta 4560 ttaacagcta tatcagtagg acgaccctcg gctgttgacc aatcttcatc ccatacagta 4620 tgatcaggta taaaattagc accagtaata gtattaactg cagctgcttg tccatgtaat 4680 acattagccc ctccacgaaa ttgattcgct gctacagcct gtatagcgcc attattcaaa 4740 gcattaaaag cagcaagatt ggccacacca gtattatcca gaagatttac acattcccaa 4800 ttttttcctt tgatatgtgt ttcataccaa tctctagcat catcttcaag taaagtctca 4860 aagaccccat gaatcctaac acgagttcta gcattagcag caggatcaag tccagcagac 4920 tcacaccatt gcctgaattc ttgaagccaa agttctggat cttcaccaga tgctcctcta 4980 aattttttag gtgcatatcc tgccattatc ggttgttgta ttactggagg gttattttgt 5040 aatgctagaa tctgttgatt caaattattg acctgatttt gcaaattttg actatgatta 5100 cgacagttta aaaattgccc tctccatgct cgagcttgct gaatgatacc attagcccgc 5160 cttctacttc gttctgtttc agtattatag cgctgtaata atctatgagc atttcttcta 5220 ctctcatcac gttctctctg gacattagct agttgaattt gactgttagc atattgagcc 5280 tggatgttag ccaattggat ctgagtattt tgtgcaagga cttcccaatg ttggcgatga 5340 cgcactttat ttgcatatgc atcctcatag gtacgtcttg ttctcagtat agtagcacgt 5400 gtattattaa gttcatgtct aagatattct tcacggttat gtgcatcatc taacaaacct 5460 tcaagtctat cctccatatt atcttgatct tctataatat ttcgcaactc attagcatat 5520 cggataatgg catcaagact atcatttgca gtgttaaata aatgatttat atgagggtgt 5580 gctgctatag ttgtacgatc aaggtgaata cgaataccag ctacactatt tgctatagtt 5640 gtccgattat ttaagatttc tgtaataggg tcaggaggaa ttgctgccat gtttttccgt 5700 tgtctattag caagttcttt tattcgatct ttaagttgaa tatttagagt atcaacttca 5760 atcagctgac gttctaatgc ttgaagccct tcctccttta gtgatgccgt ttctaatgct 5820 ttcttaagct ttgcacgcaa ctcagcattt ttttgcctaa gttcttctat ctcagattca 5880 tgaccctcga ttacaccata caggtatttt atatattgct cttgtgattg caatgtatgt 5940 gcaagctttg taatctgtgt ctttaacgat gtatcttcca ttgccagttc cacagatttt 6000 tatagaccct acatttaata aacaaatcga acaaaattat atccttcaac aaatctatta 6060 ccatccatgt ggatattatc gtactcctaa aaaactttgg gctaaagtgc aagctgaggg 6120 ccatgatttt aacatatctg atattacaga gtggttgcat aaacaggcta tatggcaaat 6180 acactcacct atacccaagt atgttccaca ggtttctttt aataagatta ctcgaccaaa 6240 tatgtatcat caagcagata tcttatatat gccctatgat actgtagata gaaagctgta 6300 taaatactgt ctgaatattg tagatgttgc tagccgatat aaagcttcag tacctctaga 6360 agatcgcagc tctgcacgtg ttgctaaggc atttaaaaag atttatagta aagcagactg 6420 cccgcttatt tggccaagag ttttacaagt agatggggga tcagaattta aggatgaagt 6480 tatccgatta atggatgaaa aaggtgtgcg tattcgggtg ggtactactc ataagagtca 6540 ggctattgtt gaacggtata atcgaacttt agcagagaga cttttcaaaa tacaggatgc 6600 aaaggaactc cttacagaag gtgtgaatac tgcctgggtt aagaatcttt ctgatattgt 6660 gaatgagcta aataattcta atactcgtct gctaggtatg tctccagttg aggcaatcca 6720 gaaagatcaa gtgtatgcac taccttcaaa aattcggaag gatagacctg taggtgcaga 6780 agaaatacgt ttaccagctg gttctcttgt gaaatactta cttgataact cagattacca 6840 aggtaggaga cgtgcaactg atccaatatg gtctactaaa gtttatacaa ttgaatcttc 6900 cacagttgtt gatgggcagc cggttatata taggttatct gatggaccta aaaaaatatt 6960 tactcgagaa gaattattag ttgtgccatc taatactatg ctgcctccta ctcatatatt 7020 atattaattt tcaagttgtg gtatcctata gtctccatgt gcatacattg tgatattatc 7080 ttgtgcaatc catttttatc atccataggg gataacccca ccttcgacat ttctataaca 7140 tttatggtat gtctatcaga tcggattgct ctaaaggtga ctgtatcatt atgtttagga 7200 tcaaagagtg ccttattata ccgctcatgg tggaactctt ttttaactac ctttgagggg 7260 acacctttag ccttacggat accatgtttt ggattggtgg tctcatcccc taccttgaga 7320 atggagtaca ttttgggacg gaggccaata aattctgcta taactgctcc attacattca 7380 tccttaaatt tccctgggat tttcatattt atgtcagtct tatcacctaa tgcttttcgt 7440 acaggatgat ctttaggata atcactgaag tcaaattggt ctgccatatc tatgaggtct 7500 gcattgatat cctcagtttg gatttgtatg agaagtgagt ctgtatctgt ataacataac 7560 tggatttttt caccatatcg tactttaaga gtatcatagt agaattgata catacagagc 7620 ttcgagaggt caagcacact catacccaca tatgttggac ggttaagcat tacctctact 7680 tttcgtcggt ggattgcaac aagattttct gagaatattt tacgaccctt gaatgcagga 7740 tctgaggtaa gctttttata tttcttggga tgggtttggg gttgtactac tgagacacgt 7800 acacgtttcc tgagattttc catagtttta ccaaatacac tattattcat gagtttatag 7860 aagtccttct caaaatcgtt tttagcctta gcacgttcgg cagtgttaaa ttctatatat 7920 ggtttcatcc atggagcctg ctcgaatttt atagcttcat agaccttctt gattactaag 7980 ccctggctaa cataatactg gagatttcta taatgaacta catattcatc cttatcaaat 8040 aggttaggga tgagcttatc cgtgggtgta tatcgttgtc ctgaacgtgc tattatttcc 8100 tgctgctttt tactaagcca ttccttctta accttcattc gttctggtgc aagggggtaa 8160 tctgtgtgtt gtgaatggag ggcttgaggg tactccagtt tgactttaag gatatatccc 8220 tcagaagaat catctggtat ttcattactt tctatttgcc tttgaatgtt aaataactcg 8280 ttttcctcct ttatccagtg gaagttacct gtaggtaagt attgcaacat ggcccatcca 8340 tataggttat ttgcgtctaa gtatagaata gatgacttgg gcttatctgt ggtccattta 8400 ccttctccca tacctggatt atttgcacga gcatatcgtt tacttaccat agagattcca 8460 ccacgtaatc cttgttctac catcatgtac atgtcctggt cagttataag atccaatctt 8520 tgccctgtca ttaagaatag tgcatcccat gcgaagcctg gtgttgaata gtaccagaga 8580 ggatctaacc catatttttt caaggccatt tttcggaaat tctgaaatat atcggctagt 8640 aggaggacgt cagttttgag ataaatgtca tgataatctt gcattgtttt acatcctgct 8700 ttttcccata catactgtgc atattcgtat tctttatcac taatgtgggt ttcattaagt 8760 ttgctataga atgctccttt aggtggtaaa ctagttttct cgaacttttt ccagtcatcc 8820 atatattcat acgggtagac acctttctta agataaatct cgagtaattc aggtggtatt 8880 tgacttcatg ttatcttaaa gttttcggga tgtgcaaagc atctacctgc atcaattcgc 8940 cataagtgtt gtggattact acattcttgt gctctacatt tttctgcacc taggttggat 9000 gctaatttgt cgagacttga tttcatgaat tgtaggctat ctataaatcg gagggaacct 9060 aacttgaatg ccatatactt ttccatgtta taagggatag gatcaatatc atcctcacat 9120 tccattgcac caattccttg cataattatg tgtccatcat atcctgataa attgtgaaag 9180 attactggga tatgcatttc acgaggtttt atggataagt caagattaca tcctctatgt 9240 gctgcacctc gataccttcc tgtaatatgg cagtggtctc ttactttatt tccatctaag 9300 gggttgcgac atatccagca gtttattgca ttatcataac ttacttgctc ttaggttgtt 9360 aatggcatca tttccattgg gtttctgaat tcattgcgaa ttgcttcagc ctcatttagc 9420 atttcgatta caaatttttg tgctgcattt tctcctgtaa atatcttctg tgattcactt 9480 acaccatcat atcttacctt aatatacgag tatgagcagg ggatttgctc ttgtactttc 9540 ttggttttct tatctttgtc ttcatctgtg ggtggtatgt ccattactag ggcttcaaag 9600 tctgcaacaa agtaatatgg gacttgcatg cggcgtttcc agttcttgaa ttcatatata 9660 tcatttcctt tctttactga tggcattcgg tttgcttgtg gggttttctt taatccttga 9720 catattgcta catgtttgtc atggatttct tgttgatgtg tgaagtggga gatacaccat 9780 cggcagacaa atgttgcatg tccatctttt gtacgtttat ttactaaacg gctcatattc 9840 ttaatccagc aatagtgctg cttttccccg ttactgataa gtactaaatc tattatttta 9900 cggccctctg catcatccct atcggtaaca tatattggac ataggcggtg gtcacgccat 9960 tcacagatgc agagggcaat agttgggttc tgtcgctcaa atcttcttag aattacctca 10020 tcagcttgta ctggaaattc gataccttca aagtttaact catttacata ctccttatac 10080 ttggaaattc tttctgcgtt cttttctact ggatgtaatg ctgctaggat ttcccactgg 10140 aagcatttgt catcatcatt agctggattt actactccct ttttcgtggc aagatctttt 10200 gggagtggta tatgtcctgc tcctgtgggt ggttggaatt gcgagatttc tataaatatt 10260 ttttctacac gatagaattc ccaacctgat ccgtaattta catagtcttc tactttctgc 10320 atgattcttg ctaatgcaaa gttaaatgta ccattaatat ctgttgatgg taggattggc 10380 atattgcgag ttttaaatgg tatgttatat ctgtagatat actctactcc atcatcttca 10440 ctaaagttac catctccgaa taatccctca aggtaagaat ctaaacgtct tattgtggca 10500 atattccaat tcttactttt attgatccca atcgatttct ttcatcccga agaatattct 10560 ggattgtttg actgtgtgct gtaagaaatg aagcatagtc atgctctcca ggtgttccta 10620 tagcaatatc agccgtaaag tttgagtctg ttaactcctt tcctatacgt acctgatcat 10680 gacgttcaaa gactacctca taatgtgggt ctgggtcaat ttgcatcatg cggattatct 10740 cttcattggt taccgccgct tgtgcttcac ggatttggca gatctcatcc tcatattcct 10800 gcaattttgc atttgagcgg ggatatggga tggacttgac aggtacctgc ttgaattcag 10860 ctagaactga tcgggtgact tgtctgtgtt gggcacttct tgcttgatgt gttgacatct 10920 tgtaaacttt ttatactcct cttttattca attacatttt ttctgaaaat aaaattaatt 10980 gttcgcctac cctatgaggg taaaaaaaga gggagacaga ttagtactgt cttcctaaac 11040 tattcctcac catctgagtc atcatcatcc gagtcaccga acacatcctc gtcccgatgt 11100 attctctatc cttaagtaac ttatagttgt aagttacaat ggcgaatagc ttcttttacc 11160 ataatatgct ctactgcggc atttgttttc tctttacaag ccttttgata tgcctcccat 11220 tcaagttcat ggaggtcacg atttctatta ccctcctcct ctgatcgacc aagagcatca 11280 tttgcttttt ccttcttttc tactgccctg tcgtatgcct cctcggcttc tttatgccgc 11340 ttccagagta ctcgttcagc ttcagctctc cagattgggc tgacttctag aagatgtgtc 11400 ccgaaatttc tacttggaat gtattccagg atatggccta caagttcggg gattactgat 11460 actgcggtat ataccatgtt atgtactatt tatctgtata gttacttggt acctaccttt 11520 tattcaatta cattttttct cggtataagg taattgcttg ccttgctatt ccgctctatt 11580 gactccatgg ctggtttcct ggatgggtta atatttcctt accttttttg gctccgttgg 11640 ctttgaaatc ctatttcttg ttcggttaat tttttcagat tgtgtggtga taatagaatt 11700 gtatggtggt aataaaacta tatgatggta actagattat atgaaaagtg gtaggtggat 11760 tatatggtgg taattaaaat tatatgatgg tggtaactag attatataaa gtggtagtta 11820 aattatatta aaagtggtag gtggattata tgatggtaat taagttatat gatggtggta 11880 attagattat ataaagtggt agtattttag attatatggt gggagtgagc cagaattggg 11940 gtttttatac tact 11954 // ID Copia-49_MLP-I repbase; DNA; FNG; 4981 BP. XX AC AECX01002337; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-49_MLP_; KW Copia-49_MLP-LTR; Copia-49_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4981 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01002337; Positions 31562 26582. XX CC Positions [2233-2634] - Integrase core CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS join(400..2634,2638..4947) FT /product="Copia-49_MLP-I_1p" FT /translation="MSQGQDIDQFFAQSSSQVPPSEQSNSEDEADQTTRFI FT SQLLTSPSNTNKAPINPNLQPTSYKSDPTPLSKSHFESIMSGPSSPNTISP FT QQQAMLLNQSLGRSSKILTDENYVLWASFIRGGLKSLFLSDYLNSDDIKLE FT NSQFTDEVSRACSTNWMLNNMDDINRSRFEPKITVYGDSGPDTQDLPSKLW FT KAVTNHHSSRSEELRLLYERALNLITQASNVPILTHIQNFQNAVTKYKTAG FT GQMSDEDLGRKLLISLNSSHFQDAREIAISGIKEYDKVVSELKKRLDAVLM FT LTKSSVHSKVPVVIHAAEASSVSNSFSRNRQTLKCTKKKCVGVNHSPDQCF FT KRPGNEHLQREWIEQRVKLGQWNGDPPKDFAASSASAATLNEPTIEQLENV FT FNSLNASASHISAVSLSKLSLSSNPHDLGVLIDTAASHHMFKEKHIFVNYV FT DMANSNEYLNMAGGDATLKIHGRGDVQFFGPDGQRFDLHDCLHIPNLKRSL FT IGGTILLRDGFDFIHKKVEGRFEIIKDNQRAFQGVLNDDVNLMRSYRPVSP FT ESSPEANVVNQSTDPNILNLHRRLGHLNLRYMKSMVSQDCVKGLNVNVSSF FT PTAIACDSCDLSKSHRLPHNKLHVHSSNLLENIHLDLSGIIRTSAICGSVY FT FMLFTDDHSRYRHVVGLSEKSADKVFIKIRQYFSLVERQCNAKIKSLTLDN FT GYEFINDTLVPYCEEIGIYLRTTATYTPEENGVAERSNRTVTEPAAMMIEA FT NLPIRFWLYAIKASVYLKNRTITSSLPDGKTPFELWHGRQPDITHVRPIGC FT LCFVVIRKAIRHGKFNQVSRQAILLGHTEHNLNYEVFIFETNSIVISHDVT FT FREDVFPFKKLKSFDLSHLSFEGDEDIMLSDPSALEPLHEEEGDDALVPGG FT EGLHQDETELDIQPLIDQVEPIEPNESHPNICRSERERRPIERYTPSASYA FT YWDDEGAFTDLHDAFPCAFAVGSIVRLIQEPHNFISAMTGSDKDEWKMACD FT KEMKNMEDRKVWRLVPRPSDQSVVGCKWHFKVKLNPDGSINKHKARIVAKG FT YTQTYGVDYTDTFAPTGKPASFRTVVAFAAYHGLPVHSMDAIAAFLNSGLK FT HKIFMEQPQGYEIEGDDLVCKLLQALYGLKQSAREWNDDFKLKCLKAGFTQ FT SPADECVYIRRRAHDMCLFYLHVDDLAITGTNIDDFKKEIGGFWPMEDMGI FT STCVVGIQIARAGLNHYVLGQEAMIKLLLKRFGMTEAKVASTPFPGGTKLT FT KSTPDEARSFNLLNLPYRSGVGSLMYLSQCTRPDITYAVGCLSQHLENPSL FT RHWEAFKHILRYLKGTMNHCIHYKKPILPSKSSNLPLISSNNGFSLPEHFA FT DSDWAGDRSTRRSTTGYVFMLCGGAISWRSRLQQTVAKSSTEAEYRAANEA FT GDEMIWLARLLASIDLPQATPYLLNSDSLSTIDVSENAVMHGRTKAIEIHH FT HWLREKVKEGVIKLVYCPSEDMLADILTKPLHPGPFNHFRERIGVRQIVE" XX SQ Sequence 4981 BP; 1494 A; 1036 C; 1019 G; 1432 T; 0 other; ggtaaatata ataatctttg ggaaagagaa taaagcgctc ttatagactc aaagatcagt 60 tatctcgcat tcgataactt atactaatca gatcatcaat actctgtacc aacttatagg 120 tactaattgt tatttctttc tattcttcat aagtctttat cttcatttat ctgattagct 180 attattgtta taggtcagag gtctttctat cagattgttg tttcgaggtt gtgcttatca 240 gagcagtagt gcatactacc aggtcagagg tctttctatc agattgttgt ttcgaggttg 300 tgcttatcag agcagtagtg catactacca ggtccctatt ccgagactct aaaatattca 360 catggtagcg agagtcacat ctttcgatcc tacaaacgaa tgagtcaagg tcaagatata 420 gatcaattct ttgctcaatc atcaagccaa gtacctcctt ctgaacaatc gaactccgaa 480 gacgaagcag accagacaac tagatttatt tcacaattac tcacatcacc atctaatact 540 aataaagcac ctatcaatcc taatttacaa cccacatcat ataaatctga tcctacacca 600 ttatctaagt ctcattttga atcaatcatg tcaggacctt caagtcctaa taccataagt 660 cctcagcaac aggctatgct ccttaatcaa tcactaggca gatcatcaaa gattttgacg 720 gatgagaact acgttttatg ggcgtctttc attcgtggtg gtctgaaatc tttatttctt 780 tccgactact taaactctga tgacatcaaa ctagagaaca gtcaatttac tgacgaagtc 840 agtcgagcat gcagtacaaa ttggatgctg aacaacatgg acgacatcaa tagatcaaga 900 tttgaaccta agataactgt atacggtgac agcggacctg atactcaaga tctgccttcc 960 aagttatgga aggctgttac aaatcatcat tcaagtagat cggaagagct tcgtttatta 1020 tacgagcgag ctcttaatct cattactcaa gcttcaaacg tgcctatttt aactcatatt 1080 caaaactttc agaatgccgt tacgaaatat aaaactgccg gtggacaaat gagcgacgaa 1140 gaccttggtc gaaagctcct aatttccttg aattcatctc attttcaaga tgcacgagaa 1200 atcgctatct cgggtatcaa ggagtacgat aaagtcgtat ctgaactaaa gaagagactt 1260 gatgcggttt tgatgcttac gaagagttcc gttcattcta aagtccctgt tgtcatccat 1320 gctgcagagg ctagttcggt ctcaaattct ttctcacgaa atcgacagac cttgaaatgc 1380 acgaagaaga agtgtgtagg agtcaatcat tctccagatc aatgtttcaa aaggcctgga 1440 aacgagcacc ttcagcgaga atggatcgaa caacgcgtga agctcggtca atggaacggt 1500 gacccaccta aggacttcgc tgcttcttcg gcatctgcgg ctactctcaa tgagccgact 1560 attgagcaac ttgagaatgt gttcaattca cttaatgctt ccgcaagcca tatctcggct 1620 gtatcgttgt ctaaactctc tctatcatca aatcctcacg atcttggggt tttaatcgac 1680 accgcggcgt ctcaccacat gttcaaagag aaacatatct ttgtaaacta tgtcgacatg 1740 gcgaatagca atgaatatct taacatggct ggtggtgacg ctactctgaa aattcatggc 1800 agaggagacg tacaattttt cgggcctgat ggtcaacgtt ttgatttaca tgactgtcta 1860 cacattccca atctgaaaag gagtctgatc ggaggcacta tcttgcttag agatggtttc 1920 gatttcattc acaagaaggt tgaaggtcga ttcgagatta tcaaagacaa tcaacgagcc 1980 tttcaaggtg ttttaaatga cgatgttaat ctaatgaggt cgtataggcc ggtttcacct 2040 gaatcaagtc ctgaagcaaa cgtcgtaaat caatcaactg acccgaacat ccttaacctt 2100 catcgccgtt taggtcacct taatctgaga tacatgaagt caatggtatc acaggattgt 2160 gtaaagggtt taaatgtgaa cgtctcgtca ttccctaccg ctattgcatg tgactcctgt 2220 gacttgtcaa aatcccatcg tctccctcat aataaacttc atgtacacag ctcaaatctt 2280 ctggagaata tacatctcga tcttagtggg attatacgca ccagtgccat atgtggtagt 2340 gtgtacttca tgttgttcac ggacgatcac tcaagatatc ggcatgtagt tggtctatct 2400 gaaaagtcag ccgataaggt gttcattaaa attcgacaat atttttcatt agttgaaaga 2460 caatgtaatg caaagatcaa atcactcacc cttgacaatg ggtatgaatt cattaatgat 2520 actctggtcc cgtattgtga ggagataggt atctatctta ggacaactgc aacctatact 2580 cctgaagaaa acggagtggc tgaaagatct aaccgaactg tgacagagcc tgcctgagct 2640 atgatgatcg aagcaaatct tccaattcgg ttctggttgt acgcaatcaa agcgtcggta 2700 tacttaaaga ataggactat tacttcatcg ttaccggatg gtaaaactcc ttttgaattg 2760 tggcacggtc gtcaacccga catcacacac gttagaccaa taggctgctt atgctttgtg 2820 gtgatcagaa aagccattcg ccatgggaag tttaatcagg tgtctcgaca agctatatta 2880 ttaggacaca cagaacataa tttaaattat gaagttttca ttttcgaaac caactctata 2940 gttatttccc atgacgttac tttcagagaa gatgtttttc cctttaagaa gcttaagtct 3000 tttgatctat ctcacctatc atttgaaggc gatgaagaca tcatgttatc ggatccttcc 3060 gctcttgaac ctcttcatga ggaagaaggc gacgacgccc ttgttcctgg tggtgaagga 3120 ctgcaccagg acgaaaccga gctcgacatt cagcccctta ttgatcaagt ggagcctatc 3180 gaaccaaatg aatcacaccc taacatttgc aggtcagaac gtgagagaag accaatagaa 3240 cgttatactc catccgccag ttatgcttac tgggatgatg aaggagcttt caccgactta 3300 catgatgcgt ttccatgcgc cttcgccgtc ggatcaattg ttcgtctgat tcaggagcct 3360 cataatttta tatctgctat gactggttcg gataaagatg aatggaagat ggcttgtgat 3420 aaagaaatga agaacatgga agatagaaaa gtatggcgtt tggtaccccg accatcggat 3480 caatcagtag ttggttgtaa atggcatttt aaagtaaaac ttaaccctga cggttctatc 3540 aacaaacaca aagctaggat tgtcgccaaa ggatacactc aaacctatgg agtagattac 3600 accgacactt ttgcgcctac cggtaaacct gcctcttttc gtaccgttgt ggcttttgcg 3660 gcttaccatg ggcttcctgt acactcaatg gatgcaattg ctgcattctt aaatagtgga 3720 ctgaaacata agatatttat ggaacaacca caaggctatg aaattgaagg tgatgacctg 3780 gtatgcaaat tacttcaagc tctatacgga ttaaaacaat cagcaagaga gtggaatgat 3840 gactttaaat tgaaatgttt gaaagccggt tttacacaat ctccagctga tgaatgtgtc 3900 tacattagga gacgtgccca tgatatgtgt ttattttacc tccacgttga tgatttagca 3960 atcactggta ccaatatcga cgatttcaag aaggagattg gcggtttttg gccaatggaa 4020 gatatgggaa tctcaacctg tgtagttggc atccagatag ctcgagccgg tttgaatcac 4080 tatgtactag gacaagaggc catgataaaa ttgttactca aacgtttcgg catgactgaa 4140 gccaaagtcg cctctacccc cttccctggt ggcacgaaac ttaccaaatc aacgccagac 4200 gaagcgagat cttttaattt attaaatcta ccttatagaa gtggggtagg cagtctcatg 4260 tatctatcac aatgcacaag gccggacatt acatatgcgg ttggctgtct ctcacaacat 4320 ctcgagaacc cgtctttacg acactgggag gcattcaagc atatacttag atatttaaaa 4380 ggaacaatga accactgcat acattataag aagccaatct taccatctaa atcctccaac 4440 ttaccattaa tatcaagtaa caacggattt tctttaccag aacatttcgc tgattctgat 4500 tgggcgggtg acagaagtac aagacgatcc accacgggct atgtttttat gctgtgcggg 4560 ggagcaataa gttggagaag tcgactacaa caaacagttg cgaaatcatc tacggaagct 4620 gaatacaggg cggcgaatga agctggtgac gaaatgattt ggctagctag attgcttgcg 4680 tcaatagatt tacctcaagc aacaccttat ttattgaatt cagatagctt aagcacgatt 4740 gatgtgtcag agaatgcggt aatgcatggc aggacaaaag ccatagagat acatcaccat 4800 tggttacgtg agaaggtcaa ggaaggtgtt attaagctgg tttattgtcc ttctgaggac 4860 atgttggcgg atattttaac taagcctctt catcctggac cttttaatca ttttagagag 4920 cgtattgggg tcaggcaaat tgttgaatag agatatttta attgtgtcga ttgagagggg 4980 g 4981 // ID Mariner-1_ABr repbase; DNA; FNG; 2098 BP. XX AC . XX DT 16-APR-2011 (Rel. 16.04, Created) DT 16-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE Mariner/Tc1-type DNA transposon - consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-1_ABr. XX OS Alternaria brassicicola OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Dothideomycetes; Pleosporomycetidae; Pleosporales; Pleosporineae; OC Pleosporaceae; mitosporic Pleosporaceae; Alternaria. XX RN [1] RP 1-2098 RA Clifton S., Fulton L., Fulton B., Godfrey J., Minx P. RA and Nelson J.; RT "Draft sequence assembly of the Alternaria brassicicola genome."; RL Direct Submission to EMBL/GenBank/DDBJ (10-MAR-2009)Unpublished.. XX RN [2] RP 1-2098 RA Jurka J.; RT "DNA transposons from the Alternaria_brassicicola genome."; RL Direct Submission to Repbase Update (22-MAR-2011). XX DR [2] (Consensus) XX FH Key Location/Qualifiers FT CDS 93..1940 FT /product="Mariner-1_ABr_1p" FT /translation="MDPIQEAIAEIESRDPGEQFSYQQIAKKYGVERTTLM FT RRHKHQNDDYGVRNQSLHPEREAELVRYIETLTERRLPPTRTMIQQFASQL FT AGKPVSESWVSRFLRRHPNHLISRSGKAMAKERTKADSGAKYNLYFKLLHE FT KIEEYNIQPTHIFNMDEKGFQLGRVGNTKRIFSRRLYEQKGARQALEDGSS FT EWITVIACICSDGKALSPTLIFQGANGAVQSSWVEAVQAGEHSVFTTSSPS FT GWSNNDIGLAWLKQVFERETRRHASTGYRLLLLDGHGSHVTMDFIEYCNDN FT KILLFVFPPHATHTLQPLDVVMFKPLATAYSTQLMGYLQDSQGLLNLTKGD FT FFQLFWRAWCSVFKPPLIKRSFEATGIYPANPDVVLKKFPKEASDSDSSQS FT VLSGEDWLKLKSIVRREVKDQGSKDVKKLQRSLCHIAAQNSILHEEIRGLR FT QSLAIKERRPKQSFTLQLDEDEVYHGGAKLWSPRSVQRARDRRASQQQQQE FT LEKLQKAKQAEIKKAARDCEAQLKAAKRVERERRADEKRKEEAAKQAEKQH FT QQLINNTKKLIKLSQKGKRKASQPHTPATKRQKRGGVAPAVVGAARVARAA FT PPRSRRQRPITPSKKISE" XX SQ Sequence 2098 BP; 624 A; 532 C; 500 G; 442 T; 0 other; tacagtatcc gtgcgccagt tgcacacggt caccagttgc acacccccac caccacaaca 60 acaaaacttc aacgcgtcat aactcagcga ccatggatcc gattcaagaa gcgattgcag 120 aaatcgaatc gcgcgatcca ggagaacaat tctcgtacca acaaattgcc aaaaagtatg 180 gcgtagaacg aacaacgctg atgagacgac acaaacacca aaacgacgac tatggcgtac 240 gcaaccaatc cctccaccca gaacgcgaag ccgagcttgt acgatacata gaaacgctta 300 ccgaacgacg tttgccacct acaagaacaa tgatacaaca gtttgccagt cagttggctg 360 gcaagccggt ctccgaaagc tgggtttccc ggttcctccg ccggcatccc aaccatctca 420 tctctcgctc gggcaaagcc atggccaagg agcgtaccaa agctgattca ggggccaagt 480 acaacttgta tttcaagctt ttacatgaaa agatagagga gtacaatata caacccaccc 540 atatattcaa tatggacgaa aagggatttc aacttggccg ggttggcaat acaaaacgta 600 tattcagcag aagactctac gagcaaaaag gggcaaggca agcacttgaa gatggctcaa 660 gcgagtggat aacggtgata gcttgtattt gttcggatgg caaagctttg agtccaactc 720 tcatcttcca gggagccaac ggagctgttc aatcaagctg ggttgaagct gtacaagcag 780 gagaacactc agtatttact acgtcatcac cctctggctg gagcaacaac gatattgggt 840 tagcttggct caaacaggtg tttgagagag aaaccagacg gcatgcttca accgggtatc 900 gattactact ccttgacggc cacggatccc atgtaactat ggattttatt gagtattgta 960 acgacaacaa gatcctcttg tttgtatttc cacctcacgc tacccatacg cttcaaccac 1020 ttgatgtggt gatgttcaaa ccccttgcaa cagcttactc aactcagttg atgggatacc 1080 tccaggacag ccaggggtta cttaatttaa ccaaaggaga cttcttccag ctcttctgga 1140 gggcttggtg tagcgtattc aaacctccac tcatcaagag gtcgtttgaa gccactggta 1200 tatacccagc caacccggac gttgtactta agaagtttcc caaggaggct tcagattctg 1260 acagcagcca gtcagtcctc tctggtgagg attggctcaa gctcaagtca atcgtacgcc 1320 gtgaggtgaa ggatcagggc agtaaggacg tcaagaagct tcaacgaagt ttgtgccaca 1380 tcgcagctca aaacagtata cttcatgaag agatcagggg cctcaggcag tccctagcca 1440 tcaaggagag gcgaccaaag cagtccttca ccctccagct tgatgaggac gaggtttacc 1500 atgggggagc caaactgtgg tcgcccagga gtgtacaacg agctcgcgat cgtcgggcat 1560 cacaacaaca acaacaggag cttgaaaagc ttcaaaaagc caagcaagcc gagatcaaga 1620 aggcagctcg cgattgtgag gctcaactca aagccgcgaa gcgtgtagag cgtgagagac 1680 gggcggatga gaagaggaag gaagaggctg caaaacaggc cgagaaacaa caccagcagc 1740 tcatcaacaa caccaaaaag cttataaaac tttcccaaaa gggtaagcgc aaggcttcac 1800 aacctcacac gccagctaca aagcgtcaaa agcgtggtgg tgttgctcca gctgtagtgg 1860 gagccgcaag ggttgcacga gctgctccac cgcgtagtcg acggcaacga ccaataacgc 1920 cctccaaaaa aatatctgaa taagccaaaa ctttcgtagg cctgtatata gtataatcct 1980 agctttgcgc aattttatta cgatttgtgg tggctctaca acctcatctt tttgaatata 2040 ttgttgttgt tggggtgtgc aactggcgac catgtgcaac tggcgcacgg atactgta 2098 // ID Copia-1_MVPL-LTR repbase; DNA; FNG; 176 BP. XX AC AEIJ01000200; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Microbotryum violaceum genome: long DE terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_MVPL_; KW Copia-1_MVPL-I; Copia-1_MVPL-LTR. XX OS Microbotryum violaceum OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Microbotryomycetes; Microbotryales; Microbotryaceae; OC Microbotryum. XX RN [1] RP 1-176 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Microbotryum violaceum genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; AEIJ01000200; Positions 42461 42286. XX SQ Sequence 176 BP; 36 A; 46 C; 35 G; 59 T; 0 other; tgtttagatt ggtctgcgcc tgagtgacac ttttctcctt agtcttactc cgttcgctct 60 cgcaacggtt ctgacataag tgtcctcatg tgaattgtca ttacatctat tgttcgctct 120 cgcaacggtt ctgacataag tatcctcagt gcgggataca tagcgtccca aatcca 176 // ID Copia-34_MLP-I repbase; DNA; FNG; 2746 BP. XX AC AECX01001687; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-34_MLP_; KW Copia-34_MLP-LTR; Copia-34_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-2746 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001687; Positions 58635 55890. XX CC 'AATAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 59..2668 FT /product="Copia-34_MLP-I_1p" FT /translation="MSTPSSEFSEEISLTDTEEESNSSSSSSAQTARLPDS FT SSMSNEEPMSVFGNALQRYGQQLNNALSKFKVTDDLEDGNYPTWKRSVYDN FT LETLELHHYVIVKDFVDKELTSDKVLKTKKVIVNYILNRLDKPNHCQAINW FT LTDPEDPNSILHDPFILWSFLKERHFLINAQRLASISKSLSVITISRGDTL FT SGYLDKFENLFIEFTRYGGKMDDAQSALRLIDSIDRLSDSTVEFIHSTVSP FT LTRRDVVKYLRDYDTRHNFSTEATREARPAELSANTVSRGQRFECTESMCK FT GPHPADMCWSKPQNFKERDDFLACRRANRTTGRQRSNFNPNASLSNPQVRG FT MKRVTPSANMVAESLERMSLHTSFKVISLDDSTSANLTNAMSSDNSIWALH FT DTGATHHMFNTSSIFVPDSLRPVEDSNRRLKLAGGGVSLAVESIGKVLLKA FT GDGTIFELTDCLYVPELSKNSIAGGNLKKKGVREVYDCTDPMCFALVRNGL FT ALFNGFILDTGLMNVLMDPVRSVKPEINQASNSHSHSIIHHRRGPQSQTYL FT NSMIKHGSLDGVATQVENFKTSDSSASPLTGNNKSINLSPSPNPDSNQIIH FT SDKKLFQPSQTQIQIPSTRLLTPQSTQNNTSANNIDRPSVDQISPSHQKES FT MSNRDHHQNYQSDDNNHYPYNQFNHTNHQKDQLDPSPNHSTTIKPTFGTQV FT AYPISRCTPISSSSNPSPVVHPISNKITDTPVNINHSPTQYSISPTVVQQS FT NQTQNDINYKASNAIPDTKLYKEDVLACNSLNWQQACNEKMNSLLKENVWT FT SVNCHVNKIMLFFIAIILVPFAVSTDQCFTFEPGIMRLTFIGFTKLSAGVT FT FLLNTVQLKRWLLTYSQ" XX SQ Sequence 2746 BP; 843 A; 651 C; 479 G; 773 T; 0 other; tggtagcgag aggttcgatc aacgaaacct cataatcaat tctgtcttaa cccgatttat 60 gtcaactccg tcgtcagaat tctcagaaga gatatcatta accgataccg aagaagagtc 120 aaattcatcg tcttcctcat cagctcaaac tgctcgttta cctgattcca gtagcatgtc 180 caacgaagaa cctatgtcgg tgtttggaaa cgcacttcag cgttatggcc agcaattgaa 240 caatgccttg agcaaattca aagtgacgga tgatcttgaa gacggcaact acccaacctg 300 gaagcgatct gtctatgaca atctggaaac tcttgaactt caccactacg tcattgtcaa 360 agattttgtt gacaaagaac ttacttccga taaggttctt aagactaaga aggtgattgt 420 caattatata cttaatagat tagataagcc gaaccactgt caagcaataa attggttaac 480 agatccagag gatccaaatt ccattctcca tgatcctttc atcttgtggt ctttcttgaa 540 ggaaagacat ttcttgatta atgcccaacg cttagcttct atctccaagt ctctcagtgt 600 tatcaccatc tcacgaggag acacgttgtc gggatatctg gacaaattcg agaatttatt 660 cattgaattc actcgttacg gtggcaagat ggacgatgct caatctgccc ttcgtctaat 720 tgattctatc gatcgtcttt cggattctac cgtggaattc atccattcca ccgtctcacc 780 cttaactcgt cgtgatgttg tcaaatacct acgtgactat gacactcgtc ataatttttc 840 cacggaggct acccgtgaag ctagaccagc cgaactatca gctaacactg tgtctagagg 900 tcagcgattc gaatgtactg agtctatgtg caagggaccc catccagcag atatgtgctg 960 gtctaagccg caaaacttta aggaacgtga cgactttctt gcttgtcgac gtgctaacag 1020 aaccactgga cgtcaacgat ccaatttcaa tcctaatgct tcactctcaa atcctcaagt 1080 cagaggtatg aagagggtta ctccctccgc aaacatggtt gccgaaagcc tcgagagaat 1140 gtcccttcac accagcttca aggtgatctc tctggacgat tccacttccg ccaacttgac 1200 aaatgctatg tcttctgata actctatttg ggcattacac gacaccggcg ccacccatca 1260 tatgtttaat acatcaagca tctttgttcc tgacagctta cgacctgttg aggactcaaa 1320 tcgaagactt aagttagctg gagggggtgt atctctagct gttgaaagca tagggaaagt 1380 tctcttgaag gctggtgatg gaacaatttt tgaacttact gattgtttgt atgttcctga 1440 gctaagcaag aattcaatag ccggagggaa tctgaagaag aagggggtta gagaagtcta 1500 tgattgtaca gatcctatgt gctttgcttt agttcgaaat ggattagctc ttttcaatgg 1560 ctttattctt gataccggac ttatgaatgt actcatggat ccagtaagat cggtcaaacc 1620 tgaaatcaat caagcctcaa attcgcactc tcactcaatc atccatcatc gtcgaggtcc 1680 tcaaagtcaa acctatctga attcaatgat taaacatggt agtcttgatg gtgtagcgac 1740 tcaagtagaa aactttaaaa caagtgattc ttcagcatct cccttgactg gcaacaataa 1800 atcaatcaac ttatcaccgt ctcctaatcc cgactcaaac caaatcattc attcagataa 1860 aaaactattt caaccatctc aaacccaaat tcaaattcct tctactcgat tgttaacgcc 1920 tcaatctact caaaacaaca cctcggctaa taatattgat agaccttctg tcgatcaaat 1980 ttcaccatcc catcaaaaag aatcaatgtc taatcgcgat caccatcaaa actatcaatc 2040 tgatgataac aatcactatc catacaatca atttaaccat actaatcatc aaaaagacca 2100 acttgatcca tcaccaaatc attctacaac cattaaacct acttttggaa ctcaagtggc 2160 ttacccgatc tccagatgca ctcctatctc atcatcatcc aatccttctc cagtagtcca 2220 tcctatttca aacaaaatca ctgatacccc tgtgaacata aaccattctc caactcaata 2280 ttcaatctca ccgactgttg ttcaacaatc aaaccaaact caaaatgaca tcaactacaa 2340 agcatctaat gcaatacctg atacaaagtt gtacaaggaa gacgtgttag catgtaactc 2400 ccttaattgg caacaagctt gcaatgagaa aatgaactca ctgctcaagg agaatgtatg 2460 gaccagtgtt aactgtcatg taaacaaaat catgctattc ttcatagcaa taatcttggt 2520 accatttgcc gtatcaacag atcaatgttt cacatttgaa cctggaataa tgagattaac 2580 tttcattgga ttcacaaagc tgtcagctgg ggtgacattt ctcttaaaca ctgtccaact 2640 gaagagatgg ttgctgactt actcacagta gctcttggca agcagcagtt caccaaactt 2700 aggagcaaat tgggaatcaa gtgacctcat tggtcttgag ggggtg 2746 // ID Gypsy-1-LTR_PPl repbase; DNA; FNG; 863 BP. XX AC . XX DT 09-APR-2009 (Rel. 14.04, Created) DT 09-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Long terminal repeat of the LTR retrotransposon - a consensus DE sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-1-LTR_PPl. XX OS Postia placenta OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Polyporales; Postia. XX RN [1] RP 1-863 RA Bao W. and Jurka J.; RT "Gypsy LTR retrotransposons from fungi Postia placenta."; RL Repbase Reports 9(4), 924-924 (2009). XX DR [1] (Consensus) XX SQ Sequence 863 BP; 197 A; 256 C; 214 G; 196 T; 0 other; tgtggtaggg ccaagtggtt cactggcaaa atccgcagga agacgacttg caagcgcggg 60 tcgcgcgctt cgtgttcgga accatgatcg ctccgtctcg gtctccatcg ctggaccacc 120 tcgtcaacct cgcacctccc gaacaactac gcaatgcatt ccatggctaa acgacgacgc 180 agtgcgcatc aagcaggaat cggcgcagca tatcgtgcga agatcaagga caataagata 240 agacacgcgc gtgaggcgtg tagggtcggc agaaacattg cttgttcgtt cttttctctc 300 tctcttcact cctgtaacct ccgttctgga cggcctgagc gacgcggatt gcgcgcggga 360 catagggtcg agcgcgtgat cgtgcctcct ccgcttgttg gggggagcaa cgcgttcgta 420 aatagagtcg agactcgtca cgctgactcc ccgaacaata gcccgcgatc gcatgatgtt 480 tagcgatggt actgttcttt cccggcacac cacgactgat atgcatgtac tcgactactt 540 catgtgaaga cttctaggtt acgggtatat agggagaaga aatgggcctg tacgagccag 600 cgaaccacgc gcaattcaca attctcttaa tctaaatatt cgaacctttg ctggatcctt 660 ggcgacgctc tcgctcctcc tcaaagtcta cgaaatcctc caggtacgcg atcctctctt 720 cgcgattcac tccttccagt ctcagctagc gctggggcta ttctatctaa tctgatcccg 780 aactctcagt ccacgaggag gctgtactaa cgtcgacaac acccgggttc cgatcccacg 840 agcacaaaac ccccactacg aca 863 // ID Gypsy-28_MLP-LTR repbase; DNA; FNG; 189 BP. XX AC AECX01001249; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-28_MLP_; KW Gypsy-28_MLP-I; Gypsy-28_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-189 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001249; Positions 96505 96693. XX SQ Sequence 189 BP; 45 A; 61 C; 36 G; 47 T; 0 other; tgttatgatc ctctatcacg gatcatggga ggctatgtca aagactgtga cacttcaccc 60 ttgttgggta actggagacg cacacaccag ttgtagacct tttccctcac gctacaataa 120 ctatcatagc tggatcacac ctctctccct ttgccctcac gcgccagcac ccagacccgg 180 gtcctaaca 189 // ID HARBINGER1_CN repbase; DNA; FNG; 3368 BP. XX AC . XX DT 31-MAR-2005 (Rel. 10.03, Created) DT 21-APR-2005 (Rel. 10.03, Last updated, Version 1) XX DE C. neoformans Harbinger DNA transposon. XX KW Harbinger; DNA transposon; Transposable Element; CNIRT1; KW HARBINGER1_CN. XX OS Cryptococcus neoformans OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Tremellomycetes; Tremellales; Tremellaceae; Filobasidiella; OC Filobasidiella/Cryptococcus neoformans species complex. XX RN [1] RP 1-3368 RA Goodwin T.J. and Poulter R.T.; RT "The diversity of retrotransposons in the yeast Cryptococcus RT neoformans."; RL Yeast 18(9), 865-880 (2001). XX RN [2] RP 1-3368 RA Loftus B.J., Fung E., Roncaglia P., Rowley D., Amedeo P., RA Bruno D., Vamathevan J., Miranda M., Anderson I.J. et al.; RT "The genome of the basidiomycetous yeast and human pathogen RT Cryptococcus neoformans."; RL Science 307(5713), 1321-1324 (2005). XX RN [3] RP 1-3368 RA Gentles A. and Jurka J.; RT "C. neoformans harbinger transposon."; RL Direct Submission to Repbase Update (31-MAR-2005). XX DR [3] (Consensus) XX CC An autonomous DNA transposon, around 43% similar to Harbingers in CC zebrafish and pufferfish. It has 17 bp perfect terminal inverted CC repeats, with CCC-GGG termini. There are only a few copies in the CC genome, some of which have TAA TSDs. XX SQ Sequence 3368 BP; 852 A; 812 C; 827 G; 874 T; 3 other; ggggtagaca aaatgcagct cattgttatt ttttatttct gttagggaag caaagggaga 60 cctgatttca gggtggaggc ttaggaaatg acgacagaga gtctagaaca ggcccatagg 120 aattagtgaa tttgcttcgc gacgcgagtg acctcgacag agacgcgtct gtctttgcaa 180 catgataatc gcgtctgttg cgttctgaat cctatcctct tcaacatcca gcgctccaag 240 agcagtttac caccctaaga acaacgtagt gaatatgcca agacgatcag ctaagtataa 300 agcgacagca cgagccagag agctctatcg aatggcgact atcctcaatc ataagcagct 360 caagrccagt ytgtccattc cgtatcatgc tctcaggaca tctcgttacc tcaatcgtcc 420 gaagaagtat ggtcagcttc gtagtaggga tgctgttgcc agaattacaa gcattcatga 480 gataccagat gaggatttcc gccgcaaact gcgcgtcaac catacggaat ttcgaaagct 540 tctctgcctt atcaaggacc atccagtttt cgtctctcat ggtccaagaa agcaggcgaa 600 ccccctttta cagcttacag ttgcgctcta tcgactggga cattgcggat gtgctgcgag 660 cacttttgag ataggagagc agtttggggt gtcaggtgag tgcccacgcc acactgcatc 720 cagaaacatt aaaccgtttc ttcgagattg attgttatcc atcagagggc acgtcggcca 780 tatggacgac tcgggtaatc aaggccatcc tgtcattgga gcgaaacaac gtatactggc 840 cagatgaaaa tgagaggaag gctatagaca ggcactttga agaggaggag gacattcctg 900 atggatgtgt ggggattatc gacggcttcc atgttccctt cgcttacaag cctgctcgcc 960 atgatgctgt cgatttcttt tcatacaaag ggcgctacgg cttcaatatc ctaggaattt 1020 gcgaccattt gaagaggatt cggtacttcc agtacggtta tcctgcttct gctcatgatg 1080 ctcgtatctt caagaactgt tctctctttg aagaggctaa tgccgacgct cagagtaaca 1140 gagaagccat gttgcaaggc cgagctgttc attcagaaat gatcagtcaa ggtgaatatc 1200 ttttggcaga ttcggcattt cctgctgggg attggtgtgt accgcttttc aagcgtcgga 1260 gaggtcagaa cgaccttgat cggccagaag taagtttatt tgtccgcata tctcatgtat 1320 agacctacat gctcatatat acaacaggcc aagttcaata agaagtgttc gagtgcccga 1380 gtcaagattg agcatgccta cggaatcctc aaaaaccgct ggcagagtct taggagcctt 1440 cgcgtgaaga ttcgcaatgt cagagatgaa ggcgttgcga catgctggat ccgggcctgt 1500 gtggtattgc acaatctgct cattgacaca ggtgattggt acaatcgact ggatggtgac 1560 gctgatgaac ctgatgatta catcgacatc gaaaggatgc aagaaaggat ggagagggtg 1620 ttagagcggc acagggaaga agaggaggat gaggcacgtg atgcaagaga tgcttcgaat 1680 ctcacgcgaa agagggtgat ggagcgaatg caggaaatag agcgcaggga tcttctatga 1740 tcatatctgc aggggagagg aagagacgac tgagaatgca tatcataatt atagagaggg 1800 agctccaggc tcaggcccca ccataacttt cgcctcctcc caacttttcc cagacctcat 1860 cagcatctcc atcttcctgt tccaactctc cgccaacatc ttcatctttt ctgtcgccgc 1920 attctcccgt gctatcgtca cattctcctg cgcgatattc atcatgtctg catgatgttt 1980 ctcctgcacg gagattttcc gttcttgaac ggccaatagc tcctgatgcc gccgatcgtc 2040 gttgccctcc tgcctcatga ccagctcatc catcttgtca tctgccgaga cagcctttgc 2100 cttggaagac cttatggatg ctctctggct ctgggtttgg gctcggattg gtgtagacac 2160 tgagcccagc acagatgagg accgtgcagg ttgagacgaa gagcggctct ccctgcggat 2220 agctgcaatg attggagcat ctgcctctga ctcggtctcg ctagcctcac ctgcaccgtc 2280 cccaaagaga ccaccatcct ctcctgacaa ctcatcttcc gagtcatcag cagccatttc 2340 gcccctttgg cgctctatca aaccatcgag rgcagcgaga tctctgtcgg gacgattgag 2400 agaggcgtgg agggtcgaag aggcgtggtg ggcagtgacg gaggccctgt cgccaaggac 2460 aggtaaaagg atctcgtaga aggggcaaac cttctttcgt tgagctatta atgttcagtt 2520 agccacgacg tgcccatgct gcctgccaaa gtacttttac acaccaagga gactctcgtc 2580 gtcgatctcc atcgccccag caccagtccc ggtccctatc tgggatgcct tagtgaagct 2640 ctcgacaatt tgattgatct ttcatcatat cagcgctcat gagcatacca aaaacgaaaa 2700 agacatattt actttgagag caacactttg aaactggcgt cgactgtcgc agccattgtc 2760 agtaaggtac tgtagacact ttctcgacca aaaatccttc ttctcatgac cagttttcca 2820 gttatgaaag ttggggagcc ttccattctc aggaatggtt gacaaccata cagcaacgtg 2880 ctcctctgcc gagacaccat cgctgttggt atctgtcgcc cattgcttgt tggagtttcg 2940 ctgcttcttc tgagtcgcct cagacgcttg attctcgttc tcgttctgcc ttttaggggg 3000 catgattggt gattgttgag atgaacggag tctgagataa aaataagtca atggtggaat 3060 gtgtaagatt tctaagagaa aaaaacacgt tacgcgtctt gaaatcatgc attctgaagc 3120 acgaagagcg agtttgaatt gtcaaatgaa ttaaacgacc tccagaggcc gtttaattcg 3180 aacggctttt ttcaagttat ttcaagagga atttcctttc atggtaaatc cgactcggac 3240 ttctgccaat tcaattccag ggtttttacg ccacaaaggg ttcaggaatg tcattttccc 3300 tgagggaatt ggcagctcgt tcatcaattc agaattatca tcgaaatagc atgcattttg 3360 tctacccc 3368 // ID Gypsy-5_PPM-LTR repbase; DNA; FNG; 473 BP. XX AC ABWF01004803; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Postia placenta genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-5_PPM_; KW Gypsy-5_PPM-I; Gypsy-5_PPM-LTR. XX OS Postia placenta OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Polyporales; Postia. XX RN [1] RP 1-473 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Postia placenta genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABWF01004803; Positions 225 697. XX SQ Sequence 473 BP; 84 A; 154 C; 109 G; 126 T; 0 other; tgtcacgagt tggtacggcg tctagccatt tttaattttt tcctcttcgc gggctcgaac 60 ccgcgacctc tgtactacat gcctgatatg gacacgtcgc tatctactgt acgattagat 120 tagatgggcg gcagtgcgcg gcacagctcg ggctccgctg acacagcctc tctgactcac 180 cggcccccgg ggccacgtca gcatgctgcg cacggaccag ctgtgcacgg accagtgcgc 240 acagcctccc ctaaacttcc cctgtatata aacctgcctc tgctgcccta gggacctcag 300 tttcgacctc ttgtccctag gtcagtaagc ccctgtacat acttactttc ctatgctaac 360 ttacttgtcc ctagccttag gtttgagcgt tgctcaccat cgcttcggtg agatctagtc 420 ggacgccccc agcttttagg ttttctcctg tcgcttgacc tcaagtcgcg aca 473 // ID Copia-29_MLP-LTR repbase; DNA; FNG; 279 BP. XX AC AECX01003109; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-29_MLP_; KW Copia-29_MLP-I; Copia-29_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-279 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01003109; Positions 22 300. XX SQ Sequence 279 BP; 52 A; 71 C; 55 G; 101 T; 0 other; tgttcattta atttatccta ataccttctt acgcgtgctc ttcacgattc cctatgagct 60 cgcaggttag tgtcttgtta tcgtcgtaga cctgctgttc gctgtagtct tacgaagata 120 cctgtgccct caggttagtg tcttgttatc gtcgtagacc tgctgttcgc tgtagtctta 180 cgaagatacc tgtgccctca ggttttccct ttttactaca cgcgaatctt agcggtctgt 240 tcagacttcg attctctgga atcacaagac attctctca 279 // ID CACTA-1_Mlaricis repbase; DNA; FNG; 7089 BP. XX AC . XX DT 16-MAY-2011 (Rel. 16.05, Created) DT 16-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW EnSpm; DNA transposon; Transposable Element; CACTA-1_Mlaricis. XX OS Melampsora laricis OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX CC incomplete_on_the_right. XX SQ Sequence 7089 BP; 2040 A; 1599 C; 1569 G; 1881 T; 0 other; caccgccaag aaattcacgt cggaatgccc cttccgatgt cacatcgtgg acggtgttcc 60 gaagcagggt cggggccaaa ttgtgcgata gaattgttgt gcgatgtgta atcacatctt 120 cactgtccga tgggatccgc agtccgattg aggatcgctt tggcacaacc gatgaaactg 180 agcgccgatc ccaaaccgca ttggtggccg ccgactgaac ctcagtctga tcaggtccga 240 tggctactcg tatagcgctc atccgactaa aacaaatgtg cgatacaatc atcgcatcgc 300 atcccctctg agcaactaaa tgtcgggcct tttgtcccga ttcaatcatt ttttcgattt 360 catgtcggtg tttgaatcgc gattcaatcc cctgaatgcc cgattcaatc tcgctggcga 420 ggcaacttcg gttgcatctt agcgacgcat tccaaaagca gccgatgcaa tcggcgtagg 480 accgatttgg tctcgaatcg acacaaaaat tgcagaaaaa ttttctgtct tgcgagctct 540 tcgaacttcg aatctgagct cttcgagctc ttcgaacttc gaatcttaag ctcgagtgtc 600 agaggtctgg gcttcgagcc tcaagagacc ttgacatata aagcttcgaa gttcgaactt 660 gactttgaag ttcgaacttg cccttgaaga ctcagcacgt agccactctt gcgattaagt 720 gcttccatgt agtcagggtg atgcggagag gctgagatcc aatactctcc tttgatgttc 780 agcatgtatg catcctcaga atagtgtgtt cttctgagga tgaatagtcc agatgcgcat 840 ttcaatccat ataagatagg gttcaactca cgcgatcatc ttcatattca acagtttctc 900 gcttcatctc actttgacac catttatcaa ttcattaagg ccaatcaccc aagaaagcaa 960 agtcttgaaa gatgaaaacg ttagctctga taccttgcca ctgttccgac tgtgaaaaaa 1020 acacctacca acatccccag accaatgcta aagtatacgg aacgctctgg agacctgcaa 1080 atcacaaaaa acatgtcaag agcttcggga cacgcgcgac atgcacggca caaaaactct 1140 ctttatcctc ggatatttca ggtaggtaga cgttctctct tgttaaattg ttaatattcc 1200 ttgagagtct gagagcaagc ggttgataga tacacgatca ccatcacaag atgcccaaaa 1260 ggcctcaatg gaaagtttgc ctcttgcttc ttcgaactta gaaacaacag ctttcaatcg 1320 taaagccaca gaaagccact tccaaccatc atgggatggg ccaatcatag attcaagtgt 1380 gtttcatcat cttttattca atagagttct atctttatgt gaatttcgtt gaatttttga 1440 atttttgaca agcaggcata tacagtacaa cacactacga caagtttcga cgttggaagc 1500 cttctgacct ggcttacttc cttatgatca catgtttaca ccttatcaaa aatatcagtg 1560 tggaggctgc cacgctttgc ttgcatgtca taggtgcagt ctccaccttc aacaccgatg 1620 aaaagtcttc gcaggccact ttaccaaagg ctatgagtac atttcttgat ggactacgga 1680 ttgagccaaa gattcgcaga acagtgtgct gccccaaatg cttcagtatg tactatggtg 1740 ataagattcc tacattttgc acgtctcaag ttactcccag gagccgtgta tgttcagagc 1800 agcttttaca cgaagcaggt attcatgatg ggacaacccc aaaagcatgc cggctctttt 1860 cgactgtatc ctttactgat tggttagctc gttttttgaa tcgacctggg atcgaggatt 1920 tgctggaatg gagagacgac tatttcgaaa cgcaaggtcc gatcatgaag gatgtccggg 1980 acgcttcaat gtggcactcg cttcgtgggg aagatggaca gcggtttacc tcctctcccg 2040 gaaatctggt gtttgctttg aatgttgatt ggttcaatgc aaatggaaac aaggctgcgt 2100 cgaaacaaat atctcttgga actattgctt tagtttgttt gaatctgcca ccagagattc 2160 gttcatctca agagaacata tttctggtag ggctgacccc tggtccttcc gaaccggacg 2220 tagagcaaat gaaccatgtt ctggcaccct tagtccaaga actcaaggtc ctatggaagg 2280 gtgtcaagtt ttcttctacc gccaagtatc ctacacatgg tagagtaatc cgtgctatgc 2340 taggtcccat tatctgtgat cttcccgcaa tacgaaaggt tttggggatg gcaggccatt 2400 catcaaatcg acatgtttgc tcactttgtc tactcggaac agacgacata catctattac 2460 gaattcctaa agaaagaaga cgatgcacac atacacttcg cgaacaggct ttggcctgga 2520 gagatgcaaa aacgataaca cgccggaagg aaatctttgc tgaatacggc gtgcggtact 2580 ccgtcctatg ggaacttccg tattttgatc ctcttagcca cacaattgta gaaccgatgc 2640 acaacatctt tcttggtatc atcaagaacc atggcacgag ggcttttgga atgaagtccg 2700 acccacaatc tgccgaggat ccaaggaacg ggggtgctag cgatgatgaa gactcggaga 2760 cggagactgt agcatcagac gtgtatatga aggattgttc tgttgacgca gcttcagaaa 2820 acccagacac ggaattagga tcatcacctc cccctccaac tccagagtat ctatcagcag 2880 agagcaatga caaattgttg tttgcatttg aagaaatgaa cttagaaagc gatacttgct 2940 ctatgacatc acccgaggat tatcaagcaa agcaagagca gacagaaagc tttttagaag 3000 atgacgcatt cgatgaagac ctctctgact tggaagatgc ttatgaagat gaactcggtt 3060 ggagattttt tgatgattct cagaacctag ctacattgca agatatcaca agaaatataa 3120 accttccttc ctgggttgga cgcgtgccta gcactgtagg gacacgcaag ggcgggaaat 3180 tgaaggcaga tgaatgggtg attctcttcc aggtcatgat gatacctgcg ataatttctg 3240 tcttgcacaa agatcagggt gctacagact tcacggaaca caacttcgtc cgcaatgcgc 3300 tccatcttat aagtgtattg aacattgttc ggaggttgga actgaatgag cgcgacattg 3360 gatccctccg ataccatctg cgaagctatc gaaatgggta ttccaaactt tatagttcac 3420 ttcccgtcct gcctaatcat cacatggcgc ttcatcttcc tgaatgtatc caacggtttg 3480 ggcccgcgcc tttctggagt gcatggttgt ttgagagact taatggtaaa ttggctaaaa 3540 taccgaataa taatcatcct agtaagttca taatatcgac taaataagag attgtgtctg 3600 aactgacaaa agcacgtcag attcccgcga attgacaaga ctcaggaagt ttaccttaga 3660 gcagaacctg cggccagttg tcaaaatctt atgtgaatct cttccaggag aaattgctac 3720 gaggattttg aagttcttgg agcccagtgg tacaccacga ggcagtacct ccggatacga 3780 agctcgaaca ggcatccctg tacatgcgga tcctgtctat cactacgacg aaaaaaaagc 3840 cgaaaccctg gactacgaca cccacgagaa cctggcacgc cgtatgaaag agttgggatg 3900 cggggatgtt gttgtggcta gccgtatttt cagaacaccg cttcctccgc cccatcacat 3960 attactagga cggatggtga atcaaaaaaa aatggtaaag catgaagaca tatcctactc 4020 aacttgggag catcaccagg ggaatagcat tgttatgtat cagtctaaag atgtgcgccg 4080 tccaatggca tatcggtggg gccgaatcaa tgagatattt atcactaatc ctgggtcgca 4140 aggcaatcca aaggcgcagc tatggttcga agttctccga ttctccgatt tgagagcttc 4200 tgagaaagcc aagcatggtc ttgataattg gccaaatgcg agaatgactg ttgtgtacac 4260 ctcctccaaa aaaaaggatc tggtacgggt agatgagctt gttgcccaag gggcagcttg 4320 ggatacccca gaaggatgct tcggaattaa gcagccaacc acactgatta taaatttgag 4380 caagtttaat tgcgagcaaa cttcctgatt ttcataatct atgggccact tgatagactt 4440 ttgtttctga tcttttgtta gttgaaatga attttattgt ctttacgaga catccgaaag 4500 tgatacttga ttccaaccaa tgaagaacta cgtaaagttt taaagcctca cgtcttgcta 4560 ggtaagacac cctgttccta cttgcaacac ctcatgagtg ctttgtatca aacgctccaa 4620 tgcttgactc tcatagacac cacagattcg aagtacaatc attctacttt gaaattgcct 4680 agtagacgct acaattttga agtagcatga tcttacttcg aactgacata ttagacatgt 4740 aagttcgaat gtttgaaata gaatgatgct acttcgaacg tacatattag acaccacagg 4800 atgtaaggtt ctgggtagga tcatccttcc tactttgaac ttcagggatt ttaagatatc 4860 aagtacagat aatacgaatc gagatgtctc gagctagagc atcaagtgaa acaaaccttg 4920 aaaccaaacg tgggggcgga aggcacaatc gggcgaagag atgagcaaaa caccgaatct 4980 gaagagactc gtgcaataga caaactattg acagggagat gatctcgagc ctcgttgata 5040 aatgttttca acagtccata gtgaccggat caagctgaat tgagacagat caggcaatgg 5100 agcaggttct ggactgtgtc agtcttgccg atctcaactt cgcaagcagc cttcactcca 5160 tcaatagcgg ccttcaatat ggtacgtaaa gtagacaaaa tcgtcgaatc taccgtaatc 5220 gacttagact ccataccaga gtgcgaagcc ttcaactcca gatgatactc cgagtaaaat 5280 tcgcctcatc catggtaaaa agtggatcga gagtattccg agagcttcga gaagaagtca 5340 taatctcacg gggatgacac attagcctag gttataggag tataatatta tgcaaaaggt 5400 gaagggtaaa gaacgacttc aaaccatgca agcagcaaac atccggacat ggccgggaaa 5460 accaggaaaa tgcgagtcca atgacatcga ttgggtgacc gcgcctgtgc ctccaatata 5520 caggaaagtt gcaagcagcg atgagatgag aaaaacatta aaattaaaaa gaaaccctga 5580 tgagaaaggc gacaagacat aaaaaattat aagggcttgt tagtcaaaat gtgttttatc 5640 atcttcacaa gtcatctaag gtatactcga tttgtttgta catacaaccg aaaatgatga 5700 agctataact acgtaaaagt tcaaaggctg ctgtattgat tggtcagaaa ccctgcccca 5760 acttgcgaca atcataagtg ccttgtatcg aacactcaga tgctcaatca tcgtggacat 5820 cgtatattcg aagtaggatg aggatccggt ttcgacggga agtcatcgta aattacggac 5880 aagacaatgc tagccaaagg gtactggcgt atccagcagg tgaagaatga aactcacctt 5940 caaaccggaa acacagtaaa cataacatac cttcatacag tgtggactct cgcgtgtcac 6000 gcgtttttgc ttgacatgat cacagggagg accactcgtt aatgaggatc cttcatacaa 6060 tgacgtttct tgtgaaaaga gggaggtctt cacggagtag aggatgaggc cgacccacag 6120 agtagaggat gagcccgatg gaagatgtca ttttactcag gatatcaagg ttttggcaca 6180 agagcgaaca tgcgtttaat cgaacttggc atattacgcg cctttgtgag attatgtcag 6240 ctgaactact ggtcaataat gcgtagagcg tgagcctggt ccattggttt acgcttgacg 6300 ttttacgacg aactttgcga agtgcgtaga gtggcccttg ttattaatcc ctctgtacta 6360 taacaaaggt ttgtcatgtg tgttccatgt tcaaactttt ggtctcgaac caattacgcg 6420 tttgcgaact ctgacatgtg tagcttgacg tagagatcaa aaaaccttga agggtgcgct 6480 gagcgtgaat acgcactgtg aatacgcact gagtcacctg aaagcttatg gtctactgca 6540 tattttggat tactcaggga ggatgcgcag agtcgacttc ggaatttcgt agaatacata 6600 tttgatttaa gatgagcaag gttcaaaatg cttatgtgtt aaagcttagc tttcgacacc 6660 tttgccacac gtataaagcc gagtcttcgt cgtctttgtt cccaattcct caatccaatc 6720 caaacttcat agtcccctct ctcgattttc tttccaagat tccccatcga aatcaaagag 6780 tacttcctca catctttttc gttcccaaat cccttgttcg aacttcgaac ttcaacattc 6840 atcgtacttt tttcgtttcg ttcctaaatc cctagtccga ctccctcact tttcacttag 6900 atggaagatt caaaatccct tactccagac gaggaatctc acatcccggg tactattata 6960 gaaatcaaat ctaaatcccc tcgatcggtt cgttgcgaga ccgagccgga cgaggctttg 7020 atcgggaaac acaagtactc gaaggtgtac gagtttgcca atccttgttc aagatgcgcc 7080 aatactggt 7089 // ID Gypsy-18_RO-I repbase; DNA; FNG; 6932 BP. XX AC AACW02000280; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-18_RO_; KW Gypsy-18_RO-LTR; Gypsy-18_RO-I. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-6932 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000280; Positions 28990 22059. XX CC Positions [5085-5561] - Integrase core CC 'ATAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 177..1373 FT /product="Gypsy-18_RO-I_2p" FT /translation="MATLPTEQIVLAQTNNQSQGYRPRLVEFYGYEGEDFR FT HFQELLDSYLVLSNTHSDARKLAILRSQLRRAAKMYFEKVILKDRPQVTYD FT EAIELLKNYYITPELIQNYELEFNEMAQGEQEHPQIFLARLREAADLANIT FT SEAVIESRYRAGLLKEIKLFCIQSSARNFQDWVTHSEGWWNANRPRKIAMV FT DNPFIPRNVNNALIYHDDNYHSHHTINNHNIELIDAGERPTQIIPLNDLHP FT HGVLDTGITNHHAINGSQQLSTLEVTRNNYHAQTYPQQARKNKVNYHNQVN FT EQQNLVDLIQQTIRNELNNQQQQPPARNYNRRNRYQDNYQNYNQNGNLGGH FT SNNGYNNNGYNNNYPCNNYNNQGWNKNSYSHPPYRSQHPQQSVQHLQPQQS FT NQSKN" FT CDS 4704..6050 FT /product="Gypsy-18_RO-I_3p" FT /translation="MEQRKINQMVIYLETLQIPEDETKSLKKFLLRESSNF FT TLYKNVLYRYNTENGTIRKVLNKQEAQEIMNAYHQHPLGGHLAYNNTLHKI FT ATRYYWENMAKDIKDFVQTCPRCQVFGKKSLREELYPVPVSIKPFDRIAID FT VKHVQTSRSGHRYIVAAIDYLTKYVEARPLRFQSSSEIALFLYEDVICRHG FT CPTILVTDNGKPFLSGLVRKVCQAYSIIHKTTTPYNPQSNGLIERFNRTLG FT QILQKRSDTEKEDWDIYLPATLFAYRSIKQATTKQFPFFLMYGYEPKTPFD FT IDSRIFEKNSHKFEAVLWHRTSHQLHNLNWIREQAVQAIKQTQTAQKKAIE FT NKILEERKELKPPFRLGDMVLLYKDYMSTSWSGKLQDKWEGPFIIQNNLGK FT GTYHIKSMDPNDTRIRRVHGNRLKSYAIPNTMWNTEGERSISSLILDKETN FT ELVQ" FT CDS 1646..4468 FT /product="Gypsy-18_RO-I_1p" FT /translation="MIATQPMVTPPPIATQPYIAAQPIAAATQPTVTLQPE FT AMETDQRTIQKNTKSKTLRKKTKAPPLEYDIVSDVMNQKADISFKNLIVAA FT PALGRKLAAASRPKRIPVEVGNHEETMAMIEDEEINTTAVYSKVTIGEKDV FT KALIDCGAAKTCMSKALANTLGLYIDAASESVFTLGNGTKQPALGVIYDVP FT ISVKRGMVIPCTVEVLPSCPNPLIIGNNWLNRAKAKIDFNNSSLKVSYKNK FT KAELEITFLRKTSSVPKISSYTQGYEKPVSLTNSEQAKHVHFEDELQDEDS FT SDQDDESTEEEDSAEEELEDEQTETLLLLENEKEDDIQAINQGTDILLEAP FT QNGLTVPSSTSKTIWIKKPKNEQRNLIYNLEITNPKVINSMGCFDSCLNLI FT TNRKSLEIRIFNRSPTDMSFDPGEEIGIIEKISPESDTIIQAYELDSSPQL FT CMIETTEFALENEMENNNKLLESEKYKKLDIGMLDAETMRNLRKLLKKYEN FT IFDWDNNTIGRTNLLKHKITIKEDTMPISHRPYRISPLEAEHLQKELDKYC FT KLGVIEPSNSPWAAPVILVKKKNGEYRMVIDYRKLNAVTKKDAYPLPRIDD FT LLDTLGKAKVFSALDMRAGFHQVPMEEDSKELTAFTTKYGTYHYNTLPMGL FT VNSPATFQRLIDLCFRPLINQCLVAYIDDLNVYSLNKHEHLQHLEQVFQCV FT QIANLKLNPEKCFFFKDHLKFLGYIVTKEGLQTDPSKIQKIVEYPQPKTIK FT QVRGFLGIASYYRRFIKNFAAIARPLHDQTKTTKKVPWTNETTESFELLKR FT ALTSTPILSRPDFNKPFILITDASKLGLGAILTQLDDNGYEHPVIYASRGL FT KSTESNYAPTKLECLAVIWAVKLFRPYVLGKKFTIITDHSALNGLLKTPNP FT TGIIARWITILSEYEFEIKYRPGRVNESADFLSRLGY" XX SQ Sequence 6932 BP; 2681 A; 1365 C; 1201 G; 1685 T; 0 other; tttggtggtc actacgaggg tcaaaatcaa caaatatacg aattaattga acaatcaaat 60 taaacttatc aaaaattcaa aaaaaaaaaa ctatacatca agaatctcaa aacctttatt 120 caaacaatat ttttccattc gaaatctcaa acgactcagt tagttacaat caagaaatgg 180 ctactttacc tacagaacaa atcgtgttgg cacaaacaaa taaccagagt caaggctatc 240 gaccaagatt ggttgaattc tatggttatg aaggcgaaga ctttcgtcat tttcaggaac 300 tcttagattc gtaccttgtg cttagtaata cacacagcga tgctcgcaag cttgctattc 360 ttagatctca attacgaaga gccgccaaaa tgtactttga aaaggttatt ctcaaggatc 420 gtcctcaagt aacctatgac gaagcaattg agcttttaaa gaattattat attacacctg 480 aacttattca aaattacgag ttggaattta acgaaatggc tcaaggagaa caagaacatc 540 ctcaaatctt tctggcacga ttacgagaag ctgcagatct tgccaatatc actagtgaag 600 cagtgataga aagtcgttat cgtgcagggc tcttgaagga gatcaaatta ttctgcattc 660 aaagtagtgc ccgtaatttt caagactggg ttactcattc agaaggatgg tggaatgcaa 720 atcgcccacg caagattgcc atggtagata acccttttat tcctcgaaat gtgaataatg 780 ccttgatata tcacgatgac aattatcatt cgcatcacac gataaataat cacaatattg 840 aattaattga tgctggagaa aggcccactc aaatcattcc tcttaatgat ttacatcctc 900 atggagtact ggacactggt atcactaatc atcatgctat caatggatca cagcaattat 960 ctacattaga agttactcga aacaattacc atgcccaaac ataccctcaa caagcccgaa 1020 aaaataaggt aaattatcat aatcaagtta acgaacaaca aaatctagta gatctcattc 1080 aacaaactat tcgcaatgaa ttgaacaatc aacaacaaca acctcctgca agaaattata 1140 atcgccgtaa tcgctaccaa gataattacc aaaattataa ccaaaatgga aatttgggag 1200 ggcatagcaa caatggctat aataataatg gctataataa taactatcca tgcaataatt 1260 acaacaatca aggatggaac aagaatagtt atagccatcc tccctatcgt tcacaacatc 1320 cccaacagtc agtgcaacat ctacaacctc aacaatccaa tcaatcaaaa aactaaaggg 1380 ttcggttgct tttgaaaaac aaagaaatgg tcaatcgaat aaccaaaacc ttaacaaatc 1440 cattaataac catccacaca atctcaatgc actactcacc caaaacgaaa tttacccaac 1500 agattacact catgacttat tcgctgcagt aagacccgac tttccacctg aggtcactac 1560 agcaaaccct tatgaaaaac caatcaaaag tactcgtgaa agaggaagga gaacaactac 1620 aaaagctaaa tctaaagaga ataacatgat tgccacacaa cctatggtta ctccacctcc 1680 gatagctacg caaccatata ttgccgcaca accaattgct gctgcgacgc aacctaccgt 1740 taccttgcaa cctgaggcga tggaaactga ccaaagaacc attcaaaaaa ataccaaatc 1800 taaaacgctt agaaaaaaga ccaaagcccc gccattagaa tatgacatcg tatctgatgt 1860 aatgaaccaa aaagctgata tctcatttaa aaatctgata gtcgctgctc cggctttggg 1920 aagaaaatta gccgctgcta gtcgtcccaa acggatacca gtagaggttg gcaatcatga 1980 agaaacaatg gcgatgatcg aagatgaaga gataaataca accgctgtat actctaaagt 2040 cactattgga gaaaaggatg tcaaggcctt aatcgactgt ggtgctgcaa agacatgcat 2100 gtccaaagca cttgctaaca cacttggact gtatatagat gctgcgtcag aaagtgtatt 2160 cactctcggt aatggaacaa aacaaccagc tttgggagtt atctacgatg ttcctatatc 2220 agttaaacgc ggtatggtga taccttgcac ggttgaagtg ttaccatctt gcccaaaccc 2280 acttattatt ggaaacaact ggttaaatag agcaaaagca aaaatcgatt ttaacaattc 2340 atcattaaag gtttcataca aaaacaaaaa ggctgaattg gaaattacat tcttaagaaa 2400 aacttcatct gtaccaaaga tatcaagcta tacacagggc tatgagaaac ccgtcagcct 2460 aacaaattca gaacaagcta aacatgtcca ctttgaagat gaattacaag atgaagatag 2520 ctcagatcaa gatgatgaaa gtacagaaga agaagattca gcagaagaag aattagaaga 2580 cgaacagaca gagactctct tattattaga aaacgaaaag gaagatgaca tacaagcgat 2640 aaatcaagga actgatatac tattggaagc ccctcaaaat ggattaacag taccatcgag 2700 tacttcaaag acaatatgga ttaagaaacc aaaaaatgaa caaagaaatt taatttacaa 2760 cctggagata acaaacccta aagtaataaa ttctatggga tgctttgatt cttgtttaaa 2820 cctgattaca aaccggaaaa gtctggaaat aagaatattc aatcgctcac ctacagatat 2880 gtcatttgac ccaggcgaag aaattggaat cattgaaaaa attagtccag aaagtgatac 2940 cataattcaa gcttatgaac ttgattctag tccacaatta tgtatgatag aaacgacaga 3000 atttgctttg gaaaatgaaa tggaaaacaa caataaatta ttggaatcag aaaaatacaa 3060 gaaactcgat attgggatgt tggatgcaga gactatgaga aatctacgga aactattaaa 3120 gaaatacgag aacattttcg actgggacaa caatactatc gggcgtacaa acctattaaa 3180 gcacaaaata acgataaagg aagatactat gccgataagt catagaccat accgtatcag 3240 tccactcgaa gcagaacatc ttcaaaagga actcgataaa tactgtaaac taggagttat 3300 cgaaccctca aacagtccat gggctgctcc tgttatatta gtcaagaaga aaaacgggga 3360 ataccgaatg gtaatagact atagaaaact caacgcagtt acgaaaaagg atgcataccc 3420 cttacctcgt atcgatgatc tgttagatac gttaggaaaa gcaaaagtat tctcagcctt 3480 agatatgcgt gcaggatttc atcaagtacc gatggaagaa gacagtaaag aactaactgc 3540 attcaccaca aaatacggca cataccacta taatacctta ccaatgggat tagtcaattc 3600 acctgcaact tttcaacgtt tgattgattt atgtttcagg ccattgataa accagtgcct 3660 agtcgcttat atcgatgatc tcaatgtata ttctcttaac aaacatgaac atcttcaaca 3720 tttggaacaa gtatttcaat gcgtacaaat agctaacctc aaattgaatc cagagaaatg 3780 cttctttttc aaggatcacc tcaagtttct tggatatatt gtcacaaaag aaggactgca 3840 aactgaccca agcaagattc aaaagatagt cgaatatcct caaccaaaaa caattaaaca 3900 ggtccgagga tttctgggaa ttgcttctta ttatagacga tttatcaaaa actttgccgc 3960 tatagcaagg cctttacatg atcagacaaa aacaacaaag aaagtaccat ggacaaacga 4020 aacgacggaa tcattcgagt tgctaaaaag agcacttact tctacaccga ttttatcaag 4080 acctgatttc aacaaaccat tcatcttaat aaccgatgca tcaaagttag gtttaggagc 4140 aattctaact caattggatg ataatggtta tgaacatcct gttatttacg caagtcgagg 4200 actcaagtca accgaatcaa attatgcacc taccaaatta gaatgcctgg ctgtcatatg 4260 ggcagtaaaa ttgttccgcc cctatgtact tgggaaaaag tttacgataa ttaccgatca 4320 ttcagccctg aatggcttac taaaaacacc aaaccctact ggaataatcg caagatggat 4380 tactattctg tcagaatacg agtttgaaat caaatatcga ccaggaagag taaacgaaag 4440 tgccgatttc ttatctcgac ttggttacta aaagattcaa caactacagc tactttacta 4500 ttaatatatt accaatctta tattcttatc aaactacatg gaggcaggga ggggtagttg 4560 aacaaaaaac ccagaaaact caacaaaaac aagacaaaaa taggaaaaac tcaaaacaaa 4620 acaaaaggac aataataaac aaaaactcgt attaaaaagg caaaataaca aaacaaatta 4680 catcattcaa caaaaagaca aaaatggaac aacgcaaaat taatcagatg gtgatatacc 4740 tggagacctt acaaatccca gaagacgaga caaaaagttt aaagaaattc cttttaaggg 4800 aatcttcaaa tttcaccttg tacaaaaatg tcttgtacag atacaataca gaaaacggga 4860 ccattcggaa agtactcaat aaacaagaag ctcaagaaat aatgaatgcc tatcatcaac 4920 atcctttagg tggacatcta gcatataaca atactttaca caaaattgca actagatact 4980 attgggagaa catggctaaa gatatcaaag attttgtgca aacatgtcct aggtgtcaag 5040 ttttcggaaa aaagtcgcta agagaagaac tatatccagt tcccgtttcc atcaaaccat 5100 tcgatcgaat cgctattgat gtaaaacacg tacaaacatc gagatcggga catcgatata 5160 ttgtcgcagc cattgactat ctcacaaaat atgtcgaagc aagaccatta cgtttccaat 5220 cgtcctcaga aatagcttta ttcttatatg aagacgtcat ttgtagacat ggttgtccaa 5280 cgatacttgt cactgataat ggaaaaccct ttttgagcgg tttggtacga aaagtctgtc 5340 aagcttactc aattattcac aagaccacaa caccgtacaa cccacagagc aacggattga 5400 ttgaacgttt taatcggaca ctcggtcaaa tcctacagaa aagatccgat actgaaaaag 5460 aagattggga tatctacttg ccagccacat tatttgcata tcgatcaatt aagcaggcaa 5520 caacaaagca gtttccattc tttctaatgt atggatatga acctaaaaca cctttcgaca 5580 tagatagccg gatattcgaa aagaattctc acaaatttga agctgtatta tggcacagaa 5640 cgtcccacca gcttcataat ctaaattgga tacgtgaaca agcagtacaa gctatcaagc 5700 aaactcaaac agctcagaaa aaagcaatag aaaataaaat cctggaagaa cgaaaagaat 5760 taaagccacc ttttaggctt ggggatatgg tactcttata caaagactac atgtccactt 5820 catggtcggg aaagctccaa gacaaatggg aaggcccctt tatcattcaa aataatctgg 5880 gaaaagggac ttatcatata aaaagtatgg acccaaatga cactagaata agaagagttc 5940 atggtaatag gctgaaatcg tacgcaatac caaacacaat gtggaacaca gagggcgaaa 6000 ggagtatttc aagcctaata ctagataaag aaacgaacga actagttcag taaacaaaaa 6060 aaaaaaaaaa aaaaaacaaa caaataaaca aaaaattttt atataaatat tcgagtatat 6120 acaaacactc atcaatcaac aatcaataac aagcagtaac aatggatcaa acccaatacc 6180 ttaaggaaca agccaaaatc acagctagca agatggtaca agacattact gaaggaaatg 6240 gaggacatgc atccaaattc attaaagaac ttttacggga ctatttcaag ccactgtata 6300 ttgcccgagg aatgtatggt cttaaagatg ctttggagaa ataccttgac tacatggaaa 6360 acgaagaagg ccttgccgga attgaatatt ttgttggtga aaaggcaact aaacgtgaag 6420 actatattaa caaggtagcc caaaaggtac aagcagaaac cgatgacgag gaaatccaag 6480 agccaggttc caacgaaaga actctccgaa actggctgat gaccgagtca tctacacttg 6540 aacaagaaca agatgaaagc aatccacctt ttgaattgtc ttgggaaatg ttgaatgaga 6600 tgtcaactat ttctttgttt gaaaaagcca taaaaagtga ctatgctgta ttagtagcct 6660 accacaaaga agaagctaaa attgcaaaac tcaaggaaca agtggttgag cgcctggcgg 6720 aaaacattac tgagtggagc catatcaagt tcgaccaacc aagaaataat gaacgcggcg 6780 tacagcttta tgttggaaac cgaaacgaag cgttgagagg actgatgaag aattatgatg 6840 ttcacaaagc gcttcacagg aagattagga agatagatac ttggggtaaa gtttctttgc 6900 tcggggacga acaacccgtc ggtacgggca tc 6932 // ID Helitron-1_AN repbase; DNA; FNG; 6809 BP. XX AC . XX DT 09-DEC-2003 (Rel. 8.11, Created) DT 09-DEC-2003 (Rel. 8.11, Last updated, Version 1) XX DE Autonomous Helitron rolling-circle DNA transposon - a consensus DE sequence. XX KW Helitron; DNA transposon; Transposable Element; KW HELITRON superfamily; Helitron-1_AN; Helitron-N1_AN; KW rolling-circle DNA transposon. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-6809 RA Kapitonov V.V. and Jurka J.; RT "Helitron-1_AN, autonomous family of Helitrons in the Aspergillus RT nidulans genome."; RL Repbase Reports 3(11), 190-190 (2003). XX DR [1] (Consensus) XX CC Autonomous Helitron rolling-circle DNA transposon. CC Helitrons in the A. nidulans genome do not contain a 3' CC palindromic CC element present in Helitrons from other eukaryotes. CC The genome harbors several nonautonomous copies (97% identity). CC inserted in the TA target site, no duplications. CC The 1439-aa Rep/Helicase Helitron-1_AN is encoded by a CC single ORF (pos. 987-5306). XX FH Key Location/Qualifiers FT CDS 987..5303 FT /product="Helitron-1_ANp" FT /translation="MRPLNIGRMDLECPDCHGLHWKAERDKGTSIAAPKFG FT KCCGGGNNFIPPVEVPTFLQRLFNGSDHDNGRHFRQNARLYNCAFSMTSFN FT AGEDPRLKGQHGPFQIQGQLVHFLGPLLPDADRPPAFAQLWIYDRLTDAAR FT DRNRALATAVRCTRFSDLRPGIVGELTEWFELHNRFSQQFMSATEQLFANE FT ARGTPAELLLGPGINLVAVEGTDKRRYNLPREGEQVAALLIDPTVNDLDHG FT TFREVILQLRHPVNGSGLKRIDPSHAGYLPMQYPLFFPYGDDGWHWGCRRL FT DGLRLSARMYHAYRVHIRRREFSPWHYGGRLWQQYLVDAWGTIEQAKLEWI FT RHNQTTIRAELYSGLTDALAAADGDVLVANNTGQRVILPSNVVGTPRYMQQ FT LFQDAMAICQFYGPPSLFITFTANPAWDEVTRELRPGETWEDRPDIVSRVF FT NILRAEMVDELCKKKLFGVAPGRFFTIEYQKRGLPHMHLVLFLEERERFLD FT AAHIDEMVSAELPDPREDLELYKLVKKHMIHAPCGPVYNSRAPCCDKHSDS FT NMIYCTKRFPKAEQYETQPIEEGYPLYRRRADPRGAYRIKAKNNDMVRIDN FT TWVVPYNPYLLKRFRSHINVEVCRGVDVIKYITKYIYKGPDRASMRSKVAD FT EVDLYLDARYVGASEAVWRLLRFPLHQEWPPVTALHVHEPARHLVYYNSNA FT GMRELEDCIDGGKSMLMGFFEYNAHAANPANAALALNRYLYAKMPQFFTWD FT KADRIWRPRTRNRFAIGRMYHCSPNAGERFYIRLLLTVCVGPVSFEDLRTH FT DGILYPTFKAACNARGLLKDDREWHHAFEESVGSAIGAQLRTLFVVALTSG FT TLNDPPCLWEEFKCRICSDLEHYEIRRMDNPPDIEDRHIDYGLFIIARMLA FT EHGERETLDRYGLPLWTAAWGRLEPQTDVLVPFIPPVDLARRVDALIPSLN FT IDQRRHFDTVSAAMADRSGECFYLQGAGGTGKTYLYTALYYHLRAQGKTVL FT CIASSGIASLLLPYGRTSHSTFGIPLALHEESTCAVTLRSTRARVLAGVDL FT IIWDEAPMQHRHAFEAVDRLLQDILKVKQLFGGISVLTGGDWQQCLPVVPK FT APRAGIISATLRRSYIWPRLRAILRLTQNMRLPSVGINRLFSQLLARMSVD FT NTMHGTLELPDYVMDSASMSVEELCERVFPAADMTHCHTADFQASDPDFFA FT GRAILSMRNSALVEFNDRITDSMRSQESMRYSADEALTDNVAEGVEEITHE FT FLQSVDLPGLPPARLRLKVGMPIMLLRNLRATEGLCNGTRMQIVELCRYTI FT RARILTGDFRGSVHLIPRITLYSKPGDLHYVLSRTQFPVRPCFAITTNKSQ FT GQSLQQVGVDLRVPAFSHGQLYVAMSRVTDVRRLSVLLPPGVRTTNNVVYP FT EVLQDIASLDDVPDWDDGMVTDEAA" XX SQ Sequence 6809 BP; 1426 A; 1807 C; 1820 G; 1756 T; 0 other; ttaacattac tatacgaact ttaccctctg actccaccgc agtcacgtgt tatccactta 60 attcgcagaa tgcaatatac acccatgatg tattctacac ccacccgtca cttattcttc 120 ttcgcgtccg ttacggaccg ctacgggccg ccttctgtca ataaacacct cgctgccgtg 180 ttcttcccaa ttccacatat ctgtatagtt catattgcac tctcctacta ttgcatatgc 240 ctagaaaacg ttcggcgcct gtttccggtc accaccgatg taatatctgt cggcgtgact 300 ggccggattt cgaatttttc aggccgcttg atccccccgg gcgtggctct cctccatgga 360 agtcttgctt cacctgttga cgccagagcc agcttcggcg tggtccgcgt gttagtgttc 420 ctttacagcc gttgagcgcg accaatctca atatacgggc caatgagact cctgccagcc 480 cggtggttac tgtgtttgag gacaatttct tggatccgcc tccctcagtt ctccattcag 540 gtcctccgcc tccgccgccg ccatctagcc cctcgcccgg attggttgca gtccgttcca 600 ggccgcagag atcacggcgc gctgtagagg gctatagata ctgctcgcgc tgcttgcgct 660 cgaagcctga ggatgaattt tctgccacac ggagatctca ggcgacacgc tgccatacct 720 gccgggtttg ttatcctcta tgcatgtgtt tctgtccgct aacccctgtg ttgccttagg 780 gtgatgatta ccagcagtct gcttcaatcc ttgcaaacac tccacctccg gcggacctaa 840 cagatataca taatgccggt tatgatgagg attttcaagc cggtgtctcg gatgatgagg 900 atgagccacc cgctgcagcc ccccctctgc cgatttctgt tgggaacacg tccgtttacc 960 ggggctaccg gtctcagctt caccgtatga ggccgctgaa tataggccgg atggacctag 1020 agtgccctga ttgtcatggg ctacattgga aggctgaacg ggataaaggg acgtccattg 1080 ctgccccgaa atttggcaag tgttgcggcg ggggaaacaa ttttattcca cccgttgagg 1140 tgcctacgtt cctacaacgg ctgttcaacg gcagtgatca tgataatgga aggcatttcc 1200 gccagaatgc ccgcctgtac aactgtgcat tcagcatgac aagtttcaat gcaggtgagg 1260 atcctcggtt gaagggccag cacggcccgt ttcagataca gggccagtta gtacacttcc 1320 ttggccccct gctgccggat gcagacaggc ctcctgcctt tgctcaacta tggatatacg 1380 atcgtttgac ggatgcggcg cgggatagaa accgcgcatt ggcaactgct gtccgttgca 1440 cgcgtttttc ggacctaaga ccaggtattg ttggtgagct aacagagtgg tttgagctgc 1500 ataatcgttt ttcccagcag tttatgtccg ctacggagca gctatttgcc aacgaggccc 1560 ggggcacgcc tgctgagttg ttgcttggac ctggaattaa tttggtggcc gtagaaggta 1620 cagataagcg ccgctataat ctgccgcggg agggagagca ggtggctgca ctgctgattg 1680 atccaaccgt aaatgatctg gaccatggca cgtttcgcga agtcatcttg cagttgcgcc 1740 acccagttaa tggcagcggg ttaaagcgca ttgatccaag ccatgctggc tatctgccaa 1800 tgcaataccc gctgttcttt ccatatggag atgatggatg gcactggggc tgccgccgtc 1860 ttgacgggtt acgactatcg gcccgtatgt atcatgccta ccgggtccat atccgccgcc 1920 gtgagttcag cccctggcat tacgggggcc ggctctggca gcaatacttg gttgatgcgt 1980 ggggtacaat tgagcaggcc aaattagaat ggatccggca taaccagacc accatccgcg 2040 cagagctgta ttccggtttg acggatgctc ttgctgctgc cgacggtgat gttttggttg 2100 caaataacac tggccagcgg gttatcttgc cctctaatgt ggttggcacg ccccggtata 2160 tgcagcagct gttccaggat gcaatggcaa tctgccagtt ctatggtcca ccatctctat 2220 ttatcacatt cactgccaat ccggcgtggg atgaggtcac ccgagaacta cggcccggcg 2280 agacgtggga ggatcggccg gatatagtgt cgcgtgtttt caatattttg cgggccgaaa 2340 tggttgatga gctctgcaaa aagaaattat ttggagtcgc gccaggccgt ttctttacta 2400 ttgagtacca gaaacgtggc ttgccccata tgcacttggt tctctttcta gaggagcgcg 2460 agcgctttct agatgcagct catattgatg agatggtctc tgctgagttg ccagacccgc 2520 gagaggatct ggaattatat aagttggtaa agaagcacat gatccacgcg ccttgcggac 2580 cggtatacaa ttcgagagcc ccctgttgtg ataaacactc tgattcgaac atgatctatt 2640 gcacgaaacg gttccccaag gctgagcagt atgagaccca acctatcgag gagggctatc 2700 ccctatatcg tcgacgggcg gatccaaggg gcgcataccg gatcaaagcc aagaacaatg 2760 atatggtccg catcgacaac acgtgggttg tgccttacaa cccgtatctg ctcaagcgct 2820 tccgttctca catcaatgtg gaggtctgcc ggggtgttga cgtgattaaa tatatcacta 2880 aatatatcta caagggtccg gaccgcgcgt cgatgagatc gaaggttgcg gacgaggttg 2940 atctctatct agatgcccgt tatgtcggcg catccgaagc tgtatggcgc ctgctccgct 3000 tcccactaca ccaggagtgg ccacctgtaa ccgcgctgca tgtccatgag ccggcccgtc 3060 atttagtcta ctacaacagc aatgctggaa tgagagagtt ggaggactgt atagatggcg 3120 gaaagtcaat gttaatggga ttctttgaat acaatgccca tgctgctaat ccggcaaatg 3180 ccgcgctggc attgaatcgc tacctgtacg cgaagatgcc tcagttcttt acttgggata 3240 aagcggaccg gatttggcgt ccgcgaaccc gaaatcgatt tgccattggc cgtatgtatc 3300 actgcagccc aaatgcaggt gagcgttttt atatccggct cctgctgaca gtctgcgtgg 3360 gacctgtatc attcgaagat ctgcggacgc atgatggcat tctctatccc actttcaagg 3420 ccgcttgcaa tgctcgcggg ctgttaaagg atgaccgcga gtggcatcat gcctttgaag 3480 agtccgttgg atccgctatc ggcgcgcagt tacggacatt gttcgtggtg gctctgacca 3540 gcggtacatt gaatgacccg ccatgcctgt gggaggagtt taagtgccgg atttgcagcg 3600 atttggagca ctatgagatc cgacggatgg ataatccgcc tgatattgag gaccgtcata 3660 ttgattacgg cctcttcata atcgcgcgta tgttggcgga gcatggcgaa cgagagacgc 3720 tggataggta tggattgcct ctatggacgg ccgcatgggg ccgtctggag ccgcaaacgg 3780 acgtgttggt gccttttatt cctccagtag atctggctcg gcgggtggat gctctgattc 3840 cctcccttaa tattgatcag cgccgtcatt ttgatacagt gtcagcggct atggctgacc 3900 ggtccggtga atgtttctat ttgcagggcg ccgggggaac aggtaaaact tacctttata 3960 cagctctata ttaccacctg cgagcccagg gcaagacagt cctctgtatt gcctcatccg 4020 gtattgcttc gctgctgctc ccttacggac gaacatccca ttctactttt ggcatcccgc 4080 tggctctgca tgaggaatca acctgcgctg taaccctccg cagcacgcga gcacgtgtgt 4140 tagcgggtgt ggaccttata atctgggatg aggcccctat gcaacatcga catgcttttg 4200 aggctgttga tcgtctcctg caggatatcc ttaaggttaa acagctattt ggaggcatat 4260 ctgtccttac aggtggtgat tggcagcagt gtcttcctgt tgtgcctaag gcacctcgcg 4320 ccggcattat atccgccacc ctgcgcagat cctatatctg gccaaggtta cgggctattt 4380 tacggctcac ccagaatatg cgtctcccgt cagttggcat taatcgccta ttctcccagt 4440 tactcgctcg catgtctgtt gataatacaa tgcatggtac tttggagctg ccggactatg 4500 taatggatag cgcgtccatg tccgttgagg aactctgcga gcgggtcttt ccggcagcag 4560 atatgaccca ttgtcacact gccgactttc aggcgtctga cccggacttt tttgcgggcc 4620 gtgcaatact ctcaatgcga aactctgcct tggttgagtt caatgaccgt attacggact 4680 ccatgcgcag tcaggagtct atgcggtatt ccgctgacga ggccttaacg gacaacgttg 4740 ctgaaggggt ggaggaaatc acccatgagt tcctgcaatc tgtggatctg ccaggtcttc 4800 ccccagcaag attacgattg aaggttggta tgccaatcat gttgctgcgg aatttacggg 4860 ctacagaggg tctttgcaat ggtacgcgga tgcagattgt ggagttatgc cgctatacaa 4920 tccgcgcgcg catcttgacg ggtgacttca gaggctcagt gcatctcatc ccccggatta 4980 ccctgtattc aaagcctggc gatctgcatt atgtgctgtc acgaacacag tttccggtcc 5040 gtccatgctt tgcaatcacc acaaataagt ctcagggtca gtctttgcag caggtaggtg 5100 tggatttgcg ggtccctgct ttctcccatg gtcagttata tgtggcaatg tcgcgggtta 5160 cagatgtgcg gcgacttagc gtcttgctgc cgccaggtgt tcggaccact aataatgttg 5220 tttaccccga ggtcttgcag gatattgcaa gcttggatga cgtgccagat tgggatgatg 5280 gtatggttac ggacgaggca gcctaattat gaccagaata tatagtgttt acaggtatgc 5340 atatacgtgt ctgattcttt tgcggtctga atttactttg ctttcatagt atccggtccc 5400 ggtgaaaact atttacgggt gattatgacc tgccaagagt cagaatctcc tcatccagtg 5460 aggagagctc agactcatct tcatctcctc ctccctctcc ttcctcctct tcctctccac 5520 caaacccctg gaaaccatca tcatcggcgt actcggcggc taccaggagc agcgtctggg 5580 cgagagcttc ctgggcagca gcggcggcag atgcctgtcg ggccgccacg gccagcatgg 5640 tgtgtaagcg ctgggcgcgg cgcgggaccc tatctgcatg ttcacgggtg tcatgactgg 5700 caaaatgaac ccccttgcgg gcagaggtgg ccggggcttt gcctttgcag cagcgggcat 5760 tctcttcctc ctcttcatcc tcctcctcct cctccgcctc ctcatcttcc tcttcttcct 5820 ctccagcaac aagcgcagca agggcggcag cacgcttgcg ggcgtgcttg ggggtagtag 5880 ttgcaagatt tttgactaat attacctgtc agcagttatt ctttcggtcc ggatgataga 5940 ccatttactg tgaaagtcac actgcttgcc ctcactgcca aagtagcagt tagcacaagc 6000 cccctggaat cgcccagcta ccactataca accttcaaac agtccacttt ccttaatgtg 6060 gcagtgggtg catggcttgg gcgcatgaga accaaccatc taccccagaa gggcctcaga 6120 attgctgtta cggacgtaac tgaaggtttt gccgtcccta atgtgctttt gccgaatctg 6180 aggctcgcag acagccggca tggcgagcag ggctgcctgg gcagcggatg gtttaggata 6240 tttggcggcc cagtgccact cggcgaccag cgctgcaggc gggatttcgg ccgtggatga 6300 agaagaggat gtggtggagg acattgtggt tggatggatg tataatgaaa actggctatc 6360 tacaacttac gggccgtcaa atgctttata tagacccctt gcagcccatt gttgtagttt 6420 acagcacttt gtgtttgtat tctggatatc cagacaataa tgtattatct taggtccttt 6480 gacggtccgt cgattctttt tctggcaggt tatcgattcc cattgacgtt cagagtacaa 6540 tcaatgtgtt ggtccaaaat caattgacgg tccgtcagac ctccgtagca gctctatcat 6600 tggtcgaaaa tttatttacg gatcgggata ccgtttctcc ggtccgccga aggctaccaa 6660 agtcccgcag ggcctgccgg gaggcccccc cgggagggtg ccccgcaggg caagctgagt 6720 cacccggagc gtctcggcca tcgaagcgct cccggagcga agcgcaggtc agccagcgta 6780 ggtaggccag cgtagggtca ggaagcttg 6809 // ID Gypsy-82_MLP-I repbase; DNA; FNG; 10260 BP. XX AC AECX01001127; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-82_MLP_; KW Gypsy-82_MLP-LTR; Gypsy-82_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-10260 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001127; Positions 44906 34647. XX CC Positions [8204-8725] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 2055..4853 FT /product="Gypsy-82_MLP-I_3p" FT /translation="MSSRASQPPSRASSASTRGSTRSKRSVEPTSRGTRGT FT PMGELFQPLPPSGKPNPTLVDNFAKENIINKKILFLTTEDSAATGRLLFPT FT ELSGPSACSSSPTPACPIPSLPASSEPPSTPVPGSSTTPTHSGYQQQQHAS FT SSSSCTVLPADKGGRGGPTSPSNQQSQARLGRSESCRQQLELQASIGDGQG FT GPQGTGRLPWRQTLPFESRTDCRLQPLPDQSSDHGGIRSTETGCRSIESAP FT QTSSRGGDRRVSRDFIQPIGHDADGKKDETRQEERFRQEREREGEGAVSIL FT PPPTSSFHSHPSVLYTETPFTGPPLDNSPNLLSPSLKSCKSEEDQLSQGNS FT QRLFNQSYLVSNLNSQISSSVVSASNSQLDLDELITRKSYAIYEKVENIKP FT TLNLLKNQCEQSEQVRLQEFKLSCQLQKRSYDDIMKNLLELKEELVEIKTL FT SYEFKNQEQHQTAYEDVKVKFAELKEKFSEIQEIKKIILEIKRDSQDEIKD FT KGKGKEKEHIPISEIKEASASTSNTSQGVKQGVTEIQNSEVTPMYSLNKTQ FT ASVLQIREKLNTSLEKATLSASQQNMIKQITNLASQIATRISDNEKKNLLK FT FSGYDKNFQDTINKISQQRNKEKQQVIETSDTIVSDLGEIKNTIKQLDIKL FT DKQLRLPIKDPYNKIEELLTKFGKHETHVSSTIDEIKEDTNITKIRENLNE FT QQQQFESSMMKSYHSLQENIKENHEQLTKELLKQQDIQMKSFTEQMLKQQV FT ILNQKILEKNVEIQQQLLQMDDNANTARLSIIQELKPSIQNNNNNGPTPIN FT QINPSITYRHLHTPFEENPRVDPSLRGEARFRQSTAEPRERTVSYVEPVPT FT HRDFHSPAIRERTIMREETIPPRERSFMREDTVQTGITESAREDNVTKALT FT KLWPTGKDWKKSAETVNIIIMITLTGVMI" FT CDS join(4871..6946,6950..9361) FT /product="Gypsy-82_MLP-I_1p" FT /translation="MDPLVVVSHLATAFTGTAAQWYMDKRKTRGSMTWNEW FT KEEIHKRFGNNVWKGNMEELFMKDKFDANTHTDPTAWSLKQKKRISAFDPD FT SSEDRIVNKILNKVDGDARNAIKSQLSTPYEWDMFLMVFKDIFENTTIITK FT KIYLNRPRRPFIENRATFRDDKLQGPSSTRVQPSTSERKPSRACPNCGSKD FT PKHQWRGCKDKKINVIEEDTEESDNMNDEFDFQGGQLSGDSSSDSEGENSP FT NQHQQNVCMIDTHEDNEKSINILQAEADVPHSWQDDMQMGNTIDARLLKSK FT PAEGMAHTLGHHAMARALVNNCKVPILLDSGASCSIVGKHFLSEIIPDWQE FT RIMPSSHVKFSGVGSKLHALGVISLPVIFPHVKQSIRINAEFVVMENANNK FT YFILGAENLSQYGFDIFHSKERYFTIGNNNKSIKFALMQHKEILSIKPDNT FT MESPTNEDIHQLRGKLLESEFGPNLTYSQKEDIIQMVIKYKDQFGLGEQPL FT GVIKNYPVKIELTIDKPYPPILRKGAYPASPRSRKEIEKHIEELLKMGIIR FT KVGSDEEVDITSPVLIAWHNDKSRLCGDFRALNQYTKPDRYPLPRIDQSLT FT NLFNAKYITLMDIMKGFHQNIVEICSRKYLRIICHLGIFEYIRMPFGIKNA FT PAFLQRMMDTEFSKELREGWLKVYIDDIIVFHTTWEEHLEAMEVLLRAKAM FT GMTISLKKCHFGFEECKALGHRVSGLWVSVDQNSVAAVLQKPCPKDKQELS FT SFLGFTSYYRAHIPNFGIITRSLYKLYAKGVVFEMTKERIDAVNKIKHILT FT TAPILFHPDFEKPFKLYVDASIEGLGAALHQTQIIDGKPKEGPIVFISRKL FT TDTESRYSSPQLEALALVWALEKLHYYLDGSYFEVITDCTGVRSLTNLKSP FT SRHMSRWMMAIQEYKPFMTITHRPGKFHNNADGLSRMALPNDSSNPAWEPE FT EMERDIPVMGISLCELSEEFFDEVKTSYQKNSNTAKITRILSAQNTDLSLS FT STLKQPWKDGLAQGKISLESDLLYFREKHTANLVIINAEHIQQTLHVCHDE FT FMSGHLSEDRTVDRIKSTAWWPNWRQDVEEYVKTCERCQKANKATGKRFGL FT LQRIEEPMYAWEVINMDFVTGLPPSLINNYNCVLVIVDRFSKRTRFLPCYK FT EATAMYIALLFWERLISDVGLPQIIISDRDPKFTSEFWKSLHTLIGTTLAL FT STTYHPQTDGLSERNISTLTEIIRRYCTEGLCYTDKDGHTHDWHTLLPALE FT LAYNSSIHSTTGKKPFEVERGYCPRLPKDQIKNKNVEFHPTSLSFFDMLGK FT ARARAAQCIEDSVTYNQERWNKTHKEPKFVVGEQVLLLTTNFTNLQGPKKL FT QDQFVGPFVILEFHGSNAVEVALTEEFGRKHPVFPISLIKKFHASDKSKFP FT DREIPKKTPIRFETDGEKIFSHIIKQREIQVNGKSSTLYLVRYKNRSADED FT EWLPADKVPNGKTTLRDFRAQKRAHKPSEKK" XX SQ Sequence 10260 BP; 3865 A; 2009 C; 1757 G; 2629 T; 0 other; cctttggggg cctcatcgtg tgttgaaacc caaatttcaa cctcaacctt ataccaaaag 60 tctgagactt catacctcta gacaaaaaca taaagaacat caactgtagg actaagagtg 120 attaatcact aatcaactcc agcaatttca taacaagcca ctccaaatcc cttataaaac 180 aaagtcaaat cattgacctt tttttttaaa aaaaaagaaa aatacacagc acaaattcta 240 tcgagaatta tagcaatcaa aacaaagaca caagtttttt tttttgactt cctttatttc 300 ttccccaaac taatctaaaa aggaaaagaa gtaacaataa acaaatttct attattatta 360 ttatctatta cttctttcct aaccaaagtt agaactcttt caaaacaata ctcaaagaaa 420 gactgttgtt caatctcgag ataccatttt cgagtgttat tttaaaatca agaccgagta 480 aagtcgcaga ccgacaagtc ggatacttgc attacatata cacataatca aatttcgaaa 540 atcccgaaaa cccatataca aatagatata ccaaaacaaa aagaaaagga agatatccaa 600 tcacagaaca caaaaggaaa ccaatttctc ttttagtatc ttacaagtgg acaaacaatc 660 aaaaaagata tcgaacagaa gaaatcaccg ggggatcaaa ccaaaaacaa gaaatcagaa 720 cattccgatc ggaccaaaca acaagtcttc gaaaaagaaa agggtaaggc ttacctcttt 780 ctctggaata atcaagccac agatctcgtt ttctttttat taagtagaat caattcaaaa 840 caagcgaaac acactgacct agatcccatt actttcctat cagaacaaat tcattctatt 900 cctacctacc accgcatacg attaatcgag agaaacgatt gatattaacg acgagattgc 960 ataaacaaca cctacaggac aagagtgagt taaattaacc atagaagctc atgaaatcag 1020 atcgaagaac taacacacgt catcataaca gtgatttgtg aacaaatcag cgccggaagc 1080 gaaaccgatc aacaagctct tttgtataag agaaccattg attgatattc acctcagtcc 1140 acaaacagtt tccttcagaa tcaagcagac tgctatcctt atttttatcg ctcgatcaaa 1200 cgtcaaaaac tttttgagtt tattaaagtt atttataaaa taaagagtaa taacaacaat 1260 acccttatcc tcctcttcta ttttctaatc tctcatcctg aaatcgaact taagttgaaa 1320 actgatagtg cattctcttc taccttcatt actctatctt tttgttttat cttttctttt 1380 ctgttaagtg tccttcccac atttattcaa cttgttggac ttggcaaatc acggtacccc 1440 tcagcaacaa cctgctgaag ctgatcgtac gcaccattcc accccagctt taccggatca 1500 ccctaactac gacggttccg tcgtagaatt tgaagacgca tacggaaata cgtcacaaga 1560 gccaatcatc gttgacccga ccataaccac ccctgccgcc aatccattag ttatcgaagt 1620 cgattctccc ggatctgacg gtactgcacg agaaccgttt cgagcagaag gatctccacc 1680 agagaaccta ctaactaatc aacaaagtga gtattttttt ataaaatcga ttgatattct 1740 tattatcact agttgttgtt gagaagctgg ttactgatat aaaaaaacaa acattcatcg 1800 actggaatca gaaatccttt ctcatctagg caaccaaggt tcaggagctc tagctcaaga 1860 acaagggcaa caacaccatc aaccaggcta tcgtaagctt tatttaatac cttctctaca 1920 aaaaaaaaga gaaacttgaa tactaatcga ttcattacat attttttttc cttcaccaac 1980 aaatccctta tcaatttcct cgttcacatt attttttttt cttctattcc attgattgat 2040 cgtcaccaat caacatgtct agcagagcct cccaaccacc aagccgggct agctcagcca 2100 gtacacgagg atcaacgaga tcgaaacgat ccgttgaacc aacctcacga ggcacgagag 2160 ggacaccaat gggggaatta ttccaacccc taccaccctc aggtaagcca aaccctacat 2220 tagtggataa ctttgctaaa gaaaatatta ttaacaaaaa aattctattc ttgacgactg 2280 aagactcagc agcaacaggc cggttattat tcccaaccga attatccggc ccatctgcct 2340 gctccagtag cccaacaccc gcctgtccca tcccatcatt accagcctca tccgaaccac 2400 catccacacc cgtccccggt tcatcgacca caccaaccca ctccgggtac cagcaacagc 2460 aacatgcctc ctcctcctcc tcctgcactg tcttaccagc agacaaggga ggccgaggag 2520 gcccgacttc gccaagcaat caacaatctc aagcgcgact tggacgaagc gagagttgca 2580 ggcaacaact cgaacttcag gcgagtattg gggatggcca aggaggccca caaggaactg 2640 gaagacttcc ttggcggcag acgttaccgt tcgaaagtcg aacagattgc aggttacaac 2700 ccttaccgga tcaatcctca gaccatggag gaattcgatc aaccgagacc ggatgcaggt 2760 ccatcgagtc agcgccgcaa acgagctcga gaggaggaga tagaagagtt agcagagact 2820 tcatccaacc tattggacat gatgctgatg gtaaaaagga tgaaacaaga caagaagaga 2880 gattcagaca agaaagggaa agggaaggag aaggagcagt aagtattctg cctcctccta 2940 cttcttcatt tcactctcac ccttcagttc tttatactga aactcctttc actggtccac 3000 ctttagataa ttcccctaat cttttatcac cttctctcaa atcatgcaaa tcagaggagg 3060 atcagttaag ccaaggaaac tctcaaaggt tgttcaacca atcgtattta gtttcaaatt 3120 taaattccca aatttcttct tcagttgtat ctgcttccaa ctctcaatta gatttagatg 3180 aacttattac acgaaaaagt tatgcaattt atgagaaagt tgaaaatata aaaccaactt 3240 tgaatttact aaaaaatcaa tgtgagcaaa gtgagcaagt gagacttcaa gagttcaagc 3300 tgagttgcca actacaaaag agatcttatg atgatattat gaagaatctt ttagaattga 3360 aagaagaact tgttgaaatt aaaacactaa gttatgagtt caaaaaccaa gaacaacatc 3420 agacagctta tgaagatgtt aaagtaaaat ttgcagaatt gaaagaaaaa ttttctgaaa 3480 tccaagaaat taagaaaata attctcgaaa taaaaagaga ctctcaagat gagataaagg 3540 ataaaggtaa aggaaaagag aaagaacaca tacctatttc tgaaattaaa gaagctagtg 3600 catccacatc taatacaagt caaggcgtta agcaaggagt aactgagata caaaactcag 3660 aagttactcc aatgtatagt ttgaataaga ctcaagcctc agtattgcag atcagagaaa 3720 aactaaacac ctcacttgag aaagcaacac tatcagcttc ccaacagaat atgatcaaac 3780 aaatcacaaa cttagccagt cagattgcta cgagaatctc tgataatgag aaaaagaatc 3840 ttttaaaatt tagtggatat gataaaaatt tccaagacac aattaataaa ataagtcaac 3900 aaaggaataa agaaaaacaa caggtcatag aaacaagtga tacaatagtg agtgatttag 3960 gagaaataaa aaataccatt aaacaattgg atattaagtt agataaacaa ttaagattac 4020 caattaaaga cccgtataat aaaattgaag agttgcttac aaagtttgga aaacacgaaa 4080 ctcacgtaag ttcaacaatt gatgaaatta aagaagatac aaatataaca aaaattcgtg 4140 aaaacttaaa tgagcaacaa caacaatttg agagtagtat gatgaaatct taccactcat 4200 tacaagagaa tataaaagaa aatcacgaac aattaacaaa ggaattattg aaacaacaag 4260 acatacaaat gaagtctttt acagaacaaa tgttgaaaca acaagtaatt ttaaatcaaa 4320 agatattgga aaagaatgtt gaaatacaac aacaactgtt acaaatggat gacaatgcta 4380 atacagcaag attgtcaatc attcaagaac ttaaacccag catacagaat aacaacaata 4440 atggtcctac accaataaac cagataaatc caagtataac ctatagacat ttacatacac 4500 cttttgaaga aaatccaaga gtagatccat cacttagagg agaagctaga tttagacaaa 4560 gcacagcaga accacgggaa agaacggtga gttatgtaga acctgtacca actcacagag 4620 actttcacag cccagcaatt cgtgaaagaa caattatgag agaagaaaca atcccacctc 4680 gtgaaaggtc atttatgaga gaagatactg ttcaaacagg tataacagaa tccgctaggg 4740 aagataatgt gactaaagct ttgactaaac tctggccaac aggcaaagat tggaaaaaat 4800 cagcggagac ggtgaatata atcataatga ttacattgac tggtgtgatg atttaattat 4860 taccctgaaa atggaccctc ttgttgttgt aagccattta gcaacagcct ttacgggcac 4920 agcagcgcaa tggtacatgg ataaacgtaa aactagaggc tcaatgacgt ggaatgagtg 4980 gaaagaagag attcataaaa ggtttggcaa taatgtttgg aagggtaaca tggaagagtt 5040 atttatgaaa gacaaatttg atgccaatac acacacagat cccacagctt ggtctttgaa 5100 gcaaaagaag agaatttcag cttttgatcc agattcttct gaagacagaa tagtcaataa 5160 aattttaaat aaagttgatg gtgatgctag aaatgcaatc aagagtcaac tctctacacc 5220 atatgagtgg gacatgtttc taatggtgtt caaagatatt tttgaaaata caaccatcat 5280 tacaaagaaa atatatttga atagaccaag gagacctttt attgaaaaca gagccacctt 5340 tagggatgat aaattacaag gtccctctag tacaagagtc caaccaagca catcagaaag 5400 gaagccttcc agagcatgcc caaactgcgg aagtaaagat ccaaaacatc aatggagagg 5460 atgtaaagat aagaaaataa atgtgattga agaagacact gaagaaagtg ataatatgaa 5520 tgatgagttt gactttcaag gaggacaact ctcaggggat tcaagctcag acagtgaagg 5580 agaaaattct ccaaatcaac atcaacaaaa tgtctgcatg attgatactc atgaagataa 5640 tgagaaaagc atcaatatat tgcaagcgga agcagatgtg cctcactcat ggcaagatga 5700 catgcagatg gggaatacaa ttgatgcaag attactgaaa tccaaaccag cagaaggaat 5760 ggcacataca ctaggacatc atgcaatggc tcgagcattg gtaaacaatt gtaaagtacc 5820 aatactactg gacagcggag catcatgctc aattgtggga aaacatttct tatctgaaat 5880 tataccagac tggcaagaaa gaatcatgcc cagtagtcat gtgaagttct ccggagttgg 5940 cagtaaactt catgctctag gtgtcataag cttaccagta atatttccac atgttaaaca 6000 gtcaattaga ataaatgctg aatttgtagt aatggaaaat gcaaacaaca agtacttcat 6060 attgggcgca gaaaacttga gccaatatgg atttgatata tttcatagta aagaaagata 6120 ttttactatt ggcaataata acaaaagtat taaatttgct ttaatgcaac acaaagaaat 6180 tctatcaatt aaaccagaca atactatgga gtctccaaca aatgaagaca tacatcaact 6240 tcgaggaaaa ttattggaat ctgagtttgg accgaatctt acctatagtc agaaagaaga 6300 tataatccag atggtaatta aatacaagga tcagtttgga ttgggagaac aacctctggg 6360 agttattaag aactatccag tgaaaataga actcacaata gataaacctt acccgcctat 6420 cttgagaaaa ggcgcatacc ctgctagccc aaggagtaga aaagaaatag aaaaacacat 6480 tgaagaacta cttaagatgg gcataatacg aaaagtaggt agtgatgaag aagttgatat 6540 aacttcgcct gttctaatag catggcataa tgataaatct agactttgtg gagactttcg 6600 tgcgcttaat caatacacca agcctgatag atatccctta ccaagaatag accaatcctt 6660 gactaacctt tttaatgcaa aatatattac tctcatggat ataatgaaag gattccatca 6720 gaacatcgta gaaatatgta gcagaaaata cctcagaatt atttgtcatt taggaatctt 6780 tgaatatata aggatgcctt ttggaataaa gaatgctcca gctttcttac aaagaatgat 6840 ggatacagag ttttcgaaag aattaagaga aggatggctc aaagtctaca ttgatgacat 6900 catagttttt cataccacat gggaagagca tctagaagca atggaataag tattgctaag 6960 agccaaagcc atgggtatga caatatcttt aaagaaatgt cactttggtt ttgaggaatg 7020 caaagctcta ggccaccgtg tgtcaggact atgggtatca gtggaccaga actctgtagc 7080 agcagtacta caaaaacctt gccctaaaga caagcaagaa ttaagctcat tcttgggatt 7140 tacaagctac tacagagccc acattccaaa ctttgggatc attactcgta gtctctacaa 7200 actatatgct aaaggcgtag tatttgaaat gactaaagaa agaattgatg cagtaaataa 7260 aataaaacac atacttacaa cagcgccaat cttgtttcac cctgactttg agaaaccttt 7320 taaactctat gttgatgcat caattgaggg tttaggagca gcactccatc aaacacaaat 7380 aattgatggg aaaccaaaag aaggccctat tgtttttata tccagaaaat tgacagacac 7440 tgaaagtaga tattcttcac cacaactaga agctttagca ctagtttggg cactagaaaa 7500 attacactac taccttgacg gcagctactt tgaagtaata acagattgca caggagtaag 7560 atctctgaca aatctcaaat ctccaagcag acacatgtcg agatggatga tggcaataca 7620 agagtataaa ccattcatga caattactca tagaccagga aaattccaca ataacgctga 7680 tggactaagt agaatggcat tacccaatga ttccagtaat ccagcatggg aacctgaaga 7740 aatggagaga gacatccctg taatgggaat cagcttatgt gaattgtctg aagaattttt 7800 tgatgaagta aagactagct atcaaaagaa tagtaacaca gctaaaatca ccagaatatt 7860 gtctgcgcag aatactgatc tgagcctgtc aagcacactc aaacaacctt ggaaagatgg 7920 actagctcaa ggaaaaatct cacttgaatc tgacctacta tattttagag aaaagcatac 7980 tgcaaatctt gtaataataa atgctgaaca cattcaacaa acattgcacg tgtgccatga 8040 tgagtttatg tcagggcatc tctcagaaga ccggacagta gacagaatca aaagcacagc 8100 ttggtggcca aactggagac aagacgtaga agaatatgtg aagacttgtg aaagatgcca 8160 aaaagcaaac aaagccacgg gaaaaaggtt tgggttactt caaagaatag aagaaccaat 8220 gtacgcatgg gaagtcatta acatggattt tgttactggt cttccaccat cattaattaa 8280 caattacaac tgtgtactag tcattgtaga cagattttca aaaagaacca ggttcttacc 8340 atgctacaaa gaagcaacag caatgtatat tgcattacta ttctgggaaa gactaattag 8400 tgatgtggga ttaccacaaa taatcatcag tgacagagat ccaaaattta cttctgaatt 8460 ttggaaaagt ctgcatacac ttattggaac aacattagct ctttccacca cgtaccatcc 8520 acagactgat ggtctcagtg aaagaaatat tagtacctta actgaaatca tcaggagata 8580 ttgcacagaa ggactctgtt acacagacaa ggatggacac acacatgact ggcacacatt 8640 gctaccagct ctagaattag catataatag cagcatacat agcacaacag gaaaaaagcc 8700 ttttgaagta gaaagaggct attgtcccag attaccaaaa gaccagataa aaaataaaaa 8760 tgttgaattt catccaactt ctttaagttt ctttgatatg cttggtaaag caagagctag 8820 agcagcgcaa tgtattgaag attctgtaac ttataatcaa gaaagatgga ataagaccca 8880 taaagaacct aagtttgtag taggtgaaca agttttactc ttgaccacca attttactaa 8940 tctacaagga cctaagaaac ttcaagacca atttgttgga ccttttgtaa tactggaatt 9000 tcatggaagc aatgcagtgg aagtagcatt gacagaagaa ttcgggagaa aacacccagt 9060 attcccaatt tcactaataa aaaaatttca tgcttcagat aagagcaaat tccctgacag 9120 agaaataccc aagaaaactc ctatcagatt tgaaactgac ggagaaaaga ttttctcaca 9180 tatcataaag caaagggaaa tacaagtcaa tggtaaatct tcaacacttt atcttgtaag 9240 atacaagaac agaagtgcag atgaagatga atggctacca gctgataaag tacccaatgg 9300 caagacaacg ttgagagact tcagagcaca aaaaagagct cataaacctt ctgaaaagaa 9360 gtaatgacct cctttgaggg tggggaatgt cagccttggg tacatataca tgtacccaaa 9420 actaacttat aatattgtac ttttacttat aaaccttatt acatgtacat acagcttata 9480 aaacacatgt aacatatatt aaaacatata caaacatcaa ttacatatgt atatacactt 9540 aaaaacatat gtaacatgta ataaaatacc aattacacct cataactcag tgtcaagagt 9600 taagatgctt atactacata aaatacataa cacgtatatg taattatgtt atatcaataa 9660 taataataaa taatagtaat tattaaaccc taaaccacag tgatgataca atcaaatatc 9720 actacaatta ggtttggaga ggaggatata taaacccctc ctctctcagc cttcaaggat 9780 tttccctcaa gcatttttac acacctcttt taaagaacac tttaaagttc aattgaagta 9840 tataatcaag agtcttataa taaataagac tagtgattca cttcatttct ttaaggaagt 9900 tcctgactag aagataaagt aatttcatac ttcaccccct tttctttgga tcctctactt 9960 cccccaagta gatttattct tcaggttagt gaccttccaa taaactatca ggttagtcac 10020 cttcagaaca actacaggtt agtgacctag atcaactcat aggttagtgt accttcagaa 10080 caactacagg ttagtgacct acatcaactc caggttagtg acctcttgaa taactacaag 10140 ttggtgactt tattacaacc aggttagtga cctccacatc actcttcaat cagttacctt 10200 gaagattgaa ggggagcttt ggatccagct ttcaacaaat tatattttta taattaacct 10260 // ID Gypsy-6_LENY-I repbase; DNA; FNG; 2205 BP. XX AC AAPO01000005; XX DT 12-FEB-2011 (Rel. 16.02, Created) DT 12-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Lodderomyces elongisporus genome: DE internal portion. XX KW LTR Retrotransposon; Transposable Element; Gypsy-6_LENY_; KW Gypsy-6_LENY-LTR; Gypsy-6_LENY-I. XX OS Lodderomyces elongisporus OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Debaryomycetaceae; OC Lodderomyces. XX RN [1] RP 1-2205 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Lodderomyces elongisporus RT genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; AAPO01000005; Positions 10627 12831. XX CC 'ATAAA' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 99..2066 FT /product="Gypsy-6_LENY-I_1p" FT /translation="MVSANANTRPLLQPLLNILPSLGPQVLHIFINHLPFE FT WFLNSNITFTVPLNDPSKILANQISQTLTPKHFTTTFRNLDVSILSEFSIL FT DLVSGIQLSDTAPSWIALLWHAITKVVLISINWYNIFHYWSLLTFIFTLFA FT VLNTIYNLLSSIVKRHFNRRTKKPSKNNQHQVYNFHSRGLSNLIFYTNERV FT QAKPVKSKTTLFGWIWRTLKFPFRFCFSTLLRYLRISQSNTKPIGTISYTR FT YRLRNLDFTTRETLLKTLIPQIRYVQFNAAHLYQAIVDAGNRADWSELAEF FT RSARSKITRRKHLWSFLSFMLRWNVPDTVYDKYCQRSYDKALTLTTPVKTY FT HELLQLLRELDALPYLRPNFDTVIVQLLRTAFLHKPIHKVIKQIHKETTSY FT WDFCYVFDSSFYELVSPNIKGKPCRRPPVHHFLRNGLPRCYRDLPTTPTPI FT AATALKPITHIPDTNHQLIVSTLDLETTPDTAPETLKDIIDENKAQKLEIT FT SSPARPDKHRVNINRTAPFPPNRPLSKTAMAAELAITNSTTGLLGLGTMAL FT TTSHRHSPPCVVFKKPDPRGIVYNSRDINYRHVREYYRESPTLQPLSHELS FT SAYNQLRIYFATYKWPTTFFDTSWALFIPKNCPLKLHTLSNFFSVGCGEGT FT ALSQGTF" XX SQ Sequence 2205 BP; 642 A; 588 C; 342 G; 633 T; 0 other; tggaggtcct aaccagatac acatcgtccg tagctacact acgccaactc gcagacagct 60 tcactatcat tatctgcctt attattggag tcatcactat ggtcagcgcc aacgctaaca 120 cccgaccatt gctccaacct cttttaaaca ttctgccatc tcttggccca caagttctcc 180 acatatttat caatcacttg ccattcgaat ggtttctcaa ttctaatatc acttttacag 240 tcccacttaa cgacccaagt aaaatattag ctaatcaaat tagtcaaacc ttgacaccta 300 aacacttcac tacgacgttc cgtaacttag acgtaagtat tctatccgaa ttttcaattt 360 tagacctcgt ttctggaatc caattaagcg acactgcccc ttcctggatc gcactccttt 420 ggcacgcaat tacgaaagtc gttcttattt caatcaattg gtataacatt ttccactact 480 ggtcgcttct cacctttatc ttcacactgt tcgccgtgtt gaatacaatc tataaccttc 540 tttcttctat tgtcaaaagg catttcaaca ggagaaccaa gaaaccaagc aaaaacaacc 600 aacaccaagt ctacaatttc cacagtcgcg gactctcaaa cctcatcttc tacacgaatg 660 aacgagtcca ggctaagcct gtaaaatcaa agactacctt atttggttgg atctggcgaa 720 ccctcaagtt tccttttcgt ttttgctttt caacgctttt gcgttattta cgaatttctc 780 agtcaaatac taagccaata ggtaccatca gttacacaag gtatcgcctt cgtaatttgg 840 actttacaac tagggaaaca cttcttaaaa ccctcattcc acagatacgg tatgtccaat 900 ttaatgccgc tcatctttat caagctattg tcgacgcagg caaccgtgcc gactggtcag 960 aattagctga atttcgctct gctcgctcaa aaattacacg cagaaaacat ttatggagct 1020 tcttaagttt catgttacgt tggaacgtac cagacacggt gtacgacaaa tactgccaac 1080 gctcatatga caaagctctt actttaacaa ctccggttaa aacttatcat gagttgctcc 1140 aactccttcg cgaacttgac gccttacctt acttgagacc taattttgat acggtcatag 1200 tacaactctt aagaaccgcg ttcctgcaca aaccaatcca taaagttatc aaacaaatcc 1260 acaaagaaac cacatcttat tgggatttct gttacgtttt cgattcaagc ttttatgaat 1320 tggtttcacc taacattaaa ggtaaaccat gtcgccgtcc tcctgtccat cactttttaa 1380 ggaatggttt accacgttgt tatcgggact tacccaccac tcccacacca attgccgcca 1440 ctgcgcttaa acccattaca catattcctg atacaaatca tcaacttatc gttagtactc 1500 tggacttaga aacaactcct gacacagctc cagaaacact caaagacatt attgatgaaa 1560 acaaagcaca gaaacttgaa atcacatctt cacctgcacg tccagataag cacagagtca 1620 atatcaaccg gactgctcct tttccaccaa atcggcctct ttctaaaacc gctatggccg 1680 ctgaattagc tatcaccaac agcacaacgg gattgctggg acttggtacg atggcactca 1740 ctactagtca tcgccacagt ccaccttgtg ttgtctttaa gaaacccgac ccccggggta 1800 tcgtgtataa ctctcgtgac atcaattaca gacacgttag agagtattac cgggaatcac 1860 cgactttgca gcctttaagt cacgaattat ccagcgctta caatcaattg cgcatttact 1920 ttgccacata taaatggcct actacttttt ttgacaccag ttgggcatta ttcattccaa 1980 aaaattgtcc attgaaatta catacattat caaatttttt ttcagtgggg tgcggagagg 2040 gtaccgcctt atcacaaggc accttttaac gaacagcccc caattttaag tcctgggcta 2100 cacccaaatt tataacttac ttatgctggc gaaaagtcca tgccttccct tgtggcaacc 2160 aaggaccgtc atttagtaac tcctcctcaa gcaacctagg gggga 2205 // ID Gypsy-1_PPM-LTR repbase; DNA; FNG; 399 BP. XX AC ABWF01007905; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Postia placenta genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_PPM_; KW Gypsy-1_PPM-I; Gypsy-1_PPM-LTR. XX OS Postia placenta OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Polyporales; Postia. XX RN [1] RP 1-399 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Postia placenta genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABWF01007905; Positions 17939 18337. XX SQ Sequence 399 BP; 84 A; 98 C; 83 G; 134 T; 0 other; tgaaacattt atctttccgt tctatttctt gtactactag cgcccatact ggtgcctgtt 60 ttagcgctca cccgcaggtt gaacattatg tagcttacat aatggggtcc atgcgcactt 120 tgtgcctggt attgtaccac tacgtttcct tctttactaa cgtgaccaca cagatacagt 180 ggtctgtgcg tagccggtac ttgtataaat agttgtacca ttagttggtc aaactctaag 240 ttcgacctgg ttattcattc aagtctctgc tcctcctcgc ttggggtagt ggtcattctc 300 gacttccttg ctcgatctta aggatgacgc actgtccata agtcctcgct cgtcgtctac 360 gagtcaggca tcactatagg agtatagact gttgtatca 399 // ID Mariner-4A_AF repbase; DNA; FNG; 1995 BP. XX AC . XX DT 28-FEB-2006 (Rel. 11.02, Created) DT 08-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE A subfamily of nonautonomous Mariner DNA transposons - a DE consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; mariner; KW Interspersed repeat; Mariner-4_AF; Mariner-4A_AF. XX OS Aspergillus fumigatus OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Neosartorya. XX RN [1] RP 1-1995 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-1995 RA Kapitonov V.V. and Jurka J.; RT "Mariner-4_AF, a family of Mariner DNA transposons in the RT Aspergillus fumigatus genome."; RL Repbase Reports 6(2), 99-99 (2006). XX DR [2] (Consensus) XX CC It is a subfamily of Mariner-4_AF from the Mariner superfamily CC (Tc1 CC clade). XX SQ Sequence 1995 BP; 599 A; 444 C; 460 G; 492 T; 0 other; tccgtgacta gcccacccac accgcttccc gctgtgggtg agctacaccg taatcaccaa 60 tttcaccgta atcaccaatt ttcgatattt tgacctttcg aagctattcc caccacccaa 120 catgcctaaa tctattaaat ttaatgaatc tgacctcctt aaggcctgcg aagccgctca 180 ggcccaaaat aaaccgaata tctccaagat tgcgcgtgaa tatggcgttc cttattcaac 240 actacgtgat tgcatcaaaa agggcagaca ggctcgtaca gctcggaaac cagtgaataa 300 agcacttgat gggtaccagg aggaagcctt aatacagtgg atagtctgga tgcgagatca 360 taacatacca gtgacaccta agctactaga agagtttgca aatcagtcac ttcaacgcgc 420 tggtgaagct agacaggtta gtagggtatg ggcatatcgc tttgaaaaac gactcccaga 480 acacctcaat ctgggccctg tgaagcaaaa gacgaaggaa tcaaagtgta tcaaggctga 540 ggatgctggt ttactagcga attggtataa tcagcttgcc aatgtggtta aagatacacc 600 accacgattg gtatacaact ttgatgaatg tggcttccga cctggcgaag gcaaggcaag 660 gaatgtgatt ggattaaaag gttcttgcct tgatcttgct gaatctgaga agggtgagaa 720 tataacaact attgaatgta ttgctgcaga tggttggcag atggatccat ggtttatctt 780 taaaggcaag ctcctactct tttagaccat tcttttctga cctttcttgc ttcctaaagg 840 caacgggatc ttcatggaat gttggtttaa cgagagcgag gccctaccac caaatacaac 900 gatagctacg caagccaatg gctggatatc agatgaacta gcccatcaat ggcctcaaag 960 ctttatcaag gcaacaaatg agcgtacaaa gagaggagag aaacgaatac ttatatttga 1020 tggtcatggc tcccatctta ctgttgattt cttacagaca tgcgaagata atggggttat 1080 tccctttgga ttccttcctc atacaacaca cctttgccag ccactggatg gcaagccatt 1140 cttgagctat aagcaacact tccgacgtat aaataatgag ctatcttact gggctggtga 1200 gccagtaggg aagtcagaat tcttacgggt gattggacct gtacgggaga aagccttcaa 1260 ccaacgaatt atccgtgagg ccttcaaaga tcgtggtatc tggcctgttg atggtagtaa 1320 gatagtcgac aatcttgcta tccaggcatg ggaacaaatt ccagatgtct acgcgcctga 1380 tcttgatgca tgccttagag agacaccctc tccaccacct atctcctcat ctagtgtgga 1440 tatcacccct cgaaggacga ttcaggccct taagaagaat caggcaaagc tatctaagca 1500 taaagatctg cttacaccaa agctacagcg gaatcttgaa cggatatttg aatataatca 1560 aattgctgct gagcatctgg ctatagcgaa tgaaacaatc aatcgaatca gggctgcgta 1620 agccccccta cggcgccaac acactaagcg acatgttaag ccactcagtc aggatggtat 1680 actaaaagta cgtgatgcga atcgatcaat tgctttaagg aaggccaaag atgctgctgc 1740 acaagagagg cgtttacaaa ggcagtggga gaaagtgcat ggtaaacccc caccaccagc 1800 acctacacaa gagaatctgg tatcaaatgg atcagcaggg gcagcagatg aaaatagtga 1860 tgtttttttc ttagatagtc agccaatgtg ttgagaatag cttcaaaata tcgaaaattg 1920 gtgattacgg tgaaattggt gattacggtg tagctcaccc acagcgggaa gcggtgtggg 1980 tgggctagtc acgga 1995 // ID Copia-58_MLP-I repbase; DNA; FNG; 5224 BP. XX AC AECX01000356; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-58_MLP_; KW Copia-58_MLP-LTR; Copia-58_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5224 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000356; Positions 288 5511. XX CC Positions [1893-2417] - Integrase core CC LTRs are 98% similar to each other. XX FH Key Location/Qualifiers FT CDS 57..4556 FT /product="Copia-58_MLP-I_1p" FT /translation="MSSDHADNPIDDSQDNSTVTEGSSTSSSADSFGTVHQ FT DLSNPLSSDVPPGTSTSKIPVSNKMSENTNSRVFGDQIQKYSQIMNNVLSK FT YKVSENLTDDNYIEWSQSLMEVFRSLEFHHYVKIKDYRNSTLTNEEHEKTC FT FNLTTFILHRLDSVNNVRTRNHLTDPADACEIVYDPYMCWNFLKTYHNRVS FT EDKLEAVTKALYSCQITKTDSLTSFVDKFENLIREFYRLKGELSDIQSARM FT LLGAIPSLSIETKEYIHNTVIPLTREGVGTYLRKYEERHGWTCAAIREVNS FT VSVRPSKPTSGECTPEHCLGPHLAKNCWSKPENAKKQLAFLSKIRGKTGGG FT NSNSASSHHSQAPQVKGVKKVFDSSVNTASASPAFLSLNVEFDDEPGGKQV FT TASPSEFEEDPDISASVSASLSSSRDWALHDTGATHHMFKDKKFFDPSTLV FT KIDDASKRLKLAGGDISLAVHSRGVAKLLAGDETPFELKNSLYVPDLSRNL FT IAQALLKKQGVRELFHDGTNFSLVLNGLAIFNGFISDNNLMFILLEPVSGQ FT HSTSTSISTSVTEITASLQHHRLGHVSNKYLKLMAKHDSVEEFDYLDENLN FT CDICSLSKNTKLPHNKTRPRARTFLENVHVDLSGIIRTTGFNNENYFILFC FT DDYSAYRHIYSLSDKTKEEVYEAFMAYIAVAERQTGCLVKQFTLDRGSEFL FT NNLLGSKLKELGITLHLTSGHAPEQNGVSERGMRIVVTRARSMMLESALPI FT RFWCYACSAAVFLTNQCITTALEDGKTPFEVWFFRKPSINHLRVFGCQAFG FT LIRKELRQSKFSPVSSEGVLVGFEHDNFNYQIYDLSSKKIYITHHSTFNED FT VFPFQTLSTKFPLLPAEPLKNSVRVRFFDDDEDDDLVDSLDKIVTLRPSSP FT AVDPPTDSLGIQPSEKTVQEPPRRSNRSSKKIHDSLKGMCSSSAEFEFDFI FT TESFFACLPPECNLSHVLSNSPDPKSYKRAMASSESELWKAACDKEFNSLR FT EKGVWLLVDRPSDRNVIRGLWVFRKKILVDGSTKFKARFVAMGNTQIPGQD FT YGETFAPTGKPGSLRILIAIAAIHGWEVHQMDAVTAFLNGILHEELYVEIP FT EGYRTQSTVGKVWRMKKSLYGFKQSLKIWQDDVEGFLIEIGFYQCEIDHCI FT YIRSVGNLFTAVYVHVDDLAITGNDIAKFKVEISSKWEMEDLGLAHTVVGI FT QISRVDEHSYSMSQQQYALTVLKRFDMLESKPAITPLSPNVKLLKSTEEEA FT HDFALTKLPYRSVVGSLMYLAQCTRPDMAHAVGVLSQHLESPNKLHWDLAM FT HVLRYLNHTVNIGIVYSGTAPKVVTGQRSFQCPVSHCNADWAGDANTRRST FT TGYVFVLAGGPISWRSRLQPTVALSSTEAEYRAITEAGQELIWLRNMMARF FT GFTDPNPTVLQSDNLGAIHLTSKSIFHARTKHIEIHYHWIREVVKKGDLAI FT KHCPTHLMVADLLTKQLPKEQFSNLRKAMGLRFIG" XX SQ Sequence 5224 BP; 1443 A; 1087 C; 1086 G; 1608 T; 0 other; ctttaacttt atatggtagc gagagtgcaa cgatctcttc gacccaattt gcttgtatga 60 gttccgacca cgccgataat ccaatcgacg attcgcagga taactcaact gtaaccgaag 120 gctcaagcac ttcaagttcg gccgattcct ttggaacggt tcatcaagat ttatcaaatc 180 ctttgtcatc tgacgtacct ccaggaactt ctacctcgaa gatacctgta tctaacaaga 240 tgtctgaaaa caccaattcc agagtgttcg gtgatcagat tcaaaaatac tctcaaatta 300 tgaataacgt gttaagcaag tacaaagtat cagaaaatct tactgatgac aactatatcg 360 aatggagtca gtctttgatg gaagtttttc ggtcgctcga attccatcat tacgttaaga 420 tcaaagacta ccggaattcg actctgacaa atgaagaaca cgaaaagact tgttttaatc 480 ttactacctt catcttgcat cgtttggatt ccgtcaataa cgtcagaaca agaaatcacc 540 ttacagatcc agctgatgct tgtgagatcg tttatgatcc gtatatgtgc tggaactttc 600 ttaaaaccta ccacaaccgt gtttccgaag ataaactcga ggctgtcaca aaagcccttt 660 attcttgcca aattaccaag accgattctc tgacttcttt cgttgataag tttgaaaatc 720 tgattcgtga attttatcgc ctaaagggcg aactttcaga cattcagtca gctaggatgc 780 tgttgggtgc tattccttca ctttcaatcg aaacaaagga atatattcat aacaccgtaa 840 ttcctttgac tcgcgaaggt gtcggaactt atcttcgaaa gtacgaagag cgccatggat 900 ggacgtgtgc ggctattcga gaagtcaatt ccgtctctgt tcgtccttca aagccaacct 960 ctggtgaatg cactccggaa cactgtcttg gacctcatct ggctaagaat tgctggtcaa 1020 aaccagagaa tgccaaaaaa caattggctt tcctttcaaa gatacgagga aaaactgggg 1080 gtggaaattc taattccgct tcttctcatc attcacaagc tcctcaagtc aaaggagtta 1140 agaaagtatt cgactcgagc gtgaatactg cctctgcgag tccggctttt ctttctctca 1200 atgtcgaatt cgatgatgaa cctggaggaa aacaagtcac cgcctcgcct tcagaattcg 1260 aagaagaccc ggatataagc gcatctgttt cagcatcact ttcttcgtct cgtgattggg 1320 ctctacatga cacaggagcc acacatcaca tgttcaagga caagaagttt tttgatcctt 1380 ccactcttgt caagatcgat gatgcctcca aacgactcaa gttggccggt ggtgacattt 1440 cgttggctgt acatagtcga ggagttgcta agttgttggc aggtgatgaa acaccttttg 1500 aactcaagaa cagtctctat gttccggacc tgtcaagaaa cctcattgct caagcacttc 1560 tcaagaagca gggtgttaga gaacttttcc acgacggcac aaacttctct ctggtgttaa 1620 atggtctcgc aattttcaac ggcttcatct cggacaacaa tcttatgttt atcctccttg 1680 aacctgtgag tggacaacac tctacttcaa cctcaatctc gacttctgtc actgaaatca 1740 ctgcatcact ccaacatcat cgattagggc atgtcagcaa caaatacctc aagttaatgg 1800 caaagcatga cagtgtagag gaatttgatt accttgatga aaacttaaac tgtgatatct 1860 gttctttgtc caagaatact aaactacctc ataacaaaac tagacctcgt gctcgcactt 1920 tcttagaaaa tgtacatgta gatttgagtg gtattatcag gacaaccggt tttaacaatg 1980 aaaattattt tatcttattc tgtgatgatt attctgcgta ccggcatatc tactctctaa 2040 gtgataaaac gaaagaagag gtttatgagg catttatggc gtatatcgct gttgctgaga 2100 gacagactgg ctgtcttgtc aaacagttta ctcttgatcg tggaagcgaa ttcctcaaca 2160 atcttctcgg ctctaaactc aaggagcttg ggattactct ccatttgaca tcaggacatg 2220 caccggaaca gaatggtgtc tctgaacgcg gtatgcgcat agtcgttacc agggcgcgtt 2280 cgatgatgct cgagtcggct ttaccaatca gattctggtg ttatgcttgt agcgcagctg 2340 tctttctaac aaatcaatgt ataacaacag ctttagaaga cggaaagact ccttttgagg 2400 tatggttttt tcgaaaaccc tcaatcaatc acttacgggt tttcggttgc caagcttttg 2460 gtcttattcg gaaagaactt cgacaatcta agttttctcc cgtcagttct gaaggcgtcc 2520 ttgtgggatt tgagcatgac aacttcaatt atcaaattta tgatttatcc tcgaagaaaa 2580 tctacattac acatcattct acattcaatg aagatgtgtt cccttttcaa accttatcaa 2640 ccaaattccc tctacttcct gctgaacctc taaaaaattc agttagagta cgcttctttg 2700 acgacgatga agatgatgac ctggtagact ctcttgataa aatcgtgaca cttagaccat 2760 catcgccagc tgttgatcct ccaaccgata gtttgggtat tcaaccttca gaaaagacgg 2820 ttcaagaacc tcctcgtcga tcaaaccggt cgtcaaagaa aatccacgat agcctgaaag 2880 gaatgtgttc ttccagtgct gaatttgagt ttgactttat cacagaatct ttctttgctt 2940 gtctaccacc cgaatgcaac ctgtctcatg ttctttccaa tagtcctgat ccaaaatctt 3000 ataaacgtgc tatggcgtcg agtgaatcag agctgtggaa ggccgcgtgt gataaggaat 3060 tcaactctct aagagagaaa ggtgtttggt tattagttga tcgtccttca gatcgtaatg 3120 tgatcagggg tctgtgggtg ttcaggaaga agattttggt tgatgggagt accaaattta 3180 aggcgcggtt tgtggctatg ggaaataccc aaatacctgg tcaggactac ggtgaaactt 3240 tcgcaccgac aggtaaacct gggtctttac gtattctcat tgctatagcg gcaattcatg 3300 ggtgggaagt tcatcaaatg gatgccgtaa cggcatttct taacggaatc ttgcacgaag 3360 agttatatgt cgaaattcct gaaggttacc gtactcaatc aactgttgga aaagtctgga 3420 ggatgaagaa gtcactgtat ggttttaaac aatcactgaa aatttggcaa gacgatgtgg 3480 aaggatttct cattgaaata ggcttctatc aatgtgaaat agaccattgt atttacatcc 3540 ggtctgttgg aaacttattc acagctgttt atgttcacgt ggatgatcta gctatcaccg 3600 gaaatgacat tgcaaagttt aaggtggaaa tctcgtcgaa atgggagatg gaggatttgg 3660 gattggctca tactgttgta ggtatccaaa tttcacgtgt ggatgaacac tcttactcca 3720 tgtctcagca gcagtatgct ttgactgttc tcaaacgctt cgatatgttg gaatcaaaac 3780 cggctataac acctttatct ccgaatgtca agttattgaa atcaactgaa gaagaggctc 3840 acgattttgc cttaaccaaa cttccttaca ggagcgtggt tggttcattg atgtatttgg 3900 cacaatgtac tcgccctgat atggctcatg cggtcggggt tttatcacaa catctcgaaa 3960 gtcccaacaa acttcattgg gatttggcca tgcacgttct taggtattta aatcacactg 4020 tcaatattgg tattgtctac tccggtaccg cgcccaaggt tgtcacaggg caacgaagct 4080 ttcaatgtcc agtttctcac tgcaatgctg actgggctgg agatgccaac acccgacgat 4140 caacaactgg ctatgttttc gttctagctg gtggtccaat ttcttggaga agccgacttc 4200 agccgacggt ggctttgtcg tcgacggaag ccgagtaccg cgcgatcacg gaggccggtc 4260 aagaattaat ctggttgcgt aatatgatgg cacgttttgg atttactgat ccaaatccta 4320 cagttcttca aagtgataat ttaggagcta ttcacttgac ttcaaaatct attttccatg 4380 ctagaactaa gcatattgaa atacactatc attggattcg agaagtagtt aagaagggtg 4440 atcttgcaat caaacactgc ccaacccact tgatggtggc agacctttta actaaacaac 4500 ttcctaaaga acagttttca aatcttagaa aagctatggg gttaaggttt ataggataat 4560 gctctttgag ggggtgtgtt aagatattag atactatctt gttatttggt caaagatcat 4620 tatttactgg aggtaatagg ttaatgttca tttatggtaa ttggatgagt tggaagtgaa 4680 gtgaggtggt tagactagaa gatgagatgt tatgcgtggt tagcgttgaa gagtgttaga 4740 tgtacaaagg gggttatgcg gaaagttgaa ccaaggggta actgatggat tcttttgggt 4800 ttattagttc tcattttgta tttctctttt tcctctcttt ttctgcttct tttcgtgtcg 4860 aaaagaatca gtgagttccc tcttatttca tcttcctgaa tcagctcttt tcaacaagtc 4920 tctttactta gtcttactga tactgtgatc ctcatctcag atcaaatcaa ttttgtcttt 4980 ttattttcat aacctcttac gcgtgctcct cacgttttgc tttgagctcg caggtagtga 5040 tgatttttgt ctttcttatc aagtcttaga ttcttctctg atcttttaca tatctttcct 5100 ttattatcta tttgtgctgt taggttagtg atcagttctc gtcgttgact agcatagaag 5160 ctatagtcta acgaggtacc tgtgccttca ggtgggtgtc gaaaagaatc aatcaaatta 5220 attt 5224 // ID Gypsy-1_CBW-LTR repbase; DNA; FNG; 683 BP. XX AC CP000289; XX DT 15-JAN-2011 (Rel. 16.02, Created) DT 15-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Cryptococcus bacillisporus genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_CBW_; KW Gypsy-1_CBW-I; Gypsy-1_CBW-LTR. XX OS Cryptococcus gattii OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Tremellomycetes; Tremellales; Tremellaceae; Filobasidiella; OC Filobasidiella/Cryptococcus neoformans species complex. XX RN [1] RP 1-683 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Cryptococcus bacillisporus RT genome."; RL Direct Submission to RU (15-JAN-2011). XX DR Genome; CP000289; Positions 710116 710798. XX SQ Sequence 683 BP; 144 A; 171 C; 137 G; 231 T; 0 other; tgaaaggctc catgcattcg agtggttggc caaagcccgt ggccaaagcc cgaggtcaca 60 tttcggagaa tttcccacgt gtgtgggtat caagttataa ccggttcctt tcgttgcatt 120 tcgggatcaa attcaatgca ctcgaagcat ttccgccctt ccacagtttc acttcgtccg 180 agctacaact ttatcattta tttattgtta tatcttcatc atttttattc ttatttttct 240 tctattttct ccaagcaccg tcccagacac tgaggatata aggccgcaga tctgcaacaa 300 ctctccttct tttttatcct tcaggaagag ttcttttgag cggtcctcgg ttgtctggtg 360 ggtgtcttgc ttctctttta tttcggagat ggattgttag ctgactgggc aggaatagta 420 cgtagaacat ttattgactc ctgcccaggt atgtttgtga tccttcagta agagttcttt 480 tgagcggtcc tcggttgtct gtacgtagaa catttattga ctcctgccca gattatttcg 540 tacatgcaca tctgtagacc gtctgccgtt ctctcttgcc gcacactctg aagtcgattt 600 tcgcctcaaa tcaatccatc ttttcgccgt caagccagtc ataacccagc cgtagtggac 660 acacctttcc aaggtctgtt aca 683 // ID PYGGY_I repbase; DNA; FNG; 5654 BP. XX AC AF533703; XX DT 03-JUN-2005 (Rel. 10.05, Created) DT 09-JUN-2005 (Rel. 10.05, Last updated, Version 1) XX DE P. graminea LTR retrotransposon PYGGY -internal sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; gag; pol; PYGGY_I; internal portion. XX OS Pyrenophora graminea OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Dothideomycetes; Pleosporomycetidae; Pleosporales; Pleosporineae; OC Pleosporaceae; Pyrenophora. XX RN [1] RP 1-5654 RA Taylor E.J., Konstantinova P., Leigh F., Bates J.A. and Lee D.; RT "Gypsy-like retrotransposons in Pyrenophora: an abundant and RT informative class of molecular markers."; RL Genome 47(3), 519-525 (2004). XX RN [2] RP 1-5654 RA Gentles A. and Jurka J.; RT "Internal portion of P. graminea LTR retrotransposon."; RL Direct Submission to Repbase Update (03-JUN-2005). XX DR Genbank; AF533703; Positions 225 5878. XX SQ Sequence 5654 BP; 1783 A; 1288 C; 1446 G; 1137 T; 0 other; ttttgtattc gatcctagtc aacagagcct tcctataaga ccctgttaac aagtcgcgcg 60 ttaattacct taaccgctac tttgttgcgt ataagcaaga gtgtctgctg tacaagatta 120 gtgttgttct attgcccgaa aggtccttag acagcaaaga ggcataagct atcgcgagat 180 tgctagccta agactgtagc ttcgcgagac gcttagtaac aacactactt aggtgtaggc 240 ttactagttg caaaagccgt tttagagacg tgagcgatag taggcgaagt cctattagtc 300 ctataatagt taaggaatag gcaacgaccc tttactatag ttagagcaag cgtcgtcgga 360 agtcctatcg gtcctctaca gagtaaggca taggccgcga cagtttagct agaaagttag 420 ctaagaagag ttcgcgagcg gtgaggaacg acagtccaaa agacgcgact tataacgact 480 accgatagtc gttcggaaaa gccctgctta gtgacaggca acgagaagcc ctaccaagaa 540 ggagtaggcc acagtagaaa cctgtgcgag acgaccgaag aacggttaga aaaagacaac 600 catggcgagc gggagagtac cgctaacgac cgacgatttg gggctatacg ccgccacgag 660 actcgacgaa ggagcgaagg aggaggacgt tgacgcgcag atccggagct ggctgagcga 720 gagcaactgg cctcaggaag cgatagagtt gacaatagga tcggcagcga agagggcaca 780 tgagataagg aagggcaagg ttccgcaaca agcggaaggg caatcgagtc aacaagcagg 840 ccaacaaccg caagtccagc aggacaacgt gtcccagata atgacccaaa tgatgggcct 900 gttaggaccg atggctagtc ggctagaagc cttggagaga cagcgcaccg acgaactcca 960 ctcggccact actcagatta ggacgccagc gcccgtccct acaaagagca aattccccca 1020 tccagagccg tttgatgggg atagatcgaa atatttggcc tttcgctata agacaaaagc 1080 gaaacttcga cacgagtatg agggagctct aagcatcgct caagatagag tacgtaagtt 1140 agtaggtgtt tccggcaagg ccgcagatgt actgctaccc tgggccgaaa cgtaccaaga 1200 ctacagtagc atcgacgagc tatggcactt tatggaccag caatacgacg acccgcacca 1260 gaagtcaaag gctcttaatc agctctcgaa cctaaggcag ggcaagttgt tagttaggga 1320 ctaccacatg gagtttaacc ggcttaagat ccagttaggc gaacgctttg gcgaggcagc 1380 aaagaagagt atgtttctaa agggccttta tacaaagctg caggaagccc tagctacagt 1440 ggacgaggac ctattgtacg aacagcttat taataaagct atccagacat ccgacaacct 1500 ataccgggta agcctaacag cctgggcccg acaagggcac acgctagccg aacagtctac 1560 tccgaagact actcagcgcg aggctagccc taaggcgata gactgggaac tgacgaaggt 1620 gggtcaagct cgactagaca agagcaagca ggagaaacac caatttaagt gctacaactg 1680 cggcaagtca ggctatatag caagacagtg tccgaacatc gtccgagcgc ggaaggcggc 1740 ccctagccct cagcaccagg aagcgcttga agaggattcg agtaccgact cgggaaaaga 1800 gcagctctga gccaaagtct cggcccagag cccttgaaga tggatcagaa gactcgcagc 1860 gaatggaatc agtttcagca agaacacatg cgcactctac ctatcactat agaccttatt 1920 gtgaatgacg tcacgaacgc cgtcgcgctc gtagactgta gatgtctatg ctacgcgctg 1980 gtcaataaga aattcgctta ctgttactat ctagagcgtt tccagatacc tgctcgacta 2040 atagaaggag ttaatagcaa gttatcagaa atttataagg tagcccgatt ctcgctcaga 2100 atgcataggc acgaagagat agcctacgcg tacgtaatgg atttgtcctc agaggaggat 2160 atatacctag gcagaggctg gatagaccac caagacgtga cggtagcccc agctaagaag 2220 agtatcttta tccactcgaa agggattcgg gtgaggtcaa cagaaggagg ttcagcaggg 2280 aaacctcgac aaatcaacgc agcaggattc gcggcgctta tccgtaggca gaaacgcgcc 2340 ccgaacagcg ttcagatctt tgccgcatca attgcggata tcgacaaggc cttaagaccc 2400 aagaagaagg tggacgtgag agcgctatta ccagaacagt acaaggagtt ctacgatctg 2460 gtcgacccaa agagagcaga aaagctccct ccgcaccgag gtcctagagt agaccacaag 2520 atcgagttag acctaaagga cagtcaacca ctatggggac ccctatacgg catgttacga 2580 ggagagctac tagtactcca aaaggagctt acatccttgc ttaacaaagg atttattcgc 2640 gtaagtagct cgccggtatc cttactagtc ttgttcgcaa aaaagctagg aggagggctc 2700 cgactatata tcgactatcg ggccctaaac gccatcacaa gagaaaagaa ccgataccta 2760 ttaccactta tccgagaaac acttaacaac attagcaagg caaaatggtt cactaagcta 2820 gacgtaattg ccgccttcta taagatctgt gtagtagaag gtaacaagtg gaaaacagcc 2880 ttccgaacgc gctttagctt atataagtgg ttagtaaccc cctttagtat agtaaactcc 2940 ctaagcacct tccaacggta tatcaactgg actctaagag agtacctcga cgagttctgt 3000 tcggcctacc tagacgacgt gttaatctac acagatggta gcctagaaca gcaccaagac 3060 catgtccgaa aggtcctcag gaaactccaa gaaagtgggc ttaacgtaga tatcaagaag 3120 tgcgagttcg gggtgaagtc tactaagtac ctaggattaa ttattgacgc ggagaaaggc 3180 atccggatag acctagaaaa agttaaggca attatagagt gggaaccgct aagactgtga 3240 agggcgtacg tttatttttg ggattcgcaa acttctaccg gagatttatc cagagacttc 3300 tcaggaatta ctgcccctct tactcgcctt acgggcgacg taagctttaa ctggggagaa 3360 gaagagcaag ccgtattcga gaagctaaag agaatctttg taacagaacc aatccttgcg 3420 accttctact ctgaacgtga tacgatccta gagtgcgact cctccagata cgcaacagga 3480 ggagtgctat cccagtataa cgacgaagga gtactacgtc cttgcgcgta cttctctaag 3540 aagaacaacg tctacgaatg caactacaag atccacaaca aggagttgct agctattatc 3600 tgttgcctta aggaatggga tgctgaactg cgtttagtca agagttttaa ggtgataaca 3660 gatcacaaga acctcgacgt actttataaa gcaaagatgt taacaaacgt taaatctgtt 3720 aggcaggctg ctgagtcgat tcaatataga gatcctgtat aggcggggaa acagaacgtg 3780 agagctgatg cgctttcaag gagagaacaa gaactggcaa aggacgcaga agacgaacgc 3840 ttaaggaagc gcgttataca ggtcttgaag ccaacgcgta ctggtatgat gaagctagcg 3900 aggacgataa ggactcagaa gaggctggat tttatcggcg aagattcgcg tatgacgaac 3960 agaagaccgt ccaagcctga ctaaagagca gtcagaacaa gtagacaaac aggagacagt 4020 cctagtccaa ctgaagggca gtcagagcag caagaagacc tagcccaagg aaacgagctc 4080 gaggagctat ggctagaggc tctgaagaac gatcggcagt accaagaagc aacgcaggca 4140 gtgatacggg ccgaacgaag gttcccccca agcttagcgt aaaagtgctc gatctcagag 4200 tgtgaagtta gtaaccaagg agggttacta tacagaggca gaagatgggt gccagacaac 4260 gagagtctac gcaccaaaat aatcagtgga gtacacgaat ctctctcaag tggtcacacc 4320 agaggagaga gattacctta caaagataat ctgccgagaa gattcttctg ggcagagcat 4380 gataggtaga atcagacgct acgtgcgcaa ctgcgatgta tgcggaagaa ctaaaccata 4440 gagagaaggt cccaaggact cctaaaacca ttgctaatac tagaccggat atggaaggag 4500 atctcaatag actttattaa gggactgcta actagcgaag gcataacttg tcttatagtt 4560 gttactaata ggcttagcaa gggatcaatc ttcatcctat tacctaacat caagatagaa 4620 acagttgtcc gagcgttcct tagacaagta gtagcctacc actggttacc agaagcaata 4680 acttccgata gaggtagcca gttcgtgagc gtgttatagg agcgcctctg cgagatcctg 4740 aagattaggc gacagttatc aacatccttc tacccccaaa ctaatagctc gactgagagg 4800 ataaacagcg tgtgggaagc ctacacccga gcgtttatca gctgggccca aacagactgg 4860 gcatctctct gcccaatagc ccaaatagct attaatagta gggacgcaac atcaacagga 4920 gtagcgccat tcttcttgca acatagatat aacatagatc cgctgcaact agaaatccct 4980 caaggagccg accagaagaa gtacaccgcc gaagaatatt cagaccgcaa gaaggcagaa 5040 gctatcgtag ctaagttccg agacgtattc aacctagcgc aagcaagcat ggccacagcc 5100 aaacagaata agaaaggcca ggcaaatagg caccgcaaag aagcaaagag ataccaggtt 5160 agtgacaaag tctggttgcg gctcgataaa caatacagga caggacgaca gtcacagaag 5220 cttgactaga agagtgctaa gtatacagta gtctaagtaa tcgatagcca cttagttaag 5280 ctggataccc ctccgggaca ccacccagtc tttcacgttg acagactacg ccttgccaac 5340 tcagatcctc taactagcca ggtacaagac aactatcagc cgttgctgtt acaaataggt 5400 gacaagaaca agtgggttgt agaggagatt atgagtgaga aatggaaacg ccgagagcgt 5460 ggttggcgcc tttactacga ggtgaaatgg gctagataca ctctaacaga tcttcagcca 5520 ggagaattgc tagaggaaat agaagctcta gaatcgatag aaggttatac aaaggacgtt 5580 cggagcgagg aggggcgctt accagaaggg tttcgccggg ataaccccta agaaggggag 5640 ggagaagggg ggta 5654 // ID Gypsy-4_LENY-LTR repbase; DNA; FNG; 370 BP. XX AC AAPO01000014; XX DT 12-FEB-2011 (Rel. 16.02, Created) DT 12-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Lodderomyces elongisporus genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-4_LENY_; KW Gypsy-4_LENY-I; Gypsy-4_LENY-LTR. XX OS Lodderomyces elongisporus OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Debaryomycetaceae; OC Lodderomyces. XX RN [1] RP 1-370 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Lodderomyces elongisporus RT genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; AAPO01000014; Positions 540629 540260. XX SQ Sequence 370 BP; 115 A; 63 C; 76 G; 116 T; 0 other; tgttctatat cggacctaag gtacgattag aatagggctc agaaaactga cgtcacattt 60 tgtcatgtga ttttgtggct tagcggatgg caagatgaga tgcagcacat acgctgcgta 120 ttgctacgta aatagttttc gttatgtgtc aagtcggact atataaggag gacttgcgct 180 ctcgaattta gtttatagtt tctacttaga aatcaacttg cgtttcaaca aataagtgtt 240 taagacattc aagagtttgt agaaggtaca acagtagtta gttgtgccac ataccacatt 300 ccaacgaact cataaccttg ttgcaagttc aaggttgact caactataat attaaggaca 360 ttataaatca 370 // ID Copia-7_LBS-I repbase; DNA; FNG; 4703 BP. XX AC ABFE01002190; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: internal DE portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-7_LBS_; KW Copia-7_LBS-LTR; Copia-7_LBS-I. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-4703 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01002190; Positions 12552 17254. XX CC Positions [1689-2219] - Integrase core CC 'GAAGT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 171..2453 FT /product="Copia-7_LBS-I_2p" FT /translation="MDTDTSRKSPLAKLGIISYNEWESAMYGRMMYLGVSR FT IINGKETVISDPGSSSDVTSASMAAYNAYATRCEKAAGEIYQWLDEANKVH FT VLSIREDPKAMWDKLALIHNKSAPNSRFNSLSDLFSIRLKEDETLTQLTAR FT VEGAMQTVVNLRPKEPPYTLSLLDEELAIMAMIRALPREDYNPFISSVLLL FT STLTKDAVLEAFRVEETQRRGTQAEIEAAAVIAAAAAARDIKCHWCGGPHM FT QRECASYIEARSSAQKLPASGGKKKNRGGRGAGRGANRDSANASKVEEGCD FT GVSVKAESAVCISASTKASADTHWNADSGATAHMTPHRNWISSDFKPWRVP FT VHIANGHIVYSTGKGSVHFRPSGRQSNGRALEDLIITDVLHVPDLANNLLS FT VLTLTTKRAFTVVIRGSKLSFIRKHKTLFTATVDSSTTAYLDGTVVVPNIS FT SNISAVRGAVIDRNLLHKRLCHLGHDRLERFIDEDLASDTVLTSDAPLDNI FT CESCIAGKQHRHPFPHTADRATGALDRIFSDVHGPMSVRNRSGKRYWVTFI FT DDAFRWMEVYDIAKKSDVFGAFKLFKALVEKQTGRKIKCFHVDQGGEFTLK FT EFVDFLMAEGIRIKFTTRATPQQNGVVERNNRTTAEALTSMLNEANLPMSF FT WGDALNVFRHVHNRSPTAALPKGSTPYSGFKGRKPRCGHLRVFGCRAYVLI FT GRDKRKSLQGHTIKGIFMDMGILRTMLVGGFMIQFRRRPTSRVMSYLMNPD FT FLVPQPNPLP" FT CDS 2489..4672 FT /product="Copia-7_LBS-I_1p" FT /translation="MDLPEHEDPFDEDDPVFPPALEAPSNDLPPLVGADLP FT PLVGADSPPPARRRREHALPPPPFELPTGPRTRKPVQKYDDYYSSIQRQAD FT AKADAKRLRNLVPDIPAPALPIPVNDNTDKLLWDASLGPADEFGEQEVNFN FT LDLVSDEALMMAGRTFVENGTDSGGYSAFSATLQSSTCAMKVGAHDTNPRN FT YREAMGSDHSEEWYTAMCLEMNAIERNGTWRVVYLPAGKKSIGSRWVFKVK FT HLPDGCLDKFKARVVAQGYSQRPGVDFDETFAPTARWNAIRTILALAAVED FT MHLESVDIRAAFLHGVMPEEMEIYMNLPDGFPAHPAPDIIKQPGDGRPVAR FT LLKGLYGLKQGAYLWHKRMNEVFIKIGFVRVVSDPCVYVYLRDKVRIIIPV FT HVDDMTIASASRPAILKVIDQLREYFDLNHLGPAVGLLGVRLTRDRPNRKL FT WIDQRAYAVDLLSRFNMLDANPVTTPMDPNIKLRKSQSPQTSQEVEEMSRI FT PYIQATGGLLYLALATRPDIVYSVGVLCRFNSNPGLDHWKAVKHLLRYIRG FT TLDYRIEYSAIASTPAPTFFKAFSDADHGGNLDNGRSTTGMLLMVAGGAVS FT WSSKIQTIVANSTTEAEFIAASETGRELCWMRNFLDEIGTTQVGPSDLKMD FT NNSAISVSKQPEHMGRLKHLDRHWFWLRQAVYDRKIIPSYIPTAEMAADIL FT TKALTRDLVEKFRRMMGVVGEWSKEPLQ" XX SQ Sequence 4703 BP; 1128 A; 1269 C; 1102 G; 1204 T; 0 other; ggttatgggc cccggcccca gacattcttg aactcctccc aacacacctc gaaggctcca 60 ctctcacgtg tcgcagccct ctcctccaat ccctcgctcc gctcagtcgt cgcagcccga 120 gattcacaca cctgtcagat cgttatctga tctaaccgat ccctgccatt atggacacgg 180 acacctcgcg caagagcccg cttgcaaagc tcggcatcat tagctataat gaatgggaat 240 ccgcgatgta cggccgcatg atgtacctgg gggtcagtcg cattatcaat gggaaggaaa 300 ccgtgatctc cgaccctggt agttcctccg acgtcacttc agccagtatg gccgcgtaca 360 acgcctacgc aacacgatgc gagaaggcag ctggagaaat ctatcagtgg ttggatgaag 420 caaacaaggt gcatgttctg tccattcgag aagacccgaa agcgatgtgg gacaagctcg 480 cactcattca caataaatcc gcgccaaact cccgtttcaa ctctctctcc gatctctttt 540 ccattcgcct caaagaagac gaaaccctca ctcagctgac tgctcgtgtt gaaggggcta 600 tgcaaactgt cgtcaatctt cgccccaaag aaccgccata caccctttcc ttgcttgacg 660 aagaactcgc aatcatggcc atgatacgcg ctcttcctcg cgaggactac aaccccttca 720 tatcctctgt cctactctta tcgactctca cgaaggacgc cgtcttggaa gcctttcgcg 780 ttgaagagac tcaacgtcga ggcacccagg ctgagatcga ggccgccgct gtcattgctg 840 ctgccgctgc cgctcgagac atcaaatgtc attggtgtgg tggtccgcac atgcaacgtg 900 aatgcgctag ctatatcgag gctcgcagca gtgctcagaa attgcctgca tctgggggga 960 agaagaagaa tcgtggagga cgtggagctg gacgtggcgc aaaccgggac tcggccaatg 1020 cctccaaggt cgaagagggc tgtgatggtg tttctgtcaa ggccgaatcc gctgtttgta 1080 tctctgctag caccaaggcc tcagctgaca cacattggaa tgcagactct ggcgccacag 1140 cgcatatgac acctcatcgc aattggattt cgagcgactt caaaccttgg cgtgttccag 1200 tccacatcgc aaatgggcat attgtttatt caactgggaa gggatcagta catttccggc 1260 cttcaggtcg acagtcaaat ggaagggcat tggaggatct tatcatcaca gatgtacttc 1320 atgtaccaga tcttgccaat aatctcttat ctgttcttac cctcacgacg aagcgagcat 1380 tcactgtcgt gatccgcggc tccaaattgt ctttcattcg gaaacacaaa actcttttca 1440 ctgcaaccgt cgactcatca acaacggctt atcttgatgg aacagtcgtt gtccccaaca 1500 tttcatccaa tatttctgcc gttcgtggag cagtcattga ccgcaacctt cttcataaac 1560 gcctatgcca tcttggccac gatcgccttg aaaggttcat tgacgaagat ttagcatcag 1620 atactgtcct cacatcagat gctcctcttg acaacatctg tgaatcatgt atcgctggaa 1680 agcaacaccg gcacccattc cctcataccg ctgatagagc gactggtgct ctagatcgca 1740 tttttagtga tgttcacggt cccatgagtg tacggaaccg ctctgggaaa cgttattggg 1800 tcacatttat tgatgacgca ttccgttgga tggaagtcta tgatatagcc aagaaaagcg 1860 atgtttttgg agcattcaaa ctcttcaagg ctcttgtcga aaagcagact ggccggaaaa 1920 tcaaatgttt ccatgttgat caagggggcg agtttacact caaggaattc gtcgactttc 1980 ttatggcaga gggcattcga atcaagttta cgacgcgtgc tactccacag caaaatggag 2040 ttgttgaacg gaataatcgg accacagctg aagcactcac ctcaatgctt aatgaagcaa 2100 acctcccgat gagcttctgg ggtgatgctc tcaatgtatt ccgccatgta cacaaccgtt 2160 ctccgacagc tgctcttcca aaaggttcta caccttacag tggtttcaag ggcagaaaac 2220 ctcgctgtgg ccatcttcgt gtttttggct gccgtgcgta tgtcctcatt ggtcgggata 2280 agcgcaagag cttgcaaggt cataccatca aaggtatttt catggatatg gggatcctca 2340 gaaccatgct ggttggaggt tttatgatcc agtttcgaag aagacctaca tctcgcgtga 2400 tgtcatattt gatgaatccc gatttcctgg tacctcaacc aaatcctctt ccttgaactc 2460 tgattttgct ccaccccttg ccaatttcat ggatctccca gaacatgaag acccgtttga 2520 tgaagatgat cctgttttcc ctcctgcttt agaggcacct tctaacgatt tgcctccttt 2580 ggtgggagcc gatttgcctc ctttggtggg agctgattct cctccacctg cacgtcgtcg 2640 ccgtgaacat gctctccccc ctcctccttt tgagctccct actggccctc gcactcgcaa 2700 gcctgtccag aaatatgacg actattattc ctcaatccaa cgtcaagcgg atgcaaaagc 2760 tgatgctaag agactacgta atcttgtgcc tgacatccct gctccagccc ttcccatccc 2820 agtgaatgac aacactgaca agctgctttg ggatgcatct cttggccctg ctgatgaatt 2880 tggcgaacag gaggtgaatt ttaacctcga tcttgtgtct gatgaagctc tcatgatggc 2940 tggacgtacg ttcgtcgaga atggaaccga ttcgggggga tacagtgctt tttccgcgac 3000 attgcaatcg tctacctgcg caatgaaagt tggcgcccat gacaccaatc cacgtaacta 3060 tcgagaggcc atgggaagtg atcactccga ggagtggtac actgcaatgt gtctggagat 3120 gaacgcaatt gagaggaatg gcacctggcg cgttgtatat cttcctgcag gaaagaagtc 3180 aattggctca cgttgggtgt tcaaggttaa acaccttcca gatggctgcc tcgataagtt 3240 caaagcgcgt gtggtcgccc aaggctatag tcaacgccct ggtgtcgact ttgacgaaac 3300 ctttgcacca acagcacgct ggaatgctat ccgcactatt cttgctcttg cagcagtcga 3360 ggatatgcac ctagagtcgg ttgacattcg cgccgcattt ctgcatggtg ttatgcctga 3420 agaaatggag atttatatga atctccctga cggatttcct gcacatcctg cacctgatat 3480 catcaaacaa cctggagatg gacgaccagt cgctcgcctt ttaaagggtc tttacggtct 3540 caagcagggc gcatatcttt ggcataagcg gatgaacgag gtcttcatca agatcggttt 3600 tgtgcgtgtg gtctcagatc cttgcgtata tgtctaccta cgtgataagg tccgcatcat 3660 tatccctgtc catgtcgacg acatgacaat tgcatcagca tcacgaccgg ccatcctcaa 3720 ggtcattgat caacttcgcg agtactttga tctcaaccat cttggaccag cagtcggcct 3780 tcttggtgtg cgtctcaccc gtgatcgccc taaccgcaaa ctctggattg accagcgtgc 3840 ttacgctgtt gaccttcttt cacgcttcaa catgctcgac gccaatccgg taactactcc 3900 catggaccct aacataaagc tcagaaagtc ccagtcccca caaacatcac aggaggtcga 3960 agaaatgagc cgaatacctt atattcaagc tacaggaggc ttgctctacc tggctctcgc 4020 aacacgtcct gacatcgtgt attcggtggg agttctatgt cgttttaact ccaatcccgg 4080 gcttgaccat tggaaggctg ttaaacacct tctccgctac atacgtggga ccttggatta 4140 tcggattgag tactccgcca ttgcctcgac acctgcacca acgtttttca aagcattttc 4200 tgacgctgat cacggcggta atcttgacaa cggccgatct actactggga tgctacttat 4260 ggttgctggt ggtgctgtta gctggagcag caagatccaa accattgttg ctaactccac 4320 aaccgaggcc gagtttattg ctgccagtga aactggacga gagctttgct ggatgcgaaa 4380 tttccttgat gaaattggta cgacacaagt tggtccctcg gatctcaaaa tggacaacaa 4440 ttctgcaatc tccgtctcaa agcagcctga gcatatggga aggctcaagc acctcgaccg 4500 ccattggttt tggttgcgac aggctgttta cgataggaaa atcattccct cttacatccc 4560 tactgctgag atggcagcag atattctcac caaggcacta actcgagacc ttgtcgagaa 4620 gttcaggcgc atgatggggg ttgttgggga gtggtccaag gagccactgc agtgatattg 4680 tccgggacat catcaggtgg gag 4703 // ID Copia-2_VA-I repbase; DNA; FNG; 5447 BP. XX AC ABPE01003718; XX DT 13-FEB-2011 (Rel. 16.02, Created) DT 13-FEB-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Verticillium albo-atrum genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-2_VA_; KW Copia-2_VA-LTR; Copia-2_VA-I. XX OS Verticillium albo-atrum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetes incertae sedis; Phyllachorales; OC mitosporic Phyllachorales; Verticillium. XX RN [1] RP 1-5447 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Verticillium albo-atrum genome."; RL Direct Submission to RU (12-FEB-2011). XX DR Genome; ABPE01003718; Positions 19760 14314. XX CC Positions [2162-2653] - Integrase core CC 'GGTTT' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 377..5239 FT /product="Copia-2_VA-I_1p" FT /translation="MDSGKHRVTLTRREEWRTWLNSVRAVATSYEAWDLVD FT PDKEADRRPKRAKKPTAPSFEDETRDLSTLLTLYKIHLDIYNRESAGLQEV FT QKYILNTVDRSLSAVFMNEESSQEWLKALKDHFAPSDMGQKQYIRQRWEEH FT MAMIKKMDHEKWVDDFILLCIEVEKHGIPQLKEAADQNVQFLRAILSVSPG FT WAEAECAHQVRRELDDRDGGKVLALRYKSYLRDYQPHRLTTKPSQTSFYSD FT SVTFMGKDRNGQSDRTSAQDSHRDSPQQPNSSPSRRPFCDCVQQKRAPKAC FT QPICLCPNIAEHRRKKSYQSDQSLLDYIDEAKKRMKPSTLQFWNEKIKEDK FT KKRQEARGTSGVAVSAFNNFLGDDDEQSCDDEYTNFMSNFSVADSDEFQRT FT NRGGDLRHEFLIDSGTPNHITNNKDLLTNFREATDTVRYSTGGGKSSPVEG FT HGTMAIDTLDSKQNVISITCHDVSYIKDYPTTLISARQLQRQGLGLDSLRN FT MLFNPATGSEIPLQEKGGQYAVNTLHSTTSFYHSKKQRAAMVSDSRLWHER FT LAHAHPEAIQQLPYSCTGVGVKGPTTFECESCGVAKSKKIVSRRQPTYTTT FT NPFERVNLDFFSPGKSYNGMESCLVITDAYSGKIVVRILPARADGFQAFIN FT FEMFLWRQYHLVIKVLRVDWDSALRSNLVTWSEERGILLERSAARTPEQNG FT AAERNGGTIFTIARTLTSAAGFPEDLWPEVVQTAAYLHNLTPKGRLQWATP FT EGRLNDWLNSKGLAARTTAPGLAHLRRYGCRAYPLTADAQEGRNRKQKLAP FT RAHIGYLCGYDSSNIYRIWAPEMKAVLRTRDVGFDESTVFKDAQSPKPILQ FT DEFQALDVPVADAQNPFNSSLDIDEFDVATIDLFEKARTFQLGGKINAEIL FT RPREGHTASRDDVLPSPRPTPETETGQGQPAVSEGVEDAVIDTIQGAEDAV FT EDTIPVPDTIQVTEEAIEDTIEVARSPESTPQHTPADQSSAEDETETVEAR FT PSVAPAPQPSPSTQIQGTRRSARDRKPSQFKDGTPTDQALGARRSKKMPPS FT SHLTVEYSFFHTQTPSNKKPLLRTDMPAEPRSWKEMLHSDHKKQWLEASVQ FT EVNTIENMDTIEQVPFESVDQEIHKVLGLTWVWKYKVDSTGYVTKFKARLC FT VRGDQQPKNEMETYASTLAAKTFRTLMAICAEFDLDTHQLDAVNAFPNAHL FT DETVYVWAPQGWDLMQEMQQKKGAPQRRTIFKLKRALYGLRRSPLLWHKLL FT SEAMRKIGLVALEEDNCVFMGRGMIMFFYVDDIILMNRKEDRQAAEELKSQ FT LKSLFEMRELGEVRWFLGIRVVRDRSQRKLWLSLENYITEKMKDLNLQPST FT RGHLTPLSTDPAPAPKDYEASAESIQLYQRKIGSINYAATTTRPDVAKAAS FT RLAESLLRPTELHHLAADHCLRYLYRTRTLSLSYGGSGDHSTSLYCASDAS FT FGDDPITRFSSQGYTTMLFGGPIDWKATKQRSVVTSSTEAELVALSVAARN FT YLATLRLIKELQLRFHGKVPIQMFCDNRQTLLLLTSERPQATTKLKHVDIH FT RLWIRQTVRQGKISVDWMSTDKIVSDGLTKILPTQKHQNFVKLLQMECVPA FT VHQ" XX SQ Sequence 5447 BP; 1479 A; 1534 C; 1401 G; 1033 T; 0 other; gttatgagcc cgggcgacga tacagctcct tgaccttcga tatctacagc agggcgccta 60 cggccaatac gcctgctgcg aagctgtcta cagctgaatc gccgctgcac ttcaacagtc 120 aattgacgcg acgtttcgcc attgacgacc actggccaag ggccgttacc gacatcctcc 180 aggagagcga tcatctctcc acgacacatc gaagcaggag aaatcagctc ctcaagacca 240 attgaacctc cgagacgtgt ggcctagggc ttgagtacac gcggggtcaa ttgatgatat 300 tcaacataca cgcgatcagt cgcgaccatc gaacgacctc cccgacggtc tccgacaacg 360 cccagggatc ctcaagatgg actctggtaa gcacagagtg acacttacaa ggagggagga 420 atggaggaca tggctcaact ccgtcagagc cgtggcgacc tcctacgagg cctgggatct 480 agtcgaccca gacaaagagg ctgacaggcg gcccaagagg gccaagaagc caacagcacc 540 gtcattcgaa gatgagacgc gagatctcag cacactgctg acgttgtaca agatccattt 600 ggacatctac aacagggagt cagcgggcct ccaggaggtc cagaaataca tcctcaacac 660 agtcgacagg agcctttcgg ccgtcttcat gaacgaggaa agcagccagg agtggctcaa 720 ggcgttgaag gaccacttcg ccccttccga catgggacag aaacaataca ttcgccagcg 780 ctgggaggag catatggcga tgatcaagaa gatggaccat gagaagtggg ttgacgactt 840 catcctactc tgtatcgaag ttgagaagca cgggattcct cagcttaaag aagccgcaga 900 tcaaaatgtt caattccttc gggcaattct ttccgtgtct ccagggtggg ccgaagcaga 960 gtgcgcccac caagttcgcc gggagctgga tgatcgtgat ggagggaagg tactggcgtt 1020 gaggtacaag agctacctta gagactacca accgcatcgc ctcaccacca agccatctca 1080 gacgtccttc tacagcgact cggtgacctt catgggcaag gaccggaatg gtcaaagtga 1140 tcggacgagt gcccaggatt cccatcgtga ctctcctcaa cagccaaaca gctcgccttc 1200 tcgcagacca ttctgcgact gtgtacagca gaagagggca cccaaggcgt gtcagcccat 1260 ctgtctgtgc ccaaatatcg cggaacaccg gaggaagaag agctatcaat ccgatcagag 1320 cctgctggac tacattgacg aagcgaagaa aaggatgaag ccaagcacat tgcaattctg 1380 gaacgagaaa atcaaggagg acaaaaagaa gcggcaggag gcaagaggca ccagtggagt 1440 cgccgtctct gcttttaaca acttcctcgg tgacgatgat gaacagagct gcgatgacga 1500 gtacaccaac ttcatgagca acttctccgt cgctgattcg gacgagtttc aacggacaaa 1560 ccgaggaggg gatctccgtc acgagttcct gatcgactcg ggtacaccca accatataac 1620 caacaacaaa gacctcctca ccaactttcg cgaggccacc gacactgtaa gatacagcac 1680 aggaggaggc aaatcaagcc ccgtcgaagg gcatggcacg atggccatcg acaccctcga 1740 cagcaagcag aacgtcatca gcatcacctg ccatgatgtc tcgtacatca aagactaccc 1800 gacgacgctt atttcagcca ggcagctcca acgccaagga ctcggactgg actcgctcag 1860 gaacatgctg ttcaacccag ccactggctc agagatccca cttcaggaga agggtggcca 1920 gtatgcggtg aacaccctac atagcaccac ttccttctat cactccaaga agcagagagc 1980 cgcgatggtc agcgactcga ggctctggca cgaacggctg gctcatgccc acccagaggc 2040 catacagcag ctgccataca gctgtactgg agttggggtg aaagggccaa caacattcga 2100 atgcgagtct tgcggggtag cgaaatccaa gaagattgta tctcggagac agccaacgta 2160 cacgacgacg aatccatttg agcgcgtcaa cctggacttc ttcagccctg ggaagtcgta 2220 caacggcatg gagtcgtgcc ttgtaataac ggacgcatat tcggggaaga ttgttgtcag 2280 gattctgcct gccagagcag atggcttcca ggcgtttatc aactttgaga tgtttctctg 2340 gagacagtac catctggtca tcaaggtcct gagagttgac tgggactcag cactacgcag 2400 caatctggtt acttggtcag aagagcgcgg cattctcttg gagagatcag ccgcaaggac 2460 accagaacag aacggtgccg cagaacgcaa tggaggcacg atattcacca tcgcaagaac 2520 ccttacctcg gctgctgggt tcccggagga tttgtggcca gaggtggtac aaaccgctgc 2580 atatctccac aatctcacgc cgaagggacg cctacagtgg gcgacaccag aaggaaggct 2640 gaacgactgg ctcaactcca agggccttgc cgctcgaact acggctccag gcctggctca 2700 cctgcgaaga tacggatgca gggcgtatcc tctcacggca gacgcccaag agggacgtaa 2760 ccgaaagcag aagctggctc cgagagccca catcggctat ctctgtggat acgacagcag 2820 caacatatac agaatctggg ctccagagat gaaagcagta ttgaggacaa gagacgtggg 2880 tttcgatgag tccacagttt tcaaggacgc ccaatcccca aagccaattc tacaggacga 2940 attccaggct ttggacgtgc ccgttgcgga cgcccagaat cctttcaact cgtcactgga 3000 cattgacgag ttcgatgttg ccaccatcga cctgttcgag aaagcccgga cattccaact 3060 tggaggaaag atcaacgcgg agatactgcg accccgggaa ggtcacacag ccagcagaga 3120 tgatgtcctg ccgtcaccac gtccaactcc agagacggag actggacaag gccagccagc 3180 tgtctctgag ggggtggaag atgccgtaat agacaccatt caaggggctg aagatgctgt 3240 ggaagacacg atccccgtgc cagacaccat tcaagtgact gaagaggcta ttgaagacac 3300 tatcgaagta gctcggagcc ctgaatcaac acctcaacac acaccagctg atcaaagttc 3360 agctgaagac gagacagaga ctgtggaggc ccgaccatca gtggcaccag ccccacagcc 3420 gtcgccatca acacaaatcc aaggcactcg aaggagcgct agagacagaa agccgtcaca 3480 gttcaaagac ggcaccccca ccgaccaggc tctaggagcc cgccgcagca agaaaatgcc 3540 tccatcatca catttgaccg tggaatactc attctttcac actcagactc catcgaacaa 3600 gaaaccgctt ctccgaacag atatgcccgc ggagccaaga tcctggaagg agatgcttca 3660 ttctgatcac aaaaagcagt ggctagaagc aagcgtccag gaggtcaaca ccattgaaaa 3720 tatggacaca atcgagcaag tccccttcga atcggtcgac caggaaatac acaaggttct 3780 cgggttgacc tgggtctgga aatacaaggt tgactccacc ggctacgtga ccaagttcaa 3840 ggccaggctc tgcgtacgag gagaccaaca gcccaagaat gagatggaaa catacgcgtc 3900 gacactggca gccaagacgt tcagaacgtt gatggccatc tgtgcagagt ttgacctaga 3960 cacgcaccag ttggatgccg tcaacgcctt tccaaatgca cacctcgacg agacagtcta 4020 cgtctgggcc cctcagggct gggatctcat gcaggagatg caacagaaga agggcgctcc 4080 acagaggcgc acgattttca agctgaagag agctttgtac ggtttgcgcc gatcaccgct 4140 tctctggcac aagctgctgt ccgaggctat gcggaagatc ggcctcgtgg ctctcgagga 4200 agataactgt gtcttcatgg gcagaggcat gatcatgttc ttctacgttg acgacatcat 4260 cctcatgaac cgcaaagagg atcgccaagc ggccgaagag ctgaagagcc agctcaagtc 4320 tctgttcgag atgcgagaac ttggcgaggt cagatggttc ttaggcatca gagtggtcag 4380 agatcgtagc cagcgcaagc tctggctgtc tctggagaac tacatcactg aaaagatgaa 4440 ggacctcaac ctccagccat ccaccagagg tcatttgacg cctctatcaa cagacccagc 4500 acctgctcca aaggattacg aagcctctgc tgaatccatt caactatacc aacgcaaaat 4560 tggttccatc aattacgcag caaccacaac acgacctgat gtggccaaag ctgcctcaag 4620 acttgcggag tctctgctca ggccgacaga actccatcac cttgcagcag accactgtct 4680 ccgctacctc tatcgcacaa ggactctctc gctgtcctac ggaggatcag gagaccattc 4740 tacatcgctg tattgtgcga gcgacgcatc attcggcgat gatcccatca ccagatttag 4800 ttcacaagga tacaccacca tgctctttgg aggtccaatt gactggaaag ctacaaagca 4860 gcgatccgtg gtcacttcta gcaccgaagc tgagctggtg gccctcagtg tcgcagcgag 4920 gaactacctt gccactcttc gtctgatcaa ggagcttcag ctacgttttc atggcaaggt 4980 tccgatccag atgttctgtg ataaccgcca aacgctgctg ttactcacta gtgagaggcc 5040 tcaagccact acgaagctca agcatgttga catccatcgt ctttggatcc gacaaacagt 5100 cagacagggg aagatatccg tggactggat gagcactgac aagatagtga gcgatgggct 5160 tacaaagatt ctcccgactc agaagcatca gaacttcgtg aagcttctgc agatggagtg 5220 cgtaccagcg gtccaccagt gaacagcttg ctcaatttcg cccacctctc ctaacactta 5280 cacacatcct tatttgcttc acaagcacga tttccttcat ctgccctgag aacgccaact 5340 gttccaaagg caccaaattc tctaccaaaa gccgccaaca ggcatctgcc tgatctttcg 5400 tggttcagat ggtcagagag aggctcgcct acccatctga gggggtg 5447 // ID Gypsy-88_MLP-LTR repbase; DNA; FNG; 386 BP. XX AC AECX01000283; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-88_MLP_; KW Gypsy-88_MLP-I; Gypsy-88_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-386 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000283; Positions 80602 80987. XX SQ Sequence 386 BP; 116 A; 77 C; 61 G; 132 T; 0 other; tgtaatgtcg tcgaagactc cattacaaac atagtacaga catatattct tatacatgat 60 tacacgacct ttttgatatc tctcgtctcg cagaacacta actcctcagg gaatctagac 120 tgattagatt tcatctcttt ggagttaggt gagcaacaac atcatttcac ttatatcatc 180 ctatttcctt attatatctt gtatggttcc tcagggaatc tagactgatt agatttcatc 240 tctttggagt tagataataa aaactagtat agtttggttc tagaaaatca gtcgagatca 300 ctacgtgatt ctcagttccc tcacattccc gtcactgttc ggtaaaaata agattaaatt 360 agtgaggact caataggtcc tttaca 386 // ID Mariner-3a_AN repbase; DNA; FNG; 1860 BP. XX AC . XX DT 09-JAN-2004 (Rel. 9, Created) DT 09-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE Mariner-3a_AN is a subfamily of Mariner-3_AN - a consensus DE sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; KW mariner superfamily; Mariner-3a_AN; Pogo clade; transposase. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-1860 RA Kapitonov V.V. and Jurka J.; RT "Mariner-3_AN, a family of nonautonomous DNA transposons in the RT Aspergillus nidulans genome."; RL Repbase Reports 3(12), 211-211 (2003). XX DR [1] (Consensus) XX CC It is a subfamily of Mariner-3_AN. CC This subfamily is characterized by the TA TSDs and 15-bp TIRs. CC The CC 524-aa Mariner-3a_AN transposase is encoded by a single ORF (pos. CC 194-1765). CC The Mariner-3a_AN and Mariner-3_AN consensus sequences are 89% CC identical to each other. The Mariner-3a_AN consensus sequence CC was reconstructed based on multiple alignment of 3 copies ~1% CC divergent CC from the consensus. XX FH Key Location/Qualifiers FT CDS 194..1765 FT /product="Mariner-3a_ANp" FT /translation="MAPPKRLSDVQRKALRDWVHSQSRRPTQKACIAWFQA FT HYNHRLSQSTVSDILSPQYHYLDSECNPSSATRKGIGQWQDLEAILYEWHH FT TLDCKGAYISGDILIEKARQIWSSLSQYRDQPPPAFSSGWLHRFKQRYNIK FT QRTYHGEAGSVPEDVEEEMKAIRTIAGQYNEDDIYNMDETGLFWRMPPSQS FT LSSVNRPGIRKDKTRISMICCVNASGTDRLPIWVIGKAHKPRALRNINISA FT IGIRWQWNKNAWMNQIIMREWLLEFYQHIGQRSILLTMDNLPAHLSGLELA FT PPPPNIRICWLPKNSTSRYQPLDQGIIQNLKIYYRKQWLRYMLSHYERDLD FT PLESVTILDCIRWLVRSWHHDVLSSTILACFYKSTLVPDPIQLPVEAPDLK FT PLYEKVQQSGNLSDCMDISFFLNPAEESQEPTSSSNGMSSEVLLEQLITEA FT SGSTDIYSDDLDDDTAEPAPLPKPQDALDAVRLLISYMEGQDASKAPILRS FT LERLERDLEGEIITARAQGTLDSWLSNA" XX SQ Sequence 1860 BP; 514 A; 439 C; 404 G; 503 T; 0 other; tacagtgcgg ccccgccaag ccgatactcg cttagccgat aacctcgctt agccgatatc 60 tcatctcggg atgaattcca tcccatataa atttacctcg ctttactatg taaccccgat 120 ccccgatata tcgacaagca tatcgtctat ttgaaaagct cgtcctagtg gaaaccttat 180 tctaaccaat catatggctc caccgaaaag gctttctgac gtccagcgga aggctttgag 240 agactgggtt catagccagt ctcgccgtcc aacacaaaag gcctgtatag catggtttca 300 agctcattat aaccatcgct tgagccagtc tactgtctct gatatcctca gcccacaata 360 tcattatctt gactcggaat gcaatccttc ctcggcaact cgcaaaggta ttggccagtg 420 gcaagacctt gaggctatcc tttatgaatg gcatcataca cttgattgca aaggggcata 480 tatcagtggt gatattctta ttgaaaaagc acgccaaatc tggagttctc tatcccagta 540 tcgtgaccag cccccacctg ctttcagtag tggttggcta catcggttca aacaacgcta 600 taatatcaag cagcggacat accacggaga agctggctca gtaccagaag acgttgagga 660 agagatgaag gctatacgta cgattgctgg ccagtataat gaggatgata tctataatat 720 ggatgaaact gggcttttct ggcgtatgcc tccttcacag agcctatctt ccgttaatag 780 gcctggaatt aggaaggata agactcggat atctatgata tgctgtgtca atgcctctgg 840 gactgatcga ttaccaatct gggtaattgg gaaggcacat aagccacgag ctcttcgcaa 900 tatcaatatc tcagcaattg gaattcgatg gcaatggaac aagaatgcct ggatgaacca 960 aattattatg cgtgaatggc tcctggagtt ctatcaacat atcggccagc gatcaatcct 1020 tcttacaatg gacaacctcc ctgcacatct ttctggccta gagctagcac cacctcctcc 1080 taatatacgc atctgctggc tgccaaagaa ttcaacaagc cggtaccagc ctcttgatca 1140 gggtattatc cagaacctga agatatatta tcggaaacag tggttaagat atatgctttc 1200 ccactatgag agggaccttg atccactaga atctgtgacg attctagact gcatacgctg 1260 gcttgtacgg tcctggcatc atgatgtcct aagctcaact atccttgcct gcttctataa 1320 gagcacactg gttccagatc ctatacagct tccagttgaa gcacctgatc taaaaccact 1380 atatgagaag gtacagcaat ctgggaatct atcagattgt atggatatct ccttctttct 1440 taaccctgca gaggagtctc aagagcctac tagctctagt aatgggatgt cctctgaggt 1500 attacttgag cagctaatta ctgaggcttc tgggagtaca gatatatatt cggacgatct 1560 agatgatgat acagctgagc cagcacctct tccaaagcct caggatgctc ttgatgctgt 1620 aagactactg atctcttata tggagggtca ggatgcatct aaagcaccta ttcttagatc 1680 acttgagcgg ttagagcgag atttagaggg tgaaattatt acagcgaggg ctcagggcac 1740 cttagatagt tggcttagta atgcttagat aataataaaa acttcacctt ggcgataacc 1800 tcggataggc gatatttttt gctgggatga cttgtatcga ctaaacgggg ccgcactgta 1860 // ID TDH2_LTR repbase; DNA; FNG; 376 BP. XX AC AJ439551; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 08-NOV-2002 (Rel. 7.1, Last updated, Version 1) XX DE Debaryomyces hansenii retrotransposon TDH2_LTR, long terminal DE repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; KW Long terminal repeat; RNaseH; TDH2_LTR; gag; integrase; pol; KW protease; retrotransposon; reverse transcriptase. XX OS Debaryomyces hansenii OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Debaryomycetaceae; OC Debaryomyces. XX RN [1] RP 1-376 RA Neuveglise C., Feldmann H., Bon E., Gaillardin C. RA and Casaregola S.; RT "Genomic evolution of the long terminal repeat retrotransposons RT in hemiascomycetous yeasts."; RL Genome Res 12(6), 930-943 (2002). XX DR Genbank; AJ439551; Positions 1 376. XX SQ Sequence 376 BP; 150 A; 59 C; 53 G; 114 T; 0 other; tgttggtttt gtgtcactat cagaatctat aaagattctg aaacaaatca ttgtaagaac 60 cataaatagt catgctagtg atactgatag tcataagagg atatgggaat aatcttaatg 120 agttgaaacg aaacaagaat ataaattagg aatgaatctc caagaagaag atgaattttc 180 taattaacga acagcaatta agcagaatat acttaattaa gactacccta aaaagtcatc 240 gataggtaga tcgatcactt aactacttct atagtaacca atattgatat ataagaccag 300 tatatctata atactaatac ataagtattc ttccctacat atttcaggta ttcacacgac 360 ctttatcata atccca 376 // ID Gypsy-111_MLP-I repbase; DNA; FNG; 5862 BP. XX AC AECX01000696; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-111_MLP_; KW Gypsy-111_MLP-LTR; Gypsy-111_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5862 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000696; Positions 1495 7356. XX CC Positions [4733-5212] - Integrase core CC 'TGTGG' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS 1677..2909 FT /product="Gypsy-111_MLP-I_3p" FT /translation="MVTGFDGSSSRSSFETSLYINQDSTLTLFIITRIKDT FT YDGILGIPWIKKNSARIDWKQGLIHDSTHDIAATAVVSSSLLQPSLAQGLD FT PLRDARCNNEGMCVLTDTSTSPQCEFDTLLPSVSPCTAGKPQSLLNLQNQT FT TPRGIHDKDKTDELLTADVWTQAPHVAAAKAVLLSPSRPSPAQGMDPMREA FT RSNNEGMRVSIDTSTSPRCEFKPFLIALSPCTAGKPCHFPNLEPHVDQPGT FT TPKTGRIATKGQQPIIAADTSASSIPTQNPEDLTSIPKGQARNFDEGACDS FT QDTVQPPQCEFDLPTPSAVIESAGQQEHFPDNSPPSINAANASWSTSACLA FT ADEKKKAVIKPLEETVPAYYHCHLNMFRKSKAQCLPPCRKYDFRVQLVPGD FT NLRPAGSSHFRQRRTRP" FT CDS join(2804..4027,4031..5590) FT /product="Gypsy-111_MLP-I_1p" FT /translation="MPPALPKIRLQSPTSPRRQPQASRIIPLSPAENEALD FT EMITEGLASGTIRRTTSAWAAPVLFTRKKDGKLRPCFDYQKLNALTVKNKY FT LLPLTMDLVNSLLDADEFTKLDLRNACGNLRVAEEDEDKLAFICKQGQFAP FT LTMPFGPTGAPGYFQYFIQDILLGRIGKDTAAFLDDTMIFTKPGENHEQAV FT DGILDILSKHQLWLKPEKCEFSKKEVEYLGLLISKNKVKMDPTKVKAVRDW FT PAPQNVNELQRFIGFSNFYRRFIDHFSKTTRPLHNLTRTNVPFEWNNECNN FT AFESLKTSFTTAPVLKIADPYRPFLLECDCSDFALGAVLSQKCKEDGELHP FT VAYLSRSLVQAERNYEIFDKELLAIVAAFKEWRHYLEGNPNRLEVIVYTDH FT RNLESFMTTTQLTHQAWWAETMGCFNFVIKFRPGRKTAKPDALSRRPDLKP FT HEDKKLTFGQLLRPENIDHDSFPTELASMETFFVDESIELEDAEKWFEVDV FT LGVSNEEEEETNHEKTLTDDKIIDLIRQANGNDPRMSELMNIIQNPISLDI FT KKAVKKYKIKDGIVYNSGRIEVPEDNKIQFEIIRSCHDTLLAGHAGRSKTL FT SLVRRSFVWPSQRAYVNRYVDGCDSCLRVKSSTEQPFGTLEPLPVPAGPWL FT DISYDLITKLPKSNECDSILTVVDRLTKMAHFLPCKESMSAEELADLMITN FT VWKLHGTPKTIVSDRGSIFVSQITKEIDKRLGIRLYPSTVYHPRTDGQSEI FT VNKAIEQYLHHFVSYRQDDWATLLPTAKFSYNNKDHESIGVSPFKANYGFN FT PNFNAVPSAEQCVPAVEGRLKTPSKVQKELTECLKLTQETMKHNSDRTVQD FT TPNWNIGDQVWLNSQNITTTTTRPAPKLDYQWLGPFSIKEKISTSTYKLNL FT PLSMKGIHPVFQYKTRMNGKSQAF" XX SQ Sequence 5862 BP; 1798 A; 1454 C; 1325 G; 1285 T; 0 other; tattgtcaga tcttataatc caagacagga ctgaagaact ccctcgactt accgataaaa 60 ttctcgatct tcacaagatt caagattgaa tggtttgaaa ccaatagttt agaactaaaa 120 agattaccac cgattcggat tccttcggaa ctgacgattg acctgcatca actgaaccat 180 aaccttaatt gtgatctgcg actcgtagaa caaagaacca caccgaacct tatcaaatca 240 ccgatctatc tgcgacacca cggctcccac gtacagcgcc ctcccgctag acaacactga 300 ctcagattac gacaccggat cagatcccgg aatctccaca tgcttcgtcg acgccaactc 360 tagtacaggc accgagaaac ctgaacctac agggatggaa gaaattgaac gtcagctcaa 420 caagctagct aattcattag cagaagaaag atccttacgt gctcaagcca aagcccgact 480 ggctgcgata gaagccaacc agcagagaag ccaacctcag ggtaactcga tgccagtacc 540 aatgaatcct cagcctgcct cgatcccaaa gggaccaaag gtctccgttc ctgataaatt 600 cagcggcacc agaggaggac ctgccgagat cttcgcgagt caggtgcaat tgtacatgct 660 cacacacccg cacttatttt tagaagatcg aagcaaagtg gtctccgctc tctcgtatct 720 cacgggagct gctagcgctt gggctcagcc catgactcag gagctgtttg atgagtcaac 780 cacccacaag gtcacgttca agcgcttcgt cgccaattcc aaagcaatgt atttcgacac 840 cgagaagaaa tccaaggcag aacgagcgct ccgtgcctta tctcagaaaa ctacctagcc 900 gcgtatgcgc acgagtttaa catacacgcc gccgccacgg gttgggaaac ccctaccttg 960 atcagccagt tcaaacaagg ccttaaaaga gagataaggg tggccatggt tatgatgcaa 1020 gacccgttca aatcgatcaa agatatcgca aacctagcca tcaggatcga cagcaagatc 1080 cacggagttt cagaccacac cactcacacc actactgtct ctgctgatcc gaacgcaatg 1140 gacatatcag ccaattacgt gtgtttaacg gacgaagagc gtgcttgtcg attacgaact 1200 ggatcatgct ttcagtgcgc aaagcaaggt catagagcta atgtatgccc tgacagaaga 1260 gctggtagtg gagggaaagt gaaaggagga tttaaagcta aaattcatga attagaatct 1320 aaattggcag ctatgagtag taaagatgaa actgtgatgg atagagggga aggaccaagg 1380 cgtgctgagg cctcaaaaaa tggaggggct caggcatgaa tgttgtgcct aacctgagcc 1440 tgactaggga ttggattgga gtagatgtgg gtgctagtaa acttatcaaa tgtaatgaaa 1500 atgatcctcg tttgttttat cgcgcctctc tgtcccacat tcccaatccc caagccacaa 1560 aaccaaatcc tctttttgcc tattttctca tcgactccgg agcaactcac gacgttctga 1620 atgagtcatt tgcggcaaca tctaacctcc ttgcctgagc cactcgtacc gacagaatgg 1680 taacgggatt tgatggttct agcagcagat cctcatttga aacatcactt tacatcaatc 1740 aagactcaac cctgaccctg ttcatcatca caagaattaa agacacatat gatggtatct 1800 taggaatacc gtggataaag aagaactcag cccggattga ttggaaacaa ggactcattc 1860 acgacagtac tcacgacatt gcggccacgg cagtggtttc ttcaagcctg ctacaaccct 1920 ctttagccca aggattggac ccattgaggg acgctaggtg caacaacgag gggatgtgtg 1980 ttttaacaga cacgagcaca tccccgcaat gtgagtttga cacattatta ccttctgttt 2040 caccttgtac agctggcaag ccccaatctc tcctgaattt acagaaccag accacgccaa 2100 gaggcatcca cgacaaggac aagaccgacg aacttctcac cgctgacgta tggactcagg 2160 cacctcatgt cgcagctgca aaagcagttt tgttaagccc gtcaagaccc tctcccgccc 2220 aaggaatgga cccaatgagg gaagctagga gcaataacga ggggatgcgt gtttcaatag 2280 acacaagcac atccccgcga tgtgagttca aaccgttttt gattgctttg tcaccctgta 2340 cagctggcaa gccctgtcat ttcccaaatt tagaacccca cgtcgatcaa cctgggacaa 2400 cgccaaagac cggcaggatt gctacgaaag gacagcaacc aataattgcg gctgacacgt 2460 cagcctcgtc aattccgacc caaaaccccg aagatcttac cagtatacct aaggggcaag 2520 cgaggaactt tgacgagggg gcttgtgatt ctcaagacac agtacagccc ccgcaatgtg 2580 agtttgatct ccccactcca tctgcggtga ttgaatcagc tggccagcag gaacactttc 2640 cagataacag tcctccaagt atcaacgccg ccaacgcttc ctggtcaaca tccgcatgcc 2700 tagcagccga cgaaaagaag aaagctgtca tcaaaccact agaagagacg gtaccagcct 2760 attaccactg tcacctgaac atgtttcgca aatctaaagc tcaatgcctc ccgccttgcc 2820 gaaaatacga cttcagagtc caactagtcc caggcgacaa cctcaggcca gcaggatcat 2880 cccactttcg ccagcggaga acgaggccct agacgaaatg atcacggagg gtcttgctag 2940 cggaactatt cgacgtacca catcagcgtg ggccgcccca gtcctcttca ccaggaagaa 3000 ggatgggaag ttacgaccgt gctttgatta ccaaaaactg aatgcgctta cagtcaaaaa 3060 caagtacctg ctcccactaa ctatggacct agtcaatagt ttgttggacg cggatgaatt 3120 caccaagctg gatttgcgta atgcctgtgg caacctacgt gttgcagaag aagatgaaga 3180 caaacttgct tttatatgta agcaggggca gtttgcaccc ttgaccatgc cttttgggcc 3240 gaccggagcg ccgggatatt tccaatactt catccaagac atattattgg ggaggattgg 3300 aaaggacaca gccgcattct tggacgacac catgattttc accaaacctg gcgagaatca 3360 tgaacaagca gtcgatggca tattagacat cctcagcaag catcagttgt ggttgaagcc 3420 ggaaaagtgc gagttttcaa agaaagaagt cgaatattta ggcctactca tttctaagaa 3480 caaggttaaa atggatccta caaaagtgaa agcagttagg gactggcctg caccccaaaa 3540 cgtgaatgaa ttgcaacgtt tcattggctt ttctaacttc taccgccggt tcattgatca 3600 tttctccaaa acaacaagac cactgcacaa cttgacccgg acaaatgtcc cttttgagtg 3660 gaacaacgaa tgcaacaacg ccttcgagtc tctgaagaca tctttcacaa cggcgccagt 3720 cctaaagatt gcggatccct accgtccatt cctattagaa tgtgactgct ccgattttgc 3780 attaggggca gtcttgtccc agaagtgtaa agaagatggt gaactacacc cagttgcata 3840 tctgtcacga tcgttggtac aagctgagcg caattacgaa atctttgaca aggaactcct 3900 tgccatagtg gctgccttca aagaatggcg ccactatttg gagggcaacc caaatcgact 3960 cgaagtcatc gtatacacgg accaccggaa cctggaaagt ttcatgacaa caacgcagtt 4020 aacacattga caggcttggt gggcagaaac tatgggatgc ttcaacttcg tgataaagtt 4080 caggccgggt cgtaaaactg caaaaccaga tgcgctgtca agacgacctg atttaaagcc 4140 gcacgaagac aagaagctaa cgtttggtca actcttgaga cctgagaaca tcgatcatga 4200 ctcattccca acggaactgg ccagcatgga aacatttttt gtagacgaga gcattgaact 4260 agaggatgca gaaaagtggt tcgaggtaga cgtgttggga gttagcaacg aggaggagga 4320 ggagaccaat catgagaaga cgcttaccga tgacaagata atcgacttga taagacaagc 4380 taatggtaat gacccaagaa tgtctgaact catgaacatt atccaaaacc caatatcatt 4440 ggacatcaag aaagcagtga agaagtacaa gatcaaggac ggtattgtat acaactcagg 4500 acgcatagaa gtccccgagg acaacaagat ccaatttgaa atcatacgaa gttgccatga 4560 tacactactg gcgggtcacg caggaaggtc aaagactcta agcctcgtac gtcgaagttt 4620 tgtgtggccg tcgcaaagag cttacgtcaa cagatacgtt gacgggtgtg actcttgctt 4680 acgagtcaaa tcaagcacgg aacagccctt cggtacatta gagccactcc cagtcccagc 4740 gggcccttgg ttggatatca gctacgatct cataaccaag cttccaaaat caaatgagtg 4800 tgatagcata ctaacggttg ttgatcgact tactaagatg gcccactttt taccatgtaa 4860 ggaaagcatg agcgcggagg aactcgctga cttgatgatc acaaacgtgt ggaagcttca 4920 tggtacccca aagacaatag tgtcggacag gggtagcata tttgtgtcac agatcaccaa 4980 ggaaattgac aaacgactcg gcatccgttt atacccttcc accgtgtatc accctaggac 5040 ggacggtcaa tccgagattg tgaacaaggc cattgagcaa tatttacacc actttgtctc 5100 ataccgccaa gacgactggg caacattatt acctacagcc aaattttcgt acaacaataa 5160 agaccatgaa tcaataggcg tctcaccctt caaagcgaat tacgggttta atccaaactt 5220 caacgcggtc ccgtctgccg aacaatgcgt gccggcagtg gagggaagat taaaaacacc 5280 gagcaaagtt caaaaagaac tgactgaatg tttgaaattg acacaggaga caatgaagca 5340 taactccgac aggacggtac aagacacacc aaactggaac ataggggatc aagtatggct 5400 gaactcacaa aacattacaa ctacaactac aagacccgca ccgaaactgg actatcaatg 5460 gctaggtccc tttagtatta aagaaaagat ttccacttca acgtataagc tgaacctgcc 5520 tttgtctatg aaaggaatac atcctgtatt ccaatacaag actcgaatga atgggaagtc 5580 tcaagcattt tagactgccg cacacgacgc aacaagaagg aataccttgt gagtcggaag 5640 gggtacagtg ccgagcataa ctcgtgggaa ccagtagcaa acctcaagaa ttgtcaagaa 5700 ctagtgaagg aattcaataa tctatttcca caggcgtcaa ggaggtacag aaaaaggaga 5760 agaatgtgag agggtaagct ttttcccaat gggtttttta atgctacccg gggaggaatg 5820 cagaacttgc aagaggaagt ttgggcatca aaggggggat aa 5862 // ID TKM1_I repbase; DNA; FNG; 5224 BP. XX AC AJ439546; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 08-NOV-2002 (Rel. 7.1, Last updated, Version 1) XX DE Kluyveromyces marxianus retrotransposon TKM1_I, internal region. XX KW LTR Retrotransposon; Transposable Element; RNaseH; TKM1_I; gag; KW integrase; internal region; pol; protease; reverse transcriptase; KW internal portion. XX OS Kluyveromyces marxianus OC Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina; OC Saccharomycetes; Saccharomycetales; Saccharomycetaceae; OC Kluyveromyces. XX RN [1] RP 1-5224 RA Neuveglise C., Feldmann H., Bon E., Gaillardin C. RA and Casaregola S.; RT "Genomic evolution of the long terminal repeat retrotransposons RT in hemiascomycetous yeasts."; RL Genome Res 12(6), 930-943 (2002). XX DR Genbank; AJ439546; Positions 386 5609. XX SQ Sequence 5224 BP; 1846 A; 1222 C; 1063 G; 1093 T; 0 other; tggtagcgcc gctaagacac ctattgtccc ctcagacaat caggcaagtc cagaggaatc 60 aggtagcatc gatgtcgagc ataaggccaa tgccgacgac gaagcatccg gatcccacga 120 aggtcatcat gaaaattttt caaaggattc aacaaaaacc ggaactccac aggacaccct 180 acacaaagag cctgtggacg gttctcccgt tccaaattac gcccaacaaa gctggtacta 240 ccatacacag ccaccgccac agttttatcc atctccatac gctaattatg ggcctggtcc 300 ttacactcct ccgtgggcca acatgaatat gcccatccca ggagctaaca cgaacccaga 360 taaaactgga ggacattatc aaacaacggg tccatctggg gacagctccc aatacaaccc 420 tgcgtactac ttcccgtacc aattggaacc aaaaccaggg ttgcaaagcc aaacaacgac 480 cgaatgtact ttcggccaat tgcacaatcc cagaaaaaca ttcattctgt caaaggtaaa 540 ctcacctcac tattacgaca gctgggtaag agaaatgtgc caggcaatgc tcgagcaaaa 600 cttgggccac ctcattccaa cagaggataa caaaacccca gaaaccccag aacctgcaga 660 gaagaggtac atagaagaaa tccatatagc atgtgtgcct gaagaaaaat acccaaagtg 720 gttaaagtcg agttacgacg aaggtgaagc actaattgac tgcttcaagg aaggaattag 780 aagaataaag gcggaagaag atcctgaggc aatcgtgtta gccctagctg gactaacact 840 agactacaga gagtctattc ccaacttcgc caagagatta aggaaaatcc accagagaac 900 ggtaaacgca aaattcccca tggcagaaaa gatcgtcatt tccagagcac taaaggctct 960 accagaacga tacgagaaag ttgacatcaa cttcaccaaa tcaaacgatc agagcttcaa 1020 taacttcata aacatcctcc taacaacgga gccaaggatg aaaagctcaa atcaatatga 1080 taatcaacgg ttttacacca attccgaaag gaagcaggtg gacagacgtc caaacagaaa 1140 aaccgtacaa cacattggat caaaaaccgt agagggggat cttaggcctg actcagaggt 1200 ccaggaagaa taattcccca gtcgatcccc agagagactt tatgcttgat ggtggagcaa 1260 cagtctctgt gatatatgac aaaaagttaa tccacaattt tacatctgaa tctaatcaaa 1320 ttcttgtcga cgttcaacaa aacgaagtcg ccgtgaaagg tgagggtaac ttagaactca 1380 agtttaaggg caaacgaatt tccctacccg ccatatatgc accatcaaca cacaccaata 1440 tcatcagtgt agaagatcta acaaaatcac aggcgtatct agatctaaga aggaactgcc 1500 tgctatccaa gaacgggaaa accatcgctc cgacccacaa gttcgcgggc ctaaggtggt 1560 tatctcgcaa aaatgctata gaactaccaa accagactca atcagtctat gcaatcacac 1620 ccagatcggt aagatctgcg ccagacaagt tctctctgac aagcatccac aacatgtttg 1680 gacatatgaa tatcaattac atccgagaat cattcagaaa aggcttaatt caaggtgtga 1740 aagaagacga tgtagactgg acaggagtca gctcattcca atgccaacac tgtatggaag 1800 gaaaagccaa acgtaataac cattacgtga acgctagaaa agattatacg aaggaatatc 1860 ttcctttcga atacttgcat accgacgttt ttggaccagt aagagtacaa agaacccgta 1920 ctactccaag gtacttcatt gcattcatag acgaggtcac aaaatacata tggaccttcc 1980 cgttactaca taaaacagca gaagaagtag ccccgacatt caaggaagtc gtcatgctga 2040 tttatacaca gttcaacacg agagtgaaaa ccatccagat ggacaaagga tcggaatacc 2100 tgaacactaa ggtacagaaa ttcctaaggg aaagaggaat tgtttcgaga gaaacaaccg 2160 ttgctgattc aaaagcaaat ggagccatag aaagacagca ctatacactt ttgaatgact 2220 gtagaacatt cttgagacaa gctaacctac gccctagatt gtggtatcat gccgtcgtat 2280 actctacagt aatgaggaat tctctcctca atagaagtat aggaacgtcc ccgagaaaca 2340 gggcggggat gtcgggactg tcattcagag acattcttcc cttcggacaa cccgtgattg 2400 tccacttacc caacccaaaa tcgaaattac aagctcgagg agtcctaggc tatgccctcc 2460 atccatccac tagatcatac gggtacatca tagtcgtagg gaagaagaag aaaataccta 2520 tcgacactcg aaactatcgg gtcctgaact acccaccagg tgcaacaatc tcagaagacg 2580 aggtgcagta catgatcgac cgtatggaaa ataacgacgc agaatctcaa gacgatatag 2640 agtcgaactt tgaaccaaat tatacggata tggagcaacc cattcaccac acagcggact 2700 atttcccgaa tacaaccgcc tcaaacatcg agacagacca atacaataat gatagctttg 2760 gtctgcatta cgggggtgat tccgtaccgc ccgagtcgtc cagtagcgag gacgaactgt 2820 tccccacaga cgaatcagaa aacgattcag actcatcgga ccaatcattc aatgatgatg 2880 gagaccccat gtcccctcca tattccgggg gtgaggaaca gatagtacca acaccagccc 2940 cgattagacg cgttccacca atggaaccac catctcctgt cgaggactct cccccacctt 3000 tatctgggac agacctcgac gacttatttg gagaatctaa cataaataat tacatcccag 3060 aagatacaga tctactggca ctaaaccatg aatctgttcc ggaacccgat accgtggtac 3120 cagaaacaac gaacattgaa caagacaata ttctaccatt agtaactaac gaaaactcta 3180 atcctccaaa cgatagctct gaccagtccg gcgacgagtc aggcgaagaa tccggcgaag 3240 agtcagtcga caacagactc aagtccatac ctattttcaa tggaaacaag cacaaagatg 3300 caagaactgc tgaagcagat ttggattctt tgtacggggg tggagataat acagaaaata 3360 acaatggacc aactttagaa gaggtgttca gatcgatcga agaagatcca ttcatgctaa 3420 cacagaaaag accgagatca cgtgcccgct atcgcgaatc taaccaagac agttgcgatt 3480 ccgggggtga ctacgaatca ggttcagact cagacgggtc atctgacgag tcacctcaga 3540 agggaagaaa aatacaacgt gtgaactatg ttaacgcggt cctaaaacca gtgaatgtaa 3600 taccacttaa tatgtcctta aactactcgc aggcaatatc tcggaacaga aacgaagaag 3660 agaaggatgc tttccagaag gcataccaga aagaaatagc acagctaacc aaaatgaaca 3720 cttggaacga agaattaata gatgcctcaa ctcttcctaa gaagaagatc ctaaactcaa 3780 tgttcatttt caccactaaa agagataatt ccaagaagtg cagactagtt gctagaggag 3840 accaacaagc cgcagacacg tatgacacgg aattaaaggc aaatacagtg gacaacctag 3900 ctctcatgac agtcttagca ctaacactag actacaacct aacagcgttc cagctcgata 3960 tctcatcggc ctacctctac gcagacctta aggaagaact gtatatcaga gctcctccac 4020 acatgaatgc taaaaacaag gtactaagac taaataaatc actctatgga ctaaaacaga 4080 gtggagcaaa ttggtacgaa ctaatcagat cgttcctaat taagaaatgt gacctgattg 4140 aagatagaat gtggaaatgt gtttttagag acaaagaacc gctgaaactt attatatgtc 4200 tgtttgtcga tgacatgctg gttgtaggaa acgacgtcaa gtatatcaag aaattcatat 4260 caaagctatc taagagattt gatacaaaga ttgtaaatga tggttcacac aggccagaag 4320 atggagtaaa cgagtatgac attttgggca tagaattaga gtataagaaa aaagaataca 4380 tgaagttcgg aatgcagaag tctctagagg acaagctgcc acaactggga atacccctac 4440 tcccaaacgc taaaatcagg aaggttccgg gggtgcctgg agactacatc ttctcaggaa 4500 aggaactgac attaaacgaa agagaatata aaagcaaagt taaacacctg caaagaattg 4560 taggactagc gtcctacgta ggacataagt tccggtttga catcttgtac tacgtgaaca 4620 tcttagcaca acatcaactg tatcccagcg ccaaggtcct agacagggct gcacaattat 4680 gccaatactt gtgggataca agagataaga aactagtttg gcattattct ggtcccaagg 4740 aaaacaacgt taccgctgta tcagatgcag catttgcagg gaaccaagat tttaaatcac 4800 aatcaggaac tctttacctg agaaacaaca agcccatagc agccaaatca agaaaaatca 4860 agttaacttg tatctcgtcc acagaagccg agatatacgc aatcagtgaa agcctgccaa 4920 tactacgtgg gttagaacac ctagtaaaca agttacaaga cataaaagca acagtaaagg 4980 ttaaaacaga cagtcaacca tcaatggcaa taataaacgg cacggatgac tcagcatgcc 5040 tcaagaaaca cattggtagt agggcaatga ggataagaga tgagtgcgat gatctcggac 5100 ttacactcga atatatcccc acaaaagaaa acaatgctga cgttttaacc aaacccctat 5160 ccgtgaagct attcaaactt ctcacagagg actggataca atagctttct cctagtaggg 5220 ggtg 5224 // ID Gypsy-11_MLP-LTR repbase; DNA; FNG; 596 BP. XX AC AECX01001578; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-11_MLP_; KW Gypsy-11_MLP-I; Gypsy-11_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-596 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001578; Positions 40990 40395. XX SQ Sequence 596 BP; 144 A; 141 C; 75 G; 236 T; 0 other; tgtagcaggg ctacttctta gtcctaacta caataggtta caacggtcgt tatactaatt 60 agataccata ccgttacctt ataaactagc ctggccctca tttgtaaacc agctagatta 120 tatctcagca aggcgcacaa gccttttact ttgtacgcct tccttttttt ttgtctcatt 180 gatgtcatcg gctgtttgtg ccttgacaat cactttgtat aaatatcgaa agattctctt 240 gttattcaaa cttttggtat cttatctttt atcactttac ttttgcttaa aacttgtttt 300 cgcatctttt gtcttttcaa accttctgtt atcaaacttc ttttttctta tcttaaagat 360 cccattattg catttctctg aaagaacaat ttagaattac tacttaaaag catctacgtg 420 ctttgtccta tactttagaa ctcagccgtc cccttttaac gttcttctac ttcaattgtc 480 tttcctcaaa gcatagatct gtgatcttta cttctttgtc catattcttc taagctcccc 540 cttaaggtca ctggatctta tcacttaacg tgggtaccta tctccacccc gttaca 596 // ID Gypsy-12_MLP-LTR repbase; DNA; FNG; 195 BP. XX AC AECX01001319; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-12_MLP_; KW Gypsy-12_MLP-I; Gypsy-12_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-195 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001319; Positions 66261 66067. XX SQ Sequence 195 BP; 52 A; 58 C; 34 G; 51 T; 0 other; tgttatgatc cctaactcag gatcacaggg atgtcacaga atcgtgacat gtagagagtc 60 tgagccgcac ttgtaccaaa ccacatatca tccagacttc tctcctcatg ctacaataat 120 catatcacgt agtggtcacc aatagtctct ctccctctct gccctcttac gttgacccag 180 aggccggtca taaca 195 // ID SINE3-2_AO repbase; DNA; FNG; 3294 BP. XX AC . XX DT 24-JAN-2006 (Rel. 11.01, Created) DT 02-FEB-2006 (Rel. 11.01, Last updated, Version 1) XX DE SINE3 nonautonomous non-LTR retrotransposon - a consensus DE sequence. XX KW SINE3/5S; SINE; Non-LTR Retrotransposon; Transposable Element; KW SINE3-2_AO. XX OS Aspergillus oryzae OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC mitosporic Trichocomaceae; Aspergillus. XX RN [1] RP 1-3294 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-3294 RA Kapitonov V.V. and Jurka J.; RT "SINE3-2_AO, a family of 5S rRNA-derived nonautonomous non-LTR RT retrotransposons in the Aspergillus oryzae genome."; RL Repbase Reports 6(1), 46-46 (2006). XX DR [2] (Consensus) XX CC SINE3-2_AO is a family of SINE3-like retrotransposons in the A. CC oryzae genome (a conceptual consensus sequence, 2 copies). The CC 65-bp 5' terminal portion (pos. 6-70) is derived from 5S rRNA CC (pos. 3-67, 98% identity to the A. oryzae 5S r RNA) and includes CC the polIII internal promoter. SINE3-2_AO encodes a hypothetical CC protein (pos 400-2500). There are several diverged families of CC SINE3-2_AO-like elements in the A. nidulans, A. fumigatus, and A. CC oryzae genomes. XX SQ Sequence 3294 BP; 925 A; 744 C; 770 G; 855 T; 0 other; gaatcagtac atacgaccat agggtgtgga gaacagggct tcccgtccgc tcagccgtac 60 ttaagccaca tattacgcgt ggtcagtaag tataatggcc tttagccctt taaaaagctg 120 aaacaacggt gactatcgac ttagggtgag tgctggtgat tggggtgagt gccctatatc 180 cggccatcac cagcggtgat tcctcctcac caggggtgat gcagctaaca acggagccac 240 ccacaatgat ttttcgtgct aaatttagga agacacggga ttcccagatg cagataaata 300 cgtcttgtgt ctttcaaaga aatcctttct tgtttcatcc tcaacacgtc tgcctttctt 360 tatccatacg agcacgtgac atcacatgac catgttttcg aacgagcacc tcgaaaagtt 420 gctggaagag cgcaagagac tagccgagtc acgtggattc accctagagt atcaacagga 480 acaggaaaag attgagaata ctgtttgtca ccctcgaatt gctccagcca cggaggagaa 540 gtacgaacga gcggttacca actgggcgct gtaagccaat cttactacta tgatatatta 600 gcattgagat agcagaagct aatatagatc aggtggaggc tctctcggag cgaacccaag 660 gatgcaaatc ttaccagaga agatcccgac ccgacgccac agcaactaaa actattcgca 720 gaaagctacg ttgtttcccg aaagacaaag ccctctcaga agtcagcatg taataatttc 780 acttgtttca cctctaagtg ggagcgtgaa acgtcgcgaa cgcttccctt gggtttaaaa 840 aaggatgttc tgaatgtacg ggctgctcat gtcctcatat ttattactgg ggcgactaat 900 tatagatagt atatccggac tgtgttaaca gaaagacaca gtcttcctac taagccaaga 960 gagcggttct tggtgacggc gaaagatata gatcatcttc ttcaccatct atttggcgaa 1020 gacaaccatg attatgtaca cgagcgtgca agagtacaga ctgcaagttc actggctctc 1080 ttttctggct ctgctgctag agctggtgct attgtcgaat cgagctcgta caggaaaacc 1140 aatgaatgcc tctactataa ggtaattagt ctatatcacg cgtgacaatt ctttctgact 1200 tctattgcag catctgactt tcaatctaaa atggagtggg gatactggtg ggttgaaacg 1260 ctgggtcgta attgaccccg aatttctcaa aggactacgc tacagagatg acaaaatgat 1320 gtgggtatat attcagccgt tcatacttgc ggtagctaaa ttacgatcag accaaagaat 1380 tggtttcgtg aacatcctgt gcttggtaaa agctttgtgt tttgggtcat tgtgcatggc 1440 attgcggatg gtgcattcaa gggcatctca accgtcgaag aattgcttga aaaacgaccc 1500 ccaaaaggaa gagagtcgtg gaccctcgag tggaaggagg atgcgaaaga gttacctttc 1560 tttcgaatga ctacttccga aggtcccaaa gcaaacgaag cgtggacatt ctcttcgtta 1620 cgtcaccatc tcactagttt ggctgaacgg gatggcttca gagaccgctt acgagtgcat 1680 ggaatcaggg gtgctatggc aaacaaaatc gatcgtaagg tgtatatcaa tctataggag 1740 gaacataatg ctaactattt tttctgctta gccaaggcat cagccgcgac ccgcggccag 1800 gcgttagatc atatggatca tgactcatac ttgaaatacc agtcatcgct caaagcagta 1860 gatatgatgg ctctgtacca tgacctggat ccggattatg aatgtcgtga aatggaacag 1920 agcatggccc accaccgtga tcaaaatgtt ccgcttcgac ttgatgctgc ctctctcgca 1980 gagttcgaaa aagatgagga agtgattctg attaaccaac ggatcagtga actgacgcag 2040 gagattcatg gccggcctga taaacatgcc gacttagtat cggagagatc taaactttac 2100 acccggaaag ccaaaaagct ccgcaccaaa cgttccgaat tcattgaaaa ttggtggaat 2160 gtttgctatg acgagtacat tattgggaac gactttttgg aacgcgacac tacctgcctt 2220 tttcaaattt accgtaaata tatgcctgaa agagcacgct taaatgacaa cattttcaag 2280 caagtgcctc tcgatagtga tgttgggagg caatgtttac gtgatgcggt tagtctttgt 2340 acttcagtcg agaaggtggc atattatccg ggcatgactc ccgtagaggg aaaatgtcca 2400 atatgtgcca agcagatgtc aaggtttgcg tgtgcttttt attttttggt agtcgccttc 2460 tattgctaat attaaccagc attggtctcc aagaacggtc aaagcatatt ttgcaatgca 2520 aacgcaaatc attgaatgcg gcaccatacc agcagaccta taggaacggc aaacgtgccc 2580 atcgccatac ccgtcgaagc ttcgttcaat tctgttattt gtgtgctgaa ttaatttgca 2640 acgaagaaga ttggatcaac cattgccaat ctcacctgca ggctctacag ccgaggtgcg 2700 ggctcttaac cttccgctat acactagttg cagcgggtct atgtccattc tgtctgggtg 2760 atgagaaaaa gagagcggac gagcggttcc aacagtgggt caagaaagcc acattgttaa 2820 accatattga cagacacctt gagaatttgc cccccgaagg aaatgtttcg tgccctcacc 2880 cttgctgtaa tggaaggcaa tattccgatg tatctgacct ccgtcgccac ttcttcaaca 2940 gacattctat agcggagcct aggtccaact gtgttgcaag gaaaaggcca tggactgaga 3000 catcaatggc tctggaatcc gttcatgggg atcacgatgg cgagggtaaa ataaagggag 3060 gatgaatggt tactgagttt ggtggccacc attcatggtt tcaaagccca gtcaagtgac 3120 cacctctaat cgccttgcgc aagatactag ctatctccct acaacattac ctgtgatata 3180 tcatcagcaa tgctgtacta tggccctaaa taaacgtccg aattgtctgg aggctctatt 3240 atttggcaat gcatcccctc gcacgaagtc ttcttttaac ccatacttta atgg 3294 // ID P-1_AllMac repbase; DNA; FNG; 5042 BP. XX AC . XX DT 17-MAY-2011 (Rel. 16.05, Created) DT 17-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW P; DNA transposon; Transposable Element; P-1_AllMac. XX OS Allomyces macrogynus OC Eukaryota; Fungi; Blastocladiomycota; Blastocladiomycetes; OC Blastocladiales; Blastocladiaceae; Allomyces. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 5042 BP; 1036 A; 1453 C; 1491 G; 1062 T; 0 other; caggggagct aagtaagggg aaggacagcc tgtgttgagg ctcgggtaaa ttttcctcca 60 tcatttcttc cacgtcactc agctcttctc tcgcccacgg ctaagtccac cacggcagcg 120 ctgccgacca ctggctgcaa ccaccgacgc tcatccacca cgacgagtac caacagctgc 180 tgcaagcaac ggcagtgcgc gcccgcacct gcccgcagcc tgctgtctcg ccggcaccca 240 acaccaaccg tcgccgcctt tggccacctg cacgcccgct cccgtttgtg atccgtcgtc 300 gcctctcgtc cggccgtgcg cgccgaaaag gagccgctac gaccatgaag ctgccaacaa 360 gttatattgg cgacggacct cccgactttc catttgcgat ccatcgacaa ggtcggccgc 420 caaaagacgt aatcacgaaa gatcgtgcgt ccaactggcc cccgacacgg acggtgtgct 480 ttgccacggc cgggtactgg cacccggaac agagcctgaa ggactcgagc tacttttcct 540 ggcgctggtt tcctggcgac tacggcaagc gcacgctcgt gctcgcgttc atggacccga 600 agcgcaactc agccagttcg ggcctcccgc cacggcaaat catcttgcat gccaaggaga 660 agagcccgga catcttcacg tccgccgaaa tcatgtatct cggccacgcc atcccgcttg 720 ctgagctgcc caacattcca cccacgctct acccacctgg cggcgccgag gaccatgtct 780 tgatcaggac gtgtcacgac ctcttggatt tgctcgacaa gtgtggacgg tacgccacta 840 ccaccatgtc atgtcgcgcg gtttcgatgc gtgcatctca catcagaacc gtgtccacca 900 gctggaacca ttgccagggc gtctccgacg acgatattgt caaggccgtg tcggcgcaac 960 cgctgctcct gctctcgccg tttccgttca aggatgcgaa gaagctccac cgactcggcg 1020 acgtgtgcaa catgccgttt gcgtttcggc aagctgtgat ccccaagacg attcgcgccg 1080 cgccatgcac ccatgtcgca gatacagtgg gcagcatgca ttgcgcaagc tgttattcgt 1140 tggcgcgtac tttgcggtgg cgggtcgacc aagtcgatga gctggtggcg cgcactgtgg 1200 atgggaatgt gcacatggtc acccatgaca tgacgctgct cgagtatgtt cactgcattt 1260 ttggttccat ggtgatcggg ccacaattgt gaccaactga ttcgtcctgc cagtttcatc 1320 cagtacgccc agcccgacct ttacgacgag ctgtccaagc tcccctttct cgtcgaattc 1380 atcaaggcga ttggcacgcg cgtggcagca ccgggtgggg gccgtgggat tccatggtca 1440 gacacgattc gtggtggcgc gctcgctctc tattgcattt cacggtctgc gtacctgacg 1500 ttgctctcat ttggattccc gcttccgtcc cccaaggtct tggatggcga cattaaggaa 1560 ttctcgtcgc cgcctggcat cacggccaac tcggtgatgc ggctgcttgc gctgctcgac 1620 atggagctcg acaagttgcg tgccgagttc tcggacgtgc ccgaggtcga gcttcaccgg 1680 ttttcacggt gcggcctgct tgcaattgac gaggtccatt tgaacgccga attgttgaca 1740 agtgctgcgc acgaaatcaa gggtgctgtg aacctcggcc cgattctcac gccgagcctc 1800 ggcgtctcga ccaagacgga gcaggacgag gtgcctcttg caaagagcgc gctccaggtc 1860 atgtttgtct cgttgtcaac caacgttgtg tggccccttg gctacttcca gacgcaggac 1920 cttaagacga acggcctggt ttcgattctc gagacgctct ttatcgaggc cgggggtggc 1980 gagtttgacg tgatgtgttt ggcgcttgac ggatcatcaa tcaaccgctc tgcgagcaac 2040 ttttatgacg agaagggcat tgagcaccca ctctaccccg ggcagtgcgt gcggatcctc 2100 caggatctgc cgcacgcaat gaaaaagtgc cggaattcct tgctgaattg ccacgatcac 2160 ttcgagtttg ggaagttgaa aatgggctgg tccgtcttgc acgagttgta cgactacatc 2220 aaggccgaga agagcacgct cgacattctt ccaaagtttg ggccacaaca tctcaacgtc 2280 gatacatggg ccaaaatgcg cgtgccattg gccgcgcgcg tcatgtcccc tgacgtggcc 2340 aagcagctcg aggatgagct atttgccaag ggccgcccac tggcagcagg aacggccacg 2400 ttcatccgcc atgtgcacac gttctacagc atgtttaccg accgcttgcc gctgatcgac 2460 ccggcccatg atactttgtc ggcgcgcgct gcatgggccc gagatcaacg cgcagccgtt 2520 gcgtctgcgt gtgtccacga tttggctggc gataacgatg gcccgcatgc cgagcggatc 2580 ctagcggacc ttgccgagtt ggtcgattgg ttcaccaagt ggaaagagga ggccgatggt 2640 aagaagaagg ttgcgcgggc tcacgtcaag cggcttatca ggagcggtgg taccacggcc 2700 gagttggagg cagcaaagcg tcaccgcgac gcgctgaaga aggtcttccc gcctgagcaa 2760 acgtttgacg atatttgcaa ggccatatct actttctcca ccgtgtgctg caacctctgg 2820 gccaagtatc cgtttgggat ccgcatctac ccgcgacggt ttggcacgaa tgttttggag 2880 tcgtttttct cgctcgtccg cacggttggc gggcacaacg acaaagtcgc tccacagcac 2940 atggctctcg ctgcgcatca tttctcgcac agccgcttat ttcgggcccg gctcgaggac 3000 atgcaggatt cgtacaatgt cgttcttcct gaagaccttg cagcaccagt tgaggagaat 3060 gtaggagatg gacgtgtgtt gcagcgtgag gaataaacaa attcaatcat catcatcatc 3120 atcatcatca ttgccaactg agacacacgc tcccctaact gagctcatca tgctcatcat 3180 ggtgcgccag ttgacgcggc cagtgatttg cgtcttattt gcggcagtgt actttgccgt 3240 caaatgcacc gtcagctcag ccagatgcaa cgtcaaaaac cggcgccgat agagccgcga 3300 gaacgtgtcc aaatgcttgc caagaacccg gcctgcattg gttgctgcag caaaagtggc 3360 ccacacctcc aggtgacggg gcgagacaat cacttcgtgg acgcggcgat gcgcccacat 3420 cgtgggcgct gcgcgcttgt cgggcccacg ttcttggagc gcgagaataa ttgcagcatc 3480 gagccaaaga atgtaggaaa tggcagcgtc ggacgcgtaa ataaggcgat cgatctcaca 3540 ttgagcgatg ctctgcacaa caggaacgtt tgacgcgcgc cgcacggcca actcactttc 3600 gggaacttga aggtgggcga ggtaggcgag aatgctcggc gcgtgctgcg tgctcgaata 3660 gcgcctgcag tgctcccgca gcgcgaaaat gagccagccc gtaatgtagc cgcacttgcg 3720 gagttggaga atcgagaagc gtgcggtggg cacagtcggt gtgccggctg tgagatcagt 3780 gggcgcgaca ggaccaactt gagagacaag ggcagctatg tcggggtacg gctcgaaatc 3840 gcgagggtga ttgatgagct ggtcgaccat gacccagagc atgcgcaaaa acagccggat 3900 gtggatatcc ttgaggtcct cgcgcgagaa ctgagagttg acaggatttt tgagcggatc 3960 gagtaccagc ggaaggggca aacgggctct gaggaaagtg acaacagatt tgttgccaac 4020 cgcaatcaca gcacgccggg tctgcaccat gggcgacttg tagcagccat agatggtgga 4080 gagctcaggg atcagcagca tgagcaagtc aacacggtcc ttttcattga aattcgagaa 4140 gttttcaaca agtgggtgca tgaacgtcga gttcacctcg tacgctgcaa gcttcttgat 4200 ggcctcaata gtgcgcaaaa aaatgtcgga cgacatgaga aacgtggaaa cgactgctgc 4260 cttcacgtcc aacggcttgt cggccacttg tgcacgcttc ggatcgacgt gagcatcgag 4320 ctggatcgtg gccgcattgc gcttggacgg gaccgcaggt gggcgcaaat caaattcctg 4380 gaccccagga ggagccgcac gctggacctg cagcgactcg cgcacagcgc cgtactggtg 4440 cgtgagcgtt ttggctgact caccaccttt gctgcaatcg tcgttcaggc cgcaatcgtc 4500 gtctgggtca gaatcgtccg agtcgctgga gtccgagtct gattcggatt cggcgcagct 4560 gtcgtcgatg tcatcagtgc cgcgccaatc gtctgagttg tggattttgc tggactcgtg 4620 gtcaacgacg atttcctggt tgtcgagggg ggccaggaca aactcgacaa attcgtcttc 4680 gattgtcggg taactcatgg tcgtagcggc tccttttcgg cgcgcacggc cggacgagag 4740 gcgacgacgg atcacaaacg ggagcgggcg tgcaggtggc caaaggcggc gacagttggt 4800 gttgggtgcc ggcgagacag caggctgcgg gcaggtgcgg gcgcgcactg ccgttgcttg 4860 cagcagctgt tggtactcgt cgtggtggat gagcgtcggt ggttgcagcc agtggtcggc 4920 agcgctgccg tggtggactt ggccgtgggc gagagaagag ctgagtgacg tggaagaaat 4980 gatggaggaa aatttacccg agcctcaaca caggctgtcc ttccccttac ttagctcccc 5040 tg 5042 // ID PUNTRIP_NC repbase; DNA; FNG; 1874 BP. XX AC AF181821; XX DT 08-NOV-2002 (Rel. 7.1, Created) DT 08-NOV-2002 (Rel. 7.1, Last updated, Version 1) XX DE Neurospora crassa DNA transposon PUNTRIP_NC. XX KW DNA transposon; Transposable Element; PUNTRIP_NC. XX OS Neurospora crassa OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Sordariales; Sordariaceae; OC Neurospora. XX RN [1] RP 1-1874 RA Margolin S.B., Garrett-Engele W.P., Stevens N.J., Fritz Y.D., RA Garrett-Engele C., Metzenberg L.R. and Selker U.E.; RT "A methylated Neurospora 5S rRNA pseudogene contains a RT transposable element inactivated by repeat-induced point RT mutation."; RL Genetics 149(4), 1787-1797 (1998). XX DR Genbank; AF181821; Positions 74 1947. XX CC 44bp terminal inverted repeats. XX SQ Sequence 1874 BP; 630 A; 350 C; 287 G; 607 T; 0 other; tagacgtgct aaccccaacc ccggtcacgc cccaccctcg gtcatataac ttcatcacca 60 ggaatcaaat aacattgcaa actagccact tttacaggga ttttaatact tcaaaatgtc 120 ataattttat tcctaatact cttcaataat ataaaggatt ggctggaggg ctctccccaa 180 ctcttaaagc agtctcatcg tcgtcgttac cgtagcaacc cttctgcgct taggaccctc 240 ctttaaagca tcacaaactt ctattatcta ttctatctgt acaaatttat cgttaccaat 300 ctctttaacc ttccttttac ctctaggctt ctataactcc aattaaatct ataaatatta 360 aatttagtaa tccttagtag cgaacttaat gttcttaata ttaagggcct tggctgcctt 420 agtgaaaaca gtcctaatat tacggtcgat agaggagaaa ttgtaaatac ctcccaatat 480 atccctagac ctctaaggaa ttggaaaaat aatgtttcct atcgaaataa gttcgtttag 540 tagtattcta ggtagtaaag gctcggctag caaggcattg ttattaataa ctaccgcggg 600 atcgtcaata ataatagatt tatttactag ctatattcct atttttctaa accctaatct 660 aaggtttcta atacttaccc cccttcttaa aacttcctaa taatacttaa ggaaccgtta 720 tttatttact aatacaaata ttagaaaatt agcgaaatcc ctaattaaat ctctaaaata 780 agatttaata attgaaaaaa taaagtggtt aagcggttaa gttcttatag gaagtataag 840 aaggtaaaaa aagtaattaa atattgtatt aaaattactt caataaggaa gtcgttaatt 900 atataattaa tatacttatc gaagattaaa agacgctaac tatcgtcgat tagcttcatt 960 tctaatagat aaacctcctt caactataat agtacaattt ctaaattgct ctatctacta 1020 gcaatactaa tatacttcca actcttaagt tcgtagtcca tatcaaagaa ccattatcct 1080 tagaggttaa taccaaaaaa aataacaacg ggaatcaatc tcttaccctc aatagtacca 1140 aacttaataa tagttaccca aattgttata tcggtcgtta ttaaatatac tttcttcgtc 1200 aaaatagtcc caaggacttt accgtctatt aattttaatt cttaaagatt atatttattt 1260 atattagata tattagtagg aataataccc ttatcgcgaa tctaaacttc caaaaagtcg 1320 aagaattttg taaaggcttc cttaatagac cccctaactc taactaactc aaaggggctc 1380 gaaatcttta ttataacaat tagatttctt tttataaaac agtctaccca atgcgccccc 1440 aacttattat aattgcctcc tctttttaat atcgctatta taaagccaac atatttaacg 1500 acggttaaga gcccaactaa ctttctcttt atttaatatc catttgtata agaaggattc 1560 ttaggcgggg gacagccgtt attagggtat tttaattata cgaagggagg aggtgcctcg 1620 tatacggtcg gagatagtag attatagtac gctatacttc ttagttacta tacgctagga 1680 aagtccgttg cggaccttag atatcgctta gtcgagtaaa gacgaagcta ttgcgaccgg 1740 aatcaatcaa atcgatagta ttttagtaga gttattgtta ttaaaagatc gagagtatga 1800 aagtgtatag gagaaatggc ggtgtgggga tgtgaccgag ggtggggcgt aaccggggtt 1860 ggggtcgcga cgta 1874 // ID Gypsy-1_ARO-LTR repbase; DNA; FNG; 281 BP. XX AC ABVF01000027; XX DT 02-MAR-2011 (Rel. 16.03, Created) DT 02-MAR-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Arthroderma otae genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_ARO_; KW Gypsy-1_ARO-I; Gypsy-1_ARO-LTR. XX OS Arthroderma otae OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Onygenales; Arthrodermataceae; OC Arthroderma. XX RN [1] RP 1-281 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Arthroderma otae genome."; RL Direct Submission to RU (02-MAR-2011). XX DR Genome; ABVF01000027; Positions 53874 54154. XX SQ Sequence 281 BP; 52 A; 97 C; 52 G; 80 T; 0 other; tgtaaggggt tggaccccta ttcccttctc gggtcaccgc gcgctccgta atcgacgagg 60 cagcaaggcc tgtctggaga tgccatcacc agactccccc tttttactct cgcgcacgtg 120 accctcaaca cggtcacatg acctcaacgc ctgaagctat agctccatcc cccatcaccg 180 acttccttcc ttttcccctt tcttcttctt ccccatgtgt tttgtcttcg acttacggat 240 acccttagtt agttgaatag aactctcttc ggccgttaac a 281 // ID Gypsy-17_RO-I repbase; DNA; FNG; 5122 BP. XX AC AACW02000311; XX DT 28-FEB-2011 (Rel. 16.03, Created) DT 28-FEB-2011 (Rel. 16.03, Last updated, Version -1) XX DE LTR retrotransposon from the Rhizopus oryzae genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-17_RO_; KW Gypsy-17_RO-LTR; Gypsy-17_RO-I. XX OS Rhizopus oryzae OC Eukaryota; Fungi; Fungi incertae sedis; Basal fungal lineages; OC Mucoromycotina; Mucorales; Mucoraceae; Rhizopus. XX RN [1] RP 1-5122 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Rhizopus oryzae genome."; RL Direct Submission to RU (28-FEB-2011). XX DR Genome; AACW02000311; Positions 330546 335667. XX CC Positions [3772-4275] - Integrase core CC 'GTAAG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 52..4911 FT /product="Gypsy-17_RO-I_1p" FT /translation="MSINSNLLASNEEILFSESDPAFIGPRLEKSTMVNGV FT ELSSHGSPISNTEDIEMEEVEVGAPISPLAGVKISGAAPPSQVLGGRNLTA FT TIQTLTKQVEQVAAAITSQELTGKQLANLSREYTTKSTILASLLDLQAKLA FT NNLAIQTNSSAINGSNSQPTVVPQNLPLFQYTGNVENSALPVYKDIAECIR FT RFEDILFAYRLDVQVHWQRLIPFSLTHPQRSWYDELRVKKNLSWATFKEQL FT TLKYGIESTEAQAHASRELLTISMQPDESVEKYTDRFLKLKREATPVADDF FT LAILFVNSLPGSLQNHVNMAIVSLDQEAKNDINLLSSIARKFHTKQQTLPN FT TSTRKFNNQPSQVNGLNPPSGIHKRNHRTDGRTCVIHGKCNHTTDRCKKAQ FT ELGITGSMNKETFSKRCFKCNAPFTREHLNVCPKRNQPLSAGSASSSSTAP FT TITRAFRSLQLTGSEDVPMGTKASKYNPANQQDHDEEQDLDDEAVKDIHYY FT SSLCKLQPFNTKLHSFNNKRTDSFFVPITIENVRTWALVDTGCNASTISPD FT LAKIINVRTFPVSGKLQLASLNTLVPRTAITDQIKVFYNGINVLHKFEIFK FT FDKDIPICIGTDLMPKLNINLTGLAVTWDSPNISTIPDPIDPSPIKPNKSP FT AGTHQQREQLMDALQPYLSANAKIPSTSACPLPEAIVRLETEPGKTTYRRQ FT YPIALSIQPKVQEQVDKWLADGVIEIAPPNTAFNSPLLTVRKKDANGNYSD FT DIQVCIDPRHINAILIKDRTDRFQLPLIAELHQKMSAATLFTVLDLKQAFH FT RLPLAEEHRALTAFTFNGQQYQFCRAPFGLTPIPNHFQRVLTTLLHDLPYV FT TCFIDDLTIGTGPDIEEHIKCITTVIERLTSAKFILNVDKCHFMQTSVNIL FT GFTISHNSLSLDSRKVANALAWPIPKTGKDIQRFLGLANYFRSHLPNFSTM FT TAPLDKLRNKGSLEHIWTDKYLQAFRNIQTALSNAPVLSIPDMRHELQLAT FT DASNTGIGGVLYQIIDDKTRYIGFAARALSKSEMNYSTTKRELLAICYLFE FT RFHKWLYGSHFTLYTDHKSLVYLGTQNIPNPMMLNWFETIFNYTFRVIHRP FT GIENVLPDSLSRLFSDYTEKRLGGDSNPQQLSIKNKITKSQNKRTTENSTN FT IKQRAVQLVDYMTPPEEERHNLLLKAHLLGHFGADAIVKTIHNDNLHWINI FT KKDALQLVSNCPECQQFNIGKHGYHPPRSILPEGPFDHVCIDLGTLNVTST FT EGHNFILVLVDLFTRFTILRPLMDKTAVSIANELRDIFCTFGWPKKLSSDN FT GTEFTNEIFNSLMETCGIDHRLSLPYNPLGNSTNESFVGIAKRALVKRLQG FT KKDEWNLFLPSIQYAMNCKYSRLHHSRPFVLMYNRQPNEFQDYSNTQSSIP FT NSSQQYAKALENKLDNMNNIVIPAIRNRIMTTQRIDNEYFTKKHRIVHQQF FT PTNSEVMIKNVNRNSKTDPRYEGPFVVHGVTKNGSYILTDKTGALLSRDVP FT TSHIKLISTDNVVNNRADQQQYDVQAIIQHKGNNTSNYKYLVRWKGYPPEY FT DTWEPASSFDDMSMIEQYWARRNVNNNIGKRKRSSHTNLPTKTNNLNKRSK FT TK" XX SQ Sequence 5122 BP; 1633 A; 1131 C; 903 G; 1455 T; 0 other; ttttttttga cgaattctct ttactctatt actactacac tattctcaaa catgtctatc 60 aactctaacc ttttggcttc caacgaagaa attctttttt ctgaaagtga tccagctttc 120 attggtcctc gacttgaaaa atctaccatg gttaatgggg tggaactctc tagtcatggt 180 tcgcccattt ctaatacaga agacatagag atggaagaag tagaagtcgg ggcacccatt 240 tccccacttg ctggtgtcaa gatcagtggc gcagcaccac cttctcaggt cttaggtggt 300 cgtaatttaa cggctaccat tcagactttg acgaaacagg ttgagcaagt tgctgcggct 360 attactagtc aagagcttac gggaaagcaa ctggctaacc tttctcgtga atacaccacc 420 aagtccacca tcttggcctc cttgttggac ctgcaagcaa aactggctaa taacttggct 480 atacagacca atagctctgc cattaacggt tctaacagtc aacccacggt agtccctcaa 540 aacttacctt tgtttcaata tactggtaat gtggagaact cggccttgcc tgtctataaa 600 gacattgctg agtgtatccg acgttttgag gacatcctct ttgcgtatcg tttggatgta 660 caggtgcatt ggcagcgcct catccctttc tcacttaccc atccccagcg ttcatggtat 720 gatgaattac gggtaaagaa gaacttatct tgggccacat tcaaggagca attaaccttg 780 aaatacggca tcgaatccac tgaggctcaa gctcacgcct caagagagct gttaaccatc 840 agcatgcaac ctgatgaatc tgtggaaaaa tacaccgacc gcttcttgaa gctcaagcgt 900 gaagccacac ccgtagctga tgattttttg gctatcttgt tcgtgaactc ccttcctggc 960 agcctccaaa atcatgtcaa tatggccatc gtcagtctgg accaggaggc aaaaaatgat 1020 atcaacctcc tttcaagtat cgcaagaaag tttcatacga aacaacagac cttgcccaat 1080 acgtcaacca gaaagttcaa taaccaaccc tctcaagtca acgggttgaa ccctcccagt 1140 ggtattcata aaagaaacca ccgtacggat ggaaggactt gtgttatcca tgggaagtgc 1200 aaccacacaa ctgatcgttg taagaaggct caggaactgg gtataactgg cagcatgaac 1260 aaagagacgt ttagtaagcg ttgctttaag tgcaacgctc cttttactag ggagcattta 1320 aatgtttgcc ctaaaaggaa tcaacctctt tccgctggtt ctgcctcctc ctcctctact 1380 gcacctacca taacccgagc attccgctca cttcagttga ctggcagtga agatgtacca 1440 atgggtacaa aagcttctaa gtacaaccct gccaaccaac aagaccacga tgaagaacag 1500 gatcttgacg atgaagccgt caaggacatc cactactact catcactatg taagttgcaa 1560 ccgtttaata caaaacttca cagttttaat aacaaacgta ctgattcctt ttttgtaccc 1620 atcactatag aaaatgtccg tacttgggct ttggttgata ctgggtgtaa tgcttctacg 1680 atctctcctg acctagcaaa aattataaac gttagaactt ttcctgtttc aggaaaatta 1740 caattagcta gtttgaatac cttggtacct agaactgcta ttacagatca aataaaagta 1800 ttttataacg gaatcaatgt cttgcataaa tttgaaattt ttaagtttga taaagatata 1860 cctatttgta ttggtacaga tcttatgcct aagttaaaca ttaacttgac aggtttagct 1920 gttacgtggg atagtcctaa tatctctaca atacctgatc caatagaccc ttcacctatc 1980 aaacctaaca agtcacccgc tggcactcac caacagcgtg aacaacttat ggacgcatta 2040 cagccatatt tgagcgctaa tgctaagatc cctagcacat ctgcatgtcc tctacctgaa 2100 gcgatagttc gcttggaaac tgagccaggt aaaacgactt atagacgaca gtatcctatt 2160 gcactgtcaa tccagcctaa agtacaagaa caggttgata aatggttagc agatggcgtg 2220 attgaaatag cccctcctaa tactgcattt aactctcctt tgttgactgt acgtaagaag 2280 gacgctaatg gtaattatag tgacgatata caggtatgta tagaccccag acatatcaat 2340 gcaatactca ttaaagatcg taccgaccgc tttcaattac cattgatagc tgagttgcat 2400 caaaaaatga gtgctgctac gctattcaca gtacttgacc tcaaacaagc ttttcatcgt 2460 ctacctctag ctgaagagca tagagcactc acagctttca catttaacgg tcaacaatat 2520 caattctgta gagcaccatt cggtttgacc cccataccaa atcacttcca acgtgtgcta 2580 actactctct tacatgactt gccatacgta acatgtttca ttgatgattt aactatcggt 2640 actggacccg atattgagga acatatcaaa tgcataacta ctgtcatcga acgacttaca 2700 tcagctaaat tcattcttaa cgttgataaa tgtcatttta tgcaaacttc tgtaaacatc 2760 ctgggcttca ctattagcca caatagtctc tcactggata gcagaaaggt tgcaaacgct 2820 ttagcgtggc caataccaaa gactggaaaa gatatacaac gctttctagg tctcgctaat 2880 tattttcgaa gtcatctccc aaatttctct accatgaccg caccgttaga taaactacgt 2940 aataaaggtt cattggaaca catttggact gacaaatatc tacaggcttt tcgaaatata 3000 cagacagcct tatctaatgc tcctgttttg tctatacctg acatgcgtca tgaattacag 3060 ctagctacag atgctagcaa cacaggaatt ggcggtgtac tttatcaaat tatcgacgat 3120 aaaactcgct atattggatt cgcagcaaga gcactgtcga agtcagaaat gaactactcg 3180 acaactaaac gagagctgtt agccatctgt tatctttttg aaagatttca caagtggttg 3240 tatggatctc atttcacgct gtatactgat cataaaagtt tggtttatct tggcacacaa 3300 aacattccta atcccatgat gctgaattgg ttcgaaacta ttttcaacta tacattcaga 3360 gtaatccatc gtccaggaat tgaaaatgtg ttgccggatt cactttctcg tcttttttca 3420 gactatacgg aaaaaagact ggggggagat agtaatcctc agcaactttc cataaaaaat 3480 aaaattacaa agtcacagaa caaacgtact acagaaaaca gtacaaatat aaaacaaaga 3540 gctgtacaac tagtagacta catgacacca ccagaagaag aacgtcataa cctattatta 3600 aaggcacatc tacttggaca ctttggggct gatgctattg taaagacaat ccacaatgat 3660 aatttgcact ggattaatat taaaaaagat gcattacagc tcgtatccaa ttgtcctgaa 3720 tgtcaacaat ttaatattgg caaacacgga tatcaccctc cccgtagtat tctaccagag 3780 ggtccttttg atcacgtctg tattgacctt ggtaccctga acgttacatc aacagaagga 3840 cacaatttca ttttagttct agttgatctt tttacaagat ttacaattct aagaccatta 3900 atggataaaa ctgcagtatc catagctaat gaacttagag atatcttttg cacttttggt 3960 tggcccaaaa aacttagttc agataacggt acagaattca ctaatgaaat atttaattca 4020 ttaatggaaa cctgtggtat tgaccataga ctttcactgc cttataaccc attgggtaat 4080 tcgactaacg aatcgtttgt tggaatagcc aaacgagctc ttgttaaaag attacaaggc 4140 aagaaggacg agtggaatct tttccttcct agtattcaat atgctatgaa ttgtaaatat 4200 tctagattac atcattccag accattcgtg cttatgtata ataggcaacc aaatgaattc 4260 caggattatt ctaatacaca atcctcaata cctaactcta gtcagcaata tgctaaagca 4320 ctagaaaata aactagacaa tatgaacaac atagtaatcc ctgctatacg taaccgtatt 4380 atgactactc aaagaattga caacgagtat tttacaaaga aacaccgcat tgtacatcaa 4440 caatttccta ctaatagcga agttatgata aaaaatgtga atagaaatag taaaactgac 4500 ccaaggtacg aggggccctt tgtcgtacat ggagtcacaa aaaatggtag ctacattttg 4560 acggataaaa caggtgcatt attatcgcgc gacgtaccta cttcgcacat caaacttatc 4620 tctacagaca acgttgttaa taatcgcgct gaccaacaac agtacgatgt gcaggcaatt 4680 atccaacata aaggaaataa cactagcaat tataaatacc tagtaagatg gaaaggttat 4740 cctccagaat atgatacctg ggaacctgcc tcaagttttg atgacatgag tatgatagaa 4800 cagtactggg caagacgtaa cgtcaataat aacattggca aacgtaaacg ctcatctcat 4860 actaatttgc caaccaagac taataatctt aacaaaagaa gcaagacaaa gtaacgagct 4920 gaatatggtc ttttcccatt actttttttc ctaaacaggt ttttatgggt ttttatcgat 4980 tatataccag aagagcatgc tctctctttt ctctacactt acgtctttaa tatgaacagc 5040 ctaagcaatc atattagata taatccatgt actcttcgtt attcatgatg ttcatcggtt 5100 gttccaacct ggaagggggc aa 5122 // ID DNA-3_AN repbase; DNA; FNG; 6094 BP. XX AC . XX DT 09-JAN-2004 (Rel. 9, Created) DT 09-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE Nonautonomous DNA transposon. Putative classification: MuDR DE superfamily - a consensus sequence. XX KW MuDR; DNA transposon; Transposable Element; Nonautonomous; KW DNA-3_AN; nonautonomous DNA transposon; KW putatively MuDR superfamily. XX OS Emericella nidulans OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Emericella. XX RN [1] RP 1-6094 RA Kapitonov V.V. and Jurka J.; RT "DNA-3_AN, a family of nonautonomous DNA transposons in the RT Aspergillus nidulans genome."; RL Repbase Reports 3(12), 205-205 (2003). XX DR [1] (Consensus) XX CC Nonautonomous DNA transposon. Putative classification: MuDR CC superfamily. CC 9-bp TIRs. subterminal palindromes. XX SQ Sequence 6094 BP; 1730 A; 1279 C; 1317 G; 1766 T; 2 other; gtggtcgcag ataactccct aaattccctg agctaactct aagtcgacca tgccgtttat 60 ggttagcgcc tcccaaaaag gaatggccga cttagagtta cctcttgacc gactttttct 120 ttcctccccc ttacatttcg ttaccacaac acattcctat atccaaactc caggtacata 180 actagtcgaa atctctttta aatctagtca agaactagat acttaacctt catactgctt 240 agagatagcc tttgaagctc ttatttcgct ctttgtctca ctctcacctt cctcctcctc 300 ctccaatcct ttttgctcgg gactagtcca agactagtca acgattagta tgccgagcat 360 tcgcgataac gatctccgaa agtccccaga gtactgtcac tatctcgagg cagttaagga 420 cggggagctt acgctgccgg atttcaagat agtaagccga cccgactagc tttaagccta 480 gttactgact agtccacagg acgacaatgg cgtgcctgat atccatccat atgaagtcta 540 ctgccgagtg aagggatgcc tcaagcgtac agtgagtcta ctgctatact agtttctgac 600 tagttattaa ctagttgaca ggttccctct gccaacagaa acatattggt caagcacttg 660 aaggacaaga actcccacgg catggagttt acattgcaca atggacctcc cactatgaag 720 gaactgatgg aggccaaagg caagtcctat ctagattact tggtgactag tccctgacta 780 gtcaagtagc atggtatgaa ggcttgtttg aaggcactgt tctcccaacc ccgactccta 840 ccaagaagcg caagcgagct gcgtaagttt ctgtgagtct aactagtgta ttagctaata 900 tatagcagca ccaagtccaa ggaccacaat actaagggtg tcgagaattc gtgagtttct 960 tctcccattt caactagtcc ttgactagtc actaactact ttgcagcaac gagggtgaag 1020 ctggaaatga tcaggacaat ggcgagggcc cgtaagtaca gccattcaat gcagactagt 1080 tgctaactaa tctgtgacta gttcaagtgg tccgtacgcc gtgcataccc ctgtgactgg 1140 taggaatttg agcaagcctg tcttgccgcg cgatgagaaa ggaaaggcaa gttacattca 1200 gccccgtacc taggatcagt gctaatttat aacctctagc cactctttat gcagatccgc 1260 cgtgagggta gcaaggcagc taaatcagct ggtgagaaag gaaccatacc ctgcaagacc 1320 tgtcgcaacg caaagggcaa aggtaagcta tccaagctag tttgggacta gattctaact 1380 agtctcagca ccgtgtggtt caaagccata ttgcgagttt tggcgctttt tctcatcgat 1440 tgacgaggca aagggagcga gtatgcaacc tcgtaagtca gactcagaca aagaccaact 1500 agttattgac cagtcttcta gaaggctctg ttgtggatct tgaggccctg gagagttcct 1560 ccaacaatcc ggagacaagc aagtcctcgt cggactagtc actaactaga ctctaactag 1620 ttgcagacat ggataatgca aaagagacaa gcaatgaaga aagtggtaag acatttctcc 1680 tttgtggttc tggactagtc tttgactagt cacagtctta aacaaggaaa atgagcatga 1740 aaatgaggag gaaaaggctg ctgagcccga ggaagtgcag ggtgatggca gacatggtag 1800 gttaatacct tgttagttat tgctagtcac tgactagtca ataactagtc tctgaacacc 1860 ttgcaatcac tccgtttgcg cagctgaaca gtggtgagga taatagtagt aagttattct 1920 agcttcagag ttataggaga ctagatacta actagtatta gttgcaacta acctggatct 1980 cagagacttt ggcctcaatc tagaatctat ctagttgtca actagactgt ggtatcattg 2040 tcttttattt tcctagtcct ggaactagct tctaactagt ctccctaata tgtggctgtc 2100 ttgttttttt tttttgtttc cctacccgga tatctagtcc ccttctaggt tctgttaacc 2160 tctcgggctc tgatttagtt taacgcaaac ctgagattag tttctaacta gtctctaggt 2220 tttctatcca cctttaattg taataataaa tacaagcaac gtttatacgt caaaagcatt 2280 tataaacttt taccctaaag tagcttgctt gtgtgtttag tttataatta gtctcttatt 2340 aatttgatgt aggtaagccc gccacaaata tatattttta acaagatacc gtggaaaaac 2400 ttcgtgctat cacaaaacag tatacaaaaa ataagcttaa caatctattc tccgcttggt 2460 gatgctaaag ggctttcaat agaccttgta agtgaaggag atggagccgt caatccgcta 2520 ccctgcctct ggtcagttgg tctcagcaat gtaccctgtg agtcttgata gactagttgg 2580 tgactagtct ctgtagatgg aacaaatggt gcttgagagg gcacgtatgg cagagctcya 2640 gtcatctgcg gaacatatac tgggcccggg ttgactagtt gtgaactaga tggtaaatat 2700 ccaggtagag gtggatattg cactggattt acaccaacta gttgaggact agttggctgt 2760 agtagacgct gctgcataag ccgagtctct acctcatatc gatctagacg tgcttggcgc 2820 tcacgaagat caagtatatg ttgttcatgg ctagcctgct gatcgcgata gttctgagaa 2880 gcaatctggg ctagggatgg agacctagac tagtcagcaa ctagtcccgg attagtcctt 2940 catgacttac cgctgtggtg tagaacttcg gcttactcct ctagtataac tcctacccct 3000 ggtactagac ctggagcggc gtggccgtgg agatccagag gatgagaaag atctagaagg 3060 acgtgctctt ctaggtagta ctccttctat attttcctct tcctcaatct cctgattacg 3120 gcgtcgtttt ctttctaata ctggttagta accagttagg cactagtcaa ggctagactt 3180 acggtccctg attaaattct ctgtaattct tgagatctca tcgtctggcc gataggaatg 3240 ggtaatgtta taatgtatcc gcacagagta ctgatgcata tctcgtacat caagaaaata 3300 tgcactatga gaactagtca gagtctaaac aaagcctagt cttgacttag aaacttacct 3360 agtaactgcc tttagaattg gtaggtgctt cccactaaaa gcatatgatt tttggtgggt 3420 ttgttcacct gcattggtat gcatacgggt ttgattaaaa tgctgttcag ggatttgtga 3480 gcaagcttta tttaatccag cagaaactac cggatgcagc ttatgttcag cccaagttgc 3540 cacgcctgga gttggatcct ctatatcaat cagttactag ttgctaacta gttatggtct 3600 ggacttacct agtaattgct ctaaaatatc catatactgg gctcgggtag gggcatgtaa 3660 aagactaatc attgccttaa ataagactgt atgacgttgg ttctggccaa ttatattctc 3720 aatcccacgg ataaaatgaa cccggcagaa gataagaatt ctttctaact gccagtctgg 3780 ttcatgatgt tctggatcaa gctcttgtag gcattttccc aaacctaact agttagcaac 3840 tagtgggtaa ctagtcagtg cacaaagaaa acataccgtc cctctgtcct tcatccatat 3900 caacaacgat agtttcaatg ccggttccat gaagatggaa aaactgtact ggatggcctg 3960 aaagatcatg aatcatctca aagacctttt taaatagaaa atagtaggca cggcctgttt 4020 gttggttcat aaacgcacga aggagtgtga taactagaag gtgattagtt attgactagt 4080 ctgggactag agcaggacta gtattgtact taccttttcc atgatcctct aaaaaaacag 4140 caaagataac ctcgttaaac ctgccctcac gtacgcgttt gaaggacatg tcaacttcaa 4200 atgagcgtag gctttgaaat agcttggcct gatcttgcaa tatacaaata ataattgtaa 4260 tatcaccttc atgatagatt tctcgcacat aatgctagtt agatgttatc cactaatccc 4320 cgactagtca gtaactagat agggactaat ctttacatac cttaagatat tcattccggt 4380 cgcattctag ttgtagtgca ttaagatctt ggccatcagg ataaaatgca agcttctccc 4440 ggtaaataag agcctgtatt cgatccatat ttgctagagc aggatgaagg cctataattg 4500 agtcctcatt atatcgcgcg agtaagtcct gcatttgagg actgcggata agttggccta 4560 gtctagttag caactaattg tggattagaa agggattagt ttaatagcag tactgactaa 4620 ttcgaagatc aggattctgc atgtcacgaa taatataaga gatttcatct gcaatttcgc 4680 cagggatctt ggctggaggk aggtggagga tgagtatgta ttcctgttga tgtaaaaaga 4740 taatagggaa aggcctccag gtctaatgga atcataacat taaagatcgc agtacaaggt 4800 tgcgactcta ttagtgatat atcatcacca tgcagaaagc cttagcaaga attagttagc 4860 aattagtcca agactagccc aagactagct gtataactca ctgcagttac agtacgttct 4920 ctttgaccgg cttgtgtcaa tcattgcaca gtattttggt tcaataattc ctctctgctc 4980 gaataattga gctagatata ggatatccgt atctgtacga aatgcaagag aatcaaaaaa 5040 atgttgttta tgggtttccg gtgttgaatt agagcaacat atataggatg aagcatggcc 5100 catttgaagc tgactagtta gtaactagac ccagactagt ctaccctaag gtacttacag 5160 cttcgccaag taccatctta ggagcacacc ctacttgttg cggccgacaa gcttgatgtc 5220 tcttaaatac acgaagcaca ccaaagaaat gtctatataa acatgattag tccttgatta 5280 gtctctaatc aaataaagac tagttactca cgagattgct ttctgttttt taggatccat 5340 ttctgtctct tcaataagtt gtcgctcctg tcggatatag ttccagtcat cttctgtaac 5400 cttgtaatgg tgcatatttc taaggttagg atggagataa gagcatattc gaacgcctga 5460 acactggtac tgtattcgag taaccctgcc tcgaatatag ggtgattgag catcataacg 5520 ctccaactgg cgagcatggt attgagtctg tgactagtca gcaactagtc ccttattagt 5580 tatagaacga acatttttaa taagtcttcc cttatcttcc ctgctcctta cccggttcgg 5640 atcaagtggg acaatatagg tatagccgtt ctgatgagat tcaggatagt aaggtaacat 5700 atagcagtat tcaagtttag atgtgattag aggacgtgtt ggatgattgc taagaataac 5760 tctgtcaatc tgctgccagt tagcaactag tcacaaacta ggtaagaaaa atgaagtctt 5820 gtacctgttc tggaacgtca atagcacctg gaatttgcgg agttgtagac atggtggcag 5880 caagttgcaa actagtcagt gactagtctg agactatttc ttattggcag attgaagttg 5940 aggtaaataa aggaggaaaa gtccattact gacaagtggt attagtattt tagctaactc 6000 taagtcgacc ttactccttt gggttagtgc gctcaaaaga ggaatagccg acttagagtt 6060 agctcgcgac atttaccacg tcacatgtgg tctc 6094 // ID Gypsy-53_MLP-I repbase; DNA; FNG; 5338 BP. XX AC AECX01002318; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-53_MLP_; KW Gypsy-53_MLP-LTR; Gypsy-53_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-5338 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002318; Positions 8887 3550. XX CC Positions [4138-4617] - Integrase core CC 'GATTG' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 236..1354 FT /product="Gypsy-53_MLP-I_3p" FT /translation="MDNINTNLANILSQLTTLNSKLDTEIAERQANQAETQ FT QRLNDLAQRVDAAQAATVPLPPDASPPSATPAATNPQPQSTTRGGNESITT FT ADIKLLMSSTRGPKVGTPDKFDGSKGDAAEAFVNQVGLYLLANDKSFPDDR FT SKIVFTLSYFTGEANQWAAPFFKRLLQPVEGEELTFESFAASFEATYFDSD FT RQNRAQRDLRALQQTGSVADYTTKFMALAARTGWGELEHISHYKIGLKQEI FT RINMILRTFTTLTDITTFAIAIDNELHPGANHSTPRTSTIPTARDPDAMDL FT SATRVGVSREEMARRAEKGLCYKCGKGRHRAADCGRRDEKGNFIRSGGSSS FT RIAELEALVAELGGKGKGRADGSKNGDARD" FT CDS 3736..5241 FT /product="Gypsy-53_MLP-I_2p" FT /translation="MDNIKSDEQLLKDIRQKTPLDNDIMEIINATTSPISS FT QVKDALREYEVVDDLLYRNGRVVVPNDNKLKADILKTYHDSKLAGHPGRAK FT TLSLVNRFYFWKGQKAFVNRYVEGCSSCQRIKPTLMKPFGTLEPLPIPAGP FT WTDISYDMITGLPDSQGYNCILTVIDRMTKMAHFIACKETDGAEQLADLMM FT KEVWRLHGTPKTIVSDRGSVFISKITSQLDKRLGIRLHPSTAYHPRSDGQS FT EIANKAVEQYLRHFVTYHQDDWAQLLPPAEFAYNNSTHTSTGVSPFKANYG FT YDLTLGPVPSEEQCIPVVEERIQRLLEVKKELQACLSGAQESMKSQFDKGV FT RDTPAWKVGDKVWLSSRNISTTRPSPKLDHRWLGPFVILKKISTSAYKIEL FT PLTMKGIHPVFHVSVLRKFIPDTIETRKIVKPTPIEIEGENEWEVEAILDC FT RKRYGKLEYLVSWKGFGKEDDSWEPASNLVNSQEMVDEFHVRFPNATSKYK FT RSRRRK" FT CDS 1462..3342 FT /product="Gypsy-53_MLP-I_1p" FT /translation="MSQNYPPKRATSTPFALCLLDCGATHNAVSEGFIKKF FT EFKTNPLSTPRTVSAFDGQSKQLTDEALLTIDDDQSPTKFIVTQLKDSYDA FT LLGMPWFRKYGHRIDWSKGTLMADPTTSIATANAVSLLPTNPLEPARDARN FT SSEGVCIGADSSKNTLIPPQCESIFGNLEQLNIGATCKFDGPLLQIDSPLE FT TTGPPYTPATPPMDPIATEFSVSSNPKTSSVPGDAERNARNCEEGVCIRAD FT VSKDTLIPPQCECANVNHQIIPEAASKRTPLPNNCAIDYESSICAAKTSWS FT TAAKIAVREHKDQSNKPVEELVPARYHKYINMFRKSSAQILPPRRKYDFKV FT NLVPGATPQAGRIIPLSPAEDEVLKKMVQDGLANGTIRRTTSPWAAPVLFT FT GKKDGNLRPCFDYRRLNALTVKNKYPLPLTMDLIDSLLDAEDYTKLDLRNA FT YGNIRVAEGDEDKLAFICKEGQFAPLTMPFGPTGAPGCFQYFIQDILLGRI FT GKDTAAFIDDIMIYTKTGGNHEEAVEGILEILSKHSLWLKPEKCEFSRKEV FT DYLGLLISKNRIRMDPAKVKAVSDWPTPKNTTEVLRFIGFANFYRRFIDQF FT SRKARPLHDLTCKDTPFKGQMKGRGRLKS" XX SQ Sequence 5338 BP; 1636 A; 1214 C; 1269 G; 1219 T; 0 other; tattgcagtg tctcatctta gtacgacgga taacccaatt caagatctca atcacacatc 60 gactgaatcg actgaatcaa accttatctg aaaattgaag atttgcatac tttaaggttc 120 agaaataaac tcaagaacaa cggcgaactt cacagtgcat tccggagaac acgactcaga 180 gagctcattc gaggctgtat cagcgcaaga tttagatcta gacgaactcg acgtaatgga 240 taacattaat acaaacttag ccaacatatt gtcgcaactt acgacgttga attcgaagtt 300 agacaccgaa atagctgaaa ggcaagccaa tcaggctgaa acccagcaac gacttaatga 360 tcttgctcag cgagtagacg cggcgcaagc ggcgactgtt ccattacctc ctgacgcttc 420 acccccatct gcaacccctg ctgctacaaa cccacaaccg cagagtacca cccgcggcgg 480 taatgagtca attacaactg ccgacatcaa actcctgatg agctcaacac gaggtccaaa 540 ggtgggaacc ccagacaaat ttgatgggtc aaagggagac gctgccgagg cgttcgtgaa 600 tcaagtgggc ttgtacttac tcgcgaacga caaatccttt ccggacgata ggtccaaaat 660 cgtttttacc ctgtcttatt tcacgggcga agccaatcaa tgggctgctc cattcttcaa 720 acgattactc caacctgtgg aaggtgaaga actcacgttt gagagctttg cagcgagctt 780 cgaagctacc tacttcgact ctgatcggca aaatcgagcc caacgagacc tccgagcgct 840 acagcagact ggttcagtgg cagattatac aaccaaattt atggcgttag cggcgcgcac 900 gggttggggg gagttagaac atatcagtca ttacaagata ggcctcaagc aagaaattag 960 aatcaatatg atcttacgca ctttcactac cttgaccgac atcactacgt ttgctattgc 1020 tatcgacaac gagttacacc caggagcaaa tcactcaaca cctcgcacca gtacaatccc 1080 aaccgcaaga gatcccgatg ctatggacct ttctgctact agagttgggg tatcaagaga 1140 agagatggcg cgtcgagctg agaaaggatt atgctataaa tgcggaaaag ggagacatag 1200 agcagctgat tgtgggagaa gagatgagaa gggaaacttt attagaagtg gtggtagtag 1260 ttcgagaatt gctgagttgg aggcgttagt agctgaatta ggagggaagg ggaagggtag 1320 agctgatggg tcaaaaaatg gagatgctcg agactgatag atgtgcctat ctcgagcact 1380 gaggaggatt taattgattt tggagctatt acttacttac aaagaaatgc aagtgatccg 1440 cggtttttct tactattacc aatgtcccaa aattaccctc ccaaacgtgc cacatccacc 1500 ccttttgccc tatgccttct tgactgcggc gctacgcaca acgcggtcag tgaaggcttc 1560 atcaagaaat ttgaatttaa gacaaatccc ctatcgactc ctcgcacggt atcagcgttt 1620 gacggtcaat caaaacaact caccgacgaa gccctactca ccatcgacga cgatcaatca 1680 ccaacaaaat tcattgttac acaactcaaa gactcctacg acgccctatt gggtatgccg 1740 tggtttcgta agtacggtca caggattgac tggtcgaagg gcaccttaat ggcagacccg 1800 acgacaagta ttgcgactgc taatgcagtt tcgctgctac cgacaaaccc cttggagccc 1860 gcaagggacg ctaggaatag tagcgagggg gtatgtattg gagctgactc cagcaaaaat 1920 acactaatac ccccgcaatg tgagtcaatt tttggaaatc ttgaacaact taatattgga 1980 gcaacttgca agtttgatgg ccctctatta cagattgact caccactgga aaccactgga 2040 ccaccgtata cgccagccac accaccaatg gatcccattg caactgaatt ctcagtttct 2100 tccaatccga aaacatcctc cgtgccagga gacgccgaga ggaacgctag gaattgtgaa 2160 gagggggtat gtatcagagc tgatgtcagc aaagatacac taataccccc gcaatgtgag 2220 tgtgctaatg ttaatcacca aataattcct gaagcagcta gcaagcgtac acctcttcca 2280 aataactgtg ctatagatta cgagtcgagc atatgcgctg ctaaaacttc ttggtcaacg 2340 gcagcaaaaa tcgcggtcag ggaacataag gatcagagca acaaaccagt cgaagaacta 2400 gtcccggccc gatatcacaa gtacattaac atgtttcgca aatctagcgc tcagattctc 2460 ccgccacgtc gaaagtatga cttcaaggtc aatctggttc caggtgctac gcctcaagct 2520 ggcagaatta taccactatc tccagcggag gatgaagttc taaagaaaat ggtacaggac 2580 ggtctggcga atgggacaat cagacggacc acttccccat gggcagcccc agtgttattc 2640 accggcaaga aagacggcaa tctaaggccg tgctttgact atcgtagact aaatgcttta 2700 acggtaaaaa ataaataccc attaccgctg acgatggatc tgatcgatag tctattggat 2760 gctgaagatt acaccaaact tgatcttagg aacgcatacg gcaatatcag agtggccgaa 2820 ggcgatgagg acaagcttgc atttatatgt aaggagggtc aatttgcacc tttgacgatg 2880 ccttttggac ctacaggtgc accgggttgt tttcaatact ttattcaaga tatattactt 2940 ggaaggattg ggaaagatac agctgctttt atagacgaca taatgattta tacaaagaca 3000 ggtggtaatc atgaagaagc agtagagggg atccttgaaa tcttaagcaa acattcgttg 3060 tggctgaaac cagaaaagtg tgaattctca cggaaggaag tggactacct gggcttgttg 3120 atatcaaaga atcggattcg tatggaccca gctaaagtga aagctgtatc agattggcca 3180 acaccaaaaa ataccacaga agttttacgg ttcattggtt tcgcaaactt ctaccgaagg 3240 tttatcgacc aattttcccg aaaagctcga ccactacacg atttgacctg caaggacacc 3300 ccattcaagg gtcagatgaa aggcagaggg cgtttgaaga gctgaagcac gcattcacga 3360 cagcaccggt cctgaaaatc gcggacccta acaaagcagc ttacaagacg tcaggccaga 3420 tgggcggaga cattagggtg ttttgatttt gtgattaagt ttcgacctgg cagagccgcg 3480 acacaacctg acgccctgtc acgaagaccc gatttagcac ccacatccga ggaaaagctc 3540 tcttttggac agctactacg tccgtccaat ataactgatg agacttttgc tgagatagat 3600 gcttttgaag cacagttcac caccgaggac gcggtcgagc accctagagc gtcggagtgg 3660 ttccaattgg atgtgttggg agtcgcagaa gaggtcaaca atcaaatttc ggaagaagat 3720 accgacgatc aacccatgga caacatcaaa tccgacgagc aattattgaa ggacatcaga 3780 caaaagacgc cactcgacaa tgatattatg gaaatcatta acgcaacgac tagcccaata 3840 tcttctcagg ttaaggatgc gctacgggag tacgaagtgg tagatgacct gctgtatagg 3900 aatgggaggg tagtggttcc aaatgataac aagttgaagg ctgacattct caagacctat 3960 catgacagta agctagcggg acacccaggc cgcgcaaaga ctttgagctt ggtcaatcga 4020 ttctactttt ggaaaggaca gaaagctttt gtgaaccgtt atgtggaagg ttgtagttcg 4080 tgtcaacgta taaaacccac actaatgaaa ccttttggaa ctttggagcc actccctata 4140 ccagcagggc cgtggaccga tataagctac gacatgatca cgggactgcc ggattcacaa 4200 gggtacaact gcatattaac ggtgatcgac agaatgacca aaatggcaca ttttattgcc 4260 tgtaaagaaa ctgacggtgc tgaacagctg gctgatctta tgatgaagga ggtttggaga 4320 ctccacggaa cgccaaaaac catagtatcg gatcgtggca gcgtattcat ttcaaaaatc 4380 acgtcgcaac tggacaaaag gttgggcatt cgactacacc catccacggc gtatcacccg 4440 agatccgatg gccaatcaga aattgcaaac aaggcggtgg agcaatatct aagacatttt 4500 gtaacttatc atcaagacga ttgggcgcaa ttactaccgc ctgcagagtt tgcgtacaat 4560 aacagcacgc atacttcgac aggggtgtca ccctttaaag ctaactacgg ttacgattta 4620 actctaggac cggtaccctc agaagaacaa tgcatacctg tcgtggaaga acggatacaa 4680 cgtcttttgg aagtgaaaaa agagcttcaa gcatgtctgt cgggagcaca agagtcaatg 4740 aagtcccagt tcgacaaagg cgtacgcgac acaccggcgt ggaaggttgg ggataaagta 4800 tggcttagta gccgaaatat atccaccact aggcctagcc cgaaattaga tcatcgctgg 4860 ctaggacctt tcgttatttt gaaaaagatt tcaacttcag cctataagat agaactgccg 4920 ctgactatga aggggatcca ccctgtgttt catgtttctg tcttaaggaa gttcatacct 4980 gacacgatcg aaacacgaaa gattgtgaaa cccacaccca tcgaaataga aggcgaaaat 5040 gaatgggaag tggaagcaat tttagattgt agaaagaggt acggtaaact ggaatatctc 5100 gtcagttgga agggatttgg caaggaggac gactcttggg aaccagcatc caatttagtc 5160 aactcgcaag aaatggtaga tgaattccat gtaagatttc ctaatgctac gagtaaatat 5220 aagagatcaa ggagaagaaa gtagagggtc aagctttttc cccacggggt tttttaatgc 5280 tgacccgggg aagagtgcag ggccgcaaga ggggcctggg cgctaaagag ggggatag 5338 // ID Gypsy-25_MLP-LTR repbase; DNA; FNG; 386 BP. XX AC AECX01000928; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-25_MLP_; KW Gypsy-25_MLP-I; Gypsy-25_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-386 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000928; Positions 66317 66702. XX SQ Sequence 386 BP; 110 A; 79 C; 58 G; 139 T; 0 other; tgtaatgtgt cgaaacacca ttacaaaacc tataaacata aatctagcta taaacatatt 60 cagacctaaa cagatattat atctttccaa attcttcacg cacacaattg gctccttagg 120 gaatctttga ctgatttaga ttcatatcct ttggagccag gtgagctcat catatttctc 180 ttttaacatt tctttattca ttatttgtaa tccttaggga atctttgact gatttagatt 240 catatccttt ggagccagct tttaattata atttttattg gcattagaat cagtcggaac 300 caccttgtgg tatcctgttc cctccgctca cgtttacgag cgctatcgag atcatttagg 360 tgaaggactt tcagaagttc cttaca 386 // ID Gypsy-2_PPM-LTR repbase; DNA; FNG; 1016 BP. XX AC ABWF01003175; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Postia placenta genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-2_PPM_; KW Gypsy-2_PPM-I; Gypsy-2_PPM-LTR. XX OS Postia placenta OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Polyporales; Postia. XX RN [1] RP 1-1016 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Postia placenta genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABWF01003175; Positions 6039 5024. XX SQ Sequence 1016 BP; 201 A; 340 C; 221 G; 254 T; 0 other; tgataggcgc caagcccctc tgggccccga tacagctctt ttcggcagtc gcatcccgcc 60 cggcacttcg gctcagtcac ccaacacttc gacttcgcct tcgacgctct tcgacatctt 120 cgacggggca cgacgcttgc tcgaagctcg gcgcggtcgt cccgatgcct cgcgcataga 180 tctcggaaca cctcccgcat tcggcggaca atagggccac ttgacgccat tcggccaagc 240 gcgtgcatcg ctccatgcca caccgacgta gccggtcttc cccgattagg tatcgaacct 300 cgcggcatgc cttcaacgac ggataccggt ccgaagtgtc tcgcatgctt ctcttcatct 360 ccgcgttcag catgacgtca ctctaccgac agacttgata tgtgccactc ggtgaggcag 420 cttcgtgcgg aacaatgcaa agccccttgt actagtgtgc cactctcgat agtcatagcg 480 tcctatgcgt agtataaaat ctgtcgtact tctttccaaa aacacaagat ccggttcccc 540 cctcgagttc agagctcata agtcaagttt gccctgcgtc taagccctca actgcgtcac 600 ttcattcctc gctgcataac ctcgaaacct cgtctactgc gtgtactcgt cgataccgtc 660 tcgacgaacc tttcgacttc ggctattgtc gaagaactgc gttgcacctc gaaccacgtc 720 ctagctaagt ccaccctatc gaatcgtgtc tcttccctct atgatacatc gaagagctct 780 gccgagcttc gatcgtctcc actaaccacc cacagcctcc gagtagcttc gacaacgtgt 840 cgagccacct cgtaccttcc ctagtgttcg tagccttttc tgaaggcgtt tctcgtgttc 900 actcgaacca cgggacacct tgctgctaac tcccctctca ctaagactat aatctgtctc 960 tctagtccct ttaggtcgca cccaggttag gggttagggt ttcgtaggtc gtatca 1016 // ID Gypsy-1_ADJ-LTR repbase; DNA; FNG; 227 BP. XX AC ADAR01000039; XX DT 21-APR-2011 (Rel. 16.04, Created) DT 21-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Batrachochytrium dendrobatidis DE genome: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_ADJ_; KW Gypsy-1_ADJ-I; Gypsy-1_ADJ-LTR. XX OS Batrachochytrium dendrobatidis OC Eukaryota; Fungi; Chytridiomycota; Chytridiomycetes; Chytridiales; OC Chytridiales incertae sedis; Batrachochytrium. XX RN [1] RP 1-227 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Batrachochytrium dendrobatidis RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; ADAR01000039; Positions 28364 28590. XX SQ Sequence 227 BP; 61 A; 59 C; 43 G; 64 T; 0 other; tgaggatcac agccatgagg accttggatt aattaccttg cggttacaca acaccctaag 60 gactcatgcg aaccgccata aaggacactc aactccctaa ggactcatgt tgaccgccgt 120 atggacacac actccctaag aacctgtgtg atccctttat ggctgtactc ttgttgaggt 180 ctataaatag accatacttg tcatgtgtat tttcctcagt tttatca 227 // ID Gypsy-1_MLP-LTR repbase; DNA; FNG; 169 BP. XX AC AECX01000978; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-1_MLP_; KW Gypsy-1_MLP-I; Gypsy-1_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-169 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000978; Positions 294 126. XX SQ Sequence 169 BP; 39 A; 35 C; 37 G; 58 T; 0 other; tgtaataatc cgtgttatgc tagtaccaga ttatgcttgt agtgaggtgc acggagcggg 60 agttgtactt tgttccttcc acgcttgtag cttttctctc catactgcaa tctagttagt 120 tagctatagg aacctcatca gtgttcactg cgtacattgg atcataaca 169 // ID Copia-55_MLP-I repbase; DNA; FNG; 4670 BP. XX AC AECX01000203; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-55_MLP_; KW Copia-55_MLP-LTR; Copia-55_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4670 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (21-APR-2011). XX DR Genome; AECX01000203; Positions 37226 41895. XX CC Positions [2147-2422] - Integrase core CC 'CTCA' target site duplication CC LTRs are 97% similar to each other. XX FH Key Location/Qualifiers FT CDS join(26..2143,2147..4612) FT /product="Copia-55_MLP-I_1p" FT /translation="MSSSSERPQSEEPSLTDDESTESTTSSSSSASNQTAK FT LQISTSSTFDTLLNSSNTMSTEETPNALFGNSLQKYGQQLNNALSKFKLTD FT VLEDGNYPVWSRCIVVNLETIELHHYVLVKDYVDSELTKDQVLKTRKIIVN FT YILNYLDKKNQTQSVTHLNDPNDPLEIIYDPFKLWLYLKERHFLVNAQRLA FT FITKSLSTIKITRGETLTSFLDTFENLFVDFLRYGGKMDDMQSAIRLIDAI FT DTLSSHTVEFIHTTVVPLTRIGVAKYLREFETRHSFSTEVTRGTNFSEVNA FT NSAYRGNRVVCTETVCRGNHPPEKCWSKPENFKERDDFLARRRGIKSNQHT FT NVKSQPSFNGSVPSSQEQVRGMKRVTDQASANAVTASLEMMCLNTELTPLK FT SNDVTSVSIHTEMMTGHTIFEEVDSEPSANASKVENSMTIWALHDTGATHH FT MFNVKSIFVQDSLKPIEESNLRLKLAGGNVSLAVESVGRVQLKAGDGTVFE FT LSECLYVPELSKNLIAGGVLKMKGVREVYDKTEPNCFALVNKGLAIFNGVI FT LPNGLMNIIIDPVSNPNTIKSEPSSCSSIFHRRLGHISPQYLRQMVKNGSV FT DGIKDLDFEFMNCDICSLSKNTKLPFNHTRPRATKFLENVHVDLSGINRIT FT AVGNEKYYILFCDDFSSFRHIYGLRDKSKEEVKSVVMSYIALSERQTCCKL FT KQLTLDGGEFLNNSLGPELRNLGIVLHLTAGHTPEQNGVAERGNRTVMTKA FT RCMMIESGVPLSFWFLACEAAVFLTNRTITKAVSNFKSPFEIWYFRKPSIS FT HLRVFGCKVFRLIRKELRTSKYLPVSSEGVLVGFYQDNFNYRIFDLNDKKF FT HITPHVTFNEDQFPFRVHPSRISDNETDVIRTFYFDDDDSSEEINVETNAE FT ISNENSDEVEACEQNFQSVPITPSETPEVIHETVDSTDNPSNIDSNDSQPS FT VPTQTRFTFRNIPKVDYKGMASMAVFEDTDYLSQFDCLPPEVNHVSSSTPD FT PKSYKKAMSSGDSQLWTEACNKEMSSLLKKDVWTLVDRPTHKAVIRGMWIF FT RKKDTADGSIKFKARYVAMGNTQVEGVDYGETFAPTGKPTSFRILVAMAAI FT HGWEIHQMDAVTAFLNGILEEEIYIEQPEGFQVKGQEHMVLKLNRSLYGLK FT QSPKIWQDDVEGFLIEIGFHQCEVDPCVYIRSDEENEKFTAVYVHVDDMAI FT TGNEIEKFKTEISLKWEMEDLGLAKMVVGIEIQRRGNQIYALTQSKFALTI FT LNRFEMTTSKSASTPLTPNLKLYRATDEEAAKFLLLKLNYRSAVGSLIYLS FT QCTRPDLAYAVGVLSQHLDKPGIQHWNAALNVFRYLKGTINLGIVYSGKDV FT SSVSGQRSYQFPISHCDADWAGDRSTRRSTTGYIFTLAGGALSWKSRLQPT FT VALSSTEAEYRAITEAGQELLWLRRMMGYFGCYDSEATVLESDNLGAIHLT FT SKSIFHGRTKHIEIQYHWIREVVNKKEMIIKHCPTSEMVADLLTKALGKQQ FT FIKLRSKLGVKS" XX SQ Sequence 4670 BP; 1406 A; 955 C; 931 G; 1378 T; 0 other; ttcgacacct gtctttcaac tgtccatgtc tagttcttcc gaacgtcctc aatcagaaga 60 accttcactt accgacgacg agtctacgga atcaaccact tccagttcat cttctgcatc 120 aaaccaaacc gccaaacttc agatttccac ttcatcaact tttgatacgt tgttgaattc 180 ttcgaacact atgtcgaccg aggaaactcc aaatgcatta tttggaaatt ctctacagaa 240 gtatggtcag cagttgaata atgctttaag caaattcaag ttaaccgatg tcctagaaga 300 tggtaactac cctgtctgga gccgatgtat agttgtaaat ctcgaaacca tagaattgca 360 ccactatgtc ttggtgaaag actatgtaga ctctgaactc acgaaagatc aagtactcaa 420 gaccaggaag atcatagtca attatatctt aaactattta gacaagaaaa atcaaactca 480 gtctgtcacc catctcaatg atcctaatga tcctttagag atcatctatg atcctttcaa 540 attgtggtta tatctcaaag agagacattt tctcgtaaat gcccaacgtt tagctttcat 600 cactaaatca ttgagtacaa ttaaaatcac tcgtggtgaa actttaactt ctttccttga 660 tacttttgaa aatcttttcg tcgactttct acgatatgga ggtaaaatgg atgacatgca 720 gtcagctatc aggctcatcg atgcaattga tacgttatcg agtcacactg tcgagttcat 780 tcatacaact gttgtccctc ttactcggat tggagtcgcc aagtatcttc gcgagtttga 840 aacccgtcac tccttctcta ccgaagtcac tcgtggaacc aatttctctg aagtcaacgc 900 taattcagct tatcgaggaa atcgtgttgt ctgtactgag actgtgtgcc gaggtaacca 960 tcctccagag aaatgttggt cgaaacctga gaacttcaag gaaagagacg atttcttagc 1020 tcgacgaaga ggcatcaaat caaatcaaca tacgaatgtc aaatctcaac cttcattcaa 1080 cggatccgtt ccctcttctc aagaacaagt tcgtggaatg aaacgagtga ccgatcaagc 1140 ttcagccaat gccgtcactg cttctttaga aatgatgtgc ctcaacacag aacttacccc 1200 attgaaatca aatgatgtaa cttcagtttc tattcacact gagatgatga ctggtcatac 1260 catttttgaa gaagtagatt cagaaccttc cgccaacgct tcaaaagtcg aaaactctat 1320 gactatttgg gcgttacacg acacaggagc aactcatcac atgttcaatg tcaaatcaat 1380 ctttgttcaa gatagtctta aaccaattga agagtcaaat ctaagattga aacttgctgg 1440 tggtaatgtg tcactcgcgg tagaaagtgt tggacgagtt cagttgaaag ctggagatgg 1500 aactgtgttt gagttgtctg agtgtttata cgttccggaa ttatctaaga atttgattgc 1560 tgggggagtt ttgaagatga aaggagttag agaagtgtat gacaagaccg aaccgaactg 1620 ctttgcttta gtcaataaag gactagccat ctttaatggg gtcatcttac ctaatggact 1680 catgaacatc attatcgatc cagtaagcaa tccgaacacc atcaagtctg aaccttcatc 1740 ttgctcatct atctttcatc gtcgcttagg tcacatcagt cctcagtact taagacaaat 1800 ggtgaaaaat ggaagtgttg atgggattaa ggaccttgat tttgaattta tgaattgtga 1860 tatttgctcg ttgtctaaaa atacaaaatt acccttcaat catactcgtc ccagagctac 1920 taaatttctc gagaacgttc atgttgatct tagtggaata aatcgtataa ctgctgtagg 1980 aaatgaaaag tactacattt tattttgtga tgatttctca agcttccgtc atatctatgg 2040 gcttcgagat aagagcaaag aagaagtgaa atctgtggtt atgtcctaca ttgctctgag 2100 tgaacgccaa acatgttgca agttgaaaca actcactttg gattgagggg gagaattctt 2160 aaacaactct ctcggacctg aacttcgtaa cttgggaatt gtccttcatc tcacggcggg 2220 tcatactccg gaacagaatg gagtagctga gaggggaaat cgtacagtta tgactaaggc 2280 ccgatgcatg atgatcgagt ctggagttcc cttgagtttc tggtttttag catgtgaagc 2340 tgcagttttt ctcaccaatc gaaccatcac caaagctgtg tccaatttca aaagtccttt 2400 tgaaatatgg tatttcagaa aaccctctat ctctcatctt agagtgttcg gatgcaaagt 2460 gttccgattg attcggaagg aactcagaac gtccaaatat cttccagtta gctctgaagg 2520 agttctggtt ggtttctatc aagacaactt caattataga attttcgatc ttaatgacaa 2580 gaaatttcac attacacctc atgtcacttt caatgaagat caatttccgt ttcgagttca 2640 cccatcaagg atttctgata atgaaaccga tgtgatcagg accttctact tcgatgatga 2700 tgattcttcg gaagaaatca acgttgaaac aaacgctgag atctctaacg agaattctga 2760 tgaagtagaa gcctgtgaac aaaattttca atcagttcca atcactcctt ctgaaactcc 2820 tgaagtcatt cacgaaactg ttgattcaac cgataatcca tcaaacatcg attcaaatga 2880 ttctcaacct tcagtcccta ctcaaactcg attcaccttt cggaacatac ctaaagttga 2940 ctacaagggt atggccagca tggcagtctt tgaggatacc gattacttgt cacagttcga 3000 ttgtcttccg cctgaagtga atcatgtgtc atcttccacg ccggatccga aatcttacaa 3060 gaaggccatg agctcgggtg actcccagct ctggaccgaa gcttgtaaca aggagatgtc 3120 ttcccttttg aaaaaggatg tgtggaccct tgtcgatcgt ccgactcaca aagcagtcat 3180 cagaggcatg tggattttta gaaagaaaga cactgctgat ggttccatca agttcaaagc 3240 tcgatatgta gctatgggga atactcaggt tgaaggtgtg gattatggcg aaacctttgc 3300 ccctacgggc aaacctactt cttttcgtat tcttgttgct atggctgcca ttcatgggtg 3360 ggaaatccat caaatggacg ccgtgactgc ttttctcaac ggcattcttg aagaagaaat 3420 ttacatagaa caacctgaag gatttcaagt caaaggtcaa gaacatatgg ttttgaagct 3480 gaaccggtct ctttatggtc taaaacagtc accaaagatc tggcaagatg atgttgaagg 3540 attcttaatt gaaattggat ttcatcagtg tgaagttgat ccatgtgttt acatcagatc 3600 agatgaagaa aatgaaaagt ttacagctgt atatgttcat gtagatgata tggccatcac 3660 tggaaatgaa attgaaaaat tcaaaactga aatctcactg aaatgggaaa tggaagatct 3720 tggattggct aagatggttg ttggtattga aattcaaaga cgaggaaatc aaatttacgc 3780 tttaactcaa tccaaatttg ctctaaccat tcttaataga tttgaaatga ctacttcaaa 3840 atctgcttca acacctttaa ctccaaatct aaaactctat cgtgcaactg atgaagaggc 3900 tgctaaattc ttattactca aactgaatta tagaagtgct gtgggctctc ttatttactt 3960 atcacaatgt acacgtcctg acttagccta cgcagtgggt gtcctgtctc agcatctcga 4020 caaaccgggg atccaacact ggaacgcagc tctcaatgtt tttcggtatc tcaaaggtac 4080 aattaatctt ggtattgtat actcgggtaa agatgtctct tcagtctccg gtcaacgaag 4140 ctatcaattt cccatctctc actgtgatgc agactgggct ggagatcgca gtactcgccg 4200 gtccaccact ggttatattt tcactcttgc tggtggtgcg ctctcatgga aaagtcgttt 4260 gcaaccaact gttgcattat cttcaacaga ggctgagtat agagcgataa ctgaagcggg 4320 tcaagaatta ctctggttaa gaagaatgat gggttatttt ggttgttacg attcagaagc 4380 tactgtccta gaaagtgaca atctcggtgc aattcatctt actagcaaat caatctttca 4440 tggacgtaca aaacatattg agatacagta tcattggatc cgggaggttg tcaacaagaa 4500 ggagatgatt attaaacatt gtccaacaag tgaaatggtt gctgatttac ttacaaaagc 4560 acttggtaag cagcaattca tcaagttgag aagtaaatta ggtgtgaaat cttaacggtg 4620 aaagtcttga gggggagtat tggattattt tcatcatcga gctgaactca 4670 // ID Gypsy-34_MLP-LTR repbase; DNA; FNG; 167 BP. XX AC AECX01000176; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-34_MLP_; KW Gypsy-34_MLP-I; Gypsy-34_MLP-LTR. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-167 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01000176; Positions 14742 14576. XX SQ Sequence 167 BP; 52 A; 35 C; 28 G; 52 T; 0 other; tgtgataatc cataactcac attatctcta tagctagtta tgcttgtact gtagaagcgt 60 gagaactagt cagtgctcga ggtatcagga ttgatacctc atacttcaat ctcagttata 120 atataagaag accttctctg atctccataa gtcctgaagc cataaca 167 // ID Gypsy-1-I_AF repbase; DNA; FNG; 6638 BP. XX AC . XX DT 28-FEB-2006 (Rel. 11.02, Created) DT 07-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE Internal portion of the Gypsy-1_AF LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; Gypsy-1_AF; KW Gypsy-1-LTR_AF; Gypsy-1-I_AF. XX OS Aspergillus fumigatus OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC Neosartorya. XX RN [1] RP 1-6638 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-6638 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-1_AF, a family of gypsy LTR retrotransposons in the RT Aspergillus fumigatus genome."; RL Repbase Reports 6(2), 60-60 (2006). XX DR [2] (Consensus) XX CC This is a an internal portion of the Gypsy-1_AF LTR CC retrotransposon. Gypsy superfamily. ORFs coding for a Gypsy-like CC polyprotein are corrupted by numerous stop-codons. XX SQ Sequence 6638 BP; 2774 A; 963 C; 1045 G; 1855 T; 1 other; aattaaacac catacagtat aaatcgcttt aggaagttta attaactctt aatattaaat 60 attatagcag gaaatgcgcc acagatccct actagctcta gtattagtta ggcacctact 120 acgcagtaat aacagctata gcctttaata gaagagctaa gaggcatggt taagtagctt 180 taggaatata tctaataact agaaggttaa cttacagcta agtctattaa agttagacta 240 cctaaactat ttaatagaaa caggtctaag ttacaagtat ttcttatata attagacatg 300 tatatccata taaataggga aaaacttatc actaaggtag aaagggtctt gcttacagca 360 acctatctaa ctagactagt atttaattag tttaaaccct ttatcaggga ctattaagag 420 catgaagagg ataactaagc taataaaact aacgagatat tcactagcta taatgctttt 480 aaaaaataat tagaaagcac tttcagtaat attaataaag aaaggaatgc taagtaataa 540 ctatagcaac tcaagcaaac aggatctgta ggggaataca tattaaaatt ctaatagatt 600 atatcctatc tgaattagga caaggacatg aacattgcaa gatttaagga aggtttgaag 660 ccggagatct aggagaagtt aatctagata gaataactag ataaacttag caagataatt 720 aaataagtag ttaagattaa taataagtta taagacttct atatataaag atgtgaatga 780 agcagctgga atagctttag caattaataa accataaact attaagtaaa taataggaga 840 cccgctcaac tgtggaacta aggatactta gatccttata ggctataact aatagagtta 900 gatgtaatat acacagcact ttctaataag gaaagagaac ggcgacataa agagaaacta 960 tatttttagt gtgggaagct aggacatata tctaaggact gcaagaaaaa cctagggaag 1020 ggtaagcagt aagagaagaa atagatataa gctataaagg agcttagtgc tataattaga 1080 agaaatagat ataataccac tagcactatt aagataaaga aaagtaatga ataactcaga 1140 aagttgtata aggaatgtac aaacttaagt aataaggaaa ttaaaataga gcttaagaaa 1200 gaagactcta ccgactataa aagcataaag gataattagt ctgctaaaga cttaaatatc 1260 cttatagcta ctaattagaa gttacctaat agtaatgcag acagccttat taattaggag 1320 aatattatac ctagagggct tatttactaa tagataacat taacctataa gatatatcag 1380 gttattagtc ctaacttaga aaatactaac tataataagc tagcttaaga ggactttcct 1440 gtaatagatt agactactat taactaacct attaatctac ttaataaaga gtatagaact 1500 tagtaggtag gccttaatcc taacaaggag cagccctata agattcctaa gacacctaga 1560 aataaatccc tataagataa taataaagta gactaatatg ccgcatggct atgacagcta 1620 gataggaaat agttagtaga actagaagcg cactaactaa ataagtaaac ccttaaaaag 1680 gttaaacaag gggtgaaagg atgcatatat cagaacttag aatattagac ctaagttaaa 1740 tcttaggaat agtatataac acactgcaga aagtatatag tgtattacta acactgcagg 1800 tagtagaata aagagtatta ctataaactt aagagggtat tagaaaagag actaccagga 1860 ctattttata ggaaatataa ttataactag tatctatgca aatacttcta aggacacccg 1920 caatataaat aagtactata gattacttat tacagtaata attatagcat ctactataat 1980 aataaatata tagcaaaatt ttagccagag gtactggtaa ttatgaagga gaataacact 2040 tatctatgct agaagcttaa atactcttac agaggatata ggtaatactt agaatataat 2100 acaatgtatt agacagcatg ctttaaagac taatacatag tacactacta aggcaaatta 2160 gttagaggat atttccccat gctactacta ataactaggg cactaaaata ggaaatagag 2220 tacctcacag caatagtgca gcataaaaag tacttaaaga tagttataaa cattcttaat 2280 taattaacta aagtaataat taactcagga gcaataggca actttataaa cccagccttt 2340 ataaggaagc tagggattct aggaaaaata aaggcagtgc ctaaactaat cacaggtcta 2400 aatagagaga atttaggaac tctcttaatt atgactgaat caggaccagt cctaatagtt 2460 atattaggat atattaaata tctaaacttt aatattgtgc taataggatg gtataatatg 2520 gtgttaggga ttctatagct caggaatcat aacctagcga ttaactagaa tataaattaa 2580 atctagttta actataaatg cccacaacag gactaggttt aaggggaaac cagggcctta 2640 tagtatatat aattaaagga tagcaataat aacgctaagc agtcaaaggg tggactgaag 2700 aaagaagtaa taactagcta gaggataata gtagtattag tagcaactaa actattataa 2760 aggatcataa taatagagct actaggatgg gcactagagt aagatagtaa ttatactaat 2820 attataacac cagaaagtga cgataaaggt atctgggaaa taaaatacag agaaataatt 2880 aatcccagga caggcagtcc caggaatatc cttgacccag aaccctactt attcttagca 2940 ataataaatc aggactagga tgatcctaga gtaatcagtc ctaggaacac taaatctagg 3000 aataaattag ggttagaact aattcttcct taggaatata gtaatttcaa gaaattattt 3060 aaataaccta aataatatat tctaccagaa tatagtaaat ataactactg gattccttta 3120 caggaaggaa agtcactagt atataagaag atatactaaa tatcaggaaa agagctacaa 3180 gcgcttaagg agtatattaa taagtaacta taacttagga agatataact attaaactta 3240 ctagcaggac atagtatact atttatacta aaaaaggata gaagtctata actctgtata 3300 gactacaggc tattaaatat aattataatt aaggattatt atctacttcc gttaatccac 3360 gacatctaag attaaattcg aggagcgaaa tggttcacaa agttagacat tatggatgta 3420 tataatcata ttcgaattat agaaggcgaa gaatagaaga cagcttttag aataaagttc 3480 agatatttta aatacctagt tatactattt agattaacta acacaccagc attatttaaa 3540 aggtttatta aggaagtact acacaaggtt ctgcactgct ttgtagtagt ctacttagat 3600 aatatcttaa tctttttaga aaataaaaat aaatatatag aatatgttaa ggaggtttta 3660 caagagcttc aggaagcaag catcagacta aaactcaaga agtatgaatt ctatgtctaa 3720 gagactaagt tcttaggata ctagatatct actaaaggca tctatataga ctaaaacaaa 3780 gttataacta taagagaata gccttaacta agaaacgtta aggaagtcca gcaattcatt 3840 aggcttataa actattatta ataatttatt aaaggatacg taaaaatcat gacaccactt 3900 tttaaactac ttaagaaggg gaaagagttt aagtggacta aagaatataa aatagcattt 3960 actaggatta aagagaaggt tataactata ctaatcctag tgcaacacaa cctggataaa 4020 gaaataacta tcaagaccaa cacatctgac tacgcaatta gaataagaat aacataacta 4080 ggatctgatg gaagaccacg actagtcata ttctatttaa gaaagctaat ttaagcagaa 4140 cttaactata acatccataa caaagaacta ctagctatag tggtagtatt taaagtctag 4200 agagtctacc tagaaggagc atagtaaaca gttattataa aaacagatca taagaacctt 4260 acattcttta taactataaa agaactaaca aggagacaag ctagataggc tgagacactc 4320 tcacagtata actttaagat tattcactat aaaggaacag aaaataggca agctaatata 4380 cttagtcaat aactagatta caagctttaa gggaaaataa ttaaactagc catattaaaa 4440 taacaggagg ataggttact aatatataat tactagactc ttacagccat aattaagtta 4500 caggaggacc cacttatcta aagaattatt aaagtaacta aaaaggacaa aattatttaa 4560 gaaacactta aaaacttagt agataataaa cacctcacta cagacaacag aggaataata 4620 tatttctaag gacttatcta tatacccgag tgtatgagaa ctaagattat cacaaagtac 4680 cataacaacc taatgcatag atatatagga actaaaaaaa cagccaaagc catcttaaga 4740 aattattatt tcctaaatat aagaaggaag gtacaaggat atatttacta gtataaaact 4800 tgcattagag acaaacctac tagatactag ccttatagta aaatacaatc accagaagca 4860 ctgaaataac tataggaata gatcactatt aactttatca gactactact agagttaaaa 4920 gggtataatt acctaataac agtcactaac agacttacaa aatttatcta tttaattcta 4980 ataataacaa agataacggc atcacaactc gcaagcctac tcatagggca tgtgattatg 5040 aattatagaa tgccacgata cattacatta gatagagata agttattcat gttaaagttt 5100 tggcaattat taatagattt aataggaatt aagcaaagac ttataactat atatcatcca 5160 caagctaata gacaaactaa ataaacaaac taaataatta aataatattt ataacactat 5220 gtgaattact agcaagataa ttaggtagtt tacttactaa tagcataatt cacatacaat 5280 aatactattc attcaactac aggggaaata cctttctttg ctaattatag atataaccct 5340 atattaatta gagaactata aaacaaagta cccacagcag aagaagccgc agaaatagtg 5400 gatactatta actatttaag gacttaactt ttaagggaca ttaaattcat gaacctctgc 5460 atagtaattt actataatta gaagcacagg agtgctctag acttaaaaaa gagagagaaa 5520 gtttacctac tctgcagaaa tattaaaata aaaagactaa gtcagaaact agactatcag 5580 aaaattagac tatttattat tacagaaaaa ttaggactag ttaactataa actataacta 5640 ctaaaaagca tgttaaagat tcatccagta tttcatatct tattattaga actagcacca 5700 gaaaacacaa agattactaa aaatatagaa attaaggaca acatagaata ggaatataaa 5760 gttaaataaa ttctagatca caagtaagtc agtagaaaac catactacct tataaaatag 5820 aaaggttata atacctcaga aaacacatgg gagcctatta agaatctgac aggctgccac 5880 caacttattt aacagtacta tttaaaggga aatcaaagtt ctcccagaag gaggggggca 5940 tctaatcaat ctctgctatt atctrattaa aattaggatt atcagaagtc acagctaact 6000 attaggcctc agcagtagga ttaggctaat tagataactt cttctcatct aagatctata 6060 acaactcagc atcatgttct gacattttaa aaccacattc tttaagaaac tattattatt 6120 tgcgaagacg agaatgacga gccaaggtag tgataatctt taactagacc ttagctaagc 6180 gagcttgaag ctgctctagt ttaggttata ataacttaag tttattatta gccttaataa 6240 tatcattagc aaccttctct tctgcttatt ttaaaatact ctactctttt acagtataga 6300 actccttctt atataagcaa ccttgatggg tgcaagtagc atatttagaa taataataag 6360 tttaatccat cacacactcc taaccagaag aagaatagtg ctcacacaag aaagacaaaa 6420 taaaactatt ataataaata tactcttctt acttagcaca gacagcagga tagttacagg 6480 aaggtttcta gggaatatct aacaatatgt cagcacgacg cttaggcatg ataggcgaat 6540 ataaggaaaa gagaaaaact caggagcaca cttacctggg aagcactgtc cttttaaatt 6600 aagttttaag gacaaaacac taggagaggg ggaggtga 6638 // ID Gypsy-24_LBS-I repbase; DNA; FNG; 6363 BP. XX AC ABFE01002141; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: internal DE portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-24_LBS_; KW Gypsy-24_LBS-LTR; Gypsy-24_LBS-I. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-6363 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01002141; Positions 30096 23734. XX CC Positions [5172-5690] - Integrase core CC 'CCAC' target site duplication CC LTRs are 100% similar to each other. XX FH Key Location/Qualifiers FT CDS 446..1663 FT /product="Gypsy-24_LBS-I_2p" FT /translation="MSSPQLGAPQQTAQLSPVKPPQDLPPIMSAPVTLRSF FT SGETDDDVQPGEFLKTFRCFTTSARITDDDLIITSFGDHLKYGSPADDWFT FT ELTSTKQTWKEVETVFLQCFPPVEKAKRTETKLERELCELRLKTEDLGKKE FT KYAGEEVYTHVIFAEKALSLAKQAKISTGSNSIWKVRDELPDIIRQKVKET FT YPTWTEFCAAIKGVEMAHIRDGVKKYQKEKEEKEKTEAAIASLQRLHLQGQ FT RRPPAAPTSSVSNVSQTQQSSTLGGQCNNTPQSNSTNTNAYMGPSSGQGNL FT PHPAPSPITEADRSALICSLALYPMQPNTEAGIAAWHAQLREWKAKNGETA FT EVTVNMGFPLQPGGEAPGSGECYRCGRGGHRRIDMRCTTDQINNKERTFRT FT ICGRILRAAPTS" FT CDS 1945..3240 FT /product="Gypsy-24_LBS-I_1p" FT /translation="MHWLVLHGPQGEKVRVKALFDGGAMCTSFFMKAQHRL FT CGQTKPSNRRLRMANGAIVQSKAVWNGVLELEGIQAEAEFEIFDSGGSWEF FT LFGKPLLHRFRAVHDFDTDTVTIRSAHESITLHNFAGGNTPSAPTGVSLTL FT DVEQQEISAGGSSGVNPPPRQVSYTDIVTPLVQYDESNPFLGFAGDTLETA FT DESAMCTDEEHAWDEEEFTEPERNSETQGRQSEEQGMNQGGGNTPPLREVL FT SENLTWEDVKETDDTHAAASEVVANAAQKPTTAESIHVHRVEPLLLEHEDH FT SGGNEQPPSRGVQTQTTGLQMSNPADTPCLVLPVSSITDATSEEALYTRHT FT EPFQPMRVARILDLVQIGDDVTMDQREEIRQLVAEFTDCFALSLSEVNLIP FT DAVHKLNIPEDATFRTKLPQRSYNPDQRAFMEARLTRC" FT CDS 3237..6362 FT /product="Gypsy-24_LBS-I_3p" FT /translation="MLKAGIIRSIHPGDVKCSAPSVLAQKAHENTGLSLDE FT LKHKVNDECVKHNLPAAFDLPQRPAPTVNAATFTRPKKWRLCQDFGEINKV FT TPIAPVPQGDIRAKQLRLSGHRYIHIFDFAAGFYGIAIHPDSQPYITFHLE FT GRGHFAYEHMPFGVTGGPSEFGHTVGQRMHDLIADGTCENFVDDGGSAADS FT FEEGVVKLRRILERVRRERLSLSPGKLQLFMTEAVFAGARVGPGGVSPDSS FT KLTAVVDWKIPEDASHLEGFLGLTAYFRDLVKGYATLEKPLRDLLRAVDIP FT NGAKKSAYQRIMKAYKLQPHWKAEHTTTFINLKARLVSEPVLTAPQFDGTH FT FILTTDACKDAFAGVLSQRIKTTLPGGKEVTRLHPIGFASKRTSTSEEKYK FT PFLLEFAALKYSFDKFSDIVYGYPVEVETDCQALRDILLSDKLNATHARWR FT DGVLAHNIVDVRHIPGKVNIADSISRQYEGTEKAAGDGSEWTVTPDWEEVT FT GLVHDLYQVADLPDLTVLKERFKDEPLYLDVIDAIIGHSSQDATVWEKKRA FT QHQKTQYMIDEGKLWFVAGGSGTRARARRECVSRAEAVALAKIEHEQGGHW FT HRDTMKMALLDQYHSPKLDESIVKAILDCARCKNFGGTHLHSLLQPITRCH FT PFELLVGDYLSMPVGKGGYHTAGIYLDTCSQHVWGYKFKTHGTAITTNKSL FT DDIFHNFAPPETFMADGGKHFKNREVAENCERWGTKLHTVAAYSPWVNGLV FT KGTNKLLLYVLARLCAPEVGEDGWQTTTWDKLPTTWPDHFDKAIRILNWRI FT LPALKFSPKEVLLGLVVNTLKTPLEVSSSFLPPADVDTHMTYAAQQQLDGY FT AEAVHHAVQRKTAFNRRVKASKAGVVEFKTGQLVQVYNNKLALTLSTERKL FT APMWSPPRRVAERLLNSYKLETLEGAPLDGLFHARRLRSFTPREGTTLAAE FT QKNFEDALTAEEPSRAEVPAQEIEAQEVVDGEDSQEAVLRDSEDDSGGEAG FT FFYDEEEEEVQEEDAMGIGARVAARRRGRLHNGGGQ" XX SQ Sequence 6363 BP; 1752 A; 1702 C; 1662 G; 1247 T; 0 other; cctggtgatg agtgcgtgat tcgtactttt acatgcgcat atctctcgtc ttaaaacgtc 60 cgtcctcata tcaccgtctt aaatccacca tcccacgtca tacgtcacac cgtcccacgt 120 catacgtcac accgtcaacg ccaccacgtc caacgtccag cgtctaacgg cttttcatct 180 ccgaaatcaa ctcggctgca aacgtgtttc caacgtttca ccaagctcat cgtgtacccc 240 gcacccataa agcaagctgg cacccaacaa tacctggata actcagccaa gagtcaacgg 300 cagattcgtg gctagagaca gagccgctac cattacaccc tcatctaata caacctcggg 360 tacatccacg ccatctactg aaagttcgct cacgctttcg gacatactag agcctgcctc 420 tccgttcgag gatctctttg gatcaatgtc aagtcctcaa ctaggagccc cacaacaaac 480 cgcacagctc agccccgtta aaccaccgca ggaccttcca cccattatgt ctgccccagt 540 caccctacgc agcttcagcg gcgaaacaga cgacgatgtc cagccaggcg agtttctgaa 600 gaccttccgc tgcttcacaa cgtccgcccg catcacggac gacgacctca tcatcacatc 660 cttcggagac cacctcaagt acgggtcccc tgcagacgac tggttcacag agctaaccag 720 cacgaagcag acgtggaaag aggtagaaac agtgtttttg caatgctttc cgcccgtcga 780 gaaggctaaa aggacggaaa ccaagttaga gagggagttg tgtgaattaa ggttgaagac 840 agaggatttg gggaagaagg agaaatacgc aggcgaagaa gtctacaccc acgtcatatt 900 cgcagaaaag gctctcagcc ttgcaaaaca agccaagatc agcacggggt cgaattcgat 960 ctggaaggtc agagacgagc tacccgacat tatacgccag aaggtcaagg aaacgtaccc 1020 aacctggacc gaattttgtg cagcgatcaa aggggtagaa atggctcata tacgagatgg 1080 ggtgaagaag taccagaagg agaaggagga gaaggagaaa acggaggcgg ctatagcaag 1140 cctacaacgc ttacacctgc agggacaacg tcgcccccca gccgcaccca catcctcagt 1200 atcaaacgtc agtcaaaccc agcagtcatc aaccttggga ggtcaatgta acaacacgcc 1260 acaatccaat tcaaccaaca caaatgcata catgggcccg tcaagtggcc aaggcaacct 1320 gccccaccca gctccctcgc caatcactga agctgatcga agcgcgttaa tctgcagctt 1380 ggccctctac cccatgcaac ccaacaccga agcaggcatc gcagcatggc acgcccaact 1440 cagggaatgg aaggcgaaga atggcgaaac tgctgaggtt acagtaaaca tggggtttcc 1500 actacaacca gggggtgagg cccctggttc aggggagtgc tacagatgcg gaaggggagg 1560 acatcgccgc atcgacatgc gctgcacgac tgaccaaatc aacaacaagg agcgcacgtt 1620 ccgcacgata tgtggccgca tccttcgagc tgcgcctacc tcttgagtca acttcgtcag 1680 cgatgcagac gatgaattca gctggctcaa ctatcaagcc ttcaagcccg cgccgagccc 1740 gggaaatggg gaagggccgc ctgcataagt gacgagcggg cggccccgat ttcagtgaga 1800 cttacaggag tcgaccgccc cagcagtcag aacggtttca atgatgtata taacgcatgc 1860 gagaattcat cagtcattga tttgtacgct gttggggatg aggataagac atgggataca 1920 ctacgacaaa aggggccgcc attcatgcat tggttggtgt tgcacgggcc ccaaggcgag 1980 aaagttaggg ttaaggctct tttcgacggg ggagccatgt gcacctcctt tttcatgaaa 2040 gctcaacacc gactttgtgg ccaaactaaa ccatccaatc gacgcttacg catggcgaat 2100 ggggctatcg tacagtcgaa agctgtatgg aatggagtgc tggagttaga aggaatccag 2160 gcggaagcgg aattcgaaat cttcgacagc ggaggcagtt gggagtttct ttttgggaaa 2220 ccgttgctac atcgcttcag ggcggtccat gacttcgaca cggatacagt gaccatccga 2280 tctgcacacg agtcaatcac gctgcacaat ttcgctggag gaaatacccc atcagcacca 2340 acaggcgtta gcctcacatt agatgtggaa cagcaggaaa tctcagcagg gggctcttcg 2400 ggcgtgaatc cccctccgag gcaagtctca tacacagaca ttgtgacacc tttagttcag 2460 tatgacgagt cgaatccttt cttaggtttt gcaggtgata cattggaaac agcggacgag 2520 agcgccatgt gcacagacga agagcatgca tgggatgaag aggaattcac agagccagaa 2580 aggaacagcg aaacccaggg gagacagagt gaggagcagg gtatgaacca ggggggaggt 2640 aacacgcccc ccttgaggga agtacttagc gaaaatctca cttgggaaga tgtaaaagaa 2700 actgacgata cacatgccgc agcctctgaa gtcgtcgcca acgccgcaca aaagcccaca 2760 actgccgaaa gcattcatgt acaccgggtc gaaccactgt tattagagca tgaggaccat 2820 agtgggggga atgaacaacc cccctcgagg ggagtacaaa cccaaaccac tggtctgcaa 2880 atgtccaatc cagctgacac gccctgcctc gttttgccag tttccagcat tacagatgcc 2940 acgtcagagg aagcactcta cacccgtcac acagagcctt tccagccaat gcgtgtggcc 3000 aggatcttag acctcgtgca gataggtgat gatgtcacca tggaccaacg tgaagagatc 3060 agacagcttg tcgcagaatt cacggactgt tttgcactat cccttagcga agtcaacttg 3120 atcccagacg ccgtgcacaa actcaacata ccagaagacg ccacgttccg tacgaagctg 3180 cctcaacgtt catacaaccc agaccaaagg gcctttatgg aggcaaggtt gacgagatgt 3240 tgaaagccgg catcatacgc tctatacatc ccggagacgt caagtgcagt gcaccctcgg 3300 ttttggccca aaaagcccat gagaacacag gtctctctct ggacgagctc aagcacaagg 3360 tgaatgacga atgcgtgaaa cacaacctac ctgcagcatt cgacctcccg caacgcccag 3420 ccccaaccgt gaatgctgct acgttcacaa gacccaagaa atggcgccta tgccaagatt 3480 ttggagaaat caacaaggtc acgcccatcg ccccagttcc ccagggggac attcgcgcca 3540 aacagctcag gctatcggga catcgatata ttcacatttt cgactttgca gcaggcttct 3600 acggcatagc tatccatcct gattcacagc cgtacatcac atttcatcta gaaggacgtg 3660 ggcatttcgc ttatgagcat atgccgtttg gcgtcactgg cggtccatct gagtttgggc 3720 acactgtagg ccagcgtatg cacgacctca ttgcggatgg tacctgcgaa aattttgtgg 3780 acgatggtgg gtcagcggcg gattcctttg aggaaggtgt agtgaaatta aggcggatat 3840 tggaacgtgt acgcagggaa cggttgtcgc tgtccccagg aaaactccag ctattcatga 3900 ctgaggcagt atttgcggga gcacgtgtcg gaccaggggg tgtcagccca gactcaagca 3960 agctcacagc ggttgttgac tggaaaatac ctgaggacgc ctctcatctg gaaggctttc 4020 tcgggttaac tgcatatttc agagacttgg tcaaagggta cgctaccctc gagaagccgc 4080 tgagggatct attacgtgca gtcgacattc ctaacggcgc caaaaagtca gcttaccagc 4140 gtattatgaa ggcttacaaa ttgcaaccac actggaaggc agaacacacg acgacgttta 4200 taaatctaaa agcacgcttg gtatcggagc ctgtgctaac ggcccctcaa ttcgatggca 4260 cccatttcat tctcactacc gacgcatgca aagacgcttt cgcaggagta ctatcgcaga 4320 gaatcaagac gactctacct ggtggaaagg aagtaacccg cttacaccca attggttttg 4380 cgtctaagcg aacgtcaacg tcggaggaga agtacaaacc tttcctcctc gagttcgctg 4440 cattgaaata ttcattcgac aaattctcgg acattgtata tgggtaccct gtggaagttg 4500 aaacagactg tcaggctctg cgcgacatct tacttagtga caaactgaat gcaactcatg 4560 cacgatggag agatggtgtg ctggcacaca acattgtcga cgtccgacac ataccaggca 4620 aagtcaacat cgctgacagt attagcaggc agtacgaagg cacagaaaag gctgcaggtg 4680 atggcagtga gtggacggta acaccagact gggaggaggt aactggcctg gtacacgatc 4740 tctatcaagt ggccgacctt ccggacttga cggtcctgaa ggagaggttc aaggacgaac 4800 ccctgtacct cgacgtcatc gacgcaatca tcggccactc gtctcaggac gctacagtgt 4860 gggagaagaa acgtgcgcaa catcagaaga cccagtacat gatagacgaa gggaagttgt 4920 ggtttgtggc agggggaagt ggtacacgtg cgcgagctag gagggaatgt gtatcaaggg 4980 ctgaggcggt cgcactggcg aagatcgaac atgagcaggg cggacactgg catcgtgaca 5040 ccatgaaaat ggcgttgttg gatcaatacc acagccccaa gctggacgag tccattgtca 5100 aagccatcct ggactgtgca cggtgcaaga actttggagg tacacaccta cactcccttc 5160 tacaaccaat aacccggtgt catcccttcg agctgctggt gggcgattat ctgtcaatgc 5220 ctgtagggaa aggcggctac cacactgctg gaatctacct cgatacgtgt tcgcagcatg 5280 tgtgggggta caagtttaaa actcacggaa ccgccataac aaccaacaaa tctcttgacg 5340 acattttcca caatttcgct ccccccgaga cgttcatggc agatggtggc aagcatttca 5400 agaatcggga agttgcagag aactgtgagc gttggggtac taaattacac acagtggcag 5460 catactctcc atgggtcaac gggctcgtca aaggcaccaa caaacttctc ctttatgttc 5520 tagcacggtt atgcgcacct gaggttggcg aagatggatg gcaaaccacc acgtgggaca 5580 agctaccgac aacgtggccg gaccacttcg acaaggccat ccgcatcctc aattggcgga 5640 ttctgccagc tctcaagttc agcccgaagg aggtcttact gggattagtg gtgaacacgt 5700 tgaaaacacc actggaggtc agttcatcct tcttaccgcc cgctgacgtc gacacgcaca 5760 tgacatatgc ggcacaacaa caactcgacg gctacgccga ggcagtgcat catgctgtac 5820 aacggaagac cgcgttcaat cgtagggtca aggcctccaa ggcaggagtt gttgaattca 5880 agacagggca actggtacaa gtatacaaca ataaactcgc cctaaccctt agtacagagc 5940 gcaagctggc ccccatgtgg tcaccccctc gccgcgtcgc agaacggctc ctcaactcat 6000 acaagcttga aacattggag ggcgctcctc tcgacggcct attccatgca agacgcctcc 6060 ggagtttcac gccaagggag ggcacaacat tagcagcaga gcagaagaat ttcgaagatg 6120 cgcttactgc ggaagagccc agcagagcgg aagttccagc acaggagatc gaagcccaag 6180 aggtagtaga cggcgaggac tcacaggaag cggtgttgcg agattcagag gacgacagcg 6240 gaggagaggc gggttttttc tatgacgaag aagaggagga agtacaggag gaggacgcga 6300 tgggaattgg agccagagtg gcagcgagga ggcgaggacg cctccataat ggaggggggc 6360 aga 6363 // ID CGT1 repbase; DNA; FNG; 3255 BP. XX AC L76172; XX DT 13-AUG-1999 (Rel. 4.07, Created) DT 13-AUG-1999 (Rel. 4.07, Last updated, Version 1) XX DE CGT1 is a non-LTR retrotransposon - a partial sequence. XX KW Non-LTR Retrotransposon; Transposable Element; CGT1; ORF1; ORF2; KW reverse transcriptase. XX OS Glomerella cingulata OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Hypocreomycetidae; OC Hypocreomycetidae incertae sedis; Glomerellaceae; Glomerella. XX RN [1] RP 1-3255 RA He C., Nourse P.J., Kelemu S., Irwin A.J. and Manners M.J.; RT "CgT1: A non-LTR retrotransposon with restricted distribution in RT the fungal phytopathogen Colletotrichum gloeosporioides."; RL Mol. Gen. Genet 252(3), 320-331 (1996). XX DR GenBank; L76172; Positions 1 3255. XX CC ORF1: <1..1739 (has a stop codon) CC ORF2 (reverse transcriptase): 1702..>3255 CC There are about 30 copies of CGT1 in the genome and the complete CC sequence length is 5.7 kb [1]. It has 3' poly(A) tail and is CC flanked by 10-15 bp-long target-site duplications. XX SQ Sequence 3255 BP; 875 A; 1017 C; 742 G; 621 T; 0 other; atggcaatta ttacatccgg acgggccgag cccaacttcc ctcctacata ccaccccacc 60 ggcacccact gttagtgccg cgctgggtct gccaccctaa cactaacacc gcccaaccta 120 ccccccaacg acaagaaccc tcgcccgcga tggcctcaca gccagccgcg ggcccctcgc 180 agggccctat cggcctccta tcctcgatgc ataacctgcc gaagacccct ccccccccta 240 cctccctacc cccgcgtccg aggggcctca cgcccccccc tatccagctc ctgggaagag 300 ggcgacgatc ggccactgcc gcccttcagc aagcccggga aggccgggcc ctccgagatg 360 aggccctaga cctccttgca aaaggactag ataccctcat tggaacggct ccagcccgcc 420 tgaaggctac agcccaggac cttatcgact ccttcaccag atttgcaact gcagaaatct 480 ctggacccac ccctaccccc agccccggaa aaacgtacgc ccaggccgtt ggaggctcca 540 ctataggtgg acccctctcc ggccccggcc ccggccccag gccgagccga acaacaggct 600 ggcaggccac caactacgca aacaagccaa aacgggccca ggatctccgc ctactagttc 660 ggctccctgc tgaacatatc cctacatcta gatccctggc ccctcactca gttaagatgg 720 tagttaccaa agagcttaac ctacagccat aagacctagt tgaagtccga catactgcta 780 ctggcttctc gctcaagccc cagaacaagg aaatccaagc cctactcctc cagagggctc 840 ctgagatcaa gaactgtctc acggctgaga aggtagaggt gcccacaaag tggtaccaat 900 ataatatccc attctgcccc ctctacttca caagccttga tggtagtagg gtaagcatag 960 aatctgaaat agaagaggag gtccttctaa ggacaaagca aagacttctc tcctggcgcc 1020 caacccgagg tagcctaaac ctggctacag ccacgcaaag cctgattatt gccctcccac 1080 aggaggttaa ggaaggcttc cgcctattcg gccagtcagg ccctgcccgt ctgcttcccc 1140 caaggaagcg tacccctaaa cgacactctc ctggctgcca gggctttcac tcaaaccgtt 1200 tttgcactag gcaagccctc tgtgagaact gcagcaaacc cttagatact catgaggctg 1260 ggctatgcct agctagatct aagtgcgcta actgctatgg gcccttccca tcaagccata 1320 cagactgccc agctagacct ttaatagtga atgggagcct taccatccca acccggaagg 1380 aactgcgagg gatccgaaaa gctggagagg aaacaactaa gaacctctct gctagccaga 1440 cctgccccct aggaagcaca cagcaagcca acgctacaca ggggcaacag ccagcatcaa 1500 aacgccctcg cccccttaca cctacacagg actcaattgt tgttgcagat accactagtg 1560 tggctggaag cagcagtagt atggatgagg atgagactga agactctgaa gccctccccc 1620 ttactaaccg ccctatccgc ccgcttccaa gcagtagcag aagccagagg gtagcacaga 1680 agcctgacta taacgttatt aatgcctaca cccaccttga cctcaactca gagccttagg 1740 ttcaggccag cttccccaca ggcactacgg gtcgcccagt gtaacgtggg gaagataagt 1800 ccagctcata cagctctcct agagctctgc tgggcagatc agatcgatat agtactgatc 1860 caggagccct ggataggcta tagagataag atgcaaatca acacacaccc aggctataac 1920 tcttatgtgc cagtagactt ctggaacagc agggacacaa ggcctagggt tatgacctat 1980 acacgcaaaa gtctaaggat acgaattcag caacaaagac ctagcccttc aagggatatc 2040 ttatgggtga aagctggcga cctaactatt atcaatatat ataggccacc aggggagcct 2100 attaaccaca ccactactac cctactagat tatcaaccac cagagggaac ccttattgct 2160 ggggatctga acgcccgcca tccctcgtgg gagcctggat cctctagccg gaaccgggga 2220 gacgatattg cagagtgggc agaaggctat aatctcaagt ttattgggga agctggggct 2280 ccaacccata cccatgggca cacactcgac cttaccttct caaatattcc ctttgcagag 2340 gcctatatca gcgcacacct acatccagga gctgaccacc aagcaatcat aataatcata 2400 cctataaaca agagccagga gcaatggtgt cagtctagcc acccaatagt ctcagattca 2460 gccctacttc ggctggccga cctagtacag atggggatcc cagccctgcc taccttaggg 2520 gatataccta ctccagcaga gatagactcc ctagctaccc aactagtcag cctcctctct 2580 acagctatcc aagcagttgg gaagagacgg actaaccagg gaagtagagc cccctggtgg 2640 acagaggagt gctcagaggc ccaccgagca gttattgagg ctaggcgtca gcaagcccaa 2700 gaacctgggc tagcagtacc accgcctatc ctagaggagc agaaggcctt tctgaccaca 2760 gtcaggaagg caaagcgcgc ctactgggct gctaaggttg aggaggctgg gactaaccag 2820 aatctatatc agatattaag ctggcgtaag tctggtctag ctgttaaagc cccacctcta 2880 ctagtagatg gacaggctat caacaacagt tacgataaag cccaagcctt acggaaagct 2940 atactagaga gatttgatgc tgcagacgac ctacctggga acccattcac tgttcctatg 3000 gttataaggc gtctaatccc ctggcagcac acagtcagca tagaggaagt acgccaggca 3060 actctcacta aagctaccac ccctgggatt gatgggatca ctacaaagtt actacaatct 3120 atttgggggc taatagctac acctgtacag aagctctttc aggcctgtgt ccaaacaggc 3180 tacttcccta ggaccttccg caaagcagag gttgttatga tccagaagct ggggaaaaca 3240 gacttctcaa agact 3255 // ID Copia-1_MLP-I repbase; DNA; FNG; 4240 BP. XX AC AECX01002023; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-1_MLP_; KW Copia-1_MLP-LTR; Copia-1_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4240 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01002023; Positions 12563 16802. XX CC Positions [1662-2159] - Integrase core CC LTRs are 95% similar to each other. XX FH Key Location/Qualifiers FT CDS 465..4238 FT /product="Copia-1_MLP-I_1p" FT /translation="MKEHISNFNSLITSLGTSLASMPDFAKMSTGLNACIF FT LSSFNSDESLTPLIQTLYKSEPFTYASVYESMMLEANQRDDVESNMTVNFA FT HNQRNQAFNRFKKPSSSNQNSPNSQSRNQHILSRNPQPQFPNPSSSSQTNN FT GYFRPRGIIPTRFGNQRSDQNQESIREMRERLNRLENQMKSNPSNNSGSMN FT ANSVSRNNNQDETMDGSCFMIESFSAVSASIRSDSFDLTYDSGATDTTVNN FT FGLIIDPKPMFRVLNTYGGTITVTHVGKMSIGGEIIYPVFYVPAGPRNLIS FT ASQLEDHGLTIHHVPSGVIIRKANRIILKFPRDGKLFVAKGRSEVNNIQSI FT KPLTDWHIILGHPSDEYLKQFLVFRNIKVNSFQSSKNCEICVKCKLKARPH FT NHPLPMSTKPFERLHFDVLQVSVLCKARKNYVLVIIDDFSRFNCIYLMKTK FT SESESCLMSFIKEIRNKLGITPAYLHSDRGGEFSSTRFMTYIQEQGISVER FT GPANSPQTNGVAEKFNYTLLVKMRCLLAQSGVPINLWDEAAKYASYLINIL FT PTSRLNGKCAMDVLASVNSSIEPIRDISKTLPFGLKTYVKSTTHGKNFPIG FT VPLLFLGYEPYSDAGRFFDTQSRNIIISRDFVPLPITVNYSDPELLRKPPS FT TLPREAVNSMPGPSPATVPRSFPVEVELPPDPDVSVHDPSLDGSRNDPQED FT LSGQDSQGQESSRRHQEPSPRDSTPDGPSPEQPDGSSPKQPPRKGYAFVPN FT FTTAPRDISSAINEDNILESDGARVTRSRARVLGSEGGEILLAEATTVDKA FT LSNPDELQHWKKAMLDKFESLLSRGTGELVDPPKGEIIIGGMWRLTRKLNE FT FGEVERFKARWVALGNHQIHMLHYFDTYASVARNESFKVLLALAVQHGLYV FT FQFDVETAFLYGDMDAIVYVSQVSGFEVKGQEHKVWKLRRSLYGTKQAPRK FT WREHLVNTLRSLNFNATEGDESLFSNSSYSTFIHMHVDDGFGVARTRAEGQ FT DFLTRLSAHYKLKTKEMPTRHLGYTLDWKEDGSVILHQEDFCVKMLKELKM FT ENSNPVKTPAAQDLHAIVATPAQDADKGLYLKATGMLSYLALHTRPDILFT FT VNLLSQFNNKPTLSHWSAIRHLMRYLNGARRIGLHYTKSVASDNLLTGWAD FT ADYATSLVTKKSTSGFCVTLFGNMISWTTKKQSVVAQSTTEAEFISSNKCA FT KQLRWLSILISSLNIKLKKPPLNVKIHNDNSGAVIISKEAQLNPNSKH" XX SQ Sequence 4240 BP; 1257 A; 927 C; 841 G; 1215 T; 0 other; tggtagcgag agtctagtct tggcaacaat tcactttata gtattagata caatcgaata 60 tgtcttcctc tgtcgatatg gacacttcct caacttcttc cgaatatgat actggaatcg 120 atgtctcaaa attcagattc tttaaactca atcaaaccaa ctgggcaaag tggcagatca 180 tggtcgacag cgttttaacc agcaaaggtt ttgataaact gttagatgtt gactggaatc 240 gggacaacaa agaaactcaa acttatcgtg tccgaaatgc ttgggctttt tctctcctat 300 gggacattgt tgaatcagac cttcatgata taatcataaa taattccccg agtttctggc 360 atacctatca agccctcgga agagcatgtg gaatgttatg tattgttacc gtctcagcta 420 aacttcgaaa gacttttaac actgtctaaa atgtgggaac ttcgatgaag gaacacattt 480 ctaatttcaa tagtctgata acttcactcg gaacctcctt agcatctatg cctgattttg 540 ctaagatgtc taccggtctt aatgcctgca tttttctgtc gtctttcaat tccgatgaat 600 ctctaacccc tctcatacaa actctttata aatcggaacc attcacatat gctagtgttt 660 atgaaagtat gatgctggaa gcaaatcaac gagacgatgt cgaatctaat atgacagtaa 720 actttgctca taatcaacga aatcaagcat tcaatcgttt caagaagcct tcctcttcga 780 atcaaaattc tcctaatagt caatccagaa atcaacatat tctatctcgc aatcctcaac 840 ctcaatttcc taatccatct tcttcctcgc aaacaaacaa tggctacttt cggcctagag 900 gaataattcc aaccagattt ggaaaccaga gatcggatca gaatcaggaa tcaattcgag 960 aaatgcgaga aagactgaat cgtttagaaa atcaaatgaa gtcgaatcca tcaaacaatt 1020 ccggatcaat gaatgcaaat tcggtctcta gaaacaacaa tcaggatgag accatggacg 1080 gatcatgttt catgattgaa tctttctctg cggtatctgc atctatacgt tctgatagtt 1140 tcgatttgac ttacgattcg ggggcgactg acacaactgt caacaacttt ggtcttatca 1200 ttgacccgaa gcctatgttc agagtattga atacatatgg tggtactata actgttactc 1260 atgtcgggaa aatgagcata ggaggggaaa tcatctatcc tgtcttctat gttcccgctg 1320 gacctagaaa tctgatctcc gcatctcaac tagaagatca tggtttaacc attcatcatg 1380 taccgagtgg tgttataata cggaaggcaa atcgcataat actcaaattc cctcgagacg 1440 ggaaactttt tgttgctaaa ggaaggagtg aagttaacaa tattcaaagc atcaaacctt 1500 tgacggattg gcacattatt cttggacacc cttccgatga atatctcaaa caatttctgg 1560 tcttcagaaa catcaaagtc aatagtttcc aatcgtcaaa gaactgtgaa atttgtgtga 1620 aatgcaaact aaaagccaga cctcacaatc accctcttcc gatgtcaaca aaaccatttg 1680 aacgtttaca ttttgatgtc cttcaagtct ccgtgttgtg caaggctagg aagaattatg 1740 tgctagtaat aattgatgat ttctctagat ttaattgtat atatctaatg aagacaaaat 1800 cagaatctga atcatgtctt atgtcgttta tcaaggagat tagaaataaa ttaggcataa 1860 caccagccta tctacattcc gacagaggtg gagaattcag ctcaactcga tttatgactt 1920 acattcaaga gcagggtata tcagttgaaa gaggacctgc taatagccct cagaccaacg 1980 gtgttgctga aaagttcaat tatactttat tagtcaaaat gagatgtctg cttgctcagt 2040 ctggtgtacc gatcaatttg tgggatgaag cagcaaaata tgcctcttac ctcatcaata 2100 ttttacctac aagtcgcctt aatgggaagt gtgcaatgga tgttctagcg tcggtcaact 2160 cttcaattga accaatacgt gacatcagca agactttacc ttttgggttg aaaacgtacg 2220 tgaagtcaac cactcatgga aaaaattttc ctattggtgt tccacttctt ttcttggggt 2280 atgagccata ttctgatgct ggtcgatttt tcgatacaca atcccgaaat attatcatca 2340 gtcgtgattt cgttccactt cctattactg tcaattattc cgaccccgaa ctactcagaa 2400 agccgccctc aactctacct cgagaggcag tcaattcaat gcctggtccc tctcccgcaa 2460 ctgttccaag gtcttttcca gttgaagtag aactacctcc ggatcccgac gtctcagttc 2520 atgatccttc actggatggt tctagaaacg atcctcagga ggaccttagt ggacaagatt 2580 cacaaggtca agaatcgtca agaagacatc aagagccctc tcctcgagat tcaacacctg 2640 acggtccctc ccctgaacaa ccagacggtt cctcccccaa acaaccaccc aggaaaggct 2700 atgcatttgt tccaaatttt acgactgcac ctcgcgacat atcatctgcc atcaatgaag 2760 ataacatact tgaatcggac ggtgctaggg ttacaaggtc tcgcgctagg gtgcttggaa 2820 gtgaaggtgg tgaaatactc ctagcagagg caacaaccgt tgataaagct ttgtcaaatc 2880 cggatgaact tcagcattgg aagaaagcga tgttggacaa gtttgagtcg ttgctgtcgc 2940 gtggcactgg ggagttggtt gatcctccaa agggtgaaat tattattggg gggatgtggc 3000 gactcaccag gaagttgaac gagtttggtg aggttgagcg ttttaaagct cgatgggttg 3060 cattgggaaa ccatcaaatt catatgctac actatttcga cacctacgca tccgtagcac 3120 gaaatgagtc attcaaagtc ttgctagcgc ttgcggtgca gcatggcctg tacgtttttc 3180 agtttgatgt cgaaacggcc tttttgtatg gtgatatgga tgctatagtg tacgtttctc 3240 aagtcagtgg gtttgaggtt aaaggtcaag aacacaaagt ttggaaactc aggcgttccc 3300 tgtacggtac caaacaagcg cctcgcaaat ggcgtgaaca tctggtcaat actctcaggt 3360 ctttgaactt caatgcgact gaaggcgacg agtctctctt ctctaactct tcctattcaa 3420 catttattca tatgcacgtt gatgatgggt ttggggtggc tcggaccagg gctgaaggtc 3480 aagatttctt gacccggcta agtgcgcatt acaaactcaa gacaaaggaa atgccaacac 3540 gacatcttgg ctatacgctt gactggaagg aggacggttc agtcatttta catcaagaag 3600 acttttgtgt caagatgctc aaagaactta agatggaaaa ttccaatcca gtcaagactc 3660 cagccgctca ggatctacat gctattgttg ctactccggc tcaagatgct gacaagggtc 3720 tttatctcaa ggcaactgga atgcttagct atctagcctt acatactcgt cctgatatac 3780 tattcactgt caatctactc tcccaattta acaacaaacc aactttgtct cactggtcag 3840 caattcgtca tcttatgcgc tatcttaatg gagcaagacg aatcgggcta cactacacga 3900 aatctgtagc ttctgacaat cttctgactg gctgggcgga tgctgactat gcgacgtcgt 3960 tagtaaccaa gaaatcaacc tctgggttct gtgtgactct ttttggtaat atgatttcat 4020 ggacgactaa gaagcaatct gtagttgccc aatcaaccac tgaagcagaa tttatatctt 4080 ccaataaatg tgcgaagcag ttaaggtggt tatcaatttt aataagcagt cttaacatca 4140 aattgaagaa acctcctcta aacgttaaga tacataatga caactcgggt gctgttataa 4200 tcagtaagga agctcaactc aacccaaatt ctaaacacat 4240 // ID Copia-18_MLP-I repbase; DNA; FNG; 4654 BP. XX AC AECX01001037; XX DT 22-APR-2011 (Rel. 16.04, Created) DT 22-APR-2011 (Rel. 16.04, Last updated, Version -1) XX DE LTR retrotransposon from the Melampsora larici-populina genome: DE internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia-18_MLP_; KW Copia-18_MLP-LTR; Copia-18_MLP-I. XX OS Melampsora larici-populina OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina; OC Pucciniomycetes; Pucciniales; Melampsoraceae; Melampsora. XX RN [1] RP 1-4654 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Melampsora larici-populina RT genome."; RL Direct Submission to RU (20-APR-2011). XX DR Genome; AECX01001037; Positions 71027 75680. XX CC Positions [1945-2469] - Integrase core CC 'TTCAT' target site duplication CC LTRs are 99% similar to each other. XX FH Key Location/Qualifiers FT CDS 79..4644 FT /product="Copia-18_MLP-I_1p" FT /translation="MSNDHVASSPDPSLTDSETELEDSFSSLSESRTETPG FT SDGMPEATDSKTFGTPIQQFVQKLTSLLAKYNIDTKLSDGNFPVWATEVKR FT VLLAIDYQKYLTKVDFKAPSLTDEQHAKVKLVISIWLLSLMDAHNKTRCQT FT MLKLQNCNDKADLDDENDDDDDDDINYEPALIWTFLKNHHQKISEAGLQTI FT EDAISDMKILSSDSFKVHCDKFNNLIADFVKYKGRISSASAARKLIKTVKS FT RINENVSENIYNYVVPLTREGVVQYLIDYEARNGGFTTPAIVEASQSTYST FT AAAQSGGGNYRRNKPRCTENKCISSSHDDDKCFAKPKNHAARDIWIAQMEA FT KRNRPQYTRQQSSNSSSNVTGIKQLSLPSASVAVASNNPFASLHIYLDEPG FT EICASATALSTFESSHTVFDEDTPAANVVNDSGGHWALYDTGATHYMFKND FT QLFTSASMVKVEDSSKRLKLAGGDASLAVHSTGMVQLKSGSGKMFELNNSL FT YVPDLTQNLLAGGAMLRKGVQVLVHPNNQNCFSLVFKGEALFNGVFASNNL FT MYVSLEPVSPITDSTATNISAEDSSNLQHRRLGHLNNRYLKIMCNHKSALG FT LPKKLSQLDECETCSLSKNVKIPHSSTRPRASRHLENVHVDLSGIIRVKGL FT KNEMYYTLFCDDFSSYRHIYFMKDKTKETVYEIFRAYIALAERQTCKMIKQ FT FTLDRGGEFLNDLLGTELRERGITLHLTAAHTPEENGVSERGNRTISTKAR FT SMMIESGTPLRFWVQACQTAVFLTNRTITASLSGNRTPFEAWHFQKPTVDH FT LCVYGCLTYSLIRKELRGSKFAPVSSKGVLVGFEEDNFNYHVYDLASMKIH FT ITHNATFNENSFPFLEDSPTKSSQNTPPSTLPDVSLKFFDSDTDDDEEDAV FT SPKIDEVIPEPVPSVNEPCHGVVQNPPSVVTNDQTAPLRRSARTKGPVNYS FT ASAVCDDLDESSLRASWELTQLFDCVLPLCNMVTSAQDVPKSYSKAVNGLE FT GIEWLKACKKEIQAMQDKKVWELVDRPDSSNVIRGLWLFRKKPTSNAKNPV FT KFKARFVAMGNTQVEGEDYFETFAPTGKPTSLCLLIAIAAIFGWEVHQMDA FT VTAFLNCDLHELIYVEQPEGFRQPGEEHKVCRLLKSLYGLKQAPKYWQDDV FT KEFLLSVKFVRCEVDHCIYIRNQGEKFTAVYVHVDDLAITGNDILTFKLEI FT SQRWEMEDLGLASTVVGIEINRLDTFRYSICQEAYALKVLERFNSLDVKPA FT STPFTANLKLYKPDQEEIDDFALRKLPYRGVVGSLMYLAQCTRPDLAHAVG FT TLSQHLDWPGFQHWDAACHVLRYLRGTFNLGITYSCDSTITHVRGLKSELC FT PQAMCDADWAGDRDTQRSTTGYVLILANGAVSWKSRLQPTMALSSTEAEYR FT AITEAGQELVWLRTMLTTLGLEDKNPTVLESDNKGAIHLTNKSIFHGRTKH FT IEIHHHWIREVVNSGKLSLQHCPSHLMIANLLTKPLGSQQFASLRKLLGLK FT PITRRT" XX SQ Sequence 4654 BP; 1353 A; 1165 C; 951 G; 1185 T; 0 other; tggtagcgag agtataaaga tccaaacttt cacctcgaga cacaactttt catcatacta 60 tcataccttc aaaggtttat gtcaaacgat cacgttgctt catcaccaga cccttcgctc 120 actgattccg aaactgaact tgaagactct ttctcatcgc tgtccgaatc acgaactgaa 180 acaccaggat ctgacggaat gcccgaagcc actgactcaa agacatttgg cacgcctatc 240 cagcagtttg ttcagaagtt aacgtctctt ctagccaagt acaatatcga tacgaagcta 300 tcagatggca actttcctgt gtgggccacc gaagtaaaac gtgtgctcct cgccattgat 360 tatcagaagt atctcacaaa agttgacttc aaagctccgt cacttactga tgaacaacac 420 gcaaaagtca aacttgtcat atcgatatgg ttgctaagtc ttatggatgc tcacaacaaa 480 acccgatgcc agaccatgct caaactccaa aactgcaatg acaaagccga tcttgacgat 540 gagaacgatg acgatgatga tgatgacata aactacgaac ctgccttgat ttggaccttt 600 cttaaaaatc accaccaaaa gatatcagaa gccggacttc aaaccattga agatgcgatc 660 agtgacatga agattcttag ctcggactcg ttcaaagtcc actgtgataa attcaacaat 720 ctcatcgctg atttcgtgaa atataaaggt cgtatatcat ctgcttctgc tgctcgaaag 780 ttgatcaaaa cggttaaatc aaggataaat gaaaacgtct cagaaaacat ctacaactat 840 gtagttccac ttactagaga aggtgtggta cagtacctta tcgactacga agctaggaac 900 gggggtttca ccacgcctgc cattgttgaa gctagtcaat caacgtactc cacggccgct 960 gctcagtctg ggggtggaaa ctatcgacga aacaaacctc gatgcacgga aaacaagtgt 1020 atatccagct ctcatgacga tgacaaatgc ttcgcaaaac ccaaaaatca tgctgcgagg 1080 gacatctgga ttgctcagat ggaggccaag agaaatcgtc ctcagtacac tcgtcaacaa 1140 tcgtcaaact cttcctctaa tgtgaccggt atcaagcaac tgtctctccc atctgccagc 1200 gtagctgtag cctccaacaa cccttttgcc tctcttcaca tttatcttga tgaaccagga 1260 gagatatgtg cctcggctac tgctttatcc acctttgaat cgtcgcacac tgtctttgat 1320 gaagatacac ctgccgctaa tgtagtaaac gacagtggtg gtcactgggc cctgtatgac 1380 accggcgcca ctcactatat gttcaagaat gatcagctct tcacctctgc ttcaatggtc 1440 aaagtcgaag attcatctaa acgactcaaa cttgccggag gagatgcatc actcgccgta 1500 cactccacgg gaatggtcca actgaagtcc ggttctggga aaatgtttga acttaataac 1560 agcttatatg ttcctgatct tactcaaaat cttcttgctg gaggtgccat gttaaggaaa 1620 ggtgttcaag tcctggttca tccgaacaat caaaactgtt tctctcttgt ctttaaaggg 1680 gaggctttgt tcaacggcgt atttgcatca aacaacctga tgtatgtttc actcgaacct 1740 gtgagtccaa tcactgactc taccgcaacc aacatctcgg ctgaagactc ttccaatctt 1800 caacatcgtc gactaggtca tctgaataat cgttacctta agattatgtg caatcacaaa 1860 agtgcacttg gactgcccaa aaaattatct cagcttgatg aatgtgaaac ctgttctctt 1920 tctaaaaatg tcaaaatccc ccactcatcc actagaccaa gagcctcacg tcatttagaa 1980 aatgttcatg ttgatcttag tggtatcatt agagttaaag gattaaagaa tgaaatgtac 2040 tacaccctct tctgtgatga tttttcctcc tatcgccaca tttacttcat gaaagacaaa 2100 acaaaagaga ctgtatacga aatcttcaga gcctacattg ctctagccga acgtcaaacc 2160 tgcaaaatga taaaacagtt caccctggac agaggaggcg agttcctgaa cgacctccta 2220 gggactgaac ttcgagaaag aggtatcact cttcatctca ctgctgccca cacacctgaa 2280 gaaaatggtg tgtctgagcg cggaaacaga actatcagca caaaagcaag atccatgatg 2340 attgaatctg gtacccctct cagattttgg gtacaagcat gtcaaacggc cgtcttccta 2400 accaacagaa ctatcactgc gtcgctgagt ggaaacagaa ccccatttga agcgtggcat 2460 ttccaaaagc ctaccgtcga tcacctttgt gtgtacggct gtcttaccta ctcgctaata 2520 aggaaagaac tccgtggctc aaagttcgct ccagtgagtt ccaagggagt cctagttggt 2580 tttgaagagg acaactttaa ctatcatgtg tatgatctgg catcgatgaa aattcacatc 2640 acacacaatg ccacattcaa tgagaactcg ttcccttttc ttgaagattc tccgaccaaa 2700 tcatctcaga acacgccacc ttccacctta cctgatgtct cgctgaaatt ctttgactct 2760 gataccgacg atgatgagga ggatgctgta agccccaaga ttgatgaagt tatacctgaa 2820 cctgtaccgt cagtcaacga gccgtgtcac ggtgttgtcc agaacccgcc ctccgttgta 2880 acaaatgatc agactgcacc tcttagacga tcagcacgca ccaagggtcc tgtgaattac 2940 tccgcctcag cagtgtgtga tgacttagat gaatcatccc tccgtgcctc ctgggaactt 3000 actcaactat tcgactgtgt ccttccgctc tgcaacatgg tcacctccgc tcaagatgtc 3060 ccaaaatcgt actcgaaagc tgtaaatggt ctcgaaggca ttgagtggct gaaagcctgc 3120 aagaaggaaa ttcaagctat gcaggacaag aaggtgtggg aattggttga cagacctgat 3180 tcttcaaatg tgatcagggg tctttggctc ttcaggaaaa agcccacctc caacgccaaa 3240 aatccagtca aatttaaagc acgctttgtg gccatgggca acacacaagt tgaaggtgaa 3300 gactacttcg aaacatttgc ccctaccgga aaaccaacct ctttatgtct cctgattgcc 3360 attgctgcca tatttggttg ggaagtccat caaatggacg cggttaccgc gtttctcaac 3420 tgcgatcttc atgaactcat ctatgttgaa cagcctgaag gtttccggca acccggtgaa 3480 gaacacaagg tctgtcgtct tctgaaatca ctttatggac tcaaacaagc gccgaaatac 3540 tggcaggacg atgtaaagga atttcttcta agtgtcaagt ttgtgaggtg tgaagtcgac 3600 cactgtatat atatcagaaa tcaaggcgaa aaattcactg cagtgtatgt ccacgtcgat 3660 gatctagcca tcaccggtaa cgacattttg acgttcaagt tggaaatatc ccaacgctgg 3720 gaaatggaag acctaggatt ggcatccacg gtggtaggaa tagaaatcaa cagactggat 3780 actttcagat actcaatttg tcaagaagcc tatgcactca aagtactgga aagattcaac 3840 tccctagacg tcaagccagc tagcacgccc ttcactgcga atctgaaact ttacaaaccc 3900 gatcaagaag agattgatga tttcgctctg cgaaaactac cctatcgggg tgttgtgggt 3960 tctttaatgt accttgctca atgtacgcga cccgacttag cgcatgcagt tggtaccctc 4020 tctcagcacc ttgactggcc tggtttccaa cactgggatg ccgcttgtca tgttcttcga 4080 tacctccgtg ggaccttcaa tcttggaata acctactcat gtgactccac catcactcat 4140 gttaggggtc tcaaaagtga gttatgtcct caagccatgt gtgacgctga ctgggctgga 4200 gatagagaca ctcaaagatc aaccaccggc tacgttctca tcctcgccaa cggagctgta 4260 tcttggaaga gtcgacttca accaaccatg gctctttcct ccaccgaagc tgagtacaga 4320 gcaatcacgg aggcgggaca ggaacttgta tggctcagaa ctatgctaac tacattaggc 4380 cttgaagaca aaaacccgac tgtccttgaa agtgataaca agggcgctat tcatctcact 4440 aacaagtcta tctttcacgg cagaacgaaa cacattgaga ttcaccatca ctggattcgg 4500 gaggttgtaa actcaggaaa actctcactc caacactgtc cgtctcacct gatgattgcc 4560 aatctcctca caaaaccgtt aggatctcag caatttgcaa gccttaggaa gttactgggt 4620 ctcaagccta tcacgagaag aacttgaggg ggcg 4654 // ID MULE-1_Cglob repbase; DNA; FNG; 3159 BP. XX AC . XX DT 17-MAY-2011 (Rel. 16.05, Created) DT 17-MAY-2011 (Rel. 16.05, Last updated, Version 1) XX DE DNA transposon: consensus. XX KW MuDR; DNA transposon; Transposable Element; MULE-1_Cglob. XX OS Chaetomium globosum OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Sordariomycetes; Sordariomycetidae; Sordariales; Chaetomiaceae; OC Chaetomium. XX RN [1] RP 1-3053 RA Yuan Y.W. and Wessler S.R.; RT "The catalytic domain of all eukaryotic cut-and-paste transposase RT superfamilies."; RL Proc Natl Acad Sci U S A 108(19), 7884-7889 (2011). XX DR [1] (Consensus) XX SQ Sequence 3159 BP; 763 A; 912 C; 851 G; 633 T; 0 other; gagtgttcct atttaaacag tagttttttt ctatttggac agtaaaggtc cgtgcccgta 60 ctaaaattga tgcgcccaca ccactgttgt cactattaca cacctaatca cagtgcaaag 120 tcctgggaca tgacatacct gccaattgaa gacagcaaca acaccaagac ttcgtcaatt 180 cgcgatcgcg aggtcgtttt cttccgtttt tttcgcccac agccgcaact tccctttatc 240 atcgccacat tcctcgtttt acgatggatt tcaccgattg ggatgccttc ctaggcgcgc 300 cgccgtcgtc tgaactgtcg gaaggcgccg gacttgggca aatcgacgat ttctatcaag 360 aaggacagac acagttgtct gatgctatcg cgaccgcaga aaccgccaat tccttgcaag 420 aagacgccct gctgccgccg ctgccgccgc tgatggcctt gtcaggcgtt gaggaagtca 480 tagagactgc agaatcggcg ccagagcccg tcattccatc gccgccgccc tcggggctgt 540 acgactcttt tgacgagctt ttggccttcc tgcagacgtt ccatcgcgac aacggcgcgg 600 ctatcaagat tctgaagagc gcaaaccctc gcgaggtcta tggcgagaag aaatacacat 660 actatgtcct tgcctgcgat cgcgccggct ctcaaacgtc cacaagcaca ggaatccgaa 720 gatcctcgac tcagcggctc gactgcccgt tcaaagtcat agcatcagcg acgaagcgtt 780 ccggctagaa gtggtcgtat cgcgttgact gccccgagca caaccacccg ccgtctctcg 840 acccgtcagc acacataatg catcgccggc gtaccctagc ccagcagcaa caagcgcgag 900 atttgaccaa gcaccgggcc cttcctgcgc gcgagatgtt agagatcatg agagactcta 960 gcggcgccga acctacgttc tttcgtcagt cagacatcta taacgacaga cagaagatca 1020 ggatcgaagg actcaacggc ttgtcagcca cccaggcctg gattaagcag cttcaagata 1080 acaaccttcg tcactggatc aagattgacg atgacaacaa ggtcgagggc gtcctgtgga 1140 cttacccttg gccagagaag atgtggcgcc agtttcctga agtcctagga cttgacaaca 1200 cgtacaaaac gaaccgtttt cacatgtatc ttttcgaggt cataggcatc acggaccaga 1260 agtcggtagc caacttcgcg tttggcctca tcaacacaga gaaggaagac ggcttcctat 1320 ggctttgtca acagcttgag gaccttcgtc aagaccttca tgtgccggca ccgaccgtgg 1380 tgatcacaga caaggagaca gccctcaaaa atgccctcac ggcgaccttc ccgggtgcgc 1440 agcaacagct gtgtgtctat cacatcaacg cgaaggtgag ggcaagaatc cggtcaagat 1500 ggaaggccga agacggcaga agcgacgacg aggttgacga ggctggcgac aatgaggccg 1560 aggtcgctga cggcgatggc gacctcgtcg cccgcgctgc tgaacaagag gtcgccgagc 1620 atggctgtgc ggccttgctg cctgccgccg accctttgac tcgcgatgga atgttccgtg 1680 cgtggcaaca agtcgtgtac gccgaagacg aggctgattt ccagtcagca tggaacaaga 1740 tgaaggctat atacaatcca acgcagaacc acatcctacg ctatatctcg aaggaataca 1800 tgccgtaccg cgagcagtgg gcgcgctgct ttatcaagcg ataccgcaac ttcggtcaac 1860 gagttaatag tcctgtggaa acggcacaca aggacgtgaa gtccttcctc atcacgggca 1920 caagcgacct cctccacctc cacaacgcga tctccctgat gctgcagaaa aaggagagag 1980 actacaacga gaaggccgcg gcgatgcaaa tgcggcagcg ccaacgcttc atccagcagc 2040 aagggtggct agggaacatc cccatgacag tctcctacgt cgccgtcgat ctccttgccc 2100 aacagcaccg gctagccgtg gccgcgatgc cctcaagccg tggcctgatg cctgagccgc 2160 tgaggccttg caccggcagc ttcatgcagc agtacgcgct tccttgcagc cacatgatct 2220 tcaagaagct ggacgacggc gaggccctca gcaaggaaga tgtccacccg cgatggtggc 2280 tggccaagcc tctggtacga tttcctatga ttttctatgg aaaagctact aactgcttgg 2340 tcccaggaca ttgacgagcc cttactccga ctccgcgatc ctgacgtggt ccaaagcctc 2400 cgtggccggc ctcgcctgcc aaataacgag aagatgacgg tgccgtcatc cctgagggct 2460 gacgacggcc aagatgaagc ccagcagcag gagacggcgc agtcaacgca gcagcagcgg 2520 cgacggcagc caaagcagac ttcgcagtca catcgccgtg agcgccgtgc gaagccaagc 2580 gcgcgtcgcg atctatcaca atttgaggtt gaaggctcgc aaaggtcatc acaaaggtcg 2640 tcccggagcc gcggtcgcgg ccgcacccgg gcagcttcgt cgccgtcgac aaggagttcg 2700 cagtcacgcg taaattcgca gtctacgaga actgcaagag gacgatcgag gaggtcagcg 2760 atagcctctt ctacggcgag gctggccgcg atagctgcag ccgcctttga gaccgacccc 2820 gagaccgacc ccgagaccga gtctgaggct gaggacgtga ttgaggtcat cccggccacc 2880 caacccggcg ctgacgagcc ggatggacta gccggcccgt gagaagtgtt gacgttgaat 2940 tagaagggcc ggaaatgcgt tttttgaaga attggcaccg attgatagtc gatagacctt 3000 atagcgtgtg attttgtatg aagatatcct ttcccagtgt cttacagtgt attttggtgt 3060 gtgggtgtgg caattgcgat tttggggggt aaatctagct attaatgtgg acctttactg 3120 tccaaataga agaaaactac tgttcaaata ggaacactc 3159 // ID Gypsy-25_LBS-LTR repbase; DNA; FNG; 346 BP. XX AC ABFE01002558; XX DT 29-JAN-2011 (Rel. 16.02, Created) DT 29-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE LTR retrotransposon from the Laccaria bicolor genome: long DE terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-25_LBS_; KW Gypsy-25_LBS-I; Gypsy-25_LBS-LTR. XX OS Laccaria bicolor OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; OC Agaricomycetes; Agaricomycetidae; Agaricales; Tricholomataceae; OC Laccaria. XX RN [1] RP 1-346 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from the Laccaria bicolor genome."; RL Direct Submission to RU (29-JAN-2011). XX DR Genome; ABFE01002558; Positions 359 14. XX SQ Sequence 346 BP; 106 A; 70 C; 87 G; 83 T; 0 other; tgttggatgg tggctcacaa attgtatcaa tggatgcgga ggttgctaag aaattagcag 60 tcccttggga tcctgatata accattcaaa tgcaaagtgc gaatagaact gtggaaaaga 120 cccttggtct cgccaaaaat gttccattta acttcggagg aattatgatt tacctgcaag 180 tgcatgtaat acgggaccca gcctataaag tactattggg caggcccttt gacgtgttaa 240 cagggagcgt ggtcgtaaac tctacggacg gaggacagac ggtaaccata acggatccaa 300 actctggcag aagagcaatg ttacccactt ttgacagagg aaacca 346 // ID Mariner3_AO repbase; DNA; FNG; 1908 BP. XX AC . XX DT 24-JAN-2006 (Rel. 11.01, Created) DT 02-FEB-2006 (Rel. 11.01, Last updated, Version 1) XX DE A family of Mariner/Pogo DNA transposons - a consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner3_AO. XX OS Aspergillus oryzae OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; OC Eurotiomycetes; Eurotiomycetidae; Eurotiales; Trichocomaceae; OC mitosporic Trichocomaceae; Aspergillus. XX RN [1] RP 1-1908 RA Galagan J.E., Calvo S.E., Cuomo C., Ma L.J., Wortman J.R., RA Batzoglou S., Lee S.I., Spevak C.C., Basturkmen M. et al.; RT "Sequencing of Aspergillus nidulans and comparative analysis with RT A. fumigatus and A. oryzae."; RL Nature 438(7071), 1105-1115 (2005). XX RN [2] RP 1-1908 RA Kapitonov V.V. and Jurka J.; RT "Mariner3_AO, a family of Mariner DNA transposons in the RT Aspergillus oryzae genome."; RL Repbase Reports 6(1), 32-32 (2006). XX DR [2] (Consensus) XX CC This is a family of Mariner/Pogo DNA transposons. It encodes a CC 560-aa Mariner3_AOp TPase; TA TSDs, 110-bp TIRs. XX FH Key Location/Qualifiers FT CDS 121..1800 FT /product="Mariner3_AOp" FT /translation="MPDSYSDIEIRIVQACTIAQGQKKPNISALARQFNVP FT YQRLRARIHGRANLSSRPISTKTLDDSQEKALIRWIRQLDNLYSPPTAGMI FT EQSANQILQRNITDGQSRTVDKNWVYRFIKRLPEEFKLIQQKPKDKKRLDA FT EDIGVLQHWYDCLEAFIKNIPPKNIYNFDETGFQLRQGKTQKVVTTMPLRA FT ARGNPSKEVGELISAIECIAADGFTLPPYFIFKGTYHLERWYDADIPEEYR FT ISLSPKGYTTDKISFDWIQHFHRHTKHRISTKKEVRLLFFDGHESHLTYEF FT LQFCGLHYIIPYCFPPHTTHLVQPLDGQPFQVYKHFYRKRNNELAQRGAEM FT DDKSDFLKEIHSIRTMTFKQRTIRDAFEKRGLYPLDSEKVMKSLREALETA FT PELEIITTPSPPPSSSSPPSTIRGLRRSISKAQSFINNSPELDQSFVRRLD FT RVFQSSLETTELAAQLKDDLQQHLRYRKPQDRRKSQKRVKYHGPLTVYDAK FT RRIADRTEVERLQGLRQIRKTGALEYDKPPQPTNTGDLPSTEAGQVDREGP FT RLPYWIDTQGDVV" XX SQ Sequence 1908 BP; 586 A; 439 C; 396 G; 487 T; 0 other; gccgtctctt aggtcagcag acccaccacg cacttcggcg tgttggtctg cctcacccaa 60 acgataattc ccacctaaac gataaatttg aagctattca ggctcctata taccaccacc 120 atgcctgatt cttattctga tatcgagata cggatcgtac aggcctgtac aatcgcccaa 180 ggccaaaaaa agcccaatat ctctgcgcta gcgcgtcaat ttaatgttcc ttaccaacga 240 ctccgggcgc gtattcatgg ccgagcaaat ctttcctcac gtcctatctc aaccaagaca 300 cttgatgatt cacaagaaaa ggcactaatt cgttggattc gtcagcttga taatctatat 360 agtccaccta cggccggaat gattgagcag agtgccaacc aaatactgca gcggaatata 420 accgacgggc aatctcgtac agttgacaag aactgggttt atcgctttat taaacgtctt 480 cctgaggagt ttaagctcat tcagcagaag ccgaaagata agaagcgtct tgatgcagaa 540 gatataggtg ttctccaaca ctggtacgac tgcttagagg cctttatcaa gaatatacct 600 cccaaaaata tatacaattt cgatgaaact ggctttcaac ttagacaggg aaagactcaa 660 aaagttgtaa ctactatgcc tctacgagct gcaagaggaa atccctctaa agaagtaggg 720 gaactgatct ctgcaataga gtgtattgca gcggacggtt ttactcttcc tccatacttt 780 atattcaaag gaacatatca tcttgagcgg tggtacgatg cagatattcc agaggaatat 840 cgaatatctc tatctccaaa aggctatact acagataaga ttagttttga ctggattcag 900 cacttccacc gtcatacaaa acaccgtatc tctacaaaaa aagaggttcg attactattt 960 ttcgatggcc acgagtccca tcttacctat gaatttcttc aattctgtgg actacactat 1020 attataccat attgctttcc tcctcataca acacatcttg tacagcctct tgatggacaa 1080 ccttttcaag tctataagca cttttatcgc aagagaaaca atgagctcgc tcaaagaggt 1140 gctgagatgg acgataagag tgatttttta aaagagatcc attctattcg tacgatgacc 1200 ttcaaacaac gtacaatccg ggatgccttt gaaaagcgag gtctatatcc cttagattca 1260 gaaaaggtga tgaaatctct acgtgaggct ttggaaactg ctccagaact tgagataatt 1320 acaacaccat caccaccacc atcaagctca tcgccaccat ctacaatacg aggtcttcgt 1380 cgaagtatta gtaaagcgca gagctttatt aataatagcc cggagcttga tcaaagcttt 1440 gtacgacgcc ttgaccgtgt attccaaagc tcgcttgaaa ctactgaatt agccgctcag 1500 ctcaaggatg atctacagca gcatcttcga taccggaagc ctcaggatag acggaaatca 1560 cagaaaaggg ttaaatacca cgggccacta actgtatatg atgcaaaacg ccgaattgcg 1620 gatcgtactg aggtggaaag actacagggt cttaggcaaa ttagaaagac gggagctttg 1680 gaatacgata aaccaccaca accgacaaat acaggggatc tgcctagtac agaagctggc 1740 caggtggata gggaaggccc tagattacct tattggatag atactcaagg cgatgttgta 1800 taggagcttg aatagcttca aatttatcgt ttagatggga attatcgttt gggtgaggca 1860 gaccaacacg ccgaagtgcg tggtgggtct gctgacctaa gagacggc 1908 //