ID SPIN_Og repbase; DNA; PRI; 2836 BP. XX AC . XX DT 23-OCT-2008 (Rel. 13.11, Created) DT 23-OCT-2008 (Rel. 13.11, Last updated, Version 1) XX DE SPIN_Og, an autonomous member of the SPIN family of hAT DNA DE transposons. XX KW hAT; DNA transposon; Transposable Element; autonomous; SPIN; KW SPIN_Og. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-2836 RA Pace J.K., Gilbert C., Clark M.S. and Feschotte C.; RT "Repeated horizontal transfer of a DNA transposon in mammals and RT other tetrapods."; RL Proc Natl Acad Sci U S A 105(44), 17023-17028 (2008). XX DR [1] (Consensus) XX CC SPIN_Og is a member of the hAT superfamily. The TIRs are 16-bp CC long and are flanked by 8-bp TSD. XX FH Key Location/Qualifiers FT CDS 665..2473 FT /product="SPIN_Og_1p" FT /translation="MTMDRVEKNVKKRKYSEDFLQYGFTSIITAGIEKPQC FT VICCEVLSAESMKPNKLKRHFDSKHPSFASKDTNYFRSKADGLKKARLDTG FT GKYHKQNIAAIEASYLVALRIARAMKPHTIAEDLLLPVAKDIVRVMIGDEF FT VTKLSAISLSNDTVRRRIDDMSADILDQVIQEIKSAPLPIFSIQLDESTDI FT ANCSQLLVYMRYINDGDFKDEFLFCKPLEMTTTAHDVFDTVGSFLKEHKIS FT WEKVCGVCTDGAPAMLGCRSGFQRLVLNESPKVIGTHCMIHRQILAMKTLP FT QELQEVMKSIISSVNFVKASTLNSRLFSQLCNELDAPNNALLFHTEVRWLS FT RGKVLKRVFELRDELKTFFNQKARPQFEALFSDKSELQKIAYLVDIFAILN FT ELNLSLQGPNATCLNLSEKIRSFQMKLQLWQKKLDENKIYMLPTLSAFFEE FT HDIEPDKRITMIISVKEHLHMLADEISSYFPNLPDTPFALARSPFTVKVED FT VPETAQEEFIELINSDAARTDFSTMPVTKFWIKCLQSYPVLSETVLRLLLP FT FPTTYLCETGFSSLLVIKSKYRSRLVVEDDLRCALAKTAPRISDLVRKMQS FT QPSH*" XX SQ Sequence 2836 BP; 904 A; 543 C; 557 G; 830 T; 2 other; cagcggttct caacctgtgg gtcgtgaccc ctttgggggt caaacgaccc tttcacaggg 60 gtcacctaag accattagaa aacacatatt tccgatggtc ttaggaacca agacaccgct 120 cctctatccg tctccaggca ggtctgccca catgcagata cacccacata ygagtacccg 180 gcgtgatgac atcatcacac caaccccatc acatacaccc cgtacaaata caggtgtatg 240 tgacagggtt ggtgccataa tgtacttatg cggaccagtc acacatgtgt agagagcagc 300 tactgtgttg aaagcagcta ctgtgttgaa agcaccagta ttggaggtaa aatgacactt 360 catgaattat aattactggg taaatgtaaa atcatgtact gtaaaatcat caactactgc 420 aaaaaaaaaa atatatctgt accatgggaa cttaatctgg atgctgattg gtctttttat 480 attcagctgt ggttgatgtg aatactgccc ccttgtgata gtaacaggta tgtaaaaaaa 540 cacaacacag agatggtaaa tcataggaaa ctttaatgaa ctgtattgac tgaactatgc 600 catgtatcat cttttgtatt attaaagcta ttgttatata ttattttcat tagcaaacca 660 tcccatgaca atggatcgtg tagagaagaa tgttaagaaa agaaaatata gtgaggattt 720 tttacagtat ggttttacct caataattac agcaggaatt gagaaaccgc aatgtgttat 780 ttgttgtgaa gttctatcag ccgaatctat gaagccgaac aaactaaaac gccattttga 840 tagcaagcat ccgagctttg ctagcaagga taccaactat tttagaagca aagctgatgg 900 actcaagaaa gccagacttg acactggtgg caagtaccac aaacaaaaca tagcagccat 960 tgaagcttca tatttggtgg cactcagaat cgccagagct atgaaacctc acaccattgc 1020 tgaggattta ctgttgccag tggccaaaga cattgttcga gttatgatcg gagacgaatt 1080 tgttacaaaa ttgagtgcaa tttccttatc taacgacact gtccgcagaa gaatagatga 1140 catgtctgct gatattcttg atcaggtaat ccaggaaatt aaatctgctc cacttccaat 1200 atttagtatc cagcttgatg aatctacaga cattgcaaac tgttcacagt tactggttta 1260 catgaggtat attaatgatg gcgactttaa agatgagttt cttttttgca aacctcttga 1320 aatgacaact actgcacatg atgtatttga cacagttggt tcatttctga aagagcataa 1380 gatctcttgg gaaaaggttt gtggtgtttg cacagatggt gctccagcta tgctaggatg 1440 tcgatctgga tttcaacgtt tggtactgaa tgagtcacca aaagtcatcg gaactcactg 1500 tatgattcat cggcaaatat tagcaatgaa gacgctgcca caagagttac aagaagtaat 1560 gaaaagcatc ataagttctg tcaattttgt aaaggcgagc actttaaaca gtcgactgtt 1620 ttcgcaactg tgcaacgagt tggatgcgcc gaacaatgct ctgctatttc acactgaagt 1680 gagatggttg tcgagaggaa aagttttaaa acgtgttttt gagcttcgtg atgaactcaa 1740 aacgtttttt aatcagaaag caagaccgca gttcgaagca cttttcagcg ataaaagtga 1800 actgcagaaa atagcttact tggttgacat ctttgccatc ttgaatgagt taaatttatc 1860 actgcaagga ccaaatgcaa catgcctcaa tttgtctgaa aagatccgat cattccaaat 1920 gaaacttcag ctttggcaaa aaaaattgga tgaaaataaa atttacatgt tgcctacctt 1980 atctgctttc tttgaggaac atgacattga accagacaaa aggattacga tgataatttc 2040 tgtgaaagaa cacttgcaca tgcttgcaga ygaaatttca tcgtactttc caaatctacc 2100 tgacacccca tttgcacttg ccagaagccc attcacagtc aaagttgaag atgttcctga 2160 gacagcacaa gaggagttca ttgaacttat taacagcgat gcagcgagaa ctgatttctc 2220 tacaatgcca gttacaaaat tctggatcaa gtgtttgcag tcatatcctg ttctgtctga 2280 gactgtgttg cgccttcttc ttccatttcc aacaacatat ctttgtgaaa cagggttttc 2340 cagcttgttg gttatcaagt ctaaatacag aagtagactt gttgtggaag atgatcttcg 2400 ttgtgctctt gcaaagactg ccccgagaat ttctgatctg gtgagaaaga tgcaatctca 2460 accttcgcac tgacgttggc tttttacgca tactgtcaca aaatgtagca atgtagttta 2520 ctgttggtta tattaagact gttacccatg ctacaccatg cttcaagaca aaatttcatt 2580 tatttgtaat tagaaataaa tatttcacca atatataatt acatattgtt tttgtgatta 2640 atcactatgc tttaattatg ttcaatttgt aacaatgaaa atacatcctg catatcagat 2700 atttacatta cgattcataa cagtagcaaa attacagtta tgaagtagca acgaaaataa 2760 ttttatggtt gggggtcacc acaacatgag gaactgtatt aaagggtcac ggcattagga 2820 aggttgagaa ccactg 2836 // ID MacERV2_LTR1 repbase; DNA; PRI; 565 BP. XX AC . XX DT 24-SEP-2007 (Rel. 13.02, Created) DT 21-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE ERV1 Endogenous Retrovirus from Cercopithecidae. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; KW MacERV2_LTR1. XX OS Cercopithecidae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini. XX RN [1] RP 1-565 RA Smit A.F.; RT "MacERV2_LTR1 - ERV1 Endogenous Retrovirus from RT Cercopithecidae."; RL Direct Submission to Repbase Update (21-FEB-2008). XX DR [1] (Consensus) XX CC chr2.nib:121220437 1% Probably better: ERV2; <80% similar to CC ERV1. XX SQ Sequence 565 BP; 143 A; 134 C; 119 G; 169 T; 0 other; tgaaaggaca caaagataga tgtgtacccg ccccccccgc tacacccttt gttctgctct 60 ctaatctttg cacataccag agatttcgta agttctgtga gtacctgttt ttctgcacgt 120 accagagatt ttgttttgca cataccagag atttcgtaag ttctgtgagt atctgttttt 180 ctgcacgtac cagagatttt gtaagttctg aggcaaggtc acaagacgtg tttaagtaag 240 ataaactctt gctgccataa acctgctctc ccgcctcaaa ggttgaaccg aaatatcaga 300 aatggcggga accaatcata gttagccaaa tcgccttgtt caaacactag ccaatcatat 360 atctgatttg tataataact ctatgcccac ttttcttaga ctatataaca ctgctcggag 420 ctcagtgggg gagctctcct gcccgtctcg tttcgcgagc gagtgagagt tccaggttcg 480 aacctgtaat aaagatcctt gctgcttagc tttgactctg gactctggtg gtcttcttcg 540 gggaataaac ggtctgggca taaca 565 // ID LTR35A repbase; DNA; PRI; 547 BP. XX AC . XX DT 04-AUG-2008 (Rel. 14.02, Created) DT 04-AUG-2008 (Rel. 14.02, Last updated, Version 1) XX DE ERV1 Endogenous Retrovirus from primates. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; LTR35A. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-547 RA Smit A.F.; RT "LTR35A - ERV1 Endogenous Retrovirus from primates."; RL Repbase Reports 9(2), 570-570 (2009). XX DR [1] (Consensus) XX CC Somewhat younger subfamily of LTR35. XX SQ Sequence 547 BP; 135 A; 169 C; 110 G; 133 T; 0 other; tgagacagag tagggacggg gcttggcttc agctcacccc cactagagca ttctttcatg 60 cattcccact gatcacaaaa cccacaccac tacctcactg acaccataat gtttaaccat 120 gccttttact taaagaattc caggaactgg ccttaggaga taaccaaggt tgcggagtgt 180 cccacctcgg gaaggaatgc tgaacaattg atttacagcc ttgttgccgc cggccagacc 240 accaggtggc ccattactca agataaccat cgcaaccaga taatgctgac ctgcataccc 300 tacccctcac gtgctttgcc cagcccagcc tgcataccct acccctgatg tcaattcccg 360 cgctttgcct aataaaaaag ccctaccggc tcttttcggg gagtcagtca gggaattctc 420 tctctctctt gtgctgcctc ccttatgccc gggcataagc tccaatgaag ccttgtctgg 480 gaaaactctt tcggcctcat gtcaatttct attgcattga gagcccaaga acccatggtc 540 ggtaaca 547 // ID LTR25-int repbase; DNA; PRI; 7188 BP. XX AC . XX DT 05-MAR-2004 (Rel. 13.06, Created) DT 26-JUN-2008 (Rel. 13.06, Last updated, Version 1) XX DE ERV1 Endogenous Retrovirus from primates. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; KW LTR25-int. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-7188 RA Smit A.F.; RT "LTR25-int - ERV1 Endogenous Retrovirus from primates."; RL Direct Submission to Repbase Update (26-JUN-2008). XX DR [1] (Consensus) XX CC mer4 group. XX SQ Sequence 7188 BP; 2154 A; 1513 C; 1386 G; 2053 T; 82 other; tattttggtg cattggccgg gaaagaaagg aattcatcag aagggtgagt aagagtggac 60 ntttaactct ttactttcat ttctgaggct tgtcctcagt tttttttttt ttcttctcaa 120 gaagcaagcg aaacactggg cccctgtcag ccagttaaaa gaccagtagn gtggctncna 180 gccctaaaag actnagggga caggnntgct ggagaggant ttgccaatcc cccattgccg 240 taagntgttg ggaatgttgg ctctgttcca atccagtttc ctttcacgga gggcctagcc 300 atcgtgtggg actgaaagga ggtcctaggg caactgaaga tttctggctg aggctacacc 360 tcagtgttac ctgaaggccc ttggactaac tccagtcccc gacagcccat gcagggtgtc 420 ggcacaagga cttccagtct tttctattgc attttctttc tttcttttca cggctatcat 480 gtctcctatc ccttctttgt atgcaatgtt gtgggtgttt ttgcaaccta gggatataat 540 cttgctgggt aaagtcagtc agtgtcttag taatcaggaa tgtaactcaa agaattgttg 600 tttttgtgat ttcctagaaa cagggggaat tcaagatttc agtctaaatt ttcacctagt 660 aagggccttt ctgtccccca ataatagaca ttcatggcac tgnatgggag gatatttcac 720 cctgagtgaa taccctcctt tgcatttgag ttgttttttt tcctccatgt gaaagctcag 780 cactgtccaa tgaatctaaa cagttccttt atgagacaag ttaattttct tttgctgggg 840 ggcatgctat ggggacagcc tatcaaaccc caaacntctc tttctaactt ttgcctgaaa 900 agaatttagt cagagttttt acctaacatt tctaacctta cagcaccacc tagtggaatg 960 ggatttttct ccatggggag ccttgtcagc cctctgcccc aaacctctag tttcccaatt 1020 cttttccctt ttacatccct ctatcagtga tcaggcccca tgccctatct gtagacagga 1080 aaactccact ttcaacagcc gggaggaagc catcctgaca agacagatct tgcttcaata 1140 ctatctccat caaaggaagg acagccattc aatttttacg ctctttttga ggcacctgtt 1200 ctgcatccaa ctacattggn atttaaacaa aaaggggatt ttatgtttga aagtnaatcn 1260 gtcccattct ctgggattcc gatntttccc tggggccata gcaagggaag ccagagatgg 1320 tgttaggaca ctccctccat taaagtnttt gcccaaatcc aactactaca taatctctcc 1380 caggcccctg gggtaccttg ggagcctttt gggccgagtg ggtctaggaa accagcaggg 1440 tggaaagcta gggtcttgcg caggtgagca catgactagt cctgccnact agctcctccg 1500 gatccatggg tgaaggtcat gcttgcatcc atgggtggca cctatgacgg tcgccgggac 1560 ccagaggaca aggaagtgaa gaggagaagg gggatgccct ttctctcttt ccctccaccc 1620 tgggtcacnc agaagggaag aaggagactg aggaacgcct ttgtctctcc tctttttcta 1680 gatgggtaac aaaccatctn cagtctgcac tcccctcnag tgcattctga aacactggaa 1740 ctcctttgac cctgagactc tgaagagaaa atggcttata ttctattgca caagggcgtg 1800 gccatcttac caggggagac agaaggcctg gccttctgag ggaagtnttg atttcnacac 1860 tatccaacaa ctagatcttt tctggtggga gggaaatggc agttccctat gtacaagctt 1920 tctttgccct gtgagacaac ccagatcttt gtaagcattg taaagttaac cctgccctct 1980 tggcagccat gtcaggcaag cctacaaagg ataattcccc aaagtcagag agacaaaccc 2040 ttggggaacc ctcaaatgca acttccgggt gccctacctg ccccncttat ttngggcccc 2100 caatanccat atcatcagct cctccggttg tgccactnaa gaaaccccna cattcgctgt 2160 tgcccctgca gaaaatgccc aatggacatg gtgctactag ggttcaagtt cccttctcat 2220 tgcaggacct tagacaaata aagggggacc taggcaagtt ctctaatgac cctgatagat 2280 atatagaggc tttccaaaat ttaacccaag tgtttaatct tacgtggaga gatgctatgc 2340 tacttttaag ccaaacccta actgttacca agaaacaggc agccttacag gcagcagaaa 2400 cattcagaga caaacagtat ctcctatagc cagtcnaaaa gaaacccagt caaagttaaa 2460 gaggtgaaaa agagacagaa tccccattcc caataggaag agaaacagtg ccccttaaaa 2520 atcctaattg gagccccagt gatcccatag atgagtggaa aagaaaacac tttctgatgt 2580 gcatactaga aggcttgcaa agaaccanaa ccaaacntct taattactct aagctntccn 2640 tgttaaatca gaaaccagat gaaaatccct cagcctttnt ggaaaggctg agagaagctt 2700 tagtaaaaca cacctccctg tntcccgatt caataaagaa caggtttatt actcaggcag 2760 cccctaatat cagaaggaag ttgcngaaac aggccctgtc caaanatctt tctagntttt 2820 ctcatcctca agttgaaact ttgcagtatg taaataacac tctcctctgt gccccaactg 2880 aggaggtctc aggaaggcac tgaggctctc ctcaatttct tagctgaaag ggaatatagg 2940 gtctcaaaat ctaaagctca gctctgtcaa acttcagtaa agtacctagg tctagtctta 3000 tcagaaggga cnagaacacc gggtgaggaa agaattaagc ccattttctc ttttcccttt 3060 cccaaaactc ttaacagtta aggggattct tgggcattac nggattttgc agactgtggg 3120 tacctgggta cggtgaaata gctnaccctt tataccacct cataaaagaa actcaagcag 3180 ctaaaactca ctccctaact tgggaacctg aggctcaaaa gcctttaacc agctaaagca 3240 agccttactt aaagcaccng ccctcagtct tcccataggg aaggcattta atctttatgt 3300 ntcagaaagg aagggaatgg ccctgggagt tttaactaag gctcaaggtc cagctcaaca 3360 gccagtgggt tacctaagca aggaacttaa cttggtggct aaaggatggc cagcctgcct 3420 ccgagcagtt ncagtggtgg ctttgctggt gccagaggcc actaagttaa ccatggggaa 3480 taacttaact gtttacatcc cacacaatgt agcaggactg ctgtcctcta aaggaagtct 3540 ctggctaaca atcacctcct caaatatcaa gctttgctgc tagagggatc tgcagtccag 3600 ttaaaaacct gcccttgcct gaacccagcc actttctccc agaggaaact ggagaacctg 3660 aacatgattg tgaacaggta gtggtgcaaa ctggtaaaag aaataagaag aatcactgtt 3720 tatattctct gtaaagtttt aattaataaa taaagatttt cttaaagngc actcagctta 3780 attaaaagtg gatatccaag ctataggtat attcaaaagg cctttatgtt tttctcttca 3840 taaatcttgt tttcctggaa gaggnttttt tctcanttga ctgaattact tttntccact 3900 ctgtcttgcc actgttggtg catgcatgga aggccctaaa ataacttctg gtggcctggg 3960 actcctcggg aaaacagaaa aggcaccaca gatcccattt tggaaaaaat ctctgttttc 4020 ctcatggaac ccctagaatt agaggtggat aagtccctct caaaatctgt ttttgtcttc 4080 cagctatgct tgtttattag gccccggaaa ctatattcct agccctgttc ttaaaaggcc 4140 tcaaccagag gccaataatc caattaggaa actggcaaac aaaaaatcta tagctactgg 4200 atcttcttct gtttgtctgg tggttatata tgtgntgtgt gtgatgtcta ttaaaaaanc 4260 tctaattaat tggcntanaa ataagcactt aaataaaata tttttaagaa aaaantaaag 4320 gctgtagtgc ctctcggttc acgtaacttt aatntttaag aaataaaaac gtcttggaga 4380 ttnttggtaa aatacaaacg tcttcaagat gtaaanaggt ggtctaaatt acgcaggtca 4440 gatactaggt ttgctaaatg ttttaaggtt gtaaactgct tctttggcct ttaagaactg 4500 tcaacttgcc tgcttcacaa tnggtaaggc ctggggacat atggaagtaa ccacgcccct 4560 aactatactg gaagaagtca aactttatct gcacctagca cataattaaa acaacttacc 4620 aggttttaca ttaaagttaa aattactaaa agttaccatt ataacatgta attgagacta 4680 ctgaaaatgg atttgcatgc aaggtgtgta aaaacagtaa aatgttttta gtaaaagatt 4740 ataagaaggc atagaaatnt acattttgcc taggagtaaa agattgtctt aaattaaata 4800 aagtaaaagn tttaagcaaa ttgtggaaag actgtaaaaa ttaatcttgc aaangaaact 4860 ctgtntgtna anatattaac taaattcaaa aggatattat atggttttcc tttaaattaa 4920 gcattnaaat aaaagcacaa caaggctttc ttaagatgct aatctgctct ttagcaaaat 4980 ttntaaaggg ttataaaagg tttgtgaaaa tctnacctca tggtcaaact ggttaagatt 5040 aaatagaatt gtctataaga tttcattaaa aattgggatt aacattaata gtaaactaat 5100 gcaagggtga aatttggctt tctctcttga acagaatttt tatgtaatan taaaggctaa 5160 tgaaaggttt ttgctttttc aaatttttga gtcatcattt tggcaaaaca aataacttat 5220 ggtaatctaa aattctattt cataatatca agtgttttaa aactctaaca tatttaacag 5280 acttcccaaa atnaaacttc agtttcaagg ttgtctttcc tgacccctgg cttttgggtg 5340 ctacagaggc ccctagaaca tccaaaagaa aggcaaacag gattatttaa catgtttaga 5400 tacatgggat tgccaaaatg atgtctaatt tcttcaggtt atatttcagt aaataatatt 5460 aacatatgtt ccaaaactgt atggaatgtc taaggttcta atgtntgaat atgtgctatc 5520 aattacaatt aagnttatta tgttgggtta ttgtaaacca cagaaataac caaatttctt 5580 tgtcaatcgt gtttctgact gtaaccatcc tggacatttt gtcattnaca gacaattgtc 5640 ttgttttaat cctcttcaaa aaatggttta taatcagctg tgggacttta acaggtgctc 5700 tcaaatgcag gtttctgata acagaaaaac gtacagaact cataaaaagc taaaatgttt 5760 acgaatatca agcagaacaa gagttaacga aatagactaa actaatagaa aactaaagca 5820 atgtttttaa cttttgcttg gaacattgct gatccttatt ttgttttttt caggtnagga 5880 aacttttgag ctagctanag cttttaacaa ctgagcaagg tatactcctg taaacaaaat 5940 ttggaacatg tttgtttctc tctgcctggt tcttctaaaa ttcagaaact agttgtgagt 6000 attcttaact tacaacaata tagttgtttg catcagtgca acaaaatcca ttttcttttg 6060 caacgagaca caatngaaaa atgctggttg ttttaccaag gctttgactg gaagggtgtg 6120 tttcccttta aggaatcaag cttgacttgc aaagccaata aaagcccctt gggaaaactg 6180 gcctcatacc ttgtctacac agtccccgta cagggttcct ggcctgtggt gagtaaagaa 6240 tgtcactttc taacaggctt agaaacctat gctcttggga cctcaagaag aaaggagttt 6300 acccaactca caggtatttg agggtacaaa tccatggctn ggcccagctt taaaaagtcc 6360 tatctaagat tccttntgga acagagttcc atcaaagcca atcaaaaagg cctatgtaaa 6420 gataattatt cttgctgcac tttatgcaaa taatcaggcc aagtataana ctaaagtcta 6480 ttttgcaaac aattcagtct atcgtgantt gtttttaaca aaaatgagga ctaaagagaa 6540 agaaattatg tttcaaanct tatcatacat ttgtcattaa cttctagtct cattagttgt 6600 ttttaagttt ttgnctacat tttaaactaa ccctgcttat tcctgtaagc caaccagcaa 6660 tctccggctg cagctcagaa aaacagaaag ggatgggtaa tgtaaaaatc tagatcaata 6720 ttctagttct gggcaattat tctgcaaatc ctgccgggta atggaaataa atagggtgcc 6780 cataacccag aggtttcctt tntcagaaaa gtaagaccaa ggaagctaac caaagccaag 6840 ccccatgcac ccaaatctta gcaggcataa ctatagccac cagttatcag ggcgtgtcag 6900 cagcctcaan atttttaagc ttgtccttac cccccttgtc tcattttaat acatgtcctc 6960 taataaccca aattgtttct tttcacctaa aagctatcaa gctccaaatg gtaatgcaaa 7020 tggaaccatg cataaacacg cctttcttct gaggacactt aaaccagccc cgggaggaat 7080 cctagctgct gttccctaca caacacccct ctccagcagg aagtagccag aaanatcaat 7140 gcccaatctc cctaacagca gttagggtct ccactcctga ggggggac 7188 // ID LTR14C1_Mim repbase; DNA; PRI; 405 BP. XX AC . XX DT 02-NOV-2009 (Rel. 14.11, Created) DT 02-NOV-2009 (Rel. 15.11, Last updated, Version 3) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR14C1_Mim. XX NM LTR14C1_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-405 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2971-2971 (2009). XX DR [1] (Consensus) XX CC >94% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. CC We thank the Broad Institute Genome Sequencing Platform and CC Genome Sequencing and Analysis Program, Federica Di Palma, and CC Kerstin Lindblad-Toh for making the data for Microcebus murinus CC available. XX SQ Sequence 405 BP; 97 A; 106 C; 115 G; 87 T; 0 other; tgtaaaacag gaatgagagc aggagtgact ccatgacagg ctgcacttcc tggagtaggc 60 ctaagtttcg gtttccccga aacttcgcct actacaccct tccccgaagt gacgcgcgcc 120 gcttttgaag gagccaatca ggagccgaca cgatcagcca cttttaaagg aaccaatgga 180 ctaagggggg agggggggaa ggtgtaccca gcgtgtataa cttgcagaaa agtataaaag 240 cttgacttat accccagggc ggggtcctgg ttcgtaggga ggccactgcg ttggtgctct 300 gggacctgga ccctagctcg agctagccaa taaactcctt tgtgctttgc agcctcagcg 360 attctgcctc tctgttcctg gggccgaagg aaaccggctc taaca 405 // ID LTR27D1_TS repbase; DNA; PRI; 748 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW Endogenous Retrovirus; Transposable Element; LTR27D1_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-748 RA Bao W. and Jurka J.; RT "Long terminal repeats from tarsier."; RL Repbase Reports 11(5), 1630-1630 (2011). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 748 BP; 144 A; 241 C; 180 G; 183 T; 0 other; tgagagagga gtaagcaaga agccagttag gcagttaggg cgggtctccg gtaggactcc 60 attttacctg gagaaacagc tgcaagctgc aggctccgga caaaagccac acccattccc 120 ttttcgctcg ctttctctct ttttcgcgca ccttttccta attggccctt tttcaatcct 180 ctccccatcg gccctttaca ctatcatgcc acctttttag tccttactac tgaccaatta 240 ggttagtgta ttttcagtcc ctactaccaa ccaatcagca tgcccgagta ctatgtaaac 300 ccccggcccg gcctcagcgg tctctttttg gccccagcaa ccccttgggg gcctggcaac 360 ccacttgggg gcctggcccc tctcggggcc ccctctcgct gtgagagctg ctaacacttg 420 cttcctgaat aaacttcgac ttgctcgccc tttcggtgtc ctcgatcctt aatcctctcg 480 gtcgtgagac aagaacctgg gattcatctg ggattcatcc agacaagaga gctgtaacct 540 ttcctgaccg ggaagggaat tctccaagca agagctgaga gctgtaacac tcctcggggc 600 tccgcggctg tcgggcaccc ccgagttcca gggcgccata acgtcctagg tcaccacgtg 660 ggtctcaggg cgccattgtg cccttggact cgtccttgga ccccacgtgg gtcccaggct 720 ggcagtgcag ccctgtgcga ctgcaaca 748 // ID npiggy1_Mm repbase; DNA; PRI; 240 BP. XX AC . XX DT 24-MAR-2010 (Rel. 15.09, Created) DT 24-MAR-2010 (Rel. 15.09, Last updated, Version 1) XX DE npiggy1_Mm is a nonautonomous piggyBac element found in DE Microcebus murinus that shares TIRs with the autonomous DE piggyBac1_Mm element. Age estimates suggest it has been active DE within the past 15 my. XX KW piggyBac; DNA transposon; Transposable Element; Nonautonomous; KW npiggy1_Mm. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-240 RA Pagan H.J.T., Smith J.D., Hubley R.H. and Ray D.A.; RT "PiggyBac-ing on a Primate Genome: Novel Elements, Recent RT Activity and Horizontal Transfer."; RL Genome Biol. Evol 2, 293-303 (2010). XX DR [1] (Consensus) XX SQ Sequence 240 BP; 95 A; 34 C; 44 G; 67 T; 0 other; ccctttgcac tcggatgtcg agtgtgactc gacacggtta gcaaaaatta tagagattaa 60 aattactctt tgaatgtatc aataatttga aatataaaaa aatccaaata aataagtttg 120 tatgaaaaga aactccagtt ttttattcta ctgccacgct ttgtaaaatc tggggtattt 180 aaaaaattaa atcccgagta gaataaagga atcgagaaaa aagcaagcga gtgcaaaggg 240 // ID LTR8_Mim repbase; DNA; PRI; 586 BP. XX AC . XX DT 09-NOV-2009 (Rel. 14.11, Created) DT 09-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR8_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-586 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2962-2962 (2009). XX DR [1] (Consensus) XX CC >89% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 586 BP; 154 A; 165 C; 111 G; 156 T; 0 other; tgaaaccgcc ctttataaaa ttaataaagg gccacaaggt gggaactgtg gtaggggaaa 60 tatatatata accagccatt gttccctagc ttgctttcct ataattgctt actactcagg 120 agtcacatag ctgatggtca taaagattct tagctttctt cattgctccc atagataaca 180 tcacccttgt gaaacctaag gctagttttt gagatatctt ccaggccctg cattccggtg 240 gaacggctga cccacccaga ccagtggccc ataccaagga actgactcaa ctggtcttgt 300 gacccccacc caagaactga ctcagcaaag aagacaattt ctacacccct atggttccac 360 cccgaaccca gccaattagc agacccaatt gcctagccct ttgcccgcca aactatcttt 420 aaaaaccctt tctcccaaat tctcggggag atggatttga gaacttcctc ccatctcctc 480 gcatggtgcc tgcgatatta aactctttct ctgctgtaac ccctgctgtc tcagtgtatt 540 ggcttcctct tgagcagcgg gcaatgaacc tggtaaggct gtaaca 586 // ID LTR8D_Mim repbase; DNA; PRI; 432 BP. XX AC . XX DT 11-NOV-2009 (Rel. 14.11, Created) DT 11-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR8D_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-432 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2961-2961 (2009). XX DR [1] (Consensus) XX CC ~88% identical to consensus. 4bp tsd. CC Similarity to LOR1a_LTR from primates. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 432 BP; 107 A; 130 C; 80 G; 115 T; 0 other; tgtaactggc cccatgaaca aatttcaatg acccctatat aatcctctcc cttaactaac 60 tacttgacct tttgagttgt ttcccagaaa tggtggattt tctgcaagac aaaggagctg 120 agattctgaa ggcccctggg cagacttcct gatgccaaga ttcaccatcc cccacaacct 180 gcccccaatg cacgtagccc cttgttagtt acaccatgac ctgcccaagg cacgtggccc 240 ctcctttaaa agcccatgat ctcccttagt cggagagacg gatttgaggc agcttctccc 300 attctcccga caagacgtct gtcatattaa aaattccttt cttccttggc aatcctcgtt 360 gtctcaataa ttggatcttt gtgcgacgag caactggacc tagaccgaaa cccccttgcg 420 gtttcgacaa ca 432 // ID LTR1F2 repbase; DNA; PRI; 739 BP. XX AC . XX DT 01-MAR-2009 (Rel. 14.06, Created) DT 18-JUN-2009 (Rel. 14.06, Last updated, Version 1) XX DE ERV1 Endogenous Retrovirus from primates. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; LTR1F2. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-739 RA Smit A.F.; RT "LTR1F2 - ERV1 Endogenous Retrovirus from primates."; RL Repbase Reports 9(6), 1177-1177 (2009). XX DR [1] (Consensus) XX CC 10.5% subs, 35 copies. XX SQ Sequence 739 BP; 167 A; 249 C; 206 G; 115 T; 2 other; tgatacggac aggaggcagg gaaatactgg gtagaagagg gcggggtccc cggcgagggc 60 cccaccctca agcctggacc cgcggcccta aatgagaaca nncatccctg ttttcccgcc 120 cgaatgttgc cttttccaaa accaccctgg cccgccacgc cccccatcct gtacccataa 180 aaaccccaaa ctccactggc agaggagcag agcggcgcgg cagagaagga gagaagagaa 240 gaagcgtctg aacgtcgaga ggagttcggc tggggacggt cggagaggag ttcggccggg 300 gacggccgaa ctccagggga agattatctt cccactccat cccctttcca gctccccatc 360 ccgctgagag ccacctccat cactcaataa aacctccgca ttcaccatcc ttcaagtccg 420 tgtgacctga ttcttcctgg acgccggaca aggacccggg taccaagagg gcagggtgta 480 aaaggctgtc accctgactc tccactgagc tggttaacac ttagccgtcc gcggacggca 540 actgctaaaa gagcattaat tgtaacacac ccctagacgc tgccgtgggg ccggagccca 600 aaagcgctcg ccccggcccc ggcacccgct cgcctgcgtg ctccccctcc cgcaaggggt 660 ttgagcgcgg cggccgagta agcgagccac acccctgtcg caagtcccgc gaaggggtca 720 agggaactct cccgtctca 739 // ID CERV1_LTR repbase; DNA; PRI; 409 BP. XX AC . XX DT 17-AUG-2004 (Rel. 9.07, Created) DT 29-MAY-2011 (Rel. 16.05, Last updated, Version 2) XX DE Long terminal repeat of chimpanzee endogenous retrovirus CERV1 - DE a consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; KW Long terminal repeat; Chimpanzee endogenous retrovirus; CERV1; KW CERV1_LTR; PTERV1a_LTR. XX NM CERV1_LTR. XX OS Pan troglodytes OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pan. XX RN [1] RP 1-409 RA Skaletsky H., Hughes F.J. and Page C.D.; RT "Consensus sequence of an endogenous retrovirus CERV1."; RL Repbase Reports 4(7), 190-190 (2004). XX DR [1] (Consensus) XX CC >99% identical to consensus. It is named PTERV1a_LTR in CC RepeatMasker libraries. XX SQ Sequence 409 BP; 122 A; 112 C; 88 G; 87 T; 0 other; tgaaaagagc cgggcacatt cctcagcccc gggctcaaaa caaacaagcc cagtacaaac 60 acatcccatc ctcccatccc accacatatc accatatatc tcttaaactt cccccgggct 120 caaaacaaac aagcccagta caaacaccac caggaaagtc tccgataagg ggacagatga 180 ggggacagcc gttcaaagtt ttactgaaag agcgggaacc aaaagaattc ctttgttccc 240 ctgtaacttt caggctataa aaaagcaaac actcgcattg ttcagggccc tcttgtatgc 300 ggtggaatgg agggaccagg ttcgaacttg tagtaaagat ccttgccgct tggctttgac 360 tctggactct ggtggtcttc tttggggaac aaacggtctg ggcataaca 409 // ID Tigger3c repbase; DNA; PRI; 602 BP. XX AC . XX DT 14-NOV-2005 (Rel. 10.11, Created) DT 09-DEC-2005 (Rel. 10.11, Last updated, Version 1) XX DE Mariner DNA transposon from primates. XX KW Mariner/Tc1; DNA transposon; Transposable Element; GOLEM; mariner; KW Tigger3c. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-602 RA Smit A.F.; RT "Tigger3c - Mariner DNA transposon from primates."; RL Direct Submission to Repbase Update (11-NOV-2005). XX DR [1] (Consensus) XX CC (MER7C) 15% div. XX SQ Sequence 602 BP; 166 A; 139 C; 114 G; 183 T; 0 other; cagtcatgcg ccacataacg acgtttcggt caacgacgga ccgcatatac gacggtggtc 60 ccataagatt ataatggagc tgaaaaattc ctatcgccta gtgacgtcgt agccgtcgta 120 acgtcgtagc gcaattactt tatttttaaa taaatttagt gtagcctaag tgtacagtgt 180 ttataaagtc tacagtagtg tacagtaatg tcctaggcct tcacattcac tcaccactca 240 ctcactgact cacccagagc aacttccagt cctgcaagct ccattcatgg taagtgccct 300 atacaggtgt accatttttt atcttttata ccgtattttt actgtacctt ttctatgttt 360 agatatgttt agatacacaa atacttacca ttgtgttaca attgcctaca gtattcagta 420 cagtaacatg ctgtacaggt ttgtagccta ggagcaatag gctataccat atagcctagg 480 tgtgtagtag gctataccat ctaggtttgt gtaagtacac tctatgatgt tcgcacaacg 540 acgaaatcgc ctaacgacgc atttctcaga acgtatcccc gtcgttaagc gacgcatgac 600 tg 602 // ID LTR12 repbase; DNA; PRI; 826 BP. XX AC . XX DT 01-OCT-1995 (Rel. 1.09, Created) DT 26-JUN-2008 (Rel. 13.06, Last updated, Version 4) XX DE LTR from human ERV9 endogenous retroviral sequence (HRES-1/1). XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; LTR12; KW Long terminal repeat; PTR5; PTR7. XX NM LTR12. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-826 RA La Mantia G., Pengue G., Maglione D., Pannuti A., Pascucci A. RA and Lania L.; RT "Identification of new human repetitive sequences: RT characterization of the corresponding cDNAs and their expression RT in embryonal carcinoma cells."; RL Nucleic Acids Res 17(15), 5913-5922 (1989). XX RN [2] RP 1-826 RA Levy S.L., Lobelle-Rich A.P., Elder H.J., Payne S. RA and Montelaro C.R.; RT "An unusual retrovirus-like sequence identified in human DNA."; RL J. Gen. Virol 71, 1613-1618 (1990). XX RN [3] RP 1-826 RA Lania L., Di Cristofano A., Strazzullo M., Majello B. RA and La Mantia G.; RT "Structural and functional organization of the human endogenous RT retroviral ERV9 sequences."; RL Virology 191, 464-468 (1992). XX RN [4] RP 125-826 RA Smit A.F.; RT "LTR12."; RL Direct Submission to Repbase Update (FEB-2000). XX RN [5] RP 1-826 RA Smit A.F.; RT "Consensus update."; RL Direct Submission to Repbase Update (05-MAR-2004). XX DR [4] (Consensus) XX CC LTR from human class I ERV9 endogenous retrovirus (HRES-1/1). CC Copies on average 8-9% diverged from consensus sequence. CC [5]. XX SQ Sequence 826 BP; 232 A; 237 C; 194 G; 162 T; 1 other; tgagaggtga agccagctgg gcttcctggg tcgggtgggg acttggagaa cttttctgtc 60 tagctagagg attgtaaacg caccaatcag cgctctgtgt ctagctaaag gtttgtaaac 120 gcaccaatca gcactctgta aaaacgcacc aatcagcgct ctgtgtctag ctaaaggwtt 180 gtaaacgcac caatcagcac tctgtaaaaa cgcaccaatc agcgctctgt gtctagctaa 240 aggtttgtaa acgcaccaat cagcactctg taaaaacgca ccaatcagca cagcactctg 300 taaaatggac caatcagcgc tctgtaaaat ggaccaatca gcaggacgtg ggcggggcca 360 aataagggaa taaaagctgg ccacccgagc cagcagcggc aacccgctcg ggtccccttc 420 cacgctgtgg aagctttgtt ctttcgctct tcacaataaa tcttgctgct gctcactctt 480 tgggtccgca ctacctttat gagctgtaac actcaccgcg agggtctgcg gcttcactcc 540 tgaagtcagc gagaccacga acccaccggg aggaacaaac aactccggac gcgccacctt 600 taagagctgt aacactcact gcgaaggtct gcggcttcac tcctgaagtc agcgagacca 660 cgaacccacc ggaaggaaga aactccggac acatctgaac atctgaagga acaaactccg 720 gacacaccat ctttaagaac tgtaacactc accgcgaggg tccgcggctt cattcttgaa 780 gtcagcgaga ccaagaaccc accggaagga accaattccg gacaca 826 // ID ERV1-4_TSy-LTR repbase; DNA; PRI; 558 BP. XX AC . XX DT 29-JAN-2010 (Rel. 15.09, Created) DT 29-JAN-2010 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERV1-4_TSy-LTR; ERV1-4_TSy-I. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-558 RA Jurka J. and Walichiewicz K.; RT "Endogenous retroviruses from tarsier."; RL Repbase Reports 10(9), 1201-1201 (2010). XX DR [1] (Consensus) XX CC ~92% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 558 BP; 145 A; 136 C; 91 G; 186 T; 0 other; tgaaatagct aatgaaatac ataaaataat tgcaataact tgcttgacat aagtaactcc 60 attttgttga aatcctccat cttagctgcc tgcctaacct aaaaatgacc ccccaatgta 120 tacgtgtgat tactgtaacg ttcccatgac tacagaccca cagctttgtt cccctgagct 180 gttcacgctg accgatcatg ctgaccattt gcaatactcc ctagctacaa gttccgtgta 240 attgtatgct ttgttaactg aaatcagatg ttgtgccaag aaattgttaa ggaattatta 300 cttacaaaat tccccctgcc acgtccctat agtttgttct tatattttgt tccttttgtt 360 tttagttcct tgtaccctca ccccctgcta tgtagccaac ctgcagattt tacctatata 420 agctttttgc tgattatatt ctgcgtcaag agcttcatgg caagagccta ctcttgaccg 480 cgcgctaata aaggactctt aatattaatt tggacttggc gtgctggcgt gtctccttct 540 cgcaatccgg tcataaca 558 // ID ERV1-4D_TSy-LTR repbase; DNA; PRI; 468 BP. XX AC . XX DT 06-APR-2010 (Rel. 15.09, Created) DT 06-APR-2010 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERV1-4D_TSy-LTR. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-468 RA Jurka J.; RT "Long terminal repeats from tarsier."; RL Repbase Reports 10(9), 1253-1253 (2010). XX DR [1] (Consensus) XX CC ~88% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 468 BP; 115 A; 119 C; 75 G; 159 T; 0 other; tgaagtacct gtaaaactta taagcaaccc cacattgtaa cagattctcc atattagctg 60 ttttagctgc tttaattaac ctaaaaatga ccccctacat agctttccag taatcagagg 120 ttgtaatctt cccgtaacta agagcctaca cccttggttt ccccaaatat tggaaactcc 180 ctaaccacaa gttccgcatt tatttgtata cgtttgcaca ttttttcttt ttgattttaa 240 tggtctttat gttcgtgctt tatgccatgt aactcaatgt accttatgtt tgcaccccat 300 gctatgtagc ctattcagca gtttttgcct ttataaaccc tctccctgct tcattcgggg 360 ccgagagttt ttggaggcat gagccccctc tcggtcgccg gcttattaaa ggactcataa 420 tctgactctg cgtgttggca acatttttct cgcactccag acacaaca 468 // ID LTR13B2_OG repbase; DNA; PRI; 382 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR13B2_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-382 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 11(5), 1587-1587 (2011). XX DR [1] (Consensus) XX CC >91% identical to consensus. 4 bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 382 BP; 91 A; 123 C; 83 G; 85 T; 0 other; tgtgagggcc gagccgacgc cattttaatg acggccaccg ctgccatctt aggcccaatg 60 agtcagcata acgtcccccg cccatcccaa agatagcacc accccatggc gcgaaaccta 120 tgcccaatca cgagctaatg cctaacctgt aatctgccca tgcccctagc aaccaatcaa 180 aattgtaccc gtactttacc cctccccttc ttgttttctg taccctataa aaactgattg 240 cacctccggg gcggggctct tcctcggtgt gctcaccagg agagtccagg cccgggcttg 300 aataaagctt gcctcatgat ctttacatcg gagtgtgtct cggactaact cttgggggat 360 cgggaaaaat ccaggcacaa ca 382 // ID MacERVK1_LTR1d repbase; DNA; PRI; 282 BP. XX AC . XX DT 24-SEP-2007 (Rel. 13.02, Created) DT 21-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE ERV2 Endogenous Retrovirus from Cercopithecidae. XX KW ERV2; Endogenous Retrovirus; Transposable Element; ERVK; LTR; KW MacERVK1_LTR1d. XX OS Cercopithecidae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini. XX RN [1] RP 1-282 RA Smit A.F.; RT "MacERVK1_LTR1d - ERV2 Endogenous Retrovirus from RT Cercopithecidae."; RL Direct Submission to Repbase Update (21-FEB-2008). XX DR [1] (Consensus) XX CC 3%. XX SQ Sequence 282 BP; 76 A; 61 C; 72 G; 73 T; 0 other; tgtagaggac tacgtgctcg caaacgaggc gttcccgata agtcctgctc ttgcaaacga 60 agcagggcgt tgggggcttg tttatgtgta aacatcttga aaatccagaa agtcagggaa 120 aggtcagaaa aacaacaatg tgtcttgtga cttggcaaca ttccacaaac gactgtataa 180 aataaagcag agcgcgccat tcgaggcggc cgccatgttt gtcttgtctt gtgttgtctt 240 gtgtgttcat tcctttgttt aggaaacacg cggaccccaa ca 282 // ID LTR71B_TS repbase; DNA; PRI; 447 BP. XX AC . XX DT 14-DEC-2009 (Rel. 15.09, Created) DT 14-DEC-2009 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an endogenous retrovirus - consensus. XX KW Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR71B_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-447 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from tarsier."; RL Repbase Reports 10(9), 1278-1278 (2010). XX DR [1] (Consensus) XX CC ~91% identical to consensus. Target site duplication varies from CC 4-6bp, therefore classification is uncertain. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 447 BP; 109 A; 103 C; 100 G; 135 T; 0 other; tgtaagaaac aaaatggtga ttcattttca aaatggcgat ttgttacttc acttcctggt 60 tcttcctcct ttcttggcag gctttctctt ttggcaggct tcctttggtg ggcattgaga 120 cccaatcaga atccagagac atgtatttta ccttatttgt gtagaaactg cttgtgtttt 180 accttatttg tacaaaatta ttgcttaggc gacgtccggg aaagtgatgt aacctctgaa 240 gtactcagcc aatggggaac caggggaggg acttgcactt tagggcataa atacactggc 300 tgtgactgcc tcagtgtgcc tgctcacccg agaggcaccc attcttgcaa gaacaagatt 360 aaaagagctg cttcactccc ccaatctctg agtctccgtg tcgtgctttg atgatgaaac 420 tgaggagggt caattccacc tcccaca 447 // ID PTERV1a repbase; DNA; PRI; 7738 BP. XX AC . XX DT 14-MAR-2006 (Rel. 13.02, Created) DT 27-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE ERV1 Endogenous Retrovirus from Pan troglodytes. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; PTERV1a. XX OS Pan troglodytes OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pan. XX RN [1] RP 1-7738 RA Smit A.F.; RT "PTERV1a - ERV1 Endogenous Retrovirus from Pan troglodytes."; RL Direct Submission to Repbase Update (21-FEB-2008). XX DR [1] (Consensus) XX CC <2% div. ORF1 436-2385, ORF2 2386-6027, ORF3 6006-7691 The other CC three subfamilies show a deletion of the N-terminal, aspartyl CC protease encoding region of the ORF2 product. Closest match to CC MuLV (ORF2 product is 59% identical, 72% similar). XX FH Key Location/Qualifiers FT CDS 436..2382 FT /product="PTERV1a_1p" FT /translation="AARLGGQQLSLIWAAPARPRSRQLTLTRASGARLRSE FT LPRFGGQQPYLSCPVGGQQLSLIWAAPARPRSRQLTLTRASGARLRSELPR FT FGGQQLPLTRAAKVRLRLDIINKSDQISFTGIQTHIPLQEETTNYYRMGNT FT QSTPLSLLTSNFKEVRARGHDLGIEIRKGKLITLCRSEWPAFDVGWPPEGT FT FRLAVITRVKSKIFLPGRAGHLDQIPYILIWQDLVENPPPWLSPFQLASEP FT CKALVARPLKSKQPTAPPHPVLPDSGDPLFTEPPPYPSGPQAPAPLAELRE FT GAGGREAAGTHGPAERESNFEGPAGRTRGRTSRTSPPQPPDSTVALPLREI FT GPPDDTGIPRLQYWPFSTSDLYNWKTQSARFSDNPKDLLALLDSVMFTHQP FT TWDDCQQLLRILFTTEERERIQIEARKLVPGDDGQPTANPDLINATFPLTR FT PAWDYNTAEGRGRLHLYRQTLMAGLRAAARKPTNLAKVYSILQGKTESPAT FT YLERLMEAFRQYTPIDPEAPGSQAAVVMSFVNQAAPDIKRKLQKLEDLEGK FT RIQDLLQIAQRVYNNRDTPEEKQFKATEKMTKVLAAVVQKEHLQPEYTQPR FT RPPRHDNLSKDQCAYCKGAGHWVRDCPKKKPRGQGPGPPRSTPVLVTQDED FT " FT CDS 6006..7688 FT /product="PTERV1a_3p" FT /translation="LTANMQLGSLTLTLVALVAAGENIKPAPNPFVWRFWL FT YENQTHPGQPHKPGKLVASADCPSSGCNSPILLNFTDFPVAKPVAPIICFE FT YDQTEYNCKHYWWHQSAGCPYNYCNIHKYQWWGGEEQIDPRWPFHRRRDRD FT LSYTWIVRDPWNSRWTTPQHGAVYYSSASTWPSSHLYLWRGLVQVRPLVHG FT NIQRQENRLTQDLRPFSWLKLLQEGLELANLTGLHSLSGCFLCATLGRPPL FT TAVPLPWGSSTSAQANNHQNLSYAPIPNVPLYLNPSQEKFPYCFSGTNSSL FT CNITATPPNITLRAPSGIFFWCNGTLSKNLSSPSVTNLLCLPVTLVPRLTL FT LTAGEFLGYTGNWTSAVIHPDPRPRPARAIFLPLIAGISLTASFMAAGLAG FT GALGHTLIESNKLYQQFAVAMEESAESLASLQRQLTSLAQVTLQNRRALDL FT LTAEKGGTCMFLKEDCCFYINESGLVEDRVQQLRKLSTEVRTRQFASAADQ FT WWNSSMFSLLAPFLGPLLSLLFLLTVGPCVVNRILRFVKERFNTVQLMVLR FT AQYQPVNAETESDL" FT CDS 2386..6024 FT /product="PTERV1a_2p" FT /translation="GRRGSDPLPEPRVTLQVEGSPVQFLVDTGAQHSVLVK FT TNGKLSSKSSWVQGATGVKKYPWTTQRTVNLGAKNVTHSFLVIPESPCPLL FT GRDLLTKMGAQIHFLPEGPVVTNPHNRXVSILTINLEDEYRLHQEKAAPDQ FT DIATWLQQYPEAWAETGGLGLAKHRPALFIELKPGTDPVRVRQYPMPLEAK FT RGIAPHIRRLLDQGVLRPCHSPWNTPLLPVRKPNSGEYRPVQDLREINKRV FT VDIHPTVPNPYTLLSTLNPKHQWYTVLDLKDAFFSLPLAPQSQKLFAFEWN FT DPDRGISGQLTWTRLPQGFKNSPTLFDEALHEDLGEYRRKHPEIILLQYVD FT DLLIAAETQEACIQGTKGLLQALGNLGYRASAKKAQICKPEVIYLGYLLKG FT GQRWLTDARKQTVLQIPRPQSTRQVREFLGSAGFCXLWIPGFAELAKPLYQ FT ATRGQQPFNWTDEAELAFQQIKTALLSAPALGLPDVTKPFHLYVDENKGVA FT KAVITQNLGPWRRPVAYLSKKLDPVAAGWPPCLRMIAATALMVQDADKLVM FT GQELRVVTPHAIEGVLKQPPNRWMSNARLTHYQGLLLNPLRIIFLPPTTLN FT PASLLPNPDLDAPLHDCTEILAQVHGVREDLQDRPLPDADLVWFTDGSSFM FT HQGQRYAGAAVTSETEVIWAEPLPPGTSAQKAELIALTQALTLGAGKKLTV FT YTDSRYAFATAHIHGAIYRERGLLTAEGKEIKNKQEILALLTALWRPEKLA FT IVHCPGHQKLTTPTAQGNFLADQTARNVAKAPSQLLALQLPDPGPRDLPYF FT PEYSEQDLQWIDKLPLKQIQNGWWTDTNDQTILPEKLGQQVLEHIHRTTHL FT GARRMIDLIRRSKLKIRHIAETASSIVTSCKVCQLNNAYPQSQAAAGTRLR FT GTRPGIYWEVDFTEIKPGKYGYRYLLVFVDTFSGWTEAFPTKRETAQVVAK FT KILEDILPRYGFPIQIGSDNGPAFVAKVSQDLASILGANWKLHCAYRPQSS FT GQVERMNRTLKETLTKLTIETGANWVVLLPYALFRARNTPYKLGLTPYEIM FT YGRPPPLVPSLKDDLLKSETENVSEFLFSLQALQKIHQEIWPKLRELYETS FT PPPTPHPYQPGDWVLVKRHRQETLEPRWKGPLQVLLTTPTALKVEGIASWI FT HYTHVKPVDPTSDLLGPITAAAAEAPDTWTVDRAKNNPLKLTLRRQHNSLQ FT TCS" XX SQ Sequence 7738 BP; 2014 A; 2210 C; 1882 G; 1629 T; 3 other; tctgggggcc cgtccgggat tccccaagcc caccagaccc ctggtcaacg gatctgctag 60 gatcgatcta ctgataggtg agctggctcg tctccgtttg tctgtctgtg tctgttctga 120 atccgaatct gtgactcgcg aggtctgaaa ctggagctgg cacagtcctg gcggacgcgc 180 tataggacgg ccagcggaga ccggtgggag acgtcccctg gctctcatct gatctatatt 240 gcgatctgag ctgccccggt ttggcgggca gcagccgtct ctgatctgag ctgccccggt 300 ttggcgggca gcagctgtct ctgatctggg ctgccccagc gcgaccgcga tccaggcagc 360 tgactctgac ccgggcttcc ggtgcgcgct tgcgatctga actgccccgg tttggcgggc 420 agcagccgta tctgagctgc ccggttgggc gggcagcagc tgtctctgat ctgggctgcc 480 ccagcgcgac cgcgatccag gcagctaact ctgacccggg cttccggtgc gcgcttgcga 540 tctgaactgc cccggtttgg cgggcagcag ccgtatctga gctgcccggt tggcgggcag 600 cagctgtctc tgatctgggc tgccccagcg cgaccgcgat ccaggcagct aactctgacc 660 cgggcttccg gtgcgcgctt gcgatctgaa ctgccccggt ttggcgggca gcagctgcct 720 ctgaccaggg ctgccaaagt gcgcctgcga ttagatatca ttaataagtc agatcagatt 780 tcctttacag ggattcaaac ccacattcct ttacaggaag agactacaaa ttattacagg 840 atgggtaata cccagagcac tcctctatct ctccttacga gtaatttcaa agaagttaga 900 gcaaggggcc atgatcttgg tatagaaatc aggaaaggaa agctaattac tctgtgtcgc 960 tccgaatggc ctgcctttga tgtggggtgg ccgcccgaag ggaccttccg acttgctgtc 1020 atcactaggg taaagtccaa gattttccta cctgggcgtg cgggccactt agatcaaatc 1080 ccatatatcc tcatatggca ggaccttgtt gagaacccgc ctccttggct gtcccctttc 1140 caattggcct ctgaaccctg taaggcactg gttgctcgac cactaaaatc caagcaacca 1200 actgcccccc cccatcctgt tctacctgac agcggggacc cactgttcac agaaccccct 1260 ccgtacccct ccgggcccca ggccccagcc cccctggctg agctgcggga gggagcaggc 1320 ggacgggagg cggccggcac acacgggccc gctgaaaggg aaagtaactt tgaagggccg 1380 gcggggcgga cgcgagggcg cacttcgcgg actagccccc ctcagccgcc tgactccacg 1440 gtggctttac cccttcggga aataggaccc ccggatgaca caggaatccc caggctccag 1500 tactggccat tctccaccag tgatctgtat aactggaaga ctcagagtgc tcggttttca 1560 gacaacccca aagatttact ggctttactg gatagtgtca tgttcaccca ccagcccact 1620 tgggatgatt gtcagcagct cctccgaatc ttgttcacca cggaagagcg agagagaata 1680 cagatagaag ctagaaagct ggtcccgggg gacgacggtc aaccgactgc caaccccgac 1740 ctcataaacg caacctttcc tctgaccagg ccggcgtggg actacaacac ggcagaaggt 1800 aggggacggc tacaccttta tcgccagact ctaatggcag gtctccgggc agctgctcgc 1860 aagcccacta atttggctaa agtatattct attctgcagg gaaagacaga gagcccagct 1920 acctacttag aaagattaat ggaagctttt agacagtaca cccccataga tccagaggct 1980 ccaggaagtc aggcagctgt tgtaatgtct ttcgtaaatc aggcagcccc agatattaag 2040 agaaaactcc agaaattaga agacttggag ggaaagcgga ttcaggacct ccttcagata 2100 gcccagcggg tttacaataa cagagatact ccagaggaaa agcaatttaa ggccactgaa 2160 aaaatgacca aggtcctggc agcagtggta cagaaagagc atctacagcc agagtacacc 2220 caacctaggc ggcccccccg gcatgataat ctgagcaaag accaatgtgc ctattgtaag 2280 ggggctggcc actgggtaag agactgcccc aaaaagaaac cacgaggaca gggacccgga 2340 ccccctaggt ctacacccgt actagtcact caagacgaag actagggaag acggggttcg 2400 gaccccctcc ccgaacctag ggtaactttg caagtggagg ggtccccagt ccagttcttg 2460 gtcgatacgg gagcacagca ctcggtctta gttaaaacta atgggaaatt atcctccaaa 2520 tcctcgtggg tacaaggggc cacaggagtt aagaaatacc catggacaac acaaagaaca 2580 gtaaacctcg gagccaagaa tgtaacccat tctttcctgg tcatccctga gagcccctgt 2640 cccctattgg ggagagacct gctaactaaa atgggagcac agatccattt cctccctgag 2700 gggcccgtcg tgaccaaccc ccacaatcga nccgtgtcca tcctgactat aaacctagaa 2760 gatgagtacc ggctccacca ggagaaagcg gcccctgacc aggacatagc aacctggctc 2820 cagcagtatc cagaagcgtg ggcggaaacg gggggcttag gtctagcaaa acaccgtcct 2880 gccttattta ttgaacttaa gcctgggaca gaccccgtgc gggtacgcca atacccgatg 2940 cccctagagg ccaagagagg gattgccccg catatccgcc ggctccttga ccaaggggtc 3000 ctncgcccat gtcactcacc ctggaatact ccattgttgc cggtacgaaa acctaatagt 3060 ggagaataca gacctgtaca agacttaaga gaaatcaaca agagggttgt ggacatacat 3120 ccaactgtac ctaacccgta taccctccta agtaccttaa accctaaaca tcaatggtac 3180 actgttttag atttgaaaga tgctttcttt agtttgcctt tagcccctca gagccaaaag 3240 ctcttcgcct tcgagtggaa tgaccctgat aggggcataa gtggccaact gacatggacc 3300 aggctgccgc agggattcaa aaactctcct accctgttcg atgaggccct ccatgaagac 3360 ctgggtgagt accgacgtaa acaccctgaa ataattttac tccagtatgt tgatgacctc 3420 ctgattgctg ctgagaccca agaagcttgc atccaaggga ccaagggtct cttacaagct 3480 ctagggaatc taggctaccg agcctcggca aagaaagctc aaatctgtaa gccagaggta 3540 atatatctag ggtacctgct taagggaggg cagcgctggc taacagacgc ccggaaacaa 3600 actgttctgc agatccccag gccacaatcc acccgacaag tgagagagtt cctggggtcg 3660 gcaggatttt gcngactatg gatacctggg ttcgcagaac tggctaaacc cttgtatcag 3720 gcaacacggg ggcaacagcc atttaattgg acagacgaag ccgagttggc cttccaacag 3780 attaaaaccg ccctactctc cgcgcctgca ctaggactac ctgatgttac caagcccttc 3840 cacttatacg tggatgaaaa taagggtgtc gccaaggcgg taataactca gaacttaggc 3900 ccctggcgga ggccagttgc ctacctgtca aagaagttag acccagtagc tgccgggtgg 3960 cccccttgtc tccgaatgat tgcggccacg gctctgatgg tgcaagatgc tgataaactt 4020 gtcatggggc aagaattgcg ggtcgttact ccacatgcca tcgaaggtgt actcaaacag 4080 ccacctaatc gatggatgag taacgcccgg ctcacccact accaaggact actactaaat 4140 cctctcagga taattttcct gcccccaacg accttaaacc ctgcctcgct gctgcccaac 4200 ccggacctgg acgccccact ccatgactgc accgagatac tagctcaggt gcacggagtt 4260 cgagaagacc tgcaggaccg cccacttcct gacgccgacc tcgtctggtt cactgatggg 4320 agcagcttca tgcatcaagg ccagaggtac gctggggcgg cagtaacttc agagactgag 4380 gtaatctggg cggaacccct gcccccgggg acatcggccc agaaggccga actgatagcg 4440 ctcacccaag ctcttacctt aggggcgggg aaaaagctga cagtatatac agacagccga 4500 tatgcttttg caacggcgca tatacatggg gccatttaca gggagcgagg gttactgacg 4560 gctgaaggaa aagagataaa aaacaagcaa gagatcctag ccctgctaac agccctatgg 4620 aggccagaaa aattagccat tgtacattgc ccagggcatc agaaactaac tactccaact 4680 gctcaaggca actttctggc agaccaaact gcaagaaatg tggcgaaggc tcccagccaa 4740 ctccttgcac tccagctccc tgacccgggc ccccgggact tgccatattt ccctgaatat 4800 tcagaacaag atctccagtg gattgacaaa cttcccctga aacaaatcca gaatgggtgg 4860 tggactgata ctaatgacca aaccatccta ccagaaaaat taggacaaca ggtgttagaa 4920 cacatccacc gaaccaccca cctgggggcc cggcggatga tagacctgat cagacgctcc 4980 aagctcaaaa tcagacatat agctgagacg gccagcagta tcgtgacaag ttgcaaagtc 5040 tgccagctta acaacgcata cccccaatct caagctgcag caggaacaag gctcagggga 5100 accaggcccg gtatctactg ggaagtagat tttactgaaa taaagccagg aaagtacggg 5160 taccggtact tacttgtctt tgtagatact ttttcagggt ggactgaagc attcccaacc 5220 aaaagagaaa ctgctcaggt cgtagcaaag aaaattctgg aagatatcct tcccaggtat 5280 ggcttcccca tccagatagg gtcagataat gggcccgctt tcgtcgctaa ggtaagtcag 5340 gacttggctt ccatccttgg ggcaaattgg aaactacatt gcgcttacag gccccagagt 5400 tcaggacagg tagaaaggat gaatcggacc ttaaaagaga ccttaactaa attgactata 5460 gagactggcg ctaattgggt agtccttctc ccctatgctc tgttccgggc ccgtaatacc 5520 ccttacaaac tgggcctcac cccttacgaa atcatgtatg gcagacctcc acccctggtt 5580 cctagcttaa aagatgacct gcttaagtct gaaacagaaa atgtctctga attcttattt 5640 tccttacaag ccttacagaa aattcaccaa gaaatctggc ccaagctgag agagctatat 5700 gagaccagtc ccccaccgac accccatccg taccagccgg gagactgggt cctggttaag 5760 cgacaccgac aagagaccct agagcccagg tggaaaggac cactccaagt actcctgacc 5820 acacccaccg ccctgaaggt agaaggcatt gcgtcgtgga tccactacac ccacgtcaag 5880 ccagtggacc caacctccga ccttctgggg ccaatcacgg cggcggcggc tgaagcaccg 5940 gacacgtgga ctgtggacag agctaagaac aaccccttaa aactcaccct gcgccggcag 6000 cataactcac tgcaaacatg cagttaggta gtctaactct aacattagtc gccctagtgg 6060 ccgctgggga aaacataaag ccagctccta atccctttgt ctggagattc tggctttatg 6120 aaaaccaaac ccaccctggg caacctcata agcccgggaa attagtggcc agtgcagatt 6180 gcccctcctc agggtgcaat agcccaattt tactaaattt taccgatttc ccagtagcca 6240 aaccagtggc accaataata tgcttcgagt atgatcagac tgaatacaat tgtaagcact 6300 attggtggca ccaaagtgcc ggctgccctt ataactattg taacatccat aaataccaat 6360 ggtggggtgg agaagaacag atagatccca gatggccctt ccatcgcaga cgagatagag 6420 acctttcata tacatggata gttagagacc cctggaactc ccgctggacc acgcctcaac 6480 acggggctgt atactactcc tccgcctcca catggcctag cagtcacctc tatctgtggc 6540 ggggtctagt gcaggtacgg cccctggtcc atggaaatat ccagcgacaa gaaaaccgcc 6600 tgacacaaga tttacgtcct ttttcctggt taaaattatt gcaagaagga ttagaacttg 6660 ccaaccttac aggacttcac agcctgtctg gctgctttct atgtgccact ctagggcgtc 6720 caccgctaac cgctgtcccc ctgccatggg gatcatccac ctctgcccaa gctaacaacc 6780 accaaaacct ctcatatgcc cctatcccta acgtgccact atacctaaac cccagtcaag 6840 agaagtttcc ctactgtttc tcaggaacta attccagcct ctgcaacatc actgcaacgc 6900 cccctaacat caccttaagg gctccgtcag gcatattctt ctggtgtaat ggaacattat 6960 ctaaaaacct atcaagcccc tctgttacca acctactgtg tcttcctgtc acattagttc 7020 cccggttaac tctacttact gccggcgagt tcctagggta taccggtaac tggactagtg 7080 ctgttattca cccagaccct agaccgagac ctgcacgagc catatttctc cccctcattg 7140 caggaatctc cctcaccgca tccttcatgg cggccggact ggctggggga gccctaggtc 7200 acacccttat agaaagtaac aagctgtacc aacaatttgc cgttgctatg gaggagtcag 7260 ctgagtccct tgcctccctc cagcggcagc tcacgtccct agcacaggta accttgcaga 7320 accggagggc cttagaccta ctcactgctg aaaaaggggg aacgtgtatg tttctaaagg 7380 aagactgttg tttctacata aatgaatcag gactcgtgga agaccgagtc caacagttac 7440 gcaagttaag cacagaagta agaacacggc agtttgcttc agctgcagac caatggtgga 7500 actcatctat gttttctctg ttagccccct tccttggacc cctgctgagt ctactatttc 7560 tgcttaccgt aggaccttgt gttgttaaca gaattttgcg gttcgttaaa gaaaggttta 7620 acactgtaca actcatggtc ctcagagccc aataccaacc tgtaaacgct gaaacagaat 7680 cagacttata agacccaaga ttggctctaa aaaaatacct gaaaagaaag ggggggaa 7738 // ID LTR13B_OG repbase; DNA; PRI; 585 BP. XX AC . XX DT 18-OCT-2009 (Rel. 14.11, Created) DT 18-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat (consensus). XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR13B_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-585 RA Jurka J.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 9(11), 2857-2857 (2009). XX DR [1] (Consensus) XX CC ~94% identical to consensus. 4 bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 585 BP; 144 A; 185 C; 107 G; 149 T; 0 other; tgtaagggcc agaccagcgc cattttgttg tctactacat aagctctgtt aaaccgcagg 60 gccactccct gctgcctccc ccacccctgg ttgcacagcc attaagccgt tgaattgagt 120 cacctaaggc agacagctgc cataataggc aataagtatc agttcccttg aacagctccc 180 ataataggca ataggtttct acttccttga aattcaggta ttctccttca actacctaat 240 taggctacac cacctctagc cccggttaag aaatccccca cccctcacct gtaccaatcc 300 aaatcctcct ttactagtga aatcatcaac caccaatgca aatcctcctt tactaatgaa 360 atcaccaacc accaatcaag agtgtacccg tacccgaccc tacccttctt gctttctgta 420 ccccataaaa acttgctcca ccccgggttc ggggctcgtc ctcggcttgc atgccaggag 480 agtccgggct tcccgggcct gaataaaact cacctcttgc tttttacatc ggagtgtgtc 540 tcggactgtc tgttggggcc acttgggaga atttcgggca taaca 585 // ID Alu3_TS repbase; DNA; PRI; 298 BP. XX AC . XX DT 09-APR-2010 (Rel. 15.04, Created) DT 09-APR-2010 (Rel. 15.07, Last updated, Version 3) XX DE Alu-like SINE element - consensus. XX KW SINE1/7SL; SINE; Non-LTR Retrotransposon; Transposable Element; KW Nonautonomous; Alu3_TS. XX NM Alu3_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-298 RA Jurka J.; RT "SINE elements from tarsier."; RL Repbase Reports 10(4), 634-634 (2010). XX DR [1] (Consensus) XX CC ~85% identical to consensus. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 298 BP; 70 A; 71 C; 103 G; 53 T; 1 other; ggccgggcgc agtggctcag cctgtaatcc cagcactttg ggaggccgag gcgagaggat 60 tgcctgagcc cgggggttcg agaccagcct gggcaacttg gtgagacctt gtctctacaa 120 taaataaaaa attagccagg cgtggtagcg cgcgcctgta gttccagcta cttggaaggc 180 tgaggcggaa ggatcgcctg ggcccagcag gttggggctg cggtggccgt gagcatgcca 240 ctgcactacg gcatgggggg angagactcc aaatcttaaa aaagtctgaa aaggaaga 298 // ID LTR8E_Mim repbase; DNA; PRI; 595 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; nonautonomous; KW LTR8E_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-595 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 11(5), 1720-1720 (2011). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 595 BP; 160 A; 162 C; 125 G; 148 T; 0 other; tgaaaccgcc tttacaaatt gataaagggg tgcaaggcga gaactgtggt aagggcctag 60 ctaaaaataa ggcacaggcc tagctaaaaa taatgataat aattagccac tgtccccctg 120 tttgcttctc tataattgcc actctggagc cacatagctg gtggtcataa agattcttag 180 ctttctccat agataacatc acctttttga aacctagggg tagtttctga gatatcttcc 240 aggtcccgca ttccagtgga acggctgacc cacccagacc agtggcccat accgaggaac 300 tgactcaact ggtcttgtga cccccaccca agaacctggt cagcaaagaa gacaatttct 360 acaaccctat ggttccaccc cgaacccagc caattagcag actcaattgc ctatgccaaa 420 ctgtctttaa aaaccctgtc tcccaaattc tcggggaggc ggatttgaga aatttctccc 480 atctccccgc ttagcgctat gcggaataaa ctctttctct gctgtaaacc ctgctgtctc 540 agcgtattgg cttcctcttg ggcagcgggc aaatgaacct ggttgggcgg caaca 595 // ID ERV1-1B_TSy-LTR repbase; DNA; PRI; 460 BP. XX AC . XX DT 29-JAN-2010 (Rel. 15.09, Created) DT 29-JAN-2010 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERV1-1B_TSy-LTR; ERV1-1B_TSy-I. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-460 RA Jurka J. and Walichiewicz K.; RT "Endogenous retroviruses from tarsier."; RL Repbase Reports 10(9), 1191-1191 (2010). XX DR [1] (Consensus) XX CC ~91% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 460 BP; 127 A; 123 C; 87 G; 123 T; 0 other; tgtggcagag agggggctga tagagttaaa agtattcctt ttgaaagtca atcggcaagc 60 tctgaaaacc cgccagacca gttttgcttg agaaaaccaa atagccaaga atgatggctt 120 tcatggaaat gcaaaagagt gaccaattca caaacttacc caaatcctct ctttttgaag 180 ttatcagttt acaaccccta ctcgctctgc atttcctgga aaccatttgt tctttgaaaa 240 tctgattttg ctgagtacaa ccttgtgaat tccaaagaaa atgtgtaatc gctcaacacg 300 acccccctcc cttacctaaa cttaggtata agacccctct cctccctttc cggggtgctc 360 agcctgggag cattagcccg ccgagtctgt gccggcacaa ataaatctgc tacttcctaa 420 aacagcctcg gtgtcacggc ctctctgtgt tcccactaca 460 // ID LTR16_Mim repbase; DNA; PRI; 573 BP. XX AC . XX DT 06-NOV-2009 (Rel. 14.11, Created) DT 06-NOV-2009 (Rel. 15.11, Last updated, Version 3) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR16_Mim. XX NM LTR16_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-573 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2977-2977 (2009). XX DR [1] (Consensus) XX CC >90% identical to consensus. 4bp tsd. CC Similarity to LTR16_OG from bushbaby. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. CC We thank the Broad Institute Genome Sequencing Platform and CC Genome Sequencing and Analysis Program, Federica Di Palma, and CC Kerstin Lindblad-Toh for making the data for Microcebus murinus CC available. XX SQ Sequence 573 BP; 171 A; 139 C; 93 G; 170 T; 0 other; tgaaggagct caggaaattt caccccaaaa tatgactcct tggtataaag aatattttga 60 attaaaggcc attcaagatc aaaaagcatt ggagggggct ttccctctat ctgcataaac 120 cggactgacc aatcaaaaga tcaaaggggc aattgacttc ccttcctctc cctgttatct 180 caatatattg caggaaggag gatcaagaat gcaaccagac ctggcccaaa tcatttaaat 240 ataatacctg tctctcaggt taatttaatt tgctcaggtt aatttaattt gcaaagagaa 300 tcatttacaa gtcaatctgt ttcccccatc catttatcct ccctagcacc atttgttccc 360 cctaaacaga attacctgta ctcctcatct ccccctcccc tccaaaagga caggtataaa 420 aatatctgaa cttcattggg atattgggta atcactctgt gattctcccc atgtgcatgg 480 taaataaacc tttctcttat taatctgcct taattgtgag ttgatctttc agcgaacttt 540 cgggggaaag ggaagtttcc cctcacccct aca 573 // ID LTR1B_Mim repbase; DNA; PRI; 592 BP. XX AC . XX DT 04-NOV-2009 (Rel. 14.11, Created) DT 04-NOV-2009 (Rel. 15.11, Last updated, Version 3) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR1B_Mim. XX NM LTR1B_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-592 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2947-2947 (2009). XX DR [1] (Consensus) XX CC ~94% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. CC We thank the Broad Institute Genome Sequencing Platform and CC Genome Sequencing and Analysis Program, Federica Di Palma, and CC Kerstin Lindblad-Toh for making the data for Microcebus murinus CC available. XX SQ Sequence 592 BP; 166 A; 162 C; 103 G; 161 T; 0 other; tgtgaactaa aataaaattt taaggctcac ccccccaccg gctgactgaa tggaccccct 60 cgtggccaaa ggaatatcct aaaactaaat tgtctgccag gaggagggag gtcagacatg 120 cctcatcatg cccccctccc ttcttgggga catcctttgt aacccattaa caggcctaag 180 ggtatgcaag acaaacctgc aggtcctcaa tttacacaac aaatctatgt ccggtggctt 240 atctctgata acagctcctt atgttaaaac attccaagcc tttagacaaa gcttcatgtc 300 tttaaccaat tacaagccaa agaatcttta aacccaccta taacctgtaa gcccccgctt 360 cgagatggcc cacctttttg ggccaaacca atgtatgcct cccacgtatt gatttatgac 420 tttacctgta acccctgtct ccctgaaatg tataaaacca aactgtaacc cagccacacg 480 agtccacttg ctcaaggcct cttgggagtg gctctgggtc atggtcctca aatttggctc 540 agaataaatc tctttaaaat tattttacag agtttggctt ttttccttga ca 592 // ID LTR8B_Mim repbase; DNA; PRI; 613 BP. XX AC . XX DT 11-NOV-2009 (Rel. 14.11, Created) DT 11-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR8B_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-613 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2959-2959 (2009). XX DR [1] (Consensus) XX CC >85% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 613 BP; 159 A; 166 C; 123 G; 165 T; 0 other; tgaaaccgcc tatattaaag taatgtaagg ccataaggca agaactgtgg taaaagtctg 60 atttctacag aagtatagac atagctaaaa gttaaccagc cattgtttcc tagcttgcct 120 tcctgtaatt gcttactgct taggagtcac gtatccttga ctggcgtaag atttgtgact 180 tccccaatcg cttctataga taacatcgct attgtaaaac ctaagactgg tctttgagat 240 atcttccaga ctctgcattc ggggggacca actgacgcca accagaccag tggccatgca 300 tgggaactga cccaacaggt cctgtgaccc tctacccagg aactgtctca gccaagaaga 360 caatttctgc ttcccaaccc tatgagttca tccccagcca atcagcagac ccaattccct 420 agccctttgt ctgccaaact acctttaaaa accttagccc ccaaattctc agggagatgg 480 atttgagaaa tttctcccat ctcctcgctt ggcgccttgt gatattaaac tctttctctg 540 ctgcaacccc tgctgtctca gtgtattggc ttcctcctgg gcagtgggca aggaacccgg 600 ttgggcggta aca 613 // ID MacERV1_LTR1 repbase; DNA; PRI; 477 BP. XX AC . XX DT 24-SEP-2007 (Rel. 13.02, Created) DT 21-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE ERV1 Endogenous Retrovirus from Cercopithecidae. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; KW MacERV1_LTR1. XX OS Cercopithecidae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini. XX RN [1] RP 1-477 RA Smit A.F.; RT "MacERV1_LTR1 - ERV1 Endogenous Retrovirus from RT Cercopithecidae."; RL Direct Submission to Repbase Update (21-FEB-2008). XX DR [1] (Consensus) XX CC Many more subs 2%; some close to 0%. XX SQ Sequence 477 BP; 143 A; 126 C; 91 G; 116 T; 1 other; tgaaagaaat caggcacatt cctcagcccc gggctcaaca caaacaggcc cagtacaaac 60 acatcccacc atatatctct caacttcctg agcccagtac aaacacatcc caccatatat 120 ctctcaactt cctgagccca gtacaaacac accacctgga aaatccccga taaggacaca 180 gccgtttgtg gttttactga aagcgcggga accaatagaa aaactgctaa atgtataact 240 taaaatgtaa accaattgta atgctgtaac caaaagaatt cccttgttcc tctgtaactt 300 tacgtatcct atttacntcg ggctataaaa ggcaagcact cgcattgttc ggggccctct 360 tgtatgctgt ggaatggagg gaccaagttc gaacttgcag taaaagatcc ttgccgcttg 420 gctttgactc tggactctgg tggtcttctt tggggaacaa acggtctggg cataaca 477 // ID ERV2-1_Mim-LTR repbase; DNA; PRI; 328 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV2-type endogenous retrovirus - DE consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; nonautonomous; KW ERV2-1_Mim-LTR. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-328 RA Jurka J.; RT "Endogenous retroviruses from the mouse lemur."; RL Repbase Reports 11(5), 1525-1525 (2011). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 328 BP; 63 A; 88 C; 102 G; 75 T; 0 other; tgtggggcgc ggtgtcaacg ccattacaag ttggcgcctg tttccggctt tgcctagagc 60 agcaaacaag ttcagcgcac gcgcaattgt cggttgccct gtggcgcaag catgcaaacg 120 aagggtcctg ctatggacta atgctttggt ggctcgacag ctgagcggcc tatcccagct 180 taggggttgt gcttctaggg tatatagcag cctgcgcgct gccgggctgg gtcttccgca 240 tcatgtaagt ctaaagggaa ccccattaaa gcactgtcag aagaactccg gttgccgcgt 300 cttccttgct ggcgaggcgg gcgcgaca 328 // ID HERVE_a repbase; DNA; PRI; 7847 BP. XX AC . XX DT 29-AUG-2008 (Rel. 13.08, Created) DT 29-AUG-2008 (Rel. 13.08, Last updated, Version -1) XX DE ERV1 Endogenous Retrovirus from Catarrhini. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; HERVE_a; KW HARLEQUIN; LTR2. XX OS Catarrhini OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini. XX RN [1] RP 1-7847 RA Smit A.F.; RT "HERVE_a - ERV1 Endogenous Retrovirus from Catarrhini."; RL Direct Submission to Repbase Update (05-AUG-2008). XX DR [1] (Consensus) XX CC Some of the copies are hominoid or OWM specific. CC Lineage-specific copies 4.5-5% substituted. Only 82% similar to CC original HERVE, which probably goes with the older LTR2B/C LTRs. CC ORFs: gag 588-2147, pol 2148-5738, env 5698-7824. CC 91% Identical to HARLEQUIN. XX SQ Sequence 7847 BP; 2207 A; 1867 C; 1991 G; 1776 T; 6 other; tttcttggtt ccctgaccgg gaagcgaggt gattaacgga cggtcgaggc agccccttag 60 gcggcttagg cctgccctgt ggagcatccc tgcgggggac tccggccagc ttgagcgacg 120 cggatcctga gagcgctccc gggtaggcaa ttgccccggt ggaacgcctc gccagagcag 180 cgcgtggcag gcccccgtgg aggatcaacg cagtggctga acaccgggaa ggaactggca 240 cttggagtcc ggacanctga aacttggtaa gactagtctt tggaacttgc ccactccatt 300 tgagtggaag cgtggcctga tcacccacgg cgtgcctgta ccggcacttt ggtttttgtt 360 ttcgacttga cttggattgc ttgatacttt ggttttggtt ttgacctggc ttggatttct 420 cgatactctg attttggttt tgattctggt ttggtgtaaa ctgtaaaagt gtgtgtgtgc 480 cctttttacc cgttctttgt tttgtggtgt gcgtgtggtg tgagcgtggt gttttgtctc 540 gaggaaacac gggtcaggca caaagtaagc ccaccccact aggaactatg ttgaaaaatt 600 tcaaaaaggg atttaaggga gactatggag tcaccatgac accaggaaaa cttagaactt 660 tgtgtgagat agactggcca gcattagagg tgggttggcc atcagaagga agcctggaca 720 ggtcccttgt ctcgaaggta tggcacaggg taacctgtaa gccagggcac ccggatcagt 780 tcccatatat agattcttgg ttacagctag ttttggaccc cccacagtgg ttaagaggac 840 aggcagcagc agtactagta gcaaagggac agttagttaa ggaaggttct cnctccaccc 900 gccgagggaa gtcggcacca aaagtcctgt ccgacccaac accagaagaa tcatggcagg 960 aattggtacc agcagtaccc cctccttatc gagaggaagg gctccccacc cctgagccca 1020 cagcacctac acctccacca gataaccaca cccctagacc acccagagta gacaaaagag 1080 gaagtgaagc cgcgggagaa actcctccgt tggcagctcg cttacggccc gagactggaa 1140 tccaaatgcc cctgagagag cagcgatata ctggggtaga tgaggacgga cacatggtgg 1200 aaaggcgtgc ctttgtgtat caacctttca cctctgctga cctcctcaat tggaaaaata 1260 atactccatc ttacactgaa aagcctcaag ctttaattga cttgctccaa actattatac 1320 agactcataa tcctacttgg gctgattgcc accagctgct catgtacctc tttaatacag 1380 atgaaaggcg aagggtgctc cgggtggcaa ctaagtggct anaggagcac gtcccagccg 1440 attaccaaaa cccccaagaa tatataagaa ttcagctgcc aggaacagac ccccagtggg 1500 acccgaacga gggaccagac atggagaggc taagacggta ccgtgaggca ttaatagaag 1560 gtctaaagaa aggggctcaa aaggctacaa atgtaaataa ggtctctgag gtcatccaag 1620 gaaaagagga gagtccagcg caattctatg aaagactgtg tgaggcttac cgtatgtaca 1680 ctccttttga tccagatagc cctgaaaatc agcgcatgat taacatggcc ttagttagtc 1740 aaagcgcgga agatatcagg agaaaattgc agaaacaggc tgggtttgcg ggtatgaata 1800 cctcacggtt actggaaata gccaatcaag tgtttgtgaa tagagatgca acaagccgca 1860 gagaaagccg taaggaaggc gaacgccagg ctaggcgaaa cgccgactta ctggccgcgg 1920 ccattagggg aattcccccg aaaggagagg gaaagggggg ttccgggaag aatacccagt 1980 ctaatcgccc acgcttgcaa cgtaaccaat gcgcctattg taaggaaata ggacattgga 2040 aagataagtg tccccaactg aaggaaaagc aaggtgattc ggaacaaaag acctcagata 2100 aagacgaggg agctttgttc aatctggctg aagggctact ggactgaagg ggaccgggct 2160 caagcgcccc caaggagccc acggtcagga ttacaactgg gggcaaggac attaagtttt 2220 tggtcgatac tggtgctgaa cattcagtag tgaccacccc ggtcgccccc ttatccaaga 2280 aaaccattga tataatcgga gcaacaggag tttccactaa gcaggctttc tgtctaccac 2340 ggacctgctc ggtgggggga cgtgagatag ttcaccagtt cttgtacatg cctgactgtc 2400 ccttgccctt gctgggaaga gacttgctta gcaagctgag agccaccatc tcctttacaa 2460 aacagggctc tttacagcta aagttaccgg gaacaggagt tatcatggcc cttacggtcc 2520 cccgggaaga agaatggaga ctttttctaa ccgagccagg ccaagagata aaaccagctc 2580 tagctaagcg atggccccga gtatgggcag aggataatcc tccgggactg gcggtcaacc 2640 aagcccccgt actcatagaa gttaagcctg gggcccaacc aattagacaa aagcagtatc 2700 cggttcccag agaagctctc gaaggaatcc aggttcatct caggcgcttg aaagcctatg 2760 gaattatagt tccttgccag tctccgcgga acacccccct cctgcctgtc cctaagccag 2820 ggaccaagga ctaccggcca gtacaggact tgcgcttggt caaccaagct acagtgactc 2880 tgcacccaac agttcctaac ccttacacat tgttagggct gctgccggct gaggacagct 2940 ggtttacctg tctggactta aaagatgcct tctttagcat cagactagct cctgagagcc 3000 agaagctgtt tgcctttcag tgggaagatc cggagtcagg tgtcactact cagtacactt 3060 ggacccggct tccccaaggg ttcgagaact cccctactat cttcggggag gccctggctc 3120 gagacctgca aaagtttcct gctaaagacc taggctgcgt cttgctcctg tacgtggacg 3180 accttctgct gggacactcc acggcagtcg ggtgtgcaaa agggatggat gccctgcttc 3240 ggcacctgga ggactgtggg tataaggtgt ccaagaagaa agctcagatc tgcagacagc 3300 aggtacgcta cctgggattc actattcgga aaggggagcg cagcctgggg tcagaaagaa 3360 agcaggtcat ctgcagccta ccggaaccta gaaccagaag gcaagtaagg gaattcctag 3420 gagctgtggg gttttgcaga ttatggattc caaactttgc agtactagcc aaacctttgt 3480 acggggttac aaaggggggc gaccgggagc cttttgaatg ggggcctcta caacagcaag 3540 ccttttgtaa gttaaaggaa aaatttatgt cggccccagc cctaggacta ccagatttga 3600 caaagccctt tacactctat gtgtcagaaa gagaaaaaat ggcagttgga gttttaaccc 3660 agactgtggg gccctggcca aggccagtgg cctatctctc aaaacaacta gatggggttt 3720 ccaaaggctg gccaccatgt ctaagggccc tggcagcaac agccctgtta gcacaagaag 3780 cagataaact aacccttggg caaaacctga atataaaggc cccccatgct gtggtaactt 3840 tgatgaatac caaaggacat cattggctaa caaatgctag attaaccaag taccaaagct 3900 tgctatgtga aaatccccgc ataactattg aagtctgtaa caccctaaat cccgccaccc 3960 tgctcccagt atcagagagc ccggtcgagc ataaccgtgt agaggtgttg gactcagtct 4020 attctagcag acctgacctt cgggaccagc catgggcatc agtagactgg gagttatacg 4080 tggacgggag cagcttcatc aacccacaag gagaaagatg tgcaggatat gcggtggtaa 4140 ctttggacgc tgtcattgaa gccaaaccgt tgccacaggg cacttcagcc cagaaggctg 4200 agctcattgc tttaactcgg gctctagaac tcagtgaagg taagactgta aacatctaca 4260 ctgactctcg atacgccttt ctaaccctcc aagtgcatgg agcattatat aaggaaaagg 4320 gcctgttaaa ctctggggga aaggacataa aatatcaaca agaaattcta caattattag 4380 aggcagtgtg gaaacctcag aaggcggcag tcatgcactg caggggacac cagcgagcct 4440 ccacctcagt ggccttagga aactctcgag ctgattcaga agctcgaaaa gcagcatcta 4500 ccccttaccg ggcatcggta gcagccccct tactccctca aacgcctgac ctggtaccta 4560 cctattctaa ggaagaaaaa gacttcttcc acgcagaagg ggggcaagta ataaaaggag 4620 gatggatcag actgccagat gggagggtag ctgtgccgca gttgctggga gccacagtcg 4680 tattggccat gcacgaaacc actcatctag gncaagagtc acttgaaaaa ttgttaggcc 4740 ggtacttcta cgtctcacac ttgccagccc ttgccaaagc agtagcacaa cggtgcgtta 4800 cttgccgaca gcacaatgcg aggcaaggcc ccactgttcc gcccggcata caagcttatg 4860 gagcggctcc ttttgaggat cttcaggtgg atttcacaga aatgccgaaa tgtggaggta 4920 acaagtattt gctggttctt gtgtgtactt actctgggtg ggtggaggct tatccaacac 4980 gaactgaaaa ggcctacgag gtaacccgtg tgcttctccg agatcttatt cctaggtttg 5040 gactgccctt acgaatcggc tcagataacg ggccggcgtt tgtggctgac ttggtacaga 5100 agacagcaaa ggcattagga atcacttgga agctacgtgc cgcctaccga cctcagagtt 5160 ccggaaaggt ggagcgaatg aatcggacta tcaaaaatag tttagggaaa gtatgtcagg 5220 aaacaggatt aaagcggata caggcccttc ctatggtatt gtttaaaatt agatgcactc 5280 cttctaagaa aacaggatac tccccttatg aaatactgta tcataggcct cctcctatac 5340 tacgggagct tccaggcact ccccgagagt taggtgaaat tgaattacag cgacagctac 5400 aggctttagg aaaaattaca caaacaatct caacttgggt aaatgagagg tgtcccatca 5460 gcttattctc cccagttcac cctttctctc caggtgatcg cgtgtggatc aaggactgga 5520 acgtagcccc tttgcggcca cggtggaaag gacctcagac cgtcatcctg accaccccca 5580 cggctgtaaa ggtagaagga atcccagcct ggatccacca cagccgtgtg aaacctgcag 5640 ccgctgaaac ctgggaggca aaaccgagcc cggacaaccc ctgcaaagtg actctgagga 5700 ggacgacaag ccctgctcca gtcacacccg gaagctgact ggtctacgca cggccgaagc 5760 atgaggagaa tcatcgtggg actcattttc cttataattt ggacttgtat agtaaaaact 5820 tccactgatt ttccccgcat ggaggactgc tctcagtgta tacatcaggt taccgaggta 5880 gggcaacaag ttaaaacaat ctttctgttc tatagttact atgaatgcct aggaacttta 5940 aaaggaacat gtttatataa tgacactcag tacaaggtat gtagcccagg aaacgacnga 6000 ccagatgtgt gttatgaccc ctctgagcct cccatgtcca cagtttttga aataagatta 6060 aggactgaag actggtgggg actcataaat gatacaagta aagtattagc cagaacagaa 6120 gaaaaagggg tgcccaaacg cataatcttg aaatttgatg cctgtgctgt cattaatagc 6180 aataagttag gaaggggatg tggctctttt agttgggaaa aaggctatat gaccgaaaat 6240 aagtacattt gtcatgaatt aggactgtgt ggaaatgaat gtggatactg gtcttgtgtc 6300 atttgggcca cttggataaa aaatgaaaag gatccagtcc accttcagaa aggaaaaaat 6360 ggcccttcct gtactaaggg acaatgtaac cccttagagc tagtaataac caatcccctt 6420 gatcctcgct ggaaaaaagg ggagcgtgtg accttaggaa tcgacggggc cggactggat 6480 cctcgagtaa atatcttggt tcgaggagaa gtttacaaac actctcctga gccagtgttt 6540 caaactttct atgatgaact aaatgtgcca gtaccagaaa ttccaggaaa aacaagaaat 6600 ttgtttttgc aattagccga gcgtgtagcc cagtctctca atgtcacttc atgttatgta 6660 tgtggaggaa ctgtaatggg agatcaatgg ccatgggaag cccgagaatt agtacctaca 6720 gacccagttc ctgatgaatt cccggctcaa aagaatcacc ctgataattt ctgggtccta 6780 aaancctcaa ttattggaca atattgcata gctagagaag gaaaagaatt cactcacccc 6840 gtaggacgac ttagttgtct gggacagaaa ctgtataatg gtaccacaaa aacagtcact 6900 tggtggagtt caaatcacac agagaggaat ccatttagta aattcccaaa gttgcaaacc 6960 gtgtggaccc acccggagtc ccaccgggac tggacagccc ccactggatt atactggata 7020 tgtgggcata gagcttacgc caaattacct gaccggtggg caggtagttg tgttattggc 7080 actattaaac catctttctt cctactgccc ataaaaacag gcgaactcct gggcttccct 7140 gtctatgctt cccgcgaaaa gagaagcata gctataggaa attggaaaga tgatgaatgg 7200 ccccctgaga gaatcataca atattatggg cctgctactt gggcacaaga cggctcgtgg 7260 ggataccgga cccccattta catgatcaac cgaatcatac ggttacaagc tgtcttagaa 7320 ataatcacta ataaaaccgg cagagccttg actattctgg cccggcaaga aactcagatg 7380 agaaatgcta tctatcaaaa tagattggct ctcgactact tgctagcagc tgaaggaggg 7440 gtctgcggga aatttaacct tactaattgc tgtctacaca tagatgatca agggcaagta 7500 gttgaagaca tagttagaga tatgacaaaa ctggcacacg tgcccgtgca agtgtggcat 7560 ggatttgatc ctggggccat gtttggaaaa tggttcccag cgctaggagg atttaaaact 7620 cttataatag gagttataat agtaatagga acctgcttac tgctcccttg tttgctacct 7680 gtacttcttc aaatgataaa aagcttcatc gctaccttag ttcaccaaaa tgcttcagca 7740 caagtgtact atatgaatca ctatcgatct gtcttgcaag aagacatggg tagtgagaat 7800 gaaagtgaga actcccacta ttgagtgaga ttctcaaagg cggggaa 7847 // ID LTR1C3_OG repbase; DNA; PRI; 533 BP. XX AC . XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 05-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Long terminal repeat (consensus). XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR1C3_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-533 RA Jurka J. and Jurka M.G.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 8(10), 1670-1670 (2008). XX DR [1] (Consensus) XX CC 4bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 533 BP; 136 A; 143 C; 97 G; 157 T; 0 other; tgatacaggg tttcccccat tcagggctgg acaaggaccc ccgttttccg ggcctaagca 60 aacagacttt ttctctccaa actgcactct ctgaatacaa gcaagacagg agaaaagcag 120 tttttccagt atttcatgta agctaggcct cttctggagt ggaatgcata ctccctgccg 180 cttttgccca gagctggaag gactgcattc atttgttaat acactcattc atttggtcac 240 taatcaacat tgaaaagatg ttttctgtct gccaagtccc ggccaagtgt aatttttact 300 ccaaagatct ccctatttac aaacagtata taaaccccta gacaaaagac ccatgtgggg 360 atttcccact tgggtctccc tcctctactg ccagaggctc tggtgccttt cattccttct 420 ctattccttt ctatctaata aatcctatct tatcactgat ctctgtggtc cgtgggttca 480 ttcttcgaat ctccgagacc aaggacctac tgcagaaaga aattccggca aca 533 // ID ERV1-1_CJa-LTR repbase; DNA; PRI; 598 BP. XX AC . XX DT 12-JAN-2011 (Rel. 16.02, Created) DT 12-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; KW ERV1-1_CJa-LTR. XX OS Callithrix jacchus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. XX RN [1] RP 1-598 RA Jurka J.; RT "Endogenous retroviruses from the common marmoset."; RL Repbase Reports 11(2), 690-690 (2011). XX DR [1] (Consensus) XX CC >86% identical to consensus. 4bp tsd. CC This sequence was derived from the marmoset dataset made CC available by Genome Sequencing Center at Washington University CC School of Medicine in St. Louis. CC We thank the Marmoset Genome Sequencing Consortium for making CC their data publicly available. XX SQ Sequence 598 BP; 130 A; 165 C; 159 G; 142 T; 2 other; tgagggactg tgttaaggaa cttctcttcc cgggtccgcg aggggcgtgg tttcaaaggc 60 aggagtttcc cggctccacg cctccatccg ggaaccagag gggtttcaaa tgtaaagttc 120 tttgtctggg tttcccgcgc gcgggctagg tagggaaagc caggtgaggc tcctttgtcc 180 caaaaanccc gccgaaagct ctgggaggag ggggctctgt aactcagagt tgtggaccaa 240 gcagccaatc cggtatcaga gtcaaagatc aatcactcca gaggaagttt gaaagaactc 300 ctctcattgg acgagaacat gaatggggag ggaataactc aagggttaaa actccagctg 360 ctccttacct atacggattc cttctctggg tcccctccca cgacagcgtg ggagctgtct 420 ctctctctct ctctctctct gtctaataaa tcgcttttcc gcaaaactct ttgggtccac 480 gatatttatt cgaacggcaa actcaccgcg cctccggggc ccctttcccc attcttcggg 540 gttagngagg ccaagaacct cttcggaact gggagctggg tacgctcccc cggttaca 598 // ID piggyBac1_Mm repbase; DNA; PRI; 2527 BP. XX AC . XX DT 24-MAR-2010 (Rel. 15.06, Created) DT 24-MAR-2010 (Rel. 15.06, Last updated, Version 1) XX DE piggyBac1_Mm element: consensus sequence. XX KW piggyBac; DNA transposon; Transposable Element; piggyBac1_Mm. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-2527 RA Pagan H.J.T., Smith J.D., Hubley R.H. and Ray D.A.; RT "PiggyBac-ing on a Primate Genome: Novel Elements, Recent RT Activity and Horizontal Transfer."; RL Genome Biol. Evol 2, 293-303 (2010). XX DR [1] (Consensus) XX CC piggyBac1_Mm is an autonomous DNA transposon reconstructed from CC the Microcebus murinus draft assembly. Comparison to previously CC identified piggyBac transposase ORFs suggest that this is a novel CC family. Lineage-specific activity suggests it was inserted more CC recently than the divergences from Lemur catta, Otolemur CC garnettii, and Cheirogaleus medius. XX FH Key Location/Qualifiers FT CDS 1037..2347 FT /product="piggyBac1_Mm_1p" FT /translation="MKAFLGVILNMGVLNHPNLQSYWSMDFESHIPFFRSV FT FKRERFLQIFWMLHLKNDQKSSKDLRTRTEKVNCFLSYLEMKFRERFCPGR FT EIAVDEAVVGFKGKIHFITYNPKKPTKWGIRLYVLSDSKCGYVHSFVPYYG FT GITSETLVRPDLPFTSRIVLELHERLKNSVPGSQGYHFFTDRYYTSVTLAK FT ELFKEKTHLTGTIMPNRKDNPPVIKHQKLKKGEIVAFRDENVMLLAWKDKR FT IVTXLSTWDXSETESVERRVXGGGKEIVLKPKVVTNYTKFMGGVDIADXYT FT STYCFMRKTLKWWRTLFFWGLEVSVVNSYILYKECQKRKNEKPITHVKFIR FT KLVHDLVGEFRDGTLTSRGRLLSTNLEQRLDGKLHIITPHPNKKHKDCVVC FT SNRKIKGGRRETIYICETCECKPGLHVGECFKKYHTMKNYRD*" XX SQ Sequence 2527 BP; 830 A; 412 C; 546 G; 715 T; 24 other; ccctttgcac tcggatgtcg agtgtgactc gacgcggtta gcatcggttg cagctcgtat 60 gttgagccat actcgacctg tagtttcacc gaggggggaa gggggatttt tgtctatttt 120 tccagtattt ttcttgtttt cattagcatg aaaggacaag taaaatgtaa atgccgtctc 180 aactgatgcc accacctaag cttrtctttt ggataggcag caatcttgag taggagggtc 240 caaggcatga gtcsttgcat catgcctttt ygagcttgca tcatcttccc ttggtaaaga 300 accttcctta tcttttcrtc tcctaggttt gtttaggttt ctattagcat atyacagatk 360 ttgtcmtgac acrgggttta gttggatgat tggtttgccc cctagcagtc scacgttctg 420 agtgaatttc ccyycctcag gatggttaag ttttccaaag acatagaaag cagcgatgat 480 gaattttact tcgaaaacga agacaaaagt gaaaaatgta ataatgacga aattgagttc 540 tctgaggatg cgagtggaga tgaacaaatc gctggtccta gtggaactac agaaaggaaa 600 acgtcccttg ctttgcccaa aaatctggct gaaagtactg acagtgacat tgaattcata 660 aaggcaaaac gaagtcgcac aattgtttat tcgagagtga tgtagatata ggtgatatca 720 ttgagaaaty gggtataaga cccagtgaaa gttangtttc taggggagaa aacaggaaaa 780 agaaaagtgg acatctacat ctgtgaatga caaagagcct agcagaatcc ctttctccac 840 tggtcagtta catgttggac ctcaagttcc ttctggatgt gccacaccaa tagacttttt 900 tcagttgttt tttactgaga cgttgataaa aaatataacg gatgagacaa atgagtatgc 960 taggcataaa attagccaga aaraactatc ccagcratcc actbggaata attggaaaga 1020 tgtgacgata gaggaaatga aggcttttct tggtgtaata ctaaatatgg gtgtattgaa 1080 tcaccccaac ttgcagagtt attggtccat ggattttgag tctcatatac cattttttcg 1140 atcdgttttc aaaagagaaa ggttcttgca aatattttgg atgttacact tgaaaaayga 1200 tcaaaagtca agtaaagatt tgagaacaag aaccgagaag gtaaactgct tcttgtcata 1260 ccttgaaatg aaatttaggg agagattctg tcctggaaga gagattgctg ttgatgaggc 1320 tgttgttggt ttcaagggaa agatacactt catcacatat aatcctaaaa agccaacaaa 1380 atggggtatt cgtttatatg tcctctcaga ttcaaaatgt ggttacgtac attcatttgt 1440 cccttattat ggaggaataa catctgaaac gttggtgaga cctgatctcc cttttactag 1500 cagaatagtt ttagagcttc atgaaagatt gaagaactca gtacctggtt ctcaaggtta 1560 tcatttcttc actgacaggt attacactag tgtcactctt gcaaaggaat tgttcaagga 1620 aaaaacacac ttgacaggga caataatgcc caataggaaa gataatccac ctgttatcaa 1680 acatcaaaag ctaaagaaag gagagatagt ggctttcagg gatgaaaatg taatgctkct 1740 tgcgtggaaa gacaagagga tagtcacart gctcagtaca tgggacrctt cagagactga 1800 gtctgtggaa aggagagtgc vtggtggtgg gaaggaaata gttctcaaac ccaaggtbgt 1860 gaccaactat acaaagttta tggggggtgt tgatatagct gatcrctaca caagcacata 1920 ctgttttatg aggaagactc taaaatggtg gcgtacattg tttttctggg ggcttgaggt 1980 cagtgttgta aattcctata tattgtacaa agaatgccaa aaaagaaaaa atgaaaaacc 2040 aataacgcat gtgaaattta ttaggaaact agtacatgat cttgttggag aattcagaga 2100 tggtacacta acttccaggg gtagattatt atccactaac ttagagcaac gtttagatgg 2160 aaagctccat ataatcacac ctcaccctaa caaaaaacac aaagattgtg ttgtttgttc 2220 taacagaaaa atcaaaggag gaagaagaga gacgatttat atttgcgaaa catgtgaatg 2280 taaaccaggt cttcatgtgg gtgaatgttt caaaaaatat cacaccatga aaaattatag 2340 agattaaaat tactctttga atgtatcaat aatttgaaat ataaaaaaat ccaaataaat 2400 aagtttgtat gaaaagaaac tccagttttt tattctactg ccgcactttg taaaatctgg 2460 ggtatttaaa aaattaaatc ccgagtagaa taaaggaatc gagaaaaaag caagcgagtg 2520 caaaggg 2527 // ID Charlie12 repbase; DNA; PRI; 2873 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE hAT DNA transposon from primates. XX KW hAT; DNA transposon; Transposable Element; Charlie12. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-2873 RA Smit A.F.; RT "Charlie12 - hAT DNA transposon from primates."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Autonomous elements that gave rise to MER30 CC elements. Only two copies in genome (on 2q37.3 and 7q22.1). CC Reconstructed from these; only MER30-like sequences are CC consensus. Product =>55% similar to Charlie8. See CC Charlie12_GG in chicken.lib (March 2002). XX SQ Sequence 2873 BP; 957 A; 478 C; 550 G; 888 T; 0 other; caagcttgtc caacccgcgg cccgcgggcc gcatgcggcc caggacggct ttgaatgcgg 60 cccaacacaa attcgtaaac tttcttaaaa cattatgaga tttttttgcg attttttttg 120 taattttgta atttgactat gtgattctca agtgtgaact tcgtagacaa caatggaatg 180 gaaaggggta atcagagtat gtcacatcca caaataatgg actagtatcc ataatttgca 240 ttataattgc aaacagttca gcaagaagaa attaaaagcc attaaaatta actaacaaag 300 agcgagagga cgctattgca gtgccacgtg aagtgaattg tgacaaagaa ttcaagtgtg 360 tattctggac aaggcagagc gctgctgtac accagacact gtcgccattt gtacgcatgt 420 gtggcttcag ttttatggtg gtgtgcgact tccagtgtgt tatcttctaa tcacggacag 480 ttgaattgac accttcaggc gttatatgtg cttctggcga cttgagcaat gatatttgat 540 aatatttcaa cctacctttg taaatgatag gtgtttaata ttagttagaa gacctataag 600 tattagtctt cttatttggt tattaaacct tgctttctat tttcctgtct attaattttc 660 ataataaatg taaatcgagt tgcttatcta ttattatttt ttaaaattcc agcatatggc 720 ttctacaatg tctcaaaaga aaacaaaccc cccccaaaat aaaaaataca gatgagggaa 780 gattgctcaa cgaaaagtgg acagatgact acttttttgt caaggcaaat agtaaggcac 840 tctgcttgat ttgtagggaa tttgtgccag tttcaaagac tataatttga aaaggcatta 900 tatgcaaaga cgtgctgcca aatttggtgc gtatcaagga atgtgtcgta aggacaaaaa 960 tagcagaact gaaaaaatgt ctgtcttcac aaaaaaaatt ttttttaaag ttgcaactca 1020 aacagtctat tgtaaaagct agttatatga tagcaaattt aatagcaaaa agcaaaacta 1080 tttacagatg gtgagtttat taagcaacgt atgggaggca tggcatatat catttgccct 1140 gataaaaaag aagatatctc taaaatcagt ttgtcttgcc ggaatatagc caggtgaatt 1200 ggagaaattg gaaagtctat gaaaagagcg taaaactgct aattttaaat tttgtgcttt 1260 ggcgatggat gaaagcactg atgctacaca tatggcacaa cttgccattt ttattagagg 1320 cattgatgac gaatagaatg tcatcattat ataaagccat ataatgaaga aataataaac 1380 ccatataatg aagaaatggc ttttttagtg ccattaaaaa acagagtaaa tcaagagatt 1440 tatacgaagt agtaaaagat gcattaaagc aattttcttt gtgcgttgtg aacatacctg 1500 gtatagttac tgatgatgcc cctgcgatgg tacgtaaaag agagggagtt gtaaaattaa 1560 tagaaaatga tgcagttgcc gcctgaaact cacttttgat gatgtgtcat tgtatagtac 1620 atcaagaaaa tttatgcaca aaagctttaa aaatggataa catcatgcaa attgtcatca 1680 aggctgtgaa tttcataggg gccaagagat tgaatcattg ccaattccag gaattcctta 1740 aaagtatgga tgctgactat agcaacatca tttacttttc ggaagtaaag tcgagacaga 1800 tgttgaaaag attttatgat ttgcgacatg aaatcgagtt atttatggta tcaaaaacaa 1860 aatttgtgcc agaacttgat gacgaaaact ggcttacaga tttagcattt ttagtggatt 1920 tgaccactca tttaaatgag ttaaacatga atcttcaagg tgaaaaccaa cttctcaata 1980 caatgtttca aaccataaca gtgttccaaa tacaattgaa attatggcaa gctaaaatta 2040 aggcaaacag ttttacggat ttcaacacat ttgctaaaca cgggcttgtc aacagcaaaa 2100 agtattctgc cttgcttttt gatttgataa aggaatttga aaacaggttt taagatttct 2160 ggaaaaataa tcaatatttt ggtatagttg caactccatt ttcagccaac ataaatatgt 2220 tacctgcgaa tgcatacagc tgcaatgtga cattcaactt aaagaaaaat ctcatcaggc 2280 ttctttcctg gactttgtaa gacctatctt cccagagaca aatatccctc gcttcacagt 2340 catgccttac tcatgtcgtc ggtttttggc agcacctgcg tttgtgagca actgttttca 2400 aggatgaagc acacgaagag taaaattaga accaaaatat ctgaggagca ccttgagaac 2460 tcgctgagaa ttgcaactac ttccatcgaa ccagatattg atgcattagt ttctcaaaaa 2520 caatgtcaaa tatcccacta gttttatgtt gtcctctttt acttttataa taaaaattat 2580 caaaaaatta atgacgtttt attacttaga tacgtacatt ttctatgtca gtgattgcaa 2640 agttgggacc tgcttgacga ttttaaaaga ccctctgaaa ggggcagcac atggttagat 2700 tatgatgcga ggactttttt gcttatctgt ggtggtggat atcacgaaaa ttatgcacag 2760 accttttttt tttagctcat cagctatcgt tagtgttagt gtattttatg tgtggcccaa 2820 gacaattctt cttccaatgt ggcccaggga agccaaaaga ttggacaccc gtg 2873 // ID ERV1-2_TSy-LTR repbase; DNA; PRI; 580 BP. XX AC . XX DT 28-JAN-2010 (Rel. 15.09, Created) DT 28-JAN-2010 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERV1-2_TSy-LTR; ERV1-2_TSy-I. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-580 RA Jurka J. and Walichiewicz K.; RT "Endogenous retroviruses from tarsier."; RL Repbase Reports 10(9), 1195-1195 (2010). XX DR [1] (Consensus) XX CC ~95% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 580 BP; 153 A; 134 C; 138 G; 154 T; 1 other; tgtggcagga ggcggcatat gctcaaatat atttcttgtg tgattgagtg aataaacaca 60 gaaacacatg tggctgattc gggatcagca gttggcctca cctgggctgc cactagccca 120 gcggcaagaa ggaaaggttg atagcagtta aaggtatttc cctttgaaag gcaagttgca 180 actttaaagg cccgcccggc gtcgggcagg ttttgtttaa gaaaggccaa atggcccaga 240 gactggctat ttcatggaaa tggaaaagag tcttatctaa aacccccctt ctgaagttga 300 cggtttacaa gcctgcagtt cctggaaacc atttgttctt tgaaaaatct gattttgctg 360 agtacaacct tgtaagattc caaaggaagt atgtaatcac tcaacacgac ccccctccct 420 tacctaaacg agggtataag accccaggtc ncctgttctg gggcgctcga gcttgatttg 480 gatgcatgaa attctcgggc ccgctgtgct agccagcgta ataaagtgcc acttcctata 540 atagtctcgg tgtcacgatc tctctatttt tcctgctaca 580 // ID CYN-II2 repbase; DNA; PRI; 172 BP. XX AC . XX DT 02-MAY-2006 (Rel. 11.04, Created) DT 02-MAY-2006 (Rel. 11.04, Last updated, Version 1) XX DE SINE element from Cynocephalus - consensus sequence. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; KW Interspersed repeat; CYN-I; CYN-II2. XX OS Cynocephalus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Dermoptera; Cynocephalidae. XX RN [1] RP 1-172 RA Schmitz J. and Zischler H.; RT "A novel family of tRNA-derived SINEs in the colugo and two new RT retrotransposable markers separating dermopterans from RT primates."; RL Mol Phylogenet Evol 28(2), 341-349 (2003). XX DR [1] (Consensus) XX SQ Sequence 172 BP; 34 A; 44 C; 65 G; 29 T; 0 other; ggctggcccg gtggtgcact tggtagagtg cggcgcttgg gagtatggcg gcgctcccgc 60 cgagggttag gatcccatat agagaccggt ccgctcactg gttgagcgcg gtgcgggcgc 120 gacaccgagg gttgcgatcc cgttgccggt cacgaaaaaa gacaaaaagg aa 172 // ID ERV2-1_OG-LTR repbase; DNA; PRI; 367 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV2-type endogenous retrovirus - DE consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; ERV2-1_OG-I; KW ERV2-1_OG-LTR. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-367 RA Jurka J.; RT "Endogenous retroviruses from bushbaby."; RL Repbase Reports 11(5), 1477-1477 (2011). XX DR [1] (Consensus) XX CC >98% identical to consensus. Low copy. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 367 BP; 83 A; 118 C; 71 G; 95 T; 0 other; tgtcgggagc caaatgacgc cagatggccg cccatcccca ctgtctggag tatgctaacc 60 tttatcagat aaggagctgc ttttccctta tcagacaagg agctgttctt cccagagaag 120 ccataaaatg tttcagacac atcgtttaca ttcccagcag gagattgatg caagctctaa 180 ctcctgcatt cctctgcccc ccacctcttg tcttctctga atgctataaa aactgatctg 240 ttacccccat taaatgaggc cttgacaaga aatctgcctg gcctcgcctc ctctttctct 300 cgcccattct ctttacaggc tgggtccccc tcgccccccg aataaccgtg tcccgcagga 360 cgggaca 367 // ID LTR25_TS repbase; DNA; PRI; 385 BP. XX AC . XX DT 11-DEC-2009 (Rel. 15.09, Created) DT 11-DEC-2009 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV3-type endogenous retrovirus - DE consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR25_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-385 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from tarsier."; RL Repbase Reports 10(9), 1277-1277 (2010). XX DR [1] (Consensus) XX CC ~88% identical to consensus. 5bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 385 BP; 94 A; 86 C; 90 G; 115 T; 0 other; tgtggggtat tgtccagaca tggactgttt ccccctccct cagggagtgg cctctggcca 60 aggctggtct ctgaagggta tgcatattcc cagtaatgag tcatgggatg caggtgacca 120 agcaaagctg aaaacaatgt agtcgtcact cttaatcaat gtaattgtaa acaccgtata 180 ggaatttgaa taattaactt gcttgattat aaaactgctt gctgtactgc cctgaggtgc 240 ggattcttca ctgacctgcc gccattggtg cctcctgtaa gttttaataa accaatgtct 300 ttccctgctg gacctctggg tgacttcttt ggaataggta aacattatgt gcaagccctc 360 tcagatttgg ggttaccgca ttaca 385 // ID LTR25_OG repbase; DNA; PRI; 403 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR25_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-403 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 11(5), 1597-1597 (2011). XX DR [1] (Consensus) XX CC ~92% identical to consensus. 4 bp TSD. Similarity to LTR19_Pca CC from hyrax. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 403 BP; 90 A; 93 C; 117 G; 103 T; 0 other; tgagagagga taccggagag gggcgaggtc tgggagaaca ggtcttctca gggagaccac 60 gaggatgcac ctgcagggcc ctctccatgg tgactcagct cttgggaatt tataggctta 120 actccctgga acttggaaat ttccaagttc tgtgaaatcc tgtagttctg taggggctgg 180 agtagcatct cccgcttgat tcaaactggc caatgggaag ggtgcctgtg gggtggaact 240 ttttgactgt caataaaaag ggaaagacca agccagtcgg ggagcgcggc catttggaaa 300 tcttctatgc cgtaagctcc cggccgatga ataaagccct tcctcttata actatggtgt 360 ttgggtgctc tttctctccg ttgggcctcg tcttcctgca aca 403 // ID LTR22B_OG repbase; DNA; PRI; 491 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR22B_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-491 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 11(5), 1593-1593 (2011). XX DR [1] (Consensus) XX CC >88% identical to consensus. 4 bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 491 BP; 125 A; 138 C; 100 G; 128 T; 0 other; tgtgatgccc taactccatc cattttgagc aagtgaacca gacttactcc attttgttaa 60 gagcttgcag cccccccccc cttttgtaac caaaatagcc ccttgagcaa gcagcgaccc 120 tgcaaaaaca ggaaggaaat ttagctgaga gctgcagaat cctgtgacaa ctgaaaaaaa 180 tcttgtggaa accacaatcc tgtcacaagt ctgtctggtc tgtctgtgtg cagtttttag 240 acccctgtaa gacccctccc catagctcta accttgcaga tattagcttc tgtaaaattc 300 tgcttacgct agggcttccc atcccccgct tgaagtgcca tataaaaacc ttgcaccccc 360 gcttcaaggg ggtcaagaga ctttgtggca tgagcctgct cttgactggc cggcccaata 420 aaggaccctt gcttaatttg aactcagtgt gctggcgttt ctctataatt ccactcgcgg 480 cgggtacaac a 491 // ID MER101_I repbase; DNA; PRI; 6639 BP. XX AC . XX DT 05-MAR-2004 (Rel. 13.06, Created) DT 05-SEP-2008 (Rel. 13.1, Last updated, Version 2) XX DE ERV1 Endogenous Retrovirus from primates (internal portion). XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; KW MER101-int; MER101_I. XX NM MER101-int. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-6639 RA Smit A.F.; RT "MER101-int - ERV1 Endogenous Retrovirus from primates."; RL Direct Submission to Repbase Update (26-JUN-2008). XX DR [1] (Consensus) XX CC closest to PRIMA4-int; probably non-autonomous 16% subst; CC related seqs predate radiation. XX SQ Sequence 6639 BP; 1770 A; 1380 C; 1324 G; 2130 T; 35 other; ttcttggtgt cagaagcggg atttgaagca accccgattc tcctcccggc gccgtcctga 60 accaacgcat tggtgcctgc aagagcccct tgagctcagt tgtcttctca ctggagatgc 120 caggataggg ataggtaggg taatggcctc nggtaagtcc tctcgaattc agacctccca 180 tgccttggtt gaggtctcag agactttttc cccgtaggtc tcgtccatcg accccagagg 240 gacttcnttt agccatgggc ttgggagggg cttctcagcc agtcccccca tctggactct 300 gaaggggaca ttccctttgg ccccaggctt agnggggggc ctttccancc agttctctcc 360 atgcggatgc cggaggggct ctccctgtcg cccgaggttt agnaggggct tcccagtcaa 420 ctcggccggt ccataaggtt ttgtggggac gctcacgcta agggcactgg aagggacgcc 480 ttcgtcaggt cagtgcaatg ggtaacccat gttttaagtt ttccctgntg atcagccgcc 540 ttccggcact ccagctggct ttatgagcaa aaattatggt ccagggagtt gtaattggct 600 aggtctctgg acaaaaatta cccaggataa tcttaaactt tgttggccaa agtggggctc 660 ctttgaaatt ccaaaacttg cctatctgcg cgcacaattg gaacaaagaa aacaccgaac 720 ctcccagaga caatgggaag cctttttcag ttggtacttt gaaagttcta aatggaatca 780 agaggctacc attgcctccc ttagagaaaa taattccaaa ttgagtgagc gccttaatga 840 aatagaaaga gattagcgct tgcgagactg aaactaaagc tgtnaaaaca cctctggaag 900 acccttcttc cactagcccc ccttgcctgc cactctgcct gtcttctgca ccttttttca 960 ccacctctgc tttatccttc actccctcct tcctctttca ctctttcagc tggttatttt 1020 gaaaaggttt ctaaattttc tctaggctcg tctgtgtgtt tccttgtaaa atcctgtgat 1080 aaattcctgt gattttatgt taccttggca tccattttaa tcctcctcta acacacccag 1140 actccttgtt gagaaagctt aaattctctc tgtgcttgag atgtaaattt gctaccctgt 1200 tttctctaaa attcggtaag ggcttcagcc atgtgggaca gataaatttc agcctgttcc 1260 atttacagag acgcagtttg aatccaactg tccttttaaa ctagtgagtt ttacctgact 1320 catggctaaa gttttaaaat taaagctata agatctttat ttgtgtctgt ctgtattttt 1380 ntgtatatgt gtgtatacat gtctgttcgt atattgtcta cggtaccaaa ttggcttata 1440 aataaatgag tactcataaa ttaagcaaat aagcccaaat gcttttcaag ttcatgtgac 1500 ttagtaatct tttggcggat gggactagtc taatattgtt ggtttgatgg gaatggctgt 1560 gtcttctgag ttatcagcaa aatatgcatg tatttaactt tagggttctt gcttttatga 1620 tacttgcctg gcatgcagta atgtaaaatt ggttgataga aaatttagct tgggatgatg 1680 gctagatttg tctagtgtct catgaagttt tccaggcata attnttaaga gtgaatggat 1740 tggatggatg taaatgggat aaaagtttat aaatnaactt ttgataatgg ttatgttttg 1800 taatatgttt acttgggagg gcttctcaaa tntctttagt aactataccc ttagagtttt 1860 gctaagctaa attaaatgat ggatattcat tgaatgtcta gatcatttnc agataagata 1920 taatgctgag acattnattg ctgaatatga gtttaggctc atatactttt ggcttcttat 1980 ttcagagaaa caaaagttat ttggatctgt tagtaaaaat gtcctgttcc atattaaaaa 2040 gntgttctgt tagaaagcct atgtctctgg aaattgtaaa atgtgtattc atggattgtt 2100 ggtacatgat tggcagttaa aagttgctta cttcctaggt tttcactgaa aattagggtt 2160 actaagagtt aacattgtaa ttaatgtgtg tgattaaact actagagatg agaaagacca 2220 ttctgtatgc aagtgtatga ggagggtagg atgtattttt ggtaaggaag gttgaaaaga 2280 aaagagaata attttgtatg agaaagaatc ttgtgtggta aatttttntc ctanagtaaa 2340 atgactggtt atttaagaaa gaggaagtat aggacaaagc agaaagtcca agcatgtcat 2400 aaatggtcta agtaaatcat gataaggttt atgaaaagaa agtttataaa aggaattttc 2460 tgtgtgatca ggttggctac aattggaagg aaattgttta tgggtctttc taaggattga 2520 gctttgatgt tagaaatgca ctgatgcaga acttaaaaat ttggtcccct gtgttagaac 2580 aaggttttct taaaatgttg atttgctctt agtaaaattg caagaggttt tgatttttaa 2640 ttctgaaatc tgtttcctta acagccatcc tctaaactac aaacagtttc tatttctgcc 2700 acatttcttc ctgagatcta tctaatttcc ctagtttcag gttggaaatg cagctctcct 2760 tctttctacc cttgaaaagg tatatctttt tgcttggctg gggtgataac cctctccttc 2820 aaccttttcg tcagctcctg taactttttc tccggttcta acactgccgt tatggcctga 2880 tgctaaaatg tttatcttga aggtctagaa aggcaatgtt tccttcagta caacttgatt 2940 ctgtactttt ggcttttctt gatgtgtctg aattgttcca tgtaaccagg aaacttccta 3000 tgctgttact aaaaaccacg tattcccctg ctcaaggtac tagttttctt gtttacattc 3060 ctctataata tgggtacact cataaccctg gacacactct tcctgtgcct gattaaattc 3120 aagtaccctt ttcatcaggt ttaactttca ggttatctaa atgggctttc cgtaaggaga 3180 agcaatcacg ctgcaggagg tttttttttc tttgcctttt aggtaactgg cctaggaaac 3240 aaagattctg tgttttacca agataatttc ctgtgcttca tgttgtcttt attgggtttt 3300 tgattactta ggaaaactga gctttaaaag ggttaaggtt tttacatcca tgtaactttc 3360 tgtattgctt ttgaagtctt ttgattatca ctctggttaa atgaataact attatttagc 3420 agtgacctgt gattctgttt aatcaagtac tttgaacctt ttgacatctt tggcaggttt 3480 ccccaggatc aaaatcctaa attaagtctt tttgacctaa aattaacttt aggattttcc 3540 agttgggccc ctggagagca tcaaagaatt atctctcatc ttgtagagat attaaatgat 3600 taggcttatt tggtaaatca tatgggaagc attgtcaaat aagaaatggt gtttaacttc 3660 ctttaagtta catttgtgta aatgtgttat taaaatgtgt tccaaaattg catgagattt 3720 ctaaaattcc gatatgtcat gatatgtatt atcagtcatg attntgatta ttatgttaaa 3780 tgnttgtatg ccacaaaaat aactaaattt ccttgtcaat tgtgaactct catcagattt 3840 ttgaccatgg ctgttctggg tttttgtcat ccacagttat tgttttaaat tcttctctag 3900 aagcatttgc aatcagtata gtccaaaatt gctttaatca agcaaagcaa aattaattac 3960 atgaaattaa gtanttgata aggataactt tatgactttt atttaaaatg ttggttctnc 4020 atttaaattt tttttcagat tcaaggaant tttctttcat aagntattta tagtttgcaa 4080 taatttggta aagtatcctt tatgaacaaa agtggaagca tttgcttttt ctccctactt 4140 gattcctcca aaattcagaa actatttntg agtattctta ttttatttat ataagttcaa 4200 taaaaatctg ctctctcttt ataagcagga tacaattgga aacnttggtt atattgccaa 4260 ggttttgact gaaatgtcat atttaagaat gtgcataaaa tgcctggctt caagagttcc 4320 cagccttaca gtgagtgagt aaaaattgtc acttcctggc aggcccaaga accttaagac 4380 tgtaagtaaa atctaaagcc tgccttggtt tggcttccta gcctcaagag gttctaaaat 4440 ctgagattcc tatatgatca atgtggagag aaaaagttat gtttctaggg aaaacactaa 4500 agtacacctg ttattagatt gtagccctgt gcattgtttt caagtccttg ttatctgcct 4560 gtagactgga ctggatcctg aattctccta atttcctnca atatttggct acaactaaat 4620 cccgataaag tcccccggcc ctcttccccc aagcaagact agggatgctc cggggacatt 4680 caggggattt cccctnctta aanctaacca actaggggaa ttagatatta aaattggaga 4740 caaactagac ccataggata ctatggtccc cttgtctcaa agcagttgat gctgtctctt 4800 cctttgtaaa agccacagag aagatagtca cggggccacc tctcactgtc tscattccat 4860 actctgtcga ggctctcctc aattcacatc actggcagca tttgtaaaat ttgtccacaa 4920 tacaatactg gaaaaccatt acatgcctcc atggaccact tcccattacc gaatggtccc 4980 tttgaggtat ggcaacaaga ttttattcag ctccctgcct ctcaaggata ccagtatgtg 5040 ctagttatgg tttgcatgtt ttcacattgg gttgaagcct tcccctgtcg acaggccata 5100 gccatggcag tagctaaggc cctattggaa aaattatacc aacctgggga gtctctcaag 5160 agcttcacag tgactgagga actcatttta cagggcaaat tattnaaaat gtttgtaaaa 5220 tttggcctat ttatcaacat ctccattgtg cttaccaccc ccagtcctct ggggcggtgg 5280 aacagaccaa cggaataata aaagcccaat tggcaaagat ctgtgcggta tttagcctgc 5340 catggcccga ggccctttct ttagtcntcc ttaaccctgg catgctttca acgncccctc 5400 cccggagttc tgaattcatc cagatggtcg gggtagccaa ccagcctatg atggttccta 5460 aatctctacc tattcccttc caactaggac ccctcactgg cagtcattgc ttttcgcttg 5520 tcccatcggc ccccatacac ctcctggaaa gggacttctt agaaacctgc caggcccata 5580 tttccttctc ccaaaagggg gaaataatgc ttgagttatc ctcaccagga gattttgcca 5640 cagaaacggc ttttacccaa attcccatct attcagttag ccccaacact acccaccctg 5700 ctctccaaga gctacctgag agtctttggg cacaatccaa caccgatgat aacccctcag 5760 attatgaaga tgatagttgt ggacactctc aaggatgcga tttacctgga ggggttaacc 5820 ctagtgaagg agctagatta ggaaggtttc tggggtcctg gtttggacta ggccctgctt 5880 ggaatgaata tatggtcaga aacctttccc gcactgttaa cagaattgcc cgctccaccg 5940 cccgagccat cagggcacaa cagaggtccc tagattccct tgcttatgtg gtcctagaca 6000 accacattgc tttagactat cncctcgctg cacagggtgg tgtttgtgct gtcgctaaca 6060 cttcctgctg cacctgggta aatacttcca gtcaggttga attggaaaca tctaagatcc 6120 taaagctggc caaatctctg aaagggacac cttcagaaag cctcctggct ggacttactg 6180 ggttaaattt ccaatttcca gatattttca gctggcttcc cctggtatag gattccttct 6240 gcgttccgcc ctacaagtct taatgatcct cctcatattt gggctaagca tttggctcct 6300 ctttaaaatc gttctagcct gttttaacag atgtctgcaa gagaccccca ccaggatcgt 6360 gctgacccaa caccttgaga ctttaaactc actccagccg gaaacnggaa ccaacttaac 6420 ccaagagact ttgattcaaa tttaacaggt acctgagtgc ctctcgtaag taaatggctc 6480 tagttgctca gttggccact gccctgccac naggatccct gcgcgggact agatggaccc 6540 ggagcaggta gccaaccact ctggcaccat gatgggatgc aaccaaccta ttcgatcatc 6600 agtgctgtct gctgacaggt tttgatnaaa gggggggaa 6639 // ID REP522 repbase; DNA; PRI; 1817 BP. XX AC . XX DT 13-SEP-2000 (Rel. 5.08, Created) DT 04-AUG-2008 (Rel. 13.07, Last updated, Version 2) XX DE Human DNA repetitive subtelomeric-like sequence (a consensus). XX KW Satellite; Simple Repeat; REP522; Repetitive sequence; KW subtelomeric sequence similarity; telomeric. XX NM REP522. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-1817 RA Roschenthaler F., Schable F.K., Thiebe R. and Zachau G.H.; RT "Of orphons and UHOs. Delimitation of the germline repertoire of RT human immunoglobulin kappa genes."; RL Biol. Chem. Hoppe-Seyler 373(4), 177-186 (1992). XX RN [2] RP 1-1817 RA Smit A.F.; RT "Consensus update."; RL Direct Submission to Repbase Update (05-MAR-2004). XX DR [1] (Consensus) XX CC CC [2] (follows L1 fragment; includes palindromic 'MER122'). XX SQ Sequence 1817 BP; 259 A; 606 C; 523 G; 396 T; 33 other; atctacccaa aaactttttc ccccatcatt ntttccccgc cttcttttcc cgaccgcctt 60 cggccccctc cccctcgcca ccctctttct tcctccatct accccaaaac tttttcccca 120 ccatttttcc cccaccgtct tttcgcaaag ccttctctgc tcctcaaaac cttttcccca 180 ccgctttccc cctccctggc caccctcttt ccccctcccg ctctcgtcac cctcttttnc 240 ccctccatct acccaaaaac nttttcccca ccgtcttttc caaagccttc tccccactcc 300 tgctgctcac caccctcttt tccccctcca tctacccaaa aactttttcc ccccaccctc 360 tttccgcaaa accttctccc gctctccctc ctttctccct gctcgccacc ctctttcccc 420 ctccatctac ccnaaaactt ttttccccat cttctcctcg ctgccttttc gcaacgcctt 480 ctccgctcgc cactgccctc tttccccttg ncgctaacca ccctctttac tcccctccat 540 ctatcccgaa actattttcc ccctcctacc gctccagcca cgctgcngtc tccgtcgccg 600 ccaccaaccg cagcgaggcg agccgtggtg ccgcaggctc cagcctccag natgcggcng 660 gtggctnccc ttccggtctc ctctaagccg ggcacggagc agctcngcgg gcagacacag 720 aagaacctgg aacggcctga cnccccctca gcatcattta tatactgagg ttatgcanat 780 gaggttcctg gactacatgt tctgattgga tgagagaaaa gcctcnaggc ctactctgat 840 tggactttgt tatcatgttc tgattggatg agagcaagtc ttaggacaac caatcagagc 900 atgaaaataa agtccaatca gagtaggcct agaggttttc tctcatccaa tcagaacatg 960 tagtccagga acccacttgc ataacctcgt atataaagca tgctgaggng gcgtcaggcc 1020 attccaggct ctcctgtgtc tgccngccga gctgctctgt tcccagctta gaggacnagg 1080 agaggggaac cgccgcctgc tggaggctgg aggctggagc ctgcggcacc gtggctcgcc 1140 tcgctgcggt tggtggtggc gacggagacg gcagcgtggc cagagcggta ggagggcggc 1200 cngcggcggg agcttgnccn gcggcaggag gaggagggga gggccgcact gcccacggct 1260 ggaggctgga gcctgcgcca ccgcggctgc gctcgctgcg gttggtggtg gcgncggaga 1320 ctgcaggccg gccagagtgg tagaagggcg tggggtaggt gcgctatccg gggctgcact 1380 gcccgcggcn gggggcnggt tgggggcgct atccgaggcg gcactgcctg cgtcgggtgg 1440 cactggttgg gngcgctntc tggggctgca ctgcctgcgg ggcggtgggg gncgggttgg 1500 gtgcgctatc cggggctgca ctgcccgtgg cggggggcng gttgggggcg ctatcccaga 1560 ctgtactgct ggcggcagtg gggcgggtta ggggcgctat ccggggctgc actgcccgcg 1620 gcggggggcg ggttgggtgt gctatccggg gctgcantgc cggcggcggg gggnggttta 1680 ggggcgctat ngggtgctgc actgcccgtg gtgcggggag gcggggcggc ttgggtgtgn 1740 tgggtgcgct gtngcggggg ggcgacactg ctggtggcag cggncggggc gggttggggg 1800 cgctgtcaag ngctgca 1817 // ID Garnel1 repbase; DNA; PRI; 94 BP. XX AC . XX DT 18-OCT-2009 (Rel. 14.12, Created) DT 06-APR-2010 (Rel. 15.05, Last updated, Version 3) XX DE SINE element - consensus. XX KW SINE2/tRNA; SINE; Non-LTR Retrotransposon; Transposable Element; KW Nonautonomous; Garnel1. XX NM Garnel1. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-94 RA Jurka J.; RT "SINE elements from the bushbaby genome."; RL Repbase Reports 9(12), 3114-3114 (2009). XX DR [1] (Consensus) XX CC The top youngest sequences are >93% identical to consensus. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project. XX SQ Sequence 94 BP; 25 A; 27 C; 32 G; 10 T; 0 other; gggcggcgcc tgtggctcaa aggagtaggg cgccggcccc atatgccgga ggtggcgggt 60 tcaaacccag ccccggccaa aaactgaaaa aaaa 94 // ID HAL1-1C_Cja repbase; DNA; PRI; 2327 BP. XX AC . XX DT 13-JUL-2010 (Rel. 15.07, Created) DT 04-NOV-2010 (Rel. 15.12, Last updated, Version 2) XX DE HAL1 non-LTR retrotransposon: consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; Nonautonomous; KW HAL1; HAL1-1C_Cja. XX NM HAL1-1C_Cja. XX OS Callithrix jacchus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. XX RN [1] RP 1-2327 RA Bao W. and Jurka J.; RT "HAL1 non-LTR retrotransposons from marmoset."; RL Direct Submission to Repbase Update (13-JUL-2010). XX RN [2] RP 1-2327 RA Bao W. and Jurka J.; RT "Origin and evolution of LINE-1 derived "half-L1" RT retrotransposons (HAL1)."; RL Gene 465(1-2), 9-16 (2010)doi:10.1016/j.gene.2010.06.005. XX DR [1] (Consensus) XX SQ Sequence 2327 BP; 842 A; 543 C; 433 G; 509 T; 0 other; gagaacgatc caagatggcc gatcgctaac atcccgggat tgcagctctc agggaaggcg 60 cggagaacta gaggacgcca cactttcaga caaagtctgg tcgctcacgg agcagaagat 120 cccccagtgg tggaaacaca cgggtcgcca gcgcgactct cgtggtcggc gcagcggttc 180 cgccggcacc tcggcgcggc agctctcggc gcagagtaaa cgggaccggt tccccttctg 240 accgaggttt ggagccccgg gaaggcagag tcgcctacta cggaaacaag aaggaagccc 300 gacaggagaa tcctgggcag aaaagcacca tcagttttaa cgccgctgct ctggccctgg 360 gaactaacaa cctggacgtc cactcaagag acctaatctg aaagttggta atttcaaaga 420 cgacaggagg ataaatttac aatgacggga agaaaccagc gtaaaaaagc tgagaatact 480 caaagtcaga acgcctctcc ctctaaagat gatcacagtt ccacatcaac aatggaacaa 540 ggcttgatgg agaacgagcg cctcctgatg acagaatcac tcttcaagga atggataata 600 acaaacttcg gtgagttaaa agaacatgtt gtagcccaac gtaaagaaac taggaacttt 660 gaaaaaaggt ttgatgaaat cctattgaga atagacaact tagagaggag tatgagtgaa 720 ttaatggaac tgaagaatac aatacaggaa ctccgagaag tatgcacagg tttaaacact 780 cgaattgttc aagcagaaga agggatatca gaggtcaaag tccaacttaa tgaaataaaa 840 cgtgaagaaa agattagaga aaaaaggata aaaaggaatg agcaaagtct ccaagaaatg 900 tgggactatg tgaaaagacc aaatttacgt ttgataggtg tacctgaatg cgacggagag 960 aatgaatcca agctggaaaa tacccttcag gatattattc aggaaaattt tcctaaacta 1020 gcaaagcagg tcaacattca accccaggta atacagagaa caccacaaag atattcctca 1080 agaagagcaa ccccaaggca cataatcgtt agattcacca gggttgaaac gaaggagagg 1140 atactaaggg cagccagaga gaaaggtcag attacccaca aaggcaagcc tatcagactt 1200 acagcagatc tctcggcaga aactctacag gccagaagag agtgggggcc aatattcaac 1260 atcctcaaag aacagaacct tcagcccaga atttcatatc cagccaaact aagcttcaca 1320 actgaaggaa aaataaaatc ttttatgaac aagcaagaac tcagagattt tattaccacc 1380 aggcctgctt tacaagagct tctgaaagaa gcattacaca cagaaagaaa caaccagtat 1440 tagcctttct aaaaatacac caaaaagtaa agagcaccaa cataaagaag aatttacacc 1500 aacaaatgga taaaacagcc agtcaacatc aaatggcagt aaccctaaat ttaaattgac 1560 taaattccca atcaaaagac acagccaaaa cccaacggca tgttacatcc agacctgttt 1620 cacatgcaag gatacacaaa gactcaaaac aaagggatgg agaaagattt accaaccaaa 1680 tggagagcaa aaataaataa taaataaata aaaagcagga gttgcaattc ttgtatcgga 1740 taaaatagat tttaaagcaa caaagatata gtggtaaaag gatcaatgca acaacaagag 1800 ctaacgatcc taacacccag ataggagact tagattcaat gagacagaaa attaataagg 1860 atatcaagga ctcgaactca gatccagaac aagtaaactt aataaatatt tatagagctc 1920 tccacttcaa atacacaaaa tatacattct tgtcaatacc acatcacacc tacccattag 1980 tttaaatgaa acattgattg gccattatta atacccaatt tttttcaaaa taaagcaata 2040 tttccattta ctctccctct ttctcttcct ctttcttcct ctcctttact tatttttttt 2100 ttttctttcc ttctctcaaa aaaaaagaaa tcaacttgta aacctctaga tccaggtcgg 2160 caatgtctct ttcattgctt gatttccttt cttcccttcc ctccctccct ccctccctcc 2220 ccgcttcctc ccttccttcc ttccttcatc ccttccttcc tccctaccgt ccttcttccc 2280 ttccttcctg ccttcctccc ccccccaaaa aaaaaaaaaa aaaaaaa 2327 // ID LTR77c_TS repbase; DNA; PRI; 940 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW Endogenous Retrovirus; Transposable Element; LTR77c_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-940 RA Bao W. and Jurka J.; RT "Long terminal repeats from tarsier."; RL Repbase Reports 11(5), 1634-1634 (2011). XX DR [1] (Consensus) XX CC Elements are ~79% identical to the consensus. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 940 BP; 195 A; 309 C; 274 G; 162 T; 0 other; tgagacagga gaaagctaga cacctgatag gcagacagta gggcgggtcc cggtagctgc 60 aagccccacg gactggagtg aacacaccca cttccaggtc cgtggacttc atctttcgcg 120 cactttttcc tgattggccc ttcccaaccc tctcccgatt ggtaaattta ctaagaccag 180 accatggcgc ctctctctga ttggtgctgt tgccaaccaa tcagcaggct ctggagacta 240 tataaactcc tgcctgcaga ggcaacccgg caaccctttc gggacccctc ctgctgccgg 300 gagcttttct gtcctctaat aaattcctac ttactcaccc tccggttgtc cgcgtccctc 360 attcttcctg ggcgcgagac aagaacccgg actcgaagag cgctgctgaa gaccgccaga 420 cttgaagacc agatctgaag accgccggac ttgaagagct gcacacgccg gaggctggct 480 tgccagccga cggagccggc agagctggca gagcgggcgg agccagcgaa gccgacggcc 540 ggagcggagc tgacgcggcg gagctgatgg caggagctga caggggcaga gctgaggggc 600 ggagctgacg agcagagctg atggggccca gggctggcgg ccgccgaacc aacagagctg 660 tagcacttct cggggcctgg cttgccggcc caccgagccg acagagctgt gacacttctc 720 ggggctcggc tcaccgaccc cccacccccc caatcaagag agctgcaaca ctctttgggg 780 accccgcagc tgctggcatc cccgagctct cgggtgccac cgcgttcccc atctggatgc 840 catcatcccc acgtggatcc cgggctggca gtacagctcc gggaactgca acatccctgg 900 gggcccagct cgccagccga tcggataaaa aactgtatca 940 // ID Tigger4a repbase; DNA; PRI; 236 BP. XX AC . XX DT 14-NOV-2005 (Rel. 10.11, Created) DT 09-DEC-2005 (Rel. 10.11, Last updated, Version 1) XX DE Mariner DNA transposon from primates. XX KW Mariner/Tc1; DNA transposon; Transposable Element; mariner; KW Tigger4a; ZOMBI_A. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-236 RA Smit A.F.; RT "Tigger4a - Mariner DNA transposon from primates."; RL Direct Submission to Repbase Update (11-NOV-2005). XX DR [1] (Consensus) XX CC (MER46A) 14% div. XX SQ Sequence 236 BP; 78 A; 55 C; 47 G; 56 T; 0 other; caggttgagc atccctaatc cgaaaatccg aaatccgaaa tgctccaaaa tccgaaactt 60 tttgagcgcc gacatgacgc tcaaaggaaa tgctcattgg agcatttcgg atttcggatt 120 ttcggattag ggatgctcaa ccggtaagta taatgcaaat attccaaaat ccgaaaaaat 180 ccgaaatccg aaacacttct ggtcccaagc atttcggata agggatactc aacctg 236 // ID LTR22B_TS repbase; DNA; PRI; 646 BP. XX AC . XX DT 09-DEC-2009 (Rel. 15.09, Created) DT 09-DEC-2009 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV3-type endogenous retrovirus - DE consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR22B_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-646 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from tarsier."; RL Repbase Reports 10(9), 1274-1274 (2010). XX DR [1] (Consensus) XX CC >95% identical to consensus. 5bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 646 BP; 157 A; 166 C; 132 G; 191 T; 0 other; tgagagaccc aagagattaa aaaaaaaact caaaaagaac tattactata tgaattccta 60 ccccacgttc actgtatttg tttgcagtgt ttgtacttag aactttgtta agtttaattt 120 ttgtttgtcc cataaaataa ccgattgctt ctgcaactgc tgctagcagt gtgtgaatgt 180 taagcaataa gcccctcccg gacattcctt tgcttagata acggtagact ggtggcgcca 240 gccacccccc ccccccgtgg gttctgcctt gtttcaaacc tcctgcctgc ttgctaaacc 300 tagcttatct ggcattccat tccgcccgtt tggcagttac ttgtccatca caagtaacta 360 agcctgagcc cgcgcaaatc gaactcgaat caaccaatca aagtctttgt aactgcgtca 420 tatagttttt atccaatcct gtgtgcacaa gtttgtaaag ccaccccacc ccttcagaat 480 tatgtataaa aggtgcgtgt tagcctgggt cggggctctc gtccctgaag ctgctgcgtc 540 ggcttcaaac gtgagcccag actcgagcct gaataaagac cctcgtgtgt ttgcatcgga 600 gctggctcct tggtggtctc tcggattttg aattttgggt ttaaca 646 // ID pSIVgml-I repbase; DNA; PRI; 6583 BP. XX AC . XX DT 07-JAN-2011 (Rel. 16.02, Created) DT 07-JAN-2011 (Rel. 16.02, Last updated, Version 2) XX DE Internal portion of pSIVgml - consensus. XX KW Endogenous Retrovirus; Transposable Element; lentivirus; KW pSIVgml-LTR; pSIVgml-I. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-6583 RA Gifford R.J., Katzourakis A., Tristem M., Pybus O.G., Winters M. RA and Shafer R.W.; RT "A transitional endogenous lentivirus from the genome of a basal RT primate and implications for lentivirus evolution."; RL Proc Natl Acad Sci U S A 105(51), 20362-20367 (2008). XX DR [1] (Consensus) XX CC It is an endogenous lentivirus. XX FH Key Location/Qualifiers FT CDS 4717..6555 FT /product="pSIVgml-I_3p" FT /translation="CFYQYFSPLEEEVDPWDSSLGLFTDWVSGAHMQWLTQ FT RAQEWRGYCQPMNCTXANNFTRNCTRPYVDYESRPENIQETISHMQLNCTN FT STCVWKECKQRLFFRGNPPLDAQTFRLCVRPPFALRRCPPTNRTDWRQPYK FT CSEQCLTSCTEAVNITVETLWQSQGVLNPNQTGVSCYQEGMRVTVQTEHDP FT IGIKVLQTVKIPKMTCNLTGAQNNSGQKGIVDPCYFLCYNATKKGRGGNGN FT PIVLISCKYNGTSGTLTNCERVFKVSMPGPQDPLYYPTYPGEKWLLHLPME FT ETGDPVQCNASFQWLSRSVALHDTGVLKPNIYSSFGAEEAWRDMVENYIVY FT RFQEWTVVPVQEGEIILSRPRREPVTIIMALISLFTQVRTXINVAAMHWAG FT TVVTAVGVLVDNEGLLWESQRRLGNLVDHLAFQVQVLQARLQAMEYVLQVQ FT ESWSELGCVPSLCLTDIPWNDTWGVPNITQTWKEWDDQFWKNYGEARELWH FT RQFLEGRGKLAKIRKDLAQTALAQKVKDALNSIPKWLWTGLAVIIVLIIII FT QVACMGLKMLTKAIVSWAGFLALQPEAPRDVEPDAAIYKNNGYANGWRDYW FT QLWSGGTLFQMTRRP" FT CDS join(2506..3201,3205..4287,4257..4592) FT /product="pSIVgml-I_2p" FT /translation="PVDLYVKAAIKIGRETLIRLGVVPILTVPVAKDIWDS FT FAWTAGEVAWFPEVRHQAPPLIVERWMKMVSEPIKGARTWYIDGSKKRAQK FT ARAGIWTEGEKAQVQELEGSNQKAELAALLYALQQEDQELNIITDSQYVMK FT VLRLVPWVSDSPLVQSIIQAVEKKQAIYLDWVPGHKGIPGNHKIDEEIQYW FT QGLVIQGTGILPKREEDVGYDLQIPEDVYLQGLERRSVPLNLVQWEKDQWG FT LIVAKSSMAQMGVIPLGGVIDSGYRGPIIIILWNLNRKAVLLKAGKRVAQL FT VIMSLLHEELQQVQQVKIDTARGEGAFGSTGTYFLEAIPRAESDHELWHSG FT VKALMQDFGISQMVAKAIVHKCPNCQGKGSAITGVVDYTPGTWQMDVTHWE FT GHKLLVAVETASGLTWAKTIPDETAKTTLLATLELHSVFKVSHLHTDNGLN FT FTAERFTNALAWLGIKHSTGIPYNSHSQGVVESTNKLLKEMLHKIRPKMET FT VHAAVYMALFVINFKQRGGVGGTTRYERHLDMGLEDLQNYHFKNLDSYHVY FT FKQPPQKTWQGPARLLYKGQGAVVCEDQGKTIAVPRRYCKIITGGETLLQD FT HNRRGIAYRMQECWTDSAFRVFQQQGPLLPTLRDWSEWREASLQLLPYMAP FT VTAEERDWFDLLAYALPKVPLSVLIPIFRGPQEKWRVSLQRWLWYTHFARG FT HWEHN" FT CDS join(30..731,744..1121,1031..2107) FT /product="pSIVgml-I_1p" FT /translation="FLQTVSLXELQGKMEKGKMGDGQCLGRDTXKKAQRVR FT VRGTGKPMHKLGNFVWAVKIAAAVAERSIDKTAVGVFRRWPPKGGAGDINV FT ALDTLLCYGLQRKGSQWKVQRVPQMWQRWVGWQEQLYGKKEQDPEAEAAAY FT PVVNRGQGWAYEPMSTRTVAAWIRQTREKGLTSPETITYWGLISQDLSSRE FT QVQLLEVIPGLQADKDMLGAYLGERAREWDAQPQQPLPYTFCSYQGIRPFA FT ISAQGREAAQVFRAWITQGLMSLAQLQAPHPGSTKILQGPKEPYGEFINRL FT FLQINQEGAPEEVKTYLKGHLSIENANADCQKAMSHLRLEMTPVQPSALEP FT PQGGEMGLYPTLKELLNARDDTSAAFCTGTSPRGRDGVVPHPQGIIKLDHR FT PLVTLKVGRQSVTLLIDTGADNTIIHPKDWKPVGMEEGIINIGGIGGSQKG FT ILYKQVPISLADRQIRRTVIRAVTPINLLGRDNLVSLGIGVVMLMAQMSVK FT IVPLPVELMPGCDGPRVKQWPLTQEKYQALAEIVSKLEKEGKVSIAEVSNP FT YNTPVFAIKKKSGKWRMLIDFRVLNARTKKGAEFQLGLPHPAGLQKKDNVT FT ILDIGDAYFTIPSDPTFKKYTAFTLIPPNNQGPAKRFVFNCLPQGWVCSPA FT FYQRTMSDILQLWKQAHLEVMLYVYMDDLLIGTDLPLREHRRLVQELRSML FT LGWSFETPEENVQDQWPLQ" XX SQ Sequence 6583 BP; 1991 A; 1301 C; 1724 G; 1563 T; 4 other; gctggcgccc aacgtggggc tggacttgat ttctgcaaac ggtgagtttg kgggagttgc 60 agggcaagat ggaaaagggg aagatgggtg atggacagtg tctagggaga gacacggkga 120 aaaaggcaca gcgggtaaga gtgagaggta ccggaaagcc catgcacaag ttagggaact 180 ttgtctgggc agtaaagata gctgcagcag tcgcagaaag atctatcgac aaaacagctg 240 tgggggtttt taggcgctgg ccaccaaagg ggggggcagg ggacattaat gtagcactag 300 acaccctgct atgttatggg ttacagcgaa agggttcaca gtggaaagta cagagggtgc 360 cgcagatgtg gcaaaggtgg gtaggatggc aagaacagct ctatgggaag aaagagcagg 420 atccggaggc agaagcagct gcctacccgg ttgtaaatag gggacagggg tgggcttatg 480 agcctatgag taccagaact gtcgcagcat ggattcgaca gactagagag aagggactta 540 ctagtccaga gacaatcaca tattggggtt taatatccca agatctgtcc agcagggaac 600 aggtccaact gctggaagtc attccaggac ttcaggcaga caaggatatg ctgggggcat 660 atctaggaga aagggcacgt gagtgggatg cgcaaccaca acagccattg ccctatactt 720 tctgctcata ttaggggatt tgacagggga tcaggccttt tgccatatca gcgcaagggc 780 gagaagctgc acaggtcttt agggcctgga taacgcaggg cttaatgagc ttggcccagt 840 tgcaggcgcc acacccaggg tcaacaaaga tcttgcaggg accaaaagag ccttatgggg 900 aatttattaa tagactgttt ctccagatta atcaggaagg agccccagag gaagtaaaga 960 catatcttaa ggggcatctc agcattgaaa acgctaatgc agattgccaa aaggccatga 1020 gtcatcttag gctagagatg acaccagtgc agccttctgc actggaacct ccccaagggg 1080 gagagatggg gttgtacccc accctcaagg aattattaaa ttagatcacc ggcccctagt 1140 gactctgaag gtagggagac aatcggtcac cctcctgata gacaccgggg ctgacaatac 1200 aattatccat cctaaagatt ggaaaccagt aggaatggaa gaggggataa ttaacatagg 1260 gggaattgga ggttctcaaa aggggatatt atacaaacaa gtacctatta gtctagcaga 1320 taggcagata cggagaactg tcataagggc agtgacccct ataaatttac tagggaggga 1380 caatttagta tcactaggaa ttggagtagt gatgctaatg gcacaaatgt cagtaaaaat 1440 agtgccgctg ccagttgagt taatgcctgg ctgtgatggg ccaagagtaa aacagtggcc 1500 ccttacgcaa gagaaatatc aggctcttgc tgaaatagta tctaaattag aaaaagaggg 1560 aaaagtcagt atagcagagg taagtaatcc ctacaacact ccggtgtttg ccattaagaa 1620 aaaatcaggc aaatggagaa tgctcattga ctttcgagtg ctaaatgctc gaaccaaaaa 1680 gggagctgaa tttcaactgg gcttgcctca ccccgccggc ctacaaaaaa aagataatgt 1740 caccatacta gatattggtg atgcttattt taccatacca tcggacccca catttaaaaa 1800 gtacactgcc tttactctaa taccacctaa taatcaggga ccagccaaga gatttgtgtt 1860 taattgtcta ccacaagggt gggtgtgtag tccagccttt taccaaagga ctatgagtga 1920 catcttacaa ctatggaaac aagctcatct tgaggtcatg ttatatgtct acatggatga 1980 ccttctaatc gggacagacc tccccttgag ggagcataga aggctggtcc aagagcttag 2040 gagtatgctt cttggctgga gctttgaaac tcctgaagaa aacgtgcagg accagtggcc 2100 gctacagtag atgggatatg agttacaccc taataattgg cagttgcagg tccgaaaatt 2160 agagttacca gatcacccca ctttaaatga agtcccaaaa ctggtgggaa ttattaattg 2220 ggctagtcaa attatttcag ggcttaaaat aaagaagctt actgctatga tggcagggaa 2280 tcaggatctc aatagaaaga tagaatggac taaggaagct agaaaggaag ctgaagaggc 2340 agctaagctg ctctaggagc tcccagcagg ggggtatgtc gaccccttga aacaggtgga 2400 ggctagaata gcttttgtag gtttcctcga ggtaacctac gacgtccatc aggagaatat 2460 catcctttgg tgtggcagag tagggtccag caaagcttat tgtaaccggt ggatctgtat 2520 gtaaaagcag caataaaaat aggcagggaa accttgatta ggctaggagt ggtccccata 2580 ctcactgtcc cagtagcaaa agatatctgg gatagttttg catggactgc aggggaggta 2640 gcatggttcc cagaagtaag acatcaggca cctcctctaa tagttgagag atggatgaaa 2700 atggtatcag aacccattaa gggggcaaga acttggtata tcgatggatc taagaaacgg 2760 gcccaaaaag ctagagcagg aatttggaca gagggagaga aggcacaagt acaggaactg 2820 gagggctcaa atcaaaaagc agaattggca gccttattgt atgccttaca gcaggaagac 2880 caagaattaa acattatcac tgattctcaa tatgtaatga aagtgctgcg actcgtgcca 2940 tgggttagcg attctccctt ggtgcagagc atcatacaag cagtagagaa aaaacaggct 3000 atctatttag attgggtgcc aggtcataag ggaatcccag gaaatcataa aattgatgaa 3060 gaaattcaat attggcaagg tttggttatc caaggcacag gtatccttcc taaaagagaa 3120 gaggatgtag gctatgattt acaaattcca gaagatgtgt acctgcaggg cttggaaagg 3180 cggtccgttc cgttgaactt gtgagttcaa tgggaaaaag accaatgggg gttgattgtg 3240 gcaaagtcct ctatggctca gatgggggtg attcctttag gtggagtcat agattctggg 3300 tatagaggac ccatcatcat catcctatgg aatcttaata gaaaggcagt actccttaaa 3360 gccggaaaaa gagtggctca actagttata atgtctctac ttcatgagga gttgcaacaa 3420 gttcagcagg tcaaaattga cacggcccga ggtgaaggag catttggttc cactggaacc 3480 tatttcttgg aggccatccc tagagcagaa agtgatcatg aactatggca ctcgggggtt 3540 aaagctctca tgcaggattt tggaatatct caaatggtgg ctaaagccat cgtgcataaa 3600 tgtcctaatt gccaagggaa agggtctgcc attacagggg tggtggatta caccccgggg 3660 acatggcaga tggatgttac ccactgggaa ggacataaac tgttagtagc agttgagact 3720 gcttctgggt taacatgggc taaaactatc cctgatgaaa cagccaaaac cactttgttg 3780 gctacattag aactgcacag tgttttcaaa gtgagtcatt tacatacaga taatgggctt 3840 aatttcactg ctgaaagatt tactaatgct cttgcctggt taggcattaa gcactccaca 3900 ggcatcccct ataattctca ctctcaaggg gtagtggaat ctaccaataa gttgttgaaa 3960 gaaatgctcc acaaaattag acccaaaatg gagacagttc acgcggctgt ctatatggct 4020 ttatttgtca ttaattttaa acaaaggggt ggagtgggag gtacaactag atatgaaaga 4080 catttagaca tgggattgga agacttacaa aattaccatt tcaaaaattt ggactcgtac 4140 catgtttact ttaaacagcc acctcaaaaa acctggcagg gaccagctcg tctcctttat 4200 aaggggcagg gagcagtggt ctgcgaggat caaggaaaga caatagcagt acctagacgc 4260 tactgcaaga tcataacagg aggggaatag catacagaat gcaggaatgt tggactgact 4320 ctgctttccg agtgtttcaa cagcagggac cgttgctgcc gaccctgagg gattggagtg 4380 agtggagaga ggcctctctg caactactgc catatatggc accagtaaca gcggaagaaa 4440 gagactggtt cgaccttcta gcttatgctt tgcctaaagt acccctttct gtcttaatac 4500 ctattttcag ggggccgcaa gaaaaatggc gggtaagctt gcaacgctgg ctttggtata 4560 ctcactttgc gcgggggcac tgggaacaca attagcattg cttaaaaccc cacccattgt 4620 tagacttctt accaataata cagaaccccc tatagtgttc tgtgagtctg agcaggggca 4680 cttgggatgt gctccagctc tattttcgta tgttaatgct tctatcaata tttctcaccc 4740 ttagaggaag aggttgatcc atgggatagc tctttgggat tgttcacaga ctgggtatcc 4800 ggagcccata tgcagtggct gactcaaaga gcccaggaat ggaggggata ttgccaacca 4860 atgaactgta cgcrggctaa taactttact agaaattgta ccaggcctta tgtggattat 4920 gaaagtagac ctgaaaacat tcaggagaca atttcacata tgcagttaaa ttgtactaat 4980 tcaacctgtg tgtggaaaga gtgtaaacaa agattgttct tccggggtaa cccacctctt 5040 gatgcccaaa cctttagact ttgtgttaga ccaccttttg ctttaagaag atgtccacca 5100 accaatagga cggactggag gcaaccttat aagtgctctg agcaatgcct aacttcctgt 5160 acagaggcag taaatataac tgtcgaaact ttgtggcaat cacagggagt gttaaaccca 5220 aatcaaacag gggtgagctg ttaccaagag ggaatgaggg taacagtcca aactgagcac 5280 gaccccattg ggataaaggt cttgcagact gtgaaaatac caaaaatgac ctgtaatctt 5340 acaggagctc aaaataacag tggtcaaaag ggcatagttg acccctgtta tttcctttgc 5400 tataatgcca caaagaaagg aagaggaggc aacggcaatc ccatagtgct tatctcttgt 5460 aagtacaatg ggacgtctgg gacgttaacc aattgtgaaa gagtttttaa agtgtcaatg 5520 ccagggccac aagatcccct atattatcca acctatcctg gggaaaagtg gttactgcat 5580 ctcccaatgg aggaaacagg ggatccagta cagtgtaatg cctctttcca gtggctatca 5640 aggagtgtgg cgttacacga cactggggta ctgaagccca atatctatag ctcctttggg 5700 gcagaggaag catggagaga tatggtagaa aactacatag tgtatcgttt ccaggaatgg 5760 acagtggtac ctgttcagga gggagaaata atattatctc gaccaaggag ggaaccagtg 5820 acaataatca tggctctgat atcgctgttc actcaggtta gaacagstat aaatgtggcc 5880 gcaatgcact gggcgggaac tgtggtgacc gcagttgggg tacttgtgga caacgagggg 5940 ctcctatggg agtcacagag gcgcttaggc aacttagtgg atcacctagc gttccaggta 6000 caggtgttac aagcgagatt gcaggctatg gagtatgtgt tgcaggttca agaatcttgg 6060 tctgaattag gctgtgttcc ttcactgtgc cttacggaca tcccttggaa cgatacatgg 6120 ggagttccca atataaccca gacttggaag gagtgggatg atcaattttg gaagaattat 6180 ggtgaggcta gagagctatg gcatagacag tttttagagg gacgaggtaa attggcaaaa 6240 atccgcaaag atttagctca aactgctcta gcacaaaaag tcaaggatgc tctcaattca 6300 attcctaaat ggttgtggac tggcttagca gtaataattg tactcattat cataattcaa 6360 gttgcttgca tgggacttaa aatgcttact aaggcgattg taagctgggc aggctttctt 6420 gcattgcagc cagaggcgcc tcgcgacgta gagcccgacg ccgccattta caaaaacaac 6480 ggctacgcaa atggatggcg cgactactgg caactctgga gcggtgggac ccttttccag 6540 atgactcgcc ggccttaaaa gaaaaagaaa agaaagggtg acc 6583 // ID BSRa repbase; DNA; PRI; 142 BP. XX AC . XX DT 14-NOV-2005 (Rel. 10.11, Created) DT 09-DEC-2005 (Rel. 10.11, Last updated, Version 1) XX DE Satellite from primates. XX KW Satellite; Simple Repeat; BSRa; BSRb. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-142 RA Smit A.F.; RT "BSRa - Satellite from primates."; RL Direct Submission to Repbase Update (11-NOV-2005). XX DR [1] (Consensus) XX SQ Sequence 142 BP; 30 A; 44 C; 46 G; 22 T; 0 other; gctgggccca gcgatatgtc acaatgcccc ctgtgggcag ggcccaggca gaagagtcac 60 atcacctggg tgctgggccc agcgatatgt cacaatgccc cctgtgggca gggcccaggc 120 agaagagtca catcacctgg gt 142 // ID MacERV6_LTR4 repbase; DNA; PRI; 459 BP. XX AC . XX DT 24-SEP-2007 (Rel. 13.02, Created) DT 21-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE ERV1 Endogenous Retrovirus from Cercopithecidae. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; KW MacERV6_LTR4. XX OS Cercopithecidae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini. XX RN [1] RP 1-459 RA Smit A.F.; RT "MacERV6_LTR4 - ERV1 Endogenous Retrovirus from RT Cercopithecidae."; RL Direct Submission to Repbase Update (21-FEB-2008). XX DR [1] (Consensus) XX CC despite 5bp TSD with Bias for GCGAC! 6%. XX SQ Sequence 459 BP; 98 A; 161 C; 114 G; 86 T; 0 other; tgtctggacg gagggaggag ggaaacaaag aacaaaaggg actaagatgg cgtatttccg 60 ggttcttcat caccaacttt cccgcgcccg gggaaagaca caggtcaact gcgcaggcgc 120 aacctgacgt ccgaccgagg aaaccgaaac ctacctggcc gcgcctaccg cacggccccc 180 gacccgccca tgtccggcct actgccctcc cactcccagg cccaagacat aaagccgctc 240 cgggcagacg cgcggcgcga acttcctcgg cccctcctca tatgcggacc caggaacttc 300 gcccgagaac gccggagcga cttcctcggc ctccaccgcc ggagaccggt gaacttcgcc 360 ctttcttctt tcacgttggc tagctaataa agtttctttt taccttgcct acttgccttt 420 tctctggcgc ctgctctggt ggtcgcacaa aacaaatca 459 // ID hAT-2N2_TS repbase; DNA; PRI; 506 BP. XX AC . XX DT 21-APR-2009 (Rel. 14.1, Created) DT 14-DEC-2009 (Rel. 15.07, Last updated, Version 3) XX DE hAT-2N2_TS is a family of non-autonomous DNA elements found in DE Tarsius syrichta mobilized by hAT-2_TS. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-2_TS; hAT-2N2_TS. XX NM hAT-2N2_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-506 RA Novick P.A., Smith J., Ray D. and Boissinot S.; RT "Independent and parallel lateral transfer of DNA transposons in RT tetrapod genomes."; RL Gene 449(1-2), 85-94 (2010). XX DR [1] (Consensus) XX SQ Sequence 506 BP; 79 A; 135 C; 130 G; 158 T; 4 other; caggggtcct caaactacgg cccgcgggcc acatgcggcc gccgaggaca tttatccggc 60 ccaccgggtg tttttgccgc cgctgcctgt cctgcctagc agccgactcg tccaggcccg 120 cagtgcgcat gtgtggaatg tgcgtcgcac tctccgactc ccctccttct ctctgtctct 180 cgactcctcc tctccgtctc gggtgtgatc ggacgagtca cgagcttgcc tgtgcagagc 240 ctgctgctgc ctgaggaccg aggtaagaac aagttaggat ttdttttttt ttgaagttag 300 gaggtctatt tttttttttt taaattttgc arttagtagg gccttttttt ttcggttaag 360 gggggccttt ttttccctga agttaggagg tctwtttttt wttttttgca gatagggggc 420 gccttttttt taaactatag tccgcccctc caacggtctg agggacagtg aactggcccc 480 ctgttttaaa agtttgagga cccctc 506 // ID ERV1-Mim_I repbase; DNA; PRI; 5417 BP. XX AC . XX DT 31-OCT-2009 (Rel. 14.11, Created) DT 31-OCT-2009 (Rel. 15.11, Last updated, Version 3) XX DE Endogenous retrovirus-like element: consensus of the internal DE portion. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERV1-Mim_I; ERV1-Mim_LTR. XX NM ERV1-Mim_I. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-5417 RA Jurka J.; RT "Endogenous retroviral elements from the mouse lemur."; RL Repbase Reports 9(11), 2821-2821 (2009). XX DR [1] (Consensus) XX CC Top sequences are >99% identical to consensus. ORFs corrupted by CC mutations. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. CC We thank the Broad Institute Genome Sequencing Platform and CC Genome Sequencing and Analysis Program, Federica Di Palma, and CC Kerstin Lindblad-Toh for making the data for Microcebus murinus CC available. XX FH Key Location/Qualifiers FT CDS 446..1405 FT /product="ERV1-Mim_I_1p" FT /translation="MGQGKSTPLSLTLTHWKEVRDRAHDLSVIIRKGPWQT FT FCASEWPSFGVGWPSDETFNLSVISAVKKKIFAPHPQGHLNQGPYIIVWQD FT LTQNPPSWVKMPVPQPSLALVAQTAPSDPARRPLPLTDPQAESLLLESPPP FT YVPQSQAQAARPGPGTAQPTAPMVEGPAQGTRSRRGLSPDSIVACPLRPVP FT VPNSGPGSDDDTAPLQLLQYWPFSTADLYNWKSQNSNFSQNLGDLINLLDS FT VLFTHQPTWDDCQQLLKVLFTTEERERIQGEARKLVPGRTADLLQTREPST FT ELFPLNGRFGTAMRLKVGSVSGSTARL*" FT CDS 2315..4558 FT /product="ERV1-Mim_I_2p" FT /translation="MEDAFQALRAAMLEAPALALPDPTKEFHLFIDEKRGN FT SQGGTHAGPGTLEETGGVLVKKKLDPVAAGWPACLRIIAATALLVKDADKL FT TLGQRLQITTPHAIEGVLKQPPGRWITNARLTHYQGLLLDAPRIEFRAPAA FT LNPATLLPTPAAAAPERDCLEILAETQTTRRDLKDRPLPGSDLTWFTDGSS FT FIRDGRRYAGAAIVDDQGKLVWAARLPQGTSAQKAELIALTEALSRARGKR FT LTVYTDSRXAFGTVHIHRALYRERGFITAEGKEIKHMPEILHLLEAVLLPK FT AVAVVHTPGHQSGDSVEARGNRRADAEAKKAAEDTPQPNILHLSLPPPGMG FT RLPPLPDYSSVDEVWASEQLREDGWLRDEGHRLIVPELLRRHLLKHLHQTT FT HLGKRKMLQLLDTAQLRFKSQGRVPDDIVKSCRACQVMQPGRTRGTHAGTR FT ERGRQPGLFWEVDFTEVKPRKYGYRYLLVMVDTFSRWVEAFPTKGETALTV FT AKKILEEIVPRYGLPEGIRSNNRPAFVCQVSQGLARAMGIDWKLHCAYNPQ FT SSGQVELINRTIKETLTKLALETGGDWVTLLPFALFRARNTPYHLGLTPFE FT IMYGTPPPVVPRMSPEALPGHPKEVLQAVQALQRVHSQVWPALRAIYRPEE FT EGAGHPFHPHQPGDWVWVKRHNSRTLEPRWKGPYQIILVTPTALKVDRIVA FT TWLHHSHVRPANKKDQERCQDQWKATADPKNPLKTRLTRVAATAQDR*" XX SQ Sequence 5417 BP; 1358 A; 1384 C; 1542 G; 1128 T; 5 other; tttgggggct cgtccgggat nggagaggcg acccctctta aaaagaccgg tggcaccagg 60 cttggaggta ggccgactcc ggagtcgtcc tttatgtttg tctctgtgtt gtatgtgatg 120 tgtcagtttt gaagtttgaa tttggaacga cagggtacta actccgtgag gaggaagttc 180 ctctggacca agatagggtt cttggcggcc agaaaaactg tcgccctgga ggacgctcca 240 gggattaggg gtgagtgacc aagtggcctt ggtcgactcc atttgtttgc ctgcaagttg 300 ttgcaggatt gggcaaggta accctagggt aaggtatcgt cctttgtctg tgtttgtgtg 360 caaaagcaag tggcttcggt atttgtctac tgtctgtgtt gtcgtccttt tctctttaat 420 tctccttttt actcatccag gaattatggg ccaagggaag agtactcctc tctccctcac 480 cctgacacat tggaaggaag tacgagaccg ggcccacgac ctgtccgtca tcatccggaa 540 gggcccctgg cagacgtttt gtgcctctga gtggccctcc ttcggcgtgg ggtggccctc 600 agatgaaact tttaaccttt ctgttatttc tgcagtgaaa aagaaaatct ttgcaccgca 660 cccccaagga cacctgaatc aaggtcccta tatcattgtc tggcaggacc tcacccaaaa 720 ccctccttct tgggtaaaga tgccggtgcc tcagccttct ctcgccctag tggcccagac 780 ggctccatca gatccagctc ggcgacctct acccctaact gatccccagg cagaatccct 840 cctcctggaa tctcccccac cctatgtccc ccagtcgcag gcgcaggcag cccggccagg 900 gccggggaca gcccagccca cggcccctat ggttgagggt ccggcccaag gcactaggag 960 taggagaggg ttgtctccgg attccattgt cgcctgcccc ctacgcccag tgccagttcc 1020 caactctggg ccagggtcgg atgacgacac agctcccctc cagctcctcc aatattggcc 1080 cttttccaca gctgatctct ataattggaa atctcaaaac tcgaacttct cccagaatct 1140 gggagaccta attaatcttt tagactctgt tcttttcact catcagccca cctgggatga 1200 ttgccagcag ttgcttaagg tgctcttcac aacggaggaa agagaaagga ttcaggggga 1260 ggcgcggaag ctggttccgg ggaggacggc agacctacta caaacccgcg aaccatcgac 1320 cgaacttttc cccttgaacg gccgctttgg gactgcaatg aggctgaagg tagggagcgt 1380 ctccgggtct accgccagac tctgatggcc ggtctccgta tggcggcgcg aaagccgacc 1440 aatttggcca aggtaggaga tgttcgtcag gggccagaga gcccggcagc atacttagag 1500 aggatcatgg aggccttccg gcagtacacc cccatagatc ccactatgga agagagcaag 1560 gcagctgtta tgatggcgtt tgtcaatcag gcggcccccg acattaggcg caaggtgcag 1620 agaatagata gattgggcga gaagactctg caggacctgt tagaagtggc ggagaaggtc 1680 tataacaata gagagacgcc agaagagagg ttagagagga tcaggataga aaataggaaa 1740 ttccaagctg gaaaagcacg gaaagcaaac agagagatgg ctaaaatcct gctagccgcc 1800 acaagagggg ggcagatagg gtcagaggat agggaaaggc cccggcggga aaggctaggc 1860 aaggatcaat gtgccaattg caaagagcat ggacattggg cctgagagtg ccccaaaaga 1920 aaggggggcg aagacttgga ggtcctgaaa aagtcacggt tgcaggacag gtgatagacg 1980 aatagggaag atggggttcg gtccccctcc ccaaacctag ggtaactttg caagtggagg 2040 ggaacccagt cagcttcctc atagatacag gagcagaaca tttggtacta acggaagaca 2100 caggaaaatt gtccagtaag accagctggg tgcagggggc aacaggagcc aaactatatc 2160 ggtggaccac gtggcagaga ttggatttgg gttcaggata aactcgccta ccctgttcaa 2220 cgaggccctc catgatgacc ttgggttttc ggaaaaggct cgacctctgt atgaagggcg 2280 taaggcgggg cgagcatggg agtggactgt gcaaatggaa gacgcttttc aggccctgag 2340 ggcagccatg ctggaagccc cggccctagc actccctgat cctacaaaag agtttcacct 2400 gttcatagac gagaagaggg gaaatagcca agggggtact cacgcaggcc ctgggaccct 2460 ggaagagacc ggtggcgtac ttgtcaaaaa aaaactagat ccggtggcgg cggggtggcc 2520 agcgtgcttg cgaatcatcg cggccacggc cctgttggtc aaagacgctg ataagctcac 2580 cctagggcag cggctgcaga tcaccacccc gcacgccata gagggggttc tgaagcagcc 2640 accgggacgg tggatcacaa atgcgaggct gactcattac caaggacttt tgctggacgc 2700 accccgcatc gagttccggg cccctgccgc cttgaatccg gctacgcttc tgccaacccc 2760 cgctgcagcc gcacctgaac gtgattgcct tgagatccta gccgagaccc agacgacccg 2820 tagagacttg aaggatcgcc cccttccagg tagcgacctg acctggttca cggacggaag 2880 cagtttcatc cgggacggac gcaggtacgc aggggcggcc atagtagacg accaaggtaa 2940 acttgtctgg gcggcacgcc ttccgcaagg gacatctgct cagaaggcag aactaatagc 3000 gctgacggag gctcttagcc gggcccgagg aaaaaggctg acggtgtaca ctgacagccg 3060 ctntgccttt gggaccgtgc acatacatag ggccctctac agggaaaggg gcttcatcac 3120 ggcagaggga aaggaaatta agcacatgcc tgaaatactc cacctactag aggctgtttt 3180 gctgccaaag gcagtggcgg tagtccatac cccgggacac cagagcgggg attccgtgga 3240 agcacggggc aatcggagag cggacgctga agctaagaag gctgcagaag acactccgca 3300 gccaaacatc ctgcacctta gcctgccacc cccaggcatg ggacggttgc ctcctctgcc 3360 ggactattcc agtgtagatg aggtctgggc ctcagagcag ctacgtgaag atggctggtt 3420 gcgggacgaa gggcatcgcc taatagtgcc agaactctta agacgccacc tgttaaagca 3480 cctgcatcag accacacatc tagggaagag aaagatgctg caactgctgg acactgctca 3540 actaaggttc aagtcacaag gacgggttcc agatgacatc gtcaaaagtt gccgagcctg 3600 tcaggtcatg cagccgggga ggaccagagg gacccacgca ggtacgaggg aaagggggag 3660 gcagccagga ctattttggg aagtggattt cacagaggtc aagcctagaa aatacgggta 3720 ccgatattta ctggtcatgg tagatacgtt ttccagatgg gtagaagcct tccctacgaa 3780 aggagaaaca gctttgacag tagctaagaa aatattagaa gaaatagtcc ccaggtatgg 3840 actgccggaa gggataagat caaataatag acctgcgttt gtctgtcagg ttagtcaagg 3900 gctggcccgg gctatgggga tagattggaa attacattgt gcatataatc cccagagctc 3960 tgggcaggta gagctaataa atagaacaat aaaggagacc ctgactaaat tggccctgga 4020 gactggcgga gactgggtga ccctccttcc cttcgccctg ttccgagccc ggaacacccc 4080 ttatcacttg ggtttaaccc cttttgaaat catgtacggc acaccccctc ctgtagttcc 4140 taggatgagc cctgaggctc ttccgggaca tcccaaagaa gtgttgcaag cggtgcaggc 4200 tctgcaacgt gtccacagtc aagtatggcc ggccctccgt gctatctacc ggcctgagga 4260 ggagggtgcc ggacacccgt ttcacccaca ccagccaggg gactgggtgt gggtgaaacg 4320 gcataacagc agaaccctcg agccgaggtg gaagggcccc tatcaaatta ttcttgttac 4380 ccccactgct cttaaggtcg atagaatagt agcgacctgg ctacaccact cccacgtgag 4440 acctgctaat aaaaaagacc aggagcggtg ccaggaccag tggaaagcaa ctgcagaccc 4500 gaagaatcct ctgaagacca gactgacccg agtggcggcg acagcccagg accgctgaca 4560 cagacttgga ctggggttgt ggcatctaaa gattgactgg gtgacaaaca tggacgggtc 4620 gcggtggggg gttggttatt gtgttttggg gatgttgtgt ttttggtcca atgtttttgg 4680 gatagagacc tacagctgta cttaccgccc cgaaccccca ccaaccccat aaccttacat 4740 gggtctaaag attactgtat aatggtgtta gtgttcccaa aaataatata ccacactgag 4800 gagacaatgt atgagggcat aatagggcga agagtcacca acctaatttt taaaagaaaa 4860 agagagccct tcacagccat aaccctggct actctttttg gcttagggac aataggagca 4920 ggaactagta tctcttctct ggcaatgcag caaagagggt ttaatactct gagggcagct 4980 gttgatccga ggtagtgctc cagaaccaga gggggttaga tctagtattc ctccagcaag 5040 gaggactgtg tgctgccctc aaagaagaac gttgtttcta tgcagaccat acgggagttg 5100 ttagagaatc tatagctaag gtcagggaag ggcttgcccg ccgcaaacgg gaatatgaac 5160 ggcaagcagg ttggtttgag tcttggttta acagttcccc ctggttaact actctactct 5220 ccactctgct agggcccctg cttatactca tcttgctcct aacctttggg ccttgtatac 5280 tcagccgact agtgactngt ctgagagcgg gtcggcgcng ttcagctact ggtnctgcaa 5340 cgacactatc aacctcttgc tggagatgaa gtcctgtagt ttcaagatta aaactttcca 5400 aaagaaaaag gggggaa 5417 // ID LTR77_TS repbase; DNA; PRI; 761 BP. XX AC . XX DT 09-DEC-2009 (Rel. 15.09, Created) DT 09-DEC-2009 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR77_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-761 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from tarsier."; RL Repbase Reports 10(9), 1281-1281 (2010). XX DR [1] (Consensus) XX CC >90% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 761 BP; 181 A; 240 C; 196 G; 144 T; 0 other; tgagacagga gatagctaga gaaaggctag gcagacagac aggcgggtcc tctggtaaca 60 atgtagcgaa ctcccggcag gcctctcagg ctcagcattg gtaaacaagc tccctgcgag 120 cctctcaagc tagagcttaa gtactcactg ctgggatcga aaagcaacca atcagctcac 180 cattaggccc ctacctcctg ccgattggtc cttaatactg accaatcaga acagctacag 240 ctcacagcta gtgctagcca atcggcttgg acttagggct atataagcac caggttttcg 300 gccccagatg ggcaacccct tcgggtcccc tcccactgtg ggagcttctc tgtcctctca 360 ataaatcctg ctttctcact ttccggctgt ccgtgtccct cattcttcct gggcatgaga 420 caagaacctc gaccctgaag agctgcaacc tgaagagctg cagaagatca tcggagccga 480 agagccgtgc accccagagg ctggcttgcc agccgacgga gccggctgaa ccaggagagc 540 tgccggaaga aggcgagact ccgaagctga gacacccggg ggctaggctt gccagcccac 600 cgaaccagca acactctcag ggcttggctt gccagcccat tagagctgac acttccgggg 660 ctaggcttac cagcccctcg acgaccctct ctggggcttg gctggccgac ccattgagct 720 gacactctcg aagcccaggc tgacctgaaa aatccacatc a 761 // ID L1-4_TS repbase; DNA; PRI; 6258 BP. XX AC . XX DT 11-APR-2010 (Rel. 15.05, Created) DT 26-JUL-2010 (Rel. 15.08, Last updated, Version 4) XX DE L1-type non-LTR retrotransposon - consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-4_TS. XX NM L1-4_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-6258 RA Jurka J.; RT "Non-LTR retrotransposons from tarsier."; RL Repbase Reports 10(5), 770-770 (2010). XX DR [1] (Consensus) XX CC >89% identical to consensus. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 6258 BP; 2469 A; 1399 C; 1188 G; 1194 T; 8 other; gggattaaga tggccgacat gaggcagcta gagtgagttt ctcctacaga gtaaaccaag 60 atagagtttc ctcagcaccc agagaggaac ggatccagca gcatgcccca cgagtctcca 120 aaggagtgat aataaccact tagagcgaaa aaagaagaaa aaaatccacc gacctgggct 180 ggaactggcc ggactctcgc ctgnaccacc acagtccacc gcagcctccg gacgcagtgg 240 ggaaggtgag tgaggggaat tcaggagttc tgtgctcctg ccgcgaactt gcagggctcc 300 ctgagccttg cgggacccca gtccctcacg ggaaactgcc ttacccacag ccgtgctctg 360 ccgtggagtg gtgcaggaaa aaagccttga gcagcgccag ccgccatttc ccaggaactc 420 aacccttgca gggtggagcg gctgctttcc agccggcctg ggaggggacg ggccccaagg 480 gcaaactccg cggctgcaga cagttcctgg ctgggcgggt tgcctgagaa cacctgaggc 540 cctgaggccg gctgcacctt gacattcaga tcgcagattg cctgaggcca aggcgggagc 600 cgggtgggac gcgttgccat agcaacctct cactcccctt gtgaactctt ctgtgcctgc 660 tctgcacaac agcgttaccc cagtggcctg ggactccctc ctaatcccca ctggggtagc 720 taagagtgca cacttgtggg gtgcttcagc tcctttcccc cgcactcccc cgaaggctca 780 aaacagacag agaacttggg aaatccccag agccccgccc tcggcctagg ctggctgggc 840 acttccttgg agcaacactc gccaaggaga aaccctacag ccaccatcgc agctggcttt 900 ctcccacaag cgccacctcc tggccggagg tcaacagtac agcccaccac atcggatata 960 atacctcagc tttggggaag gcctgatccc aaacgccagc taacaacact ttccaagcca 1020 ctccggccac tcaggacgcc gtgagccagt gcaggtacac agcacagggc tgccacanct 1080 ggcaattgag aaactcacca caccaaggct atatataacc aagggaaccc tacagtgcct 1140 acgtcagccc cctgccaacc tcaaataaga agaaatagtc tacccaaatg agaaggaacc 1200 agaaaaataa ttctgacaat atgaaaaaac agagttctcc aacaccccca aaagaccaca 1260 ccaactcttc agcaacgaac cctaaccaaa atgaaatttt tgaaattcca gacatagaat 1320 tcaaaaggtt gattataaaa ttggtcaatg aacttcagga gaaaattgaa aaccaacata 1380 aagaaattta aaagattcag gacatggatg aaaaattctc caaagagata gacaccttaa 1440 aaaaaagctg aaattctgga aatgaaagaa tcatttaggg aagtacaaaa tgcagtggaa 1500 agttttaaca gtagactaga acaagcagaa gaaaggatct cagaacttga agacaaggct 1560 tttgaattaa cccaatcaga caacaataaa gaaaaaagaa tcaagagaat ggaacaaagt 1620 ctccaagaaa tgtgggatta tgtaaaacgt ccaaacctaa gaatcatagg tgttcctgag 1680 gaagaagaga aagcaacagg tttggaaaac ctatttgagg gaataatgga ggaaaacttc 1740 cctggccttg ccagagatct aaacatccag atacaagaag cccaaagaac ccccagaaga 1800 ttcattgcaa aaaggtcatc cccaaggcat ataatcatcc ggttatgcaa agttaatgtg 1860 aaggaaagaa ttttaagagn agtgagacaa aagcatcaaa taacttacaa aggtaaacct 1920 atcagactaa cagcagactt ctcagaagaa accctacaag ccagaaggga ttggggcccc 1980 atcttcaatc tcctcaaaca gaataacngt cagccaagga ttttgtatcc cgcaaaacta 2040 agtttcataa atgaaggaga aatgaagtcc ttcgcagaca agcaaacgct gagggaattt 2100 gtcaccacta gaccggctct acaagaaata ctcaaaggag ttctaaatac tgaaacaaaa 2160 ggtcgaaata caccagtaga aaaatgctgg aaatcataaa gttcacaggc tttatggaac 2220 accaacacaa tagagaaaat aaataaataa agcaaccaga taacaaccag tatgatgaat 2280 agaatagtac cttacatatc attattaact ctgaacgtaa atggtctcaa tgccccacta 2340 aaaagatata gactggcaga atggataaaa aaacacaacc caaatatctg ctgccttcaa 2400 gagacccact taactcacaa agactcctat agactcaagg taaaggggtg gggaaaaata 2460 ttccacgcaa atggaaacca aaaacgagca ggagtagcca ttctcatatc agataaaaca 2520 gactttaaat taacaacagt aaaaaaagac aaagatggct attatataat gataaaggga 2580 tcaatccaac gagaagacat aacaatttta aatatatatg cgcctaacac cagagccccc 2640 agctttataa aacaaattct actagaccta aaaaaagaga tagacagcaa tacaataata 2700 gtgggagact tcaacactcc actgacagca atagacagat catcaaggca gaaagtcaac 2760 aaagaaacat tggagttaaa cnggactcta gagnaaatgg aactaacaga catctacaga 2820 acattccatc ccaaaactac agaatataca tttttctcat cagcacatgg aacattctcc 2880 aagatagacc atatgatagg ccacaaatca agtctcaata aattcaaaaa aatcgaaatc 2940 atatcaagca ccttctcaga ccacagtgga ataaaactag aaattaactc caagaagaac 3000 tcgcgaaacc aaacaaaaac atggaagtta aacaatctgc tcctgaatga tccttgggtc 3060 aacaatgaaa tcaagatgga aattttaaaa attcctcgaa acgaatgaca acaatgaaac 3120 aagttatcaa aacctctggg acacagccaa agctgtgctc agaggaaaat ttatagcgct 3180 taaggcctac attaataaga ctgaaagatc gcatatcgac aacctaacgt cacacctcaa 3240 ggaactagaa aaacacgaac aaaccaaacc caaagctagc agaagaaaag aaataacgaa 3300 gatcagagca gaactaaatg aaattgaaat taaaaataca aaggatcaat gaaacaaaaa 3360 gttggttctt tgaaaagata aacaaaattg atagaccgct agctagatta attaagaaac 3420 gaagagagaa gactcaaatt agctcaatca gaaatgaaaa tggagacatt acaactgata 3480 cgacggaaat tcaaaagatc atccgagact attatgaaca cctctatgca aacaaactag 3540 acaatctaga ggaaatggat aaattcttgg aaacatacaa ccccccaacc ttgaatcagg 3600 aagaaataga aaccctgaac agaccaataa cgagtagcga gattgaaaca gtaattagaa 3660 gtctcccaac aaaaaaaagt ccaggacctg atggactcac agctgaattc taccagacct 3720 tcaaagaaga actggtacca attttactga agctattcca caagattgaa gaggagggaa 3780 tcctccctaa ctcattctac gaggccagta tcactctgat accaaagcca ggaaaagaca 3840 caacaaaaaa gaaaactaca gaccaatatc ccttatgaac atagacgcaa aaatcctcaa 3900 caaaatatta gcaaaccgaa ttcaacagca cataaaaaat aatacaccac gaccaagtgg 3960 gttttatccc agggatgcaa gggtggttca acatacgcaa gtcaataaac gtgatccact 4020 acataaacag aattaaaaac aaaaatcata tgatcatttc aatagacgca gaaaaagcat 4080 ttgataaaat ccagcatccc ttcatgataa aaaccctcag caaaataggc atagagggaa 4140 cattcctcaa aataataaaa gccatatacg acaaacccac agccaatgtc atcctgaacg 4200 gagaaaagtt gaacgcattc cccctcagaa ctggaacaag gcaaggatgc ccactttcac 4260 cactnctatt caacatagta ctggaagtcc tagccagagc aatcaggcaa gaaaaagaaa 4320 taaagggtat ccaaattgga aaagaagaaa tcaaactatc tctgtatgcc gacgatatga 4380 tcttatacct agagaatcct aaagactcct ccaaaagact cctggacttg ataaatgaat 4440 tcggtaaagt ttcaggatac aaaatcaaca cacacaaatc agtagcactg ctatacacca 4500 atagcgacca agctgagaat caaatcaaga actcaattcc atttacaata gcagccaaaa 4560 agctaaaata cctaggaata tatttaacca aggaggtgaa agatctctac aaggagaact 4620 acaaaactct gatgaaagaa attgcagagg acacaaacaa atggagaaac atcccatgct 4680 catggattgg aagaatcaac attgttaaaa tgaccatatt acccaaagta atctacagat 4740 tcaacgcaat ccctatcaaa ctaccaacgt catttttcac agaattagaa aaaaaaatcc 4800 taaaattcat atggaaccaa aaaagagcca gaatagccaa agcaatccta agcaaaaaga 4860 acaaagctgg aggcatcaca ttacctgact tcaaattata ctacaaggct gtagtaacca 4920 aaacagcatg gtactggtac aagagcagac acatggacca atggaacaga atagagaatc 4980 cagaaataaa cccaaatacc tacaaccaac tgatctttga caaagcagac aaaaacatac 5040 actggggaaa ggacacccta ttcaacaaat ggtgctggga aaattggata gctacatgca 5100 gaaaaatgaa acttgatccc tatctctccc catatacaaa aattaactcg agatggatta 5160 aagacttaaa tgtcagacct gaaaccataa aaatcctaga agaaaaccta ggaaaaactc 5220 ttctggacat cggcctaggc aaagaattta tggctaaaac cccaaaagca aatgcaacaa 5280 aaacaaataa ataagtggga cttaattaaa ctaaaaagct tctgcacagc aaaagaaata 5340 atcaacagag caaataggca acctacagaa tgggagaaga tattcgcaaa ctatacatct 5400 gacaaaggac taatatccag aatctacaaa gaactcaaac aaatcagcaa gaaaaaagca 5460 aataacccca ttaaaaagtg ggcaaacgac atgaacagac atctctcaaa agaagatata 5520 caaatggcca acaaacatat gaaaaaatgc tcaacatcac tcatcatcag ggaaatgcaa 5580 attaaaacca caatgagata ccaccttaca ccagtcagaa tggccattat taaaaaaaaa 5640 aaaaaataga tgctggcgcg gatgcggaga aaagggaacg cttatacact gttggtggga 5700 gtgtaaatta gtacaacctc tatggaaagc agtatggaga tttctcaaaa actaaaagta 5760 gatctaccat ttgatccagc aatcccacta ctaggtatct acccaaggaa cagaggtcgt 5820 tatataaaaa agacacctgc acccgcatgt tcatcgcagc acaattcaca attgcaaaga 5880 tgtggaatca accgatcagt ggataatgaa aatgtggtat atatacacca tggaatacta 5940 ttcagccatc aaaaagaatg aaataatgtc ttttgcagca acttggatgg agctggagan 6000 catcatccta agtgaactga ctcaggaaca gaaaaccaaa caccgcatgt tctcactcta 6060 tagtgggagc taaacggtcg caaagataga gaatggtata atggacactg gagactcaga 6120 aaggggaggg ggaaggggtg gagggcaaaa actacctatg ggacaatgtc acgggagggg 6180 gccaaaatgg tgggtacact taaagcccag accaaaacta tttgtacccc ccaaagctat 6240 tgaaattaaa aaataaaa 6258 // ID LTR1C5_OG repbase; DNA; PRI; 456 BP. XX AC . XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 05-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Long terminal repeat (consensus). XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR1C5_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-456 RA Jurka J. and Jurka M.G.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 8(10), 1672-1672 (2008). XX DR [1] (Consensus) XX CC 4bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 456 BP; 118 A; 133 C; 91 G; 113 T; 1 other; tgatgcaggt tccctccgct cagggctgga cacggactcc attttctggg cctaagcagg 60 ctcataaacc agtttgatca atgttccatc taagccacca ctccctagga agagcggaat 120 gcccaaaaca rgaccacatt cctacacaat ggcagtttta aaattaactt ctgcccacag 180 gcctctggcg ccaatgggct gtgacatatt ttagacatac attccatttc cccaaacccc 240 cctgcctgca gggaaggaga aactgcatat aaacccctag acagagagcc aaaatcggct 300 aacccactcg ggaacccctc tgctgtgccg gaggctttgc tctttctttt ctcttatcta 360 tcaataaaat ctatcctgtt gcttactcgt ggtggtccgt gaattcattc ttcgatttcc 420 cgagaccaag gacccggcag agaaattccg gtaaca 456 // ID LTR11_Mim repbase; DNA; PRI; 370 BP. XX AC . XX DT 09-NOV-2009 (Rel. 14.11, Created) DT 09-NOV-2009 (Rel. 15.11, Last updated, Version 3) XX DE Long terminal repeat - consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR11_Mim. XX NM LTR11_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-370 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2967-2967 (2009). XX DR [1] (Consensus) XX CC ~90% identical to consensus. 5bp tsd. CC Distant similarity to LTR8A_ML from bat. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. CC We thank the Broad Institute Genome Sequencing Platform and CC Genome Sequencing and Analysis Program, Federica Di Palma, and CC Kerstin Lindblad-Toh for making the data for Microcebus murinus CC available. XX SQ Sequence 370 BP; 78 A; 117 C; 81 G; 94 T; 0 other; tgtaagggaa actgcatttg tataggcaac aggtttctta ggccaaacta agatgactgc 60 cacagtgttt tttccggccc ggcccccccg gctagctggc cttccgggga ggcatgagtc 120 agcacactca tgtaaccaag gtgttatctc ctgtgtgaac ccatgtgatc acgctttccc 180 atggcatgtt gccactattg ttcagagcca tatataagcg ctcgccatgt tctcggccat 240 gcttttcgcc actgctgtat ccccccgaca ataaagagca tgtctcacct gcctgctgcc 300 actcgccttt tcttccaatt tccgaagcct gcgccggagc acacgctagc cacagagcat 360 cctcctcaca 370 // ID LTR1A2b_OG repbase; DNA; PRI; 648 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR1A2b_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-648 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 11(5), 1579-1579 (2011). XX DR [1] (Consensus) XX CC ~91% identical to consensus. 4 bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 648 BP; 171 A; 167 C; 172 G; 138 T; 0 other; tgataccgga caaagtggcg ccccagccag ctcagctgag cggggctggg ggacctgccg 60 cctccgtctt ggctgcggca gccggcagag gaaccggcag gctggagcaa gcggagggag 120 ctttcctctc ccaccggagg ctccagtagc aaaaaagacc agaccgagca ggggaaaaca 180 ctggcgattc ctccagtgca tccctgattg gtccattttc aatcctgctc ctgattggtc 240 tattttcagg agaaaataga ccaatcagct cagctttcag ctcattggac aactgcccct 300 atataaaccc ctgagtagag agcctagggg cagatcgtcg cagaccttcg agggagagag 360 agcaggagag caaagacctc agcagatcgc cgcagaccat tgagggagag agagcagaag 420 aggaagagct gtaacacttg tatcctgctc tgcaaggagg tttggctctt tgtaagagct 480 gtgacacttg tataaaacct atcttatcct gctctgcaag gtttggctct ttgtaagagc 540 tgtaatacct gttaaataaa acctaacttc catcctgtgg tccgtcgatt cattcatcga 600 atcgtcgaga ccaagagcct ggaaatccaa tatcaaaagt ccgtatca 648 // ID SUBTEL2_sat repbase; DNA; PRI; 87 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE SAT Satellite from primates. XX KW SAT; Satellite; Simple Repeat; SUBTEL2_sat. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-87 RA Smit A.F.; RT "SUBTEL2_sat - SAT Satellite from primates."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX SQ Sequence 87 BP; 0 A; 42 C; 27 G; 12 T; 6 other; gcgcctctct gcgcctgcgc cggcgcsscg cgcctctctg cgcctgcgcc ggcgcsscgc 60 gcctctctgc gcctgcgccg gcgcssc 87 // ID LTR7B_OG repbase; DNA; PRI; 331 BP. XX AC . XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 01-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Long terminal repeat (consensus). XX KW ERV3; Endogenous Retrovirus; Transposable Element; LTR7B_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-331 RA Jurka J. and Jurka M.G.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 8(10), 1679-1679 (2008). XX DR [1] (Consensus) XX CC 5 bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 331 BP; 74 A; 102 C; 72 G; 83 T; 0 other; tgatagaatc aggtgtacct gcccaaataa acttgcccca cccaggcagg gtcaggttca 60 cctgcccagg cagggtcagg ttcatggccc gcccaggaag ggtcagactc atgtccctcc 120 caggaaatag gaatgccacc aatcctggct gccctaaccc tttaaatccc tgtccgccat 180 agcccacgtg tggatttctc cctagtcagg aatctcaccc cacctgcact ctaggtggaa 240 taaaagccac tttaattgca tgaattggat ctccgttttc ctttcgtctc gcgtctcgga 300 atgattttat ccttgggtcc ggaacctttc a 331 // ID MER41G_Mim repbase; DNA; PRI; 557 BP. XX AC . XX DT 09-NOV-2009 (Rel. 14.11, Created) DT 09-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW MER41G_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-557 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2988-2988 (2009). XX DR [1] (Consensus) XX CC >88% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 557 BP; 159 A; 137 C; 116 G; 144 T; 1 other; tgttagagta ggagactagg agacagggct gagaccggat gcaggggctt gmagattggg 60 tgcaggtgtt aaagtaaaca accctcaagc agaagctaac tagaggggct gataaccaca 120 ggcaagaacc ggcttcaact ggcttgtcag ctatctctgc caccttacat gaactttggc 180 taattacaat atcatttaca ttgtggtttt aaaatttctc cgcccttgaa atcaccatga 240 cagttccgag acaaccatat aaagtatgaa aatgggaggg aacccaattc taggaattac 300 tacccaaatt cttagtaaaa gccaacccct tggccttgaa tattcctccc ctcaattagt 360 aagactacaa aagatctaaa accacttcct cccagtgcag ctgctctttt tcgagcctgc 420 ccgctctctc tcctgagagt gtactttcgc tttcaataaa aagctgtcgc ttgcttggct 480 tacgtggtgc gtcctgaaat tcttttcccg acgatcacca agaacttgga aaaacctcca 540 cttggaggcc ggtaaca 557 // ID MER9a1 repbase; DNA; PRI; 513 BP. XX AC . XX DT 04-AUG-2008 (Rel. 14.02, Created) DT 04-AUG-2008 (Rel. 14.02, Last updated, Version 1) XX DE ERV2 Endogenous Retrovirus from Catarrhini. XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR; ERVK; KW MER9a1. XX OS Catarrhini OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini. XX RN [1] RP 1-513 RA Smit A.F.; RT "MER9a1 - ERV2 Endogenous Retrovirus from Catarrhini."; RL Repbase Reports 9(2), 571-571 (2009). XX DR [1] (Consensus) XX CC Some copies integrated after split from orangutan. XX SQ Sequence 513 BP; 128 A; 134 C; 115 G; 136 T; 0 other; tgttgggagc aagcccccca aaatctggcc ataaactggc cccaaaactg gccataaaca 60 aaatctctgc agcactgtaa catgttcata atggccctaa cgcccaagct ggaaggttgt 120 gggtttacgg gaatgagggc aaggaacacc tggcccgccc agggcggaaa accgcttaaa 180 ggcattctta agccacaaac aatagcatga gcgatctgtg ccttaaggac gtgctcctgc 240 tgcagttaac tagcccaacc tattccttta attcggccca tcccttcgtt tcccataagg 300 gatactttta gttaatttaa tatctataga aacaatgcta atgactggtt tgctgttaat 360 aaatacgtgg gtaaatctct gttcggggct ctcagctctg aaggctgtga gacccctgat 420 ttcccacttc acacctctat atttctgtgt gtgtgtcttt aattcctcta gcgccgctgg 480 gttagggtct ccccgaccga gctggtctcg gca 513 // ID SPIN_NA_2_Og repbase; DNA; PRI; 80 BP. XX AC . XX DT 23-OCT-2008 (Rel. 13.11, Created) DT 23-OCT-2008 (Rel. 13.11, Last updated, Version 1) XX DE SPIN_NA_2_Og, a non-autonomous member of the SPIN family of hAT DE DNA transposons. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; SPIN; KW SPIN_NA_2_Og. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-80 RA Pace J.K., Gilbert C., Clark M.S. and Feschotte C.; RT "Repeated horizontal transfer of a DNA transposon in mammals and RT other tetrapods."; RL Proc Natl Acad Sci U S A 105(44), 17023-17028 (2008). XX DR [1] (Consensus) XX CC SPIN_NA_2_Og is a member of the hAT superfamily. The TIRs are CC 16-bp CC long and are flanked by 8-bp TSD. XX SQ Sequence 80 BP; 22 A; 20 C; 23 G; 15 T; 0 other; cagcggttct caacctgtgg gtcacgaccc acaggaactg tattaaaggg ccgcagcatt 60 aggaaggttg agaaccactg 80 // ID LTR12_TS repbase; DNA; PRI; 547 BP. XX AC . XX DT 09-DEC-2009 (Rel. 15.09, Created) DT 09-DEC-2009 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR12_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-547 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from tarsier."; RL Repbase Reports 10(9), 1269-1269 (2010). XX DR [1] (Consensus) XX CC >95% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 547 BP; 175 A; 86 C; 114 G; 172 T; 0 other; tgtcggagtc cagctccgca ggattcagga acccaggaag gatgtgtgga taaggcgaga 60 aacgagtcgg accagagttc tgtttcgcac tctggccacc gaagcacagc agtgctgatt 120 tatttatatc ataatttttg gcaagcaaat atcagtaaat taacagtaaa aaatgattaa 180 caattaataa taagttcatg acgtgattgt gattaagatt cttacgtata attatgtgct 240 tcagtaatgt ctacattttt aatatttctt agaaaaagag ctagctgttc ttggaaatag 300 taacttatta aaagggcaag gttaatatta gccaaaaggt tagcaagtgg aaagtaaaac 360 ttatactcta taaaggtaaa tagtctatat agatataaag gtgctaggca tataacctat 420 gtagtctcct tctcttacag tgggaatggt taggcctaat gatattggag gagggctgag 480 tatctaagcc tatctttaat gacttatgtt tcagtgtgtt cctcattttc tcacttcctg 540 gtccaca 547 // ID Alu2_OG repbase; DNA; PRI; 237 BP. XX AC . XX DT 06-APR-2010 (Rel. 15.04, Created) DT 06-APR-2010 (Rel. 15.04, Last updated, Version 3) XX DE SINE element - consensus. XX KW SINE1/7SL; SINE; Non-LTR Retrotransposon; Transposable Element; KW Nonautonomous; Alu2_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-237 RA Jurka J.; RT "SINE elements from the bushbaby genome."; RL Repbase Reports 10(4), 639-639 (2010). XX DR [1] (Consensus) XX CC The top youngest sequences are >87% identical to consensus. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project. XX SQ Sequence 237 BP; 62 A; 61 C; 72 G; 42 T; 0 other; ggccgggcgc ggtggctcac gcctgtaatc ctagcactct gggaggccga ggcgggtgga 60 ttgcttgagc tcacgagttc gagaccagcc tgagcaaaag cgagaccccg tctctactaa 120 aaatagaaaa actgaggcaa gaggatcgct tgagcccaag agttggaggt tgctgtgagc 180 tatgacgcca cggcactcta cccagggcga cagcttgaga ctctgtctca aaaaaaa 237 // ID LTR6_Cja repbase; DNA; PRI; 546 BP. XX AC . XX DT 14-OCT-2009 (Rel. 14.11, Created) DT 14-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat: consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR6_Cja. XX OS Callithrix jacchus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. XX RN [1] RP 1-546 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the common marmoset."; RL Repbase Reports 9(11), 2919-2919 (2009). XX DR [1] (Consensus) XX CC >89% identical to consensus. 4bp tsd. CC This sequence was derived from the marmoset dataset made CC available by Genome Sequencing Center at Washington University CC School of Medicine in St. Louis. XX SQ Sequence 546 BP; 119 A; 165 C; 108 G; 154 T; 0 other; tgtaataacc tagcttgtgc tagcctaact gactccatct tagctgtaag ctccagctta 60 gcttctccct ccctgcctcg gccccctcag gtgcattact gggtcacgct gcactcaaag 120 ctttgaaccg ttaagagtgg ttcacactgt tcacagtacc tgtaaccact gtaaccgcat 180 cctgcttggc aagtctacct gctattgtga ataccctcct tggtgattga ttgtatggaa 240 agttccccct gctacccccc ttgatgattg tatggaaata cccttagcaa ccaacaatcc 300 tgggaacctc ctgccccctg gttaccagct tccttatcta acttgcttgt tctgcttctg 360 taaaattccg cttcagctag gctccccctc ccctacctaa atcaaggtat aaagggaaat 420 caagcccctt cctcggggcc gagagaattt tgagcgttag ccgtctctcg gtcgccggca 480 aataaaggac tcttaatttg gaactcagag cgtggcgctt tccttctgac tcgctcggtt 540 acaaca 546 // ID L1_RS1_5end repbase; DNA; PRI; 2248 BP. XX AC . XX DT 29-AUG-2008 (Rel. 13.08, Created) DT 29-AUG-2008 (Rel. 13.08, Last updated, Version -1) XX DE L1 Non-LTR Retrotransposon from Cercopithecinae. XX KW L1; Non-LTR Retrotransposon; Transposable Element; LINE; KW L1_RS1_5end. XX OS Cercopithecinae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Cercopithecidae. XX RN [1] RP 1-2248 RA Smit A.F.; RT "L1_RS1_5end - L1 Non-LTR Retrotransposon from Cercopithecinae."; RL Direct Submission to Repbase Update (05-AUG-2008). XX DR [1] (Consensus) XX CC 2 subs, both with RS1_3end 0-1% (Macaca-specific probably). XX SQ Sequence 2248 BP; 786 A; 535 C; 544 G; 382 T; 1 other; gggggcggag caagatggcc gaataggaac agctccagtc tccaactccc agcgcgagcg 60 acacagaaga ccggtgattt ctgcattttc aactgaggta ctgggttcat ctcactgggg 120 agtgccggac gatcggtgct ggtcagctgc tgcagcccga ccagcgagag ctgaagcagg 180 gcgaggcatt gcctcacctg ggaagcgcaa gggggaaggg aatccctttt cctagccagg 240 ggaactgaga cacacaacac ctggaaaatc gggtaactcc caccccaata ctgcgcttta 300 agcaaacagg cacaccagga gatcatatcc cacacctggc cgggagggtc ccacacccac 360 ggagcctccc tcattgctag cacagcagtc tgtgatctac cggcaaggca gcagcgaggc 420 tgggggaggg gcgcccgcca ttgctgaggc ttaagtaggt aaacaaagct gctgggaagc 480 tcgaactggg tggagctcac agcagctcaa ggaaacctgc ctgtctctgt agactccacc 540 tctggggaca gggcaataac aaacgcagcc gaaacctctg cagacgcaaa cgactctgtc 600 tgacagcttt gaagagagca gtggatctcc caacacggag gttgagatct gagaagggac 660 agactccctg ctcaagtggg tccctgaccc ctgagtagcc taactgggag acatccccca 720 ctaggggcag tctgacaccc cacacctcac agggtggagt acacccctga gaggaagctt 780 ccaaagcaag aatcagacag gtacactcgc tgttcagaaa tattctatct tctgcagcct 840 ctgctgctga tacccaggca aacagggtct ggagtggacc tcaagcaatc tccaacagac 900 ctacagctga gggtcctgac tgttagaagg aaaactatca aacaggaagg acacctacac 960 caaaacccca tcagtacatc accatcatca aagaccagag gcagataaaa ccacaaagat 1020 ggggaaaaag cagggcagaa aagctggaaa ttcaaaaaat aagagcgcat ctcccccggc 1080 aaaggagcgc agctcatcgc cagcaacgga tcaaagctgg acggagaatg actttgacga 1140 gatgagagaa gaaggcttca gtccatcaaa tttctcagag ctaaaggagg aattacgtac 1200 ccagcgcaaa gaaactaaaa atcttgaaaa aaaagtggaa gaattgatgg ctagagtaat 1260 taatgcagag aaggtcataa acgaaatgaa agagatgaaa accatgacac gagaaatacg 1320 tgacaaatgc acaagcttca gtaaccgact cgatcaactg gaagaaagag tatcwgcgat 1380 tgaggatcaa atgaatgaaa tgaagcgaga agagaaacca aaagaaaaaa gaagaaaaag 1440 aaatgaacaa agcctgcaag aagtatggga ttatgtaaaa agaccaaatc tacgtctgat 1500 tggggtgcct gaaagtgagg gggaaaatgg aaccaagttg gaaaacactc ttcaggatat 1560 catccaggag aacttcccca acctagtagg gcaggccaac attcaaatcc aggaaataca 1620 gagaacgcca caaagatact cctcgagaag agcaactcca agacacataa ttgccagatt 1680 caccaaagtt gaaatgaagg aaaaaatctt aagggcagcc agagagaaag gtcgggttac 1740 ccacaaaggg aagcccatca gactaacagc agatctctcg gcagaaactc tccaagccag 1800 aagagagtgg gggccaatat tcaacattct taaagaaaag aattttaaac ccagaatttc 1860 atatccagcc aaactaagtt tcataagtga aggagaaata aaatccttta cagataagca 1920 aatgcttaga gattttgtca ccactaggcc tgccttacaa gagaccctga aggaagcact 1980 aaacatggaa aggaacaacc ggtaccagcc attgcaaaaa catgccaaaa tgtaaagacc 2040 atcgaggcta ggaagaaact gcatcaacta acgagcaaaa taaccagtta atatcataat 2100 ggcaggatca agttcacaca taacaatctt aaccttaaat gtaaatggac taaatgctcc 2160 aattaaaaga cacagactgg caaactggat aaagagtcaa gacccatcag tctgctgtat 2220 tcaggagacc catctcacac gcagagac 2248 // ID GarnAlu1 repbase; DNA; PRI; 273 BP. XX AC . XX DT 06-APR-2010 (Rel. 15.05, Created) DT 06-APR-2010 (Rel. 15.05, Last updated, Version 3) XX DE SINE element - consensus. XX KW SINE2/tRNA; SINE; Non-LTR Retrotransposon; Transposable Element; KW Nonautonomous; GarnAlu1. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-273 RA Jurka J.; RT "SINE elements from the bushbaby genome."; RL Repbase Reports 10(5), 778-778 (2010). XX DR [1] (Consensus) XX CC The youngest sequences are >97% identical to consensus. The CC 5'-end is homologous to Garnel1 (tRNA-derived), and the 3'-end is CC Alu-like. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project. XX SQ Sequence 273 BP; 67 A; 74 C; 87 G; 45 T; 0 other; ggctcggcgc ctgtggctca agcggctaag gcgccagcca catacaccta aggtggcggg 60 ttcgaatccc cagcccgggc ccgccaaaca acaatgatgg ctgcaaccaa aaaatggcca 120 ggcgttgtgg cgggcgcctg tagtcccagc tacttgggag gctgaggcag gagactcgct 180 tgagcccagg agttggaggt tgctgtgagc gtgatgccat ggcactctac ccagggggat 240 agcttgaggc tccgtctcaa aaaaaaaaaa aaa 273 // ID LTR20_Mim repbase; DNA; PRI; 426 BP. XX AC . XX DT 11-NOV-2009 (Rel. 14.11, Created) DT 11-NOV-2009 (Rel. 15.11, Last updated, Version 3) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR20_Mim. XX NM LTR20_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-426 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2980-2980 (2009). XX DR [1] (Consensus) XX CC ~90% identical to consensus. 4bp tsd. CC Strong similarity to LTR20_OG from bushbaby. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. CC We thank the Broad Institute Genome Sequencing Platform and CC Genome Sequencing and Analysis Program, Federica Di Palma, and CC Kerstin Lindblad-Toh for making the data for Microcebus murinus CC available. XX SQ Sequence 426 BP; 111 A; 101 C; 96 G; 116 T; 2 other; tgtaacagaa gtaagggcct gaaaaagggc aaaatgtttt acgagttgtc tctttaaaac 60 ccctcaccct ttttggggaa ttaaaacctg cattcctgcc cgaggccagt aatcagaggg 120 gcagaggagt gttctttgtt ctttgtttta aaccacagga gatggctcaa cggaattgtc 180 cgggcagagg tcacgagatc gtcttcaccg gagatgctat agttaaacag caatagcccg 240 aagctgaaaa ccccctttaa aagctctgta tttctgctta aagggaggac gatggtnctt 300 taagacgaga gtctgccatc ctcctcattt gccggcaaat taataaactt ctctttcctt 360 ttcctcaaac cgcttgtcct cgttcttcgt tcggccncgg ggacaagtac cgaactttcg 420 gtaaca 426 // ID LTR15B_OG repbase; DNA; PRI; 910 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 02-JUN-2011 (Rel. 16.05, Last updated, Version 1) XX DE Long terminal repeat of an ERV2-type endogenous retrovirus - DE consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; ERV1; KW LTR15B_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-910 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 11(5), 1590-1590 (2011). XX DR [1] (Consensus) XX CC >91% identical to consensus. 5 or 6 bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 910 BP; 255 A; 222 C; 211 G; 222 T; 0 other; tgctgggaga atggggaccg aagaccccat tcttggcaaa ggaaagcacc cccagaaaaa 60 gaggaaaact aaaaagtaag ctgagtgagt cggaacaaag aaaagtcagc tagtttacat 120 ttaagctagc cattgctagc aagcttcttg ccttttctga aattctagcc tgttggggat 180 acagcttaca aacaccctgc tgggtctgtg agcactctga ggagaataat ggccatagaa 240 gaggctatgg caagtcctag aaaagataca tcttctaggg catttcctgc ccattaaaca 300 ggcaaagcag atacccattg tggagatttg catttgcctg gggcagagac caagggagta 360 gctcgagctc tgagatatgg gcagaagagc ttatatccct tatcgcagag gaaaatcctc 420 tgtggtggtg atgggaggaa aatgcctccc tcccaccgat ccctcagagg agattaacgc 480 tggtggcgtt ttactccaca gagaactagt ggagcttaac acctccctag aaatctgtgt 540 gtgtgacctc aagcaggagg ctctcccaat acaaggctca gatttcagat tactgaaggc 600 tacccttaag gttaataata ataacaatag gagctgatat gatcaatccg agaaactgct 660 gacatctttg ttccctccct catcttaccc tctgggcaaa tagacttcaa taaaatcaga 720 gtggagcaaa cactctgggc cattgccgga atcccacgat gggcaatgga cccctgagct 780 cccactttta attctactct atgtctcttg tctttcttta ccatcgcttc taactctata 840 tttcttcagt cgatcgccgc cagtttccac aggtctcagc ggacgctgct ctcggggcgg 900 gaccccaaca 910 // ID LTR39_Mim repbase; DNA; PRI; 782 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; nonautonomous; KW LTR39_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-782 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 11(5), 1726-1726 (2011). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 782 BP; 206 A; 214 C; 162 G; 200 T; 0 other; tgagatggag cagggacccc tcttaggggc ctgccaggcc ccccccaagc atggaaataa 60 aggaaaaatc ttgagtccct tcaagggaaa ttccaggcac ctagctagcc ttgaaaagta 120 aatgagcaac ctgataagca agaagatagt aataacaata gtcccccaag caagccagag 180 tcacaaggtg ttttggttcc ctatggaaac taaaagataa catcttaaca tatgttcttg 240 agttgttttt cagaaaccca cccccaccag atggaaaatg ccaaccgcta tcacgtagac 300 ctcagataag gggaactgag gactgaactc tgaccgccgt tctttgttct aaatttcttc 360 ctgaggggcc tggagagagt cacacccaca ggccaaacct taacattcct ttccgctgac 420 cccaagtttt tagacaaggc cttgcttcct taaccaattg ctaatcaaag aatctctgaa 480 tccacctatg acccgtaagc ccccgcttca agatatcccg cctttttggg ccaaaccaat 540 gtataacctc catgtattga tttacgattt tgcctgcaac ttctgctttc ctgaaatgta 600 cccctgcctt taaaaaccct tgcttgtaag ccatcgggga gttcgggtct taagcgttag 660 ctgcccgttc tccttgcttg gcgccatgca ataaatgcct ttctttctct cgctgcaaat 720 ctcagcgtca gtgtctggct ttgctgcgcc gggcgagcgg accccagttc ggttcggtaa 780 ca 782 // ID LTR9_OG repbase; DNA; PRI; 450 BP. XX AC . XX DT 18-OCT-2009 (Rel. 14.11, Created) DT 18-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat (consensus). XX KW ERV3; Endogenous Retrovirus; Transposable Element; LTR9_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-450 RA Jurka J.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 9(11), 2854-2854 (2009). XX DR [1] (Consensus) XX CC ~89% identical to consensus. 5bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 450 BP; 65 A; 185 C; 58 G; 142 T; 0 other; tgttagataa ttgcctcaga aagccccttc cccaaaagcc ccttcctgct cttaaagtta 60 caactcctcc ccaagcacag ggagtggccc gcttacctca tcaccctgtt ccaattaccc 120 gctgcccgcc ccaacgactc tttccgccgg agcagtaact ccttattggt tatcccacca 180 tcccgtcacc attctgagac ttccctatat aagggtacgt ttctccctct ctctctctct 240 ctctccctct cccccctccc cttctctgct ctccctctct ctcttctctt ctccctctct 300 cctctctctc tcccctcttc tctctcttct cttctctttt tctctcctcc ttggtgctgc 360 tctgcagcct ccctccccca ccaataaaga cctttcgtac aagccttggt ggtctgtgag 420 atctctgccg gtcgaagtgc ctccctttca 450 // ID CYN-III2 repbase; DNA; PRI; 240 BP. XX AC . XX DT 02-MAY-2006 (Rel. 11.04, Created) DT 02-MAY-2006 (Rel. 11.04, Last updated, Version 1) XX DE SINE element from Cynocephalus - consensus sequence. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; KW Interspersed repeat; CYN-I; CYN-III2. XX OS Cynocephalus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Dermoptera; Cynocephalidae. XX RN [1] RP 1-240 RA Schmitz J. and Zischler H.; RT "A novel family of tRNA-derived SINEs in the colugo and two new RT retrotransposable markers separating dermopterans from RT primates."; RL Mol Phylogenet Evol 28(2), 341-349 (2003). XX DR [1] (Consensus) XX SQ Sequence 240 BP; 61 A; 66 C; 80 G; 33 T; 0 other; ggccggcccg gtggcacact ggatagtgcg ccacttggga agcgcggcgg tgctcccgcc 60 cgagggttcg gatcccacat acagaccggt tcccgctcac tggctgagcg aggcgcggga 120 gcagcgccga gggttgcaat ccgttgccgg tctccggtcc ggtatggggg caacactgag 180 ggttgcgatc cgttgccgga cacggaaaaa gacaaaaaga caaaaaaaaa aaaaaaaaaa 240 // ID MacERV3_int repbase; DNA; PRI; 7463 BP. XX AC . XX DT 24-SEP-2007 (Rel. 13.02, Created) DT 27-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE ERV1 Endogenous Retrovirus from Cercopithecidae. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; KW MacERV3_int. XX OS Cercopithecidae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini. XX RN [1] RP 1-7463 RA Smit A.F.; RT "MacERV3_int - ERV1 Endogenous Retrovirus from Cercopithecidae."; RL Direct Submission to Repbase Update (21-FEB-2008). XX DR [1] (Consensus) XX CC chr1.nib:209738444 ORFs gag 176-2041, pol 2042-5611, env CC 5554-7437. XX FH Key Location/Qualifiers FT CDS 176..2041 FT /product="MacERV3_int_1p" FT /note="gag." FT /translation="MGKGDRRVRHLPTYAPGDALAVVWEKADDSVSLLKSV FT GRSPLPSEYFVILWRHSLAARLLLTCLVFVFVTFVLSLLYVDEMGQTLTTP FT LSLTLTHFPDVRARAHHLSVEVRKGRWKTFCSSEWPTLHGEWPRDGTFNLS FT IILQVKAKVMDPGPLGHPDQVAYIIIWEDLVRNPPSWVKPFLHSPSPSQST FT LLALEAPKNRNPDPXKPVLPDEPQRDLLLLDPLPPPPQNPLLGPPPYASPL FT PPVLSPALSSTASAPTLSPTSPSAPPSTPSPSPAPPKLTPRTPPPTPPRLR FT LRRTEDPDGPSTWQSSLFPLRTVNRTVQYWPFSASDLYNWKTHNPSFSQDP FT QALTSLIESILLTHQPTWDDCQQLLQVLLTTEERQRVLLEARKNVPGPGGL FT PTQLPNEIDEGFPLTRPDWDYETAPGRESLRIYRQALLAGLKGAGKRPTNL FT AKVRTITQERDESPAAFMERLLEGFRMYTPFDPEALEHKATVAMAFIDQAA FT LDIKGKLQRLDGIQTYGLQELVREAEKVYNKRETPEEREARLAKEQMERED FT RRDRVRDKHLTKILAAVVREKGPGREGEKRRRPKVEKDQCAYCKERGHWIK FT DCPKRPKDQKKPAAVLTLGEDSE*" FT CDS 2042..5611 FT /product="MacERV3_int_2p" FT /note="pol." FT /translation="GCQGSGAPPEPRLTLSVGGHPTTFLVDTGAQHSVLTK FT ADGPLSSRTSWVQGATGGKLHKWTNHRTVNLGRGMVTHSFLVVPECPYPLL FT GRDLLTKLGAQIHFSETGAQVLDRDGQPIQILTVSLQDEHRLFDAPVITSL FT PDVWLQDFPQAWAETGGLGLAKYQAPIIIDLKPTAVPVSIKQYPMSREAHI FT GIRQHINKFLELGVLRPCRSPWNTPLLPVKKPGTQDYRPVQDLREINKRTM FT DIHPTVPNPYNLLSTLKPGYNWYTVLDLKDAFFCLPLAPQSQELFAFEWKD FT PEKGISGQLTWTRLPQGFKNSPTLFDEALHRDLTDFRTQNPEVTLLQYVDD FT LLLAAPTKEICIQGTRHLLQALGEKGYRASAKKAQLCQTKVTYLGYILSEG FT KRWLTPGRIETVARLPPPRNPREVREFLGTAGFCRLWIPGFAELAAPLYAL FT TKESTPFTWQTEHQLAFEALKQALLSAPALGLPDTSKPFTLFLDERQGIAK FT GVLTQKLGPWKRPVAYLSKKLDPVAAGWPPCLRIMAATAMLVKDSAKLTLG FT QPLTVITPHALEAIVRQPPDRWITNARLTHYQALLLDTDRVQFGPPVTLNP FT ATLLPVPEDQPSPHDCRQVLAETHGTREDLKDQELPDADHTWYTDGSSYLD FT SGTRRAGAAVVDGHNTIWAQSLPPGTSAQKAELIALTKALELSKGKKANIY FT TDSRYAFATAHTHGSIYERRGLLTSEGKEIKNKAEIIALLKALFLPQEVAI FT IHCPGHQKGQDPVAVGNRQADQVARQAAMAEVLTLATEPDETSHITIEHTY FT TPEDQEEAKAIGAIENKDTKNWEKGGKIVLPQKEALAMIQQMHAWTHLSNR FT KLKLLIEKTDFLIPKASTLVEQVTSACKVCQQVNAGATRVPEGKRTRGNRP FT GVYWEIDFTEVKPHYAGYKYLLVFVDTFSGWVEAYPTRQETAHIVAKKILE FT EIFPRFGLPKVIGSDNGPAFVSQVSQGLARILGINWKLHCAYRPQSSGQVE FT RMNRTIKETLTKLTLETGLKDWRRLLSLALLRARNTPNRFGLTPYEILYGG FT PPPLSTLLNSFSPSDPKTDLQARLKGLQAVQAQIWAPLAELYQPGHPQTSH FT PFQVGDSVYVRRHRTQGLEPRWKGPYIVLLTTPTAIKVDGISTWIHASHAK FT AAPGTPGPTPPETWRLRRSEDPLKIRLSRI*" FT CDS 5554..7437 FT /product="MacERV3_int_3p" FT /note="env." FT /translation="DMETPTLRGPAQDKTLSYLAPCLLLALLPCVAGSNNP FT HRPYNLTWQVTDFSTHEVLDKTSKIAPMGTWFPDLYFNLDKIAKIDDMEGG FT EWRKQARRVSVSRNGFYACPGFRTGEMRKTCGEIDALFCASWSCITTNDGE FT WKWATKPWYITMSFVQRCTRTRYSKTCNLVRIKFEDAAKSDNRWISGLIWG FT LYLYQKPLYGIPIQIKLIVNPITAPVAVGPNQVLSETRKPPVPAPREPQPR FT APKSTSPPLISTSSKYTPSAQNVTRGPLDLGIGDRLLNLIKGSYFALNQTK FT PEFTSSCWLCLATGPPYYEGIASTNNFTNSANPTGCAWEQQRKLTLAEVSG FT SGTCIGQVPPSHQHLCNVTLTVPSSNHYLVPSETDWWACNTGLTPCVSTAV FT FSSGTHYCVLVQVVPRVYYHSGDSFDLRYEQKTHTRPKREPISLTLAVMLG FT IGVAAGVGTGTAALVHGNHHLQQLRVAIDEDLRAIEQSITKLEESLTSLSE FT VVLQNRRGLEIVFLKEGGLCAALKEQCCFYADHSGVVKDSMAKLRERLDKR FT KKERESQQSWFENWYNQSPWLSTLISTILGPLILLTLILTFGPCILNRLLT FT LIKNRLNIVHAMVLTQQYQTLRTEEEAQD*" XX SQ Sequence 7463 BP; 1999 A; 2127 C; 1735 G; 1601 T; 1 other; tttggtgcgt tggccgggaa gtggggtcgt ccgaggaccc ccgacccatc cggcggagac 60 ccatctggcc cgggccacgg actgctgact gaacggacct accaggtact ttcgttttgt 120 tctgtctgtc ttgccggcta actctgaact ctggggagta ctccttctga attaaatggg 180 gaagggggac agacgtgtcc ggcaccttcc cacttacgcc ccgggggacg ccctggcggt 240 agtctgggag aaggctgacg actcagtcag cctcctcaaa tctgtaggca ggtcgcccct 300 gccgtctgaa tattttgtga tcttgtggcg ccactctctg gccgcgcggc ttctccttac 360 ttgtctggtc tttgtttttg ttactttcgt tttgtccttg ttatacgtgg acgaaatggg 420 acagacgttg acgactcctt tgtctctaac cctgactcac ttccctgacg tccgggctcg 480 agcccaccac ctctccgtag aagtccgtaa gggacgatgg aaaacattct gctcgtccga 540 atggccaacc ctccatgggg agtggccccg ggacggaaca tttaacctct caattatctt 600 gcaggttaaa gcaaaagtga tggatcctgg gccactcgga cacccggacc aggtggccta 660 cataattatt tgggaggatc tggtccgaaa tcctccctct tgggtgaaac ccttcctcca 720 ttccccttcc ccatcccaat ctaccctcct tgccttagaa gccccaaaga atcggaatcc 780 ggacccgcnt aagccagtcc tcccagatga accccagagg gatctcctcc ttcttgaccc 840 cctgcctcct ccacctcaaa acccccttct gggacctcca ccttacgctt cacccttgcc 900 ccctgtcttg tccccagctc tttcctctac cgcctcggcc cctacccttt ctccaacttc 960 tccctcggcc cctccctcca ccccgtctcc ttctccagcc ccgcccaaac tcacccctcg 1020 gacgccgccg ccgacacctc ctcgtctccg cttgcggcgg actgaggacc cagatggccc 1080 ttccacttgg caatcctccc tttttcccct ccgtaccgtc aatcgcacgg tccagtactg 1140 gcccttctct gcctctgacc tctacaactg gaaaacccat aacccttcct tttcccaaga 1200 cccccaggcc ctaacctcgt tgatagaatc cattctcctc actcaccagc ccacttggga 1260 tgattgccag caactcttgc aggtcctcct aaccactgaa gaaaggcagc gagtcctcct 1320 ggaggcccgg aaaaatgtgc caggaccagg aggcctccca acccaacttc ccaatgaaat 1380 agacgaggga tttcccctca cccgcccgga ctgggactat gaaacggcac caggtaggga 1440 gagtctccga atctatcgcc aggctctgtt ggcgggtctc aaaggggcag gaaagcgccc 1500 cacaaatttg gccaaggtaa ggaccataac tcaggaaagg gacgaaagcc cggcagcctt 1560 catggaaagg cttctggaag ggttccgaat gtatacccca ttcgatccag aggccctaga 1620 acataaggct accgtagcta tggcattcat agaccaagct gcattagata tcaaaggaaa 1680 actccaaagg ctagatggaa tccaaaccta tggattacag gaattggtta gggaggcaga 1740 aaaagtgtat aataagagag aaacccctga ggaaagggaa gccaggttag cgaaggaaca 1800 gatggagcga gaggatcgta gggaccgagt gagggataag catttaacaa aaatcctggc 1860 ggcagttgtg agagagaaag gaccagggag agagggagag aagcggaggc ggccaaaagt 1920 ggaaaaagac cagtgtgcct actgcaaaga acggggacat tggatcaaag attgccccaa 1980 gcgtcctaaa gaccaaaaga aacctgccgc tgtcctcacc ctaggtgaag atagcgaata 2040 ggggtgtcaa ggctctggag ccccccccga gccccggcta actctctctg taggggggca 2100 ccccaccacc ttcttggtgg acacaggggc ccaacattca gttttgacaa aggcagacgg 2160 gcccctgtct tcccgcacat cctgggtcca gggggcaaca gggggaaaat tgcacaagtg 2220 gactaaccac cggacagtta atcttggacg aggaatggtg acacattcct tcttggtggt 2280 acctgaatgc ccctaccccc tcctagggcg agatcttctg accaagctcg gagcccaaat 2340 ccatttctcc gagacagggg cccaggtatt agatcgggac ggtcagccca tccaaatctt 2400 aactgtgtct ctgcaagatg agcatcggct tttcgacgct ccggtcatca ctagcctccc 2460 cgatgtttgg ttgcaagatt ttccccaagc ttgggcggaa acgggaggac tcgggctagc 2520 taagtatcaa gccccaatca taattgattt aaagcccacg gcggtgcccg tgtctatcaa 2580 gcaatatccc atgagccgag aggctcatat aggaattcgg cagcacatta acaaatttct 2640 agaactcgga gtgttgcgac cttgtcgctc gccctggaac actcctcttc tgccagtaaa 2700 aaagcctggt actcaggatt acaggcctgt ccaagacttg agagaaatta acaaaagaac 2760 catggacatc catcccacgg tccccaatcc ttacaactta ctcagcacct taaaaccagg 2820 ctataactgg tatacagtat tagatttaaa agatgctttc ttctgtttac ctctggcccc 2880 ccaaagccaa gaactctttg cctttgagtg gaaggatcct gagaaaggaa tttcgggcca 2940 attgacctgg acccggcttc cccaaggatt caagaactct cccactctct tcgatgaggc 3000 tcttcatcga gacctgaccg acttccggac ccaaaatcca gaagtgactc tactccagta 3060 tgtggatgac ctcctcttgg ctgctcctac aaaggaaatc tgtatacaag gtaccaggca 3120 tctactccag gcactgggtg aaaaaggata ccgggcatcc gccaagaagg cacagctctg 3180 tcagaccaag gtaacatatc tggggtatat cctgagtgaa gggaaaaggt ggctcacccc 3240 tgggcgcata gagacagtgg ctcgccttcc accaccacga aatcccaggg aggtacgtga 3300 attcttggga actgctgggt tctgtcgctt gtggataccc ggttttgctg aactggccgc 3360 ccccctttat gcactcacca aggagagcac ccctttcacc tggcagacag agcatcaatt 3420 ggcttttgag gcactaaaac aagcactctt gtctgccccg gcccttgggt taccggacac 3480 ctcaaagccc tttaccctct tcctggacga gaggcaaggg attgccaaag gggtcttgac 3540 ccaaaaatta gggccttgga aaagaccggt agcatacctg tctaaaaagc tggaccctgt 3600 ggcggccggc tggcccccgt gtcttcgtat catggcagcc accgctatgc tggtcaagga 3660 ctctgctaag ttaacccttg ggcagccact gactgttatt accccacatg ctctagaggc 3720 catagtgcgg cagcccccgg accggtggat aaccaacgca cgcctaaccc actaccaggc 3780 cctcctactg gacacggacc gcgtccagtt tggccctccg gtcaccctaa accctgctac 3840 gctgctgccg gtaccagaag accaaccaag cccacacgat tgtcggcaag tactggctga 3900 gacccatgga acacgggaag accttaaaga ccaagaactc ccagacgcgg atcacacctg 3960 gtacacagac ggcagcagtt accttgactc aggtacccgg agggcgggag cggcggtagt 4020 agatggccac aacaccattt gggcacaatc actacctcct ggcacgtctg cacagaaggc 4080 tgagttaata gcactaacca aggccctaga gctgtccaag ggaaagaaag ctaacattta 4140 tactgatagc cggtatgcct ttgcaacggc tcatactcat ggaagtatct atgaaagaag 4200 aggtctccta acctcagaag gaaaggaaat caagaacaaa gctgaaataa ttgccttatt 4260 aaaagccctt tttcttcctc aagaagtggc tataattcac tgccccgggc atcagaaagg 4320 acaggatcca gtcgcagtag gaaacagaca ggccgaccaa gtggccaggc aagccgccat 4380 ggcggaagta ctgaccctag ccacagaacc tgacgaaacc agccacataa ctattgaaca 4440 tacttatacc ccagaagacc aggaagaagc aaaagccata ggggctatag aaaacaaaga 4500 cactaaaaac tgggaaaaag gaggaaaaat agtccttccc caaaaggagg ccctggcaat 4560 gatccagcag atgcatgcct ggacacactt gagtaatcga aagctaaaat tactgattga 4620 aaaaactgac tttctaatcc caaaggcaag taccctcgta gaacaagtga catctgcctg 4680 taaggtctgt cagcaggtaa acgctggggc tacccgagtg ccagaaggga aacgaactcg 4740 tggtaaccgc ccaggagtct attgggaaat agacttcact gaagtaaaac ctcactatgc 4800 tggatataag tacttactgg tgtttgtaga taccttttca ggatgggtag aagcctaccc 4860 cacccggcaa gaaacggcac acatagtagc caagaaaatt ttggaagaaa tctttcctag 4920 attcggactt cccaaggtaa ttgggtcaga taacgggccg gccttcgttt ctcaggtaag 4980 tcaggggctc gccaggatat tggggattaa ttggaaatta cattgtgcct atagacccca 5040 gagctcagga caggtagaaa gaatgaatag aacaataaaa gagaccctta ctaaattgac 5100 cttagagact ggtttaaaag attggagacg cctcctatcc ttagctctgt taagggcccg 5160 aaatacgcct aaccgttttg ggctcactcc atatgaaatc ctctacggag gacctccccc 5220 tttgtcaacc ttgcttaact ccttctcccc ctccgatcct aagactgacc tacaggcccg 5280 gctaaaagga ctccaagcag tacaggccca aatctgggcc cccttggcag aactgtacca 5340 accaggacat ccacagacca gtcacccctt ccaggtggga gactctgtct acgttagacg 5400 gcaccgcact caaggactag agcctcggtg gaaaggaccc tacattgttc tcctgaccac 5460 gcccacagcc ataaaggttg acggaatctc cacttggatc cacgcatccc acgccaaggc 5520 tgctccaggg acgcccggac caacaccacc tgagacatgg agactccgac gctccgagga 5580 cccgctcaag ataagactct ctcgtatcta gccccttgct tattactagc cctccttccc 5640 tgcgtcgctg gcagtaataa cccccaccgg ccatataacc tgacctggca ggtaactgat 5700 tttagtactc atgaggtctt agataaaacc tcaaaaattg ctcctatggg gacttggttc 5760 cctgacctct atttcaacct agacaaaata gcaaagatag atgacatgga agggggagaa 5820 tggagaaaac aggctagaag ggtgtccgta agccggaacg ggttctatgc ctgtcccgga 5880 ttcaggacag gagaaatgag aaaaacttgt ggagaaatag atgccctgtt ttgcgctagt 5940 tggtcctgta taactactaa tgatggagaa tggaaatggg ccacgaaacc ctggtacata 6000 accatgtcct ttgtccagcg ctgcaccaga acccgatatt cgaaaacttg caatctggtc 6060 cgcatcaagt ttgaagatgc ggcaaaatct gataaccgtt ggatatccgg gttaatatgg 6120 ggcctatatt tataccaaaa gccactgtat ggaatcccta tccaaatcaa attaatagtc 6180 aaccctatca cagcccctgt cgcagtagga ccaaaccaag ttttatcaga aacaaggaag 6240 cccccggttc ccgcaccgag agagccccaa ccaagagctc ctaaaagcac ttccccgccc 6300 ttaatctcca cctcctctaa atacacaccc tcagcccaaa acgtcacccg cgggcccctt 6360 gacctgggaa taggtgacag gctcctaaat ctcataaagg gctcctattt cgctttaaac 6420 cagacaaagc cagaatttac ctcctcttgc tggctatgtc tggcaacagg ccccccttac 6480 tatgaaggca tcgcctccac taataatttc actaactccg ccaatcctac tggatgcgcg 6540 tgggaacaac aaagaaaact aaccctggct gaagtttctg ggtcgggaac ctgcataggc 6600 caagtgcccc ctagtcatca gcatctttgt aatgtaacct tgacagtacc cagctccaat 6660 cactatttgg tcccctccga gacggactgg tgggcttgca acactgggct caccccctgt 6720 gtatccacag ccgtttttag cagcggcacc cactattgcg tgttggtaca agttgttccc 6780 cgagtatact atcactctgg agactccttt gatctccggt atgagcaaaa aactcatact 6840 agaccaaaga gagaacctat ctccctcacc ctcgccgtca tgttaggaat tggggtagcg 6900 gctggagttg ggaccgggac agcagcccta gtgcatggta accatcatct gcaacaactt 6960 agagtagcca tagatgaaga ccttagagcc atagaacaat ctatcacaaa acttgaagag 7020 tccttgactt ctctgtctga agttgtatta caaaaccgac gaggactaga aattgtcttt 7080 ctgaaagagg gcgggctctg tgcagccctg aaagagcaat gttgttttta tgcagatcat 7140 tcaggagtag ttaaagattc tatggcaaaa ctaagagaaa gattagacaa gagaaaaaaa 7200 gagagagaat ctcagcaaag ttggtttgaa aattggtaca accaatcccc ttggcttagc 7260 accctaatct ccaccatctt aggacccctc atcctgctca cgctcatcct gactttcggg 7320 ccatgcatac tcaaccgctt actcaccctt attaaaaata gattaaacat agtacatgct 7380 atggttctga cccaacaata ccagaccctc aggactgaag aagaggctca agattgagcc 7440 tctgacacaa aaagaggagg gaa 7463 // ID HUERS-P1 repbase; DNA; PRI; 6263 BP. XX AC . XX DT 10-AUG-1998 (Rel. 3.07, Created) DT 31-JUL-2008 (Rel. 13.07, Last updated, Version 2) XX DE Primate HUERS-P1 repetitive element - a consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; KW MER4I-group; LTR8; MuLV; HUERS-P1. XX NM HUERS-P1. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-30 RA Harada F., Tsukada N. and Kato N.; RT "Isolation of three kinds of human endogenous retrovirus-like RT sequence using tRNA pro as a probe."; RL Nucleic Acids Res 15, 9153-9162 (1987). XX RN [2] RP 1-856 RA Kapitonov V.V. and Jurka J.; RT "Direct submission."; RL Direct submission (July 1998). XX RN [3] RP 1-6263 RA Smit A.F.; RT "Consensus update."; RL Direct Submission to Repbase Update (05-MAR-2004). XX DR [2] (Consensus) XX CC Originally, a consensus sequence of the 856-bp 5'-terminal part CC of the internal portion of HUERS-P1, an LTR retrotransposon with CC LTR8 long terminal repeats, was obtained [2]. It was reported in CC 1998 that the remaining portion of HUERS-P1 is related to the CC MER4I-group [2]. At that time only 10% of the human genome was CC sequenced and it was not enough sequence data for an accurate CC full-length consensus sequence, which was finally derived in 2008 CC from the completely sequenced human genome [3]. CC HUERS-P1 has Pro tRNA related PBS [1] analogously to MMLV, HERVR CC (BaEV), HUERS-P3 and HUERS-P2. It has 4 bp target site CC duplications like all the other members of MER4I-group [2]. CC Individual sequences are on average 90% identical with the CC consensus sequence (some subfamilies are only 5% divergent from CC the consensus) [3]. XX SQ Sequence 6263 BP; 1727 A; 1122 C; 1367 G; 2033 T; 14 other; aatttggggg ctcgtccggg attgcccttg tggctacctg cccgtggttc ggtagccccc 60 ctccggcgat ggatccagag gccagcccaa gtggccgcct agttctcttg gactgggggc 120 tgactctggt actctctcta ccggcggggc gctgccgacc caatgtgcat ggatttaatt 180 gcaatggaga aatagtcctg gggagacgtc ccntaactgt agccctatca cagggtgtct 240 gtctgtagcc ccatggcggg gtgtctgtct gtagccccat tgcggggtgt ctggattggt 300 gagtatccta ggcgctgcca acgcctcctt ccttctcccg actggtttgt agccctatgg 360 tggggtgtct gtagccccat cgcggggtgt ctgtttgtag ctccaccatg gggtgtctgt 420 gtctgtagcc ccattgcggg gtgtctgttt gcagctcctg gggggtctcg gttggctctt 480 cctaactagt aggaagagtc ttggtttggg agacttctcc tcaatcagga agatttcggg 540 gaggtttctc agacggagaa taggaggata gtttggaagg gatactcttg gagttcttgg 600 ttagggatct gatttggaag gccttctgtc cgtctcgtct ttgtgtgtgt ttgtatatgt 660 ggaggggatc tcagaaggag ttgctgatgg aagtccagca ggcctaactc agagaaccct 720 ccttatttgt ctggtcacat tcggtgagcc ctaaagaaag ctcaacaggc ctgtctcggg 780 gtgactatct gctcttcgcc ttgcccagag accccattgt gaattaccgt tcggaggtcg 840 tccctcccca cctggagtgg atcaaagaca acagggacca acgggaaaaa gtttgagctt 900 tgccaggttg atattgggtg ctgaacgagg tgactagtgt ctgttttgtt atgtgtattt 960 tgctgggatg gaaaatgtta attcggttcc ccatgcagcc cattgggcag catcttgcaa 1020 attaagaatc ttgcctatgg ttccataaaa cagnaaaggg tgattttctc ttgtaaagtg 1080 gcttgaaccc cacagctatg gcacaagcga gcagggtcat cagaagccgc tccgttcttc 1140 tggaagctgc agagaaaggg aacccggaaa cctggtatgc cagcaaaaag ggtaagaaat 1200 tcttaccagc caagtttctg gtctctctct ctctctttct ctgtctgngt aaaacagtaa 1260 actatttgtc tcctctgcaa gggtttgatt aatagaaaaa aggatttgtg agactagtct 1320 taggctgtag caaatctggt gtactttgtg ctaagaattt gtctttctgt gttctgtaat 1380 ggagagaggg gtatcacagg atagaacgtg ggtttaggac ccctataagc ctgcttttca 1440 agccagctcg gcaggctggt cagttacaaa ctttgctacg ggtccctgaa accaataccg 1500 tatgaaattt ctctgtcttg ttttgtgtcc ttaagagctt aaccttgtga ccatgtgggg 1560 atactttctc ttggtttcca ccatccagag gacaggaatt ttggggttca tgtcatagtt 1620 agccctaaaa atttttcttg agcagttaaa agcctttgca agcttgaaat tggcttctct 1680 aggctccttc tgggaaaagc aatagaaact gctcaatgct gtatagctca gtagctaagg 1740 ctttatcttt tgacagtggt ggcctgggtt caattgttgg cttctggaat gattcctttc 1800 tggtttgtta tttgtgtaac tttgccattt attgaggttt cttcccccca tanttagctt 1860 ctgatttcct ctcttgaatt ttcctttctc tgaactacct tgnggagatt ctaaatcttg 1920 taaaaaagaa actgcttacc atgtctttga agcacctggg aggttacctt tggtaaagtt 1980 cagaagccag aaatattggc cgcttggcat ggctaaagtc gggtaataag agatctgaaa 2040 ggatttcttt tttaaagagc actatggtta aaagtcagct taattaaaag tggataaaca 2100 agctatagat atatttaaaa ggcctttatg tttttctctt cttgganctt gtttttctgg 2160 aaaaaggttt tttcttctca gtcgactgaa ttatttttct ccattttttt gtcttgccac 2220 tcttaatgca cacatgagag gccctaagat aacttctggt agcctgggac tcattgggaa 2280 aaacagagga ggcgccacag accccgtttt gggaaaaaaa aaccctctgt tttcctcatg 2340 aaaccccagg aattaaaagc ggatagatcc ctctcaaaat caaaggctct gttctgtttt 2400 gcattgtgtt atctgacggt tttgagtttt gggggtatca gaaattactt cgcattatga 2460 gagagctttg gtgtgtaata actaggtagg aaatatactt taagggatgg ctaatagtag 2520 ttatggaggg atacttgact ctttgcacac ttggatcaga gaagcatgct cttggccacc 2580 tggaagataa ggaaacatcc ccacccccca ctgggagatg agactcccat gagggatggg 2640 ctgattacaa aatgggctga ttggctttgg gttgccttgc aatgaaatgc agggtagaag 2700 cactgcactg tcttctcccg tagtatttcc ctccttttgg ggatccagga tccagtataa 2760 aatggcaccc ttaattttgg ggatctgtct ttgccttcag ctgcttattt gctgcttatt 2820 tggccctaga aatgcatgct ttcctggccc tgttcctcca agggctccac cctgaagcca 2880 gtaatccaat taagaaactg gcaaatgaaa aatcttacaa gtgctgaatc ttctgtctgt 2940 gtgtatttat atgtgttgta tgtttatata taaaagagct ctgattaatt ggcttagaaa 3000 aataagcgct taaatcaaat attttgtcag aaaaatagaa actttaatgc ctttttgttc 3060 acatgacttt agtaatcttt tggaaataaa gacagtttta aagattattg gtaaaataaa 3120 atgtcttgaa aatgtagaca tttggtctaa attaaggtca gatatcagat ttgctaaatg 3180 ctttaaggtc aaactgtttc tttgactttt gaaaattgtt cgatttacct actttggagc 3240 attagattat agataaggcc tggggacata tggagagcca tgcccnctag ctatgctgaa 3300 aagagtcaga ccttatcttc acttctgtct gatgtcctag gctccacccc tagtacataa 3360 ttaaaatcgc ttacttatca ggtttttcac taaaaataaa agttgctaag agttaacatt 3420 gtaacatgta attgagacca ctggagaaac agttttacat acaaggtgtg tagggaatgt 3480 gtttttggta aaagattata agaaggcatg ggaatatggc ttttgttaaa gggaatgtaa 3540 ttttgtctag ttcagagggt tttaaagatt gtcttaacct aaaagagtaa tgggacaaaa 3600 ctgaaggttt aagcaaagtg aaaagggttt gtaaagggtt gatcttgtaa aaaaagttct 3660 gtgggtataa acaagttggc taagatttga aagaaattat ttagcttttt ttccataggt 3720 taaaacatta aaatcatact gatgtggggc cagaatctgg gcccatgtgt ccgaataaca 3780 gggttttctt agaaaattga tctgctgttt gatggaaaat tgtaaagggt tctaaaaagt 3840 ttatgaaaat cttaccttat ggtcaaacta attaaaactg gatagattta taaaatttta 3900 tttaaaaact agctttaaca ttaaagatgc actaatgcaa acatgaaatt tggttttctc 3960 ttttgaagan gatttttatg taatgttaaa agataatgaa agggttttgt tttccccttt 4020 gggtaaatgg cagggaaaaa agggaggaga gagagaagag acagattcag ttggcctcat 4080 gctatcttca ttgggtcttg tttggaaagc taagtctcct ctatcagagt aaaggttttt 4140 cttttttaaa aanatttttg gagttatcat tttggccaaa tgaatgactt atggtgacct 4200 gggattctat tttgtgatat ccagtgtttt aaacctttga tatttgacaa actttccaaa 4260 atcaaattat aaattatgtc tctttctaac ctaatatttt agatattagg tcctctaaag 4320 tccaaaaatg acatttggct tatttggtat aaaaatcata caggaagcat tgtcaaatat 4380 gaaatggtgt ttggctttct ttgggctata tttgtgtaaa tgtgttattg gtatatgttc 4440 caaaattatg taaaactcct ataattctaa tatgacttag tatatgttat cagtaataat 4500 tataattatt atgttaaatg actgtgtgcc acagaggtaa caaatttcct tgtcaattgt 4560 gtctttaact gtggctgccc taaaatgttt ttgtcatcca cagacaattg ttgtctcgct 4620 ttggtcctct ttaaaagatg gttttataat cagctataaa atttaacagg tgctcttaaa 4680 tgcaggtttc tgattaataa cttggagatt gtgacattag aatagaggaa aaaactttca 4740 aatagaagag tgaatggtgt ttggttttct ttggactgta tttgtataaa tatgttatta 4800 gtatgtgttc caaaattatg ggaaacttct ataatgctga tatgatttag tgtacattat 4860 taataattat aattgttatg taaaattgtt gtatgccaca gaagtaacca aaattcctag 4920 tcaattgtag ctttaatagt ggctatagac ttttgtcatc cacagacatt ttgtcttgct 4980 ttggtccttt tcaaaaggca gtttataatc agatataaga ctctgagtgc aggtctcaga 5040 taactttaaa aattgtgcta ttncaaactt ctaggactct catggagagc tgatgtgtta 5100 aacattgcta anccttttgt tttcagagtc aaganaactt atttctttag agctatttgc 5160 aacttttaac aagtgagtaa aatatactcc tgtgaacaaa atttggagca tatttgttcc 5220 tctctacctg atttctccag aatttggaaa ctatttgtga gtattctcaa tttatggcag 5280 tatagttaat tgcataagtg caataanaat ctgttttctt ttgtaacagg acacaattgg 5340 agaaattggt tattttacca aggctttgac tggaatggca tgcttccttt aaagaatcaa 5400 agttgactta tagagccaat taaagcccgt tggggaatct ggcctcatac cttgtccaca 5460 cagagtccct gtacaaggtt cctgacctgt ggtaagtaaa gaatgtcact ttctaacagg 5520 cccaggaacc ccaagttatc ttgggacctc aagaggagag gaatttgccc aactcatagg 5580 tatttgaggg tacaaaccca tggctgggct cggcttttaa aaagtcttat ctgagattct 5640 tcacggaaca gagttccatc aaagccaatt tnaaaagcct aagtgaaaaa taattattct 5700 tgctgcactt catgcaaata atcaggccaa gtacagtaag actaaagttt attttgtaaa 5760 caaatcagtt ctatcatgat ttgtttttaa taaaaatggg gactggagag agaaaaatta 5820 tgcttcaaaa gaaaaactat agtacacctg ttgttagctg ttcttgaggt tttttctgca 5880 gtttagacta aattctaaat tctttgtggg ttagaagtcc ccaaactaat gctttcaaat 5940 ctttgctttt aaaattggga attgtactcc tcatcctagg actcattatt taccttatag 6000 taggctgttc acttaaacac tgtagtaaaa ctatagatga gaatactaat gtttttgcca 6060 tgcaagcctt ggaagcccag ccaggcctgc atgagtacgc tcagacagtt gcaaagcggt 6120 tccactcttc tcaccttggg gttcactccc attcccacta cgtcccctgt cagcaggaag 6180 aagccagagc gatcgacggc cttttcccat cttcatagcc tacaccttaa gattaaggtg 6240 ttataaaacc caaagggagg gat 6263 // ID MacERVK1_LTR1 repbase; DNA; PRI; 334 BP. XX AC . XX DT 24-SEP-2007 (Rel. 13.02, Created) DT 21-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE ERV2 Endogenous Retrovirus from Cercopithecidae. XX KW ERV2; Endogenous Retrovirus; Transposable Element; ERVK; LTR; KW MacERVK1_LTR1. XX OS Cercopithecidae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini. XX RN [1] RP 1-334 RA Smit A.F.; RT "MacERVK1_LTR1 - ERV2 Endogenous Retrovirus from RT Cercopithecidae."; RL Direct Submission to Repbase Update (21-FEB-2008). XX DR [1] (Consensus) XX CC 4%. XX SQ Sequence 334 BP; 92 A; 79 C; 86 G; 77 T; 0 other; tgtagaggac tacgtgctcg caaacgagac gttcccgata agtcctgctc ttgcaaacga 60 agcagggcgt tccttccctg caaacaggga ggacaaagga gccagctgca aacagcagac 120 cctgggggct tgtttatgtg taaacatctt gaaaatccag aaagtcaggg aaaggtcaga 180 aaaacaacaa tgtgtcttgt gacttggcaa cattccacaa acgactgtat aaaataaagc 240 agagcgcgcc gttcggggcg gccgccatgt ttgtctcgtc ttgtgttgtc ttgtgtgttc 300 attcctttgt ttaggaaaca cgcggacccc aaca 334 // ID ALRY-MINOR_PT repbase; DNA; PRI; 171 BP. XX AC . XX DT 07-JAN-2005 (Rel. 10, Created) DT 07-JAN-2005 (Rel. 10, Last updated, Version 1) XX DE Minor repeat unit of chimpanzee alpha repetitive DNA from the Y DE chromosome centromere - a consensus. XX KW SAT; Satellite; Simple Repeat; ALRY-MAJOR_PT; ALRY-MINOR_PT; KW Repetitive sequence; satellite DNA. XX OS Pan troglodytes OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pan. XX RN [1] RP 1-171 RA Hughes F.J., Skaletsky H. and Page C.D.; RT "ALRY-MINOR_PT."; RL Repbase Reports 4(12), 310-310 (2004). XX DR [1] (Consensus) XX CC The major repeat unit found in the chimpanzee Y chromosome CC centromere CC consists of 28 copies of the minor 171 bp unit. XX SQ Sequence 171 BP; 50 A; 29 C; 36 G; 56 T; 0 other; cattctgaca aacttctttg tgatgtgtgc attcatctca cagagttgaa cctttctttt 60 gattgagcag ttttgaaaca ctctttttgt agaatctgca agtggacatt tggagcgctt 120 tgaggcctat ggtggaaaag gaaatatctt cacataaaaa ctagacagaa g 171 // ID ERV1-3_TSy-LTR repbase; DNA; PRI; 544 BP. XX AC . XX DT 29-JAN-2010 (Rel. 15.09, Created) DT 29-JAN-2010 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERV1-3_TSy-LTR; ERV1-3_TSy-I. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-544 RA Jurka J. and Walichiewicz K.; RT "Endogenous retroviruses from tarsier."; RL Repbase Reports 10(9), 1199-1199 (2010). XX DR [1] (Consensus) XX CC ~94% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 544 BP; 124 A; 116 C; 139 G; 164 T; 1 other; tgtaagaggt gatgaattgt gtggaacctg tgtgaagcct tatatattaa gtgtgatgaa 60 ttatgtgaaa tgtaatgaaa ctataacctc atatattaag tatgctgaaa ctgtgtgctg 120 tgacctctcc cctaaaactt agtaattctt ggaaaccctg gaaagaacgc cctacttgga 180 agccccaagt ggctatgtta gataagttaa ccagcttttt ctgcttctgt aacccccgct 240 ggcagtgtgt gagtgttgtg cgataagccc ctttcaaatt ctggcgggct tttctctgtt 300 tkaattgtgg cgggcaagcc cctcttctgg cgggctttgt gttgttcgaa tttgaattat 360 aacggccttg tggattaggc gggcttttgc atatttaaga aaggccccag ccgcagtgtg 420 gggccctcgc ctagagactt gtgagtctgg tgggtggccc tggccagtcg aacctggcta 480 ataaacctct ctcttggtaa attgtgtgtc ggtgccgtga atcaaacgct cggaccccac 540 atca 544 // ID LTR5_RM repbase; DNA; PRI; 717 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV2-type endogenous retrovirus - DE consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR5_RM. XX OS Macaca mulatta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Cercopithecidae; Cercopithecinae; Macaca. XX RN [1] RP 1-717 RA Jurka J.; RT "Long terminal repeats from Rhesus macaque."; RL Repbase Reports 11(5), 1629-1629 (2011). XX DR [1] (Consensus) XX CC ~95% identical to consensus. 6bp tsd. CC This sequence was derived from sequence data generated by the CC Broad Institute Mammalian Genome Project. XX SQ Sequence 717 BP; 173 A; 160 C; 167 G; 217 T; 0 other; tgtggggaaa agaaagagag atcagcctgt tactgtgtct atatagaaag aagtagacat 60 aagagactcc attttgttct gtatttgaga tgctgttaat ctgtgaccct acccccaacc 120 ttgtccttgc aagagacatg tgctgtggtg actcaaggtt taatggattt tgggctgtgc 180 aggatgtgtc tttgttaaac aagtgcctga aggcagcttg ctggttaaaa atcctgcccg 240 tccctgggca atggaacatc tcggtgtaaa acccgattgt atgctctgtt tactgagata 300 ggagaaaacc gccttacggc ataaggtggg acttgctggc gggacttgct ggcgcaatgc 360 tgctaagagg tttatggaga tgtttgcata tgcatatcaa ggcacagcat tttcctttga 420 acttattcat gtcacagaga tctttatcca tatgtcttac tgctaatttt ctccctaaaa 480 tgatcctatt gtcctgccac tcccttatct ttaagatggt aaagataatt atcaataaat 540 actaagggaa ctcagagacc ggtgccggcg tgggtcctct gtaagctgag cgccggtccc 600 ctgggcccac tttttctttc tctatacttt gtctctgtgt ctcatttctt ttctcaagtc 660 tctcgttcca cctaacgaga aacgcccaca ggtgtggagg ggcaggccac cccttca 717 // ID MacERV2_int repbase; DNA; PRI; 7289 BP. XX AC . XX DT 24-SEP-2007 (Rel. 13.02, Created) DT 27-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE ERV1 Endogenous Retrovirus from Cercopithecidae. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; KW MacERV2_int. XX OS Cercopithecidae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini. XX RN [1] RP 1-7289 RA Smit A.F.; RT "MacERV2_int - ERV1 Endogenous Retrovirus from Cercopithecidae."; RL Direct Submission to Repbase Update (21-FEB-2008). XX DR [1] (Consensus) XX CC chr2.nib:121220437 2% or less. ORFs: gag 347-1990, pol CC 1991-5414, env 5577-7229. XX FH Key Location/Qualifiers FT CDS 347..1990 FT /product="MacERV2_int_1p" FT /note="gag." FT /translation="VLSVSPKRNIKPFLPLPGPDLSDTPLCFTLIFVPPSL FT CRSSPKMGNNQSTPLSLLVTNFKDVKARGRNLSVELKKGKLVTFCRSEWPS FT FGVGWPSEGTFCLSIITKVKTKIFLPGQSGHPDQIPYILVWQDLVENPPPW FT ITPFIPEPCKVLVTRPTRQRTPSAPSAPVLPDSQDPLTLEPTLPPPYPHLI FT PQDPVPQGGASVEGNREAEAAESESNPGGPAGRTRGRVQRDQASRLPDSTV FT ALPLREIGSLDDTGLSRLMYWPFSTSDLYNWKSQNARFSDNPKDLTSLLDS FT VMFTHQPTWDDCQQLLRILFTTEERERIQVEARKLVPGDDGQPTANPDLIN FT AAFPLTRPRWDYNTAEGRGRLLIYRQTLMAGLRAAARKPTNLAKVYSVLQG FT KTESPAVYLERLMEAFRQFTPMDPEAPENQAAVVMSFVNQAAPDXKRKLQK FT LEGLEGKQXQDLLQMAQRVYNNRDTPEEKQFKATKEMTKVLVAAFPQAGNG FT QNRKQKKQGPRQGLEKDQCAYCKERGHWIKDCPKKRRPANPTSVLVTQDSD FT *" FT CDS 1991..5415 FT /product="MacERV2_int_2p" FT /note="pol." FT /translation="GGRGSDPLPEPRVTLQVEGSPVRFLVDTGAERSVLTK FT PIGKMSKKTSWVHGATGIKKYPWTTQRTVDLGTGKVSHSFLVIPESPCPLL FT GRDLLTKMGAQIHFEPEGPKITDSQNRPISILTVTLEDEYRLHQEQKPPDQ FT EIDSWLQRFPEAWAETGGLGLAKHRPAIFVEVKPGTDPVRVRQYPMPLEAR FT EGITPHIRRLLDQGVLRACHSSWNTPLLPVRKPNSTDYRPVQDLREVNKRV FT MDIHPTVPNPYTLLSALHPEKQWYTVLDLKDAFFSLPLAPKSQELFAFQWT FT DPERGINGQLTWTRLPQGFKNSPTLFDEALHEDLGEYRRQHPEITLLQYVD FT DLLIAARSPETCVQGTEDLLKTLGELGYRASATKAQICKSEVTYLGYLLKG FT GQRWLTKARKETVLRIPRPQSTRQVREFLGSAGFCRLWIPGFAELAKPLYQ FT ATKERQPFNWTEEAELAFQQIKTALLSAPALGLPDVSKPFHLYVDESKGVA FT KAVLTQYLGPWQRPVAYLSKKLDSVAAGWPPCLRIIAATALMVRDADKLIM FT GQELRVITPHAIEGVLRQPPDRWMSNARLTHYQGLLLNPLRITFLPPTSLN FT PASLLPNPDLDAPSHECTEILAQVHGVREDLQDRPLPDTELIWFTDGSSYV FT HQGQRYAGAAVTSETEVIWAEPLPPGTSAQRAELIALTQALTLGAEKKLTV FT YTDSRYAFATAHIHGAIYRERGLLTAEGKEIKNKQEILALLTALWKPKKLA FT IVHCPGHQKPTTPIARGNFLADQTARSIAKAPSQLLALQLPDPGPRXLPCF FT PDYTEQDREWMDTLPLKQVKNGWWTDINDQTILPEKLGRQVLEHIHRTTHL FT GARRMIDLIRHAKFRIRRIAELASDVTTNCKACQLNNACPQTQTAAGIRCR FT GTRPGIYWEIDFTEIKPGKYGYQYLLVFVDTFSGWTEAFPTKRETAQVVAK FT KILEEILPRYGFPVQIGSDNGPAFVAKVSQDLASILGANWKLHCAYRPQSS FT GQVERMNRTLKETLTKLTMETGANWVVLLPYALFRARNTPYRLGLTPYEIM FT YGRPPPLVPSLKDDLLKPETENVSELLFSLQALQKIHQEIWPRLRELYEAG FT PPPTPHPFQPGDWVLVKRHRQETLQPRWKGPLQV" FT CDS 5577..7229 FT /product="MacERV2_int_3p" FT /note="env." FT /translation="PCAVIPMAVTMSSSPMLLCLMHLTLLTAAPSNPYVWR FT FWLYENKTHPGETPQAGKLLASADCPPSGCNTIVYLNFTRFQIAQPAIPMI FT CFEYDQTEYNCKNYWWHQSAGCPYSYCKTHSVRYRREQQGWYFYQTESPVT FT YTWIIRDPWDSRWTTPQHGGVYYSSTSTWPSSHLYLWRSLVQIQPLIHTQI FT HRQETKLSQDLQPFSWLTLLQEGLAFANLTGLGDLSSCFMCATLGRPPLTA FT VPLSSPPTNYTGFNSSIVTIPDVALYRDPYQEKYPYCYSAFSNSLCNQSAT FT YPNSPIYAPPGVFFWCNMTLLKTLSKSISDGQLCIPVTLVPRLTLLTPAEF FT IGWAGTAPTETARHRRAVFLPLIAGISLTTSVVAAGLAGGALQHSMIENDK FT LLQQFSVAMEDSAYSLASLQRQLTSLAQVTLQNRRALDLLTAEKGGTCMFL FT KEECCFYINESGLVEERVQRLHKLSLEMKKQQFTTAANNWWSSSMFSLLAP FT LLGPLMSLLLLFTIGPCVVNKILQFVRERFDTIQLMVLRSHHQPLLHPESE FT ATL*" XX SQ Sequence 7289 BP; 1967 A; 2020 C; 1688 G; 1607 T; 7 other; taacaaatgg gggctcgtcc gggatttccc ccaagcccac cagaccccca agtcaatgga 60 tcttaatctg taggtaagcc ggctttgtca ctgtctctgt ctttgtctct gtctgtctcc 120 ctgaaatttc gcgaaatccg taatctgtaa tctgtattgg tctgttctgt aagtggcacg 180 gtcttggcgg acgcgctaaa agacagccac tcgggactgg tgggagacgt cccccagtgc 240 ccatctgggg gcctccatcc ttggttgccc cgtctgactc aggtcggaag tccgacacgc 300 gctcgcgaag gctcatcgtt tntcactgtc tgtccgttat tgttaagtgt taagtgtaag 360 tccaaaacga aacataaaac cttttcttcc ccttccagga ccggacctgt cggatactcc 420 cctctgcttc acactcattt tcgtccctcc ttctctttgt aggagcagtc caaagatggg 480 caacaaccag agcacgccgc tctcactcct tgttaccaat tttaaggatg tgaaggctcg 540 ggggcgtaac ttaagcgtag aactaaagaa gggaaagcta gttactttct gccgttcgga 600 gtggccctct ttcggcgtag ggtggccctc cgaaggaact ttctgtctct ctattattac 660 taaggtaaaa actaagattt tcctgccagg gcagtcagga catcctgatc aaattcctta 720 tattctagtg tggcaggatc ttgtggaaaa cccaccgccc tggataactc cattcatccc 780 tgagccctgc aaggtcctag taacgcgacc tacaagacaa aggactccct ctgctccctc 840 ggcccctgtg ctgccggaca gccaggaccc cctaacgtta gaacccacnc ttcccccacc 900 atatccgcac cttatcccgc aggacccagt cccccagggt ggtgcttccg tagaggggaa 960 ccgagaagcg gaagcagccg agagcgaaag caacccgggg gggccggccg gccgaacgcg 1020 aggccgcgta cagcgggacc aagcttcccg actccctgac tccactgtag ccctgcctct 1080 cagagaaata ggatcgctcg atgataccgg gctctcccgt ctcatgtatt ggcctttttc 1140 cactagtgat ttatacaatt ggaagtccca aaatgctcgg ttctcagata atcccaaaga 1200 tctaacctcg ttgttagaca gtgtcatgtt tactcaccag ccgacctggg atgactgtca 1260 acagctcctc cgaatcctgt tcacaacgga ggagcgggaa agaatccaag ttgaggccag 1320 aaagctggtt ccaggagatg acggccagcc aactgcaaat cctgacctca ttaacgcggc 1380 tttccctttg acccggccca gatgggacta caacacggca gaaggtaggg gacgactgct 1440 catttatcgc cagactctaa tggcgggtct ccgggctgca gctcgcaagc ccaccaattt 1500 ggctaaagta tattctgtgt tacaaggtaa gacagaaagc cccgctgtgt atttagagag 1560 attaatggaa gccttcagac agtttacccc tatggacccg gaagcaccgg aaaatcaggc 1620 agcggtagta atgtcctttg taaaccaggc agcaccagat atnaagagaa aactccagaa 1680 attagagggt ttagaaggaa agcagatnca ggacctcctt cagatggcnc agcgagttta 1740 taataatagg gatactccag aagaaaaaca gttcaaggca accaaagaaa tgaccaaagt 1800 cttagtagca gctttccccc aggcaggaaa tggccaaaac agaaagcaga aaaagcaagg 1860 ccctaggcaa ggattagaaa aagatcagtg tgcttattgc aaggaacgtg gccactggat 1920 taaagactgc ccaaagaaac ggagaccagc taaccctacc tccgtactcg tcacccagga 1980 ctctgactag ggaggacggg gttcggaccc cctccccgaa cccagggtaa ctttgcaagt 2040 ggaggggtcc ccagttcgct tcttggtcga taccggggcg gaacgctcag tcctaaccaa 2100 gccaattgga aaaatgtcta agaaaacatc ctgggtgcac ggggccacag gcatcaaaaa 2160 atacccctgg acaacccaga ggactgtaga cttaggaacg ggaaaagtct cccattcctt 2220 cctagtcatc ccagagagcc cctgccctct actagggagg gacttgctga caaaaatggg 2280 ggcccaaatc cactttgagc cagaagggcc caagataact gattcccaaa acaggccaat 2340 atctatcctg actgtcacct tagaagatga gtacagactc catcaagagc aaaagccccc 2400 cgatcaggaa attgactctt ggctccagcg tttccctgaa gcgtgggcag aaactggggg 2460 cttagggcta gctaaacatc gacctgccat atttgtggaa gttaagccag ggacggaccc 2520 ggttcgggtt cggcaatatc ctatgcccct ggaggcaaga gaaggaatca cgccccatat 2580 ccgccgactc ctagaccaag gagtcctacg ggcttgccac tcatcctgga atactccact 2640 gttacctgtt cgtaaaccca acagtacgga ctacaggcca gtacaggatt taagagaagt 2700 taataaaagg gtcatggaca tacatcccac tgtgcccaat ccatacaccc ttctgagtgc 2760 cttacatccc gaaaaacagt ggtataccgt tcttgaccta aaagacgctt tcttcagcct 2820 tcctctggct cccaaaagtc aagaactctt tgcatttcaa tggacggatc ctgagagggg 2880 catcaacggt cagctaacat ggaccaggct gccacaaggg ttcaagaact cgccaaccct 2940 gttcgatgaa gcccttcatg aagatctggg tgagtaccgg cggcaacacc ctgaaataac 3000 tcttttgcaa tatgttgatg atctcttaat agcggccagg tccccggaaa cttgtgttca 3060 agggactgag gacctcttga agactctggg ggagctaggc taccgagcct cagcaacaaa 3120 ggctcagatc tgcaagtcgg aggtaactta tctggggtac ctattaaaag gggggcaacg 3180 ctggctaacc aaagcccgaa aggaaacagt cctacgcatc cctagacccc agtcgacacg 3240 gcaagtgaga gaattcctgg ggtcggcagg gttctgtagg ttatggatac ccggatttgc 3300 tgagttagcc aagcccctat accaggcaac aaaggaacgg cagcccttca attggacgga 3360 agaggctgag ctggcctttc aacaaatcaa aactgccttg ttatcagccc ccgcgctggg 3420 gctccctgat gtctccaagc ccttccactt atacgtggat gaaagtaagg gtgttgcaaa 3480 agccgtgtta acacagtacc taggcccctg gcagaggcca gttgcctatt tgtcaaaaaa 3540 attagattca gtggctgctg gctggccacc ctgcctgcgg ataatcgcgg cgaccgccct 3600 aatggtccga gacgctgata aacttatcat ggggcaagag ttgcgcgtta taaccccgca 3660 tgccattgaa ggcgtcctcc ggcagccgcc ggaccgatgg atgagtaacg cccggctcac 3720 ccactatcaa ggactgctgt taaaccccct cagaataact tttctgcccc caacctcttt 3780 aaaccctgct tcactgctgc caaatccaga cttggacgcc ccgtcccatg agtgcactga 3840 gatactggct caggtgcatg gggtgcggga ggacctgcaa gatcgtccgc tccccgacac 3900 agaactcatc tggttcactg atggcagcag ctacgtccac caaggccagc ggtatgcagg 3960 agcggctgta acgtcagaga ctgaggtaat ctgggcggaa cccttgcctc cagggacatc 4020 ggcccaaaga gccgaactga tagcactcac acaggccctc accctagggg cagagaaaaa 4080 actaacagta tacacggata gccgctatgc tttcgctact gcgcacatac atggggccat 4140 ctacagagag aggggactat taacagccga aggcaaagaa ataaaaaaca agcaagagat 4200 cctagctcta ttaactgctt tgtggaaacc aaagaaattg gctatcgtac attgccctgg 4260 gcatcagaaa ccaactacgc caattgctcg aggcaacttc ttagccgacc aaaccgcaag 4320 gagcatagca aaagctccca gtcagctcct cgcgctccag ctccccgatc ctggcccgcg 4380 aaanttgcca tgttttcccg actataccga acaggatcgc gaatggatgg acacgcttcc 4440 cctaaaacag gttaaaaatg ggtggtggac tgatataaat gaccaaacca tcctgccaga 4500 gaaattagga cggcaggtgt tggaacacat ccaccggaca acccacctgg gtgcccggcg 4560 aatgatagac ttaatcaggc acgccaagtt cagaatcagg cgcatagctg aactggccag 4620 cgatgtcaca acaaattgca aagcctgcca actgaacaac gcttgccccc aaacccagac 4680 cgccgcagga attaggtgta gaggaactag gcctggtatc tattgggaaa tagactttac 4740 tgagataaag cctggaaaat atgggtatca gtacttactt gtctttgtag atactttttc 4800 aggatggaca gaagcgttcc caactaagag agaaactgct caggttgtag caaagaaaat 4860 tttggaagaa atcctcccca ggtatggctt tcccgtccag atagggtcag acaatggacc 4920 agccttcgtt gctaaggtaa gtcaggattt ggcttccatt cttggggcaa attggaaatt 4980 acattgtgct tataggcccc aaagctcagg acaggtagaa aggatgaatc ggactctaaa 5040 ggagacctta actaaattga ccatggagac tggcgctaat tgggtagtac ttctccccta 5100 cgctctgttc cgggcccgaa atacccctta cagactgggc ctcacaccat atgaaatcat 5160 gtatggcaga cccccacccc tggtccccag tctaaaagat gacttactca aacctgaaac 5220 tgaaaatgtc tctgagctct tgttctcctt acaagcctta cagaaaattc accaggaaat 5280 ttggcccaga ctacgagaac tgtacgaggc aggacctccg ccgacgcctc atccgttcca 5340 gccgggagac tgggtcctgg tcaagcgcca ccggcaagaa actctccaac ccaggtggaa 5400 aggaccactg caagtactcc tgactactcc caccgctctc aaagtagaag gcatcgcttc 5460 gtggatccac tacacacacg tcaaaccggt ggacccanca tccgaccttc tcgagccgtc 5520 tggagcaccg gttacatgga ctgtggacaa agctaagaac aatcccttaa agctaaccct 5580 gcgccgtcat ccccatggcc gtaaccatgt ctagctcacc tatgctttta tgccttatgc 5640 atctgactct cctcactgct gcgccgtcta atccctatgt ctggaggttc tggctctatg 5700 agaacaaaac tcaccccggg gaaacccccc aggcgggtaa actgctggct agtgcagact 5760 gccccccctc agggtgtaat actatagttt acctcaattt cactaggttc cagattgccc 5820 aaccagcgat acccatgatc tgttttgaat atgatcaaac tgaatataat tgtaaaaatt 5880 attggtggca ccagagtgca ggttgcccat atagctactg taagacgcac tctgtccgat 5940 accggagaga acaacaaggg tggtattttt accaaactga gtctcccgtc acttacactt 6000 ggataattag agacccatgg gactctcggt ggacaacccc acaacatggg ggagtatatt 6060 actcatcaac tagtacctgg cccagtagcc atttatatct gtggagaagt ctagtccaga 6120 ttcaaccctt aatccataca caaatccaca ggcaagaaac caagctatca caagacttgc 6180 agcccttctc ctggctaacc ctactacagg aagggctagc attcgccaac ctcaccggtt 6240 taggcgacct gtcctcttgc tttatgtgtg caaccctagg aagaccacca ttaacggctg 6300 ttcctctatc ctcccctccc acaaactata caggttttaa ctcgtcaata gtcacgatac 6360 ccgatgtggc cctgtaccgt gacccatacc aagagaaata cccctattgt tattcggcat 6420 ttagcaacag cctctgtaac caatcggcta cataccctaa tagtcccata tatgctcccc 6480 ctggtgtgtt cttctggtgt aatatgactc tgttaaagac tctgtccaaa agcatctctg 6540 acggacagtt gtgtatccct gttaccctag ttccccgact gacactgcta accccggcag 6600 agttcatagg ctgggcaggg accgccccaa ctgaaactgc gcgacataga cgagcagttt 6660 ttctaccact aattgccgga atatccttaa ccacctctgt cgtcgcagcg gggttagcag 6720 gaggggccct acaacactcc atgatagaaa acgacaagct gttacaacaa ttctctgttg 6780 ctatggaaga ctccgcctat tccctagcct cccttcagcg gcagctcacc tccctagcac 6840 aggtcaccct tcaaaaccgc agagccttag acttactcac ggctgagaaa ggaggtacct 6900 gtatgttcct caaagaagaa tgctgtttct atataaatga gtcagggtta gttgaggaaa 6960 gagtccaacg cctacacaaa ctaagcttag aaatgaaaaa gcaacaattc actacagccg 7020 ctaacaattg gtggagctct tcaatgtttt cccttctggc acccctttta ggccccctaa 7080 tgtctttact acttctattc actataggcc cctgtgtggt aaataaaatt ttacaatttg 7140 tcagagaaag gtttgatacc atccaactaa tggtcctccg tagccatcac cagcctctcc 7200 tgcacccaga aagtgaggct acattataag actctggaaa tccaagattg gacatccagt 7260 tacttatggg agggggaaat gaaaggaca 7289 // ID HSATII repbase; DNA; PRI; 170 BP. XX AC X03460; XX DT 01-OCT-1995 (Rel. 1.09, Created) DT 04-AUG-2008 (Rel. 13.07, Last updated, Version 3) XX DE Human satellite II DNA. XX KW SAT; Satellite; Simple Repeat; HSATII; KW Satellite repetitive element. XX NM HSATII. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 21-79 RA Prosser J., Frommer M., Paul C. and Vincent C.P.; RT "Sequence relationships of three human satellite DNAs."; RL J. Mol. Biol 187, 145-155 (1986). XX RN [2] RP 1-170 RA Smit A.F.; RT "Consensus update."; RL Direct Submission to Repbase Update (05-MAR-2004). XX DR GenBank; X03460; Positions 1 59. XX CC CC [2] general. XX SQ Sequence 170 BP; 38 A; 42 C; 24 G; 66 T; 0 other; ccattcgatt ccattcgatg attccattcg attccattcg atgatgattc cattcgattc 60 cattcgatga ttccattcga ttccattcga tgatgattcc attcgattcc attcgatgat 120 tccattcgat tccattcgat gatgattcca ttcgattcca ttcgatgatt 170 // ID MacERV6 repbase; DNA; PRI; 7386 BP. XX AC . XX DT 24-SEP-2007 (Rel. 13.02, Created) DT 27-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE ERV1 Endogenous Retrovirus from Cercopithecidae. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; MacERV6. XX OS Cercopithecidae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini. XX RN [1] RP 1-7386 RA Smit A.F.; RT "MacERV6 - ERV1 Endogenous Retrovirus from Cercopithecidae."; RL Direct Submission to Repbase Update (21-FEB-2008). XX DR [1] (Consensus) XX CC 3% subst ORfs: gag 141-1886, pol 1887-5411, env 5584(ATG)-7323. CC The gap between pol and env seems unusual, but shows no CC similarity to known ERV proteins. There is a second subfamily, CC about 10% diverged, few copies to work with. XX FH Key Location/Qualifiers FT CDS 141..1886 FT /product="MacERV6_1p" FT /note="gag." FT /translation="MGTSQSKIPRNTPLGCLLQNLEALHLAQDLKRKRLIF FT LSTVAWPQYKLDNQSQWPPDGTLDYNILLDLSNFCQRLGKWSEIIYVQGFW FT DLRSRPDLCAQCSMAQVLLVKTQPSAPTPQAEEDSTSISDEADGGAPPSRP FT TNPPPYAPPTAPPSATPLTSPISAHTRSKTAVAAPTSQPSTSQPAPTPSTG FT QPASTPSTGQPVSTYPTGQSVSTSPVSLLAPLREVAGAEGLVRVHVPFSLA FT DLSKIEKRLGDFSANPTLYSKEFRYLCQAYDLTWHDLQVILSSSLNPEERE FT RILAAARQHADQWHLTDATVPLGEMAVPSIEPDWDYQPQQPGRRRRDIMVQ FT CLLAGMQAASNKVVNFDKLKEIIQHPDENPASFLNRLTEALAQFTRLDPTS FT PAGAAVLASYFISQSASDIRKKLKKVEDGPQTPIQDLVKLAFKVFNSREEA FT AEAAELDKEKRRAVLQAQALVAALQPALPTLPAGKTQGKSPKGACFRCGDP FT KHWADKCPQAKGSLPSDPCFKCGTTGHWASKCPNPRSPTAPCPVCHQEGHW FT KSDCPAVRTGPAPQRGASPQRSEGSFQLLHLDDD*" FT CDS 1887..5411 FT /product="MacERV6_2p" FT /note="pol." FT /translation="RCPHSETPVTLAEPRVTLQVAGKPISFLIDTGATYSV FT LSSYGGPSQPSTISVVGVDGKPSNPRQTPMLNCCLDYAYFAHSFLIMPSCP FT TPLLGRDILTKLKASINLPSHPLPSSNVTLIMPLCTHNQGPNPPSPVFPVP FT VDPQVWDTSTPVVARHHLPVLIHLKDSTHFPSRPQFSLPLRHRQGIKPIID FT RLLQQQILIPTHSPCNTPILPVKKPSGAYRLVQDLRIINEAVIPTRPVVPN FT PYTLLSRIPPETSHFTVLDLKDAFFSIPLHPDCYFIFAFTWEDPETHVSTQ FT LTWTVLPQGFRDSPHFFGQALAKDLAQCPFRSSTIIQYVDDLLLCSPSHDV FT SLQDTATLLNFLGRLGYRITKQKAQLCLPKVIYLGIELTPVSKSLTTDRLS FT LIQSLXPPSNGEEILSFLGLVGFFRHWIPNFGVLAKPLYQAAKESPTGPLS FT DPKLIATHFTRLKSCLLSAPTLSLPNPLYPFYLFTDERHKIATGLLAQPIG FT TTFRPVAYLSKQLDTTVQGWQPCLRALAAAAELTKEALKLTLGGPLTVHST FT HRLRDLISHKCISHLTPSRIQLFHVLFLENPDIKIATCSSLNPATLLPTPN FT TSHTPEHSCPEIIQLALPTHLHLRDQPLPDPQLTFFVDGSSFVDPHGNRHA FT GYAVVTNTQVIEAKPLPLGTTSQKAELTALTRALHLSEGKTVNIYTDSEYA FT YMIAHTHSVLWQERGFLTTKGTPIINGKHIQRLLEALTSPKEVAIVHCRGH FT QKPTNPIAQGNNLADTTAKSIALSTENPRSLCFLTPQYTPSYSSEEKNKLL FT QNPHATITPDQWIFIHNRVVLPEAQTQHILRDIHNSLHIGHQALYNFLNPI FT IECPSLLSHLKRISQQCLICLKANAQGGIRNAHPSHQLRGHQPGEDWQIDF FT THMPRHKKLRYLLTLVDTFSGWIEAYPTTGESASIVASILIEQIIPRFGLP FT RSIQSDNGPAFISRVIQLVTDSLNITWKLHIPYHPQSSGKVERANGLIKQQ FT LTKLSIETRQSWVTLLPLALTRLRATPRGPTGLSPFELVYGRPLALQELPS FT LPTPLAPYLPYLSLLRQLLREHAERSLPHPTQGNSEAPSALSPGDQVLLKD FT LHAKGLTPKWKGPYTVILTTSTAAKLLGHSSWVHLTQLKRAPPPQTQWQST FT ELSPTRLRITRLNTN*" FT CDS 5584..7323 FT /product="MacERV6_3p" FT /note="env." FT /translation="MISAMLNLPSTPLLPLLWFTLIIPASLTNPKFVWRFS FT ITETWSTDNQAHSQTQGTADCSPQGCQAALLLNFHLSSVGNYDRPVICFLY FT DQTEYNCKNYWQETNLGCPYNYCNMHEIGLMCANGICTPNDRPFVRNRTSG FT GYILXIKDPWDPRWAQGVKGGLYATSWSSYPTATLQIKRVYVQQVPLPKSK FT QVHPPKSVQALQNLTSVVKSHEQKIQKLLSPPSPPNNEDPFSWLTLIRQGL FT NLTQAAGVRNLSHCFLCAALGKAPLVAVPLPTAFNITTDSTSSSQATSLPQ FT VPLYRNPQSQTLPFCYSTPNSSWCDRTQAPSRTQTAPVGGYFWCNQTLSKT FT LNHTSITQSLCVPVSLVPSLTLYSEGELSELASQLSPSNNIQKRAVFLPLI FT IGVSLASSLVASGLGTGALTHSIQSTQTLSTQVQAAIEASAESLASLQRQI FT TSVAQVAAQNRRALDLLTAERGGTCLFLGEECCYYVNESGLVDTNVQTLNK FT IKKELQQFNAPLTPGPPVWLLPVVQQMLPFLIPILILCLMLCLAPILIKFL FT RARVQEITRVTFNQMLLHPYTQLPTSDPNYAP*" XX SQ Sequence 7386 BP; 1929 A; 2348 C; 1340 G; 1767 T; 2 other; gtgaggcctt caacccggag acgtccgtct tgaagttcct caattctccg ttagtggctc 60 caggagcccc cggtctctag cagcggcaaa ggacgctttg ccactgcgtg aggtcggttt 120 ccatttctgg tggttaaaca atgggaacct cacaatccaa aatccctagg aacaccccct 180 tagggtgtct cctccaaaac ttggaagcct tacatctagc tcaagattta aaacgaaagc 240 ggctaatttt cctctccact gtggcatggc cgcagtataa attagacaac caatctcaat 300 ggccaccaga cggcacactt gactataaca ttctattaga tctctctaat ttctgccaaa 360 ggcttggaaa gtggtcagag attatctatg tccaaggctt ttgggacttg cgctctcgcc 420 cagacctgtg cgcccagtgt tcaatggccc aggtcttgtt agttaaaacc cagccctcgg 480 cccccacacc acaagcagag gaagattcta cttccatctc agacgaggcc gacggcgggg 540 cacctccctc ccgacctacc aacccccctc cctatgctcc tcctaccgcc cctccctcag 600 ccactcctct tacctctccc atttccgccc acactcgctc caagaccgcg gtagcggccc 660 ccactagtca gccctctacc agtcagcctg ccccaacccc ctctaccggt cagcctgcct 720 caaccccctc taccggtcag ccggtctcaa cctacccgac tggtcagtct gtttcaacct 780 cccctgttag tttgttggcc cccctccgag aggtcgccgg agccgaagga ctagttagag 840 tccacgtccc tttttccttg gcagaccttt ccaagattga aaaacggctc ggcgatttct 900 ctgctaaccc caccctatat tccaaggagt ttcgatacct atgtcaggca tatgatctaa 960 cctggcatga cttacaggtc attttgtcct cctcccttaa cccagaggaa agagagcgta 1020 ttcttgcagc cgccaggcaa catgcagatc aatggcattt aactgacgcc accgtcccgt 1080 taggagagat ggctgtcccg tccatagaac cagattggga ctaccaacca cagcagcccg 1140 gccgccgtag gagagacatt atggttcaat gtctcttggc cggtatgcag gcagcctcta 1200 acaaagtggt caattttgat aaactaaagg aaatcatcca gcacccagat gagaacccgg 1260 cttccttcct aaatcgcctt acagaggcac tagcccaatt tactcggcta gaccccacct 1320 ccccagccgg agcagctgtc ctagcctcct attttatctc ccagtcggcc tcagatattc 1380 gaaaaaagct aaaaaaggtt gaagatgggc ctcaaacccc catacaggac ttagtaaaat 1440 tggcctttaa agtttttaac tccagagaag aagcagctga ggccgcagag ctagacaagg 1500 agaaaagaag ggctgtgctt caggcgcaag ctctagtggc tgccctccaa ccagcgttgc 1560 ccactctgcc ggcagggaag acacagggca agtctcctaa gggcgcttgc ttcagatgcg 1620 gagaccccaa gcactgggcc gataagtgcc ctcaggcaaa ggggagtcta ccgtccgatc 1680 cttgtttcaa gtgtggcaca actgggcatt gggccagtaa gtgcccaaat cctcgctcgc 1740 ctaccgcgcc gtgtccggta tgccatcagg aagggcattg gaagtctgat tgccccgccg 1800 tcaggacagg ccctgcgcct caacgtggtg cctctcctca aaggtcggaa ggctccttcc 1860 agctcctaca cctcgacgac gactgacggt gcccacactc ggagaccccg gtcaccctcg 1920 ccgagcccag ggtaacgcta caggtagcgg gtaagccaat ttcttttctc atcgatacgg 1980 gggctaccta ctctgttttg tcatcctatg gtggacctag ccaaccctcc actatttcag 2040 tagtgggggt agacggtaag ccatctaacc ctcgccagac cccaatgtta aactgttgcc 2100 tggactatgc atattttgcc cactctttcc tcatcatgcc ctcctgtccc acccccctct 2160 taggacgaga catcttaacc aagctcaagg cttccattaa cctgccctct catcccctcc 2220 ccagctccaa cgttacccta ataatgcctt tgtgtaccca taatcagggc cccaatccac 2280 cttccccggt gttcccggtg ccggttgacc ctcaggtgtg ggatacatct accccagttg 2340 ttgctcgaca tcacttacca gtccttatcc acctgaagga ctctacccac ttcccgtctc 2400 gccctcagtt ctctctccca ctgcgtcacc ggcaaggaat caaacccatc atcgaccgtc 2460 tacttcaaca gcagatttta atccccaccc attctccatg caacaccccc atcctccctg 2520 ttaagaaacc ctcaggagct tacagactag ttcaagactt acgtatcatt aatgaggcag 2580 taatcccaac acgcccagta gtccctaacc cctataccct tctttcacgc atacctcctg 2640 agacctccca cttcaccgtc cttgatctaa aagatgcatt cttttctatt cctttacatc 2700 ccgattgcta ctttatcttt gctttcactt gggaagatcc cgaaacccat gtctctactc 2760 aactcacttg gacggttctt ccccaaggtt tccgagacag cccccacttc tttgggcagg 2820 cattggccaa agacctagcc cagtgcccat tccgctccag caccattatc cagtacgttg 2880 atgatttact tctatgtagc ccctcacatg acgtttcctt acaggatact gccaccctac 2940 ttaacttctt aggtaggctg ggctatagga ttaccaaaca aaaggcccaa ctttgccttc 3000 ccaaggttat atacctaggc atagaactta ccccagtgtc caaatcgctg actacagata 3060 gattaagtct cattcagagt ctcanccccc catctaacgg agaagaaatc ctatctttct 3120 taggtcttgt aggattcttc cgacattgga tcccaaattt tggggtcttg gccaaacccc 3180 tctaccaagc tgccaaggaa tccccaacag gacccctatc agacccaaaa ttaatagcta 3240 cacacttcac acgccttaaa tcatgtctcc tgtccgcccc tactctctcc ttacccaacc 3300 ccctgtaccc attctacctc ttcactgatg aacgtcacaa gatagctaca ggtctcctgg 3360 cccagccaat aggcacaacc ttcagaccgg tcgcttatct ctccaaacaa ctagacacca 3420 cagtacaggg atggcagccc tgcctacgcg ctcttgctgc agcggcagaa ctaaccaaag 3480 aagctttaaa acttacctta ggaggccccc ttacggtcca ttccacacat cgtcttcgag 3540 acttaatttc ccacaaatgc atcagtcatc tcactccttc ccgcatacag ttattccatg 3600 tcctattttt agaaaatcct gacattaaaa tagccacctg ttcttctctc aatcctgcta 3660 ctctcctgcc tactccaaat acctcacaca ccccagaaca ttcatgccca gagattatcc 3720 agttagctct cccaacccat ctccacttac gggatcaacc cctcccagac ccacaactga 3780 cattctttgt ggacgggagt tcattcgttg acccccatgg aaacagacac gcgggatatg 3840 cagtagtcac taacacacag gtgatagaag ccaagcctct accattaggc acaacctcac 3900 aaaaggcaga actcactgcc ttaacccggg cactccatct ctctgaggga aaaacagtta 3960 atatatatac tgattccgag tatgcataca tgatagcaca cacacactcc gtcctctggc 4020 aagaacgtgg gtttttaacc accaaaggaa cccctatcat aaacgggaaa catatccaac 4080 ggttgctaga agctctaacc tcccctaaag aggtagctat agttcattgt agaggacatc 4140 aaaaacctac caaccccatt gcccaaggca acaacttggc cgataccact gcaaaatcca 4200 tagcactatc tacagaaaac cctcgatccc tatgtttctt aactcctcaa tacacccctt 4260 cttattcctc agaagaaaaa aacaagcttt tacagaaccc acatgcaact attacccccg 4320 accaatggat ctttatccat aaccgtgtcg tcctccctga agctcaaaca caacacatct 4380 taagagatat tcacaactct ttacacatcg gacaccaggc cctatacaac tttttaaacc 4440 ccatcattga gtgtccatcc cttctttctc acctcaagag aattagccaa cagtgtttaa 4500 tctgcctaaa ggcaaatgct caagggggaa tacgcaacgc ccatccaagt catcagctta 4560 gaggacatca acccggagag gactggcaga ttgatttcac ccacatgcct aggcacaaaa 4620 agcttcgtta cctactcacc ctggtagata ccttttcagg atggatagaa gcctacccca 4680 ccacaggaga aagtgcatcc attgtagcct ccattctcat agaacaaatc attcctcgtt 4740 ttgggcttcc ccggtcaatc caatctgata acggcccggc ttttatttcc agagtgatcc 4800 agctagtgac agattctctc aacattactt ggaagctaca tattccatac catccccagt 4860 catccggaaa ggttgaacgg gcaaatggac tcatcaagca acaactaacc aaactttcca 4920 tagagacacg tcaatcgtgg gtaaccttgc ttccccttgc acttacccgg ctacgagcta 4980 cacctagagg ccccacaggc ctcagtcctt ttgaattagt atacgggcgc cccctcgcac 5040 tacaagaact ccctagcctc cccacgccgt tagcacccta ccttccatac ctttctctct 5100 taagacaatt acttagagaa cacgctgaga ggtcacttcc ccaccctaca caaggaaatt 5160 cggaagcccc ctcagccctt tcaccaggcg accaggtttt actcaaggac ctgcacgcaa 5220 aaggccttac ccctaagtgg aaaggtccat acacagtaat tcttaccact tccacagcag 5280 ccaaactctt aggccacagt tcttgggtcc accttactca actgaaaaga gccccgcccc 5340 ctcagactca gtggcagtcc actgaactgt cacccacccg tctgagaata accagactta 5400 atacaaacta aatgagctac gcccttctga aattctcccg ctttctcact attctccatc 5460 tcaacttaac tgaatttccc ctcatagacc ctctagaaac cttccttccc gacccttccc 5520 ccgaccagta ggtttcccga ctttttactc tcatagaaga cctagcttgg caaggtgccc 5580 tttatgattt cagcaatgtt gaacttaccc tctacacctt tgttaccact attatggttc 5640 actctaatta ttcctgctag ccttactaat cctaaatttg tatggagatt ttctataact 5700 gaaacctggt ctactgacaa ccaggctcac tcacaaacgc aaggtactgc agattgctcc 5760 ccccagggct gtcaagctgc cctcctcctt aactttcacc tgagttctgt cggcaactac 5820 gaccgcccag tcatatgttt cttgtacgac caaacggaat ataattgtaa gaactactgg 5880 caagaaacca acctggggtg tccatacaac tattgcaaca tgcatgagat aggtttaatg 5940 tgtgcaaatg gcatttgcac ccccaatgat cgaccttttg taagaaatag aacatcagga 6000 ggatatattc ttantattaa ggatccctgg gacccccggt gggctcaggg agttaaaggt 6060 ggactatatg caacatcttg gtcctcatat cccaccgcaa ccctccaaat aaagagagta 6120 tacgtgcaac aagtacccct ccctaagagt aaacaagtac atcctcctaa gagtgtacag 6180 gccctacaaa atttaacctc agtcgtaaaa tctcatgaac agaaaataca gaagctgtta 6240 agccccccaa gcccccctaa caatgaggac ccattctcat ggctaacact tattcgccag 6300 ggacttaact taacccaggc tgctggagta aggaacctct cccactgctt cctttgcgct 6360 gcactcggaa aagctccctt agtggcagtc ccactaccaa ccgctttcaa tatcaccacc 6420 gactctacca gctcctctca agcaacatcc ttgcctcaag tcccattata ccgtaatcca 6480 cagagtcaaa cccttccttt ttgctactct actccaaact cttcgtggtg tgatcgcaca 6540 caagctccca gcaggaccca gacggccccc gttggaggct acttttggtg taaccaaact 6600 ctatctaaaa ctcttaacca tacctccatc acccagtccc tttgcgttcc ggtatcttta 6660 gtgcctagct taaccctata tagcgaagga gaattgtctg aactagcttc ccagctcagc 6720 cccagcaata acatccaaaa gcgggccgtt tttcttccct taattatcgg ggtttcccta 6780 gcatcgtccc tagtagcctc aggacttgga acaggggccc tcacccactc gattcaatcc 6840 acacagaccc tctccactca ggtccaggca gccatagagg cttcagctga aagcttagcc 6900 tctttacaac gtcaaatcac ctcagtggcc caggtagcag cacaaaatag acgggcatta 6960 gatcttctta ctgctgaaag agggggaaca tgcctcttcc taggagagga gtgttgctac 7020 tacgtaaatg aatcaggcct ggttgacacc aacgtccaaa cattaaacaa aatcaaaaag 7080 gagcttcaac aatttaacgc ccccttaacc cccggaccac cggtatggct gctgcctgta 7140 gtacagcaga tgcttccatt cctaattccc atactaatcc tctgccttat gttatgtctt 7200 gctcccattc taataaaatt tctccgagcc agggtccaag agatcacccg agtcaccttc 7260 aaccaaatgc tcctgcatcc ctacacccaa ctgccaacct cagaccctaa ctacgcccct 7320 taacagcagg aagcagccag acagaccacg gcgccctaaa ttcttataat caataagagg 7380 ttggac 7386 // ID MacERV1_int repbase; DNA; PRI; 7259 BP. XX AC . XX DT 24-SEP-2007 (Rel. 13.02, Created) DT 27-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE ERV1 Endogenous Retrovirus from Cercopithecidae. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; KW MacERV1_int. XX OS Cercopithecidae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini. XX RN [1] RP 1-7259 RA Smit A.F.; RT "MacERV1_int - ERV1 Endogenous Retrovirus from Cercopithecidae."; RL Direct Submission to Repbase Update (21-FEB-2008). XX DR [1] (Consensus) XX CC chr1.nib:3750103 2% ORFs: gag 372-1910, pol 1911-5549, env CC 5528-7213 There may be another subfamily, 5% different and CC closer to PtERV1a. STILL TO DO. XX FH Key Location/Qualifiers FT CDS 372..1910 FT /product="MacERV1_int_1p" FT /note="gag." FT /translation="MGNTQSTPLSLLTSNFKDVRARGHDLSMEIRKGKLIT FT LCRSEWPAFDVGWPPEGTFRLPVITRVKSKIFLPGRAGHLDQIPYILIWQD FT LVENPPPWLSPFQLAPEPCKALVARPLKSKQPTAPPILFYLIAGTHCPQNP FT LRTPPGPRPQPPGLSRGKEQADGRRPAHPDPLKVKVTLKGRRGGREGALCG FT LAPPQPPDSTVALPLREIGPPDDTGIPRLQYWPFSTSDLYNWKTQSARFSD FT NPKDLMALLDSVMFTHQPTWDDCQQLLRILFTTEERERIQVEARKLVPGDD FT GQPTANPDLINATFPLTRPAWDYNTAEGRGRLHLYRQTLMAGLRAAARKPT FT NLAKVYSVLQGKTESPATYLERLMEAFRQYTPIDPEAPGSQAAVVMSFVNQ FT AAPDIKRKLQKLEDLEGKQIQDLLQIAQRVYNNRDTPEEKQFKATEKMTKV FT LAAVVQKEHLQPENTQPRRSPRHDNLSKDQCAYCKEAGHWVRDCPKKKQRG FT QGPPRSTPVLVTQDED*" FT CDS 1911..5549 FT /product="MacERV1_int_2p" FT /note="pol." FT /translation="GRRGSDPLPEPRVTLQVEGSPVQFLVDTGAQHSVLVK FT TNGKLSSKSSWVQGATGIKKYPWTTQRTVNLGARNVTHSFLVIPESPCPLL FT GRDLLTKMGAQIHFLPEGPVVTNSQNQPVSVLTITLEDEYRLHQERAAPDQ FT DIATWLQQYPGAWAETGGLGLAKHRPALFVELKPGTDPVRVRQYPMPLEAK FT EGIAPHIRRLLDQGVLRPCHSPWNTPLLPVRKPNSGEYRPVQDLREVNKRV FT MDIHPTVPNPYTLLSTLNPKHQWYTVLDLKDAFFSLPLAPQSQKLFAFEWT FT DPGRGISGQLTWTRLPQGFKNSPTLFDEALHEDLGEYRRKHPEITLLQYVD FT DLLIAAETQEACIQGTKGLLQALGNLGYRASAKKAQICKSEVTYLGYLLKG FT GQRWLTDARKQTVLQIPRPQSTRQVREFLGSAGFCRLWIPGFAELAKPLYQ FT ATRGQQPFNWTEEAESAFQQIKTALLSAPALGLPDVTKPFHLYVDESKGVA FT KAVITQNLGPWRRPVAYLSKKLDPVAAGWPPCLRMIAATALMVQDADKLVM FT GQELRVVTPHAIEGVLKQPPNRWISNARLTHYQGLLLNPLRITFLPPTTLN FT PASLLPNPDLDAPLHDCTEILAQVHGVREDLQDRPLPDADLVWFTDGSSFV FT HQGQRYAGAAVTSETEVIWAEPLPPGTSAQKAELIALTQALTLGAGKKLTV FT YTDSRYAFATAHIHGAIYRERGLLTAEGKEIKNKQEILALLTALWRPEKLA FT IVHCPGHQKPTTPIAQGNFLADQTARSVAKAPSQLLALQLPDPGPRDLPYL FT PDYSEQDLQWIDKLPLKQIQNGWWTDTNDQTILPEKLGQQVLEHIHRTTHL FT GARRMIDLIRRSKLKIRHIAETASSIVTSCKVCQLNNAYPQSQAATGTRLR FT GTRPGIYWEVDFTEIKPGKYGYRYLLVFVDTFSGWTEAFPTKRETAQVVAK FT KILEDILPRYGFPVQIGSDNGPAFVAKVSQDLASILGANWKLHCAYRPQSS FT GQVERMNRTLKETLTKLTMETGANWVVLLPYALFRARNTPYRLGLTPYEIM FT YGRPPPLVPSLKDDLLKSETENVSELLFSLQALQKIHQEIWPKLKELYETG FT PPPTPHPYQPGDWVLVKRHRQETLEPRWKGPLQVLLTTPTALKVEGIASWI FT HYTHVKPVDPTSDLLGPITAAAEAPATWTVDRAKNNPLKLTLRRQHSSLQT FT CS*" FT CDS 5528..7213 FT /product="MacERV1_int_3p" FT /note="env." FT /translation="LTANMQLGSLTLTLVALVAAGENTKPAPNPFVWRFWL FT YENRTHPGQPHKPGKLLASADCPSSGCNSPILLNFTGFQIAKPMAPIICFE FT FDQTKYNCKHYWWHQNAGCPYNYCNIHRSRYWGKEEQLDPKWPFHRRRDGD FT FSYTWIVRDPWNSRWTTPQHGAVYYSSSSTWPSSHLYLWRGLVQVRPLVHG FT NIQRQENRLTQDLRPFSWLKLLQEGLELANLTGLHSLSGCFLCATLGRPPL FT TAVPLPWGSSTSAQANNLRNLSYAPIPNVPLYLNPSQEKFPYCFSGTNSSL FT CNITATPPSTTLRAPPGIFFWCNGTLSKNLSGPSVTNLLCLPVTLVPRLTL FT LTAGEFLGYTGNWTSTAIHPVPRPRPARAIFLPLIAGISLTSSLIAAGLAG FT GALGHTLIESNKLYQQFAVAMEESAESLASLQRQLTSLAQVTLQNRRALDL FT LTAEKGGTCMFLKEDCCFYINESGLVEDRVQQLRKLSTEVKTRQFASAADQ FT WWNSSMFSLLAPFLGPLLSLLFLFTVGPCVVNRILQFVRERFDTVQLMVLR FT AQYQPVNAETESDL*" XX SQ Sequence 7259 BP; 1917 A; 2068 C; 1738 G; 1535 T; 1 other; tctgggggct cgtccgggat tccccaagcc caccagaccc ctggtcaacg gatctaccag 60 gatcgatctg ctgataggtg agccggctcg tctctgtttg tctgtctgtg tctgttctga 120 acctgtatct gtgactcgcg aggtctgaaa ctgaagctga cgcagtcctg gcggacgcgc 180 tataggacgg ccagcggaga ccggtgggag acgtcccctg gctctcgtct gatctagatt 240 tttatctgaa ctgattctga cccgggcttc cggagcgcgc ttgcggctag atattattaa 300 tacgccagat ttcctttaca ggaattctaa cccatattct tttacaggaa gagactacaa 360 attattacag gatgggtaat acccagagca ctcccctatc tctccttaca agtaatttca 420 aggatgttag agcaaggggc catgatctta gtatggaaat caggaaggga aagctaatta 480 ctctgtgccg ctccgaatgg cctgcctttg atgtggggtg gccacccgaa gggacgttcc 540 gacttcctgt catcactaga gtaaagtcca agattttcct acctgggcgt gcgggccact 600 tggatcaaat cccatatatc ctcatatggc aggaccttgt tgagaacccg cctccttggc 660 tgtccccttt ccaattagcc cctgaaccct gtaaggcact ggttgctcgg ccgctaaaat 720 ccaagcaacc gactgccccc cccatcctgt tctacctgat agcggggacc cactgtccac 780 agaaccccct ccgtacccct ccgggcccca ggccccagcc ccccgggctg agccgcggga 840 aggagcaggc ggacgggagg cggccggcgc acccggaccc gctgaaagtg aaagtaactc 900 tgaagggccg gcggggcgga cgcgagggcg cactttgcgg gctagccccc ccccagccgc 960 ctgactccac agtggctcta ccccttcggg aaataggacc cccagatgac acgggaatcc 1020 ccaggctcca gtactggcca ttctccacca gtgatctgta taactggaag actcagagtg 1080 ctcggttttc agacaacccc aaagatttaa tggctttact ggatagtgtc atgttcaccc 1140 accagcccac ttgggatgat tgtcagcagc tcctccgaat cttgttcaca acggaggagc 1200 gagagagaat acaggtagaa gctagaaagc tggtcccggg ggacgacggt caaccgactg 1260 ccaaccccga cctcataaat gcgacttttc ctctgaccag gccggcgtgg gactacaaca 1320 cggcagaagg taggggacgg ctacaccttt atcgccagac tctaatggcg ggtctccggg 1380 cagctgctcg caagcccact aatttggcta aagtatactc tgttctgcag ggaaagacag 1440 aaagcccagc tacctactta gaaagattaa tggaagcttt tagacagtac acccccatag 1500 acccagaggc tccaggaagt caggcagctg tcgtaatgtc ttttgtaaat caggcggccc 1560 cagacattaa aaggaaactc cagaaattag aagacttaga gggaaaacag attcaggacc 1620 tccttcagat agcccagcgg gtttacaata acagagatac tccagaggaa aagcaattta 1680 aggccactga aaaaatgacc aaggtcctgg cagcagtagt acagaaagag catctacagc 1740 cagagaacac ccaacctagg cggtccccca gacatgataa tctgagcaaa gaccaatgtg 1800 cntactgtaa ggaagctggc cactgggtaa gagactgccc caaaaagaaa caacgaggac 1860 agggaccccc taggtctaca cccgtactag tcactcaaga cgaagactag ggaagacggg 1920 gttcggaccc cctccccgaa cctagggtaa ctttgcaagt ggaggggtcc ccagtccagt 1980 tcttggtcga tacgggagca cagcactcgg tcctagttaa aactaatgga aaattatcct 2040 ccaagtcctc gtgggtacaa ggggccacag gaattaagaa atacccctgg acaacacaaa 2100 gaacagtaaa cctcggagcc aggaatgtaa cccattcttt cctggtcatc cctgagagcc 2160 cctgtcccct attagggaga gacctgctaa ctaaaatggg agcacagatc catttcctcc 2220 ccgaggggcc cgtcgtgacc aactcccaaa atcagcccgt gtccgtcctg actataaccc 2280 tagaagatga gtaccggctc caccaggagc gagcggcccc tgaccaggac atagcaacct 2340 ggctccagca atatccagga gcttgggcgg aaacgggggg cttaggacta gcaaaacacc 2400 gtcctgcctt gtttgttgaa ctcaagcctg ggacagaccc cgttcgggta cgccaatacc 2460 cgatgcccct agaggccaaa gaagggattg caccacatat tcgccggctc ctcgaccaag 2520 gggtccttcg cccatgtcac tcgccctgga atactccatt gttgccggta cgaaaaccta 2580 atagcggaga atacagacct gtacaagact tgagagaagt caataagagg gttatggaca 2640 tacatccaac tgtgcctaac ccgtacaccc tcctgagtac cctaaaccct aaacatcaat 2700 ggtacactgt tttagatttg aaagatgctt tcttcagttt gcctttagcc cctcagagcc 2760 aaaagctctt cgccttcgag tggactgacc ctgggagggg cataagtggc caactgacat 2820 ggaccagact gccgcaggga ttcaaaaact ctcctaccct gttcgatgag gccctccatg 2880 aagacctggg tgagtaccga cgcaaacacc ctgaaataac cttactacag tatgttgatg 2940 acctcctgat tgctgccgag acccaagaag cttgtattca agggaccaag ggtctcttac 3000 aagctctagg gaatctaggc taccgagcct cggcaaagaa agctcaaatc tgtaaatcag 3060 aagtaacata tctggggtac ctgctaaaag gagggcagcg ctggttaaca gacgcccgga 3120 agcagactgt tctgcagatc cccaggccac aatccacccg acaagtaagg gaattcctgg 3180 ggtcggcggg attttgtaga ctatggatac ctgggttcgc agagctggct aaacccttgt 3240 atcaggcaac acgggggcag cagccattta attggacaga agaagccgag tcggctttcc 3300 aacagatcaa aaccgcccta ctctctgcgc ctgcactggg actacctgat gtcaccaagc 3360 ccttccactt atacgtggac gagagcaagg gtgtcgccaa ggcggtaata actcagaact 3420 taggcccctg gcggaggccg gttgcctacc tgtcaaagaa attagaccca gtggctgccg 3480 ggtggccccc ttgtctccga atgattgcgg ccacggccct gatggtacaa gatgctgata 3540 aacttgtcat gggtcaagaa ttacgggtcg ttaccccaca tgccatcgag ggtgtactca 3600 aacagccacc taatcgatgg ataagtaacg cccggctcac ccactaccaa ggactactac 3660 taaatcccct caggataact ttcctgcccc caacaacctt aaaccctgcc tcgctgctgc 3720 ccaacccgga cctggacgcc ccgctccatg attgcaccga gatactagct caggtgcacg 3780 gagttcgaga ggacctgcag gaccgcccac ttcctgacgc tgacctagtc tggttcactg 3840 atgggagcag cttcgtacac caaggccaga ggtacgctgg agcggcagtg acttcagaga 3900 ctgaggtaat ctgggcggaa cccctgcccc cggggacatc ggcccagaag gccgaactga 3960 tagcgctcac ccaagctctt accttagggg cagggaagaa actgacagta tacacagaca 4020 gccgatatgc ttttgctacg gcgcacatac atggggccat ttacagggag cgagggttac 4080 taacggctga aggaaaagag ataaaaaaca agcaagagat cctagccctg ttaacagccc 4140 tatggagacc agaaaaattg gctattgtac attgcccagg gcatcagaaa ccaaccactc 4200 caattgctca aggcaacttt ctggcagacc aaactgcaag gagtgtggca aaggctccca 4260 gccaactcct tgcactccag ctccctgatc cgggcccccg ggacttgcca tatctccctg 4320 attattcaga acaagatctc cagtggatcg acaaacttcc cctgaaacag atccagaatg 4380 ggtggtggac tgatactaat gaccaaacca tcctaccaga aaaattagga caacaagtgt 4440 tagaacacat ccaccgaacc acccacctgg gtgcccggcg gatgatagac ctgatcagac 4500 gctccaagct caaaatcaga catatagccg agacggccag cagcatcgtg acaagttgca 4560 aagtctgcca gcttaacaac gcctaccccc aatcccaggc tgcaacggga acaaggctca 4620 ggggaaccag gcccggtatc tactgggaag tagattttac tgagataaag ccaggaaaat 4680 acgggtatcg gtacttactt gtctttgtag atactttttc agggtggact gaagcattcc 4740 caaccaaaag ggaaactgct caggttgtag caaagaaaat tctggaagat atcctcccca 4800 ggtatggctt ccccgtccag atagggtcag ataatgggcc ggccttcgtc gctaaggtaa 4860 gtcaggattt ggcttccatc cttggggcaa attggaaact acattgcgct tacaggcccc 4920 agagttcagg acaggtagag aggatgaatc ggaccttaaa ggagacctta actaaattga 4980 ctatggagac tggcgctaat tgggtggtcc ttctccccta cgctctgttc cgggcccgta 5040 atacccctta cagactgggc cttacccctt acgaaatcat gtatggcaga cccccacccc 5100 tggttcccag cctaaaagat gatctgctca agtctgaaac agaaaatgtc tctgaactct 5160 tattttccct acaagccttg cagaaaattc atcaagaaat ctggcccaag ctgaaggaac 5220 tatatgagac cggtcccccg ccgacacccc atccgtacca gccgggagac tgggtcctgg 5280 ttaagcgaca ccgacaagag accctagaac ccaggtggaa gggaccactc caggtactcc 5340 taaccacacc caccgccctg aaggtagaag gcatcgcgtc gtggatccac tacacccacg 5400 tcaagccagt ggacccaacc tccgaccttc tggggccaat cacggcggcg gctgaagcac 5460 cggccacgtg gactgtggac agagctaaga acaacccctt aaaactcacc ctgcgccggc 5520 agcatagctc actgcaaaca tgcagttagg tagtctaacc ctaacgttag tcgccctagt 5580 ggccgctggg gaaaacacaa aaccagctcc taatcccttt gtctggagat tctggcttta 5640 tgaaaaccga acccaccctg ggcaacctca taagcccggg aaattattgg ccagtgctga 5700 ttgcccctcc tcagggtgca atagcccaat tttactaaat tttactggtt ttcaaatagc 5760 caaaccaatg gcaccaataa tatgctttga gtttgatcag actaaataca attgtaaaca 5820 ctattggtgg caccaaaatg ccggctgccc ttataactat tgtaacatcc atagatcccg 5880 ttattgggga aaggaagaac agttagatcc taagtggccc ttccatcgta gacgggacgg 5940 ggacttttca tatacatgga tagtcagaga cccctggaac tcccgctgga ccacgcctca 6000 acatggggct gtatactact cctcctcctc cacatggcct agcagtcacc tctatctgtg 6060 gcgaggtcta gtgcaggtac ggcccctggt ccatgggaat atccaacgac aagaaaacag 6120 actaacacaa gacttacgtc ctttctcctg gttaaaatta ttacaagaag gattagaact 6180 tgctaacctt acaggacttc acagcttgtc tggctgcttt ctgtgtgcca ctctagggcg 6240 tccaccgcta accgctgtcc ctctgccatg gggatcctct acctctgccc aagctaacaa 6300 cctccgaaac ctctcatatg cccctatccc taacgtgcca ctatacctaa accccagtca 6360 ggagaagttt ccctactgtt tctcaggaac taattccagc ctctgcaaca tcactgcaac 6420 gccccctagt accaccctaa gggctccgcc gggcatattc ttctggtgta atggaacatt 6480 atctaagaac ctatctggtc cctctgttac caacctactg tgtcttcctg tcacattagt 6540 tccccggttg actctactaa ctgccggcga gttcctgggg tacaccggta actggactag 6600 tactgctatt cacccagtcc ctagaccgag acctgcacga gccatatttc tccccctcat 6660 tgcgggaatc tccctcacct catccctcat tgcggccgga ctggcggggg gagccctagg 6720 tcacaccctc atagaaagca acaagttgta ccaacaattt gccgttgcta tggaggagtc 6780 ggctgagtcc cttgcctccc ttcagcgaca gctcacgtcc ctagctcagg taaccttgca 6840 gaaccggagg gccctagacc tactcactgc tgaaaaaggt ggtacatgta tgtttctaaa 6900 ggaagactgt tgtttctaca taaatgaatc agggcttgta gaggaccggg tccaacagtt 6960 acgcaagtta agcacagaag taaaaacacg gcagtttgct tcagctgcag accaatggtg 7020 gaattcctct atgttttccc tgttagcccc cttccttgga cccctgctaa gtctactatt 7080 tctgtttacc gtaggacctt gtgttgttaa cagaatttta cagtttgtca gggaaaggtt 7140 tgacaccgta cagctcatgg tcctcagagc ccaataccaa cctgtaaacg ctgaaacaga 7200 atcagactta taagacccaa gattggctct agaggtacct gagaaaaggg gggaatgaa 7259 // ID LTR22C0 repbase; DNA; PRI; 502 BP. XX AC . XX DT 28-AUG-1997 (Rel. 2.07, Created) DT 04-JUN-2009 (Rel. 8.11, Last updated, Version 3) XX DE Long terminal repeat of endogenous retrovirus - a consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR; HERVK22I; KW LTR22; LTR22C0. XX NM LTR22. XX OS Hominidae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini. XX RN [1] RA Kapitonov V.V. and Jurka J.; RT "LTR."; RL Direct Submission to Repbase Update (25-AUG-1997). XX RN [2] RP 1-502 RA Lavie L., Medstrand P., Schempp W., Meese E. and Mayer J.; RT "Characterization of the human endogenous retrovirus family RT HERV-K(HML-5)."; RL Direct Submission to Repbase Update (07-DEC-2003)(in RL preparation). XX DR [2] (Consensus) XX CC Putative LTR of endogenous retrovirus. CC The LTR22 family is represented by two major subfamilies. CC LTRs associated with HERVK22/HERV-K(HML-5) proviral loci are CC represented by three major subfamilies: LTR22, LTR22A, LTR22B. XX SQ Sequence 502 BP; 111 A; 121 C; 145 G; 125 T; 0 other; tgtaggagat cggtcagggt ggtgggaaaa attataggaa agacgcaaac cttcttggaa 60 ggccgggagg ttttgcaaaa gcttcggaaa aggatttggc tgaaggcagc cagattctct 120 tatccggagc ctgagagcaa aggttagata acaaggggat gtaaagaaat tgatctagat 180 aagttagttt acttaggcct cggaacctgg cctttaatca tccgcgcgca ggactgctct 240 ctccaggggt ggggggggcg accatgttaa ttacccacaa gtgtgttgac tcaaagcctt 300 tgtcattaaa tctgtactga ataaatgccc gcagcgccgg cttgtcaggg ccgcggctgc 360 tgactcttta cgactcttta cggcaccctc ctcggtgtct gtgagcggcc cggtccccta 420 gcccgctctt tcactggata cctgtgtctg agtgcatttt ttcatccgtc gctcggccag 480 ggtctgcggg tcggacccgg ca 502 // ID MacERV6_LTR2a repbase; DNA; PRI; 494 BP. XX AC . XX DT 24-SEP-2007 (Rel. 13.02, Created) DT 21-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE ERV1 Endogenous Retrovirus from Cercopithecidae. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; KW MacERV6_LTR2a. XX OS Cercopithecidae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini. XX RN [1] RP 1-494 RA Smit A.F.; RT "MacERV6_LTR2a - ERV1 Endogenous Retrovirus from RT Cercopithecidae."; RL Direct Submission to Repbase Update (21-FEB-2008). XX DR [1] (Consensus) XX CC despite 5bp TSD 5-6%. XX SQ Sequence 494 BP; 120 A; 154 C; 122 G; 98 T; 0 other; tgtttgggtg agggagaaag gacaagatgg aggaaggtga acaagaaggc acaatccatg 60 ttgcttccgg gttcttcctc accaactttc ccgcgcgcgg gaaaatgcag cccgcgcccg 120 ggaagatgca gatcaaccga gcatgcgcca ggtgacgtca atccgaagag atcgaaactt 180 acccggccac gcctacggag acgcccctat cacgccctta tcccgcccac tgccctcccc 240 cttccagtac caatgcataa aagtccgccg ccggcaggag ccggcgtgac ttcttcggcc 300 cccgcattcg tggaccggag aacctcaccc gagagcgccg gcgcgacttc cctggccccc 360 cacacctgag gaccggagaa cctcgcccga gagtgtgtgc atatttgcaa taaaagactg 420 ccgctttctt acgtactttg gcctcatgtt taattattta gctctcctaa attaagttaa 480 attaaattaa gaca 494 // ID LOR1b_LTR repbase; DNA; PRI; 461 BP. XX AC . XX DT 16-MAR-2006 (Rel. 11.03, Created) DT 10-APR-2007 (Rel. 11.03, Last updated, Version 1) XX DE Long terminal repeat of ERV1 Endogenous Retrovirus from primates. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LOR1; KW LOR1b_LTR; LTR retrotransposon; MER4 group. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-461 RA Smit A.F.; RT "LOR1b_LTR - a subfamily of endogenous retroviruses from RT primates."; RL Direct Submission to Repbase Update (11-NOV-2005). XX DR [1] (Consensus) XX CC mer4 group; 4bp duplication. XX SQ Sequence 461 BP; 117 A; 130 C; 82 G; 131 T; 1 other; tgaaaccggc ccaattgtcc catagaactg atgtttatgg tttctttgaa taaacataga 60 aattgaccct cccagtctta aaacttgaga aagttacatt tgtcttatct gagttccttt 120 ctcaggaaac caaccatcag gcctcccaga tagtatcaag gaactgaaac ttaccagatc 180 actgcatccg gacaatgaga cgtcagaccc ctcacccatc atgattgctt ccttacccct 240 ccctaattcc tgttttcccg cacatggtta catttcttcc ctgctatata aacccctaat 300 tttagtccat cagggagatg gatttgagac tgatctcccg tctcctcggc tgcagcaccc 360 gattaaagcc ttcttccytg gcaatactca ttgtctcagt gattggcttt ctgtgcggcg 420 agcaacagga cctagaccga acccctggcg tttcggtaac a 461 // ID MER65_Cja repbase; DNA; PRI; 529 BP. XX AC . XX DT 14-OCT-2009 (Rel. 14.12, Created) DT 14-OCT-2009 (Rel. 14.12, Last updated, Version 2) XX DE Long terminal repeat of retrovirus-like element. XX KW ERV1; Endogenous Retrovirus; Transposable Element; MER65_Cja. XX OS Callithrix jacchus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. XX RN [1] RP 1-529 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the common marmoset."; RL Repbase Reports 9(12), 3035-3035 (2009). XX DR [1] (Consensus) XX CC ~81% identical to consensus. 4bp tsd. CC This sequence was derived from the marmoset dataset made CC available by Genome Sequencing Center at Washington University CC School of Medicine in St. Louis. XX SQ Sequence 529 BP; 154 A; 143 C; 90 G; 142 T; 0 other; tgtgaaagtt gtcagattca aaatggagtc acttgtgtca aaccctggca aatggagccg 60 gggaaggcca tgaagggagg gctctcatgc acgatttgcc tgataacagg aactatcaca 120 agaaattttt ccaaaccgca gcttactaca tgagtcacac aaggacagct agccgcttat 180 acaagaacac ttgcctgaca cactgtctca caagcccaat ccaaaactgc aagggcctaa 240 ccataactcc aagattacaa gtcctaccta gcaactgctg acactcgcca atcagaactc 300 gccagctctt gtaagacgct gccagcacca atgagctttc tttcaaaaca acttgcataa 360 cctcctcttt ccccagtaaa ccctaacctt ttcctttgtt ctccggacat accagaggcc 420 accctggtct gtatgtatgt cctggattgc aatcctactt cttgtatatt attcccaaat 480 aaaacctttt tacttagaga ttcatctcta ttattttttt atgttgaca 529 // ID ERV1-Mim_LTR repbase; DNA; PRI; 421 BP. XX AC . XX DT 31-OCT-2009 (Rel. 14.11, Created) DT 31-OCT-2009 (Rel. 15.11, Last updated, Version 3) XX DE Endogenous retrovirus-like element: consensus of the long DE terminal repeat. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERV1-Mim_LTR. XX NM ERV1-Mim_LTR. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-421 RA Jurka J.; RT "Endogenous retroviral elements from the mouse lemur."; RL Repbase Reports 9(11), 2822-2822 (2009). XX DR [1] (Consensus) XX CC Top sequences are >99% identical to consensus. 4bp tsd. ORFs CC corrupted by mutations. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. CC We thank the Broad Institute Genome Sequencing Platform and CC Genome Sequencing and Analysis Program, Federica Di Palma, and CC Kerstin Lindblad-Toh for making the data for Microcebus murinus CC available. XX SQ Sequence 421 BP; 85 A; 129 C; 89 G; 118 T; 0 other; tgaaaggccc aaactagcct aaaggcccaa actagcctaa ccagcttttc gcttctgtaa 60 ctatgcttgc tcataggtaa ttaagacccc tacccctccc tttttcttta tcttttcccc 120 taagtgtcca ttgcaagttc tcggaattgg gtagtggtcg tgcaactggt tgtgcaagat 180 ggagctactg gaatttcccc accctaaact ccccggccac ctgcgtgtgg tctcgcccca 240 taaaaaccct aagcttgaga gcttcggggc ggcagcctcc ccatcttggt gatgctgttt 300 agcccctgcg cgcgctggaa ataaatcctc ttgctctctt gcatcaagcc tctggactct 360 gagtcttttt gggcggtcgt ctctctctcc caaacgggct gtacatttct acggcccaac 420 a 421 // ID PTERV1c repbase; DNA; PRI; 7292 BP. XX AC . XX DT 14-MAR-2006 (Rel. 13.02, Created) DT 27-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE ERV1 Endogenous Retrovirus from Pan troglodytes. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; PTERV1c. XX OS Pan troglodytes OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pan. XX RN [1] RP 1-7292 RA Smit A.F.; RT "PTERV1c - ERV1 Endogenous Retrovirus from Pan troglodytes."; RL Direct Submission to Repbase Update (21-FEB-2008). XX DR [1] (Consensus) XX CC <2% div. ORF1 1499-3302, ORF2 3203-5581, ORF3 5560-7245 Most CC copies of this subfamily seem to have a frame shift in the CC beginning of ORF2 through a 1 bp insertion at pos 3293 (CAC to CC CAAC). lib20040702. XX FH Key Location/Qualifiers FT CDS 1499..3199 FT /product="PTERV1c_1p" FT /translation="SGLPQRDAIQAATLTRAAKVRLRLDIINKSDQISFTG FT IQTHIPLQEETTNYYRMGNTQSTPLSLLTSNFKEVRARGHDLGIEIRKGKL FT ITLCRSEWPAFDVGWPPEGTFRLAVITRVKSKIFLPGRAGHLDQIPYILIW FT QDLVENPPPWLSPFQLASEPCKALVARPLKSKQPTAPPHPVLPDSGDPLFT FT EPPPYPSGPQAPAPLAELREGAGGREAAGTHGPAERESNFEGPAGRTRGRT FT SRTSPPQPPDSTVALPLREIGPPDDTGIPRLQYWPFSTSDLYNWKTQSARF FT SDNPKDLLALLDSVMFTHQPTWDDCQQLLRILFTTEERERIQIEARKLVPG FT DDGQPTANPDLINATFPLTRPAWDYNTAEGRGRLHLYRQTLMAGLRAAARK FT PTNLAKVYSILQGKTESPATYLERLMEAFRQYTPIDPEAPGSQAAVVMSFV FT NQAAPDIKRKLQKLEDLEGKRIQDLLQIAQRVYNNRDTPEEKQFKATEKMT FT KVLAAVVQKEHLQPEYTQPRRPPRHDNLSKDQCAYCKGAGHWVRDCPKKKP FT RGQGPGPPRSTPVLVTQDED" FT CDS 3203..5578 FT /product="PTERV1c_2p" FT /translation="GRRGSDPLPEPRVTLQVEGSPVQFLVDTGAQPFNWTD FT EAELAFQQIKTALLSAPALGLPDVTKPFHLYVDENKGVAKAVITQNLGPWR FT RPVAYLSKKLDPVAAGWPPCLRMIAATALMVQDADKLVMGQELRVVTPHAI FT EGVLKQPPNRWMSNARLTHYQGLLLNPLRIIFLPPTTLNPASLLPNPDLDA FT PLHDCTKILAQVHGVREDLQDRPLPDADLVWFTDGSSFMHQGQRYAGAAVT FT SETEVIWAEPLPPGTSAQKAELIALTQALTLGAGKKLTVYTDSRYAFATAH FT IHGAIYRERGLLTAEGKEIKNKQEILALLTALWRPEKLAIVHCPGHQKLTT FT PTAQGNFLADQTARNVAKAPSQLLALQLPDPGPRDLPYFPEYSEQDLQWID FT KLPLKQIQNGWWTDTNDQTILPEKLGQQVLEHIHRTTHLGARRMIDLIRRS FT KLKIRHIAETASSIVTSCKVCQLNNAYPQSQAAAGTRLRGTRPGIYWEVDF FT TEIKPGKYGYRYLLVFVDTFSGWTEAFPTKRETAQVVAKKILEDILPRYGF FT PIQIGSDNGPAFVAKVSQDLASILGANWKLHCAYRPQSSGQVERMNRTLKE FT TLTKLTIETGANWVVLLPYALFRARNTPYKLGLTPYEIMYGRPPPLVPSLK FT DDLLKSETENVSEFLFSLQALQKIHQEIWPKLKELYETSPPPTPHPYQPGD FT WVLVKRHRQETLEPRWKGPLQVLLTTPTALKVEGIASWIHYTHVKPVDPTS FT DLLGPITAAAEAPDTWTVDRAKNNPLKLTLRRQHSSLQTCS" FT CDS 5560..7242 FT /product="PTERV1c_3p" FT /translation="LTANMQLGSLTLTLVALVAAGENIKPAPNPFVWRFWL FT YENQTHPGQPHKPGKLVASADCPSSGCNSPILLNFTDFPVAKPVAPIICFE FT YDQTEYNCKHYWWHQSAGCPYNYCNIHKYQWWGGKEQIDPRWPFHRRRDRD FT LSYTWIVRDPWNSRWTTPQHGAVYYSSASTWPSSHLYLWRGLVQVRPLVHG FT NIQRQENRLTQDLRPFSWLKLLQEGLELANLTGLHSLSGCFLCATLGRPPL FT TAVPLPWGSSTSAQANNHRNLSYAPIPNVPLYLNPSQEKFPYCFSGTNSSL FT CNITATPPNITLRAPSGIFFWCNGTLSKNLSSPSVTNLLCLPVTLVPRLTL FT LTAGEFLGYTGNWTSAVIHPDPRPRPARAIFLPLIAGISLTASFMAAGLAG FT GALGHTLIESNKLYQQFAVAMEESAESLASLQRQLTSLAQVTLQNRRALDL FT LTAEKGGTCMFLKEDCCFYINESGLVEDRVQQLRKLSTEVRTRQFASAADQ FT WWNSSMFSLLAPFLGPLLSLLFLLTVGPCVVNRILRFVKERFNTVQLMVLR FT AQYQPVNAETESDL" XX SQ Sequence 7292 BP; 1752 A; 2137 C; 1861 G; 1538 T; 4 other; tctgggggcc cgtccgggat tccccaagcc caccagaccc ctggtcaacg gatctgctag 60 gatcgatcta ctgataggtg agctggctcg tctccgtttg tctgtctgtg tctgttctga 120 atccgaatct gtgactcgcg aggtctgaaa ctggagctgg cacagtcctg gcggacgcgc 180 tataggacgg ccagcggaga ccggtgggag acgtcccctg gctctcatct gatctatatt 240 gcgatctgag ctgccccggt ttggcgggca gcagctgtct ctgatctgag ctgcccggtt 300 gggcgggcag cagctgtctc tgatccgagc tgccccggtt tggcgggcag cagccgtctc 360 cgatctgagc tgccccggtt tggcgggcag cagccgtctc tgatctgggc tgccccagcg 420 cgatcgcgat ccaggcagct gactctgacc cgggcttccg gtgcgcgctt gcgatctgaa 480 ctgccccggt ttggcgggca gcagctgtct ctgatctggg ctgccccagc gcgatcgcga 540 tccaggcagc tgactctgac ccgggcttcc ggtgcgcgct tgcgatctga actgccccgg 600 tttggcgggc agcagctgtc tctgatctgg gctgccccag cgcgatcgcg atccaggcag 660 ctaactctga cccgggcttc cggtgcgcgc ttgcgatctg aactgccccg gtttggcggg 720 cagcagctgt ctctgatctg ggctgcccca gcgcgaccgc gatccaggca gctaactctg 780 acccgggctt ccggtgcgcg cttgcgatct gaactgcccc ggtttggcgg gcagcagctg 840 tctctgatct gggctgcccc agttggcggg cagcagccgt atctgagctg cccggttggg 900 cgggcagcag ctgtctctga tctgagctgc cccggttcgg cgggcagcag ctgtctctga 960 tctgagctgc cccggtttgg cgggcagcag ctgtctctga tctgagctgc cccggtttgg 1020 cgggcagcag ccgtctctga tctgggctgc cccggctgan cgggcagcag ctgnctctga 1080 tctgagctgc cccggtttgg cgggcagcac tgccgatctg agctgccccg gttnggcggg 1140 cagcagctgt ctctgatctg ggctgcccca gcgcgaccgc gatccaggca gctaactctg 1200 acccgggctt ccggtgcgcg cttgcgatct gaactgcccc ggtttggcgg gcagcagctg 1260 tctctgatct gggctgcccc ggcgcgaccg cgatccaggc agctgnctct gatctgagct 1320 gccccggttt ggcgggcagc agccgtatct gagctgcccc ggtttggcgg gcagcagctg 1380 tctctgatct gggctgcccc agcgcgaccg cgatccaggc agctaactct gacccgggct 1440 tccggtgcgc gcttgcgatc tgaactgccc cggtttggcg ggcagcagct gtctctgatc 1500 tgggctgccc cagcgcgacg cgatccaggc agctactctg accagggctg ccaaagtgcg 1560 cctgcgatta gatatcatta ataagtcaga tcagatttcc tttacaggga ttcaaaccca 1620 cattccttta caggaagaga ctacaaatta ttacaggatg ggtaataccc agagcactcc 1680 tctatctctc cttacgagta atttcaaaga agttagagca aggggccatg atcttggtat 1740 agaaatcagg aaaggaaagc taattactct gtgtcgctcc gaatggcctg cctttgatgt 1800 ggggtggccg cccgaaggga ccttccgact tgctgtcatc actagggtaa agtccaagat 1860 tttcctacct gggcgtgcgg gccacttaga tcaaatccca tatatcctca tatggcagga 1920 ccttgttgag aacccgcctc cttggctgtc ccctttccaa ttggcctctg aaccctgtaa 1980 ggcactggtt gctcgaccac taaaatccaa gcaaccaact gccccccccc atcctgttct 2040 acctgacagc ggggacccac tgttcacaga accccctccg tacccctccg ggccccaggc 2100 cccagccccc ctggctgagc tgcgggaggg agcaggcgga cgggaggcgg ccggcacaca 2160 cgggcccgct gaaagggaaa gtaactttga agggccggcg gggaggacgc gagggcgcac 2220 ttcgcggact agcccccctc agccgcctga ctccacggtg gctttacccc ttcgggaaat 2280 aggacccccg gatgacacag gaatccccag gctccagtac tggccattct ccaccagtga 2340 tctgtataac tggaaaactc agagtgctcg gttttcagac aaccccaaag atttactggc 2400 tttactagat agtgtcatgt tcacccacca gcccacttgg gatgattgtc agcagctcct 2460 ccgaattttg ttcaccacgg aagagcgaga gagaatacag atagaagcta gaaagctggt 2520 cccgggggac gacggtcaac cgactgccaa ccccgacctc ataaacgcaa cctttcctct 2580 gaccaggccg gcgtgggact acaacacggc agaaggtagg ggacggctac acctttatcg 2640 ccagactcta atggcaggtc tccgggcagc tgctcgcaag cccactaatt tggctaaagt 2700 atattctatt ctgcagggaa agacagagag cccagctacc tacttagaaa gattaatgga 2760 agcttttaga cagtacaccc ccatagatcc agaggctcca ggaagtcagg cagctgttgt 2820 aatgtctttc gtaaatcagg cagccccaga tattaagaga aaactccaga aattagaaga 2880 cttggaggga aagcggattc aggacctcct tcagatagcc cagcgggttt acaataacag 2940 agatactcca gaggaaaagc aatttaaggc cactgaaaaa atgaccaagg tcctggcagc 3000 agtggtacag aaagagcatc tacagccaga gtacacccaa cctaggcggc ccccccggca 3060 tgataatctg agcaaagacc aatgtgccta ttgtaagggg gctggccact gggtaagaga 3120 ctgccccaaa aagaaaccac gaggacaggg acccggaccc cctaggtcta cacccgtact 3180 agtcactcaa gacgaagact agggaagacg gggttcggac cccctccccg aacctagggt 3240 aactttgcaa gtggaggggt ccccagtcca gttcttggtc gacacggggg cacagccatt 3300 taattggaca gacgaagccg agttggcctt ccaacagatt aaaaccgccc tactctccgc 3360 gcctgcacta ggactacctg atgttaccaa gcccttccac ttatacgtgg atgagaataa 3420 gggtgtcgcc aaggcggtaa taactcagaa cttaggcccc tggcggaggc cagttgccta 3480 cctgtcgaag aagttagacc cagtagctgc cgggtggccc ccttgtctcc gaatgattgc 3540 ggccacggct ctgatggtgc aagatgctga taaacttgtc atggggcaag aattgcgggt 3600 cgttactcca catgccatcg aaggtgtact caaacagcca cctaatcgat ggatgagtaa 3660 cgcccggctc acccactacc aaggactact actaaatcct ctcaggataa ttttcctgcc 3720 cccaacgacc ttaaaccctg cctcgctgct gcccaacccg gacctggacg ccccactcca 3780 tgactgcacc aagatactag ctcaggtgca cggagttcga gaagacctgc aggaccgccc 3840 acttcctgac gccgacctcg tctggttcac tgatgggagc agcttcatgc atcaaggcca 3900 gaggtacgct ggagcggcag taacttcaga gactgaggta atctgggcgg aacccctgcc 3960 cccggggaca tcggcccaga aggccgaact gatagcgctc acccaagctc ttaccttagg 4020 ggcggggaaa aagctgacag tatatacaga cagccgatat gcttttgcaa cggcgcatat 4080 acatggggcc atttacaggg agcgagggtt actgacggct gaaggaaaag agataaaaaa 4140 caagcaagag atcctagccc tgctaacagc cctatggagg ccagaaaaat tagccattgt 4200 acattgccca gggcatcaga aactaactac tccaactgct caaggcaact ttctggcaga 4260 ccaaactgca aggaatgtgg cgaaggctcc cagccaactc cttgcactcc agctccctga 4320 cccgggcccc cgggacttgc catatttccc tgaatattca gaacaagatc tccagtggat 4380 tgacaaactt cccctgaaac aaatccagaa tgggtggtgg actgatacta atgaccaaac 4440 catcctacca gaaaaattag gacaacaggt gttagaacac atccaccgaa ccacccacct 4500 gggggcccgg cggatgatag acctgatcag acgctccaag ctcaaaatca gacatatagc 4560 tgagacggcc agcagtatcg tgacaagttg caaagtctgc cagcttaaca acgcataccc 4620 ccaatctcaa gctgcagcag gaacaaggct caggggaacc aggcccggta tctactggga 4680 agtagatttt actgaaataa agccaggaaa gtacgggtac cggtacttac ttgtctttgt 4740 agatactttt tcagggtgga ctgaagcatt cccaaccaaa agagaaactg ctcaggtcgt 4800 agcaaagaaa attctggaag atatccttcc caggtatggc ttccccatcc agatagggtc 4860 agataatggg cccgctttcg tcgctaaggt aagtcaggac ttggcttcca tccttggggc 4920 aaattggaaa ctacattgcg cttacaggcc ccagagttca ggacaggtag aaaggatgaa 4980 tcggacctta aaagagacct taactaaatt gactatagag actggcgcta attgggtagt 5040 ccttctcccc tatgctctgt tccgggcccg taatacccct tacaaactgg gccttacccc 5100 ttacgaaatc atgtatggca gacctccacc cctggttcct agcttaaaag atgacctgct 5160 taagtctgaa acagaaaatg tctctgaatt cttattttcc ttacaagcct tacagaaaat 5220 tcaccaagaa atctggccca agctgaaaga gctatatgag accagtcccc caccgacacc 5280 ccatccgtac cagccgggag actgggtcct ggttaagcga caccgacaag agaccctaga 5340 gcccaggtgg aaaggaccac tccaagtact cctgaccaca cccaccgccc tgaaggtaga 5400 aggcattgcg tcgtggatcc actacaccca cgtcaagcca gtggacccaa cctccgacct 5460 tctggggcca atcacggcgg cggctgaagc accggacacg tggactgtgg acagagctaa 5520 gaacaacccc ttaaaactca ccctgcgccg gcagcatagc tcactgcaaa catgcagtta 5580 ggtagtctaa ctctaacatt ggtcgcccta gtggccgctg gggaaaacat aaagccagct 5640 cctaatccct ttgtctggag attctggctt tatgaaaacc aaacccaccc tgggcaacct 5700 cataagcccg ggaaactagt ggccagtgca gattgcccct cctcagggtg caatagccca 5760 attttactaa attttaccga tttcccagta gccaaaccag tggcaccaat aatatgcttc 5820 gagtatgatc agactgaata caattgtaag cactattggt ggcaccaaag tgccggctgc 5880 ccttataact attgtaacat ccataaatac caatggtggg gtggaaaaga acagatagat 5940 cccagatggc ccttccatcg cagacgagat agagaccttt catatacatg gatagttaga 6000 gacccctgga actcccgctg gaccacgcct caacacgggg ctgtatacta ctcctccgcc 6060 tccacatggc ctagcagtca cctctatctg tggcggggtc tagtgcaggt acggcccctg 6120 gtccatggaa atatccagcg acaagaaaac cgcctgacac aagatttacg tcctttttcc 6180 tggttaaaat tattgcaaga aggattagaa cttgccaacc ttacaggact tcacagcctg 6240 tctggctgct ttctgtgtgc cactctaggg cgtccaccgc taaccgctgt ccccctgcca 6300 tggggatcct ccacctctgc ccaagctaac aaccaccgaa acctctcata tgcccctatc 6360 cctaacgtgc cactatacct aaaccccagt caagagaagt ttccctactg tttctcagga 6420 actaattcca gcctctgcaa catcactgca acgcccccta acatcacctt aagggctccg 6480 tcaggcatat tcttctggtg taatggaaca ttatctaaaa acctatcaag cccctctgtt 6540 accaacctac tgtgtcttcc tgtcacatta gttccccggt taactctact tactgccggc 6600 gagttcctag ggtataccgg taactggact agtgctgtta ttcacccaga ccctagaccg 6660 agacctgcac gagccatatt tctccccctc attgcaggaa tctccctcac cgcatccttc 6720 atggcggccg gactggctgg gggagcccta ggtcacaccc tcatagaaag taacaagctg 6780 taccaacaat ttgccgttgc tatggaggag tcagctgagt cccttgcctc cctccagcgg 6840 cagctcacgt ccctagcaca ggtaaccttg cagaaccgga gggccttaga cctactcact 6900 gctgaaaaag ggggaacgtg tatgtttcta aaggaagact gttgtttcta cataaatgaa 6960 tcagggctcg tggaagaccg ggtccaacag ttacgcaagt taagcacaga agtaagaaca 7020 cggcagtttg cttcagctgc agaccaatgg tggaactcat ctatgttttc tctgttagcc 7080 cccttccttg gacccctgct gagtctacta tttctgctta ccgtaggacc ttgtgttgtt 7140 aacagaattt tgcggttcgt taaagaaagg tttaacactg tacaactcat ggtcctcaga 7200 gcccaatacc aacctgtaaa cgctgaaaca gaatcagact tataagaccc aagattggct 7260 ctaaaaaaat acctgaaaag aaaggggggg aa 7292 // ID HERV1_I repbase; DNA; PRI; 8801 BP. XX AC . XX DT 01-JUL-2005 (Rel. 10.06, Created) DT 05-AUG-2008 (Rel. 13.07, Last updated, Version 4) XX DE Human endogenous retrovirus HERV1 - a consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; KW Internal sequence of human endogenous retrovirus; HERV1; HERV1_I. XX NM HERV1_I. XX OS Haplorrhini OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates. XX RN [1] RP 1-8801 RA Polavarapu N., Bowen N.J. and Mcdonald J.F.; RT "Consensus sequence of human endogenous retrovirus HERV1."; RL Repbase Reports 5(6), 145-145 (2005). XX RN [2] RP 1-8801 RA Smit A.F.; RT "Consensus update."; RL Direct Submission to Repbase Update (14-MAR-2006). XX DR [1] (Consensus) XX CC LTRs of the element are deposited as HERV1_LTR in CC repbase.Target site duplications are 4bp.There is ~6% CC divergence between HERV1 elements and their consensus CC sequence.The HERV1 consensus sequence encodes gag, protease, CC reverse transcriptase, RNase H, integrase and envelope CC proteins. Pol proteins are 50-55% identical to Human CC endogenous retrovirus HCML-ARV. HERV1 intra-element LTR CC identity is 93% to 96%. Gag region: nt 914..2521, Polymerase CC region: 2687..6341, Envelope region : 6657..8552. CC [2] 6% subst level. Only 3 or 4 copies in human genome. ORFs: gag CC 870-2618, pol 2658-6308, env 6534-8768. Close (75% at DNA level CC over full length) to HERV15 and HERV3. XX FH Key Location/Qualifiers FT CDS 870..2618 FT /product="HERV1_I_1p" FT /note="gag." FT /translation="MGGKASKPTPLECMLKNFKKGFDGDYGMKLTPQRLRT FT LXEIDWPSFNVGWXAEGTIDREIIGRVFRVVTGVGEQPGHPDQFPYIDSWL FT SVIQTHPKWLQACFEDYCKTLVARTKQGTIEKTHKAQXQEKEQQGKQEKPV FT LQAPPEELETPPPYVPIYPSLARLRQEATPAAASGGSDSEESTPQTSPRRE FT EPGPLPDKSKEELQDEVGRLRSGRARAMQMPLRETRGQIYLDAQNEVQGGE FT RLYVYQPFSTTDIFNWKQHTPSYTEKPQALIDLMQSIFLTHNPTWADCKQL FT LLSLFNTEERCRVIQAAHQWLESNAPVGTGDVRXYAQQALPIETDPGWDPN FT QAQGLLNLLRYREALIQGIKAGGKKATNTGKVSEVHQKPDESPSEFYERLC FT EAYRLYTPFDPEAAGNQCMVNAAFVSQAQNDIKRKLQKLEGFEGMNISQLI FT QVATKVFVNRDEEAKREAKRRVKEKAEFLAAALVEREAGFARGHGRGHGCG FT HGRGQARPGQETRTGQEGQPRLERDQCARCKQRGHWKDECPEKEKDKGNNQ FT GQNGWTGPPSAAGQGIVGSDVDLIGLAGANDYLED*" FT CDS 2658..6308 FT /product="HERV1_I_2p" FT /note="pol." FT /translation="MVSMVVGGRKTDFMVDTGAEHSVVTQTIGPLSKDYAN FT IIGATGITEKTPFFKSKRCMIGDQEVQHKFLYLPNCPVPLLGRDLLQKLQA FT QISFTPSGDMTLSLGQKKAMVLTLTVPKAEEWRLYESSCQECGKGYSVAER FT EKLFTDLLLKLPGVWAEDNPPGLAVNQAPVIVELLRGTYPVRIRQYPIPIE FT AXQGIAKHLKWLLEFGIIERCVSSWNTPLLPVLKPSGDYQPVQDLRAVNEV FT AATLHAIVPNPYTMLGQIPASAAWFTCLDIKDAFFCIRLAPISRDIFAFEW FT GPSQYTWTRLPQGFKNSPTIFEEALASDLKAFTPPSDRCVLLQYIDDLLLA FT APTRKECIQGTESLLRVLWEAGYKVSKEKAQICGQGVQYLGFYVSQGRREL FT GRERKETVCSIPQPDTRRQVREFLGAAGFCRIWIPNYSLLAKPLYEATKXG FT EKEPLLWGKEQDMAFKEIKKALIQAPALGLPDMTKPFYLYVHERKEMATGV FT LVQMLGSWYRPVAYLSKQLDLVATGWPPCFKALAATALLAEDANKLTFGQK FT LIIQVPHTVVTLMEQRGHRWLSNPRMLRYQGLLCENPYITLETVNTLNPAT FT LLPIECAEHGKPPLCAPGYHCCVETVDEVFSSREDLKDQPLKDPDVEYFTD FT GSSFISEGIRKARYAVVTLSSVAEARPLPVGTSAQKAELIALTRALLLAKG FT KSVNIYTDSRYAFATLHAHGAIYKERGLLTTEGKEIKNKKEIEQLLEAVWA FT PKEVAVIHCKGHQTGGSDEATGNRKADKEAKRAAMTEXTKKEETYAMPLLE FT PPLAXTPNYSSSEKAWFAQENGSYQKGGWWKFSDGRLAIPEAIAPRFIKQF FT HQGTHMGKTALEILVGRYFXVPRLTAITRAICEQCVTCAQNNPRQGPTRPP FT GIQETGAVPCENLLVDFTELPQAGGYRYMLVFVCTFSGWVEAFPTRTEKAR FT EVTRILLKDIIPRFGLPLTLGSDNGPAFVAEVVQQLTQMLKIKWKLHTAYH FT PQSSGKVERMNRTLKQLLKKFCQETHLRWDQVLPMVLLRVRCTPTKLTGYS FT PYEIVYGRPPPLISQVKGDLKEIGELTLRRQMQALGEVMQEVQGWVRERIP FT VSLTDAIHPFQPGDSVWVKRWNPTTLGPLWDGPHIVIMSTPTAVKVAGITP FT WIHHSRLKPAASVQDRWTSQQDPDHPTRLILQRNQGAAEKDDCPAPTTPEA FT GRSTHG*" FT CDS 6534..8768 FT /product="HERV1_I_3p" FT /note="env." FT /translation="MLSLQLLSNLLPRRMPVHSINLITLVTIMLTGMGGNQ FT DNCHHCMIEAWSGKGITKTLLYQTYYECTGTPLGTCVYNQTSYSVCDPGNG FT QPQVCYDPGLLPYDFWFEIQIGKPLLPSYANPKDVGTGKLVSKTQVFPYSH FT KGSVSIYFDACQAAHLSNLNNLGVVCKNLGQERVSSKAAKIITGEPEEECP FT DCNIQWTTHEFSQRLYAGRVALLASQEAKIGCATKTCNPLNLTILKPNMPF FT WTKGHQGELSFDREGANLGIPLIIIKKTQRAKVQVSPMSQFRFFKSFNKHF FT NPKEPKVQIPPMSAENLFAQLAESIATNLGVTSCYVCGGTSIGDQWPWEAR FT ELMPQDNFTIPEFVTKFNANPSVWLLKTPIIGRYCIARWGKDFQTQVGDTT FT CLDQQYFEESKNKTQWRSFIDNSSVPDFNPLSQFPALNQSWYQLDAPNVWK FT APAGLYWICGTKAYQLLPEKWTGACVLGTIRPSFFLLPLKQGEDLSYPVYN FT EERKRTRRNVFTQISTVEKINTNIKKDIEIGSWKDNEWPPERIIKYYGPAT FT WAQDGSWGYRTPIYMLNRIIRLQAVLEIIVNETARALDLLAIQATQMRDAI FT YQNRLALDYLLASEGGVCGKLNLTNCCLQIDDNGRAVMEITARMRKLAHVP FT VQTWSXWSPNSLFGGWFSWFGGFKTLIIGFIAIIGGCLILPCLLPLLIRSI FT QSTIEAIVDRTTTTKIMALQKYQPVPQEEYVPTQEEINDCGALY*" XX SQ Sequence 8801 BP; 2615 A; 1996 C; 2243 G; 1929 T; 18 other; tttctggggg ctcgtccggg attggagatg gcaggtttct gtctcctttg cctgtgggct 60 agagccccag gacgcgggag atttgggacc cttggcgcca ctgggtaaga cttagcccag 120 aaggagaacg gctctcccgt gtcttggagc cttcccctga cagcgcaaac ggaaccgact 180 caggagttgc aggacagtca caggagcagc gcgcaggcag actactgaac cacggtaagg 240 ttgggccctg ggaaagcccg tcccataagg acagaagggg agcttgatca cctcccaggg 300 aacgaccact tatccaaccc agagtggctg ggggcggcag gagtggcctg ccaatttgga 360 tgaacctcgt gtccccacta caagcgaaag tggttcactg gatctggagg cagaaactgg 420 gagtgtgtgg gtgcgtgtga acctacccgg gacacaagga aggcttgttt catccgatga 480 ggtggggtag gagtggtgtg tgtatgtgtg tgaatgtggg agcctaacta ggctcacccg 540 ggacacgaga gaggctcgtt tcatctgatg aggagtcctg gggcagggga ggtgtgtgaa 600 agtgtgtgaa agagatggtc tcgggagagg ccaatgcggg gagtgatgtg gggaggtgca 660 gatctcttag cgcagactgt gtcctccgag gcgagtgtgg gacgagccag acctaggtca 720 ctgcgtaagg ccgacaggat tagcttcata gcttcacagc agtagttggc tgtgacctgg 780 ccaagcagca tccgaacctc ccgtaatagg acccggtctg gtgaatccga gagtgaaagt 840 gagagtgaaa gtgcgccgcg agggaggaaa tgggaggaaa agcatcgaag ccaactccat 900 tagaatgtat gttgaagaac tttaagaaag gctttgatgg tgattatgga atgaagttaa 960 caccgcagag gctaagaacn ctttntgaaa ttgactggcc ctcctttaat gtagggtggn 1020 cggccgaggg aaccatagat agggaaataa ttggccgagt gtttcgggtg gtcactgggg 1080 tcggagaaca gcctgggcac cctgatcagt tcccatacat agactcctgg ctgagcgtga 1140 ttcagaccca tccgaaatgg ctacaggcct gctttgaaga ttattgtaag actctagtgg 1200 ctcgtacaaa acaaggaacc atagaaaaga cccacaaagc gcagncccag gagaaagagc 1260 agcaaggaaa acaggaaaaa cctgtcctac aggccccgcc agaagagcta gagactccac 1320 ccccttatgt tccaatttac ccatctctgg caaggcttag gcaggaagcc actccagcag 1380 ctgcctccgg agggtcggac tcagaggaga gtacccccca aacttcacca cgtagggaag 1440 agccagggcc actgcccgat aaatcaaagg aggaactcca ggatgaggtt ggccgtctcc 1500 ggtcaggacg cgcccgagcc atgcagatgc ccctcagaga aactagagga cagatctatt 1560 tagatgcaca aaatgaggtt caaggaggag aacggcttta tgtttatcag cccttctcta 1620 ctacagacat tttcaactgg aagcagcata ctccctccta tacggaaaaa ccccaggctc 1680 ttatcgacct aatgcagtcc atcttcttaa ctcacaaccc aacctgggct gactgcaagc 1740 agctccttct gtcactgttc aatacagaag aacgctgcag agtaatacaa gcggctcatc 1800 agtggctaga aagcaatgcc ccagtaggca caggagatgt caganagtat gcncagcagg 1860 ctttgccaat agaaactgac ccaggctggg acccaaatca ggcccaaggg ctgctgaact 1920 tgctgagata ccgagaggct ctgatacaag gaataaaggc tggagggaaa aaggcaacaa 1980 acactggaaa ggtttcagag gtccatcaga aaccagatga aagccccagt gagttctatg 2040 agaggctttg tgaggcttac cggctctaca caccttttga cccagaagcg gcagggaatc 2100 agtgcatggt taatgcggca ttcgtgagtc aggcacaaaa tgacattaag cgaaagttgc 2160 agaagctgga ggggtttgaa ggtatgaaca tttcccagct tatccaggtg gcaaccaaag 2220 tgtttgtaaa tcgagatgaa gaagcaaagc gggaggccaa gcgtagagtg aaggaaaagg 2280 cagagttctt ggccgcagcc ctggttgaaa gagaggctgg atttgcaaga ggacatggac 2340 gtggtcatgg atgcggtcac ggtagaggac aagctaggcc aggtcaggag accaggacag 2400 gtcaggaagg tcagcctaga ctagagagag atcaatgtgc gagatgcaag caaagagggc 2460 attggaaaga tgaatgtcca gagaaagaaa aggataaagg caacaaccag ggacagaatg 2520 gctggacagg gcctccttcc gccgctgggc agggcatagt aggatccgac gtggatctaa 2580 ttgggctggc aggagccaac gactaccttg aagactgaga cagaccgggc tccatctcat 2640 taggccccga ggagcccatg gtctcaatgg tagtaggggg ccgaaaaaca gactttatgg 2700 tagacacagg tgctgaacac tcggttgtga ctcaaacaat tgggccgtta tcaaaagact 2760 atgctaatat tatcggggcc acaggtatca cagaaaagac accttttttc aaatcaaaga 2820 gatgtatgat tggagaccaa gaagtccaac acaagttctt atacttgcca aactgcccag 2880 tgccgctgtt gggaagagac ttgctgcaga agctgcaggc tcagatctcc ttcacaccga 2940 gtggagatat gaccttgagc ctagggcaaa aaaaggctat ggtactaacn ctcacagtnc 3000 ccaaagcaga ggagtggaga ctttatgaaa gtagttgcca ggagtgtgga aaggggtaca 3060 gcgtagctga gagagaaaaa ctgttcacag acttactcct taagttacca ggagtctggg 3120 cggaggacaa tcccccnggg ctagcagtga atcaagcacc tgtcatagta gagctactac 3180 gaggaaccta cccggtgcgg atccgtcagt atcccattcc catagaggct naccaaggga 3240 ttgcaaaaca cttaaaatgg ctccttgaat ttggaataat agagagatgt gtctcctcat 3300 ggaatactcc cctactgccg gtgttaaaac cttctggtga ctaccagcct gtacaagact 3360 taagggcagt caacgaggtg gcggctacac tgcatgctat tgtgcctaac ccgtacacta 3420 tgcttggaca aatacctgct agtgctgctt ggttcacatg cttggacatt aaagatgcgt 3480 tcttctgcat ccgattagcc cctataagcc gagacatttt tgcctttgag tggggcccat 3540 ctcagtatac ctggactaga cttccccaag gatttaaaaa ctccccaact atctttgaag 3600 aagcactagc ctcagactta aaggctttca cgccaccnag cgaccgntgt gtcttactgc 3660 aatatataga tgatctgctg ttagccgcac ccacnagaaa ggaatgtatc caaggaacag 3720 agagtctcct tcgagtgctg tgggaagctg gctataaagt gtctaaggaa aaggcacaga 3780 tctgtggcca aggagttcag tatcttggct tttacgtctc ccaagggcgg cgtgagcttg 3840 ggcgggagcg aaaagagact gtctgtagca ttcctcagcc agacacaagg cggcaagtgc 3900 gggaattcct aggggcagct ggtttctgcc gtatatggat ccccaactac tcgctcctag 3960 caaagccttt atatgaggcg accaaagngg gggaaaagga gcccctcctt tggggaaaag 4020 aacaggacat ggctttcaag gaaatcaaga aagctttaat ccaggcccca gcactagggc 4080 tgccagacat gacaaagccc ttttacctgt atgttcatga aagaaaagaa atggctacag 4140 gagtcttagt gcaaatgcta gggtcatggt atcggcctgt agcatatttg tccaagcaac 4200 tggacttggt ggctacggga tggccaccct gtttcaaggc gctggccgcc actgccttgt 4260 tagccgagga tgctaacaag ctcacatttg gacagaagct gataattcag gtgccccaca 4320 cagttgtcac cctgatggaa cagagaggac atcgttggct ctctaaccct aggatgctaa 4380 gataccaagg gctcctatgt gagaatccat acatcacctt agagactgtg aataccctaa 4440 atccggccac actgttacca atagaatgtg cagaacatgg gaagcccccg ttatgtgccc 4500 cagggtacca ctgctgtgta gaaacagtgg atgaagtctt ttcaagccgg gaagacttaa 4560 aggatcaacc cttaaaagac ccagatgttg aatattttac tgatggaagc agcttcatat 4620 ccgagggtat cagaaaggcc agatatgccg tagtcacatt aagctcggtg gctgaggccc 4680 gccccctacc ggtaggaacc tcagcgcaga aggcagaact aatagctctc acaagagcac 4740 tacttctagc gaagggaaag tcagtgaata tctatactga ctcaagatat gcttttgcca 4800 ccttgcatgc tcatggagcc atttacaaag aaagaggact gttaactact gaaggaaaag 4860 aaataaagaa caaaaaggaa atagagcagc tcctagaggc tgtatgggct ccaaaggaag 4920 tagcagtcat ccactgtaaa ggacatcaaa caggaggaag tgatgaggct acaggaaaca 4980 gaaaagcaga caaggaagca aaaagggctg caatgacaga aanaacaaag aaagaagaga 5040 cttatgccat gcccttatta gagcctcccc ttgcaganac tcctaactac tcgtccagtg 5100 agaaggcgtg gttcgcgcag gaaaacggga gttatcagaa aggaggctgg tggaagtttt 5160 cagatgggag gcttgccatt ccagaagcca ttgccccccg atttataaag cagtttcacc 5220 aaggaacaca catggggaag acagctttag agattctcgt agggcggtat ttctntgtgc 5280 cacgcctaac tgccatcact cgagccatct gcgagcagtg tgttacttgt gcccagaata 5340 acccaaggca agggcctact cggcccccag gaattcagga aacaggagca gtgccatgtg 5400 aaaacctgct tgtagacttt accgaactgc ctcaggctgg aggctatcgg tacatgttag 5460 tgtttgtctg caccttttca gggtgggtcg aggcattccc caccaggaca gaaaaggcac 5520 gagaagtaac caggatatta ttaaaagaca ttattcctag atttggactg cctctaactt 5580 taggctcgga caacggacca gcttttgtgg cagaagtagt acagcaacta actcagatgt 5640 taaaaattaa atggaaactg catacagcct atcatccaca aagttctgga aaggttgaaa 5700 ggatgaaccg gacactgaaa caactgttga agaagttttg ccaagagact catttaaggt 5760 gggatcaggt attgcccatg gtccttctcc gagtcaggtg tacccctact aaattaaccg 5820 ggtattcacc ctatgagata gtgtatggcc gaccaccccc actcatatct caggtaaaag 5880 gagatttaaa ggaaattgga gaactgaccc taagaagaca aatgcaggca ttaggtgagg 5940 taatgcaaga agtacaaggg tgggtaagag aaagaatacc tgttagcctt acagatgcaa 6000 tacatccctt tcaacctggg gactctgtat gggttaaacg ctggaatcct accaccctcg 6060 ggcccttatg ggatggcccc catattgtga tcatgtctac ccctaccgct gttaaagttg 6120 caggtattac accttggatc catcacagcc gactgaaacc tgcagcctca gttcaagacc 6180 ggtggacgag tcagcaagat ccagatcatc caactcgact gatcttgcag aggaaccaag 6240 gcgcagcaga aaaagacgac tgccctgctc cgaccacacc ggaggctggt cggtcaacgc 6300 acggctgaag cttgaggaaa cgtcaagccc tgctctagtc acacaactgg aagctgacta 6360 gtctacgcat ggctgaagct tgaggaaacg tcaagccctg ctctagtcac acaaccggaa 6420 gctgactagt ctacgcacgg ccgaagcctg aggaagccag cgctagataa gtaaatgtgg 6480 attgaatttg caagtgtagt tatactattg cttatactga ttgtcttgct gtcatgctat 6540 ctttgcaatt gctatcaaac ttgttgccca ggaggatgcc cgtgcatagt ataaacttga 6600 tcacactagt gacaataatg ctaacaggca tgggaggaaa ccaagataat tgtcatcatt 6660 gtatgataga agcttggtct ggtaaaggta taactaaaac cctgttatat caaacttatt 6720 atgagtgtac aggaactccc ctggggacat gtgtttataa tcaaaccagc tactccgtct 6780 gtgacccggg aaatgggcaa cctcaagtat gttatgatcc aggcctccta ccctacgact 6840 tctggtttga aattcaaata gggaaacctt tgttaccctc atatgccaat cctaaagacg 6900 ttggaactgg gaaactcgta agcaaaacac aggtattccc ttattcacat aaaggatctg 6960 tttctatata ttttgatgcc tgtcaggctg cacacctcag caacctaaac aatctaggag 7020 tagtctgcaa gaacttagga caagaaagag tcagcagcaa ggctgctaag atcataacag 7080 gagaaccaga agaagaatgt cctgattgta acattcaatg gaccacacat gagttcagcc 7140 aacgccttta cgcagggaga gtagctctgc ttgccagcca agaagcaaag attggttgtg 7200 cgactaaaac atgcaacccc cttaatctga ccatattaaa gccaaacatg cctttttgga 7260 ctaaaggaca ccaaggagag ctaagctttg atcgagaagg agcaaatcta ggtattccgt 7320 taatcattat taaaaagact caacgagcta aagttcaagt cagtccaatg tcacagttca 7380 gatttttcaa atccttcaat aaacatttta accccaagga gccaaaagtt cagattccac 7440 caatgtcagc cgagaaccta ttcgctcagc tagctgaaag tattgctact aatcttggag 7500 tcacctcatg ttatgtatgt ggaggtacca gtataggtga ccaatggccc tgggaggcta 7560 gagaactaat gccacaagat aactttacca taccggaatt tgttacaaag ttcaatgcaa 7620 acccaagtgt ctggctatta aaaaccccta tcattggaag atactgcata gcacgatggg 7680 gaaaagactt tcaaactcaa gtaggagata caacttgttt agatcagcaa tatttcgaag 7740 agtctaagaa caagacacag tggagaagct ttatagacaa ttcctctgta ccagatttta 7800 accctctctc tcagtttcca gcgctaaatc agtcgtggta tcaactagac gctccaaatg 7860 tttggaaagc accggcagga ctatattgga tctgtgggac aaaagcctac caactattgc 7920 ccgagaagtg gaccggagcc tgtgtgttag ggacaataag accgtccttc ttcttgctcc 7980 cactgaaaca aggggaagat ttaagttacc cagtctataa cgaagaaaga aaaaggacca 8040 gaagaaacgt ctttactcag ataagtaccg tagaaaagat aaacacaaac ataaagaagg 8100 acattgagat aggtagctgg aaagataatg aatggcctcc tgaaagaatt atcaaatact 8160 atgggccagc tacgtgggcc caagatggat catggggcta ccgcacccct atttacatgt 8220 taaaccgaat tataagattg caagcagtac tagaaatcat agtcaatgaa acagctcgag 8280 ccttggatct gttagctata caggccactc aaatgagaga tgctatatac caaaataggc 8340 tagcattaga ctacctccta gcctcagaag gaggagtttg tggcaagctt aatttgacca 8400 actgttgctt acaaatcgat gataatggaa gagctgttat ggaaattact gctagaatgc 8460 ggaagttagc ccatgtccca gtccagactt ggtccnggtg gagcccaaac tcactttttg 8520 gaggatggtt ctcatggttt ggaggcttta aaactttgat aattggtttt atagctataa 8580 ttggaggatg tctaatactg ccttgtcttt tacctctcct catcagaagc atccagtcta 8640 ctattgaagc aatagtggac cgaacaacta ccaccaaaat aatggcacta caaaaatacc 8700 aaccagtccc ccaggaagaa tatgtgccta cacaggaaga gataaacgat tgtggtgctc 8760 tttattaatc tacatttatg gcgagcacca aaggggggga a 8801 // ID LTR1C1 repbase; DNA; PRI; 648 BP. XX AC . XX DT 18-FEB-2009 (Rel. 14.06, Created) DT 18-JUN-2009 (Rel. 14.06, Last updated, Version 1) XX DE ERV1 Endogenous Retrovirus from primates. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; LTR1C1. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-648 RA Smit A.F.; RT "LTR1C1 - ERV1 Endogenous Retrovirus from primates."; RL Repbase Reports 9(6), 1173-1173 (2009). XX DR [1] (Consensus) XX CC 8.5% subst outside CpGs. 45 copies. XX SQ Sequence 648 BP; 134 A; 221 C; 177 G; 115 T; 1 other; tgatacagaa cggctgggct cccggctaaa ccccaccctt aagcctggaa ccgcggccct 60 aagtgaaaac agctgacccc gtttttccgc ccaaatgttg cctttttggc ctgccacgcc 120 cctatcctgt gcccataaaa agacttcagc tggcagagca acacaagcgg ctgagcgtcg 180 gggatacaag cggctgagcg ncggggatac aagcggctga gcgtcggaga ctacggatag 240 acgcggctaa cttcagacgg tgcggcttca gggaaagatc accttcttcc cgcaccatcc 300 cctttccaac tccccatccc gctgagagcc acttccatcg cccaataaaa tcctccgcat 360 acactaccct tcaatccgtt cgtgtgacct gattcttcct ggacgccgga caagaacccg 420 ggtgccgaga gggcaggggc ttggacgctg ctgcggggcc cgcacagagc ctgctcccgc 480 cagagaggag cgaccggccg gttccagcgt tcgttccctc cggttcccgc actcgcttgc 540 tcgcacgctc cctctcgcga ggagtggcca gcggcgggct gagtgaaacg agccactcca 600 gttcccgccc acgaaggggg tcaaggtcaa gggaacaatc ccgtctca 648 // ID GarnAlu3B repbase; DNA; PRI; 255 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version 1) XX DE SINE element - consensus. XX KW SINE2/tRNA; SINE; Non-LTR Retrotransposon; Transposable Element; KW GarnAlu3B. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-255 RA Bao W. and Jurka J.; RT "SINE elements from the bushbaby genome."; RL Repbase Reports 11(5), 1744-1744 (2011). XX DR [1] (Consensus) XX CC Compared with GarnAlu3, GarnAlu3B contains an internal CC deletion. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project. XX SQ Sequence 255 BP; 67 A; 68 C; 82 G; 38 T; 0 other; gggcggcgcc tgtggctcag tgagtagggc gccggcccca tataccgagg gtggcgggtt 60 caaacccggc cccggccaaa ctgcaacaaa aaaatagccg ggcgttgtgg cgggcgcctg 120 tagtcccagc tactcgggag gctgaggcaa gagaatcgcc taagcccaag agctggaggt 180 tgctgtgagc tgtgacgcca cagcactcta ccgagggcga caaagtgaga ctctgtctct 240 aaaaaaaaaa aaaaa 255 // ID piggyBac2_Mm repbase; DNA; PRI; 2211 BP. XX AC . XX DT 24-MAR-2010 (Rel. 15.06, Created) DT 24-MAR-2010 (Rel. 15.06, Last updated, Version 1) XX DE piggyBac2_Mm: consensus sequence. XX KW piggyBac; DNA transposon; Transposable Element; piggyBac2_Mm. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-2211 RA Pagan H.J.T., Smith J.D., Hubley R.H. and Ray D.A.; RT "PiggyBac-ing on a Primate Genome: Novel Elements, Recent RT Activity and Horizontal Transfer."; RL Genome Biol. Evol 2, 293-303 (2010). XX DR [1] (Consensus) XX CC piggyBac2_Mm is an autonomous DNA transposon reconstructed from CC the Microcebus murinus draft assembly. The element is highly CC similar to piggyBac2_ML from Myotis lucifugus (BLASTN e-value=0, CC 96% identity), suggesting a horizontal transfer event of a CC piggyBac element between mammals. XX FH Key Location/Qualifiers FT CDS 277..2115 FT /product="piggyBac2_Mm_1p" FT /translation="MPSLRKRKETNETDTLPEVFNDNLSDIPSEIEDADDC FT FDDSGDDSTDSTESEIIRPVRKRKVAVLSSDSNTDEATDNCWSEIDTPPRL FT QMFEGHAGVTTFPSQCDSVPSVTNLFFGDELFEMLCKELSNYHDQTAMKRK FT TPSRTLKWSPVTQKDIKKFLGLIILMGQTRKDSWKDYWSTDPLICTPIFPQ FT TMSRHRFEQIWTFWHFNDNAKMDSCSGRLFKIQPVLDYFLHKFRTIYKPKQ FT QLSLDEGMIPWRGRLKFXTYNPAXITKYGLLVRMVCESDTGYICNMEIYTA FT ERKKLQETVLSVLGPYLGIWHHIYQDNYYNATSTAELLLQNKTRVCGTIRE FT SRGLPPNLKMKTSRMKKGDIIFSRKGDILLLAWKDKRVVRMISTIHDTSVS FT TTGKKNRKTGENIVKPTCIKEYNAHMKGVDRADQFLSCCSILRKTTKWTKK FT VVLYLINCGLFNSFRVYNILNPQAKMKYKQFLLSVARDWITDDNNEGSPEP FT ETNLSSPSSGGARRAPRKDQPKRLSGDMKQHEPTCIPASGKKKFPTXACRV FT CAAHGKRSESRYLRKFCFVPLXRGKCFMXYHTLKKYSELXFXSLIVVSKIQ FT NVIIYXKTTXKVXY*" XX SQ Sequence 2211 BP; 705 A; 433 C; 453 G; 603 T; 17 other; cacattgcvt accdctcayg agttttctcg tgtttcnbgc accatctgtt aaggaccgct 60 cacgagtttt ctcgtttttc acgcgccatc tgttatggac cttagatgtc aacacactgt 120 cttgtccaca tcgaatgaag cgatctcatt ggtggaaacc gtgcaggtca atctacgaaa 180 aactatataa ttgcacgaac ccataaagca ttgcagttac attgtatttt ggtcattcga 240 atagtcttcg tcttcaagtt cctggcgctt ttagaaatgc cctctctcag aaaaaggaag 300 gaaaccaacg aaactgatac acttccggaa gtatttaacg ataatttatc agatattcct 360 agtgagatcg aagatgcaga tgactgtttt gacgattccg gagatgattc tactgattct 420 actgagagtg aaattattag acctgtaagg aagcgcaagg tggcggtgct ttcaagtgat 480 tccaacactg acgaagctac tgataattgt tggtctgaaa ttgacacacc accacgctta 540 caaatgtttg aaggtcatgc tggggtcact acatttccat ctcagtgtga ctctgtaccc 600 tctgtgacca atctcttttt tggtgatgaa ttgtttgaga tgttgtgcaa agagctgtcc 660 aactatcacg atcaaaccgc aatgaaacgc aaaacaccat ctagaacact aaagtggtct 720 ccggttacac agaaggacat caagaaattc cttggcctaa ttattctgat gggtcaaaca 780 agaaaagata gctggaaaga ctattggtca acagatcctt tgatatgtac ccctatattt 840 ccacagacaa tgagtcgcca tagatttgag caaatatgga cattctggca tttcaatgat 900 aacgccaaaa tggacagttg ctcggggaga cttttcaaga tccaacctgt gctggattat 960 ttcctgcata aatttcgaac aatatacaaa ccaaagcaac agttgtcttt ggatgaggga 1020 atgattccat ggagaggacg tttaaaattt crcacrtaca acccagcgaa hataacaaaa 1080 tacggtttac ttgttcggat ggtgtgcgag agtgacaccg gctatatctg caatatggag 1140 atatacactg ctgaaagaaa gaaattgcaa gaaactgttc tttcagtcct tggaccctat 1200 cttggcatat ggcaccatat ttaccaggat aattattaca atgctacatc tactgctgaa 1260 ttgctgctac agaacaaaac tagagtctgt gggactatta gggagagtag aggtttaccg 1320 ccaaatttga aaatgaaaac atcaagaatg aagaaaggtg acataatatt ttccagaaaa 1380 ggcgatattc ttctcctagc atggaaagac aagcgggttg tccgaatgat atcaacgatc 1440 catgacactt ctgtctcgac aacaggaaaa aaaaatagaa aaacgggaga gaatattgta 1500 aaacctacct gcatcaagga atacaatgcc cacatgaaag gcgttgaccg tgcggatcaa 1560 ttcctttcgt gttgttccat tctaaggaaa acgacgaaat ggacaaaaaa agtagtgctg 1620 taccttataa actgtggact tttcaattca tttagagtgt acaacatcct caatccacaa 1680 gcaaaaatga agtataaaca gtttctgcta tcggtggcga gagactggat aacggatgac 1740 aataatgaag gctctccaga accagagaca aatctgtcca gcccttcctc tgggggtgca 1800 aggagagcac ctcgtaaaga tcaacccaaa aggttgtcag gtgatatgaa gcagcatgaa 1860 cctacrtgta ttccagcgag tggaaagaaa aaatttccta cgasagcctg cagagtttgt 1920 gccgcccatg gaaaaaggag cgaatctaga tacttacgta aattttgttt cgtccctctt 1980 crtagaggaa aatgttttat gyagtaccat acgttaaaaa agtactcgga actttwgttt 2040 arttcgttaa ttgttgtaag taaaatacaa aatgttataa tttattgwaa aacaacacyt 2100 aaagtgmatt attgatctta gttatgatga tttaaataac gtgcagtttg cccaaaaacg 2160 tgcggtccct ggcatatgtc ttagagattt ctatgcggta cgtaatgcgt t 2211 // ID MER4A1_LTR repbase; DNA; PRI; 600 BP. XX AC . XX DT 16-MAR-2006 (Rel. 11.03, Created) DT 10-APR-2007 (Rel. 11.03, Last updated, Version 1) XX DE Long terminal repeat of ERV1 Endogenous Retrovirus from primates. XX KW ERV1; Endogenous Retrovirus; Transposable Element; KW LTR retrotransposon; MER4A; MER4A1_LTR; MER4A1__LTR. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-600 RA Smit A.F.; RT "MER4A1_LTR - a subfamily of ERV1 Endogenous Retrovirus from RT primates."; RL Direct Submission to Repbase Update (11-NOV-2005). XX DR [1] (Consensus) XX CC mer4 group (differs mostly by a deletion). XX SQ Sequence 600 BP; 182 A; 151 C; 113 G; 153 T; 1 other; tgtgaaagga aaataaatct cgggacccca aaatcactaa gccaagggaa aagtcaagct 60 gggaactgcg tcaggcaaac ctgcctccca ttttattcct aaataagata gctacaaaga 120 taaaaaagct acatacctcc ctcacaattt gcccacaagg aaattccttg tggacaaagg 180 acagacagaa ctcaaagtca tccctctgag gctcacctga gacaaatgca tatctgattg 240 cttcctctgc cctattgttt atgtaaaaat gcagattcac tgagccagac taaattgtgt 300 attcagtgaa aggctgatca aggactcaaa agaatgcaac cttttgtctc ttatctacct 360 atgacctgga agcccccgct tcgagttgtc ccgccttncc ggaccgaacc aatgtacatc 420 ttacacatat tgattgatgt ctcatgtctc cctaaaatgt ataaaagcaa gctgtacccc 480 gaccaccttg ggcacatgtc gtcaggacct cctgaggctg tgtcacgggc gcgtccttaa 540 ccttggcaaa ataaactttc taaattgatt gagacctgtc tcagatattt tgggttcaca 600 // ID LTR7A2_OG repbase; DNA; PRI; 269 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV3-type endogenous retrovirus - DE consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; LTR7A2_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-269 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 11(5), 1584-1584 (2011). XX DR [1] (Consensus) XX CC >85% identical to consensus. 5 bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 269 BP; 63 A; 96 C; 52 G; 58 T; 0 other; tgatagggcc gaatgtaggg acccgcccaa atagaaccgt cctgcccagg aaagaacgag 60 tccgcccagg aaggaatgag tgcgccaatc cccacccccc caactcttat attcccaccc 120 cgccttaacc cacgtgctga cttctttttc gaactcagcc cgctcgcacc cgagtgaata 180 aaggccacgt tgcccacaca gaacctggac tccgcttcct ttcgttcatg tctcggaata 240 acctctcatt ttcggtcctt aacctctca 269 // ID LTR5_TS repbase; DNA; PRI; 379 BP. XX AC . XX DT 09-DEC-2009 (Rel. 15.09, Created) DT 09-DEC-2009 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV3-type endogenous retrovirus - DE consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR5_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-379 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from tarsier."; RL Repbase Reports 10(9), 1261-1261 (2010). XX DR [1] (Consensus) XX CC ~94% identical to consensus. 5bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 379 BP; 75 A; 158 C; 72 G; 74 T; 0 other; tgttaggctc ggggagaagc agcactcacc ttcccccatc tgcccctcca cccaccgatc 60 ccctatgcgc ccctccaccc caaacccaca aaccacatac cagcttcctg gaacctgccg 120 gcagctgctg agaagggcca gacagaaact ttccagccca cgccggatgc gacgctttgt 180 ctccccgctt cactgccgcc cccaacaccc tcccagatac agcaccctta aaaacccaag 240 cctgcccctc cttgggcgcg actcccctgg cccatctttc cggaccacgg gacctcgccc 300 gggagtttct cccccaataa acgcttggtt tataccggcc ctttgtcctg tgcctctttc 360 ggtatctcca aacgttaca 379 // ID LTR26B repbase; DNA; PRI; 531 BP. XX AC . XX DT 14-NOV-2005 (Rel. 10.11, Created) DT 09-DEC-2005 (Rel. 10.11, Last updated, Version 1) XX DE Long terminal repeat of ERV1 Endogenous Retrovirus from primates. XX KW ERV1; Endogenous Retrovirus; Transposable Element; KW LTR retrotransposon; LTR26B_LTR; LTR26E; LTR26B. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-531 RA Smit A.F.; RT "LTR26B - ERV1 Endogenous Retrovirus from primates."; RL Direct Submission to Repbase Update (11-NOV-2005). XX DR [1] (Consensus) XX CC <15% div. XX SQ Sequence 531 BP; 146 A; 148 C; 94 G; 143 T; 0 other; tgaagccatc ctcacagggt taacaagaat tctggacaga aatatagtta taattaagca 60 ttaatcaggc tgcactttga cccacttcct tgtaaccaaa agtcacataa cactagatac 120 tgaccatttg catccccatt gttcctatag ataggatttc tgacgttaga atcataaggc 180 ttttgtttaa gaattgctta agcagatcct gaattccagt ggaacagctg acgccaacca 240 gtttgaagac ccccacagag gaaccgaatc agcatgagaa tacagtttct tcatctccct 300 gtcccatgac ttcaccctgc actcttcgac caatcaatga tctccacact tcggcccact 360 ccaaaacctt taaaaaccct agccccaaac tcctcgggga gatggatttg aggtttcctc 420 ccatctcctc attcggcggc cctacgatta aacctctttc tctgctgcaa cctggtgtct 480 cggcgtattg acttgccgtg tgcatcgggc aacgaaccta ttacggttac a 531 // ID ERV1-3B_TSy-I repbase; DNA; PRI; 5231 BP. XX AC . XX DT 29-JAN-2010 (Rel. 15.09, Created) DT 29-JAN-2010 (Rel. 15.09, Last updated, Version 2) XX DE Internal portion of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERV1-3B_TSy-LTR; ERV1-3B_TSy-I. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-5231 RA Jurka J. and Walichiewicz K.; RT "Endogenous retroviruses from tarsier."; RL Repbase Reports 10(9), 1196-1196 (2010). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX FH Key Location/Qualifiers FT CDS 1604..2926 FT /product="ERV1-3B_TSy-I_2p" FT /translation="MENGTRYAGAAVVTLQDTIWAEPLPQGTSAQKAELIA FT LTEALKAGTGKTVNIYTDSRYAFATVHVHGTIYKERGLLTAEGKTIKNREE FT ILNLLAAVWAPKKVAVIHCPGHQQGNSNEAVGNRKADQTAKEVARRRTACL FT LALPPPVLCEPPVYTEEDFCAAQKLNATTNPEGWIQLQDGRTFLPQALGRK FT LLRQLHGGTHLGKTKLTELIRKEFYLPHLDQLAQELIQGCIPCQQTNATKV FT SVSTXGARHRGVTPGRHWEIDFTEVRPGQYGYKYLLVLVDTFSGWVEAFPT FT KTETASVTVKKLLNDIIPRYGLPISIGSDNGPAFVSKLTEGIAKALGINWK FT LHCAYHPQSSGQVERMNQTLKETLTKLTLETGEGWVTLLPYALLRARCTPY FT IKGLTPFEILFGRPPPLLPXVGLEKALELSNHNLLQSLQALQKVQNQ" FT CDS join(630..740,772..1233) FT /product="ERV1-3B_TSy-I_1p" FT /translation="MGQQGSKPRTPWSWSYIILRIFRMLQSSREIECLKGN FT IRKLQLATRGHFQPLNSRESPANCLPVPPRPDSVHHCLVHSYPAAPGLDEK FT AGTNSSHVLALQEAKPPILQQPPEDEENPRHIRWWDKGRVFKSREFLARLI FT LEVVLPTSPLLAHHAPLLTGHPPRVRQPQARTQLLRPWKKPRRSSLCGKPR FT G" FT CDS join(3613..4104,3993..4643,4445..5068) FT /product="ERV1-3B_TSy-I_3p" FT /translation="MPQAGHFHCAEWGCETLSPWKTEDPFIRLQRQSPAPT FT CNQLETCNPLKLTVIQWDAYGIQKIWEKGRAWGLRLYVSGRDPGTEFAIRR FT KVIPHVLSPMGPNRNLFLPNPQARPQAPKPTQDKPRSVNLTTVAATSKPIA FT TLKPTRPMLLFQLPHLGTKIFTSPCFNNCCCNLKAYCNLKAYKTNVTLPIT FT PLGNENLYLPMLAAVHRLAIATQPNLARGCWLCLDPSPPYYVGVAVNSSIG FT NTGGHIQVLNHTTTQLRGLCPWGQNPALTLGDLQGQGACLHTSSYNIESSP FT YREACLASSRINASQLGTPEGLNMVVAPPTTWFACTRGITPCVALELLXDT FT PSSVFWCTYCPRCTLLKGPQDGNNSNSRTLDGPKGSPSYGCCTPHYLVCLH FT TGHNPVRRPRAPXRHSELCVLVHILPQVYITEGPAGWEQLELQNFRRTKRV FT PILVPILVGLGIAGSAAMGAAALAIGDQNLKELSRHVDADLSNLETSISQL FT EQQVDSLAEVVLQNRRGLDLLLMKEGGLCAALGEACCFYANRSGIIRETLA FT LVRENIRHRESKRQASENWYQSLFSWSPGSQPWSQPLPDLCS" XX SQ Sequence 5231 BP; 1341 A; 1481 C; 1295 G; 1104 T; 10 other; tctgggggct catccgggat accccagtgc ttaatccacg ggttttaagc cgaccggcct 60 tgtgagcttg ctagcctgca aactcccgaa tagctgagcc gtccctggca ggctcacaac 120 gccttttgga gccctcagag aggtaagtca ggttctcatt ttgtcttgcc tttgtctgct 180 tttgtctgcg gtagtgcgca ctctgggttg gactcgcagg aaagctggac gcagcggtaa 240 ttccagagtc cgaggtcccg gtgagacgtc accgggtcct ctggtttgga accctgagcg 300 accaaccgtc ggataagcca ggagctgaaa caggctccgg agctgaaaga ggctcaggcg 360 gccgattggt ggtggatggg ctcagtgtcc attgttttgg tgtcccttgt ttcccgtttg 420 gaactcagag cggccagttg tcggataagc caggagctga agcgggctcc agagctgaaa 480 gaggctcagg cggccaacta acgaataggc tcggtgtcca atatctccca aggaccgtta 540 ggtcctcctg ctctgtgtgt gtctttgttt cgttgtgtct tgtgtttgta actattgtaa 600 ccctcatttc cattgctgca gttgaaggta tgggacagca ggggtctaag cctcgcaccc 660 cctggagctg gtcttatatc attttaagga ttttcaggat gctgcagagc agtcgggaga 720 tcgagtgctt aaagggaaac taaccacttt ctgctccctt gaatggccta aattcgcaaa 780 ctgcagctgg ccaccagagg gcactttcaa cccctcaata gcagagagag tccagcgaat 840 tgtctgccgg taccacccag accagattcc gtacatcact gcctggtaca ttcttatcct 900 gcagcgcccg gcctggatga aaaggccggg actaactctt ctcatgtttt agccctacag 960 gaagctaagc cccccatcct ccagcaacca ccagaggatg aggagaaccc ccgccatatc 1020 cggtggtggg acaagggcag ggtgttcaag agcagggaat tcttagcccg gctcatacta 1080 gaagtggttc tccctaccag ccccctcctg gcccaccatg ccccactctt aaccggccac 1140 ccgccgaggg tccgacagcc ccaagcccgg actcaactac tgagaccttg gaagaaaccc 1200 cggcgatcct ccctctgcgg caagccccgg gggtagatgt gggtgggacg ggaccccgac 1260 cctttatggt ttatgtcccc ttttctacta gtgacctcta caattgggac aggacctaac 1320 tgtcacagcc acacatgcga tagaggccct tctccggggg gccccagcwa aatggatttc 1380 caatgcccgg ctcacccatt atcaggcatt gcttctcaac cagccacgca tcaggttcca 1440 gcgaactgcc agcctaaatc ccgccaccct gctgccggcg gggacctgcg gcacccatga 1500 ctgcatcgag ctcactgagt tcctccagaa gccgcgcccg gacctaacgg acgttccctg 1560 gccagaacca gacttaaccc tctacacgga cggcagtagc ttcatggaaa atggaaccag 1620 gtatgcgggg gcagcggtgg taacgctgca ggacacaata tgggccgagc cccttcccca 1680 agggacctca gcccaaaagg cagaattaat cgccttgact gaagccctaa aagcaggcac 1740 aggaaagact gtcaacattt atactgacag caggtatgcc tttgcaactg tgcatgtaca 1800 tgggacaatc tacaaggaaa gggggttgct taccgctgaa gggaaaacta taaaaaatag 1860 agaggaaatt ttaaacctct tggcggcagt ttgggcccca aagaaggtgg ctgtcataca 1920 ctgcccaggc caccagcagg ggaactcaaa tgaagcggtt ggcaacagga aagcggatca 1980 aacggccaaa gaggtggccc gccgccgcac tgcctgcctg ctggcactac ctcccccggt 2040 actatgtgag ccgccggttt acacagagga agatttctgt gcagcacaaa aactgaatgc 2100 gaccacgaat ccggagggat ggatacagct ccaggacggg aggaccttcc tcccccaagc 2160 cctgggacgc aagctcctac ggcaacttca cggaggcact catctgggaa aaactaagct 2220 aactgaactc ataagaaagg agttctactt gccccacctg gaccagctgg cccaggaact 2280 catccaaggc tgcataccct gccagcagac taacgccact aaggtcagtg tatccactcm 2340 aggggcccga caccggggag taacccctgg gagacattgg gaaatagatt ttacagaggt 2400 aaggccaggg caatatggat ataaatactt gttagtcttg gtagacactt tctcgggatg 2460 ggttgaagca tttcctacga aaacagaaac tgcatctgta acagttaaaa aacttttaaa 2520 cgacatcatc ccgagatatg gtctmcctat ctcaatagga tcagataatg gkccggcctt 2580 tgtctccaaa cttacggaag gtatagcaaa ggctctgggg ataaattgga aattacactg 2640 tgcttatcat cctcaaagtt caggacaggt agaaagaatg aatcaaacac ttaaggagac 2700 cttaaccaaa ttaaccctag agaccggcga gggatgggtg acgctccttc cctatgccct 2760 cctaagggca agatgtaccc cctacatcaa gggcctaacc ccctttgaaa tccttttcgg 2820 gcggcccccg ccgctacttc caamagtagg cttagaaaaa gcgctagaac tgtctaacca 2880 taatttactt cagtcactac aggcacttca aaaagtacaa aaccagtgac tgagaccgtg 2940 cgggccgcct tccaggagcc gagtccggta aaccctcacc catccagccc ggagaccaag 3000 tatgggtcaa acggcacaaa accgaggctc taaccccaag atggaaaggc ccacacgtgg 3060 taatcttgac taccccaacg gcggtgaagg tcagcggtgt ccgtccctgg atccatcatt 3120 cccaactaaa gaaggcagac cccctgccga gaccagcgag gaaccgccga tgaaatggaa 3180 ggtcatcaag ccggccgccg gagaagaccc actaaagata agactttcgt gggccccgat 3240 tccctaggtc catcgtgtaa ctggaggcaa ctgtgtcccg ccctcccagg ggggctagta 3300 gccctcctta ttcttttaag ccttccccaa tgtacagcat attaccwtgc cccgggaaac 3360 tattctggga actggcaaac gctgtcactg gaactgtagt ctccacctac cacgggaatg 3420 ggcagccggt ctttaaattt gacctttgta gactgtggcc cagtttagcc ataactgagt 3480 atgattaccg cctggcactg agcggttcaa cccttggaaa aggggctggg gatgcatttc 3540 caaggacatt gaagatgaaa aacaggttgc agcgtataac ttctatgcct gcccggctgc 3600 tgsccmaggc caatgccaca agcagggcac tttcattgtg ccgaatgggg ctgtgaaaca 3660 ctttccccct ggaaaaccga ggaccccttc atacgccttc aaaggcaaag ccctgcgccc 3720 acttgtaatc agcttgaaac atgtaacccc ctaaaactca ccgttataca atgggatgct 3780 tacggcatcc agaaaatatg ggaaaagggc agagcatggg gactcagact gtatgtttca 3840 ggaagggacc ccggaacgga gtttgccata cgaaggaaag ttatacccca tgtcctttcc 3900 cccatggggc ccaaccgtaa tctctttctg ccaaaccctc aggcaagacc tcaggccccc 3960 aaacccactc aggacaagcc caggtcagtt aatttaacaa ctgttgctgc aacctcaaag 4020 cctattgcaa ccttaaagcc tacaagacca atgttactct tccaattacc ccacttggga 4080 acgaaaatct ttacctcccc atgttagcag cagtccacag gctggccatt gccacccaac 4140 caaacttggc caggggctgc tggctatgcc tagatcccag ccccccatat tatgtggggg 4200 tggccgtaaa ctcctcaata ggtaataccg gggggcacat acaagttctt aaccatacta 4260 ctacccaact caggggatta tgtccatggg gacagaatcc tgctctcacc ctgggagatc 4320 tccaagggca aggagcctgt ctccacactt catcctataa catagaaagc tccccctatc 4380 gggaagcttg cttggcttct tccagaataa atgcctccca gctgggcact cccgaaggcc 4440 ttaatatggt tgttgcaccc cccactacct ggtttgcctg cacacggggc ataaccccgt 4500 gcgtcgccct agagctcctc maagacactc cgagctctgt gttttggtgc acatattgcc 4560 ccaggtgtac attactgaag ggcccgcagg atgggaacaa ctcgaactcc agaactttag 4620 acggaccaaa agggtcccca tcctagtgcc tatccttgta ggattgggaa ttgcagggtc 4680 agcmgccatg ggggccgctg cgctggccat aggagaccaa aatcttaagg aactaagcag 4740 gcatgttgac gcagatctgt ctaaccttga gacaagcata tcccaactag aacaacaagt 4800 agactcttta gctgaggtgg tgctccagaa taggagggga ctagacctcc tgctcatgaa 4860 agaaggaggg ctctgtgcgg cattaggaga agcctgttgc ttttacgcca acaggtccgg 4920 aataatccgg gagacactcg ccctggtcag agaaaatatt cgccacagag agtctaaaag 4980 acaggcttca gaaaactggt accaatcctt attttcctgg tcccctggct cacaaccctg 5040 gtctcagcca ttgccggacc tctgctctta atcctaatat tcgtaaccgt ggggccatgc 5100 ctaattcaat gtcttctaaa ttacataagg aacaggctaa caaccacaaa cctgatgtta 5160 ctgagaaccc aagttaagtc cctgcacgac gagtcaatga tttgactcta ggaacaactc 5220 aaagggggga a 5231 // ID L1-1_Cja repbase; DNA; PRI; 6561 BP. XX AC . XX DT 18-OCT-2009 (Rel. 14.11, Created) DT 18-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE L1-type non-LTR retrotransposon (consensus). XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-1_Cja. XX OS Callithrix jacchus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. XX RN [1] RP 1-6561 RA Jurka J.; RT "L1-type elements from the marmoset genome."; RL Repbase Reports 9(11), 2846-2846 (2009). XX DR [1] (Consensus) XX CC ~95% identical to consensus. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project. XX FH Key Location/Qualifiers FT CDS 1406..2431 FT /product="L1-1_Cja_1p" FT /translation="MGRNQRKKEENTRNQNTSPPTRDHNSSPAREQSWTEN FT ECDEMTESDFRRWVMRNFRELKEHVLTQCKETKNLEKRFDEMITRMDNLER FT NMSELMELKNTTRELREACTSFNSRIDQAEERISEVEDQLNEIKREGKIRE FT KSAKRNEQSLQEMWDYVKRPNLRLIGVPECDEENESKLENTLQDIIQENFP FT NLARQANIQVQEIQRTPQRYSSRRATPRHIIVRFTRVEMKEKMLRAAREKG FT RVTHKGKPIRLTADLSAETLQARREWGPIFNILKEKNFQPRISYPAKLSFI FT SEGKIKSFANKQVLRDFVTTRPALQELLKEALHIERNNQYQPLQKHTKC*" FT CDS 2495..6322 FT /product="L1-1_Cja_2p" FT /translation="MAVSNSHITILTLNVNGLNAPIKRHRLANWIKSQNPS FT VCCIQETHLTCKDTQRLKIKGWRKIYQANGEQKKAGVAILVSDKIDFKATK FT IKRDKEGHYIMVKGSIQQEELTILNIYGPNTGAPRYIRQVLNDLQRDLDSH FT TIIVGDFNTPLSILDRSTRQKINKDIQDLNSDLEQANLIDIYRTLHPKSTE FT YTFFSAPHHTYSKIDHIIGSKSLLSKCKRTEIITNSLSDHSAIKLELRIQK FT LTQNRTASWKLNNWLLNVDWINNEMKAEIKKFFETNENEDTTYQNLWDTFK FT AVSRGKYIAISAHMRRVERSKIDTLSSKLKELEEQDQKNSKPSRRQEITKI FT RAELKEIETRKTLQKINKSRSWFFEKINKIDRPLARLIKKKRENNQIDAIK FT NDKGEITTDPTEIQTIIREYYKQLYAHKLVNLEEMDKFLDTCVLPSLNQEE FT VETMNRPITRSEVEAAIKSLPHKKSPGPDGFTAEFYQTHKEELLPFLLKLF FT QIIQKEGILPKSFYETNIILIPKPGRDSTRKENFRPISMMNIDAKIFNKIL FT ASRLQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVIHHINRTKNKNHMIIS FT IDAEKAFDKIQQPFMLKTLNKLGIDGTYLKIIKAIYDKPTANIILNGQKLE FT AFPLKSGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQIGKEEAKLSLF FT ADDMIVYLEDPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRLKES FT QIKNELPFTIATKRIKYLGIQLTRNVKDLFKENYKPLLNEIREDTNRWRNI FT PCSWLGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLNFIWNQKR FT ARIAKSILSKKNTAGGITLPDFKLYYKATVIKTAWYWYQNRDIDQWNRTEA FT SEATQHIYNHTIFDKPDKNKQWGKDSLFNKWCWENWLAMCRKQKLDPFLTP FT YTKINSRWIKDLNIRPNTIKTLEENLGKTIQDIGVGKDFMTKTPKALATKA FT KIDKWDLIKLHSFCTAKETVIRVNRQPTEWEKIFAVYPSDKGLISRIYKEL FT KQIYKKKTNKPIQKWAKDMNRHFTKEDIHEANKHMKKCSSSLVIREMQIKT FT TLRYHLTPVRMAIIKKSGDNRCWRGCGEIGTLLHCWWECKLVQPLWKTVWR FT FLKDLEIEIPFDPAIPLLGIYPKDYKSFYYKDTCTRMFIAALFTIAKTWNQ FT PKCPSMIDWTGKMWHIYTMEYYAAIKNDEFVSFVGTWMNLENIILSKLTQE FT QKMKYRMFSLIGGC*" XX SQ Sequence 6561 BP; 2505 A; 1497 C; 1376 G; 1182 T; 1 other; gggggaagat ccaagatggc cctttaggag cagctcagga ttgcagctcc cagtgaaagc 60 gcagagggtg agtggacgcc gcatttccag acggatcttt attgcccaca gaccaggaga 120 ttcccaggcg gaggagcccc acgggtcgcc agcgcggctg tttcggccgg cgcggctgtt 180 tcggccggcg ccccggcgcg gcggctctcc gtacaaaata cactggtctg gttgcccttt 240 taagctggca attggagctc cgggaaggca gattcgccca ttcatctgat taaacgggnc 300 tgaaacaggg agccaggcca ggagattccc gggcagcgcc gttgtttcag ccggcgcagt 360 gggtcgccgc acgggaaatc acacagatcc cggcgccctt tcagcaggcg actggaacac 420 ctgggagaga gtcaaccgtt caacttaaaa aaaaaaaaag gctctgaggc agggagccag 480 gtgatcaggc tcggcgggtc ccacccccac aaaaacaaac aaacaaaaaa aaacagcaat 540 tggaaacgct cggggttgag agtttcacgg caagcacagc tgaacccggg acggtccagc 600 tccgtggggg aggggcgtcc gccattaccg aggcactccg cccctacgga ggtagtccgc 660 cattgctgag gcagcccgcc gttgccgagg caacccgcca taacagagag agtccgccat 720 tacagaggcg ggccaccatt gccgaggcag ttctaaccac acccatataa acaggactgc 780 agggaagttc acacggcagc agggcggagc ccacagcagc tcagcaaagc ctctgcaggc 840 agacagtgac taggctgcct ccttgctggg cagggcagcc ctgaaaaaaa aaaaaaggca 900 gcagcacaac ggatactcat aaataaagcc ctaactcccc gggacagagc acctggggaa 960 aaaaaggggg tttatgagtt ctgctgcagc agacttaaac gtacctgcct agcagctctg 1020 aatgaacaac ggagctcaca gctcagcact tgagctccta taaagaacag actgtctcct 1080 caagcagctc cctgaccccc gtatatccaa agagtcacct cacaaaggac tgatcagact 1140 gacatttggc gggcatcatt ctgggacaaa gatagcagaa gaagaaactg gtagcaaccc 1200 ttactgttct gcagctgctg caggtgatcc ccaggcaagc agggcctgga gtggacctca 1260 gcagtcctac agcagagggg ccagactgtt agaaggaaaa ctaagaaaca gaaataactt 1320 catcatcaac aatctggacg tccactcaga gacccaatct gaaaatcagc aactacacag 1380 acgacaggtg gataaatcca caaagatggg aagaaaccag cgcaaaaagg aggaaaacac 1440 ccgaaaccag aacacctctc ctcctacaag ggatcacaac tcctcaccag caagggaaca 1500 aagctggacg gagaatgagt gtgatgaaat gacagaatca gacttcagaa ggtgggtaat 1560 gagaaacttc cgtgagctaa aagaacatgt tctaactcaa tgcaaagaaa ctaagaacct 1620 tgaaaaaaga tttgacgaaa tgataacaag aatggacaac ttagagagga atatgagtga 1680 attgatggag ctgaaaaaca caacacgaga acttcgcgaa gcatgcacaa gtttcaacag 1740 ccgaattgac caagcagaag aaaggatatc agaggtcgaa gatcaactca atgaaataaa 1800 acgagaaggc aagattagag aaaaaagcgc aaaaaggaat gaacaaagtc tccaagaaat 1860 gtgggactat gtgaagagac ctaatctacg tttgataggt gtacctgaat gtgacgaaga 1920 gaatgaatcc aagctggaaa atactcttca ggatattatc caggaaaact tccccaacct 1980 agcaaggcag gccaatattc aagtccagga aatacagaga acaccacaaa gatattcctc 2040 aagaagagca accccaaggc acataatcgt cagattcacc agggttgaaa tgaaggagaa 2100 aatgctaagg gcagccagag agaaaggtcg ggttacccac aaagggaagc ccatcagact 2160 cacagcagat ctctcggcag aaaccctaca agccagaaga gagtgggggc caatattcaa 2220 catccttaaa gaaaagaact ttcaacccag aatttcatat ccagccaaac taagcttcat 2280 aagtgaagga aaaataaaat ccttcgcgaa caagcaagta ctcagagatt ttgtcaccac 2340 caggcctgct ttacaagagc tcctgaaaga ggcactacac atagaaagga acaaccagta 2400 ccagccactc caaaaacata ccaaatgcta aagagcatca acaaaatgaa gaatctgcat 2460 caactaacgg gcaaaacagc cagctagcat caaaatggca gtatcaaatt cacacataac 2520 aatattaacc ctaaatgtaa atgggctaaa tgcaccaatc aaaagacaca gactggcaaa 2580 ttggataaaa agccaaaacc catcggtgtg ctgtatccag gaaacccatc tcacatgcaa 2640 ggatacacaa aggctcaaaa taaagggatg gaggaagatt taccaagcaa atggagagca 2700 aaaaaaagca ggagttgcaa ttctcgtctc tgataaaata gactttaaag caacaaagat 2760 caaaagagac aaagaaggac attacataat ggtaaaagga tcgatacaac aagaagagct 2820 aacgatccta aatatatacg gacccaatac aggagcaccc agatacataa ggcaagttct 2880 taatgactta caaagagact tagactccca cacaataata gtgggagact ttaacactcc 2940 actgtcaata ttagacagat caaccagaca gaaaattaac aaggatatcc aggacttgaa 3000 ctcagacctg gaacaagcaa acctgataga catttacaga actctccacc ccaaatccac 3060 agaatataca ttcttctcag caccacatca cacctactct aaaattgacc acataattgg 3120 aagtaaatca ctcctcagca aatgcaaaag aacggaaatc ataacaaaca gtctctcaga 3180 ccacagtgca atcaagttag aactcagaat tcagaaacta actcagaacc gcacagcttc 3240 atggaaactg aacaactggc tcttgaatgt tgactggata aacaatgaaa tgaaggcaga 3300 aataaagaag ttcttcgaaa ccaacgagaa cgaagacaca acataccaga atctctggga 3360 cacatttaaa gcagtctcta gaggaaaata tatagcaata agtgcccaca tgagaagagt 3420 ggagagatcc aaaattgaca ccctatcgtc aaaattgaaa gagctagagg agcaagatca 3480 aaaaaactca aaacctagca gaagacaaga aataactaag atcagagcag aactgaagga 3540 gatagagaca cgaaaaaccc ttcaaaaaat caataaatcc aggagctggt ttttcgaaaa 3600 gatcaacaaa atagacagac cactagccag attaataaaa aagaaaagag agaataacca 3660 aatagatgca ataaaaaacg ataaagggga aatcaccaca gatcccacag aaattcaaac 3720 catcatcaga gaatattaca aacaactcta tgcacataaa ctagtaaacc tggaagaaat 3780 ggataaattc ctggacactt gcgtcctccc aagcctaaac caggaagaag tcgaaaccat 3840 gaatagacca ataacaagat ctgaagttga ggcagcaatt aagagcctac cacacaaaaa 3900 aagcccaggt ccagatgggt tcacagccga attctaccag acacacaaag aggaactgtt 3960 accattcctt ctgaaactat tccaaataat ccaaaaagag ggaatccttc ccaaatcatt 4020 ttatgagacc aacatcatcc tgataccaaa acccggcaga gactcaacaa gaaaagaaaa 4080 cttcaggcca atatccatga tgaacataga cgcaaaaatc ttcaataaaa tactggcaag 4140 ccgattgcaa cagcacatca aaaagcttat ccaccatgat caagtaggat tcatcccggg 4200 gatgcaaggc tggttcaaca tacgcaagtc tataaacgta attcaccaca taaacagaac 4260 caaaaacaaa aaccacatga ttatctcaat tgacgcagag aaggcctttg acaaaattca 4320 acagcccttt atgctaaaaa ccctcaataa actcggtatt gacggaacgt atctcaaaat 4380 aataaaagct atttacgaca aaccaacagc caatatcata ctgaatgggc aaaaactgga 4440 agcattccct ttgaaatctg gcactagaca aggatgccct ctctcaccac tcctattcaa 4500 tatagtactg gaagttctag ccagagcaat caggcaagaa aaagaaataa agggtattca 4560 aataggaaag gaggaagcca aattgtctct atttgcagac gacatgatag tatatctaga 4620 agaccccatc gtctcagccc aaaatctcct gaaactgata agcaacttca gcaaagtctc 4680 aggatacaaa atcaatgtgc aaaaatcaca agcattccta tacaccaata acagacttaa 4740 agagagccaa atcaagaacg aactgccatt cacaattgct acaaagagaa taaaatacct 4800 aggaatacaa ctaacaagga acgtaaagga cctcttcaag gagaactaca aaccactgct 4860 caacgaaata agagaggaca caaacagatg gagaaacatt ccatgttcat ggttaggaag 4920 aatcaatatc gtgaaaatgg ccatactgcc caaagtaatt tacagattca acgctatccc 4980 catcaagcta ccaatgacct tcttcacaga actggaaaaa accaccttaa acttcatatg 5040 gaaccaaaag agagcccgca tagccaagtc aattctaagc aaaaagaaca cagcgggagg 5100 catcacacta ccggacttca aactatacta caaggctaca gtaatcaaaa cagcatggta 5160 ctggtaccaa aacagagata tagaccaatg gaacagaaca gaggcatcgg aggcaacaca 5220 acatatctac aaccatacga tctttgataa acctgacaaa aacaagcaat ggggaaagga 5280 ttccctgttt aataaatggt gttgggaaaa ctggctagcc atgtgcagaa agcagaaact 5340 ggaccccttc ctgacacctt acactaaaat taactccaga tggattaaag acttaaacat 5400 aagacctaac accataaaaa ccctagaaga aaatctaggc aaaaccattc aggacatagg 5460 agtaggcaag gacttcatga ccaaaacacc aaaagcattg gcaacaaaag ccaaaataga 5520 caaatgggac ctaatcaaac tccacagctt ctgcacggca aaagaaacag tcattagagt 5580 gaatcggcaa ccaacagaat gggaaaaaat ttttgcagtt tacccatctg acaaagggct 5640 gatatccaga atttacaaag aactaaaaca gatttacaag aaaaaaacaa acaagcccat 5700 tcaaaagtgg gcaaaggata tgaacagaca ctttacaaaa gaagacatac atgaggccaa 5760 caaacatatg aaaaaatgct catcatcact ggtcattaga gaaatgcaaa tcaaaactac 5820 attgagatac catctcacgc cagttagaat ggcgatcatt aaaaaatctg gagacaacag 5880 atgctggaga ggatgtggag aaataggaac acttttacac tgctggtggg agtgtaaatt 5940 agttcaacca ttgtggaaga cagtgtggcg attcctcaag gacctagaaa tagaaattcc 6000 atttgaccca gcaatcccat tactgggtat atatccaaag gactataaat cgttctacta 6060 taaggacaca tgcacacgaa tgttcattgc agcactgttt acaatagcaa agacctggaa 6120 ccaacccaaa tgcccatcga tgatagactg gacagggaaa atgtggcaca tatacaccat 6180 ggaatattat gcagcaatca aaaacgatga gttcgtgtcc tttgtaggga catggatgaa 6240 cctggagaac atcattctca gcaaactgac acaagaacag aaaatgaaat accgcatgtt 6300 ctcactcata ggcgggtgtt gaacaatgag aacacatgga cacagggagg ggagcactac 6360 acactggggt ctgttggggg gaatagggga gggacagcgg ggggtgggga gttggggaga 6420 gatagcatgg ggagaaatgc cagatatagg tgaaggggag gaaggcagca aatcacactg 6480 ccacgtgtgt acctatgcaa ctatcttgca tgttcttcac atgtacccca aaacctaaaa 6540 tgcaataaaa aaaaaaaaaa a 6561 // ID MacERV4_LTR1a repbase; DNA; PRI; 438 BP. XX AC . XX DT 24-SEP-2007 (Rel. 13.02, Created) DT 21-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE ERV2 Endogenous Retrovirus from Cercopithecidae. XX KW ERV2; Endogenous Retrovirus; Transposable Element; ERVK; LTR; KW MacERV4_LTR1a. XX OS Cercopithecidae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini. XX RN [1] RP 1-438 RA Smit A.F.; RT "MacERV4_LTR1a - ERV2 Endogenous Retrovirus from RT Cercopithecidae."; RL Direct Submission to Repbase Update (21-FEB-2008). XX DR [1] (Consensus) XX CC 1.4%. XX SQ Sequence 438 BP; 96 A; 147 C; 78 G; 117 T; 0 other; tgtccggagc tgcacgcccc ggccatagcg aataataatt gaggattaaa cgcctgagct 60 atattcattt ccaccccaca ccttctccct atctttgcct tttttcccct gtactaatac 120 ctcattaaag atggcgctct tcctgcttct tcttcactca cttttcccgc gcccgggaaa 180 attgttactt aatagcgcaa gcgcaacatg acgtccgacc ggagaaaccg aaactaacct 240 ggccacgccc tcggcaatga gatcatttcc gccttagccc aaccccttcc cttccaagtg 300 tatataaggc agtgcattac cgccattaaa cgagacttga tcagagcact gtcttgtctc 360 catttctcgt gtctcttgtt ccccaaattc ccaccccctc ctccagggcc tgctctgact 420 atcccgcggg ccgggata 438 // ID L1-1b_TS repbase; DNA; PRI; 5968 BP. XX AC . XX DT 03-MAY-2010 (Rel. 15.05, Created) DT 03-MAY-2010 (Rel. 15.07, Last updated, Version 2) XX DE L1-type non-LTR retrotransposon - consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-1b_TS. XX NM L1-1b_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-5968 RA Bao W. and Jurka J.; RT "Non-LTR retrotransposons from tarsier."; RL Repbase Reports 10(5), 767-767 (2010). XX DR [1] (Consensus) XX CC ~92% identical to the consensus. The 5'-end is different from CC that of L1-1_TS. XX SQ Sequence 5968 BP; 2331 A; 1417 C; 1099 G; 1118 T; 3 other; aggagactcc tgaagatggc gacggcatag gtggatccct gagactctcc gaacgtgagt 60 ttcccaaaaa gactattact tttcttcctg cacccccctg cccaccgcca gccgcactgg 120 tacgaggcct ggggactttt ggtcccacgc agggccagac attgcggccg gccgcggctg 180 ctccactata tgtgattccg gccggctggg gaatcccaaa tcccggatcc cagctccmtc 240 cctggctaca tcgtcatgac cagcacccag caccaagctg ccggtcaccc agggaggctc 300 tgggcgcccg cccctcagag ataccggtcc catttctggc tgaagtgccc tgtgagggac 360 ccagagccac cacctgccct ctggagcttt ggagatctga agccccgggc ccaccctggc 420 tgaggcactc agcgtgcgac cacaagcctg tgtaaggggg cagaacccca agctggtgta 480 aggtgccgat cgcaggcttc cagctccaga ggagggaacg cccgaccctc tcctccaccc 540 cagcctgctg gaggcacccc agatccagcg cccagctgtg cagctggtcc cgcctagaga 600 cccacaggag catccctaaa agtgaaagta aaactaccac ctccgccatt ttcaaggcat 660 atttacatcg catcgccccg cccacctatc agagcctcca cctcggttcc cacggtgacg 720 tcattaaaca ggagacaaag cccctcccct acgctgaata gagaccagtg tcacagatct 780 cccaggcagc aggctaactg cctgcaacaa aaacagaggg gagaggctgg agattagagc 840 tcaagggact ttatccacta agggacagag agaaaaatct acacaaatga agaaaaacca 900 aaagaagaat atgggtctct cccagactcc tgggagggca gacactgaga aaactgactt 960 tggaatgcaa acaatgaaga gcccccagaa tgactggtct caaaatacaa acctagacat 1020 caagacatta atggagagat taaaaagaat tgaggagact caagaagaaa ctagaaagga 1080 gctgatatct gagataacag taataaagaa tactgtgaat gaaataaaca acaaactgat 1140 aagcatggaa agcagaatta cccaagcaga agaaagaatc tcagagcttg aggaccaaaa 1200 tatagaamta acccaaactc tcaaaaacac agaaaacaag ctcaaaaaga cagaaaaaaa 1260 accttcaaga gatgagcgat tacctcaaga ggcctaacct aagagtaatt ggtctgcctg 1320 aggcagaaag agagacagag accacactgg aacaaacctt ccatgagatc attcaagaaa 1380 acttccctta tctaatcaat gatgcaaaaa ctcaaacaca agagatccag agaacccctg 1440 caagacaaca aatgagaaaa ccaactccta gacacataat aattcgccta aacaaagtag 1500 gcataaaaga aaaaatccta aaggcagcaa gagaaaaagg ccagatcacc taccatggaa 1560 aaccaattag aatagcagca gatttatcta cagaaaccct gcaggctagg agagcttgga 1620 gccctatctt caaagtccta aaagataaac aatttcaacc aagaataacc taccctgcca 1680 gattaagctt catcagcgag ggagaattaa aatctttccc agatatccaa tccctaagaa 1740 cttacgctgc caccaaacca cctctacatg aaacacttaa gaaagtacta aacacagaag 1800 aaaaggggga aaaaagagca acgttcttca caagagtaca ggaaaaagaa taaaatacac 1860 atgaaccaac ccccaaaaca aaagaaagac aataaaccaa gtggaagaac aactctataa 1920 gaactctatg atagggatga actctcacat ttcaataatt agcctgaatg tgaatggact 1980 aaatgcacca ctgaaaagac atagaatggc aaaatggata aaatatcacc aggcaacaat 2040 atactgcctt caagagaccc atctcactag aaaggacatg cacagactca aagtaagagg 2100 atgggaaaca aatttccagg cgaatggaac acaaaagaaa ggaggagtcg cgatcctaat 2160 ttcagacaaa ataccattta agctatcaaa aattaaaaaa gatacagagg gccactacat 2220 aatgataaaa ggttcactcc atcaacaaga aatatctatc ctaaacatat atgcacctaa 2280 cataggtgca ccaactttta taaagcaact cctaggaaaa ctaaagaaag atattgactc 2340 taacaccatc ataactgggg actttaatac cccactcaca accctcgaca gatcatcggg 2400 acaaaaaatc agcaatgaga tccggaacct caatgtgact ctggaccaaa tggacttaat 2460 tgatacctac agaacactcc atccaaagac cagagaatac acattctact catcaccgca 2520 tgggacgtat tccaagatcg accacataat cggccataaa tcaagcataa gcaaatttaa 2580 aaggaccgaa attctaccat gcaccttctc ggaccacagt ggaataaaaa taaacattga 2640 caccaacaag gtccccccaa aacccacaaa gacatggwca ctaaacagca tgatgctaaa 2700 caactcctgg gtcaatgatg aaatcaaaac agagatcaaa agatacctgg aaacaaatga 2760 aaatgaagaa acatcttacc aaaatctctg ggatgcctta aaagctgtag taagagggga 2820 atttatatcc ctacaaacac acatgaagaa aatggaagga gcacaaatta atagcctaac 2880 aagccaccta aggaagctgg aaaagcaaga ccacaaaaac cctaatttca gcagaagaat 2940 ccagatcacc aaaataaaag cccaaatcca ggacatagaa gacaaaaaga caatacaaaa 3000 aaatcaatga aacaaaaagc tggttcttcg aaaggataaa caagatcgat ggtcccctag 3060 ctagactgac caagaaaaag aaagaaaaaa cccaaataag cacaatcaga aacacaaaag 3120 atgaagtcac atctgaccct gaagaaatac aaaagatcat cagagactac tatgtacact 3180 tgtatggaaa caaacttgaa aacctcaagg aaatggagga ctttctgtca tcacacaacc 3240 tgcctaggtt gaaacaagaa gaaattgaga ccctaaatag accaataaca atcaaggaaa 3300 ttgactatgt aataagaaaa ctacctacaa aaaaaaagcc ctggaccaga tggctttcca 3360 gcagaattct acaagacatt taaggaggaa ctgattccaa tcctactgaa gctatttcag 3420 gcgattgaga aagatggaac cctccccaaa tcattttatg aagctaacat cacattgata 3480 cccaagccag gtaaagatcc aacaaaaaaa gagaactaca ggccaatatc tttgatgaac 3540 atagacgcta aaattctcaa caagatccta gcaaaccgga ttcaacaaca catctcaaaa 3600 atcatccatt atgaccaagt aggcttcatc cctgggatgc aaggctggtt caacattcgt 3660 aaaacaataa atgtaattaa atacatcaac agatgtcaaa acaaaaacca catgattata 3720 tcattagatg cagaaaaagc ttttgataaa atccagcacc ccttcttgat aaaaaccctc 3780 gaacatctag gcatacaggg aacatacctc aaagtagtaa aagccatcta cgagaaaccc 3840 acagccagca tactcctaaa tggacaaaaa ttggaaccat ttcccctgaa aactggaaca 3900 agacaaggat gcccactctc acccctcctg ttcaatatag tattggaagt cctggctaga 3960 gcaatcagag aagagaaggc aatcaggggt atccaaatag gaaaagagga agtcaaatta 4020 tctctctttg cagatgacat gatcgtgtac cttgaaaacc caagagaatc tgtcaaaaac 4080 ctccttacac tgataaaggc cttcggcaaa gtctcaggat ataaaataaa tgtgcaaaag 4140 acaatcgcat ttctttacac caataataaa caaacagaaa cccaaataag aagcacaatt 4200 ccattcacaa tagccaccac aaaaaaaatg aaataccttg gcatcttcct aaccagagac 4260 gtgaaagacc tttacaatga aaactacaaa actctgctca aagaaatcaa agatgacaca 4320 aacaagtgga aaaatatccc atgctcatgg attggaagaa tcaacattgt gaagatgtcc 4380 atcttaccta aggcaatcta cagattcaat gcaataccta tcaaattacc agcaacattc 4440 ttctcagacc tagaaaaaac aacacaggaa ttcatatgga aacacaaacg accaagaata 4500 gccagaacaa tcctcagcaa aaaaaaaaaa caaagcaggt ggtatcacat taccagactt 4560 caaactttac tataaagcta caatcatcaa aacagcttgg tattggtata ggaacaggca 4620 tatagaccaa tggaatagaa ttgagattcc agaggcaaga cctcaatttc tcaaccaact 4680 catcttcgac aaagcctcca ccacctacca ctggggagag gagaacctat tcagtaaatg 4740 gtgctgggaa aactggctga ccacatgcag aagattgaaa caggacccct atctatcccc 4800 atacacaaaa attaactcca aatggatcag agacctaaat gtaaaacctc aaaccataag 4860 aaccttagaa aatgaaggac ataccctcat ggaaattgga actggcatcc aattcctgaa 4920 caaaactcga aacccacagg ccataaggga taagatagac aagtgggacc tcattaaact 4980 gacaagcttc tgcaaagcca aagaaaccat caagagagca gggagacagc ctacagactg 5040 ggaaaaagta tttgccaact ccaggtctga caaaggctta acatcctgga tctacaagga 5100 actcaaacgt gctgaaaaga aaaaaacaaa caaccccatt ataaaatggg caaaagatat 5160 gaacagacac ttcacaaagg aagacatccg agcagccaac agacacatga agaaatgctc 5220 aacctcacta atcatcaggg agatgcaaat caaaaccaca ctgagatacc acctaactcc 5280 agtcagaatg gcaattatca acaactcaaa aaataacagc tgctggagag ggtgtggcga 5340 aaagggaaca ctcctacact gttggtggga gtgtaaacta gtgcaacctc tgtggaaagc 5400 agtgtggcga ttcctaaaag ctctaaacat caacctccca tatgaccctg caatccccct 5460 actgggaata taccctgaag aactcaaatc actctataaa aaagatacct gcacacgaat 5520 gtttatcgca gcattgttca caatagcaag aacctggaac caaccatgct gtccatcaaa 5580 agaggactgg attaaaaaaa tgtggtacat atacacgatg gaatactatg cagccataaa 5640 aaagaacaaa atcatgaatt tcgcagcaac ctggatggag ctagagtcta taatactgag 5700 tgacctctca cagaaacaaa gatctgagta tcacatattc tcactcatat agtggacctt 5760 gatcatccaa tgcactacca taagaaaatg actgacagtg ttgggaaact atgggggggg 5820 atgggactaa tggtagtaaa catctgtctg gggacgggga gacacctgtt atcaacaagg 5880 gggcctgaat gaagcatatt tgtataccta acccttaact gtaccccaca atatcaaaat 5940 aaaaaatatt gattaaaaaa aaaaaaaa 5968 // ID LTR1A2 repbase; DNA; PRI; 837 BP. XX AC . XX DT 18-FEB-2009 (Rel. 14.06, Created) DT 18-JUN-2009 (Rel. 14.06, Last updated, Version 1) XX DE ERV1 Endogenous Retrovirus from primates. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; LTR1A2. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-837 RA Smit A.F.; RT "LTR1A2 - ERV1 Endogenous Retrovirus from primates."; RL Repbase Reports 9(6), 1171-1171 (2009). XX DR [1] (Consensus) XX CC 9.5% subst outside CpGs. Ca. 90 copies. XX SQ Sequence 837 BP; 196 A; 264 C; 247 G; 130 T; 0 other; tgatacggaa gtgctgggaa gggaagggcg tggtcccttt aaatgatacg gaagggggga 60 agggaagtgc tgggtagagg agggcgtggt ccctggctag ggctccaccc ccggcctgtg 120 cccacggacc taggtgagga caggcacttc tgccttcctg cccaaatgtt gcatttccca 180 agaccaccct ggcccgccac gcccccatcc tgtgcctata aaaaccccga gaccctagca 240 ggcagacaca caagcggctg gacgtcgaga ggagcacatc ggcggaagaa cacacaagcg 300 gctggacgtc gagaggagca cgccgacagg caccggcacg ccggcaggcc accgaccggc 360 ggaacgacgc ggagtttggc cggggcagtc ggaggagagc cgggccgccg agcggcccga 420 ctccagggga aaaccatctc ccttctggct cccccatctg ctgagagcta cttccactca 480 ataaaacctt gcactcattc tccaagccca cgtgtgatcc gattcttccg gtacaccaag 540 gcaagaaccc gggatacaga aagccctctg tccttgcgac aaggcagagg gtctaattga 600 gctggttaac acaagccgcc tatagacggc taaactaaaa gagcaccctg taacacacgc 660 ccactggggc ttcaggagct gtaaacattc acccctagac actgccgtgg ggtcggagcc 720 ccacagcctg cccgtctgta tgctccccta gaggtttgag cagcggggca ctgaagaagc 780 gagccacacc cccatcgcac gccctgcgag ggggacaagg gaacttttcc cgtttca 837 // ID GarnAlu2 repbase; DNA; PRI; 267 BP. XX AC . XX DT 06-APR-2010 (Rel. 15.05, Created) DT 06-APR-2010 (Rel. 15.05, Last updated, Version 3) XX DE SINE element - consensus. XX KW SINE2/tRNA; SINE; Non-LTR Retrotransposon; Transposable Element; KW Nonautonomous; GarnAlu2. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-267 RA Jurka J.; RT "SINE elements from the bushbaby genome."; RL Repbase Reports 10(5), 779-779 (2010). XX DR [1] (Consensus) XX CC The youngest sequences are >89% identical to consensus. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project. XX SQ Sequence 267 BP; 70 A; 71 C; 80 G; 46 T; 0 other; ggctcggtgc ctgtagctca gtggctaggg cgccagccac atacaccgag gctggtgggt 60 tcgaacccag cccgggcctg ccaaacaaca atgacaacta caaccaaaaa atagccgggc 120 gttgtggcgg gcgcctgtag tcccagctac ttgggaggct gaggcaagag aatcgcttaa 180 gcccaagagt ttgaggttgc tgtgagctgt gacgccacgg cactctaccg agggcgacat 240 agtgagactc tgtctcaaaa aaaaaaa 267 // ID LTR14_Cja repbase; DNA; PRI; 662 BP. XX AC . XX DT 14-OCT-2009 (Rel. 14.11, Created) DT 14-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat: consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR14_Cja. XX OS Callithrix jacchus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. XX RN [1] RP 1-662 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the common marmoset."; RL Repbase Reports 9(11), 2925-2925 (2009). XX DR [1] (Consensus) XX CC >91% identical to consensus. 6bp tsd. CC This sequence was derived from the marmoset dataset made CC available by Genome Sequencing Center at Washington University CC School of Medicine in St. Louis. XX SQ Sequence 662 BP; 170 A; 178 C; 149 G; 165 T; 0 other; tgttgggaaa agcagcaaag gagaaaggac ttacagagtg cctacataaa ttggttatga 60 gtaacatata aacaaggaac aacaaactgc ggaaggccgc aggaggcctc tgaggaggaa 120 ggcctctcca tgcctcatgt tccggtgcag ggtgacctag gctctgttac cataaacatt 180 gtgctcaagg acgtaaaacc ccttcgtagc actatgacgt agccagactt gtaggctcct 240 tgtaaggccc ggttccacca gctgcacata gataataagc taaagataac cacatgttct 300 tgttttgtga aactcccaaa ccgccactgt tatcaatcct taggatgaca caccagttcc 360 ggctcactga ttcactccac catcactcca cccctaccct atagatcaaa gtgattgtaa 420 tcaataaata gtgtggatga tcagagctcg gggccttcac tgcctcctcc agagtaataa 480 gtgatggccc cctggtccca ctgtctctct taatctgtct ttttctcatt cctttgtcgc 540 caccgaactc ggggtaccca cgggtgatgt agggctggct ccctacattc tggtgcccaa 600 tgtggggctc atggatccct acattctggc acccaatgtg tggggctcat ggattcccta 660 ca 662 // ID LTR21_OG repbase; DNA; PRI; 818 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR21_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-818 RA Jurka J.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 11(5), 1592-1592 (2011). XX DR [1] (Consensus) XX CC >90% identical to consensus. 4 bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 818 BP; 200 A; 201 C; 156 G; 261 T; 0 other; tgttagggtg aaagaaaagc aggcctaact ccattttgtt gtaactctta ttcctgagaa 60 tcagtgggga ggcaaagcag gcctgactcc attctgttgt tttgttacat tttattgctt 120 taaaaaaaac cggttccgct tgcctccccg cccattgcat ggtattttaa gcgtacctgg 180 ttagaactgt tagaattatt agtattgtgt agaaagtcac taactaccct cgtgacaaca 240 accctcgtga ttatgttaag gtggtcttca taccatccgt atatgatatt gaaatgttaa 300 aagaacaagc ttgactccat tttactattt actccctttt accttcaaaa ctagttcctc 360 ttgtttgccc tcccattgtg taactgtaaa tgcatttgct tgtgattgtg gccctagtga 420 ttatagcttt ttatgatcat gccctttacg acccttacct gtaagttccc ctccctgcat 480 ttccccccta ggacattgtg gtttgtacct attttgtaag gaagaaaaat cactaatcgc 540 tctcgtgatc atagcattgt gtaaaatgac taaccgccct accctcgtga ttccccctac 600 ccctttaact gctgttttcg cttctgtaat aacgcttgct tgccccccgg aaaagttttg 660 ccctataaaa accagagcca cagacaactc ggggctgctc tcagaaaccc catgttggga 720 cactgaggca gtcgccggcc ggctcttcaa taaaaggact cctctttaaa ttcgacttgt 780 ctggcggtgt ggtctttgag agactcctgg gcataaca 818 // ID LTR2A_OG repbase; DNA; PRI; 638 BP. XX AC . XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 01-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Long terminal repeat (consensus). XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR2A_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-638 RA Jurka J. and Jurka M.G.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 8(10), 1676-1676 (2008). XX DR [1] (Consensus) XX CC 6bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 638 BP; 133 A; 173 C; 112 G; 220 T; 0 other; tgtggagagc gggctgtgag actggcaggg ccctcacagc ctggggaaca ttcagggaat 60 gaacccccaa ctgcttctcc tccgttccca gaaacagcca tattcctttc ttcatgctcc 120 ctctgccaga agtcacgtag ctctttcttt gtctagttaa ctatgcagag atccttgtga 180 tctttccctt ccttctcctt gtcatatgct aacgaagttg agtagtaacc actaagtcac 240 atagttcttc ctttgttttg ttaactatgc tgagatcctc gtgatctttc ccttcttact 300 tcttgtcata tgttaacgaa gttgagtaga aaccattaac aagatgattt gtctttatta 360 aagtcattta cattttccta tcgttactgt aagaccaaaa ctatgtcttt tgtttatgct 420 atttccccat attgtctatg ctacttcccc ctggacaacc ctgtacagat actataaaat 480 aaagctgatt ttgtgctttg gtgctggctg cagccatcag ctcagtcagt cctcccgatc 540 ccatcctttg tttccgtgtc ttctcttgtg ctgtattttc ctcattcccc accaattcca 600 ctctggttcg ctccccttcg ccgcgctggt tcgcgaca 638 // ID LTR2_Mim repbase; DNA; PRI; 680 BP. XX AC . XX DT 02-NOV-2009 (Rel. 14.11, Created) DT 02-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR2_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-680 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2951-2951 (2009). XX DR [1] (Consensus) XX CC ~96% identical to consensus. 6bp tsd. CC Similarity to LTR2_Vpa from alpaca. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 680 BP; 154 A; 160 C; 184 G; 182 T; 0 other; tgttgggggc tgattctggt gttatgcctt atactaatca ttcactatag tctgtaatgt 60 acgtgtgctg cctgacgcag cagtagtcag cattttcttc agacacggtt gctgccttgt 120 gacttgataa gaggcctgag ttgttgcaca taagcaaatg ctcccaactc ccacacggag 180 ggcccccgtg gtgggagtct ggaggccatg agaacaagaa caagaagaac tctcccaaga 240 actggccccc gattgatggt tggtagtcaa tgacgggtaa gactccccat ggaggggggc 300 gacctaaaac aggcacaacc gtgggggcca gggaggagcc gccgcctggg attgagacca 360 agaagcactt caaatggctg cattgtttta tgttgcctat ctcggccttg gcttttgagc 420 tctgtccggc accttaactt tagtaaccgt tttgaggtct gagaccatgg gaaattgttc 480 ccttttgaaa caactccgga attggctgat atcgctgcta tgctcagatg ctcacgtaca 540 aggaactaac cttgggtctg gggggtataa aaataaagcg accggtgcat tggtcactcg 600 agtcattgtc atgtccatgt gtgtctgtgt cgttatttta tgttctgtgt catcccccgt 660 cctgtaatcg ggccacgtca 680 // ID LTR13_Mim repbase; DNA; PRI; 589 BP. XX AC . XX DT 11-NOV-2009 (Rel. 14.11, Created) DT 11-NOV-2009 (Rel. 15.11, Last updated, Version 3) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR13_Mim. XX NM LTR13_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-589 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2969-2969 (2009). XX DR [1] (Consensus) XX CC >89% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. CC We thank the Broad Institute Genome Sequencing Platform and CC Genome Sequencing and Analysis Program, Federica Di Palma, and CC Kerstin Lindblad-Toh for making the data for Microcebus murinus CC available. XX SQ Sequence 589 BP; 139 A; 168 C; 137 G; 145 T; 0 other; tgttaggtag taggagggat tcggtctcct agggacgaaa gtgggtgctt tgcctggtgc 60 aattgcccga gagggcggcc cccatcttgg tcctgcccca ccctaactcc agactgccat 120 acctggggag gagggaatac cctacgaatc gccaatcaaa cttagcgcgc cagctgcaaa 180 agaatgcaga ggactctcta gcccacgggc caagcgcata ggtaccatag agtcggaaat 240 gacccacatg cgtagtaatg caactcccaa ctgtccaatc aaagtggttc catggtgacg 300 tctgagccac gagggaggtc ccaatccggg cactataaaa caagacgcag accataacca 360 ggcccttttg tgcccttcct tttgctctcc ctttgctgcg acaatgggtc cggtcgtcat 420 ctgtggcata tgtatcatgc tctgtaaacc tatatcttgc tcttgcccta taatcctatc 480 tttctctcct caataaacct cattttcatg cttgccttac tttggtgtgt ctggtcattc 540 ttcggccatg agcgcaccaa gaaccgacat ttcagactga aacctgaca 589 // ID TINE1 repbase; DNA; PRI; 86 BP. XX AC . XX DT 31-DEC-2009 (Rel. 15.03, Created) DT 22-NOV-2010 (Rel. 15.07, Last updated, Version 7) XX DE SINEs from the LTR portion - consensus. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; TINE1. XX NM TINE1. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-86 RA Jurka J.; RT "SINE elements from tarsier."; RL Repbase Reports 10(3), 517-517 (2010). XX DR [1] (Consensus) XX CC The youngest copies are >96% identical to consensus. This CC sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 86 BP; 29 A; 25 C; 12 G; 20 T; 0 other; ggcaaccccc tcgggtcccc tcccactgtg ggagcttttc tgtcctctca ataaatcctg 60 ctttctaaaa aaaaaaaaaa aaaaaa 86 // ID LTR54_Mim repbase; DNA; PRI; 444 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; nonautonomous; KW LTR54_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-444 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 11(5), 1727-1727 (2011). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 444 BP; 140 A; 104 C; 74 G; 126 T; 0 other; tgttaaacaa aaattatggg aggccattgt tttggactga gctcctgcac taggccccaa 60 cagaccagac caaaccaaaa tggagtcact catgctaaat gccacataat caaactgaaa 120 ctttaaggaa gcagatagat cccaaaacag accagttttt cctgaaaaca ggagattcca 180 gtctacctga gtcagcgtaa taaggaagtc ccctctgctt taacccttac aaaaaagtaa 240 cctgaagtaa cctgatgtta accaatcagc ttttttccta ttgttctgtt tccttgttcc 300 caccttacaa aacccactgt tctgccattg cccagtggga gctctcattc tattttgtag 360 aatggaggct gccccgattc atgaatcaca aataaaagcc aattagatct ataactaaat 420 ttgttgtaat tttgtctttt gaca 444 // ID LTR2B_Mim repbase; DNA; PRI; 663 BP. XX AC . XX DT 04-NOV-2009 (Rel. 14.11, Created) DT 04-NOV-2009 (Rel. 15.11, Last updated, Version 3) XX DE Long terminal repeat - consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR2B_Mim. XX NM LTR2B_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-663 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2949-2949 (2009). XX DR [1] (Consensus) XX CC ~93% identical to consensus. 6bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. CC We thank the Broad Institute Genome Sequencing Platform and CC Genome Sequencing and Analysis Program, Federica Di Palma, and CC Kerstin Lindblad-Toh for making the data for Microcebus murinus CC available. XX SQ Sequence 663 BP; 156 A; 153 C; 179 G; 175 T; 0 other; tgttggggac tgacattata ctttcctggt ttaagaatta aatttagtga gtaatgtacg 60 tgctctgccc agagcgtttg agagtaatgt gttccggaca gggtccttgt gacaaacaag 120 gttgctgcct agaggcataa caaaggacct gagtttctgc aagtaagcaa gagctgccca 180 gccccacggc agtgggggtc tggaggccac acgataaaca cattgttcct aggattgctg 240 cttagcccca taagatggct ggttagtcaa tgacgggtaa gattcctcag ggaggaacaa 300 cctaagacag acacagccgc cgggggccag cctagaggaa ctggggacgg aaaatgcccc 360 ccgtggctgc cttgcccaac cttgctaatc tcggtctgtg atctatgcct ggcgcctaga 420 gcaaccacct ggaaaccttg agtcagggga catctgtgtc cttaaggcta cgtgttccgg 480 aatttatggc cattgctgta atgcctgggt ggtcacgtat aaggaactaa ctttgatttt 540 ttagagtata aaaataaact gaccagtgca ttgggcactc gagtcttgta ctgtctttgt 600 gtgtgtctgt gtctttcttt tgtgttctgt gttcatcccc cgtcctgcaa acgggccacg 660 gca 663 // ID LTR12_OG repbase; DNA; PRI; 529 BP. XX AC . XX DT 18-OCT-2009 (Rel. 14.11, Created) DT 18-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat (consensus). XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR12_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-529 RA Jurka J.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 9(11), 2856-2856 (2009). XX DR [1] (Consensus) XX CC ~90% identical to consensus. 4 bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 529 BP; 126 A; 165 C; 122 G; 115 T; 1 other; tgaagaggtt aatgtaactc catttgggaa tgaagaggtt aatgtaactc catcttggaa 60 ctaactcccc actcccctcc tgggcaaggt cgcccaggtg gaatcagtct atcagaagga 120 gactggctcc ctcccccgca cttctgggca gccagtctga aggggaccgg ctcccccccc 180 gcacttctgg gcagccagtc tgaaggggac cggcttcccc cctccgcact tctgggcaag 240 gtcaatcagc tcagagggag ccagtctatc agaaacagac aagcagcccc cctctagaaa 300 gtccctaacc gtaaacagct cagccaatag caaccccncc cgacaaaaca cccgatgaaa 360 ttcctctata cgctttaaaa acctccgctc gcttcacttc ggggtcgccc ctccccggct 420 acgctgtagg gtggagtgac ccaggcatat gcttgcattt aaataaaagc acttctttgt 480 ttttacatgt gtgtatggtg gtctctcagc ggcgatttag gtcccaaca 529 // ID ERV3-2_CJa-LTR repbase; DNA; PRI; 463 BP. XX AC . XX DT 12-JAN-2011 (Rel. 16.02, Created) DT 12-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE Long terminal repeat of an ERV3-type endogenous retrovirus - DE consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; ERV3-2_CJa-I; KW ERV3-2_CJa-LTR. XX OS Callithrix jacchus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. XX RN [1] RP 1-463 RA Jurka J.; RT "Endogenous retroviruses from the common marmoset."; RL Repbase Reports 11(2), 693-693 (2011). XX DR [1] (Consensus) XX CC ~90% identical to consensus. 5bp tsd. LTR is related to LTR5 CC subfamilies from the same species. CC This sequence was derived from the marmoset dataset made CC available by Genome Sequencing Center at Washington University CC School of Medicine in St. Louis. CC We thank the Marmoset Genome Sequencing Consortium for making CC their data publicly available. XX SQ Sequence 463 BP; 111 A; 178 C; 85 G; 88 T; 1 other; tgtaaggccg gttgcctcgc cgggagctcg gacacgccca tccctctcca cgcctgaaaa 60 ctacgtcacc agaatctgca ctcancaccc gcccacctct gcactccgca ccagaatctg 120 cactcggcac taaactctgc actcggcacc tagcccgccg ttggaaaacc ccgccaaaat 180 ttaaaagaag ccccgccaaa acctgcccaa tagcacagcc cgctcaccgg aaatcccgcc 240 tacctaccaa tagcagctcg ccccgtccac gtcctgtttc tccacagcca atcaaacgcc 300 ttcactccct ataaaacccc acgcccgaga ggagtcgggc gcgacttctc tggccccttt 360 ccccgggacc acggaacctc gcccgggagc taaataaatt ggcatttaat tttcttgtgc 420 tggcctcagt ttcctcattt taaactcggc aataaacctc aca 463 // ID LTR41_Mim repbase; DNA; PRI; 451 BP. XX AC . XX DT 31-OCT-2009 (Rel. 14.11, Created) DT 31-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR41_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-451 RA Jurka J.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2982-2982 (2009). XX DR [1] (Consensus) XX CC Top sequences are >89% identical to consensus. 4bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 451 BP; 115 A; 112 C; 108 G; 116 T; 0 other; tgagacaggg aactggcagg cactgctttg ggcattgctg cagcattacc tgagtcagga 60 agtctcaaaa caaagtttca aattaccctg atcaaaacag gatggaggca gggcaagttc 120 caaccggtta gaaccaagat ggtggagaaa tagagggtgt caccccgatt ccaggaaaac 180 accgcccctt ccatgaatat tcctcccctt gtttagccta taattaagcc gttgcttaaa 240 tagaaaggca acaggaacca gggggccggt tctcttctgc agggagaact tgcctctgct 300 ctgtctgtgg agtagacttt ctatagtttc tgaataaatc tcgctttcac tttatactgt 360 cggctcgctc ctgaattctt tcttgcgcga agccaaggac ccacttggac ctcaagtggc 420 tcccagcatt gggagtcact ttcctgtgtc a 451 // ID LTR5_Hs repbase; DNA; PRI; 968 BP. XX AC . XX DT 16-MAR-2006 (Rel. 11.03, Created) DT 10-APR-2007 (Rel. 11.03, Last updated, Version 1) XX DE Long terminal repeat of ERV2 Endogenous Retrovirus from primates. XX KW ERV2; Endogenous Retrovirus; Transposable Element; KW LTR retrotransposon; LTR5; LTR5_Hs_LTR; LTR5_Hs. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-968 RA Smit A.F.; RT "LTR5_Hs - a subfamily of endogenous retrovirus from primates."; RL Direct Submission to Repbase Update (11-NOV-2005). XX DR [1] (Consensus) XX CC HERVK LTR (youngest subfamily < 1% div). XX SQ Sequence 968 BP; 250 A; 238 C; 227 G; 253 T; 0 other; tgtggggaaa agcaagagag atcagattgt tactgtgtct gtgtagaaag aagtagacat 60 aggagactcc attttgttat gtactaagaa aaattcttct gccttgagat tctgttaatc 120 tatgacctta cccccaaccc cgtgctctct gaaacgtgtg ctgtgtcaac tcagggttaa 180 atggattaag ggcggtgcag gatgtgcttt gttaaacaga tgcttgaagg cagcatgctc 240 cttaagagtc atcaccactc cctaatctca agtacccagg gacacaaaaa ctgcggaagg 300 ccgcagggac ctctgcctag gaaagccagg tattgtccaa ggtttctccc catgtgatag 360 tctgaaatat ggcctcgtgg gaagggaaag acctgaccgt cccccagccc gacacccgta 420 aagggtctgt gctgaggagg attagtaaaa gaggaaggaa tgcctcttgc agttgagaca 480 agaggaaggc atctgtctcc tgcccgtccc tgggcaatgg aatgtctcgg tataaaaccc 540 gattgtatgc tccatctact gagataggga aaaaccgcct tagggctgga ggtgggacct 600 gcgggcagca atactgcttt gtaaagcatt gagatgttta tgtgtatgca tatctaaaag 660 cacagcactt aatcctttac attgtctatg atgcaaagac ctttgttcac gtgtttgtct 720 gctgaccctc tccccacaat tgtcttgtga ccctgacaca tccccctctt cgagaaacac 780 ccacagatga tcaataaata ctaagggaac tcagaggctg gcgggatcct ccatatgctg 840 aacgctggtt ccccgggtcc ccttatttct ttctctatac tttgtctctg tgtctttttc 900 ttttccaaat ctctcgtccc accttacgag aaacacccac aggtgtgtag gggcaaccca 960 cccctaca 968 // ID ERV1-1_TSy-I repbase; DNA; PRI; 8121 BP. XX AC . XX DT 28-JAN-2010 (Rel. 15.09, Created) DT 28-JAN-2010 (Rel. 15.09, Last updated, Version 2) XX DE Internal portion of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERV1-1_TSy-LTR; ERV1-1_TSy-I. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-8121 RA Jurka J. and Walichiewicz K.; RT "Endogenous retroviruses from tarsier."; RL Repbase Reports 10(9), 1192-1192 (2010). XX DR [1] (Consensus) XX CC ~92% identical to consensus. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX FH Key Location/Qualifiers FT CDS join(1299..2468,2472..4013,4145..4876,5151..6122, FT 6621..7637) FT /product="ERV1-1_TSy-I_1p" FT /translation="MSDFTSGECRGAFQMPLRETRGPIYYDQEGQIQGGQR FT TFVYQPFTTTDLLNWKNHTPSFTEKPQAMIDLIQSIIQTQRPTWADCRQLL FT LTLFNTEERRRITQGAIKWLEEHAPAGALNAQAYAQNQFPEEDPQWDPNSN FT QGLQMLERYQEALLGGMREGGEKAINMSKVSEVFQGADESPSQFYERLCEA FT FRLYTPFDPEATENQRMVNATFVGQSQGDIRRKLQKLEGFAGMNVSQLLEV FT ATKVFINRDKEAQQEADRRQKKKMAMLAAAIAGYQITGPPGGNRGRGRGRR FT QPGRQFPRARPGPPQPAQGPLNSIYQPTQGTSQPPMKLGRNQCAYCYQEGH FT WKGECPQRIADEEAQQKNGPQGSTYQQDPPAPNGFMGLASMEGLQQEARPG FT SLLSLGPQEPMVSMTVGDQRMNFMVDTGAEHSVITQPVGPFSQKRANVMGV FT TGRQARHPFTMPRRCTVGGHEVIQEFLYLPDCPVALMGRDLLGKLRAQITF FT DSPNHVTLRLRGPEAKIMTLTVPREEEWRLYSLTEASQKSPELPFRVPGVW FT AEDNPPGLARNIPPVIVELKPGGEPVHQRQYFIPRKAQIGIHKHLERLLHY FT GILRVCQSAWNMPLLPVQKPGTDDYRPVQDLRAVNQVAVTLHPVVPNPYTL FT LGLIPAEAAFFTCLDLKDAFFCIRLAPRSQPIFAFQWEDPTNGTKGQLTWT FT RLPQGFKNSPTIFGRALASDLKAFPAHQCGCTLLQYVDDLLLAAPTRENCM FT QGTQQLLTLLWKAGYKVSKKKAQICQEKVRYLGFHLSQGQRQLGPERKQAV FT CSIPVPTTRRQIREFWGAAGFCRIWIPNFSIMAKPLYEATKGGEREPLLWE FT KAQQQAFEQIKQALTNAPGLGLPDVTKPFFLYVHERVGTRRGLDSNARVMA FT RQGASCVINLMEYKGQYWLTNARMVKYQGMLCENPRIRLEVVKTLNPATLL FT PIEPGQPDHDCIEIINEVFSSRPDLTDQPLRDPEVEYFTDGSSFIQEGERF FT AGYSVVTLNSTVEAKPLPIGTSAQKAELIALTRALQLAAGLRANIYTDSKY FT AFTTLHVHGALYKERGLINSGGKDIKYGNEILELLDAVWAPKEVAVIHCRG FT HQKGNAITARGNRKADQEAKLATKETVETTAITVALLPAHPTAGTQSLPRS FT AACGGAPFEDLIVDFTELPRCQGYKYLLVFVCTFSGWVEAFPTRTEKATEV FT ARLLLKEIIPRFGLPITIGSDNGPAFVAEVVQLMTKGLQIKWKLHTAYRPQ FT SSGKVERMNRTLKSQLSKLCQETHLGWVQLLPIVLLRVRSSPTKQTGFSPF FT EILYGRPPPLVKGIQGDLKELGNLTLKQQMQALGSALNAIHYQVRERLPIS FT LTTDAHSFKPGDMVWVKEWNVQPLKPLWRGPFTVLLSTPTAVKVAEIVPWI FT HYSRVKPASQDWECTTNPSAPLKLTLQKVTEADGKPETDSPSPALATPGLA FT SLRTAEVLVAPSEAQIGFVGISAPAPKAEKLLKEVVQFVTPPATQQPKLTP FT FPQPWEVGQNLFINLAENIARTLGVTNCWVCGGALMTEEWPWKGTSLDAYQ FT LLQWNHSVTVRTDNLPQKWILSSKVIGEDCLSRAGSAYTQWVGETPCKRIL FT YWNSTHRTWWPTRPVWYWAPAYATKSSQCTIVTQTISNCTVLSSEQANPFQ FT GIPIISQSWTDLSSVNPDLWKAPKGLFWICGKVAYAQLPALWKGTCTIGII FT QPGFFLLPNPRGDELGIPLYESLKTRDARSLDQAPNIGGTQVWKDDEWPPQ FT RIINTYGPATWAQDGSWGYCTPIYIAKSHYQVTSCLRNYN" XX SQ Sequence 8121 BP; 2257 A; 2058 C; 1994 G; 1810 T; 2 other; tttcttggtg cattggctgg gaaagggaga gttcggagat ggtgtgttcc acctcctttg 60 tcatcggggt atgacctccg gggtgacggg agaggcaatc tcgcctggcg cgcaatggac 120 cttttgatcc gtaggaggga ggcccctccc ggcgcactga ggctcggatc tcgaaagcac 180 aacagaaccc aaagaggcac gggaactggc gtgaggagca ccgcccgacg gaaagggtag 240 gaacaagggc ctttgcatag acctgccagt agatggcaga aggggagctt gatcacctcc 300 cggagtccgc tagtagtctc cagtctgaag gaaaggactg aaaggcagtg gaaggaaaca 360 ctgtctcgag acgaacccgt tgactcctag gattagggtg gagggttgga gaatgtgctg 420 tgtgattgcg tgtgaatgag acggacagcg gaagggttcc gacctgacca aagggccgaa 480 gcgagtgttg agtcccgatc taagccggtg acgacctcat acagctgaag gcagccccaa 540 ccccgtcata gggttggtcc gggttggggg tttataccaa cctgccaatt gctaagaggt 600 gtctgaaata tctccgcgag gagtacggct ggaggaggac gaagcagctg tttcccatac 660 cccccgttcc ctcatttgtg tgtgcctgcc atctgttagg gggaggaaat gggtggaaac 720 caaagtaaga atacccccct tgattgtatg cttaagaact ttgacaaatg tttctgtgga 780 gattgttatg gtgtcaaact gtctcggaga aagctccgta cgttctgtga aatagattgg 840 ccctctttcg gggttggatg gccgccagag ggatccttag atagaaaaac cattaggaga 900 gtatatagaa ttatagtagg agagccaggt cacccagacc aattccctta catcgactgt 960 tggcaggatg tggtccttgc acggccctct tggctaaaag cttgctggga agacaattgc 1020 aaaattatgg tagccctcaa aacagcccag tccaaatgcc catcagagtc tgatgagttg 1080 gctgacaagc cggtgttggc tgatgaacca gaaatcccac caccttattc tccagtctac 1140 ccaccattgc cacctgctcc aatggcacca gccactgtta gaggctcaca attggaatgc 1200 acccctaagc ctagtacgtc tcttaaacaa tcagaggcca cacccgaacc accgttctga 1260 ctccccacac cccacccaga aagcagctgc cagaatgcat gtctgacttc acatcagggg 1320 aatgtcgtgg tgccttccag atgccactga gggagactag gggacctatt tattatgatc 1380 aagaaggcca gatccaaggt gggcaacgca catttgtgta ccagcctttc accaccacag 1440 acctcctgaa ttggaaaaat catacgccgt ccttcacgga aaagcctcag gctatgatcg 1500 acttaataca gtccattatc cagacccaga ggccaacctg ggcggactgc cgccaactcc 1560 tcctgacgct attcaatact gaggagcgtc ggagaattac acagggggct ataaaatggc 1620 tagaggaaca cgccccggct ggagcactca acgcccaggc atatgcccaa aatcagttcc 1680 cagaggagga cccccaatgg gaccccaaca gtaaccaggg gctgcaaatg cttgaacggt 1740 atcaggaggc cctcctagga ggtatgagag aaggagggga aaaggccata aatatgagca 1800 aggtctctga ggtgttccag ggagctgacg agagcccaag ccaattctat gagagactct 1860 gtgaagcctt tcggctatac accccctttg atccagaggc aaccgaaaat cagcgtatgg 1920 taaacgcaac ttttgtcgga cagtcccagg gggatatccg acgtaagcta caaaaactag 1980 agggctttgc tggaatgaat gttagccagc tgctagaagt tgccactaaa gtgttcatta 2040 accgagataa ggaggcccaa caagaggcag accgaaggca gaaaaagaaa atggccatgc 2100 ttgctgctgc cattgcgggg taccagataa caggcccacc gggaggaaac agaggcagag 2160 gtcggggaag aagacagcca ggtcggcagt ttcccagagc gaggcccgga ccgcctcagc 2220 ctgcccaggg acccctgaac tccatctatc agcctacaca agggacatca caacccccta 2280 tgaagttagg gagaaaccag tgtgcctatt gttaccaaga aggtcactgg aaaggtgaat 2340 gcccccaacg aatagctgat gaggaagccc aacagaaaaa tggcccacaa ggctccactt 2400 atcaacagga cccaccagct cctaacggct ttatgggcct ggcctctatg gaaggactgc 2460 aacaagaata ggccagaccg ggctccctac tatctttagg tccccaggag cctatggtca 2520 gcatgacagt aggggaccaa cgtatgaact tcatggtcga cacgggagct gaacactccg 2580 tgatcaccca gcctgtaggt cccttctccc aaaaacgggc taatgttatg ggagtcactg 2640 ggcgtcaagc cagacatccc tttacgatgc cgcgacgctg cacagtcggc ggccatgagg 2700 tcatacaaga gttcctgtat ctcccagatt gccccgtagc cttaatggga cgagacctcc 2760 tgggaaagct tcgagctcag ataacctttg actctcctaa tcacgtaact ctgagattaa 2820 gaggcccaga ggctaaaatc atgaccctca ctgtcccccg ggaagaggaa tggcggctct 2880 acagcctaac ggaggcctca caaaaatctc cagagctgcc atttcgagta ccgggagtgt 2940 gggcagagga caatcctcca ggactggcaa gaaatatacc ccccgtgatt gtggaactaa 3000 agccaggggg cgagccggtc catcagaggc agtacttcat cccacgcaaa gcacaaatag 3060 gaatccacaa gcacctggag aggctcttgc attatggaat cctccgggtt tgtcagtctg 3120 cttggaatat gcctctgttg ccggtgcaga agccaggcac ggacgactac cggccggttc 3180 aagacttgcg agccgtaaac caggtcgcgg tcactctaca ccccgtcgtc ccaaacccgt 3240 atacgctgct gggtctcatc cctgctgagg ctgctttctt cacctgcttg gaccttaagg 3300 acgctttctt ctgcatccgc ctggccccaa gaagccagcc gatcttcgct ttccaatggg 3360 aggacccgac aaatggcacc aagggacagt taacttggac ccggctgccc caaggattca 3420 agaattctcc aaccatcttc ggtagggcac tagcctcaga tttaaaagcc tttccagcac 3480 accagtgtgg ctgtaccttg ctccagtatg tagatgactt actgttggca gcaccaaccc 3540 gggaaaactg tatgcaaggg acacaacaac tgctcaccct cctgtggaag gcgggttaca 3600 aggtctcaaa gaaaaaagcc caaatatgtc aggaaaaagt taggtatcta ggttttcatt 3660 tgtcccaagg ccagcgccaa cttggccctg aaaggaaaca ggcagtgtgt tcaataccag 3720 tcccaactac ccggcgtcaa atccgggagt tttggggggc tgcggggttc tgtagaattt 3780 ggatacctaa cttctccatc atggcaaaac ccctctatga agccacaaag gggggtgaac 3840 gggagcctct gctatgggaa aaggctcagc aacaggcctt tgaacaaatt aaacaggccc 3900 tcaccaatgc acctggtctg ggactcccag atgtgactaa gcctttcttt ttatacgtgc 3960 acgaacgtgt tggcacgcgt aggggtcttg actcaaatgc tagggtcatg gcatagaccc 4020 gtggcatatt tatccaaaca gctagattct gtggctcagg gctggccgcc ttgtttacga 4080 gccctggcag ccacggcact cctgataaca gaagctgaca aactaactat gggacaacat 4140 ctaacgtcag ggtgcctcat gcgtgataaa tttaatggaa tacaaggggc aatattggct 4200 aaccaacgct cgaatggtca aatatcaggg aatgctctgt gaaaacccac gcattcgcct 4260 agaggtggta aagaccctaa atccggccac cttactacct atagagccag gacagccaga 4320 ccatgattgc attgaaataa taaatgaagt gttctctagc cgtccagatt taacagacca 4380 gcccctcagg gaccctgaag tcgaatactt cacggacgga agcagtttca tacaggaggg 4440 ggagcgcttc gcgggctatt cggtggtaac tttaaactcc acggtcgagg cgaaacccct 4500 gccgatagga acttcagctc agaaagcaga actcattgcg ctcactcgag ccctccagct 4560 agccgcaggg ctacgcgcta atatctatac agactccaag tatgctttta ctaccttgca 4620 tgtgcatggg gccttataca aggagagagg tttaatcaat tcagggggaa aagacataaa 4680 atatggcaat gaaattctag agctactaga cgctgtgtgg gcaccaaagg aggtagcagt 4740 catacattgc agagggcatc aaaagggaaa tgcgataact gccagaggaa atcgcaaagc 4800 agatcaagaa gcaaagcttg caacaaaaga aactgtcgaa acgactgcaa taaccgtggc 4860 actgcttcca gcccactgac aaactggagc ccttgctaca cgccaaatga gaaaacatgg 4920 tttggaaccg aaacgggaca ctacttacaa gccggatggt ggaaatttca agatggccga 4980 atagcaattc cagaatcatt ggctccagca tttgtcaggc agtttcacca agggacacac 5040 agcggacgga ccgctcttga aaataccctc agtaagtact tctatgtccc ccgactctcc 5100 agtatagcgc gggtggtctg tgaacaatgt gtaacttgtg ctcaaaataa cccacggcag 5160 ggacccaaag tctcccccgg agtgcagcat gtgggggggc accctttgag gatcttatcg 5220 tggactttac agaactccca cgctgccaag gctacaaata tctccttgtt tttgtgtgta 5280 ctttctcagg atgggttgag gccttcccca caaggacaga aaaagccact gaggtggccc 5340 gactcctgtt aaaggaaatc attccccggt ttggactccc gatcactatc ggatcggaca 5400 atggaccggc atttgtagcc gaggtagtac agttaatgac taaaggactg caaatcaaat 5460 ggaagctcca tacagcctat aggccacaaa gttcagggaa ggtggagcga atgaaccgga 5520 ccttaaagtc acaactaagt aagttatgcc aagaaactca tttaggctgg gtccagcttc 5580 tgccaatagt ccttctaagg gtaaggtcta gccccaccaa acaaacaggt ttctcgcctt 5640 ttgaaatcct ttatgggcgg ccaccccctc ttgttaaggg catacaggga gacttaaagg 5700 agttaggaaa tcttactcta aaacaacaaa tgcaagcctt aggttcagcc ttaaatgcca 5760 tccattatca agtcagagaa cggttaccaa ttagtttaac taccgatgcc cattctttta 5820 agccaggtga tatggtatgg gtaaaagaat ggaatgtaca gcctctaaaa cccctctgga 5880 ggggcccatt tactgtcctt ttgtctaccc caactgcagt caaagtagcc gaaatagttc 5940 cctggatcca ttacagcaga gttaagcctg cctcccagga ctgggagtgc accacaaatc 6000 cgtcggctcc ccttaagctg accctccaga aagtcaccga ggccgacgga aaacccgaaa 6060 cggattcccc aagccctgct ctagccactc cagggctggc tagtctacgc acggctgaag 6120 tttgaggatc catcggcctt gcttcagcca cacaccggas gctggctgat ctacgcatgg 6180 cagaagctta aggattcagc ggaccttact gataagccat tgtttcgctt tgtcagcctt 6240 tgtatttgcc aattgctata ctgtttgtca ttttaattgc tgtaggactg actttatttg 6300 ctataggact gacagagtcg acccctacaa attgaacatg ttgggaaaag gtctcactag 6360 catttccgtt ctttcttata ataggaagtt ttcttgtgct tttatgtact tctgcctaat 6420 ttgggtgtct atcgcttccc ccacacaccc agcatgtaaa tgtatagaaa ctatctggcc 6480 agacgtaaaa gtcttccatc gttattcatc ccagccagaa gtttgttata ctgacaaaca 6540 tatctgtacc cataatggcc agctgtactt tactggatta tcaaaacaga gctaccagta 6600 cactaaagct ggtgtcatag cttgtagcac cctcggaagc acaaattggg tttgttggga 6660 tctctgcccc tgcacccaaa gccgagaaac tgcttaaaga agttgtacaa tttgtaactc 6720 ccccggctac ccaacagcca aagttaaccc ccttccctca accatgggaa gtaggccaaa 6780 atctgtttat aaatttggct gaaaacatag cccggacact tggtgtcacc aattgttggg 6840 tatgtggagg agccctaatg actgaagagt ggccttggaa gggaactagt cttgatgcat 6900 accagctcct tcaatggaac cattctgtaa ctgttagaac agacaaccta ccccaaaaat 6960 ggatcctctc ctcaaaagta attggagaag actgcttaag cagagcgggc tctgcttata 7020 ctcaatgggt aggagaaact ccttgcaaaa gaatattata ctggaattca acccaccgga 7080 cctggtggcc aacaaggcct gtttggtatt gggcacctgc ttatgcaact aaatcttctc 7140 aatgtactat tgtcacccaa actataagta actgtacagt actatcctct gaacaagcta 7200 acccatttca aggtatcccc attataagtc aaagctggac agacctgtcc tctgtaaatc 7260 cagacctctg gaaagcccct aaagggctgt tttggatatg tggaaaggtt gcatatgccc 7320 agctcccagc attatggaaa ggcacatgta ccataggaat aatccagcca ggatttttcc 7380 ttctgcctaa cccaaggggg gatgaactgg gcattcccct ttatgaaagc ctcaaaacac 7440 gggatgctcg atcccttgac caagctccta atataggagg gacccaggtt tggaaagatg 7500 acgaatggcc accccagaga attataaata cttatggccc agccacatgg gcacaagatg 7560 gaagttgggg atactgcacg cctatatata tagctaaatc gcattatcag gttacaagct 7620 gtcttagaaa ttataactaa tcaaacagcc atggcttttg agcttgctgc tcaacaacaa 7680 gcccaaatgc gcgcagccat ttatcagaac cgcttagctt tggactacct gctagcggaa 7740 gaaggaggag tatgtggtaa atttaatagc tctgactgtt gtttgcaaat agatgataat 7800 agtaaagcta taactgatat agccaccaat attagaaagc ttgcccatgt cccggtgcaa 7860 aaatggcaag gcataaaaat tggcaattgg tttgaaaaca tgttctcggg tctgggagga 7920 tttaagtaca ttattggatc tgtagtccta ttggtgggat cctgtcttat cctcccttgc 7980 atagccccaa taattatgaa ckccatctct aagtttgtgg aaacagttgt tgaacgaaaa 8040 actgcggccc acataatgtt aatgcaccaa attgaagatg atgctcttaa cccatgaagg 8100 gtcgagcatc aaagggggga a 8121 // ID piggyBac2b_Mm repbase; DNA; PRI; 999 BP. XX AC . XX DT 24-MAR-2010 (Rel. 15.06, Created) DT 24-MAR-2010 (Rel. 15.06, Last updated, Version 1) XX DE piggyBac2b_Mm element: consensus sequence. XX KW piggyBac; DNA transposon; Transposable Element; piggyBac2b_Mm. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-999 RA Pagan H.J.T., Smith J.D., Hubley R.H. and Ray D.A.; RT "PiggyBac-ing on a Primate Genome: Novel Elements, Recent RT Activity and Horizontal Transfer."; RL Genome Biol. Evol 2, 293-303 (2010). XX DR [1] (Consensus) XX CC piggyBac2b_Mm is a truncated form of piggyBac2_Mm from Microcebus CC murinus. It contains an open reading frame and differs from CC another truncated form, piggyBac2a_Mm, by a 44 bp indel. XX FH Key Location/Qualifiers FT CDS 51..815 FT /product="piggyBac2b_Mm_1p" FT /translation="MDLRCQHTVLSIRESRGLPPNLKMKTSRMKKGDIIFS FT RKGDILLLAWKDKRVVRMISTIHDTSVSTTGKKNRKTGENIVKPACIKEYN FT AHMKGVDRADQFLSCCSILRKTMKWTKKVVLYLINCGLFNSFRVYNVLNPQ FT AKMKYKQFLLSVARDWITDDNNEGSPEPETNLSSPSPGGARRAPRKDPPKR FT LSGDMKQHEPTCIPASGKKKFPTRACRVCAAHGKRSESRYLCKFCLVPLHR FT GKCFTQYHTLKKY*" XX SQ Sequence 999 BP; 328 A; 188 C; 213 G; 269 T; 1 other; cacctttcgt accgctcacg agttttctcg tgtttcgcgc gccatctgtt atggacctta 60 gatgtcaaca cactgtcttg tccattaggg agagtagagg tttaccgcca aatttgaaaa 120 tgaaaacatc aagaatgaag aaaggtgaca taatattttc cagaaaaggc gatattcttc 180 tcctagcatg gaaagacaag cgggttgtcc gaatgatatc aacgatccat gacacttctg 240 tctcaacaac aggaaaaaaa aatagaaaaa cgggagagaa tattgtaaaa cctgcctgca 300 tcaaggaata caatgcccac atgaaaggcg ttgaccgtgc ggatcaattc ctttcgtgtt 360 gttccattct aaggaaaacg atgaaatgga caaaaaaagt agtgctgtac cttataaact 420 gtggactttt caattcattt agagtgtaca acgtcctcaa tccacaagca aaaatgaagt 480 ataaacagtt tctgctatcg gtggcgagag actggataac ggatgacaat aatgaaggct 540 ctccggaacc agagacaaat ctgtccagcc cttcccctgg gggtgcaagg agagcacctc 600 gtaaagatcc acccaaaagg ttgtcaggtg atatgaagca gcatgaacct acgtgtattc 660 cagcgagtgg aaagaaaaaa tttcctacga gagcctgcag agtttgtgcc gcccatggaa 720 aaaggagcga atctagatac ttatgtaaat tttgtttggt ccctcttcat agaggaaaat 780 gttttacgca gtaccatacg ttaaaaaagt actaggaact ttaattgttt aattadttgt 840 ttaattgttt ttgtaaataa aaatgttata attattgaaa aacaacacct aaagtgcatt 900 atgatctgta gttatgatga tttaaataac gtgcagtttg cccaaaaacg tgcggtccct 960 ggcgtatgtc ttagagattt ctatgcggta cgaaatgtg 999 // ID MacERV5a_LTR repbase; DNA; PRI; 456 BP. XX AC . XX DT 24-SEP-2007 (Rel. 13.02, Created) DT 21-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE Long terminal repeat of ERV1 Endogenous Retrovirus from DE Cercopithecidae. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; KW MacERV5a_LTR. XX OS Cercopithecidae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini. XX RN [1] RP 1-456 RA Smit A.F.; RT "MacERV5a_LTR - ERV1 Endogenous Retrovirus from RT Cercopithecidae."; RL Direct Submission to Repbase Update (21-FEB-2008). XX DR [1] (Consensus) XX CC 5 bp TSD, but ERV1-class proteins. 5-6%. XX SQ Sequence 456 BP; 89 A; 167 C; 90 G; 110 T; 0 other; tgttaggcag ggatctagac ccgacatggc ggcgccaccc ggcgtagcag gccctttgtt 60 cgagacttcc cttcctcttt gaacacaccc gcagacttgc atacgtcaga ggttccggct 120 gggcaaccac actcccccat gacctgcagc tcacctgcat tccacaagtt cgtaaacagc 180 agtttcggtt aacagtttcc aaagacccct tcctcatgac ccttactcga ctcctgccct 240 agtagtttca agacccccca ggccctgttt gcacaaccag cttccctacc tctcgggcta 300 taaaaagccc ctaccctcct ccctcagcgc gacttcctcg gcccgcacct ttggaccgag 360 gaacctcgcc cgcgagtcct aataaaggct attctcactg ccatttgcct tgtgtcttcg 420 ttcccggttc ggctccagcc tcagtttacc tttaca 456 // ID LTR71C_TS repbase; DNA; PRI; 489 BP. XX AC . XX DT 14-DEC-2009 (Rel. 15.09, Created) DT 14-DEC-2009 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an endogenous retrovirus - consensus. XX KW Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR71C_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-489 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from tarsier."; RL Repbase Reports 10(9), 1279-1279 (2010). XX DR [1] (Consensus) XX CC >88% identical to consensus. Target site duplication varies from CC 4-6bp, therefore classification is uncertain. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 489 BP; 114 A; 108 C; 107 G; 160 T; 0 other; tgtgagaaat aagaatgtga tttattttca aattggagcc tagcttatct tgggttctca 60 agatggtgtc taaatgcttt gtgcttctcc tttgctgatt cacttcctgc ttctttgtgc 120 ttccttttcg cgggcttttt actttggcgg gcttcatgtt taaacttttg gcaggatcca 180 atcagaatag taatcctgta agccaatcgg caaataggac atgcaaatga gatgattctg 240 taaagccaat caaaatgtgt catgtaggca tcctacggca aagcctgtaa ctcctgcaag 300 tattcctcca atgaggaacc aggagaggga cgtgcatttt agggataaaa gtgctgattc 360 ttcctgcttt ggtgtgcctg cccaccagac acccgatctt gcaagaccgt cattaaagtc 420 tcgcttctgc tgcactccat gtctctgtgt ccatcctttg attttggaca ggtgagttcg 480 tttctcaca 489 // ID CYN-I repbase; DNA; PRI; 89 BP. XX AC . XX DT 02-MAY-2006 (Rel. 11.04, Created) DT 02-MAY-2006 (Rel. 11.04, Last updated, Version 1) XX DE SINE element from Cynocephalus - consensus sequence. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; CYN-I. XX OS Cynocephalus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Dermoptera; Cynocephalidae. XX RN [1] RP 1-89 RA Schmitz J. and Zischler H.; RT "A novel family of tRNA-derived SINEs in the colugo and two new RT retrotransposable markers separating dermopterans from RT primates."; RL Mol Phylogenet Evol 28(2), 341-349 (2003). XX DR [1] (Consensus) XX SQ Sequence 89 BP; 29 A; 21 C; 19 G; 20 T; 0 other; ggctagttag ctcagttgat tagagcacag ccttaaacac caaggttatg ggttcaaatt 60 cctgcactgg ccagctgcca aaaacaaaa 89 // ID LTR20C_Mim repbase; DNA; PRI; 406 BP. XX AC . XX DT 11-NOV-2009 (Rel. 14.11, Created) DT 11-NOV-2009 (Rel. 15.11, Last updated, Version 3) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR20C_Mim. XX NM LTR20C_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-406 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2979-2979 (2009). XX DR [1] (Consensus) XX CC ~89% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. CC We thank the Broad Institute Genome Sequencing Platform and CC Genome Sequencing and Analysis Program, Federica Di Palma, and CC Kerstin Lindblad-Toh for making the data for Microcebus murinus CC available. XX SQ Sequence 406 BP; 94 A; 103 C; 103 G; 106 T; 0 other; tgtaacagag ggaatggcct gaaaaagggc aaaaatgttt tctgtctctt caaaaccccc 60 ccaccctttt tgagaactaa aacctgcatc cctgcctcag gccagtggtc ggaggggcag 120 gggagtgtcc tttgttcttt gtgccacagg agatggctca agggaattgt ccgggcggag 180 gtcacgagat tgtcttagcc gaagatggga tgaatcagaa cctttaaaag ctctgtactt 240 ctgctcagag gcaggacgtt ggtactttga gacgggagtc tgccaacctc ctcatttgcc 300 ggcaaattaa taaacttctc tttccttctc ctcaaaccac ttgtcctcgt tcttctgatg 360 cggcctcggg gacaagtgct gaactttcgg taacagttgc ccgcca 406 // ID LTR18_Mim repbase; DNA; PRI; 770 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; nonautonomous; KW LTR18_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-770 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 11(5), 1723-1723 (2011). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 770 BP; 192 A; 213 C; 155 G; 210 T; 0 other; tgtaaccggc tccataagca agaaaatgtt ttctcaatag tgtttttagt tccctcatat 60 ttcaaatgag ttttcagtgt tcacattgtg gctggggctg gcagctctaa gtggccttga 120 cagtgagttt tatctcagcc ctggtagctt agcagctaca gtctcaaagt aggcagaaaa 180 tagagagctc tcagtaagat ttgctgacca gaactttgta actttctgtt gcactgatgc 240 atcattctct tttgagtttc tttcccagaa acggcggaac ctccttgcag ccacgagata 300 actacaaagc ctcacaccac ctgtttgtcc ggagaagaca agataagcga cactccacca 360 cagatcgtgc ccaaagaccc actgagcctg cgcacaaggc tgaaagaatg accaatatta 420 acccctagct cattataata ctaaaatccc caccctggga gggacttatc gccatttttt 480 gatcatggga cgtatgtact agcatgattt ctcactgcgc ttgcgtgccc tgcactccac 540 cccaaacacg caatgatgct cactcccctc atgcattatt catttcgcct ccagaaaaac 600 cccctgaccc tgttggtcgg ggaggcggat ttgaggcagt ccataacctc cccctcccag 660 ttgacgcgtc ccctgcaata aatcctttct tccttgacaa tcctcgttgt ctcagtaatt 720 ggctttctgt gcggcgagca acattgaacc cccttgtggg ttcgttaaca 770 // ID LTR7D_OG repbase; DNA; PRI; 306 BP. XX AC . XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 01-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Long terminal repeat (consensus). XX KW ERV3; Endogenous Retrovirus; Transposable Element; LTR7D_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-306 RA Jurka J. and Jurka M.G.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 8(10), 1681-1681 (2008). XX DR [1] (Consensus) XX CC 5 bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 306 BP; 70 A; 96 C; 61 G; 79 T; 0 other; tgatagaatc aggtgtacct gcccaaatag acctgccccg cccaggcagg gtcaggttca 60 tgacccgccc aggaagggtc agactcatgt tccctcccag gaaataggaa tgccgccaat 120 tgctgccctc cctgaccctt taaatcccca tccgccataa cccacgtgtg gaattccttt 180 cttgatttca ccccacctgc atccaggtgg aaataaagac cacattattt ctacagaacg 240 gggattccgt tttcctttgt ttcgcgcctt ggaataatct ttcttcctcg ggtccagaac 300 ctttca 306 // ID LTR77d_TS repbase; DNA; PRI; 813 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW Endogenous Retrovirus; Transposable Element; LTR77d_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-813 RA Bao W. and Jurka J.; RT "Long terminal repeats from tarsier."; RL Repbase Reports 11(5), 1635-1635 (2011). XX DR [1] (Consensus) XX CC Elements are ~82% identical to the consensus. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 813 BP; 179 A; 258 C; 202 G; 174 T; 0 other; tgagagagga gaaaggaaga aaccggatag gcagacagat gggtcccagt aaaaacttct 60 gctgcaagcc ccacggactg gagtgagaac acctatttcc aggtccatgg actgcccctt 120 tcatgtgctt cttgattggc ccttttcaac cctctcccga ttggtgcatt tcaatcccac 180 accatggtgc ctctctctga ttggtgctac caccaaccaa tcagcacgct ctcaagacta 240 tataaacccc tgactgcaga ggcatcgttg gcaaccctct ttcgggtccc ctcctgctgc 300 cgggagcttt cctgtcttct aataaattcc tacttactca ctctctggtg tccgcgtccc 360 tcattcttcc tgggcatgag acaagaaccc ggaactgctg aactagaaga gctgctgaac 420 tagaagagct gcacacgctg actggaagag ctggacccac caagctgaga gagcttcaac 480 actccttggg ggcctggctt gccggcccac cgaaccgaca gagctgtgaa cacttcttgg 540 gggctggctc actggcccac cgaaccggca gagctgtaaa cacttctcgg ggctcggctc 600 accgaccccc aaactaagag agctgtaaca ctccttgggg gctctgcagc tgttggcatc 660 cccgagttct cgggtgccac cgcattcccc tatccggatg ccattcgcgt ccccggccac 720 cgtgtggatc ccgggctggc agtgcagctt tgggaactgc aacatccctt gggggcccag 780 ctcgccaact cgcagggaca aaaaactgca tca 813 // ID LTR14_Mim repbase; DNA; PRI; 481 BP. XX AC . XX DT 02-NOV-2009 (Rel. 14.11, Created) DT 02-NOV-2009 (Rel. 15.11, Last updated, Version 3) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR14_Mim. XX NM LTR14_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-481 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2976-2976 (2009). XX DR [1] (Consensus) XX CC ~98% identical to consensus. 4bp tsd. CC Similarity to LTR14B2_Sar from common shrew. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. CC We thank the Broad Institute Genome Sequencing Platform and CC Genome Sequencing and Analysis Program, Federica Di Palma, and CC Kerstin Lindblad-Toh for making the data for Microcebus murinus CC available. XX SQ Sequence 481 BP; 109 A; 122 C; 146 G; 104 T; 0 other; tgtaagggcc tggttgcagc ccaaagctgc tggctcactt gtgcgagccc ggataacaga 60 aaattggccc aggggaggag taatcggctt ctggaaactg ctagcttggg gcccaaaggt 120 agagctatcg gctaaacgca tatttctgct tctgagaatc gcttgcttgc agctagacgc 180 ataggtacgg tgccagataa gggagaaagg cccctttgcc gccggcgggc taccagtcca 240 ccaatcattt taaagactaa cacgcacaat cagcttgtgc agcgcgggtg ttcaagaggg 300 aggggggata aaagggcagc cccagctttg gtcagggtcc ttgcctgtaa gagcgaccac 360 tgcgctggca ctctagggcc tggaccctgg ctagccagaa aataaagctc ctcttgagtg 420 attgcatcct tggtgtcttt gttcgtctgc ctggcggggt gcaggaagcc ggtccctaac 480 a 481 // ID LTR22_TS repbase; DNA; PRI; 886 BP. XX AC . XX DT 09-DEC-2009 (Rel. 15.09, Created) DT 09-DEC-2009 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV3-type endogenous retrovirus - DE consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR22_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-886 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from tarsier."; RL Repbase Reports 10(9), 1275-1275 (2010). XX DR [1] (Consensus) XX CC >94% identical to consensus. 5bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 886 BP; 219 A; 230 C; 182 G; 255 T; 0 other; tgagagactc aaaggactca aaaactatac ttgctatgaa agaaacgttt actatatgaa 60 ttcctacctc acgttcactg tattttataa acccctacct cacgtttgct gtattttata 120 aatccctacc tcaaacccca taagactagg catatgtaag gcgcctggta gtaactgtga 180 ccgtaaagtt acaaaatatt tgcagtgttt gtacttagaa ctttgttaag tttttgtttg 240 tcccataata accatgttaa ttgaaataac cggcttcttc tgctggcagt gtgtgaatgt 300 taagcaataa gcccctcccg gacattcctt tgcttgaatt atggtgggct ctgtggcgcc 360 agccactctt ctggtgggct ctgtgttatt tcaaacctgc ttgtatgccc gagctgattt 420 atgcacagtc acagtgactg tgtaaaccgc agcttatcta acattccaga tggcttaact 480 aactccactc tgcccgtgta gccgttagta gtcaatcacg ggcagccgag cccgcgcccg 540 cgcaaatcaa aaacgaaccc accaatctca agctctcctg tcccctgaca tcaccgcgga 600 cctaattgag aactaattct gcaaggtccg tggacttcgt tttctacgtc atgccttctt 660 ctaattggat aatgtctcaa gcctaggaaa aaccggtcac gcccagtccc tccctcgtaa 720 ctgtgtatat aaactacttg ctgtactagg tcggggctct cgttcctgta ccgctgcgcc 780 ggttacagac gtgagcccag actcgagcct gaataaagac cctcgtgtga tttgcatcgg 840 agctggctct ttggtggtct ctcggattca ggattcgggt acaaca 886 // ID HAL1-1B_Cja repbase; DNA; PRI; 2773 BP. XX AC . XX DT 13-JUL-2010 (Rel. 15.07, Created) DT 13-JUL-2010 (Rel. 15.07, Last updated, Version 1) XX DE HAL1 non-LTR retrotransposon: consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; Nonautonomous; KW HAL1; HAL1-1B_Cja. XX OS Callithrix jacchus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. XX RN [1] RP 1-2773 RA Bao W. and Jurka J.; RT "HAL1 non-LTR retrotransposons from marmoset."; RL Direct Submission to Repbase Update (13-JUL-2010). XX RN [2] RP 1-2773 RA Bao W. and Jurka J.; RT "Origin and evolution of LINE-1 derived "half-L1" RT retrotransposons (HAL1)."; RL Gene 465(1-2), 9-16 (2010)doi:10.1016/j.gene.2010.06.005. XX DR [1] (Consensus) XX SQ Sequence 2773 BP; 970 A; 612 C; 590 G; 601 T; 0 other; gagagagatc caagatggcc gtctagcagc agctcagggt tgcagctccc agtgaaggcg 60 cagggagtga gtggacgcca gacttccaga cgaatttttg ttgcctacgg accaggagat 120 tcccagcgga ggagccccac gggtcgccag cgcgactctt ttggccggcg cggctgtttt 180 gccggcgtcc cggcgcggcg gttctcagtg cggagtatgc gggactgggt gcccttttag 240 ttggcgtttg gagctccgag aaggcagagt cacccattca gctgattgag agggggactg 300 aagccgggag ccagaccggg agattcccgg gcagaaaagc gccacgaatc tcagcgccgc 360 tgtttcagcc ggcgcggtcg ccgcacggga aattgcatag atcccggcgc cttttcggcg 420 ggcgactggg gcacctggga gagaatcgac cgttcaatta ggaaaagaaa aaaagggggc 480 tccgaggcag ggagccaggt gatcgggctc ggctggtccc acccccacaa aaaaacagca 540 attggaaacg ctgacggttg agagtttcac agcaagcaca gctgaacccg ggaaggtcca 600 gctctgtggg ggaggggcgt ccgccgaagg aaaacaaaga aacagaaata acatcaccac 660 caacaagctg gacgtccact cagagaccca atctgaaagt cagcaattac aaagacgaca 720 ggtggataaa tccacaaaga tgggaagaaa ccagcgcaaa aaggctgaaa acatccaaaa 780 tcagaatgcc tctcctcctt caggggatcg cagctcctca tcagcaaggg aacaaggcct 840 gatggagaat gagtgtgatg aattgtcaga atcaggcttc agaaggtgga taataagaaa 900 cttctgtgag ctaaaagaac atgttctaac ccaatgcaaa gaaactaaga accttgaaaa 960 aaggtttgac gaaatgctaa tgagaataga caatttagag aggaatataa gtgaattaat 1020 ggagctgaaa aacacaacac gagaacttcg cgaagtatgc acaagtttta acagtcgaat 1080 tgatcaagca gaagaaagga tatcagaggt cgaagaccaa ctcaatgaaa taaaacgaga 1140 agacaagatt agagaaaaaa ggataaaaag gaatgagcaa agtctccaag aaatatggga 1200 ctatgtgaaa agacctaatc tacgtttgat aggtgtacct gaatgtgacg aagagaatga 1260 atccaagctg gaaaatactc ttcaggatat tattcaggaa aactttccca acctagcaag 1320 gcaggacaat attcaactcc aggtaataca gagaacacca caaagatatt cctcaagaag 1380 agcaacccca aggcacataa tcgtcagatt caccagggtt gaaatgaagg agaaaatgct 1440 aagggcagcc agagagaaag gtcaggttac ccacaaaggg aagcctatca gactcacagc 1500 agatctctca gcagaaaccc tacaagccag aagagagtgg gggccaatat tcaacatcct 1560 taaagaaaag aactttcaac ccagaatttc atatccagcc aaactaagct tcataagtga 1620 aggaaaaata aaatcttttg tgaacaagca agtactcaga gatttcatca ccaccaggcc 1680 tgctttacaa gagcttctga aagaagcact acacatagaa aggaacaacc agtatcagcc 1740 tttccaaaaa tataccaaaa ggtaaagagc atcaacataa tgaagaattt acatcaacta 1800 atgggcaaaa cagccagcta gcatcaaatg gcagtattaa actcacatat attattatta 1860 atcctaaatt taaatcgact aaatccccca atcaaaagac acagacaggc aaattggata 1920 aaaagccaaa acccatcggt atgctgcatc cagacccatc tcacatgcaa ggatacacaa 1980 agactcaaaa caaagggatg gaggaagatt taccaaccaa atggagagca aaaataaata 2040 aataaaaagc aggagttgca attctcgcct ctgataaaat agactttaaa gcaacaaaga 2100 tcaaaagagg caaagaagga cattacataa tggtaaaagg atcaatgcaa caagaagagc 2160 taacgatcct aaatatatac gcacccaata caggagcacc cagatacata agacaagttc 2220 ttaatgactt ataaagagac ttagactccc acacaataat agtgggagac tttaacatca 2280 cattgtcaat attagacaga tcaacgagac agaaaattaa caaggatatc caggacttga 2340 actcagaccc ggaacaagta aacttaataa acatttatag aactctccac cccaaataca 2400 caaaacatac attcttatca gtaccacatc acacctactc taaaagtgac cacataattg 2460 gaagtaaatc actcctcagc aattgcatag catttcacag gtttaaatga aatattggtt 2520 ggctgcttgt ttgttatttt cctcccttat tccttcctgt ctccattgtt aagcacaatt 2580 attggcataa aagaggtttt caatacccat ttttagactg cattctagcc tgggcaataa 2640 agcaagactc ccgttctctc tccctctttc tcttcctctt tctttctctc tttcattctt 2700 tttttttttc tttttttctt tctttctctc tttttttttc tttctttttc ttcctttctt 2760 tctttaaaaa aaa 2773 // ID MacERV6_LTR3 repbase; DNA; PRI; 468 BP. XX AC . XX DT 24-SEP-2007 (Rel. 13.02, Created) DT 21-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE ERV1 Endogenous Retrovirus from Cercopithecidae. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; KW MacERV6_LTR3. XX OS Cercopithecidae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini. XX RN [1] RP 1-468 RA Smit A.F.; RT "MacERV6_LTR3 - ERV1 Endogenous Retrovirus from RT Cercopithecidae."; RL Direct Submission to Repbase Update (21-FEB-2008). XX DR [1] (Consensus) XX CC despite 5bp TSD 6%. XX SQ Sequence 468 BP; 99 A; 167 C; 117 G; 85 T; 0 other; tgtctggacg gggggagagg acaaagacga ctaagatggc gcatttccgg gttcttcatc 60 accaacttac ccgcgcgcgg gaaaatgcag cccgcgcccg ggaaaaatac agaccaactg 120 cgcaggcgca acgtggcgtc cgatcgagga aaccgaaact tacctggccg cgcctacgga 180 acgcccccga cacgcccgtg tcccgcctat tgccctccca ctcccaagcc ttagacagaa 240 aagccgctcc cggcaggcgc gcggcgcgaa cttcctcggc ccctcctcat atgcggacct 300 aggaacctcg cccgagaacg ccggagcgac ttcctcggcc tccaccgccg gagaccggtg 360 aacctcgccc tttcttcctt cacattggct agctaataaa gtttcttttt acctcgccta 420 cttgcctctt ctctggcgcc tgctccggtg gtcgcataaa acaaatca 468 // ID LTR19_OG repbase; DNA; PRI; 927 BP. XX AC . XX DT 19-OCT-2009 (Rel. 14.11, Created) DT 19-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat (consensus). XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR19_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-927 RA Jurka J.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 9(11), 2865-2865 (2009). XX DR [1] (Consensus) XX CC ~84% identical to consensus. 6bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 927 BP; 239 A; 195 C; 240 G; 253 T; 0 other; tgtgggtgga aatgtgggag aactggctga ggagtagagg cagaggaagc cgtcctcttg 60 aagcatttat ttagattgga atttatgaaa tattttaaca attagggaat gcggattgtg 120 aggcaaggga taaaggacag atagaagatg tggatatctc agagagtgtg cgcagggaga 180 ccacaatgca tatcgggagc cccaaaagtc tgttgaacac gacagcacag gccttggaaa 240 aaaggacaag aaatgtctta ctgaggcctg acccttcccc cacagagagc ctgtacaaag 300 gtctctgggg gctaaatgag ccgttgcgag atgtgctggg aacagcacgg gcagctggct 360 ccttgataag agaaacaatc tcttggggct agataagaga aacagcttct ttctcccccg 420 ctcccggtat gggtatggcg tcagcctgtc tgccccacag ggacccggag acttaattgc 480 ctcctgtgga aactgtgtgc gtggtcaaac ctgctgtctg ctctcgcatc ctgcttctaa 540 gtcttgttct ttgttccatt aactgcatcc caattcttta gtttataaat aacatagagt 600 ggagtataag tagttaaaga gcaacttagc tgaagtggtt aaagagtagc tagttaaagt 660 ggtaaatacc ataaaggatg acgtatagtg cagttttttg ctttccccca tgattattca 720 tttgctgatt tgaagacaat gtatgtaata aatactgagt taacccagag cacggggcct 780 tcgctgttcc tcgcattagg tggcgatggt cccctgggcc cactcttttt aaatcctact 840 ttgtctcttt gtctttattt ctttcattct ctcgtcaccg ccaagcattg gggataccca 900 cgggttgtgt gggggctggc ccctaca 927 // ID ERV1-4B_TSy-LTR repbase; DNA; PRI; 481 BP. XX AC . XX DT 06-APR-2010 (Rel. 15.09, Created) DT 06-APR-2010 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERV1-4B_TSy-LTR. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-481 RA Jurka J.; RT "Long terminal repeats from tarsier."; RL Repbase Reports 10(9), 1251-1251 (2010). XX DR [1] (Consensus) XX CC ~92% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 481 BP; 111 A; 126 C; 89 G; 155 T; 0 other; tgaaatagct aatgaaatat gtaaaataat tgcaatagct tgcttgacat aagtaactcc 60 attttgttaa aatcctccat cttagctgtc tgcctgacct aaaaatgacc ccccccagtg 120 tgtacgtgtg attgttgtaa cgttcccatg accacagacc cacagccttg ttcccctgag 180 ctgttcacgc tgaccattct tgctgaccat ttgaaaactc cctagccaca agttccgtgt 240 aattgtatgc cttgttaact gaaatccaag agattgttcc ttttgttttt agttccttgt 300 accctcaccc cctgctatgt agcccacttg cggattttgc ctttataagc ttcttgctga 360 ttccactcag tgtcgagagc tttcttggcg tgagcctact cttgaccgcg tgctattaaa 420 ggactcaaaa ttcgaacccg gtgtgctggc gattctcctt tctcgcggtt cggctataac 480 a 481 // ID ERV2-3_TSy-I repbase; DNA; PRI; 2927 BP. XX AC . XX DT 29-JAN-2010 (Rel. 15.09, Created) DT 29-JAN-2010 (Rel. 15.09, Last updated, Version 2) XX DE Internal portion of an ERV2-type endogenous retrovirus - DE consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERV2-3_TSy-LTR; ERV2-3_TSy-I. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-2927 RA Jurka J. and Walichiewicz K.; RT "Endogenous retroviruses from tarsier."; RL Repbase Reports 10(9), 1206-1206 (2010). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX FH Key Location/Qualifiers FT CDS 2652..2912 FT /product="ERV2-3_TSy-I_3p" FT /translation="YSTITQPDINIVPSLSEKCLSWRGCKTLFTHSFFIHF FT FSFFMVPDIQFHCHRSNCTIFNSASFSHHQTSRLFYQTCPEENSSFKFKK" FT CDS join(1974..2024,1978..2295) FT /product="ERV2-3_TSy-I_2p" FT /translation="MTHLLGFPLWRALSQEQLTYLDSPYGELSAKNNETAP FT ILEKTSAFYHCYNRSLFSVLCKKCYITNCLNASIKHTHIMLLMQQSYVLLP FT VNLSTPWYFDPGMYALNLIFYALFRPKRFIIALIL" FT CDS join(349..1062,1066..1581) FT /product="ERV2-3_TSy-I_1p" FT /translation="MKDWYQRNGGKDMPADAYSIWNVLRQALETEPSYGTF FT SRPPPREATPLLKARPSSRASSRIFPLSQPAEDTDTPDSSEDEGGGDXNQN FT NKTQDDDRDHEQLLPEDEAKLEDEAAKYNNPDWPPGARNFMSVPSSSPAVK FT VCKIKRPHVYAVGFQGAIQQAHEQGELLHCFPISYEWDDPTWEALPYKLLK FT ELKSAVHDYGPTSPYTMTLVEGLAGRWMTPYDWFQVAKACLPGGSFXLKME FT YKKLTKKELVKNEKRRGHNTVTKDMLMGEGAHVGVADQLXLSKXVLKAVSA FT MALGAWKRLPPPDQKVPGLVNVKQKSDEPYEDFVARLVEAVDRMVLSKAAA FT DLIVKQLAFENASPTCQALIRSIRKSGDVSDFVKACAEVTPSYMQGVAIAA FT ALQGQTYFSVSTKSKQ" XX SQ Sequence 2927 BP; 912 A; 575 C; 588 G; 841 T; 11 other; tctggcgccc gaacagggac ctgaagaagg taagctcccc ctgatatgcg gtgtgactga 60 ggacttcacc catagggttc tgcaggcccc ctgcgagtga gtgcaggtag agagagggta 120 aacagcaaga actgtttgag aagttagcac ctctgtcact atgggaaatt cagggtccaa 180 ggacagacgt atctatgcct ctattattaa aagaacttta aagagaaaag gtatagaggt 240 aagtaagaaa acattaatag aatttttaca gttcgtccaa acagtaagcc cttggttccc 300 tgataaggaa cgcttgatct ggatacctgg gaccaggtag gtaaaaatat gaaagattgg 360 tatcaaagaa atggggggaa ggacatgccc gcagatgctt attctatatg gaatgtgtta 420 agacaggcac tagaaacaga gccatcctat ggcacctttt ctagacctcc tcctcgggaa 480 gccacacccc tattgaaagc aagaccatct tctagggctt cgtcgaggat tttccctcta 540 tcgcagcctg cggaggatac agacacacca gatagtagtg aggatgaagg gggaggtgat 600 maaaatcaga ataataagac tcaggatgat gacagagacc atgaacaatt gttaccggag 660 gatgaggcta aattagaaga tgaggcagca aaatataata acccagattg gccaccaggg 720 gccagaaact tcatgtctgt tccctcttcc tcacctgctg ttaaggtttg taaaattaag 780 agacctcacg tttatgcagt cggctttcaa ggggccattc agcaggcgca tgaacaaggg 840 gaattgttac actgctttcc aatctcatat gaatgggatg acccgacttg ggaggcactg 900 ccttataaac ttttgaaaga gctaaaaagc gcagtccatg attacggccc wacctcccct 960 tatactatga ctctggtgga agggctggct ggtcgatgga tgacccccta cgattggttc 1020 caggtggcma aagcttgctt gccaggaggk agcttttwcc tgtgaaaaat ggagtataaa 1080 aaattaacca aaaaagagtt agttaaaaat gaaaaaagam gaggacataa tacggttacc 1140 aaggatatgt tgatgggaga aggcgctcat gtgggagtcg cagatcaatt gasactatcw 1200 aaggawgtgt taaaggctgt ttcagcaatg gccttaggag catggaaaag actgcctccc 1260 cctgatcaga aggtcccagg tctggttaat gttaaacaaa agtctgatga accctatgaa 1320 gactttgtgg ccagattggt ggaggcagtc gatagaatgg tgctaagcaa agcggcagca 1380 gatttgatcg ttaaacagtt ggcttttgaa aatgcmtctc ccacttgtca ggcattgatc 1440 agatctatta ggaagtcagg ggatgtttct gacttcgtaa aggcctgtgc agaggtcact 1500 ccttcctaca tgcagggtgt tgctatagca gcagcacttc aaggtcaaac ctacttctca 1560 gtttctacaa aatcaaaaca ataaaaaaaa ataatacaac agcgtgtact tgttatacat 1620 tttaagactt ttgtaaaact tttaacatat ctcattctac aggaattcct tataatcctc 1680 aaggtcaggc aattgtaaaa cgctctaatc aatatctgaa aacatacatt caaaaactxc 1740 agaacggaga ttataaataa tcatctcctc atcatattct taaccatgtg ctctttgtaa 1800 ttaatcacct aaatctggac agttctaatc aatcagcatt ttttaggcat tattccttta 1860 acagttctcc atgccctctc attaagtgga aagacctttt gagtaaccaa tggtgtggac 1920 cggatgtcct cttgacttcc ggaggagagt tcgcctatgt gttcccaaag gacatgactc 1980 acctacttgg attcccccta tggcgagctc tcagccaaga acaatgagac tgctccgatc 2040 ttggagaaga cgtcagcctt ctaccactgc tataatagat cacttttttc ggtactttgt 2100 aaaaaatgtt atattaccaa ttgtcttaat gcatccatta aacatactca tattatgctt 2160 ttaatgcagc aatcatatgt tttacttcct gttaatcttt ccacgccatg gtattttgac 2220 ccagggatgt atgctttgaa tttgatattt tacgctttat ttagaccaaa aagatttatt 2280 atagctttaa ttttataaat tattgcttta attgttaaaa taagatttat agccgtattt 2340 accattactt tggtttaata agtacatatc gctgatcatg ttaataaatt atttaaaaat 2400 attaacttgg ctttatttac ttaggaacaa atagataaag gcattatgga tagattaaat 2460 gctttaaaaa gtgctttaat atatattgat aactagattc aacatattaa agttcaatta 2520 tcaactacat gtcatgctca atataaatgg atttatatta cccctttacc ctataatggt 2580 acagacatta cttggtccca ggtgcagtca catctaaatg gcatctggaa atccagtgga 2640 actgctctta atattcaaca attacacaac cagatataaa cattgtccca agcctatctg 2700 agaagtgtct cagctggaga ggttgcaaaa cactttttac ccacagtttc ttcattcatt 2760 tcttctcatt ctttatggtc cctgatatac aatttcattg ccatcggagt aattgtacca 2820 tttttaattc tgcttctttc agtcatcatc agacatctag gctcttctat caaacgtgcc 2880 cagaggaaaa ttcatctttt aaatttaaaa aataaaaaag ggggaga 2927 // ID LTR2C_Mim repbase; DNA; PRI; 662 BP. XX AC . XX DT 06-NOV-2009 (Rel. 14.11, Created) DT 06-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR2C_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-662 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2950-2950 (2009). XX DR [1] (Consensus) XX CC ~94% identical to consensus. 6bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 662 BP; 154 A; 155 C; 180 G; 173 T; 0 other; tgtaggggac tgacatatat tttcctagtt taagaattaa atttagtgag taatgtacgt 60 gctctgccca gagcgtttga gagtaatgtg ctccggacaa ggcccttgtg acaaacaggt 120 tgctgcctag aggcataaca atagacctga gtttctgcat tgaggcaaga gctgcccagc 180 cccacgataa gtggaggcct ggaggccaca caataagcac attgttcctt tgattgctgc 240 ttagccccat aagatggctg gttagtcaat gacgggtaag acccctcaag ggaggggcga 300 cctaagccag gcacagccgc aggggatcgg cctaagagga actggggacg aaaaatgccc 360 cctgtggctg ccttgcccaa ccttgctaat ctcggtctgt gatctatgcc tggcgcctcg 420 agcaactgcc tggaaacctt gagtcagggg acatctgtgt ccttaggcta cgtgttccgg 480 aatttatggc cattgctgca atgcctgggt gggcacgtac aaggaactaa ctttgatttt 540 tggggtataa aaataaaacg actggtgcgt tctgcactgc agtcttgtaa ccatctttgt 600 gtgtgtctgt gtgtcttttt gtgttctgtg ttcatcctcc gtcctgcaaa cgggccacga 660 ca 662 // ID hAT-2N3_TS repbase; DNA; PRI; 626 BP. XX AC . XX DT 21-APR-2009 (Rel. 14.1, Created) DT 14-DEC-2009 (Rel. 15.07, Last updated, Version 3) XX DE hAT-2N3_TS is a family of non-autonomous DNA elements found in DE Tarsius syrichta mobilized by hAT-2_TS. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-2_TS; hAT-2N3_TS. XX NM hAT-2N3_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-626 RA Novick P.A., Smith J., Ray D. and Boissinot S.; RT "Independent and parallel lateral transfer of DNA transposons in RT tetrapod genomes."; RL Gene 449(1-2), 85-94 (2010). XX DR [1] (Consensus) XX SQ Sequence 626 BP; 143 A; 169 C; 126 G; 184 T; 4 other; caggggtcct caaactacgg cccgcgggcc acatgcggcc cgccgaggac atttatccgg 60 cccaccgggt gtttttgccd ccgctgcctg tcctgcctag cagccgactc gtccgggccc 120 gcagtgcgca tgtgtggaat gtgcdtbnga gcactctccg actcccctcc ttctctctgt 180 ctctcgactc ctcctctcag taatctcagg aaccatgcac tcaaaatggc aaccatcttt 240 ggcagcactt atgtctgtga acagactttt tccagaatga aacatctgaa atcttccaac 300 cagatctaga ctaactgatg cacacttgca tcacttgtta cggactagca gtgacaaata 360 tggaacccgg acattgacca tctcattagc caaaagcagg cccatagttc ccattgaaat 420 actggtaagt ttgttgattt aactttactt gttcttcatt ttaaatattg tatttgttcc 480 cgttttgttt tttcacttca aaataagata tgtgcagtgt gcataggaat ttgttcatag 540 tttttttttt taaactatag tccgcccctc caacggtctg agggacagtg aactggcccc 600 ctgttttaaa agtttgagga cccctg 626 // ID LTR1C1_OG repbase; DNA; PRI; 526 BP. XX AC . XX DT 19-OCT-2009 (Rel. 14.11, Created) DT 19-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat (consensus). XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR1C1_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-526 RA Jurka J.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 9(11), 2852-2852 (2009). XX DR [1] (Consensus) XX CC ~91% identical to consensus. 4bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 526 BP; 131 A; 149 C; 105 G; 141 T; 0 other; tgatacagga ttccaccact cagggctaga cagggagccc cccattctct aggcccaggt 60 gagctcagag gtcagacccc catttccagg cacaagcaga ctgccttttt cgggacacac 120 cctctgggct ctgataagat catgaacaaa gcagcctgtg atggtttttt gggtggttcc 180 accttgggta ggagcacaca ccctaatccc ttctcaggga caaagactgt aaaggactga 240 taactacatt cattcatttg ttaatgcact catttccacc taccaactgc cagccatgtg 300 taatcctcta gacctaatac ccctagataa aagagcccac gtggagactc ccgagagagt 360 tcttcagttc tttccccctt ccagaagctc tggtgctttt tacttcttct gtattccttt 420 ctaacttcta ataaaagtcc tatcttacca ctgatctgtg gtccgtgggt tcattcttcg 480 aatccccgag accaaggacc tactgaagag cgaaattccg gtatca 526 // ID LTR4_Mim repbase; DNA; PRI; 398 BP. XX AC . XX DT 04-NOV-2009 (Rel. 14.11, Created) DT 04-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR4_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-398 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2953-2953 (2009). XX DR [1] (Consensus) XX CC ~97% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 398 BP; 107 A; 120 C; 86 G; 85 T; 0 other; tgccctgttc attttaaact ttctcacctt tgacactgcc ggtaaaacag ccgccgaccg 60 cctttggcac ccatccacca cggatactta tgcacaggcc ttatggaagg agccgcttac 120 aggcaaatgg ctggggcctg accctgtcct catatgggga aaaggacatg cctgcatcta 180 cgatacccag gcaggaaacg ccagatggct accagagaga gcgattaagt tatataaccc 240 acccagggaa tcccctgaga agaattctta attctgttct cttccagaaa tacaatgact 300 ccccgagaaa aggcgctgcc cctcctgtgc ctggttctgg tcgcgatcaa gagcaccacc 360 gcagccaaca tccatcagat ctacaactat acatggca 398 // ID LTR5B_Cja repbase; DNA; PRI; 465 BP. XX AC . XX DT 14-OCT-2009 (Rel. 14.11, Created) DT 14-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat: consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; LTR5B_Cja. XX OS Callithrix jacchus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. XX RN [1] RP 1-465 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the common marmoset."; RL Repbase Reports 9(11), 2917-2917 (2009). XX DR [1] (Consensus) XX CC >92% identical to consensus. 5bp tsd. CC This sequence was derived from the marmoset dataset made CC available by Genome Sequencing Center at Washington University CC School of Medicine in St. Louis. XX SQ Sequence 465 BP; 107 A; 176 C; 85 G; 97 T; 0 other; tgtaaggccg gttgcctcgc cgggagctca gacacgccca tccctctccc cgcctgaaaa 60 ttacgtcacc agaatacgcc catcatctag ctccgccttg gaaatcccct ctgcactcag 120 cactaagctc tgcacccggc acctagcccg ccttggaaat cccctctgca ctcaacacta 180 agctctgcac tcggcaccta gcccgctcac cggaaatccc gcctacctac caatagcagc 240 ttgccccgtc cacgtcctgt ttctccacag ccaatcagaa cgccttcact ccctataaaa 300 ccccacgcct gagaggagtc gggcgtgact tctctggcct ctttccccgg gaccacagaa 360 cctcgcccgg gagccgaata aattggcgtc taattgttct catgcaggcc tcagtttcct 420 cattttaaac tcggcaataa acctcacaag acaataaccc ttaca 465 // ID ERV1-4E_TSy-LTR repbase; DNA; PRI; 526 BP. XX AC . XX DT 06-APR-2010 (Rel. 15.09, Created) DT 06-APR-2010 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERV1-4E_TSy-LTR. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-526 RA Jurka J.; RT "Long terminal repeats from tarsier."; RL Repbase Reports 10(9), 1254-1254 (2010). XX DR [1] (Consensus) XX CC >83% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 526 BP; 110 A; 157 C; 90 G; 165 T; 4 other; tgaaatatct gtaaaaccta taatagcctg taatagcttg cttgacataa gtaactccat 60 tttgttaaaa atcctccatc ttagctgctt gcctacccag aaatgacncc ccccngcgtg 120 tgactgttgt aacgttccca tgatcacaga cccacatcct tgttccccct gagctgttcc 180 ccctgagcgt ttggaaactc cccagccaca agttccgtgt aattgtactc ttgcttaacc 240 gaaatcagat gttacgccaa gaaattgttc cttgcaaaat tcccnctgcc acatccccat 300 attttgttcc tatattttgt tcccttttgt ttttagttcc ttgttcccct acccccctgc 360 tatgtagccc acttgcggtt tttgccttta taaactcctt gcttgctccg ctcggggtcg 420 agagctcttt gtggcgctag cccactctcg accgccggct attaaaggac tcaaaattcg 480 actctcagcg tgttggcgac tcctttctcg cgntccggct ataaca 526 // ID CYN-II3 repbase; DNA; PRI; 184 BP. XX AC . XX DT 02-MAY-2006 (Rel. 11.04, Created) DT 02-MAY-2006 (Rel. 11.04, Last updated, Version 1) XX DE SINE element from Cynocephalus - consensus sequence. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; KW Interspersed repeat; CYN-I; CYN-II3. XX OS Cynocephalus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Dermoptera; Cynocephalidae. XX RN [1] RP 1-184 RA Schmitz J. and Zischler H.; RT "A novel family of tRNA-derived SINEs in the colugo and two new RT retrotransposable markers separating dermopterans from RT primates."; RL Mol Phylogenet Evol 28(2), 341-349 (2003). XX DR [1] (Consensus) XX SQ Sequence 184 BP; 47 A; 51 C; 62 G; 24 T; 0 other; ggccggcccg gtggcgcacc ggagagtgcg ccgcttggga gcgcggcggc gctcccaccg 60 agggttcaga tcccatgtgg agaccggttc ccgctcactg gctgagcatg gtgcgggcac 120 gacaccgagg gttgcaatcc cgttgccggt cagtaaaaaa gacaaaaagg aaaaaaaaaa 180 aaaa 184 // ID SPIN_NA_1_Og repbase; DNA; PRI; 225 BP. XX AC . XX DT 23-OCT-2008 (Rel. 13.11, Created) DT 23-OCT-2008 (Rel. 13.11, Last updated, Version 1) XX DE SPIN_NA_1_Og, a non-autonomous member of the SPIN family of hAT DE DNA transposons. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; SPIN; KW MITE; SPIN_NA_1_Og. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-225 RA Pace J.K., Gilbert C., Clark M.S. and Feschotte C.; RT "Repeated horizontal transfer of a DNA transposon in mammals and RT other tetrapods."; RL Proc Natl Acad Sci U S A 105(44), 17023-17028 (2008). XX DR [1] (Consensus) XX CC SPIN_NA_1_Og is a member of the hAT superfamily. The TIRs are CC 16-bp long and are flanked by 8-bp TSD. XX SQ Sequence 225 BP; 67 A; 48 C; 55 G; 55 T; 0 other; cagcggttct caacctgtgg gtcgcgaccc ctttgggggt cgaacgaccc tttcacaggg 60 gtcgcctaag accatcctgc atatcagata tttacattac gattcataac agtagcaaaa 120 ttacagttat gaagtagcaa cgaaaataat tttatggttg ggggtcacca caacatgagg 180 aactgtatta aagggtcgca gcattaggaa ggttgagaac cactg 225 // ID MER72_Mim repbase; DNA; PRI; 383 BP. XX AC . XX DT 11-NOV-2009 (Rel. 14.11, Created) DT 11-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW MER72_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-383 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2989-2989 (2009). XX DR [1] (Consensus) XX CC ~90% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 383 BP; 106 A; 98 C; 65 G; 114 T; 0 other; tgtgaactaa aaataaaatc ctaagccctc tgctaactga atggacccct ctcggccaag 60 gggaccccag aaacacctta aaactgagtt cccagccatg acgggatggg agagataaag 120 cacaattaca tgcatatgat ttcccttcat aaatattcat gactcctcct atagcttgtt 180 gaatatgtat atttgaccac cccactcagt ataaaatcct gttccctttt gtctcttctt 240 tgaagcacat gtgcctggct tctggccggg gctctgcttc ccaacctgtt ggaatggcca 300 cccaacaggt tgctaccctt tatgagaaat aaagctctcc tttttccaaa gtataaacct 360 cgttaatttt tttttagtta aca 383 // ID LTR77a-int_TS repbase; DNA; PRI; 1801 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Internal portion of an ERV1-type endogenous retrovirus - DE consensus. XX KW Endogenous Retrovirus; Transposable Element; LTR77a-int_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-1801 RA Bao W. and Jurka J.; RT "Retrovirus repeats from tarsier."; RL Repbase Reports 11(5), 1738-1738 (2011). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 1801 BP; 530 A; 373 C; 356 G; 540 T; 2 other; ttttggtgct atgacttgga ataatggatt cagaagggta agtgaaagca gacctgtctc 60 ttcacttcca tttctgaggc ttcttatcct caaagtagtt ttctatctca aaatcaacct 120 tggcctctgc cagccactta aaagcagata gcacggctgc tgttctataa gtcacagaat 180 gcaggattgc tggaaagggc ttagtcaatc ccccattgcc cttgggtgtt gtgaatgttg 240 gctctgttca tagccggttt cctttcatgg atgtctagac gttgcatgga tctggattaa 300 gtttgggagt aattgatgat ttctggccaa gggtacacct tggtgttacc tgaaggctct 360 ctgatggatt ccaattcctg acctcctggt agggtatcta caaaagactt ccgatctttc 420 ctattgcatt ttctttctct caaagctgcc gtggttccta tcacttcttt gtattccatg 480 ttaatgactt ttatgcaatg tgatgaaatg ctggtatttg ctagaataag ctttaaaatt 540 aaggaatgtg acaaggagca atgttgttct tgctatttaa aaaaaaacag ggggacatga 600 gaaaaaagca aagggaggtt tttcttcctt cagtacatct agtctgcttt tttgtgtcag 660 gaaccaacat agttttgcta atgtgaatgg gatggacctg catagtcctt aactttctcc 720 ctttggacag aattttgtgg agccaaaaca cctcaccttg agaactgcaa ggatgtattg 780 ttttgtttaa ggcaaatttg gattcaaagc tacaccttag cctttcaagc ttcatccacc 840 tgcttaagag acttttcaag atataagtta ttaaggaaat acagactttt atcacatgga 900 atactatcaa ccttatgccc taaatatcag gtagaccatt tatttacttc actaagacct 960 aaattcaaat tggactgtcc tatctgtaat aatgaaaatc atacatggaa caggaatact 1020 tccataagaa tgtctcacca aataatggct acaatgtgtc aaagaccatg ggagggatat 1080 tcaatagatc catacatctt tctgagagac tttttacacc ctcagtttga atagtcacat 1140 agaaaaggag ggttctacct acaaattaac ccttctcatt tctgtgagtg ttggcaattg 1200 accaacatta tactgatacc tttcaaggaa atkaaagkag gggaattcct atgcaaagac 1260 tggtgctttt gttactcata tggctggtcc aacctactcc cactcttaag atgtgcatgc 1320 atgaacaaat gcatatgact tctttcagta tccacaccac tctctctgag taccctagct 1380 caaattcaag tacttcaatg catctttcag atttccatat gggaagacac aaaacagcaa 1440 gaaggtaaaa aatctgggag gtagtctaga aaaatcagag gtagtctcag ttgggggcga 1500 gaggctgggg cttgcacagg taagcacaac taagttctgt caccacagtc atatttggcc 1560 catgcataat aaaccctgtt gtaaaatgtg cttcttatca cttagagttc atcaaacttc 1620 agatggtgat gcagatggag atcagcatga cagaatcttt cttctgaggc ccattagacc 1680 atccactgga gaaaacccaa actgccacat aaacaatgtc cccacacagc ttaaagaagc 1740 tagaacagtc attgcccctt tcccctaaca gtagctaggg atactactcc agaggaggta 1800 g 1801 // ID hAT-2_OG repbase; DNA; PRI; 2797 BP. XX AC . XX DT 21-APR-2009 (Rel. 14.1, Created) DT 21-APR-2009 (Rel. 14.1, Last updated, Version 1) XX DE hAT-2_OG is a family of autonomous DNA elements in Otolemur DE garnetii found also in Anolis carolinensis, Microcebus murinus, DE Myotis lucifugus, Monodelphis domestica, Echinops telfari, DE Xenopus tropicalis and Schmidtea mediterranea. Less than five DE elements exist in the Anole genome at 2246bp in length. XX KW hAT; DNA transposon; Transposable Element; hAT-2_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-2797 RA Novick P.A., Smith J., Ray D. and Boissinot S.; RT "Independent and parallel lateral transfer of DNA transposons in RT tetrapod genomes."; RL Gene 449(1-2), 85-94 (2010). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 778..2583 FT /product="hAT-2_OG_1p" FT /translation="MISRKRKIDSECRIFKEQWTYDYFFMQYKERAVCLIC FT QNIVSVFKEYYLRRHYQTQHKDKYDCLVREVRKDKILKLKNTLTTQQNTFV FT KQKQLNISSLRASFQVAKLIACTGRPFVEGEFVKECLLSVAKEMCPEKADL FT FSTVSLSGPTITRRIEEMGDNLHQHLQNSAKKLSYFSLALDKSSDVRDSAQ FT LLIFIRGMNDYFEVTEELAALQSIKGTTTGEDIYEKVCQTVNGLELDWAKL FT ASVTTDGAPSMVGSKKGVIAHINQEMDKHNHSHPIAIHCLIHQQVLCSKSL FT KWDSVMKIVVSCVNFIRANALNHRQFQEFLSELNVAYEDVLYHTEVCWLSR FT GRVLKHFYDLLPQITAFLLSKNKEVPELNDAEWKWHLAFLTDVTELLNSFN FT VQLQGKGKLICDMQSHVKAFEVKLGLLIKQVKEENFCHLPTTQNLLAEKPL FT VAFPNKTCVDSLEKLQKEFQFRFKELHLHEQDIQLFRNPFSIDIENVDTIY FT QMELAELQNCDSLKDAFKSSSLPNFYASLPSETYPNLRNHALKMATIFGST FT YVCEQTFSRMKHLKSPTRSRLTDAHLHHLLXLAVTNMELDIDHLISQKQAH FT SSH*" XX SQ Sequence 2797 BP; 845 A; 526 C; 594 G; 817 T; 15 other; cagtgttaat gcaaatdgtc agcgctcagt gttaatgcaa attgtcagtg gtcagtgtta 60 ttgcaaccat tgtcagtagt cagtgttaat gcaaatggtc agtgctcagt gttattcgct 120 tgggggcccc aaactggtaa tctgcctagg gccccatggg aacttaattt ggctctgcag 180 agagtgaagg agtaggaahc ccaatttaat tgacagtaag tgcatttata ttctgattgc 240 tattcggttg tgtatgatgt tdtatgttgt gtvctgtgta agccctggtt cacactgttg 300 caatctctga gcavtgtgag tttagccata cgcttgtatg gctgaaatcg cattagattc 360 agaagaaaaa aggcatacgt gccttctttt ttccttcagt ggaatctgat cgcatgggtc 420 ttctcaccca tgtgatcaga tttgcctgtg cdagttcaca gatcdcagtg cagttcacac 480 agggtagtgt gaactggaaa gntggtggag gaactvgctc tgtaatcgtg ctagttcccv 540 caccdcacca gtgtaaacct gaggtaaaag hggagttcca ccaatatggg cactggtgag 600 gctgaaatga tgtggactgg caaggctgca ttgatgggca ctgatcagac tgcattgatg 660 gvcagtgcag tctatatgtc tctgtgtggg caaagttatt gctggtatat tgtttttgta 720 gtgctgtata tatatatagg tattttacta atagcaattt ggaatcccta ggaaacaatg 780 atatcaagaa agagaaaaat tgactcagag tgtaggatat tcaaagaaca gtggacttat 840 gattactttt tcatgcagta caaggaaaga gctgtgtgtt tgatatgcca gaatatagtg 900 tctgtgttca aagaatacta tttgcgtcga cactatcaaa ctcaacataa agataaatat 960 gattgtttgg tcagagaagt gagaaaagat aaaatattaa aactgaaaaa tacattgaca 1020 actcagcaaa atacttttgt gaagcagaag cagctaaata tttcatcact gcgagcaagt 1080 tttcaagttg ccaagctaat agcgtgcact ggcagaccat tcgtggaggg agaatttgtt 1140 aaagaatgcc ttctttctgt tgccaaagag atgtgtccag agaaggccga tttatttagt 1200 acagtgagtc tttcaggacc tacaattaca cgaaggattg aagaaatggg agacaatttg 1260 catcagcatt tgcaaaactc tgcaaaaaaa ctttcctatt tttccttggc actcgacaaa 1320 agtagtgatg ttcgtgattc tgcacaactt ctaattttta ttcgtgggat gaatgactat 1380 ttcgaagtca cagaagagct tgctgcactg caaagcatca aaggaacaac tacaggagag 1440 gatatctatg aaaaggtttg ccaaactgtg aatggtttgg agctggactg ggctaaacta 1500 gccagtgtga caactgatgg tgctcctagc atggtggggt ctaagaaagg agtaattgct 1560 cacattaacc aagagatgga caaacataac cattctcatc caatagccat acactgcctc 1620 atccaccaac aagtgctgtg tagtaaatca ctgaagtggg actctgtcat gaaaattgtg 1680 gtatcttgtg ttaacttcat tagagctaat gcactaaacc acagacaatt tcaggaattt 1740 ctgtctgagc taaatgttgc ctatgaagat gttctgtacc acacagaagt ctgttggctg 1800 agtcgaggga gagttttgaa acatttctat gacttacttc cacagattac agcttttctg 1860 ctttcaaaaa acaaagaagt accagagctc aatgatgcag aatggaaatg gcaccttgcc 1920 tttctgacag atgtaacaga gctactcaac agtttcaatg tgcaacttca aggaaagggg 1980 aagctcatct gtgatatgca atcacatgtg aaagcatttg aagtaaaatt aggcctcctc 2040 atcaaacaag tgaaggagga aaatttctgc catctcccca caactcaaaa tctgttagca 2100 gaaaaaccat tggttgcatt cccaaacaaa acatgtgtgg attcactgga aaagttgcaa 2160 aaggagttcc aatttagatt taaagagctt catctccatg aacaggacat acagcttttc 2220 cgtaacccat tttctattga cattgaaaat gtggatacaa tttaccaaat ggaactggct 2280 gaactgcaga attgtgactc tctgaaagac gcattcaagt caagcagcct tcctaatttc 2340 tatgcatctc tcccctctga gacatatcct aatctcagga accatgcact caaaatggca 2400 accatctttg gcagcactta tgtctgtgaa cagacttttt ccagaatgaa acatctgaaa 2460 tctccaacca gatctagact aactgatgca cacttgcatc acttgttang actagcagtg 2520 acaaatatgg aactggacat tgaccatctc attagccaaa agcaggccca tagttcccat 2580 tgaaatactc gaaagtttgt tgatttaact ttacttgttc ttcattttaa atattgtatt 2640 tgttcccgtt ttgttttttt tacttcaaaa taagatatgt gcagtgtgca taggaatttg 2700 ttcatagttt tttttttttt taaactatag tccggccctc caatggtctg agggacagtg 2760 aactggcccc ctdtttaaaa agtattgagg acccctg 2797 // ID MER9a2 repbase; DNA; PRI; 513 BP. XX AC . XX DT 04-AUG-2008 (Rel. 14.02, Created) DT 04-AUG-2008 (Rel. 14.02, Last updated, Version 1) XX DE ERV2 Endogenous Retrovirus from Catarrhini. XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR; ERVK; KW MER9a2. XX OS Catarrhini OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini. XX RN [1] RP 1-513 RA Smit A.F.; RT "MER9a2 - ERV2 Endogenous Retrovirus from Catarrhini."; RL Repbase Reports 9(2), 572-572 (2009). XX DR [1] (Consensus) XX CC A few times at orthologous sites in Rhesus, so active over CC catharin hominoid split. XX SQ Sequence 513 BP; 125 A; 139 C; 116 G; 131 T; 2 other; tgttgggaac aggcccccca aaatctggcc ataaactggc cccaaaactg gccataaaca 60 aaatctctgc agcactgtga catgttcatg atggccataa cgcccacgct ggaaggttgt 120 gggtttaccg gaatgagggc aaggaacacc tggcccgccc agggcggaaa accgcttaaa 180 ggcgttctta agccacaaac aatagcatga gcgatctgtg ccttaaggac atgctcctgc 240 tgcagntaac tagcccaacc catcccttta nttcggccca tcccttcgtt tcccataagg 300 gatactttta gttaatctaa tatctataga aacaatgcta atgactggct tgctgttaat 360 aaatacgtgg gtaaatctct gttcggggct ctcagctctg aaggctgtga gacccctgat 420 ttcccacttc acacctctat atttctgtgt gtgtgtcttt aattcctcta gcgccgctgg 480 gttagggtct ccccgaccga gctggtctcg gca 513 // ID LTR1B0 repbase; DNA; PRI; 742 BP. XX AC . XX DT 18-FEB-2009 (Rel. 14.06, Created) DT 18-JUN-2009 (Rel. 14.06, Last updated, Version 1) XX DE ERV1 Endogenous Retrovirus from primates. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; LTR1B0. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-742 RA Smit A.F.; RT "LTR1B0 - ERV1 Endogenous Retrovirus from primates."; RL Repbase Reports 9(6), 1172-1172 (2009). XX DR [1] (Consensus) XX CC Very close to LTR1B, but with among others an 80 bp deletion. CC 10% subst outside CpGs 25 copies. XX SQ Sequence 742 BP; 173 A; 244 C; 211 G; 114 T; 0 other; tgatacggac aggagacagg gaaatactgg gtagaagagg gcggttcccc ggcaaaggcc 60 ccaccctcaa gcctggagac ccgcggccct aaatgggaac aggcattcct gttttcgcgc 120 ccaaaaagtt gccttttggc ccgccacgcc ccctatcctg tacccatata aaccccgaac 180 cccaggctcc agaagcagac gagcagacga gcgaggagac aagcagacga acggcagaac 240 ggcgcggcag agaaagagag gaggaacgtc tgaacgccga gaggagttcg gctgggggcg 300 gtcggagagg agttcggccg ctggacggcc aaactccagg ggaagatcat cttcccactc 360 catcccccct tccggctccc catccatccc gctgagagcc acctccacca ctcaataaaa 420 cccccgcatt catccttcaa gtccgtgtgt gacccgattc ttccgggacg ctggacaaga 480 gctcgggata cagaaagctg tcacactggc cctctgccct tgcagaaagg cagagggtcc 540 actgagctgg ttaacactca agccgtccgc ggacggcaag gctaaaaggg cacactgtaa 600 cacacgccca cttgggctcc tgcacctgtc cgtctgcgtg ctccccctcc cgtaaggggt 660 ttgagcagcg gcggcgaccg aacaggcgag ccacacccct gtcgcacgtc ctgcgagggg 720 gtcagggaac tctcccgttt ca 742 // ID ERV1-4_TSy-I repbase; DNA; PRI; 5114 BP. XX AC . XX DT 29-JAN-2010 (Rel. 15.09, Created) DT 29-JAN-2010 (Rel. 15.09, Last updated, Version 2) XX DE Internal portion of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERV1-4_TSy-LTR; ERV1-4_TSy-I. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-5114 RA Jurka J. and Walichiewicz K.; RT "Endogenous retroviruses from tarsier."; RL Repbase Reports 10(9), 1200-1200 (2010). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX FH Key Location/Qualifiers FT CDS 1473..2804 FT /product="ERV1-4_TSy-I_1p" FT /translation="MLKEVFPSSRPNWDPNTSAGSKALDTYHWTLLAGLKG FT AAQKPINLSKTTEVLQGPEESPGAFLERLQEAFRTYTPFDPSAPENARAIN FT LAFVAQAALDIRKKLQKLEGFAGINISQLLEVAQKVFNNRDAEKQKQAVQA FT ADRAARRQAKILVTALRAPGTDSKPPQHQVYTVLDLKDAFFSIPLSPVSQP FT IFAFEWTDPDSGDTSQLTWTRLPQGFKNSPTLFGEALQQDLSGFRALHPEC FT TLLQYVDNLLLAAADLDTCLQGTKALLQLLQDLGYRVSAKKAQLCSTEVSY FT LGYRLKKGKRALTSARKEAILQIPTPTTKKQVREFLGAAGYCRLWIPGFAE FT IAKPLYTATAGDQPLNWTATEEQAFQQLKTALTQAPALALPDVTKPFHLFI FT HEQRGIAKGVLTQTLGPWRRTGGLLVKKTGPSGLRLALLSPGHCGHGHTS" FT CDS 2899..4056 FT /product="ERV1-4_TSy-I_2p" FT /translation="MSNARITQYQSLLLDQPRLTFSPTQCLNPATLLPDDN FT PETPVHDCEDVLDISQASRPDLRDTPLSQADADLFTDGSSLVIEGVRKAGA FT AVTTAHDVIWSQALPQGTSAQKAELIALTQALRWGTGKRVNIFTDSRYAFA FT TLHVHAMIYKERGLLTAAGKTIKNKEEILALLEAIWLPKEVAVIHCRGHQK FT SDTPEATGNRLADQAAREAAELPVEPLTILLTKTFPAPSLPKQPTYTQEEE FT EQADLAGAHKDSAGWWVKDSKIILPEALGHTLVKQLHSTTHLGGTKMTELL FT NHFYHINDLKSIAATAAAMCPACAQVNAKQGPRPPDGIRARGLSPGEKWEI FT DFTEIKPPHAGYDYLLVFVDTFSGWVEAFATKNETALRLQGFY" FT CDS 4218..4874 FT /product="ERV1-4_TSy-I_3p" FT /translation="MNRTLKSTLTKLIIETGEKWVNLLPLALLRTRCTPYK FT AGFTPYEIMFGRPPPLLPRLRDPILEEITNQSLLKYLQSLQQVQAAVHSGV FT RAVLPTPSSTPCHPFQPGDLVFVKKFQKEGLTPAWKGPFIVIITTPTALKV FT DGIPAWIHHTRVKRARETTLVPSTALRRQGTPNIHSLIQQGKTATVQPPGL FT QPRTSRTLSRSPFDVHLNRETPIDWLLIS" XX SQ Sequence 5114 BP; 1299 A; 1361 C; 1134 G; 1319 T; 1 other; tttggcagct ccagtgagaa tttttccacc agcacaagca gccgcctcac tccgaccctc 60 cccagctggg aggagagatg taggggaggt gcctcctgag attctcaggc cccttaacct 120 ctcatctggc tctgaggggg ttggaacgac tgctggcagc tactgcctca aaagaacctg 180 agtccatcta caatctccta ggaggtaaga atttattttt ggttttctgt ttaattgctt 240 gtgaccatgg tttctgtttc tgtttctgtt taagggagtc atgacctgac gaggttgccc 300 tttggacact attcattagg ccaagacgtt gactgaatag agtcctctgg ttttctgccc 360 tgaggttgtt ccagggatcc gttttctgcc ctgaggttgt tccagggatc cgtaactctt 420 ctgacttggt ccagacgtgc ccgtctgtca ttgtgtttta attgtttgtt tgttgtttgc 480 tcattgtttg ttatgggttc acagaattca tgacctgtca gtcttctgcc taatgaggcc 540 actccttaga cactgttcgt taagccaaaa cgttggctaa acagagtctc agtttagtat 600 ctggtttctg tttgggtacc cgctttgact tctaactgcc tatggtattt ctatatccct 660 gtcatccttg gcctctttcc ctgctttgca ttcttgctgt gctctttgcc acagggcttg 720 cttctgttgc actttcaaac tggtcagcag gccagaaact tctattagcc tttgcttatc 780 ttctcgtggt tggaggtctg atcctcatta ggtgcctcct gtaatttact ggttttgaca 840 atatggggtc acgggcttct aagccctccg gtcccctgga ctgtataata aggaatttct 900 ctgtcgggtt tgtggaggac tacagtgtcc atctctccaa aggcagactc tggactctct 960 gtgaaaacaa ttggcctgcg tttaatgtta aatagccctc cacagggtct ctggatgttc 1020 atgttactta gtctgtctgg aaggtcatta cagggacccc tggtcatcct gatcagtttc 1080 cctatatcaa ccaatggtta aatttagtcc aaaacccccc tccatggtta aaaatatgtt 1140 ttcttaacta gaacccttct gtaaaccata ctatacggag acaaagcttc taacttaaaa 1200 acaacagggt gctatataca aaggcccctg gctaactctt caaggaaaac ttgttttacc 1260 tgtttctcaa gagaatgaaa tcttgccaaa atcgccatgg ttttaacctc cctggtaaaa 1320 tcaatcatgc gcattcatct tcccacctgg gataattgcc agcagctcct ttttaaccct 1380 atctacttct aaaaagccag aagcgcatta aaactgagac taaaaagtct gtcctggcct 1440 cagcagctgg tacagaggag gagagtcaag aaatgttaaa agaggttttt ccctcctctc 1500 gccctaactg ggaccccaac acctcggcgg ggtccaaggc actagatacc tatcactgga 1560 ctctattagc wggtttgaag ggggcagccc aaaaacctat caatctttct aaaactaccg 1620 aggttctcca gggacccgag gaatctccag gagcattcct tgagcgcctc caagaggcat 1680 ttcggactta cacccctttt gatccctcgg ctcccgagaa tgctcgagct attaatttgg 1740 catttgtggc tcaggcagct ctggacatcc gaaaaaagtt gcagaagctg gaaggctttg 1800 caggaataaa tatttctcag ctgcttgaag tagctcagaa agtttttaac aaccgagatg 1860 ctgaaaaaca gaaacaggca gtacaggcag ccgatagagc tgcaagaaga caggccaaaa 1920 tcctggtaac ggccctccga gccccgggaa ctgattccaa gcctcctcaa catcaggtat 1980 atactgtact ggatttaaag gatgccttct tttctattcc tttgtcccct gtcagtcaac 2040 ccatctttgc ttttgaatgg acagatcctg actcagggga caccagtcag ctaacttgga 2100 cccggttacc acaaggtttc aagaactccc caactctttt cggggaagcc ttgcaacagg 2160 atttgtcagg gttccgtgcc ctacaccctg aatgtacttt actccagtat gtagataatc 2220 tcttgcttgc ggctgcagac ttggacactt gtttacaagg tactaaggct ctgttacagc 2280 tcctccaaga cttgggctac cgtgtgtcag cgaaaaaggc acagctctgc tcaaccgaag 2340 tctcctacct gggataccgg ctaaaaaaag gtaaacgagc cctgacctcg gcccggaaag 2400 aagccattct tcaaattccc acgccaacca caaagaaaca ggtacgggaa ttcctggggg 2460 cggcagggta ttgccggctg tggatcccag gatttgctga aattgccaag cccctgtaca 2520 cagccacggc aggggatcag cccctaaact ggactgctac tgaggaacaa gcgttccagc 2580 agttaaaaac ggctctgacc caagccccgg ctctagccct cccagatgtt accaagcctt 2640 tccacttgtt cattcacgaa caacgaggaa ttgcaaaggg ggtcctcact cagaccctag 2700 gaccctggag gagaaccggt ggcttacttg tcaaaaagac tggacccagt ggcctccggc 2760 tggccctctt gtctccgggc cattgcggcc acggccatac tagttaagga agcagataaa 2820 ttgactttcg gccaaaacct ggctttgact gttccccatg ccgttgaaac tctgctccgg 2880 ggagcctctg gccgctggat gtcaaacgct cgtatcaccc agtatcagag tctcttgctt 2940 gaccaaccta gactaacttt ctctcctact cagtgcctca atccggccac cctgctgccc 3000 gatgacaatc cggaaacgcc agttcatgac tgcgaagacg tgttggacat ctctcaggcc 3060 agccggcccg atcttcggga cactccctta agccaggctg atgctgacct gttcacggat 3120 ggtagcagcc tggtaataga gggagtccgc aaggctggtg cagccgtaac cacggctcac 3180 gacgtaatct ggtcacaggc gttgcctcaa ggtacctcag ctcaaaaggc tgagttaatt 3240 gctttaactc aggccctgcg ctggggaaca ggaaagcggg taaacatctt caccgacagc 3300 cggtatgctt ttgctacact ccatgtgcat gccatgatct acaaggaacg aggtcttctt 3360 accgcggcgg gtaagactat aaaaaacaag gaagaaatct tagccctcct agaggctata 3420 tggctcccca aagaggtagc cgtcattcat tgcaggggac accagaaatc agacactcca 3480 gaagcaactg gaaaccgcct ggcggaccag gcggctcgag aggcagcaga actcccagtt 3540 gaacccctga ctatcttgct aactaaaact tttccagctc cttctctccc aaaacagcct 3600 acctataccc aggaagaaga ggaacaggcg gacctagcgg gagctcacaa agactccgcc 3660 ggatggtggg tcaaggactc taaaattatc ctgcctgaag ccctgggaca caccttggta 3720 aaacaactac attctactac ccacttggga ggtactaaaa tgactgagct gttaaatcat 3780 ttttatcata ttaatgactt aaaaagcata gccgcaacgg cagccgctat gtgccctgcc 3840 tgtgcccaag tcaatgcaaa acaagggccc agacctccgg acggtatcag ggctcgagga 3900 ttaagtcccg gagaaaaatg ggaaatagac ttcactgaaa taaaaccccc tcatgcgggg 3960 tacgactatc tgctagtatt tgtagatact ttttctggat gggtagaggc ctttgccacc 4020 aaaaatgaaa ccgcactacg gttacaaggt ttttattaaa tgaaattatt cctaggtttg 4080 ggctccccct ctctattggc tctgacaacg gtcctgcctt cacctcgtcc atagtgcagc 4140 tggttagtaa ggcactaaac attaactgga aattacattg tgcttaccga ccacaaagct 4200 ctggacaggt agaacgcatg aaccgaactc taaaatctac tttaaccaaa ttaataatag 4260 agaccggtga aaaatgggta aaccttttgc ctttagctct tctccggacc cgatgcaccc 4320 catataaggc tgggtttact ccatatgaaa tcatgtttgg aagaccccct cctctgttac 4380 cccggttgcg ggatcccatt ctagaagaaa taactaatca atctctgtta aagtatctcc 4440 agtccctcca acaggtccag gctgctgtcc actccggcgt tcgtgctgtt ttgcccactc 4500 cgtcctccac gccttgtcat ccgttccagc ccggagacct ggttttcgtc aaaaagttcc 4560 agaaagaagg actgactcct gcctggaagg gaccctttat cgtcatcatc accactccaa 4620 cagcactcaa ggtcgatggg ataccagcct ggattcacca cacccgagtc aagagggcca 4680 gggaaacgac actcgttcca tcgactgcgc tgaggcggca agggacccca aatatccact 4740 ccctcattca acaaggaaag acagccaccg ttcagccacc tggactgcaa ccaaggactt 4800 caagaaccct ctcaaggtca cctttcgatg tacatctaaa ccgtgagact ccaatagact 4860 ggttgctaat ttcttgaatg cattgtctca acctttaact cttgtttttg taatcccact 4920 tgctttggct tttggcgcag gattgcttac tgtagctcct gcagattgga aatgtggcaa 4980 aaaggctctg ttaatgctat cttatctcct atgcgtaaat tctgtaaaac ttatgcatgt 5040 aaaaacttac tataattcca tacccatgac cgatgagtca acgaattgac tcatgtacaa 5100 ctaaaagggg ggaa 5114 // ID LTR3_Cja repbase; DNA; PRI; 418 BP. XX AC . XX DT 14-OCT-2009 (Rel. 14.11, Created) DT 14-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat: consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR3_Cja. XX OS Callithrix jacchus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. XX RN [1] RP 1-418 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the common marmoset."; RL Repbase Reports 9(11), 2914-2914 (2009). XX DR [1] (Consensus) XX CC ~93% identical to consensus. 4bp tsd. CC This sequence was derived from the marmoset dataset made CC available by Genome Sequencing Center at Washington University CC School of Medicine in St. Louis. XX SQ Sequence 418 BP; 106 A; 89 C; 121 G; 102 T; 0 other; tgttgtacca gagcgagcat agaaagtcaa caccaagaca aaagtatgcc aaattgcagg 60 gtctttattg ccggcggcca cggaggactc acgtctctcc aacccgtggc cccgaaagga 120 gggtgtcaag accttttatg cccgtaaacc acctcctggg cgggggtgag ggtgaggggc 180 ttcggacatt ccgattgtta cttaagtggt tcacagaagt caagataagc gttagtttac 240 acattgcctg ggtagggtgg aaattacaga ccttagttac ttattacaag tcgcaggggc 300 caagatggcg tgggtttgga ctgcttgcct gtcacatcag ggttacagag agcagtttca 360 acagaaagcc gaggtatggt tgaggtatgc agttttatgg ggcttttacc atgtcaca 418 // ID PTERV2b_LTR repbase; DNA; PRI; 491 BP. XX AC . XX DT 14-MAR-2006 (Rel. 13.02, Created) DT 21-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE Long terminal repeat of ERV1 Endogenous Retrovirus from Pan DE troglodytes. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; KW PTERV2b_LTR. XX OS Pan troglodytes OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pan. XX RN [1] RP 1-491 RA Smit A.F.; RT "PTERV2b_LTR - ERV1 Endogenous Retrovirus from Pan troglodytes."; RL Direct Submission to Repbase Update (21-FEB-2008). XX DR [1] (Consensus) XX CC LTR1b/ptervc lib20040702 3 copies in panTro2, 2 of which are CC full ERVs. XX SQ Sequence 491 BP; 134 A; 140 C; 106 G; 111 T; 0 other; tgagaaacca tagagaaaac atagccacta aaacaccaga tggccaggag tcagggtgcg 60 tcaatggcta tcaaaaacac cagatgccca gggtaagaca gggagccccc cccccccccc 120 agtgttcccg gaactaaagc agaaaaacat ttccaggaaa tacctcctcc cgcctataag 180 ccccctgacc aaccagtatg agacagctag ctcagaccat aattagaacc aatcagttac 240 ttccaaacct cgcgccctaa aactgttcaa atgtgtaacc caatcagctt attgtaacct 300 ccccctgttt gaatttgtgg tttttgcctt tataagcagt ctgtaacagt cattcggggt 360 ctcctggcct catgtgctgg ggaccctagc gcgctagtaa taaatagtgt ctctttgctg 420 tgatctccgt gtcgagtggt ctctggcggc ggccccgtcc cgaagcagga atcttgagta 480 aggttccaac a 491 // ID LTR22_Cja repbase; DNA; PRI; 567 BP. XX AC . XX DT 14-OCT-2009 (Rel. 14.11, Created) DT 14-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat: consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR22_Cja. XX OS Callithrix jacchus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. XX RN [1] RP 1-567 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the common marmoset."; RL Repbase Reports 9(11), 2929-2929 (2009). XX DR [1] (Consensus) XX CC ~89% identical to consensus. 6bp tsd. CC This sequence was derived from the marmoset dataset made CC available by Genome Sequencing Center at Washington University CC School of Medicine in St. Louis. XX SQ Sequence 567 BP; 143 A; 121 C; 150 G; 153 T; 0 other; tgtgggaatt cactcagggt ggtggcaaaa atattaaggg aaaaatatta gagaaagtta 60 tagaatatag tctcaaaccc ttttggaagg cctaagggtt ttttctatag ttcttggctg 120 aaggcagcca gagtctcttt gcaggagcca gagagattag ggcgcaaata caaaggaatg 180 tgagtagttt acctagctag cttgtttact catgtggtct taaaactaac ctttgagccg 240 gatggccctc tcggggggag gtcgaccagg gatattaccc actaatggtg tttgctttgg 300 gcatcggaac ctgtccttta atctttaacc tctagtggtg ttgactcaag cctttgtcaa 360 ttaaacttta ctaaataaat gcgagtctca ctggctggtg ggggccggcg gtcgcaactg 420 tttacagcac tctccaggga gtctgtaagc ggccacggac cctcagccga actggcaaag 480 cataatatct gtgtgtcagt gtactttatt catccgtcgc tgagtcaggg tctgcaggac 540 agacccccgc aggtggtgac cccgaca 567 // ID CERV2_LTR repbase; DNA; PRI; 544 BP. XX AC . XX DT 17-AUG-2004 (Rel. 9.07, Created) DT 17-AUG-2004 (Rel. 9.07, Last updated, Version 1) XX DE A DNA sequence of chimpanzee endogenous retrovirus CERV2 - long DE terminal repeat. XX KW ERV2; Endogenous Retrovirus; Transposable Element; CERV2; KW CERV2_LTR; Chimpanzee endogenous retrovirus; KW Long terminal repeat. XX OS Pan troglodytes OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pan. XX RN [1] RP 1-544 RA Skaletsky H., Hughes F.J. and Page C.D.; RT "Original sequence of an endogenous retrovirus CERV2."; RL Repbase Reports 4(7), 192-192 (2004). XX DR [1] (Consensus) XX SQ Sequence 544 BP; 150 A; 150 C; 120 G; 124 T; 0 other; tgagagaccc cagagaaaaa caccagatgg tcactagaaa aacaccggat ggccactaga 60 aaaacaccag atggccagga gtcagggtgt gtcaatggct atcaaagaca ccagatgccc 120 agggtagggc aaggagcccc tctccccccc agtgttccca gaactaaagc aggaaaacat 180 ttccaggaaa tacctcctgc cacctataag ccccctgacc aaccagcgtg agacagctag 240 ctcaggccat aattagaacc aaccagttac ttccaaacct cacgccctaa gacttttcaa 300 atgtgtgacc caatcagctt attgtaacct ccccctgttt gaatttgtaa cctccccctg 360 tttgaatttg tagtttttgc ctttataagc agtgtgtaac agtcattcgg ggtctcctgg 420 cctcatacgc tggggaccct agcgcgctag taataaagag tgtctctttg ctgtaatctc 480 cgtgtcgagt ggtctctggc ggcggcccta tcccgaagta agaatcttga gagaggttct 540 aaca 544 // ID LTR22C repbase; DNA; PRI; 509 BP. XX AC . XX DT 16-MAR-2006 (Rel. 11.03, Created) DT 10-APR-2007 (Rel. 11.03, Last updated, Version 1) XX DE Long terminal repeat of ERV2 Endogenous Retrovirus from primates. XX KW ERV2; Endogenous Retrovirus; Transposable Element; KW LTR retrotransposon; LTR22; LTR22C_LTR; LTR22C. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-509 RA Smit A.F.; RT "LTR22C - a subfamily of endogenous retroviruses from primates."; RL Direct Submission to Repbase Update (11-NOV-2005). XX DR [1] (Consensus) XX CC 6bp duplications, 9% divergence from the consensus. XX SQ Sequence 509 BP; 125 A; 110 C; 138 G; 127 T; 9 other; tgtaggggtt cggtcagggt ggtgggaaaa attataagaa gaaattatag gaaatagaca 60 caaaccttct tggaaggccg ggaggttttg caaaagcttc agkaawgggt ttggctgaag 120 gcagccwaat tctcttatcc ggagccwgag agcwwagggt agataacaag ggaatgtaaa 180 ggagtttatc tagataagct tgtttactca tgtggcccga aamctgacct ttaatcattc 240 gtgcgcagga ctgctctcta ctcggggggc ggccatgtta attacccaca agttgtgttg 300 actcaaagcc tttgtcatta aatctgtact aaataaatgc cmgcagcgcc ggcttgtcag 360 ggccacggct gctacaactc tttacagcac cttcctgggw gtctgtgagc ggcccggtcc 420 ctcagctgga ctggcaaagc agaatatctg tgtgtcagtg tactttattc atccgtcact 480 cggtcagggt ctgcgggtca gacccggca 509 // ID LTR7_TS repbase; DNA; PRI; 420 BP. XX AC . XX DT 09-DEC-2009 (Rel. 15.09, Created) DT 09-DEC-2009 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV3-type endogenous retrovirus - DE consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR7_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-420 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from tarsier."; RL Repbase Reports 10(9), 1263-1263 (2010). XX DR [1] (Consensus) XX CC ~94% identical to consensus. 5bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 420 BP; 105 A; 94 C; 110 G; 111 T; 0 other; tgaagtaggg tccgagcgat taaatgacag cccaccgact caaattagca aaggagaggt 60 ttattagtca aggctcttgc tggccagcag ccgcttgcag ggaactagtt ccgagcaagg 120 gccccgagct gcaacacagg gctgtcttat gtagtttaga tttacatcct ggcgggctaa 180 agatagaact tgggacttcc cagggctctt atcgcaacat tctggaggcg tgggcaagca 240 agcaagttta cagaagcaga tgtagctggt tatgcaggtg cggcggataa gggtacaatc 300 aattgcgtaa tgtctgcaga gaaagtttcc aggaaaggag taatgcctgc ctcaaggtta 360 gagctattcc tgcttttccc ccttacactt ctttctttgt tctctttgtt ccttcttaca 420 // ID HERV-Fc1 repbase; DNA; PRI; 4629 BP. XX AC . XX DT 29-AUG-2008 (Rel. 13.08, Created) DT 29-AUG-2008 (Rel. 13.08, Last updated, Version -1) XX DE ERV1 Endogenous Retrovirus from Hominidae. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; HERV-Fc1. XX OS Hominidae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini. XX RN [1] RP 1-4629 RA Smit A.F.; RT "HERV-Fc1 - ERV1 Endogenous Retrovirus from Hominidae."; RL Direct Submission to Repbase Update (05-AUG-2008). XX DR [1] (Consensus) XX CC Apparently active after the split from orangutan (B?nit, Calteau CC & Heidmann, Virology 312 (2003) 159-165. Only contains an CC intact env ORF (2815-4569), so should have been non-autonomous. CC Remnants of gag 466-1649, pol 1865-2534. XX SQ Sequence 4629 BP; 1082 A; 1506 C; 959 G; 1082 T; 0 other; ttggtgccga aacccgggag gggacacctc ctaagccccc ccagaggctc ggggggactc 60 ccctcctggt cggatcaggc ctctccctca gtcaggtcag gcttctcctc cacggccatc 120 cgtccgtttc gtccggttac ttgccgccag gtcgcagctg ctgcagctac tccagtccaa 180 ttcggccgac gctgggtgag tacccctcct ttttcctttt gtccgttcct ccctggccga 240 gagtcatgcg cacgcccagg gagagtttcc ttcctcaagg gaaggccagt ccgggtcacc 300 aggtgaccca agtttacttc cccaggggaa gtccaaatcg gcactgacga ctcggagacg 360 tccgtgtctg aagtagccga tccgaggctc caggagccgc gtggtctgag tgaccccaga 420 aggacgcttc tgctgtccct cagaccgctg ccataaaggg aagaggatgg ggtccaccca 480 gtccaaaatc gcgcaaaaca cccccttagg gtgcctcctg cgcaacctcc caactttaca 540 actcgaccaa gatttaaaac gaaagcgact aattttcttc tgcacggttg cctggccgca 600 atatacctta gacaaccaat ctcgctggcc ccccgaaggc acactcgact tcaatatcct 660 aaacgacctt accaattttt gtcagaggcg aggcaaatag tcaaaaatca aatttgttca 720 aaggttctgg gacctccgct ctcgtcggac cgctgccgcc aagtgttttc gctggcgcaa 780 gtccctctgg ctagccttcc ccttgaagtc tggccagcct ctcttgccgt taatcctgtc 840 cgcggccccc atcttagtct ctccgccgcc atctccttct gcaccgccgc catcttacta 900 cctgcttcct caccgccgcc atcttacttc cttttttctc tgccgccatt ttagttcttc 960 tgccgccatt cggccgccgt tttaattccc gttagttccc atttgttctt ttaaccctgc 1020 ccacctaact ccttggcttc cgtcttaccc ccattcttat ttccacctcc ccgtagtgcc 1080 ataccagtcc actacatcta caactcctaa cacattcgct gcgggcagtg atatccgcta 1140 atcctggatg aggcagcgga gggcccccaa acccctatcc aggacttagt aaagctggcg 1200 ttcaaagttt ttaattcccg agaggaggcg gctgaggtac aacgacaggc aagactgaaa 1260 caaaaagttc agctccaaac ccaagccctg gcggctgccc tgcaaccggc attccctaag 1320 agccccggca ggagaggtag aggtacaatc tcccgggccc cgtctggcgc ctgcttcaag 1380 tgcggcaact cgggacactg ggccagccgg tgccctagcc aacagcaacc gtcctgcccg 1440 ccttgcaact gtttcaagtg cggcaatcca ggtcattggg caaaacagtg cccaaacccc 1500 aagccgccaa cgcgcccgtg ccctaactgc caacaaatgg ggcactggag gtcagactgc 1560 cccggcctcg gagcggccgc tgtgtctcca catggcgacc cctccccgga tggcgaaggt 1620 gccctccagc tcctccaact ggacgacgac tgaagaggcc cagactcggg aacccctctc 1680 acccttgccg agcccagggt aatgcttcag gtagcgggta agtccatttc ctttttgcta 1740 gacacaaggg ctacctactc tgttttgcca tcttttagcg ggcccagccg cccctcctca 1800 atctctgtta taaggattga tggcactccc tccacctacc gccagacgcc ttcactgccc 1860 tgccgcctag accactatat aactttcttg aacccataat ctaccatcct tccctttatt 1920 ccttactaaa gcaaatacat cgagttatct tcttacttta gtaaacactt tctcaggtta 1980 gattaaagcc tgccctacca ctcataaaac agcagaggta gtagcttcaa ccctcattga 2040 acagataatc ccgagatttg gcctgctttt atctccaaaa tagtcaaaca ggtgacaacc 2100 acacttggcg ttaactggaa gctacacact ccataccatc cgcagtcttc tggaaaagtg 2160 gaacgcgcca acggccttgt caaacaacac ctaatcaaat tggctctcga gacgcgccaa 2220 tcgtgggtaa ccttacttcc ctttgccctc gcgcggctcc gggcagcacc ccgaagcccc 2280 acaggcctta gcccctttga actcctatac gggcgcccct tcctctttca agagctccct 2340 gtgaataccc cacctcttgg cacgtacctg ccctacctca ccctgttaag ggagctgcta 2400 agagaacacg ccgaccgcag ccttccaaag cccggaccgc tcagcccaga cagtccggcc 2460 ataataaccc caggagatca ggtactagta aaagacctcc aggcaagagg tctctccccc 2520 cggtggaaag gcccctatac ggtaattctt acaacaccga cggcagctaa acttataggc 2580 cttccctcct ggtaccatat ttcccatctt aagagggcac ctacacaaca tcaggccact 2640 tggactgtca cctccctctc cccaaccaaa ctgaaactct ctaaatcaaa taccgcatga 2700 attctccgcg tgacaggctc caacaattta ttcaggttct tctcgaggaa agctggtcat 2760 tccctacttt tgctaacacc cttcgctggc ctgaaaatct gttgtcctat atagacgaac 2820 tggtgtggca aggctccctc cagaactttc accaacatga agttcgcttt ttttgtcttg 2880 tcactcttct tatcttgcac accagtcgta gtagacaagc cccctctcag actccctctc 2940 actgggtttt tttccctcac tgagaattgg agttccggac aggcagtctc ctctagacta 3000 gtagccacgg cagcatgccc gccagcaggg tgccaggcac ccatagcttt cctaggtcta 3060 aaattctctt ccctaggccc ggctagaaaa aaccctgcac tttgcttcct gtatgatcaa 3120 agtaactcca aatgcaatac cagctgggtc aaagaaaatg taggctgtcc gtggcactgg 3180 tgcaatatcc atgaggcatt aattcgtact gaaaaaggat ctgacccaat gttctatgtc 3240 aatacctcca ctggaggacg ggacggcttt aacggattta acctccaaat ctctgaccct 3300 tgggaccccc gctgggcctc cggtgtagat ggaggactat atgagcacag aacttttatg 3360 tatccagtag ctaagatccg cattgccagg acccttaaaa ccactgtcac agggttatcc 3420 gacttagcct cctcaatcca gtcagccgag aaagagcttg ccagccagct tcaaccggca 3480 gctgaccagg ccaagtcctc ccccttctcg tggttaactt taatctcaga aggtgcacaa 3540 ttgctccaat ccacaggggt acaaaacctc tcccactgct tcctctgtgc agccctcgga 3600 agacctccct tagtagcagt tcctctccct acccccttta attatacaag aaattcatcc 3660 acccctatac caccggtccc gaaaggacag gtcccactat tctcagaccc tacaagacac 3720 aagttcccgt tctgttactc taccccaaat gcctcttggt gtaaccagac caggatgctt 3780 accagcgccc cggcaccgcc cggaggctac ttctggtgta actccacgct aactaaagtt 3840 cttaactcaa ctggtaatca caccttgtgc ttacccgtct ctctcatccc tagcctgacc 3900 ctatatagtc aggacgaact tagccatctg ctagcctgga ccgagccaag gccgcgaaat 3960 aaaagcaaac gggctatttt cctaccctta gtactaggca tctccttagc ctcctcctta 4020 gtggcatcag ggctaggaaa aggagccctc acccactcaa tccaaacatc tcaagatctg 4080 tctactcgcc tgcagttggc cattgaggca tcggccgagt ccctggcctc cctacagcga 4140 cagatcactt cggtagcaca ggtcgcggca cagaacaggc gggccttaga cttgcttacg 4200 gcagaaaaag gagggacatg tttattctta caggaagaat gctgctatta tctcaacgag 4260 tcaggggtag ttgagaccaa tttacaaact ttaaaaaaaa aaaagatcca agaggagtta 4320 aaacactcct acgaccctct ccgccctggt ccttcttggt ggttttcacc cgtggttcaa 4380 cagatgctcc ctttccttat cccaattata attctctgta taataatgtg ttttgcccca 4440 atcctagtac agtttctccg ccagcggata caagaaatca ccagggtcgc tttcaaccag 4500 atgttacttc acccctacgt ccaactgcca acctctgatt taggccccct ccccaacgac 4560 gccccttaac agcaggaagt agccagacga tttcgtcgcc ccttttctat aaccaaaaag 4620 aggttggac 4629 // ID SAR repbase; DNA; PRI; 84 BP. XX AC X03461; X03462; XX DT 01-OCT-1995 (Rel. 1.09, Created) DT 04-AUG-2008 (Rel. 13.07, Last updated, Version 3) XX DE Human satellite I DNA. XX KW SAT; Satellite; Simple Repeat; SAR; Satellite repetitive element; KW simple sequence DNA. XX NM SAR. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 84-43 RA Prosser J., Frommer M., Paul C. and Vincent C.P.; RT "Sequence relationships of three human satellite DNAs."; RL J. Mol. Biol 187, 145-155 (1986). XX RN [2] RP 1-84 RA Smit A.F.; RT "Consensus update."; RL Direct Submission to Repbase Update (05-MAR-2004). XX DR GenBank; X03461; Positions 1 17. XX CC Positions 1 to 17 repeat unit A CC Positions 18 to 42 repeat unit B CC [2]. XX SQ Sequence 84 BP; 26 A; 4 C; 12 G; 42 T; 0 other; acagtatata atatatattt tgggtacttt gatattttat gtacagtata taatatatat 60 tttgggtact ttgatatttt atgt 84 // ID LTR14_TS repbase; DNA; PRI; 471 BP. XX AC . XX DT 09-DEC-2009 (Rel. 15.09, Created) DT 09-DEC-2009 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV2-type endogenous retrovirus - DE consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR14_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-471 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from tarsier."; RL Repbase Reports 10(9), 1271-1271 (2010). XX DR [1] (Consensus) XX CC ~92% identical to consensus. 6bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 471 BP; 126 A; 108 C; 100 G; 137 T; 0 other; tgttggagac cactgccctg cttgaactct gctgaaaaca tacttggcgt acgtgcaaag 60 tcacaaccag ccaagaatgc ctataatggc aaagccagat aggcatgtat gctgaggcaa 120 gttctaaaag tgcagtgtct gttgtgttag atagataaaa agttatacat tgtcttgcaa 180 gcagctgcac tatattttgt aaacagtcac gctttatcac aaaagctttc taagatcaaa 240 acagggagtt ttctttgcac tgtctgtact ttgtccttga aggcttactt aactacatca 300 ttttgtacta ataaaagtct ggtgggtaag aaactcgggg cctttgtcct gtgaggcaaa 360 gtgccccctg gttcccccac cttctagttc taaaagtgtc ttgtgaattg tgttataccg 420 cgccgttcct ctttcaggat cctttaccac ccagccacgc aggacgcagc a 471 // ID LTR1A_OG repbase; DNA; PRI; 787 BP. XX AC . XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 05-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Long terminal repeat (consensus). XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR1A_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-787 RA Jurka J. and Jurka M.G.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 8(10), 1667-1667 (2008). XX DR [1] (Consensus) XX CC 4bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 787 BP; 202 A; 213 C; 201 G; 171 T; 0 other; tgataccgga caagtggcgc cccagccagc tcagccgagc ggggctggag gacctgccgc 60 ctccgtcttg gcggtgttgg cggcagccgg cagaggaacc agcaggctgg agcaagcgga 120 tgggagcttc cctcccccac caaaggctcc agtagagcgg cgctagagga cctgccggca 180 cagaaaccag caggctggag caaaaggggg agcttccctc ccccaacgga ggctccagta 240 gccaaaagaa ccgaacacgt gcggggaaag acactggcgc cgtgctgtct cctcacgtgg 300 acttctggcg attcctccag tgcatccctg attggtccgt tttcaaaacc tactcatgat 360 tagtctattt tcaagaccca aaagctcatt ggacaactgc ccctatataa gccagagacc 420 ctgagcacag acttgggcag atctcaggag aaagagcaga agagcaaaga ggtttggctc 480 tgtaacactt gtatcctgct ctatgagatt tgcccctttg taagagctgt aaacactcgc 540 aaggaggttt gcccctttgt aagagctgta aacactcgca aggaggtttg ctcctttgta 600 agagctgtaa cactcgtatt ataaaaccta tcttactgct aatcctactc tacaaggagg 660 tttggcttgt tataagagct gtaacactta tacctgttga ataaaaccta tcctgtggtc 720 cgtcgcttca ttcttcgaac cgccgagacc aagagcctgg aaattcagta tcaaaaatcc 780 ggtatca 787 // ID LTR13C1a_OG repbase; DNA; PRI; 427 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR13C1a_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-427 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 11(5), 1589-1589 (2011). XX DR [1] (Consensus) XX CC ~86% identical to consensus. 4 bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 427 BP; 83 A; 155 C; 92 G; 96 T; 1 other; tgtgagaacc agaccaacgc catcttgggg cgccgccatt ttgttgttta ctgctaagct 60 atgttcctcg cccctccacc cccccctccg ctggcagccc acaaaaccgc atcctgctgc 120 ctcctccacc cctggttgca agtagcttgg cttgcaaccg ttgctcaagc cacctgcgag 180 ctgccmaagg cctcagttac ccctagacca atcaagccac accaagtaga ccccccagat 240 accccccttc ttgccttctg taccccataa aatcccccgc catcttaggg tcggggccgc 300 tctatctcct gctgcgtcag gtgtggacgg cccagacccg agtctgcaat aaagtgcctc 360 ttgctgtttg catcggactc ggtctctgag tgttctgtgg ggaaaatccc aaattccggg 420 cacaaca 427 // ID ERV2-1_CJ-I repbase; DNA; PRI; 6406 BP. XX AC . XX DT 12-JAN-2011 (Rel. 16.02, Created) DT 12-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE Internal portion of an ERV2-type endogenous retrovirus - DE consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR7_Cja; KW ERV2-1_CJ-LTR; ERV2-1_CJ-I. XX OS Callithrix jacchus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. XX RN [1] RP 1-6406 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the common marmoset."; RL Repbase Reports 11(2), 789-789 (2011). XX DR [1] (Consensus) XX CC ~95% identical to consensus. CC This sequence was derived from the marmoset dataset made CC available by Genome Sequencing Center at Washington University CC School of Medicine in St. Louis. CC We thank the Marmoset Genome Sequencing Consortium for making CC their data publicly available. XX FH Key Location/Qualifiers FT CDS 1460..2230 FT /product="ERV2-1_CJ-I_1p" FT /translation="MLAGEGNYATSDAQMQYDAGLYAQIQTAGTKAWRKLP FT TKGDVSASLTSIKQGPDEPFSEFVHRLITAAGRIFGNADAGTDFVKQLAFE FT NANAACQAAIRPYKKKTDLSGYIRLCSDIGSAYQQGLAMAAALQGYTVKQF FT LSNQSKNRCFQCGASGHYAKDCNADKKPSLNTKVPGLCPRCKRGKHWANEC FT KSKRDAQGQPLSPHQGNGMRGQPQAPKQKQIYGAVSFVPPTNNPFQASAEP FT QQEVQDWTSVPPPTQY" FT CDS 2242..2994 FT /product="ERV2-1_CJ-I_2p" FT /translation="MGVQALPTGIFGPLPEGSFGLILGRSGLTLQGLQVLP FT GVIDNDYTGEIKVMATSPKVISTVSPGQRFAQLLLLPLRKTSNRVVKKDRG FT NLGFGSSDVYWAEVISPDKPMMTLWLDGKEFTGLVDTGADVTIISKTQWPA FT TWPISATITHLTGIGQSKNPEKSTKMLKWEDKKGNQGQIQPYIVAGLPFNL FT WGRDLLAQIGMVIGSPNEIVAAQMLKHGYVPGTGLGKNQTGILSPIESKPK FT TDRKGVGHFQ" FT CDS 3243..5603 FT /product="ERV2-1_CJ-I_3p" FT /translation="MVLMGALQPGLPSPVAIPLNYYKIVIDLKDCFFTIPL FT HPDDQKRFAFSVPSTNFKEPMKRYHWKVLPQGMANSPTLCQSFVALAIQPV FT RDTWKEIYIIHYMDDILLAGKVGQDVLSCFKALQTSLTQHGLLIAPEKVQL FT TDPYTYLGFQLKGSQITTQKVQLRLDKLKTLNDFQKLLGDINWLLPHLKLC FT KADLKPLYDILNGDPNPTSPRELTSEGIQAIQTVEHAINHQTITFVDYSKP FT LQFIICQTSLSPTAVFWQTAPLMWVHLPNTPKRVLEPYYLMVANLIIQGRT FT MGKEYFGYEPSLIIQPYSKEQVQWLMQTTEAWPIALASFSGQLDNHYPPHK FT LITFANRHEFIFPKVTKPEPIPDGLVVFTDGSSTGIAAYVTQGHTVQFQTQ FT SSSAQLTELQAVIAVFSAFPNQPLNVYTDSAYIAQSIPNLETAPIIKHTSS FT AAKLFKQLQQFIISRTHPFFIGHLRAHSGLPGPLSQGNQQADLATRVFHTY FT ATEAKDPLSQAQHTHNLHHLNAHSLRLLHHITREQARQIVKECTQCATHLP FT VPHLGVNPRGLVPNALWQMDVTHVSEFGNLKYVHVCIDTCSGYVFATLQTG FT EATKHVIGHLLSAFATLGCPQKIKTDNGPGYTSTAFARFCSQLNIQHITGI FT PYNPQGQGIVERTHLTLKLTLQKIKKGEWYPIKGSPRNLMAHSLFILNFLT FT LDKNGHSAAERHWQSETQTKFASVMWKDPMTGSWHGPDPVLIWGRGSVCVY FT AKEADSARWLPERLVKQIDSKGTSGCEDNNPSQGMS" XX SQ Sequence 6406 BP; 1931 A; 1581 C; 1406 G; 1487 T; 1 other; aactggcgcc caacgtgaga gctcgagtac gagggataca gtgagggacg ccgtcgaggt 60 cagccggcac agaaggatta agaaacccgg aggaaagagc gtcgtgggag cctgccacgg 120 actcgcgctc cggtaagcgg acggcgaata tgggacagga aatcagccag catgagttgt 180 ttatagagag tttaagtaag gctttaaagg cgcgaggagt aaaggtgaaa ataaaacaat 240 tgtgtaaata ttttgactat attcaggatg tctgcccttg gtttccccag gaaggaacca 300 tagaccaaaa acggtggaaa agggtaggag atgctttgaa agattattac agtactttcg 360 gacccgaaaa ggtcccagtc acagcctttt cttattggaa cctgatagat gaaattttaa 420 ctcagagacg ccacgatccc ctcattgcca atgtaatctc ccagggagag tctctcctaa 480 aaggggaact cgataagagc aacctcggca gtgctaagag cccagatgac tccaagcaat 540 tgtctgtaaa tgaagaggcg cagtcgccac aagttaacga ggatgaggat ttgatgtcct 600 ttgatagtga ctccgagaag ggagcgcccg gagggtcaca gtgcccctcc gagagccaaa 660 atccgccttt agaaaaaacc aaaagagggc ctaaagggcg ttacccaggg gtccgacaga 720 cctaaagtta aatcttcaaa ccagaaaaag agacccgcct ggctaatgga aagggcaaaa 780 aattctgaag acgagggcat agatgaggga atagactggg gtgacttgga ggagaggcgg 840 cccggtataa aaaccaggac tggtcccccc atgggccgcg acctccgcct tacagggcgg 900 ggccctcggc acccccaata gcatgtcccg ttgttgatcc ccgaaaagaa ttacaagaaa 960 aaatcaattc cctaaaagaa cagatwaggc ttgaagtaga acatcaagag ctcattgaac 1020 agcttgaaaa aataaagata ggtaaagcca ggaatgaggc taagactcaa acacccaaaa 1080 cgtcatcctc cccgacgggg agcggcacta aggcaacctc cccagactaa ccccttgcta 1140 gaccttgatg atccgagcgc gttccccgtg acagaaacca ccgaccagca ggggaccgct 1200 tggaggcatc atgcggtttc gattttaaaa taatcaaaga actaaaaact gcggtggcac 1260 aatacggggc tactgctccg tacaccactg cgattttaga atctgtggcc gaaaattggc 1320 ttaccccagg ggattggcag acttggcccg agccaccctt tcagggggtg attatttgtt 1380 gtggaaatct gagttttatg aatgctgtaa ggaaactgcc cgccgcaacg cacaggctaa 1440 taataattgg aattttgaca tgctagctgg tgaaggcaat tacgctacca gcgacgcaca 1500 aatgcagtat gatgcgggtc tgtacgccca aattcaaact gcgggtacaa aggcctggcg 1560 gaagctgccc acaaaagggg atgttagtgc ctcccttacc agcataaaac aagggccaga 1620 tgaacctttt tcagaatttg tacatcgcct tattacagcg gcaggcagaa tttttggcaa 1680 tgcagatgcg ggcacagatt ttgttaaaca actagcattt gaaaatgcca atgctgcctg 1740 ccaagcggct atccgccctt ataaaaagaa aacagacctc tcagggtata tacgcctgtg 1800 ctcagacata ggctcagcct atcaacaggg actagctatg gctgcagccc tacagggcta 1860 tactgtgaaa cagttcttat ctaatcagtc aaaaaatagg tgctttcaat gtggcgccag 1920 tggtcactat gctaaggatt gtaatgctga taaaaaaccc tcccttaata ctaaagtccc 1980 tggcttgtgt ccacgatgca agagagggaa gcattgggcc aatgaatgta aatcgaaaag 2040 ggatgcccaa gggcaaccac tttccccaca tcagggaaac gggatgaggg gccagcccca 2100 ggcccccaag caaaaacaaa tctatggggc agtcagtttt gttcctccaa ccaacaatcc 2160 gtttcaagcc tctgccgagc cacaacagga agtgcaggat tggacctctg ttccacctcc 2220 cacacagtac taacacctga aatgggggtt caggcattgc ccacaggcat ctttgggccc 2280 ctgcctgagg gttcttttgg acttattcta gggcgaagtg gcctcaccct acaaggtctg 2340 caagtcctcc ccggagtaat agataatgat tacacaggag aaatcaaagt catggcaacc 2400 tcccctaaag tcataagtac tgtctcccct ggacagagat ttgcacagct cttgctctta 2460 cctctccgta aaaccagtaa tagagttgta aaaaaggaca gaggtaactt aggctttggt 2520 tcctctgacg tctattgggc agaagtgata tctcccgaca agcccatgat gacattatgg 2580 ctagatggaa aggagtttac cgggctagtg gataccggag cagatgtgac tataattagc 2640 aaaacccagt ggcctgccac atggccaata agtgctacta ttactcatct caccggtatt 2700 gggcagagca aaaatcccga gaaaagtaca aaaatgctaa aatgggaaga taagaaggga 2760 aatcaaggcc aaatccaacc ctatattgtt gcaggcctac ccttcaacct ttggggaagg 2820 gatttgctgg ctcagatagg tatggtcata ggtagcccta atgagatcgt cgccgcccag 2880 atgcttaaac acggatacgt gcccggcact gggttaggaa aaaaccaaac tggaatttta 2940 agccctatag aatctaaacc aaaaacagac aggaaaggag taggacattt tcagtagggg 3000 ccgcggtctg tcctgtagcc catgcagaca aaattacttg gttgtcagac aaccccatat 3060 gggttgacca atggcccctc tcttcagaaa aattagctgc cgcacaacaa ctggtacagg 3120 aacagttgca agcggggcat atagaaccta gcaattctcc ttggaacaca ccaatcttcg 3180 ttataaggaa aaagtcagga aaatggcgat tgctacagga tcttagagaa gtcaataaat 3240 ccatggtcct catgggggcc ctacaacctg ggctcccttc tcctgtcgcc atacccctaa 3300 actactataa gattgttatt gatttaaagg actgtttctt tactattcca ttacatccag 3360 atgatcaaaa acgttttgct ttcagtgtgc cctctaccaa ttttaaggaa cctatgaaaa 3420 gatatcactg gaaagtgtta cctcaaggca tggctaatag tccaaccttg tgccaatctt 3480 ttgttgcctt agccattcaa cctgttagag acacctggaa agagatctat attatccatt 3540 atatggatga tattctttta gcaggaaaag ttggacagga tgttttgtct tgttttaaag 3600 ctttgcaaac atctcttact caacatggtc tgttaatagc tcctgaaaag gtgcaactca 3660 cagaccccta tacgtactta ggcttccagc taaaaggaag ccaaattact acgcaaaagg 3720 tacaactcag gctagataaa ctgaagacac tcaatgactt tcagaagtta ttgggagata 3780 ttaattggct actacctcat ttgaaactat gcaaagcaga tctcaaaccg ctttatgata 3840 ttctaaatgg ggacccaaac cccacatcac ctcgcgagtt aacctctgaa ggcatacagg 3900 ctattcaaac tgtagaacat gccattaatc atcaaaccat caccttcgta gattatagca 3960 agccactcca atttatcatt tgccaaacct cattatcccc taccgctgta ttttggcaaa 4020 ccgctccact catgtgggta cacctcccta acacgccaaa aagagtcctt gaaccatatt 4080 accttatggt agcaaatcta attatccaag gccgaacaat gggcaaggaa tattttggtt 4140 atgaaccttc cttaattatc cagccgtaca gtaaagaaca agttcagtgg ctcatgcaaa 4200 ctactgaggc ttggcctata gcattggctt ccttttcagg acaattagac aatcactatc 4260 cccctcataa gttgattact tttgccaata gacatgaatt tatctttcca aaagtcacaa 4320 aacctgagcc cattcccgat ggactcgtag tgttcactga tggttcatct acaggtatag 4380 ctgcctatgt cactcaaggc cataccgtgc agtttcaaac tcaatcatcc tcggctcagc 4440 ttactgagct gcaagcagtt atagcagtat tctctgcttt ccctaatcaa cccttaaacg 4500 tttacactga cagtgcatac attgctcaat ccatacctaa tttagagaca gctcccatca 4560 tcaaacatac ctcttcagct gcaaaattat tcaaacaatt acaacagttc atcatcagca 4620 gaactcatcc tttctttatt ggacacttgc gggcccactc aggattgcct ggaccactgt 4680 cacaaggcaa tcaacaagcc gacctagcca cccgcgtttt tcatacatat gccacagaag 4740 ctaaagaccc actatctcaa gctcaacaca cacacaacct acaccacctt aatgctcact 4800 cccttagatt attacatcat attaccaggg aacaagctag acaaatagtt aaggaatgca 4860 cacaatgtgc cacccattta cccgtgcccc atttgggggt caaccctagg ggcctagtcc 4920 ccaatgcact ctggcagatg gatgtaaccc atgtttcaga atttggcaat ctcaaatatg 4980 ttcatgtgtg cattgatacc tgcagtggat atgtgtttgc cacactccaa acaggagaag 5040 ccactaaaca cgtcattggg cacttactgt ctgcttttgc cacccttggc tgtccacaaa 5100 aaattaagac cgataatgga ccagggtata ccagcacagc ctttgcccgg ttctgctctc 5160 aattaaacat acagcatata actggcatcc cctataatcc tcaaggccaa gggatagttg 5220 agagaaccca tctcacctta aaacttaccc ttcaaaaaat aaaaaagggg gaatggtacc 5280 ccatcaaggg gtcccccaga aacctcatgg cacattcact atttatactc aattttttaa 5340 ctttagacaa aaatggacac tccgctgcag aacgccattg gcaatcagaa actcaaacga 5400 aattcgcatc cgtgatgtgg aaggatccta tgactggctc ctggcacgga cctgaccccg 5460 tattgatttg gggtagagga tcagtctgtg tatatgcaaa agaagcagat tctgccagat 5520 ggctaccaga aagattagtg aaacaaatag acagtaaagg aacatcagga tgtgaggata 5580 ataacccctc ccagggaatg tcttaagccc tgaggattat tccctgttcc ccctctttcg 5640 atcttcagtc agaaacatgt ggaccttaat attgatatcc ctgtaatcac tctcatacct 5700 aagaccgaca gcggtttcga tcacaatatg atgtaagatc agcttgactc cctagccgaa 5760 gtagtcctgc aaaaccgacg aggcctagat ctgctcaccg cccagaaagg aggactgtgc 5820 cttgcattag atgaaaaatg ctgcttctat gccaaccgtt ctggtgttgt cagagataag 5880 ataaaagccc tgcaagaaga cctggcagaa agaagaaagg cactcttcga taatccactg 5940 tggggagggt tgaatgggtt cctcccctat ctcctccccc tcctcggccc ttttgtcggg 6000 ttcctcctac tcctcactgt tggacctctt atattcaaca aggtcatggc ttttgtcaaa 6060 caacagattg acgccataaa aatgcagccc ctacaggtcc attaccataa gctggaaatg 6120 gccgaccggg aaatagagat ggatcaaggc tacggtggaa gctcgataag caccgccaac 6180 cgtctctgac attgcctggt gagatggacc agttagccaa tgacgggcaa cctgagagcc 6240 tagcagtaag atgctcccgc ctaagacagg ggaccgggat acgaatcccg ctcaccctat 6300 gacgggtaag ggtaaaaagc atcactgcag gtgtgtctct tgacctaaga caggcatggt 6360 ccccacaggc agtcggccaa acgaaataaa ataaaaaagg gggacc 6406 // ID LTR12_Mim repbase; DNA; PRI; 430 BP. XX AC . XX DT 04-NOV-2009 (Rel. 14.11, Created) DT 04-NOV-2009 (Rel. 15.11, Last updated, Version 3) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR12_Mim. XX NM LTR12_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-430 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2968-2968 (2009). XX DR [1] (Consensus) XX CC >94% identical to consensus. 4bp tsd. CC Similarity to LTR12_Vpa from alpaca. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. CC We thank the Broad Institute Genome Sequencing Platform and CC Genome Sequencing and Analysis Program, Federica Di Palma, and CC Kerstin Lindblad-Toh for making the data for Microcebus murinus CC available. XX SQ Sequence 430 BP; 118 A; 97 C; 99 G; 116 T; 0 other; tatgttccgg caggtgtgaa cagctcttgg ccgaagaaaa acccgagcgg cacacggaga 60 gttggagagt cagctttatt tcgccggcgg gctcagagag gcatatctgc caccaaactc 120 tgagcgcccc tttttcgttt tcttttagtt ttataccttt ttgggggtta cagttagcca 180 atggcaagtt ttcacaaaag tcacctcatt tacatagtag tcagccaatc agaagtatgt 240 cccaaaagtt acttcattta catagtagtc agccaatcag aagtatgtcc caaaagttac 300 ttcatttaca tggtagtcag ccaatcagaa gtatgacccc aaatcacctc atcagctgtg 360 agtattagta gctgctcaat ttaggagcgg gataagcgaa acaggcgggt tttggtggaa 420 cgccctgaca 430 // ID LTR5B1_Cja repbase; DNA; PRI; 522 BP. XX AC . XX DT 14-OCT-2009 (Rel. 14.11, Created) DT 14-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat: consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; LTR5B1_Cja. XX OS Callithrix jacchus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. XX RN [1] RP 1-522 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the common marmoset."; RL Repbase Reports 9(11), 2916-2916 (2009). XX DR [1] (Consensus) XX CC >93% identical to consensus. 5bp tsd. CC This sequence was derived from the marmoset dataset made CC available by Genome Sequencing Center at Washington University CC School of Medicine in St. Louis. XX SQ Sequence 522 BP; 112 A; 198 C; 95 G; 117 T; 0 other; tgtaaggccg gttgcctcgc cgggggccaa atggacacac ccattcctct ccgtacctga 60 aaactacgtc atcattctga gcgcattact cagcatctgg cctctccacg cgggctcccc 120 acacccattt ccgcattcca atctcagtcc tgcactcatc agcaaactag gcactcggcc 180 ccgccgaaat ttaaaaaatc ccgccaagac ctgccctatg gcagcttgcc ccgtacgtcc 240 tcagccagcc cacttgccag cagccccgcc tggctgccaa tggcggctcg cctcgtccgt 300 gcccagtccc ttcataacca atgagactac ctccgctttg tttcccccgc agccaatcaa 360 ctacctccac tccctataca accccacgcc ctaagaagag aagcgcgact tctctggccc 420 ctttcccgga ccatagaacc tcgcccggga gctgaataaa ttggctttta cttttttact 480 ccggtctccg tttcctcatt taactcggca ataaacctta ca 522 // ID PTERV2b repbase; DNA; PRI; 7744 BP. XX AC . XX DT 14-MAR-2006 (Rel. 13.02, Created) DT 21-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE ERV1 Endogenous Retrovirus from Pan troglodytes. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; PTERV2b. XX OS Pan troglodytes OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pan. XX RN [1] RP 1-7744 RA Smit A.F.; RT "PTERV2b - ERV1 Endogenous Retrovirus from Pan troglodytes."; RL Direct Submission to Repbase Update (21-FEB-2008). XX DR [1] (Consensus) XX CC Also with PtERV2c LTRs. High diverged matches as if third sub CC there (too few matches right now) Closest match outside chimp to CC BaEV, but quite distant lib20040702. XX SQ Sequence 7744 BP; 1922 A; 2262 C; 1910 G; 1624 T; 26 other; tttgggggct cgtccgggat tggtcaccgc ttgacgaccc tcgacccgca cggcgaagac 60 ccactcggcc cgggccatcg acggacgacc aggcaccttc aggtactttc gttttgtttt 120 gttcgtcttg cttgtctgtt tgagttgccg gctaatttgg ttgagtactc agtctgaata 180 agtgtggaag gggagcagac gtgctcggca ccttcccact tgcgccccgg gggacgccct 240 ggtggttgtc tggaggagaa ttgacgaccc cgtcaatctc ctcgcctctg taggcaggtt 300 cttcctgcca tctgaatccg tagaccgccc tggcggtcgt ctggaggaaa actgacgatt 360 ccgtcagtct cctcgcctct gtaggccggt tttcccggcc atctgaatcc ttcgtagaac 420 tgtggcgccg atctttggcc gcgcggcttc tgtgtgtgtg tatactcgtt ggttttcctg 480 ctgtcttgtt tgtgtgtgtg ttagaaattg aggacacgac tatgggacag actctgacga 540 ctcctctatc tttgactctt gctcactttc ctgacgtaag ggcgcgagcc cacaacctct 600 ctgtagaaat tcgcaagggg cgatggaaaa ccttttgctc ctccgaatgg cccaccctcg 660 gtgtagggtg gccccagggc ggaacttttg acctctcgat tatcttgcag gttaaaacaa 720 aggtgatgga cccagggccc agcggccacc ctgaccaggt ggcctatatc ctcacctggg 780 aagacctggt tcgggaccca ccctcatggg tgaagccctt cctcccctcg gcttccccct 840 cccagtcgac cctcctcgcc gtggaaaccc cccgaaacca gaccccggtc cctttgaagc 900 ctgtcctccc ggatgagagt cagaaggatc tcctcctcct agaccctctc cctcctccgc 960 ctctcaatcc cctccttaat ccccctcctt actccacccc ttcggcgcca cccctttctc 1020 cttccacccc ttcgacccct tccacctcct tgtatccaac tctttcccca acctcttctt 1080 ccaccccctc tctgacccca anccccacgc cggcaccacc tgacctcacc cctcagaccc 1140 cncctcngac accccgcctc cgcttgcggg ggggcaaagg accctggcga ccagtccacc 1200 tggcaggcct cgctttttcc cctccgcacc gtgaatcgca cggtccagta ttggcccttc 1260 tcggcctcag atctctacaa ttggaagacc cataaccctc ctttctccca agatccgcag 1320 gccctgaccg ctctgataga atctatcctc ctcactcacc aacccacctg ggatgactgt 1380 caacagctct tgcaggttct tctgaccaca gaagaaaggc agcgagtcct cctcgaggcc 1440 cggaaacatg tgccagggcc aggaggactc ccgacccagc tccccaacga gatggatgag 1500 ggatttcccc tcacccgccc ggattgggac tatgaaacag catcaggtag ggagagtctc 1560 cgaatctatc gccaggctct gttggcaggt ctcaaagggg ccggaaagcg ccccaccaat 1620 ttggccaagg taagaactat tactcaggaa aagaatgaaa gcccggcggc attcatggag 1680 aggctcctag aggggtttcg aatgtacact ccattcgatc cagaagcccc ggagcacaag 1740 gccaccgtgg ctatgtcatt catagatcag gcagcactag acataaagag caagctccag 1800 agattggacg ggattcagac ttatgggctg caagaactag ttagagaagc agagaaagtt 1860 tataacaaaa gagaaactac tgaggagaaa gaagctaggc tagcaaagga acaggaggaa 1920 cgagaagatc gangagatcg taagagggat agacatttga ctaaaatcct ggcggcagta 1980 gtgacaggga aagggccagg gagaagagcc agggagagag gggggagagc gaaggcgccc 2040 gaaggtggat aaggaccaat gtgcctactg caaagaacga ggacattggg tcaaagactg 2100 tcctaaacgt cctaaggacc ggaagaaacc caccccggtc ctgaccctgg gagaggacag 2160 cgattagggg cgtcagggct ccgaagcccc ccccgagccc cggataaccc tttctgtagg 2220 ggggcgcccc accacctttc tagtagacac cggggcccaa cattcagttt tgacaacagc 2280 agacgggccc ctttcatccc gcacctcttg ggtccaagga gcaacaggag gaaagctgca 2340 caagtggacc acccaccgaa cagtaaacct tggaaaaggt atggtgactc attctttctt 2400 agtagtacct gaatgcccct atccccttct gggacgggat ctgttgacca agctcggagc 2460 ccagatacac ttctcggaaa gaggggccca ggtaatggac gaggatggtc agcctatcca 2520 gattttgact gtgtccttgc aagatgagta tcggctcttt gagaccccta tcctcaccag 2580 cccttctgat aattggctac aagaatttcc ccaagcttgg gcagagacag ggggacttgg 2640 actggcaaaa tttcaggctc caattatagt tgacctcaaa cccaccgcgg tgcccgtgtc 2700 tattaggcaa taccccatga gccgagaagc ccgtatgggc atccagcagc atattaataa 2760 atttctagaa ttaggagtct tgcggccatg tcgctcacct tggaacacgc cgctccttcc 2820 agtaaagaaa cctgggaccc aagattatag gcccgtccaa gacttaagag agattaataa 2880 gaggactatg gacatacatc ccacagtccc caacccttac aacctgctca gctccttgag 2940 accagatcac aactggtata cagtgctaga cctaaaagat gcattctttt gcttgcctgt 3000 ggctccccaa agccaagagc tttttgcctt tgaatggaga gaccctgaga agggaatctc 3060 aggccagcta acttggactc ggcttcccca agggttcaag aattccccta ccctctttga 3120 cgaggccctt caccgagact tgactgactt tcgcacccag cacccagatt tgactctgct 3180 ccagtatgta gatgacctcc tcctggccgc ccccactaaa gaagcctgtc tacaaggcac 3240 caggcacctg ctccaggagc tcggagacaa aggataccga gcatctgcca agaaagcaca 3300 aatctgccag actaaggtaa cctacttggg gtacatctta agtgaaggga aaagatggct 3360 cactcctggg cggatagaga cagtagcccg cattccgcca ccccggaacc ccaaggaggt 3420 gcgtgagttt ctggggactg ccgggttctg ccgcctgtgg atacccggtt ttgctgagtt 3480 ggcagccccc ctttatgccc tcaccaaagg gagcaacccc tttatctggc tggaggaaca 3540 ccaacaggcc ttcgaagctc tgaagaaggc actcctctct gccccagccc tcgggctacc 3600 tgacacatcc aagcctttta ccctctttgt agatgagaga agggggatag ccaagggggt 3660 cctaacccaa aaactggggc catggaagag accagtagcc tacctgtcta agaaactgga 3720 ccctgtggcg gctaggtggc ctccttgcct ccgcatcatg gcagccaccg ctatgctagt 3780 caaagactct gccaagttga ctcttgggca acctttgact gtcattaccc cgcacgcctt 3840 ggaggccata gtgcggcagc ccccggaccg ttggatcacc aacgcccgcc tgacccacta 3900 ccaagccctc ctgctggacg cggaccgcgt cagctttggc cctccggtca ctctgaatcc 3960 tgccaccttg ctacctgtac cggaagaccc gctgagtccc cacgactgtc gacaagtgct 4020 ggcggagacc cacgggactc gagaagacct ccaggactac gaactcccag acgcggacca 4080 cacttggtac acagacggta gcagcttcat ggacgcaggt acccggaggg cgggggcggc 4140 ngtagtggat ggacatgcta cgatatgggc gcaggcactg cctcccggaa cttctgctca 4200 aaggctgagc taattgctct aacaaaggcc ttagagctat cgcaggggaa gaaggctaac 4260 atctacacag acagtcggta tgcctttgcc acagcccaca ctcatgggag catttacgag 4320 aggcgaggtc tcctaacatc agaagggaaa gaaatcaaga ataaggctga aataatcgcc 4380 ttattaaagg ccctcttcct ccctaagaag gtggccataa ttcattgtcc tggacatcaa 4440 aaaggacatg accctgtcgc ccagggtaac aggcaagctg accaagcagc caagcaggct 4500 gctagagtag agacattgac attagtttca gaaaccagca aggctgacca gatgccccct 4560 cccacaagct atacctatac accagaggac cagaaagagg cagtagcctt aggagccaca 4620 gagaaccaag agactaaaaa ttgggaaaaa gacgggaaga cagtcctccc acaaaaagag 4680 gccatggcca tgctgcaaca gatgcacgct tggacacatt taagtagcaa gaaactgaga 4740 ctgctcattg aaaagactga cttcttaatc cccagggttg gtacccttct ggagcaagtg 4800 acgctcgctt gcaaggcctg tcagcaagta aacgccgggg ccacgcgagt cccggcgggg 4860 ataagggcac ggggcaaccg ccctgggacc tattgggaag tggacttcac tgaagtaaag 4920 cctcactgtg gaggatataa atatttatta gtatttgtag acacattttc aggatgggta 4980 gaagccttcc ccacccggca agaaacggcc cacatagtgg ccaagaaaat attagaagaa 5040 attttcccca ggttcggact ccccaaggta atcgggtcag acaatgggcc agccttcgtc 5100 tcccaggtaa gtcagggact tgccaggata ctggggatta attggaaact tcattgtgcn 5160 tacagacccc agagttcagg gcaggtagaa nggatgaata ggactattaa agagacctta 5220 acaaaattga ccttagagac tggcttaaaa gattggagac gtctcctgtc cctagctctt 5280 ttgagagccc gaaatacgcc taatcgcttt gggctcaccc cttatgagat cctctacgga 5340 ggaccacctc ccttgtcaac cttacttgat tcttactccc cctctgacct taagactgac 5400 ttgcaggctc gcctgaaagg actgcaggca gtgcaagccc aaatttggac tcctctggca 5460 gaactgtacc agcctggaca cccacagacc agtcaccctt tccaagtggg agactccgtc 5520 tatgtcagac gacatcgctc ccaaggacta gaaccccggt ggaagggacc atacatcgtc 5580 ctcctcacca cacccactgc tgtaaaagtt gacggggtcg ccgcctggat ccacgcatcc 5640 cacgtgaaag cagctccgga ggtgtccgga ccagcgtcgc ctgagagatg gagacttcgt 5700 cgctccgggg accccctcaa gataagactc tctcgcgtct aaccctttgc ctattgttag 5760 cccttttcct tccctgtgtt accgggggca gcaaccccca tcagccctat cgattgactt 5820 ggcaaataac taattttgaa acccatgaag tcctcaacga gacttcacac gtagcccctt 5880 tgaatacctg gttccctgac ctatatttta acttagacaa aatagccctg accgataaga 5940 tggagggcgg tgagtggaga aagcaagcna ggaggatntc cgtaagcngg aacgggttct 6000 acgtctgccc nggattcngg acaggaccna tgaaaagnac ctgtggngga ntnatgtccc 6060 tatactgtgc nagttggtct tgtgtgacan ctagtgatgg ggaatggaaa tggaaaaccc 6120 agccctggta cgtaaccatg tcctatgtcc agccctgtac caggacccgg tactcggcca 6180 cctgtaactt aatccgtgtc aaatttgagg aggccgcgaa aactgacccc cgctggacaa 6240 ctggactaat ttggggccta aatttatacc aancgccggc atntgggctc cctatccaaa 6300 tcaggctatt agcnaacccg atctcagtca aggaggcagc cgcggtcccg gtggggccaa 6360 acccggttct aacaggggga gcacctcctc agtcagggag ccggcaaaaa gccccgaccg 6420 ccggtgccca gtctcctcca ggagcgccaa acccagcatc ttctcagtca agtaacccac 6480 cttcccagac agggaaccca gttctaacag ggagtgcacc ttctcagtca ggggactcag 6540 ttataacagg gaaggcgcct cctcagtcag gttccccgtc tcccccacca tccaaatccc 6600 cattggcgct cccggagacc acccgcatgc ctccggaccc ggaaacaagc aatagactcc 6660 tcaacctcat cagaggcgct tacctcgccc tgaaccagac aaggcctgaa tccaccacct 6720 cctgctggct ctgcctggcc tcgagccccc cttactatga aggtattgcc tctattagta 6780 attttactaa ctccactagt cattctgggt gtgcatggga acagcacaag aaacttaccc 6840 tagcagaagt gtcagggtcg ggaacctgta taggccgggt gccccccagt caccaacatc 6900 tctgtaatag aaccctggca gtacccagaa ctagtcacta tctaataccc tctgggccag 6960 actggtgggc ttgcaacacc ggactcaccc cttgtgtgtc cacggctgtc ttcaacagca 7020 gtgaagacta ttgtatatta gtacaagttg tgcctcgagt ttattatcaa actgcagagt 7080 cttttgaatc ccagtttgag cagaaatccc tcactagaat gaggagagaa cctgtttccc 7140 tcaccctcgc tgttatgcta ggattaggag tagcggctgg ggtcgggaca ggaaccgcgg 7200 cactagtgag tggcagctac cacctgcagc aactcagggc agccgtagat gaagacctca 7260 gggccataga gcactctatt actaaacttg aagaatcact aacctccctg tctgaagtag 7320 tactccaaaa tcgacgggga ctggacataa tttttctaaa agagggcgga ctctgtgcgg 7380 cccttaaaga acagtgttgc ttttatgctg atcattctgg agtagttaaa gactctatgg 7440 caaaacttag aaaaagacta gatgatagac aaaaagaaag agaatcccaa caaagctggt 7500 ttgaagcttg gtagtacaac caatcccccc tggtttagta ctctcatctc caccatccta 7560 gggcccctga ttctgcttgt gcttgttttg actttcgggc cctgcatttt tagccgcgtg 7620 gttancctaa ttaaagatag attaaacnta gtgcatgcca tgnncctgac ccagcagtac 7680 caggcagtta agactgacga agagactcaa gattgagcct ctaagttaca aaaagaggag 7740 ggaa 7744 // ID LTR12F repbase; DNA; PRI; 519 BP. XX AC . XX DT 14-MAR-2006 (Rel. 13.06, Created) DT 26-JUN-2008 (Rel. 13.06, Last updated, Version 1) XX DE ERV1 Endogenous Retrovirus from primates. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; LTR12F. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-519 RA Smit A.F.; RT "LTR12F - ERV1 Endogenous Retrovirus from primates."; RL Direct Submission to Repbase Update (26-JUN-2008). XX DR [1] (Consensus) XX CC Preference for insertion in NTAN. 78 4.00 3.00 0.00 LTR12F CC 1 100 (419) LTR12B 1 103 (564) 73 8.11 0.90 0.90 LTR12F CC 101 211 (308) LTR12B 79 189 (478) 212 2.07 1.24 0.00 CC LTR12F 278 518 (1) LTR12B 192 435 (232) It's 3'end does CC confirm with that of the other LTR12 subfams (LTR12B has an CC exceptional extension). XX SQ Sequence 519 BP; 150 A; 135 C; 129 G; 105 T; 0 other; tgagaggtga agccagctgg acttcctggg tcgagtgggg acttggagaa cttttctgtc 60 ttacaagagg attgtaaaat gcaccaatca gcgctctgta aaaacgcacc aatcagcgct 120 ctgtagctag caagaggatt gtaaaatgca ccaatcagcg ctctgtaaaa tgcaccaatc 180 agcgctctgt aaaatgcacc aatcagcagg atcctaaaag tagccaatcg cagggaggat 240 tgaaaaaagg gcactctgat aggacagaaa cggaacatgg gaggggacaa ataagggaat 300 aaaagctggc caccccagcc agcagcggca acccgctcgg gtccccttcc acgctgtgga 360 agctttgttc tttcgctctt cacaataaat cttgctgctg ctcactcttt gggtccgtgc 420 catctttaag agctgtaaca ctcaccgcga aggtccgcgg cttcattctt gaagtcagcg 480 agaccacgaa cccaccggaa ggaaccaact ccggacaca 519 // ID LTR11_TS repbase; DNA; PRI; 573 BP. XX AC . XX DT 09-DEC-2009 (Rel. 15.09, Created) DT 09-DEC-2009 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR11_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-573 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from tarsier."; RL Repbase Reports 10(9), 1268-1268 (2010). XX DR [1] (Consensus) XX CC ~97% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 573 BP; 163 A; 146 C; 113 G; 150 T; 1 other; tgagaggacc gtaatagagt ataccataga cttaggttaa gagcatgcta taacccaata 60 aatatgtaat cggcttgtgg acaatagcac agccattttt agttttctgt cttaaaacat 120 aacctgtaac tgtttaaagt ttcagttatg ggaaaaactg caaagctcca gttacaagct 180 tcaacccctt cccttatctc actttcccct tagcaacaac caatcaggtg ccagggggta 240 atcacctaag tttccagatg tctctgcaac ttccctagct agtggaccaa ggtcagcccg 300 agtcaggcta gccaatcctc cacgattaag ctgaccacca gaggttacta cccaattaaa 360 tggtaacama actggaaagc ccccccaaac tcctcctccc tttgtgtaaa agcataaaag 420 gcatctgctt aaggaaatca gggcctctga atgtactccg ctgtgtcggt tgtcgacaga 480 ggcccaggct cgagcttgta aataaagacc ctcttgaact ttcatcggaa ttggctcttg 540 gtggtctctc ggattgaact ttcggccact aca 573 // ID hAT-2N1_MM repbase; DNA; PRI; 198 BP. XX AC . XX DT 21-APR-2009 (Rel. 14.1, Created) DT 21-APR-2009 (Rel. 14.1, Last updated, Version 1) XX DE hAT-2N1_MM is a family of non-autonomous DNA elements found in DE Microcebus murinus mobilized by hAT-2_MM. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-2_MM; hAT-2N1_MM. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-198 RA Novick P.A., Smith J., Ray D. and Boissinot S.; RT "Independent and parallel lateral transfer of DNA transposons in RT tetrapod genomes."; RL Gene 449(1-2), 85-94 (2010). XX DR [1] (Consensus) XX SQ Sequence 198 BP; 41 A; 43 C; 46 G; 68 T; 0 other; caggcgtcct caaactacgg cccgtgggcc acatgcgggt gtttttgccc gtttgttttt 60 ttacttcaaa ataagatatg tgcagtgtgc ataggaattt gttcatagtt tttttttttt 120 ttttaactat agtccggccc tccaacggtc tgagggacag tgaactggcc ccctgtttaa 180 aaagtttgag gacccctg 198 // ID ALRa_ repbase; DNA; PRI; 172 BP. XX AC . XX DT 29-AUG-2008 (Rel. 13.08, Created) DT 29-AUG-2008 (Rel. 13.08, Last updated, Version -1) XX DE SAT Satellite from primates. XX KW SAT; Satellite; Simple Repeat; Centromeric; ALRa_. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-172 RA Smit A.F.; RT "ALRa_ - SAT Satellite from primates."; RL Direct Submission to Repbase Update (05-AUG-2008). XX DR [1] (Consensus) XX SQ Sequence 172 BP; 53 A; 29 C; 39 G; 51 T; 0 other; ttgtagaatc tgcgaaggga catttgggag ctcattgagg cctatggtga aaaagcgaat 60 atccccagat aaaaactaga aagaagctat ctgagaaact gctttgtgat gtgtgcattc 120 atctcacaga gttaaacctt tcttttgatt cagcagtttg gaaacactgt tt 172 // ID ERV2-1_Mim-I repbase; DNA; PRI; 6119 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Internal portion of an ERV2-type endogenous retrovirus - DE consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; nonautonomous; KW ERV2-1_Mim-I. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-6119 RA Jurka J.; RT "Endogenous retroviruses from the mouse lemur."; RL Repbase Reports 11(5), 1524-1524 (2011). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX FH Key Location/Qualifiers FT CDS 933..1847 FT /product="ERV2-1_Mim-I_1p" FT /translation="MTPRDWSQLVKAVLTTGQYLDWKSINQEECMEQSRRN FT AQRGQPAWNFDMLTGQGQWVNNQVAYPEEVYAQINSIAIKAWKSLPNRGEV FT KGNLTKIIQAPAEPFSDFVARLVEAAGRIFGDLDTAMPLIEQLAFEQCTTE FT CRNAIAPWKNKGLNAWLKACREIGGPLTNSGLAAAVLAAQXKQERNCYNCG FT KPGHLKRQCRXPPKQGSSSNKQPGICPTCGKGRHWASECRSMKDKDGNPIV FT RDQNGKPVPKQKNGWRGPRPQGPQIYGAMQGNTQERSLLAPPGSQGEPLKV FT LQDWTSVPPPQQY" FT CDS join(1697..2614,2618..5275) FT /product="ERV2-1_Mim-I_2p" FT /translation="MAGPPTPGPANIWGHAGEYPGEVAPSASGEPRRATKG FT SAGLDLCATTTTVLTPLMGCQPVPSDFKGPLPDNTVGLVLGRSSSTLQGLI FT IHPGVIDSDFEGQVKILCSSTRGIVSIYPGDRIAQLLILPSLHSLFPSKQK FT IRGEKGFGSSGGNCAYLSLNLDERPTMDLTIQGKTFTGLLDTGADTSIISE FT RWWPKTWPLAPSAQTLQGLGYASSPVISAQTLKWKDNEGREGSFQPYVLPL FT PVNLWGRDLLRQLDFKLTNDYSVQSQKLMKGMGCVPGKGLGRNLQGRTEPI FT EPVPRPSKQGLGFSGPLAAGDPLPIPWNTEEPVWVPQWPLPSEKLLAAHNL FT VQEQLTLGHLEPSQSPWNTPIFVIKKRSGKWRLLHDLRAINAQMQLMGPVQ FT RGLPLLSALPKHWSLIILDIKDCFFSIPLHPQDKSRFAFTLPSINHEQPDA FT RYQWQVLPQGMANSPTMCQLYVASAIQPVRQKFSKVRIIHYMDDILLCHCD FT DGVLRQAYASLQAHLKEKGLLVAPEKVQEGEVGAFLGSSILPDKIVPQRLS FT IRRDALKTLNDFQKLLGDINWLRPFLCLTTAELKPLFQILEGDSHITSPRT FT LTPEALKALHKVEEAIQKAQLVRIQHHDPFSLCVLPSPTLPTAVLWQDGPL FT LWVHPQASPGKVIGHYPTCVAEIALKGISLAVSHFGRHPDSLICPYTANQV FT NTLCATLDAWAILRCTFPGSIDNHYPRHALLQFAMRNELIFPRVTSLLPLV FT DALDVYTDGSKTGIGSYVIQDKVYTKQFNYSSPQLVECAVVGLVLSTITEP FT INIISDSFYVVNAVKQLENSFHILAKSTVFSVFTEIRNTIQNRKNPFSIQH FT IRAHSLLPGPMAAGNAMADRATRALWVSDGHAAAIDFHKLYHVPAETLRLK FT FSITRADAREIVLQCPSCAPYRTVVHTGVNPRGLRPLHMWQMDVTHVPSFG FT KLQYVHVSVDTCSGVLHATPLAGEKASYVIIHCLEAWAAWGKPRVIKTDNG FT PAYTSKSFQQFCTRLGVEHRTGLPYNPQAQGIVERAHGTLKMYLQKQKGGD FT ALQLGLSPKGRLSFALFTLNFLNLDVKGRSAADRHVTPAPPQKELVKWKDV FT LSNLWKGPDPVLMRSRGAICIFPQDQDNPIWVPTRLTRTVQQPPEQHGDPV FT ATPPDDSMDDREDGSGATLEDPVSASPADACDT" XX SQ Sequence 6119 BP; 1553 A; 1491 C; 1540 G; 1527 T; 8 other; agtggtgccg aaacccggga accgaaacaa ctttgaagca ccagcggaga cgacctcgac 60 agaggcgagt tcagaactga cgtgtgggac ggccaaaact cccctgcttc caaaaaatgg 120 gtaatattct gctgcggcgc tttttgtggc ctattacatc acaaggagtg cactggagga 180 aaggaccctc ataaaatttt tgaaggaggt ggaccgtgta gctccttggt tcctccactc 240 gggttctttg actattccct cttgggggaa gttaggacgg gaccttcgta gtgcggggac 300 tctgaggcca ggcaccctcc ttgttcacaa cctaattatg gagtgcctgt atgatgaaac 360 tcaggcccgc gcgtagcggc tagccaaaga gcattaaatg acttgcaaga cgcaagttcg 420 gagagaagca gtaggggagg gaccgcggta aaaggtgaaa aaaggaatat agagctcaaa 480 aaaggaaaat gcagggaggg aaagcgcagc ccgatctgcc taattgtgac cccttaccag 540 attttgacaa tctacatttg tctgacactg ctgactctga tcactcgtca gagtccaggg 600 ggggagtcag atgaggaact agaaaagaaa gaggaaccag ctttcaggaa gcggcaggtt 660 atgagcggta ccgcccacct ccgtatgccc cgccttatga ggaamtaagc gttmgcccca 720 ggcagagagg cgcackgcag ggtgactcat tcatccaccc tgaagagcga aaagagctgt 780 tccttacctt tcctgtcttt gagaatgata atcaaagggg ctgggagcca gtaaacgcca 840 agcaattgaa agagctggcg gaggctgtcc gaaactgggg acatggggcc tcctacacct 900 tatctttggt agagagactg ggcaatwcag caatgacgcc cagagattgg agtcagctcg 960 tcaaggcagt actgactaca ggtcagtatt tagattggaa gtccattaac caggaagagt 1020 gcatggagca atcccggaga aacgcccagc gcgggcaacc agcatggaat tttgacatgc 1080 tcacagggca ggggcagtgg gtcaacaatc aggtcgccta cccagaggag gtatatgccc 1140 agatcaactc catagccatc aaggcgtgga agagcttgcc caatcgtggg gaggtcaagg 1200 gtaacttgac aaaaattatc caggcccctg ctgaaccatt ctcagacttt gtagctaggc 1260 tagtagaggc agcaggaagg atatttgggg atctagacac agccatgccg ctcattgagc 1320 agcttgcctt tgagcagtgc actacggaat gccgtaatgc aattgcccct tggaagaata 1380 aaggccttaa tgcctggcta aaagcatgca gagagatagg gggtcccctg accaatagtg 1440 gcctggcggc cgctgtttta gcagctcaak caaagcagga acgcaattgc tacaattgtg 1500 gcaaacctgg gcaccttaaa aggcaatgta gagsacctcc taagcaaggg agctcctcta 1560 acaaacagcc aggcatctgc cctacctgtg gcaaggggcg acattgggct tctgaatgtc 1620 gctcgatgaa ggataaggat ggaaatccca tagttcggga tcaaaatgga aagcctgtac 1680 ctaagcaaaa aaacggatgg cggggccccc gaccccaggg cccgcaaata tatggggcca 1740 tgcaggggaa tacccaggag aggtcgctcc tagcgcctcc ggggagccaa ggagagccac 1800 taaaggttct gcaggattgg acctctgtgc caccaccaca acagtattaa cccccctcat 1860 ggggtgccaa ccggtcccat ctgattttaa gggaccgtta cctgataata ckgtaggact 1920 cgtcctcggc aggtcttcat caactctgca gggattaata attcacccag gagttattga 1980 ttcagatttt gaagggcaag taaaaattct ttgctcttcc actagaggca tagtttccat 2040 atatccagga gaccggatag cacagttgct aattttacct agcctccatt ctttattccc 2100 tagtaaacaa aaaatcagag gagaaaaagg ttttggctcc tctggaggaa actgtgcata 2160 tttgtcattg aatttagatg aacgtcctac tatggactta accatacaag gaaagacatt 2220 tacaggcctc ctagatacag gagctgacac tagtataatt tctgaacgct ggtggccaaa 2280 aacgtggcca ttggccccct cagctcagac tctgcaggga ttgggatatg catcctctcc 2340 agtgatcagc gcccagacgc tgaaatggaa ggataatgag ggtagagagg gtagttttca 2400 accttatgtt ctgcctttgc ctgtgaatct ttggggtaga gatttgctac ggcaattgga 2460 cttcaaattg accaatgact attctgttca aagtcaaaag ctgatgaagg gcatgggctg 2520 tgtcccaggt aaagggcttg ggaggaactt gcagggaaga actgaaccta tagagcctgt 2580 gccacgacct tctaagcaag ggctgggttt ttcttagggg ccgctcgccg ctggggatcc 2640 tctgcccata ccctggaata cagaggagcc agtgtgggta cctcagtggc ccttaccctc 2700 tgaaaagtta ctcgcggccc ataatctagt gcaagagcag cttactcttg ggcatcttga 2760 accctctcaa tctccttgga atacccctat ttttgtgatt aagaaaaggt ctggtaaatg 2820 gagactgttg catgacctta gagcaattaa tgcccagatg cagcttatgg ggcctgttca 2880 aagaggactg ccccttctct cagctttgcc taaacattgg agtctaatta tcttggacat 2940 taaagactgt tttttctcta ttcccttgca cccacaagat aaaagcagat ttgcattcac 3000 gcttccttcc ataaaccatg agcagcctga tgcaagatat caatggcagg ttttgcctca 3060 gggcatggct aacagtccca ctatgtgcca actttatgtg gcctcagcta tccagccggt 3120 taggcagaag ttttcaaagg taagaattat acattatatg gatgatattc tcttgtgtca 3180 ttgtgatgat ggcgtacttc gccaagctta tgcgtccctt caggctcact taaaggaaaa 3240 aggcttgctg gtcgctccag aaaaggtgca agagggagaa gtaggcgcgt ttcttggtag 3300 tagcattctc cctgacaaaa ttgtgcctca aagactgtcc atccgaagag atgccttaaa 3360 aacattaaac gattttcaaa aactgctagg ggatatcaat tggcttcgac cttttttatg 3420 tcttaccaca gcagaactga agccattatt tcagattcta gagggagact cacacatcac 3480 gtctcctaga actttgactc ctgaggccct aaaggctctt cataaggtag aagaagccat 3540 tcaaaaggcg caactagtgc gcattcaaca ccatgatcct ttctccctct gtgtactccc 3600 ttctcccacg ctccctacgg cagtgctttg gcaggatggt ccccttttat gggtgcatcc 3660 tcaagcctct ccaggaaagg tgattggcca ttatcctacc tgtgtggccg agatagcttt 3720 aaagggtatt agtttggctg tgtcccattt tgggcgtcac cctgatagcc tgatttgtcc 3780 atatactgcc aatcaggtga acaccttatg tgccactctt gatgcatggg ccatattgag 3840 atgcaccttc ccagggagca ttgacaacca ttatcccaga catgccttgt tgcagtttgc 3900 aatgaggaac gagcttattt tcccaagagt aacttcgctg ctacctttgg ttgatgcctt 3960 ggatgtgtat accgatggct ctaagacagg tattgggagc tatgtcattc aggacaaggt 4020 gtataccaaa cagtttaact attcttctcc tcagctagta gagtgtgccg tagtcggtct 4080 ggttctcagt accatcactg aacctatcaa catcatctca gactcctttt atgtggtcaa 4140 cgcagttaaa cagttagaga actcctttca cattcttgcc aagagtacwg tcttttctgt 4200 atttacagaa attaggaata ctattcaaaa taggaaaaac ccattttcca ttcagcatat 4260 cagagctcat tccttgttac ctggacctat ggctgcggga aatgctatgg cagatcgcgc 4320 cactcgcgcc ctctgggtgt ctgatggcca cgccgccgcc atagactttc ataaacttta 4380 tcatgtgcca gctgaaacct tacgacttaa gttcagcatc acacgtgcag atgctcgtga 4440 gattgtactg caatgtccaa gttgtgctcc ttatcgcact gttgttcata cgggcgtgaa 4500 tcctaggggt ttgcgtcctc tacatatgtg gcaaatggat gtaactcatg tcccatcctt 4560 tgggaagctt cagtatgttc acgtgtctgt tgacacgtgc tcaggcgtcc ttcatgccac 4620 gcctctggct ggagaaaagg caagttacgt tatcattcac tgcctagaag cttgggcagc 4680 ctggggcaag ccccgggtta ttaagacaga taatggcccg gcttatacct caaaatcttt 4740 tcagcagttt tgcactcgct tgggtgtgga gcaccgtacc ggtcttccct acaaccctca 4800 agcgcaaggc atcgtggaac gtgctcacgg cacgctcaaa atgtacttac aaaaacaaaa 4860 agggggagat gccctacagc tagggctatc acccaaagga cgcctctcct ttgctctctt 4920 tactttaaat tttttaaatt tagatgtcaa gggacgttcg gcagcagaca ggcatgtcac 4980 gcctgctccg ccacagaaag aattagtgaa gtggaaagat gtgttgtcta acctatggaa 5040 aggcccggat ccagtgttaa tgagatccag gggagctatt tgtatttttc cacaggacca 5100 ggacaatccc atctgggtgc ccacgcggct gactagaacg gtgcagcagc ctccggagca 5160 gcatggggac ccggtggcta ctcctcctga cgacagcatg gatgatcgcg aggatggcag 5220 cggagccacg ctggaggatc ctgtcagtgc ctcccctgcc gatgcctgtg acacataact 5280 caccaatttt cccccgcttc ttttctacta atgtttccct gggcactgcg tacatgccgt 5340 tggacacgca gagtagtaaa ctggagggaa atagatcctt ctccttgaag gggactttgt 5400 gttttgtgtg gggttaacgg aagctgcacg gacttggccc ccctgtccat tggtggctgg 5460 gggtatggca gggaatgcta gtttcgcatg gaatcttagc gactcttttg aaggttctgt 5520 gagggtgatt gcgggagggc gaaacgcgac ctttgcccct acacctgtct gggtttggcc 5580 cccgtttatc tgggtggtat ccacggaaaa accctctggt acacatgtga actgttcctc 5640 taatacgtgc aattatacat tgtgttggaa tgccacgtct cacccctctg ccattgttac 5700 ctgcttgcct agatacattc ctgttcctgt tgaagctcct agctctatga ctctgtttcg 5760 gtcttgcagc cacggaccat gtggtattgg cacagacatt tgctgcgctt gaactgagac 5820 agtctgcaca ggtgtggttg accatgctaa aaggttagcc tagttttggg ttgcacccgc 5880 tcctaccccc aggtatttta gcagacatgg cctttgcact ctgaggggtc tctcccattg 5940 caccaaataa ggcatgtctc ctatcccaca gccagccttt gcacacctcc gagttgtgct 6000 cattgcacga ggataggtgg ttgctggtga aagtgaggct ctgcaacctt gatcgcacgg 6060 tcgaaggacg gtgctgaacg gttgctcagc tctcattaaa ttaataaaac agggggaga 6119 // ID LTR14_OG repbase; DNA; PRI; 556 BP. XX AC . XX DT 18-OCT-2009 (Rel. 14.11, Created) DT 18-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat (consensus). XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR14_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-556 RA Jurka J.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 9(11), 2860-2860 (2009). XX DR [1] (Consensus) XX CC ~90% identical to consensus. 6bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 556 BP; 129 A; 143 C; 141 G; 143 T; 0 other; tgtcaggtcc cttcgtggtc gccacttatc agggcccggg actggctggt ctccggccct 60 ggcttaacta ggttccggac gcggaccgct aagctacctt gctcaggaac aattgctctt 120 aggtgcctga gggctattta ctttagcaga agggagagag agagagagat agtctttgag 180 agagtaaagc agtcttctta gaaagataga tcttagtctt tagcagagct tagtgatctt 240 cagcagagag acgccgtgct ggcgttctgc aggcaggaga ggagacagag tcagagtaag 300 cgggtccgtg atagaatgcg catgctcccg actgccttct cctccagcct aatataaggc 360 tgatctcgtg cctctcaaca ccacaggtgt agagactgcc aattcaccaa tgctctcatc 420 tcgcgagctc tcatgcttgt taggagagct aaatccctct ccatgccctt ttcaccaagg 480 tgttccatta ttgagctata caggtatttc ttataactct cactcctcct agccataggc 540 tatacgggct gctaca 556 // ID LTR6_Mim repbase; DNA; PRI; 576 BP. XX AC . XX DT 04-NOV-2009 (Rel. 14.11, Created) DT 04-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR6_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-576 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2957-2957 (2009). XX DR [1] (Consensus) XX CC ~92% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 576 BP; 135 A; 181 C; 147 G; 113 T; 0 other; tgttgcggca gtgtgtacag ctagcgttac agccttgtta cagctcttat tcatttgacc 60 ggggaaagaa ccgagcggca cacgaagagt cggagaacag cctttattct tctacagcag 120 gctcagcaca cctagtgtta cagctctctt cgtgtcttcc gatgttctcc cccttcctgc 180 ccttctgccc ctgcttttat acccttggta gggctcaaga agcgtccaat caactacaag 240 ctttaacatc caatcagcaa caagctttaa catccaatca gcaacaagct ttaacatcca 300 atcaacaaca agctttaaca tccaaccaaa aacagtggga ttcaaaaatt acccaatcag 360 gatcacgcaa gcagcgtggc cggaaacagg cgcggcacgt gggcctgggg cattccggca 420 cgcggggtcc cggcggcacg cgggcgtccg gggcggctgg ggccggcccc agcagcgtgc 480 ctctaaccgg ccacgcgcag cactggcccc cggagatgcg gaactacggg gaggcggcaa 540 tcggccgcgt cccggcgccc gacattttcc gcatca 576 // ID L1A_Mim repbase; DNA; PRI; 6545 BP. XX AC . XX DT 07-JAN-2010 (Rel. 15.01, Created) DT 07-JAN-2010 (Rel. 15.11, Last updated, Version 4) XX DE LINE element - consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1A_Mim. XX NM L1A_Mim; LTR6_MD; LTR86_MD. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-6545 RA Jurka J.; RT "LINE1 elements from the mouse lemur."; RL Repbase Reports 10(1), 10-10 (2010). XX DR [1] (Consensus) XX CC >98% identical to consensus. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. CC We thank the Broad Institute Genome Sequencing Platform and CC Genome Sequencing and Analysis Program, Federica Di Palma, and CC Kerstin Lindblad-Toh for making the data for Microcebus murinus CC available. XX FH Key Location/Qualifiers FT CDS 1659..2378 FT /product="L1A_Mim_1p" FT /translation="MKNQFKELQNTVESLKNRVDQTEERISELEDNTLQLN FT KSVTEIEQRNKRKEQSLQELWDYVKKPNVRVIGLPEGEEDNTQGLDKLFED FT IIEENFPGLAQNLDIQVQEAQRTPGRFNANRKTSRHAVIRLTKVSTKEALL FT RAVRQKKQVTYKGKPIRITSDFSNETLQARRDWGPILTLLKQNNAQPRILF FT PAKLSFVYEGEIKTFSDKQRLREFTKTRPALQEVLKTALRTEHHNNNPRI" FT CDS 2423..6295 FT /product="L1A_Mim_2p" FT /translation="MAQDRNHSNNIQPNRMISNLPYLSVLSINVNGLNSPL FT KRHRLAEWIRKYRPSICCLQETHLTCKDAYRLKIKGWRSIFQANRSQKKAG FT VAVLISDDLVFKPTKVVKDKEGHYIMVKGTVQQEEITILNIYAPNLGAPRF FT IKQTLLELSKWINSNSIIAGDFNTPLTARDRSSKQKINKEIMDLNKTLEQL FT GLTDIYRTFYPKSTEYTFFSSAHGTFSKIDHILGHKENLKKFKKIEIIPCT FT FSDHSGIKLEINPNRNSHFYTKTWKLNNLLLNDYFVNEEIKTEIKKFYEEN FT DNGETSYQLLWDTAKAVLRGKFISINAYNQKTRRSQIDNLMKRLKELEKEE FT QTNPKPSRRSEINKIKSELNEIENREAIQEINKTKSWFFEKINKIDTPLAK FT LTKSRKEKSLISSIRNKKGDITTDPKEIQDTIYEYYKNLYAHKLENVEEMD FT KFLETHSLPRLNQEEIDSLNRPISTAEIETAIKNLPKKKSPGPDGFTPEFY FT HTYKEELVPILQKLFHNIEKNGNLPDTFYEANITLIPKPGKDATKKENYRP FT ISLMNIDAKIFNKILANRIQTLIKKIIHHDQVGFIPGMQGWFNIRKSINAI FT HHINRSKNKDHMILSIDAEKAFDKIQHPFMIRTLKKIGIEGTYLKMIQAIY FT DRPIANIILNGERLKSFPLRTGTRQGCPLSPLLFNIVLEVLATAIRQENGI FT KGIQIGAEEIKLSLFADDMILYLENPKDSTKKLLELINEFSKVSGYKINTQ FT KSEAFIYANNNLIENQIKDSIPFTIATKKLKYLGIYLTKEVKDLYRENYET FT LRKEIAEDVNRWKSIPCSWIGRLNIIKMSILPKLIYRFNAIPIKIPSAFFT FT DIEKIILRFVWNQRRPRISRAILGNKNKMGGINMPDIKLYYKAVVIKTIWY FT WHKNRNIDQWNRCENPDIKPSSYSHLIFDKADKNIRWGKESLFNKWCWENW FT IATCRRLKQDPHLSPLTKTNSRWITDLNLRYETIRTLEEKVGNTLLDIGLG FT KEFMKKSPKAITAATKINKWDMIKLQSFCTAKEIVMKVNRQPTEWEKIFAS FT YASDKGLITRIYLELTKIRKKKSNNPIKKWAKDLNRNFSKEDRRMANKHMK FT KCSTSLIIREMQIKTTMRYHLTPVRMAFIKKSPNNKCWRGCGERGTLLHCW FT WDCKLVQPLWKAIWRYLKAIQVNLPFDPAIPLLGIYPNDPVTLYKKDTCTR FT MFIAAQFIIARLWKQPKCPSIQEWINKMWYMYTMEYYSALRNNGDIAHLIF FT SWLELEPILLSEVSQEWKNKHQIYSPANWY" XX SQ Sequence 6545 BP; 2461 A; 1490 C; 1351 G; 1243 T; 0 other; ggggggggcg gagcaagatg gcggacgaat aacaccgcca gacagagggt ctctgcagaa 60 aagacagatt ctagcagaaa ctagaggaaa gaagcaagaa gacgagcata cagcggacaa 120 gggccggaag gaggggtacc tgagaccccg ggagactcca cgggaggagg ctgcggagga 180 gaactggagg ctgagaccac cggagcagcc cggagaccag cggcaagggt aggtggattt 240 gctgtttccc ctcccctgca ttcgggactg ctggtgggct ccccagcggg tggagagacc 300 tgcggacacc agcccagaga ctgccgccgc ctgccaacgg tgagcctgta gcagacgtgg 360 caccaggttc ccaacttcct ccgggcacct ccgtgtgcac ggacccgagc cgcgcggcag 420 gcgccatatt gcctcctcct cccctccgcc gaccctaccc gcggctgccc agagagacaa 480 tacagccacc agccggaggc acctccaggg aacgggacct tcccgtttgg gaccccgccc 540 gccctcccag gtgctgctgg caccgtgttc ccaggagaac ggtgccgact cagaggctga 600 gagacataga cccagcttgg gctccctgtg ggtgaattag gaccggaaat cctctccctg 660 gtgggaatac agtttgaact ctgggaccca gaggtcggac ctgcagacca gatcccctgc 720 accgagggct agcattgccc ggggcacaga agggttatac gtgaacagcc tactgaggtc 780 tgtgtgcctc caggggcgga tcggcgtcct agagggcaac cctcctccca ggaggaggcc 840 gtgcgcccaa cccaggtggc gttcctgtgc agggaacctc cccgccggca tcacagtccg 900 gggaggcctg gtggcttgtg gtctggcctg ctggcagagg cccaggagta gctgcggagt 960 tggggagggt ggaaagaagc gaggcctgct gcagactgcg ggtctcagac agccccaccc 1020 ccacacccag actttctggc tgagcgggac cattccagcc ccgccctgac agctttccct 1080 ggaagcagag aacagaactt tgacccctgc taacggcctg agggcaggct tacccaaccc 1140 agctccgccc agaacgagag ctgataacag gactcaaaat caacaccata gcctgttcct 1200 ccaagcaaac gccacctact gacagggacg gcatcttgca cagcctttcc acggcaccca 1260 ctgactcaat atacagggag tggtccaatt tcacccacag gcaccaccta acgcctcaga 1320 aactaaacaa ggtgtgtgaa tacccaaaca ataacctaag gaaagaaaca acaactgatc 1380 gacatgggaa gaaatcagcg aaagaactca ggaaatatga agaaccaaac ggaaaacaca 1440 cccccaaaga ggagcaccag ccccctagaa acggacaccg accaaaatca ggcaaccaat 1500 atgacagaag aggaatttcg tatgtggatc ataagaacac tcacccagct gcaacaacaa 1560 ctcaataacc aacaccaaga aaccacaaaa agcctccagg atatgggacg gccaaagaaa 1620 tagacacagt gaagaaaagt gtaaccgaac tcctggaaat gaagaatcaa ttcaaggaac 1680 tacaaaatac agtggaaagt ctcaagaaca gggtagatca aacagaagaa agaatctcag 1740 agcttgaaga taacaccctc caattaaata aatcagtcac agaaatagag cagagaaaca 1800 agagaaaaga gcaaagccta caagagctgt gggattatgt gaagaaacct aatgtgaggg 1860 tcatagggtt accagaaggg gaagaagaca acactcaagg gttggacaag ctgtttgaag 1920 atataataga ggaaaatttc ccaggccttg ctcaaaatct tgatatacaa gttcaagaag 1980 ctcagaggac ccctgggaga ttcaacgcaa acaggaagac gtcacgacat gcagtcatca 2040 gactgaccaa agtatcaact aaagaggccc ttctaagagc tgtaagacaa aagaagcaag 2100 taacatacaa gggaaagcca attcgaataa catcagactt ctctaatgag actttacaag 2160 caaggagaga ctggggcccc attctcactc ttttgaaaca aaacaatgcc cagcctagaa 2220 tattattccc tgcaaaacta agcttcgtat atgaaggaga aataaaaaca ttctcagaca 2280 agcaaaggct cagagaattc accaagacaa gaccagccct acaagaagta cttaaaacag 2340 cgttacgcac ggaacatcat aataataacc cacgaatata aaaacaacca aaacccaaag 2400 atattaaagg ccagatatta caatggctca agacagaaat catagcaaca acatccaacc 2460 caacagaatg atcagtaatc taccttacct atcagttctc tcaataaatg tgaatggctt 2520 aaactctcca ctcaagagac ataggctggc tgaatggata agaaaataca ggccaagtat 2580 atgctgtctt caggaaacac atctaacctg caaggatgca tatagactaa aaataaaagg 2640 gtggagatca atattccaag caaatagaag ccaaaagaag gctggtgtgg cagttctaat 2700 ttcagacgat ttagttttta aaccaacaaa agtagtaaaa gacaaagagg gtcattatat 2760 aatggtgaag ggcacagtcc aacaagaaga gataacaatt ttaaatatat atgcacccaa 2820 cttaggtgca cccagattca taaagcaaac cttactggag ctaagcaaat ggattaatag 2880 caactccata atcgccggag atttcaacac cccactgacg gcacgagaca gatcctccaa 2940 acagaaaatt aataaagaaa taatggactt aaacaaaact ctagaacaat tgggtctgac 3000 agacatctac agaacattct acccaaaatc cactgaatat acgttcttct catcagctca 3060 cgggacattc tctaagattg accatatcct aggacacaaa gaaaatctca agaaatttaa 3120 aaaaatagaa atcataccat gtaccttctc agatcacagt ggaataaaac tagaaatcaa 3180 ccctaacaga aactcacatt tctacacaaa aacgtggaaa ttaaacaacc tcctactaaa 3240 tgattacttc gtaaatgaag aaatcaagac ggaaataaaa aagttctatg aagaaaacga 3300 caatggagag acaagttatc aactcctctg ggacacagct aaagcagttc tgagaggaaa 3360 gtttatctcc ataaatgcct ataaccaaaa gacaagaaga tcacaaatag acaatctaat 3420 gaaacgactc aaagagctgg aaaaagaaga acagaccaac cccaaaccca gcagaagaag 3480 tgaaatcaac aagatcaaat cagaactaaa cgaaattgaa aacagggaag ctattcagga 3540 gattaataaa acaaaaagtt ggttctttga aaaaataaac aaaattgaca caccattggc 3600 taagctaacg aaaagcagaa aagagaaatc tctaataagc tccatcagga ataaaaaagg 3660 agatatcaca actgatccca aagagataca agatacaatt tatgaatact acaaaaatct 3720 ttatgcacac aaactggaaa atgtggagga aatggacaaa tttctagaaa cacacagcct 3780 ccctaggctc aaccaggaag aaatagattc cctgaacaga ccaatctcaa cagctgaaat 3840 agaaacagca attaaaaatc tccctaaaaa gaaaagtccc ggtccagatg gcttcacacc 3900 tgaattttac catacttaca aagaagaact agtacctatc ttgcagaaac tattccacaa 3960 catcgagaag aacggaaacc tccccgacac cttttatgaa gcgaatatta ctctgatacc 4020 aaaaccagga aaggatgcaa caaaaaaaga aaactacaga ccaatatccc taatgaatat 4080 agatgcaaaa attttcaaca aaatcttagc taaccgaatc cagacactta tcaaaaaaat 4140 aatccaccac gaccaagtgg gcttcatccc agggatgcag ggatggttca acatacgtaa 4200 atctataaat gcaattcacc acataaacag aagcaaaaac aaagaccaca tgattctttc 4260 aatagatgca gaaaaagctt ttgacaaaat tcaacaccct ttcatgatac gaacacttaa 4320 gaaaataggc atagaaggga catacctaaa aatgatacaa gccatatatg acagacccat 4380 agccaacatc atactgaatg gggaaagatt gaaatcattc ccacttagaa ctggaaccag 4440 acaaggctgc ccactatctc cacttctgtt caacatagtg ctggaagtct tggctacagc 4500 aatcagacag gaaaatggaa tcaaaggtat ccaaataggg gcagaagaga tcaaactttc 4560 actgtttgct gatgatatga tattgtatct agaaaacccc aaggattcaa ccaagaaact 4620 cctggaactg atcaatgaat ttagtaaagt ctcaggatac aaaatcaata cacagaaatc 4680 agaggcattc atatacgcca acaacaatct aattgagaac caaatcaaag actcaattcc 4740 cttcacaata gcaacaaaga aattaaagta cctaggaata tacttaacca aggaggtaaa 4800 agacctctac agggagaact atgaaacact gaggaaggaa atagcagagg atgtaaacag 4860 atggaaatcc ataccatgct cgtggatcgg cagactcaat atcatcaaaa tgtctatact 4920 acccaaactg atctacagat tcaatgcaat acctattaaa atcccatcag cattcttcac 4980 agatatagaa aaaataattt tacgcttcgt atggaaccaa agaagacccc gaatatcaag 5040 agcaattcta ggcaacaaaa acaaaatggg aggcattaat atgccagata tcaaactata 5100 ctacaaagct gtagtaatta aaacaatatg gtattggcac aaaaacagga atattgacca 5160 gtggaacaga tgtgagaatc ctgatataaa accatcctca tatagccatc taatctttga 5220 caaagcagac aaaaacatac gctggggaaa agaatccctc ttcaataaat ggtgctggga 5280 aaactggata gccacctgta gaaggctaaa acaggaccca cacctttcac ctctcacaaa 5340 aaccaactca cgctggataa cagacttaaa cctaaggtat gaaactatta gaactctaga 5400 ggaaaaagtt ggaaacactc tcctagacat cggcctgggc aaagagttta tgaagaagtc 5460 cccaaaggca atcacagcag caacaaaaat aaataaatgg gacatgatca aactacaaag 5520 cttctgcaca gccaaagaaa tagtcatgaa agtaaacaga caacctacag aatgggagaa 5580 aatttttgca tcctatgcat ccgataaggg actgataact agaatatact tagaactcac 5640 gaaaattagg aagaaaaaat caaataaccc cattaaaaag tgggcaaagg acttgaacag 5700 aaatttttct aaagaagaca gaagaatggc caacaaacat atgaagaaat gctcaacatc 5760 tctaatcatc agggaaatgc aaatcaaaac cacaatgaga tatcacttaa ccccagtgag 5820 aatggccttt atcaaaaaat ctccaaacaa taaatgctgg cgtggttgcg gagagagagg 5880 aacactccta cactgctggt gggactgcaa actagttcaa cctctgtgga aagcaatatg 5940 gagatacctt aaagcgatac aagtgaatct accatttgat ccagcaatcc cattgctggg 6000 catctaccca aatgatccag tgacactcta caaaaaagac acctgcactc gaatgtttat 6060 agcagcacaa ttcataattg caaggctgtg gaaacagccc aagtgcccat caatccaaga 6120 atggattaat aaaatgtggt atatgtatac catggagtac tattcagctc taagaaacaa 6180 tggtgatata gcacatctta tattttcctg gttagagctg gaacccatac tactaagtga 6240 agtatcccaa gaatggaaaa acaagcacca gatatattct ccagcaaact ggtattaact 6300 gagtagcacc taagtggaca cataggtgct acagtaatag ggtattgggc aggtgggagg 6360 ggggaggggg gcgggtatat acatacataa tgagtgagat gtgcaccatc tgggggatgg 6420 tcatgatgga gactcagact tttgggggga gggggggaaa tgggcattta ttgaaacctt 6480 aaaatctgta cccccataat atgccgaaat aaaaaaaata attaaaaaaa aaaaaaaaaa 6540 aaaaa 6545 // ID LTR43_I repbase; DNA; PRI; 4798 BP. XX AC . XX DT 19-JUL-2006 (Rel. 11.07, Created) DT 10-APR-2007 (Rel. 11.07, Last updated, Version 1) XX DE Internal portion of ERV1 Endogenous Retrovirus from primates. XX KW ERV1; Endogenous Retrovirus; Transposable Element; KW LTR retrotransposon; LTR43_I. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-4798 RA Smit A.F.; RT "LTR43_I - ERV1 Endogenous Retrovirus from primates."; RL Direct Submission to Repbase Update (11-NOV-2005). XX DR [1] (Consensus) XX CC mer4 group. XX SQ Sequence 4798 BP; 1372 A; 1146 C; 874 G; 1402 T; 4 other; tttccggtgc catgactcag aggtttgtct gcttcgttgg tttcagtttc ccttcactac 60 tggtgagtac tatggcagcc agagacccct gattgactat cactgctttc cccagatcta 120 ttaaggtttt gggggaggac cttttaactc actcacattc tttgagcaac taattgtgat 180 tgctttccat ttggctgctg cttttacagt gtttacaatt accttatttg gatggaacgc 240 cctgattatt cagccttggg acttttgctg cttctgtttc actttttgtt ttgctgttcc 300 tcccaggact gcacctgatc tgtacttact ggctattgta acnacttgtt tgttaatcaa 360 gtaatctctt caaagatttt tgttcacctt gagggacaca ttagatctac ttttgccaac 420 agtccccatt cctccaggct ctgtgtgttc tgagactcct ctgagtctca gaggagtgtg 480 ttctgaacgt ctcctctgag aacaggagac gttccaagag gccatccatg ttgagtgcag 540 gatgtgtggc cacatggatg tgtagtcatg gggactataa ccaggcattc caagcatgat 600 gactggacat taaaaatggc agatcagtga aataaggaag ggcttgttgg tgagacatcc 660 aggctccccg gctggcagca gagatcactt cagttcagct tggagacgtc cagcaccagt 720 gagacctaga atggtgcatg gcaaatgccc atgacctcct agggcctcag tttcatgggg 780 attcaaggga acaccctgga ctccatcgtc cagcttagct cacagggatg ccgatgacct 840 cctggatttt ggtacatgtt tctgtggttg caggattctc ttgttaccta gaaagccacc 900 tcctctactg tcactgaaac acctctaggg tatatactaa acattggaat attttgaaac 960 tgtataaatt aaaagataat aggtnttttt tttaaataaa tataataaac aattgccaaa 1020 gagtaaaact attgatacaa tcctcaccac tttaaggctt aaggttttct tttccatcac 1080 tgagtctctc cctttcctct cattcttcca cttacaaatc tccaaaacaa ttctcacgca 1140 ctgtgacttt gctcccttca gctgatttat cagttcatcc tgatagcctg ataggtgaca 1200 agcagaggtg aggacttcaa agttcacacc aagtagatct agttcactgt ggccctcctt 1260 gacaggaggt ttgtgaagct ggcagggctt ccgtccaggc tgtgcactgt ctgggaatcc 1320 tcatttgcaa tgtctggaga tcttcatttt tcttactact aacaatcatc ttgttatgtt 1380 tgcacttctt tgcatttcac cccttttgaa ttctgtcctt ccatgaaaat ttattgtcct 1440 ttttgatcca tctgtattca cagactttca tttgctttct ttttctctct aacccgtaag 1500 actgataaaa attgtcctaa agtttctttc tttctgcttt gtgtgtcagg gctcctctgc 1560 ctttggtgag agcagagttt tatctttacc ggaagaaaac taattgctgg gtgaaatata 1620 ttttctacca aattcccctt acgagaccta gaaagcctaa tgaacatagc tacttacatg 1680 tcctaagctg ttattttaag gccaaaatta aaacattaag ggcacatata aggttggcca 1740 ttactaacct gaaaaaaaag ataaataaat ttccatgatt aggtcttttc aacactgcat 1800 agtcccaaac aatactgttt tacaattaga gtttttgttg ttgttgctgt ttttaaataa 1860 aaagaaagga agtttngagg atgatcaggg attttccaag ggcccagggg aacctgacat 1920 tattccccct actaaccaga cagctctata ctaagaccag tcccttagag actgatacca 1980 aatctattat gctcatgtta ttcaaaagaa tttggggaaa tctaacataa ttaatgactc 2040 tataataaga aatataccag ctgggtgcaa cagtgntacc tcctaccaac aactttcctc 2100 ccttacaatc tagtccaggg ttactcttca aacctcttaa gcttctactc ctgtagtcct 2160 tcctcacttg acacacagtc ttctgcaccc cgtccttatc agcttgttca ccaaacactc 2220 cctaaagagc ccagtcctgc tgggacaact catagcagag tatcctattg cccccctaaa 2280 acaaaaagca acctactctc actctctatc tgtatctccc tctctcaggt aacacacaga 2340 acaacaacca aatcctctta gagacctact tcatgagtca gtctgtccca gatatcagga 2400 aaaagtcaca aaactagtca taaatcccca agtcccaata aatgaactgc taaacctaac 2460 ttttggtgtc tttaattacc aagacagagt ggaaaaggca catagagatc aaagggaaga 2520 aaagagagac aaaagatagt cccaattttt ggccttcact cactatgcga aaactcccac 2580 ctccaggtca tcctgagtgg aacccaaggg ctattcctgc acttataaaa agcctggaca 2640 ccggagctaa gtaagtaaca aaggccttca ggcttgcaaa ccctctggag cctgtcatca 2700 atgtgacaaa gaagggcaat ggaagaagga ctgtctccaa ctctgaaggg aggagggact 2760 cctaattcct tattgtccct ggctaaagac taaagagacc aaaggcaaaa aacagctcct 2820 atgtggcaat cagccccagt cacagcaatg gagcctcgga tgaccctgga catgacaggc 2880 aaaaatatca atatcctttt aaagacagag gctggcctgt cagttctcac tgtctgccct 2940 gggcctctgt ctaccaaaca cgacactgtc attggtgtta atagcaaact ccagactagg 3000 attttcactc taccatgcag ctgaccaact tctgctgcag taaaacttag gggtgtaggc 3060 ctttggtgtg tttatcaaaa ataaaaaatg attcctttta agtcatcaca gaaacttgaa 3120 acaaagactc caagctattc ctatgaagca ctggaggatc taaggctcct gtccaaaaac 3180 agccaagacc caaaacatca ggcaattaat gttgcctcag cataagcttc tattcaagaa 3240 aacaactcac agtgaaatgt gatgttttta tttttttctt atttatttac tgtattttag 3300 gcgcttttag taaaacgacc ttatctgcta aagaaataat aaatcatact actaatttat 3360 aaaaattaac tcagtcttgc tggctttgca tgactaccaa aatttaaaaa tgtgcaaaac 3420 ctgtttctcg ggaagaatgg gccaacattc ctatacacct cctggaacaa actttggacc 3480 ataatgtgga aatatctgac taaacaaaca atacaaagag agttccttgg acctggccac 3540 tgccagttca aacttccatt tttatctatg aataatagct tcactctgcc aaggggaaaa 3600 ttgctttctt accttgcttt ttacccagag caattcccct tctgccttta cagcaaccat 3660 gccagtttca ctccttttat agaaaaactc cacaagagag tcagtatatc taaacctttc 3720 tcacagaatc atttatacac ctcatgatag aaccctaaag ggggaacttt atttcaaaaa 3780 gcttattaac accactcaac tctaccatcc tctaattagt ccagtgacca ccaaatttcc 3840 attactttta ccacctcgat gcaaaatgct tttgcagcac aaatttcacc atcacatata 3900 atttgcttgt gttggtattt gtggatcttc agcacgtcta caactccctc cacaatggaa 3960 gggacgatgt cccatagttt acatttcccc ttatctacct tttgcattgg ctaacaaatc 4020 tctccctttc cccatgtacc aacatcacaa gatccaccgc tgagcaggat tccttgttcc 4080 cttgggatta gtgctatcct ctctatcggg actagcagag ccagccacag agacagagcc 4140 ttgggaaccc agcataaact gtctcaggag accagagtgg ccctctgaca aacagcagag 4200 agcctcacta gacttcagca acagctggac ttcctggcag tcctacaaaa ccgaagagcc 4260 ttagaccttc tcacagttgg acaacgagga acatgtttgt atctagaaga agaatgttgt 4320 tttcgcatca atcaaattac aaatatatat taatagcatt ttcttggaat aagaaaatca 4380 ttacccaggc agacaaaatt gaatatttag gagcttccgt gggaacttgg aagcaatggc 4440 tgttttctgc cttgctccct ttaacaatgc cagtcattac catatgttta gctctaactt 4500 ttggtccaac tttgtttaaa atgctgattt cttgctttgt cacctacagc aaatcccggt 4560 tcatgtgatg gttttgcaag gcttccaacc tttggctgct aatgagctat ctcacatctt 4620 gcccaccagt cccctgaaag acatggctta cacactgtta gactaggcag gaaaagactt 4680 cagggcccag gttaggcaag gacaatgccg cactcagcag gaagcagctc tggaagaaat 4740 gacctagcct ctcatcctcc cgtatgatta tgggtcctaa gatcttttag ggaggaat 4798 // ID LTR1B1a_Mim repbase; DNA; PRI; 637 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; nonautonomous; KW LTR1B1a_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-637 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 11(5), 1717-1717 (2011). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 637 BP; 178 A; 168 C; 109 G; 182 T; 0 other; tgtaaactaa aaataaaatc ctaagccccc cgctgactga acggaccccc tcttggccaa 60 ggggacccca gaaaaacctt aaaactgagt tcccggccat gaagggaagg gaggtcggac 120 acgcctcatt ataccccctc ccttttggag tttagactaa ttgcaaacag gacctaaggc 180 catgcaaggc aaaggttaag tcacgcctgc aggccatcaa tttgcttaac agatcacttt 240 gagtcagtat attgtggctt gctctatgat taacagactt ccttatctta aaacattcct 300 ttctactgat tccaagcctt tagacaaagc tttatttctt taaccaatta caaatcaaag 360 aatctctgaa cccacctata acctgtaagc ccccgctttg agatgtcccg ccttttcagg 420 ccaaaccaat gtataccttc catgtattga tttatgattt tacctgcaat tcctgtctcc 480 ctaaaatgta taaaaccaaa ctgtaaccct ggccgcctcg ggcgcacttt ctcaggacct 540 cttgagatat gttccctggg ccgtggtcac tcatattcgg ctcagaataa acctctttaa 600 attattttac agagtttggg tttttttccc attaaca 637 // ID ERV2-4_TSy-I repbase; DNA; PRI; 2907 BP. XX AC . XX DT 29-JAN-2010 (Rel. 15.09, Created) DT 29-JAN-2010 (Rel. 15.09, Last updated, Version 2) XX DE Internal portion of an ERV2-type endogenous retrovirus - DE consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERV2-4_TSy-LTR; ERV2-4_TSy-I. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-2907 RA Jurka J. and Walichiewicz K.; RT "Endogenous retroviruses from tarsier."; RL Repbase Reports 10(9), 1208-1208 (2010). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX FH Key Location/Qualifiers FT CDS 2138..2905 FT /product="ERV2-4_TSy-I_2p" FT /translation="MILKQPPYLMVPVSLPSLWFDEYGVKALYEVNSLLQR FT SKRFIATLVLGITTLITVATSLSVSAVALAKEVHTATFADQLSKNVSLALA FT TQEIIDQKLESRVNALEEAILIIGQELITLKIKLALRCHSEFKWICVTPLQ FT VNKSIHTWDKIKNHISGIWNHSDLSIDLSKLQQEIQDISQAEKQFNSDQLA FT QSFFNNLSSFTNQKSLISILINVGICGVIILFLLCLLPVIFRLLKNNINRL FT TVEFHGFVLKNKKGG" FT CDS join(141..584,575..1000) FT /product="ERV2-4_TSy-I_1p" FT /translation="MIMGQSESKERKLFISILMHMLTKRGIKVSSTQLSHF FT LHFVQEQCPWFPEEGTVNLATWTKVGEELKLYYTLHGPERVPVNTFALWNI FT IRDVLNPQQEADKLPTKLKEEPGERTPLLTPVPSQAMLATAPVDDDHLSPG FT NEENYREEGRRRIGLELGTQIYLRPLFLKLPTLLPGKTISQFWVEQWPMPS FT EKLKAAKDLVQQQLEAGHIEPSNSPWNTPIFVIKKKSGKWRLLQDLRPLLP FT CVLGDKIFQHSFLVMPTCSAPLLERDIINKIKTHIILCPHHCAFWVPFV" XX SQ Sequence 2907 BP; 928 A; 555 C; 557 G; 867 T; 0 other; agtggcgccc aaacaggggc ctgaaaggta agctttgttc taattagaac cgggatggat 60 aggaccttac ccagggtgct gcagacccct ctgtagagat aggcagagtg aggaagggta 120 agtagtttaa gagccatgat atgatcatgg gccagagtga atctaaagaa agaaagctgt 180 tcataagtat tctgatgcat atgcttacta aaagaggaat taaagtttcc tctacacaat 240 tatctcattt tcttcatttt gtgcaagaac agtgcccctg gttccctgaa gaaggcaccg 300 ttaatttagc aacatggaca aaggttgggg aggaattaaa gctgtattac actctccatg 360 gtccggaacg agtgcctgta aatacttttg cattatggaa tataattaga gatgttttaa 420 atccacaaca agaagctgat aaattaccca caaaattaaa agaggaacca ggtgaaagaa 480 ctcccctatt aacccctgtt ccttcccagg caatgttagc cactgcccct gttgatgatg 540 atcatttatc ccctgggaat gaggaaaatt atagagaaga aggataggtc tggaattggg 600 tactcaaatt tacctaaggc cattgttctt aaagctgcca accctattac ctggaaaaac 660 aataagccag ttttgggtag aacaatggcc tatgccatca gagaaactta aggcagcaaa 720 agatttggtg caacagcaac tagaggctgg acatatagaa ccttcaaaca gcccatggaa 780 tactcctata tttgtcataa agaaaaaatc gggtaagtgg agacttttac aagatttaag 840 gccattactt ccatgtgttc tgggagataa aatatttcaa cactccttcc ttgttatgcc 900 aacgtgttca gctccattat tggaaaggga cattataaac aaaattaaaa ctcatattat 960 tctttgccct caccattgtg ctttttgggt gccttttgtt tagctttgga tttccctaat 1020 agttctacgt tggatcctat ctaagccctt ccactgcata taaacttatt gctttggtta 1080 aagctctgag gttaggaaaa atgtaaagaa agttgttggt atgaggtatg tttttagtaa 1140 gaaaggataa aggaaagaga aataattgta taggaaagaa tcttgtatgg tacatttttg 1200 ttctataata gaatggctgg ttaaggaaag tatagaacaa aaatagaagg ttcaagcatg 1260 tcaaaaaatg gtatgagtat gtcgaccatg gtaggtgaaa gtttgtgaaa agaagtgaaa 1320 ggaattctat aaacactcga ggctacaaaa aggttctata aatttaccat taatatcaag 1380 tacactaatg ctaggcctgc atgctattca aacaccagaa catgagcttc ttaattaact 1440 ccaaaagaat tttctccttt aaacaacaga gggggaagaa agaaacaaaa ttcaactagc 1500 cctcccccac tgtctttgtt acaaaagagc aaggtttttt tttaaattct gagaattgta 1560 aaagatttgc atttagtcta ccttctaaga attttaaaca gcccttacag gtcaatggaa 1620 aggaccagat atccttttaa cctgcgggag aggatgtgct tgtgtttttc cccaggattc 1680 atcttcacct atttgggttc cggatcgcct gattaggcat gtccggcctc ccacccccac 1740 cacagttcaa ctcccgaaag caggaacaaa accgcctgag ctcgactgat gacgaggaag 1800 aagaaaccct cgttcctcag ctaaaacatc ttgctctcga gaggcctcca cgaccgcgac 1860 cgcgttctgc tgcttccacg agaacctgca aaccaccaac ttggggacag ctgaaagtgc 1920 tgacgcatga tgcagaggac ttggttcggt ctcaaaatac ttctcttact ccctctactg 1980 tttttgttgt tatgctggct tacagtcacc ttatgtattt ttggttacca aaagttctaa 2040 ttctttaatt attacccaaa atggaccaaa atataacata tcatgtaaaa attgtattct 2100 tactaattgt ttaactgatt tttcttatag agaatttatg attcttaaac aacctcctta 2160 tttgatggtt cctgtaagtc ttccctccct ctggtttgat gaatatggtg taaaagcatt 2220 atatgaagtt aattcgttat tacaacgctc aaaaaggttt atagcaactc tggttttggg 2280 catcactacc ttgataactg tcgccacatc actttcagtt tctgctgtag ctctagctaa 2340 agaagtacat acagctacat ttgcagatca gttatctaaa aatgtgtcct tggcattggc 2400 tactcaagaa attattgatc aaaagttaga gagtagggtg aatgctctag aggaagctat 2460 tcttatcata ggacaagaac ttataacctt aaaaattaag ttagctttgc gttgccattc 2520 agaatttaaa tggatttgtg taacccctct acaggtgaat aaatctatcc atacatggga 2580 taaaatcaaa aaccatattt ctggaatttg gaatcattcg gacctaagca tagatctctc 2640 taaattacag caagaaattc aggatatcag ccaggctgag aaacaattta attctgacca 2700 gttagctcaa tcatttttta ataatctctc ctcttttact aatcaaaagt ctctaatttc 2760 cattttaatt aatgtaggaa tttgtggagt cattatctta tttcttttat gtttgcttcc 2820 agtcatcttc cgattactga aaaacaacat caaccgactg accgtggaat ttcacggctt 2880 tgtattaaaa aacaaaaaag ggggaga 2907 // ID LTR77-int_TS repbase; DNA; PRI; 5551 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Internal portion of an ERV1-type endogenous retrovirus - DE consensus. XX KW Endogenous Retrovirus; Transposable Element; nonautonomous; KW LTR77-int_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-5551 RA Bao W. and Jurka J.; RT "Retrovirus repeats from tarsier."; RL Repbase Reports 11(5), 1737-1737 (2011). XX DR [1] (Consensus) XX CC Its corresponding LTR is LTR77_TS. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 5551 BP; 1562 A; 1235 C; 1310 G; 1444 T; 0 other; gttttggtac ccaacatggg gctcgacgga agggtgagtt aaagcggacc tatcacttta 60 ctttcatttc cgaggatcct cgtcctcaag ttttttttac tctctctaaa gaacaaatta 120 ccgggactct gccagccagt taaaagcgaa tagcatggct gccggtctac aatacgcaga 180 ggtcaggctt gctggagagg acttggttaa tcccccatca ctctcaggtg ttgggaatgt 240 tggccctgtt ccgatccagt ttcttttcat ggaggtctag ccgtcgcgtg gaattggaag 300 aagcttgggg gtaattgagg gcatctggcc aaggctacac cttggtgtta cccaaaggct 360 tccaggctgg ccccaattcc cgacagcccg acagggtgtc ggcagaaggc ttccagtctc 420 tcctatcaca cttttttcag cctttcctat cacattttct tctcttcggg attgccatgg 480 gtactgtccc cacttggtgc ttggtgacgt gggtgttttt gtagcttagg gagctggcat 540 tactagacag tgttggcaac tggcttgatc cttgttgaat aagagctcag gacaaggtgt 600 gttcattggt ccacgaggca aaagggtagg aagctttggt ttcgtttttt catgttggga 660 agcattgtaa ttggcaagaa tgagaaactg ctttatatga ggaatttaag tcttgtgaag 720 ttaggtccct acatttgggg actctatggg tagacaaatg tcaaatgaat gtgcagactt 780 tatgttgaaa tgcttggctg acggtgggtt taggcttttc agaagatgtc agggactgtc 840 tcaaaagatc ctgtaccggc actttcatgt ctccatattt gtgggccttg ttaggaaagt 900 aatctgaatg ttgtaacctg gctcaaacca tgggcctcct ggccaggggc tcatggaaag 960 tcccaggtaa accccccccc ctttgtcaga atcctgctcc taaacccaat cattccaagg 1020 cctttttgaa gacctagtct ggtaaggagg aatgctaggt ttgaggaatc cagaggcaca 1080 gtcgacagcg cgtaggacag gtattaatgc gtaggtgagc atgactattc ctactgaccg 1140 ggtcgctacc gctatacttg ggtggaggtc acgctcgcat ccaaggggac gccatgagca 1200 gagaaggaca ggaaagtgag gggacgctct cctttctttg atccttattc atcatgtgcg 1260 ttggttggag gagggaaagc ttagggatgc ctttctttca cctctctctt attcagatgg 1320 gtaacaggcc atctggctat ctgcagtgtg cactcaagag tgcatcctgg gacatgggac 1380 tcctttgatc ctcagacttt aaacagcatc ttaaattctt ttgcacccaa gcgtggctga 1440 attataagtt gcaggatgga gaaacctggc ctccagaggg aagaatcaat cataatacaa 1500 tcgtgcagct agacagtttc tgtagatgcc agggcaaatg gtccgaggtc ccatatgtac 1560 aggctttctt cgcattacga gaaaatccgg acctttgtca atgctgtaga attgattcag 1620 ccctcctagc agtcatctca agggatggtc ccaatggacc ggggatgcca accccaaagg 1680 cgccactggg gaaggcatca tctcatgagg ccactccacc tggtccatct tgccctcctt 1740 attcaggtcc tccttgtact tcaaaccatt cccactctaa aacccctcat tctagaactc 1800 cggcattact tatgccccta caagagatgc ttggtgaata tggccctagt aaggtacagg 1860 tacccttttc cctatgagac ttgaaacaga tcaaggaaga tcttggaagg ttctcagatg 1920 accctgacaa gtatatagat actctccaga atttaatgca agtatttgag ctctcctgga 1980 aggatgttat gttgctctta ggccagaccc taatgaccac tgagaaacaa gcagccctgc 2040 aagtggcaga gaagtttgga gatgagcttt gcatttcata tagtgccaga gaaggggagg 2100 agccctatcc aataggcatg tctgcagtgc cattggaaga ccctaaatgg gaccccgata 2160 gtgacttggg agaatggagg aggaaacatt tcctggtgtg catattggaa agcttgagaa 2220 aatagaacta aacctctgaa ttattccaag ttagccatga tagaccaagg ggtcgatgag 2280 aatccctcag cttttctgga aaggctgaga gaggctttga ttaaacacat ccctagaccc 2340 tgaatcagtt gaagggcaac taatcttaaa agacaagttc attactcagg cagctccaga 2400 tatcaggagg aagctgcaga agcaggccat aggaccagat agtactttag agaatctcct 2460 gaaagtggcc acctcagtct tttataacag gatcaggagg aggcacagga aaaagagagg 2520 agacacaaga caaaggcaag agctctagta gctgccttat aggcatatac accccagaat 2580 tcccaaggtg tacctgctaa ctgttatgga tgcagcaagc caagacatta aagaagtaat 2640 agggtgccaa catggaaggc tgcagaacca gagggacaaa agaggcatct caattggcag 2700 tcacctttag tgacaaaact ccaacagccc atcagagacc tggtaacaac ggacctgcta 2760 aggggatggt gtctatagcc ataggtacaa tctgtggggg cactagaagc ctggttgccc 2820 caaagaggcc taaacctcca ggaccttgtc cactgtacag gccagagagc cattggaaag 2880 caggtatgct ctcttctgaa gggaggagag aacacctagc tcaggaatga tggccactga 2940 ggaggcctgg aaacctgcta cagtgcctgg catagctcac catcacccca accatctcca 3000 ggtgaatatt agggcactgg aacccaaatt aatatctgaa ttaacacagg tgccagccat 3060 ttggtcctgg ccatccatgt tggaatatag ccttgttgta attcaagttg ctaggggtta 3120 attcaaagtg taaattgtag aaggttatat agaaagatga tgggagaagc cagaaaagca 3180 cagaaaagat gtaaagaaag ttgttaggaa ggatgcaaag gacagaatgg ttttgtatag 3240 gaaggaatct tgtatggtaa atttttgtcc tgcaataaag tggatggtta cttaaaggaa 3300 ggaaggtata ggacaaaaaa atagagggtt catgcatgtt gaaaaatggt atgtcgtaga 3360 atggtatatg gaactaagga gactcaatca gctagaactc attttctaat ttggacatca 3420 gaggctgaga aagcctttga ccaattaaaa caagccttaa ttgcagcacc ggctcttagt 3480 ctttccatag gaaagacatt taacctttat gtgtctgaga gaaaaggaat ggccctggaa 3540 gttctgaccc agacccaggg tccagcccaa caacctgtag gctacttaag caaggaactt 3600 aatctggtgg ctaagggatg gccagcctgc ctaagagcaa tggcagcagt agcgttgttg 3660 gtgccagaag ctattaagct aacaatggga gaaaaaccta actgtgtata ctccacataa 3720 tgtggcagga ctgctatcct ctaaagggag cctctggcta actgataacc gccttcttaa 3780 atatcaggcc ttattattag aaggagctgc aatccactta aaaccatgtt cttctctgaa 3840 cccagccacc ttcctccctg acaatggtgg ggaatgcgaa catgattgtg aacaagtggt 3900 agtacaaacc tatgcagcca gagaggactt aaaggaaacc cctgtagaaa acccagactg 3960 gattctcttt actgatggaa gttcatttgt ggagcaaggg atccgtaagg cagggtacgc 4020 agtagtcact tccaatgaca tcattgaaag tgctctttcc tcaaatacaa gtgctcagct 4080 agccaaatta attgctctta taagggccct caaattaagt aagggaagtc agtaaacatc 4140 tatctgctcc aagtatgcct ttttagtcct tcacacccat gccgctattt ggaaagaaag 4200 acattttctt acagctaatg gatcccctat taaatatcat tgggagattg gcaaactatt 4260 gtcttcagtc ttcctccctc gggaagtggc agtaatacat tgtaaaggcc atcaaaaggg 4320 aacagatgaa atagctgaag gaaatagact agcggatcga acagccaagt tagcagcaag 4380 gaaaacccag atttctgatc catttgaggc ccctctgatt tgggagggcc cctcaaaaga 4440 tataaaaccc caatattctc ctgcagagat agaatgggtt acatctcagg gatacatttt 4500 tcagccttca ggatggctgc aatcagagga tggcaaactt catctgccaa cctcctgcca 4560 atggaaggtt cttaaaaccc tccatcagac ctttcacctc agaagagata aaactttcca 4620 actaaccgaa agctgttttc agtaaaggtt actggaatag acttgtggat ccatcacact 4680 agagtgaaag catggagtcc aggagaagtt gcctcagctg acccggaaga gcatacagaa 4740 tcaaactaaa gatcgtaaaa gataagtgcc actaattaac tgcctatgaa tttcctcctg 4800 atggttctac ctatacttgc tgcctttgtt ttcatcttga tttgtaccgt tagacgtctc 4860 tgtcaaggac cccttaatcc tgagcgtcca tgggattgtt ttcttcccta gaagtctccc 4920 tcacaggaac aagcaaaatg cctactctat cttgcttcct gtttttagta tcattagtcc 4980 tagcttctta taagcaccct ctctcaccct ccattacgta cgtggctcaa cagggtaatt 5040 attcaaacgt tggatatgct cacggcggaa tgtggaggaa catgcgcatt attaaatgaa 5100 acttgttgtt tccatattaa cacttcctcc caagtagaag agactttaca ggttataaaa 5160 caataataaa atcattgact ctttacaaga acaggcagtt tcaggaccca gttggttaca 5220 atctttttta tcctctctag ggatttccaa tatatggaca tgggtttttc ccttaatagc 5280 tccttttgtt gtattgcttg tgtttggtcc atgtatagtt aaccttcttg taaagtttgt 5340 ttcttctcgc ctagaggcca tcaaactcca aatggtgatg cagatggaac cttgtatgca 5400 agcttctttc taccgaggac ctttagatcg ccctccggag gatccctagc tgccgtcccc 5460 atacgacgcc catgatcagc aggaagtagc cagaccggtc atcgtcctta ctccctaacg 5520 gcagttaggg ttactactcc agagggggga a 5551 // ID MER4_OG repbase; DNA; PRI; 534 BP. XX AC . XX DT 19-OCT-2009 (Rel. 14.11, Created) DT 19-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat (consensus). XX KW ERV1; Endogenous Retrovirus; Transposable Element; MER4_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-534 RA Jurka J.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 9(11), 2867-2867 (2009). XX DR [1] (Consensus) XX CC ~81% identical to consensus. 4bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 534 BP; 134 A; 147 C; 90 G; 162 T; 1 other; tgaggactga actctgacct ttttttttct cttgcccaaa ttcctttcta agaggcctgg 60 tgagtcacgc ctacaaacga taaaatctca ttaaacgggt ttttattaac ccggtataac 120 gtggcttact ttccaacctg attccggtat agcatcacgt gacagataga agacccctcc 180 tatcttaact caagcattcc tttctactga ctccaggtct ttagacaaag cttaactctt 240 tcaaccaatt gccaactaaa gaatccctaa aacccaccta tgacttgtaa gcccccgctt 300 cgagatgtcc cgccttttcg ggccgaacca atgtatacct tccacgtact gatttatggt 360 tttacctgta attcctgtct ccgtgaatgt ataaaaccaa actgtaaccc ggccgcctcg 420 ggcgcatttt ctcaagacct cttgagacca tgttccccgg gccgcggtca cncatattcg 480 gctcagaata aacctcttta aattatttac agagtttggg tttttccgtt tgca 534 // ID L1-3_TS repbase; DNA; PRI; 7127 BP. XX AC . XX DT 09-APR-2010 (Rel. 15.05, Created) DT 09-APR-2010 (Rel. 15.07, Last updated, Version 4) XX DE L1-type non-LTR retrotransposon - consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-3_TS. XX NM L1-3_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-7127 RA Bao W. and Jurka J.; RT "Non-LTR retrotransposons from tarsier."; RL Repbase Reports 10(5), 769-769 (2010). XX DR [1] (Consensus) XX CC ~91% identical to consensus. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX FH Key Location/Qualifiers FT CDS 2008..3036 FT /product="L1-3_TS_1p" FT /translation="ITELKRKTRDLSQMRKNQRKTSGNFKCTKDKTPEKSD FT LGAQTMEVTNDWPQNTTGIREWMERVNKTQERQDATLKELVTEILAIKNFI FT QEINNKMTSMESRINLAEERISELEDQNMELTRSVKNIERRLRKKEQSLQE FT MCDYIKRPNLCLIGIPEEEREMENNLEQVFQEVIQENFPHLTRDVTSQAQE FT IQRTPTRHQMRRPTPRHIVICLHKVGTKEKILKAAREKGQTTYWGKPIRIA FT ADLSAETLQARRDWSPIFKVLKDKQFQPRISYPAKLSFISDGELKSFPDIQ FT SLRDYAASRPALQETLKKVLSTEERKKRMTTHFPREQQSTESTENTAQQET FT " FT CDS join(3406..5667,5671..6933) FT /product="L1-3_TS_2p" FT /translation="MIKGSIHQQEISILNIYAPNTGAPAFIKQLLSKLKKD FT IDSNTIIAGDLNTPLTALDRSSRQKINKEIQNLNLTLDQMDLIDTYRLFHP FT TTTEYTFYSSPHGTYSKIDHILGHKSSINKFHKVEILPCTFSDHSGIKINI FT NTNNISPKPTKTWTLNSMMLNNYWVNTEIKAEIKRFLETNENEETSYQNLW FT DAMKTVLRGEFISLRAHIKKMERSQVESLTNHLRELEREDHQNPNFSRRIQ FT ITKIRAQIWDIEDKNIIENINKTKSWFFERINKIDGPLARMTKKKREKAQI FT NTIRNTKDQITTDPEEIQKIIRDYYVHLYGNKLDNLNEMEDFLTSHNLPRL FT KQEDIETLNRPITTQEIDSVIRKLPTKKSPGLDGFPAEFYKTYKEELIPIL FT LKVFQAIEKDGILPKSFYEANITLIPKLGKDPTKKENYRPISLMNIDAKIL FT NKILANRIQKHISKIIHHDQVGFIPRMQGWFNIRKTINVIKYINRCKNKNH FT MIISLDAEKAFDKIQHPFLIKTLKHLGIEGTYLKIVSAIYDKPTANILLNG FT QKLETFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEIRGIQIGKEEVK FT LSLFADDMILYLENPRESVKNLLALIKDFGKVSGYKINVQKTVAFLYTNNK FT QAETQIKSTIPFTIATQKMKYLGIFLTREVKDLYNENYKTLLKEIQEDTNK FT WKNIPCSWIGRINIVKMSILPKAIYRFNAIPIKLPTSFFSDLEMTIQKFIW FT KHKPRIAKTILSKRNKAGGITLPDFKLYYKATVIKTAWYWYKNRHIDQWNR FT IEIPEAKPQFLNQLIFDKASNNNHWGEENLFSKWCWENWLTTCRRLKQDPY FT LSPYTKINSKWIKDLNVKPQTIRILENAGDTLMEIGTGNQFLFRTLKAHTL FT RNKIDKWDLIKLTSFCKAKETIKRAGRQPTEWEKIFANCISDKGLTSRIYK FT ELKRAKRKKTNSPIKKWAKDMNRHFSKEDIRAANRHMKKCSASLIIREMQI FT KTTLRYHLTPVRMAIINKTNNNRCWRGCGEKGTLLHCWWECKLVQPLWKTV FT WRFLKELEIDLPYDPAIPLLGIYPEELKSFYKKDTCTRMFVAALFTIAKTW FT NQPCCPSKLDWIKKMWYIYTMEYYAAIKKDKHMDFAATWMDLESIILSDLS FT QKQRTEYHMFSLISGP" XX SQ Sequence 7127 BP; 2574 A; 1584 C; 1433 G; 1536 T; 0 other; gaggtcggga agatggcggc cgagccagtc ggactatgcg agtcccggtg tgagtgagtg 60 agtgtatgga aaattttgtg tacgtgctgg tgagcgagtg ggcatgtgag tggctgactg 120 agtgagggct agggagctag tggccagggc tctcctggtg cagcggctgg tgccacagct 180 gctgggcttg gaccctagcg attcccggga cgcagaagcc gagcgccatt ttctttttct 240 tttctttttt ttttaaatca acaaaccctc cctgggaaga aaatgctggg agagactgtc 300 taagtggggg ctcgtgagca agtggccagg cccctggtgc aacagctggt gctgcagctt 360 ctctgcttcg tccctggctc ttcccaggtc gcagaagccg agcgccattt ttttttttaa 420 acagactaaa gcggcgggga gaaactatcg ccccaggctc tccctcacca ttgcagcagg 480 agagagactg ccggctccag gctttgcctc accacttcag cacctggcag ggcactggca 540 gcaggggctg cggagggagt ccagaggcaa agtcccatgg aagaggacaa gagaggcacc 600 ccaccaccat cttgaaggcg gggcaggaaa gtgctccgac ttgctcacca cggagcaaca 660 gacccaatct ctgtgactgg cggatcaact actcagggat tgatcgtgac ccagagggaa 720 tatcctgaga gcacaaactt cggagggacc tgacgagtgc tgagttccag gcgccccacc 780 ccctaccctg atccagccgt gggacaaagg gtgtggccag aaaaggacga ggaagaactc 840 caaagagatc agtggtgagt ccggggctga cccaaacccc ctcctcccca gcaatcctct 900 gaacatcaga cccacacagc tgggccacga ctgagaggga caaacccagg cggagcctgg 960 acatcagtcg caataggaga actccattcc aggataggcc tggctggggc acccccgtga 1020 gagtacatac ctggggaaga gacaatcccg tacctcccca ccggaggaga tctccctgac 1080 catacgagag tgggagggga ggactgcagc cccacccacc tccactgccc ctcctccttc 1140 cctgcttcgg acactgagga gagagtgaaa gaggtggctc tgcaggagag acctgattag 1200 aggaacaatt acagagcctg ccacaagagg gcaggttttt ttgtttttta atttttaata 1260 atttttttaa atttttgact gtttgttagt tcatctattt ttaaaatatt tttcctttgt 1320 gcatgaatgt gtatgtgagg gtttgatctg catcccccca aaattttttt tcttttcttt 1380 ttctgtttct ttcttctttt atttgtcgtt ttctgtttat gtgtgtatgt gaatgattga 1440 gtgttcatct ttcaattttg ttgttgttgt tggtttgttt ttgctctttt gttcttctgg 1500 gtttttttta attttcattt tttatagttt aaggtgtgtg tcaatatatg tgtgagggtt 1560 cgacctgaat tcccctctcc ggtctttctg tttttcttct ctattcactt tgtgctaggc 1620 atgtgtgtgt gtgtttagtt ttcttggttt tttgttgttg tttgttttgt ttcactttgt 1680 tgttattcca gtttgttttg ggggtgtgtg tgtatgtctg tgtgtgtgtt ttgcattggt 1740 ttctgctttg catcatcata tagagtggtg gtggggagtg ggtctcaggg aacacatacc 1800 acaataagtg aacattgtaa tttgcaagta gtttctgaat cctagcgccc catcccctac 1860 tttattcgaa tttcaaaaac tacaagagac agcaaagctt cccatccacc atcctatttc 1920 aaaatagaac aaatccagag cccggatcag ggaagaatag aggactgagc aaacatcaga 1980 agggaaagcc tctcccactg acactgaata acagaactaa agagaaaaac aagagatctg 2040 tcacaaatga ggaaaaacca aaggaagacc tcaggcaatt tcaagtgtac caaagacaaa 2100 actcctgaaa aatcagacct tggtgcccaa acaatggaag tcaccaatga ctggccccag 2160 aatacgacag ggattaggga atggatggaa agagtgaata agactcaaga aagacaagat 2220 gcaacactga aagaacttgt aacagagatt ttggcaataa aaaatttcat ccaggaaata 2280 aataacaaga tgacaagcat ggaaagcaga attaatctag ctgaagaaag aatctcagaa 2340 cttgaagacc agaacatgga attaacccga tctgtaaaaa acatagaaag aagactcaga 2400 aagaaagaac aaagcctaca agagatgtgc gattatatca agaggccgaa cctatgcctg 2460 attggtatcc cagaagaaga aagggaaatg gagaacaact tggagcaagt attccaggag 2520 gtaatccaag aaaactttcc ccatctcacc agagatgtga ccagccaagc acaagagata 2580 cagagaaccc ccacaagaca tcaaatgaga agaccaaccc ctagacacat agtaatttgc 2640 ctacacaaag taggcacaaa agaaaaaatc ctaaaggcag caagagagaa aggtcagact 2700 acctactggg gaaaaccaat cagaatagca gcagacctat cggcagaaac actacaagca 2760 agaagggact ggagccctat attcaaagtc ctcaaagata aacaatttca accaagaatt 2820 tcctatccag ccaagctcag cttcatcagt gatggagaat taaaatcctt cccagacatc 2880 caatccctaa gagattatgc agcttccaga ccagctctac aggagacact taaaaaggta 2940 ttaagcacag aagaaagaaa aaaaagaatg accacacact tcccaagaga acagcagagc 3000 acagaatcaa cagagaatac agcgcaacaa gaaacttgaa aacacacaca tacatcaacc 3060 ccaaagccaa aagaaaacaa gcaaacaaag aaaaaacttt ataagaacct catgacaggg 3120 ataaacaatc acatttcaat aatcagcctg aatgtgaatg gactaaatgc accactgaaa 3180 agacacagaa tggcaaactg gataagaaac catgacccaa ttatttgctg catccaggag 3240 actcatctca ctacaaggga tgcacacaga ctcaaagtta aaggatggaa aatgagtttc 3300 caggcaaatg gatcacaaaa gaaggcagga gttgcgatct taatatcaga caaaacaacc 3360 tttaagctat caaaaattta aaaagatgca aaaggacact acataatgat aaaaggttca 3420 atccatcaac aagaaatatc catcctaaac atatatgcac ccaacacagg agcaccagca 3480 tttataaagc aactactaag taaactaaaa aaagatattg actctaacac tattatagca 3540 ggggacttga ataccccact gacagcccta gatagatcat cgaggcaaaa aatcaacaag 3600 gagatccaga acctaaactt gacactcgac caaatggact taatagatac ctacagatta 3660 ttccacccaa caaccacaga atatacattc tactcatcac cacatggaac atactccaag 3720 atcgatcaca tccttggcca taaatcaagc ataaacaaat tccataaggt tgaaatcttg 3780 ccatgcacat tctcagacca cagtggaata aaaataaata tcaacaccaa caacatttcc 3840 ccaaagccca caaagacatg gacactaaac agcatgatgc tgaacaacta ctgggtcaac 3900 actgaaatca aagcagaaat taaaagattc ctggaaacaa atgaaaatga agaaacatct 3960 taccaaaacc tctgggatgc catgaaaaca gttctaagag gggaatttat atctctacga 4020 gcacacatca agaaaatgga aagatcacaa gtggagagcc taacaaatca cctaagggag 4080 ctggaaagag aagaccacca aaaccccaac tttagcagaa gaatccaaat caccaaaata 4140 agagcccaaa tatgggacat agaagacaaa aatatcatag aaaacatcaa caaaacaaaa 4200 agctggttct ttgaaagaat taacaagatt gatgggcccc tagccagaat gaccaagaaa 4260 aagagagaaa aagcccaaat aaacacaatc agaaatacaa aagatcaaat cacaactgac 4320 cctgaagaaa tacaaaagat tatcagagat tactatgtac acctatatgg aaacaaactt 4380 gataacctaa atgaaatgga ggactttctg acatcacaca acctccccag gttgaaacaa 4440 gaagacattg agacactaaa tagaccaata acaacccagg aaattgactc tgtcatacga 4500 aaactaccta ccaaaaaaag ccctggactg gatggctttc cagcagaatt ctacaaaacg 4560 tacaaggagg agctgatacc aatcctattg aaagtattcc aggcaattga gaaagatgga 4620 attctcccca aatcatttta cgaagctaac atcacactga tacccaaact gggtaaagat 4680 ccaacaaaaa aagagaacta caggccaata tcccttatga acatagatgc aaaaatcctc 4740 aacaagattc tagcaaatcg gatccaaaaa cacatctcaa aaatcatcca ccatgaccaa 4800 gtaggcttca tccccaggat gcagggctgg ttcaacattc gcaagaccat aaatgtaatt 4860 aaatacatca acagatgtaa aaacaagaac cacatgatta tatcattaga tgcagaaaaa 4920 gcttttgata aaatccagca tcccttcttg ataaaaaccc tcaaacacct aggtatagaa 4980 ggaacatacc tcaaaatagt aagtgccatc tacgataaac ccacagctaa catattgcta 5040 aatggacaga aactggaaac atttcccctg aaaactggaa caagacaagg ctgcccactc 5100 tcacccctct tgttcaacat tgtgttggaa gtcctagctc gggcaattag acaagagaag 5160 gaaatcaggg gtatccaaat aggaaaagag gaagtcaagt tatccctctt tgctgatgat 5220 atgatcctat accttgaaaa tccaagagaa tctgtcaaaa acctgcttgc actgataaag 5280 gactttggca aagtctcagg gtacaaaata aatgtgcaaa agacagttgc attcctatac 5340 accaacaaca agcaggcaga gacccaaatt aaaagcacaa tcccattcac aatagccaca 5400 caaaaaatga aatacctcgg catcttccta accagagaag tgaaagacct ttacaatgag 5460 aactacaaaa cactgctcaa agaaatccaa gaagacacaa acaaatggaa aaatattcca 5520 tgctcatgga taggaagaat caacattgtt aaaatgtcca tcctaccaaa ggcaatctac 5580 agattcaacg caatacccat taagttacca acatcattct tctcagacct ggaaatgaca 5640 atacagaaat tcatatggaa acataaatga ccacgaatag ccaaaacaat ccttagcaaa 5700 agaaacaaag caggaggtat cacacttcca gacttcaaac tttactataa ggctacagta 5760 atcaaaacag cctggtattg gtacaagaac aggcacatag accaatggaa caggatagag 5820 attccggaag caaagcctca atttctcaac caactcatct ttgacaaagc ctccaacaac 5880 aaccactggg gagaggagaa cctattcagt aaatggtgct gggaaaactg gctgaccaca 5940 tgcagaagat tgaaacagga cccctaccta tcaccataca caaaaattaa ctctaaatgg 6000 atcaaagacc taaacgtaaa acctcaaact ataagaatct tagaaaacgc aggagacacc 6060 cttatggaaa ttggaactgg caaccaattc ctattcagaa ccctaaaggc ccatacctta 6120 agaaataaga tagacaagtg ggacctcatc aaactaacga gcttctgcaa agcaaaagaa 6180 accatcaaga gagcagggag acagcccaca gaatgggaaa aaatatttgc caactgtata 6240 tctgacaaag gcctaacatc taggatctac aaggaactca aacgcgccaa aaggaaaaaa 6300 acaaacagcc ccattaaaaa gtgggcaaaa gacatgaata gacacttctc aaaagaagat 6360 atacgggcag ccaacagaca catgaaaaag tgctcagcct cactcatcat cagagaaatg 6420 caaatcaaaa ccacattgag ataccaccta accccagtaa gaatggccat cattaataaa 6480 acaaacaaca acagatgctg gcgaggatgc ggagaaaaag gaacgcttct acactgctgg 6540 tgggaatgca aactagtgca acctctttgg aagacagtgt ggcgatttct gaaagaacta 6600 gaaattgacc ttccatatga cccagcaatt cccctattgg gaatataccc ggaggaactc 6660 aaatcattct acaaaaaaga tacctgcaca cgtatgtttg ttgcagctct attcacaata 6720 gcaaaaacat ggaaccaacc atgttgccca tcgaagctgg actggataaa aaaaatgtgg 6780 tacatataca caatggaata ttatgcagcc ataaagaagg acaaacatat ggacttcgca 6840 gcaacttgga tggatttgga gtcaatcata ctcagtgatc tatcacagaa acaaagaaca 6900 gagtaccata tgttctcact cataagcgga ccttgaacaa ttataatact ataagaaagg 6960 gattggcagt agtgggaaac tgtcagggga ggggggtggg ttggcagttg agggaaactg 7020 ctaggggagg gaggggcata cctcatcaac aagggtacct gcataatcaa catttgtata 7080 cctaaccctg aattgtaccc cacatcttta taataaaaaa agaaaaa 7127 // ID LTR7_Mim repbase; DNA; PRI; 549 BP. XX AC . XX DT 04-NOV-2009 (Rel. 14.11, Created) DT 04-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR7_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-549 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2958-2958 (2009). XX DR [1] (Consensus) XX CC ~91% identical to consensus. 5bp tsd. CC Similarity to LTR7Y from primates. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 549 BP; 136 A; 145 C; 139 G; 129 T; 0 other; tgttaggttt gttccaggtg ggaacccgaa aagtatgtat aggacagaat gtggttaatg 60 caaaacttag gcttgtaaaa caatagccgc aggagtctgg ggagtcccgg actttgtaga 120 tctgacagac agctgcagcc agcattcctt ccccctccct ccctgatgga gaccaaggct 180 atccctccct agctctcctg ataaggacga gccagagcag aaggatggat gcgactgggg 240 caaaggggtc gtaggagttg gttgtttgaa gaagaattta ctacaaacag ctgttctgcc 300 caccgtcctc tgcactgcac gtagcaggaa gcaattcttc acctgccagt ttggcccaga 360 aacaaagaac tgaacagggc aggtgggagg gggtcggcta tccccgttta taaaaaccct 420 cactaaaccc ccttctgtgc tgactctctt ttcggactca gcccacccgc acccgggtga 480 ataaaccagc catgttgctc ccacatagcc tgtgtgtgaa atgtgtcctt tgggtcattc 540 accctttca 549 // ID CYN-III3 repbase; DNA; PRI; 229 BP. XX AC . XX DT 02-MAY-2006 (Rel. 11.04, Created) DT 02-MAY-2006 (Rel. 11.04, Last updated, Version 1) XX DE SINE element from Cynocephalus - consensus sequence. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; KW Interspersed repeat; CYN-I; CYN-III3. XX OS Cynocephalus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Dermoptera; Cynocephalidae. XX RN [1] RP 1-229 RA Schmitz J. and Zischler H.; RT "A novel family of tRNA-derived SINEs in the colugo and two new RT retrotransposable markers separating dermopterans from RT primates."; RL Mol Phylogenet Evol 28(2), 341-349 (2003). XX DR [1] (Consensus) XX SQ Sequence 229 BP; 50 A; 68 C; 77 G; 34 T; 0 other; ggccggcccg gtggcgcact gctacagtgc gccgcttggg agcgcagcgt cgctcccgct 60 gagggttcgg atcccacata cagaccggtt cccgctcact ggctgagtga ggtgcaggcg 120 caacgccgag ggttgcgatc ccattgccgg tcccggtccg gtacgggcgc gacactgagg 180 gttgcgatcc gttgccggac acgaaaaaag acaaaaagga aaaaaaaaa 229 // ID L1P4c_5end repbase; DNA; PRI; 1583 BP. XX AC . XX DT 29-AUG-2008 (Rel. 13.08, Created) DT 29-AUG-2008 (Rel. 13.08, Last updated, Version -1) XX DE L1 Non-LTR Retrotransposon from primates. XX KW L1; Non-LTR Retrotransposon; Transposable Element; LINE; KW L1P4c_5end. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-1583 RA Smit A.F.; RT "L1P4c_5end - L1 Non-LTR Retrotransposon from primates."; RL Direct Submission to Repbase Update (05-AUG-2008). XX DR [1] (Consensus) XX CC L1PA16. XX SQ Sequence 1583 BP; 378 A; 518 C; 443 G; 226 T; 18 other; gggttcaaga tggcggacta gaagcagcta gtgtgcgccg ctctcacgga gaggaaacaa 60 agtggcgagt aaatactgac tcttcaagcg gatcgtctaa gaaaccatgt caggatccat 120 caagggagca aggggacaca cggagaacag agaagagcga agctgggcag ccgcccaccc 180 gggaccagcg cggagccagg agaagctccc taacacaggg aaagggtgag ngagtgagag 240 cccccagggg atccacactt cccacaggga cctgtgcaat cctgggaacg ggagaaccct 300 cctgaccccc ncgggcctct agactgatac agagagccgc ccagagtttt tgcggaggca 360 acactcaagn ccacagggag ccccacaggc cttggatccc ngagcagccc ggcgccagct 420 gccatagccc caatagaggc cacagtcgtg gtgccnggga gcagtcagat tgctccactc 480 ctcttcgcca gacaaggctc ggcnccagct tccagcacag tggccccgcc tctgcctgaa 540 ctctgtgggc aggcacagct ctgtgttccc cgggaagcac cnggacggcg gancgggcga 600 ctccacccac ccccgctgct cntagccggg cgggncncgc cggctngggc ttccagcaca 660 gcagncccgc ctctgcctga actctgcggg tgggcacagc tctgtgttcc cccgggaagc 720 acccagatgg cagatcgggt gactccaccc acccccgctg ctcctagcca ggcgggacnc 780 gccagcttgg gcttccagca cagcggcccc gcctctgcct gaactctgcg ggcgggcaca 840 gctctgtgtt cccccgggaa gcacccagat ggcggattgg gtgactccac ccacccccgc 900 tgctcctagc cggacgggac tcactggctt gggcagcgcc caagcaggag ggagccccca 960 ctctcagaac actgagaggg gtgagacgcc tgggttcatg ggctggcggg ggagcagggt 1020 gtgcctccct ccgcagggcc agcccaggaa gggtatggcc tgtctgccag ccgcggcccc 1080 tgcctgaggg agccccgcgg cccagaacac ctaacaaagg aaatgcaggc acggcgccag 1140 tgatcagang gggctcctcc aaggcccagg agcggacctg gtgagggggt catctctctc 1200 cccacccnac cacagagcac tactgcgaac tgcgccaaaa tacaaaagag ccacgtggct 1260 gagtaagagc ctatctgccg gccatcactc ttaagcgcca cctactggat cgcagcccaa 1320 attacaacac caaaaatatt ttgccagtat acagcgcctg tgaaacctaa ggcaaaaatc 1380 cagccacaaa taaagatcct gtacagagcc ctggccttct gaaagcaccc agaaatgaag 1440 ccaactgact atactcaact tacatcacag ttaaaggaac accagccctc ncagatgaga 1500 aagaatcagc acaagaactc tggcaattca aaaagccaga gtgtcccctt acctccaaan 1560 aaacacacta gtcccccagc aat 1583 // ID ERV2-1_TSy-I repbase; DNA; PRI; 7114 BP. XX AC . XX DT 06-DEC-2009 (Rel. 15.09, Created) DT 06-DEC-2009 (Rel. 15.09, Last updated, Version 2) XX DE Internal portion of an ERV2-type endogenous retrovirus - DE consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERV2-1_TSy-LTR; ERV2-1_TSy-I. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-7114 RA Jurka J.; RT "Endogenous retroviruses from tarsier."; RL Repbase Reports 10(9), 1202-1202 (2010). XX DR [1] (Consensus) XX CC ORFs are not fully reconstructed. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX FH Key Location/Qualifiers FT CDS 345..1718 FT /product="ERV2-1_TSy-I_1p" FT /translation="MNFSPRRKWMEEEVIGYHKRKNLANVAKAPPPIPFTR FT NKVKTPKFTPGYVSLANVGMQGARKKAEEEGDFTLSSCFPVIYMEEGTVNE FT APHWEPLPYKLIKELKEARTRYGPTAPYTITIMEQFATRWMTPQDWRQLAK FT SCLSGGAYLLWETEFEDQVRIQKAANQQRDPNQFHISSNMLLGRGLYESVD FT SQLILPKEVLLQVNSCASEAWKKLPINSTQTASLSDLKQKTDETYEEFLSR FT LTTAVNRVISNAEAANILIKQLAFENANSTCQALIRPIRRTGQLTDYIRQC FT ADINPAXMQGMAIAAALKGETYAQFVQGMQNGNNRSKNSSVLTCYNCGQPG FT HISRNCPAKRGFQEALMQQPNESSRPKSLCPRCQKGFHWARDCKSKFHKNG FT TLLTKQRSTVPIPGNGQRGRPRPQVTIGAASSMNPFVPFVPSQNSSEPPQG FT AQDWTSVPPPQQY" FT CDS 2383..5076 FT /product="ERV2-1_TSy-I_3p" FT /translation="MRSLKGAWKEQTGHKRTCSGCREERPVWYWLFKFALG FT AIVPEAADPITWKDDKPVWVEQWPMPSEKLKAARELVQQQLEAGHIEPSTS FT PWNTPIFVIKKKSGKWRLLQDLRAINATMEAMGALQPGLPSPVAIPADYNI FT IVIDLQDCFFSIPLSSQDCKRFAFSLPSHNLKQPFQRYQWKVLPQGMKNSP FT TLCQRFVDAALQDIRNQYPNLYLIHYMDDILLAHADRAHLHNVLNETVTAL FT ANCGLKISPEKIQTNPPFTYLGRVLNTQTVSHQPLQLRKNHLVTLNDFQKL FT LGDINWLRPYLHVTTADLKPLFDILQGDPSPTSKRVLTPEAKEALTLVEKA FT IESQRVKRVSYDIPWELIVLATSHTPTGCLWQEGPMEWLHLPVSSKKVLAS FT YPYLVALLIIKGRQRSVELFGKDVDNIIIPYDKTQFQTLLQTNDDWQVALI FT DFRGQILFHLPRNPLLQFVKCNPIIIPRACSLVPLEGAVLVFTDGSSNGKA FT VAVINDKPHVLSTSETSAQKTELRAVLYAFTALKDKRFNLYTDSRYVQGLF FT PHIETATIPVSKTTILELLIQLQQLIHVRKEQFFVGHVRSHSGLPGPLHEG FT NQLADNLTKPAIYATLETEAVTQAQLSHNIHHQNATALRYEFQIPREAARQ FT IVKNCPACPTFVNVPAGVNPRGLKPNVLWQMDVTHVPTFGKLSYVHVTVDT FT FSHVILASARTGEAYKDVVQHLFFCFSQLGMPKQLKTDNGPAYTSKSFQQF FT CSQFNIFHSTGIPYNPQGQAIVERAHQTLKNQIHKLQEGEFRYSSPHHVLN FT HALFVINHLNLDSSGNSAMARHWNPEGNVKPMVKWKDLLTGQWRGPDILLT FT SGRGYACIFPQDASTPVWIPDRFIRNAPANPKTETGGMNPVESKE" XX SQ Sequence 7114 BP; 2085 A; 1459 C; 1520 G; 2049 T; 1 other; agtggcgcct gaacgtggga cctcaaggta agctttctct ttctttttgg gatgcaggac 60 ccctctgtag agacaggcag agcaaggaag ggtgagcagt ttaagagcca tggcagaatc 120 atgggccaga gtgagtttgt tcataagtat tctgatgtct attagtattt tgcacatatt 180 gtccgataga ggaaccggtt gtcttgactc ttgcattttg tacgccttag ctttatggaa 240 tattatccgg gatgttttaa acccacagta cgattcctcc aaaattccgg ccaaaaataa 300 agacactccc ctgctggccg tagcctcggc gcctccttta gacgatgaat ttttccccga 360 ggaggaaatg gatggaggag gaagtgattg ggtatcataa gcgaaaaaat cttgctaatg 420 ttgccaaggc acctcctccc atacccttta ctagaaataa ggtgaaaaca ccaaaattca 480 cacctgggta tgtgtctctg gctaatgtag gtatgcaggg agccagaaaa aaggcagaag 540 aagagggaga ttttaccttg tcttcttgct ttccagtaat ttatatggaa gaaggcacag 600 taaacgaggc acctcactgg gagccgctac catataaatt aattaaggaa cttaaagagg 660 ctcgtactcg gtatggtcct acggctcctt atactattac tattatggaa caatttgcca 720 ctcggtggat gacaccacaa gattggcgac aacttgctaa aagctgtttg tcaggaggag 780 cctatttact atgggaaact gaatttgaag atcaggtaag gatccagaag gctgctaacc 840 agcagaggga tccaaatcag tttcatattt ctagcaatat gctgttagga agaggattat 900 atgagtcagt tgacagtcaa ctaatcttac ccaaggaagt tttactccag gtaaattctt 960 gtgcaagtga ggcttggaag aagttaccta ttaattctac tcagactgcc tcactatctg 1020 atcttaagca gaaaacagat gaaacttacg aggagtttct ctctagactt acaacagcag 1080 taaatagagt aatctctaat gcagaagcag caaatatcct tattaaacag ttggctttcg 1140 agaatgctaa cagtacttgc caagctttga ttagaccgat tagaagaaca ggacaattaa 1200 ctgattatat caggcagtgt gctgatatta accctgcckt tatgcagggt atggcaatag 1260 ccgccgcctt gaagggagaa acttatgcac aatttgtgca gggaatgcaa aatggtaaca 1320 atagatcaaa aaacagctcg gttttaactt gttataactg tgggcagcct ggccatatta 1380 gtaggaactg cccagctaaa cgagggtttc aggaagctct gatgcaacag cctaatgaga 1440 gctctcgacc aaagtctctg tgtcctaggt gtcagaaagg atttcattgg gctagggatt 1500 gcaaatccaa atttcataaa aatggcactc tgttaactaa acaacgatct acggtgccga 1560 ttccgggaaa tgggcagagg ggccggcccc ggccccaagt aacaataggg gcagcttcca 1620 gcatgaaccc gtttgttccc tttgtcccat ctcagaactc atcagagcca ccccagggag 1680 cacaggactg gacctccgtt cctccaccac aacaatatta actccagatg ttccagttac 1740 cccaattcca acgggagtga agggccctct cccggatgga atagtgggta taatattagg 1800 aagaagttcc ttgtctctac aaggagtttc tgtgattcct ggagtggtgg actctgacta 1860 cacaggggaa atccaggtac ttgttgctcc ccctagcaaa actgtacaat tttatgaagg 1920 tcaaagaata gctcagttac tactcttacc ctatcataaa atgggcaagt ctctaaccga 1980 taagcccagg ggtgactctg gatttggctc aagcaatttc gctttctggg ttcaggaaat 2040 tactaactcc agacctctaa aagatttata tattcaggga gttaaaatta aaggtctgct 2100 agatacaggg gctgatgttt cttgtattgc tgggaaagac tggccgtctt cctggccaac 2160 acaagcagcc ccatcaggac tcattggtat tggtcgagcg ccctctgtag ccaaaagttc 2220 tcaaatctta aattggacag ataacgaaac ttcaggaacc ttttgtccct acatcattcc 2280 ttccatcccc attacacttt ggggacgcga tattcttgct caaatgggca tgcttctgta 2340 tagtccagat gacaaggtgt cagcgcaaat gctacaaata ggatgcgatc ccttaaaggg 2400 gcttggaagg aacagacagg gcataaaagg acctgttcag gttgcagaga ggaaagaccg 2460 gtctggtatt ggcttttcaa atttgcccta ggggccattg ttcctgaggc tgctgatccc 2520 attacctgga aagatgataa accagtttgg gtggaacaat ggcccatgcc ctcggagaaa 2580 cttaaggcag caagagaatt agttcaacag cagttggagg ctggccatat agagccttct 2640 accagtcctt ggaatactcc tatatttgtt ataaaaaaga aatctggaaa atggagactt 2700 ttacaggatc taagagccat taatgccacc atggaggcta tgggtgcatt gcagcctggg 2760 ctgccatctc cggttgctat tcctgctgat tataacatca tagttattga tctacaggat 2820 tgcttttttt ctataccttt aagttctcaa gactgtaaaa gatttgcatt tagtttacct 2880 tctcataatt taaagcagcc gttccagaga tatcaatgga aggtcctgcc acaaggtatg 2940 aaaaacagcc ctaccttgtg ccagaggttt gtagatgcag ccttgcagga tataagaaac 3000 caatatccta atctttattt aattcattac atggatgata ttttattggc tcatgcagat 3060 cgtgctcact tgcataacgt tttgaatgaa acagtaacag cattggctaa ttgtggtttg 3120 aaaatttctc ctgaaaaaat tcaaaccaac ccgcccttta cttatcttgg ccgagtgtta 3180 aacactcaaa ctgtgagtca tcaacctcta caattaagga agaatcattt ggttacctta 3240 aatgattttc agaaattatt aggagatatt aattggctga gaccttattt acatgtcact 3300 actgcagatt taaagcctct ttttgatata ctgcaggggg atccctcccc cacttctaaa 3360 agagtcttaa cacccgaggc aaaagaggcc ttaactctcg tggaaaaggc cattgagtcc 3420 caaagggtga aaagggtaag ttatgatata ccatgggaat taattgtgtt ggctacttct 3480 cataccccaa cagggtgctt gtggcaggaa ggacctatgg aatggttaca tttaccggtt 3540 tcctctaaaa aagttttagc atcttatccg tacttagtag ctttgctaat aatcaaagga 3600 agacagagaa gcgtggaatt gtttggaaaa gatgtggata atatcatcat cccatatgac 3660 aaaactcagt ttcaaacctt gttgcaaact aatgatgatt ggcaagtggc tttgattgat 3720 tttagaggac aaattttgtt tcatttgcct agaaatccat tgttgcagtt tgtgaaatgt 3780 aatcctataa ttattcctcg tgcatgttcc ttagtccccc tcgagggagc agtattggtt 3840 tttactgatg gctcttctaa tggtaaagca gtggcagtaa ttaatgacaa acctcatgtt 3900 ttaagtactt cagagacttc agcacaaaaa acagaattaa gagctgtctt atatgccttt 3960 actgctctaa aagacaagag atttaacttg tatactgatt ctcgttatgt tcaagggctg 4020 tttcctcata ttgagacagc aactattcct gttagtaaaa ccacaatttt ggaattgctt 4080 atacaactgc aacaattgat tcatgttaga aaagagcaat ttttcgttgg ccatgtcaga 4140 tctcattccg gccttccggg acctctacat gagggtaatc aattggctga taatttaaca 4200 aaacctgcca tctatgcaac tttggagaca gaggcagtca ctcaagctca actatctcat 4260 aatattcatc atcaaaatgc tactgcttta cgttatgaat ttcagatacc tagggaagca 4320 gcaaggcaga ttgtcaagaa ttgtccagca tgtcctactt ttgttaatgt tccagcagga 4380 gttaatccac gaggattaaa acctaatgta ttatggcaaa tggatgtaac tcatgtacct 4440 acatttggaa aattgtctta tgtgcatgtt actgtggata cgttttctca cgttatttta 4500 gcctcagcaa gaactggaga agcttacaag gacgtggtgc aacatttatt tttttgtttc 4560 tcacaattag gtatgccaaa acaacttaag actgacaatg gtcctgcata tacatcaaaa 4620 tcttttcaac agttttgttc tcaatttaat atttttcatt ctacaggaat tccttataac 4680 ccgcagggtc aagccatagt tgaaagagcc catcaaactt taaaaaatca aatacataaa 4740 ttacaagagg gggagtttag gtatagttct cctcatcatg ttttaaatca tgctttattt 4800 gttataaacc atttaaattt ggattcttca ggcaattcag ctatggctag acactggaac 4860 ccagaaggaa atgttaaacc tatggtaaaa tggaaggatc ttctcacagg ccaatggaga 4920 gggcctgata tattactaac cagtgggaga ggatatgctt gcatatttcc acaggatgcg 4980 tcaactcctg tttggattcc tgatcgattc attcgaaatg cccccgctaa cccaaagact 5040 gagacgggtg gcatgaatcc tgttgaatca aaggaataga ccggagaaaa atgtaaacaa 5100 aataagggtt ttcacttcct ttgtcctctc ctattcctcc tacaactcca attcctgcct 5160 tccctactac tcttgtttct tgtaatctgt tctgtgagtc ccttcgcctt tatctgggaa 5220 ggaagggcag gcgtctacgg ccgtggcgcc tagctctgac tttcagtctt tctcttttac 5280 cttcggagaa ggggcaggtg ccaaaggcca cggcacctga ccccagcctc cagtctttct 5340 cttttacctt tggagaaggg gtgggtgcca aaggccacgg cacctaaccc caacctccag 5400 acttctcacc cccccctctt ctcaaggggg tggctcagac aaaaggatgt taaaaattct 5460 tctttgttgt cttcttatga taaatttctc tcaagttcaa ctttcccaga cggaagtaaa 5520 ggaatcatat aagtgaatcc tgttagctaa ctacaaaata ggtgttaata agcagactag 5580 actggaagga caagttcttt tggatgtatt ttatctctca gtcagcactg gcgcaaactc 5640 caaaaattag aaaagggccc acaaacccct caactataac tattggacac agcctttaaa 5700 gtctttaaca actgggagga gaaaagaatg caggaccaaa aacataagat aaacttactg 5760 cgtgcagtcc gagacatcga ggtgggagaa gagctcacca tctgctacct ggatatgctg 5820 atgaccagtg aggagcgccg gaagcagctg agggaccaat actgctttaa gtgtgactgt 5880 atccgatgcc aaacccagga caaggtgatc tgcaactctt tcaccatctg tactgcagag 5940 atgcaggaag tgggtgttgg cctgtatcct agtatgtctt tgctcaatca cagctgtgac 6000 cctaactgtt caattgtgtt taatggacac cacctctcag cgtacctgct tgccccttca 6060 atgatatgga aaaggactgg tcactcatcc agcgagatac tggcttcctg aaatggagag 6120 actgtgctta taactctaag gtggactata ctgtacctgg tgaaagacta tatattagtg 6180 actggggttt taatggctct accttgctgc aactaaaggc actccaatga atgggtacca 6240 gacaagccat tcaacctcat ttgtggagag ccctcgagcc cttcaaccag tcagcctgtc 6300 taggcctgtt ggacaaactc agcagtacca cattagggat gcttgcagtt ctcttacgtg 6360 tttttagtat gaaataagta ctaagttgtt aaaattgagt atcatagcaa tgtttataag 6420 gtgtcttgta ctgggtgtat ttttaactaa ttgtttaact aatgcctttt acgaaggttt 6480 tgttattctt gagcagccat cttatttgat gattcttgta taattgtcat ctcgttggta 6540 tgatgcatat ggtttacaga tattgtatga ggctaatgct cttttagaaa gacagaaaag 6600 atttttggct acttttaatt ttaggagtta cagctttaat agctactttt gagctgtagc 6660 attatctagg gaagttcata caactacgtt tgctgatcgg ctgtctaaaa acgtttctat 6720 tgccatagct attcaggaaa tcatagacca gaagttggaa agtaaggtta gtgctttgga 6780 ggaaggagtt ttagcagtag ggcaagaact tgtgacctta aaaaagaaat tggctctcca 6840 atgtcactct gagtttaaat ctgtccatac ttgggacaaa gtaacttgga atttggaacc 6900 attcagacct gagtatagac cttactaaat tacatcaaga aattcaggca tcatttgcta 6960 atcaaaaatc attaatgtct atactcatca atgtgggaat ttgcggagca gtgattcttt 7020 tgttgctgtg tttactccca atcatcttca gagtgctaaa gaagagcaat atggagtttc 7080 atggtttcgt gctaaaaaat aaaaaagggg gaga 7114 // ID LTR1F1 repbase; DNA; PRI; 729 BP. XX AC . XX DT 01-MAR-2009 (Rel. 14.06, Created) DT 18-JUN-2009 (Rel. 14.06, Last updated, Version 1) XX DE ERV1 Endogenous Retrovirus from primates. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; LTR1F1. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-729 RA Smit A.F.; RT "LTR1F1 - ERV1 Endogenous Retrovirus from primates."; RL Repbase Reports 9(6), 1176-1176 (2009). XX DR [1] (Consensus) XX CC 9.5% subs, 45 copies. XX SQ Sequence 729 BP; 172 A; 241 C; 203 G; 112 T; 1 other; tgatacggac aggagacagg gaaatactgg gtagaagagg gcggttcccc ggcaaaggcc 60 ccaccctcaa gcctggaaac ccgcggccct aaatgggaac aggcattcct gttttcgcgc 120 ccaaangttg ccttttggcc cgccacgccc ccctatcctg tacccatata aaccccaaac 180 cccaggctcc acgagcagac gagcagaaga gcagaagagc ggcagagcgg cgcggcagag 240 aaggagagaa gagaaggagc gtctgaacgt cgagaggagt tcggctgggg acggtcggag 300 aggagatcgg ccgctggacg gccaaactcc aggggaagat catcttccca ctccatcccc 360 tttccagctc cccatccatc ccgctgagag ccacctccac cactcaataa aacccccgca 420 ttcaccatcc ttcaagtccg tgtgacctga ttcttcctgg acgccggaca agaacccggg 480 taccaagagg gcactgagct ggttaacact taagccgtct gcggacggca gagctaaaag 540 agcactaatt gtaacacacc cctagatgct accgtggggc cggagcccaa aagcgctcgc 600 cccggctcct gcacctgccc gtctgcgtgc tccccctccc gtaaggggtt tgagcgcgcg 660 gcggccgaac agacgagcca cacccctgtc gcacgtcctg cgagggggtc agggaactct 720 cccgtttca 729 // ID LTR2B_OG repbase; DNA; PRI; 541 BP. XX AC . XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 01-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Long terminal repeat (consensus). XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR2B_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-541 RA Jurka J. and Jurka M.G.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 8(10), 1677-1677 (2008). XX DR [1] (Consensus) XX CC The 3' AA termini is atypical for endogenous retroviruses. 6bp CC TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 541 BP; 122 A; 150 C; 90 G; 179 T; 0 other; tgtggagtgc ggattgtgag gcggcaagac ccccacagcc cgggaggtgc gttcaaggaa 60 cgacccccaa ctgcttctcc tcccttccta aaacaacaat agtcctttct ccttgctccc 120 tctgccagaa gtcacgtagc ttctgtatac tttcctatct tttgcctaaa agcacatagc 180 cttctttccc taaagttaat ctttacccca tgtactaaga ttcacgtcct taagaatatt 240 gtttatgata aaactccttt tacttatcca gttacttccc ctcagccttg taagatgaaa 300 atattcccac tcgctttcct tgctgatttg caacattgtt aatataagca accttgttta 360 ggtactatat aaataaagct gttcacacgg ttcggtgctg gctgcagtca tcagctgcgc 420 cagaccttcc gatcccatct ctttgtcttt cgtgtaatct atgtctcttg tgtgatttat 480 tttctcaatt ccccgccatt cccactctgg ttcatttact cttcgcgctg gttcgcgaaa 540 a 541 // ID LTR5_Mim repbase; DNA; PRI; 410 BP. XX AC . XX DT 02-NOV-2009 (Rel. 14.11, Created) DT 02-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR5_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-410 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2956-2956 (2009). XX DR [1] (Consensus) XX CC >98% identical to consensus. 5bp tsd. CC Similarity to MacERV5a_LTR from Cercopithecidae. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 410 BP; 94 A; 152 C; 80 G; 84 T; 0 other; tgtaaggctg gtggcagggg ccccctcccc tccctagatg gaaaaagaaa acgctcagtg 60 acctgacccg ccaaggacaa actgccaggc cccactgaga ggaaccacac ccagatctcc 120 acctggattc gctcctggat tccacaaata cctccagccc ctcctatttc tcaacgacct 180 tccaaggtca caatagagaa tcccagctct tcccgccaca agtccctagc cccgcccaaa 240 gggtataaaa ggcctcagcc ccttcaaact gcggcgactt ccggggcccc cctctcttgg 300 ggccccagaa cctcgtccac gagtgtataa taaagccacg tggtttggcc ccccctactc 360 tttctctctc tgtgtctctt ttcttttcgc caccgccgaa aaaccttaca 410 // ID L1-6_TS repbase; DNA; PRI; 6717 BP. XX AC . XX DT 03-MAY-2010 (Rel. 15.05, Created) DT 03-MAY-2010 (Rel. 15.07, Last updated, Version 2) XX DE L1-type non-LTR retrotransposon - consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-6_TS. XX NM L1-6_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-6717 RA Bao W. and Jurka J.; RT "Non-LTR retrotransposons from tarsier."; RL Repbase Reports 10(5), 772-772 (2010). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 6717 BP; 2530 A; 1570 C; 1299 G; 1318 T; 0 other; gatggcatcg gatccggtca gagtgtgaga gtctctgcat gggtgagtgc acaaaaaaac 60 taatctgcgt ctgtttgtgt gtgcaagtgg gtgagcgaat gtccaactga ggggagacac 120 gggatcaagc acccaggccc cctgcagaaa tggctacaat ttgccactgc tctgaattcg 180 gcacctgctg cacgcagaag gatccaccat ccctggggca tctctcggga aaaggcagaa 240 gagatcagct ggcacctggc ccacaccacc acctgcggca acaggccagg catgggcagc 300 agggaccgtg gagaaagcac ggggaggaaa tctggagctg agccccacca gagggaaaaa 360 aacacggacc gccaccattt tctctgtcct gggactgggc agagacctgc tttatcttgc 420 ccaattctga gctgcctgca tcctgagcag cctgcatcct gagaccaagg ggtgatctct 480 gcacccctct gaaatcgacc atcatttgcc cagagaggat actccaagtg catgataccg 540 gagagacaga cgccccaccc ccacactgga ccaaacacag acgcatgcaa tcagagtggg 600 agacctagca gcagacaatt tgggagcttt tcccgggaaa atctgctgga gcgagctgaa 660 cgctggcagt gagggccgcg gagggaacgc agggagggag cctgaggcag actcccaagg 720 aagcagtggg gagagcccac cctgagcctg gacggggctc cagagccttg ccaccaggtc 780 ccgagatccg gagactggta actgctggga agggagcaat agctcccggc agagaaactg 840 agcagactct ctaatgcctg ggcagagaag tgctccaggg tggcctctgc ggaacagcag 900 aactgcatct tgagaccagg cgctgatctc tgcgttgcgt tggggctgct ctttgcgacc 960 tcagagaaca ttgcccccag tgcacaagcc ctggcgggta atggcagatg cggaacccca 1020 gacgatccac cctccaccct gagccagctg ctgggcgata gacgcagaca gcagaactga 1080 ccgagaagga ggctggtggc agtattggtt tgtttgttta tctcatttgg ttgctttgtt 1140 tgttggttgg ttttctcttt tctttccttc tatttctctt tctttttctc tttctctgtt 1200 ttttctcttt ctattttctt tcttcttttc ctttctcctt ttttcttttt ccctccttct 1260 ctctcttttc cctcttatac cctgtcttgg actcaggggc tatgcaagca tacccactgt 1320 aaggtcatat tgtgcagaga tggagagcaa gtcccagctg gcccattcac cagatagcaa 1380 acattatgac ctgccaatag gttctgaatc cttgcagccc ccctcatccc ctagtaagtg 1440 ctcaaaattg taaagagaca gcacaatgcc cccagcccta cttcataatt gaacaaaccc 1500 agagccccgg ctcaaggaag gacaaaaggt agagcaaata tcagaagtga ctgcttctcc 1560 attttccact gagcaacgga attaaagaga aagataggag aactgctaca aatgaagaaa 1620 aaccaacgaa gaactgctgg taacccagag ccccagcaaa gagtcagtgc tagaaagaac 1680 agctactggg gcacaaacaa tggctatctc cgagaacgac agtacccaga aggtacttac 1740 agagatcaga gaatcattgg atagaatgaa taaaaatcaa gaaatgcaag ggctaaaact 1800 aaatgaaatt gcaacagagc tggacaaact caagaaagcg tacttagaaa taacagaaat 1860 aaagaactcc atccaagaaa taaatgacaa actgacaagc atgggaagca gaattgacca 1920 agcagaagaa agaatctcag agctggaaga ccaaaatatg gaattaaccc aatctattaa 1980 aaatatagaa aaaaacttaa gaagaaagaa gaaactctcc aagagatgtg cgattttatc 2040 aagaaaccaa acctacgtct gatcggaata cctgaaatag aaagagaaac agaaaacacc 2100 ctggaacaaa cattccatga ggtcattcaa gaaaacttcc ctcatctggc cagagagatg 2160 actatacaag cacaagagat tcagagaacc cctgcaagac atcttatgag aagaccaacc 2220 cctagacaca tagtaattcg cctacacaaa gtaggtataa aagaaaaaat cctaaaggca 2280 gcaagggaaa aaggtcagac tacctaccgg gggagaccaa tcagaattgc tgcagactta 2340 tctgcagaaa cactacaggc cagaaggaat tggaccccaa tctttaatgt tctcagagat 2400 aaacaatttc aaccaagaat ttcctaccca gccaagctaa gcttcatcag tgatggagaa 2460 ttaaaatcct tcccagacat ccaatcccta agagaatatg ctgcttcgag accagctcta 2520 caggagatgc ttaaaaaggt gctgtgcaca gaagaaaaaa gaaacaaaag aaagaccata 2580 tacttcacaa gatcacaaac aaacacagaa gcagcagaac acacagcata cccgcaaaaa 2640 tgaaaacaca cacatatatg aacacaaaag ctaaatgaaa acaaaacagc cttataagag 2700 cattatgaca gggacaaact ctcacatttc aataatcagc ctgaatgtga atggactaaa 2760 tgcaccactg aaaagacata gaatggcaaa ctggataaaa aaacatgacc cagtaatttg 2820 ctgtctccag gaaactcatc tcaccacaaa ggatgcccac agactcaaag ttagaggatg 2880 gaaaacaaat ttccaggtga acggatcaca aaagaaggca ggagtcgtga tcttaatatc 2940 agacaaaaca acctttaagc tatcaaaaat ttaaaaagat aaggaaggac actacataat 3000 gataaaaggt tcaatccatc aacaagaaat atccatccta aacatatatg cacccaacac 3060 aggagcacca gcttttataa agcagctact aaaaaaagat attgactcta acactatcat 3120 agctggggac ttaaataccc cactaacaac cctagataga tcatcgaggc aaaaaatcaa 3180 caaagagatc cggaacctca acttgacgct tgaccaaatg gacttagtag atacctacag 3240 aacactccac ccaacaacca cagaatatac attctactca tcaccacatg gaacgtactc 3300 caagatcgac cacatcctcg gccataaatc aagcataaac aaatttcaca agattgaaat 3360 actgccatgc accttctcag accacagtgg aataaaaata aatatcacca ccaacaaaat 3420 tcccccaaac ccacaaagac atggacacta aacagcatga tgctgaacaa ctcctgggtc 3480 aacacagaaa tcaaaacaga aattaaaaga ttcctggaaa caaatgaaaa tgaagaaaca 3540 tcttaccaaa acctctggga tgccatgaaa acagtactga gaggggaatt tatatctcta 3600 tgaacacaca tcaagaaaat agaaagagaa caagttaaca gcctaacaaa tcacctaagg 3660 gagctggaaa ggcaagacca ccaaaaccct aacttcagca gaagaatcca gatcaccaaa 3720 gtaaaagccc aaatatggga catagaagac aaaaatacca tagaaaaaaa tcaacaaaac 3780 aaaaagctgg ttctttgaaa ggataaacaa gattgatgga cccctagcca gaatgaccaa 3840 gaaaaagaga gaaaaagccc aaataaacac aatcagaaat gcaaaagatc aagtcacaac 3900 tgaccccaaa gaaatacaaa agattatcag agattactat gcacacctgt atggaaacaa 3960 actcgataac ctaaatgaaa tggaggactt tctgacatca cacaacctcc caaggttgaa 4020 acaagaagaa attgagatcc taaatagacc aataacaacc caggaaattg actctgtcat 4080 aagaaaacta cctacaaaaa aaaaagccct ggaccagatg gatttccagc ggaattctac 4140 aaaacataca aggaggagct gataccaatc ctattgaaag tattccaggc aatcgagaaa 4200 gatggaactc tccccaaatc attttatgaa gctaacatca cactgatacc caaaccaggt 4260 aaagatccaa caaagaaaga gaactacagg ccaatatccc tgatgaacat agatgcaaaa 4320 attctcaaca agattctagc aaatcggatc caacacatct caaaaatcat ccaccatgac 4380 caagtaggct tcatccctgg gatgcagggc tggttcaaca tccgcaagac cataaatgta 4440 attaaataca tcaacagatg taaaaacaag aaccacatga ttatatcatt agatgcagaa 4500 aaagcttttg ataaaatcca gcatcccttc ttgataaaaa cccttgaaca cctaggcata 4560 gagggaacat acctcaaaat agtaagagcc atctatgata aacccacagc caacatattg 4620 ctaaatggac agaaattgga agcatttccc ctgaaaaccg gaacaagaca aggctgccca 4680 ctctcacccc tcctgttcaa catagtgttg gaagtcctag ctagagcaat cagacaagag 4740 aaggaaatca ggggtatcca aataggaaaa gaagaagtca agttatccct ctttgctgat 4800 gatatgatcc tatacctcga aaatccaaga gaatctgtca aaaacctcct tgcactgata 4860 aaggactttg gcaaagtctc agggtacaaa ataaacgtgc aaaagacagt cacattctta 4920 tacaccaaca acaaacaggc agagaaccaa ataaaaagca caatcccatt cacaatagcc 4980 acaaaaaaat gaaatacctt ggcatcttcc taaccagaga agtgaaagac ctttacaatg 5040 aaaactacaa aacactgctc aaagaagtca aagatgacac aaacaaatgg aaaaatattc 5100 catgctcatg gattggaaga atcaacattg ttaagatgtc catcctacca aaggcaatct 5160 acagattcaa cgcaataccc attaaattac caacatcatt cttctcagac ctggaaacaa 5220 caatacagaa attcatatgg aaacataaac gaccacgaat agccaaaaca atccttagca 5280 aaagaaacaa agtgggaggt atcacacttc cagactttaa actttattat aaggctacaa 5340 taatcaaaac aacctggtat tggtacaaga acaggcatat agaccaatgg aacaggatag 5400 agattccgga agcaaaacct caatttctca accaactcat cttcgacaaa gcctccacca 5460 acaaccactg gggagaggag aacctattca gtaaatggtg ctgggaaaac tggctgacca 5520 cgtgcagaag aatgaaacag gacccctacc tatcaccata cacaaaaatc aactccaaat 5580 ggatcaaaga cctaaatgta aaacctcaaa ctttaagaat cttagaaaac gcaggagaca 5640 cccttatgga aattggaata ggcaaccaat tcctgatcaa aaccccaaaa gcccatgcca 5700 taagagataa gatagacaag tgggacctca tcaaactgaa aagcttctgc aaagcaaaag 5760 aaaccatcaa gagagtgggg agacagccca cagaatggga aaaaatattt gccaactgca 5820 tatctgacaa aggcctaaca tccaggatct acaaggaact caaacgtgcc aaaaggaaaa 5880 aaaacccatt aaaaagtggg caaaagacat gaatagacac ttctcaaaag aagacatatg 5940 agcagccaac agacacatga aaaaatgctc agcctcacta atcatcagag aaatgcaaat 6000 caaaaccaca ttgagatacc acctaactcc agtaagaatg gccatcatta ataaaacaaa 6060 aaacaacaga tgctggtgag gatgcggaga aaagggaatg cttctacact gttggtggga 6120 atgcaaacta gttcaacctc tatggaaaac agtgtggcaa ttcctaaaag aactagaaat 6180 tgaccttcca tatgacccag caatccccct tctgggaata tacccggagg aacttaaatc 6240 actctacaaa aaagatacct gcacatgcat gtttattgca gcactattca caatagcaaa 6300 aacatggaac caaccgtgct gcccatcgaa actggactgg attaaaaaaa tgtggtacat 6360 atacatgatg gaatattatg cagccataaa gaagaacaaa attatggact tcgcagcaac 6420 ttggatggat ttagagtcta tcatactcag tgatctatca cagaaacaaa gaactgagta 6480 ccacatgttc tcactcataa gcggaccctg aacatttacc ataatactat aagaaaggga 6540 ttggcagtag cgggaaactg ccaggggagg ggggggcaca taacatcaat aagggtacct 6600 gatctcaaac caggcgaagt ggggaccaaa gggaggccca aactttacct gtacgatgac 6660 cacttgtata cctaatcctg aattgtaccc cacatcttta aaataaaaaa ataaaaa 6717 // ID MacERVK1_LTR1a repbase; DNA; PRI; 372 BP. XX AC . XX DT 24-SEP-2007 (Rel. 13.02, Created) DT 21-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE ERV2 Endogenous Retrovirus from Cercopithecidae. XX KW ERV2; Endogenous Retrovirus; Transposable Element; ERVK; LTR; KW MacERVK1_LTR1a. XX OS Cercopithecidae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini. XX RN [1] RP 1-372 RA Smit A.F.; RT "MacERVK1_LTR1a - ERV2 Endogenous Retrovirus from RT Cercopithecidae."; RL Direct Submission to Repbase Update (21-FEB-2008). XX DR [1] (Consensus) XX CC 4%. XX SQ Sequence 372 BP; 102 A; 89 C; 94 G; 86 T; 1 other; tgtagaggac tacgtgctcg caaacgagac gttcccgata agtcctgctc tcgcaaacga 60 agcagggcgt tcccgataag tcctgctctt gcaaacgaag cagggcgttc cttccctgta 120 aacagggagg acaaaggagc cagctgcnaa cagcagaccc tgggggcttg tttatgtgta 180 aacatcttga aaatccagaa agtcagggaa aggtcagaaa aacaacaatg tgtcttgtga 240 cttggcaaca ttccacaaac gactgtataa aataaagcag agcgcgccat tcgaggcggc 300 cgccatgttt gtcttgtctt gtgttgtctt gtgtgttcat tcctttgttt aggaaacacg 360 cggaccccaa ca 372 // ID LTR46_Mim repbase; DNA; PRI; 825 BP. XX AC . XX DT 02-NOV-2009 (Rel. 14.11, Created) DT 02-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR46_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-825 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2983-2983 (2009). XX DR [1] (Consensus) XX CC >94% identical to consensus. 6bp tsd. CC Similarity to LTR46_BT from cow. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur.4. XX SQ Sequence 825 BP; 208 A; 179 C; 207 G; 231 T; 0 other; tgatgggatt aagaccactg ggacttaggg atgctatgtc tgaggctgtg agaggatagg 60 aggtgaagga attgcctgag cagaagcctg aaataagtgc agatgttaac tgcctagttg 120 ggggacatat atcagcgatc tgggagtaca gcaaagcatg ctccaacctc agagcctaag 180 gggtctcatg catgaaaagc cgcgggcggc ctgacatgaa aacttctcat ggctgtagag 240 gtagattgca atatgtcagc ccgggagaag ggtctcaagg cttccatagt gtttgcaccc 300 tgggccaagt tccttctctc cctttctctc aacaatgcct cagttaatgt aaaaccagtc 360 tcccagtggg aagcatgtag tttccttgtt ccttgatcaa atgggtcgca gctactgaca 420 ttaaagattg atatccatgt acatagcttg agcttgggat aaatcagttt accttgatca 480 aaagggttgt acctactgaa gaattatgtg tttttcactt ctagaaagga gaaggaaatt 540 agctaggaca ccttgtgatt ttggggggag ccccgaaacc tttgatcatt gtatcccgtg 600 attttttcac atgagatcat tgtattttgc aacatggcaa agagaataaa aagcaagtgc 660 tggggttcag tcactgccac cttggcaccg gagtgctggt ggtcccctga cctccccgct 720 ttcttaacta ttatttctgt gtctgtgtct ttgtttctct catctcttgc aacacagtct 780 gccaggggac cctatcttta ttggggatgt cgcgaacccc cgaca 825 // ID ALR_ repbase; DNA; PRI; 171 BP. XX AC . XX DT 29-AUG-2008 (Rel. 13.08, Created) DT 29-AUG-2008 (Rel. 13.08, Last updated, Version -1) XX DE SAT Satellite from primates. XX KW SAT; Satellite; Simple Repeat; Centromeric; ALR_. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-171 RA Smit A.F.; RT "ALR_ - SAT Satellite from primates."; RL Direct Submission to Repbase Update (05-AUG-2008). XX DR [1] (Consensus) XX SQ Sequence 171 BP; 49 A; 29 C; 33 G; 55 T; 5 other; ttgtagaatc tgcaagtgga tatttggasc kctttgaggm cttcgktgga aacgggaata 60 tcttcacata aaaactagac agaagcattc tcagaaactt ctttgtgatg tttgcattca 120 actcacagag ttgaacmttc cttttgatag agcagttttg aaacactctt t 171 // ID ERV1-3B_TSy-LTR repbase; DNA; PRI; 556 BP. XX AC . XX DT 29-JAN-2010 (Rel. 15.09, Created) DT 29-JAN-2010 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERV1-3B_TSy-LTR; ERV1-3B_TSy-I. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-556 RA Jurka J. and Walichiewicz K.; RT "Endogenous retroviruses from tarsier."; RL Repbase Reports 10(9), 1197-1197 (2010). XX DR [1] (Consensus) XX CC ~92% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 556 BP; 123 A; 117 C; 146 G; 170 T; 0 other; tgttgggagg tgatgaattt tgtaggaagt gatgaatttt ataccctgtg gcatgtaaaa 60 agttttgcat ttataatctc cctatataat gtacaatgta gtgaatttgt aaccccacat 120 attaggtgct gtgcggtaat gtcgctcagc tggaatgtgc ttccaggagg ggtctctagg 180 agatgcctta tttggactgg aatgcacact caggagggtc tctaggagat gccttatttg 240 gaagggacca agtggctacg ttaaattagc taacttgctc attctgcttc tgtaaacccc 300 gctggcatta tgtgattggt atatttgaat ttcccgggct tgcgaggtaa tgtgattggt 360 atatttgaat ttcccgggct cgcaaggtgg gcgggctcta acctatttaa gggcagcctc 420 tgccatagct cggggccctc gctttgtgtc tgcgtgtctg gcgagcggtc ccggccggct 480 gaaccttggc taataaacct ctcccttgtt aaattgtgtg tcggcgccat gaattaaaca 540 ctcagacccc actcca 556 // ID LTR10_Cja repbase; DNA; PRI; 501 BP. XX AC . XX DT 14-OCT-2009 (Rel. 14.11, Created) DT 14-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat: consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; LTR10_Cja. XX OS Callithrix jacchus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. XX RN [1] RP 1-501 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the common marmoset."; RL Repbase Reports 9(11), 2922-2922 (2009). XX DR [1] (Consensus) XX CC >85% identical to consensus. 5bp tsd. CC This sequence was derived from the marmoset dataset made CC available by Genome Sequencing Center at Washington University CC School of Medicine in St. Louis. XX SQ Sequence 501 BP; 106 A; 147 C; 95 G; 152 T; 1 other; tgtgagatat ggcaaggttt ctcttcaaac agcctgctca accttttatt ctttaattcc 60 cagtacccac cccctttctt ctccttttct tctttctgac tttactacat gcccaggcat 120 gccacagcac cagtggcgtt atcagcacca gctcacattc ctttccttat ttgggaaaga 180 ctggctctct agttcgccgc agacgacccc ttcctcctct cccctctctc ccatcacgcg 240 tccaccttat ctaagaaagt ttaaatgttt agccaatcgg gtctagttta gattgtgcgg 300 tccgacccca gccaatgggg aaaggacaca ggcaggagtc gcgttaggaa taaaaacttc 360 tactctcctt tgttcggggt gctctcgtgg caaccagccn tacgagaggc acccttctgc 420 gcagaagtaa atttgctttg ctgagaaatc ctttgtctga gtgctcgttt ttccttacga 480 ctccgagctt tatttctaac a 501 // ID CYN-III4 repbase; DNA; PRI; 230 BP. XX AC . XX DT 02-MAY-2006 (Rel. 11.04, Created) DT 02-MAY-2006 (Rel. 11.04, Last updated, Version 1) XX DE SINE element from Cynocephalus - consensus sequence. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; KW Interspersed repeat; CYN-I; CYN-III4. XX OS Cynocephalus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Dermoptera; Cynocephalidae. XX RN [1] RP 1-230 RA Schmitz J. and Zischler H.; RT "A novel family of tRNA-derived SINEs in the colugo and two new RT retrotransposable markers separating dermopterans from RT primates."; RL Mol Phylogenet Evol 28(2), 341-349 (2003). XX DR [1] (Consensus) XX SQ Sequence 230 BP; 51 A; 67 C; 79 G; 33 T; 0 other; ggccggcccg gtggcacact ggataagtgc gccgcttggg aagcatggcg gcgctcccgc 60 ccgagggttc ggatcccaca tacagactgg ttcccgctca ctggctgagc gaggcgcggg 120 ggcagcgccg agggttgcaa tcccgttgcc ggtccccggt ccggtacggg gcaacactga 180 gggttgcgat ccgttgccgg acacggaaaa agacaaaaaa aaaataaaaa 230 // ID MER41B1_Mim repbase; DNA; PRI; 614 BP. XX AC . XX DT 11-NOV-2009 (Rel. 14.11, Created) DT 11-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW MER41B1_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-614 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2985-2985 (2009). XX DR [1] (Consensus) XX CC ~88% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 614 BP; 188 A; 157 C; 139 G; 130 T; 0 other; tgttagggac aggtgcaggt tggcagcgct aaccacagga aaagaggagg ggggcaaaaa 60 tgaggaggcc aatgagcatt tagaaccaaa cccgcaacgg caggaacttg aggttaggga 120 cttttcccgg ccaggagcat gcgcagaaac taggagctca tgtccaaagg taaccttttc 180 cccattaaca tacattagca tagtaaaaat cacacccacc agcgccatga cagtttacaa 240 atgtcatggc aacccctgga agttacccta taaggataaa attggggagg gcccttagct 300 ctaagaactc tccacccttt tcccaggaaa ataatgaata ttccacccac catttagcat 360 atcataaggt gtagacataa aagtagaagt gtggtaagac cccagggtct tctccactcg 420 cggagacaga cccatttccc ctcaggaaac gtactttcgc tttctattct gtgctgaaaa 480 gcctgcaata aaagctgctt gtgtggaagt gctaagtgtc tgtcatcact ctgtcgcacc 540 ggccctgctt gcgattcttc caagcaagga gccaaagaac cagggcaaca cccctcaaaa 600 aatcaaccct gaca 614 // ID LTR11_OG repbase; DNA; PRI; 534 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV3-type endogenous retrovirus - DE consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; LTR11_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-534 RA Jurka J.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 11(5), 1585-1585 (2011). XX DR [1] (Consensus) XX CC >93% identical to consensus. 5 bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 534 BP; 151 A; 122 C; 125 G; 134 T; 2 other; tgagagggag cagctatgaa cctatttttt attattacta ttttccttgc tgtgccttta 60 aatagtagtg gtagtggcaa aaatagaaaa agataaaaga gctggggaag aaggaagaca 120 cggaggagag gggggaggag gcaagagctg caggaagctc caaggtcata aatcgaatca 180 ggtatacagc ccaggtggca gcaaccaatc aaaagttcac acaccnccat tccagctaag 240 accccaacca cggagaggca gttggcctat cggaacccca agacctccct agtgaccaat 300 gagtgaccag acactagctt cttcctcagc tatcttcttc agcattttat tgcaacccat 360 agtaccttgc gagaaagtgt ataaaagttt ccctggcctt tgtttagggg ccactccccg 420 ctgtttctgc ttttctcttg ggagtgaccc gagcatatgc ttgataagaa taaactcctt 480 gcttcttcca tgtgtgagnc tggactctca gtggtaataa ttttggccct aaca 534 // ID TINE2 repbase; DNA; PRI; 87 BP. XX AC . XX DT 31-DEC-2009 (Rel. 15.03, Created) DT 22-NOV-2010 (Rel. 15.07, Last updated, Version 6) XX DE SINE from the LTR portion - consensus. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; TINE2. XX NM TINE2. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-87 RA Jurka J.; RT "SINE elements from tarsier."; RL Repbase Reports 10(3), 518-518 (2010). XX DR [1] (Consensus) XX CC >92% identical to consensus. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 87 BP; 26 A; 24 C; 12 G; 25 T; 0 other; ggcaacccct tcgggtcccc tcccactgtg ggagcttttc tgtcctctca ataaatcctg 60 ctttctaaat aaataaataa ataaaaa 87 // ID LTR18_OG repbase; DNA; PRI; 474 BP. XX AC . XX DT 18-OCT-2009 (Rel. 14.11, Created) DT 18-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat (consensus). XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR18_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-474 RA Jurka J.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 9(11), 2864-2864 (2009). XX DR [1] (Consensus) XX CC ~83% identical to consensus. 4 bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 474 BP; 126 A; 113 C; 139 G; 96 T; 0 other; tgataggagt tagaaggctt taagcaggga ctgggaaacg gcacggagaa gaagggggag 60 gactccttta gggcgacctg tgcagccatt ggtcactggc tgttgctagg cagggcaatg 120 ccaggggagg gacttcccct ggggattggg cggaaaaagc gccagcacga ataaagggcc 180 aatcaacaga ggccagtcag ttaaggttta aagaaaccaa tcagtgtaag ccagctaggt 240 ttgaaggacc caatcagtgt ttgccagcag gtttgggcgg gcagacaagg cataaaagcc 300 caacccagcc gagctgcaac ggcaacccgt tcgggacccc ttccgttgtg ttggaagctt 360 tcctatcttc taataaacta tcgctctatc gcttctgctt gcctgtggtc cgtgaattca 420 ttcttcgatt cccgagacca agaacccgac caggaggaga aaaatccggt atca 474 // ID D20S16 repbase; DNA; PRI; 98 BP. XX AC . XX DT 01-JUL-2003 (Rel. 8.06, Created) DT 17-JUL-2008 (Rel. 13.07, Last updated, Version 2) XX DE Human centromeric satellite. XX KW SAT; Satellite; Simple Repeat; Centromeric; D20S16; KW Satellite repetitive element. XX NM D20S16. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 25-98 RA Smit A.F.; RT "D20S16: Human centromeric satellite."; RL . XX RN [2] RP 1-98 RA Smit A.F.; RT "Consensus update."; RL Direct Submission to Repbase Update (05-MAR-2004). XX DR [1] (Consensus) XX CC CC [2]. XX SQ Sequence 98 BP; 23 A; 36 C; 17 G; 22 T; 0 other; cagctccaca aaaatcaatc tagaacaaga cctctcctcc ctgggtcgcc agcttcctga 60 ccctcgaact gcaacaacgt tgctcctgcc tgggtctt 98 // ID L1-2_Cja repbase; DNA; PRI; 6486 BP. XX AC . XX DT 12-JAN-2011 (Rel. 16.02, Created) DT 12-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE L1-type non-LTR retrotransposon (consensus). XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-2_Cja. XX OS Callithrix jacchus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. XX RN [1] RP 1-6486 RA Bao W. and Jurka J.; RT "L1-type elements from the marmoset genome."; RL Repbase Reports 11(2), 734-734 (2011). XX DR [1] (Consensus) XX CC This sequence was derived from the marmoset dataset made CC available by Genome Sequencing Center at Washington University CC School of Medicine in St. Louis. CC We thank the Marmoset Genome Sequencing Consortium for making CC their data publicly available. XX SQ Sequence 6486 BP; 2447 A; 1459 C; 1340 G; 1240 T; 0 other; ggtggctggc aagatggccg aataggaaca gctccggtct gcagctccca gcgagatcaa 60 cgcagaaggt gggtgatttc tgcatttcca actgaggtac ccggctcatc tcattgggac 120 tggttagaca gtgggtgcag cccatggagg gtgagccgaa gcagggtggg gcgtcgcctc 180 acccgggaag cgcaaggggt cggggaactc cctcccctag ccaagggaag ccatgaggga 240 ctgtgccatg aggaacggtg cactccggcc cagatactat gcttttccca tggtcttcgc 300 aacccgcaga ccaggagatt ccctcgggtg cctacaccac cagggccctg ggtttcaagc 360 acaaaactgg gcggccattt gggcagacac cgagctagct gcaggagttt ttttttcata 420 ccccagtggc gcctggaacg ccagcgagac agaaccgttc actcccctgg aaagggggct 480 gaagccaggg agccaagtgg tctagctcag cggatcccac ccccatggag cccagcaagc 540 taagatccac tggcttgaaa ttctcgctgc cagcacagca gtctgaagtc gacctgggat 600 gctcgagctt ggtgggggga ggggcgtcca ccattactga ggcttgagta ggcggttttc 660 ccctcacagt gtaaacaaag ccgccgggaa gttcgaactg ggcggagccc accacagctc 720 ggcaaagccg ctgtagccag actgcctctc tagattcctc ctctctgggc agggcatctc 780 tgaaagaaag gcagcagccc cagtcagggg cttatagata aaactcccat ctccctggga 840 cagagcacct gggggaaggg gcggctgtgg gcgcagcttc agcagactta aacgttcctg 900 cctgccggct ctgaagagag cagcggatct cccagcacag tgctcgagct ctgctaaggg 960 acagactgcc tcctcaagtg ggtccctgac ccccgtgcct cctgactggg agacacctcc 1020 cagcaggggt cgacagacac ctcatacagg agagctctgg ctggcatctg gcgggtgccc 1080 ctctgggacg aagcttccag aggaaggaac aggcagcaat ctttgctgtt ctgcagcctc 1140 cgctggtgat acccaggcaa acagggtctg gagtggacct ccagcaaact ccagcagacc 1200 tgcagcagag gggcctgact gttagaagga aaactaacaa acagaaagga atagcatcaa 1260 catcaacaaa aaggacatcc acacagaaac ccatccgaag gtcaccaaca tcaaagacca 1320 aaggtagata aatccatgaa gatgaggaaa aaccagcgca aaaaggctga aaattccaaa 1380 aaccagaatg cctcttctcc tccaaaggat cacaactcct cgccagcaag ggaacaaaac 1440 tggacggaga atgagtttga tgaattgaca gaagtaggct tcagaaggtg ggtaataaca 1500 aactcctccg agctaaagga gcatgttcta acccaatgca aggaagctaa gaaccttgaa 1560 aaaaggttag aggaattgct aactagaata accagtttag agaagaacat aaatgacctg 1620 atggagctga aaaacacagc acgagaactt cgtgaagcat acacaagtat caatagccga 1680 atcgatcaag cagaagaaag gatatcagag attgaagatc aacttaatga aataaagcgt 1740 gaagacaaga ttagagaaaa aagaatgaaa aggaatgaac aaagcctcca agaaatatgg 1800 gactatgtga aaagaccaaa cctacgtttg attggtgtac ctgaaagtga cggggagaat 1860 ggaaccaagt tggaaaacac tcttcaggat attatccagg agaacttccc caacctagca 1920 agacaggcca acattcaaat tcaggaaata cagagaacac cacaaagata ctcctcgaga 1980 agagcaaccc caagacacat aatcgtcaga ttcaccaagg ttgaaatgaa ggaaaaaatg 2040 ttaagggcag ccagagagaa aggtcgggtt acccacaaag ggaagcccat cagactaaca 2100 gcggatctct ctgcagaaac cctacaagcc agaagagagt gggggccaat attcaacatt 2160 cttaaagaaa agaattttca acccagaatt tcatatccag ccaaactaag cttcataagt 2220 gaaggagaaa taaaatcctt tacagacaag caaatgctga gagattttgt caccaccagg 2280 cctgccttac aagagctcct gaaggaagca ctaaatatgg aaaggaaaaa ccggtaccag 2340 ccactgcaaa aacataccaa attgtaaaga ccattgacac tatgaagaaa ctgcatcaac 2400 taatgggcaa aataaccagc tagcatcata atgacaggat caaattcaca cataacaata 2460 ttaaccttaa atgtaaatgg gctaaatgcc ccaattaaaa gacacagact ggcaaattgg 2520 ataaagagtc aagacccatc ggtgtgctgt attcaggaga cccatctcac atgcaaagac 2580 acacataggc tcaaaataaa gggatggagg aatatttacc aagcaaatgg aaagaaaaaa 2640 aaagcagggg ttgcaatcct agtctctgat aaaacagact ttaaaccaac aaagatcaaa 2700 aaagacaaag aagggcatta cataatggta aagggatcaa tgcaacaaga agagctaact 2760 atcctaaata tatatgcacc caatacagga gcacccagat tcataaagca agttcttaga 2820 gacctacaaa gagacttaga ctcccacaca ataatagtgg gagactttaa caccccactg 2880 tcaatattag acagatcaat gagacagaaa attaacaagg atattcagga cttgaactca 2940 gctctggacc aagcggacct aatagacatc tacagaactc tccaccccaa atcaacagaa 3000 tatacattct tctcagcacc acatcacact tattctaaaa ttgaccacat aattggaagt 3060 aaaacactcc tcagcaaatg caaaagaatg gaaatcataa caaacagtct ctcagaccac 3120 agtgcaatca aattagaact caggattaag aaactcactc aaaactgcac aactacatgg 3180 aaactgaaca acctgctcct gaatgactac tgggtaaata acgaaattaa ggcagaaata 3240 aataagttct ttgaaaccaa tgagaacaaa gacacaatgt accagaatct ctgggacaca 3300 gctaaagcag tgtttagagg gaaatttata gcactaaatg cccacaggag aaagcaggaa 3360 agatctaaaa tcgacaccct aacatcacaa ttaaaagaac tagagaagca agagcaaaca 3420 aattcaaaag ctagcagaag acaagaaata actaagatca gagcagaact gaaggagata 3480 gagacacgaa aaacccttca aaaaaatcaa tgaatccagg agctggtttt ttgaaaagat 3540 taacaaaata gatagaccgc tagccagact aataaagaag aaaagagaga agaatcaaat 3600 agacacaata aaaaatgata aaggggatat caccactgat cccacagaaa tacaaactac 3660 catcagagaa tactataaac acctctatgc aaataaacta gaaaatctag aagaaatgga 3720 taaattcctg gacacataca ccctcccaag actaaaccag gaagaagtcg aatccctgaa 3780 tagaccaata acaagttctg aaattgaggc agtaattaat agcctaccaa ccaaaaaaag 3840 cccaggacca gatggattca cagccgaatt ctaccagagg tacaaagagg agctggtacc 3900 attccttctg aaactattcc aaacaataga aaaagaggga ctcctcccta actcatttta 3960 tgaggccagc atcatcctga taccaaaacc tggcagagac acaacaaaaa aagaaaattt 4020 caggccaata tccctgatga acatcgatgc gaaaatcctc aataaaatac tggcaaaccg 4080 aatccagcag cacatcaaaa agcttatcca ccacgatcaa gtcggcttca tccctgggat 4140 gcaaggctgg ttcaacatac gcaaatcaat aaacgtaatc catcacataa acagaaccaa 4200 tgacaaaaac cacatgatta tctcaataga tgcagaaaag gcctttgata aaattcaaca 4260 ccccttcatg ctaaaaactc tcaataaact aggtattgat ggaacgtatc tcaaaataat 4320 aagagctatt tatgacaaac ccacagccaa tatcatactg aatgggcaaa agctggaagc 4380 attccctttg aaaactggca caagacaagg atgccctctc tcaccactcc tattcaacat 4440 agtattggaa gttctggcca gggcaatcag gcaagagaaa gaaataaagg gtattcaaat 4500 aggaagagag gaagtcaaat tgtctctgtt tgcagatgac atgattgtat atttagaaaa 4560 ccccatcgtc tcagcccaaa atctccttaa gctgataagc aacttcagca aagtctcagg 4620 atacaaaatc aatgtgcaaa aatcacaagc attcctatac accaataata gacagagagc 4680 caaatcatga gtgaactccc attcacaatt gctacaaaga gaataaaata cctaggaata 4740 caacttacaa gggatgtgaa ggacctcttc aaggagaact acaaaccact gctcaaggaa 4800 ataagagagg acacaaacaa atggaaaaac attccatgct catggatagg aagaatcaat 4860 atcgtgaaaa tggccatact gcccaaagta atttatagat tcaatgctat ccccatcaag 4920 ctaccattga ctttcttcac agaattagaa aaaactactt taaatttcat atggaaccaa 4980 aaaagagccc atatagccaa gacaatccta agcaaaaaga acaaagctgg aggcatcatg 5040 ctacctgact tcaaactata ctacaaggct acagtaacca aaacagcatg gtactggtac 5100 caaaacagat atatagacca atggaacaga acagaggcct cagaaataat gccacacatc 5160 tacaaccatc tgatctttga caaacctgac aaaaacaagc aatggggaaa ggattcccta 5220 tttaataaat ggtgttggga aaactggcta gccatatgca gaaaactgaa actggacccc 5280 ttccttacac cttatacaaa aattaactca agatggatta aagacttaaa cgtaagacct 5340 aaaaccataa aaaccctaga agaaaaccta ggcaatacca ttcaggacat aggcatgggc 5400 aaagacttca tgactaaaac accaaaagca atggcaacaa aagccaaaat tgacaaatgg 5460 gatctaatta aactaaagag cttctgcaca gcaaaagaaa ctatcatcag agtgaacagg 5520 caacctacag aatgggagaa aatttttgca atctatccat ctgacaaagg gctaatatcc 5580 agaatctaca aggaacttaa acaaatttac aagaaaaaaa caaaaacccc atcaaaaagt 5640 gggcaaagga tatgaacaga cacttctcaa aagaagacat ttatgcagcc aacaaacata 5700 tgaaaaaaag ctcatcatca ctggtcatta gagaaatgca aatcaaaacc acaatgagat 5760 accatctcac gccagttaga atggcgatca ttaaaaagtc aggaaacaac agatgctgga 5820 gaggatgtgg agaaatagga acgcttttac actgttggtg ggagtgtaaa ttagttcaac 5880 cattgtggaa gacagtgtgg cgattcctca aggatctaga accagaaata ccatttgacc 5940 cagcaatccc attactgggt atatacccaa aggattataa atcattctac tataaagaca 6000 catgcacaca tatgtttatt gcagcactgt tcacaatagc aaagacttgg aaccaaccca 6060 aatgcccatc aatgatagac tggataaaga aaatgtggca catatacacc atggaatact 6120 atgcagccat aaaaaaggat gagttcatgt cctttgcagg gacatggatg aagctggaaa 6180 ccatcattct cagcaaacta acacaggaac agaaaaccaa acaccgcatg ttctcactca 6240 taagtgggag ttgaacaatg agaacacatg gacacaggga ggggaacatc acacactggg 6300 gcctgtcggg gggtgggggg ctaggggagg gatagcatta ggagaaatac ctaatgtaga 6360 tgacgggttg atgggtgcag caaaccacca tggcacgtgt atacctatgt aacaaacctg 6420 cacgttctgc acatgtatcc cagaacttaa agtataataa taaaaaaata aaataattta 6480 caaaaa 6486 // ID MacERV5b_LTR repbase; DNA; PRI; 371 BP. XX AC . XX DT 24-SEP-2007 (Rel. 13.02, Created) DT 21-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE Long terminal repeat of ERV1 Endogenous Retrovirus from DE Cercopithecidae. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; KW MacERV5b_LTR. XX OS Cercopithecidae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini. XX RN [1] RP 1-371 RA Smit A.F.; RT "MacERV5b_LTR - ERV1 Endogenous Retrovirus from RT Cercopithecidae."; RL Direct Submission to Repbase Update (21-FEB-2008). XX DR [1] (Consensus) XX CC 7% 5 bp TSDs, but ERV1-class proteins. XX SQ Sequence 371 BP; 85 A; 141 C; 74 G; 71 T; 0 other; tgttaggcag gaatctagac ccaacatggc ggtatcaccc ggcatggcag gccctttgtt 60 aggacttccc gcccttcact tcctgctaag actctcagcg cgcgaaaaaa gcccgcgccc 120 gccaaaaaac ccccgctctg cgcaagctcc tggacacgtc attcctcaga aatcgaaacc 180 taactcagga aaaccgaaac ctacaaaccc cgcctacctc gccctataaa aggcccccga 240 tacccgcccc gagcgcgact tcctcggccc tcctcctagg ggaccggtga acctcgcccg 300 cgagcccaat aaaggctacc tctgttctca tctgcctcgt gtcttcttgc tcggctcccc 360 attacattac a 371 // ID MacERV4_int repbase; DNA; PRI; 7442 BP. XX AC . XX DT 24-SEP-2007 (Rel. 13.02, Created) DT 27-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE ERV2 Endogenous Retrovirus from Cercopithecidae. XX KW ERV2; Endogenous Retrovirus; Transposable Element; ERVK; LTR; KW MacERV4_int. XX OS Cercopithecidae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini. XX RN [1] RP 1-7442 RA Smit A.F.; RT "MacERV4_int - ERV2 Endogenous Retrovirus from Cercopithecidae."; RL Direct Submission to Repbase Update (21-FEB-2008). XX DR [1] (Consensus) XX CC 1-3% ORFs: gag 154-2130, pro 1944-2891 (probably ca 2094-2891), CC pol 2870-5485, env 5476-7236. XX FH Key Location/Qualifiers FT CDS 154..2130 FT /product="MacERV4_int_1p" FT /note="gag." FT /translation="MGQELSQHQIYVGQLKEALKIRGVKVKGNDLFKFFDF FT VKDTCPWFPQEGTIDIKRWRRVGDCFQDYYNTFGPEKIPVTAFSYWNLIRD FT LIDKKEADPQVMAAVAQTEHILKVSSRSNLAKPPQDTEEDLISLESDHEEI FT KSPSVTDKEMPHENKPKKYPILQMLQKEEEINKPNQSDINWDDLEEEAAKY FT HNPDLPPFTSYPPPYNKTHNEASAPIVMAAIDPKEELKQKIAQLEEQIKLE FT ELHQSLIIRLQKLKTGNEKIPNSDAMEGSLRPLQRPGQHVPRGGLVASRHR FT EDSSPKDVFPVTETIDEQGQAWRHHTGFDFTIIKELKTAASQYGATAPYTL FT AIVESVAENWLTPTDWNTLVRAVLSGGDHLIWKSEFFENCRDTAKRNQQAG FT NGWDFDMLTGSGNYADTQAQMQYDPGLFSQIQAAATKAWRKLPVKGDPGAS FT LTAVKQGPDEPFSDFVHRLMTTAGRIFGNAETGVDYVKQLAYENANPACQA FT AIRPYRKKTDLTGYIRLCSDIGPSYQQGLAMAAAFSGQTVRDFLINKGKDK FT GGCFRCGKRGHFAKDCRENQNKSPEAKIPGLCPRCKRGRHWANECKSKTDS FT QGNPLPPRQGNGMRGQPQAPKQAYGAVSFVPASNSNPFQNLVEQPQEVQDW FT TSVPPPTQY*" FT CDS 2870..5485 FT /product="MacERV4_int_3p" FT /note="pol." FT /translation="KGVWKFLAQATDIPAPQRCADPITWKSDEPVWVDQWP FT LLNDKLSAAQQLVQEQLAAGHIEESNSPWNTPIFVIKKKSGKWRLLQDLRA FT VNITMILMGALQPGLPSPVAIPQKYFKIIIDLKDCFFTIPLHPADQKRFAF FT SLPSTNFKQPMKRYQWKVLPQGMANSPTLCQKYVAAAIEPVRKTWAQMYII FT HYMDDILIAGEIGEQVLQCFAQLKQELTAAGLQIAPEKVQLQDPYTYLGFQ FT INGPKIINQKAVIRRDHLKTLNDFQKLLGDINWLRPYLKLTTGELKPLFDI FT LKGDSNPKSPRSITKEALMALQQVEHAIATQFVTGIDYSQPLIFLIFNTTI FT TPTGLFWQNNPIMWVHLPSSPKKVLLPYYDAIADLIILGRENSRKYFGIEP FT STIIQPYTQSRIHWLLQNTEAWPIACASYTGAIDNHYPPNKLIQFCKLHAF FT VFPHITSKEPLNDALLIFTDGSSTGLAAYTYNNVVVKFQTTYTSAQLVELQ FT AIIAALSAFPCQPLNIYTDSAYLAHSIPLLETVPQIKHISDTANLFLQCQQ FT LIRKRTTPFFLGHIRAHSGLPGPLTQGNATADAATKTIATVTTDNLQQAQK FT AHALHHLNAQTLRLMFKLTREQARQIVKQCANCITYLPVPHLGVNPRGLIP FT NEIWQMDVTHHLEFGQLKYIHVCIDTYSGFISATLQTGEATKHVIAHLLHC FT FSILGIPKQIKTDNGPGYIAKTFLQFCNTLQIKHTTGIPYNPQGQGIVERA FT HLSLKTVITKLKGGSWYPVKGTPRNILNHALFILNFLNLDSHGKSAADRFW FT HPESQKQFAMVKWKDPLDSSWHGPDPVLIWGRGSVCIFSQKNDAARWLPER FT LVRQINHNHCQSREDKSP*" FT CDS 1944..2891 FT /product="MacERV4_int_2p" FT /note="pro (probably ca 2094-2891)." FT /translation="QSRKSFTPQAGKRDEGPASGPETSIWGSQLCSSQQQQ FT SISKLSRATPGSAGLDLSSTSHTILTPEMGPQTLNTGIYGPLPPNTFGLLL FT GRSSVTMRGLQVLPGVIDNDYEGEIKIMARAIDSIITVPQGVRIAQLLLLP FT LVKTDNNIQYSNRNIKGFGSSDIYWVQPITNQKPSLTLWLDGKAFTGLIDT FT GADVTIIKQEDWPSHWPTTETLTHLRGIGQSSNPKQSSKYLTWTDKENNSG FT LIKPFVIPYLPVNLWGRDLLSQMKIIMCSPNDIVTAQMLTQGYTPGKGLGK FT GENGIPQPILVSGQLDKKGFGNF*" FT CDS 5476..7236 FT /product="MacERV4_int_4p" FT /note="env." FT /translation="ISLRSSFLLVFQKMKPNMRFLWRIIALYNIVTVYAGF FT GDPRKARELLRKQYGQPCDCRGGQVSEPPSDRITQVTCXGKTAYLMPNQLW FT KCKSTPRDTSPSGPLLECPCSSFQSSVHSSCYTSYQQCKSGNRTYYTATLL FT KTQTGGTNDVQVLGSTNKLVQSPCNGQKGKPVCWSTTAPIHISDGGGPLDT FT ARIKTVQKKLEEIHKALYPELQYHPLALPELRDNFRLDAQTFDILNATYNL FT LQMSNTSLAHDCWLCLKMGPPIPLAIPNLSLPYVNYSNESLVNNSCPITPP FT LLVQPMTFSNSSCLFSPSYNNTKEIDLGYVVFGNCTSIINATNPLCAVNGS FT VFVCGNNMAYTYLPTNWTGLCVLATLLPDIDIIPGDEPIPIPAIEHFIYRP FT KRAIQFIPLLAGLGITTAFTTGATGLGVSLTQYTKLSNQLISDVQTLSSTI FT QDLQDQVDSLAEVVLQNRRGLDLLTAEQGGICLALQEKCCFYANKSGIVRD FT KIKTLQEELEKRRKGLAANPLWTGLDGLLPYLLPFLGPLLTLLLFLTLGPI FT ILNKLMAFVRQQIEAFQAKPIQVHYHRLEMTENGESYLP*" XX SQ Sequence 7442 BP; 2389 A; 1712 C; 1426 G; 1914 T; 1 other; tggcgcccga acggggacct ggaaacgagg gactccgtga ggaagaggac gccaaaggac 60 ggtcgaccgc taacgaagac aaaaggagtc aaactcttcc gatcaccgcg ggaacctgcc 120 gcgtcagaat cgaaggtaag tgacgcgtcc gaaatgggac aagaattaag ccaacatcaa 180 atatatgtag gacaattaaa agaggcttta aagatacgag gagtaaaggt taaaggtaat 240 gatttgttta aattttttga ttttgtaaaa gacacttgcc cttggttccc acaggaagga 300 actattgata ttaaaagatg gcgtagagta ggggattgtt ttcaggacta ttataatact 360 tttgggccag aaaaaattcc tgtgaccgcc ttctcttatt ggaacctcat tagagattta 420 atagataaga aggaggccga tccgcaagtc atggctgcgg tcgctcagac agagcatatt 480 ttaaaggtta gctcccgctc taacctcgca aagcctccgc aagatacgga ggaggatctc 540 atttccctcg aaagtgatca tgaggaaatc aagtctcctt ctgtaacaga taaagaaatg 600 ccacacgaaa acaaaccaaa aaaataccca attttacaga tgcttcaaaa ggaggaggaa 660 attaataagc ctaatcaatc ggatataaat tgggatgatt tagaggaaga ggcggctaaa 720 tatcacaacc ctgatttgcc tccctttact tcatacccgc ccccatataa taagacacat 780 aatgaggctt ctgcgcccat tgttatggca gcaatagatc ccaaagaaga attaaagcaa 840 aaaattgctc aactagaaga acagattaaa cttgaagagt tacatcaatc attgataatt 900 aggctccaaa agctaaaaac aggaaatgaa aaaataccta actcagacgc tatggagggt 960 tccttgcgcc cacttcagcg gcctggacaa catgttccaa gaggggggtt agttgctagc 1020 cgacatagag aagactcctc ccccaaagac gtttttccgg tcactgaaac catagatgaa 1080 cagggacagg cttggaggca tcatactgga tttgatttta ctattataaa agagttaaag 1140 actgctgcct ctcagtatgg ggctactgct ccatatactc ttgctatagt ggaatctgta 1200 gccgagaatt ggctcactcc tacagactgg aataccttag tcagggcagt tctttctggg 1260 ggagaccact taatttggaa gtcagagttc tttgaaaatt gtagagatac agctaaaaga 1320 aatcaacagg caggaaacgg ttgggatttt gatatgttaa ctggttcagg taattatgca 1380 gacactcagg cccaaatgca atatgaccct ggactgttct cacaaatcca ggctgctgct 1440 acaaaagcct ggagaaaact tcccgttaaa ggagatccag gggcctcgct cacagcggtt 1500 aaacaaggac ccgatgagcc attttcagac tttgtgcata gacttatgac cacggcaggt 1560 agaatctttg gaaatgcaga aacgggtgta gattatgtta aacagttagc ttatgaaaat 1620 gctaaccccg cctgccaagc ggcaatcaga ccttatcgaa agaaaacaga tttaacagga 1680 tacattcgcc tttgttcaga tattgggccc tcatatcaac aaggcctagc aatggccgcc 1740 gcttttagcg gccaaacagt aagagacttc ctcattaaca aaggtaaaga taaaggggga 1800 tgttttagat gcggtaaaag gggacacttt gcaaaagatt gccgtgaaaa ccaaaataag 1860 agtccagaag caaaaatccc aggcctttgc ccaaggtgta aaagaggaag gcactgggca 1920 aacgaatgta aatctaaaac tgacagtcaa ggaaatcctt taccccccag gcagggaaac 1980 gggatgaggg gccagcctca ggccccgaaa caagcatatg gggcagtcag ctttgttcca 2040 gccagcaaca gcaatccatt tcaaaactta gtagagcaac cccaggaagt gcaggattgg 2100 acctcagttc cacctcccac acaatactaa cacctgaaat gggaccgcaa accttaaata 2160 ccggaatata tggaccctta ccacctaaca cttttgggct acttttagga agaagtagtg 2220 tcaccatgag aggcttacaa gtcctccctg gagttatcga taatgattat gaaggagaaa 2280 ttaaaattat ggccagagct attgatagta ttattactgt ccctcaagga gttagaatag 2340 ctcagttact cctgctacct ttggttaaaa cagataataa tatccaatac tctaatagaa 2400 atataaaagg tttcggatca tcagatatat attgggtgca accaattaca aatcaaaaac 2460 cctctctaac cttatggtta gacgggaagg cattcactgg actaatagac acaggggccg 2520 atgtaactat cattaaacag gaagattggc cctctcattg gcctaccaca gaaactttaa 2580 ctcacttgag aggaattgga caaagcagta atcctaaaca aagttctaaa tacctaacat 2640 ggacagataa agaaaacaat tcaggcctca ttaagccatt tgtcatccct tacctacctg 2700 ttaacctttg ggggcgagat ctgctctctc aaatgaaaat tataatgtgt agtccaaatg 2760 atatagttac tgcacaaatg ttaactcaag gatacacccc tggtaaaggt cttggaaaag 2820 gagagaacgg tatcccacag cctatactgg tttcaggaca acttgataaa aaggggtttg 2880 gaaattttta gctcaggcca ctgacatacc tgcaccccaa aggtgcgctg accccattac 2940 ttggaagtca gatgagcccg tttgggttga tcagtggcct ttactcaatg ataaactaag 3000 tgctgcccaa cagttagtgc aggaacaact agcagcagga catattgagg aaagtaattc 3060 cccttggaat acacctattt ttgttattaa aaagaagtct ggtaaatgga gactcttaca 3120 agatttaaga gcagtaaata tcactatgat ccttatgggt gccttacaac caggattgcc 3180 ttcaccggtt gcgattcctc aaaaatattt taaaatcatt attgacctta aagattgctt 3240 ttttacaatt ccccttcacc ctgctgacca gaaaagattt gcctttagtc ttccatctac 3300 aaattttaaa caaccaatga agcgctatca atggaaagtc ttacctcagg gtatggccaa 3360 tagtcctacc ttgtgtcaaa aatatgtagc tgccgctata gagccagtca gaaaaacgtg 3420 ggcacaaatg tatattatac attatatgga tgatatttta atagcaggag aaattggcga 3480 acaagtctta cagtgcttcg cccaactcaa acaagagtta acagcagccg gactacaaat 3540 agccccagaa aaggtacaat tacaagatcc atacacctat cttggtttcc aaattaatgg 3600 acccaaaatc attaatcaaa aggccgttat acgtcgtgat catttaaaaa ctttaaatga 3660 tttccaaaaa ttactgggag acataaattg gcttcggccg tacttaaagc tcaccacagg 3720 agagttaaaa cctcttttcg atatattaaa aggagactct aatccaaaat cccccaggtc 3780 cattactaaa gaagcattaa tggcactcca acaggtagaa catgccattg caacacaatt 3840 tgttaccggt attgattatt ctcagccatt aatattcctt atttttaaca caacaataac 3900 ccctactggc ctattttggc agaacaatcc cattatgtgg gtacacctgc cctcctcccc 3960 taaaaaggtt ttgttgcctt attatgatgc catagctgat ctaattatct tgggaagaga 4020 aaacagtaga aaatactttg gaatagaacc ctctaccatt atacagccct acactcaatc 4080 acgcatccat tggctgttac aaaatacgga agcctggcca attgcttgcg cttcttatac 4140 tggcgcgatt gacaaccatt acccacctaa caaacttatt caattttgca aacttcatgc 4200 gtttgtgttt ccccatatta ccagtaaaga acctctcaat gacgcattac taattttcac 4260 tgatggatct tccacaggac ttgccgctta cacttacaat aatgtagtcg ttaaattcca 4320 gaccacttat acatcagctc agctggtcga attgcaagct ataattgcag cactatcagc 4380 ttttccttgt cagccactta acatttatac agacagcgcc tacctggctc attcaatacc 4440 cctcttagag accgtgcctc aaattaaaca tatttcagac acagctaacc tatttttaca 4500 atgtcaacaa cttatccgaa aaaggactac tccctttttt cttgggcata ttagagcaca 4560 ttcaggatta ccgggacctt taacacaagg taacgcaaca gctgacgcgg caacaaaaac 4620 catagccaca gtcactacag acaatttgca acaagcgcaa aaagcacatg ccttacatca 4680 tttaaacgcc caaaccttaa gacttatgtt taaacttacc agagaacaag ctcgacaaat 4740 agttaaacaa tgcgccaact gcataacgta tttacctgtt ccccatctag gagttaaccc 4800 ccgaggactc atccccaacg aaatttggca aatggacgtt actcaccact tagaattcgg 4860 tcaactaaaa tatatccatg tatgcataga cacctatagt ggattcatca gtgcaactct 4920 ccaaacagga gaggccacca aacatgtcat agctcattta ttacactgct tttctatttt 4980 aggaataccc aaacaaatca aaacagacaa tggccccggt tatatagcca aaaccttctt 5040 acaattctgt aataccctac aaattaaaca taccacaggc attccctata atccccaagg 5100 acaaggtata gtagaaagag ctcatctgtc attaaaaact gttatcacaa aattaaaagg 5160 ggggagctgg taccccgtga agggtacccc cagaaacata ctcaatcatg ccctgtttat 5220 ccttaatttt ttaaatttgg acagtcatgg aaaatcggct gccgaccgtt tctggcatcc 5280 tgaatctcaa aaacagtttg caatggtaaa atggaaagat ccacttgata gttcatggca 5340 tggccccgat ccagttttaa tttggggaag aggctcagta tgcatcttct cacaaaaaaa 5400 tgatgcagcc agatggctgc ctgaaagatt ggtaagacaa ataaatcata accattgtca 5460 gtccagggaa gataaatctc cctgagaagt tctttccttc ttgtttttca gaaaatgaag 5520 cctaacatga gattcctttg gagaataatc gctctatata acatagtgac agtctatgca 5580 ggttttggtg accctcgtaa ggcaagagaa ttattacgaa aacaatacgg ccagccttgt 5640 gactgcagag ggggacaagt atctgaacct ccgtcagaca gaatcaccca ggtgacttgc 5700 ncgggcaaga cagcttacct aatgccaaac cagttatgga aatgtaagtc taccccaaga 5760 gatacctcac ctagcgggcc gctcctagaa tgcccttgta gctctttcca atcttctgta 5820 catagttcct gttatacctc ctatcaacaa tgcaaatcag gcaatagaac atactatacg 5880 gccacgttac taaaaacaca aactggaggc accaatgacg tacaagtatt aggatccact 5940 aataaacttg tacaatctcc ttgtaacggc caaaaaggaa agcctgtttg ttggagcact 6000 accgccccca ttcacatttc tgatggagga ggcccattag atactgcaag aattaaaacc 6060 gtccagaaaa aattagaaga aattcataaa gctctatatc ctgaacttca atatcaccct 6120 ttagccctgc ctgagcttag agataatttt aggctcgatg cccaaacctt cgatatcctc 6180 aatgctactt acaatttact tcaaatgtcc aatacaagcc tggcccacga ttgttggctt 6240 tgtcttaaaa tgggcccccc tattcctcta gccataccta acctttcatt gccctatgtc 6300 aattactcaa atgaatcctt agtaaataat tcctgtccta ttaccccccc cctcttagtt 6360 caaccgatga cgttttctaa ttcctcttgc ctcttttcac catcatataa taacactaaa 6420 gaaatcgatt taggctacgt tgtgttcggc aactgtacct ccataatcaa tgccaccaac 6480 cctttgtgtg ctgtaaatgg ctcggttttc gtttgtggaa acaacatggc atatacttat 6540 ctacctacaa attggacagg gctttgtgtt ctggccactc ttctccccga cattgatatc 6600 atccctggag atgaacctat ccccatcccc gctattgaac attttattta tagaccgaaa 6660 cgagccatac aatttattcc cttgttggct ggactaggaa tcaccactgc ttttactaca 6720 ggagccacag gcctaggagt ctcactaacc caatatacta aattatccaa tcaattaatt 6780 tcagatgtac agaccttatc cagtactata caagatctac aagaccaagt agactcgtta 6840 gctgaagtag ttctccagaa tagaagaggt ctagatttgt taacggcaga acagggaggg 6900 atatgtttgg ctttacagga aaagtgctgt ttttacgcca acaaatccgg aatcgtcaga 6960 gataagataa agactttaca agaagaacta gaaaagcgca gaaaaggcct ggccgccaat 7020 ccactgtgga ctggactcga tggacttctc ccctatctcc tgccatttct tggtccttta 7080 cttactcttc tactcttcct cactctcggg cctataatcc ttaataagct tatggcattc 7140 gtcagacaac aaatcgaggc cttccaggcc aaacctatac aggtccatta tcatcgcctt 7200 gagatgactg aaaatggtga gtcttatttg ccttaataag accacctccc ctgtgtgctg 7260 aactggacag tcaatgacgg gtaagaggac actatctcca tcggagccta agacaggagg 7320 gccgcccttg ctgctgcctt atcccatgac gggcctagaa ggtggggatg agttgaccca 7380 acctaagaca ggcgcagttc ccgaggggtt ttctcatcat aaaatatata aaaaggggga 7440 cc 7442 // ID MacERVK1 repbase; DNA; PRI; 7080 BP. XX AC . XX DT 24-SEP-2007 (Rel. 13.02, Created) DT 27-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE ERV2 Endogenous Retrovirus from Cercopithecidae. XX KW ERV2; Endogenous Retrovirus; Transposable Element; ERVK; LTR; KW MacERVK1. XX OS Cercopithecidae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini. XX RN [1] RP 1-7080 RA Smit A.F.; RT "MacERVK1 - ERV2 Endogenous Retrovirus from Cercopithecidae."; RL Direct Submission to Repbase Update (21-FEB-2008). XX DR [1] (Consensus) XX CC 3% subst ORFs: gag 331-1851, pro 1707-2624, pol 2525-5323, env CC 5275-7068. POL 52% id/69% similar to MYSERV POL. XX FH Key Location/Qualifiers FT CDS 331..1851 FT /product="MacERVK1_1p" FT /note="gag." FT /translation="MPLTLWTLCSSITQALELLETDSERGDTATGGEASEA FT LERNDPREEHIYAPVVEDKELEKELMPPSSEGNSDLQRILEMLQQLLKLQG FT TAAPVSPISPPRAFPVTFPSAPLEKDFPLPPPPTSLPMSGQVFSVPLPPVG FT EKKESKEKDDFEELDLFPITRAAFGPNAQFPNGGQNVTFTALQFKFLKEMK FT AAISNYGPQSPFVLGLLDSFSSEHMMIPIDWETLGQAVLDRSQWLQLKSWW FT WEEARVQARKNATRNPPGPTEEQLTGSGQYATLNAQAGLDDIALTQIKALF FT IKAWTKVEIAGKTSLSFVKILQGATEPYPDFVARLQDAVLKTVGSGPAAKI FT LLDTLAFESANPECQKLLRPLKASGADLXEYIRACAGVGGAIYNAQLFAGA FT LSKALKGNSKQGICYQCGKPGHFKKECRKKLGSSALKEKRLPSDMCKRCGK FT GRHWTNECRSKTDKSGNVLSPLSGNGSRGPQAWGPNNYNAGLPQYPVSSGL FT QHQGMTSSPP*" FT CDS 1707..2624 FT /product="MacERVK1_2p" FT /note="pro." FT /translation="KWECIVTPLGKREPGPAGLGPQQLQCRPTPVPSEQWL FT TTSRNDLESSLKTSYPTPVSQLYAATKQSAAADLAIVQPYTLSPNGGIYKL FT ATGVCGPLPKGHVGLLLGRSSSAMRGLMVVPGIIDPDFTDEILIMVQVSQF FT MRLEAGERIAQLLLLPFFPFLSRDVSRQGGFGSTGKTVFWETLVSDQKPLC FT SLQINGILFEGLVDTGADVSIICLAQWPDHWKKKQVSVTLSGLGTASIVYQ FT SVEPLSCVGPEGQQGKVFFYIVPININLWGRDLLQQFGAFLSIPHISSAAK FT NMMFKMGYNPLNS*" FT CDS 2525..5323 FT /product="MacERVK1_3p" FT /note="pol." FT /translation="SFTTIWCLFKHSSYFFSCQKYDVQNGLQSFKLLATVH FT KPQALQLKWKTTTPVWVEQWPLSKEKLKALKNLVKEQLAEGHIEPSTSAWN FT SPVFVIKKKKKKGKWHLLTDLRAVNTCIQPMGTLQPGLPNPALIPQNWKLM FT VIDLKDCFFTIPLQPQDCEKFAFTVPEYNNGQPTQRYQWKVLPQGMLNSPT FT LCQEFVHRALNPVRCQFPTVLIYHYMDDILLATPDEVLQGQVFQVLQQQLT FT QYNLRISSEKIQTQFPIQYLGYLLAEKHIRPQKVQIRRDQLLTLNDFQKLL FT GDINWLRPILGIPTYKLRHLFATLEGDSDLNSSRQLSPVAEEELSFVENRI FT SEAHLDYIAPNLPLSLCLFHTPHSPTAVLHQDNNILEWLFLANKAIKKIHP FT YIDKLAELIIKGRHRARQLLGADPIKIITRLSNQYIESLLETHEGWQVACA FT EYTGSFSQHYPSNKLFAFLQQYSLLPYNPISYTPVKGPTFFTDASSNGKAG FT YWTSDNSKISHYPFESVQQGELFAILMVLQDWPQTPCNIVSDSQYAVYVTK FT YISQTSLPLFPQTPLQKLFSLLFVTLTSRTAPLFITHIRSHSALPGPLSFG FT NSQIDSLLIGSVQRAQDEHQLHHTNSLGLQRRFSITRRQARAIVRACPSCA FT PIIAPFLSPGINPRGIQQNHIWQMDVTYVPSFGRLKYVHHTIDTWSHFQWA FT TPLPSEKADSVITHLLTCFAVMGVPAELKTDNAPAYCSRKLAAFLSLYHIR FT HSTGIPFNSQGQAIIERANATLKLQLLKQKGGDGRDSPKHQILKALFTLNF FT LNHWRQLQQSAAVKHFQQPLEKPQNSNLWVYYPSLQGKYLKGKVLQWGRGY FT ALVRTGAGDEWFPSRRLKQCHGEEEPIRRVPEGIPGAKSERSENIHLRNST FT CGAPDMGTTEKVKKLKGLFNKLGSPKPH*" FT CDS 5275..7068 FT /product="MacERVK1_4p" FT /note="env." FT /translation="EAEGVVQQAGQPKTPLTLFLAMLAVVNCQVTAGEFTY FT WAYIPFPPLYQGVAWGDREVPVFTNDTAWMPSPFLNQDPELDTGTVNSSYQ FT FGVEGLPICLGGSPHCLHLSHEAWAVRYNHSHISAFTMIIVAQSFKYNHTA FT VLNETLPSTLSLCPIPDVSGGVSQLEWTRCRSSGPRLLLEVKGKSRKYLVT FT DWSVHGDFQTKFSHVNLRWHKGNHSISADGNETIIWHDGGLSPPMPHLANT FT SQIQGHIWKLLAAGKPMFTFTGNMSLNLTNIANPFHISLHRNSSRYVIACV FT RKPYLLLTGIFKWDNNTGVVNCTNNCTFLSCINTTWWNNNWNESHSDLYIL FT RARKETWLPVNLTRTWSESAGVTQIYKVMQDLVHRSRRMVGIVVTAVIGLV FT AIASTAAVAGLALHQSIQNAEFVQQWHEQSHLLWQQQRDIDAHLTERVDNL FT EQVVSWLGDQLTVLNTRALLKCDWNTTQFCITPVPFNSTVHNWTEIKRLLI FT GHNNLSLEIQELTQNISETFRNQLPLLTGADLMTGIAQSLTSLNPMSPVKT FT LLTSVSSNVLIVVLAFVIFTVCWRRCQRANTESQRAQHVMMVLKEIQTCK* FT " XX SQ Sequence 7080 BP; 2028 A; 1455 C; 1452 G; 2144 T; 1 other; tctggcgcag cgagcagggt ccgtggctcc acgagagcaa ggaaccggag ggggaacacc 60 ccgggcaggt aaataaagaa agggggaccc acgggaaaat tatggggaat tccacctctc 120 tggcctctga atatttgcgg ttactccagg gactgttgat gtctatagga gtagaagtta 180 gttaaggaac ataagacttt gaagcgattg tttgcccatg ttgaacaaca ttgttattgg 240 tttcaatatc aaaccaaagt gcagttaaat cgaaaggaat ggttacaagt ggttaaagtc 300 ctgcgtcgag ctcatcagcg agggcaaacg atgcctctga ctttgtggac tttgtgtagt 360 tccattactc aggcattaga gttacttgaa actgactcag agcgtggcga tactgccaca 420 ggaggtgagg catctgaagc tttagagagg aatgatccta gagaggaaca tatttacgcc 480 ccggttgttg aggacaagga attggaaaaa gagctaatgc ctccttcatc tgaaggaaat 540 tctgatttgc agcgcatttt agagatgtta caacagttgt taaaattgca aggtaccgct 600 gctcctgttt ctcctatctc tccgccacga gccttccctg ttaccttccc ttctgctcct 660 ttggaaaagg atttcccttt gcctccacct ccaacctctt tgcctatgtc aggtcaggtt 720 ttctctgtgc ctcttcctcc tgtaggagaa aagaaggaat cgaaggaaaa ggatgatttc 780 gaggaattag atttattccc aataacacgg gctgctttcg gccccaatgc tcagttccca 840 aacggtggcc agaatgtgac ttttacagcc ctacaattta aatttttgaa agaaatgaaa 900 gctgcaattt ctaattatgg acctcagtca ccgtttgttc ttggccttct cgattccttc 960 tcttcagaac atatgatgat tcctattgac tgggaaacgt tgggacaggc tgtccttgat 1020 cgttcgcaat ggcttcaatt aaaaagttgg tggtgggagg aagcaagagt acaagctaga 1080 aaaaatgcta cccgaaatcc cccaggacct actgaagagc agcttacggg ttcaggacaa 1140 tacgccactc ttaatgctca ggcaggccta gatgatattg ctctcactca gattaaagcc 1200 ttgtttataa aagcgtggac taaggttgaa atagcgggta aaacttcttt atcttttgta 1260 aagatcctac aaggagccac tgagccgtat ccagattttg tagcccgtct tcaagatgct 1320 gtattaaaga ctgtaggttc aggccctgct gctaaaattc ttttggatac tctggctttt 1380 gagagtgcta atcccgagtg ccaaaaattg ctgcgtcctc taaaagctag tggagctgat 1440 ttagntgaat acattcgggc ttgtgcaggt gttggaggag ctatttacaa tgctcaattg 1500 tttgctgggg cactttctaa ggctttaaaa ggcaattcta aacaaggtat atgctatcaa 1560 tgtgggaaac ccggtcattt taaaaaggaa tgccggaaaa aattaggctc ttctgctcta 1620 aaagaaaaaa ggttaccatc tgatatgtgt aaacgatgtg gcaagggtcg acattggacc 1680 aatgaatgcc gttcaaaaac tgataaaagt gggaatgtat tgtcacccct ctcgggaaac 1740 gggagccggg gcccgcaggc ttggggcccc aacaactaca atgcaggcct accccagtac 1800 ccagtgagca gtggcttaca acatcaagga atgacctcga gtcctcctta aagacatctt 1860 accctacccc agtttctcag ctttatgctg ccactaagca gagtgccgct gctgacctag 1920 ctattgtcca accttatact ttatctccta atggaggaat atataagttg gctaccggag 1980 tgtgtggtcc tttgccaaag ggacatgtag gacttttact aggtcgaagc agcagcgcta 2040 tgcgggggtt aatggtggtc ccaggaatta ttgatcctga ttttactgat gaaatcctta 2100 ttatggtaca agtctcacaa tttatgcgcc tagaggcagg ggaacgtatt gcacaattgc 2160 ttttattgcc tttttttcct ttcttatcta gagatgtgtc tcgtcaagga ggtttcggta 2220 gtactgggaa aactgttttc tgggaaactt tagtttctga tcaaaaacct ttatgttcat 2280 tgcaaattaa tgggatactt tttgagggat tagtcgacac aggagcggat gtatctatta 2340 tatgtttggc tcaatggcct gatcattgga aaaagaaaca agtatcggtt actctatccg 2400 gcctaggtac tgcttctata gtctaccaaa gtgtcgagcc cctgagctgt gtaggacctg 2460 aaggacaaca aggaaaagtg ttcttttata ttgttcccat taatattaac ctttggggcc 2520 gtgatctttt acaacaattt ggtgcctttt taagcattcc tcatatttct tcagctgcca 2580 aaaatatgat gttcaaaatg ggctacaatc ctttaaactc ttagccactg tccacaagcc 2640 tcaagcttta caattgaagt ggaaaactac aactcctgtt tgggtggaac agtggccatt 2700 atctaaagaa aaacttaagg ctttaaagaa tttagttaag gaacaattgg ccgaggggca 2760 tattgaacca agtacttctg catggaattc ccctgtgttt gtcattaaaa aaaaaaaaaa 2820 aaaaggaaaa tggcacttat taactgatct tagagctgta aatacttgta ttcaacctat 2880 ggggacttta caacctgggc ttcctaatcc agctcttatc ccacaaaatt ggaaattaat 2940 ggttatagat cttaaagatt gtttttttac tatcccgtta caaccccaag attgtgaaaa 3000 atttgccttt actgttcccg aatataataa tggacagcct actcaaagat atcaatggaa 3060 agttcttcct caaggtatgc ttaatagtcc taccttatgt caagagtttg ttcatagagc 3120 cttaaatcct gttagatgtc aattccctac tgtgttaata taccattata tggatgatat 3180 tctattagca actcctgacg aagtcctaca aggtcaagtt tttcaggtct tacaacaaca 3240 attaactcaa tataatttac gtatttcttc tgagaaaata cagactcaat ttcctattca 3300 atatttggga taccttttag cagaaaagca tattcggcca caaaaagttc aaattcgaag 3360 agatcaactt ttgaccctta atgattttca aaaactgcta ggagacataa attggcttcg 3420 ccccatctta ggcatcccta cttacaaatt gcgacatcta tttgctacct tggaaggaga 3480 ttcagattta aatagttctc gtcaattatc ccctgtagca gaagaagaat tgtcatttgt 3540 agaaaataga attagtgaag ctcaccttga ttatattgca cccaatcttc cactttcgtt 3600 gtgtcttttt catactcccc attcgccaac ggccgtttta caccaagata ataatattct 3660 ggagtggctg tttttggcta ataaagccat taagaaaatt cacccatata ttgataaatt 3720 ggctgaatta attattaaag gacgacatcg agctcgtcag ttgcttggtg ctgaccctat 3780 aaaaattatt acccgattat ctaatcaata cattgaatct ctgttggaaa ctcatgaggg 3840 ttggcaagtg gcttgcgctg agtatactgg ttctttttct cagcattatc ctagtaataa 3900 attatttgct tttctgcaac aatattcttt actgccatat aatcctatct cttacactcc 3960 agttaaaggt cccacctttt ttactgatgc ttcaagcaat ggaaaggctg gatactggac 4020 ttcagacaat tcaaaaattt ctcactatcc atttgaatca gtacaacaag gagaactttt 4080 tgccattctt atggttctac aagactggcc tcaaacccct tgtaatatag ttagtgattc 4140 ccaatacgct gtatacgtaa ctaaatatat ttctcagact tctttaccat tatttcctca 4200 aactccttta caaaaattat tttctttatt atttgtaact cttacttccc gcactgcccc 4260 cctttttatc actcatattc gttctcattc agctttgcca ggacctttat cttttggcaa 4320 tagtcagatt gattctttac ttattggcag tgtccaacgg gctcaggatg agcaccaact 4380 acatcatact aatagtttag gattacaacg gcgattttct ataacacgcc gtcaagcccg 4440 agctattgtc agagcctgcc catcttgtgc tcctattatt gcaccttttt taagtccagg 4500 tattaatcct cgaggcattc aacaaaatca catttggcaa atggatgtaa cttatgttcc 4560 ttcctttggt cgtttaaaat atgttcatca tactattgat acgtggtcac atttccagtg 4620 ggctacgcca ttaccctctg aaaaggctga ctctgttatt actcatttac ttacttgctt 4680 tgcagttatg ggagtccctg ctgaattaaa aactgataat gcccctgctt attgctctcg 4740 caaattggct gccttcctct ctttatacca tattcgtcat tccactggta ttccatttaa 4800 cagtcaaggt caagcaatta ttgaaagagc caatgctacc ttaaaattac aacttttaaa 4860 acaaaaaggg ggagatgggc gagattcccc aaaacatcaa attctgaaag ccctttttac 4920 tttaaatttt ctcaaccatt ggcgccaatt acagcagtct gcagcggtga aacattttca 4980 acagccttta gaaaagccgc aaaacagtaa tctgtgggtg tattatcctt cattacaagg 5040 gaaatattta aaaggaaaag ttttacaatg gggccgaggc tatgctcttg tccgtacagg 5100 tgccggagac gagtggtttc catctcgacg gttgaaacag tgccatggcg aagaggagcc 5160 gatccgaaga gttcctgaag gaattccagg agctaaatct gagcgctcag agaacattca 5220 ccttcggaac tcgacgtgcg gagcccccga catggggaca actgaaaaag ttaagaagct 5280 gaaggggttg ttcaacaagc tgggcagccc aaaaccccac tgactttatt cctggctatg 5340 ctggccgtag taaattgtca ggtaacggct ggagaattta catactgggc ttatattcct 5400 tttccaccct tatatcaagg tgtggcctgg ggagacagag aagtcccagt atttactaat 5460 gatactgctt ggatgccatc gcctttttta aaccaggatc ctgaattaga cactggcaca 5520 gtaaatagtt catatcaatt tggggttgaa gggttaccta tatgtcttgg gggtagtccg 5580 cattgtctgc atctttctca tgaggcttgg gctgttcgct ataaccatag tcatatctct 5640 gcctttacta tgattattgt ggctcaaagt tttaagtata atcatactgc agtacttaat 5700 gaaactctgc caagtacatt atctttatgc cctatccctg atgtgtccgg aggtgtttcc 5760 caactagagt ggactagatg tagaagtagt gggccacgat tgttgttaga agtaaaaggg 5820 aagagtcgta aataccttgt tacagattgg agtgtacatg gagattttca gactaaattc 5880 agccatgtca acctgcgttg gcataaaggt aatcacagca tttctgcaga tggcaatgaa 5940 actattattt ggcatgatgg tggtctatca ccccctatgc cacatttggc taatacctca 6000 caaatacaag gtcatatttg gaagttgcta gctgctggta aaccaatgtt cactttcacg 6060 ggaaacatgt ccttaaatct tactaatatt gctaaccctt tccatatatc tttgcatcga 6120 aatagttcta gatatgttat tgcatgtgta agaaagcctt acttgttgtt gacaggaatt 6180 tttaaatggg ataataatac aggggtagtt aactgtacaa acaattgtac attcttaagt 6240 tgtataaata ctacttggtg gaataataat tggaatgagt ctcattccga tttatatata 6300 ctaagagcca gaaaagaaac ctggctacct gtaaacttaa cacgtacttg gagtgaatct 6360 gctggggtta ctcaaattta caaggttatg caagatcttg tccaccgcag tcggaggatg 6420 gttggaatcg tcgtgaccgc ggtgatagga cttgtagcaa ttgcttccac cgccgctgtt 6480 gctggattag ctctacacca gagtatacaa aatgcagaat ttgtgcaaca atggcatgaa 6540 caatcacatt tgttgtggca gcagcaacga gacatagatg ctcatttgac tgaacgagtg 6600 gataatctgg agcaagtggt ttcttggttg ggagatcagc taacagtgtt gaatacccga 6660 gctttgttga aatgtgattg gaacactacc caattttgta ttactcctgt tccttttaac 6720 agcactgtac ataattggac tgagataaag agacttttaa ttggtcataa taatctttcc 6780 ctagaaatac aagaattaac acagaacatt tctgaaacat ttcgcaatca gttaccgttg 6840 ctgactggtg cagatttgat gactggtatt gcacaaagtt tgacgtcttt gaaccctatg 6900 agccctgtaa aaactttgct gacctctgtt tctagtaatg tattgattgt tgttcttgcc 6960 tttgtcattt tcacagtctg ctggagacgg tgccaaaggg caaacaccga atcccagcga 7020 gcccaacatg taatgatggt tttaaaagaa attcaaactt gtaaataagg aaaggggaaa 7080 // ID ERV2N1-Mim_I repbase; DNA; PRI; 4944 BP. XX AC . XX DT 11-NOV-2009 (Rel. 14.11, Created) DT 11-NOV-2009 (Rel. 15.11, Last updated, Version 3) XX DE Internal portion of a retrovirus-like element. XX KW ERV2; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERV2N1-Mim_LTR; ERV2N1-Mim_I. XX NM ERV2N1-Mim_I. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-4944 RA Jurka J. and Walichiewicz K.; RT "ERV2-like non-autonomous endogenous retrovirus from the mouse RT lemur."; RL Repbase Reports 9(11), 2836-2836 (2009). XX DR [1] (Consensus) XX CC >98% identical to consensus. 6bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. CC We thank the Broad Institute Genome Sequencing Platform and CC Genome Sequencing and Analysis Program, Federica Di Palma, and CC Kerstin Lindblad-Toh for making the data for Microcebus murinus CC available. XX SQ Sequence 4944 BP; 1328 A; 1078 C; 1073 G; 1457 T; 8 other; ggcggtggcc cgtataggga accctctctc cgagackgcc ttcggtggta cwtggtgaca 60 mggggattcg gtgaaggccg ctgcccgtgg aaggccggct gtkacakaaa gaaaggtgwg 120 ttgggcagag agtgtaattg ataggagtaa gaaattatgg gacaaggaac tagtaggatg 180 ttgtttgtac aagtgttgaa gactatgctt cgggcccgca gtgttaaaat aggaaaaaag 240 cagttggaaa atttttcaag tttttagtag aagtatgtct tggtccctga ggaaggaacg 300 gtaaatttcg atacatggaa gaaggtagga gttaaattgc aggactatta ctcagcccac 360 ggacctgaga aggttccggt ggacgctttt tatctttgaa atcttattcg ggattgtcta 420 aacccccggc atactcgtat caggcacata acctttaagg aattagaaaa tctaaaaaat 480 atgatgcaac aacttcttat tcaaaatgct gctataggct tgttattagg tagaagcagt 540 atcactttga aaagcttact ggtgacaccg ggaattatag atcaggatta tacaagagaa 600 attaaagttt tgtactcctc tcctagctct atttctgttg tcaatcctgg ccagcgcatt 660 gctcagcttg ttttgatttc tttgatatcc cttggtaaag tacaatcgca acaagccagg 720 gtaccctctg ctttttgctt ctcagatgct tactggatgc tattactagg aataagcctg 780 aaatggtttt gtgggtgaat ggcaaggcat ttaaaggaat tgtagatact gaagctgatg 840 tttctgttat cgctgagcaa cactggccct cctgtggccc aagcaggaag ccgttggttc 900 gctccaaggt ataggctaag cccgtaatcc tgaacagtag tgaattgctt acctgtgttg 960 agtagggaca ttcggacaat tccaaccata tgttcgactt gctctacccg ttaatctttc 1020 gggaagagat tttatgcagg ctatgagagt ttatttgtat agcccgaact tgtgccaaag 1080 atttgttgcc aatgctttgc tactagaatc aaatatccta ctgtatatat catccactat 1140 atggatatgc agatgaagga ttcttactgc aagtttttga tcatgaaaca acacaattaa 1200 acagggcctt gtcattgctc aggaaaaggt gcagaggcac cctccttatt gttatttggg 1260 ataccaactg cataaatacc attttattaa ccaaggaatt caacttcgta aagatggctt 1320 aaaaactctt aatgatttcc aaagacttct gggagatatc aattggattc gaccctattt 1380 gaaaattacc actggagacc ttaagcctct ttttgacatc ttamagggcg atccaatcct 1440 aattctcctc gtcaattaac ccctgacggg aggcaagcct taaaattagt agaacaggaa 1500 ttgtcccgcc aacatgttcc ctatgtaaac tacaacttgg aatgggctgg ctatgtttta 1560 cacaactctc atactcccac agcagtcctc tatcaatggg gaccactgat gtggctacat 1620 ttgccctcct atccctctaa agttctaact ccttattatg agatggttgc taccttaatc 1680 catatgctgc gttcagaatc ctgcaaatta ctaggtaaag agccacactt ttttgtgttc 1740 ctttttctag tctgcagcag gagtggttgt tccagcatag tgactcctgg gccatagccc 1800 tggccaatta ccctggaaaa attgataacc attacacccc tgataagttg ttacattttg 1860 ctagcctaca tccgtttatt tttgttgccc aagtgtctcc tgtacccctt gacaatgctg 1920 ttctgatttt tacagatggg tcttctaatg gtatggagtc tattcagtta acaatgacat 1980 taaatcttgg cacactgggt cttcttcgtc tcaagaggtc gaattgcagg cagttttttc 2040 agctttagag gccatccctg ctacgcctgt taacctttat tcagatagtc attatgtgat 2100 tcgggcactc caagtcatag agaatgtgcc ctttattggg acttctaata gtaatgttca 2160 gaaattgttt cgtgccttgc aagccctcat tcactcacgt acaggaagat gtttctttgg 2220 gcaccttcgt gctcattccc acctgcctgg tcccctcggc caaggaaatg aaattgtaga 2280 tcttgccact agaaccaaac cccttcttat tttgttaagc gcagttagca cagcagtccc 2340 atgctttgca caaccaaaat agcagcgctc tcaagcaaca gttcaatatc wctagagaag 2400 cggctcgaaa aattgtaaaa gcttgctctt cctgttctca gttactccct gtcccccatt 2460 atggtgttaa taccagaggc cttttatcta accacttatg gcaaatggat gtgacgttta 2520 ttacttctct gggttgatta aaatatgttc atctgaccat agacacttat tctggatttt 2580 taacggccac atttcaatta ggcgaggctg gtaaacattg tgtagcccat tgcctccgat 2640 gctttgcatc aatgggccag cctaattgta taaaaacaga caatgaccct ggatacacgg 2700 gtgacaaatt tcaaacattc ttaccgaaaa tggggatcag tcataaacca gaaaattctt 2760 ataacgccca aggacaaagc attgtagaac gtgcccacta gactctcaaa aatcaacttt 2820 tgaaaatgaa aaaggtggat ctgtaccacc tcacgccccc cccccccaga attatctaaa 2880 ccatgatctc gttattttaa attttttaat tttggacaaa gaagatcgtt cagcagccca 2940 gcggttttgg tgtactaatt ccaacaaaga cacgccgttg gttgggtgga aggatccact 3000 gactagccag tggtcaggac cggatccagt tgttatttgg agtcgaggtc atgtttgtgt 3060 atttcaacaa gatgctgagg gcccgttctg gctcccggag aggctggtaa gccaggcagg 3120 atccctccat agaaagcagg atgaagacac agaagctgca ggtcgaccag caacctccag 3180 ttaggatgac aactagacaa gctctgctgc cgtcttggag tcagattgag acttactgag 3240 attgcttcca agctggttcg agcgacagga cagccattga attctttgac cttgttttta 3300 gctatggtaa ccctgctatg gacgcctatt ggacttatat atcagacccc gttttactcc 3360 acccggtggg ctggggagat cgtgttgttc ttatgtcagt gatcccaggg ctctgggagc 3420 acccgctaat gatcacatcc agcatgctaa ggtatctgct tataattata ccggactcag 3480 ccctgatgtt cctatctgtt tccaccgtga aggttatttt ccaggctgtg ttcttttacc 3540 tacttatgta tattacaata taaatggtgt taaatggacg tctatgtcaa gtttagacag 3600 gcgattttct ggccctctta tttgccaccg ccgcccggaa tacttcgctg tgaaaaggag 3660 ctggttgctt ctactctgca cgtgccgtgg agagtttccg tgaaccaact gcgactaaga 3720 ttcctattct cgggacgact cagagcatat atgattggac cactgcaata actcacaatg 3780 ggtatttggg agacaagggt tgtgttgagg ggctatggat aactccttca aaggtatacc 3840 agactagtct ctgcaaactt gttgcaggaa ctggtcaatt ctttgcaaga gtgtggtgga 3900 ctctacccca aaaatacgcc taaggactac aggaactggt caatcttctg caagagagtg 3960 cggtggactc taccccaaaa atacgcctaa cgactatttc tgcctgtgtc cctgaacttt 4020 tgtattcatg gtaggcaatg tgaatattcg tactgtcact catggttttg aaatatcatg 4080 ccttaattgt caactaacta attgtataca atcccttgcc tattctgata gaatggttgt 4140 gttataccgc cccgcctttg tggtggtccc agctaatgtc tctggtccat ggtatgatga 4200 taaaggattg caaatgtgga aggaggtgaa cgctattttg ctaaggccta aaagatttat 4260 tgggctactc ataactggca ttgtagctct tgttactgta atagcatccg ctgctgcggc 4320 tgcagtggcg ttgacacaag agatacaaac tgcccattat gtcaataatc tctctaaaaa 4380 tgtcactcag gcattgggaa cccaagagca tattgataaa aagattgaag acagactgga 4440 cgctttgtat gatgtggtcc agattctggg agaggaagta caagggcttc ggctacgcag 4500 ccagctccgc tgccatgaca attatcagtg ggtttgtgta actcccaaac cttataatga 4560 gagtaactat aattgggcta aggtccaaaa tcacctggca gggatctggc actccgcaaa 4620 tgcatcttta gatttgctcc aacttcatca agaaataaca ggcatgttgg agactcctcc 4680 ccttgacact agcattgcta cagaagcccg agattttctc aatcaactct taggacatgt 4740 tccatctttt ggaaatttta aaagcttatt ctttatgatt attggggctt ttgttcttct 4800 cttgctaatt ttgtgtgtta cgcctctgct actgcgactg attgttaaga atattttggc 4860 tgttaaagct gccatacact ccactaactt gcaaatgaaa gcttaccagc gaccctccaa 4920 ttaattataa agggaaggga gaga 4944 // ID L1B_Mim repbase; DNA; PRI; 5342 BP. XX AC . XX DT 07-JAN-2010 (Rel. 15.03, Created) DT 07-JAN-2010 (Rel. 15.11, Last updated, Version 4) XX DE L1-type non-LTR retrotransposon - consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1B_Mim. XX NM L1B_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-5342 RA Jurka J.; RT "Non-LTR retrotransposons from the mouse lemur."; RL Repbase Reports 10(3), 476-476 (2010). XX DR [1] (Consensus) XX CC >92% identical to consensus. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. CC We thank the Broad Institute Genome Sequencing Platform and CC Genome Sequencing and Analysis Program, Federica Di Palma, and CC Kerstin Lindblad-Toh for making the data for Microcebus murinus CC available. XX FH Key Location/Qualifiers FT CDS 60..1061 FT /product="L1B_Mim_1p" FT /translation="MRKNQRKITGNMKDQREKSPPKENTRSPSXDTNLHDM FT IKLTEEXFRIWIAXKLNGIEEKIESQHRETTRTXQEMNERFSKEIETIQKN FT QTEILAMKETIKDLHNXVESLKNRMDHAEERISELEDYAYALNKSEEEREH FT RNKRQDQSLQEVWDYVKKPNIRLIGIPEGEEEHSQGLENLFHGILEENMPG FT LARNLDIQIQEAHRTPGRLNVKRQSPRHVVXRLTKVNVKEAILRAARRKQQ FT MTYKGKPIRLTADFSSETLQARRDWVPILNLLKQNKAQPRILYPAKLSFIY FT EGEIKSFSDKQSLKEFAKTRPXLQEVLRPAFLTEQGSRHSSK" FT CDS 1101..4976 FT /product="L1B_Mim_2p" FT /translation="MMAQGVRQSNKTPPNNMNGNLPPISILSINVNGLNCP FT LKRHRLAEWIKIHKPSICCLQETHLTHKDAXRLKVKGWKTIIQSNGSQKKA FT GVAILFADNISFKIAKVKKDKDGHXIMVKGKIQQEDLTILNIYAPNAGAPN FT YXKQTLSNLNTLLHNTAIVAGDFNTPLNDLDRSSKQKISKEIMDLNKALDQ FT KGLTDLYRAFHPNKLEFTFFSAAHGSYSKIDHILGRKSDLKKFKKIEIIPC FT IFSDHNGIKLQFNSYRNTQXLTKSWKLNNLLLKNYWVKEEIQREIENFFEQ FT NDNGDTSYQNLWDTAKAYLRGKLIAINAHIQKTESLDTDNLMNKLKELEKE FT EQTISNPNRRKEITKIKAELNEMENKRTILKINKTRSWFFEKINKIDGPLA FT RLTRTQRERTLINSIRNEKGEITTDTTEIQNIIFDYYKKLYAXKLQNEDEM FT DKFLDSYNLPKFTQEATEFLNRPISSSEIEAVIKNLPKRKSPGPDGYTSEF FT YQTYKDELIPILQKLFHTIEKDGILPNSFYEANITLIPKPGKDATKKENYR FT PISLMNTDAKILNKILANRIQQHIKKIIHHDQVGFIPGMQGWFNIRKSINA FT IHFINKIKNKDHMILSIDAEKAFDKVQHTFMIKTLNKIGIDGSYLKLIKSI FT YDKPTANIILNGEKLKSFPLRSGTRQGCPLSPLLFNIVLEVLAIAIRQERG FT IKGIQVGADEIKLSLFADDMILYLENPMDSSKRLLDLITEFGKVSGYKINI FT HKSEAFIYAKNHQAETQIKNAIPFTIAPKKIKYLGVYLTKDTKDLYKENYE FT TLKKEIAEDLNRWKNLPCSWIGRINXVKMSILPKVIYRINAIPIKIPSAFF FT TDLEKIILHFVWNQKKPRIAKAILSKKNKLGGISLPDFKLYYKAIIVKSAW FT YWHKNRSXDIWNRSEIPEMKPSVYGNLIFDKADKNIXWGKESLFNKWCWEN FT WLATCRRANQDPYLSPLTKIHSRWITDLNLRHETLRILEEDVGKTLSDIGL FT GKEFLRKTPKAITAASKINKWDLIKLKSFCTAKETISRANRQPTEWEKIFA FT LYTSDKGLITRIYLELKRINKKKSNNPIKKWATEMNRNFSKEDRIMACKHI FT KKCSTSLIIREMQIKTTMRYHLTPVRMAYIKKSQNNKCWRGCGETGTLLHC FT WWDCKLVQPLWKRIWRYLKQLEIEIPFDPAIALLGIYPKEHKTFYYKDICT FT RMFMAAQFTIARSWKQPKCPSIHEWIIKMWYMLTMEYYSILRNDGELAPFM FT LSWIKLKPVIQSEATQDXENGLHIYSPSNWY" XX SQ Sequence 5342 BP; 2099 A; 1094 C; 998 G; 1127 T; 24 other; ggccagaatt tagctcataa tccantccct gcacctccgg tcctcgataa agcatccaga 60 tgagaaagaa ccagcgaaag attacaggaa acatgaagga ccagagagaa aagtcacctc 120 caaaggaaaa tactcgttct ccatcanctg acaccaactt acatgatatg attaaactga 180 cggaggaaga nttccgaata tggattgcta naaaactnaa tggaattgaa gagaaaatag 240 aatcacaaca tagagaaacc acaagaacaa tncaggaaat gaatgaaaga ttctccaaag 300 aaattgagac tatccagaaa aaccaaacag aaattctggc aatgaaggaa acaatcaagg 360 atctccataa tncagtggaa agcctcaaga acaggatgga ccatgcagag gaaagaatct 420 cagagcttga agattatgcc tatgcgctaa acaaatcaga ggaagaaagg gaacacagaa 480 acaagagaca agaccaaagc ttacaggaag tgtgggatta tgttaaaaag ccgaatatca 540 gattgattgg gattccggaa ggggaagagg aacattcaca agggctggaa aacttatttc 600 acggaatact ggaggaaaat atgccgggcc tggccagaaa tcttgatatc caaatacaag 660 aagcacacag aactcctggg agactcaacg tgaaaaggca atcacctcgc cacgtggttn 720 ttaggctgac caaagtaaac gtgaaagaag caattctccg tgcagcgaga cgaaagcagc 780 aaatgaccta taaaggnaag cctatcagac taacagcaga cttctcatct gaaaccttac 840 aagccaggag ggattgggtg cctatcctta atcttctaaa acagaacaaa gcccaaccta 900 gaattcttta tccggcaaaa ttaagtttca tctatgaggg agaaataaag tccttctcag 960 acaagcaatc actgaaggaa tttgcaaaga ccagaccanc cctacaggaa gttctcagac 1020 ccgcatttct aaccgaacag ggcagtagac actcctcaaa gtgaaatcgt caaagaatta 1080 aagtttagat ctcgaactac atgatggctc aaggagtaag acaaagcaac aagactccac 1140 ccaacaatat gaatggtaat cttcctccaa tttcaatcct ctcaataaat gtaaatggct 1200 taaactgtcc tctgaagaga catagactgg cagagtggat aaaaatccac aagcctagca 1260 tctgctgtct acaggaaaca catctaaccc acaaagatgc ctnccggctg aaggtcaagg 1320 gatggaaaac tatcatccag tcaaacggaa gtcaaaagaa agctggggta gctatactat 1380 ttgcagataa cataagcttt aaaatagcaa aagtaaaaaa ggataaagat ggccattnta 1440 taatggtgaa agggaagatc caacaagaag atttaacaat tcttaatatc tatgcaccca 1500 atgcaggagc acccaattac ntaaagcaaa ccttgtctaa tctaaacacc ttgttacaca 1560 acactgccat agtagcaggg gacttcaaca ctccactgaa tgatctggat agatcctcca 1620 aacagaaaat aagcaaagaa ataatggacc tgaacaaagc ccttgatcaa aaaggtctga 1680 cagatctcta tagagcattc catccaaata aacttgaatt tacattcttc tcagcagccc 1740 atggatccta ctccaaaatt gatcacatcc taggccgcaa atcagatctc aaaaaattca 1800 agaaaataga aattatacct tgtatcttct ctgaccataa cggtataaaa ttacagttca 1860 attcctatag aaacactcaa cncctcacaa aatcatggaa actaaacaat ctattattga 1920 aaaattattg ggtaaaggaa gaaattcaga gggaaatcga gaatttcttc gaacaaaatg 1980 ataacggtga tacctcttac caaaacctgt gggatacagc aaaagcttac ctgagaggaa 2040 aactaatagc aattaacgct cacatccaaa aaacagaaag cttagatact gacaacctaa 2100 tgaataagct caaggaattg gaaaaagaag agcaaacaat ttccaatcct aatagaagaa 2160 aagaaataac gaagatcaaa gcagaactga atgaaatgga gaacaagaga actatactaa 2220 agatcaacaa aaccagaagc tggtttttcg aaaagataaa caaaatcgat ggccctcttg 2280 ctagattgac aaggacccaa agggaaagga ctctaataaa ctcaataaga aatgaaaaag 2340 gagagatcac aacagacacc acagaaatac aaaacattat atttgactac tataaaaaac 2400 tatatgcccn aaaactacag aacgaagatg aaatggacaa attcctggat tcatacaacc 2460 tccctaagtt cacccaggag gcaacagaat tcctgaacag accaatctca agctcagaaa 2520 ttgaagcagt aattaaaaac ctccccaaac ggaaaagtcc cgggccagat ggctacactt 2580 cagagttcta ccaaacatac aaagatgaac tcatacctat actacagaaa ctattccaca 2640 ccattgagaa ggatggtatc cttcctaact cattctacga agccaatatc accttgatac 2700 caaagccagg aaaggacgca acaaaaaaag aaaattacag accaatatcc ctcatgaata 2760 cagatgcaaa aatcctaaat aaaattttag cgaatagaat tcagcagcac atcaaaaaaa 2820 taattcacca tgaccaggtg ggctttattc cagggatgca aggntggttc aacatacgca 2880 agtctataaa tgcaattcac ttcataaata aaatcaagaa caaagaccat atgattctgt 2940 caatagatgc agaaaaagca tttgacaaag tccaacacac ctttatgata aaaactctta 3000 acaaaatagg catagacggc tcatacctta aacttatcaa atccatctat gacaaaccca 3060 ctgctaatat cattctaaat ggggaaaaat tgaaatcttt cccccttcga tccggaacta 3120 gacaaggatg cccactatct cctctcctat tcaacatagt gctcgaagtc ctagcnatag 3180 caatcaggca ggagaggggt attaagggca tccaagtggg ggcagatgaa atcaaactct 3240 cgctcttcgc cgatgatatg atattatacc tagaaaaccc catggactct tccaagagac 3300 tcctagactt gataaccgaa ttcggtaaag tttcaggtta taaaatcaat atacacaaat 3360 cagaagcatt catatatgcc aagaaccatc aagcagaaac tcaaatcaaa aacgcaatac 3420 cctttactat agccccaaag aaaattaaat atctaggagt atacttaacg aaagatacga 3480 aagatttata caaggagaac tacgaaacac taaaaaaaga aattgcagaa gatttaaaca 3540 gatggaaaaa tctaccttgt tcatggattg gtagaatcaa tatngttaaa atgtcaatat 3600 tacctaaagt gatctacaga atcaatgcaa tccccatcaa aataccatca gcattcttta 3660 cagatctaga aaaaataatt cttcacttcg tatggaacca gaaaaaacct cgtatagcca 3720 aagcaatctt aagtaaaaag aacaaactgg gaggcatcag tcttcctgac ttcaagctgt 3780 actataaagc aataatagtt aaatcagcct ggtactggca caagaacaga agcatngata 3840 tctggaatag atctgagata ccagagatga aaccatcagt atacggtaac ctaatctttg 3900 ataaagcnga caaaaatata cantggggaa aagaatctct cttcaataaa tggtgctggg 3960 aaaactggtt agctacatgc agaagagcga atcaggatcc ctacctctca cctctcacaa 4020 aaattcactc aagatggata acagacttaa acctaaggca tgaaacctta agaatcctag 4080 aagaagatgt tgggaaaacc ctatcagaca ttggcctagg caaagaattt ttgaggaaga 4140 cccccaaggc aatcaccgca gcatcaaaaa taaacaaatg ggatctgatc aaattaaaaa 4200 gtttctgcac agccaaggaa accatcagta gagcaaatag acaacccaca gagtgggaga 4260 aaatatttgc tctctacacc tctgataaag gtctaataac aagaatctat ctagaactta 4320 aaagaattaa caagaaaaaa tcaaacaatc ccatcaagaa atgggcgacg gaaatgaaca 4380 gaaacttctc caaagaagac agaataatgg cctgcaaaca tataaaaaaa tgctcaacat 4440 ctctaatcat tagagaaatg caaatcaaaa ccacaatgag ataccaccta accccagtga 4500 gaatggccta tatcaagaaa tcccaaaaca acaaatgctg gcgaggatgc ggagagacag 4560 gaacactcct acactgctgg tgggactgca aattagtgca acctttgtgg aaaagaattt 4620 ggagatacct caaacagcta gaaatagaaa taccattcga cccagcaata gcattgttgg 4680 gcatctaccc aaaagagcat aagacattct attataaaga catctgcacc cgaatgttta 4740 tggcagcaca attcactatt gcacggtcat ggaaacaacc caagtgcccg tcaattcatg 4800 agtggataat taaaatgtgg tatatgctca caatggaata ttactcaatc ctaagaaatg 4860 acggtgagct agcaccgttt atgctatcct ggattaagct taagcccgtt atccaaagtg 4920 aggcgacaca agacntggaa aatgggctcc acatctactc gccatcaaat tggtactgac 4980 tgattaaaac tatggtnctc aaatggtggt aatgctcacc agggattcgg gggggggggg 5040 gagacccaca tcttagggat gtggcgagca ttttggaggg gaagggcata actctaaccc 5100 ttcttaggga gaggcaaaga tatacaatgt aaccaaaatg tcaaaaaaaa aaaaaaaact 5160 ttttatcggg tggcgggcag gcgggagggg ggaggaggag aggggtgtat gcttncataa 5220 cgtgtgtgat gcgcaccacc gggggattgg acacatcggg gggagggggg ggcaggggca 5280 atatttgtaa ccctaacaat atttgtaccc ccataatatg atgaaataaa agaaaaaaaa 5340 aa 5342 // ID LTR7A_OG repbase; DNA; PRI; 304 BP. XX AC . XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 01-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Long terminal repeat (consensus). XX KW ERV3; Endogenous Retrovirus; Transposable Element; LTR7A_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-304 RA Jurka J. and Jurka M.G.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 8(10), 1678-1678 (2008). XX DR [1] (Consensus) XX CC 5bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 304 BP; 81 A; 95 C; 58 G; 70 T; 0 other; tgatagagtc aaatggaggg acccgcccaa atagaaccga cccgcccaaa tagaaccgac 60 ccgcccagga agaaatgaat gcatagaatg tcccacccag gaaagaggga atgtgccaat 120 ccctaccctt ccgacccttt aaattcccat ccaccttaac ccacgtgctg acttccttct 180 cgaactcagc tcacccgcac ccgggtggaa ataaaggccg atgttgccca cacagaacct 240 ggactccgtt tcctttttgt tcacgtttcg gaataatctt ttattttcgg tctttgacct 300 ctca 304 // ID LTR50_Mim repbase; DNA; PRI; 439 BP. XX AC . XX DT 02-NOV-2009 (Rel. 14.11, Created) DT 02-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR50_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-439 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2984-2984 (2009). XX DR [1] (Consensus) XX CC >98% identical to consensus. 6bp tsd. CC Similarity to RLTR50A from mouse. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 439 BP; 105 A; 132 C; 95 G; 107 T; 0 other; tgttagaggc aaggaccctc aaagaccctc aactccccta tgactcgtcc tacgcacgga 60 cttcgccctg gaaaattcca gctatagctc cgagaagaat gcagtccatg acgtgctgac 120 cacgggatgc tccagggagt gctgactgca attgttcccg accacctgcc aggttggaga 180 tgagtaacca gtcaaaaaca ggataccgat aagacgctga ttggggctga ttaacccaga 240 cccaagcctc atcgtcgccc cactctattt ttttgtttct acttccttgt ttctgtatgc 300 ttataaaacc tactaagaat ctaagtaaag cagaccttga caaccatttg cttggtctcg 360 tttctttctc tcgcccatct ctttcaggca cggtccccct cgcccccgcg agtaacagaa 420 ggtcccgcgg gccgggaca 439 // ID Sat-1_TSy repbase; DNA; PRI; 3236 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Satellite sequences from Tarsius syrichta. XX KW Satellite; Simple Repeat; Sat-1_TSy. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-3236 RA Bao W. and Jurka J.; RT "Satellite sequences from tarsier."; RL Repbase Reports 11(5), 1743-1743 (2011). XX DR [1] (Consensus) XX CC The repeat unit length is ~3-kb. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 3236 BP; 1129 A; 634 C; 576 G; 895 T; 2 other; aaataattaa ataaataaat aaatgaaagc tgtaaaacat gcaactgact tcatttttta 60 tttttttatt tttttttact ataaaaatgt ttataattca gcgttgtgta tagaaatgtt 120 cattatgcag ggattgattt attaagaagt gtataatact ctaagataac tccttttcct 180 tttaatacgt tagtatacat cgtatacttc tgccttcaaa attgacaact acagtgtttc 240 tcatataaat tctgcatgtt ttattctgcc taaattgaaa cttgttgaga ttccagatct 300 atatcattct tagatgattg aaccttgaga ttcagtgtac ctaagtgcta ttcagaagac 360 agaatgcata ataaatctta aaaattttag aaaataaacc agtattctaa tttaatacag 420 ctttttccat tcttaaaatt tgtgtacatt ttcttcgtgc actctgtatt ttctgcttgt 480 taactttgat taaatgttac agatgtgcaa tccttagaga aaagtttccc ttatatattt 540 gaaggaaata taaaatcaca ttccctttat attttaaacc aatttatatt tttaaaaggg 600 tatttccwtt gaatccccta gtattttctc tacctaagac ataatgctgt actctgcaac 660 aaagtatctg attaatctga tacttttaat caccaagctt ttaaaatcag gcacacagaa 720 aaaagagaat ttggggccca atatatataa accaaaataa catggagatt tacaatgtat 780 taagtatttg ggctccgcat tagaataaaa atttagataa ttgaaaattc tggaaaattg 840 acgaagatct tagctaaagc gatgttcaaa tcctgttaga aaaacggctc aaaggctgtg 900 acataatact aaaaacatcc aaaagtttct aaactaaaga aagtgctaaa atagtatgtg 960 gttgtgctat aaccaagctc caatagtatt cggtgaccat aataataaaa gactacatcg 1020 tcggccaccc cctcaggtcc catcccagtg tgggtgattc tctgtcctct caataaatac 1080 tgctttgcta acaaaaaaag actacctcgt ttcgtggatt cttgaatggt cacttatacc 1140 tggagaagtt tttgcatagg tgtgaacaca tagttttctt cacttcttgg gaatgaaatc 1200 caccatgttt accgcatttg gcagagcact gtacataagc ccttgaagcc taaggattgc 1260 tcagtagtca aggtccctcc cattatgatg tcaaagttct gaggctgatg tcataaatga 1320 caatttgaca tgcttatttc cctgacatta cttacctctt ttaatgtaaa aattagcctt 1380 ttagtcaatc tactgattca atgaaataac cattaagtga acatcattct tctcagacct 1440 ggaaattaca ataaagaatt catatgattc gggctcggtg gctcagcctg taatcccgcg 1500 cctttgggag gctgaggtga gtggattgcc tgagcccgcg ggttcgagac ccgcctgggc 1560 agcgagacct catctctaca ataaatcaaa aaatcagccg ggcgtggtag cgcatgcctg 1620 tccttcaggc tacttggaag gctgaggcgg aagcatcgcc ggagcccatc aggtcaagcc 1680 tgcggtggcc gggagcggcc actgcactcc attctgggcg acagagtggg actccaacac 1740 acacacacac acacacacac acacacacac acacacacac aagaattcac atgaaaccaa 1800 aaacaatcaa gaatagccaa aagaatctta tcaaaagaaa caaagcagga gctatcactg 1860 ttccagactt caaactttac tatatgttta cagtaatcaa cacagcctgg tattcgtaca 1920 agaataggtc cataggccaa tggatcagga cagagaatga aatcctcaaa ttctcaacca 1980 actcttcttt gaaaaaaact ccaagacccc tgggtaatgg agaacatatt cagtaaatgg 2040 tgctgggaaa aggctgacca catgcacaag gttgaaacag gacccctatc accatacaca 2100 ttaactctaa atggatcaaa gacctaaacg tacaacacca cattataata atcttagaaa 2160 acacaggaga caccattatg gaaactggaa ccggaaacta attcctcttc agaaccctaa 2220 atacccatgc cgtaataagt aagttagaca agtgggatct catcaaatta atatacttct 2280 gcaaaacaaa agaaaccatc agaagagcag ggatacagcc aacagagtgg gaaaaacgat 2340 tctccaacta tatatctgta aaaaagcata atatctagga tctacaagaa actcaaacac 2400 tcagaaaaat cagacaaaca ttccctttga aaagtgtaca aaatacgtga ttggacactt 2460 ttcataagaa gagatacggg cagccagcaa acacatkaaa attgctcagc ctcactcatc 2520 atcaaagaaa tgcaaatgag accactttga aataccacat accccctgta agactggcca 2580 tcactaataa aacaaacagt aacagaggct ggtaaggatt tgggaaaaaa taaatgcttc 2640 tacactgcat gtggactgaa aagtagagca acatctttgg aaaagagtgt ggtggtttct 2700 aaagggacaa gatattgacc taccatgtga ctcagtaatt ctcctgtaga gaatatacct 2760 ggaggaactc aaatcatttt acagaaaaga tacctgcaca agtatgtttt ttgcagctct 2820 aatcacaatt gcaaacaaca tggtaccaac catgttgccc atcaaagcag gaatggataa 2880 aataaatgtg gtatatacac agtggaatat tatgcagcca taaaaatgga cagatataag 2940 gattttgtag aacgtttgat ggatatgagg taggtcgttt tcagtgattt atcacagaaa 3000 caaacaacag agttccatat gttctcactc atgagctgaa cttgatcact tatagtgctg 3060 taagaaagtg attggtagtc gtggaaaact gtttaggggg aggggtgaga ttggcagtgg 3120 agggatactg ctaaggaagc gaggggcata cctcatcaac cagggaccct gcataatcaa 3180 tgtttgtata tctaactctg aactgtaccc cacgtttttt aaaaaattaa aataaa 3236 // ID LTR1D1 repbase; DNA; PRI; 899 BP. XX AC . XX DT 01-MAR-2009 (Rel. 14.06, Created) DT 18-JUN-2009 (Rel. 14.06, Last updated, Version 1) XX DE ERV1 Endogenous Retrovirus from primates. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; LTR1D1. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-899 RA Smit A.F.; RT "LTR1D1 - ERV1 Endogenous Retrovirus from primates."; RL Repbase Reports 9(6), 1175-1175 (2009). XX DR [1] (Consensus) XX CC 10.5% subst; 50 copies. Consensus oscillates between two CC apparent subs, but coseg can not separate these. XX SQ Sequence 899 BP; 171 A; 313 C; 267 G; 143 T; 5 other; tgatagggac aggaggcagg gaaattctgg gcagaagagg gcgggtcccc ggcgagggcc 60 ccaccctcaa gccaaagcct ggaaccgcgg cccaaagtga gaacntacat ccctgttttc 120 ccgctcgaat gttgcctttt ccaaaaccac ccatggcccg ccccgccccc catcctgtgc 180 ccataaaaac cccaggctcc gccggcagag aggagaagca gctggacgtc ggagactacg 240 gttggacgtc ggagagaagc agcttgactt cagagggacg gcttgacggc gtngcttcgg 300 agaggagtcc ggccggggac ggccggactc cgggggaaga tcaccttccc gctccatccc 360 ctttccagct ccccttcccg ctgagagcca ctttcatcgg caataaaatc ccccgcattc 420 accacccttc aattcgttcg tgcgacctga tttctcctgg acgccggaca agagctcggg 480 tgccacgggt gcggatgcna aaggctgtca cactgaccct ctgccctcgc tggcggagag 540 caaccgcctc acgcgaaaag gcagagggcc cactgagctg tttaacactt aagccgtccg 600 cggacggcaa agctaaaaga gcactgtaac acnccctctg gggcttcggg ggtcgcgggc 660 acccccccct agacgctgcc gcggggcccg cacggagttt tgctcctgcc ggcgcccaaa 720 agcgctcgcc ccggctcctg cacccgctca cctgcgcgct ccctcccgcg aggggtngag 780 cgcagcgggt ccgagtgagt ggagttcgcc cctgccggcg ccgaagcggc cggctagctc 840 cagcgcccgt gcactccagt tcccgcccgc gaaggggtca gggaaaattt cctgcttca 899 // ID LTR7Y repbase; DNA; PRI; 472 BP. XX AC . XX DT 14-NOV-2005 (Rel. 10.11, Created) DT 01-OCT-2008 (Rel. 13.11, Last updated, Version 2) XX DE Long terminal repeat of ERV1 Endogenous Retrovirus from primates. XX KW ERV3; Endogenous Retrovirus; Transposable Element; KW LTR retrotransposon; LTR7B; ERV1; LTR7Y_LTR; LTR7Y. XX NM LTR7Y. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-472 RA Smit A.F.; RT "LTR7Y - ERV1 Endogenous Retrovirus from primates."; RL Direct Submission to Repbase Update (11-NOV-2005). XX RN [2] RP 1-472 RA Jurka J.; RT "Classification changed to ERV3 based on 5bp TSD."; RL Direct Submission to RR (01-OCT-2008). XX DR [1] (Consensus) XX CC 2% div; 5 bp TSD, but HERV-H internal. Notice TG...AA like LTR7A. XX SQ Sequence 472 BP; 119 A; 153 C; 79 G; 121 T; 0 other; tgtcaggcct ctgagcccag gccaggccat cgcatcccct gtgacttgca cgtatacatc 60 cagatggcct gaagtaactg aagatccaca aaagaagtaa aaacagcctt aactgatgac 120 attccaccat tgtgatttgt tcctgcccca ccctaactga tcaatgtact ttgtaatctc 180 ccccaccctt aagaaggtac tttgtagtct cccccaccct taagaaggtt ctttgtaatt 240 ctccccaccc ttgagaatgt actttgtgag atccacccct gcccaccaga gaacaacccc 300 ctttgactgt aattttccat taccttccca aatcctataa aacggcccca cccctatctc 360 ccttcgctga ctctcttttc ggactcagcc cgcctgcacc caggtgaaat aaacagccat 420 gttgctcaca caaagcctgt ttggtggtct cttcacacgg acgcgcatga aa 472 // ID ERV2-2_TSy-LTR repbase; DNA; PRI; 815 BP. XX AC . XX DT 29-JAN-2010 (Rel. 15.09, Created) DT 29-JAN-2010 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV2-type endogenous retrovirus - DE consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERV2-2_TSy-LTR; ERV2-2_TSy-I. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-815 RA Jurka J. and Walichiewicz K.; RT "Endogenous retroviruses from tarsier."; RL Repbase Reports 10(9), 1205-1205 (2010). XX DR [1] (Consensus) XX CC >95% identical to consensus. 6bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 815 BP; 239 A; 163 C; 179 G; 233 T; 1 other; tgttggaagc cggccaagga ggcccactga gcattagaaa tcaggggggc tgaagaccct 60 gagaccacta gaagtgtgtt cgggagccat gacttcatta cttctccctt ctgttggcat 120 gacagccttc tgttggcatg atagccttct gttagctcaa gcatgaaaac tccagacagg 180 gacgaacaca agctaacatt agaaacccgt tgttcccatt tccctgtctg tggtttaggc 240 ttttgtctgg tacaagccag gaccacatca aagagcccag gaactgagag cccagcatga 300 ccttttgatt gagacgcatt aagtagaaca ggttttagat attttgttta agttctattg 360 tttgtgatag gattaagcaa aggaatagat aaactgttga tcaataggaa agttattcat 420 aagtaaacta agtagaataa aataaaatat agattacaat tgataaaatg gtataatatt 480 aatagaaact gtaataatac ttctaaggga cattagtcta gcataggaac tggatagaca 540 agttgggttt gttctctacc ttgcttgaag aaatatctgg tgggtaggag acgcccccat 600 gcacctggat tgtctgtgta tgtgggagac atctgaatct gaatgagcaa aatctgtaga 660 taaaaaccct ctatactttt caataaacgg cttctgtcac ttgaacagta gtccagtcct 720 atttctttct cccccttctt tctaattctc tttacaggcg tggaatccgt ctcacctcca 780 ggaactstag accctgtggg agctggactc cggca 815 // ID LTR14B_Mim repbase; DNA; PRI; 439 BP. XX AC . XX DT 02-NOV-2009 (Rel. 14.11, Created) DT 02-NOV-2009 (Rel. 15.11, Last updated, Version 3) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR14B_Mim. XX NM LTR14B_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-439 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2970-2970 (2009). XX DR [1] (Consensus) XX CC >96% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. CC We thank the Broad Institute Genome Sequencing Platform and CC Genome Sequencing and Analysis Program, Federica Di Palma, and CC Kerstin Lindblad-Toh for making the data for Microcebus murinus CC available. XX SQ Sequence 439 BP; 89 A; 122 C; 123 G; 105 T; 0 other; tgtgaggggt tgagtcattg cacacctgca aggccccgcg cccccagtga agccggaagc 60 aacatccggg aaggctgacc gagacgtccg cttctggaaa ctgttgcttg cttgcctgcc 120 catggaaact gttgcttgct tgcttgcgcc aactttgcat tgttcgaacc cctgtatagt 180 ggggctacca gccaaccaat catgttaaag gtcaatgcat gatccacgcg tgatcagcca 240 atgcatagcg tatgcaccat agggataaaa ggcacgctgc aaccccggtc ggggtccttg 300 cctgcaagag tggccactgc gttggtgctc tggggcttgg accctggcta gccagaaaat 360 aaacctcctc ttgtgtgatt gcatcctcga tgtctctgct tttctgtccg gtggggctgt 420 ggaaggtcgg tccctaaca 439 // ID CERV1_INT repbase; DNA; PRI; 7337 BP. XX AC . XX DT 17-AUG-2004 (Rel. 9.07, Created) DT 10-OCT-2005 (Rel. 10.11, Last updated, Version 2) XX DE Chimpanzee endogenous retrovirus CERV1 - a consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; CERV1_INT; KW Internal sequence of chimpanzee endogenous; LTR; PtERV1. XX NM CERV1_INT. XX OS Pan troglodytes OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pan. XX RN [1] RP 1-7337 RA Skaletsky H., Hughes F.J. and Page C.D.; RT "Consensus sequence of an endogenous retrovirus CERV1."; RL Repbase Reports 4(7), 189-189 (2004). XX RN [2] RP 1-7337 RA Yohn C.T., Jiang Z., McGrath S.D., Hayden K.E., Khaitovich P., RA Johnson M.E., Eichler M.Y., McPherson J.D. et al.; RT "Lineage-specific expansions of retroviral insertions within the RT genomes of African great apes but not humans and orangutans."; RL PLoS Biol 3(4), e110-e110 (2005). XX DR [1] (Consensus) XX CC Internal sequence of an endogenous retrovirus with ORFs CC for gag (961-2370), pol (3042-5624) and env (5618-7288). CERV1 CC is absent from the human genome. It is similar to an CC uncharacterized interspersed repeat mulatta and Papio anubis CC genomes. CERV1, first published in Repbase Reports, also exists CC under CC another name PtERV1 reported by another group in 2005 (ref. 2). XX SQ Sequence 7337 BP; 1892 A; 2103 C; 1788 G; 1546 T; 8 other; tctgggggcc cgtccgggat tccccaagcc caccagaccc ctggtcaacg gatctgctag 60 gatcgatcta ctgataggtg agctggctcg tctccgtttg tctgtctgtg tctgttctga 120 atccgaatct gtgactcgcg aggtctgaaa ctggagctgg cacagtcctg gcggacgcgc 180 tataggacgg ccagcggaga ccggtgggag acgtcccctg gctctcatct gatctatatt 240 gcgatctgag ctgccccggt ttggcgggca gcagccgtat ctgagctgcc cggttgatcg 300 ggcagcatct gtctctgatc tgagctgccc cggttgcggc agcagcctat ctgactgccc 360 ggttgggcgg gcagcagctg tctctgatct gagctgcccc agcagctgtc tctgatctga 420 gctgccagta nctgtctctg atctgagctg ccccggtttg gcgggcagca gctgttcatc 480 tgagctgccc ngttnggcng gcagcanctg tctctgatct gagctgcccg gcgggcagca 540 gctgtctctg atctgagctg ccccggtttg gcgggcagca gctgtctctg atctgagctg 600 ccccagtgca gccgtatctg agggcagctg tctctgatct gagctgcccg gcgggcagca 660 gctgagctgc cccggtttgg cgggcagcag ctgtctctga tctgcccggt tnggcgggca 720 gcagctgtct ctatctgagc tgccccggtt tggcgggcag cagctgtctc tgatctgagc 780 tgccccggtt tggcgggggc tgccccagtg cgntcggctg ccccggggcg cagctgcctc 840 tgaccagggc tgccaaagtg cgcctgcgat tagatatcat taataagttc agatcagatt 900 tcctttacag ggattcaaac ccacattcct ttacaggaag agactacaaa ttattacagg 960 atgggtaata cccagagcac tcctctatct ctccttacga gtaatttcaa agaagttaga 1020 gcaaggggcc atgatcttgg tatagaaatc aggaaaggaa agctaattac tctgtgtcgc 1080 tccgaatggc ctgcctttga tgtggggtgg ccgcccgaag ggaccttccg acttgctgtc 1140 atcactaggg taaagtccaa gattttccta cctgggcgtg cgggccactt agatcaaatc 1200 ccatatatcc tcatatggca ggaccttgtt gagaacccgc ctccttggct gtcccctttc 1260 caattggcct ctgaaccctg taaggcacta gttgctcgac cactaaaatc caagcaacca 1320 actgcccccc ccatcctgtt ctacctgaca gcggggaccc actgttcaca gaaccccctc 1380 cgtacccctc cgggccccag gccccagccc ccctggctga gctgcgggag ggagcaggcg 1440 gacgggaggc ggccggcaca cacgggcccg ctgaaaggga aagtaacttt gaagggccgg 1500 cggggaggac gcgagggcgc acttcgcgga ctagcccccc ctcagccgcc tgactccacg 1560 gtggctttac cccttcggga aataggaccc ccagatgaca caggaatccc caggctccag 1620 tactggccat tctccaccag tgatctgtat aactggaaaa ctcagagtgc tcggttttca 1680 gacaacccca aagatttact ggctttacta gatagtgtca tgttcaccca ccagcccact 1740 tgggatgatt gtcagcagct cctccgaatt ttgttcacca cggaagagcg agagagaata 1800 cagatagaag ctagaaagct ggtcccgggg gacgacggtc aaccgactgc caaccccgac 1860 ctcataaacg caacctttcc tctgaccagg ccggcgtggg actacaacac ggcagaaggt 1920 aggggacggc tacaccttta tcgccagact ctaatggcag gtctccgggc agctgctcgc 1980 aagcccacta atttggctaa agtatattct attctgcagg gaaagacaga gagcccagct 2040 acctacttag aaagattaat ggaagctttt agacagtaca cccccataga tccagaggct 2100 ccaggaagtc aggcagctgt tgtaatgtct ttcgtaaatc aggcagcccc agatattaag 2160 agaaaactcc agaaattaga agacttggag ggaaagcgga ttcaggacct ccttcagata 2220 gcccagcggg tttacaataa cagagatact ccagaggaaa agcaatttaa ggccactgaa 2280 aaaatgacca aggtcctggc agcagtggta cagaaagagc atctacagcc agagtacacc 2340 caacctaggc ggccccccgg catgataatc tgagcaaaga ccaatgtgcc tattgtaagg 2400 gggctggcca ctaggtaaga gactgcccca aaaagaaacc acgaggacag ggacccgcct 2460 aggtctacac ccgtactagt cactcaagac gaagactagg gaagacgggg ttcggacccc 2520 ctccccgaac ctagggtaac tttgcaagtg gaggggtccc cagtccagtt cttggtcgat 2580 acnggagcac agcactcggt cttagttaaa actaatggga aattatcctc caaatcctcg 2640 tgggtacaag gggccacagg agttaagaaa tacccatgga caacacaaag aacagtaaac 2700 ctcggagcca agaatgtaac ccattctttc ctggtcatcc ctgagagccc ctgtccccta 2760 ttggggagag acctgctaac taaaatggag cacagatcca tttcctccct gaggggcccg 2820 tcgtgaccaa ctcccacaat cgaccgtgtc ctcctgacta taaacctaga agatgagtac 2880 cggctccacc aggagaaagc ggcccctgac caggacatag caactggctc cagcatatcc 2940 agaagcgtgg gcggaaacgg ggggcttagg tctagcaaaa cacctcctgc cttatttatt 3000 gaacttaagc ctggacagac ccctgcggta cgccataccc gatgccccta gagccaagaa 3060 gttcctccag cccccacccg agagacacaa gaagacagac accctactca catcaatggt 3120 acgttactct gaccgaggaa gaacctaacc gccaactgct taagggaggg cagcgctggc 3180 taacagacgc ccggaaacaa actgttctgc agatccccag gccacaatcc acccgacaag 3240 tgagagaatt cctggggtcg gcaggatttt gcagactatg gatacctggg ttcgcagaac 3300 tggctaaacc cttgtatcag gcaacacggg ggcaacagcc atttaattgg acagacgaag 3360 ccgagttggc cttccaacag attaaaaccg ccctactctc cgcgcctgca ctaggactac 3420 ctgatgttac caagcccttc cacttatacg tggatgagaa taagggtgtc gccaaggcgg 3480 taataactca gaacttaggc ccctggcgga ggccagttgc ctacctgtca aagaagttag 3540 acccagtagc tgccgggtgg cccccttgtc tccgaatgat tgcggccacg gctctgatgg 3600 tgcaagatgc tgataaactt gtcatggggc aagaattgcg ggtcgttact ccacatgcca 3660 tcgaaggtgt actcaaacag ccacctaatc gatggatgag taacgcccgg ctcacccact 3720 accaaggact actactaaat cctctcagga taattttcct gcccccaacg accttaaacc 3780 ctgcctcgct gctgcccaac ccggacctgg acgccccact ccatgactgc accaagatac 3840 tagctcaggt gcacggagtt cgagaagacc tgcaggaccg cccacttcct gacgccgacc 3900 tcgtctggtt cactgatggg agcagcttca tgcatcaagg ccagaggtac gctggagcgg 3960 cagtaacttc agagactgag gtaatctggg cggaacccct gcccccgggg acatcggccc 4020 agaaggccga actgatagcg ctcacccaag ctcttacctt aggggcgggg aaaaagctga 4080 cagtatatac agacagccga tatgcttttg caacggcgca tatacatggg gccatttaca 4140 gggagcgagg gttactgacg gctgaaggaa aagagataaa aaacaagcaa gagatcctag 4200 ccctgctaac agccctatgg aggccagaaa aattagccat tgtacattgc ccagggcatc 4260 agaaactaac tactccaact gctcaaggca actttctggc agaccaaact gcaaggaatg 4320 tggcgaaggc tcccagccaa ctccttgcac tccagctccc tgacccgggc ccccgggact 4380 tgccatattt ccctgaatat tcagaacaag atctccagtg gattgacaaa cttcccctga 4440 aacaaatcca gaatgggtgg tggactgata ctaatgacca aaccatccta ccagaaaaat 4500 taggacaaca ggtgttagaa cacatccacc gaaccaccca cctgggggcc cggcggatga 4560 tagacctgat cagacgctcc aagctcaaaa tcagacatat agctgagacg gccagcagta 4620 tcgtgacaag ttgcaaagtc tgccagctta acaacgcata cccccaatct caagctgcaa 4680 caggaacaag gctcagggga accaggcccg gtatctactg ggaagtagat tttactgaaa 4740 taaagccagg aaagtacggg taccggtact tacttgtctt tgtagatact ttttcagggt 4800 ggactgaagc attcccaacc aaaagagaaa ctgctcaggt cgtagcaaag aaaattctgg 4860 aagatatcct tcccaggtat ggcttcccca tccagatagg gtcagataat gggcccgctt 4920 tcgtcgctaa ggtaagtcag gacttggctt ccatccttgg ggcaaattgg aaactacatt 4980 gcgcttacag gccccagagt tcaggacagg tagaaaggat gaatcggacc ttaaaagaga 5040 ccttaactaa attgactata gagactggcg ctaattgggt agtccttctc ccctatgctc 5100 tgttccgggc ccgtaatacc ccttacaaac tgggcctcac cccttacgaa atcatgtatg 5160 gcagacctcc acccctggtt cctagcttaa aagatgacct gcttaagtct gaaacagaaa 5220 atgtctctga attcttattt tccttacaag ccttacagaa aattcaccaa gaaatctggc 5280 ccaagctgaa agagctatat gagaccagtc ccccaccgac accccatccg taccagccgg 5340 gagactgggt cctggttaag cgacaccgac aagagaccct agagcccagg tggaaaggac 5400 cactccaagt actcctgacc acacccaccg ccctgaaggt agaaggcatt gcgtcgtgga 5460 tccactacac ccacgtcaag ccagtggacc caacctccga ccttctgggg ccaatcacgg 5520 cggcggcggc tgaagcaccg gacacgtgga ctgtggacag agctaagaac aaccccttaa 5580 aactcaccct gcgccggcag cataactcac tgcaaacatg cagttaggta gtctaactct 5640 aacattagtc gccctagtgg ccgctgggga aaacataaag ccagctccta atccctttgt 5700 ctggagattc tggctttatg aaaaccaaac ccaccctggg caacctcata agcccgggaa 5760 attagtggcc agtgcagatt gcccctcctc agggtgcaat agcccaattt tactaaattt 5820 taccgatttc ccagtagcca aaccagtggc accaataata tgcttcgagt atgatcagac 5880 tgaatacaat tgtaagcact attggtggca ccaaagtgcc ggctgccctt ataactattg 5940 taacatccat aaataccaat ggtggggtgg agaagaacag atagatccca gatggccctt 6000 ccatcgcaga cgagatagag acctttcata tacatggata gttagagacc cctggaactc 6060 ccgctggacc acgcctcaac acggggctgt atactactcc tccgcctcca catggcctag 6120 cagtcacctc tatctgtggc ggggtctagt gcaggtacgg cccctggtcc atggaaatat 6180 ccagcgacaa gaaaaccgcc tgacacaaga tttacgtcct ttttcctggt taaaattatt 6240 gcaagaagga ttagaacttg ccaaccttac aggacttcac agcctgtctg gctgctttct 6300 atgtgccact ctagggcgtc caccgctaac cgctgtcccc ctgccatggg gatcatccac 6360 ctctgcccaa gctaacaacc accaaaacct ctcatatgcc cctatcccta acgtgccact 6420 atacctaaac cccagtcaag agaagtttcc ctactgtttc tcaggaacta attccagcct 6480 ctgcaacatc actgcaacgc cccctaacat caccttaagg gctccgtcag gcatattctt 6540 ctggtgtaat ggaacattat ctaaaaacct atcaagcccc tctgttacca acctactgtg 6600 tcttcctgtc acattagttc cccggttaac tctacttact gccggcgagt tcctagggta 6660 taccggtaac tggactagtg ctgttattca cccagaccct agaccgagac ctgcacgagc 6720 catatttctc cccctcattg caggaatctc cctcaccgca tccttcatgg cggccggact 6780 ggctggggga gccctaggtc acacccttat agaaagtaac aagctgtacc aacaatttgc 6840 cgttgctatg gaggagtcag ctgagtccct tgcctccctc cagcggcagc tcacgtccct 6900 agcacaggta accttgcaga accggagggc cttagaccta ctcactgctg aaaaaggggg 6960 aacgtgtatg tttctaaagg aagactgttg tttctacata aatgaatcag gactcgtgga 7020 agaccgagtc caacagttac gcaagttaag cacagaagta agaacacggc agtttgcttc 7080 agctgcagac caatggtgga actcatctat gttttctctg ttagccccct tccttggacc 7140 cctgctgagt ctactatttc tgcttaccgt aggaccttgt gttgttaaca gaattttgcg 7200 gttcgttaaa gaaaggttta acactgtaca actcatggtc ctcagagccc aataccaacc 7260 tgtaaacgct gaaacagaat cagacttata agacccaaga ttggctctaa aaaatacctg 7320 aaaagaaagg gggggaa 7337 // ID ERV2-1_CJ-LTR repbase; DNA; PRI; 315 BP. XX AC . XX DT 05-DEC-2009 (Rel. 14.11, Created) DT 05-DEC-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat: consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR7_Cja; KW ERV2-1_CJ-LTR. XX NM L1A_Mim; LTR6_MD; LTR86_MD. XX OS Callithrix jacchus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. XX RN [1] RP 1-315 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the common marmoset."; RL Repbase Reports 9(11), 2921-2921 (2009). XX DR [1] (Consensus) XX CC ~95% identical to consensus. 6bp tsd. Renamed after adding the CC internal portion. CC This sequence was derived from the marmoset dataset made CC available by Genome Sequencing Center at Washington University CC School of Medicine in St. Louis. XX SQ Sequence 315 BP; 51 A; 95 C; 67 G; 102 T; 0 other; tgtagggagc agtggccatc ttgagcgtca tgatagccgt cttgtgatct tgttgctcaa 60 gcttctgctt cccttatcta agtttcacgt tcttcttagt cacgaggcca gcctgcagcc 120 tcgtctccat ggttactttc tgacgtcaga actcagagct tcccgccttt gcttgcttat 180 aagcagctgc tttgtaccta ataaacgaga cttgatcaga tctttgactt gtctccattc 240 ttcgcgtctc ctgtctctcc cttaatcccc actccctccc tagggtctgc gttgctcgtc 300 ccgcgggtcg ggaca 315 // ID LTR9_Mim repbase; DNA; PRI; 387 BP. XX AC . XX DT 04-NOV-2009 (Rel. 14.11, Created) DT 04-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR9_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-387 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2964-2964 (2009). XX DR [1] (Consensus) XX CC >91% identical to consensus. 4bp tsd. CC Similarity to LTR9_Vpa from alpaca. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 387 BP; 87 A; 73 C; 123 G; 104 T; 0 other; tgttacagca agtgttacag cgggtggtcc cgaggacaaa ccgagcggca cacggagagt 60 cggagaatca aagtttattc gccagcgggc tcagaggggc ctcgccacca aattctgagc 120 accatctacc cgatttttcc cacttttatt aagttggggt ggtccgaggg gtggggtatc 180 aggtgtaggt tagttgatgt gtcaggtaat aggttaatat tatgttgatg cgccaggcaa 240 tagatttagt gttatgctga tgccaggcaa tagactagtt gatgggccag gtaacaggtt 300 agtattatgc tgatgtgggt cttccgtgag gggtgggggt ctccggtgcc tgatgcgcca 360 agtaatagat gggggtttcc cttatca 387 // ID LTR14C2_Mim repbase; DNA; PRI; 397 BP. XX AC . XX DT 04-NOV-2009 (Rel. 14.11, Created) DT 04-NOV-2009 (Rel. 15.11, Last updated, Version 3) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR14C2_Mim. XX NM LTR14C2_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-397 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2973-2973 (2009). XX DR [1] (Consensus) XX CC >94% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. CC We thank the Broad Institute Genome Sequencing Platform and CC Genome Sequencing and Analysis Program, Federica Di Palma, and CC Kerstin Lindblad-Toh for making the data for Microcebus murinus CC available. XX SQ Sequence 397 BP; 93 A; 89 C; 110 G; 105 T; 0 other; tgtgaagcct caggctaatt gctttctttt agcttatgta tattgcttgt ttttaggaag 60 ttgggttgag cgggctgatc gcagtaaatt ttgggatgaa acaaaagggc aagcgcatgg 120 gcacaagggg ccaaccaatc aatgtaaagg gcaagcgcgt gggcacaagg ggccaaccaa 180 tcaatgtaaa ggtcaagcgt gtggacaggt tatgcaccta gggtataaag ggctccgtcc 240 cacagtgcgc ggggtctttg tcccaataga ggccgccgta tcggtgctct gggacttgga 300 ccctagctcg agctagtcaa taaaactcct tttgatgatt tcagcctcag tgaccctgtc 360 tctttgttct gtggtcctac ggtttcccgc tctaaca 397 // ID L1Pt_5end repbase; DNA; PRI; 2137 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE L1 Non-LTR Retrotransposon from Pan troglodytes. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1Pt_5end. XX OS Pan troglodytes OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pan. XX RN [1] RP 1-2137 RA Smit A.F.; RT "L1Pt_5end - L1 Non-LTR Retrotransposon from Pan troglodytes."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX SQ Sequence 2137 BP; 733 A; 512 C; 532 G; 360 T; 0 other; gggggaggag ccaagatggc cgaataggaa cagctccggt ctacagctcc cagcgtgagc 60 gacgcagaag acgggtgatt tctgcatttc catctgaggt accgggttca tctcactagg 120 gagtgccaga cagtgggcgc aggccagtgg gtgcgcgcac cgtgcgcgag ccgaagcagg 180 gcgaggcatt gcctcacctg ggaagcgcaa ggggtcaggg agttcccttt ccgagtcaaa 240 gaaaggggtg acggacgcac ctggaaaact cgggtcactc ccacccgaat attgcgcgtt 300 tcagaccggc ttaaaaaacg gcgcaccacg agactatatc ccacacctgg ctcggagggt 360 cctacgccct cggaatctcg ctgattgcta gcacagcagt ctgagatcaa actgcaaggc 420 ggcagcgagg ctgggggagg ggcgcccgcc attgcccagg cttgcttagg taaacaaagc 480 agccgggaag ctcgaactgg gtggagccca ccacagctca aggaggcctg cctgcctctg 540 taggctccac ctctgggggc agggcacaga caaacaaaaa gacagcagta acctctgcag 600 acttaaatgt ccctgtctga cagctttgaa gagagcagtg gttctcccag cacgcagctg 660 gagatctgag aacgggcaga ctgcctcctc aagtgggtcc ctgacccctg acccccgagc 720 agcctaactg ggaggcaccc cccagcaggg gcacactgac acctcacacg gcagggtatt 780 ccaacagacc tgcagctgag ggtcctgtct gttagaagga aaactaacaa acagaaagga 840 catccacacc gaaaacccat ctgtacatca ccatcatcaa agaccaaaag tagataaaac 900 cacaaagatg gggaaaaaac agaacagaaa aacaggaaac tctaaaatgc agagcgcctc 960 tcctcctcca aaggaacgca gttcctcacc agcaacggaa caaagctgga tggagaatga 1020 ttttgacgag ctgagagaag aaggcttcag acgatcaaat tactctgagc tacgggagga 1080 cattcaaacc aaaggcaaag aagttgaaaa ctttgaaaaa aatttagaag aatgtataac 1140 tagaataacc aatacagaga agtgcttaaa ggagctgatg gagctgaaaa ccaaggctcg 1200 agaactacgt gaagaatgca gaagcctcag gagccgatgc gatcaactgg aagaaagggt 1260 atcagcaatg gaagatgaaa tgaatgaaac gaagcgagaa gggaagtcta gagaaaaaag 1320 aataaaaaga aatgagcaaa gcctccaaga aatatgggac tatgtgaaaa gaccaaatct 1380 acgtctgatt ggtgtacctg aaagtgatgc ggagaatgaa accaagttgg aaaacactct 1440 gcaggatatt atccaggaga acttccccaa tctagcaagg caggccaacg ttcagattca 1500 ggaaatacag agaacgccac aaagatactc ctcgagaaga gcaactccaa gacacataat 1560 tgtcagattc accaaagttg aaatgaagga aaaaatgtta agggcagcca gacagaaagg 1620 tcgggttacc ctcaaaggga agcccatcag actaacagcg gatctctcgg cagaaaccct 1680 acaagccaga agagagtggg ggccaatatt caacattctt aaagaaaaga attttcaacc 1740 cagaatttca tatccagcca aactaagctt cataagtgaa ggagaaataa aatactttac 1800 agacaagcaa atgctgaccg attttgtcac caccaggcct gccctaaaag agctcctgaa 1860 ggaagcgcta aacatggaaa ggaacaaccg gtaccagccg ctgcaaaatc atgccaaaat 1920 gtaaagacca tcaagactag gaagaaactg catcaactaa cgagcaaaat caccagctaa 1980 catcataatg acaggatcaa attcacacat aacaatatta actttaaatg taaatggact 2040 aaattctcca attaaaagac acagactggc aagttggata aagagtcaag acccatcagt 2100 gtgctgtatt caggaaaccc atctcacgtg cagagac 2137 // ID GSATX repbase; DNA; PRI; 218 BP. XX AC X87951; XX DT 18-APR-1997 (Rel. 2.03, Created) DT 04-AUG-2008 (Rel. 13.07, Last updated, Version 3) XX DE H.sapiens gamma X satellite DNA, an X chromosome specific DE centromeric sequence. XX KW SAT; Satellite; Simple Repeat; Centromeric satellite DNA; GSATX; KW Centromeric; tandem repeat. XX NM GSATX. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RA Lee C., Li X., Jabs W.E., Court D. and Lin C.C.; RT "Human gamma X satellite DNA: an X chromosome specific RT centromeric DNA sequence."; RL Chromosoma 104(2), 103-112 (1995). XX RN [2] RA Lee C.; RT "GSATX."; RL Direct Submission to Genbank (13-JUN-1995)C. Lee, University of RL Alberta, Dept of Laboratory Medicine & Pathology, 6-59 Heritage RL Medical Research Building, Edmonton, Alberta T6G 2S2, CANADA. XX RN [3] RP 1-218 RA Smit A.F.; RT "Consensus update."; RL Direct Submission to Repbase Update (05-MAR-2004). XX DR GenBank; X87951; Positions 1 1205. XX CC CC [3]. XX SQ Sequence 218 BP; 35 A; 69 C; 90 G; 24 T; 0 other; gccggggtcc tccgccggag gtcagtgcct tcccggcagc ccctgcgccg ggcccggggg 60 ggtcgtggag tccctggctt gcacccaggg tgcgtgtctc tcccacgggg ggcaccccaa 120 agcggcaaga agtcccccgg gggacgggga caggacgcca ggctttcagg gggacgttga 180 ggcagcccgg ggaaaaaagc ggcgaggccg aagaggag 218 // ID LTR18_Cja repbase; DNA; PRI; 760 BP. XX AC . XX DT 12-JAN-2011 (Rel. 16.02, Created) DT 12-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE Long terminal repeat of an ERV2-type endogenous retrovirus - DE consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR18_Cja. XX OS Callithrix jacchus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. XX RN [1] RP 1-760 RA Jurka J.; RT "Long terminal repeats from the common marmoset."; RL Repbase Reports 11(2), 791-791 (2011). XX DR [1] (Consensus) XX CC >98% identical to consensus. 6bp tsd. CC This sequence was derived from the marmoset dataset made CC available by Genome Sequencing Center at Washington University CC School of Medicine in St. Louis. CC We thank the Marmoset Genome Sequencing Consortium for making CC their data publicly available. XX SQ Sequence 760 BP; 175 A; 187 C; 152 G; 246 T; 0 other; tgtggggatc gctcctgcga agggcccttg acaggaccct cgccatcttg agcttgagcc 60 catacacgtt tctcctctcc tggaggctgt tgagctaagg gaaaggagga agtctagttg 120 cactgatttt ataaacatca atctttgcct ctgacatttg caacagaatg gctgagctca 180 ggccttgtgg tctaatgaga cattgtatgg gtagacattg tgtgcctcct agttccttct 240 ctgcacaata actgtatgac ttcatttacc tctgctaact caatcatatt gtttgcatat 300 gatcctatat gaataggcaa ggttagaatg acatttagaa aggtccagct ataccttgtt 360 aacaatctta cttgcttgct taactgattt tagaaaaaca catgtttaga tttatgatcc 420 cctttttgca agctttattg tccctttgct gtacaccctt tgattaagag tcatactttc 480 tgtacctctt ttcttgtgat ttttgttctt gattttattt caagatacaa tggaatgtac 540 gtcactcctc aaacactgaa aggtatataa cctgtagttt gcacaaataa acttgctgtc 600 ttggcacttc ttgctccggc attgcctggc agtcctccag accccgctct ctatctatct 660 tttcttccgc gcactccctc ttccaggtta agaacacttg agtggggctg aggctctccg 720 ctggtcggaa gccccccggc agacgtgccg gacccccaca 760 // ID BSRb repbase; DNA; PRI; 152 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Satellite from primates. XX KW Satellite; Simple Repeat; BSRb. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-152 RA Smit A.F.; RT "BSRb - Satellite from primates."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX SQ Sequence 152 BP; 40 A; 28 C; 44 G; 40 T; 0 other; gctgggttca gcaatatgtc acaatttctt ctgtggggca ggttcaggca gaagagaaga 60 gtcacattac ctaggtgctg ggttcagcaa tatgtcacaa tttcttctgt ggggcaggtt 120 caggcagaag agaagagtca cattacctag gt 152 // ID hAT-2_MM repbase; DNA; PRI; 2169 BP. XX AC . XX DT 21-APR-2009 (Rel. 14.1, Created) DT 21-APR-2009 (Rel. 14.1, Last updated, Version 1) XX DE hAT-2_MM is a family of autonomous DNA elements found in DE Microcebus murinus as well as in, Anolis carolinensis, Tarsius DE syrichta, Myotis lucifugus, Monodelphis domestica, Otolemur DE garnetii, Echinops telfari, Xenopus tropicalis and Schmidtea DE mediterranea. Less than ten elements exist in the genome at DE 2169bp in length. XX KW hAT; DNA transposon; Transposable Element; hAT-2_MM. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-2169 RA Novick P.A., Smith J., Ray D. and Boissinot S.; RT "Independent and parallel lateral transfer of DNA transposons in RT tetrapod genomes."; RL Gene 449(1-2), 85-94 (2010). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 153..1958 FT /product="hAT-2_MM_1p" FT /translation="MISRKRKIELECRIFKEQWIYDYFFMQYKERAXCFIC FT QNIVSVFKEYNLRXHXQTQHKDKYDCLVREVRKDXILKLKNTLTTQPNTFV FT KQXQLNXSSLRASFQVAKLMACTGRPSVEGEFVTECLLSXAEEMCPEKADL FT FSAVSLSGPTITRRIEEMGDNLHQHLQNSTKKLSYFSLALDESNDVCDSAQ FT LLIFIRGTNDYFEVTEELAALHSIKGTTTGEDTYEKVFQIVNGLELDWAKL FT ASVTTDGAPSMVGSKKGVIARINQEMDKHDHSHPIAIHRLVHQQALCSKSL FT KWDSVMKIVVSCVNFIGANALNHRQFQEFLSELNVAYEDVLYHTEVRWLSR FT GRVLKHFYDLLPQITAFLLSKNKEVPELNGAEWKWHLAFLTDVTELLNSFN FT VQLQGKGKLICDTQSHVKAFEVKLGLLIKQVKEENFCHLPTTQNLLAEKPL FT VAFPNKTXVDSLEKLQKEFQFRFKELHLHEQDIQLFRNPFSIDIENVDTIY FT QMELAELQNCDSLKXSFKSSSLPNFYASLPSETYPNLRNHALKVATIFGST FT YVCEQTFSRMKHLKSPTRPRLTDAHLHHLLRLAVTNMEPDIDHLISQKQAH FT SSH*" XX SQ Sequence 2169 BP; 671 A; 419 C; 439 G; 627 T; 13 other; tgcattgatg ggctctgatc agactgcatc catgggcagt gcagtcthta tgtctctgtg 60 tgggcaaagt tattgctggt atattgtttt tgtagcgact gtatabatat attggtattt 120 tactaatagc aatttggaat ccctaggaaa caatgatatc aagaaagaga aaaattgagt 180 tggagtgtag gatattcaaa gaacagtgga tttatgatta ctttttcatg cagtacaagg 240 aaagagctdt gtgttttata tgccagaata tagtgtctgt gttcaaagaa tacaatttgc 300 gtcvacacta dcaaactcaa cataaagata aatatgattg tttggtcaga gaagtgagaa 360 aagataavat attaaaactg aaaaatacat tgacaactca gccaaatact tttgtgaagc 420 agvagcagct aaatatbtca tcactgcgag caagttttca agttgccaag ctaatggcgt 480 gcactggcag accgtccgtg gagggagaat ttgttacaga atgccttctt tctvttgccg 540 aagagatgtg tccagagaag gccgatttat ttagtgcagt gagtctttca ggacctacaa 600 ttacacgaag gattgaagaa atgggagaca atttgcatca gcatttgcaa aactccacaa 660 aaaaactttc ctatttttcc ttggcactcg atgaaagtaa tgatgtttgt gattctgcac 720 aacttctaat ttttattcgt gggacaaatg actatttcga agtcacagaa gagcttgctg 780 cactgcacag catcaaagga acaactacag gagaggatac ctatgaaaag gttttccaaa 840 ttgtgaatgg tttggagctg gactgggcta aactagccag tgtgacaact gatggtgctc 900 ctagcatggt ggggtctaag aaaggagtaa ttgctcgcat taaccaagag atggacaaac 960 atgaccattc tcatccaata gccatacacc gcctcgtcca ccaacaagcg ctgtgtagta 1020 aatcactgaa gtgggactct gttatgaaaa ttgtggtatc ttgtgttaac ttcattggag 1080 ctaatgcact aaaccacaga caatttcagg aatttctgtc tgagctaaat gttgcctatg 1140 aagatgttct gtaccacaca gaagtccgtt ggctgagtcg agggagagtt ttgaaacatt 1200 tctatgactt acttccacag attacagctt ttctgctttc aaaaaacaaa gaagtaccag 1260 agctcaatgg tgcagaatgg aagtggcacc ttgcctttct gacagacgta acagagctac 1320 tcaacagttt caatgtgcaa cttcaaggaa aggggaagct catctgtgat acgcaatcac 1380 atgtgaaagc atttgaagta aaattaggcc tcctcatcaa acaagtgaag gaggaaaatt 1440 tctgccatct ccccacaact caaaatctgt tagcggaaaa accattggtt gcattcccaa 1500 acaaaacatv tgtggattca ctggaaaagt tgcaaaagga gttccaattt agatttaaag 1560 agcttcatct ccatgaacag gacatacagc ttttccgtaa cccattttct attgacattg 1620 aaaatgtgga tacaatttac caaatggaac tggctgaact gcagaattgt gactctctga 1680 aggnctcatt caagtcaagc agccttccta atttctatgc atctctcccc tctgagactt 1740 atcctaatct caggaaccat gcactcaaag tggcaaccat ctttggcagc acttatgtct 1800 gtgaacagac tttttccaga atgaaacatc tgaaatctcc aaccagacct agactaactg 1860 atgcacactt gcaccacttg ttacgactag cagtgacaaa tatggaaccg gacatcgacc 1920 atctcattag ccaaaagcag gcccatagtt cccattgaaa tactggtaag tttgttgatt 1980 taactttact tgttcttcat tttaaatatt gtatttgttc ccattttttt tttacttcaa 2040 aataagatat gtgcagtgtg cataggaatt tgttcatagt tttttttttt ttttaactat 2100 agtccdgccc tccaanggtc tgagggacag tgaactggcc ccctggttta aaaagtttga 2160 ggacccctg 2169 // ID LTR10_OG repbase; DNA; PRI; 442 BP. XX AC . XX DT 18-OCT-2009 (Rel. 14.11, Created) DT 18-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat (consensus). XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR10_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-442 RA Jurka J.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 9(11), 2855-2855 (2009). XX DR [1] (Consensus) XX CC ~88% identical to consensus. 6 bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 442 BP; 122 A; 93 C; 108 G; 119 T; 0 other; tgtaggatag agcatagggt taaaagtgct aggcactagg gacagagtaa tctgggaaaa 60 agccaagaat cactgctggc catgagtact gaaaacaata ctcgctagac acagcatact 120 gtcccagccc taagtcatat attagcagtg ataagattaa ggcaagacag aagtgcagta 180 aacaggatgt gttaagagtt caatcatgca gctaaagata aactacttgt tttttaacct 240 ttactctctc tttgaagctt tccgctgtgc tgtattttct catgcctgag atttcaataa 300 aagccagtga ggaacgttgg agggggccgc acccttgtct ctccttgcgg agtgcttgct 360 ggtgttggcc tccccgggtc cctcagttca atgcaaatgg actgagtagg tgttattcat 420 tttgcgtcga gtcgcaccga ca 442 // ID LTR3_Mim repbase; DNA; PRI; 327 BP. XX AC . XX DT 02-NOV-2009 (Rel. 14.11, Created) DT 02-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR3_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-327 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2952-2952 (2009). XX DR [1] (Consensus) XX CC >99% identical to consensus. 6bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 327 BP; 74 A; 87 C; 83 G; 83 T; 0 other; tgtggggcgc ggtgacaacg ccattacaag atggcgccga tttccggtgg cccgcctgta 60 gtaaacaagt tcagcgcatg cgcagagtaa gtcctggcct gttctataag tatgcatatg 120 aaggatctta gcttggccta tcagtaataa ccctcgcgcg cttttaacct atccccatca 180 cttccctcct tccttagagt atataagtgt gtgagagctt gtgctcaggg tcttccgcat 240 catgtaagtc taaagggaac cccattaaag cactgtcaga agaactccag ttgccgcgtc 300 ttccttgctg gcgaggcggg cgcgaca 327 // ID LTR23_OG repbase; DNA; PRI; 641 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR23_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-641 RA Jurka J.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 11(5), 1595-1595 (2011). XX DR [1] (Consensus) XX CC ~83% identical to consensus. 4 bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 641 BP; 160 A; 179 C; 157 G; 145 T; 0 other; tgttaggtag atagttagga atccggctct cctagggaaa aaggacaagg gcggggcccc 60 catctcggtg ccgcccacct taatcctgtc ccgagcgatt gctcacacct ggggaggagg 120 gaacgcccta ccaatctacc aatcagaact tagcgtgcca actgcaaaag gatgcagagg 180 actctctagc ccacgggcca agcagcacag gtacaataga gtctggaagg tgacctagaa 240 cagccttaag gctaattatc atgtcattag cttaaatcta cattgcggtc cacacccccc 300 caatgggctt tcagagaggc tcatgggagg tccgcatgcg cagtaagact gtggttccaa 360 ggtgacctct gaaccacgag gtgggcccaa tccaggcagt ataaaacaga accccggcca 420 taaccaaggc cttttcttgc cgcgacaatg ggggcccgtt cgttgtctat ggcgaacgta 480 tcttgctctg taaacctgta tcttgctctt actctgtaaa cctatctttc tcttcctcaa 540 taaactttgt ttcacgcttg ccttactttg gggtgtctgg tcattcttcg gccaagagca 600 caccaagaac cgaggacgca gacagctgcc tggaacctac a 641 // ID LTR5_Cja repbase; DNA; PRI; 836 BP. XX AC . XX DT 14-OCT-2009 (Rel. 14.11, Created) DT 14-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat: consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; LTR5_Cja. XX OS Callithrix jacchus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. XX RN [1] RP 1-836 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the common marmoset."; RL Repbase Reports 9(11), 2918-2918 (2009). XX DR [1] (Consensus) XX CC ~93% identical to consensus. 5bp tsd. CC This sequence was derived from the marmoset dataset made CC available by Genome Sequencing Center at Washington University CC School of Medicine in St. Louis. XX SQ Sequence 836 BP; 191 A; 298 C; 133 G; 214 T; 0 other; tgtaaggccg gttgccttgc cgggagtcaa acggacacat tcatcactct ccgcgcctga 60 taactgcgtc atcacccgag ctcattatct ggcttcccca caaagcttcc ccgcacccgc 120 ctctgcatac catgcccaag cccacactca gcactaaatt atgcatccaa cctccagccc 180 gctgttaaaa aaccctgcag agaaattttt taaagaagcc ccaccgttcc ctgcccagca 240 acagcctgcc ccgtacgccc ccggcaggcc cgcgtaccca aggtcccgcc cacctaccaa 300 tggcagctag tttcatgtac ttcctacttt aactccatgt acttcctgct cctcaacagc 360 caatagaaat gccttctctt tgtttcccca gtagtcaatc aaggcccact tacccaaggt 420 cccgcccgcc taccaatggc agctagtttc atgtacttcc tactttaact ccatgtactt 480 cctgctcctc aacagccaat agaaatgcct tctctttgtt tccccagtag tcaatcaagg 540 cccacttacc caaggtcccg cccgcctacc aatggcagct agtttcatgt acttcctact 600 ttaactccat gtacttcctg ctcctcaaca gccaatagaa atgccttctc tttgtttccc 660 cagtaaccaa tcaattgcct gcactcccca taaaacccca cgccctgaac agccgggtgc 720 gacttctctg gcccctcttg gaccactgag gttgcccggg agctgaataa attggcttct 780 cacttttttt atacctgcct cagtttcctc attttacctc ggcaataaat cttaca 836 // ID npiggy3_Mm repbase; DNA; PRI; 276 BP. XX AC . XX DT 24-MAR-2010 (Rel. 15.09, Created) DT 24-MAR-2010 (Rel. 15.09, Last updated, Version 1) XX DE npiggy3_Mm is a nonautonomous piggyBac element found in DE Microcebus murinus whose peak activity was roughly 40 mya. XX KW piggyBac; DNA transposon; Transposable Element; Nonautonomous; KW npiggy3_Mm. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-276 RA Pagan H.J.T., Smith J.D., Hubley R.H. and Ray D.A.; RT "PiggyBac-ing on a Primate Genome: Novel Elements, Recent RT Activity and Horizontal Transfer."; RL Genome Biol. Evol 2, 293-303 (2010). XX DR [1] (Consensus) XX SQ Sequence 276 BP; 89 A; 52 C; 44 G; 90 T; 1 other; cacattaact gccatgtgag ttgtatttaa ctcacgctag ttttgagccc ggggcctcat 60 gaagcatatg taactcacac gtctcttcac cttgggagcc atgagaacta tttttcaagt 120 tgcatataac tcacacacag aaaacaataa aaaataacaa atttttcatt aaattagaaa 180 ggatcatttt gttttcgaag tttttattct attttcataa taaaacaccg tggccccaag 240 gaaaaaahtt ttttttctag tgtggcagtc aatgtg 276 // ID LTR10_TS repbase; DNA; PRI; 296 BP. XX AC . XX DT 07-DEC-2009 (Rel. 15.09, Created) DT 07-DEC-2009 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV2-type endogenous retrovirus - DE consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR10_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-296 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from tarsier."; RL Repbase Reports 10(9), 1267-1267 (2010). XX DR [1] (Consensus) XX CC ~99% identical to consensus. 6bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 296 BP; 73 A; 73 C; 96 G; 54 T; 0 other; tgtggagagc gcggcctgag ccaagatggc gcccgggacc tctgccaagt cccatgatta 60 ggcgcatgat ataccgccgg gccctccttc actgcgcacc gcgcaagtgc aacgctgcag 120 agcctatgaa ctgctgacac gctggtacgg gacttgtggg tggagaccaa gggatataag 180 tacgggacaa cggggaagaa agaagaagaa gctggcatgg tgtaaaggct gaataaaccg 240 ctttgagaag aacacgttgt tgttgcctcc ttcctgctgg tcaggggtga gcgaca 296 // ID GarnAlu3 repbase; DNA; PRI; 321 BP. XX AC . XX DT 06-APR-2010 (Rel. 15.05, Created) DT 06-APR-2010 (Rel. 15.05, Last updated, Version 3) XX DE SINE element - consensus. XX KW SINE2/tRNA; SINE; Non-LTR Retrotransposon; Transposable Element; KW Nonautonomous; GarnAlu3. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-321 RA Jurka J.; RT "SINE elements from the bushbaby genome."; RL Repbase Reports 10(5), 780-780 (2010). XX DR [1] (Consensus) XX CC The youngest sequences are >90% identical to consensus. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project. XX SQ Sequence 321 BP; 79 A; 84 C; 107 G; 50 T; 1 other; gggcggcgcc tgtggctcag tgagtagggc accagcccca tataccgagg gtggcaggtt 60 cgaacctggc cccggccaaa ctgcaacaaa aaaatagccg ggcgttgtgg cgggcgcctg 120 tagtcccagc tactcgggag gctgaggcaa gagaatcgcc taagcccaag agctggaggt 180 tgctgtgagc tgtgacgcca gcnctcggga ggctgaggca agagaatcgc ctaagcccaa 240 gagctggagg ttgctgtgag ctgtgacgcc acggcactct accgagggcg acaaagtgag 300 actctgtctc taaaaaaaaa a 321 // ID MacERVK1_LTR1c repbase; DNA; PRI; 396 BP. XX AC . XX DT 24-SEP-2007 (Rel. 13.02, Created) DT 21-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE ERV2 Endogenous Retrovirus from Cercopithecidae. XX KW ERV2; Endogenous Retrovirus; Transposable Element; ERVK; LTR; KW MacERVK1_LTR1c. XX OS Cercopithecidae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini. XX RN [1] RP 1-396 RA Smit A.F.; RT "MacERVK1_LTR1c - ERV2 Endogenous Retrovirus from RT Cercopithecidae."; RL Direct Submission to Repbase Update (21-FEB-2008). XX DR [1] (Consensus) XX CC 4%. XX SQ Sequence 396 BP; 103 A; 98 C; 101 G; 94 T; 0 other; tgtagaggac tacgtgctcg caaacagggt gttcccgata agtcctgctc tcgcaaacga 60 agcagggcgt tccccataag tcctgctctc gcaaacgaag cagggcgttc ccgacaagtc 120 ctgctcttgc aaacgaagca gggcgttccc gataagtcct gctcttgcaa acgaagcagg 180 gcgttggggg cctgtttata tgtaaacatc ttgaaaatcc agaaagtcag ggaaaggtca 240 gaaaaacaac aatgtgtctt gtgacttggc aacattccac aaacgactgt ataaaataaa 300 gcggagcgcg ccattcgagg cggccgccat gtttgtcttg tcttgtgttg tcttgtgtgt 360 tcattccttt gtttaggaaa cacgcggacc ccaaca 396 // ID LTR57_Mim repbase; DNA; PRI; 598 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV3-type endogenous retrovirus - DE consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; nonautonomous; KW LTR57_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-598 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 11(5), 1728-1728 (2011). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 598 BP; 168 A; 134 C; 136 G; 160 T; 0 other; tgagggaaaa atgcaagttc cccaccacac agaacactgc gctgtgctag cttggagtgc 60 tcaacaaaat tgttacaaaa tagattaagc aaatgccaaa gatagctaca ggcacaagag 120 atgtgcctca gcactttctt gaggcagagg cgaccggctt aaaccagccc tgactgcttt 180 agaactgctt tagctgtcct atagtaccct ttagctaatt gtaatactaa aatccccgcc 240 tagtggagaa ttataatatc attaacataa catgggttgt atgaaggcac atgatgattg 300 tactgtgcat gcttgatttg tttcaatgtg aacatgcgat gatacaatgt aagagctctg 360 cgtcactcac tgggaaggga tttaaggcaa caccacgggt aggcagatgc ggatttttca 420 agaaagctct gcggaaaagt tcctgtgccg cccaggagtg agcaccctgt gtaagtaacc 480 tctaataaac tcatctaact caccaagctg gacttgtctg agtcattctt tggtctctcg 540 gctccatctc agtttggggg gcggtttttc tatactgtcc cgggattttc ccgaaaca 598 // ID HERVK repbase; DNA; PRI; 7536 BP. XX AC . XX DT 11-NOV-1996 (Rel. 1.1, Created) DT 04-AUG-2008 (Rel. 13.07, Last updated, Version 3) XX DE Internal part of human endogenous retrovirus HERV-K; clone K-10. XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR; protease; KW gag protein; LTR5; HERVK; env protein; pol polyprotein; revertase; KW ERVK. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-7536 RA Ono M., Yasunaga T., Miyata T. and Ushikubo H.; RT "Nucleotide sequence of human endogenous retrovirus genome RT related to the mouse mammary tumor virus genome."; RL J. Virol 60(2), 589-598 (1986). XX RN [2] RP 1-7536 RA Smit A.F.; RT "Consensus update."; RL Direct Submission to Repbase Update (05-MAR-2004). XX DR [2] (Consensus) XX CC LTRs of HERV-K are represented in REPBASE as LTR5. CC The human K10 and K18 endogenous retrovirus clones have a CC deletion CC of 290 bp (between positions 5532 and 5533) with respect to CC clones K8 and K22 (see separate entries). This deletion fuses the CC pol and env reading frames, eliminating the 3' end of pol and the CC 5' end of env. CC misc_binding 3..20 CC /bound_moiety="Lys-tRNA primer" CC CDS 144..995 CC /product="gag 1 protein" CC CDS 839..2143 CC /product="gag 2 protein" CC CDS 2671..2949 CC /note="putative" CC /product="neutral protease large subunit" CC CDS 3204..7243 CC /note="pol/env putative" CC /codon_start=1 CC [2] LTR5. XX SQ Sequence 7536 BP; 2543 A; 1474 C; 1574 G; 1940 T; 5 other; tctggtgccc aacgtggagg cttttctcta gggtgaaggt acgctcgagc gtggtcattg 60 aggacaagtc gacgagagat cccgagtacg tctacagtca gccttacggt aagcttgtgc 120 gctcggaaga agctagggtg ataatggggc aaactaaaag taaaattaaa agtaaatatg 180 cctcttatct cagctttatt aaaattcttt taaaaagagg gggagttaaa gtatctacaa 240 aaaatctaat caagctattt caaataatag aacaattttg cccatggttt ccagaacaag 300 gaactttaga tctaaaagat tggaaaagaa ttggtaagga actaaaacaa gcaggtagga 360 agggtaatat cattccactt acagtatgga atgattgggc cattattaaa gcagctttag 420 aaccatttca aacagaagaa gatagcattt cagtttctga tgcccctgga agctgtataa 480 tagattgtaa tgaaaagaca aggaaaaaat cccagaaaga aacggaaggt ttacattgcg 540 aatatgtagc agagccggta atggctcagt caacgcaaaa tgttgactat aatcaattac 600 aggaggtgat atatcctgaa acgttaaaat tagaaggaaa aggtccagaa ttagtggggc 660 catcagagtc taaaccacga ggcacaagtc ctcttccagc aggtcaggtg cccgtaacat 720 tacaacctca aangcaggtt aaagaaaata agacccaacc gccagtagcc tatcaatact 780 ggccnccggc tgaacttcag tatcggccac ccccagaaag tcagtatgga tatccaggaa 840 tgcccccagc accacagggc agggcgccat accctcagcc gcccactagg agacttaatc 900 ctacggcacc acctagtaga cagggtagtg aattacatga aattattgat aaatcaagaa 960 aggaaggaga tactgaggca tggcaattcc cagtaacgtt agaaccgatg ccacctggag 1020 aaggagccca agagggagag cctcccacag ttgaggccag atacaagtct ttttcgataa 1080 aaatgctaaa agatatgaaa gagggagtaa aacagtatgg acccaactcc ccttatatga 1140 ggacattatt agattccatt gctcatggac atagactcat tccttatgat tgggagattc 1200 tggcaaaatc gtctctctca ccctctcaat ttttacaatt taagacttgg tggattgatg 1260 gggtacaaga acaggtccga agaaataggg ctgccaatcc tccagttaac atagatgcag 1320 atcaactatt aggaataggt caaaattgga gtactattag tcaacaagca ttaatgcaaa 1380 atgaggccat tgagcaagtt agagctatct gccttagagc ctgggaaaaa atccaagacc 1440 caggaagtac ctgcccctca tttaatacag taagacaagg ttcaaaagag ccctatcctg 1500 attttgtggc aaggctccaa gatgttgctc aaaagtcaat tgccgatgaa aaagcccgta 1560 aggtcatagt ggagttgatg gcatatgaaa acgccaatcc tgagtgtcaa tcagccatta 1620 agccattaaa aggaaaggtt cctgcaggat cagatgtaat ctcagaatat gtaaaagcct 1680 gtgatggaat cggaggagct atgcataaag ctatgcttat ggctcaagca ataacaggag 1740 ttgttttagg aggacaagtt agaacatttg gaggaaaatg ttataattgt ggtcaaattg 1800 gtcacttaaa aaagaattgc ccagtcttaa acaaacagaa tataactatt caagcaacta 1860 caacaggtag agagccacct gacttatgtc caagatgtaa aaaaggaaaa cattgggcta 1920 gtcaatgtcg ttctaaattt gataaaaatg ggcaaccatt gtcgggaaac gagcaaaggg 1980 gccagcctca ggccccacaa caaactgggg cattcccaat tcagccattt gttcctcagg 2040 gttttcaggg acaacaaccc ccactgtccc aagtgtttca gggaataagc cagttaccac 2100 aatacaacaa ttgtcccccg ccacaagcgg cagtgcagca gtagatttat gtactataca 2160 agcagtctct ctgcttccag gggagccccc acaaaaaatc cccacagggg tatatggccc 2220 cctgcctgag gggactgtag gactaatctt gggaagatca agtctaaatc taaaaggagt 2280 tcaaattcat actagtgtgg ttgattcaga ctataaaggc gaaattcaat tggttattag 2340 ctcttcaatt ccttggagtg ccagtccagg agacaggatt gctcaattat tactcctgcc 2400 atatattaag ggtggaaata gtgaaataaa aagaatagga gggcttggaa gcactgatcc 2460 aacaggaaag gctgcatatt gggcaagtca ggtctcagag aacagacctg tgtgtaaggc 2520 cattattcaa ggaaaacagt ttgaagggtt ggtagacact ggagcagatg tctctatcat 2580 tgctttaaat cagtggccaa aaaattggcc taaacaaaag gctgttacag gacttgtcgg 2640 cataggcaca gcctcagaag tgtatcaaag tacggagatt ttacattgct tagggccaga 2700 taatcaagaa agtactgttc agccaatgat tacttcaatt cctcttaatc tgtggggtcg 2760 agatttatta caacaatggg gtgcggaaat caccatgccc gctccattat atagccccac 2820 gagtcaaaaa atcatgacca agatgggata tataccagga aagggactag ggaaaaatga 2880 agatggcatt aaaattccag ttgaggctaa aataaatcaa aaaagagaag gaatagggta 2940 tcctttttag gggcggccac tgtagagcct cctaaaccca taccattaac ttggaaaaca 3000 gaaaaaccgg tgtgggtaaa tcagtggccg ctaccaaaac aaaaactgga ggctttacat 3060 ttattagcaa atgaacagtt agaaaagggt catattgagc cttcgttctc accttggaat 3120 tctcctgtgt ttgtaattca gaagaaatca ggcaaatggc gtatgttaac tgacttaagg 3180 gctgtaaacg ccgtaattca acccatgggg cctctccaac ccgggttgcc ctctccggcc 3240 atgatcccaa aagattggcc tttaattata attgatctaa aggattgctt ttttaccatc 3300 cctctggcag agcaggattg tgaaaaattt gcctttacta taccagccat aaataataaa 3360 gaaccagcca ccaggtttca gtggaaagtg ttacctcagg gaatgcttaa tagtccaact 3420 atttgtcaga cttttgtagg tcgagctctt caaccagtta gagaaaagtt ttcagactgt 3480 tatattattc attatattga tgatatttta tgtgctgcag aaacgaaaga taaattaatt 3540 gactgttata catttctgca agcagaggtt gccaatgctg gactggcaat agcatctgat 3600 aagatccaaa cctctactcc ttttcattat ttagggatgc agatagaaaa tagaaaaatt 3660 aagccacaaa aaatagaaat aagaaaagac acattaaaaa cactaaatga ttttcaaaaa 3720 ttactaggag atattaattg gattcggcca actctaggca ttcctactta tgccatgtca 3780 aatttgttct ctatcttaag aggagactca gacttaaata gtaaaagaat gttaacccca 3840 gaggcaacaa aagaaattaa attagtggaa gaaaaaattc agtcagcgca aataaataga 3900 atagatccct tagccccact ccaacttttg atttttgcca ctgcacattc tccaacaggc 3960 atcattattc aaaatactga tcttgtggag tggtcattcc ttcctcacag tacagttaag 4020 acttttacac tgtacttgga tcaaatagct acattaatcg gtcagacaag attacgaata 4080 ataaaattat gtggaaatga cccagacaaa atagttgtcc ctttaaccaa ggaacaagtt 4140 agacaagcct ttatcaattc tggtgcatgg nagattggtc ttgctaattt tgtgggaatt 4200 attgataatc attacccaaa aacaaagatc ttccagttct taaaattgac tacttggatt 4260 ctacctaaaa ttaccagacg tgaaccttta gaaaatgctc taacagtatt tactgatggt 4320 tccagcaatg gaaaagcagc ttacacaggg ccgaaagaac gagtaatcaa aactccatat 4380 caatcggctc aaagagcaga gttggttgca gtcattacag tgttacaaga ttttgaccaa 4440 cctatcaata ttatatcaga ttctgcatat gtagtacagg ctacaaggga tgttgagaca 4500 gctctaatta aatatagcat ggatgatcag ttaaaccagc tattcaattt attacaacaa 4560 actgtaagaa aaagaaattt cccattttat attactcata ttcgagcaca cactaattta 4620 ccagggcctt tgactaaagc aaatgaacaa gctgacttac tggtatcatc tgcactcata 4680 aaagcacaag aacttcatgc tttgactcat gtaaatgcag caggattaaa aaacaaattt 4740 gatgtcacat ggaaacaggc aaaagatatt gtacaacatt gcacccagtg tcaagtctta 4800 cacctgccca ctcaagaggc aggagttaat cccagaggtc tgtgtcctaa tgcattatgg 4860 caaatggatg tcacgcatgt accttcattt ggaagattat catatgttca cgtaacagtt 4920 gatacttatt cacatttcat atgggcaact tgccaaacag gagaaagtac ttcccatgtt 4980 aaaaaacatt tattgtcttg ttttgctgta atgggagttc cagaaaaaat caaaactgac 5040 aatggaccag gatattgtag taaagctttc caaaaattct taagtcagtg gaaaatttca 5100 catacaacag gaattcctta taattcccaa ggacaggcca tagttgaaag aactaataga 5160 acactcaaaa ctcaattagt taaacaaaaa gaagggggag acagtaagga gtgtaccact 5220 cctcagatgc aacttaatct agcactctat actttaaatt ttttaaacat ttatagaaat 5280 cagactacta cttctgcaga acaacatctt actggtaaaa agaacagccc acatgaagga 5340 aaactaattt ggtggaaaga taataaaaat aagacatggg aaatagggaa ggtgataacg 5400 tgggggagag gttttgcttg tgtttcacca ggagaaaatc agcttcctgt ttggataccc 5460 actagacatt tgaagttcta caatgaaccc atcagagatg caaagaaaag cacctccgcg 5520 gagacggaga caccgcaatc gagcaccgtt gactcacaag atgaacaaaa tggtgacgtc 5580 agaagaacag atgaagttgc catccaccaa gaaggcagag ccgccgactt gggcacaact 5640 aaagaagctg acgcagttag ctacaaaata tctagagaac acaaaggtga cacaaacccc 5700 agagagtatg ctgcttgcag ccttgatgat tgtatcaatg gtggtaagtc tccctatgcc 5760 tgcaggagca gctgcagcta actataccna ctgggcctat gtgcctttcc cgcccttaat 5820 tcgggcagtc acatggatgg ataatcctat agaagtatat gttaatgata gtgtatgggt 5880 acctggcccc atagatgatc gctgccctgc caaacctgag gaagaaggga tgatgataaa 5940 tatttccatt gggtatcgtt atcctcctat ttgcctaggg agagcaccag gatgtttaat 6000 gcctgcagtc caaaattggt tggtagaagt acctactgtc agtcccatcn gtagattcac 6060 ttatcacatg gtaagcggga tgtcactcag gccacgggta aattatttac aagacttttc 6120 ttatcaaaga tcattaaaat ttagacctaa agggaaacct tgccccaagg aaattcccaa 6180 agaatcaaaa aatacagaag ttttagtttg ggaagaatgt gtggccaata gtgcggtgat 6240 attacaaaac aatgaatttg gaactattat agattgggca cctcgaggtc aattctacca 6300 caattgctca ggacaaactc agtcgtgtcc aagtgcacaa gtgagtccag ctgttgatag 6360 cgacttaaca gaaagtttag acaaacataa gcataaaaaa ttgcagtctt tctacccttg 6420 ggaatgggga gaaaaaggaa tctctacccc aagaccaaaa atagtaagtc ctgtttctgg 6480 tcctgaacat ccagaattat ggaggcttac tgtggcctca caccacatta gaatttggtc 6540 tggaaatcaa actttagaaa caagagatcg taagccattt tatactgtcg acctaaattc 6600 cagtctaaca gttcctttac aaagttgcgt aaagccccct tatatgctag ttgtaggaaa 6660 tatagttatt aaaccagact cccagactat aacctgtgaa aattgtagat tgcttacttg 6720 cattgattca acttttaatt ggcaacaccg tattctgctg gtgagagcaa gagagggcgt 6780 gtggatccct gtgtccatgg accgaccgtg ggaggcctcg ccatccatcc atattttgac 6840 tgaagtatta aaaggtgttt taaatagatc caaaagattc atttttactt taattgcagt 6900 gattatggga ttaattgcag tcacagctac ggctgctgta gcaggagttg cattgcactc 6960 ttctgttcag tcagtaaact ttgttaatga ttggcaaaaa aattctacaa gattgtggaa 7020 ttcacaatct agtattgatc aaaaattggc aaatcaaatt aatgatctta gacaaactgt 7080 catttggatg ggagacagac tcatgagctt agaacatcgt ttccagttac aatgtgactg 7140 gaatacgtca gatttttgta ttacacccca aatttataat gagtctgagc atcactggga 7200 catggttaga cgccatctac agggaagaga agataatctc actttagaca tttccaaatt 7260 aaaagaacaa attttcgaag catcaaaagc ccatttaaat ttggtgccag gaactgaggc 7320 aattgcagga gttgctgatg gcctcgcaaa tcttaaccct gtcacttggg ttaagaccat 7380 tggaagtact acgattataa atctcatatt aatccttgtg tgcctgtttt gtctgttgtt 7440 agtctgcagg tgtacccaac agctccgaag agacagcgac catcgagaac gggccatgat 7500 gacgatggcg gttttgtcga aaagaaaagg gggaaa 7536 // ID AluJ_Mim repbase; DNA; PRI; 329 BP. XX AC . XX DT 20-NOV-2010 (Rel. 15.12, Created) DT 20-NOV-2010 (Rel. 15.12, Last updated, Version 2) XX DE SINE element - consensus. XX KW SINE1/7SL; SINE; Non-LTR Retrotransposon; Transposable Element; KW Nonautonomous; AluJ_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-329 RA Jurka J.; RT "SINE elements from the mouse lemur."; RL Repbase Reports 10(12), 2171-2171 (2010). XX DR [1] (Consensus) XX CC ~95% identical to consensus. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. CC We thank the Broad Institute Genome Sequencing Platform and CC Genome Sequencing and Analysis Program, Federica Di Palma, and CC Kerstin Lindblad-Toh for making the data for Microcebus murinus CC available. XX SQ Sequence 329 BP; 95 A; 77 C; 100 G; 57 T; 0 other; ggccgggcgc ggtggctcac gcctgtaatc ctagcactct gggaggccga ggcgggagga 60 tcgctcgagg tcaggagttc gaaaccagcc tgagcaagag cgagaccccg tctctactaa 120 aaatagaaag aaattaattg gccaactaaa aatatataga aaaaattagc cgggcatggt 180 ggcgcatgcc tgtagtccca gctactcggg aggctgaggc aggaggatcg cttgagccca 240 ggagtttgag gttgctgtga gctaggctga cgccacggca ctctagcccg ggcaacagag 300 tgagactctg tctcaaaaaa aaaaaaaaa 329 // ID MER75A repbase; DNA; PRI; 77 BP. XX AC . XX DT 14-MAR-2006 (Rel. 13.06, Created) DT 26-JUN-2008 (Rel. 13.06, Last updated, Version 1) XX DE piggyBac DNA transposon from primates. XX KW piggyBac; DNA transposon; Transposable Element; DNA; MER75A. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-77 RA Smit A.F.; RT "MER75A - piggyBac DNA transposon from primates."; RL Direct Submission to Repbase Update (26-JUN-2008). XX DR [1] (Consensus) XX CC 14 bp TIRs. 5' AA and 3' TT are part of the TTAA TSD. XX SQ Sequence 77 BP; 16 A; 20 C; 18 G; 23 T; 0 other; aacccatttc ccgtttgccc cgagaatact gcgctggcag cgagctgcac tttttttttc 60 taaacgggaa atgggtt 77 // ID LTR3A repbase; DNA; PRI; 484 BP. XX AC . XX DT 16-MAR-2006 (Rel. 11.03, Created) DT 10-APR-2007 (Rel. 11.03, Last updated, Version 1) XX DE Long terminal repeat of ERV2 Endogenous Retrovirus from primates. XX KW ERV2; Endogenous Retrovirus; Transposable Element; KW LTR retrotransposon; LTR3; LTR3A_LTR; LTR3A. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-484 RA Smit A.F.; RT "LTR3A - a subfamily of endogenous retroviruses from primates."; RL Direct Submission to Repbase Update (11-NOV-2005). XX DR [1] (Consensus) XX CC HERVK3 LTR. XX SQ Sequence 484 BP; 101 A; 145 C; 117 G; 120 T; 1 other; tgtagggagc cgaaggccng tgggacgtga ccaactcagc attccactgg aggctatatg 60 atcaaacagc aaactgttta tcatgaatgc aggatgtggg caaactcacg actgcgcctg 120 ccgccagaag gtttgctgag ggcaatcact ccctggcgcc gggctccttg aggttatcta 180 ctgggacatc tagagcctgt tgttcgagga atgcagtctt gcaagcctac tctggaccga 240 gcagctgacc ccttcttcca cccccccttc tcactatctc ttttgcctaa taaatacgga 300 gggctgtgta aagctcaggg cccttgtcca ctagaggcaa ggtgccccct gaccccttct 360 tccaaatata ctcttttgtc tcttgtcttt tattcccacg ttcgcccccc tttgttcagt 420 cccccaaggt ccgtgcgggt tacatagtgg cgccccgaac agcgacagaa tcgggtgctc 480 aaca 484 // ID PTERV2a repbase; DNA; PRI; 7503 BP. XX AC . XX DT 14-MAR-2006 (Rel. 13.02, Created) DT 21-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE ERV1 Endogenous Retrovirus from Pan troglodytes. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; PTERV2a. XX OS Pan troglodytes OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pan. XX RN [1] RP 1-7503 RA Smit A.F.; RT "PTERV2a - ERV1 Endogenous Retrovirus from Pan troglodytes."; RL Direct Submission to Repbase Update (21-FEB-2008). XX DR [1] (Consensus) XX CC Closest match outside chimp to BaEV, but quite distant CC lib20040702. XX SQ Sequence 7503 BP; 1854 A; 2234 C; 1837 G; 1519 T; 59 other; tttgggggct cgtccgggat tggtcgccgc ctgacgacac ccccacgacc cgcacggcga 60 agacncgctc ggcccgggcc atcgactgac ngacgnccag gcgccctgcg caggtacttt 120 cgttttgttc tgttcgtctg cttgacttgt gtatttntgt ngccggctaa atctgcaggc 180 gggntcttcc tgccatctga agagtactgg tgctcagtct gtataagtgt ggaagggggg 240 cagacgtgcc cggcaccttc ccacttacgc cccgggggac gccctggcgg tngtctggag 300 gaaaactgac gatcccgtcc gtctcctcac ctctgnaggc gggttctccc tgccatctgt 360 agnccggttc tcccggccan ccgaagcacc gccctggcgg ttgtctggag gagaactgac 420 gatnccgtca gtctcctcac ctctgnaggc nggttctccc ngccatctga atccttcgtg 480 gaactgtggc gccgntctct cgccgcgcgg cttctgtgtg tgtgtgcagt cnctgttttt 540 gctgttgtcc tgtttgcctg tgcgttggaa nctgacttgg gcncgactat gggacagaca 600 ctgacgactc ctctntcnct gaccctgact cacttccctg acgtangggc ncgagcccac 660 aatctccctg tngaaattcg caaagggcga tggnaagcct tctgctcttc cgaatggccc 720 accctcggcg tngggtggcc ccagggcggg acctttgacc tctcnattat cttacaggtt 780 aaggcaaaag tgatggatcc agggccacgn ggccgccctg accaggtggc ctatatcatc 840 anctnggnag acctggttcg ggatccnccc ccttaggtga agcccttcct gccctcggct 900 tccccttccc agtcgaccct cctcgccttg gaggcccccc gagaccagac cccggtcccc 960 ctgaaacctg tcctcccgga tgagggtcag agggacctgg tcctcctagc aagcccctcc 1020 ctnctncgcc tcnnaacccc ctcctcnacc cccctcccta cnccacgccc tcagcccccg 1080 cgttgtcccc tgccccttct tccaccccct cggctcctac tctttctcca acctcccctt 1140 cttcggcgcc gacctcctct tcttccaccc cctctcngta ccccancccc cttctccggc 1200 cccacccgaa ctcgcccctc agacccggcc tcagacaccc cgcctccgct tacggcgggc 1260 cgaggacccc ggcgaccagc ccgcctggca gtcctccctc tttcgcctcc gcaccgtgaa 1320 ccgcacggtc cagtattggc ccttttcggc ctcagatctc tacaattgga agacccataa 1380 cccccctttc tcccaagacc cgcaggccct gacctctctg atagagtcca ttctcctcac 1440 tcaccaaccc acctgggatg actgccaaca gctcttacag gttcttctga ccacagaaga 1500 gaggcaacga gtcctcctcg aggcccggaa aaatgtgccg gggccaggag gattcccgac 1560 ccaactcccc aacgagatag atgagggatt tcccctcacc cgcccggatt gggactatga 1620 aacagcaaca ggtagggaga gtctccgaat ctatcgccag gctctgttgg caggtctcaa 1680 aggggccgga aagcgcccca ccaatttggc taaggtaaga actattactc agggaaagaa 1740 cgaaagcccg gcagccttca tggaaaggct cctagagggg tttcgaatgt acactccatt 1800 caatcccgag gccccagagc acaaggccac cgtggcaatg tcattcatag accaggcagc 1860 gctagacata aagggaaagc tccaaagatt ggacgggatc cagacctatg ggctgcaaga 1920 actagttaga gaggcagaaa aagtttataa caaaagagag actactgagg aaaaagaagc 1980 taggctagca aaggaacagg aggagcggga agatcgacga gatcgtaaga gagacaggca 2040 tttgactaaa atcctggcag cagtagtgac agggaaaggg ccagggccag ggagagaggg 2100 gggagaacga aggcgcccga aggtggataa agaccaatgt gcctattgca aggaacgagg 2160 acattgggtc aaggaatgtc ctaaacgtcc taaggaccgg aagaagccca ctcctgtcct 2220 gaccctggga gaggacagtg attaggggcg tcagggctcc gaagcccccc ccgagccccg 2280 gctaaccctt tctatagggg ggcgccccac cacctttcta gtggacaccg gggcccagca 2340 ttcagttctg acaaaagcag gcgggcctct ttcatcccgc acctcttggg tccaaggagc 2400 aacaggagga aagctgcaca agtggacgac ccaccgaaca gtaaaccttg gaaaaggtat 2460 ggtgactcat tctttcttag tagtacctga atgcccatat ccccttctgg ggcaggatct 2520 gttgaccaag ctcggagccc agatacattt ctcagagaga ggggcccagg tactgggtga 2580 ggatggtcag cctatccaaa ttctgaccgt ttccttgcaa gatgagtacc ggctttttga 2640 gactcctatc ttcaccagcc ctcccgataa ttggctgcaa gaatttcccc aggcttgggc 2700 agagacangg ggacttggac tggcnaaatt tcaagccccg attatagttg acctcaaacc 2760 caccgcagtg ccngtgtcca ttaagcaata ccccatgagc cgagaagccc gtatgggcat 2820 ccggcagcat gttgataaat ttctggaatt aggggtcttg cggccatgcc gctcaccttg 2880 gaacacgccg ctcctcccgg taaagaaacc tggnacccaa gattataggc ccgtccagga 2940 cttgagagaa attaataaga gaaccatgga catacatccc acagtcccca acccttacaa 3000 cctgctcagt accttgagac cagaccacaa ctggtataca gtactagacc taaaagatgc 3060 attcttttgc ttacccctgg ctccccaaag ccaagagctt tttgcctttg aatggaggga 3120 ccctgagagg ggaatctcag gccaattaac ttggactcgg cttccccaag ggttcaagaa 3180 ctctcctacc ctctttgacg aggctcttca ccgggacttg gctgattttc gcacccngca 3240 cccagattta actctgctcc agtatgtaga tgacctcctc ctggcngccc ccactaaggg 3300 agcctgccta cagggcacca ggcaactgct ccaggagctc ggagaaaaag gataccgagc 3360 ntctgccaag aaagcacaaa tctgccaggc taaggtaacc tacctgggat acatcttgag 3420 tgaagnaaaa aggtggctca cccctgggcg gatagagact gtagcccgca ttccgccacc 3480 ccggagcccc aaggaggtgc gtgagtttct ggggactgcc gggttctgcc gcctgtggnt 3540 acccggtttt gctgagttgg cggcccccnt ttatgccctc accaaaggga gcaacccctt 3600 tacctggctg gaggaacacc aacaggcctt cgaaacttta aagaaggcac tcctctctgc 3660 cccngccctc gggctacctg acacatccaa gccttttacc ctctatgcag acgagagacg 3720 ggggatagcc aaaggggtct taacccaaaa actggggccc tggaagagac cggtagccta 3780 cttgtctaag aaactggacc ctgtggcggc tgggtggccc ccttgcctcc gcattatggc 3840 agccaccgct atgctagtca aagactctgc taagttaacc cttgggcaac cattgactgt 3900 cattaccccg cacgccttgg aggccatagt gcggcagccc ccggaccgtt gggtcaccaa 3960 cgctcgtcta acccactacc aagccctnct actagacacg gaccgcgtcc gctttggccc 4020 tccggtcact ctgaatcctg ccaccttgct acctgtaccg gaggtcccgc tgagccccca 4080 cgactgtcga caagtgctgg cggagaccca cgggactcga gaagacctcc aggactacga 4140 actcccagac gcagaccata cttggtacac agacggtagc agcttcatgg acgcaggtac 4200 ccggagggcg ggggcggcgg tagtggatgg acatgccacg atatgggcgc aggcactgcc 4260 tcccggaacg tctgctcaaa aggctgaact aattgctcta acaaaggcct tagagctatc 4320 gcaggggaaa aaggctaaca tctacacaga cagtcggtat gcctttgcga cagcccacac 4380 ccatgggagc atttacaaga ggcgaggtct cctaacatca gaaggaaaag aaatcaaaaa 4440 taaggccgaa ataatcgcct tattaaaggc cctcttcctc cctaaaaagg tggccataat 4500 tcattgtcct ggacatcaaa aaggacatga ccccgtcgcc cagggtaaca ggcaagctga 4560 ccaggcggcc aagcaggctg ctagaataga gacattgacc ttagtttcgg aaaccaaaga 4620 ggctgaccgg ataccccctt ccacaagtta tacctataca ccagaggacc gggaagaggc 4680 agtagcctta ggagccacag aaaaccaaga gactaaaaat tgggaaaaag acgggaagac 4740 agtcctccca caaaaagagg ccacggccat ggtgcagcag atgcacncct ggacacattt 4800 aagtagtagg aaactaaaac tgctcattga aaagactgac ttcctaatcc ccagggtcgg 4860 caccctcctg gaacaagtaa cgctcgcttg caaggcctgc caacaagtaa acgccggggc 4920 cacgcgagtc ccggcgggga taaggacacg gggcaaccgc cctgggacct attgggaagt 4980 agattttact gaaataaagc ctcaccatgc gggatataaa tatttattag tatttgtaga 5040 cacattttca ggatgggtag aagcctaccc cacccggcaa gaaacggccc acatagtggc 5100 caagaagata ttagaagaaa ttttccccag gtttggactc cccaaggtaa tcgggtcaga 5160 caatgggcca gccttcgtct cccaggtaag tcagggactt gccaggatac tggggattaa 5220 ttggaaactt cattgtgcnt acaggcccca gagttcaggg caggtagaac gggtggacag 5280 aactatcaaa gagaccttgg caaaattgac cttagagact ggcttaaaag attggagacg 5340 tctcctatcc ctagctctct tgagggcccg aaatacgcct aatcgctttg ggctcacccc 5400 ttatgaaatc ctctacgggg gaccacctcc cttgtcaacc ttgcttgatt ctttctcccc 5460 ctctaaccct aagactgact tgcaggctcg gctaaaagga ctacaggcgg tgcaagccca 5520 aatttgggct cctttggcgg aactgtacca gcctggacac ccacaaacca gtcacccttt 5580 ccaagtggga gactccgtct atgtcagacg acatcgctcc caaggattag aaccccggtg 5640 gaagggacca tacatcgttc tcctcaccac acccactgct gtgaaagttg acggggtcgc 5700 cgcctggatc cacgcatccc acgtaaaagc tgctccgaag gtgccaggat cagcatcgcc 5760 tgagaaatgg agacttcgtc gctccaggga ccccctcaag ataagactct cccgtgtcta 5820 accccccacc tactgttagc tcttttcctt ccctgggtta tcggaagcag caacccccac 5880 cagccctatc gattgacttg gcaaataact aattttgaaa cccatgaagt cctcaacgag 5940 acttcacatg tagccccttt aaatacctgg ttccctgacc tctacttcaa tcttgacaaa 6000 atagccatga tagatgaaat ggagggtggc gagtggagaa agcaagcgag gagagtctcc 6060 ctaagtcgaa acgggtttta tgtctgccct ggattccgga cgggaccgat gaaaaagacc 6120 tgtggtgaaa taatgtccct gtactgtgca agttggtcat gtgtaacnac taatgatggg 6180 gaatggaaat gggaaaccca accctggtat ttgaccatgt cctatgtcca gccctgcacc 6240 aggacacggt attcggccac ctgtaactta atccgtgtca aatttgagga ggccgcaaaa 6300 actgaccccc gntggacaac cggactaatt tggggcctaa atttatacca aactccggca 6360 nctggactcc ctatccaaat taggctacta gttaacccgg tctcagcctc ggtcccggta 6420 gggccaaacc cggttctaac agggagagca ccttctcagt gagggagccg gcagaaagtc 6480 ccgaccaccg tttccccgtc tcctccacca aatccgnatc cccatcggca ctcccgaggg 6540 ccccatcaac gctcccgggg nccacccgcc tgcctcccga cccggaaaca agcaacagac 6600 tcttcagcct catcagaggc gcttacctcg ccctgaacca gacaangcct gaatccacca 6660 cctcctgctg gctctgcctg gccacaggcc ccccttacta tgaaggtatt gcctctgttg 6720 gtaatcttac taactccact agtcattctg gatgtgcatg ggaccagcac aagaaactta 6780 ccctaacaga ggtgtcaagg tcgggaacct gtataggccg ggtgcccccc agtcaccaaa 6840 aaaaaaaaaa aaaaaaaaaa aaaaaaaata cctcactaga atgaagagag aacctgtttc 6900 cctcaccctc gctgttatgc taagatgatg gagtagcggc tgaggtcagg acaggaaccg 6960 cggcattagt gcgtggcagc taccacctac aacaactcag ggcagccgta gatgaagacc 7020 tcagggccat agaacactcc attaccaaac ttgaagaatc tctaacctcc ctgtccgaag 7080 tagtactcca aaatcgacgg ggactagata taatttttct aaaagagggc gggctctgtg 7140 ctgcccttaa agagcagtgt tgcttttacg ctgatcattc cggagtagtt aaagactcta 7200 tggccaaact tagaaaaaga ctagatgata gacagaaaga aagagaatcc caacaaaact 7260 ggtttgaaac ttggtacaac caatccccct ggtttagtac tctcatctcc actatcctag 7320 ggcccctgat tctgtttatg cttattttaa ctttcaggcc ctgcattttt aaccgcttgc 7380 ttgctctaat taaagacaga ttaaatatag tgcatgctat ggtcctgact cagcggtacc 7440 aggcagtcaa gactaacgaa gagactcaag attgagcctc taagtcacaa aaagaggagg 7500 gaa 7503 // ID MER41E repbase; DNA; PRI; 595 BP. XX AC . XX DT 30-JUL-1998 (Rel. 3.06, Created) DT 26-JUN-2008 (Rel. 13.06, Last updated, Version 2) XX DE Long terminal repeat of endogenous retrovirus; internal sequence DE belongs to the MER4I group; subfamily MER41E. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; KW Long terminal repeat of retrovirus-like element; MER41E; KW MER4I-group family. XX NM MER41E. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 84-595 RA Kapitonov V.V. and Jurka J.; RT "MER41E."; RL Direct Submission to Repbase Update (JUL-1998). XX RN [2] RP 1-595 RA Smit A.F.; RT "Consensus update."; RL Direct Submission to Repbase Update (05-MAR-2004). XX DR [1] (Consensus) XX CC Individual copies ~88% identical to the consensus sequence. CC [2] mer4 group 13%. XX SQ Sequence 595 BP; 159 A; 163 C; 133 G; 140 T; 0 other; tgagacagga ataatacagg gtggtcgcag gagaatagaa aattccaggc agcagtttca 60 catgactagc aaaaggaaac tgttgaaata gctgcataag ctaggggctg ataagaccct 120 gaaaaaccag ggtgtgggcc aagctggcta agaccgactg gacccaacat ggcgctggat 180 ttgacctagg tttcacctag gacctcatta tatgctcatt aacatactaa atcacacacc 240 caccagcgcc atgacagttc cgggaacacc catatttggt gtaaaaatgg gtggcaccac 300 agttccgaga aatctccacc tttttccagg aatcttcatg aatattccac cccttggtta 360 aagaaaccca taaaggtaga agccccaaac cccattgggc gcgactcctc tcttgagtac 420 gcccgcactc ccctttcttg agtgtgtact tttcgctttg caataaatct ccgtactttc 480 actattttct gactcatcct tgaattcctt ctcgcgacgg tgtcaagagc ctggacaccg 540 gctggggtcg aggtcccacc ggcgtttggg gacctccccc agcccaccgg tatca 595 // ID L1-1_TS repbase; DNA; PRI; 5277 BP. XX AC . XX DT 02-JAN-2010 (Rel. 15.03, Created) DT 02-JAN-2010 (Rel. 15.07, Last updated, Version 3) XX DE L1-type non-LTR retrotransposon - consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-1_TS. XX NM L1-1_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-5277 RA Jurka J.; RT "Non-LTR retrotransposons from tarsier."; RL Repbase Reports 10(3), 440-440 (2010). XX DR [1] (Consensus) XX CC >95% identical to consensus. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX FH Key Location/Qualifiers FT CDS 186..1145 FT /product="L1-1_TS_1p" FT /translation="MKKNQKKSMGLSQTPGRADTEKTDFGTQTMKSPQNEG FT SQTANVDIKTIMERLKRIEETQEETRKELISEITVIKNTVNEINNKLISME FT SRITQAEERISELEDQNIEITQTLKNTENKLKKTEQNLQEMSDYLKRPNLR FT IIGLPEAERETETTLEQTFHEIIQENFPYLISDAKIQTQEIQRTPARQQMR FT RPTPRHIIIRLNKVGTKEKILKAAREKGQITYHGRPIRIAADLSAETLQAR FT RAWSPIFKVLKDKQFQPRITYPAKLSFISEGELKSFPDIQSLRTYAATKPS FT LHETLKKVLNTEEKEKRTTFFTRVQGKE" FT CDS 1239..5051 FT /product="L1-1_TS_2p" FT /translation="MIGTNSHISIISLNVNGLNAPLKRHRMTKWIKYHQAT FT IYCLQETHLTRKDIHRLKVRGWETNFQANGTQKKGGVAILISDKIPFKLSK FT IKKDTEGHYIMIKGSLHQQEISILNIYAPNIGAPTFIKQLLGKLKKDIDSN FT TIITGDFNTPLTTLDRSSGQKISKEIRNLNETLDQMDLIDTYRTLHPKTTE FT YTFYSSPHGTYSKIDHIIGHKSSISKFKRTEILPCTFSDHSGIKINIDTNK FT VPPKPTKTWTLNSMMLNNSWVNDDIKTEIKRYLETNENEETSYQNLWDALK FT AVIRGEFISLQTHMRKMEGTEIDNLTSHLKKLEKQDHKNPNFSRRIQITKI FT KAQIQDIEDKKIIQKINETKSWFFERVNKIDGPLARLTKKKREKNQISTIR FT NTKDEVTSDPEEIQKIIRDYYVHLYGNKLENQKEMEDFLTSHNLPRLEQEE FT IETLNRPITIKEIDHVIRKLPTKKSPGPDGFPAEFYKTFKEELIPILLKVF FT QAIEKDGTLPKSFYEANITLIPKPGKDPTKKENYRPISLMNIDAKILNKIL FT ANRIQQYISKIIHHDQVGFIPGMQGWFNIRKTINVIKYINRCQNKNHMIIS FT LDAEKAFDKIQHPFLIKTLEHLGIRGTYLKIVKAIYEKPTASILLNGQKLE FT PIPLKTGTRQGCPLSPLLFNIVLEVLARAIREEEAIRGIQIGKEEVKLSLY FT ADDMXVYLENPRESVKGLLTLIKAFGKVSGYKINVQKTIAFLYTNNKQTET FT QIKNTVPFTIATKKMKYLGIFLTRDVKDLYNENYKTLLKEIKADTNKWKNI FT PCSWIGRINIVKMSILPKAIYKFNAIPIKLPTTFFSDLEKTTQEFIWKHKR FT PRIARTILSKKNKAGGITIPDFKLYYKATIIKTAWYWYRNRHIDQWNRIEI FT PEAKPQFLNQLIFDKAPTTYHWGEENLFSKWCWENWLTTCRRLKQDPYLSP FT CTKVNSKWIRDLNVKPQTIRTLEKEGNTLMEIGTGIQFLYKTRNPQDLREK FT IDKWDLIKLTSFCKAKETIKRAGRQPTDWEKVFANSRSDKGLTSWIYKELK FT RAEKKKTNNPIIKWAKDMNRHFTKEDIRAANKHMKKCSTSLIIREMQIKTT FT LRYHLTPVRMAIINNSKNNSCWRGCGEKGTLLHCWWECKLVQPLWKAVWRF FT LKALKIDLPYDPAIPLLGIYPEEHKSLYKKDTCTRMFIAALFTIARTWKQP FT CCPSKEDWIKKMWYIYTMEYYAAIKKNKIMNFAATWMELESIILSDLSQKQ FT RSEYHMFSLI" XX SQ Sequence 5277 BP; 2236 A; 1146 C; 925 G; 969 T; 1 other; ggaccaagac cgcaaactgc tgaatagaca gactgtaaag cagagggaaa agtgaacaaa 60 gccagaagac cctcataagg aagaacagga cattgcagaa gagaaataaa agcaacccca 120 ccttccctca aaaacagaat tgaagagggg gagggaaagg gggagaggga gaaaaatcta 180 cagaaatgaa gaaaaaccaa aagaagagta tgggtctctc ccagacgcct gggagagcag 240 acactgagaa aactgacttc ggaacgcaaa caatgaaaag tccccagaat gaagggtctc 300 aaactgcaaa tgtagatatc aagacaataa tggagagatt aaaaagaatt gaggagacac 360 aagaagaaac taggaaggag ctgatatctg agataacagt aataaagaat actgtgaatg 420 aaataaataa caaactgata agcatggaaa gcagaattac ccaagcagaa gaaagaatct 480 cagagcttga ggaccaaaat atagaaataa cccaaactct taaaaacaca gaaaataagc 540 tcaaaaagac agaacaaaac cttcaagaga tgagtgacta cctcaagagg cctaacctaa 600 gaataatcgg actccctgag gcagaaagag aaacagagac cacattggaa caaactttcc 660 atgagatcat tcaagaaaac ttcccttatc taatcagtga tgcaaaaatt caaacacaag 720 agattcagag aacccccgca agacaacaaa tgagaagacc aactcctaga cacataataa 780 ttcgcctaaa taaagtaggc acaaaagaaa aaatcctaaa ggcagcaaga gaaaaaggcc 840 agatcaccta ccatggaaga ccaatcagaa tagcagcaga tttatctgca gaaaccctgc 900 aggctaggag agcttggagc cctatcttca aagtcctaaa agataaacaa tttcaaccaa 960 gaataaccta cccggccaag ctaagcttca tcagtgaggg agaattaaaa tctttcccag 1020 atatccaatc cctaagaact tatgcagcca caaaaccatc tctacatgaa acacttaaga 1080 aagtactaaa cacagaagaa aaggaaaaaa gaacaacgtt cttcacaaga gtacagggaa 1140 aagaataaaa tatacacgaa ccaaccccaa aaccaaaaga aagacaaaaa aaaaaaagaa 1200 aaaaccaagt ggaagaacaa taactcaata agaactccat gatagggacg aactctcaca 1260 tttcaataat tagtctgaat gtgaatggac taaacgcacc actgaaaaga catagaatga 1320 caaaatggat aaaatatcac caggcaacaa tatactgcct tcaagagacc catctcacta 1380 gaaaggacat acacagactc aaagtaagag gatgggaaac aaattttcag gcgaatggaa 1440 cacaaaagaa aggaggagtc gcgatcctaa tttcagacaa aataccattt aagctatcaa 1500 aaattaaaaa agatacagag ggccactaca taatgataaa aggttcactc catcaacaag 1560 aaatatctat cctaaacata tatgcaccca acataggtgc gccaacattc ataaagcaac 1620 ttctaggaaa actaaagaaa gacattgact ctaacaccat aataactggg gactttaata 1680 caccactcac aaccctagac agatcatcag gacaaaaaat cagcaaggag atccggaacc 1740 tcaatgagac tctggaccaa atggacttaa ttgataccta cagaacactc catccaaaga 1800 ccacagaata cacattctac tcatcaccac atggaacata ttctaagatc gaccacataa 1860 ttggacacaa atcaagtata agcaaattta aaaggaccga aattctacca tgcaccttct 1920 cggaccacag tggaataaaa ataaacattg acaccaacaa agtcccccca aaacccacaa 1980 agacatggac actaaacagc atgatgctaa acaactcctg ggtcaatgat gacatcaaaa 2040 cagagatcaa aagatacctg gaaacaaatg aaaatgaaga aacatcttac caaaatctct 2100 gggatgcctt aaaagctgta ataagagggg aatttatatc cctacaaaca cacatgagga 2160 aaatggaagg aacagaaatt gacaacctaa caagccacct aaagaagctg gaaaagcaag 2220 accacaaaaa ccctaatttc agcagaagaa tccagatcac caaaataaaa gcccaaatcc 2280 aggacataga agacaaaaag ataatacaaa aaatcaatga aacaaaaagc tggttcttcg 2340 aaagggtaaa caagatcgat ggtcccctag ctagactgac caagaaaaaa cgagaaaaaa 2400 accaaataag cacaatcaga aacacaaaag atgaagtcac atctgaccct gaagaaatac 2460 aaaagatcat tagagactac tacgtacact tgtatggaaa caaacttgaa aaccagaagg 2520 aaatggagga ctttctgaca tcacacaacc tacctaggtt ggaacaagaa gaaattgaga 2580 ccctaaatag accaataaca atcaaggaaa tcgaccacgt aataagaaaa cttcctacaa 2640 aaaaaagccc tggtccagat ggctttccag cagaattcta caagacattt aaggaggagc 2700 tgataccaat cctactgaag gtattccagg cgattgaaaa agatggaact ctccccaaat 2760 cattttacga agccaacatc acattgatac ccaagccagg taaagatcca acaaagaaag 2820 agaactacag gccaatatct ttgatgaaca tagatgctaa aattctcaac aagatcctag 2880 caaaccggat tcagcaatac atctcaaaaa tcatccatca tgaccaagta ggcttcattc 2940 ctggcatgca aggctggttc aacattcgta aaaccataaa tgtaattaaa tacatcaaca 3000 gatgtcaaaa caaaaaccac atgatcatat cactagatgc agaaaaagct tttgataaaa 3060 tccagcaccc cttcttgata aaaacccttg aacatctagg catacgggga acatacctca 3120 aaatagtaaa agccatctac gagaaaccca cagccagcat actcctaaat ggacaaaaat 3180 tggaaccaat tcccctgaaa actggaacta gacaaggatg cccactctct cccctcctgt 3240 tcaacatagt attggaagtc ctggctagag caatcagaga agaggaggca atcagaggta 3300 ttcaaatagg aaaagaggaa gtcaagttat ctctctatgc agatgatatg atkgtgtacc 3360 ttgaaaaccc aagagaatct gtcaaaggcc tccttacatt gataaaggcc tttggcaaag 3420 tctcaggata caaaataaat gtacaaaaga caatcgcatt tctctacacc aataataaac 3480 aaacagaaac ccaaataaaa aacacagttc cattcacaat agccacaaaa aaaatgaaat 3540 accttggcat cttcctaacc agagacgtga aagaccttta caatgaaaat tacaaaacac 3600 tgctcaaaga aatcaaagct gacacgaaca agtggaaaaa tatcccatgc tcatggatcg 3660 gaagaatcaa cattgtgaag atgtccatct tacctaaggc aatctacaaa ttcaatgcaa 3720 tacccattaa attaccaaca acattcttct cagacctaga aaaaacaaca caggaattca 3780 tatggaaaca caaacgtcca agaatagcca gaacaatcct cagcaaaaaa aacaaagcag 3840 gtggtatcac aataccagac ttcaaacttt actataaagc tacaatcatc aaaacagctt 3900 ggtattggta caggaacagg catatagatc aatggaacag aattgagatt ccagaagcaa 3960 aacctcaatt tctcaaccaa ctcatcttcg acaaagcccc caccacctac cactggggag 4020 aggagaacct attcagtaaa tggtgctggg aaaactggct gaccacatgc agaagattga 4080 aacaggaccc ctatctatcc ccatgcacaa aggttaactc caaatggatc agagacctaa 4140 atgtaaaacc tcaaaccata agaaccttag aaaaggaagg aaataccctc atggaaatcg 4200 gaactggcat ccaattcctg tacaaaaccc gaaacccaca ggacttaaga gagaagatag 4260 acaagtggga ccttattaaa ctgacaagct tctgcaaagc caaagaaacc atcaagagag 4320 cagggagaca gcctacagac tgggaaaaag tatttgccaa ctccaggtct gacaaaggct 4380 taacatcctg gatctacaag gaactgaaac gtgctgaaaa gaagaaaaca aacaacccca 4440 ttataaaatg ggcaaaagat atgaacagac acttcacaaa agaagacatc cgagcagcca 4500 acaaacacat gaagaaatgc tcaacctcac taatcatcag ggagatgcaa atcaaaacca 4560 cactgagata ccacctaact ccagtcagaa tggcaattat caacaactca aaaaataaca 4620 gctgctggag agggtgtggc gaaaagggaa cacttctaca ctgttggtgg gagtgtaaac 4680 tagtgcaacc tctgtggaaa gcagtgtggc gattcctaaa agctctaaaa atcgacctcc 4740 catatgaccc cgcaatcccc ctactgggaa tataccctga agaacacaaa tcactctata 4800 aaaaagatac ctgcacacgt atgtttatcg cagcattgtt cacaatagca agaacctgga 4860 aacaaccatg ctgcccatca aaagaggact ggattaaaaa aatgtggtac atatacacga 4920 tggaatacta cgcagccata aaaaagaaca aaatcatgaa tttcgcagca acctggatgg 4980 aactagagtc tataatactg agtgacctct cacagaaaca aagatccgag tatcacatgt 5040 tctcactcat ataatggacc ttgaacatcc aatgcaatac tataagaaaa tgactgacgg 5100 tactgggaaa ctatgggggg gagggagatg ggattaacgg tagcaaatat ctgtctgggg 5160 acggggaaac acctcttatc aacagggtgc ctgaatgaaa cgtaattgta tacctaaccc 5220 ttaactgtac cccacaacat cataataaaa aaatactgat taataaaaaa aaaaaaa 5277 // ID LTR17_Mim repbase; DNA; PRI; 350 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV3-type endogenous retrovirus - DE consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; nonautonomous; KW LTR17_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-350 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 11(5), 1722-1722 (2011). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 350 BP; 89 A; 112 C; 57 G; 92 T; 0 other; tgttaggctt aaagaattca ccctccccaa acttccatcc aaacctccta taaaacccac 60 ccctctcccc ttcctagggg tgtattaggc acgtagcctt tgccagaccc ctagggagat 120 aaaggaatga aatgcaagag gggaacttac tttccttatc cctcccggtt gtttgaaaag 180 aatttattta ctccccccag aaaaaccctc tataaaaccc aaggctctcc ccttttctct 240 cgtggacgcc tttttcggcc tccacccatc tgcacccaga tgctcaaaat aaacagcatt 300 ttgctacaca tgaggcttcg tgtgtctctc tctcggctcc ggacctaaca 350 // ID LTR71_TS repbase; DNA; PRI; 501 BP. XX AC . XX DT 09-DEC-2009 (Rel. 15.09, Created) DT 09-DEC-2009 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR71_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-501 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from tarsier."; RL Repbase Reports 10(9), 1280-1280 (2010). XX DR [1] (Consensus) XX CC >90% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 501 BP; 141 A; 109 C; 113 G; 138 T; 0 other; tgtgagaaat gaactcacct gtccaaaatc aaaggatgga cacagagaca tggagtgcag 60 cagaagcgag actttaatgg cggtcttgca agatcgggtg tctggtgggc aggcacaccg 120 gagctgttac agttagcata tttgtaccta gaacgcagtc cctcccctgt ctcccatagg 180 ctgggcttcc cagaggttac aacctttccc ggacgtcgcc tacatggcac acatcgattg 240 ggtcacgtaa ttattgttca tttgcatgtc ctatttgctg attggtttag agttgctaca 300 aaatcccttc atttacatgt cctatttgct gattggttta caggatttca ttttgattgg 360 ttccctccga aatttgaaca taaaagcccg ccaaagtgaa agcccgtgaa aaagagaaca 420 aagaagcagg aagtgactca gcaaaggcat ttaaacatgg ctcagggttc attttgaaaa 480 taaatcattc tatttcttac a 501 // ID LTR7B_Cja repbase; DNA; PRI; 329 BP. XX AC . XX DT 14-OCT-2009 (Rel. 14.11, Created) DT 14-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat: consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR7B_Cja. XX OS Callithrix jacchus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. XX RN [1] RP 1-329 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the common marmoset."; RL Repbase Reports 9(11), 2920-2920 (2009). XX DR [1] (Consensus) XX CC ~97% identical to consensus. 6bp tsd. CC This sequence was derived from the marmoset dataset made CC available by Genome Sequencing Center at Washington University CC School of Medicine in St. Louis. XX SQ Sequence 329 BP; 56 A; 103 C; 61 G; 109 T; 0 other; tgtggggagc agtagccatc ttgtaactat gttgcttaag cttttgttcc tcttacctaa 60 acctccatct tgtgggcttc ccttattgtt taagtttcac attcctccta gttaggaggc 120 tagcctgcag cctcgtctcc atggttactc tctgacctca gccctcagaa cttcccgccc 180 ttacttgttt ataagcaacc gccttgtagc taataaacga gacttgatca gatccttgac 240 ttgtctccat tctccgcgtc tcctgtctct ctcttttaat ccccactccc tccctagggt 300 ctccgttgct cgtcccgcgg gtcgggaca 329 // ID ERV1-1_TSy-LTR repbase; DNA; PRI; 437 BP. XX AC . XX DT 09-DEC-2009 (Rel. 15.09, Created) DT 09-DEC-2009 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERV1-1_TSy-LTR; ERV1-1_TSy-I. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-437 RA Jurka J. and Walichiewicz K.; RT "Endogenous retroviruses from tarsier."; RL Repbase Reports 10(9), 1193-1193 (2010). XX DR [1] (Consensus) XX CC ~92% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 437 BP; 126 A; 119 C; 75 G; 117 T; 0 other; tgtggcaggg aggggctaaa aacagtcaaa ttattccttt ttaaaaatca acctgcaagc 60 tccaaacctg ctagacatgg gctgagaaaa caaatagcca aagactttcc tggaaatgca 120 agatagtaat cgattcacaa ccttaccccc acatcctctc tttgtgaagt tatcagttta 180 cgacccctgc tttcatgtat accctggaga caacttgttc tttgagaatc tggttttgct 240 agacaccact tgcaaattcc aagaaaaatg taagccatac aacctgaccc cctcccttac 300 ctaaactaac agataaaacc cctctcctct ttctcgggtt gctcagcctt ggagcattag 360 cccactgagt gcgccagcct aataaacgct acttcctaag atagccttgg tgtcacggtc 420 tctgttttcc tgcaaca 437 // ID LTR14C_Mim repbase; DNA; PRI; 410 BP. XX AC . XX DT 02-NOV-2009 (Rel. 14.11, Created) DT 02-NOV-2009 (Rel. 15.11, Last updated, Version 3) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR14C_Mim. XX NM LTR14C_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-410 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2975-2975 (2009). XX DR [1] (Consensus) XX CC >95% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. CC We thank the Broad Institute Genome Sequencing Platform and CC Genome Sequencing and Analysis Program, Federica Di Palma, and CC Kerstin Lindblad-Toh for making the data for Microcebus murinus CC available. XX SQ Sequence 410 BP; 112 A; 97 C; 113 G; 88 T; 0 other; tgtaagatac tgggctataa ggacaatgga caccttccag cttctgttta ctgcttgttt 60 gctaaccgca aggcattatg agtacattat gggatgcaga ggagaaagac aaagaacgcg 120 gaaaccggag tctctcagct aggacccgga acaggttgga gcctatcagg ggcaggatga 180 agtaagaatc actgtggggg cggatgcatg atcagcgtgt aaacagctta ggtataaaag 240 gctcactagc acacaaaggg gggtccctgc ccgaagaaga ggccactgcg ctggcactct 300 gggggctcgg accctagctc gagctagaca ataaaactcc ttttgataat tacagcctcg 360 gtgactctgt ctctctgtcc cgcggccctg cgaatctcga ctatataaca 410 // ID LTR3D_Cja repbase; DNA; PRI; 492 BP. XX AC . XX DT 14-OCT-2009 (Rel. 14.11, Created) DT 14-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat: consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR3D_Cja. XX OS Callithrix jacchus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. XX RN [1] RP 1-492 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the common marmoset."; RL Repbase Reports 9(11), 2913-2913 (2009). XX DR [1] (Consensus) XX CC ~89% identical to consensus. 4bp tsd. CC This sequence was derived from the marmoset dataset made CC available by Genome Sequencing Center at Washington University CC School of Medicine in St. Louis. XX SQ Sequence 492 BP; 141 A; 102 C; 129 G; 120 T; 0 other; tgttgtacca gagcgagcat agaaaatcaa caccaagaca aaagtatgcc aaattgcagg 60 gtctttattg ccggcggcca cggaggactc acgtctctcc aacccgtggc cccgagctca 120 gggggagcag ggtttttaag ctgaaaaacc acatcctggt tttaggacca agagggtgag 180 gggatgcagg gcaagcaact ttacagaagc tatttctggc agttctgata agcaaaggca 240 agcagcttta cacagtgggc ttcaaacatc aatttttctt ttaatcagca ttttcagagc 300 aagctagctg tttttattta cagaagccaa ggtcagcagt tattgcaagg agaacggggc 360 agcaagagtt atcacaaagg aaacggggca gcagttacaa cttatttttg gaaaaacagc 420 ttttcttggt gcttgcaggc agggtttaag atggcgttgc ttaggttcta actctgggcc 480 agctatggca ca 492 // ID LTR20B_Mim repbase; DNA; PRI; 518 BP. XX AC . XX DT 09-NOV-2009 (Rel. 14.11, Created) DT 09-NOV-2009 (Rel. 15.11, Last updated, Version 3) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR20B_Mim. XX NM LTR20B_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-518 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2978-2978 (2009). XX DR [1] (Consensus) XX CC ~90% identical to consensus. 4bp tsd. CC Similarity to LTR20_OG from bushbaby. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. CC We thank the Broad Institute Genome Sequencing Platform and CC Genome Sequencing and Analysis Program, Federica Di Palma, and CC Kerstin Lindblad-Toh for making the data for Microcebus murinus CC available. XX SQ Sequence 518 BP; 134 A; 119 C; 124 G; 141 T; 0 other; tgtaacagag tagtcacccc tcctcccata gatatggaag ggaaatattg acagtgggga 60 atatgtaaca gagggaatgg cctgaaaaaa cggcaaaaat attttctgtc tctttaaaac 120 atcctccact cctttttgag aactaaaacc tgcatccctg cctcaggcca gtggttggga 180 ggggcagggt aagtcttttg ttatttgtgc tacaggagat ggctcagccc agacctggta 240 aagggaggtc ctgggtggag gtcacgggat tgtcttcagc ggagatggga cgatcagaat 300 catcaactgt agttgaacag caattgcctg aagctgagaa cccatccttt aaaagctctg 360 tatttctgct tattattagg acgatggcat ttcagacagg agtctccatc ctcctccttt 420 gccggcaaag taataaactt ctctttcctt ctcctcaaac cacttgttct cgttcttctg 480 atgcggcctc gaggacaagt gccgagcttt cggtaaca 518 // ID LTR1C6_OG repbase; DNA; PRI; 586 BP. XX AC . XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 05-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Long terminal repeat (consensus). XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR1C6_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-586 RA Jurka J. and Jurka M.G.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 8(10), 1673-1673 (2008). XX DR [1] (Consensus) XX CC 4bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 586 BP; 146 A; 180 C; 119 G; 138 T; 3 other; tgatacaggg ttcccccgct cagggctgga cagggacccc ccattttcta ggcctgggtg 60 agctcagaaa tcagtctcat cacagacccc atctctaggc acaaacaggc cctttgcagg 120 ccgcattctc tggacacaaa gcaaggtcat aaaaaagcag cctgatcagt atctcatctg 180 ggctgctyct cccggagcag ggaacacacc ctactgcctc acctggagca ragactgcaa 240 ggaccaaatt cctttgcact atggcagttt taaaattaac ttcctgcccg caggcccctg 300 gcaccaaatg gttgtgacat attccagaca tacattccat ctccccaaac ccccctgcct 360 acaggatagg aaaagcagta tataaacccc tagacctaga cagaaggccc acatgggctt 420 cccactcggg ttccccctcc actgccggag gttctgttcm ccctcttttg cttatctatc 480 tttctatcaa taaaatctgt cctttcgctt acttgcgtgg tccgtggatt cattctttca 540 aatctctgag accaaggact cggcagagag agaaattccg gctaca 586 // ID LTR12_Cja repbase; DNA; PRI; 506 BP. XX AC . XX DT 14-OCT-2009 (Rel. 14.11, Created) DT 14-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat: consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR12_Cja. XX OS Callithrix jacchus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. XX RN [1] RP 1-506 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the common marmoset."; RL Repbase Reports 9(11), 2924-2924 (2009). XX DR [1] (Consensus) XX CC >87% identical to consensus. 4bp tsd. CC This sequence was derived from the marmoset dataset made CC available by Genome Sequencing Center at Washington University CC School of Medicine in St. Louis. XX SQ Sequence 506 BP; 117 A; 140 C; 134 G; 115 T; 0 other; tgagggactg tgttagagga acttccgggt cggctagggg ctttgtttcc agtgcaggaa 60 ttttcccgcc tctggccggc taggggcttt gttcccagtg caggaatttt cctgcctctg 120 gcccctggag gcggaggctc agtgactcag ggttttaaaa cagtaaccaa tccggtaaag 180 gattcaaaga tcaattaatc cagagaaagt ttgaaaaaaa cccctctcat tggacgagaa 240 caagaatggg gagggaataa ctcaggggta taagctccag ccgcccaagc ctgcagcggc 300 accccctccg ggtccccttc cacggcgtgg aagctgtcct ttcgctctgc ctaataaacc 360 gcactaccgc acactctttg ggtccatgag ttctatttga gctgtaacac tcaccgcgac 420 accgaggccc ccttccccat tcttcggggt tggagaggcc aagaacctga gttccagaga 480 atccggttgc agctaatccg gttaca 506 // ID ERV3-1_CJ-I repbase; DNA; PRI; 7308 BP. XX AC . XX DT 12-JAN-2011 (Rel. 16.02, Created) DT 12-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE Internal portion of an ERV3-type endogenous retrovirus - DE consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; LTR18_Cja; KW ERV3-1_CJ-LTR; ERV3-1_CJ-I. XX OS Callithrix jacchus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. XX RN [1] RP 1-7308 RA Jurka J.; RT "Endogenous retroviruses from the common marmoset."; RL Repbase Reports 11(2), 691-691 (2011). XX DR [1] (Consensus) XX CC >94% identical to consensus. ORFs incomplete. CC This sequence was derived from the marmoset dataset made CC available by Genome Sequencing Center at Washington University CC School of Medicine in St. Louis. CC We thank the Marmoset Genome Sequencing Consortium for making CC their data publicly available. XX FH Key Location/Qualifiers FT CDS 2211..3611 FT /product="ERV3-1_CJ-I_1p" FT /translation="MGIDGTPSIPYQTPPLMCSYNSTPFTHSFLVIPTCPV FT PLLGRDILSKLKAVITFPTANPHSLFLLALTSDTEPQKALSELLPGVHPQV FT WDINNPSVATHHQPVQVRLKDSSPRFLSRPQFPISKAHRMGIRPIIEKLKG FT QGLLIPINSPCNTPILPVRKPSGQYRLVQDLRLVNEAVIPIHPVVPNPYTI FT LSNIPPGTTCFSVLDLKDAFFTIPLHPNSYFIFAFTWDDPLTNMSQQLAWT FT VLPQGFRDSPHLFGQALATDLLHCNLKPSHLLQYVDDLLICSPTRQECIQA FT TTALLNFLGDRGYRVSPTKAQIASPTVTYLGIQLTPGRRSITIDRLQALKA FT LQAPESAKEILSFLGLVGFFRHWVPNYALLAKPLYHAAAEAPEGPLSSPGS FT VNRAFKSLLNALCTSTSLALPNPNLPFQLYTDEKQHTAVGVLVQPVGKTLL FT PIAYLSRQLDPTERGWQPVLEP" FT CDS 5492..7192 FT /product="ERV3-1_CJ-I_2p" FT /translation="MDPPHAGRLFLLALYLSNLPGPSSHTPYRWRFFLTEN FT YTKTTFICNTPPYSHNHLLATSDCPAAGCQSAIYLNFTKFHNVHEGTGVIP FT VMCFNHDQPSGACSDTPWRSCMGCTWGDCRLHTALDTQQHPDIPARLKIVP FT DVDGEGNARTGSTRYSLLIPDPWDDRWKSPQKAAXYDHRDTTYPSSHLYIW FT RAYVRTVHQVHSXISLQEKSLNDQLQPHSSPFSWLTFLXEGIKLANLSGLT FT DLTSCLMCASLGQTPFVAVPYPIPLNLSASTSPFSPIREVDLYTPPDFAQL FT PVCYSLGRNYPSCNRTILVTTNLTAPRGTFFWCNRTLTKTLHTGQSSTLLC FT LPVTLVPRLTIYSSAEFQMLQAPLHRTRRAAFLPLAVGVSLAASALAAGLG FT GGALAHSHQAIARLTSQFQAAIDDSAESLASLQRQITSVAQVALQNRRALD FT LLTAERGGTCIFLQEECCYYINESGLVETRIESLQKLKTNLQNQKFSAEAT FT AWWSSSMYTLLSPLLGPLVAVCLILLIAPCFLQFLQRRFQELTRVTIHQML FT LQPYPERVLEADLSPNIQAP" XX SQ Sequence 7308 BP; 1737 A; 2583 C; 1393 G; 1591 T; 4 other; agtggtgccg aaacccggga gctggggccg ttggcccggc cacggccccc tcccccgacc 60 tacccaacac tggccctgcc acctccacgc tccgattaag gtaagccaag ttttttcttg 120 ctttaccccc ccttcccttc tcctcctcct acggcacggt gcagttgctc ccctccggcg 180 ttccgcacgg actcctcgcc tagtctcggc cgagccgagg aagtcctccg ttccggctcc 240 aggagccttc cgaggggtag ctgccagacg ggggacgccc ctctgaccgc ttccctaacc 300 gtacttaact gtactccggg gaatttgcaa cgaaagggga cgccttttct ttgcatctac 360 ccaaccgcgc cggagggtct gtagctgcgc ctgccatggg aaattcggag tccaaaatac 420 ctaccagcac cccccttggg tgcttgctgc gaaattttac caaattggga tactcccaaa 480 ccctcagaaa gaaaaagctt gtcttcctgt cccaggtggc ttggccccaa tacaaactca 540 gtaaccagtc acattggccc ccgactggca ctttcgattt caacaccctc cgagatctcg 600 acaatttttg tcgccgcacg ggcaaggaca atgaaattcc ttacgttcag gctttttggg 660 acctccgtac ccaccccgac ctatgttctt cctgctccac ctaccaaatc ctcttagctc 720 gtacccctcg taaatcttcc tccactacct ctccccctcc ctactcctca ccggaggatc 780 tgcctgaaca tctgtcctct cctgccaatc tcagcaacca gtcgccatca acagcacgct 840 cttcccccgc tccctctgca cctcctacct cagaggccat gcctcgctct tcctctactc 900 cctctgggcc ccctccctca gaggccgcac ttcctcttgc ctctccggtg gcggcccaca 960 cccgctctca ccaaccaact atcctggccc cactccgaga agtagcaggc gctgaaggtt 1020 tagtcagggt ccatgttcct ttctcactcc aagacctgtc ccaaattgag aaacgtttag 1080 gatccttctc agccgacccg tccacctata ttaaggaatt ccaatacctc actcaagcat 1140 acagcctcac ttggcatgat attcgcgtta tctgtacctc caccctcaca gtagaggagc 1200 acgagcgcat cctagctgca gccagaaacc aagcagataa agatcatgct atagataacc 1260 agatcccctt aggccccgaa gcaatcccag gtgaagaccc ccaatgggac tatcaggcta 1320 atcgagggtc tgatataccc aagagggacc gcatgattag atacctcctc aggggaatgg 1380 atactgtatc taacaaagtt gtaaactatg ataaactaag agaaatcacc caactacccg 1440 atgagaatcc cgccctcttt ctcgcccgct tacaagaagc cctagtccgg tacactcggc 1500 tagatccagc ctcccaaaac ggggccactg tattagcctc ccattttatc tcccaatcag 1560 cacccgacat aagaaagaaa ctaaagaaag cagaagaggg gccagagact cccatccaag 1620 acctggtaaa aatggccttt aaagtattca atgccaggga agatgcggct gaggctgccc 1680 gccaaactcg catgaagcag aaggctgccc tccagaccca agccctagtg gcggccctaa 1740 ggccggcagg gaaccagaag cccaaggccg aaacccatgc ccctccgggg gcatgtttca 1800 aatgtggaaa ggaaggacac tggtcccgag cctgtccaca accacggcca ccaactaagc 1860 cttgccctat ctgtcagctc ccgggacact ggaagtcaga ctgccccagc ttccccaggg 1920 tgccggttcc tcaaaacaga cacactggcg ttacgcagcc ggcagtcacc ctcggtaggg 1980 aggagcctgc ctgccaggag ccgacctccg agggagctcc attccccagt ctactagggc 2040 tgctagacga ctagcggggc cctggctttc cgactccggt taccctcgcc gagcctaggg 2100 tcacacttca ggtagcgggt aaaagtgtct ccttcttgct ggacacgggg gctacctact 2160 ccgcacttcc ttcctattct ggccgtctca ccccttccca ggtctcggtc atgggaatcg 2220 atgggacccc gagtatcccc taccaaaccc caccactcat gtgctcatat aactccactc 2280 cattcaccca ctccttttta gtaatcccca cctgcccagt acccctcctg gggcgagaca 2340 tcctttcaaa gctgaaggcc gtcataacct tccccaccgc caacccacac agcctctttc 2400 tcctagccct cacctcagat accgaacccc aaaaagccct gtccgaacta ctccccggag 2460 tccaccctca ggtctgggac attaacaacc cgtctgtagc tacacaccac cagccagttc 2520 aggtccgcct taaagactcc tcgcctcgtt ttctctcccg ccctcaattt cccatctcta 2580 aagcccaccg tatgggtatc cgacccatca tagagaaact aaaaggtcaa ggcctcctaa 2640 tccccattaa ctccccatgt aataccccca tcctcccagt acgaaaaccc tcaggacaat 2700 atagactggt ccaggacttg cgcttagtca atgaagccgt aattcccata catccagttg 2760 tccccaaccc atacaccatt ctctctaaca ttcccccagg gaccacctgt ttctcagtct 2820 tagacctaaa agatgccttc tttactatac ctctacaccc caactcctac tttatctttg 2880 ccttcacttg ggatgatccc cttacgaaca tgtcccaaca actggcctgg acagtccttc 2940 cacagggctt ccgagatagc ccgcacctat tcggccaggc actcgccacc gacctactcc 3000 actgcaacct caagccctct cacctcctcc agtacgtaga tgaccttctt atctgctctc 3060 ccacccgcca ggagtgtatc caggccacca ccgccctact caatttccta ggagataggg 3120 gataccgggt gtcaccaacc aaagcacaaa tagcctcacc caccgtcacc tacctaggta 3180 tacagctcac gcctgggcga cggtccatca caatcgaccg actacaagcc ttaaaggcgc 3240 tccaggcccc tgagtcagcg aaagaaatcc tctccttcct aggactagta gggttttttc 3300 gccactgggt acctaattat gctttgctag ctaaacccct ataccatgct gctgcagaag 3360 cccccgaggg accactctcc tcaccaggtt ctgtcaaccg ggcattcaag tcactgctta 3420 atgccctatg tacctccacc agcctagccc tgcctaatcc caacttaccc ttccagttat 3480 acacggacga gaaacaacac acagctgtag gagtgctagt ccaaccagta ggaaagaccc 3540 ttctgcccat agcctacctg tctagacaac ttgacccaac tgagcgagga tggcagcctg 3600 tcttagagcc ctagctgccg ccgtcaccct aaccacagaa gctcttaaga tcaatctaca 3660 actcccactc caagtctttt ctccacacag gttgaccgaa ctgctcagtc aacactccct 3720 gccccacctc gggccctccc gtatccaact gcttcaccta ctgtttattg agaaccccaa 3780 catcactctc tcccactgct cctcgttaaa cccggctacc ctcctccctt gccctccgta 3840 cgccagcccc ctacattcct gtactgaaat cctcaaatta gcccagtccc ctcgccccga 3900 cttgctccct caacccttac catctgcctc catcacccta ttcgttgatg gcagctctat 3960 accccatccc accggccacg gagagcagcc tatgctatag taactcactc tgaaattata 4020 gaaacccgct cccttcctct tgggaccact tcacaacagg ctgaactggt cgccctcacc 4080 cgggccctat cctggtccca gaacaaagaa gtaaatatat atactgactc caagtacgcc 4140 ttcctcatag ctcactcaca ctgcatgatc tggaaagaac gaggcttcct caccactaag 4200 ggcatgcccg tcctcaatgg aaaactcatt gctgccctta tccaggccgt tcaactccct 4260 agcaaggtag ccatcgtcca ctgtaaaggg caccagtccc tgaactcacc cgtagccgcg 4320 gggaatgcct ttgcagacag ggtggctaaa gatacggcaa gaaaccatcc gccacccact 4380 tccactctct gtttcttgtc caagtcatat agcccccaat actccgcctc agaacttcag 4440 atcctccagt ctacccccgg ggtacggttc aaggatgact gggccttcaa gcataacctc 4500 ctcatcctcc ccgaaaagca aaggtaccaa ataatccaag acatccacaa ttcactccac 4560 attggaccta aagccctata ccagttcctc aacccacttt ttcaccccca ccacctcctc 4620 agtaccatac aggaagtcca gtcctcctgt actgtctgcg cgaaaactaa ttcccagggt 4680 tcctgtcatc ctcggcgcca gcttcatcag ctacgcgggt tcctacctgg ccaagactgg 4740 caaatcgatt ttacccatat gccaaagcac aaacagtacc ggtatctgct aaccattgtt 4800 gacacttttt caggctggat tgaagcttac cccacagcct ctgagtctgc cgggacagtg 4860 gccactcatc tcattcaaga catcatccca cgctttggac tcccggccac catacagtca 4920 gacaacggcc cagcctttat ctccaaagtc actaatgccg tttctacctc tctaggaatt 4980 cagtggaagc tacacgccgc ctaccatcct cagtcctctg gtaaggtgga acgggccaat 5040 ggcctaataa aggagcatct gaccaagctc atgcttgagc tccgccaatc ctgggtaacc 5100 ttgcttccta tcgccctcac caggttaaga gcaagccccc ggggcccatc acagcttagc 5160 ccgtttgagc tgctgtatgg tcgccccttc ctactttcaa ctcccccacc gccagaaacc 5220 actcctctag acagctacct cccctacttt actctccttc gcaaccttct ccgagagcac 5280 gccaacgctt ctcttcccca acccacccag ccatctgaaa atacacagaa agtctccccc 5340 ggggactcag ttcttatcag gtccctctcc ccaaagcccc tcgcccccaa gtgggaggga 5400 ccatacaccg tcatcctcac cactccatca gccataagag tcgctgaagt cccatcttgg 5460 gttcacctgt ctcgggtaaa aaaggcccct catggaccct cctcatgctg gaaggctgtt 5520 cctactggcc ctttatctgt caaacttacc agggcctagc agccacaccc cctacagatg 5580 gagattcttc ctaactgaaa actataccaa aaccaccttc atctgtaaca cccccccata 5640 cagccacaac catcttcttg caacttccga ctgccccgca gccggatgcc aaagcgccat 5700 atacctgaac ttcaccaaat tccacaacgt ccatgaagga actggggtta tacctgtcat 5760 gtgcttcaac catgaccaac catcaggagc atgcagtgat actccctggc gctcctgcat 5820 gggatgcacc tggggggact gtcgactaca tacagccctc gacacacagc agcacccaga 5880 cattcctgcc cggctcaaaa tcgtcccaga cgtagatgga gaaggaaacg ccagaaccgg 5940 ttctacacgg tattcccttc tcatacctga cccctgggac gaccggtgga aaagccccca 6000 gaaagctgcc stgtatgacc acagagacac cacatatccc tcttcccacc tgtacatctg 6060 gcgggcatac gtacgtacgg tccaccaagt ccactccgmt attagcctac aagaaaaatc 6120 tctaaatgac caattgcaac cacattcgtc tccgttttcc tggctaacct ttctcmaaga 6180 aggaatcaag ttagctaacc tctcaggatt aactgaccta acctcatgcc tcatgtgtgc 6240 ctccttagga cagactccct tcgtcgcagt cccctacccc atccctctca acctttcagc 6300 ctcaacttct cccttctccc ccatcaggga agtagatctc tacactccac ctgattttgc 6360 acagctacca gtgtgctact ccctaggccg gaactacccc agctgtaaca gaaccatact 6420 ggtaaccact aatctgacag ccccamgggg aactttcttc tggtgtaaca ggaccctcac 6480 aaaaactctc cacaccggcc aatcctcaac tctcttatgc ctcccagtga ccctagtccc 6540 ccgactcact atatattcct cagccgaatt ccagatgctg caagcaccgc tccaccgcac 6600 ccggcgagct gccttcctgc ccctagccgt aggcgtctcc cttgcagcct cagcacttgc 6660 agcaggattg ggaggagggg cactagccca ctctcaccag gccatagccc ggctgacctc 6720 acaattccaa gcggctattg atgactctgc tgagtcttta gcatcactcc aacgacagat 6780 cacctcagtt gcccaagttg ccttgcagaa ccgaagagcc ctggacctgc tcaccgctga 6840 acgaggaggg acctgcatct tcctacaaga agagtgctgt tactacatca atgaatccgg 6900 cctagtagaa acccgaatcg agagcctcca gaaactcaaa actaacctgc agaatcagaa 6960 gttctcggct gaagccacag cgtggtggtc ttcctctatg tataccctct tgtccccgtt 7020 gcttgggccc ctagtagccg tttgcctaat cttacttatt gctccttgct ttctacagtt 7080 cctacagcgg cgctttcaag aactgactcg agtcaccatc caccagatgc tgctccaacc 7140 ctaccctgaa cgagtgctag aagcggacct ctccccgaac atccaggctc cttaggaaaa 7200 gagacctctt cagaaccccc cgtcgaccct tgtcagcagg aagtagccag agagacaacc 7260 gacgcccctg accccctctt ttcttctttt cttttaagga gtcgggaa 7308 // ID LTR20_OG repbase; DNA; PRI; 504 BP. XX AC . XX DT 19-OCT-2009 (Rel. 14.11, Created) DT 19-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat (consensus). XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR20_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-504 RA Jurka J.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 9(11), 2866-2866 (2009). XX DR [1] (Consensus) XX CC ~83% identical to consensus. 4bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 504 BP; 142 A; 132 C; 100 G; 130 T; 0 other; tgtaacagag gtaaaggtct gaaaaagggc aaaatgtttt acgagttgtc tctttaaaac 60 cccccacctt ttggggaatt aaaacctgca ttcctgcccg aggccagtaa tcaaaggggc 120 agaaaaatgt tctttgcttt tgttttctag tatcttaaac cacaggagat ggctcaacag 180 aattgtccgg acaagggtca cgagatcatc tcaccggaga taaggtcgac caaaaccacc 240 aattatagtt aaacaacaat agcccaaagc cacgacatgt gacactggtc acgtagaccc 300 gagcccaacc ctgaaacccc cctttaaaaa gccctgtatt tctgcttaaa ggcaggatgg 360 cggtctttcg agacgctagt ctgccgccct cctcatttgc cggcaaatta ataaacttct 420 ctttcctttt cctcaaacca cttgtcctcg ttcttcgttc tcactgattc ggcctcgggg 480 acaagtactg aactttcggt aaca 504 // ID LTR4_Cja repbase; DNA; PRI; 913 BP. XX AC . XX DT 03-OCT-2009 (Rel. 14.11, Created) DT 03-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat: consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR4_Cja. XX OS Callithrix jacchus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. XX RN [1] RP 1-913 RA Jurka J.; RT "Long terminal repeats from the common marmoset."; RL Repbase Reports 9(11), 2915-2915 (2009). XX DR [1] (Consensus) XX CC ~92% identical to consensus. 6bp tsd. CC This sequence was derived from the marmoset dataset made CC available by Genome Sequencing Center at Washington University CC School of Medicine in St. Louis. XX SQ Sequence 913 BP; 229 A; 225 C; 235 G; 224 T; 0 other; tgtgggtcaa gatggccagg gggagaccta tgttaattct agctaaataa ttataagctt 60 gcttggctca gagtacactg aagtttatac accgaactaa actatattat tggtgcagtt 120 aattagaaat caagcggcaa caggaggcct aaggtaaaga ccatccgaga gcgcagtaac 180 accggggaac acctcggggt ggttgcagag cagggaatac tgctctggtc ccacgaatgc 240 cttctcaaca tggagactgc atgagggcct cgtaaagagg gcttatgcaa agatctctga 300 gaagggaggc gacctctgct cacaagatcc gggagggtct ttgcagatgg gagctggcaa 360 ggaggagaaa agccttgcag ggcagagatc acggaacgcc tcaggctctg cctaggttct 420 cggaatatgg agtgatttta attgctctat gtcctcaatg ctcgggagat aaaagcccct 480 tgatgcattg ggtgtggtca gaccgggtgg ctcctcatcc gctatacttc tgatgcagcc 540 gcaccctgat aacaatcggt gataaactca gagtactagt ttagcatgtc ttagaaggat 600 ctcaagtgct gttaaatgct cagagcacta gtttagcatg tcttagaagg atctcaagtg 660 ctctccaccc ggctatctta cctacccctt caccctcccc ttccacgccc ataagatcaa 720 tgagattgtg cacataataa aagtgtttgg agcccagagc tcggggccaa cgcggttccc 780 acgcttgcgt tggtcccctg gacccatgct gtaaattcta ctcttgtgtc tctgtcttta 840 tttccttttc tcaatccctc gtcaccgccg ggactaaggg cacccacgta cggggttggg 900 ctggtaccca aca 913 // ID LTR8B_TS repbase; DNA; PRI; 1308 BP. XX AC . XX DT 09-DEC-2009 (Rel. 15.09, Created) DT 09-DEC-2009 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV2-type endogenous retrovirus - DE consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR8B_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-1308 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from tarsier."; RL Repbase Reports 10(9), 1264-1264 (2010). XX DR [1] (Consensus) XX CC >96% identical to consensus. 6bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 1308 BP; 502 A; 177 C; 280 G; 349 T; 0 other; tgccggaagc cgcaacccct aagaggcaaa atggaggtta cccgaaaatg ggaaaaacgc 60 tgactgggag cagagcccca gtgggaagcg gccatgtgga ggcagagccc catggatcct 120 attgaaggac tccttttggc cttaaaggaa tgcgtgctta ggcttgaaaa acacttccct 180 tgataaataa aacaattagc tagaatgtgc aaggatgtgg aattagcagt aaacctatgt 240 atagaaatgt aaaaagggct caacagcaaa aatttccccc ctgaagggac aagggaatgc 300 aaattttgca acctttagct aaaaaaggaa aaacaagtac ttcctcatac cctttataag 360 taacaacatt gacagttgtg attgatgagg aagtgagaaa ctgagacaga gtatctccaa 420 aaacaggtgt tttcgggaaa aacaagtcac agacatgcct tttgatttaa gaaatattat 480 agtttaatta gaaagagggt taaaattaaa gtaaggatta gaattaaata aatagcagtt 540 ataattaaaa gataaatgtt tattatagat gtaggtatta gttaaatata aggaatagga 600 aataattaag ttttaaggta aacttaaaat tatacacact taatgtaggt ttaaaattaa 660 atacatttag gattagaagt aagattaatt tcatatataa gtgtagattt agacatgttt 720 tgtttagaaa tataaaatgt aaaataggtg ataagaaaat agcaagtaat atagataagg 780 caattagaat taagaaagtt aaatttttaa aataatagta ttaagaattc gtagtgggta 840 agataagggt ggttatataa aagaaaaaac aagtcaatat attgctgcta aaagtatata 900 taagggtaga attaagaaaa tagattagta aaggaaaaga aatagaatag cgaagggaag 960 agattcagtt ggcctagagg tcacaggatg tggctgataa attggcagct ggtattttgt 1020 tacaccaccg gtagcaggaa ggaacaaaat gaaacttgta aaggtcaact tccccataaa 1080 caaaagaaaa aacatgatgt aaacatttaa tgtgtaacat gggatacatt ttgtgggtag 1140 gggggtcagt aaaaaaacta tataagcttg taagaattga ataaaaattg gcagagtctt 1200 gagctctccc caggaagagg attcctgtcg ttttcgtctt ttctttttcg ccgcttcccg 1260 attcccccgt ctcaagccct gactctctcc tggagctgga ctccagca 1308 // ID BSRd repbase; DNA; PRI; 152 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Satellite from primates. XX KW Satellite; Simple Repeat; BSRd. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-152 RA Smit A.F.; RT "BSRd - Satellite from primates."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX SQ Sequence 152 BP; 30 A; 38 C; 46 G; 36 T; 2 other; gctgggtccg catatgagag tcacaatctc acctgtgggc tgggtccagg tatgagagtc 60 acmatctcac ctgtgggctg ggtccgcata tgagagtcac aatctcacct gtgggctggg 120 tccaggtatg agagtcacma tctcacctgt gg 152 // ID ERV1-4C_TSy-LTR repbase; DNA; PRI; 471 BP. XX AC . XX DT 06-APR-2010 (Rel. 15.09, Created) DT 06-APR-2010 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERV1-4C_TSy-LTR. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-471 RA Jurka J.; RT "Long terminal repeats from tarsier."; RL Repbase Reports 10(9), 1252-1252 (2010). XX DR [1] (Consensus) XX CC >88% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 471 BP; 100 A; 134 C; 83 G; 154 T; 0 other; tgaaatatct gtaaaaccta taatagcctg taatagcttg cttgacataa gtaacttcat 60 tttgttaaaa atcttccatc ttagctgcca gctcacccag atgcaacccc cctgcgtgtg 120 accgttgcaa ccttcccgta ctcacaagcc cacatccttg ttccactccc ttgttcccct 180 ggtgtttgaa aactccttaa ctacaagttc cttgttctaa ttgtaccctt gcttaactga 240 aatcagatgt gcgccaagta attgttccct tttgttttta gttccttgtt ccctcacccc 300 cctgctatgt agcccacttg cggtttttgc ctttataagc cttttgcttg ctttattcgg 360 ggtcgagagc atttggaggc gtgagtcccc tctcgaccgc cggctattaa aggactcaaa 420 aattcgactc tcggtgtgtg gcgactcctt tctcgcactc cggctataac a 471 // ID hAT-2N1_TS repbase; DNA; PRI; 674 BP. XX AC . XX DT 21-APR-2009 (Rel. 14.1, Created) DT 14-DEC-2009 (Rel. 15.07, Last updated, Version 3) XX DE hAT-2N1_TS is a family of non-autonomous DNA elements found in DE Tarsius syrichta mobilized by hAT-2_TS. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-2_TS; hAT-2N1_TS. XX NM hAT-2N1_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-674 RA Novick P.A., Smith J., Ray D. and Boissinot S.; RT "Independent and parallel lateral transfer of DNA transposons in RT tetrapod genomes."; RL Gene 449(1-2), 85-94 (2010). XX DR [1] (Consensus) XX SQ Sequence 674 BP; 114 A; 148 C; 182 G; 223 T; 7 other; caggggtcct caaactacgg cccgcgggcc acatgcggcc cgccgaggac atttatccgg 60 cccaccgggt gtttttgccd ccgctgcctg tcctgcctag cagccgactb gtccgggccc 120 gcagtgcgca tgtgtggaat gtgcgtccgc actctccgac tcccctcctt ctctctgtct 180 ctcgactcct cctctcagtc tcgggtgtga tcggacgagt cacgagcttg cctgtgcaga 240 gcctgctgct gcctgaggac cgaggtaaga acaagttagg attttttttt ttttttttga 300 agttaggagg tctdtttttt ttttttaatt ttgcagttag tagggccttt tttttgcggt 360 taaggggggc cttttttthc tgaagttagg aggtctattt ttttttttgc agataggggg 420 cgcctttttt tttgaagtta ggagagcbtt tttttttgaa gttaggagag cctttttttt 480 gaagttagga gagccttttt ttttaagttg gttagttggt tgggggtggt ttctaggggg 540 gttgcatcac agtgataacg caaatagtca gygctcagtg ctaatgcaaa tggtttttta 600 aactatagtc cgcccctcca acggtctgag ggacagtgaa ctggccccct gttthaaaag 660 tttgaggacc cctg 674 // ID MacNERVK1 repbase; DNA; PRI; 7088 BP. XX AC . XX DT 29-AUG-2008 (Rel. 13.08, Created) DT 29-AUG-2008 (Rel. 13.08, Last updated, Version -1) XX DE ERV2 Endogenous Retrovirus from Cercopithecidae. XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR; ERVK; KW MacNERVK1. XX OS Cercopithecidae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini. XX RN [1] RP 1-7088 RA Smit A.F.; RT "MacNERVK1 - ERV2 Endogenous Retrovirus from Cercopithecidae."; RL Direct Submission to Repbase Update (05-AUG-2008). XX DR [1] (Consensus) XX CC Close to MacNERVK2 until pos. 3185. Apparent recombination CC (after that close to MacERVK1), but which is recombinant product CC yet unclear. 4% subst Coding regions: gag 202-1760, pro CC 1811-26218, pol 2558-5140, env 5154-7085 have multiple CC frameshifts and stop codons, so probably non-autonomous. Perhaps CC there was one once. XX SQ Sequence 7088 BP; 2208 A; 1422 C; 1283 G; 2170 T; 5 other; agtggcgcag cgagcagggt cagcgcgggt cgagtagggg agaacaagaa ggggaacccc 60 ctaggggcgg gtaagtgccc ggatattgtt aaggggaacc ttcttaagct ttcattatgg 120 ggaattccac ctctctggcc tcagaatatt tacggctgtt tcaaggactg ttgctgtcta 180 taggagtaga agtgaaagaa aaagactttg aagcgattgt ttgcccatgt tgaacaacat 240 tgttactggt ttcagtatca gactaaagtg cagctgaatc ggagagaatg gttacaggta 300 gtaaaagctc tgcttagagc tcaccagtta ggggatacta tgcctctaaa tttgtggacc 360 ttgtgtaact ctatcactca ggcattagag ttacttgaga ctgattcaga gtgtggtgat 420 gtagccacag gaggaaaggc atctgaggat ttgggaaaaa ataaatctga aatggagccc 480 atttatgcca gggtaacaga ggatgggggg gtagcaaaag ggggagaaga ggaagagcca 540 gctcttcctt cttctaaggg gaattctgac atgcagcgta ttatgggaat gcttcaacag 600 atattacaga tacattcttc taattcacct ttgcgagctt tccctgtttc ttttccttct 660 gccccgttag aagaggattt tccagtgcct ccaacccccc ggcttcctta cctttgtctg 720 gcaaagtttt ctcagtgcct ttcccgccat tgggggaggg agaaaaggaa aagataagaa 780 aatttaagga gctggatctg tttccaatta ttcgtgcttc ttttgctcct aatgctcagt 840 ttccaaatgg tggcaataat gtccaattta atgccttaca anttaaattt ttgaaagaaa 900 tgaaagctgc catttctaac tatggacctc aatctccttt tatccttggt cttcttgatg 960 ctttttcatc agaaaatttg atgattccta tagactggga gactttgggg aaggcttttc 1020 ttgatcgttc tcagtggtta caattgcaca gctggtggat ggacaaggcg catgtacagg 1080 ctagaaaaaa tgctacgtgt gatcccccag gacccactga ggaacaactc acaggcacgg 1140 gccaatatgc tactcttaat gctcaggctg gcctagatga tgtcgccctt actcaaatca 1200 agactttgtt tttaagggcc tgaaataagg tagagatagc aggaaaatct tccctttcct 1260 ttgtgaagat tttacagggc acaaatgaac catatccaga tttcgtagcc cgtcttcagg 1320 atgctgtctt gaaaactgtg ggtgttggcc ctgcttcaaa aattttgtta caaacattgt 1380 cttttaaaaa tgctnatgct gactgtcaaa aattgttgag accgttaaag gccagtgggg 1440 ccaatttaga taaatatatt aaaacttgtg ctggtgttgg aggggctatt tataatgctc 1500 aattatttgc tggagctttg tctaaagcct taaaaggaaa taacaaacaa ggtgtttgct 1560 ttcagtgtgg taaacctgga catttcaaaa aagaatgtcg taaaaaattg aacactttta 1620 tccctcaagg cagaaagctg ccttcagagg tgtacaaacg ttgtggtaaa ggtcgacatt 1680 ggacaaatga atgtcgttcc aaaactgata aaagtggaaa tttactgtct cctctcccgg 1740 gaaacgggag ccggggccca caggcttggg gccccaacaa ttacaataca agtctactac 1800 aatacccagt gagcaatggc ctacaacatc aaggaataaa ttcgagtcct ccttaaggat 1860 cccacacctt accccagttt ctcagcttta tgcggcaact aagcagagcg ccgctgctga 1920 cttagctatt actcagcctt ataccttgtc tcctaataga ggagtatata aattagctat 1980 tagagtgtgt ggccctttgc caaaaggaca tgtaggactt ttgttaggtc aaaacaacag 2040 tgctatacgg gggttaatgg tgattccagg aatcattgat cctgatgtta ctaataaaat 2100 tcttattatg gtacaagtct cacaatttat acgcctagag gcaggggagc gtattgcaca 2160 attactttta ttgccttttt ttccgttcct atctagagag gtatctcgac agggagggtt 2220 cggtagcaca ggaaaaactg ttttttggga aactttagtt tctaatcaaa agcccttatg 2280 ttcattacac attaatggaa tagtttttaa agggttaatt gacacggggg tggatgtgtc 2340 tatcatttgt ttaaatcagt ggcctataca atgggaaaaa aagacaagtt tcagttacac 2400 tcactggctt gggtgctgct tctgtggttt atcaaagcgt agaaccttta acctgtgtcg 2460 gtcctaaagg tcaacaaggt caagtgtttt tttatattgt tcctatcaat attaatcttt 2520 ggggacaaga tcttttacaa caatttggtg cttttttaaa cattcctcat atttcctcag 2580 ctgccaaaaa tatgatgttc aaaatgggct acaatccttt aaactcatag ccactgttca 2640 ccagcctcaa gttttataat taaagtggaa aactaccgct cctgtttagg tgaaacagtg 2700 gccattatct aaagaaaaaa cttagggctt taaaagagtn agtcaaaaaa caattggccg 2760 aggggcatat aaagccaagt acttctgcat ggaattcccc tgtctttgtc atgttaaaaa 2820 aaaaaaagtg aaaaatggcg tttattaact aatcttaaaa ctataaatgt ttacattcaa 2880 cctaggggaa gtttacagcc tggtcttcct aatcctgccc ttatcccaca aagttaaaaa 2940 ttaatggtca tagatctcaa aaattgtttt ttctctattc cattacaacc tcaagattgt 3000 aaaaagtttg cctttactat tcctaaatat aataatgggc agcccattca aaaatatcag 3060 tagaagnttc ttccccaggg tatgcttaat agccctactt tatgtcaaaa attcgtacat 3120 aggactttaa atcctgtaaa aaatcagttt ccaactgtgt taatttatca ttatatggat 3180 aatattttat tggctacccc tgacgtaacc ctccaaaatc aagtttttaa ggttttacaa 3240 caacagttaa ttcattataa tttacaaata tctcctaaaa aggtacaaac tcagttcccc 3300 attcaatatt tggggtatct attggctaaa aaacaaatcc gcccacaaaa ggttcaaatt 3360 aggagaaatc aacttgtgac ccttaataat tttcaaaaat tgttgaaaaa tattaattga 3420 cttcgtccca tcttaaatat tcctacttat aaattacaac atttgtttgc taccttaaaa 3480 ggaaattcag atttaaacag ttctcaacaa ttatccccta ctacaaaaaa aaangtcttt 3540 tataaaaaat aaaattagta aagctcacct taattatatt atacccgatc ttccactctc 3600 tttatgtctt tttcatactc ctcattcacc aacagccatc ttacatcagg ataataacat 3660 tctagagtgg ctatttttgg ctaataaagc cattaaaaaa atttaccctt atattaataa 3720 attggctaaa ttaattatta aaggacgaca tagagctcgt caattacttg gaactgaccc 3780 tgtaaaaatt attacccaat tatctaatca atacattaaa tctttactaa aaactcataa 3840 aagataacaa gtgacttacg ctaagtatac tggttccttt tctcaacact atcccagtca 3900 tagattattt acttttctac aacaatattc tttcctgccc tataatccta tttcttatac 3960 tccggttaaa ggtcccacct tttttactaa tacttctaac aataaaaaaa ctaaatactg 4020 gacatcaaat aattcaaaaa tttctcacta tccatttaaa tctgtacaac aaaaaaaact 4080 ttttgccatt cttatggtac tacaaaattg gcctcaaacc ccttataata tagttagtaa 4140 ttcccaatat actgtatatg tgactaagta tatttctcaa acttctttac cattactccc 4200 aaaaacccct ttacaaaaac tattttcctt attgttcttg actcttactt cccgtacaac 4260 cccccttttt atcactcata ttcgttcaca ttcagcttta ccgggacctt taacttttgg 4320 taatagccaa attaattctt tacttgttga caatgtccaa cagactcaag ataaacatca 4380 attacatcat actaacagtt taaaattaca atggcgattt tctataacac gccgtcaagc 4440 ccaaactatt gtcaaaacct gtccatcttg tgctcctatt attgcaccct ttttaagtcc 4500 aggtataaat ccccaagaca ctcaacaaaa tcatatttga caaatagata taacttatgt 4560 ttcttctttt ggtcgtttaa aatatgttca tcgtacaatt gatacttggt cacacttcca 4620 atgggctaca ccattaccct ctaaaaaagc taattctgtt attactcatt tacttacttc 4680 ctttacagtt atagaagttc ctactaaatt aaaaactgac aatgctccta cttattactc 4740 tcataaattg gctgccttcc tctcttcata ccacattcgt cattccactg gcattccatt 4800 taacagtcaa ggtcaaacaa ttattaaaaa agctaatact accttaaaat tacaactttt 4860 aaaacaaaca gggggaaatg gacgagattc cccaaaacat caaattctga aagctctttt 4920 tactttaaat tttcttaact attggcgcca attacagcaa tctgcagcaa taaaacattt 4980 tcaacaacca ttagaaaagc cacaagctaa caatttgtgg gtgtactatc cttcattaca 5040 aggccaatat ttaaaaggaa aagttttaca gtggggtcga ggttatgctc tcgttcgtac 5100 aggtgccgga gatgagtggt tcccatcccg acggttaaag cagtgccatg gcgaagagga 5160 gccggccgga agacttcctg gcggcatttc aaaaattaaa tctgagtact cagaaaacct 5220 tcacctacga aactcgtcga gcggaacccc cgacatgggg tcagctgaaa aaacttactc 5280 aagaagctga aggggttgtt caacaagctg gacagcccaa aacctcactg accttatttc 5340 tggctatact agctgtagtg agttgccaaa taacagctgg agaattcaca tattggactt 5400 atattccctt tccaccctta tatcaaggtg tggcctaggg agacaaagaa gttttagtat 5460 ttactaatga tactacttgg atgccatcac ctttttaaaa tcaggatcct gagttagaca 5520 cgggtgcagt aaacagttca tatcaatttg gggtggaagg gttacctata tgtcttggga 5580 acagtccaca ttgtttacat ccttctcatg aggcttgggc tgtgcgctaa aatcatagtc 5640 acattgctgc ctttactatg attgttgtta ctcagagttt taaatataat catactgctc 5700 tacttaacga aaccttgcct actacattat ccttatgtcc tattcctgac gtgtccggag 5760 gtgtttccca attaaagtgg actagatgta aaagaagtag accacgatta ttattagagg 5820 taaaaggtag aaattataga taccttgtca ctgactagag tgtgcatgga gattttcaga 5880 ctaagtttag ccactttagc ctacgatggc ataaaaataa tcacaacatt tatgctgatg 5940 gtaacgaaac tattatttgg catgatggtg gcttgtcgcc ccctatgcca catttggcca 6000 acacctcaca aatacaaagt catatatgga agttactagc tgctggcaaa ccaatgtcta 6060 cttttacggg aaacatgtcc ttaaatacca ctaatctttc taatcctttc catatatctt 6120 tacatcgaaa cagatctaga tatgtcattg cttgtgttaa aaaaccttac ttgttgttag 6180 cgggaatctt taaatgggat aataatacgg gagtagttaa ctgtacggac aattgtacat 6240 ttttaagctg tataaatact acttggtggc ataataattg gaatgagact cactccgata 6300 tatatatttt aagagctgaa aaagaaacct ggttacctgt gaacttgaca cgcacctgga 6360 gtgaatctac tggggttact caaatttaca aagtaatgca agatcttgtc caccgcagcc 6420 ggagagcggt tgggatcgtc gtaacagcag tagtgggact tgtggccatt gcttccaccg 6480 ctgctgttgc tggactggcg ttacatcaga gtatccaaaa tgcagagttc gtgcaacaat 6540 agcatgaaca atctcacttg ttgtggcaac aacaacgaga catcgatgct catttgactg 6600 aacgagtaga taacttagaa caagtggttt cttggatagg ggatcagctg acagtgttaa 6660 atactcgagc cttgttaaaa tgtgactgga atactactca attttgtatt acccctgttc 6720 cttttaacag cactgtacat aattggacag agataaaaag gcttttaatt ggccataaca 6780 atctttccct ggaaatacaa gaactaacac acaatatttc tgaaacgttt cgtaatcagt 6840 taccattgtt gactggcgca gatttaatga ctggtattgc gcaaagcctg aaatctttaa 6900 accctatgag tcctgtaaaa actttactga cttctgtttc tagtaatgta ttgattgttg 6960 tttttgcctt tgtcattttc acagtctact ggaaacggtg ccaaagggca aacaccaaat 7020 cccagcgagc ccaacatgta atgatggttt taaaaaaaat tcaaacctgt aaataaagaa 7080 gggggaga 7088 // ID ERV3-1_CJ-LTR repbase; DNA; PRI; 472 BP. XX AC . XX DT 14-OCT-2009 (Rel. 14.11, Created) DT 14-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat: consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; LTR18_Cja; KW ERV3-1_CJ-LTR. XX OS Callithrix jacchus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. XX RN [1] RP 1-472 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the common marmoset."; RL Repbase Reports 9(11), 2926-2926 (2009). XX DR [1] (Consensus) XX CC >94% identical to consensus. 5bp tsd. Renamed after CC reconstruction of the internal portion of this retrovirus. CC This sequence was derived from the marmoset dataset made CC available by Genome Sequencing Center at Washington University CC School of Medicine in St. Louis. XX SQ Sequence 472 BP; 68 A; 205 C; 95 G; 102 T; 2 other; tgtttggaaa actctgcccc tatcgtgaaa gctctctttc cctcccccac aaagaggctc 60 ccttatctct cactctacat accgctctag ccttcccgcc cggccgcagc cacctcccgc 120 ccggcnnccg gctgaccagt caccgcccgc ctcatcagcg ctccccggct tcccgccgcc 180 cggcttcccg ccgcccggct tcccgccgcc cagcttcccg cccagcggtt caaactgacc 240 aaccgttgcc cgccacctcg gctctctcgg cttccccact ccctcggccc ttcagccccc 300 tataaaagaa ccctgctcct ctgagcggcg cggccatttt cctcttcttc ccggctggcc 360 cgcccggagg ctctttcctc tcaataaagc ctgaacttaa gactttcttc taacctgcgg 420 tccggttttt tcccgaacga gctcagtcct cccgcagagg gtaggtcgga ca 472 // ID MacERVK1_LTR1b repbase; DNA; PRI; 358 BP. XX AC . XX DT 24-SEP-2007 (Rel. 13.02, Created) DT 21-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE ERV2 Endogenous Retrovirus from Cercopithecidae. XX KW ERV2; Endogenous Retrovirus; Transposable Element; ERVK; LTR; KW MacERVK1_LTR1b. XX OS Cercopithecidae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini. XX RN [1] RP 1-358 RA Smit A.F.; RT "MacERVK1_LTR1b - ERV2 Endogenous Retrovirus from RT Cercopithecidae."; RL Direct Submission to Repbase Update (21-FEB-2008). XX DR [1] (Consensus) XX CC 4%. XX SQ Sequence 358 BP; 93 A; 88 C; 92 G; 85 T; 0 other; tgtagaggac tacgtgctcg caaacagggc gttccccata agtcctgctc tcgcaaacga 60 agcagggcgt tcccgacaag tcctgctctc gcaaacgaag cagggcgttc ccgataagtc 120 ctgctcttgc aaacgaagca gggcgttggg ggcctgttta tatgtaaaca tcttgaaaat 180 ccagaaagtc agggaaaggt cagaaaaaca acgatgtgtc ttgtgacttg gcaacattcc 240 acaaacgact gtataaaata aagcggagcg cgccattcga ggcggccgcc atgtttgtct 300 tgtcttgtgt tgtcttgtgt gttcattcct ttgtttagga aacacgcgga ccccaaca 358 // ID pSIVgml-LTR repbase; DNA; PRI; 423 BP. XX AC . XX DT 07-JAN-2011 (Rel. 16.02, Created) DT 07-JAN-2011 (Rel. 16.02, Last updated, Version 2) XX DE Long terminal repeat of pSIVgml - consensus. XX KW Endogenous Retrovirus; Transposable Element; lentivirus; KW pSIVgml-LTR. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-423 RA Gifford R.J., Katzourakis A., Tristem M., Pybus O.G., Winters M. RA and Shafer R.W.; RT "A transitional endogenous lentivirus from the genome of a basal RT primate and implications for lentivirus evolution."; RL Proc Natl Acad Sci U S A 105(51), 20362-20367 (2008). XX DR [1] (Consensus) XX CC It is an endogenous lentivirus. XX SQ Sequence 423 BP; 96 A; 104 C; 129 G; 94 T; 0 other; tggaaggaga ttttgggtgc agctgagcac cctagtgact gtgtgggtgc agctgagcac 60 ccaggtgact gtgtaccaac gaggggtgcg aaaagccaac gagaggtgcg aaaggtgctg 120 tgtgagcata caggaaacca caaccgcaga ctgtctttca cacctttaca gcttacacaa 180 ggactttgct ttctatttgg ggaagggggg ctacttcagt acttggggct tggggagggc 240 ttggggagca tatataagcc tgaggttgcc taacctcgag gccccctcac acatctctgg 300 gtccggccat cacccagact ccagagtgtg gatccacaat aaagctgtgc atcttggccc 360 agagccgtgt gggtatgcca agtcttcttg ccctggggaa ggcaatgcaa gttggccctt 420 cca 423 // ID LTR77b_TS repbase; DNA; PRI; 921 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW Endogenous Retrovirus; Transposable Element; LTR77b_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-921 RA Bao W. and Jurka J.; RT "Long terminal repeats from tarsier."; RL Repbase Reports 11(5), 1633-1633 (2011). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 921 BP; 218 A; 285 C; 245 G; 173 T; 0 other; tgagagagga gacagaaaac aaataggcag acagggcagg tcccggcaat aagcttcctg 60 caggcccctt gaactagagc aaaaacactc acttcctgga attcaaaaac acaccacaat 120 acctccagct gattggtgct aagtgccaac caatcagctc actactaggc ctctacctct 180 agctgattgg tgcttagtac taaccaatca gctcagctag tgccaaccaa tcagctcact 240 gctaatgcta gccaatcagc tcgatcctag gactatataa ggacctggtt tcggccccag 300 actggcaacc ctctttcggg tcccctcctg ctgcggggag cttttctgtc ttcttaataa 360 actcctgctt tctcaccctc cggttgtccg tgtccctcat tcttcttggg cgcgagacaa 420 gaacccggac ctgaagagct gctgaagatc gtcggagctg aagatcgtcg gagcttgaag 480 actgccggac ttgaagagct gcacacgccg gaggctggct tgccagccac ggagctggca 540 gattggagct gaaactgacg acggagctgc agcaggagct gaagctgacg acggagctgc 600 cggtggagtg gagctgaaag tgaaacccac tacgaagcaa aaccaacgga gcagacggag 660 ctgtaacact ctcgggcctg gcttgccggc ccatcgagcc aacacttctc ggggcctggc 720 ttgccggccc atcgagccga cggagctgca acactctcag ggggctccgc ggctgctggc 780 atccccgagc tctcggggct gtcgcgttcc ccatccggac gccatcactt cccggccacc 840 gcgtggatcc cgggctgaca gtgcagctcc gggaactgca acagcccagc tcgccagccg 900 accagataaa aaaccgtatc a 921 // ID LTR1B2_OG repbase; DNA; PRI; 546 BP. XX AC . XX DT 19-OCT-2009 (Rel. 14.11, Created) DT 19-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat (consensus). XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR1B2_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-546 RA Jurka J.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 9(11), 2851-2851 (2009). XX DR [1] (Consensus) XX CC ~86% identical to consensus. 4bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 546 BP; 137 A; 151 C; 148 G; 109 T; 1 other; tgatacagga caagtggcat cccagccagc tcgtggagct ggggggcgcg gaccggctgc 60 ctccgccttg gcggtgttgg tgcggtggca gcaagcagcg gaaccagcgg gctggagggt 120 gggcaggggg gccccatttc ctaggcctgg gaatgaaagg gattgtagcc attcatttgc 180 gacccagaca tgacccagac agatgtccca gacacgctaa atgtaactac cagacgngcc 240 agacgtgccc agacaatgtc ccagacacgc taaacgtaac taccgagccc cgctaaatgt 300 aactacagag ggccccatgc gactatatat acccctagac aaagagccta gacagacaga 360 cagacagaga gagagagaga gcacttttct ctcccgactc gggttttctc tctctgccag 420 gagctctggc gcttctgtat tcttttctgt aaataaatcc tgtcttacca ctgatctgtg 480 gtccgcggat tcattcttcg aatcaccgag accaagaacc tactgaagag cgaaattccg 540 gtatca 546 // ID ERV2N1-Mim_LTR repbase; DNA; PRI; 622 BP. XX AC . XX DT 02-NOV-2009 (Rel. 14.11, Created) DT 02-NOV-2009 (Rel. 15.11, Last updated, Version 3) XX DE Long terminal repeat of a retrovirus-like element. XX KW ERV2; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERV2N1-Mim_LTR. XX NM ERV2N1-Mim_LTR. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-622 RA Jurka J. and Walichiewicz K.; RT "ERV2-like non-autonomous endogenous retrovirus from the mouse RT lemur."; RL Repbase Reports 9(11), 2837-2837 (2009). XX DR [1] (Consensus) XX CC >98% identical to consensus. 6bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. CC We thank the Broad Institute Genome Sequencing Platform and CC Genome Sequencing and Analysis Program, Federica Di Palma, and CC Kerstin Lindblad-Toh for making the data for Microcebus murinus CC available. XX SQ Sequence 622 BP; 147 A; 152 C; 142 G; 181 T; 0 other; tgttgggagc tgtcccgaac acgggagagt taaagtgcgg ttgctgcccg gaacactggg 60 gtccttgaga cacagcacct ggcgccttaa ctcatagctc tgacattctt ggcctttctt 120 ctagctcaca gagggtgctg gagataacag tagcagttat agaaggtcac tgaagtaatc 180 ctgtgcacac agccctcgta gcttatagcc ccccctaggc cctgcacact cctttctcag 240 aattgttaat tacgcttggg tgctctttcc ttgaactgtc tgagtgatca tagttgagta 300 tttgtagctt ttttgatatc agaaaaaagt taagttgtag attagaaaac tgcttaagcg 360 agggaaaggg ctatcaaagt gacgtaaggt taaaagaaac aagaactgtt ttgttttctg 420 gtttaatgaa gacatgcacc acctgttcct ttgcattttg cttctccttt gttcttatct 480 gtataaatac agcaactgaa ataaacgagg tgcggcagtc agcaaggact ccgtcctccc 540 gtccccatct ttttgttgtc tcttcatttc tcagcctcgc cccctaactc caggtgccgc 600 acgtcgccgg ctggctccgg ca 622 // ID LTR2_TS repbase; DNA; PRI; 505 BP. XX AC . XX DT 09-DEC-2009 (Rel. 15.09, Created) DT 09-DEC-2009 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV2-type endogenous retrovirus - DE consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR2_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-505 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from tarsier."; RL Repbase Reports 10(9), 1259-1259 (2010). XX DR [1] (Consensus) XX CC >92% identical to consensus. 6bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 505 BP; 114 A; 143 C; 128 G; 120 T; 0 other; tgtcgggagc ggatgtgatg tggggagcag acgagcactg tcactgtagc aggccttgag 60 taaaggcccc aagcgaaggc aaggtctcgc ttaatggccc caagctcgta ggcaaggtct 120 cgcttggaag ctctgcatcc tagccaaagg caggctgtga gggatgcagc ctgctacacg 180 aaacggtgtc tcagatgccc acacatccgg gtgcacatag tacagataaa gccttagcta 240 caacaagata actgcggaac agctcgcctt ttaggtagca acataattcc gttatggctc 300 cctcacagac cttgaatccc ttccaacttc ccttgtgctt gattataaaa taaacatgct 360 gagctagttc ggggccactt ctcctccatc agaggaaagt gtcccacctg gccccagctt 420 ttctttatgt ctttgtgtct gtgtctttct taatctctcg ccgctctctc tcgggaccct 480 caaggcagcc gcgcgggccg cggca 505 // ID LTR7C_OG repbase; DNA; PRI; 396 BP. XX AC . XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 01-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Long terminal repeat (consensus). XX KW ERV3; Endogenous Retrovirus; Transposable Element; LTR7C_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-396 RA Jurka J. and Jurka M.G.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 8(10), 1680-1680 (2008). XX DR [1] (Consensus) XX CC 5 bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 396 BP; 99 A; 139 C; 71 G; 87 T; 0 other; tgatagggtc aaaccccacc caaatagggt ggacccgcct aaaagaatca accccctacc 60 ctctcccaca gatgtcccac ctaagaaagg tgaatgggtc aaccccctac cctaccccac 120 cggatgtccc acctgggaaa ggtaaatggg ccaaccccca cccttacaac agatgtccaa 180 cccaggaaag ggaatgtccc gacccccacc ttcctgacct ttaaaacccc cgttcgccat 240 tacccacgtg tggatttctt cccggtctag ttggaaatct taccccacct gcacccaggt 300 ggaataaagc cattttaatt gcacaaaccg ggcctccgtt ttcctttcgt ctcgcgcctc 360 ggaataatct ttcatctttg gctccgtaac ctttca 396 // ID MacERV5a repbase; DNA; PRI; 7246 BP. XX AC . XX DT 24-SEP-2007 (Rel. 13.02, Created) DT 27-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE ERV1 Endogenous Retrovirus from Cercopithecidae. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; MacERV5a. XX OS Cercopithecidae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini. XX RN [1] RP 1-7246 RA Smit A.F.; RT "MacERV5a - ERV1 Endogenous Retrovirus from Cercopithecidae."; RL Direct Submission to Repbase Update (21-FEB-2008). XX DR [1] (Consensus) XX CC 5 bp TSD, but ERV1-class proteins. 4% (LTRs 3%) ORFs: gag CC 345-1948 (one shift yet), pol 1949-5521 (one stop), env CC 5388-7208. XX FH Key Location/Qualifiers FT CDS 5388..7208 FT /product="MacERV5a_3p" FT /note="env." FT /translation="KKHPPSMIPGGRLRLLPLLNYDLRVPVTTLCLTFLSY FT PVLLSLGKTPTQTPPVNPFRWRFYLSETWTQNNHISSHILATVDCRPQGCR FT SQVTFNFSAFNSCPDWWNPVICFLYDQVEYNCLNYWVETNGGCPYHYCNMH FT FTYLDMSYTKWQQPASTVRLVRSYGRGEVPTFFLTIPDPWDPRWASGIEAR FT LYRHGYESYPVARLKIYRAYVRVTSSLVSLASDIKQQEKVISALANPDNTA FT NSDNKPQGSGNPFSWLTLVREGAQVVHMVGVHNISRCFLCAALNKPPLVAV FT PLPSPFNSSNLTPSFPLPGQALGEVPLFQDPLRQQLPFCYSTPNASWCNRT FT GSAPPNLTAPPGGYFWCNSTLSKTLKASDTTLCVPISLVPSLTLYSEAELS FT SLLPLARPRQARAVFLPLMIGVSLAASLVASGLGTGALTHSVRSSQDLSAR FT LQVAIEASAESLASLQRQITSVAQVAAQNRRALDLLTADKGGTCMFLNEEC FT CYYINESGLVETNLLTLEKVREGLHKKTSGLESPLGWWQSSMAGWVLPFLS FT PLLIIGFLLLIAPCILRFIQDRIKEVSRVTFNQMLLHPYTRVPTSEEPHDD FT PYQQEAAR*" FT CDS join(345..1568,1571..1948) FT /product="MacERV5a_1p" FT /note="gag." FT /translation="MGTTQSKPNRKSPLGCLLANLQTLSLSQDXKRKRLIF FT FCTVAWPQYKLDNQSQWPAEGTLDFNILTDLTNFCKRLGKWHEIPYVQAFW FT DLRSRPDLCAQCSLAQVLLAKSLPSSKERDDSSSFSEPPDTLSRPPLRSPA FT QPPPYPDPPLSPSSSTPPLPSTPPVPLPNPTDSVSPTSTSSPSSPVSAHTR FT SRTDLLCPLREVAGAEGVVRVHVPFSLADLSKVEERAGSFSANPTRYIKEF FT RYLCQAYDLTWHDLHVVMTSTLSPEEQERILAAARQHANQVHLTDPAMPVG FT TEAVPSAEPDWDYQVGQAGRRRRDMMVQCLLAGMQAASNKSVNFNKLKEVV FT QGADENPAVFLNRLTEALIRYTRLDPASPAGGTVLASYFISQSAPDIQKKI FT FKRRRKALKLPSRIVKLAFKVYNSREEAAEAQRQARLKQKVQLQTQALVAA FT LRPAGSRSSQKGGTTRAPPGACFKCGNDGHWARQCPNPKEPTRPCPNCRQM FT GHWKSDCPDLRTVAAPPRDDPPPGIGGAFQLLDTDED*" FT CDS join(1949..2266,2270..5521) FT /product="MacERV5a_2p" FT /note="pol." FT /translation="RGPDSGTPLTLAEPRVMLQVAGKSISFLMDTGATYSV FT LPSFSGPSHPSTVTVMGIDGTPSTYRQTPPLSCRLDGSLFSHSFLIIPSCP FT VPLLGRDLLSKLGASVHFPNPSPHLAFLFPLLSPDKPRQADPPLPFPVPIN FT PKVRDTSTPIIAQHHTPVRIRLKDPSKFPSRPQFPISLEHRQGLKPIITRL FT LRQHILVPANSPCNTPILPVRKSSGAYRLVQDLRLINEAVVPTFPVVPNPY FT TLLSRVPPDTTHFTVLDLKDAFFTIPLHPDCHFLFAFTWEDPDTHVSSQLT FT WTVLPQGFRDSPHFFGQALARDILLCPLTHSTLLQYVDDLLLCSPSWECSL FT ADTATLLNFLGDRGYRVTPAKAQLCTPXVTYLGILLTPTTKSLTADRMSLI FT KTLQPPQDAEEILSFLGLVGYFRHWIPNFGVLAKPLYQAARETPTGPLSDP FT SLVANSFKKLQDCLLSAPALSLPNPLRPFHLFTEERQKVATGLLAQPVGST FT YQAVAYLSKQLDPTVQGWQPCLRALAAATELTQEALKLTLGHPLTVYSSHR FT LSDLITHKCLSHLTPSRLQQFHLLFIENPHITLTTSPPLNPATLLPTPGYD FT SAPAHSCPEVLTTLPPARLGLSEQPLEHPDRTLFVDGSSVLTPDGRRQAAY FT AVVTATQTVETRPLPLGTTSQKAELTALTRALLLSEGQRVNIYTDSKYAYL FT IAHTHSILWQERGFLTTKGTPVVNGPHIKRLLDALQAPKEVAIIHCKSHQH FT SKDPVSQGNSLADSTARATALTSPPPRAPLLFLSPAYSPDYSSQELQTLMA FT HPSVTRNKDGWVFIDSRIALPQTQAVHILTDIHRSLHIGPKAMYNFLEPIF FT HLPSLQAQIKKVHQQCATCSATNPQGRLRHPGPTHQLRGHQPGEDWQLDFT FT HMPRHKKFRYLLTLVDTFSGWIEAFPTTRETAEVAATILLEHVVPRFGLPR FT TIQSDNGPAFISKLTQQVAAALQITRKLHIPYHPQSSGKVERANGLLKLHL FT TKLTLETRLSWITLLPLALTRLRAAPWAPTGLSPFELLYGRPFLFQELPAL FT SPPLGSYLPYLTLLRELLRKHADRCLPTPASPNPKNPAVVAPGDLVLVKQL FT QPRALSPRWEGPYTVILTTPTAAKLLGLPSWYHLSQLKKAPTQHDSRWTAQ FT AVAPTKLRLTRAGNDPLPNLPQLSSTPEPR*" XX SQ Sequence 7246 BP; 1597 A; 2540 C; 1501 G; 1606 T; 2 other; tttggtgccg aaacccggga ggagaccccc cccccggacc cctgccgggt cgggcaggct 60 ctcctctccc cgagccagga cctcctttcc gagccgggag ccgtttcacc aaatccaaga 120 ctcaattcga tcactgctgg gtaagttttc ccccgtcctg taggccccct ccgggaagtc 180 cctgtccgaa acgcgcggcc acgtcggggg ctccctccgg tcaactgtga cctcggcacc 240 ggggactagg agacgtccaa cctccccgga agccgccgta ccccgcactc cgcgtggcag 300 taggaggacg ctcccattgc ctccggactg cccgccttag gaccatggga actacccagt 360 ccaaaccaaa ccgaaaatcc cctctggggt gcctcttggc gaatctccag actttaagtc 420 tcagccaaga cntaaaacga aagcggctca ttttcttctg cacagtagca tggccgcagt 480 ataaattgga taatcagtct cagtggcctg cagaaggcac tctagacttt aacatcctca 540 cggatctgac caacttctgc aagagactag gcaaatggca cgagataccc tatgttcaag 600 ccttttggga cttgcgttct cgcccagatc tctgcgccca atgctcgtta gctcaggtcc 660 tactcgcaaa gtctcttccc tcgagcaagg agagggatga ttcctcctct ttttctgagc 720 ctcctgacac cctatcccgg ccacccctcc ggtctcccgc ccaacctcct ccttaccctg 780 acccgccgtt atccccgtcc tcgtcaacgc ctcccctccc ttccacccct cctgtgccct 840 taccaaaccc gacagattct gtctccccta cctccacctc ctccccttcc tcgcctgtct 900 cagcccatac tcggtccagg accgacctcc tgtgtccctt gcgagaggtg gcaggcgccg 960 aaggagtagt ccgggtccat gtgccgtttt cattggcaga tttgtctaag gttgaggagc 1020 gtgcaggctc cttttcggcc aatcctactc gttatatcaa agagttcagg tatctgtgcc 1080 aggcatatga tctcacctgg cacgacctcc atgtcgttat gacctcaacc ctgtctcctg 1140 aggagcagga gcgcatccta gcggcagcca ggcagcacgc taaccaggtt catttaactg 1200 accccgccat gccagtcggc accgaggcag tgccctcggc tgagcccgac tgggactacc 1260 aggtagggca ggcaggtcgc cgccgccgag acatgatggt tcagtgcctt cttgccggca 1320 tgcaggcagc ctctaacaaa tcagtcaact ttaacaaact aaaggaggta gtccaaggcg 1380 cagatgaaaa tccggccgtt tttcttaacc ggctgactga ggcacttatt cggtacaccc 1440 gccttgaccc tgcctccccc gcaggaggaa ccgtcttggc ctcgtatttc atttctcagt 1500 cggcccctga tatccaaaaa aaaattttta aaaggcggag gaaggccctc aaactcccat 1560 ccaggattta gtcaaactgg ccttcaaggt ctacaactcc agggaggaag cagctgaggc 1620 ccaacgacag gccaggctaa aacagaaggt acaactccaa acccaggcct tggtagcagc 1680 cctgaggccg gccggctcca ggagctccca gaaaggaggt actacccgag cgccacctgg 1740 tgcctgcttc aagtgcggca acgacggcca ctgggccagg cagtgcccta acccaaagga 1800 gccaactcgc ccctgtccga actgtcggca gatgggccac tggaagtcag actgccccga 1860 cctaaggacg gtcgctgcgc ctccacgtga cgaccctcct ccaggtattg gaggcgcctt 1920 ccagctcctc gacaccgacg aagattgaag aggcccagac tcggggaccc ctctcactct 1980 cgccgagccc agggtcatgc tccaggtagc gggtaagtcc atatccttcc ttatggacac 2040 gggggctacc tactctgttt tgccttcctt cagtggcccc agccacccct ccactgtcac 2100 agtcatggga attgacggca ctccctccac ctaccgccag actcctcctc tgtcttgccg 2160 cctggatggc tccctcttct cgcactcatt tcttatcatt ccttcgtgcc cagtcccctt 2220 gttaggacga gacctcctct ccaagctagg ggcctcagtt cacttctgac ccaacccctc 2280 cccacacctc gcgttcctct tccccctcct ctcacctgac aaaccccgcc aggctgaccc 2340 cccactcccg tttccagtcc ccattaaccc taaggtgcgg gacacctcca ccccgatcat 2400 tgcccagcac cacactccgg tccgcatccg gctaaaagac ccttccaagt tcccctctag 2460 accccagttc cctatctccc ttgaacaccg acagggacta aaacctatca ttacacgcct 2520 gctccgacag cacattctag ttcccgccaa ctcaccatgc aatactccta tcctgcctgt 2580 acggaaaagc tctggggcct atcgcctcgt gcaggaccta cgcctcatca atgaggcagt 2640 agtccctacc tttccagttg ttcctaatcc atatacactc ctctcgcgcg ttccccctga 2700 caccactcat ttcactgtcc ttgacctgaa agatgccttc tttaccatcc ctctacaccc 2760 agactgccac tttctgtttg ccttcacatg ggaggaccca gacactcatg tttcttccca 2820 gctgacctgg actgtcttgc ctcaagggtt ccgagatagc ccccattttt tcggacaggc 2880 actggcacgg gacatcctcc tctgccccct aacccatagc acccttctac aatacgtaga 2940 tgatctatta ctatgtagtc cttcctggga gtgctccctt gcagacactg ctacacttct 3000 aaattttcta ggcgaccgag gttatcgggt taccccggct aaggctcaac tttgcacccc 3060 tnctgtcacc tacctaggca tactactcac acccactacg aaaagcctca cggcagatag 3120 aatgagcctc atcaaaactc tccagcctcc tcaggatgcg gaagagatct tgtccttcct 3180 aggactggta gggtatttca ggcattggat tcccaacttc ggggtcctag ccaagcccct 3240 ctaccaggct gccagggaga cacccaccgg accgctgtcc gacccctcct tggttgccaa 3300 ctctttcaag aagcttcagg actgtctcct ttctgcccct gctctctctc tccccaaccc 3360 ccttcggccc tttcatctat tcaccgagga gcgccagaag gtagccactg gcctcctagc 3420 ccagccggtt ggatccacat accaggctgt ggcctatctc tccaagcagt tagatcccac 3480 agtccagggc tggcaacctt gtctgcgagc cctggcggct gccacagaac ttacccaaga 3540 ggccctcaag cttaccctag ggcatcctct cacagtctac tcttctcacc gactgtcaga 3600 tctcatcaca cacaagtgtc tcagccatct caccccgtcc cggcttcaac agttccacct 3660 gctattcatt gaaaaccctc acatcactct caccacctca ccccctctaa atcctgctac 3720 cctcttgccc actccaggat acgattccgc ccccgcacat tcctgcccgg aagttctcac 3780 caccttgcca cccgcccgcc tcggtctttc cgaacagccg ttagaacacc cagaccgtac 3840 cctgttcgta gatgggagct ctgtcttaac ccccgatggc cgcagacagg cggcatacgc 3900 tgtcgtaaca gcaacccaaa cggttgaaac caggcccttg cctctaggca ccacctccca 3960 gaaggctgaa ctcactgccc ttactcgtgc cttacttctc tccgaagggc agagggttaa 4020 tatctataca gactctaaat atgcttacct tattgcacat actcactcta ttctctggca 4080 agagcgcggg tttcttacca ccaaagggac gccagtagtc aatggaccac atataaagag 4140 gttgcttgat gctctccagg cccccaaaga ggtagccatc atccactgca aaagccacca 4200 gcattctaag gaccctgtgt cgcaaggtaa cagcctagct gactccaccg cacgggccac 4260 tgccctcact tccccccctc cccgagcgcc tttactcttt ctttcccccg catattcccc 4320 cgactactct tcccaggaac ttcaaaccct gatggcccac cccagtgtta cccggaacaa 4380 ggatgggtgg gtgttcattg acagccgaat agcgctccct cagactcagg cagtacatat 4440 actgaccgac atacaccgct ctctccacat aggacccaag gctatgtata acttcctaga 4500 acccatcttt caccttccct cactgcaggc ccaaattaag aaagtacatc aacaatgtgc 4560 cacctgttcg gccactaacc cccaaggtag gctcagacac ccagggccta ctcatcagct 4620 aagaggccac cagccaggag aagattggca acttgatttt acccacatgc cccgacataa 4680 aaaatttcgc tacctgctga ccttagttga caccttctca ggatggattg aggctttccc 4740 caccactcga gagactgcag aagtcgcggc aactattctc ctagagcacg tcgtccccag 4800 gttcggtctc ccccgaacca tccaatcaga caatggtccg gctttcattt ccaaactcac 4860 ccagcaggtg gcagccgcac tccagattac ccggaagctc cacattccct accatccgca 4920 gtcgtctggt aaggtagaac gcgcaaacgg cctccttaaa ctgcatctaa ccaagctcac 4980 tctagaaacc cgcctctcgt ggataactct ccttcccttg gctcttactc gtctcagggc 5040 agctccttgg gcccccacag ggctcagccc ttttgagcta ctgtacggac gcccgttcct 5100 cttccaggag ctgccagccc tctcccctcc cttaggctcc tacctccctt acttaaccct 5160 cctacgcgag cttctaagaa aacatgcgga ccggtgtctc cccacacccg cttccccaaa 5220 ccctaaaaat cccgccgttg tagcaccagg agacctggta ctggtcaagc agctgcagcc 5280 ccgagcctta tctccccggt gggaaggacc gtataccgta atccttacca cgcccactgc 5340 tgccaagctc ctcggccttc cctcctggta tcatctgtcc cagctgaaaa aagcacccac 5400 ccagcatgat tccaggtgga cggctcaggc tgttgcccct actaaactac gacttacgcg 5460 tgccggtaac gaccctctgc ctaaccttcc tcagctatcc agtactcctg agcctaggta 5520 agactcctac ccaaactccc cccgtcaacc cattccgctg gagattctat ctgtcagaga 5580 cctggaccca aaacaatcac ataagttccc acatcttagc cacagttgac tgccggcccc 5640 aagggtgccg gagtcaagtt acctttaact tctccgcctt taacagttgc cccgactggt 5700 ggaacccggt catatgcttt ctctatgatc aggtagaata taactgtctc aattactggg 5760 tagaaaccaa cggcgggtgt ccatatcatt attgtaacat gcattttact taccttgaca 5820 tgtcatacac caagtggcag caaccggcat caacagttcg gttagtcaga tcatacggac 5880 gaggagaagt gcctacattc ttccttacca ttcctgaccc gtgggaccct cggtgggcat 5940 caggtataga ggctcgcctt taccggcacg gctacgaatc ttatcccgta gcccgactca 6000 agatttatag agcctacgtc agggtcacaa gcagcctcgt cagcctggcc tccgacatca 6060 aacaacaaga aaaagtcatc tcagctctag ccaacccaga caacacagcc aactcagaca 6120 acaagcctca gggcagcggt aacccttttt cttggctaac cctagtcaga gaaggggccc 6180 aggtagtaca tatggtcgga gtgcacaaca tctcccgttg tttcttgtgt gcagcattaa 6240 ataagccccc actggttgcg gtacctttac ctagcccttt taactcctct aacctaaccc 6300 cctcctttcc ccttcccggc caagccctgg gggaagttcc tttgttccaa gacccgctta 6360 gacaacaact ccccttttgc tactccaccc ctaatgcttc ctggtgcaac cggacaggat 6420 cggcgcctcc aaatctcact gcccccccag gcgggtattt ttggtgcaac tccaccctgt 6480 caaaaactct caaggcttcc gacactaccc tatgcgttcc catctcccta gtccccagcc 6540 tcaccctgta cagtgaagcc gagctatcct ctcttctgcc ccttgcccgc ccccgccagg 6600 caagagcagt attccttccg ctaatgattg gagtctcttt agccgcatcc ctcgtggcct 6660 ctggccttgg aactggagcc ttaactcatt ctgtccgatc ctctcaagac ctttcggccc 6720 gattacaggt agcaatcgaa gcttcggcag aatccctggc ttccctccaa cgacagatca 6780 cctcagtggc ccaggtggca gcccagaatc gcagagcact cgacctcctt acggctgaca 6840 aaggcgggac ctgtatgttc cttaatgaag aatgctgcta ctacatcaat gagtcaggac 6900 tagtagaaac caacctcctc actctagaaa aagtccgaga aggactccac aaaaaaacct 6960 caggattaga gtccccactc gggtggtggc agtcgtccat ggccggctgg gtcctaccct 7020 tcctaagccc cttactaatc attggcttct tactgctcat agctccctgt atcctccgct 7080 tcatccagga ccgcataaaa gaagtctccc gggtcacttt caatcagatg ctgctccacc 7140 cctatacccg agttccgacc tccgaagaac cccacgacga cccataccag caggaagcag 7200 ccagatgaac acgtcgcccc tttttcttat tagaaagagg tcggaa 7246 // ID MER9a3 repbase; DNA; PRI; 512 BP. XX AC . XX DT 04-AUG-2008 (Rel. 14.02, Created) DT 04-AUG-2008 (Rel. 14.02, Last updated, Version 1) XX DE ERV2 Endogenous Retrovirus from Catarrhini. XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR; ERVK; KW MER9a3. XX OS Catarrhini OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini. XX RN [1] RP 1-512 RA Smit A.F.; RT "MER9a3 - ERV2 Endogenous Retrovirus from Catarrhini."; RL Repbase Reports 9(2), 573-573 (2009). XX DR [1] (Consensus) XX CC Always shared with rhesus. XX SQ Sequence 512 BP; 126 A; 137 C; 115 G; 133 T; 1 other; tgttgggaac aggcccccaa aatctggcca taaactggcc ccaaaactgg ccataaacaa 60 aatctctgca gcactgtgac atgttcgtga tggccatgac gcccacgctg gaaggttgtg 120 ggtttaccgg aatgagggca aggaacacct ggcccaccca gggcggaaaa ccgcttaaag 180 gcgttcttaa accacaaaca atagcatgag cgatctgtgc cttaaggaca tgctcctgct 240 gcagataact agccagaccc atccctttat ttcggcccat ccctttgttt cccgtaagga 300 atacttttag ttaatctata atctatagaa acaatgctta tcactggctt gctgtcaata 360 aatatgtggg taaatctctg ttcgaggctc tcagctctga aggctgtgag acccctgatt 420 tcccactcca cacctctata tttctgtgtg tgtgtcttta attcctctag cgccgctggg 480 ttagggtctc cncgaccgag ctggtctcgg ca 512 // ID MER72B_Mim repbase; DNA; PRI; 622 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; nonautonomous; KW MER72B_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-622 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 11(5), 1729-1729 (2011). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 622 BP; 154 A; 176 C; 124 G; 168 T; 0 other; tggtgtgaac taaaataaaa tcctaagccc cctgctgact gaacggaccc cctcttggcc 60 aaggggaccc cagagaaacc ttaaaactga gttcccagcc atgacgggat gggaggtcag 120 acatgcctca ttataccccc tcccttgcta acggccatca ggcttttttc cttaagggct 180 aaacagaaac caatctcatg atccagacca gcattctttc ctgataagag accaccaacc 240 atggagtggt tttaaccagt ctacggagca tgcgcagaga gggttttcat gtccttgctt 300 caccttttga catcagaggg ccgccatttt ctgaacatgg accccataaa ggggcataaa 360 gctcgattgc gcatgcgcat gtttctcctt tcataaatat tcatgactcc tcctatagct 420 tgttaaatat gtatatttgg ccatcccgct cagcataaat tcctgttccc attgtccctg 480 cctcgaagca cgtgttcctg gcttctggcc ggggctctgc tttccagcct gtcggaatgg 540 ccacccacag gctgcaaccc tttatgagaa ataaagctct cctttccaaa tttatgaacc 600 tcgtcatttt cttcagttaa ca 622 // ID LTR2B2_OG repbase; DNA; PRI; 382 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV2-type endogenous retrovirus - DE consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR2B2_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-382 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 11(5), 1582-1582 (2011). XX DR [1] (Consensus) XX CC ~89% identical to consensus. 6 bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 382 BP; 77 A; 118 C; 71 G; 116 T; 0 other; tgtggggaat gggctgtgag gcggcagggc ccccacagcc tggggaggcg ttcagggaac 60 gacccccccc ccagctgctt ctcctccttt aacaacaaac attcttttca cctgccagaa 120 gtcacgtagc ccatgcatac tttccttatt attggcaagc acacagcttt tcctccttaa 180 agttaatctt taccccaggt gttaagattt accttgttaa agatacttta taaataaagc 240 tgtttccaca gtttggtgct ggctgcagtc atcagctgca tcagacctcc cgaccccatc 300 ctatcttttg tctcttgtct ttctttctca tttcccacca ttcccactca ggttatttct 360 ccttccccgc gctggttcgc ca 382 // ID HERV-Fc2 repbase; DNA; PRI; 7250 BP. XX AC . XX DT 29-AUG-2008 (Rel. 13.08, Created) DT 29-AUG-2008 (Rel. 13.08, Last updated, Version -1) XX DE ERV1 Endogenous Retrovirus from Hominidae. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; HERV-Fc2. XX OS Hominidae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini. XX RN [1] RP 1-7250 RA Smit A.F.; RT "HERV-Fc2 - ERV1 Endogenous Retrovirus from Hominidae."; RL Direct Submission to Repbase Update (05-AUG-2008). XX DR [1] (Consensus) XX CC Only 1 copy in hg18 and panTro2. B?nit, Calteau & Heidmann CC (Virology 312(2003) 159-168) suggest hominidae specific. LTRs CC are 5% diverged (even in this consensus for the copies of human, CC chimp & gorilla), suggesting an older age. The single locus is CC precisely empty in rhesus monkey (rheMac2 chrX:96,523,191. ORFs CC are not intact but span: gag 366-1943, pol 1779-5526, env CC 5396-7213. XX SQ Sequence 7250 BP; 1564 A; 2498 C; 1516 G; 1672 T; 0 other; ttggtgccaa aacccgggag gagacccctc tctgacccag ggtcggggag catctcctct 60 ccctacctgc caggaaccag actcgggcca gcgcattcgg ccttcgctat tgggtaagtc 120 tcccctccgt cctgtaggcc ccgggaacct ctgtccgtaa tcgcggccac gtcagagtct 180 tccccccatc ggtttccgac tacgggaccg aggacgcgga gacgtccgtc ctcctcggcc 240 tccgccatcc gcgcttcaag gaaggttggg ggacgcccct ccctgacctt gaatctcccg 300 cctcaggaca atgggaggtg cccaatccaa aacctccctg accttgaatc gcccgcctca 360 ggacaatggg aggtgcccaa tccaaaattg atcctaagac acccctgggg tgtctcctag 420 ccaactttga agctctaggc ctcagtatgg accttaagcg gaagcgactc attttctttt 480 gctcggtcgc ttggccgcaa tacaaattgg acaaccaatc tcggtggccg cccgaaggaa 540 ctttcgattt ccaaatttta caggacctag acaacctttg cagaagacaa ggcaaatggt 600 cagaggtccc ttatgtacaa gccttttggg acctacgctc tcgtcctgac ctatgtgcca 660 agtgttccct tggacaggtg ttactggcta aggcatcccc ctctaacaaa gaacctgatt 720 cctcccctct ctccgagcct cctgaagccc tcgctttacc accattgcca gcggcgctcc 780 ctcctcccta tccaggatcc tctggcccca ccccaacggc tcctccgcta cctcctacac 840 caccttcctc tcccgctaac cctcccgctt ctgctctgcc accgccttcc cctgtatctg 900 cgcacactcg gtcgaagacg gacctcttgt gtccgctccg tgaagttgcc ggtgcggaag 960 gcgtggtcag ggtccatgtt cccttttctc ttactgactt atctaaaata gagaagcggc 1020 ctgggtcctt ctctgccaac ccaaccctgt atatcaaaca atttaggtac ctatgccagg 1080 cttatgacct cacctggcgt gacctacata ttatcctaac atccactctg tccccagagg 1140 agagggagcg agtccaggcg gtggctaggc aacatgccga ccaaattcat ttaactgacc 1200 ccgccatgcc tgtcggaacc ctagcagtac cggcagccga gccggactgg gattaccaag 1260 ctggtcagac tggccgtcga cgccgtgacc aaatggttca gtgccttctg gcaggcatgc 1320 aggcggcttc caataagacg gtcaactttg acaaattacg ggagattatt caagggtctg 1380 acgagaaccc agcagttttc cttaaccgcc ttactgaggc cctcatccag tatacccgcc 1440 ttgatcccac ctccccggca ggggccactg tcttggctac tcatttcatt tcccaatcag 1500 cgggagatat tcggaaaaaa ctaaaaaagg cggaggaagg ccctcaaacc ccaatacagg 1560 acctagttaa aatggccttc agggtctata attccaggga ggagacggct gaggcccaaa 1620 gacaggcaag gctaaagcag aaggtacagc tccagaccca ggccttggta gctgccccgc 1680 ggctggccgg ctccgggagc caaccgaaag ggggttccgg ccaccgagcg ccacctggtg 1740 cctgcttcaa gtgtgggaac gaaggccact gggcctgaca atgcccgtac ccgaaggaac 1800 cgacccgacc atgccctaac tgccaccaga tgggacattg gaagtctgag tgccccagcg 1860 tcggagcgtc cacagtgcct ctacgctgtg aaaactccga gacgaccggt ggcgccttcc 1920 aattactcag cgtggacgac gaccgaagag gcccagactc gggaaccccc ctcactcttg 1980 ccgagcccag ggtaacgctc caggtagcgg gtaagtccat atcttttctc gtgcatatgg 2040 gggctaccta ttctgttttg ccttccttcg gcgggcccag tttcccgtcc ccggtcacgg 2100 tagtggggat tgacggtacc ccttccaccc atcgtcagac ccccccattg tcttgccggc 2160 tggacgacac tctcatctcc cattccttcc tcattatccc ttcctgtccc gtcccgctct 2220 tcggaaggga cttgctgtct aagttagggg cctccattcg gttgcgcccc agcctcccct 2280 ccagtgcaat ctctttgctt cctctgctgg cacttagcga tgacactcct tcgccgatcc 2340 catcgctccc tgtgcccgtt gatccaatag tatgggacat ctcaaccccc tccatcgccc 2400 gacaccatgc cccaataatg atcaaactca aggaccctac caaatttccc tcgcggccac 2460 aattccccat ctcagttgaa caccgccaag ggttaaaacc tatcatcacc aggctcttgc 2520 aacaacacat ccttatcccg gtaaactccc gttgcaacac gcccattctg cccatccgta 2580 aggcctctgg cgcgtaccgt ttagtgcaag atcttcgcat catcaacgag gctgtcgtcc 2640 ccatttttcc tgtagtgcct aacccgtaca ctctcctatc ccgcattcct ccgaccacca 2700 cccatttcac ggtccttgac ctcaaagatg atttcttcac tatccccctc caccctgact 2760 gttacttcct gttcgctttc acctgggaag accctgacac ccatgtctcc tcgcaacttg 2820 cctggaccgt tctcccgcaa ggcttccgag acagccctca cctctttgga caggctctag 2880 ctaaagacct cagtacatgc actttggccg acagcaccct tctcctgtat gttgatgacc 2940 ttctcctttg cagtccttcc ctgtctgtct cgcagcaaga tacagccaca atccttaatt 3000 tcttaggaaa acaagggtat cgagttaccc ctcacaaagt tcagctctgc accccgacag 3060 tcacatacct aggcatttct ctcaccgcca ccaccaaaag cctcaccaca gaccgagtta 3120 gcctcattaa agacctccaa cttccccagg acgcagataa gatcctctcc ttcgtagggc 3180 tagtagggtt cttccggcac tggatcccaa acttcggggt cttagctaag cccctgtacc 3240 aggcggcgaa agaaacaccc accggccctc tgtctgatcc cgccctagtg gcccgccatt 3300 tccgccggct gcagcagtgc ttactcacag ctccagtttt atccctgccg aaccccctgc 3360 ggccttttca cctctacaca gatgaactgc agggagttgc taccggccta ctagggcaac 3420 cggtaggacc cacctatcag gtggtggctt acctttccag gcagcttgat cccagcactc 3480 ggggctggca gccctgcctg cgggccttag cagcggcggc agagcttacc aaagaggccc 3540 tcaaacttac tctcaatcac ccactcacag tatactcccc ccaccgcttg acagatgtac 3600 tctctcacaa atgtctggcc catctggcgc cctccagaat acagctgttt catgtgctct 3660 ttgtcgaaaa cccagatatc accctgaccg cctcaccacc tcttaaccct gctacacttc 3720 ttcccataga agcctctgag ccctctcctg tcctgtcgca ttcttgtccc gaactcctta 3780 cctctatccc caactcccga cttggcctct tcgatcgacc gctttctaat cctgacagca 3840 ctctgtttgt cgatggcagc tcagtcctca ccccttgcgg taggcgacag gcagcttacg 3900 ccgtagtcac ccacgacaaa acagtggagg cggcagccct accccttggg accacttcgc 3960 agaaggctga actccttgct cttaccaggg ctctactcct ctctcaggga cagcgggtca 4020 acatttacac tgactccaag tatgcgtatt ctcattgcac gcacgcattc tgttctctgg 4080 caggagcgag gtttccttac tatgaaaggg acttcaatcg tcaacgggcc tcttatccat 4140 aaacccttaa atgccttaca ggcgccccga gaggtggcga tcatacactg caaaagtcac 4200 cagcactcaa aagaccctgt tgctcaagga aataatctag ccgactctac tgctaagtct 4260 cttgctctta cttctgcccc tgccccagct cccgcaatgt tcccgtccgg ttcacgcacc 4320 cctgcctatt ctccacagga gaccttccac ctcatttcca acttaaaagg aatgaccgac 4380 caagacggtt gaatctgggt cgataaccgg attgccctcc ccgaatccca ggctcaggct 4440 attattaccg atgtgcacaa gaccctactc ataggcccaa aactcttata tcagttctta 4500 gaaccaattt ttctatgccc cgccctacag tccctcattc accaggtaca ccaagcctgt 4560 gctgtctgtt caacagtcaa cacacaagga ggacttaggc gcccagggcc ccatcaccag 4620 ctccgcgggc atcagccagg agaggactgg cagctagatt tcacccacat gccgcggcac 4680 aagcattacc gctaccttct tactcttgta gataccttca caggctggat tgaggccttt 4740 cccactgcgc gtgagacagg agaagtcgca gtctctgtcc tgctagaaca tatcatccct 4800 cgctttggac ttccccgatc cctgcaatca gacaacggcc ccgcgttcgt ctcaaaaatc 4860 actcagcaag tatccgagtc gctccgcgtc acgtggaagc tccatatccc ttaccgccct 4920 caatcctctg gtaaagtaga aagggctaac agcctcctca aagaacacct tacaaaactt 4980 actcttgaaa caaagctgtc gtgggtcacc ctcctaccat cggccctgac ccgcctccgg 5040 gcagctccca ggggccccac agggctcagc cccttcgaac ttctctacgg acgccccttc 5100 ctacttcctg gtcttccccc cactgtttcg ccccctcccc tcgcgtccta tcttccttat 5160 ctgaccctcc ttcgcgacct tctccgcaag cacgcggacg cctgcctccc cgaacctacc 5220 ccctcctccc cggacgcccc tgttgtgctc tctccaggtg atagtgtcct ccttaaggaa 5280 ctacagccca agaccttgac cccgcggtgg tcaggccctt acaccgtgat cctcaccact 5340 ccgacggccg ctaagttact aggtctacca tcctggtatc atttgtcaca gttgaagaaa 5400 gcaccgactc agcacgactg gtcctcaaaa ctcaccccaa cccggcttcg tatcacccat 5460 ggccagacct tccccactat gcctcctact cctcctgacc ctcctacccc ccatagtgcc 5520 cagtaactcc ctcctaactg aacccccgtt ccgatggagg ttctacctgc atgagacttg 5580 gacccaaggc aaccggctct ccactgtcac actggcaacg gtggactgcc aacctcacgg 5640 ttgtcaggcc caagtaactt ttaacttcac ttcctttaaa agtgttctgc ggggctggtc 5700 caatcccacc atctgctttg tctatgatca aacacacagc aactgccgcg actattgggc 5760 ggacacaaac ggaggatgcc cctatgccta ttgtcgtatg catgtgaccc agctcgatac 5820 cgccaagaaa ctccaacaca cctatcgcct gacatctgat ggaaggacaa cttacttcct 5880 gaccatccca gacccatggg attctcggtg ggtcagtgga gtcactggtc gactgtaccg 5940 gtggcccacc gactcctacc cggtcggcaa actccggata ttcctgactt atatacgagt 6000 tatcccccag gttttgtcta atttacagga ccaagcagac aacattaagc atcaggaaga 6060 ggtcatcaat acttcggtgc agtcccatcc gaaggctgac atggtcacct acgatgacaa 6120 ggctgaggca ggaccgtttt cgtggataac cctagtccgc cacggggctc gccttgttaa 6180 tatggcaggc ctagttaatc tctcccactg tttcctttgc accgccctcg gccaaccacc 6240 actagtagct gtacccctac cccaggcttt taacacctct ggtaaccaca ctgcccaccc 6300 ttccggcgtc ttctctgagc aggtccctct tttccgagac cccctccagc cccagttccc 6360 cttctgctac accactccta actcatcctg gtgcaaccag acctattctg gctccctatc 6420 taacctctct gcaccggcag gtggctactt ctggtgtaac ttcaccctta caaaacatct 6480 taatatttcc tctaacaata ccctttctag aaacttatgc ctccccatct ctctggtgcc 6540 tcgactcact ctgtacagcg aggctgaact ctcttccctt gtcaacccgc ctatgcgtca 6600 gaagcgggcc gttttcccac cgctggtaat aggtgtctcc ttgacctcct cacttgttgc 6660 ctccgggctg ggcacaggtg ctattgtaca tttcataagc tcttcccaag atctctctat 6720 taagctccag atggccatcg aagcctcagc cgaatcctta gcctctctac agagacagat 6780 tacgtctgtg gccaaggtgg ccatgcagaa ccggagagcc ctagatctcc tcacagccga 6840 caaaggcgga acctgcatgt ttctcgggga agagtgctgt tattacatca atgaatcagg 6900 cttagtagaa accagcctcc tcacccttga taaaatccgg gacggtctcc atcgaccctc 6960 ctcaactccc gactatggag gagggtggcg gcaatcccct ttaaccactt ggattatccc 7020 tttcataagc cccatcctaa tcatttgcct tttacttctc atagccccct gtgtcctcaa 7080 gttcatcaaa aaccgcatca gcgaagtctc ccgggtgacg gtcaaccaaa tgttactaca 7140 cccttactcc cgtcttccga cctccgaaga ccactatgac gacgccctca ctcagcagga 7200 agcagccaga tgattacgtc gccccttttt cttacagtat gaggtcggaa 7250 // ID HAL1-1_Cja repbase; DNA; PRI; 2443 BP. XX AC . XX DT 13-JUL-2010 (Rel. 15.07, Created) DT 13-JUL-2010 (Rel. 15.07, Last updated, Version 1) XX DE HAL1 non-LTR retrotransposon: consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; Nonautonomous; KW HAL1; HAL1-1_Cja. XX OS Callithrix jacchus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. XX RN [1] RP 1-2443 RA Bao W. and Jurka J.; RT "HAL1 non-LTR retrotransposons from marmoset."; RL Direct Submission to Repbase Update (13-JUL-2010). XX RN [2] RP 1-2443 RA Bao W. and Jurka J.; RT "Origin and evolution of LINE-1 derived "half-L1" RT retrotransposons (HAL1)."; RL Gene 465(1-2), 9-16 (2010)doi:10.1016/j.gene.2010.06.005. XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 482..1507 FT /product="HAL1-1_Cja_1p" FT /translation="MMGRNQRKKAENTQNQNASPSTGDHSSSSTREQGLME FT NECVPLTELGFRRWIIRNFCELKEHVLAQCKETKNLEKRFDEILTRIDNLE FT RNISELMELKNTIRELREVCTGFNSRIDQAEERISEVEDQLNEMKREDKIR FT EERVKRNEQSLQEIWDYVKRPNLRLIGVPECHEENESKLENILQDIIQENF FT PNLVRQDNTQLQVIQRTPQRYSSSRATPRHIIVRFTRVEIKEKILRAAREK FT GQVTHKGKPIRLTADLSAETLQTRREWGSIFNILKEKNFQPRISYPAKLSF FT INEGKIKFFTNKQALRDFITTRPALQELLKEALYTERNNQYQSSQKLTKR" XX SQ Sequence 2443 BP; 892 A; 494 C; 499 G; 558 T; 0 other; gagagagatc caagatggct gattactagc agctcaggat tgtagctccc agtgaaagcg 60 cagagaacga gaggacgcca cactttcaga cgaatttttg ttgctcacgg accaggagat 120 tcccagcgga ggagccccac gggtcgccag cgcgactctt gtggccggcg cggcggtttt 180 gccggcgccc cggcgcggcg gttcttggtg cagagtaaac gggactgggt ccccttttgg 240 tcgacgtttg gagctccggg aaggcagagt cgcctattca gctgattgaa aaagggactc 300 aagaaggaag ccagaccgga gattcccggg cagaaaagca ccatgaatct taacgccgct 360 gttttagccg gcgcagtggg ttgctcagat tccggcgctg ggaatcaaca agttggacgt 420 ccactcagag acctaatttg aaagtcggta attacaaaga cgacaggtgg ataaatttac 480 aatgatggga agaaaccagc gtaaaaaggc tgagaatacc caaaatcaga atgcctctcc 540 ctctacaggg gatcacagtt cctcatcaac aagggaacaa ggcctgatgg agaacgagtg 600 cgttccatta acagaattag gcttcagaag gtggataata agaaacttct gtgagttaaa 660 agaacatgtt ctagcccaat gtaaagaaac taagaacctt gaaaaaaggt ttgacgaaat 720 cctaacgaga atagacaatt tagagaggaa tataagtgaa ttaatggaac tgaagaatac 780 aatacgagaa ctccgcgaag tatgcacagg ttttaacagt cgaattgatc aagcagaaga 840 aaggatatca gaggtcgaag accaacttaa tgaaatgaaa cgagaagaca agattagaga 900 agaaagagta aaaaggaatg agcaaagtct ccaagaaata tgggactatg tgaaaagacc 960 taatttacgt ttgataggtg tacctgaatg tcatgaagag aatgaatcca agctggaaaa 1020 tattcttcag gatattattc aggaaaactt tcctaaccta gtaaggcagg acaatactca 1080 actccaggta atacagagaa caccacaaag atattcctca agtagagcaa ccccaagaca 1140 cataatcgtt agattcacca gggttgaaat aaaggagaaa attctaaggg cagctagaga 1200 gaaaggtcag gttacccata aagggaagcc tatcagactc acagcagatc tctcagcaga 1260 aaccctacaa actagaagag agtgggggtc aatattcaat atcctcaaag aaaagaattt 1320 tcaacccaga atctcatatc cagccaaact aagcttcata aatgaaggaa aaataaaatt 1380 tttcacgaac aagcaagcac tcagagattt catcaccacc aggcctgctt tacaagagct 1440 tctgaaagaa gcactataca cagaaaggaa caaccagtat cagtcatccc aaaaacttac 1500 caaaaggtaa agagtatctc cataatgaag aatctatatc aactaatggg caaaatagcc 1560 agctagcatt aaatgacaat attaaactca caaatatcaa tattaatcct aaatttaaat 1620 ggattaaatg ccccaatcaa agacacagac aggcaaattg aataaaaagt caaaacccat 1680 cggtacgctg tatccagatc catctcataa gcaaggacac acaaagactc aaaacaaagg 1740 gttggtggaa gacttaccaa tcaaatggag agcaaaaaaa aaaaaaacag gagttgtaac 1800 tttcgtctct gacaaaatag actttaatag taacaaagac taaaagaaac aaagaaggac 1860 attacataat ggtaaaagga tcaatgcaac aacaggagtc aatgatcata aatatatatg 1920 cacccaatat aggaacaccc agatacataa gacaagttct taatgactta taaagagact 1980 tggactcccg cacaataata gcgggagact ttgacaccac attgttaata ttcgacggat 2040 caacgagata gaaaattaac agggacatct atgatttgaa ctcagacctg gaacaagtaa 2100 actcagtgaa tatttataga attcttcacc ccaggtccac agaacataca tttttctcag 2160 tatcacatta cacctatttt gaaagtgacc acataattgg aagtaaatca ctcttcagca 2220 tttgcatagc atttcacaga cttaaatgaa atattggttg gctgtttgtt tgttattttc 2280 ttcccttatt ccttcctgtc tccgctgtta agcacaatta tcggcataaa agaggttttt 2340 aatacctatt tttagattgc attccagcct gggcaataga gcaagactcc tgttctctct 2400 ccccctttct ctttctcttt ctttctttct ttcaaaaaaa aaa 2443 // ID LTR27D_TS repbase; DNA; PRI; 545 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW Endogenous Retrovirus; Transposable Element; LTR27D_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-545 RA Bao W. and Jurka J.; RT "Long terminal repeats from tarsier."; RL Repbase Reports 11(5), 1631-1631 (2011). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 545 BP; 141 A; 167 C; 112 G; 125 T; 0 other; tgagagagga ggaaggaaga aactggtcag gcagacagtt agggtgggtc cccggtaaaa 60 cttcctcaaa ccaagaatgg tctggcgcct gaggaaccaa gctgcaagtt ccagatggag 120 tccacggacc agagtgagaa ctcccatccc cttttggcgt gctttctccc gattggccct 180 catcctatcc tctcccgatt ggtcctttac actatcgtgc ctctttctga ttggtgcatt 240 ttcaagccca cccgccaacc aatcagcatg cacttcccca ttttgagccc ataaaaaccc 300 ctgaactcag ccccatcgct ggcaacccac tttcgggtcc cctctcgctg ccaagagctt 360 tcctgttgct taataaactc tgacttactc actcttcggt gtccacgctc cttattcttc 420 ttggtcgtga gacaagaacc cggaactcgc cgaactaaaa gagctgtact cgccgaacta 480 aaagagctgt actcaccaaa ctaaaagagc cgtaacagaa ctcaccgaac taaaaagccg 540 taaca 545 // ID MER6A repbase; DNA; PRI; 605 BP. XX AC . XX DT 16-MAR-2006 (Rel. 11.03, Created) DT 10-APR-2007 (Rel. 11.03, Last updated, Version 1) XX DE Mariner DNA transposon from placental mammals. XX KW Mariner/Tc1; DNA transposon; Transposable Element; MER6; MER6A; KW mariner. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-605 RA Smit A.F.; RT "MER6A - a subfamily of Mariner DNA transposon from placental RT mammals."; RL Direct Submission to Repbase Update (11-NOV-2005). XX DR [1] (Consensus) XX CC 17-18% divergence from the consensus. XX SQ Sequence 605 BP; 129 A; 125 C; 148 G; 202 T; 1 other; cagcaggtcc tcgaataacg tcatttcgtt caacgtcgtt tcgttataac gttgatgaga 60 aaaaaaatcg attcccggcc ggggccactg tctgtgtgga gtttgcacgt tctccccatg 120 tctgcgtggg ttttctccgg gtactccggt ttcctcccac atcccaaaga tgtgcacgtt 180 aggtkaattg gcgtgtctac atggtcccag tctgagtgag tgtgggtgtg tgtgtgagtg 240 cgccctgcga tgggatggcg tcctgtccag ggttggttcc cgccttgtgc cctgagctgc 300 cgggataggc tccggccacc cgcgaccctg aactggaata attgggtaaa taattatctt 360 acttgttttt attaatcttt cttaaatgta tgtatagctc acatttattt caatgtttaa 420 tattagaagt gttttggtct ttatttagaa gtttggtgat gtttttgtga ccagaaatat 480 gccgtaggaa cttaactctt gtttatatca attagcctat ggtaaaattg gtttcgttat 540 acgtcgtttc gcttaaagtc gcagtttcca agaacctatc gacgacgtta agtgaggact 600 tactg 605 // ID ERV2-2_TSy-I repbase; DNA; PRI; 7264 BP. XX AC . XX DT 29-JAN-2010 (Rel. 15.09, Created) DT 29-JAN-2010 (Rel. 15.09, Last updated, Version 2) XX DE Internal portion of an ERV2-type endogenous retrovirus - DE consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERV2-2_TSy-LTR; ERV2-2_TSy-I. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-7264 RA Jurka J. and Walichiewicz K.; RT "Endogenous retroviruses from tarsier."; RL Repbase Reports 10(9), 1204-1204 (2010). XX DR [1] (Consensus) XX CC The ORF could not be fully reconstructed. This sequence was CC derived from sequence data generated at the Washington University CC School of Medicine Genome Sequencing Center, and assembled at the CC Broad Institute. XX FH Key Location/Qualifiers FT CDS 2591..3082 FT /product="ERV2-2_TSy-I_1p" FT /translation="MEQMGXLLYGPNGKVSSAQMIQMGFHPDKGLGKDLQG FT IKEPVQVKARTIWAGTRLFKFIMRATVATADPITWRDDIPVWVEQWPMTTE FT KLSAVQEIVEQQLEAGHLEPSNSPWNSPVFAIKKKSGKWRMLHDLREINDT FT MEAMGALRPGLPSPXLFLMNITLLL" FT CDS 6477..7262 FT /product="ERV2-2_TSy-I_2p" FT /translation="MLNIKVFLIXKQPPYLMVPVQLSTHWYDDGLQTLYEV FT NSLLEXQKRFLATLILGITALIAIATSVTVSAVALTKEVHSATFADQLSRN FT VSIALATQEIIDQKLESKVNALEEAVXAVGQERVTLKIKLXLRCHSEFKWI FT CLTPLEVNKSFYTWDKIKNHILEIWNHSDPSIDLIKLHKQIQDMTLAEKQS FT RSTELATSIFNSLSSFVNKKSLMSIFINVSICRAVILFLLCLLPIIFTVLK FT KSISQLNTEFHGFVLKNKKRG" XX SQ Sequence 7264 BP; 2176 A; 1465 C; 1450 G; 2119 T; 54 other; ctggcgccca acgtgggagc cctggaaggt aagctttttt ttcttttcag ggcggctaga 60 atcttaccca gggtgctgca gacccctctg tagagacagg cagagcgagg acgggtaagc 120 awagtttttg tgaagccaga gccatgatmt aatcatgggc cagagcgaat ctaaggaacg 180 kaaaatgttc atcagtatct taatgaatac tctctctaaa cgtggaacta aagtctcttc 240 tgmtgatcta gctcgwtttc tacattttgt tcaggaacac tgcccmtggt ttcctgaaga 300 gggggcagtg aatttggaca cctggtctaa ggtgggggaa gaactwagac tgtattatac 360 tcaacatggt cctgawaagg tgccacctga tgcttttgct ttatggaatt taatcaaaga 420 tgctttagat ccmcagcatg attatgacaa atttccttcc aaaaaggctg ctgatgacga 480 ggatgatgaa aacttttctt ttcgctgctg cctctgcacc ttcaaattat gatgataaat 540 tatccccaca ggacacggca gctttggaga atgaagccac aagttaccat ggtaacaatt 600 accttaataa cactaatcca cctcttccac aggtccctct tcataaggca gaggtmagaa 660 agctaaacct tacacccggc ggtgtccctg cccctccttt tctttttgga atacaggctg 720 ctcgagcaga ggcagagaag gaaggagatt tcagtttatc agcatgtttt ccagtggcct 780 ttattactgg tgctgatgga gtgggccaag cgcaatggga gcctttgcct tacaagttgg 840 taagagaatt aaaagaggcc agttcacata tgktccttta gctccttata caattactat 900 cttagatatg gccctacagc tccttataca ctaccwtttt agatcggttt cctgcttggt 960 ggatgactmc ttatgattgg cwacagktgg ctaaaggttg cctgtcagga ggacagtttg 1020 tgctatggaa aacggaatat gaagagcagg ttaaaaagaa agtggctcaa aactaaaagc 1080 catccaccag gaaggggtgt cgtcgccaag gacatgcaca cgggaacagg agattattat 1140 actacacaag ctcaggcaaa tagtgttaca tacagatgtg cttgctgccg taagtgattg 1200 tggtgttatt gcctggaaaa ggttacctkc tgattctatt caaacttctt ctttgtctga 1260 gatgaagcaa aggtcagatg agccatatga ggaatttgtt gctaggttga ccgatgcagt 1320 magcaaagta gccwcwaatg aggaagctga agatcttatt attaaacaat tggcctttga 1380 aaatgcttcc cccccttgcc aaaccttgat taaacctatc aggaaaactg gagattatct 1440 gattttatta aagcatgtgc tgaattgact ccwgcctatt tacaaggcgt ggcaatggct 1500 gcagcttttc aaggacaatc tttttcaaat gacacaaggt attacagaga gcgacaaggg 1560 cagaataatg cctgctatca tgtgtggaca aattggacat tattccagac agtgtccttc 1620 caagcaaggc ttcgctagct acaaggcata gcactgccac tactggtgat aacagacctg 1680 ctgctccttg tcaacgatgt gctaaaggct ttcattgggc taaagattgc aaatctaaat 1740 ttcataagga tggcactgta ttatctccgc caaatcaagc tcagccgcag ggaaacttcc 1800 agaggggcca gtcccaggcc ccacaaataa cggggacaaa ttttgtcagc cacaaccatg 1860 aattctacca gcccattaac atcagggaat ccaatgaggc aaatccgttc cttcaaggaa 1920 cacaatcgat gggctcttcc gtgcaacccc aggcagcgca ggattggacc tcagtgcctc 1980 cacctcaaca atattaacac ctgattctcc tactactaag atccctacag gagtttatgg 2040 ccctctccct gagggaacag taggattaat tgtagggcgt agttcttcct ccctcaaagg 2100 aatttctgta gttccaggag tggttgattc agatttcact ggggaaatcc aggtccttgt 2160 tcagcccccc actaaaacag tacgattcat gcaaaggaca acggattgct caaatgctac 2220 tcttgcctta ctttaaagcg ggcacccagt aactaataag gagagaggga caaaggattt 2280 ggatccagcg gtatggcatt ttgggtwaaa gaagttaaaa ggtccagacc atataaagaa 2340 ttgtacattc agggaagcca aattgaaggc ttattggata ccggcgcaga tgtttcttgc 2400 attgctggaa aagattggcc tgmaccttgg ccaacagaga aggctccttc tggtctcgta 2460 ggtattggta acagcacaaa atgttgccag aagctcacaa gttcttccat ggactgactt 2520 ggagtcttca ggaacgtttc gaccgtatgt tataccttct ctccccttta ctttatgggg 2580 aagagacatt atggaacaaa tgggcatkct tttatatggt cctaatggca aagtctccag 2640 tgctcagatg attcaaatgg gcttccatcc cgataagggt ttagggaaag acttacaggg 2700 cataaaggaa cctgtccagg ttaaagcaag gaccatctgg gcagggacta ggttattcaa 2760 atttataatg cgggccactg ttgcmactgc tgaccccata acctggagag atgatatacc 2820 agtatgggtg gaacagtggc ccatgaccac agaaaaactg tcagcagttc aggaaatagt 2880 agagcaacaa cttgaggctg gtcacttaga gccttctaac agtccttgga attctcctgt 2940 atttgccatt aagaagaaat caggaaaatg gagaatgtta catgacctaa gagaaattaa 3000 tgataccatg gaagccatgg gcgcattacg gccagggttg ccatcaccgc wgctgttcct 3060 gatgaatatm acattattgt tatagatcta caggattgct tctttactat tcctttagcc 3120 cctcaggata agcaagagat ttgcatttag tattccatct ccaaatttaa ataggcctca 3180 tcagagattt caatggacgg tcttgcctca aggtatgaaa aatagtccac cttatgtcaa 3240 aaatatgttg atgcagcttt acaacattag acagagctac aaggatatat atcttattca 3300 ttatatggat gatatattas tggctcattc agatagatta tatttggaaa caggattggc 3360 tttattaaac tgttcaggat ttacawaatt ttggtctcca agtggctcct gaaaaaattc 3420 aaactcagcc accatttact tatttaggaa gacttatttt ttcagattac attacaccga 3480 gtcaaaagcc tttgcgattg gasataagaa atctggaaac kttgaatgat tttcagaaat 3540 tactcggaga cattaattgg ataagacctt atctaaaagt ctctacttat gaactaaaac 3600 ctctttttga tatattaaga ggagatccga atcctaggtc tccagagcct taactccaga 3660 agctactaaa gcgctaaagt tagtgcagga caaactattg ctgaccaaaa ggttaagaga 3720 cttgattatt ctcgaccttg gtcattaata atattatcta ctgaatatac tcctactgga 3780 tgtttatggc aacaaggacc attagaatgg attcatttac ctgtaactgc taaaaagaat 3840 tttaatagct tatcccgctt atgctgctaa attaattaga aaagggagac taagaagcgg 3900 agaattgttt ggctcagatc ctaatgaaat cattatacct tatgctaaag aacaattgga 3960 cgctttactc cagctgaaga acctgaatgg tcaatagcat tattgggttt tacaggtact 4020 atttcatttc atctccctgc taatcctctg ttacaatttt taaataacaa ggctttaatt 4080 ttccctctca attgttcctt gactcaacca ttgccacctc cagctaaact tgtttttacg 4140 atggatcttc aaatggaaga gctgcagcat atgttattga taatcaatct cttgcgcttt 4200 tagctkaaga aacttcggca caacgagcgg agttacgtgc agtaatcgca gcttttaaag 4260 ctttaacacc tgatccattt aatttatatt ctgactctag atatgtagta gcgctgtttc 4320 cccatattga aacagctcca attaacgtta ataattctag tattttcggg ttactggagg 4380 agttacaaca tttaatccgt caacgaaatg tgcctttttt tgttggccat gttagagccc 4440 attcttccct accaggtcct attcatgaag gcaatcaaat ggctgacctt ggaactaaac 4500 ctactatttg tattgcagtg gaagtgaagc agtcgctaaa gctcaatctt ctcataaatt 4560 acatcatcaa aatgcttctg ccttacgttt tcagtttgat atamctaggg aagctgctag 4620 gcatatcgtt cataactgcg aagcatgtcc caccattgtt cctgtttcag gaggagttaa 4680 tcctcgaggc ttaaaaccaa atgctctatg gcaaatggat gttactcatg tttccagttt 4740 ggggaagttg tcatatktwc atgtctstgt ggatacwtwt tctcatgtka tackggctwc 4800 agcwmgaact ggagaggctt wtaaasatgt ggwgcaacac ctatttgcct gttttgcast 4860 gatgggaatg cctaaacaac ttaaaacaga caatggtcct gcatatactt gtagatcttt 4920 tcaacatttt gttcccaatt taatatttct cattctactg gmattcctta caacccwcag 4980 ggccaagcca tagttgaaag agcccatcaa actttaaaaa atcaaataat taaattacaa 5040 gagggtgagt ttaagtatgg ttctccccat catattttgt atcatgcttt gtttgtkata 5100 aatcacttaa atttggattc tgcaggcaat tcagctatga ccaggcactg gaacccagaa 5160 ggaagtatca aacctatggt aaaatggaag gatcttctta caggccaatg gagaggacct 5220 gatatattat tgacagcagt gcgagggatt atgcttgtat attcccacag aacgamtgaa 5280 gtcccagttt ggattccgga tcgattcttt tggaatgcct ccagtaatcc gttgactaag 5340 acgaatggca aaaatcagga cacttcagag gaatgaagga tgaacagcag cccctgtcac 5400 taaccaacaa aaggagctgt tctttataca acaggagact tcatcatggg tagagcaaga 5460 tctgcaacat cagtatagtt acatggacag tgaaaccata acatttcaga gagacctctc 5520 cttacggcat gagaacttac agtctaaatt gtctcgaaca agacaaacca ccattaggct 5580 ggtaattctg ttcttagcta tgattgttct tgtaatgagt caaatagctc cagcctctgc 5640 agctgtggct aataagactt actgggctta tatgcctaac cttccaacat ttcaaactgt 5700 cacttgggat aatgaggtta ttaatgtatt ttctaattcc tctcaattga tgggaggacc 5760 tactttactt catactcaaa caaaacacgt tattaatgat aatttttctt atgaaggaca 5820 ttcttgttta cctcccattt gtttttatca taattcagtc cccaaaaatt atagggcctc 5880 atctaatgtc tactgtaaag tctttaatca ctaatagctc agcccagagt aataaaagat 5940 attggtggat tgaaatgctg atgcctggag gagctgatcc cagtatagcc acaaaatttc 6000 accaccattt aatataccta tctgctctcc taataatgct gagaaggact ggatactcac 6060 tcaatatgtt aagtatccaa agtggagaga gtgtgtttat gaatctaaaa tagatgaccc 6120 catatcaggc gaaaaactat atattagtga ctggggttta aagttacagt cctacttaac 6180 tttcaatcaa cctcccgcta ggtggcacat taatggaggg gtacccccga tattaacaac 6240 tactatcagg aataagcaaa ttgttcagcc ttacttgtgg agggccctag ctgccctaaa 6300 atcccttgct ttaatacaac cacgtggggg aattaatgtc tacaaaataa gagcttgctt 6360 acaataacct tatgtttttc ctgtatggaa tgaatatcac actgtaagaa ttgatcacta 6420 cagcaatgtg tattctatat cttgtcaaaa ttgtattctc acaaattgtt taactgatgc 6480 taaatatcaa ggttttttta attmttaaac aacctccata tttaatggtt ccagtacaat 6540 tgtcaactca ctggtatgat gatggcttac aaacattgta tgaggttaat tccttgctag 6600 aaakacaaaa gagatttttg gctactttaa ttttaggtat tacagcttta attgctattg 6660 ctacttcagt tacagtatca gcagtggctc taactaaaga agtccattca gccacgtttg 6720 ctgaccagct gtctagaaat gtttctatag ctctagctac tcaggaaatt attgatcaga 6780 agctagaaag caaggttaat gccttggagg aagcagttwt agcagtagga caagaacgtg 6840 tcaccttaaa aataaaattg sctctccgtt gtcattctga gtttaaatgg atttgcctca 6900 ctcccttaga ggtaaataag tctttttata cttgggacaa aataaaaaat catattcttg 6960 aaatttggaa tcactcagac ccgagcatag acttaatcaa attgcataaa caaattcagg 7020 acatgacact tgctgaaaag caatctcggt ccactgaatt ggcaacttct atatttaata 7080 gtctctcatc atttgttaac aaaaaatcat tgatgtctat atttattaat gtgagcattt 7140 gcagagcagt gattcttttc ttgctgtgtt tactcccaat catcttcaca gtattgaaga 7200 agagtatatc tcagctcaat acggagtttc atggttttgt gctaaaaaat aaaaaaaggg 7260 gaga 7264 // ID HSAT5 repbase; DNA; PRI; 86 BP. XX AC . XX DT 01-JUL-2003 (Rel. 8.06, Created) DT 17-JUL-2008 (Rel. 13.07, Last updated, Version 2) XX DE Human centromeric satellite. XX KW SAT; Satellite; Simple Repeat; Centromeric; HSAT5; KW Satellite repetitive element. XX NM HSAT5. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RA Smit A.F.; RT "HSAT5: Human centromeric satellite."; RL . XX RN [2] RP 1-86 RA Smit A.F.; RT "Consensus update."; RL Direct Submission to Repbase Update (05-MAR-2004). XX DR [1] (Consensus) XX CC CC [2]. XX SQ Sequence 86 BP; 22 A; 30 C; 18 G; 16 T; 0 other; cactgaccag gtccttactg acaaggcctc actgacaagg cctcactgac caggtcctta 60 ctgacaaggc ctcactgaca aggcct 86 // ID LTR21_Mim repbase; DNA; PRI; 423 BP. XX AC . XX DT 09-NOV-2009 (Rel. 14.11, Created) DT 09-NOV-2009 (Rel. 15.11, Last updated, Version 3) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR21_Mim. XX NM LTR21_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-423 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2981-2981 (2009). XX DR [1] (Consensus) XX CC ~90% identical to consensus. 4bp tsd. CC Similarity to LTR20_OG from bushbaby. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. CC We thank the Broad Institute Genome Sequencing Platform and CC Genome Sequencing and Analysis Program, Federica Di Palma, and CC Kerstin Lindblad-Toh for making the data for Microcebus murinus CC available. XX SQ Sequence 423 BP; 102 A; 102 C; 96 G; 123 T; 0 other; tgtaacagag ggaatggcct ggagaaaaaa aaggcaaaaa gattttctgt ctctttaaaa 60 cttcccccac tctttttggg aactgcagcc tggcctgcat ccctgaggca ggttaagtct 120 tttgttaaaa ctcccaggtg gcgccagccc cccaggtggg cagctggcgg agttcacagg 180 ctttgtcttt ggcagagatg ggatgattag aatagttaac cacagtaatt gcctgaagct 240 gagaaccctc cctttaaaag ctctgtattt ctgcctatgt tcaggaaatt tgggattttg 300 agatgagaag gtctaccatt tttcctcctt tgcaaagtaa actctctttc cttcctcccc 360 aaaccacttg tcctcgttct tctgattcgg cctcgtggac aagtggctga gctttcagta 420 aca 423 // ID LTR13B1_OG repbase; DNA; PRI; 519 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR13B1_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-519 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 11(5), 1586-1586 (2011). XX DR [1] (Consensus) XX CC ~95% identical to consensus. 4 bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 519 BP; 115 A; 185 C; 90 G; 129 T; 0 other; tgagagggcc agaccagcgc cattttgttg tttactgcct aagttctgtt cctcattttg 60 agcccccgcc gccatcttag ataataagct tcagcttccc caaaaccctg gcctttcccc 120 gaaatcaagc accccagtca gccaatcaga agtgctcccc acccttcttg cttcatacca 180 ccctaagacc ccagttagga aatcccccac ccctcatacc tgtaccaatc cagagcctcc 240 ccgtacttcc ccaaaccctg accaccaatc acccatactt ccccataccc cgaccaccaa 300 tcaaaaatgt accccaccct tcttcaccct tcttgctttc tgtaccctat aaaactttcc 360 aaccccctag gctcggggcc ttccttggta tgtttgccaa gagagggtcc gggctcgagc 420 ttgctaaata aagcttgcct cttgctttta catcggagtg tgtctcggtt cgtcccttcg 480 tgggctcggg gactcaggga aattctgggc acaacagca 519 // ID MER41B2_Mim repbase; DNA; PRI; 580 BP. XX AC . XX DT 11-NOV-2009 (Rel. 14.11, Created) DT 11-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW MER41B2_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-580 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2986-2986 (2009). XX DR [1] (Consensus) XX CC ~87% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 580 BP; 161 A; 159 C; 116 G; 144 T; 0 other; tgagataggg aaccggcagg ccctgctcaa taaggagtca taagacccga accaaggtct 60 ctgataaaac aggaggtggt ggagaagccg gccaaaacca gctagaaccc agatggccac 120 gaaaaatgac ctcaggttac cctcactgct cattataccc tgcttataat gtattagcat 180 actaaaagaa actcccacca gcgccatgac agtttacaaa tgccatggca actcccggaa 240 gttaccctat atggtttaaa aaggggaggg accctcggtt ccgggaactc cccacccctt 300 tcccagaaat cttatgaata atcctccctc cgcttaacat aaaattaaat agaaggtata 360 aaataagacc ccggaaactc acaaagtgct actctctgtt gctctgagag atagctgctg 420 ccctgtctat ggggcaagtc tttctttctc tttcttactt caataaactt gcttttgcct 480 tactctgtca gctcgctctt gaattctttc ctgcgcgaag ccaagaaccc acttggcctt 540 caggccggac cccagttttg gggtccactt tcctgtatca 580 // ID LTR1C2_OG repbase; DNA; PRI; 419 BP. XX AC . XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 05-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Long terminal repeat (consensus). XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR1C2_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-419 RA Jurka J. and Jurka M.G.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 8(10), 1669-1669 (2008). XX DR [1] (Consensus) XX CC 4bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 419 BP; 100 A; 116 C; 85 G; 118 T; 0 other; tgatacaggg ttcccccatt caaggctgga caggggaccc cagttttcct aggcctgggc 60 aatttcatgt aagccgccag gcctcttctg aagggaacgc cctgaccttt tgccccgagc 120 tggagggact gcattcattc atttgttaat aacgctcatt aacattgaaa aggtgtcttc 180 catctaccaa gtcctggcca aatgtaattc tcaagacctt atatacccct agacaaagga 240 cccacgcgga gacttcccac ttgggtcttt ccctctgcca gaagctctgg tgctttttat 300 tccttctgta ttcctttctg tctaataaaa tcctatctta ccactgatct gtggtccgtg 360 ggttcattct tcgaatcccc gagaccaagg acctactgaa gaaagaaatt ccggtaaca 419 // ID L1P4b_5end repbase; DNA; PRI; 1783 BP. XX AC . XX DT 29-AUG-2008 (Rel. 13.08, Created) DT 29-AUG-2008 (Rel. 13.08, Last updated, Version -1) XX DE L1 Non-LTR Retrotransposon from primates. XX KW L1; Non-LTR Retrotransposon; Transposable Element; LINE; KW L1P4b_5end. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-1783 RA Smit A.F.; RT "L1P4b_5end - L1 Non-LTR Retrotransposon from primates."; RL Direct Submission to Repbase Update (05-AUG-2008). XX DR [1] (Consensus) XX CC L1PA16. XX SQ Sequence 1783 BP; 317 A; 641 C; 543 G; 270 T; 12 other; agaagcacca agatggctga ctagaagcag ctagtgtgtg ctgctctcac ggagaggaga 60 cagagtggcg agtaaacact agctcttcaa gtggatcatc caggaggcca cattgggatt 120 catcaaggaa gcaatggcga cccatggaga gcagagagga gcgaggcagg acagccgccc 180 acctgggatt ggcacggagc cagggaggct ccctaccatg gggaaagggt gagtgagcga 240 gagcccctgg ggacccacac ttctgccacg gacctttgca atcctgggca caggagatcc 300 cccctgaccc cccgggcctc cagaccgaca cggagagctg cctggagtct gggcagagcc 360 gccgctcagg cccacgtgga gccccacggg ccttggatcc ctgagcaccc cggcgccagc 420 tgccgtagcc ccgccaacaa gggaggccag gctctctcgc gtacccctag gataggggcc 480 gcatccacgg tgctgaggag cagatggact gcaggcccca cctccgctgc acctcgccag 540 gcaaggccca ccggcctggg nccccagcgc agccacccca ccccngcctg agcactccgg 600 ccggttgcgg ccctgcattt ctctgggata gagctcccag aggtaaccaa caggcccgct 660 gcgattgccg ctgccacggt ccccacccct gctgccctca ggctggggag ggaggaagag 720 caaagatgcc tcaagaactg tcacgggcct ccagcacgcc acagctgcca tacggaaaag 780 cggccagact gttttccacg tgggtccctg cccctgctgc tcctcaccgg gcagggcctc 840 ctggcctggg cccccagcgc agccgcccca cccctgcctg atcacttcgg tcggtggcgg 900 ctctgcattt ctctggggtg gagctcccag aggcaaccga caggccctct gccattgccg 960 ccgcagcggt acctgccctt gctnccctca ggctggggag ggaacaaaga gcctgattgc 1020 ttttgcatgc ctccagcacg ccgcagctgc cctacggaga ggaggccaga ctgtcttccn 1080 cgcgagcccc tgacncccct gctcttcacc aggcagggcc tcccggcttg ggcccgcagc 1140 gcagccgccc cacccccggc tgatcattcc natcggcagt ggctctgcgt ttctctgggg 1200 tggagctccc agaggcaact gacagcccct ctgccactgc cgccactgcg gtacctgccc 1260 ttgctgcccc caggctgggg agggaacaaa gagcctgatt gctttnctca cacctccagc 1320 atgccgcagc cgccctacag agaggaggcc agactgtctt ccccgtgagc ccccnccccc 1380 cctgctcttc accaggcagg gccccctagc ttgggccngc agcgcagccg ccccaccccg 1440 gctgatcact ccaatcggcn atggctctgt gtttctctgg ggtggagctc ccagaggcaa 1500 ctgacaggcc ctctgccatt gccgccgcaa ggtccccgcc cctgctgccc ccaagctggg 1560 gagggaacaa aaagcctgag ctcgccccag ggctgcggtg nacagctcgg gagtgccgag 1620 ccgagatctg tggccagcac ttaagcggaa gaggagccca cactctcaga gcactgagag 1680 gggtgagtcg cgtgggctcn tgggctgccg cgggagcggg gcatgcctcc ctccacaggg 1740 ccagcccgga aaaggtgtgg cctgtctccc tgccgcggcc tct 1783 // ID LTR1A3b_OG repbase; DNA; PRI; 425 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR1A3b_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-425 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 11(5), 1581-1581 (2011). XX DR [1] (Consensus) XX CC ~91% identical to consensus. 4 bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 425 BP; 102 A; 115 C; 111 G; 97 T; 0 other; tgataccgga caaggtggca ccgagaccgg ccaccttggc cctggacctg ttgctgttgc 60 ggcgaatgga ggaattcggc cggacacgag cgggggaaga cactggcgcc gtgctgtctc 120 ctcacgtgga cttctggcaa gtcaattcct ccagtgcatc cctgattggt ctgtttccaa 180 aacctgctcc tgattggtct gttttcaaga cccgaaagct cattggacaa ctgccactat 240 ataagcccct aagcagagag cccaaacaca gacctccaag agagagcagg agagcaagga 300 ggtttggttc tttgtgagag ctgtaacact tgtattgtta aataaaacct aacctactgc 360 tgatcctgtg gtccgtcgct tcattcttcg aaccgccgag accaagagcc tggaatccgg 420 tatca 425 // ID LTR10B_Mim repbase; DNA; PRI; 284 BP. XX AC . XX DT 02-NOV-2009 (Rel. 14.11, Created) DT 02-NOV-2009 (Rel. 15.11, Last updated, Version 3) XX DE Long terminal repeat - consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR10B_Mim. XX NM LTR10B_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-284 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2965-2965 (2009). XX DR [1] (Consensus) XX CC >98% identical to consensus. 6bp tsd. CC Similarity to RLTR10F from mouse. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. CC We thank the Broad Institute Genome Sequencing Platform and CC Genome Sequencing and Analysis Program, Federica Di Palma, and CC Kerstin Lindblad-Toh for making the data for Microcebus murinus CC available. XX SQ Sequence 284 BP; 56 A; 88 C; 73 G; 67 T; 0 other; tgtggggcgc ggtggcaacg ccattccaag atggcgcccg cttcctggtc acaccctact 60 gtgcgtctga caaacagatg ttcagcgcat gtgcaaagcc ttgcctcctc tcctgttctg 120 tgttgaccaa tcaggttatg ccccgtgtac ttactgtcta tataagcagc cgccgagaac 180 gcctcggcgt cttccgcatg taaccagtta agcaatcccc attaaagcgc tgtcagaaga 240 actccagttg ccgcgtcttc cttgctggcg aggcgggcgc gaca 284 // ID Tigger2b_Pri repbase; DNA; PRI; 1068 BP. XX AC . XX DT 16-MAR-2006 (Rel. 11.03, Created) DT 10-APR-2007 (Rel. 11.03, Last updated, Version 1) XX DE Mariner DNA transposon from primates. XX KW Mariner/Tc1; DNA transposon; Transposable Element; mariner; KW TIGGER2; Tigger2b_Pri. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-1068 RA Smit A.F.; RT "Tigger2b_Pri - a subfamily of Mariner transposon from RT primates."; RL Direct Submission to Repbase Update (11-NOV-2005). XX DR [1] (Consensus) XX CC Contains pos 1-744 and 2461-2718 of Tigger2. It includes a 66 bp CC "unique region" also found in Tigger2f in carnivores and CC cetartiodactyla. The dog element comprises a full Tigger2 CC element plus an insertion showing a 7 bp TSD, and the most CC parsimonious explanation is that Tigger2b is a deletion product CC of Tigger2f, although it beats me how these elements ever CC interacted (the ORF is disrupted by the insertion). Pos 811 CC corresponds to pos 2461 in Tigger2. XX SQ Sequence 1068 BP; 336 A; 204 C; 208 G; 320 T; 0 other; cagttgaccc ttgaacaaca cgggtttgaa ctgcgcgggt ccacttatac gcggattttt 60 ttcaataaat atattggaaa attttttgga gatttgcgac aatttgaaaa aactcgcaga 120 cgaaccgcgt agcctagaaa tatcgaaaaa attaagaaaa agttaggtat gtcatgaatg 180 cataaaatat atgtagatac tagtctattt tatcatttac taccataaaa tatacacaaa 240 tctattataa aaagttaaaa tttatcaaaa cttacgcaca cacttacaga ccgtacatgg 300 cgccattcgc agtcgagaga aatgtaaaca aacgtaaaga tgcagtatta aatcataact 360 gcataaaatt aactgtagta catactgtac tactgtaata atttcgtagc cacctcctgt 420 tgctattgcg gtgagctcaa gtgttgcgag tatccgctta aaacgccgtg tgacgctaat 480 catctccgcg tgagcagttc gtctctccag taaattgcgt atcgcagtaa aaagtgatct 540 ctcgcggttc tcgcgtattt ttcatcgtgt ttagtgcaat accgtaaacc ttgaataaca 600 ccatgggacc catacgaagt gccactagtg atgctggaag tgctcccaag aagcagagaa 660 aagtcatgac attacaagaa aaagttgaat tgcttgatat gtaccgtaga ttgaggtctg 720 cagctgcggt tgcccgccat ttcagacaga tgattcatct tgtaaacaga tgacgtaaac 780 ttacggtatc gataaataca gtacagtact gtaaatgtat tttctcttcc ttatgatttt 840 cttaataaca ttttcttttc tctagcttac tttattgtaa gaatacagta tataatacat 900 ataacataca aaatatgtgt taatcgactg tttatgttat cggtaaggct tccggtcaac 960 agtaggctat tagtagttaa gtttttgggg agtcaaaagt tatacgcgga ttttcgactg 1020 cgcggggggt cggcgcccct aacccccgcg ttgttcaagg gtcaactg 1068 // ID L1P4e_5end repbase; DNA; PRI; 1391 BP. XX AC . XX DT 29-AUG-2008 (Rel. 13.08, Created) DT 29-AUG-2008 (Rel. 13.08, Last updated, Version -1) XX DE L1 Non-LTR Retrotransposon from primates. XX KW L1; Non-LTR Retrotransposon; Transposable Element; LINE; KW L1P4e_5end. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-1391 RA Smit A.F.; RT "L1P4e_5end - L1 Non-LTR Retrotransposon from primates."; RL Direct Submission to Repbase Update (05-AUG-2008). XX DR [1] (Consensus) XX CC L1PA16. XX SQ Sequence 1391 BP; 338 A; 448 C; 389 G; 209 T; 7 other; gagccaagat ggccaactag atgcagccag gaagagcttc tcccactgag agagaccaga 60 acatcaagta gaccggcaca ctccgaacag atcttcngaa agaaggcatt gagagtggat 120 agagggagga cgcagacccg gggctgaaag gggaggaagc tgggaaccct gcacggggtt 180 gccgagcacc gggactcgtt cctggccctg agtggctcct agggaagggg tgagtgaaat 240 aggcgtggag cggcccactc tcgccacgga cctccgggat cctagctgca ggagacccca 300 tgacccccac ggacatttga gctggcaggg agaactgccc ggagagttgg cagagacaga 360 actccagcct gcacggagcc cagagggttt ggcgcgggaa cggctgcagt ggagcacggc 420 catgggcgcc catcccccaa ggctcgccat actcctctag gtggctttag cctttgttag 480 ctgccagacc tgganagagc agggctgtct tgcccgcggg actggggcga gtctgatctg 540 agcgcccccc tgtctgccgg cctctcccag ggtccctgcc tggccgcacc cgcttgcagc 600 gcagcctcag ctgcccagcc gaagcgcttg ccagcggcca ccgccatagc nctttcgcca 660 gcagcccctc gccatcccgc cggagcgctt ttgccagtgc ccacgcaccc accgccgccc 720 tgccggtgcg cactcgcctg cagcctcccc gccgccctgc ggtgcgcatt gcccacngcc 780 cccactgccc cgctggcgca atgctttcac acggcgnccc ccgccgcccc gccggagcac 840 ttttgccggc agcccccatc ggagtgttgt tgccagcaga ctgggagcac tctcggcccc 900 tccagcgcag caggtgctta acctcgaggg gccagagaac aaagctgcgg gcctggtccc 960 agccccccag ggttagagca cgcagcccag gagtgctgag ctgagccttg gccccctgaa 1020 agcatccaga aatgaagcca atcgactaaa cccaacttat accacagtca aaccctcaag 1080 ggcatcaaag aatataaaag caaaaagccc catccaaagg acagcaactt caaagattaa 1140 aggaacatca gcccacacag atgagaaaga accagcgcaa gaactctggc aactctaaaa 1200 gccagagtgt cttcttacct ccaaacgacc gcactagctc cccagcaatg gttcttaacc 1260 agattgaaat ggctgaaatg acagacatag aattcagaat ctggatggca aggaagctca 1320 ncgagataca ggagaaggtt gaaacccaat ccaaggaaan cagtaaaacg atccaagagt 1380 tgaaagatga c 1391 // ID LTR1C4_OG repbase; DNA; PRI; 542 BP. XX AC . XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 05-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Long terminal repeat (consensus). XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR1C4_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-542 RA Jurka J. and Jurka M.G.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 8(10), 1671-1671 (2008). XX DR [1] (Consensus) XX CC 4bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 542 BP; 123 A; 159 C; 117 G; 139 T; 4 other; tgatacaggg tttcccccgt tcagagctgg acagggacat cccccatttt ctaggcctgg 60 gcaaacagac cttyttccag accgcactct ctggactcaa gcaggatcag agaaaagcag 120 cctgaycagt atctcatctg ggtggaaacc aggcccctcc tggagtggaa aacatacacc 180 ctgctgcttc gcctgggaca gggactgtga ggacctcatt catttgctaa tacattcatt 240 cgctaatcaa catgggcagg atgttttctg tctgccggct cctggccaag tgtgacatac 300 attcccaaga ccccctgctc gcaggcagta ttatgaaccc ctagacaaaa gacccacggg 360 ggcttcccac ttgggaattc cccccctccg ctgccggagg ctctgttgcc cttctttcct 420 tctctatttc tttctctgtc aataaacttt gtcctgtcac ttatctcatt ggtctgtggt 480 ttcattcttt gaatctccag aycaaratca aggacccagc gcagagagaa attctggcaa 540 ca 542 // ID RTVL-Ib_I repbase; DNA; PRI; 6649 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Internal portion of ERV1 Endogenous Retrovirus from Pan DE troglodytes. XX KW ERV1; Endogenous Retrovirus; Transposable Element; KW LTR retrotransposon; RTVL-Ib_I. XX OS Pan troglodytes OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pan. XX RN [1] RP 1-6649 RA Smit A.F.; RT "RTVL-Ib_I - ERV1 Endogenous Retrovirus from Pan troglodytes."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX SQ Sequence 6649 BP; 2120 A; 1482 C; 1374 G; 1673 T; 0 other; aaaaatttgg gggcccacct ggattcccat tctcctccga ggaagggtct ctctggtcct 60 cttccgtgag gagcgcactg cgctgcctcg ttgtggtggc ctcaggggta aggaatcgag 120 acccacccaa tgtgacaaat aaacctggac tttcagcaac gcaggaagaa acagaccaac 180 aacttgggga gaaaagatcg tcacatactg caatgaccag gtaactctgg ttacagacca 240 aggtaagaaa tgttgttaag agcgacaaag tatttccttg gtggttaggg tatcttggag 300 gttgaaagtg tgtgaatgag atgcaccatt gagtgcaaag tgagtggtga gttcagatct 360 gcagttctgt ggtcacctca tctggcctag ggtggccctc ctgttaagag tctggattgg 420 gggtttatac caacccgcca atgctaagag ggatgtaaca ttcccgtgag agaagcagcc 480 agagaaggat gaagcaaaag gagaagagtg cgagaagcct ccagcagggg gctcgagcct 540 ctaggaaagg aaggcgagaa atctccagta tgggggattg agcctcatac acacatccag 600 tagtagggaa ggcaagaaat ttccagtaga ggaaattgag cctcaccctg aaaggtgaga 660 aatctccagt aggggagatt gagccttacc ccaaaaccat caagatggga aataccccaa 720 gtaaggtagg ggataaaagg gataaagcta gcaacactaa tattcccctg atagtccatt 780 aggcctaatg ttaaaatatt agaaggataa tgaaaggacc aaaaacaaga aaaagcagtt 840 gctgacccag acccttctcc tacccctgtt gtcccctcct tataaccctg cctctttgga 900 atcatcccaa gagctcactt actaccagcc taagtaccct tccctgaaag gacttcaaca 960 caagataaag caatgtaaaa agaatattca gaacttccct ttcccctcta cttcggggga 1020 atcagctcca gctctcttcc cctgaagggg gtgcccccta ggaggagggg gcattggctt 1080 tgtaaatgct cctttaacca gcttggaggt ctgaaaccaa aaaagagagc ttaagccagt 1140 attagacgac cctcatggag aagcagatca aattgaccaa tttctgggac ctcagttata 1200 tacttggggt gagttaatgt ccatcctagg catcctcttc ttgcagggaa aaaagaggca 1260 tgatccacag agctgctaca atagcctgga atgtgaacac cctcctgcca aaatgttcct 1320 acagcagatc aaaaaaattt cctgcccaag atccctggtg ggacaataac aacacaaccc 1380 accaagaaaa tatgcaggac cttaggaaat tgataataaa cgggattaaa gaatcagtac 1440 cccgagccct gaatcttacc caagcacttg atatacagca agggaaagat gtagggcctt 1500 taaaattttt aaacagacta aaggaacaaa cgagaaaata tgctgatcta gatcttgagg 1560 atcctctttg gcagggaatg ttaaaatttc attttgtcac taacagttgg ccagatatta 1620 acaagaaatt acaaaagata gagaactgga aagatagacc tatagaagag cttctgagag 1680 aagcccaaaa agcaaatgca agaagagatg aagaaaaaca aaagcagaag gcaaaaattc 1740 tgctgtctac catacaacaa agtacccagg gggccagaac ctataaagaa cccagacccc 1800 cactctccag gcaatataaa gggtacaaaa gagtaaagcc agcagactca aaggtagaaa 1860 aaaaaagagg acagaacaaa tgttttaaac gtggaaaatt aggttacttt aaaagacaat 1920 gtcccaaatg agaaaagaaa gaaacccagt tatggctttt gaagaagact aaggaagtca 1980 ggggctctac tttctacttg agtcccacca agagcccttg ataaatttag aagtgggacc 2040 caatgttgag cttattatct ttttaatcaa ctcaggagta gctcgctcct cagtttgtta 2100 tcttccatct agtgtaactt gctcacaaga agaacttttt atctcaaagg taaaaggaga 2160 agggtttaca gcaaaaaaaa aaaaaaaaaa acttagagga gacagaagta aaatacaaaa 2220 accaatcagc tagtatcaaa tttctgttaa tcccaggaac agggacaaat ctattgagaa 2280 gagatttaat gctaaaatta agcttaggcc tccaaatcaa tcatggaaaa ttcctcccct 2340 ccctaaactt gttcaccacc acagacaaag aacacattca tcctcgggta tggtcaaaag 2400 acgggaatcg aggaaagtta cagattcctc ccattcatgt taaattataa acccttaggg 2460 aagttgtaaa gagaaagcaa taccgtattt attcctttaa aagccagggt aaatttaaaa 2520 cctgtaattg ataatttaag gtcttctccg tgatgggctt cttgagcctt gtatgtctcc 2580 ctataacact ccaatactgc ctgtaaagag gccagacatg tcatactgct tagtgcaaaa 2640 tcttagagct attaatgaga tagtccaaac caaccactct gttgttacta gtaactgccc 2700 ctgtcctagc tttacgctcc ctagaactgc cctttcatct ctccaccaat gtaaacaaag 2760 gagtagcctt aagagtactc actcagaaac acgggggcca tcggcgccca tagccttttg 2820 gtcaaaaatt cttggccctg taatccgagg gtagcagcaa cagccctact aacagaagaa 2880 ggcaggaagg ttaattttta gaagggactt cgttgtcagt acacctcatc catttaggac 2940 tacccttaat caggaggcag gaagggggct tactgactca agaattttaa agtatgaagc 3000 tatcccatta gaaagagatg atttaacact aaccactgaa aattcactta gcccagcagg 3060 tttcctgcct ggggacccaa atttaaagag acttgagcat gagtgtttag atttaattga 3120 ttatcatgct aaagtcaggc ccaatttaag agagacctct ttcaaaacgg ggcagcactt 3180 atttatagat ggctcttcct gggtaattga aggaaaaaga cataatgggt actcagtggt 3240 cgatggggaa gcccttgcag aagtagagtc aggaagacta cccaataatc ggtctgccca 3300 aacatgtgag ttgtttggat taaatcaagc cttagagcac ttgcaaaacc aagaagggac 3360 tatctatact gattccaagt acgcctttgg aatagctcac atctttggaa aaatttggac 3420 tgaatgaggt cttattagta gcaagggtca agacctgatc cataaagaat taatcaccca 3480 agtattagag aacctccaac tgccagaaga aatagttgtt gtctgtgtcc caggacatca 3540 gacaagtatc tcttctgaaa gcgcgccatg ggaataatcc tgcagatcat atagccaaac 3600 aagccgccgt ttcctctgaa atgcctgttt ttcacttaac ttcttgcctt tcccccctga 3660 tcgcagtccc catcctttcc tccaccgaaa aggaaaaatt aggggtcaag gaaaatccag 3720 aagggaaata ggtgttacca gaccaaagag aaatgttacc caaacccctc atgggagata 3780 ttccctctca tccgcacaag ggattcattg ggggcctcaa gccatgtgtg atgcagttct 3840 tagggtttat ggatgtatag gaatttatac tttggcaaga caggttacag atagttgcct 3900 agtatgtaaa aagactaata aacagaccct cagaaaacca cctcttggga gaagaaatcc 3960 aggattaagg ccattccaaa gtatccaagc tgattacacc aagatgcccc aattggtcat 4020 ctaaagtatt tattagcagt agtagatcac cttactcatt aggtactcat ttcactgcac 4080 atgtccttaa gaaattagcc caactactgg atataacatg ggagtatcat actccttggc 4140 acccaccttc accagaaaga gtggaaaaga atgaacgaaa ctctaaaaag tcacctaacc 4200 aaattagtct tagagacttg gttgccttcc cattgccctg ttaagaatcc aaactgcttc 4260 ttagagagat gttggcttat ccccttatga aatgttgtat aggttgccct atttgcactc 4320 cactgctgac attcccacgt tcgaaacaaa agatcagttt ctcggaaact atatacttgg 4380 tttatcttcc actttctctt ccctcagaac taaaagtctt ttagcacagg caccacccct 4440 agagtttcca gtacagcaat atcaggctgg ggaccacgtc ctagtcaaaa gttggagaga 4500 aggaaaaccc gaactggctt gggaaggacc ctacctagtg cttctaacca ccaagaccgc 4560 agttcgaaaa gcagaaagag gatggactca tcacacccgg gtcaaaagag caccaccctc 4620 tccagaatca tggacagcta ttccagggcc aaccccaaac aagttaaagc taaaacgggt 4680 ttgatcctct tatattgcat ctctttcttt tccctttcta ttgctaatcc tctcgttatt 4740 aatgtgacta ggttgagttt accccaaact attacttttg atgcttgcct tgtgatgccc 4800 tgtggagatt tgccaaacca gaggcaactc tccacttcag aaaagtatct ttgtctttcc 4860 tggctctcct cagactggaa atctgttaac tgagatgagt tagtctagga agattttgat 4920 gatgatccca gtatgaactg ggaatcttgc cctcctagaa tagaattttt atgctgtagc 4980 tggtctaatg tcttatggac tatcaaggaa caaggatgga ccaccccaac cagtggttgc 5040 agtttcctga aaccatatat tcattttact aaaggaatta cccccatcaa ctgtcagcta 5100 aaccaatgta atccagtaca aattaccatc tcagcttccc aaaattcctc cttcattaaa 5160 tcgtttttat ggcacaagag cagaagtctc aggaacagac cctataggat tctttgaaat 5220 gtgctgtatt gctcccctac ccccttcacg tccttctaat acatcttcca atcaaaccgt 5280 tactcttcct ctacccaatg ataaaactaa ggtagccatt gtagaagtta aaagatttaa 5340 agcagaccat agcaattgag acagggtacc gagatgcgaa tgcttgggtg aaatggatta 5400 aatattccat ctgcactcta aacaaaagtg actgttacac ttgtatgcac ggtgccacag 5460 gcccaggctg tccccttttt actcagatgg tcttccagct gacctggcat gagctgtatg 5520 gtagctcttt tccaagacct cacagtctga ggtaatgaat tcgccaagct ctctctctct 5580 ctgctattcc ctgaagtcca acaccctgca ggtcaggccc ctggggggca tccagcctcc 5640 gtatcctggc accaattttt cctcatgtct ctcatgacaa ggggaaaaat tggcgtatct 5700 tggaaactaa aggggtgcag tcagcttaag cccttccaag agcttaccag tcagtctgcc 5760 cttgttcacc cttgaacaaa tgtatggtgg tactgcggtg gacccttact aggcactctg 5820 ccaagtaact ggagcagcac ttgtgctcta gtccagctgg ccatcccttt tacctgagca 5880 ttccatcagc atggtagaaa agaaaattgt aaagaaggag tggcttacgt gggtcctttg 5940 accctcatgt ttatacaggc gctattagag ttccaagagg ggcaccagat gaatttaaca 6000 cccaaaatca aatagctgta ggatttgaat ctgtgctgtt ttagtggtga agtataaaca 6060 aaaatgtaga ttggataaat tatatttact ataatcaaca aaagtttgtt aactacacaa 6120 gagatgccat taaaggaaca gctgaacaat tagaccccgc cagccagatg gtttagaaaa 6180 atagaatagc tcttaacaca atgttagcag agaaaggcgg ggtctgtgtc atgattagag 6240 tccaatgttg tatttttatc cctaataaca cagtccccga cagaacagtt aacaaaagct 6300 ttgcaaggcc taaccttatc cagtaagtta gcaagaattc tagaataaat gaccccttta 6360 ccagttggat agagagatgg tttagaaaat ggaggggact aatgtcctca atatttactt 6420 ctcttgcaac tgttataagc gtgctgattc ttgttagatg ctttatcgta ccatgcattc 6480 gtgaactggt acaaagactc atagaaacag cactcaccaa aacctctctt agccctcctc 6540 caccttactc agataagctt ttccttttag aaaaccaatc aaaacagcaa agccaagata 6600 tgatggaaag gtttgaagag gaaaatttgt aaaatcaaga gggggaaat 6649 // ID PTERV2c_LTR repbase; DNA; PRI; 603 BP. XX AC . XX DT 14-MAR-2006 (Rel. 13.02, Created) DT 21-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE Long terminal repeat of ERV1 Endogenous Retrovirus from Pan DE troglodytes. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; KW PTERV2c_LTR. XX OS Pan troglodytes OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pan. XX RN [1] RP 1-603 RA Smit A.F.; RT "PTERV2c_LTR - ERV1 Endogenous Retrovirus from Pan troglodytes."; RL Direct Submission to Repbase Update (21-FEB-2008). XX DR [1] (Consensus) XX CC LTR2/ptervb or ptervd lib20040702 Most common PtERV2 LTR (19 CC copies in panTro2). XX SQ Sequence 603 BP; 167 A; 157 C; 134 G; 144 T; 1 other; tgagaaacct tagaaaaata tagccattaa aatagaaaaa cacttagaaa aacactttgg 60 gaaaacatgt cagaaaaaca tttcagaaaa acaagtcaga aaaacattta ccagatggac 120 agtaccccac ccgggtgtgt caacagatgg acaggagtcc gggtccgggt gtggctatca 180 gggacaccag atgtccaggg tagtgtcccc tacattccca gaaccaaagt agaaaaacat 240 ttccaggaaa taccccttcc cgcctgtgag ccccctgacc aaccagtgtg ggacagctgg 300 ctcaggccat agttagaaac caatcagttg cttccaaacc tcgcgccttg aaccttttca 360 aatgtgtgaa ccaatcaact tattgtaccc caccctgttt naatttgtaa cctccccctg 420 tgtgactttg tggtttttgc ctttataagc agtgtgcaac agccgttcgg ggtctctcgg 480 cctcttgtgc tggggaccct agcgcgctag caataaagag tgtctctttg ctgtgacctc 540 cgtgtcgagt ggtctctcgc ggcggcccct ctcgaacttg gaatcttgag caaggttcca 600 aca 603 // ID L1-5_TS repbase; DNA; PRI; 6035 BP. XX AC . XX DT 03-MAY-2010 (Rel. 15.05, Created) DT 03-MAY-2010 (Rel. 15.07, Last updated, Version 2) XX DE L1-type non-LTR retrotransposon - consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-5_TS. XX NM L1-5_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-6035 RA Bao W. and Jurka J.; RT "Non-LTR retrotransposons from tarsier."; RL Repbase Reports 10(5), 771-771 (2010). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 6035 BP; 2317 A; 1413 C; 1203 G; 1101 T; 1 other; gagggatcca agatggccac catggagcag cagaccggca cctctctcac cgagcaaggc 60 agagagtagg gacccagccg ccaatagaca gagaccctga acacttttgg atgggtgaag 120 caccctgcac ccagcaaagg tctctttgaa cacatagaga ccagaaagtg acggcagatg 180 cgggaatccg ggcaccccat cccccgccat gtgccaggct tgcacaggaa ctgaagccct 240 gcccgtgaac gagagtgggt gagtgatcct aagatgcttt gggtcggttc acccctccca 300 gccatgcatg agtcattggt cccatgcagg aggagtcctg gtcagggcac agcagcaggc 360 aaccggagcc caagtggaat gggagaaaaa aacctatgct ggtcagccgc catgagcatg 420 gccccagccg gctcccaggc agggaaagcc cacctgcaga gtgaaccttg ctggcctgct 480 gacccatctt cccagccaga tcaggaagaa tcacctccaa gccctgcatg cctacagggt 540 ggatccccca ttcccccatg gcagggaact cactcttttt caagagcaga gtgtggactt 600 gaacagggag ggagcgacac gaaagacacc tgaaggcttt cttggtggga aggcttctgg 660 gctgcaaacc gcagatgggc gggcccatcc ccccacctgc tctccacaga gtgctagcag 720 caggggctga gaaaagctac agttgagaaa cctgtcacag ggtggagcaa ggagcagaag 780 tgaatgcccc tccctctctc actgagtgac ctatattgaa ccacagaaag ggccctagaa 840 cagagggtca ttgactagct tcagagcaac aaaaccaaaa ggaattacag agagggagag 900 aggcctgtca caaatgagga acaacaacaa aaatagacaa ccctcagagt cctaaagaga 960 cagcacccga aaaagcagct atcagagcac aaacaatgga aaccccccag aacgacctgc 1020 cccagatgac agtcacagaa atcagagatt ggatggctaa aatgaacaga gagaaccaag 1080 agaaactaga agcaaaactc actgaagtca tacaggaagt aaaagcagag ttggccaatt 1140 tcagaaaagc ccattctgaa ataatggaaa tgaagaactt catcctggaa attaaaaaca 1200 cagtaacaag catagggagc aggatcaacc aggcagagga aagaatctca gagcttgaag 1260 accaaaatat ggaactaacc cagtctgtca aaaacatagg gaaaaaagct caaaaagaag 1320 gagcaaagcc tctgagagat gtgggactat gtcaagaagc caaatctatg cctgattggt 1380 atccctgaag cagaaaggga aactgagaac accatggaac aagcattcca tgaggtcatc 1440 caagaaaact tcccacatct cactagagag gtgaccattc aagcacaaga gattcagaga 1500 acccccacaa ggtatcacat gagaagacca tccccaagac acatagtaat ccgcctccac 1560 aaagtaagca tgaaagaaaa aatcctaaag gcagcaagag agaaaggtca gactacctac 1620 cagggaagac ccatcagaat cgctccagac ttatctgcag aaacactaca agccagaagg 1680 gactggagtc caatttttaa tgttcttaaa gataagcaat tccagccaag aatttcctac 1740 cccgccaagc taagcttcat cagtgatgga gaattaaaat ccttcccaga catccaatcc 1800 ctaagagaat atgccgcctc cagaccagcc cttcaggaga cgcttaagat ggtgaacaca 1860 ggaaaaaaaa aaaagaatgg tcatacccat cacaaaagta caggcaagca cagagttaac 1920 agagcataca gagcactcac aaaaatgaaa atacacacat atataaatac aaaagtaaaa 1980 aggaaacaaa gcaaccccct aagagcccta tgacagggat aaactctcac atttcaataa 2040 tcaatttgaa tgtgaatgga ctaaatgcgc cactgaagag acatagagtg gcaaactgga 2100 taaagaagca tgatccaaca gtctgctgcc tccaggaaac tcatctcact gcaaaggaca 2160 cccacaggct caaagttggg ggctggaaaa tggtctttca ggcaaatgga aaacagaagt 2220 agctattctg atatcagaca aagcagactt caaaccatca aaagttaaaa aggacactac 2280 ataatgataa aaggctcaat ccaccagcaa gaaatatcca tcttaaacat atatgcaccc 2340 aacacaggag caccaggttt tataaaacaa ctactaagta aactaagaga ggacattgac 2400 tctcacacaa tcatagttgg agacctaaat accccactaa cagctctaga tagatcatcg 2460 aggcaaaaaa ccaacaagga gatctggaac cttaactcaa tgctgatcaa atggatttaa 2520 tagacatcta cagaacactc cacccaacat ctacagagta tacattctac tcatcagcac 2580 atggaacata ctccaagatc gatcacattc tcggacataa atcaagcgtg aacaaatttc 2640 aaaagatcaa aataatacca tgcatcttct cagatcacag tggaataaaa ataaatatcg 2700 ccaccaacaa gatcccccca aaacacacaa agacatggac actaaacaac atgctgctga 2760 acgacttctg ggtcaacacg gaaatcaaga cagaaataaa aagattcctg gaaacaaaca 2820 aagacacatc ttatcagaat ctctgggatg ccatgaaagc agtgttaaga ggaaaattca 2880 tagcattgca cgcacacatc aagaaaacag aaagatcaca agtaaacagc ctaacatcac 2940 acctaaggga gctggaaagg caagatcatc taaatcctaa cttcagcaga agaatccaga 3000 tcaccaagat aaaatcacaa ttgcaggata tagaagacaa aaatatcata gaaaggatca 3060 acaagacaaa aagctggttc tttgaaagga taaataagat tgacagaccc ctggccagat 3120 tgactaaaaa aagagagaaa gtccaaataa acacaattag aaatgcaaaa ggcgaagtca 3180 caactgaccc tgaagaaatt caaaagatta tcagagatta ctatgaacac ctgtatgcaa 3240 ataaactaga aaacctaaag gaaatggagg actttctgac atcacacaac ctcccaaggt 3300 tgaaacaaga agaaattgaa actctaaaca gaccaataac aatccaggaa attgactcag 3360 tcataagaaa tctccccacc aaaaaaagcc ccggaccaga tggcttccca gctgaattct 3420 acaagacata caaggaggag ctgataccaa tcttattgaa agtattccag gcaatcgaga 3480 atgatggaat tctccccaac tcattttatg aagctaacat catactgata cccaaacctg 3540 gcaaagaccc aacaaggaaa gagaattaca ggccaatctc cttgatgaac atagatgcaa 3600 aaattctcaa caagattcta gcaaatcgga tccaacaaca catctcaagg atcatccacc 3660 atgaccaggt gggcttcatt cctgggatgc agggctggtt caacattcac aagaccataa 3720 acataattca gcacattaac agatgtaaaa ccaagaacca catgattata tcattagacg 3780 cagaaaaagc atttgacaaa atccagcatc ccttcttgat aaaaaccctt gagcacctag 3840 gcatagaagg aacattcctc aaaacagtaa gagccatcta tgataaaccc acagccaaca 3900 ttttgctcaa tgggcagaag ctggaagcat tccccctaag aacgggaaca agacaaggat 3960 gcccactctc acccctcctg ttcaacatag tgctggaagt cctagccaga gcaatcaggc 4020 aagagaagga aatcaggggt atccaaatag gaaaagagga agttaagcta tccctctttg 4080 tggacgatat gattctatac cttgaaaacc ctagggagtc tgtcaaaagc ctcctcgcac 4140 tgataaatga ctttggtaaa gtcttgggtt acaaaatcaa tgtgcaaaag acagttgcat 4200 ttctatacac cagcaacaag caggcagaga accaaataaa aagcacaatc ccattcacaa 4260 tagccacaaa aaaatgaaat accttggcat ccacctaacc agagaagtga aagaccttta 4320 caatgagaac tacaaaacgc tgctcaaaga aatcaaggat gacacaaaca aatggaaaaa 4380 cattccatgc tcatggattg gaagaatcaa tattgtcaag atgtccatcc tacccaaggc 4440 aatctacaga ttcaatgcaa tacccatcaa attaccaaca tcatttttct cagacctgga 4500 aatgacaata cagaaattca tatggaaaca aaaaagagca cgaatagcca aaacaatcct 4560 cagcaaaagg aacaaagcgg gaggtatcac acttccagac tttaaacttt attacaaggc 4620 tacagtaacc aaaacagcct ggtattggta caagaatagg cacatagacc aatggaacag 4680 gacagagatt ctggaagcaa aaacacagtc tctcaaccaa ctcatctttg acaaagccac 4740 caataacaat cactggggaa aggagaccat atttagcaaa tggtgctggg aaaactggct 4800 gaccacatgc agaagaatga aactggaccc ctacctatca ccatatacaa aaatcaactc 4860 aaaatggatc aaagacctaa acgtaaaacc tcaaactata agaatcctag aagaaaatgt 4920 aggaaacacc cttatgcaca tcggagtagg caacgaattc ttgaccaagt ccccaaaagc 4980 aaatgccata aaagctaaga tagacaagtg ggacctcatc aaactgaaaa gcttttgcac 5040 agcaaaagaa accatcaaga gagtaaagag acaacccaca gaatgggaaa aatatttgca 5100 aattatgcat ctgacaaagg cctaacatcc aggatctaca aggaactcaa acaaagtgaa 5160 aggaaacaaa caaccccatt aaaaagtggg caaaggacat gaacagacac ttctcaaaag 5220 atgacataca agcagccaac agacacatga aaaaatgctc agcctcacta atcatcagag 5280 aaatgcaaat caaaactaca atgagatacc acctaactct ggtaaggatg gctatcatta 5340 ataagtcaaa aaacagatgc tggcatgggt gcagagaaaa gggaacgctt ctacactgtt 5400 ggtgggagtg caagctagtt caacctctat ggaaagcagt gtggcgattc ctaaaagaac 5460 taaaaattga ccttccatac gatccagcaa tccctctcct gggaatatac ccagaggaac 5520 ataaatcatt ctacmaaaaa gacacctgca catgcatgtt tattgccgca ctattcacaa 5580 tagcaaaaac atggaaccaa ccctactgcc catcaaaagt ggactggata aagaaaatgt 5640 ggtacatata cacgatggag tattatgcag ccataaagaa gaacaaaatc atggacttcg 5700 cagcaaattg gatggagttg gagaccatca tactaagtga actatcacag aaacaaagaa 5760 ctgaatacca catgttctca ctcataagtg ggccctgaac attagtcaca ttagcacaag 5820 ggagggatcg gcagtcactg ggcactgcta ggggagggag ggagggagaa ggtgggggca 5880 cgtgacaaca caaagggtac ctggtccact accaggggga gtgagggcca gttagaagcc 5940 caaacttcac ctgcatgcag cctacttatg tagctaatcc tcacttgtac cccacatccc 6000 tataataaaa taaaatttta aaaataataa aagaa 6035 // ID ERV1-1B_TSy-I repbase; DNA; PRI; 6654 BP. XX AC . XX DT 29-JAN-2010 (Rel. 15.09, Created) DT 29-JAN-2010 (Rel. 15.09, Last updated, Version 2) XX DE Internal portion of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERV1-1B_TSy-LTR; ERV1-1B_TSy-I. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-6654 RA Jurka J. and Walichiewicz K.; RT "Endogenous retroviruses from tarsier."; RL Repbase Reports 10(9), 1190-1190 (2010). XX DR [1] (Consensus) XX CC The ORF could not be fully reconstructed. This sequence was CC derived from sequence data generated at the Washington University CC School of Medicine Genome Sequencing Center, and assembled at the CC Broad Institute. XX FH Key Location/Qualifiers FT CDS 5544..6131 FT /product="ERV1-1B_TSy-I_1p" FT /translation="MYMYRNYLARRKSFPSLFIPARSLFWWPTRPVWYWAP FT AYATKSSQCTIVTQTISNCTVLSSEQANPFQGIPIISQSWTDLSSVNPDLW FT KAPKGLFWICGKVAYAQLPALWKGTCTIGIIQPGFFLLPNPRGDELGIPLY FT ESLKTRDARSLDQVPNIGGTQVWKNDEWPPQRIINTYGPATWAQDGSWGYR FT TPIYIC" XX SQ Sequence 6654 BP; 2056 A; 1362 C; 1426 G; 1808 T; 2 other; tacttggtgc atgggcggga tgggagagtt tggagatggt gtgttccacc tcctttgtca 60 ttggggtaca acctccaggg cgacaggaga ggcaatctca cctggcacac aacagacctt 120 ttgatccgta ggagggaggc ccctcctagt gcactgaggc tcggaccacg aaagcacaac 180 agggacccaa gaggcaagga actggcatga ggatgatcac ctgatggaaa ggacagaatg 240 accacctgac agaaagggta ggaataaggg cctttgcata gacctgccaa cagatggcag 300 aaggggagct tgatcacctc ccagagtctg caagtaatct ccagtctgaa ggaaaggact 360 gaaaggcagt gaagagaaca ctgtctgaag atgaacctgt agactcctag gattagggca 420 gagggttgaa gaatgcattg tgtgattgcg tgtaaatgag atggacggca ggagggttcc 480 gcataaagtt agtctaggtt gggagtttct accaacctgc caattgctaa gaggtgtctg 540 aaatatcttc acaaggagta tggctagagg aggacgaaac agatgtttcc catacccctc 600 attctcttat ttgtgtgtct gtcattttga atctgctaaa aagctggttg agttattgtg 660 ctttgtttaa gggacatctt gtcttttaaa aacttgatta gcgcatgttt ctttatgtca 720 ttttatgttt ccctcccttt atttgtgtgg tgtttctccc aacccgttgg cactgttaaa 780 attcctgcct tttgttgtca aggatgtggg aaagttgcct gccgtgtatg tgaagtgcta 840 ccggttatct tgaattcttt cctgggacca caccctctcc cttccttagc ctgcgttctc 900 cttcatgaat tctgaaggag ttcattctaa tccacctgta aagaaacata gaaatatctc 960 ttatatgcta cttagccttt tgctaactca gtgcaagtta atgctagcct tggaaaaaat 1020 aattttgtaa tctttctctg gcaactaagg gactgaataa actgttcttg ttttcaaatg 1080 ctagaagccg ggttagttga aagtaagtct gtttctaaag gtttaggcct tgcctaagat 1140 cgtaatcaac tgcttgtccc tgtaaaactg ttgggaagtc tcttagagtc gtgtaatgta 1200 tgctgagtcc ctactttaaa ttgtaatgct aaataacact actgcttgct tttttggcta 1260 tgcctgacta gcacacctct tacaagactg gtaaagcttt aaatgccgaa tactttgcag 1320 gacttaataa ggcatgggaa ctggccattc ccacctgatt aagtcacctt gtaagctcgg 1380 tacttcacct aagggacagg tcaagtacta ggcacactct gtggtgcata gagtgagacc 1440 taatcactct catttaattg tccacatctt acgactaagg ctactgttta gcaattcatg 1500 aggatttttt tgacaatttt gcctgttcat aattaggaat aagtaggaac agatcatttt 1560 acatctacgt ggtctgttca ctctaaaaag gttttgtacc aggccatttt gtaggcatca 1620 ttggaaagtc atggagcaaa aagggggggc ttcaaggaag gctgagaaca ctcagcctgg 1680 aagaagccat cagggtccca gggatwcctg gcattgctgt ttatggcttt atttagttat 1740 ttgtttgttt tagaattctc ctttgaggaa taaattgcaa gtgttcaaaa caaattctgg 1800 gttttcaaga aacgtgacca agaaaatgtg taataatctg taccctcaga caccccaatg 1860 ggacagattt taaataatta gagcaaatta agattacgtg gtttaaaatt taaataataa 1920 gtgcattatt aaaatcaaat ttgaactcac tacaagttag cagataatga gatttggaaa 1980 ggagcaaaac tatctggcaa ttaaattcac tttgtagacc taaagaaaaa tggtcaaaag 2040 ttctgtatgc agaggcattt aaaaaattta tttcactact ttgcaaacaa gattatatcc 2100 tatagaatat taaatgccaa ctccagaggc ttgactcagg actatgaata ccaaactccc 2160 aattatagaa agcctctgaa gttcttagaa actgaaacat taagaaaaaa aggaaaagaa 2220 ctccaaaatg agcaccagcc agaggaaagg taaacaaacc tattggcagt tgccactaaa 2280 ggcaaagctc cagccaccca aaagaggcct gataaggata aacctgctaa gggggatggt 2340 atcagccgct atgaatgtgg tcaatctggg cacttaaagc acaattgttc ttgatggcct 2400 aagcccccag gaccttgtcc aacatccgga ctaaaaggcc actgaaagag acggtgcggc 2460 ctctcccgaa ggggggggga cacctagctc ggtgatggtg gccgttgact gaagggggag 2520 gcacggccca gcccagcgct ctgtttgcca tcaccccaga gaatccttgg gtctgtatga 2580 atgtggctgg ggtccaaatt aatttcttga ttaatactga ggctaagttt ttagcccttg 2640 tgctcatact gaacctcttt ctcctgatac ctgctctcta gtgggagtta aaggtgttcc 2700 aaaagttaaa tgtttcacct aaccagttac taaaaggtga attcaaatta tgactgtaaa 2760 aggttatcta taaaaagata aaaaggaaag caaaaagcaa aggaaaaata taaaaaggtt 2820 gttggtgtac aaataaggtt ttagtaagaa aaaaaggaat ataagagaaa aaatattttt 2880 ttatctaaga agagatctta tatggtaaat ttctgtccta taataaagta aatagttatc 2940 tgaaaatttt aaagaaggcg gtagagaagg ttcaaaaata aaatggtata agcatgccaa 3000 aaatggtatg aggaaagttt atgaaaaaat atatatacaa aagaaattct ataaacaatc 3060 taggcataat aaagtctttt ctggatgcaa gtctccaaac attttcggat ttctcttcaa 3120 attttatata aaatttactt gagtgcctgg ttcaaattat agtctaaaac aatctgtcag 3180 gctaccatta tagtccagaa atacattaaa aactccaaag gctggtactt gtaataaagc 3240 cacagaaatt aaagtttatc atttaagtta tgctaataac ttcaggcctt gtagcttttg 3300 tacaaaaggt gatcagccac tacctaaaat tgtcaaaatg tatgcctgac tttattcctt 3360 ttattgagcc agaatccagt aatttcatgc ccacataacg gtcatgcaag gattccagtc 3420 agtgcctgca aacaatgcca ctgctgaaac agatctggat gcctcaagtc agacagggag 3480 agattttcga tctctggcta ggcaggacta cgcccactgt cagctctgaa gaagttacag 3540 aaaacggacc ttcacccctc agcaccccat atgattatgg gatcgaaaaa tctctaaaga 3600 ataaataatt gaggcagggt agctagcaca ggctagacat agataaattt cccataggta 3660 ggttttagaa ggcgttcaaa cattttaaaa attatattct actgcaggcc aagtggcagc 3720 ctaccccacc cctcggcaaa tccaaagaaa actccatcag atgtaggact aggacagtac 3780 aatctttcaa tttgcggttt agcataataa tcactagaaa tcctttggag acacgagtaa 3840 actgaacata ggtatttggt attgcatcgc tcaggatcca agtcagtccc tgaaatggga 3900 ttcgttaaaa ctttgaaatt gtttaacggt aggccttttt cctacacctc tgaaggaaat 3960 ggggccacaa acataagtgc tcataaggaa atgtgttgtc catttgggac aaattttaat 4020 cacagttcat aagcttgctt ccctaggaac ttcctggacc atggaaaacc caaacatccc 4080 aggggagtaa gtgctgctaa aaaattgaag agaggaagga cccaaccaac agctgtaaga 4140 aaagaaaaaa gggaccctaa gggtggtcct gttcctatac tagtacatgc tattacccag 4200 attgttaaat ttgtgggtaa cttctagtct caggagcctc taggcttccc taacgggtct 4260 cacccctcca gggaactaac tgaaagcatt taaatctgta cattcaaaag ttcaaagtaa 4320 aaagtcgaac agctaataat aataacctaa caagaattag tttacagcaa tggcccatta 4380 agaaaacaag cagtgtgttt ataccagccc aactacccgg tgtcaaattc agaagttttg 4440 gggggctgcg aggttctata gaatttggat acctaacttc tctatcatga caaaacccct 4500 ctatgaagcc tcaaaggggg tgaacaggag cctctgctat gggaaaaagg cccagaaaca 4560 ggcctttgaa taaattaaac aggccctcac caatgcacct gatctgggac tcccagatgt 4620 gactaagcct ttctttttat acgtgcatga acgtgttggc actgcagtag gggccttaac 4680 tcaaatgcta gggtcatggc atagacctgt ggcatattta tccaaacagc tagattctgt 4740 ggcccagggc tggccgcctt gtttatgagc cctggcagcc acagcactcc tgataacaga 4800 agctgwcaga ctaactatgg aacaacacct aacagtcagg gtgcctcgtg cggtaataaa 4860 tttaatgaaa tacaagaggc agtattggct aaccaatgct cgaatggtca gatgtcaggg 4920 aatgctctgt gaaaaaccca cgtattcacc tagaggtagt gaagacctta ctacccatag 4980 agccaggaca gccagaccac gattgcatta aactaataaa tgaagtgttc tctagccatc 5040 cagatttaac agaccagccc ctcagggacc ctgaggtcga atacttcacg gaccggagca 5100 gtttcataca ggaggaggag cgctttgcgg gctattccca ttcttttaag ccaggtcacc 5160 aaggccgatg gaaaacccgg aacggatttc tcaagccctg ctctaaaagc ctgaggatac 5220 ttcggccctg ctttaaccac acactggagg ctggctgatt gactcacggc agaaggttga 5280 aaattcagct taaggactca gcggacatta ctgataagcc attgttttgc tttgtcaacc 5340 tttatctttg ctaaactgtt tgtcatttta attgctgtaa aactgacttt atttgctata 5400 gaattaacag agttgtcccc atcgaacatg ttagaaaagg gtctcactag catttccatt 5460 ctttcttatg ataggaagtt ttcttgtgct tttatgtatt tctgcctaat ttgggtgttt 5520 atcgcttccc ccacacaccc agcatgtaca tgtatagaaa ctatctggcc agacgtaaga 5580 gttttccatc gttattcatc ccagccagaa gtttgttctg gtggccaaca aggcctgttt 5640 ggtattgggc acctgcttat gcaactaaat cttctcaatg tactattgtc acccaaacta 5700 taagtaactg tacagtacta tcctctgaac aagctaaccc gtttcaaggt atccccatta 5760 taagtcagag ctggacagat ctgtcctctg taaatccaga cctctggaaa gcccctaaag 5820 ggctattttg gatatgtgga aaggttgcat atgcccagct cccagcatta tggaaaggca 5880 catgtaccat aggaataatc cagccaggat ttttccttct gcctaaccca aggggggatg 5940 aactaggcat tcccctttat gaaagcctca aaacacgaga tgctcgatcc ctcgaccaag 6000 ttcctaatat aggagggacc caggtttgga aaaatgatga atggccaccc cagagaatta 6060 taaatactta tggcccagcc acatgggcac aagatggaag ttggggatac cgcacgccta 6120 tatatatatg ctaaatcgca ttgttaggtt acaagctgtc ttagaagtta taactaatca 6180 aacagccatg gcttttgagc ttgttgctca gcaacaagcc caaatgcgcg cagctattta 6240 tcaaaaccgc ttagctttga actacctgct agcggaaaaa ggagaagtat gtggcaaatt 6300 taatagctct aactgttgtt tgcaaataga taatagtaaa gcaataactg atatagccac 6360 caatattaga aagcttgccc atgtcccagc gcaaaaatgg caaggcataa aaatcggcaa 6420 ttggtttgaa aacatgttct cgggtctggg aggatttaag tacattattg aatctatagt 6480 cctattggtg ggatcctgtc ttatcctccc ttgcatagcc ccaataatta tgaacgccat 6540 ttctaagttg tggaaacagt tgttgaacga aaaactgcag cccacatgat gttgatgcac 6600 caaattaagg atgatgctct agacccatga ggggtcaagc atcaaagggg ggaa 6654 // ID DNAX1_OG repbase; DNA; PRI; 190 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Non-autonomous DNA transposon - consensus. XX KW DNA transposon; Transposable Element; nonautonomous; DNAX1_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-190 RA Jurka J.; RT "DNA transposons from bushbaby."; RL Repbase Reports 11(5), 1467-1467 (2011). XX DR [1] (Consensus) XX CC ~93% identical to consensus. >15,000 copies phg. TA tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 190 BP; 55 A; 35 C; 54 G; 46 T; 0 other; cagtagaacc tctgtaagtt gaccacccaa gggactgtaa caaactggtc aacatacgga 60 ggtggtcaac ataaggaact aggcctactg tactgatacg tacatgtggt gcatgtccgg 120 tctatgaaaa ttaggtcaac ttaaggaggt ggtcagtgta gggaggtggt caactatgga 180 ggttctactg 190 // ID LTR22 repbase; DNA; PRI; 571 BP. XX AC . XX DT 16-MAR-2006 (Rel. 11.03, Created) DT 04-JUN-2009 (Rel. 14.07, Last updated, Version 2) XX DE Long terminal repeat of ERV2 Endogenous Retrovirus from primates. XX KW ERV2; Endogenous Retrovirus; Transposable Element; KW LTR retrotransposon; LTR22C; LTR22. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-571 RA Smit A.F.; RT "LTR22- a subfamily of endogenous retroviruses from primates."; RL Direct Submission to Repbase Update (11-NOV-2005). XX DR [1] (Consensus) XX CC 6bp duplications, 4-5% divergence from the consensus. XX SQ Sequence 571 BP; 149 A; 118 C; 144 G; 158 T; 2 other; tgttgggatt cactcaggat ggtggcagaa atattaaagg gaaatattag ggaaagttat 60 agggaatagt cacaaacctt tttggaaggc tgaaaggtta catagcttgt aataattgaa 120 caggctgaag gcagccggtt cttaccttag agcattaggt catagggtaa atactaggga 180 caatagaggc ttccccagtt aagtctgttt accctacctc cattaactaa cctttgagcc 240 agatggccct cttgggggag gtcgaccagg gatattgccc cctaatggta tttactttag 300 accgnggtac ctgagcttta atcattcgta gaactactct cttaaccatg ttaattatcc 360 acaagtgtgt tgactcagag cttctgttgt taattgtata ctaaataaat gcctggagtg 420 caagctgctc agggccggcc gcagtgacaa acctctcttg gtgtgcaggc ggtcggacac 480 tcagcnggac tggcaaaaca gaatatctgt gtgtcagtgt acgttttatt catccgtcgt 540 ttgggtcagg gtctgcgggc agacccccgc a 571 // ID LTR13C1_OG repbase; DNA; PRI; 453 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR13C1_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-453 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 11(5), 1588-1588 (2011). XX DR [1] (Consensus) XX CC ~95% identical to consensus. 4 bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 453 BP; 104 A; 156 C; 79 G; 114 T; 0 other; tgagagggcc agactagctc cattttgtgg tttactgcac aagccccatt cctagctgtt 60 tactacacag tggtttactg cacaagcccc actcctagct gtttactgca caaactccat 120 tcctagcccc tccgccccaa cccctgccag taaaacctgc aaagccactc ccgctgcctt 180 cccccacccc tcacctgtac caatccaaat cctcctttac taatgaaatc accaaccacc 240 aatcaaaagt gtactcatac cctcctcccc ttcttgcttt ctgtacccca taaaaacttg 300 ctccaccccg ggttcggggc tcgtcctagg cttgcatgcc aggagagtcc agacccgggt 360 ctgaataaaa cctcacctca tgccttttac atcggagtgc gtctcggatt cgtctgttgg 420 ggctatttgg gagaataatt ttcgggcaca aca 453 // ID ERV2-1B_TSy-LTR repbase; DNA; PRI; 937 BP. XX AC . XX DT 09-DEC-2009 (Rel. 15.09, Created) DT 09-DEC-2009 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV2-type endogenous retrovirus - DE consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERV2-1B_TSy-LTR. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-937 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from tarsier."; RL Repbase Reports 10(9), 1257-1257 (2010). XX DR [1] (Consensus) XX CC >94% identical to consensus. 6bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 937 BP; 245 A; 192 C; 220 G; 279 T; 1 other; tgttgggagc cggccacgga ggcccactga gcgttaggga tcaggaaggg ccgttgaccc 60 cgagaccact aggatgcgtt cgggagctgt tgaccccatt ccctccctcc ttttccactt 120 ggcacaagca accaggaaag gctcctgctg gctctaacag aggactctga aaccggaacg 180 aacacagcta acttgggagt ctattgttcc cattttccac ggtttatggt ttttgctttt 240 aaacaattta agtttaactt tgagtcacat tgaggagcct tagtgcaagt cacaggttct 300 cttgatgtga ttttagaaat ttatagatat aggttttaga atatagttta tatagattaa 360 gaaaatttaa gtaaccattt tattatagaa gtgtggttag ttcagtttga ttcagggggg 420 ccagtgaagt ttgaattagg ccgctggtgc taggagagtg taaccatagc agcaactgtt 480 tcatagaaac tagtttgaat tagcatagaa tagtaaaagg taatatatca ttatgttggt 540 ttcgggtgta agcattgaac gatgttatgt agctataaac aattaagggt acatagaaac 600 tggataagaa cttcatgttt ttcttcatgt ttgaaaaata catctgcttg agagtggggg 660 gtgtcctgtg tgcacctgga cctgktcatg tgaaaggcac atgctacctt ttgtgtaacc 720 atagcagcag gagttcctgt gatgataact ttgcagagtt tggggataaa agccctggga 780 ttacacccat taaacggctt ctggtcactt gaacaggagt ccagtctctt tctttctccc 840 ccctttcttt ctctttatat ctatccctcc agtttaaatc gctgatgccg tcacaccacc 900 agggaccccc gatccctgca ggggcaggac ccccgca 937 // ID Alu1_OG repbase; DNA; PRI; 290 BP. XX AC . XX DT 06-APR-2010 (Rel. 15.04, Created) DT 06-APR-2010 (Rel. 15.04, Last updated, Version 3) XX DE SINE element - consensus. XX KW SINE1/7SL; SINE; Non-LTR Retrotransposon; Transposable Element; KW Nonautonomous; Alu1_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-290 RA Jurka J.; RT "SINE elements from the bushbaby genome."; RL Repbase Reports 10(4), 638-638 (2010). XX DR [1] (Consensus) XX CC The top youngest sequences are >93% identical to consensus. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project. XX SQ Sequence 290 BP; 70 A; 78 C; 98 G; 43 T; 1 other; gccgggcgcg gtggctcacg cctgtaatcc cagcactgtg ggaggctgag gcgggaggat 60 tgctcgagct caggagttcg aggctcgtct gagcgagagt gagaccccga ctcatggaaa 120 aaaatggaaa aacccagccg ggcgccgcgg cgagcgcctg taatcccagc ggcttcggag 180 gctgaggcag caggatgccc acaagccgga gtctgaggtt gcagtgagct acgacgccac 240 tgcactctgc tcagggcana gggtagaact ctgtctcgac aaaaaaaaaa 290 // ID ALRb repbase; DNA; PRI; 171 BP. XX AC . XX DT 14-NOV-2005 (Rel. 10.11, Created) DT 14-NOV-2005 (Rel. 10.11, Last updated, Version 1) XX DE SAT Satellite from primates. XX KW SAT; Satellite; Simple Repeat; ALRY-MAJOR_PT; ALRb; ALRa. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-171 RA Smit A.F.; RT "ALRb_ - SAT Satellite from primates."; RL Direct Submission to Repbase Update (11-NOV-2005). XX DR [1] (Consensus) XX CC 75% identical to ALRa. XX SQ Sequence 171 BP; 54 A; 27 C; 33 G; 54 T; 3 other; ttgtggaatt tgcaagtgga gatttcaagc gctttgaggc caawnktaga aaaggaaata 60 tcttcgtata aaaactagac agaataattc tcagtaactt ctttgtgttg tgtgtattca 120 actcacagag ttgaaccttc ctttagacag agcagatttg aaacactctt t 171 // ID LTR1A2_OG repbase; DNA; PRI; 785 BP. XX AC . XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 05-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Long terminal repeat (consensus). XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR1A2_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-785 RA Jurka J. and Jurka M.G.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 8(10), 1666-1666 (2008). XX DR [1] (Consensus) XX CC 4bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 785 BP; 188 A; 208 C; 200 G; 187 T; 2 other; tgataccgga aaagtggcac cccagccagc tcagccgagc ggggcctggg ggacctgccg 60 cctctgtctt ggtggcagca gccggcagag gaaccggcag gctggagcaa gcgggggagc 120 tttcctcccc cactagaggc tccagtagag cagtgctara ggacctgcca atcaggggaa 180 aacactggcg attcccctcc agtgcatccc tgattggtcc attttcaatc ctgctcctga 240 ttggtccgtt ttatagacca atcagctcag ctaagctctc ggctcattac ccctatataa 300 gccccctgag cagagagcct aggggcagat catcgcagat ggttgcagac cttcgaggaa 360 gagagagcag aagagcaaga cttcagcaga ccgttgcaga ccatccagag gaagaagcga 420 gagctgtarc acttctactg ctgatcctgc tctgcaaaga ggcttggctc tttgtaagag 480 ctgtaacact tctactgctg atcctgctct gcaaagaggt ttggcccttt gttgatctct 540 gccctgcaaa gaggcttggc cctttgtaag agctgtagca catcttatcc tgctctacaa 600 agaggcttgg ctctttgtaa gagctgtttt cataagagtt gtaacacttg catcctgccc 660 tgcaaggagg tttggttctt tgagctgtaa ataaaacctc actgttcatt ctatggtcca 720 tcgattcatt catcgaatcg tcgagaccaa gagcctggaa atccaatgtc aaaaatccgg 780 tatca 785 // ID LTR22B_Cja repbase; DNA; PRI; 452 BP. XX AC . XX DT 14-OCT-2009 (Rel. 14.11, Created) DT 14-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat: consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR22B_Cja. XX OS Callithrix jacchus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. XX RN [1] RP 1-452 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the common marmoset."; RL Repbase Reports 9(11), 2928-2928 (2009). XX DR [1] (Consensus) XX CC >86% identical to consensus. 6bp tsd. CC This sequence was derived from the marmoset dataset made CC available by Genome Sequencing Center at Washington University CC School of Medicine in St. Louis. XX SQ Sequence 452 BP; 113 A; 100 C; 119 G; 120 T; 0 other; tgtaggagat cggtcagaat ggtgggagaa attatagaaa acgcaaacct tcttggaagg 60 ccagaaggtt ttgcaaaagc cttggaagag gttatggctg aaggcagcct aatcctctta 120 ccttgagcta atagcaaaga gcagataaca agggaatgta gaggagttta tctaaatagc 180 ttgtttactc atgtggtcct aagaccgacc tttgatcatc cgcgggcgca tgactgctct 240 ctactcgggg ggcggcaatg ttaattaccc tctaatggtg tttacttgag acctttgtca 300 tttaatctgt actaaataaa tgtgaacatc gccggcttat cggggccgct gctgactctg 360 gcagaggtcc cctggccacg ctgactggca agcctgtgtc agtgtacgtt tttcatccat 420 cgctcagtcg agtctgccgg tcggactcga ca 452 // ID LTR2B1_Mim repbase; DNA; PRI; 699 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV2-type endogenous retrovirus - DE consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; nonautonomous; KW LTR2B1_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-699 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 11(5), 1718-1718 (2011). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 699 BP; 166 A; 164 C; 180 G; 189 T; 0 other; tgtaggggac tgacatatat tttcctagtt taagaattaa agttagtgag taatgtacgt 60 gttctgccca gagcgtttga aagtaatctg ttccggacag gctccctgtg ataaacaaag 120 gtgttgccta gaggcataac aaaggacctg agctgcaagt agcaagagcg gccccgcccc 180 acggcattgg gggtctggag gccacacgat aaacatattg ttcctgtgat tgctgcgtac 240 accccataag atggctggtt agtcaatgac gggtaagacc cctcaaggga ggggcgacct 300 aagccaggca cagccgccgg ggttcagccg aagatcttag ggggtcaccc taagagaagc 360 tggggatgag aaatgccccc tgtggctcac cttgcccatc cttgctaatc tcggtccttt 420 atctatgcct agtgcctaga gcaaccacct tgaaattctg agtcagggga tatctgcttc 480 cctaaggcta cgcttccgga atttatggct atcgctgcca cgcctgggtg gttacgtaca 540 aggaactaac tttaatcttt tagagtataa aaataaaacg gacccggtgt attattactc 600 gagtcaaccg tttgtacgtg tgtctgcgtt tttctttgtg ttctgcgttt gtgttttatg 660 ttctgtgttc atcctccgtc ctgtaaacgg gccacgaca 699 // ID MER4D_LTR repbase; DNA; PRI; 903 BP. XX AC . XX DT 14-NOV-2005 (Rel. 10.11, Created) DT 09-DEC-2005 (Rel. 10.11, Last updated, Version 1) XX DE Long terminal repeat of ERV1 Endogenous Retrovirus from primates. XX KW ERV1; Endogenous Retrovirus; Transposable Element; KW LTR retrotransposon; MER4D1; MER4D_LTR. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-903 RA Smit A.F.; RT "MER4D_LTR - ERV1 Endogenous Retrovirus from primates."; RL Direct Submission to Repbase Update (11-NOV-2005). XX DR [1] (Consensus) XX CC mer4 group. XX SQ Sequence 903 BP; 259 A; 238 C; 156 G; 249 T; 1 other; tgtaaaccaa aaataaaatt ctaagccccc caaccgactg aatggacccc tcctctcggc 60 caaggggatc ccaaagaaac ctgaaaaact agttcaggcc atgatgggaa ggggggtcgg 120 acatgcctcg ttataccctc ctccctttgg agttcaggca caactgacca gcattaacat 180 taaaacagag atcntaagac tgacaaaaca gactctttgt agcaataaga taccaaattc 240 caacctgact ctagtatagc atcacatgac agatagcagg ccctgaagga aatcaaagta 300 ttttacccca aaatatattt ctttgacata ttttgaaatg gccctgcaaa gccgtctctt 360 gtgggggaaa tctacattct gtagagaatc cccttccctt tccaggtctt ttcctgatcc 420 aggagagatt taactaagag tctggcacct tttaaggtct gataagagac atttaccatc 480 tattctctct gaagcctgct acctggaggc ttcatctaca taacaagaac cttggcttcc 540 acaactcccc ttatcttaac ccaagcattt ctttctgctg acttcaactc tttaggcaaa 600 gcttaactct ttcaaccaat tgccaatcag aaaatctttg aatccaccta tgacctggaa 660 gcccccgctt cgagatgtcc cgcctttccg ggccgaacca atgtatacct tacatgtatt 720 gatttatgtc tttgcctgta acttctgtct ccctaaaatg tataaaacca agctgtaacc 780 caaccacctt gggcacatgt tctcaggacc tcctgaggct gtgtcacggg ccatggtcac 840 tcatatttgg ctcagaataa acctcttcaa atattttaca gagtttggct tttttcgtca 900 aca 903 // ID MacERVK2_LTR1a repbase; DNA; PRI; 317 BP. XX AC . XX DT 24-SEP-2007 (Rel. 13.02, Created) DT 21-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE ERV2 Endogenous Retrovirus from Cercopithecidae. XX KW ERV2; Endogenous Retrovirus; Transposable Element; ERVK; LTR; KW MacERVK2_LTR1a. XX OS Cercopithecidae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini. XX RN [1] RP 1-317 RA Smit A.F.; RT "MacERVK2_LTR1a - ERV2 Endogenous Retrovirus from RT Cercopithecidae."; RL Direct Submission to Repbase Update (21-FEB-2008). XX DR [1] (Consensus) XX CC 1-2% MacNERVK2 LTRs. XX SQ Sequence 317 BP; 84 A; 64 C; 86 G; 83 T; 0 other; tgcaggggcc tttgaaaaca gtatgctgag aatagatagg gctcaaggac agtatctcca 60 attgaacttt taatgaactc ccgtaggcgc caagaaggcg ggcagataag gaagagcata 120 gaaacaagga actgagtaga aggttacatc tgggaaaaga aagtttgttt tgcattgcat 180 tctttctgct gacgtggggt ttatggagta taaataaagt gaggcggagg ctctgagcgc 240 ggccgccatg ttctctgtgt gtctttgtct tttgtgtgtt ctttcattct ccaccacccc 300 cggcacggac cccaaca 317 // ID ERV2-3_TSy-LTR repbase; DNA; PRI; 1426 BP. XX AC . XX DT 29-JAN-2010 (Rel. 15.09, Created) DT 29-JAN-2010 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV2-type endogenous retrovirus - DE consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERV2-3_TSy-LTR; ERV2-3_TSy-I. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-1426 RA Jurka J. and Walichiewicz K.; RT "Endogenous retroviruses from tarsier."; RL Repbase Reports 10(9), 1207-1207 (2010). XX DR [1] (Consensus) XX CC >95% identical to consensus. 6bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 1426 BP; 475 A; 255 C; 289 G; 407 T; 0 other; tggcggaggc catgacaatc tggggctcag catgggaacc gaatgaagac acccccctgg 60 aaggcaaagc cccagagggg atccgtgtgc gtggcaaggc cccacggaaa gtcttgagga 120 aagttctgca tagctgccct gtgactcatg gtcgcctggt tattcccatt atattgttat 180 actcctgcct tacaggaata atatgtttca atctccctcg gaaaagaata atgtaaccag 240 aattggaggg tacagtgtct acaatgccag taataaagca gtcaatgttt caacatcggc 300 tacgcctcag gaaatacaaa caagaacagg agaggtcatg acgacccaac cgtcaataga 360 accaatcact gtaacatcca ctgcaacaaa atgtaccttt aactgtatgg caaatgaaac 420 acttagagat tttctattaa agttgtggca taaggaatta acagaaactg aggtaaaaac 480 tgttacagat aacataggta tagttaaaca tcagattagt cataaaaccg gaacccccaa 540 aaataaagct cttaagcctt tacctactat tagagcagga caacatgtgg gatataatat 600 taataataga gtttggtggg cccctagggg tagaagcagc atgcaagatt tatcgtggct 660 agatcctcct ttaccttaca ccggagctta tgatgaagac atggttagat gggcttgggt 720 agatggcaga tatgttaaat gaaactctct gtctctctgg gaaagaatta aacgtagtag 780 cccaccatgg tgtttaattc taaaggcata gaaaataaaa atctgctaca aagtaaacta 840 atggtataag aaaattaaat gtagctatta tacaatagaa gattgtgaga atagatataa 900 ttggttaagc ttgatataat tagtaataat gattttttgc tattgcttaa ttataaaaat 960 aagttaggaa tatttagaat agatgaagta agctttgtgt gaatccacat gttattgctt 1020 taatagctga ttgtcctttt ttacttttta ttttccttat agaaaattat atattttcat 1080 gaaataggat gttaagttta taaggttaat agtccagcag gtggtggtag tgataaccac 1140 aaaaaccggt tttgggttta tgggcaagtt gtattaatca ttaactttat aataaatctt 1200 gtaagacatg agaacataca gtgtctgtga taaaataatg aaatcatctt gtgattgtat 1260 aaccacaaaa aatgattaat ttataattta ctataaataa acgcaacctg gtatgtttga 1320 ttgccagagt gcctgaaaag aaactctggt cacgactctt ccttccctcg cctcatcacg 1380 cctccttctc gggtccccgt gaaccccccg gagctggcct ccggca 1426 // ID MER87_OG repbase; DNA; PRI; 559 BP. XX AC . XX DT 19-OCT-2009 (Rel. 14.11, Created) DT 19-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat (consensus). XX KW ERV1; Endogenous Retrovirus; Transposable Element; MER87_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-559 RA Jurka J.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 9(11), 2868-2868 (2009). XX DR [1] (Consensus) XX CC ~81% identical to consensus. 4bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 559 BP; 146 A; 145 C; 114 G; 154 T; 0 other; tgtaacagtg gaagggaggc ttagcatgac tagctccatt ttgcttctga cccctgcggt 60 aatatccttt aggttaaaag cttctgctta gctctgcacg taggccagct aattatagga 120 agaatttagt ttgtagttca actctagaag aaagatgata accgtccctt tcccaaaact 180 aactcccggg gagataagga aagtatgcac acaagtaaca atgctatgtt aaagatttat 240 aggaacattg tgacctgacc tacgtcatcc aaagttaact gaccaagaac aaagaagttt 300 cacaacctcc tcggaccctc gctgacgccc agatgtctgc ggtcctcggt tacctcttgc 360 actcctccct tccccttccc ttaacataaa aaggagccta aagttcatac taacttaaga 420 tggttcttcg ggacattagt ccgccatctt ctcggtttgc tggctctccg aataaaagtc 480 gctttccttg ccccaacacc ttgtctctcg acttactggc tgtcctgcgg cgagtggtac 540 gagcttggac tcggttaca 559 // ID MER11B repbase; DNA; PRI; 1236 BP. XX AC . XX DT 01-OCT-1995 (Rel. 1.09, Created) DT 26-JUN-2008 (Rel. 13.06, Last updated, Version 4) XX DE LTR from HERVK-related endogenous retrovirus HERVK11. XX KW ERV2; Endogenous Retrovirus; Transposable Element; ERVK; HERVK; KW HERVK11; LTR; MER11; MER11B; subfamily MER11B. XX NM MER11B. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-1236 RA Kaplan J.D., Jurka J., Solus F.J. and Duncan H.C.; RT "Medium reiteration frequency repetitive sequences in the human RT genome."; RL Nucleic Acids Res 19(17), 4731-4738 (1991). XX RN [2] RP 1-1236 RA Smit A.F.; RT "Structure and evolution of mammalian interspersed repeats."; RL Ph.D. thesis (USC 1995). XX RN [3] RP 1-1096 RA Kapitonov V.V. and Jurka J.; RT "MER11B."; RL Direct Submission to Repbase Update (17-OCT-1997). XX RN [4] RP 1-1236 RA Smit A.F.; RT "Consensus update."; RL Direct Submission to Repbase Update (05-MAR-2004). XX DR [3] (Consensus) XX CC MER11 is a retroviral LTR [3]. It has been proliferated by CC HERVK-related CC retrovirus HERVK11 [3]. 6 bp target site duplications [3]. CC [4] ~10% div. XX SQ Sequence 1236 BP; 307 A; 306 C; 244 G; 366 T; 13 other; tgttgcggga agtcagggac cccaaacgga gggaccggct gaagccatgg cagaagaaca 60 tggattgtga agatttcatg gacatttatt agttccccaa attaatactt ttataatttc 120 ttatgcctgt ctttactgca atctctaaac ataaattgtg aagatttcat ggacacttat 180 cacttcccca atcaataccc ttgtgatttc ctatgcctgt ctttacttta atctcttaat 240 cctgtcatct tcrtaagctt catgagctga ggatgtatgt cgcctcagga ccctgtgatr 300 attgcgttaa ctgcacaaat tgtttgtaca gcatgtgtgt ttgaacaata tgaaatctgg 360 gcaccttgaa aaaagaacag gataacagca attgttcagg gaacaagaga gataacctta 420 aactctgact gccggtgagc crggcrgaac agagccatat ttctcttctt tcaaaagcaa 480 atgggagaaa tatcgctgaa ttctttttct cagcaaggaa catccctgag aaagagaatg 540 cgyacctagg ggtaggyctc tgaaatggcc cccctgggag tggcctgtct tttatggtng 600 aaactgcagg gatgaaataa rccccagtct cccatagcgc tcccaggctt attaggawga 660 ggaaattccc gcctaataaa ttttggtcag accggttgtc tgctctcaaa accctgtctc 720 ctgataagat gttatcaatg acaatgcgtg cccgaaactt cattagcaat tttaatttcg 780 ccccggtcct gtggtcctgt gatctcgccc tgcctccayt tgccttgtga tattctatta 840 ccttgtgaag tacgtgatct ctgtgaccca caccctattc gtacactccc tccccttttg 900 aaaatcccta tttaatttcg ccccggtcct gtggtcctgt gatctcgccc tgcctccayt 960 tgccttgtga tattctatta ccttgtgaag tacgtgatct ctgtgaccca caccctattc 1020 gtacactccc tccccttttg aaaatcccta ataaaaactt gctggttttt gcggcttgtg 1080 gggcatcacg gaacctaccg acatgtgatg tctcccccgg atrcccagct ttaaaatttc 1140 tctcttttgt actctgtccc tttatttctc aagccagccg acrcttaggg aaaatagaaa 1200 agaacctacg tgattatcgg ggcaggttcc ccgata 1236 // ID LTR13_OG repbase; DNA; PRI; 599 BP. XX AC . XX DT 18-OCT-2009 (Rel. 14.11, Created) DT 18-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat (consensus). XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR13_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-599 RA Jurka J.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 9(11), 2859-2859 (2009). XX DR [1] (Consensus) XX CC >96% identical to consensus. 4 bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 599 BP; 144 A; 193 C; 108 G; 153 T; 1 other; tgtgaggacc agatcagctc cattttgttg tttccacata agttactgtt cctgggccct 60 ggcctgcagg gcccaccctg taaccctccc cattccattc atagttgctt aaaccacacc 120 cgcagggccc accctgcaac ctttctcatt cctttcatag ttactcaacc aaatagtacc 180 accctgggca acagataagt ttctcattnc cttgaaatac aggtaaccag gttgcaccaa 240 ctctagcccc cggttaaggg accctccacc ccttacctgt accaatccaa attcaccttt 300 actaatgaaa tcaccaacca ccaatgaaaa tccaaattca cctttactaa tgaaatcacc 360 aaccaccaat gaaaagtgta ctcgtatccg accctacccc tcttgctttc tgtaccccat 420 aaaaactgcc ttcaccccga gttcggggct cgtcctcggc ttgcatgcca ggagagtcca 480 ggcccggggc ccccgggctt cccgggtctg taataaatct catcttatgc cctttacatc 540 ggagcgagtc tcgagtttgt cggttggggt taatttggga ggctttctcg gacccaaca 599 // ID piggyBac_2a_Mm repbase; DNA; PRI; 1043 BP. XX AC . XX DT 24-MAR-2010 (Rel. 15.06, Created) DT 24-MAR-2010 (Rel. 15.06, Last updated, Version 1) XX DE piggyBac2a_Mm: consensus sequence. XX KW piggyBac; DNA transposon; Transposable Element; piggyBac_2a_Mm. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-1043 RA Pagan H.J.T., Smith J.D., Hubley R.H. and Ray D.A.; RT "PiggyBac-ing on a Primate Genome: Novel Elements, Recent RT Activity and Horizontal Transfer."; RL Genome Biol. Evol 2, 293-303 (2010). XX DR [1] (Consensus) XX CC piggyBac2a_Mm is a truncated form of the autonomous piggyBac2_Mm CC from Microcebus murinus and contains an open reading frame. XX FH Key Location/Qualifiers FT CDS 95..859 FT /product="piggyBac_2a_Mm_1p" FT /translation="MDLRCQHTVLSIRESRGLLPNLKMKTSRMKKGDIIFS FT RKGDILLLAWKDKRVVRMISXIHDTSVSTTGKKNRKTGENIVKPACIKEYN FT AHMKGVDRADQFLSCCSILRKMMKWTKKVVLYLINCGLFNSFRVYNVLNPQ FT AKMKYKQFLLSVARDWIMDDNNEGSPEPETNLSSPSPGGARRAPRKDPPKR FT LSGDMKQHEPTCIPASGKKKFPTRACRVCAXHGKRSESRYLCKFCLVPLHR FT GKCFTQYHTLKKY*" XX SQ Sequence 1043 BP; 333 A; 196 C; 224 G; 287 T; 3 other; cacctttcgt accgctcacg agttttcttg tgtttcgcgc gccatctgtt aaggaccgct 60 cacgagtttt ctcgtgtttc gcgcgccatc tgttatggac cttagatgtc aacacactgt 120 cttgtccatt agggagagta gaggtttact gccaaatttg aaaatgaaaa catcaagaat 180 gaagaaaggt gacataatat tttccagaaa aggcgatatt cttctcctag catggaaaga 240 caagcgggtt gtccgaatga tatcaaygat ccatgacact tctgtctcga caacaggaaa 300 aaaaaataga aaaacgggag agaatattgt aaaacctgcc tgcatcaagg aatacaatgc 360 ccacatgaaa ggcgttgacc gtgcggatca attcctttcg tgttgttcca ttctaaggaa 420 aatgatgaaa tggacaaaaa aagtagtgct gtaccttata aactgtggac ttttcaattc 480 atttagagtg tacaacgtcc tcaatccaca agcaaaaatg aagtataaac agtttctgct 540 atcggtggcg agagactgga taatggatga caataatgaa ggctctccgg aaccagagac 600 aaatctgtcc agcccttccc ctgggggtgc aaggagagca cctcgtaaag atccacccaa 660 aaggttgtca ggtgatatga agcagcatga acctacgtgt attccagcga gtggaaagaa 720 aaaatttcct acgagagcct gcagagtttg tgccrcccat ggaaaaagga gcgaatctag 780 atacttatgt aaattttgtt tggtccctct tcatagagga aaatgtttta cgcagtacca 840 tacdttaaaa aagtactagg aactttaatt gtttaattag ttgtttaatt gtttttgtaa 900 ataaaaatgt tataattatt gaaaaacaac acctaaagtg cattatgatc tgtagttatg 960 atgatttaaa taacgtgcag tttgcccaaa aacgtgcggt ccctggcgta tgtcttagag 1020 atttctatgc ggtacgaaat gtg 1043 // ID LTR1_TS repbase; DNA; PRI; 446 BP. XX AC . XX DT 09-DEC-2009 (Rel. 15.09, Created) DT 09-DEC-2009 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR1_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-446 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from tarsier."; RL Repbase Reports 10(9), 1258-1258 (2010). XX DR [1] (Consensus) XX CC >91% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 446 BP; 102 A; 130 C; 109 G; 105 T; 0 other; tggggagcag atgagcactg tcactgtagc aggccttgag caaggcaagg ccccaagcga 60 aggcaaggtc tcgcttggta gctctgcatc ctagccaaag ggcgagggat gcagcctgct 120 acacggaacg gtgtctcaga tgcccacaca tccgggtgca catagtacag ataaagcctt 180 agctacaaga taaccgcgga acagctcgcc tttaggtagc aacaaaattc cgttatggct 240 ccctcacaga ccttgaatcc cttccaactt cctttgtgct tgattataaa ataaacatgc 300 tgagctagtt cggggccact tctcctccat cagaggaaag tgtcccacct ggccccagct 360 tttctttatg tctttgtgtc tgtgtctttc ttaatctctc gccgctctct ctcaggaccc 420 tcaaggcagc cgcgcgggcc gcggca 446 // ID Alu1_TS repbase; DNA; PRI; 299 BP. XX AC . XX DT 09-APR-2010 (Rel. 15.04, Created) DT 09-APR-2010 (Rel. 15.07, Last updated, Version 3) XX DE Alu-like SINE element - consensus. XX KW SINE1/7SL; SINE; Non-LTR Retrotransposon; Transposable Element; KW Nonautonomous; Alu1_TS. XX NM Alu1_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-299 RA Jurka J.; RT "SINE elements from tarsier."; RL Repbase Reports 10(4), 632-632 (2010). XX DR [1] (Consensus) XX CC ~97% identical to consensus. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 299 BP; 76 A; 78 C; 96 G; 49 T; 0 other; agccgggctc ggtggctcag cctgtaatcc cagcactttg ggaggctgag gtgagtggat 60 tgcctgagcc cgcgggttcg agacccgcct gggcaacttg gcgagacctc atctctacaa 120 taaatcaaaa aattagccgg gcgtggtagc gcgcgcctgt agttccagct acttggaagg 180 ctgaggcgga aggatcgccg gagcccagca ggtcgaggct gcggtggccg ggagcggcca 240 ctgcactcca gtctgggcga cagagtgaga ctccaactca aaaaaaaaaa aaaaaaaaa 299 // ID GSAT repbase; DNA; PRI; 217 BP. XX AC X68545; XX DT 18-APR-1997 (Rel. 2.03, Created) DT 04-AUG-2008 (Rel. 13.07, Last updated, Version 3) XX DE H.sapiens gamma satellite DNA. XX KW SAT; Satellite; Simple Repeat; Centromeric satellite DNA; GSAT; KW Centromeric; tandem repeat. XX NM GSAT. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RA Lin C.C.; RT "Direct submission."; RL Direct Submission to EMBL/GenBank/DDBJ (30-SEP-1992). C.C. Lin, RL Dept. of Lab. Medicine & Pathology, University of Alberta RL Hospitals, 8440-112 Street, Edmonton, Alberta, T6G 2B7, CANADA. XX RN [2] RA Lin C.C., Sasi R., Lee C., Fan S.Y. and Court D.; RT "Isolation and identification of a novel tandemly repeated DNA RT sequence in the centromeric region of human chromosome 8."; RL Chromosoma 102(5), 333-339 (1993). XX RN [3] RP 1-217 RA Smit A.F.; RT "Consensus update."; RL Direct Submission to Repbase Update (05-MAR-2004). XX DR GenBank; X68545; Positions 1 704. XX CC CC [3]. XX SQ Sequence 217 BP; 41 A; 64 C; 88 G; 24 T; 0 other; gctgggagcc tcccaaggag gcctctccca tcccagaagc ccccagggct gtcccgggcg 60 ggctgtaaag ccccaggctt tggagcaggg tgcctgtgtc tctcgcggaa ggcccccaca 120 agcgaaaacg gggccgcagg gtggcgtggg cgggccgcag ggactcaggg ggacgttgag 180 gcaggcagag gggagaagcg gcgagaccgc agggaat 217 // ID LTR21_TS repbase; DNA; PRI; 540 BP. XX AC . XX DT 09-DEC-2009 (Rel. 15.09, Created) DT 09-DEC-2009 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR21_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-540 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from tarsier."; RL Repbase Reports 10(9), 1273-1273 (2010). XX DR [1] (Consensus) XX CC >98% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 540 BP; 137 A; 138 C; 104 G; 161 T; 0 other; tgagagaccc aagagattaa aaaaaaaact ctaagagaac tattactata tgaattccta 60 ccccacgttc actgtatttg tttgcagtgt ttgtacttag aattttgtta agtttaattt 120 ttgtttgtct cataaaataa ccgattgctt ccgcaactgc tgctagcagt atgtgaatgt 180 caagtaataa acccctcccg gacattcctt tgcttagcct gcctgctgac attccactct 240 gctcatttag cagctccccc gtcaatcaca ggtagctgag cggaccagac caatcccaag 300 tgcccccgta cccattagga gaaattaccc tatcatgtaa aaacaagttt aaagttcttt 360 cccctccctc tgtaatgatg tataaaaact gcttgccatg ctggatcggg gctctcgtcc 420 ctgaagctgc tgcttcgggc gtgagcccag actcgagcct gaataaagac ccttgtgtgt 480 ttgcatcgga gctggctctt tgtggtctct atctctctca ttagagtccc gggttcaaca 540 // ID MacERV5b repbase; DNA; PRI; 7205 BP. XX AC . XX DT 24-SEP-2007 (Rel. 13.02, Created) DT 27-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE ERV1 Endogenous Retrovirus from Cercopithecidae. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; MacERV5b. XX OS Cercopithecidae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini. XX RN [1] RP 1-7205 RA Smit A.F.; RT "MacERV5b - ERV1 Endogenous Retrovirus from Cercopithecidae."; RL Direct Submission to Repbase Update (21-FEB-2008). XX DR [1] (Consensus) XX CC 5 bp TSD, but ERV1-class proteins. 6% subst. ORFs: gag 339-1934, CC pol 1935-5507, env 5392-7167. XX FH Key Location/Qualifiers FT CDS 339..1934 FT /product="MacERV5b_1p" FT /note="gag." FT /translation="MGAAQSKPDPKSPLGCLLANFQTLGLSQDLKRKRLIF FT FCTVAWPQYKLDNQSQWPAEGTLDFNILTDLTNFCKRLGKWSEVPYVQAFW FT DLRSRPDLCARCSLAQVLLAKSXPSSKEKDDSPSFSEPPDTLSLPPLRSPA FT QPPPYPDPSSSPSSSAPPLPSTPPVSPPNLTDPVSPTSSSSSPVSAHTRSR FT TDLLCPLREVAGAEGVVRVHVPFSLADLSKIEERVGSFSANPTLYIKEFRY FT LCQAYDLTWHDLHVVMTSTLSPEERERILAAARQHADQVHLTDPAMPVGTE FT AVPSAEPGWDYQVGQAGRRRRDMMVQCLLAGMQAASNKSVNFDKLKEVVQG FT SDENPAVFLNRLTEALIQYTRLDPASPAGGTVLATYFISQSAPDIRKKFKK FT VEDGPQTPIQDLVKLAFKVYNSREEAAEAQRQARLKQKVQLQTQALVAALR FT PAGSRSSQRGGTPRAPPGACFKCGNEGHWARQCPNPEEPTRPCPSCRQMGH FT WKSDCPNLRTVATPPRDDPPPGTGGVFQLLDTDED*" FT CDS 1935..5507 FT /product="MacERV5b_2p" FT /note="pol." FT /translation="RGPDSGTPLTLAEPRVTLQVAGKSISFLMDTGATYSV FT LPSSSGPSHPSAVTVMGIDGTPSTYRQTPPLSCRLDGSLFSHSFLIIPSCP FT VPLLGRDLLSKLGASVHFRPNPSPHLAFLFPLLSPDKPRQADTPLPFPVPI FT NPKVRDTSTPIIAQHHTPVRIRLKDPSKFPSRPQFPISLEHRQGLKPIITR FT LLQQHILVPANSPCNTPILPVRKSSGAYRLVQDLRLINEAVVPTFPVVPNP FT YTLLSRIPPDTTHFTVLDLKDAFFTIPLHPDCHFLFAFTWEDPDTHISSQL FT TWTVLPQGFRDSPHFFGQALAQDVSLCPLTHSTLLQYVDDLLLCSPSWESS FT LADTATLLNFLGDRGYRVTPAKAQLCTPSVTYLGVLLTPTTKSLTADRISL FT IETLQPPQDAEEILSFLGLVGYFRHWIPNFGVLAKPLYQAAKEAPTGPLSD FT PSLVANSFKKLQDCLLSAPALSLPNPLRPFHLFTEERQKVATGLLAQPVGS FT TYQAVAYLSKQLDPTVQGWQPCLRALAAATELTKEALKLTLGHSLTVYSSH FT RLSDLITHKCLSHLTPSRLQQFHLLFIENPHITLTTSPPLNPATLLPTPGC FT DSAPAHSCPEVLTTLPPARLGLSEQPLEHPDRTLFVDGSSVLTPDGRRQAA FT YAVVTATQTVETRPLPLGTTSQKAEXTALXRALLLSEGQRVNIYTDSKYAY FT LIAHTHSLLWQERGFLTTKGTPVVNGPLIKRLLDALQAPKEVAIIHCKSHQ FT HSKDPVSQGNNLADSTARATALTSSPTPAPLLFLSPTYSPDYSPQELQTLT FT AHPSATQSEDGWVFIDSRVALPQTQAVRILTDIHRSLHIGPKAMYNFLEPI FT LHLPSLQAQIRKVHQQCATCSAVNPQGRLRHPGPTHQLRGHQPGXDWQLDF FT THMPRHKKFRYLLTLVDTFSGWIXAFPTTRETAEVAATTLLEHIIPRFGLP FT RTIQSDNGPAFISKLTQQVAAALQITWKLHIPYHPQSSGKVERANGLLKLH FT LTKLTLETRLSWVTLLPLALTRLRAAPRAPTGLSPFELLYGRPFLFQELPA FT LSPPLGSYLPYLTLLRELLRKHADRCLPAPASPTLENPAVVAPGDLVLVKQ FT LQPRALSPRWEGPYTVIPATPTAAKLLGLPSWYHLSQLKKAPTQHDSNWTT FT QVVTPTKLRLTRAGNDPLPDLPRPPSPPESR*" FT CDS 5392..7167 FT /product="MacERV5b_3p" FT /note="env." FT /translation="MIPIGPLRLLPLPSYDSRAQVTTLCLTFLGHLVLPSL FT GKTPAQTPPTNPFQWRFYLSETWTQGNRMSSLTIATVDCQPQGCQSQVTFN FT FSSFNSVPRKWRHPVICFTYDQTYDACRSTWVETNGGCPYXYCNMHKAIQD FT TKETLWQQPTSSVRLTTNLKSTFFLTIPDPWDSRWASGVEARLYRAGYDSY FT PVARLRIFRAYVRVVSSLVSLASDIKQQEKAISAIVDPGSKPQDSSNPFSW FT LTLVREGAQVVHMAGVRNISRCFLCAALNKPPLVAVPLPSPFNSSNLTLSF FT PLPGRTLGEVPLFQDPLRQQLPFCYSTPNASWCNRTGSAPPNLTAPPGGYF FT WCNSTLSKTLKASNTTLCVPISLVPSLTLYSEAELSSLLPLARPRQARAVF FT LPLMIGVSLASSLVASGLGTGALTHSVRTSQDLSARLQVAIEASAESLASL FT QRQITSVAQVAAQNRRALDLLTADKGGTCMFLNEECRYYINESGLVETNLL FT TLEKIREGLHQKNLGSGPSLGWRQSSMAGWVLPFLSPLLIIGFLLLIAPCV FT IRFVRDRIKEVSRVAVNQMLLHPYTRVPTSEEPHDGLYQQEAAR*" XX SQ Sequence 7205 BP; 1483 A; 2569 C; 1572 G; 1574 T; 7 other; ttggtgccga aacccgggag gagacccctc cccggacccc tgccggttcg ggcaggctct 60 cctctccccg agccaggacc tcctctccga gccgggagcc gtttcaccaa atccgagact 120 caattcgatc actgctgggt aagttttccc ccgtcctgca ggctccggga gtccctgccc 180 gaaaacgcgg ccgcgtcagg gcttcccccg tccgtcactt gtgacctcgg caccggggac 240 taggagacgt ccgacctccc cggaagccgc cataccccgc actccgcgtg gcagtgggag 300 gacgctcccg ttgcctccgg actgcccgcc tcaggatcat gggggctgcc cagtccaaac 360 cagacccaaa atcccctctg gggtgcctct tggctaattt ccagacttta ggtctcagcc 420 aagacctaaa acgaaagcgg cttattttct tctgcacagt ggcatggccg cagtataaat 480 tagataatca gtctcagtgg cctgcagagg gcactctaga ctttaacatc ctcacagacc 540 tgaccaactt ctgcaagaga ctaggcaaat ggtccgaggt accctacgtt caggcctttt 600 gggacttgcg ctcccgcccg gacctctgtg cccgatgttc gttggcccag gtcttgctcg 660 cgaaatctnc tccctcaagc aaggagaagg acgattcccc ctctttctct gagccccctg 720 atactctttc cctgcccccc cttcggtccc ccgctcaacc tcccccttac cctgacccgt 780 cgtcgtctcc gtcctcgtcg gcgccacccc ttccctccac tcctcctgtg tccccaccaa 840 acctgacgga ccctgtctct cctacctcct cctcctcctc gcctgtctca gcccataccc 900 ggtccaggac cgacctcctg tgccccttgc gggaagtggc aggcgccgaa ggagtggtcc 960 gggtccatgt gccgttttcg ctggcagacc tgtctaagat tgaagagcgt gtaggctctt 1020 tctcggccaa ccccactctg tatatcaaag agttcagata cctatgccag gcgtatgacc 1080 tcacctggca tgatctccat gtcgttatga cctcaaccct gtcccctgag gaacgggagc 1140 gtatcctagc ggcagccagg cagcatgctg accaggttca tttaactgac cccgccatgc 1200 cagtcggcac cgaggcggtg ccctcggccg agcccggctg ggactaccag gttgggcagg 1260 caggccgccg ccgccgagac atgatggtcc agtgccttct tgccggcatg caggcagcct 1320 ccaataagtc ggtcaacttt gataaattaa aggaggtagt ccaaggctca gatgagaatc 1380 cggccgtttt tcttaaccga ctgactgagg cactcattca gtacacccgc cttgaccctg 1440 cctcccccgc aggaggaacc gtcttggcta cgtactttat ttctcagtcg gctccggata 1500 tccggaaaaa atttaaaaag gtggaggacg gccctcaaac tcccatccag gatttagtca 1560 aactggcctt caaggtctac aactccaggg aggaagcagc tgaggcccag cgacaggcca 1620 ggctaaaaca gaaggtacaa ctccaaaccc aggccttggt agcagccctg aggccggccg 1680 gctccaggag ctctcagaga ggaggtactc cccgagcgcc acctggtgcc tgcttcaagt 1740 gcggcaacga aggccactgg gccaggcagt gccccaaccc tgaggagcca actcgcccct 1800 gtccgagctg tcggcagatg ggccactgga agtcggactg ccccaacctg aggacggtcg 1860 ctacgcctcc acgtgacgac cctcctccag gtactggagg cgtcttccag ctcctcgaca 1920 ccgacgaaga ttgaagaggc ccagactcgg gaacccctct tactctcgcc gagcccaggg 1980 ttacgctcca ggtagcgggt aagtccatat ccttccttat ggacacgggg gctacctact 2040 ctgttttgcc ttcctccagt ggccccagcc atccctccgc tgtcacggtc atgggaattg 2100 atggcactcc ctccacctac cgccagactc ctcctctgtc ttgccgcctg gatggctccc 2160 tcttctcgca ctcatttctt atcattcctt cgtgcccagt ccccttgtta ggacgagacc 2220 tcctctccaa gctaggggcc tcagttcact tccggcccaa cccctccccg cacctcgcgt 2280 tcctctttcc cctcctctcg cctgataaac cccgccaggc tgacacccca ctcccgtttc 2340 cggtccccat taaccctaag gtgcgggaca cctccacccc gatcattgcc cagcaccata 2400 ctccagtccg catccggctg aaggacccct ccaagtttcc ctctagaccc cagttcccta 2460 tctcccttga gcaccgacag ggactaaaac ccatcatcac gcgcctgctc cagcagcaca 2520 ttctagtccc tgccaactca ccatgcaata ctcctatcct gcctgtacgg aaaagctccg 2580 gggcctaccg cctcgtgcag gatctacgcc tcatcaatga ggcagtagtc cccacctttc 2640 cagttgttcc taacccatat acacttctct cgcgcattcc ccctgatacc actcatttta 2700 ctgtccttga cctaaaggat gccttcttca ccattcccct acatcccgac tgtcactttc 2760 tgtttgcctt tacatgggaa gatccagaca ctcatatttc ttcccagctg acttggactg 2820 ttttgcctca agggttccga gatagccccc attttttcgg acaggcactg gcacaggatg 2880 ttagcctctg tccccttacc cacagcacac tattgcaata tgtagatgac ttattactat 2940 gtagtccctc ctgggagagc tcccttgcgg ataccgctac acttttaaat tttcttggcg 3000 accgaggtta tcgggttacc ccggccaagg ctcagctttg caccccttct gtcacctacc 3060 taggcgtact cctcacaccc actacaaaaa gcctcacggc ggatagaata agcctcatcg 3120 aaactctcca gcctcctcag gatgcggaag agatcttgtc cttcctagga ctggtagggt 3180 atttcaggca ttggattccc aacttcgggg tcctagccaa gcccctctac caggctgcca 3240 aggaagcgcc caccggaccg ctgtccgacc cctcattggt tgccaactct ttcaagaagc 3300 ttcaggactg tctcctttct gcccctgctc tctctctccc caaccccctt cggccttttc 3360 atctattcac cgaggagcgc caaaaggtag ctactggcct cctagcccag ccggttggat 3420 ccacatacca ggctgtggcc tatctctcca agcagttaga ccccacggtc cagggctggc 3480 aaccttgtct gcgagccctg gcggctgcca cagaacttac caaggaggcc ctcaagctca 3540 ccctaggaca ttctcttact gtctactctt ctcaccgact gtcagatctc atcacacaca 3600 agtgtctcag ccacctcacc ccgtcccggc ttcaacagtt tcacctgcta ttcattgaaa 3660 accctcacat cacccttacc acctcacccc ctctaaaccc tgctaccctc ttgcccactc 3720 cagggtgcga ttccgccccc gcacattcct gcccggaggt tctcaccacc ttgccgcccg 3780 cccgcctcgg tctttccgaa cagccattag aacacccaga ccgtaccctg ttcgtggacg 3840 ggagttctgt cttaaccccc gatggccgcc ggcaggcggc ctacgctgtc gtaacagcaa 3900 cccagacggt cgaaaccagg cccttgcctt taggcaccac ctcccagaag gctgaastca 3960 ctgcccttmc tcgtgcccta ctcctctccg aagggcagag ggttaatatc tacacggact 4020 ctaaatacgc ctaccttatt gcacacaccc actcccttct ctggcaagag cgtgggttcc 4080 ttaccaccaa agggacgcca gtagtcaatg gaccacttat aaagaggttg cttgatgctc 4140 tccaggcccc caaggaggta gccatcatcc actgtaaaag ccaccagcat tctaaggacc 4200 ctgtgtcgca gggtaacaac ctagctgatt ccaccgcgcg ggccaccgcc ctcacttcct 4260 cccctacccc agcgccttta ctctttcttt cccccacata ttcccccgac tactctcccc 4320 aggaacttca aaccctgacg gcccatccca gtgctaccca gagcgaggat gggtgggtgt 4380 tcattgacag ccgagtagcg ctccctcaga ctcaggcagt gcgtatattg accgacatac 4440 accgctctct ccacataggg cctaaggcta tgtataactt cctggaaccc atccttcacc 4500 tcccctcact gcaggcccaa attaggaagg tacatcagca atgtgccacc tgttcggccg 4560 ttaaccccca aggtaggctc aggcacccag ggcccactca tcagctaagg ggccaccagc 4620 cagggnaaga ttggcaactt gattttaccc acatgccccg ccataaaaaa ttccgctacc 4680 tgctgacctt ggttgacacc ttctcaggat ggatcnaggc tttccccacc acccgagaaa 4740 ctgcggaggt cgcggcaacc actctcctag agcacatcat ccccaggttc ggtctccccc 4800 gaaccatcca atcggacaat ggtccggcct tcatttctaa actcacccag caggtggcgg 4860 ccgcactcca gattacctgg aagctccaca ttccctacca tccgcagtca tctggaaagg 4920 tagaacgcgc aaacggtctc cttaaactnc atctaaccaa actcactcta gaaacccgcc 4980 tctcgtgggt gactctcctt cccttggctc tcactcgtct tagggcagct ccgcgggccc 5040 ccacagggct cagccctttt gagctgctgt acggacgccc gttcctcttc caggagctgc 5100 cagccctctc ccctccctta ggctcctacc tcccttactt aaccctcctt cgcgagcttc 5160 tgagaaaaca cgcggaccgg tgtctccctg cacccgcttc cccaaccctc gagaatcccg 5220 ccgtcgtagc acctggagac ctggtcctgg tcaagcagct gcaaccccga gccttatctc 5280 cacggtggga aggaccatac accgtaatcc ccgccacccc caccgctgcc aagctccttg 5340 gtcttccctc ctggtatcac ctgtcccagc tgaaaaaggc gcccacccaa catgattcca 5400 attggaccac tcaggttgtt acccctacca agttacgact cacgcgcgca ggtaacgacc 5460 ctctgcctga ccttcctcgg ccacctagtc ctcccgagtc taggtaagac tcctgcccaa 5520 actcccccca ccaatccatt ccagtggagg ttctaccttt ccgagacctg gacccaagga 5580 aatcgcatga gctccctcac catagccacg gtagactgcc aaccccaagg gtgccaaagc 5640 caagtaacct ttaatttctc ctccttcaac agcgtgcccc gtaagtggcg gcacccggtc 5700 atttgtttca catatgatca gacatacgat gcctgtcgat ctacttgggt tgaaaccaat 5760 ggaggctgcc cttatnacta ctgcaacatg cacaaggcca tacaggacac caaagaaacc 5820 ctctggcagc agcccacttc ctccgtgcgg ctgactacga accttaagtc tacctttttc 5880 ctcaccatcc ctgacccttg ggactcgagg tgggcgtcag gggttgaggc ccgcctttac 5940 cgagcggggt atgactccta tcccgtggcc cgactcagga tctttagggc ctacgtcagg 6000 gtcgtaagca gccttgtcag cctggcctcc gacatcaaac aacaggaaaa agccatctca 6060 gctatagtcg acccaggcag caagccccag gacagcagca accctttttc ctggctaacc 6120 ttagtcaggg aaggggctca agtagtacac atggccgggg tacgcaacat ctctcgctgt 6180 ttcctgtgtg cagccttgaa taagccccca ctggttgcgg tacccttacc cagccctttt 6240 aactcctcta acctaaccct ctcctttccc cttcccggcc gaaccctggg ggaagtcccc 6300 ttgttccaag acccgcttag acaacaactc cccttttgct actccacccc caatgcttcc 6360 tggtgcaacc ggacaggatc tgcgcctcca aatctaaccg cccccccagg tgggtatttt 6420 tggtgcaatt ccaccctgtc aaagactctc aaggcttcca acactaccct atgcgttccc 6480 atctccctag tccccagcct caccctgtac agtgaggccg agctatcctc ccttctgccc 6540 cttgcccgcc cccgccaggc aagggcagta ttccttccat tgatgatcgg ggtctcttta 6600 gcctcatccc tcgtagcctc cggcctagga accggagccc taactcattc tgtccggacc 6660 tctcaagacc tttcagcccg attacaggta gcaatcgaag cttctgcgga gtccttggcc 6720 tccctccaac gacagatcac ctcggtcgcc caggtggcag cccaaaaccg tcgggcactc 6780 gacctcctta cggccgataa gggcggcacc tgcatgttcc tcaatgaaga atgtcgctac 6840 tacatcaatg agtcaggact agtagaaacc aatctcctca ctctagaaaa gattcgggaa 6900 ggactccacc agaaaaacct cggatcgggg ccctcacttg ggtggcggca gtcgtctatg 6960 gccggttggg tcctaccctt tctaagcccc ttactaatca ttggcttctt gctactcata 7020 gctccctgtg tcatccgctt cgtccgggac cgcataaagg aagtctcccg ggtcgctgtc 7080 aatcagatgc tactccaccc ctatacccga gttccgacct ccgaagaacc ccacgacggc 7140 ctatatcagc aggaagcagc cagatgaata cgtcgcccct ttttcttatt agaaagaggt 7200 cggaa 7205 // ID LTR13C_OG repbase; DNA; PRI; 616 BP. XX AC . XX DT 18-OCT-2009 (Rel. 14.11, Created) DT 18-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat (consensus). XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR13C_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-616 RA Jurka J.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 9(11), 2858-2858 (2009). XX DR [1] (Consensus) XX CC ~95% identical to consensus. 4 bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 616 BP; 147 A; 202 C; 111 G; 156 T; 0 other; tgtgagagcc aaactagctc cattttgttg tttacagcac aagctccatt cttggctgtt 60 tacagcacaa gccccattcc tagctgttta ccgcatagcc cttccgcccc aacctctgcc 120 agtgaaacct gcagggccac tccctgctgc cttcccccac cccaggttgc acagccactg 180 cgccgtaaac tgagtcacct taggcaaata gctgctataa taaacaataa gtttctatta 240 ccttgaaata caggtaacca ggttgcacca actctagccc ccggttaagg gaccccccac 300 ccctcacctg taccaatcca aattcacctt tactagtgaa atcatcagct accaatgcaa 360 atcctccttt acttcctcct ttactaatga aatcaccaac caccaatgaa aagtgtactc 420 gtacccgacc ctacccctct tgctttctgt accccataaa aacttcctcc accccgggtt 480 cggggctcgt cctaggcttg cacgccggga gagtccagac ccggtctgaa taaaacctca 540 cctcgtgcct tttacatcgg agcgtgtctc gggttcgtct gttggggctg tttgggagaa 600 tttttcggac ctaaca 616 // ID LTR6_TS repbase; DNA; PRI; 386 BP. XX AC . XX DT 09-DEC-2009 (Rel. 15.09, Created) DT 09-DEC-2009 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV3-type endogenous retrovirus - DE consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR6_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-386 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from tarsier."; RL Repbase Reports 10(9), 1262-1262 (2010). XX DR [1] (Consensus) XX CC ~94% identical to consensus. 5bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 386 BP; 86 A; 126 C; 93 G; 81 T; 0 other; tgtaaggtca gaggaccgga gggcggggca ggcctcgtct caacaagatg gctccaaccc 60 ggaccaggag aatgggagtg agagcaaaaa ccggaactaa gaccccagcc gcaaagccag 120 ccagtacctg ttacagcgac tctagcaacc ccgttcagcc aatcgccagc agacgtgaca 180 gctgtcccaa ccagtgatcg ctctccccaa tccccctcac agctttctca gcgtgccctt 240 tataaacccg tcgcccatct tgctgggcgc gacttctccg gcccacagct acatggaccg 300 gagaacctcg cccgggattg cgctaataaa ttttttcttg gctcctttgt tgccgtttcg 360 cctggtctgg tttcatttac cttaca 386 // ID ERV1-2_TSy-I repbase; DNA; PRI; 5805 BP. XX AC . XX DT 28-JAN-2010 (Rel. 15.09, Created) DT 28-JAN-2010 (Rel. 15.09, Last updated, Version 2) XX DE Internal portion of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERV1-2_TSy-LTR; ERV1-2_TSy-I. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-5805 RA Jurka J. and Walichiewicz K.; RT "Endogenous retroviruses from tarsier."; RL Repbase Reports 10(9), 1194-1194 (2010). XX DR [1] (Consensus) XX CC ~95% identical to consensus. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX FH Key Location/Qualifiers FT CDS join(3597..3716,3720..3956,4133..5014,5018..5275) FT /product="ERV1-2_TSy-I_1p" FT /translation="MQALGSVLNAIHCQVRERLPISLTTDAHSFKPGDMVW FT VKENVQPLKPLWRGPFTVLLSTPTAVKVAEVVPWIHYSRIKPAFQDWECTA FT DSAAPLKLTIRKVTEPNEGRGTNLSSPAPEAQLSPLIMLEKGFTSISVLSY FT DRKFTCAFMYFYLIWVSTASPLYPACKCIETIWPDLRVFHRYSSQPEVCYT FT DKHICTHNGQLYFAGLSKQSYQYTKAGVIACNTPRSTNWVCWNFAPSPKAE FT KLIKETVQLVAPPTAQQSKLTPFPQSWEVGQNLFINLAENIARTLGVTNCW FT VCGGALMTKEWPCKGTNLNAYQLLQWNHSLTVRTDNVPQKWILSSKVIGED FT CLSRAGSAYTQWVGETPCKRILYWNSTHQTWWPTKPVWYWASAYATKSSQC FT TKGLFWICGKVAYAQLPALKGTCNIGIIQPGFFLLPNPRGDELGIPLYESL FT KTRGARSLDQVPNIGGTQVWKNDEWPPQRIINTYGPATWAQDGSRKCCTPI FT YIC" XX SQ Sequence 5805 BP; 1795 A; 1183 C; 1255 G; 1572 T; 0 other; tttctggtgc gttggccggg aaatggagat tggagacggt gtgttccacc tcctttgtca 60 tcgaggtgcc acctccaggg caacgggaag ggcaatctcg cctggcgcgc aacggactct 120 ttttgatccg taggagagag gcccctcccg gtgcactgaa gctcggacca cgaaagcaca 180 acaggaaccc aagaggcaag gaactggcac aaggacgacc acctgacgga aaggacagaa 240 ggaccacccg atggaaaggg taggaataaa ggcctttgca tagacctgcc aatggatggc 300 agaaggggag cttgatcacc tcccggagcc cgctagtaat cttccagcct gaaggaaagg 360 gctgaaaggc agtgaaagga aacactgcct gaagataaac ccgtagactc ctaggattag 420 gacggagagt tgacgaatgt gctgtgtgct tgtgtgtaaa tgagacggat ggcgggaagg 480 ctccacatag ttagtctgcg ttgggggtgt accaacctac caattgctaa gaggtgtctg 540 agatatcttc gtgaagagta tgactgaagg gggacaaagc ggatatttcc catacccctc 600 attctcttat ctgtgtgtgt ctgtcacttt aaatctgcta aaaagctggt tgagttatta 660 tgctttgttt gatgggacat gtggtctttt aaaaacttga ttggcgcata tttcttgatg 720 tcattttatg tctcccttta tttgtgtggt gttttctccc aacccgttgg cgctgttaaa 780 atttctgcct tttgttgtta aggttgtgga aaaattgcct gttgcgtata agaagtgtta 840 ccggttatct taaattcttt ctggggacca ctccccctcc cttccttagc ctgtgctctc 900 cttcgtaaat tctgaagggg ctcattctaa tccacctgta aagaaacgga gaaatgtctc 960 ttatatgcta cctagccttt tgttaactca gtgtaagtta atgctagcct tggagctaat 1020 ctttctctgg caactaagga aatgaatagg ctgttctcgt tttcaaatgc taaaggccag 1080 gttagttaaa agtaagtccg tttctaaagg tttaggcctt gcctaagatc gtgatcaact 1140 gcttgtccct gtaaaaaaac tgttgggaag cctctcagag tcttgtaatg tatgctgagt 1200 ccctacttta cattgtaatg ctaagtaaca ctaatgcttg cgttttttgg ctatgctcat 1260 ttaattgtcc acgtcttacg actaaggctg ctgtttagta attcatgagg atttttgaca 1320 attttgcctg ttcataatta ggaatatgta ggaacagatc atttcacatc ttcgtggtct 1380 gttcactcta aagggttttt ttgaaccagg ccattcttgt agacatcatt ggaaaagtcg 1440 tgaagcaaaa gaaaaaaaaa aaaaagagaa gaactccaaa atgagcacca gccagaggaa 1500 aggtaaacaa attggcagtt gccactaaag gcaaaagctc cagccaccca aaagaggcct 1560 gataaggata aacctgctaa ggggatggta tcagctgcta tgaatgtggt caatctgggc 1620 acctaaagca caattgttct taatagccta agcccccaga actttgtcca atatctggac 1680 taaaaagcca ccgaaagaga tggtgcggcc tctcccgaag gggggggaaa cacctagctc 1740 ggtgatagtg gctgttgact gaagggggag gggcatggcc cagcgcagca ctctgtatgc 1800 catccaccca gagaatcctt gggtctatat aaatgtggct ggggtccaaa ttaatttctt 1860 aattaatact gaggctaagt ttttagccct tgtgctcata ctgaacttct tcctcctaaa 1920 acctgctctc cagtggaatt aaaggtgttc caaaaactaa atgtttcacc taacagttac 1980 taaaaggtaa attcaaatta tgactgtaaa aggttatcta taaataaata aaaaggaaag 2040 caaaaagcaa aggaaaaatg taaaaaggtt gttggtatac aaataaggtt ttagcaggaa 2100 aacaaaaaag aaaggaatat aaaaatatat atatttatct aagaggaaat cttatatggt 2160 acatttctgt cctataataa agtagatagt tatgaaaatt tttaaaaagg gggtagagaa 2220 caaaaataaa atggtataag cacgccaaaa atggtataag gaaagtttaa aaaaaatata 2280 catacaaaag aaattctata aacaatctag gcataataaa gtcttttctg aatgcaagtc 2340 tccaaacatt ttcggatttc tcttcaaatt ttatatagaa tttacttaag tgcctagttc 2400 aaattatagt ctaaaacaat ctgtcaggct accattatag tccagaaata caaaagctgg 2460 tacttgtaat aaagctacag aaattaaagt ttatcattta agttatgcta ataacttcag 2520 gccttgtagc ttttgtataa aaaggtaatc agccactgcc tacaattgtc aaaatgtagg 2580 cctgacttta ttccttttat tgagctagaa tccagtaatt tcatgcccac ataatggtca 2640 tgcaaaaatt ccagccagtg cctgcaaata atgccactgc tgaaacaaat ctggacgcct 2700 caagtcagac agggagagat tttcgatctc tggctaggca ggactgcgcc cattatcagc 2760 tctgaagaaa ttacagaaga cggaccttca cccctcagca ccccaaatga ttatgggatc 2820 gaaaaatctc taaagaataa ataattgagg cagggtagct agcacaggct agacataaat 2880 aaatttccta taggtaggtt ttaaaaggcg ttcaaacgtt tctactgcag gccgagtggc 2940 aacctatccc accccttggc aaatccaggg aaaactccat cagatgtaaa actaggacag 3000 tacaactttc aatttgcggg ttagcataat aatcactaga aatcctttgg agacatgagt 3060 aaactaaaca taggtatttg gtattgcatc actcaggatc caagtccctg aaatggaatt 3120 tgttaaacct ttgaaattgt ttaacggtag gcctctgaag gaaagggggg ccacaaacgt 3180 aaatgctcat aagaaaatgt gttgtctatt tgggataaat tttaatcaca gtttatgagc 3240 ttatttccct agacgcttct tggaccagaa gaaactcaaa catccaaggg gagtaagtgc 3300 tgctaaaaat tgaagaaagg aaggacccaa tcaacaactg taagaaaaaa aagaggaccc 3360 taagggtggt cctgttccta tactagtcta gagaatttaa atgcctaact tctctatgaa 3420 gtctcaaagg ggggtgaaca aaagcccctg ctaggggaaa aggccccaaa acaggccttt 3480 aaataaagta aacaggcttc ttgccttttg taattcctta tgggtggcca ccccctctcg 3540 ttaggggcat acaggaaaac ttaaaggaat ttggaagtct tgccctaaaa caacaaatgc 3600 aagccttagg ttcagtccta aatgccatcc attgtcaagt cagagaacga ttgccaatta 3660 gtttaactac cgatgctcat tcttttaagc caggtgatat ggtatgggta aaagagtgaa 3720 atgtacaacc cctaaagccc ctctggaggg ggccattcac tgttctttta tctaccccga 3780 ccgcagtcaa agtagccgaa gtagttcctt ggatccatta tagcagaatt aagccggctt 3840 tccaggactg ggagtgcacc gcagattcag cagcacccct taagctgacc attcggaaag 3900 tcacggagcc caacgaagga cgtggaacta atttatccag ccctgctccg gaagcgtgag 3960 gatacttcag gccctgcttc aatcacacac tggaagctga ttgatcgacg cacggcagaa 4020 gattgaggat tcggcttaag gacccagcgg acattactga taagcctttg ttttgctttg 4080 tcaaccttta tacttgctaa accgtttgta actttatttg ttgtagaatt aacagttgtc 4140 cccattgatc atgttagaaa agggtttcac tagcatttcc gtcctttctt acgataggaa 4200 gtttacttgt gcttttatgt atttctacct aatctgggta tctaccgctt cccccctata 4260 cccagcatgt aaatgtatag aaactatctg gccagactta agggttttcc atcgttattc 4320 atcccagcca gaagtttgtt atactgataa acatatctgc acccataatg gccagctata 4380 ctttgctgga ttatcaaaac aaagctacca gtacactaaa gctggtgtca tagcctgtaa 4440 cacccccaga agcacaaatt gggtttgttg gaattttgct ccctcaccca aagctgaaaa 4500 gctgattaaa gaaactgtgc aattagtagc tcctccgact gcccaacagt caaaactaac 4560 ccccttccct caatcgtggg aagtagggca gaatctgttt ataaatttgg ctgaaaatat 4620 agctcggaca cttggtgtca ccaattgttg ggtatgtgga ggggcactaa tgactaaaga 4680 gtggccttgt aagggaacta atctgaatgc ttaccaactc ctccaatgga accattcttt 4740 gactgttaga acagataatg taccccagaa gtggatcctc tcctcaaaag taattggaga 4800 agactgttta agcagagcag gttctgctta tactcaatgg gtaggagaga ctccttgcaa 4860 aagaatatta tactggaatt caacccacca gacctggtgg ccaacaaagc ctgtttggta 4920 ttgggcatct gcttatgcaa ctaaatcttc tcaatgtact aaagggctat tttggatatg 4980 tggaaaggtt gcatatgccc agctcccagc attatgaaaa ggcacatgta acataggaat 5040 aatccagcca ggatttttcc ttctgcctaa cccaaggggg gatgaactag gcattcctct 5100 ttatgaaagc ctcaaaaccc gaggtgctcg atccctcgat caagttccta atataggagg 5160 gacccaggtt tggaaaaatg acgaatggcc accccagaga attataaata cttatggccc 5220 agccacatgg gcacaagatg ggagtcggaa atgctgcacg cctatatata tatgctaaat 5280 cgcattatca ggttacaaac tgtcttagaa attataacta atcaaacagc cacggctttt 5340 gagcttgttg ctcagcagca gcccaaatgc gtgcagctat ttatcaaaac cgcttagctt 5400 tggactacct gctggcggaa gaaggaggag tatgtggtaa atttaatagc tctgactgtt 5460 gtttgcaaat agataataat ggtaaagtta taactaatat agccaccaat gttaaaaaaa 5520 cttgcccatg tcccagtgaa agaatggcaa ggcataaaaa tcggcaattg gtttgaaaac 5580 atgttctcgg gtctgggagg atttaagtat attactggat ctatagtcct aatggtggga 5640 tcctgcctta tcctcccttg catagcccca ataattatga acgccatttc tgagtttgtg 5700 gaaacagttg ttaaacgaaa aactgcggcc catataatgt tgatgcacca aattaaggat 5760 aatgatgctt tagatccata aaagatcaag catcaaaggg gggaa 5805 // ID LTR6B_Mim repbase; DNA; PRI; 384 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; nonautonomous; KW LTR6B_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-384 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 11(5), 1719-1719 (2011). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 384 BP; 111 A; 97 C; 91 G; 85 T; 0 other; tgttccagca agtgtacagc aagtgtaaca gcaagtgtta acagcgactg accagggaaa 60 gaaccgagcg gcacacagag ggtcggagaa tcagccttta ttccgccggc gggctcagag 120 gggcatgcca ccaaattctg agcaccgctt ccttgttctc ctccagtttt atacacttct 180 tggggttaca cacaaccaat caactatgag catgaggagt tatccaatca gcggtgggat 240 tcaaaaagca tccaatcagc aacaagcttt acatatccaa tcagcaacaa gctttacata 300 tccaatcagc agtgggattc aaaaagtatc caatcagcag tgagttcagc catggggccc 360 cggggtgggg gtttttccct atca 384 // ID MacERV6_LTR5 repbase; DNA; PRI; 454 BP. XX AC . XX DT 24-SEP-2007 (Rel. 13.02, Created) DT 21-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE ERV1 Endogenous Retrovirus from Cercopithecidae. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; KW MacERV6_LTR5. XX OS Cercopithecidae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini. XX RN [1] RP 1-454 RA Smit A.F.; RT "MacERV6_LTR5 - ERV1 Endogenous Retrovirus from RT Cercopithecidae."; RL Direct Submission to Repbase Update (21-FEB-2008). XX DR [1] (Consensus) XX CC despite 5bp TSD 5%. XX SQ Sequence 454 BP; 108 A; 151 C; 106 G; 89 T; 0 other; tgttcgggtg agggagaaag gacaagatgg aagaaggtaa agaaggtaaa caagatggcg 60 cagttccggg ttcttcatca gcgactttcc cgcgcccggg aaaaacaccg actgtctgcg 120 cctgcgcatt gtgacgtcaa aacaaagaaa tcgaaactta cccggccacg cctatgaaga 180 cgcccttacc cccgcccctg tcctgcccac ctcaagcccc atccataaaa ggccgctccc 240 ggaagacatc ggcgcgaact tcctcggccc ctcctcatat gcggacctag gaacctcgcc 300 cgagaacgcc ggagcgactt cctcggcctc caccgccgga gaccggtgaa cctcgccctt 360 tcctccttca cattggctag ctaataaagt ttttttacct tgcctacttg cctcatctct 420 ggcgcctgct ccggtggtcg cataaaacaa atca 454 // ID LTR15_Mim repbase; DNA; PRI; 383 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV2-type endogenous retrovirus - DE consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; nonautonomous; KW LTR15_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-383 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 11(5), 1721-1721 (2011). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 383 BP; 75 A; 127 C; 77 G; 104 T; 0 other; tgtccggagc cgcagctcct gcaaatggct gccacaacct cacatagcca gcctgagatg 60 gatgcctggt catgatcggg agacagaaag tgcatacttt ctgttcccct gacatgacgt 120 agccccagac cctctactgc cttccttata tggttatcct ttgcctctgc ccccagaccc 180 tttactgcct tccttatacg gttatccttt gcctctcgga gaaacccgcc gagctactgg 240 catataagtc tgcactctgc tcctaataaa cgagacttga tcagactcct gtcttgtctc 300 catttcttgt gtcttatgcc tttcccaggt cccactccct cctccgagga ccccactaat 360 aactggtcct gcgggtcggg aca 383 // ID LTR1_Mim repbase; DNA; PRI; 681 BP. XX AC . XX DT 13-OCT-2009 (Rel. 14.11, Created) DT 13-OCT-2009 (Rel. 15.11, Last updated, Version 3) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR1_Mim. XX NM LTR1_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-681 RA Jurka J.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2948-2948 (2009). XX DR [1] (Consensus) XX CC Top sequences are >98% identical to consensus. The internal CC portion (not included), comes from a non-autonomous retrovirus CC with unrelated insertions such as L1. 4bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. CC We thank the Broad Institute Genome Sequencing Platform and CC Genome Sequencing and Analysis Program, Federica Di Palma, and CC Kerstin Lindblad-Toh for making the data for Microcebus murinus CC available. XX SQ Sequence 681 BP; 187 A; 173 C; 126 G; 195 T; 0 other; tgaggactga actctaattt tttgctaaaa actccgtcct aaggaggcca gctgggttag 60 gctggcaaat ggtaaggagg ccagcagggt taggctgaca aacagtaaat tcccactaag 120 cggcttttgt taaccaaacg aagcagtggt ttacttcctg acctgattct ggtatggcat 180 taacatcacc taaaagataa gaagcccctg tcttaactca agcatcctag cagatgccta 240 ttcttaaatt taaaccatct taaaacatcc tagcaggcgc ctattgtaaa tttaaagtgt 300 cctcctgtct ggacctccca gagtgctcat acccttatct taaagtaagc atatcctttc 360 tggtcttcta gataaagact aactctctca gccaattgcc agccaaagaa tctttaaacc 420 cacctataac ctgtaagccc ccgcttcgag atgtcccacc tttttgggcc aaaccaatgt 480 atgcctctca tgtattgatt tgtgactttg cctgtaaccc ctgcctctct gaaaatgtat 540 aaaactgaac tgtaacccag ccacagcgag tccacttgcc caaggcctct tggcagtggc 600 tccgggtcat ggtcctcaaa tttggctcag aataaatctc tttaaaatta ttttacagag 660 tttggctttt ttccgtcgac a 681 // ID LTR5B_Mim repbase; DNA; PRI; 370 BP. XX AC . XX DT 04-NOV-2009 (Rel. 14.11, Created) DT 04-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR5B_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-370 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2954-2954 (2009). XX DR [1] (Consensus) XX CC >91% identical to consensus. 5bp tsd. CC Similarity to LTR5_Opr from pika. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 370 BP; 85 A; 137 C; 78 G; 70 T; 0 other; tgtaaggctg gtggctgggt ccccctcccc tcttccccca tcctcacagc attgcaaagc 60 ctgcagacct tgcacaacag ctgccctaac tgcatcctgt taccaacgga acatctcggc 120 ccggaccccc ccgccaaggc aaagcaggaa gacgccccgg gtgcaaaaca tcctgccctg 180 acccccggag acagacaaca acaacagcta gccccctata aaagctaact cgcagtcctg 240 ggcggcgcga acttctgggg cccctgctct gggacccttg aacctcgccc gggagcgctt 300 tcaataaacg ctacgtttgt tagcatctat tcggcctcat tactcttctt ccgccaccac 360 aaaacttaca 370 // ID LTR1B1_Mim repbase; DNA; PRI; 624 BP. XX AC . XX DT 06-NOV-2009 (Rel. 14.11, Created) DT 06-NOV-2009 (Rel. 15.11, Last updated, Version 3) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR1B1_Mim. XX NM LTR1B1_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-624 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2946-2946 (2009). XX DR [1] (Consensus) XX CC >90% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. CC We thank the Broad Institute Genome Sequencing Platform and CC Genome Sequencing and Analysis Program, Federica Di Palma, and CC Kerstin Lindblad-Toh for making the data for Microcebus murinus CC available. XX SQ Sequence 624 BP; 173 A; 164 C; 110 G; 177 T; 0 other; tgtgaactaa aataaaatct taagccccca gctgactgaa tggaccccct cttggccaag 60 gggaccccag aaatacccta aagctgagtt gctggccatg agaagggagg tcagacatgc 120 ctcatcatgc ccccctccct tcttggagat gtcctttgta actcattaac aggcctaagg 180 ctatgcaaga caaagcttaa accacacctg caggtcatca atttacttaa cagatcactt 240 gagtctgggt atatgtccgg tggcttgtct ctgattaaca gacttcctta tcttaaaaca 300 ttccaagcct ttagacaaag cttcatttct ttaaccaatt acaaatcaaa gaatctttaa 360 acccacctat aacctgtaat ccccgcttcg agatgtcctg ccttttcggg ccaaaccaat 420 gtacaccttc catgtattga tttatgactt tatgtgtaat tcctgtctcc ctgaaatgta 480 taaaaccaaa ctgtaaccca accacgcgag accacttgct caaggcttct tgggcgtggc 540 tctccgggcc atggtcacac atattcggct cagaataaac ctctttaaat tattttacag 600 agtttgggtt cttttccgtt gaca 624 // ID LTR1C3 repbase; DNA; PRI; 638 BP. XX AC . XX DT 01-MAR-2009 (Rel. 14.06, Created) DT 18-JUN-2009 (Rel. 14.06, Last updated, Version 1) XX DE ERV1 Endogenous Retrovirus from primates. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; LTR1C3. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-638 RA Smit A.F.; RT "LTR1C3 - ERV1 Endogenous Retrovirus from primates."; RL Repbase Reports 9(6), 1174-1174 (2009). XX DR [1] (Consensus) XX CC 9.5% subst, 25 copies. XX SQ Sequence 638 BP; 152 A; 204 C; 168 G; 113 T; 1 other; tgatacagaa cggctgggct cccggctaaa ccccaccctc aagcctggaa cctcggccct 60 aagtgaaaac agctgacccc gtttttccgc ccaaatgatt gcctttttgg cccgccccgc 120 ccctatcctg tgcccataaa aacagacttc agctggcaga gcaacacaag cggctgatgc 180 aagcggtcgg ggatgcaagc tgctgagcgt cggggataca agcggctgag cggcgagcag 240 agaagcaact gagcgtcgga gactacggat agacgcggct aacttcagac ggtgcagctt 300 cggaggggag cccggccaga gacggctggg cttcagggaa agatcacctt cttcccgcac 360 catccccttt ccagctcccc attccgccga gagccacttc caccgcccaa taaagtcctc 420 cgcatncact acccttcaaa cagttcgtgt gacctgattc ttcctggaca ccgaacaaga 480 actcgggtgt caaaaagggc aggtgcagga ggctgtcacc ctgacccttc actgagctgt 540 taacacttag ccgtccacgg actgcaggct gagtgaaacg agccactcca gttcctgccc 600 acgaaggggg tcaaggtcaa gggaacaatc ccgtctca 638 // ID HERV1_LTRb repbase; DNA; PRI; 511 BP. XX AC . XX DT 29-AUG-2008 (Rel. 13.08, Created) DT 29-AUG-2008 (Rel. 13.08, Last updated, Version -1) XX DE ERV1 Endogenous Retrovirus from Haplorrhini. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; KW HERV1_LTRb. XX OS Haplorrhini OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates. XX RN [1] RP 1-511 RA Smit A.F.; RT "HERV1_LTRb - ERV1 Endogenous Retrovirus from Haplorrhini."; RL Direct Submission to Repbase Update (05-AUG-2008). XX DR [1] (Consensus) XX CC <8% div. XX SQ Sequence 511 BP; 135 A; 120 C; 106 G; 150 T; 0 other; tgaaggggga attaatgaat tttataagta taatagtcaa gaaatttatt cttttccttg 60 aaggtagaat gtaatatagc cccccaaccc ggaggcctgg gttaagagag aatattaact 120 gcttatttct cctctatgcc cagagaggct tatctgtgtt ccatcgtttc acattccttg 180 aggcacagcg agttcttgct tccctcccta gcgcggctgt aaagtcacaa ggttgataag 240 caaatgctac aaaagcatgt attcccaagg atgtaagaca tgtggtgtaa caaatgtaaa 300 agaataatta actgcctttg ttctcgcttc tgcaagtacg cttcctgcag cacgtaactc 360 ccgccacaaa ctgcttaaaa ggtgattgat ccctttgttc ggggctcaga ctttctggac 420 cctagtccga ctgagccggt gatcacctta ataataaagg gctctcctga actctgttcg 480 gtctctcccg tctctgattt gtcccgcaac a 511 // ID ERV2-4_TSy-LTR repbase; DNA; PRI; 1245 BP. XX AC . XX DT 29-JAN-2010 (Rel. 15.09, Created) DT 29-JAN-2010 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV2-type endogenous retrovirus - DE consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERV2-4_TSy-LTR; ERV2-4_TSy-I. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-1245 RA Jurka J. and Walichiewicz K.; RT "Endogenous retroviruses from tarsier."; RL Repbase Reports 10(9), 1209-1209 (2010). XX DR [1] (Consensus) XX CC ~95% identical to consensus. 6bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 1245 BP; 377 A; 224 C; 268 G; 376 T; 0 other; tgttgggagc cagccatgga ggcctgcctg ctgacaaagt ccagggggct gaagaccccg 60 ggtcgcaaag cgaacctggg cagctgagac ccttgcttct tctacttttg ttattagtgc 120 agccgaaaag ttctgtacag tctgaggatg gaaagcaaaa ttccactcac gcctccacta 180 ccccttgcaa aaaggcaaga ggggaattca tgaaaaatta ccttcccttc atttgaattc 240 ctccctaggt ttccacgtcc ccggcctcct ggtattaaac catgcaatca cttgattaaa 300 cattgtagtt gggaaaattg taaaaaatca gaaagtgctc gatttaatgt ttgtcaggga 360 actcactggc cttgccagga aattatcaat agagatttta aatatcatgt gcgggtagat 420 aattacttgt aagattttat ttcttctgat tggcgactaa ttggagaaat ttgagaagct 480 acagtttgta tttttaggaa gagtgccaat gcaaaaatta catttactta atttcttaat 540 ttctacatgt ggtgtattat gcttaaaaaa ttaacaatca cataaatatt agagcaccct 600 tgcctttact tccctttgtt aaggaggctt gtaattctgg ttagggattg tgactaaggt 660 gacttgagcc cgagaggcct tttgttttgc aaacattaaa gctcattctc cctggcagga 720 aataattaat gcagattgta aacaccaggt gtgaaattag ggaaaactag tcaggccaac 780 ttgcggctaa gaaaccttga aaatgttgtt ttaccccagc tagttgcaaa ttgctgtgtg 840 aaggtcacag gtgtgacttg ttaatctatg tgataattat agtgcttaga acttggtagt 900 gatttataat gtaatagtta taggaataag taagttttgt atggacatta agaaatggta 960 atcattaaaa aatgttagaa gtttataaaa tttatagtta atgtttagga tgataatgtt 1020 ttaagatcaa aacaggatga acaaagatac attgcctgtc cggtccatta aattgtaact 1080 gattgccttg gaaacaaaaa acatgtaggg tataaaagga gttggaaaag acaataaatc 1140 ggcatttgtc ctgcttgcag cagagtcagt gtccaagttt tttctcctgc gccgacgccg 1200 tccctctttc aggaaaccct gacccgctgg agctggactc cggca 1245 // ID npiggy2_Mm repbase; DNA; PRI; 348 BP. XX AC . XX DT 24-MAR-2010 (Rel. 15.09, Created) DT 24-MAR-2010 (Rel. 15.09, Last updated, Version 1) XX DE npiggy2_Mm is a nonautonomous piggyBac element identified in DE Microcebus murinus. It appears to have been recently active DE (within the last 30 my). XX KW piggyBac; DNA transposon; Transposable Element; Nonautonomous; KW npiggy2_Mm. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-348 RA Pagan H.J.T., Smith J.D., Hubley R.H. and Ray D.A.; RT "PiggyBac-ing on a Primate Genome: Novel Elements, Recent RT Activity and Horizontal Transfer."; RL Genome Biol. Evol 2, 293-303 (2010). XX DR [1] (Consensus) XX SQ Sequence 348 BP; 108 A; 76 C; 74 G; 90 T; 0 other; ccacttcggg acgagcgtcg actatagtcg acagccacag atgaacgcgc acagcgactt 60 tagccgacag ccgtgatatg acttttctaa tttttcattt atcaaaataa aattgtgaac 120 atttaaaaat aacataatga aaacatatat gtatatgtta cctattctga tttacattac 180 aagtaaagct gcctgtaaag taaaacaagc tttcagtgct ttaaagcttt cctcatcaca 240 caagagcaaa acggattcgt cgtcaatgca cagcacaaac tatcgtgcgg actgtgagtg 300 ccggctgtgg gcaaggtttc gcggccggtg agcgccgtac cgaagtgg 348 // ID LTR9B_Mim repbase; DNA; PRI; 340 BP. XX AC . XX DT 04-NOV-2009 (Rel. 14.11, Created) DT 04-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR9B_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-340 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2963-2963 (2009). XX DR [1] (Consensus) XX CC >92% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 340 BP; 99 A; 68 C; 81 G; 92 T; 0 other; tgttacagca aaccggaccg ggaaacaaac cgagagccac acgagagtac tgcaaaacag 60 gcttttatta cgccggcggg ctcagagggg tctcgcctcc aaattctgag caaaatgggg 120 tctaagcatg gtggctttta tatgttacag aagcaaatgg ttacaaagtc tgcacaagat 180 tgaggtctag tcacgtgtta ctctatagtt gcttagtcat gctattttgc agttatgcac 240 aagattgaaa tctagtcaaa ggtcactcta cagttgctta gtcatgctag tttgcagtta 300 tgctactatt aggaagggag tggaaaaatt ctcccttaca 340 // ID L1P4d_5end repbase; DNA; PRI; 2035 BP. XX AC . XX DT 29-AUG-2008 (Rel. 13.08, Created) DT 29-AUG-2008 (Rel. 13.08, Last updated, Version -1) XX DE L1 Non-LTR Retrotransposon from primates. XX KW L1; Non-LTR Retrotransposon; Transposable Element; LINE; KW L1P4d_5end. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-2035 RA Smit A.F.; RT "L1P4d_5end - L1 Non-LTR Retrotransposon from primates."; RL Direct Submission to Repbase Update (05-AUG-2008). XX DR [1] (Consensus) XX CC L1PA16. XX SQ Sequence 2035 BP; 521 A; 632 C; 572 G; 300 T; 10 other; gacaagatgg ccgactagac gcagccagga aacgcctctc ccactgagag aaaccaaaat 60 atcgagtaaa ccatcacact ttgaacagat cttttgagag aaaacactga aagtcgatag 120 agaggcgacg cagacaccga ggctgaagag ggaggaagct gggaagcctg catggagtcg 180 ctgagcgcca ggaccggctc ctggtcctga gcaggtccta aggaaggggt gagtgaagga 240 actccagggc accacactcc cactacggac ctctgggatc ctagctacaa gagatcccat 300 gacccccata gacatttgaa ttggcagggg gaactgcccg gagagtaggc agaggcagag 360 ctcgaacctg catggagccc agaaggtttc gcgcggggac agctgcagca aaacgtgacc 420 ataggcgccc atcccccaag gctctccatc ttgctctgag tgactntagc ccctgctgac 480 tgccgggccg ggagagagca gggctgcctt tcctgcggga ccggggcgca tctgatctgc 540 atgccccctt gtccaccggc ccctcccaag gcccctgcct ggccgctccc gcaagagggt 600 gcacacagca cagcctccac tgccccgcct gagtgttttg ccggtggcct gggagcagtt 660 cggccccccc agcacagccg gtgctcgacc ccgaggggcc agaggacaaa gccgcgggcc 720 nggtcccaac cccccagggt ttgagcacac cgcccagggg tatcgagctg agatctgtgg 780 ccngagctcg agcgggggag gagcccccac tctcagaaca ctgagaagag tgaggcgcgg 840 gttcgtgtgc cggcgtggga gctgggcatc cctccctctg caagaccggt ccgggaaggg 900 tgtagcctgt tggccagcca cagcttctgc ccgagggagc cccacagcct ggaacacctg 960 gaacagccca gcgatctggg cgcagaaggc ttgggacaaa actagctggt cgggcctgct 1020 cctggggcag acaccggagg gagacccggt cgggggagcg cgagctgggc ggnccccaca 1080 gccgtctgct gggcaaaaaa ccccgggccg cgggcgccac accagctgca cacccatggc 1140 accaccgccc tgcctgggga tcctccgccc ttgacccact gcatcaccag accacccgca 1200 gacatacccc acaacctgct ctgactctgc caagcncaga ggaccagcgg gtccccgggg 1260 agttgcgggt ctcctggtga cctaaccttc ggctcgggcc gcccctaagg gaggggggag 1320 tgcagcctgc cagggccccc cttggggcta aggaaacgca ggcgcggtgc cagtgattgg 1380 agggggctcc cccaaggccc aggaacggac ttggcgaggg ggtcatctct cgcccccctc 1440 catccctccc cccagagcac tgctgcgnat gcgctgaaat acaaaagagg cgcgtggctg 1500 agtaagagcc tatctgccgg cccttactct taagcaccat ctactggatc gcagcctgaa 1560 ttacaccacc aaaaanaaat tccttcagca cacancgcct gtgaaaccca atgcaggaaa 1620 ctagccacaa ntaaggaacc cgtacagagc cttggccctc tgaaagcacc cagaaacgaa 1680 gccaatcaac tatacacaac atacaccaca gtcaaaccct caagggaaaa aagaatataa 1740 aaacaaaaag ccccatccaa acgacagcaa cttcaaaaag ataaagaaac accagccctc 1800 tcagatgaga aggaatcagc gcaagaactc cggcaattca aaaagtcaga gtgtttcctt 1860 acctccaaan gattgcacta gctccccagc aatggatcct aaccagattg aaatgtctga 1920 aatgacagac atagaattca gaatctggat ggcaaggaag ctcaatgaga ttcaggagaa 1980 agttgaaacc caatccaagg aagccagtaa aatgatccaa gagttgaaag acgac 2035 // ID CYN-III5 repbase; DNA; PRI; 258 BP. XX AC . XX DT 02-MAY-2006 (Rel. 11.04, Created) DT 02-MAY-2006 (Rel. 11.04, Last updated, Version 1) XX DE SINE element from Cynocephalus - consensus sequence. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; KW Interspersed repeat; CYN-I; CYN-III5. XX OS Cynocephalus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Dermoptera; Cynocephalidae. XX RN [1] RP 1-258 RA Schmitz J. and Zischler H.; RT "A novel family of tRNA-derived SINEs in the colugo and two new RT retrotransposable markers separating dermopterans from RT primates."; RL Mol Phylogenet Evol 28(2), 341-349 (2003). XX DR [1] (Consensus) XX SQ Sequence 258 BP; 64 A; 75 C; 87 G; 32 T; 0 other; ggctggcccg gtggcgcact ggataagtgc ggcgcactgg ataagtgcgc cgcttgggaa 60 gcgcggcggc gctcccgccc gagggttcgg atcccacata cagaccggct cccgctcact 120 ggctgagcga ggcgcgggag caacaccgag ggttgcaatc ccgttgccgg tccccggccc 180 ggtacggggg caacactgag ggttgcgatc cgttgccgga cacggaaaaa gacaaaagac 240 aaaaaaaaaa aaaaaaaa 258 // ID Tigger3b repbase; DNA; PRI; 1231 BP. XX AC . XX DT 14-NOV-2005 (Rel. 10.11, Created) DT 09-DEC-2005 (Rel. 10.11, Last updated, Version 1) XX DE Mariner DNA transposon from primates. XX KW Mariner/Tc1; DNA transposon; Transposable Element; GOLEM_B; KW mariner; Tigger3b. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-1231 RA Smit A.F.; RT "Tigger3b - Mariner DNA transposon from primates."; RL Direct Submission to Repbase Update (11-NOV-2005). XX DR [1] (Consensus) XX CC (MER7B) 15% div. XX SQ Sequence 1231 BP; 399 A; 218 C; 235 G; 375 T; 4 other; cagtcatgcg ccgcataacg acgtttcggt caacgacgga ccgcatatac gacggtggtc 60 ccataagatt ataatggagc tgaaaaattc ctatcgccta gtgacgtcgt agccatcgta 120 acgtcgtagc gcaacgcatt actcacgtgt ttgtggtgat gctggtgtaa acaaacctac 180 tgcgctgcca gtcgtataaa agtatagcac atacaattat gtacagtaca taatacttga 240 taatgataat aaacgactat gttactggtt tatgtattta ctatactata ctttttatcg 300 ttattttaga gtgtactcct tctacttatt aaaaaaaaaa gttaactgta aaacagcctc 360 aggcaggtcc ttcaggaggt attccagaag aaggcattgt tatcatagga gatgacagct 420 ccatgcgtgt tattgcccct gaagaccttc cagtgggaca agatgtggag gtggaagaca 480 gtgatattga tgatcctgac cctgtgtagg cctaggctaa tgtgtgtgtt tgtgtcttag 540 tttttaacaa aaaagtttaa aaagtaaaaa aaaaataawt ttaaaaatag aaaaaagctt 600 atagaataag gatataaaga aagaaaatat ttttgtacag ctgtacaatg tgtttgtgtt 660 ttaagctaag tgttattaca aaagagtcaa aaagttwaaa aaattwaaaa gtttataaag 720 taaaaaagtt acagtaagct aaggttaatt tattattgaa gaaagaaaaa tattttwaat 780 aaatttagtg tagcctaagt gtacagtgtt tataaagtct acagtagtgt acagtaatgt 840 cctaggcctt cacattcact caccactcac tcactgactc acccagagca acttccagtc 900 ctgcaagctc cattcatggt aagtgcccta tacaggtgta ccatttttta tcttttatac 960 cgtattttta ctgtaccttt tctatgttta gatatgttta gatacacaaa tacttaccat 1020 tgtgttacaa ttgcctacag tattcagtac agtaacatgc tgtacaggtt tgtagcctag 1080 gagcaatagg ctataccata tagcctaggt gtgtagtagg ctataccatc taggtttgtg 1140 taagtacact ctatgatgtt cgcacaacga cgaaatcgcc taacgacgca tttctcagaa 1200 cgtatccccg tcgttaagcg acgcatgact g 1231 // ID LTR17_OG repbase; DNA; PRI; 434 BP. XX AC . XX DT 18-OCT-2009 (Rel. 14.11, Created) DT 18-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat (consensus). XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR17_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-434 RA Jurka J.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 9(11), 2863-2863 (2009). XX DR [1] (Consensus) XX CC >90% identical to consensus. 6bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 434 BP; 87 A; 118 C; 120 G; 109 T; 0 other; tgaggcgaga aagtaaaagc agaactaaaa gaatctatcc tagccgctct gtgtaggcct 60 cagccaagcc gcaggcgttt ctgctgtctg cgtgtttctc ccctgacata cgcggggcct 120 agattgtaac ctagtaactt ccttgggttt ctgttgcctt tggggcaggt ggtccgaccg 180 gtctcaagga cgcgggcgaa tctgaccttc ccccttgagc ccgggacctt gaggatgatg 240 tcatgtctct ctgtcttatg accctcacgt gcactcccac cctgaaaggt ggataaaaga 300 gaggcaagag attgagtcgg ggggctcggt tttgggaaac acgctggtgt tccgagtacc 360 cccggccgaa taaacccgct tcctctaatc aactggtgcc tggagtcttc tgtctgcgtc 420 ggtttcctgc taca 434 // ID L1Pt repbase; DNA; PRI; 902 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE L1 Non-LTR Retrotransposon from Pan troglodytes. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1Pt. XX OS Pan troglodytes OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pan. XX RN [1] RP 1-902 RA Smit A.F.; RT "L1Pt - L1 Non-LTR Retrotransposon from Pan troglodytes."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC lib20040702. XX SQ Sequence 902 BP; 346 A; 174 C; 187 G; 195 T; 0 other; ctaatatcca gaatctacag tgaactcaaa caaatttaca agaaaaaaac aaacaacccc 60 atcaaaaagt gggcgaagga catgaacagg cacttctcaa aagaagacat ttatgcagcc 120 aaaaaacaca tgaaaaaatg ctcatcatca ctggccatca gagaaatgca aatcaaaacc 180 actatgagat atcatctcac accagttaga atggcaatca ttaaaaagtc aggaaacaac 240 aggtgctgga gaggatgtgg agaaatagga acacttttac actgttggtg ggactgtaaa 300 ctagttcaac cattgtggaa gtcagtgtgg cgattcctca gggatctaga actagaaata 360 ccatttgacc cagccatccc attactgggt atatacccaa atgactataa atcatgctgc 420 tataaagaca catgcacacg tatgtttatt gcggcattat tcacaatagc aaagacttgg 480 aaccaaccca aatgtccaac aatgatagac tggattaaga aaatgtggca catatacacc 540 atggaatact atgcagccat aaaaaatgat gagttcatat cctttgtagg gacatggatg 600 aaattggaaa ccatcattct cagtaaacta tcgcaagaac aaaaaaccaa acaccgcata 660 ttctcactca taggtgggaa ttgaacaatg agatcacatg gacacaggaa ggggaatatc 720 atactctggg gactgtggtg gggagggggg aggggggagg gatagcattg ggagatatac 780 ctaatgctag atgacgagtt agtgggtgca gcgcaccagc atggcacatg tatacatatg 840 taactaacct gcacaatgtg cacatgtacc ctaaaactta aagtataata aaaaaaaaaa 900 aa 902 // ID CYN-II1 repbase; DNA; PRI; 177 BP. XX AC . XX DT 02-MAY-2006 (Rel. 11.04, Created) DT 02-MAY-2006 (Rel. 11.04, Last updated, Version 1) XX DE SINE element from Cynocephalus - consensus sequence. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; KW Interspersed repeat; CYN-I; CYN-II1. XX OS Cynocephalus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Dermoptera; Cynocephalidae. XX RN [1] RP 1-177 RA Schmitz J. and Zischler H.; RT "A novel family of tRNA-derived SINEs in the colugo and two new RT retrotransposable markers separating dermopterans from RT primates."; RL Mol Phylogenet Evol 28(2), 341-349 (2003). XX DR [1] (Consensus) XX SQ Sequence 177 BP; 50 A; 43 C; 47 G; 37 T; 0 other; ggccggcccg tggctcactc gggagagtgt ggtgctgata acaccaaggc cccgggttcg 60 gatcccatat agggatggcc ggttcgctca ctggctgagc gtggtgctga caacaccaag 120 tcaagggtta agatcccctt accagtcatc tttttaaaaa aaaataaaaa taaaaaa 177 // ID L1-2_TS repbase; DNA; PRI; 6474 BP. XX AC . XX DT 10-APR-2010 (Rel. 15.05, Created) DT 10-APR-2010 (Rel. 15.07, Last updated, Version 4) XX DE L1-type non-LTR retrotransposon - consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-2_TS. XX NM L1-2_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-6474 RA Jurka J.; RT "Non-LTR retrotransposons from tarsier."; RL Repbase Reports 10(5), 768-768 (2010). XX DR [1] (Consensus) XX CC ~90% identical to consensus. ORFs corrupted by mutations. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 6474 BP; 2436 A; 1463 C; 1219 G; 1344 T; 12 other; ggatttctgg caagatggca accagacagg tcagactgtg agagtctcca caaaagcaag 60 tgtccataaa gactctgtgt gagtaggcgt gtgctggtgg gtgagtgaaa gtccgtctga 120 ggggagactc gggttcatgc accgggcccc cctccgaagg tgactgcaag cccccaccac 180 cctgagattg gagccacagg cgtgcaggat catctgctgt tccaggactc tcctgagaag 240 agcctgccga atccagctga ttcccatcac ctggcgcccg gcccacacca tcacccgtta 300 tagcgaagca gtgggnccca gatcggtggg gagagccgga gagaaagccc catccagtga 360 ggagacgccg accgcagcca ttttctctgt cccgggacca ggcagagacc tgcattgccc 420 cgcccagctc cgcccagcct ccaacttggg acctaaggta atctcatcaa actgggagag 480 acagaagccc ctcccccatg ctagatcgga cacagacagg tcagatctga gtggcagact 540 tgacaggcag caagccggct gagtccagat ctgatccacc ctccttccac agcaggcctg 600 gggcatctga cccnagctgt gagcaaccct accagcctgc gatgagagct agagatcaga 660 gctgaaagga cgttggactt acaggaacga cctcagagcg agtttggttc cttgttttta 720 ttttttgtct gttttgctcg tttttgtttg ttttattttt tctctttctg tttttgataa 780 cgttttcgta cttgctttgt ttctgttatt ttactgattt ttttttctnt tctttagact 840 tcctgtttgt gtaggggaaa tacatcaata gggaggttgt tgttggcttg tgtgtttgta 900 tgattgtctg tatgttttag tttttttttt ttactctttt ctgtctttgt gtctcttttc 960 atttgttagt cggtacgtnt gtctgttttc cctcttactc ttctctctct ccaccttcct 1020 cttncccccc tccttccctt tccattcttt tcctttttct tacctcattt tgctttcagg 1080 gagcaaacct atacccccac tgcatggaca gattgcaagg cagtgggaag tgggttccag 1140 ggtacccata cacctgtcgg tggatactgg gatctgcaga tagattttga atcacagctc 1200 ccccctcaat taaatctcaa aactgcaaaa agtacagccc ccaacccaca gtccagcttc 1260 agaattgaac aaagccagag gatccacata aggaagaata ggagattgag caaacagaag 1320 taaaagcaac cccgccatcc ctgaacaaca gaattaaaga agggggggaa aaggggggga 1380 ggaaaatcta cacaaatgaa gaaaaaccaa aagaagaata tgggtctcac acagacccct 1440 gggaggtcag acactgagaa aacagacttc ggaacgcaaa caatgaagag cccccagaat 1500 gactggtcnc anactacaaa cccagacatc aagacattaa tagagagaat aaaaagaatt 1560 gaggagagac aagaggaaaa taggaaggag ttgataactg agataacagc agtaaagaat 1620 actgtgantg aaataaataa caaactgata agcatggaaa gcagaattag ccaagcagaa 1680 gaaagaatct cagagcttga ggaccaaaat atagaactaa cccaaactgt caaaaacata 1740 gaaaagaagc ttaaaaagac agaacaaaac cttcaagaga tgagcgatta tttcaagagg 1800 ccgaacctaa gagtaattgg acttcctgag gcagaaagag agacagagac caccctggaa 1860 caaacattcc atgaaattat tcaagaaaac ttccctcatc tcatcagtga tgcgaaaatt 1920 caaacacaag agattcagag aacccctgca agacaacaaa tgagaagacc aactcccaga 1980 cacatagtaa ttcgcctaaa caaagtaggt ataaaagaaa aaatcctaaa ggcagcaaga 2040 gaaaaaggtc agactaccta ccggggaaga ccaattagaa tagcagcaga tttatctaca 2100 gaaacacttc aggctaggag agcttggagc ccaatcttca aagttctcaa agataaacaa 2160 tttcaaccaa gaataaccta cccagctaag ctaagcttca tcagtgaggg agaattaaaa 2220 tctttcccag acattcaatc cctaagaact tacgctgcct ctagaccacc tctacaagaa 2280 acacttaaga aagtattaaa cacagaagaa aaggaaaaaa gaacgacaac gttcttcaca 2340 agagtgcagg aaaaagattg aaaacacaca tgaatcaacc caaaaatcaa aagaaagaca 2400 acaaacaaac aggaacaaca acaactctat aagaacctca tgacagggat aaactctcac 2460 atttcaataa ttagcctgaa tgtgaatgga ctaaatgcac cactgaaaag acatagaatg 2520 gcaaaatgga taaaatatca tgaggcaaca atatattgtc tccaagagac tcatctcacc 2580 agaaaggaca ctcacagact caaagtaaga ggatgggaag caaaatttca ggcgaacgga 2640 acacaaaaga aagcaggagt tgcgatctta atatcagaca aaataccctt taagctatca 2700 aaaatttaaa aagatacaga aggtcactat ataatgataa aaggttcaat ccatcaacaa 2760 gaaatatcca tcctaaacat atatgcaccc aacataggag caccaacttt cataaagcag 2820 cttctaggca aacttaaaaa agatattgac tctaacacta tcatagctgg ggactttaat 2880 accccactca caaccctaga cagatcatca ggacaaaaaa tcagcaagga gatccggaac 2940 ctcaatgtga ctcttgacca aatggactta attgatacct acagaacact ccacccaacg 3000 accacagaat atacattcta ctcatcaccg catggaacgt actctaagat cgaccacatc 3060 cttggccata aatcaagcat aaacaaattt cataagattg aaattttgcc atgcaccttc 3120 tcagaccaca gtggaataaa aataaatatc aacaccaaca aagttccccc naaacccaca 3180 aagacatgga cactaaacag catgatgcta agcaactcct gggtcaacat ggaaatcaaa 3240 acagagatta aaagatacct ggaaacaaat gaaaatgaag aaacatctta ccaaaacctc 3300 tgggatgcca tgaaagcagt agtaagaggg gaattcatat ctctacaaac gcacatgaag 3360 aaaatggaaa gatcacaagt taacagccta acaagtcacc taaggaagct ggaaaagcaa 3420 gaccaccaaa accctaactt cagcagaaga atccagatca ccaaaataaa agcccaaatc 3480 cgggacatag aagacaaaaa gataatacaa aaaatcaatg aaacaaaaag ctggttcttt 3540 gaaaggataa acaagatcga tggtccccta gctagactga ccaagaaaaa gagagaaaaa 3600 gcccaaataa acacaatcag aaacacaaaa gatgaagtca catctgaccc tgaagaaata 3660 caaaagatta tcagagacta ctatgtacac ttgtatggaa acaaacttga aaacctaaag 3720 gaaatggagg actttctgac atcacacaac ctccctaggt tgaaacaaga agaaatcgag 3780 accctaaata gaccaataac aatccaggaa attgactatg tcataagaaa actacctaca 3840 aaaaaaagcc ctggaccaga tggctttcca gcagaattct acaaaacata caaggaggaa 3900 ctgataccaa tcctactgaa agtattccag gcgattgaga aagatggaac tctccccaaa 3960 tcattttatg aagctaacat cacattgata cccaagccag gtaaagaccc aacaaagaaa 4020 gagaactaca ggccaatatc cttgatgaac atagatgcta aaattctcaa caagatccta 4080 gcaaaccgga ttcaacaaca catctcaaaa atcatccacc acgaccaagt aggcttcatc 4140 cccgggatgc aaggctggtt caacattcgt aagaccataa acgtaattaa atacatcaac 4200 agacgtaaaa acaaaaacca catgattata tcattagatg cagaaaaagc ttttgataaa 4260 atccagcacc ccttcttgat aaaaaccctc gaacatctag gcatagatgg aacatacctc 4320 aaaatagtaa gagccatcta cgagaaaccc acagccagca tactgctaaa cagacagaaa 4380 ttggaaccat ttcccctgaa aactggaaca agacaaggat gcccactctc acccctcctg 4440 ttcaacatag ttttggaagt cctagctaga gcaatcagag aagagaaggc gatcaggggt 4500 atccaaatag gaaaagagga agtcaagtta tctctctttg cagacgacat gattgtgtac 4560 cttgaaaacc caagagaatc tgtcaaaaac ctccttgcac tgataaagga ctttggcaaa 4620 gtctcagggt ataaaataaa tgtgcaaaag acaatcgcat ttctatacac caataataaa 4680 caaacagaaa cccaaataaa aagcacaatt ccattcacaa tagccacaaa aaaaatgaaa 4740 taccttggca tcttcctaac cagagacgtg aaagaccttt acaatgaaaa ctacaaaaca 4800 ctgctcaaag aaatcaaaga tgacacaaac aagtggaaaa atattccatg ctcatggatt 4860 ggaagaatca acattgttaa gatgtccatt ctacctaagg caatctacag atttaacgca 4920 atacccatca aattaccagc aacattcttc tcagacctag aaacaacaat acaggaattc 4980 atatggaaac ataaacgacc aagaatagcc aaaacaatcc tcagcaaaaa aaaacaaagc 5040 aggaggtatc acactcccag actttaaact ttattataaa gctacaataa tcaaaacagc 5100 ctggtattgg tacaagaaca ggcatataga ccaatggaat agaattgaga ttccagaagc 5160 aaaacctcaa tttctcaacc aactcatctt cgacaaagcc tccaccaccn accactgggg 5220 agaggagaac ctattcagta aatggtgctg ggaaaactgg ctgaccacat gcagaagatt 5280 gaaacaggac ccctacctat ccccatacac aaaaattaac tctaaatgga tcagagacct 5340 aaacgtaaaa cctcaaacta taagaacctt agaaaatgca ggagacactc ttatggaaat 5400 tggaactggc aaccaattcc tgatcaaaac ccgaagtgcc caggccataa gagataagat 5460 agacaagtgg gacctcatca aactgacaag cttctgcaaa gccaaagaaa ccatcaagag 5520 agcagggaga cagcccacag actgggaaaa aatatttgcc aactccatgt ctgacaaagg 5580 cctaacatct aggatctaca aggaactcaa acgtgctgaa aagaaaaaaa caaacagccc 5640 cattacaaag tgggcaaaag atatgaatag acagttctca aaagaagaca tacgagcagc 5700 caacagatac atgaaaaaat gttcagcctc actagtcatc aaggagatgc aaattaaaac 5760 tacactgaga taccacctaa ctccagtcag aatggccatc atcaataact caaaaaataa 5820 cagatgctgg agaggatgtg gcgaaaaggg aacacttcta cactgttggt gggagtgtaa 5880 actagtacag cctctgtgga aaacagtgtg gcgattccta aaagttctaa aaatcgacct 5940 tccatatgac cccgcaatcc ccctactggg aatatacccg gaagaactca aatcactcta 6000 taaaaaagat acctgcacac gtatgtttat cgcagcattg ttcacaatag caagaacatg 6060 gaaccaaccg tgctgcccat caaaagagga ctggattnaa aaaatgtggt acatatacac 6120 gatggaatat tacgcagcca taaagaagaa caaaatcatg aacttcgcag caacctggat 6180 ggaattagag tctataatac tgagtgatct ttcacagaaa caaagaactg agtatcacat 6240 gttctcactc ataagtggac cttgaacaac taccgtaata ctataagaaa aggattgaca 6300 gtagcgggaa actgcggggg ggaggggagg gaggtgggat tgacagtagt ngaaatctgc 6360 ctggggaagg gggggcacac ctcatcaaca agggtacctg cacaatgaat atttgtatac 6420 ctaaccctga attgtacccc acaattttaa aataaaaaat attgattaaa aaaa 6474 // ID hAT-2_TS repbase; DNA; PRI; 3541 BP. XX AC . XX DT 21-APR-2009 (Rel. 14.1, Created) DT 21-APR-2009 (Rel. 15.07, Last updated, Version 2) XX DE hAT-2_TS is a family of autonomous DNA elements found also in DE Anolis carolinensis, Microcebus murinus, Myotis lucifugus, DE Monodelphis domestica, Otolemur garnetii, Echinops telfari, DE Xenopus tropicalis and Schmidtea mediterranea. About five DE elements exist in the genome at 3381bp in length. XX KW hAT; DNA transposon; Transposable Element; hAT-2_TS. XX NM hAT-2_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-3541 RA Novick P.A., Smith J., Ray D. and Boissinot S.; RT "Independent and parallel lateral transfer of DNA transposons in RT tetrapod genomes."; RL Gene 449(1-2), 85-94 (2010). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 1522..3327 FT /product="hAT-2_TS_1p" FT /translation="MMSRKRKIDSECRIFKEQWTYDYFFMQYKERAVCLIC FT QNIVSVFKEYNLRRHYQTQHKDKYDCLVGEVRKEKILKLKNTLTTQQNTFV FT KQKQLNISSLRASFQVAKLISMHWQTICGGEFVKECLLSVAKEMCPEKADL FT FSTVSLXGPTITRRIEEMGDNLHQHLQNSTKKLSYFSLALDESNDVRDSAQ FT LLIFIRGTNDYFEVTEELAALQSIKGTTTGEDIYEKXCQTVNGLELDWAKL FT ASVTTDGAPSMVGSKKGVIARINQEMGKHNHSHPIAIHCLIHQQALCSKSL FT KWDSVMKIVVSCVNFIKANALNHRQFQEFLSELNVTYEDVLYHTEVRWLSR FT GRVLKHFYDLLPQITAFLLSKNKEVPELNDAEWKWHLXFLTDVTELLNSFN FT VQLQGKGKLICDMQLHVKAFEVKLGLLIKQVKEENFFHLPTTQNLLAEKPL FT VAFPNKTCVDSLEKLQKEFQFRFKELHLHEQDIQLFRNPFSIDIENMDTIY FT QMELAELQNCDSLKDAFKSSSLPNFYASLPSETYPNIRNHVLKMATVFGST FT YVCEQTFSRMKHLKSPTRSRLTDAHLHQLLRLAVTNMEPDIDHLISQKEAH FT SSH*" XX SQ Sequence 3541 BP; 1013 A; 673 C; 788 G; 1061 T; 6 other; caggggtcct caaactacgg cccgcgggcc acatgcagcc cgccaaggac atttatctgg 60 cccaccgggt gtttttgccg ccgctgcctg tcctgcctag cagccaactc gtccgggcca 120 cagtgcgcat gtgtggaatg tacatctctc tctccgactt ccctccttct ctctgtctct 180 cggctcctcc tttcaatctc gggtgtgatt gaacgagtca tgagcttgcc tatgcagagc 240 ctgctgctgc ctgaggaccg aggtaagaac aagttaggat tttttttttt tgaagttagg 300 aggtctattt ttttttttaa ttttgcagtt agtagggcct tttatttgta gttaaggggg 360 gccttttttt ttctgaagtt aggaggtcta tttttttttt tttgcagatg ggggcacctt 420 ttatttttga agttaggaga gccttttttt gaagttagga gagccttttt ttaaagatag 480 gagagccttt tcttttttga agttggttag ttggttgggg ttggtttctg gggggggtgc 540 atcacagtga taatgcaaat agcactcagt gctaatgcaa atggtcagtg ctcagaggta 600 atgcaaatgg tcagcactca gaggtaatgc aaatagtcag tgctcagtgg taatgataat 660 tgtaagtgct cagtgttaat gcaaatggtc agcactcagt attaatgcaa attatcagtg 720 gtcagtgtta tcgcaaatgg tcagtagtca gtgttaatgc aaatggtcag tgctcagtgt 780 taatgcaaat agtcagtgct cagtgttatc gcatgggggc cccaaactgg taatctgcct 840 agggccccat gggaacttaa tcctgctctg cagacagchg aggagtagga aacccaattt 900 aattgacagt aagtgcattt gtattctgat tgctattcag ttgtgtatga tgttgtatgt 960 tgtgtgatgt gtaagccctg gttcacactg ttgcaatctc tgagcagtgc gagttcagcc 1020 atatgcttgt atggctgaac ttgcattaga ttcggaagaa aaaaggcata cgtaccttct 1080 tttttcctgc agtggaatct gattghatgd gtcttcttac ccatgcaatc agattcctgt 1140 gcgagttcac agatcgcagt gtggttcgca cagggtagtg tgaactggaa aggtggtgga 1200 ggaaccggct ctgtaatcgt gccagttccc gcaccgcacc agtgtgagcc tgaggtaaaa 1260 gcagagttcc accaacatgg gcactggtga ggctgaaatg atgtggactg gtaaggctac 1320 attgatggac actgatcaga ctgcattgat ggacactgat cagactgcat tgatgggcag 1380 tgcagtctgt atgtctctgt gtgggcaaag ttattgctgg tatattgttt ttgtagcgct 1440 gtgtgtgtgt atatatatta tatatatata tgtatatata tatgtatttt actaatagca 1500 atttggaatc cctaggaaac aatgatgtca agaaagagaa aaattgactc ggagtgtaga 1560 atattcaaag aacagtggac ttatgattac tttttcatgc agtacaagga aagagctgtg 1620 tgtttgatat gccagaatat agtgtctgtg ttcaaagaat acaatttgcg tcgacactat 1680 caaactcaac ataaagacaa atatgattgt ttggtcggag aagtgagaaa agaaaaaata 1740 ttaaaactga aaaatacatt gacaactcag caaaatactt ttgtgaagca gaagcagcta 1800 aatatttcat cactacgagc aagttttcaa gttgccaagc taataagcat gcactggcag 1860 accatttgtg ggggagaatt tgttaaagaa tgccttcttt ctgttgccaa agagatgtgt 1920 ccagagaagg ccgatttatt tagtacagtg agtctttbag gacctacaat tacacgaagg 1980 attgaagaaa tgggagacaa tttgcatcag catttgcaaa actccacaaa aaaactttcc 2040 tatttttcct tggcactcga cgaaagcaat gatgttcgtg attctgcaca acttctaatt 2100 tttattcgtg ggacaaatga ctatttcgaa gtcacagaag agcttgctgc actgcaaagc 2160 atcaaaggaa caactacagg agaggatatc tatgaaaagb tttgccaaac tgtgaatggt 2220 ttggagctgg actgggctaa actagccagt gtgacaactg atggtgctcc tagcatggtg 2280 gggtctaaga aaggagtaat tgctcgcatt aaccaagaga tgggcaaaca taaccattct 2340 catccaatag ccatacactg cctcatccac caacaagcgt tgtgtagtaa atcactgaag 2400 tgggactctg ttatgaaaat tgtggtatct tgtgttaact tcattaaagc taatgcacta 2460 aaccacagac aatttcagga atttctgtct gagctaaatg ttacctatga agatgttctg 2520 taccacacag aagtccgttg gctgagtcga gggagagttt tgaaacattt ctatgactta 2580 cttccacaga ttacagcttt tctgctttca aaaaacaaag aagtaccaga gctcaatgat 2640 gcagaatgga aatggcacct tgbctttctg acagatgtaa cagagctact caacagtttc 2700 aatgtgcaac ttcaaggaaa ggggaagctc atctgtgata tgcaattaca tgtgaaagca 2760 tttgaagtaa aattaggcct cctcatcaaa caagtgaagg aggaaaactt cttccatctc 2820 cccacaactc aaaatctgtt agcggaaaaa ccattggttg cattcccaaa caaaacatgt 2880 gtggattcac tggaaaagtt gcaaaaggag ttccaattta gatttaaaga gcttcatctc 2940 catgaacagg acatacagct tttccgtaac ccattttcta ttgacattga aaatatggat 3000 acaatttacc aaatggaact ggctgaactg cagaattgtg actctctgaa agacgcattc 3060 aagtcaagca gccttcctaa tttctatgca tctctcccct ctgagacata tcctaatatc 3120 aggaaccatg tactcaaaat ggcaactgtc tttggcagca cttatgtctg tgaacagact 3180 ttttccagaa tgaaacatct gaaatctcca accagatcta gactaactga tgcacacttg 3240 catcaattgt tacgactagc agtaacaaat atggaaccgg acattgacca tctcattagc 3300 caaaaagagg cccatagttc ccattgaaat actggtaagt ttgttgattt aactttactt 3360 gttcttcatt ttaaatattg tatttgttcc cattttgttt ttttcacttc aaaataagat 3420 atgtgcagtg tgcataggaa tttgttcaca gttttttttt tttttaaact atagtccgcc 3480 cctccaacgg tctgaggaac agtgaactgg cccccttgtt ttaaaagttt gaggacccct 3540 g 3541 // ID LTR22_OG repbase; DNA; PRI; 337 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR22_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-337 RA Jurka J.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 11(5), 1594-1594 (2011). XX DR [1] (Consensus) XX CC ~91% identical to consensus. 4 bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 337 BP; 75 A; 102 C; 70 G; 90 T; 0 other; tgttatgccc caactccatc ttgctaaggg ctttcaacac tcccccttta agacaggaag 60 ctcagctgaa agagtttagt aataactgca aaaccccagc ttccccttgt aaggactccc 120 cttaccaggt gttggctctt gtcaaattcg gcttgcatta aggactcccc ctcccctacc 180 gaaagcccct tataaaaggt ttccacccct gcttctaggg gtcgagagac tttgaggcgt 240 gagcccgctc tcggcccggc cgatcaataa aggacccttg cttaatttgg actcagtgtg 300 ctggcgtttc tctattccac tcgcttcggg tataaca 337 // ID LTR24_OG repbase; DNA; PRI; 554 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR24_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-554 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 11(5), 1596-1596 (2011). XX DR [1] (Consensus) XX CC ~96% identical to consensus. 4 bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 554 BP; 130 A; 160 C; 119 G; 145 T; 0 other; tgtgggagcc aaactaatta actcaagtca caagtcaaac tacctccttt tgttgtttat 60 tcagaacaat aggccgccgc ataccaagga ccttatcata cctaggtgtt acagcccaca 120 tcctgacccc ctcccctttg tcccactagc ccccttgtgc actcagccca catcctgacc 180 ccctcccctt tttgcactcg ccactctact ctgcactcgc cccacttgcg acctttatcc 240 cttccttctc attcgctcac ctgagacaaa gaagactcgt ccctgagcca atcaaccctg 300 aacacgacca ccctgtacct gtcagtttag gacagacatg gtttacctac caataagggg 360 atggggtggg atggtctaat gggagggaag agggagaggt ataaagggac tttctgaata 420 gtctgtaagg gctgttcctg ctttttctta aagggcagcc caggctctga gcttgaatga 480 ataaaattcc tcgtgccttt gcagcgatgt tcggactctc agagtggtgc ttgaattgag 540 tctgggggtt aaca 554 // ID MacERVK1_LTR1e repbase; DNA; PRI; 320 BP. XX AC . XX DT 24-SEP-2007 (Rel. 13.02, Created) DT 21-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE ERV2 Endogenous Retrovirus from Cercopithecidae. XX KW ERV2; Endogenous Retrovirus; Transposable Element; ERVK; LTR; KW MacERVK1_LTR1e. XX OS Cercopithecidae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini. XX RN [1] RP 1-320 RA Smit A.F.; RT "MacERVK1_LTR1e - ERV2 Endogenous Retrovirus from RT Cercopithecidae."; RL Direct Submission to Repbase Update (21-FEB-2008). XX DR [1] (Consensus) XX CC 4%. XX SQ Sequence 320 BP; 84 A; 73 C; 83 G; 80 T; 0 other; tgtagaggac tacgtgctcg caaacggggc gttcccgata agtcctgctc tcgcaaacga 60 agcagggcgt tcccgataag tcctgctctt gcaaacgaag cagggcgttg ggggcttgtt 120 tatgtgtaaa catcttgaaa atccagaaag tcagggaaag gtcagaaaaa caacaatgtg 180 tcttgtgact tggcaacatt ccacaaacga ctgtataaaa taaagcagag cgcgccattc 240 gaggcggccg ccatgtttgt cttgtcttgt gttgtcttgt gtgttcattc ctttgtttag 300 gaaacacgcg gaccccaaca 320 // ID ERV1-5_TSy-LTR repbase; DNA; PRI; 691 BP. XX AC . XX DT 16-FEB-2010 (Rel. 15.09, Created) DT 16-FEB-2010 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERV1-5_TSy-LTR. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-691 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from tarsier."; RL Repbase Reports 10(9), 1256-1256 (2010). XX DR [1] (Consensus) XX CC >95% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 691 BP; 196 A; 157 C; 105 G; 233 T; 0 other; tgaaatgagc taaatcagtt aataaaatca aattgctagc aaaataataa tattaaccag 60 cttactccat tttgtttaaa actttcatct tagttacctg cctgacctaa gaatcaatgc 120 ctacgtgtga ttacagtaac gttcccatga ccacagaccc ccagctttgt ctcctttggg 180 caggtcatgc tgaccgtttc ctgctgactg ttacaaaact ccttagccat aagttccatg 240 taattgtatg ctttgttaac tgaaatcaaa tgttacgctt tgcaaaaccc cattgttaca 300 ttttcatatt ttgttcttat attctgttag taacctaacc aaggtaactc cattgtgtaa 360 tgattcttct accttgtctc tatgtaacta ccttaaacta tatcttgatg gcagaactcc 420 agggactttc tggacactgg ctatacctag gacatgaaat taatcaaagc tccctaacaa 480 tcaaaagctc ccattttggt tgccaagttg ttagttaata gttgctatgg tcactaagct 540 gctatgtagc caatactgca aattttacct ataaaagctt gtcacaaact gcaccatttg 600 tcaagagtta tttttcctgg catttgcctg ctcttgacct gcacgttaaa ctcagcttgc 660 tggcgtgact tcttctcact tagtcataac a 691 // ID Tigger3d repbase; DNA; PRI; 321 BP. XX AC . XX DT 14-NOV-2005 (Rel. 10.11, Created) DT 09-DEC-2005 (Rel. 10.11, Last updated, Version 1) XX DE Mariner DNA transposon from primates. XX KW Mariner/Tc1; DNA transposon; Transposable Element; GOLEM_C; KW mariner; Tigger3d. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-321 RA Smit A.F.; RT "Tigger3d - Mariner DNA transposon from primates."; RL Direct Submission to Repbase Update (11-NOV-2005). XX DR [1] (Consensus) XX SQ Sequence 321 BP; 105 A; 63 C; 67 G; 86 T; 0 other; cagtcatgcg ccgcataacg acgtttcggt caacgacgga ccacatatac gacggtggtc 60 ccataagatt ataatggagc atatatagaa acctgatata tggcacttga tattggcatt 120 gcagatcaag taggggaaat gactgatatt cagtaatggt gctgggacat ttggttttcc 180 atatgaaaaa atatatataa ataaaaatat atataccatc taggtttgtg taagtacact 240 ctatgatgtt cgcacaacga caaaatcgcc taacgacgca tttctcagaa cgtatccccg 300 tcgttaagcg acgcatgact g 321 // ID LOR1a_LTR repbase; DNA; PRI; 497 BP. XX AC . XX DT 16-MAR-2006 (Rel. 11.03, Created) DT 10-APR-2007 (Rel. 11.03, Last updated, Version 1) XX DE Long terminal repeat of ERV1 Endogenous Retrovirus from primates. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LOR1; KW LOR1a_LTR; LTR retrotransposon. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-497 RA Smit A.F.; RT "LOR1a_LTR - a subfamily of endogenous retroviruses from RT primates."; RL Direct Submission to Repbase Update (11-NOV-2005). XX DR [1] (Consensus) XX CC mer4 group, 4bp target site duplication. XX SQ Sequence 497 BP; 123 A; 142 C; 89 G; 142 T; 1 other; tgaaaccggc ccaattgtcc catagaactg atgtttatgg tttctttgaa taaacataga 60 aattgaccct cccagtctta aaacttgaga aagttacatt tgtcttatct gagttccttt 120 ctcaggaaac caaccatcag gcctcccaga tagtatcaag gaactgaaac ttaccagatc 180 actgcatccg gacaatgaga cgtcagaccc ctcacccatc atgattgcct aactgaccac 240 ctgcttcctg ttgaccaaat tctcttcctt acccctccct aattcctgtt ttcccgcaca 300 tggttacatt tcttccctgc tatataaacc cctaatttta gtcggtcagg gagatggatt 360 tgagactgat ctcccgtctc ctcggctgca gcacccgatt aaagccttct tccytggcaa 420 tactcattgt ctcagtgatt ggctttctgt gcggcgagca gcaggaccta gaccgaaccc 480 ctggcgtttc ggtaaca 497 // ID LTR1C_OG repbase; DNA; PRI; 562 BP. XX AC . XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 05-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Long terminal repeat (consensus). XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR1C_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-562 RA Jurka J. and Jurka M.G.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 8(10), 1674-1674 (2008). XX DR [1] (Consensus) XX CC 4bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 562 BP; 146 A; 161 C; 112 G; 143 T; 0 other; tgatacagga ttcccccact cagggctgga cagggactcc cccattctct aggcccaggt 60 gagctcagaa atcagacccc catctctagg cacaaacagg ctgccctttc caggccacac 120 cctctgggct ctagcaaggt catgaacaaa gcagcctgaa aggttttggg tggttccacc 180 ttgagtaggg agcacacacc ctaatccctt ccctgagaca aaaactgcaa agacctacga 240 ataaccttgc ccccgagtgg gagggactgc attcattcat ttgttaatac gctcattaac 300 tttgaaaagg tgtttttcca cctaccaagt cctggccaaa atgtaattct ccagactata 360 tataccccta gacaaaggac ccacgtggag acttcccatg acttgggtct ttccctttgc 420 cagaagctct ggtgcttttt actccttctg tattcctttc tgtctaataa aatcctatct 480 taccaactga tctgtggtcc gtgggttcat tcttcgaatc cccgagacca aggacctact 540 gaagaaagaa attccggtaa ca 562 // ID PTERV2a_LTR repbase; DNA; PRI; 586 BP. XX AC . XX DT 14-MAR-2006 (Rel. 13.02, Created) DT 21-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE Long terminal repeat of ERV1 Endogenous Retrovirus from Pan DE troglodytes. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; KW PTERV2a_LTR. XX OS Pan troglodytes OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pan. XX RN [1] RP 1-586 RA Smit A.F.; RT "PTERV2a_LTR - ERV1 Endogenous Retrovirus from Pan troglodytes."; RL Direct Submission to Repbase Update (21-FEB-2008). XX DR [1] (Consensus) XX CC lib20040702 Only two copies in panTro2. This sequence is from CC the identical LTRs of the element at chrY:21672531-21681611. XX SQ Sequence 586 BP; 152 A; 159 C; 137 G; 138 T; 0 other; tgagagaccc tagagaaaaa cgctagacag tctcttaagg aaaaacacca gctagtctct 60 tagagaaaaa caccagatgg ccacttgaaa aacacctgat ggtcaggagt cagggtgtgt 120 cgatggctat caagggcacc agaagcccag ggcagggcag ggagccccgg tccccctccc 180 ctgttccggg aactaaaaac agaaaaacat ttccaggaaa tactgcccgc cacctgtaag 240 ccccctgacc aaccagagtg agacagctgg ctcaggccat aattaagaac caatcagtta 300 ctcccaatcc tcgcgcctta atatgtgacc caatcaaatg tgtgtatcct ccccttgttt 360 gaatttgtaa cctcccccta attgctgtgt aacccctccc ttgttttgat gtgtggtttt 420 tgcctatata aactgtgtga aaacagccgt tcggggtctc tcggcctctt gtgctggaga 480 ccctagcgcg ctagtaataa agtgtgctct ttgctgtgat ctccgtgtcg ggtggtctct 540 ggcggcggcc ccatcccgaa gtagaatctt gagcaagatt ctaaca 586 // ID LTR2_Cja repbase; DNA; PRI; 381 BP. XX AC . XX DT 14-OCT-2009 (Rel. 14.11, Created) DT 14-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat: consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR2_Cja. XX OS Callithrix jacchus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. XX RN [1] RP 1-381 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the common marmoset."; RL Repbase Reports 9(11), 2912-2912 (2009). XX DR [1] (Consensus) XX CC ~94% identical to consensus. 4bp tsd. CC This sequence was derived from the marmoset dataset made CC available by Genome Sequencing Center at Washington University CC School of Medicine in St. Louis. XX SQ Sequence 381 BP; 112 A; 67 C; 99 G; 103 T; 0 other; tgtagcagga gcggccgcgg gaaatggatc tctcggacac cgaatacagg aggaaggagt 60 ttattcggcc gggagcaagg tggcatagtt gcctaacagc caagctcccc taacaaagga 120 aggttcctct ttatataggc ttatacttta gatgttacac gtggaagaat cttaggttta 180 tagcttaaca tatgaaaagg gtctcggttt acagtctcag gaattacata tgaaaagggt 240 ttcggtttac agtcttagga attacatatg aaaaggatct cggtttacag tctcaggaat 300 tacatatgaa aagggcttca gtttacagtc ttaggaatga aaaggggaag ttgatctgag 360 atgcctgggt ctccctcttc a 381 // ID ERV1-1_CJa-I repbase; DNA; PRI; 7158 BP. XX AC . XX DT 12-JAN-2011 (Rel. 16.02, Created) DT 12-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE Internal portion of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; ERV1-1_CJa-I. XX OS Callithrix jacchus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. XX RN [1] RP 1-7158 RA Jurka J.; RT "Endogenous retroviruses from the common marmoset."; RL Repbase Reports 11(2), 689-689 (2011). XX DR [1] (Consensus) XX CC >86% identical to consensus. CC This sequence was derived from the marmoset dataset made CC available by Genome Sequencing Center at Washington University CC School of Medicine in St. Louis. CC We thank the Marmoset Genome Sequencing Consortium for making CC their data publicly available. XX FH Key Location/Qualifiers FT CDS join(638..928,1172..2653,2708..6235) FT /product="ERV1-1_CJa-I_1p" FT /translation="MTYNLSCLGLALRTAELLHLEYHSQTWSEAQIVNYRP FT HNPRAYSYDGKWGGGGGPGPEGIVPASAMRPAITFFKIVVLRGPSQGVYPP FT RAALWSGLCLGDAPGLAVARNNQGCGRPCVRFQMGATLPKAKSPLKCILDN FT WDQFDPRSLKKKRLIFLCRTAWPRYPLPSGETWPPKGSINYNTXLQLDLFC FT KREGKWIEVPYVQVFFSLRDNPQLCKACNLYPTGPSSGLPPHLGLPVAPPS FT TDADQVSSAPITQEESGKAETAKAPQATRMPQLCPLQAVGGEFGPTKVHTP FT FSLSDLKQIKADLGKFSDDPDKYIDVLQGLGQSFELDWKDIMLLLSQTLTS FT NEREAALAAAQEFGDTWYLSQVHDQMTPEEKDRFPTGRQAVPSTDPRWDPD FT SERGDWSRRHLLTCILEGLRRTRKKPMNYAMLSTITQGKEENPTAFLERLR FT EALRKYTPLSPDSIEGQLILKDKFITQSAADIRRKLQKLALGPEQSLESLL FT NLATSVFYNRDQEEQAERNRRDQRKAAALVMAFRQADPRGSEEGKAWVSRQ FT SGRACYHCGLPGHFKRDCPQRNGPPPRPCPLCQGDHWKAHCPRGGCPGQAP FT AQAITLTEPRVKLTIEGQEVDLLLDTGAAFSVLLSCPGRLSSKSVTVRGVL FT GQAVSRYFSQPLSCNWGDILFSHAFLIMPESPTPLLGRDLLARAGAIIYMN FT IVREEIPLCCPLFKEGINPEVWAMEGQFGRAKNACPIRIRLKDPTSFPYQR FT QYPLRPEAKKGLQDIVRNLKAQGLVKSCNSPCNTPILGVQKPNGQWRLVQD FT LRLINEAVIPLYPAVPNPYTLLSQIPGEAQWFTVPDLKDAFFCVPLHPDSQ FT FLFAFEDPLDQNSQLTWTVLPQGFRDSPHLFGQALAQDLSHFSHPGTLVLQ FT YMDDLLLTANSETQCQQATQDLLNFLATCGYKVSKPKAQLCRQEVKYLGLV FT LSKGTRALDGERLQPILAFPRPKTLKQLRGFLGITGFCRLWIPRYGEIAKP FT LYTLIKETQKANTHLIQWDPIAEAAFQTLKQALLQAPALSLPTGNDFSLYV FT TEKSGVALGVLTQTRGTTPQPVAYLSKEIDAVAKGWPHCLRVVAAVAILVS FT EAVKIVQGKDLTVWTSHDVAGILASKGGLWLSDNRLLKYQALMLEGPVLQL FT LCTCAALNPATFLPEEGADKEHDCQQVIAQNYAAREDLLETPLANPDLNLY FT TDGSSFTENGVRKAGYAVVSDQTVLESNSLPPGTSAQLAELVALTRALKLG FT KGKRINIYTDSKYAYLVLHAHAAIWKEREFLTSAGTPIKYHKEILDLLQAV FT QEPKEVAVLHCRGHQKGDEKEAEGNRRADLEAKRAARQEFIEGPLIWENPL FT QGTRPQYSSEEVEWAISQGHNLLPSGWLATEEGKVLLPAASQWKVLKTLHQ FT TFHMGIENTHRMAKSIFTEKGLFKAVQQIVKGCEICQKNNPLAHRKAPPGE FT QRAGHYPGEDRQMDFTHMPKSQGYQYLLVCVDTYTNWVEAFPCRTEKAQEV FT IKVLINEIIPRFGLPRCLQSDNGPAFKAAVTQGISKALGIQYHLHCAWRPQ FT SSGKVEKMNEILKRHLKKLAQETHLPWPSLLPIALLRARNSPQKIGLSPYE FT MLYGRPFLTNDLVLDKETADLIKDITSLAKYQQALRTLPEGHPREKGKELF FT CPGDLVLVKSLPSDSPSLGPSWEGPFSVILSTPTAVKVVGIDSWIHHTRLK FT PWTPPEENHRPSTPQPEVPGDDPSYTCEPLEDLRLLFRRDPQESNYS" XX SQ Sequence 7158 BP; 1916 A; 1873 C; 1696 G; 1672 T; 1 other; ctttcccttc ttttaaattt gttaaggagg taaaaaccgg gcgcctgtca gccatctaag 60 agcgactagc gtggccgctg gctaaagaca cgggtgtcag gctttctggg aacggactct 120 ctaacaaccc ccagctcttc ggaactggga gcgttggtta gccccgaacc agcttccctt 180 tcttgctgtt ctttctttct ctgggccaag ctgagggtcg gcaggaggaa agtggtcccg 240 aactcccggt tcagtccgtt gaggttgtgg ccctgcttag acctcctaag aaaaagccgc 300 ctatgcgagc cacgcccagc ctttcagacc ccaacctcct gtgtccctgg ctctttcctc 360 taagatattt cctacacatg aggctattac ccttcttgct aggactttga ggctcttggg 420 gtggcagtcc agggaccctt gcccgtggtg ccctcaggct tatgcctatg aggacctcct 480 acagtataaa tacacagggc cagttccctt agtccgcagt gctaggttat gtcctaacgg 540 agccagagtc tatcagtggg tactggaggc tataggacat tgtgtttttt tttgtcaccc 600 gcaatttccc aatccaactg agtggtgcca gaatccgatg acgtacaatt tgtcttgcct 660 aggattggct ctccgtacgg cagaacttct ccatctagaa taccatagtc agacctggtc 720 cgaggctcaa atagttaatt acaggcccca caaccctagg gcatattcct acgatggaaa 780 gtggggtggg ggaggaggcc cgggtcctga gggcattgtt ccggcctcgg ccatgcggcc 840 agccattacc ttctttaaga ttgttgtgct tagaggcccc tcccaaggag tttatcctcc 900 tagggcagct ttatggtcag gcctatgtta attgaactct tgctagggaa tattggcacc 960 catcgcccaa gcctcactgg cctggttctt caaggacgcg ccggtacagg gccactaatc 1020 ccggcagtcc ccatgtcctt atggtggcca taaggacatg tagacagtga ggtgtgccca 1080 tgacgccggt aagcgtgatt aaacccggca catggggggc ccagaagatg atcacgcatc 1140 tagggcgcaa ggaaactacg aggtgctata actgggggat gccccagggc tagcagttgc 1200 aaggaataac cagggatgcg ggcgtccctg tgttcgattt cagatgggtg ctactcttcc 1260 caaggcaaaa tcacccttaa agtgtatcct ggataactgg gaccaatttg acccacgatc 1320 tttgaagaag aagcggctca tttttctctg ccgaacagcg tggccgcgat accctctgcc 1380 gagtggagaa acctggcctc ctaagggaag tataaattat aacaccmttc tacaattaga 1440 cctgttctgt aagagggaag gtaaatggat tgaggtcccc tatgtgcaag ttttcttttc 1500 tctaagggat aatcctcagc tatgtaaggc ctgtaaccta taccctactg gcccctcctc 1560 aggactaccc ccacatttag gactccctgt ggctcctccc tccaccgatg ctgaccaggt 1620 ctcctcagct cctataaccc aagaggagtc agggaaggcc gaaacggcca aggcacctca 1680 ggctactagg atgccccaac tctgccccct acaggcagta ggaggagaat ttggcccaac 1740 caaggtacat actccctttt cactttctga cttaaagcaa atcaaggcgg acttaggaaa 1800 attttcagat gacccggaca agtacataga tgttctgcag ggcctgggtc agtccttcga 1860 attagattgg aaagatatca tgttattgct tagtcagacc ctaaccagca atgagagaga 1920 ggctgctcta gcagcggctc aagagtttgg ggacacctgg taccttagtc aagtacatga 1980 ccaaatgacg ccagaggaga aggatagatt ccctacaggt agacaagcag tccctagcac 2040 ggaccctcgc tgggatcctg actcagaacg gggggactgg agtcgtaggc acctattgac 2100 ctgcatactt gagggattaa ggcgaactag gaaaaagccc atgaactatg caatgttatc 2160 caccataact cagggaaagg aagaaaaccc aacagccttc cttgagcggt tacgggaggc 2220 cttaagaaag tatactcccc tgtcaccaga ctccatcgag ggtcagctaa tactaaaaga 2280 caaatttatt acgcaatcag cagcggacat taggaggaag ctccagaagc tggccctggg 2340 gccagaacag agtctagaat cattactaaa tctagcaacc tcggtgttct ataatagaga 2400 ccaggaggaa caggctgaaa ggaacaggag agaccagaga aaagccgcag ccctagtcat 2460 ggccttcagg caagcagacc caaggggctc agaagaggga aaggcttggg ttagccgcca 2520 gtctggcaga gcttgctacc actgcggcct accgggacac tttaagagag attgtcccca 2580 gaggaacgga ccacctcctc gtccttgccc actgtgccaa ggggatcact ggaaggcaca 2640 ctgtcccagg ggatgaaggt tcccagggcc tgaggtggct aaccagatga cccaacagca 2700 ggactgaggg tgcccggggc aagcgccagc acaagccatc accctcaccg agccccgggt 2760 aaagctgacc atagagggcc aggaggtcga tctcctcctg gacactggcg cggccttctc 2820 agttttgctt tcctgtcctg gacggttgtc ctccaagtct gtaactgtcc gaggagtcct 2880 aggacaagca gtctcccgat atttctccca gcccctaagc tgtaactggg gagatatact 2940 cttctcccat gccttcctca tcatgccaga aagtcccact cccttactag ggcgggacct 3000 gttagccaga gcaggagcca ttatctatat gaacatcgtc agggaggaaa tccctctctg 3060 ctgcccatta tttaaagaag ggattaatcc cgaggtttgg gcaatggagg gacaatttgg 3120 gcgggcaaag aatgcatgcc caattcgaat aaggctaaag gatcccacct cttttcctta 3180 ccaaagacag taccccctaa gaccggaagc caaaaagggg ctacaggata ttgtaaggaa 3240 cttaaaagct cagggtctag taaagtcctg taacagccct tgtaatactc caattttagg 3300 agtacaaaaa cctaatggac aatggaggct ggtgcaagat ctcaggctta ttaatgaagc 3360 tgtgatccct ttatacccag ctgtacccaa cccatacacc ttgctctctc aaataccagg 3420 agaggcacaa tggtttacag tcccggacct taaggatgcc tttttctgtg ttccattaca 3480 tcctgactcc cagtttctgt tcgcctttga agacccttta gaccagaatt ctcagctcac 3540 ctggacagtg ttaccccagg gctttcgaga tagcccccac ctgttcggcc aggctctagc 3600 tcaggacctt agtcattttt cacatccagg taccctggtt cttcaatata tggatgactt 3660 acttttaact gctaactcag aaacccagtg tcagcaggcc acccaggacc tcctaaactt 3720 tctagccacc tgtggatata aggtttccaa accaaaagcc caactttgta ggcaagaggt 3780 aaaataccta gggcttgtct tgtctaaggg cactcgggcc cttgatggag aacgtctcca 3840 acccatactg gccttccccc gtccaaagac cttaaaacag ttgcggggat tcttaggaat 3900 aactggtttc tgccgattgt ggattcccag atatggtgaa atagccaaac ccctatacac 3960 tctgattaag gaaacccaaa aagctaacac ccatctaata caatgggacc ctatagctga 4020 ggcagccttt cagaccctga agcaggcact actgcaggct ccagctctaa gccttcctac 4080 agggaacgac ttctccctat atgttacgga aaagtcagga gtggcactgg gggttctcac 4140 ccagactcgg gggaccacgc cacaaccggt ggcataccta agtaaggaaa ttgatgcagt 4200 agcaaaaggc tggccacact gcttgcgggt cgtagccgcg gtggctatac tggtctcaga 4260 agctgttaaa atagtgcaag ggaaagatct aactgtatgg acttcacatg atgtagctgg 4320 aattctggcc tctaaaggag gcttatggct gtccgacaac cgcttactca aatatcaggc 4380 cttaatgctc gaagggcctg tacttcagct gctgtgcaca tgtgcagccc tcaatccagc 4440 tacttttctt cctgaggaag gggcagacaa agaacatgac tgtcagcagg ttatagccca 4500 gaactatgcg gctcgagagg atctcttaga aactcctcta gccaatcctg accttaacct 4560 atacactgac ggaagttcat ttacggaaaa tggagttcga aaggcaggat atgcagtagt 4620 cagtgatcaa acagtgcttg aaagtaattc ccttcctcct ggaactagtg cccagctggc 4680 agaactagta gccctcactc gagctctaaa attaggaaaa ggaaagagaa taaacatata 4740 cacagattcc aaatatgctt acctggttct gcatgcccat gctgccatat ggaaggaaag 4800 agagttccta acctcggcag gaactcccat taaatatcat aaggaaattt tagacttatt 4860 acaagctgtc caagaaccca aagaggtagc agtcttacac tgccggggtc accagaaagg 4920 agatgagaag gaagcagagg gcaatcgccg agctgaccta gaggcaaaaa gggcagctag 4980 acaagaattt atagaaggac ctttaatatg ggaaaaccct ctccagggaa ctagacctca 5040 atactcatca gaagaggtag agtgggcaat ctcccaggga cataatcttc ttccttcagg 5100 atggctagct actgaggagg gaaaagtact cctacctgct gccagccagt ggaaagtact 5160 taaaaccctg catcaaacct tccatatggg tatagagaat acacatcgaa tggccaaatc 5220 tatatttaca gaaaaaggtc tcttcaaggc tgtccaacaa atagtcaaag gatgtgaaat 5280 atgccagaaa aataatcctt tggcacatcg caaagctcct cctggggaac agcgagccgg 5340 tcattaccca ggtgaggacc ggcaaatgga ctttacccat atgcctaagt ctcagggata 5400 tcaatatctg ttagtatgtg tagataccta cacaaactgg gttgaggctt tcccctgtag 5460 aacagaaaaa gcccaagagg taataaaagt cctgattaat gaaataatcc ctagattcgg 5520 actccctcgc tgtcttcaaa gcgataacgg ccctgccttc aaagctgcag tgacccaagg 5580 aatttccaag gcgttaggaa tacagtatca tctccactgt gcttggaggc ctcagtcctc 5640 agggaaagtg gaaaagatga atgaaattct taagagacac ctaaagaaac tggcacagga 5700 aacccacctc ccgtggcctt ctctattacc aatagcactc ttaagggctc gaaactctcc 5760 tcagaaaata ggactcagcc cctatgagat gctttatgga cggccctttc ttactaatga 5820 tctcgtatta gacaaggaaa cagcagatct aataaaagat ataacctcct tggctaagta 5880 ccaacaggct cttaggacct tacctgaagg acaccctagg gaaaaaggga aagagttgtt 5940 ctgccctgga gacttggtgc ttgttaagtc tctcccctca gactctccct ctctaggccc 6000 atcttgggaa ggaccatttt cggttattct gtctacccca acggcagtta aggtggtggg 6060 aattgactcg tggattcatc acactcggct gaagccttgg acacctcctg aagaaaatca 6120 taggccctcc actccgcagc ctgaagttcc aggagacgac ccttcgtaca cctgcgagcc 6180 gttagaggat ctacgtttgc tcttccgacg agatccccag gaaagtaact actcatgagt 6240 ctaacttaca tcctggcatt aacaggacta tcactcttaa ctttactatc tgcgaccgga 6300 ctctatgcgg tcgcccctac ggattggcct aatagctctc ctcctctcct tcctagtgcc 6360 ccccccatga cgatttacac cgactctcag gtgcagagtc ttctcttgcc aaggccacgc 6420 catactcgaa ccccaattat tcctttcata gttggggggt ggcggaataa taggaggact 6480 tggtactggc attggaggag ttactacctc tactcagttt tactataagc tatctcagga 6540 gttaaatgat aacatggaaa aggttgctaa ttctctctgc aaggcagctc actctgcaga 6600 agcagcttaa ctctttagct gccgcggctc ttcaaaatag aagggccctg gatctgctaa 6660 cagcagagag aggaggaacc tgtctctttc taggaaaaga atgctgctac tttgtaaacc 6720 aatcaggtat tgtcttaaac aaagttaaag aactcaggga cccaatcgaa cgcagggcta 6780 aagaccttca agaaacagga tcgtggagct tgtttaacaa ctggctgtcc tggctctacc 6840 ctctcctagg cccgtttata accattctat tattgctggc cttcgggcct ctcatcttta 6900 accttcttgt caaattcgtt tcttctagaa ttaaggccat aaggccacaa atggtcctgc 6960 aaatggaacc ccggatgtcc ttggctcatg agttctatcg tggacctctg gatcaacctg 7020 cacccctgga agactcccct ctggaggaaa cctcaactgc agggcccctt ctacgcccca 7080 attcagcagg aagtagctaa agagcgatcg acgcccgtgt cccaacagca gttggacttt 7140 cctgttcaga ggggggac 7158 // ID LTR15_Cja repbase; DNA; PRI; 587 BP. XX AC . XX DT 12-JAN-2011 (Rel. 16.02, Created) DT 12-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR15_Cja. XX OS Callithrix jacchus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. XX RN [1] RP 1-587 RA Jurka J.; RT "Long terminal repeats from the common marmoset."; RL Repbase Reports 11(2), 790-790 (2011). XX DR [1] (Consensus) XX CC >86% identical to consensus. 4bp tsd. Homologous to PRIMA4_LTR. CC This sequence was derived from the marmoset dataset made CC available by Genome Sequencing Center at Washington University CC School of Medicine in St. Louis. CC We thank the Marmoset Genome Sequencing Consortium for making CC their data publicly available. XX SQ Sequence 587 BP; 172 A; 153 C; 103 G; 159 T; 0 other; tgtgaaagga aaataaatct cgggacccca aactcactaa gccaaaggga aaagtcaagc 60 tgggaactgg gtcacgcaaa cctgcctccc attttgttcc taaataagat ggctacaaag 120 atgaaaagct acatacctcc ctcacaattt gcccacgagg aaattccttg tgggccccaa 180 gatctttacc ctaaaacagt tctgttgaat ttcaccctgg caatgtaaat tgatagctta 240 tcttcacagg tgcgggacaa aggacagaac tcaaagtcat ccctctgctc acctgagaca 300 aatgcatatc tgattgcttc ctctgcccta ttgtctatgt tatcttatgt aaaaatgcag 360 attcactgag ccagacgaag gcataagtga ctattttcct ctacccccct ctcacatgaa 420 aattgtgtat ttcccaatat cccgcccttt cccctttaaa tattgaagcc ctcaaaatca 480 tcttcggaga aaggcataga cctgtctccc gggcgcgcgt ccttaacttt ggcaaataaa 540 cctcctaaaa tgattgagac ttgcctcggt cattttcttg atttaca 587 // ID LTR1D_OG repbase; DNA; PRI; 540 BP. XX AC . XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 05-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Long terminal repeat (consensus). XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR1D_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-540 RA Jurka J. and Jurka M.G.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 8(10), 1675-1675 (2008). XX DR [1] (Consensus) XX CC 4bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 540 BP; 120 A; 147 C; 137 G; 135 T; 1 other; tgatgcagga caaggtggca ccacgacctg ccaccttagc cctggaggtg ttgatgytgc 60 ggcggatgga ggaaccagct tgatatccgg accggacacc tgggggggag acactggcgc 120 cttgctgtct cctccacctg cacttctggc tggtcaattt catggatggc ccttgctgtc 180 tcctccatgt gcactcctga ctggtcaatt tcatgtcagt ttccaaggcc tgagagctca 240 ttggacagct ggctgtaact gacgaggacc cacgccctaa aagtcaacat tgagttccgg 300 caaaaaaaca acccccctgc cactatataa acccctgagc agagagcccg aagggacaga 360 cctcaaagag ggagcaggag agcttgttcc cctcctgggt tctctctgcc aggaggtttg 420 gttttttgta gttgttgtaa tttttgtttc tttgaataaa tcctgactta ccgctgatcc 480 tgtggtccat cgattcattc ttcgaatcgc cgagaccaag aacctgaaac tccggtatca 540 // ID LTR15_OG repbase; DNA; PRI; 945 BP. XX AC . XX DT 18-OCT-2009 (Rel. 14.11, Created) DT 02-JUN-2011 (Rel. 16.05, Last updated, Version 3) XX DE Long terminal repeat (consensus). XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR15_OG. XX NM LTR15_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-945 RA Jurka J.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 9(11), 2861-2861 (2009). XX DR [1] (Consensus) XX CC >90% identical to consensus. 5 or 6bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 945 BP; 248 A; 237 C; 229 G; 231 T; 0 other; tgcaggcaga atggggaccg aagaccccat tcttggcaag ggaaaacacc cccagaaaaa 60 gggaaaaatt aaaaagtaag ctgagtgagt cggaacaaag gaaagtcagc tagtttacct 120 ttaaaccagc cgctgctaaa tttcttatct gagcaagttt cctgcttttc ctagaactct 180 ggcctattgg ggatacagct tactaacacc cctgctgggt ctgtaaacac tctgaggaaa 240 taatggccac agaagatgct gtggcaaggc ctggaaagat acattaacta ggccgtttcc 300 tgcccagtaa acagtcaaag caaataccca ttgtggagat ttgtatttgg ctggggcaga 360 gatcaaggga acagctcgag ccctgaaata tgggtaggag ggcttatatc ccttatcgca 420 gaggaagacc ctctgtggtg gtgatgagag gaaaattcct ctctcccacc gattcttcag 480 aggagattga cgccagtggc gttttactcc acagtgaact agcggagttt aacacctccc 540 taggaatctg tgtgtgtgac ctcagacagg aggctgcccc cacacaaggc tcaggttccg 600 gatttctcaa gactgctcag acccttaaag ttaatgatta tagaaacaat aggagctgat 660 atgctcgctc agagaaactg ctggcgtctt tgttcccttc ctcagcttac cctttgggct 720 aatgagcaaa tagccttcaa taaaagctga gtggaactga cactcggggc caccgccgga 780 accccacaag cgggcggtgg tcccctgagt tcccaccttt gaaaattcta ttctgtgtgt 840 cttgtctctt tctttctcgt cgtttctaac tttatatttc ttcagtcggt cgccgccaga 900 tccacaggtc ttaacggacc ctgcccccgg ggcaggaccc cggca 945 // ID ERV1-4F_TSy-LTR repbase; DNA; PRI; 418 BP. XX AC . XX DT 06-APR-2010 (Rel. 15.09, Created) DT 06-APR-2010 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERV1-4F_TSy-LTR. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-418 RA Jurka J.; RT "Long terminal repeats from tarsier."; RL Repbase Reports 10(9), 1255-1255 (2010). XX DR [1] (Consensus) XX CC ~89% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 418 BP; 118 A; 108 C; 64 G; 127 T; 1 other; tgaaatatga aatatctaat ctgaaaaacc tataataacc tgtaatagct tgcttgacat 60 aagtaactcc attttgtaac agatcttcca tgttaaccta ggaataaccc ccttatgtaa 120 ccttcccgta atcatcccgc acacccacat tcctgtacct ttcggtggtt ggaaactccc 180 tagttgcgta aattctgtaa ccatccgtac cccttactaa tcaaatgtat tgccaagaaa 240 tatcaatcaa actgtacaac ttatgtaatt tttaccttta aatacccact actcactctg 300 ttcggggccg agagttnttt gggacactag tccgctctcg gtcgccggcg tcaataaagg 360 actcataatc tgaccttctc tgcgtgttgg cgtcatttct cgcaatccgg ctataaca 418 // ID ERV2-1_OG-I repbase; DNA; PRI; 4725 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Internal portion of an ERV2-type endogenous retrovirus - DE consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; ERV2-1_OG-LTR; KW ERV2-1_OG-I. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-4725 RA Jurka J.; RT "Endogenous retroviruses from bushbaby."; RL Repbase Reports 11(5), 1476-1476 (2011). XX DR [1] (Consensus) XX CC >98% identical to consensus. Low copy. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX FH Key Location/Qualifiers FT CDS 179..961 FT /product="ERV2-1_OG-I_1p" FT /translation="MGHKLSKEAIFIQDLKASFRERGIRVKKKDLIKFFIF FT IDQACPWFLISGPEIHPGKWQKVGRNLNDLLKTQGPNAVPATVFSYWGLIR FT DIVEGAHTDPNKQQLLSVAEYCLRPISRSASVTSVHSRCPLDSEPHSNLQF FT LSPSPSVAPSVTINIPPSEKPPQQATAPLLYPTLPTDLPSSDPLTCPVFQK FT KLSNIDPLEPRDGETLEEEAACYHNPDLPPLASAPVSGPPLMCLSFALIPF FT CPRLYLQPLFLMQKTNFPHK" FT CDS 1141..1995 FT /product="ERV2-1_OG-I_2p" FT /translation="MTRSAGTFDAPASTAAGPHNNSPSSPSPTPDSNSEGE FT EAEGGGESETDEQAPAPAVQTEFRKLRLKVLKELNDAVRAYGPNAPFTISI FT LETIPGGGHMIPSEWTRIVRSVLTRGQFLSWKADFLDRCQTLAAHNQKSPR FT SPSAAWTYEKLSGQGKYATETRQQRLPVGLLAQTASAALGAWRALPAKGSI FT ITPLTKIIQGAQEEYSEFVSRLLEAAERTLGPDESDNKLLKQLAYENANSA FT CKAVLRGLTRISCISPVSHLSMPSNVRDFRSPQRKSKSSLPICS" FT CDS 1869..3887 FT /product="ERV2-1_OG-I_3p" FT /translation="SCTTGPDENQLHLAGQSLIHALQRQGLQISPEKVQVL FT PPHLFLGFELLPTNILSQKIQLRTDSLRTLNDFQRLLGDINWLRPYLKLTT FT GDLKPLFNILLGDSDPSSPRILTPDAKLALIKVEHAIANQSIGYFSPTLPL FT YFLIFPTPFSPTGLLWQSNPLFWIHTSASPSKVLPTYPQLVAHVLRLGRES FT AVRLFGRDPNVIVLPYDTSQVQWLIQNSDDWAVNCTSFQGTIDNHYPADKL FT VQFLLRTPVIFPKRTHSTPIAGAALAFTDGSSSGIAAFSINGNLSRFKTPY FT SSAQLVELAAIIKVFELLPNSPFNLYTDSAYVASSVPLLETVPHIRPSTNA FT SPLFTKLQALILARSFPFFIGYVRAHSGLPGALAAGNDAVDQATKLIASAL FT SSTPLAAAQQAHALHHLNAHTLRLKFSITREQARQIVRQCKGCLTLLPEPH FT LGVNPRGLVPGEIWQMDVTHHSPFGKLKYIHVSIDTCSGFLCATLQTGEAT FT KHVINHVLACLAVTPQPKILKTDNGPGYTSSSFKQFCAQLTIKHITGIPYN FT PQGQGIVERAHQTLKNMLFKLQSKGGVLYPPTGNFKTLLNHALFVLNFLTY FT DSAGKSAADRLWHPSTADTYAQAMWKDPVSNKWQGPNPVLIWGKGHACIYD FT PNTQNARWLPDRLIKLYTQPRDNP" XX SQ Sequence 4725 BP; 1176 A; 1394 C; 900 G; 1255 T; 0 other; ggtggcgccc aaccgtgggg ctcgaggtac ggaccctcgc cggacggacc ccccttttga 60 agacaccggg agacgtcccc ggacaacctg aaagtgcagc cgccgtctcg ccgcccacga 120 gtcggatcgt gccgcgtgag ttgaaagttt cattcccttt tcattccgtc ccataagaat 180 gggccataaa ttgtctaaag aggcgatttt catacaggat ttgaaagcct cgttcagaga 240 aagaggaatt agggttaaga aaaaagactt aataaaattt ttcattttca tagaccaagc 300 ctgtccttgg ttcctcatta gtggtccaga aatacaccct ggaaaatggc aaaaagtagg 360 tagaaacctt aatgaccttt taaaaactca agggccaaac gccgtccccg ccacagtttt 420 ttcctactgg ggactcatcc gagatattgt tgagggagcc cacacagacc ccaataagca 480 gcaactccta tctgtggccg agtactgcct ccgccctatt tcccggtcag cttccgtgac 540 ctctgtccat tctcgctgcc ccctggattc cgagcctcac tccaatcttc agtttctgtc 600 tccttctcct tctgttgctc cttctgttac tattaacatt cccccctcag agaaaccccc 660 tcagcaggcc acagcccctc tactctaccc tactcttcct acagaccttc cctctagcga 720 cccccttaca tgcccggtct tccaaaagaa attgtcaaac attgatcccc ttgagccaag 780 agatggggag accctcgagg aagaagccgc ttgctatcat aaccctgatt tacctccgct 840 cgcttcagcc cctgtgagtg gcccccccct tatgtgcctc agttttgcgc tcataccctt 900 ttgcccccgt ctttacctgc agccactctt cttgatgcaa aagacaaact ttcctcacaa 960 gtaaaagatc taaaagaggt cttgcacctc cacaggcaat ttgcacagct ttctactgaa 1020 cttacttccc ttcaggcctc ttttagggac gccctgttca ctcctttgac tgtagcgccg 1080 accggcaccg cctcggcact tccccatcgc tccaaaccca aaactttaac cttccctatt 1140 atgacccgtt cggcggggac ttttgatgcc cccgcctcca ctgccgctgg tcctcataac 1200 aattccccat ctagcccgag tccaacgcct gattcaaata gtgagggtga ggaggctgag 1260 ggcggcgggg aaagcgaaac cgatgaacag gcgccggctc ccgccgtgca aaccgagttt 1320 cgcaaattac gccttaaggt tctaaaagaa ctcaatgacg ccgtcagggc ttatgggcct 1380 aatgcccctt ttactatctc tattcttgag actattccag gcggcggtca catgattcct 1440 agcgaatgga ccagaattgt tcggtctgtt cttactcgcg gacagtttct gtcctggaag 1500 gctgacttcc ttgatcgctg ccaaaccctt gcagcccata atcaaaaatc tcccagatcg 1560 ccttctgcgg cctggaccta tgaaaaactt agtggtcagg gaaaatacgc cacagagacc 1620 cgccaacagc gccttcctgt tggccttcta gcccaaactg ctagtgccgc ccttggagca 1680 tggcgagcgc tcccggccaa aggttccatt atcacacccc tcaccaagat catccagggg 1740 gcccaggagg aatatagtga atttgtgagc cgcttacttg aagctgcgga gcgcaccttg 1800 ggcccagatg aatcagacaa taagctttta aaacagttag cttatgagaa tgccaattcc 1860 gcctgtaaag ctgtactacg gggcctgacg agaatcagct gcatctcgcc ggtcagtcac 1920 ttatccatgc cctccaacgt cagggacttc agatctcccc agagaaagtc caagtcctcc 1980 ctccccatct gttcttaggc tttgagttac tccctaccaa tattctttcc cagaaaattc 2040 agctacggac agattcccta cgaacactca atgattttca acgccttcta ggtgacatta 2100 attggcttcg accctatctc aaacttacta caggagattt aaagccttta ttcaatattc 2160 tccttggaga ttctgaccct tcctcccccc gtatattaac accagacgca aaacttgccc 2220 tcataaaggt ggagcatgcc attgctaacc aaagcattgg atacttttcc cctaccctcc 2280 ctctgtattt ccttattttc cctacaccct tttctcccac aggcctcctt tggcaatcta 2340 accctctgtt ttggatccac acttcagcct ccccttctaa agtcctgccc acttaccctc 2400 aattagttgc ccacgttctc cgccttggca gagaaagtgc tgttagactt tttggcagag 2460 accctaatgt tattgtcctt ccctatgata cttcccaggt ccaatggctt attcaaaaca 2520 gtgatgattg ggcagtcaac tgcacctctt ttcagggtac tatcgataac cattaccccg 2580 cagacaaatt agtacagttt ttacttcgga cccctgtaat cttcccaaaa cgaacacact 2640 caacaccaat tgctggggca gcgctagcct tcactgatgg ctcctcctcc ggcatcgctg 2700 ccttcagcat taatggaaat ctctcccggt tcaagacccc ctattcctca gcgcagcttg 2760 tcgagcttgc cgctattatc aaagtttttg agcttcttcc caactcgcct tttaatttat 2820 ataccgatag tgcctatgta gcttcctctg tccccctttt agaaactgtc cctcatatcc 2880 gtccttccac caatgcctct cccctgttta ctaaacttca ggccctaatt cttgctcgca 2940 gctttccatt cttcattggc tatgtccgcg cccattctgg cctgcctggg gcacttgctg 3000 caggaaatga tgctgttgac caggccacaa aattaattgc ctctgcctta tcttctactc 3060 cccttgcggc tgctcagcag gcccatgcct tacatcatct taatgcacac accctgcgtc 3120 tcaaattttc cattacccgt gaacaggccc gtcaaattgt ccgtcagtgc aaaggctgtt 3180 tgaccctttt acctgagcct cacctgggag ttaatccccg tggtctagtt ccaggtgaaa 3240 tatggcaaat ggatgttacc caccactctc ccttcggaaa attaaagtac atccacgtgt 3300 ccattgatac ctgcagtggc tttctctgtg ccactctaca aaccggtgaa gcaactaaac 3360 atgtcattaa ccatgttctt gcttgcctgg ctgttacacc gcagcctaaa attctcaaaa 3420 cagataatgg tccagggtat actagttcta gctttaaaca attctgtgct caactgacta 3480 ttaaacatat cactggtatc ccttataatc ctcagggtca aggtattgta gaaagagccc 3540 atcaaaccct taaaaacatg ctctttaaat tacaatccaa agggggagtt ctttaccctc 3600 cgacaggcaa ttttaaaacc cttttgaatc atgcattatt tgttttaaat tttctcacat 3660 atgattctgc aggcaaatct gccgcagatc gcctctggca tccttcaaca gccgatactt 3720 atgcacaagc catgtggaaa gaccctgtat ccaataagtg gcagggccca aacccagtcc 3780 ttatctgggg caaaggtcat gcttgtattt atgacccaaa cacccaaaat gctaggtggc 3840 taccggatcg cttaatcaaa ctttacaccc aacccaggga caatccctga ggaaaaactt 3900 tcccattttt tctccctcca ggtcatgcat tggatctacc cgctcttcta cataataatc 3960 agacgtttag cacatgccgg catggaccca ggaccagtag accccctgca gctcctcatt 4020 gcctgtatga tttgtctcac cctccaagaa aaatgttgtt tttatgctaa taaatcaggc 4080 attgtaaggg acagaatcaa aaaacatcaa acagaattag aacaaaggcg cagagaactt 4140 tttgaaaacc ctttatggaa tatgtggaat ggtgtccttc cctatctcct ccctctcctt 4200 ggacccctaa ttagtttcct tttaatcctc tcttttggac cttggatttt cagaaaaatc 4260 acagatctta tcaaaagaca agtcaataca gctcttaaga aaacagtcga ggttcactac 4320 catcgcctcc caacccagga tgtaagtcct gaagaattta gtactcccct tgaaaccccc 4380 tctacagctc ttaattttac tcttcttgcc ccagaggcaa gaccttcttg gctttgcagg 4440 ctgtggagac gccaatgacg ggattattgt gatgccacaa gctagaggca tgccgttccc 4500 tggctgcaag gaaactccat gacgggatta ttgtaggtcc acaacctgag cccttcaaaa 4560 aagaaggata ccaggagtgg aagccaccta catagcctaa gacagagacc ttgcccattt 4620 tcttttaaaa ctacggcact agatgtgggt gcccatcaca tagcctaaga caggccctcc 4680 acccgtgttt actagccaag cttaattaaa gaatagaggg ggaga 4725 // ID LTR5C_Mim repbase; DNA; PRI; 410 BP. XX AC . XX DT 11-NOV-2009 (Rel. 14.11, Created) DT 11-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR5C_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-410 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2955-2955 (2009). XX DR [1] (Consensus) XX CC >90% identical to consensus. 5bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 410 BP; 98 A; 155 C; 80 G; 77 T; 0 other; tgtaaggctg gtggcgggcc cccctcccct ccctagatca aaaagaaaag gctcacgaga 60 ccagacccgc caaggacaac tgccaggccc cgctgaaaac aacaacacct ggatgtccac 120 ccagataaga aagttttcgg ttcctggatt ccgcaaatac cgccagcccc tcctatttct 180 caataaccgc cgaaggtcac agtggaaaat ccccagctct tcccgccaaa agtccccagc 240 ccgcctcaaa cagtataaag gccctcaccc cctccaaacg gacgcgactt ctcggccccc 300 cccttgggac ccgcgaacct cgcccgggag cgctcaataa agccacgttg tttggcccca 360 ctctctctct ctctctctct ctgtttcttt cgccgccgga aaactttaca 410 // ID MER34D repbase; DNA; PRI; 579 BP. XX AC . XX DT 05-MAR-2004 (Rel. 13.07, Created) DT 04-AUG-2008 (Rel. 13.07, Last updated, Version 1) XX DE ERV1 Endogenous Retrovirus from primates. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; MER34D. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-579 RA Smit A.F.; RT "MER34D - ERV1 Endogenous Retrovirus from primates."; RL Direct Submission to Repbase Update (04-AUG-2008). XX DR [1] (Consensus) XX CC mer4 group . Perhaps present in rodents as well. XX SQ Sequence 579 BP; 151 A; 141 C; 115 G; 168 T; 4 other; tgaaggagtk aaggatatgc caccccaaaa tatgccagat tggtatattg attatttcga 60 gctgaaagca ctggagaaac tgtagtttca gaaagggcta gctgacctgt ctcttcctgc 120 atgcagcaag ccataaagat tcctctggga ggggtgccct ccccgtacca gggcgagaaa 180 atagccctta tcaccagaga ctgggaattg gngctgcaat ggacctgaat aaacanactt 240 actgaagtaa cccttatctt ccactagttt tacacccctc cccatatatc tcctagtgac 300 tcccctagaa atttactgcc cctagccaga tcccctttgt cctgtcattt cttcncaaat 360 ttatcgttct ttgtctaaaa agtataaaag catcttgctt tggccacttc tttgggtctt 420 cactctcttg tgaagatccc catgtacatg taaaactaat aaaatttgta tgcttttctc 480 ttgttaatct gcctggtgtc aatttggttt ctagatccag ccgaagagcc cacataagag 540 ctaaaagggg ggttggaggt gatctctggc tcccctaca 579 // ID LTR77a_TS repbase; DNA; PRI; 629 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW Endogenous Retrovirus; Transposable Element; LTR77a_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-629 RA Bao W. and Jurka J.; RT "Long terminal repeats from tarsier."; RL Repbase Reports 11(5), 1632-1632 (2011). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 629 BP; 171 A; 141 C; 149 G; 168 T; 0 other; tgagagagga gatagtcaga agctggttag gcagacagag agggagggtc tcgaggagac 60 ttaacacctg cgggaacgcc ccctgcacca cccccattat ataattgttg gctgaaaatg 120 tggttaatgt ggttaagaac atcctctcaa cccaggatgt actgctaaaa gggacttttt 180 gctgcttaaa cgcaggcgca gtagcttaac taaatgtcct aacttgacct tggcttatta 240 taatatcatt aacatagcat ttgcattgcg gttttacccc ccccccgagt gggcttttct 300 gtggtactta tgggtaatat tcaagatgga gtaactttgg tcaaggaccc gcatgcgcaa 360 taagaattgg gtctcaggaa gtaaggggcg gatgagatgt aaatgagaga aacgccccca 420 gaaaagataa gataaaaacc acaaacaaac tcctagcacc ggttacccat ttcgggcccc 480 ctctcttttt ctggagagct ttgttgctta ataaactcct tctttcactc atccttttgg 540 tgtccgcgtt ccttaatctt cttggtcgtg ggacaagaac ccgaggttgc tggtttaagc 600 tgacagtgga gttcaaaact cctgcaaca 629 // ID LTR2C_OG repbase; DNA; PRI; 564 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV2-type endogenous retrovirus - DE consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR2C_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-564 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 11(5), 1583-1583 (2011). XX DR [1] (Consensus) XX CC ~90% identical to consensus. 6 bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 564 BP; 138 A; 138 C; 101 G; 187 T; 0 other; tgtagagagc gagctgtgag gtggcagggc ccccgcggct tgggagaacg tccaggggat 60 gaacccccct ctactgaccc tcctcaacag ataacaacat attttctcta agcttcaact 120 tttaacccag cagaagaaaa acatgttttc cttaaaactt tagcttttaa cttcacagct 180 gcacttcaaa gcagtcacgt acattttcca gctaatcttg tagatatgta gtattgccca 240 tattttagag gtcacacagt aataatagat tatgtttacc tgggaatttg tgcctgtttc 300 tcttctcata attaagttga aaatgtgtct tacttccttg aaaaacaact ttgtactgcc 360 tccttaaaaa taaagctgtt cgggcactcg ggtgctggtg atagagaaaa atatcccagt 420 cctcccgatc ccatctttgc ctctcaattc catctctgtc tctcgcgtct ttgtctttcg 480 tgttttattt catctattta tttcatcatt tcccaccgat cccactctgg ttcgtttact 540 tttcgcgctg gttcgcgaga gtta 564 // ID ALRa repbase; DNA; PRI; 172 BP. XX AC . XX DT 14-NOV-2005 (Rel. 10.11, Created) DT 09-DEC-2005 (Rel. 10.11, Last updated, Version 1) XX DE SAT Satellite from primates. XX KW SAT; Satellite; Simple Repeat; ALR2; ALRa. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-172 RA Smit A.F.; RT "ALRa - SAT Satellite from primates."; RL Direct Submission to Repbase Update (11-NOV-2005). XX DR [1] (Consensus) XX SQ Sequence 172 BP; 53 A; 29 C; 39 G; 51 T; 0 other; ctatctgaga aactgctttg tgatgtgtgc attcatctca cagagttaaa cctttctttt 60 gattcagcag tttggaaaca ctgtttttgt agaatctgcg aagggacatt tgggagctca 120 ttgaggccta tggtgaaaaa gcgaatatcc ccagataaaa actagaaaga ag 172 // ID LTR4_TS repbase; DNA; PRI; 351 BP. XX AC . XX DT 07-DEC-2009 (Rel. 15.09, Created) DT 07-DEC-2009 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV2-type endogenous retrovirus - DE consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR4_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-351 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from tarsier."; RL Repbase Reports 10(9), 1260-1260 (2010). XX DR [1] (Consensus) XX CC >98% identical to consensus. 6bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 351 BP; 70 A; 124 C; 52 G; 105 T; 0 other; tgttgggagc cattttaaaa atagcagcca tcttatataa gccttcttct ttgtcctccg 60 aaaagcccgc gcgcgccaaa ccacaacacc cctcccacct cctctcctca taccaccact 120 actcttcctc ctatcataag ccgccacacg ccctatttca gcccgcccct ctttgctatg 180 tatataagca atcgcctaac aataaaattt tgaggcttga tcagaacact gtcttgcctc 240 cattctgcgt gtctcctgtc tctttctctc tcttggtctc tctctctctt ttcaccagca 300 gcaggtagtc ctcctcgtgc ccacgtttta cttgtcccgc tggtcgggac a 351 // ID LTR14C3_Mim repbase; DNA; PRI; 521 BP. XX AC . XX DT 04-NOV-2009 (Rel. 14.11, Created) DT 04-NOV-2009 (Rel. 15.11, Last updated, Version 3) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR14C3_Mim. XX NM LTR14C3_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-521 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2974-2974 (2009). XX DR [1] (Consensus) XX CC >94% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. CC We thank the Broad Institute Genome Sequencing Platform and CC Genome Sequencing and Analysis Program, Federica Di Palma, and CC Kerstin Lindblad-Toh for making the data for Microcebus murinus CC available. XX SQ Sequence 521 BP; 153 A; 114 C; 143 G; 111 T; 0 other; tgtaagaaat ggaaattata agaaaagctc aaagctaaca gcttagcttc tgtcccctca 60 gcttgcttgc ttacaatata tagcttaagt gaagccatga caggcttcag cgggtgctgg 120 gcctcaaaag gcggtaaccg caggtcattt cctgataaga ggttgaaaag ccccgggatg 180 gggggagggg gggccaaaca ccggatgact ctgcagtaaa atgctgaaat aaaatgcaga 240 gggaaagccc taacgccaga accaatcagc tggtaagagg aaccaaccag aaaagagatg 300 aaataaactg ctgaaggaga ctgcgggagg ggggaggggg aaaactactt aagggatacc 360 cgtaacccaa gctggggtcc ttgtcagaat agaggccact gcgattggcg ctctgagact 420 cggaccctag ctcgagctag tcaataaaac tccttttgat gatttcagcc tcagtgactc 480 tgtctctttg ttctgtggta ctgcggtttc ccgctctaac a 521 // ID ERV2-1_TSy-LTR repbase; DNA; PRI; 857 BP. XX AC . XX DT 06-DEC-2009 (Rel. 15.09, Created) DT 06-DEC-2009 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV2-type endogenous retrovirus - DE consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERV2-1_TSy-LTR. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-857 RA Jurka J.; RT "Endogenous retroviruses from tarsier."; RL Repbase Reports 10(9), 1203-1203 (2010). XX DR [1] (Consensus) XX CC ~97% identical to consensus. 6bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 857 BP; 216 A; 201 C; 187 G; 253 T; 0 other; tgttgggagc cggccacgga ggcccactga gcgttaggga tcaggaaggg ccgttgaccc 60 cgagaccact aggatgcgtt cgggagctat tgaccccatt tcctccccct ttctactcag 120 cactaaccag attaagctcc tgtgagctca aaacataaaa actctgagac cgggacgaac 180 acaagctaac tttagaagct cattgctccc attcttccgc aaactgcagt tttggctttc 240 acacaactta taattcaaac tcgtgtcaca tctaagagcc ttagtattgg tcacaggctt 300 ctgggtcatg atcttaggat ttattgaagg tgtaaattgt agaatatagt tttagaagac 360 agccattttg ttgaaagaat gcggctgggt tacttgcagg tagcccactg tgaactttgg 420 actgggccac attccaaaca ttctgggatg ggccactttc gtttgtaaag taaatactta 480 agcttctata gaattgttta acttggcata aatcaatcag cttaggatga aaaaggttta 540 tagaatatgt cagtcacgga ttaggttatc ctttgttatg ggattttaag attatcatat 600 gaaaagggtt taaaggcaat cattgattaa gttttcatgg attaaggttt ctggtgggtg 660 ggggacgccc ctcccacatc tgactttgtg tataaaagac ccgttgtaat tcccaataac 720 acacatcgct tctgcactag accggagtcc agtctctttc tttctctctc tctctctttc 780 tctcttcttc cccccaatcg ccgacgccgt ctccaccttt ggggtaccct gaacccggct 840 ggagctggtc tccggca 857 // ID LTR34_Mim repbase; DNA; PRI; 607 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; nonautonomous; KW LTR34_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-607 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 11(5), 1725-1725 (2011). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 607 BP; 164 A; 161 C; 99 G; 183 T; 0 other; tgtaaaagaa aataaaaatc tcaggaccac ccctccccca actccttatg caaaagggaa 60 ggtaaaggcc tcaggcacct gtgaaggact ggccccacag atcattcatt aaggaaattc 120 cttgctggcc tcccataagc aaggacatgc aaattgtaac tttaggtctg caggctacgt 180 ctagctccta aaactaaagt ctgttcgatt ccacactaat aatgaattac aagtttatct 240 tcacaggtgc agaacagaga caagacagga tcagacattc ttccacctac ccagggacat 300 ctacataatt gatgcttcct tcactccctt tttctcttct aacattcgcc ttatcttatg 360 taaaatgtag attttcctgg gtctcacaag aatgtaacca ttttgtctca ctaccacccc 420 ccgccttttt ttttcctttc tgtatgccct actccccttt aaatactgaa acttccaaat 480 tccccttcgg aaaaacagcc acaggtttgt ctgtggtttg tgtttttccc gggcgcgtcc 540 tcaactttgg cttaataaac ctccattgat tgagattttt gcctcagtca cttattttgg 600 gtcgaca 607 // ID LTR16_OG repbase; DNA; PRI; 487 BP. XX AC . XX DT 18-OCT-2009 (Rel. 14.11, Created) DT 18-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat (consensus). XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR16_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-487 RA Jurka J.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 9(11), 2862-2862 (2009). XX DR [1] (Consensus) XX CC >91% identical to consensus. 4 bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 487 BP; 139 A; 128 C; 72 G; 148 T; 0 other; tgaaggagct aagaaaattt caccccaaaa tatgattcct tggtataaag attattttga 60 attaaaggct attcataatc aggaaactct gaacgaagac tttcctctat ctacctaaaa 120 tcttaaaggg ctgctagact ccttatcaca tgctaataga cctggcccta aggtcatttg 180 aagcacaata cctgtctctc aggttaattt actaatcaat ctgtttccct ccatctaccc 240 atccccccct tagcaaccac ttgctgcacg tatactttcc agtttctctc ccccctccct 300 tctaaaaggc atctttaaaa gccactgact atcactgaaa ctttgggaaa tcattcttgt 360 ggttcccccc atgcgcatgg tatttggaaa taaacctttc tcttattaat ctaccataac 420 tgtgagttga tcttttcagc gaaccaacct aggggaaaag ggcaagtttc cctgagactc 480 cccttca 487 // ID LTR11_Cja repbase; DNA; PRI; 440 BP. XX AC . XX DT 14-OCT-2009 (Rel. 14.11, Created) DT 14-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat: consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR11_Cja. XX OS Callithrix jacchus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. XX RN [1] RP 1-440 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the common marmoset."; RL Repbase Reports 9(11), 2923-2923 (2009). XX DR [1] (Consensus) XX CC >90% identical to consensus. 4bp tsd. CC This sequence was derived from the marmoset dataset made CC available by Genome Sequencing Center at Washington University CC School of Medicine in St. Louis. XX SQ Sequence 440 BP; 105 A; 112 C; 111 G; 112 T; 0 other; tgtagagagc cagagtagcc caccccctct gggggatcag cagcaaggct gctgtgaact 60 ttgggtgaac tccagagaag cgaaaccata ataggggaat gtgtatgtct cggagaagag 120 cagtcggacc tagcgaaaga taaccagctt tcactgacct tgctgcaaga tagttgttag 180 ctgaggtatg ttaactgaag tatgagaatt ataaccacag gctgttctgg gtaactgcac 240 cctccctctg ctatgtgctt tgtgaacata taaaataaag tactctgagg caggacgggg 300 ccggagaaac tgacactttc actttcccgg accgcctggc cccgcttttc tctatgtctg 360 tctttgtgtt tttcttcatc ccccaccgtt ccctcttagg tctttgaacc cttgtgcagc 420 gcgggacgca gcacacgcca 440 // ID BSRf repbase; DNA; PRI; 379 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Satellite from primates. XX KW Satellite; Simple Repeat; BSRf. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-379 RA Smit A.F.; RT "BSRf - Satellite from primates."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX SQ Sequence 379 BP; 82 A; 98 C; 96 G; 98 T; 5 other; tgtgagattc agaacctcam cagtgggctn tgtccatgtg tgagggtgac aatcctaact 60 gtcggctggg tgtgcatacg agagtcacaa tctcacctgt gtgctgggcc ctgttatgac 120 actctctgta ccacccgagg gctttataca atatgcgtga gtgtcataat cctctgtgac 180 ctttntacaa gtaggagacc caggacctta cccgttgccc taagcctagc tatgagagtc 240 aamatctctc ctattggctg ggtccaggta tgagagtcat catcgtgcct gtgagctggg 300 tccagatatg wgtcaccatc ccacctgtgg gcagatccac gtatgacagt cacaattcca 360 actgtggact gcgtccgcg 379 // ID MER41B_Mim repbase; DNA; PRI; 731 BP. XX AC . XX DT 09-NOV-2009 (Rel. 14.11, Created) DT 09-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW MER41B_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-731 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2987-2987 (2009). XX DR [1] (Consensus) XX CC >89% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 731 BP; 222 A; 170 C; 173 G; 166 T; 0 other; tgttaggttg ggacaaggaa ggggaacaga tgctttgtct agggggtttc cattgagaaa 60 cagatggcac taggagatgc tttgtctaca ataggaggaa caaattgacc agaagggggg 120 cttccaagaa ggaacagatg acatgtcaag ggggcttcca aggactgaac agatgtttgg 180 tctacgatag gggtctccaa ggactgacca accagaaaga ggtagaaagg ccacaaagtg 240 gagatatgat caaggccaca aaagagggtc tttggagcat gtgtacaagt aaggtgacag 300 tatcctcaag tgacctaccc tcattaacat agtaaaaatc acacccacca gcgccatgac 360 agtttacaaa taccatggca acccccggaa attgccttac aaggttaaaa atggggaggg 420 ttcccagttc caggaattct ccaccccttt ccaagaaaaa ccatgaatat tccacccccc 480 acttaacata aaattaggga atcagcataa aagggaaaat cttgttaaac tcagggtctc 540 ctccactcgc gggggttgac ccattccttc tctggaatgt actatgcttt aataaacttt 600 ggcactttgc ttgcttgcat ctgcttggtc tgctcatttg ctccatcgac tcggtaccag 660 taaccattct tcggtactgg gaaccaagaa ccagggaaca cggacctgac atctggtccc 720 cacacctgac a 731 // ID CYN-III1 repbase; DNA; PRI; 236 BP. XX AC . XX DT 02-MAY-2006 (Rel. 11.04, Created) DT 02-MAY-2006 (Rel. 11.04, Last updated, Version 1) XX DE SINE element from Cynocephalus - consensus sequence. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; KW Interspersed repeat; CYN-I; CYN-III1. XX OS Cynocephalus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Dermoptera; Cynocephalidae. XX RN [1] RP 1-236 RA Schmitz J. and Zischler H.; RT "A novel family of tRNA-derived SINEs in the colugo and two new RT retrotransposable markers separating dermopterans from RT primates."; RL Mol Phylogenet Evol 28(2), 341-349 (2003). XX DR [1] (Consensus) XX SQ Sequence 236 BP; 54 A; 65 C; 83 G; 34 T; 0 other; ggccggcccg gtggcgcact gcactagtgc accgcttggg aagcgcggcg gcgctcccgc 60 cgagggttcg gatcccagat acagaccggt tcccgctcac tggctgagcg aggtgcgggc 120 gtgacgccga gggttgcgat cccgttgccg gtcctggtcc ggtgcggggg caacactgag 180 ggttgcgatc tgttgccgga cacggaaaaa gacaaaaaaa aaaaaaaaaa aaagaa 236 // ID LTR24_TS repbase; DNA; PRI; 481 BP. XX AC . XX DT 09-DEC-2009 (Rel. 15.09, Created) DT 09-DEC-2009 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR24_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-481 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from tarsier."; RL Repbase Reports 10(9), 1276-1276 (2010). XX DR [1] (Consensus) XX CC >93% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 481 BP; 117 A; 103 C; 87 G; 174 T; 0 other; tgaagtataa aggttttgct gttttttaaa tgtttaaagt gtttgcagtg tattggtaaa 60 acctaagttt gtatttaaat gtaaaactcc attttgtggc ttgtctccat gttgttaact 120 tgttattaaa cttctgtgtg gctgatgcaa gaattttaaa ttttgctcat aaacaaccct 180 cacagccctg caccccaccc agtgctctaa cggtttcccc cttatctgag tatactgtgt 240 gcccccttat ctaagtgtac tatgtgtcaa accccagaac atgtgtttca tgtgtttatc 300 tctgtaatga ctcagtttcc caactgcttt tgcctataaa gactccctgc aaatttcatt 360 cgtcgtcgag agtctgggag gcatgagccc cctcttgacc accggcttta ataaagggct 420 tacttattaa tttgaccttt aattgggctg gttatctgat ttctagcgat tcgctataac 480 a 481 // ID L1P4a_5end repbase; DNA; PRI; 2596 BP. XX AC . XX DT 29-AUG-2008 (Rel. 13.08, Created) DT 29-AUG-2008 (Rel. 13.08, Last updated, Version -1) XX DE L1 Non-LTR Retrotransposon from primates. XX KW L1; Non-LTR Retrotransposon; Transposable Element; LINE; KW L1P4a_5end. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-2596 RA Smit A.F.; RT "L1P4a_5end - L1 Non-LTR Retrotransposon from primates."; RL Direct Submission to Repbase Update (05-AUG-2008). XX DR [1] (Consensus) XX CC L1PA16. XX SQ Sequence 2596 BP; 787 A; 711 C; 669 G; 419 T; 10 other; gagagggaca agatggccga ctagacgcag ccaggaagcg ctgctcccac cgagagagac 60 caaantatcg agtaaaccaa cataatttga gcagatcttc ggagagaaaa cgccgagagt 120 ggatgagaga ggcgacgcng acgccgaggc tgaagaggga ggaagctggg aaccctgcgc 180 ggggtacccg aatgctaggg ctagttcccg gccctgaatg gctcctgggg aaggggtgag 240 tgaagggact gagggacagc ccactctcgc cgtggacctc tgggatccta gctacagggg 300 accccacgcc ccccatggac gtntgagctg gcagggggat ctccccgggg agtaggcaga 360 gacagggctt cagncggcat ggagcccggg agcttttgtg cacggggcag ctccggcgga 420 gcgcggccat aggcgcccat cccccagggc tccccatctc cctccgagag gctctggccc 480 cagctgaccg ccgggccagg agagagcggg gccagcttcc ccgcgggact ggggcacgtc 540 tgttctgcag gccctcctgc ccgccagccc ctcccagggc ccctgcctgg ccaccccgca 600 ggagcatgtg cacagcgcag cctccgctgc ccagcctggg tgctttgctc cacctgagta 660 cnttcccggc ggcctgggag cacttcggat cccccagcgc agccggaacc caaccccgag 720 ggtccagagg aggagccgcg gcaggtcccg gtgccccagg gctgcggcnc gcagctcggg 780 agtgccgagc cgagatctgt ggccggcact cgagcngggg aggagccccc actctcagag 840 cactgagagg ggtgagatgc gcgggttcnt gggccggngc gggagcgggg cgtgcctccc 900 tccacagggc cggtccagaa agggtgtggc ctatctccct gccgcagcct ctgcccgagg 960 gagccccgcg gcccggaaca cctaacaaaa gaaacgcggg cgcggtgcca gtgatcggag 1020 ggggctcccc caaggcccag gagcggacct ggtgaggggg tcatctctct ccccgccgca 1080 ccgcagagca cggctgcgaa cgcgaggaag tacaaaagag ccgcgcggct gagtaagagc 1140 ctatctaccg gccattactc ttaagcgcca tctactggat cgcagcccaa actacaacac 1200 caaaaatatt ctgctaatat acacccctgt gaaaccaagg gcaagaattc agccacaaat 1260 aaagatcctg tacagagcct tggccctctg aaagcatcca gaaatgaagc caactgacta 1320 tactcaactt acaccacagt taaaggaaca ccagccctcc cagatgagaa agaatcagcg 1380 caagaactct ggcaattcaa aaagccagag tgtcccctta cctccaaacg agcccactag 1440 ctccccagca atggttctta accagactga aatgactgaa atgacagaca tagaattcag 1500 aatctggatg gcaaggaagc tcatcgagat tcaggagaaa gttgaaaccc aatccaagga 1560 atccaagnaa tccagtaaaa tgatccaaga gctgaaagac gaaatagcca ttttaagaaa 1620 gaaccaaact gaacttctgg agctgaaaaa ttcactacaa gaatttcata atacaatcgg 1680 aagtattaac agcagaatag accaagctga ggaaagaatc tcagagctcg aagaccggtt 1740 cttcgaatca actcagtcag acaaaaataa agaaaaaaga attttaaaaa atgaacaaaa 1800 cctccgagaa atatgggatt atgtaaagag accaaatcta tgactcattg gcattcctga 1860 gagagaagga gagagaataa gcaacttgga aaatatattt gaggatatag tccatgaaaa 1920 tttccctaat ctcgctagag aggttgacat gcaaattcaa gaaatacaga gaaccccggc 1980 tagatactat acaagatgac catccccaag gcacatagtc atcagattca ccaaggtcaa 2040 cgcgaaagaa aaaatcttaa aggcagctag agagaagggt caggtcacat acagagggaa 2100 ccccatcagg ctagcagcag acctctcagc agaaacctta caagccagaa gagattgggg 2160 gcctattttc agcatcctta aagaaaagaa attccaacca agaatttcat atcctgccaa 2220 actaagcttc ataagtgaag gagaaataaa atccttctca gacaagcaaa tgctgaggga 2280 attcatttca actagaccag ccttacaaga ggtccttaag ggagtgctaa acatggaatc 2340 gaaagaatga cacctgctac cacaaaaaca cacttaagca catagcccac aggcactata 2400 aagcaactac acaatcaagt ctacataaca accagctaac aacacgatga caggatcaaa 2460 atctcacata tcaatactaa ccctgaatgt aaatgggcta aacgccccac ttaaaagaca 2520 cagagtggca agctggataa aaagacaaga cccaaccatc tgctgtcttc aagagaccca 2580 tctcacatgt aacgac 2596 // ID LTR8_OG repbase; DNA; PRI; 398 BP. XX AC . XX DT 17-OCT-2009 (Rel. 14.11, Created) DT 17-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat (consensus). XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR8_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-398 RA Jurka J.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 9(11), 2853-2853 (2009). XX DR [1] (Consensus) XX CC >98% identical to consensus. 4bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 398 BP; 97 A; 135 C; 80 G; 86 T; 0 other; tgttaggcca cagggcagta aggaggccct aaaacaatga aaacattcta cccgcatatc 60 agactcttgc acctggaggc aacaccctac cccctcccta ctccctcata cccgcgctaa 120 aaataaccgt cgcaaaaagt ccccagagca aaactgccca gcaaccctgc aagttatgaa 180 ttccccaatc accaaccgcc tcatacccta acgccccctc accccccctg catgtagttt 240 ttcgctttaa aagcagcctg taacagcctc tcggggttcc ttcccccttg tggctgagga 300 accctggcgc gccagcaata aacctgcctc ttgctttttg catcgatttg tgattgtgag 360 tggtctttgg gtaggcgacc cccggggtgg gcacaaca 398 // ID SUBTEL_sat repbase; DNA; PRI; 174 BP. XX AC . XX DT 14-NOV-2005 (Rel. 10.11, Created) DT 14-NOV-2005 (Rel. 10.11, Last updated, Version 1) XX DE Satellite from primates. XX KW Satellite; Simple Repeat; SUBTEL_sat; TAR1. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-174 RA Smit A.F.; RT "SUBTEL_sat - Satellite from primates."; RL Direct Submission to Repbase Update (11-NOV-2005). XX DR [1] (Consensus) XX SQ Sequence 174 BP; 0 A; 84 C; 54 G; 24 T; 12 other; gcgcctctct gcgcctgcgc cggcgcsscg cgcctctctg cgcctgcgcc ggcgcsscgc 60 gcctctctgc gcctgcgccg gcgcsscgcg cctctctgcg cctgcgccgg cgcsscgcgc 120 ctctctgcgc ctgcgccggc gcsscgcgcc tctctgcgcc tgcgccggcg cssc 174 // ID ERV1-3_TSy-I repbase; DNA; PRI; 4005 BP. XX AC . XX DT 29-JAN-2010 (Rel. 15.09, Created) DT 29-JAN-2010 (Rel. 15.09, Last updated, Version 2) XX DE Internal portion of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERV1-3_TSy-LTR; ERV1-3_TSy-I. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-4005 RA Jurka J. and Walichiewicz K.; RT "Endogenous retroviruses from tarsier."; RL Repbase Reports 10(9), 1198-1198 (2010). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX FH Key Location/Qualifiers FT CDS 1..429 FT /product="ERV1-3_TSy-I_1p" FT /translation="FGGSXGNERLTWSFKPTGLVGLLACRLVTFVSTGAAG FT CRRAWNGLSGGRLPGGSVFPSLADSQRPWSPQRGKSGSCFCLCALLFGPPG FT KLDAALIPESRVPVRRHRGPLFWSALGQSGLFVQSLSLSCCAXCLXLSCLS FT LLCC" FT CDS 437..1828 FT /product="ERV1-3_TSy-I_2p" FT /translation="MGQQXSQPDTPLQLVLRHFKDFQNAAQQTGDQVIKGK FT LTTLCSLEWPXLPNCNWPSEGTFNPTVVERVQRFIYRRHPDQIPYITAWYL FT LVLRQPSWMRKAESNPPHVLALQETPKAKPPVLQXPPEDDETPPPYPAAGQ FT GQGAREILSPAHTRSGSSYQPLYPTLPLPPVEGPTAPHQPTSPDSTSGTSG FT ANPGNPPSAASPGVDVGGTGAQPFMVYVPFSTSDLYNWKNQNPPFSERPQS FT LISLLETIFYTHQPTWDDCQQLLQVLFTAEERERIRQEARKLILGPDGRPA FT EHARLERIFPSDRPNWDPNTDHGRECLTTYRRTLMGGLRAAARRPTNLSKV FT SEVTQGPQESPTAFLERLMEAYRTYTPIDPEAPENRRALVIAFVSQSAPDI FT RKKLQKIEGFEGKSLSELTEVAQKVYNSRDSPEDRQAKKLAKVVVAALEGT FT GRHSGQGNAEERVQEEKPP" FT CDS 2751..3449 FT /product="ERV1-3_TSy-I_3p" FT /translation="ETGLHCPGDLKDAFFSIPLAKQSQPLFAFEWTDPEKG FT ENGQLTWTRLPQGFKNSPTLFDEALRQDLRAFRQQHPEVTLLQYVDDLLIA FT ATSKGECLRATRHLLKELQTLGYRVSAKKAQLCVTVATYLGYNLQGGKRSL FT YREACMASTGINVSQFSTPGSLYMAVAPPTAWFXCTRGITPCVAIELLQDS FT SELCILVHILPQXYIVEGPAGWEQLELLTPTWTKRXPVLVPSL" XX SQ Sequence 4005 BP; 989 A; 1168 C; 1009 G; 814 T; 25 other; tttgggggct cgsccgggaa cgagcgctta acttggtcct ttaaaccgac tggccttgtg 60 gggctgctag cctgtagact cgtgacgttc gttagcactg gagcagctgg ctgtcgacgg 120 gcctggaacg gactctcagg cggccgacta ccggggggct cagtgttccc atccctggca 180 gactcacaac gcccctggag ccctcagaga ggtaagtcag gttcctgttt ctgtttgtgc 240 gcacttctgt ttggacctcc aggaaagctg gacgcagcac taattcctga gtccagggtc 300 ccggtgagac gtcaccgggg tcccctgttt tggtctgccc tcgggcagtc aggtctcttt 360 gttcagtcct tgtccttgtc ttgttgtgct tkgtgtttgt mactgtcgtg tctgtcattg 420 ctttgttgtt gacgtcatgg ggcagcaakg ctctcagcct gacacccctc tgcaactggt 480 cttgcgccat tttaaggatt tccagaatgc tgcgcagcaa acgggwgacc aggtaattaa 540 gggtaaacta accactctct gttctctcga atggccmamt cttcctaact gcaactggcc 600 atcggaggga acttttaacc ccacagtagt ggagagagtc caacggttca tctaccgacg 660 ccacccggac cagatcccat acatcactgc ctggtacctc ctggtcctgc ggcagccatc 720 atggatgagg aaggctgaat ctaaccctcc tcatgtttta gccctacagg aaactcctaa 780 agctaagcct ccagtcctcc agcstccgcc ggaggatgat gagactcccc caccataccc 840 tgcggcggga cagggacagg gggcacgaga gattctcagt ccagcacata ctaggagcgg 900 ttcctcttac cagcccctct accccactct ccctctgcct cccgtggagg gcccaacagc 960 cccccaccag ccaacaagcc cggactcgac ctccgggacc tcgggggcaa accccggcaa 1020 tcctccctct gcggcaagcc ccggggtgga tgtgggtggg acgggagccc agcccttcat 1080 ggtctatgtc cctttttcta ccagtgacct ttacaattgg aagaaccaaa acccgccatt 1140 ctctgaaaga ccccaaagcc tcatttctct cttagaaact attttctaca cgcaccagcc 1200 cacatgggat gactgccagc agcttctcca ggtcctcttc accgcggaag agagggagcg 1260 aattcggcag gaagcaagaa agctaatcct gggacctgac ggccgaccgg ctgagcatgc 1320 acgtctggag cgtatcttcc cctctgacag gccgaactgg gaccccaaca ctgatcacgg 1380 tagggagtgt ctcaccacct accgccggac tctaatgggg ggtctccggg ctgccgcaag 1440 gcgccctaca aatctgtcta aggtaagcga ggttactcag ggcccacagg aatctcccac 1500 cgcttttctg gagcgcctca tggaggccta tcggacatat actccgatag acccagaggc 1560 cccagaaaat cggagggcat tggtcatagc ctttgtctcc cagtcggccc ctgatataag 1620 aaagaagctg cagaaaattg aagggtttga aggaaaaagt ttgtcagaat taaccgaggt 1680 agcccaaaag gtttataata gcagagactc tcctgaggat agacaggcaa agaaactggc 1740 caaagtagtg gttgcagctt tggaaggaac tggccgacac agcggacagg ggaacgcgga 1800 ggaaagggtg caggaagaaa aaccccctta ggcaaggacc agtgcgccta ctgcaagaaa 1860 aggggacact ggaaaaagga ctgcccagac aggccaagaa aatgcccccc tacctcggag 1920 ccacagcaac caactccagt gctccagtta ggatctgact gacggggccg gggctctgtt 1980 aatacccccc ggagcccagg gtcgaactta cggtaggggg taaccctgtc cacttcttgg 2040 tggacacggg ggcagagcat tcagttctcc aaaaatataa tggtcctctc caaaataagc 2100 atactctggt cagggggcaa ctgggactca actatacccc tggaccaccg aaagaactgt 2160 gaatctagga aaaggagagg taactcactc attcctggta atgcctgact gcccataccc 2220 gctcttgggg cgagaccttc tgcataaact caaagccaat atagggtttc aggaaaccac 2280 ggcaacagta gatttaaaag aaccaaccaa gatcctcgta acagtgcctc tccgagacga 2340 gtatctgttg gcagaaggta agccgccccc gtcacctaag ccagactcag actcctagcc 2400 cagctgcaga ctgagttccc cggtgtctgg gccgaatcca accccccggg tgcaagcacc 2460 cccccccagt cgttgtgcag ctmaaaagtc aggcaacccc catcagggta cgcagtaccc 2520 tatatcccag gaggcacgaa aaggcatagc atccatatcc agcgcctccg ggaggccgga 2580 atcctggtgc cctgccactc accctggaac accccgctgc tcccagtmag aaaggctggc 2640 acgggggact accgtccagt gcaggacctc agggaagtca acaagcgagt ggaggacatc 2700 cacccaacgg tcccaaaccc gtacaccctc ttaagtgggc ttccccctga gagacaggtc 2760 tacactgtcc tggagacctc aaggatgcct tcttcagcat cccgctggcc aagcagagcc 2820 agcccctgtt tgcctttgaa tggacagacc cggagaaagg ggaaaatggc cagctaacct 2880 ggacacggct cccacaagga tttaagaact ctcccacgct gttcgatgaa gccctccgcc 2940 aagacctccg ggccttccgg cagcagcacc ctgaggtaac tctgctgcag tatgtagatg 3000 acctgctgat tgcagccacg tcaaaaggtg agtgcctgcg tgccaccagg cacctgctsa 3060 aggaactcca gaccctgggg taccgagtgt cagcaaagaa ggcccagctc tgcgtaactg 3120 tagccaccta ccttgggtac aacctccagg gaggtaagag gtctctatac cgagaagcct 3180 gcatggcctc tactggaata aatgtctccc agtttagcac tcctgggagc ctctatatgg 3240 ctgttgcacc ccccactgcc tggttckcct gcacacgagg cataactccg tgcgttgcca 3300 tagagctcct ccaagacagt tcagaacttt gcattctagt acacatmtta ccccagstat 3360 atattgttga agggcctgcg gggtgggagc aacttgaact cctaacccct acctggacta 3420 aamgaktccc cgtcttggtg ccctccttgt agmgttagga actgcaggat caacggcttt 3480 aggtgcagca gccctaactg taggaagcca aaatctkaaa gaactaagca tacatgttwa 3540 cacggatctc caaaatcttg aaactagcat atcccagcta gaacaacagg tagactcgtt 3600 ggctgaagta gtgctccaaa atagaagggg gctggacctc ctcttcatga aagsaggagg 3660 actatgcgcg gcactwggga gacctgttgc ttttatgcca acaaatcagg aataatccga 3720 gaaactcttg ccctagtcag agaaagtaaa aaatacaggc ctcagaaaac tggcatcaat 3780 ccttattttc ctggtctccc tggctcacaa ccttggtctc agccwtcgcg ggacctcwgc 3840 tcttagttct aatgtgtgac cgtagggccg tgcgtaatta aaagccttct aaattacata 3900 aagcasaggt taactcccac aaacttaatg atactaagaa cccaagccag gmccctatac 3960 gaagagtcaa cgatttaact ctaggaacaa ctcaaagggg ggaaa 4005 // ID PMER1 repbase; DNA; PRI; 90 BP. XX AC . XX DT 25-APR-1997 (Rel. 2.03, Created) DT 21-OCT-2008 (Rel. 3, Last updated, Version 4) XX DE Putative non-autonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; URR1; PMER1; SPIN_NA_2_Og. XX NM PMER1. XX OS Strepsirrhini OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates. XX RN [1] RP 1-90 RA Smit A.F.; RT "PMER1."; RL Direct Submission to Repbase Update (30-NOV-1996). XX DR [1] (Consensus) XX CC A prosimian-specific MER1-type DNA transposon fossil similar to CC bp 1-54 CC and bp 202-236 of PR/URR1 in rodents. XX SQ Sequence 90 BP; 23 A; 22 C; 26 G; 19 T; 0 other; tatagcagcg gttctcaacc tgtgggtcgc gacccacagg aactgtatta aagggccgcg 60 gcattaggaa ggttgagaac cactgctcta 90 // ID LTR1B_OG repbase; DNA; PRI; 757 BP. XX AC . XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 05-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Long terminal repeat (consensus). XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR1B_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-757 RA Jurka J. and Jurka M.G.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 8(10), 1668-1668 (2008). XX DR [1] (Consensus) XX CC 4bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 757 BP; 178 A; 206 C; 188 G; 185 T; 0 other; tgatacagga caggtggcgc cccagccagc tcagccgagc tgggctggag gacccgccgc 60 ctccgccttg gcggtgttgg tgcagcggca gccggcagag gaaccagcgg gctggagcgt 120 aggcaagggg tggggggagc ctccctcccc caccggaggc tccagtagcc aaagggaccg 180 gacaactgct ggggagacac cggcaccttg gtgtctcctt cccctttaca cttatgacta 240 aatacggaac ccccatttcc tatgcctagg tcaatttcat gtaagtttcc taggcctcag 300 ttccgtgaat ggactgtgtc cattcattag tacattcatt caccttgccc cagcggaagg 360 aactatgtcc attcattagt acattcattc accttgcccc agcggaaggg actatgtgtc 420 cattcattag tacattcatt caccttgccc cagtggaaga gactgtgtcc attcatttcc 480 tggcctcata tagaaggaac ccatgcacct atatataccg ctaaacggag agagacggag 540 gagagaaaaa cggagagagc ttttctctcc cgacttgggt tttctctccc taccagaaac 600 tttagtgttt ttgtattctt tttctgtaaa taaattctgt cttgccactg atctgtggtc 660 cgcggattta ttcttcaaat caccgagacc aagaacctga aagtccggta tcagatttgg 720 cgagccagcc aggagacaag acgaaattcc ggtatca 757 // ID HSATI repbase; DNA; PRI; 577 BP. XX AC X00470; XX DT 01-OCT-1995 (Rel. 1.09, Created) DT 04-AUG-2008 (Rel. 13.07, Last updated, Version 3) XX DE Human satellite I DNA HinfI fragment. XX KW SAT; Satellite; Simple Repeat; HSATI; KW Satellite repetitive element. XX NM HSATI. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 70-577 RA Frommer M., Prosser J. and Vincent C.P.; RT "Human satellite I sequences include a male specific 2.47 kb RT tandemly repeated unit containing one Alu family member per RT repeat."; RL Nucleic Acids Res 12(6), 2887-2900 (1984). XX RN [2] RP 1-577 RA Smit A.F.; RT "Consensus update."; RL Direct Submission to Repbase Update (05-MAR-2004). XX DR GenBank; X00470; Positions 1 631. XX CC Alu repeat removed from satellite I. CC [2] (Chr 21, 22). XX SQ Sequence 577 BP; 142 A; 109 C; 112 G; 214 T; 0 other; ttgggggtgc cctatttccc atctcataac ttattttaag aagcacagca taataatgtg 60 tgggcttggg attcagtttt tgaaacaaaa cactgagcct tcgatgacct tcctgtacat 120 gtaaaagcac acctgtctgc atggcagcag ttggacctca cagtgtggat tgtgccttca 180 ccctggaatg tttatgccct atcgccatgg tgatgggatt agggatctcc tgcccttggt 240 cctaagtgcc actgtctgtg ctgagttttt caaaggtcag agcagattga acctttgtgg 300 tttcattttc cctgattttg atttttctta tggggaacct gtgttgctgc attcaaggta 360 tgttcatact ggcctgtcaa atgcgatctt ttcaaattac tagttaatgc tttcaaaata 420 tgttatttaa aaaattatcc tctgtatttt ccatatgcag ttataaatat gtttcatggt 480 tatgttttat tcctcaattt atatatttga ttattgtacc aagcagagta cctttgaaat 540 ttttcttcat ttaaaaaata tgtatcttgg ctcaggc 577 // ID MLT1_Mim repbase; DNA; PRI; 413 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; nonautonomous; KW MLT1_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-413 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 11(5), 1730-1730 (2011). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 413 BP; 124 A; 93 C; 104 G; 92 T; 0 other; tgttatgggc tgaattgtgt ccccccaaaa ttcatatgtt gaagccctaa cccccagtac 60 ctcagaatgt gactgtattt ggagataggg cctttaaaga ggtgattaag ttaaaatgag 120 gccgttaggg tgggccctaa tccaatctga ctggtgtcct tataagaaga ggaaatttgg 180 acacacagag agacaccagg gatgcgcgcg cacagaggga agaccatgtg aagacacagc 240 aagaaggcgg ccatctgcaa gccaaggaga gaggcctcag aagaaaccaa ccctgccgac 300 accttgatct tggacttcca gcctccagaa ctgtgagaaa ataaatttct gttgtttaag 360 ccacccagtc tgtggtattt tgttatggca gccctagcaa actaatacga gca 413 // ID Alu2_TS repbase; DNA; PRI; 298 BP. XX AC . XX DT 09-APR-2010 (Rel. 15.04, Created) DT 09-APR-2010 (Rel. 15.07, Last updated, Version 3) XX DE Alu-like SINE element - consensus. XX KW SINE1/7SL; SINE; Non-LTR Retrotransposon; Transposable Element; KW Nonautonomous; Alu2_TS. XX NM Alu2_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-298 RA Jurka J.; RT "SINE elements from tarsier."; RL Repbase Reports 10(4), 633-633 (2010). XX DR [1] (Consensus) XX CC >85% identical to consensus. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 298 BP; 73 A; 80 C; 100 G; 45 T; 0 other; ggccgggcgc ggtggctcac gcctgtaatc ccagcacttt gggaggccga ggcgggagga 60 ttgcttgagc ccaggagttc gagaccagcc tgggcaacat agcgagacct cgtctctaca 120 aaaaattaaa aaattagccg ggcgtggtgg cgcgcgcctg tagtcccagc tactcgggag 180 gctgaggcgg gaggatcgcc tgagcccagg aggtcgaggc tgcggtgagc cgtgatcgtg 240 ccactgcact ccagcctggg cgacagagtg agaccccgac tcaaaaaaaa aaaaaaaa 298 // ID LTR21_Cja repbase; DNA; PRI; 369 BP. XX AC . XX DT 14-OCT-2009 (Rel. 14.11, Created) DT 14-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat: consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; LTR21_Cja. XX OS Callithrix jacchus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. XX RN [1] RP 1-369 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the common marmoset."; RL Repbase Reports 9(11), 2927-2927 (2009). XX DR [1] (Consensus) XX CC ~90% identical to consensus. 5bp tsd. CC This sequence was derived from the marmoset dataset made CC available by Genome Sequencing Center at Washington University CC School of Medicine in St. Louis. XX SQ Sequence 369 BP; 92 A; 118 C; 79 G; 80 T; 0 other; tgatgttagg gtcacagcaa ggcaagccca tagaccccct ccccgttctc ccctgctaag 60 ccgagctcat aacaaaaaca gcctcaagcc ttgtaaacag ggcacctagg ttccaggatg 120 tggtcacacc cggctctcga acccaggatg ttgacccaat taagcagaat gtcagcccct 180 gagaaccaag atgcggtttt tcttagaata gcacccggct acctgcaaac cccctatata 240 acccccatag tctgtaagcc aggctgctgc cttcactgcc tgtggtgagg cagccagcct 300 ggcaggttga aataaacttg ctaaacctga ccctgggtct gcctctccat cctttcggtc 360 aaccttaca 369 // ID LTR19_Mim repbase; DNA; PRI; 671 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; nonautonomous; KW LTR19_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-671 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 11(5), 1724-1724 (2011). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 671 BP; 163 A; 167 C; 149 G; 192 T; 0 other; tgttagggcc tagggcccat aggtagccaa taaccagcca ttagcttctg tattatgctt 60 gcttgcttgc ttaggagagt gggaaatttt cagttgaccc ttatctgact ccgcacctgc 120 catcccgcct ctgctagtgt gattaagttt agaaactaag gagaagggcg gtggtgcatg 180 gcacaactgc ccacaagata tagccgctag tccagggtcg aaacgccctg gacccctgag 240 gcgcctgaca tgcttcatgt acctagtgaa atgcgctctg gcttagatca caggatgcag 300 gatgtatttc aactaagatt aatgttgtcc gtaattagga tacttcagag gcttcctctc 360 ggctgaatct tagctgcagc cggtgactcc tacaaaaccc atctcaaggc tccaaccccc 420 cccccattgt gctaaaacaa gatagaaatg taagattaat gctttctctc gcttctgtaa 480 aagtgcttgc taattagatt tttagagtgt gttttttgcc tttaaaaagg ggccgttttt 540 cccttctcat cactactttc tgtgcctgcc actctgcagg acaggagtag tcctggccag 600 taaataaaga ctcacaattt ggcccaattc ggtttctaag gtggtctttt ctcgcacttc 660 ggaccataac a 671 // ID LTR21B_TS repbase; DNA; PRI; 567 BP. XX AC . XX DT 09-DEC-2009 (Rel. 15.09, Created) DT 09-DEC-2009 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR21B_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-567 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from tarsier."; RL Repbase Reports 10(9), 1272-1272 (2010). XX DR [1] (Consensus) XX CC ~97% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 567 BP; 136 A; 156 C; 124 G; 151 T; 0 other; tgaggagctc aagggccatc tagttcataa ttgtgtgtaa aacttgtttt aagtgtaggt 60 tgaagcttat tttgaatgta tattgaaact tgtttttaaa actcagatgt atgcttgctt 120 tccccccaag cccagtagta tactttggta aacaagttca cataccaaaa gaaaaccgcg 180 gagtcaggtg cagaccctct ggccccattt cctgctccac tcccctagta atctcacatt 240 ccagatgtcc caaccggttc cacccagccc gcctgaccgt tgatggtcag ccgcgacccg 300 cgaccataca aaccaaccaa tcccaaacgc ccccgtaccc agtaggagaa attacccaat 360 catgtgcgaa caagtttaaa gttctttccc ctcccttgta actgtgtata taagctgctt 420 gctgcgctgg gtcggggctc tcgttcctgt accgctgcgc cggttacgga cgtgagccca 480 gactcgagcc tgaataaaga ccctcgtgtg atttgcatcg gagctggctc tttggtggtc 540 tctcggattc aggatttggg tacaaca 567 // ID LTR8_TS repbase; DNA; PRI; 994 BP. XX AC . XX DT 09-DEC-2009 (Rel. 15.09, Created) DT 09-DEC-2009 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV2-type endogenous retrovirus - DE consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR8_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-994 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from tarsier."; RL Repbase Reports 10(9), 1265-1265 (2010). XX DR [1] (Consensus) XX CC ~97% identical to consensus. 6bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 994 BP; 256 A; 237 C; 149 G; 352 T; 0 other; tgctggagtc cagctccagg agagagtcag ggcttgagac gggagaatcg ggaagcggcg 60 aaaaagaaaa gacgaaaacg acaggaatcc tcttcctggg gagagctcaa gactctgcca 120 atttttattt atttttcaca agcttatata ggttttccac tcagccctgt acccacaaat 180 gtatcttatg ttgctaacca aacgcccacg tacatccttt ttgcactcac aataaggttg 240 acctttaaca atttcaacca cagatggtgt ttttcttaag tcaaacattt atgaacgtcc 300 tggttttctt cctaactttc tcataactat ttactattat aggcctataa tatccttacc 360 ctataaagct aaaccttaac cctaagtctt atttaactta cttactacta tatatttatt 420 tctattaatc ttatttctat ccttatgtct ctatattttc ttaactatca cttatactat 480 tatatatctc ttatcttact agtttatctt atcttactat cttattatta atattctaaa 540 ttctatcttc attaaactac ttatttaatc ctaaaatatc ttaacctatt ttttatctca 600 aacggttgtt ttccaccagc tctcagctgg ggtctgttgt ttatgaggaa gtggccgact 660 cgttcttctc ggcctcctgc tgtttactga aaaacatcct tacacaccct aaggggcacc 720 ctttttgctg caatctcacc tccattatgc ctatgtataa tttttgctgt caaatgccat 780 catccatttg cgcatctatt atctcaatta aagagggaac attctcaagc ctatgcacgc 840 aatccttcaa agttagggaa ggtccgtcaa caggatccat ggggctttgc ctccgtcatg 900 gccgcatttt actggggcct tgctcccagc cggcgttttt cctgttttca ggtccctttt 960 accttacttt tactagtgat tgcggctccc gaca 994 // ID LTR5A repbase; DNA; PRI; 1033 BP. XX AC . XX DT 14-NOV-2005 (Rel. 10.11, Created) DT 14-NOV-2005 (Rel. 10.11, Last updated, Version 1) XX DE Long terminal repeat of ERV2 Endogenous Retrovirus from primates. XX KW ERV2; Endogenous Retrovirus; Transposable Element; KW LTR retrotransposon; LTR5A_LTR; LTR5B; HERVK; LTR5A. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-1033 RA Smit A.F.; RT "LTR5A - ERV2 Endogenous Retrovirus from primates."; RL Direct Submission to Repbase Update (11-NOV-2005). XX DR [1] (Consensus) XX CC HERVK LTR 5-10%. XX SQ Sequence 1033 BP; 257 A; 242 C; 243 G; 280 T; 11 other; tgtagggaaa agaaagagag atcagactgt cactgtgtct atgtagaaag ggaagacata 60 agagactcca ttttgaaaaa gatctgtact cngaacaatt gctttgcctg agatgctgtt 120 catttgtagc tttgccccag ccactttgcc ccaaccactt tgacccaact tggagctcac 180 agaaacatgt gttgtataaa atcaaggttt aagggatcta gggctgtgca ggatgtgcct 240 tgttaacaaa atgtttacag gcagtatgct tggtaaaagt catcgccatt ctccagtctc 300 aatnaaccag gggcacaatg cactgtggaa agccgcaggg acctctgccc tngaaagcag 360 ggtattgtcc aaggtttctc cccatgtgat agtctgaaat atggcctcgt gggatgagaa 420 agacctgacc gtcccccagc ccgacacccg taaagggtct gtgctgaggn ggattagtaa 480 aagaggaaag cctcttgcag ttgagatgag aggaaggcca ctgtctcctg cctgcccctg 540 ggaactsaaw gtctcggtgt aaaacccgat tgtacatttg ttcaagtctg agataggaga 600 aaagctgccc tgtggcggga ggcgagacat gttngcagca atgctgcctt gttattcttt 660 actccactga gatgtttggg tggagagaaa cataaatctg gcctacgtgc acgtccaggc 720 atagtacctt cccttgaact tanttatgat atagattctt ttgctcacat gttttcttgt 780 tgaccttctc cttattatca ccctgctctc ctantacatt cctttttgct gaaataatga 840 aaatcgtaat caataaaaac tgagggaact cagaggccgg tgccngtgca ngtcctcggt 900 gtgctgagcg ccggtcccct ggacccactg ttgtttctct atactttgtc tctgtgtctt 960 atttcttttc tccgtctctc atcccacccg actagaaata cccacaggtg tggaggggca 1020 ggccacccct tca 1033 // ID MacERV3_LTR1 repbase; DNA; PRI; 513 BP. XX AC . XX DT 24-SEP-2007 (Rel. 13.02, Created) DT 21-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE ERV1 Endogenous Retrovirus from Cercopithecidae. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; KW MacERV3_LTR1. XX OS Cercopithecidae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini. XX RN [1] RP 1-513 RA Smit A.F.; RT "MacERV3_LTR1 - ERV1 Endogenous Retrovirus from RT Cercopithecidae."; RL Direct Submission to Repbase Update (21-FEB-2008). XX DR [1] (Consensus) XX CC chr1.nib:209738444 2-3% 98% identical to RheERV2_LTR2. XX SQ Sequence 513 BP; 138 A; 151 C; 112 G; 111 T; 1 other; tgaaactagg cctcataaaa cttagaaact aggcctcata tagaaaaaaa aaattaacac 60 caggtggctc tggatagggt cccaccctgc ctcgataggg acccaccctg atagggtccc 120 accctgccaa ttccgggaaa caacctcatg gggtcccacc ctgccaattc cgggggtccc 180 accctgcctc gaagttcccg gaatcaacaa ctccaggaaa aaacctcata aggtcctgct 240 ctaaccaatt agaacaagac accttgctca ggccatagct agacccaatc accncgcgcc 300 ttaagctttg tttgaatttc gcgccataag ctgtgtttga acttgtgttt gcctatataa 360 acagcctgta acaagcagtc ggggtcccag ggccaactta gagcttggga ccctagcgcg 420 ctagtaataa ataactctct gctgtgaatc tcgtgtcggt gatccttcgc ggcgacccct 480 gcccaggagg gaatcgacag ttcggttcca aca 513 // ID ERV3-2_CJa-I repbase; DNA; PRI; 7855 BP. XX AC . XX DT 12-JAN-2011 (Rel. 16.02, Created) DT 12-JAN-2011 (Rel. 16.02, Last updated, Version -1) XX DE Internal portion of an ERV3-type endogenous retrovirus - DE consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; ERV3-2_CJa-LTR; KW ERV3-2_CJa-I. XX OS Callithrix jacchus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. XX RN [1] RP 1-7855 RA Jurka J.; RT "Endogenous retroviruses from the common marmoset."; RL Repbase Reports 11(2), 692-692 (2011). XX DR [1] (Consensus) XX CC ~90% identical to consensus. LTR is related to LTR5 subfamilies CC from the same species. CC This sequence was derived from the marmoset dataset made CC available by Genome Sequencing Center at Washington University CC School of Medicine in St. Louis. CC We thank the Marmoset Genome Sequencing Consortium for making CC their data publicly available. XX FH Key Location/Qualifiers FT CDS 6027..7772 FT /product="ERV3-2_CJa-I_2p" FT /translation="MCPASNWPPKMISGSPNPWAPLASASPVTPTRQLLTL FT QILLLTILLVSPLKANSYYVWRFYLQESWTEGPTTKTQYLAQADCQPAGCQ FT SAIKFEFPKSKANTIKNSWNNFGLCFLYDQTKDSCHSWNDTYGGCPYASCV FT IHTTFRTDTXPASKMLFLNDQGKVSLIIHDPWDARWEKGTAGKIYTRFTDS FT HPSGTWLIFRSYTRIVPVEIDRLNSLSNXILQNEATLTNQLKPSSNTPQPF FT SWLTLVRQGVNMLSLTEAKNFTNCFLCASLNKPPLAAVPLRTGFNLSQSPT FT GPDTTLTGIPLFKVHSQNLSLCYGTASSPSLRCNTTVEVKSSLYAPPGGYF FT WCNGTLTKVIDASLPFPCVPVTLVPQLEVYGQAEFLSLIAPSPNHRSKRAV FT FLPIVVGLSLASSLAAAGIGSGAMGYSVTSAAQLEDKLRVAIEASAASLAS FT LQRQITSLAQVTLQNRRALDLLTAEKGGTCLFLQEQCCYYINETGLVEENV FT NTLYRLQENLRKKQSAPASSLNWWQSPMLTWLAPIITPVIVVCLLLMLAPF FT FLRFLQARMREISRVAVNQMLLHPYVRLPTEAPAN" FT CDS join(1019..2584,2588..6175) FT /product="ERV3-2_CJa-I_1p" FT /translation="MGNQKSRPDPKSPLGCLVQNLPKLGLSIKKKRLLFLS FT TVAWPQYPLDNQSKWPPEGTLDFNVLTDLDNFCQKNGKWSEVPYVQAFWYL FT RSRPALCSSCSSAQVLLAKTPPLPSAPPGSALSENPEDLSCPARFTNPPPY FT ARPQLPPPSSPSPPQSPNMTSPVSARTRSHDPPLLCPLREVAGAEGVVRVH FT VPFSLSDLSAIEKRLGSFSTNPTAYTKEFRYLTQAYDLTWHDTYVILSSTL FT TPDERDRILTAARAHADQVHLTNNQMPVGAAAVPEADPNWDYQTGQDGRRR FT RDQMLQCLLAGMQSAAQKVVNYDKLREITQGPDENPAAFLDRLTNAMILHT FT RLDPASLAGATVLATHFISQSAADIRRKLKKAEEGPQTPIRDLVNMAFKVF FT NGREEKAEATRQARLQQKVGLQTRALVAALRPAAHRTSGGGGPSKPSQNSS FT KAPPGPCFKCGQEGHWARQCPCPRPPPGPCPSCKQTGHWKVDCPSRAAGSP FT PTPLCGGQAGPQEAVPSLELLGLLDDRSPDSRTPITLAEPRVTLQVAGKSI FT SFLMDTGATYSVLPSYSGPSIPSQVSVMGIDGKPSCPNQTHPLACVLEGHP FT FSHSFLIIPSCPVPLLGRDILQTVGATLQLTGPPHSSPSSAHLLLLLLTGT FT TPCLDPSPQPPIPPNEVNPSVWDTSKPVVARHHTPVKILLKNPTHFPSRPQ FT FPISKTHRQGLKPIITRLLAQGLLIPTSSPCNTPILPVQKSTGDYRLVQDL FT RLINEAVVPLHPVVPNPYTLLSNIPSTTSHFTVLDLKDAFFTIPLHPESYF FT LFAFTWTDPDTNLSQQLTWTVLPQGFRDSPHIFGQALAADLSSCSLQPSLL FT LQYVDDLLLCSPSLSLSQQHTATLLNFLGSQGYRVSPSKAQLSVTSVVYLG FT IRLTPKTKSLTTDRRQALLSLQPPETADQILSFLGFVGFFRHWVPNFAALA FT KPLYVAAKDTPTGPLSHPGVVKQAFRTLQTALSTAPPLQLPDLNHPFHLFT FT DEKQGVAVGVLTQPLGPVYRPVAYLSKQLDPTVRGWQPCLRALGAATELTK FT ESLKLTLGQPISVFSPHRLTDLLSHKSLAHLGPSRLQEFHLLFIENPSVSL FT HPTSPLNPATLLPLPSHSPSPAHSCPELIDAFAKPREGLSDLPLSNPDLTF FT YVDGSSIVTAEGRRKAAYAVVTDSATLEARRLPDGTTSQKAELIALTRALT FT LARGKRANIYTDSKYAFLITHCHSALWKERGFLTTKGSPIINATQISNLLQ FT ALSLPKEVAVIHCRGHQAQQDAVSRGNARADAAAKSLTSTRPAPTPVLFLT FT TATPPIYSPSEKQALILKGGTESDQGWIFLDNKIALPREQAPKIIAEIHQS FT LHIGPKALHRFVQPLFFSPGLQQTIEQVHKACVTCSKVSSQGGLRPQFPTH FT QMRGHLPAQDWQIDFTHMPTHKKLRYLLTFVDTFSGWIEAFPTSRETADTV FT ASLLIQEIIPRFGLPXNIQSDNGPAFTAQVVQLVAKSLNISWKLHIPYHPQ FT SSGKVERAHGILKDHLAKLTIEVKLSWPTLLPLALARVRATPRGPTGLSPF FT ELLYGRPFLVSHNLPVQPPPLASYLPYLSLLRHLLREHADRTLPVVPGPED FT SHPATPLQPGDSVLLRELKPGSLQPRWSGPHIVILTTPTAAKLLGHTPWYH FT VSRLKLAPQNDQWKSEPLGPTRLRLTRNPYPSTPNPSNSPPNNPSGVPS" XX SQ Sequence 7855 BP; 1796 A; 2685 C; 1610 G; 1759 T; 5 other; agacaataac ccttacaagt ggtgccgaaa cccgggagga gttgtaagac ccctcctggg 60 ggaaactgac tcctccaccc gaagccggac cagggaaccg ctccatcaca aggtaagact 120 tctgttaccc acagcctctc ccgctgcctg cctcgtggac tccctgctcg taatcgcggc 180 cgcgtcaggg actccttctc gtttctcgcc gtgggcccgg ggccccggcc tcgtctcgag 240 gacctgagcg tagaggagac gtccctacac tcttgtcttc tccccgtgag gggtttctcg 300 gggtcaggaa tctgagggag acgtccccac cttcctgaac cctctccggt gttcggtagc 360 gcggtccggg acgccttccg aacatctccg cagcagcggc cgaggtttag gtattcagaa 420 gccccgtttg cagacccggg ccctgcctaa cagcaggacc aaatccccgg cttacggtac 480 taaaacgtct atctcaaatc agccttttgg catataaact cccgttaatt aagtaactgt 540 actttcggtt tctcttccta atcccgagga gtgagcgagt gagcgagttt atgtgtagtc 600 ccacggccta acggctacgg ggcttgtgtg ggcctcaacc cgcggttcca cggggacgtc 660 ataaggcata acggtgattc acctcaggac tgaaagttct gaggtgacca ggcccggggg 720 ttatacgggt acaccttagc caagacgccc tgaaacatcc cttcggggac gtggccggga 780 aggcatcaga ctggtctccc atctgggggg cacccaccct ttgtccgtca atcctatatg 840 tggttctgtc tcagatagcc ctagccctgt ctctttttat tttctttctc gttataatcg 900 ttgcagcctg taagttctct cagccacccc ccgggaaagg agtacttctg cccgtgttgg 960 gtataatacc tttccatatt cctttcttac ccctaactac cccacaagaa agtgaaccat 1020 gggcaatcag aagtcccgcc cagaccctaa atcgccatta ggatgcctag tccaaaatct 1080 acccaaactg ggtctttcca ttaaaaagaa aaggctgctt ttcctctcta ccgtggcctg 1140 gccacagtac cccctggaca accagtctaa gtggcctcca gagggcacac tcgacttcaa 1200 tgtcctgaca gatttagaca acttttgcca aaagaacggc aaatggtcgg aggtccccta 1260 cgtncaggcc ttctggtacc ttcgctcccg acccgccctc tgctcctctt gctcctccgc 1320 ccaggtcctg ttagccaaaa ccccgccttt gccctcagcc ccgcccggct ccgcactctc 1380 cgaaaacccg gaagacctgt cctgccccgc ccgctttacc aacccccctc cttatgcgag 1440 gcctcaactc cctcctccct cttctccttc tcctccccaa agccctaata tgacttcccc 1500 tgtcagtgct cgcacccgct ctcacgaccc cccactcctc tgccctttgc gtgaggtagc 1560 cggagcagaa ggcgttgtcc gagtacacgt tcccttctct ctctctgatc tatcagccat 1620 tgaaaaacgc ctagggtcat tctctaccaa tcctactgct tataccaaag aatttcgtta 1680 cctcacccag gcgtacgacc taacctggca tgacacctat gtgatcctgt cctccactct 1740 tactccagat gagcgtgacc gcattcttac agcggcccga gcacacgctg atcaggtaca 1800 tctcacaaac aatcaaatgc cagtaggcgc agcggcggtc cccgaggccg accctaattg 1860 ggactatcaa accggtcaag atggccgccg ccgccgagac caaatgctac agtgcctcct 1920 agcgggcatg caaagcgcag ctcaaaaggt agttaattat gacaagttaa gggaaataac 1980 tcagggtcct gatgaaaacc cagctgcctt cctagaccga ttaactaacg ctatgatcct 2040 ccatacccgg ctggatccgg cctccttagc aggggctaca gtcctagcca cccacttcat 2100 ctcccagtcg gcagctgata taaggaggaa acttaaaaaa gctgaggagg gcccccaaac 2160 tcccatacgg gacctggtga acatggcatt taaagttttc aatggccgag aggaaaaggc 2220 tgaggccacc cgccaggccc gtttacagca aaaggtaggc ctccaaaccc gagctcttgt 2280 agcagccctg aggccggcag cccaccgaac atccggtgga gggggtccat cgaaaccctc 2340 ccaaaattct tcaaaggccc cgcctgggcc ctgtttcaaa tgcggccagg agggacactg 2400 ggcacggcag tgcccctgtc ctcgtccacc tcctggaccc tgcccaagct gcaaacagac 2460 tggacactgg aaggttgact gcccgtcccg ggctgcaggc tcacccccca cacctctatg 2520 tggagggcag gccggtccgc aggaagcagt cccttcgctt gaacttctcg gccttctgga 2580 cgactgacgc agcccggact cgaggacccc catcaccctt gccgagccca gggtaacgct 2640 gcaggtagcg ggtaagtcca tttccttttt aatggatacg ggggctacct attctgtact 2700 gccttcctac tcagggccta gcataccttc ccaggtttca gtcatgggga ttgacggaaa 2760 gccctcctgt cctaaccaaa cccatccctt agcctgtgtc ctcgagggac accccttctc 2820 ccactccttt ctaatcatac cctcatgccc agtccccctt ttaggacgag atattctcca 2880 gactgtaggg gccaccctcc aactcactgg ccccccacat tcaagcccat cctccgccca 2940 cctcctattg ctactcctaa ccggtaccac gccctgtctt gacccctcgc ctcaaccccc 3000 tatccctcct aatgaagtta atccctcggt atgggatacc tccaagcctg tggtggccag 3060 acatcacacg ccagttaaaa tacttctcaa aaaccctacc cactttccct cacgtcctca 3120 gtttcccatc tcaaaaaccc accgccaggg cctaaagcct atcatcaccc gacttctggc 3180 ccagggactc cttatcccca ccagttcccc ttgtaatact cctatcttac cggtccaaaa 3240 gtcgaccggc gactaccgcc tagttcagga cctgcgcctc atcaacgagg cggtggtacc 3300 cctccacccg gtggtcccca acccatacac cctattatcc aacattccct ccactacctc 3360 tcattttacg gttcttgacc tcaaggacgc cttctttact attccccttc acccagagtc 3420 ttacttcctc tttgccttca cctggaccga tcctgacacc aacctgtcac agcagctcac 3480 ctggacggtc ctgccccagg gatttagaga cagcccccat attttcggac aagccttagc 3540 agctgacctt agctcctgct ccctccaacc cagtctcctt ctccagtatg tggatgacct 3600 attactttgt agcccctccc tctccctgtc tcaacagcat actgccactc tccttaactt 3660 tcttggatcc caggggtacc gggtatctcc ctctaaagca cagttgtcag tcacctcagt 3720 agtctactta ggcattcgcc tcacccccaa aactaaaagc ctaacaaccg accgccggca 3780 ggcacttctt tcactgcagc ccccggagac tgcagaccag attctctcct tcttaggatt 3840 tgtaggattc ttccgtcact gggtacctaa ctttgccgct ctagctaaac ctctgtatgt 3900 ggcggctaag gacacaccca cgggacccct ctctcatccc ggagttgtaa aacaagcttt 3960 cagaacccta caaactgctc tgtccacggc acctccactt caactgccag accttaacca 4020 tcccttccac ttgtttactg acgagaaaca aggtgttgca gttggagtcc taacccaacc 4080 cctaggtcct gtgtatcgtc ctgtagcata cttatcaaaa caacttgatc ccacagtcag 4140 gggatggcag ccttgcctgc gggccctggg agcggcaaca gaacttacca aggaatccct 4200 aaaacttacc ttgggccagc ccatctcggt gttctctcct catcgactga ccgaccttct 4260 ctctcacaaa tctctcgctc acctaggacc ctctcgccta caggagtttc atctactctt 4320 tattgaaaat ccttcagtca gccttcatcc cacttctcca ctcaacccag ctactttgct 4380 tccccttcca tcccattccc cctccccagc ccattcctgc cctgaactaa tagatgcatt 4440 tgctaaacct cgggagggcc tgtctgacct tcccctatct aacccagacc ttacctttta 4500 tgtagatggg agttcaatag tgacagccga ggggcggcga aaagcagcct atgcagtagt 4560 cacagattca gccaccctcg aggcacgccg cctccccgat ggaaccacat cacaaaaggc 4620 agagctcatt gccttaaccc gagctctcac gctagcacgg ggaaagcgag caaatatcta 4680 tactgattca aagtatgctt tccttattac ccactgccat tctgccctct ggaaagagcg 4740 gggattcctt accacgaaag ggtcccccat aatcaatgct acccaaatct ctaatctcct 4800 acaggccctg tcactgccca aagaagtcgc tgtcattcac tgccgagggc accaagccca 4860 gcaggatgct gtttcccggg gcaatgcaag ggcagatgcg gcagcaaaat ccttaaccag 4920 taccaggcca gcccccactc cagtcctttt cctcaccacg gccacgccgc cgatttactc 4980 gccatcagag aaacaggccc tcatattaaa aggggggacc gagtctgatc agggctggat 5040 cttcctagac aacaaaatcg cccttccccg ggaacaggcc ccaaaaatca ttgctgaaat 5100 ccaccaatcc ttacatatag ggcccaaagc gttgcaccgc ttcgtgcagc ccctcttttt 5160 ctcgccaggt ctgcaacaga cgattgagca agtacacaag gcatgcgtta cttgctctaa 5220 ggtctcgtct cagggaggct taaggccaca gtttcctacc caccaaatgc gtggccacct 5280 gccagcccaa gactggcaga ttgattttac ccatatgccc actcacaaaa aactccgcta 5340 ccttctaacc tttgtagata ccttttctgg atggattgaa gcctttccca cctccaggga 5400 aaccgcagac acggtggcct ccctcttaat tcaggaaatc atccctcgtt tcggcttacc 5460 tgnaaacatc cagtcagaca acggtccagc gtttacagct caggtggtcc agctggtcgc 5520 caagtctctc aacatctcct ggaaactaca catcccctac caccctcagt cgtcgggtaa 5580 agttgaacgg gctcacggca tcctcaaaga ccatttggcc aaacttacca tcgaggtaaa 5640 actctcctgg cccacactcc tgcctcttgc tctagcccgg gtccgggcca ccccacgggg 5700 gcctacgggc ctcagcccct ttgaattgct gtacggccgg cccttcctgg tctcccataa 5760 tctcccggta cagccacctc ccctagcctc ctacctcccc tatctctctc tcctccgcca 5820 cttactcagg gagcacgcgg accgcaccct cccagttgtt ccaggaccag aggactccca 5880 cccagcgacg cctctccaac caggcgacag cgttctcctc cgagaactca aaccaggatc 5940 cttgcaaccc aggtggtcgg gaccccacat cgttatccta accaccccca cagcngccaa 6000 actcctaggc cacactccgt ggtaccatgt gtcccgcctc aaactggccc cccaaaatga 6060 tcagtggaag tccgaaccct tgggccccac tcgcctccgc ctcacccgta acccctaccc 6120 gtcaactcct aacccttcaa attctcctcc taacaatcct tctggtgtcc cctcttaaag 6180 ccaactccta ctatgtctgg cggttctacc ttcaggagtc ctggacggaa ggccccacca 6240 ctaaaaccca ataccttgcc caagcagatt gccagcccgc aggctgccaa tcagccataa 6300 aatttgaatt cccaaaatcc aaggccaata ctatcaaaaa ctcctggaat aactttggcc 6360 tctgctttct atatgaccaa accaaagact cctgtcattc gtggaacgat acctatggtg 6420 gatgcccgta cgcttcttgt gttatacaca caaccttcag aacagacacc nccccagcaa 6480 gtaaaatgct tttcctcaac gaccaaggta aagtgtccct tatcatccac gatccctggg 6540 acgcccgctg ggaaaagggt actgcaggta aaatatacac caggtttaca gacagccacc 6600 ccagcggtac gtggctaata tttcggtcct atacccgcat cgtgccggtg gaaatagacc 6660 gactcaactc gttaagcaac nccatcctgc agaatgaggc caccttaact aaccaactca 6720 agccctcgag caacactccc caacctttct cttggttaac gctagtacgg caaggtgtta 6780 acatgttgag cttaaccgaa gccaaaaact tcactaattg ctttttatgt gcctccctta 6840 acaaaccccc cttagccgca gttcccctcc ggaccgggtt caacctgtca caatcaccca 6900 cggggcctga caccacctta acgggtatcc ccttgttcaa agtacattct caaaatctct 6960 ctctctgcta tgggacagcc agcagcccca gcctcaggtg caacaccacg gtagaagtaa 7020 aatcctccct ttacgcgcct ccaggaggat acttttggtg taatggaacc ctaaccaaag 7080 ttattgatgc ctccctcccc ttcccctgtg tgcccgtcac actagtcccg cagttagaag 7140 tgtacggaca agccgagttc ctatccctca tcgcaccgtc ccctaatcac cggagcaaac 7200 gagcagtctt cctcccgata gtagttggcc tctctttagc atcctctcta gcagcagctg 7260 gaatagggag cggcgccatg ggctacagcg tcacttcagc cgctcaactg gaagacaagc 7320 tccgtgtggc tattgaagcg tcagccgcct ctttagcctc cctccagcga caaatcacgt 7380 cgctggctca ggtaactctc cagaaccgac gggccctaga cctactaacg gcagaaaaag 7440 gaggtacctg tctgtttctc caggagcagt gctgctatta catcaatgaa acaggccttg 7500 tggaagaaaa tgtcaacact ctctatcgcc tccaagagaa cctccgcaag aagcagagcg 7560 caccagcctc gtctctcaac tggtggcaat cccccatgct tacctggcta gctcccatta 7620 tcacccccgt tatcgttgtg tgtcttctac taatgctagc cccttttttc ctcaggtttt 7680 tacaggcacg catgagggaa atctccaggg tggccgtaaa ccagatgcta cttcacccct 7740 atgtccgact cccaaccgag gcaccagcca actgacccct tcccctcact ccgcccctat 7800 tcagcaggaa gtagccagaa agaagcggcg tcctatatca tcaaaagggt cggaa 7855 // ID LTR10B1 repbase; DNA; PRI; 547 BP. XX AC . XX DT 05-MAR-2004 (Rel. 13.06, Created) DT 26-JUN-2008 (Rel. 13.06, Last updated, Version 1) XX DE ERV1 Endogenous Retrovirus from primates. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; LTR10B1. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-547 RA Smit A.F.; RT "LTR10B1 - ERV1 Endogenous Retrovirus from primates."; RL Direct Submission to Repbase Update (26-JUN-2008). XX DR [1] (Consensus) XX CC HERVI-LTR. XX SQ Sequence 547 BP; 119 A; 165 C; 100 G; 159 T; 4 other; tgtcagatac agtaagttcc tcttcaaagg tttaacttgc tcaacttcct tgttctttgt 60 tcttaagacc aacttccttg tactctcttg cccctagcta cctgctctgt aaacaacttc 120 tcccgccagt cccaatctgt aactcacatc tcttccttac ttggaaagag tcctctttac 180 tcctggctac ccattctgta aacaaccctc cttcccgcct ttgccgcgcc ctgacatgcc 240 cagacatgcc ttgtactgta acggacagcc tctcccttcc cacctagnta gccatattca 300 attttaaaca gtagccaatc gggtcagctt agattgtgcg gtccgactcc agccaatggg 360 ganaggacac agaagcaggg actaactgcg ttagggataa aaaccccttc cctccttcgt 420 tcggtgtgct ctcgcagcag ccagaaatgc gagcagcacc cttctgcaga agtaaatttg 480 ccttgctgag aaatcttttg tttgagtgct ngttcttctt tgcggcwccg agctcttgtt 540 tccaaca 547 // ID MER50C repbase; DNA; PRI; 782 BP. XX AC . XX DT 05-JUN-2008 (Rel. 13.06, Created) DT 05-AUG-2008 (Rel. 13.07, Last updated, Version 2) XX DE LTR from human endogenous retrovirus-like element. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; KW Long terminal repeat; MER50C. XX NM MER50C. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-482 RA Jurka J.; RT "Long terminal repeats from the human genome."; RL Repbase Reports 8(6), 668-668 (2008). XX RN [2] RP 1-782 RA Smit A.F.; RT "Consensus update."; RL Direct Submission to Repbase Update (01-AUG-2008). XX DR [2] (Consensus) XX CC This is a very small family (~30 copies). CC [2] Extended about thrice to full-length, matching MER50 and CC MER50B about 80% but with many indels. Despite high divergence CC level, appears to be primate-specific, or perhaps to CC Euarchontoglires. XX SQ Sequence 782 BP; 204 A; 200 C; 217 G; 153 T; 8 other; tgttagagta ggtagttaga cgtgagcagg gcaggagaga gggcccccag gaatgtcggg 60 catttgtcaa gccatggtca ggcgattata aatctgtccc tctgaaataa tgagcaggac 120 aagggaggga ccccagagct gtcnggctct catcgggtga nggacaggcg ggcataaaac 180 tgtccctctg agataataag tggccacgac tggcgccggg agnganagga gtcttncaac 240 agatagaaaa cacctggagc cagcaagcca caatccctga taaggtttca agcatgcgca 300 gtaaaggggc aagatggcgg aatttgaccg gtatatgacc ttcctctggg ggcgctcgac 360 cagtaaggga gaatcgcccc aagtgagcat gcgcacgact tcagtaaaca cactgcgcat 420 gcggcccctc ccaagtgctg gcaggccact gcgcatgcgg caattaagcg acagcccgcc 480 caagggagga acaaagggag gagacagaga gcccgggaaa agatacgggg tataaaaacc 540 ctaagccaag gancgagcgg ggcacttgat ttctcaagtc gcccgcttgg ccctcttcca 600 agtgtactct gctttctcta ataaactctc actttgctta aaataaattt tccctcctgc 660 tttaaacctt gcctgtgtct ctcgnttgaa ttctttcctc cgagaagaca agggaccgag 720 attgctgcgg antcgccact ccggagtttc tccggatagc tgcagactca ccgccggtaa 780 ca 782 // ID CERV2_INT repbase; DNA; PRI; 7584 BP. XX AC . XX DT 17-AUG-2004 (Rel. 9.07, Created) DT 17-AUG-2004 (Rel. 9.07, Last updated, Version 1) XX DE A DNA sequence of chimpanzee endogenous retrovirus CERV2 - DE internal portion. XX KW ERV2; Endogenous Retrovirus; Transposable Element; CERV2; KW CERV2_INT; Chimpanzee endogenous retrovirus. XX OS Pan troglodytes OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pan. XX RN [1] RP 1-7584 RA Skaletsky H., Hughes F.J. and Page C.D.; RT "Original sequence of an endogenous retrovirus CERV2."; RL Repbase Reports 4(7), 191-191 (2004). XX DR [1] (Consensus) XX CC Internal sequence of an endogenous retrovirus with ORFs CC for gag (1671-2153), pol (2370-4154) and env (5670-6233). CC CERV2 is absent from the human genome. It is similar to CC Baboon endogenous retrovirus strain. XX SQ Sequence 7584 BP; 1971 A; 2229 C; 1774 G; 1610 T; 0 other; tttgggggct cgtccgggat tgctcgccgc ctgacgaccc ccgacccgca cggcgaagac 60 acactcggcc cgggccatcg actgactgac gaccaggtgc cctcaggtac tttcgttttg 120 ttttgttcgt ctatttgtgt tgccagctaa atcagaagag tactcaatct gtataagtgt 180 ggaagggggg cagacgtgct cggcaccttc ccacttacgc cccgggggac gccctggcgg 240 ttgtctggag gaaaattgac gattccgtca atctcttcac ctctgaaggc aggttctccc 300 tgccatctga atccttcgtg gaaccgaagc accgccctgg cggttgtctg gaggaaaact 360 gacgattccg tcagtctcct cacctctgaa ggcaggttct ccctgccatc tgaatctttt 420 gtagaactgt ggcgccgatc tcttgccgcg cggcttctct gtgtgtgtgc agtctctgtt 480 tttactattg tcttgtttgc ctgtgcgttg gaatctgtag acacgactat aggacagaca 540 ctgacgactc ctctatcatt gaccctgact cacttccctg acatacgagc gcgagcccac 600 aatctctctg tggaaattcg caaagggcga tggcaagcct tctgctcttc tgaatggccc 660 accctcggcg ttgggtggcc ccaaggcggg acctttgacc tctccattat cttacaggtt 720 aagacaaaag taatagatcc agggccacgc ggccatcctg accaggtggc ctatatcatc 780 aactaggaag acctggttcg ggatcccccc ccttaggtga agcccttcct gccctcggct 840 tccccttccc agtcgaccct cctcgccttg gaggcccccc gagaccagac cccggtcccc 900 ctgaaacctg tcctcccgga tgagagtcag aaggacctcc tcctcctaga ctccctccct 960 cctccgcctc acaatcccct cctttacccc cctccttacg ccacgccctc accccctgcg 1020 ttgtctcctg ccccttcttc caccccctcg gctcctactc tttctccaac ctcttcttcc 1080 accccctctc agatcccatc cccctctccg gccccacccg aactcgcccc tcagacccgc 1140 ctcagacacc tcgcctccgc ttacggcggg ccgaggaccc tggcgaccag tccgcctggc 1200 agtcctccct ttttcccctc cgcaccgtga accgcacggt ccagtattgg cccttctcgg 1260 cctcagatct ctacaattgg aagacccata accccccctt ctcccaagac ccgcaggccc 1320 tgacctctct gatagagtct attctcctca ctcaccaacc cacctgggat gactgccaac 1380 agctcttaca ggttcttctg accacagaag agaggcaacg ggtcctcctc gaggctcgga 1440 aaaatgtgcc agggccagga ggattcccga cccagctccc caacgagata gatgagggat 1500 ttcccctcac ccgcccggat taggactatg aaacagcaac aggtagggag agtctccgaa 1560 tctatcgcca ggctctgttg gcaggtctca aaggggctgg aaagcgcccc accaatttgg 1620 ctaaggtaag aactattact cagggaaaga acgaaagccc agcagccttc atggaaaggc 1680 tcctagaggg gtttcgaatg tacactccgt tcaatcccga ggccccagag cacaaggcca 1740 ccgtggcaat gtcattcata gaccaagcgg cgctagacat aaagggaaag ctccaaagat 1800 tggacgggat ccagacctat gggctgcagg aactagtgag ggaagcagaa aaggtttata 1860 acaaaagaga gactactgaa gaaaaagaag ctaggctagc aaaggaacag gaggagcggg 1920 aagatcgacg agatcgtaag agagacaggc atttgactaa aatcctggca gcagtagtga 1980 cagggaaagg gccagggcca gggagagagg ggggagaacg aaggcgcccg aaggtggata 2040 aagaccaatg tgcctattgc aaggaacgag gacattgggt caaggaatgt cctaaacgtc 2100 ctaaggaccg gaagaagccc actcctgtcc tgaccctggg agaggacagt gattaggggc 2160 gtcagggctc cgaagccccc cccgagcccc ggctaaccct ttctataggg gggcgcccca 2220 ccacctttct agtggacacc agggcccagc attcagttct gacaaaagca gacgggcctc 2280 tttcatcccg cacctcttgg gtccaaggag caacaggagg aaagctgcac aagtggacga 2340 cccaccgaac agtaaacctt ggaaaaggta tggtgactca ttctttctta gtagtacctg 2400 aatgcccata tccccttctg gggcaggatc tgttgaccaa gctcggagcc cagatacatt 2460 tctcagagag aggggcccag gtactgggtg aggatggtca gcctatccaa attctgaccg 2520 tttccttgca agatgagtac cggctttttg agactcccaa cttcaccagc cctcccgata 2580 attggctgca agaatttccc caggcttggg cagagacagg gggacttgga ctggcaaaat 2640 ttcaagcccc gattatagtt gacctcaaac ccaccgcagt gcccgtgtcc attaagcaat 2700 accccatgag ccgagaagcc cgtatgggca tccaacagca tgttaacaaa tttctggaat 2760 taggagtctt acggccatgc cgctcacctt ggaatacgcc actccttccg gtaaagaaac 2820 ctggtaccca agattatagg cccgtccagg acttaagaga aattaataag agaaccatgg 2880 acatacatcc tacagtccct aacccttaca acctgctcag taccttgaga ccagaccaca 2940 actggtatac agtactagac ctaacagatg cattcttttg cttacccctg gctccccaaa 3000 gccaagagct ttttgccttt gaatggaagg acactgagag gggaatctca ggccaattac 3060 cttggactcg gcttccccaa gggttcaaga actctcctac cctctttgac gaggctcttc 3120 accgggactt ggctgatttt cacaccccgc acccagattt aactctgctc cagtatgtag 3180 atgacctcct cctggcagcc cccactaaag aagcctgcct acagggcacc aggcaactgc 3240 tccaggagct cggagaaaaa ggataccaag catctgccaa gaaagcacaa atctgccagg 3300 ctaaggtaac ctacctagga tacatcttga gtgaagtaaa aaggtggctc acccctgggc 3360 ggatagagac tatagcccgc attccgccac cccggagccc caaggaggtg cgtgagtttc 3420 tgaggactgc cgggttctgc cgtctgtggc tacccggttt tgctgagtta gcggccccca 3480 tttatgccct caccaaaggg agcaacccct ttacctggct ggaagaacac caacaggcct 3540 tcgaaacttt aaagaaggca ctcctctctg cccccgccct cgggctacct gacacatcca 3600 agccttttac cctctatgta gacaagagac gggggatagc caaaggggtt ttaacccaaa 3660 aactggggcc ctggaaaaga ccagtagcct acttatctaa gaaactggac cctgtggcgg 3720 ctgggtggcc cccttgcctc cacattatgg cagccaccgc tatgctagtc aaagactctg 3780 ctaagttaac ccttgggcaa ccattgactg tcattacccc gcatgccttg gaggccatag 3840 tgcagcagcc cccggaccgt tgggtcacca atgctcgtct aacccactac caagccctcc 3900 tactagacac ggaccgcgtc cgctttggcc ctccggtcac tctgaatcct gccaccttgc 3960 tacccgtacc ggaagtcccg ctgagccccc acgactgtcg acaagtgctg gcggagaccc 4020 acgggactcg agaagacctc caggaccaca aactcccaga cgcagaccat acttggtaca 4080 cagacggtag cagcttcatg gacgcaggta cccggagggc gggggcggcg gtagtggatg 4140 gatatgccac gatataggca caggcactgc ctcccggaac gtctgctcag aaggctgaac 4200 taactgctct aacaaaggcc ttggagctat cgcaggggaa aaaggctaac atctacacag 4260 acagtcggta tgcctttgca acagcccaca cccatgggag catttacaag aggcgagggc 4320 tcctaacatc agaaggaaaa tcaaaaataa ggctgaaata atcgccttat tgaaggccct 4380 cttcctccct aaaaaggtgg ccataattca ttgtcctgga catcaaaaag gacctgatcc 4440 cgtcgcccaa ggtaacaggc aagctgacca cgcggccaag caggctgcta gaatagagac 4500 attgacctta gtttcggaaa ccagagaggc tgaccggata tcccctccca caagttatat 4560 ctatacacca gaggaccagg aagaggcagt agccttagga gccatagaaa accaagagac 4620 taaaaattag gaaaaagacg ggaaaacagt tctcccacga aaagaggcca cggccatggt 4680 gcagcagatg cacgcctgga cacatctaag tagtagaaaa ctaaaacttc tcattgaaaa 4740 gactgacttc ctaatcccca gggtcggcac cctcctggaa caagtaacgc tcgcttgcaa 4800 ggcctgccaa cgagtaaagg ccggggccac gcgagtcccg gcggggataa ggacacgggg 4860 caaccgccct gggacctatt aggaagtaga ttttactgaa agtctcacca tgcgggatat 4920 aaatatttat tagtatttgt agacaccttt tcaggatagg tagaagccta ccccacccag 4980 caagaaacgg cccacatagt ggccaagaag atattggaag aaattttccc caggttcgga 5040 ctccccaagg taattaggtc agacaatagg ccagccttcg tctcccaggt aagtcaagga 5100 cttgccaaaa tactggggat taattggaaa cttcattgtg cttataggcc ccagagttca 5160 ggacaggtag aacggataaa cagaactatt aaagagacct gaacaaaatt gaccttagag 5220 actggcttaa aagattggag acgtctccta tccctagctc tcttgagggc ccgaaataca 5280 cctaatcgct ttaggctcac cccttatgaa atcctctacg ggggaccacc tcccttgtca 5340 accttgcttg attctttctc cccctctgac cctaagactg acttgcaggc tcggctaaaa 5400 ggactacagg cggtgcaagc ccaaatttag gctcctttgg cggaactgta ccagcctgga 5460 cacccacaaa ccagtcatcc tttccaagtg ggagactccg tctatgtcag acgacatcgc 5520 tcccaaggat tagaaccccg gtggaaagga ccatgcatcg ttctcctcac cacacccact 5580 gccgtgaaag ttgacggggt cgccgcctgg atccacacat cccacgtgaa agctgctccg 5640 aaggcgccag aatcagcatc gcctgagaaa tggagacttc gtcgctccag ggaccccctc 5700 aagataagac tctcccgtgt ctaacccccc acctactgtt agcccttttc cttccctggg 5760 ttatcggaag cagcaacccc catcaaccct atcgattgac ttggcaaata actaattttg 5820 aaacccatga agtcctcaac gagacttcac atatagcccc tttaaacacc tggttccccg 5880 acctctactt taatcttgac aaaatagcca tgataaatga aatggaaggt ggtgagtgga 5940 gaaagcaagc gagaaaggtc tcccttagtc gaaacgggtt ttatgtttgc cctggatttc 6000 ggacaggacc gatgaaaaag acctgtggtg aaataatgtc cctatactgt gcaagttggt 6060 catgtgtaac aactaatgat ggagaatgga aatggaaaac ccaactctgg tatgtgacca 6120 tgtcctatgt ccagccatgc actaggacac ggtattcggc cacctgtaac ttaatctgtg 6180 tcaaatttga ggaggccgca aaaactgacc cccgttggac aaccggacta atttgaggcc 6240 taaatttata ccaatctccg gcatttggac tccctatcca aattaggcta ctagtcaacc 6300 cggtctcagc ctcggtccca gtaaggccaa acccggttct aacagggagg gcaccttctc 6360 agtcagagag ccggcaaaaa gtcccaacca ccgtttcccc atctcctccg ccatccgaat 6420 ccccattggc actcccaagg gccccctcgg tgctcccggg gaccacccgc ctgcctcccg 6480 acctggaagc aaccagtaga ctcttcaacc tcatcagagg cgcttacctc gccctgaacc 6540 agacaaggcc taaatccacc acctcctgct ggctctgcct ggccacaggc cccccttact 6600 atgaaggtat tgcctctgtt agtaatttta ctaattccac tagtcatttt ggatgtgcat 6660 ggcaccagca caagaaactt accctagcag aagtgtcagg gtcgggaacc tgtataggcc 6720 aggtgccccc cagtcaccaa catctctgta ataaaaccct ggcagtaccc agaactagcc 6780 actatctaat accctccggg ccagactggt aggcttgcaa aaccggactt acccgttgtg 6840 tatccacagc tgtcttcaac gacagtgagg attactgtat attagtacaa gttgtgcccc 6900 aagtttatta ccaaactgga gagtctttta aatcccagtt tgagcaaaaa tacctcacta 6960 gaatgaagag agaacctgtt tccctcaccc tcgctgttat gctaagatga ggagtagcgg 7020 ctgaggtcag gacaggaacc gcggcattag tgcgtggcag ctaccaccta caacaactca 7080 gggcagccgt agatgaagac ctcagggcca tagaacactc cattaccaaa cttgaagaat 7140 ctctaacctc cctgtccgaa gtagtactcc aaaatcgacg gggactagat ataatttttc 7200 taaaagaggg cgggctctgt gctgccctta aagagcagtg ttgcttttac gctgatcatt 7260 ccggagtagt taaagactct atggccaaac ttagaaaaag actagatgat agacagaaag 7320 aaagagaatc ccaacaaaac tggtttgaaa cttggtacaa ccaatccccc tggtttagta 7380 ctctcatctc cactatccta gggcccctga ttctgtttat gcttatttta actttcaggc 7440 cctgcatttt taaccgcttg cttgctctaa ttaaagacag attaaatata gtgcatgcta 7500 tggtcctgac tcagcggtac caggcagtca agactaacga agagactcaa gattgagcct 7560 ctaagtcaca aaaagaggag ggaa 7584 // ID LTR9_TS repbase; DNA; PRI; 339 BP. XX AC . XX DT 09-DEC-2009 (Rel. 15.09, Created) DT 09-DEC-2009 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR9_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-339 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from tarsier."; RL Repbase Reports 10(9), 1266-1266 (2010). XX DR [1] (Consensus) XX CC >88% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 339 BP; 71 A; 84 C; 98 G; 86 T; 0 other; tgttgcggca gaactgactc ccgcagtcga gagacgagcg gcactcccgg aatgtgagga 60 gaacacaagt ttatttacgc cggcgggccc agggggctca tgcctaagtc ctggggcccc 120 gtctacttgt gagctttgcc ttatatcagg tttacatttc cacgtacgtg cagtttacat 180 ttttgtgtgc gtgcagtata gctcgggtgg tagcgggggc ttaagcaagc tatttacaga 240 agcggatatt gcgaggagaa aaaagagtag gcagggaccc ccttcctccc gaccattgtt 300 atggcccttt tggcttcctc cagatggctt gccagttca 339 // ID LTR17B_OG repbase; DNA; PRI; 455 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR17B_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-455 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 11(5), 1591-1591 (2011). XX DR [1] (Consensus) XX CC >89% identical to consensus. 4 bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 455 BP; 102 A; 130 C; 104 G; 119 T; 0 other; tgaggcaaga gagtaaaagc agagctaaag atctgtccta gtcactctgt acaggcatta 60 gccagacagt ctgtaacact cctacctcta ctgcagcaga tgtctctgtt atctgtatgc 120 ccttcccttg gtatgtgctg ggcctagata gtaaccttgt aacttcctta accttctgtc 180 gcctttgagg acaggtagcc cgccggtacc aagggcacgg gccaacctga ccttcccttg 240 agtcccaaga ctttcacgtc tgtcacgtct ctcacatcct tcacgacttt cacgtgcacc 300 caagccctaa aaatgtataa aagaaaggca agagattgag tcggtgggct cggtttttag 360 gacactagtc cgccgagtct cccggccggg aataaagccg cttcctcttc caactggtgc 420 ctggagtctt ctgtctgcgt cggtttcctg ctaca 455 // ID LTR10_Mim repbase; DNA; PRI; 306 BP. XX AC . XX DT 02-NOV-2009 (Rel. 14.11, Created) DT 02-NOV-2009 (Rel. 15.11, Last updated, Version 3) XX DE Long terminal repeat - consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR10_Mim. XX NM LTR10_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-306 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2966-2966 (2009). XX DR [1] (Consensus) XX CC ~99% identical to consensus. 6bp tsd. CC Similarity to RLTR10F from mouse. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. CC We thank the Broad Institute Genome Sequencing Platform and CC Genome Sequencing and Analysis Program, Federica Di Palma, and CC Kerstin Lindblad-Toh for making the data for Microcebus murinus CC available. XX SQ Sequence 306 BP; 63 A; 85 C; 81 G; 77 T; 0 other; tgtggggcgc ggtgttaacg ccattgcaag atggcgccga cttcctggtc acacccatac 60 cacaagactg ataaacaggg tgaaccgcgc atgtgtaggg gctttttcct gtctctcatc 120 aagtatgcta atgagggctc ttgcgtgagc caatcagatt ctgcctaatg tacttagtgc 180 ctatataagc ccgctccgag agctcctcgg ggtcttccgc tttagtcatc ttcagattcc 240 ccaataaagc gctgtcagaa gaactccggt tgccgcgtct tccttgctgg cgaggcgggc 300 gcgaca 306 // ID MacERVK2 repbase; DNA; PRI; 6921 BP. XX AC . XX DT 24-SEP-2007 (Rel. 13.02, Created) DT 27-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE ERV2 Endogenous Retrovirus from Cercopithecidae. XX KW ERV2; Endogenous Retrovirus; Transposable Element; ERVK; LTR; KW MacERVK2. XX OS Cercopithecidae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini. XX RN [1] RP 1-6921 RA Smit A.F.; RT "MacERVK2 - ERV2 Endogenous Retrovirus from Cercopithecidae."; RL Direct Submission to Repbase Update (21-FEB-2008). XX DR [1] (Consensus) XX CC 4% ORFs: gag 115-1899, pro <1848-2660, pol 2642-5173, env CC 5163-6921. POL 47% id/62% similar to MYSERV. XX FH Key Location/Qualifiers FT CDS 115..1899 FT /product="MacERVK2_1p" FT /note="gag." FT /translation="MGNSSSLASEYLRLLQGLLLSIGVDTKERTLRKLFAH FT VEQHCYWFQYQTKVQLNKRDWQQVVKTLRQAHQRGDVMSAPLWALCASITQ FT ALELLETDSEKGEGGDEESLKSPIAISPPLDNPVTDGEGPLHNDVSQERES FT EIQKELQPVVQLLQQILQLQLSPPHPSPPPTPVFTFPVAHPPPSAPKAEEE FT DSFPPPPPPVEMLPRSGTVFTVPASTAKAVIEVLDSQEDEDPLQLFPITRQ FT PFGPNDQFPQGGVNVQYNVLQFKFLKEMKAAVANYGPQSPFVMGLLDSFSS FT ENLFLPLDWETLGKAVLDRSQWLQLRSWWLEDAKEQARRNAARNPPGPTEE FT QLTGTGQFATVAAQSGLDDVALSQVKGLFLKAWCKVEPSGKTALSFVKILQ FT GANEPYPDFVARLQDAVMKTVGNGAAGKILITTLAFENANQECQRLLRPLK FT AAGNLQIEDFIRACAGVGGAAYNAQLFAGALSKALMGKKGVCFQCGKTGHF FT KKECRKLSNKNDSPKIGSKRLPTEPCRRCGKGRHWTNQCHSKVDKYGNPLT FT SGSGNLQRGPSAWGPSNNTSPLHLQFPISSGVQHQQANQSMLTPNFH*" FT CDS 1848..2660 FT /product="MacERVK2_2p" FT /note="pro." FT /translation="GPASAGQSVNVNPQFSLTQLFSATSGSAAADVSILQA FT VELTPEMGVVKLPTGVFGPLPPHTVGLLIGRSSSILKGLQVHLGVIDSDYK FT GEIQIMAQAHKPISLKEGQRIAQLLLLPYFQFPSRRQERTGGFGSTGKHIF FT WETLVTQQKPLFPLEVEGQIFQGLVDTGADVSIIASDQWPLQWPKQPVAVS FT LTGLGSASEVYQSSQSLSCRGPDGQMAQVQFYIVPIALNLWGRDLLQQFGA FT FVSIPHVSNSAKNMMFRMGYNPLEKSLNT*" FT CDS 2642..5173 FT /product="MacERVK2_3p" FT /note="pol." FT /translation="KVFKHVATVQQPQALPLKWKTNIPVWVEQWPLTREKL FT EALENLVQEQLSLGHIEPSTSPWNSPVFVIKKKNGKWRLLTDLRAINACIQ FT PMGSLQPGLPNPSMIPQDWVLMVIDLKDCFFSIPLDQKDCERFAFSVPVFN FT NSQPLSRYQWKVLPQGMLNSPTLCQEFVHRALDPVRKRHPSVLLYHYMDDI FT LLAAPTREKQQDAFASLQTQLSFYSLNIAPEKIQVDFPIQYLGYTLGARNI FT RPQKIQIRRDHLKTLNDFQKLLGDINWMRPVLGIPTYQLQHLFSILEGDSH FT LDSSRHLTPLALQELQLVEEQLQKAHLHYILPDVPLSLCLFHTLHSPTGMI FT HQTNRPLEWIFLSNKTSKRLTTYIDRLAELIIKGRHRCRQLWGGDPSLIIS FT QFTHSQITYLLATSDSWQVACASFVGQFSTQYPKSPIFSFFRQHSVLPFSP FT ISVFPVQGPTFFTDASSNGKAGYWSVHQSRVTVYPYPSVQQGELFAILMVL FT LDFPTTSCNIVSDSQYAVYVTSYISQATLPLCATSSLQKLFSLLSSTLSQR FT RAPLFITHLRSHSQLPGPLVHGNTQIDSLLIGHLQAAEQEHALHHTNSSGL FT QRRFHLTRKQARSLIRACPSCAPLSIPSFHPGVNPRGTQVNQIWQMDVTHV FT PSFGRLKYVHHTIDTFSHFQWATPLPSEKADSVITHLLACFAIMGIPLILK FT TDNAPAYVSRKLQVFLQQYNIQHITGIPGNSQGQAIIERANLTLKTQLQKQ FT KGGNESPRNQILKVLFTLNFLNQWRQLQDSAAVTHLSSPPSVIVPADNLRV FT WYPNEEGKYVQGQVLKQGRGYALVLTGSGPEWFPTRRLKPC*" FT CDS 5163..6920 FT /product="MacERVK2_4p" FT /note="env." FT /translation="NPVEKKTGFSMHFFLCWMCLGGGLISLSEALYEYWTY FT IPFPPLYQGVAWGEAEIKVFTNDTTWMPSPYIDKDNIERETGIINNTYQFG FT VEGLPICMGNSPHCLRKSHEAWAVRYNHSHVAANTVIIVTRSFQYNHTVYS FT NETIPSSLPFCPIPEVSPNIEVLEWQQCRGTKPRILLEYNGMQITDWSVHG FT DFQTKFSHIPLKWHRVNKSIAANGNETLVWREGGLSPPMPHLRHSLQVQSH FT LWKLLAAGDAISTFVGNMSLNLSNPNNSFHIQLYRNTSRYIIACVRKPYLL FT LAGNLTWDNDTGIVNCTDTCTFLSCLNHSWWHNFTGDIYILRARKEIWLPV FT NLTRPWGASPLETYIWTSLQRTRRALGLIIASVMSLVAIASTAAVAGLALH FT QSIQNAEFVQQWHEQSHRLWLQQRNIDSQLAERLDNMEQALTWLGDQLTVL FT STRITLQCDWNSTQFCVTPVPFNISQNWTEIRKLLVGHKNISLDIQELTQN FT ISEAFRQQLHVLSGTATLQELGQRLAAFNPWTQMKTWISTISGSILLWALG FT LGLLLLVIRCSLRRLQKQKKTQEQIWATFLGLRQNKKGE" XX SQ Sequence 6921 BP; 1834 A; 1550 C; 1427 G; 2110 T; 0 other; agtggcgcag cgagcagggt ccgcacgggt gagtagggga gaacaagaag gggaaccccc 60 taggagcggg taaagcccgg atattgttaa ggggaacctt cttaagcttt cattatgggg 120 aattccagtt ccttggcctc agaatattta agattattac aagggctttt actttcaatt 180 ggagttgata ccaaagagag aactttacgg aaattgtttg ctcatgtgga acaacattgc 240 tattggtttc aatatcaaac taaagtgcaa ttaaataaac gagattggca acaggttgtt 300 aagacccttc gtcaagcgca tcagcgcggc gatgtaatgt cggcaccttt gtgggccctc 360 tgtgcctcta ttactcaggc tttggagctc ttagagacag actctgaaaa aggggaaggt 420 ggagatgagg aatctttgaa gtctccaata gccatttccc ctcctttaga taatcctgta 480 actgatgggg aaggtccttt gcataatgat gtgtctcagg aaagggaatc agaaattcaa 540 aaggagcttc agcccgttgt tcaattgtta caacaaattt tacaattgca gttatctccc 600 cctcaccctt ctcctccacc cactcctgtt tttacttttc ccgtggctca tccacccccg 660 tcagcaccca aggcggagga ggaggatagt tttccaccac caccaccgcc ggtggaaatg 720 ctgcctcgtt caggcacggt ttttactgtc cctgcttcta ctgctaaagc tgtaattgag 780 gtattggatt ctcaagagga tgaggatcct ctacaattgt tccccattac taggcaacct 840 tttggcccta atgatcaatt tcctcaagga ggtgtgaatg tgcagtataa tgtccttcaa 900 tttaagttcc ttaaagaaat gaaggcagct gttgctaatt atggccctca atctcctttt 960 gtcatggggc ttttagattc tttctcctct gagaatttgt ttttgcctct tgattgggag 1020 actttgggga aggctgtttt ggaccgctct cagtggctcc agttacgcag ttggtggctg 1080 gaagatgcca aagaacaggc ccgccgcaat gccgctagaa atcctccagg accaacagag 1140 gagcaattaa caggtactgg tcagtttgct accgtagcgg ctcaatctgg gctcgatgac 1200 gtagcattat cacaagttaa aggacttttt ttaaaagctt ggtgtaaagt agaaccatcg 1260 ggaaaaactg ctttgtcttt cgttaagatt ttgcagggtg ctaatgaacc ttatcctgat 1320 tttgttgctc ggctccaaga tgctgtaatg aagactgtgg gtaatggggc agcaggcaaa 1380 atcctgataa ccacattagc tttcgagaat gctaatcaag aatgtcagag gttgttgcgc 1440 cctctaaagg ctgctggaaa cttacaaata gaagatttta ttagagcctg tgccggagtg 1500 ggaggggctg cctataacgc acaattgttt gctggagctc tatctaaggc actcatggga 1560 aaaaaggggg tttgctttca atgtggaaag actggtcatt ttaaaaagga gtgtcgtaag 1620 ttatctaata agaatgactc ccccaaaata ggatctaaaa ggcttcccac tgagccctgt 1680 cgtcgttgcg gcaaagggcg acactggact aatcaatgtc actctaaagt ggacaaatac 1740 ggtaaccctc taacctccgg atcgggaaac ttgcagaggg gcccttcagc ttggggcccc 1800 agcaacaata caagccctct tcatttgcaa tttccaatca gctcgggggt ccagcatcag 1860 caggccaatc agtcaatgtt aacccccaat tttcactaac tcaacttttc tctgctactt 1920 ctggtagcgc cgctgctgat gtgtcaattt tgcaggccgt cgaactcacc ccagaaatgg 1980 gagttgttaa attgcctaca ggtgtttttg ggccattacc tccacatact gttggattgc 2040 ttataggacg tagtagtagt attctaaagg gtttacaagt tcatttgggc gttatagatt 2100 ctgattataa aggggaaatt caaattatgg cacaggctca taaacccata tcccttaaag 2160 aggggcaacg gattgcacaa ctccttttac taccctattt tcagtttcct tcacggaggc 2220 aagaaagaac cggaggtttt ggaagcacgg gtaaacatat tttttgggaa actttagtaa 2280 ctcaacaaaa acccttgttt cctctcgaag tagagggaca aatttttcag ggcttggttg 2340 atacgggagc tgacgtttcc attattgcct ccgaccaatg gcctttacaa tggcctaaac 2400 agcctgttgc tgtgtctctt actggattag gttccgcctc agaggtgtat caaagttctc 2460 aatccttgag ttgtcgaggg ccggatggtc aaatggccca agtccaattt tatattgttc 2520 ctatcgccct aaatttatgg gggcgagatt tattgcaaca atttggagct tttgtttcta 2580 ttcctcatgt ttctaattct gccaagaata tgatgttccg catgggctac aatccccttg 2640 aaaagtcttt aaacacgtag ccactgttca gcagcctcaa gcccttccct taaagtggaa 2700 aactaatatt cctgtatggg ttgaacagtg gcctttaaca cgtgagaagc ttgaggcctt 2760 ggaaaactta gttcaggaac aattatcctt aggtcatata gaacctagta cttctccatg 2820 gaactcccct gtttttgtga taaagaaaaa gaacggtaaa tggagattat taacagatct 2880 tcgtgctatt aatgcatgta ttcaacccat gggatcctta caacctggcc ttcctaatcc 2940 ttctatgatt cctcaagatt gggtgctcat ggttattgat ttaaaggatt gttttttttc 3000 tattcctttg gatcaaaagg attgtgagcg ttttgccttc tctgtccctg tttttaataa 3060 ttctcagcca ctttctcgat atcaatggaa agttttacct caaggtatgt taaacagccc 3120 taccttgtgt caagaatttg tccatcgggc cttggatcct gttcgaaagc gtcatccttc 3180 tgttctttta tatcattata tggatgacat tcttcttgct gcacccacca gagaaaagca 3240 gcaagatgct tttgcatcat tacaaactca gctttctttc tactcactta atattgcccc 3300 agaaaaaatt caagtggatt ttcctattca atatttaggt tataccttgg gagcacgaaa 3360 tattcgaccc caaaaaattc aaattcgacg tgatcattta aaaacattaa atgactttca 3420 aaagctttta ggagacatta attggatgcg tcctgtttta ggaataccaa catatcagtt 3480 acaacatctt ttttctattc tagaaggaga cagtcacttg gacagctcac gtcatttaac 3540 tcctttggct ttacaggaac tccagcttgt agaagagcaa cttcaaaagg cccatttaca 3600 ttatatcctt cccgacgttc ccctttccct ttgtttattt catactttac attctcccac 3660 aggaatgatt catcaaacca atcggccgct agaatggatc tttttgtcca ataaaacatc 3720 taaacgcttg actacttata tcgaccgttt agctgaactg atcatcaaag gacggcatcg 3780 ctgtcgacaa ctttggggag gagatccttc cttaattatt tctcaattca ctcatagtca 3840 gatcacttat cttcttgcaa cttctgattc ttggcaagtt gcttgcgctt catttgttgg 3900 acaattttct acccaatatc ctaagtctcc gatcttttca ttttttaggc aacattctgt 3960 gttgcctttc tcaccaattt ctgtatttcc tgttcaaggt cctacctttt ttactgatgc 4020 tagtagcaat ggtaaggctg gatattggag tgttcatcaa tctcgagtta cagtttatcc 4080 ttatccctcg gtgcaacagg gagaattgtt tgctattctc atggtcttac ttgatttccc 4140 tactactagt tgtaacattg ttagtgattc tcaatacgcc gtttatgtta cttcttatat 4200 ttctcaggcc actttacctc tttgtgctac ttcttctctt caaaaactct tttctttgtt 4260 atcctctact ttgagccaac gtcgagctcc tttattcatc actcatcttc gttctcattc 4320 tcaacttcct ggaccattag ttcatggtaa tactcagatt gattcgcttt taattggcca 4380 cctgcaggct gcagaacaag aacatgctct acatcacact aatagttctg gccttcaacg 4440 acggtttcat ttaactcgta aacaagctcg ttctcttatt cgagcttgtc cttcctgtgc 4500 tcctttgagc attccatcct tccatcctgg ggttaatccg cgaggtactc aagttaatca 4560 gatttggcaa atggatgtca cacatgtccc ttcttttgga cgattaaagt atgttcatca 4620 taccattgat accttttctc attttcaatg ggccactccg ttgccatctg agaaagctga 4680 ttctgttatt acacatctct tagcttgttt cgccatcatg ggcattcctc ttatacttaa 4740 aactgataat gcccctgcct atgtctctcg taaattacaa gttttcttac agcaatacaa 4800 tattcaacat atcaccggta tcccgggcaa cagtcaaggt caagccatca ttgagcgcgc 4860 aaatttaacc ttaaaaactc aactgcaaaa acaaaaaggg ggaaatgagt cccccagaaa 4920 ccagattctg aaggttcttt ttaccttaaa ttttttgaat caatggagac aattgcaaga 4980 ctccgctgcc gtaacccatc tttcctcacc tccctcggtg atagtccccg ctgacaattt 5040 aagggtttgg tatcctaatg aagaaggtaa gtatgttcaa gggcaagttc taaaacaggg 5100 acggggctat gctcttgtcc ttacagggag cggacctgag tggtttccta caagacgatt 5160 gaaaccctgt tgaaaagaaa actggattca gcatgcattt cttcctttgt tggatgtgcc 5220 tcggcggggg acttatttcc cttagtgagg ctttgtatga atattggact tatattccct 5280 ttccgccctt gtatcaggga gtcgcctggg gggaggcgga aattaaggtt ttcaccaatg 5340 acactacttg gatgccgtcc ccctatatag acaaggacaa catagaacgg gaaacgggaa 5400 tcataaacaa cacataccaa tttggagtgg aaggacttcc aatatgcatg ggaaatagtc 5460 ctcattgtct ccgaaaatcg catgaagcct gggctgttcg ctataatcat agtcatgtgg 5520 ctgctaatac tgttattatt gtaactagaa gctttcagta taatcatact gtatatagta 5580 atgagactat tcctagttct ttaccttttt gtcctatccc agaggtttcc cctaatattg 5640 aggtgctgga gtggcaacaa tgtcggggaa ctaaaccccg gattttatta gaatacaatg 5700 ggatgcaaat aactgattgg agtgtgcacg gtgattttca aacaaaattt agtcacatac 5760 ctttgaaatg gcaccgggta aataaaagta ttgccgctaa cggtaatgaa accttggtat 5820 ggcgtgaagg aggcctgtcg ccacctatgc ctcatcttag acactcttta caagtacaga 5880 gtcatctttg gaagttgtta gctgcaggcg atgccattag cacttttgtg gggaatatgt 5940 cccttaatct ttctaacccg aataactcct ttcatattca gttatatcgg aatacctccc 6000 gatatattat tgcttgtgtc cgtaagccct acctattatt agcggggaat ttgacatggg 6060 ataatgatac ggggattgtt aattgtacag atacgtgtac atttttgtct tgcctcaatc 6120 attcctggtg gcacaacttt actggggata tttatatcct cagggctcgt aaggaaattt 6180 ggctacctgt taaccttacg cgaccgtggg gggcatcacc tcttgagact tatatttgga 6240 cttctctcca gcgaacccgc cgtgcccttg gacttataat tgcctcggtg atgagccttg 6300 tcgccatcgc ctccactgcg gccgtagcag ggctcgctct tcatcaaagc atacaaaatg 6360 ctgaatttgt gcagcaatgg cacgaacaat ctcaccggtt atggcttcaa caacgaaaca 6420 ttgattctca acttgctgaa agattggaca acatggagca agcattgaca tggcttggag 6480 atcagctcac tgtcctcagt actcggataa cattacaatg tgattggaac tccactcaat 6540 tttgtgtaac gcctgtgcct tttaatatct ctcagaattg gaccgaaatc cgaaaattat 6600 tagtaggcca taaaaatatt agtttggaca tccaagagct cactcagaac atttccgaag 6660 catttcgaca acaattacat gtgttgtccg gaaccgcaac cctgcaggag ctgggtcaac 6720 gccttgctgc ctttaaccca tggacccaga tgaaaacatg gatttctaca atctctggaa 6780 gtatattgct ttgggcactc ggattgggtc tacttctctt ggtgattcga tgctccctcc 6840 gtcgcctcca aaaacaaaag aaaacacagg aacaaatatg ggctaccttt ttaggattga 6900 ggcaaaacaa aaagggggaa a 6921 // ID LTR8C_Mim repbase; DNA; PRI; 604 BP. XX AC . XX DT 09-NOV-2009 (Rel. 14.11, Created) DT 09-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR8C_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-604 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2960-2960 (2009). XX DR [1] (Consensus) XX CC ~88% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 604 BP; 167 A; 166 C; 115 G; 156 T; 0 other; tgaaaccgcc cttcacaaat caataacggg gtgcaaggcg agaactgtgg taagggccta 60 gctaaaaata atgataataa ttagtcactg tccccctttg cttctctgta attgccactc 120 cggagccacg cagctggtgg tcataaagat tcttaacttc tccccatagc tggtgtccat 180 aaagattctt aactttctcc atagataaca tcaccttttt gaaacctaaa ggtagttttt 240 aagatatctt ccaggccccg cattccagtg gattggttga cccacccaga ccagcggccc 300 ataccaagaa accgactcaa ctggaattgt gaccccaggg actgaagcaa cacaagaaga 360 tgatttctac acccctataa ttttgcccca attaacaaac ctaattgcct aatcccttgc 420 ctgccaaact atccttaaaa accctgtctc ccaaattctc ggggaggcgg atttgagaaa 480 tttctcccat ctccccgctt agcgctatgc ggaataaaac tctttctccg ttgcaaccct 540 gactgactca gcatattggc tcttctctgg gcagcgggca aatgaacctg gttgggtggc 600 aaca 604 // ID ALRY-MAJOR_PT repbase; DNA; PRI; 4896 BP. XX AC . XX DT 07-JAN-2005 (Rel. 10, Created) DT 07-JAN-2005 (Rel. 10, Last updated, Version 1) XX DE Major repeat unit of chimpanzee alpha repetitive DNA from the Y DE chromosome centromere - a consensus. XX KW SAT; Satellite; Simple Repeat; ALRY-MAJOR_PT; ALRY-MINOR_PT; KW Repetitive sequence; satellite DNA. XX OS Pan troglodytes OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pan. XX RN [1] RP 1-4896 RA Hughes F.J., Skaletsky H. and Page C.D.; RT "ALRY-MAJOR_PT."; RL Direct Submission to Repbase Update (DEC-2004). XX DR [1] (Consensus) XX CC The major repeat unit found in the chimpanzee Y chromosome CC centromere CC consists of 28 copies of the minor 171 bp unit. XX SQ Sequence 4896 BP; 1451 A; 856 C; 1024 G; 1565 T; 0 other; gtggaggaaa aggaaatatc tccacctaaa aactagacag aagcattctg gcaaacttct 60 ttgtgatgtg tgcattcatc tcacagagtt gatccttact tttcattgag cagttttgaa 120 acactctttt tgtagaatct gcaggtggat atttggaggg ctttgaggcc tctggtggat 180 aacgaaatat cttcacataa taactagaca gaagaattct cacaaacttc tttgtaatgt 240 gagcattcgt ctcacagagt tgaactgttc ttttgactga gcagctttga aacagtattt 300 ttataaaatc tgcaagtgga catttggatc gctttgaggc ctctggtgga aaaggaaata 360 gcttcacaat aaaactggag agaagcgttc tgacaaactt ctttgtgatg tgtacattca 420 tctcacagag ttgaaccttg cttttgattg agcagttttg aaacacactt tttgtagaat 480 ttgaaagtgg acctttggat tgctttgagg cctatggtgg aaaaggaaat atcttcacat 540 aaagactaga cagaagcatt gtgacaaact cctttgggat gtgtgcattc aactcacaga 600 gttgaacctt tcttttgact gagcaacttt gaaacactct atctgtagaa tctcccagtg 660 gatatttgca gtggcttgag gcctctgacg gaaaaggaaa tatcttcaca taacaactag 720 acagaagcac tctgacaaac tattttgtga tgtgtgcatt cttctcacag agttgaacct 780 tacttttcat tgaggaattt ttaaatactc tttttgtaga atctgcaaga tgacatttgg 840 agcgcttcaa ggcctatggt ggataacgaa acgtcttcat gtaataacta ggcagaagca 900 ttatgagaaa ctccaatgtg atgtgtgctt tcgtatcaca cagagtttca cttttctttt 960 gattgagcag ctttgaaaca ttctttttgt agaatctgaa agtggatatt tggagctctt 1020 ggaggcctat ggtggataac gaaatgtctt cgtataataa taactagaga gaagcattct 1080 gagaagcttc tttgtgacgt gagaattcat ctcacagatt tgaaccttcg tttgattgag 1140 tagctttgaa gcactctttt tgtagaatct gcaaatggac atttggagcg gtttgaggcc 1200 tatagtggaa aaggaatcgt cttcacataa aaaggagaca gaagcattct cacaacattc 1260 tttgtgacat atgcattcat ctcacaaagc tgaaccttac ttttgattga gcagtttgga 1320 aacccccttt ttctactatc tgcaagtgga cctttggagt gctttgaggc ctatggtgga 1380 aaaggaaata tcttcactta agaagtagac agaaggattc tgataaattt ctttgtgatg 1440 tgtgcattca tctcacagag ttgaaccttc ctctggttga acagttttga aacactgttt 1500 tcgtagaatc tgcaagtgga cattttgagc gcttggaggg ctatggtgtt aaaggaaata 1560 tcttcccata agaactagac acaagcattc tgacaagctt ctttgtgatg tgtgcattcg 1620 tctcacagag ttgaaactat cttttgattg agcagctttg aaacactctt tctgtagaat 1680 ctgcaagtag gcatttggaa ggtttggggc ctgtggtgga aaaggaaata tcttcacaca 1740 aaaattagac agaagccctc ggacaacctt ctttgtgata tgtgcactta tcacacagag 1800 tgaaacctta gtgtttattg agcagttttg aaacaccctt tttgaatgat ctgcatgtgg 1860 acatttggag tgctttgagg tctattgtgg caaaggaaat atcttcacct aagcactaga 1920 cagaagcatt ctgagaattt tcttgtgata tgtgtattca tctcacagag ttgaacctta 1980 ctttggattg accagttttg aaacacgaaa aggaaatatc ttcacattaa aactagacag 2040 aaggattttg acaaactact ttgtgatgtg tacattcatc tcacagagtt gaaccttact 2100 ttccattgag cagttttgaa accttcctct ggggaatctg caagtggatt tttggagcac 2160 tttgaggcct ttgctggaaa ggaaatacat tcacaaaaaa cctagacaga agcattctca 2220 caaactactt tgggatgcat gcattcatct cacggggctg aaccttgctt tacattgagt 2280 agttttgaaa cactcttttg tagcatctgc aagtggacat ttcgagtgct ctgaggccta 2340 cggtggataa cgaaatatct tcatatcaca gctagacaga agctatctga gaagctttct 2400 atgtgatgtg tgcattcatc tcacagagtt gaaccttact tttcattgaa cagttttgaa 2460 acactctttt ggcagaatct acaagtggac ctttggaacg ctttgaggcc tacggtggaa 2520 aaggaaatat cttcacataa gaactagaca gaagcatact gacaaatttc tttttcgtgt 2580 gcgcattcat ctcacagagt tgaaagttaa ttttcattga gcagttatga aacatacttt 2640 ttgttgaatc tggatgacga catttggagc actttgaggc ctatcacgga aagggaaata 2700 tcttcatata aaaattagat ggaggcattc tgaaaaacat ctttgtgatg tgtgcattcc 2760 tctcacagag ttgaaccttt cttttgattg accagcttcg aaatgctctt ttagtagaac 2820 ctggaagtgg acatttggag ccttttgtgg cctaatgcgg aaaaggaaat atcttcacgt 2880 aaaaagtaga tagaagcatt ctgacaaatt actttatgct gtgtgcattc atctcacaga 2940 gttgaacatt tcttttcatt gaccagtttt gaaacactgt tttagtagaa tctggaagtg 3000 gacatttgga gcaccttgag gcctgtggtg gaaaaggaaa tcccttcaca taaaaactag 3060 acagaagcat tttgacaaac tcctttggat gtgtgcatgc atctcatggc gtggaatatt 3120 tctgttgatt gagcagcttt gaaacactct ttttctagaa tcttcaaggg gacatttgga 3180 gcaatttcag gcctatggtt caataggaaa tatcttcaca taaaaactag acagaagcat 3240 actgacaaac ttctctgtga tgtgtgcata catctcagag agttgaacat ttcttttgat 3300 agaccagctt tgaaacactc catttgtaga atgtgggagt ggacatttgg agtgctttga 3360 ggcttatggt agaaagggaa atatcttcat ataaaaacta gacagaagca ttctgaaaaa 3420 cttctttgtg atatgtgcat tcatgtcaaa gagttgaact tttcttttga ttgaacagtc 3480 atgaaactct ctgtagaatc tgcaaacgga cattggttgt gctttgaggc ctatggtgga 3540 aaagataata tcttcacata aatactagtc agaagcattc tgaaaactgt ctttgtgata 3600 cgtgcattca tctcacagag ttgaacctca cttttgattg agcagtttga aagacttttt 3660 ttgtactata ggaaagtgga taattggagt gctttgaggc cgatcatgga aaattaaata 3720 tcttcacata agaacaagac agaagcattc tgacaaattt ctttgtgata tgtgcattca 3780 tctcacagag ttgaacctta attttcattg agcagttttg aaacactctt tttgtagaat 3840 ctgcaagtag tcatttggag cgctttgagg cctttgttgg aaaacgaaat gtcttcatat 3900 aataactaga cagaagcatt ctgagaaact tatttgtgat atctgcactc atctcacaga 3960 gttgaatctt tcctttgatg gagcagcttt gaaacactct tttcgtagaa tctgctagtg 4020 gacctttgca gcgctttgag gcctatggtg gaaaaggcaa tatcttcaca taaaaactag 4080 acagaagcat tctgacaaac ttctttgtga tgtgtgaatt tacttcatag atttcaactt 4140 tcttgtgatt gagcagtttt gaaacactct ttttgtggaa tctgcaagtg gacatttgga 4200 acgctatgaa gcctaaggtg gaaaaggaaa tatcttcaca taaaaattag acggacgcat 4260 tttgacaaac ttcttcgaga tgtgtgcatt catctcacag agttgaacct ttcttttgat 4320 taagcaatct tgaaacactc tatttgtaga atctgcaagt ggatatttgt tgtgctttga 4380 ggcctacggt ggaaaaggaa atatcttcac ataaaaactt gacagaagca ctctgagagc 4440 cacctttggg atgtgtgcat ttatctctca gacttgaccc ttacttttga ttgagcatct 4500 ttgaaagacc acttttgtaa taaatgcaag tggacattcg aagtgctttg aggcctacgg 4560 tggaaaagaa aatatcttca catcagaagg agacagaagc attctgacca atttctttgt 4620 ggtgtgtgca ttaatctcac agagttaaac cttactttca attgagccat tttgaaacac 4680 ttttcgtaga atctgcaagt ggacatttga agcgctttga ggcctagggt cgaaaaagaa 4740 atatattcac ataaaaacta gatggaagcc ttctgacaaa cttctttctg atgtgtgcat 4800 tcatctcaca gagttgaacc tttcttttga ttgagcagct ttgaaaccct ctttttgtag 4860 aatttgcatg tgaacatttg aagcgctttg aggcct 4896 // ID L1C_Mim repbase; DNA; PRI; 5993 BP. XX AC . XX DT 07-JAN-2010 (Rel. 15.03, Created) DT 07-JAN-2010 (Rel. 15.11, Last updated, Version 4) XX DE LINE element - consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1C_Mim. XX NM L1C_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-5993 RA Jurka J.; RT "LINE1 elements from the mouse lemur."; RL Repbase Reports 10(3), 246-246 (2010). XX DR [1] (Consensus) XX CC >98% identical to consensus. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. CC We thank the Broad Institute Genome Sequencing Platform and CC Genome Sequencing and Analysis Program, Federica Di Palma, and CC Kerstin Lindblad-Toh for making the data for Microcebus murinus CC available. XX FH Key Location/Qualifiers FT CDS 826..1827 FT /product="L1C_Mim_1p" FT /translation="MGRNQRKNSGNMKNQTENTPPRRSTSPLETDTDQNQA FT TNMTEEEFRMWIIRTLTQLQQQLNNQHQETTKSLQDMGQRFNKEIDTVKKX FT XTELLEMKNQLRELQNTVESLKNRVDQAEERISELEDNTLQLNKSVTEIEQ FT RNKRKDQSLQELWDYVKKPNVRVIGLAEGEEDNTQGLDKLFEDIIEENFPG FT LAQNLDIQVQEAQRTPGRFNANRKTSRHAVIRLTKVSTKEALLRAVRQKKQ FT VTYKGKPIRITSDFSNETLQARRDWGPILTLLKQNNAQPRILFPAKLSFIY FT EGEIKTFSDKQRLREFTKTRPALQEVLKTALRTEHHNNNPRI" FT CDS 1872..5744 FT /product="L1C_Mim_2p" FT /translation="MAQDRNHSNNIQPNRMISNLPYLSVLSINVNGLNSPL FT KRHRLAEWIRKYRPSICCLQETHLTCKDAHRLKIKGWRSIFQANRSQKKAG FT VAVLISDDLVFKPTKVVKDKEGHYIMVKGTVQQEEITILNIYAPNLGAPRF FT IKQTLLELSKWINSNSIIAGDFNTPLTARDRSSKQKINKEIMDLNKTLEQL FT GLTDIYRTFYPKSTEYTFFSSAHGTFSKIDHILGHKENLKKFKKIEIIPCT FT FSDHSGIKLEINPNRNSHFYTKTWKLNNLLLNDYFVNEEIKTEIKNFYEEN FT DNGETSYQLLWDTAKAVLRGKFISINAYNQKARRSQIDNLMKRLKELEKEE FT QTNPKPSRRSEINKIKSELNEIENRKAIQEINKTKSWFFEKINKIDTPLAK FT LTKSRKEKSLISSIRNKKGDITTDPKEIQDTIYEYYKNLYAHKLENVEEMD FT KFLETHSLPRLNQEEIDSLNRPISTAEIETAIKNLPKKKSPGPDGFTPEFY FT HTYKEELVPILQKLFHNIEKNGNLPDTFYEANITLIPKPGKDATKKENYRP FT ISLMNIDAKIFNKILANRIQTLIKKIIHHDQVGFIPGMQGWFNIRKSINAI FT HHINRSKNKDHMILSIDAEKAFDKIQHPFMIRTLKKIGIEGTYLKMIQAIY FT DRPIANIILNGERLKSFPLRTGTRQGCPLSPLLFNIVLEVLATAIRQENGI FT KGIQIGAEEIKLSLFADDMILYLENPKDSTKKLLELINEFSKVSGYKINTQ FT KSEAFIYANNNLIENQIKDSIPFTIATKKLKYLGIYLTKEVKDLYRENYET FT LRKEIAEDVNRWKSIPCSWIGRLNIIKMSILPKLIYRFNAIPIKIPSAFFT FT DIEKIILRFVWNQRRPRISRAILGNKNKMGGINMPDIKLYYKAVVIKTIWY FT WHKNRNIDQWNRCENPDIKPSSYSHLIFDKADKNIRWGKESLFNKWCWENW FT IATCRRLKQDPHLSPLTKTNSRWITDLNLRYETIRTLEEKVGNTLLDIGLG FT KEFMKKSPKAITAATKINKWDMIKLQSFCTAKEIVMKVNRQPTEWEKIFAS FT YASDKGLITRIYLELTKIRKKKSNNPIKKWAKDLNRNFSKEDRRMANKHMK FT KCSTSLIIREMQIKTTMRYHLTPVRMAFIKKSPNNKCWRGCGERGTLLHCW FT WDCKLVQPLWKAIWRYLKAIQVNLPFDPAIPLLGIYPNDPVTLYKKDTCTR FT MFIAAQFIIARLWKQPKCPSIQEWINKMWYMYTMEYYSALRNNGDIAHLIF FT SWLELEPILLSEVSQEWKNKHQIYSPANWY" XX SQ Sequence 5993 BP; 2334 A; 1317 C; 1165 G; 1174 T; 3 other; ggcaccgtgt tcccaggaaa acggtgccga ctcagaggct gagagacata gacccagctt 60 gggctccctg tgggtgaatt aggaccggaa accctctccc tggtgggaat acagtttgaa 120 ctctgggacc cagaggtcgg acctgcagac cagatcccct gcaccgaggg ctagcattgc 180 ccggggcaca gaagggttat acgtgaacag cctactgagg tctgtgtgcc tccaggggcg 240 gatcggcgtc ctagagggcg accctcctcc caggaggagg ccgtgcgccc aacccaggtg 300 gcgttcctgt gcagggaacc tccccgccgg catcacagtc cggggaggcc tggtggcttg 360 tggtctggcc tgctggcaga ggcccaggag tagctgcgga gttggggagg gtggaaagaa 420 gcgaggcctg ctgcagactg cgggtctcag acagccccac ccccacaccc agactttctg 480 gctgagcggg accattccag ccccgccctg acagctttcc ctggaagcag agaacagaac 540 tttgacccct gctaacggcc tgagggcagg cttacccaac ccagctccgc ccagaacgag 600 agctgataac aggactcaaa atcaacacca tagcctgttc ctccaagcaa acgccaccta 660 ctgacaggga cggcatcttg cacagccttt ccacggcacc cactgactca atatacaggg 720 agtggtccaa tttcacccac aggcaccacc taacgcctca gaaactaaac aaggtgtgtg 780 aatacccaaa caataaccta aggaaagaaa caacaactga tcgacatggg aagaaatcag 840 cgaaagaact caggaaatat gaagaaccaa acggaaaaca cacccccaag gaggagcacc 900 agccccctag aaacggacac cgaccaaaat caggcaacca atatgacaga agaggaattt 960 cgtatgtgga tcataagaac actcacccag ctgcaacaac aactcaataa ccaacaccaa 1020 gaaaccacaa aaagcctcca ggatatggga caaaggttca acaaagagat wgacacagtg 1080 aagaaaastk taaccgaact cctggagatg aagaatcaac tcagggaact acaaaataca 1140 gtggaaagtc tcaagaacag ggtagatcaa gcagaagaaa gaatctcaga gcttgaagat 1200 aacaccctcc aattaaataa atcagtcaca gaaatagagc agagaaacaa gagaaaagac 1260 caaagcctac aagagctgtg ggattatgtg aaaaaaccta acgtgagggt cataggttta 1320 gccgaagggg aggaagacaa cactcaaggg ctggacaagc tttttgaaga tataatagag 1380 gaaaatttcc caggccttgc tcaaaatctc gatatacaag ttcaagaagc ccagaggacc 1440 cctgggagat tcaacgcaaa caggaagacg tcacgtcatg cagtcatcag actgaccaaa 1500 gtatcaacta aagaggccct tctaagagct gtaagacaaa agaagcaagt gacatacaag 1560 ggaaagccaa ttcgaataac atcagacttc tctaatgaga ctttacaagc aaggagagac 1620 tggggcccca ttctcactct tttgaaacaa aacaatgccc agcctagaat attattccct 1680 gcaaaactaa gcttcatata tgaaggagaa ataaaaacat tctcagacaa gcaaaggctc 1740 agagaattca ccaagacaag accagcccta caagaagtac ttaaaacagc gttacgcacg 1800 gaacatcata ataataatcc acggatataa aaacaaccaa aacccaaaga tattaaaggc 1860 cagatattac aatggctcaa gacagaaatc atagcaacaa catccaaccc aacagaatga 1920 tcagtaatct accttaccta tcagttctct caataaatgt gaatggctta aactctccac 1980 tcaagagaca taggctggct gaatggataa gaaaatacag gccaagtata tgctgtcttc 2040 aggaaacaca tttaacctgc aaggatgcac atagactaaa aataaaaggg tggagatcaa 2100 tattccaagc aaatagaagc caaaagaagg ctggtgtggc agttctaatt tcagacgatt 2160 tagtttttaa accaacaaaa gtagtaaaag acaaagaggg tcattatata atggtgaagg 2220 gcacagtcca acaagaagag ataacaattt taaatatata tgcacccaac ttaggtgcac 2280 ccagattcat aaagcaaacc ttactggagc taagcaaatg gattaatagc aactccataa 2340 tcgccggaga tttcaacacc ccactgacgg cacgagacag atcctccaaa cagaaaatta 2400 ataaagaaat aatggactta aacaaaactc tagaacaatt gggtctgaca gacatctaca 2460 gaacattcta cccaaaatcc actgaatata cgttcttctc atcagctcac gggacattct 2520 ctaagattga ccatatccta ggacacaaag aaaatctcaa gaaatttaaa aaaatagaaa 2580 tcataccatg taccttctca gatcacagtg gaataaaact agaaatcaac cctaacagaa 2640 actcacattt ctacacaaaa acgtggaaat taaacaacct cctactaaat gattacttcg 2700 taaatgaaga aatcaagacg gaaataaaaa acttctatga agaaaacgac aatggagaga 2760 caagttatca actcctctgg gacacagcta aagcagttct gagaggaaag tttatctcca 2820 taaatgccta taaccaaaag gcaagaagat cacaaataga caatctaatg aaacgactca 2880 aagagctgga aaaagaagaa cagaccaacc ccaaacccag cagaagaagt gaaatcaaca 2940 agatcaaatc agaactaaac gaaattgaaa acaggaaagc tattcaggag attaataaaa 3000 caaaaagttg gttctttgaa aaaataaaca aaattgacac accattggct aagctaacga 3060 aaagcagaaa agagaaatct ctaataagct ccatcaggaa taaaaaagga gatatcacaa 3120 ctgatcccaa agagatacaa gatacaattt atgaatacta caaaaatctt tatgcacaca 3180 aactggaaaa tgtggaggaa atggacaaat ttctagaaac acacagcctc cctaggctca 3240 accaggaaga aatagattcc ctgaacagac caatctcaac agctgaaata gaaacagcaa 3300 ttaaaaatct ccctaaaaag aaaagtcccg gtccagatgg cttcacacct gaattttacc 3360 atacttacaa agaagaacta gtacctatct tgcagaaact attccacaac atcgagaaga 3420 acggaaacct ccccgacacc ttttatgaag cgaatattac tctgatacca aaaccaggaa 3480 aggatgcaac aaaaaaagaa aactacagac caatatccct aatgaatata gatgcaaaaa 3540 ttttcaacaa aatcttagct aaccgaatcc agacacttat caaaaaaata atccaccacg 3600 accaagtggg cttcatccca gggatgcagg gatggttcaa catacgtaaa tctataaatg 3660 caattcacca cataaacaga agcaaaaaca aagaccacat gattctttca atagatgcag 3720 aaaaagcttt tgacaaaatt caacaccctt tcatgatacg aacacttaag aaaataggca 3780 tagaagggac atacctaaaa atgatacaag ccatatatga cagacccata gccaacatca 3840 tactgaatgg ggaaagattg aaatcattcc cacttagaac tggaaccaga caaggctgcc 3900 cactatctcc acttctgttc aacatagtgc tggaagtctt ggctacagca atcagacagg 3960 aaaatggaat caaaggtatc caaatagggg cagaagagat caaactttca ctgtttgctg 4020 atgatatgat attgtatcta gaaaacccca aggattcaac caagaaactc ctggaactga 4080 tcaatgaatt tagtaaagtc tcaggataca aaatcaatac acagaaatca gaggcattca 4140 tatacgccaa caacaatcta attgagaacc aaatcaaaga ctcaattccc ttcacaatag 4200 caacaaagaa attaaagtac ctaggaatat atttaaccaa agaggtaaaa gacctctaca 4260 gggagaacta tgaaacactg aggaaggaaa tagcagagga tgtaaacaga tggaaatcca 4320 taccatgctc gtggatcggc agactcaata tcatcaaaat gtctatacta cccaaactga 4380 tctacagatt caatgcaata cctattaaaa tcccatcagc attcttcaca gatatagaaa 4440 aaataatttt acgcttcgta tggaaccaaa gaagaccccg aatatcaaga gcaattctag 4500 gcaacaaaaa caaaatggga ggcattaata tgccagatat caaactatac tacaaagctg 4560 tagtaattaa aacaatatgg tattggcaca aaaacaggaa tattgaccag tggaacagat 4620 gtgagaatcc tgatataaaa ccatcctcat atagccatct catctttgac aaagcagaca 4680 aaaacatacg ctggggaaaa gaatccctct tcaataaatg gtgctgggaa aactggatag 4740 ccacctgtag aaggctaaaa caggacccac acctttcacc tctcacaaaa accaactcac 4800 gctggataac agacttaaac ctaagatatg aaactattag aactctagag gaaaaagttg 4860 gaaacactct cctagacatc ggcctgggca aagagtttat gaagaagtcc ccaaaggcaa 4920 tcacagcagc aacaaaaata aataaatggg acatgatcaa actacaaagc ttctgcacag 4980 ccaaagaaat agtcatgaaa gtaaacagac aacctacaga atgggagaaa atttttgcat 5040 cctatgcatc cgataaggga ctgataacta gaatatactt agaactcacg aaaattagga 5100 agaaaaaatc aaataacccc attaaaaagt gggcaaagga cttgaacaga aatttttcta 5160 aagaagacag aagaatggcc aacaaacata tgaagaaatg ctcaacatct ctaatcatca 5220 gggaaatgca aatcaaaacc acaatgagat atcacttaac cccagtgaga atggccttta 5280 tcaaaaaatc tccaaacaat aaatgctggc gtggttgcgg agagagagga acactcctac 5340 actgctggtg ggactgcaaa ctagttcaac ctctgtggaa agcaatatgg agatacctta 5400 aagcgataca agtgaatcta ccatttgatc cagcaatccc attgctgggc atctacccaa 5460 atgatccagt gacactctac aaaaaagaca cctgcactcg aatgtttata gcagcacaat 5520 tcataattgc aaggctgtgg aaacagccca agtgcccatc aatccaagaa tggattaata 5580 aaatgtggta tatgtacacc atggagtact attcagctct aagaaacaat ggtgatatag 5640 cacatcttat attttcctgg ttagagctgg aacccatact actaagtgaa gtatcccaag 5700 aatggaaaaa caagcaccag atatattctc cagcaaactg gtattaactg agtagcacct 5760 aagtggacac ataggtgcta cagtaatagg gtattgggca ggtgggaggg gggagggggg 5820 cgggtatata catacatagt gagtgagatg tgcaccatct gggggatggt catgatggag 5880 actcagactt ttggggggag ggggggaaat gggcatttat tgaaacctta aaatctgtac 5940 ccccataata tgccaaaata aaaaaaataa ttaaaaaaaa aaaaaaaaaa aaa 5993 // ID LTR1A3_OG repbase; DNA; PRI; 627 BP. XX AC . XX DT 22-MAR-2011 (Rel. 16.05, Created) DT 22-MAR-2011 (Rel. 16.05, Last updated, Version -1) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR1A3_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-627 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 11(5), 1580-1580 (2011). XX DR [1] (Consensus) XX CC ~91% identical to consensus. 4 bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 627 BP; 151 A; 168 C; 176 G; 132 T; 0 other; tgttgatacc ggacaggtgg cgccccagcc agctcagccg agcggggctg gaggacctgc 60 cgcctccgtc ttggcggtgt tggtggcggc agccggcaga ggaaccagca ggctggagca 120 ggctggagca ggctggagca taggcaagcg gggggagctt ccctccccca ccggaggctc 180 cagtagccaa aggaaccgga cacaagcagg ggaagacact ggcgattcct ccagtgcatc 240 cctgattggt ccattttcaa aacctgctcc tgattggtct gttttcagga cccgaaagct 300 cattggacaa ctgcccctat ataagcccct gagcagagag cccaagcgca gatcaccgca 360 gacctccgag ggagagagag cagaagagca agagctgtaa cacttgtatc ctgctctgca 420 aggaggtttg gctctttgta agagctgtga cacttgtata aaacctatct tactactgat 480 cctgctctgc aaggtttggc tctttgtgag agctgtaaca cctgttgaat aaaacctaac 540 ctactgttga tcctgtggtc cgtcgattca ttcatcgaat cgttgagacc aagagcctgg 600 aaatccagta tcaaaaatcc ggtatca 627 // ID LTR13_TS repbase; DNA; PRI; 1424 BP. XX AC . XX DT 11-DEC-2009 (Rel. 15.09, Created) DT 11-DEC-2009 (Rel. 15.09, Last updated, Version 2) XX DE Long terminal repeat of an ERV2-type endogenous retrovirus - DE consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR13_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-1424 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from tarsier."; RL Repbase Reports 10(9), 1270-1270 (2010). XX DR [1] (Consensus) XX CC ~99% identical to consensus. 6bp tsd. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 1424 BP; 372 A; 291 C; 304 G; 457 T; 0 other; tgttgggagc cggccacgga ggcccactga gcgttaggga tcagggaggg ccgtggacct 60 cgagaccgct aggacgtgtt cgagagccgt agaccccatt ttcctccccc ttttgccatg 120 cagcacaaga acaacctaaa ggaactctgc tggcctaagc tgaagactct gtgactgtga 180 caggcatagc tgactctaga aacccattgt tcctgtttcc cataaactgc ggtttcggcc 240 taggtattag atataattca aattcaggtc acatttggga gccttttata cggtacaagc 300 ttctttataa gatttaaaag aatatttata gaattagtat atacttaggg agagctgttt 360 gcagcttttc aatggtgaag atgattttcg agagtggccc agcacagagt ttgggctggg 420 ccacattcct gagagcccag catgtacttt ggactgggct gcattctttt gggccggctt 480 tagctacagt tagtgtaagg tctcttttct acctattata gaaattgaat ttattttatt 540 catagaaggt attgtagaat aggtcatgct acgaatcatg ttagttcata cataatcaat 600 cataatattt ttaggtttaa taagaatata gttaacttta agaattttca tcgtctccac 660 tttctactat cgaagttttt gccttagtac atttacaatt tacatttagg ttttagaagc 720 tttgctgctg ctaatcaact ttatatatta gaattaagaa ttggaaaact ttcatagaaa 780 ttaggataga catataagac tttggaatag ccattttgtt aagagattgt agtctattgt 840 tccaaaaggt ggcccaccgt ggaccttgga ctgggccaca tcctgtaggg cccagtgtgg 900 tctttggact gggccacatt ctttaggaaa ttggcttaag ctataattaa gagcttccgt 960 agcattgtat aagtggtaga gtttaattta gtcgtcatgg gtattgagtg agctttagaa 1020 gttattgatt atcagaacat cattagacta gtaaggattg ttacactgct tttgaagaac 1080 attaactagc aaagggaaca atgagacaat cttggggttt gttcctcctc acacctaagc 1140 atctggtggg tgggggatgc cccccccgca cctggctttg tgttcacgtg aaagacttgt 1200 gagactgcct gtgtaaaatt tggggataaa tacctgctgt aactttcaat aaacggcttc 1260 tgctcatttc actgaacagt tgcccacttt ctttctttct ctttcttcct ctttatatct 1320 atccttccat ctttctttct tttaatatct atccctacag attcgccgac gccaccacat 1380 cttccaaggg actccccaaa acttgcaggg gctggccccc cgca 1424 // ID LTR14C1a_Mim repbase; DNA; PRI; 517 BP. XX AC . XX DT 04-NOV-2009 (Rel. 14.11, Created) DT 04-NOV-2009 (Rel. 15.11, Last updated, Version 3) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR14C1a_Mim. XX NM LTR14C1a_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-517 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2972-2972 (2009). XX DR [1] (Consensus) XX CC ~92% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. CC We thank the Broad Institute Genome Sequencing Platform and CC Genome Sequencing and Analysis Program, Federica Di Palma, and CC Kerstin Lindblad-Toh for making the data for Microcebus murinus CC available. XX SQ Sequence 517 BP; 153 A; 118 C; 122 G; 124 T; 0 other; tgtgggatac aagctaaccg ctaccttggc ttctgtacct tagcttttgt aattcgcttg 60 cttgcttgcc acttagcctg actgaagcca tgacaggctt ctacaaagta aaagaaaaaa 120 aaaaaaaaga gagaacaaac aaggccccag ggaggaaacc ggtaaggcac tacctgataa 180 ggtagtgcaa agtccctggg acctagccaa ccaatcaata aatcaataca cggccagcat 240 gatcagtgcg tgaacagctt gagtgggatg tgtgggtgct gggtgggctc cggataccac 300 ttgtaaccag taacctgagt tgcacaacaa ctaaaagtat aaaacctgtg ctaaaacctt 360 gccaagggtc cttgtctaaa gagaccactg agctggtgct ctgggacctg gaccctagct 420 cgagctagct taaataaacc tccatttgtt gcttacgttg gtgtgagctt gttactctga 480 cattctgttc tgggatacag aattcttgga cacaaca 517 // ID MacERV6_LTR1a repbase; DNA; PRI; 459 BP. XX AC . XX DT 24-SEP-2007 (Rel. 13.02, Created) DT 21-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE ERV1 Endogenous Retrovirus from Cercopithecidae. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; KW MacERV6_LTR1a. XX OS Cercopithecidae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini. XX RN [1] RP 1-459 RA Smit A.F.; RT "MacERV6_LTR1a - ERV1 Endogenous Retrovirus from RT Cercopithecidae."; RL Direct Submission to Repbase Update (21-FEB-2008). XX DR [1] (Consensus) XX CC despite 5bp TSD 4%. XX SQ Sequence 459 BP; 124 A; 135 C; 107 G; 93 T; 0 other; tgtttgggtg aaggaaaagg gacaagatgg aggaaggtga acaagatgga gcagtccctg 60 ttgtttccgg gttcttcatc acagattttc ccgcgcccgg gaaaagaacc aaatcaactg 120 agcatgcgca gaatgacgtc aagccgggga caccgaaatt caagaagcca ctcctacaca 180 cacgccccga actcctcccc ttccagctcc caggcataaa agtccgccgc cggcaggagc 240 cggcgtgact tcttcggccc cccgcattcg tggaccggag aacatcaccc gagagcgccg 300 gcgcgacttc cctggccccc cacacctgag gaccagagaa cctcgcccga gagtgtgtgc 360 atatttgcaa taaaagactg ccactttctt acgtactttg gcctcatgtt taattactta 420 gctctcctaa attaagttac attaaattaa atcaaaaca 459 //