ID LTR2B_Mim repbase; DNA; PRI; 663 BP. XX AC . XX DT 04-NOV-2009 (Rel. 14.11, Created) DT 04-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR2B_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-663 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2949-2949 (2009). XX DR [1] (Consensus) XX CC ~93% identical to consensus. 6bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 663 BP; 156 A; 153 C; 179 G; 175 T; 0 other; tgttggggac tgacattata ctttcctggt ttaagaatta aatttagtga gtaatgtacg 60 tgctctgccc agagcgtttg agagtaatgt gttccggaca gggtccttgt gacaaacaag 120 gttgctgcct agaggcataa caaaggacct gagtttctgc aagtaagcaa gagctgccca 180 gccccacggc agtgggggtc tggaggccac acgataaaca cattgttcct aggattgctg 240 cttagcccca taagatggct ggttagtcaa tgacgggtaa gattcctcag ggaggaacaa 300 cctaagacag acacagccgc cgggggccag cctagaggaa ctggggacgg aaaatgcccc 360 ccgtggctgc cttgcccaac cttgctaatc tcggtctgtg atctatgcct ggcgcctaga 420 gcaaccacct ggaaaccttg agtcagggga catctgtgtc cttaaggcta cgtgttccgg 480 aatttatggc cattgctgta atgcctgggt ggtcacgtat aaggaactaa ctttgatttt 540 ttagagtata aaaataaact gaccagtgca ttgggcactc gagtcttgta ctgtctttgt 600 gtgtgtctgt gtctttcttt tgtgttctgt gttcatcccc cgtcctgcaa acgggccacg 660 gca 663 // ID LTR14C2_Mim repbase; DNA; PRI; 397 BP. XX AC . XX DT 04-NOV-2009 (Rel. 14.11, Created) DT 04-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR14C2_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-397 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2973-2973 (2009). XX DR [1] (Consensus) XX CC >94% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 397 BP; 93 A; 89 C; 110 G; 105 T; 0 other; tgtgaagcct caggctaatt gctttctttt agcttatgta tattgcttgt ttttaggaag 60 ttgggttgag cgggctgatc gcagtaaatt ttgggatgaa acaaaagggc aagcgcatgg 120 gcacaagggg ccaaccaatc aatgtaaagg gcaagcgcgt gggcacaagg ggccaaccaa 180 tcaatgtaaa ggtcaagcgt gtggacaggt tatgcaccta gggtataaag ggctccgtcc 240 cacagtgcgc ggggtctttg tcccaataga ggccgccgta tcggtgctct gggacttgga 300 ccctagctcg agctagtcaa taaaactcct tttgatgatt tcagcctcag tgaccctgtc 360 tctttgttct gtggtcctac ggtttcccgc tctaaca 397 // ID LTR14C1_Mim repbase; DNA; PRI; 405 BP. XX AC . XX DT 02-NOV-2009 (Rel. 14.11, Created) DT 02-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR14C1_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-405 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2971-2971 (2009). XX DR [1] (Consensus) XX CC >94% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 405 BP; 97 A; 106 C; 115 G; 87 T; 0 other; tgtaaaacag gaatgagagc aggagtgact ccatgacagg ctgcacttcc tggagtaggc 60 ctaagtttcg gtttccccga aacttcgcct actacaccct tccccgaagt gacgcgcgcc 120 gcttttgaag gagccaatca ggagccgaca cgatcagcca cttttaaagg aaccaatgga 180 ctaagggggg agggggggaa ggtgtaccca gcgtgtataa cttgcagaaa agtataaaag 240 cttgacttat accccagggc ggggtcctgg ttcgtaggga ggccactgcg ttggtgctct 300 gggacctgga ccctagctcg agctagccaa taaactcctt tgtgctttgc agcctcagcg 360 attctgcctc tctgttcctg gggccgaagg aaaccggctc taaca 405 // ID CERV1_LTR repbase; DNA; PRI; 409 BP. XX AC . XX DT 17-AUG-2004 (Rel. 9.07, Created) DT 17-AUG-2004 (Rel. 9.07, Last updated, Version 1) XX DE Long terminal repeat of chimpanzee endogenous retrovirus CERV1 - DE a consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; KW Long terminal repeat; Chimpanzee endogenous retrovirus; CERV1; KW CERV1_LTR. XX OS Pan troglodytes OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pan. XX RN [1] RP 1-409 RA Skaletsky H., Hughes F.J. and Page C.D.; RT "Consensus sequence of an endogenous retrovirus CERV1."; RL Repbase Reports 4(7), 190-190 (2004). XX DR [1] (Consensus) XX SQ Sequence 409 BP; 122 A; 112 C; 88 G; 87 T; 0 other; tgaaaagagc cgggcacatt cctcagcccc gggctcaaaa caaacaagcc cagtacaaac 60 acatcccatc ctcccatccc accacatatc accatatatc tcttaaactt cccccgggct 120 caaaacaaac aagcccagta caaacaccac caggaaagtc tccgataagg ggacagatga 180 ggggacagcc gttcaaagtt ttactgaaag agcgggaacc aaaagaattc ctttgttccc 240 ctgtaacttt caggctataa aaaagcaaac actcgcattg ttcagggccc tcttgtatgc 300 ggtggaatgg agggaccagg ttcgaacttg tagtaaagat ccttgccgct tggctttgac 360 tctggactct ggtggtcttc tttggggaac aaacggtctg ggcataaca 409 // ID LTR14B_Mim repbase; DNA; PRI; 439 BP. XX AC . XX DT 02-NOV-2009 (Rel. 14.11, Created) DT 02-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR14B_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-439 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2970-2970 (2009). XX DR [1] (Consensus) XX CC >96% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 439 BP; 89 A; 122 C; 123 G; 105 T; 0 other; tgtgaggggt tgagtcattg cacacctgca aggccccgcg cccccagtga agccggaagc 60 aacatccggg aaggctgacc gagacgtccg cttctggaaa ctgttgcttg cttgcctgcc 120 catggaaact gttgcttgct tgcttgcgcc aactttgcat tgttcgaacc cctgtatagt 180 ggggctacca gccaaccaat catgttaaag gtcaatgcat gatccacgcg tgatcagcca 240 atgcatagcg tatgcaccat agggataaaa ggcacgctgc aaccccggtc ggggtccttg 300 cctgcaagag tggccactgc gttggtgctc tggggcttgg accctggcta gccagaaaat 360 aaacctcctc ttgtgtgatt gcatcctcga tgtctctgct tttctgtccg gtggggctgt 420 ggaaggtcgg tccctaaca 439 // ID ERV1-Mim_LTR repbase; DNA; PRI; 421 BP. XX AC . XX DT 31-OCT-2009 (Rel. 14.11, Created) DT 31-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Endogenous retrovirus-like element: consensus of the long DE terminal repeat. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERV1-Mim_LTR. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-421 RA Jurka J.; RT "Endogenous retroviral elements from the mouse lemur."; RL Repbase Reports 9(11), 2822-2822 (2009). XX DR [1] (Consensus) XX CC Top sequences are >99% identical to consensus. 4bp tsd. ORFs CC corrupted by mutations. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 421 BP; 85 A; 129 C; 89 G; 118 T; 0 other; tgaaaggccc aaactagcct aaaggcccaa actagcctaa ccagcttttc gcttctgtaa 60 ctatgcttgc tcataggtaa ttaagacccc tacccctccc tttttcttta tcttttcccc 120 taagtgtcca ttgcaagttc tcggaattgg gtagtggtcg tgcaactggt tgtgcaagat 180 ggagctactg gaatttcccc accctaaact ccccggccac ctgcgtgtgg tctcgcccca 240 taaaaaccct aagcttgaga gcttcggggc ggcagcctcc ccatcttggt gatgctgttt 300 agcccctgcg cgcgctggaa ataaatcctc ttgctctctt gcatcaagcc tctggactct 360 gagtcttttt gggcggtcgt ctctctctcc caaacgggct gtacatttct acggcccaac 420 a 421 // ID LTR20B_Mim repbase; DNA; PRI; 518 BP. XX AC . XX DT 09-NOV-2009 (Rel. 14.11, Created) DT 09-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR20B_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-518 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2978-2978 (2009). XX DR [1] (Consensus) XX CC ~90% identical to consensus. 4bp tsd. CC Similarity to LTR20_OG from bushbaby. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 518 BP; 134 A; 119 C; 124 G; 141 T; 0 other; tgtaacagag tagtcacccc tcctcccata gatatggaag ggaaatattg acagtgggga 60 atatgtaaca gagggaatgg cctgaaaaaa cggcaaaaat attttctgtc tctttaaaac 120 atcctccact cctttttgag aactaaaacc tgcatccctg cctcaggcca gtggttggga 180 ggggcagggt aagtcttttg ttatttgtgc tacaggagat ggctcagccc agacctggta 240 aagggaggtc ctgggtggag gtcacgggat tgtcttcagc ggagatggga cgatcagaat 300 catcaactgt agttgaacag caattgcctg aagctgagaa cccatccttt aaaagctctg 360 tatttctgct tattattagg acgatggcat ttcagacagg agtctccatc ctcctccttt 420 gccggcaaag taataaactt ctctttcctt ctcctcaaac cacttgttct cgttcttctg 480 atgcggcctc gaggacaagt gccgagcttt cggtaaca 518 // ID hAT-2N3_TS repbase; DNA; PRI; 626 BP. XX AC . XX DT 21-APR-2009 (Rel. 14.1, Created) DT 14-DEC-2009 (Rel. 14.1, Last updated, Version 2) XX DE hAT-2N3_TS is a family of non-autonomous DNA elements found in DE Tarsius syrichta mobilized by hAT-2_TS. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-2_TS; hAT-2N3_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-626 RA Novick P.A., Smith J., Ray D. and Boissinot S.; RT "Independent and parallel lateral transfer of DNA transposons in RT tetrapod genomes."; RL Gene 449(1-2), 85-94 (2010). XX DR [1] (Consensus) XX SQ Sequence 626 BP; 143 A; 169 C; 126 G; 184 T; 4 other; caggggtcct caaactacgg cccgcgggcc acatgcggcc cgccgaggac atttatccgg 60 cccaccgggt gtttttgccd ccgctgcctg tcctgcctag cagccgactc gtccgggccc 120 gcagtgcgca tgtgtggaat gtgcdtbnga gcactctccg actcccctcc ttctctctgt 180 ctctcgactc ctcctctcag taatctcagg aaccatgcac tcaaaatggc aaccatcttt 240 ggcagcactt atgtctgtga acagactttt tccagaatga aacatctgaa atcttccaac 300 cagatctaga ctaactgatg cacacttgca tcacttgtta cggactagca gtgacaaata 360 tggaacccgg acattgacca tctcattagc caaaagcagg cccatagttc ccattgaaat 420 actggtaagt ttgttgattt aactttactt gttcttcatt ttaaatattg tatttgttcc 480 cgttttgttt tttcacttca aaataagata tgtgcagtgt gcataggaat ttgttcatag 540 tttttttttt taaactatag tccgcccctc caacggtctg agggacagtg aactggcccc 600 ctgttttaaa agtttgagga cccctg 626 // ID TINE1 repbase; DNA; PRI; 86 BP. XX AC . XX DT 31-DEC-2009 (Rel. 15.03, Created) DT 31-DEC-2009 (Rel. 15.03, Last updated, Version 2) XX DE SINE element - consensus. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; KW Nonautonomous; TINE1. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-86 RA Jurka J.; RT "SINE elements from tarsier."; RL Repbase Reports 10(3), 517-517 (2010). XX DR [1] (Consensus) XX CC >95% identical to consensus. CC This sequence was derived from sequence data generated by the CC Broad Institute Mammalian Genome Project. XX SQ Sequence 86 BP; 29 A; 25 C; 12 G; 20 T; 0 other; ggcaaccccc tcgggtcccc tcccactgtg ggagcttttc tgtcctctca ataaatcctg 60 ctttctaaaa aaaaaaaaaa aaaaaa 86 // ID L1-3_TS repbase; DNA; PRI; 7127 BP. XX AC . XX DT 09-APR-2010 (Rel. 15.05, Created) DT 09-APR-2010 (Rel. 15.05, Last updated, Version 3) XX DE L1-type non-LTR retrotransposon - consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-3_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-7127 RA Bao W. and Jurka J.; RT "Non-LTR retrotransposons from tarsier."; RL Repbase Reports 10(5), 769-769 (2010). XX DR [1] (Consensus) XX CC ~91% identical to consensus. CC This sequence was derived from sequence data generated by the CC Broad Institute Mammalian Genome Project. XX FH Key Location/Qualifiers FT CDS 2008..3036 FT /product="L1-3_TS_1p" FT /translation="ITELKRKTRDLSQMRKNQRKTSGNFKCTKDKTPEKSD FT LGAQTMEVTNDWPQNTTGIREWMERVNKTQERQDATLKELVTEILAIKNFI FT QEINNKMTSMESRINLAEERISELEDQNMELTRSVKNIERRLRKKEQSLQE FT MCDYIKRPNLCLIGIPEEEREMENNLEQVFQEVIQENFPHLTRDVTSQAQE FT IQRTPTRHQMRRPTPRHIVICLHKVGTKEKILKAAREKGQTTYWGKPIRIA FT ADLSAETLQARRDWSPIFKVLKDKQFQPRISYPAKLSFISDGELKSFPDIQ FT SLRDYAASRPALQETLKKVLSTEERKKRMTTHFPREQQSTESTENTAQQET FT " FT CDS join(3406..5667,5671..6933) FT /product="L1-3_TS_2p" FT /translation="MIKGSIHQQEISILNIYAPNTGAPAFIKQLLSKLKKD FT IDSNTIIAGDLNTPLTALDRSSRQKINKEIQNLNLTLDQMDLIDTYRLFHP FT TTTEYTFYSSPHGTYSKIDHILGHKSSINKFHKVEILPCTFSDHSGIKINI FT NTNNISPKPTKTWTLNSMMLNNYWVNTEIKAEIKRFLETNENEETSYQNLW FT DAMKTVLRGEFISLRAHIKKMERSQVESLTNHLRELEREDHQNPNFSRRIQ FT ITKIRAQIWDIEDKNIIENINKTKSWFFERINKIDGPLARMTKKKREKAQI FT NTIRNTKDQITTDPEEIQKIIRDYYVHLYGNKLDNLNEMEDFLTSHNLPRL FT KQEDIETLNRPITTQEIDSVIRKLPTKKSPGLDGFPAEFYKTYKEELIPIL FT LKVFQAIEKDGILPKSFYEANITLIPKLGKDPTKKENYRPISLMNIDAKIL FT NKILANRIQKHISKIIHHDQVGFIPRMQGWFNIRKTINVIKYINRCKNKNH FT MIISLDAEKAFDKIQHPFLIKTLKHLGIEGTYLKIVSAIYDKPTANILLNG FT QKLETFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEIRGIQIGKEEVK FT LSLFADDMILYLENPRESVKNLLALIKDFGKVSGYKINVQKTVAFLYTNNK FT QAETQIKSTIPFTIATQKMKYLGIFLTREVKDLYNENYKTLLKEIQEDTNK FT WKNIPCSWIGRINIVKMSILPKAIYRFNAIPIKLPTSFFSDLEMTIQKFIW FT KHKPRIAKTILSKRNKAGGITLPDFKLYYKATVIKTAWYWYKNRHIDQWNR FT IEIPEAKPQFLNQLIFDKASNNNHWGEENLFSKWCWENWLTTCRRLKQDPY FT LSPYTKINSKWIKDLNVKPQTIRILENAGDTLMEIGTGNQFLFRTLKAHTL FT RNKIDKWDLIKLTSFCKAKETIKRAGRQPTEWEKIFANCISDKGLTSRIYK FT ELKRAKRKKTNSPIKKWAKDMNRHFSKEDIRAANRHMKKCSASLIIREMQI FT KTTLRYHLTPVRMAIINKTNNNRCWRGCGEKGTLLHCWWECKLVQPLWKTV FT WRFLKELEIDLPYDPAIPLLGIYPEELKSFYKKDTCTRMFVAALFTIAKTW FT NQPCCPSKLDWIKKMWYIYTMEYYAAIKKDKHMDFAATWMDLESIILSDLS FT QKQRTEYHMFSLISGP" XX SQ Sequence 7127 BP; 2574 A; 1584 C; 1433 G; 1536 T; 0 other; gaggtcggga agatggcggc cgagccagtc ggactatgcg agtcccggtg tgagtgagtg 60 agtgtatgga aaattttgtg tacgtgctgg tgagcgagtg ggcatgtgag tggctgactg 120 agtgagggct agggagctag tggccagggc tctcctggtg cagcggctgg tgccacagct 180 gctgggcttg gaccctagcg attcccggga cgcagaagcc gagcgccatt ttctttttct 240 tttctttttt ttttaaatca acaaaccctc cctgggaaga aaatgctggg agagactgtc 300 taagtggggg ctcgtgagca agtggccagg cccctggtgc aacagctggt gctgcagctt 360 ctctgcttcg tccctggctc ttcccaggtc gcagaagccg agcgccattt ttttttttaa 420 acagactaaa gcggcgggga gaaactatcg ccccaggctc tccctcacca ttgcagcagg 480 agagagactg ccggctccag gctttgcctc accacttcag cacctggcag ggcactggca 540 gcaggggctg cggagggagt ccagaggcaa agtcccatgg aagaggacaa gagaggcacc 600 ccaccaccat cttgaaggcg gggcaggaaa gtgctccgac ttgctcacca cggagcaaca 660 gacccaatct ctgtgactgg cggatcaact actcagggat tgatcgtgac ccagagggaa 720 tatcctgaga gcacaaactt cggagggacc tgacgagtgc tgagttccag gcgccccacc 780 ccctaccctg atccagccgt gggacaaagg gtgtggccag aaaaggacga ggaagaactc 840 caaagagatc agtggtgagt ccggggctga cccaaacccc ctcctcccca gcaatcctct 900 gaacatcaga cccacacagc tgggccacga ctgagaggga caaacccagg cggagcctgg 960 acatcagtcg caataggaga actccattcc aggataggcc tggctggggc acccccgtga 1020 gagtacatac ctggggaaga gacaatcccg tacctcccca ccggaggaga tctccctgac 1080 catacgagag tgggagggga ggactgcagc cccacccacc tccactgccc ctcctccttc 1140 cctgcttcgg acactgagga gagagtgaaa gaggtggctc tgcaggagag acctgattag 1200 aggaacaatt acagagcctg ccacaagagg gcaggttttt ttgtttttta atttttaata 1260 atttttttaa atttttgact gtttgttagt tcatctattt ttaaaatatt tttcctttgt 1320 gcatgaatgt gtatgtgagg gtttgatctg catcccccca aaattttttt tcttttcttt 1380 ttctgtttct ttcttctttt atttgtcgtt ttctgtttat gtgtgtatgt gaatgattga 1440 gtgttcatct ttcaattttg ttgttgttgt tggtttgttt ttgctctttt gttcttctgg 1500 gtttttttta attttcattt tttatagttt aaggtgtgtg tcaatatatg tgtgagggtt 1560 cgacctgaat tcccctctcc ggtctttctg tttttcttct ctattcactt tgtgctaggc 1620 atgtgtgtgt gtgtttagtt ttcttggttt tttgttgttg tttgttttgt ttcactttgt 1680 tgttattcca gtttgttttg ggggtgtgtg tgtatgtctg tgtgtgtgtt ttgcattggt 1740 ttctgctttg catcatcata tagagtggtg gtggggagtg ggtctcaggg aacacatacc 1800 acaataagtg aacattgtaa tttgcaagta gtttctgaat cctagcgccc catcccctac 1860 tttattcgaa tttcaaaaac tacaagagac agcaaagctt cccatccacc atcctatttc 1920 aaaatagaac aaatccagag cccggatcag ggaagaatag aggactgagc aaacatcaga 1980 agggaaagcc tctcccactg acactgaata acagaactaa agagaaaaac aagagatctg 2040 tcacaaatga ggaaaaacca aaggaagacc tcaggcaatt tcaagtgtac caaagacaaa 2100 actcctgaaa aatcagacct tggtgcccaa acaatggaag tcaccaatga ctggccccag 2160 aatacgacag ggattaggga atggatggaa agagtgaata agactcaaga aagacaagat 2220 gcaacactga aagaacttgt aacagagatt ttggcaataa aaaatttcat ccaggaaata 2280 aataacaaga tgacaagcat ggaaagcaga attaatctag ctgaagaaag aatctcagaa 2340 cttgaagacc agaacatgga attaacccga tctgtaaaaa acatagaaag aagactcaga 2400 aagaaagaac aaagcctaca agagatgtgc gattatatca agaggccgaa cctatgcctg 2460 attggtatcc cagaagaaga aagggaaatg gagaacaact tggagcaagt attccaggag 2520 gtaatccaag aaaactttcc ccatctcacc agagatgtga ccagccaagc acaagagata 2580 cagagaaccc ccacaagaca tcaaatgaga agaccaaccc ctagacacat agtaatttgc 2640 ctacacaaag taggcacaaa agaaaaaatc ctaaaggcag caagagagaa aggtcagact 2700 acctactggg gaaaaccaat cagaatagca gcagacctat cggcagaaac actacaagca 2760 agaagggact ggagccctat attcaaagtc ctcaaagata aacaatttca accaagaatt 2820 tcctatccag ccaagctcag cttcatcagt gatggagaat taaaatcctt cccagacatc 2880 caatccctaa gagattatgc agcttccaga ccagctctac aggagacact taaaaaggta 2940 ttaagcacag aagaaagaaa aaaaagaatg accacacact tcccaagaga acagcagagc 3000 acagaatcaa cagagaatac agcgcaacaa gaaacttgaa aacacacaca tacatcaacc 3060 ccaaagccaa aagaaaacaa gcaaacaaag aaaaaacttt ataagaacct catgacaggg 3120 ataaacaatc acatttcaat aatcagcctg aatgtgaatg gactaaatgc accactgaaa 3180 agacacagaa tggcaaactg gataagaaac catgacccaa ttatttgctg catccaggag 3240 actcatctca ctacaaggga tgcacacaga ctcaaagtta aaggatggaa aatgagtttc 3300 caggcaaatg gatcacaaaa gaaggcagga gttgcgatct taatatcaga caaaacaacc 3360 tttaagctat caaaaattta aaaagatgca aaaggacact acataatgat aaaaggttca 3420 atccatcaac aagaaatatc catcctaaac atatatgcac ccaacacagg agcaccagca 3480 tttataaagc aactactaag taaactaaaa aaagatattg actctaacac tattatagca 3540 ggggacttga ataccccact gacagcccta gatagatcat cgaggcaaaa aatcaacaag 3600 gagatccaga acctaaactt gacactcgac caaatggact taatagatac ctacagatta 3660 ttccacccaa caaccacaga atatacattc tactcatcac cacatggaac atactccaag 3720 atcgatcaca tccttggcca taaatcaagc ataaacaaat tccataaggt tgaaatcttg 3780 ccatgcacat tctcagacca cagtggaata aaaataaata tcaacaccaa caacatttcc 3840 ccaaagccca caaagacatg gacactaaac agcatgatgc tgaacaacta ctgggtcaac 3900 actgaaatca aagcagaaat taaaagattc ctggaaacaa atgaaaatga agaaacatct 3960 taccaaaacc tctgggatgc catgaaaaca gttctaagag gggaatttat atctctacga 4020 gcacacatca agaaaatgga aagatcacaa gtggagagcc taacaaatca cctaagggag 4080 ctggaaagag aagaccacca aaaccccaac tttagcagaa gaatccaaat caccaaaata 4140 agagcccaaa tatgggacat agaagacaaa aatatcatag aaaacatcaa caaaacaaaa 4200 agctggttct ttgaaagaat taacaagatt gatgggcccc tagccagaat gaccaagaaa 4260 aagagagaaa aagcccaaat aaacacaatc agaaatacaa aagatcaaat cacaactgac 4320 cctgaagaaa tacaaaagat tatcagagat tactatgtac acctatatgg aaacaaactt 4380 gataacctaa atgaaatgga ggactttctg acatcacaca acctccccag gttgaaacaa 4440 gaagacattg agacactaaa tagaccaata acaacccagg aaattgactc tgtcatacga 4500 aaactaccta ccaaaaaaag ccctggactg gatggctttc cagcagaatt ctacaaaacg 4560 tacaaggagg agctgatacc aatcctattg aaagtattcc aggcaattga gaaagatgga 4620 attctcccca aatcatttta cgaagctaac atcacactga tacccaaact gggtaaagat 4680 ccaacaaaaa aagagaacta caggccaata tcccttatga acatagatgc aaaaatcctc 4740 aacaagattc tagcaaatcg gatccaaaaa cacatctcaa aaatcatcca ccatgaccaa 4800 gtaggcttca tccccaggat gcagggctgg ttcaacattc gcaagaccat aaatgtaatt 4860 aaatacatca acagatgtaa aaacaagaac cacatgatta tatcattaga tgcagaaaaa 4920 gcttttgata aaatccagca tcccttcttg ataaaaaccc tcaaacacct aggtatagaa 4980 ggaacatacc tcaaaatagt aagtgccatc tacgataaac ccacagctaa catattgcta 5040 aatggacaga aactggaaac atttcccctg aaaactggaa caagacaagg ctgcccactc 5100 tcacccctct tgttcaacat tgtgttggaa gtcctagctc gggcaattag acaagagaag 5160 gaaatcaggg gtatccaaat aggaaaagag gaagtcaagt tatccctctt tgctgatgat 5220 atgatcctat accttgaaaa tccaagagaa tctgtcaaaa acctgcttgc actgataaag 5280 gactttggca aagtctcagg gtacaaaata aatgtgcaaa agacagttgc attcctatac 5340 accaacaaca agcaggcaga gacccaaatt aaaagcacaa tcccattcac aatagccaca 5400 caaaaaatga aatacctcgg catcttccta accagagaag tgaaagacct ttacaatgag 5460 aactacaaaa cactgctcaa agaaatccaa gaagacacaa acaaatggaa aaatattcca 5520 tgctcatgga taggaagaat caacattgtt aaaatgtcca tcctaccaaa ggcaatctac 5580 agattcaacg caatacccat taagttacca acatcattct tctcagacct ggaaatgaca 5640 atacagaaat tcatatggaa acataaatga ccacgaatag ccaaaacaat ccttagcaaa 5700 agaaacaaag caggaggtat cacacttcca gacttcaaac tttactataa ggctacagta 5760 atcaaaacag cctggtattg gtacaagaac aggcacatag accaatggaa caggatagag 5820 attccggaag caaagcctca atttctcaac caactcatct ttgacaaagc ctccaacaac 5880 aaccactggg gagaggagaa cctattcagt aaatggtgct gggaaaactg gctgaccaca 5940 tgcagaagat tgaaacagga cccctaccta tcaccataca caaaaattaa ctctaaatgg 6000 atcaaagacc taaacgtaaa acctcaaact ataagaatct tagaaaacgc aggagacacc 6060 cttatggaaa ttggaactgg caaccaattc ctattcagaa ccctaaaggc ccatacctta 6120 agaaataaga tagacaagtg ggacctcatc aaactaacga gcttctgcaa agcaaaagaa 6180 accatcaaga gagcagggag acagcccaca gaatgggaaa aaatatttgc caactgtata 6240 tctgacaaag gcctaacatc taggatctac aaggaactca aacgcgccaa aaggaaaaaa 6300 acaaacagcc ccattaaaaa gtgggcaaaa gacatgaata gacacttctc aaaagaagat 6360 atacgggcag ccaacagaca catgaaaaag tgctcagcct cactcatcat cagagaaatg 6420 caaatcaaaa ccacattgag ataccaccta accccagtaa gaatggccat cattaataaa 6480 acaaacaaca acagatgctg gcgaggatgc ggagaaaaag gaacgcttct acactgctgg 6540 tgggaatgca aactagtgca acctctttgg aagacagtgt ggcgatttct gaaagaacta 6600 gaaattgacc ttccatatga cccagcaatt cccctattgg gaatataccc ggaggaactc 6660 aaatcattct acaaaaaaga tacctgcaca cgtatgtttg ttgcagctct attcacaata 6720 gcaaaaacat ggaaccaacc atgttgccca tcgaagctgg actggataaa aaaaatgtgg 6780 tacatataca caatggaata ttatgcagcc ataaagaagg acaaacatat ggacttcgca 6840 gcaacttgga tggatttgga gtcaatcata ctcagtgatc tatcacagaa acaaagaaca 6900 gagtaccata tgttctcact cataagcgga ccttgaacaa ttataatact ataagaaagg 6960 gattggcagt agtgggaaac tgtcagggga ggggggtggg ttggcagttg agggaaactg 7020 ctaggggagg gaggggcata cctcatcaac aagggtacct gcataatcaa catttgtata 7080 cctaaccctg aattgtaccc cacatcttta taataaaaaa agaaaaa 7127 // ID L1B_Mim repbase; DNA; PRI; 5342 BP. XX AC . XX DT 07-JAN-2010 (Rel. 15.03, Created) DT 07-JAN-2010 (Rel. 15.03, Last updated, Version 3) XX DE L1-type non-LTR retrotransposon - consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1B_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-5342 RA Jurka J.; RT "Non-LTR retrotransposons from the mouse lemur."; RL Repbase Reports 10(3), 476-476 (2010). XX DR [1] (Consensus) XX CC >92% identical to consensus. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX FH Key Location/Qualifiers FT CDS 60..1061 FT /product="L1B_Mim_1p" FT /translation="MRKNQRKITGNMKDQREKSPPKENTRSPSXDTNLHDM FT IKLTEEXFRIWIAXKLNGIEEKIESQHRETTRTXQEMNERFSKEIETIQKN FT QTEILAMKETIKDLHNXVESLKNRMDHAEERISELEDYAYALNKSEEEREH FT RNKRQDQSLQEVWDYVKKPNIRLIGIPEGEEEHSQGLENLFHGILEENMPG FT LARNLDIQIQEAHRTPGRLNVKRQSPRHVVXRLTKVNVKEAILRAARRKQQ FT MTYKGKPIRLTADFSSETLQARRDWVPILNLLKQNKAQPRILYPAKLSFIY FT EGEIKSFSDKQSLKEFAKTRPXLQEVLRPAFLTEQGSRHSSK" FT CDS 1101..4976 FT /product="L1B_Mim_2p" FT /translation="MMAQGVRQSNKTPPNNMNGNLPPISILSINVNGLNCP FT LKRHRLAEWIKIHKPSICCLQETHLTHKDAXRLKVKGWKTIIQSNGSQKKA FT GVAILFADNISFKIAKVKKDKDGHXIMVKGKIQQEDLTILNIYAPNAGAPN FT YXKQTLSNLNTLLHNTAIVAGDFNTPLNDLDRSSKQKISKEIMDLNKALDQ FT KGLTDLYRAFHPNKLEFTFFSAAHGSYSKIDHILGRKSDLKKFKKIEIIPC FT IFSDHNGIKLQFNSYRNTQXLTKSWKLNNLLLKNYWVKEEIQREIENFFEQ FT NDNGDTSYQNLWDTAKAYLRGKLIAINAHIQKTESLDTDNLMNKLKELEKE FT EQTISNPNRRKEITKIKAELNEMENKRTILKINKTRSWFFEKINKIDGPLA FT RLTRTQRERTLINSIRNEKGEITTDTTEIQNIIFDYYKKLYAXKLQNEDEM FT DKFLDSYNLPKFTQEATEFLNRPISSSEIEAVIKNLPKRKSPGPDGYTSEF FT YQTYKDELIPILQKLFHTIEKDGILPNSFYEANITLIPKPGKDATKKENYR FT PISLMNTDAKILNKILANRIQQHIKKIIHHDQVGFIPGMQGWFNIRKSINA FT IHFINKIKNKDHMILSIDAEKAFDKVQHTFMIKTLNKIGIDGSYLKLIKSI FT YDKPTANIILNGEKLKSFPLRSGTRQGCPLSPLLFNIVLEVLAIAIRQERG FT IKGIQVGADEIKLSLFADDMILYLENPMDSSKRLLDLITEFGKVSGYKINI FT HKSEAFIYAKNHQAETQIKNAIPFTIAPKKIKYLGVYLTKDTKDLYKENYE FT TLKKEIAEDLNRWKNLPCSWIGRINXVKMSILPKVIYRINAIPIKIPSAFF FT TDLEKIILHFVWNQKKPRIAKAILSKKNKLGGISLPDFKLYYKAIIVKSAW FT YWHKNRSXDIWNRSEIPEMKPSVYGNLIFDKADKNIXWGKESLFNKWCWEN FT WLATCRRANQDPYLSPLTKIHSRWITDLNLRHETLRILEEDVGKTLSDIGL FT GKEFLRKTPKAITAASKINKWDLIKLKSFCTAKETISRANRQPTEWEKIFA FT LYTSDKGLITRIYLELKRINKKKSNNPIKKWATEMNRNFSKEDRIMACKHI FT KKCSTSLIIREMQIKTTMRYHLTPVRMAYIKKSQNNKCWRGCGETGTLLHC FT WWDCKLVQPLWKRIWRYLKQLEIEIPFDPAIALLGIYPKEHKTFYYKDICT FT RMFMAAQFTIARSWKQPKCPSIHEWIIKMWYMLTMEYYSILRNDGELAPFM FT LSWIKLKPVIQSEATQDXENGLHIYSPSNWY" XX SQ Sequence 5342 BP; 2099 A; 1094 C; 998 G; 1127 T; 24 other; ggccagaatt tagctcataa tccantccct gcacctccgg tcctcgataa agcatccaga 60 tgagaaagaa ccagcgaaag attacaggaa acatgaagga ccagagagaa aagtcacctc 120 caaaggaaaa tactcgttct ccatcanctg acaccaactt acatgatatg attaaactga 180 cggaggaaga nttccgaata tggattgcta naaaactnaa tggaattgaa gagaaaatag 240 aatcacaaca tagagaaacc acaagaacaa tncaggaaat gaatgaaaga ttctccaaag 300 aaattgagac tatccagaaa aaccaaacag aaattctggc aatgaaggaa acaatcaagg 360 atctccataa tncagtggaa agcctcaaga acaggatgga ccatgcagag gaaagaatct 420 cagagcttga agattatgcc tatgcgctaa acaaatcaga ggaagaaagg gaacacagaa 480 acaagagaca agaccaaagc ttacaggaag tgtgggatta tgttaaaaag ccgaatatca 540 gattgattgg gattccggaa ggggaagagg aacattcaca agggctggaa aacttatttc 600 acggaatact ggaggaaaat atgccgggcc tggccagaaa tcttgatatc caaatacaag 660 aagcacacag aactcctggg agactcaacg tgaaaaggca atcacctcgc cacgtggttn 720 ttaggctgac caaagtaaac gtgaaagaag caattctccg tgcagcgaga cgaaagcagc 780 aaatgaccta taaaggnaag cctatcagac taacagcaga cttctcatct gaaaccttac 840 aagccaggag ggattgggtg cctatcctta atcttctaaa acagaacaaa gcccaaccta 900 gaattcttta tccggcaaaa ttaagtttca tctatgaggg agaaataaag tccttctcag 960 acaagcaatc actgaaggaa tttgcaaaga ccagaccanc cctacaggaa gttctcagac 1020 ccgcatttct aaccgaacag ggcagtagac actcctcaaa gtgaaatcgt caaagaatta 1080 aagtttagat ctcgaactac atgatggctc aaggagtaag acaaagcaac aagactccac 1140 ccaacaatat gaatggtaat cttcctccaa tttcaatcct ctcaataaat gtaaatggct 1200 taaactgtcc tctgaagaga catagactgg cagagtggat aaaaatccac aagcctagca 1260 tctgctgtct acaggaaaca catctaaccc acaaagatgc ctnccggctg aaggtcaagg 1320 gatggaaaac tatcatccag tcaaacggaa gtcaaaagaa agctggggta gctatactat 1380 ttgcagataa cataagcttt aaaatagcaa aagtaaaaaa ggataaagat ggccattnta 1440 taatggtgaa agggaagatc caacaagaag atttaacaat tcttaatatc tatgcaccca 1500 atgcaggagc acccaattac ntaaagcaaa ccttgtctaa tctaaacacc ttgttacaca 1560 acactgccat agtagcaggg gacttcaaca ctccactgaa tgatctggat agatcctcca 1620 aacagaaaat aagcaaagaa ataatggacc tgaacaaagc ccttgatcaa aaaggtctga 1680 cagatctcta tagagcattc catccaaata aacttgaatt tacattcttc tcagcagccc 1740 atggatccta ctccaaaatt gatcacatcc taggccgcaa atcagatctc aaaaaattca 1800 agaaaataga aattatacct tgtatcttct ctgaccataa cggtataaaa ttacagttca 1860 attcctatag aaacactcaa cncctcacaa aatcatggaa actaaacaat ctattattga 1920 aaaattattg ggtaaaggaa gaaattcaga gggaaatcga gaatttcttc gaacaaaatg 1980 ataacggtga tacctcttac caaaacctgt gggatacagc aaaagcttac ctgagaggaa 2040 aactaatagc aattaacgct cacatccaaa aaacagaaag cttagatact gacaacctaa 2100 tgaataagct caaggaattg gaaaaagaag agcaaacaat ttccaatcct aatagaagaa 2160 aagaaataac gaagatcaaa gcagaactga atgaaatgga gaacaagaga actatactaa 2220 agatcaacaa aaccagaagc tggtttttcg aaaagataaa caaaatcgat ggccctcttg 2280 ctagattgac aaggacccaa agggaaagga ctctaataaa ctcaataaga aatgaaaaag 2340 gagagatcac aacagacacc acagaaatac aaaacattat atttgactac tataaaaaac 2400 tatatgcccn aaaactacag aacgaagatg aaatggacaa attcctggat tcatacaacc 2460 tccctaagtt cacccaggag gcaacagaat tcctgaacag accaatctca agctcagaaa 2520 ttgaagcagt aattaaaaac ctccccaaac ggaaaagtcc cgggccagat ggctacactt 2580 cagagttcta ccaaacatac aaagatgaac tcatacctat actacagaaa ctattccaca 2640 ccattgagaa ggatggtatc cttcctaact cattctacga agccaatatc accttgatac 2700 caaagccagg aaaggacgca acaaaaaaag aaaattacag accaatatcc ctcatgaata 2760 cagatgcaaa aatcctaaat aaaattttag cgaatagaat tcagcagcac atcaaaaaaa 2820 taattcacca tgaccaggtg ggctttattc cagggatgca aggntggttc aacatacgca 2880 agtctataaa tgcaattcac ttcataaata aaatcaagaa caaagaccat atgattctgt 2940 caatagatgc agaaaaagca tttgacaaag tccaacacac ctttatgata aaaactctta 3000 acaaaatagg catagacggc tcatacctta aacttatcaa atccatctat gacaaaccca 3060 ctgctaatat cattctaaat ggggaaaaat tgaaatcttt cccccttcga tccggaacta 3120 gacaaggatg cccactatct cctctcctat tcaacatagt gctcgaagtc ctagcnatag 3180 caatcaggca ggagaggggt attaagggca tccaagtggg ggcagatgaa atcaaactct 3240 cgctcttcgc cgatgatatg atattatacc tagaaaaccc catggactct tccaagagac 3300 tcctagactt gataaccgaa ttcggtaaag tttcaggtta taaaatcaat atacacaaat 3360 cagaagcatt catatatgcc aagaaccatc aagcagaaac tcaaatcaaa aacgcaatac 3420 cctttactat agccccaaag aaaattaaat atctaggagt atacttaacg aaagatacga 3480 aagatttata caaggagaac tacgaaacac taaaaaaaga aattgcagaa gatttaaaca 3540 gatggaaaaa tctaccttgt tcatggattg gtagaatcaa tatngttaaa atgtcaatat 3600 tacctaaagt gatctacaga atcaatgcaa tccccatcaa aataccatca gcattcttta 3660 cagatctaga aaaaataatt cttcacttcg tatggaacca gaaaaaacct cgtatagcca 3720 aagcaatctt aagtaaaaag aacaaactgg gaggcatcag tcttcctgac ttcaagctgt 3780 actataaagc aataatagtt aaatcagcct ggtactggca caagaacaga agcatngata 3840 tctggaatag atctgagata ccagagatga aaccatcagt atacggtaac ctaatctttg 3900 ataaagcnga caaaaatata cantggggaa aagaatctct cttcaataaa tggtgctggg 3960 aaaactggtt agctacatgc agaagagcga atcaggatcc ctacctctca cctctcacaa 4020 aaattcactc aagatggata acagacttaa acctaaggca tgaaacctta agaatcctag 4080 aagaagatgt tgggaaaacc ctatcagaca ttggcctagg caaagaattt ttgaggaaga 4140 cccccaaggc aatcaccgca gcatcaaaaa taaacaaatg ggatctgatc aaattaaaaa 4200 gtttctgcac agccaaggaa accatcagta gagcaaatag acaacccaca gagtgggaga 4260 aaatatttgc tctctacacc tctgataaag gtctaataac aagaatctat ctagaactta 4320 aaagaattaa caagaaaaaa tcaaacaatc ccatcaagaa atgggcgacg gaaatgaaca 4380 gaaacttctc caaagaagac agaataatgg cctgcaaaca tataaaaaaa tgctcaacat 4440 ctctaatcat tagagaaatg caaatcaaaa ccacaatgag ataccaccta accccagtga 4500 gaatggccta tatcaagaaa tcccaaaaca acaaatgctg gcgaggatgc ggagagacag 4560 gaacactcct acactgctgg tgggactgca aattagtgca acctttgtgg aaaagaattt 4620 ggagatacct caaacagcta gaaatagaaa taccattcga cccagcaata gcattgttgg 4680 gcatctaccc aaaagagcat aagacattct attataaaga catctgcacc cgaatgttta 4740 tggcagcaca attcactatt gcacggtcat ggaaacaacc caagtgcccg tcaattcatg 4800 agtggataat taaaatgtgg tatatgctca caatggaata ttactcaatc ctaagaaatg 4860 acggtgagct agcaccgttt atgctatcct ggattaagct taagcccgtt atccaaagtg 4920 aggcgacaca agacntggaa aatgggctcc acatctactc gccatcaaat tggtactgac 4980 tgattaaaac tatggtnctc aaatggtggt aatgctcacc agggattcgg gggggggggg 5040 gagacccaca tcttagggat gtggcgagca ttttggaggg gaagggcata actctaaccc 5100 ttcttaggga gaggcaaaga tatacaatgt aaccaaaatg tcaaaaaaaa aaaaaaaact 5160 ttttatcggg tggcgggcag gcgggagggg ggaggaggag aggggtgtat gcttncataa 5220 cgtgtgtgat gcgcaccacc gggggattgg acacatcggg gggagggggg ggcaggggca 5280 atatttgtaa ccctaacaat atttgtaccc ccataatatg atgaaataaa agaaaaaaaa 5340 aa 5342 // ID MSTA repbase; DNA; PRI; 426 BP. XX AC . XX DT 01-OCT-1995 (Rel. 3.6, Created) DT 18-APR-1997 (Rel. 6, Last updated, Version 2) XX DE Transposon-like human element long terminal repeat (MSTa DE subfamily) - a consensus. XX KW Repetitive sequence; MaLR family; MSTa subfamily; MstII; MER10; KW MSTA. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-426 RA Jaiswal K.A., Gonzalez J.F. and Nebert W.D.; RT "Human P1-450 sequence and correlation of mRNA with genetic RT differences in benzo[a]pyrene metabolism."; RL Nucleic Acids Res 13, 4503-4520 (1985). XX RN [2] RP 1-426 RA Lawrance K.S., Das K.H., Pan J. and Weissman M.S.; RT "The genomic organization and nucleotide sequence of the RT HLA-SB(DP) alpha gene."; RL Nucleic Acids Res 13, 7515-7528 (1985). XX RN [3] RP 1-426 RA Mermer B., Colb M. and Krontiris G.T.; RT "A family of short interspersed repeats is associated with RT tandemely Repetitive DNA in the human genome."; RL Proc. Natl. Acad. Sci. U.S.A 84, 3320-3324 (1987). XX RN [4] RP 1-426 RA Kaplan J.D., Jurka J., Solus F.J. and Duncan H.C.; RT "Medium reiteration frequency repetitive sequences in the human RT genome."; RL Nucleic Acids Res 17, 4731-4738 (1991). XX RN [5] RP 1-426 RA Smit F.A.; RT "Identification of a new, abundant superfamily of mammalian RT LTR-transposons."; RL Nucleic Acids Res 21(8), 1863-1872 (1993). XX DR [5] (Consensus) XX CC Replaces MER10 sequence. XX SQ Sequence 426 BP; 85 A; 111 C; 101 G; 124 T; 5 other; tgatatggtt tggatctgtg tccccaccca aatctcatgt tgaattgtaa tccccaatgt 60 tggaggtggg gcctggtggg aggtgattgg atcatggggg cggattyctc atgaatggtt 120 tagcaccatc cccttggtgc tgtyctcgtg atagtgagtg agttctcatg agatctggtc 180 gtttaaaagt gtgtggcacc tcccycctct ctctcttgct cctgctctcg ccatgtgacg 240 tgcctgctcc cccttcgcct tccgccatga ttgnaagctt cctgaggcyt ccccagaagc 300 cgagcagatg ccagcgccat gcttcctgta cagcctgcag aaccgtgagc caattaaacc 360 tcttttcttt ataaattacc cagtctcagg tatttcttta tagcaatgtg agaacagact 420 aataca 426 // ID hAT-2N3_TS repbase; DNA; PRI; 637 BP. XX AC . XX DT 21-APR-2009 (Rel. 14.1, Created) DT 21-APR-2009 (Rel. 14.1, Last updated, Version 1) XX DE hAT-2N3_TS is a family of non-autonomous DNA elements found in DE Tarsius syrichta mobilized by hAT-2_TS. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-2_TS; hAT-2N3_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-637 RA Novick P.A., Smith J., Ray D. and Boissinot S.; RT "Independent and parallel lateral transfer of DNA transposons in RT tetrapod genomes."; RL Gene 449(1-2), 85-94 (2010). XX DR [1] (Consensus) XX SQ Sequence 637 BP; 146 A; 169 C; 126 G; 186 T; 10 other; tarsushatn acaggggtcc tcaaactacg gcccgcgggc cacatgcggc ccgccgagga 60 catttatccg gcccaccggg tgtttttgcc dccgctgcct gtcctgccta gcagccgact 120 cgtccgggcc cgcagtgcgc atgtgtggaa tgtgcdtbng agcactctcc gactcccctc 180 cttctctctg tctctcgact cctcctctca gtaatctcag gaaccatgca ctcaaaatgg 240 caaccatctt tggcagcact tatgtctgtg aacagacttt ttccagaatg aaacatctga 300 aatcttccaa ccagatctag actaactgat gcacacttgc atcacttgtt acggactagc 360 agtgacaaat atggaacccg gacattgacc atctcattag ccaaaagcag gcccatagtt 420 cccattgaaa tactggtaag tttgttgatt taactttact tgttcttcat tttaaatatt 480 gtatttgttc ccgttttgtt ttttcacttc aaaataagat atgtgcagtg tgcataggaa 540 tttgttcata gttttttttt ttaaactata gtccgcccct ccaacggtct gagggacagt 600 gaactggccc cctgttttaa aagtttgagg acccctg 637 // ID L1-6_TS repbase; DNA; PRI; 6717 BP. XX AC . XX DT 03-MAY-2010 (Rel. 15.05, Created) DT 03-MAY-2010 (Rel. 15.05, Last updated, Version 1) XX DE L1-type non-LTR retrotransposon - consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-6_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-6717 RA Bao W. and Jurka J.; RT "Non-LTR retrotransposons from tarsier."; RL Repbase Reports 10(5), 772-772 (2010). XX DR [1] (Consensus) XX SQ Sequence 6717 BP; 2530 A; 1570 C; 1299 G; 1318 T; 0 other; gatggcatcg gatccggtca gagtgtgaga gtctctgcat gggtgagtgc acaaaaaaac 60 taatctgcgt ctgtttgtgt gtgcaagtgg gtgagcgaat gtccaactga ggggagacac 120 gggatcaagc acccaggccc cctgcagaaa tggctacaat ttgccactgc tctgaattcg 180 gcacctgctg cacgcagaag gatccaccat ccctggggca tctctcggga aaaggcagaa 240 gagatcagct ggcacctggc ccacaccacc acctgcggca acaggccagg catgggcagc 300 agggaccgtg gagaaagcac ggggaggaaa tctggagctg agccccacca gagggaaaaa 360 aacacggacc gccaccattt tctctgtcct gggactgggc agagacctgc tttatcttgc 420 ccaattctga gctgcctgca tcctgagcag cctgcatcct gagaccaagg ggtgatctct 480 gcacccctct gaaatcgacc atcatttgcc cagagaggat actccaagtg catgataccg 540 gagagacaga cgccccaccc ccacactgga ccaaacacag acgcatgcaa tcagagtggg 600 agacctagca gcagacaatt tgggagcttt tcccgggaaa atctgctgga gcgagctgaa 660 cgctggcagt gagggccgcg gagggaacgc agggagggag cctgaggcag actcccaagg 720 aagcagtggg gagagcccac cctgagcctg gacggggctc cagagccttg ccaccaggtc 780 ccgagatccg gagactggta actgctggga agggagcaat agctcccggc agagaaactg 840 agcagactct ctaatgcctg ggcagagaag tgctccaggg tggcctctgc ggaacagcag 900 aactgcatct tgagaccagg cgctgatctc tgcgttgcgt tggggctgct ctttgcgacc 960 tcagagaaca ttgcccccag tgcacaagcc ctggcgggta atggcagatg cggaacccca 1020 gacgatccac cctccaccct gagccagctg ctgggcgata gacgcagaca gcagaactga 1080 ccgagaagga ggctggtggc agtattggtt tgtttgttta tctcatttgg ttgctttgtt 1140 tgttggttgg ttttctcttt tctttccttc tatttctctt tctttttctc tttctctgtt 1200 ttttctcttt ctattttctt tcttcttttc ctttctcctt ttttcttttt ccctccttct 1260 ctctcttttc cctcttatac cctgtcttgg actcaggggc tatgcaagca tacccactgt 1320 aaggtcatat tgtgcagaga tggagagcaa gtcccagctg gcccattcac cagatagcaa 1380 acattatgac ctgccaatag gttctgaatc cttgcagccc ccctcatccc ctagtaagtg 1440 ctcaaaattg taaagagaca gcacaatgcc cccagcccta cttcataatt gaacaaaccc 1500 agagccccgg ctcaaggaag gacaaaaggt agagcaaata tcagaagtga ctgcttctcc 1560 attttccact gagcaacgga attaaagaga aagataggag aactgctaca aatgaagaaa 1620 aaccaacgaa gaactgctgg taacccagag ccccagcaaa gagtcagtgc tagaaagaac 1680 agctactggg gcacaaacaa tggctatctc cgagaacgac agtacccaga aggtacttac 1740 agagatcaga gaatcattgg atagaatgaa taaaaatcaa gaaatgcaag ggctaaaact 1800 aaatgaaatt gcaacagagc tggacaaact caagaaagcg tacttagaaa taacagaaat 1860 aaagaactcc atccaagaaa taaatgacaa actgacaagc atgggaagca gaattgacca 1920 agcagaagaa agaatctcag agctggaaga ccaaaatatg gaattaaccc aatctattaa 1980 aaatatagaa aaaaacttaa gaagaaagaa gaaactctcc aagagatgtg cgattttatc 2040 aagaaaccaa acctacgtct gatcggaata cctgaaatag aaagagaaac agaaaacacc 2100 ctggaacaaa cattccatga ggtcattcaa gaaaacttcc ctcatctggc cagagagatg 2160 actatacaag cacaagagat tcagagaacc cctgcaagac atcttatgag aagaccaacc 2220 cctagacaca tagtaattcg cctacacaaa gtaggtataa aagaaaaaat cctaaaggca 2280 gcaagggaaa aaggtcagac tacctaccgg gggagaccaa tcagaattgc tgcagactta 2340 tctgcagaaa cactacaggc cagaaggaat tggaccccaa tctttaatgt tctcagagat 2400 aaacaatttc aaccaagaat ttcctaccca gccaagctaa gcttcatcag tgatggagaa 2460 ttaaaatcct tcccagacat ccaatcccta agagaatatg ctgcttcgag accagctcta 2520 caggagatgc ttaaaaaggt gctgtgcaca gaagaaaaaa gaaacaaaag aaagaccata 2580 tacttcacaa gatcacaaac aaacacagaa gcagcagaac acacagcata cccgcaaaaa 2640 tgaaaacaca cacatatatg aacacaaaag ctaaatgaaa acaaaacagc cttataagag 2700 cattatgaca gggacaaact ctcacatttc aataatcagc ctgaatgtga atggactaaa 2760 tgcaccactg aaaagacata gaatggcaaa ctggataaaa aaacatgacc cagtaatttg 2820 ctgtctccag gaaactcatc tcaccacaaa ggatgcccac agactcaaag ttagaggatg 2880 gaaaacaaat ttccaggtga acggatcaca aaagaaggca ggagtcgtga tcttaatatc 2940 agacaaaaca acctttaagc tatcaaaaat ttaaaaagat aaggaaggac actacataat 3000 gataaaaggt tcaatccatc aacaagaaat atccatccta aacatatatg cacccaacac 3060 aggagcacca gcttttataa agcagctact aaaaaaagat attgactcta acactatcat 3120 agctggggac ttaaataccc cactaacaac cctagataga tcatcgaggc aaaaaatcaa 3180 caaagagatc cggaacctca acttgacgct tgaccaaatg gacttagtag atacctacag 3240 aacactccac ccaacaacca cagaatatac attctactca tcaccacatg gaacgtactc 3300 caagatcgac cacatcctcg gccataaatc aagcataaac aaatttcaca agattgaaat 3360 actgccatgc accttctcag accacagtgg aataaaaata aatatcacca ccaacaaaat 3420 tcccccaaac ccacaaagac atggacacta aacagcatga tgctgaacaa ctcctgggtc 3480 aacacagaaa tcaaaacaga aattaaaaga ttcctggaaa caaatgaaaa tgaagaaaca 3540 tcttaccaaa acctctggga tgccatgaaa acagtactga gaggggaatt tatatctcta 3600 tgaacacaca tcaagaaaat agaaagagaa caagttaaca gcctaacaaa tcacctaagg 3660 gagctggaaa ggcaagacca ccaaaaccct aacttcagca gaagaatcca gatcaccaaa 3720 gtaaaagccc aaatatggga catagaagac aaaaatacca tagaaaaaaa tcaacaaaac 3780 aaaaagctgg ttctttgaaa ggataaacaa gattgatgga cccctagcca gaatgaccaa 3840 gaaaaagaga gaaaaagccc aaataaacac aatcagaaat gcaaaagatc aagtcacaac 3900 tgaccccaaa gaaatacaaa agattatcag agattactat gcacacctgt atggaaacaa 3960 actcgataac ctaaatgaaa tggaggactt tctgacatca cacaacctcc caaggttgaa 4020 acaagaagaa attgagatcc taaatagacc aataacaacc caggaaattg actctgtcat 4080 aagaaaacta cctacaaaaa aaaaagccct ggaccagatg gatttccagc ggaattctac 4140 aaaacataca aggaggagct gataccaatc ctattgaaag tattccaggc aatcgagaaa 4200 gatggaactc tccccaaatc attttatgaa gctaacatca cactgatacc caaaccaggt 4260 aaagatccaa caaagaaaga gaactacagg ccaatatccc tgatgaacat agatgcaaaa 4320 attctcaaca agattctagc aaatcggatc caacacatct caaaaatcat ccaccatgac 4380 caagtaggct tcatccctgg gatgcagggc tggttcaaca tccgcaagac cataaatgta 4440 attaaataca tcaacagatg taaaaacaag aaccacatga ttatatcatt agatgcagaa 4500 aaagcttttg ataaaatcca gcatcccttc ttgataaaaa cccttgaaca cctaggcata 4560 gagggaacat acctcaaaat agtaagagcc atctatgata aacccacagc caacatattg 4620 ctaaatggac agaaattgga agcatttccc ctgaaaaccg gaacaagaca aggctgccca 4680 ctctcacccc tcctgttcaa catagtgttg gaagtcctag ctagagcaat cagacaagag 4740 aaggaaatca ggggtatcca aataggaaaa gaagaagtca agttatccct ctttgctgat 4800 gatatgatcc tatacctcga aaatccaaga gaatctgtca aaaacctcct tgcactgata 4860 aaggactttg gcaaagtctc agggtacaaa ataaacgtgc aaaagacagt cacattctta 4920 tacaccaaca acaaacaggc agagaaccaa ataaaaagca caatcccatt cacaatagcc 4980 acaaaaaaat gaaatacctt ggcatcttcc taaccagaga agtgaaagac ctttacaatg 5040 aaaactacaa aacactgctc aaagaagtca aagatgacac aaacaaatgg aaaaatattc 5100 catgctcatg gattggaaga atcaacattg ttaagatgtc catcctacca aaggcaatct 5160 acagattcaa cgcaataccc attaaattac caacatcatt cttctcagac ctggaaacaa 5220 caatacagaa attcatatgg aaacataaac gaccacgaat agccaaaaca atccttagca 5280 aaagaaacaa agtgggaggt atcacacttc cagactttaa actttattat aaggctacaa 5340 taatcaaaac aacctggtat tggtacaaga acaggcatat agaccaatgg aacaggatag 5400 agattccgga agcaaaacct caatttctca accaactcat cttcgacaaa gcctccacca 5460 acaaccactg gggagaggag aacctattca gtaaatggtg ctgggaaaac tggctgacca 5520 cgtgcagaag aatgaaacag gacccctacc tatcaccata cacaaaaatc aactccaaat 5580 ggatcaaaga cctaaatgta aaacctcaaa ctttaagaat cttagaaaac gcaggagaca 5640 cccttatgga aattggaata ggcaaccaat tcctgatcaa aaccccaaaa gcccatgcca 5700 taagagataa gatagacaag tgggacctca tcaaactgaa aagcttctgc aaagcaaaag 5760 aaaccatcaa gagagtgggg agacagccca cagaatggga aaaaatattt gccaactgca 5820 tatctgacaa aggcctaaca tccaggatct acaaggaact caaacgtgcc aaaaggaaaa 5880 aaaacccatt aaaaagtggg caaaagacat gaatagacac ttctcaaaag aagacatatg 5940 agcagccaac agacacatga aaaaatgctc agcctcacta atcatcagag aaatgcaaat 6000 caaaaccaca ttgagatacc acctaactcc agtaagaatg gccatcatta ataaaacaaa 6060 aaacaacaga tgctggtgag gatgcggaga aaagggaatg cttctacact gttggtggga 6120 atgcaaacta gttcaacctc tatggaaaac agtgtggcaa ttcctaaaag aactagaaat 6180 tgaccttcca tatgacccag caatccccct tctgggaata tacccggagg aacttaaatc 6240 actctacaaa aaagatacct gcacatgcat gtttattgca gcactattca caatagcaaa 6300 aacatggaac caaccgtgct gcccatcgaa actggactgg attaaaaaaa tgtggtacat 6360 atacatgatg gaatattatg cagccataaa gaagaacaaa attatggact tcgcagcaac 6420 ttggatggat ttagagtcta tcatactcag tgatctatca cagaaacaaa gaactgagta 6480 ccacatgttc tcactcataa gcggaccctg aacatttacc ataatactat aagaaaggga 6540 ttggcagtag cgggaaactg ccaggggagg ggggggcaca taacatcaat aagggtacct 6600 gatctcaaac caggcgaagt ggggaccaaa gggaggccca aactttacct gtacgatgac 6660 cacttgtata cctaatcctg aattgtaccc cacatcttta aaataaaaaa ataaaaa 6717 // ID MSTAR repbase; DNA; PRI; 1651 BP. XX AC . XX DT 01-MAY-1996 (Rel. 5.2, Created) DT 23-JAN-1998 (Rel. 6.4, Last updated, Version 3) XX DE MSTa-Mammalian LTR internal retrotransposon sequence - a DE consensus. XX KW Non-LTR retrotransposon; MaLR family; MSTa subfamily; MstII; KW MER10; MSTAR. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-1651 RA Smit F.A.; RT "Structure and evolution of mammalian interspersed repeats."; RL PhD dissertation, Univ Southern California, 1995. XX DR [1] (Consensus) XX CC ORF: bases 48 to 1469. XX SQ Sequence 1651 BP; 436 A; 353 C; 489 G; 371 T; 2 other; gaawattggt actgaggagt ggagcattgc tataaagata cctgaaaatg tggaagcgac 60 tttggaactg ggtaacaggc agaggttgga agagtttgga gggctcagaa gaagacagga 120 agatgaggga aagtttggaa cttcttagag acttgttaaa tggttgtgac caaaatgctg 180 atagtgatat ggacagtaag ggccaggctg acgaggtctc agatggaaat gaggaactta 240 ttgggaactg gagcaaaggt cactcttgtt atacattagc aaagagcttg gctgcatttt 300 gcccctgccc tagagatttg tggaagtttg aacttgagag tgatgatcta gggtatctgg 360 cggaagaaat ttctaagcag caaagcgttc aagatgtgac ctggctgctt ttaacagctt 420 acagtcatat gcgagagcaa agaaatcact taaagttgga atttatattt aaaagggaag 480 cagagcgtaa aagtttggaa aatttgcagc ctggccatgt gatagaaaag aaaaacccgt 540 tttctggaga gaaattcaag caggctgcgg agcgaccgtt tgctaaagag attagcataa 600 ctaaaaggaa gccaagtgct gatagccaag acaatgggaa aaaggcctcg aaggcatttc 660 agaaatcttc gaggtggtcc ttcccatcac aggcccagag gcctaggagg actgaatggt 720 ttcgtgggcc aggcccaggg ccccgctgcc ctgtgcagcc tcgggacact gctccctgca 780 tcccggctgc tycggctcca gccgtggctc aaagggcccc aggtacagct cgagctgccg 840 cttcggagag tgcaagctat aagccttggt ggcttccaca tggtgttaag cctgcaggtg 900 cacagaatgc aagagtgaag gaggcttggc agcctccacc tagatttcag aggatgtatg 960 ggaaatcctg ggtgcccagg cagaagcctg ctgcagggac ggagccctca cagagaacct 1020 ctactagagc agtgccaaag ggaaatgtgg ggttggagcc cccacacaga gtccccaccg 1080 gggcactgcc tagtggagct gtgggaaggg ggccactgtc ctccagaccc cagaatggta 1140 gagccactgg cagcgtgcac cgccagcctg gaaaagccgc aggcatcaga ctccaacccg 1200 tgagagcagc cacgtgggct gtgcccagca aagccacagg ggcggagctg cccaaggcct 1260 tgggagccca cccctcgcac cagcgtgccc tggatgcgag acacggagtc aaaggagatt 1320 attttggagc tttaagattt aatgactgcc ctgctgggtt tcggacttgc gtggggcctg 1380 tagccccttt cttttggccc atttctccct tttggaatgg aaatatttac ccaatgcctg 1440 taccaccatt gtatcttgga agtaaataac ttctttttga ttttacaggc tcataggtgg 1500 aaggaacttg ccttgtctca gatgagactt tggactttgg acttttgagt taatgctgga 1560 atgagttaag actttggggg actgttggga aggcatgatt gtattttgca atgtgagaag 1620 gacgtgagat ttgggggaac caggggcaga a 1651 // ID PTERV1a_LTR repbase; DNA; PRI; 409 BP. XX AC . XX DT 14-MAR-2006 (Rel. 13.02, Created) DT 21-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE Long terminal repeat of ERV1 Endogenous Retrovirus from Pan DE troglodytes. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; KW PTERV1a_LTR. XX OS Pan troglodytes OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pan. XX RN [1] RP 1-409 RA Smit A.F.; RT "PTERV1a_LTR - ERV1 Endogenous Retrovirus from Pan troglodytes."; RL Direct Submission to Repbase Update (21-FEB-2008). XX DR [1] (Consensus) XX CC 1% div With PTERV1a & b lib20040702. XX SQ Sequence 409 BP; 121 A; 112 C; 89 G; 87 T; 0 other; tgaaaagagc cgggcacatt cctcagcccc gggctcaaaa caaacaagcc cagtacaaac 60 acatcccatc ctcccatccc accacatatc accatatatc tcttaaactt cccccgggct 120 caaaacaaac aagcccagta caaacaccac caggaaagtc tccgataagg ggacagatga 180 ggggacagcc gttcaaagtt ttactgaaag agcgggaacc aaaagaattc ctttgttccc 240 ctgtaacttt caggctataa aaaagcaaac actcgcattg ttcggggccc tcttgtatgc 300 ggtggaatgg agggaccagg ttcgaacttg tagtaaagat ccttgccgct tggctttgac 360 tctggactct ggtggtcttc tttggggaac aaacggtctg ggcataaca 409 // ID TINE1 repbase; DNA; PRI; 86 BP. XX AC . XX DT 31-DEC-2009 (Rel. 15.03, Created) DT 20-NOV-2010 (Rel. 15.07, Last updated, Version 6) XX DE retrogenes from the LTR portion - consensus. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; TINE1. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-86 RA Jurka J.; RT "SINE elements from tarsier."; RL Repbase Reports 10(3), 517-517 (2010). XX RN [2] RP 1-86 RA Bao W. and Jurka J.; RT "TINE1 is a retrogene from the LTR portion."; RL Direct Submission to Repbase Update (16-NOV-2010). XX DR [1] (Consensus) XX CC The youngest copies are >96% identical to consensus. In other CC mammalian genomes, such as human, TINE1-like elements are also CC present and sequences are most similar to their respective LTR in CC the genomes. Furthermore, a considerable percentage of CC TINE1-like sequences (~ 50% ) in the human genome are preceded CC with the 3'-end sequences of the internal protions of HERV17. CC Therefore, TINE1 is classified is probably derived from LTR CC (although insertions of TINE elements to LTRs cannot be ruled CC out). CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 86 BP; 29 A; 25 C; 12 G; 20 T; 0 other; ggcaaccccc tcgggtcccc tcccactgtg ggagcttttc tgtcctctca ataaatcctg 60 ctttctaaaa aaaaaaaaaa aaaaaa 86 // ID ERV1-Mim_I repbase; DNA; PRI; 5417 BP. XX AC . XX DT 31-OCT-2009 (Rel. 14.11, Created) DT 31-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Endogenous retrovirus-like element: consensus of the internal DE portion. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERV1-Mim_I; ERV1-Mim_LTR. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-5417 RA Jurka J.; RT "Endogenous retroviral elements from the mouse lemur."; RL Repbase Reports 9(11), 2821-2821 (2009). XX DR [1] (Consensus) XX CC Top sequences are >99% identical to consensus. ORFs corrupted by CC mutations. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX FH Key Location/Qualifiers FT CDS 446..1405 FT /product="ERV1-Mim_I_1p" FT /translation="MGQGKSTPLSLTLTHWKEVRDRAHDLSVIIRKGPWQT FT FCASEWPSFGVGWPSDETFNLSVISAVKKKIFAPHPQGHLNQGPYIIVWQD FT LTQNPPSWVKMPVPQPSLALVAQTAPSDPARRPLPLTDPQAESLLLESPPP FT YVPQSQAQAARPGPGTAQPTAPMVEGPAQGTRSRRGLSPDSIVACPLRPVP FT VPNSGPGSDDDTAPLQLLQYWPFSTADLYNWKSQNSNFSQNLGDLINLLDS FT VLFTHQPTWDDCQQLLKVLFTTEERERIQGEARKLVPGRTADLLQTREPST FT ELFPLNGRFGTAMRLKVGSVSGSTARL*" FT CDS 2315..4558 FT /product="ERV1-Mim_I_2p" FT /translation="MEDAFQALRAAMLEAPALALPDPTKEFHLFIDEKRGN FT SQGGTHAGPGTLEETGGVLVKKKLDPVAAGWPACLRIIAATALLVKDADKL FT TLGQRLQITTPHAIEGVLKQPPGRWITNARLTHYQGLLLDAPRIEFRAPAA FT LNPATLLPTPAAAAPERDCLEILAETQTTRRDLKDRPLPGSDLTWFTDGSS FT FIRDGRRYAGAAIVDDQGKLVWAARLPQGTSAQKAELIALTEALSRARGKR FT LTVYTDSRXAFGTVHIHRALYRERGFITAEGKEIKHMPEILHLLEAVLLPK FT AVAVVHTPGHQSGDSVEARGNRRADAEAKKAAEDTPQPNILHLSLPPPGMG FT RLPPLPDYSSVDEVWASEQLREDGWLRDEGHRLIVPELLRRHLLKHLHQTT FT HLGKRKMLQLLDTAQLRFKSQGRVPDDIVKSCRACQVMQPGRTRGTHAGTR FT ERGRQPGLFWEVDFTEVKPRKYGYRYLLVMVDTFSRWVEAFPTKGETALTV FT AKKILEEIVPRYGLPEGIRSNNRPAFVCQVSQGLARAMGIDWKLHCAYNPQ FT SSGQVELINRTIKETLTKLALETGGDWVTLLPFALFRARNTPYHLGLTPFE FT IMYGTPPPVVPRMSPEALPGHPKEVLQAVQALQRVHSQVWPALRAIYRPEE FT EGAGHPFHPHQPGDWVWVKRHNSRTLEPRWKGPYQIILVTPTALKVDRIVA FT TWLHHSHVRPANKKDQERCQDQWKATADPKNPLKTRLTRVAATAQDR*" XX SQ Sequence 5417 BP; 1358 A; 1384 C; 1542 G; 1128 T; 5 other; tttgggggct cgtccgggat nggagaggcg acccctctta aaaagaccgg tggcaccagg 60 cttggaggta ggccgactcc ggagtcgtcc tttatgtttg tctctgtgtt gtatgtgatg 120 tgtcagtttt gaagtttgaa tttggaacga cagggtacta actccgtgag gaggaagttc 180 ctctggacca agatagggtt cttggcggcc agaaaaactg tcgccctgga ggacgctcca 240 gggattaggg gtgagtgacc aagtggcctt ggtcgactcc atttgtttgc ctgcaagttg 300 ttgcaggatt gggcaaggta accctagggt aaggtatcgt cctttgtctg tgtttgtgtg 360 caaaagcaag tggcttcggt atttgtctac tgtctgtgtt gtcgtccttt tctctttaat 420 tctccttttt actcatccag gaattatggg ccaagggaag agtactcctc tctccctcac 480 cctgacacat tggaaggaag tacgagaccg ggcccacgac ctgtccgtca tcatccggaa 540 gggcccctgg cagacgtttt gtgcctctga gtggccctcc ttcggcgtgg ggtggccctc 600 agatgaaact tttaaccttt ctgttatttc tgcagtgaaa aagaaaatct ttgcaccgca 660 cccccaagga cacctgaatc aaggtcccta tatcattgtc tggcaggacc tcacccaaaa 720 ccctccttct tgggtaaaga tgccggtgcc tcagccttct ctcgccctag tggcccagac 780 ggctccatca gatccagctc ggcgacctct acccctaact gatccccagg cagaatccct 840 cctcctggaa tctcccccac cctatgtccc ccagtcgcag gcgcaggcag cccggccagg 900 gccggggaca gcccagccca cggcccctat ggttgagggt ccggcccaag gcactaggag 960 taggagaggg ttgtctccgg attccattgt cgcctgcccc ctacgcccag tgccagttcc 1020 caactctggg ccagggtcgg atgacgacac agctcccctc cagctcctcc aatattggcc 1080 cttttccaca gctgatctct ataattggaa atctcaaaac tcgaacttct cccagaatct 1140 gggagaccta attaatcttt tagactctgt tcttttcact catcagccca cctgggatga 1200 ttgccagcag ttgcttaagg tgctcttcac aacggaggaa agagaaagga ttcaggggga 1260 ggcgcggaag ctggttccgg ggaggacggc agacctacta caaacccgcg aaccatcgac 1320 cgaacttttc cccttgaacg gccgctttgg gactgcaatg aggctgaagg tagggagcgt 1380 ctccgggtct accgccagac tctgatggcc ggtctccgta tggcggcgcg aaagccgacc 1440 aatttggcca aggtaggaga tgttcgtcag gggccagaga gcccggcagc atacttagag 1500 aggatcatgg aggccttccg gcagtacacc cccatagatc ccactatgga agagagcaag 1560 gcagctgtta tgatggcgtt tgtcaatcag gcggcccccg acattaggcg caaggtgcag 1620 agaatagata gattgggcga gaagactctg caggacctgt tagaagtggc ggagaaggtc 1680 tataacaata gagagacgcc agaagagagg ttagagagga tcaggataga aaataggaaa 1740 ttccaagctg gaaaagcacg gaaagcaaac agagagatgg ctaaaatcct gctagccgcc 1800 acaagagggg ggcagatagg gtcagaggat agggaaaggc cccggcggga aaggctaggc 1860 aaggatcaat gtgccaattg caaagagcat ggacattggg cctgagagtg ccccaaaaga 1920 aaggggggcg aagacttgga ggtcctgaaa aagtcacggt tgcaggacag gtgatagacg 1980 aatagggaag atggggttcg gtccccctcc ccaaacctag ggtaactttg caagtggagg 2040 ggaacccagt cagcttcctc atagatacag gagcagaaca tttggtacta acggaagaca 2100 caggaaaatt gtccagtaag accagctggg tgcagggggc aacaggagcc aaactatatc 2160 ggtggaccac gtggcagaga ttggatttgg gttcaggata aactcgccta ccctgttcaa 2220 cgaggccctc catgatgacc ttgggttttc ggaaaaggct cgacctctgt atgaagggcg 2280 taaggcgggg cgagcatggg agtggactgt gcaaatggaa gacgcttttc aggccctgag 2340 ggcagccatg ctggaagccc cggccctagc actccctgat cctacaaaag agtttcacct 2400 gttcatagac gagaagaggg gaaatagcca agggggtact cacgcaggcc ctgggaccct 2460 ggaagagacc ggtggcgtac ttgtcaaaaa aaaactagat ccggtggcgg cggggtggcc 2520 agcgtgcttg cgaatcatcg cggccacggc cctgttggtc aaagacgctg ataagctcac 2580 cctagggcag cggctgcaga tcaccacccc gcacgccata gagggggttc tgaagcagcc 2640 accgggacgg tggatcacaa atgcgaggct gactcattac caaggacttt tgctggacgc 2700 accccgcatc gagttccggg cccctgccgc cttgaatccg gctacgcttc tgccaacccc 2760 cgctgcagcc gcacctgaac gtgattgcct tgagatccta gccgagaccc agacgacccg 2820 tagagacttg aaggatcgcc cccttccagg tagcgacctg acctggttca cggacggaag 2880 cagtttcatc cgggacggac gcaggtacgc aggggcggcc atagtagacg accaaggtaa 2940 acttgtctgg gcggcacgcc ttccgcaagg gacatctgct cagaaggcag aactaatagc 3000 gctgacggag gctcttagcc gggcccgagg aaaaaggctg acggtgtaca ctgacagccg 3060 ctntgccttt gggaccgtgc acatacatag ggccctctac agggaaaggg gcttcatcac 3120 ggcagaggga aaggaaatta agcacatgcc tgaaatactc cacctactag aggctgtttt 3180 gctgccaaag gcagtggcgg tagtccatac cccgggacac cagagcgggg attccgtgga 3240 agcacggggc aatcggagag cggacgctga agctaagaag gctgcagaag acactccgca 3300 gccaaacatc ctgcacctta gcctgccacc cccaggcatg ggacggttgc ctcctctgcc 3360 ggactattcc agtgtagatg aggtctgggc ctcagagcag ctacgtgaag atggctggtt 3420 gcgggacgaa gggcatcgcc taatagtgcc agaactctta agacgccacc tgttaaagca 3480 cctgcatcag accacacatc tagggaagag aaagatgctg caactgctgg acactgctca 3540 actaaggttc aagtcacaag gacgggttcc agatgacatc gtcaaaagtt gccgagcctg 3600 tcaggtcatg cagccgggga ggaccagagg gacccacgca ggtacgaggg aaagggggag 3660 gcagccagga ctattttggg aagtggattt cacagaggtc aagcctagaa aatacgggta 3720 ccgatattta ctggtcatgg tagatacgtt ttccagatgg gtagaagcct tccctacgaa 3780 aggagaaaca gctttgacag tagctaagaa aatattagaa gaaatagtcc ccaggtatgg 3840 actgccggaa gggataagat caaataatag acctgcgttt gtctgtcagg ttagtcaagg 3900 gctggcccgg gctatgggga tagattggaa attacattgt gcatataatc cccagagctc 3960 tgggcaggta gagctaataa atagaacaat aaaggagacc ctgactaaat tggccctgga 4020 gactggcgga gactgggtga ccctccttcc cttcgccctg ttccgagccc ggaacacccc 4080 ttatcacttg ggtttaaccc cttttgaaat catgtacggc acaccccctc ctgtagttcc 4140 taggatgagc cctgaggctc ttccgggaca tcccaaagaa gtgttgcaag cggtgcaggc 4200 tctgcaacgt gtccacagtc aagtatggcc ggccctccgt gctatctacc ggcctgagga 4260 ggagggtgcc ggacacccgt ttcacccaca ccagccaggg gactgggtgt gggtgaaacg 4320 gcataacagc agaaccctcg agccgaggtg gaagggcccc tatcaaatta ttcttgttac 4380 ccccactgct cttaaggtcg atagaatagt agcgacctgg ctacaccact cccacgtgag 4440 acctgctaat aaaaaagacc aggagcggtg ccaggaccag tggaaagcaa ctgcagaccc 4500 gaagaatcct ctgaagacca gactgacccg agtggcggcg acagcccagg accgctgaca 4560 cagacttgga ctggggttgt ggcatctaaa gattgactgg gtgacaaaca tggacgggtc 4620 gcggtggggg gttggttatt gtgttttggg gatgttgtgt ttttggtcca atgtttttgg 4680 gatagagacc tacagctgta cttaccgccc cgaaccccca ccaaccccat aaccttacat 4740 gggtctaaag attactgtat aatggtgtta gtgttcccaa aaataatata ccacactgag 4800 gagacaatgt atgagggcat aatagggcga agagtcacca acctaatttt taaaagaaaa 4860 agagagccct tcacagccat aaccctggct actctttttg gcttagggac aataggagca 4920 ggaactagta tctcttctct ggcaatgcag caaagagggt ttaatactct gagggcagct 4980 gttgatccga ggtagtgctc cagaaccaga gggggttaga tctagtattc ctccagcaag 5040 gaggactgtg tgctgccctc aaagaagaac gttgtttcta tgcagaccat acgggagttg 5100 ttagagaatc tatagctaag gtcagggaag ggcttgcccg ccgcaaacgg gaatatgaac 5160 ggcaagcagg ttggtttgag tcttggttta acagttcccc ctggttaact actctactct 5220 ccactctgct agggcccctg cttatactca tcttgctcct aacctttggg ccttgtatac 5280 tcagccgact agtgactngt ctgagagcgg gtcggcgcng ttcagctact ggtnctgcaa 5340 cgacactatc aacctcttgc tggagatgaa gtcctgtagt ttcaagatta aaactttcca 5400 aaagaaaaag gggggaa 5417 // ID Alu3_TS repbase; DNA; PRI; 298 BP. XX AC . XX DT 09-APR-2010 (Rel. 15.04, Created) DT 09-APR-2010 (Rel. 15.04, Last updated, Version 2) XX DE Alu-like SINE element - consensus. XX KW SINE1/7SL; SINE; Non-LTR Retrotransposon; Transposable Element; KW Nonautonomous; Alu3_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-298 RA Jurka J.; RT "SINE elements from tarsier."; RL Repbase Reports 10(4), 634-634 (2010). XX DR [1] (Consensus) XX CC ~85% identical to consensus. CC This sequence was derived from sequence data generated by the CC Broad Institute Mammalian Genome Project. XX SQ Sequence 298 BP; 70 A; 71 C; 103 G; 53 T; 1 other; ggccgggcgc agtggctcag cctgtaatcc cagcactttg ggaggccgag gcgagaggat 60 tgcctgagcc cgggggttcg agaccagcct gggcaacttg gtgagacctt gtctctacaa 120 taaataaaaa attagccagg cgtggtagcg cgcgcctgta gttccagcta cttggaaggc 180 tgaggcggaa ggatcgcctg ggcccagcag gttggggctg cggtggccgt gagcatgcca 240 ctgcactacg gcatgggggg angagactcc aaatcttaaa aaagtctgaa aaggaaga 298 // ID LTR15_OG repbase; DNA; PRI; 945 BP. XX AC . XX DT 18-OCT-2009 (Rel. 14.11, Created) DT 18-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat (consensus). XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR15_OG. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-945 RA Jurka J.; RT "Long terminal repeats from bushbaby."; RL Repbase Reports 9(11), 2861-2861 (2009). XX DR [1] (Consensus) XX CC >90% identical to consensus. 6bp TSD. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: Bushbaby. XX SQ Sequence 945 BP; 248 A; 237 C; 229 G; 231 T; 0 other; tgcaggcaga atggggaccg aagaccccat tcttggcaag ggaaaacacc cccagaaaaa 60 gggaaaaatt aaaaagtaag ctgagtgagt cggaacaaag gaaagtcagc tagtttacct 120 ttaaaccagc cgctgctaaa tttcttatct gagcaagttt cctgcttttc ctagaactct 180 ggcctattgg ggatacagct tactaacacc cctgctgggt ctgtaaacac tctgaggaaa 240 taatggccac agaagatgct gtggcaaggc ctggaaagat acattaacta ggccgtttcc 300 tgcccagtaa acagtcaaag caaataccca ttgtggagat ttgtatttgg ctggggcaga 360 gatcaaggga acagctcgag ccctgaaata tgggtaggag ggcttatatc ccttatcgca 420 gaggaagacc ctctgtggtg gtgatgagag gaaaattcct ctctcccacc gattcttcag 480 aggagattga cgccagtggc gttttactcc acagtgaact agcggagttt aacacctccc 540 taggaatctg tgtgtgtgac ctcagacagg aggctgcccc cacacaaggc tcaggttccg 600 gatttctcaa gactgctcag acccttaaag ttaatgatta tagaaacaat aggagctgat 660 atgctcgctc agagaaactg ctggcgtctt tgttcccttc ctcagcttac cctttgggct 720 aatgagcaaa tagccttcaa taaaagctga gtggaactga cactcggggc caccgccgga 780 accccacaag cgggcggtgg tcccctgagt tcccaccttt gaaaattcta ttctgtgtgt 840 cttgtctctt tctttctcgt cgtttctaac tttatatttc ttcagtcggt cgccgccaga 900 tccacaggtc ttaacggacc ctgcccccgg ggcaggaccc cggca 945 // ID LTR10B_Mim repbase; DNA; PRI; 284 BP. XX AC . XX DT 02-NOV-2009 (Rel. 14.11, Created) DT 02-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR10B_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-284 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2965-2965 (2009). XX DR [1] (Consensus) XX CC >98% identical to consensus. 6bp tsd. CC Similarity to RLTR10F from mouse. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 284 BP; 56 A; 88 C; 73 G; 67 T; 0 other; tgtggggcgc ggtggcaacg ccattccaag atggcgcccg cttcctggtc acaccctact 60 gtgcgtctga caaacagatg ttcagcgcat gtgcaaagcc ttgcctcctc tcctgttctg 120 tgttgaccaa tcaggttatg ccccgtgtac ttactgtcta tataagcagc cgccgagaac 180 gcctcggcgt cttccgcatg taaccagtta agcaatcccc attaaagcgc tgtcagaaga 240 actccagttg ccgcgtcttc cttgctggcg aggcgggcgc gaca 284 // ID TINE2 repbase; DNA; PRI; 87 BP. XX AC . XX DT 31-DEC-2009 (Rel. 15.03, Created) DT 31-DEC-2009 (Rel. 15.03, Last updated, Version 2) XX DE SINE element - consensus. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; KW Nonautonomous; TINE2. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-87 RA Jurka J.; RT "SINE elements from tarsier."; RL Repbase Reports 10(3), 518-518 (2010). XX DR [1] (Consensus) XX CC >92% identical to consensus. CC This sequence was derived from sequence data generated by the CC Broad Institute Mammalian Genome Project. XX SQ Sequence 87 BP; 26 A; 24 C; 12 G; 25 T; 0 other; ggcaacccct tcgggtcccc tcccactgtg ggagcttttc tgtcctctca ataaatcctg 60 ctttctaaat aaataaataa ataaaaa 87 // ID LTR21_Mim repbase; DNA; PRI; 423 BP. XX AC . XX DT 09-NOV-2009 (Rel. 14.11, Created) DT 09-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR21_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-423 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2981-2981 (2009). XX DR [1] (Consensus) XX CC ~90% identical to consensus. 4bp tsd. CC Similarity to LTR20_OG from bushbaby. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 423 BP; 102 A; 102 C; 96 G; 123 T; 0 other; tgtaacagag ggaatggcct ggagaaaaaa aaggcaaaaa gattttctgt ctctttaaaa 60 cttcccccac tctttttggg aactgcagcc tggcctgcat ccctgaggca ggttaagtct 120 tttgttaaaa ctcccaggtg gcgccagccc cccaggtggg cagctggcgg agttcacagg 180 ctttgtcttt ggcagagatg ggatgattag aatagttaac cacagtaatt gcctgaagct 240 gagaaccctc cctttaaaag ctctgtattt ctgcctatgt tcaggaaatt tgggattttg 300 agatgagaag gtctaccatt tttcctcctt tgcaaagtaa actctctttc cttcctcccc 360 aaaccacttg tcctcgttct tctgattcgg cctcgtggac aagtggctga gctttcagta 420 aca 423 // ID MSTA repbase; DNA; PRI; 426 BP. XX AC . XX DT 01-OCT-1995 (Rel. 3.6, Created) DT 23-JAN-1998 (Rel. 6.4, Last updated, Version 3) XX DE Transposon-like human element long terminal repeat (MSTa DE subfamily) - a consensus. XX KW Non-LTR retrotransposon; MaLR family; MSTa subfamily; MstII; KW MER10; MSTA. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-426 RA Jaiswal K.A., Gonzalez J.F. and Nebert W.D.; RT "Human P1-450 sequence and correlation of mRNA with genetic RT differences in benzo[a]pyrene metabolism."; RL Nucleic Acids Res 13, 4503-4520 (1985). XX RN [2] RP 1-426 RA Lawrance K.S., Das K.H., Pan J. and Weissman M.S.; RT "The genomic organization and nucleotide sequence of the RT HLA-SB(DP) alpha gene."; RL Nucleic Acids Res 13, 7515-7528 (1985). XX RN [3] RP 1-426 RA Mermer B., Colb M. and Krontiris G.T.; RT "A family of short interspersed repeats is associated with RT tandemely Repetitive DNA in the human genome."; RL Proc. Natl. Acad. Sci. U.S.A 84, 3320-3324 (1987). XX RN [4] RP 1-426 RA Kaplan J.D., Jurka J., Solus F.J. and Duncan H.C.; RT "Medium reiteration frequency repetitive sequences in the human RT genome."; RL Nucleic Acids Res 17, 4731-4738 (1991). XX RN [5] RP 1-426 RA Smit F.A.; RT "Identification of a new, abundant superfamily of mammalian RT LTR-transposons."; RL Nucleic Acids Res 21(8), 1863-1872 (1993). XX DR [5] (Consensus) XX CC Replaces MER10 sequence. XX SQ Sequence 426 BP; 85 A; 111 C; 101 G; 124 T; 5 other; tgatatggtt tggatctgtg tccccaccca aatctcatgt tgaattgtaa tccccaatgt 60 tggaggtggg gcctggtggg aggtgattgg atcatggggg cggattyctc atgaatggtt 120 tagcaccatc cccttggtgc tgtyctcgtg atagtgagtg agttctcatg agatctggtc 180 gtttaaaagt gtgtggcacc tcccycctct ctctcttgct cctgctctcg ccatgtgacg 240 tgcctgctcc cccttcgcct tccgccatga ttgnaagctt cctgaggcyt ccccagaagc 300 cgagcagatg ccagcgccat gcttcctgta cagcctgcag aaccgtgagc caattaaacc 360 tcttttcttt ataaattacc cagtctcagg tatttcttta tagcaatgtg agaacagact 420 aataca 426 // ID L1-5_TS repbase; DNA; PRI; 6035 BP. XX AC . XX DT 03-MAY-2010 (Rel. 15.05, Created) DT 03-MAY-2010 (Rel. 15.05, Last updated, Version 1) XX DE L1-type non-LTR retrotransposon - consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-5_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-6035 RA Bao W. and Jurka J.; RT "Non-LTR retrotransposons from tarsier."; RL Repbase Reports 10(5), 771-771 (2010). XX DR [1] (Consensus) XX SQ Sequence 6035 BP; 2317 A; 1413 C; 1203 G; 1101 T; 1 other; gagggatcca agatggccac catggagcag cagaccggca cctctctcac cgagcaaggc 60 agagagtagg gacccagccg ccaatagaca gagaccctga acacttttgg atgggtgaag 120 caccctgcac ccagcaaagg tctctttgaa cacatagaga ccagaaagtg acggcagatg 180 cgggaatccg ggcaccccat cccccgccat gtgccaggct tgcacaggaa ctgaagccct 240 gcccgtgaac gagagtgggt gagtgatcct aagatgcttt gggtcggttc acccctccca 300 gccatgcatg agtcattggt cccatgcagg aggagtcctg gtcagggcac agcagcaggc 360 aaccggagcc caagtggaat gggagaaaaa aacctatgct ggtcagccgc catgagcatg 420 gccccagccg gctcccaggc agggaaagcc cacctgcaga gtgaaccttg ctggcctgct 480 gacccatctt cccagccaga tcaggaagaa tcacctccaa gccctgcatg cctacagggt 540 ggatccccca ttcccccatg gcagggaact cactcttttt caagagcaga gtgtggactt 600 gaacagggag ggagcgacac gaaagacacc tgaaggcttt cttggtggga aggcttctgg 660 gctgcaaacc gcagatgggc gggcccatcc ccccacctgc tctccacaga gtgctagcag 720 caggggctga gaaaagctac agttgagaaa cctgtcacag ggtggagcaa ggagcagaag 780 tgaatgcccc tccctctctc actgagtgac ctatattgaa ccacagaaag ggccctagaa 840 cagagggtca ttgactagct tcagagcaac aaaaccaaaa ggaattacag agagggagag 900 aggcctgtca caaatgagga acaacaacaa aaatagacaa ccctcagagt cctaaagaga 960 cagcacccga aaaagcagct atcagagcac aaacaatgga aaccccccag aacgacctgc 1020 cccagatgac agtcacagaa atcagagatt ggatggctaa aatgaacaga gagaaccaag 1080 agaaactaga agcaaaactc actgaagtca tacaggaagt aaaagcagag ttggccaatt 1140 tcagaaaagc ccattctgaa ataatggaaa tgaagaactt catcctggaa attaaaaaca 1200 cagtaacaag catagggagc aggatcaacc aggcagagga aagaatctca gagcttgaag 1260 accaaaatat ggaactaacc cagtctgtca aaaacatagg gaaaaaagct caaaaagaag 1320 gagcaaagcc tctgagagat gtgggactat gtcaagaagc caaatctatg cctgattggt 1380 atccctgaag cagaaaggga aactgagaac accatggaac aagcattcca tgaggtcatc 1440 caagaaaact tcccacatct cactagagag gtgaccattc aagcacaaga gattcagaga 1500 acccccacaa ggtatcacat gagaagacca tccccaagac acatagtaat ccgcctccac 1560 aaagtaagca tgaaagaaaa aatcctaaag gcagcaagag agaaaggtca gactacctac 1620 cagggaagac ccatcagaat cgctccagac ttatctgcag aaacactaca agccagaagg 1680 gactggagtc caatttttaa tgttcttaaa gataagcaat tccagccaag aatttcctac 1740 cccgccaagc taagcttcat cagtgatgga gaattaaaat ccttcccaga catccaatcc 1800 ctaagagaat atgccgcctc cagaccagcc cttcaggaga cgcttaagat ggtgaacaca 1860 ggaaaaaaaa aaaagaatgg tcatacccat cacaaaagta caggcaagca cagagttaac 1920 agagcataca gagcactcac aaaaatgaaa atacacacat atataaatac aaaagtaaaa 1980 aggaaacaaa gcaaccccct aagagcccta tgacagggat aaactctcac atttcaataa 2040 tcaatttgaa tgtgaatgga ctaaatgcgc cactgaagag acatagagtg gcaaactgga 2100 taaagaagca tgatccaaca gtctgctgcc tccaggaaac tcatctcact gcaaaggaca 2160 cccacaggct caaagttggg ggctggaaaa tggtctttca ggcaaatgga aaacagaagt 2220 agctattctg atatcagaca aagcagactt caaaccatca aaagttaaaa aggacactac 2280 ataatgataa aaggctcaat ccaccagcaa gaaatatcca tcttaaacat atatgcaccc 2340 aacacaggag caccaggttt tataaaacaa ctactaagta aactaagaga ggacattgac 2400 tctcacacaa tcatagttgg agacctaaat accccactaa cagctctaga tagatcatcg 2460 aggcaaaaaa ccaacaagga gatctggaac cttaactcaa tgctgatcaa atggatttaa 2520 tagacatcta cagaacactc cacccaacat ctacagagta tacattctac tcatcagcac 2580 atggaacata ctccaagatc gatcacattc tcggacataa atcaagcgtg aacaaatttc 2640 aaaagatcaa aataatacca tgcatcttct cagatcacag tggaataaaa ataaatatcg 2700 ccaccaacaa gatcccccca aaacacacaa agacatggac actaaacaac atgctgctga 2760 acgacttctg ggtcaacacg gaaatcaaga cagaaataaa aagattcctg gaaacaaaca 2820 aagacacatc ttatcagaat ctctgggatg ccatgaaagc agtgttaaga ggaaaattca 2880 tagcattgca cgcacacatc aagaaaacag aaagatcaca agtaaacagc ctaacatcac 2940 acctaaggga gctggaaagg caagatcatc taaatcctaa cttcagcaga agaatccaga 3000 tcaccaagat aaaatcacaa ttgcaggata tagaagacaa aaatatcata gaaaggatca 3060 acaagacaaa aagctggttc tttgaaagga taaataagat tgacagaccc ctggccagat 3120 tgactaaaaa aagagagaaa gtccaaataa acacaattag aaatgcaaaa ggcgaagtca 3180 caactgaccc tgaagaaatt caaaagatta tcagagatta ctatgaacac ctgtatgcaa 3240 ataaactaga aaacctaaag gaaatggagg actttctgac atcacacaac ctcccaaggt 3300 tgaaacaaga agaaattgaa actctaaaca gaccaataac aatccaggaa attgactcag 3360 tcataagaaa tctccccacc aaaaaaagcc ccggaccaga tggcttccca gctgaattct 3420 acaagacata caaggaggag ctgataccaa tcttattgaa agtattccag gcaatcgaga 3480 atgatggaat tctccccaac tcattttatg aagctaacat catactgata cccaaacctg 3540 gcaaagaccc aacaaggaaa gagaattaca ggccaatctc cttgatgaac atagatgcaa 3600 aaattctcaa caagattcta gcaaatcgga tccaacaaca catctcaagg atcatccacc 3660 atgaccaggt gggcttcatt cctgggatgc agggctggtt caacattcac aagaccataa 3720 acataattca gcacattaac agatgtaaaa ccaagaacca catgattata tcattagacg 3780 cagaaaaagc atttgacaaa atccagcatc ccttcttgat aaaaaccctt gagcacctag 3840 gcatagaagg aacattcctc aaaacagtaa gagccatcta tgataaaccc acagccaaca 3900 ttttgctcaa tgggcagaag ctggaagcat tccccctaag aacgggaaca agacaaggat 3960 gcccactctc acccctcctg ttcaacatag tgctggaagt cctagccaga gcaatcaggc 4020 aagagaagga aatcaggggt atccaaatag gaaaagagga agttaagcta tccctctttg 4080 tggacgatat gattctatac cttgaaaacc ctagggagtc tgtcaaaagc ctcctcgcac 4140 tgataaatga ctttggtaaa gtcttgggtt acaaaatcaa tgtgcaaaag acagttgcat 4200 ttctatacac cagcaacaag caggcagaga accaaataaa aagcacaatc ccattcacaa 4260 tagccacaaa aaaatgaaat accttggcat ccacctaacc agagaagtga aagaccttta 4320 caatgagaac tacaaaacgc tgctcaaaga aatcaaggat gacacaaaca aatggaaaaa 4380 cattccatgc tcatggattg gaagaatcaa tattgtcaag atgtccatcc tacccaaggc 4440 aatctacaga ttcaatgcaa tacccatcaa attaccaaca tcatttttct cagacctgga 4500 aatgacaata cagaaattca tatggaaaca aaaaagagca cgaatagcca aaacaatcct 4560 cagcaaaagg aacaaagcgg gaggtatcac acttccagac tttaaacttt attacaaggc 4620 tacagtaacc aaaacagcct ggtattggta caagaatagg cacatagacc aatggaacag 4680 gacagagatt ctggaagcaa aaacacagtc tctcaaccaa ctcatctttg acaaagccac 4740 caataacaat cactggggaa aggagaccat atttagcaaa tggtgctggg aaaactggct 4800 gaccacatgc agaagaatga aactggaccc ctacctatca ccatatacaa aaatcaactc 4860 aaaatggatc aaagacctaa acgtaaaacc tcaaactata agaatcctag aagaaaatgt 4920 aggaaacacc cttatgcaca tcggagtagg caacgaattc ttgaccaagt ccccaaaagc 4980 aaatgccata aaagctaaga tagacaagtg ggacctcatc aaactgaaaa gcttttgcac 5040 agcaaaagaa accatcaaga gagtaaagag acaacccaca gaatgggaaa aatatttgca 5100 aattatgcat ctgacaaagg cctaacatcc aggatctaca aggaactcaa acaaagtgaa 5160 aggaaacaaa caaccccatt aaaaagtggg caaaggacat gaacagacac ttctcaaaag 5220 atgacataca agcagccaac agacacatga aaaaatgctc agcctcacta atcatcagag 5280 aaatgcaaat caaaactaca atgagatacc acctaactct ggtaaggatg gctatcatta 5340 ataagtcaaa aaacagatgc tggcatgggt gcagagaaaa gggaacgctt ctacactgtt 5400 ggtgggagtg caagctagtt caacctctat ggaaagcagt gtggcgattc ctaaaagaac 5460 taaaaattga ccttccatac gatccagcaa tccctctcct gggaatatac ccagaggaac 5520 ataaatcatt ctacmaaaaa gacacctgca catgcatgtt tattgccgca ctattcacaa 5580 tagcaaaaac atggaaccaa ccctactgcc catcaaaagt ggactggata aagaaaatgt 5640 ggtacatata cacgatggag tattatgcag ccataaagaa gaacaaaatc atggacttcg 5700 cagcaaattg gatggagttg gagaccatca tactaagtga actatcacag aaacaaagaa 5760 ctgaatacca catgttctca ctcataagtg ggccctgaac attagtcaca ttagcacaag 5820 ggagggatcg gcagtcactg ggcactgcta ggggagggag ggagggagaa ggtgggggca 5880 cgtgacaaca caaagggtac ctggtccact accaggggga gtgagggcca gttagaagcc 5940 caaacttcac ctgcatgcag cctacttatg tagctaatcc tcacttgtac cccacatccc 6000 tataataaaa taaaatttta aaaataataa aagaa 6035 // ID L1-4_TS repbase; DNA; PRI; 6258 BP. XX AC . XX DT 11-APR-2010 (Rel. 15.05, Created) DT 11-APR-2010 (Rel. 15.05, Last updated, Version 3) XX DE L1-type non-LTR retrotransposon - consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-4_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-6258 RA Jurka J.; RT "Non-LTR retrotransposons from tarsier."; RL Repbase Reports 10(5), 770-770 (2010). XX DR [1] (Consensus) XX CC >89% identical to consensus. CC This sequence was derived from sequence data generated by the CC Broad Institute Mammalian Genome Project. XX SQ Sequence 6258 BP; 2469 A; 1399 C; 1188 G; 1194 T; 8 other; gggattaaga tggccgacat gaggcagcta gagtgagttt ctcctacaga gtaaaccaag 60 atagagtttc ctcagcaccc agagaggaac ggatccagca gcatgcccca cgagtctcca 120 aaggagtgat aataaccact tagagcgaaa aaagaagaaa aaaatccacc gacctgggct 180 ggaactggcc ggactctcgc ctgnaccacc acagtccacc gcagcctccg gacgcagtgg 240 ggaaggtgag tgaggggaat tcaggagttc tgtgctcctg ccgcgaactt gcagggctcc 300 ctgagccttg cgggacccca gtccctcacg ggaaactgcc ttacccacag ccgtgctctg 360 ccgtggagtg gtgcaggaaa aaagccttga gcagcgccag ccgccatttc ccaggaactc 420 aacccttgca gggtggagcg gctgctttcc agccggcctg ggaggggacg ggccccaagg 480 gcaaactccg cggctgcaga cagttcctgg ctgggcgggt tgcctgagaa cacctgaggc 540 cctgaggccg gctgcacctt gacattcaga tcgcagattg cctgaggcca aggcgggagc 600 cgggtgggac gcgttgccat agcaacctct cactcccctt gtgaactctt ctgtgcctgc 660 tctgcacaac agcgttaccc cagtggcctg ggactccctc ctaatcccca ctggggtagc 720 taagagtgca cacttgtggg gtgcttcagc tcctttcccc cgcactcccc cgaaggctca 780 aaacagacag agaacttggg aaatccccag agccccgccc tcggcctagg ctggctgggc 840 acttccttgg agcaacactc gccaaggaga aaccctacag ccaccatcgc agctggcttt 900 ctcccacaag cgccacctcc tggccggagg tcaacagtac agcccaccac atcggatata 960 atacctcagc tttggggaag gcctgatccc aaacgccagc taacaacact ttccaagcca 1020 ctccggccac tcaggacgcc gtgagccagt gcaggtacac agcacagggc tgccacanct 1080 ggcaattgag aaactcacca caccaaggct atatataacc aagggaaccc tacagtgcct 1140 acgtcagccc cctgccaacc tcaaataaga agaaatagtc tacccaaatg agaaggaacc 1200 agaaaaataa ttctgacaat atgaaaaaac agagttctcc aacaccccca aaagaccaca 1260 ccaactcttc agcaacgaac cctaaccaaa atgaaatttt tgaaattcca gacatagaat 1320 tcaaaaggtt gattataaaa ttggtcaatg aacttcagga gaaaattgaa aaccaacata 1380 aagaaattta aaagattcag gacatggatg aaaaattctc caaagagata gacaccttaa 1440 aaaaaagctg aaattctgga aatgaaagaa tcatttaggg aagtacaaaa tgcagtggaa 1500 agttttaaca gtagactaga acaagcagaa gaaaggatct cagaacttga agacaaggct 1560 tttgaattaa cccaatcaga caacaataaa gaaaaaagaa tcaagagaat ggaacaaagt 1620 ctccaagaaa tgtgggatta tgtaaaacgt ccaaacctaa gaatcatagg tgttcctgag 1680 gaagaagaga aagcaacagg tttggaaaac ctatttgagg gaataatgga ggaaaacttc 1740 cctggccttg ccagagatct aaacatccag atacaagaag cccaaagaac ccccagaaga 1800 ttcattgcaa aaaggtcatc cccaaggcat ataatcatcc ggttatgcaa agttaatgtg 1860 aaggaaagaa ttttaagagn agtgagacaa aagcatcaaa taacttacaa aggtaaacct 1920 atcagactaa cagcagactt ctcagaagaa accctacaag ccagaaggga ttggggcccc 1980 atcttcaatc tcctcaaaca gaataacngt cagccaagga ttttgtatcc cgcaaaacta 2040 agtttcataa atgaaggaga aatgaagtcc ttcgcagaca agcaaacgct gagggaattt 2100 gtcaccacta gaccggctct acaagaaata ctcaaaggag ttctaaatac tgaaacaaaa 2160 ggtcgaaata caccagtaga aaaatgctgg aaatcataaa gttcacaggc tttatggaac 2220 accaacacaa tagagaaaat aaataaataa agcaaccaga taacaaccag tatgatgaat 2280 agaatagtac cttacatatc attattaact ctgaacgtaa atggtctcaa tgccccacta 2340 aaaagatata gactggcaga atggataaaa aaacacaacc caaatatctg ctgccttcaa 2400 gagacccact taactcacaa agactcctat agactcaagg taaaggggtg gggaaaaata 2460 ttccacgcaa atggaaacca aaaacgagca ggagtagcca ttctcatatc agataaaaca 2520 gactttaaat taacaacagt aaaaaaagac aaagatggct attatataat gataaaggga 2580 tcaatccaac gagaagacat aacaatttta aatatatatg cgcctaacac cagagccccc 2640 agctttataa aacaaattct actagaccta aaaaaagaga tagacagcaa tacaataata 2700 gtgggagact tcaacactcc actgacagca atagacagat catcaaggca gaaagtcaac 2760 aaagaaacat tggagttaaa cnggactcta gagnaaatgg aactaacaga catctacaga 2820 acattccatc ccaaaactac agaatataca tttttctcat cagcacatgg aacattctcc 2880 aagatagacc atatgatagg ccacaaatca agtctcaata aattcaaaaa aatcgaaatc 2940 atatcaagca ccttctcaga ccacagtgga ataaaactag aaattaactc caagaagaac 3000 tcgcgaaacc aaacaaaaac atggaagtta aacaatctgc tcctgaatga tccttgggtc 3060 aacaatgaaa tcaagatgga aattttaaaa attcctcgaa acgaatgaca acaatgaaac 3120 aagttatcaa aacctctggg acacagccaa agctgtgctc agaggaaaat ttatagcgct 3180 taaggcctac attaataaga ctgaaagatc gcatatcgac aacctaacgt cacacctcaa 3240 ggaactagaa aaacacgaac aaaccaaacc caaagctagc agaagaaaag aaataacgaa 3300 gatcagagca gaactaaatg aaattgaaat taaaaataca aaggatcaat gaaacaaaaa 3360 gttggttctt tgaaaagata aacaaaattg atagaccgct agctagatta attaagaaac 3420 gaagagagaa gactcaaatt agctcaatca gaaatgaaaa tggagacatt acaactgata 3480 cgacggaaat tcaaaagatc atccgagact attatgaaca cctctatgca aacaaactag 3540 acaatctaga ggaaatggat aaattcttgg aaacatacaa ccccccaacc ttgaatcagg 3600 aagaaataga aaccctgaac agaccaataa cgagtagcga gattgaaaca gtaattagaa 3660 gtctcccaac aaaaaaaagt ccaggacctg atggactcac agctgaattc taccagacct 3720 tcaaagaaga actggtacca attttactga agctattcca caagattgaa gaggagggaa 3780 tcctccctaa ctcattctac gaggccagta tcactctgat accaaagcca ggaaaagaca 3840 caacaaaaaa gaaaactaca gaccaatatc ccttatgaac atagacgcaa aaatcctcaa 3900 caaaatatta gcaaaccgaa ttcaacagca cataaaaaat aatacaccac gaccaagtgg 3960 gttttatccc agggatgcaa gggtggttca acatacgcaa gtcaataaac gtgatccact 4020 acataaacag aattaaaaac aaaaatcata tgatcatttc aatagacgca gaaaaagcat 4080 ttgataaaat ccagcatccc ttcatgataa aaaccctcag caaaataggc atagagggaa 4140 cattcctcaa aataataaaa gccatatacg acaaacccac agccaatgtc atcctgaacg 4200 gagaaaagtt gaacgcattc cccctcagaa ctggaacaag gcaaggatgc ccactttcac 4260 cactnctatt caacatagta ctggaagtcc tagccagagc aatcaggcaa gaaaaagaaa 4320 taaagggtat ccaaattgga aaagaagaaa tcaaactatc tctgtatgcc gacgatatga 4380 tcttatacct agagaatcct aaagactcct ccaaaagact cctggacttg ataaatgaat 4440 tcggtaaagt ttcaggatac aaaatcaaca cacacaaatc agtagcactg ctatacacca 4500 atagcgacca agctgagaat caaatcaaga actcaattcc atttacaata gcagccaaaa 4560 agctaaaata cctaggaata tatttaacca aggaggtgaa agatctctac aaggagaact 4620 acaaaactct gatgaaagaa attgcagagg acacaaacaa atggagaaac atcccatgct 4680 catggattgg aagaatcaac attgttaaaa tgaccatatt acccaaagta atctacagat 4740 tcaacgcaat ccctatcaaa ctaccaacgt catttttcac agaattagaa aaaaaaatcc 4800 taaaattcat atggaaccaa aaaagagcca gaatagccaa agcaatccta agcaaaaaga 4860 acaaagctgg aggcatcaca ttacctgact tcaaattata ctacaaggct gtagtaacca 4920 aaacagcatg gtactggtac aagagcagac acatggacca atggaacaga atagagaatc 4980 cagaaataaa cccaaatacc tacaaccaac tgatctttga caaagcagac aaaaacatac 5040 actggggaaa ggacacccta ttcaacaaat ggtgctggga aaattggata gctacatgca 5100 gaaaaatgaa acttgatccc tatctctccc catatacaaa aattaactcg agatggatta 5160 aagacttaaa tgtcagacct gaaaccataa aaatcctaga agaaaaccta ggaaaaactc 5220 ttctggacat cggcctaggc aaagaattta tggctaaaac cccaaaagca aatgcaacaa 5280 aaacaaataa ataagtggga cttaattaaa ctaaaaagct tctgcacagc aaaagaaata 5340 atcaacagag caaataggca acctacagaa tgggagaaga tattcgcaaa ctatacatct 5400 gacaaaggac taatatccag aatctacaaa gaactcaaac aaatcagcaa gaaaaaagca 5460 aataacccca ttaaaaagtg ggcaaacgac atgaacagac atctctcaaa agaagatata 5520 caaatggcca acaaacatat gaaaaaatgc tcaacatcac tcatcatcag ggaaatgcaa 5580 attaaaacca caatgagata ccaccttaca ccagtcagaa tggccattat taaaaaaaaa 5640 aaaaaataga tgctggcgcg gatgcggaga aaagggaacg cttatacact gttggtggga 5700 gtgtaaatta gtacaacctc tatggaaagc agtatggaga tttctcaaaa actaaaagta 5760 gatctaccat ttgatccagc aatcccacta ctaggtatct acccaaggaa cagaggtcgt 5820 tatataaaaa agacacctgc acccgcatgt tcatcgcagc acaattcaca attgcaaaga 5880 tgtggaatca accgatcagt ggataatgaa aatgtggtat atatacacca tggaatacta 5940 ttcagccatc aaaaagaatg aaataatgtc ttttgcagca acttggatgg agctggagan 6000 catcatccta agtgaactga ctcaggaaca gaaaaccaaa caccgcatgt tctcactcta 6060 tagtgggagc taaacggtcg caaagataga gaatggtata atggacactg gagactcaga 6120 aaggggaggg ggaaggggtg gagggcaaaa actacctatg ggacaatgtc acgggagggg 6180 gccaaaatgg tgggtacact taaagcccag accaaaacta tttgtacccc ccaaagctat 6240 tgaaattaaa aaataaaa 6258 // ID PMER1 repbase; DNA; PRI; 90 BP. XX AC . XX DT 25-APR-1997 (Rel. 2.03, Created) DT 23-JAN-1998 (Rel. 3, Last updated, Version 2) XX DE Putative non-autonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; URR1; PMER1. XX OS Strepsirrhini OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates. XX RN [1] RP 1-90 RA Smit A.F.; RT "PMER1."; RL Direct Submission to Repbase Update (30-NOV-1996). XX DR [1] (Consensus) XX CC A prosimian-specific MER1-type DNA transposon fossil similar to CC bp 1-54 CC and bp 202-236 of PR/URR1 in rodents. XX SQ Sequence 90 BP; 19 A; 26 C; 22 G; 23 T; 0 other; tagagcagtg gttctcaacc ttcctaatgc cgcggccctt taatacagtt cctgtgggtc 60 gcgacccaca ggttgagaac cgctgctata 90 // ID L1A_Mim repbase; DNA; PRI; 6545 BP. XX AC . XX DT 07-JAN-2010 (Rel. 15.01, Created) DT 07-JAN-2010 (Rel. 15.01, Last updated, Version 3) XX DE LINE element - consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1A_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-6545 RA Jurka J.; RT "LINE1 elements from the mouse lemur."; RL Repbase Reports 10(1), 10-10 (2010). XX DR [1] (Consensus) XX CC >98% identical to consensus. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX FH Key Location/Qualifiers FT CDS 1659..2378 FT /product="L1A_Mim_1p" FT /translation="MKNQFKELQNTVESLKNRVDQTEERISELEDNTLQLN FT KSVTEIEQRNKRKEQSLQELWDYVKKPNVRVIGLPEGEEDNTQGLDKLFED FT IIEENFPGLAQNLDIQVQEAQRTPGRFNANRKTSRHAVIRLTKVSTKEALL FT RAVRQKKQVTYKGKPIRITSDFSNETLQARRDWGPILTLLKQNNAQPRILF FT PAKLSFVYEGEIKTFSDKQRLREFTKTRPALQEVLKTALRTEHHNNNPRI" FT CDS 2423..6295 FT /product="L1A_Mim_2p" FT /translation="MAQDRNHSNNIQPNRMISNLPYLSVLSINVNGLNSPL FT KRHRLAEWIRKYRPSICCLQETHLTCKDAYRLKIKGWRSIFQANRSQKKAG FT VAVLISDDLVFKPTKVVKDKEGHYIMVKGTVQQEEITILNIYAPNLGAPRF FT IKQTLLELSKWINSNSIIAGDFNTPLTARDRSSKQKINKEIMDLNKTLEQL FT GLTDIYRTFYPKSTEYTFFSSAHGTFSKIDHILGHKENLKKFKKIEIIPCT FT FSDHSGIKLEINPNRNSHFYTKTWKLNNLLLNDYFVNEEIKTEIKKFYEEN FT DNGETSYQLLWDTAKAVLRGKFISINAYNQKTRRSQIDNLMKRLKELEKEE FT QTNPKPSRRSEINKIKSELNEIENREAIQEINKTKSWFFEKINKIDTPLAK FT LTKSRKEKSLISSIRNKKGDITTDPKEIQDTIYEYYKNLYAHKLENVEEMD FT KFLETHSLPRLNQEEIDSLNRPISTAEIETAIKNLPKKKSPGPDGFTPEFY FT HTYKEELVPILQKLFHNIEKNGNLPDTFYEANITLIPKPGKDATKKENYRP FT ISLMNIDAKIFNKILANRIQTLIKKIIHHDQVGFIPGMQGWFNIRKSINAI FT HHINRSKNKDHMILSIDAEKAFDKIQHPFMIRTLKKIGIEGTYLKMIQAIY FT DRPIANIILNGERLKSFPLRTGTRQGCPLSPLLFNIVLEVLATAIRQENGI FT KGIQIGAEEIKLSLFADDMILYLENPKDSTKKLLELINEFSKVSGYKINTQ FT KSEAFIYANNNLIENQIKDSIPFTIATKKLKYLGIYLTKEVKDLYRENYET FT LRKEIAEDVNRWKSIPCSWIGRLNIIKMSILPKLIYRFNAIPIKIPSAFFT FT DIEKIILRFVWNQRRPRISRAILGNKNKMGGINMPDIKLYYKAVVIKTIWY FT WHKNRNIDQWNRCENPDIKPSSYSHLIFDKADKNIRWGKESLFNKWCWENW FT IATCRRLKQDPHLSPLTKTNSRWITDLNLRYETIRTLEEKVGNTLLDIGLG FT KEFMKKSPKAITAATKINKWDMIKLQSFCTAKEIVMKVNRQPTEWEKIFAS FT YASDKGLITRIYLELTKIRKKKSNNPIKKWAKDLNRNFSKEDRRMANKHMK FT KCSTSLIIREMQIKTTMRYHLTPVRMAFIKKSPNNKCWRGCGERGTLLHCW FT WDCKLVQPLWKAIWRYLKAIQVNLPFDPAIPLLGIYPNDPVTLYKKDTCTR FT MFIAAQFIIARLWKQPKCPSIQEWINKMWYMYTMEYYSALRNNGDIAHLIF FT SWLELEPILLSEVSQEWKNKHQIYSPANWY" XX SQ Sequence 6545 BP; 2461 A; 1490 C; 1351 G; 1243 T; 0 other; ggggggggcg gagcaagatg gcggacgaat aacaccgcca gacagagggt ctctgcagaa 60 aagacagatt ctagcagaaa ctagaggaaa gaagcaagaa gacgagcata cagcggacaa 120 gggccggaag gaggggtacc tgagaccccg ggagactcca cgggaggagg ctgcggagga 180 gaactggagg ctgagaccac cggagcagcc cggagaccag cggcaagggt aggtggattt 240 gctgtttccc ctcccctgca ttcgggactg ctggtgggct ccccagcggg tggagagacc 300 tgcggacacc agcccagaga ctgccgccgc ctgccaacgg tgagcctgta gcagacgtgg 360 caccaggttc ccaacttcct ccgggcacct ccgtgtgcac ggacccgagc cgcgcggcag 420 gcgccatatt gcctcctcct cccctccgcc gaccctaccc gcggctgccc agagagacaa 480 tacagccacc agccggaggc acctccaggg aacgggacct tcccgtttgg gaccccgccc 540 gccctcccag gtgctgctgg caccgtgttc ccaggagaac ggtgccgact cagaggctga 600 gagacataga cccagcttgg gctccctgtg ggtgaattag gaccggaaat cctctccctg 660 gtgggaatac agtttgaact ctgggaccca gaggtcggac ctgcagacca gatcccctgc 720 accgagggct agcattgccc ggggcacaga agggttatac gtgaacagcc tactgaggtc 780 tgtgtgcctc caggggcgga tcggcgtcct agagggcaac cctcctccca ggaggaggcc 840 gtgcgcccaa cccaggtggc gttcctgtgc agggaacctc cccgccggca tcacagtccg 900 gggaggcctg gtggcttgtg gtctggcctg ctggcagagg cccaggagta gctgcggagt 960 tggggagggt ggaaagaagc gaggcctgct gcagactgcg ggtctcagac agccccaccc 1020 ccacacccag actttctggc tgagcgggac cattccagcc ccgccctgac agctttccct 1080 ggaagcagag aacagaactt tgacccctgc taacggcctg agggcaggct tacccaaccc 1140 agctccgccc agaacgagag ctgataacag gactcaaaat caacaccata gcctgttcct 1200 ccaagcaaac gccacctact gacagggacg gcatcttgca cagcctttcc acggcaccca 1260 ctgactcaat atacagggag tggtccaatt tcacccacag gcaccaccta acgcctcaga 1320 aactaaacaa ggtgtgtgaa tacccaaaca ataacctaag gaaagaaaca acaactgatc 1380 gacatgggaa gaaatcagcg aaagaactca ggaaatatga agaaccaaac ggaaaacaca 1440 cccccaaaga ggagcaccag ccccctagaa acggacaccg accaaaatca ggcaaccaat 1500 atgacagaag aggaatttcg tatgtggatc ataagaacac tcacccagct gcaacaacaa 1560 ctcaataacc aacaccaaga aaccacaaaa agcctccagg atatgggacg gccaaagaaa 1620 tagacacagt gaagaaaagt gtaaccgaac tcctggaaat gaagaatcaa ttcaaggaac 1680 tacaaaatac agtggaaagt ctcaagaaca gggtagatca aacagaagaa agaatctcag 1740 agcttgaaga taacaccctc caattaaata aatcagtcac agaaatagag cagagaaaca 1800 agagaaaaga gcaaagccta caagagctgt gggattatgt gaagaaacct aatgtgaggg 1860 tcatagggtt accagaaggg gaagaagaca acactcaagg gttggacaag ctgtttgaag 1920 atataataga ggaaaatttc ccaggccttg ctcaaaatct tgatatacaa gttcaagaag 1980 ctcagaggac ccctgggaga ttcaacgcaa acaggaagac gtcacgacat gcagtcatca 2040 gactgaccaa agtatcaact aaagaggccc ttctaagagc tgtaagacaa aagaagcaag 2100 taacatacaa gggaaagcca attcgaataa catcagactt ctctaatgag actttacaag 2160 caaggagaga ctggggcccc attctcactc ttttgaaaca aaacaatgcc cagcctagaa 2220 tattattccc tgcaaaacta agcttcgtat atgaaggaga aataaaaaca ttctcagaca 2280 agcaaaggct cagagaattc accaagacaa gaccagccct acaagaagta cttaaaacag 2340 cgttacgcac ggaacatcat aataataacc cacgaatata aaaacaacca aaacccaaag 2400 atattaaagg ccagatatta caatggctca agacagaaat catagcaaca acatccaacc 2460 caacagaatg atcagtaatc taccttacct atcagttctc tcaataaatg tgaatggctt 2520 aaactctcca ctcaagagac ataggctggc tgaatggata agaaaataca ggccaagtat 2580 atgctgtctt caggaaacac atctaacctg caaggatgca tatagactaa aaataaaagg 2640 gtggagatca atattccaag caaatagaag ccaaaagaag gctggtgtgg cagttctaat 2700 ttcagacgat ttagttttta aaccaacaaa agtagtaaaa gacaaagagg gtcattatat 2760 aatggtgaag ggcacagtcc aacaagaaga gataacaatt ttaaatatat atgcacccaa 2820 cttaggtgca cccagattca taaagcaaac cttactggag ctaagcaaat ggattaatag 2880 caactccata atcgccggag atttcaacac cccactgacg gcacgagaca gatcctccaa 2940 acagaaaatt aataaagaaa taatggactt aaacaaaact ctagaacaat tgggtctgac 3000 agacatctac agaacattct acccaaaatc cactgaatat acgttcttct catcagctca 3060 cgggacattc tctaagattg accatatcct aggacacaaa gaaaatctca agaaatttaa 3120 aaaaatagaa atcataccat gtaccttctc agatcacagt ggaataaaac tagaaatcaa 3180 ccctaacaga aactcacatt tctacacaaa aacgtggaaa ttaaacaacc tcctactaaa 3240 tgattacttc gtaaatgaag aaatcaagac ggaaataaaa aagttctatg aagaaaacga 3300 caatggagag acaagttatc aactcctctg ggacacagct aaagcagttc tgagaggaaa 3360 gtttatctcc ataaatgcct ataaccaaaa gacaagaaga tcacaaatag acaatctaat 3420 gaaacgactc aaagagctgg aaaaagaaga acagaccaac cccaaaccca gcagaagaag 3480 tgaaatcaac aagatcaaat cagaactaaa cgaaattgaa aacagggaag ctattcagga 3540 gattaataaa acaaaaagtt ggttctttga aaaaataaac aaaattgaca caccattggc 3600 taagctaacg aaaagcagaa aagagaaatc tctaataagc tccatcagga ataaaaaagg 3660 agatatcaca actgatccca aagagataca agatacaatt tatgaatact acaaaaatct 3720 ttatgcacac aaactggaaa atgtggagga aatggacaaa tttctagaaa cacacagcct 3780 ccctaggctc aaccaggaag aaatagattc cctgaacaga ccaatctcaa cagctgaaat 3840 agaaacagca attaaaaatc tccctaaaaa gaaaagtccc ggtccagatg gcttcacacc 3900 tgaattttac catacttaca aagaagaact agtacctatc ttgcagaaac tattccacaa 3960 catcgagaag aacggaaacc tccccgacac cttttatgaa gcgaatatta ctctgatacc 4020 aaaaccagga aaggatgcaa caaaaaaaga aaactacaga ccaatatccc taatgaatat 4080 agatgcaaaa attttcaaca aaatcttagc taaccgaatc cagacactta tcaaaaaaat 4140 aatccaccac gaccaagtgg gcttcatccc agggatgcag ggatggttca acatacgtaa 4200 atctataaat gcaattcacc acataaacag aagcaaaaac aaagaccaca tgattctttc 4260 aatagatgca gaaaaagctt ttgacaaaat tcaacaccct ttcatgatac gaacacttaa 4320 gaaaataggc atagaaggga catacctaaa aatgatacaa gccatatatg acagacccat 4380 agccaacatc atactgaatg gggaaagatt gaaatcattc ccacttagaa ctggaaccag 4440 acaaggctgc ccactatctc cacttctgtt caacatagtg ctggaagtct tggctacagc 4500 aatcagacag gaaaatggaa tcaaaggtat ccaaataggg gcagaagaga tcaaactttc 4560 actgtttgct gatgatatga tattgtatct agaaaacccc aaggattcaa ccaagaaact 4620 cctggaactg atcaatgaat ttagtaaagt ctcaggatac aaaatcaata cacagaaatc 4680 agaggcattc atatacgcca acaacaatct aattgagaac caaatcaaag actcaattcc 4740 cttcacaata gcaacaaaga aattaaagta cctaggaata tacttaacca aggaggtaaa 4800 agacctctac agggagaact atgaaacact gaggaaggaa atagcagagg atgtaaacag 4860 atggaaatcc ataccatgct cgtggatcgg cagactcaat atcatcaaaa tgtctatact 4920 acccaaactg atctacagat tcaatgcaat acctattaaa atcccatcag cattcttcac 4980 agatatagaa aaaataattt tacgcttcgt atggaaccaa agaagacccc gaatatcaag 5040 agcaattcta ggcaacaaaa acaaaatggg aggcattaat atgccagata tcaaactata 5100 ctacaaagct gtagtaatta aaacaatatg gtattggcac aaaaacagga atattgacca 5160 gtggaacaga tgtgagaatc ctgatataaa accatcctca tatagccatc taatctttga 5220 caaagcagac aaaaacatac gctggggaaa agaatccctc ttcaataaat ggtgctggga 5280 aaactggata gccacctgta gaaggctaaa acaggaccca cacctttcac ctctcacaaa 5340 aaccaactca cgctggataa cagacttaaa cctaaggtat gaaactatta gaactctaga 5400 ggaaaaagtt ggaaacactc tcctagacat cggcctgggc aaagagttta tgaagaagtc 5460 cccaaaggca atcacagcag caacaaaaat aaataaatgg gacatgatca aactacaaag 5520 cttctgcaca gccaaagaaa tagtcatgaa agtaaacaga caacctacag aatgggagaa 5580 aatttttgca tcctatgcat ccgataaggg actgataact agaatatact tagaactcac 5640 gaaaattagg aagaaaaaat caaataaccc cattaaaaag tgggcaaagg acttgaacag 5700 aaatttttct aaagaagaca gaagaatggc caacaaacat atgaagaaat gctcaacatc 5760 tctaatcatc agggaaatgc aaatcaaaac cacaatgaga tatcacttaa ccccagtgag 5820 aatggccttt atcaaaaaat ctccaaacaa taaatgctgg cgtggttgcg gagagagagg 5880 aacactccta cactgctggt gggactgcaa actagttcaa cctctgtgga aagcaatatg 5940 gagatacctt aaagcgatac aagtgaatct accatttgat ccagcaatcc cattgctggg 6000 catctaccca aatgatccag tgacactcta caaaaaagac acctgcactc gaatgtttat 6060 agcagcacaa ttcataattg caaggctgtg gaaacagccc aagtgcccat caatccaaga 6120 atggattaat aaaatgtggt atatgtatac catggagtac tattcagctc taagaaacaa 6180 tggtgatata gcacatctta tattttcctg gttagagctg gaacccatac tactaagtga 6240 agtatcccaa gaatggaaaa acaagcacca gatatattct ccagcaaact ggtattaact 6300 gagtagcacc taagtggaca cataggtgct acagtaatag ggtattgggc aggtgggagg 6360 ggggaggggg gcgggtatat acatacataa tgagtgagat gtgcaccatc tgggggatgg 6420 tcatgatgga gactcagact tttgggggga gggggggaaa tgggcattta ttgaaacctt 6480 aaaatctgta cccccataat atgccgaaat aaaaaaaata attaaaaaaa aaaaaaaaaa 6540 aaaaa 6545 // ID LTR13_Mim repbase; DNA; PRI; 589 BP. XX AC . XX DT 11-NOV-2009 (Rel. 14.11, Created) DT 11-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR13_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-589 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2969-2969 (2009). XX DR [1] (Consensus) XX CC >89% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 589 BP; 139 A; 168 C; 137 G; 145 T; 0 other; tgttaggtag taggagggat tcggtctcct agggacgaaa gtgggtgctt tgcctggtgc 60 aattgcccga gagggcggcc cccatcttgg tcctgcccca ccctaactcc agactgccat 120 acctggggag gagggaatac cctacgaatc gccaatcaaa cttagcgcgc cagctgcaaa 180 agaatgcaga ggactctcta gcccacgggc caagcgcata ggtaccatag agtcggaaat 240 gacccacatg cgtagtaatg caactcccaa ctgtccaatc aaagtggttc catggtgacg 300 tctgagccac gagggaggtc ccaatccggg cactataaaa caagacgcag accataacca 360 ggcccttttg tgcccttcct tttgctctcc ctttgctgcg acaatgggtc cggtcgtcat 420 ctgtggcata tgtatcatgc tctgtaaacc tatatcttgc tcttgcccta taatcctatc 480 tttctctcct caataaacct cattttcatg cttgccttac tttggtgtgt ctggtcattc 540 ttcggccatg agcgcaccaa gaaccgacat ttcagactga aacctgaca 589 // ID hAT-2N1_TS repbase; DNA; PRI; 674 BP. XX AC . XX DT 21-APR-2009 (Rel. 14.1, Created) DT 14-DEC-2009 (Rel. 14.1, Last updated, Version 2) XX DE hAT-2N1_TS is a family of non-autonomous DNA elements found in DE Tarsius syrichta mobilized by hAT-2_TS. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-2_TS; hAT-2N1_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-674 RA Novick P.A., Smith J., Ray D. and Boissinot S.; RT "Independent and parallel lateral transfer of DNA transposons in RT tetrapod genomes."; RL Gene 449(1-2), 85-94 (2010). XX DR [1] (Consensus) XX SQ Sequence 674 BP; 114 A; 148 C; 182 G; 223 T; 7 other; caggggtcct caaactacgg cccgcgggcc acatgcggcc cgccgaggac atttatccgg 60 cccaccgggt gtttttgccd ccgctgcctg tcctgcctag cagccgactb gtccgggccc 120 gcagtgcgca tgtgtggaat gtgcgtccgc actctccgac tcccctcctt ctctctgtct 180 ctcgactcct cctctcagtc tcgggtgtga tcggacgagt cacgagcttg cctgtgcaga 240 gcctgctgct gcctgaggac cgaggtaaga acaagttagg attttttttt ttttttttga 300 agttaggagg tctdtttttt ttttttaatt ttgcagttag tagggccttt tttttgcggt 360 taaggggggc cttttttthc tgaagttagg aggtctattt ttttttttgc agataggggg 420 cgcctttttt tttgaagtta ggagagcbtt tttttttgaa gttaggagag cctttttttt 480 gaagttagga gagccttttt ttttaagttg gttagttggt tgggggtggt ttctaggggg 540 gttgcatcac agtgataacg caaatagtca gygctcagtg ctaatgcaaa tggtttttta 600 aactatagtc cgcccctcca acggtctgag ggacagtgaa ctggccccct gttthaaaag 660 tttgaggacc cctg 674 // ID Garnel1 repbase; DNA; PRI; 94 BP. XX AC . XX DT 18-OCT-2009 (Rel. 14.12, Created) DT 18-OCT-2009 (Rel. 14.12, Last updated, Version 2) XX DE SINE element - consensus. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; KW Nonautonomous; Garnel1. XX OS Otolemur garnettii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lorisiformes; Galagidae; Otolemur. XX RN [1] RP 1-94 RA Jurka J.; RT "SINE elements from the bushbaby genome."; RL Repbase Reports 9(12), 3114-3114 (2009). XX DR [1] (Consensus) XX CC The top youngest sequences are >93% identical to consensus. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project. XX SQ Sequence 94 BP; 25 A; 27 C; 32 G; 10 T; 0 other; gggcggcgcc tgtggctcaa aggagtaggg cgccggcccc atatgccgga ggtggcgggt 60 tcaaacccag ccccggccaa aaactgaaaa aaaa 94 // ID THE1B repbase; DNA; PRI; 364 BP. XX AC . XX DT 01-OCT-1995 (Rel. 3.6, Created) DT 23-JAN-1998 (Rel. 6.4, Last updated, Version 3) XX DE Transposon-like human element long terminal repeat (THE1b DE subfamily) - a consensus. XX KW Non-LTR retrotransposon; MaLR family; THE1b subfamily; THE1B. XX OS Hominidae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini. XX RN [1] RP 1-364 RA Sun L., Paulson E.K., Schmid W.C., Kadyk L. and Leinwand L.; RT "Non-Alu family interspersed repeats in human DNA and their RT transcriptional activity."; RL Nucleic Acids Res 12, 2669-2690 (1984). XX RN [2] RP 1-364 RA Paulson E.K., Deka N., Schmid W.C., Misra R., Schindler W.C., RA Rush G.M., Kadyk L. and Leinwand L.; RT "A transposon-like element in human DNA."; RL Nature 316, 359-361 (1985). XX RN [3] RP 1-364 RA Smit F.A.; RT "Identification of a new, abundant superfamily of mammalian RT LTR-transposons."; RL Nucleic Acids Res 21(8), 1863-1872 (1993). XX DR [3] (Consensus) XX CC Replaces OFR sequence. XX SQ Sequence 364 BP; 74 A; 89 C; 88 G; 108 T; 5 other; tgatatggtt tggctgtgtc cccacccaaa tctcatcttg aattgtagct cccataattc 60 ccacgtgtcg tgggagggac ccggtgggag gtaattgaat catgggggcg ggtctttccc 120 gtgctgttct cgtgatagtg aataagtctc acgagatctg atggttttat aaaggggagt 180 tyccctgcac angctctctt gcctgccgcc atgtaagacg tgmctttgct cctccttcgc 240 cttcygccat gattgtgagg cctccccagc catgtggaac tgtgagtcca ttaaacctct 300 ttyctttata aattacccag tctcgggtat gtctttatta gcagcatgag aacggactaa 360 taca 364 // ID L1-2_TS repbase; DNA; PRI; 6474 BP. XX AC . XX DT 10-APR-2010 (Rel. 15.05, Created) DT 10-APR-2010 (Rel. 15.05, Last updated, Version 3) XX DE L1-type non-LTR retrotransposon - consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-2_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-6474 RA Jurka J.; RT "Non-LTR retrotransposons from tarsier."; RL Repbase Reports 10(5), 768-768 (2010). XX DR [1] (Consensus) XX CC ~90% identical to consensus. ORFs corrupted by mutations. CC This sequence was derived from sequence data generated by the CC Broad Institute Mammalian Genome Project. XX SQ Sequence 6474 BP; 2436 A; 1463 C; 1219 G; 1344 T; 12 other; ggatttctgg caagatggca accagacagg tcagactgtg agagtctcca caaaagcaag 60 tgtccataaa gactctgtgt gagtaggcgt gtgctggtgg gtgagtgaaa gtccgtctga 120 ggggagactc gggttcatgc accgggcccc cctccgaagg tgactgcaag cccccaccac 180 cctgagattg gagccacagg cgtgcaggat catctgctgt tccaggactc tcctgagaag 240 agcctgccga atccagctga ttcccatcac ctggcgcccg gcccacacca tcacccgtta 300 tagcgaagca gtgggnccca gatcggtggg gagagccgga gagaaagccc catccagtga 360 ggagacgccg accgcagcca ttttctctgt cccgggacca ggcagagacc tgcattgccc 420 cgcccagctc cgcccagcct ccaacttggg acctaaggta atctcatcaa actgggagag 480 acagaagccc ctcccccatg ctagatcgga cacagacagg tcagatctga gtggcagact 540 tgacaggcag caagccggct gagtccagat ctgatccacc ctccttccac agcaggcctg 600 gggcatctga cccnagctgt gagcaaccct accagcctgc gatgagagct agagatcaga 660 gctgaaagga cgttggactt acaggaacga cctcagagcg agtttggttc cttgttttta 720 ttttttgtct gttttgctcg tttttgtttg ttttattttt tctctttctg tttttgataa 780 cgttttcgta cttgctttgt ttctgttatt ttactgattt ttttttctnt tctttagact 840 tcctgtttgt gtaggggaaa tacatcaata gggaggttgt tgttggcttg tgtgtttgta 900 tgattgtctg tatgttttag tttttttttt ttactctttt ctgtctttgt gtctcttttc 960 atttgttagt cggtacgtnt gtctgttttc cctcttactc ttctctctct ccaccttcct 1020 cttncccccc tccttccctt tccattcttt tcctttttct tacctcattt tgctttcagg 1080 gagcaaacct atacccccac tgcatggaca gattgcaagg cagtgggaag tgggttccag 1140 ggtacccata cacctgtcgg tggatactgg gatctgcaga tagattttga atcacagctc 1200 ccccctcaat taaatctcaa aactgcaaaa agtacagccc ccaacccaca gtccagcttc 1260 agaattgaac aaagccagag gatccacata aggaagaata ggagattgag caaacagaag 1320 taaaagcaac cccgccatcc ctgaacaaca gaattaaaga agggggggaa aaggggggga 1380 ggaaaatcta cacaaatgaa gaaaaaccaa aagaagaata tgggtctcac acagacccct 1440 gggaggtcag acactgagaa aacagacttc ggaacgcaaa caatgaagag cccccagaat 1500 gactggtcnc anactacaaa cccagacatc aagacattaa tagagagaat aaaaagaatt 1560 gaggagagac aagaggaaaa taggaaggag ttgataactg agataacagc agtaaagaat 1620 actgtgantg aaataaataa caaactgata agcatggaaa gcagaattag ccaagcagaa 1680 gaaagaatct cagagcttga ggaccaaaat atagaactaa cccaaactgt caaaaacata 1740 gaaaagaagc ttaaaaagac agaacaaaac cttcaagaga tgagcgatta tttcaagagg 1800 ccgaacctaa gagtaattgg acttcctgag gcagaaagag agacagagac caccctggaa 1860 caaacattcc atgaaattat tcaagaaaac ttccctcatc tcatcagtga tgcgaaaatt 1920 caaacacaag agattcagag aacccctgca agacaacaaa tgagaagacc aactcccaga 1980 cacatagtaa ttcgcctaaa caaagtaggt ataaaagaaa aaatcctaaa ggcagcaaga 2040 gaaaaaggtc agactaccta ccggggaaga ccaattagaa tagcagcaga tttatctaca 2100 gaaacacttc aggctaggag agcttggagc ccaatcttca aagttctcaa agataaacaa 2160 tttcaaccaa gaataaccta cccagctaag ctaagcttca tcagtgaggg agaattaaaa 2220 tctttcccag acattcaatc cctaagaact tacgctgcct ctagaccacc tctacaagaa 2280 acacttaaga aagtattaaa cacagaagaa aaggaaaaaa gaacgacaac gttcttcaca 2340 agagtgcagg aaaaagattg aaaacacaca tgaatcaacc caaaaatcaa aagaaagaca 2400 acaaacaaac aggaacaaca acaactctat aagaacctca tgacagggat aaactctcac 2460 atttcaataa ttagcctgaa tgtgaatgga ctaaatgcac cactgaaaag acatagaatg 2520 gcaaaatgga taaaatatca tgaggcaaca atatattgtc tccaagagac tcatctcacc 2580 agaaaggaca ctcacagact caaagtaaga ggatgggaag caaaatttca ggcgaacgga 2640 acacaaaaga aagcaggagt tgcgatctta atatcagaca aaataccctt taagctatca 2700 aaaatttaaa aagatacaga aggtcactat ataatgataa aaggttcaat ccatcaacaa 2760 gaaatatcca tcctaaacat atatgcaccc aacataggag caccaacttt cataaagcag 2820 cttctaggca aacttaaaaa agatattgac tctaacacta tcatagctgg ggactttaat 2880 accccactca caaccctaga cagatcatca ggacaaaaaa tcagcaagga gatccggaac 2940 ctcaatgtga ctcttgacca aatggactta attgatacct acagaacact ccacccaacg 3000 accacagaat atacattcta ctcatcaccg catggaacgt actctaagat cgaccacatc 3060 cttggccata aatcaagcat aaacaaattt cataagattg aaattttgcc atgcaccttc 3120 tcagaccaca gtggaataaa aataaatatc aacaccaaca aagttccccc naaacccaca 3180 aagacatgga cactaaacag catgatgcta agcaactcct gggtcaacat ggaaatcaaa 3240 acagagatta aaagatacct ggaaacaaat gaaaatgaag aaacatctta ccaaaacctc 3300 tgggatgcca tgaaagcagt agtaagaggg gaattcatat ctctacaaac gcacatgaag 3360 aaaatggaaa gatcacaagt taacagccta acaagtcacc taaggaagct ggaaaagcaa 3420 gaccaccaaa accctaactt cagcagaaga atccagatca ccaaaataaa agcccaaatc 3480 cgggacatag aagacaaaaa gataatacaa aaaatcaatg aaacaaaaag ctggttcttt 3540 gaaaggataa acaagatcga tggtccccta gctagactga ccaagaaaaa gagagaaaaa 3600 gcccaaataa acacaatcag aaacacaaaa gatgaagtca catctgaccc tgaagaaata 3660 caaaagatta tcagagacta ctatgtacac ttgtatggaa acaaacttga aaacctaaag 3720 gaaatggagg actttctgac atcacacaac ctccctaggt tgaaacaaga agaaatcgag 3780 accctaaata gaccaataac aatccaggaa attgactatg tcataagaaa actacctaca 3840 aaaaaaagcc ctggaccaga tggctttcca gcagaattct acaaaacata caaggaggaa 3900 ctgataccaa tcctactgaa agtattccag gcgattgaga aagatggaac tctccccaaa 3960 tcattttatg aagctaacat cacattgata cccaagccag gtaaagaccc aacaaagaaa 4020 gagaactaca ggccaatatc cttgatgaac atagatgcta aaattctcaa caagatccta 4080 gcaaaccgga ttcaacaaca catctcaaaa atcatccacc acgaccaagt aggcttcatc 4140 cccgggatgc aaggctggtt caacattcgt aagaccataa acgtaattaa atacatcaac 4200 agacgtaaaa acaaaaacca catgattata tcattagatg cagaaaaagc ttttgataaa 4260 atccagcacc ccttcttgat aaaaaccctc gaacatctag gcatagatgg aacatacctc 4320 aaaatagtaa gagccatcta cgagaaaccc acagccagca tactgctaaa cagacagaaa 4380 ttggaaccat ttcccctgaa aactggaaca agacaaggat gcccactctc acccctcctg 4440 ttcaacatag ttttggaagt cctagctaga gcaatcagag aagagaaggc gatcaggggt 4500 atccaaatag gaaaagagga agtcaagtta tctctctttg cagacgacat gattgtgtac 4560 cttgaaaacc caagagaatc tgtcaaaaac ctccttgcac tgataaagga ctttggcaaa 4620 gtctcagggt ataaaataaa tgtgcaaaag acaatcgcat ttctatacac caataataaa 4680 caaacagaaa cccaaataaa aagcacaatt ccattcacaa tagccacaaa aaaaatgaaa 4740 taccttggca tcttcctaac cagagacgtg aaagaccttt acaatgaaaa ctacaaaaca 4800 ctgctcaaag aaatcaaaga tgacacaaac aagtggaaaa atattccatg ctcatggatt 4860 ggaagaatca acattgttaa gatgtccatt ctacctaagg caatctacag atttaacgca 4920 atacccatca aattaccagc aacattcttc tcagacctag aaacaacaat acaggaattc 4980 atatggaaac ataaacgacc aagaatagcc aaaacaatcc tcagcaaaaa aaaacaaagc 5040 aggaggtatc acactcccag actttaaact ttattataaa gctacaataa tcaaaacagc 5100 ctggtattgg tacaagaaca ggcatataga ccaatggaat agaattgaga ttccagaagc 5160 aaaacctcaa tttctcaacc aactcatctt cgacaaagcc tccaccaccn accactgggg 5220 agaggagaac ctattcagta aatggtgctg ggaaaactgg ctgaccacat gcagaagatt 5280 gaaacaggac ccctacctat ccccatacac aaaaattaac tctaaatgga tcagagacct 5340 aaacgtaaaa cctcaaacta taagaacctt agaaaatgca ggagacactc ttatggaaat 5400 tggaactggc aaccaattcc tgatcaaaac ccgaagtgcc caggccataa gagataagat 5460 agacaagtgg gacctcatca aactgacaag cttctgcaaa gccaaagaaa ccatcaagag 5520 agcagggaga cagcccacag actgggaaaa aatatttgcc aactccatgt ctgacaaagg 5580 cctaacatct aggatctaca aggaactcaa acgtgctgaa aagaaaaaaa caaacagccc 5640 cattacaaag tgggcaaaag atatgaatag acagttctca aaagaagaca tacgagcagc 5700 caacagatac atgaaaaaat gttcagcctc actagtcatc aaggagatgc aaattaaaac 5760 tacactgaga taccacctaa ctccagtcag aatggccatc atcaataact caaaaaataa 5820 cagatgctgg agaggatgtg gcgaaaaggg aacacttcta cactgttggt gggagtgtaa 5880 actagtacag cctctgtgga aaacagtgtg gcgattccta aaagttctaa aaatcgacct 5940 tccatatgac cccgcaatcc ccctactggg aatatacccg gaagaactca aatcactcta 6000 taaaaaagat acctgcacac gtatgtttat cgcagcattg ttcacaatag caagaacatg 6060 gaaccaaccg tgctgcccat caaaagagga ctggattnaa aaaatgtggt acatatacac 6120 gatggaatat tacgcagcca taaagaagaa caaaatcatg aacttcgcag caacctggat 6180 ggaattagag tctataatac tgagtgatct ttcacagaaa caaagaactg agtatcacat 6240 gttctcactc ataagtggac cttgaacaac taccgtaata ctataagaaa aggattgaca 6300 gtagcgggaa actgcggggg ggaggggagg gaggtgggat tgacagtagt ngaaatctgc 6360 ctggggaagg gggggcacac ctcatcaaca agggtacctg cacaatgaat atttgtatac 6420 ctaaccctga attgtacccc acaattttaa aataaaaaat attgattaaa aaaa 6474 // ID LTR16_Mim repbase; DNA; PRI; 573 BP. XX AC . XX DT 06-NOV-2009 (Rel. 14.11, Created) DT 06-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR16_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-573 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2977-2977 (2009). XX DR [1] (Consensus) XX CC >90% identical to consensus. 4bp tsd. CC Similarity to LTR16_OG from bushbaby. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 573 BP; 171 A; 139 C; 93 G; 170 T; 0 other; tgaaggagct caggaaattt caccccaaaa tatgactcct tggtataaag aatattttga 60 attaaaggcc attcaagatc aaaaagcatt ggagggggct ttccctctat ctgcataaac 120 cggactgacc aatcaaaaga tcaaaggggc aattgacttc ccttcctctc cctgttatct 180 caatatattg caggaaggag gatcaagaat gcaaccagac ctggcccaaa tcatttaaat 240 ataatacctg tctctcaggt taatttaatt tgctcaggtt aatttaattt gcaaagagaa 300 tcatttacaa gtcaatctgt ttcccccatc catttatcct ccctagcacc atttgttccc 360 cctaaacaga attacctgta ctcctcatct ccccctcccc tccaaaagga caggtataaa 420 aatatctgaa cttcattggg atattgggta atcactctgt gattctcccc atgtgcatgg 480 taaataaacc tttctcttat taatctgcct taattgtgag ttgatctttc agcgaacttt 540 cgggggaaag ggaagtttcc cctcacccct aca 573 // ID LTR14C3_Mim repbase; DNA; PRI; 521 BP. XX AC . XX DT 04-NOV-2009 (Rel. 14.11, Created) DT 04-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR14C3_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-521 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2974-2974 (2009). XX DR [1] (Consensus) XX CC >94% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 521 BP; 153 A; 114 C; 143 G; 111 T; 0 other; tgtaagaaat ggaaattata agaaaagctc aaagctaaca gcttagcttc tgtcccctca 60 gcttgcttgc ttacaatata tagcttaagt gaagccatga caggcttcag cgggtgctgg 120 gcctcaaaag gcggtaaccg caggtcattt cctgataaga ggttgaaaag ccccgggatg 180 gggggagggg gggccaaaca ccggatgact ctgcagtaaa atgctgaaat aaaatgcaga 240 gggaaagccc taacgccaga accaatcagc tggtaagagg aaccaaccag aaaagagatg 300 aaataaactg ctgaaggaga ctgcgggagg ggggaggggg aaaactactt aagggatacc 360 cgtaacccaa gctggggtcc ttgtcagaat agaggccact gcgattggcg ctctgagact 420 cggaccctag ctcgagctag tcaataaaac tccttttgat gatttcagcc tcagtgactc 480 tgtctctttg ttctgtggta ctgcggtttc ccgctctaac a 521 // ID ERV2N1-Mim_LTR repbase; DNA; PRI; 622 BP. XX AC . XX DT 02-NOV-2009 (Rel. 14.11, Created) DT 02-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat of a retrovirus-like element. XX KW ERV2; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERV2N1-Mim_LTR. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-622 RA Jurka J. and Walichiewicz K.; RT "ERV2-like non-autonomous endogenous retrovirus from the mouse RT lemur."; RL Repbase Reports 9(11), 2837-2837 (2009). XX DR [1] (Consensus) XX CC >98% identical to consensus. 6bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 622 BP; 147 A; 152 C; 142 G; 181 T; 0 other; tgttgggagc tgtcccgaac acgggagagt taaagtgcgg ttgctgcccg gaacactggg 60 gtccttgaga cacagcacct ggcgccttaa ctcatagctc tgacattctt ggcctttctt 120 ctagctcaca gagggtgctg gagataacag tagcagttat agaaggtcac tgaagtaatc 180 ctgtgcacac agccctcgta gcttatagcc ccccctaggc cctgcacact cctttctcag 240 aattgttaat tacgcttggg tgctctttcc ttgaactgtc tgagtgatca tagttgagta 300 tttgtagctt ttttgatatc agaaaaaagt taagttgtag attagaaaac tgcttaagcg 360 agggaaaggg ctatcaaagt gacgtaaggt taaaagaaac aagaactgtt ttgttttctg 420 gtttaatgaa gacatgcacc acctgttcct ttgcattttg cttctccttt gttcttatct 480 gtataaatac agcaactgaa ataaacgagg tgcggcagtc agcaaggact ccgtcctccc 540 gtccccatct ttttgttgtc tcttcatttc tcagcctcgc cccctaactc caggtgccgc 600 acgtcgccgg ctggctccgg ca 622 // ID LTR22 repbase; DNA; PRI; 492 BP. XX AC . XX DT 28-AUG-1997 (Rel. 2.07, Created) DT 18-DEC-2003 (Rel. 8.11, Last updated, Version 2) XX DE Long terminal repeat of endogenous retrovirus - a consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR; HERVK22I; KW LTR22. XX OS Hominidae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini. XX RN [1] RA Kapitonov V.V. and Jurka J.; RL Direct Submission to Repbase Update (25-AUG-1997). XX RN [2] RP 1-492 RA Lavie L., Medstrand P., Schempp W., Meese E. and Mayer J.; RT "Characterization of the human endogenous retrovirus family RT HERV-K(HML-5)."; RL Direct Submission to Repbase Update (07-DEC-2003)(in RL preparation). XX DR [2] (Consensus) XX CC Putative LTR of endogenous retrovirus. CC The LTR22 family is represented by two major subfamilies. CC LTRs associated with HERVK22/HERV-K(HML-5) proviral loci are CC represented by three major subfamilies: LTR22, LTR22A, LTR22B. XX SQ Sequence 492 BP; 119 A; 113 C; 130 G; 130 T; 0 other; tgtaggagat cagtcagggt ggtgggaaaa attatagtag aaagatgcaa accttcttgg 60 aaggctggga ggttttacaa aagcttcgga aaaggatttg gctgaaggca gccaaattct 120 cttatccagt gcctgaaagc ttaggttaga taacaaggaa atgtaaagaa attgatctag 180 ataagttagt ttacttaggc ctcggaacct ggcctttaat catctgcatg caggactgct 240 ctctccgggg ggggggcgac catgttaatt acccacaagt gtgttgactc aaagcctttg 300 tcattaaatc tgtactgaat aaatgcccgc agcgccggct tgtcagggcc acggctgctg 360 tgactcttta cagcaccctc ctcggtgtct gtgagcggcc cggtccccta gcccactctt 420 tcactggata cctgtgtctg agtgcatttt ttcatctgtc actcggccag ggtctgcggg 480 tcagacccgg ca 492 // ID LTR10_Mim repbase; DNA; PRI; 306 BP. XX AC . XX DT 02-NOV-2009 (Rel. 14.11, Created) DT 02-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR10_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-306 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2966-2966 (2009). XX DR [1] (Consensus) XX CC ~99% identical to consensus. 6bp tsd. CC Similarity to RLTR10F from mouse. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 306 BP; 63 A; 85 C; 81 G; 77 T; 0 other; tgtggggcgc ggtgttaacg ccattgcaag atggcgccga cttcctggtc acacccatac 60 cacaagactg ataaacaggg tgaaccgcgc atgtgtaggg gctttttcct gtctctcatc 120 aagtatgcta atgagggctc ttgcgtgagc caatcagatt ctgcctaatg tacttagtgc 180 ctatataagc ccgctccgag agctcctcgg ggtcttccgc tttagtcatc ttcagattcc 240 ccaataaagc gctgtcagaa gaactccggt tgccgcgtct tccttgctgg cgaggcgggc 300 gcgaca 306 // ID hAT-2N2_TS repbase; DNA; PRI; 517 BP. XX AC . XX DT 21-APR-2009 (Rel. 14.1, Created) DT 21-APR-2009 (Rel. 14.1, Last updated, Version 1) XX DE hAT-2N2_TS is a family of non-autonomous DNA elements found in DE Tarsius syrichta mobilized by hAT-2_TS. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-2_TS; hAT-2N2_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-517 RA Novick P.A., Smith J., Ray D. and Boissinot S.; RT "Independent and parallel lateral transfer of DNA transposons in RT tetrapod genomes."; RL Gene 449(1-2), 85-94 (2010). XX DR [1] (Consensus) XX SQ Sequence 517 BP; 82 A; 135 C; 130 G; 160 T; 10 other; tarsushatn acaggggtcc tcaaactacg gcccgcgggc cacatgcggc cgccgaggac 60 atttatccgg cccaccgggt gtttttgccg ccgctgcctg tcctgcctag cagccgactc 120 gtccaggccc gcagtgcgca tgtgtggaat gtgcgtcgca ctctccgact cccctccttc 180 tctctgtctc tcgactcctc ctctccgtct cgggtgtgat cggacgagtc acgagcttgc 240 ctgtgcagag cctgctgctg cctgaggacc gaggtaagaa caagttagga tttdtttttt 300 tttgaagtta ggaggtctat tttttttttt ttaaattttg carttagtag ggcctttttt 360 tttcggttaa ggggggcctt tttttccctg aagttaggag gtctwttttt twttttttgc 420 agataggggg cgcctttttt ttaaactata gtccgcccct ccaacggtct gagggacagt 480 gaactggccc cctgttttaa aagtttgagg acccctc 517 // ID LTR1B_Mim repbase; DNA; PRI; 592 BP. XX AC . XX DT 04-NOV-2009 (Rel. 14.11, Created) DT 04-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR1B_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-592 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2947-2947 (2009). XX DR [1] (Consensus) XX CC ~94% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 592 BP; 166 A; 162 C; 103 G; 161 T; 0 other; tgtgaactaa aataaaattt taaggctcac ccccccaccg gctgactgaa tggaccccct 60 cgtggccaaa ggaatatcct aaaactaaat tgtctgccag gaggagggag gtcagacatg 120 cctcatcatg cccccctccc ttcttgggga catcctttgt aacccattaa caggcctaag 180 ggtatgcaag acaaacctgc aggtcctcaa tttacacaac aaatctatgt ccggtggctt 240 atctctgata acagctcctt atgttaaaac attccaagcc tttagacaaa gcttcatgtc 300 tttaaccaat tacaagccaa agaatcttta aacccaccta taacctgtaa gcccccgctt 360 cgagatggcc cacctttttg ggccaaacca atgtatgcct cccacgtatt gatttatgac 420 tttacctgta acccctgtct ccctgaaatg tataaaacca aactgtaacc cagccacacg 480 agtccacttg ctcaaggcct cttgggagtg gctctgggtc atggtcctca aatttggctc 540 agaataaatc tctttaaaat tattttacag agtttggctt ttttccttga ca 592 // ID hAT-2N1_TS repbase; DNA; PRI; 685 BP. XX AC . XX DT 21-APR-2009 (Rel. 14.1, Created) DT 21-APR-2009 (Rel. 14.1, Last updated, Version 1) XX DE hAT-2N1_TS is a family of non-autonomous DNA elements found in DE Tarsius syrichta mobilized by hAT-2_TS. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-2_TS; hAT-2N1_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-685 RA Novick P.A., Smith J., Ray D. and Boissinot S.; RT "Independent and parallel lateral transfer of DNA transposons in RT tetrapod genomes."; RL Gene 449(1-2), 85-94 (2010). XX DR [1] (Consensus) XX SQ Sequence 685 BP; 117 A; 148 C; 182 G; 225 T; 13 other; tarsushatn acaggggtcc tcaaactacg gcccgcgggc cacatgcggc ccgccgagga 60 catttatccg gcccaccggg tgtttttgcc dccgctgcct gtcctgccta gcagccgact 120 bgtccgggcc cgcagtgcgc atgtgtggaa tgtgcgtccg cactctccga ctcccctcct 180 tctctctgtc tctcgactcc tcctctcagt ctcgggtgtg atcggacgag tcacgagctt 240 gcctgtgcag agcctgctgc tgcctgagga ccgaggtaag aacaagttag gatttttttt 300 tttttttttg aagttaggag gtctdttttt tttttttaat tttgcagtta gtagggcctt 360 ttttttgcgg ttaagggggg ccttttttth ctgaagttag gaggtctatt tttttttttg 420 cagatagggg gcgccttttt ttttgaagtt aggagagcbt ttttttttga agttaggaga 480 gccttttttt tgaagttagg agagcctttt tttttaagtt ggttagttgg ttgggggtgg 540 tttctagggg ggttgcatca cagtgataac gcaaatagtc agygctcagt gctaatgcaa 600 atggtttttt aaactatagt ccgcccctcc aacggtctga gggacagtga actggccccc 660 tgttthaaaa gtttgaggac ccctg 685 // ID LTR12_Mim repbase; DNA; PRI; 430 BP. XX AC . XX DT 04-NOV-2009 (Rel. 14.11, Created) DT 04-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR12_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-430 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2968-2968 (2009). XX DR [1] (Consensus) XX CC >94% identical to consensus. 4bp tsd. CC Similarity to LTR12_Vpa from alpaca. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 430 BP; 118 A; 97 C; 99 G; 116 T; 0 other; tatgttccgg caggtgtgaa cagctcttgg ccgaagaaaa acccgagcgg cacacggaga 60 gttggagagt cagctttatt tcgccggcgg gctcagagag gcatatctgc caccaaactc 120 tgagcgcccc tttttcgttt tcttttagtt ttataccttt ttgggggtta cagttagcca 180 atggcaagtt ttcacaaaag tcacctcatt tacatagtag tcagccaatc agaagtatgt 240 cccaaaagtt acttcattta catagtagtc agccaatcag aagtatgtcc caaaagttac 300 ttcatttaca tggtagtcag ccaatcagaa gtatgacccc aaatcacctc atcagctgtg 360 agtattagta gctgctcaat ttaggagcgg gataagcgaa acaggcgggt tttggtggaa 420 cgccctgaca 430 // ID TINE2 repbase; DNA; PRI; 87 BP. XX AC . XX DT 31-DEC-2009 (Rel. 15.03, Created) DT 31-DEC-2009 (Rel. 15.07, Last updated, Version 3) XX DE SINE element - consensus. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; KW Nonautonomous; TINE2. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-87 RA Jurka J.; RT "SINE elements from tarsier."; RL Repbase Reports 10(3), 518-518 (2010). XX DR [1] (Consensus) XX CC >92% identical to consensus. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 87 BP; 26 A; 24 C; 12 G; 25 T; 0 other; ggcaacccct tcgggtcccc tcccactgtg ggagcttttc tgtcctctca ataaatcctg 60 ctttctaaat aaataaataa ataaaaa 87 // ID TINE2 repbase; DNA; PRI; 87 BP. XX AC . XX DT 31-DEC-2009 (Rel. 15.03, Created) DT 16-NOV-2010 (Rel. 15.07, Last updated, Version 4) XX DE retrogene from the LTR portion - consensus. XX KW LTR Retrotransposon; Transposable Element; TINE2. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-87 RA Jurka J.; RT "SINE elements from tarsier."; RL Repbase Reports 10(3), 518-518 (2010). XX RN [2] RP 1-87 RA Bao W. and Jurka J.; RT "TINE2 is a retrogene from the LTR portion."; RL Direct Submission to Repbase Update (16-NOV-2010). XX DR [1] (Consensus) XX CC >92% identical to consensus. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. TINE2 is classified CC as retrogene (see comment of TINE1). XX SQ Sequence 87 BP; 26 A; 24 C; 12 G; 25 T; 0 other; ggcaacccct tcgggtcccc tcccactgtg ggagcttttc tgtcctctca ataaatcctg 60 ctttctaaat aaataaataa ataaaaa 87 // ID TINE1 repbase; DNA; PRI; 86 BP. XX AC . XX DT 31-DEC-2009 (Rel. 15.03, Created) DT 09-APR-2010 (Rel. 15.03, Last updated, Version 3) XX DE SINE element - consensus. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; KW Nonautonomous; TINE1. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-86 RA Jurka J.; RT "SINE elements from tarsier."; RL Repbase Reports 10(3), 517-517 (2010). XX DR [1] (Consensus) XX CC The youngest copies are >96% identical to consensus. CC This sequence was derived from sequence data generated by the CC Broad Institute Mammalian Genome Project. XX SQ Sequence 86 BP; 29 A; 25 C; 12 G; 20 T; 0 other; ggcaaccccc tcgggtcccc tcccactgtg ggagcttttc tgtcctctca ataaatcctg 60 ctttctaaaa aaaaaaaaaa aaaaaa 86 // ID HERV70_LTR repbase; DNA; PRI; 4 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 07-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Long terminal repeat of ERV1 Endogenous Retrovirus from primates. XX KW ERV1; Endogenous Retrovirus; Transposable Element; KW LTR retrotransposon; HERV70_LTR. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-4 RA Smit A.F.; RT "HERV70_LTR - ERV1 Endogenous Retrovirus from primates."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Entry deleted Jun 2004; it contains a BNIP3 processed CC pseudogene, thus masking this gene in human. The element is also CC not autonomous and doesn't deserve the name HERV and the CC consensus was created mostly from a segmental duplication CC expanse on chromosome 19. XX SQ Sequence 4 BP; 4 A; 0 C; 0 G; 0 T; 0 other; aaaa 4 // ID Alu2_TS repbase; DNA; PRI; 298 BP. XX AC . XX DT 09-APR-2010 (Rel. 15.04, Created) DT 09-APR-2010 (Rel. 15.04, Last updated, Version 2) XX DE Alu-like SINE element - consensus. XX KW SINE1/7SL; SINE; Non-LTR Retrotransposon; Transposable Element; KW Nonautonomous; Alu2_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-298 RA Jurka J.; RT "SINE elements from tarsier."; RL Repbase Reports 10(4), 633-633 (2010). XX DR [1] (Consensus) XX CC >85% identical to consensus. CC This sequence was derived from sequence data generated by the CC Broad Institute Mammalian Genome Project. XX SQ Sequence 298 BP; 73 A; 80 C; 100 G; 45 T; 0 other; ggccgggcgc ggtggctcac gcctgtaatc ccagcacttt gggaggccga ggcgggagga 60 ttgcttgagc ccaggagttc gagaccagcc tgggcaacat agcgagacct cgtctctaca 120 aaaaattaaa aaattagccg ggcgtggtgg cgcgcgcctg tagtcccagc tactcgggag 180 gctgaggcgg gaggatcgcc tgagcccagg aggtcgaggc tgcggtgagc cgtgatcgtg 240 ccactgcact ccagcctggg cgacagagtg agaccccgac tcaaaaaaaa aaaaaaaa 298 // ID ERV2N1-Mim_I repbase; DNA; PRI; 4944 BP. XX AC . XX DT 11-NOV-2009 (Rel. 14.11, Created) DT 11-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Internal portion of a retrovirus-like element. XX KW ERV2; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERV2N1-Mim_LTR; ERV2N1-Mim_I. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-4944 RA Jurka J. and Walichiewicz K.; RT "ERV2-like non-autonomous endogenous retrovirus from the mouse RT lemur."; RL Repbase Reports 9(11), 2836-2836 (2009). XX DR [1] (Consensus) XX CC >98% identical to consensus. 6bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 4944 BP; 1328 A; 1078 C; 1073 G; 1457 T; 8 other; ggcggtggcc cgtataggga accctctctc cgagackgcc ttcggtggta cwtggtgaca 60 mggggattcg gtgaaggccg ctgcccgtgg aaggccggct gtkacakaaa gaaaggtgwg 120 ttgggcagag agtgtaattg ataggagtaa gaaattatgg gacaaggaac tagtaggatg 180 ttgtttgtac aagtgttgaa gactatgctt cgggcccgca gtgttaaaat aggaaaaaag 240 cagttggaaa atttttcaag tttttagtag aagtatgtct tggtccctga ggaaggaacg 300 gtaaatttcg atacatggaa gaaggtagga gttaaattgc aggactatta ctcagcccac 360 ggacctgaga aggttccggt ggacgctttt tatctttgaa atcttattcg ggattgtcta 420 aacccccggc atactcgtat caggcacata acctttaagg aattagaaaa tctaaaaaat 480 atgatgcaac aacttcttat tcaaaatgct gctataggct tgttattagg tagaagcagt 540 atcactttga aaagcttact ggtgacaccg ggaattatag atcaggatta tacaagagaa 600 attaaagttt tgtactcctc tcctagctct atttctgttg tcaatcctgg ccagcgcatt 660 gctcagcttg ttttgatttc tttgatatcc cttggtaaag tacaatcgca acaagccagg 720 gtaccctctg ctttttgctt ctcagatgct tactggatgc tattactagg aataagcctg 780 aaatggtttt gtgggtgaat ggcaaggcat ttaaaggaat tgtagatact gaagctgatg 840 tttctgttat cgctgagcaa cactggccct cctgtggccc aagcaggaag ccgttggttc 900 gctccaaggt ataggctaag cccgtaatcc tgaacagtag tgaattgctt acctgtgttg 960 agtagggaca ttcggacaat tccaaccata tgttcgactt gctctacccg ttaatctttc 1020 gggaagagat tttatgcagg ctatgagagt ttatttgtat agcccgaact tgtgccaaag 1080 atttgttgcc aatgctttgc tactagaatc aaatatccta ctgtatatat catccactat 1140 atggatatgc agatgaagga ttcttactgc aagtttttga tcatgaaaca acacaattaa 1200 acagggcctt gtcattgctc aggaaaaggt gcagaggcac cctccttatt gttatttggg 1260 ataccaactg cataaatacc attttattaa ccaaggaatt caacttcgta aagatggctt 1320 aaaaactctt aatgatttcc aaagacttct gggagatatc aattggattc gaccctattt 1380 gaaaattacc actggagacc ttaagcctct ttttgacatc ttamagggcg atccaatcct 1440 aattctcctc gtcaattaac ccctgacggg aggcaagcct taaaattagt agaacaggaa 1500 ttgtcccgcc aacatgttcc ctatgtaaac tacaacttgg aatgggctgg ctatgtttta 1560 cacaactctc atactcccac agcagtcctc tatcaatggg gaccactgat gtggctacat 1620 ttgccctcct atccctctaa agttctaact ccttattatg agatggttgc taccttaatc 1680 catatgctgc gttcagaatc ctgcaaatta ctaggtaaag agccacactt ttttgtgttc 1740 ctttttctag tctgcagcag gagtggttgt tccagcatag tgactcctgg gccatagccc 1800 tggccaatta ccctggaaaa attgataacc attacacccc tgataagttg ttacattttg 1860 ctagcctaca tccgtttatt tttgttgccc aagtgtctcc tgtacccctt gacaatgctg 1920 ttctgatttt tacagatggg tcttctaatg gtatggagtc tattcagtta acaatgacat 1980 taaatcttgg cacactgggt cttcttcgtc tcaagaggtc gaattgcagg cagttttttc 2040 agctttagag gccatccctg ctacgcctgt taacctttat tcagatagtc attatgtgat 2100 tcgggcactc caagtcatag agaatgtgcc ctttattggg acttctaata gtaatgttca 2160 gaaattgttt cgtgccttgc aagccctcat tcactcacgt acaggaagat gtttctttgg 2220 gcaccttcgt gctcattccc acctgcctgg tcccctcggc caaggaaatg aaattgtaga 2280 tcttgccact agaaccaaac cccttcttat tttgttaagc gcagttagca cagcagtccc 2340 atgctttgca caaccaaaat agcagcgctc tcaagcaaca gttcaatatc wctagagaag 2400 cggctcgaaa aattgtaaaa gcttgctctt cctgttctca gttactccct gtcccccatt 2460 atggtgttaa taccagaggc cttttatcta accacttatg gcaaatggat gtgacgttta 2520 ttacttctct gggttgatta aaatatgttc atctgaccat agacacttat tctggatttt 2580 taacggccac atttcaatta ggcgaggctg gtaaacattg tgtagcccat tgcctccgat 2640 gctttgcatc aatgggccag cctaattgta taaaaacaga caatgaccct ggatacacgg 2700 gtgacaaatt tcaaacattc ttaccgaaaa tggggatcag tcataaacca gaaaattctt 2760 ataacgccca aggacaaagc attgtagaac gtgcccacta gactctcaaa aatcaacttt 2820 tgaaaatgaa aaaggtggat ctgtaccacc tcacgccccc cccccccaga attatctaaa 2880 ccatgatctc gttattttaa attttttaat tttggacaaa gaagatcgtt cagcagccca 2940 gcggttttgg tgtactaatt ccaacaaaga cacgccgttg gttgggtgga aggatccact 3000 gactagccag tggtcaggac cggatccagt tgttatttgg agtcgaggtc atgtttgtgt 3060 atttcaacaa gatgctgagg gcccgttctg gctcccggag aggctggtaa gccaggcagg 3120 atccctccat agaaagcagg atgaagacac agaagctgca ggtcgaccag caacctccag 3180 ttaggatgac aactagacaa gctctgctgc cgtcttggag tcagattgag acttactgag 3240 attgcttcca agctggttcg agcgacagga cagccattga attctttgac cttgttttta 3300 gctatggtaa ccctgctatg gacgcctatt ggacttatat atcagacccc gttttactcc 3360 acccggtggg ctggggagat cgtgttgttc ttatgtcagt gatcccaggg ctctgggagc 3420 acccgctaat gatcacatcc agcatgctaa ggtatctgct tataattata ccggactcag 3480 ccctgatgtt cctatctgtt tccaccgtga aggttatttt ccaggctgtg ttcttttacc 3540 tacttatgta tattacaata taaatggtgt taaatggacg tctatgtcaa gtttagacag 3600 gcgattttct ggccctctta tttgccaccg ccgcccggaa tacttcgctg tgaaaaggag 3660 ctggttgctt ctactctgca cgtgccgtgg agagtttccg tgaaccaact gcgactaaga 3720 ttcctattct cgggacgact cagagcatat atgattggac cactgcaata actcacaatg 3780 ggtatttggg agacaagggt tgtgttgagg ggctatggat aactccttca aaggtatacc 3840 agactagtct ctgcaaactt gttgcaggaa ctggtcaatt ctttgcaaga gtgtggtgga 3900 ctctacccca aaaatacgcc taaggactac aggaactggt caatcttctg caagagagtg 3960 cggtggactc taccccaaaa atacgcctaa cgactatttc tgcctgtgtc cctgaacttt 4020 tgtattcatg gtaggcaatg tgaatattcg tactgtcact catggttttg aaatatcatg 4080 ccttaattgt caactaacta attgtataca atcccttgcc tattctgata gaatggttgt 4140 gttataccgc cccgcctttg tggtggtccc agctaatgtc tctggtccat ggtatgatga 4200 taaaggattg caaatgtgga aggaggtgaa cgctattttg ctaaggccta aaagatttat 4260 tgggctactc ataactggca ttgtagctct tgttactgta atagcatccg ctgctgcggc 4320 tgcagtggcg ttgacacaag agatacaaac tgcccattat gtcaataatc tctctaaaaa 4380 tgtcactcag gcattgggaa cccaagagca tattgataaa aagattgaag acagactgga 4440 cgctttgtat gatgtggtcc agattctggg agaggaagta caagggcttc ggctacgcag 4500 ccagctccgc tgccatgaca attatcagtg ggtttgtgta actcccaaac cttataatga 4560 gagtaactat aattgggcta aggtccaaaa tcacctggca gggatctggc actccgcaaa 4620 tgcatcttta gatttgctcc aacttcatca agaaataaca ggcatgttgg agactcctcc 4680 ccttgacact agcattgcta cagaagcccg agattttctc aatcaactct taggacatgt 4740 tccatctttt ggaaatttta aaagcttatt ctttatgatt attggggctt ttgttcttct 4800 cttgctaatt ttgtgtgtta cgcctctgct actgcgactg attgttaaga atattttggc 4860 tgttaaagct gccatacact ccactaactt gcaaatgaaa gcttaccagc gaccctccaa 4920 ttaattataa agggaaggga gaga 4944 // ID MER101-int repbase; DNA; PRI; 6639 BP. XX AC . XX DT 05-MAR-2004 (Rel. 13.06, Created) DT 26-JUN-2008 (Rel. 13.06, Last updated, Version 1) XX DE ERV1 Endogenous Retrovirus from primates. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; KW MER101-int. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-6639 RA Smit A.F.; RT "MER101-int - ERV1 Endogenous Retrovirus from primates."; RL Direct Submission to Repbase Update (26-JUN-2008). XX DR [1] (Consensus) XX CC closest to PRIMA4-int; probably non-autonomous 16% subst; CC related seqs predate radiation. XX SQ Sequence 6639 BP; 1770 A; 1380 C; 1324 G; 2130 T; 35 other; ttcttggtgt cagaagcggg atttgaagca accccgattc tcctcccggc gccgtcctga 60 accaacgcat tggtgcctgc aagagcccct tgagctcagt tgtcttctca ctggagatgc 120 caggataggg ataggtaggg taatggcctc nggtaagtcc tctcgaattc agacctccca 180 tgccttggtt gaggtctcag agactttttc cccgtaggtc tcgtccatcg accccagagg 240 gacttcnttt agccatgggc ttgggagggg cttctcagcc agtcccccca tctggactct 300 gaaggggaca ttccctttgg ccccaggctt agnggggggc ctttccancc agttctctcc 360 atgcggatgc cggaggggct ctccctgtcg cccgaggttt agnaggggct tcccagtcaa 420 ctcggccggt ccataaggtt ttgtggggac gctcacgcta agggcactgg aagggacgcc 480 ttcgtcaggt cagtgcaatg ggtaacccat gttttaagtt ttccctgntg atcagccgcc 540 ttccggcact ccagctggct ttatgagcaa aaattatggt ccagggagtt gtaattggct 600 aggtctctgg acaaaaatta cccaggataa tcttaaactt tgttggccaa agtggggctc 660 ctttgaaatt ccaaaacttg cctatctgcg cgcacaattg gaacaaagaa aacaccgaac 720 ctcccagaga caatgggaag cctttttcag ttggtacttt gaaagttcta aatggaatca 780 agaggctacc attgcctccc ttagagaaaa taattccaaa ttgagtgagc gccttaatga 840 aatagaaaga gattagcgct tgcgagactg aaactaaagc tgtnaaaaca cctctggaag 900 acccttcttc cactagcccc ccttgcctgc cactctgcct gtcttctgca ccttttttca 960 ccacctctgc tttatccttc actccctcct tcctctttca ctctttcagc tggttatttt 1020 gaaaaggttt ctaaattttc tctaggctcg tctgtgtgtt tccttgtaaa atcctgtgat 1080 aaattcctgt gattttatgt taccttggca tccattttaa tcctcctcta acacacccag 1140 actccttgtt gagaaagctt aaattctctc tgtgcttgag atgtaaattt gctaccctgt 1200 tttctctaaa attcggtaag ggcttcagcc atgtgggaca gataaatttc agcctgttcc 1260 atttacagag acgcagtttg aatccaactg tccttttaaa ctagtgagtt ttacctgact 1320 catggctaaa gttttaaaat taaagctata agatctttat ttgtgtctgt ctgtattttt 1380 ntgtatatgt gtgtatacat gtctgttcgt atattgtcta cggtaccaaa ttggcttata 1440 aataaatgag tactcataaa ttaagcaaat aagcccaaat gcttttcaag ttcatgtgac 1500 ttagtaatct tttggcggat gggactagtc taatattgtt ggtttgatgg gaatggctgt 1560 gtcttctgag ttatcagcaa aatatgcatg tatttaactt tagggttctt gcttttatga 1620 tacttgcctg gcatgcagta atgtaaaatt ggttgataga aaatttagct tgggatgatg 1680 gctagatttg tctagtgtct catgaagttt tccaggcata attnttaaga gtgaatggat 1740 tggatggatg taaatgggat aaaagtttat aaatnaactt ttgataatgg ttatgttttg 1800 taatatgttt acttgggagg gcttctcaaa tntctttagt aactataccc ttagagtttt 1860 gctaagctaa attaaatgat ggatattcat tgaatgtcta gatcatttnc agataagata 1920 taatgctgag acattnattg ctgaatatga gtttaggctc atatactttt ggcttcttat 1980 ttcagagaaa caaaagttat ttggatctgt tagtaaaaat gtcctgttcc atattaaaaa 2040 gntgttctgt tagaaagcct atgtctctgg aaattgtaaa atgtgtattc atggattgtt 2100 ggtacatgat tggcagttaa aagttgctta cttcctaggt tttcactgaa aattagggtt 2160 actaagagtt aacattgtaa ttaatgtgtg tgattaaact actagagatg agaaagacca 2220 ttctgtatgc aagtgtatga ggagggtagg atgtattttt ggtaaggaag gttgaaaaga 2280 aaagagaata attttgtatg agaaagaatc ttgtgtggta aatttttntc ctanagtaaa 2340 atgactggtt atttaagaaa gaggaagtat aggacaaagc agaaagtcca agcatgtcat 2400 aaatggtcta agtaaatcat gataaggttt atgaaaagaa agtttataaa aggaattttc 2460 tgtgtgatca ggttggctac aattggaagg aaattgttta tgggtctttc taaggattga 2520 gctttgatgt tagaaatgca ctgatgcaga acttaaaaat ttggtcccct gtgttagaac 2580 aaggttttct taaaatgttg atttgctctt agtaaaattg caagaggttt tgatttttaa 2640 ttctgaaatc tgtttcctta acagccatcc tctaaactac aaacagtttc tatttctgcc 2700 acatttcttc ctgagatcta tctaatttcc ctagtttcag gttggaaatg cagctctcct 2760 tctttctacc cttgaaaagg tatatctttt tgcttggctg gggtgataac cctctccttc 2820 aaccttttcg tcagctcctg taactttttc tccggttcta acactgccgt tatggcctga 2880 tgctaaaatg tttatcttga aggtctagaa aggcaatgtt tccttcagta caacttgatt 2940 ctgtactttt ggcttttctt gatgtgtctg aattgttcca tgtaaccagg aaacttccta 3000 tgctgttact aaaaaccacg tattcccctg ctcaaggtac tagttttctt gtttacattc 3060 ctctataata tgggtacact cataaccctg gacacactct tcctgtgcct gattaaattc 3120 aagtaccctt ttcatcaggt ttaactttca ggttatctaa atgggctttc cgtaaggaga 3180 agcaatcacg ctgcaggagg tttttttttc tttgcctttt aggtaactgg cctaggaaac 3240 aaagattctg tgttttacca agataatttc ctgtgcttca tgttgtcttt attgggtttt 3300 tgattactta ggaaaactga gctttaaaag ggttaaggtt tttacatcca tgtaactttc 3360 tgtattgctt ttgaagtctt ttgattatca ctctggttaa atgaataact attatttagc 3420 agtgacctgt gattctgttt aatcaagtac tttgaacctt ttgacatctt tggcaggttt 3480 ccccaggatc aaaatcctaa attaagtctt tttgacctaa aattaacttt aggattttcc 3540 agttgggccc ctggagagca tcaaagaatt atctctcatc ttgtagagat attaaatgat 3600 taggcttatt tggtaaatca tatgggaagc attgtcaaat aagaaatggt gtttaacttc 3660 ctttaagtta catttgtgta aatgtgttat taaaatgtgt tccaaaattg catgagattt 3720 ctaaaattcc gatatgtcat gatatgtatt atcagtcatg attntgatta ttatgttaaa 3780 tgnttgtatg ccacaaaaat aactaaattt ccttgtcaat tgtgaactct catcagattt 3840 ttgaccatgg ctgttctggg tttttgtcat ccacagttat tgttttaaat tcttctctag 3900 aagcatttgc aatcagtata gtccaaaatt gctttaatca agcaaagcaa aattaattac 3960 atgaaattaa gtanttgata aggataactt tatgactttt atttaaaatg ttggttctnc 4020 atttaaattt tttttcagat tcaaggaant tttctttcat aagntattta tagtttgcaa 4080 taatttggta aagtatcctt tatgaacaaa agtggaagca tttgcttttt ctccctactt 4140 gattcctcca aaattcagaa actatttntg agtattctta ttttatttat ataagttcaa 4200 taaaaatctg ctctctcttt ataagcagga tacaattgga aacnttggtt atattgccaa 4260 ggttttgact gaaatgtcat atttaagaat gtgcataaaa tgcctggctt caagagttcc 4320 cagccttaca gtgagtgagt aaaaattgtc acttcctggc aggcccaaga accttaagac 4380 tgtaagtaaa atctaaagcc tgccttggtt tggcttccta gcctcaagag gttctaaaat 4440 ctgagattcc tatatgatca atgtggagag aaaaagttat gtttctaggg aaaacactaa 4500 agtacacctg ttattagatt gtagccctgt gcattgtttt caagtccttg ttatctgcct 4560 gtagactgga ctggatcctg aattctccta atttcctnca atatttggct acaactaaat 4620 cccgataaag tcccccggcc ctcttccccc aagcaagact agggatgctc cggggacatt 4680 caggggattt cccctnctta aanctaacca actaggggaa ttagatatta aaattggaga 4740 caaactagac ccataggata ctatggtccc cttgtctcaa agcagttgat gctgtctctt 4800 cctttgtaaa agccacagag aagatagtca cggggccacc tctcactgtc tscattccat 4860 actctgtcga ggctctcctc aattcacatc actggcagca tttgtaaaat ttgtccacaa 4920 tacaatactg gaaaaccatt acatgcctcc atggaccact tcccattacc gaatggtccc 4980 tttgaggtat ggcaacaaga ttttattcag ctccctgcct ctcaaggata ccagtatgtg 5040 ctagttatgg tttgcatgtt ttcacattgg gttgaagcct tcccctgtcg acaggccata 5100 gccatggcag tagctaaggc cctattggaa aaattatacc aacctgggga gtctctcaag 5160 agcttcacag tgactgagga actcatttta cagggcaaat tattnaaaat gtttgtaaaa 5220 tttggcctat ttatcaacat ctccattgtg cttaccaccc ccagtcctct ggggcggtgg 5280 aacagaccaa cggaataata aaagcccaat tggcaaagat ctgtgcggta tttagcctgc 5340 catggcccga ggccctttct ttagtcntcc ttaaccctgg catgctttca acgncccctc 5400 cccggagttc tgaattcatc cagatggtcg gggtagccaa ccagcctatg atggttccta 5460 aatctctacc tattcccttc caactaggac ccctcactgg cagtcattgc ttttcgcttg 5520 tcccatcggc ccccatacac ctcctggaaa gggacttctt agaaacctgc caggcccata 5580 tttccttctc ccaaaagggg gaaataatgc ttgagttatc ctcaccagga gattttgcca 5640 cagaaacggc ttttacccaa attcccatct attcagttag ccccaacact acccaccctg 5700 ctctccaaga gctacctgag agtctttggg cacaatccaa caccgatgat aacccctcag 5760 attatgaaga tgatagttgt ggacactctc aaggatgcga tttacctgga ggggttaacc 5820 ctagtgaagg agctagatta ggaaggtttc tggggtcctg gtttggacta ggccctgctt 5880 ggaatgaata tatggtcaga aacctttccc gcactgttaa cagaattgcc cgctccaccg 5940 cccgagccat cagggcacaa cagaggtccc tagattccct tgcttatgtg gtcctagaca 6000 accacattgc tttagactat cncctcgctg cacagggtgg tgtttgtgct gtcgctaaca 6060 cttcctgctg cacctgggta aatacttcca gtcaggttga attggaaaca tctaagatcc 6120 taaagctggc caaatctctg aaagggacac cttcagaaag cctcctggct ggacttactg 6180 ggttaaattt ccaatttcca gatattttca gctggcttcc cctggtatag gattccttct 6240 gcgttccgcc ctacaagtct taatgatcct cctcatattt gggctaagca tttggctcct 6300 ctttaaaatc gttctagcct gttttaacag atgtctgcaa gagaccccca ccaggatcgt 6360 gctgacccaa caccttgaga ctttaaactc actccagccg gaaacnggaa ccaacttaac 6420 ccaagagact ttgattcaaa tttaacaggt acctgagtgc ctctcgtaag taaatggctc 6480 tagttgctca gttggccact gccctgccac naggatccct gcgcgggact agatggaccc 6540 ggagcaggta gccaaccact ctggcaccat gatgggatgc aaccaaccta ttcgatcatc 6600 agtgctgtct gctgacaggt tttgatnaaa gggggggaa 6639 // ID LTR14C_Mim repbase; DNA; PRI; 410 BP. XX AC . XX DT 02-NOV-2009 (Rel. 14.11, Created) DT 02-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR14C_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-410 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2975-2975 (2009). XX DR [1] (Consensus) XX CC >95% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 410 BP; 112 A; 97 C; 113 G; 88 T; 0 other; tgtaagatac tgggctataa ggacaatgga caccttccag cttctgttta ctgcttgttt 60 gctaaccgca aggcattatg agtacattat gggatgcaga ggagaaagac aaagaacgcg 120 gaaaccggag tctctcagct aggacccgga acaggttgga gcctatcagg ggcaggatga 180 agtaagaatc actgtggggg cggatgcatg atcagcgtgt aaacagctta ggtataaaag 240 gctcactagc acacaaaggg gggtccctgc ccgaagaaga ggccactgcg ctggcactct 300 gggggctcgg accctagctc gagctagaca ataaaactcc ttttgataat tacagcctcg 360 gtgactctgt ctctctgtcc cgcggccctg cgaatctcga ctatataaca 410 // ID LTR1B1_Mim repbase; DNA; PRI; 624 BP. XX AC . XX DT 06-NOV-2009 (Rel. 14.11, Created) DT 06-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR1B1_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-624 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2946-2946 (2009). XX DR [1] (Consensus) XX CC >90% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 624 BP; 173 A; 164 C; 110 G; 177 T; 0 other; tgtgaactaa aataaaatct taagccccca gctgactgaa tggaccccct cttggccaag 60 gggaccccag aaatacccta aagctgagtt gctggccatg agaagggagg tcagacatgc 120 ctcatcatgc ccccctccct tcttggagat gtcctttgta actcattaac aggcctaagg 180 ctatgcaaga caaagcttaa accacacctg caggtcatca atttacttaa cagatcactt 240 gagtctgggt atatgtccgg tggcttgtct ctgattaaca gacttcctta tcttaaaaca 300 ttccaagcct ttagacaaag cttcatttct ttaaccaatt acaaatcaaa gaatctttaa 360 acccacctat aacctgtaat ccccgcttcg agatgtcctg ccttttcggg ccaaaccaat 420 gtacaccttc catgtattga tttatgactt tatgtgtaat tcctgtctcc ctgaaatgta 480 taaaaccaaa ctgtaaccca accacgcgag accacttgct caaggcttct tgggcgtggc 540 tctccgggcc atggtcacac atattcggct cagaataaac ctctttaaat tattttacag 600 agtttgggtt cttttccgtt gaca 624 // ID ORSL repbase; DNA; PRI; 630 BP. XX AC M26221; XX DT 30-APR-1998 (Rel. 6.5, Created) DT 30-APR-1998 (Rel. 6.5, Last updated, Version 1) XX DE Origin of replication-like (ORS8) region (a consensus). XX KW Origin of replication; ORSL. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 287-482 RA Rao S.B., Zannis-Hadjopoulos M., Price B.G., Reitman M. RA and Martin G.R.; RT "ORSL."; RL Unpublished (1989). XX RN [2] RP 1-630 RA Rao S.B., Zannis-Hadjopoulos M., Price B.G., Reitman M. RA and Martin G.R.; RT "Sequence similarities among monkey DNA-replication ori-enriched RT (ors) fragments."; RL Gene 87, 233-242 (1990). XX RN [3] RP 1-630 RA Jurka J.; RT "ORSL."; RL Direct Submission to Repbase Update (13-APR-1998). XX DR [3] (Consensus) XX CC This sequence is moderately repeated in human DNA. CC It shares ~200bp stretch of similarity with African CC Green Monkey origin of replication region (Acc. No. M26221). XX SQ Sequence 630 BP; 183 A; 118 C; 135 G; 175 T; 19 other; ycccatacyg tttccrtcac trgtytgwgc accctctgca rggmagacag catgaytttt 60 yatctttgaa tycyyaaaas ttagctcact gtgtgctcaa acgtgtattg aatgacagtt 120 gctatatttg aggacbacat agattttggg gaagacggac aggcacacta gcagaaccat 180 acgaaggcca ggatcagtca tgaccagggc tgcattatga cttgtgggcc ctgagcactt 240 ttgcttttat gggccccttc ctccataaaa atattaaaaa ttatatttta tgactgcatt 300 ggtataaaga tgagaatata atccaggctg aattaaaaca ttttcttaga ctctaaaatt 360 tcattttttt ctgattttaa aagaaattaa aacattttya tggggcccta aagtattgtg 420 ggccctaggc actgtgccta ctgtgcctaa tggataagtc nagcctgacc agggcactct 480 ggtgtgaggt tgaagaaagg aaatttggaa caaagaagcc aagtgctctg gagaagcagg 540 tgaaacttcc actgccgaac aaaatcagaa tgggagcagc catggttaat aaggttgtgg 600 aagtttagas cttccagttc acttcmcctt 630 // ID L1PB_orf2 repbase; DNA; PRI; 3294 BP. XX AC . XX DT 29-AUG-2008 (Rel. 13.08, Created) DT 29-AUG-2008 (Rel. 13.08, Last updated, Version -1) XX DE L1 Non-LTR Retrotransposon from primates. XX KW L1; Non-LTR Retrotransposon; Transposable Element; LINE; KW L1PB_orf2. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-3294 RA Smit A.F.; RT "L1PB_orf2 - L1 Non-LTR Retrotransposon from primates."; RL Direct Submission to Repbase Update (05-AUG-2008). XX DR [1] (Consensus) XX CC L1PB1 consensus. XX SQ Sequence 3294 BP; 1441 A; 683 C; 521 G; 626 T; 23 other; atgaatggaa tagtacctca catctcaata ctaacattga atgtaaatgg cctaaatgct 60 ccacttaaaa gatacagaat ngcagaatgg ataagaattc accaaccaac tatctgctgc 120 cttcaggaga ctcacctaac acataaggac tcacataaac ttaaggtaaa ggggtggaaa 180 aagacattcc atgcaaatgg acaccaaaag cgagcaggag tagctattct tatatcagac 240 aaaacaaact ttaaagcaac agcagttaaa aaagacaaag agggacatta tataatgata 300 aaaggccttg tccaacagga aaatatcaca atcctaaata tatatgcacc taacactgga 360 gctcccaaat ttataaaaca attactanta gacctaagaa atgagataga cagcaacaca 420 ataatagtgg gggacttcaa tactccactg acagcactag acaggtcatc aagacagaaa 480 gtcaacaaag aaacaatgga tttaaactat accctggaac aaatggactt aacagatatn 540 tacagaacat tccacccaac aactgcagaa tacacattct attcancagc acatggaacn 600 ttctccaaga tagaccatat gataggccac aaaacgagcc tcaataaatt taagaaaatt 660 gaaattatat caagcactct ctcagaccac agtggaataa aactggaaat caactccaaa 720 aggaaccttc aaaaccatgc aaatacatgg aaattaaata acctgctcct gaatgatcat 780 tgggtcaana atgaaatcaa gatggaaatt aaaaaattct tcgaactgaa cgacaatagt 840 gacacaacct atcaaaacct ctgggataca gcaaaggcgg tgctaagagg aaagttcata 900 gccctaaatg cctacatcaa aaagtctgaa agagcacaaa cagacaatct aaggtcacac 960 ctcaaggaac tagagaaaca agaacaaacc aaacccaaac ccagcagaag aaaggaaata 1020 accaagatca gagcagaact aaatgaaatt gaaacaaaca aaaaaataca aaagataaat 1080 gaaacaaaaa gctggttctt tgaaaagata aataaaattg atagaccatt agcaagatta 1140 accaagaaaa gaagagagaa aatccaaata agctcaatta gaaacgaaac gggagatatt 1200 acaactgaca ccacagaaat acaaaagatc attcaaggct actatgaaca cctttacgcg 1260 cataaactag aaaacctaga ggagatggat aaattcctgg aaanatacaa ccctcctagc 1320 ttaaatcagg aagaantaga naccctgaac agaccaataa caagcagcga gattgaaatg 1380 gtaattnaaa aattaccaac aaaaaaaagt ccaggaccag acggattcac agcngaattc 1440 taccagacat tcaaagaaga attggtacca atcctattga cactattcca caagatagag 1500 aaagagggaa ccctccctaa atcattctat gaagccagta tcaccctaat accaaaacca 1560 ggaaaggaca taacnaaaaa agaaaactac agaccaatat ccctgatgaa catagatgcn 1620 aaaatcctta acaaaatact agctaaccga atccaacagc atatcaaaaa gataatccac 1680 catgatcaag tgggtttcat accagggatg cagggatggt ttaacatacg caagtcaata 1740 aatgtgatac accacataaa cagaattaaa aacaaaaatc acatgatcat ctcaatagat 1800 gcagaaaaag catttgacaa aatccagcat ccctttatga ttaaaaccct cagcaaaatc 1860 ggcatanaag ggacatacct taatgtaata aaagccatct atgacaaacc cacagccaac 1920 atnatactga atggggaaaa gttgaaagca ttccctctga gaactggaac aagacaagga 1980 tgcccactct caccactcct nttcaacata gtactggaag tcctagccag agcaatcaga 2040 caagagaaag aaataaaggg catccaaatc ggtaaagagg aagtcaaact gtcgctgttt 2100 gctgatgata tgatcgtnta cctagaaaac cctaaagact cctccagaaa gctcctagaa 2160 ctgataaang aattcagcaa agtttcngga tacaaaatta atgtacacaa atcagtagct 2220 ctnctataca ccaacagcga ccaagctgag aatcaaatca agaactcaac cccttttaca 2280 atagctgcaa aaaaaataaa atacttagga atatacctaa ccaaggaggt gaaagacctc 2340 tacaaggaaa actacaaaac actgctgaaa gaaatcatag atgacacaaa caaatggaaa 2400 cacatcccat gctcatggat gggtagaatc aatattgtga aaatgaccat actgccaaaa 2460 gcaatctaca aattcaatgc aattcccatc aaaataccac catcattctt cacagaacta 2520 gaaaaaacaa tcctaaaatt catatggaac caaaaaagag cccgcatagc caaagcaaga 2580 ctaagcaaaa agaacaaatc tggaggcatc acattacctg atttcaaact atactataag 2640 gccatagtca ccaaaacagc atggtactgg tataaaaata ggcacataga ccaatggaac 2700 agaatagaga acccagaaat aaacccaaat acttacagcc aactgatctt cgacaaagca 2760 aacaaaaaca taaagtgggg aaaggacacc ctattcaaca aatggtgctg ggataattgg 2820 cnagccacat gtaggagaat gaaactggat cctcatctct caccttatac aaaaatcaac 2880 tcaagatgga tcaaggactt aaatctaaga cctgaaacta taaaaattct agaagataac 2940 attggaaaaa cccttctaga cattggctta ggcaaggatt tcatgaccaa gaacccaaaa 3000 gcaaatgcaa caaaaacaaa gataaatagn tgggacttaa ttaaactaaa gagcttctgc 3060 acggcaaaag gaacagtcag cagagtaaac agacaaccca cagagtggga gaaaatcttc 3120 acaatctata catctgacaa aggactaata tccagaatct acaangaact caaacaaatc 3180 agcaagaaaa aaacaaacaa tcccatcaaa aagtgggcta aggacatgaa tagacaattc 3240 tcaaaagaag atatacaaat ggccaacaaa catatgaaaa aatgctcaac atca 3294 // ID PTERV1b repbase; DNA; PRI; 7064 BP. XX AC . XX DT 14-MAR-2006 (Rel. 13.02, Created) DT 27-FEB-2008 (Rel. 13.02, Last updated, Version 1) XX DE ERV1 Endogenous Retrovirus from Pan troglodytes. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; PTERV1b. XX OS Pan troglodytes OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pan. XX RN [1] RP 1-7064 RA Smit A.F.; RT "PTERV1b - ERV1 Endogenous Retrovirus from Pan troglodytes."; RL Direct Submission to Repbase Update (21-FEB-2008). XX DR [1] (Consensus) XX CC <2% div. ORF1 1188-2816, ORF2 2807-5353, ORF3 5332-7017 CC lib20040702. XX FH Key Location/Qualifiers FT CDS 5332..7014 FT /product="PTERV1a_3p" FT /translation="LTANMQLGSLTLTLVALVAAGENIKPAPNPFVWRFWL FT YENQTHPGQPHKPGKLVASADCPSSGCNSPILLNFTDFPVAKPVAPIICFE FT YDQTEYNCKHYWWHQSAGCPYNYCNIHKYQWWGGEEQIDPRWPFHRRRDRD FT LSYTWIVRDPWNSRWTTPQHGAVYYSSASTWPSSHLYLWRGLVQVRPLVHG FT NIQRQENRLTQDLRPFSWLKLLQEGLELANLTGLHSLSGCFLCATLGRPPL FT TAVPLPWGSSTSAQANNHQNLSYAPIPNVPLYLNPSQEKFPYCFSGTNSSL FT CNITATPPNITLRAPSGIFFWCNGTLSKNLSSPSVTNLLCLPVTLVPRLTL FT LTAGEFLGYTGNWTSAVIHPDPRPRPARAIFLPLIAGISLTASFMAAGLAG FT GALGHTLIESNKLYQQFAVAMEESAESLASLQRQLTSLAQVTLQNRRALDL FT LTAEKGGTCMFLKEDCCFYINESGLVEDRVQQLRKLSTEVRTRQFASAADQ FT WWNSSMFSLLAPFLGPLLSLLFLLTVGPCVVNRILRFVKERFNTVQLMVLR FT AQYQPVNAETESDL" FT CDS 1188..2813 FT /product="PTERV1b_1p" FT /translation="SELPRFGGQQPYLSCPVGRAAAVSDLGCPGATAIQAA FT DSDPGFRCGLRSELPRFGGQQLPLTRAAKVRLRLDIINKSDQISFTGIQTH FT IPLQEETTNYYRMGNTQSTPLSLLTSNFKEVRARGHDLGIEIRKGKLITLC FT RSEWPAFDVGWPPEGTFRLAVITRVKSKIFLPGRAGHLDXIPYILIWQDLV FT ENPPPWLSPFQLASEPCKALVARPLKSKQPTAPPHPVLPDSGDPLFTEPPP FT YPSGPQAPAPLAELREGAGGREAAGTHGPAERESNFEGPAGRTRGRTSRTS FT PPQPPDSTVALPLREIGPPDDTGIPRLQYWPFSTSDLYNWKTQSARFSDNP FT KDLLALLDSVMFTHQPTWDDCQQLLRILFTTEERERIQIEARKLVPGDDGQ FT PTANPDLINATFPLTRPAWDYNTAEGRGRLHLYRQTLMAGLRAAARKPTNL FT AKVYSILQGKTESPATYLERLMEAFRQYTPIDPEAPGSQAAVVMSFVNQAA FT PDIKRKLQKLEDLEGKRIQDLLQIAQRVYNNRDTPEEKQFKATKKK" FT CDS 2807..5350 FT /product="PTERV1b_2p" FT /translation="KKMTKVLAAVVQKEHLQPEVTYLGYLLKGGQRWLTDA FT RKQTVLQIPRPQSTRQVREFLGSAGFCRLWIPGFAELAKPLYQATRGQQPF FT NWTDEAELAFQQIKTALLSAPALGLPDVTKPFHLYVDGNKGVAKAVITQNL FT GPWRRPVAYLSKKLDPVAAGWPPCLRMIAATALMVQDAHKLVMGQELRVVT FT PHAIEGVLKQPPNRWMSNARLTHYKGLLLNPLRIIFLPPTTLNPASLLPNP FT DLDAPLHDCTKILAQVHGVREDLQDRPLPDADLVWFTDGSSFMHQGQRYAG FT AAVTSETEVIWAEPLPPGTSAQKAELIALTQALTLGAGKKLTVYTDSRYAF FT ATAHIHGAIYRERGLLTAEGKEIKNKQEILALLTALWRPEKLAIVHCPGHQ FT KLTTPTAQGNFLADQTARNVAKAPSQLLALQLPDPGPRDLPYFPEYSEQDL FT QWIDKLPLKQIQNGWWTDTNDQTILPEKLGQQVLEHIHRTTHLGARRMIDL FT IRRSKLKIRHIAETASSIVTSCKVCQLNNAYPQSQAATGTRLRGTRPGIYW FT EVDFTEIKPGKYGYRYLLVFVDTFSGWTEAFPTKRETAQVVAKKILEDILP FT RYGFPIQIGSDNGPAFVAKVSQDLASILGANWKLHCAYRPQSSGQVERMNR FT TLKETLTKLTIETGANWVVLLPYALFRARNTPYKLGLTPYEIMYGRPPPLV FT PSLKDDLLKSETENVSEFLFSLQALQKIHQEIWPKLRELYETSPPPTPHPY FT QPGDWVLVKRHRQETLEPRWKGPLQVLLTTPTALKVEGIASWIHYTHVKPV FT DPTSDLLGPITAAAAEAPDTWTVDRAKNNPLKLTLRRQHSSLQTCS" XX SQ Sequence 7064 BP; 1714 A; 2020 C; 1808 G; 1519 T; 3 other; tctgggggcc cgtccgggat tccccaagcc caccagaccc ctggtcaacg gatctgctag 60 gatcgatcta ctgataggtg agctggctcg tctccgtttg tctgtctgtg tctgttctga 120 atccgaatct gtgactcgcg aggtctgaaa ctggagctgg cacagtcctg gcggacgcgc 180 tataggacgg ccagcggaga ccggtgggag acgtcccctg gctctcatct gatctatatt 240 gcgatctgag ctgccccggt ttggcgggca gcagccgtat ctgagctgcc cggttggtcg 300 ggcagcatct gtctctgatc tgagctgccc cggtttggcg ggcagcagcc gtatctgagc 360 tgcccggttg ggcgggcagc agctgtctct gatctgagct gccccggttt ggcgggcagc 420 agctgtctct gatctgagct gccccggttt ggcgggcagc agccgttctg atctgagctg 480 cccggttggg cgggcagcag ctgtctctga tctgagctgc cccggtttgg cgggcagcng 540 ccttgatctc tgagctgccc ggttgggcgg gcagcagctg tctctgatct gagctgcccc 600 ggtttggcgg gcagcagccg tatctgagct gcccggttgg gcgggcagca gctgtctctg 660 atctgagctg ccccggtttg gtgggcagca gctgtctctg atcgtgagct gcccggttgg 720 cgggcagcat ctgtctctga tctgagctgc cccggtttgg cgggcagcag ccgtatctga 780 gctgcccggt tgggcgggca gcagctgtct ctgatctgag ctgccccggt ttggcgggca 840 gcagccgtct ctgatctgag ctgccccggt ttggcgggca gcagccgtat ctgagctgcc 900 cggttgggcg ggcagcagct gtctctgatc tgagctgccc cggtttggcg ggcagcatct 960 gtctctgatc tgagctgccc cggtttggcg ggcagcagcc gtatctgagc tgcccggttg 1020 ggcgggcagc agctgtctct gatctgggct gccccggcgc gaccgcgatc caggcagctg 1080 actctgaccc gggcttccgg tgcgcgcttg cgatctgaac tgccccggtt tggcgggcag 1140 cagccgtatc tgagctgccc ggttggncgg gcagcagctg tctctgatct gagctgcccc 1200 ggtttggcgg gcagcagccg tatctgagct gcccggttgg gcgggcagca gctgtctctg 1260 atctgggctg ccccggcgcg accgcgatcc aggcagctga ctctgacccg ggcttccggt 1320 gcggcttgcg atctgaactg ccccggtttg gcgggcagca gctgcctctg accagggctg 1380 ccaaagtgcg cctgcgatta gatatcatta ataagtcaga tcagatttcc tttacaggga 1440 ttcaaaccca cattccttta caggaagaga ctacaaatta ttacaggatg ggtaataccc 1500 agagcactcc tctatctctc cttacgagta atttcaaaga agttagagca aggggccatg 1560 atcttggtat agaaatcagg aaaggaaagc taattactct gtgtcgctcc gaatggcctg 1620 cctttgatgt ggggtggccg cccgaaggga ccttccgact tgctgtcatc actagggtaa 1680 agtccaagat tttcctacct gggcgtgcgg gccacttaga tcanatccca tatatcctca 1740 tatggcagga ccttgttgag aacccgcctc cttggctgtc ccctttccaa ttggcctctg 1800 aaccctgtaa ggcactggtt gctcgaccac taaaatccaa gcaaccaact gccccccccc 1860 atcctgttct acctgacagc ggggacccac tgttcacaga accccctccg tacccctccg 1920 ggccccaggc cccagccccc ctggctgagc tgcgggaggg agcaggcgga cgggaggcgg 1980 ccggcacaca cgggcccgct gaaagggaaa gtaactttga agggccggcg gggaggacgc 2040 gagggcgcac ttcgcggact agcccccctc agccgcctga ctccacggtg gctttacccc 2100 ttcgggaaat aggaccccca gatgacacag gaatccccag gctccagtac tggccattct 2160 ccaccagtga tctgtataac tggaaaactc agagtgctcg gttttcagac aaccccaaag 2220 atttactggc tttactagat agtgtcatgt tcacccacca gcccacttgg gatgattgtc 2280 agcagctcct ccgaattttg ttcaccacgg aagagcgaga gagaatacag atagaagcta 2340 gaaagctggt cccgggggac gacggtcaac cgactgccaa ccccgacctc ataaacgcaa 2400 cctttcctct gaccaggccg gcgtgggact acaacacggc agaaggtagg ggacggctac 2460 acctttatcg ccagactcta atggcaggtc tccgggcagc tgctcgcaag cccactaatt 2520 tggctaaagt atattctatt ctgcagggaa agacagagag cccagctacc tacttagaaa 2580 gattaatgga agcttttaga cagtacaccc ccatagatcc agaggctcca ggaagtcagg 2640 cagctgttgt aatgtctttc gtaaatcagg cagccccaga tattaagaga aaactccaga 2700 aattagaaga cttggaggga aagcggattc aggacctcct tcagatagcc cagcgggttt 2760 acaataacag agatactcca gaggaaaagc aatttaaggc cactaaaaaa aaatgaccaa 2820 ggtcctggca gcagtggtac agaaagagca tctacagcca gaggtaacat atctagggta 2880 cctgcttaag ggagggcagc gctggctaac agacgcccgg aaacaaactg ttctgcagat 2940 ccccaggcca caatccaccc gacaagtgag agaattcctg gggtcggcgg gattttgcag 3000 actatggata cctgggttcg cagaactggc taaacccttg tatcaggcaa cacgggggca 3060 acagccattt aattggacag acgaagccga gttggccttc caacagatta aaaccgccct 3120 actctccgcg cctgcactag gactacctga tgttaccaag cccttccact tatacgtgga 3180 tgggaataag ggtgtcgcca aggcggtaat aactcagaac ttaggcccct ggcggaggcc 3240 agttgcctac ctgtcaaaga agttagaccc agtagctgcc gggtggcccc cttgtctccg 3300 aatgattgcg gccacggctc tgatggtgca agatgctcat aaacttgtca tggggcaaga 3360 attgcgggtc gttactccac atgccatcga aggtgtactc aaacagccac ctaatcgatg 3420 gatgagtaac gcccggctca cccactacaa aggactacta ctaaatcctc tcaggataat 3480 tttcctgccc ccaacgacct taaaccctgc ctcgctgctg cccaacccgg acctggacgc 3540 cccactccat gactgcacca agatactagc tcaggtgcac ggagttcgag aagacctgca 3600 ggaccgccca cttcctgacg ccgacctcgt ctggttcact gatgggagca gcttcatgca 3660 tcaaggccag aggtacgctg gggcggcagt aacttcagag actgaggtaa tctgggcgga 3720 acccctgccc ccggggacat cggcccagaa ggccgaactg atagcgctca cccaagctct 3780 taccttaggg gcggggaaaa agctgacagt atatacagac agccgatatg cttttgcaac 3840 ggcgcatata catggggcca tttacaggga gcgagggtta ctgacggctg aaggaaaaga 3900 gataaaaaac aagcaagaga tcctagccct gctaacagcc ctatggaggc cagaaaaatt 3960 agccattgta cattgcccag ggcatcagaa actaactact ccaactgctc aaggcaactt 4020 tctggcagac caaactgcaa gaaatgtggc gaaggctccc agccaactcc ttgcactcca 4080 gctccctgac ccgggccccc gggacttgcc atatttccct gaatattcag aacaagatct 4140 ccagtggatt gacaaacttc ccctgaaaca aatccagaat gggtggtgga ctgatactaa 4200 tgaccaaacc atcctaccag aaaaattagg acaacaggtg ttagaacaca tccaccgaac 4260 cacccacctg ggggcccggc ggatgataga cctgatcaga cgctccaagc tcaaaatcag 4320 acatatagct gagacggcca gcagtatcgt gacaagttgc aaagtctgcc agcttaacaa 4380 cgcatacccc caatctcaag ctgcaacagg aacaaggctc aggggaacca ggcccggtat 4440 ctactgggaa gtagatttta ctgaaataaa gccaggaaag tacgggtacc ggtacttact 4500 tgtctttgta gatacttttt cagggtggac tgaagcattc ccaaccaaaa gagaaactgc 4560 tcaggtcgta gcaaagaaaa ttctggaaga tatccttccc aggtatggct tccccatcca 4620 gatagggtca gataatgggc ccgctttcgt cgctaaggta agtcaggact tggcttccat 4680 ccttggggca aattggaaac tacattgcgc ttacaggccc cagagttcag gacaggtaga 4740 aaggatgaat cggaccttaa aagagacctt aactaaattg actatagaga ctggcgctaa 4800 ttgggtagtc cttctcccct atgctctgtt ccgggcccgt aatacccctt acaaactggg 4860 ccttacccct tacgaaatca tgtatggcag acctccaccc ctggttccta gcttaaaaga 4920 tgacctgctt aagtctgaaa cagaaaatgt ctctgaattc ttattttcct tacaagcctt 4980 acagaaaatt caccaagaaa tctggcccaa gctgagagag ctatatgaga ccagtccccc 5040 accgacaccc catccgtacc agccgggaga ctgggtcctg gttaagcgac accgacaaga 5100 gaccctagag cccaggtgga aaggaccact ccaagtactc ctgaccacac ccaccgccct 5160 gaaggtagaa ggcattgcgt cgtggatcca ctacacccac gtcaagccag tggacccaac 5220 ctccgacctt ctggggccaa tcacggcggc ggcggctgaa gcaccggaca cgtggactgt 5280 ggacagagct aagaacaacc ccttaaaact caccctgcgc cggcagcata gctcactgca 5340 aacatgcagt taggtagtct aactctaaca ttggtcgccc tagtggccgc tggggaaaac 5400 ataaagccag ctcctaatcc ctttgtctgg agattctggc tttatgaaaa ccaaacccac 5460 cctgggcaac ctcataagcc cgggaaacta gtggccagtg cagattgccc ctcctcaggg 5520 tgcaatagcc caattttact aaattttacc gatttcccag tagccaaacc agtggcacca 5580 ataatatgct tcgagtatga tcagactgaa tacaattgta agcactattg gtggcaccaa 5640 agtgccggct gcccttataa ctattgtaac atccataaat accaatggtg gggtggagaa 5700 gaacagatag atcccagatg gcccttccat cgcagacgag atagagacct ttcatataca 5760 tggatagtta gagacccctg gaactcccgc tggaccacgc ctcaacacgg ggctgtatac 5820 tactcctccg cctccacatg gcctagcagt cacctctatc tgtggcgggg tctagtgcag 5880 gtacggcccc tggtccatgg aaatatccag cgacaagaaa accgcctgac acaagattta 5940 cgtccttttt cctggttaaa attattgcaa gaaggattag aacttgccaa ccttacagga 6000 cttcacagcc tgtctggctg ctttctatgt gccactctag ggcgtccacc gctaaccgct 6060 gtccccctgc catggggatc atccacctct gcccaagcta acaaccacca aaacctctca 6120 tatgccccta tccctaacgt gccactatac ctaaacccca gtcaagagaa gtttccctac 6180 tgtttctcag gaactaattc cagcctctgc aacatcactg caacgccccc taacatcacc 6240 ttaagggctc cgtcaggcat attcttctgg tgtaatggaa cattatctaa aaacctatca 6300 agcccctctg ttaccaacct actgtgtctt cctgtcacat tagttccccg gttaactcta 6360 cttactgccg gcgagttcct agggtatacc ggtaactgga ctagtgctgt tattcaccca 6420 gaccctagac cgagacctgc acgagccata tttctccccc tcattgcagg aatctccctc 6480 accgcatcct tcatggcggc cggactggct gggggagccc taggtcacac ccttatagaa 6540 agtaacaagc tgtaccaaca atttgccgtt gctatggagg agtcagctga gtcccttgcc 6600 tccctccagc ggcagctcac gtccctagca caggtaacct tgcagaaccg gagggcctta 6660 gacctactca ctgctgaaaa agggggaacg tgtatgtttc taaaggaaga ctgttgtttc 6720 tacataaatg aatcaggact cgtggaagac cgagtccaac agttacgcaa gttaagcaca 6780 gaagtaagaa cacggcagtt tgcttcagct gcagaccaat ggtggaactc atctatgttt 6840 tctctgttag cccccttcct tggacccctg ctgagtctac tatttctgct taccgtagga 6900 ccttgtgttg ttaacagaat tttgcggttc gttaaagaaa ggtttaacac tgtacaactc 6960 atggtcctca gagcccaata ccaacctgta aacgctgaaa cagaatcaga cttataagac 7020 ccaagattgg ctctaaaaaa atacctgaaa agaaaggggg ggaa 7064 // ID hAT-2N2_TS repbase; DNA; PRI; 506 BP. XX AC . XX DT 21-APR-2009 (Rel. 14.1, Created) DT 14-DEC-2009 (Rel. 14.1, Last updated, Version 2) XX DE hAT-2N2_TS is a family of non-autonomous DNA elements found in DE Tarsius syrichta mobilized by hAT-2_TS. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-2_TS; hAT-2N2_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-506 RA Novick P.A., Smith J., Ray D. and Boissinot S.; RT "Independent and parallel lateral transfer of DNA transposons in RT tetrapod genomes."; RL Gene 449(1-2), 85-94 (2010). XX DR [1] (Consensus) XX SQ Sequence 506 BP; 79 A; 135 C; 130 G; 158 T; 4 other; caggggtcct caaactacgg cccgcgggcc acatgcggcc gccgaggaca tttatccggc 60 ccaccgggtg tttttgccgc cgctgcctgt cctgcctagc agccgactcg tccaggcccg 120 cagtgcgcat gtgtggaatg tgcgtcgcac tctccgactc ccctccttct ctctgtctct 180 cgactcctcc tctccgtctc gggtgtgatc ggacgagtca cgagcttgcc tgtgcagagc 240 ctgctgctgc ctgaggaccg aggtaagaac aagttaggat ttdttttttt ttgaagttag 300 gaggtctatt tttttttttt taaattttgc arttagtagg gccttttttt ttcggttaag 360 gggggccttt ttttccctga agttaggagg tctwtttttt wttttttgca gatagggggc 420 gccttttttt taaactatag tccgcccctc caacggtctg agggacagtg aactggcccc 480 ctgttttaaa agtttgagga cccctc 506 // ID L1-1_TS repbase; DNA; PRI; 5277 BP. XX AC . XX DT 02-JAN-2010 (Rel. 15.03, Created) DT 02-JAN-2010 (Rel. 15.03, Last updated, Version 2) XX DE L1-type non-LTR retrotransposon - consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-1_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-5277 RA Jurka J.; RT "Non-LTR retrotransposons from tarsier."; RL Repbase Reports 10(3), 440-440 (2010). XX DR [1] (Consensus) XX CC >95% identical to consensus. CC This sequence was derived from sequence data generated by the CC Broad Institute Mammalian Genome Project. XX FH Key Location/Qualifiers FT CDS 186..1145 FT /product="L1-1_TS_1p" FT /translation="MKKNQKKSMGLSQTPGRADTEKTDFGTQTMKSPQNEG FT SQTANVDIKTIMERLKRIEETQEETRKELISEITVIKNTVNEINNKLISME FT SRITQAEERISELEDQNIEITQTLKNTENKLKKTEQNLQEMSDYLKRPNLR FT IIGLPEAERETETTLEQTFHEIIQENFPYLISDAKIQTQEIQRTPARQQMR FT RPTPRHIIIRLNKVGTKEKILKAAREKGQITYHGRPIRIAADLSAETLQAR FT RAWSPIFKVLKDKQFQPRITYPAKLSFISEGELKSFPDIQSLRTYAATKPS FT LHETLKKVLNTEEKEKRTTFFTRVQGKE" FT CDS 1239..5051 FT /product="L1-1_TS_2p" FT /translation="MIGTNSHISIISLNVNGLNAPLKRHRMTKWIKYHQAT FT IYCLQETHLTRKDIHRLKVRGWETNFQANGTQKKGGVAILISDKIPFKLSK FT IKKDTEGHYIMIKGSLHQQEISILNIYAPNIGAPTFIKQLLGKLKKDIDSN FT TIITGDFNTPLTTLDRSSGQKISKEIRNLNETLDQMDLIDTYRTLHPKTTE FT YTFYSSPHGTYSKIDHIIGHKSSISKFKRTEILPCTFSDHSGIKINIDTNK FT VPPKPTKTWTLNSMMLNNSWVNDDIKTEIKRYLETNENEETSYQNLWDALK FT AVIRGEFISLQTHMRKMEGTEIDNLTSHLKKLEKQDHKNPNFSRRIQITKI FT KAQIQDIEDKKIIQKINETKSWFFERVNKIDGPLARLTKKKREKNQISTIR FT NTKDEVTSDPEEIQKIIRDYYVHLYGNKLENQKEMEDFLTSHNLPRLEQEE FT IETLNRPITIKEIDHVIRKLPTKKSPGPDGFPAEFYKTFKEELIPILLKVF FT QAIEKDGTLPKSFYEANITLIPKPGKDPTKKENYRPISLMNIDAKILNKIL FT ANRIQQYISKIIHHDQVGFIPGMQGWFNIRKTINVIKYINRCQNKNHMIIS FT LDAEKAFDKIQHPFLIKTLEHLGIRGTYLKIVKAIYEKPTASILLNGQKLE FT PIPLKTGTRQGCPLSPLLFNIVLEVLARAIREEEAIRGIQIGKEEVKLSLY FT ADDMXVYLENPRESVKGLLTLIKAFGKVSGYKINVQKTIAFLYTNNKQTET FT QIKNTVPFTIATKKMKYLGIFLTRDVKDLYNENYKTLLKEIKADTNKWKNI FT PCSWIGRINIVKMSILPKAIYKFNAIPIKLPTTFFSDLEKTTQEFIWKHKR FT PRIARTILSKKNKAGGITIPDFKLYYKATIIKTAWYWYRNRHIDQWNRIEI FT PEAKPQFLNQLIFDKAPTTYHWGEENLFSKWCWENWLTTCRRLKQDPYLSP FT CTKVNSKWIRDLNVKPQTIRTLEKEGNTLMEIGTGIQFLYKTRNPQDLREK FT IDKWDLIKLTSFCKAKETIKRAGRQPTDWEKVFANSRSDKGLTSWIYKELK FT RAEKKKTNNPIIKWAKDMNRHFTKEDIRAANKHMKKCSTSLIIREMQIKTT FT LRYHLTPVRMAIINNSKNNSCWRGCGEKGTLLHCWWECKLVQPLWKAVWRF FT LKALKIDLPYDPAIPLLGIYPEEHKSLYKKDTCTRMFIAALFTIARTWKQP FT CCPSKEDWIKKMWYIYTMEYYAAIKKNKIMNFAATWMELESIILSDLSQKQ FT RSEYHMFSLI" XX SQ Sequence 5277 BP; 2236 A; 1146 C; 925 G; 969 T; 1 other; ggaccaagac cgcaaactgc tgaatagaca gactgtaaag cagagggaaa agtgaacaaa 60 gccagaagac cctcataagg aagaacagga cattgcagaa gagaaataaa agcaacccca 120 ccttccctca aaaacagaat tgaagagggg gagggaaagg gggagaggga gaaaaatcta 180 cagaaatgaa gaaaaaccaa aagaagagta tgggtctctc ccagacgcct gggagagcag 240 acactgagaa aactgacttc ggaacgcaaa caatgaaaag tccccagaat gaagggtctc 300 aaactgcaaa tgtagatatc aagacaataa tggagagatt aaaaagaatt gaggagacac 360 aagaagaaac taggaaggag ctgatatctg agataacagt aataaagaat actgtgaatg 420 aaataaataa caaactgata agcatggaaa gcagaattac ccaagcagaa gaaagaatct 480 cagagcttga ggaccaaaat atagaaataa cccaaactct taaaaacaca gaaaataagc 540 tcaaaaagac agaacaaaac cttcaagaga tgagtgacta cctcaagagg cctaacctaa 600 gaataatcgg actccctgag gcagaaagag aaacagagac cacattggaa caaactttcc 660 atgagatcat tcaagaaaac ttcccttatc taatcagtga tgcaaaaatt caaacacaag 720 agattcagag aacccccgca agacaacaaa tgagaagacc aactcctaga cacataataa 780 ttcgcctaaa taaagtaggc acaaaagaaa aaatcctaaa ggcagcaaga gaaaaaggcc 840 agatcaccta ccatggaaga ccaatcagaa tagcagcaga tttatctgca gaaaccctgc 900 aggctaggag agcttggagc cctatcttca aagtcctaaa agataaacaa tttcaaccaa 960 gaataaccta cccggccaag ctaagcttca tcagtgaggg agaattaaaa tctttcccag 1020 atatccaatc cctaagaact tatgcagcca caaaaccatc tctacatgaa acacttaaga 1080 aagtactaaa cacagaagaa aaggaaaaaa gaacaacgtt cttcacaaga gtacagggaa 1140 aagaataaaa tatacacgaa ccaaccccaa aaccaaaaga aagacaaaaa aaaaaaagaa 1200 aaaaccaagt ggaagaacaa taactcaata agaactccat gatagggacg aactctcaca 1260 tttcaataat tagtctgaat gtgaatggac taaacgcacc actgaaaaga catagaatga 1320 caaaatggat aaaatatcac caggcaacaa tatactgcct tcaagagacc catctcacta 1380 gaaaggacat acacagactc aaagtaagag gatgggaaac aaattttcag gcgaatggaa 1440 cacaaaagaa aggaggagtc gcgatcctaa tttcagacaa aataccattt aagctatcaa 1500 aaattaaaaa agatacagag ggccactaca taatgataaa aggttcactc catcaacaag 1560 aaatatctat cctaaacata tatgcaccca acataggtgc gccaacattc ataaagcaac 1620 ttctaggaaa actaaagaaa gacattgact ctaacaccat aataactggg gactttaata 1680 caccactcac aaccctagac agatcatcag gacaaaaaat cagcaaggag atccggaacc 1740 tcaatgagac tctggaccaa atggacttaa ttgataccta cagaacactc catccaaaga 1800 ccacagaata cacattctac tcatcaccac atggaacata ttctaagatc gaccacataa 1860 ttggacacaa atcaagtata agcaaattta aaaggaccga aattctacca tgcaccttct 1920 cggaccacag tggaataaaa ataaacattg acaccaacaa agtcccccca aaacccacaa 1980 agacatggac actaaacagc atgatgctaa acaactcctg ggtcaatgat gacatcaaaa 2040 cagagatcaa aagatacctg gaaacaaatg aaaatgaaga aacatcttac caaaatctct 2100 gggatgcctt aaaagctgta ataagagggg aatttatatc cctacaaaca cacatgagga 2160 aaatggaagg aacagaaatt gacaacctaa caagccacct aaagaagctg gaaaagcaag 2220 accacaaaaa ccctaatttc agcagaagaa tccagatcac caaaataaaa gcccaaatcc 2280 aggacataga agacaaaaag ataatacaaa aaatcaatga aacaaaaagc tggttcttcg 2340 aaagggtaaa caagatcgat ggtcccctag ctagactgac caagaaaaaa cgagaaaaaa 2400 accaaataag cacaatcaga aacacaaaag atgaagtcac atctgaccct gaagaaatac 2460 aaaagatcat tagagactac tacgtacact tgtatggaaa caaacttgaa aaccagaagg 2520 aaatggagga ctttctgaca tcacacaacc tacctaggtt ggaacaagaa gaaattgaga 2580 ccctaaatag accaataaca atcaaggaaa tcgaccacgt aataagaaaa cttcctacaa 2640 aaaaaagccc tggtccagat ggctttccag cagaattcta caagacattt aaggaggagc 2700 tgataccaat cctactgaag gtattccagg cgattgaaaa agatggaact ctccccaaat 2760 cattttacga agccaacatc acattgatac ccaagccagg taaagatcca acaaagaaag 2820 agaactacag gccaatatct ttgatgaaca tagatgctaa aattctcaac aagatcctag 2880 caaaccggat tcagcaatac atctcaaaaa tcatccatca tgaccaagta ggcttcattc 2940 ctggcatgca aggctggttc aacattcgta aaaccataaa tgtaattaaa tacatcaaca 3000 gatgtcaaaa caaaaaccac atgatcatat cactagatgc agaaaaagct tttgataaaa 3060 tccagcaccc cttcttgata aaaacccttg aacatctagg catacgggga acatacctca 3120 aaatagtaaa agccatctac gagaaaccca cagccagcat actcctaaat ggacaaaaat 3180 tggaaccaat tcccctgaaa actggaacta gacaaggatg cccactctct cccctcctgt 3240 tcaacatagt attggaagtc ctggctagag caatcagaga agaggaggca atcagaggta 3300 ttcaaatagg aaaagaggaa gtcaagttat ctctctatgc agatgatatg atkgtgtacc 3360 ttgaaaaccc aagagaatct gtcaaaggcc tccttacatt gataaaggcc tttggcaaag 3420 tctcaggata caaaataaat gtacaaaaga caatcgcatt tctctacacc aataataaac 3480 aaacagaaac ccaaataaaa aacacagttc cattcacaat agccacaaaa aaaatgaaat 3540 accttggcat cttcctaacc agagacgtga aagaccttta caatgaaaat tacaaaacac 3600 tgctcaaaga aatcaaagct gacacgaaca agtggaaaaa tatcccatgc tcatggatcg 3660 gaagaatcaa cattgtgaag atgtccatct tacctaaggc aatctacaaa ttcaatgcaa 3720 tacccattaa attaccaaca acattcttct cagacctaga aaaaacaaca caggaattca 3780 tatggaaaca caaacgtcca agaatagcca gaacaatcct cagcaaaaaa aacaaagcag 3840 gtggtatcac aataccagac ttcaaacttt actataaagc tacaatcatc aaaacagctt 3900 ggtattggta caggaacagg catatagatc aatggaacag aattgagatt ccagaagcaa 3960 aacctcaatt tctcaaccaa ctcatcttcg acaaagcccc caccacctac cactggggag 4020 aggagaacct attcagtaaa tggtgctggg aaaactggct gaccacatgc agaagattga 4080 aacaggaccc ctatctatcc ccatgcacaa aggttaactc caaatggatc agagacctaa 4140 atgtaaaacc tcaaaccata agaaccttag aaaaggaagg aaataccctc atggaaatcg 4200 gaactggcat ccaattcctg tacaaaaccc gaaacccaca ggacttaaga gagaagatag 4260 acaagtggga ccttattaaa ctgacaagct tctgcaaagc caaagaaacc atcaagagag 4320 cagggagaca gcctacagac tgggaaaaag tatttgccaa ctccaggtct gacaaaggct 4380 taacatcctg gatctacaag gaactgaaac gtgctgaaaa gaagaaaaca aacaacccca 4440 ttataaaatg ggcaaaagat atgaacagac acttcacaaa agaagacatc cgagcagcca 4500 acaaacacat gaagaaatgc tcaacctcac taatcatcag ggagatgcaa atcaaaacca 4560 cactgagata ccacctaact ccagtcagaa tggcaattat caacaactca aaaaataaca 4620 gctgctggag agggtgtggc gaaaagggaa cacttctaca ctgttggtgg gagtgtaaac 4680 tagtgcaacc tctgtggaaa gcagtgtggc gattcctaaa agctctaaaa atcgacctcc 4740 catatgaccc cgcaatcccc ctactgggaa tataccctga agaacacaaa tcactctata 4800 aaaaagatac ctgcacacgt atgtttatcg cagcattgtt cacaatagca agaacctgga 4860 aacaaccatg ctgcccatca aaagaggact ggattaaaaa aatgtggtac atatacacga 4920 tggaatacta cgcagccata aaaaagaaca aaatcatgaa tttcgcagca acctggatgg 4980 aactagagtc tataatactg agtgacctct cacagaaaca aagatccgag tatcacatgt 5040 tctcactcat ataatggacc ttgaacatcc aatgcaatac tataagaaaa tgactgacgg 5100 tactgggaaa ctatgggggg gagggagatg ggattaacgg tagcaaatat ctgtctgggg 5160 acggggaaac acctcttatc aacagggtgc ctgaatgaaa cgtaattgta tacctaaccc 5220 ttaactgtac cccacaacat cataataaaa aaatactgat taataaaaaa aaaaaaa 5277 // ID TINE1 repbase; DNA; PRI; 86 BP. XX AC . XX DT 31-DEC-2009 (Rel. 15.03, Created) DT 09-APR-2010 (Rel. 15.07, Last updated, Version 4) XX DE SINE element - consensus. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; KW Nonautonomous; TINE1. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-86 RA Jurka J.; RT "SINE elements from tarsier."; RL Repbase Reports 10(3), 517-517 (2010). XX DR [1] (Consensus) XX CC The youngest copies are >96% identical to consensus. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. XX SQ Sequence 86 BP; 29 A; 25 C; 12 G; 20 T; 0 other; ggcaaccccc tcgggtcccc tcccactgtg ggagcttttc tgtcctctca ataaatcctg 60 ctttctaaaa aaaaaaaaaa aaaaaa 86 // ID LTR1_Mim repbase; DNA; PRI; 681 BP. XX AC . XX DT 13-OCT-2009 (Rel. 14.11, Created) DT 13-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR1_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-681 RA Jurka J.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2948-2948 (2009). XX DR [1] (Consensus) XX CC Top sequences are >98% identical to consensus. The internal CC portion (not included), comes from a non-autonomous retrovirus CC with unrelated insertions such as L1. 4bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 681 BP; 187 A; 173 C; 126 G; 195 T; 0 other; tgaggactga actctaattt tttgctaaaa actccgtcct aaggaggcca gctgggttag 60 gctggcaaat ggtaaggagg ccagcagggt taggctgaca aacagtaaat tcccactaag 120 cggcttttgt taaccaaacg aagcagtggt ttacttcctg acctgattct ggtatggcat 180 taacatcacc taaaagataa gaagcccctg tcttaactca agcatcctag cagatgccta 240 ttcttaaatt taaaccatct taaaacatcc tagcaggcgc ctattgtaaa tttaaagtgt 300 cctcctgtct ggacctccca gagtgctcat acccttatct taaagtaagc atatcctttc 360 tggtcttcta gataaagact aactctctca gccaattgcc agccaaagaa tctttaaacc 420 cacctataac ctgtaagccc ccgcttcgag atgtcccacc tttttgggcc aaaccaatgt 480 atgcctctca tgtattgatt tgtgactttg cctgtaaccc ctgcctctct gaaaatgtat 540 aaaactgaac tgtaacccag ccacagcgag tccacttgcc caaggcctct tggcagtggc 600 tccgggtcat ggtcctcaaa tttggctcag aataaatctc tttaaaatta ttttacagag 660 tttggctttt ttccgtcgac a 681 // ID MSTAR repbase; DNA; PRI; 1651 BP. XX AC . XX DT 01-MAY-1996 (Rel. 5.2, Created) DT 18-APR-1997 (Rel. 6, Last updated, Version 2) XX DE MSTa-Mammalian LTR internal retrotransposon sequence - a DE consensus. XX KW Repetitive sequence; MaLR family; MSTa subfamily; MstII; MER10; KW MSTAR. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-1651 RA Smit F.A.; RT "Structure and evolution of mammalian interspersed repeats."; RL PhD dissertation, Univ Southern California, 1995. XX DR [1] (Consensus) XX CC ORF: bases 48 to 1469. XX SQ Sequence 1651 BP; 436 A; 353 C; 489 G; 371 T; 2 other; gaawattggt actgaggagt ggagcattgc tataaagata cctgaaaatg tggaagcgac 60 tttggaactg ggtaacaggc agaggttgga agagtttgga gggctcagaa gaagacagga 120 agatgaggga aagtttggaa cttcttagag acttgttaaa tggttgtgac caaaatgctg 180 atagtgatat ggacagtaag ggccaggctg acgaggtctc agatggaaat gaggaactta 240 ttgggaactg gagcaaaggt cactcttgtt atacattagc aaagagcttg gctgcatttt 300 gcccctgccc tagagatttg tggaagtttg aacttgagag tgatgatcta gggtatctgg 360 cggaagaaat ttctaagcag caaagcgttc aagatgtgac ctggctgctt ttaacagctt 420 acagtcatat gcgagagcaa agaaatcact taaagttgga atttatattt aaaagggaag 480 cagagcgtaa aagtttggaa aatttgcagc ctggccatgt gatagaaaag aaaaacccgt 540 tttctggaga gaaattcaag caggctgcgg agcgaccgtt tgctaaagag attagcataa 600 ctaaaaggaa gccaagtgct gatagccaag acaatgggaa aaaggcctcg aaggcatttc 660 agaaatcttc gaggtggtcc ttcccatcac aggcccagag gcctaggagg actgaatggt 720 ttcgtgggcc aggcccaggg ccccgctgcc ctgtgcagcc tcgggacact gctccctgca 780 tcccggctgc tycggctcca gccgtggctc aaagggcccc aggtacagct cgagctgccg 840 cttcggagag tgcaagctat aagccttggt ggcttccaca tggtgttaag cctgcaggtg 900 cacagaatgc aagagtgaag gaggcttggc agcctccacc tagatttcag aggatgtatg 960 ggaaatcctg ggtgcccagg cagaagcctg ctgcagggac ggagccctca cagagaacct 1020 ctactagagc agtgccaaag ggaaatgtgg ggttggagcc cccacacaga gtccccaccg 1080 gggcactgcc tagtggagct gtgggaaggg ggccactgtc ctccagaccc cagaatggta 1140 gagccactgg cagcgtgcac cgccagcctg gaaaagccgc aggcatcaga ctccaacccg 1200 tgagagcagc cacgtgggct gtgcccagca aagccacagg ggcggagctg cccaaggcct 1260 tgggagccca cccctcgcac cagcgtgccc tggatgcgag acacggagtc aaaggagatt 1320 attttggagc tttaagattt aatgactgcc ctgctgggtt tcggacttgc gtggggcctg 1380 tagccccttt cttttggccc atttctccct tttggaatgg aaatatttac ccaatgcctg 1440 taccaccatt gtatcttgga agtaaataac ttctttttga ttttacaggc tcataggtgg 1500 aaggaacttg ccttgtctca gatgagactt tggactttgg acttttgagt taatgctgga 1560 atgagttaag actttggggg actgttggga aggcatgatt gtattttgca atgtgagaag 1620 gacgtgagat ttgggggaac caggggcaga a 1651 // ID TINE1 repbase; DNA; PRI; 86 BP. XX AC . XX DT 31-DEC-2009 (Rel. 15.03, Created) DT 16-NOV-2010 (Rel. 15.07, Last updated, Version 5) XX DE retrogenes from the LTR portion - consensus. XX KW LTR Retrotransposon; Transposable Element; TINE1. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-86 RA Jurka J.; RT "SINE elements from tarsier."; RL Repbase Reports 10(3), 517-517 (2010). XX RN [2] RP 1-86 RA Bao W. and Jurka J.; RT "TINE1 is a retrogene from the LTR portion."; RL Direct Submission to Repbase Update (16-NOV-2010). XX DR [1] (Consensus) XX CC The youngest copies are >96% identical to consensus. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. In other mammalian CC genomes, such as human, TINE1-like elements are also present and CC sequences are most similar to the respective LTR in the genomes. CC Furthermore, a considerable percentage of TINE1-likes sequences CC (~ 50% ) in human genomes are preceded with the 3'-end sequences CC of the internal protions of HERV17. Therefore, TINE1 is CC classified as retrogene. XX SQ Sequence 86 BP; 29 A; 25 C; 12 G; 20 T; 0 other; ggcaaccccc tcgggtcccc tcccactgtg ggagcttttc tgtcctctca ataaatcctg 60 ctttctaaaa aaaaaaaaaa aaaaaa 86 // ID Alu1_TS repbase; DNA; PRI; 299 BP. XX AC . XX DT 09-APR-2010 (Rel. 15.04, Created) DT 09-APR-2010 (Rel. 15.04, Last updated, Version 2) XX DE Alu-like SINE element - consensus. XX KW SINE1/7SL; SINE; Non-LTR Retrotransposon; Transposable Element; KW Nonautonomous; Alu1_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-299 RA Jurka J.; RT "SINE elements from tarsier."; RL Repbase Reports 10(4), 632-632 (2010). XX DR [1] (Consensus) XX CC ~97% identical to consensus. CC This sequence was derived from sequence data generated by the CC Broad Institute Mammalian Genome Project. XX SQ Sequence 299 BP; 76 A; 78 C; 96 G; 49 T; 0 other; agccgggctc ggtggctcag cctgtaatcc cagcactttg ggaggctgag gtgagtggat 60 tgcctgagcc cgcgggttcg agacccgcct gggcaacttg gcgagacctc atctctacaa 120 taaatcaaaa aattagccgg gcgtggtagc gcgcgcctgt agttccagct acttggaagg 180 ctgaggcgga aggatcgccg gagcccagca ggtcgaggct gcggtggccg ggagcggcca 240 ctgcactcca gtctgggcga cagagtgaga ctccaactca aaaaaaaaaa aaaaaaaaa 299 // ID LTR14C1a_Mim repbase; DNA; PRI; 517 BP. XX AC . XX DT 04-NOV-2009 (Rel. 14.11, Created) DT 04-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR14C1a_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-517 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2972-2972 (2009). XX DR [1] (Consensus) XX CC ~92% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 517 BP; 153 A; 118 C; 122 G; 124 T; 0 other; tgtgggatac aagctaaccg ctaccttggc ttctgtacct tagcttttgt aattcgcttg 60 cttgcttgcc acttagcctg actgaagcca tgacaggctt ctacaaagta aaagaaaaaa 120 aaaaaaaaga gagaacaaac aaggccccag ggaggaaacc ggtaaggcac tacctgataa 180 ggtagtgcaa agtccctggg acctagccaa ccaatcaata aatcaataca cggccagcat 240 gatcagtgcg tgaacagctt gagtgggatg tgtgggtgct gggtgggctc cggataccac 300 ttgtaaccag taacctgagt tgcacaacaa ctaaaagtat aaaacctgtg ctaaaacctt 360 gccaagggtc cttgtctaaa gagaccactg agctggtgct ctgggacctg gaccctagct 420 cgagctagct taaataaacc tccatttgtt gcttacgttg gtgtgagctt gttactctga 480 cattctgttc tgggatacag aattcttgga cacaaca 517 // ID CERV1_INT repbase; DNA; PRI; 7337 BP. XX AC . XX DT 17-AUG-2004 (Rel. 9.07, Created) DT 17-AUG-2004 (Rel. 9.07, Last updated, Version 1) XX DE Chimpanzee endogenous retrovirus CERV1 - a consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; CERV1_INT; KW Internal sequence of chimpanzee endogenous; LTR. XX OS Pan troglodytes OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Pan. XX RN [1] RP 1-7337 RA Skaletsky H., Hughes F.J. and Page C.D.; RT "Consensus sequence of an endogenous retrovirus CERV1."; RL Repbase Reports 4(7), 189-189 (2004). XX DR [1] (Consensus) XX CC Internal sequence of an endogenous retrovirus with ORFs CC for gag (961-2370), pol (3042-5624) and env (5618-7288). CERV1 CC is absent from the human genome. It is similar to an CC uncharacterized interspersed repeat mulatta and Papio anubis CC genomes. XX SQ Sequence 7337 BP; 1892 A; 2103 C; 1788 G; 1546 T; 8 other; tctgggggcc cgtccgggat tccccaagcc caccagaccc ctggtcaacg gatctgctag 60 gatcgatcta ctgataggtg agctggctcg tctccgtttg tctgtctgtg tctgttctga 120 atccgaatct gtgactcgcg aggtctgaaa ctggagctgg cacagtcctg gcggacgcgc 180 tataggacgg ccagcggaga ccggtgggag acgtcccctg gctctcatct gatctatatt 240 gcgatctgag ctgccccggt ttggcgggca gcagccgtat ctgagctgcc cggttgatcg 300 ggcagcatct gtctctgatc tgagctgccc cggttgcggc agcagcctat ctgactgccc 360 ggttgggcgg gcagcagctg tctctgatct gagctgcccc agcagctgtc tctgatctga 420 gctgccagta nctgtctctg atctgagctg ccccggtttg gcgggcagca gctgttcatc 480 tgagctgccc ngttnggcng gcagcanctg tctctgatct gagctgcccg gcgggcagca 540 gctgtctctg atctgagctg ccccggtttg gcgggcagca gctgtctctg atctgagctg 600 ccccagtgca gccgtatctg agggcagctg tctctgatct gagctgcccg gcgggcagca 660 gctgagctgc cccggtttgg cgggcagcag ctgtctctga tctgcccggt tnggcgggca 720 gcagctgtct ctatctgagc tgccccggtt tggcgggcag cagctgtctc tgatctgagc 780 tgccccggtt tggcgggggc tgccccagtg cgntcggctg ccccggggcg cagctgcctc 840 tgaccagggc tgccaaagtg cgcctgcgat tagatatcat taataagttc agatcagatt 900 tcctttacag ggattcaaac ccacattcct ttacaggaag agactacaaa ttattacagg 960 atgggtaata cccagagcac tcctctatct ctccttacga gtaatttcaa agaagttaga 1020 gcaaggggcc atgatcttgg tatagaaatc aggaaaggaa agctaattac tctgtgtcgc 1080 tccgaatggc ctgcctttga tgtggggtgg ccgcccgaag ggaccttccg acttgctgtc 1140 atcactaggg taaagtccaa gattttccta cctgggcgtg cgggccactt agatcaaatc 1200 ccatatatcc tcatatggca ggaccttgtt gagaacccgc ctccttggct gtcccctttc 1260 caattggcct ctgaaccctg taaggcacta gttgctcgac cactaaaatc caagcaacca 1320 actgcccccc ccatcctgtt ctacctgaca gcggggaccc actgttcaca gaaccccctc 1380 cgtacccctc cgggccccag gccccagccc ccctggctga gctgcgggag ggagcaggcg 1440 gacgggaggc ggccggcaca cacgggcccg ctgaaaggga aagtaacttt gaagggccgg 1500 cggggaggac gcgagggcgc acttcgcgga ctagcccccc ctcagccgcc tgactccacg 1560 gtggctttac cccttcggga aataggaccc ccagatgaca caggaatccc caggctccag 1620 tactggccat tctccaccag tgatctgtat aactggaaaa ctcagagtgc tcggttttca 1680 gacaacccca aagatttact ggctttacta gatagtgtca tgttcaccca ccagcccact 1740 tgggatgatt gtcagcagct cctccgaatt ttgttcacca cggaagagcg agagagaata 1800 cagatagaag ctagaaagct ggtcccgggg gacgacggtc aaccgactgc caaccccgac 1860 ctcataaacg caacctttcc tctgaccagg ccggcgtggg actacaacac ggcagaaggt 1920 aggggacggc tacaccttta tcgccagact ctaatggcag gtctccgggc agctgctcgc 1980 aagcccacta atttggctaa agtatattct attctgcagg gaaagacaga gagcccagct 2040 acctacttag aaagattaat ggaagctttt agacagtaca cccccataga tccagaggct 2100 ccaggaagtc aggcagctgt tgtaatgtct ttcgtaaatc aggcagcccc agatattaag 2160 agaaaactcc agaaattaga agacttggag ggaaagcgga ttcaggacct ccttcagata 2220 gcccagcggg tttacaataa cagagatact ccagaggaaa agcaatttaa ggccactgaa 2280 aaaatgacca aggtcctggc agcagtggta cagaaagagc atctacagcc agagtacacc 2340 caacctaggc ggccccccgg catgataatc tgagcaaaga ccaatgtgcc tattgtaagg 2400 gggctggcca ctaggtaaga gactgcccca aaaagaaacc acgaggacag ggacccgcct 2460 aggtctacac ccgtactagt cactcaagac gaagactagg gaagacgggg ttcggacccc 2520 ctccccgaac ctagggtaac tttgcaagtg gaggggtccc cagtccagtt cttggtcgat 2580 acnggagcac agcactcggt cttagttaaa actaatggga aattatcctc caaatcctcg 2640 tgggtacaag gggccacagg agttaagaaa tacccatgga caacacaaag aacagtaaac 2700 ctcggagcca agaatgtaac ccattctttc ctggtcatcc ctgagagccc ctgtccccta 2760 ttggggagag acctgctaac taaaatggag cacagatcca tttcctccct gaggggcccg 2820 tcgtgaccaa ctcccacaat cgaccgtgtc ctcctgacta taaacctaga agatgagtac 2880 cggctccacc aggagaaagc ggcccctgac caggacatag caactggctc cagcatatcc 2940 agaagcgtgg gcggaaacgg ggggcttagg tctagcaaaa cacctcctgc cttatttatt 3000 gaacttaagc ctggacagac ccctgcggta cgccataccc gatgccccta gagccaagaa 3060 gttcctccag cccccacccg agagacacaa gaagacagac accctactca catcaatggt 3120 acgttactct gaccgaggaa gaacctaacc gccaactgct taagggaggg cagcgctggc 3180 taacagacgc ccggaaacaa actgttctgc agatccccag gccacaatcc acccgacaag 3240 tgagagaatt cctggggtcg gcaggatttt gcagactatg gatacctggg ttcgcagaac 3300 tggctaaacc cttgtatcag gcaacacggg ggcaacagcc atttaattgg acagacgaag 3360 ccgagttggc cttccaacag attaaaaccg ccctactctc cgcgcctgca ctaggactac 3420 ctgatgttac caagcccttc cacttatacg tggatgagaa taagggtgtc gccaaggcgg 3480 taataactca gaacttaggc ccctggcgga ggccagttgc ctacctgtca aagaagttag 3540 acccagtagc tgccgggtgg cccccttgtc tccgaatgat tgcggccacg gctctgatgg 3600 tgcaagatgc tgataaactt gtcatggggc aagaattgcg ggtcgttact ccacatgcca 3660 tcgaaggtgt actcaaacag ccacctaatc gatggatgag taacgcccgg ctcacccact 3720 accaaggact actactaaat cctctcagga taattttcct gcccccaacg accttaaacc 3780 ctgcctcgct gctgcccaac ccggacctgg acgccccact ccatgactgc accaagatac 3840 tagctcaggt gcacggagtt cgagaagacc tgcaggaccg cccacttcct gacgccgacc 3900 tcgtctggtt cactgatggg agcagcttca tgcatcaagg ccagaggtac gctggagcgg 3960 cagtaacttc agagactgag gtaatctggg cggaacccct gcccccgggg acatcggccc 4020 agaaggccga actgatagcg ctcacccaag ctcttacctt aggggcgggg aaaaagctga 4080 cagtatatac agacagccga tatgcttttg caacggcgca tatacatggg gccatttaca 4140 gggagcgagg gttactgacg gctgaaggaa aagagataaa aaacaagcaa gagatcctag 4200 ccctgctaac agccctatgg aggccagaaa aattagccat tgtacattgc ccagggcatc 4260 agaaactaac tactccaact gctcaaggca actttctggc agaccaaact gcaaggaatg 4320 tggcgaaggc tcccagccaa ctccttgcac tccagctccc tgacccgggc ccccgggact 4380 tgccatattt ccctgaatat tcagaacaag atctccagtg gattgacaaa cttcccctga 4440 aacaaatcca gaatgggtgg tggactgata ctaatgacca aaccatccta ccagaaaaat 4500 taggacaaca ggtgttagaa cacatccacc gaaccaccca cctgggggcc cggcggatga 4560 tagacctgat cagacgctcc aagctcaaaa tcagacatat agctgagacg gccagcagta 4620 tcgtgacaag ttgcaaagtc tgccagctta acaacgcata cccccaatct caagctgcaa 4680 caggaacaag gctcagggga accaggcccg gtatctactg ggaagtagat tttactgaaa 4740 taaagccagg aaagtacggg taccggtact tacttgtctt tgtagatact ttttcagggt 4800 ggactgaagc attcccaacc aaaagagaaa ctgctcaggt cgtagcaaag aaaattctgg 4860 aagatatcct tcccaggtat ggcttcccca tccagatagg gtcagataat gggcccgctt 4920 tcgtcgctaa ggtaagtcag gacttggctt ccatccttgg ggcaaattgg aaactacatt 4980 gcgcttacag gccccagagt tcaggacagg tagaaaggat gaatcggacc ttaaaagaga 5040 ccttaactaa attgactata gagactggcg ctaattgggt agtccttctc ccctatgctc 5100 tgttccgggc ccgtaatacc ccttacaaac tgggcctcac cccttacgaa atcatgtatg 5160 gcagacctcc acccctggtt cctagcttaa aagatgacct gcttaagtct gaaacagaaa 5220 atgtctctga attcttattt tccttacaag ccttacagaa aattcaccaa gaaatctggc 5280 ccaagctgaa agagctatat gagaccagtc ccccaccgac accccatccg taccagccgg 5340 gagactgggt cctggttaag cgacaccgac aagagaccct agagcccagg tggaaaggac 5400 cactccaagt actcctgacc acacccaccg ccctgaaggt agaaggcatt gcgtcgtgga 5460 tccactacac ccacgtcaag ccagtggacc caacctccga ccttctgggg ccaatcacgg 5520 cggcggcggc tgaagcaccg gacacgtgga ctgtggacag agctaagaac aaccccttaa 5580 aactcaccct gcgccggcag cataactcac tgcaaacatg cagttaggta gtctaactct 5640 aacattagtc gccctagtgg ccgctgggga aaacataaag ccagctccta atccctttgt 5700 ctggagattc tggctttatg aaaaccaaac ccaccctggg caacctcata agcccgggaa 5760 attagtggcc agtgcagatt gcccctcctc agggtgcaat agcccaattt tactaaattt 5820 taccgatttc ccagtagcca aaccagtggc accaataata tgcttcgagt atgatcagac 5880 tgaatacaat tgtaagcact attggtggca ccaaagtgcc ggctgccctt ataactattg 5940 taacatccat aaataccaat ggtggggtgg agaagaacag atagatccca gatggccctt 6000 ccatcgcaga cgagatagag acctttcata tacatggata gttagagacc cctggaactc 6060 ccgctggacc acgcctcaac acggggctgt atactactcc tccgcctcca catggcctag 6120 cagtcacctc tatctgtggc ggggtctagt gcaggtacgg cccctggtcc atggaaatat 6180 ccagcgacaa gaaaaccgcc tgacacaaga tttacgtcct ttttcctggt taaaattatt 6240 gcaagaagga ttagaacttg ccaaccttac aggacttcac agcctgtctg gctgctttct 6300 atgtgccact ctagggcgtc caccgctaac cgctgtcccc ctgccatggg gatcatccac 6360 ctctgcccaa gctaacaacc accaaaacct ctcatatgcc cctatcccta acgtgccact 6420 atacctaaac cccagtcaag agaagtttcc ctactgtttc tcaggaacta attccagcct 6480 ctgcaacatc actgcaacgc cccctaacat caccttaagg gctccgtcag gcatattctt 6540 ctggtgtaat ggaacattat ctaaaaacct atcaagcccc tctgttacca acctactgtg 6600 tcttcctgtc acattagttc cccggttaac tctacttact gccggcgagt tcctagggta 6660 taccggtaac tggactagtg ctgttattca cccagaccct agaccgagac ctgcacgagc 6720 catatttctc cccctcattg caggaatctc cctcaccgca tccttcatgg cggccggact 6780 ggctggggga gccctaggtc acacccttat agaaagtaac aagctgtacc aacaatttgc 6840 cgttgctatg gaggagtcag ctgagtccct tgcctccctc cagcggcagc tcacgtccct 6900 agcacaggta accttgcaga accggagggc cttagaccta ctcactgctg aaaaaggggg 6960 aacgtgtatg tttctaaagg aagactgttg tttctacata aatgaatcag gactcgtgga 7020 agaccgagtc caacagttac gcaagttaag cacagaagta agaacacggc agtttgcttc 7080 agctgcagac caatggtgga actcatctat gttttctctg ttagccccct tccttggacc 7140 cctgctgagt ctactatttc tgcttaccgt aggaccttgt gttgttaaca gaattttgcg 7200 gttcgttaaa gaaaggttta acactgtaca actcatggtc ctcagagccc aataccaacc 7260 tgtaaacgct gaaacagaat cagacttata agacccaaga ttggctctaa aaaatacctg 7320 aaaagaaagg gggggaa 7337 // ID LTR7Y repbase; DNA; PRI; 472 BP. XX AC . XX DT 14-NOV-2005 (Rel. 10.11, Created) DT 14-NOV-2005 (Rel. 10.11, Last updated, Version 1) XX DE Long terminal repeat of ERV1 Endogenous Retrovirus from primates. XX KW ERV1; Endogenous Retrovirus; Transposable Element; KW LTR retrotransposon; LTR7B; LTR7Y_LTR; LTR7Y. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-472 RA Smit A.F.; RT "LTR7Y - ERV1 Endogenous Retrovirus from primates."; RL Direct Submission to Repbase Update (11-NOV-2005). XX DR [1] (Consensus) XX CC 2% div; 5 bp TSD, but HERV-H internal. Notice TG...AA like LTR7A. XX SQ Sequence 472 BP; 119 A; 153 C; 79 G; 121 T; 0 other; tgtcaggcct ctgagcccag gccaggccat cgcatcccct gtgacttgca cgtatacatc 60 cagatggcct gaagtaactg aagatccaca aaagaagtaa aaacagcctt aactgatgac 120 attccaccat tgtgatttgt tcctgcccca ccctaactga tcaatgtact ttgtaatctc 180 ccccaccctt aagaaggtac tttgtagtct cccccaccct taagaaggtt ctttgtaatt 240 ctccccaccc ttgagaatgt actttgtgag atccacccct gcccaccaga gaacaacccc 300 ctttgactgt aattttccat taccttccca aatcctataa aacggcccca cccctatctc 360 ccttcgctga ctctcttttc ggactcagcc cgcctgcacc caggtgaaat aaacagccat 420 gttgctcaca caaagcctgt ttggtggtct cttcacacgg acgcgcatga aa 472 // ID LTR11_Mim repbase; DNA; PRI; 370 BP. XX AC . XX DT 09-NOV-2009 (Rel. 14.11, Created) DT 09-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR11_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-370 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2967-2967 (2009). XX DR [1] (Consensus) XX CC ~90% identical to consensus. 5bp tsd. CC Distant similarity to LTR8A_ML from bat. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 370 BP; 78 A; 117 C; 81 G; 94 T; 0 other; tgtaagggaa actgcatttg tataggcaac aggtttctta ggccaaacta agatgactgc 60 cacagtgttt tttccggccc ggcccccccg gctagctggc cttccgggga ggcatgagtc 120 agcacactca tgtaaccaag gtgttatctc ctgtgtgaac ccatgtgatc acgctttccc 180 atggcatgtt gccactattg ttcagagcca tatataagcg ctcgccatgt tctcggccat 240 gcttttcgcc actgctgtat ccccccgaca ataaagagca tgtctcacct gcctgctgcc 300 actcgccttt tcttccaatt tccgaagcct gcgccggagc acacgctagc cacagagcat 360 cctcctcaca 370 // ID hAT-2_TS repbase; DNA; PRI; 3541 BP. XX AC . XX DT 21-APR-2009 (Rel. 14.1, Created) DT 21-APR-2009 (Rel. 14.1, Last updated, Version 1) XX DE hAT-2_TS is a family of autonomous DNA elements found also in DE Anolis carolinensis, Microcebus murinus, Myotis lucifugus, DE Monodelphis domestica, Otolemur garnetii, Echinops telfari, DE Xenopus tropicalis and Schmidtea mediterranea. About five DE elements exist in the genome at 3381bp in length. XX KW hAT; DNA transposon; Transposable Element; hAT-2_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-3541 RA Novick P.A., Smith J., Ray D. and Boissinot S.; RT "Independent and parallel lateral transfer of DNA transposons in RT tetrapod genomes."; RL Gene 449(1-2), 85-94 (2010). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 1522..3327 FT /product="hAT-2_TS_1p" FT /translation="MMSRKRKIDSECRIFKEQWTYDYFFMQYKERAVCLIC FT QNIVSVFKEYNLRRHYQTQHKDKYDCLVGEVRKEKILKLKNTLTTQQNTFV FT KQKQLNISSLRASFQVAKLISMHWQTICGGEFVKECLLSVAKEMCPEKADL FT FSTVSLXGPTITRRIEEMGDNLHQHLQNSTKKLSYFSLALDESNDVRDSAQ FT LLIFIRGTNDYFEVTEELAALQSIKGTTTGEDIYEKXCQTVNGLELDWAKL FT ASVTTDGAPSMVGSKKGVIARINQEMGKHNHSHPIAIHCLIHQQALCSKSL FT KWDSVMKIVVSCVNFIKANALNHRQFQEFLSELNVTYEDVLYHTEVRWLSR FT GRVLKHFYDLLPQITAFLLSKNKEVPELNDAEWKWHLXFLTDVTELLNSFN FT VQLQGKGKLICDMQLHVKAFEVKLGLLIKQVKEENFFHLPTTQNLLAEKPL FT VAFPNKTCVDSLEKLQKEFQFRFKELHLHEQDIQLFRNPFSIDIENMDTIY FT QMELAELQNCDSLKDAFKSSSLPNFYASLPSETYPNIRNHVLKMATVFGST FT YVCEQTFSRMKHLKSPTRSRLTDAHLHQLLRLAVTNMEPDIDHLISQKEAH FT SSH*" XX SQ Sequence 3541 BP; 1013 A; 673 C; 788 G; 1061 T; 6 other; caggggtcct caaactacgg cccgcgggcc acatgcagcc cgccaaggac atttatctgg 60 cccaccgggt gtttttgccg ccgctgcctg tcctgcctag cagccaactc gtccgggcca 120 cagtgcgcat gtgtggaatg tacatctctc tctccgactt ccctccttct ctctgtctct 180 cggctcctcc tttcaatctc gggtgtgatt gaacgagtca tgagcttgcc tatgcagagc 240 ctgctgctgc ctgaggaccg aggtaagaac aagttaggat tttttttttt tgaagttagg 300 aggtctattt ttttttttaa ttttgcagtt agtagggcct tttatttgta gttaaggggg 360 gccttttttt ttctgaagtt aggaggtcta tttttttttt tttgcagatg ggggcacctt 420 ttatttttga agttaggaga gccttttttt gaagttagga gagccttttt ttaaagatag 480 gagagccttt tcttttttga agttggttag ttggttgggg ttggtttctg gggggggtgc 540 atcacagtga taatgcaaat agcactcagt gctaatgcaa atggtcagtg ctcagaggta 600 atgcaaatgg tcagcactca gaggtaatgc aaatagtcag tgctcagtgg taatgataat 660 tgtaagtgct cagtgttaat gcaaatggtc agcactcagt attaatgcaa attatcagtg 720 gtcagtgtta tcgcaaatgg tcagtagtca gtgttaatgc aaatggtcag tgctcagtgt 780 taatgcaaat agtcagtgct cagtgttatc gcatgggggc cccaaactgg taatctgcct 840 agggccccat gggaacttaa tcctgctctg cagacagchg aggagtagga aacccaattt 900 aattgacagt aagtgcattt gtattctgat tgctattcag ttgtgtatga tgttgtatgt 960 tgtgtgatgt gtaagccctg gttcacactg ttgcaatctc tgagcagtgc gagttcagcc 1020 atatgcttgt atggctgaac ttgcattaga ttcggaagaa aaaaggcata cgtaccttct 1080 tttttcctgc agtggaatct gattghatgd gtcttcttac ccatgcaatc agattcctgt 1140 gcgagttcac agatcgcagt gtggttcgca cagggtagtg tgaactggaa aggtggtgga 1200 ggaaccggct ctgtaatcgt gccagttccc gcaccgcacc agtgtgagcc tgaggtaaaa 1260 gcagagttcc accaacatgg gcactggtga ggctgaaatg atgtggactg gtaaggctac 1320 attgatggac actgatcaga ctgcattgat ggacactgat cagactgcat tgatgggcag 1380 tgcagtctgt atgtctctgt gtgggcaaag ttattgctgg tatattgttt ttgtagcgct 1440 gtgtgtgtgt atatatatta tatatatata tgtatatata tatgtatttt actaatagca 1500 atttggaatc cctaggaaac aatgatgtca agaaagagaa aaattgactc ggagtgtaga 1560 atattcaaag aacagtggac ttatgattac tttttcatgc agtacaagga aagagctgtg 1620 tgtttgatat gccagaatat agtgtctgtg ttcaaagaat acaatttgcg tcgacactat 1680 caaactcaac ataaagacaa atatgattgt ttggtcggag aagtgagaaa agaaaaaata 1740 ttaaaactga aaaatacatt gacaactcag caaaatactt ttgtgaagca gaagcagcta 1800 aatatttcat cactacgagc aagttttcaa gttgccaagc taataagcat gcactggcag 1860 accatttgtg ggggagaatt tgttaaagaa tgccttcttt ctgttgccaa agagatgtgt 1920 ccagagaagg ccgatttatt tagtacagtg agtctttbag gacctacaat tacacgaagg 1980 attgaagaaa tgggagacaa tttgcatcag catttgcaaa actccacaaa aaaactttcc 2040 tatttttcct tggcactcga cgaaagcaat gatgttcgtg attctgcaca acttctaatt 2100 tttattcgtg ggacaaatga ctatttcgaa gtcacagaag agcttgctgc actgcaaagc 2160 atcaaaggaa caactacagg agaggatatc tatgaaaagb tttgccaaac tgtgaatggt 2220 ttggagctgg actgggctaa actagccagt gtgacaactg atggtgctcc tagcatggtg 2280 gggtctaaga aaggagtaat tgctcgcatt aaccaagaga tgggcaaaca taaccattct 2340 catccaatag ccatacactg cctcatccac caacaagcgt tgtgtagtaa atcactgaag 2400 tgggactctg ttatgaaaat tgtggtatct tgtgttaact tcattaaagc taatgcacta 2460 aaccacagac aatttcagga atttctgtct gagctaaatg ttacctatga agatgttctg 2520 taccacacag aagtccgttg gctgagtcga gggagagttt tgaaacattt ctatgactta 2580 cttccacaga ttacagcttt tctgctttca aaaaacaaag aagtaccaga gctcaatgat 2640 gcagaatgga aatggcacct tgbctttctg acagatgtaa cagagctact caacagtttc 2700 aatgtgcaac ttcaaggaaa ggggaagctc atctgtgata tgcaattaca tgtgaaagca 2760 tttgaagtaa aattaggcct cctcatcaaa caagtgaagg aggaaaactt cttccatctc 2820 cccacaactc aaaatctgtt agcggaaaaa ccattggttg cattcccaaa caaaacatgt 2880 gtggattcac tggaaaagtt gcaaaaggag ttccaattta gatttaaaga gcttcatctc 2940 catgaacagg acatacagct tttccgtaac ccattttcta ttgacattga aaatatggat 3000 acaatttacc aaatggaact ggctgaactg cagaattgtg actctctgaa agacgcattc 3060 aagtcaagca gccttcctaa tttctatgca tctctcccct ctgagacata tcctaatatc 3120 aggaaccatg tactcaaaat ggcaactgtc tttggcagca cttatgtctg tgaacagact 3180 ttttccagaa tgaaacatct gaaatctcca accagatcta gactaactga tgcacacttg 3240 catcaattgt tacgactagc agtaacaaat atggaaccgg acattgacca tctcattagc 3300 caaaaagagg cccatagttc ccattgaaat actggtaagt ttgttgattt aactttactt 3360 gttcttcatt ttaaatattg tatttgttcc cattttgttt ttttcacttc aaaataagat 3420 atgtgcagtg tgcataggaa tttgttcaca gttttttttt tttttaaact atagtccgcc 3480 cctccaacgg tctgaggaac agtgaactgg cccccttgtt ttaaaagttt gaggacccct 3540 g 3541 // ID TINE2 repbase; DNA; PRI; 87 BP. XX AC . XX DT 31-DEC-2009 (Rel. 15.03, Created) DT 20-NOV-2010 (Rel. 15.07, Last updated, Version 5) XX DE retrogene from the LTR portion - consensus. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; TINE2. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-87 RA Jurka J.; RT "SINE elements from tarsier."; RL Repbase Reports 10(3), 518-518 (2010). XX RN [2] RP 1-87 RA Bao W. and Jurka J.; RT "TINE2 is a retrogene from the LTR portion."; RL Direct Submission to Repbase Update (16-NOV-2010). XX DR [1] (Consensus) XX CC >92% identical to consensus. CC This sequence was derived from sequence data generated at the CC Washington University School of Medicine Genome Sequencing CC Center, and assembled at the Broad Institute. TINE2 is classified CC as retrogene (see comment of TINE1). XX SQ Sequence 87 BP; 26 A; 24 C; 12 G; 25 T; 0 other; ggcaacccct tcgggtcccc tcccactgtg ggagcttttc tgtcctctca ataaatcctg 60 ctttctaaat aaataaataa ataaaaa 87 // ID PMER1 repbase; DNA; PRI; 90 BP. XX AC . XX DT 25-APR-1997 (Rel. 2.03, Created) DT 28-AUG-2008 (Rel. 3, Last updated, Version 3) XX DE Putative non-autonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; URR1; PMER1. XX OS Strepsirrhini OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates. XX RN [1] RP 1-90 RA Smit A.F.; RT "PMER1."; RL Direct Submission to Repbase Update (30-NOV-1996). XX DR [1] (Consensus) XX CC A prosimian-specific MER1-type DNA transposon fossil similar to CC bp 1-54 CC and bp 202-236 of PR/URR1 in rodents. XX SQ Sequence 90 BP; 23 A; 22 C; 26 G; 19 T; 0 other; tatagcagcg gttctcaacc tgtgggtcgc gacccacagg aactgtatta aagggccgcg 60 gcattaggaa ggttgagaac cactgctcta 90 // ID L1C_Mim repbase; DNA; PRI; 5993 BP. XX AC . XX DT 07-JAN-2010 (Rel. 15.03, Created) DT 07-JAN-2010 (Rel. 15.03, Last updated, Version 3) XX DE LINE element - consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1C_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-5993 RA Jurka J.; RT "LINE1 elements from the mouse lemur."; RL Repbase Reports 10(3), 246-246 (2010). XX DR [1] (Consensus) XX CC >98% identical to consensus. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX FH Key Location/Qualifiers FT CDS 826..1827 FT /product="L1C_Mim_1p" FT /translation="MGRNQRKNSGNMKNQTENTPPRRSTSPLETDTDQNQA FT TNMTEEEFRMWIIRTLTQLQQQLNNQHQETTKSLQDMGQRFNKEIDTVKKX FT XTELLEMKNQLRELQNTVESLKNRVDQAEERISELEDNTLQLNKSVTEIEQ FT RNKRKDQSLQELWDYVKKPNVRVIGLAEGEEDNTQGLDKLFEDIIEENFPG FT LAQNLDIQVQEAQRTPGRFNANRKTSRHAVIRLTKVSTKEALLRAVRQKKQ FT VTYKGKPIRITSDFSNETLQARRDWGPILTLLKQNNAQPRILFPAKLSFIY FT EGEIKTFSDKQRLREFTKTRPALQEVLKTALRTEHHNNNPRI" FT CDS 1872..5744 FT /product="L1C_Mim_2p" FT /translation="MAQDRNHSNNIQPNRMISNLPYLSVLSINVNGLNSPL FT KRHRLAEWIRKYRPSICCLQETHLTCKDAHRLKIKGWRSIFQANRSQKKAG FT VAVLISDDLVFKPTKVVKDKEGHYIMVKGTVQQEEITILNIYAPNLGAPRF FT IKQTLLELSKWINSNSIIAGDFNTPLTARDRSSKQKINKEIMDLNKTLEQL FT GLTDIYRTFYPKSTEYTFFSSAHGTFSKIDHILGHKENLKKFKKIEIIPCT FT FSDHSGIKLEINPNRNSHFYTKTWKLNNLLLNDYFVNEEIKTEIKNFYEEN FT DNGETSYQLLWDTAKAVLRGKFISINAYNQKARRSQIDNLMKRLKELEKEE FT QTNPKPSRRSEINKIKSELNEIENRKAIQEINKTKSWFFEKINKIDTPLAK FT LTKSRKEKSLISSIRNKKGDITTDPKEIQDTIYEYYKNLYAHKLENVEEMD FT KFLETHSLPRLNQEEIDSLNRPISTAEIETAIKNLPKKKSPGPDGFTPEFY FT HTYKEELVPILQKLFHNIEKNGNLPDTFYEANITLIPKPGKDATKKENYRP FT ISLMNIDAKIFNKILANRIQTLIKKIIHHDQVGFIPGMQGWFNIRKSINAI FT HHINRSKNKDHMILSIDAEKAFDKIQHPFMIRTLKKIGIEGTYLKMIQAIY FT DRPIANIILNGERLKSFPLRTGTRQGCPLSPLLFNIVLEVLATAIRQENGI FT KGIQIGAEEIKLSLFADDMILYLENPKDSTKKLLELINEFSKVSGYKINTQ FT KSEAFIYANNNLIENQIKDSIPFTIATKKLKYLGIYLTKEVKDLYRENYET FT LRKEIAEDVNRWKSIPCSWIGRLNIIKMSILPKLIYRFNAIPIKIPSAFFT FT DIEKIILRFVWNQRRPRISRAILGNKNKMGGINMPDIKLYYKAVVIKTIWY FT WHKNRNIDQWNRCENPDIKPSSYSHLIFDKADKNIRWGKESLFNKWCWENW FT IATCRRLKQDPHLSPLTKTNSRWITDLNLRYETIRTLEEKVGNTLLDIGLG FT KEFMKKSPKAITAATKINKWDMIKLQSFCTAKEIVMKVNRQPTEWEKIFAS FT YASDKGLITRIYLELTKIRKKKSNNPIKKWAKDLNRNFSKEDRRMANKHMK FT KCSTSLIIREMQIKTTMRYHLTPVRMAFIKKSPNNKCWRGCGERGTLLHCW FT WDCKLVQPLWKAIWRYLKAIQVNLPFDPAIPLLGIYPNDPVTLYKKDTCTR FT MFIAAQFIIARLWKQPKCPSIQEWINKMWYMYTMEYYSALRNNGDIAHLIF FT SWLELEPILLSEVSQEWKNKHQIYSPANWY" XX SQ Sequence 5993 BP; 2334 A; 1317 C; 1165 G; 1174 T; 3 other; ggcaccgtgt tcccaggaaa acggtgccga ctcagaggct gagagacata gacccagctt 60 gggctccctg tgggtgaatt aggaccggaa accctctccc tggtgggaat acagtttgaa 120 ctctgggacc cagaggtcgg acctgcagac cagatcccct gcaccgaggg ctagcattgc 180 ccggggcaca gaagggttat acgtgaacag cctactgagg tctgtgtgcc tccaggggcg 240 gatcggcgtc ctagagggcg accctcctcc caggaggagg ccgtgcgccc aacccaggtg 300 gcgttcctgt gcagggaacc tccccgccgg catcacagtc cggggaggcc tggtggcttg 360 tggtctggcc tgctggcaga ggcccaggag tagctgcgga gttggggagg gtggaaagaa 420 gcgaggcctg ctgcagactg cgggtctcag acagccccac ccccacaccc agactttctg 480 gctgagcggg accattccag ccccgccctg acagctttcc ctggaagcag agaacagaac 540 tttgacccct gctaacggcc tgagggcagg cttacccaac ccagctccgc ccagaacgag 600 agctgataac aggactcaaa atcaacacca tagcctgttc ctccaagcaa acgccaccta 660 ctgacaggga cggcatcttg cacagccttt ccacggcacc cactgactca atatacaggg 720 agtggtccaa tttcacccac aggcaccacc taacgcctca gaaactaaac aaggtgtgtg 780 aatacccaaa caataaccta aggaaagaaa caacaactga tcgacatggg aagaaatcag 840 cgaaagaact caggaaatat gaagaaccaa acggaaaaca cacccccaag gaggagcacc 900 agccccctag aaacggacac cgaccaaaat caggcaacca atatgacaga agaggaattt 960 cgtatgtgga tcataagaac actcacccag ctgcaacaac aactcaataa ccaacaccaa 1020 gaaaccacaa aaagcctcca ggatatggga caaaggttca acaaagagat wgacacagtg 1080 aagaaaastk taaccgaact cctggagatg aagaatcaac tcagggaact acaaaataca 1140 gtggaaagtc tcaagaacag ggtagatcaa gcagaagaaa gaatctcaga gcttgaagat 1200 aacaccctcc aattaaataa atcagtcaca gaaatagagc agagaaacaa gagaaaagac 1260 caaagcctac aagagctgtg ggattatgtg aaaaaaccta acgtgagggt cataggttta 1320 gccgaagggg aggaagacaa cactcaaggg ctggacaagc tttttgaaga tataatagag 1380 gaaaatttcc caggccttgc tcaaaatctc gatatacaag ttcaagaagc ccagaggacc 1440 cctgggagat tcaacgcaaa caggaagacg tcacgtcatg cagtcatcag actgaccaaa 1500 gtatcaacta aagaggccct tctaagagct gtaagacaaa agaagcaagt gacatacaag 1560 ggaaagccaa ttcgaataac atcagacttc tctaatgaga ctttacaagc aaggagagac 1620 tggggcccca ttctcactct tttgaaacaa aacaatgccc agcctagaat attattccct 1680 gcaaaactaa gcttcatata tgaaggagaa ataaaaacat tctcagacaa gcaaaggctc 1740 agagaattca ccaagacaag accagcccta caagaagtac ttaaaacagc gttacgcacg 1800 gaacatcata ataataatcc acggatataa aaacaaccaa aacccaaaga tattaaaggc 1860 cagatattac aatggctcaa gacagaaatc atagcaacaa catccaaccc aacagaatga 1920 tcagtaatct accttaccta tcagttctct caataaatgt gaatggctta aactctccac 1980 tcaagagaca taggctggct gaatggataa gaaaatacag gccaagtata tgctgtcttc 2040 aggaaacaca tttaacctgc aaggatgcac atagactaaa aataaaaggg tggagatcaa 2100 tattccaagc aaatagaagc caaaagaagg ctggtgtggc agttctaatt tcagacgatt 2160 tagtttttaa accaacaaaa gtagtaaaag acaaagaggg tcattatata atggtgaagg 2220 gcacagtcca acaagaagag ataacaattt taaatatata tgcacccaac ttaggtgcac 2280 ccagattcat aaagcaaacc ttactggagc taagcaaatg gattaatagc aactccataa 2340 tcgccggaga tttcaacacc ccactgacgg cacgagacag atcctccaaa cagaaaatta 2400 ataaagaaat aatggactta aacaaaactc tagaacaatt gggtctgaca gacatctaca 2460 gaacattcta cccaaaatcc actgaatata cgttcttctc atcagctcac gggacattct 2520 ctaagattga ccatatccta ggacacaaag aaaatctcaa gaaatttaaa aaaatagaaa 2580 tcataccatg taccttctca gatcacagtg gaataaaact agaaatcaac cctaacagaa 2640 actcacattt ctacacaaaa acgtggaaat taaacaacct cctactaaat gattacttcg 2700 taaatgaaga aatcaagacg gaaataaaaa acttctatga agaaaacgac aatggagaga 2760 caagttatca actcctctgg gacacagcta aagcagttct gagaggaaag tttatctcca 2820 taaatgccta taaccaaaag gcaagaagat cacaaataga caatctaatg aaacgactca 2880 aagagctgga aaaagaagaa cagaccaacc ccaaacccag cagaagaagt gaaatcaaca 2940 agatcaaatc agaactaaac gaaattgaaa acaggaaagc tattcaggag attaataaaa 3000 caaaaagttg gttctttgaa aaaataaaca aaattgacac accattggct aagctaacga 3060 aaagcagaaa agagaaatct ctaataagct ccatcaggaa taaaaaagga gatatcacaa 3120 ctgatcccaa agagatacaa gatacaattt atgaatacta caaaaatctt tatgcacaca 3180 aactggaaaa tgtggaggaa atggacaaat ttctagaaac acacagcctc cctaggctca 3240 accaggaaga aatagattcc ctgaacagac caatctcaac agctgaaata gaaacagcaa 3300 ttaaaaatct ccctaaaaag aaaagtcccg gtccagatgg cttcacacct gaattttacc 3360 atacttacaa agaagaacta gtacctatct tgcagaaact attccacaac atcgagaaga 3420 acggaaacct ccccgacacc ttttatgaag cgaatattac tctgatacca aaaccaggaa 3480 aggatgcaac aaaaaaagaa aactacagac caatatccct aatgaatata gatgcaaaaa 3540 ttttcaacaa aatcttagct aaccgaatcc agacacttat caaaaaaata atccaccacg 3600 accaagtggg cttcatccca gggatgcagg gatggttcaa catacgtaaa tctataaatg 3660 caattcacca cataaacaga agcaaaaaca aagaccacat gattctttca atagatgcag 3720 aaaaagcttt tgacaaaatt caacaccctt tcatgatacg aacacttaag aaaataggca 3780 tagaagggac atacctaaaa atgatacaag ccatatatga cagacccata gccaacatca 3840 tactgaatgg ggaaagattg aaatcattcc cacttagaac tggaaccaga caaggctgcc 3900 cactatctcc acttctgttc aacatagtgc tggaagtctt ggctacagca atcagacagg 3960 aaaatggaat caaaggtatc caaatagggg cagaagagat caaactttca ctgtttgctg 4020 atgatatgat attgtatcta gaaaacccca aggattcaac caagaaactc ctggaactga 4080 tcaatgaatt tagtaaagtc tcaggataca aaatcaatac acagaaatca gaggcattca 4140 tatacgccaa caacaatcta attgagaacc aaatcaaaga ctcaattccc ttcacaatag 4200 caacaaagaa attaaagtac ctaggaatat atttaaccaa agaggtaaaa gacctctaca 4260 gggagaacta tgaaacactg aggaaggaaa tagcagagga tgtaaacaga tggaaatcca 4320 taccatgctc gtggatcggc agactcaata tcatcaaaat gtctatacta cccaaactga 4380 tctacagatt caatgcaata cctattaaaa tcccatcagc attcttcaca gatatagaaa 4440 aaataatttt acgcttcgta tggaaccaaa gaagaccccg aatatcaaga gcaattctag 4500 gcaacaaaaa caaaatggga ggcattaata tgccagatat caaactatac tacaaagctg 4560 tagtaattaa aacaatatgg tattggcaca aaaacaggaa tattgaccag tggaacagat 4620 gtgagaatcc tgatataaaa ccatcctcat atagccatct catctttgac aaagcagaca 4680 aaaacatacg ctggggaaaa gaatccctct tcaataaatg gtgctgggaa aactggatag 4740 ccacctgtag aaggctaaaa caggacccac acctttcacc tctcacaaaa accaactcac 4800 gctggataac agacttaaac ctaagatatg aaactattag aactctagag gaaaaagttg 4860 gaaacactct cctagacatc ggcctgggca aagagtttat gaagaagtcc ccaaaggcaa 4920 tcacagcagc aacaaaaata aataaatggg acatgatcaa actacaaagc ttctgcacag 4980 ccaaagaaat agtcatgaaa gtaaacagac aacctacaga atgggagaaa atttttgcat 5040 cctatgcatc cgataaggga ctgataacta gaatatactt agaactcacg aaaattagga 5100 agaaaaaatc aaataacccc attaaaaagt gggcaaagga cttgaacaga aatttttcta 5160 aagaagacag aagaatggcc aacaaacata tgaagaaatg ctcaacatct ctaatcatca 5220 gggaaatgca aatcaaaacc acaatgagat atcacttaac cccagtgaga atggccttta 5280 tcaaaaaatc tccaaacaat aaatgctggc gtggttgcgg agagagagga acactcctac 5340 actgctggtg ggactgcaaa ctagttcaac ctctgtggaa agcaatatgg agatacctta 5400 aagcgataca agtgaatcta ccatttgatc cagcaatccc attgctgggc atctacccaa 5460 atgatccagt gacactctac aaaaaagaca cctgcactcg aatgtttata gcagcacaat 5520 tcataattgc aaggctgtgg aaacagccca agtgcccatc aatccaagaa tggattaata 5580 aaatgtggta tatgtacacc atggagtact attcagctct aagaaacaat ggtgatatag 5640 cacatcttat attttcctgg ttagagctgg aacccatact actaagtgaa gtatcccaag 5700 aatggaaaaa caagcaccag atatattctc cagcaaactg gtattaactg agtagcacct 5760 aagtggacac ataggtgcta cagtaatagg gtattgggca ggtgggaggg gggagggggg 5820 cgggtatata catacatagt gagtgagatg tgcaccatct gggggatggt catgatggag 5880 actcagactt ttggggggag ggggggaaat gggcatttat tgaaacctta aaatctgtac 5940 ccccataata tgccaaaata aaaaaaataa ttaaaaaaaa aaaaaaaaaa aaa 5993 // ID LTR14_Mim repbase; DNA; PRI; 481 BP. XX AC . XX DT 02-NOV-2009 (Rel. 14.11, Created) DT 02-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR14_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-481 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2976-2976 (2009). XX DR [1] (Consensus) XX CC ~98% identical to consensus. 4bp tsd. CC Similarity to LTR14B2_Sar from common shrew. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 481 BP; 109 A; 122 C; 146 G; 104 T; 0 other; tgtaagggcc tggttgcagc ccaaagctgc tggctcactt gtgcgagccc ggataacaga 60 aaattggccc aggggaggag taatcggctt ctggaaactg ctagcttggg gcccaaaggt 120 agagctatcg gctaaacgca tatttctgct tctgagaatc gcttgcttgc agctagacgc 180 ataggtacgg tgccagataa gggagaaagg cccctttgcc gccggcgggc taccagtcca 240 ccaatcattt taaagactaa cacgcacaat cagcttgtgc agcgcgggtg ttcaagaggg 300 aggggggata aaagggcagc cccagctttg gtcagggtcc ttgcctgtaa gagcgaccac 360 tgcgctggca ctctagggcc tggaccctgg ctagccagaa aataaagctc ctcttgagtg 420 attgcatcct tggtgtcttt gttcgtctgc ctggcggggt gcaggaagcc ggtccctaac 480 a 481 // ID LTR20C_Mim repbase; DNA; PRI; 406 BP. XX AC . XX DT 11-NOV-2009 (Rel. 14.11, Created) DT 11-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR20C_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-406 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2979-2979 (2009). XX DR [1] (Consensus) XX CC ~89% identical to consensus. 4bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 406 BP; 94 A; 103 C; 103 G; 106 T; 0 other; tgtaacagag ggaatggcct gaaaaagggc aaaaatgttt tctgtctctt caaaaccccc 60 ccaccctttt tgagaactaa aacctgcatc cctgcctcag gccagtggtc ggaggggcag 120 gggagtgtcc tttgttcttt gtgccacagg agatggctca agggaattgt ccgggcggag 180 gtcacgagat tgtcttagcc gaagatggga tgaatcagaa cctttaaaag ctctgtactt 240 ctgctcagag gcaggacgtt ggtactttga gacgggagtc tgccaacctc ctcatttgcc 300 ggcaaattaa taaacttctc tttccttctc ctcaaaccac ttgtcctcgt tcttctgatg 360 cggcctcggg gacaagtgct gaactttcgg taacagttgc ccgcca 406 // ID LTR20_Mim repbase; DNA; PRI; 426 BP. XX AC . XX DT 11-NOV-2009 (Rel. 14.11, Created) DT 11-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE Long terminal repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR20_Mim. XX OS Microcebus murinus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Strepsirrhini; OC Lemuriformes; Cheirogaleidae; Microcebus. XX RN [1] RP 1-426 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the mouse lemur."; RL Repbase Reports 9(11), 2980-2980 (2009). XX DR [1] (Consensus) XX CC ~90% identical to consensus. 4bp tsd. CC Strong similarity to LTR20_OG from bushbaby. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project: mouse lemur. XX SQ Sequence 426 BP; 111 A; 101 C; 96 G; 116 T; 2 other; tgtaacagaa gtaagggcct gaaaaagggc aaaatgtttt acgagttgtc tctttaaaac 60 ccctcaccct ttttggggaa ttaaaacctg cattcctgcc cgaggccagt aatcagaggg 120 gcagaggagt gttctttgtt ctttgtttta aaccacagga gatggctcaa cggaattgtc 180 cgggcagagg tcacgagatc gtcttcaccg gagatgctat agttaaacag caatagcccg 240 aagctgaaaa ccccctttaa aagctctgta tttctgctta aagggaggac gatggtnctt 300 taagacgaga gtctgccatc ctcctcattt gccggcaaat taataaactt ctctttcctt 360 ttcctcaaac cgcttgtcct cgttcttcgt tcggccncgg ggacaagtac cgaactttcg 420 gtaaca 426 // ID L1-1b_TS repbase; DNA; PRI; 5968 BP. XX AC . XX DT 03-MAY-2010 (Rel. 15.05, Created) DT 03-MAY-2010 (Rel. 15.05, Last updated, Version 1) XX DE L1-type non-LTR retrotransposon - consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-1b_TS. XX OS Tarsius syrichta OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Tarsiiformes; Tarsiidae; Tarsius. XX RN [1] RP 1-5968 RA Bao W. and Jurka J.; RT "Non-LTR retrotransposons from tarsier."; RL Repbase Reports 10(5), 767-767 (2010). XX DR [1] (Consensus) XX CC ~92% identical to the consensus. The 5'-end is different from CC that of L1-1_TS. XX SQ Sequence 5968 BP; 2331 A; 1417 C; 1099 G; 1118 T; 3 other; aggagactcc tgaagatggc gacggcatag gtggatccct gagactctcc gaacgtgagt 60 ttcccaaaaa gactattact tttcttcctg cacccccctg cccaccgcca gccgcactgg 120 tacgaggcct ggggactttt ggtcccacgc agggccagac attgcggccg gccgcggctg 180 ctccactata tgtgattccg gccggctggg gaatcccaaa tcccggatcc cagctccmtc 240 cctggctaca tcgtcatgac cagcacccag caccaagctg ccggtcaccc agggaggctc 300 tgggcgcccg cccctcagag ataccggtcc catttctggc tgaagtgccc tgtgagggac 360 ccagagccac cacctgccct ctggagcttt ggagatctga agccccgggc ccaccctggc 420 tgaggcactc agcgtgcgac cacaagcctg tgtaaggggg cagaacccca agctggtgta 480 aggtgccgat cgcaggcttc cagctccaga ggagggaacg cccgaccctc tcctccaccc 540 cagcctgctg gaggcacccc agatccagcg cccagctgtg cagctggtcc cgcctagaga 600 cccacaggag catccctaaa agtgaaagta aaactaccac ctccgccatt ttcaaggcat 660 atttacatcg catcgccccg cccacctatc agagcctcca cctcggttcc cacggtgacg 720 tcattaaaca ggagacaaag cccctcccct acgctgaata gagaccagtg tcacagatct 780 cccaggcagc aggctaactg cctgcaacaa aaacagaggg gagaggctgg agattagagc 840 tcaagggact ttatccacta agggacagag agaaaaatct acacaaatga agaaaaacca 900 aaagaagaat atgggtctct cccagactcc tgggagggca gacactgaga aaactgactt 960 tggaatgcaa acaatgaaga gcccccagaa tgactggtct caaaatacaa acctagacat 1020 caagacatta atggagagat taaaaagaat tgaggagact caagaagaaa ctagaaagga 1080 gctgatatct gagataacag taataaagaa tactgtgaat gaaataaaca acaaactgat 1140 aagcatggaa agcagaatta cccaagcaga agaaagaatc tcagagcttg aggaccaaaa 1200 tatagaamta acccaaactc tcaaaaacac agaaaacaag ctcaaaaaga cagaaaaaaa 1260 accttcaaga gatgagcgat tacctcaaga ggcctaacct aagagtaatt ggtctgcctg 1320 aggcagaaag agagacagag accacactgg aacaaacctt ccatgagatc attcaagaaa 1380 acttccctta tctaatcaat gatgcaaaaa ctcaaacaca agagatccag agaacccctg 1440 caagacaaca aatgagaaaa ccaactccta gacacataat aattcgccta aacaaagtag 1500 gcataaaaga aaaaatccta aaggcagcaa gagaaaaagg ccagatcacc taccatggaa 1560 aaccaattag aatagcagca gatttatcta cagaaaccct gcaggctagg agagcttgga 1620 gccctatctt caaagtccta aaagataaac aatttcaacc aagaataacc taccctgcca 1680 gattaagctt catcagcgag ggagaattaa aatctttccc agatatccaa tccctaagaa 1740 cttacgctgc caccaaacca cctctacatg aaacacttaa gaaagtacta aacacagaag 1800 aaaaggggga aaaaagagca acgttcttca caagagtaca ggaaaaagaa taaaatacac 1860 atgaaccaac ccccaaaaca aaagaaagac aataaaccaa gtggaagaac aactctataa 1920 gaactctatg atagggatga actctcacat ttcaataatt agcctgaatg tgaatggact 1980 aaatgcacca ctgaaaagac atagaatggc aaaatggata aaatatcacc aggcaacaat 2040 atactgcctt caagagaccc atctcactag aaaggacatg cacagactca aagtaagagg 2100 atgggaaaca aatttccagg cgaatggaac acaaaagaaa ggaggagtcg cgatcctaat 2160 ttcagacaaa ataccattta agctatcaaa aattaaaaaa gatacagagg gccactacat 2220 aatgataaaa ggttcactcc atcaacaaga aatatctatc ctaaacatat atgcacctaa 2280 cataggtgca ccaactttta taaagcaact cctaggaaaa ctaaagaaag atattgactc 2340 taacaccatc ataactgggg actttaatac cccactcaca accctcgaca gatcatcggg 2400 acaaaaaatc agcaatgaga tccggaacct caatgtgact ctggaccaaa tggacttaat 2460 tgatacctac agaacactcc atccaaagac cagagaatac acattctact catcaccgca 2520 tgggacgtat tccaagatcg accacataat cggccataaa tcaagcataa gcaaatttaa 2580 aaggaccgaa attctaccat gcaccttctc ggaccacagt ggaataaaaa taaacattga 2640 caccaacaag gtccccccaa aacccacaaa gacatggwca ctaaacagca tgatgctaaa 2700 caactcctgg gtcaatgatg aaatcaaaac agagatcaaa agatacctgg aaacaaatga 2760 aaatgaagaa acatcttacc aaaatctctg ggatgcctta aaagctgtag taagagggga 2820 atttatatcc ctacaaacac acatgaagaa aatggaagga gcacaaatta atagcctaac 2880 aagccaccta aggaagctgg aaaagcaaga ccacaaaaac cctaatttca gcagaagaat 2940 ccagatcacc aaaataaaag cccaaatcca ggacatagaa gacaaaaaga caatacaaaa 3000 aaatcaatga aacaaaaagc tggttcttcg aaaggataaa caagatcgat ggtcccctag 3060 ctagactgac caagaaaaag aaagaaaaaa cccaaataag cacaatcaga aacacaaaag 3120 atgaagtcac atctgaccct gaagaaatac aaaagatcat cagagactac tatgtacact 3180 tgtatggaaa caaacttgaa aacctcaagg aaatggagga ctttctgtca tcacacaacc 3240 tgcctaggtt gaaacaagaa gaaattgaga ccctaaatag accaataaca atcaaggaaa 3300 ttgactatgt aataagaaaa ctacctacaa aaaaaaagcc ctggaccaga tggctttcca 3360 gcagaattct acaagacatt taaggaggaa ctgattccaa tcctactgaa gctatttcag 3420 gcgattgaga aagatggaac cctccccaaa tcattttatg aagctaacat cacattgata 3480 cccaagccag gtaaagatcc aacaaaaaaa gagaactaca ggccaatatc tttgatgaac 3540 atagacgcta aaattctcaa caagatccta gcaaaccgga ttcaacaaca catctcaaaa 3600 atcatccatt atgaccaagt aggcttcatc cctgggatgc aaggctggtt caacattcgt 3660 aaaacaataa atgtaattaa atacatcaac agatgtcaaa acaaaaacca catgattata 3720 tcattagatg cagaaaaagc ttttgataaa atccagcacc ccttcttgat aaaaaccctc 3780 gaacatctag gcatacaggg aacatacctc aaagtagtaa aagccatcta cgagaaaccc 3840 acagccagca tactcctaaa tggacaaaaa ttggaaccat ttcccctgaa aactggaaca 3900 agacaaggat gcccactctc acccctcctg ttcaatatag tattggaagt cctggctaga 3960 gcaatcagag aagagaaggc aatcaggggt atccaaatag gaaaagagga agtcaaatta 4020 tctctctttg cagatgacat gatcgtgtac cttgaaaacc caagagaatc tgtcaaaaac 4080 ctccttacac tgataaaggc cttcggcaaa gtctcaggat ataaaataaa tgtgcaaaag 4140 acaatcgcat ttctttacac caataataaa caaacagaaa cccaaataag aagcacaatt 4200 ccattcacaa tagccaccac aaaaaaaatg aaataccttg gcatcttcct aaccagagac 4260 gtgaaagacc tttacaatga aaactacaaa actctgctca aagaaatcaa agatgacaca 4320 aacaagtgga aaaatatccc atgctcatgg attggaagaa tcaacattgt gaagatgtcc 4380 atcttaccta aggcaatcta cagattcaat gcaataccta tcaaattacc agcaacattc 4440 ttctcagacc tagaaaaaac aacacaggaa ttcatatgga aacacaaacg accaagaata 4500 gccagaacaa tcctcagcaa aaaaaaaaaa caaagcaggt ggtatcacat taccagactt 4560 caaactttac tataaagcta caatcatcaa aacagcttgg tattggtata ggaacaggca 4620 tatagaccaa tggaatagaa ttgagattcc agaggcaaga cctcaatttc tcaaccaact 4680 catcttcgac aaagcctcca ccacctacca ctggggagag gagaacctat tcagtaaatg 4740 gtgctgggaa aactggctga ccacatgcag aagattgaaa caggacccct atctatcccc 4800 atacacaaaa attaactcca aatggatcag agacctaaat gtaaaacctc aaaccataag 4860 aaccttagaa aatgaaggac ataccctcat ggaaattgga actggcatcc aattcctgaa 4920 caaaactcga aacccacagg ccataaggga taagatagac aagtgggacc tcattaaact 4980 gacaagcttc tgcaaagcca aagaaaccat caagagagca gggagacagc ctacagactg 5040 ggaaaaagta tttgccaact ccaggtctga caaaggctta acatcctgga tctacaagga 5100 actcaaacgt gctgaaaaga aaaaaacaaa caaccccatt ataaaatggg caaaagatat 5160 gaacagacac ttcacaaagg aagacatccg agcagccaac agacacatga agaaatgctc 5220 aacctcacta atcatcaggg agatgcaaat caaaaccaca ctgagatacc acctaactcc 5280 agtcagaatg gcaattatca acaactcaaa aaataacagc tgctggagag ggtgtggcga 5340 aaagggaaca ctcctacact gttggtggga gtgtaaacta gtgcaacctc tgtggaaagc 5400 agtgtggcga ttcctaaaag ctctaaacat caacctccca tatgaccctg caatccccct 5460 actgggaata taccctgaag aactcaaatc actctataaa aaagatacct gcacacgaat 5520 gtttatcgca gcattgttca caatagcaag aacctggaac caaccatgct gtccatcaaa 5580 agaggactgg attaaaaaaa tgtggtacat atacacgatg gaatactatg cagccataaa 5640 aaagaacaaa atcatgaatt tcgcagcaac ctggatggag ctagagtcta taatactgag 5700 tgacctctca cagaaacaaa gatctgagta tcacatattc tcactcatat agtggacctt 5760 gatcatccaa tgcactacca taagaaaatg actgacagtg ttgggaaact atgggggggg 5820 atgggactaa tggtagtaaa catctgtctg gggacgggga gacacctgtt atcaacaagg 5880 gggcctgaat gaagcatatt tgtataccta acccttaact gtaccccaca atatcaaaat 5940 aaaaaatatt gattaaaaaa aaaaaaaa 5968 // ID MER61A_LTR repbase; DNA; PRI; 343 BP. XX AC . XX DT 16-MAR-2006 (Rel. 11.03, Created) DT 10-APR-2007 (Rel. 11.03, Last updated, Version 1) XX DE Long terminal repeat of ERV1 Endogenous Retrovirus from primates. XX KW ERV1; Endogenous Retrovirus; Transposable Element; KW LTR retrotransposon; MER61; MER61A_LTR. XX OS Primates OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires. XX RN [1] RP 1-343 RA Smit A.F.; RT "MER61A_LTR - a subfamily of endogenous retroviruses from RT primates."; RL Direct Submission to Repbase Update (11-NOV-2005). XX DR [1] (Consensus) XX CC mer4 group. XX SQ Sequence 343 BP; 73 A; 100 C; 95 G; 73 T; 2 other; tgagacagcc aggtgggaag gggtccccgg agaaactcca accagcctgc gcactgggag 60 gagtgcgcac tggggtggag ccacagaagt tcgcgccatt tgcagcgggg aggagcctgg 120 cccctcctct tcctgggtgg aacctgggat tcaatctgcg aggcgggaag cgcactagca 180 ggaacactct ggccttgcng agagtccctg tttccccttt tcttcctttt cacccaataa 240 aaccccgtct tactcaccct tcaaattgtc tgcgagccta awttttcgtg gccgtgtgac 300 aaggaccccg tctttagctg aactaaggaa aagtcctgca aca 343 // ID HAL1-1C_Cja repbase; DNA; PRI; 2629 BP. XX AC . XX DT 13-JUL-2010 (Rel. 15.07, Created) DT 13-JUL-2010 (Rel. 15.07, Last updated, Version 1) XX DE HAL1 non-LTR retrotransposon: consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; Nonautonomous; KW HAL1; HAL1-1C_Cja. XX OS Callithrix jacchus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Platyrrhini; Cebidae; Callitrichinae; Callithrix. XX RN [1] RP 1-2629 RA Bao W. and Jurka J.; RT "HAL1 non-LTR retrotransposons from marmoset."; RL Direct Submission to Repbase Update (13-JUL-2010). XX RN [2] RP 1-2629 RA Bao W. and Jurka J.; RT "Origin and evolution of LINE-1 derived "half-L1" RT retrotransposons (HAL1)."; RL Gene 465(1-2), 9-16 (2010)doi:10.1016/j.gene.2010.06.005. XX DR [1] (Consensus) XX SQ Sequence 2629 BP; 924 A; 623 C; 529 G; 553 T; 0 other; ggccgggcgc ggtggctcac gcctgtaatc ccagcacttt gggaggccga ggcgggcgga 60 tcacctgagg tcaggagttc gagaccagcc tggccaacat ggtgaaaccc cgtctctact 120 aaaaatacaa aaattagccg ggcgtggtgg cgcgtgcctg taatcccagc tactcgggag 180 gctgaggcag gagaatcgct tgaacccggg aggcggaggt tgcagtgagc cgagatcgcg 240 ccactgcact ccagcctggg cgacagagcg agactccgtc aaaaaaaaaa aaaaaaaaaa 300 aagagaacga tccaagatgg ccgatcgcta acatcccggg attgcagctc tcagggaagg 360 cgcggagaac tagaggacgc cacactttca gacaaagtct ggtcgctcac ggagcagaag 420 atcccccagt ggtggaaaca cacgggtcgc cagcgcgact ctcgtggtcg gcgcagcggt 480 tccgccggca cctcggcgcg gcagctctcg gcgcagagta aacgggaccg gttccccttc 540 tgaccgaggt ttggagcccc gggaaggcag agtcgcctac tacggaaaca agaaggaagc 600 ccgacaggag aatcctgggc agaaaagcac catcagtttt aacgccgctg ctctggccct 660 gggaactaac aacctggacg tccactcaag agacctaatc tgaaagttgg taatttcaaa 720 gacgacagga ggataaattt acaatgacgg gaagaaacca gcgtaaaaaa gctgagaata 780 ctcaaagtca gaacgcctct ccctctaaag atgatcacag ttccacatca acaatggaac 840 aaggcttgat ggagaacgag cgcctcctga tgacagaatc actcttcaag gaatggataa 900 taacaaactt cggtgagtta aaagaacatg ttgtagccca acgtaaagaa actaggaact 960 ttgaaaaaag gtttgatgaa atcctattga gaatagacaa cttagagagg agtatgagtg 1020 aattaatgga actgaagaat acaatacagg aactccgaga agtatgcaca ggtttaaaca 1080 ctcgaattgt tcaagcagaa gaagggatat cagaggtcaa agtccaactt aatgaaataa 1140 aacgtgaaga aaagattaga gaaaaaagga taaaaaggaa tgagcaaagt ctccaagaaa 1200 tgtgggacta tgtgaaaaga ccaaatttac gtttgatagg tgtacctgaa tgcgacggag 1260 agaatgaatc caagctggaa aatacccttc aggatattat tcaggaaaat tttcctaaac 1320 tagcaaagca ggtcaacatt caaccccagg taatacagag aacaccacaa agatattcct 1380 caagaagagc aaccccaagg cacataatcg ttagattcac cagggttgaa acgaaggaga 1440 ggatactaag ggcagccaga gagaaaggtc agattaccca caaaggcaag cctatcagac 1500 ttacagcaga tctctcggca gaaactctac aggccagaag agagtggggg ccaatattca 1560 acatcctcaa agaacagaac cttcagccca gaatttcata tccagccaaa ctaagcttca 1620 caactgaagg aaaaataaaa tcttttatga acaagcaaga actcagagat tttattacca 1680 ccaggcctgc tttacaagag cttctgaaag aagcattaca cacagaaaga aacaaccagt 1740 attagccttt ctaaaaatac accaaaaagt aaagagcacc aacataaaga agaatttaca 1800 ccaacaaatg gataaaacag ccagtcaaca tcaaatggca gtaaccctaa atttaaattg 1860 actaaattcc caatcaaaag acacagccaa aacccaacgg catgttacat ccagacctgt 1920 ttcacatgca aggatacaca aagactcaaa acaaagggat ggagaaagat ttaccaacca 1980 aatggagagc aaaaataaat aataaataaa taaaaagcag gagttgcaat tcttgtatcg 2040 gataaaatag attttaaagc aacaaagata tagtggtaaa aggatcaatg caacaacaag 2100 agctaacgat cctaacaccc agataggaga cttagattca atgagacaga aaattaataa 2160 ggatatcaag gactcgaact cagatccaga acaagtaaac ttaataaata tttatagagc 2220 tctccacttc aaatacacaa aatatacatt cttgtcaata ccacatcaca cctacccatt 2280 agtttaaatg aaacattgat tggccattat taatacccaa tttttttcaa aataaagcaa 2340 tatttccatt tactctccct ctttctcttc ctctttcttc ctctccttta cttatttttt 2400 ttttttcttt ccttctctca aaaaaaaaga aatcaacttg taaacctcta gatccaggtc 2460 ggcaatgtct ctttcattgc ttgatttcct ttcttccctt ccctccctcc ctccctccct 2520 ccccgcttcc tcccttcctt ccttccttca tcccttcctt cctccctacc gtccttcttc 2580 ccttccttcc tgccttcctc ccccccccaa aaaaaaaaaa aaaaaaaaa 2629 //