ID RLTR20B1_MM repbase; DNA; ROD; 566 BP. XX AC . XX DT 30-JAN-2004 (Rel. 9, Created) DT 20-AUG-2008 (Rel. 9, Last updated, Version 2) XX DE Mouse long terminal repeat - a consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR; ERVK; KW RLTR20B1_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RA Karolchik D., Baertsch R., Diekhans M., Furey S.T., Hinrichs A., RA Lu T.Y., Roskin M.K., Schwartz M., Sugnet W.C. et al.; RT "The UCSC Genome Browser Database."; RL Nucleic Acids Res 31(1), 51-54 (2003). XX RN [2] RP 1-566 RA Pavlicek A. and Jurka J.; RT "RLTR20B1_MM - a family of LTR retrotransposons."; RL Direct Submission to Repbase Update (31-DEC-2003). XX DR [2] (Consensus) XX CC The consensus sequence was reconstructed from the UC Santa Cruz CC genome annotation. Similar to RLTR20B2 and RLTR28_MM. Individual CC copies are ~88% identical to the consensus. 6 bp TSDs. XX SQ Sequence 566 BP; 150 A; 101 C; 182 G; 133 T; 0 other; tgttgtggat ttggtttaat gctacttgtg ttatgttaat tgggttccca aaattacatg 60 ggaattcgca cgtctgcatg taagacgctg agggtccctg cccccagttg gctctaattg 120 gtaaataaag ttgccggtgg ccaatggctg ggcagggaga cagaggtggg actttagatt 180 tcccgggcaa gggaaccaag ggaagaagaa ggattttaga atcgccatgc cagggaagca 240 ggaggatcag gcttgagagc tgcaggagag aaagcataca gccatgtaag agccagggaa 300 gagcggcccc aggggcccct cccccgattg ggtctggggt agcaaagatg gaatatagat 360 tttagtaagt aataattcag gagtatcgga ggggaggcgt tagcaatgtg gaagtttggg 420 agtggcccag ccattgagct gcttagggca tattaaaata taaggctgtg tgttgtgtgt 480 ctttcattca agaatccaga gcatttgggg gcaggtagca aggaacacgc gctgccaccg 540 ccggggagtt tagagtagat taatca 566 // ID RLTR19D repbase; DNA; ROD; 438 BP. XX AC . XX DT 18-SEP-2008 (Rel. 13.09, Created) DT 25-SEP-2008 (Rel. 13.09, Last updated, Version 1) XX DE Long terminal repeat of endogenous retrovirus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; RLTR19D. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-438 RA Jurka J. and Jurka M.G.; RT "Long terminal repeats of endogenous retroviruses from mouse."; RL Repbase Reports 8(9), 1054-1054 (2008). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC International Collaboration for the Mouse Genome Sequencing. XX SQ Sequence 438 BP; 99 A; 104 C; 87 G; 147 T; 1 other; tgtgttaaat attaaaactt gacaatagct ctgccatatt ccctcaacct tcattgtttg 60 tagctcataa ccttaacttg acctattgtt ccttgccctg acaaaaacaa cttcttgttg 120 tactttgtca tggagcctcg tggactttgg agatccccta gcctccttgc ctctagaccc 180 agcctatcag cttaaaggtc agacagcgct gctttgtctt agatagccta tatgttgtaa 240 atgttgcaaa tgcacctgtt csttgttttt cccatataaa aaactggttc tgaggactgc 300 cctgggttgc agttttggat cccgactcct ggctgtgtcc ctgattaatc agtcttggag 360 tgtgcattca ataaaccatc gatacctgac tgagattggt gtctgagtgg tttgtgtggc 420 tattcctgga ccctaaca 438 // ID RLTR19_MM repbase; DNA; ROD; 467 BP. XX AC . XX DT 14-OCT-2002 (Rel. 3.4, Created) DT 14-OCT-2002 (Rel. 3.4, Last updated, Version 1) XX DE Mouse putative long terminal repeat - a consensus. XX KW Long terminal repeat; retrotransposon; RLTR19_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-467 RA Jurka J. and Drazkiewicz A.; RT "RLTR19_MM: putative LTR from Mus musculus."; RL Repbase Reports 2(9), 4-4 (2002). XX DR [1] (Consensus) XX SQ Sequence 467 BP; 139 A; 67 C; 163 G; 98 T; 0 other; tgccccagag ggacaaaggg cagggaataa gagacaaaga caggagatag aggatgaggg 60 agaaggggaa gggaacaagg gagaggggga agggatattt gtcctggagg acaaaggact 120 gcctctggat agagaggaga cagacgtggc acataggaaa atggtagttt ataaaggtaa 180 aaggggaaac cctgtgttag gatgaggtgt ttaattttaa ttgggcatgt taattaggtg 240 agccaaaggg ggcttttgat tgctggactt caatactttg atagctggac cttggtagtc 300 agcctcagga ggaggaagtg gccaaataag ggaatagacc ttggtggcta gctttaggaa 360 tgtaatctaa tggtttttag caaggcagag ggaatggggg agaagggcaa ggcctgcggc 420 ctgccagagc catgtttgcc atgctcaggc tggctagagt cccttca 467 // ID MSTAR repbase; DNA; ROD; 1651 BP. XX AC . XX DT 01-MAY-1996 (Rel. 5.2, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 4) XX DE MSTa- LTR internal retrotransposon sequence - a consensus. XX KW Non-LTR retrotransposon; MaLR family; MSTa subfamily; MstII; KW MER10; MSTAR. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-1651 RA Smit F.A.; RT "Structure and evolution of mammalian interspersed repeats."; RL PhD dissertation, Univ Southern California, 1995. XX DR [1] (Consensus) XX CC ORF: bases 48 to 1469. XX SQ Sequence 1651 BP; 436 A; 353 C; 489 G; 371 T; 2 other; gaawattggt actgaggagt ggagcattgc tataaagata cctgaaaatg tggaagcgac 60 tttggaactg ggtaacaggc agaggttgga agagtttgga gggctcagaa gaagacagga 120 agatgaggga aagtttggaa cttcttagag acttgttaaa tggttgtgac caaaatgctg 180 atagtgatat ggacagtaag ggccaggctg acgaggtctc agatggaaat gaggaactta 240 ttgggaactg gagcaaaggt cactcttgtt atacattagc aaagagcttg gctgcatttt 300 gcccctgccc tagagatttg tggaagtttg aacttgagag tgatgatcta gggtatctgg 360 cggaagaaat ttctaagcag caaagcgttc aagatgtgac ctggctgctt ttaacagctt 420 acagtcatat gcgagagcaa agaaatcact taaagttgga atttatattt aaaagggaag 480 cagagcgtaa aagtttggaa aatttgcagc ctggccatgt gatagaaaag aaaaacccgt 540 tttctggaga gaaattcaag caggctgcgg agcgaccgtt tgctaaagag attagcataa 600 ctaaaaggaa gccaagtgct gatagccaag acaatgggaa aaaggcctcg aaggcatttc 660 agaaatcttc gaggtggtcc ttcccatcac aggcccagag gcctaggagg actgaatggt 720 ttcgtgggcc aggcccaggg ccccgctgcc ctgtgcagcc tcgggacact gctccctgca 780 tcccggctgc tycggctcca gccgtggctc aaagggcccc aggtacagct cgagctgccg 840 cttcggagag tgcaagctat aagccttggt ggcttccaca tggtgttaag cctgcaggtg 900 cacagaatgc aagagtgaag gaggcttggc agcctccacc tagatttcag aggatgtatg 960 ggaaatcctg ggtgcccagg cagaagcctg ctgcagggac ggagccctca cagagaacct 1020 ctactagagc agtgccaaag ggaaatgtgg ggttggagcc cccacacaga gtccccaccg 1080 gggcactgcc tagtggagct gtgggaaggg ggccactgtc ctccagaccc cagaatggta 1140 gagccactgg cagcgtgcac cgccagcctg gaaaagccgc aggcatcaga ctccaacccg 1200 tgagagcagc cacgtgggct gtgcccagca aagccacagg ggcggagctg cccaaggcct 1260 tgggagccca cccctcgcac cagcgtgccc tggatgcgag acacggagtc aaaggagatt 1320 attttggagc tttaagattt aatgactgcc ctgctgggtt tcggacttgc gtggggcctg 1380 tagccccttt cttttggccc atttctccct tttggaatgg aaatatttac ccaatgcctg 1440 taccaccatt gtatcttgga agtaaataac ttctttttga ttttacaggc tcataggtgg 1500 aaggaacttg ccttgtctca gatgagactt tggactttgg acttttgagt taatgctgga 1560 atgagttaag actttggggg actgttggga aggcatgatt gtattttgca atgtgagaag 1620 gacgtgagat ttgggggaac caggggcaga a 1651 // ID RNSAT1a repbase; DNA; ROD; 168 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Satellite from rat. XX KW Satellite; Simple Repeat; RNSAT1a. XX OS Rattus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae. XX RN [1] RP 1-168 RA Smit A.F.; RT "RNSAT1a_ - Satellite from rat."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX SQ Sequence 168 BP; 48 A; 32 C; 30 G; 58 T; 0 other; cttttatgta cttggagata actatgagat gcaaaggctt taccacattg atcacataca 60 tagggcttct ctccagtatg tattctttta tgtacttgga gataactatg agatgcaaag 120 gctttaccac attgatcaca tacatagggc ttctctccag tatgtatt 168 // ID GOLEM_B repbase; DNA; ROD; 1205 BP. XX AC . XX DT 25-FEB-1998 (Rel. 3.01, Created) DT 31-OCT-2000 (Rel. 5.09, Last updated, Version 4) XX DE Nonautonomous DNA transposon. XX KW DNA transposon; Transposable Element; Nonautonomous; 35S; GOLEM; KW GOLEM_B; MER17; MER29; MER7; MER7B; nonautonomous DNA transposon. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1205-944 RA Drinkwater D.R., Burgoyne A.L. and Skinner D.J.; RT "Two human repetitive DNA elements: A new interspersed repeat RT found in the factor IX gene, and a satellite 11 tandem repeat RT sequence."; RL Nucleic Acids Res 14, 9541-9541 (1986). XX RN [2] RP 1205-944 RA Kaplan J.D. and Duncan H.C.; RT "Novel short interspersed repeat in human DNA."; RL Nucleic Acids Res 18, 192-192 (1990). XX RN [3] RP 932-825 RA Kaplan J.D., Jurka J., Solus F.J. and Duncan H.C.; RT "Medium reiteration frequency repetitive sequences in the human RT genome."; RL Nucleic Acids Res 17, 4731-4738 (1991). XX RN [4] RP 546-326 RA Skalnik G.D., Strauss C.E. and Orkin H.S.; RT "CCAAT displacement protein as a repressor of the myelomonocytic- RT specific gp91-phox gene promoter."; RL J. Biol. Chem 266, 16736-16744 (1991). XX RN [5] RP 1004-824 RA Rubin M.C., Leeflang P.E., Rinehart P.F. and Schmid W.C.; RT "Paucity of novel short interspersed repetitive elements (SINE) RT families in human DNA and isolation of a novel MER repeat."; RL Genomics 18, 322-328 (1993). XX RN [6] RP 79-546 RA Jurka J., Kaplan J.D., Duncan H.C., Walichiewicz J., RA Milosavljevic A., Murali G. and Solus F.J.; RT "Identification and characterization of new human medium RT reiteration frequency repeats."; RL Nucleic Acids Res 21, 1273-1279 (1993). XX RN [7] RP 1205-1 RA Smit A.F. and Riggs D.A.; RT "Tiggers and other DNA transposon fossils in the human genome."; RL Proc. Natl. Acad. Sci. USA 93, 1443-1448 (1996). XX RN [8] RP 1-1205 RA Kapitonov V.V. and Jurka J.; RT "GOLEM_B."; RL Direct Submission to Repbase Update (1998).. XX CC Replaces MER17, MER29, MER7B. CC Differs from GOLEM and GOLEM_A by internal deletions. CC 23 bp terminal inverted repeats, TA target site [7]. CC Orientation has been changed based on the reconstruction CC of GOLEM internal sequence [8]. XX SQ Sequence 1205 BP; 386 A; 208 C; 217 G; 367 T; 27 other; cagtcatgcg ctgcataacg acgtttcggt caacgatgga ccacatatac gacggtggtc 60 ccataagatt ataatggagc tgaaaaattc ctatcgccta gtgacgttgt agtgcaacat 120 atyactcakt actcacgcgt ttgtggcgat gctggtgtaa acaaacctac tgcgctgcca 180 gttgtataaa agtatagcac atacaattat gtacagtaca taatacttga tagtgataat 240 aaatgactat gttactggtt tatgtattta ctatactata ctttttatta ttattttaga 300 gtrtactcct tctacttatt aaaaaaaaaa gttaactgta aaacagnctc aggcaggtcc 360 ttcaggagat attccagaag aargcatcgt tatcatagga gatgacagct ccatgcatgt 420 tattgcccct gaagaccttc cagtgggaca aaatgtggag gcggaagaca gtgatattra 480 tgatcctgac cctgtgtagg cctaggctaa tgtgtgtgtt tgtgtcttag tttttaacaa 540 aaatntttta aaaaataaaa aaatwaaaaa tttwwaaata gaaaaaagct tatagaataa 600 ggatataaag aaagaaaata tttttgtaca gctgtacaat gtgtttgtgt tttaagctaa 660 gtgttattac aaaagagtca aaagttaaaa atgcaaaagt ttataaagta aaaaagttac 720 agtaagctat ggttaattta ttgctgaaaa arwwwwwwww wwwwwataaa wttagtatag 780 cctaagtgta cagtgtttat aaagtctaca gtagtgtaca gtaatgtcct aggccttcac 840 attcactcac cactcactca ctgactcacc cagagcaact tccagtcctg caagctccat 900 tcatggtaag tgcyctatac aggtgtacca ttttttatct tttataccgt atttttactg 960 taccttttct atgtttagat acacaaatac ttaccattgt gttacaattg cctacagtat 1020 tcagtacagt aacatgctgt acaggtttgt agcctaggag caataggcta taccayatag 1080 cctaggtgtg tagtaggcta taccatctag gtttgtgtaa gtacactcta tgatgttcgc 1140 acaacgaaat tgcctaatga cgcatttctc agaacgtatc cccgtcgtta agcgacgcat 1200 gactg 1205 // ID GOLEM_B repbase; DNA; ROD; 1205 BP. XX AC . XX DT 25-FEB-1998 (Rel. 6.4, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 3) XX DE Nonautonomous DNA transposon. XX KW MER7; 35S; MER17; MER29; MER7B; GOLEM; KW Nonautonomous DNA transposo; GOLEM_B. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1205-944 RA Drinkwater D.R., Burgoyne A.L. and Skinner D.J.; RT "Two human repetitive DNA elements: A new interspersed repeat RT found in the factor IX gene, and a satellite 11 tandem repeat RT sequence."; RL Nucleic Acids Res 14, 9541-9541 (1986). XX RN [2] RP 1205-944 RA Kaplan J.D. and Duncan H.C.; RT "Novel short interspersed repeat in human DNA."; RL Nucleic Acids Res 18, 192-192 (1990). XX RN [3] RP 932-825 RA Kaplan J.D., Jurka J., Solus F.J. and Duncan H.C.; RT "Medium reiteration frequency repetitive sequences in the human RT genome."; RL Nucleic Acids Res 17, 4731-4738 (1991). XX RN [4] RP 546-326 RA Skalnik G.D., Strauss C.E. and Orkin H.S.; RT "CCAAT displacement protein as a repressor of the myelomonocytic- RT specific gp91-phox gene promoter."; RL J. Biol. Chem 266, 16736-16744 (1991). XX RN [5] RP 1004-824 RA Rubin M.C., Leeflang P.E., Rinehart P.F. and Schmid W.C.; RT "Paucity of novel short interspersed repetitive elements (SINE) RT families in human DNA and isolation of a novel MER repeat."; RL Genomics 18, 322-328 (1993). XX RN [6] RP 79-546 RA Jurka J., Kaplan J.D., Duncan H.C., Walichiewicz J., RA Milosavljevic A., Murali G. and Solus F.J.; RT "Identification and characterization of new human medium RT reiteration frequency repeats."; RL Nucleic Acids Res 21, 1273-1279 (1993). XX RN [7] RP 1205-1 RA Smit F.A. and Riggs D.A.; RT "Tiggers and other DNA transposon fossils in the human genome."; RL Proc. Natl. Acad. Sci. USA 93, 1443-1448 (1996). XX RN [8] RP 1-1205 RA Kapitonov V.V. and Jurka J.; RT "Jerky gene - a recruited transposon?."; RL Direct Submission to Repbase Update.. XX CC Replaces MER17, MER29, MER7B. CC Differs from GOLEM and GOLEM_A by internal deletions. CC 23 bp terminal inverted repeats, TA target site [7]. CC Orientation has been changed based on the reconstruction CC of GOLEM internal sequence [8]. XX SQ Sequence 1205 BP; 386 A; 208 C; 217 G; 367 T; 27 other; cagtcatgcg ctgcataacg acgtttcggt caacgatgga ccacatatac gacggtggtc 60 ccataagatt ataatggagc tgaaaaattc ctatcgccta gtgacgttgt agtgcaacat 120 atyactcakt actcacgcgt ttgtggcgat gctggtgtaa acaaacctac tgcgctgcca 180 gttgtataaa agtatagcac atacaattat gtacagtaca taatacttga tagtgataat 240 aaatgactat gttactggtt tatgtattta ctatactata ctttttatta ttattttaga 300 gtrtactcct tctacttatt aaaaaaaaaa gttaactgta aaacagnctc aggcaggtcc 360 ttcaggagat attccagaag aargcatcgt tatcatagga gatgacagct ccatgcatgt 420 tattgcccct gaagaccttc cagtgggaca aaatgtggag gcggaagaca gtgatattra 480 tgatcctgac cctgtgtagg cctaggctaa tgtgtgtgtt tgtgtcttag tttttaacaa 540 aaatntttta aaaaataaaa aaatwaaaaa tttwwaaata gaaaaaagct tatagaataa 600 ggatataaag aaagaaaata tttttgtaca gctgtacaat gtgtttgtgt tttaagctaa 660 gtgttattac aaaagagtca aaagttaaaa atgcaaaagt ttataaagta aaaaagttac 720 agtaagctat ggttaattta ttgctgaaaa arwwwwwwww wwwwwataaa wttagtatag 780 cctaagtgta cagtgtttat aaagtctaca gtagtgtaca gtaatgtcct aggccttcac 840 attcactcac cactcactca ctgactcacc cagagcaact tccagtcctg caagctccat 900 tcatggtaag tgcyctatac aggtgtacca ttttttatct tttataccgt atttttactg 960 taccttttct atgtttagat acacaaatac ttaccattgt gttacaattg cctacagtat 1020 tcagtacagt aacatgctgt acaggtttgt agcctaggag caataggcta taccayatag 1080 cctaggtgtg tagtaggcta taccatctag gtttgtgtaa gtacactcta tgatgttcgc 1140 acaacgaaat tgcctaatga cgcatttctc agaacgtatc cccgtcgtta agcgacgcat 1200 gactg 1205 // ID HERVK22I repbase; DNA; ROD; 6823 BP. XX AC . XX DT 30-APR-1998 (Rel. 3.03, Created) DT 18-DEC-2003 (Rel. 8.11, Last updated, Version 3) XX DE HERVK-related endogenous retrovirus flanked by LTR22s - a DE consensus sequence (internal portion, without LTRs). XX KW ERV2; Endogenous Retrovirus; Transposable Element; HERVK22I; KW LTR22; HERVK superfamily. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RA Kapitonov V.V. and Jurka J.; RL Direct Submission to Repbase Update (31-MAR-1998). XX RN [2] RP 1-6823 RA Lavie L., Medstrand P., Schempp W., Meese E. and Mayer J.; RT "Characterization of the human endogenous retrovirus family RT HERV-K(HML-5)."; RL Direct Submission to Repbase Update (07-DEC-2003)(in RL preparation). XX DR [2] (Consensus) XX CC Average similarity of HERVK22I individual copies to the consensus CC sequence is about 91%. CC 6 bp target site duplications. HERVK22I is flanked by LTR22s. CC Similarity of HERVK22I consensus sequence to known retroviruses CC is shown below: CC ---------------------------------------------------------------- CC sequence begin end sequence begin end similarity CC ---------------------------------------------------------------- CC HERVK22I 287 445 HERVK 281 439 0.64 CC HERVK22I 1176 1429 HERVK9I 1062 1322 0.63 CC HERVK22I 1441 1869 HERVK 1863 2336 0.65 CC HERVK22I 2346 3009 HERVK9I 2380 3048 0.64 CC HERVK22I 3046 3353 HERVK 3659 3966 0.65 CC HERVK22I 3706 3805 HERVKC4 2027 2126 0.82 CC HERVK22I 3873 4359 HERVKC4 2127 2606 0.86 CC HERVK22I 4385 4821 HERVK 4975 5427 0.66 CC HERVK22I 4910 4979 HERVKC4 3080 3148 0.74 CC HERVK22I 5028 5065 HERVKC4 3149 3186 0.64 CC HERVK22I 5114 5334 HERVKC4 3187 3408 0.79 CC HERVK22I 6654 6785 HERVKC4 3401 3531 0.80 CC ---------------------------------------------------------------- CC PBS for methionine tRNA: nt 3-20 CC Putative gag gene: nt 138-1692 CC Putative protease gene: nt 1533-2327 CC Putative polymerase gene: nt 2270-5002 CC Putative envelope gene: nt 4779-end (+37 nt for LTR22 and LTR22A, CC +73 nt CC for LTR22B) CC PPT: nt 6810-6823 CC frequently deleted region in gag-prt: nt 768-2303 CC frequently deleted region in env: nt 5314-6641. XX SQ Sequence 6823 BP; 2051 A; 1655 C; 1397 G; 1720 T; 0 other; ggtggtgccc cgtgtgagga acgctgcaac ggatcgcgac ggaaccctca aaaatgaagg 60 tgaagagact gcgcagtcag taagtcattg gtgcccgctc gggatttcca agttcgaggg 120 aattgttcag gctagggttt catcatggga caacagttat cagctcaaca gaaacagtat 180 ataaaagtat tgaaacagct gcttaaagct agtggagcct cggtttcaca ggctcaatta 240 agggacctaa tgcaaactgt tgtttcccat aacccatggt tcccagaaga aggcacgcta 300 gacgtagagc tctgggaaca agtggggaga aatcttaaac aacatcatgc acaagggcaa 360 cgggtcccag taacatcttt aacgttatgg gccttagtta gggctgcttt ggtcccactc 420 tacacagaag agcctaaaaa gggaagggag gaggaaccat cacctacctt accgcctcct 480 cctccttcag ccccgccatt accgggcaaa aataccaaag aggaaacaga ggttttgcct 540 gagccccctc ctccaataaa ttggaaaaaa gacaagggat acactacagc tatgggaccc 600 tgtcttaggc aagcggcatt agaaggggag ctcttagcct gcccggtaat gcaagatcaa 660 caaggcaatc aggtacatga acccatttct tttaacgctt ataaagagat aagaaaaagc 720 attagagaaa atggagccgc tagcccattt acgaaaggat taattgaggc catagcagac 780 aacttccata tgaccccatg ggactggtca gtgctagcta aaacaacttt agagcccagt 840 caatacctcc tctggagggc agaatatgat gagttgtgtg aacaacaagc caaccagaat 900 caagtggccg ggcaagacat aacagctgct atgctccagg ggaggggtcc ccatgccaat 960 gtacaacaac aactaaattt tgatccccag gcctatgcac aagtgtcttt gtgtgctctc 1020 agggcttggg accgaattcc cgaaagcgga gttcaacagg gatcttttat aaatgttcga 1080 caagggcctc aggagccatt tgttgaattt atcaatcggt taacccaggc aattaagaga 1140 caaattagtc atgcccaggc cgctgatatc ttattgttgc aattggctta tgaaaatgct 1200 aatgtggact gccagcaagc aatgcaggca atcagaggaa aggcagccac agtcggggaa 1260 cttatacgag catgtcaact ggtggggact gaaacacaca aagccaaaat attggctatg 1320 gcattaaggc ctcctaaagt gaaaagggag agaaacccaa attgttttct atgtggagag 1380 ccaggtcata tgaagaggga atgccccaat agtagagacc aaggtaactc aggaaaagaa 1440 cccccttcta tatgccccca atgtaaaaag gggaaacatt gggcaaatca atgcaggtcc 1500 aaatttgata aaaacggcaa ccccataagt aatcaggtgg gaaacttcat gaggggccgg 1560 ccccaggccc cgctccaaac tggggcaatg ccagcggctt tcctcggtca gatggaaagc 1620 ccacagtcct ctctctcaga gcagccacca ctgggagcgc aggactggac ttactctgcc 1680 ccaacgaatt agtgctaaaa gaaggagaag accctaaaag ggttgcaact gggatctggg 1740 gcccactgcc tctgggaaca gtgggattag tcctagggcg atcaagccta tccagtaaag 1800 gaattaatgt gctcactggg gtaattgata gtgattatca aggtgagata ttagttatga 1860 tggaatgtaa aggtctgcat attcttcccc ctggatcaaa gatagctcag ttactacttt 1920 taccatactg ggtccccaat gcccagggaa aggaaagggg aaagggaagt tttggaagca 1980 caggagccac aggagtatat tggaatcaat taatcactga tcagagaccc atgattacct 2040 taaaaattgg aaataaaaat tttactggct tattggacac aggggtggac atttcaatca 2100 ttagtgatca aaactggcca gaaacttggc cttgggtcac tcagaaacaa aaaattgtca 2160 gcatcgggga agcacacaca gccaagcaga gcacacaccc cctaacatgt tgtgattcag 2220 agggaagaaa ggcagttata caacctctaa tcatgcccat ccctgttaat ctttggggac 2280 gggacctatt agcccaatgg ggggtcactc tgcagacccc tttctaataa tggccactgt 2340 tattattcct cccctacccc tgacgtggct ctctcaagat ccaatttggg tagaacagtg 2400 gcctttaaag ggagagaaat tacaaagagc ccatgaatta gttgaggagc aattaaaagc 2460 cggccatata gaaccatcaa acagcccttg gaattcgccc attttcgtca ttcccaaaaa 2520 gtctggtaaa tggagacttt tgcatgactt acgtgctatc aatgctaatt tgcaacctat 2580 ggggcccctt caacaggggc tcccttcccc cgcggcgatt cctcaagatt ggcctatagt 2640 cattattgac ttaaaagact gcttttatac tattcccctt gcagaacagg acagagaaaa 2700 atttgcattt acaataccag ctatcaataa tgaaaggcca gcttgccgat ttcattggaa 2760 agtgcttcct caaggaatgc taaacagtcc taccatgtgt cagtatcatg taaatcaggc 2820 tttgctcccc agtagaaaag aatttcctaa ttgcaagatt attcatttta tggatgatat 2880 tttactagca gccccaacgg agccagtact tttaagttta tatgcctctg tcataaagaa 2940 tacacagtta agaggtttaa tcatagcacc tgaaaaagta caaatgtcct ctccttggaa 3000 atatcttgga tacatactaa cttcccggtc agtaagacct caaaaggtta aattaaatac 3060 tagcaactta cacaccttaa atgattatca aaaattacta ggtgatatta actggcttcg 3120 ccccaccttg ggcataacta ctgataagtt acaaaacctg ttttctatcc taaagggcaa 3180 tacagcccta gactctccca ggtatttaac tcctgcagca aaaagggaaa ttgaggaaat 3240 agagcaagct atttctcaga ggcaactaga tcgcatagac ccacgatatt cagttcaatt 3300 gtttgttttt cctactaaac attccccaac aggattaata ggacagatgg ccccagggct 3360 acgcttccta gaatgggttt tttgctcaca taccgggact aaaacactat ctccctatat 3420 ccagctagtt agtaaagtca tctattcagg ccgcagacga tgcaatcagt tgctaggtta 3480 tgaccctgat gtcatcagaa ttcctttaag taaaaagcaa ttcgaagcag tattgccctt 3540 atctctagac ctgcaaatag cactctctga ttacacaggc catatagagc atgcccttcc 3600 tgctgacaaa ctacttcagt tcttatctca tactcctgta gttttgccta caaaagtagt 3660 tcactccccc atacctaacg ctttaacact ttttactgat ggctctggta aaaatggaaa 3720 agcggctatc tggtggagac cacataattc cctcactcgt tctggattta ctagcactca 3780 gagagctgag gttggagcct taatattggc cctggaaact ttttccactc agcccatcaa 3840 tattgttagt gactctgctt actctgttta tttattgcag aaccttgaaa cagccctcat 3900 taagtccact ctggagccca ccctgtgtgc actttttctt cgacttcagc aattgctaga 3960 tcaatgtaca catcctattt ttatcacaca tattcgagcc cacagctcac tgcctggccc 4020 actggcttat ggcaatgatc aagcagacct gcaggttatg acatcactgc ttgaccaagc 4080 cacccaatca catcaatttt tccaccaaaa ttggagaaac ttatctaaac aatttcaact 4140 tacccaaaga ctagctaaac aaattatcct gcaatgccca gattgccagc tcacaggcac 4200 gtcccctcct tcaacaggtg ttaaccctag aggactagaa cctaatcagt tatggcaaac 4260 agatgttaca cacatccctg aatttggaaa actaagatat gtacatgtat ccattgatac 4320 caattctcac ttaattagcg ctcatgctct tcctggagag tccacccgat atgtcattaa 4380 acatcttctt ttaacttttg catttatggg gcggcccaca aaaattaaaa ctgataatgg 4440 tctggcttat gccagctcac aatttcaaca attttgtcac acgtggaaca tccaacattc 4500 cacaggcatc ccgtataacc cccaaggaca ggccatagta gaacgtgccc actccaccct 4560 taaaaatatg ctcagaaaac aaaaaagggg gaatatgagt aaggaccctg caacactact 4620 ggcacaagcc ttatttaccc ttaatttttt aaatttagat gataaatttc aatcagctat 4680 agaaaagcac tttgctaaaa cctctcaaga cataaaacct gcagttttat ggaaagatgt 4740 aaatagtaat gtatggtgtg gtccaaatga attgctaaca tggggaagag gatatgcttg 4800 tgttcacacc ccctcaggtc ctctttggat tccagcacga tgcatcaaac cataccatgg 4860 cgtggctagg acccaacccg gtaccagaaa tgaaggaaat gaccctgcag gacccacagc 4920 cccggacgat gcggcttcct cggatgacac aagccccgga cattacctgg gggatgctga 4980 agaagacaac tcaggaggct gagcgaatcc tgctccggac acagacacca ttcactccag 5040 ataatttgtt ccttgctatg ctttctgttg tacattgcaa ctctcatagg gtattgatcc 5100 tttttattct ctcactttgc ctgcaacctg tacctgctac actctattgg gctcatctct 5160 tagatccgcc tttcttccgc cctgttacct gggcagacac ccccttccca gcctctaata 5220 acgtaactgc ttggctagga gggattgact tacccccagt ggggtccctc attaatggca 5280 cacattggac taaggtgcca ggtaacacta catatcactc cactatcctc ccactgtgtg 5340 taagttataa aagttctaac ccttactgtg tacctgccca aacacaatta tggctacatc 5400 atggcaaagg aaatgcctta acagtcttag ttgcaggtag cctcaaaccg ggcaatgcaa 5460 tcaatgccac tttcccaaac attccttcct gtgctaaaga acaaagccag gaaagtaatg 5520 gattccactt tagctgggag gtctgtcatg ggggacaagc ccgtagcctc cagttaggca 5580 attataacat cttagactgg agcccccaca gccatttgca gggcaaccat actgatgtcc 5640 gcatctatca tggcatcaat cacagtttca tagccacgtc ccattcccct ataatttggg 5700 ccgatggggg gatgggatat cccagacccc aagtagagtc catgccaccc caagacactt 5760 tatggtgcct gggacatctt agcacctccc ttaacacctg gcatgggaca tatcataatt 5820 ccagtcacaa ttatactatg acctttattc ataatcacac tgatcagtgc ctgatttgca 5880 ctacccatcc atatgttttc cttatgggaa ccaatatttc cattacaccc caaaactcca 5940 cgtttgtgac ccgagtgcag ggacaggctt ggtttgcctc atgtatcact aattacaata 6000 tatctaattt aaatattact agtgtcatgg tattaaggag acaatctgag gcattcctac 6060 cagtcaattt gacacgcgat tggcaaggtt cctctgccct tgccacctta gaacgtgccc 6120 tgtcccaggt cagacacaaa agattcatag ttacacttat agcctttata gtctcagcca 6180 tagtcatcct agcaactgct agtgttgctg tagcatctat tactgaatca gtacaaacag 6240 ctacttttgt agataatttg gccagaaatg tgtctaatga acttctctta cagcagggta 6300 tagatcaaaa gattcttgca tgtctgcaag ccctcgaggc tgccttggaa tatgtagggg 6360 agcgacaaga tgcactggca ttccgacagc aattaaactg tgactgggag cataagcata 6420 tctgtgtcac ttctctacca tggaatcaat caatacatag ttgggatgag gtgaaacaac 6480 acctctgggg aacctttcat gacaatttaa cagcagacat aaagcaactt aaaactaaaa 6540 ttctagaatc cctaaacgcc atagatctac acgcccaaca aacagccata tggaagggtg 6600 tgcgagatca tctctcctgg atagaccccc actcctgggg gtcactcctt gattggaaaa 6660 gaatgttact aattatactc atgtttgtct tatgttattt actaattcta ggatgcaaag 6720 ccggaatacg agcagtaacc gctacgcctg acaaacctgt tgctgcacac atctgtactc 6780 ttcaatcaac aaaacctgat gcaaaaaaca gaaaaggggg aga 6823 // ID GOLEM repbase; DNA; ROD; 2986 BP. XX AC . XX DT 13-JAN-1998 (Rel. 6.4, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 4) XX DE Autonomous DNA transposon; POGO superfamily. XX KW TIRs; DNA transposon; MER7; 35S; MER17; MER29; MER7B; TA target; KW GOLEM. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 2986-2724 RA Drinkwater D.R., Burgoyne A.L. and Skinner D.J.; RT "Two human repetitive DNA elements: A new interspersed repeat RT found in the factor IX gene, and a satellite 11 tandem repeat RT sequence."; RL Nucleic Acids Res 14, 9541-9541 (1986). XX RN [2] RP 2986-2724 RA Kaplan J.D. and Duncan H.C.; RT "Novel short interspersed repeat in human DNA."; RL Nucleic Acids Res 18, 192-192 (1990). XX RN [3] RP 2713-2608 RA Kaplan J.D., Jurka J., Solus F.J. and Duncan H.C.; RT "Medium reiteration frequency repetitive sequences in the human RT genome."; RL Nucleic Acids Res 17, 4731-4738 (1991). XX RN [4] RP 2327-2128 RA Skalnik G.D., Strauss C.E. and Orkin H.S.; RT "CCAAT displacement protein as a repressor of the myelomonocytic- RT specific gp91-phox gene promoter."; RL J. Biol. Chem 266, 16736-16744 (1991). XX RN [5] RP 2787-2607 RA Rubin M.C., Leeflang P.E., Rinehart P.F. and Schmid W.C.; RT "Paucity of novel short interspersed repetitive elements (SINE) RT families in human DNA and isolation of a novel MER repeat."; RL Genomics 18, 322-328 (1993). XX RN [6] RP 2327-2128 RA Jurka J., Kaplan J.D., Duncan H.C., Walichiewicz J., RA Milosavljevic A., Murali G. and Solus F.J.; RT "Identification and characterization of new human medium RT reiteration frequency repeats."; RL Nucleic Acids Res 21, 1273-1279 (1993). XX RN [7] RP 2986-2128 RA Smit F.A. and Riggs D.A.; RT "Tiggers and other DNA transposon fossils in the human genome."; RL Proc. Natl. Acad. Sci. USA 93, 1443-1448 (1996). XX RN [8] RP 1-2986 RA Kapitonov V.V. and Jurka J.; RT "Jerky gene - a recruited transposon?."; RL Direct Submission to Repbase Update.. XX DR [8] (Consensus) XX CC 23 bp terminal inverted repeats and TA target site [7]. CC GOLEM's non-autonomous elements are GOLEM_A (MER7A) and GOLEM_B CC (MER7B) repeats. CC Orientation of the repeat has been determined based on the CC reconstruction of its internal sequence encoding transposase [8]. XX SQ Sequence 2986 BP; 946 A; 566 C; 643 G; 805 T; 26 other; cagtcatgcg ctgcataacg acgtttcggt caacgatgga ccacatatac gacggtggtc 60 ccataagatt ataatggagc tgaaaaattc ctatcgccta gtgacgttgt agtgcaacat 120 atyactcakt actcacgcgt ttgtggcgat gctggtgtaa acaaacctac tgcgctgcca 180 gttgtataaa agtatagcac atacaattat gtacagtaca taatacttga tagtgataat 240 aaatgactat gttactggtt tatgtattta ctatactata ctttttatta ttattttaga 300 gtrtactcct tctacttatt aaaaaaaaaa gttaactgta aaacagtatg ccgtgttaga 360 ccggcagcag ccacatgcat ctcgtgttta ccgcgtctct tgattgcatc attttctctt 420 gtgcttgatt taatctcatg tgttttgttc atcacggctc ctaagcgttc aaaatccacg 480 gctaatgttg ccagtaagag gccacgtcga gtaattgagc tggaaacaaa attaaaagtg 540 attaaggacc gccgaaggtg gaaaatcagt gatggttatt gctcgccaga caggtatgtc 600 ccattcgacc atagctgcca ccttgaagaa caagagcaaa gtgacagaaa ctgttaaagg 660 gtctgcttca tcgaaggcga cgagaccaac aaatttgaga agtgccaaca tcagatatgg 720 agaatatctg ctaatgacct ggattgaaaa ccagacacag aaacatagct ctctcagcac 780 cacgacgacc acggccagag caaaagggat atttttgatg ttgaaagaaa aggctggacc 840 cgactgtgat atcgaattta ctgctagttc cgggtggttt aaatgattcg agaatcgtta 900 tccattacat aatgttaacg tgactgctga gtctgtgagg gctgacgtga aggcagctga 960 agagtttttg gaaactctag atgagtcgtg gagaagaaat acttgccaga gcaaatcttc 1020 aatatggagg aaacctccct attctggaaa cagatgcctg aaaggacttt ctgtcataag 1080 ggggccaagt caatgccagg tttcaagggt tttaaggaca ggaaacggtc ttgctttggg 1140 gcaatgttgc aggctacaaa ttgaagccct ttgtgatccg acacagtgaa aaccccaggg 1200 ccttcaagca tatcagtaag cacacgctgc cagtgtacta caggagcaat aaaaagtcat 1260 ggatgatcca tctcctcttc caagatgccc tcctgaattg ctatgccaga gaaatggaga 1320 agtactgttt ggggaataac atacctttca agattttgct tattgttgat aatgctcctg 1380 cacatcttcc ttttactggt gatcttcatc ccaatttcaa agtggtgttt ctctctccaa 1440 acaccacctc tttgatccaa tggatcaagg aattatagca gctttttaaa ggcctactac 1500 cagagaaggg ccttcgccca ggttatcact gtaactgagg aagacactga tgcaattctg 1560 gaaggattac aacagccaag actgcatcaa gaaccttgct tgggattggg gtgatgtcac 1620 caaggagtgt atgaatggca tctggaagaa gacactcaag aggtttgtcc atgacttcaa 1680 aggatttgcc aaggatgagg aggttgcaaa aatcaacaag gctgtggttg agatggcaaa 1740 caactttaac ctgggtgtgg atgaggatga cattgaggag ctcctagagg tggttcctga 1800 ggaattgact aatgaggagt tgttggaact ggaacaggaa tgcatagctg aagaagaggc 1860 aagagaaaag gaaactgcag gagaagaaga agaaccccca agaaaattca cagtaaaggg 1920 tttagcagaa gcttttgcag acctcaacaa gctccttaaa aagtttgaaa acatggaccc 1980 caacaccgaa aggttttcat taatagagag gaatgttcat ggtgcattat ctgcttacaa 2040 gcaaatctat gatgacaaaa agaaacaaac caagcaaacc atcatggaca tatatttctg 2100 aaaagagtga cacctcctca agaagagcct caggcaggtc cttcaggaga tattccagaa 2160 gaargcatcg ttatcatagg agatgacagc tccatgcatg ttattgcccc tgaagacctt 2220 ccagtgggac aaaatgtgga ggcggaagac agtgatattr atgatcctga ccctgtgtag 2280 gcctaggcta atgtgtgtgt ttgtgtctta gtttttaaca aaaatntttt aaaaaataaa 2340 aaaatwaaaa atttwwaaat agaaaaaagc ttatagaata aggatataaa gaaagaaaat 2400 atttttgtac agctgtacaa tgtgtttgtg ttttaagcta agtgttatta caaaagagtc 2460 aaaagttaaa aatgcaaaag tttataaagt aaaaaagtta cagtaagcta tggttaattt 2520 attgctgaaa aarwwwwwww wwwwwwataa awttagtata gcctaagtgt acagtgttta 2580 taaagtctac agtagtgtac agtaatgtcc taggccttca cattcactca ccactcactc 2640 actgactcac ccagagcaac ttccagtcct gcaagctcca ttcatggtaa gtgcyctata 2700 caggtgtacc attttttatc ttttataccg tatttttact gtaccttttc tatgtttaga 2760 tacacaaata cttaccattg tgttacaatt gcctacagta ttcagtacag taacatgctg 2820 tacaggtttg tagcctagga gcaataggct ataccayata gcctaggtgt gtagtaggct 2880 ataccatcta ggtttgtgta agtacactct atgatgttcg cacaacgaaa ttgcctaatg 2940 acgcatttct cagaacgtat ccccgtcgtt aagcgacgca tgactg 2986 // ID MLT1R repbase; DNA; ROD; 1338 BP. XX AC . XX DT 01-MAY-1996 (Rel. 1.04, Created) DT 07-OCT-1998 (Rel. 3.09, Last updated, Version 4) XX DE MLT1- LTR retrotransposon internal sequence - a consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; MaLR family; KW MLT1a subfamily; Non-LTR retrotransposon; MLR; MLT1R. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RA Smit A.F.; RT "Structure and evolution of mammalian interspersed repeats."; RL PhD dissertation, Univ Southern California, 1995. XX DR [1] (Consensus) XX SQ Sequence 1338 BP; 392 A; 211 C; 358 G; 335 T; 42 other; gattttggca ctaggaagtg gggtgcttac gtaacaaata cctaaaaatg tggaagcgac 60 tttggaactg ggtaataagt agaggctgga agagttttga ggtacatgyt agaaaaagcc 120 tagatttcct tgaagagact gttggtagaa atatggatat taaaggyaat tctggtgggg 180 ggctcagaaa ggaagwggag agctatagag aaagctcccg tcgtcttaga gaatacaaaa 240 atcgtcatga acagaatgtt gctagaaata tgaatgttaa aggtgcttct ggtgaggtct 300 cagacggaaa tgagggagat gttattgaaa attggaggaa aggtgatcct tgttataaag 360 tagcaaagaa cttggctgaa ttgtgttcat gtcctagggc tttatggaaa gtagaacttg 420 yaagtgatga actkggatat tcagctgagg aaatttctaa gcaaagtgtt gagggcgcaa 480 cctgacttct cctaactgct tatagtaaaa tgtgagagac ataaagatgg aantgttgag 540 taaaaaggaa ccagaacgta aagatttgga aaattctcag cctgatcatg taacagaaan 600 nncaatagcg ttctctggaa agaanaccaa ggatgtnncc gngcaaccgt ttgctaaaga 660 gattagratc gtgactcgtg ggtccaatca accatctcag caraarccar gaatagagat 720 grggttattt aggaaatatc tgtggaggac tctcttatct gatggctcga acccctatga 780 cttncatagg agaccgacaa ggnttttgag aatnttacac cagcagaaac actgccaact 840 tggactraag ggracagaga cgagaaaaaa tgaaggaagr gtggcagacc gaaggtgggg 900 ctgtctcgtt tcagagcatg gkgtcacccc agcgggcccg gaagatgaat ctcaggactc 960 agaggcatta gcctccagcc ttaagatcta atgaagtttg ccctgctggg ttttggactt 1020 gcttgggacc ngtgactcct ttcttyyytt cyatttctcc cttttggaat gggaatatct 1080 atcctatgcc tgycccaccg ttgtattttg gaagcagata acttgtttna ttrcacaggt 1140 ccacagatgg agaggaattt tgccyctgaa tgaatctyac cctgagtytc tnccayacyt 1200 gatgyrgrtg atatttcgct gagattgtgg actaagagtt ggtgctggaa ggggttagac 1260 attggggaga tgttgctatg ggatgcaggg attttgcatg tgagaaggac atgattatgg 1320 ggggagcgga gggcaaac 1338 // ID BGLII_B_LTR repbase; DNA; ROD; 441 BP. XX AC . XX DT 14-NOV-2005 (Rel. 10.11, Created) DT 09-DEC-2005 (Rel. 10.11, Last updated, Version 1) XX DE Long terminal repeat of ERV2 Endogenous Retrovirus from Muridae. XX KW ERV2; Endogenous Retrovirus; Transposable Element; KW LTR retrotransposon; RLTR16; BGLII_B_LTR. XX OS Muridae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea. XX RN [1] RP 1-441 RA Smit A.F.; RT "BGLII_B_LTR - ERV2 Endogenous Retrovirus from Muridae."; RL Direct Submission to Repbase Update (11-NOV-2005). XX DR [1] (Consensus) XX CC 14-15% subst. XX SQ Sequence 441 BP; 117 A; 65 C; 154 G; 105 T; 0 other; tgttgtggat tgccctggtg ctgtttgtat tttgatgtta attctgcttc ctcaagaggg 60 gctgccccga cgaggagtgg atcacgtact caggtgattt catgtgaacc ttctccccat 120 tttaactggt caaataaagg ctagagcctg tgattgggca gtggaaggga aaggtggggc 180 tggaggtttt agagagggag agaggaagaa ggagagagaa gagaagaggg agggagagga 240 gacggaggaa gaggaggtgg aaggaagatg gagcagaacc acgtggcctg gagaagccgc 300 aagtagcaag ggatctcata gctggggaat agagtagtgt agtggtagat ctgcccaatc 360 taggcgtgca gcttataaat attataactg agttgtgtgt tctttgcacg ggcttattgg 420 ggttggagat ttaccgcaac a 441 // ID LTR2B_Cpo repbase; DNA; ROD; 348 BP. XX AC . XX DT 21-OCT-2009 (Rel. 14.11, Created) DT 21-OCT-2009 (Rel. 14.11, Last updated, Version 3) XX DE Long terminal repeat: consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR2B_Cpo. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-348 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from guinea pig."; RL Repbase Reports 9(11), 2872-2872 (2009). XX DR [1] (Consensus) XX CC ~94% identical to consensus. 6 bp TSD. XX SQ Sequence 348 BP; 94 A; 88 C; 89 G; 77 T; 0 other; tgtagggagc cgttgaagct ggagattgat cacatgtgtg gggctaaagc aaggccagcc 60 cagaaaaaac agagagacca aaacaagcta tgtaaacaag ttgttccagc accaagaagg 120 caaggggccc aggcctgcca agcaatgggt tgctaggcag aggggctgaa attgacatca 180 tctttcccag tatataaata aaggtgctac agtggttgag cagttccttt cctcccatca 240 ggaggataag ggctccacct gaccccagct ttgtcttttc tttatttttc tcatcctctc 300 gccggtcctt aaccccaaga acctcaggag ccgcactggg tcgcggca 348 // ID MLT1J repbase; DNA; ROD; 516 BP. XX AC . XX DT 14-MAY-1998 (Rel. 6.5, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 2) XX DE Transposon-like element long terminal repeat (MLT1j subfamily) - DE a consensus. XX KW MaLR family; Long terminal repeat of retrovirus-like element; KW MLT1J. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-516 RA Jurka J.; RT "MLT1J."; RL Direct Submission to Repbase Update (05-MAY-1998). XX DR [1] (Consensus) XX CC 3'-similar to MLT1I and possibly to MLT1H around positions CC 89-201. XX SQ Sequence 516 BP; 130 A; 128 C; 116 G; 139 T; 3 other; tgtggcagag actgctagtt gtcccccaat atccattctc cccttcttcc ttagtaatag 60 aamccccaat ttttagctgg gcacatggcc acccagaaat aaagactaca tttcccagcc 120 tcccttgcag ctaggtgtgg ccatgtgact ttctaagttc tggccaatga gatggtaagc 180 agaagtgatg tgtgcaactt ctaggaaatg tccttaaaga gaggggcayg cccttctttt 240 ccccttcctc cttcctgctg cctggaatgt agatgtgatg gctggagctc tagcagccat 300 cttggaccat gaggtgaaag ccacatgcta aggatggcag agcagcaaga tagaaggagc 360 ctgggtccct gagactatgg agcagagctg ccataccagc cctggactgc ctacctctag 420 acttctatat gagagagaaa taaattctat cttgtttaag ccactgttat tttgggtttt 480 ctgttacttg cagctaaacc taatcctaac ayacca 516 // ID LINE2 repbase; DNA; ROD; 3314 BP. XX AC . XX DT 18-APR-1997 (Rel. 6, Created) DT 24-JUL-2000 (Rel. 7.3, Last updated, Version 4) XX DE MIR2/LINE2 non-LTR retrotransposon - a consensus. XX KW Non-LTR retrotransposon; LINE; L2 family; MIR2; MIR2/LINE2; KW LINE2. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-3314 RA Degen J.S. and Davie W.E.; RT "Nucleotide sequence of the gene for human prothrombin."; RL Biochemistry 26, 6165-6177 (1987). XX RN [2] RP 1-3314 RA Smit F.A. and Riggs D.A.; RT "MIRs are classic tRNA-derived SINEs that amplified before the RT mammalian radiation."; RL Nucl. Acids. Res 23, 98-102 (1995). XX RN [3] RP 1-3314 RA Smit F.A.; RT "LINE2."; RL Direct Submission to Repbase Update (1996). XX RN [4] RP 1-3314 RA Smit F.A.; RT "LINE2."; RL Direct Submission to Repbase Update (1999). XX DR [4] (Consensus) XX CC 24 bp upstream of NcoI site; chromosome 11p11-q12. CC This is a consensus sequence for LINE2 subfamily A. The MIR SINE CC shares the 3' terminal 50 bp and was co-amplified with LINE2. The CC 5' CC end is probably still incomplete. The ORF from bp 189-2691 CC encodes a CC product 59% similar (39% identical) to the CC reverse-transriptase-like CC protein of a LINE-like element in pufferfish (GenBank acc# CC AAD19348). CC Note that, whereas L1 is A- and purine rich in the coding strand, CC L2 CC is C- and pyrimidine (65%) rich. CC There may be more than 300,000 copies of LINE2 in our genome. CC LINE2 CC spread before the mammalian radiation, and copies are only 65-75% CC similar to this consensus sequence. XX SQ Sequence 3314 BP; 706 A; 1249 C; 420 G; 915 T; 24 other; ccctccgcaa gagtaaagag ggagccccaa aactcacccc tcccnnaccc ctcagcactc 60 acagcctctc caccactatc tgccttgaga ccaccgggac taatctcatc atgcccttta 120 tccctgttta tgtcagtcac cacaagaagc ccaagactcc caggtcatcc ctcacccccc 180 acgcttgaat ccccatnagg ctctgtacat ctccttttct gaccccatct cccatccctc 240 cccanctcct gaaacccttc cactgngccc tctggaactc acggtcmatc atcagcaaaa 300 tcccccgtat cctcaacctc ttctctgaac gttcccttca ccttcttgct ctaacngaaa 360 cctggctctc ccctgaggac actgcttccc ctgcagcctt ctcaagtggt ggccgttttc 420 tctcccacan ccctcgtacc actgggcctg gaggtggggt aggtgtcctc cttgctcctc 480 attgctgctt ccagaccatt ctccctccct cctccctaaa acaccccagc tttgaatctc 540 atgtcatcag actacatcac ccgctacccc tccttgttgc agtcatctac ngacctccgg 600 gtcactcccc ctcattcctt gaagatttta gctcctggct cactgtcact ctctccaaca 660 ctactcctgt cntaattctt ggtgatttca atatccacat agatgatcct tccaataccc 720 tggcctctca gttccttgac ctcctctcct ccaatgatct tgtcctccac cctacctcag 780 ccactcactc ccatggtcat accctagacc ttgtcattac caataactgc aacccctcca 840 taatctcaat ttcaagcatc ccactctctg accaccacct cctatctttc cagctcactc 900 cctctagtac cctaactcca acaattcttc gaccccaccg ggacctccaa tccattgatc 960 ctaccacctt ttcactgtcc ctcacccccc tnatgtcctc acttccctcc ttacccagct 1020 taaattccat ggtcaatcat tataatcact cccttgcata taccctcaac tcccttgccc 1080 ctctctcgct tcgtcntact cgcctggcaa aaccacaacc ctggttaaat ccaactctcc 1140 gcctactccg cgcctgcacc cgtgcagctg aacgtggctg gagaaaaaca cacaaccatg 1200 ctgactggtc tcgctttaaa ttcatgacca cgaacctcaa gtgggccctt aatgctgccc 1260 ggcaatcata ctacatttcc ctagtccatt cactctccca ctctcctaga tnactatttc 1320 acaccttctc ctctctcctc aaacctccaa cacctcctcc cctatcctca ctctcagctg 1380 atgaccttgc ttcctatttc actgagaaaa twgaagcaat cagaagagaa cttccacana 1440 ctcccaccac cacatctacc cacctacctg catctgtgcc cacatactct gccttccttc 1500 ctgttactac ggatgaactg tccgtgctcc tatctaaggc caacccctcc acttgtgcac 1560 tagatcccat cccctctcgc ctactcaagg acatcgctcc agcaattctc ccctctctct 1620 cctgcatcat caatttttcc ctctctactg gatcattccc atcagcatac aaacatgctg 1680 ttatttctcc catctttaaa aaacaaaaat tctcccttga ccccacttcc ccctccagct 1740 accgccccat ttctctgctc ccctttacag caaaactcct caaaagagtt gtctatactc 1800 gctgtctcca attcctctcc tcccattctc tcttaaaccc actccaatca ggctttcgtc 1860 cccaccactc caccgaaact gctcttgtca aggtcaccaa tgacctccat gttgctaaat 1920 ccaatggtca attctcagtc ctcatcttac ttgacctatc agcagcattt gacacagttg 1980 atcactccct ccttcttgaa acactttctt cacttggctt ccaggacacc acactctctt 2040 ggttttcctc ctacctcact ggccgctcct tctcagtctc ctttgctggt tcctcctcat 2100 ctccccgacc tctnaacgtt ggagtgcccc agggctcagt ccttggacct cttctcttct 2160 ctatctacac tcactccctt ggtgatctca tccagtctca tggctttaaa taccatctat 2220 atgctgatga ctcccaaatt tatatctcca gcccagacct ctcccctgaa ctccagactc 2280 gtatatccaa ctgcctactc gacatctcca cttggatgtc taataggcat ctcaaactta 2340 acatgtccaa aactgaactc ctgatcttcc ccctcaaacc tgctcctccc acagtcttcc 2400 ccatctcagt taatggcaac tccatccttc cagttgctca ggccaaaaac cttggagtca 2460 tccttgactc ctctctttct ctcacacccc acatccaatc catcagcaaa tcctgttggc 2520 tctaccttca aaatatatcc agaatccgac cacttctcac cacctccact gccaccaccc 2580 tggtccaagc caccatcatc tctcgcctgg attactgcaa tagcctccta actggtctcc 2640 ctgcttccac ccttgccccc ctncagtcta ttctcaacac agcagccaga gtgatccttt 2700 taaaacataa gtcagatcat gtcactcctc tgctcaaaac cctccagtgg cttcccatct 2760 cactcagagt aaaagccaaa gtccttacag tggcctacaa ggccctacat gatctggtcc 2820 cccgttacct ctctgacctc atctcctacc actctccccc tcgctcactc cgctccagcc 2880 acactggcct ccttgctgtt cctcgaacac gccaggcacg ctcctgcctc agggcctttg 2940 cacttgctgt tccctctgcc tggaacgctc ttcccccaga tatccacgtg gctsgctccy 3000 tcacctcmtt caggtctcwg ctcaaatgtc acctcctcag agaggccttc cctgaccacc 3060 ctatctaaaa twgcacaccc tctcccccat catgcccatc tttcttaccc cgctttattt 3120 ttcttcatag cacttatcac catctgacat actatatwat ttatttgttt gtttgtttat 3180 tgtctcctcc actagaatgt aagctccatg agggcaggga ctttgtctgt tttgttcact 3240 gctgtatccc cagcgcctag macagtgcct ggcacatagt aggcgctcaa taaatatttg 3300 ttgaatgaat gaat 3314 // ID ERVB4_2-LTR_MM repbase; DNA; ROD; 553 BP. XX AC AC110500; XX DT 11-FEB-2004 (Rel. 9.01, Created) DT 11-FEB-2004 (Rel. 9.01, Last updated, Version 1) XX DE Mouse endogeneous betaretrovirus ERVB4_2 LTR sequence. XX KW LTR Retrotransposon; Transposable Element; LTR; KW endogeneous betaretrovirus; MmERV-B4_AC110500; ERVB4_2-LTR_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-553 RA Baillie J.G., Van de lagemaat N.L., Baust C. and Mager L.D.; RT "Multiple groups of endogeneous betaretroviruses in mice, rats, RT and other mammals."; RL J. Virol 78(11), 5784-5798 (2004). XX DR Genbank; AC110500; Positions 51578 52134. XX SQ Sequence 553 BP; 116 A; 181 C; 99 G; 157 T; 0 other; tgttgggagc cattaaggca acgctattgt cctgatctct gaattgggcc tctccccccg 60 agaagagggt caaaagcggg ccaccaacac atggattccg agaaccacgg gatgttccag 120 ccctaggtca tcaccgatgt cctgaccaca ccctgatacc accaagttcc cgcttccccg 180 tgtagctgcc aaaagaaaga atttatattg cccccttccc ataagtactt ctccttttgc 240 ttgtattgcc ctacccctcc cactgataat cacttcccct cttgcttgta ttgccctacc 300 cctcccactg ataatcactt cccctcttgc ttgtattgcc ctacccctcc cactgataac 360 tatctgtact tccccttttg cttgtgcatt taagccttgc acctttctca atacattggg 420 gtcttgatac aacttcagaa cggtctccgt gtcgttattc gtacaagacc ctcgtctctc 480 tctacccccc atttggttat taggaggagg tcccctcgag atcctcgaat aactggacct 540 gctggacagg tca 553 // ID RLTR9D repbase; DNA; ROD; 412 BP. XX AC . XX DT 30-JAN-2004 (Rel. 9, Created) DT 30-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE Mouse long terminal repeat - a consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR; ERVK; KW RLTR9D. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RA Karolchik D., Baertsch R., Diekhans M., Furey S.T., Hinrichs A., RA Lu T.Y., Roskin M.K., Schwartz M., Sugnet W.C. et al.; RT "The UCSC Genome Browser Database."; RL Nucleic Acids Res 31(1), 51-54 (2003). XX RN [2] RP 1-412 RA Pavlicek A. and Jurka J.; RT "RLTR9D - a subfamily of LTR retrotransposons."; RL Direct Submission to Repbase Update (31-DEC-2003). XX DR [2] (Consensus) XX CC The consensus sequence was reconstructed from the UC Santa Cruz CC genome annotation. Individual copies are ~92% identical to the CC consensus. 6 bp TSDs. XX SQ Sequence 412 BP; 87 A; 128 C; 94 G; 103 T; 0 other; tgtagcctcc ctcagccctg aagcaggctg atccagactt tgaccttgct gccactaaga 60 aggagattac ctagggttgg agccttccga cctagatggc tcttttgttc tgtcacctgc 120 cactgctctc ctccaagaca ctgctacctg ctgagaagcc cctgagatat tccggaggaa 180 catcctattc tgcactatcg cctacaggct ggctccagac accaagaatg gactggtgcg 240 gggggatggg ccttccccct ttataagcac actctcttag taaactggcg ggccttgaac 300 agaatcattg tcttggtctc cattaatttc tctcgccatc taagtctctt tcagccccag 360 cctgcctccc aggtgtaccc ggttcaagta ggccgcaggc cggcatacaa ca 412 // ID RLTR18_MM repbase; DNA; ROD; 470 BP. XX AC . XX DT 14-OCT-2002 (Rel. 7.09, Created) DT 14-OCT-2002 (Rel. 7.09, Last updated, Version 1) XX DE Mouse putative long terminal repeat - a consensus. XX KW LTR Retrotransposon; Transposable Element; Long terminal repeat; KW RLTR11A; RLTR18_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-470 RA Jurka J. and Drazkiewicz A.; RT "RLTR18_MM: putative LTR from Mus musculus."; RL Repbase Reports 2(9), 3-3 (2002). XX DR [1] (Consensus) XX CC 81% similar to AC008160 fragment (bases 138495-138947) CC described as RLTR11A. 68% similar to RLTR11A (rodrep.ref) CC (bases 63-237). CC Similar to RLTR34_MM (73%, bases 11-174), RLTR23_MM (71%, bases CC 1-443), RLTR28_MM (76%, bases 1-453). XX SQ Sequence 470 BP; 137 A; 83 C; 154 G; 96 T; 0 other; tgttggggat tggttctaat gctttgattt aatccagctc ccaaaatcag gaatctgcat 60 gtccaaatgc tgaaggtcct tgtccccagt tggtttttga ttgatcaata aagatttgcc 120 aacggccaat ggctgggcag ggagacagag gcgggacttt tagatttgcg cgggctagga 180 cacaggggga aaggaagagg gagaatcacc atgactcaga gggagacgga tcagatttaa 240 ggagctgcag gagagaaatc atccaaaatg taggtggaaa ggaaagcggc cccatgggag 300 ggctgcccag aagggtcttg ggcagcaaag atcagggggc tgcccagaag gtacagggca 360 gcaaagataa aatatagatt tagaaggtgt taagtcagga ataccggagg gaagtgtgtg 420 ctagccgcgg ggaggtttag aagtgcccag ccattgagct agtcaaggca 470 // ID CAVID2C repbase; DNA; ROD; 84 BP. XX AC . XX DT 26-DEC-2009 (Rel. 15.03, Created) DT 26-DEC-2009 (Rel. 15.03, Last updated, Version 3) XX DE tRNA-derived SINE family: consensus. XX KW SINE2/tRNA; SINE; Non-LTR Retrotransposon; Transposable Element; KW CAVID2C. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-84 RA Jurka J.; RT "SINE elements from guinea pig."; RL Repbase Reports 10(3), 504-504 (2010). XX DR [1] (Consensus) XX CC ~95% identical to consensus. XX SQ Sequence 84 BP; 26 A; 17 C; 24 G; 17 T; 0 other; gggccaggga tttagctcag tggcataagc acctgcctgg caagcgcaag gttgtgagtt 60 tgatccctgg tacaaaaaaa aaaa 84 // ID RLTR26_MM repbase; DNA; ROD; 709 BP. XX AC . XX DT 14-OCT-2002 (Rel. 7.09, Created) DT 14-OCT-2002 (Rel. 7.09, Last updated, Version 1) XX DE Mouse putative long terminal repeat - a consensus. XX KW LTR Retrotransposon; Transposable Element; Long terminal repeat; KW retrotransposon; RMER17A; RLTR26_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-709 RA Jurka J. and Drazkiewicz A.; RT "RLTR26_MM: putative LTR from Mus musculus."; RL Repbase Reports 2(9), 11-11 (2002). XX DR [1] (Consensus) XX CC 86% or less similar to RMER17A elements described in GenBank CC (AC079222, AC007937, AE000665). 70% identity to RMER17A CC (rodrep.ref) (bases 12-185). CC Similar to RMER17D_MM (71%, bases 1-209). XX SQ Sequence 709 BP; 171 A; 233 C; 173 G; 132 T; 0 other; tgtgggacgg tgggctatga taggcagcct ggtccctggt tgagctaagg cttaaaaccc 60 cggtgaccct gcaggggact cgcctgcaag ggacggtagg cattttgcca tgctcctggg 120 cacctggctc ctgtcacata gctacagccc cccacacccc cacccccgta gagaggtttg 180 tggccatcag tcacgtagga gcagcactcc aagccctccc acatgtagat aaggtatccc 240 caagctctca gaccaagcca ataggaagta cctgctgtca gaccctgacc caccccaaaa 300 ctgtatataa ggatcctcct atccagaagg aataaaggtg tgtgagaatt actccatcat 360 ctgagagctt ctgtcataag agctgtaaca ccaccgctag ggaagagatc tgctctcccc 420 ccccccaaga aaacgccacc agaagctccc ctgcactcct cactggctag ttagcctcct 480 tccggctcag cctcgcccga ctcagtgcag agcgacctag gtgcagcatc ttggagcaac 540 tgcaaccaaa gaaacagcag cagtggagac tgaggtggag gcagctgcag cagcagaggc 600 agtggaggca attccccctc cctgctcaag ttctctcccc tttccctgaa ccctcgcacc 660 tggcctggcc agagatctcc gtggaaagcc tccagtacac aggcccaca 709 // ID LTR6_Cpo repbase; DNA; ROD; 345 BP. XX AC . XX DT 31-JUL-2009 (Rel. 14.07, Created) DT 01-DEC-2009 (Rel. 14.07, Last updated, Version 3) XX DE Long terminal repeat: consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; LTR6_CPo. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-345 RA Jurka J. and Baney O.; RT "Non-autonomous endogenous retrovirus from guinea pig."; RL Repbase Reports 9(7), 1547-1547 (2009). XX DR [1] (Consensus) XX CC ~93% identical to consensus. XX SQ Sequence 345 BP; 78 A; 79 C; 86 G; 101 T; 1 other; tgtcatggtt tatgtcggtg gcccccaggc ctcatgcatt tgcattgtgc atttgtgatt 60 ggttcattgt ctagygcttg gattgatggt gtcagtgcta cacccacata ggggtggagc 120 caggatgtaa tgtaatggca ggaagaaggt gtgtctctct ctcttgctgg tttctgcctt 180 gctgtttgca gccgccatga actgtggccc cgccatgcca ccctgccttg gagccaactg 240 agtatggact gaaacctcca aaaactgtaa gaaataaacc tttcctttcc caactttggg 300 catcaggtat tttgtctcag caatgagtaa aaagtaacta agaca 345 // ID IAPLTR2b_LTR repbase; DNA; ROD; 327 BP. XX AC . XX DT 14-NOV-2005 (Rel. 10.11, Created) DT 14-NOV-2005 (Rel. 10.11, Last updated, Version 1) XX DE Long terminal repeat of ERV2 Endogenous Retrovirus from mouse. XX KW ERV2; Endogenous Retrovirus; Transposable Element; KW LTR retrotransposon; IAPLTR1a_MM; IAPLTR2b_LTR. XX OS Mus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae. XX RN [1] RP 1-327 RA Smit A.F.; RT "IAPLTR2b_LTR - ERV2 Endogenous Retrovirus from mouse."; RL Direct Submission to Repbase Update (11-NOV-2005). XX DR [1] (Consensus) XX CC note unusual TG...GA; 4% subst. XX SQ Sequence 327 BP; 68 A; 87 C; 88 G; 84 T; 0 other; tgtggggagc cgccctcaca ttcgccgtta caagatggcg ctgacatcct gtgttctaag 60 tggtaaacaa ataatctgcg catgtgccaa gggtagttct ccactccatg tgctctgcct 120 tccccgtgac gacaactcgg ccgatgggct gcagccaatc agggagcgac acgtcctagg 180 cggaggataa ttctccttaa aagggacggg gtttcgccat tctctctctt gcttcttgct 240 cctgaagatg taagcaataa agcttttgcc gcagaagatt ccggtttgtt gcgttcttcc 300 tggccggtcg cgagaacgcg tgtaaga 327 // ID CAVID2B repbase; DNA; ROD; 88 BP. XX AC . XX DT 25-DEC-2009 (Rel. 15.03, Created) DT 25-DEC-2009 (Rel. 15.03, Last updated, Version 3) XX DE tRNA-derived SINE family: consensus. XX KW SINE2/tRNA; SINE; Non-LTR Retrotransposon; Transposable Element; KW CAVID2B. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-88 RA Jurka J.; RT "SINE elements from guinea pig."; RL Repbase Reports 10(3), 501-501 (2010). XX DR [1] (Consensus) XX CC >92% identical to consensus. XX SQ Sequence 88 BP; 31 A; 21 C; 22 G; 14 T; 0 other; ggggctgggg atatagctca gtggcacaag cacctgcctg gcaagcacaa ggtcctgagt 60 tcaattccca gtaccaaaaa aaaaaaaa 88 // ID RMER6A repbase; DNA; ROD; 788 BP. XX AC . XX DT 26-SEP-1997 (Rel. 2.08, Created) DT 26-SEP-1997 (Rel. 2.08, Last updated, Version 1) XX DE Long terminal repeat of retrovirus-like element. XX KW ERV2; Endogenous Retrovirus; Transposable Element; KW putative long terminal repeat; RMER6A. XX OS Murinae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae. XX RN [1] RP 5-361 RA Chopra V. and Jurka J.; RT "RMER6A."; RL Direct Submission to Repbase Update (30-NOV-1996). XX RN [2] RP 1-788 RA Smit A.F.; RT "RMER6A."; RL Direct Submission to Repbase Update (30-NOV-1996). XX DR [2] (Consensus) XX CC Putative LTR of retroviral-like element. 6 bp target site CC duplication. XX SQ Sequence 788 BP; 154 A; 213 C; 139 G; 270 T; 12 other; tgtaaawgta wcttttattc tcctgatgta attgtcgtct ccgaggctta ctgcctctgt 60 ctgctaacct aggcctagtc ctggaagctt ctagcctccg tacaatctwa tctaggccta 120 gaatgttttc agcctctgag acttgctgct gaataagctc accctttcta gttctttctg 180 aactctggct ggctggttca actcagctgt tctggctcaa actcctctcc aagctgactg 240 attcaatctg gcttctctca gcttctcact gaattgctct gcttggcctc aaactaactc 300 tggcaatctg ttctaatctt ctggctcctt ctcattctct ggcttgttct gtcttcacct 360 gtgtctagct cgttctctct tcagcctgtc tctgtaaaac tctcccggta aaactgcctc 420 cttctcccct ctgtgctgtt ccactgtccc tnnnnnnnnn gtactgtctg tctcttctct 480 aagtagcttc cctttcctct ctcttcttct gagagttggg catatcctgt tctgtcaaat 540 ctttctctga ttcgtcactt tgtctgccac tcaattagac atcactttca aacatgggtg 600 cttccttcta caaactaact ttaccttcat tgtttgggat taaaggtgtg tactaagggt 660 gtgtctgtat tccagccaga gggattaaag gtgtgtgcta agggctgagc cacaccacaa 720 ctagaaacag gtttttcagt aaataacaca atctcagggt tcacagtgtg atcaaatatc 780 ctgcaaca 788 // ID MEN repbase; DNA; ROD; 269 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE SINE2 SINE from Menetes. XX KW SINE2/tRNA; SINE; Non-LTR Retrotransposon; Transposable Element; KW SINE2; MEN. XX OS Menetes OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Sciuridae; Callosciurinae; Callosciurini. XX RN [1] RP 1-269 RA Smit A.F.; RT "MEN - SINE2 SINE from Menetes."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Groundsquirrel. XX SQ Sequence 269 BP; 82 A; 63 C; 73 G; 46 T; 5 other; gggctggaga wgtggctcag ccggtagagc gcttgccttg caagcgtgag kcstgggttc 60 nattccagac acatgcawat caatgcctga aagaaagctc atcaaccact aatatacata 120 aaaatgagct gggcgtggtg gcgcacgcct gtaatcccag cagctcggga ggctgagact 180 ggaggatcgc cgagttcaaa gccacctcag cagaaataag cgaggtgcta aagcagctca 240 gtgagaccct gtctcaaaaa aaaaaaaaa 269 // ID L2A repbase; DNA; ROD; 3314 BP. XX AC . XX DT 22-AUG-2000 (Rel. 5.07, Created) DT 22-AUG-2000 (Rel. 5.07, Last updated, Version 1) XX DE L2A (MIR2/LINE2) non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; LINE; KW L2 family; MIR2; MIR2/LINE2; LINE2; L2A. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 3265-3314 RA Degen J.S. and Davie W.E.; RT "Nucleotide sequence of the gene for human prothrombin."; RL Biochemistry 26(19), 6165-6177 (1987). XX RN [2] RP 3165-3314 RA Smit A.F. and Riggs D.A.; RT "MIRs are classic, tRNA-derived SINEs that amplified before the RT mammalian radiation."; RL Nucl. Acids. Res 23(1), 98-102 (1995). XX RN [3] RP 565-3314 RA Smit A.F.; RT "L2A."; RL Direct Submission to Repbase Update (30-NOV-1995). XX RN [4] RP 1-3314 RA Smit A.F.; RT "L2A."; RL Direct Submission to Repbase Update (30-NOV-1998). XX DR [4] (Consensus) XX CC 24 bp upstream of NcoI site; chromosome 11p11-q12. CC This is a consensus sequence for LINE2 subfamily A. The MIR SINE CC shares the 3' terminal 50 bp and was co-amplified with LINE2. The CC 5' CC end is probably still incomplete. The ORF from bp 189-2691 CC encodes a CC product 59% similar (39% identical) to the CC reverse-transcriptase-like CC protein of a LINE-like element in pufferfish (GenBank acc# CC AAD19348). CC Note that, whereas L1 is A- and purine rich in the coding strand, CC L2 CC is C- and pyrimidine (65%) rich. CC There may be more than 300,000 copies of LINE2 in our genome. CC LINE2 CC spread before the mammalian radiation, and copies are only 65-75% CC similar to this consensus sequence. XX SQ Sequence 3314 BP; 706 A; 1249 C; 420 G; 915 T; 24 other; ccctccgcaa gagtaaagag ggagccccaa aactcacccc tcccnnaccc ctcagcactc 60 acagcctctc caccactatc tgccttgaga ccaccgggac taatctcatc atgcccttta 120 tccctgttta tgtcagtcac cacaagaagc ccaagactcc caggtcatcc ctcacccccc 180 acgcttgaat ccccatnagg ctctgtacat ctccttttct gaccccatct cccatccctc 240 cccanctcct gaaacccttc cactgngccc tctggaactc acggtcmatc atcagcaaaa 300 tcccccgtat cctcaacctc ttctctgaac gttcccttca ccttcttgct ctaacngaaa 360 cctggctctc ccctgaggac actgcttccc ctgcagcctt ctcaagtggt ggccgttttc 420 tctcccacan ccctcgtacc actgggcctg gaggtggggt aggtgtcctc cttgctcctc 480 attgctgctt ccagaccatt ctccctccct cctccctaaa acaccccagc tttgaatctc 540 atgtcatcag actacatcac ccgctacccc tccttgttgc agtcatctac ngacctccgg 600 gtcactcccc ctcattcctt gaagatttta gctcctggct cactgtcact ctctccaaca 660 ctactcctgt cntaattctt ggtgatttca atatccacat agatgatcct tccaataccc 720 tggcctctca gttccttgac ctcctctcct ccaatgatct tgtcctccac cctacctcag 780 ccactcactc ccatggtcat accctagacc ttgtcattac caataactgc aacccctcca 840 taatctcaat ttcaagcatc ccactctctg accaccacct cctatctttc cagctcactc 900 cctctagtac cctaactcca acaattcttc gaccccaccg ggacctccaa tccattgatc 960 ctaccacctt ttcactgtcc ctcacccccc tnatgtcctc acttccctcc ttacccagct 1020 taaattccat ggtcaatcat tataatcact cccttgcata taccctcaac tcccttgccc 1080 ctctctcgct tcgtcntact cgcctggcaa aaccacaacc ctggttaaat ccaactctcc 1140 gcctactccg cgcctgcacc cgtgcagctg aacgtggctg gagaaaaaca cacaaccatg 1200 ctgactggtc tcgctttaaa ttcatgacca cgaacctcaa gtgggccctt aatgctgccc 1260 ggcaatcata ctacatttcc ctagtccatt cactctccca ctctcctaga tnactatttc 1320 acaccttctc ctctctcctc aaacctccaa cacctcctcc cctatcctca ctctcagctg 1380 atgaccttgc ttcctatttc actgagaaaa twgaagcaat cagaagagaa cttccacana 1440 ctcccaccac cacatctacc cacctacctg catctgtgcc cacatactct gccttccttc 1500 ctgttactac ggatgaactg tccgtgctcc tatctaaggc caacccctcc acttgtgcac 1560 tagatcccat cccctctcgc ctactcaagg acatcgctcc agcaattctc ccctctctct 1620 cctgcatcat caatttttcc ctctctactg gatcattccc atcagcatac aaacatgctg 1680 ttatttctcc catctttaaa aaacaaaaat tctcccttga ccccacttcc ccctccagct 1740 accgccccat ttctctgctc ccctttacag caaaactcct caaaagagtt gtctatactc 1800 gctgtctcca attcctctcc tcccattctc tcttaaaccc actccaatca ggctttcgtc 1860 cccaccactc caccgaaact gctcttgtca aggtcaccaa tgacctccat gttgctaaat 1920 ccaatggtca attctcagtc ctcatcttac ttgacctatc agcagcattt gacacagttg 1980 atcactccct ccttcttgaa acactttctt cacttggctt ccaggacacc acactctctt 2040 ggttttcctc ctacctcact ggccgctcct tctcagtctc ctttgctggt tcctcctcat 2100 ctccccgacc tctnaacgtt ggagtgcccc agggctcagt ccttggacct cttctcttct 2160 ctatctacac tcactccctt ggtgatctca tccagtctca tggctttaaa taccatctat 2220 atgctgatga ctcccaaatt tatatctcca gcccagacct ctcccctgaa ctccagactc 2280 gtatatccaa ctgcctactc gacatctcca cttggatgtc taataggcat ctcaaactta 2340 acatgtccaa aactgaactc ctgatcttcc ccctcaaacc tgctcctccc acagtcttcc 2400 ccatctcagt taatggcaac tccatccttc cagttgctca ggccaaaaac cttggagtca 2460 tccttgactc ctctctttct ctcacacccc acatccaatc catcagcaaa tcctgttggc 2520 tctaccttca aaatatatcc agaatccgac cacttctcac cacctccact gccaccaccc 2580 tggtccaagc caccatcatc tctcgcctgg attactgcaa tagcctccta actggtctcc 2640 ctgcttccac ccttgccccc ctncagtcta ttctcaacac agcagccaga gtgatccttt 2700 taaaacataa gtcagatcat gtcactcctc tgctcaaaac cctccagtgg cttcccatct 2760 cactcagagt aaaagccaaa gtccttacag tggcctacaa ggccctacat gatctggtcc 2820 cccgttacct ctctgacctc atctcctacc actctccccc tcgctcactc cgctccagcc 2880 acactggcct ccttgctgtt cctcgaacac gccaggcacg ctcctgcctc agggcctttg 2940 cacttgctgt tccctctgcc tggaacgctc ttcccccaga tatccacgtg gctsgctccy 3000 tcacctcmtt caggtctcwg ctcaaatgtc acctcctcag agaggccttc cctgaccacc 3060 ctatctaaaa twgcacaccc tctcccccat catgcccatc tttcttaccc cgctttattt 3120 ttcttcatag cacttatcac catctgacat actatatwat ttatttgttt gtttgtttat 3180 tgtctcctcc actagaatgt aagctccatg agggcaggga ctttgtctgt tttgttcact 3240 gctgtatccc cagcgcctag macagtgcct ggcacatagt aggcgctcaa taaatatttg 3300 ttgaatgaat gaat 3314 // ID MLT1C repbase; DNA; ROD; 466 BP. XX AC . XX DT 01-OCT-1995 (Rel. 3.6, Created) DT 23-JAN-1998 (Rel. 6.4, Last updated, Version 3) XX DE Mammalian transposon-like element long terminal repeat (MLT1c DE subfamily) - a consensus. XX KW Non-LTR retrotransposon; MaLR family; MLT1c subfamily; STIR; KW MLT1C. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-466 RA Rouyer F., de la Chapelle A., Andersson M. and Weissenbach J.; RT "An interspersed repeated sequence specific for human RT subtelomeric regions."; RL The EMBO Journal 9(2), 505-514 (1990). XX RN [2] RP 1-466 RA Smit F.A.; RT "Identification of a new, abundant superfamily of mammalian RT LTR-transposons."; RL Nucleic Acids Res 21, 1863-1872 (1993). XX SQ Sequence 466 BP; 135 A; 99 C; 115 G; 104 T; 13 other; tgttatgggt tgaattgtgt ccccccaaaa ttgatatgtt gaagtcctaa cccctagtac 60 ctcagaatgt gaccttattt ggaaataggg tcwttgcaga tgtaattagt taagatgagg 120 tcatactgga gtagggtggg ccctaaatcc aatatgactg gtgtccttat aaraagagga 180 aatttggaca cagacacgca cacggggaga aggccatgtg aagacggagg cagagattgg 240 agtgatgcak ctacaagcca aggaacgcca argrytgcca gcaaaccacc agaagctagg 300 aagaggcaag gaacagattc tccctcacag ccytcagagg arrccagccc tgccgacacc 360 ttsatctcgg acttctggcc tccagaactg tgagagaata matttctgtt gtttaagcca 420 cscagtttgt ggtactttgt tacggcagcc cyaggaaact aataca 466 // ID IAPLTR4_LTR repbase; DNA; ROD; 345 BP. XX AC . XX DT 14-NOV-2005 (Rel. 10.11, Created) DT 09-DEC-2005 (Rel. 10.11, Last updated, Version 1) XX DE Long terminal repeat of ERV2 Endogenous Retrovirus from mouse. XX KW ERV2; Endogenous Retrovirus; Transposable Element; KW LTR retrotransposon; IAPLTR4; IAPLTR4_LTR. XX OS Mus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae. XX RN [1] RP 1-345 RA Smit A.F.; RT "IAPLTR4_LTR - ERV2 Endogenous Retrovirus from mouse."; RL Direct Submission to Repbase Update (11-NOV-2005). XX DR [1] (Consensus) XX CC 5% subst. XX SQ Sequence 345 BP; 65 A; 90 C; 92 G; 96 T; 2 other; tgtgaggagc cgcccttgca atcgccatta caagatggcg ctgatatccg gtgttctaac 60 tggtaaacaa gtagtctgcg catgtgctgg ggtatttttc cattccttgt gccctgcctg 120 tcccgtggcg tcatctgggc tgatagtgag cagccagtca gggtgaaana cgtctccaac 180 cgctcttgtg gtctatttaa ggaccgagtt tcctgtgttc tgggcctcct cccccagaag 240 ctgatgatct ttctctcgag atgcattaaa gctatgctgc agaagaaccc gtgtgtgtcc 300 tgtgtgtgtg tcctcgctgg cgagactccn tatcacacag ggaca 345 // ID L1MB7 repbase; DNA; ROD; 922 BP. XX AC . XX DT 01-OCT-1995 (Rel. 3.6, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 3) XX DE 3'-end of L1 repeat (subfamily L1MB7) - a consensus. XX KW Repetitive sequence; L1 (LINE) family; MER12; L1MB7 subfamily; KW L1MB7. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-922 RA Smit F.A., Toth G., Riggs D.A. and Jurka J.; RT "Ancestral, mammalian-wide subfamilies of LINE-1 repetitive RT sequences."; RL J. Mol. Biol 246, 401-417 (1995). XX RN [2] RP 1-920 RA Smit F.A.; RT "L1MB7."; RL Direct Submission to Repbase Update (1996). XX DR [2] (Consensus) XX CC ORF2 ends at bp 675; average divergence of copies from consensus: CC 17%. XX SQ Sequence 922 BP; 365 A; 149 C; 176 G; 223 T; 9 other; cttgtatcca gaatatataa agaactctta caactcaaca ataaaaaaac aaacaaccca 60 attaaaaaat gggcaaaaga tttgaataga catttctcca aagaagatat acaaatggcc 120 aataagcaca tgaaaagatg ctcaacatca ttagtcatta gggaaatgca aatcaaaacc 180 acaatgagat accacttcac acccactagg atggctataa ttaaaaagac agacaataac 240 aagtrttggc gaggatgtgg agaaattrga accctcatac attgctggtg ggaatgtaaa 300 atggtgcagc cactttggaa aayagtttgg cagttcctca aaaagttaaa catagaatta 360 ccatatgacc cagcaattcy actcctaggt atatacccaa gagaawtgaa aacatatgtc 420 cacacaaaaa cttgtacacg aatgttcata gcagcattat tcataatagc caaaaagtgg 480 aaacaaccca aatgtccatc aactgatgaa tggataaaca aaatgtggta tatccataca 540 atggaatatt attcagccat aaaaaggaat gaagtactga tacatgctac aacatggatg 600 aacctcgaaa acattatgct aagtgaaaga agccagacac aaaaggycac atattgtatg 660 attccattta tatgaaatgt ccagaatagg caaatccata gagacagaaa gtagattagt 720 ggttgccagg ggctgggggr aaggggaaat ggggagtgac tgctaatggg tacggggttt 780 ctttttgggg tgatgaaaat gttctaaaat tagatagtgg tgatggttgc acaactytgt 840 gaatatacta aaaaccactg aattgtacac tttaaaaggg tgaattttat ggtatgtgaa 900 ttatatctca ataaarctat aa 922 // ID L1MD1 repbase; DNA; ROD; 973 BP. XX AC . XX DT 20-FEB-1997 (Rel. 5, Created) DT 20-FEB-1997 (Rel. 5, Last updated, Version 1) XX DE 3'-end of L1 repeat (subfamily L1MD1) - a consensus sequence. XX KW Repetitive sequence; L1 (LINE) family; L1MD1 subfamily; L1MD1. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-973 RA Smit F.A., Toth G., Riggs D.A. and Jurka J.; RT "Ancestral, mammalian-wide subfamilies of LINE-1 repetitive RT sequences."; RL J. Mol. Biol 246, 401-417 (1995). XX DR [1] (Consensus) XX CC Temporarily contains ORF2 region consensus of L1MB7 (subfam L1M4) CC ORF2 ends at bp 675. XX SQ Sequence 973 BP; 360 A; 144 C; 179 G; 244 T; 46 other; yttgtatcca gaatatataa agaactctta caactcaaca ataaaaaaac aaacaaccta 60 attaaaaaat gggcaaaaga yttgaataga tatttctcca aagaagatat ayaaatggcc 120 aataagcaca tgaaaagatr ctcaacatca ttagtcatca gggaaatgca aatyaaaacc 180 acaatgagat aycacttcac acccaytaga atggctaaaa ttaaaaagac agrmaatamc 240 aartgttggy raggatgtrg agaaaytgga achctcatac aytgctggtg ggaatgtaaa 300 atggtacarc yactttggaa aacagtttgg cagttcctca aaaagttaam aatagagtta 360 ccatatgacc cagcaattyc actcctaggt atwtacccaa gagaaatgaa aacatayrtc 420 cayacaaaaa cttgtacaca aatgttcata gcagcattat tcataatagc caaaaagtgg 480 aaacaaccca aatgtccatc aatratwgaa tggataaaca aaatgtggta tatccataca 540 atggaatatt attcagcmat aaaaaggaat gaagtamtga tmyatgcaac aacatggatg 600 aaccttgaaa acattatgct aagtgaaaga agccarrcac aaaagrccac atattgtatg 660 attccattta cataacattc ttaaaatgac aaaattwtag arwtgragar cagattcctg 720 gttgccaggg gttagggacg ggggtggggg tggagggagg tgggcatsst ggagttccct 780 gtggtgatgg aaatgttctg tatcttcact gtcactgtat caatgtcaat atcctggttg 840 tgataytgta ytatagtttt gtaagatgtt accattgggg gaagctgggt gaagggtaca 900 cgggatctct ctgtaytatt tcttacaatt gtctgtgagt ktayaattat ttcaaaataa 960 agtttaattt aaa 973 // ID MER69B repbase; DNA; ROD; 1501 BP. XX AC . XX DT 18-APR-1997 (Rel. 6, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 4) XX DE MER69 repetitive element - a consensus. XX KW DNA transposon; MER69B. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-1501 RA Smit F.A.; RT "MER69B."; RL Direct Submission to Repbase Update (1996). XX DR [1] (Consensus) XX CC 11 bp terminal inverted repeats. 8 bp duplication. CC MER69B is an internal deletion product of an Activator-hobo-like CC DNA transposon. The product of a small ORF remaining at bp CC 942-1265 CC has homology to C. elegans hobo transposases. CC Average divergence from consensus 25-26%. XX SQ Sequence 1501 BP; 443 A; 274 C; 305 G; 450 T; 29 other; cagaggcaga tttaccgtga agctaatgaa gcttaagctt cagggcccct cacttgcacg 60 ggccccttcc aaggccctgg gaggggccct agcaatgtgt tcacatggtc atatgttttt 120 gtaaaatttg caaaagtaag atattttaac cacaattggt taagactgct gtctctttcc 180 actccgactt cccctccatc acacttcccc tcatgtcggg tggtattgga gtggccgtgg 240 gcatttttgg gatctggcta agggaaagtt gagttgggga tacatttagt ttgggtttag 300 tgggatatat ttatgtggtt cgcagtcact tccgtgtata gttaagttat tgctagccgt 360 cccggtatag gaatggcttc caggaayatt cctactgccc actgtgccga ctcacccagc 420 gtcgtgacmt gaggcacagg accggagrtc gtatcgcgat atgaacgtgt cctacggcac 480 ctggcaccgg aagtatgcgg gtagtggagg agaaacaagg tttgaaatgt acagagccag 540 aagctagtct gtggaaaatt cttccaatca tcagacgtgt aaaattgtaa gcggaggatt 600 cggttctcat cgatgcctag ycagagcaga agttctctcc tgtcagaaat atactcgata 660 atgcagcgta tacaattata aatgcamcat gcatttattt gcatttttgg aagggaatca 720 tgcgaaatag aatttatcag aantccttgt ttgtagggca cagatctrta gcagtactac 780 aaacagtgag cacatctgtn nnnnnnnnnn nnnnnnttat taamtttctt gctgatatga 840 catgaaatwc tgacgttaac gaagataaca sttcacttct aatttaccat gacaaaaaag 900 acatcagtga caaactggtt actgagtgtc attaattcta agagyattta agattagtca 960 ctgcacaaga aaacttgaaa tgccctgaaa tcttacagct catatatgaa agaaacttga 1020 taggggtttc cccaaatttg acaacaattc taaaaattta catgacatta ccaataacga 1080 gttgtgaagc tgaaagaaac ttttctaaac tatcaataat aaaaaacaaa tttcgatcaa 1140 ccatgctaga ggaaagactg aattatcttt ctattctctc tatagaaaat attacaaaat 1200 cgttgtcata tgaagaggcg atcaaagagt atgcagccaa aaaatgtagg gaaaaartat 1260 tatagaagtg tgtcaggcag ttaattaata aaaatatggt gttgtttttc tggattttgt 1320 gatgtttgtg gtatttgtca actttttaaa atttgtaatt tgttgcgatt tcttttctca 1380 ttctaaataa atattcactt ttgtacctaa ttttgtattc gtaattttgt attctttttc 1440 ttaaagaggg ccccccaaat tgtataagct tcaggcccca caaaacctgg atctgcccct 1500 g 1501 // ID ERV1NA-CPo_LTR repbase; DNA; ROD; 507 BP. XX AC . XX DT 19-JUN-2009 (Rel. 14.07, Created) DT 19-JUN-2009 (Rel. 14.07, Last updated, Version 2) XX DE Long terminal repeat of ERV1 endogenous retrovirus: consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERV1NA-CPo_LTR. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-507 RA Jurka J.; RT "Non-autonomous endogenous retrovirus from guinea pig."; RL Repbase Reports 9(7), 1543-1543 (2009). XX DR [1] (Consensus) XX CC >98% identical to consensus. XX SQ Sequence 507 BP; 131 A; 150 C; 104 G; 122 T; 0 other; tgagagaatc gactccccgt ttgtaatgta accccccccc ccgagctata gaacagagct 60 ataaggaatg tttcactccc taaacctttc ccaggagata atcagagcag ggcagaaagg 120 aatgtctctg ctagtcatta accactagat gttctgttct ctgtaaagat agagataagc 180 aaaagaatgt actgcataaa caggaccagg gggcacctgg cactctaagg tcaacccaag 240 accctcggtc tccatggtta ctcagcccca gacccctcaa atcagcgtta gccaagtcct 300 gcgccgttca aaattaacca atgcgatttg cttctgtaaa cttgcttgcc tcccgcttgt 360 acctttaaaa accctgcaca gattcccctc ggggcccctc cgcacacgtt tgcctgaggg 420 accccatgcg catggaaata aacttttttt tccctcacca agagttgagt ccttggggtc 480 tctttccctg cggcgtcggc cctaaca 507 // ID L1ME2 repbase; DNA; ROD; 911 BP. XX AC . XX DT 01-OCT-1995 (Rel. 3.6, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 3) XX DE 3'-end of L1 repeat (subfamily L1ME2) - a consensus. XX KW Repetitive sequence; L1 (LINE) family; MER36; MER38; KW L1ME2 subfamily; L1ME2. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 471-728 RA Iris F., Bougueleret L., Prieur S., Caterina D., Primas G., RA Perrot V., Jurka J., Rodriguez-Tome P., Claverie J. et al.; RT "Dense Alu clustering and a potential new member of the NFkappaB RT family within a 90 kilobase HLA class III segment."; RL Nature Genet 3, 137-145 (1993). XX RN [2] RP 1-911 RA Smit F.A., Toth G., Riggs D.A. and Jurka J.; RT "Ancestral, mammalian-wide subfamilies of LINE-1 repetitive RT sequences."; RL J. Mol. Biol 246, 401-417 (1995). XX DR [2] (Consensus) XX CC ORF2 ends at bp 675; average divergence of copies from consensus: CC 24% CC Replaces MER36 (Acc. No. Z15025) and MER38 (Acc. No. Z15026). XX SQ Sequence 911 BP; 359 A; 161 C; 159 G; 214 T; 18 other; cttgtatcca gaatatataa agaacgccta caactcaaca ataaaaaaac aaacaaccta 60 attaaaaaat gggcaaaaga cttgaatagg catttcacca aagaagatat acagatggcc 120 aataagcaca tgaaaagatg ctcaacatca ttagtcatca gggaaatgca aattaaaacc 180 acaatgagat accacttcac acccactaga atggctaaaa ttaaaaagac cgacaayaac 240 aagtaytggc gaggatgtgg agcaactrra actcycatac attgctggtg ggaatgtaaa 300 atggtacaac cactttggaa aayagtttgg cagtttctca aaaagttaaa cacgcaccta 360 ccctatgacc cagcaattcc actcctaggt atttacccaa gagaaatgaa aacatatgtc 420 cacacaaaga cttgtacaag aatgttcata gcagctttat tcataatagc cmmaaagtgg 480 aaacaaccca aatgtccatc aacaggagaa tgaataaaca aattgtggta tatccataca 540 atggaatatt actcagcaat aaaaaggaat gaactaytga tacacgcaac aacatgaacg 600 aatctcgaaa acattatgtt gagcgaaaga agccagacac aaaagantac atactgtatg 660 attccattta tatraaattc wagaacaggc aaaactaatc tataatgnta gaaatcagaa 720 tagtggttgc ctctggkgag ggtraatgac tggraagggr catgagggaa ttttctgggg 780 tgatggaaat gttctatatc ttgatcgggg tggtggttac acgagtgtat acatttgtca 840 aaactcatcg aactgtacac ttaaaatctg tgcattttac tgtatgtaaa ttatayctca 900 attttaaaaa a 911 // ID RLTR37_MM repbase; DNA; ROD; 671 BP. XX AC . XX DT 30-JAN-2004 (Rel. 9, Created) DT 30-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE Mouse long terminal repeat - a consensus. XX KW LTR Retrotransposon; Transposable Element; LTR; ERVK; RLTR37_MM; KW RLTR26. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RA Karolchik D., Baertsch R., Diekhans M., Furey S.T., Hinrichs A., RA Lu T.Y., Roskin M.K., Schwartz M., Sugnet W.C. et al.; RT "The UCSC Genome Browser Database."; RL Nucleic Acids Res 31(1), 51-54 (2003). XX RN [2] RP 1-671 RA Pavlicek A. and Jurka J.; RT "RLTR37_MM - a family of LTR retrotransposons."; RL Direct Submission to Repbase Update (31-DEC-2003). XX DR [2] (Consensus) XX CC The consensus sequence was reconstructed from the UC Santa Cruz CC genome annotation. Individual sequences are 88% identical to the CC consensus. 6 bp TSDs. RLTR26 in RepeatMasker. XX SQ Sequence 671 BP; 156 A; 214 C; 176 G; 125 T; 0 other; tgtggggcag tgggctgtgc acagacagcc tggtccccag tcgagcaaag gtctggaacc 60 ccggagaccc ggtgggtggt gatttccacc tgcatgggac agaaggagtt cggccatgcc 120 tcctgggccc ctggctcctg tcacgtagct acagcctccc acagcccccc tgcaggagag 180 gtatgtggct atcagtcaca taggagcagc accaagccct cccacatgca aataaggttt 240 tccccaaact ctcagtccaa gccaatgaga agtacctgct gtcaaaccct gaatcacccc 300 caaaactgta tataagaatc ctatccagag gaattaaagg tgtgcgagaa ctactccgtc 360 atctgagcct tttgtcctaa gagctgtaac acttgggaag agatctgctc tcccgaagtg 420 ccacctgagg ctctgccgca ctcctcactg gctagtcggc ttctcatcgg cccagcccga 480 cccgactcag tgcagggcag cacggagtta ccgagtaaga cacggcagag atggaggtag 540 agccaactgc agcagcagaa gcaggggaag cgacttcccc tccctgcccg cactcgcttc 600 ccttcgctgg aaccctcgca ccaggcctgg ccagagatct ccgtggaaag cctctggtac 660 tcaggcccac a 671 // ID LTR8_Cpo repbase; DNA; ROD; 630 BP. XX AC . XX DT 31-JUL-2009 (Rel. 14.07, Created) DT 31-JUL-2009 (Rel. 14.07, Last updated, Version 2) XX DE Long terminal repeat of ERV1 endogenous retrovirus: consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR8_CPo. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-630 RA Jurka J. and Baney O.; RT "Non-autonomous endogenous retrovirus from guinea pig."; RL Repbase Reports 9(7), 1549-1549 (2009). XX DR [1] (Consensus) XX CC >90% identical to consensus. XX SQ Sequence 630 BP; 170 A; 172 C; 111 G; 175 T; 2 other; tgttacaggc agctgagtta gttagaaata ttataggcag tcagacaggg ctaggtcctc 60 aaaggagcca gtgcagaagm ccagatacct ggcaactgtg gaggagacag acacctcctg 120 ctgtgggcca ggcctcaaga caaaagacca aacaaatgca accacatcct aaggattgat 180 attctaaata gttctctatc accctgagaa gccaggagtc accatcccaa ccccctaggg 240 aagagtcact gctctttaag aatgcccgct ttctacgtga ctcgcttccc catttgtccc 300 taatttaatc aattttacct gaactattaa tcttattggc taactatagt tatgtcccac 360 cttatgccca tctgccttgt gagttcccag tcttccccca ccttgaaccc gtctccccat 420 ggttttactc tataaaagtc cctagaaaag acagagtcct tggcaacccr ctcgggaccc 480 cttctgtctc tcttcagaca gaagctttct ttctctcaat aaatatattc tactcttaac 540 ctccgttgtt cctaaaattc attcttcgat tttagagaac acgaactcat accggacata 600 gtaggttggg agtttcattc ccctatatca 630 // ID RLTR10A repbase; DNA; ROD; 470 BP. XX AC . XX DT 26-SEP-1997 (Rel. 2.08, Created) DT 26-SEP-1997 (Rel. 2.08, Last updated, Version 1) XX DE Long terminal repeat of endogenous retrovirus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; KW Putative retroviral long terminal repeat RLTR10A; RLTR10A. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-470 RA Smit A.F.; RT "RLTR10A."; RL Direct Submission to Repbase Update (1997). XX DR [1] (Consensus) XX CC Bp 415 to 470 80-85% similar to IAPLTR_MA and RLTR7 termini. CC Copies 5% diverged from consensus. XX SQ Sequence 470 BP; 148 A; 83 C; 147 G; 88 T; 4 other; tgtggggagc gggtgtggcg gcagtcccaa aggcgccagg gactgcagct aagtcatatg 60 acttgcacct gacttcctca tataaaccac aaacatcttg agwgctgcgc aggtgtacca 120 ggatacaggt gaatccawtt tggtggagat wtacccctgc tgccctgatt agctgaagct 180 gcgtgcctgg tgaggtggcg tggcctgctg tgcgtggatg ggaactgaga gtatawaaga 240 gtgagaggcc cagggttcgg gggagatata aaaacaaggg agatataaaa acaagggaga 300 tataaaaaca ggggagatat aaaaacaagg gagatataaa caagggagat ataaacaagg 360 gagatataaa caagggagat ataaagaaag aagaaacagg actgaataaa cgtgtgcaga 420 aggatcctgt tgcggcgtcg ttcttcctgg ccagttgggc gcgcgcaaca 470 // ID MUSID6 repbase; DNA; ROD; 84 BP. XX AC . XX DT 22-APR-1997 (Rel. 2.03, Created) DT 22-APR-1997 (Rel. 2.03, Last updated, Version 1) XX DE SINE element, ID family. XX KW SINE2/tRNA; SINE; Non-LTR Retrotransposon; Transposable Element; KW ID family; MUSID6. XX OS Rodentia OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires. XX RN [1] RA Smit A.F.; RT "Structure and evolution of mammalian interspersed repeats."; RL Ph.D. thesis (USC 1995). XX DR [1] (Consensus) XX SQ Sequence 84 BP; 11 A; 20 C; 38 G; 15 T; 0 other; ggggctgggg gtgtggctca gtggtagagc ccctgcctag aatcccccag tgaggggctg 60 ggggcgtggc tcagtggtag agcc 84 // ID MER58 repbase; DNA; ROD; 224 BP. XX AC . XX DT 19-FEB-1997 (Rel. 5, Created) DT 19-FEB-1997 (Rel. 5, Last updated, Version 1) XX DE Medium reiteration sequence MER58; DNA transposon. XX KW Repetitive sequence; TIR; Non-autonomous DNA transposon; MER58. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-224 RA Kapitonov V. and Jurka J.; RT "unpublished."; RL Direct Submission to Repbase Update. XX DR [1] (Consensus) XX CC MER58 is a nonautonomous DNA transposon. It is flanked by CC terminal CA and TG ends with 16 bp terminal inverted repeats CC (TIR). CC MER58 can be characterized by 8 bp target site duplication. CC Target site (CTCTARAR) and TIR consensuses resemble those of CC DNA transposons from the "MER1 group". XX SQ Sequence 224 BP; 60 A; 51 C; 56 G; 57 T; 0 other; caggggtcgg caaacttttt ctgtaaaggg ccagatagta aatattttag gctttgcggg 60 ccatacggtc tctgtcgcaa ctactcaact ctgccgttgt agcgcaaaag cagccataga 120 caatatataa acgaatgggc gtggctgtgt tccaataaaa ctttatttac aaaaacaggc 180 ggtgggccag atttggccca tgggccatag tttgccgacc cctg 224 // ID HAL1 repbase; DNA; ROD; 1727 BP. XX AC . XX DT 31-MAR-1998 (Rel. 6.4, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 2) XX DE HAL1 repetitive element - a consensus sequence. XX KW HAL1; LINE1-like element. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-1727 RA Smit F.A.; RL Direct Submission to Repbase Update (MAR-1998). XX DR [1] (Consensus) XX CC HAL1 resembles Half An Line1 element, as it encoded a protein CC closely CC related to the ORF1 of LINE1 product (the LINE1 mRNA binding CC protein CC p40), but has no similarity to ORF2. Position 156 to 840 is 68% CC similar CC to the ORF1 region of the old LINE1 subfamily L1ME. Position 967 CC to CC 1099 is 76% similar to part of the coding region in the 8A-2B and CC 8A-2V CC mRNAs (GenBank entries MM8A2BGEN and MM8A2VGEN). Average CC divergence of CC copies from the consensus is 28%. The 5' end of HAL1 is probably CC incomplete. XX SQ Sequence 1727 BP; 708 A; 263 C; 364 G; 358 T; 34 other; ccctatagtg aagcccacma cctggcaagc cccacccacg cactcagagc ttccaagtcg 60 ctttttagts tcccactctt aaatatgagc agacagccaa ggatcaccag acatctgagg 120 aaancctcta atatggcaga caganaaata aaaacagaga aaaacawwnt ntatccatga 180 aacaagaaya ggatgctata aaaaggaaca ttcagagaac aaaaagaagc tcttggaaat 240 taaaaacgtg agagcagaaa ttaaaatttc aatagaaggg ttggaagata aagttgagga 300 aatctcccag aaagtagaac aaacaaagga taaagatacg gaaaatagga gagaaaagaa 360 ttaaaaaaat tgaggatcag tccaggaggt ccaacatcca actaatagaa gttccagaaa 420 gagagaacag agaaaaagat ggaagagaaa ttatcaaaga aataattcaa gaaaatttcc 480 cagaactgaa ggacatgaat ttccagattg aaaggnccca ccgagtgctt accgcaaaaa 540 tgaaggaaaa aaatagaccc acaccaaggc acatcattgt gaaatttcag aacactgaga 600 naaaaggaaa atcccaaaag cttccagaga gaaaaaaagg tcacatacaa aggatnagaa 660 tcagaatggc atyagacttc tcaacagcaa cactggaagc tagaagacaa tggagcaatg 720 ccttcaaaat tctgagggaa aatgatttyc aacctagaat tctataccca gccaaactat 780 caatcaagtg tgagggtaga ataaagacat tttcagacat gcaagatctc aaaaaattta 840 cctcccatgc accctttctc aggaagctac tggaggatgt gctccaccaa aacgagggag 900 taaaccaaga aagaggaaga catgggatcc aggaaacagg ggatccaaca caggagagag 960 gtaaagggaa ttcccaggat gatggtgaag ggaaattcca ggatgacagc tgtgcagcag 1020 gcctagagag caaccagtcc agattggagc aggaagatgg aaggctccag gaggaatgtt 1080 ctcaagaaaa tagaanacaa atgaaactga tagattatct gatgtgtttg aatatattga 1140 gaggntatna tttagwcatg tgaaagtttg ggggagaatt gaattagtga taggtacana 1200 gaaaactaag caaatgaaaa aaagcaagac aattattaac tccagggaaa acaaaaagtt 1260 gtacaagaaa ggaaatgtaa tcatagtaca ctacatggct cagctgtgaa taacatattt 1320 acatagtcat aataatgtaa acactraata ctgatttaac caaaattatg atataaacta 1380 tattgggagg ataagaaatg gaaagggtgt gcgtatkgtg tangnggaat garagaanta 1440 aatcctcatc ttccatagta ggaagtcart agataatgtc taaaactgaa aaatcaagaa 1500 atagcaatat aagcatgtta tttagaaata tggaggtaaa tattccaaaa gaatcagcta 1560 aaagagttga aagtggttgc ctctggggaa ggagctgnag gtggtgaaga gngnagggac 1620 ggggaaytgc tgttttttat aagcctttta gtactatttg accttttaaa ctatatacat 1680 rtattacttt aatttaaaaa antaanttat aaaaaagnaa ataagta 1727 // ID Charlie17a repbase; DNA; ROD; 219 BP. XX AC . XX DT 05-FEB-2007 (Rel. 12.01, Created) DT 10-APR-2007 (Rel. 12.01, Last updated, Version 1) XX DE hAT DNA transposon from mammals. XX KW hAT; DNA transposon; Transposable Element; Charlie17a. XX OS Mammalia OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi. XX RN [1] RP 1-219 RA Smit A.F.; RT "Charlie17a - hAT DNA transposon from mammals."; RL Direct Submission to Repbase Update (05-FEB-2007). XX DR [1] (Consensus) XX CC rnd-2_family-343 Pos 1-30 match 3' termini of Charlie3a_Xt and CC Chaplin6_FR. XX SQ Sequence 219 BP; 47 A; 70 C; 64 G; 38 T; 0 other; cagtgcttcc caaccttttt cacgtcatgg cacacataga aaatgataat atttgtacgg 60 cacactgggg taaacggacg aggctgctca cggccgagga ggtgaccggc ccaggggctc 120 cggctgcccc aggccccgcc cggccgcccc gagggctgag gggatcaata tctcggcaca 180 cctgtaaccc attcgcggca caccagttgg gaagctctg 219 // ID MLT2B2 repbase; DNA; ROD; 503 BP. XX AC . XX DT 01-OCT-1995 (Rel. 3.6, Created) DT 27-JAN-1997 (Rel. 6, Last updated, Version 2) XX DE Interspersed repeat MLT2B2 - a consensus. XX KW Inerspersed repeat; MLT2B2. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-503 RA Smit F.A.; RT "Structure and evolution of mammalian interspersed repeats."; RL Ph.D. thesis (USC 1995). XX DR [1] (Consensus) XX CC This sequence is a human endogenous retroviral LTR. XX SQ Sequence 503 BP; 111 A; 129 C; 121 G; 134 T; 8 other; tgtgatggtt aattttatgt gtcaacttga ctgggctaar gggtgcccag atagctggtt 60 aaacattatt tctgggtgtg tctgtgaggg tgtttccaga tgagattagc atttgaatca 120 gcggactgag taaagaagat tgccctcacc aatgtgggcg ggcatcatcc aatccgttga 180 gggcctgrat agaacaaaaa ggcagaggaa gggtgaattt gctctctctc cttgagctgg 240 gacatccatc ttctcctgcc cttggacatt agaactccag gttctcgggc cttcggacty 300 cgggacttgc accagcagcc ccccagattc tcaggccttc ggactcggac tgaryyacgc 360 caccggcttc cctggttctc cagcttgcag acggcatatc gtgggacttc tcagcctcca 420 taatcacgtg agccaattcc cctaataaat cycytctatc catcctattg gttctgtctc 480 tctggagaac cctgactaat aca 503 // ID MamGypLTR1d_LTR repbase; DNA; ROD; 813 BP. XX AC . XX DT 05-FEB-2007 (Rel. 12.01, Created) DT 10-APR-2007 (Rel. 12.01, Last updated, Version 1) XX DE Long terminal repeat of Gypsy LTR Retrotransposon from mammals. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW MamGypLTR1d_LTR. XX OS Mammalia OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi. XX RN [1] RP 1-813 RA Smit A.F.; RT "MamGypLTR1d_LTR - Gypsy LTR Retrotransposon from mammals."; RL Direct Submission to Repbase Update (05-FEB-2007). XX DR [1] (Consensus) XX CC 4 bp TSD; 32% subst in dog-human; 90% similar to MamGypLTR1b. XX SQ Sequence 813 BP; 198 A; 191 C; 251 G; 161 T; 12 other; tgtggctgga taatattttg agatattaat ttatgttttt ttttcctcct gtattttccc 60 ctttcctcct tccccccatt caagcaggta gccggctctg tgctcattgc ctcaggggag 120 gtatgtggca gggcagaaaa gcagaagtag cctgcaagtc tttctggctt ttgtnttcca 180 aaagcctaag cccttagggg aactagaggg tttgnggaag aggcaaaagg gaanaagtgt 240 cttggagaaa nncgagggga gagaggactt cctcctcccc agactagaaa gagattcccc 300 tgggctggga aggggagggg agaagagaag gagagangtn tgggtccnag agcagaggga 360 cctgngccct gcttcccggc agcgccgctg gggaggcggc aagaccccag agaggaatgg 420 ctgcgtggtg cgtctaggca gacgggacca taggcagcct cgcaaaagat tcccgtgccc 480 caagcatggc acggaagcag cagagagccg ccggacctga aggggccatg cggacaggga 540 caacggacgt ctcagcggta acctgtgtgg accgatgacc gagggccaga tccnctcccc 600 ncccccgacg ccttggcact gcgtaagatc cctggaactg tggcacaacc ctgggggagg 660 gagggggaac cccaagaagg actgaggtta agttttccgc cagcccagcg gaatgggggc 720 tcagagtcag aaattaagtt gagttataga aaataaagaa agtnatattt cttgcacacc 780 tgagtttgtg gactgagatt catacctgct aca 813 // ID RSINE2A repbase; DNA; ROD; 290 BP. XX AC . XX DT 22-APR-1997 (Rel. 2.03, Created) DT 22-APR-1997 (Rel. 2.03, Last updated, Version 1) XX DE SINE element RSINE2 subfamily - a consensus. XX KW SINE2/tRNA; SINE; Non-LTR Retrotransposon; Transposable Element; KW B4; RSINE2 family; RSINE2A; retroposon; subfamily RSINE2A. XX OS Rodentia OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires. XX RN [1] RP 1-290 RA Lee I., Westaway D., Smit F.A., Cooper C., Yao H., Prusiner B.S. RA and Hood L.; RT "Complete Structure and Organization of Three Mammalian Prion RT Chromosomal Regions."; RL Unpublished (1996). XX DR [1] (Consensus) XX SQ Sequence 290 BP; 75 A; 77 C; 86 G; 50 T; 2 other; tggggctggg gagatggctc agtcgataaa gtgcttgccg tgcaagcatg aggacctgag 60 ttcggatctc cagcacccac gtaaaagccg ggcatggtga tatacgcctg taatcccagc 120 gctggagagg cggagacagg aggatccctg gggctcgctg gccagccagc ctagccgaat 180 cggcgagctc caggttcagt gagagaccct gtctcaaaaa ataargtgga gagtgattra 240 ggaagacact cggtgttaac ctctgacctc cacacacaca cacacacaca 290 // ID MER69A repbase; DNA; ROD; 179 BP. XX AC . XX DT 18-APR-1997 (Rel. 6, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 3) XX DE MER69 repetitive element - a consensus. XX KW DNA transposon; MER69A. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-179 RA Smit F.A.; RT "MER69A."; RL Direct Submission to Repbase Update (1996). XX DR [1] (Consensus) XX CC Putative DNA transposon; 12 bp terminal inverted repeats. XX SQ Sequence 179 BP; 43 A; 50 C; 35 G; 51 T; 0 other; ccagaggcag atttaccgtg aagctaatga agcttaagct tcagggcccc tcacttgcat 60 aggccccttc caaggccctg tacctaattt tgtattcgta attttgtatt ctttttctta 120 aagagggccc cccaaattgt ataagcttca ggccccacaa aacctggatc tgcccctgg 179 // ID HERVK22I repbase; DNA; ROD; 6837 BP. XX AC . XX DT 30-APR-1998 (Rel. 6.5, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 2) XX DE HERVK-related endogenous retrovirus flanked by LTR22s - a DE consensus sequence of internal part. XX KW endogenous retrovirus; HERVK superfamily; LTR22; HERVK22I. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-6837 RA Kapitonov V.V. and Jurka J.; RT "HERVK22I."; RL Direct Submission to Repbase Update (APR-1998). XX DR [1] (Consensus) XX CC Average similarity of HERVK22I individual copies to the consensus CC sequence is about 91%. CC 6 bp target site duplications. HERVK22I is flanked by LTR22s. CC Similarity of HERVK22I consensus sequence to known retroviruses CC is shown below: CC ---------------------------------------------------------------- CC sequence begin end sequence begin end similarity CC ---------------------------------------------------------------- CC HERVK22I 287 445 HERVK 281 439 0.64 CC HERVK22I 1176 1429 HERVK9I 1062 1322 0.63 CC HERVK22I 1441 1869 HERVK 1863 2336 0.65 CC HERVK22I 2346 3009 HERVK9I 2380 3048 0.64 CC HERVK22I 3046 3353 HERVK 3659 3966 0.65 CC HERVK22I 3706 3805 HERVKC4 2027 2126 0.82 CC HERVK22I 3873 4359 HERVKC4 2127 2606 0.86 CC HERVK22I 4385 4821 HERVK 4975 5427 0.66 CC HERVK22I 4910 4979 HERVKC4 3080 3148 0.74 CC HERVK22I 5028 5065 HERVKC4 3149 3186 0.64 CC HERVK22I 5114 5334 HERVKC4 3187 3408 0.79 CC HERVK22I 6654 6785 HERVKC4 3401 3531 0.80 CC ----------------------------------------------------------------. XX SQ Sequence 6837 BP; 2027 A; 1648 C; 1423 G; 1722 T; 17 other; ggtggtgccc cgcgtgagga acgctgcaaa cggatcgtga cggacccctc gaaaatgaag 60 gtgaaaagaa ctgcgcagtc agtgagtaat cagtaagtca ttggtgcccg ctcgggattt 120 ccaagttcgg ggggaattgt tcaggctagg gtttcatcat gggacaacag ttatcagctc 180 aacagaaaca gtatataaaa gtattgaaac agctgcttaa agctagcgga gcctcagttt 240 cgcaggctca attaagggac ctaatgcaaa ctgttgtttc ccataaccca tggttcccgg 300 aagaaggtac gctagacgta gagctctggg aacaagtggg gagaaatctt aaacaacatc 360 atgcacaagg gcaacgggtc ccagtaacat ctttaacgtt atgggcctta gttagggcgg 420 ctttggtccc gttatacaca gaagagccta aaaagggaag ggaggaggaa ccgtcaccta 480 ccttaccacc tcctcgtccc tcagccccgc tatcaccggg ccaaaataac aaagaggaaa 540 cggaggtttt gcctgagccc cctcctccaa tagattggaa aaaagacagg ggatacgcta 600 cagctatggg accctgtctt aagcaagcgg cattagaagg ggagctctta gcctgcccgg 660 taatgcaaga tcgacaaggc aatcaggtgt atgaacccat ttcttttaac gcttataaag 720 agctaagaaa aagcattaaa gaaaacggag ccgctagccc atttatgaaa ggaatgattg 780 aagccatggc agacaacttc tgtatgaccc catgggactg gtcagtgcta gctaaaacaa 840 ctttggagcc cagccaatac ctcctctgga aggcagaata tgatgagttg tgtgaacaac 900 aagccaacca gaatcaggtg gccaggcaag acataacagc tgctatgctc caggggaggg 960 gtccccatgc cgatgtacaa caactagatt ttgatcccca ggcctatgca caagtgtctt 1020 tgtgtgctct cagggcttgg gaccgaattc ccgaaagcgg agttcaacag ggatctttta 1080 taaatgttca acaagggcct caggagccat ttgttgaatt tatcaatcag ttaacccagg 1140 caattaagag acaaattagt cacgcccagg ccgctgatat cttattgttg caattggctt 1200 atgaaaatgc taatgtcgat cgccagcaag caatgcaggc aatcagagga aaggcagcca 1260 cagtcgggga acttatacga gcatgtcaac tggtggggac tgaaacacac aaagccaaaa 1320 tattggctat ggcattaagg cctcctaaag tgaaaaggga gagaaaccca aattgttttc 1380 tatgtggaga gccaggtcat atgaagaggg aatgccccaa taatagagac caaggtaact 1440 caggaaaaga acccccttct atatgccccc aatgtaaaaa ggrgaaacat tgggcaaatc 1500 aatgcaggtc caaatttgay aaaaacagma accccataag taaccaggcg ggaaacttca 1560 tgakgggctg gccyyaggcc ccgctwmaar ctggggcaat gccagcagct ttcctcggtc 1620 agatggaaag cccacagtcc tctctctcag agcagccacc actgggagcg caggactgga 1680 cttactctgc cccaacaaat tagtgctaaa agaaggagaa gaccctaaaa gggttgcaac 1740 cgggatctgg ggcccactgc ctccgggaac agtgggatta gtcctagggc gatcaagcct 1800 atccagtaaa ggaattaatg tgctcactgg ggtaattgat agtgattatc aaggtgagat 1860 attagttatg atggaatgta aaggtctgca tattcttccc cctggatcaa agatagctca 1920 gttactgcty ttaccatact gggtccccaa cgcccacgga aaggaaaggg gaaagggaag 1980 ttttggaagc acgggagcca caggagtata tgggaaycaa ttaatcactg atcagagacc 2040 catgattacc ttaaaaattg gaaataaaaa ttttactggc ttattggaca caggggtgga 2100 tatttcaatc attagtgatc aaaactggcc agaaacttgg ccttgggtca ctcagaaaca 2160 aaaaattgtc agcatcgggg aagcgcacac agccaagcag agcacgcgcc ccctaacatg 2220 ttgcgattcg gagggaagaa aggcagttat acaacctcta atcatgccca tccctgttaa 2280 tctttgggga cgggacctat tagcccaatg tggggggtca ctctgcagac ccctttctaa 2340 taatggccac tgttattatt cctcccctac ccctgacgtg gctctctcaa gatccaattt 2400 gggtagaaca gtggccttta aagggagaga aattacaaag agcccatgaa ttagttgaag 2460 agcaattaaa agccgggcat atagaaccat caaacagtcc ttggaattcg cccattttca 2520 tcattcccaa aaagtctggt aaatggagac ttttgcatga cttacgtgct attaatgcta 2580 atttgcaacc tatggggccc cttcaacagg ggctcccttc ccccgcggcg attcctcaag 2640 attggcctat aatcgttatt gacttaaaag actgctttta tactattccc cttgcagaac 2700 aggacagaga aaaatttgcg tttacaatac cagctatcaa taatgaaagg ccagcttgct 2760 gatttcattg gaaagtgctt cctcaaggaa tgctaaacag tcctaccatg tgtcagtatc 2820 atgtaaatca agctttgctc cccagtagaa aagaatttcc taattgcaag attattcatt 2880 ttatggatga tattttacya gcagccccaa cggagccagt acttttaart ttatatacct 2940 ctgtcgtaaa gaatacacag ytaagaggtt taatcatagc acctgaaaaa gtacaaatgt 3000 cctctccttg gaaatatctt gggtacatac taacttcccg gtcagtaaga cctcaaaagg 3060 ttaaattaaa tactagcaac ttacacacct taaatgatta tcagaaatta ctaggcgata 3120 ttaactggct ttgccccacc ttgggcataa ctacttataa gttgcaaaac ctgttttcta 3180 tcttaaaggg caatacagcc ctagactctc ccaggtattt aactcctgca gcaaaaaggg 3240 aaattgagga aatagagcaa gctatttctc agaggcaact agatcgcata gacccatgat 3300 attcagttca gttgtttgtt tttcccacta aacattcccc aacaggatta ataggacaga 3360 tggccccagg gctgcgcttt ctagaatgga tttttttgct cacataccgg gactaaaaca 3420 ctctctccct atatccagct aattagtaaa gtcatctatt caggccacag acgatgcaat 3480 cagttgctag gttatgaccc tgatatcatc agaattcctt taagtaaaaa gcaattcgaa 3540 gcagtattgc ccttatcttt agatctgcaa atagcactct ctgattacgc aggccatata 3600 gagcatgccc ttcctgctga caaactactt cagttcttat ctcatactcc tgtggttttg 3660 cctacaaaaa tagttcactc ccccatacct aacgctttaa cactgtttac tgatggctct 3720 ggtaaacatg gaaaagcggc tatttggtgg agaccacata attccctcac tcattctgga 3780 tttactagca ctcagagagc tgaggttgga gccttaatat tggccctgga gactttttct 3840 gctcagccca tcaatattgt tagtgactct gcttactctg tttatttatt gcagaacctt 3900 gaaacagccc tcattaagtc cactctggag cccaccctgt gtgcactttt tctttgactt 3960 cagcaattgc tggatcaacg tacacatcct atttttatca cacatattcg agcccacagc 4020 tcactgcctg gcccactggc ttatggcaat gatcaagcag acctgcaggt tatgacgtca 4080 ctgcttgacc aagccaccca atcgcatcaa tttttccacc aaaattggag aaacttatct 4140 aaacaatttc aacttaccca ragactagct aaacaaatta tcctgcaatg cccagattgc 4200 cagctcacag gcacgtcccc tccttcaaca ggtgttaacc ctagaggact agaacctaat 4260 cagttatggc aaacagatgt tacacacgtc cctgaatttg gaaaactaag atatgtacat 4320 gtatccattg ataccaattc tcatctaatt agcgcacatg ctcttcctgg agagtccacc 4380 cgatatgtca ttaaacatct tcttttaact tttgcattta tggggcggcc cacaaaaatt 4440 aaaactgata atggtctggc ttatgccagc tcacaatttc aacaattttg tcacacatgg 4500 aacatccaac attccacagg catcccgtat aacccccaag gacaggccat agtagaacgt 4560 gcccattcca cccttaaaaa tatgctcaga aaacaaaaaa gggggaatat gagtaaggac 4620 cctgcaacac tattggcaca agccttattt acccttaatt ttttaaattt aaatgataaa 4680 tttcaatcag ctatagaaaa gcactttgct aaaacctctc aagacataaa acctgcagtt 4740 ttatggaaag atgtaaacag taatgtatgg tgtggtccaa atgaattgtt aacatgggga 4800 agaggatatg cttgtgttca caccccctca ggtcctcttt ggattccagc acgacgcatc 4860 aaaccatacc atggcatggc taggacccaa cccagtacca gaaatgaaga aaatgaccct 4920 gcaggaccca cagccccgga cgatgcagct tcctcggatg acacaagccc cggacattac 4980 ctgggggatg ctgaagaaga caactcagga ggccgagcga atcctgctcc agacacagac 5040 accattcact ccagataatt tgttccttgc tatgctctct gttgtacatt gcaactcatg 5100 tagggcattg atccttttta tgctctcgct ttgtctgcaa cctgtacctg ctacactcta 5160 ttgggctcat ctcttagatc cgcctttctt ccgccctgtt acctgggcag acactccctt 5220 cccagcctct aataacgtaa ctgcttggct aggagggata gatttacccc cagtggggtc 5280 cctcagtaat ggcacacatt ggactaaggt gccaggtaac actacatatc actccactat 5340 cctcccactg tgtgtaagtt ataaaggttc taacccttac tgtgtacctg cccaaacaca 5400 attatggcta catcatggca aaggaaatgc ctttacagtc ttagctgcag gtagcctcaa 5460 accaggcaat gcaatcaatg ccactttccc aaacattcct tcctgtgcta aagaacaaag 5520 ctgggaaagt aatggattcc actttagctg ggaggtctgt cacgggggac aagcccgtag 5580 cctccagtta ggcaattata acatcttaga ctggagtccc cacggccatt tgcagggcaa 5640 tcttactgat gtcctcatct atcatggcat caatcacagt ttcgtagcca cgtcccgttc 5700 ccctatratt tgggccgatg gggggatggg atatcccaga ccccaagtaa agtccatgcc 5760 accccaagac actttatggt gcctgggaca tcttagcacc tcccttaaca cctggcatgg 5820 gacatatcat aattccagtc acaattatac tatgaccttt attcataatc acactgatca 5880 gtgcctgatt tgcactaccc atccatatgt tttccttatg ggaaccgata tttccgttac 5940 accccaaaac tccacgtttg tgacccgggt gcagggacag gcttggttcg cctcatgtat 6000 cactaattat aatatatcta atttaaatat tactagtgtc atggtattga ggagacaatc 6060 tgaggctttc ctaccagtca atttaacatg cgattggcaa ggttcctctg cccttgccac 6120 cttagaatgt gccctgtccc aggtcagacg caaaagattc atagttacac ttacagcctt 6180 tatagtctca gccatagtca tcctagcaac tgctagtgtt gctgtagcat ctattactga 6240 atcagtacaa acagctactt ttgtagataa tttggccaga aatgtgtcta atgaacttct 6300 cttacagcag ggtayagatc aaaagattct tgcacgtctg caagccctcg aggctgcctt 6360 ggaatatgta ggggagcgac aagatgcact ggcattccaa cagcaattaa actgtgactg 6420 ggagcataag cacatctgtg tcacctctct accttggaat caatcaatac atagttggga 6480 tgaggtgaaa caacacctct ggggaacctt tcatgacaat ttaacagcag acgtaaagca 6540 acttaaaact aaaattctag aatccctaaa cgccatagat ctacacgccc aacaaacagc 6600 catatggaag ggtgtgcgag atcatctctc ctggatagac ccccgctcct gggggtcact 6660 ccttgattgg aaaagaatgt tgctaattat actcatgttt gtcttatgtt atttactaat 6720 tctaggatgc aaagccggaa tacgagctat aaccgccgcg cctgacagac ctgttgctgc 6780 acacatctgt actcttcaat caacaaaacc tgatgcaaaa aacagaaaag ggggaga 6837 // ID ERVB5_4-I_RN repbase; DNA; ROD; 8124 BP. XX AC NW_047710; XX DT 11-FEB-2004 (Rel. 9.01, Created) DT 11-FEB-2004 (Rel. 9.01, Last updated, Version 1) XX DE Rat endogeneous beta retrovirus ERVB5_4, internal sequence. XX KW Endogenous Retrovirus; Transposable Element; ERVB5_4-I_RN; KW RnERVB5_NW_043819; endogeneous betaretrovirus; gag domain; KW pol domain; pro domain. XX OS Rattus norvegicus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Rattus. XX RN [1] RP 1-8124 RA Baillie J.G., Van de lagemaat N.L., Baust C. and Mager L.D.; RT "Multiple groups of endogeneous betaretroviruses in mice,rats and RT other mammals."; RL J. Virol 78(11), 5784-5798 (2004). XX DR Genbank; NW_047710; Positions 17971804 17979936. XX SQ Sequence 8124 BP; 2280 A; 2038 C; 1629 G; 2177 T; 0 other; acacaggtcg gggcaactgg cgcctaacgt agggcctcag taaggggctc cgacagaaca 60 ttgctggccc atgaacgtac ccctagcata tgatcgccac ttccctgagg acgtgagaga 120 agatccacac tctgactgta gtaaccgccg actgtgagtc cgatcttcat aaggtaagac 180 cacgcaatgc aatcatggga cagtcattga gcaaacatga cttattttta aaggggctta 240 aggagtccct caaggcaaga ggaacttggg ttaagaaaaa ggatcttgct aaattcttca 300 tattccttga tgatgtttgt ccttggtttc cccaagaggg aactattgat gaaaaaacag 360 cttttttctc cagagtggga gactgtctga aagactttta taggacattt ggttcagaaa 420 gggtccctgt tcaagccttt tcctactgga acttgattaa tgaagtactt agggtttata 480 agacatggac agatattcaa gagatactta gtgagggaga aaggtcttac agcgccattc 540 tccggctcct tcagtctgcg atactgtctc cattgtaatt ccagatccgg gccacccacc 600 cccacccctg gtccgcctgc cgagtcctag taggcatcag caaaagttcc tactcctagt 660 atttaccctt ccttgactca gttgaaagcc ccagtccaca acgagcctga tgaggtcagg 720 gaggtccttc ctccagaaga tccatcatcc ctagaagagg ctgctaaata tcataactct 780 gattggccac ccttacatac tgtccactct aggccaaccc cttacaccac tactccccca 840 ttttgcgctc ctgttttttt attttgtctc agatattgcc aacaccactg aaaatgctaa 900 aataaatctt tgctgggagg taaatccctt aaagatttac tctgaaagaa aagaaagcat 960 cttaccctat tgagggaatt ccgagctttg agtcttgagc taacccagtt gctcccaagc 1020 ccacccctct taaaattgag atgaagaaaa atcctaaaac tgaaacaaag aaaaaaattc 1080 cctaaaaccc ttcaagcatt cccagtcaca cagatcaaaa tatcctcccc aaccccaccc 1140 cgccaggaac agccggagaa gacaatgctt ccgatcaggg agagaaccca cccccctcag 1200 ttgatgaagc ctcggataat gagaatgagg tttcaggaga ttcaaatagg gaaaaggaag 1260 aggttagaca tcaacagtat cgctgcctct ggtttaaaca cgtaaaagaa ttaaagatgg 1320 cggcgaaact ctatggcccg gtggacccgt tcactgtatc cataattgaa accctgagtg 1380 accgatggct tacacccaat gattggtatc ttgtggcaca ggtaaccttg tcaggaagtg 1440 ctatgttctc tggaaaaata aatttgttga aaattgtaca gaaacagcaa tccacaattc 1500 cgaaaacaaa acctccaaaa cttggaccaa ggataaactt ttaggtcaca gcccctatga 1560 caccatgaaa ggcaggctaa gtttcctccc ggcctcttag ctcagataca aaacgctggt 1620 ctgaaagact ggaaaaactc ccccctaagg ggtcggtgac tacctccctg gccaaaatac 1680 aacagggtcc cgaagagccc tctggcagtt tcatcagatg cctcacagat gcgactgaaa 1740 ggcttgtcgg ctgagacaaa ataaaagaga attcattaaa catcttgctt tcaaaaacaa 1800 aaaccatact tatcaagctg ccataccgcc ccaccgtagt ggcaaggaca gagtttggta 1860 tagcccaaaa tgttttcgcc atactaccag cctagccatt ggagccaccc ttaaagactt 1920 tacctaggat ggtcgtccaa agatttgcct caattgcaaa cgaccaggcc atttcttcaa 1980 agaatgtagg gcccataggg ctgaacacac gtctgcctcc tccctgtgtc ctcgatgcta 2040 aaaggataga cattgggctc ctgagtgcca ttctaaaaca gatgcccaag gtaaccactt 2100 gcccccaagg cagggaaatt cccagaagaa ccagtccttg gccctgagct ggagacaaac 2160 ccctggggct atcgggcttg tcccccaaac acccaaccct cagttcaaaa aacccaagag 2220 ttgcctccct tcattgaaca accacaggca gcacagaact ggactctgtt ccaccaccag 2280 cgcaaaatta acacctaaga tgggaattta aatcctacct atgggaattt ttggcccact 2340 actccctgac accttcagcc tccttctttc tcatgccagc ctaaccctcc agggtctgaa 2400 catttccccc agaataattg acaatgatta tacaggagtg ataaagattt tagcctcctc 2460 tactaaccgc aggttcggga caaaggatca ctcaattact tttgctcccc cttattcacc 2520 ccaacccaaa tactcaacaa aaaactagga gtgccagcaa tttcgactcc tcagatgctt 2580 actgagtcca acagattact caagatctac ccctccataa attaaaacta gatggaagga 2640 gttttgaagg acttataggt actggagcag tacagtaatt tcagagaaat attggccctc 2700 ctcctggccc cttactactt ctatgactta ccttaaagga atcgggcaga acaccaacac 2760 ccagtaaagt tccaaggtcc ttacatggac agatgaggta ggtaatacca gcactgtcca 2820 accttacgtt gtgtctggct tacctgttaa cctttgggat gttctcgccc aattacatct 2880 ccttatgtgt agcccaaatg aaactgtagc ccatcaaatg ctaaaacagg gcttccgctc 2940 cagacaggga cttggaaaac attcacaagg cataagagaa cctattcagg tgaatgaaaa 3000 gttgaattgt ttgggtttga gtgcaccaga tttaccctag tggccattga aactcctgta 3060 cctcaagcag ataaaatcac ttggaaatct aaggatgctg tgtgggttga tcaatggcca 3120 ttgacttcag aaaagttggc cgcagcggtg gcgttaatac aggaacagct tgccaccggc 3180 caccttgagc ctaatacctc tccttggaac actcccatct tttttatcaa gaaaaaaacc 3240 aggcaggtgg aggcttttac aagatttgag agagattaat aaaactatgc ttgctatggg 3300 agccttgcaa ccaggtttgc ctatacccgt ggccattact gctgggtact aaaaaatagt 3360 gatagacctt aaagattttt ttaccatccc tttacaccct gaagatagag aacgttttgc 3420 aggggttggg gatttagctc agtggtagag cgcttgccta ggaagcgcaa ggccctgggt 3480 tcggtcccca gctccggaaa aaaaaaaaaa aaaaaaaaaa agagaacgtt ttgcattcag 3540 tctactggtg acaaatttta aagggcccat gcctcgcttt cactggaagg ttctgactcg 3600 gaatggccaa tagcctcacc ttatgcctaa tatttgtggc ccagatcatt gatccattta 3660 gggcattaag gccttccatt tacataatcc attatatgga tgacatcctc ctgacaggac 3720 ctgatgattc tgaattgctt tgctgttgcc agcagctttc acaaaagttg accactaaag 3780 gccttcagat tgtccctgat aaaattctgt tgaaagatcc ctatttttac ttggggtttg 3840 aacttcgcca tcaaaagatt actatacaaa aagtccagct taagactagc cacttaaaaa 3900 ccataaatga ttttcaaaaa cttttgggag atattaattg gcttaggccc tatttaaact 3960 caccacagga gaactaaagc tcctttttga catccttaag ggagactata acccctcttc 4020 ccattgatct ttgaaccctg tggccaatgc cctgcaaatt gttgaagggg ttattcaaag 4080 tctgggagtt accttcatct cttatgaaaa aacccctact ctttgctgtc tgagccacat 4140 cccacactcc cactggagtg ttttggcaaa aagatcccat tttgtgaaca aatattcctg 4200 cttctcccac taaagtcctt atccttttta cctccttggt tgctaaactt atcctcttag 4260 ggcaagaaca aagcaggcag ttctttggcc aggactctga taagctgatc ttaccctact 4320 ctaaggacca gatacattga cttatgcaga ctaccgatga gtggtctgtg gtttgttcat 4380 ccttttcagg aatcatggat aatcactgtc ctgctgattc aatttgcaaa gattcatccc 4440 ttcatttttc caaggatcac atctcctcga cccttgaaaa ggcatgcctg gtgtttacag 4500 acggctcctc tggtggaagt gcagcttgtg tcatggatgg acaaaccacc acaattcagt 4560 ctcccttttg ctctgcacaa cttgtagaac tctttgcagt aattaaatta tttcaaataa 4620 tgataggttg tccttttaat ttatatacag gtagtgctta tattgcccag tatgtcccac 4680 tccttgagat gatcccttat attaagtcct ctactaatgc cgtcccttat tttcacaacg 4740 ttagaaattg attatctcta gatgtcagtc tttctatatt gggcacctct gtgctcattc 4800 agaaccccct ggccctctct ctgagggaaa tgcttgtgct gatgctgcta cctgtttggc 4860 tttccccatc cctatagacc ccattgtaca agtcccagag agccactcat tacatctctt 4920 aaatgctcag acacttggac cactctttag aattatctga gaagaggcat gtcagattgt 4980 taaacagtgc ccagcttgtg tcacatagtt gcctaacctt tatttggagg tcaaccccag 5040 gggcctgatc ccaaatgaga tatggcagat gggtgttacc catgtgcctg aatttggaca 5100 tctcaaaaat cttcatgtgg taatagatac ctttaatgga tttatttttg ccagcctaca 5160 cactggagaa gcctcaaaaa atgttatagc ccatatccta aattgtcttt cagctatggg 5220 tgaacctaag gttattaaga cagacagtgg ccctggctat acgggaaaaa attttcaaga 5280 gttctgccac aggctacaga ttaaacatgt tactggcatc ccttattatc ttcaaggcca 5340 gggcattgtt gaacacattc acccaactct caaaaacact ctgctcaagt taaaatgggg 5400 tggtttatac ctcataaaag gatccccaaa aatagtcttc atcatgcatt gtttgtcctc 5460 aattttctga acctaggcac ccatggtaga tcagccgcta actgtctcca gcaccctgaa 5520 acaaacaaga catatgctac ggccatgtgg aaagaccctc ttgcttataa atgggatggc 5580 ccagacacag ttcttatctg ggatagaggg gtgatgtgcc tatttgatac cgaggagggt 5640 gctgctaggt ggctactaga aagacttgta aaacatatgg atgtccccca aaaggatagc 5700 tcagccccag aaatattaga aagtgatgaa atccagggaa gaaagtctcc ctgagattcc 5760 tctttccttt tctcttcaca ggttaagaaa aatagatcat tccctgtcaa gactttttgt 5820 tctactgctg ttcttgatgt ctacatatgg agctaattct caccgacccc tcaatcttac 5880 caggctggta ataaatggag aggatgatgc tctgtggagc atgtctaagg ttacaacacc 5940 aactcctggt ggcctagtct tcacccagac ctttgtcaac tggccatggg caccccagcc 6000 aactgggacc tagagggata ctataatctt caaaaggctc cctcattgcc ctcccctcct 6060 gggagacatg gtcttgaccc atggggtggt tgtgccacaa aagatagaag agtgttgaga 6120 atgctaccct tttacatttg tcccggcttc tactgagatc gcaccttaaa ccatgggtac 6180 ggagtcaagg cagaatactt ctgttataat tggggctgtg agaccacgag tgatgctctt 6240 tggaatcctt cttcctcatg ggatttcatt aaagtcactg ctaattatac acaccataag 6300 tctggaagcc caggttggac aaacatagag aagtgttcgg ggtggtgcca ccaactacat 6360 attcaattta caaaactggg aaaaaaaaat acgcctgtga accagtaacg tttcatgggg 6420 tctgaggctt tataaggcat gccatggtgg tggtgtggct ttcactataa agttacaaac 6480 aaacctaccc ggggctgata aaccagtcat gcggattagg ccaaaccttg ctttgaattc 6540 cctatatttt tcggggaaat cttaaacctt tccccactat tttagcccca ccccctttgc 6600 cctctgctac tcatttacct ccaaatagga tttatgagcc caatacctat cgatttatct 6660 tcaatattgt taatcaaacc ttttttgtgt taaatcattc ccctcctgaa cttactaagt 6720 cctgctggct gtattatcat tctgaccgcc ccccccccac acccttttta tgaaggaatt 6780 gcaacaagta ttaatgtttc ggggacagat gatcctttcc tatgtaggtg gcaccaacag 6840 aggctaacta acgttgagca cgctctctgg ccaaggatcc tgtgctggca accaaattcc 6900 ctccactcct gcccagctat acaagcaaat gattcccatt ccctccttct tcaagtatct 6960 tcttccctcg aatgatatat ggtaggcctg tagtgatggc cttacaccct tcgtttaaac 7020 tcccaatttg ataaataatg ctactgcctt ttgtgtactt gtacaactaa ctccccgctt 7080 gacctactat atagccaatg aatttgccca gatgtaggat cctgttactg atcatgccct 7140 acttcgaaca aaacgggaag tcataacagc agttaccctg ttgtcctcct cggccttggc 7200 gctgcgggtg ctgggacggc ataacctggt tctctccaac agacattatt ctgaccttcg 7260 gcaagccatt gacaaggaca ttagggaact ggaaattggc attaccaaac tagaaaattc 7320 ccttgtctct ccttgtctga agtggttctc cagaaccgaa gaggactcga tttgctcttc 7380 ctccagcaag ggaggctatg tgcggccctt aaagaagaat gttgcttcta cactgactat 7440 accgggataa tcagagttag tatggctaag gttagagaag gcttagaact aagaaagaaa 7500 cagagagaac aagaagagga ctggtttaag agctggtgtt tctcctctcc ctggctcacc 7560 accctcctcc cctccatttt ggggccccta gttggtttct tcctgctcct ggcttttgag 7620 ccttggacct ttaaccggct cacggtcttt atcaaacaac aaattgactc cttggcatcc 7680 aaacccctac aggtccatta ccatgagttc aatttggcag atagaggtct ggctgaaccc 7740 tatgatgaca tatctccctt gggagctgca taacactcac atttatgctc gcctgtattg 7800 gcgtgggtac ctgcaagggg tgcgaactaa gtacagcaaa gggagaaccc tattgtctgt 7860 ctctgagtac cctgtgaagt aaacctgatt gcatagaggt tcgtgtcgat ttcccttcag 7920 ccacacacca ttgagagatt ctcggagttg gcaggggcta agaactcccc tccctaaaaa 7980 gacacccccc aaaactatca atgaccagtg ataatgaccg gaggatgagg tcaatacttc 8040 ccccctcggg ttaaaccccg cttgtctctg tgacatcgtc atcacatatt actcgccagc 8100 ctcacataaa ttgataaaaa ggga 8124 // ID B3 repbase; DNA; ROD; 228 BP. XX AC . XX DT 22-APR-1997 (Rel. 3, Created) DT 11-AUG-1997 (Rel. 3, Last updated, Version 2) XX DE SINE element. XX KW SINE; B3 family; B3. XX OS Rodentia OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires. XX RN [1] RP 1-228 RA Kalb F.V., Glasser S., King D. and Lingrel B.J.; RT "A cluster of repetitive elements within a 700 base pair region RT in the mouse genome."; RL Nucleic Acids Res 11, 2177-2184 (1983). XX RN [2] RP 1-228 RA Lee I., Westaway D., Smit F.A., Cooper C., Yao H., Prusiner B.S. RA and Hood L.; RT "Complete Structure and Organization of Three Mammalian Prion RT Chromosomal Regions."; RL Unpublished (1996). XX DR [2] (Consensus) XX CC The tRNA-like region of B3 is very similar to that of B2. XX SQ Sequence 228 BP; 65 A; 64 C; 51 G; 43 T; 5 other; ggggctggag agatagctca gcggttaaga gcactggctg ctcttccaga ggacccgggt 60 tcggttccca gcacccacat ggcggctcac aaccgtctgt aactctagtt ccaggggatc 120 tracnccctc ttctgacctc cacgggcacc aggcacgcac gtggtacaca gacgtacatg 180 cargcaaaac actcatacac ataaaataaa aataaatmtt twaaaaaa 228 // ID IAPLTR3_I repbase; DNA; ROD; 6821 BP. XX AC . XX DT 30-JAN-2004 (Rel. 9, Created) DT 30-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE Mouse family of LTR retrotransposons - a consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; ERVK; IAPLTR3; KW IAPLTR3-int; IAPLTR3_I; LTR. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RA Karolchik D., Baertsch R., Diekhans M., Furey S.T., Hinrichs A., RA Lu T.Y., Roskin M.K., Schwartz M., Sugnet W.C. et al.; RT "The UCSC Genome Browser Database."; RL Nucleic Acids Res 31, 51-54 (2003). XX RN [2] RP 1-6821 RA Pavlicek A. and Jurka J.; RT "IAPLTR3_I - a subfamily of autonomous LTR retrotransposons."; RL Direct Submission to Repbase Update (JAN-2004). XX DR [2] (Consensus) XX CC The consensus sequence was reconstructed from the UC Santa Cruz CC genome annotation. Similar to IAPEYI. Individual copies are ~98% CC identical to the consensus. Flanked by IAPLTR3 LTRs. CC IAPLTR3_I_ORF1: 525-1899 (458 aa) gag CC MGSSQSVITPLQAVLKQRDLQVTSHTLQNFVKEVDRVAPWYACSGSLTVASWNKLGRDLDRKHEEGDLRL CC GTKAIWKLIKNCLEDETCRPAIVEGQGTLEEVQDSMSETERSERIRAQKKKCLRKKGPPQDSEGRGEKKK CC GSETEPSTKKKPYTNFYPIHDLEALEINSSGSEDLDPSEEAKLEEEAAKYKEQRYNPDRWSRSRSNKKGS CC ISATVPTAPPLYELQYSANSFFPQEELKKIQMAFPVFDTGEAGRMHAPVDYKQLKELAESVCNYGVSANF CC TLVQVERFANMAMTPSDWQMIAKATLPNMGQYMEWKALWYDAAQNQARVNTTAVDDNQRQWTFELLTGQG CC QYATNQINYPWGAYAQIGAAAVKAWKALTRKGEAGGHLTKIVQGPQEAFSDFVARMTEAAARVFGDPEQA CC MPLIEQLIYEQATQECRAAITPRKKKGWYKTGLRYVGS CC IAPLTR3_I_ORF2: 2305-3061 (252 aa) pol (partial) CC MPQMNVQPILVKSPGPLPPRTMGLIVGRGSLTLQGLVVHPGVVDHQHLQDIQVLCSCPQGIFSISPGDRI CC AQLIFLPSPDKDEDNIKELRGMGSSGPDSAYLVMPLNARPTLHLFINDKDFEGIMDTGADKSIISSYWWP CC KSWPVTKSSHSLQGLGYQSCPAISSSTLTWQTSEGQRGLFTPYVLPLPINLWGRDVLSEMGITLTNEYSV CC QTTNIMKKMGYTKGKGLGSKEQGRLEPVSHNGNPGRRGLGFS CC IAPLTR3_I_ORF3: 3297-5580 (761 aa) pol (partial) CC MNLFGSIQRGLPLLSTLPKQWKIVILDIKDCFFSIPLCHQDRPRFAFTIPALNHMEPDKRFQWKVLPQGM CC ANSPTMCQLFVQAALEPVRQYFPSLLLLHYMDDILLCHKDMMLLQKSYSFLIKMLNQWGLQIAAEKVQIS CC EVGSFLGTIIFPDKILPQKLEIRRDHLHTLNDFQKLLGSINWLRPFLKISSAELKPLFDILKGDSHISSP CC RALTPAANKALQVVENALQNAQLQRIEESQPFNLCVFKTAQLPTAVLWQDGPLLWIHPNASPARVIDWYP CC NAVAQLALRGLKAAVTHFGRDPKLLIVPYTATQVQVLAATSDDWAVLVTSFSGQIDNHYPRHPILQFALN CC QAIVFPQVTAKNPLPEGIIVYTDGSKTGVGAYVTNNKIVSKQYNETSPQIVECLVVLEVLKAFPGPLNIV CC SDSSYVVNAVNLLEAAGVIKSSSKVADIFQKIQAVLLHRRFPVYITHVRAHSGLPGPISRGNDLADRATR CC VVAAALSSQVDAARNFHKQFHVTAETLRRCFALTRKEAREIVTQCQNCCQFLPVPHVGVNPRGIQPLQVW CC QMDVTHISSFRRLQYLHVSVDTCSGIIFASPLTGEKASHVIQHCLEAWSAWGQPKILKTDNGPAYTSQKF CC RQFCRQMNVTHLTGLPYNPQGQGIVERAHRTLKSYLIKQKGGVDEALPLTPRVAVSMALFTLNFLNLDEQ CC GHTAADRHCSEPNRPREMIKWKDVLTGKWRGPDPILIRSRGAICVFPQEEDNPLWVPERLT. XX SQ Sequence 6821 BP; 1892 A; 1544 C; 1637 G; 1748 T; 0 other; tctggttgcc agaagcccgg gaattaacat cgccagcatc gaggagaacc cctggagatg 60 gggtgggttc agaactgcag agaaaaggta agttcggaga ggtatgtctg atcgtgaacc 120 tcttatccct tttgatttcg gtcttaatct agatgcaccc taggaagggg cggtagaagc 180 tctagtcttg gttgccgtcc atcgaggctg gccctgaaat ttgctatcca ttgaggctgt 240 ttctgaaata ttctgtccat cgaggctggt gctgaaattt gaggtccatc gaggctggtg 300 ctaaaatttt ctgtccatcg aggctggtgc tgaaattctt attttgtcca tcgaggcttg 360 tgctgaaatt tgaggtgtgg tcagtccgac gtagataagc ggcagcaccg aggtatcttg 420 taagcctccc atagataagg gagcagagca ccgtgtttac ttttggcttt gcttaaggag 480 taccataagt cgggcgtaga tcagccgcag gtactcattc tatcatgggc tcttcacagt 540 cagtgatcac cccattacag gcagtgctaa agcaacgcga tctgcaggtc acctcccata 600 cgctgcagaa ttttgttaag gaggtggatc gcgttgctcc ctggtatgcc tgttcggggt 660 ctctaactgt agcctcatgg aataagctag gaagggacct tgaccgtaag catgaagagg 720 gagacttacg cctaggcacc aaggcaattt ggaagctgat aaaaaactgt ctagaggatg 780 aaacctgccg acccgccatt gtggagggac agggaacact agaagaggtt caggacagta 840 tgtcagaaac cgaacggagc gagagaataa gagctcaaaa aaaaaaatgt ctaagaaaaa 900 aaggacctcc ccaggattca gaaggaaggg gagagaaaaa gaagggcagt gaaactgagc 960 cctctactaa gaaaaagcct tatactaatt tttatcctat ccatgacttg gaggccttag 1020 agattaatag ttcagggtcc gaagatctag accccagtga ggaggctaaa ttagaggagg 1080 aggcagcaaa atataaagaa caaagatata accctgaccg atggtcgcga tcaagaagta 1140 acaaaaaggg tagcatatca gctactgtgc ccacagcgcc accactttat gaattgcagt 1200 atagtgctaa ctcttttttt cctcaggaag agttaaaaaa gatacagatg gcatttccag 1260 tctttgatac tggggaggcg gggcgtatgc atgccccagt ggactataaa caacttaaag 1320 aacttgctga atctgtctgt aactatgggg tcagtgccaa ttttactctg gttcaggtcg 1380 agagattcgc taatatggcc atgaccccat cagattggca aatgatagca aaggctacac 1440 tccctaatat gggacaatat atggaatgga aagctctatg gtatgatgcg gcccaaaatc 1500 aggccagggt caataccaca gcagttgatg acaaccagag acaatggacc tttgaattgt 1560 tgaccggcca agggcagtat gccaccaatc aaattaacta tccttggggg gcatatgccc 1620 agataggggc agctgcggtc aaggcttgga aggcacttac aagaaaaggg gaggctgggg 1680 gtcaccttac aaagattgtt cagggccccc aggaagcatt ctctgatttt gtggcaagaa 1740 tgactgaggc tgcagcccgg gtctttggcg accccgagca agccatgcct ctaattgaac 1800 agctcattta tgagcaagcc acccaagagt gcagggcggc catcacacca agaaaaaaaa 1860 aagggtggta caagactggc ttaagatatg tagggagcta ggaggccctc tcaccaacgc 1920 gggactagcc gcggctatat tgaaatctca gaggcgccct aatttcaaca aacaaaaggc 1980 gtgtttcaat tgtgggaaag ctggacattt gaaaagagat tgccctgtac ttgaacgtgc 2040 aagaggagct gttctctgct cccgctgcag aaagggctat cacaaggcta gtgaatgccg 2100 ctctgtcaga gatataaaag gcagactcct gcccccgata ggtgaagtta atccctcaca 2160 gtcaaaaaac ggggtgctgg gcccccgatc ccagggccct cacaaatatg ggagccgttt 2220 tgtcagaagc cagagcagga cagaggagat aacacccgac gagttacagg agtggacttg 2280 cgtgccacct ccagtttttt actcatgccc caaatgaatg tgcaaccaat ccttgttaag 2340 tctcctggac ccttaccccc tcggactatg ggtctcattg ttgggcgagg atctctcact 2400 ttacaaggcc ttgtggttca tcctggagta gtggaccacc aacacctgca agacattcaa 2460 gtcctttgct cctgccccca aggtattttt tccattagcc caggagatag aattgcacaa 2520 ttaatattct tgcccagccc tgataaggat gaagataaca taaaagaatt gagaggtatg 2580 ggttcttctg gccccgattc agcctatcta gttatgcctt tgaatgccag gcctactcta 2640 catcttttta ttaatgacaa agattttgag gggattatgg acaccggggc agataagagt 2700 atcatttcat cttactggtg gcccaagagc tggcctgtta caaaatcttc tcattcttta 2760 caaggcctgg gttatcaatc ttgcccagcg attagttcat ccaccctgac atggcaaacc 2820 tcagaaggac agaggggcct atttactcca tatgtgctcc cgctgccgat aaacctatgg 2880 ggcagagatg tgctttctga gatgggaatc acattaacca atgaatactc agttcagacc 2940 actaatatta tgaagaagat ggggtacaca aagggaaagg gattgggaag caaagaacaa 3000 ggtagacttg agcctgtctc ccacaacggt aacccaggta gacggggctt gggtttttcc 3060 taggggccgt tggggttaca agacccatcc cctgggtaac agaggaaccc gtgtgggtct 3120 cccaatggcc gctatcctct gaaaaattgg aggcagtcaa caagctagtt acagaacagg 3180 tgcagctcgg acatttggaa ccttccacgt ccccgtggaa tactccaatt tttgccataa 3240 aaaagaaatc aggcaaatgg aggttacttc atgaccttcg agcaattaat gcacaaatga 3300 accttttcgg ctcaatccag cggggtttac ctctgttatc taccctgcct aagcaatgga 3360 aaattgtcat tttagacatt aaagactgtt tcttttccat ccctttgtgt caccaagatc 3420 gaccaagatt tgcttttaca attccagccc ttaatcatat ggagcctgat aagaggttcc 3480 agtggaaagt cctccctcag ggcatggcaa atagccccac tatgtgccag ctttttgtac 3540 aggcagcatt agaacctgtg agacaatatt ttccctcctt actgttatta cattatatgg 3600 atgatattct tttgtgccat aaagatatga tgcttttaca aaaatcctat tcatttttga 3660 taaaaatgtt aaaccaatgg ggactgcaga tagctgcaga aaaggttcag atttcggaag 3720 tgggttcctt tctgggaacc attattttcc cagataagat acttcctcaa aaattggaaa 3780 ttcgcagaga tcatttacat actcttaatg attttcaaaa gttattgggg agtataaatt 3840 ggcttaggcc ttttttaaag atttcctctg ccgagcttaa acctttattt gatattttaa 3900 agggggattc acatatctcc tcccccagag cccttactcc tgcggctaat aaagccttgc 3960 aagtagtaga gaatgcctta caaaatgccc agttacagcg tattgaggaa tcacaacctt 4020 tcaacttgtg tgtctttaag acagctcagt taccaactgc tgtattgtgg caagatggac 4080 cattattatg gatccatccc aatgcttctc cggcaagagt aatcgattgg tatcctaacg 4140 ccgttgcgca acttgcgctt cgtggcttaa aagcagcagt cactcatttt ggacgagatc 4200 ctaaattgct aattgttccc tatactgcca cacaagttca ggtccttgca gctacatctg 4260 atgactgggc agtattagtc acctcctttt caggacaaat tgacaatcat tatcctagac 4320 atccaatttt acaatttgcc ctgaatcagg ccatagtgtt tccacaggtg acagcaaaaa 4380 acccgcttcc agaaggaatc atagtgtata cagatgggtc aaaaactggt gtaggcgctt 4440 atgtgaccaa taacaaaata gtgtctaaac aatataatga aacttcacct cagattgtgg 4500 agtgtttggt ggttttggag gtccttaagg ccttcccggg accgcttaac attgtgtcag 4560 attcctccta tgtggtcaat gcagttaacc ttctagaggc tgctggggta ataaaatcat 4620 ccagtaaagt cgctgatatt tttcaaaaaa tacaggctgt tttattacat aggagatttc 4680 ctgtttatat cacccatgtt agagcacatt caggtctccc tgggcccata tccagaggca 4740 atgatctcgc agaccgagcc accagggttg tggctgctgc cctttcatcc caagtggatg 4800 ctgcaagaaa ttttcacaaa caattccatg tgacggctga aaccttgcgc cgttgctttg 4860 cattgaccag aaaggaggct agggaaatag ttactcagtg tcaaaactgt tgccagttct 4920 tgccagtgcc acatgtgggg gttaacccac ggggaattca accgctacaa gtctggcaga 4980 tggatgtaac acacatttcc tcctttagga ggcttcaata cctgcatgtt tctgtggata 5040 cttgttctgg tatcattttt gcctcgcctc ttacggggga aaaggcctca catgtaattc 5100 aacattgcct tgaagcctgg agtgcttggg gacaacccaa aattcttaaa acagataatg 5160 gaccggccta cacctctcaa aaatttcgac agttctgtcg ccagatgaat gtaactcatt 5220 taactggttt gccatacaac cctcagggac aaggcattgt ggaacgtgcc caccgcactc 5280 ttaaatctta cctaatcaaa cagaaagggg gagtcgacga ggctctgccc ttaacaccga 5340 gagtggccgt ctctatggca ctctttactc ttaatttttt gaatcttgat gaacaaggcc 5400 acactgcggc tgatcgtcac tgttcagaac caaacagacc tagagaaatg atcaaatgga 5460 aggatgtctt aaccggaaaa tggagaggcc cggatcctat tttaataaga tccaggggag 5520 ctatttgtgt ttttccacag gaagaagaca atcctctttg ggttccagaa cgcctcacct 5580 gaaggatctc cccttcagaa gatgtggaca aaagagggaa cactgaaaca acgatggata 5640 ctgacccttc tactggagat cctggttcct agtatatcag gggaattgcg ttgggtatta 5700 tgtccacctt tccctgccca tgcctgtcat gcacagtgca caagtgagca gtttaagctt 5760 atttaaagaa aagagagatt ttagaatctc tgctattata gttggcttgg tagcaacagc 5820 agcaatagct gcctctatca ctgcttcggc ccttgccttg tctgctacgg ttcagacaat 5880 gcaaaccatc aatgaccttt cggcaagagt gacatcggcc ctggacaggc aggccacggc 5940 caactcccag atatagggag gcctcatgtt ggtaaaccaa cggatcgatc tggtccaaga 6000 acaggtggat atcctgtggc aaatggctta gctgagttgc gaacataagc tgcctagcct 6060 cggtatcacc tcagtacaat ttgagaattt tataagagca actaacctgt caaaggcttt 6120 gtttcattat ttgttacaga attggaccat agaatttgaa cagacactcc gggagctgag 6180 gatcgctgtc ttacaagtca attcgacacg tcttgacctg tcattgacgg aaggacctgg 6240 atttctgcag cattctccta cttcaaggaa tgggtggggg tggatttgtt tggagtggct 6300 gtgtatggtg gagtactgct cctactatgg atgatctgca aacttaaagc ccaaacaagg 6360 agggacagga tggtagtaac ccaggcgctt gttgctttgg agcatggtgc ctcccccgat 6420 atatgattat cgatacttaa gcaataggtc gctggccact cagctcttgc accccacgag 6480 gctagtctca ttgcacggga tagagtgagt gtgcttcagc agcccgggag agttgtacgg 6540 ctaagcactg cagtagaaag gctctgcggc ataaaatgag cctattctag ggagacatgt 6600 catcttgtat gaaggttgag tgtccaagtg tccttccccc aggaaaaacg acacaggagc 6660 ggaccaaaac ccctccgggt gacgagcctg ggaggaggtt ttgtgtaagg cccctatgct 6720 tgcacactgg ggatttgacc tctatctcca ctctcagtac tgggtggcct gttgcttcta 6780 aaataaaaga aaagggggag atgtgaggag ccgccctcac a 6821 // ID RLTR32B_MM repbase; DNA; ROD; 700 BP. XX AC . XX DT 22-AUG-2008 (Rel. 13.08, Created) DT 22-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Long terminal repeat of endogenous retrovirus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; RLTR32B_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-700 RA Jurka J. and Jurka M.G.; RT "Long terminal repeats of endogenous retroviruses from mouse."; RL Repbase Reports 8(8), 892-892 (2008). XX DR [1] (Consensus) XX SQ Sequence 700 BP; 183 A; 165 C; 140 G; 208 T; 4 other; tgtagagggc aaaaaggcct tgtgcacaca gataagcact ggagaagcca ggaaagttaa 60 acacacagct tacctcttca atggaatgag gtgcttagct gctcttcctg gaatgccggg 120 aatgatgtag ttttacaacc tggtgatcct ttgctgtctg atgagacagg ctctgcccgt 180 atcttccaga atttttgttt ttacttatcc cagacccttt ccccctaaat cttgagcatt 240 gcttttaggc cataaaaccc atccttatga gtgatgtcac ttgtgcttta caacagccta 300 agtgacttct atgttatctt agggccaggc caatcctgac acccacaggc attatgttta 360 cctcagtcat tctccctaga ccctwaatct tgagcactgt ttctgagtca acagwaccca 420 ttcttgtgaa agaagccaga tgtactttgt aaagaatctc aatgtaaaca tgtcttaagc 480 attgttgtrc acwtgggact gagttaattg ttatctgaat acttttttcc ccaaaattgt 540 gctgtgctta aatatggcta aagtaaaatg atctgggcca gactctagaa gtcctggtcc 600 agcaccggtt acaataaaat cgggctgagc cgagtttcat tcttacctca ccctgatagt 660 tttccctgcc tgctgtaaag cctgcagggg aatcaccaca 700 // ID L1MC5 repbase; DNA; ROD; 1351 BP. XX AC . XX DT 14-MAY-1998 (Rel. 6.5, Created) DT 23-JUN-2000 (Rel. 7.2, Last updated, Version 3) XX DE 3'-end of L1 repeat (subfamily L1MC5) - a consensus. XX KW Repetitive sequence; L1 (LINE) family; L1MC5 subfamily; L1MC5. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-1351 RA Jurka J.; RT "L1MC5."; RL Direct Submission to Repbase Update (05-MAY-1998). XX DR [1] (Consensus) XX CC Position 1 corresponds to pos. 1400 of L1MC4. Possibly 3' end of CC L1ME4. XX SQ Sequence 1351 BP; 513 A; 205 C; 231 G; 383 T; 19 other; aaaaaaaaaa yacatttcta scctttccac tgaaaagncc tagaaacaat gactaatcca 60 atagcaatga gtaccctyag gacccagatt gtggtctcta aataccattt cccactaaaa 120 ggaaccaggg ctccttggag aaatggctaa ttccaggtct ggggcaggaa atgtacaaga 180 tgagcctgga acatcttgtc ataccagata gcaaggaagc tataaaacta ctagggttgt 240 gtcaaaagga ctcaggagcc aacttactgg ccaaagatgg gacaatttga gcatcaataa 300 taactgcaat ttaacacatc aaatatrttt aaatccatga gtttataata aaaaaatcta 360 attggtcacc tttggagtga tgctagggaa ccaactcatt attttgaaaa ctggtaaata 420 aagggaaaga atcaagcatt tatcttgcct ttcctatata aactgtacct cwgggtaacc 480 aaatagtwga tgagggaaat ttctctttat agaagtattc cagctaataa atgaagaaaa 540 aattagaatt agaatatcac cattttgcaa cccctaatga attaatggat ctaggcaatg 600 atcatcaatg gctgctaaca tcacaaaaac tarasatytg cctcctgatg gaartawaca 660 acaccaccta tgaaatatta gtcttgccaa aaaaaaatca aacctgaatc tgatcaagcc 720 tctagatcta actaccaatt tacaggaaat acagaggaca gaggaacatg ttaaatgaca 780 ccatrgggat gcaatcagca aaatccagac tgtgggaaac tctacaggac aaatrttaac 840 ttttcttcaa caaataaatt atgagaaaaa aaagatggaa gaagaaccta tagattaaaa 900 gagacttaaa agacatatca accaattaca atgtatggac cttatttgga tcctgattca 960 aamaaatrta aactataaaa atatatrtgt atacaattgg aaatttgaac actgactaga 1020 tatttgatga tattaaggaa ttattgttat ttttaggtgt gataatggta ttatagttat 1080 tttataaaat agtccttatc ttttagagat acatactgaa atatttatag ataaaatkat 1140 atgatgtctg ggatttgctt caaaataatc caggagggag gaagtaggtg gagctataga 1200 tgaaacaaaa ttggccatga attgataatt gttgaagctg ggtgatgggt atgtggaagt 1260 tcattatact attctctcta cttttgtata tttgaaattt ttctaatwtt aaaaataaag 1320 accaatttgt tggacattaa tttgttggca a 1351 // ID RLTR23_MM repbase; DNA; ROD; 606 BP. XX AC . XX DT 14-OCT-2002 (Rel. 7.09, Created) DT 14-OCT-2002 (Rel. 7.09, Last updated, Version 1) XX DE Mouse putative long terminal repeat - a consensus. XX KW LTR Retrotransposon; Transposable Element; Long terminal repeat; KW retrotransposon; RLTR11A; RLTR23_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-606 RA Jurka J. and Drazkiewicz A.; RT "RLTR23_MM: putative LTR from Mus musculus."; RL Repbase Reports 2(9), 8-8 (2002). XX DR [1] (Consensus) XX CC Similar to AC067964 (85%, bases 113374-113694) and AC079222 CC (82%, bases 56089-56527) fragments described as RLTR11A. CC 67% similar to RLTR11A (rodrep.ref) (bases 111-264). CC Similar to RLTR34_MM (78%, bases 22-509), RLTR18_MM (71%, bases CC 1-468), RLTR28_MM (72%, bases 73-264). XX SQ Sequence 606 BP; 152 A; 102 C; 210 G; 142 T; 0 other; tgtagggggt ggttctgatg ctttgatcgg gaatctgcat gtaaacactg aaggtcctgg 60 tccccaattg gttcttgatc gatcaataaa gatgccagtg gccaatgagc tgggcgaaag 120 aggtgggact tccagatttc cacaggcagg ctaggagaca caggaggagg aaagagaatt 180 tgccatgctt tggagggaga aagagccacc agccatgtga gatctcaggt ggagtggcca 240 ttggccactt ccccgactgg gcctggggta gcaggcggga gattagaaac ataactaagc 300 tgagggcaga tttagggatg ttgagccagg actgaaggta actgggcaac taagctgagg 360 gcagatgtgt tgagctagga gtgaagggaa gggtatgcta gccgaggaag gcttagaagt 420 gcccagccat tgagctagta aggatattaa aaataagcta atgtgtgtgt gtgtgtgtgt 480 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtctttca tccacggatc caagggaacc 540 tgggtggggg ctggtagcgt ggtctgcctg gagcttaaag tggggtagca aaaactacac 600 gctaca 606 // ID ERVB4_5-I_RN repbase; DNA; ROD; 8172 BP. XX AC AC106444; XX DT 11-FEB-2004 (Rel. 9.01, Created) DT 11-FEB-2004 (Rel. 9.01, Last updated, Version 1) XX DE Rat endogeneous beta retrovirus ERVB4_5, internal sequence. XX KW Endogenous Retrovirus; Transposable Element; gag domain; KW pol domain; endogeneous betaretrovirus; pro domain; KW RnERV-B4_AC106444; ERVB4_5-I_RN. XX OS Rattus norvegicus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Rattus. XX RN [1] RP 1-8172 RA Baillie J.G., Van de lagemaat N.L., Baust C. and Mager L.D.; RT "Multiple groups of endogeneous betaretroviruses in mice,rats and RT other mammals."; RL J. Virol 78(11), 5784-5798 (2004). XX DR Genbank; AC106444; Positions 185775 241572. XX SQ Sequence 8172 BP; 2336 A; 1967 C; 1741 G; 2127 T; 1 other; aagtggcgcc cgaacaggga ccccgaagca aggccgacta caggacgacg gagataagag 60 agctgcagcg ggaatcgagg actcttcagg ccgtatagct cgtgtgtcgc tggagaggga 120 ggtaaggacc ggttgaataa ttgtcgtcta attgccatgg ggaaggtatt gtctaaggag 180 gcttgtttca taagagaaat caagcgttta ctcagggaga gagcaataag agttaagaaa 240 aaggatttaa tcaaattttt ttgtttcata gatgacaaat gtccttggtt aattttaagt 300 ggccccgata ttcaccccca tacttggaat aaggttgatc aggagattaa taggcttttg 360 aaagccggcg acccagtgtc cgcggctttt ttcagttact ggggtatcat tagagacatt 420 ataatagatg ctgaggaagg aggcgagagt gcccacctcc tagctgtcgc agaggacttc 480 ttacaagcct ccgggccggc cggcttcacc taacaatgaa aaagaaaaag gggagccgca 540 ttctccctgc ccttccattg ttattgatat gcctgggggt gagggaaatc cctcatccaa 600 agccttatgc tctctagaag accccaagat gtccaaaaat tcaaagacca tttatccttc 660 tcttaaagaa ttttctggga aaaacaaatg ctctgagtca ggtctagatc ccgcctctga 720 ggccaaatta gaggaggaag ctgccaggta tgagaaagag agatatggtc ccgagccaca 780 ggagactcag atcttcttca ctcggaggtc tgggcctgtc tgcctcccac cttatctacc 840 ccctatatcg tcagccccac ctgcacctct accggtcgca gtagtggagg aactgactca 900 gactaagact gcacttcaag cacaaattac tgggcttaaa cagatcttac acctacagca 960 ggaattgggg gatcttacac tagaaatcca ggatttacaa gagaccatga ctaagggaca 1020 gtatgcccaa aaagggccag ctaaggccag tagaaaaccc caaccaaaac taactttccc 1080 agttatgact agggcacgcg ccagacaggc cagggatgag tctgagttaa acagtactag 1140 aacactaaat aggtcggcca gagacaagcg cgagccagac agtactaatc gtaaccgtaa 1200 agaggaacaa gaaagtgaag aagaaaatga ctatgaggag ggaccctcag attcggaaga 1260 ggaaagccct gatcaggatg agggtgggac ctcgtgtccc gtatacaaga aacttaaact 1320 caaacacata aaggaattac attcagctgt taaaaattat ggtatgaatg cccgctttac 1380 cttggctatc ctagaaggac tttcggggtc tgggcatttg acccctagtg aatggaccaa 1440 ggtagtacaa tctgtcctca ctagaggaca atatttgact tggaaatctg aatttattga 1500 taaagcagag agccttgctg ctagaaacag aaacaatccc ttaagtaaat ctgcttcctg 1560 gacggcagat aagctatgtg ggaggggctc atttgcctct gaggaaaaac agttggggct 1620 gtctcccggt gtactcgcac aaacggcgca ggcggctttg tccgcttgga gagcagtccc 1680 cgctacaggg gcgctcacca cccccttaac taaaataatc cagggggcac aagagcccta 1740 tgctcaattt gtagggagac tacaagaagc agctgaaaga atcctaggtc aaaatgaaag 1800 tgaagggctt ctcgttagac agcttgctct cgaaaatgct aactcggtgt ctagagccgc 1860 acttcgaggc aagaccaaac aaagaccttg atatcccagg catgattaaa ctttgcaagg 1920 atgtcgactc tttttctcac caagtatcta aatctatcag cctggcaatt ggggctgtgt 1980 tccaaagagc tggagattct ggtcacccta gaggcagaac atgtttcaaa tgtggacaac 2040 ccggtcattt tgctagagaa tgcaacatcc caaaacctag ccagtccccc agccctgctg 2100 gatggcccac tgccccgagg gtttgtcccc gctgcaggaa aggccgtcac tgggccaggg 2160 aatgtcagtc taagactgac attagtggac aaaccctccc tgcggtccag ggaaacgagt 2220 ggggggcccc acaatggggc ccctatccan cggtgaccaa ccgcccccaa gtgcagcatc 2280 aagctcctgc aacgccaact tatcccgggc aacagcagga agtgcaggag tggacctgtg 2340 ttcctccacc accaggatta taacccctga ggaagggact gttatcataa atctggaaag 2400 tttgggcccc cgccgcctgg catgtttttc ttaataattg ggcgagcctc gagtgttctt 2460 caaggacttg tcatccttcc ctctgtaata gatgctgatt attctggaga aataaaattt 2520 ctagccactg ccacgcaggg ccccttaacg cttagagccg gacaacgcat tgcacaggcc 2580 ttaacgcttc ccttcatggg gcagttccct cataaagtaa aggggcgtgg ttcttcctcc 2640 cctggctctt cagatgttta ttgggtgcaa aaattaactg atgaaagacc catgatgtct 2700 ctttggcttg atggcaaaca atttcagggc cttttagata caggggcgga tacaacagtg 2760 ctctcctcta gacactggcc ctcaacctgg ccccttaagg ctactgccac acatttaaaa 2820 gggataggtc aaacacagga tacactccaa agttcaaagc tcctgacctg gaaagataaa 2880 gagaataata caggcacagt taggccattt gtagtttctg gactccctgt taacctctgg 2940 ggaagggata ttctttccca gatgggggtg atgatgtgta gtcccaatga ggtcattact 3000 agccaaatgt taaggacagg attcctccct ggaaaaggat taggtagaaa tgaacaggga 3060 atcacagaac ccttgactcc cacgcccaaa acagatagag gagttttagg agcagacctt 3120 ttttcataga gaccgctgtt cctcctgcac tccaggctga taaaatatct tggaagtcaa 3180 atgatccagt ctggatcgat cagtggtcca tgcctcaaga aaaggttcaa gcggccctat 3240 agttggtgca ggaacaatta ttacaagggc accttgaaca gtccacttcg ccttggaaca 3300 cccctatttt tgtcataaaa aagaaaaatg gttcttggag attgttacaa gaccttagag 3360 cggttaataa aactatggtc cctatggggg ccctacaacc tggcttgcct tctccaatag 3420 ctattcctag agacttttat aaaataatca ttgatataaa agattgtttc ttttctattc 3480 cccttcatcc ggaagattgt gcccgcttgc cttttcctat ccctgtcatt aaccatgtgg 3540 gccctaatcc ctgctttcag tggcgagtgt tccacagggg atggctaata gccccaccct 3600 atgtcgaagg tatgtggccc aaactataga tccaataagg ctgcgtttcc cctctgctta 3660 catcattcat tacatagatg atatcttggt ttcctctgct tgtttacagg aaactcaaaa 3720 actagcccaa attattgttc tggccttaca aaaaaggggt ttcaccattg cccctgaaaa 3780 aattcagacg caatatcctt ttttgtttct tggattccaa ttagaaccta tatccattta 3840 ttctcaaaag ctaacaatta gaagatcgca gctacatacc ttaaatgatt ttcaaaaact 3900 tttgggtgat attaattggc ttagacctta ccttaagcta accactggtg atttaaaacc 3960 tctatttgca attctgaagg gtagccctga tcctaattcc attagggttc tgactcccga 4020 ggcagcccct ccaagacact gtatctgatc cttttcgcta caaggtttac acccataggc 4080 ctcctatggc aagataatca tccactaatg tggagccacc tgcccgcaac ccccccctaa 4140 aatacttcct acttatccct ccttaatttg ccagacaatt ttcttggggc ttaagctggc 4200 tactcgccat tttccctgcg atcccgatat tattatttct ccatattcac aggagcagct 4260 tgcttggctg cagcacagac atgatgactg gacagtcctg ctgtctattt atcagggaac 4320 atttgatact catttaccag gagataaatt attacagttc ctccatgtta ccccctttgt 4380 cttcccaaag gttacacagc tgaaacctat accaaatgcc ttaactgtct tcatagatgg 4440 atctaagaat ggaaaggcct cctttgttgt taaagatcat gttttttttt gtccataccc 4500 catatgcttc tgcccaatta gtagaacttt atagtgcttt agaaatattt aaattaataa 4560 atcaatcatt taaccttttt tctgatagcc attatgtagt cagagctctc cgagtcttgg 4620 agactgtctc tactatacaa ccctcaacac atacctttaa actattttca gaaatacaaa 4680 aacacataag ggcccggcca aacccattct ttgtgggcca tatcagagcc cactcgaatc 4740 tgcccggtcc tttaaccaaa ggtaatgatt tagctgacag ggccactaga ctgacaatga 4800 ttgcttctct tagtgattcc ttacaagaag cacagatggc acattcactc catcacctta 4860 atgctcaaac attgtggctt aagtacaaaa ttactagaga acaagccaga caaattgtca 4920 aaaattgtaa aaactgcctg accctacttc ctgagccaca cattggggtt aatcccaggg 4980 gccttatccc gggagaaata tggcaaatgg atgtcaccca tgtcccatct tttggaaaac 5040 taaaatttgt acatgttacc attgatacct ttagtgggtt catatgtgcc tctgcacaca 5100 caggagaggc cacaaaagat gttattgctc atatgctcta cacattttct gttatgggac 5160 aacccaaaat gctaaaaacg gataatgggc ctggctatgt tagtcataaa tttaaacaat 5220 tttgctccca atttcaaatg aaacatatta caggcatacc ttataagcct cagggacaag 5280 gaattgtgga acgagcacat caaaccctta agaatacact gttaaagctt actgcccaag 5340 aaactcttta ttctcttaaa ggcagcgcaa aatttttatt atcccatgcc ctttttgttt 5400 tgaatttctt aactttggat aatcaagggc gttcagctgc cgatcgctta tggcacccct 5460 caaccaagat ggatcacgct cacgttctct ggaaggaccc cctaactgga cagtggcagg 5520 gtcctgaccc tgtgatagtc tggggaaaag ggtccgcctg tatttataac tctaaagagg 5580 ggggagccag atggctccct gaaagattaa taaagcttta caataaattt caaacaaatt 5640 ccaagattga taaagcctta taacaaattc cagggtggcg cctgagaaaa atgcatgttt 5700 tctttttcag agcaatgaag ttctactgga ttctactgaa acttctcttt ctaactgtgc 5760 taacatccgg aacatcaagc ccctatcaga cttggaatta cacctgggta attataaatg 5820 aggccggaga taccgccttt accacctccc acactggccc taaccccacc tgtccacaat 5880 tagctccaga tttttgtaaa ctagctgcag gaggaaattc ttactggggc ctcgcagaca 5940 aatacttacc cctaattgaa gcccctcatg ggtctagttt taatgaaaga tatgtcggtt 6000 gtgacactgt ccatcgacgc accaacatta gggaaactga cttttatgtc tgtcccgatc 6060 ccacagagat cgctccctaa actataagtg tggatacagg gatcaattct attgtgcctc 6120 ctgggggtgt gaaaccacag gagacgcata ctggaaacct acctcctctt gggattatat 6180 cacagtaaaa aaggggtggc acaattctaa cagaaacagt actctaaccc aggaatgccg 6240 gaacactgag ccttccaagg gctggtgcaa ccctcttaca ataactttta ctgatgctgg 6300 gaaaaagatc accccagaaa actggcatag ggggtttgag tggggtctcc ggatgtatgt 6360 tgccaataga gaccccggag taaccttcaa aattagatta tttaaaacca cccccaactt 6420 acataaggcc tccattgggc ctaatcctca attgcatagc cagggttcct tgccgcccag 6480 ggtggtgact cagcccccct ttaacacctc tagtcctcca actacccttt ttcagcccac 6540 catacctgtt gggccaatgt cccccacaag cctgatttta tctgttctta acgcctcagc 6600 tcgtgccttg gtatatcatg aacaggaaac aaatacctct atttatgagg aatgctggat 6660 gtgtttttca gctaaccctc ctttctatga gggtatagcc acttttggca acatcacata 6720 tactaataac actggtggtc tctcttggaa caccatagaa ctcactatca cagaagtctc 6780 agggatagga agctgcctcc tcgggaagaa catgctcctc cctaaacaat tattagaaat 6840 atgtaatcat accataatcg ttgacaatca aaatacatat ttacaagccc ccaagtatac 6900 ttatctagca tgttccactg gtctaaccac atatgttata acatctcaat ttttaaagac 6960 taaagattac tgtgtgctgg ttcaattatt tcccagactc agcatacatg agccagagac 7020 ttttcttaaa tcctgggaaa agggctcgga tatgcctcac aaggtaaaaa gagaaccagt 7080 aactgccatc actctcaccg ttctcctagg tctaggggcc gcgggtgcag gcactggcat 7140 tgcttcctcg ttacttccca tcaatactat aaccagctta gtgaagccat agataaggat 7200 attgctgaat taagagatgg attagctaac ttaaaagact cagttacctc gttgtctgaa 7260 gtagtcttac aaaatagaag aggcctagac ttaatcttcc tacaacaagg aggactttgt 7320 gcagccttaa aagaggaatg ctgtgtttat gtagataaga ctggacttgt agaagacagt 7380 ctaaaaaagg tgagagatag cctagaaaaa cgaaggagag agagagaaca acaagagtct 7440 tggtatcaaa actggctttc tacatcccca tggctctcta ccttgttgcc ctcaattttg 7500 ggaccccttg tgggcctact tttgttgatt tcctttggtc cttgggcctt ccaacgactt 7560 acatgcttta ttaaatcaca aatagattct gctttgccta gaaattctgt ttcagtacat 7620 tatcatcgct tggagaccgg aacagctgag gagaaccaga gacaacgaac aagatgaggt 7680 gcctacaact tatcgggaga ggctcaattt ctacaatctc ttatattaat gccatctggc 7740 ccttgctctt tcgcccctgg gctcaagtga gaggaatgag aagatgacct gatgtagcta 7800 acctcaggga cgggcaactt cctctgaccc ttgaccaatc taagacagga accttggggg 7860 cgacaaggtg atcctatgac agataaggct ggggtttgag aggtcgatcc taagacagag 7920 gtggctacac cccgtttaaa cgtgcgggcc ggtcaaatgc tcctctgccc agtttctccc 7980 ccaataacac acacgtatag cccagaacgg tgtgttacag cataagttca ttcctagcca 8040 ttgaggtgca gtcctaactg caagagtggc ttgccatggc cgagctgggc actctgtgag 8100 gcatgtctga tctctctcag accactactt ccacctaatg tttttttttt ttttaaagaa 8160 aaatggggga ga 8172 // ID ERV1A-CPo_LTR repbase; DNA; ROD; 659 BP. XX AC . XX DT 19-JUN-2009 (Rel. 14.07, Created) DT 19-JUN-2009 (Rel. 14.07, Last updated, Version 2) XX DE Long terminal repeat of ERV1 endogenous retrovirus: consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; ERV1A-CPo_LTR. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-659 RA Jurka J.; RT "Non-autonomous endogenous retrovirus from guinea pig."; RL Repbase Reports 9(7), 1541-1541 (2009). XX DR [1] (Consensus) XX CC ~95% identical to consensus. XX SQ Sequence 659 BP; 170 A; 197 C; 145 G; 147 T; 0 other; tgtgagagtc gacccccctg ctacaacccc ctccccagag cggggaatga aagggttaac 60 tccctaactc cccagagcgg ggaatgaaag ggttaactcc ctaactcccc agagcgggga 120 ataaaagggt taactcccta actccccaga gcggggaatg agagagttga ctccccgttt 180 gtaaccccct ccccgagcta tcagaacaga gcagtaaaga atgtttcact ccctaaacct 240 ttcccaggag ataatcagag cagggcagga aggaatgtct ctgctagtca ttaaccacta 300 gatgttctgt tctctgtaaa gatagagata agtaaaagaa tgtactgttt aaacaggacc 360 ggggtgcacc tggcactcta aggtcaaccc aagaccctcg gtctccatgg ttactcagcc 420 ccagacccct cgaatcagca ttagccaagt cctgcgccgt tcaaaattaa ccaatgcgat 480 ttgcttctgt aaacttgctt gcctcccgct tgtaccctta aaaaccctac acaaattccc 540 ctcggggccc ctccgcactc gtttgcctgg gggaccccat gcgcatggaa ataaactttc 600 ttttcctcac caagagctga gtccttgggg tctctttccc tgcggcgtcg gccctaaca 659 // ID L1M3_5 repbase; DNA; ROD; 3806 BP. XX AC . XX DT 07-OCT-1998 (Rel. 3.1, Created) DT 07-OCT-1998 (Rel. 3.1, Last updated, Version 1) XX DE L1M3/4 LINE1 repetitive element 5' end - a consensus. XX KW L1 repeat; MER43; L1-43_5; L1M3_5. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1719-1894 RA Rubin M.C., Leeflang P.E., Rinehart P.F. and Schmid W.C.; RT "Paucity of novel short interspersed repetitive element (SINE) RT families in human DNA and isolation of a novel MER repeat."; RL Genomics 18, 322-328 (1993). XX RN [2] RP 1631-1894 RA Smit F.A.; RT "Structure and evolution of mammalian interspersed repeats."; RL Ph.D. thesis (USC 1995). XX RN [3] RP 1-2253 RA V V. and Jurka J.; RT "L1M3_5."; RL Direct Submission to Repbase Update (1996). XX RN [4] RP 1-3806 RA Smit F.A.; RT "L1M3_5."; RL Direct Submission to Repbase Update (1996). XX DR [4] (Consensus) XX CC 5' end of L1, probably of the L1MB1-8 subfamilies. CC Assigned to rodents by J. Jurka, October 07, 1998. XX SQ Sequence 3806 BP; 1296 A; 787 C; 873 G; 729 T; 121 other; agagcaagat ggcagaacag aatgctcnaa caattgcccc catyctryyc cnccacagga 60 acaccaaatt gaacaactat ctacacaaaa wagcaccttc ataagaacca aaaatcannt 120 aagcgatcac agtacctggt tttaacttca tatnactgaa agaggcactg aagagggtag 180 gaaagacagt cttgaatcgc caacgccgcc cctcccccat cccccggcag tggccgcacg 240 gcacggagag agaatccgcg cacttgggng agggagagcg cagcgattgt gggactttgc 300 gttggaantc agtgctgccc tgtcanagcg gaaagcaaca ncgggcagaa ctcagccggc 360 gctcatagag ggancattta gaccagccct agncagaggg gaatcaccca tcccagcggt 420 cggaacctaa gttccggcaa gcctcgccac cgtgggctaa agtgcnntag ggtcctaaat 480 aaacttgaaa ggcagtctag gccacaagga ctgcaattcc tgggcaagtc ctggtgctgt 540 gccgagctaa gagtcagtgg acttaggggg cacacgacct agtgagacac yagccggggt 600 ggccaaggga ntgcttgcgc cacccyctcc cycaacncca ggcagcacaa ctcgcagctc 660 cgggagagac tccttccctc cgcttnagga gaggagaggg aagagtaaag aggactttnt 720 cttgcaantc ggataccagc tcagccatag taggataggg caccgagcag agtnnwgagg 780 ycccnattct aggccctagc tcccggatga catttctaaa cacaccctgg gccagaaggg 840 aanctgctgc cttaaaggga agganccagt cctagcagga ntcatcacct gctgactaaa 900 gagcccttgg gccctgaata atcagcagcg atacccaggt tagyactcgc cgtaggcctt 960 gggtgagant ctgagacgtg ctggnttcag gtgtgacnca gcacattccc agctgtggtg 1020 gctacgggga gagactcctt ctgcttgaga aaaggagagg gaanagtaaa ggggactttg 1080 tcttgcagct tagntaccag cttggccaca gtggagtaga gcaccaagcg ggctcttagg 1140 gtccccgatt ccaggccttg gctgttggat ggcatttctg gacctgccct gggccagagg 1200 agagcccact gccctgaagg gagagtctca ggcctggcag cattcaccgc aagctgacag 1260 aagagtcctt gggctttaag tgaacattkg cgrtagycag gcagtacttn ctgtgggcct 1320 gcggcggtgg tngccatagg gagagnctcc tctgcttgtn gaaaggggag ggaagagtgg 1380 gaagaacttt gtcttgtggc ttgagtgcca gctcagccgc agtagaacag agcaccgggt 1440 agatttctaa ggtttccgac tccaggccct ggctcctgga tagcatcyct ggacgtgccc 1500 ggggccagag agaactcacc accctgaagg gaaggataca agnctggctg gctttaccac 1560 ctgctgattg tagagtcccg gggccttgag cgaacataag cagcggccag gcagtggtta 1620 ctgcgggcct tgggcgagac ccagtgctgt gctggcttca ggtctgaccy agtacagtcy 1680 cagtgrtggt ggccacaggg gtgcttgtgt cacccctccy ccagctccag gcagctcagc 1740 acagagagag agacngagtt tgtttgrggg aaagtaargg aagagaacaa gagtctctgc 1800 ctggtaatcc agrgaattct tccagatctt atccaagacc acnaaggcag tacctctacg 1860 agtctgcaag agccacagta ttactgggct tggggtgccc cctaatgcag atacggccgc 1920 agtgacaaaa aacttagatc acaacacyca agtcccttca aatacctgga aagcyttccc 1980 aagaaggacg ggtacaaaca agcccagact gtgaagacta caataaatac ctaactcttc 2040 aatgcccaga cacagacaaa catctacaag catcaacacc atccaggaaa acatgacctc 2100 accaaatgaa ctaaataagg caccagggac caatcccgga garacagaga tatgtgacct 2160 ttcagacaga gaattcaaaa tagctgtttt gaggaaactc aangaaattc aagataacac 2220 agasaaggaa ttcagaatyc tatcagataa atttaacaaa gagattgaaa taattaaaaa 2280 gaatcaagca gaaattctgg agctgaaaaa tgcaattgac atactgaaga atgcatcaga 2340 gtytcttaac agcagaattg atcargcaga agaaagaatt agtgagcttg aagacaggct 2400 atttgaaaat acgcngycag aggagacaaa agaaaaaaga aaanaatgaa gcatgcctac 2460 aagatctaga aaatagcctc aaaagggcaa atctaagagt tagtgacctt aaanaggagg 2520 tagagagaga gataggggtr aaagtttatt caaagcnata ataacagaga atgtcccaaa 2580 cctagagaaa gatatcaata ttcaagtaaa agaagnttat agaataccaa gcanatttaa 2640 cncaaagaag actacctcaa gacatttawt aatcaaactc ccaaaggtca aggataaaga 2700 aaggatccta aaagcagcaa gagaaaagaa acaaataaca tacaatggag cnccgatata 2760 gtctggcagc agacttntcg gcggaaanct tacaggccag gagagagcgg catgacatat 2820 ttaaagtgct gaagaaaaaa aaaaaaactt tnatcctaga ntagcgtntc cagngaaaat 2880 atccttcaaa catgaargag aaataaagac tttcccagac aaacaaaagc tgagggattt 2940 cntcaacacc agacctgtcc tacaagaaat gctanagnga attcttcaat gtgaaagaaa 3000 aggacgttaa tgannaataa naaatyatct gnaggtrcaa aactcactga taatagtgcg 3060 cagaaaaaca naatattata acantgtaat tatggtgtat aaactactct tragtagaaa 3120 gactaaaaga tgaaccaatc aaaaatanta actanaacaa cttttcaaga catagacagt 3180 acnntaagat anaaatagaa acaacaaaaa gtttaaaaat tggnggacga agttaaaatg 3240 tagagttttt attagttttc ttnttgttnn ttgrtttttt tnaanttatg taatnagtgt 3300 tattatcagt ttaaaatata ggatattatt tgcaagcctc atggtaacct caaatcnaaa 3360 aacatacaac ggatacacaa aaataaaaag caagaaatta aaacatatca ccagagaaaa 3420 tcaccttcac taaaaggaag acaggaagga aggaaagaan aaagagaaga ccacaaaaca 3480 accagaaaac aaataasaaa atggcaggag naagtcatta cttatcaata ataacattga 3540 atgtaaatgg actaaactct ccaatcaaaa gacatagagt ggctgaatgg atataaaaat 3600 aagacccaat gatcggtcgc ctanawgwaa cacacttcac ctataaagac acakatagac 3660 tgaaaataaa gggatgaaaa aagatatttc atgcnaatgg aaaccaaaaa agagnaggag 3720 tagctatact tayatcagan aaaatatatt tcaagacgaa aactataaga agagacaaag 3780 aaggtcacta tataatgata aaggag 3806 // ID RLTR47_MM repbase; DNA; ROD; 503 BP. XX AC . XX DT 05-FEB-2004 (Rel. 9.01, Created) DT 24-AUG-2008 (Rel. 9.01, Last updated, Version 2) XX DE Mouse long terminal repeat - a consensus. XX KW LTR Retrotransposon; Transposable Element; LTR; RLTR47_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-503 RA Pavlicek A. and Jurka J.; RT "RLTR47_MM - a family of LTR retrotransposons."; RL Repbase Reports 4(1), 29-29 (2004). XX RN [2] RP 1-503 RA Jurka J.; RT "Consensus."; RL Direct Submission to Repbase Update (24-AUG-2008). XX DR [1] (Consensus) XX CC Individual copies are ~92% identical to the consensus. Distantly CC related to LTRIS3 (60% identity). XX SQ Sequence 503 BP; 129 A; 135 C; 110 G; 128 T; 1 other; tgagggaccc taagggcttt ttcccagtcc cacaccctcc ttacagcttc cccctcccgc 60 ctggggtcaa ggctaggcca agttccattc tccacccaca ggaacattct tgagtaaaaa 120 attctcagaa aaaccgcaga atgtactgac tgatagtcac ctgaccctct ggaaagtccc 180 aggtagagtt caaatgcgtg tcatgatctg cccatgcttg ctagccaata gatttaaagg 240 tcaatatgct tagccaataa gtttgaactg taaccttgct gatgtaacct gtgcccctaa 300 aaagtataaa aactgcttgt aatagccatt catggttggt cgccttctag tcactcgcct 360 tgagggacta attgaaggtc gacccygatg tgccaaacgc gccagaaaat aaacctcttg 420 cttttgcatc gatctgcgtc tcggtgtctc actcgggggc gtctcgaagt aagtaccact 480 gaccgagggt caggggtctt aca 503 // ID RMER19 repbase; DNA; ROD; 738 BP. XX AC . XX DT 26-SEP-1997 (Rel. 2.08, Created) DT 26-SEP-1997 (Rel. 2.08, Last updated, Version 1) XX DE Long terminal repeat of retrovirus-like element. XX KW Endogenous Retrovirus; Transposable Element; KW putative long terminal repeat; RMER19. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-738 RA Smit A.F.; RT "RMER19."; RL Direct Submission to Repbase Update (30-NOV-1996). XX DR [1] (Consensus) XX CC Putative LTR. 6 bp duplication sites. Orientation unclear. XX SQ Sequence 738 BP; 200 A; 190 C; 160 G; 174 T; 14 other; tgtaggctta gatttttcga aacctanttt taataagggt tcttaaacgc ttcaacctct 60 cttctagccc accacccacc agaggtagtg gaagagaaag gttattagga tatgggggwa 120 gtggacctgc ttagcaatag ttctttaggg gtgagcccaa tcttcattgk cagcagttca 180 gtcccgtagc aaacaccaaa tatgactcag cagctgcaga tcagtcctct aggcaggcag 240 acaccaggca cgaaccagca gctgcagtcc agtcctctcg gcaagcatga actagcagtt 300 gcagttcgat ccwgaagaaa ccgcaaggct ctgccaattg gcstaagtcc gcggaagcag 360 caaagaagct gcaggaacmt cacaagcagt tctttggcga gtttctctct atgcggtagc 420 gtcaccgcaa gtkgagccca acaacgctat gcaaggcgaa ccaatacatg ggtgtcgtta 480 gcaaagaata gcgaggcaga gcaaaccaaa gctcagtgct cgtctcccac tgtctgtggg 540 gtcatattta tactccttcc aaacatcawg cgtcctttca cgtgtctgct mtaacmaaac 600 atcctytcac ctgtgtctgc ttcaggaaaa cagtctttca cgtgtttgct tcagcaagac 660 atcctttcac ctgtgtgccc cggcaaaaca tcatttgaca taactgastt tccaaagaaa 720 cyagaarttt ccacttca 738 // ID RLTR20A2_MM repbase; DNA; ROD; 507 BP. XX AC . XX DT 30-JAN-2004 (Rel. 9, Created) DT 30-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE Mouse long terminal repeat - a consensus. XX KW LTR Retrotransposon; Transposable Element; LTR; ERVK; KW RLTR20A2_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RA Karolchik D., Baertsch R., Diekhans M., Furey S.T., Hinrichs A., RA Lu T.Y., Roskin M.K., Schwartz M., Sugnet W.C. et al.; RT "The UCSC Genome Browser Database."; RL Nucleic Acids Res 31(1), 51-54 (2003). XX RN [2] RP 1-507 RA Pavlicek A. and Jurka J.; RT "RLTR20A2_MM - a subfamily of LTR retrotransposons."; RL Direct Submission to Repbase Update (31-DEC-2003). XX DR [2] (Consensus) XX CC The consensus sequence was reconstructed from the UC Santa Cruz CC genome annotation. Individual copies are ~87% identical to the CC consensus. 6 bp TSDs. XX SQ Sequence 507 BP; 146 A; 89 C; 162 G; 110 T; 0 other; tgttggggat tggttctaat gctttgaatt aatccggccc ccaaaatcgg gaatctgcgt 60 gtccaaacgc tgaaggtcct tgtccccaat tggtttttga tcaatcaata aagagccaat 120 ggccaatggc tgggcagata gactgagggc aggaccttta gatttgcgtg ggctaggaac 180 tgggagagag gaaggaaggg agaatcgcca tgattcggag ggagacggat cagatttaga 240 gctgcagaag gaaaatcatc cgaaatgtag gtgagaagga atgcggcccc cggaagggct 300 gcccagaagc gtcttgggca gcaaagacta gggagccgcc cagaaggagc cagggcaaca 360 aagataaaat atagaattag agggtgttaa gccaggagta caggagggaa atgtgtgcta 420 gccatgggga ggattagaac tgcccagcct ttgagctagt caaggcatat ttaaaattaa 480 ctggtgtgtg tgtgtgtgtg tgtttca 507 // ID TIGGER2 repbase; DNA; ROD; 2708 BP. XX AC . XX DT 19-FEB-1997 (Rel. 5, Created) DT 19-FEB-1997 (Rel. 5, Last updated, Version 1) XX DE Autonomous DNA transposon. XX KW Repetitive sequence; Tigger2. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-2708 RA Smit F.A. and Riggs D.A.; RT "Tiggers and other DNA transposon fossils in the human genome."; RL Proc. Natl. Acad. Sci. USA 93, 1443-1448 (1996). XX DR [1] (Consensus) XX CC 24 bp terminal inverted repeats, TA target site CC Includes MER28 (bases 1 to 59; 2333 to 2708). XX SQ Sequence 2708 BP; 884 A; 511 C; 555 G; 701 T; 57 other; cagttgaccc ttgaacaaca cgggtttgaa ctgcgcgggt ccacttatac gtgrattttt 60 tycaataaat atattggaaa aawttytgga gatttgcaac aatttgaaaa aactcgcaga 120 cgaaccgcgt agcctagaaa tattgaaaaa attaagaaaa aggtatgtca tgaatgtata 180 aaatatatgt agatactagt ctattttatc atttactacc ataaaataya cacaaancta 240 ttataaaaag ttaaaattta tcaaaactta yayayayact tacagactgt acgtggtacc 300 attcgaagtn gagagaaata taaacaaacn taaagatgca gtattaaatc acaactgcgt 360 aaaattaact gtagtacata ctatactacc gtaataattt tgtagccacc tcctgttgct 420 attgcagcga gctcaagtgt tgtgagtacc tgcttaaaac gctskgtgat gctaatcatc 480 tccgcgcgag cagttyatct ctccagtaaa ttgcgtattg cagtaaaaag tgatctctca 540 cggttctcgc gtatttttca ttgtgtttar tgcaatattg taaaccttga ataacaccat 600 gggacccata cgaagtgcca ctagtgatgc tggaagtgct cccaagaagc agagaaaagt 660 catgacatta caagaaaaag ttgaattgct tgatatgtac catagattga ggtctgcagc 720 tggggttgcc tgccatttca agayagatga atccggcgta aggaccgttg tmaaaaaaga 780 aaaggaaatt cgtgaaracc gtcgctgcag ttatgcccag caggcacaaa aacttgtnac 840 tttttgcgaa ataccttttt atgttgtatt gaaaatgcag cttttntgtg ggtgcaggat 900 tgctataaga aaggmatacc tanagactct aatatgatta gagaaaaagc gaagtcatta 960 tgtgannact taaagcaaaa rraagatgam rgatctaaag ctgganaatt taatgccagc 1020 aaakgatggt ttgacaattt tagaaagarg twtggcttaa aaaatktcaa gakaacagga 1080 gaagcasytt ctgctgacca agagacagca gacgagttnc cagatgccat taacaaaatc 1140 attgaggaga aaggatatct gcctgaacag gtttttaatg cagatgarag tgccctattc 1200 tggggggaaa aaatgccaca aagracattt attagtaagn aagagaagcg agcaccagga 1260 tttaaggcar gaagggatag gctaactcta ctgttttgtg caaatgcagt taggtttatg 1320 atcaggactg cccttatcta taaagctgct aacccccgag ccttgaaggg aaaagataaa 1380 cgccagctgc cagtcttttg gttgtacaac aagaaggcct ggacaacgag aacccttttt 1440 ctggattggt tccattkatg ctttgtccct gawgtcagga agtaccttgc cagtaagrga 1500 ctgcctttta aagttctttt gatattggac aatgcccytg gccacccaga accccatgag 1560 ttcaacaccg aaggtgtcga agtggtctac ttgcccccaa acacaacgtc tctaattcag 1620 cctctagatc agggggtcat aaggaccttt aaggctcatt acacacggta ctctatggaa 1680 aggattgtca acgctatgga agagaacccc gatagaraga acatcatgaa agtctggaag 1740 gattacacca ttgaagatgc cattgttrtt atagaaaaag ccgtgaaagc catcaagccc 1800 aaaacagtaa attcctgctg gagaaaactg tgtccagatg ttgtacatga cttcacagga 1860 tttacgacag agccaatcaa ggaaatcatg aaagagattg tggatatggc aaaaaagttg 1920 gggggtaaag ggtttcaaga tgcgaatctt ggagaaattc aagagcwaat agacaccaca 1980 ccagaggaat taacagaaga cgacttgatg gagatgagtg cttccgaacc agtgccagac 2040 gatgaggaag aagacatnga agaagcagtg ccagaaaaca aattgacatt agacaatctr 2100 ggcagaaggg ttccaattat tcaagactgc ttttgacttc ttttacaaca tggacccttc 2160 tatgatacgg gcactgaaac taaagcaaac agtggaagaa ggattggtay tatacagaaa 2220 catttttaga gaaatgaaaa ggcaaaaayr tcagacagaa attacgatgt atttccgtaa 2280 agttacaccg agtgtgcctg cctctcctgc ctccccttcc acctcctcca cctcttccac 2340 ctctgccacc ctgagacagc aagaccaacc cctcctcctc ctcagcctwc tcaatgtgaa 2400 gacgacgagg atgaagacct ttatgatgat ccacttccac ttaatgaata gtaaatatat 2460 tttctcttcc ttatgatttt cttaataaca ttttcttttc tctagcttac tttattgtaa 2520 gaatacggta tataatacat ataacataca aaatatgtgt taatcgactg tttatgttat 2580 cggtaaggct tccagtcaac agtaggctat tagtagttaa gttttkgggg agtcaaaagt 2640 tatacgtgga tttttnactg cgcggggggt cagcgcccct aacccccgcg ttgttcaagg 2700 gtcaactg 2708 // ID RMER17D_MM repbase; DNA; ROD; 923 BP. XX AC . XX DT 14-OCT-2002 (Rel. 7.09, Created) DT 14-OCT-2002 (Rel. 7.09, Last updated, Version 1) XX DE Mouse putative long terminal repeat - a consensus. XX KW Non-LTR Retrotransposon; Transposable Element; KW Long terminal repeat; retrotransposon; RMER17A; RMER17B; KW RMER17D_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-923 RA Jurka J. and Drazkiewicz A.; RT "RMER17D_MM: putative LTR from Mus musculus."; RL Repbase Reports 2(9), 21-21 (2002). XX DR [1] (Consensus) XX CC Related to RMER17A (86%, bases 13-767) and RMER17B (77%, bases CC 152-909). Similar to LTR21_MM (71%, bases 1-214). XX SQ Sequence 923 BP; 145 A; 334 C; 180 G; 264 T; 0 other; tgtggggcat tgggctatgc acagacaggc tggtctccag ttgagctgag gtctgaaccc 60 cggtgggacc cggtggtgat aattcaccta catgacacgg taggcgttcc ctcatgtctc 120 ctggaactct ggctcctgcc taagttaccg ccccccacag cccccacaag agaagcatgg 180 ttagtagtca cgtaggcaat gtcccaagct tctgaccttc aggctaaact cctccccagt 240 tacctagcaa cagtaaagta aagcccacca taaaaggggc tgtttagccc ccacctcgct 300 ctcttacttc tctctctctc actctctctc tctctcctct ctctctcctc tctctctcct 360 ctctctcctc tctctctctc ttctcctctc ctctctcctc tctctctctc tcgctttctc 420 tctctctctc ttggcctctc tctctctctc tctctcttct accttctctc tttccccctg 480 cctttctata ataaagctct aaaaccatag actgtctctg ttcatcaagg cccactgtgc 540 ttggaggatg ggataggctt tctcctaatg agccgtttct aatctcctat tagaaggcct 600 tcctgtgctc cagtcaaggt ccgcactgac tctcgcctgt gttgggaacc tctcttccct 660 cagccctctc tcctataacc ctggtggctt tagcagtgta gccccggggc ccccaggtat 720 cgggggctgc cccttgtcca cccccccccc cgacgagtgg ggtcagtggc ttcacacact 780 tagatgccca cccagggctg agtggaaagt gtctggcagc cctcccatgt ctgcctgccc 840 agagcatagg tgaaactctg gcgggatgtg ggctctcttc cctcccccct cttcccctgg 900 gcccccccct tttttagttc cca 923 // ID RLTR11A2_LTR repbase; DNA; ROD; 527 BP. XX AC . XX DT 16-MAR-2006 (Rel. 11.03, Created) DT 10-APR-2007 (Rel. 11.03, Last updated, Version 1) XX DE Long terminal repeat of ERV2 Endogenous Retrovirus from Muridae. XX KW ERV2; Endogenous Retrovirus; Transposable Element; KW LTR retrotransposon; RLTR11A; RLTR11A2_LTR. XX OS Muridae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea. XX RN [1] RP 1-527 RA Smit A.F.; RT "RLTR11A2_LTR - a subfamily of endogenous retroviruses from RT Muridae."; RL Direct Submission to Repbase Update (11-NOV-2005). XX DR [1] (Consensus) XX CC MYSERV 16% subst. XX SQ Sequence 527 BP; 146 A; 92 C; 160 G; 127 T; 2 other; tgttgaggtt tggtcttttg ctatgtattg taatgctaaa tactggcccc caaggcctgg 60 ttgcccccag ggatgagaga atccgcacat agacccaagt gatgctatgt aaccttgccc 120 cccaagttat ccctgattgg tgaataaaga tgcctacagc ctatagctgg gcagaagaga 180 ggtaggcggg gtttgggttc ccgggcttgg ggtctgagga gaaccacgag gagggagagg 240 aggnggagag agagggaaga cgccatgggg taggtgagtc atgaaaacat ggccatgagg 300 gctggccaat tggagttaag agcagcccag atggaacatg gcaagttata actcggggtt 360 attgatgggg aagtagattc taatagcnta gagggtagat atctgcccag ctctagtgct 420 gattaaggct tattataaat ataaaagttg tgtgtctttt atctgggaac tgaatgatca 480 aaggcggggt agaaaccccc gattgagatt aaatatttac tacaaca 527 // ID ERV2B-CPo_I repbase; DNA; ROD; 3256 BP. XX AC . XX DT 20-JUN-2009 (Rel. 14.07, Created) DT 20-JUN-2009 (Rel. 14.07, Last updated, Version 2) XX DE Internal portion of ERV2 endogenous retrovirus: consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; ERV2B-CPo_I. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-3256 RA Jurka J.; RT "ERV2-type endogenous retrovirus from guinea pig."; RL Repbase Reports 9(7), 1375-1375 (2009). XX DR [1] (Consensus) XX CC >90% identical to consensus. XX FH Key Location/Qualifiers FT CDS join(176..1099,1103..1498,1533..3035) FT /product="ERV2B-CPo_I_1p" FT /translation="MGQALSEHELFVGGLSKALKTRGVRVRTKELLQYFQF FT IHKVCPWFPLEGSIDTKRWKRVGDALKDYYRVFGPEKVPVTTFAYWNLISE FT LLTQRHADPQIEETVAQGEAALRDSTNPTPKPSSEKSEASDLSDAEADDLL FT EVLQDGDLPPLPPVDMMDRFQDGGPPPPMPPQPIGDSYKSLYPDLSDFQEV FT DWEQLAQQASTYHPDPVAALAAIKSPPDYDDALSVCPQPSSILAELEQRKR FT DLLKQIQLEKECQNLSTQLQALRATQPYSAPKMAPASGKPHGALKMAPASG FT SQPKMVFPVTEADVPPEGAEPQVVRIRTPLTVTALKDLKTAVSQYDPTAAY FT TIAVLDSLTHGWIIPNEWKDLAKATLSRGNYLLWKSEFFYQCEGIAGDNAA FT WQNPWTLAMLTGSSRSGGTQTSQIDCGSCGQALPYRGINSKLGMQNPGPCP FT QECKSYQIEMHSSCYDAVRECTGLDNATYFTAILTKIKMAAAGGDWGLNPS FT VSGSPVYTQASCKGEINKPVCWYKRAPIHVSDGGGPQDKMRELYVEKKLKE FT IEDSLFPKINYHPLALPKSRGVDLDPQTYDILSATHHVLNLTNPALAQDCW FT LCMALGTPMPLAIPSNSTIDPSRDPHNFTCRPSQSFLVQPLITPNISMTCL FT QGSLQNDSTLIDMGHLLSTACVQFVNHTGPLCPPTGHIFVCGNNRAYTYIP FT ANWTGTCITASLLPNIDIVSGDQPVPVPTLDYIAGRQKRAVTLIPLLVGMG FT ITGAVATGTAGLAVALDKYTDLSNQLSEDVQALSFTIKTLQNQLDSLAEVV FT LQNRRGLDLLTAEQGGICLALQEKCCFYVNQSGVVRGKLQEIEERLQQRRQ FT QMAANPFSSMWGGLFPYLAPLLGPLAALLLLLSVGPCIFQRIMTLINNRID FT AFMAKPIQVHYHQLEMADQQTYREGIPTSGPYDDAA*" XX SQ Sequence 3256 BP; 885 A; 881 C; 738 G; 752 T; 0 other; agtggcgccc agcgtggggg ctcctggcac gggggagtca acgagggagc cgttcgagaa 60 cgccggctaa aagtgaaagc aaattgaatc aaggaccgag gcctgcgcgg atgaccgccc 120 gacgtgccgc ccgagtcacg gtgagtgaac ggcaacccca tcccctttcc cggacatggg 180 tcaagcattg agcgaacatg aattgttcgt gggaggcctt agcaaggcat taaagacaag 240 aggggtaagg gttagaacaa aagaattgtt gcagtatttt cagtttatcc ataaagtttg 300 cccttggttc cccctagagg gctctataga taccaagagg tggaaaagag taggagatgc 360 tctaaaagat tattataggg ttttcggtcc ggaaaaggtt ccggtgacca cctttgcata 420 ctggaattta atctctgagc tgcttaccca gcggcatgca gatcctcaga ttgaggagac 480 ggtcgcacaa ggtgaggctg ccctgcgcga ctccactaat cctaccccaa aaccgtcctc 540 agaaaaatct gaagcctctg acctctcaga cgcagaggct gatgacctgt tagaggtcct 600 ccaagatggc gatctgccac cacttcctcc tgttgacatg atggacaggt tccaagatgg 660 cggcccacca ccccccatgc ctccacagcc aattggggat tcatataagt ctttataccc 720 cgacctctct gattttcaag aggtcgactg ggaacaacta gcacaacaag cctctacata 780 ccaccctgac cctgttgcag cactggctgc cattaagtcg ccccctgact atgatgatgc 840 acttagcgtc tgccctcaac cttctagcat actagcagaa ttagaacaaa gaaaaaggga 900 tcttttaaaa cagatacagc tagaaaaaga atgccaaaat ctttcaactc aattgcaggc 960 cctccgagct acacagcctt atagcgctcc taagatggcg cctgcctccg gcaagcctca 1020 tggcgctctt aaaatggcgc ctgcgtccgg cagccagccc aaaatggtgt ttccagttac 1080 agaggccgat gttcccccat gagagggcgc tgaaccacaa gtggttagga tacgcacacc 1140 cctcactgtg acagctctaa aagacttaaa gactgcagtc tctcaatatg acccgactgc 1200 tgcctatacc attgcagttt tagattcttt gactcatggc tggattattc ctaatgagtg 1260 gaaagactta gcaaaagcca ccctctccag aggcaattat ctattatgga aatctgaatt 1320 tttttatcaa tgtgagggca tagccggaga taatgctgca tggcaaaacc cctggaccct 1380 tgctatgctt acaggctctt ctcggtctgg aggaacacaa accagtcaga tagactgcgg 1440 ttcttgtggc caagccctcc cctacagggg gatcaactca aaattgggaa tgcaaaacta 1500 aaccccaaat tatacccaaa caaaataact gaccagggcc ttgcccacag gagtgtaagt 1560 catatcagat agagatgcat agctcctgtt atgatgctgt acgagaatgt actggcttag 1620 ataatgccac ttattttaca gccatactta caaaaataaa aatggcagca gccggaggcg 1680 attggggcct caacccctct gtatcgggaa gccccgtata cacccaagcc tcctgcaaag 1740 gcgaaatcaa caagccagtc tgttggtata agagggctcc aatccatgtc tcagatgggg 1800 gaggtcctca agacaaaatg agagaacttt atgttgagaa aaaactaaaa gaaatagaag 1860 acagtctatt cccaaagatc aattaccacc cgttagctct ccccaaatca cggggtgtgg 1920 atttagaccc acaaacttat gatatccttt cagccactca ccatgtactt aacctcacta 1980 accctgcttt agcgcaagac tgctggttgt gcatggcctt aggcactcct atgcccttag 2040 ccatcccctc caactcaaca attgacccct ctcgtgatcc acataatttt acctgtagac 2100 cttcccaatc atttctagtc cagccattaa taacacccaa catatccatg acctgcctac 2160 agggctctct ccaaaatgat tctaccctta tagatatggg ccatctcctg tccaccgcct 2220 gtgttcaatt tgtcaatcat acaggccccc tgtgcccccc gacaggacac atttttgtct 2280 gtggtaataa tagggcctat acctatattc cagccaactg gacaggtaca tgcataactg 2340 cttccctcct gcccaacata gatattgtct caggtgacca acctgtcccg gtgccaacac 2400 ttgattacat tgctggccgc caaaaacgag cggtaacact gatcccgctt ttggtgggaa 2460 tgggcatcac tggcgctgta gccacgggta ctgctggcct cgccgttgcc ttagataaat 2520 atactgacct gtccaaccag ttgtctgaag atgttcaagc cttatccttt acgataaaaa 2580 ctctccaaaa ccaattagac tcactggcag aggtcgtctt acaaaacaga cgagggctag 2640 acctgctcac ggcagagcag gggggaatct gtcttgcctt acaggaaaaa tgttgcttct 2700 atgtcaatca gtccggtgtt gtgcgaggca agctccaaga aatagaagaa cgacttcaac 2760 agcgacgaca acaaatggcc gctaacccct tctcctcgat gtggggggga cttttccctt 2820 acctagcgcc ccttttgggt cccctagctg ccctgttgct tctcctctct gtgggtcctt 2880 gtatttttca gaggattatg acgctcatta ataaccggat agacgcgttt atggcaaagc 2940 ccattcaggt acactatcac cagctggaga tggcagatca acaaacctac cgggaaggaa 3000 ttcccacgag tggcccatat gacgatgcgg cctaactgcc ctggggcaat ggacaggcag 3060 ccagagacgg gcaactagga catacatgtt actccaccgg ccgcctaaga caggcacagg 3120 accaactgct gtcctgcaag cccatgacgg gtaaggcctg gttgggttca tgtaatgagg 3180 ttccttgacc taagacaggc gctgtcctca cagggcctgc catactcctg attaaataat 3240 atagaaaggg aggaga 3256 // ID ZOMBI repbase; DNA; ROD; 2806 BP. XX AC . XX DT 13-JAN-1998 (Rel. 6.4, Created) DT 28-JUN-2000 (Rel. 7.2, Last updated, Version 3) XX DE Autonomous DNA transposon; POGO superfamily. XX KW TIRs; DNA transposon; MER46; TA target; ZOMBI; TIGGER4. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 2806-2710 RA Smit F.A. and Riggs D.A.; RT "Tiggers and other DNA transposon fossils in the human genome."; RL Proc. Natl. Acad. Sci. USA 93, 1443-1448 (1996). XX RN [2] RP 2806-2710 RA Jurka J., Kapitonov V.V., Klonowski P., Walichiewicz J. RA and Smit F.A.; RT "Identification of new medium reiteration frequency repeats in RT the genomes of primates, rodentia and lagomorpha."; RL Genetica 98, 235-247 (1996). XX RN [3] RP 1-2806 RA Kapitonov V.V. and Jurka J.; RT "Jerky gene - a recruited transposon?."; RL Direct Submission to Repbase Update. XX RN [4] RP 1-2806 RA Smit F.A.; RT "ZOMBI."; RL Direct Submission to Repbase Update (JAN-1998). XX DR [3] (Consensus) XX CC 23 bp terminal inverted repeats and TA target site [1,2]. CC ZOMBI is an autonomous DNA transposon (its non-autonomous CC elements have been identified as ZOMBI_A (MER46 [1,2]) and CC ZOMBI_B. CC Orientation of ZOMBI has been determined based on the CC reconstruction of its internal sequence encoding transposase [3]. CC It has been shown [3] that neurolepsy-related Jerky gene in human CC and mouse is a recruited transposase from ZOMBI. XX SQ Sequence 2806 BP; 912 A; 540 C; 600 G; 754 T; 0 other; caggttgagc atccctaatc caaaaatccg aaatctgaaa tgctccaaaa tctgaaactt 60 tttgagcgct gacatgacgc cacaagtgga aaattccaca cctgacctta tgtgacgggt 120 cacagtcaaa acgcaggtgc acaacacaca gtttattcgg cgtccccaag ggaaaaaaga 180 ccctcccagc ccccttcagc tgcggtatat cttttccgcg cacacccaga ttcccccatg 240 caagcacgcc cacaaagggt aataaaatgg cacgtgtgca ggctggacac accaacggca 300 ggttccccac aatgccccca catggggtca agacctacgt gcattactca ctgtgttttt 360 ttgcttattc tctgctctgt ggtgtaaaga tattgttgaa aatgtcaaaa aggcctgtag 420 atacccctgt gagtaacaat gataagaaaa aggaagcatt tatgtttatc tatagcacag 480 aaaagtcaag ctgttggaga aactggacag tggtgtaagt gtgaaacgtc ttacagaaga 540 gtatggtgtt ggaatgacca ccatatatga cctgaagaaa cagaaggata aactgttgaa 600 gttctatgct gaaagtgatg aacggaagtt aatgaaaaat aaaaaaacac tgcataaagc 660 taaaaatgaa gatctcgatc gtgtattgaa agagtggatc cgtcagcatc acagtgaaca 720 catgccactt aatggtacgc tgatcatgaa acaagcaaag atctgtcaca atgaactgaa 780 aattgaaggg aactgtgaat attcaacggg ctggttgcag aaatttaaga aaagacacgg 840 cattacattt ttaaagattt gtggtgataa agcatctgct gatcatgaag cagcggagaa 900 attcattgac gagtttgcca agatcatcgc tgatgaaaat ctgatgccag aacaagtcta 960 taatgctgat gaaacatcac cgttttggtg ttattgcccc agaaagacac tgactacagc 1020 tgatgagaca gcccctacag gaattaagga tgccaaggac agaataactg tgctgggatg 1080 tgctaatgca gcaggcacgc ataagtgtaa acttgctgtg ataggcaaaa gcttgcgtcc 1140 ttgctgtttt caaggagtga atttcttacc agtccattat tatgctaaca aaaaggcatg 1200 gatcaccagg gacatctttt ctgattggtt tcacaaacat tttgtaccag cggcttgtgc 1260 tcactgcagg gaagctggac tggatgatga ctgcaagatt ttgttattcc ttgacaactg 1320 ttctgctcat cctccagctg aaattctcat caaaaataat gtttatgcca tgtactttcc 1380 cccaaatgtg acttcattaa ttcagccatg tgaccagggt atctttagat caatgaagag 1440 taaatataaa aacactttct tgaacagcat gctagcagca gtgaacagag gcgtgggtgt 1500 ggaaggtttt caaaaggagt ttagcatgaa ggatgccgta tatgctgttg ccaacgcttg 1560 gaacacagtg actaaagaca cagttgtgca tgcctggcac aacctctggc ctgcgactgt 1620 gttcagtgat gatgatgaac caagtggtga ctttgaagga ttctgtatgt caagtgagaa 1680 aaaaatgatg tctgacctcc ttacatatgc aaaaaatata ccttcagagt ccgtcagtaa 1740 gctggaagaa gtggatatta aagacatttt taacatcgat aatgaggctc cagttgttca 1800 ttcattggaa gaagtggata tcaaagaagt cttccacatc gataaatgca ttaccagttg 1860 ttcaaccatc accggatggt ggaatagccg aaatggttct gaatcaaggt gattgtgatg 1920 atagtgatga tgaagatgat gacgttaaca ctgcagaaaa agcgcctata gatgacatgg 1980 tgaaaatgtg tgatgggctt attgaaggac tagagcagcg tgcattcata acagaacaag 2040 aaatcatgtc agtttataaa atcaaagaga gacttctaag acaaaaacca ttgttaatga 2100 ggcagatgac tccggaggaa acattttaaa aagccatcca gcagaatgcc tcctcatccc 2160 tagaggaccc acttcctggt ccctcaactg cttctgatgt ttcttctcac ttagaaaaca 2220 aaaaccaaaa agcaaaaaaa atacagtgta cagtaacctt ttaatcaaaa cacagcatcg 2280 tagatggaga ctgaaagcct gccattgttt gttgttgctg ttgtttaaca gctgatacag 2340 gtattctggt gatgctactg tgctgcttag ttaccctgaa cacatttttt tttcactgta 2400 ttaatggtat gtcatatttt ttactgttaa gtacttatgt gtgaataagt gtaagaaaat 2460 gattgcttat cggtagcata taaattcaga gtcaggaatg atggtgatgc caaacaacca 2520 cagattgtcc acatgggtgg ctgagatagt gacacctttg ctttctgatg gttcaatgta 2580 cacaaacttt gtttcatgca caaaattatt aaaaatattg tataaaatta ccttcaggct 2640 atgtgtataa ggtatatatg aaacataaat gaattttgtg tttagacttg ggtcccatcc 2700 ccaagatatc tcattatgta tatgcaaata ttccaaaatc tggaaaaaaa tccagaattc 2760 aaacacttct ggtcccaagc atttcggata agggatactc aacctg 2806 // ID MLT1CR repbase; DNA; ROD; 1375 BP. XX AC . XX DT 24-JUL-2000 (Rel. 5.06, Created) DT 24-JUL-2000 (Rel. 5.06, Last updated, Version 1) XX DE MLT1- LTR retrotransposon internal sequence - a consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; MaLR family; KW LTR retrotransposon; MLT1R; MLT1c subfamily; MLT1CR. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RA Smit A.F.; RT "Structure and evolution of mammalian interspersed repeats."; RL PhD dissertation, Univ Southern California, 1995. XX RN [2] RA Smit A.F.; RT "Direct submission."; RL Direct Submission to Repbase Update (31-JAN-2000). XX DR [2] (Consensus) XX CC Internal sequence consensus for MLT1A retrovirus-like element CC (MaLR). CC The ORF from pos 48 to 1154 encoded a protein regionally 34% CC identical (48%similar) to AA 480-569 the ERVL GAG protein (e.g. CC GenBank acc# CAA73250). This aids to the earlier indications [1] CC that MaLRs have been derived from ERVL-type endogenous CC retroviruses. XX SQ Sequence 1375 BP; 391 A; 230 C; 404 G; 347 T; 3 other; gattttggta ccgggaagtg gggtgctgct gtaacaaata cctaaaaatg tggaagtggc 60 tttggaactg ggtaatgggt agaggctgga agagttttga ggcacatgat agaaaaagcc 120 tagattgcct tgaagagact gttggtagaa atatggacgt taaaggtgat tctggtgagg 180 gctcagaagg aaatgaggag agctgtagag aaagcttcta tcatcttaga gaatacatat 240 atcgtcatga acagaatgtt ggtagaaata tgaacgttaa aggtgcttct ggtgaggtct 300 cagacggaaa tgaggaacat gttattgaaa actggaggaa aggtgatcct tgttataaag 360 tggcagagaa cttggctgaa ttgtgttcta gtgttttgtg gaaagtagaa cttgtaagcg 420 atgaacttgg atatttagct gaggagattt ccaagcaaag tgttgaaggt gcggcctggt 480 ttctccttgc tgcttatagt aaaatgcgag aggaaagaga taaattgagg aaggaactgt 540 taagcaaaaa ggaaccagaa cttgaagatt tggaaaattc tcagcctatc cagattgcaa 600 aagatgagaa agcatgctct ggagagaaca ccaagggtgt ggctggacaa ccatttgcta 660 aagagattag gtatgtgact catggatcca atcaaccatc tcagcagaag ccaggaatag 720 agatggggtt atccaggaag gatctgtgga ggaccctctt gtctaatggc gtggaccccc 780 atgacttgca cgggaggccg acaaggtttt tgagaatttt ataccagcag aaacactgcc 840 agcctggact gaaggggaca gagatgggac aaaatgaagg aaggatgact ctgagggcgg 900 agccacggat gcagaggcca tgggggctgc ggccnccatg ggccaagagc atggggtcac 960 cccagcgggc ccggaagaca gagcatcgag ccacagagga ttattctcga gccttgaaac 1020 ctaatggaat tttccctgct gggtttcgga cttgcttggg acccgtgacc cctttnttcc 1080 ttccnatttc tcccttttgg aatgggaatg tctatcctat gcctgtccca ccattgtatt 1140 ttggaagcag ataacttgtt ttctggtttc acaggtccac agatggagag gaattttgcc 1200 ccaggatgaa tcataccctg agtctcaccc atacctgatt tagatgatat ttagatgaga 1260 ttttggactt agagttgatg ctggaatggg ttaagacttt tggggatgtt gggatggggt 1320 gaatgtattt tgcatgtggg aaggacgtga attttggggg agccagaggg cagac 1375 // ID RLTR13B3 repbase; DNA; ROD; 943 BP. XX AC . XX DT 30-JAN-2004 (Rel. 9, Created) DT 30-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE Mouse long terminal repeat - a consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; ERVK; LTR; KW RLTR13B3. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RA Karolchik D., Baertsch R., Diekhans M., Furey S.T., Hinrichs A., RA Lu T.Y., Roskin M.K., Schwartz M., Sugnet W.C. et al.; RT "The UCSC Genome Browser Database."; RL Nucleic Acids Res 31, 51-54 (2003). XX RN [2] RP 1-943 RA Pavlicek A. and Jurka J.; RT "RLTR13B3 - a subfamily of LTR retrotransposons."; RL Direct Submission to Repbase Update (JAN-2004). XX DR [2] (Consensus) XX CC The consensus sequence was reconstructed from the UC Santa Cruz CC genome annotation. RLTR13B3 is ~93% identical to RLTR13B4. The CC subfamily is very young, since individual copies share 97-98% CC identity with the consensus. TDSs are 6 bp long. XX SQ Sequence 943 BP; 236 A; 224 C; 195 G; 288 T; 0 other; tgctacgctc tctggtcgag tcaactggac aagccgagag gcttaaaagt cactcatgag 60 gcaaaagagt ttcaaggaag ctctcctcct ggagtctagc gttccctacc catacagtaa 120 tttttcattc ccgcgttcca ttcccgcgtt agtctagtgt atggatccac gtgctctctt 180 tcatatattt tcctatcata agtgcctttt aagattgaat tctgacatag ctaaagccgt 240 ccgccagtgt tccaacagtc ctagttcgtc cttagcaaga ccttgggctt ggaattcagt 300 actgcccctg gtattacact atctctttga gtaaaagtta ggtcactgcc cttcacagga 360 atttaccatc actaaggaag aaagaagttt ctgaccgtag gagaaaccag aatagatatt 420 tagaattata gctgagttac acctatctat atacaggaat ttacaatggt ttagggaagt 480 agattccggc tgtggtttta gagtattgca ttattcatga gatggttagc tagaacggaa 540 ctccctagtt agaaccatga cttgggtgtg aaagcccttg ccctaggagt gaaaagttct 600 cacacctggc ttctaagact atagctgacc ttagataggg ctgactctgt ctcagttatt 660 ttattgccca gttcctgcca gtctttgcgt tttcatttcc tctgttctgt gtaagagtca 720 ttaagcatcc ggttacctca tttgtacctt tgtgggtatg tcacccaaac taccttgttg 780 tcttctgcat aaaaagtctg atgttcgatt tgaaaaatta cattcagatt cacacacaaa 840 ctctccctgt gtgtgtgtct gtctgtcatt catccacgct ttgcccaccc gcgtcgagag 900 accccgttcc acgcagacac agggacccag aaggtctgcg gca 943 // ID B2L_S repbase; DNA; ROD; 254 BP. XX AC . XX DT 30-APR-1998 (Rel. 3.03, Created) DT 30-APR-1998 (Rel. 3.03, Last updated, Version 1) XX DE B2-like retroposon from Sciurognathi - a consensus. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; B2; B2L_S. XX OS Sciurognathi OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia. XX RN [1] RA Serdobova V.I. and Kramerov A.D.; RT "Short retroposons of the B2 superfamily: Evolution and RT application for rodent phylogeny study."; RL Unpublished. XX RN [2] RA Kramerov A.D.; RT "B2L_S."; RL Direct Submission to Repbase Update (14-JUL-1994)D.A. Kramerov, RL Inst of Gene Biology, Russian Academy of Sciences, 34/5 Vavilov RL str., 117984 Moscow B334, RUSSIA. XX RN [3] RA Jurka J.; RT "B2L_S."; RL Direct Submission to Repbase Update (20-APR-1998). XX DR [3] (Consensus) XX CC Consensus obtained from the following entries: X80307 ,X80307 CC ,X80308, CC X80311, X80315, X80316, X80317, X80320, Y09600 (GenBank, CC rel.105). XX SQ Sequence 254 BP; 78 A; 56 C; 57 G; 58 T; 5 other; ggggctggag agatggctca gyggttaagg cgcttgcytg caaagcctga aggcctgggt 60 tcgattcccc agyacccacg taagccagat gcacawagtg gcacatgcat ctggagttcg 120 tttgcagtgg caggaagccc tggtgtgcct atactctctc actctctctc tgcctctctc 180 tctctcaaat aaataaataa aatattaaaa aaaataaaat aaaataaaca ggtaaataag 240 aaggaracta ctga 254 // ID Kanga1 repbase; DNA; ROD; 1898 BP. XX AC . XX DT 14-NOV-2005 (Rel. 10.11, Created) DT 09-DEC-2005 (Rel. 10.11, Last updated, Version 1) XX DE Mariner DNA transposon from placental mammals. XX KW Mariner/Tc1; DNA transposon; Transposable Element; HSTC2; Kanga1; KW mariner. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-1898 RA Smit A.F.; RT "Kanga1 - Mariner DNA transposon from placental mammals."; RL Direct Submission to Repbase Update (11-NOV-2005). XX DR [1] (Consensus) XX CC (MER104) 25%. XX SQ Sequence 1898 BP; 605 A; 343 C; 416 G; 525 T; 9 other; ccatatttca tcgattctaa gatgcacatt ttttcacatt ttaacatctc tgaaatcggg 60 atgcatctta caatcgatgg catcttacaa tcgctgtcgg ccaggcggca gtcgtgacgt 120 agttgtcatt gcctgcacgt gtgcgaactt ggtcatagct gttcatattg tcatcacttc 180 aattgagtta tgtgcattgt tggtactaca cgtgttgagt ttaattgcca tttaaaatgt 240 cttcaaaaag attacactat gattcggcat tgaaatgaaa agttattgtg tacacagaaa 300 ggcacggaaa cagagcagcg gggcgtaaat ttgatattag tgaagcaaat attcgtcgtt 360 ggaggaatga ccgcaattcc atattttctt gcaaagcaac aaccaagtgc tttatgggac 420 ctaagaaagg aagataccca caagtagatg aagctgtgtt acgttttgtt actgagatac 480 gtgcaaaagg attgcctatc acacgccaag caatgcaact gaaggcagga gaaattgcca 540 aatccctcgg aatagatgaa agaaatttca aagcaatgag aggctggtgt gaccgattca 600 tgcgtcgtgc aggactatcg ttaaggcatc aaacatcaat ttgtcagaaa cttcctgctg 660 actttgaaca gaagctgctt aacttccagc gacatgtgat tcaattgagg aaaaaacgaa 720 actatgagtt tagtcaaata ggaaatgctg atgaaaccnc ggtgttcttc gacatgcctc 780 aaaattatac tgtcaatgct aaaggtgcta aagagatcaa gatcatgagc acaggttatg 840 aaaagcagca tatcactgtg atgctatgca taattgccga tggccaaaag ttgtcgccat 900 atttaatttt aaactacaaa ataattccta agaatgaaat cttccccaaa gatgttactg 960 tgcttnccan taaacatgga gatatagacg tcagntgagc tgatgaagga gtggctaaag 1020 tcatctggaa ttggatgtcc aggagcccta cgtaacccac caagtgtgtt ggttcttgat 1080 gcatttcatg gacgtgtatc tgaacagtta aagaatangt tcactgaaaa gtaanacaag 1140 ttggtggttg atccatnaac tgcaacccct caanattcca gtcagcaaac catttaagga 1200 ccatttgagg aaggaatatg agtcctggtt gttgtctgaa aaccttccgt tgacaccttc 1260 tggtaagatc aagaaagcgc cagcatcaaa acttgcagaa tgggtgtcag cggcttggaa 1320 gaaaatcccg gagacaatag tggagcactc ttttaactcc taggaaccaa atggttgtgg 1380 gnaagggagg gagaaggcag tgaatcagac atgtttagca acattcccga agcgggagtc 1440 aaaagtaggc tagctctaca taattgcaaa gccggctcaa gagcccagca ccaatggctc 1500 tcagttcttt tatggattct tttaagaaat gctgcatcac caacgctctt gatggcacag 1560 aggatgatat tgtgtggaaa aacacggaca tcgatgactc tgagtcgaaa agtgattcag 1620 aagagttgga ctctgaatgt gaagaagttt taggaatacc ttaaccaatt tatttcgctt 1680 atattttcct ttttatgtat gcacaagagt gatatatgat aaaaatctgt gtctaaataa 1740 gtctaaaaga gctctttcaa taagtataaa ataaaaattc taatgataag gaaagcattg 1800 tgtcatagtt taattggcag cgttttttct ttcttagtgg tacataaaat aatggtgcgt 1860 cttacaatcg atggcatctt agattcgatg aaatacgg 1898 // ID URR1 repbase; DNA; ROD; 236 BP. XX AC . XX DT 25-APR-1997 (Rel. 2.03, Created) DT 25-APR-1997 (Rel. 2.03, Last updated, Version 1) XX DE Rodent-specific putative non-autonomous DNA transposon - a DE consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; URR1; PR. XX OS Rodentia OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires. XX RN [1] RA Ogata T.R., Rosa A.P. and Zepf E.N.; RT "Sequence of the gene for murine complement component C4."; RL J. Biol. Chem 264, 16565-16572 (1989). XX RN [2] RA Gale M.J., Tobey A.R. and D'Anna A.J.; RT "Localization and DNA sequence of a replication origin in the RT rhodopsin gene locus of Chinese hamster cells."; RL J. Mol. Biol 224(2), 343-358 (1992). XX RN [3] RP 1-236 RA Smit A.F.; RT "URR1."; RL Direct Submission to Repbase Update (30-NOV-1996). XX DR [3] (Consensus) XX CC URR1 has 19-21 bp (imperfect) terminal inverted repeats (TIRs). CC The terminal TA dinucleotides are probably flanking repeats. CC The element is related to PMER1 in prosimians. XX SQ Sequence 236 BP; 58 A; 57 C; 51 G; 70 T; 0 other; tagggcagtg gttctcaacc ttcctaatgc tgcgaccctt taatacagtt cctcatgttg 60 tggtgacccc caaccataaa attattttcg ttgctacttc ataactgtaa ttttgctact 120 gttatgaatc gtaatgtaaa tatctgatat gcaggatatc tgatacgcga cccctgtgaa 180 agggtcgttc gacccccgaa ggggtcgcga cccacaggtt gagaaccgct gctgta 236 // ID RMER6 repbase; DNA; ROD; 762 BP. XX AC . XX DT 22-APR-1997 (Rel. 2.03, Created) DT 16-JUL-2009 (Rel. 2.03, Last updated, Version 2) XX DE Medium reiteration frequency repeat - a consensus. XX KW Repetitive sequence; RMER6. XX OS Murinae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae. XX RN [1] RA Chopra V. and Jurka J.; RT "RMER6."; RL Direct Submission to Repbase Update (30-NOV-1996). XX DR [1] (Consensus) XX SQ Sequence 762 BP; 263 A; 128 C; 209 G; 159 T; 3 other; tgttgcagga tatttgatca cactgtgaac cccgagattg tgttatttac tgaaaaaacc 60 tgtttctagt tgtggtgtgg ctcagccctt agcacacacc tttaatccct ctggctggaa 120 tacagacacg cccttagtac acacctttaa tcccaaacaa tgaaggtaaa gttagtttgt 180 agaaggaagc acccatgttt gaaagtgatg tctaattgag tggcagacaa agtgacgaat 240 cagagaaaga tttgacagaa taggatatgc ccaactctca tgagaagaga gaggaaaggg 300 aagctttaag ggagaagtga asagagagag aaaggagagg aggcagtttt actgggagag 360 ttttasagag acaggttgaa gagagaacaa gctagacaca ggtgaagaca gaacgagcca 420 gagaatgaga agccagaaga ttagaacaga ttgccagagt tagtttgagg ccaagcagag 480 caattcagtc agaagccgag aaagaagcca gattgaatca gtcagcttgg agaggagttt 540 gagccagaac agctgagttg aaccagccag ccagagttca gaaagaacaa gaaagggtga 600 gyttattcag cagtaagtct cagaggctga aaacattcta ggcctagatt agattgtacg 660 gaggaggcta gaagcttcca ggactaggcc taggttagca gacggagggc agtaagcctc 720 ggagatgaca attacatcag gcgaataaaa gttactttta ca 762 // ID RLTR6_MM repbase; DNA; ROD; 656 BP. XX AC X17508; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 17-APR-1997 (Rel. 2.03, Last updated, Version 2) XX DE Mouse virus-like 30S (VL30) LTR. XX KW ERV1; Endogenous Retrovirus; Transposable Element; KW Repetitive sequence; Long terminal repeat; retrotransposon; KW MMLTR6; RLTR6_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-656 RA Norton D.J.; RT "RLTR6_MM."; RL Direct Submission to Genbank (28-NOV-1989)Norton J.D., Royal Free RL Hospital School of Medicine, Dept of Haematology, Pond Street, RL Hampstead , London NW3. XX RN [2] RP 1-656 RA Eaton L. and Norton D.J.; RT "Independent regulation of mouse VL30 retrotransposon expression RT in response to serum and oncogenic cell transformation."; RL Nucleic Acids Res 18(8), 2069-2077 (1990). XX DR GenBank; X17508; Positions 1 656. XX SQ Sequence 656 BP; 143 A; 157 C; 143 G; 213 T; 0 other; tgaagaatga aaaattactg gcctcttgtg agaacatgaa ctttcacctc ggagcccacc 60 ccctcccatc tagaaaacat ttttgagata aaggcctcct ggaacaacct caaaatgaca 120 ttgccaaatg ataagacatg actccttagt tacgtaggtt ccttgatggg acatgactcc 180 ttagttacgt aggttccttg ataggacatg actccttagt tacgtagaat cctttggcag 240 aaccccttgt cccttggcag aactccctag tgatgtaaac ttgtactttc cctgcccagt 300 tctcccccct ttgagtttta ctatataagc ctgtgaaaaa ttttggctgg tcgtcgagac 360 tcctctaccc tgtgcaaagg tgtatgagtt tcgaccccag agctctgtgt gctttctgtt 420 gctgctttat ttcgacccca gagctctggt ctgtgtgctt tcatgtcgct gctttattaa 480 atcttacctt ctacatttta tgtatggtct cagtgtcttc ttgggtacgc ggctgtcccg 540 ggacttgagt gtctgagtga gggtcttccc tcgagggtct ttcatttggt acatgggccg 600 ggaattcgag aatctttcat ttggtgcatt ggccgggaat tcgaaaatct ttcatt 656 // ID MER128 repbase; DNA; ROD; 329 BP. XX AC . XX DT 21-JUL-2006 (Rel. 11.07, Created) DT 21-JUL-2006 (Rel. 11.07, Last updated, Version 1) XX DE Unclassified mammalian repetitive element - consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW MER128; mariner; Tigger14a; conserved. XX OS Mammalia OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi. XX RN [1] RP 1-329 RA Jurka J.; RT "MER128: Unclassified, moderately repetitive element from RT mammals."; RL Repbase Reports 6(7), 381-381 (2006). XX DR [1] (Consensus) XX CC This sequence is present in >1000 copies phg. XX SQ Sequence 329 BP; 109 A; 54 C; 49 G; 115 T; 2 other; cagtaaaagc tcgtttatcc ggcattctat caaccagaac tctctattaa ctagcacttc 60 tgtacatcta tagtayaatg ataattgatg ttcataatga tgaccctgag gcactacaag 120 atcctgaagt gccttctgaa tcatcaaaga aagattaaat tatgttcagt atagttttag 180 tgttaagtgt attttattgt attctaattc tttgaaaatt ggtaatctat gtatggtata 240 tatgataact ctctattaac cagartattt gattaaccag aatacattat tcctgaccat 300 acccaatatg gataacagag agtttactg 329 // ID RLTR25_MM repbase; DNA; ROD; 774 BP. XX AC . XX DT 14-OCT-2002 (Rel. 7.09, Created) DT 14-OCT-2002 (Rel. 7.09, Last updated, Version 1) XX DE Mouse putative long terminal repeat - a consensus. XX KW LTR Retrotransposon; Transposable Element; Long terminal repeat; KW retrotransposon; RLTR25_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-774 RA Jurka J. and Drazkiewicz A.; RT "RLTR25_MM: putative LTR from Mus musculus."; RL Repbase Reports 2(9), 10-10 (2002). XX DR [1] (Consensus) XX SQ Sequence 774 BP; 234 A; 146 C; 162 G; 232 T; 0 other; tgagggccat gcaggaagga acaacaaatg gctttagaac tcatggaatg cagcccactg 60 gttagcacag ccacatgggt gagcctcagt gtggcataac cctgaggacc acgtgtcctg 120 aggacttcat gctgttatgg agttctgctc tgcttgctgc cctcacatcc ctcttaatct 180 aagaattgta tgaaaactta agggggcttc cagcccctca ctgggcagta tctctggcac 240 tcacaataat agtgcttgat ctctttcagg cattgctgct ggagcagatt aatgattcaa 300 tcaccaccct aggaaagaac ttagagatgt agattgtcag caatgaccac tgccaccacc 360 acccttagaa aacatttgga aaaataaaat tgttagtgaa gctaggttaa gatgtttttg 420 ctactggctc tgatgtagaa aatcttaggg aacaatttga agttcagaaa ggcacactct 480 taatagttaa gctagggact ttttaaggtc agggaacaag aacaatggca gcactggctg 540 acatgactct cactgaggct ggactggtga tgattgattt cttataccca tttttgccct 600 cagcttcctg atatgattta ataaaaattg ctgatgagta gatatgcatt ctgaggattt 660 aaagtagata tgactgattt ttatataaaa gtttagattt gtgattcttt taagattctt 720 ataagattac ttttaaagat aaataataaa ataattaatc cttttttgta acca 774 // ID ERVB4_5-LTR_RN repbase; DNA; ROD; 482 BP. XX AC AC106444; XX DT 11-FEB-2004 (Rel. 9.01, Created) DT 11-FEB-2004 (Rel. 9.01, Last updated, Version 1) XX DE Rat endogeneous beta retrovirus ERVB4_5, LTR sequence. XX KW LTR Retrotransposon; Transposable Element; LTR; KW endogeneous betaretrovirus; RnERV-B4_AC106444; ERVB4_5-LTR_RN. XX OS Rattus norvegicus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Rattus. XX RN [1] RP 1-482 RA Baillie J.G., Van de lagemaat N.L., Baust C. and Mager L.D.; RT "Multiple groups of endogeneous betaretroviruses in mice,rats and RT other mammals."; RL J. Virol 78(11), 5784-5798 (2004). XX DR Genbank; AC106444; Positions 185293 185774. XX SQ Sequence 482 BP; 116 A; 149 C; 90 G; 127 T; 0 other; tgcacggagc catattggat ttgggcctag ccgctttttg actgcatggt tcaaagtgac 60 tctgggctgt ttggccacag agactcaccc caaagatgac tgccataaat cttaagattg 120 ttggataaag cctctcccat gaatttacgg cacaggagta taacattcaa gggatgcctc 180 agctcgagca gataccagtt atctcctcag tcaacaccca gtgtcttcac cacctgacct 240 catcgcctga cttctttgcc tccagctatc actgcctata ccctcatctc cccctccttc 300 atatttcaca aaggtcactc cccctacctg aaagccttaa aaactgtaac attcatccca 360 gtaaacgaga ccttgacaac agaacttttg cttggtctcc ttcttctctt cacccccatt 420 ttggcccaca ggtagaaagc ctcttcggga ccctgaataa ctgggtcccc gctggcgggg 480 ac 482 // ID RMER3 repbase; DNA; ROD; 574 BP. XX AC . XX DT 27-JAN-1997 (Rel. 3, Created) DT 15-OCT-1997 (Rel. 3.1, Last updated, Version 2) XX DE Medium reiteration frequency repeat - a consensus. XX KW Repetitive sequence; RMER3; putative LTR of retrovirus. XX OS Rattus norvegicus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Rattus. XX RN [1] RP 1-574 RA Jurka J., Kapitonov V.V., Klonowski P., Walichiewicz J. RA and Smit F.A.; RT "Identification of new medium reiteration frequency repeats in RT the genomes of primates, rodentia and lagomorpha."; RL Genetica 98, 235-247 (1996). XX DR [1] (Consensus) XX CC 7 bp terminal inverted repeats. XX SQ Sequence 574 BP; 159 A; 176 C; 87 G; 152 T; 0 other; tgttgttgta aaaatataaa aataaagagt acagttgtct tttaccccgc taggtccgca 60 ccacggtgcc caagatatct gctagatatc ttggcggaaa cacatcccag ccgcacactt 120 ttttacactc aaaccctcac ataaaagaac acacaacaca ataatcttag acccaattgg 180 taagatataa ttgcccacct aaacatacaa agcccggtac catccatccc ttaagaacat 240 taataacaac ctgtaaatac acagagcgga atcttaacgt cacctgccat attgtcctgc 300 catggcttct ccgcctctct ctccctcctg tctcttcctc tctcccttcc agtctcctcc 360 tcttccttca aacttctctc ccgcccatcc ttccttctcc tccaatgaca ggcctccttc 420 tatcctgtac ctgcccctca cctgtatttt acaaattcaa tggggagaag gttctggtga 480 agtcacctga gtcctgagta cgtgactagg cagctgtcct tggggcagtg gaattagcat 540 caaaatacag ataacttcag ggcaaaccac aaca 574 // ID MLTR18B_MM repbase; DNA; ROD; 640 BP. XX AC . XX DT 21-AUG-2008 (Rel. 13.08, Created) DT 28-SEP-2008 (Rel. 13.08, Last updated, Version 2) XX DE Long terminal repeat of endogenous retrovirus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; MLTR18B_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-640 RA Jurka J. and Jurka M.G.; RT "Long terminal repeats of endogenous retroviruses from mouse."; RL Repbase Reports 8(8), 893-893 (2008). XX DR [1] (Consensus) XX SQ Sequence 640 BP; 166 A; 164 C; 143 G; 165 T; 2 other; tgttgggagc atagcatgaa ggtccttgtg cccacagatc tccagagaaa ccaggaatgt 60 taagcacaca ggattgtctt tccaatggga aagatgtaaa accctgttct tccatccaga 120 aagctgggaa ttggaagtga ttaacagcct ggtgatctgc gttatctgtt gggccaggcc 180 atatcaagct cctacaacca ggaccggagg tggctagtaa tctaaatgac ccttatagcc 240 agaggctccc ccaagctccc acaaccagga ccagaggtgg ctaatagccc aaatgacctc 300 tatgctatct gtatgactga gaccaggcca gacccttccc taggccctta agctagtaca 360 ggtagcctat ctctagaacc catcttcttg gaagaaatga catgtaccct gagttgagtt 420 tcgatgtaat tacgcttcct tgtgcaccgg ggattgtatt acaccaggaa actttttttc 480 caaattatac tgtgtttaaa tacgttggga ataaaccgcc tggcatcaga ctccctagaa 540 gtcttgatcc aggttgacga agtcaatctg aawccgggtt ttcattctca yctcttcgtg 600 tggttcgctt ccctgcctga cacctgcagg gaccccctca 640 // ID RNSAT1c repbase; DNA; ROD; 168 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 17-JUL-2008 (Rel. 13.07, Last updated, Version 2) XX DE Satellite from rat. XX KW Satellite; Simple Repeat; RNSAT1c. XX OS Rattus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae. XX RN [1] RA Smit A.F.; RT "RNSAT1c_ - Satellite from rat."; RL Direct Submission to Repbase Update (06-SEP-2005). XX RN [2] RP 1-168 RA Smit A.F.; RT "Consensus update."; RL Direct Submission to Repbase Update (05-MAR-2004). XX DR [1] (Consensus) XX CC [2]. XX SQ Sequence 168 BP; 42 A; 40 C; 22 G; 64 T; 0 other; ccacattcat tacattcgta aggtttctct ccagtatgaa ttctttcatg ccttttaagg 60 tcattctgac ctacaaaggc tttaccacat tcattacatt cgtaaggttt ctctccagta 120 tgaattcttt catgcctttt aaggtcattc tgacctacaa aggcttta 168 // ID MamGypLTR2b_LTR repbase; DNA; ROD; 922 BP. XX AC . XX DT 05-FEB-2007 (Rel. 12.01, Created) DT 10-APR-2007 (Rel. 12.01, Last updated, Version 1) XX DE Long terminal repeat of Gypsy LTR Retrotransposon from mammals. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW MamGypLTR2b_LTR. XX OS Mammalia OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi. XX RN [1] RP 1-922 RA Smit A.F.; RT "MamGypLTR2b_LTR - Gypsy LTR Retrotransposon from mammals."; RL Direct Submission to Repbase Update (05-FEB-2007). XX DR [1] (Consensus) XX CC 4 bp TSD; 34% subst in dog-human; 5' end undefined; 80% similar CC to MamGypLTR2c which extends further 5', but is also undefined CC there. Pos 568-922 (end) are 65-75% similar to the similar CC region in MamGypLTR1b. XX SQ Sequence 922 BP; 230 A; 215 C; 301 G; 146 T; 30 other; cctccctccc cccagannag ataaagaccc tctggactgg agggaggagg agagctggag 60 aggagagaga gagttagacg ctggtgggtc tctgagaagg gccctgcgcc cctctcccct 120 cctgggccga tcccggggng ggggnaaatg gaannctcag ataggtttgg gggtccnaga 180 gcagagagga cttgtgcctn cttcccgggt gaagccnggg aggcggcaag cctcgcagag 240 nngccctgca tccacatggc nngagagcgg cagagcagag atggctgcgt ggggtgtcta 300 ggcagagggg cctgagaggc ncccccgagt ctcccgacag cccagagtgg cacnggagag 360 canccaggtt tccctgctcc tccaagaagg gcgcgagaat tagctgagag agagccgtgg 420 cagaaantag cagaggcctg cagccagggg ccagctggga ngagnanaag gngcatgcct 480 gggggaagga tgccagcggc cggagaccag atgggaagtg gccacctcag cggatgccag 540 cngggagggg tgncgacgnc cgaggaccag acaggacaag gtacatctca gcggatgcca 600 gcagaccagg acagatgang accgagaagc ggacgccncn ccctcccaat gatacggcan 660 ctgtgtaagc cccctggaac ttaganncaa ccccagggag aaggggagag aagggggaaa 720 atcctgaatt gactgagttt taaacctgaa atgactgaga aatcactgaa tttgactgag 780 tttacctgga agtgactaga ttaagttttc tgccatcagg cagaatgggg gctcgagata 840 gaaattaagt tcagttatag aaaaataaag ttacattttt gcacacctga gtttgtggct 900 tgtgaaattc gtacctgcta ca 922 // ID LTR1_CPo repbase; DNA; ROD; 411 BP. XX AC . XX DT 17-JUN-2009 (Rel. 14.06, Created) DT 01-DEC-2009 (Rel. 14.06, Last updated, Version 3) XX DE Long terminal repeat: consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR1_CPo. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-411 RA Jurka J.; RT "Long terminal repeats from guinea pig."; RL Repbase Reports 9(6), 1259-1259 (2009). XX DR [1] (Consensus) XX CC ~94% identical to consensus. XX SQ Sequence 411 BP; 99 A; 123 C; 90 G; 99 T; 0 other; tgaaagagtt aactttacag ctgcaacccc cgagctatct gtacaaaaca agaagggact 60 ttccgttggt aaaaccgcag aatgttctgt ctcagtgagt taggcaagat aagacaactg 120 ttccgggaac acagtgaccg ggctacgacc cccggacacc aaagttaccc aaccctgagg 180 ccctccaatc agctcctgcc aagccctgcg ccgttcaaaa ctaaccaatg cgatctgctt 240 ctgtaacctc gcctgctttg cggtttatgc ctttaaaaac cctgtgtaac ttcccttcgg 300 ggtcctccgc actagtttgc tggacggacc ccatgcgcat ggaaataaag cttttttccc 360 cttagagacg tgggtccttg gggtcttctt ccctgcgaca ccggccttac a 411 // ID RLTR17_MM repbase; DNA; ROD; 802 BP. XX AC . XX DT 14-OCT-2002 (Rel. 7.09, Created) DT 14-OCT-2002 (Rel. 7.09, Last updated, Version 1) XX DE Mouse putative long terminal repeat - a consensus. XX KW LTR Retrotransposon; Transposable Element; Long terminal repeat; KW RMER12; RLTR17_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-802 RA Jurka J. and Drazkiewicz A.; RT "RLTR17_MM: putative LTR from Mus musculus."; RL Repbase Reports 2(9), 2-2 (2002). XX DR [1] (Consensus) XX CC Bases 513-802 similar to RMER12 (71%, bases 1027-1315). XX SQ Sequence 802 BP; 270 A; 146 C; 129 G; 257 T; 0 other; tgtcctcagc acttatacat ctttcagaat acatgatcac atgttaaaaa gttcatcaca 60 agttcacaca taaattcaaa tcataaattg aataagaagt ttacaacaga gaatgtttac 120 atgcatatcc attaggagta attatctggc taaacattca tcacctgtca cagctccaca 180 ggttcactga aagttaaaaa ccataactaa gttatctagt gaagttttgt atagataaac 240 ccagtcaata ttttatcttc tgtcctagca cctataataa atcattagtt ccctttttat 300 gacctttggt taattgtttt acaacctctt ggaatgtgct ctgagtagta gaaagtctgg 360 ttactatcta agagcaatta actggtgaca cttgggagac tggcagagtt ctcattgcag 420 ttttgactat cagaaaagga cctaatagca gtcccactat aaaagagctt aataattact 480 gatataattt taggaattct tataggatca tcattaagaa ttaagtcatc tatttgtcta 540 tatagcatca ctacaagaca gtacatcttt gtagatctgc agagatctgc tccaaaaggg 600 tgggctaata cctagtgatt gttatatatg tttaataata acaggaaaag catattaata 660 gcaggaatct ttcctaaaat gaatttctcc tggccttgcc taatgagagc aaatctgtca 720 tagattatat aataatccaa ggcagaccat ctcaggaaga tcacctgcta gttattagct 780 tgtcctatga tggctcctga ca 802 // ID L1ME5 repbase; DNA; ROD; 507 BP. XX AC . XX DT 28-MAR-2000 (Rel. 7.1, Created) DT 28-MAR-2000 (Rel. 7.1, Last updated, Version 1) XX DE 3'-end of L1 repeat (subfamily L1ME5) - a consensus. XX KW Repetitive sequence; L1 (LINE) family; L1ME4; L1ME5 subfamily; KW L1ME5. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-507 RA Jurka J.; RT "L1ME5."; RL Direct Submission to Repbase Update (JUL-1999). XX DR [1] (Consensus) XX CC An ancient L1 subfamily, 69% similar to L1ME4. XX SQ Sequence 507 BP; 242 A; 24 C; 39 G; 196 T; 6 other; ctaaggaaag aattaaaaaa agtacaaaaa tatatattca agaattttca tttcaatatt 60 ggntttaaaa caataaaata tttggaaaca atctaaatgt ctaataatgg gagaataaat 120 aaattatagt atatctataa aataaaatat tatatagcta ttaaaawtat atttttaaaa 180 aatatttaat gatataaaaa aatntttata atataatatt aaataaaaaa aataaattat 240 aaaattatat atataatata attttatatn ttaaatatat aatatatata taaatatata 300 tatatatata tnaaataaaa aaaaaagact gaaaggaaat atacaccaaa atgttaacag 360 tggttatctc tgggtggtgg gattataggt gatttttatt tttttttttt tttttatatt 420 ttctgtattt tctaaatttt ttntaaattt tctacaataa atatgtatta cttttataat 480 cagaaaaaaa ataaaaataa taaaaat 507 // ID L1MA9 repbase; DNA; ROD; 1053 BP. XX AC . XX DT 01-OCT-1995 (Rel. 3.6, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 3) XX DE 3'-end of L1 repeat (subfamily L1MA9) - a consensus. XX KW Repetitive sequence; L1 (LINE) family; MER32; L1MA9 subfamily; KW L1MA9. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-1053 RA Jurka J., Kaplan J.D., Duncan H.C., Walichiewicz J., RA Milosavljevic A., Murali G. and Solus F.J.; RT "Identification and characterization of new human medium RT reiteration frequency repeats."; RL Nucleic Acids Res 21, 1273-1279 (1993). XX RN [2] RP 1-1053 RA Smit F.A., Toth G., Riggs D.A. and Jurka J.; RT "Ancestral, mammalian-wide subfamilies of LINE-1 repetitive RT sequences."; RL J. Mol. Biol 246, 401-417 (1995). XX DR [2] (Consensus) XX CC Contains identical ORF2 region consensus (subfam L1M2) as L1MA5. CC ORF2 ends at bp 684; average divergence of copies from consensus: CC 16% CC Replaces MER32 (Acc. No. J02963). XX SQ Sequence 1053 BP; 418 A; 151 C; 193 G; 260 T; 31 other; ttaatatcca aaatatataa ggaactcaaa caactcaaca agaaraaaac aaataaccca 60 attaaaaaat gggcaaarga cctgaataga catttytcaa aagaagacat acaaatggcc 120 aacagatata tgaaaaaatg ctcaacatca ctaatcatca aggaaatgca aattaaaacc 180 acaatgagat atcacctcac acctgttaga atggctatta tcaaaaagac agaaaataat 240 aaatgttggy gaggatgtgg agaaaaggga actattgtac actgttggtg ggaatgtaaa 300 ttagtayagc caytatggaa aacagtatgg aggttcctca aaaaaytaaa aataraacta 360 ccatatgaty cagcaatccc actwctgggt atatatccaa argaattgaa atcagtatgt 420 ygaagagata yctgcactcc catgtttayt gcagcaytat tcacaatagc caagatatgg 480 aawcaaccta agtgtccatc aayggawgaa tggataaaga aaatgtggta tatatacaca 540 atggaatact attcagccat aaaaaagaat gaaatcctgt catttgyarc aacatggatg 600 aacctggagg acattatgct aagtgaaata agccaggcac agaaagacaa atactgcatg 660 atctcactta tatgtggaat ctaaaaaagt caaaytcata gaaacagaga gtagaatggt 720 ggttaccagg ggctggggra kgggggaaat gggaagatgt tggtcaaagg gtacaaagtt 780 kcagttatgt argatgaata agttctrgag ayctaatgta cagcatggtg actatagtta 840 ataatactgt attgtatact tgaaatttgc taagagagta gatyttaagt gttctcacca 900 canacaaatg gtaactatgt gaggtgatgg atatgttaat tagcttgayt gtggtaatca 960 tttcacaatg tatacatata tcaaaacatc acgttgtaca ccttaaatat atacaatttt 1020 tatttgtcaa ttatacctca ataaagctgr aaa 1053 // ID LTR4C_Cpo repbase; DNA; ROD; 531 BP. XX AC . XX DT 04-NOV-2009 (Rel. 14.11, Created) DT 04-NOV-2009 (Rel. 14.11, Last updated, Version 3) XX DE Long terminal repeat: consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR4C_Cpo. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-531 RA Jurka J.; RT "Long terminal repeats from guinea pig."; RL Repbase Reports 9(11), 2875-2875 (2009). XX DR [1] (Consensus) XX CC >97% identical to consensus. 4 bp TSD. XX SQ Sequence 531 BP; 156 A; 135 C; 110 G; 129 T; 1 other; tgaagaacca gaaatttaaa ggttaaaaac aagatgctgt ttctcagcta gaaacagcac 60 tgggcctcgt gactcagtcc taactcagcc ctgcagaaat aaaaacaacc tctgtggctg 120 tgatatgctn gggatgtgct gtgccaggga actcagagat agagcctgcc attacgctaa 180 tcagggttgt aaaatatgtc agactcaagc ctacagaaac caggtgtctt aaagacaagt 240 tgtgcagaac aaagacagtt aattaactct gacccgctgg tcacccctta tgaactgacc 300 aatagcagat agataagatg cccaccgctc cgcactaact gtaatgattg gcttctgtaa 360 taaacgctta cattgcagtt tttccccctt aaaaacacca gccctgcccc aactcagggt 420 tctccgtttc cacctgtttg gaggaccccg tgtacacgga ataataaaaa cccctttgtt 480 cttacatgag agcgaggccc ttggagtctt ccttgcgaca ccggtcttac a 531 // ID TIGGER5B repbase; DNA; ROD; 446 BP. XX AC . XX DT 28-JUN-2000 (Rel. 7.2, Created) DT 28-JUN-2000 (Rel. 7.2, Last updated, Version 1) XX DE Non-autonomous DNA transposon - a consensus. XX KW Repetitive sequence; TIR; Non-autonomous DNA transposon; MER47; KW TC1/mariner superfamily; TIGGER3; MER47B; TIGGER5B. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-446 RA Smit F.A.; RT "TIGGER5B."; RL Direct Submission to Repbase Update (1996). XX RN [2] RP 1-446 RA Smit F.A.; RT "TIGGER5B."; RL Update0-0 (2000). XX DR [2] (Consensus) XX CC A different internaldeletion product of TIGGER5 (see there) CC Consensus inverted from [1] to agree with TIGGER5 coding region. XX SQ Sequence 446 BP; 134 A; 100 C; 86 G; 116 T; 10 other; cagatgctcc tcgacttacg atggggttac atcccgataa acccatcgta agttgaaaat 60 attgtaagtc gaaaatgcat ttaatacacc taacctaccg aacatcatag cttagcctag 120 cctaccttaa acatgctcag aacacttaca ttagcctaca gttgggcaaa atcatctaac 180 acaaagccta ttttataata aagtgttgaa tatctcatgt aatttactga ayayartaca 240 ctgtagarta yyggttgttt accctcgtga tcgcgcggct gactgggarc tgcggytcac 300 tgycgctgcc cagcatcgcg acagagtatt gtaccgcata tcgcyagcct gggaaaagat 360 cagaaattcg aagtacggtt tctactgaat gcgtatcgct ttcgcaccat cgtaaagttg 420 aaaaatcgta agttgggaac catctg 446 // ID CAVID2B2 repbase; DNA; ROD; 87 BP. XX AC . XX DT 26-DEC-2009 (Rel. 15.03, Created) DT 26-DEC-2009 (Rel. 15.03, Last updated, Version 3) XX DE tRNA-derived SINE family: consensus. XX KW SINE2/tRNA; SINE; Non-LTR Retrotransposon; Transposable Element; KW CAVID2B2. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-87 RA Jurka J.; RT "SINE elements from guinea pig."; RL Repbase Reports 10(3), 503-503 (2010). XX DR [1] (Consensus) XX CC ~89% identical to consensus. XX SQ Sequence 87 BP; 27 A; 22 C; 24 G; 14 T; 0 other; gggccgggga tgtagctcag tggcacagca cctgcctggc aagtgtgagg ccctgagttc 60 aattcccagt acccaaaaaa aaaaaaa 87 // ID RNSAT1a repbase; DNA; ROD; 168 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 29-MAR-2010 (Rel. 13.07, Last updated, Version 3) XX DE Satellite from rat. XX KW Satellite; Simple Repeat; RNSAT1a. XX OS Rattus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae. XX RN [1] RP 1-168 RA Smit A.F.; RT "RNSAT1a_ - Satellite from rat."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX SQ Sequence 168 BP; 58 A; 30 C; 32 G; 48 T; 0 other; taaagccttt gcatctcata gttatctcca agtacataaa agaatacata ctggagagaa 60 gccctatgta tgtgatcaat gtggtaaagc ctttgcatct catagttatc tccaagtaca 120 taaaagaata catactggag agaagcccta tgtatgtgat caatgtgg 168 // ID MER21B repbase; DNA; ROD; 795 BP. XX AC . XX DT 19-FEB-1997 (Rel. 5, Created) DT 19-FEB-1997 (Rel. 5, Last updated, Version 1) XX DE Human medium reiteration frequency MER21B repetitive sequence - a DE consensus. XX KW Repetitive sequence; MER21B. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-795 RA Kaplan J.D., Jurka J., Solus F.J. and Duncan H.C.; RT "Medium reiteration frequency repetitive sequences in the human RT genome."; RL Nucleic Acids Res 17, 4731-4738 (1991). XX RN [2] RP 1-795 RA Smit F.A.; RT "Structure and evolution of mammalian interspersed repeats."; RL Ph.D. thesis (USC 1995). XX DR [2] (Consensus) XX SQ Sequence 795 BP; 188 A; 210 C; 159 G; 191 T; 47 other; tcaacacaca acacttctgt caccaaacgt gtggrrrttt ttccccacac acaaacaakt 60 cttcagttct gcggcggaca ccarctgggt gtcctccaat tcarttcagt tctgacacta 120 wctacctrga gatagcgtca gatcccacag gtttaagggc tcagtcccac aagactgccc 180 ccacttcaga trccagtcgc aagtctrggt tktcacccgt acttctgacc aactggctat 240 aaattgttcc cacgacccct ctttaggttc gattaatttg ctagaatrgc tcacaraact 300 cagggaaaca cttatattta ycggtttatt ataaaggata ttacaaagga tacagatgaa 360 caaccagatg aagagatrca tagggcgagg tmtgggagag tccngggnnc aggagcttcc 420 gtgccctctc tggstnnrcc accttccwgg cacctccacg tgttcaccaa cccggaagct 480 ctccgaaccc tgtccttttg ggtttttatg gaggcttcat tacgtaggca tgaytgatta 540 catcantggc cattgattat caactcaacc tycagcycct ctccyctccc cagaggttgg 600 agggtggggc tgaaagttcc aaccytctaa tctgccttgg tctttctrgc gaccagcycc 660 catccwgrag cntnctaggg gctgcccrcc akgagtcgmc tcattagmac aaaagrncrt 720 ynctattacc caggarattc caagggtttt aggagytctg tgtcaggaac cggggtcaaa 780 gaccaaatat tagat 795 // ID MER21C repbase; DNA; ROD; 935 BP. XX AC . XX DT 23-APR-2001 (Rel. 6.03, Created) DT 23-APR-2001 (Rel. 6.03, Last updated, Version 1) XX DE Long terminal repeat of retrovirus-like element - a consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; KW Long terminal repeat of retrovirus-like element; MER21B; MER21C. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-935 RA Jurka J.; RL Direct Submission to Repbase Update (31-MAR-2001). XX DR [1] (Consensus) XX CC It might have evolved from MER21B by partial duplication CC of the first 90 bp, or so, from the 5' end. Uninterrupted CC similarity to MER21B starts around position 115. XX SQ Sequence 935 BP; 238 A; 193 C; 245 G; 255 T; 4 other; tgtgatataa taagaaatat atatttggtc tctgcccctg gttcctggca cagagctcct 60 aaaacccttg taatttcctg agtgataggg gtgataggag catcttttgt tctaatattt 120 ggtctttgac cctagttcct gacacagagc tcctaaaacc cttggaattt cctgagggtg 180 ataggagtat cttttnttta tgctaatgag gtgactcgtg gctgggggct cctagatagc 240 ttcaggatgg gggctggtca ccagaaagac caagccatga ttagagggtt ggaactttca 300 gccccacccc cccatcctcc agggagggga gaggggcttg gagattgagt tgatcaccaa 360 tggccaatga tttaatcaat catgcctacg taatgaagcc tccataaaat ccctaaagga 420 cagggttcca gagagcttct gggttgctga acacatggag gtgctgggag ggtggtgcgc 480 ccggagaggg catggaagct ccgtgccccc cccatacctt gccctatgca tctcttccat 540 ctggctgttc atctgtatcc tttgtaatat cctttataat aaactggtaa atataagtaa 600 aatgtttccc tgagttctgt gagccattct agcaaattat tgaacctgag gagggggtcg 660 trggaacccc caatttatag ccagttgtta gttggtcaga agtacaggtc acaacctggg 720 acttgcaatt ggcatctgaa gtgggggcag tcttgtggga ctgagccctt taacctgtgg 780 gatctgatgc taactccagg gtagatagtg tcagaattga attaaattat aggacaccca 840 gttggtgtcc sccagagaat tggagaattg cttggtgngt gtggaaaaac cccacacatt 900 tggtgacaga agtgttgtga gagtagagaa aaaca 935 // ID LTR6G_Cpo repbase; DNA; ROD; 413 BP. XX AC . XX DT 19-SEP-2009 (Rel. 14.1, Created) DT 19-SEP-2009 (Rel. 14.1, Last updated, Version 3) XX DE Long terminal repeat of ERV3 endogenous retrovirus: consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; LTR6G_Cpo. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-413 RA Jurka J.; RT "Endogenous retroviruses from guinea pig."; RL Repbase Reports 9(10), 2155-2155 (2009). XX DR [1] (Consensus) XX CC >82% identical to consensus. XX SQ Sequence 413 BP; 100 A; 98 C; 109 G; 105 T; 1 other; tgttatggct tatgtctaga tgccccccaa agcctcatgc gctcgtagca agtggggctt 60 ttgggaggtg actggatgat actggattag tccactgatt ggttagcata gttctgggtg 120 tggatgatgg acttactgaa gagggaagac ccaccctggg tgtgggtggc accaaccaat 180 aggcttgtag cctggatgga ataaaaaggg aaagaaggaa gtctcactgc ttcctgcctg 240 ccatgccacg gactgtttct cctctgcgat gcccctctgc catgccgccc tgccttggag 300 ccagccgact atggactgaa acctctanaa actgtgagct aaaataaacc tttcctcctt 360 taacttgcgg gtgtcaggta ttttgtccca gcaataagaa agtaactaag aca 413 // ID LINE2B repbase; DNA; ROD; 419 BP. XX AC . XX DT 09-OCT-1997 (Rel. 6.3, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 2) XX DE 3'-end of L2 repeat (subfamily b) - a consensus. XX KW Repetitive sequence; L2 (LINE) family; LINE2B subfamily; LINE2B. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-419 RA Smit F.A.; RT "LINE2B."; RL Direct Submission to Repbase Update (1997). XX DR [1] (Consensus) XX CC Average divergence from consensus 30%. XX SQ Sequence 419 BP; 61 A; 154 C; 88 G; 107 T; 9 other; tcccccttgc tcactctgct ccagccacac tggcctcctt gctgttcctc aaacacgcca 60 ggctctttcc cgcctctggg cctttgcaca tgctgttcyc tctgcctgga acgcccttcc 120 ccwctccttc ancctggcca actcctactc gtccttcagg kctcagctca ratgtcacct 180 cctccaggaa gccttccctg acttcccagg ccgagttagg tgccctcctc tgggcccccc 240 cggtcctacc ctgccactct gggttatmat tgtctgtkng cangtctgtc tcccccactg 300 gactgtgagc tccgcgaggg cagggactgt gtctgtcttg ttcaccactg tatccccagc 360 gcctagcaca gtgcctggca catagcaggc gctcagtaaa tgtttgttga atgaatgaa 419 // ID RMER16 repbase; DNA; ROD; 392 BP. XX AC . XX DT 26-SEP-1997 (Rel. 2.08, Created) DT 26-SEP-1997 (Rel. 2.08, Last updated, Version 1) XX DE Long terminal repeat of retrovirus-like element. XX KW ERV2; Endogenous Retrovirus; Transposable Element; KW putative long terminal repeat; RMER16. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-392 RA Smit A.F.; RT "RMER16."; RL Direct Submission to Repbase Update (1997). XX DR [1] (Consensus) XX CC Probably a retrovirus-like long terminal repeat. 6 bp CC duplications. XX SQ Sequence 392 BP; 89 A; 81 C; 96 G; 124 T; 2 other; tgttgagggc ttttggtgcc gcttctagga atttggcagg aattcgtcat tgggccagga 60 cacggaagta ggctcaggca ggaatatgac tttgggctag gacaaggaag taggctcaga 120 tatcttggtc atcctgataa gcccttagaa acagtgatca cgggactttt attgccttgc 180 ttgttccttg actgtttgtg tttattgcac tgctttacct tattatttgc atgtacctaa 240 aatgatataa aagcagactg gggagaaata aacctgcctc agcctcagaa ctggctgggg 300 tcatgctaca rtgttgtcta attntccttt tttcttttca atcctcactc ctgccctgga 360 gaacctgttg actgactgag ctggcttggt ca 392 // ID RLTR39_MM repbase; DNA; ROD; 575 BP. XX AC . XX DT 30-JAN-2004 (Rel. 9, Created) DT 30-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE Mouse long terminal repeat - a consensus. XX KW LTR Retrotransposon; Transposable Element; LTR; ERVK; RLTR39_MM; KW RLTR31_Mur. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RA Karolchik D., Baertsch R., Diekhans M., Furey S.T., Hinrichs A., RA Lu T.Y., Roskin M.K., Schwartz M., Sugnet W.C. et al.; RT "The UCSC Genome Browser Database."; RL Nucleic Acids Res 31(1), 51-54 (2003). XX RN [2] RP 1-575 RA Pavlicek A. and Jurka J.; RT "RLTR39_MM - a family of LTR retrotransposons."; RL Direct Submission to Repbase Update (31-DEC-2003). XX DR [2] (Consensus) XX CC The consensus sequence was reconstructed from the UC Santa Cruz CC genome annotation. Similar to RLTR22_MM. Individual copies are CC ~91% identical to the consensus. RLTR31_Mur in RepeatMasker. XX SQ Sequence 575 BP; 174 A; 88 C; 182 G; 131 T; 0 other; tgtggattgc tgtctatatg caggtgaggg caggtaaaag ataaggctag agcctgtgat 60 tgggcagtgg aaaaagaagg cggggctgag agttttagag acaggacaga gaagggacag 120 agaaggacag aaggacagag acaagatgga ggaagaagag gacgaaccag atccacatgg 180 ctttaaatag ccacaggtag ctatgaatat catagaaggg caatagaata atataggaca 240 atttgtccaa tctaggtggg cagcttgtat caatatcaat tgagctctga gttcattgtg 300 tgggcatttt gtgggttgag aatttactga tataaatctg actgataaat tacaagcctc 360 tagagttttg attttactgg gttacgggga tttgtgacag ctagccacag ggggcagatg 420 gctgggaatg ttgagcaggt tccagcagca agcgaaccgc gagaagttgg gctgaggccg 480 ccgcatgggg atagccatgg gggcggggag accgccgtta ctagagcgta gccggcaata 540 gcgtggattg attttttaaa taattacatg caaca 575 // ID RLTR20A1_MM repbase; DNA; ROD; 468 BP. XX AC . XX DT 30-JAN-2004 (Rel. 9, Created) DT 30-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE Mouse long terminal repeat - a consensus. XX KW LTR Retrotransposon; Transposable Element; LTR; ERVK; RLTR20A1_MM; KW RLTR20A1. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RA Karolchik D., Baertsch R., Diekhans M., Furey S.T., Hinrichs A., RA Lu T.Y., Roskin M.K., Schwartz M., Sugnet W.C. et al.; RT "The UCSC Genome Browser Database."; RL Nucleic Acids Res 31(1), 51-54 (2003). XX RN [2] RP 1-468 RA Pavlicek A. and Jurka J.; RT "RLTR20A1_MM - a subfamily of LTR retrotransposons."; RL Direct Submission to Repbase Update (31-DEC-2003). XX DR [2] (Consensus) XX CC The consensus sequence was reconstructed from the UC Santa Cruz CC genome annotation. Similar to RLTR20B2 and RLTR18_MM. Individual CC copies are ~89% identical to the consensus. 6 bp TSDs. XX SQ Sequence 468 BP; 139 A; 80 C; 143 G; 106 T; 0 other; tgttggggat tggttctaat gctttgaatt aatccggccc ccaaaatcag ggaatctgcg 60 tgtccaaacg ctgaaggtcc ttgtccccaa ttggtttttg atcgatcaat aaagagccaa 120 cggccaatgg ctgggcaggt agactgaggc aggaccttta gatttgcatg ggctaggaac 180 tgggagaaag gaagaagaga gagatcacca tgtctcggag ggagacggat cagatttaga 240 gctgcagaag aaaattcatc caaaatgtag gtggaaaggg aaagcggccc tatgggaggg 300 gctgcccaga aggtaacagg gcagcaaaga taaaacatag atttagaagg tgttgagcca 360 ggagtactgg agggaaatgt atgctagcca ggcggaggat tagaactgcc cagcctttga 420 gctagtcaag gcatatttaa aattaactgg tgtgtgtgtg tgtttcat 468 // ID MARE1 repbase; DNA; ROD; 167 BP. XX AC . XX DT 26-JUN-2006 (Rel. 11.06, Created) DT 01-AUG-2007 (Rel. 11.06, Last updated, Version 4) XX DE Mammalian-wide repeat - consensus. XX KW MARE2; MARE1. XX OS Mammalia OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi. XX RN [1] RP 1-167 RA Jurka J.; RT "MARE1: An ancient mammalian repetitive element."; RL Repbase Reports 6(6), 344-344 (2006). XX RN [2] RP 1-167 RA Gentles AJ., Wakefield MJ., Kohany O., Gu W., Batzer MA., RA Pollock DD. and Jurka J.; RT "Evolutionary dynamics of transposable elements in the RT short-tailed opossum Monodelphis domestica."; RL Genome Res 17(7), 992-1004 (2007). XX RN [3] RP 1-167 RA Jurka J., Kapitonov V.V., Kohany O. and Jurka MV.; RT "Repetitive sequences in complex genomes: structure and RT evolution."; RL Annu Rev Genomics Hum Genet 8, 241-259 (2007). XX DR [1] (Consensus) XX CC This element is shared by all mammals including marsupials. It CC cannot be classified at this point. Its presence in two mosaic CC forms MARE1 and MARE2 is characteristic for SINE elements. Also CC MARE2 contains microsatellite-like repeats in its 3' end, CC characteristic for some SINE elements. MARE1 contains a 5' CC stem-loop like structure at positions 16-55. It is different from CC all currently known mammalian repeats and it is detectable in CC variable number of copies ranging from hundreds to as many as CC 3000 copies per mammalian genome. MAREs may be useful markers CC for mammalian phylogenetic studies. XX SQ Sequence 167 BP; 54 A; 33 C; 31 G; 49 T; 0 other; tacaggcagt ccccaactta caaatgggtt gtgttccaaa agttcatttg taagtcagtt 60 gtttggaact tagaatacat tttcccatag aaacaatgtt ataaatggtg gttaggttcc 120 caggccagcc cacaaaagcc tatttaaccc ataatgtagc tgaaata 167 // ID RMER6D repbase; DNA; ROD; 784 BP. XX AC . XX DT 30-JAN-2004 (Rel. 9, Created) DT 30-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE Mouse long terminal repeat - a consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR; ERVK; KW RMER6D. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RA Karolchik D., Baertsch R., Diekhans M., Furey S.T., Hinrichs A., RA Lu T.Y., Roskin M.K., Schwartz M., Sugnet W.C. et al.; RT "The UCSC Genome Browser Database."; RL Nucleic Acids Res 31(1), 51-54 (2003). XX RN [2] RP 1-784 RA Pavlicek A. and Jurka J.; RT "RMER6D - a subfamily of LTR retrotransposons."; RL Direct Submission to Repbase Update (31-DEC-2003). XX DR [2] (Consensus) XX CC The consensus sequence was reconstructed from the UC Santa Cruz CC genome annotation. Individual copies are ~85% identical to the CC consensus. 6 bp TSDs. XX SQ Sequence 784 BP; 265 A; 134 C; 216 G; 169 T; 0 other; tgtagaggaa attttaatcc agcaacatgg ctgctctggc aagggatcac atccaaagac 60 cttctaggtc tatatgatct ggctagaatg cagacatgcc ctggattaca ggtatgatct 120 ggctggaata cagacatacc cttacgtaca cacctttaat ccctaacaat gaaggtaagg 180 ttagtttgta gaaggaagca gccatgtttg aaaagtgaca tctaattgag gggcagacaa 240 agtgatgaat cagaagaaag atttgacaga atgagtcaga gataggatat gcccaactct 300 catgagaaca gcacaggaaa agagaggcta cttaagagca gcaagggaga gagaaaagta 360 agagagagac aaggggtggg gtgtggtgtg tggtgggtgg tgtgtgtgta tggcagtttt 420 tactgggaca gttttacaga gacaggtttg cagagaagaa caagctagac acaggtgaag 480 acagaatgag ccagagaatg agaaggagcc agaagattag aacatattgc caaagttagt 540 atgaggccag gcagagcaat tcagtcagaa gccgagagaa gctagattga atcagtcagc 600 ttggaagatt agatttgtat gaaggaggct agaagcttcc aggcctaggc ctaggcctag 660 ggatagttag aacagagaaa gaaatactct gggctcagcc caagccgtgt attcacacag 720 cttgggtaca gctctcatct catcccttca tctgaggaaa taaaagtgac atttacacaa 780 caca 784 // ID RLTR28_MM repbase; DNA; ROD; 453 BP. XX AC . XX DT 14-OCT-2002 (Rel. 7.09, Created) DT 18-SEP-2008 (Rel. 7.09, Last updated, Version 2) XX DE Mouse putative long terminal repeat - a consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; KW Long terminal repeat; retrotransposon; RLTR11A; RLTR28_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-453 RA Jurka J. and Drazkiewicz A.; RT "RLTR28_MM: putative LTR from Mus musculus."; RL Repbase Reports 2(9), 13-13 (2002). XX DR [1] (Consensus) XX CC 83% or less similar to RLTR11A elements described in GenBank CC (AL590503, AC015890, AL591865, AL365314). 69% identity to RLTR11A CC (rodrep.ref) (bases 1-427). Similar to RLTR34_MM (66%, bases CC 22-209), RLTR18_MM (76%, bases 1-470), RLTR23_MM (72%, bases CC 33-208). XX SQ Sequence 453 BP; 133 A; 88 C; 141 G; 91 T; 0 other; tgttgtggat ttggtctaat gctgcttgta ttatgttaat ttgggtcccc aaaattgcac 60 gagaatccac acatccacat gtaagacact gagggtccct gcccccagtt ggttttgatt 120 ggtaaataaa gttgccagca gccaatggct gggcagagag acagaggcag gactttagga 180 ttcctaggca agaggacgga gggaggaagg aagaagaagt agaaccacca tgccaggaaa 240 ggagaaagat ccaggcctga gaagtgcagg agagagagca tagccatcat gtaagagcca 300 gggaagagcg gccccagggg ccccccccaa ctgggtccag ggcagccaag atggaatata 360 gattttagta agtaataact caggaatatc aggggaggtg gattagccat gtggaggttt 420 gggagtggcc cagccattga gctgtttaag gca 453 // ID L32 repbase; DNA; ROD; 408 BP. XX AC K02061; XX DT 29-MAY-1998 (Rel. 3.1, Created) DT 29-MAY-1998 (Rel. 3.1, Last updated, Version 1) XX DE Mouse ribosomal protein pseudogene rpL32-4A coding for L32. XX KW Pseudogene; ribosomal protein; L32. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-408 RA Dudov P.K. and Perry P.R.; RT "The gene family encoding the mouse ribosomal protein L32 RT contains a uniquely expressed intron-containing gene and an RT unmutated processed gene."; RL Cell 37, 457-468 (1984). XX DR GenBank; K02061; Positions 113 520. XX SQ Sequence 408 BP; 128 A; 103 C; 107 G; 70 T; 0 other; atggctgccc tccggcctct ggtgaagccc aagatcgtca aaaagaggac caagaagttc 60 atcaggcacc agtcagaccg atatgtgaaa attaagcgaa actggcggaa acccagaggc 120 attgacaaca gggtgcggag aaggttcaag ggccagatcc tgatgcccaa catcggttat 180 gggagcaaca agaaaaccaa gcacatgctg cccagcggct tccgcaagtt cctggtccac 240 aatgtcaagg agctggaggt gctgctgatg tgcaacaaat cttactgtgc tgagattgct 300 cacaatgtgt cctctaagaa ccgaaaagcc attgtagaaa gagcagcaca gctggccatc 360 agagtcacca atcccaacgc caggctacgc agcgaagaaa atgaatag 408 // ID LPKR_RN repbase; DNA; ROD; 316 BP. XX AC X05684; XX DT 28-SEP-1995 (Rel. 1.2, Created) DT 17-APR-1997 (Rel. 3, Last updated, Version 3) XX DE Repetitive segment of L-type pyruvate kinase. XX KW Repetitive sequence; pyruvate kinase; RNLPKG; LPKR_RN. XX OS Rattus norvegicus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Rattus. XX RN [1] RP 1-316 RA Milner J.R., Bloom E.F., Lai C., Lerner A.R. and Sutcliffe G.J.; RT "Brain-specific genes have identifier sequences in their RT introns."; RL Proc. Natl. Acad. Sci. USA 81, 713-717 (1984). XX RN [2] RP 1-316 RA Cognet M., Lone C.Y., Vaulont S., Kahn A. and Marie J.; RT "Structure of the rat L-type pyruvate kinase gene."; RL J. Mol. Biol 196(1), 11-25 (1987). XX DR GenBank; X05684; Positions 4072 4387. XX SQ Sequence 316 BP; 68 A; 56 C; 79 G; 113 T; 0 other; gggggtgggt ttgtttttgt ttttgttttt gtttttttga gatcgggtct ttttatgttg 60 tagctctgac tgatggatag gcaggtctca aatttagaaa tctccctacc tttgcctcct 120 gagagttgga ctgggaccac cttgctccac atgaggctgg gtttttgaga cagggtctca 180 ggtagccttg gatatttcta gcttatttca tagaccctat tatatcgtct tagcctggtg 240 aaagcatagg tatgtgatac cacacccaac ttaacaaatg cttgttgagt acctggaaaa 300 caaggtttgg aattgg 316 // ID MLT1F1 repbase; DNA; ROD; 561 BP. XX AC . XX DT 17-FEB-1999 (Rel. 6.8, Created) DT 17-FEB-1999 (Rel. 6.8, Last updated, Version 1) XX DE MALR long terminal repeat; MLT1F1 subfamily: a consensus. XX KW Non-LTR retrotransposon; MaLR family; MLT1f subfamily; MLT1F1. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-561 RA Jurka J.; RT "MLT1F1."; RL Direct Submission to Repbase Update (FEB-1999). XX DR [1] (Consensus) XX CC The primary difference from MLT1F is in the 5'-region. XX SQ Sequence 561 BP; 146 A; 146 C; 124 G; 141 T; 4 other; tgtccacaaa ttctttgata ctcctcttaa gaggtggagt ctaattcccc tccccttgaa 60 tntgggctgg acttagtgac ttgcttctaa ccaatagaat atggcagaag tgatggtatg 120 tgacttctaa ggctaggtca taaaaggcat tgywgtagct tcctncttgc tctctctctc 180 tctctctctt ggatcactca ctctggggga agccagctgc catgtcatga ggacactcaa 240 gcagccctgt ggagaggccc atgtggcaag gaactgaggc ctcctgccaa cagccagcaa 300 ggaactgagg cctcctgcca acagccatgt gagtgagcca tcttggaagc agatcctcca 360 gccccagtca agccttcaga tgactgcagc cccagctaac atcttgactg caacctcatg 420 agagaccctg agccagaacc acccagctaa gctgctccta aattcctgac ccacagaaac 480 tgtgagagat aataaatgtt tgttgtttta agccactaag ttttggggta atttgttatg 540 cagcaataga taactaatac a 561 // ID CAVID2C2 repbase; DNA; ROD; 84 BP. XX AC . XX DT 26-DEC-2009 (Rel. 15.03, Created) DT 26-DEC-2009 (Rel. 15.03, Last updated, Version 3) XX DE tRNA-derived SINE family: consensus. XX KW SINE2/tRNA; SINE; Non-LTR Retrotransposon; Transposable Element; KW CAVID2C2. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-84 RA Jurka J.; RT "SINE elements from guinea pig."; RL Repbase Reports 10(3), 505-505 (2010). XX DR [1] (Consensus) XX CC >93% identical to consensus. XX SQ Sequence 84 BP; 29 A; 19 C; 21 G; 15 T; 0 other; gctggggatt tagctcagca gcacaagcac ctgcctggca agcacaaggt tgtgagttcg 60 atccctggta caaaaaaaaa aaaa 84 // ID RLTR1B_MM repbase; DNA; ROD; 513 BP. XX AC . XX DT 30-JAN-2004 (Rel. 9, Created) DT 30-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE Mouse long terminal repeat - a consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; KW RLTR1B_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RA Karolchik D., Baertsch R., Diekhans M., Furey S.T., Hinrichs A., RA Lu T.Y., Roskin M.K., Schwartz M., Sugnet W.C. et al.; RT "The UCSC Genome Browser Database."; RL Nucleic Acids Res 31(1), 51-54 (2003). XX RN [2] RP 1-513 RA Pavlicek A. and Jurka J.; RT "RLTR1B_MM - a subfamily of LTR retrotransposons."; RL Direct Submission to Repbase Update (31-DEC-2003). XX DR [2] (Consensus) XX CC The consensus sequence was reconstructed from the UC Santa Cruz CC genome annotation. RLTR1_MM subfamily (82 % identity). Individual CC copies are ~95% identical to the consensus. 4 bp TSDs. XX SQ Sequence 513 BP; 125 A; 136 C; 117 G; 135 T; 0 other; tgaaagaaaa taaaacttga gacctgtgat tcatgtaatt tatgtcaaat agcccaaaga 60 gttgtttgtg agctttgaaa cctggggctg agaacatagc agaacaggcc aggacatgcc 120 cgggcaggcc cgtcgttaag acattcctga ggctgcttgg ccataaagat aaagagaatg 180 acatgtccgg actggcccgg cgcctcccta tctcccgccc ttctgaccta agttaaatgt 240 tatctgcatg tacagtctgc tgatgtttaa atggaccaat catgtgaaac cgcgccaatt 300 cctcccccag ccccagcccc ttttctataa aaacccctag cttccaagcc tcgtggtcga 360 atccactgtc tcctgcgtga gatacgtttc gacccggagc tccgccatta aactacctca 420 tgtttttaca tcaagacggt ctgttctgtt cgtgattctt gggtgcacgc cgaatcggga 480 gttgagtggg ggtttcccca ctaggttctt tca 513 // ID MARE2 repbase; DNA; ROD; 355 BP. XX AC . XX DT 28-JUN-2006 (Rel. 11.06, Created) DT 28-JUN-2006 (Rel. 11.06, Last updated, Version 1) XX DE Mammalian-wide repeat - consensus. XX KW MARE1; MARE2. XX OS Mammalia OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi. XX RN [1] RP 1-355 RA Jurka J.; RT "MARE2: An ancient mammalian repetitive element."; RL Repbase Reports 6(6), 345-345 (2006). XX DR [1] (Consensus) XX CC This subfamily is less than one third as abundant as MARE1. For CC other comments see MARE1. XX SQ Sequence 355 BP; 131 A; 42 C; 72 G; 107 T; 3 other; ttccaaaagt tcatttgtta aagttagttg tttggaactc agaatacatt ttcccataga 60 aacaatgtta taaatggtgg ttaggttccc agggctagcc cacaaaagcc tatttaaccc 120 ataatgtagc tgaaatactg tacatttgca atgaaaatag tagaaaayaa tactgttgca 180 atactagtaa ttaaacaaaa cagaaaaaca ataaaattga ataagaaata attttttatt 240 tatcttgaat gctggtgctt gaggaaaagc aggtttaatt agaagaggag gaggtatatg 300 aaaagttttg tggagattct gaaggggact tctgggyayt ttcaagggct ttaga 355 // ID MER77 repbase; DNA; ROD; 650 BP. XX AC . XX DT 18-APR-1997 (Rel. 6, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 4) XX DE MER77 repetitive element - a consensus. XX KW Interspersed repeat; MER77. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-650 RA Smit F.A.; RT "MER77."; RL Direct Submission to Repbase Update (1996). XX DR [1] (Consensus) XX CC Related to MER68 and MER21. XX SQ Sequence 650 BP; 164 A; 161 C; 156 G; 137 T; 32 other; actacggggg gaggtgtcaa attcaaaggt ctccaagacm accctcctgt ttartgattc 60 gctagaagga ctcacagaac tcagaaaagc cgttatactc acagttacgg tttattacag 120 tgaaaggata cagattaaag tcagcaaagg graaaggcac ataggncagn gtccaagaga 180 rmcaggcacg rgcttccagt tgtcctctcc cggcggagtc gtrcgggcag cgcttaattc 240 tcccagcaac grtgtgtgac agcacgcatg aagtattgcc aaccagggaa gctcacccga 300 gccttggtgt ccagagtttt trttgggggt cggtnanata ggcatggntg accccgcagc 360 atggctgacc ttggtcttct caggcttgag ccncccaagc atggctgacc tnagttactc 420 agtctycagc ccctccagar gtcarryacc gtgtagccta aggcccccac cataaatcac 480 attgttagca trractgtcc ggtatggccc aaggccytcg cagataaaca aagayactyt 540 tatcaggcag gacattccaa gggcttagag gttatctccc arrgcctgag gataamcgag 600 ggccaganct ttctttgggc aaggttaatc ctttactgya yaagaccaca 650 // ID MamGypLTR1c_LTR repbase; DNA; ROD; 803 BP. XX AC . XX DT 05-FEB-2007 (Rel. 12.01, Created) DT 10-APR-2007 (Rel. 12.01, Last updated, Version 1) XX DE Long terminal repeat of Gypsy LTR Retrotransposon from mammals. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW MamGypLTR1c_LTR. XX OS Mammalia OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi. XX RN [1] RP 1-803 RA Smit A.F.; RT "MamGypLTR1c_LTR - Gypsy LTR Retrotransposon from mammals."; RL Direct Submission to Repbase Update (05-FEB-2007). XX DR [1] (Consensus) XX CC 4 bp TSD; 32% subst in dog-human; 90% similar to MamGypLTR1c. XX SQ Sequence 803 BP; 196 A; 187 C; 243 G; 163 T; 14 other; tgtggcngga taatattttg agatattaat ctatgtnttt ttttcctcct gtatttcccc 60 ctttcctcct tccccccatt caagcaggta gctggctctg tgctcattgc ctcaggggag 120 gtatgtggca gggcagaaag cagaagtagc ctgcaagtct ttctggcttt tgttttccaa 180 aagcctaagc ccttaggaga acttagagga tttncggagg aggcataaag aaagngtctt 240 gaagaaacat gaagggagaa ggatttcccc cagactagaa gggagagatt cccccgggct 300 gggaagggaa nggagagagg tctgtgggtc ctgggaggag agcagngggg acctgngccc 360 tgcttcctgg cagcgccccg gggaggcggc aagaccccag agaggaatgg ctgcgtggtg 420 cgtctaggca gacgggacca caggcagcct cgcngaagat tcccgtgccc caagcgtggc 480 ncggaagcag cagagagccg ccggacctga aggggccatg cggacaggga caatggacgt 540 ctcagcggna acctgtgtgg acngatgacc gangaccaga gggcnccccc atccccaatg 600 ccttggcact gtgtaagatc cctggaacct tggcacaacc ctggggaagg gagggggaac 660 cccaagaatg actgaggtta agtttcccac cagcccggcg ggatgggggc tcaaaataga 720 aattaagttg atttatagaa aataaagaaa tgtnatattt cttgcacacc tgagtttgtg 780 gactgagatt catacctgct aca 803 // ID LTR5_Cpo repbase; DNA; ROD; 379 BP. XX AC . XX DT 31-JUL-2009 (Rel. 14.07, Created) DT 31-JUL-2009 (Rel. 14.07, Last updated, Version 2) XX DE Long terminal repeat of ERV3 endogenous retrovirus: consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; LTR5_CPo. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-379 RA Jurka J. and Baney O.; RT "Non-autonomous endogenous retrovirus from guinea pig."; RL Repbase Reports 9(7), 1546-1546 (2009). XX DR [1] (Consensus) XX CC ~84% identical to consensus. XX SQ Sequence 379 BP; 88 A; 84 C; 88 G; 118 T; 1 other; tgttatggct tgttctagat gtccccccaa agcctcatgt agtcataggg ggtttttcag 60 aggtggctgg atctagagtg tgtgatctgg attataatgg ctaggattaa gggtgtgagt 120 gctacctrcc ctagggtagc ctggataaaa tataagggac agaaagaggc ttgttcctct 180 ttctcttgtt gcttttgctg tcttctgccc accatgaact gtttctcctc tgtgatgccc 240 ctctgccatg ccaccctgcc ttggagccag ctaattatgg actgaaacct ctacaaactg 300 tgagctaaat aaacctttcc tcctttaact ttgggtgtca ggtattttgt ctcagcaacg 360 aagaaaagta actaagaca 379 // ID L1MC2 repbase; DNA; ROD; 1072 BP. XX AC . XX DT 20-FEB-1997 (Rel. 5, Created) DT 20-FEB-1997 (Rel. 5, Last updated, Version 1) XX DE 3'-end of L1 repeat (subfamily L1MC2) - a consensus sequence. XX KW Repetitive sequence; L1 (LINE) family; MER16; L1MC2 subfamily; KW L1MC2. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-1072 RA Kaplan J.D., Jurka J., Solus F.J. and Duncan H.C.; RT "Medium reiteration frequency repetitive sequences in the human RT genome."; RL Nucleic Acids Res 17, 4731-4738 (1991). XX RN [2] RP 1-1072 RA Smit F.A., Toth G., Riggs D.A. and Jurka J.; RT "Ancestral, mammalian-wide subfamilies of LINE-1 repetitive RT sequences."; RL J. Mol. Biol 246, 401-417 (1995). XX DR [2] (Consensus) XX CC Replaces MER16 (Acc. No. X59020) CC Temporarily contains ORF2 region consensus of L1MB7 (subfam L1M4) CC ORF2 ends at bp 675; average divergence of copies from consensus: CC 18%. XX SQ Sequence 1072 BP; 427 A; 157 C; 184 G; 257 T; 47 other; yttgtatcca gaatatataa agaactctta caactcaaca ataaaaaaac aaacaaccta 60 attaaaaaat gggcaaaaga yttgaataga tatttctcca aagaagatat ayaaatggcc 120 aataagcaca tgaaaagatr ctcaacatca ttagtcatca gggaaatgca aatyaaaacc 180 acaatgagat aycacttcac acccaytaga atggctaaaa ttaaaaagac agrmaatamc 240 aartgttggy raggatgtrg agaaaytgga achctcatac aytgctggtg ggaatgtaaa 300 atggtacarc yactttggaa aacagtttgg cagttcctca aaaagttaam aatagagtta 360 ccatatgacc cagcaattyc actcctaggt atwtacccaa gagaaatgaa aacatayrtc 420 cayacaaaaa cttgtacaca aatgttcata gcagcattat tcataatagc caaaaagtgg 480 aaacaaccca aatgtccatc aatratwgaa tggataaaca aaatgtggta tatccataca 540 atggaatatt attcagcmat aaaaaggaat gaagtamtga tmyatgcaac aacatggatg 600 aaccttgaaa acattatgct aagtgaaaga agccarrcac aaaagrccac atattgtatg 660 attccaacta tatgacattc tgaaaaaggn aaaactatgg agacaagtaa aaagatcagw 720 gattnctaga gkgastrgga rrgggaggga agaataggtg gagcncaggg gatttttagg 780 gcagcgaaac tattctgtat gatactataa tggtggatac atgacattat acatttgtca 840 aaayccatag aactgtacaa yacaaagagt gaaccctaat gtatggactt tagttaataa 900 taatgtatca atgttggttc atcaattgta acaaatgtac cacattaatg caagatgtta 960 ataatagggt aaactrttgt gtgggggagg gagtatatgg gaactctctg tactttctgc 1020 tcaatttttc tgtaaaccta aaactgyttc aaaaaataaa gtctattaat aa 1072 // ID RNLTR19A_LTR repbase; DNA; ROD; 496 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Long terminal repeat of ERV1 Endogenous Retrovirus from rat. XX KW ERV1; Endogenous Retrovirus; Transposable Element; KW LTR retrotransposon; RNLTR19A_LTR. XX OS Rattus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae. XX RN [1] RP 1-496 RA Smit A.F.; RT "RNLTR19A_LTR - ERV1 Endogenous Retrovirus from rat."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC <5% div. XX SQ Sequence 496 BP; 116 A; 134 C; 113 G; 133 T; 0 other; tgaagggtta aaatttctaa aagttagtca tgaacaaagt ccagacatgc ctgagagaat 60 ttgagatagg gctcactggc ccgactatca actcccaagg acactggcca tctggagcca 120 ccccctcccc ttccttcact ccctgaggat gggtcatccc cctgctgcag tgagatgagt 180 tgccctgccc acattgctga gatgctagat tacacgtatg ctttgttgat gtaactccgt 240 gccaaaagta actgtgcacc ctttgtatta gccaattatg tgtattcaca cgaaacccct 300 ggttttccct atataagctc ctgcctagag aggctcgggg cttgactcaa tctcctgtgt 360 gagatacgtg tcagcccgag atctcgtaaa taaaactgcc tcttgttgat tacatcaaga 420 ccggcttctc gtgtttcctg ggggtacctc aacccgtgac tggagcgaga gtctccccaa 480 gtctgggggt ctttca 496 // ID RLTR19-int repbase; DNA; ROD; 6137 BP. XX AC . XX DT 29-AUG-2008 (Rel. 13.08, Created) DT 19-SEP-2008 (Rel. 13.08, Last updated, Version 2) XX DE ERV2 Endogenous Retrovirus from Muridae. XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR; ERVK; KW RLTR19-int. XX OS Muridae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea. XX RN [1] RP 1-6137 RA Smit A.F.; RT "RLTR19-int - ERV2 Endogenous Retrovirus from Muridae."; RL Direct Submission to Repbase Update (05-AUG-2008). XX DR [1] (Consensus) XX CC (5 bp dups though) some copies as little as 7% div; pos 1-3624 CC similar to ERVK elements (closest to co-hybrids RMER17C-int and CC RMER16-int, and to ETnERV3 and MYSERV); 3' end has patch-like CC similarities to ERV1 group internal sequences (!) The internal CC sequence at pos 1433<-1711 contains an ancient fragment of an CC hnRNP core protein A1 pseudogene. XX SQ Sequence 6137 BP; 1826 A; 1247 C; 1231 G; 1782 T; 51 other; tttggtgcgt tggccgggaa gcaagccccc taccctcgag cacccctcag ttactgccag 60 tgaatgccac caagccggct gggccctccg tgaggaggta agtttcctgt acctgtgcag 120 ttcttttttc ttgtttttgt tgtttctggt ttggttggac tttcggagac ccaagagagg 180 agagctggac gcagaaatnt ccttgggtcn gaggaggtca ggagacgtcc tgcttcctct 240 ctggttccca ctgggagaac gcctggttct ggttctgggt ccctttggtg ggacatctgg 300 gtccctttgg tgggacatct ggntctcttc cctttcctcc ccaactctat tggttgtcct 360 tggcgcgctg gtctgtccat gtctgtcagt ttatgtctgt agtttcgttt nattgcttga 420 ttgttgctct gtgtttcata ttcagaggaa aaatggttaa aacttcatct gctggccgtt 480 taccccttga cttagtttta actcatttca aggattttaa gaaaaaaagc agcctgaatt 540 gggacaacaa aaactgataa actgcccttg gagccctgca cctgcgctag gagtctagca 600 ctgctggaga agccctgcca gggctgcttg gctgtacagc cgctcttaaa ggggncagca 660 gcttctcggc ttctctggta agaaagctga tggctagttc tacaaaatct ctgtgcccat 720 gagaattgga ttcaacaggt ggcaaggtgc tcctcccttg aaattacagc tagaaaaaaa 780 gaaaatttct gtctgttcta aatgtataaa tgtatgtggc cactacatnt ttgtttctga 840 ctggtcttaa atgtataagt ttactatgtt ctgcatgtct tggttataga ttattggctt 900 ataagttatt gggtatggtt aaaaatctgt aacatcngta acaaaaagtt agcttaaaac 960 nagtaactca gggttgaagc cattctgggc aacaacacac ggcatgagcc aacccaggga 1020 aacaggtctc taaagatact tttaaattgg gataatattt ttatgtaatt cctgtcctaa 1080 aaaccagact tataaaagat aggatttaaa aaatgtttct ttaatgaggt attaaagctg 1140 caccttcatg cattcatata caagaacttt tcttgctggc agccaaactt tgtaacatca 1200 aaataatgca cttggtattg attttagaga cactggattg tttaaactgt taaaaattaa 1260 aaattggttt tgtcttctaa aaattatggt tatgctctat gacttcactc tttaaaaaga 1320 tactttattt tgatattgca aaagcaactt taaaaattat agttaaatat ataaagctat 1380 gggacatttt aaaatttatt aaatgtttta ttggaatgtt ttattaatat atgatggtta 1440 caataaagga ggaaatttta atgaaataac tataatgatg atngaaacta taatgacttt 1500 agaaattata atggacaaca gcaatcaaat tatggaccca tgaagggggg acaattntgg 1560 tggaagaagc tcaggcaatc cctatggtgg tagctatgga tctgatgatg gaaatgatgg 1620 atatgatagc agaagtttta aaataaaaca gaaacgggta cagttcttag aggagagaga 1680 atgaggagtt gtcaggaaag ctgcaggtta ctttgagaca gtcgtcccaa atgcattaga 1740 ggaacantaa aaatctgcca cagaaggaat gatgatccat agtcagaaaa ttactgcagc 1800 ttaaacagga aaccttcttg ttcagactgt catgccacag tttacaaaaa atacagctat 1860 tgattaatgc aatatgatgt cagttagata tacattcctg aggntttttt atctgttgta 1920 gctttgtctt tttcttttca ttacgtcagg tatattgctc tgtaaattat ggtaatgata 1980 ccaggaataa aaattaagga atttgttaat ttaaaanttt ttagaggttt acaatattaa 2040 aaaggttaag aatcactggc tcttgaattt gcctgagctc tggcaaggct ccagcatgcc 2100 cgagtcagta aggctctttc agccgaggtc ttgcagtttt tcccaactgt taaccttttc 2160 tgtcctgaca ctggtttcag cttaaactga atcatatgag aaactgttat ctctctcaga 2220 aggtcagaaa agttcctagt ctctttatgg agtatgttta tgggtttttt ttaatactag 2280 aagagcttca attcaaaact gtaattttaa ggttcaagcc taacagggat tgatagtcag 2340 taaccttgaa ggtgatcaaa tcctttaata tgttcagaaa tatatttaaa gtcatgctaa 2400 gtactgatgc agttaattnc aagattaaag ctttatttag tctcctgttt tatgtttgna 2460 aggtacagct tagagcagat aactaagaac aaacaaagnt tgtttaactc agatatgcta 2520 ggtaggtact agccctcaaa ccagtcagag atctgctgaa tatggcattt aatatgttta 2580 aacttaccat aacagacaga gactcccaaa tcctaacagt gacccccaag gtctccaaga 2640 agatatgggc acaacgacaa aggacaccac cnggattgtg gtatgataac cactgggcat 2700 aactgcccca atgccttgcc tgctgccagg gcccagcctg aactgtggac aaacagagga 2760 caactggaga attgattgcc acacgttgcc taagacaagg tgaggtcagt ctctcccatg 2820 ttcctcctcc acaggaaaaa gcctcttcat cttctgggcc tgatggccaa agactgcctc 2880 tgcccttggt gccatagaga cacaggagcc tgggataact gtctaggtaa taaaatatgg 2940 actagtctgt cattttaatt gatacatagg ccatttagat tacaatttat ccttctcaga 3000 tctctgatca cattgatggc taagctaatt gtagcttgac agctagaaaa caataggcaa 3060 ctagctcctt acctcaaggt aatccaccac tgtgntagtt cattagtcaa ttcattagtt 3120 agttaaaact actaggtctc ttattaaaaa gggcaaacag gtttgccttg ctcacaggct 3180 tcatattaaa tgactgctct caataatcaa ctgcctatgt ctcatagtaa ataattggat 3240 taaaaatata aaatttaann ttatttaagg tctagaaaaa tgtttatggg tctagaaaaa 3300 tgtttgagat tgaaaatgca gtgataaagg ttagaggata aaaaacttat gatggctaga 3360 aaatgnttta aataagaatc ttcaataaaa atgttaaggt tggtaaatga actaagattt 3420 aagggtctaa gaagatgttt taggtatata aatagcaagt tatagaggta taaaaagtaa 3480 tttaagaaat ggaaaatgtt tcatattccc ccatgctatt gttatttcaa agttcagaat 3540 tttaacattg atcaatggag ttctgataag ctaatggagc actggcagct tacaattcag 3600 tcacaagctc aagattttaa attccctttg gttgtcttct aaatatagac ttaaaggtgc 3660 ttctcatcat gctaaaaaca tctctgttca atctgtatat ctagccctct ggactgagaa 3720 aagaatgttc tgtgcctttt gacccaaacc tatagctttt actaggtctc aaaggcatga 3780 gtctactcct gttgtcttac agacacctgt taactacttt ttctaaaata tggatttcaa 3840 tacaatggta aatactaatg tatatccttt tgtaaactaa agttcacata agcttcaggg 3900 aatcaaaaag tcataagatt tggatccacc tgccacagat caaatggact ccagataagt 3960 acacctgccc tgttcttgaa aatgggattt ttcccttggc cttgggattc cttaccttcc 4020 ccaattacta gacagtttta acttctgtcc ctagtctata tttgtctcag cagattttca 4080 cctggctgac agactccatc cagggatcac ccaatgnanc tgctgaactc tggacttgct 4140 gaaagctgat gttaaccagt ccagctgatg tgattgctcc ctgtctttca tctggatcag 4200 ctaatcagat cagatgcttc tgataaatgc cccattgccc agcctttgac tagcatttca 4260 gccttcctgg gcccctntga caacgtccta atgtcagcng gaagcagtta cagaagagaa 4320 atacgncgtc cattgtccca ccttacaggc tgaaatgcta agtcaaaagg aanccccctg 4380 ggnccacgct gaaaaggacc ccacttcntg atcctgacca ccccaatggc catgaaaata 4440 gatgggatnc aatcctggat tcatcactct catctgaagt tcaccccctg gaaatatcag 4500 gaccgggacc agcgatgggt catcagacaa cacccacagg acccattaaa aatcagactg 4560 gtaaaagact ctggagacta gttttctntt ttgtgttgcc ttgggctgct gaggcacatg 4620 ctccagtaca acacatatga actttgatta gaactacaga caggagccta attactaata 4680 tcactgtcta tggttccccc accctcacct ttganttatt tgggccaaaa tggaatangt 4740 gcccaaagac tgcaaacata tatgttccct cattctctag gtcttttaga tataggatcc 4800 agaatatgga aaaagtaatt tggccaccct ttagatatat gcctgtccct caaatggaga 4860 ccctgattgt aggggacaag atatgtactt ttntgctata tggggctgtg aaacattagc 4920 tccctgggta actgataagg ataattatat tcagttacag agggttgagg gccactctga 4980 aaaatctggg aaaagaaatc aaccccattc aaattaagat taaaacgata taaataatgt 5040 agganatagt acagggaaac agaggatgtg ttttngataa ggccacctgt tgagaacaat 5100 tatagcccct atgggaatgg tcattacaac cacctggaag aggaaaaaga aatgtcccgt 5160 ccccacctag acaaaggaca aatgacncaa taaacacctt tggtctgaga gtgccagaaa 5220 ggggccctcn ctggcacaga aaaggacagc cagccctcat ctgcccagac atggacttgg 5280 ttacctccac tggccaccat agcccgctct gggtaccgtc acctctgcct ctgacattcc 5340 tgccccggta caagctttgc tngagaccaa cactgccagt cagacnccag ttcaggcctg 5400 ctgcaggtcg ctcctgacgg gcntatcaat gctacactca gcttgtttcc aacgtacggg 5460 atatgttgaa tcaggaagga cttagggact ctgcctttat gcatctggta angaccctgg 5520 aatgatgttc accatccaga gattccaggt caccaatann aggatcttag acatcgttag 5580 tcccaacaaa gtattaaacc ctctccctga gccacctgan tctgggaggg attaaaggac 5640 aaggcctctg cttatacaca ggttnatata agatatcaga gtctcggtac cttcaacact 5700 gtaatgtaac aatccaccta cagatgttaa ctgcaggagt ttccccttac natttagtac 5760 ctccaaaggg cacttggttt gcttgtgcct cgggacaact ccctgtntca gtccctttat 5820 cctcactaaa acttctgact cttgcttact tgtacatcta ttgcctcaga tatactatta 5880 ttctagagaa gangattgga acatctggga cttcacacta atcccagatg gccagggcag 5940 ctccaatact cgtgcccctc ctantaggca tggacatagc agggtctgcc agcatgggag 6000 cagcaacact tattaaggga gatcaaattg tcaaaaaatt ttaagccaac aaaggttgat 6060 ctaagtttgt ccccctggcc tcagaaaaan taattgagtc aatgatttga ctcactggaa 6120 caactcaaag gggggaa 6137 // ID LX9 repbase; DNA; ROD; 1202 BP. XX AC . XX DT 26-SEP-1997 (Rel. 2.08, Created) DT 26-SEP-1997 (Rel. 2.08, Last updated, Version 1) XX DE 3'-end of L1 repeat (subfamily LX9) - a consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; KW Repetitive sequence; L1 (LINE) family; LX9 subfamily; LX9. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-1202 RA Smit A.F.; RT "LX9."; RL Direct Submission to Repbase Update (30-NOV-1996). XX DR [1] (Consensus) XX CC Partial 3' untranslated region of rodent LINE1 subfamily LX9. CC The nomenclature for the rodent LINE1 subfamilies (as based on CC the CC 3' ends) is temporary, awaiting analysis of their relationship. XX SQ Sequence 1202 BP; 314 A; 247 C; 326 G; 290 T; 25 other; gntggacaat gctcctccca gaagacatag gtcaccaaat gaaaatctca gcgccaggta 60 cgggmtacct ccttatgagc gattggtcag ggaggccccg taggcgtcca aaacaataca 120 ggctgttgcc attgctcttg gttgcccacc agaacttgat rgtaagaccc tattgctgaa 180 gacaccacac actttggtcg caggacatag agaaatcaag ttggarctga gctggaagcc 240 tcctccctgc tggctagctt tcacagtgcc agaaggtgct atgcaggctg ccgggaggga 300 aaagtyatca atggtcttac ccagtgctgg gccttgaatg ctacaatacc aatctaccgg 360 tgcaatagtg gcatractgc tatgggggta accaaccgct ttctgattgt atttgaggcc 420 tgctccacag gagggaattc attcttggta ctgtaaacct ggtcaaaagc ctatggctgk 480 ggaacgtcat aggccctagg ggagagccta ctactgttat tttgctaaat ggtcatgttg 540 tcaaactgcc ttctaaatat ttatgtttat acccatagat tagtgctgct ctcaactttg 600 gtcagagaag ttcccttttg cagtgggcag cggttagcgc agagactcat aactggtcaa 660 agtgctgaga ataartgact gttgagtgct cagccctaaa tgggacatct atattacnna 720 cattccgagc tctcagggaa cattgnggaa gaagaggcag aaagantgta agagccggag 780 gatggggggg agtgctgtga aatgctgtct tctgracatg acatggccat tacactcacg 840 aactcatagc agctgtggtt atctgcacaa gacctgcata agatcgagcc agtcaacatt 900 ycatcataga tggaggagag gctcatgagg ccccacccca ctgargagct actggcagct 960 gatrgywgct gggggaggag tcattttttt ttgnnggtgt ggtcactggt aggttgccca 1020 tgttccagtg gatggcccca cacccacata catgtaggwa agacttattg gaytcagtgg 1080 gttatcaaaa aaaggggaca aggaacnaaa gaggatatga agktggcgga gggacatgkt 1140 gggggaattt gggtggagtt gaaggggaga aatggaatgt agttatgatc atatttcatt 1200 gt 1202 // ID RICKSHA repbase; DNA; ROD; 2030 BP. XX AC . XX DT 29-JUN-1998 (Rel. 6.5, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 2) XX DE RICKSHA repetitive element - a consensus. XX KW Non-autonomous DNA transposon fossil; composite mobile element; KW RICKSHA. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-2030 RA Kapitonov V.V. and Jurka J.; RT "RICKSHA."; RL Direct Submission to Repbase Update (JUN-1998). XX DR [1] (Consensus) XX CC Putative non-autonomous DNA transposon fossil. Perfect 79 bp CC terminal CC inverted repeats. CC Target site is unclear. There is a tendency to have 8-10 flanking CC direct repeats. These repeats do not belong either to the target CC sequence or RICKSHA itself. CC Average identity of individual copies to the consensus sequence CC is 81%. CC RICKSHA is a composite element since it carries (positions CC 184-904 of CC the RICKSHA consensus sequence) a 3'-portion of HERVL endogenous CC retrovirus including its LTR (MLT2B). RICKSHA has been mobile CC also before CC the retroviral insertion; we found several copies of RICKSHA CC without HERVL-related portion. XX SQ Sequence 2030 BP; 577 A; 379 C; 414 G; 651 T; 9 other; gggtttggat cataatcccg aaagacacaa tcccaaacgc cataatcccg aatgttgaaa 60 tcccgaaaga tcaaaatcct aaagtctaaa tccctaaagt ctaaaatccc taatgtctaa 120 aatcccgaaa atcacaatca cgaaagatta aaatctcaaa tattgaaatc ctgaaagccg 180 aattctgggg aagggattag tgcgttttcg gttgtacgca ggatagttgc atcatgttag 240 ttgcatcatg ttaggtggca gaactattac cttgttattg tctttatttg gaaattaagt 300 atggtttaag gagacacgta tgggtgccaa gttgacaagg agtggacttg tggacttaat 360 tttaggtgtc aacttgactg gattaaggaa tacctagaaa cctggtaaag cattattttg 420 ggtgtgtctg tgagggtgtt tccagaggag attagtgtgt gagtctgagc ggactaggcg 480 gggaagatct gccctcaatg ttggcgagca ccatccaatc ggccgggggc ccggagagaa 540 caaatacaga aggcgaactg gtctctctct gagagctggg acagattttt cttctgctgc 600 cttggacatc agaactctgg gcttgctggc tttggactcc aggacttaca ccagtcctnn 660 naaccgggtc ctgaggcttt cggacctcag actgagagtt acaccattgg cttccctggt 720 tctgaggctt ttggacttgg actgagccat actgccggca tcccagggtc tccagcttgc 780 agacggcctg tcgtgggact tctcagccac cataatcgcg ttagccaatt cttctaataa 840 attccctctc atrtatatat atatcatatt ggttctgtct ctctggagaa ccctgattaa 900 tacagatttg gtattgggga agccgaatat cattccttct tactgtattc cttacaacat 960 aatagaagag atctgtgaaa ttgttccctc acaaaaaggc tgtgataaaa taagtggacg 1020 aggctactca attgtgaaag ataaaattta aaagctaatt attattggtg ctgcaaaagc 1080 agaaaatcac ttaattacaa tggccgagca ataaccagct tttaaatgga cagcatatac 1140 ttacaaaatt tgtagaccac aaccactctg caaatacaca tgcagcaagt gtcttgaaga 1200 tggcaamart gaaaattcag tttaaaaata cawsaattsc ctgccaaatt attcaatctg 1260 tatgacttct acttctttac acaaaattta tgctatgtat ttcatcttcg catcatttcc 1320 aatactggag gtataaattg tgtagagact tttagagagt tctaatttgt tttatgcatt 1380 ttttgcaaat ttgactccac gaaagtgcat tatcacaatg ttgactttgt gtgtaagcat 1440 tgtgcatgta tgtaaaaacg ttgaaacttc ctcaataaat gaagagatgt cctttttgta 1500 catctgcatt tgtgaaagat aaaatttctc aagatcttgg ctctttgggc gactgcatat 1560 gcggtggtga cccatcgcgg tttttgatcg atctcgtcaa aagacttagg ttgttcgtca 1620 cggtatttca gatgaccgca gttataaagc tgggtgcaca caattaccaa ccatagtgat 1680 atgcgtttat acatttccct ttttgaccta tttctttatg aatacggttc gtctgctcat 1740 aactgttata cccgtgcgac tgtcattagt atacctgagt gtttatgctt gcaaaaatat 1800 gtatgttatt attgcctatt ttattgtgta aagtggccta tgaagtgttc tgtcatgttt 1860 ttatatgttt ctcaaataaa tcccctttta aaaatgtaaa taaattatct tttaaagaat 1920 ttttaaattt tttttcagaa ttatattttc gggattttga tctttcggga tttcaacatt 1980 cgggattatg gcgtttggga ttgtgtcttt cgggattatg atccaaaccc 2030 // ID ORSL repbase; DNA; ROD; 275 BP. XX AC . XX DT 30-APR-1998 (Rel. 3.03, Created) DT 28-JUN-2000 (Rel. 5.05, Last updated, Version 3) XX DE Putative non-autonomous, hAT-like DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; KW Origin of replication-like (ORS8) region; ORSL. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 66-275 RA Rao S.B., Zannis-Hadjopoulos M., Price B.G., Reitman M. RA and Martin G.R.; RT "Direct submission."; RL Unpublished (1989). XX RN [2] RA Rao S.B., Zannis-Hadjopoulos M., Price B.G., Reitman M. RA and Martin G.R.; RT "Sequence similarities among monkey DNA-replication ori-enriched RT (ors) fragments."; RL Gene 87, 233-242 (1990). XX RN [3] RP 1-275 RA Jurka J.; RT "ORSL."; RL Direct Submission to Repbase Update (13-APR-1998). XX RN [4] RP 1-275 RA Smit A.F.; RT "ORSL."; RL Direct Submission to Repbase Update (31-JAN-2000). XX DR [4] (Consensus) XX CC ORSL shares ~200bp stretch of similarity with African CC Green Monkey origin of replication region (Acc. No. M26221). CC About 1000 copies in our genome. The first version of the CC consensus CC sequence [3] has been significantly shortened [4]. CC This repeat has been classified as a putative hAT transposon [4] CC of identification 14-bp terminal inverted repeats and 8-bp CC targets CC site duplications similar to other hAT transposons (esp. MER45, CC MER69). CC On average 21% divergence level. Orientation not yet clear. XX SQ Sequence 275 BP; 81 A; 51 C; 48 G; 95 T; 0 other; cagggctgca ttatgacttt cgtgggccct aggcactttt gccttcgtgg gccccttcct 60 ccataaaaaa atattaaaaa ttatatttta tgactgcatt ggtataaaga tgaatataga 120 atatattaat attatatatt aaaacatttt ctttgaccta aaagttcatt ttttcttctg 180 attttaaaag aaattaaaac attttcgtgg gcccctaaaa gtatcgtggg ccctaggcac 240 tgtgcctact gtgcctaatg gataagtcgg ccctg 275 // ID MER128 repbase; DNA; ROD; 329 BP. XX AC . XX DT 21-JUL-2006 (Rel. 11.07, Created) DT 16-AUG-2007 (Rel. 11.07, Last updated, Version 2) XX DE Unclassified mammalian repetitive element - consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW conserved; Tigger14a; MER128. XX OS Mammalia OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi. XX RN [1] RP 1-329 RA Jurka J.; RT "MER128: Unclassified, moderately repetitive element from RT mammals."; RL Repbase Reports 6(7), 381-381 (2006). XX RN [2] RP 1-329 RA Gentles AJ., Wakefield MJ., Kohany O., Gu W., Batzer MA., RA Pollock DD. and Jurka J.; RT "Evolutionary dynamics of transposable elements in the RT short-tailed opossum Monodelphis domestica."; RL Genome Res 17(7), 992-1004 (2007). XX RN [3] RP 1-329 RA Jurka J., Kapitonov V.V., Kohany O. and Jurka MV.; RT "Repetitive sequences in complex genomes: structure and RT evolution."; RL Annu Rev Genomics Hum Genet 8, 241-259 (2007). XX RN [4] RP 1-329 RA Smit A.; RT "Classified as Mariner and renamed as Tigger14a."; RL Direct Submission to Repbase Update (17-AUG-2007). XX DR [1] (Consensus) XX CC This sequence is present in >1000 copies phg. XX SQ Sequence 329 BP; 109 A; 54 C; 49 G; 115 T; 2 other; cagtaaaagc tcgtttatcc ggcattctat caaccagaac tctctattaa ctagcacttc 60 tgtacatcta tagtayaatg ataattgatg ttcataatga tgaccctgag gcactacaag 120 atcctgaagt gccttctgaa tcatcaaaga aagattaaat tatgttcagt atagttttag 180 tgttaagtgt attttattgt attctaattc tttgaaaatt ggtaatctat gtatggtata 240 tatgataact ctctattaac cagartattt gattaaccag aatacattat tcctgaccat 300 acccaatatg gataacagag agtttactg 329 // ID L1MD2 repbase; DNA; ROD; 1088 BP. XX AC . XX DT 01-OCT-1995 (Rel. 3.6, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 3) XX DE 3'-end of L1 repeat (subfamily L1MD2) - a consensus. XX KW Repetitive sequence; L1 (LINE) family; L1MD2 subfamily; L1MD2. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-1088 RA Smit F.A., Toth G., Riggs D.A. and Jurka J.; RT "Ancestral, mammalian-wide subfamilies of LINE-1 repetitive RT sequences."; RL J. Mol. Biol 246, 401-417 (1995). XX DR [1] (Consensus) XX CC ORF2 ends at bp 675; average divergence of copies from consensus: CC 18%. XX SQ Sequence 1088 BP; 411 A; 158 C; 196 G; 283 T; 40 other; cttgtatcca gaatatataa agaactctca aaactcaaca ataaaaaaac aaacaatcta 60 attaaaaaat gggcaaaaga catgaagaga catttcacca aagaagatat acaaatggca 120 aataagcaca tgaaaagatg ttcaacatca ttagctatta gggaaatgca aattaaaacc 180 acaatgagat atcactacac acctattaga atggctaaaa taaaaaataa tgacaayacc 240 aaatgctggc gaggatgtgg agaaactgga tcactcatac attgctggtg ggaatgtaaa 300 atggtacagc cactctggaa aatagtttgg cagtttctta taaagttaaa catacamtta 360 ccatatgacc cagcaattay actcctaggt atttatccca gagaaatgaa aacttatgtt 420 cacacaaaaa cttgtacacg aatgttyata gcagctttat tcataatagc cmaaaactgg 480 aaacaaccca gatgtccttc aacgggtgaa tggttaaaca aactgtggta tatccatacm 540 atggaatact attcagccat aaaaaggaat gaactattga tacatgcaac aacctggatg 600 aatctcaaga acattatgct gagtgaaaaa agccagtctc aaaaggttac atactgtatg 660 attccattta tatarcattc ttgaaacgac aaaactatag agatggagaa cagattagtg 720 rttgccaggk gttagggatr caggraggag gtggatgtgr ytataaarrg gtagcatgag 780 rgaatyttta tggtgatgga acwgttctgt atcttgaytg tggtgrtggt tacacgaatn 840 tatacatgtg ataaaattgc atagaactaa atayryryry rataagtaca agtaaaactg 900 gcgaaatctg aataagatwn atggrttgta ycaatgtcaa twtcctggtt ktgatattgt 960 aytatagttt tccaagatgt taccattggg ggaagctagg tgaagaacac acgggatccc 1020 yctgtattat ttcttacaat tgtntgtgag tktayaatta tttcaaaata aaaagtttaa 1080 tttaaaaa 1088 // ID BGLII_A_LTR repbase; DNA; ROD; 445 BP. XX AC . XX DT 14-NOV-2005 (Rel. 10.11, Created) DT 09-DEC-2005 (Rel. 10.11, Last updated, Version 1) XX DE Long terminal repeat of ERV2 Endogenous Retrovirus from Muridae. XX KW ERV2; Endogenous Retrovirus; Transposable Element; KW LTR retrotransposon; RLTR16; BGLII_A_LTR. XX OS Muridae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea. XX RN [1] RP 1-445 RA Smit A.F.; RT "BGLII_A_LTR - ERV2 Endogenous Retrovirus from Muridae."; RL Direct Submission to Repbase Update (11-NOV-2005). XX DR [1] (Consensus) XX CC 12% subst,. XX SQ Sequence 445 BP; 126 A; 65 C; 146 G; 108 T; 0 other; tgttgtggat tgccctggtg ctgtttgtat tttgatgcta attctgcttc cccaagaggg 60 gctgcctggg acgaggagtg aatcacgtac tcaggtgact tcatgtgaac cttctcccca 120 ttttaacttg taaaataaag gctagagccg gtgattgggc agtggaagag aaggtggagc 180 taaaaagttt tggggagagg aggagagaaa gagaagggga gaaaaagagg gaaagagaga 240 ggaagactga agaggaggag gtggaaggaa aatggagcag aagcacgtgg cctggagaaa 300 ccgcaagtta taagggatct catagctggg gaatagagta gtgtagtggt agatctgccc 360 aatctaggcg cgcagcttgt attcatatta attgagttgt gttttccttg cacgggctta 420 tttgggttgg agatttaccg caaca 445 // ID MER2 repbase; DNA; ROD; 345 BP. XX AC . XX DT 19-FEB-1997 (Rel. 5, Created) DT 19-FEB-1997 (Rel. 5, Last updated, Version 1) XX DE Nonautonomous DNA transposon. XX KW Interspersed repetitive sequence; MER2. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-345 RA Sheflin L., Celeste A. and Woodworth-Gutai M.; RT "Recombination in Simian Virus 40-infected cells: Structure of RT naturally arising variants ev-2114, ev-2102, and ev-1110."; RL J. Biol. Chem 258, 14315-14321 (1983). XX RN [2] RP 1-345 RA Jurka J.; RT "Novel families of interspersed repetitive elements from the RT human genome."; RL Nucleic Acids Res 18(1), 137-141 (1990). XX RN [3] RP 1-345 RA Smit F.A. and Riggs D.A.; RT "Tiggers and other DNA transposon fossils in the human genome."; RL Proc. Natl. Acad. Sci. USA 93, 1443-1448 (1996). XX DR [3] (Consensus) XX CC 24 bp terminal inverted repeats, TA target site. XX SQ Sequence 345 BP; 97 A; 69 C; 73 G; 100 T; 6 other; cagtcgyccc tccgtatccg tgygttccac atccgtggat tcaaccaacc gcggatcgaa 60 aatattcagg yaaaaaattg yatggttgcg tctgtactga acatgtacag acttttttnc 120 ttgtcattat tccctaaaca atacagtata acaactattt acatagcatt tacattgtat 180 taggtattat aagtaatcta gagatgattt aaagtatacg ggaggatgtg cgtaggttat 240 atgcaaatac tacgccattt tatatcaggg acttgagcat ccgcggattt tggtatctgc 300 ggggggtcct ggaaccaatc ccccacggat accgagggat yactg 345 // ID RSINE1 repbase; DNA; ROD; 177 BP. XX AC . XX DT 22-APR-1997 (Rel. 2.03, Created) DT 26-SEP-1997 (Rel. 2.08, Last updated, Version 2) XX DE SINE element; RSINE1 family - a consensus. XX KW SINE1/7SL; SINE; Non-LTR Retrotransposon; Transposable Element; KW retroposon; RSINE1 family; ID; RSINE2; RSINE1. XX OS Rodentia OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires. XX RN [1] RP 1-177 RA Kapitonov V.V., Chopra V. and Jurka J.; RT "RSINE1."; RL Direct Submission to Repbase Update (30-NOV-1996). XX RN [2] RP 1-177 RA Smit A.F.; RT "RSINE1."; RL Direct Submission to Repbase Update (30-NOV-1996). XX CC ID-like SINE element. XX SQ Sequence 177 BP; 42 A; 59 C; 50 G; 26 T; 0 other; cggggccggc gagatggctc agtgggtaaa ggcgcttgcc gccaagcctg atgacctgag 60 ttcgatcccc gggacccaca tggtggaagg agagaactga ctctgaaatt cccgcaagtt 120 gtcctctgac ctccacacac gcgccgtggc acgcgcgcac acacacacac acacaca 177 // ID RNSAT1b repbase; DNA; ROD; 168 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Satellite from rat. XX KW Satellite; Simple Repeat; RNSAT1b. XX OS Rattus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae. XX RN [1] RP 1-168 RA Smit A.F.; RT "RNSAT1b - Satellite from rat."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX SQ Sequence 168 BP; 44 A; 30 C; 28 G; 62 T; 4 other; ccacattgat tacatttgta aggtttcact ccagtatgta ttgctttatg tkttcggaga 60 ctacmattac gtacaaaggc tttaccacat tgattacatt tgtaaggttt cactccagta 120 tgtattgctt tatgtkttcg gagactacma ttacgtacaa aggcttta 168 // ID MLT1F repbase; DNA; ROD; 541 BP. XX AC . XX DT 01-OCT-1995 (Rel. 3.6, Created) DT 18-APR-1997 (Rel. 6, Last updated, Version 2) XX DE Mammalian transposon-like element long terminal repeat (MLT1f DE subfamily) - a consensus. XX KW Repetitive sequence; MaLR family; MLT1f subfamily; MLT1F. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-541 RA Smit F.A.; RT "Identification of a new, abundant superfamily of mammalian RT LTR-transposons."; RL Nucleic Acids Res 21(8), 1863-1872 (1993). XX DR [1] (Consensus) XX SQ Sequence 541 BP; 135 A; 137 C; 123 G; 126 T; 20 other; tgtggtagcc agcctccaag atggccccca atgatccctg ctcctggtat tcacaycctt 60 gtatggtcyc ytcctacatt garyyagggc trgtctgtgt gaccaataga atagggcaga 120 agtgatggcg tgtsacttcc aagaytargt cayaaawaac actgtggytt ctgcyttgnt 180 ctcttcgggc tactcactct gggggaagcc agctgccatg ctatgaagac actcaagcag 240 cctatggaga agtccacgtg gsaaggaact gaggtctcct gccaacagcc agcttcgacy 300 tgccagccat gtgagtgagc catcttggaa gcggatcctc cagccccagt yaagccttca 360 gatgactgca gccccggctg acatcttgac tgcaacctca tgagagaccc tgagccagaa 420 ctacccagct aagctgctcc tarattcctg acccacagaa actgtgagat aataaatgtt 480 trttgtttta agccactaag ttttggggta atttgttacg cagcaataga taactaatac 540 a 541 // ID L1ME4 repbase; DNA; ROD; 596 BP. XX AC . XX DT 06-MAY-1999 (Rel. 6.9, Created) DT 06-MAY-1999 (Rel. 6.9, Last updated, Version 1) XX DE 3'-end of L1 repeat (subfamily L1ME4) - a consensus. XX KW Repetitive sequence; L1 (LINE) family; L1ME4 subfamily; L1ME4. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-596 RA Jurka J.; RT "L1ME4."; RL Direct Submission to Repbase Update (MAR-1999). XX DR [1] (Consensus) XX CC An ancient L1 subfamily. XX SQ Sequence 596 BP; 216 A; 80 C; 113 G; 179 T; 8 other; tttggcaata tctatcaaaa tcacacatac atttaccctt tgacycagca atcccacttc 60 taggaattta tcctacagac atacttgcaa catgtacaaa atgacatata tacaaagtta 120 ntttattgca gcattatttg taatagcaaa aaaytggaaa caacctaaat gtccatcaat 180 aggaaaatgg ttaaataaat tatggtatat ccatacaatg gaatactatg cagctataaa 240 aaagaatgaa gaagatctct atgtactgat atggaatgat ctccaggata tattgtttaa 300 gtgaaaaaag caaggtgcaa gaatagtgta tatagtatgc taccttttgt gtaaaaaaga 360 aggaaaaata aaaatatata tatatatttg cttatatatg tataaaataa ctctggaaaa 420 ataaacaaga aactnataac agtggttgcc tcttntgggg ggaggggaac tgggnngntg 480 ggtggctggg ggacatgtgg gatggacagg gatgggaggg actgactttt cactgtatac 540 ctttttgtac tttgtacttt ttgaaatttt gaaccatgtg aatgtattac ctattc 596 // ID L1_RN repbase; DNA; ROD; 6914 BP. XX AC . XX DT 02-MAY-2002 (Rel. 7.04, Created) DT 02-MAY-2002 (Rel. 7.04, Last updated, Version 1) XX DE L1 element from Rattus norvegicus - a consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; LINE1; L1_RN; KW LINE3_RN; L1 element. XX OS Rattus norvegicus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Rattus. XX RN [1] RA Shore K.S., Bacheler T.L., de Riel K.J., Barrows R.L. RA and Lynch J.M.; RT "Cloning and characterization of a rat-specific repetitive DNA RT sequence."; RL Gene 45(1), 87-93 (1986). XX RN [2] RA Cabot L.E., Angeletti B., Usdin K. and Furano V.A.; RT "Rapid evolution of a young L1 (LINE-1) clade in recently RT speciated Rattus taxa."; RL J. Mol. Evol 45(4), 412-423 (1997). XX RN [3] RP 1-6914 RA Jurka J.; RT "L1_RN: a consensus approximating active L1 subfamilies in RT Rats."; RL Direct Submission to Repbase Update (25-APR-2002). XX DR [3] (Consensus) XX CC This consensus is over 97% identical with most active CC L1 elements in rats. XX SQ Sequence 6914 BP; 2695 A; 1549 C; 1404 G; 1259 T; 7 other; cgccggaacc tgaagaaaca gaccggataa acagttctct gcacccaaat cccgtgggag 60 ggagagctaa ccttcagaga ggcgacagcc tgggaaacca gaagagactg ccctgcacat 120 ccagcgccag aggaaaacca aaccatctgg aaccctggtg cacgagctcc cggaaggcgg 180 cagtcttccg gttgctgccg ctgcagagag cccgtgggca gcaccccacg agcgaacytg 240 agcctcggga ccacaggtaa gaccaamttt tctgctgcaa gaaagctgcc tggtgagcty 300 gggacacacg gargcagaat tyctctagga ccgggcacgt yctgtgttta ccggaagtcc 360 cacacccgcg gatcccggcc cgcagcagct ctctgctccc agaccccgtg agagagagac 420 ccaaccgcct ggtcaggtgg gcactcctga ggctgcagag cggaagagac caccaacact 480 gctcacccct gcccacatcc ctggcccaag aggaaactgt ataaggcctc tgggctcccg 540 tgggggaggg cccaggagcg gcaggacccc tgcctgagac accgccggaa cctgaaggaa 600 acagaccgga taaacagttc tctgcaccca aatcccgtgg gagggagagc taaaccttca 660 gagaggcaga caagcctggg aaaccagaag agactgctct ctgcacacac atctcggacg 720 ccagaggaaa aagccaaaga ccatctggaa ccctggtgca ctgaagctcc cggaaggggc 780 ggcacaggtc ttcctggttg ctgccgctgc agagagcccg tgggcagcac cccacgagcg 840 aacttgagcc tcgggaccac aggtaagacc aacttttctg ctgcaagaaa gctgcctggt 900 gaactcaaga cacaggccca caggaacagc tgaagacctg tagagaggaa aaactacacg 960 cccgaaagca gaacactctg tccccataac tgactgaaag agaggaaaac aggtctacag 1020 cactcctgac acacaggctt ataggacagt ctagccactg tcagaaatag cagaacaaag 1080 taacactaga gataatctga tggcgagagg caagcgcagg aacccaagca acagaaacca 1140 agactacatg gcatcatcgg agcccaattc tcccaccaaa acaaacatgg aatatccaaa 1200 cacaccagaa aagcaagatc tagtttcaaa atcatatttg atcatgatgc tggaggactt 1260 caagaaagac gtgaagaact cccttagaga acaagtagaa gcctacagag aggaatcgca 1320 aaaatgcctg aaagaatcgc aaaaatccct gaaagaattc caggaaaaca taaataaaca 1380 agtagaagcc catagagagg agacacaaaa atccctgaaa gaattccagg aaaacataaa 1440 taaacaagta gaagcccata gagaggagtc acaaaaatcc ctgaaagaat tccaggaaaa 1500 cacaatcaaa cagttgaagg aattaaaaat ggaaatagaa gcaatcaaga aagaacacat 1560 ggaaacaacc ctggatatag aaaaccaaaa gaagagacaa ggagctgtag atacaagctt 1620 caccaacaga atacaagaga tggaagagag aatctcagga gcagaagatt ccatagaaat 1680 cattgactca actgtcaaag ataatgtaaa gcggaaaaag ctactggtcc aaaacataca 1740 ggaaatccag gactcaatga gaagatcaaa cctaaggata ataggtatag aagagagtga 1800 agactcccag ctcaaaggac cagtaaatat cttcaacaaa atcatagaag aaaacttccc 1860 taacctaaaa aaagagatac ccatagacat acaagaagcc tacagaactc caaatagatt 1920 ggaccagaaa agaaacacct cccgtcacat aattgtcaaa acaccaaacg cacaaaataa 1980 agaaagaata ttaaaagcag taagggaaaa aggtcaagta acatataaag gcagacctat 2040 cagaatcaca ccagacttct cgccagaaac tatgaaggcc agaagatcct ggactgatgt 2100 catacagacc ctaagagaac acaaatgcca gcccaggtta ctgtatccag caaaactctc 2160 aattaacatt gatggagaaa ccaagatatt ccatgacaaa accaaattta cacaatatct 2220 ttctacaaat ccagcactac aaaggataat aaatggtaaa gcccaacata aggaggcaag 2280 ctatacccta gaagaagcaa gaaactaatc gtcttggcaa caaaacaaag agaatgaaag 2340 cacacaaaca taacctcaca tccaaatatg aatataacgg gaagcaataa tcactattcc 2400 ttaatatctc tcaatatcaa tggcctcaac tccccaataa aaagacatag attaacaaac 2460 tggatacgca acgaggaccc tgcattctgc tgcctacagg aaacacacct cagagacaaa 2520 gacagacact acctcagagt gaaaggctgg aaaacaactt tccaagcaaa tggtcagaag 2580 aagcaagctg gagtagccat tctaatatca aataaaatca atttccaact aaaagtcatc 2640 aaaaaagata aggaaggaca cttcatattc atcaaaggaa aaatccacca agatgaactc 2700 tcaatcctaa atatctatgc cccaaataca agggcaccta catacgtaaa agaaacctta 2760 ctaaagctca aaacacacat tgcacctcac acaataatag tgggagattt caacacccca 2820 ctctcatcaa tggacagatc atggaaacag aaattaaaca gtgatgtcga cagactaaga 2880 gaagtcatga gccaaatgga cttaacggat atttatagaa cattctatcc taaagcaaaa 2940 ggatatacct tcttctcagc tcctcatggt actttctcca aaattgacca tataattggt 3000 caaaaaacgg gcctcaacag gtacagaaag atagaaataa tcccatgcgt gctatcggac 3060 caccacggcc taaaactggt cttcaataac aataagggaa gaatgcccac atatacgtgg 3120 aaattgaaca atgctctact caatgataac ctggtcaagg aagaaataaa gaaagaaatt 3180 aaaaactttt tagaatttaa tgaaaatgaa gatacaacat acccaaactt atgggacaca 3240 atgaaagctg tgctaagagg aaaactcata gcgctgagtg cctgcagaaa gaaacaggaa 3300 agagcatatg tcagcagctt gacagcacac ctaaaagctc tagaacaaaa agaagcaaat 3360 acacccagga ggagtagaag gcaggaaata atcaaactca gagctgaaat caaccaagta 3420 gaaacaaaaa ggaccataga aagaatcaac agaaccaaaa gttggttctt tgagaaaatc 3480 aacaagatag ataaaccctt agccagacta acgagaggac acagagagtg cgtccaaatt 3540 aacaaaatca gaaatgaaaa gggagacata actacagatt cagaggaaat tcaaaaaatc 3600 atcagatctt actataaaaa cctatattca acaaaacttg aaaatcttca ggaaatggac 3660 aatttcctag acagatacca ggtatcgaag ttaaatcagg aacagataaa ccagttaaac 3720 aaccccataa ctcctaagga aatagaagca gtcattaaag gtctcccaac caaaaagagc 3780 ccaggtccag acgggtttag tgcagaattc tatcaaacct tcatagaaga cctcatacca 3840 atattatcca aactattcca caaaattgaa acagatggag cactaccgaa ttccttctac 3900 gaagccacaa ttactcttat acctaaacca cacaaagaca caacaaagaa agagaacttc 3960 agaccaattt cccttatgaa tatcgacgca aaaatactca ataaaattct ggcaaaccga 4020 attcaagagc acatcaaaac aatcatccac catgatcaag taggcttcat cccaggcatg 4080 cagggatggt ttaatatacg gaaaaccatc aacgtgatcc attatataaa caaactgaaa 4140 gaacagaacc acatgatcat ttcattagat gctgagaaag catttgacaa aattcaacac 4200 cccttcatga taaaagtcct ggaaagaata ggaattcaag gcccatacct aaacatagta 4260 aaagccatat acagcaaacc agttgctaac attaaactaa atggagagaa acttgaagca 4320 atcccactaa aatcagggac tagacaaggc tgcccactct ctccctactt attcaatata 4380 gttcttgaag ttctagccag agcaatcaga caacaaaagg agatcaaggg gatacagatc 4440 ggaaaagaag aggtcaaaat atcactattt gcagatgaca tgatagtata tttaagtgat 4500 cccaaaagtt ccaccagaga actactaaag ctgataaaca acttcagcaa agtggctggg 4560 tataaaatta actcaaataa atcagttgcc ttcctctata caaaagagaa acaagccgag 4620 aaagaaatta gggaaacgac acccttcata atagacccaa ataatataaa gtacctcggt 4680 gtgactttaa ccaagcaagt aaaagatctg tacaataaga acttcaagac actgaggaaa 4740 gaaattgaag aagacctcag aagatggaaa gatctcccat gctcatggat tggcaggatt 4800 aatatagtaa aaatggccat tttaccaaaa gcaatctaca gattcaatgc aatccccatc 4860 aaaataccaa tccaattctt caaagagtta gacagaacaa tttgcaaatt catctggaat 4920 aacaaaaaac ccaggatagc taaagctatc ctcaacaata aaaggacttc agggggaatc 4980 actatccctg aactcaagca gtattacaga gcaatagtga taaaaactgc atggtattgg 5040 tacagagaca gacagataga ccaatggaat agaattgaag acccagaaat gaacccacac 5100 acctatggtc acttgatttt tgacaaagga gccaaaacca tccaatggaa aaaagatagc 5160 attttcagca aatggtgctg gttcaactgg agggcaacat gtagaagaat gcagatcgat 5220 ccatgcttat caccctgtac aaagcttaag tccaagtgga tcaaggacct ccacatcaaa 5280 ccagacacac tcaaactaat agaagaaaaa ctagggaagc atctggaaca catgggcact 5340 ggaaaaaatt tcctgaacaa aacaccaatg gcttatgctc taagatcaag aatcgacaaa 5400 tgggatctca taaaactgca aagcttctgt aaggcaaagg acactgtggt taggacaaaa 5460 cggcaaccaa cagattggga aaagatcttt accaatccta caacagatag aggccttata 5520 tccaaaatat acaaagaact caagaagtta gaccgcaggg aaacaaataa ccctattaaa 5580 aaatggggtt cagagctaaa caaagaattc acagctgagg aatgccgaat ggctgagaaa 5640 cacctaaaga aatgttcaac atctttagtc ataagggaaa tgcaaatcaa aacaaccctg 5700 agatttcacc tcacaccagt gagaatggct aagatcaaaa actcaggtga cagcagatgc 5760 tggcgaggat gtggagaaag aggaacactc ctccattgtt ggtgggattg cagactggta 5820 aaaccattct ggaaatcagt ctggaggttc ctcagaaaat tggacattga actgcctgag 5880 gatccagcta tacctctctt gggcatatac ccaaaagatg cctcaacata taaaagagac 5940 acgtgctcca ctatgttcat cgcagcctta tttataatag ccagaagctg gaaagaaccc 6000 agatgccctt caacagagga atggatacag aaaatgtggt acatctacac aatggaatat 6060 tactcagcta tcaaaaacaa cgagtttatg aaattcgtag gcaaatggtt ggaactggaa 6120 aatatcatcc tgagtgagct aacccaatca cagaaagaca tacatggtat gcactcattg 6180 ataagtggct attagcccaa atgcttgaat taccctagat ccctagaaca aacgaaactc 6240 aagacggatg atcaaaatgt gaatgcttca ctccttcttt aaatgaggaa aaagaatacc 6300 cttggcaggg aagggagagg caaagattaa aacagagact gaaggaacac ccattcagag 6360 cctgccccac atgtggccca tacatataca gccacccaat tagacaagat ggatgaagca 6420 aagaagtgca gaccgacagg agccggatgt agatcgctcc tgagagacac agccagaata 6480 cagcaaatac agaggcgaat gccagcagca aaccactgaa ctgagaatag gwcccccgtt 6540 gaaggaatca gagaaagaac tggaagagct tgaaggggct cgagacccca aaagtacaac 6600 aatgccaagc aaccagagct tccagggact aagccactac ctaaagacta tacatggact 6660 gaccctggac tctgacccca taggtagcaa tgaatatcct agtaagagca ccagtggaag 6720 gggaagccct gggtcctgct aagactgaac ccccagtgaa ctagactatg ggggggaggg 6780 cggcaatggg gggagggttg ggaggggaac acccataagg aaggggaggg gggaggggga 6840 tgtttgcccg gaaaccggga aagggaataa cactcgaaat gtatataaga aatactcaag 6900 ttaataaaaa aaaa 6914 // ID L1PB1 repbase; DNA; ROD; 902 BP. XX AC . XX DT 01-OCT-1995 (Rel. 3.6, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 3) XX DE 3'-end of L1 repeat (subfamily L1PB1) - a consensus. XX KW Repetitive sequence; L1 (LINE) family; L1P5; L1PB1 subfamily; KW L1PB1. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-902 RA Smit F.A., Toth G., Riggs D.A. and Jurka J.; RT "Ancestral, mammalian-wide subfamilies of LINE-1 repetitive RT sequences."; RL J. Mol. Biol 246, 401-417 (1995). XX DR [1] (Consensus) XX CC Contains identical ORF2 region consensus (subfam L1P5) as L1PB3. CC ORF2 ends at bp 684; average divergence of copies from consensus: CC 9%. XX SQ Sequence 902 BP; 360 A; 159 C; 171 G; 187 T; 25 other; ctaatatcca gaatctataa ggaactcaaa caaatcagca agaaaaaaac aaacaatccc 60 atcaaaaagt gggctaagga catgaayaga camttctcaa aagaagatat ryaaatggcc 120 aacaagcata tgaaaaaatg ctcaacatca ctaattatca gggaaatgca aatcaaaacc 180 acaatgcrat accatcttac tcctgcaaga atggccataa ttaaaaaatc aaaaaataat 240 agatgttggc gtggatgtgg tgaaaaggga acacttytac actgctggtg ggaatgtaaa 300 ctagtacaac cactatggaa aacagtatgg agattyctta aagaactaaa agtagaacta 360 ccatttgatc cagcaatccc actactgggt atctacccar argaaaakaa gtcattatay 420 gaaaaagaya cttgcacacr catgtttata gcagcacaat tyrcaattgc aaaaatatgg 480 aaccaaccca aatgcccatc aatcaayrag tggataaaga aaatgtgrta tatatatacc 540 atggaatact ackcagccat aaaaaagaat gaaataatgg catttgcagc aacctggatg 600 garytggaga cyattattct aagtgaagta actcaggaat ggaaaaccaa acatyrtatg 660 ttctcactta taagtgggag ctaagctatg aggatgcaaa ggcgtaagaa tgatacaatg 720 gactttgggg actcgggaga tcgggrgaaa gggtgggagg ggggtgaggg ataaaagact 780 acaaattggg tacagtgtac actgctcggg tgatgggtgc accaaaatct cacaaatcac 840 cactaaagaa cttattcatg taaccaaaca ccacctgttc cccaaaaacc tattgaaata 900 aa 902 // ID IAPLTR2_Mm_LTR repbase; DNA; ROD; 475 BP. XX AC . XX DT 14-NOV-2005 (Rel. 10.11, Created) DT 14-NOV-2005 (Rel. 10.11, Last updated, Version 1) XX DE Long terminal repeat of ERV2 Endogenous Retrovirus from mouse. XX KW ERV2; Endogenous Retrovirus; Transposable Element; KW LTR retrotransposon; RLTR2aIAP_MM; IAPLTR2_Mm_LTR. XX OS Mus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae. XX RN [1] RP 1-475 RA Smit A.F.; RT "IAPLTR2_Mm_LTR - ERV2 Endogenous Retrovirus from mouse."; RL Direct Submission to Repbase Update (11-NOV-2005). XX DR [1] (Consensus) XX CC note unusual TG...GA; 6% subst. XX SQ Sequence 475 BP; 87 A; 132 C; 122 G; 134 T; 0 other; tgtggggagc cgccctcaca ttcgccgttg caagatggcg ctgacatcct gtgttctaag 60 tggtaaacaa ataatctgcg catgtgccaa gggtagttct ccaccccatg tgctctgcct 120 tccccgtgac gacaactcgg ccgatgggct gcagccaatc agggagtgat acgtcctagg 180 cggaggataa ttctccttaa aagggacggg gtttcgccat tctctctctt gctctcttgc 240 gctcttgctc tcttgctctg ctcttgcgct ctggctccta aagatgtaag caatagagct 300 cttgctctgc gctcttgcgc tcttgcgctc ttgcgctctg gctcctaaag atgtaagcaa 360 tagagctctt gctctcttgc tctctggctc ctgaagatgt aagcaataaa gctttgccgc 420 agaagattcc ggtttgttgc gttcttcctg gccggtcgcg agaacgcgtg taaga 475 // ID RLTR41_MM repbase; DNA; ROD; 320 BP. XX AC . XX DT 30-JAN-2004 (Rel. 9, Created) DT 30-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE Mouse long terminal repeat - a consensus. XX KW LTR Retrotransposon; Transposable Element; LTR; ERVK; RLTR41_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RA Karolchik D., Baertsch R., Diekhans M., Furey S.T., Hinrichs A., RA Lu T.Y., Roskin M.K., Schwartz M., Sugnet W.C. et al.; RT "The UCSC Genome Browser Database."; RL Nucleic Acids Res 31(1), 51-54 (2003). XX RN [2] RP 1-320 RA Pavlicek A. and Jurka J.; RT "RLTR41_MM - a family of LTR retrotransposons."; RL Direct Submission to Repbase Update (31-DEC-2003). XX DR [2] (Consensus) XX CC The consensus sequence was reconstructed from the UC Santa Cruz CC genome annotation. Similar to IAPLTR3. Individual sequences are CC ~90% identical to the consensus. XX SQ Sequence 320 BP; 70 A; 83 C; 96 G; 71 T; 0 other; tgtggagagc cgcgataaca ttcgccatca caagatggcg ccggcttccg ctgtgcctgc 60 atgccacctt aacagagaac aagctgtgtg cgcatgtgct aagagtgtat tcgcgccaag 120 tcataagccc accccggggc gtgtcaatga gatcatgggt aagcgaccag tcaggcgtgg 180 acacgccacg ctagggtgta tataagcagc gcctttctgg ggctctgggt cttcctcttc 240 aagatgcaat aaacgctttg ctgcagaagg atcctggtgt tccgtgtgcg ttcttgctgg 300 cgagaagata gcgcgggaca 320 // ID L1MB3 repbase; DNA; ROD; 932 BP. XX AC . XX DT 01-OCT-1995 (Rel. 3.6, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 3) XX DE 3'-end of L1 repeat (subfamily L1MB3) - a consensus. XX KW Repetitive sequence; L1 (LINE) family; MER12; L1MB3 subfamily; KW L1MB3. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-932 RA Kaplan J.D., Jurka J., Solus F.J. and Duncan H.C.; RT "Medium reiteration frequency repetitive sequences in the human RT genome."; RL Nucleic Acids Res 17, 4731-4738 (1991). XX RN [2] RP 1-932 RA Smit F.A., Toth G., Riggs D.A. and Jurka J.; RT "Ancestral, mammalian-wide subfamilies of LINE-1 repetitive RT sequences."; RL J. Mol. Biol 246, 401-417 (1995). XX DR [2] (Consensus) XX CC Contains identical ORF2 region consensus (subfam L1M3) as L1MA10 CC ORF2 ends at bp 675; average divergence of copies from consensus: CC 16% CC Replaces MER12 (Acc. No. X59017). XX SQ Sequence 932 BP; 371 A; 132 C; 188 G; 220 T; 21 other; ttaatatcca aaatatataa ggaactcaaa caactcaaca agaaraaaac aaataaccca 60 attaaaaaat gggcaaarga cctgaataga catttytcaa aagaagacat acaaatggcc 120 aacagatata tgaaaaaatg ctcaacatca ctaatcatca aggaaatgca aattaaaacc 180 acaatgagat atcacctcac acctgttaga atggctatta tcaaaaagac agaaaataat 240 aaatgttggy gaggatgtgg agaaaaggga actattgtac actgttggtg ggaatgtaaa 300 ttagtayagc caytatggaa aacagtatgg aggttcctca aaaaaytaaa aataraacta 360 ccatatgaty cagcaatccc actwctgggt atatatccaa argaattgaa atcagtatgt 420 ygaagagata yctgcactcc catgtttayt gcagcaytat tcacaatagc caagatatgg 480 aawcaaccta agtgtccatc aayggawgaa tggataaaga aaatgtggta tatatacaca 540 atggaatact attcagccat aaaaaagaat gaaatcctgt catttgyarc aacatggatg 600 aacctggagg acattatgct aagtgaaata agccaggcac agaaagacaa atactgtatg 660 attccactta tatgaggtac ctagagtagt caaattcata gagacagaaa gtagaatggt 720 ggttgccagg ggctgggggg aggggggaat ggggagttak tgtttaatgg gtacagagtt 780 tcagtttggg aagatgaaaa agttctggag atggatggtg gtgatggttg cacaacaatg 840 tgaatgtact taacgccact gaactgtacg cttaaaaatg gttaaaatgg taaattttat 900 gttatgtata ttttaccaca attaaaaaaa aa 932 // ID CAVID1C repbase; DNA; ROD; 94 BP. XX AC . XX DT 26-DEC-2009 (Rel. 15.03, Created) DT 26-DEC-2009 (Rel. 15.03, Last updated, Version 3) XX DE tRNA-derived SINE family: consensus. XX KW SINE2/tRNA; SINE; Non-LTR Retrotransposon; Transposable Element; KW CAVID1C. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-94 RA Jurka J.; RT "SINE elements from guinea pig."; RL Repbase Reports 10(3), 498-498 (2010). XX DR [1] (Consensus) XX CC >94% identical to consensus. XX SQ Sequence 94 BP; 30 A; 19 C; 25 G; 20 T; 0 other; ggggctgggg atttagctca gtggcataag cacctgcctt gcaagcgcat ggtcgtgagt 60 tcaatccctg gtaccgatta aaaaaaaaaa aaaa 94 // ID L1-1_Cpo repbase; DNA; ROD; 6898 BP. XX AC . XX DT 20-JUN-2009 (Rel. 14.07, Created) DT 20-JUN-2009 (Rel. 14.07, Last updated, Version 2) XX DE L1 non-LTR retrotransposon: consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-1_Cpo. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-6898 RA Jurka J.; RT "L1-type non-LTR retrotransposons from guinea pig."; RL Repbase Reports 9(7), 1408-1408 (2009). XX DR [1] (Consensus) XX CC >97% identical to consensus. XX FH Key Location/Qualifiers FT CDS 811..1863 FT /product="L1-1_Cpo_1p" FT /translation="MTKRKTKLQPQTSSPSQETLGAMYIEENICHSPQISD FT TIATEINKQLQEALKKFKLEIISQVKEEMIKEMKEILNISKENTDKKLEEL FT NKKIESLDEQYKKQTEYIMQMHRDIQEIKNSTESCKNRLNESEDRISDLED FT RIAASEQERKDLLKITRNQETTIQQLQDDAKKNNIRMIGINEKEGDNIKDV FT KRIFREVIAENFPSMRSETDIRISEAYRTPNSHNQNKTTPRHIIITIPEIQ FT HKNRLLKAVREKRQITYKGKPIRITADFSAQTIKSRRAWSEVFQILKQNDF FT QPRLIYPAKLSFKIDGEIRYFHDKEQLKNFMNTKPTLQKILKDSLDTQRKK FT PLSKADN*" FT CDS join(1883..3046,3050..5722) FT /product="L1-1_Cpo_2p" FT /translation="MQQQRDNMPTINQHLTVITINVNGLNAPIKRNRLAEW FT IKKQNPTICCLQETHLTQKDTHRLKVKGWKTILHAAGIQKKAGVAILFADN FT VNFKPTMTIKDKEGHYILVRGKLQEEEITILNIYAPNSRAPSYIKQLLTEM FT KTQIISNTIVTGDLNTPLTPRDRSTRQKMSKEITELNHTCEQMGLIDIYRM FT FHPTTSEYTFFSAVHGSFSKIDHILAHRTYLNKCKRVEIIPCMLSDHSALK FT LEINDKRYCKNPANTWKLNNTLLSNQWVTEEIKEEIKQYLKENENADTTYR FT NLWDAMKAVLRGKFIALSSHIRKTERIQINNLMLHLKQLEKEEQVKPKAKR FT REEIIKIRAEINAIETKKTIQRINESKSWFFERINKIDKPLANLIKREEKA FT QIHAIRNEKGEITTDPIEIQKIINTYFENLYSQKFDNTEEIDRFLETYEVP FT KLDQEDVKLLNNPISVNEIENVIKSLPTKKSPGPDGFTAEFYKKYKEDLMP FT TLLKLFNEIEREAILPKSFLEANITLVPKPEKDPTKKENYRPISLMNTDAK FT ILNKILANRMQQIIKKIIHHDQVGFIPGMQGWFNIRKSINVIHHINKAKNK FT NHMIISIDAEKAFDKVQHLFMIRTLQKIGIDGLYLNIIKAIYDKPTASIIL FT NGQKLKAFTLKSGTRQGCPLSPLLFNIVLEVLARAIRQEREIKGVKIGKEE FT VKLSLFADDMILYIEDPLNSIERLLDTINKFSNVAGYKINTQKSIAFLYTN FT NKITEREIRETALFTLASKKMKYLGITLTKEVKDLYSENYNTLKKEIEDDL FT RKWKDIPCSWIGRTNIVKMAILPKLLYRFNAIPIKIPSTYLIDLEKSLLNF FT IWNQKRPRIAKAILSSKDKAGGITIPDLKLYYKATVVKSTWYWNQNRAEDQ FT WNRLEDTTTTTNTLNHLIFDKGAKQVHWKNDSLFNKWCWKNWLSICRKLKL FT DPCLSPCTKLKSKWVKDLNIKTETLNLLEDKLGRNLEDIGVGREFMNRTQT FT AQEILPRINNWDHFLLKSFCMSKEISSIVKRKPTNWEKILVNSLSDKGLLS FT KTYKELKKLRPPKFKDPIQKWASEMNTHFSDEEMQMANKYMKKCSSSLVIR FT EMQIKTTLRYHLTPERMARIKKTNNNKCWRGCGEKGTLLHCWWECRLVQPL FT WRSVWRFLKKLGLEVPFIPAIPLLGIFPEELKASYHSDICAPMFIAAQFVI FT ARSWKQPKCPSTEEWIKKLWYFYTMEYYSAIKKDHIEIFINKWAQLETILI FT SEINQSRMCEYRIVSLM*" XX SQ Sequence 6898 BP; 2616 A; 1310 C; 1299 G; 1672 T; 1 other; gtgcagagga ggggctgggc ggatctgggg cgcctgcatc agcctccccg tgtgtctgga 60 ggactgtgga tatctcagca agcagggagg gatcgcaggt tcctgaggca ccgcggcctg 120 tggtgagtgc tcagcacact ctgcccaaat cccagactgt tccggaccct ccaggcttcc 180 gggtgcagtg aagtggagaa gtgggtttga ggcctctaca tccgcctccc catcccggct 240 gcacttggag aactgctgag cttccaacgg gtgggcgggc ccacgctatc cccggagccc 300 agcctgtttg ggactcccgc acctagccat cggtgtgcag aggaggggct gggcggatct 360 ggggcgcctg catcrgcctc cccgtgtgtc tggaggactg tggatgtctc ggcgggcggt 420 ggagcatcgc gggttccaga ggccctgcag cctgtggtga gtgctcagcg cactctgccc 480 agagcccaga ctgtttgggt ccctccaggc ttccaggtgc agtgaagtgg agaggtgggt 540 ttgaggcgtc tacatctgcc tccacatccc ggctgcgatt ggagagctgt ggagcttcca 600 aggggcaggt gggcgcacag tgtccctgag gcccagcttg tctggaactg cggagttctg 660 agcaggcaga ggagggccat tggttctgca ggcccagtgg cttggggtag cgtggtgcca 720 gagtctcagt agctgcaaac atagaagggg atttccaaca gatcctgagg aagctttcaa 780 ataacacttg aaacacagaa atataggaaa atgacaaaac gaaagaccaa attacaacct 840 caaacctcca gcccatctca agaaacccta ggagcaatgt atatagaaga aaatatatgc 900 cattcacctc aaatatcgga cacaatagct actgaaatta acaagcaact ccaggaagct 960 ttaaagaaat tcaaactcga aataatctcc caggtaaaag aagaaatgat aaaagaaatg 1020 aaagagattc tgaatatatc taaagaaaac acagataaaa aattggagga actaaacaaa 1080 aagatagaat ccctagacga gcaatataag aaacaaacag aatacataat gcagatgcac 1140 agagatatac aggaaataaa aaactccact gaaagttgta aaaaccgcct taatgaaagt 1200 gaagatagaa tttcagacct tgaagacagg attgcagcta gtgaacagga aaggaaagat 1260 cttttaaaaa taacaaggaa tcaggaaaca acaattcaac aactgcaaga cgatgcaaag 1320 aaaaataaca taagaatgat agggattaat gaaaaagaag gagacaacat aaaggatgtc 1380 aagaggatat ttagagaagt aatagctgaa aatttcccaa gcatgagatc agaaaccgac 1440 atcaggatca gtgaagcata tagaactcca aatagtcata accaaaataa aactacaccc 1500 agacatataa taatcaccat cccagaaatc caacacaaga atagattatt aaaagctgtc 1560 agagagaaaa gacagatcac ttataaagga aaacctatca gaatcacagc agacttctca 1620 gcacaaacaa taaagtcaag aagagcatgg agtgaagtat tccagatcct aaagcaaaat 1680 gatttccaac ctagactgat atatcctgca aaactatcat tcaaaattga tggtgaaata 1740 cgatacttcc atgacaaaga acagctgaag aacttcatga acaccaaacc aaccctgcaa 1800 aaaatattga aagacagttt agatacacaa aggaaaaaac cactaagcaa agcagataat 1860 taaacagaac agaaagataa agatgcaaca gcagagagat aacatgccaa caataaacca 1920 gcatttaaca gtaataacca ttaatgtaaa tggtctcaat gcaccaatca aaagaaacag 1980 actagcagaa tggatcaaga aacaaaatcc aaccatatgc tgtttacagg aaacccatct 2040 aacccagaag gatactcaca gactgaaagt caaaggatgg aaaacaatac ttcatgcagc 2100 aggaatccaa aaaaaggcag gagtagccat tctgtttgca gacaacgtga actttaaacc 2160 aacaatgacc ataaaggaca aagaaggtca ctacatactc gtaaggggaa aactccaaga 2220 ggaagagata actatcttaa atatatatgc accaaactcc agagcaccca gttatataaa 2280 acaactatta acagaaatga aaactcaaat tatcagtaac acaattgtaa caggagacct 2340 taatacgcca ttgacaccaa gggacagatc aaccagacag aaaatgagca aagaaataac 2400 agaactgaat cacacctgcg aacaaatggg tttgatagac atatacagaa tgttccaccc 2460 aacaacatca gaatacacat tcttctcagc agtacatgga tcattctcta aaatagacca 2520 tatattagct cacagaacat atttaaacaa atgcaaaaga gttgaaatca tcccttgcat 2580 gttatcggat catagcgctc tgaaattaga aattaatgat aaaagatact gcaaaaatcc 2640 cgcaaacaca tggaaactga ataacacact cttgagtaat cagtgggtca cagaagaaat 2700 taaagaagaa attaaacaat acctaaaaga aaatgaaaat gcagatacaa cttaccggaa 2760 tttgtgggat gcaatgaaag ccgtcctgag aggaaaattt attgcactga gttcccacat 2820 caggaaaaca gaacgaatac aaataaataa cttgatgcta cacctcaaac agctagaaaa 2880 agaagagcaa gtcaagccca aagccaaaag aagagaggaa attataaaaa tcagagcaga 2940 aatcaatgca atagagacta agaaaacaat ccaaagaatc aacgaatcaa agagttggtt 3000 ctttgaaaga ataaataaaa tcgataagcc cctagccaac cttatttaaa aaagggaaga 3060 aaaagctcaa attcatgcaa taagaaatga aaaaggtgaa atcactacag acccaataga 3120 aatacagaag atcatcaaca cctatttcga aaacctctac tctcaaaagt ttgataacac 3180 agaagaaata gacagattct tagaaacata tgaagtacca aagctagatc aagaggatgt 3240 aaaactgttg aataacccaa tctctgttaa tgaaattgaa aatgtaatta agtccttacc 3300 caccaagaag agcccaggcc cagatggatt cactgcagaa ttctacaaga aatacaaaga 3360 agacctaatg ccgacactcc tcaaactatt caatgaaatt gaaagggaag caatccttcc 3420 taagtcattc ctggaagcaa atattactct agtaccaaaa cctgagaaag acccaactaa 3480 aaaagagaac tatagaccga tctccctaat gaacacagat gcaaaaatcc tcaataaaat 3540 attggcaaat aggatgcagc aaatcatcaa gaagattata caccatgacc aagtgggatt 3600 catcccagga atgcaaggat ggttcaacat acgtaaatca ataaatgtaa tccaccatat 3660 caataaagcc aaaaataaga atcacatgat catttctata gatgcagaaa aagccttcga 3720 taaggtccaa catttattca tgataagaac tttacagaaa attggaatag atggtcttta 3780 cctcaatata ataaaggcca tttatgacaa accaacagcc agcatcatac taaatggcca 3840 aaaattgaaa gcctttactt taaaatcagg cacaagacaa ggatgtcctt tatcaccact 3900 cctatttaat atagtactgg aagtactagc cagagcaatt aggcaagaga gagaaataaa 3960 aggggtaaag ataggaaaag aagaagttaa attatcattg tttgcagatg acatgatact 4020 ctacatagaa gaccccctaa actccattga aagactctta gatacaataa ataaattcag 4080 taatgtggct ggatacaaaa tcaatactca gaaatcaata gcattcctat acacgaacaa 4140 caaaatcaca gagagggaaa taagagaaac tgcactcttc acattagcaa gcaaaaaaat 4200 gaaatattta ggaattactc ttacgaaaga agtgaaagac ttatacagtg aaaattataa 4260 tacactgaaa aaagaaattg aagatgatct cagaaaatgg aaagacatcc cttgctcatg 4320 gataggaaga acaaacattg tgaaaatggc cattctccca aaactgttat acagattcaa 4380 tgcaatccca ataaaaatac catcaacata ccttatagat ctagagaaat cactcctaaa 4440 tttcatctgg aaccagaaga gacctagaat agcaaaggca attctaagca gcaaagacaa 4500 agcaggaggc atcacaatcc ctgacttgaa gttatactac aaagcaacag tagtaaaatc 4560 aacatggtat tggaatcaaa acagagccga agatcaatgg aatagattag aagatacaac 4620 cacaactaca aacacactca accaccttat ctttgataaa ggagccaagc aggttcactg 4680 gaagaacgac agtctcttta ataaatggtg ctggaaaaac tggctctcca tttgccgaaa 4740 actaaagcta gacccatgtc tatcaccatg tactaaatta aaatctaaat gggtcaaaga 4800 tctgaatatt aaaacagaaa cactaaatct attggaagac aaattgggta gaaaccttga 4860 agacatagga gtaggtagag aattcatgaa caggactcaa accgcacagg aaatacttcc 4920 cagaatcaat aactgggacc acttcttatt aaaaagtttc tgcatgtcaa aagaaatctc 4980 cagcatagtg aaaagaaaac ccacaaactg ggaaaagatt ttagtgaact ctctctcaga 5040 caagggactc ttgtctaaaa catataaaga attaaaaaaa ctcagacccc caaaattcaa 5100 agatccaatc caaaaatggg catctgagat gaatacgcac ttctcagatg aagaaatgca 5160 aatggcaaat aaatacatga aaaaatgttc atcatcattg gtcattagag aaatgcaaat 5220 taaaaccaca ctgagatacc atctcactcc tgaaagaatg gccaggatca agaaaaccaa 5280 caacaacaaa tgctggagag gctgtgggga aaaaggaact ctcctacact gttggtggga 5340 gtgtagactg gtgcagcctt tgtggaggtc agtctggaga tttctcaaaa agctgggatt 5400 ggaagtccca tttattccag ctattccact tctaggcata ttcccagaag aactgaaagc 5460 atcataccac agtgatatat gtgcacctat gtttatagca gcacaatttg taatcgccag 5520 atcttggaaa caacccaagt gtccatcaac tgaggaatgg ataaaaaagt tgtggtattt 5580 ctatacaatg gagtattatt cagctataaa gaaggatcac attgagatat ttatcaataa 5640 atgggcgcag cttgagacca tactcataag tgaaataaat caatctcgca tgtgtgaata 5700 ccgaatagtt tccctaatgt aagtaatgat tcaaatatat aaaaatatgt ggtaatcatg 5760 attccccatt atgtggcctt gaagccttat aaagttttca cttattaatt ggaaaggctt 5820 tattctgggc ttcttgatat ggaaatattt gaagctatag tatatgaagt tttattcaaa 5880 tatttcaagg aaaaagacag acactatggt aactactatt aagtcgggtt atttggtggt 5940 gaagcactag gttgctgaaa cagtcatgct ttgacctgta ttgttatatt gttatgtcct 6000 gttttatctc aatctctccc ttattcttta aaaaaattat ttagttattt cttatgtttg 6060 ttttaattaa ggggaaaaat ttaaaaaatt agcaggatat ataatagtaa ctttttttga 6120 tttcctatta acttgttttg aagtcctgta atactcccac gatcttgttt tgactgatta 6180 gactctgttc cattctgtat gatagtatca agtttgaggg tataggcaac tactttgaag 6240 gcaataagat tcactaagat gcccattctg gtttgactat tgtctccttt ggaagtcctg 6300 taatgtttac aatgcttaag ttttattggt cttgttctct actgttttgt tcaactttat 6360 aatatctaat gagatagaca attcttttta aggaaacaag agacattgtt gataaccatt 6420 gttgataacc ttgtactctg ttttacatca tgtgctatgt ctatccttgg ttttgactga 6480 cagctttgtt ttgccatatt tgactttatt ctctcccccc caaaaaagta ataagaacta 6540 aggagtaata aaacccaaaa tgtgtaaatt ttgttgtaaa aaagagaggg aaaaaaagag 6600 ctctaaagga tacaacagca agtgtgttta ggggtgttaa atagctgacc atattaatgt 6660 ttttggttgg atcattgaac acaactgttt gcagtctcac tgtttgaatg tccactttgt 6720 attctttgat actgtgattt gagcaactat ttctaagatg ataatatgta atctattaac 6780 tggtgatttc ttactgtttt tttttttttt tgtagttctg tattacttgt acccttacac 6840 cgctttgttt tgttttgtgt ttttctcttt ttccttttaa ataaaatttt aaaaaaaa 6898 // ID IAPLTR4_I repbase; DNA; ROD; 1470 BP. XX AC . XX DT 30-JAN-2004 (Rel. 9, Created) DT 30-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE Mouse long terminal repeat - a consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR; ERVK; KW IAPLTR4_I; IAPLTR4-int; IAPLTR4. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RA Karolchik D., Baertsch R., Diekhans M., Furey S.T., Hinrichs A., RA Lu T.Y., Roskin M.K., Schwartz M., Sugnet W.C. et al.; RT "The UCSC Genome Browser Database."; RL Nucleic Acids Res 31(1), 51-54 (2003). XX RN [2] RP 1-1470 RA Pavlicek A. and Jurka J.; RT "IAPLTR4_I - a mouse subfamily of nonautonomous LTR RT retrotransposons."; RL Direct Submission to Repbase Update (31-DEC-2003). XX DR [2] (Consensus) XX CC The consensus sequence was reconstructed from the UC Santa Cruz CC genome annotation. Internal sequence of an IAP-related CC retrovirus, CC 85% identical to IAPEY3_LTR_I. Individual copies are ~94% CC identical to the consensus. The family appears to be CC nonautonomous. CC LTRs listed as IAPLTR4. XX SQ Sequence 1470 BP; 447 A; 313 C; 423 G; 287 T; 0 other; cgaaacatta caaggacaag aagtggcgct gataacccgg gaactatatc atcaccaggc 60 gctgatagga gacccccccc cgtttcacgg ggcagattca gaactacggg acaatacagc 120 tctacaaggt atggccttga actttctcca gcgttagaag cctttctgtt ccttttcaca 180 gggctcgttc tttttcttta gattgcgtac agtttccaca ggaggaattg ctaactagtg 240 ttcagtcgag gtcaggcaag gttagaacaa aggcaagaag atagcatggg agcctcccac 300 tctgtggtga ccgccttacg gtcggtcctg aggcagcgtg gcttgaaagt ctccaccaag 360 acactagaag gctttgtaaa agagatagat cacatagcac cgtggtttgc gtgctcaggg 420 tccttaacta tcccctcttg ggagaaactc aggggagatt tagttgggga gcaggagaac 480 ggcaaactta aagcaggaac catgccgttg tggaagctga ttagatcgtg cttaaagaac 540 gaggaatgtc aacaagtagt taaggcaggg cagaaaattc tggacgaaat tcaagaaagc 600 ctatcagagg tagagcgggg agctagagag taggagccaa aggggaaaca tggtgcacca 660 aataagcata caggcctctc cacgggtctt gaacctgagg agaagataat gtcggggaag 720 gatacccggg gagagataag aagaaaggag gagaaaaaac aaaagaaaaa aagatcaatc 780 agcggaggtc cctagaggag ggagcctata cccgccgcta gatgagttta aggagttagc 840 tcttagcagc tcagaatcag atgaagaact tagcccctct gaggaaacag acttggagga 900 ggaagcagct cgttatgaga gagaaaagta ccagccagat aaaatgcgag ctaatcagtc 960 aagaaaaaag ccaaaagcgg ctggcgaagg ccagcttgct gctcggcctc cgggcagtcg 1020 gcttcaaggt catagtgcac ctccgcccta tgcggagccc ccgccctgcg tagtgcgtca 1080 gccctgcaca gagaggcagt gcgcagagag gcagtgcgca gactcgttca ttccaagaga 1140 ggaacaaagg aaaatgcaac aggcatttcc ggtctttgaa ggagccgagg gtgggcatgt 1200 tcacgctccg ggtagagtat atacagatta aagagcttgc cgagtcggtc cgtaactatg 1260 gagtcagtgc taatttcact gtggcacagg tcgaaaggct tgtggccttg gcaatgactc 1320 ccggggactg gacaacggtg ataaaagctg tggctccaaa tatgggaatg tatcttgaat 1380 ggaaagcatt gtggcaagat ttctgccaga cgcaagcaag ggctaatgct accatggagt 1440 gaaaaaggag aaaagagaaa aacaaaagat 1470 // ID RLTR10A repbase; DNA; ROD; 411 BP. XX AC . XX DT 26-SEP-1997 (Rel. 2.08, Created) DT 17-JUL-2008 (Rel. 13.07, Last updated, Version 2) XX DE Long terminal repeat of endogenous retrovirus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR; KW Putative retroviral long terminal repeat RLTR10A; RLTR10A; ERVK. XX OS Mus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae. XX RN [1] RP 1-411 RA Smit A.F.; RT "RLTR10A."; RL Direct Submission to Repbase Update (30-NOV-1996). XX RN [2] RP 1-411 RA Smit A.F.; RT "Consensus update."; RL Direct Submission to Repbase Update (05-MAR-2004). XX DR [1] (Consensus) XX CC Bp 415 to 470 80-85% similar to IAPLTR_MA and RLTR7 termini. CC Copies 5% diverged from consensus. CC [2] 7% subst. XX SQ Sequence 411 BP; 103 A; 80 C; 142 G; 84 T; 2 other; tgtgaggagc gggtgtggca gcagtcccaa ganggcgcca gggactgcag ctaagtctta 60 tgacttgcac ctgacttcct catacacctg aaaataagcc acgaccatcg tgagagctgc 120 gcaggtgcac catgatgctg gcggtttaaa caagtccata tttggtggag acatgcccct 180 gccgccctga ttggctgaag ctgcgtgcct ggtgaggtga cgtggcctgc tgtgagtgga 240 tgggggctga gagtatataa gagtgagagg cccggggttc gggggagata aagatgaggg 300 aaaaagatga agagatgaag agagaganga agacatgaag tttgctgaat aaactgctgt 360 tagaaggact ggtggtcgcg tcgttcttgc tggtcgagag cggacgcgac a 411 // ID RLTRETN_MM repbase; DNA; ROD; 322 BP. XX AC X03064; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 17-APR-1997 (Rel. 2.03, Last updated, Version 3) XX DE Mouse LTR (E.Tn sequence). XX KW ERV2; Endogenous Retrovirus; Transposable Element; KW Repetitive sequence; Long terminal repeat; MMETNLTR; RLTRETN_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-322 RA Kaghad M., Maillet L. and Brulet P.; RT "Retroviral characteristics of the long terminal repeat of murine RT E.Tn sequences."; RL EMBO J 4, 2911-2915 (1985). XX DR GenBank; X03064; Positions 1 322. XX SQ Sequence 322 BP; 63 A; 106 C; 66 G; 87 T; 0 other; tgtagtctcc cctcccctag cctgaaacct gcttgctcag gggtggagct tcctgctcat 60 tcgttctgcc acgcccactg ctggaacctg cggagccaca cacgtgcacc tttctactgg 120 accagagatt attcggcggg aatcgggtcc cctccccctt ccttcataac tagtgtcgca 180 acaataaaat ttgagccttg atcagagtaa ctgtcttggc tacattcttt tctctcgcca 240 cctagcccct cttctcttcc aggtttccaa aatcgctttc caggctagaa cccaggttgt 300 ggtctgctgg ccagacacaa ca 322 // ID ZOMBI repbase; DNA; ROD; 2806 BP. XX AC . XX DT 13-JAN-1998 (Rel. 6.4, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 2) XX DE Autonomous DNA transposon; POGO superfamily. XX KW TIRs; DNA transposon; MER46; TA target; ZOMBI. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 2806-2710 RA Smit F.A. and Riggs D.A.; RT "Tiggers and other DNA transposon fossils in the human genome."; RL Proc. Natl. Acad. Sci. USA 93, 1443-1448 (1996). XX RN [2] RP 2806-2710 RA Jurka J., Kapitonov V.V., Klonowski P., Walichiewicz J. RA and Smit F.A.; RT "Identification of new medium reiteration frequency repeats in RT the genomes of primates, rodentia and lagomorpha."; RL Genetica 98, 235-247 (1996). XX RN [3] RP 1-2806 RA Kapitonov V.V. and Jurka J.; RT "Jerky gene - a recruited transposon?."; RL Direct Submission to Repbase Update.. XX DR [3] (Consensus) XX CC 23 bp terminal inverted repeats and TA target site [1,2]. CC ZOMBI is an autonomous DNA transposon (its non-autonomous CC elements have been identified as ZOMBI_A (MER46 [1,2]) and CC ZOMBI_B. CC Orientation of ZOMBI has been determined based on the CC reconstruction of its internal sequence encoding transposase [3]. XX SQ Sequence 2806 BP; 912 A; 540 C; 600 G; 754 T; 0 other; caggttgagc atccctaatc caaaaatccg aaatctgaaa tgctccaaaa tctgaaactt 60 tttgagcgct gacatgacgc cacaagtgga aaattccaca cctgacctta tgtgacgggt 120 cacagtcaaa acgcaggtgc acaacacaca gtttattcgg cgtccccaag ggaaaaaaga 180 ccctcccagc ccccttcagc tgcggtatat cttttccgcg cacacccaga ttcccccatg 240 caagcacgcc cacaaagggt aataaaatgg cacgtgtgca ggctggacac accaacggca 300 ggttccccac aatgccccca catggggtca agacctacgt gcattactca ctgtgttttt 360 ttgcttattc tctgctctgt ggtgtaaaga tattgttgaa aatgtcaaaa aggcctgtag 420 atacccctgt gagtaacaat gataagaaaa aggaagcatt tatgtttatc tatagcacag 480 aaaagtcaag ctgttggaga aactggacag tggtgtaagt gtgaaacgtc ttacagaaga 540 gtatggtgtt ggaatgacca ccatatatga cctgaagaaa cagaaggata aactgttgaa 600 gttctatgct gaaagtgatg aacggaagtt aatgaaaaat aaaaaaacac tgcataaagc 660 taaaaatgaa gatctcgatc gtgtattgaa agagtggatc cgtcagcatc acagtgaaca 720 catgccactt aatggtacgc tgatcatgaa acaagcaaag atctgtcaca atgaactgaa 780 aattgaaggg aactgtgaat attcaacggg ctggttgcag aaatttaaga aaagacacgg 840 cattacattt ttaaagattt gtggtgataa agcatctgct gatcatgaag cagcggagaa 900 attcattgac gagtttgcca agatcatcgc tgatgaaaat ctgatgccag aacaagtcta 960 taatgctgat gaaacatcac cgttttggtg ttattgcccc agaaagacac tgactacagc 1020 tgatgagaca gcccctacag gaattaagga tgccaaggac agaataactg tgctgggatg 1080 tgctaatgca gcaggcacgc ataagtgtaa acttgctgtg ataggcaaaa gcttgcgtcc 1140 ttgctgtttt caaggagtga atttcttacc agtccattat tatgctaaca aaaaggcatg 1200 gatcaccagg gacatctttt ctgattggtt tcacaaacat tttgtaccag cggcttgtgc 1260 tcactgcagg gaagctggac tggatgatga ctgcaagatt ttgttattcc ttgacaactg 1320 ttctgctcat cctccagctg aaattctcat caaaaataat gtttatgcca tgtactttcc 1380 cccaaatgtg acttcattaa ttcagccatg tgaccagggt atctttagat caatgaagag 1440 taaatataaa aacactttct tgaacagcat gctagcagca gtgaacagag gcgtgggtgt 1500 ggaaggtttt caaaaggagt ttagcatgaa ggatgccgta tatgctgttg ccaacgcttg 1560 gaacacagtg actaaagaca cagttgtgca tgcctggcac aacctctggc ctgcgactgt 1620 gttcagtgat gatgatgaac caagtggtga ctttgaagga ttctgtatgt caagtgagaa 1680 aaaaatgatg tctgacctcc ttacatatgc aaaaaatata ccttcagagt ccgtcagtaa 1740 gctggaagaa gtggatatta aagacatttt taacatcgat aatgaggctc cagttgttca 1800 ttcattggaa gaagtggata tcaaagaagt cttccacatc gataaatgca ttaccagttg 1860 ttcaaccatc accggatggt ggaatagccg aaatggttct gaatcaaggt gattgtgatg 1920 atagtgatga tgaagatgat gacgttaaca ctgcagaaaa agcgcctata gatgacatgg 1980 tgaaaatgtg tgatgggctt attgaaggac tagagcagcg tgcattcata acagaacaag 2040 aaatcatgtc agtttataaa atcaaagaga gacttctaag acaaaaacca ttgttaatga 2100 ggcagatgac tccggaggaa acattttaaa aagccatcca gcagaatgcc tcctcatccc 2160 tagaggaccc acttcctggt ccctcaactg cttctgatgt ttcttctcac ttagaaaaca 2220 aaaaccaaaa agcaaaaaaa atacagtgta cagtaacctt ttaatcaaaa cacagcatcg 2280 tagatggaga ctgaaagcct gccattgttt gttgttgctg ttgtttaaca gctgatacag 2340 gtattctggt gatgctactg tgctgcttag ttaccctgaa cacatttttt tttcactgta 2400 ttaatggtat gtcatatttt ttactgttaa gtacttatgt gtgaataagt gtaagaaaat 2460 gattgcttat cggtagcata taaattcaga gtcaggaatg atggtgatgc caaacaacca 2520 cagattgtcc acatgggtgg ctgagatagt gacacctttg ctttctgatg gttcaatgta 2580 cacaaacttt gtttcatgca caaaattatt aaaaatattg tataaaatta ccttcaggct 2640 atgtgtataa ggtatatatg aaacataaat gaattttgtg tttagacttg ggtcccatcc 2700 ccaagatatc tcattatgta tatgcaaata ttccaaaatc tggaaaaaaa tccagaattc 2760 aaacacttct ggtcccaagc atttcggata agggatactc aacctg 2806 // ID CAVID2 repbase; DNA; ROD; 102 BP. XX AC . XX DT 07-JAN-2010 (Rel. 15.03, Created) DT 07-JAN-2010 (Rel. 15.03, Last updated, Version 4) XX DE tRNA-derived SINE family: consensus. XX KW SINE2/tRNA; SINE; Non-LTR Retrotransposon; Transposable Element; KW CAVID2. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-102 RA Jurka J.; RT "SINE elements from guinea pig."; RL Repbase Reports 10(3), 499-499 (2010). XX DR [1] (Consensus) XX CC >82% identical to consensus. XX SQ Sequence 102 BP; 39 A; 22 C; 25 G; 15 T; 1 other; gggccgggga tttagctcag cggcataagc gcctgccttg caagcacgag gtcntgagtt 60 cgatccctgg taccgaaaaa aaaaaacaaa aaaaaaaaaa aa 102 // ID RLTR32C_MM repbase; DNA; ROD; 640 BP. XX AC . XX DT 21-AUG-2008 (Rel. 13.08, Created) DT 21-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Long terminal repeat of endogenous retrovirus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; RLTR32C_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-640 RA Jurka J. and Jurka M.G.; RT "Long terminal repeats of endogenous retroviruses from mouse."; RL Repbase Reports 8(8), 893-893 (2008). XX DR [1] (Consensus) XX SQ Sequence 640 BP; 166 A; 164 C; 143 G; 165 T; 2 other; tgttgggagc atagcatgaa ggtccttgtg cccacagatc tccagagaaa ccaggaatgt 60 taagcacaca ggattgtctt tccaatggga aagatgtaaa accctgttct tccatccaga 120 aagctgggaa ttggaagtga ttaacagcct ggtgatctgc gttatctgtt gggccaggcc 180 atatcaagct cctacaacca ggaccggagg tggctagtaa tctaaatgac ccttatagcc 240 agaggctccc ccaagctccc acaaccagga ccagaggtgg ctaatagccc aaatgacctc 300 tatgctatct gtatgactga gaccaggcca gacccttccc taggccctta agctagtaca 360 ggtagcctat ctctagaacc catcttcttg gaagaaatga catgtaccct gagttgagtt 420 tcgatgtaat tacgcttcct tgtgcaccgg ggattgtatt acaccaggaa actttttttc 480 caaattatac tgtgtttaaa tacgttggga ataaaccgcc tggcatcaga ctccctagaa 540 gtcttgatcc aggttgacga agtcaatctg aawccgggtt ttcattctca yctcttcgtg 600 tggttcgctt ccctgcctga cacctgcagg gaccccctca 640 // ID ERV1A-CPo_I repbase; DNA; ROD; 7527 BP. XX AC . XX DT 19-JUN-2009 (Rel. 14.07, Created) DT 19-JUN-2009 (Rel. 14.07, Last updated, Version 2) XX DE Internal portion of ERV1 endogenous retrovirus: consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; ERV1A-CPo_I. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-7527 RA Jurka J.; RT "Non-autonomous endogenous retrovirus from guinea pig."; RL Repbase Reports 9(7), 1540-1540 (2009). XX DR [1] (Consensus) XX CC ~95% identical to consensus. XX FH Key Location/Qualifiers FT CDS join(522..2153,2157..5666,5653..6075,6063..7493) FT /product="ERV1A-CPo_I_1p" FT /translation="MGQSTSSPLSLTLDHWSEVRKRAHDLSLQVKKSKWQD FT FCSTEWPTFSVGWPPEGTFHLPLILAVKDVIFRPGQRGHPDQVAYILVWQD FT LREEPPLWVKPFLPPSDSSVPKLLALKGPPTSAPVLPESQPDLPLLDYDRP FT PPSQLTQPPPYPLSPPASSSGSSAASPSAPSSPELNPSSPPIPTSPLYPPL FT PAMACPEPTPPGPTGPAQNTRSKLRPEDPVVTLPLRPYGPMIDDGTDGGQM FT PALQYWPFSTSDLYNWKNNNPPFSDDPSKLTGLMDSVMFSHQPTWDDCQQL FT LGVLFTTEERDRILLEARKAVPGEDGRPTQRPDLIDDFFPLKRPNWDPNSP FT TGRQHLSIYRQTLMAGLRAAARRPTNLAKVREVTQGPTETPSVFLERLMEA FT YRRYTPFNPEEEGRQGSIAMAFIGQSAPDIRRKLQRLDGLQDLTLRDLVKE FT AEKVYYKRETEEEKELARDKRRNKELAKMLATVIQGKPEPGKGKPTRPPLD FT PDQCAYCKEKGHLIKDCEKLKKKRAKEEREKQSSQRSRAPMLALDEEDGLR FT GSDPLPELRVMFKVEGTPVEFEVDTGAVYSALQAPLGALSTKKSLVQGANG FT SKYRSWTTERTVDLGKGKVKHSFLVIPECPAPLLGRDLLTKLGAKISFEPQ FT GPQVTFRNPKVGQPMVTVLSLKLEDEYRLYDHQDPQPIAPAWLTDFKESWA FT ETAGLGLASQQPPVVVTLKTTASPIRVKQYPINREAHLGIKVHIQRLLDQG FT VLTPCRSAWNTPLLPVKKPGTNDYRPVQDLREVNSRVEDIHPTVPNPYNLL FT SGLNPSRTWYTVLDLKDAFFCLPLHKDSQPLFAFEWTDPETGSAGQLTWTR FT LPQGFKNSPTIFDEALHKDLAPFRAQHPNLTLLQYVDDLLLAADSEGDCTS FT GTQDLLRELAILGYRASAKKAQICKREVIFLGYSLKGGKRWLTEARKQTVV FT QIPPPKSQKQLREFLGTAGFCRLWIPGFATLAAPLYPLLKGGSPFIWEKDH FT QQAFDAIKRALLSAPALALPNVDKPFTLFIEEKKGIARGVLTQAFGPWRRP FT VAYLSKRLDTVASGWPPCLKAIAAAALLIKDADKLTLGQKITIIAPHTLES FT IIRQPPDRWLSNARVTHYQSLLLNKDKITFGPPVTLNPATLLPEEASEPVL FT HTCQDVLAEEAGVRPDLLDCPLPDAEVTYFTDGSSFLIQGKRYAGAAVTDY FT SNVIWTARLEDGSSAQKAELIALTKALELAKGKRANIYTDSRYAFATAHIH FT GAIYRQRGLLTSAGKEIKHKQEILQLLAAVMLPQKVAIIHCNSHQKGTDPV FT TRGNNLADQAAKSAAMGDSQMLVTDVKDSPPKEKAQINQTPETPDLTAYIQ FT QAHRLTHLGAKKLSLLAQRQDFPGASPTAIRQVANRVVANCEACQLTNAYP FT AKLAPGKRLRGTRPGQYWEVDFTEVKPARCGLKYLLVFVDTFSGWTEAFPT FT KKETAAMVTKKLMEEIFPRFGLPKVIGSDNGPAFVAKVSQGLANILGIDWK FT LHCAYRPQSSGQVERMNRTIKETLTKLAHETGLKDWTMLLPYALFRARNTP FT STKPPNLTPFEILFGTPPPIRDLTPLTEHAALLPTSLSDRLIALDNLQRDV FT WTQLASAYAPSEFPAPHPYQVGDFVFVRRHQQETLQPRWKGPYQVLLTTPT FT AVKVDGVASWIHASHLKPASAPEDGSWELKRSDNPLKLKLSRKTLAERLFN FT AVSVHPPLVYYCEMTQQKPQVVTPWAIMKLLMLMLVLVTPTIANPHRPWVW FT TLSRWDDPKEVDKWTGSGHPTFSFYSCDIVPLDPNVDHPPEIYLCPASVEG FT RSYCNSPGEYYCAYWGCETMATAWTTLNKGPTPRTDTLACQQRGEVNIKAL FT CHLSKRSLHSYDGGPGNCRHLQLEVLTPEDQSWTTGKIWGFRYYMSGPDTG FT GLILIKKKLKTEPLLXTPAPTTARGPSSPMPVTTQSQEPIVSPRTPSGYLT FT NPPESGLVPPTVNSLLSLILGAHKALNLTHPDLAQACWLCLNANPPYYEGT FT ALNDSYTLTPNPDQCDWDHVFHTLPLPDLASRGTCIGTLDNWPTNLRHRCL FT THTRSFPRNQYLIPPTGAFWLCTTGLTPCVSTNVLLAGDSSCILIDLMPRI FT KYLPPNMLPLMRGRYKREPVSLTLAVLLGIGVAGGIGTGTAALITGHQSRQ FT QLLAVVNQDLQALETSITALQQSLSSLSEVVLQNRRGLDLLFLKEGGLCAA FT LREECCFYTDHTGIVKDAMEKLRERIKSREPPSSEAKMFSFWESILPYILP FT LLGPLAGFIIVMAIAPCLINRVTQFVRAQISQVKVMVLRQQYDPLPIQDL* FT " XX SQ Sequence 7527 BP; 1909 A; 2106 C; 1777 G; 1733 T; 2 other; tcatggaggc cccagcgaga ttagcacgtc tcacgggagg tgaccccaac ggctctgcgg 60 tgggtgagta cgactggtcg cttctcattt ttaytctgag gatcaaacca ccggtgtttt 120 acgccccgag ccatacggct ccgatctagg ggcaattttt gttatctgtt tcgtactctg 180 ttccctggtg actgggggac gccccacacg gtttccgtgg ctcaccgtct gcgatagggc 240 ctccctatcg atctggggaa tttcgattcc caatctgtgg tccgtccggg ccaatttgga 300 gacacccttt gaggtgactc atctggggga tttcgattcc cagtctgtgg tccgtccggg 360 ccaatctgga gacaccctgt aaggtgtctc atttggagtg aaactgatgc gtatgcatgt 420 gaatgaagtg gccggcctga ttgtcttgtg tgtcgtgggt gctgtgtttg gttatttgtt 480 gtaccgcgct tgcgttaacc gccctgtgac aatagaacat catgggacag tcgacctcat 540 ccccattgtc cctcaccctc gaccattggt cggaggtgcg gaagagggct cacgatctct 600 ccttgcaggt caagaagagt aagtggcagg atttctgctc gactgagtgg cctactttct 660 ctgtgggatg gccaccagag gggacttttc atctgcccct catcctagca gttaaggacg 720 tcatcttccg ccccggacaa agggggcacc ctgatcaggt cgcttacatc ctggtatggc 780 aggacctcag ggaggaaccc ccattatggg taaaaccctt tcttccccct tctgactcat 840 ctgtaccaaa gcttttagct ctgaaaggac ctcccacttc ggctccggtc ctgcccgagt 900 ctcagcctga tctaccttta ctcgactatg accggcctcc tccttcccag ctgactcaac 960 ctcctccgta cccactctct cctccagcct cctcgtccgg atcgagtgct gcttctccgt 1020 cagccccctc ttcacctgag ctaaatccct cttctcctcc gatacccacg tcccctctat 1080 atcccccgct ccctgcgatg gcttgccccg agcccacacc tccggggccc acgggcccag 1140 cccagaacac tcggagcaag ctacggcctg aggatccagt cgttaccctc cctctccgtc 1200 cctatgggcc catgatagac gatgggacag atggagggca gatgcccgct ttacagtact 1260 ggcctttttc tacctcagat ctctataact ggaaaaacaa taaccctcct ttttctgatg 1320 atccttctaa gctaacaggt ctaatggatt ctgttatgtt ctcccaccaa cccacatggg 1380 acgactgtca acagctccta ggggtcctgt tcaccaccga ggaaagagac cgcatcctcc 1440 tggaagcccg gaaagctgtt cccggagaag atggcagacc cacccagaga ccagacctga 1500 tcgatgactt cttcccctta aagcgcccca actgggaccc caactcacct accggtaggc 1560 agcatctttc tatctatcgc cagactctaa tggcaggtct ccgggcggcc gcaaggcgcc 1620 ccactaatct ggccaaggta agagaagtta cccaaggacc tactgagact ccctcagttt 1680 ttctagaacg cctgatggaa gcttaccggc gatatacgcc ttttaaccct gaggaggaag 1740 gccgacaagg ctcgatagct atggccttca taggacaatc ggcccctgat ataagacgta 1800 agctccaaag gctagatggt ctccaagacc taactctgag ggatcttgtt aaggaagctg 1860 aaaaagtcta ctataagcgg gagacagagg aagaaaaaga gttagctagg gacaaaagac 1920 gaaacaagga attagctaag atgttggcca cagttattca gggaaaacct gagccaggaa 1980 agggaaagcc cactcgccct ccattagacc ctgaccaatg tgcatattgt aaggaaaaag 2040 gacacctgat caaagactgt gaaaagttaa aaaagaaacg ggccaaagaa gaacgagaaa 2100 agcagtcctc gcagcgatcc cgagccccca tgctcgccct agatgaagaa gactagggac 2160 ttcggggctc ggaccccctc cccgagctca gggtaatgtt taaagtggag gggactcccg 2220 tggaattcga ggtcgatacg ggagctgttt attcggcctt acaagccccc ttaggggccc 2280 tttcaactaa gaaatcatta gttcaagggg ctaacgggag caaatatcgc tcgtggacca 2340 ctgagcgcac agttgactta ggcaagggaa aagtaaaaca ctcctttctg gttattcctg 2400 aatgtcctgc tcccctcctg ggacgggact tattaaccaa actgggggcg aagatttcct 2460 ttgagcccca aggacctcaa gtgacattcc gtaacccaaa ggttgggcaa cctatggtta 2520 cggtgttatc tctaaaactg gaagatgaat ataggctcta cgaccaccag gatccccagc 2580 ctatcgcccc agcctggcta actgatttta aagaatcctg ggccgagacg gctggactcg 2640 ggctggcaag ccagcaaccg cctgtagtgg ttactttaaa aaccactgcc tcccctattc 2700 gggttaagca atatcccatt aatagagaag ctcacctagg gattaaggtg cacatacaga 2760 gacttttaga ccaaggggta ttaactcctt gtcggtctgc atggaatact ccgttactcc 2820 cggtcaaaaa gcccgggacg aatgactaca gaccggtaca ggacctaagg gaggtcaata 2880 gcagggtgga agacattcac cccaccgtac ccaatcccta taatctcctc agcgggttga 2940 atccgtcaag gacttggtac actgttctgg atttaaagga tgcctttttc tgtttgcctt 3000 tacacaaaga tagccagccc ttatttgcgt ttgaatggac ggaccccgag actgggtccg 3060 ccggacaact gacctggacg cgcctcccac agggcttcaa aaacagccca accatctttg 3120 atgaagccct acataaggat ttagccccct ttcgggctca acacccaaac ctcacccttc 3180 ttcaatatgt agatgacttg ctgctggctg cggactctga gggtgactgc acaagcggaa 3240 ctcaggacct tttacgtgag ttggctatcc tggggtatag ggcatcagca aaaaaggctc 3300 aaatttgtaa acgagaggta atttttctgg gctactcctt aaaaggggga aaaagatggc 3360 tcactgaggc cagaaaacag actgtggtcc agattccccc tccaaagagt caaaaacaat 3420 tacgagagtt cctgggtacc gcggggtttt gccggttgtg gatccctgga ttcgcaactt 3480 tggcagctcc cctgtacccg ttactgaagg ggggatctcc ctttatctgg gaaaaagatc 3540 accagcaggc ctttgatgcc atcaagcggg ctctcctgtc cgctccggcc ctggcccttc 3600 ctaatgtgga taaacccttt actctcttca ttgaagagaa gaaaggaata gcgagaggag 3660 tactgaccca ggcctttggg ccatggaggc gtccggtggc ttacctctca aaaagactgg 3720 acactgtggc aagcgggtgg ccaccctgcc taaaggccat cgcagcagct gccttgctca 3780 ttaaagatgc tgacaaattg actttgggac aaaaaataac aatcattgcc ccgcatacgc 3840 tagaaagcat catccgccag cctccagaca ggtggctctc aaatgccaga gttactcatt 3900 atcagagcct cttgctcaac aaggacaaga taacttttgg acccccggtg actctcaacc 3960 cggcgactct gctgccggaa gaagcttcgg aacccgtcct ccatacctgc caggacgtct 4020 tggcagaaga ggccggagta cgaccggact tattggattg ccccctaccc gatgcagagg 4080 tgacctactt cactgacggg agtagctttt tgattcaagg taagcggtat gcgggggcgg 4140 ctgtgactga ttattccaat gttatatgga cagccagact agaggacggg tcctcggccc 4200 aaaaggccga actgattgct ttgactaagg ctctggagct cgccaaagga aagcgagcaa 4260 acatttacac ggatagccga tatgcttttg caacggccca catccacggg gccatatacc 4320 gccagcgggg gcttctgacc tccgcaggaa aagaaataaa acacaaacag gaaatccttc 4380 agttgctcgc tgcagttatg ttgccccaga aggtggcgat aatacactgt aacagccacc 4440 agaaagggac tgatcccgtc acaaggggaa acaacctggc cgatcaggca gcaaaatctg 4500 cagccatggg agactcacag atgttggtga ctgatgtcaa agattctcct ccaaaggaga 4560 aagctcagat taaccaaacc cctgaaacac ctgatctgac tgcttacata caacaggccc 4620 accggctcac ccatttggga gctaaaaagc tgagcctact ggcccaacgc caagacttcc 4680 caggggcatc ccccactgcc atacgtcagg tcgccaatcg ggtggtggct aattgtgagg 4740 catgtcaatt gaccaatgcc taccctgcca agctggcccc cggcaaaagg ctacgtggca 4800 ccagacctgg acaatactgg gaagtggact ttacagaagt taagcctgcc cgatgtggac 4860 taaaatatct gctagttttt gtagacacct tttctggatg gactgaggct tttcctacca 4920 aaaaggaaac tgctgccatg gtaactaaaa agttgatgga agaaatattt ccccggttcg 4980 gattgccaaa ggtaataggg tctgataatg gaccggcgtt tgtggctaag gtaagtcagg 5040 gactggccaa catattgggg attgattgga agttacattg tgcttataga ccccaaagtt 5100 caggacaggt agaaagaatg aatagaacta ttaaggagac cctcactaaa ttggctcatg 5160 agactggctt aaaagattgg acgatgctcc tgccgtatgc cctttttcgc gcgcgaaaca 5220 ccccgtctac caaaccccct aaccttactc ccttcgaaat cctttttggc actccacccc 5280 ccattcgaga tctcactccc ttaactgagc atgctgcctt gttgccaacc tccctgtctg 5340 acaggctcat cgcccttgac aatctacaga gggacgtgtg gacgcagcta gcctcggcct 5400 atgctcccag tgagttcccc gcgccacatc cgtaccaggt gggagacttt gtgttcgtcc 5460 ggcgccacca acaagagact ctacagcctc gctggaaagg accataccag gtcctattga 5520 ctactcctac agcggtaaag gttgacggag tcgcctcttg gatccacgcg tcacatctaa 5580 agcctgcttc agcgccagag gacggctcct gggaactcaa gcggtctgat aaccccttaa 5640 aacttaagct gagccgaaag actctttaat gctgtttctg tacaccctcc ccttgtgtat 5700 tattgcgaga tgacccaaca aaaaccccag gttgtaaccc cctgggctat aatgaaatta 5760 ctcatgttaa tgttagtgtt agtcacccct acaatagcga acccccaccg cccttgggtc 5820 tggacattaa gcagatggga cgacccaaaa gaggtcgata agtggacagg ttcaggccat 5880 cccaccttta gtttctattc ctgtgatatt gttcctcttg accctaacgt tgaccacccc 5940 ccggaaatat atctatgtcc cgcgtctgtg gaaggaagaa gttattgcaa ttccccaggc 6000 gaatactatt gtgcctactg gggatgtgag acgatggcca cagcatggac caccctcaat 6060 aaaggaccga caccttagct tgtcagcaac ggggggaggt caacataaag gccctttgcc 6120 acttgtcaaa gaggagcctc cactcctatg acggaggacc aggtaattgc cgacatttac 6180 aactcgaggt cctcaccccc gaggaccaat cttggacgac aggaaaaatc tggggattta 6240 gatactatat gagtggacca gacacaggag ggctcatttt aattaaaaag aaactaaaga 6300 cagagccatt gctgyccaca cctgctccaa caacagctag ggggccctca tcccccatgc 6360 ccgtgactac ccaaagccaa gagcccatag tctcacctcg caccccctcc gggtatttaa 6420 ctaatcctcc tgagtcggga ctcgtccctc caaccgtgaa ctccctcctc agtcttatac 6480 ttggggctca taaagccctc aatcttaccc atccagacct agcccaagct tgttggttat 6540 gtctaaatgc taaccctcca tattatgagg ggactgctct aaatgactcc tatacgctga 6600 ctcccaatcc cgaccagtgc gactgggatc atgtgttcca cacccttcct ttacctgacc 6660 tcgctagccg aggaacctgt atagggacac ttgataattg gcctacaaac ctacgccatc 6720 gctgcctcac acatacccgc tccttcccca ggaaccaata cctcatccct cctactgggg 6780 cattctggtt atgtactaca gggctcaccc cttgtgtctc cacaaatgta ctcctggcag 6840 gggacagctc ctgcatcctt atagacctta tgccacgcat taagtatctc ccccctaaca 6900 tgctccccct catgagaggt cgatataaac gggaacctgt ctccctgacc ctagcagtcc 6960 tattggggat aggtgtggca ggggggatag gaactggtac agcagccctc attactgggc 7020 accagtctcg ccaacagctt cttgcagtag taaaccaaga tttacaggcc ttagagactt 7080 ctataactgc attacaacaa tctctctcgt ctctatctga agtagtgtta caaaacagaa 7140 gaggcttaga tcttttgttt ttaaaagaag gcggtctctg cgccgcactt agagaagaat 7200 gttgtttcta cactgatcat actgggattg taaaagatgc aatggaaaag ttgagagaga 7260 gaatcaaatc tcgagagccc ccttcaagtg aagccaaaat gttctccttc tgggaatcaa 7320 tcctgcccta tattctccct ttactgggac ccctggccgg attcattata gtcatggcta 7380 tagccccttg cctcattaac cgagtaaccc agtttgtgag ggcccagatc tctcaagtaa 7440 aagttatggt gctacgtcaa caatatgacc ccttacctat acaggacctt taagattaaa 7500 agtcattgga aaaataaaag gggggaa 7527 // ID ERVB4_6-I_RN repbase; DNA; ROD; 7231 BP. XX AC NW_047390; XX DT 11-FEB-2004 (Rel. 9.01, Created) DT 11-FEB-2004 (Rel. 9.01, Last updated, Version 1) XX DE Rat endogeneous beta retrovirus ERVB4_6, internal sequence. XX KW Endogenous Retrovirus; Transposable Element; KW endogeneous betaretrovirus; RnERV-B4_NW_042829; ERVB4_6-I_RN. XX OS Rattus norvegicus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Rattus. XX RN [1] RP 1-7231 RA Baillie J.G., Van de lagemaat N.L., Baust C. and Mager L.D.; RT "Multiple groups of endogeneous betaretroviruses in mice,rats and RT other mammals."; RL J. Virol 78(11), 5784-5798 (2004). XX DR Genbank; NW_047390; Positions 16229546 16236776. XX SQ Sequence 7231 BP; 1947 A; 1846 C; 1490 G; 1948 T; 0 other; agtggcaccc aaacagggac ccaaggcatg gatccttttg gacagaaggc gattttgtga 60 ccccggctgt ggaaccaagc ggacagacac cgtggactga ctcgaatcga aagtgaacgc 120 accgtgtcac tgctagtaga ttgtgcgatc ttgccactca gaagagaaag tcatgaacag 180 gtaagaagga cttctgtctc attaggaatg ggccaaaagc tctctaagga agcagtcttt 240 gttaaagact tgaaggcttc cctcagggaa aggggaatta gagttaagaa aaaggatctt 300 gttaaagttt ttatctttat atctgaagtt tgtccttggt tcattattga aggatcagat 360 attagcccct taaggaactt aataatgctg ttaggactta tgaccctaat gctccattta 420 cacaattcat gcttgaggcc ctatcaggag gcagatacct caccccggga gaatggttca 480 gagcaactca ggcagttctc tctcacggac agtttttgtc atggaaagct gacttttttg 540 accactgcca gacacgagca ggaaggaata agaaagatcc ctgggcccca gaggcagctt 600 ggacctttga caaacttact ggacaaggaa aatccgtgtc agagggctgt cagctcaggc 660 tacctatggg cctcttggtt caagttaaag aggccaccct gggggcctgg aaggatgtcc 720 cctccaaaag gaaccttaca acacccctta cttaagtcat ccagggccct caagaaccat 780 ttagtgaatt tgttgctaga ttgcaggaag ttgcagaaag agttttgggt cccagaaaag 840 aggacaataa tctccttaaa caaattgtgt atgaaaatgc taattcagct tgtaagtccg 900 tccttaaggg acaaacaaaa aataaagcac ttcctgattt ggtcaggctt tgcaccaatg 960 ttgatatgtt ttcacataag gtcaccagag catcaatttg gctattggag cggctcccca 1020 agtagctaaa ggcccaactt cccataaaaa ttttattaaa tgtggtcagc caggacattt 1080 tgctagacaa tgcccattga ttcagaggac agactagaga atgaaagact ttttggataa 1140 gatactggat ctccacctcc ctggtcccct gacggccaca aaatgccctc actgtaagag 1200 aggaaaacac tgggctaatc tatgtcactc aaaaacggac atttctggta atccctccaa 1260 ccccttcagg gaaatgggtg gaggggccct cctcggggga cccagagaac cattcatcca 1320 gtccatctcc atggcctctc agatacctgc cccagggcca aacactcccc tcccttcctc 1380 agagctacct cagagagcaa caggagtgga cttgtgtgcc gccaccgata cagtattgag 1440 gcctgaagat ggaacacaaa ttgttcctac cggaatgttt ggccccctgc ctcctaatat 1500 ctttttcctg atcatgggac aagcctcctc tgccctgcag ggggtcatca tacaccccac 1560 agtggtctac aacgactaca ccagagaaat gcgggtactg gctactgcca cctctggccc 1620 tttgacctta aaggagggac aaagggttgc ccaggctgtc cctctgccgt tagaccggca 1680 ctacccttct ctccaggacc gccgtggtgc tacccaacct tgctcttcgg atgctttttg 1740 ggtccagaca attacccatg agaggccttc tcttaaactt aaactagaca accgatggtt 1800 tctgggcatt gttgatactg gggctgatcc tactgtcata tctaaagatc aatggccctc 1860 atcttggcct cttcagcctt cactcacaca tttacaaggg atcagccaat ctaaaaatac 1920 tctccaaagt tctaagtacc taaaatggga tgattctgag gggcactcag gctttatccg 1980 gtcttttgtt gttgaagccc tcccggtaaa cctttggggc cgtgatttat taactcaact 2040 gggtttggtt atgtgcagcc cgaatgagac tgttactcga caagtgcttc aacagggatt 2100 ccatcctgga aaaggctttg gcaaaaagga acaaggaata agagcatcca taataccttc 2160 tcccaaaaac gatggcacag gattaggcta tcaaaatttt tcctaagggc cacctcacca 2220 cctgcactac acgcagacaa aattacttgg aaagatgaca aagctgtctg gattgatcag 2280 tggccccttt ctagtatcaa agtgactgca gctcttgagc ttgtgcagga gcaattgact 2340 gcaggacaca ttgagccctc cacctttcct tggaacaccc ctatttttgt tttaagaaag 2400 gaaacaggaa aatggcggtt attacaagat ctcagagaaa taaataagac actgtttccc 2460 atgggggcaa tacaacctgg ccttccttcc cctgtggcta tccccaaagg ttattttaaa 2520 attattattg atataaaaga ctgtttcttt tccaatccat tagaccctga ggactgcaaa 2580 tattttggtt ttagtgtgct cattgtcaac tttgtgggcc ctatgccatg attccaatgg 2640 caagttctac cccaaggcat ggctaacagt cctaacctcg gtcagagata tattgctcaa 2700 attgttgacc cccttcatct ccaatttccc tctctttata ttattaatta catggatgat 2760 attcttgtgt cagaaaagga ccctgagatg gtccatttag cctcccaaca gcttatagag 2820 gcgttccagc aaagaagcct tcaggttgct cctgataagg tacaaataca tccacctcaa 2880 ttgttcttgg gctttgaggt ctttccacat aagattatat cccaaaaggt acagcttaag 2940 aggacttccc tccttacact taatgacttc caatgactct tgggtgacat taattggatt 3000 cgcccttact taaaactcac caccggacag attaagcccc tttttgatat cctccgagga 3060 gactctgatc ccaccttacc tcgttgcctt acacctgagg tctctgaagc cctctccttg 3120 gtcaaggagg ctattgctaa ccaaaaaaat tgcttatttt tcccccatca tccattactt 3180 tttatagtcc tctccacacc cctttcccct actgcagttc tctggcaagg atgtccccag 3240 tattgggtac atttgcttgc ctctcataac aaggtcctgc tgacctaccc cttattggtg 3300 gcacagatta tacgtcttgg gagaaaactg tcacataaat tgtttggtaa agatccagat 3360 gccatgattc ttccctatag cccatctcag gcctcctggc ttggtcaaca cacggacgaa 3420 tgggcaatca gctgtgttgc ttttcagggt aagatagata atcattatcc tccagataga 3480 ttaattcagt tttttcataa gctttctgtt cctgttattt tcccaaaagt tacttgcact 3540 tgccctgtgt ccagggtcca tcttgttttt actgatggtt cctcttatgg ccaggctgcc 3600 ttttccatta atggaaaggt acaccaaata tctgctccct cggactctgc acagttggta 3660 tagctccgag ctgtccttgc agtatttgag accctaccta gatcaccctt taacttgtac 3720 acagatagct cttatcttcc cttttccatt cccttgctgg agatggtacc ttatattcag 3780 cccaccacca atgctgctcc tttgtttgcc actctacaaa agtttattca taagtgcact 3840 cacccctttt atgtgggaca tgtctgaacc cattcaggcc tgctgggctc cctggccgag 3900 ggcagcgatg ccatggattg catcacccaa cttgtagcct tggcccagga ggcttctata 3960 acccctttga ccctggccca acaggcacat gatctacacc atcttaatgt acacaccctt 4020 aggcatcacc caagaacagg cctgccagat agatcataat tgcaaaggct gtgtaacctt 4080 gctacctgac cttcttctgg gggtcaaccc ctgtagattg gtccctggag agctttggca 4140 gatggatgtt acccatatct cctcctttgg aaagttaaaa tatgttcatc ttactataga 4200 tacctttagt ggttttcttt ttgcatccct gcaggcagga gaggccacta agcacgttat 4260 cagcaatgtt ctggcctgct tggtggtgct cccacaacct aaaattatta aaacagataa 4320 tggacccaga tatattagct ctagttttaa aaacttttgc tcccaatgtg gcattaaaca 4380 tatcaggggc attccttata atcctcaggg acagggtata attgaaagag ctcatcaaac 4440 tcttaaaaat atgatccata aattacagtc aaatggggga atattattcc ccctccctgg 4500 caatcacaaa aaattaataa atcattcact ttttgtcttt aactactctg ttatggacaa 4560 ggacggaaaa actgcagcag accgcctgtg gcacccctat acttcttacg attatgcaca 4620 agtcctatgg aaggacccat taacatcccc atggtatggc cctgatccgg tcctcatttg 4680 gggaggtgga tcagcctgca tttataattc aaagactggg ggtacctgat ggctacccaa 4740 gcgccttgtt aagaccttta acccacccag ggacaaccct gaggaaaagt ttaactatgc 4800 ttaattacag tccacgctga ccatgttccc cttcatctga atctttggag gcatcaacac 4860 atcggccctt gctacatcac ctcaccagat cttcaactat acttgggtga ttcttattgg 4920 caccagagat gtggtgtctg ccaactccag caccattgca caagttccct ggcccaatct 4980 tgaggtagat ctttgtaaat tagccctagg agctcacctt gattgggaca cacctgacac 5040 tttgcccccc aagaaaaagc ccctgagctg gcagacccca gtactgaccc gggttgcagc 5100 aatacactca gaagagtcac attggccctt caaactgggg gaatttattt atgccctgct 5160 ggccacagag atcgtacaaa ggccagaggc tctgggtatg aaactgattt ctactgtgcc 5220 tcctggggct gtgagaccca gctcttcctg ggattatacc aaagctaaaa gaaaattccc 5280 caatgcccaa gtcaccagtc tgggccaata accccttact agtatatatt ccactgactc 5340 ggacctgaac gggtggtgta atcctatcca gatctctttt acagaaacag gtaaaaaggt 5400 aaattgggaa cagcgaggtt tcaagtgggg acttcgctta tataaacaat atagacttgg 5460 gcgtcacccc aaaaattaaa ttgcttaaaa cccggccaca aacaccacca gtggccatgg 5520 ggcccaataa taacttacac cctgcaccca gggttgggtc gcatttgaga gattctctac 5580 aggctatacc tcaacccaac aattcatcat ttccttccac cctttatcaa ccaacattcc 5640 ctgatgggcc ttcctctact gctaatatgc tcatagaaat gcttaatacc ttctaaccag 5700 cccttgccaa tcttaattct attatagccc ttgatgattg ctggatctgt tatcactcat 5760 cccctccctt ttacaaggga ctagcaacct ttgaaaacat cacctttact aatactaatg 5820 agacccattc cctatactgt tcagcaaatt ctgaaccaac aataacactc tcacaggtct 5880 ctggaattgg actctgtcta ttaggacccc atatgggcct tcccccctga cacttatttg 5940 caataagaca gtggtggtta ccacacagat caacgacatt actgccccta agggcactta 6000 ttttgcctgc tcagcaggct taataccttt tgtggtaaca tctacttttt gacaggaaaa 6060 gattattgtg tccttgtcct cctttttcca cgattaacta ttcatgactc ctctaagttc 6120 ctgcagttct gggaaggagg cactgccaca agtacgaaga gagaacccat caccactgtg 6180 actctggcag tactcctagg tctgtcagct gtggaggcag gaactggggt tgcctccctg 6240 atcacctccc aacagagtta tcatcagctc attgccacca tagacaggga catcagtgag 6300 ctttaagata ggaataacct atcttaaaga ttcagtcgct tccttggctg aggttgttct 6360 acaaaataga aggggattag atcttctctt tctccaacag ggagggctct gtgcggccct 6420 caaggaggag tgttgctttt atgctgataa gacaggatta gtagaaaaca gcctacaaaa 6480 agttagagaa agtctacaaa agagacatag agaaagagaa aagagtgaag cctggtataa 6540 aaattggttt tctgcctctc ccctgctcac cactcttcta cccagtattc tgggtccctt 6600 tataggactt ctgctacctg tctcctttgt cccatgggtg gttagaaagt taactgattt 6660 tatcaaggcc caggtggatt tagctaccaa acagatctct gtctattatc accgcctccg 6720 ttgtgaggag gctgaaacaa ctggggagtg tgatacccac ggctggcctc aactactcca 6780 ccctaaatac agcctataac tcaggctgga aaatcagtcg ctggtggaag cgtcaatgac 6840 gggaatatta tggtgccaca taccaaacac acacctcaca gctgggaagt gaccctatga 6900 tgggataagt atgggttcat aatctgggtc acccactctc acaaaacaca cttctatagc 6960 ccagaacggt gtgttaaagc ataagatcat tcctagccat tggggtggag tcctaactgc 7020 aagagtggct caccatggcc aagctgggca ctctgtgagg catgtctgat ctatctcaga 7080 ccactacttc caccctggcg gaaatactgg aacctaagtg agagtgtcag acatgggggc 7140 aacctgtgta gcctaagaca cgtggcttca ccaaccaggt gctggttgtg catgcacata 7200 gccaagacct tccactgata ctttagaaag g 7231 // ID MER115 repbase; DNA; ROD; 693 BP. XX AC . XX DT 31-MAY-2001 (Rel. 6.04, Created) DT 31-MAY-2001 (Rel. 6.04, Last updated, Version 3) XX DE Non-autonomous hAT-like DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; MER115. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-634 RA Jurka J.; RT "MER115."; RL Direct Submission to Repbase Update (28-FEB-1999). XX RN [2] RP 1-693 RA Smit A.F.; RT "MER115."; RL Direct Submission to Repbase Update (31-JAN-2000). XX DR [2] (Consensus) XX CC The repeat has been found by [1]. CC It has been classified by [2] as a nonautonomous DNA transposon CC which CC has 14 bp (imperfect) terminal inverted repeats and 8 bp target CC site CC duplication. It shares the terminal regions (bp 1-67 and 535-693 CC with CC Zaphod, which it probably was dependent on for transposition. CC >25% diverged, ~ 1000 copies in the genome. XX SQ Sequence 693 BP; 112 A; 213 C; 233 G; 135 T; 0 other; cagtaccgcc cttagacctg ggcaagaggg gcccctgccc tgggccctgc gctttagagg 60 gctccgctct ggccctcctc cggcgcggcc cttccccacg gggcgaggag tccgcggggc 120 caaggggacg tgcccacccg gagcccatgc ccccccttct agaccacgct ccgggtaccc 180 gggaccccgg aattccctgc ccaaatggcc ccgagcccgc ttccagggcc tgcgcgggcc 240 tcttccctgg gtccgtcctc ccaagggcgg accgcgccgc cggtgtgtgc acccctaggc 300 ccgaggggtg gccaaggggc ggctgtttgc ggggggtgtg gacggagctt ggacgtgcgg 360 gctggggtgt ccacatgcgt gcacgcgagg cccctcgcgg tgcaggacgg agccgggggt 420 gggaagagaa ggggagtagg ccacgggcca ggggctggct ctccccaggc cgccacgttc 480 tggcatggaa ctccgaggag tccgagaatt ctaaattcga acctggcctt ccaggtcgtt 540 atgaaggtat atttgtcaag gtaggaggat agaacatatt ttatttaaca gtttgttagc 600 ttgatttata acttttaaat atttagacat atggtatgtg ggcctccatt tgtactcttg 660 ccccgggccc cgcaaatgtt aggggcgggc ctg 693 // ID RLTR30_MM repbase; DNA; ROD; 441 BP. XX AC . XX DT 14-OCT-2002 (Rel. 7.09, Created) DT 14-OCT-2002 (Rel. 7.09, Last updated, Version 1) XX DE Mouse putative long terminal repeat - a consensus. XX KW LTR Retrotransposon; Transposable Element; Long terminal repeat; KW retrotransposon; RLTR30_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-441 RA Jurka J. and Drazkiewicz A.; RT "RLTR30_MM: putative LTR from Mus musculus."; RL Repbase Reports 2(9), 15-15 (2002). XX DR [1] (Consensus) XX SQ Sequence 441 BP; 103 A; 100 C; 124 G; 114 T; 0 other; tgtaagggtc catgattcgc cgaagaatga taccccagac tcaaatagta tgtaaacgca 60 aagagtgttt tattctgcag aagtccagca tgctggggtc tcccattacc aagatagaga 120 gacaaccaag tgagcttgca ggcctgattt aaagcacatt aggggaattc tggggtaggt 180 gacctttatc ttaatctgtt gggtccatct ctagggacat tccattaccg gggtgggggg 240 ctggaaactg ttgctgggga agtcgctggg gaagtctgga aactgctgct gacccattgt 300 ccttgcctca ggccaggtgg cggggcagct tctgaggcct ggacttgcct ggacttgccc 360 agttcttgga aacagagatt taggcctagt ctccttaact gccaatttga agcctgtcat 420 ggagtcagcc tagccttctc a 441 // ID HERVS71 repbase; DNA; ROD; 5491 BP. XX AC Z70664; XX DT 27-JAN-1997 (Rel. 6, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 2) XX DE Internal sequence of endogenous retrovirus HERVS71. XX KW Endogenous retrovirus S71; simian sarcoma virus; HERVS71. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-5491 RA Werner T., Brack-Werner R., Leib-Moesch C., Backhaus H., Erfle V. RA and Hehlmann R.; RT "S71 is a phylogenetically distinct human endogenous retroviral RT element with structural and sequence homology to simian sarcoma RT virus (SSV)."; RL Virology 174, 225-238 (1990). XX RN [2] RP 1-5491 RA Kabat P., Tristem M., Opavsky R. and Pastorek J.; RT "Human endogenous retrovirus HC2 is a new member of the S71 RT retroviral subgroup with a full-length pol gene."; RL Virology 226, 83-94 (1996). XX DR GenBank; Z70664; Positions 1 5491. XX CC LTR's of HERVS71 are listed in REPBASE as LTR6 sequences. XX SQ Sequence 5491 BP; 1452 A; 1521 C; 1160 G; 1358 T; 0 other; ggatccaaaa tagccaccct gcagacggcc ttgctcacct tttctgtcat cccataactt 60 ttccagtgcc cttaaataac acactatgta cacaaaccta tgcctgtgct gctttactct 120 gtctggaccc ttattctatc cccctgtggc tactctccta ccttaggaaa gatctgagtg 180 gcccctttcc tccttacccc catcccttaa cccacacagc tcgttttcct gtgtcacagc 240 aagtccagca cctccaagac ttggctctgc tctccatcct aaaaccctta aaagaaaggg 300 ctgagtttga actttttgcc tttgagtcgg ggagacacca aagatatttg gctataagtc 360 aaaaaggaag gggggggtca cataggtccc actggcctca gacccacctc ttgtcctctc 420 tctagatctc aaagcttaaa gagacagatc ttatgtggca agaaatgttg gctatagttg 480 ttttcctact tcttctggtt ataatacttc tgttcttcca atactacagc cccccaggcc 540 atgaatatct ctgtctgtgc tgggtttaat atttctgctt aaaccttgtt aattgcctcc 600 agaatgggaa actcttcttc ctggccccgt agagattaca gccctctcca atgtatgttg 660 cagaatttct ctctgggttc tcagaggatt acggagtcca ccttaagaaa ggcaaactcc 720 agacactctg tgaagtagaa tggccacagt ttggaactgg gtggcaccag aagggtcatt 780 aaacctcaca actgttcagg ctgtgtggcg ggtcatggct ggaactcccg gacaccctga 840 tcagtttcct tacattgatc aatggctaga tttggtccag agccctcctc catggctccg 900 ctcacgtgcc attcatgatc ccacctccaa ggtccttttg agctggacca cacttttgcc 960 ccaaccctca cgtcggctcc tcctgtacag tctccttctg aagaagagga aagttctctt 1020 cacccatttc tgcctcccta taacctcctg cccacccccc ccccccgccc cagaatattt 1080 ccttgtctcc tcgactacat cccctgtggc ctctccacct atagccaccc aattacggca 1140 tcggctggag aggtggccct ccttctccca ctgacagagg cccaaatcct ctgggcaatg 1200 aggctctgct ccatttttag tttatgtccc cttctctgtt tctgacctgt acaactggaa 1260 ggctcataat cccccccttt tctgaaaagc cccaggtctt gacctcactg atggagtccg 1320 tgctccggaa tcaccggccc acctgggatg actgtcagca acttctttta acccttttca 1380 cctctgaaga gagggaccat atccgaagaa aggtcagaaa gtatttcctc acatcagctg 1440 gtagaccaga ggaggaagcc cgggacctcc ttgaggagac ttttccctct acctggcctg 1500 attgggatac aaaatcctcg ggtgggaaga gagctttgga taattttcac tggtatgtcc 1560 ttgtgggtat caagggagcc actcaaaaac ccatgaatct gtccaagaca actgaagctg 1620 tccaggggcc taatgagtca ccaggagtgt ttctagaacg cctcctggag gcctatcaga 1680 tttacacccc ttttgacccg gaggctcccg agaatagccg tgctattaat ttggcatttg 1740 tgactcaggt agcccctgat attataagaa aattacaaaa gctggaagga tttgctggaa 1800 tgaacagcag ccaacttttc aaatagccca gaaagttttt gacaattgag agtttgaaag 1860 gcaaaaacag gtagctcagg cagctgaaaa ggctgctgac aaagcatcaa aaagacaggc 1920 aaagatctta gtggctgcca tccaaggaag caagaaggca gggcccccat cacagagcac 1980 cagccagggg accccaggtc cccaccagaa aggccaaaaa ggtgagcagg ctcccctaca 2040 aagaaaccaa tgtgcttatt gcaaacaaat tggacacagg aaaaaagaat gctcattaaa 2100 accagaggaa aaacaagaga agaaaaaggt cctcaccctc cctgctgtgg atgaatctga 2160 agattgacag ggccggggct gccacttcct tcacccccag gagcccttgg tgactgccac 2220 agtgggggcc cagcctgtat gcttcctaat cgacactggg gcggaagact tggtactgca 2280 aacacccttg ggcagtgtct ctaataaaaa ggtggctgtg caagggactt cataagcttc 2340 ataagctgca ggcatccatc tccttctcag cccaataagc tcacctcaca ttaggggacc 2400 caacaccctc taccacccag ctcctgctaa ccaccccttt gtcagaggaa tatctcttag 2460 tttcaccctc acaaccgctg gagaataaaa ctaatcctct cctactggat ttacagactc 2520 tctttcctca agtctgggcc aagtcaaacc cccccaggac tggcaaagca ccatctgcca 2580 gtagttgtag aactcctggc cactgccctg ccagtccagg taaaacaata tcctatgagt 2640 cagtgggcta gagagggaat caatccccat attcagtctg cctggaatac tccatttttg 2700 ctggtccaga aacctggaac aaatgattac cggcctgtac aggacttgca ggaagttaac 2760 aagtggacag tcactgtcca tccaactgtc cctaaccctt atattttact cggcctgctt 2820 ccaccagaac atacagcata cactgttctt gacttaaagg atgctttctt tgctattcct 2880 ctggccccta aaagccaacc tatatttgct tctgaatgga tggaccctgg ctcaggagac 2940 accactcaat taacctggac ttggttaccc cagggtttaa aaaattcccc cacccttttt 3000 ggggaagccc tccaacaaga tcttataccg ttctgagcca gtcaccctaa ctgcacgctt 3060 ctccagtaca tagacgacct gtttttggct actgaaacca ctgacagctg cctgcaacat 3120 actagggacc tactttacct ccttcaggaa ctcgggtatt gggtctcagc caagaaggcc 3180 cagctttgtc ttcccagact tttctaccta ggatacaaga taaacaaggg agaaagggca 3240 cttgccactg ctcgaaagga agccatcctg caaatcccca ctcccaccac taggagatgg 3300 gtacatgaat tcttaggggc tgtgggatac tgtcgtttat ggatattggg gttcgcagaa 3360 atcaccaagc ccctgtacac cactaccaga gggaatggcc cacatgtttg gactgacaaa 3420 gaacaggctt ttcaaaatct aaagaaggca ttaactgagg tccctgctct tgccctccca 3480 aatatctcag aaccatttca tctttttgtt catgaaagcc agggagtcac taaaggggta 3540 ctcactcaaa ctttaggacc atggtgatgc ccggtggcct atttgtctaa gatactggac 3600 cctgtggctc cgggtgacca agttgtctgt gagccatagc ggcaaaagca agcctggtcc 3660 aggaggctga taaactgact ctgggccaga atttaaccct tatggctcct catgccatag 3720 agactttgct acaaaggcgc tctggcaaat ggatgtcgaa tgctcacatc ctgcagtatc 3780 agagtttact gttagatcag ccttggttaa ctttctctcc cacaaggtgt ttaaatccag 3840 ctacctttct ccctgatcca gaccttacca cacctgtcca tgactgccaa gaactgttag 3900 agactacata aactggccga cctgatctcc aagatgtgcc tctaaaggag gtggactcca 3960 ctctgtttac tgacagcagc agcttccttg aacagggagt aggaaaggct ggtgcagccg 4020 ttactatgga gacagatgta ctgtgggccc aggcactgcc ggcaggtacc tcagcacaga 4080 aggctgaatt ggtcaccttc actcaggctc tctgatgggg taaggacaaa cgtattaaca 4140 tctacactga cagcaggtat gtttttccta ctgtacatgt acacaaagcc atctatcaag 4200 agtgagggct actcaccagg aaagactatt aaaaacaaag aagaaatttt ggccctgctt 4260 gaagctgttt ggcttcctcc gcaggtggct gtaattcact gcaaatgtca tcaaacagaa 4320 ggcatggcta ttgcctgtgg taaccaaaaa gcaggctctg cagctcgaga ggcagcttgg 4380 ctcccagtca cgcctttgac cctgctgccc actgtgtcct ttccgcaacc tgacctacca 4440 gaccacccac aatactcccc agaggaagaa aaacaagctt cagatctttg ggccagtaaa 4500 tatcaggaag gtttggtgga ttcttcctga ttccagaatc tttattgccc caagtcccct 4560 gggaaacttt aatcaatcat ctgcattctg ccacccattt gggaggaata aaactggccc 4620 agcttcttag gagccatttc aacatccccc accttcagga cttaactaac caagcagctc 4680 tctggtgtat ggattgtgct caggtaaaca ccaaacaagg tcctaagccc agctcagtcc 4740 accctccagg gaggctctcc ccgagaaagg tgggaagttg actttacaga aataaaacca 4800 cactgggcag ggtataaata cctcctagtg ctaatagaca ccttttcggg atggactaag 4860 gcatttacca ctggaaacga aactgccacc atggtagtta ggcttttact cattaaaatc 4920 atctctcaac atgggctgcc tgttgccata gggtctgata atggaccagc cttcacctcg 4980 tccatggctc agtcagtcag caaggcatta aacattaaat ggaaactcca ttgcacctat 5040 tgaccccaga gctctggaca ggtagaacgc atgaaccaca caataaaaag tactcttact 5100 aagttaatcc tagagaccag tgagaattgg gtaaagctcc ttcctttagc ccttcttaga 5160 gtaagataca ccacttactg ggctgggttt tcaccttttg aaatcatgta tggaagggct 5220 cctctatctt gcctaagcta agggatacca atttagcaga aatctcacaa gctaatttgt 5280 tcagtacctg cagtctctcc aacaggtatg agacaccatc cagccacttg tccagggagc 5340 acactccaat ccagttcctg accagactgg ccctgccact ttttccagcc aggtgactta 5400 gtataggtta aaaagttcca gaaggaagga ttcactcctg cctgaaaagg acctcatact 5460 gtcatcctca ccacgccgat ggctctgaaa g 5491 // ID L1MD_5 repbase; DNA; ROD; 2728 BP. XX AC . XX DT 09-OCT-1997 (Rel. 6.3, Created) DT 09-FEB-2000 (Rel. 7.1, Last updated, Version 3) XX DE Partial L1MD LINE1 repetitive element 5' end - a consensus. XX KW L1 repeat; L1MD_5; MER79; L1M6_5. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 500-1 RA Smit F.A.; RT "L1MD_5."; RL Direct Submission to Repbase Update (1996). XX RN [2] RP 1-1604 RA Jurka J., Walichiewicz J. and Kapitonov V.V.; RT "L1MD_5."; RL Direct Submission to Repbase Update (28-JUL-1997). XX RN [3] RP 1-1604 RA Smit F.A.; RT "L1MD_5."; RL Direct Submission to Repbase Update (19-AUG-1997). XX RN [4] RP 1605-2728 RA Jurka J.; RT "L1MD_5."; RL Direct Submission to Repbase Update (02-FEB-2000). XX CC 5' end of L1M subfamilies. CC Originally expanded to 1619 bp and classified as a 5'-portion of CC L1 CC (Jurka et al. [2]). A shorter version submitted by A.F.A. Smit on CC Aug. 6 - deposited in the Appendix. Minor refinements of the 1619 CC bp consensus [2] worked out by A.F.A. Smit (August 19, 1997 [3]). CC Replaces MER79 [1] and L1M6_5 [2]. Average divergence from CC consensus is CC 24%. Appears to be 5' end of L1MD1 and L1MD2 subfamily LINEs. XX SQ Sequence 2728 BP; 1032 A; 580 C; 593 G; 509 T; 14 other; ttaaaaacaa agcgggagac ttccgcttcc gggaagatgg agtagacgta cttttcccta 60 ttcctcccgc taagtacaac taaaaaccct ggacattata tataaaacaa acataagaag 120 actctgaaag gtggagagaa gaaggcagac cggctaggga cctcgggacc cgaggaacga 180 cacggtagtg agttccctgg gttttctttt tgcctcatat atcccagact tggagctgaa 240 gaagccggca acccggaaac gccaacgggc acagacaaaa aaagccccaa caaaagcctg 300 ctctctctag ccaaaggacc aggaaagggg cagcctagca agacagaaaa cttttagaca 360 ataaccgctc tactccagcc aaacaccaca gaaaaaactg tggccccacc cccacccacg 420 ccagcaaagg ccgagtgggg agcctagact tccaccctca ccaggctgta acgaggcgcc 480 ccaacacctc caccgggatg gtgtcagaga aggccaagta gggagctggg actttcatcc 540 ccgccaggcg gtaatgaggc ccmccttccc cttgccmctg cggtgtcagt ggagaccacg 600 tggggagcct ggacttccac ccccacccgg cagtaatgag gcgcccctcc ccctccctac 660 tggggtggtg tcagaggagg cctagtggag agtcgggact ttcaccaccg cccagcggta 720 atgaagccac ctcctcctct tgccmccatg gtgtcagtgg aggccacgtg gggagcagta 780 atgaggcact cctacccctc ccagccaggg aggtatcagc ggaggcctag tggggagccg 840 aactcccacc cccgcccagc agtaacgagg agcccctccc tcacctcggg tgtcaacgga 900 ggccgagtgg ggaacctgga cttctacccc cacctggcag taatgaggca gcgcccctnc 960 ccctcycctg ccggagcggt gtcagaggaa gccggctaaa acagaaggtt taaataagat 1020 ccagagtctc ataacataat acccaaaatg tccaggtttc aatcgaaaat cactcgtcat 1080 accaagaacc aggaaratct caaactgaat gagaaaagac aatcaataga cgccaacacc 1140 gagatgacag agatgttaga attatctgac aaagatttta aagcagccat cataaaaaat 1200 gcttcaataa gcaattacga acgtgcttga aacaagaraa aagtagaaag cctcagcaaa 1260 gaaatagaaa gtctcagcaa agaaatagaa gatataaaga agaaccaaat ggaaatttta 1320 gaactgaaaa atacaataac cgaaataaaa anctcaatgr atgggctcaa tagcagaatg 1380 gaggggacag aggaaagaac cagtgaactt gaagatagag caacagaaat tacccattct 1440 gaacaataga gagaaaatag attggaaaaa aaaatggaca gagcctcagg gacctgtggg 1500 actataacaa aagatctaac attcgtgtca tcggagtccc agaggagagg aaaaagagrr 1560 tagtatttga agaaataatg gctgaaaatt tcccaaattt ggcaaaagac ataaacctac 1620 agatagattc aagaagctga gtgaacccca aacaggataa acccaaagaa atccacacca 1680 agacacatca tagtcaaact tctgaaaact aaagacaaag aaaaaaaaat catcttgaaa 1740 gcagcgagag agaaatgaca ccttacctat aggggaaaaa caattcaaat gacagtggat 1800 ttctcatcag aaaccatgga ggccagaagg aagtggcaca acaatttttt caagtgctga 1860 aagaaaagaa ctgtcaaccc agaattctat atccagyaaa aatatccttc aggaatgaag 1920 gggaaatcaa gacattctca gatgaagaaa aactaagaga atttgttacc agcagaccta 1980 ccctaaaaga atggctaaag gaagttctct aaacagaaag gaaatgataa aagaaggaat 2040 cttggaacat caggaaggaa gaaagaacat agtaagaagc aaaaatatgg gtaaatacaa 2100 tagactttcc ttctcctctt gagttttcta aattatgttt gatggttgaa gcaaaaatta 2160 taacactgtc tgatgtggtt ctngcnaaaa atgtatgtag aggaaatatt taagacaatt 2220 atattataaa ttgggggagg gtaaagggac ataaagggag gtaaggtttc tacacttcac 2280 ttgaactggt aaaatgataa caccagtaga ctgtgataag ttatgtatat ataatgtaat 2340 acctagagca accactaaaa aagctataca aagagatata ctcaaaaaca ctatagataa 2400 atcaaaatgg aattctaaaa aaaatgttca agtaacccac aggaaggcag gaaaaagaaa 2460 acagagaaat gaaaaacaga acaaacagaa aacaaaaaat aaaatggcag acttaagccc 2520 taacatatca ataattacat taaatgtaaa tggtctaaat acaccaatta aaagacagag 2580 agattggcag agtggattaa aaaacatgac ccaactatat gctgtctaca agaaactcac 2640 ttcaaatata ataatatagg caggttgaaa gtaaaaggat ggaaaaagat atatcatgca 2700 aacattaatc aaaagaaagc aggagtgg 2728 // ID RNSAT1b repbase; DNA; ROD; 168 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 29-MAR-2010 (Rel. 10.08, Last updated, Version 2) XX DE Satellite from rat. XX KW Satellite; Simple Repeat; RNSAT1b. XX OS Rattus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae. XX RN [1] RP 1-168 RA Smit A.F.; RT "RNSAT1b - Satellite from rat."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX SQ Sequence 168 BP; 62 A; 28 C; 30 G; 44 T; 4 other; taaagccttt gtacgtaatk gtagtctccg aamacataaa gcaatacata ctggagtgaa 60 accttacaaa tgtaatcaat gtggtaaagc ctttgtacgt aatkgtagtc tccgaamaca 120 taaagcaata catactggag tgaaacctta caaatgtaat caatgtgg 168 // ID RMER10A repbase; DNA; ROD; 389 BP. XX AC . XX DT 26-SEP-1997 (Rel. 3.1, Created) DT 26-SEP-1997 (Rel. 3.1, Last updated, Version 1) XX DE Long terminal repeat of retrovirus-like element. XX KW putative long terminal repeat; RMER10A. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-389 RA Smit F.A.; RT "RMER10A."; RL Direct Submission to Repbase Update (1997). XX DR [1] (Consensus) XX CC Average divergence from consensus 19%. XX SQ Sequence 389 BP; 75 A; 137 C; 75 G; 101 T; 1 other; tgtttggttt cgaaccccgg acaaggggct gaggtgcata ttttacatat caaagcagac 60 ctggcctcca ggttctccca gcatccctca gtccctacct ggcatacccc gcccccaacc 120 ctgaactttc cagcccaggg gctgggctgc ccttccccca gaggctcttc cctatataat 180 ccagacattt tggtctcccc gttctctctc tgcacatggg cactttctct tccctcctcc 240 ctcctacccc gtccccatgg cgacttccct ggcctcggtc cttggggcca gtgaactcac 300 ccgagagcag yttcccaata aacctgcctt caatataatc taatctggct tgaattggct 360 catttcaccg gcggagaaat aatttatca 389 // ID MSTAR repbase; DNA; ROD; 1651 BP. XX AC . XX DT 01-MAY-1996 (Rel. 1.04, Created) DT 24-JUL-2000 (Rel. 5.06, Last updated, Version 5) XX DE MSTa- LTR internal retrotransposon sequence - a consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; MaLR family; KW gag; LTR retrotransposon; MstII; MSTa subfamily; MER10; KW MST-internal; MSTAR. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RA Smit A.F.; RT "Structure and evolution of mammalian interspersed repeats."; RL PhD dissertation, Univ Southern California, 1995. XX DR [1] (Consensus) XX CC Internal sequence consensus for MSTA retrovirus-like element CC (MaLR). CC The ORF from bp 48-1469 encoded a protein derived from an CC ERVL-like CC GAG protein (see MLT1CR). On average 12% diverged from consensus. XX SQ Sequence 1651 BP; 436 A; 353 C; 489 G; 371 T; 2 other; gaawattggt actgaggagt ggagcattgc tataaagata cctgaaaatg tggaagcgac 60 tttggaactg ggtaacaggc agaggttgga agagtttgga gggctcagaa gaagacagga 120 agatgaggga aagtttggaa cttcttagag acttgttaaa tggttgtgac caaaatgctg 180 atagtgatat ggacagtaag ggccaggctg acgaggtctc agatggaaat gaggaactta 240 ttgggaactg gagcaaaggt cactcttgtt atacattagc aaagagcttg gctgcatttt 300 gcccctgccc tagagatttg tggaagtttg aacttgagag tgatgatcta gggtatctgg 360 cggaagaaat ttctaagcag caaagcgttc aagatgtgac ctggctgctt ttaacagctt 420 acagtcatat gcgagagcaa agaaatcact taaagttgga atttatattt aaaagggaag 480 cagagcgtaa aagtttggaa aatttgcagc ctggccatgt gatagaaaag aaaaacccgt 540 tttctggaga gaaattcaag caggctgcgg agcgaccgtt tgctaaagag attagcataa 600 ctaaaaggaa gccaagtgct gatagccaag acaatgggaa aaaggcctcg aaggcatttc 660 agaaatcttc gaggtggtcc ttcccatcac aggcccagag gcctaggagg actgaatggt 720 ttcgtgggcc aggcccaggg ccccgctgcc ctgtgcagcc tcgggacact gctccctgca 780 tcccggctgc tycggctcca gccgtggctc aaagggcccc aggtacagct cgagctgccg 840 cttcggagag tgcaagctat aagccttggt ggcttccaca tggtgttaag cctgcaggtg 900 cacagaatgc aagagtgaag gaggcttggc agcctccacc tagatttcag aggatgtatg 960 ggaaatcctg ggtgcccagg cagaagcctg ctgcagggac ggagccctca cagagaacct 1020 ctactagagc agtgccaaag ggaaatgtgg ggttggagcc cccacacaga gtccccaccg 1080 gggcactgcc tagtggagct gtgggaaggg ggccactgtc ctccagaccc cagaatggta 1140 gagccactgg cagcgtgcac cgccagcctg gaaaagccgc aggcatcaga ctccaacccg 1200 tgagagcagc cacgtgggct gtgcccagca aagccacagg ggcggagctg cccaaggcct 1260 tgggagccca cccctcgcac cagcgtgccc tggatgcgag acacggagtc aaaggagatt 1320 attttggagc tttaagattt aatgactgcc ctgctgggtt tcggacttgc gtggggcctg 1380 tagccccttt cttttggccc atttctccct tttggaatgg aaatatttac ccaatgcctg 1440 taccaccatt gtatcttgga agtaaataac ttctttttga ttttacaggc tcataggtgg 1500 aaggaacttg ccttgtctca gatgagactt tggactttgg acttttgagt taatgctgga 1560 atgagttaag actttggggg actgttggga aggcatgatt gtattttgca atgtgagaag 1620 gacgtgagat ttgggggaac caggggcaga a 1651 // ID RLTR20B1_MM repbase; DNA; ROD; 566 BP. XX AC . XX DT 30-JAN-2004 (Rel. 9, Created) DT 30-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE Mouse long terminal repeat - a consensus. XX KW LTR Retrotransposon; Transposable Element; LTR; ERVK; KW RLTR20B1_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RA Karolchik D., Baertsch R., Diekhans M., Furey S.T., Hinrichs A., RA Lu T.Y., Roskin M.K., Schwartz M., Sugnet W.C. et al.; RT "The UCSC Genome Browser Database."; RL Nucleic Acids Res 31(1), 51-54 (2003). XX RN [2] RP 1-566 RA Pavlicek A. and Jurka J.; RT "RLTR20B1_MM - a family of LTR retrotransposons."; RL Direct Submission to Repbase Update (31-DEC-2003). XX DR [2] (Consensus) XX CC The consensus sequence was reconstructed from the UC Santa Cruz CC genome annotation. Similar to RLTR20B2 and RLTR28_MM. Individual CC copies are ~88% identical to the consensus. 6 bp TSDs. XX SQ Sequence 566 BP; 150 A; 101 C; 182 G; 133 T; 0 other; tgttgtggat ttggtttaat gctacttgtg ttatgttaat tgggttccca aaattacatg 60 ggaattcgca cgtctgcatg taagacgctg agggtccctg cccccagttg gctctaattg 120 gtaaataaag ttgccggtgg ccaatggctg ggcagggaga cagaggtggg actttagatt 180 tcccgggcaa gggaaccaag ggaagaagaa ggattttaga atcgccatgc cagggaagca 240 ggaggatcag gcttgagagc tgcaggagag aaagcataca gccatgtaag agccagggaa 300 gagcggcccc aggggcccct cccccgattg ggtctggggt agcaaagatg gaatatagat 360 tttagtaagt aataattcag gagtatcgga ggggaggcgt tagcaatgtg gaagtttggg 420 agtggcccag ccattgagct gcttagggca tattaaaata taaggctgtg tgttgtgtgt 480 ctttcattca agaatccaga gcatttgggg gcaggtagca aggaacacgc gctgccaccg 540 ccggggagtt tagagtagat taatca 566 // ID ORSL repbase; DNA; ROD; 275 BP. XX AC . XX DT 30-APR-1998 (Rel. 3.03, Created) DT 26-JUN-2008 (Rel. 5.05, Last updated, Version 4) XX DE Putative non-autonomous, hAT-like DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; KW Origin of replication-like (ORS8) region; ORSL. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 66-275 RA Rao S.B., Zannis-Hadjopoulos M., Price B.G., Reitman M. RA and Martin G.R.; RT "Direct submission."; RL Unpublished (1989). XX RN [2] RA Rao S.B., Zannis-Hadjopoulos M., Price B.G., Reitman M. RA and Martin G.R.; RT "Sequence similarities among monkey DNA-replication ori-enriched RT (ors) fragments."; RL Gene 87, 233-242 (1990). XX RN [3] RP 1-275 RA Jurka J.; RT "ORSL."; RL Direct Submission to Repbase Update (13-APR-1998). XX RN [4] RP 1-275 RA Smit A.F.; RT "ORSL."; RL Direct Submission to Repbase Update (31-JAN-2000). XX DR [4] (Consensus) XX CC ORSL shares ~200bp stretch of similarity with African Green CC Monkey origin of replication region (Acc. No. M26221). About 1000 CC copies in our genome. The first version of the consensus sequence CC [3] has been significantly shortened [4]. This repeat has been CC classified as a putative hAT transposon [4] of identification CC 14-bp terminal inverted repeats and 8-bp targets site CC duplications similar to other hAT transposons (esp. MER45, CC MER69). On average 21% divergence level. XX SQ Sequence 275 BP; 95 A; 48 C; 51 G; 81 T; 0 other; cagggccgac ttatccatta ggcacagtag gcacagtgcc tagggcccac gatactttta 60 ggggcccacg aaaatgtttt aatttctttt aaaatcagaa gaaaaaatga acttttaggt 120 caaagaaaat gttttaatat ataatattaa tatattctat attcatcttt ataccaatgc 180 agtcataaaa tataattttt aatatttttt tatggaggaa ggggcccacg aaggcaaaag 240 tgcctagggc ccacgaaagt cataatgcag ccctg 275 // ID L1MD2 repbase; DNA; ROD; 1087 BP. XX AC . XX DT 20-FEB-1997 (Rel. 5, Created) DT 20-FEB-1997 (Rel. 5, Last updated, Version 1) XX DE 3'-end of L1 repeat (subfamily L1MD2) - a consensus sequence. XX KW Repetitive sequence; L1 (LINE) family; L1MD2 subfamily; L1MD2. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-1087 RA Smit F.A., Toth G., Riggs D.A. and Jurka J.; RT "Ancestral, mammalian-wide subfamilies of LINE-1 repetitive RT sequences."; RL J. Mol. Biol 246, 401-417 (1995). XX DR [1] (Consensus) XX CC Temporarily contains ORF2 region consensus of L1MB7 (subfam L1M4) CC ORF2 ends at bp 675. XX SQ Sequence 1087 BP; 416 A; 154 C; 190 G; 272 T; 55 other; yttgtatcca gaatatataa agaactctta caactcaaca ataaaaaaac aaacaaccta 60 attaaaaaat gggcaaaaga yttgaataga tatttctcca aagaagatat ayaaatggcc 120 aataagcaca tgaaaagatr ctcaacatca ttagtcatca gggaaatgca aatyaaaacc 180 acaatgagat aycacttcac acccaytaga atggctaaaa ttaaaaagac agrmaatamc 240 aartgttggy raggatgtrg agaaaytgga achctcatac aytgctggtg ggaatgtaaa 300 atggtacarc yactttggaa aacagtttgg cagttcctca aaaagttaam aatagagtta 360 ccatatgacc cagcaattyc actcctaggt atwtacccaa gagaaatgaa aacatayrtc 420 cayacaaaaa cttgtacaca aatgttcata gcagcattat tcataatagc caaaaagtgg 480 aaacaaccca aatgtccatc aatratwgaa tggataaaca aaatgtggta tatccataca 540 atggaatatt attcagcmat aaaaaggaat gaagtamtga tmyatgcaac aacatggatg 600 aaccttgaaa acattatgct aagtgaaaga agccarrcac aaaagrccac atattgtatg 660 attccattta tataacattc ttaaaacgac aaaactatag agatggagaa cagattagtg 720 rttgccaggk gttagggatr caggraggag gtggatgtgr ytataaarrg gtagcatgag 780 rgaatyttta tggtgatgga acwgttctgt atcttgaytg tggtgrtggt tacacgaatn 840 tatacatgtg ataaaattgc atagaactaa atacacacac acacataagt acaagtaaaa 900 ctggcgaaat ctgaataaga ttratggatt gtaycaatgt caatttcctg gttktgatat 960 tgtaytatag ttttccaaga tgttaccatt gggggaagct gggtgaaggg tacacgggat 1020 ctctctgtay tatttcttac aattgtctgt gagtktayaa ttatttcaaa ataaagttta 1080 atttaaa 1087 // ID MER77 repbase; DNA; ROD; 650 BP. XX AC . XX DT 07-OCT-1998 (Rel. 3.1, Created) DT 07-OCT-1998 (Rel. 3.1, Last updated, Version 1) XX DE MER77 repetitive element - a consensus. XX KW Interspersed repeat; MER77. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-650 RA Smit F.A.; RT "MER77."; RL Direct Submission to Repbase Update (1996). XX DR [1] (Consensus) XX CC Related to MER68 and MER21. CC Assigned to rodents by J. Jurka, October 07, 1998. XX SQ Sequence 650 BP; 164 A; 161 C; 156 G; 137 T; 32 other; actacggggg gaggtgtcaa attcaaaggt ctccaagacm accctcctgt ttartgattc 60 gctagaagga ctcacagaac tcagaaaagc cgttatactc acagttacgg tttattacag 120 tgaaaggata cagattaaag tcagcaaagg graaaggcac ataggncagn gtccaagaga 180 rmcaggcacg rgcttccagt tgtcctctcc cggcggagtc gtrcgggcag cgcttaattc 240 tcccagcaac grtgtgtgac agcacgcatg aagtattgcc aaccagggaa gctcacccga 300 gccttggtgt ccagagtttt trttgggggt cggtnanata ggcatggntg accccgcagc 360 atggctgacc ttggtcttct caggcttgag ccncccaagc atggctgacc tnagttactc 420 agtctycagc ccctccagar gtcarryacc gtgtagccta aggcccccac cataaatcac 480 attgttagca trractgtcc ggtatggccc aaggccytcg cagataaaca aagayactyt 540 tatcaggcag gacattccaa gggcttagag gttatctccc arrgcctgag gataamcgag 600 ggccaganct ttctttgggc aaggttaatc ctttactgya yaagaccaca 650 // ID ERVB4_2-I_MM repbase; DNA; ROD; 8374 BP. XX AC AC110500; XX DT 11-FEB-2004 (Rel. 9.01, Created) DT 11-FEB-2004 (Rel. 9.01, Last updated, Version 1) XX DE Mouse endogeneous betaretrovirus ERVB4_2 internal sequence. XX KW Endogenous Retrovirus; Transposable Element; gag domain; KW pol domain; endogeneous betaretrovirus; MmERV-B4_AC110500; KW ERVB4_2-I_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-8374 RA Baillie J.G., Van de lagemaat N.L., Baust C. and Mager L.D.; RT "Multiple groups of endogeneous betaretroviruses in mice, rats, RT and other mammals."; RL J. Virol 78(11), 5784-5798 (2004). XX DR Genbank; AC110500; Positions 52135 60587. XX SQ Sequence 8374 BP; 2398 A; 2059 C; 1736 G; 2181 T; 0 other; agtggcgccc gacgtggagc acgaggtacg actgccctcc ggacaacgga ttaaggatta 60 aagaggtacc gcgcagacac agaaacagtg ggacagtccg ccgatacttc gctcgagtgt 120 cgaggccgga cgtcggagag gtaagcccgg gtattcagtt gttgtcccat taaccatggg 180 gaaggttctg tccaaagagg cagtttttat tcaagagatg aagggttctc ttaaggagag 240 agggataaga gttaaaaaaa aggatttaat aaagtttttc tgttttgtgg acgagagatg 300 tccatggctg gtattaaatg gttctgaaat acacccttta acctggaata aagtaggcaa 360 agaaatcaat agtttaataa aacaagaaga tgtccctgag ccctttttta gctattgggg 420 aatcatcaga gacctccttg agcaggcaga aaagggcgga gaaggcgcta gcctcttagc 480 cctcactgaa gattttttag agaattcccg atcgccatct cgcaaaagcg aagctgagcc 540 atcggcgaat tatcagtctc ggccgtcatc gattatcgat gccccggcct ctaaggactc 600 cctgccttcc cctctttctc cccaaattac attagattct agaaaaaggg ctactgctca 660 accctctatt agaccaaaaa gaatttaccc cgttttgtat aaggagttat cagataaacc 720 cctcgatcct gcctcagagg cagaattgga tgaggaagcc tttaaatatg aacaagacca 780 ctatggtcct gggcccgatg cctatcttac catggagccc ttcaattacc acccccctcc 840 ttatttgccc ctcatccccc ctactgccca tcccatatca ctggcttctc cgtctgttcg 900 gcaattactt caaactaaaa aagaacttca gactcagatc accaatctta aagatgttct 960 agggcttcaa aaggagctac aagacctcag tctagagacc cagagtctac aaagagccct 1020 catggggaac cttcaaggtc ccatctcaaa aaatcctaaa actagccgca gaagtccctc 1080 tcagaaacaa gggaatgaaa aattgacctt ccctgtcctc acccgagctc gagcccgtca 1140 ggctgctcca ggccatgagg acccccccac tggtgtttcc aataaagatc agggagaacc 1200 agagagtgat gaggaggaag atcaagcaaa ctctgattct gatgagggtg gcatgtcccc 1260 cgatgaaggg ttagacacca acagccaacc agtctacaaa aggttaaaac tgagacatat 1320 aaaagatctt cattcagcag tcaaaaatta tggagttaat gccccgttca cagtatctat 1380 attagaaggt ttggcaggag aaggctattt aatacccaac gaatggaata aagttgttca 1440 gtctgtcctc actagaagcc aatatttaac ttggaagtca gaatttgtgg acagaggaga 1500 gaatttggct gcaaataata gaaagaaact tacttgtaag acagccttat ggactgcgga 1560 taaaatctgc ggtaaaggta atttcgctgc agacaaaaaa caattagggc tttcccctgg 1620 tgtcctcgcc cagacagctc aggcggccct gggagcttgg cgcgctgtcc ctgctacagg 1680 ggcactaatt atgcccctaa ccaagattat acaagggccc caagagtcct atgcgcagtt 1740 cgttgccaga ttgcaagaag cagctgagag aattttggga cctgaagaaa gtgagggtct 1800 gttagtccga caacttgctc ttgaaaacgc caattcggca tgtagggctg ccctgagagg 1860 taaaaccaag agtttagata taaatggtat gatcaagctt tgcaatgagg tagatgtgtt 1920 ttctcaacaa gtttctaagt ctattaacct ggccattggg gctggtctgc aactaagtaa 1980 aggacagaaa acttgctata gatgccacca acttggacat tttgctagag aatgccctac 2040 acaaagacag agcaccctag tctcaaccat gcctgttacc caacaaaaac tcccccccag 2100 tctttgccat aagtgcaaga aaggctgcca ttgggcacga gattgccgat ctaagactga 2160 tatcaatggc catccactat ctgcggtcca gggaaacggc caaagggccc ccctgagggg 2220 cccgtctccc ctaataacaa ctgcagaccc ctagccccca tgagagattg tgcagggcca 2280 cgacaggaaa tacagcattg gacctgtgtt cctccaccac caggattata actcctaaag 2340 agggaacagt ggtcattgaa acaggtgaat ttgggccccc tccccaaaag acgttttttc 2400 tcatcatcgg ccgagtgtcc agacttttac aaggcctaac ggtgactcct acagtggttg 2460 acaccgacta ccatggggag ataaaagtct tagtcaccgc cacgcagggg ccgcttaccc 2520 ttagggctgg agagcacata gcccgagccc taccactccc actattcggc cgtttccctt 2580 atatgaaaga agagcgagga tcatcctccc caggatcctc agaagtctac tgggcccaga 2640 aaataaccga ttcacggccc atgctgactt tgtttctaga aggcaaacag tttcaagggc 2700 tcctagatac cggggcagat gcaacagtaa tttccttaac acattggcct acagcctggc 2760 ccttacagcc cactgccact catttaaaag gcataggtca aacacaggac actttacaaa 2820 gctccaaact gctaacttgg tcagacaaag aaaacaatac cggaactgtt cgaccctttg 2880 tggtaaaagg ccttcctgtg aatctatggg gaagagacat actctcccaa atggggatga 2940 taatgtttag ctccaatgaa actgtcacca atttgatgct aaaaacaggg tacctcccag 3000 ggaagggtct aggaaaagat gaacaaggaa ggatttctcc cataatgccc acacccaaaa 3060 atgataaaaa aggtttaggg gcagaccttt tttcttagag accactgttc ctcctgcatt 3120 ccaggcagat aaaatatctt ggaaatccaa tgatcctgtc tggatcgatc agtggtctat 3180 gcctcaagag aaagttcagg cagccttaca gttggtgcag gaacaactga gactgtcaca 3240 tcttgagcct tctacctccc cgtggaacac accgatattt gtcataaaaa agaaaagtgg 3300 agcctggagg ctgttacagg accttcgagc tgtaaataag accatgatgc cgatgggggc 3360 acttcaacca ggcttaccat ctcctatcgc aattcccaga ggttattcta aaattgtcat 3420 agacattaaa gattgtttct tttctatccc acttcatccc caagattgtg tccgttttgc 3480 cttttccatt cctactgtaa atcacgtggg accaaatccc cactttcaat ggcgggtttt 3540 accccaggga atggctaata gccctacctt atgtcagaaa tatgtggcac agattattaa 3600 ccctttaaga caagaattcc ctgatgccta tatagttcac tatatggatg acctgctcgt 3660 tgccacaaaa gaattatcct ccacccatgt ggttgcccag gccttagtta gggccctcca 3720 aagatgggga tttgtcatag ctcctgataa agttcaagtt cagtaccctt tcatgtttct 3780 aggctttcag ttagaaccca ttagagtaca ttctcagaaa ttaaccatcc gcacctcaca 3840 gctgaggacc ttaaatgatt tccaaaagtt gctaggagat ataaattggt tacgtcccta 3900 tttaaaatta actactggag atctaaagcc cctttttgat acactccagg gagattcaga 3960 cccaaactcc ccgagaaagc tatctcctgc tgccctaggg gctctccata aggtagagtc 4020 agccattgat caacagacca tgggctatta taaccccctt caacccctgt ccttaatagt 4080 cttttccaca cctttttcaa ccactggcct attatggcaa gatgataatc cccttttctg 4140 gatccatcta ccagccacac ctacaaaggt tctccctgtt tatccttctc ttatatgtca 4200 gataattatc ctaggaatca agatggctac ccgcaatttt ggaaaagctc ctgatactgt 4260 tattttacct tatccttctg aacagctttc ttggttacag tcacaatttg atgagtggac 4320 tatattatta tcctcctttc agagaaactt tgatacacat ttaccagcca ataggttagt 4380 tcagtttttg caaactaccc cttttgtctt ccccaaggtt actcagctac aacctatatc 4440 caaagcctta actgtgttca ttgatgggtc cagtaatgga agggctgcta tcattgttga 4500 aggacaacgc catatcattg agacaaccca cacctcagcc cagctggtag agctccgagg 4560 agctctacaa gtgtttgaat cggtctcatc tccctttaac ttgtattcag atagccatta 4620 tgtagtaaga gctttgagag ctttagaggt ggtccctagt attcaaccca ccactgccac 4680 ttttcagatg tttcttaaaa tacaaatgct cataagaagt cgagcctacc cgttttttgt 4740 ggggcatatt cgagcccaca cgggacttcc tggacccctg tcccagggaa atgatcttgc 4800 tgatcaggcc actcggctta catgccttac tatcgatcct gacccactgt cacaggctca 4860 gagagcccat actttacatc atctcaatgc acagacactc agactgcgat tcaacataac 4920 cagagaacaa gccagacaaa tcgttaaaca gtgtaaaaat tgtcttaccc tgctgccaga 4980 accacatttg ggagttaacc cacggggcct tgtccctgga gaactctggc aaatggatgt 5040 aactcacatc ccatcttttg gaaagcttaa gtttgtacat gtgtccattg ataccttcag 5100 tgggtttttg tgtgcttctg cccacactgg agaagctacc aaagatgtta taaatcattc 5160 attgtatgct ttttctgtca tgggacaacc caaaattatc aaaacagaca acggccctgg 5220 atatagcagt cttaagttta aacagttttg tgcacaatta cagatcaagc acattactgg 5280 aattccctat aaccctcagg ggcagggaat tgtcgaaagg gcccatcaga ctttaaaaaa 5340 tgccctgact aagcttgggg ctcaagaaac tatctacacc cttaagggaa attcaaaaca 5400 gctactgtct catgcacttt ttgtgcttaa ctttttaact ctagacataa gtggtcgctc 5460 agcagctgac cgactttggc atcctaaaac cagtctagag tatgctcaag cactgtggaa 5520 agatcccctt acagggattt ggaatgggcc agaccccatc atcatctggg caaaaggctc 5580 agcctgcatc tataattcca aagaaggagg agccagatgg ctccctgaaa gactaattaa 5640 gccatttaat acaacccagg gtggcgcctg agaagatgtt tcatgttttc tctcgtagga 5700 ccaccatgta cagatctctc acaacattga tgatgctggt catgacggcg acagccaggc 5760 aagcccagac ctatcaggtt tttaactata cctgaattat tcaaaatcag gctggagaca 5820 tagtcaactc cagctccaag attggtacca agccccattg gcccgagtta gaagtagaca 5880 tctgtgtcct cgcattgggt gcagatgccg cctggggcac ccctgattat tatatgcctc 5940 aattaacagc agttaacaca ggtgataaaa agactgatcc aggttgcagc agtgatatac 6000 gtcatactgc cctggcctta cacactggag gtatatatgt ctgtccagga actcatcgag 6060 atagatctct aaattataaa tgtggctatg aaaatgaatt ttactgcgcc tcttggggct 6120 gtgaaaccac tggagatacc tattggactc catcctcctc ttgggattat atcacagtta 6180 aacgacttta ccccaactcc taagtcacct cccctagaaa acaacctctt aatagctatt 6240 gttcccccat ttcctcgaga caagggtggt gcaacccatt acaaattagc tttacaacag 6300 ccgcaaggac tgctgactgg accaagagag gattctcttg ggggttgcgc atatataaag 6360 agggcacaaa ttggggattg acattcaaga ttaaactaca aaaggaaatc cccaacaaac 6420 ataaagcatc tataggacct aacccccaat tacatcaccc taattcccca aacaatcccc 6480 gacttccagg tacccgccct actgttcctg ctccagggcg tactacaacc ctatttcagc 6540 ccactatccc tgcggggccc cactcttcca ctgatttact gtggtctata ttaaatgctt 6600 ctgctttagc cttgttagat aagacacaaa gagataatgc atcagaattt gaagattgtt 6660 ggatgtgctt ttctgctacc ccaccctttt atgagggcat agccttattt ggaaatttta 6720 ccctgctttc tgatgctagg cagcttccct ttgagtcagt tcagcttact ctgacaaaac 6780 tctctgggat aggaagttgt gtgttgggcc cagacatgct tccgcctcga cccttgctag 6840 aaatttgcaa taacatgctt agagtatata aaaataggta ttcttatctt cttgccccaa 6900 atgatacctt tttggcatgt tccacaggcc ttacacggta cattattata caagatttta 6960 taaataatag agattattgt gtcttggttc agctttttcc aaattttagt attcataaga 7020 caggtgactt attaacgtcc tgggaccaag gtgcctcctt gctatcccac cacaaaagag 7080 agcctatctc ggcggtgagc cttgcggtcc ttcttggcct gggtgctgct gcgaccggga 7140 ctggaatcgc agcccttgtg tcatctcaac agaatgcgcg caattaccat ctgctcaatg 7200 aggctattag ccaagatata gaaaatataa aaaagggcct agatgatctt actgattcct 7260 tggtctccct ttcagaagtt gcccttcaaa ataaaagggg attagatttg ctctttctac 7320 aacaaggagg cttatgcgct gcacttaaag aagaatgctg cgtatatgtt gataaaactg 7380 gattagttaa agatagcatt gccaaggtta ctgccagttt agaaaaaaga aaaagagaga 7440 gagaacaaca agagccatgg tatcagaatt tgttttcaac ctccccctgg ctttcgacct 7500 tgttgttctc ccttttgggg cccctattgg gactcctatt attgatttcc tttggtcctt 7560 gggcatttca aaaattgacc cgatttgtta aatctcaaat tgattcatct ctctcaagtg 7620 catttgtttc agtccattac catcggctgg atgttggcga taacaagcag gttactggag 7680 aagagacaga cgtggatacc gcttcatcac cctcttctcg ggaagagaga ctcaatttcc 7740 ataaaatgct taagtaagat cattcccctc cccctaaatt cggccatcag tctcatgcca 7800 caaatcattt atagattgcc gtagtctgtg cttccgacat ggtaatatac gttctcgcca 7860 cattagaaac taaacagtca tccccggcta gacacatcaa aaaattaaaa cttaaagccg 7920 tcgcccgtga gagtggtaag actaagtact gcacagagat tagtctgaaa gctgttagac 7980 agtctctgag aggcatgtct gattgcataa aggttgagtg ccccagggac ctttccccag 8040 aaaaaacggc acgggagcag gtcagggtta ctctgggcaa aaatctgtgg gcctgagagt 8100 caatcctgta catggcccct aacattaaac actggggatc agacctctac ctctacccac 8160 ggagcttgct tcgttcctaa atcccttggc tagcccaagt tatacccaac agactgggac 8220 cattaattca agcctcatag atgacttgtc cttgtgtcct tcgtggtttt agagaatcat 8280 ccggtgaaca aacagacgtt taggtgatcg ttaaaaacgt ggctacaaat ttattataca 8340 atatgccttt ataaaaatat taaaacgggg gaga 8374 // ID L1MA10 repbase; DNA; ROD; 1067 BP. XX AC . XX DT 20-FEB-1997 (Rel. 5, Created) DT 20-FEB-1997 (Rel. 5, Last updated, Version 1) XX DE 3'-end of L1 repeat (subfamily L1MA10) - a consensus sequence. XX KW Repetitive sequence; L1 (LINE) family; L1MA10 subfamily; L1MA10. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-1067 RA Smit F.A., Toth G., Riggs D.A. and Jurka J.; RT "Ancestral, mammalian-wide subfamilies of LINE-1 repetitive RT sequences."; RL J. Mol. Biol 246, 401-417 (1995). XX DR [1] (Consensus) XX CC Contains identical ORF2 region consensus (subfam L1M3) as L1MB3 CC ORF2 ends at bp 675. XX SQ Sequence 1067 BP; 402 A; 158 C; 192 G; 272 T; 43 other; ttaatatcca aaatatataa agaactcyta caactcaaca aaaaaaaaac aaacaaccca 60 attaaaaaat gggcaaarga cttgaataga catttctcca aagaagayat ayaaatggcc 120 aataagyaca tgaaaaratg ctcaacatca ytaatcatta gggaaatgca aatcaaaacc 180 acaatgagat aycacctyac acccattagg atggctatta tyaaaaaamc agaaaataac 240 aagtgttgry gaggatgtgg agaaattgga acccttrtrc attgctggtg ggaatgtaaa 300 atggtrcarc cactrtggaa aacagtatgg hrgttcctca aaaaattaaa aatagaatta 360 ccatatgacc cagcaatycc actyctrggt atatacccaa aagaattgaa atcatgttcy 420 yahaaagata yttgtacacc matgttcata gcagcattat tcayaatagc caaaaggtgg 480 aaacaaccca aatgtccatc aatgrwtgaa tggataaaca aaatgtggta tatacataca 540 atggaatatt attcagcctt aaaaaggaag gaaatyctga cayatgctac aacatggatg 600 aaccttgarg acattatgct aagtgaaata agccagtcac aaaaggacaa atactgcatg 660 attccactta tatgaggtat ctaaaatagt caaactcata gaarcagaga gtagaatggt 720 ggttgccagg ggctgggggr agggggaaat ggggagttgc tgttcaatgg gtataaagtt 780 tcagttatgc aagatgaata agttctagag atctgctgta caacattgtg cctatagtta 840 ataatactgt attgtacact taaaawttgt taaaagrgta gatctcatgt taagtgttct 900 tatcgcacaa caaaaaaatg gggaactttt ggrrgtgatg gatatgttca ttatcttgat 960 tgtggtgatg gtwtcacggg tgtntacata tgtcaaaact catcaaattg tacacwttaa 1020 atatatgcag tttttagtgt ataattatac ctcaacaaag ctgtttt 1067 // ID MLT1C repbase; DNA; ROD; 466 BP. XX AC . XX DT 01-OCT-1995 (Rel. 3.6, Created) DT 18-APR-1997 (Rel. 6, Last updated, Version 2) XX DE Mammalian transposon-like element long terminal repeat (MLT1c DE subfamily) - a consensus. XX KW Repetitive sequence; MaLR family; MLT1c subfamily; STIR; MLT1C. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-466 RA Rouyer F., de la Chapelle A., Andersson M. and Weissenbach J.; RT "An interspersed repeated sequence specific for human RT subtelomeric regions."; RL The EMBO Journal 9(2), 505-514 (1990). XX RN [2] RP 1-466 RA Smit F.A.; RT "Identification of a new, abundant superfamily of mammalian RT LTR-transposons."; RL Nucleic Acids Res 21, 1863-1872 (1993). XX SQ Sequence 466 BP; 135 A; 99 C; 115 G; 104 T; 13 other; tgttatgggt tgaattgtgt ccccccaaaa ttgatatgtt gaagtcctaa cccctagtac 60 ctcagaatgt gaccttattt ggaaataggg tcwttgcaga tgtaattagt taagatgagg 120 tcatactgga gtagggtggg ccctaaatcc aatatgactg gtgtccttat aaraagagga 180 aatttggaca cagacacgca cacggggaga aggccatgtg aagacggagg cagagattgg 240 agtgatgcak ctacaagcca aggaacgcca argrytgcca gcaaaccacc agaagctagg 300 aagaggcaag gaacagattc tccctcacag ccytcagagg arrccagccc tgccgacacc 360 ttsatctcgg acttctggcc tccagaactg tgagagaata matttctgtt gtttaagcca 420 cscagtttgt ggtactttgt tacggcagcc cyaggaaact aataca 466 // ID MER80 repbase; DNA; ROD; 508 BP. XX AC . XX DT 18-APR-1997 (Rel. 6, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 4) XX DE MER80 repetitive element - a consensus. XX KW MER80; DNA transposon fossil; MER1_type family. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-508 RA Smit F.A.; RT "MER80."; RL Direct Submission to Repbase Update (1996). XX DR [1] (Consensus) XX CC MER80 has 16 bp terminal inverted repeats similar to those of the CC MER1 family. 8 bp are duplicated with a bias for NTCTAGAN. CC Average divergence from consensus 23.5%. XX SQ Sequence 508 BP; 171 A; 84 C; 90 G; 162 T; 1 other; caggggttct taaccttttt tgtgccatgg gcccctttgg cagtctggtg aagcctatgg 60 accccttctc agaataatgt ttttaaatgc ataaaataaa atacatagga ttacaaagga 120 aaccaattat attgaaatac agttatcaaa atattaaaaa accaaatttg tgatatagta 180 atatatgtgc ttctttatta atgcattaaa taacaagatc tagcggcggg tctaataact 240 accgtaattt cgaagtagtg atgagcataa atgatatttt gagatatctg caacaactgt 300 aatgtgatat gaaaatatct gtgatttcta ttggtgacaa agtcacaggt actgctaata 360 ctactgtggt ttgttgccta cattcataat tgaaggaaat gctaaatttc agttagaggt 420 tagtgaaaat aaagatgtaa nttttttccc catccaagtt cacggacccc ctgaattcta 480 tccatagact ccaggttaag aacccctg 508 // ID RLTR26_MM repbase; DNA; ROD; 709 BP. XX AC . XX DT 14-OCT-2002 (Rel. 7.09, Created) DT 21-AUG-2008 (Rel. 7.09, Last updated, Version 2) XX DE Mouse putative long terminal repeat - a consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; KW Long terminal repeat; retrotransposon; RMER17A; RLTR26_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-709 RA Jurka J. and Drazkiewicz A.; RT "RLTR26_MM: putative LTR from Mus musculus."; RL Repbase Reports 2(9), 11-11 (2002). XX DR [1] (Consensus) XX CC 86% or less similar to RMER17A elements described in GenBank CC (AC079222, AC007937, AE000665). 70% identity to RMER17A CC (rodrep.ref) (bases 12-185). CC Similar to RMER17D_MM (71%, bases 1-209). XX SQ Sequence 709 BP; 171 A; 233 C; 173 G; 132 T; 0 other; tgtgggacgg tgggctatga taggcagcct ggtccctggt tgagctaagg cttaaaaccc 60 cggtgaccct gcaggggact cgcctgcaag ggacggtagg cattttgcca tgctcctggg 120 cacctggctc ctgtcacata gctacagccc cccacacccc cacccccgta gagaggtttg 180 tggccatcag tcacgtagga gcagcactcc aagccctccc acatgtagat aaggtatccc 240 caagctctca gaccaagcca ataggaagta cctgctgtca gaccctgacc caccccaaaa 300 ctgtatataa ggatcctcct atccagaagg aataaaggtg tgtgagaatt actccatcat 360 ctgagagctt ctgtcataag agctgtaaca ccaccgctag ggaagagatc tgctctcccc 420 ccccccaaga aaacgccacc agaagctccc ctgcactcct cactggctag ttagcctcct 480 tccggctcag cctcgcccga ctcagtgcag agcgacctag gtgcagcatc ttggagcaac 540 tgcaaccaaa gaaacagcag cagtggagac tgaggtggag gcagctgcag cagcagaggc 600 agtggaggca attccccctc cctgctcaag ttctctcccc tttccctgaa ccctcgcacc 660 tggcctggcc agagatctcc gtggaaagcc tccagtacac aggcccaca 709 // ID MARE2 repbase; DNA; ROD; 933 BP. XX AC . XX DT 28-JUN-2006 (Rel. 11.06, Created) DT 09-MAY-2008 (Rel. 11.06, Last updated, Version 3) XX DE Mammalian-wide repeat - consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW MARE1; Tigger16b; MARE2. XX OS Mammalia OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi. XX RN [1] RP 1-933 RA Jurka J.; RT "MARE2: An ancient mammalian repetitive element."; RL Repbase Reports 6(6), 345-345 (2006). XX RN [2] RP 1-933 RA Gentles AJ., Wakefield MJ., Kohany O., Gu W., Batzer MA., RA Pollock DD. and Jurka J.; RT "Evolutionary dynamics of transposable elements in the RT short-tailed opossum Monodelphis domestica."; RL Genome Res 17(7), 992-1004 (2007). XX RN [3] RP 1-933 RA Jurka J., Kapitonov V.V., Kohany O. and Jurka MV.; RT "Repetitive sequences in complex genomes: structure and RT evolution."; RL Annu Rev Genomics Hum Genet 8, 241-259 (2007). XX RN [4] RP 1-933 RA Smit A.F.A.; RT "Tigger16b."; RL Direct Submission to Repbase Update (09-MAY-2008). XX DR [4] (Consensus) XX CC This subfamily is less than one third as abundant as MARE1. CC Consnsus expanded and named Tigger16b (ref. 4). XX SQ Sequence 933 BP; 219 A; 279 C; 158 G; 270 T; 7 other; cagtacaggc attccccgag ttacgaacac ccgacttgtg aacgtcccgt atatacgaac 60 ggccggttgc acactcgccc tttcccttct cgacccactg caggccgctc cgctccccgg 120 acagcaacca agctgtcctg ggccgacggc gccacctggt ggccggcgcg gggaactgtc 180 tgacccgacg tctcttcccc accgcctcct tttttcccgc ccgggganac gttcccgcgt 240 cccccntcct ctccttcttc actccgtccc gggcaattca tctctcataa gcnctcatat 300 ncctttacgc ctctttctcc ttccncttct ctcaaccttt ncctcaggaa aatctttctc 360 anctttcttc tccttttcct tccctctccc tcttttcgtt ccatttgtga gccaaaaata 420 cgaccctaat catggctgat aagcgtaaga gtagcgctag tgatacacct gtatcaagga 480 aaaggaaagc cgtaagtttt gaagtgaaat tagacgtaat aaagaaccag tgccatcaac 540 ctctaaagcc cttgaaagtg cccagaagtc cccttcagaa tctccacaaa agtctccatc 600 tacctcctca tcctcctcta attaaacctg cttttcctca agcaccagca ttcaagataa 660 ataaaaccaa tattcttatt caattttatt gtttttctgt ttcgtttaat tactagtatt 720 gcaacagtat tgttttctac tatttttcat tgcaaatgta cagtatttca gctacattat 780 gggttaaata ggcttttgtg ggctagcctg ggaacctaac caccatttat aacattgttt 840 ctatgggaaa atgcgttccg agttccaaac aaccgactta caaacgaact tttggaacac 900 aacccgtttg taagttgggg actgcctgta ctg 933 // ID LTR4_Cpo repbase; DNA; ROD; 993 BP. XX AC . XX DT 20-JUN-2009 (Rel. 14.07, Created) DT 20-JUN-2009 (Rel. 14.07, Last updated, Version 2) XX DE Long terminal repeat of ERV1 endogenous retrovirus: consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR4_CPo. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-993 RA Jurka J.; RT "Non-autonomous endogenous retrovirus from guinea pig."; RL Repbase Reports 9(7), 1545-1545 (2009). XX DR [1] (Consensus) XX CC ~94% identical to consensus. XX SQ Sequence 993 BP; 286 A; 236 C; 186 G; 285 T; 0 other; tgaagagacc aagaaagctc gttccatctg ctgcttcccc tccccttctc ccttcaaatc 60 caactaggtt aagttttaac ttcctaaagg ccataaaaca aaggcctgag gtaatagctc 120 ctgacatcct gggtcttccc tatccggaag ccccatcaca aattgtgtta attgcatcag 180 attgtattag attgtacatt cttcacaaaa atagcttcgt ttatatcata gtaatacctg 240 taacttggta ttgtatcaca gtaatacctg taactgtaaa tgtgtgtctt ccttttcctc 300 tgggtaaaag attgatatgt taatcaatgc tctcttctgt aaagaattgt aaaatcttac 360 ccaattgata tgctaaccat tgttgccctc tacaaataga atggttgtgg tgctaaagta 420 aggtaattaa ggttgttaat tgctgttaga cttttctatt caggagatat taggaacagc 480 tacccacccc cagctaagaa cagaactgag tcccatgacc cggttgactc agttctaact 540 cagctctgtg agaggcccaa aagaaaacat ctgcgaccgt ggtgttctcg ggttatgcta 600 tgtcaggaaa ctcagagata aagtctgctg actatgctaa tcaagattgg gaagtatgtc 660 agaaccaagc cacacaggag ctaggtgacc taaaaataag ctgtgcaaaa caaagacagt 720 taatcaactc tgacccactg gtcacccctc atgatctgac caatcacaaa tggacaaatc 780 gccaactgtt ccgctctaac tgctatgatt ggcttctgta acaaacgctt gctttgcggt 840 ttttcatcct taaaaacacc agccttgccc caactcaggg ttctccgttt ccacctgttt 900 ggaggacccc gtgtacacgg aataataaaa acccctttgt tttttacatg agagcgaggc 960 ccttggagtc ttccttgcga caccggtcta aca 993 // ID MamGypLTR3_LTR repbase; DNA; ROD; 839 BP. XX AC . XX DT 05-FEB-2007 (Rel. 12.01, Created) DT 10-APR-2007 (Rel. 12.01, Last updated, Version 1) XX DE Long terminal repeat of Gypsy LTR Retrotransposon from mammals. XX KW Gypsy; LTR Retrotransposon; Transposable Element; MamGypLTR3_LTR. XX OS Mammalia OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi. XX RN [1] RP 1-839 RA Smit A.F.; RT "MamGypLTR3_LTR - Gypsy LTR Retrotransposon from mammals."; RL Direct Submission to Repbase Update (05-FEB-2007). XX DR [1] (Consensus) XX CC 4 bp TSD; Pos 640-839 (end) 70-80% similar to the same region in CC MamGypLTR1a; 5' end undefined. XX SQ Sequence 839 BP; 187 A; 198 C; 269 G; 164 T; 21 other; cananccgcc ccatccnaga aggtgggngt ggncaccttg agatcacttg gggagtctcn 60 cttaggagga taaatcgcct tngagtcact gaaggtagcc ttcctcagan tgggaaatgg 120 caccttggag tcactgaggg tagcctccct tagaagggtt gggcgagtag ccccattgtc 180 cggaggaaag gggtgtaatn agaccctttg ngttttgagg gtttaaaagg aaaagctgcc 240 tgcaccctgg ngtggtccct ggggaggaaa ggngganggt ggttctgggt ccgcgaggtg 300 gcagatgccg gaaccagtcc tgacccagcg ctcctggggc cggctggcgc cctgggaggt 360 ggctgtgtat gtgaactgaa agagctgagc actaagagct gcaaccttgg gagcccaagc 420 gtggggcacc cttggccgag cttagcactg agggagtggg atcatcctcc ctcaaagaac 480 cacngcggcc tgtgcgggga tctggaccag caagggcatc accgcagcag nggaccctgg 540 aatctgcagg accagtcttc aacagcgaca ccatgtggca gcgagaagca atggcagtag 600 tggactgatc agactccagt tctctccccc ttgngtntgg aagcnggact gaccccccct 660 ttggactgng taagccccta gggttcttgg acaattcagg gggnagggga agcctcaaga 720 gggagatact ggacttcctg ctaataaggc aggtgggngc tcgagcgatt aattggaaaa 780 taaaagagat gtgaccatat ttgtaccccg agtttgtgga gcagttcata ccggttaca 839 // ID RAL repbase; DNA; ROD; 732 BP. XX AC X04991; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 28-SEP-1995 (Rel. 1.08, Last updated, Version 1) XX DE Rat RAL repetitive element. XX KW LTR Retrotransposon; Transposable Element; Long terminal repeat; KW LTR-like sequence; RAL element; RAL. XX OS Rattus norvegicus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Rattus. XX RN [1] RP 1-732 RA Suzuki N., Fujiyoshi T., Maehara Y., Takahashi K., Yamamoto M. RA and Endo H.; RT "A new family of LTR-like sequences abundantly expressed in rat RT tumors."; RL Nucleic Acids Res 14(23), 9271-9289 (1986). XX DR GenBank; X04991; Positions 297 1028. XX SQ Sequence 732 BP; 211 A; 199 C; 131 G; 191 T; 0 other; tgaaaggaaa attatacgaa tttaagttta aaaatataaa attaaaagag taagccccaa 60 aagccacaaa ctgagggcta agctgccgcc aggaatagca ggccataaag ataaagaaaa 120 ggaatacaaa aaccacctca ggctatcaag gactgaccca taaaccatcc aaagacattc 180 ccccagctta ctcagagtca tattttaacc agatgtcctc cagaccctga taagccccta 240 cttgtgcttt ccagccattg tgtcctgcag agagcattcc aggaaaccgg gtagcccaag 300 cctaaccaga gtgtttcaaa tacagatctt catcaattct gacaaacctt taaaaataat 360 gaagacctga agaccctacc cttctcctga tttagtagct ttgttccagg caatcagggg 420 tccataatac cttctgtcat ccccttatct cctggagcca taaaacaatc cttgtaactt 480 gtggtgcctt cccctttgac atcccccatc ccctggctac acagcctctg cctttaaata 540 ctctttctcc cagcctctct gggtcagaag agcctctgtc tcctgcttga gaaacgtgtc 600 agcgcgcaga tctctgtaat aggtctccgt aataaacctc gcctttgctt attacatcca 660 aaatggtctc tctgtgtctg gggtctgcga tttcccaaga cttgagtaag ggtctctctc 720 tggggtcttt ca 732 // ID MARE2 repbase; DNA; ROD; 355 BP. XX AC . XX DT 28-JUN-2006 (Rel. 11.06, Created) DT 01-AUG-2007 (Rel. 11.06, Last updated, Version 2) XX DE Mammalian-wide repeat - consensus. XX KW MARE1; MARE2. XX OS Mammalia OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi. XX RN [1] RP 1-355 RA Jurka J.; RT "MARE2: An ancient mammalian repetitive element."; RL Repbase Reports 6(6), 345-345 (2006). XX RN [2] RP 1-355 RA Gentles AJ., Wakefield MJ., Kohany O., Gu W., Batzer MA., RA Pollock DD. and Jurka J.; RT "Evolutionary dynamics of transposable elements in the RT short-tailed opossum Monodelphis domestica."; RL Genome Res 17(7), 992-1004 (2007). XX RN [3] RP 1-355 RA Jurka J., Kapitonov V.V., Kohany O. and Jurka MV.; RT "Repetitive sequences in complex genomes: structure and RT evolution."; RL Annu Rev Genomics Hum Genet 8, 241-259 (2007). XX DR [1] (Consensus) XX CC This subfamily is less than one third as abundant as MARE1. For CC other comments see MARE1. XX SQ Sequence 355 BP; 131 A; 42 C; 72 G; 107 T; 3 other; ttccaaaagt tcatttgtta aagttagttg tttggaactc agaatacatt ttcccataga 60 aacaatgtta taaatggtgg ttaggttccc agggctagcc cacaaaagcc tatttaaccc 120 ataatgtagc tgaaatactg tacatttgca atgaaaatag tagaaaayaa tactgttgca 180 atactagtaa ttaaacaaaa cagaaaaaca ataaaattga ataagaaata attttttatt 240 tatcttgaat gctggtgctt gaggaaaagc aggtttaatt agaagaggag gaggtatatg 300 aaaagttttg tggagattct gaaggggact tctgggyayt ttcaagggct ttaga 355 // ID RLTR15C_MM repbase; DNA; ROD; 1424 BP. XX AC . XX DT 23-AUG-2008 (Rel. 13.08, Created) DT 23-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Long terminal repeat of endogenous retrovirus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; RLTR15C_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-1424 RA Jurka J. and Jurka M.G.; RT "Long terminal repeats of endogenous retroviruses from mouse."; RL Repbase Reports 8(8), 878-878 (2008). XX DR [1] (Consensus) XX SQ Sequence 1424 BP; 469 A; 244 C; 359 G; 350 T; 2 other; tgccgggtcc ggcatggctg cgtaagggga ggtgtgacac agttcctgcc cctggagcat 60 cgccaggccg ttgatcccca cgatccccac gtgtgctgat ggttgctgtt gccctgaaac 120 tgatctgctg tctcttctat tttactttca atctctcttt ctggggaata tacttccccc 180 attactcaga gctaatcact ccataataaa tgaaacagat gcggttctca ggaatgagga 240 atggaggtgg tcatgaggct gtggaggacg aggaacaggc cttccgcttg gcctttaggg 300 ccatgggaaa ggatctagac aacataacca cagaggccat aaaagcctac atggtccttt 360 gtagaaatag ttcattgcca gacttggcct tgaatttgga attatgagga gatacatgta 420 ccctccggtc caatgactgc tgcattaatt agccagggtg aacttgggga catcaaggat 480 agagagttaa agtggtctca agaagaaaag gcatggatag gaaaagaagg aatctataaa 540 aaaaaaaaac aagcaaatga tagatgggtt tattttatag gagaaaggct tgtatataga 600 tctagaaaca aatagtatta gaaggagagt atacgaggcc atggaattgc taggatgagg 660 aaacaaagat tagagaatta aaagatgatg tataagcata taaaaagtta gtacttcgtt 720 ctcctagcta cttgaaggat tggagatgac aatgcctaga gagatattag agagagctag 780 ctgctttact tgcataaatt ttaaaacttt gtaagaaatc attttaacct tagtagtctg 840 tcacctttct gttcttacat gaaaatagtt actaaagact gctgtctgct gtaccaggaa 900 accgcaagag ctgtgcccag ctacccgtga agaacaggct gtgtaaaaac tataaggaaa 960 tgtgcttttg ctttgtgttt agtttctgtt ttaaggaagc aggggttgga actaatgatc 1020 cgggattcat agaatttatg taaatttaag gatttctaaa attaatatat taacaatgat 1080 caaagccttg atgatgtagc tttagtaaat aaaagactgg gttcaggctc tctggtcggg 1140 agaagaagaa ggcaaaagaa tgagaagaaa gaaggaaaaa gaatgaggag aagagagaaa 1200 rgcaaaagaa ggagaagggg aaggctaaag aatgagagga gagaaggaaa gaatgagaag 1260 gaagaaagga gargaaggaa gaatgatgag aatcctcggg caaagagaga gcacttgaac 1320 tgacagcttg catccactac tgactccgag tcattattga atcagcgcca ctcccattcc 1380 cgcctttctc aggaccctcc tcacgctgag gctggacctc ggca 1424 // ID MLT1F repbase; DNA; ROD; 541 BP. XX AC . XX DT 01-OCT-1995 (Rel. 3.6, Created) DT 23-JAN-1998 (Rel. 6.4, Last updated, Version 3) XX DE Mammalian transposon-like element long terminal repeat (MLT1f DE subfamily) - a consensus. XX KW Non-LTR retrotransposon; MaLR family; MLT1f subfamily; MLT1F. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-541 RA Smit F.A.; RT "Identification of a new, abundant superfamily of mammalian RT LTR-transposons."; RL Nucleic Acids Res 21(8), 1863-1872 (1993). XX DR [1] (Consensus) XX SQ Sequence 541 BP; 135 A; 137 C; 123 G; 126 T; 20 other; tgtggtagcc agcctccaag atggccccca atgatccctg ctcctggtat tcacaycctt 60 gtatggtcyc ytcctacatt garyyagggc trgtctgtgt gaccaataga atagggcaga 120 agtgatggcg tgtsacttcc aagaytargt cayaaawaac actgtggytt ctgcyttgnt 180 ctcttcgggc tactcactct gggggaagcc agctgccatg ctatgaagac actcaagcag 240 cctatggaga agtccacgtg gsaaggaact gaggtctcct gccaacagcc agcttcgacy 300 tgccagccat gtgagtgagc catcttggaa gcggatcctc cagccccagt yaagccttca 360 gatgactgca gccccggctg acatcttgac tgcaacctca tgagagaccc tgagccagaa 420 ctacccagct aagctgctcc tarattcctg acccacagaa actgtgagat aataaatgtt 480 trttgtttta agccactaag ttttggggta atttgttacg cagcaataga taactaatac 540 a 541 // ID L1P2_5 repbase; DNA; ROD; 990 BP. XX AC . XX DT 26-MAY-1998 (Rel. 6.5, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 2) XX DE 5' part of L1 subfamily - a consensus sequence. XX KW LINE; MER60; L1 subfamily; L1P2_5. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-990 RA Kapitonov V.V. and Jurka J.; RT "L1P2_5."; RL Direct Submission to Repbase Update (MAY-1998). XX DR [1] (Consensus) XX CC L1P2_5 is the 5' part of L1 subfamily. CC Individual copies are 82% identical to the consensus sequence. CC Portion 544-666 is 74% identical to the portion of MER60 CC (219-337); CC Portion 685-986 is 80% identical to the portion of L1MB6_5 CC (954-1241). XX SQ Sequence 990 BP; 329 A; 224 C; 220 G; 216 T; 1 other; ataaaaaaat taatgagaag aggattctgg gaagatggca gagtaggaag caccaggaat 60 ctgtctcccc acctagacaa caattgcact ggcagaatct gtctgatgta actattttgg 120 aactctggag tctattgaag gcttgcaact tccaggggaa ggcttggatg gtaaattgtg 180 gttaatttcg gtcaatttca gctcttagca cagtagcagc tacccatccc ccacccccag 240 ccccgtggca ggcagctgtg cacgtgttcc tggagcagct tgcacacagc ttgcgggagc 300 cagggtgggc aaaaaggatc ctgtcctcca aatatcgggg atctgtgctc tgatcgctga 360 ttgctgcttc tgatcacaga ggtgcagaca aagaggtggc ggccattgtt gtcgcacctc 420 ccyccattgt tgcaagcccc tccccctctg gctgaagtga cttccagggg atttaaaggg 480 ctagtaccct ttctccgctt tatttttctt ttttcccctt ttgggagcca gacattaaag 540 actaggacat tcaaaagcaa ctgcatatac ggggaaaatt agaaagtcac cgtgcatgcc 600 cagggaaagg cacaggctca gaaaagacct gagaagacct taagtttaca cctcaggctg 660 atccttggca cagagacagc ctacaacaat aaaaacaaaa caaaaaaata acaaaaacac 720 agcaaaccct ggggaagagg gagaatctga tttccagagt taccacatta ttagattcaa 780 atgtccagtt ttcaacaaca acaacaaaaa atcacaaggc atacaaagaa acaggaaagt 840 atggcccatt caaaggaaaa aaaataaatc aacagaaact gtccctgaaa aagacctgat 900 ggcagatcta ctagacaaag actttaaaac aactgtctta aagatgctca aagaactaaa 960 ggaaaacgtg gagaaagtca agaaaatgat 990 // ID RMER17C repbase; DNA; ROD; 414 BP. XX AC . XX DT 26-SEP-1997 (Rel. 2.08, Created) DT 26-SEP-1997 (Rel. 2.08, Last updated, Version 1) XX DE Long terminal repeat of retrovirus-like element. XX KW ERV2; Endogenous Retrovirus; Transposable Element; KW putative long terminal repeat; RMER17C. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-414 RA Smit A.F.; RT "RMER17C."; RL Direct Submission to Repbase Update (30-NOV-1996). XX DR [1] (Consensus) XX CC Probably a retrovirus-like long terminal repeat. 5/6 bp CC duplications. XX SQ Sequence 414 BP; 63 A; 153 C; 65 G; 115 T; 18 other; tgttagcatt ctgtctaggc tctgccccac agttacctgg caacagccag gtgtgcctga 60 ctcactataa aaggggctgt ttggcccctc ctcgctctct tgctctcgtc tctcttcctc 120 tcctctctag ccctccttcc ctctctcccc ccccctctct ctccacgtgc tcacgggcgg 180 cctcttctct ctctctctga cactctgtct ctctctgcct ttctayyyyy yyyyyyyyac 240 tcccctcccc atgccctgaa taaactctat tctatactat actgtcgtgt ggtggctggt 300 acctcagggg gaagggatgc ctcagcatgg gcccgcmgag gcacccctcc cmccwcaccw 360 taccacgcct ccacmaaaca tatccttctc tctttatttt tataaaacac aaca 414 // ID LINE3_RN repbase; DNA; ROD; 600 BP. XX AC M13922; XX DT 28-SEP-1995 (Rel. 1.2, Created) DT 25-APR-1997 (Rel. 3, Last updated, Version 3) XX DE Rat repetitive sequence homologous to 3' end of LINE repetitive DE element. XX KW LINE repetitive sequence; long interspersed element; RNLINE3; KW LINE3_RN. XX OS Rattus norvegicus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Rattus. XX RN [1] RP 1-600 RA Shore K.S., Bacheler T.L., de Riel K.J., Barrows R.L. RA and Lynch J.M.; RT "Cloning and characterization of a rat-specific repetitive DNA RT sequence."; RL Gene 45, 87-93 (1986). XX DR GenBank; M13922; Positions 297 896. XX SQ Sequence 600 BP; 215 A; 123 C; 170 G; 92 T; 0 other; gggaatagag aggcaaagat taaaacagac acagaaggaa cacccattca gagcctgccc 60 cacatgtggc ccatacacat acagtcatcc aattagacga gatggatgaa gcaaagaagg 120 gcaggccgac aggagccgga tatagatcgc tcctgagaga cacagccaga atacagcaaa 180 tacagaggcg aatgccagca gcaaaccact gaactgagaa taggatcccc gttgaaggaa 240 tcagagaaag aactggaaga gcttgaaggg gctcgagacc ccatatgtac aacaatgcca 300 agcaaccaga gcttccaggg actaagcaac tacctaaaga ctatacatgg actgaccatt 360 gactctgacc ccataggtag caatgaatat cctaataaga tcaccagtag aaggggaagc 420 cctgggtcct gctaagactg aacccccagt gaacaagatt gttgggggga gagcggcaat 480 ggggggagga tggggagggg aggggaaggg ggagggatta gggggatgtt tgcccggaaa 540 ccgggaaagg gaataacact cgaaatgtat ataagaaata ctcaagttaa ttaaaaataa 600 // ID LTR6H_Cpo repbase; DNA; ROD; 394 BP. XX AC . XX DT 19-SEP-2009 (Rel. 14.1, Created) DT 19-SEP-2009 (Rel. 14.1, Last updated, Version 3) XX DE Long terminal repeat of ERV3 endogenous retrovirus: consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; LTR6H_Cpo. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-394 RA Jurka J.; RT "Endogenous retroviruses from guinea pig."; RL Repbase Reports 9(10), 2156-2156 (2009). XX DR [1] (Consensus) XX CC ~78% identical to consensus. XX SQ Sequence 394 BP; 79 A; 93 C; 104 G; 116 T; 2 other; tgttatggtt tatatctaga tgtcccccca aagcctcatg cgctcatagg tggggctttt 60 gggaggtgac tggatcacgg gtgtgtgata ctcatcagtg gattagtcca ctgatgagtt 120 tatagctaaa tgtgatgttg ggaggtgaag cctgggagga ggtgggtcac tgggggcgtg 180 gcctggaagg gtctatctcc cttcttctcc ctcgatctcc ttctgcttct tgccgtcatt 240 tctattccat ntctcctctg ccatgccgcc ctgccttgga gccagccgac tatggactga 300 aacctctaca aactgtgagc caaaataaac ctttcctcct ttaanttgtg ggtgtcgggt 360 attgtgtctc agcgacgaga aagtaactaa gaca 394 // ID GOLEM repbase; DNA; ROD; 3029 BP. XX AC . XX DT 13-JAN-1998 (Rel. 6.4, Created) DT 28-JUN-2000 (Rel. 7.2, Last updated, Version 5) XX DE Autonomous DNA transposon; POGO superfamily. XX KW DNA transposon; MER7; 35S; MER17; MER29; MER7B; GOLEM; TIGGER3. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 3029-2724 RA Drinkwater D.R., Burgoyne A.L. and Skinner D.J.; RT "Two human repetitive DNA elements: A new interspersed repeat RT found in the factor IX gene, and a satellite 11 tandem repeat RT sequence."; RL Nucleic Acids Res 14, 9541-9541 (1986). XX RN [2] RP 3029-2724 RA Kaplan J.D. and Duncan H.C.; RT "Novel short interspersed repeat in human DNA."; RL Nucleic Acids Res 18, 192-192 (1990). XX RN [3] RP 2713-2608 RA Kaplan J.D., Jurka J., Solus F.J. and Duncan H.C.; RT "Medium reiteration frequency repetitive sequences in the human RT genome."; RL Nucleic Acids Res 17, 4731-4738 (1991). XX RN [4] RP 2327-2128 RA Skalnik G.D., Strauss C.E. and Orkin H.S.; RT "CCAAT displacement protein as a repressor of the myelomonocytic- RT specific gp91-phox gene promoter."; RL J. Biol. Chem 266, 16736-16744 (1991). XX RN [5] RP 2787-2607 RA Rubin M.C., Leeflang P.E., Rinehart P.F. and Schmid W.C.; RT "Paucity of novel short interspersed repetitive elements (SINE) RT families in human DNA and isolation of a novel MER repeat."; RL Genomics 18, 322-328 (1993). XX RN [6] RP 2327-2128 RA Jurka J., Kaplan J.D., Duncan H.C., Walichiewicz J., RA Milosavljevic A., Murali G. and Solus F.J.; RT "Identification and characterization of new human medium RT reiteration frequency repeats."; RL Nucleic Acids Res 21, 1273-1279 (1993). XX RN [7] RP 3029-2128 RA Smit F.A. and Riggs D.A.; RT "Tiggers and other DNA transposon fossils in the human genome."; RL Proc. Natl. Acad. Sci. USA 93, 1443-1448 (1996). XX RN [8] RP 1-3029 RA Kapitonov V.V. and Jurka J.; RT "Jerky gene - a recruited transposon?."; RL Direct Submission to Repbase Update. XX RN [9] RP 1-3029 RA Smit F.A.; RT "GOLEM."; RL Direct Submission to Repbase Update (JAN-1998). XX CC 23 bp terminal inverted repeats and TA target site [7]. CC GOLEM's non-autonomous elements are GOLEM_A (MER7A) and GOLEM_B CC (MER7B) repeats. CC Orientation of the repeat has been determined based on the CC reconstruction of its internal sequence encoding transposase [8]. CC The ORF from pos 442-2307 encodes a protein 39% identical (57% CC similar) over the full-length to the Tigger1 product. XX SQ Sequence 3029 BP; 968 A; 580 C; 659 G; 820 T; 2 other; cagtcatgcg ccgcataacg acgtttcggt caacgacgga ccgcatatac gacggtggtc 60 ccataagatt ataatggagc tgaaaaattc ctatcgccta gtgacgtcgt agctgtcgta 120 acgtcatagc gcaacgcatt actcacgtgt ttgtggtgat gctggtgtaa acaaacctac 180 tgcgctgcca gtcgtataaa agtatagcac atacaattat gtacagtaca taatacttga 240 taatgataat aaatgactat gttactggtt tatgtattta ctatactata ctttttatcg 300 ttattttaga gtgtactcct tctacttatt aaaaaaaagt ttactgtaaa acagtatgcc 360 gtgttacacc ggcagcagcc tcatacatct cgtgtttacc gcgtctcttg attgcatcat 420 tttctcttgt gcttgattta atctcgtgtt gttttgttca tcatggcccc taagcgtaca 480 aaatccacgg ctaatgttgc cagtaagagg ccacatcgag tgactgacct ggaaacgaaa 540 ttaaaagtga ttaaggacta cgaaggtgga aaatcagtga tggttattgc tcgccagtca 600 ggcatgtccc attccaccat agctacgatc ttgaagaaca agaacaaagt gacagaagct 660 gttaaaggat ctgcttcatt gaaggcaacg agactaacaa aaattcgaga agggcctata 720 tcagatatgg agaaacttct aatgacctgg attgaagacc agacacagaa gcatatccct 780 ctcagcacca tgacgatcac ggccaaagca aaaagtttgt ttgcgatgtt gaaagaaaag 840 gctggacccg actacgatgt tgaatttact gctagctctg ggtggtttaa acgattcaag 900 aatcgttatt cattacataa tgtgaaagtg agtggtgagt ctgcgagtgc tgatgtgaag 960 gcagctgaag aatttttgga aactctagat aagctgattg tggaggaaaa ttacttgcca 1020 gagcaaatct tcaatatgga tgaaacctcc ctattctgga aacggatgcc tgaaaggact 1080 ttcatccata aggaggccaa gtcaatgcca ggtttcaagg cttttaagga caggataaca 1140 gtcttgcttg ggggcaatgt tgcaggctac aaattgaaac cctttgtgat ctggcacagt 1200 gagaacccca gggccttcaa gcatatcaat aagcacacac tgccagtgta ctacaggagc 1260 aataagaagt catggatgac ccagctcctc ttccaagatg ccctcctgaa ttgctatgcc 1320 agcgaaatgg agaagtactg tttggagaat aacatacctt tcaagatttt gcttattgtt 1380 gataatgctc ccgcacatcc tccttttatt ggtgatcttc atcccaatat caaagtggtg 1440 tttctccctc caaacaccac ctctttgatc caaccaatgg atcaaggagt tatagcagct 1500 tttaaggcct actacctgag gaggaccttt gcccaggcta ttgctgcaac tgaggaagac 1560 actgagaaga cactgatgca attctggaag gattacaaca tctatgactg catcaagaac 1620 cttgcttggg cttggggtga tgtcaccaag gagtgtatga atggcatctg gaagaagaca 1680 ctcaagaggt tcgtccgtga cttcaaagga tttgccaagg atgaggaggt tgcaaaaatc 1740 aacaaggctg tggttgagat ggcaaacaac tttaacctgg gtgtggatga ggatgacatt 1800 gaggagctcc tagaggtggt tcctgaggaa ttgactaatg aggagttgtt ggaactggaa 1860 caggaacgca tagctgaaga agaggcaaga gaaaaggaaa ctgcaggaga agaaaaagaa 1920 gaacccccaa gaaaattcac agtgaagggt ttagcagaag cttttgcaga cctcaacaag 1980 ctccttaaaa agtttgaaaa catggacccc aacaccgaaa ggttttcatt aatagagagg 2040 aatgttcatg gtgcattatc tgcttacaag caaatctatg atgaaaaaaa gaaacaaacc 2100 aagcaaacca ccatggacat atttctgaaa agagtgacac ctcctcaaga agagcctcag 2160 gcaggtcctt caggaggtat tccagaagaa ggcattgtta tcataggaga tgacagctcc 2220 atgcgtgtta ttgcccctga agaccttcca gtgggacaag atgtggaggt ggaagacagt 2280 gatattgatg atcctgaccc tgtgtaggcc taggctaatg tgtgtgtttg tgtcttagtt 2340 tttaacaaaa aagtttaaaa agtaaaaaaa aaataawttt aaaaatagaa aaaagcttat 2400 agaataagga tataaagaaa gaaaatattt ttgtacagct gtacaatgtg tttgtgtttt 2460 aagctaagtg ttattacaaa agagtcaaaa agttwaaaaa attaaaaagt ttataaagta 2520 aaaaagttac agtaagctaa ggttaattta ttattgaaga aagaaaaata ttttaaataa 2580 atttagtgta gcctaagtgt acagtgttta taaagtctac agtagtgtac agtaatgtcc 2640 taggccttca cattcactca ccactcactc actgactcac ccagagcaac ttccagtcct 2700 gcaagctcca ttcatggtaa gtgccctata caggtgtacc attttttatc ttttataccg 2760 tatttttact gtaccttttc tatgtttaga tatgtttaga tacacaaata cttaccattg 2820 tgttacaatt gcctacagta ttcagtacag taacatgctg tacaggtttg tagcctagga 2880 gcaataggct ataccatata gcctaggtgt gtagtaggct ataccatcta ggtttgtgta 2940 agtacactct atgatgttcg cacaacgacg aaatcgccta acgacgcatt tctcagaacg 3000 tatccccgtc gttaagcgac gcatgactg 3029 // ID RLTR19B_MM repbase; DNA; ROD; 718 BP. XX AC . XX DT 30-JAN-2004 (Rel. 9, Created) DT 30-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE Mouse subfamily of LTR retrotransposons - a consensus. XX KW LTR Retrotransposon; Transposable Element; LTR; ERVK; RLTR19B_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RA Karolchik D., Baertsch R., Diekhans M., Furey S.T., Hinrichs A., RA Lu T.Y., Roskin M.K., Schwartz M., Sugnet W.C. et al.; RT "The UCSC Genome Browser Database."; RL Nucleic Acids Res 31(1), 51-54 (2003). XX RN [2] RP 1-718 RA Pavlicek A. and Jurka J.; RT "RLTR19B_MM - a subfamily of LTR retrotransposons."; RL Direct Submission to Repbase Update (31-DEC-2003). XX DR [2] (Consensus) XX CC The consensus sequence was reconstructed from the UC Santa Cruz CC genome annotation. RLTR19 subfamily. RLTR19 subfamily (86% CC identity). CC Individual copies are ~92% identical to the consensus. 6 bp TSDs. XX SQ Sequence 718 BP; 184 A; 180 C; 131 G; 223 T; 0 other; tgttatggtc tgaccccccc cccccagaat agctgtagcc attttgttcc atgcctgcca 60 gctatttcat attgttgctg taacatgcct gccagtcatt gacacagaga agtgacttgg 120 ctcaaggaca tgctgaccac ataccctgtt ttgttctgta tgttctgaat gttctgtatg 180 aggtttgcta atcttaagaa attccacaaa gctttacgta ggacccatca aatcaaaggt 240 caatatgaac tgttatgtct aaaatatctt gagtcagagc tgaccaccag gcagcccttc 300 ctgcacctat gtatgagctc actgtggttt ttgtggctga cactgaagaa tgctactaat 360 aagccaaaaa gttatgatta taatactcat gctctgttat ctcaatgttc tgaaattccc 420 ctgttcacca cccatccacc accaccctta ccttactcag gaccaatcag cttaaaggtt 480 agctgataat actttgccta gtaagccaac tgctcccctc cttgcccttt aaacttttga 540 acctggtttt tcctataaaa agcctaccct gagagcagac tagtaccaca attaggcttc 600 cgaagtcctt tttgcggtcc tggacgtcca gtattatggt gtgcgttcaa taaactattc 660 ttgcttaact gagatcagtg tttgtatggt ttgagtggcg atttcctgaa ccccaaca 718 // ID LTR6_Cpo repbase; DNA; ROD; 345 BP. XX AC . XX DT 31-JUL-2009 (Rel. 14.07, Created) DT 31-JUL-2009 (Rel. 14.07, Last updated, Version 2) XX DE Long terminal repeat of ERV3 endogenous retrovirus: consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; LTR6_CPo. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-345 RA Jurka J. and Baney O.; RT "Non-autonomous endogenous retrovirus from guinea pig."; RL Repbase Reports 9(7), 1547-1547 (2009). XX DR [1] (Consensus) XX CC ~93% identical to consensus. XX SQ Sequence 345 BP; 78 A; 79 C; 86 G; 101 T; 1 other; tgtcatggtt tatgtcggtg gcccccaggc ctcatgcatt tgcattgtgc atttgtgatt 60 ggttcattgt ctagygcttg gattgatggt gtcagtgcta cacccacata ggggtggagc 120 caggatgtaa tgtaatggca ggaagaaggt gtgtctctct ctcttgctgg tttctgcctt 180 gctgtttgca gccgccatga actgtggccc cgccatgcca ccctgccttg gagccaactg 240 agtatggact gaaacctcca aaaactgtaa gaaataaacc tttcctttcc caactttggg 300 catcaggtat tttgtctcag caatgagtaa aaagtaacta agaca 345 // ID MMERGLN_I repbase; DNA; ROD; 7556 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Internal portion of ERV1 Endogenous Retrovirus from mouse. XX KW ERV1; Endogenous Retrovirus; Transposable Element; KW LTR retrotransposon; MMERGLN_I. XX OS Mus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae. XX RN [1] RP 1-7556 RA Smit A.F.; RT "MMERGLN_I - ERV1 Endogenous Retrovirus from mouse."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC (has RLTR1 LTRs). XX SQ Sequence 7556 BP; 2098 A; 1833 C; 1919 G; 1706 T; 0 other; tttggaggcc ccagcgagat ctgcgtgaca cccaggaacc ccgaaggacc ccttggaggt 60 gcgtttgttt gtgtgagtct tgttatgttg tctgttgtct aagtgtctaa gtgtggcact 120 gctgaatttg tgtcttagtt tttcagttct gagattgtgg gtttgagccc cacctgtgtt 180 accagttctg gtattctgta ttctggcagc tgccactgcg ttccgtaagg accctagtgg 240 ctgtgggaag acgacgatct atttccccac aggctgcacc cttggaagac attccgaggg 300 agaccctgga gtgcccgggg tacggaacag tcaggaggac ctggctgttg cctggcagag 360 tgaagaagag tgagtgctct tcctgccaga ggagtggagc ggaatcccac tccatcagag 420 gtagcgtttg gctggttgtg taagtccaga cgcagatgag tgtgcttgga tgtcttagta 480 ttttccgtct ctgtcattgt gttgtgttta ctcttattct tcactatggg acagaccgtg 540 tctactcctt tatctttgac taaggaccat tggacggaca ttagggctag aggacaaaat 600 ttgtcagtaa aagtgaagaa aaagccatgg atgactttct gttcctcaga atggcctgtt 660 tttggagtag gttggccagc agaaggaact ttttacttac ccaccataag ggctgtgaag 720 gccattgttt ttcaggaagg gccagggtcg catccagacc aacaaccgta cttcatggta 780 tgggaggact tggcacgcta cccacccccg tgggttcacc cattcctccc gcctctccac 840 cctggcacca agattctagc catccgagaa aatggtgaga aagagaaacc aaaaccaccg 900 ctcgggagag atgatgatca cagcacacca gtgatgaaac cccccaagat ctatccagag 960 attgaaaccc cctgagtggc ccaacacccc tcaaccccca ccgtatgctc cccagcccca 1020 accttcagct ccctcaggac ccctgcctca ggccccggcc ggaggagggg gtccctccac 1080 aggaacaagg agccggcgag gagtcacccc tgaggggcct gcggattcaa ccgtggcgct 1140 ccccctcagg gctattgggg ctccccctgc cgatccaaat agtctacagc ccctacagta 1200 ttggcctttt tcctcttctg acctctataa ctggaaagct aatcaccccc cttttttcag 1260 aaaaccctgc aggactcact gggttggttg aatcattaat gtattcacac cagccgacct 1320 gggatgactg ccagcagctt ctgcagactc tattcacaac tgaggagaga gagaggattc 1380 tcctcgaggc tcggaaaaat gtccaagacg aggctgggcg ccctgtccaa actccagctg 1440 agatagatga aggatttccg ctaacccggc cccgatggga ttataatacg gcatcaggta 1500 gggaacgact gtccaattat cgccgggtcc tagtggcggg tctcagaggt gctgcccggc 1560 agcccacgaa tctggccaag gtaagagagg ttatgcaggg agcgactgag cccccctcag 1620 tcttccttga aaggctcatg gaggcttata ggagatatac cccattcgac cccacgtctg 1680 agggtcaaag ggcctcagta attatggcct tcattggcca gtcggctcct gacattagga 1740 agaagttaca gcgaattgag ggcttgcagg attacaccat aagggatgta gttagagagg 1800 cagagaaagt gtatcatagg agagaaacag aagatgaaaa gttagagaga gagaaaagag 1860 agaaaagaga agatgaggat aggagagaca ggaggcaaga aaaggttttg actaggatcc 1920 tggctgcagt aggagaaaga gataatggaa gaagaggtag acagtcaggg aacctgggag 1980 acaaaagaca gcagggacca aggagaccca gagaaggcgg gcagcgcctg gagaggaacc 2040 aatgtgcata ttgcaaggaa atgggccact ggaagagcaa ctgtccggaa aaaaaaacaa 2100 gaggtaaagg tgctttctct tggagaagat gaagactagg gggaacgggg cttgacccac 2160 ctccccgagc ctagggtaac tttagaagtg gaggggtccc ctgtggactt tctagttgac 2220 acgggagccg aattttcagt actcaaaaca cctctaggaa aagtgaagaa aaatgaaaaa 2280 accttggtga tcggggccac gggacaaaaa tcgtatccat ggaccacatc ccgagtagta 2340 gacatagggc gaaatcgagt aactcattcg tttctagtca ttccagagtg tcctattcct 2400 ttattgggga gagacttact aaccaagtta aaagcacaaa taactttcac ctctcatcga 2460 ccggaggttt tctggggaat aaaagcgccc cagactctag agctgtcttt acaactaggg 2520 gaggaatatc gactttacca aaataaagta aagccccctg agggattaca ggactggttg 2580 aatcgatacc ctcaggcatg ggcagagacg ggaggagtgg ggatggcaaa actggtcccc 2640 ccccgtggtg attgaactta agtccggggc cacccctata ggggtccgac aatatcccat 2700 gagcagagaa gctcaagagg gtatacgccc ccaaattaac aaactgctcc aacaagggat 2760 tttggtccca tgcaaatccc cttggaacac tcctctactt ccagtaaaaa aaccagggac 2820 cagtgactac cgtccagtac aggaccttag agaagtcaac aagagagttc aggacataca 2880 ccccacggtg ccaaatcctt ataacctcct cagcaccttg ccacctggtc ggacatggta 2940 cacagtcctg gatctcaaag acactttttt ctgtttgagg ttacacccca acagccagcc 3000 cttgttcgct ttcgaatggc gagactccga gagtggacaa gccggacagc tcacatggac 3060 gaggctgcct cagggattca agaactcgcc cactttgttc gatgaagccc tacaccgaga 3120 tcttgctctt ttccgagcca ataacccaca ggtgactctt ctgcaatatg tagatgacct 3180 gctcctagct gcagaaacac acgaggactg tgaaattggg acctaaaacc tcctgggcga 3240 gttaggtaac ctggggtatc gggcctctgc taaaaaggct cagttatgcc agatagaagt 3300 gacctaccta ggatatgtct tgagagatag acaacggtgg ctcacagaag ccagaaaaca 3360 agctgttatg cagatcccga ccccaaccac tgctcgccag gtaagagagt tcctggggac 3420 cgccgggttt tgcagactct ggattcctgg atttgccaca ctggcagctc ccttgtatcc 3480 actaaccaaa gagaaagggg aatttacctg gaccagagaa catcagctag cctttgaaac 3540 tctcaaaaag gcactgctgt aggctccggc attggccctg ccagatttaa acaaaccttt 3600 caccctatac attgatgaaa gaaatggagt ggcaagggga gtccttaccc aggttttggg 3660 accatggaag cgcccggtag cctacttatc aaagaaactg gaccctgtgg ccagtggatg 3720 gccctcctgc ctgcgagcga tagcagccac ggctgtgcta gtaagagatg ctgacaaact 3780 gactatgggc cagaatgtta ctatagtggc cccacactct cttgagagca tcatcaggca 3840 accactggac cgctggatga ccaacgcccg aatgacgcac taccagagcc tattgctgac 3900 agagcgagta agttttgcac ccccagccat tctcaaccct gcctccttac tacctgaggc 3960 tgacgaggcc cctgcacata agtgtgaaga aatactggca gaagagactg gaatccggcc 4020 agacctcaca gaccaacctt cgccaggggc gatgacttgg ttcacagacg gaagcagctt 4080 tgtggtagaa ggtaagcgga gggctggggc agcagtagtg gatggaaagt ctgtcatatg 4140 ggccagcagt ctgctggagg gtacatcagc tcaaaaagca gaactaatcg cattaattca 4200 agccttaagg ctggcagaag gaagggctct taatgtctat actgacagcc ggtacgcttt 4260 tgccacggct catgttcaca gagcaatata ccgacaccgt ggactgctga cgtctgccgg 4320 caaagatatc aaaaataaag aaggaattct cagcttatta gaagctgttc atctgccccg 4380 tagggtggta attttccatt gcccaggaca ccagaaggga actgggcccg ttgaaaaggg 4440 aaatcaaatg gcagaccaag aagctaaaaa agcagcccaa gggccaatga ctctggtggt 4500 gagaacccaa cagcccgctg ctgaggaaat aaataaaaga accctcacag aagaagaggg 4560 gcgagattac ttagctaaca tacaccatct gactcattta ggaactaaaa aattactaaa 4620 attggttagt aagtccccct attacattcc tggattaaaa agaattgtgg aagagatagt 4680 aaaaaactgc cgtgcttgtg cacttaccaa cgctgggtct agcaggctcc aggaaagaaa 4740 acgactgcga ggagacaggc ctggagccta ctgggaaact gacttcactg aggtgaaacc 4800 ggctaggtat ggaaataaat atctcctagt ttttatagac accttttcag gatgggtcaa 4860 agcattcccc accaagaaag aaacgactaa tgtagtggtc aagaagatac ttgaagaaat 4920 ccttccccgt tttgggatac ctaaggtaat ggggtcagac aacagacctg ccttcgtctc 4980 ccaggtaagt cagggattgg ccagacaact ggggacaaat tggaaattac attgtgcata 5040 cagaccccag agttcaggac aggtagaaag gatgaacaga acgctaaagg agactctgac 5100 taaaatagcc ttagaatccg gtggaagcga ttggacagcc attctccctt atgccttgtt 5160 cagggttcag aatacacctg gaccccttgg cctaactcca tttgaattaa tgtatggggc 5220 gcccccaccc atttttatga ccgtaggtga taagaatcgc ctggatgtgt ctttctctcc 5280 tccttctagt cttttggctc gattaaaagc tctcgaaata gtaagaaaag aggtctggga 5340 acagctaaaa gaaacctatg ttgctggtga cacacaggtg ccacatcagt ttgaagtagg 5400 agacgcagtc ctggtgagga gacaccgagc aggaaaccta gaaccgaggt ggaagggacc 5460 ctacttggtg ctactgacaa cgcccaccgc ggtcaaagtg gaaggaatcc ccacttgggt 5520 ccacgcatcc cacgtcaaga gagcaccccc tggagtcagc catgatgagt ggactttgga 5580 gaagactact aatcctttta agttgcgcct gcttcgtagg agcgatccca aaagacttca 5640 acccccacag tcctgttcaa caaacgtggg aggtactcaa tgaggagggt agggctgtat 5700 ggacaatcgc cgaggtacac cctctgtgga cttggtggcc tgatcttttc cctgacatct 5760 gtaagttggc tataggagcc cctcctggat gggacttgga ggggtactct gacattcaga 5820 gggcaccttt aacaccccct ccgtacgtag aaaaacattt gagagacccg tggggtggtt 5880 gctctaacca aagggataga agtatgcttc gaacccatcc cttctatgtc tgccccgggc 5940 cccaccaaag tcagtccctc aatccaacgt gtggaggtaa ggctgacttc ttttgcaaga 6000 gctggggttg tgagacttca ggtacagccc gctggaagcc ctcctcgagc tgggactata 6060 ttagagtaac agccaactat tccctagcat cttatgtacc tggaggattt gacctagacg 6120 agtgtactga ctggtgccat ccgctccgtg tcactttcac tgaaccaggg aagagagctc 6180 tgggatggac aagagggtat acctggggtc ttaggattta caaggaaaga tatgatgagg 6240 gattattgtt cactatcaga ttaaaaatag agacccctta caatccttta ggccccccaa 6300 ccaagttcac acccctcacc catacaatta ctcagcctac tccagtgatt gcggaccccc 6360 ttaatatggc cgctatcacc caacctccca ctcctcaggt acctctaact attaccccca 6420 cgattccttc aagacagagg atgtttaacc tagtgagagg agccttttat gcccttaaca 6480 gaactgatcc aagcgctact gaggactgct ggctatgcct gtcctcgggt ccgccttatt 6540 atgaaggaat cgccttcaat ggagatttca acagaatcag cagccatact tcctgctctt 6600 ggggaacagg acaaaaactg accctgactg aagtatccgt gaggaatcca ggtctctgta 6660 taggcacccc accttccact cacaaacacc tatgcggaca aattcagtcc atgtccagaa 6720 cagaagctaa ttactatctt gtaccttccc cggttggatg gtgggcttgc aatacaggac 6780 ttactccctg tgtatcaact aaggttttta attcatctca tgatttttgt gtcatgatac 6840 agctgttacc ccacgtatat tatcaccctg catccagttt agaagaaagc tatgctggcc 6900 ggaggtcaaa aagagaacca actactttaa ccctggctgc attcatggga ataggtatgg 6960 cagtaggagt ggggacggga gtgtcagctt tgatagaagg aagacaggga attcagtctt 7020 tgagggatgc tgtcaatgaa gacctagcgg caatagagaa gtccattgac gctttaaaaa 7080 aatctttgac ctccctgtct gaggtagttt tgcagaacag gagaggtctt gatttgttgt 7140 tcctcaagga aggaggactg tgtgctgccc ttaaagaaga gtgctgcttc tatgcagatc 7200 atacaggaat agttagagac tctatgcaga aactgagaga aaaattagag cgaaggaaac 7260 cggaacggga tgctcaacgg gggtggtttg agtcgtggtt tgaatcacga ccatcttgga 7320 taacttcttt aatttccgct gtagccggac caatccttat gatatgctta gctttagttt 7380 tcagcccttg tataataaat agaggaatgg ctttcatcca gagtaaaatt gatacagtaa 7440 aactcatggt tcttcaaagg caatatcaac ctatagttca ggtagatgaa gagttagggg 7500 acaccaatct ctaaaattct atgattagaa ttagtctaaa cagaagaaga ggggaa 7556 // ID RLTR24_MM repbase; DNA; ROD; 617 BP. XX AC . XX DT 14-OCT-2002 (Rel. 7.09, Created) DT 14-OCT-2002 (Rel. 7.09, Last updated, Version 1) XX DE Mouse putative long terminal repeat - a consensus. XX KW LTR Retrotransposon; Transposable Element; Long terminal repeat; KW retrotransposon; RLTR24_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-617 RA Jurka J. and Drazkiewicz A.; RT "RLTR24_MM: putative LTR from Mus musculus."; RL Repbase Reports 2(9), 9-9 (2002). XX DR [1] (Consensus) XX SQ Sequence 617 BP; 151 A; 150 C; 104 G; 212 T; 0 other; tgatgaacca gcctgcctcc atcttaggct taaaagccat cttatagtaa agacaaacta 60 ggttcattcc tgtttatgat taaacctctg tttctcaagg attgggctat gccccacctg 120 taaccttaac tacaaatggt tctgtacttg cctgttccag gaatggcaat catgtctttg 180 tttcaaaaag ttattaagac cacctgttgt tatgactacc tgttgttatg actaccttat 240 aaccatcttg caaccctacc tttgtttcaa aaggtttgtt atgactacct gttgttatga 300 ctaccttgtt atgaccacct tgcaaccatg tctttgtttc aggaggtcat tatgactaac 360 ttgttatgct tatgttctgc tcctgtaacc ctgcctattt tgcctgccaa atcccccatt 420 tggaaacccc ctacccctga gctataaaaa ccttgtcttc ctcatatcca atgctgacct 480 cttgaaccct accttagggg gaggcagccc atgtacacga ataaaaaagg cttgctttaa 540 ttaattgctt gctttaatta atttggccat gatgatttgg gtcggtggtc tttctcctcc 600 catctttggg attaaca 617 // ID RLTR32_MM repbase; DNA; ROD; 616 BP. XX AC . XX DT 14-OCT-2002 (Rel. 7.09, Created) DT 21-AUG-2008 (Rel. 7.09, Last updated, Version 2) XX DE Mouse putative long terminal repeat - a consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; KW Long terminal repeat; retrotransposon; RLTR32_MM; RLTR32A_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-616 RA Jurka J. and Drazkiewicz A.; RT "RLTR32_MM: putative LTR from Mus musculus."; RL Repbase Reports 2(9), 18-18 (2002). XX DR [1] (Consensus) XX CC 80% similar to RLTR32A_MM (bases 46-611). XX SQ Sequence 616 BP; 170 A; 151 C; 115 G; 180 T; 0 other; tgtagagagc acaagggcct tgtgctcaca gataagcact agggaaacca ggaatgttaa 60 acacacaggg ttgtctcctc aaaggaggag ataaaattaa atacctgttt tccacccaga 120 aggctgagag tggatgttgt taataacctg gtgacctttt tgctatctgt agagacagac 180 ctcacccaaa tcccccagaa tgattatccc agactcttcc ctcactagac cattacccat 240 ccttagggga gatgtcacat gtactttatg gcctgctgac ctctatgctg tcggtcagag 300 accacactag atccttccct aggcctttta gctatagaac ccaccttcat ggtcacatgt 360 actttgcaaa agagtttcaa tgtagtcata ccttaagctt tattgtacac ctggaattgg 420 aatgattatg gaaattattg ttgcctggaa acttttttcc aaattctact gtgtttaaat 480 atgcctacaa taaactgcct gtgcttagac tccctgaagt ctgatccagg ttgacaaagt 540 caatctgaac caaattttca ttctcacctc ttcacacggt ttgttttcct ctatgacacc 600 tgcagggacc ccctca 616 // ID IAPA_MM repbase; DNA; ROD; 334 BP. XX AC M99279; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 26-MAR-1997 (Rel. 2.02, Last updated, Version 2) XX DE Repeat associated with intracisternal A particle. XX KW Repetitive sequence; intracisternal A particle; MMIAPA; IAPA_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RA Aota S., Gojobori T., Shigesada K., Ozeki H. and Ikemura T.; RT "Nucleotide sequence and molecular evolution of mouse RT retrovirus-like IAP elements."; RL Gene 56(1), 1-12 (1987). XX RN [2] RA Clouston M.W.; RT "The angiotensionogen gene of swiss mice is closely linked to a RT retrovirus-like element."; RL DNA and Cell Biology 9(9), 623-630 (1990). XX RN [3] RP 1-334 RA Algate A.P. and McCubrey A.J.; RT "Autocrine transformation of hemopoietic cells resulting from RT cytokine message stabilization after intracisternal A particle."; RL Oncogene 8(5), 1221-1232 (1993). XX DR GenBank; M99279; Positions 578 911. XX SQ Sequence 334 BP; 111 A; 63 C; 86 G; 74 T; 0 other; attggtgccg aattccggga cgagaaaatc cgggacgaga aaaaactccg gactggcgca 60 ggagggatac ctcattccag aaccagaact gcgaatcaag gttataaggt tcccgtaaca 120 cagactgttg agaaggattc aactgccgaa ttcagaactc atcagctggg gaacgacggt 180 gataaaggtt cccgtaaagc agactgttaa gaaggattca actgtatgaa ttcagaactt 240 ttcagctggg gaacgaggta agtctgatct tgaactttct aaggaaattc aagacagtct 300 atcagaagta aagtggaaaa tggctttaca agtt 334 // ID RMER6 repbase; DNA; ROD; 350 BP. XX AC . XX DT 22-APR-1997 (Rel. 2.03, Created) DT 22-APR-1997 (Rel. 2.03, Last updated, Version 1) XX DE Medium reiteration frequency repeat - a consensus. XX KW Repetitive sequence; RMER6. XX OS Murinae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae. XX RN [1] RA Chopra V. and Jurka J.; RT "RMER6."; RL Direct Submission to Repbase Update (30-NOV-1996). XX DR [1] (Consensus) XX SQ Sequence 350 BP; 64 A; 101 C; 57 G; 128 T; 0 other; aatgtaactt ttattctcct gatgtaatgt tgtctggagg cttacagcct ccgtctgcta 60 acctaggcct agacctagaa gcttctagct ttcgtacaat cttatctaag cctagaatgt 120 tttcagcctc tgagacttcc tgctgaataa gctcaccctt cctagttctt tctgatctct 180 ggctggttca actcagctgt ttgctcaaac tcctctccaa gctgactgat tcaatctggc 240 ttctctctcc tcctctcctg aattgctctg cttggcctca aactaactct ggcaatctgt 300 tctaatcttc tggctccttc tcattctctg gcttgttctg tctttacctg 350 // ID RLTR6I_MM repbase; DNA; ROD; 7778 BP. XX AC . XX DT 30-JAN-2004 (Rel. 9, Created) DT 30-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE Mouse family of LTR retrotransposons - a consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; RLTR6I_MM; KW RLTR6_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RA Karolchik D., Baertsch R., Diekhans M., Furey S.T., Hinrichs A., RA Lu T.Y., Roskin M.K., Schwartz M., Sugnet W.C. et al.; RT "The UCSC Genome Browser Database."; RL Nucleic Acids Res 31, 51-54 (2003). XX RN [2] RP 1-7778 RA Pavlicek A. and Jurka J.; RT "RLTR6I_MM - a family of autonomous LTR retrotransposons."; RL Direct Submission to Repbase Update (JAN-2004). XX DR [2] (Consensus) XX CC The consensus sequence was reconstructed from the UC Santa Cruz CC genome annotation. Distantly related to RLTR4I_MM and HERVR. LTRs CC listed as RLTR6_MM. CC RLTR6I_MM_ORF: 2371-3619 (416 aa) pol (partial) CC MVIGATGSKFYPWTTKRALQINKNIVTHSFLVIPECPAPLLGRDLLTKLKAQVQFTSEGPQVSWGKAPVA CC CLVLNTEEEYRLHEEQPKNAVSSGWLTAFPNVWAEQAGMGLAKQVPPVVVELKADATPISVKQYPMSKEA CC REGIRPHIQRLLGQGVLVACQSPWNTPLLPVRKPGTNDYRPVQDLREVNKRVLDIHPTVPNPYNLLSSLP CC PERTWYTVLDLKDAFFCLRLHPKSQLLFAFEWRDPEGGQTGQLTWTRLPQGFKNSPTLFDEALHRDLAPF CC RARNPQLTLLQYVDDLLVAAASKELCHQGTERLLAELSDLGYRVSAKKAQICQTEVTYLGYTLRGGKRWL CC TEARKKTVMMIPSPTTPRQVREFLGTAGFCRLWIPGFATLAAPLYPLTKEGVSLSSKERRTIPESF CC RLTR6I_MM_ORF: 5718-7245 (509 aa) env CC MKKTTTTIGQWQPLTILLSFVCAAGATLDLGNLNPHAPIQQSWDVLNEKGNIVWATTAVHPLWTWWPDLT CC PDICKLVAGSPNWDLSDHTDLSNPPPEERCVPNGIGSTYGCSGQFYRANLRAAHFYVCPGQGQSKRLQQE CC CGGASDYFCGKWTCETTGDAYWKPSSKWDLITVKRGSGYDKSNEGERNPYKYQESGCAFKNRPSGPCKDK CC YCNPLRIRFTENGKQHRLSWLKGNRWGWRVYIPLRDPGFIFTIRLTVRDPAVTLVGPNKVLIEQGPPVVL CC APPKVPTVPAPPTPQPNTVVPSLGTNTPLIKPTLASPPPLGTENRLVSLVQGAFLVLNRTNPNMTQSCWL CC CYASSPPYYEGIAQIRTYNITSDHSQCLWGENRKLTLAAVSGRGLCLGRVPQDKGHLCNQTQNIQSSKSG CC QYLVPPLDTVWACNTGLTPCVSMSVFNSSKDFCILVQLIPRLLYHDDSSFLDKFDHRVPLEKRTRYLNFG CC SSIRIGSSSWSRYRNRCLN. XX SQ Sequence 7778 BP; 2075 A; 1952 C; 1966 G; 1785 T; 0 other; cagcgcgacc acccagaggt cctagaccca cttagaggta agattctttg ttctgttttg 60 gtctgatgtc tgtgttctgt ttctaagttt ggtgcgatcg cagtttcggt tttgcggacg 120 ctcagtgaga ccgcgctccg agagggaacg cggggtggat aaggatagac gtgtccaggt 180 gtccaccgtc cgttcaccct gggagacgtc ccaggaggaa caggggagga ccagggacgc 240 ctggtggacc cctttggagg ccaagagacc atttggggtt gcgagatcgt gggtttgagt 300 cccacctcgt gcccagttgc gagatcgtgg gttcgagtcc cacctcgcgt tttgttgcga 360 gatcgtgggt ttgagtccca cctcgcgcct tgttgcgaga ccgtgggttc gagtcccacc 420 tcgcgtttgg tcacgggatc gtgggttcga gtcccacctc gtgcagaggg tctcaatcgg 480 ccggccttag aaaggccatc tgattctttg agttgcttgt ggtcgacgca gagtcgccgc 540 cgtttctggt ttcttttttg tcttagtctc gtgtccgctc ttgttgtgac tactgttttt 600 ctagaaatgg gacaatctgt gtccactccc ctttctctga ctctggagca ttggaaggag 660 gtgcgggtca gagcccacaa ccagtcggtg gaggtcagaa agggtccgtg gcagaccttt 720 tgcgcctccg agtggccaac gtttggagtg ggctggccac ctgagggtgc ttttgacttg 780 tcactaatcg ccgccgtcag gcgaattgtt tttcaggagg aagggggtca ccctgatcag 840 atcccctaca ttgtgacctg gcagaatctc gtccaattcc cacctccttg ggtcaagcct 900 tggaccccaa actcttcgaa actgacggtt gcttttaggt tgcccagtct gatgcaagcc 960 ggaaagtccg gcccgtcagc accccccaag atctatccag agattgacga cctcctctgg 1020 atggactccc aacctccccc ttaccccctg ccccaactcc agagagcagc caccgggttg 1080 cagcggcccc accacatggg accagtagcc tagagagaaa tgggctcagg gatccggagc 1140 aaacgggggg tggtccctga ggactcggag gcgccgaggc cgaagctcct ggggaaaagg 1200 aaggggggcc tgattcaaca gttgccttgc cactcagagc acatgtggga gggccagctc 1260 ccaggcacct aatgatctca ttcctttaca gtactggcct ttttcctctt ctgatttata 1320 taattggaaa actaaccacc ctcccttctc agagaagtac ccctctggtc ttactgggct 1380 ccttgagtca cttatgttct cccatcaacc cacttgggat gattgtcagc agcttttgca 1440 ggttcttttt accacagaag aaagagaaag aatcctgagt tgctggaggc gagaaaaaat 1500 gttctgggag aggacggcac acccactgcc ctccctaacc tccagtggac gaggctttcc 1560 ccttgaaccg ccccaactgg gactacaaca ccgcggaagg taggggacgc ctccttgtct 1620 aatcgccgga ctctagtggc agtctcgtct cagaggagcc gctagacggc ccaccaattt 1680 ggctaaggta agagaggtct tgcaggggca gactgaacca ccctcagtct tccttgagcg 1740 tctaatggag gcatatagga gatacacccc ttttgacccc ttgtcagagg ggcagagagc 1800 cgctgtagcc atggccttca ttggtcagtc cgctcccgac attaagaaaa agctgcaaag 1860 gctggagggg ctccaagatc atacgctcca agatttagta aaagaagcag agaaagtcta 1920 tcataagagg gaaacagaag aagagaggca ggagagagag aagaaagaaa tggaggagag 1980 ggaaaataga cgggatcgcc gtcaggagag aaatctgagt aaaattttgg ccgcagttgt 2040 gaatgataga cagtcaggaa aaggtaaaaa agggctcctg ggcaacaggg cagtgaaacc 2100 gccaggtggc agaaagataa ccacttggaa aaagaccaat gcgcctattg caaagagaaa 2160 ggacactggg ctagagattg ccctaaaaaa cgggagcgat ccaaggtcct gaccctagaa 2220 gatgattagg gaagtcgggg ctcagacccc ctccctgagc ctagggtaac tttgtccgtg 2280 gaggggaccc ccgtcaactt cctgatagac accggagcag agcattcagt actcactagc 2340 cccctaggca agctaggctc caaaaagacc atggtaattg gagccactgg tagtaaattt 2400 tacccctgga cgaccaaacg agctcttcag ataaacaaga atatagtgac ccactccttt 2460 ctggtgatac ctgagtgccc tgctcccctc ttggggcgcg atctgctaac caaactaaag 2520 gctcaagtcc aatttacttc agaaggccca caagtaagct ggggaaaggc ccctgttgcc 2580 tgccttgtcc tcaacacaga ggaagagtac cggttgcatg aagaacaacc caaaaatgca 2640 gtctcttcag gttggctaac tgcgttcccc aatgtctggg cagaacaagc aggaatgggg 2700 ttggctaaac aagtgcctcc ggttgtggta gaacttaaag ctgatgccac ccccatttcg 2760 gtaaaacaat accccatgag caaggaagct agagaaggca tccggcctca tatccagagg 2820 ttgctaggcc aaggagtttt agtggcctgt cagtccccct ggaatacacc acttctgccg 2880 gttcgaaaac cagggaccaa tgactatcgc ccggtgcaag acctccggga ggttaacaaa 2940 agggtcctgg acattcaccc cacagtcccg aacccgtaca atttattaag ctctctccca 3000 cctgagagaa catggtatac agtcctggac ttaaaagatg ccttcttttg cctgcgtttg 3060 caccctaaga gtcagctcct gtttgctttt gaatggaggg acccagaggg cggacagact 3120 ggtcaactaa cttggactag gctaccacag gggttcaaaa attcccccac cctgtttgac 3180 gaggccctcc atcgggatct cgcgcctttt cgcgctcgaa accctcagct taccctacta 3240 cagtatgtgg atgatctctt ggtcgcggcg gcctcgaagg agctgtgtca ccagggaact 3300 gagaggctcc tcgcagaact gagtgacttg gggtatcgag tttcggctaa aaaggcacaa 3360 atctgtcaaa ctgaggtaac ctacctgggg tataccctcc gagggggcaa aagatggctc 3420 acagaggccc ggaagaagac tgttatgatg atcccatcgc caactacccc acggcaggta 3480 cgtgagtttc tggggactgc tggcttttgt agactctgga ttccaggctt tgcaacccta 3540 gcagcacctc tatatccttt gactaaggaa ggggttagcc tttcgagtaa ggaaagaaga 3600 acaatcccag agagcttttg aggctatcaa gtcgtctcta atgactgtcc cccgcgctag 3660 cattaccaga cttgactaag cctttcgtcc tatatgtgga cgagagagcg ggtgtagcca 3720 ggggagtgtt gacacaagca ctgggaccct ggaagagacc tgtagcctat ttgtcaaaaa 3780 aattagatcc ggttgctagt ggatggccca catgtctgaa agctattgcg gcagtagccc 3840 tgctgatcaa agatgctgac aaattgacaa tgggacaaca ggtgacacat gttgtagccc 3900 ctcatgcctt agaaagtatc gtgcgacagt ccacctgaca gatggatgac aaatgcccga 3960 atgacacact atcagagcct gctgctaaat gagcgtgtaa cctttgcgcc ccctgccatc 4020 ctcaatccca gctacccttc tccctctaac aaatgattcc gtcccagtac atcaatgtac 4080 tgacatcctc gctgaagaaa ctgggaccag aagtgacctg aaggatctga ccaaccctgg 4140 cctggagctc ccagttggta cacggacggc agcagtttcc tgatagaggg gaagcgaaag 4200 gctggagcag cagtgcggtg gtggacggga aaaaggtaat ttgggcaagc gctttgcctg 4260 aaggaacttc ggcacaaaag gctgaactta tagcgcttat acaagccctc cgagaggcta 4320 aaggtaagat cgttaacatc tacactgaca gccgctatgc ttttgctacc gcacacatcc 4380 atggggccat ctacaggcag cgagggctat tgacatcggc tggtaaagac attaaaaaca 4440 aagaagaaat tctggccctg ttggaagcca tacatgcacc taagaaggta gccatcatcc 4500 actgccccgg ccaccaaaga ggagaagact tggtggccaa gggcaaccga atggcagact 4560 tagtcgcaaa acaagttgct caaggggcca tgatcttaac tgaaaaaggt gatccgccca 4620 aaagccctga ggacgagagg tataacataa aagagctatg gtggaccagt gatcccctcc 4680 catatttttt tgaagggaaa atagaattaa ctcccgaaga aggaataaaa tttgtgaaag 4740 gactacacca attcacccac ctgggagttg aaaaaatgat gagactaatt aaaaattccc 4800 gataccaagt ccccaacctg aagtcagtgg ctcaaaagat tatagactcc tgcaaaccat 4860 gtgcattcac taatgcaact aaagcctaca gagaacctgg aaagagacaa cggggagacc 4920 atcctggagt gtattgggag gtagacttta ctgaagttaa acctggaatg tatggtaaca 4980 agtatctgtt agtatttgta gacacctttt caggatgggt agaggcattt cccactaaaa 5040 ctgagactgc ccagattgtg gccaagaaga tccttgaaag aaatcctgcc aagatttgga 5100 atccctaagg taatcgggtc cgacaatgga ccagcctttg ttgcccaggt aagtcagggc 5160 ttggccactc agttgggcat cgattggaaa ttacactgtg cttaccgccc tcaaagctca 5220 ggacaggtag agaggatgaa tagaacctta aaagagacct tgactaaatt agccattgag 5280 accggcggaa aagacttggg tggctctcct tcctcttgcg ctcttccgag cccgaaacac 5340 tcctggtcgt ttcgggctca ctccttttga agttctgtat ggaggacctc cccccttaat 5400 ggaagctggt ggaacattag tttccgactc tgaccctgtc ttaccctcct ctttgcttat 5460 tcatttaaag gccctagaag tgattaggac ccagatttgg gaccaactga aagcagccta 5520 taccccaggg accaccgcag taccccacgg gttccgagtt ggagacaaag tcttggtcag 5580 acggcatcga accggcagcc ttgagccacg gtggaaggga ccctatttgg tgttactgac 5640 aacccctact gcggtaaaag ttgacggaat cgcctcctgg atccacgcct cccacgtcaa 5700 gagggccgcc agtcaagatg aagaaaacca cgacgacaat tggacagtgg cagccactga 5760 caatcctctt aagcttcgtc tgcgccgcag gcgccactct agacctaggg aaccttaacc 5820 ctcatgctcc aattcaacag tcctgggatg tgcttaatga aaagggaaac attgtatggg 5880 caaccactgc agtccatccc ctctggactt ggtggcctga tctcacgcct gacatctgta 5940 agttagtggc aggatccccc aattgggacc tctcagatca tactgatctt agcaacccac 6000 cccctgagga gcggtgtgtc ccaaatggga tagggagcac atatgggtgt tcggggcagt 6060 tctaccgagc taatcttaga gctgcacatt tttatgtttg ccctggtcag ggtcagagca 6120 aaaggcttca acaagaatgc gggggggcat cagattactt ttgtggtaaa tggacatgtg 6180 aaacgacagg ggatgcttac tggaagccct cctctaaatg ggacctaatc acggtaaaac 6240 gaggtagtgg ctatgataag tcaaacgaag gagaaagaaa cccctataaa tatcaagaga 6300 gtgggtgcgc ttttaaaaac agaccctcag gaccatgcaa agataaatac tgcaaccccc 6360 tacgtataag gttcaccgag aacggaaaac aacaccgtct gagttggctt aaaggaaata 6420 ggtggggttg gcgagtatac attccactaa gagatcctgg gttcattttc acgatcagac 6480 tgacagtgag agacccggca gtgacactcg tagggcccaa caaggtcctt atagaacagg 6540 gccccccagt cgtactggct cccccaaagg tcccgactgt accagctcca ccaactccac 6600 agcccaacac agtggtaccc tccctaggaa ctaatactcc cctcataaag cctaccttgg 6660 cttccccacc gcccctagga acagagaacc gtctggtcag tctagtccaa ggagcttttt 6720 tagttctaaa tagaactaac cctaatatga ctcaatcatg ctggttatgc tatgcctcta 6780 gcccccctta ttatgaagga atagctcaga tcaggactta taatattact tcagatcatt 6840 ctcaatgcct ttggggagaa aacagaaagt tgactctagc agcagtttca ggaagagggc 6900 tttgtctggg ccgggtacct caggataaag ggcacctctg taatcagacc cagaacatcc 6960 agtctagcaa aagtggtcag tatctagtgc ctcccctaga cacagtgtgg gcttgcaata 7020 ccggtctcac tccttgtgtg tctatgtctg tttttaatag ttccaaagat ttctgcattt 7080 tggttcagct tattcccaga ctcctgtatc atgatgatag ctctttttta gataaatttg 7140 accatcgggt cccgctggaa aagagaaccc gttaccttaa ctttggcagt tctattagga 7200 ttgggagtag cagctggagt aggtacagga accgctgcct taattaagac cccccaatac 7260 tatgaagaac tacgtgcagc tatggatatt gatcttagaa ctatagaaca gtctataacc 7320 aaattagaag aatctttaac ttccctgtcc gaagtggtgc tgcaaaatag aaggggatta 7380 gacttattat tccttaaaga aggaggactc tgtgctgcct taaaagaaga atgttgtttt 7440 tatgttgacc attcaggagt aatcaaagat tctatggcta aacttagaga acgcctagat 7500 atacgtaaaa gagaaagaga aagccaacaa ggatggtttg aaagctggtt taataagtcc 7560 ccttggctca ccactctcct ctccactata gcaggacctt taattacact tatgcttttg 7620 cttacttttg gcccatgcat ccttaataag ttagtagctt ttattagaga aaggataaat 7680 gcagtacaag ttatggtact aagacaacaa tatcgggtcc ttcaggaggt tgaaaactcg 7740 ttctaagatt agaactattt actagaagaa gtggggaa 7778 // ID LTR10_Cpo repbase; DNA; ROD; 419 BP. XX AC . XX DT 21-OCT-2009 (Rel. 14.11, Created) DT 21-OCT-2009 (Rel. 14.11, Last updated, Version 3) XX DE Long terminal repeat: consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR10_Cpo. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-419 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from guinea pig."; RL Repbase Reports 9(11), 2879-2879 (2009). XX DR [1] (Consensus) XX CC >94% identical to consensus. 4 bp TSD. XX SQ Sequence 419 BP; 106 A; 123 C; 88 G; 102 T; 0 other; tgaagagtta aaacatttag gaaaccccct cttgccttgt cacacggtca gaacaaccag 60 atgttctggc ctgataggga agttgtggtc caacgtcccc aaaaataaga gcaccgtgaa 120 ctatcgaccc tgccagacag gatatgagtt ttcaaaaaca tcaagccacc tgcagcggat 180 agctcaccca ccaataagca aatgacacga cccctattaa ccaatgccat ctgcttctgt 240 aaccgacgct tgctttgcgg ttttcggctt tatatactct gcactcccct acccgggcac 300 tctcccctct ccccaccgga gtgtggtcgg agagtcccac gcatgcgcgg ataataaaac 360 cccttgattt ttcatgagag ctggttcctt ggggtcttct tcgcgacggt gatcttaca 419 // ID MLT1AR repbase; DNA; ROD; 1735 BP. XX AC . XX DT 24-JUL-2000 (Rel. 5.06, Created) DT 24-JUL-2000 (Rel. 5.06, Last updated, Version 1) XX DE MLT1- LTR retrotransposon internal sequence - a consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; MaLR family; KW LTR retrotransposon; MLT1R; MLT1c subfamily; MLT1AR. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RA Smit A.F.; RT "Structure and evolution of mammalian interspersed repeats."; RL PhD dissertation, Univ Southern California, 1995. XX RN [2] RA Smit A.F.; RT "Direct submission."; RL Direct Submission to Repbase Update (31-JAN-2000). XX DR [2] (Consensus) XX CC Internal sequence consensus for MLT1A retrovirus-like element CC (MaLR). XX SQ Sequence 1735 BP; 441 A; 356 C; 501 G; 396 T; 41 other; gaaattggta ccgagagtgg gggtgctgct gtaacaaata cctaaaaatg tggaagtggc 60 tttggaactg ggtaatgggt agaggctgga agagttttga ggtgcatgct agaaaaagcc 120 tacattgccg tgaacggacc gttaagggca attctggtga gggctcagaa gaagaggaga 180 gctgtagaga aagcctcaat cttcttagag antacctaag tggtcgtgaa cagaatattg 240 gtagaaatat ggacggtaaa ggccattctg atgaggtctc agatggaaat gaggaacatg 300 ttattggaaa ctggaggaaa ggcnatcctt gttataaagt ggcaaagaac ttggctgaat 360 tgtgttcgtg tcctagtgtt ttgtggaagg cagaacttgn gagtgatgaa atnggatatt 420 tggcngaaga aatntctaag caaagtgttg agggtgcggc ntggcttctc ttgactgctt 480 atagtaaaat gcgagaagag agaaatgant taaagatgga attnntaatc aaaagggaag 540 cagaacttaa agatttggaa aattctcagc ctanccatgt tgtaaagaat gagaaagcgt 600 gttcaggaga gaacaccaag ggtgtggcca agcaaccgtt tgataaggag attagtatgg 660 atnaacggaa gccnggtgct attcatcaag acaatggaag aatgaccccg aaggcatttc 720 agagatcntc aaggctgccc ctcccatcac aggcccagag tgcaagggcc tggagggnag 780 aacggtttca agggcaggnn ccantcccca ctgcccagtg ccacctcagt ctgctcccta 840 tcttcggctg cccgtttagn tgtggctcaa gtgggcccag gtgcagctag ggccgcctct 900 cctggaggna caggttataa accttggcag catccgcgtg gtgccatctc cgcaggcgcg 960 cagagtgcat gagctgtgga ggcatggctn cctccaccta gatttcaaag atgcgagacc 1020 tggggcccaa gcagaggnct cgcggggcag ggccaccaca gagagccccc actagggcaa 1080 tgcccagtgg agccgtgggg tcnggcctgc aaagagcccc cactaaggca atgcctagtg 1140 gagctatggg ggcagggccg cctncgngac cccagaccng tagagccacc agnntgcaat 1200 tccagcctgg gagagccgca ggcangngac tccaacccgt gagagctgcc acatgggctg 1260 cgcccagcaa agccatgggg gtggngctnc cnggngtctt gggggnncaa cccccacccc 1320 agtgngtctg gaaggcggaa catcgagtca aagaagatta ttctcgagcc ttaagattta 1380 atgttgtttg ccctgttngg ttttggactt gctcgggacc tntcactcct ttcttctttc 1440 ctatttctcc cttttggaat gggaatgtct atcctatgcc tgtcccacca ttgtattttg 1500 gaagcacata acttgtttga tttcacaggt tcacagctgg agagcaattt tgcctcagga 1560 tgaatcacac cttgagtctc acccatatct gatttagatg atatttagat gagactttgg 1620 actttagact ttngagttga tgctggaacg agttaagact ttgggggcta ttgggatgga 1680 atgagtgtat tttgcatgtg agaaggacat gaatttnggg gggccagggg tagaa 1735 // ID RLTR20B2_MM repbase; DNA; ROD; 575 BP. XX AC . XX DT 30-JAN-2004 (Rel. 9, Created) DT 30-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE Mouse long terminal repeat - a consensus. XX KW LTR Retrotransposon; Transposable Element; LTR; ERVK; KW RLTR20B2_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RA Karolchik D., Baertsch R., Diekhans M., Furey S.T., Hinrichs A., RA Lu T.Y., Roskin M.K., Schwartz M., Sugnet W.C. et al.; RT "The UCSC Genome Browser Database."; RL Nucleic Acids Res 31(1), 51-54 (2003). XX RN [2] RP 1-575 RA Pavlicek A. and Jurka J.; RT "RLTR20B2_MM - a subfamily of LTR retrotransposons."; RL Direct Submission to Repbase Update (31-DEC-2003). XX DR [2] (Consensus) XX CC The consensus sequence was reconstructed from the UC Santa Cruz CC genome annotation. Individual copies are ~88% identical to the CC consensus. 6 bp TSDs. XX SQ Sequence 575 BP; 159 A; 113 C; 176 G; 127 T; 0 other; tgttgtggat ttggtttaat gcttgtatta tgttaattgg gttcccaaaa ttgcacgaga 60 atccccgcat gtaagacgct gagggtccct gcccccagtt ggttctgatt ggtaaataaa 120 gttgccagtg gccaatggct gggcagggag acagaggcag gactttagga ttcgcaggca 180 aggggaccaa gagaggaaga aggaggtaga atcgccatgc caggaaagga ataagatcca 240 ggcttgagag ctgcagaaga gagagcatac caatcatgta agagccaggg aagagcagcc 300 ccagggcccc cccccccccc ccctaattgg gtctagggta gcaaagatgg aatatagatt 360 ttagtaagta ataactcagg agtatcggag gggaggtgtt agcaacgtgg aagtttggga 420 gtggcccagc cattgagctg tttaaggcat attaaaatat aaggctgtgt gtgtgtgtct 480 ttcattcggg aatccagaac attggggcgg gtagcaagga actcgcgctg ccgccaccgg 540 ggagatttga gtaggattaa tcacctactg caaca 575 // ID MLT1A repbase; DNA; ROD; 374 BP. XX AC . XX DT 01-OCT-1995 (Rel. 3.6, Created) DT 24-JUL-2000 (Rel. 7.3, Last updated, Version 4) XX DE Mammalian long terminal repeat (MLT1A subfamily) - a consensus. XX KW LTR; MaLR family; retrovirus-like MaLR element; MLT1A2; MLT1A. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-374 RA Smit F.A.; RT "Identification of a new, abundant superfamily of mammalian RT LTR-transposons."; RL Nucleic Acids Res 21, 1863-1872 (1993). XX DR [1] (Consensus) XX CC LTR of MLT1A retrovirus-like MaLR element. 5 bp target site dups. CC Average divergence from consensus 17%. Intermittant subfamily CC between MSTD and MLT1C. CC The subfamily MLT1A1 differs from this consensus by two small CC inserts CC and a few substitutions. XX SQ Sequence 374 BP; 96 A; 87 C; 97 G; 93 T; 1 other; tgctatggac tgaatgtttg tgtcccccca aaattcatat gttgaagccc taatccccaa 60 tgtgatggta ttaggaggtg gggcctttgg gaggtgatta ggattagatg aggtcatgag 120 ggcggggccc tcataatggg attagtgccc ttataaaaga gaccycagag agctcccttg 180 ccccttccgc catgtgagga cacagtgaga aggcgccgtc tacgaaccag ggaatgagcc 240 ctcaccagaa actgaatctg ccggcgcctt gatcttggac ttcccagcct ccagaactgt 300 gagaaataaa tttctgttgt ttaagctacc cagtctatgg tattttgtta tagcagcccg 360 aacagactaa gaca 374 // ID RLTR41_LTR repbase; DNA; ROD; 446 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Long terminal repeat of ERV1 Endogenous Retrovirus from mouse. XX KW ERV1; Endogenous Retrovirus; Transposable Element; KW LTR retrotransposon; RLTR41_LTR. XX OS Mus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae. XX RN [1] RP 1-446 RA Smit A.F.; RT "RLTR41_LTR - ERV1 Endogenous Retrovirus from mouse."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC related to RNLTR10 in rat 13% subs (subfams). XX SQ Sequence 446 BP; 100 A; 136 C; 113 G; 95 T; 2 other; tgaagaggtc agctcaggac ctttggaaaa ttccaccagg cacccctccc ctcctggaag 60 agagaggtca tggagcacat aggcacccct cccctccgga agggagaggc tgattacatt 120 cctcagaaag accgcagggg ggactgaccg ttggccgcct gcagataagg gaagcccttg 180 ctacctcatt ccctaaggac caatcagttt aaaagtcaca ctgttctgcc aatcacattg 240 tgcctagtng ctgntgctct attctgcccc tgaaaactgt ataaaaactc gccgaacggg 300 ctgcccgggg tcgccgcctc tccttcgggt gcaggacgac cccagcgcgc tggaacaata 360 aattcctctt gcttttgcat cgatccctgg ctccacgtgg ttcactcagg gggtccccgg 420 tagctaaggc tcgtcagagt cttaca 446 // ID RICKSHA_0 repbase; DNA; ROD; 1708 BP. XX AC . XX DT 09-JUN-1999 (Rel. 7, Created) DT 09-JUN-1999 (Rel. 7, Last updated, Version 1) XX DE RICKSHA_0 repetitive element - a consensus. XX KW Non-autonomous DNA transposon fossil; RICKSHA; RICKSHA_0. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-1708 RA Kapitonov V.V. and Jurka J.; RT "RICKSHA_0."; RL Direct Submission to Repbase Update (JUN-1999). XX DR [1] (Consensus) XX CC RICKSHA_0 is a non-autonomous DNA transposon (it does not carry CC an external part of HERV-L virus found in RICKSHA). CC Putative non-autonomous DNA transposon fossil. It has 70 bp-long CC terminal inverted repeat. CC Target site is unclear. CC Average identity of individual copies to the consensus sequence CC is CC about 86%. XX SQ Sequence 1708 BP; 511 A; 303 C; 322 G; 572 T; 0 other; gggtttggat cataatccca aaagacacaa tcccaaacgc cataatcccg aatgttgaaa 60 tcccgaaaga tcaaaatccc taaagtctaa aatccctaaa gtctaaaatc ccaaaaattc 120 acacaggatg gttgcatcat gttaggcaga actgttattt tcttattgtc tttatgcaga 180 aaaaatggat tttaattgaa tccccaaacc ataatgacag atttggaatt aggtgcgatc 240 aaggcttcta aaagtgaatt tcaaggtgtt accaataaag tttgtttttt tccattcagc 300 ccaatgcatt tggtggaaaa ttcagatgag tggattggcc atgcgatacg gcaacgacga 360 aaacttcagt ttaaaaatgc gtcatttgcc tgcattggca ttccttccag ctgatgacat 420 tccgggagct tttaatgaat taaagccgca tttgcctgaa gaagtcagcg aagttactga 480 ctggttcgaa aataattatg tgcacggtag gataagaaga cacttacaca acggtgttgc 540 cgttcgatta ccagtattgt ttctaccaaa tttgtggtct gtatatgagt gcatgcagaa 600 tggatttcta tatacccaaa acaacataga agcatggcac agaagatggg aaaatttaat 660 agggaatgct catgtcggtg tatatcgaat cagaagattc aaaaagagca gcgccacgta 720 gaaaatgaat gtgaacatat tctccgagga gagccatgtc ctaaaagaaa aaaaaaagca 780 gctattcatc gcgatgcaag acttcaaaat atagttaatg atcgtgaaag tcggccagct 840 cttatggact atctccgtgc aattgcccat aatctatccc tgtaatatac tttttcatat 900 gtcgaatttt ctttttagtt ttttttcact attttaaatt gtcagcatta ttttttacaa 960 ttcgctatgc tatgtatttc atcttcgcat catttccaat actggaggta taaattgtgt 1020 aaagactttt agagagttct aattcgtttt atgcattttt tgcaaatttg actccacgaa 1080 agtgcattat cacaacgttg actttgtgtg taagcattgt gcgtgtacgt aaaaacgttg 1140 aaacttcctc aataaatgaa gagatgtcct ttttgtacat ctgcatttgt gaaagataaa 1200 atttctcgag atctcggctc tttgggcgac tgcatatgca gtggtgaccc atcgcggttt 1260 ttgatcgatc tcgtcaaaag acttaggttg ttcgtcacgg tatttcagat gaccgcagtt 1320 ataaagctgg gtgcacacaa ttaccaacca tagtgatatg cgtttataca tttccctttt 1380 tgacctattt ctttatgaat acggttcgtc tgctcataac tgttataccc gtgcgactgt 1440 cattagtata cctgagtgtt tatgcttgca aaaatatgta tgttattatt gcctatttta 1500 ttgtgtaaag tggcctatga agtgttctgt catgttttta tatgtttctc aaataaatcc 1560 ccttttaaaa atgtaaataa atatctttta aaaaattttt aaattatttt ttccagaatt 1620 atatttttgg gattttgatc tttcgggatt tcaacattcg ggattatggc gttcgggatt 1680 gtgtctttcg ggattatgat cggctccc 1708 // ID ID-B1_Cpo repbase; DNA; ROD; 231 BP. XX AC . XX DT 02-APR-2010 (Rel. 15.05, Created) DT 02-APR-2010 (Rel. 15.05, Last updated, Version 2) XX DE SINE Non-LTR Retrotransposon from Muridae. XX KW SINE2/tRNA; SINE; Non-LTR Retrotransposon; Transposable Element; KW ID-B1_Cpo. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-231 RA Jurka J.; RT "SINE elements from guinea pig."; RL Repbase Reports 10(5), 777-777 (2010). XX DR [1] (Consensus) XX CC >84% identical to consensus. The 5'-part is tRNA-derived and the CC 3'-part is 7SL-derived. Homologous to ID_B1 but younger. XX SQ Sequence 231 BP; 61 A; 62 C; 72 G; 36 T; 0 other; ggggctgggg atgtagctca gtggcataag cacctgcctg gcaagcgcga ggtcctgagt 60 tcaatcccca gtaccacaaa aagccgggcg tggtggcaca cgcctgtaat cccagcactc 120 gggaggctga ggcaggagga tcgccgcgag ttcgaggcca gcctgggcta caatagtgag 180 ttcaaggcca gcctgaactg catagcgaga ccctgtctca aaaaaaaaaa a 231 // ID RMER5 repbase; DNA; ROD; 401 BP. XX AC . XX DT 22-APR-1997 (Rel. 2.03, Created) DT 22-APR-1997 (Rel. 2.03, Last updated, Version 1) XX DE Medium reiteration frequency repeat - a consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; KW Repetitive sequence; RMER5. XX OS Murinae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae. XX RN [1] RA Chopra V. and Jurka J.; RT "RMER5."; RL Direct Submission to Repbase Update (30-NOV-1996). XX DR [1] (Consensus) XX SQ Sequence 401 BP; 111 A; 96 C; 58 G; 128 T; 8 other; atctctgatg aggtggaatg ttcaggtcca gggaaaactt atcttgggct atytttagcc 60 cccttcatag mcayccatcg cctacctcta tgtttgaact aacaycctgt gactattagg 120 taaaaccctt ttagaatata ctaatagacc cctagcttcc accaaccawa gctaagatta 180 tcatcagata gagctktcct cactttgcca tagctcccta ccctgtattc ccaccaatgt 240 atttgaaaat tgccttratt tatgactttc ttttcttacc atawaaatca tgaaactgtt 300 accacttgga acatagcatt tgagtaactt aaatccgtgc tcccaggatg gtcatcatat 360 gctccagata actcttatcc ttaatgaagt gtgtttttca a 401 // ID LTR7_Cpo repbase; DNA; ROD; 403 BP. XX AC . XX DT 31-JUL-2009 (Rel. 14.07, Created) DT 31-JUL-2009 (Rel. 14.07, Last updated, Version 2) XX DE Long terminal repeat of ERV2 endogenous retrovirus: consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR7_CPo. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-403 RA Jurka J. and Baney O.; RT "Non-autonomous endogenous retrovirus from guinea pig."; RL Repbase Reports 9(7), 1548-1548 (2009). XX DR [1] (Consensus) XX CC ~93% identical to consensus. XX SQ Sequence 403 BP; 83 A; 123 C; 72 G; 124 T; 1 other; tgtgggaagc cctgtaattg gctgccacct tatatctgca ggcarcacca ccttaactcc 60 ctctaggagt taagtttggt agttaagcag ctccttgcct atccctttgt ttggcccatt 120 cagggattat gctaatcagc ctgccttatg agcccgcgcg caggaaattt gaaaatttga 180 attaacctat aaccctcagt cttgccactg ttgctaagtt acctgctgac gtctcagaac 240 ccaactccct cctcccccaa taccctatat atttgttact ttttccttga ataaatgaga 300 cttgatcaga cttctgtctt gtctccattc tttgcgtctc ttgtcccctc tcatccccac 360 tccccctcta gggtctccgt ggacttaccc gcaggccggg aca 403 // ID MLT1B repbase; DNA; ROD; 390 BP. XX AC . XX DT 01-OCT-1995 (Rel. 3.6, Created) DT 23-JAN-1998 (Rel. 6.4, Last updated, Version 3) XX DE Mammalian transposon-like element long terminal repeat (MLT1b DE subfamily) - a consensus. XX KW Non-LTR retrotransposon; MLT1b subfamily; MER15; MER18; MLT1B. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-390 RA Kaplan J.D., Jurka J., Solus F.J. and Duncan H.C.; RT "Medium reiteration frequency repetitive sequences in the human RT genome."; RL Nucleic Acids Res 17, 4731-4738 (1991). XX RN [2] RP 1-390 RA Smit F.A.; RT "Identification of a new, abundant superfamily of mammalian RT LTR-transposons."; RL Nucleic Acids Res 21, 1863-1872 (1993). XX DR [2] (Consensus) XX CC Replaces MER15 (acc.# X59019) and MER18 (acc.# X59024). XX SQ Sequence 390 BP; 117 A; 83 C; 92 G; 90 T; 8 other; tgttatgggc tgaattgtgt ccccccaaaa ttcatatgtt gaagtcctaa cccccagtac 60 ctcagaatgt gactgtattt ggagataggg tctttaaaga ggtaattaag ttaaaatgag 120 gtcattaggg tgggccctaa tccaatatga ctggtgtcct tataagaaga ggaaattwgg 180 acacagacac gcacagaggr aagrccatgt gargacacag ggagaaggcg gccatctrca 240 agccaaggag agaggcctca gaagaaacca accctgccgr caccttgatc tcggacttcy 300 agcctccaga actgtgagaa aataaatttc tgttgtttaa gccacccagt ctgtggtact 360 ttgttacggc agccctagsa aactaataca 390 // ID MamGypLTR2c_LTR repbase; DNA; ROD; 1068 BP. XX AC . XX DT 05-FEB-2007 (Rel. 12.01, Created) DT 10-APR-2007 (Rel. 12.01, Last updated, Version 1) XX DE Long terminal repeat of Gypsy LTR Retrotransposon from mammals. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW MamGypLTR2c_LTR. XX OS Mammalia OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi. XX RN [1] RP 1-1068 RA Smit A.F.; RT "MamGypLTR2c_LTR - Gypsy LTR Retrotransposon from mammals."; RL Direct Submission to Repbase Update (05-FEB-2007). XX DR [1] (Consensus) XX CC 4 bp TSD; 35% subst in dog-human; rnd-4_family-2757; 5' end CC undefined; closest to MamGypLTR2b. XX SQ Sequence 1068 BP; 280 A; 242 C; 350 G; 171 T; 25 other; attttgagag aaaangnntn ttttngaatt cnaaaagctt ggctttgaat atcaagagcc 60 aaancctgga ggactcagag catagagttt atgccataaa ccatagagtg ggaagaatgc 120 atgatgatgt aagagcagtt ttcccaccca aaaggagata aaagttcgac cagctggngg 180 gaggaggaga nttggagagg gcgngactag agcttagatg ctggtgggtc cctgagaagg 240 ggcccggcgc cctgcccact ccctgggccg agtccnnggc gggnnggcgg ganaatagaa 300 accccagatg agtttggggg gatcccagag cagaagggac ctntgcctng cttcccaggg 360 tgtagcccgg gagactgcaa gcctcntaga gaagccctgc atccaacatg gcgcctgagc 420 ggtagagtag caatggctga gggaggtgtc taggcggatg ggcctgaggg cagcctcaca 480 gagtcctgcg caccccagng tggtgcgggg agagcagccg agagttcctg agccccccaa 540 gaggggcgtg agaagtgggc tgagagagcc gaggcagaca gtagcagagg ccttgcagcc 600 agggaccagg tgggacagag gagtgcctac atgcctggga ggacctggng accggaggcc 660 angnagggga ccgcggacac agcagggacc ggagncgatg cccaggacca ggcgggacga 720 gntacacctc agcggaggcc agcaaggaca gaggatggca cgtggaccag acgcccctcc 780 ccgatgccag gacgacaagg ccactgagcc cccccggaac ccagatcacc cccggggaga 840 agggaaggag ggggaaggag agaatcctga attgactgag tatttaccca aaagagactg 900 agttttaaac cggaagtgac tgagttacct tgaattggca agattaagtt ttccgccatc 960 agcggaaatg ggggctcgng agctaagttg agttcagtta tagagaaata aagaaagtta 1020 catttttgca cacctgagtt tgtgactgta aaattcatac ccgctaca 1068 // ID LTR5B_Cpo repbase; DNA; ROD; 386 BP. XX AC . XX DT 19-SEP-2009 (Rel. 14.07, Created) DT 19-SEP-2009 (Rel. 14.07, Last updated, Version 3) XX DE Long terminal repeat of ERV3 endogenous retrovirus: consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; LTR5B_Cpo. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-386 RA Jurka J.; RT "Endogenous retroviruses from guinea pig."; RL Direct Submission to RR (19-SEP-2009). XX DR [1] (Consensus) XX CC ~88% identical to consensus. XX SQ Sequence 386 BP; 90 A; 86 C; 106 G; 104 T; 0 other; tgttatggtt gatacattgt gtcaacttga gaagtttaga agtttaactg agagactcag 60 cagggagcca gggtccttac tttgtaaata tcctcatcct gtgcaagagg agggcacgga 120 gattttgtga gtgctacacc cgccctgggg ggggggggtg gcctgcgtac aatataaggg 180 aggagaagag gcttgttttc cccccttttg ctctggtttg ctggctgctg ccttgaagtg 240 ttgccccagt gccatgccac cctgccttgg agccagctga ttatggactg aaacctccac 300 aaacagtgag ctaaataaac ctttccttcc ttcattttgg gtgtcgggta ttttgtccca 360 gcaacgagag aaaagtaacc aagaca 386 // ID L1MA5 repbase; DNA; ROD; 1042 BP. XX AC . XX DT 01-OCT-1995 (Rel. 3.6, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 3) XX DE 3'-end of L1 repeat (subfamily L1MA5) - a consensus. XX KW Repetitive sequence; L1 (LINE) family; MER14; MER27; KW L1MA5 subfamily; L1MA5. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-1042 RA Jurka J., Kaplan J.D., Duncan H.C., Walichiewicz J., RA Milosavljevic A., Murali G. and Solus F.J.; RT "Identification and characterization of new human medium RT reiteration frequency repeats."; RL Nucleic Acids Res 21, 1273-1279 (1993). XX RN [2] RP 1-1042 RA Smit F.A., Toth G., Riggs D.A. and Jurka J.; RT "Ancestral, mammalian-wide subfamilies of LINE-1 repetitive RT sequences."; RL J. Mol. Biol 246, 401-417 (1995). XX DR [2] (Consensus) XX CC Contains identical ORF2 region consensus (subfam L1M2) as L1MA9. CC ORF2 ends at bp 684; average divergence of copies from consensus: CC 15% CC Replaces MER27 (Acc. No. X12842). XX SQ Sequence 1042 BP; 414 A; 149 C; 201 G; 252 T; 26 other; ttaatatcca aaatatataa ggaactcaaa caactcaaca agaaraaaac aaataaccca 60 attaaaaaat gggcaaarga cctgaataga catttytcaa aagaagacat acaaatggcc 120 aacagatata tgaaaaaatg ctcaacatca ctaatcatca aggaaatgca aattaaaacc 180 acaatgagat atcacctcac acctgttaga atggctatta tcaaaaagac agaaaataat 240 aaatgttggy gaggatgtgg agaaaaggga actattgtac actgttggtg ggaatgtaaa 300 ttagtayagc caytatggaa aacagtatgg aggttcctca aaaaaytaaa aataraacta 360 ccatatgaty cagcaatccc actwctgggt atatatccaa argaattgaa atcagtatgt 420 ygaagagata yctgcactcc catgtttayt gcagcaytat tcacaatagc caagatatgg 480 aawcaaccta agtgtccatc aayggawgaa tggataaaga aaatgtggta tatatacaca 540 atggaatact attcagccat aaaaaagaat gaaatcctgt catttgyarc aacatggatg 600 aacctggagg acattatgct aagtgaaata agccaggcac agaaagacaa ataccgcatg 660 ttctcactca tatgtggaag ctaaaaaagt tgatctcata gaagtagaga gtagaatagt 720 ggttactaga ggctgggaag ggtagggrga ggggggrata gggagaggtt ggttaawggr 780 tacaaaatta cagttagata ggaggaataa gttctagtgt tctgtagcac cgtagggtga 840 ctatagttaa caayaattta ttgtatattt tcaaatagct agaagagagg attttgaatg 900 ttcccaacac aaagaaatga taaatgtttg aggtgatgga tatgctaatt accctgattt 960 gatcattaca cawtgtatac atgtatcgaa atatcacact gtaccccata aatatgtaca 1020 attattatgt gtcaattaaa aa 1042 // ID GOLEM repbase; DNA; ROD; 3029 BP. XX AC . XX DT 13-JAN-1998 (Rel. 3, Created) DT 31-OCT-2000 (Rel. 5.09, Last updated, Version 6) XX DE Autonomous DNA transposon; POGO superfamily. XX KW DNA transposon; Transposable Element; 35S; GOLEM; MER17; MER29; KW MER7; MER7B; TIGGER3. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 3029-2724 RA Drinkwater D.R., Burgoyne A.L. and Skinner D.J.; RT "Two human repetitive DNA elements: A new interspersed repeat RT found in the factor IX gene, and a satellite 11 tandem repeat RT sequence."; RL Nucleic Acids Res 14, 9541-9541 (1986). XX RN [2] RP 3029-2724 RA Kaplan J.D. and Duncan H.C.; RT "Novel short interspersed repeat in human DNA."; RL Nucleic Acids Res 18, 192-192 (1990). XX RN [3] RP 2713-2608 RA Kaplan J.D., Jurka J., Solus F.J. and Duncan H.C.; RT "Medium reiteration frequency repetitive sequences in the human RT genome."; RL Nucleic Acids Res 17, 4731-4738 (1991). XX RN [4] RP 2327-2128 RA Skalnik G.D., Strauss C.E. and Orkin H.S.; RT "CCAAT displacement protein as a repressor of the myelomonocytic- RT specific gp91-phox gene promoter."; RL J. Biol. Chem 266, 16736-16744 (1991). XX RN [5] RP 2787-2607 RA Rubin M.C., Leeflang P.E., Rinehart P.F. and Schmid W.C.; RT "Paucity of novel short interspersed repetitive elements (SINE) RT families in human DNA and isolation of a novel MER repeat."; RL Genomics 18, 322-328 (1993). XX RN [6] RP 2327-2128 RA Jurka J., Kaplan J.D., Duncan H.C., Walichiewicz J., RA Milosavljevic A., Murali G. and Solus F.J.; RT "Identification and characterization of new human medium RT reiteration frequency repeats."; RL Nucleic Acids Res 21, 1273-1279 (1993). XX RN [7] RA Smit A.F. and Riggs D.A.; RT "Tiggers and other DNA transposon fossils in the human genome."; RL Proc. Natl. Acad. Sci. USA 93, 1443-1448 (1996). XX RN [8] RP 1-3029 RA Kapitonov V.V. and Jurka J.; RT "GOLEM."; RL Direct Submission to Repbase Update (JAN-1998). XX RN [9] RP 1-3029 RA Smit A.F.; RT "GOLEM."; RL Direct Submission to Repbase Update (JAN-1998). XX CC 23 bp terminal inverted repeats and TA target site [7]. CC GOLEM's non-autonomous elements are GOLEM_A (MER7A) and GOLEM_B CC (MER7B) repeats. CC Orientation of the repeat has been determined based on the CC reconstruction of its internal sequence encoding transposase [8]. CC The ORF from pos 442-2307 encodes a protein 39% identical (57% CC similar) over the full-length to the Tigger1 product. XX SQ Sequence 3029 BP; 968 A; 580 C; 659 G; 820 T; 2 other; cagtcatgcg ccgcataacg acgtttcggt caacgacgga ccgcatatac gacggtggtc 60 ccataagatt ataatggagc tgaaaaattc ctatcgccta gtgacgtcgt agctgtcgta 120 acgtcatagc gcaacgcatt actcacgtgt ttgtggtgat gctggtgtaa acaaacctac 180 tgcgctgcca gtcgtataaa agtatagcac atacaattat gtacagtaca taatacttga 240 taatgataat aaatgactat gttactggtt tatgtattta ctatactata ctttttatcg 300 ttattttaga gtgtactcct tctacttatt aaaaaaaagt ttactgtaaa acagtatgcc 360 gtgttacacc ggcagcagcc tcatacatct cgtgtttacc gcgtctcttg attgcatcat 420 tttctcttgt gcttgattta atctcgtgtt gttttgttca tcatggcccc taagcgtaca 480 aaatccacgg ctaatgttgc cagtaagagg ccacatcgag tgactgacct ggaaacgaaa 540 ttaaaagtga ttaaggacta cgaaggtgga aaatcagtga tggttattgc tcgccagtca 600 ggcatgtccc attccaccat agctacgatc ttgaagaaca agaacaaagt gacagaagct 660 gttaaaggat ctgcttcatt gaaggcaacg agactaacaa aaattcgaga agggcctata 720 tcagatatgg agaaacttct aatgacctgg attgaagacc agacacagaa gcatatccct 780 ctcagcacca tgacgatcac ggccaaagca aaaagtttgt ttgcgatgtt gaaagaaaag 840 gctggacccg actacgatgt tgaatttact gctagctctg ggtggtttaa acgattcaag 900 aatcgttatt cattacataa tgtgaaagtg agtggtgagt ctgcgagtgc tgatgtgaag 960 gcagctgaag aatttttgga aactctagat aagctgattg tggaggaaaa ttacttgcca 1020 gagcaaatct tcaatatgga tgaaacctcc ctattctgga aacggatgcc tgaaaggact 1080 ttcatccata aggaggccaa gtcaatgcca ggtttcaagg cttttaagga caggataaca 1140 gtcttgcttg ggggcaatgt tgcaggctac aaattgaaac cctttgtgat ctggcacagt 1200 gagaacccca gggccttcaa gcatatcaat aagcacacac tgccagtgta ctacaggagc 1260 aataagaagt catggatgac ccagctcctc ttccaagatg ccctcctgaa ttgctatgcc 1320 agcgaaatgg agaagtactg tttggagaat aacatacctt tcaagatttt gcttattgtt 1380 gataatgctc ccgcacatcc tccttttatt ggtgatcttc atcccaatat caaagtggtg 1440 tttctccctc caaacaccac ctctttgatc caaccaatgg atcaaggagt tatagcagct 1500 tttaaggcct actacctgag gaggaccttt gcccaggcta ttgctgcaac tgaggaagac 1560 actgagaaga cactgatgca attctggaag gattacaaca tctatgactg catcaagaac 1620 cttgcttggg cttggggtga tgtcaccaag gagtgtatga atggcatctg gaagaagaca 1680 ctcaagaggt tcgtccgtga cttcaaagga tttgccaagg atgaggaggt tgcaaaaatc 1740 aacaaggctg tggttgagat ggcaaacaac tttaacctgg gtgtggatga ggatgacatt 1800 gaggagctcc tagaggtggt tcctgaggaa ttgactaatg aggagttgtt ggaactggaa 1860 caggaacgca tagctgaaga agaggcaaga gaaaaggaaa ctgcaggaga agaaaaagaa 1920 gaacccccaa gaaaattcac agtgaagggt ttagcagaag cttttgcaga cctcaacaag 1980 ctccttaaaa agtttgaaaa catggacccc aacaccgaaa ggttttcatt aatagagagg 2040 aatgttcatg gtgcattatc tgcttacaag caaatctatg atgaaaaaaa gaaacaaacc 2100 aagcaaacca ccatggacat atttctgaaa agagtgacac ctcctcaaga agagcctcag 2160 gcaggtcctt caggaggtat tccagaagaa ggcattgtta tcataggaga tgacagctcc 2220 atgcgtgtta ttgcccctga agaccttcca gtgggacaag atgtggaggt ggaagacagt 2280 gatattgatg atcctgaccc tgtgtaggcc taggctaatg tgtgtgtttg tgtcttagtt 2340 tttaacaaaa aagtttaaaa agtaaaaaaa aaataawttt aaaaatagaa aaaagcttat 2400 agaataagga tataaagaaa gaaaatattt ttgtacagct gtacaatgtg tttgtgtttt 2460 aagctaagtg ttattacaaa agagtcaaaa agttwaaaaa attaaaaagt ttataaagta 2520 aaaaagttac agtaagctaa ggttaattta ttattgaaga aagaaaaata ttttaaataa 2580 atttagtgta gcctaagtgt acagtgttta taaagtctac agtagtgtac agtaatgtcc 2640 taggccttca cattcactca ccactcactc actgactcac ccagagcaac ttccagtcct 2700 gcaagctcca ttcatggtaa gtgccctata caggtgtacc attttttatc ttttataccg 2760 tatttttact gtaccttttc tatgtttaga tatgtttaga tacacaaata cttaccattg 2820 tgttacaatt gcctacagta ttcagtacag taacatgctg tacaggtttg tagcctagga 2880 gcaataggct ataccatata gcctaggtgt gtagtaggct ataccatcta ggtttgtgta 2940 agtacactct atgatgttcg cacaacgacg aaatcgccta acgacgcatt tctcagaacg 3000 tatccccgtc gttaagcgac gcatgactg 3029 // ID RLTR32A_MM repbase; DNA; ROD; 611 BP. XX AC . XX DT 14-OCT-2002 (Rel. 7.09, Created) DT 14-OCT-2002 (Rel. 7.09, Last updated, Version 1) XX DE Mouse putative long terminal repeat - a consensus. XX KW LTR Retrotransposon; Transposable Element; Long terminal repeat; KW retrotransposon; RLTR32_MM; RLTR32A_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-611 RA Jurka J. and Drazkiewicz A.; RT "RLTR32A_MM: putative LTR from Mus musculus."; RL Repbase Reports 2(9), 17-17 (2002). XX DR [1] (Consensus) XX CC 83% similar to RLTR32_MM (bases 89-596). XX SQ Sequence 611 BP; 166 A; 143 C; 118 G; 184 T; 0 other; tgtgctcaca gataagcact agggaaacca ggaatgttaa acacacaggg ttgtctcttc 60 aaagggagag atgtataacc tgttttctac ccagaaggct gggagtggat gtggttaata 120 gcctggtgac ctttgtgcta tctctgtatg gccagaccac accagatccc ccagaccctt 180 atcatgagct ttttttccct agacccttat catgagcttt gtttctcaat ctatagaacc 240 catctttatg gaagatgtca catgtactta tagcctggtg acctctatac tatctaggtc 300 ctagaccaca ccagatccta ggctcttaac tatagaaccc atctttatgg tcacatgtac 360 tttacaaaaa gagtttcaat gtagtcatgt cttaagcttt attgtgcaca tgggattgag 420 ttaattatca ccaggaaaat tttcaccaaa ttgtgctgtg cttaaatatg cctaaaataa 480 actactcagg gtcagacact ctagaagttt gaaccaacac tggctactga gttgtgttga 540 actgaactgc cttcttgcct cccacagatc gttactcttg ctggccttgg tgttcacaga 600 gacccctcac a 611 // ID MER74 repbase; DNA; ROD; 624 BP. XX AC . XX DT 18-APR-1997 (Rel. 2.03, Created) DT 07-OCT-1998 (Rel. 3.09, Last updated, Version 4) XX DE Putative long terminal repeat of endogenous retrovirus. XX KW Endogenous Retrovirus; Transposable Element; putative LTR; KW retroelement; MER74. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-624 RA Lee I., Westaway D., Smit A.F., Cooper C., Yao H., Prusiner B.S. RA and Hood L.; RT "Complete Structure and Organization of Three Mammalian Prion RT Chromosomal Regions."; RL Direct Submission to Repbase Update (30-NOV-1995). XX DR [1] (Consensus) XX CC Putative retroposon LTR; possible poly A signal at 537-743. CC Subfamilies exist. XX SQ Sequence 624 BP; 171 A; 114 C; 194 G; 133 T; 12 other; tgttttaaaa tgtaggtttt attatttagg tatgttragg ccaacagatc aggagacgac 60 tgccattgaa aagacagttt gttattctca cagatcccaa gagaaggggg catgccgcgc 120 cacggggagc cccacgggga agcaccgggg tcggtcagga ggcagaagga gcgacgggaa 180 aacgtrggca agagccttta ttgtggtttc catgggaagg aatgrgcgag gcagggtaag 240 caggcttagg attggctagt ttgaataatt tcagcaggct ctggggyata ggggytgtcc 300 ctagttgcct ggtncctggc cctgggatga ttagggcagg gggatagtgg cccagagtgt 360 aagagccaaa tagaggaggt ggttgggggt atgggctctg gattggttgg tttgcatatg 420 aaaggcgcgc tcccgggcga gtccttcgct atctctaaga attggctagc cctgggaggg 480 gcagtctctc cgggrtcagc aaggccccaa gatgtcaaar catyataaaa tacagaaaat 540 aaaaaagcat gattaataca akytgctctg tgatgaacgg atgccaaata gwcgaataca 600 gaatctaaga aaacacagaa caca 624 // ID HERVK3I repbase; DNA; ROD; 7243 BP. XX AC . XX DT 17-JUL-1998 (Rel. 6.5, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 2) XX DE HERVK-related endogenous retrovirus flanked by LTR3s - a DE consensus sequence of the internal part. XX KW endogenous retrovirus; HERVK superfamily; LTR3; HERVK3I. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-7239 RA Kapitonov V.V. and Jurka J.; RT "HERVK3I."; RL Direct Submission to Repbase Update (JUL-1998). XX DR [1] (Consensus) XX CC Average similarity of HERVK3I individual copies to the consensus CC sequence is about 90%. CC As other members of HERVK superfamily, it has 6 bp target site CC duplications. HERVK3 is flanked by LTR3s. CC Similarity of HERVK3I consensus sequence to the known CC retroviruses CC is shown below: CC ---------------------------------------------------------------- CC start end RETROVIRUS start end identity CC ---------------------------------------------------------------- CC HERVK3I 381 452 HERVK 281 352 0.71 CC HERVK3I 1539 1809 HERVK 1772 2026 0.70 CC HERVK3I 1937 2523 HERVK 2194 2779 0.61 CC HERVK3I 2684 3273 HERVK22I 2334 2914 0.65 CC HERVK3I 3335 3992 HERVK9I 3015 3664 0.61 CC HERVK3I 3996 4464 HERVK 4240 4702 0.58 CC HERVK3I 4588 5193 HERVK 4826 5434 0.64 CC ----------------------------------------------------------------. XX SQ Sequence 7243 BP; 2190 A; 1569 C; 1503 G; 1979 T; 2 other; tagtggcgcc ccgaacagcg acagaatcag gcgctcaaca agtggcatcc gaacacaggg 60 actttgagga cgtgaacgaa gaaggtctgc tggagcagag aaactgaaat tgacaagacg 120 aatggggacc ctgggatgag tctgctggca gcagatataa ggtcagtgcc ctaacgaggt 180 actgggagca atataaggtc agtgccttaa agaagtactg ggaatgggag tttttctgaa 240 tcggaggtaa catggggcag aatttgtctg ttgaggaaaa lacattatcg tgcagttgct 300 taaagttttg ttgaaacaat ctggtgctca ggttaattct cagacattaa ctaagatgct 360 gcaggaggtt attacgcata acccatggtt tccacaggca ggcactcctg atgtagaaaa 420 ttggcacaga gcaggagaag gattaaaaca ggctcatcaa aaaggtctta aagttgattc 480 ttctgctttc tccactagga gtttagttca tactgtcctt ctgccattat atccttttta 540 ttctgctgga cagcaggagt catgttctga gtctaaaaat ctgaaagaat ctgttgtccc 600 acccacagca ccaattgaaa ataaaaaaca ggagagggag gataaaaatt ggcctatacc 660 gccccctcca gttgcagaaa catctgtacc gcctccttca gtagccgaaa tagagacctc 720 aatacaaaga attttatgct ctgctgccat agctggagag cccttaggac ctctgcactt 780 ttcctatttc cgtaaggcct gatccaaaca atccacagca gtttattcat gaacactccc 840 cactagagtt tacgttgttg aaggaattaa aaattaagtg taattaataa tgggatacag 900 agcccattca ccttaggatt gctagaatct gtatttggtg ctatgcgcct tctacccttt 960 gatgtaaaac atttggctcg cacttgtttg tctgctactg catacctgac ttggaattta 1020 aattggcaag aaatgtgtgc agaccaggct agacagaatc atgcttctgg acacggagac 1080 attacagagg gtatgctgtt aggtaatggc cctttattca gacctggcat gtcaaatggc 1140 actcccagat cctgcttatc agcagtgtgc acaggctgct atgcacgcct gggccacaat 1200 tccagaagag agagtcccag tacaatcctt tttacatctc atgcaagggt cacaggaacc 1260 ctacgtgcaa tttcttgcaa gattacaaga ggcagtgaag catgaaattc ctcataccgc 1320 tggcacagaa atgctaacct taactttagc ttttgagaat gcaaacgcag attgtaaacg 1380 tgcactggca cctgtgaggt gtaacaaaac ttgggaaatt ttctcagaac ttgtcaggat 1440 gtagaaactg agcttcattg ctctgcaatt ttagctcaag caatggctaa tttagtagtt 1500 gacaaatcta aaaggagccg acggtcaaac cctaaagtgg gaaaatgtta taattgtgga 1560 aaaactggac attttaaaaa ggaatcatga ctgatctcag ggcagaaagg accttataat 1620 gtggtgccct ccacccccat ggcccagcgg aaaaaaacgc caggactctg tcctcactgt 1680 aacaaaggaa atcactgggc tattcaatgc cgctcaaaat ttcatcaaaa ctgcaaccac 1740 ctgtcaggaa acgagaaggg ggcctggacc cgggcccctc aaacaatgag ggcattccca 1800 gttcagacca caaccccact tcaggggtgg gtcccaggag gaacattgat tccctcaccc 1860 caggaacacc aggaagtgca ggattagatc ttccagtcag agaaagaatt acattaattg 1920 gtggagacaa acctatcaaa gttcccattg gcatttgggg acctttacca gcaggataca 1980 gtagactaat tttaggcaaa agctgcctta acttgcaagg cattactgta gtcccaggag 2040 tagctgactc tgattatgaa ggagaaattc aagtagtttt aatgtcacaa gatctttggg 2100 tttttgaacc ggaagaatat attgctcaat tattgcttat tccctgcaaa ttacaccctt 2160 ctccataaaa ggagaaacga ggaaataaag ggtttgggag cacaactaca tgagaaatct 2220 aatgattcac aacctatagc ttataataga cccacctgtg tagtacaaag taaaggaaag 2280 aaattgtatg ggcttatgga cacaggagct gatgtgtcag taatatccag taaggactgg 2340 cccccagcat ggcctctcag actaacctcc acatccctag tgggagtagg agcagctaaa 2400 agtgttcaac agagtgctga gattttacct tgtcttggtc cggatggaca atcatgtact 2460 ttccagcctt atgatgcaaa tatagctatc aatttatggg gtcaagaatt acttacagca 2520 tgggatatga gacttacaaa tgaaaacttt cataacccag gatttaaaat gttgaaggac 2580 atgggatatc agagtggaaa aggtttaggg aaattcctac aaggaaaccc taacccgata 2640 tctataactg gagaaacaga tagaaaaggg caaggatgtc aggatttctg atggggatca 2700 ttgatatttc tcctcgaccc actgccttac cattagaatg gctttgtgac aaacctatgt 2760 gggtggatca atggccccta acacaggaga agctagatca acttcatctg ttggtaaaag 2820 aacaattgaa tgcaggacat atagagaaga gtttcagccc ctggaattca ccggtatttg 2880 ttattccaaa aaagtctgga agatggtgac tactacatga tttgagagtt attaatgcgc 2940 aaattaaacc aatgggtgcc ttacagcaag gtctaccttc cccagcagcc attccaagag 3000 acaggcctct tgtagtaata gatcttaagg attgtttctt tactatacca taacacgaga 3060 aggataagcc tcaatttgcc ttctctgtgt cttctattaa tcatagagaa cctgtctctc 3120 gctatcagtg gaaagtttta ccccaaggca tgcttaacag tcctacatta tgtcagcatt 3180 ttgtaggaag agcattaaag gagccttgaa atatgtttcc cactgtctat atcattcatt 3240 ttatggatga tattcttttg gccgctccta cagatcaaat tttacatcag ttattcagag 3300 aaacaaaaca ggccttaact aaatggaatc tcaaaattgc tccagagaag gtgcaaacaa 3360 cttccccata ccagtactta ggaactattg ttatggaggg gagtgtacgg cctcagaaag 3420 tagttctccg taagggcagg ttacagactt tgaatgattt ccaacaatta ttaggggata 3480 ttaattggct gtgcccaatg ctaggtattg ctacttatca actcacacac ctttatcaaa 3540 ccctccaagg agattcttca ttagattctc ctcggcaatt tactaaggag gcagaagctg 3600 agttacagct tgtagaacag atgcttcagc aacaacatgc ctcctggcta cagccacaaa 3660 agcctttgct tttgtttatt cttcctaccc cccattctcc aacaggactt ttaggccaat 3720 tcatagacaa atctgtaatc gtaatagaat ggctctttct atctaatcag tgaaatcttt 3780 gcaagtttat ctttctttaa ttactcaact tataacaata ggtaggcata ggtcaaaaat 3840 gcttatggga tatgatccag ataaaattat tgttcccttg gattcccaac aacaggccac 3900 agcatgggaa atgtcgactg catggcaaat cacttttgca gattttgtgg gaataataga 3960 taaccattat ccatcagaca aaattttgca attttataaa gttcaccctt ttatccttcc 4020 tgtaattact catcacaagc ctattccagg tggacagact tattttactg atggctcttc 4080 taaaggccgt gcagctattt atggacctaa acatactcaa acaataatga cctctggggt 4140 ttcagctcaa cgctcagact taattgcagt cattcaggtt ttacagctga cagcttcaga 4200 tcctatcaac attgtctgta attcagctta tgttgtaaat gtagccagtt gcatagaaac 4260 tgctacaatt aaaaatacac tagacccaga actgcttaat ttgtttctaa gacttcacac 4320 agctattggc tctccttcgt ctccttttca tatttctcat attcgctctc acacacaact 4380 tcctggacca ctatctctag gtaatgatag agcagataaa ctgatgagtt ctgtgtttca 4440 gcaagctcaa gcgtctccat gcatttctgc accaaagtac ttctgcctct actcgcatgt 4500 tccatttstc tcgcagccaa gctagggcta taatacaagc ctgtcctact tgccagcatg 4560 tccctggagc cgcacctgta gaaggttgta acccatgagg tttggctcca aatgaaatct 4620 ggcaaatgga tgttacacaa gtagcagcct ttagtaaact tagctatgtt ctacgaatta 4680 tagacactta ttctcatatg ctgcatgcta catgccaaac aggtgagaca gctggtcatg 4740 tacggcaaca ttgtttgtca tcatttgctc atatggggat cactaaacaa ttaaaacctg 4800 acaatggacc agcttatact agtcatgctt ttcaaatatt cttacagctt tgagctataa 4860 cccataaaca aggaatccat tataatccta gaggacaagg aattatagag ttggcacatc 4920 aaacattaca acaagtgttg aaaaaacaga aagggaggga taggagacca cttcacacct 4980 caaacaaaac tacacttagc cttattatct ttaaattttt ttgactcctg gtagagatgg 5040 taagactcca gcagaaagac attggcaagt gttagaggaa aagaggaaag tttatccgaa 5100 agtgttattg aaatcccccg gagaagagac aatggaaagg tctgttggat ttactgacgt 5160 ggggatgagg gtatgcttgt gtttttcagg gagatggaca agccgtgtgg gtgccctcaa 5220 ggtggatgtg accatggaac gggagactag aggaacccag ggtggccaac tatgggcctg 5280 gtccctctgg tctgagccat gagccagctg agctagagtg caaagatgga gagaaggccg 5340 accggagtcc agacgacatc aacccccata acctgggagc aactcaagaa taccaatcag 5400 gaagctggga aactactgga gcatcagagc caggcaaaac accctgattc catgttcttg 5460 gccatgttag tcataatgtc ctgtgtggta tgttttccct gtgcagaggc aaaaacattt 5520 tgggcatatg ttcccaatcc cctagtagta caacctatgc tttggagtga cactcctcct 5580 gagatttatc gtgatcagga agtatgggct ccaggacccc taactcccct gacaatagaa 5640 cagttagact ctcagaacaa tgtcattaat tatacgaccc cacgagaagg actcctcttg 5700 tgtatcacta caaagacatc gcttaactgt agctgtctta taattcaagc tcaacaatgg 5760 ttgagtcact atggaaaagt catgtgccta ttaagtcttg gttctattaa tgtaacaggt 5820 gtgctaacca accattcccg gcccaatcac cctaatcgtg ctgactatat ggaatggatt 5880 cccttcgata gttactaccc cccctcacat ggacccaatg tcttgaccca ctggctagaa 5940 aacaatctat gttaagtgga gacattgtgg attggggacc taaaggtctt ctgtatggaa 6000 gacatgaaaa tcagaaatca tggcacaaac ttcgctggca ttggtgggaa gattttaaag 6060 cttcttcttt ataccacacc gggatctaat cccagtctgc cacccagatt gcttgacatg 6120 gagcaggctt tagcccgcct cttcctcagt ggcattatct agggaggaaa ggaccaatcc 6180 aagagatgtt atggaaggca gcactcccat ttatgaatgg agcatctggg ttcgggatac 6240 tatccagtga tagcaatagt aagcaacaca gtcttaatgt tacatttgta aagaatatca 6300 ccactcaatt tatggtttgt ctttttaatc cttatgcttt tttggcgact aagaaggacc 6360 agctccaggt aaacaatatc caattgacct gtaaatcttg ccagttatgt cactgcatta 6420 atcatagcac attgcaaaca cataatgtct ctactttgat aattttgggt cgcatccctg 6480 ggctatggat tcctgttaat ctgtcccagc cttgggctac cacacctgct ttgcacttta 6540 tgaaacatct tctaactcag cttactcatt gtgcccgtag agccttaggc atgataattt 6600 ttgctattgt ttccttggtc acattaataa cttccgttgt gatgtcctct gtagctttgc 6660 atagttctat tcaaacaact cagtacatgg aaaactggat gcgtatagcc aaccaagcat 6720 ggccacttca gaataaaatt aacactgagt tacaaactga agtggcattg ttgaaatcca 6780 cggctctatg gttaggagaa caagtacaaa gcttgcaatt gcaacagcaa ttgcatgatc 6840 attttaatca cactcatatt tgtgtaacca acttagaata taaccaaagt gagtatccat 6900 gggaccttgt gaaagcccat ttgcagggag cttgcacatc caacatcacc tttgatatcg 6960 gtgaattaca aaacaaaatt cttgatttaa atggacaaac tcaagagttt cagccttctt 7020 tagaagacga gaccaaattc cagcaaggcc tggagagcct caacccttgg accagtctaa 7080 agcaccacat taacatctta tatgtagtcc ttggaataat gttgttttgt ctctgtcttc 7140 tgttcatagt ctgtaaaact ggatggactg ccaatcagaa aatgagagct gcccagcctg 7200 accttacatt ctttcaatta attcataaac agaaaggggg ata 7243 // ID LINE2 repbase; DNA; ROD; 2750 BP. XX AC . XX DT 13-JAN-2000 (Rel. 5.2, Created) DT 13-JAN-2000 (Rel. 5.2, Last updated, Version 1) XX DE MIR2/LINE2 repetitive element - a consensus. XX KW LINE; L2 family; MIR2; MIR2/LINE2; LINE2. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-2750 RA Degen J.S. and Davie W.E.; RT "Nucleotide sequence of the gene for human prothrombin."; RL Biochemistry 26, 6165-6177 (1987). XX RN [2] RP 2601-2750 RA Smit F.A. and Riggs D.A.; RT "MIRs are classic tRNA-derived SINEs that amplified before the RT mammalian radiation."; RL Nucl. Acids. Res 23, 98-102 (1995). XX RN [3] RP 1-2750 RA Smit F.A.; RT "LINE2."; RL Direct Submission to Repbase Update (1996). XX DR [3] (Consensus) XX CC 24 bp upstream of NcoI site; chromosome 11p11-q12. CC This sequence represents the 3' part of an ancient LINE-like CC element CC which was responsible for the amplification of the MIR elements. XX SQ Sequence 2750 BP; 593 A; 941 C; 336 G; 815 T; 65 other; ctgtcattca ttaccgntcc ttgttgcagt catccatcga gcccggctcg gtcactccct 60 ctcatttctc gaagaktttr tctcctagnt caactgtcac tctctcataa taatantcct 120 gtcataattc ttggtgattt caatatccac atagacgatc catccaatac tctggcctct 180 cagttccttg acctcctctc tcccgtggtc ttgttctcct ctacccactc tcagccactc 240 actcccatgg tcatacccta gatcttgtca ttactaataa ctgcaactcc tccataatct 300 caatttcaag catccyactc ttttaccacy acctcctatc tttctagctc actctctctg 360 gtgccctaac tccaataaty ctttgacccc accgagacct mcaatccatt gatctcatat 420 gcttttcact gtnccgtgtc ctcttccctc ctttcwttgc tcagattcca tggtcaatca 480 ttataatcac tcccwtacan ataccctcaa ctcccttgcc cctytctcmc tttgtcttac 540 ttrcttgnca aaaccacaac cctggttaaa tccaactntc cgactaattt rcgcctgcac 600 ccgngcagct aaacatggct gragaaaaat acacaaccat gctgactggt ctcactttaa 660 atttatgacc acgaacctca agtgrgccct taatgctgcc aggcaattnt actacatttc 720 cctagtccat tcactctccc actctcctag atgactattt yatactttct tnyttctcaa 780 acctccaaca yyyyytncca tctttactct cagctgatga ccttgcttcc tatttcactg 840 agaaaataga agcaatcaga agagaatttc cacatgctcc caccaccaca tctacccacc 900 tacctgcatc tgtacccata tactctgcct tccctcctgt taccgtggat gaactgtccg 960 tgctcctatc taargccaac ccctccactt gtgcactaga tcccatcccc tcttgcctac 1020 tcaaggacgt tgctccagca attctcccct ctctctcctg catcatcaat ttttccctct 1080 ctactggatc attcccatca gcatataaac atgctgnnat ttctcccatc tttaaaaaac 1140 aaaaattctc cctcgacccc acttccccct ccagctaccg ccctatttct ctgctcccct 1200 ttacagcaaa acttctcaga agagttgtct atactcgttg tctccacttc ctcacctccc 1260 gttctctctt aaacccactc caatcaggct ttcgtcccta ccactccact gaaactgctc 1320 ttgtcaaggt caccaatgac ctccacgttg ccaaatccag tggtcagttc tcagtcttca 1380 tcttacttga cctctcagca gcatttgaca cagttgatca ctccctcctt cttgaaacac 1440 tttcttcact tggcttccag gacaccacac tctcttggtt ttcctcctac ctcactggcc 1500 gttccttctc agtctccttt gctggytcct cctcatctcc ccgatctcta aatattggag 1560 tgccccaggg ctcagtcctt ggacctcttc tcttctctat ctacactcac tccctgggtg 1620 atctcatcca gtcycatggc tttaaatacc atctatacgc tgatgactcc caaatttata 1680 tctccagccc agacctctcc cctgaaytcc agactcctat atccaactgc ctactcgaca 1740 tttccatttg gatgtctaac agrcatctca aayttaacat gtccaaaact gaactcctga 1800 ttttcccccc caaacctgct yctcccgcag tcttycccat ctcagttaat ggcaactcca 1860 tccttccagt tgctcargcc aaaaaccttg gagtcatcct tgattcctct ctttctctca 1920 caccccacat ccaatcyatc agcaaatcct gttggctcta ccttcaaaat atatccagaa 1980 tccgaccact tctcaccatc tccactgcya ccaccctggt ccaagccacc atcatctctc 2040 acctggacta ctgcaatagc ctcctaactg gtctccctgc tyccaccctt gcccccctnc 2100 agtctgttct cagcacagca gccagaatga tccttttaaa atgtaaatca gatcatgtca 2160 ctccyctgct caaaaccctt caatggcttc ccgtttcact cagagtaaaa kccaaagtcc 2220 ttaccgtggc ctacaaggcc ctatatgatc tggcccccgc ytayctctct aacctcatct 2280 tctaccactt tccccctcgc tcactctgct ccagccacac tggcctcctt gctgttcctc 2340 gaacacgcca rgcwcgttcc tgcctcaggg cctttgcact tgctgttccc tctgcctgga 2400 acgctcttcc cccagatatc cacgtggctc actccytcac ctcmttcagg tctctgctca 2460 gatgtcacct yctcagagag gccttccctg accaccctat ctaaaatagc acacccyctc 2520 ccccatcatg cccatytttc ttaccycgct ttatttttct ycatagcact tatcaccatc 2580 tgacayacta tatwatttay ttgtttgttt gtttrttgtc tcctccacya gaatgtaagc 2640 tccatgaggg cagggatttt gtctgttttg ttcactgctg tatccccagc gcctagaaca 2700 gtgcctggca catagtaggc gctcaataaa tatttgttga atgaatgaat 2750 // ID RLTR19-int repbase; DNA; ROD; 6137 BP. XX AC . XX DT 29-AUG-2008 (Rel. 13.08, Created) DT 19-JUL-2009 (Rel. 13.08, Last updated, Version 3) XX DE ERV3 Endogenous Retrovirus from Muridae. XX KW ERV3; Endogenous Retrovirus; Transposable Element; LTR; ERVK; KW RLTR19-int. XX OS Muridae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea. XX RN [1] RP 1-6137 RA Smit A.F.; RT "RLTR19-int - ERV2 Endogenous Retrovirus from Muridae."; RL Direct Submission to Repbase Update (05-AUG-2008). XX DR [1] (Consensus) XX CC (5 bp dups though) some copies as little as 7% div; pos 1-3624 CC similar to ERVK elements (closest to co-hybrids RMER17C-int and CC RMER16-int, and to ETnERV3 and MYSERV); 3' end has patch-like CC similarities to ERV1 group internal sequences (!) The internal CC sequence at pos 1433<-1711 contains an ancient fragment of an CC hnRNP core protein A1 pseudogene. XX SQ Sequence 6137 BP; 1826 A; 1247 C; 1231 G; 1782 T; 51 other; tttggtgcgt tggccgggaa gcaagccccc taccctcgag cacccctcag ttactgccag 60 tgaatgccac caagccggct gggccctccg tgaggaggta agtttcctgt acctgtgcag 120 ttcttttttc ttgtttttgt tgtttctggt ttggttggac tttcggagac ccaagagagg 180 agagctggac gcagaaatnt ccttgggtcn gaggaggtca ggagacgtcc tgcttcctct 240 ctggttccca ctgggagaac gcctggttct ggttctgggt ccctttggtg ggacatctgg 300 gtccctttgg tgggacatct ggntctcttc cctttcctcc ccaactctat tggttgtcct 360 tggcgcgctg gtctgtccat gtctgtcagt ttatgtctgt agtttcgttt nattgcttga 420 ttgttgctct gtgtttcata ttcagaggaa aaatggttaa aacttcatct gctggccgtt 480 taccccttga cttagtttta actcatttca aggattttaa gaaaaaaagc agcctgaatt 540 gggacaacaa aaactgataa actgcccttg gagccctgca cctgcgctag gagtctagca 600 ctgctggaga agccctgcca gggctgcttg gctgtacagc cgctcttaaa ggggncagca 660 gcttctcggc ttctctggta agaaagctga tggctagttc tacaaaatct ctgtgcccat 720 gagaattgga ttcaacaggt ggcaaggtgc tcctcccttg aaattacagc tagaaaaaaa 780 gaaaatttct gtctgttcta aatgtataaa tgtatgtggc cactacatnt ttgtttctga 840 ctggtcttaa atgtataagt ttactatgtt ctgcatgtct tggttataga ttattggctt 900 ataagttatt gggtatggtt aaaaatctgt aacatcngta acaaaaagtt agcttaaaac 960 nagtaactca gggttgaagc cattctgggc aacaacacac ggcatgagcc aacccaggga 1020 aacaggtctc taaagatact tttaaattgg gataatattt ttatgtaatt cctgtcctaa 1080 aaaccagact tataaaagat aggatttaaa aaatgtttct ttaatgaggt attaaagctg 1140 caccttcatg cattcatata caagaacttt tcttgctggc agccaaactt tgtaacatca 1200 aaataatgca cttggtattg attttagaga cactggattg tttaaactgt taaaaattaa 1260 aaattggttt tgtcttctaa aaattatggt tatgctctat gacttcactc tttaaaaaga 1320 tactttattt tgatattgca aaagcaactt taaaaattat agttaaatat ataaagctat 1380 gggacatttt aaaatttatt aaatgtttta ttggaatgtt ttattaatat atgatggtta 1440 caataaagga ggaaatttta atgaaataac tataatgatg atngaaacta taatgacttt 1500 agaaattata atggacaaca gcaatcaaat tatggaccca tgaagggggg acaattntgg 1560 tggaagaagc tcaggcaatc cctatggtgg tagctatgga tctgatgatg gaaatgatgg 1620 atatgatagc agaagtttta aaataaaaca gaaacgggta cagttcttag aggagagaga 1680 atgaggagtt gtcaggaaag ctgcaggtta ctttgagaca gtcgtcccaa atgcattaga 1740 ggaacantaa aaatctgcca cagaaggaat gatgatccat agtcagaaaa ttactgcagc 1800 ttaaacagga aaccttcttg ttcagactgt catgccacag tttacaaaaa atacagctat 1860 tgattaatgc aatatgatgt cagttagata tacattcctg aggntttttt atctgttgta 1920 gctttgtctt tttcttttca ttacgtcagg tatattgctc tgtaaattat ggtaatgata 1980 ccaggaataa aaattaagga atttgttaat ttaaaanttt ttagaggttt acaatattaa 2040 aaaggttaag aatcactggc tcttgaattt gcctgagctc tggcaaggct ccagcatgcc 2100 cgagtcagta aggctctttc agccgaggtc ttgcagtttt tcccaactgt taaccttttc 2160 tgtcctgaca ctggtttcag cttaaactga atcatatgag aaactgttat ctctctcaga 2220 aggtcagaaa agttcctagt ctctttatgg agtatgttta tgggtttttt ttaatactag 2280 aagagcttca attcaaaact gtaattttaa ggttcaagcc taacagggat tgatagtcag 2340 taaccttgaa ggtgatcaaa tcctttaata tgttcagaaa tatatttaaa gtcatgctaa 2400 gtactgatgc agttaattnc aagattaaag ctttatttag tctcctgttt tatgtttgna 2460 aggtacagct tagagcagat aactaagaac aaacaaagnt tgtttaactc agatatgcta 2520 ggtaggtact agccctcaaa ccagtcagag atctgctgaa tatggcattt aatatgttta 2580 aacttaccat aacagacaga gactcccaaa tcctaacagt gacccccaag gtctccaaga 2640 agatatgggc acaacgacaa aggacaccac cnggattgtg gtatgataac cactgggcat 2700 aactgcccca atgccttgcc tgctgccagg gcccagcctg aactgtggac aaacagagga 2760 caactggaga attgattgcc acacgttgcc taagacaagg tgaggtcagt ctctcccatg 2820 ttcctcctcc acaggaaaaa gcctcttcat cttctgggcc tgatggccaa agactgcctc 2880 tgcccttggt gccatagaga cacaggagcc tgggataact gtctaggtaa taaaatatgg 2940 actagtctgt cattttaatt gatacatagg ccatttagat tacaatttat ccttctcaga 3000 tctctgatca cattgatggc taagctaatt gtagcttgac agctagaaaa caataggcaa 3060 ctagctcctt acctcaaggt aatccaccac tgtgntagtt cattagtcaa ttcattagtt 3120 agttaaaact actaggtctc ttattaaaaa gggcaaacag gtttgccttg ctcacaggct 3180 tcatattaaa tgactgctct caataatcaa ctgcctatgt ctcatagtaa ataattggat 3240 taaaaatata aaatttaann ttatttaagg tctagaaaaa tgtttatggg tctagaaaaa 3300 tgtttgagat tgaaaatgca gtgataaagg ttagaggata aaaaacttat gatggctaga 3360 aaatgnttta aataagaatc ttcaataaaa atgttaaggt tggtaaatga actaagattt 3420 aagggtctaa gaagatgttt taggtatata aatagcaagt tatagaggta taaaaagtaa 3480 tttaagaaat ggaaaatgtt tcatattccc ccatgctatt gttatttcaa agttcagaat 3540 tttaacattg atcaatggag ttctgataag ctaatggagc actggcagct tacaattcag 3600 tcacaagctc aagattttaa attccctttg gttgtcttct aaatatagac ttaaaggtgc 3660 ttctcatcat gctaaaaaca tctctgttca atctgtatat ctagccctct ggactgagaa 3720 aagaatgttc tgtgcctttt gacccaaacc tatagctttt actaggtctc aaaggcatga 3780 gtctactcct gttgtcttac agacacctgt taactacttt ttctaaaata tggatttcaa 3840 tacaatggta aatactaatg tatatccttt tgtaaactaa agttcacata agcttcaggg 3900 aatcaaaaag tcataagatt tggatccacc tgccacagat caaatggact ccagataagt 3960 acacctgccc tgttcttgaa aatgggattt ttcccttggc cttgggattc cttaccttcc 4020 ccaattacta gacagtttta acttctgtcc ctagtctata tttgtctcag cagattttca 4080 cctggctgac agactccatc cagggatcac ccaatgnanc tgctgaactc tggacttgct 4140 gaaagctgat gttaaccagt ccagctgatg tgattgctcc ctgtctttca tctggatcag 4200 ctaatcagat cagatgcttc tgataaatgc cccattgccc agcctttgac tagcatttca 4260 gccttcctgg gcccctntga caacgtccta atgtcagcng gaagcagtta cagaagagaa 4320 atacgncgtc cattgtccca ccttacaggc tgaaatgcta agtcaaaagg aanccccctg 4380 ggnccacgct gaaaaggacc ccacttcntg atcctgacca ccccaatggc catgaaaata 4440 gatgggatnc aatcctggat tcatcactct catctgaagt tcaccccctg gaaatatcag 4500 gaccgggacc agcgatgggt catcagacaa cacccacagg acccattaaa aatcagactg 4560 gtaaaagact ctggagacta gttttctntt ttgtgttgcc ttgggctgct gaggcacatg 4620 ctccagtaca acacatatga actttgatta gaactacaga caggagccta attactaata 4680 tcactgtcta tggttccccc accctcacct ttganttatt tgggccaaaa tggaatangt 4740 gcccaaagac tgcaaacata tatgttccct cattctctag gtcttttaga tataggatcc 4800 agaatatgga aaaagtaatt tggccaccct ttagatatat gcctgtccct caaatggaga 4860 ccctgattgt aggggacaag atatgtactt ttntgctata tggggctgtg aaacattagc 4920 tccctgggta actgataagg ataattatat tcagttacag agggttgagg gccactctga 4980 aaaatctggg aaaagaaatc aaccccattc aaattaagat taaaacgata taaataatgt 5040 agganatagt acagggaaac agaggatgtg ttttngataa ggccacctgt tgagaacaat 5100 tatagcccct atgggaatgg tcattacaac cacctggaag aggaaaaaga aatgtcccgt 5160 ccccacctag acaaaggaca aatgacncaa taaacacctt tggtctgaga gtgccagaaa 5220 ggggccctcn ctggcacaga aaaggacagc cagccctcat ctgcccagac atggacttgg 5280 ttacctccac tggccaccat agcccgctct gggtaccgtc acctctgcct ctgacattcc 5340 tgccccggta caagctttgc tngagaccaa cactgccagt cagacnccag ttcaggcctg 5400 ctgcaggtcg ctcctgacgg gcntatcaat gctacactca gcttgtttcc aacgtacggg 5460 atatgttgaa tcaggaagga cttagggact ctgcctttat gcatctggta angaccctgg 5520 aatgatgttc accatccaga gattccaggt caccaatann aggatcttag acatcgttag 5580 tcccaacaaa gtattaaacc ctctccctga gccacctgan tctgggaggg attaaaggac 5640 aaggcctctg cttatacaca ggttnatata agatatcaga gtctcggtac cttcaacact 5700 gtaatgtaac aatccaccta cagatgttaa ctgcaggagt ttccccttac natttagtac 5760 ctccaaaggg cacttggttt gcttgtgcct cgggacaact ccctgtntca gtccctttat 5820 cctcactaaa acttctgact cttgcttact tgtacatcta ttgcctcaga tatactatta 5880 ttctagagaa gangattgga acatctggga cttcacacta atcccagatg gccagggcag 5940 ctccaatact cgtgcccctc ctantaggca tggacatagc agggtctgcc agcatgggag 6000 cagcaacact tattaaggga gatcaaattg tcaaaaaatt ttaagccaac aaaggttgat 6060 ctaagtttgt ccccctggcc tcagaaaaan taattgagtc aatgatttga ctcactggaa 6120 caactcaaag gggggaa 6137 // ID MER63C repbase; DNA; ROD; 938 BP. XX AC . XX DT 18-APR-1997 (Rel. 6, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 3) XX DE MER63C repetitive element - a consensus. XX KW nonautonomous DNA transposon; MER63C. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-938 RA Smit F.A.; RT "MER63C."; RL Direct Submission to Repbase Update (1996). XX DR [1] (Consensus) XX CC Putative internal deletion product of DNA transposon. CC 15 bp terminal inverted repeats, subfamilies only differ by size. XX SQ Sequence 938 BP; 299 A; 147 C; 161 G; 323 T; 8 other; ccagtggtgt gctggtaaat gtttaacaac tggctctctg ggggaggwaa tgymtkgatt 60 tgtgtattca catatataag tttattataa attttactga tataaaggac gtrtagcaca 120 caatttacaa ataacaataa aatatacaat attctttatt gtaaattcca tatagccagt 180 tgattctcac agaatgcttt cgttgatttt tgcccaaact cttgtatccg tagccgaact 240 atgtttgtaa ttgacgaaca agtgtagttc naacatgaat gttggttgat attttcgttt 300 atgttaatga gtaagatgaa agtgaaacaa cgaagacgta tgtcagaact tcactcattt 360 gtcaatgatg tgagtgactt ctttgctgaa tcggatggta gttttcaaac attagaagaa 420 tattttctca atttttggtg ctattcgcaa tgtaacagct acagacatga cacactttta 480 agtttaatct gcatcattaa cattttctcc atcactttct taagtctaga caatcaacaa 540 aacaataaat caagccctga tttgtagcgt ttgccgattt ccgtggtgta aatactccca 600 ccgtggccga tttcaagcta ccagcgtgat gtcactgaat gsggagttgg gaagagatgc 660 acagtagcac acyattatat agtatttcca ccatacagat acaatagacg taaataacct 720 caagagcata gataatagta aaatgtagta aaataattag gaagtgatga gttttgagta 780 tttattacct ttgtttttaa tataatttat ttaattgtaa gtttatataa tttaaatttc 840 tttgttttta atataattta tttaattgta agtttatata atttaaattt caatcggctc 900 tcgcgagccg atacgagcca gctccagcac accactgg 938 // ID HERVS71 repbase; DNA; ROD; 6069 BP. XX AC Z70664; XX DT 27-JAN-1997 (Rel. 6, Created) DT 20-MAY-1999 (Rel. 7, Last updated, Version 3) XX DE Internal sequence of endogenous retrovirus HERVS71. XX KW Endogenous retrovirus S71; simian sarcoma virus; HERVS71. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-6069 RA Werner T., Brack-Werner R., Leib-Moesch C., Backhaus H., Erfle V. RA and Hehlmann R.; RT "S71 is a phylogenetically distinct human endogenous retroviral RT element with structural and sequence homology to simian sarcoma RT virus (SSV)."; RL Virology 174, 225-238 (1990). XX RN [2] RP 1-5491 RA Kabat P., Tristem M., Opavsky R. and Pastorek J.; RT "Human endogenous retrovirus HC2 is a new member of the S71 RT retroviral subgroup with a full-length pol gene."; RL Virology 226, 83-94 (1996). XX RN [3] RP 1-6069 RA Blusch H.J., Haltmeier M., Frech K., Sander I., Leib-Mosch C., RA Brack-Werner R. and Werner T.; RT "Identification of endogenous retroviral sequences based on RT modular organization: proviral structure at the SSAV1 locus."; RL Genomics 43(1), 52-61 (1997). XX RN [4] RP 1-6069 RA Kabat P., Tristem M., Opavsky R. and Pastorek J.; RT "Human endogenous retrovirus HC2 is a new member of the S71 RT retroviral subgroup with a full-length pol gene."; RL Virology 226, 83-94 (1996). XX DR GenBank; Z70664; Positions 1 6069. XX CC LTR's of HERVS71 are listed in REPBASE as LTR6A sequences. CC LTR6 (Z70664; position 5492-6069) reported by [1,2] is in fact CC an env-related portion of HERVS71 [3]. XX SQ Sequence 6069 BP; 1596 A; 1682 C; 1276 G; 1515 T; 0 other; ggatccaaaa tagccaccct gcagacggcc ttgctcacct tttctgtcat cccataactt 60 ttccagtgcc cttaaataac acactatgta cacaaaccta tgcctgtgct gctttactct 120 gtctggaccc ttattctatc cccctgtggc tactctccta ccttaggaaa gatctgagtg 180 gcccctttcc tccttacccc catcccttaa cccacacagc tcgttttcct gtgtcacagc 240 aagtccagca cctccaagac ttggctctgc tctccatcct aaaaccctta aaagaaaggg 300 ctgagtttga actttttgcc tttgagtcgg ggagacacca aagatatttg gctataagtc 360 aaaaaggaag gggggggtca cataggtccc actggcctca gacccacctc ttgtcctctc 420 tctagatctc aaagcttaaa gagacagatc ttatgtggca agaaatgttg gctatagttg 480 ttttcctact tcttctggtt ataatacttc tgttcttcca atactacagc cccccaggcc 540 atgaatatct ctgtctgtgc tgggtttaat atttctgctt aaaccttgtt aattgcctcc 600 agaatgggaa actcttcttc ctggccccgt agagattaca gccctctcca atgtatgttg 660 cagaatttct ctctgggttc tcagaggatt acggagtcca ccttaagaaa ggcaaactcc 720 agacactctg tgaagtagaa tggccacagt ttggaactgg gtggcaccag aagggtcatt 780 aaacctcaca actgttcagg ctgtgtggcg ggtcatggct ggaactcccg gacaccctga 840 tcagtttcct tacattgatc aatggctaga tttggtccag agccctcctc catggctccg 900 ctcacgtgcc attcatgatc ccacctccaa ggtccttttg agctggacca cacttttgcc 960 ccaaccctca cgtcggctcc tcctgtacag tctccttctg aagaagagga aagttctctt 1020 cacccatttc tgcctcccta taacctcctg cccacccccc ccccccgccc cagaatattt 1080 ccttgtctcc tcgactacat cccctgtggc ctctccacct atagccaccc aattacggca 1140 tcggctggag aggtggccct ccttctccca ctgacagagg cccaaatcct ctgggcaatg 1200 aggctctgct ccatttttag tttatgtccc cttctctgtt tctgacctgt acaactggaa 1260 ggctcataat cccccccttt tctgaaaagc cccaggtctt gacctcactg atggagtccg 1320 tgctccggaa tcaccggccc acctgggatg actgtcagca acttctttta acccttttca 1380 cctctgaaga gagggaccat atccgaagaa aggtcagaaa gtatttcctc acatcagctg 1440 gtagaccaga ggaggaagcc cgggacctcc ttgaggagac ttttccctct acctggcctg 1500 attgggatac aaaatcctcg ggtgggaaga gagctttgga taattttcac tggtatgtcc 1560 ttgtgggtat caagggagcc actcaaaaac ccatgaatct gtccaagaca actgaagctg 1620 tccaggggcc taatgagtca ccaggagtgt ttctagaacg cctcctggag gcctatcaga 1680 tttacacccc ttttgacccg gaggctcccg agaatagccg tgctattaat ttggcatttg 1740 tgactcaggt agcccctgat attataagaa aattacaaaa gctggaagga tttgctggaa 1800 tgaacagcag ccaacttttc aaatagccca gaaagttttt gacaattgag agtttgaaag 1860 gcaaaaacag gtagctcagg cagctgaaaa ggctgctgac aaagcatcaa aaagacaggc 1920 aaagatctta gtggctgcca tccaaggaag caagaaggca gggcccccat cacagagcac 1980 cagccagggg accccaggtc cccaccagaa aggccaaaaa ggtgagcagg ctcccctaca 2040 aagaaaccaa tgtgcttatt gcaaacaaat tggacacagg aaaaaagaat gctcattaaa 2100 accagaggaa aaacaagaga agaaaaaggt cctcaccctc cctgctgtgg atgaatctga 2160 agattgacag ggccggggct gccacttcct tcacccccag gagcccttgg tgactgccac 2220 agtgggggcc cagcctgtat gcttcctaat cgacactggg gcggaagact tggtactgca 2280 aacacccttg ggcagtgtct ctaataaaaa ggtggctgtg caagggactt cataagcttc 2340 ataagctgca ggcatccatc tccttctcag cccaataagc tcacctcaca ttaggggacc 2400 caacaccctc taccacccag ctcctgctaa ccaccccttt gtcagaggaa tatctcttag 2460 tttcaccctc acaaccgctg gagaataaaa ctaatcctct cctactggat ttacagactc 2520 tctttcctca agtctgggcc aagtcaaacc cccccaggac tggcaaagca ccatctgcca 2580 gtagttgtag aactcctggc cactgccctg ccagtccagg taaaacaata tcctatgagt 2640 cagtgggcta gagagggaat caatccccat attcagtctg cctggaatac tccatttttg 2700 ctggtccaga aacctggaac aaatgattac cggcctgtac aggacttgca ggaagttaac 2760 aagtggacag tcactgtcca tccaactgtc cctaaccctt atattttact cggcctgctt 2820 ccaccagaac atacagcata cactgttctt gacttaaagg atgctttctt tgctattcct 2880 ctggccccta aaagccaacc tatatttgct tctgaatgga tggaccctgg ctcaggagac 2940 accactcaat taacctggac ttggttaccc cagggtttaa aaaattcccc cacccttttt 3000 ggggaagccc tccaacaaga tcttataccg ttctgagcca gtcaccctaa ctgcacgctt 3060 ctccagtaca tagacgacct gtttttggct actgaaacca ctgacagctg cctgcaacat 3120 actagggacc tactttacct ccttcaggaa ctcgggtatt gggtctcagc caagaaggcc 3180 cagctttgtc ttcccagact tttctaccta ggatacaaga taaacaaggg agaaagggca 3240 cttgccactg ctcgaaagga agccatcctg caaatcccca ctcccaccac taggagatgg 3300 gtacatgaat tcttaggggc tgtgggatac tgtcgtttat ggatattggg gttcgcagaa 3360 atcaccaagc ccctgtacac cactaccaga gggaatggcc cacatgtttg gactgacaaa 3420 gaacaggctt ttcaaaatct aaagaaggca ttaactgagg tccctgctct tgccctccca 3480 aatatctcag aaccatttca tctttttgtt catgaaagcc agggagtcac taaaggggta 3540 ctcactcaaa ctttaggacc atggtgatgc ccggtggcct atttgtctaa gatactggac 3600 cctgtggctc cgggtgacca agttgtctgt gagccatagc ggcaaaagca agcctggtcc 3660 aggaggctga taaactgact ctgggccaga atttaaccct tatggctcct catgccatag 3720 agactttgct acaaaggcgc tctggcaaat ggatgtcgaa tgctcacatc ctgcagtatc 3780 agagtttact gttagatcag ccttggttaa ctttctctcc cacaaggtgt ttaaatccag 3840 ctacctttct ccctgatcca gaccttacca cacctgtcca tgactgccaa gaactgttag 3900 agactacata aactggccga cctgatctcc aagatgtgcc tctaaaggag gtggactcca 3960 ctctgtttac tgacagcagc agcttccttg aacagggagt aggaaaggct ggtgcagccg 4020 ttactatgga gacagatgta ctgtgggccc aggcactgcc ggcaggtacc tcagcacaga 4080 aggctgaatt ggtcaccttc actcaggctc tctgatgggg taaggacaaa cgtattaaca 4140 tctacactga cagcaggtat gtttttccta ctgtacatgt acacaaagcc atctatcaag 4200 agtgagggct actcaccagg aaagactatt aaaaacaaag aagaaatttt ggccctgctt 4260 gaagctgttt ggcttcctcc gcaggtggct gtaattcact gcaaatgtca tcaaacagaa 4320 ggcatggcta ttgcctgtgg taaccaaaaa gcaggctctg cagctcgaga ggcagcttgg 4380 ctcccagtca cgcctttgac cctgctgccc actgtgtcct ttccgcaacc tgacctacca 4440 gaccacccac aatactcccc agaggaagaa aaacaagctt cagatctttg ggccagtaaa 4500 tatcaggaag gtttggtgga ttcttcctga ttccagaatc tttattgccc caagtcccct 4560 gggaaacttt aatcaatcat ctgcattctg ccacccattt gggaggaata aaactggccc 4620 agcttcttag gagccatttc aacatccccc accttcagga cttaactaac caagcagctc 4680 tctggtgtat ggattgtgct caggtaaaca ccaaacaagg tcctaagccc agctcagtcc 4740 accctccagg gaggctctcc ccgagaaagg tgggaagttg actttacaga aataaaacca 4800 cactgggcag ggtataaata cctcctagtg ctaatagaca ccttttcggg atggactaag 4860 gcatttacca ctggaaacga aactgccacc atggtagtta ggcttttact cattaaaatc 4920 atctctcaac atgggctgcc tgttgccata gggtctgata atggaccagc cttcacctcg 4980 tccatggctc agtcagtcag caaggcatta aacattaaat ggaaactcca ttgcacctat 5040 tgaccccaga gctctggaca ggtagaacgc atgaaccaca caataaaaag tactcttact 5100 aagttaatcc tagagaccag tgagaattgg gtaaagctcc ttcctttagc ccttcttaga 5160 gtaagataca ccacttactg ggctgggttt tcaccttttg aaatcatgta tggaagggct 5220 cctctatctt gcctaagcta agggatacca atttagcaga aatctcacaa gctaatttgt 5280 tcagtacctg cagtctctcc aacaggtatg agacaccatc cagccacttg tccagggagc 5340 acactccaat ccagttcctg accagactgg ccctgccact ttttccagcc aggtgactta 5400 gtataggtta aaaagttcca gaaggaagga ttcactcctg cctgaaaagg acctcatact 5460 gtcatcctca ccacgccgat ggctctgaaa gtggatggca ttccttcttg gattcatcac 5520 tgtcacatca aaaaggcgaa caaagcccag caagaaacat ggatccccaa gcctgggcca 5580 tgccccttaa aactgcgcat aagtcaagtg aagccatcgg attaatcatt tttatttacc 5640 tcttttgttt gtttccgcct attacgccct ctgcccctct ctactctttt ctcctcactt 5700 ctttcacgac agtatgtgtg tttgcaaaca ccacctggaa ggcaggaacc tccaaggaag 5760 tctcttttgc agtcgattta tgtgctttgt tcccagagcc tgcccaaacc catgaaaaac 5820 aacccaacct gccggttatg ggagcaggaa atgtcaacct cgctgcaggg tccagacaca 5880 caggaagccg gactagatgt ggaagctcta aaggtgcaga aaaaggactc cagagcgttg 5940 acttttacct ctgtcctgga aatcaccctg actctagttg tctagattct tatcagtttt 6000 tctgccctca ctggtcatgt gtaaccctgg ccacataccc tggaggattg acctggtcct 6060 caacacttt 6069 // ID MER70I repbase; DNA; ROD; 5023 BP. XX AC . XX DT 29-AUG-2000 (Rel. 5.07, Created) DT 21-NOV-2000 (Rel. 5.1, Last updated, Version 2) XX DE MER70I is an internal portion of the HERVL70 endogenous DE retrovirus - a consensus. XX KW LTR Retrotransposon; Transposable Element; endogenous retrovirus; KW env; internal portion; RT; MER70A; MER70B; int; ERVL group; KW HERVL70; MER70I. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-5023 RA Kapitonov V.V. and Jurka J.; RT "MER70I."; RL Direct Submission to Repbase Update (31-JUL-2000). XX DR [1] (Consensus) XX CC MER70I is an internal portion of the HERVL70 endogenous CC retrovirus CC flanked by the long terminal repeats MER70A and MER70B. CC There are about 100 copies of MERV70I survived in the human CC genome. They are ~80% identical to the consensus sequence and CC belong to two major subfamilies. CC MER70I encodes the reverse transcriptase, integrase and env CC proteins. It is related to the ERVL, HERVL, MERVL-like CC retroviruses: CC MER70I 1019 1729 ERVL 2304 3014 d 0.61 CC MER70I 1730 2285 HERVL_40 2374 2929 d 0.62 CC MER70I 2334 2725 HERVL_40 2982 3369 d 0.63 CC MER70I 2733 3028 HERVL_40 3770 4068 d 0.61 CC MER70I 4149 4264 MER51I 6384 6499 d 0.65 CC MER70I 4474 4638 MER57I 6517 6678 d 0.65 CC Its env-enoding DNA portion (position 4180-4780) is similar CC to env-like portions of MER57I and MER51I that belong to the CC MER4I-group of retroviruses related to ERV1 class. XX SQ Sequence 5023 BP; 1261 A; 1259 C; 1377 G; 1126 T; 0 other; aaattggcgc agcgagcagg gtccggtctg acagaaccat ggcttattgg cgaagcggga 60 ggggcgatac cccccagtac ccggcccagg gcggagacat cagcctgggg gtgcaggggc 120 tggacatgca gctggatgtt ttatggggaa ggaggggtga taagtgggac atagtctccc 180 cacctccccc gcagatgtga gtgagctggc tgcatggact gaggcagaga gaaaggaatg 240 tggaaacagg agagagcagc gcacgatccc ctggttgttg ggcacttatg tgatcgctca 300 ttcctggggc cggagagctc gggaagtggg tactcagatg gaccctgagg cacctgagag 360 ggaattagaa agaggtcctc agatggtccc tgaggcatct gagagggaac caacacgtac 420 aatgcgggcc ttcacagaaa gggagactca ctatctaaag gacagatatg gagaaaggcc 480 agggaagtct tatgcagcct ggttggtgtg tttgtttgat aaagggacat tgcagatcca 540 aatgtctcag gctgaatggc gcagttagga atggggccaa ggtctccggg tccccaccgg 600 aggccagtgg ccctatactc ctgtcaagat aaaaggagaa aagaaaggaa ttcagacttt 660 tctggggttc ttggatactg gagcccacat gacaatattt ccaggtcccc ttaggggaaa 720 aattaaactg atgacatcgg gaggtttggg gacaaacatg gtgacccatg gtgcttattt 780 gcttatggtg cttatctgct tgtgggtggg gccctttggg ccatttcggg tgccagtgac 840 catggttccc accgctgagt gcattatagg cattgacatt ttggctgctt gtggcacaga 900 acatcaccgc tgcctgaggg ggtatgcccc ctcacagcta agaattcgag ccataacagt 960 ggggcatatc cactcctgcc tgccacctaa gctacccaag tcccaatggg ttattcaaca 1020 aaagcagtac tgcatactaa gtggagaaaa ggacattact ttgttaattc aggacttgct 1080 acagataaaa atgttacaaa ccaccctgtc acaatgtaac agcccagttt ggctggtcaa 1140 aaaggccttc ggggcatgga gactaacaat ggactgtcgc aggctgaatg ctgtagtaga 1200 cctattgaca ctcgtggccc agatatcacc acagtaattg aacacatcat ggaggcttcc 1260 aaccaatggt atgatgcagt tattgatctg gctaatggat tcttctcaat ccctttgagg 1320 gataagggca gagatcaatt tgtattcaca tggcaaagta tacaatatac atttacagtg 1380 ctgccacagg agtatttgaa ctcacctgcc atatgccacc agtgggtagg atgggatttc 1440 gccactgtgc ttttgcctaa agtggtcatg tgcattcatt acataggtga catccttatt 1500 gtggcccttg atgatccgat cacacaagag gccttggact tgatggtcac agggacgtga 1560 caagcagact gggaagttaa ccctaacagt cctgggatca gccaaactgg tgaccttttt 1620 caaggccact tgggcgggaa gccaaagaag tatcccagat acagtcaagc aaaaattgtt 1680 ggccctggcg gcatccacta ataaaaagga ggcccaacag ctggtaggcc tctttgggta 1740 ctggagacag catatacctc acctgggtgt tcttttggcc cccttagtca aggtgaccaa 1800 caaagccgcc aactttgaat ggggcccttt gcaacagcag gccttggaag ccattcaaca 1860 agtcgtggcc caggcactgc ctttaaaacc tttacagcct gctagcccga tggaattaca 1920 ggtgtccgca acctccatgc atgctgattg gagtctgtgg caacagaaaa ctgccactgg 1980 ggtgcaccag cctctcagat tttggacaca taagttgcct gaggcagcca ccagatatac 2040 ctcttttgaa tggcaactcc ttgcttgcta ttgggcactg gtggagactg agcatcttac 2100 ggccggagcg ccacgtgtga cgctgcaacc tgaactgccc attctcactt gggtgcttac 2160 aaaccccacc agtaaaattg gacaggctca acagagctca attatcaaat ggaaatggta 2220 cattcaagat cgggcccagc caggacccca agggaccagc gggctccatg aacaaatggc 2280 tagcttacca gaagggacca agcgacccgt aggggatgct ttggctcctc ctgtggctac 2340 ctggggccca agattcagag acatgcctac cgacggtatg gcatggggtt tactgacggc 2400 tctgcgaaac aacaagccag tgggtccact gggctgtggc caccatccag ccagtggatg 2460 gccatctttt gactgagact ggacatggac gttctgccca atgggccaaa ctacatgcag 2520 tggtgatggc catgcaggcc gcccctacca ccatatcttg ctacattttc actgactcat 2580 gggccattgc caacagccta gccatctggt caggagaatg gcaactgagt gactggacta 2640 ttaaaggatc ccctgtgtgg ggacaaggac tatggcaaca gcttgctgcc tggaagggac 2700 aaatatatgt cactcatgtg gatgctggga ctaccatggc cacccttgag aggaatttat 2760 gtcatgtttt tggatacccc atgggacttc actctgacca aggaacatcc ttcactgccc 2820 aagcaacatg acaatgggca cactctcatg gaacacgatg gactttccat gcaccctgtc 2880 atccacaggc caatggagct attgaacgat ggaacagccg actcacacag caactgaaga 2940 aaggacatca agacggcctg ctagtggggt ggtaccccca tctgactagg gcaatatgga 3000 cactaaacac tgcactccaa tgcaagggaa acacggcact gcagcgcatg ttgagaaaca 3060 ctgagcttgg tgggggtgga ggtggaccag gcagccgcct gattaggctg cgcctgcgaa 3120 atcccaatct cagtgttccc aaccattctt tttccttttt ccctttacag tgcacgtcct 3180 gggggtggtt tgcggttcag gccgccatag taccccagac agggcccccc gactctaatc 3240 tggaggcgat gcttcccctg ggtgcctccc ttctatggga tcccacagga gtgggggacg 3300 ggaccaagga ataccagggt gctagggtcc ctctagtggc gccggggaca tctgatcctt 3360 ccagtcgggt gggtgatgtt atcagacatg tcaagtttgt acaggacgtg acctcccttc 3420 ctggactgga cgactgaggc tgaaaggtct gggtcaagca acaagggcaa tggtgcccac 3480 agaggtagta gcctcaggac agggacagac agactgggtc gctacaccaa ctcagcccaa 3540 cccctatctg ataggtaggg aacacctgag accctggaag ggatgggggt ggggcactaa 3600 cctgtcagtc tgctttttcc acaaggacgt ggcagcagag gcctggaagc ataacgcctt 3660 tgttaggctc ttccaagctg tggccaccgc gggtaacctg acaaaatgct ggatctgcca 3720 tcccggacct cattctgtca cagaccagag ggaccctctc atcctgccag tggtaaacta 3780 caccagcatt cctaatgcca cagtgtacac caacagaacc cgagccctgg cttaccgagt 3840 gaggatctgg cacctgccac atgggaggga accggaggtg ccctgtttta acttaactga 3900 cttaaggtgg caaaatgtca cgaccacaac taacaaaacc ttggtgggct ggtactttga 3960 cgcaccacac tcctttgatt acatggacga gaagtgtccc agtggcgacg acgaaaacaa 4020 ggaccggact attgctagcc ctctgtgtag gggcttcatg aacaatattg tatggggaaa 4080 actgagctca tgcaactatg ccatcaatga gacttggctg gtgaatgcca atgcctccat 4140 acccatgaat gggtcactga acaataaaat gggaaagggt gtgctgtgtg cacccgaggg 4200 ctacatcttt ctctgtgggc ggtccgggag tgacccaaat acgggatggg caatgtcatg 4260 cctggaaagc tggcggatgg tgggatcctg cacgttgggc gtgctggggg tgcccctgga 4320 tatcacccct gggaatgaga tgcaccattg ggccagcagc ctaaagctgt acaccaggct 4380 tactagggac ctgccaggag gtgtaactga ctctgggttt atgtccttta tgagatcttt 4440 ggtaccatac ataggagtca gtgctcatga aaaaatgata agaaacctgt ccctgaccat 4500 ggcagatatt gcttcctcca ctgccactgc cttggcagcc cagcagacat ccctcaactc 4560 ccttgggaag gttgttttag acaacagaat tgctctagac tttcttttag cccaactggg 4620 aggagtgtat gcaattgcca acacctcctg ctgtacctgg ataaacacct caggtatcgt 4680 agaaacacaa gtagaggaga tccggaagca ggttcactgg ctgcagacag tggggccacc 4740 tgaaggatcc ttctttgacc tctttagcaa cttcttacct ggatcactgg gatcctgggc 4800 taggtcactg ctccaggcag gcctgatcat cctgcttgtg gtagtagtcc tcctgggccc 4860 agtgaaatgt attctggcta tggctcaatg atgttgcact gagattgtgt cagtcaaggt 4920 gctacatcaa tctgacaaga caaacctctg cctccagatc cggggaggtc ggtgggcata 4980 tgaaatggac tagctttgct aagggggata tctgggttgg ggg 5023 // ID CYRA11_MM repbase; DNA; ROD; 617 BP. XX AC L03316; XX DT 28-SEP-1995 (Rel. 1.08, Created) DT 17-APR-1997 (Rel. 2.03, Last updated, Version 2) XX DE Mus musculus (clone A11) chromosome Y DNA sequence, repeat DE region. XX KW Repeat region; MMYREPA11; CYRA11_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-617 RA Nishioka Y., Dolan M.B., Prado F.V., Zahed F.L. and Tyson H.; RT "Comparison of mouse Y chromosomal repetitive sequences isolated RT from Mus musculus, Mus spicilegus and Mus spretus."; RL Unpublished (1992). XX DR GenBank; L03316; Positions 1 617. XX SQ Sequence 617 BP; 191 A; 123 C; 118 G; 185 T; 0 other; gaattctgaa catgttctca catagggcat ttgaatatac tttttaggtt aaccccatat 60 gatcacaaga aacaaaatgt agcaagaaga tcagagtcag aagtatatct cagactcatg 120 tcaaagacag tctgggaggc acaccatgct tcggttttca aagttccaga gagtggttct 180 ccaccttttt aatgctgcag ttccttcata tatatacttc cttatgctgt gtggtgcacc 240 aactgtaaaa atattttttg ctgcttcata actgaaatgt cctactgata tgaattgtaa 300 catacatttc tgacatacag atagtcttag acaactcctg tgaaatttgt cacaaggact 360 agcaacctac aggttgccaa ccacagtgcc agcaactggg aaaactagcc agccccttga 420 tttttacaat gttgacttcc gtaactatag tagaatagat gtgctctatt ttagtggtct 480 aagcatgtga ggatctgtaa cactacctgg aaacacacat accaaattgg atgagggagt 540 ctattttttc tgtcctggac cttgcaggtt tgagtcagct ctaatcaaat agaagagaaa 600 ggatactaga tgaattc 617 // ID L1M3DE_5 repbase; DNA; ROD; 600 BP. XX AC . XX DT 09-JUN-1999 (Rel. 7, Created) DT 09-JUN-1999 (Rel. 7, Last updated, Version 1) XX DE L1M3DE_5 LINE1 repetitive element - a consensus. XX KW L1 repeat; L1M3D_5; L1M3DE_5. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-600 RA Jurka J.; RT "L1M3DE_5."; RL Direct Submission to Repbase Update (JUN-1999). XX DR [1] (Consensus) XX CC Partially and distantly similar to L1M3D_5. CC ~74% similar to individual repeats. XX SQ Sequence 600 BP; 163 A; 165 C; 132 G; 134 T; 6 other; gcacactttt ccagcwgctg cctgagggtc tggcttctaa ctagcctgca tctgggagct 60 gatagggcag ataaacaata gacctccagg agcctgaaca ggaggttggc acttcccatg 120 ccttctcccc agctcactcc agtgataaat ccaggtctac agattctccc tggaaggagt 180 ttgtccacac atcaagcgcc ccaactttta tagcttccac ccaagggact ggctcctaaa 240 tcacctagct ctgggagttg atggggcttt gcatttatga gtctccctag accacagaga 300 acaaagaggt ggttttaaaa caggcacact tccagcagct atctccccag gatcagaggg 360 tgcagcctga acatgagtac aggnatttgc cacagatcct ctccctggct tagtgcagag 420 agagtgggag ataaacaccc atgctcagct tcaccstgaa gatagaagaa actggaacat 480 acatccaaca ccctaacctt tccagctaca tctagagagt ctggtttcta ccttacctgt 540 ctyagagtac tracaggata tggcacatcc taatctccag ggggccacca aaaacanaga 600 // ID ERV2A-CPo_LTR repbase; DNA; ROD; 388 BP. XX AC . XX DT 19-JUN-2009 (Rel. 14.07, Created) DT 19-JUN-2009 (Rel. 14.07, Last updated, Version 2) XX DE Long terminal repeat of ERV2 endogenous retrovirus: consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; ERV2A-CPo_LTR. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-388 RA Jurka J.; RT "ERV2-type endogenous retrovirus from guinea pig."; RL Repbase Reports 9(7), 1374-1374 (2009). XX DR [1] (Consensus) XX CC >97% identical to consensus. XX SQ Sequence 388 BP; 79 A; 127 C; 69 G; 113 T; 0 other; tgtagggagc ggttttgaat agctgctgct gccctgcccc tcctgcctga gggcattttg 60 tgtgtcagcc tacatttccc atgatcctcc ccttgcctgc aatgcacatg acaagcaaac 120 ttcctgtttc agcctgccta cattacccat aatcctctcc tgcttacgtc atatatccgg 180 atgtgatgat gacctatcag ataacactct aagccttctt cctgcctatg gcccgccccc 240 caccctatat aagttgtagc cattttaaaa ataaacgaga ctcgattgga atctcatcct 300 gtctccatct ctcttttctc ttgtcctgcc cccatccccg atcccctctc caggttgaga 360 cccatagatt ggtccgcagg ccggatca 388 // ID RMER13A1 repbase; DNA; ROD; 809 BP. XX AC . XX DT 26-AUG-2008 (Rel. 13.08, Created) DT 26-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Long terminal repeat of endogenous retrovirus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; RMER13A1. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-809 RA Jurka J. and Jurka M.G.; RT "Long terminal repeats of endogenous retroviruses from mouse."; RL Repbase Reports 8(8), 901-901 (2008). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC International Collaboration for the Mouse Genome Sequencing. XX SQ Sequence 809 BP; 182 A; 254 C; 197 G; 174 T; 2 other; tgtggggcag tggaccatgc caggaaactt ggtaaggttg agtgggttca gaaaccccgg 60 cacctcatac ctgagaatgg taggcatttc acctgctccc taccagattc ctggcctgga 120 acacgtgacc acaactcccc acataaagaa tcgtgaccaa tggtcacata ggcacagaac 180 agagttctcc atgctaatga ggtatctaga tgggccctga gggtttagcc aataagcttc 240 ccttcccgga cattccttcc tgcaaaaggt atttaatctc tggtccaccc tgaggaagtg 300 gtatgcatcc attttccacg atgaacagtc aataaacagt ttggayaagc aaggactgtc 360 tctttcatca agaaccgccg tggggagcat ggaggaagcc ttcatctaca gagcagccgc 420 ctaagtctcc cgcagaaggc ccctctgygc tcccggacaa ttgcactgct gagctaagcc 480 ggagttcccc ctccccagcc ccgggctcat cctcgaggcc catgggcccc ctctgttccc 540 aactcctcaa tttggttccc agcggcgtct ggatgcccag gagtttggga accccagact 600 cctcggcctc agcctgtgcc ttttctcacc tgggctgagg tccccacatc ttaggctgcg 660 ttttcccacc ccctgagcgg tgtatagtgc ttccctggag cccagcaccc gatggcggtg 720 gcatagggta aatgcagcct agcattcccc gtttcatctg cccagcagag ttcccacacc 780 ccagcagcaa ggcagaacgc agaaccaca 809 // ID RNERVK8d repbase; DNA; ROD; 3269 BP. XX AC . XX DT 29-AUG-2008 (Rel. 13.08, Created) DT 29-AUG-2008 (Rel. 13.08, Last updated, Version -1) XX DE ERV2 Endogenous Retrovirus from rat. XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR; ERVK; KW RNERVK8d. XX OS Rattus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae. XX RN [1] RP 1-3269 RA Smit A.F.; RT "RNERVK8d - ERV2 Endogenous Retrovirus from rat."; RL Direct Submission to Repbase Update (05-AUG-2008). XX DR [1] (Consensus) XX CC Nonautonomous element, flanked by either RNLTR8C or RNLTR8D. CC Variant of RNERVK8c, with a 1150 bp replaced by 250 bp and a CC 1700 bp deletion. 2.5% subst. XX SQ Sequence 3269 BP; 986 A; 643 C; 801 G; 838 T; 1 other; agtggtgccg aaacccggga accaacatcg ccggtgccga ggggaccctt cagtgtgctg 60 aagaggattc agaactgcga gaaggtaaat taaaaggtaa gcttcgagag gcatgcccct 120 ttttggaaga gagcatgtca gaaacggagc agagtgagaa attaggcttc cgagaggcat 180 gccccttttt ggatttaaga gagcatgtca gaaacggagc ggagtgagaa attaggagcg 240 cgcaaaaaga aaaaggttac taaaacaaaa gtgaaagaga tacaaaaggc ggaagaaacg 300 ccgctggagg aaaaaatagg tatgccccgg aggacgaagt acaagtcatc cggagaggat 360 actttgtatc cattaaaaga attagaagcc ttagaactgg ctggcagtga ctctgagcaa 420 gagttgtcag agtcggagga ggaggaatta gaggaagagg cagccatata tgaggaagaa 480 agatatgggc ctaggtggag agccactgtg aagaaaccta aagttatgga tcctttgtgg 540 gcaaggctgg taattttgtt tctttttttc tttgtgtggg atagagaagc cacagacggc 600 aagcttaagc ccggaccaaa acctctttgg cgaatggtcc ctgcctgctt agaggataaa 660 cgatgtgagg aagccttaga gccgttagga caggtcagag ggtccttgca gagcaacaag 720 aaagtatgtc agagggagaa aaggtctcga aagaaaaaaa gaaaagaaaa ggagataaaa 780 ggaacagaga gaaggcagag ataaaaaatg gggtcagtgc agcttttgtt gtggctcaag 840 ttgaggctat tgctaggtat gtgattaggc tctggaaacc tcacagagtc atactgatca 900 gatttgatag aggtagaccg cctgaaaatt tattcaggag aatcaaacaa aaagataaac 960 ttgagtgtgc ctacttggat tcctggtctc aaggagattt agcaagtgac aaggtgcttc 1020 tcttaaaaaa gcagtttcaa tcggcccttg gagccttgca cctgtagcta agatatgtgt 1080 agccacttca attttgtttt ctgactggtt ttaaatgtat aaatattctc tacatgcctt 1140 ggttatggac tattggcttt taagttattg gatatggttt aaaaaaatgt aacattgata 1200 acagaaagtt gacataaaac tggtaaaatt taaaatatgc tcatgtttct gttgatacct 1260 gttccggtat tatacacgct actctgatga ctggtgaaaa ggctcgtaat gccattagcc 1320 attgcttaga ggcatgggca gcctggggaa agcctgatag tctcaagaca gacaacgggc 1380 ctgcctacac tgcaaagtcc tttcaggcat tttgccagac aatgcaggtc ggncatacta 1440 caggcttgcc atacaatccc caaggtcaag gaattgtgga aagggtacat cgtaccttaa 1500 aagagcttat acaaaaacaa aaagagggaa ttgccagcag ccgaacacca aaagaacaac 1560 tttctttagc tctttttact ctaaatttct taattttgga tgcgcatggc cgctctgctg 1620 cggatcgcca tgctgctact acacctataa ctaatgcaga agtaaagtgg aaggatgtct 1680 taactgatga atggcgtggc ccagatcccg tgatttcgag atctagggga gcgatttgtg 1740 tttttccgca gaatcaagaa aatccaattt gggtacctga gcgtttgact cgaaaattgc 1800 cttctgctct tcctgaagat gagactacga accctactat tactactggg aatggtaata 1860 caggttaatt cgctcaccct gtgggctatt gccagatcct ggccagtacc catgccagtt 1920 catagcaatt ctactgtttt gagaataaga ctggcaagca taatctctgg ccttggtttc 1980 aatgggtgct gtccaatgag caaggggcat atacatccct gacccccttt gctaggttga 2040 tgggagagaa cttcgtgctg tataatgttt cagctaccag aaaaaggtga taccaggttg 2100 atcaatattc aatataataa cctcttggta aataatactg ccactagtac ttcagtatgt 2160 gctagatcac cttttttctg ataagcacta atgtaagcag tggtgcttta aattgtaata 2220 gctctgatgt agtctgctat ttagctgaat gctgggatgg caccaatgac accgcagtga 2280 tggttaagat tccctccttt gtgccaatcc cagtggaggc aaacccagat agctttccta 2340 ttcttaattt gctgagagcc aaaagagatt ttggaattac ggcagccata atttcagcca 2400 ttgtgctgtc tgctgccgca gctaccacag ctgcaatcgc tatgaccaat caaatacaga 2460 cggctgagac tgtcaatcag attgtagaaa gaactgcagt ggcgctagag atacaagaag 2520 aatttaatac ccatttggca tctggccttc tattagccaa tcaaaggata gatttagttc 2580 aagaacaaat tgaagcactg tatcatatga cacagctgtc ttgtgtttcc tcccttagag 2640 gtttatgcat tattcctttg cgagctaact tttctcagaa ttttcaacag agcaaagaaa 2700 tctcgaatta tctgaaagga aactggtcca tgaaagcaga gcaactatca agacaattgc 2760 tgatgcagat tgctgtcctc aacagcacta ggctggaccc catcacagtt gaagacttta 2820 cctcatggat taccaatgct ttctcccttt tcaaggagtg ggcgggcatg tttgtcccgg 2880 ggagctattg tcctcctggg atgcggagta ggtctctggc ttattggccg tttaaaaaga 2940 gaacatgcca gacacaaagt ggttgtttac caggctatga ccaccattaa aaagggtgct 3000 tctcccaaca tatggctggc ctctttgaaa gatcaagagt tcgccctaga tcatttttgt 3060 cacagtcatg tggagtcatt gtatccaggg acgggcaact ttcctcgaac tcggaccaac 3120 ctaagacacg gggcccggtg gcgatagggt aactctatga cgggtaaggc cgagtttgga 3180 tgagaacgac ctaagacagg agcaatgaca agattgacgt gacacccaga tctagaccag 3240 tcattttatt aaacaaaaaa gggggagat 3269 // ID MER97 repbase; DNA; ROD; 1106 BP. XX AC . XX DT 29-JUN-1998 (Rel. 6.5, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 2) XX DE MER97 repetitive element - a consensus. XX KW mariner/Tc1 superfamily; Non-autonomous DNA transposon fossil; KW MER97. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-1106 RA Jurka J. and Kapitonov V.V.; RT "MER97."; RL Direct Submission to Repbase Update (JUN-1998). XX DR [1] (Consensus) XX CC Putative non-autonomous DNA transposon fossil related to the CC mariner/Tc1 superfamily. TA target site duplication. CC Perfect 24 bp terminal inverted repeats. CC Average identity of individual copies to the consensus is 80%. CC MER97 portion (positions 500-800) is about 60% identical to CC TIGGER2 (positions 1959-2155) and TIGGER5 (positions 1544-1820). CC Analogously to the RICKSHA mobile element, MER97 is a putative CC composite element since it carries a region (positions 804-993) CC 70% identical to the internal part of LTR-retroelement CC HARLEQUIN (positions 2409-2580, reverse orientation). XX SQ Sequence 1106 BP; 368 A; 196 C; 220 G; 322 T; 0 other; caggcagtcc tcgctttgca cggttccgat atgcatgaat ttcagttacc acggtttagt 60 taaataacac cagtccccca acaacacggt tcaaatttca gttaccacgg tatattaact 120 gtgagtaatt gcataaagta caaacttcgc tgctagctct tcagtccaca aatcactaca 180 taaataacag atgcgcatca tgatcagtga ccaatcacat cacttctttc aaagtctgtc 240 ggtgattggt cactgtgcat ctgttattca gttcatgcac agacagcaaa gcgtgtagtt 300 gtgttgcctc cttgtctccc agtgataaac ccacgtgaca ttttacaaaa atggataatc 360 gaaagaggga attggccaac aaagatgaaa gtgcagcaaa gaaacgaaaa gtgataatgc 420 tggaagtgaa attcgaatca aacgtaaatg gagttataga agaaatagct gaccgtggga 480 atgttgacac tgccgccatt tgagagactc tagatatgca gccagaggaa cttagtgaag 540 gcgaacttat cgacataaat gaggaaagtg gttgtgacga aaaggatgaa gatgtcccag 600 aggaagtgac gccagcaaaa aacttcacat taaaggaact cttggagata tttcacgaca 660 ttgaaagcgc aaaggataaa atgttggaag ctgatccaaa cttagaaagg agtatgacaa 720 tttgccaagg catagaaaag atgcttactc cgtatcgtaa gttatataaa gaagaagaag 780 gcaagcactg ttcaaactac tcttgataag ttttttttgg tttgttttga taagtttttt 840 tacaaagaaa taaaacactt taattctcaa tgtttctaat gttttaaatt atagtgtact 900 aaataaatat tagttttatt atttttttca ttccctatac atttataacc gacggtaaga 960 gagtttttaa tgtttttgac aaaaattttt aaaggtcacg gaacaattat aattttccca 1020 ttgattatta agatcgcttt gcatggtttc agcttgcatg gtcattttta cggtcccgta 1080 ctaccgtgca aagcgaggac tgcctg 1106 // ID MER42C repbase; DNA; ROD; 1522 BP. XX AC . XX DT 19-FEB-1997 (Rel. 2.01, Created) DT 19-FEB-1997 (Rel. 2.01, Last updated, Version 2) XX DE MER42 repetitive sequence; 3' LINE1. XX KW L1; Non-LTR Retrotransposon; Transposable Element; KW Repetitive sequence; LINE1; MER42C; MER42 family; KW MER42c subfamily. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-1522 RA Smit A.F., Toth G., Riggs D.A. and Jurka J.; RT "Ancestral, mammalian-wide subfamilies of LINE-1 repetitive RT sequences."; RL J. Mol. Biol 246(3), 401-417 (1995). XX DR [1] (Consensus) XX SQ Sequence 1522 BP; 541 A; 229 C; 299 G; 380 T; 73 other; atgtayagka ggattgatca aataagtaaa cgtattnaag ataatgggag ccaggtttct 60 cactgtcgga gaagggagtt acaaatatgg aaagggrgaa grctagaatg aaccctgtgg 120 tattrgattr gaattggagg tatcagtrtg aactcatgrt ttttaatata yryryryryr 180 tttttcctag tcctgtccac tgagagggcc tagaagcaac gacaccccag tagcaacgag 240 cacacctagc acccagatct tggtttctaa ataccattct ccactaaaag gaaccagggc 300 tccttggaga aatggctgat tctaggacta gggcaggaaa tgtacaagat gagcctggaa 360 catcttgttr tgccagaaaa taaggaagtg ctcaaaaaac gatrrgggca trtcaaaagg 420 acacagaagc cagcttgaag gggctcccac tggccaaatc tgggacaatt tgagcatcaa 480 aataaataat rgtagtaatg gattataact cattgaataa aataaatatc catgagttca 540 tactaatata aataaatgaa taaaawaaat aaatgagaar ggaaagctct tcttacagtn 600 gaatgccaan taataaatnt agaargaata ataatagaaa aatcaccatt aggcaaayac 660 cgcagtaata actgtttyag gcaagatcca tcgatagatg ctmtaattag tgggcgaaar 720 tttgatgaga aacggratat ttgcatagtc tcaaagtatc tcctcacaag atatttatta 780 attacaaagg gaaaaayagt gactttacag tagagaaacc tggcagacac caccttaacc 840 aagtgatcaa rgttancatc accaaaaatg agacaaaytg acatcatgcg cyncctgatr 900 tgatgcgccg agaagaacaa catsgcttct gtgatattcc tgccaaagat gcataacctg 960 aatctaatca tragaaaata tcagacaaac ccaaattgag ggacaktcta caaaataact 1020 ggnctgtact catcaaaart gtcaaggtca taaaagacaa rgaaagactg aggaactttc 1080 tacntttgac ggaagactag arncatgaca actaaatgca acgcgggatt ctgrantgga 1140 tcctggrtcg agaaatagtg ggagtttkta yctataaagg acattattgg gacanttgrc 1200 gaaatttgaa tanggtctgn agattagata atagtattgt atcaatgtta atttcctgat 1260 tttgataatt gtaytgtggt tatgtaagag aatgtccttg tttttaggaa anacacactg 1320 aagtatttag gggtaaaagg ncatsatgtc tgcaacttac tctcaaatrg ttcagraaaa 1380 aaaatnnata tayataarya gagaatgata aagcaaatgc ggyaaaatgt taacaattgg 1440 tgaatctggg tgaagggtat acgggwgttc tttgtactat tcttgcaact tttctgtaag 1500 tttgaaatta cttcaaaata aa 1522 // ID RLTR34_MM repbase; DNA; ROD; 509 BP. XX AC . XX DT 06-FEB-2003 (Rel. 8.01, Created) DT 06-FEB-2003 (Rel. 8.01, Last updated, Version 1) XX DE Mouse putative long terminal repeat - a consensus. XX KW LTR Retrotransposon; Transposable Element; Long terminal repeat; KW retrotransposon; RLTR11A; RLTR34_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-509 RA Jurka J. and Drazkiewicz A.; RT "RLTR34_MM: putative LTR from Mus musculus."; RL Repbase Reports 2(9), 20-20 (2002). XX DR [1] (Consensus) XX CC Similar to AC079222 (73%, bases 56121-56544) and AC067964 CC (75%, bases 113422-113708) fragments described as RLTR11A. CC 67% similar to RLTR11A (rodrep.ref) (bases 98-239). CC Similar to RLTR18_MM (73%, bases 61-226), RLTR23_MM (78%, bases CC 51-590), RLTR28_MM (67%, bases 92-247). XX SQ Sequence 509 BP; 119 A; 81 C; 184 G; 125 T; 0 other; tgtaggggtg gttctgatgc taaggtcctg ttccccaatt ggttcttgat ctgtcagtaa 60 agaaagccat gggccaattg ctgggcagaa ggaataggcg ggacttccag gtccctggag 120 gaaaagggag atgcaaggaa ggagagagag agttttccat gctttggagg gagaaaaggt 180 gaccagccat gtgagatctc aggatggagt ggccacatag gctgctccta caggcaggtg 240 gtcaggggct agatgtgact agcaactaaa gtttagggca ggtgggaggt gctgagctaa 300 gagtattggt aagggcacgc ttttccaggt gggagatagt agtgcccagc aattgtgcca 360 agaaggcaag ttgaaaatga acaactgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 420 gtgtgtgtgt gtcttttatc catggattca agggaagctg ggtgggggct ggtagcgtgg 480 cccgttccca gagcttaggc agggtagca 509 // ID AFROSINE repbase; DNA; ROD; 160 BP. XX AC . XX DT 19-DEC-2003 (Rel. 8.11, Created) DT 19-DEC-2003 (Rel. 8.11, Last updated, Version 1) XX DE SINE elements from African endemic mammals - a consensus DE sequence. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; AFROSINE. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RA Nikaido M., Nishihara H., Hukumoto Y. and Okada N.; RT "Ancient SINEs from African Endemic Mammals."; RL Mol. Biol. Evol 20(4), 522-527 (2003). XX RN [2] RA Nikaido M., Nishihara H., Hukumoto Y. and Okada N.; RT "Direct Submission to GenBank."; RL Direct Submission to Genbank (08-NOV-2002)Masato Nikaido, Tokyo RL Institute of Technology, Graduated school of Bioscience of RL Biotechnology; 4259 Nagatsuta-cho Midori-ku, Yokohama, Kanagawa RL 2268501, Japan (E-mail:mnikaido@bio.titech.ac.jp, RL Tel:81459245742, Fax:81459245835). XX RN [3] RA Kohany O. and Jurka J.; RT "AFROSINE consensus."; RL Direct Submission to Repbase Update (DEC-2003). XX DR [3] (Consensus) XX CC Average similarity to consensus 92%. XX SQ Sequence 160 BP; 41 A; 38 C; 42 G; 38 T; 1 other; tgctaaccaa aaggtcggca gttcgaaacc accagctgct ccaygggaga aagatgtggc 60 agtctgcttc cgtaaagatt tacagccttg gaaaccctat ggggcagttc tactctgtcc 120 tatagggtca ctatgagtcg gaattgactc catggcagtg 160 // ID RNSAT1a repbase; DNA; ROD; 168 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 17-JUL-2008 (Rel. 13.07, Last updated, Version 2) XX DE Satellite from rat. XX KW Satellite; Simple Repeat; RNSAT1a. XX OS Rattus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae. XX RN [1] RA Smit A.F.; RT "RNSAT1a_ - Satellite from rat."; RL Direct Submission to Repbase Update (06-SEP-2005). XX RN [2] RP 1-168 RA Smit A.F.; RT "Consensus update."; RL Direct Submission to Repbase Update (05-MAR-2004). XX DR [1] (Consensus) XX CC [2]. XX SQ Sequence 168 BP; 48 A; 32 C; 30 G; 58 T; 0 other; ccacattgat cacatacata gggcttctct ccagtatgta ttcttttatg tacttggaga 60 taactatgag atgcaaaggc tttaccacat tgatcacata catagggctt ctctccagta 120 tgtattcttt tatgtacttg gagataacta tgagatgcaa aggcttta 168 // ID HERVL repbase; DNA; ROD; 5669 BP. XX AC X89211; XX DT 19-FEB-1997 (Rel. 5, Created) DT 19-FEB-1997 (Rel. 5, Last updated, Version 1) XX DE Internal part of endogenous retroviral element HERV-L. XX KW Internal part of endogenous retrovirus HERV-L; HERVL. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-5669 RA Cordonnier A., Casella F.J. and Heidmann T.; RT "Isolation of novel human endogenous retrovirus-like elements RT with foamy virus-related pol sequence."; RL J. Virol 69(9), 5890-5897 (1995). XX RN [2] RP 1-5669 RA Heidmann T.; RT "Direct Submission."; RL Direct Submission to EMBL/GenBank/DDBJ (26-JUN-1995). T. RL Heidmann, Inst. Gustave Roussy CNRS URA 147, 39 Rue Camille RL Desmoulins, F- 94805 Villejuif, FRANCE. XX DR GenBank; X89211; Positions 463 6131. XX CC LTR's of HERV-L are listed in REPBASE as MLT2 sequences. CC HERV-L has a potential leucine tRNA primer-binding site. CC The HERV-L internal sequence shows some amino acid similarities CC to CC retroviral reverse transcriptase and integrase proteins. In CC addition, CC a region homologous to dUTPase proteins was unexpectedly found CC downstream from the integrase domain. The amino acid sequence and CC phylogenetic analysis indicate that the HERV-L pol gene is CC related to CC that of foamy retroviruses. HERV-L-related sequences are detected CC in CC several mammalian species. In primate and mouse genomes 100 to CC 200 CC copies may be present. XX SQ Sequence 5669 BP; 1631 A; 1188 C; 1309 G; 1541 T; 0 other; aattttggta ccaggagtga ttctagagga acagaatatt aaggattgag ttatttcgtt 60 ggtttagggg tttctagatt tggctgctta gtatgattag acccaaaaat gctgaagact 120 ctacttctaa tagtatgggg aaacactgat agtccttggt gtgaactgtt tagagaatta 180 tgcaaaataa atgcatgtgg cacttttgat tctctgttca tgaaaggcaa gatgtttagt 240 gactctgtgt gtaatacctt tgactatatg tggagaacca tgaataaagt tggttgtttg 300 ctcataagtt cactggacaa agtgatgaaa gaaaatgatg aactcaggga ttctaactcc 360 cagcttcaga agcagataca gagcctcaaa tcttcgaaga ttgccctgag cgaaagtcct 420 atctcctgta gaaaaaagag ctgaaattgt ggaaaaacag acacaagctc ttataatgca 480 agtggctgac ctacaatgaa aggtgcttgc acagcctcgc caggtgtcta ctgttgaagt 540 gagggaattc attgggaaag aatgagaccc tgaaacttgg aatggggaca tgtgggagaa 600 cctgatgaag ctggggacac tgagcatgta aactctgatg aacctttttt tttccagaag 660 aaacagcttc cgtatctcct gtagtggcaa catttccttc ctgacctatg ttgccatcag 720 tctttccacc tttgtctgag gacataaacc ctgagctatc tgtggctaca gtgagggact 780 cccctgaggc tgttgccagg caatataatg ttgattctcc tcaggaccca ctctcaacac 840 ccctgtttgc ttttagacct atacctagac taaagtcctc ctgggcccct agaagtgaga 900 ttcacagtgt gacccatgag gatttacact acacttggaa agaacccttg agttttctaa 960 tttatatgag cagaaatctg gagaacaggc atgggaatgg atattaaggg tgtgggataa 1020 tggtagaaga aacatagaat tggatcaagc tgaatttatt gacttgggcc cactaagtag 1080 ggattatgca tttaatgtta cagctcagag agttaaaaaa ggttctaata gtttatttgc 1140 ttgattagtt gaaatatgga ttaaaagatg gcccactgtg tgcaagctgg aaatgccaga 1200 tctcccttgg tttaatgtag aggaagggat ccaaaggctt agagagattg ggatgttgga 1260 gtggattagt cactttagac ttactcatcc cagctgagaa gttccaaaag ataaacccct 1320 gaccaatgct tcgcaaaaca gatttgtgag ggcagcacct gcatctttga agagccccgt 1380 aatcactctt ctctgtctgt cagatctaat catgggaact acagtcactc aactgcaaaa 1440 tttaaataca atgggaataa ttggaccctg aggtgacagg ggccaagtgg ccacagtcaa 1500 tcttcaaagg caaggtgggc atagctacca taatagacag cagaggcaaa gcagcaatca 1560 gaatagtctg acttgtgtag agctctgaca ttagaaaatt aatcatggtg ttcctagaag 1620 tgaaattgat aggaagccta cagcattcct acttaattta tataagcaga aaccttccag 1680 gttgagtgga caaaatacta actcgaatta taaaaacaga gaatcatggc ccctcaatca 1740 atttccagac tttagccaat tcacagaccc agaaactctt gaatgaaggg gaggccatgt 1800 tcccttgagg aaggacccca ctacaccact gacaatttat gctgttaata tttctcccac 1860 ccttacccaa ggagacctcc agccttttgc ctggttaact gtgcattggg gaaagggaaa 1920 tgatcagaca tttcagagac tactgaacac tgcctctgag ctgatattca tttcagggta 1980 ctcaaaatgt cactgtggtc ctccagttaa agtaggggct tatggaagtc aggtaattaa 2040 tggagttatg gctcaggtct gacttacagc aggtccagtg ggtccctgga ctcatcctgt 2100 gttcattttc ccagtgccag aatgcataat tggcattgtc atacttagag gctggcagaa 2160 cccccacatt gattctgtga ctggtaaggt gagggctatt atggtgggaa aggccaaatg 2220 gaagccattg gagctgcctt tacctaggaa aatagtaaat ccaaaaacat tatcaccacc 2280 ctggagggat tgcagagatt agtgccacca tcaaggactt gaaaaatgca ggggtggtga 2340 ttcccataac atccttgttc aactctcctt tttggcctgt gcagaagaca gatggatctt 2400 ggagaatgag agtggattat cataagctta accaagtggt gactccaatt gcagctgcta 2460 tacaagatgt ggtttcattg ctcaagcaaa ttaatacatc tcctggtacc ttgtatgcag 2520 ccattgactt ggcaaatggc cttttaccca ttccataagc cccaccagaa gcaatttgcc 2580 ttcagctggc aaggccagca atgtatcttt actgtcctac ttcaggggta tatcaactct 2640 ccggctttgt gtcataatct tattcagagt gatcttgatc acttttcact gccacaagat 2700 atcacactgg tccattacat tgatggcatt atgttgattg gatccaatga gcaagaagta 2760 gcaaacacac tggacttatt ggtgagacat ttgcatgcca taggatggga aataaatcca 2820 aataaaattc acgcaccctc tacctcagta aaatttctag ggtccagtgg tgtggggcct 2880 gtcgagatat tcctttaaag ggaaggataa attgctgcat ttggcacctc ctacagccaa 2940 gaaagagaca catcgcctag tggacctatt tggattttgg agacaacaca tttcttattt 3000 gggtgtgcta ttccagccca tttatccagt gacacaaaag gctgccggtt ttgagtggag 3060 tccagaacag aaggctctgc aacgggtcca ggctgctgtg caagctactc tgccacttgg 3120 accacatgac ccagcagatc caacggtgct tgaggtttca gtggcagaca gtgatgctgt 3180 ttggacctct ggcaggccct cacaggtaaa tcacagtgga ggcctctagg attttggagc 3240 aaggccctgc catcttctgc agataactac tctcctgaga gacagcactt ttcctgttat 3300 tgggctttgg tggaaactga gtgtttgact atgggtcatc aagtcactat gtgacctgaa 3360 ctgcctgtca taaactgaat gctttaagac ccatctagtc ataaagtggg tcatgcacag 3420 cagcattcaa tcatcaaatt gatgtagtat atatgtgatt gcgctcatgc aggtcctgaa 3480 ggcacaagta agttacataa ggaagtgtct caaatgccca tggtctccac tcctgccacc 3540 ctgccttctc tcccccagcc tgaactgagg gcctcatcgg gagttcccta tgatcagttg 3600 acaagagtaa gagaagacta gggcctggtt cacagatggt tctgcacaat atgcaggcac 3660 cacctgaaag tggacagcta cagcactatg gcccctttct aggacatccc tgaaggacag 3720 tgatgaaggg aaatcgtccc agtaggcaga acttcgagca gtgaacctgg ctgtgcactt 3780 tgcatggaag aagaaatggc cagatgtgcg attatatact gatccatagg ctgtagccaa 3840 tggtttggct ggttgaacag ggacttggaa gaagcataat tggaaaattg gtgacaaaga 3900 catttgggga agaggtatgt ggatggacct ctctgattgc tcaaagacta tgaagatatt 3960 tgtatcccac gtgagtgctc accagcaggt ggtctcagca gaggaggatt ttaataatca 4020 agtggatagg atgacctgtt ctgttgacac cattcagcct ctttccccag ccgctgctgt 4080 cattgcctaa tgggctcagg aacaaagtgg ccacagttgc agggatggag gttactcatg 4140 ggcccagcaa catggaattc actcaccaag gctgacctgg ctatggccac tgctgagtgc 4200 ccaatttgcc agcagcagag accaacactg agacctcaat atggcaccat tcctcagggt 4260 gatcagccag ctacctggtg gcaggttgat tatattggac ctcttccatc atggaaaagg 4320 cagaggtttg tcctcactgg aatagacact tactttagat atgggcttgc ctattctgtg 4380 tgcaatgctt ctgccaagac taccatccat ggactcaggg aatgccttat ccaccatcat 4440 ggtattccac acagaattgc ctctaaccaa ggcactcaca ttatggctaa agaaatgtgg 4500 cagtggactc atgctcatgg aattcactgg tcttaccatg ttccccatta tcttgaagca 4560 actagattga tagaacggtg gaatggcctc ttgaagtcac aattacattg tcaactaggt 4620 gacaatactt tgcagggctg gggcaaagtt ctccagaagg ccgtgtatgc tctgaatcag 4680 catccaatat acggtactct ttctcccata gccaggattc atcaaggggt ggaagtggaa 4740 gtggcaccac tcaccatcac ccctagagat ccactagcaa aatttttgtt tcctgttccc 4800 ccgacattat gttctgctgg cctagaggtc ttagtcccag atggaggaat gcttgccacg 4860 aggagacaca acaacgattc cattaaagtg gaagttaaga ttgccacctg gacactttgg 4920 gctcctccta cctttaagtc aacaggctaa gaagggagtt acagtgttgg ctggggtgat 4980 tgaccaagac tgtcaagaag aaatcaattt gctactccac aacggaggta aggaagagta 5040 tgaatggaat ccaggagatc cattagtgca tctcttaata ttatcctgtg attatattac 5100 tcttaatact gccctgtgat taaggtcaat gggaaattac agcccaattc aggcaggact 5160 tcaaatggcc cagacccttc aaaaaattaa ggtttgggtc actccaccag gaaaaaacac 5220 aaaaacaaaa aatgtgacct gctgagctgc ttgctgaagg caaatggaat acagaatggg 5280 tagtagaaga agttagtcat caataccagc tatgatcacg tgaccagttg cagaaacaag 5340 gactgtaatt atcatgagta tttcctcctt cttttgttaa catgtactaa gaaaatatct 5400 tcgatttatt gtagttgcac caagaaaata tcttcggttt atttcatttt cctttactat 5460 gtaacataag atttactgac ttcatatcag catttaagta ttgttacctt tatataatag 5520 catttgggtt ggggattgat acatttccgg ttgtacaaag gatagttgta ttatattagg 5580 catagttatg accttatgac tgtctttatt tgaagattat atatgatctc aggagatgtg 5640 tacgggttca tgttgacaag gggtggact 5669 // ID LTR6D_Cpo repbase; DNA; ROD; 373 BP. XX AC . XX DT 19-SEP-2009 (Rel. 14.1, Created) DT 19-SEP-2009 (Rel. 14.1, Last updated, Version 3) XX DE Long terminal repeat of ERV3 endogenous retrovirus: consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; LTR6D_Cpo. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-373 RA Jurka J.; RT "Endogenous retroviruses from guinea pig."; RL Repbase Reports 9(10), 2153-2153 (2009). XX DR [1] (Consensus) XX CC >87% identical to consensus. XX SQ Sequence 373 BP; 77 A; 94 C; 91 G; 111 T; 0 other; tgttatggtt tgtgtctgga tgtcccccaa agcctcatgc agtcatgggg gcgggcttct 60 agagtttgtg attgattcat ggtccaatgc tgggattatg ggtgtgagtg ctacacccac 120 cctagggtgg gtggcctaca tacaatataa gggacagaaa gagtcttctc tccgtgctcc 180 tccctttttg ctcttgttgc ttttgctgtc ttccgccgcc atgaactgtt gccctctgcc 240 acgtgccacc ctgccttgga gccagctgat tatggactga aacctccaca aactgtgagc 300 taaataaacc tttccttcct taattttggg cgtcgggtat tttgtctcag caacgagaga 360 aaagtaacca aca 373 // ID LTR7_Cpo repbase; DNA; ROD; 403 BP. XX AC . XX DT 31-JUL-2009 (Rel. 14.07, Created) DT 01-DEC-2009 (Rel. 14.07, Last updated, Version 3) XX DE Long terminal repeat: consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR7_CPo. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-403 RA Jurka J. and Baney O.; RT "Non-autonomous endogenous retrovirus from guinea pig."; RL Repbase Reports 9(7), 1548-1548 (2009). XX DR [1] (Consensus) XX CC ~93% identical to consensus. XX SQ Sequence 403 BP; 83 A; 123 C; 72 G; 124 T; 1 other; tgtgggaagc cctgtaattg gctgccacct tatatctgca ggcarcacca ccttaactcc 60 ctctaggagt taagtttggt agttaagcag ctccttgcct atccctttgt ttggcccatt 120 cagggattat gctaatcagc ctgccttatg agcccgcgcg caggaaattt gaaaatttga 180 attaacctat aaccctcagt cttgccactg ttgctaagtt acctgctgac gtctcagaac 240 ccaactccct cctcccccaa taccctatat atttgttact ttttccttga ataaatgaga 300 cttgatcaga cttctgtctt gtctccattc tttgcgtctc ttgtcccctc tcatccccac 360 tccccctcta gggtctccgt ggacttaccc gcaggccggg aca 403 // ID L1MC3 repbase; DNA; ROD; 2487 BP. XX AC . XX DT 18-APR-1997 (Rel. 6, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 3) XX DE 3'-end of L1 repeat (subfamily L1MC3) - a consensus. XX KW Repetitive sequence; L1 (LINE) family; L1MC3 subfamily; MER42A; KW L1MC3. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1187-2487 RA Smit F.A., Toth G., Riggs D.A. and Jurka J.; RT "Ancestral, mammalian-wide subfamilies of LINE-1 repetitive RT sequences."; RL J. Mol. Biol 246, 401-417 (1995). XX RN [2] RP 1-2487 RA Smit F.A.; RT "L1MC3."; RL Direct Submission to Repbase Update (1996). XX DR [2] (Consensus) XX CC Replaces MER42A. XX SQ Sequence 2487 BP; 938 A; 407 C; 481 G; 600 T; 61 other; cttgtatcca gaatatataa agaactctta aaactcaaca ataaaaaaac aaacaaccca 60 attaaaaaat gggcaaaaga tctgaataga catctcacca aagaagatat acagatggca 120 aataagcaca tgaaaagatg ctcaacatca tatgtcatta gggaaatgca aattaaaaca 180 acaatgagat accactacac acctattaga atggctaaaa tcaaaaacac tgacaacacc 240 aaatgttggc gaggatgtgg agcaacagga actctcattc attgctggtg ggaatgcaaa 300 atggtacagc cactttggaa gacagtttgg cagtttctta yaaarctaaa catacaatta 360 ccatatgatc cagcaatcay actcctaggt atttacccaa gtgaattgaa aacwtatgtc 420 cacacaaaaa cctgcacacg aatgtttata gcagctttat tcataattgc caaaacttgg 480 aaacaaccaa gatgtccttc aataggtgaa tggataaaca aactgtggta catccataca 540 atggaatatt attcagcgat aaaaaggaat gaactactga kacatgaaaa gacatggatg 600 aatctyaaat gcatattgct aagtgaaaga agccagtctg aaaaggctac atactgtacg 660 attccattta tatgacatty tggaaaaggc aaaactatag agacagaaaa cagattagtg 720 gttkccagrg gttgagagat gggaagtggg gatgrytgca aargtaaagc acargggatt 780 ttttagggtg rtaaaactat tctgtataaa ctattctgta tgatactatg gtggtggata 840 cacgacanta tgcatttgtc aaaacccaca gaacttgtca aaacccacag aactttacag 900 cataaagagt gaactttaat gtatgyaaat tttaaaaaat catttargag atcgggggat 960 cycaggatgg aatacagamt gtgacaaaag aatctaactg tattacaaat gtatgaaaca 1020 acctcactga agggratggg ggaaaaaggt gctgacctaa gtaactttgg aaatgagtgg 1080 agtctgtaag actaaaggca aaaggaactg cacataagca ctgtactcta gttrataaag 1140 ttgtttycca yaggggtaya ggttaacaat tctgawacya ctatatacgt atannaraat 1200 taaacaaata agtaaatgta tkgtagatga tgggagccag gtttctcact gtcggagtga 1260 ggagttacag ataagcaaag ggaggagrct agaatgaacc ctgtggtatt rgattagaat 1320 tggaggtatc agtatgaact catgrttttt aatatayryr yryryryryr tttcctagtt 1380 ctgtccgctg agagggccta gaagcaacga caccccagta gcaatgagca cacctagcac 1440 ccagatcttg gtttctaata ccattctcca ataaaaggaa ccagggctcc ttggagaaat 1500 ggctgattct aggactaggg caggaaatat acaagatgag cctggaacat cttgcartgc 1560 cagaaaataa ggaagtgctc aaaaacacaa tgatrggggt atgtcaaagg gacacagrag 1620 ccaactgaaa gagctcccaa tggccaaagc tggaacaatt tgagcaacaa aataaattat 1680 rgtagtattg gattataacy caaagtataa aataaatatc catgagtcca tactgatata 1740 aatgaatgat taaataaata aataaattga gaagataaat aagtctctnn tgcagaagaa 1800 tttcaaataa tttatgtaga tarcctnccc tcaaggaagt ggagcgtaac tccccactcc 1860 ttaagtgtgg gatggatata gtgacttcct tccagaaagc atagtatagg acggaaaaaa 1920 aagtaaattt acwgtagaga aacctgacaa acactacctt agccaggtaa tcaaagttag 1980 catcaacaat tgtaagtcat gttgatagya tatacccttg atatgatgtg ataagaatgg 2040 cacttcacct ccgtggtctt cctcccaaaa acccataacc ccagtctaat catragaaaa 2100 atatcagaca aattccaact gagggacatt ctacaaaata cctgaccagt actcctcaaa 2160 actgtcaagg tcatcaaaaa caaggaaagt ctgagaaact gtcacagcca agaggagcct 2220 aaggagacat gacaactaaa tgtaatgtgg tatcctggat gggatcctgg aacagaaaaa 2280 ggacattagg taaaaactaa ggaaatctga ataaagtatg gactttagtt aataataatg 2340 tatcaatatt ggtttattag ttgtgacaaa tgtaccatan taatgtaaga tgttaayaat 2400 aggggaaact ggatgtgggg tatatgggaa ctctctgtac tatttttgca actttcctgt 2460 aaatctaaaa ctatttcaaa ataaaat 2487 // ID STRIDE_Cpo repbase; DNA; ROD; 196 BP. XX AC . XX DT 06-NOV-2009 (Rel. 14.11, Created) DT 06-NOV-2009 (Rel. 14.11, Last updated, Version 2) XX DE SINE element - consensus. XX KW SINE1/7SL; SINE; Non-LTR Retrotransposon; Transposable Element; KW Nonautonomous; STRIDE_Cpo. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-196 RA Jurka J.; RT "SINE elements from guinea pig."; RL Repbase Reports 9(11), 3013-3013 (2009). XX DR [1] (Consensus) XX CC Its youngest sequences are >83% identical to consensus. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project. XX SQ Sequence 196 BP; 62 A; 44 C; 56 G; 34 T; 0 other; ggcgtggtgg cacacgcctg taatcccagc actctgggag gctgaggcag gaggatcaca 60 agtttgagcc cagcctgggc aacttagtga cttagtgaga ccctgtctca aaataaaaaa 120 taaaaaaggg ctggggatgt agctcagtgc gaaggccctg ggttcaatcc ccagtaccgc 180 aaaagaaaaa gaaaaa 196 // ID RLTR15A_MM repbase; DNA; ROD; 1096 BP. XX AC . XX DT 02-MAY-2002 (Rel. 7.04, Created) DT 02-MAY-2002 (Rel. 7.04, Last updated, Version 1) XX DE Long Terminal Repeat from an endogenous retrovirus - a consensus. XX KW LTR Retrotransposon; Transposable Element; RLTR15A_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-1096 RA Jurka J.; RT "RLTR15A_MM: putative LTR from mouse."; RL Repbase Reports 2(4), 24-24 (2002). XX DR [1] (Consensus) XX CC Present in at least several hundred copies in mouse genome. CC Homologous repeats also found in rat sequences. Based on TG...CA CC termini and putative polyadenylation signal it is classified as CC a long terminal repeat. XX SQ Sequence 1096 BP; 329 A; 199 C; 278 G; 277 T; 13 other; tgccaagtcc tgcatggcct ntgtggtagg gacagaatga catggttcct gcccctggaa 60 cagagccagc agggtgtggg cctccacaag tgttgatggt tgctgtgggn aganggctgg 120 ngggcataag gcttgtttgt ataataatgt acatatttta cttangacan attcctcctt 180 tgaggaatgt cctccctgtt aaggttaatg actccatgat aattagagac agcaagagtc 240 caggaggtga aggtgtggct aatgtctcag caaaatggta aatgctgtga ccttcaggac 300 agcccttaag gctgtgggaa agaactctga aaacatgagt tcaaaaatat ataatttctc 360 aactatgcaa aaatataagg atgcaatatg aattatatga ggggcttcac agatctaaag 420 gaacagaggc agctgcacta tgagccagct tgtcagaaag atactaagga agtaaagaga 480 ttgatttang gaatggtgat caaaggagan cccacccagg tttgtttttt tgtttgctta 540 cacaggtgtg gtagaaatgg taatttcagg acagggtccc accccagcta gctttattgt 600 ctgtgcttaa caaaggcagg cagatctctg aattcttttg caatgttaaa aaaaaaaaag 660 tgtgcttgct gtctctttct aagaatcaag gggctggggt catgggatgc tgattcatag 720 gataatcaaa agggaacctg gagtaaatga ctggattgat atgtaaaata aaagactggg 780 cttatgatct gcaagagatg agctagagaa gctatccaga gaagtttttt agaatagcaa 840 agagagctgc ttggagaact gtcacaagca gctntctgac anagaanaga gagancaaga 900 gctgtctgga gatctccaag aatactgaka atctccagag aaagcagaac agagagagca 960 gtctggaaag ctgtctcaag cagaacatag ggccgtcagc ttggacccat gatttgactt 1020 tgagtcgttt gttttcgctg ctcccagaca ccccttctct cagaacccct ctccaagctg 1080 aggctggtcc ttggca 1096 // ID MER102 repbase; DNA; ROD; 332 BP. XX AC . XX DT 30-JUL-1998 (Rel. 6.5, Created) DT 30-JUL-1998 (Rel. 6.5, Last updated, Version 1) XX DE Interspersed repeat MER102 - a consensus. XX KW Interspersed repeat; MER102. XX OS Mammalia OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi. XX RN [1] RP 1-332 RA Jurka J., Naik A. and Kapitonov V.V.; RT "MER102."; RL Direct Submission to Repbase Update (JUL-1998). XX DR [1] (Consensus) XX CC Over 3000 copies in the genome. Present in Sus scrofa. CC The extreme 3' end similar to MER58 - probably insignificant. CC Potential transposable element - not confirmed. XX SQ Sequence 332 BP; 94 A; 65 C; 85 G; 86 T; 2 other; gggttgcaaa ctcaaatgcc tacaggggcc aggcaggtaa cataaatgag tgaagtgggc 60 caggtgggga ctgtggcaaa ctggagagca catgccctgt ctaaaggggg cagcagctgc 120 tactcagctc cagccaattg ttgccatgtg ggaatgtagg cccagtgttg ccagatcttc 180 tgatttttca agagaagcca gaaatctaga tttttatgtg aaatctcctg atttttaaat 240 gttggcaact aatttaaaat ttttawaaaa cactgtgtag gccaaacaaa acatatctgt 300 gggccagatt tagcccatrg gctgccagtt tg 332 // ID MER93b_LTR repbase; DNA; ROD; 373 BP. XX AC . XX DT 16-MAR-2006 (Rel. 11.03, Created) DT 10-APR-2007 (Rel. 11.03, Last updated, Version 1) XX DE Long terminal repeat of ERV1 Endogenous Retrovirus from placental DE mammals. XX KW ERV1; Endogenous Retrovirus; Transposable Element; KW LTR retrotransposon; MER93; MER93b_LTR. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-373 RA Smit A.F.; RT "MER93b_LTR - a subfamily of endogenous retroviruses from RT placental mammals."; RL Direct Submission to Repbase Update (11-NOV-2005). XX DR [1] (Consensus) XX CC mer4 group, 18% divergence from the consensus. XX SQ Sequence 373 BP; 109 A; 96 C; 62 G; 104 T; 2 other; tgttaaaata attaattggg aggccattag gctgaggtgg ctccagcacc ctgggttcct 60 acgtaagcaa accgaaaccc aactcagtgt aaatggtaaa acgaaactta agcttaacca 120 atcagaaacc gccaactaac ctctaactag ggactttcca ctggaatgat ccaaataagg 180 ctactgctcc aactttaacc aatcaaatat tttctttgcc ttgcttccgc gntcacccta 240 taaaagtctt cccctcatgc cccttcagtg gagccctgaa ccacttgtag tctggngctg 300 cccgattcat gaatcgctgt ctgctcaaat aaactcttta aaattttaat gtgcctaagt 360 ttatctttta aca 373 // ID RLTR13C repbase; DNA; ROD; 1004 BP. XX AC . XX DT 26-SEP-1997 (Rel. 2.08, Created) DT 26-SEP-1997 (Rel. 2.08, Last updated, Version 1) XX DE Long terminal repeat of retroviral-like element. XX KW Endogenous Retrovirus; Transposable Element; KW Long terminal repeat of retroviral-like element; ETn; RLTR13C. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-1004 RA Smit A.F.; RT "RLTR13C."; RL Direct Submission to Repbase Update (30-NOV-1996). XX DR [1] (Consensus) XX SQ Sequence 1004 BP; 255 A; 225 C; 220 G; 290 T; 14 other; tgctacaccc actccagtga agtctgcctg acaggcccaa aagcctggac acagtcccga 60 ggcaaaaagt ttcacggaaa cttagtctcc tggaaccttg gcatcccgta kcacgaatgc 120 tccaaacttg tccctgcgtt agttggtgaa tggatctgcg tgctttcytt cagtattgtt 180 ttattatagg tgcctccaag gaatgatttg wcaaatasct aagctagatt gaactttgct 240 tcctggcctt cwccagtgtc ctaggcagtc cttgagccga ccctgggctt ggaattcagt 300 aagaatgtta cycattcagt caaaatgcct ccagtgttga tagagagata gtgaaagaaa 360 tcaggcatta cagtttgccg gataggagct agaaatctca gggcwagttc aacaaagtaa 420 actcagaatt ctcattacag wcatgaatgg cagcctacat tacataatag tagaaagaag 480 gtgggttgtg gtttaaggag gaactagagt attgtattat tcatgaaaga gttagtagaa 540 cagaactccc ctggtgcatg tttttcacta ggcccagagg aatagcataa ttaggtattt 600 actaagggac ccctggtggt gggtttaaca atagactagc attgcacctg ggtgttagcc 660 ttttcacctg gctcctaaga atggctgatt ttagatagag ctgttnttgt cccagttgta 720 aatantgccc ggttcccgtc accttttatg twtgcattcc tctgttttgt gtaagagtca 780 ttaagcatcc ggtaacctca ttgtaccttg ctgatgtgtc acctaacttc cctattttct 840 tctgtatwaa aagtytgatg ctcgatttga caaattacat tcagatacac actctctctg 900 tgtccatgtc tgtttgtcat tcatcgccga ctccttgccc acctgtatac cgagaccctg 960 attcmcacgg atcgaggagg gcccactgag cccagtctgc ggca 1004 // ID MLT1A repbase; DNA; ROD; 374 BP. XX AC . XX DT 01-OCT-1995 (Rel. 1.09, Created) DT 25-OCT-2000 (Rel. 5.09, Last updated, Version 5) XX DE Mammalian long terminal repeat (MLT1A subfamily) - a consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; LTR; KW retrovirus-like MaLR element; MaLR family; MLT1A. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RA Smit A.F.; RT "Identification of a new, abundant superfamily of mammalian RT LTR-transposons."; RL Nucleic Acids Res 21, 1863-1872 (1993). XX DR [1] (Consensus) XX CC LTR of MLT1A retrovirus-like MaLR element. 5 bp target site dups. CC Average divergence from consensus 17%. Intermittant subfamily CC between MSTD and MLT1C. CC The subfamily MLT1A differs from this consensus by two small CC inserts CC and a few substitutions. XX SQ Sequence 374 BP; 96 A; 87 C; 97 G; 93 T; 1 other; tgctatggac tgaatgtttg tgtcccccca aaattcatat gttgaagccc taatccccaa 60 tgtgatggta ttaggaggtg gggcctttgg gaggtgatta ggattagatg aggtcatgag 120 ggcggggccc tcataatggg attagtgccc ttataaaaga gaccycagag agctcccttg 180 ccccttccgc catgtgagga cacagtgaga aggcgccgtc tacgaaccag ggaatgagcc 240 ctcaccagaa actgaatctg ccggcgcctt gatcttggac ttcccagcct ccagaactgt 300 gagaaataaa tttctgttgt ttaagctacc cagtctatgg tattttgtta tagcagcccg 360 aacagactaa gaca 374 // ID ERV2B-CPo_LTR repbase; DNA; ROD; 330 BP. XX AC . XX DT 20-JUN-2009 (Rel. 14.07, Created) DT 20-JUN-2009 (Rel. 14.07, Last updated, Version 2) XX DE Long terminal repeat of ERV2 endogenous retrovirus: consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; ERV2B-CPo_LTR. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-330 RA Jurka J.; RT "ERV2-type endogenous retrovirus from guinea pig."; RL Repbase Reports 9(7), 1376-1376 (2009). XX DR [1] (Consensus) XX CC >90% identical to consensus. XX SQ Sequence 330 BP; 64 A; 125 C; 62 G; 79 T; 0 other; tgtagggagc ggcgcgacag ccccattgct gcctccccgc ctctctctcc tatgggcaca 60 tgatgcacct gcctacatta accataatcc tctcctgctt acgtcacata cccggacgtg 120 acgatgaccc atcagaaacc accccgtagc ctcttcctgt cccaccagcc cgcccttggc 180 ccataaagga tgcgaccact tcctgaataa acgaggcttg atcggattct ctcgacttgc 240 ttcaatctct ctttgtctcc caccctgttc tattcatccc cgttccccct ccaggtatac 300 accccacgga ttgatccgcg ggccggatca 330 // ID ZOMBI_B repbase; DNA; ROD; 468 BP. XX AC . XX DT 13-JAN-1998 (Rel. 6.4, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 2) XX DE Medium reiteration frequency repeat; non-autonomous DNA DE transposon - a consensus. XX KW Non-autonomous DNA transposon; MER46; ZOMBI_B. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 468-338 RA Smit F.A. and Riggs D.A.; RT "Tiggers and other DNA transposon fossils in the human genome."; RL Proc. Natl. Acad. Sci. USA 93, 1443-1448 (1996). XX RN [2] RP 468-338 RA Jurka J., Kapitonov V.V., Klonowski P., Walichiewicz J. RA and Smit F.A.; RT "Identification of new medium reiteration frequency repeats in RT the genomes of primates, rodentia and lagomorpha."; RL Genetica 98, 235-247 (1996). XX RN [3] RP 1-468 RA Kapitonov V.V. and Jurka J.; RT "Jerky gene - a recruited transposon?."; RL Direct Submission to Repbase Update.. XX DR [3] (Consensus) XX CC 26 bp teminal inverted repeats, TA target site duplication [1,2]. XX SQ Sequence 468 BP; 153 A; 99 C; 71 G; 126 T; 19 other; caggttgagc atccctaatc tgaaaatccg aaatccaaaa tgctccaaaa tctgaaactt 60 tttgagcacc aacatgatgc cacaagtgga aaattccaca cctgacctca tgtgataggt 120 cacagtcaaa ayacaatcaa gactnnncna gcnnctncng ttgctnttnc tgccagncaa 180 cnacagnttg tgcacctngn tggcaragan actgacacat ttgctttctk atggttcagt 240 gtacacaaac tttgtttcat gcacaaaatt atttaaaata ttgtataaaa ttaccttcag 300 gctatgtgta taaggtgtat atgaaacata aatgaatttc gtgtttagac ttgggtccca 360 tccccaagat atctcattat gtatatgcaa atattccaaa atccaaaaaa atctgaaatc 420 caaaacactt ctggtcccaa gcatttcgga taagggatac tcaacctg 468 // ID ERVB5_3-LTR_RN repbase; DNA; ROD; 516 BP. XX AC AC127785; XX DT 11-FEB-2004 (Rel. 9.01, Created) DT 11-FEB-2004 (Rel. 9.01, Last updated, Version 1) XX DE Rat endogeneous beta retrovirus ERVB5_3, LTR sequence. XX KW LTR Retrotransposon; Transposable Element; LTR; KW endogeneous betaretrovirus; RnERV-B5_AC127785; ERVB5_3-LTR_RN. XX OS Rattus norvegicus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Rattus. XX RN [1] RP 1-516 RA Baillie J.G., Van de lagemaat N.L., Baust C. and Mager L.D.; RT "Multiple groups of endogeneous betaretroviruses in mice,rats and RT other mammals."; RL J. Virol 78(11), 5784-5798 (2004). XX DR Genbank; AC127785; Positions 6743 7259. XX SQ Sequence 516 BP; 137 A; 133 C; 101 G; 145 T; 0 other; tgttgggagc agccttgaga tggcgaaagc agccttcaag ttaaaaagca attatgttct 60 aaccttatgc tggaataata agcaatattc ttcaggttaa aaagcaatta tattttagcc 120 ttatgataga acaaaaaaca atagccttac gttagatcaa agcagcacct gcagctctac 180 ataaattcaa tagatagcat aaatctaggt gtcagatcac tgagtactgg tggtcaggca 240 ctgacggtca ggtcactaag caacccaccc tgctgttccc ccgcttggct gatttacctg 300 gccaatatcc tggttatcaa gccaggccca ttctttgtgg tcacccaacc cctccccaca 360 ttttgcttct accagtatat cttcagcctg agaaaaatta aaaattgtca gcttgatcag 420 acttcttgac ttgctgtccg ttctttgcgt ctcttgtccc ccattctctc ctaggtgtac 480 cccagaccct ggtcgactgc cctgcgggtc ggggca 516 // ID MER135 repbase; DNA; ROD; 171 BP. XX AC . XX DT 29-JUL-2006 (Rel. 11.07, Created) DT 17-AUG-2007 (Rel. 11.07, Last updated, Version 2) XX DE A conserved, interspersed palindromic repeat - consensus. XX KW Transposable Element; Nonautonomous; MER135; conserved. XX OS Mammalia OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi. XX RN [1] RP 1-171 RA Jurka J.; RT "MER135: Conserved mammalian repeat, probably derived from a RT non-autonomous DNA transposon."; RL Repbase Reports 6(7), 388-388 (2006). XX RN [2] RP 1-171 RA Gentles AJ., Wakefield MJ., Kohany O., Gu W., Batzer MA., RA Pollock DD. and Jurka J.; RT "Evolutionary dynamics of transposable elements in the RT short-tailed opossum Monodelphis domestica."; RL Genome Res 17(7), 992-1004 (2007). XX RN [3] RP 1-171 RA Jurka J., Kapitonov V.V., Kohany O. and Jurka MV.; RT "Repetitive sequences in complex genomes: structure and RT evolution."; RL Annu Rev Genomics Hum Genet 8, 241-259 (2007). XX DR [1] (Consensus) XX CC This is a relatively abundant repeat: ~500 copies in the human CC genome. Its palindromic structure suggests that this is a CC non-autonomous transposon-derived repeat. It shows distant CC similarity to microRNA ppa-mir-224. XX SQ Sequence 171 BP; 50 A; 39 C; 30 G; 51 T; 1 other; taaactactc cctggtgtaa attagacact tttggaggtg cattaactct ttcaggccac 60 tagggtacca tttagtaaat tactgctcca rtgcactaaa tggtacccta gtggcctgaa 120 agagttaatg caccttcaaa aatgagcaat ttactccctt cttaccagta g 171 // ID IAPEYI repbase; DNA; ROD; 7513 BP. XX AC X87638; XX DT 19-NOV-1998 (Rel. 3.1, Created) DT 19-NOV-1998 (Rel. 3.1, Last updated, Version 1) XX DE An internal portion of IAPE-Y LTR-retroelement. XX KW LTR Retrotransposon; Transposable Element; gag; retrovirus; pol; KW env; IAPE-Y; IAPEYI; LTR-retroelement. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-7513 RA Plumb A.M.; RT "IAPEYI."; RL Direct Submission to Genbank (30-MAY-1995)M.A. Plumb, MRC RL Radiobiology Unit, Chilton, Didcot, Oxon OX11 0RD, UK. XX DR GenBank; X87638; Positions 390 7902. XX CC This is an internal portion of IAPE-Y retroviral element. Its CC LTR is deposited in RepBase as LTRIAPEY. XX SQ Sequence 7513 BP; 2222 A; 1569 C; 1692 G; 2030 T; 0 other; agtggtgcca aaacccggga actgtcatca caccagcgca aggaagaccc tcattcacgg 60 ggcagaatca gaactgtggg acaaaaggct ctacaaggta cgtacaatct tgaaatttct 120 ccagagttag aagccctttt gattgtttgc atggcgatta cgcttttcct tttggtttcg 180 ttttgtttcc accactggaa ctgttgactg tttcttagtc gcagtcagtc ctggacagcg 240 cctggacagc aaccaatttc aggcccggtc aggcctggac agcggcagta tgggaatctc 300 ccattctata gtagtgggct tacgctcagt cctgaagcag tgtggcctaa aaatcgccac 360 taagacctaa gagggatttg tcagagagat agaccgtgtt gcaccatggt atgcttgctc 420 attgtcatta actgtcgcct cttgggacaa actgaaagga gatctagtta ggaaacagca 480 gaatggcaaa cttaaagcag ggattatgct gctatgaaat tggtgaaatc atgtttaaca 540 gatgaggatt gtcagcaaat gttagaagca gggcagaaag ttctggacga aattcaagaa 600 agtctatcag aggtagatca gggagagaga gtaaaaatag agaggaaaca aagtgcactg 660 aagaatttag gcctttccat gagccttgag accgaggaaa agagaatgtc agggaaaata 720 cctggggaga gattagaaaa agggatggaa agggagagaa gaagggagat cgagctggag 780 aggcacataa ggaaagaagc ctctacccac cactagatga gtttaagcag ctagctgtca 840 gtagctcaga atcagataag gaacttagct cctatgagga aacagactta gagaagaagc 900 gacaggttgt gagggagaaa gataccagtc agatgaaaga cgagttaatc agttacagat 960 aacatgaaag cagctagaga caaatgtcaa atcattgcgc agtcttcgga tatgcaaatt 1020 cagggttcta gtgcacctcc gccttatgtg cagaggcacc attctgagcc ttatgtgcag 1080 aggtatcatt cagactcatt tatactaaaa gaggaacaga ggaagataca acaggcattt 1140 ccagtttttg aaagagctga gggaggggct gttcacgctc aagtagaata tatacagatt 1200 aaggagcttg cagaaacagt ccgtgttatg gagtcagagc taatttcact gtagcacaag 1260 ttgaaaggct tgcaactcac actatgactc ctggtgattg ccagactgta gtaaaagctg 1320 cagcccccag tatggggatg tatcttgaat ggaaagtttt gtggcaggac tcctgtcaga 1380 cgcaggcaag ggccaatgcc accatggaag gaaaccaaag gacatggaca tttgaattgc 1440 ttacaggtca gggacagcat gctgctaacc aaacagatta ttattgggga gcatacgccc 1500 agatctcagc tgccgctgtt aaggcatgta aggcactctc taagaaggga gaggcaagtg 1560 ggcatttgac caagatcatt caggtctacc aggagtcata ctcaattttt gtggctagaa 1620 tgacagaggc agcagggaga tattttgaga tacaaaagca gctatgcctt taattgaaca 1680 gttgatatat gaacaggcta cccaggaatg cagggcagct attacacctg gaaaaagcaa 1740 gggactgcag gactggtcga aggtttgccg tgaacttggt gggccactca ctaatgcagg 1800 tttagcagct gccattctac aagtgcaaaa atgccctgac atggctgagc tcaaactctg 1860 ctataacgtg gcaaaccggg acacttaaaa aaggactgta gagcccttga taaaaggaga 1920 gtaccaggat tgtgcactaa gtgtggaaaa ggatatcatt gggctagcgc ttgtcgttca 1980 atcagggaca ttcgaggcag gctcctgcag ccaggattcc ctcaagcagt agataatgac 2040 accatttcaa aaaaccatta cagggccctc aggtcttagg gccgaaaaca tacggaccac 2100 gacaggcgaa aggtggagcc cacagtccgc aggcaacaga aggctcagct ggattggacc 2160 tccgtaccac cacccaattc atcgtgatcc tcaaatggga gagcagccta ttcctactga 2220 ctctaaagga cctctgccac tcctgggagt gtcggcctaa tattgggcag ggcttccctt 2280 acactacaag gtcttattgt tcaccctgga gatgtaggtc aagattatga agggggactt 2340 caagttctct gttcctgtcc tcagggtgtc gtttctattt cacaaggaga tagaatagct 2400 cagttagtaa ttcagccaag cctacatggc tgttttccct cttctggtgt ccctcaagct 2460 accagaggga ttggttctac tggaaatgat tcggcctatt taattatgcc tttagattcc 2520 aggccttctt tagagttagt tatagaaggg aaacaattta aggggatttt aggcacagca 2580 gcagacaaaa gtattatttc ttctcactgg tggccgaaaa cttggccagt tattcagtca 2640 tcacattctt tgcaaggttt cggttatcag tcctgtccca ctattagttc ccgttctttg 2700 agctggacag cgcctgaagg tcaaatggga tgattcactc cttatgtcat accactccca 2760 gtaaatctct gggggaaaga catcttacag gacactagga ctgaccttaa ctaatgcatg 2820 ctcgccacaa gtcgttcgta agattaagaa aatatgctat aaagaaggaa ggggattagg 2880 aaaaggagag cagggtaaac ttgaacctat ccctcaaaaa ggtaataatg ggagacaagg 2940 tttggatttt ttctagagac agctgttgag ggttccatgc ccataccatg gcttacagag 3000 gaagttgtat gggttcccca atggcctctc tcctctgaaa aattagaagc agccacaaaa 3060 ctaatttctg aacagctaca cttggggcat ttagagcctt ctaactcacc ctggaataca 3120 cctattttcg taattaagaa aaaatctggc aaatggcgct tattacatga tcttagagca 3180 attaatgcgc aaatgcatct atttggacca gttcagcaag ggctaccatt actttctgcc 3240 ttacccaaaa attgggaaat tataatccta gacattaaaa attgcttttt ctctatacct 3300 ttatgccctc aagataggca aagatttgca tttacgatcc cagccattaa ttacttagag 3360 ccagatcaaa aataccaatg gaaggtccta cctcagggga tggcgaatag cccaccatgt 3420 gtcaactgta tgtacaattg gcacttaagt cagttagaga acattttcca tcattacagg 3480 tgataattta tatggatgac attttgattt gccataaaaa ttcagagtta ttgcaagatg 3540 cataccttat attaataaaa acgttagggc aatggggatt gcaggtggcc accgaaaagg 3600 tgcaagttgc tcgaatggga gccttcctag ggtcacttat ttatcctgac aaaattgttc 3660 ctcaaaaatt agagattcgc aaagatcaac tacatacctt gaatgatttt caaaaattgt 3720 taggagatat taattggctg agaccctttt tgaaaattcc atcagcagaa ttaaagccct 3780 tatttgacat attagaggga gatacccata tctcctccca cagagcactt acctcagctg 3840 catgtcaagc tttacaaatt gtagaaaagg ccctacaaga tgctcaatta cacgcattga 3900 cgagtcaaag tcatttgaat tgtgcatatt aaaaactgca cagttaccaa cagtggtctt 3960 atggcaaaat gggcccttgt tgtgggtcca tcctaatgct tcccctgcaa aaatcattga 4020 gtggtatcct aatatgccag ttgctcaact tgcacttagg gggataaaag cagccattac 4080 ttatttcggg taagaacctt atatagtaag tgtaccttat acttctgctc aagttcaaac 4140 cctggcagca acaaccaacg attgggcagt cttggttgcc tcctattcag gacaaattga 4200 taaccattat ccaaaacata caattttaca atttgcctta agccaagcta tagtgtttcc 4260 aataataaca gtcaaacacc cacttccgga tggggtagta gtgtatagag atggatccaa 4320 atctggtata ggtgcatatg tagtaaatgg ccaagtaaca tctaaacaat ataatgattc 4380 atcaccccat gttgtagagt gtttggtagt tctagaagtc cttgaaactt tcctgggacc 4440 gctcaatatt gtatctgatt ccttatatgt gttcaatgca gttaatatgc ttgaagttgc 4500 aggcttaatt aaaccaacta gtaagcttgc tcacattttt caaaagattc agtcagcctt 4560 gtatacagaa gacacttgtc tatattactc atgtttgagg tcattctggc cttcctggcc 4620 tcatgtctca tggaaatgac ttagcaaaca aagccactag aatcgtggct gctgctttgt 4680 cctcacaggc agaggctgca agagaatttc atgaatgctt tcatgtgaca cctgagacat 4740 tacaccaccg ttttaattta accaggaaag aatctcgtga cattgtcacc caatgtcgaa 4800 actgttgtca atttttacct actcctcatg taggggtaaa ccctcgtggc gtcaggacat 4860 tacaggtctg gcaaatggat gtcagtcata tttcctcctt tgggagaaat caatatttac 4920 atgtttctat tgacacctgc tcttgtgtaa tatttgccac acctttaaca ggtgaaaaag 4980 actctcatgt tatacagcat tgcctagaag cgtggagtgt gtgggcaagc cacaattcta 5040 aaaacggata atgagccagt catataattc tacttaattt caacaatttt gtcatcaaat 5100 ggatgtcact catttgactg gccttcctta taatactcaa ggacaaggca ttgttgaacg 5160 tgcccacccc aaactcaaat cttataaaac agaaagaggg agttgatgag actctgacct 5220 cagtgccaag agtagcagta tctatggcac tctttacact tatttttttg aatctcgacg 5280 agcagggaca ctctgcagct gagcgacata gctcagaccc tgatagacca aaagaaatgg 5340 tcaaagggaa ggatgtttta actggtcagt ggaaagcgcc ggatcctatc ttaatcagat 5400 cctggggagc tgtttgtgtt ttttcacaga gttaagacaa cccattttga ctgacggaat 5460 gactcatccc caggatattc atcaaggatg gtgcagaaga tgctgacctc ccgaggactg 5520 ttcctgatcc tgacaatgct gaacttgtct caggttccta gtataatggg ccagcagaga 5580 tgggctattc tctcaacttt ccctaaacca atgccagttc gctatgatgc tatagttttt 5640 ccaaaattct ttactactga taaaacagtg gatttgccat attaccctat gatcccaacc 5700 cgagcaccat taggagaaaa tcgcacttta ctagaacatg gttctttatg ttttcgaatt 5760 aatggaccag gaaattgtat caacctcaca gcctgagctt tggggaagtt taatgagcat 5820 agaggaggtt gcgtgacaca acccaagata cttccaatgt agagataacc gttacaaatc 5880 ggaccttttg gcacgaccta aattgggtta atggtacatt tctaccacct aacttttcag 5940 acaaaaaacg tccacaccaa ccaaaaatag cccctcattg tagtttggaa gatgaagggc 6000 tgatcctgcc attgtctgat tgtcaatcct ccatcactca ttgggcagat cagagtaaaa 6060 ccttttcctt ttctcccaac atgatagata acccagagaa ggaatttgtt atgaaaaagg 6120 gacttttgat actggacatt agaatgcatc cctttaacaa gtgggtgctt tgtggagtca 6180 atggcagttg tacagaactc aatcctttga tcctttatcc agggaggagc agttggaaag 6240 gcttctttta ctggcatctc aagatttgct cagtattggg gaatacatga tgcctccctg 6300 gactcttatg gatatagtaa taccagatgc agagatcact gggtttaata aaactttggt 6360 aaaccagatt aattatccat ctaccacagt ctgtgtttac cccgctttct tgtttattct 6420 ttcaaatgat tcatttgaaa tctgttccaa tgataattgt tggatttctc aactttgggt 6480 agtaacaaag aacaccgtgc catggtggca cggatcccac gttggatctc tgttctggtg 6540 gatacaccct ccatcttatc tatgttttga aaaaagagag attttggcat tactgctgcc 6600 gtgatcatag caatttcagc cagtgcagct gctgctacag ctgcagggta tgctatggtt 6660 agtgcggttc aatcaggcac aaaattaaat cagctttcag cagatctggc tgatgccatc 6720 actgtccaag cttctgctag taccaagtta aagggagggc tgatgacttt gaatcagtgc 6780 cttgacctgg cggaagaaca gataggtgtt ctatgaagat ggcccagttg ggttgtgaaa 6840 gaaaactgga agctctgtgc attaccagtg tccaatatga aattttactt atgcagctaa 6900 tttgtctaga caggtttctt tatatcttgc aggaaacttg tctgaaagat tcgatgagac 6960 ccttgaggtc cttagagcgg ccgttctaaa gatcaactca acacgagtgg acctgtcatt 7020 gacagaggtc ctctcctcct ggatttcact gccttttctt attttaaaga atgggtgggg 7080 gtaggtttat ttggcgttgc acctgctgtg gacttgtggt catgctctgg ttggtttgca 7140 agctcagaac ccaacaaaca cgtgacaagg ttttgataac ccaagcactt gctgccatca 7200 agcaaggggc cttccctgaa atctggctat ctatgctcaa aaattaattg acattggtgg 7260 ctgtcctctt ttatagagtt cgccctagtt tatttagtag ttaatgtcac atagcactgt 7320 ggtatccagg gaggggcaac tttcctcatg cacagatcaa cctaagacac agggcccggt 7380 ggcgataggg ttaccctatg cacggaaagg ctgtgcactt ggaggaatga cctaagacag 7440 gagccacagt ggatgggcaa agttacacag gcctagacca gcctcaagtt ttaataaaat 7500 aaaaagggga aga 7513 // ID LTR4B_Cpo repbase; DNA; ROD; 537 BP. XX AC . XX DT 21-OCT-2009 (Rel. 14.11, Created) DT 21-OCT-2009 (Rel. 14.11, Last updated, Version 3) XX DE Long terminal repeat: consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR4B_Cpo. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-537 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from guinea pig."; RL Repbase Reports 9(11), 2874-2874 (2009). XX DR [1] (Consensus) XX CC >93% identical to consensus. 4 bp TSD. XX SQ Sequence 537 BP; 148 A; 143 C; 122 G; 124 T; 0 other; tgaaggacca gaaatttaaa ggttaaaaac aagatgctgt ttctcagcta gaaacagcac 60 tgggcctcgt gactcagtcc taactcagcc ccgagggagg ataaaaagaa aacatctgcg 120 gctgtgatat gctcgggctg tgctgtgcca gggaactcag agatagagcc tgccgattac 180 gctaatcagg gttgtaaaac atgtcaaact cgagcctaca gaaaccaggt gtcctaaaga 240 taagttgtgc agaacaaaga cagttaatta actctgaccc gctggtcacc ccttatgagc 300 tgaccaacca caaataggta aactgccgac taccccgctc taactgctat gattggcttc 360 tgtaaaaaaa tgcttgcttt gcggtttttc ccccttaaaa gcacctgcct cgcctacact 420 cggggccctc cgttcccacc tgtttggagg accccgtgcg cacggaacaa taaagacccc 480 tttgttctta catgagagcg gagcccttgg agtctccttt gcgacaccgg tcttaca 537 // ID L4 repbase; DNA; ROD; 1960 BP. XX AC . XX DT 27-JUL-2006 (Rel. 11.07, Created) DT 27-JUL-2006 (Rel. 11.07, Last updated, Version 1) XX DE RTE Non-LTR Retrotransposon from mammals. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; L4. XX OS Mammalia OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi. XX RN [1] RP 1-1960 RA Smit A.F.; RT "L4 - RTE Non-LTR Retrotransposon from mammals."; RL Direct Submission to Repbase Update (11-NOV-2005). XX DR [1] (Consensus) XX CC 35-40% substitution level. A real challenge. The ORF from pos CC 1-1773 matches the N-terminus of RTE class pol proteins CC (specifically BovB and Expander). There may be some fortuitous CC double frameshifts after pos 1250, as similarity to most RTE CC elements ends there. Bp 1432-1593 match the R2 element CC RNAseH-domain again. L4 probably predates mammalian evolution. CC No SINE with it yet. The 3' UTR contains a conserved core of 40 CC bp, pos 1711-1751 in the current incomplete consensus. This CC sequence tends to correspond with regions that are highly CC conserved between mammals (some resemble exons upon first look). CC The substitution level in this region is on average only 22%, CC and reaches as low as 6% (2 substitutions). A candidate for a CC widely exapted repeat fragment?. XX FH Key Location/Qualifiers FT CDS 1..1770 FT /product="L4_1p" FT /translation="DSVSMVPSGWTQSVVYSIFXKGXSLPPPNYRPTXLLD FT IPSKXYASFLLDKLQVWVSQANILHEEQXGFRHGXSTIDHCYTLYHLVEKS FT VRNNVRLFAAFIDLSSAFDSMDRNQLWAKLYELNIDSWLLMLLQSLHLNTT FT XRIRVGRNGLLTDQIPILSGIKQDCVLPTLLFNLYLNNLIXLLDELDACPP FT AIANRKTSILLYADDMVLLSRTRSGLXRQLXLLXNYCQKERLXINXXKTKI FT IVFGRHSPTFXWLISNNSIQQVNSFSYLGVHFAANSSWRAHQEAXLLKIRY FT STGALLRFFYGRGGRLVTPALKIFQAKIISAMLYGVELWGLDRAFVXVLEQ FT IQNCFLRKILALPAGTPAAHLRAEVGWPSIRARXXVRLLNFHXRMSTLPPA FT RLVSKAYGSXLNQQHRIPALQALVREXNLELXAIQLLSKARLREXIFKEDX FT LKDMLSIHSSRYSKFYPWIKLDHQKATYLDHISLAPCRXAFTELXFNVMPS FT AFIEGRYKKQPYEXHFCIXCKXVVEDIVHYITQCPLYKXPREKFLLEFSAR FT KSFVSPEELVCFLLSDNENYVTXHVSLFALAARKLRAKFXAQP" XX SQ Sequence 1960 BP; 502 A; 445 C; 369 G; 585 T; 59 other; gattccgtna gtatggtccc atcnggntgg actcaaagtg tagtctattc tatctttnaa 60 aagggcaant ctctacctcc cccaaattat agacctactn atttactgga cattccntcn 120 aaantntatg ccagtttcct ncttgacaaa ttgcaagtct gggtttctca ggccaatatt 180 ttacatgagg agcaggnagg ctttaggcac ggctnttcca ctattgacca ttgttatact 240 ctttatcacc ttgtggagaa atctgtcagg aataacgtaa gactgtttgc agcttttatt 300 gatctttcct cggcctttga ttctatggac aggaatcagt tatgggctaa gttgtatgag 360 ctcaatatag actcctggct actgatgctt ctncaaagcc tgcatcttaa taccactnca 420 agaatcagag taggtaggaa tggtctcttg acagatcaga ttccaattct aagtggtata 480 aaacaggatt gtgtcctgcc taccctcctt tttaacttgt acctnaataa cttgatacng 540 cttttagatg aactggatgc atgcccncct gccatagcaa acaggaagac aagcatcctc 600 ctctatgctg atgacatggt tttattatca cgaaccagga gtggcctcaa nagacaactg 660 gncctgctgn ctaattactg tcagaaagaa cggctcaana tcaactntnc taaaactaaa 720 atcattgttt ttggcagaca ttctccaaca tttaantggc tnatatcnaa caactccata 780 cagcaggtca actcattcag ttacctgggg gtacattttg cagctaattc atcctggcgg 840 gctcaccagg aagccatnct gctcaaaatt agatattcta cgggagcntt actgagattt 900 ttttatggcc gaggcggccg attggtaaca cctgctttga aaattttcca ggccaaaatc 960 atttcngcca tgctctatgg cgtggaactc tggggnctcg atcgagcatt tgtccangtg 1020 ctngagcaga tccaaaactg cttcctgagg aaaatcctgg ctttacctgc aggtactccc 1080 gcggcccacc tccgtgcaga ggtgggatgg ccctccatcc gggcaagaat ncnggtcagg 1140 cttctcaatt ttcataanag aatgtcaacc ctgccacctg ctcgtctggt ttctaaagca 1200 tatggatctn ccctcaatca gcaacacaga ataccngcac tccaggcnct tgtcagagaa 1260 tncaaccttg aactgnctgc tatccagctc ctgtcaaaag cccggctgag agaantgata 1320 tttaaggaag atngtctaaa ggacatgcta tccatccatt cctctaggta ctctaaattc 1380 tatccttgga tcaagttaga ccaccagaaa gctacatacc tggaccacat tagcttagct 1440 ccctgcagaa ntgccttcac tgaattgnac tttaatgtta tgccctcggc ttttatcgag 1500 ggtcgntaca agaaacagcc atatgaaann cacttctgca tttnctgtaa aantgttgtt 1560 gaggacattg tccattacat cactcagtgt cccctatata aagncccacg tgagaaattt 1620 cttttagaat ttagtgccag gaaaagcttt gtctctcctg aggaactggt atgcttcctc 1680 ctttctgaca atgagaatta tgtaactnat catgtttccc tttttgcctt ggctgccagg 1740 aagctcagag ccaaatttga ngctcaacca tagcaactat gtaggncctt atggcctgca 1800 tatttgtgtc cttgnttttc tattggtctt gaccttctat tctactntgt tttccttgtt 1860 cctgattttt taaatttata ttttttaatt ttattgcgtg ttttntgtta taagctacct 1920 caaatccttt gtggaangag gcgggntata aataaataaa 1960 // ID RMER6C repbase; DNA; ROD; 671 BP. XX AC . XX DT 30-JAN-2004 (Rel. 9, Created) DT 30-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE Mouse long terminal repeat - a consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR; ERVK; KW RMER6C. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RA Karolchik D., Baertsch R., Diekhans M., Furey S.T., Hinrichs A., RA Lu T.Y., Roskin M.K., Schwartz M., Sugnet W.C. et al.; RT "The UCSC Genome Browser Database."; RL Nucleic Acids Res 31(1), 51-54 (2003). XX RN [2] RP 1-671 RA Pavlicek A. and Jurka J.; RT "RMER6C - a subfamily of LTR retrotransposons."; RL Direct Submission to Repbase Update (31-DEC-2003). XX DR [2] (Consensus) XX CC The consensus sequence was reconstructed from the UC Santa Cruz CC genome annotation. Individual copies are ~89% identical to the CC consensus. 6 bp TSDs. XX SQ Sequence 671 BP; 235 A; 117 C; 182 G; 137 T; 0 other; tgtagaggaa ttttaatcca gcaacatggc tgctctggca agggatcaca tccaaagacc 60 ttctaggtct gtgtgatctg gctggaatgc agacacacct ttaatccctc tggctggaat 120 acagacatac ccttagtaca cacctttaat cccaaacaat gaaggtaaag ttagtttgta 180 gaaggaagca gccatgtttg aaagtgatgt ctaattgagg ggcagacaaa gtgatgaatc 240 agagaaagat ttgacagaat gagtcagaga taggatatgc ccaactctca cgagaacagc 300 acaggaaaga gaggctactt aagagcagca cagagagaga gggaggttgg gaggcagttt 360 taccaggaca gttttacaga gacaggttgc agagagaagt agaggcaggt aaagacagaa 420 taagccagag aatgagaagg agccagaaga ttagaacaga ttgccagagt tagtttgagg 480 ccaagcagag caattcagtg agaagctgag agaagccaga ttgaatcagt cagcttggag 540 aggagtttga gccagaacag ctgagttgaa ccagccagcc agagttcaga aagaactaga 600 aagggtgagc ttattcagca gtaagcctcc aagatgacaa ttacatctgg cgaataaaag 660 ttacttttac a 671 // ID MER63C repbase; DNA; ROD; 930 BP. XX AC . XX DT 18-APR-1997 (Rel. 2.03, Created) DT 28-JUN-2000 (Rel. 5.05, Last updated, Version 4) XX DE MER63C repetitive element - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; HAT superfamily; MER63C. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-930 RA Smit A.F.; RT "MER63C."; RL Direct Submission to Repbase Update (30-NOV-1995). XX RN [2] RP 1-930 RA Smit A.F.; RT "Direct submission."; RL Direct Submission to Repbase Update (31-JAN-2000). XX DR [2] (Consensus) XX CC Putative internal deletion product of a hAT-DNA transposon, CC sharing CC with these the characteristics of 15 bp terminal inverted repeats CC and 8 bp target site duplications. Copies on average 23% CC diverged. XX SQ Sequence 930 BP; 306 A; 160 C; 159 G; 303 T; 2 other; cagtggtgtg ctggtaaatg tttaacaact ggctctctgg gggaaaaaat gtatgtatgt 60 atatacatat ataagtttat tataaatttt actgatataa aggatgtgta gcacacaatt 120 tacaaataat aataaaatat acaatactct ttattgtaaa ttccatatag ccagttgatt 180 ctcacagaat gcttttnttg aattttgccg aactcctnca tctatagcca acctatggtt 240 gcaattgacg aacaagtgta gttccaacat gaatgttggt tgatattttc gtttacatta 300 atgagtaaga cgaaagtgaa acaacgaaga cgtatgttgg aacttcactc attcgtcaat 360 gatgtgagtg acttctttgc tgaatcggat aatagttttc aaatactgga agaatatttc 420 ctcaattttt tgtgctattc acaatgtaac ggctacagac acgacacact tttaagttta 480 atctgcatta ttaacatttt ctccatcact ttcttaagtc tagacaatca acaaaacaat 540 aaatcaagcc ctgatttgta gcgtttgcca atttccgtgg tgtaaatact cccaccatgg 600 ccgatttcaa gctaccaacg tgacgtcact gaacgcggag ttgggaagag atgcgcagta 660 gcacaccatt atatagtatt tccaccatac agatacaata gacgtaaata acctcaagag 720 catagataat agtaaaatgt agtaaaataa ttaggaagtg atgagttttg agtatttatt 780 acctttgttt ttaatataat ttatttaatt gtaagtttat ataatttaat ttttaataat 840 ggctgtgttt aacaaccggc tcgcaaaatt cctgaaaatt taacaatcgg ctctcgcgag 900 ccggtatgag ccggctccag cacaccactg 930 // ID LTR2C_Cpo repbase; DNA; ROD; 346 BP. XX AC . XX DT 05-NOV-2009 (Rel. 14.11, Created) DT 05-NOV-2009 (Rel. 14.11, Last updated, Version 3) XX DE Long terminal repeat: consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR2C_Cpo. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-346 RA Jurka J.; RT "Long terminal repeats from guinea pig."; RL Repbase Reports 9(11), 2873-2873 (2009). XX DR [1] (Consensus) XX CC ~89% identical to consensus. 6 bp TSD. XX SQ Sequence 346 BP; 90 A; 97 C; 89 G; 70 T; 0 other; tgttgggagc cgttgaagct ggaaattgac cacacgtgca aggctggagc atgaccatcc 60 caagaacaaa gagcgcctgg ggcaagccac gtgaagaagc tgcttcagcc cagagaacac 120 ctagccccaa gacaatctga acagtgggtt gctaggacaa cagggtgtca tgtgacatta 180 agttgctaaa aggtataaat aaacgcccct gcagtgaggg gccagtacct ttcctcccat 240 caggaggata agggctccac ctgaccccag ctttgtctct tcttattttc tcatcctctc 300 gcggtccttc accccaagaa cctcaggagc cgcatgggtc gcggca 346 // ID MURVY_LTR repbase; DNA; ROD; 541 BP. XX AC . XX DT 30-JAN-2004 (Rel. 9, Created) DT 30-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE Mouse long terminal repeat - a consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; KW MURVY_LTR. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RA Karolchik D., Baertsch R., Diekhans M., Furey S.T., Hinrichs A., RA Lu T.Y., Roskin M.K., Schwartz M., Sugnet W.C. et al.; RT "The UCSC Genome Browser Database."; RL Nucleic Acids Res 31(1), 51-54 (2003). XX RN [2] RP 1-541 RA Pavlicek A. and Jurka J.; RT "MURVY_LTR - a family of LTR retrotransposons."; RL Direct Submission to Repbase Update (31-DEC-2003). XX DR [2] (Consensus) XX CC The consensus sequence was reconstructed from the UC Santa Cruz CC genome annotation. Individual sequences share ~94% identity with CC the consensus. 4 bp TSDs. XX SQ Sequence 541 BP; 133 A; 108 C; 175 G; 125 T; 0 other; tgtaagaccc ccgaagctgg gcctccgctc aagtcccggg atgagctggc cagcacccca 60 gtgacccgag agaaaaccac acctgatgca aactgcaaga ggttttatta tcagctagct 120 gaggacgaag tccctccgcc gtgcagggca ggggagttcg accctgagcg gtgacagtag 180 ggggctttta acagctagct gaggaattgg gaggagagtt gggaggagtt gggaggagtt 240 gggaggagag ttgggaggag agttgggagg agagttcttc tcttccgggt gtccttggtg 300 aaatttacaa ggcaggagtg gggagtgagt tctttctgaa tttttatctc cgtgtcctcg 360 gcacaggaag gcaggcatct ctggttgtac agtatagaaa caaaaaggct tttttgctgg 420 ctatagagaa caaagagctt ctttctggct gtatttttag agatcaggga aaacaagctt 480 gggggaacat ccaaggggtc tttaccttcc agggagaaac gctctaagtc ggggtctcac 540 a 541 // ID RLTR19B_MM repbase; DNA; ROD; 718 BP. XX AC . XX DT 30-JAN-2004 (Rel. 9, Created) DT 25-SEP-2008 (Rel. 9, Last updated, Version 2) XX DE Mouse subfamily of LTR retrotransposons - a consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR; ERVK; KW RLTR19B_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RA Karolchik D., Baertsch R., Diekhans M., Furey S.T., Hinrichs A., RA Lu T.Y., Roskin M.K., Schwartz M., Sugnet W.C. et al.; RT "The UCSC Genome Browser Database."; RL Nucleic Acids Res 31(1), 51-54 (2003). XX RN [2] RP 1-718 RA Pavlicek A. and Jurka J.; RT "RLTR19B_MM - a subfamily of LTR retrotransposons."; RL Direct Submission to Repbase Update (31-DEC-2003). XX DR [2] (Consensus) XX CC The consensus sequence was reconstructed from the UC Santa Cruz CC genome annotation. RLTR19 subfamily. RLTR19 subfamily (86% CC identity). CC Individual copies are ~92% identical to the consensus. 6 bp TSDs. XX SQ Sequence 718 BP; 184 A; 180 C; 131 G; 223 T; 0 other; tgttatggtc tgaccccccc cccccagaat agctgtagcc attttgttcc atgcctgcca 60 gctatttcat attgttgctg taacatgcct gccagtcatt gacacagaga agtgacttgg 120 ctcaaggaca tgctgaccac ataccctgtt ttgttctgta tgttctgaat gttctgtatg 180 aggtttgcta atcttaagaa attccacaaa gctttacgta ggacccatca aatcaaaggt 240 caatatgaac tgttatgtct aaaatatctt gagtcagagc tgaccaccag gcagcccttc 300 ctgcacctat gtatgagctc actgtggttt ttgtggctga cactgaagaa tgctactaat 360 aagccaaaaa gttatgatta taatactcat gctctgttat ctcaatgttc tgaaattccc 420 ctgttcacca cccatccacc accaccctta ccttactcag gaccaatcag cttaaaggtt 480 agctgataat actttgccta gtaagccaac tgctcccctc cttgcccttt aaacttttga 540 acctggtttt tcctataaaa agcctaccct gagagcagac tagtaccaca attaggcttc 600 cgaagtcctt tttgcggtcc tggacgtcca gtattatggt gtgcgttcaa taaactattc 660 ttgcttaact gagatcagtg tttgtatggt ttgagtggcg atttcctgaa ccccaaca 718 // ID ERV1NA-CPo_I repbase; DNA; ROD; 4804 BP. XX AC . XX DT 19-JUN-2009 (Rel. 14.07, Created) DT 19-JUN-2009 (Rel. 14.07, Last updated, Version 2) XX DE Internal portion of ERV1 endogenous retrovirus: consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERV1NA-CPo_I. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-4804 RA Jurka J.; RT "Non-autonomous endogenous retrovirus from guinea pig."; RL Repbase Reports 9(7), 1542-1542 (2009). XX DR [1] (Consensus) XX CC >98% identical to consensus. XX SQ Sequence 4804 BP; 1364 A; 1097 C; 1031 G; 1312 T; 0 other; tcttggaggc cccagcgaga tagcacgtct cacggaaggt gacctcaacg gctctgcggt 60 gggtgagtac gaccggtcgc ttcttatttt tactctgagg atcaatctac cggtatttta 120 cgccccgagc caaacggctc cggtctaggg gcaatttttg ttacctgttt ttgttatctg 180 ttctctgttc tctggtgacc gggggacgcc ccacacggtt ttccgtggct caccgtctgc 240 gataggattt tcctatcggt ttggggccca atctgtggtc cgtcagggcc aattcggaga 300 caccctctga ggtgactcgt tcggggaatt tcattcccaa tctgtggtcc gtctgggcca 360 atctggggaa tttcattccc aatctgtggt ccgtctgggc caatctgggg aatttcattc 420 ccaatctgtg gtccgtctgg gccaatctgg ggaatttcat tcccaatctg tggtccgtcc 480 gggccaatct ggaggcaccc cctaaggtga ctcgtttggt gtgaaactaa tgcgtgagag 540 tgaggtggcc ggcctgtttg tcctgtgtgt tgtgactgtt gtgaacggtt atttgttgtg 600 tcgcgcgtac gctaaccact ctttgtcttc agaacaccat gggacaaatg acatcgtccc 660 cattgtccct caccctcgac cattggtcgg aggttcgaga gagggctcac aatctctcct 720 tacaggttaa aaaaagtaaa tggcagactt tctgctcgtc tgagtggccc acattctccg 780 tgggttggcc accggagggg actttccacc ttcccctcat cttagcagtt aagaacatca 840 tcttccgccc tggacgagag ggccaccccg accgggtcac ctacatcttg gtatggcagg 900 acctcaggga ggagccccca ccgtgggtaa aacccttcct tcctccgccc tctccttggt 960 tgcctgttta aacttgttac taacttaggg gcttgttgcc catttctgtt gcaggatcca 1020 tatggcttat tagattcaag gttcatgata agaaaaatta tttcctttag ctcaccaaac 1080 tgtgcaagaa gctccctttg tttgtgtctt taatgttgtg gcaccttgtt taatatgtaa 1140 gacttcaatt gattgcttgc aacctcggca tgccttttaa aaggagtatt tttttctctc 1200 tggtgaaaat gtaaacatgt gcacatatat atgtatatct atggttgtta agagtcatgt 1260 caactttgcc aagtaaaaac caggcggtac tatatttgag ccagttaaaa ggttacagtt 1320 ttgtttttaa aatacaacat ttttatgtgg taaaaagata taaaataagt atgttttgga 1380 ctactaaaag gtgacttcaa gggttttgtt gagagttttg atgttgtctg aaaagtgaga 1440 gaaaaggtaa ttgactggta atagtaaaaa gagggacaaa gagagtggtc ttcaagtaaa 1500 ggtgtctggc tatgccagaa tgtggaaaaa gattaaagct tagtttgaga ggtattattg 1560 taagttgtag aaggctcaag aggatgtaaa ttttatttgt cttggtttat agtactacaa 1620 ataacaggtt tgacaagaag taaatgtgtt cagaatttga cagtattaga tcagttttgg 1680 taaaagtata cataagattg tactttgctg caaaatgttt tcattaaaat gagattatag 1740 gtgatactaa aaataagata caagttctta catgtgctgt gaacatctgg gagattatgc 1800 agcacaattt tgttacctgt aaaaaagccc tttttgttaa gaatataatt ttagtaaaat 1860 gatggcctgt cacgtgacac ccttggacaa gaagcagtgt gccatccaga tgaagccggc 1920 tctctgatcc ggaatgccaa aaccatgttc tccttttgtg ccaccctgac tggacatcca 1980 gaggtaagaa ggtttcccaa tattgccagt tgttctcttt gtaactcagt gggccccaca 2040 tgttagacaa aaattacaaa ggtttaagag gtttaagaaa atgcccctca gccagttaat 2100 agagatagca caaaaaggtc tttgagaact aaaaaattta caaaaagata gccaagatgt 2160 taaaattgct ctacgagaga agacttttaa aaaacagacg agactttctg gagacagttg 2220 gatactgtag gctaaatttt actgaaaagc ttgcctctat atgactccca taagggagaa 2280 actcaaaatt cgctgaactg gactgataaa tgtaaacagg ccttccagca gctcaagcgg 2340 gccttgacag aggccacagc cctgggactc cctgacgtta ccaagccttt caccttgttc 2400 atagataaac tcagagacat tgccaaagga gtcctaactc aggacattgg cccttaaaag 2460 aggcctatag cctacctctc caaaagactt gattcagtaa caggagtatg gcctccctgc 2520 cttaaaatac tagctactgt gacaatgtta cttaaggaag cgaagaaaca tgctttgggc 2580 aagtgatttc tgtggtaacc tctcacagtt tagagactct tatcaaaacc cctcctaaaa 2640 aaaggctctc aaacgccaga attctactgt caagccctgc ttctagaccc acagtcaatg 2700 atattcaaga cttcttcagc tttaaaccca gccaccctta tgcctgatga tgatccggag 2760 gaacaaagga cacctctctc catgagtgcg ctgacatcct cgaactgcag cagaacctac 2820 aggctgatct tactgaccag cttttgggaa atgccgacaa ctgttcgatg aaagcagctt 2880 catccaaaac ggcaaacaag tggccgaggc agcagtgacc accgaaaagg aggtactctg 2940 ggttaagcgg gtagagcctg gcacttctgc ccaaaaggcg gagcttgttg ctttgactga 3000 gaggctaaaa ctagccacgg aaaagagggt aaacattttt tttactgaag atatcccttt 3060 gctactgtac atatacgtgg aagccatcta aaaatgctcc acagattcta gatttgctgg 3120 cagctgtctg gctgcctcaa tggattgcta ttccttacaa ggctcaccca accagagata 3180 aaaaatagca accacatcag aggggtcaac caaaaaagta actgctctaa tactagtcct 3240 ctgaccttgt cagaaccagt ctacacaaaa aaagacctac agttggcatc tcaacttgga 3300 ataaagtctc ccaccccaag gagctaatac cagttacctg ataaacaagt actgttactg 3360 gagacactag ggcgggagct gtttcaccgg gctcgccaga ctactcgttt agaaaaaact 3420 aaacttgcgg aattgttcag aactcagtat tttattccaa aactctacaa actgaccacc 3480 tcttcgagct ccagatgccc acagtgcttg ttagtcaact cagccgagag ctctgcgttg 3540 gagaccaccc aacatcaaga aattgttcca ggagaacatt gggaaataga cttcatggac 3600 atggtaaaac ttaatttaca gggacactgc tatctcttgg tattaataaa catttttact 3660 ggctgggtag aggctttccc tactaagtct gagtcagcct tagtggtagt taaaaggcta 3720 tcgcatgaga tcatacctta atttggcctc ccactatcaa tggggtccga taatggcctg 3780 gctttcatag cccgagtcac ccaaaagtta gcccaggcat taaagattaa ttaaaactcc 3840 actgtgccta tcagccacaa agctcagggc aagtagaaca tataaaccaa agtctcaagg 3900 acattttagc taagctgttt tcaaaaactg gcgataattg gatgagcctt cttccctttg 3960 cactgttaag agctagatat accccatatg tttccaaaat ttcccccttt aaagccatgt 4020 ttgggaggcc cccgccattg gttccaagat tgtcagaaga caaattagct aaaattacta 4080 atcgagatct taagtcccta cagacactcc aacctgtgtt ggcctggatc caccagctgt 4140 ttagacaaaa acacctggac ctgccttcag agacgcctct caaggcccca gggactccag 4200 tcttggttcg atgtcaccac cgtgactccc ttaaaccccg ctgagaagga cctttcaccg 4260 tggtgctcag taccctcact gccatcaagg tagcaagaaa aactgcgcgg attcactaca 4320 ctcatgtaaa gccactggaa acagcaaaca acgagggaca tactcagtgg actacacaac 4380 agactggtaa cccttaaagt taaaaattga ctaaaacata gttagattat gctggttata 4440 aactgtttta gcttgcttgt tttgatttta ataacattga tgccgacctt acacggcggc 4500 tttggtcccc ctgtaaacaa aaaaaaacct attgttaaag atctttgtat gagagccccc 4560 ttcaagtaaa accaaaatgt tctctttccg ggaatcaatt ctgccctata tcctcccttt 4620 attgggaccc ctggccggat tcattatagt catggctata gccccttgcc tcattaaccg 4680 agtaacccgg tttgtgaggg cccagatttc tcaagtaaaa gttatggtgc tacgtcaaca 4740 atatgatccc ttacccatac aggaccttta agattaaaag tcatccacaa gaaaaagggg 4800 ggaa 4804 // ID MARINER1 repbase; DNA; ROD; 1274 BP. XX AC L36092; XX DT 19-FEB-1997 (Rel. 2.01, Created) DT 19-FEB-1997 (Rel. 2.01, Last updated, Version 1) XX DE Mariner transposon. XX KW Mariner/Tc1; DNA transposon; Transposable Element; KW mariner transposon; MARINER1. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RA Smit A.F. and Riggs D.A.; RT "Tiggers and other DNA transposon fossils in the human genome."; RL Proc. Natl. Acad. Sci. USA 93, 1443-1448 (1996). XX RN [2] RA Morgan T.G.; RT "Identification in the Human Genome of Mobile Elements Spread by RT DNA-mediated transposition."; RL J.Mol.Biol. 254, (1995). XX DR GenBank; L36092; Positions 495294 497519. XX CC 30 bp terminal inverted repeats, TA target site. XX SQ Sequence 1274 BP; 387 A; 264 C; 251 G; 351 T; 21 other; ttaggtcggt rcaaaagtaa ttrcggtttt tgcaytgttg gaatttgyca tttgatattg 60 gaatacattc ttaaataaat gtggttatgt tatacatcat tttaatgggc atttctcgct 120 ttacgttttt ttgctaatga cttattactt gctgtttatt tkrtgtttat tttagactat 180 gnaaatgatg ttagacaaaa agcarattcg agcgattttc ttattcaagk tcaaamtgrr 240 tcgtaaagcg gcggagacaa ctcgcaacat cagcaacgca tttggcccag gaactgctaa 300 taaacataca gtgcagtggt ggttcaagaa gttttncaaa ggagacgaga gccttgaaga 360 tgaggagtgt aatgacgggc catcagaagt tgacaacgac cagttgagag caatcatcga 420 arycgatcct cttacaacta nacgagaagt tgctgaagaa ctcgacgttg accattctac 480 ggtcgttcgg catttgaagc aaattggraa gnaaaaactc gataagtggg tgcctcataa 540 agtgagcaaa actttttnaa attgtcattt taaagtgtca tctcttattc tatgcaataa 600 caacgaacca tttcttgatc agattgtgac gtagaacgaa aagtggattt tatacgacaa 660 ccggcgacga ccagctcagt ggttggaccg aaagaagctc caaagcactt cccaaagcca 720 aacttgcacc aaaaaagtcg cggtcgcggt ctggtggtct gctgccagtg tgatccacca 780 cagctttctg aatcctggca aaaccattac atctgagaag tatcttcagc aaattgatga 840 gatgcaccaa aaactgcaat gcctgcacct ggcattggtc aacagaaagg gcccaattct 900 tctccacgac aacgcccgac cgcacgtygc acaaccaacg cttcaaaagt tgaataaatt 960 gggctacgaa cttttgcctc atccgccata ttcacctgac ctctcgccaa caaactacca 1020 cttcttcaag catctcgtca actttttgca gggaaaacgc ttccacaacc agcaggatgc 1080 agaaaatgct ttccaagagt tcgtcgaatc ccaaagcacg gatttttatg ctacaggaat 1140 aaacaaactt atttctcgtt ggcaaaaatg tgttgattgt aatggttcct attttgatta 1200 ataaagatgt gyttgagcct agttatgatt atgatttaat ggccaaaacc acaattactt 1260 ttgcaccaac ctaa 1274 // ID RMER2 repbase; DNA; ROD; 130 BP. XX AC L00605; XX DT 27-JAN-1997 (Rel. 3, Created) DT 15-OCT-1997 (Rel. 3.1, Last updated, Version 2) XX DE Medium reiteration frequency repeat. XX KW Repetitive sequence; RMER2. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-130 RA Jurka J., Kapitonov V.V., Klonowski P., Walichiewicz J. RA and Smit F.A.; RT "Identification of new medium reiteration frequency repeats in RT the genomes of primates, rodentia and lagomorpha."; RL Genetica 98, 235-247 (1996). XX DR [1] (Consensus) XX CC RMER2 was found also in Rattus norvegicus. XX SQ Sequence 130 BP; 33 A; 24 C; 32 G; 41 T; 0 other; ttggcttggg tttcttttgg ggggcaactt ggaaactaat gctaggtacc agcctgttag 60 tttacctgag ttcaaactta ggtcaggttc tctaaaatgg agtctgaatt taaaagactc 120 ggcatcccaa 130 // ID RLTR19C_MM repbase; DNA; ROD; 507 BP. XX AC . XX DT 30-JAN-2004 (Rel. 9, Created) DT 25-SEP-2008 (Rel. 9, Last updated, Version 2) XX DE Mouse subfamily of LTR retrotransposons - a consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR; ERVK; KW RLTR19C_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RA Karolchik D., Baertsch R., Diekhans M., Furey S.T., Hinrichs A., RA Lu T.Y., Roskin M.K., Schwartz M., Sugnet W.C. et al.; RT "The UCSC Genome Browser Database."; RL Nucleic Acids Res 31(1), 51-54 (2003). XX RN [2] RP 1-507 RA Pavlicek A. and Jurka J.; RT "RLTR19C_MM - a subfamily of LTR retrotransposons."; RL Direct Submission to Repbase Update (31-DEC-2003). XX DR [2] (Consensus) XX CC The consensus sequence was reconstructed from the UC Santa Cruz CC genome annotation. RLTR19 subfamily (79% identity). Individual CC copies are ~89% identical to the consensus. 6 bp TSDs. XX SQ Sequence 507 BP; 130 A; 115 C; 87 G; 175 T; 0 other; tgttgaggcc tgctcctttt aacataactg tagccatttt gtattcagtc tccattttgc 60 tcataaggtg aaattaagtt caggttctca gactctactt cccagaagta attatccata 120 gctgacactg aagaatgcta ctaataagcc aaaaagttat ggttgaatca cttgtactct 180 gttatctcaa tgttctgaaa ttcccctgtt caccacctgt ccaccaccct tcttacctca 240 ctcaggacca atcagcttaa aggttagctg ataatacttt gtctagttag ccaaaaatgt 300 tgtaccactt cactgcttgc cttttaaact tttgaacctg gtttttccta taaaaagcct 360 gccctgagga cagactggtg ccacaattag gtttttcctt cttgtggtcc tgaacgttca 420 gtattatggt gtgtgttcaa taaactattc ttgcttaact gagattggtg tttgtatggt 480 ttgtgtggca attcccaaac cccaaca 507 // ID MIR repbase; DNA; ROD; 262 BP. XX AC . XX DT 01-OCT-1995 (Rel. 1.09, Created) DT 18-APR-1997 (Rel. 2.03, Last updated, Version 2) XX DE Mammalian-wide interspersed repeat (MIR) - a consensus. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; MIR; KW Repetitive DNA; MIR1; MER24; MB1. XX OS Mammalia OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi. XX RN [1] RA Degen J.S. and Davie W.E.; RT "Nucleotide sequence for human prothrombin."; RL Biochemistry 26, 6165-61677 (1987). XX RN [2] RA Donehower A.L., Slagle L.B., Wilde M., Darlington G. RA and Butel S.J.; RT "Identification of a conserved sequence in the noncoding regions RT of many human genes."; RL Nucl. Acids Res 17, 699-710 (1989). XX RN [3] RA Jurka J., Zietkiewicz E. and Labuda D.; RT "Ubiquitous mammalian interspersed repeats (MIRs) are molecular RT fossils from the Mesozoic era."; RL Nucleic Acids Res 23, 170-175 (1995). XX RN [4] RA Smit A.F. and Riggs D.A.; RT "MIRs are classic, tRNA-derived SINEs that amplified before the RT mammalian radiation."; RL Nucleic Acids Res 23(1), 98-102 (1995). XX DR [4] (Consensus) XX SQ Sequence 262 BP; 69 A; 52 C; 61 G; 74 T; 6 other; acagyayagc atagtggtta agagcacggr ctctggagcc agactgcctg ggttcgaatc 60 ccggctctgc cacttactag ctgtgtgacc ttgggcaagt tacttaacct ctctgtgcct 120 cagtttcctc atctgtaaaa tggggataat aatagtacct acctcatagg gttgttgtga 180 ggattaaatg agttaataya tgtaaagcgc ttagaacagt gcctggcaca tagtaagcgc 240 tcaataaatg ttrgytatta tt 262 // ID DIPODE3 repbase; DNA; ROD; 233 BP. XX AC . XX DT 30-OCT-2009 (Rel. 14.12, Created) DT 30-OCT-2009 (Rel. 14.12, Last updated, Version 2) XX DE SINE element: consensus. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; DIPODE3. XX OS Dipodomys ordii OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Heteromyidae; Dipodomyinae; Dipodomys. XX RN [1] RP 1-233 RA Jurka J.; RT "SINE elements from the kangaroo rat Dipodomys ordii."; RL Repbase Reports 9(12), 3122-3122 (2009). XX DR [1] (Consensus) XX CC >88% identical to consensus. CC This sequence was derived from sequence data generated by HGSC at CC Baylor College of Medicine. XX SQ Sequence 233 BP; 63 A; 65 C; 67 G; 38 T; 0 other; gccgggcgcc ggtggctcac gcctgtaatc ctagctactc aggaggctga gatctgagga 60 tcgcggttcg aagccagccc gggcaggaaa gtccgtgaga ctcttatctc caattaacca 120 ccagaaaacc agaagtggcg ctgtggctca aagtggtaga gtgctagcct tgagcaaaag 180 agctcaggga cagcacccag gccctgagtt caagccccat gaccgacaaa aaa 233 // ID MLT1D repbase; DNA; ROD; 505 BP. XX AC . XX DT 01-OCT-1995 (Rel. 3.6, Created) DT 18-APR-1997 (Rel. 6, Last updated, Version 2) XX DE Mammalian transposon-like element long terminal repeat (MLT1d DE subfamily) - a consensus. XX KW Repetitive sequence; MaLR family; MLT1d subfamily; MER26; MLT1D. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 334-465 RA Jurka J., Kaplan J.D., Duncan H.C., Walichiewicz J., RA Milosavljevic A., Murali G. and Solus F.J.; RT "Identification and characterization of new human medium RT reiteration frequency repeats."; RL Nucleic Acids Res 21, 1273-1279 (1993). XX RN [2] RP 1-505 RA Smit F.A.; RT "Identification of a new, abundant superfamily of mammalian RT LTR-transposons."; RL Nucleic Acids Res 21(8), 1863-1872 (1993). XX DR [2] (Consensus) XX CC Replaces MER26 sequence. XX SQ Sequence 505 BP; 149 A; 96 C; 138 G; 113 T; 9 other; tgtggtaggc wgaataatgg ctccccaaag atgtccacgt cctaatcccc agaacctgtg 60 aatatgttac cttacatggc aaaagggact ttgcagatgt gattaagtta aggatcttga 120 gatggggaga ttatcctgga ttatccgggt gggcccaatg taatcacaag ggtccttawa 180 agagggaggc agagggtcag agtcagaaga aggagatgtg acgatggaag cagrragnga 240 aaactcaacg ttgctggctt tgaagatgga ggaaggggcc atgagccaag gaatgcgggc 300 agcctctaga agctggaaaa ggcaaggaaa cggattctcc cctagagcct ccagaargaa 360 cgcggccctg ccgacacctt gattttagcc cagtgagacy cattttggac ttctgacctc 420 cagaactgta agataataaa tttgtgttgt tttaagccac taagtttgtg gtaatttgtt 480 acagcagcma yaggaaacta ataca 505 // ID L1MD_5 repbase; DNA; ROD; 3320 BP. XX AC . XX DT 09-OCT-1997 (Rel. 6.3, Created) DT 14-FEB-2000 (Rel. 7.1, Last updated, Version 4) XX DE Partial L1MD LINE1 repetitive element 5' end - a consensus. XX KW L1 repeat; L1MD_5; MER79; L1M6_5. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 500-1 RA Smit F.A.; RT "L1MD_5."; RL Direct Submission to Repbase Update (1996). XX RN [2] RP 1-1604 RA Jurka J., Walichiewicz J. and Kapitonov V.V.; RT "L1MD_5."; RL Direct Submission to Repbase Update (28-JUL-1997). XX RN [3] RP 1-1604 RA Smit F.A.; RT "L1MD_5."; RL Direct Submission to Repbase Update (19-AUG-1997). XX RN [4] RP 1605-3320 RA Jurka J.; RT "L1MD_5."; RL Direct Submission to Repbase Update (FEB-2000). XX CC 5' end of L1M subfamilies. CC Originally expanded to 1619 bp and classified as a 5'-portion of CC L1 CC (Jurka et al. [2]). A shorter version submitted by A.F.A. Smit on CC Aug. 6 - deposited in the Appendix. Minor refinements of the 1619 CC bp consensus [2] worked out by A.F.A. Smit (August 19, 1997 [3]). CC Replaces MER79 [1] and L1M6_5 [2]. Average divergence from CC consensus is CC 24%. Appears to be 5' end of L1MD1 and L1MD2 subfamily LINEs. XX SQ Sequence 3320 BP; 1301 A; 692 C; 678 G; 627 T; 22 other; ttaaaaacaa agcgggagac ttccgcttcc gggaagatgg agtagacgta cttttcccta 60 ttcctcccgc taagtacaac taaaaaccct ggacattata tataaaacaa acataagaag 120 actctgaaag gtggagagaa gaaggcagac cggctaggga cctcgggacc cgaggaacga 180 cacggtagtg agttccctgg gttttctttt tgcctcatat atcccagact tggagctgaa 240 gaagccggca acccggaaac gccaacgggc acagacaaaa aaagccccaa caaaagcctg 300 ctctctctag ccaaaggacc aggaaagggg cagcctagca agacagaaaa cttttagaca 360 ataaccgctc tactccagcc aaacaccaca gaaaaaactg tggccccacc cccacccacg 420 ccagcaaagg ccgagtgggg agcctagact tccaccctca ccaggctgta acgaggcgcc 480 ccaacacctc caccgggatg gtgtcagaga aggccaagta gggagctggg actttcatcc 540 ccgccaggcg gtaatgaggc ccmccttccc cttgccmctg cggtgtcagt ggagaccacg 600 tggggagcct ggacttccac ccccacccgg cagtaatgag gcgcccctcc ccctccctac 660 tggggtggtg tcagaggagg cctagtggag agtcgggact ttcaccaccg cccagcggta 720 atgaagccac ctcctcctct tgccmccatg gtgtcagtgg aggccacgtg gggagcagta 780 atgaggcact cctacccctc ccagccaggg aggtatcagc ggaggcctag tggggagccg 840 aactcccacc cccgcccagc agtaacgagg agcccctccc tcacctcggg tgtcaacgga 900 ggccgagtgg ggaacctgga cttctacccc cacctggcag taatgaggca gcgcccctnc 960 ccctcycctg ccggagcggt gtcagaggaa gccggctaaa acagaaggtt taaataagat 1020 ccagagtctc ataacataat acccaaaatg tccaggtttc aatcgaaaat cactcgtcat 1080 accaagaacc aggaaratct caaactgaat gagaaaagac aatcaataga cgccaacacc 1140 gagatgacag agatgttaga attatctgac aaagatttta aagcagccat cataaaaaat 1200 gcttcaataa gcaattacga acgtgcttga aacaagaraa aagtagaaag cctcagcaaa 1260 gaaatagaaa gtctcagcaa agaaatagaa gatataaaga agaaccaaat ggaaatttta 1320 gaactgaaaa atacaataac cgaaataaaa anctcaatgr atgggctcaa tagcagaatg 1380 gaggggacag aggaaagaac cagtgaactt gaagatagag caacagaaat tacccattct 1440 gaacaataga gagaaaatag attggaaaaa aaaatggaca gagcctcagg gacctgtggg 1500 actataacaa aagatctaac attcgtgtca tcggagtccc agaggagagg aaaaagagrr 1560 tagtatttga agaaataatg gctgaaaatt tcccaaattt ggcaaaagac ataaacctac 1620 agatagattc aagaagctga gtgaacccca aacaggataa acccaaagaa atccacacca 1680 agacacatca tagtcaaact tctgaaaact aaagacaaag aaaaaaaaat catcttgaaa 1740 gcagcgagag agaaatgaca ccttacctat aggggaaaaa caattcaaat gacagtggat 1800 ttctcatcag aaaccatgga ggccagaagg aagtggcaca acaatttttt caagtgctga 1860 aagaaaagaa ctgtcaaccc agaattctat atccagyaaa aatatccttc aggaatgaag 1920 gggaaatcaa gacattctca gatgaagaaa aactaagaga atttgttacc agcagaccta 1980 ccctaaaaga atggctaaag gaagttctct aaacagaaag gaaatgataa aagaaggaat 2040 cttggaacat caggaaggaa gaaagaacat agtaagaagc aaaaatatgg gtaaatacaa 2100 tagactttcc ttctcctctt gagttttcta aattatgttt gatggttgaa gcaaaaatta 2160 taacactgtc tgatgtggtt ctngcnaaaa atgtatgtag aggaaatatt taagacaatt 2220 atattataaa ttgggggagg gtaaagggac ataaagggag gtaaggtttc tacacttcac 2280 ttgaactggt aaaatgataa caccagtaga ctgtgataag ttatgtatat ataatgtaat 2340 acctagagca accactaaaa aagctataca aagagatata ctcaaaaaca ctatagataa 2400 atcaaaatgg aattctaaaa aaaatgttca agtaacccac aggaaggcag gaaaaagaaa 2460 acagagaaat gaaaaacaga acaaacagaa aacaaaaaat aaaatggcag acttaagccc 2520 taacatatca ataattacat taaatgtaaa tggtctaaat acaccaatta aaagacagag 2580 agattggcag agtggattaa aaaacatgac ccaactatat gctgtctaca agaaactcac 2640 ttcaaatata ataatatagg caggttgaaa gtaaaaggat ggaaaaagat atatcatgca 2700 aacattaatc aaaagaaagc aggagtggct atattaatat cagataaagt agactcttca 2760 gagcaaagaa aattaccaga gacagagagg gacattacat aatgataaaa gggtcaatcc 2820 acaaagaaga catagcaaat cctaaatgtg tatgcaccaa acaacagagc tgcaaanata 2880 tgtgaagcaa aaactgatag aactgaaagg agaaatagac aaatccacaa ttatagttgg 2940 ggacttcaac anccctctct caacaattga tagaacaact agacagaaaa tcagcaagga 3000 tatagaagaa ctcaacaata ccatcaacca ataggatcta attaacattt atagaacatt 3060 ccacccaama acagcagaat acacattctt ttcaagtayc canggaacat ataccaagat 3120 agaccatatc ctgggycata aaacaaacct caacaaattt aaaagaattg aaatcataca 3180 gagtgtgttc tctraccaca atggaatcaa actagaaatc aataacagaa agataacagg 3240 aaaatctcca aacacttgga aactaaacaa catacttcta ataatccatg ggtcaaaaaa 3300 gaagtctcaa aggaaatnaa 3320 // ID MT2B repbase; DNA; ROD; 533 BP. XX AC . XX DT 25-APR-1997 (Rel. 3, Created) DT 25-APR-1997 (Rel. 3, Last updated, Version 1) XX DE Long terminal repeat of retrovirus-like element. XX KW Long terminal repeat of retrovirus-like element; MTE2; MT2B. XX OS Sciurognathi OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia. XX RN [1] RP 1-533 RA Smit F.A.; RT "Structure and evolution of mammalian interspersed repeats."; RL Ph.D. dissertation, Univ Southern California, 1995, pp 220-224.. XX DR [1] (Consensus) XX SQ Sequence 533 BP; 136 A; 114 C; 126 G; 141 T; 16 other; tgtagtggct attcctggtt gtcaacttga ctatatctgg aatgaactac aatccagaat 60 tggaaggctc acctgtgatc ctgatcttga ggctggaaga tacaagtttc tgacctggat 120 cttggcatgg agatcttgag gcatagtggc catgaaaanc ttaggcccag gcaaggtagt 180 acangccttt aattccagga gactgaggca aggagatctt tgagttcaag gtcanccaga 240 tcyaggcnng rngrtacaca tctttaatct gggccacacc ttctggctgg aggcctacat 300 aaggacattg gaagaaggaa ggntcnytyt cttcgcctgc ttgyacttac tggccagyac 360 atctgttgga gcctacttct tcaggattcc agcttatata gaagaccagc tgaaayagct 420 agcctcgcgg gactgagcaa ctactagatc cttggacttt ccattcacag ctgaccattg 480 ttgggttagt tggactgcag actgtaagtc attccaataa attcccttaa tat 533 // ID MLT1FR repbase; DNA; ROD; 1074 BP. XX AC . XX DT 28-MAR-2001 (Rel. 6.02, Created) DT 28-MAR-2001 (Rel. 6.02, Last updated, Version 1) XX DE MLT1- LTR retrotransposon internal sequence - a consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; MaLR family; KW LTR retrotransposon; MLT1E; MLT1R; MLT1F1; MLT1CR; MLT1F2; KW MLT1FR. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-1074 RA Jurka J.; RT "MLT1FR."; RL Direct Submission to Repbase Update (28-FEB-2001). XX DR [1] (Consensus) XX CC Internal sequence consensus for MLT1E/F retrovirus-like elements CC (MaLR). CC The closest homologue is MLT1CR (75% similar). Divergence from CC individual repeats ~24%. XX SQ Sequence 1074 BP; 317 A; 206 C; 268 G; 273 T; 10 other; caggacttga tgrttttgaa aattctcagc ctctccagat ggcaaaagat gctaaaatta 60 agaaatggct yctgmgcagg aaaacatggt ctaagataaa gctangggtg tgactgtaaa 120 tcttttgtta aaanctcaga aagatcaaag gatcagagta ctattcagtc acacaaaggg 180 ccctttaaag agattaaggg tgtgtgcctc acagatcctc tcaatcaaac aatagggctt 240 ctaggaagct taagggtatt gtccctcagc catctcagca gaagcccaag gtagagaagg 300 gcttannaaa ggcttagntc tcaaagagat ttgtgggtgt ggctttttgt ctaatggagt 360 gaaccccaat aagattcaca ggaracccac aaagttttta agagaattat attagcagaa 420 acactgccag cttggactga aagggacaga gacagtacaa aatgaaaaga ggcctttgga 480 cccccaaaat tctactggca ggaagcaggc tgagaaaact actcagctgc aaacatgtgc 540 tacctttcat gaaaaaggaa ggatgactca gagggtagaa ccaagagccc agagggtaga 600 gccaagagcc atggagaatt attcccaggc cttgagacct aatcaaggaa cttccaacat 660 ttgcctagct ggatttcaga attgctatgg accagtgact cctttgtgcc tcccattcat 720 tttccctctc ctttccccct ttttgaatag gaatgtctat agcagttatc ctatgcctgt 780 cccaccattg tatgttgggt gtgttggggg cagataactt gtctctttag tttcacaggt 840 ctacagattg agaggaactg tactcaagga gctgtactta aggaactaca cccaaggagc 900 ctcatccaca cctggacctg atttagatga tgagattctg gactttgagc tgatgctata 960 atgggatgag acttttgggg aatcttggga ggggkgaatg tatcttgcat ggagggggtg 1020 aatgtatttt gcatgtggga gggatgtgaa tcattggggg gccagagggt agac 1074 // ID MamGypLTR1b_LTR repbase; DNA; ROD; 782 BP. XX AC . XX DT 05-FEB-2007 (Rel. 12.01, Created) DT 10-APR-2007 (Rel. 12.01, Last updated, Version 1) XX DE Long terminal repeat of Gypsy LTR Retrotransposon from mammals. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW MamGypLTR1b_LTR. XX OS Mammalia OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi. XX RN [1] RP 1-782 RA Smit A.F.; RT "MamGypLTR1b_LTR - Gypsy LTR Retrotransposon from mammals."; RL Direct Submission to Repbase Update (05-FEB-2007). XX DR [1] (Consensus) XX CC 4 bp TSDs; 33% subst in dog-human. Associated with Gypsy CC internal sequence. Includes rnd-4_family-1902 & -3183 10% CC similar to MamGypLTR1a. XX SQ Sequence 782 BP; 169 A; 183 C; 254 G; 166 T; 10 other; tgtggcagga taatttattg agatattaat ttgtgttttg ctctctgtat ttttcccttc 60 cctcccaatt ccaagaaggt agccaggccc tttgtgntcc cttgcctcgg gggaggtttg 120 tgctgcaggg ccgaaagcag aagttgcctg aagacaaccc ccttcctggc ttttgttttc 180 aaaagcctaa gctcnttgag gagattatgc tggtgccctg agggagagag ggaggtgctt 240 gaggggggag ntgggagaag gnagaaaggg gaggagcttc cccaagactg ggaaggggac 300 aggagtctgg cggntcctgg agtagggatg aggcccaggg ccctgctccc tgncagtgcc 360 ccggggaggt ggcaggacct cagaggggaa tggctgcgtg gtgtgcctag ggaggctgga 420 ccctaggcac cggggctccc agcctcggca aagattcccg tgcccaagcn tggcacggaa 480 gcagcagagc cgccngcctt caagggacca tgcgggcttg gacaatgagc atntcagcgg 540 tgaccagtgt ggaccgaaga ccagagggcc ctccccgaat gttccgtnct gcgtaagacc 600 cccgggacct ttgcacgacc ctgggggagg gagggggagc cccaataatg actgagattg 660 aatttcccgc cagcctagtg ggatgggggc tcggagtcag atttaatttg atttaaagaa 720 ataaagaaat gtgacatttc ttgcacacct gagtttgtgg agtaagattc atacccgcta 780 ca 782 // ID SYNREP_MM repbase; DNA; ROD; 266 BP. XX AC X14228; XX DT 28-SEP-1995 (Rel. 1.2, Created) DT 17-APR-1997 (Rel. 3, Last updated, Version 2) XX DE Mouse synaptonemal repetitive DNA sequence. XX KW Repetitive DNA; MMSYNREP; SYNREP_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-266 RA Dadasheo Y.S., Bashkirov I., Belostosky D., Milshina N., RA Karpota J.O. and Bogdanob F.Y.; RT "Nucleotide sequence of a specific DNA from a mouse synaptonemal RT complex fraction."; RL Unpublished. XX RN [2] RP 1-266 RA Bashkirov I., Belostotsky A.D., Bogdanob F.Y., Dadasheo S., RA Karpota J.O. and Milshina N.; RT "Direct Submission."; RL Direct Submission to Repbase Update (01-FEB-1989)N. Milshina, RL Institute of General Genetics, USSR Academy of Science, 117809 RL GSP 1, Moscow B-333, Gubkin St.3 U.S.S.R.. XX DR GenBank; X14228; Positions 1 266. XX SQ Sequence 266 BP; 113 A; 30 C; 59 G; 64 T; 0 other; gaacagatta gatgagtaag ttacactgaa aaacacattc gttggaaacg ggatttgtag 60 aacagtgtat atcaatgagt tacaatgaga aacatggaaa atgataaaaa ccacactgta 120 gaacagatta gatgagtgag ttacactgaa aaacacattc gttggaaacg ggatttgtag 180 aacagtgtat atcaatgagt tacaatgaaa aacatggaaa atgataaaaa tcacactgta 240 gaacatatta gatgagtgag ttaggg 266 // ID X3_LINE repbase; DNA; ROD; 233 BP. XX AC . XX DT 19-JUL-2006 (Rel. 11.1, Created) DT 01-AUG-2006 (Rel. 11.1, Last updated, Version 1) XX DE A conserved fragment of RTE-like LINE element - consensus. XX KW Non-LTR Retrotransposon; Transposable Element; X3_LINE; KW conserved. XX OS Mammalia OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi. XX RN [1] RP 1-233 RA Jurka J.; RT "X3_LINE: A conserved fragment of putative RTE-like LINE."; RL Repbase Reports 6(10), 545-545 (2006). XX DR [1] (Consensus) XX CC It is present in ~200 copies phg. Present in mammals, not found CC in chicken. XX SQ Sequence 233 BP; 80 A; 38 C; 61 G; 51 T; 3 other; acatgtgtag aaatataatg rcagcaggat accaaagcag ctgttgtata gtgagctgaa 60 gtggggtaat cacaagcagg gagggcagaa gaaatacttt aaggattcac tgaagcacar 120 cttcaaacaa tgttrgcata gctgtggact gctgggaaaa acagcagcag agaccagcct 180 ggcatgcagc aataaggaat ttttgaactt tttgagcaaa ggctttaagc tga 233 // ID RMER12 repbase; DNA; ROD; 1315 BP. XX AC . XX DT 26-SEP-1997 (Rel. 2.08, Created) DT 26-SEP-1997 (Rel. 2.08, Last updated, Version 1) XX DE Interspersed repeat. XX KW ERV2; Endogenous Retrovirus; Transposable Element; KW Interspersed repeat; RMER12. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-1315 RA Smit A.F.; RT "RMER12."; RL Direct Submission to Repbase Update (1997). XX DR [1] (Consensus) XX SQ Sequence 1315 BP; 353 A; 267 C; 247 G; 420 T; 28 other; tgccaagggc ctgccccggc tggaatgggc tgaagcggga cacagmattg gatgacgact 60 taaagactga tggtccctgg ccttntagct ctctccaact gtatttggkg ctagawgctg 120 ctggcttctg gaaaatctta acwctctctt gcttattctg wggctaaaaa ctgmtggctt 180 tctttattct ataactctnt gctttataag cattataacc ctttttactt tctttaactc 240 tctgtgcttt aataagcact gggaacttaa ctctttctta ctgttctgtg cttaaataag 300 cgctaaaaac gtaactatgc ttaaataagc gctggcctct tccgctgtac cttcttatta 360 tttgtggtta acttggtgct aatntctggt gattaacaca tgctgtntag cagcagggga 420 ttaagtgaaa ggatttattg ctagtgctct tttctctggt gagtcagcca gaagcctctc 480 tggctaagct ggttaactcc ttgtgaaaca gagaggaagg aaagaagaag acttaagata 540 cnnnnnnnnn nnnngaagat acaataacag ctacaggcta cactttattt cttttctctg 600 gtcttttata gcttcttaga caaaaagttc tttagaaaat tacctcatag ataaaatgtt 660 acacagaagg ggagctacag aaggatgttg atagtaagtt caaagcagtg tttcattcta 720 taagctcatt ataagttcaa agcaacattt tcacatggcc ttgcttmatc atagtgcaca 780 cctgtggctt yatccttgga cctgacctag aatgaatgtt tttccttgan cataaatttg 840 tcccaagcct atttctcttt ttagtgcaat tatgaaagct cattgtattc cttwttatgc 900 agcttttact tactattcta tgaggagtgg gcacattcta ttcttgattc taactttatt 960 acatctttct ggcaaaacaa ctaaaaccat ctccttaaac tttcttccta aaattatttg 1020 aatcaaatca ggaattctat aaaatcatta attcatctaa acagcattaa ttcatgaaag 1080 ttcatcttca tgttgatctg caggaaatct gcccaatagg gtgggctgat gcccaggaat 1140 cattaggcta tgtaatagtg gcaggaaagg catatcagca gcaagaagca atcctgggat 1200 gaagtctctt ggggccttgc ctctcagagc tcacccatcg cagccatgca aaatggtggg 1260 ccatctccag gaaactcact tgcttgccwt agcctttcct agatggcttc tgtca 1315 // ID MSTD repbase; DNA; ROD; 406 BP. XX AC . XX DT 07-FEB-2000 (Rel. 7.1, Created) DT 07-FEB-2000 (Rel. 7.1, Last updated, Version 1) XX DE Long terminal repeat - a consensus. XX KW MaLR family; MSTC; MST subfamiy; MSTD. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-406 RA Jurka J.; RT "MSTD."; RL Direct Submission to Repbase Update (FEB-2000). XX DR [1] (Consensus) XX SQ Sequence 406 BP; 99 A; 93 C; 90 G; 121 T; 3 other; tgctatggtt tgaatgtgtc ccccaaagtt catgtgttgg aaacttaatc cccaatgcaa 60 cagtgttgag aggtgggrcc taataagagg tgattaggtc atgagggctc tgccctcatg 120 aatggattaa tgccattatc acaggaatgg gttagttatt gtaggagtgg gttccttata 180 aaaggatgag tttggccccc tcttgctctc tctctcacac tctcttgccc tttttctgcc 240 tgccttctgc catgggatga tgcagcaaga aggccctnac cagatgcgcc agccccttga 300 tcttggactt cccagcctcc agaactgtna gcaaataaat ttctgttctt tataaattac 360 ccagtctgtg gtattctgtt atagcagcac aaaatagact aagaca 406 // ID LX8 repbase; DNA; ROD; 1413 BP. XX AC . XX DT 26-SEP-1997 (Rel. 2.08, Created) DT 26-SEP-1997 (Rel. 2.08, Last updated, Version 1) XX DE 3'-end of L1 repeat (subfamily LX8) - a consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; KW Repetitive sequence; L1 (LINE) family; LX8 subfamily; LX8. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-1413 RA Smit A.F.; RT "LX8."; RL Direct Submission to Repbase Update (30-NOV-1996). XX DR [1] (Consensus) XX CC Partial 3' untranslated region of rodent LINE1 subfamily LX8. CC The nomenclature for the rodent LINE1 subfamilies (as based on CC the CC 3' ends) is temporary, awaiting analysis of their relationship. XX SQ Sequence 1413 BP; 462 A; 300 C; 300 G; 281 T; 70 other; aacacaagag aaaaagaaga aagaagataa ataccactaa ggatnttcga aangctctaa 60 ggaatcatat tattttatkt ttacctgaaa tgtgtgtgtg tgttaattta aatgaagtta 120 tcccacttgg gstgataatg ctccctccaa gagccaaaga ccatctaaca aaancccmaa 180 caccaggcat gagaarccct ctttcgagtt gttggtcagg gkwktccaag agactcccaa 240 aacaatatgg gctattgcta ttgccmttgg ttgtcccagn agaggtagaa ggtaagtccc 300 twttgctgaa gacaccatgc acttcggaca caggacccag aggmccctga gctggaactg 360 acctgaawat cttctccctg aggactagct ttcatggtac cagaaggtgc tatgcaagct 420 tccaaaggag agaggcaack aacagtccya cccagccgtg atgcctatga accacatcaa 480 cgaccagcat ggcacaataa ccccaagggt gcagtagtgg cackcacacc ttggcggtaa 540 ccaacagctc tctaattgga cttaagrcct gctcaacaag agggaaatca tacttggtac 600 tgaaaacccg gctaaatacc catggctggt gargtcatgg gccttagagg agaacctaca 660 accgccactt tactaaacca gtacaatncc taactgcatt ctaaatattt atccttatac 720 ccacagatag gtgtagtcct cacccctcat caaggaaact tctctttgca acagacagag 780 accattacag aaaaccacaa ccaatcaaaa tgcagagttg tggagcccag tcccagcgga 840 tacatctaca aaacaactcc tgcacctaag gctcagggaa catcgcggaa gagggagcag 900 aaagattgta agagccagag gatcagggag tttggtgtga gactgtgtct cctagtaatg 960 tcagaagcta cacccatgag gtctcaccaa catgactgcc taaacatgag ctgagcaagg 1020 acgacagcaa tggacacgct aaagtggaca ggggaaagct caccaagcct caaccctaca 1080 caaagaacta caggcaacta aggaatgctg agagtgggag aaatagtctt cccccgggaa 1140 gagcacatca attggttatc caataccaaa tggtcagccc tgaaaacata yacayacaag 1200 taacattata cagactgagc aggttgtatt tatgtattta ngaatagatr yryryryryr 1260 yryryryryr yryryryryr yryryryryr caattaawga aaaaagaggc catgaatttg 1320 aaaagaagca wggaggnnta tatgggaggg tttggaggga ggaaagggaa gggagaaatg 1380 ntgtaattat attataatct caaaaaataa aaa 1413 // ID ORSL repbase; DNA; ROD; 630 BP. XX AC M26221; XX DT 30-APR-1998 (Rel. 6.5, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 2) XX DE Origin of replication-like (ORS8) region (a consensus). XX KW Origin of replication; ORSL. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 287-482 RA Rao S.B., Zannis-Hadjopoulos M., Price B.G., Reitman M. RA and Martin G.R.; RT "ORSL."; RL Unpublished (1989). XX RN [2] RP 1-630 RA Rao S.B., Zannis-Hadjopoulos M., Price B.G., Reitman M. RA and Martin G.R.; RT "Sequence similarities among monkey DNA-replication ori-enriched RT (ors) fragments."; RL Gene 87, 233-242 (1990). XX RN [3] RP 1-630 RA Jurka J.; RT "ORSL."; RL Direct Submission to Repbase Update (13-APR-1998). XX DR [3] (Consensus) XX CC This sequence is moderately repeated in human DNA. CC It shares ~200bp stretch of similarity with African CC Green Monkey origin of replication region (Acc. No. M26221). XX SQ Sequence 630 BP; 183 A; 118 C; 135 G; 175 T; 19 other; ycccatacyg tttccrtcac trgtytgwgc accctctgca rggmagacag catgaytttt 60 yatctttgaa tycyyaaaas ttagctcact gtgtgctcaa acgtgtattg aatgacagtt 120 gctatatttg aggacbacat agattttggg gaagacggac aggcacacta gcagaaccat 180 acgaaggcca ggatcagtca tgaccagggc tgcattatga cttgtgggcc ctgagcactt 240 ttgcttttat gggccccttc ctccataaaa atattaaaaa ttatatttta tgactgcatt 300 ggtataaaga tgagaatata atccaggctg aattaaaaca ttttcttaga ctctaaaatt 360 tcattttttt ctgattttaa aagaaattaa aacattttya tggggcccta aagtattgtg 420 ggccctaggc actgtgccta ctgtgcctaa tggataagtc nagcctgacc agggcactct 480 ggtgtgaggt tgaagaaagg aaatttggaa caaagaagcc aagtgctctg gagaagcagg 540 tgaaacttcc actgccgaac aaaatcagaa tgggagcagc catggttaat aaggttgtgg 600 aagtttagas cttccagttc acttcmcctt 630 // ID CAVID repbase; DNA; ROD; 104 BP. XX AC . XX DT 21-SEP-2009 (Rel. 14.1, Created) DT 21-SEP-2009 (Rel. 14.1, Last updated, Version 3) XX DE tRNA-derived SINE family: consensus. XX KW SINE2/tRNA; SINE; Non-LTR Retrotransposon; Transposable Element; KW CAVID. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-104 RA Jurka J.; RT "SINE elements from guinea pig."; RL Repbase Reports 9(10), 2808-2808 (2009). XX DR [1] (Consensus) XX CC >99% identical to consensus. Active. XX SQ Sequence 104 BP; 37 A; 21 C; 29 G; 17 T; 0 other; ggggctgggg atttagctca gcagcataag cgcctgcctt gcaagcaggc agtcgtgagt 60 tcgatccctg gtaccgataa aaaggaaaaa gacaaaaaaa aaaa 104 // ID L1-4_Cpo repbase; DNA; ROD; 6572 BP. XX AC . XX DT 16-OCT-2009 (Rel. 14.11, Created) DT 16-OCT-2009 (Rel. 14.11, Last updated, Version 1) XX DE L1 non-LTR retrotransposon: consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-4_Cpo. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-6572 RA Bao W. and Jurka J.; RT "L1-type non-LTR retrotransposons from guinea pig."; RL Repbase Reports 9(11), 2847-2847 (2009). XX DR [1] (Consensus) XX CC ~88% identical to consensus. XX FH Key Location/Qualifiers FT CDS join(672..917,921..1751) FT /product="L1-4_Cpo_1p" FT /translation="MTKRKSKMQSQTSSPSQTSLGDMDIEENVSHTPETSE FT MIATEVKKQLHEAVKELRSEIISQVKEEMINIIKEMLNTSKEDIVKIEEHC FT KKMETLNEQYKKETECMKQIQRDIQEIKNSIESWQNHLNESEDRLSDLEDK FT IAASEQERKDLLKITRNQEITIQQLQDDAKKNNIRLIGVNEKEGDNTNDIK FT RLFTEVIAENFPNMGKETDIQISEAYRTPISHNQNKSTPRHIIINIPEIQH FT KNRILKAVREKRQITYKGKPIRITADFSTQTMKSRRAWSEVFQILKENDFQ FT PRLMYPAKLSFKIDGEIRYFHDKEQLKNFMTTKPTLQRILKDILDRHKNNY FT DFKNTNRKKPLSKANN" FT CDS join(1795..2070,1958..2938,2942..4963) FT /product="L1-4_Cpo_2p" FT /translation="MTKISQYISVITINVNGLNSPIKRNRLAEWIKKQHPT FT ICCIQETHLTQKEIHRLKVKGWKTILHATGTPQKSRGSYSVCRQCELQTKN FT DQKRKSKDGKQYFMQQEPHKKAGVAILFADNVNFKPRMIKRDKEGHYILVR FT GKLQEEEITILNIYAPNMGAPSYIKQILIEMKNQISNNTIIMGDLNTPLTQ FT RDRSTRQKISKEITELNQTCEQMDLVDVYRVFHPTTSEYTFFSAAHGTFSK FT IDHILAHRTCLSKCKRIEIIPCILSDHSAMKLEINDKRNCKNSINTWKLNN FT TLLNNQWVTEEIKEEIKQYLQANENTNTTYRNLWDAMKAVLRGKFIAVNSH FT IRKTERTQINNLMLHLKHLEKEEQVKPKAKRREEIIKIRAEINAIETKKTI FT QRINESKSWFFERINKIDKPLANLIKRKEKTQIHAIRNEKGEITTDPVEIQ FT KIINTYFENLYSHRFDNTEEMDRFLETYELPKLDQEDVKLLNNPISINEIE FT NVIRTLPTKKSPGPDGFTAEFYKKFKEDLIPTLLKLFNEIEREAILPNSFL FT EASITLIPKPEKDPTKKENYRPISLMNIDAKILNKILANRMQQIIKKIIHH FT DQVGFIPGMQGWFNIRKSINVIHHINRAKDKNHMIISIDAEKAFDKVQHSF FT MIRTLQKIGIDGLYLNLIKAIYDKPTASIILNGQKLKAFTLKSGTRQGCPL FT SPLLFNIVLEILARAIRQEREIKGIRIGKEEVKLSLFADDMILYIEDPQNS FT IERLLGVITKFSNVAGYKVNTQKSIAFLYTNNKLTERDKRSHTLHNSIQKM FT KYLGINLTKEVKDLFSENYNTLKKEIEEDLRKWKDIPCSWIGRTNIVKMAI FT LPKLLYRFNAIPIKIPLAYLTDLEKSFLKFIWNQKRPRIAKAILGNKGKAG FT GITIPDLKLYYKATVIKSAWYWNKNRPEDQWNRLEDTTTTTNTLSHLIFDK FT GAKHVHWKKDSLFNKWCWKNWLSICQKLKLDPCLSPCTKLKSKWIKDLNIK FT TETLNLLEDKVGRNLEDIGVGREFMNKNQTAWEILPRINNWDLILLKSFCM FT SKEISNRVKRKPTNWEKILVNCPSDKGLLSRTYKEF" XX SQ Sequence 6572 BP; 2579 A; 1284 C; 1195 G; 1514 T; 0 other; cactgtcccc acccagtgac cccgagtggc cggggaaccg cctgcgctca gtggcgtgag 60 ccgaaggcct gcacagcgaa ccccatccag ccgtgcaacc tcggcacctc gctcggaacc 120 tggtagggcc ccacccagca tccccaatca gccagggacc tttgggtagg tccctcgggg 180 cctggttggg ccccgcccag tgaccccgag tggccgggga accacctgcg ctcagtggca 240 tgagccgaag gcctgcacag tgaaccccac ccagccgtgc aacctcggca cctcgcttgg 300 aacctggtag ggccccaccc agtgacccca agcagctgtg gagctttggt gcgccatgtg 360 gcatgagctg aaggcctgca cagcatactc catctagctg ttcagccttg gtacgccact 420 cagaacctgg tagagtcccg cccagcaacc tcaagcagcc gggagcttca gagtaccacg 480 tggggtgtgc tgaaggcccg cccagcgtac tccccacatt tgaggagcct tcgagtgcca 540 ctagggacct ggtaaggcct gactaatgac ctccactcaa ctggggaaac tccctgcagt 600 acggacagac atcctcaacc caggaagcaa ctgagacctt gcattcagct tctccagggg 660 aatacaggaa aatgacaaaa cgaaagagca aaatgcaatc tcaaacctcc agcccatccc 720 aaacaagcct aggagacatg gacatagaag aaaatgtaag ccatacacct gaaacttctg 780 aaatgatagc aactgaagta aagaaacaac tccatgaagc tgtaaaggaa ctcagatcag 840 aaataatctc ccaggtaaaa gaagaaatga taaacataat aaaagaaatg ctgaacacat 900 ctaaagaaga catagtttaa aaaattgagg agcactgcaa aaagatggaa accttgaatg 960 aacaatataa gaaagaaaca gaatgcatga aacagattca aagagatatc caggaaatca 1020 aaaattccat cgaaagctgg caaaaccacc tcaatgaaag tgaagataga ctttcagacc 1080 tcgaagacaa gattgcagct agtgaacagg aaaggaaaga tcttttaaaa ataacaagga 1140 atcaggaaat aacaattcaa cagttgcaag atgatgcaaa gaaaaacaat ataagattga 1200 taggtgtaaa tgaaaaagaa ggtgacaaca caaatgatat caaaagacta tttacagaag 1260 taatagccga aaatttccca aacatgggaa aggaaactga catacagata agtgaagcat 1320 atagaactcc aattagccat aaccaaaata aatctacacc cagacatata ataatcaaca 1380 tcccagaaat tcaacataag aacagaatat taaaagctgt tagagagaag agacagatca 1440 cctataaagg aaagcccatc agaatcacag cagacttctc aacacaaaca atgaagtcaa 1500 gaagagcatg gagtgaagta tttcaaatcc taaaggaaaa tgacttccaa cccagattga 1560 tgtaccctgc aaaactgtcc ttcaaaattg atggagaaat aagatacttc catgacaaag 1620 aacagctgaa gaacttcatg accaccaaac caaccctgca aagaatactg aaagatattc 1680 tagatagaca taaaaataac tatgacttca agaacaccaa cagaaaaaaa ccactaagca 1740 aggcaaacaa ttaaacagag gggaaaggta aagaaccaac agcagaaaga taacatgaca 1800 aaaataagcc aatatatatc agttataacc attaatgtaa atggcctcaa ttcaccaatt 1860 aaaagaaaca gactggcaga atggatcaag aaacaacatc caaccatatg ctgtatacaa 1920 gaaacccatc taacccaaaa ggaaattcat agactgaaag tcaaaggatg gaaaacaata 1980 cttcatgcaa caggaacccc acaaaaaagc aggggtagct attctgtttg cagacaatgt 2040 gaacttcaaa ccaagaatga tcaaaagaga taaagaaggt cactacatac tagttagggg 2100 aaaacttcaa gaggaagaga taacaatctt aaatatatat gcaccaaata tgggagcacc 2160 cagctatata aaacaaatat taatagaaat gaaaaatcaa ataagcaata acacaattat 2220 aatgggagac cttaacaccc cattgacaca aagagacaga tcaactagac agaaaatcag 2280 caaagaaata acagaactga atcaaacttg tgaacaaatg gacttagtag acgtgtacag 2340 agtgttccac ccaacaacat cagaatatac attcttctca gctgcacatg ggacattctc 2400 taaaatagac catatactag cccatagaac atgcttaagt aaatgcaaaa gaattgaaat 2460 tatcccatgc atattatctg atcatagtgc catgaaacta gaaattaatg acaaaagaaa 2520 ctgcaaaaac tccataaaca catggaaatt aaataataca ctcctgaaca atcaatgggt 2580 cacagaggaa attaaagaag aaattaaaca atatctacaa gcaaatgaaa atacaaatac 2640 aacttaccgg aacctgtggg atgcaatgaa agcagtcctc agagggaaat ttattgcggt 2700 gaattcccac atcaggaaaa cagaacgaac acaaataaac aacctgatgc tacacctcaa 2760 acatctagaa aaagaagagc aagtcaagcc caaagccaaa agaagggagg aaattataaa 2820 gatcagagca gaaattaatg caatagagac taagaaaaca atacaaagaa ttaatgaatc 2880 aaagagttgg ttctttgaaa gaataaataa aattgataag cccctagcca accttattta 2940 aaaaaggaaa gaaaaaactc aaattcatgc aataaggaat gaaaagggcg aaatcactac 3000 agaccctgtg gaaatacaga agatcatcaa tacctacttc gaaaaccttt actctcatag 3060 gtttgataac acagaagaaa tggacagatt cctagaaaca tatgaattac caaagctgga 3120 tcaagaagat gtaaaactgc tgaataaccc aatttcaatt aatgaaattg aaaatgtaat 3180 tagaacctta ccaacaaaga aaagcccagg tcctgatgga ttcactgctg aattctacaa 3240 gaaattcaag gaagacttaa ttccaacact cctcaaactc ttcaatgaaa ttgaaaggga 3300 agcaattctc cctaactcat tcctggaagc aagtattacc ctaataccaa aaccagagaa 3360 agacccaacc aaaaaagaga actacaggcc aatctcccta atgaacatag atgcaaaaat 3420 cctcaataaa atactggcaa acagaatgca gcaaatcatc aagaagatta tacaccacga 3480 ccaagtggga ttcatcccag gaatgcaggg atggttcaac atacgcaaat caataaatgt 3540 aatacaccat atcaatagag ccaaagataa gaatcacatg atcatttcaa tagatgcaga 3600 aaaagctttt gataaggtcc aacactcatt catgataaga accctacaga aaattggaat 3660 agatggtctt tacctcaatc tgataaaagc catctatgac aaaccaacag ccagcatcat 3720 attaaatggt caaaaactga aagcttttac tctaaaatca ggaacaagac aaggatgtcc 3780 attatcacca ctcctattca atatagtgct ggaaatatta gccagagcaa ttagacaaga 3840 gagagaaata aaaggaataa ggataggaaa ggaagaagtt aaattatcat tatttgcaga 3900 tgacatgata ctctacatag aagaccccca aaactccatt gaaagacttc taggtgtaat 3960 aaccaaattc agtaatgtag ctggatacaa agtcaacact caaaaatcaa tagcattcct 4020 gtacacaaac aacaaactca ctgagagaga taagagaagc cacacccttc acaatagcat 4080 ccaaaaaatg aaatacttag gaatcaatct aacaaaggaa gtaaaagatc tattcagtga 4140 aaattacaat actctgaaaa aagaaatcga agaggatctc agaaaatgga aagacattcc 4200 ttgttcatgg ataggaagaa caaacattgt gaaaatggcc attctcccaa aattgttata 4260 cagatttaat gcaattccaa tcaaaatacc attagcatac ctcacagatc tagagaaatc 4320 attcctaaaa ttcatctgga accagaagag acctagaata gcaaaggcaa ttctgggcaa 4380 caaaggcaag gcaggaggca tcacaatccc tgacttgaag ttatactaca aagctactgt 4440 gataaaatca gcatggtact ggaataaaaa cagacctgaa gatcaatgga atagattaga 4500 agatacaacc acaactacaa acacactcag ccaccttatc tttgacaaag gggccaaaca 4560 tgttcactgg aagaaagata gcctctttaa taagtggtgc tggaaaaact ggctttccat 4620 atgccaaaaa ctaaaactag acccatgcct atcaccatgc actaaactaa agtcaaaatg 4680 gatcaaagat ctaaatatta aaacagaaac actaaatctg ctggaagaca aagtaggtag 4740 aaatcttgaa gacatagggg taggcagaga attcatgaac aaaaatcaaa ctgcatggga 4800 aatacttccc agaatcaata actgggatct catcttatta aaaagttttt gcatgtcaaa 4860 agaaatctcc aacagagtga aaagaaaacc cacaaattgg gaaaaaatct tagtcaactg 4920 tccctcagac aaaggacttc tatctagaac atataaagaa ttttaaaaaa tcagaccccc 4980 aaaattcaaa gacccaatcc aaaaatgggc atctgagatg aatacacact tctcagctga 5040 agaaatacaa atggcaaata aatacatgaa aaaatgttca gcatcattgg tcattagaga 5100 aatgcaaatt aaaaccacac tgagattcca tctcactcca gaaagaatgg ccaggatcaa 5160 gaaaaccacc aataacaaat gctggagagg ctgtggggaa aaaggaactc ttctgcactg 5220 ttggtgggag tgtagactgg tgcaaccact gtggaagtca gtctggaggt tcctcaaaaa 5280 gctgggattg gaagtcccat ttgacccagc tattccactt ctgggcatat tcccagaaga 5340 actgaaaaca tcataccaca gtgatatatg tgcacccatg tttatagcag cacaatttgt 5400 aatagccaaa tcttggaaac aacctaaatg cccatcaact gaggaatgga taaaaaagat 5460 gtggtatttt tatacaatgg agtactactc agctataaag aaggatcaca ttgaggcttt 5520 tataggtaaa tggatgcgac ttgagaccat actcataagt gaaataaatc aaactcacat 5580 gtgtaaatac catattgtct ccctagtgta agaaactgaa agagtaaata agatacaaaa 5640 tcttcaataa atattgtgaa agggtaagaa actgctcaaa ggggaaaaat aactcaggga 5700 aaaggagaaa agaaggaagg gaaaagaaaa aagatataaa tttgtatata cacacttaag 5760 atagctaaga tgattgtgct actaattcta ttagttcttg gaacaaaaca gttaacattc 5820 ataggattag atcaattttg catgtttata tctggtaatc tgagtaactc ttttgaagat 5880 aactatgcaa aataacaacc ttagttatgt tcctaccacc ttgttttgaa agcctatatt 5940 gtcattgttt tgggctcttt gttattgtta tttatttgtc tcttttgttt cttgtttttc 6000 ttgtattttg ttttgttttt ctaatacaga aaggatagat gtacatgtat aggcttatac 6060 atgcctagaa cttctcctaa agagtttaca agaattgatt ttattacgtg tgatggtatg 6120 aacagctact ttgaagaaaa tgagacatta tggtagccat tgctgatttc ctatcatctt 6180 gttgtgaagt ccagtaatgc ttttattctt tgttttgatt ggttttagtc tgttttgtat 6240 tatttaattt caatatgtgt gaaggtatgg acagctattt tgaagagaac aagagacatt 6300 atggtagcca ttgttgattt tcctattgtc tggttttgaa ggcctgtaat gctttcattc 6360 ttttgttttg atcggtttta tactattttg tcttgtttga attcattgtg tgtgaagata 6420 tggacaacta tttcgaagaa aacaagagac attatggtag ccactgttga tttcctgttg 6480 tcttgttttg aaatcctgtg ttgcttgcat tgttttgttc tgatctgctt tgttctgttt 6540 tgttttattc tctttaaaat aaaaaaataa aa 6572 // ID RLTR35_MM repbase; DNA; ROD; 467 BP. XX AC . XX DT 14-OCT-2002 (Rel. 7.09, Created) DT 30-JAN-2004 (Rel. 9, Last updated, Version 2) XX DE Mouse putative long terminal repeat - a consensus. XX KW LTR Retrotransposon; Transposable Element; Long terminal repeat; KW retrotransposon; RLTR19_MM; RLTR35_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-467 RA Jurka J. and Drazkiewicz A.; RT "RLTR19_MM: putative LTR from Mus musculus."; RL Repbase Reports 2(9), 4-4 (2002). XX DR [1] (Consensus) XX CC name changed from RLTR19_MM to RLTR35_MM. XX SQ Sequence 467 BP; 139 A; 67 C; 163 G; 98 T; 0 other; tgccccagag ggacaaaggg cagggaataa gagacaaaga caggagatag aggatgaggg 60 agaaggggaa gggaacaagg gagaggggga agggatattt gtcctggagg acaaaggact 120 gcctctggat agagaggaga cagacgtggc acataggaaa atggtagttt ataaaggtaa 180 aaggggaaac cctgtgttag gatgaggtgt ttaattttaa ttgggcatgt taattaggtg 240 agccaaaggg ggcttttgat tgctggactt caatactttg atagctggac cttggtagtc 300 agcctcagga ggaggaagtg gccaaataag ggaatagacc ttggtggcta gctttaggaa 360 tgtaatctaa tggtttttag caaggcagag ggaatggggg agaagggcaa ggcctgcggc 420 ctgccagagc catgtttgcc atgctcaggc tggctagagt cccttca 467 // ID RLTR19A_MM repbase; DNA; ROD; 500 BP. XX AC . XX DT 30-JAN-2004 (Rel. 9, Created) DT 30-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE Mouse subfamily of LTR retrotransposons - a consensus. XX KW LTR Retrotransposon; Transposable Element; LTR; ERVK; RLTR19A_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RA Karolchik D., Baertsch R., Diekhans M., Furey S.T., Hinrichs A., RA Lu T.Y., Roskin M.K., Schwartz M., Sugnet W.C. et al.; RT "The UCSC Genome Browser Database."; RL Nucleic Acids Res 31(1), 51-54 (2003). XX RN [2] RP 1-500 RA Pavlicek A. and Jurka J.; RT "RLTR19A_MM - a subfamily of LTR retrotransposons."; RL Direct Submission to Repbase Update (31-DEC-2003). XX DR [2] (Consensus) XX CC The consensus sequence was reconstructed from the UC Santa Cruz CC genome annotation. RLTR19 subfamily, weakly similar to RLTR33_MM CC (84% identity). Individual copies are ~90% identical to the CC consensus. CC 6 bp TSDs. XX SQ Sequence 500 BP; 128 A; 112 C; 98 G; 162 T; 0 other; tgttatggtc tgaccccccc cccccccagc atagctgtag ccattttgtt ccatgcctgc 60 cagctatttc atattgttgc tgtaacatgc ctgccagtca ttgacacaga gaagtgactt 120 gactcagagc aaggtcatgc tgaccacata ccctgttatg ttctgaatgt tctgtatgag 180 gtttgttaat cttaagaaat tccacaaagc tttacgtagg acccatcaaa tcaaaggtca 240 atatgaactg ttatgtctaa aatatcttga gtcagagctg accaccaggc agcacttcct 300 gcacctgcgt atgaactcat tgtggttttt gtttttttcc tttataagct gacagaaaaa 360 gatatctgtt gtcatagttc agataattct gagctatgtc cctggtctga ccagtattag 420 ggtgtgcatt caataaacta ttcttgttta actgagatca gtgttcatat ggtttgtgtg 480 gcgattcctg aaccccaaca 500 // ID MER44B repbase; DNA; ROD; 719 BP. XX AC . XX DT 18-AUG-1998 (Rel. 6.6, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 2) XX DE Nonautonomous DNA transposon. XX KW Repetitive sequence; MER44B; MER44. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-719 RA Naik A. and Jurka J.; RT "MER44B."; RL Direct Submission to Repbase Update (AUG-1998). XX DR [1] (Consensus) XX CC Approx. 90% similar to MER44A and 80% relative to MER44C. CC Contains long insertions relative to both. XX SQ Sequence 719 BP; 227 A; 115 C; 150 G; 213 T; 14 other; cagtagtccc tccttatcca tgggggatat gttccaagac ccccagtgga tgcctgaaac 60 trtggatagt acygaacyct atatatacta tgttttttcc yatacatact tatgataaag 120 tttaatttat aaattaggca cagtaagaga ttaacaatan taactaataa taaaatagaa 180 caattataac aatatactgt aataaaagtt atgtgaatgt ggtctctctc tcaaaatatc 240 ttattgtact gtactcaccc ttnttcttct tgtgatgatg tgagatgata aaatgcctat 300 gtgatgagat gaagtgaggt gaatgatgta ggcattgtga tgtagcgtta ggctactatt 360 gaccttctga tgtctgatga tatgtcagaa ggaggatcat ctgcttcagg tgatcctgga 420 tcattgagcc atgacaatgt cgatggttgg atgtcaggag cagacaatgt cgatgactaa 480 tgggtggrta gcatatacag cntggatacg ctggacaaag ggatgattca cgtcccaggt 540 rggatggagc aggacagtgt gagatttcas ccatcatgct actcagaaca atacacaatt 600 taaaacttat gaattgttta tttctggaat tttccattta atattttcag accacagttg 660 acyatgggta actgaaacca cagaaagtna aaccayrgat aagggagaac tactgtaca 719 // ID MER60 repbase; DNA; ROD; 837 BP. XX AC . XX DT 07-OCT-1998 (Rel. 3.1, Created) DT 07-OCT-1998 (Rel. 3.1, Last updated, Version 1) XX DE 5' part of L1M subfamily. XX KW Repetitive sequence; LINE; L1M subfamily; MER60. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-837 RA Kapitonov V.V. and Jurka J.; RT "MER60."; RL Direct Submission to Repbase Update (1997). XX DR [1] (Consensus) XX CC MER60 is the 5' part of one of the L1M subfamily. XX SQ Sequence 837 BP; 294 A; 153 C; 177 G; 188 T; 25 other; tgcagrttgc tagtgccaga gggagcaaag aattgtgatt gtggatttta attgctggca 60 gctccctgaa ggactcacaa gtcttttttt ctcggctgaa gtggttcaaa aacatttaaa 120 ggcacatttg ctggagcaag ggacaacatg rgrcaagcaa yagatagacc aaaaagccta 180 agaaggaaga gmtgagaaat aagatgcttg gggaaawaag ggctttgaaa agtccacata 240 ttcctgggaa tctagaaggc catgcacatg tccagggctg gacacatgct cagaaaagac 300 ctaagaaggc cctaagcttt cacctctggc tgaccttnag actctgtaca agcaggaagt 360 gaaggctaag gcagagttgt aaactgcctt ggctgagtgt tgaaggaatg ccycaacaca 420 cagmcaatct gcaaagacta ggagactttt tttttgtgtt tggtddtktt gttgttrttc 480 caggtattta aggaaayctc tgtcaawyac tagctgacca ctaagctaay cgarcagaga 540 cttcagtggc cacacatgac aaagaataca gactttacaa aattagttta gaaaagtcac 600 taaacaaaca ayacaacagc amayaayaag caacaacaac aaaccctgga aaggggagag 660 aatctgattt ccagagttgc cacattataw tatttaaaat gtccagtttt ttttttcaac 720 arcaacaaaa attatgaggc atacaaagaa acaaagaaaa gtatggccca tacacaggaa 780 aaagaaatta atagaaacta tccctgagga agcccagaca ttgracttac tagacaa 837 // ID LTR3_Cpo repbase; DNA; ROD; 405 BP. XX AC . XX DT 20-JUN-2009 (Rel. 14.07, Created) DT 01-DEC-2009 (Rel. 14.07, Last updated, Version 3) XX DE Long terminal repeat: consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; LTR3_CPo. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-405 RA Jurka J.; RT "Non-autonomous endogenous retrovirus from guinea pig."; RL Repbase Reports 9(7), 1544-1544 (2009). XX DR [1] (Consensus) XX CC >92% identical to consensus. 5 bp TSD. XX SQ Sequence 405 BP; 118 A; 81 C; 87 G; 119 T; 0 other; tgttgtagct tggcataact gacgccatct tgagccctgc atttaactag tgccctgtat 60 ttgacccagg aaaatggaaa attactcagg gtataagtcc catgagaaaa gcagaatgag 120 ctgtacccac aggagaaaaa ttaacctgct caacaaaaga agcaggatac aggtgtcaac 180 cgaaaattaa tgtaagattc tagctaaatt gtttccaaca ggatgtcttt ttgtgattct 240 ctcttttaaa aactctgtaa ctttccagtt cggggccact tgtttggact ctgaaacggg 300 gggagggggt ctgtatacga gtggtcctga gctcagttaa ttaaattcca aatttatcaa 360 tttggctgct tggattccta tattgttgtt cacgaaccca cctca 405 // ID LTR6B_Cpo repbase; DNA; ROD; 356 BP. XX AC . XX DT 19-SEP-2009 (Rel. 14.1, Created) DT 19-SEP-2009 (Rel. 14.1, Last updated, Version 3) XX DE Long terminal repeat of ERV3 endogenous retrovirus: consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; LTR6B_Cpo. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-356 RA Jurka J.; RT "Endogenous retroviruses from guinea pig."; RL Repbase Reports 9(10), 2151-2151 (2009). XX DR [1] (Consensus) XX CC >86% identical to consensus. XX SQ Sequence 356 BP; 77 A; 88 C; 97 G; 93 T; 1 other; tgtcatggtt tatgtcagtg gcccccaagc ctcatgcaat tatggactcg accaaatcta 60 gcgtttgtga ttggttcgtt gtctagcgct cggattggtg gtgtgagtgc tncacccacc 120 caggggtgga gccaggatac gatgtaacgg cgggaagaag gtgtgtctct ctctcttgcc 180 ggttcgcgct tgctgtttgc agcggccatg aatggctgcc ccgccatgcc acgctgcctt 240 ggagccagct gagtatggac tgaaacctcc aagaactgta agaaataaac ctttccttcc 300 ttcattttgg gcgtcgggta ttttgtccca gcaacgagaa aaaagtaact aagaca 356 // ID URR1 repbase; DNA; ROD; 236 BP. XX AC . XX DT 25-APR-1997 (Rel. 2.03, Created) DT 28-AUG-2008 (Rel. 2.03, Last updated, Version 2) XX DE Rodent-specific putative non-autonomous DNA transposon - a DE consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; URR1; PR. XX OS Rodentia OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires. XX RN [1] RA Ogata T.R., Rosa A.P. and Zepf E.N.; RT "Sequence of the gene for murine complement component C4."; RL J. Biol. Chem 264, 16565-16572 (1989). XX RN [2] RA Gale M.J., Tobey A.R. and D'Anna A.J.; RT "Localization and DNA sequence of a replication origin in the RT rhodopsin gene locus of Chinese hamster cells."; RL J. Mol. Biol 224(2), 343-358 (1992). XX RN [3] RP 1-236 RA Smit A.F.; RT "URR1."; RL Direct Submission to Repbase Update (30-NOV-1996). XX DR [3] (Consensus) XX CC URR1 has 19-21 bp (imperfect) terminal inverted repeats (TIRs). CC The terminal TA dinucleotides are probably flanking repeats. CC The element is related to PMER1 in prosimians. XX SQ Sequence 236 BP; 70 A; 51 C; 57 G; 58 T; 0 other; tacagcagcg gttctcaacc tgtgggtcgc gaccccttcg ggggtcgaac gaccctttca 60 caggggtcgc gtatcagata tcctgcatat cagatattta cattacgatt cataacagta 120 gcaaaattac agttatgaag tagcaacgaa aataatttta tggttggggg tcaccacaac 180 atgaggaact gtattaaagg gtcgcagcat taggaaggtt gagaaccact gcccta 236 // ID RLTR46A repbase; DNA; ROD; 321 BP. XX AC . XX DT 30-JAN-2004 (Rel. 9, Created) DT 26-AUG-2008 (Rel. 9, Last updated, Version 3) XX DE Mouse long terminal repeat - a consensus. XX KW LTR Retrotransposon; Transposable Element; LTR; ERVK; RLTR41_MM; KW RLTR46A. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RA Karolchik D., Baertsch R., Diekhans M., Furey S.T., Hinrichs A., RA Lu T.Y., Roskin M.K., Schwartz M., Sugnet W.C. et al.; RT "The UCSC Genome Browser Database."; RL Nucleic Acids Res 31(1), 51-54 (2003). XX RN [2] RP 1-321 RA Pavlicek A. and Jurka J.; RT "RLTR46 - a family of LTR retrotransposons."; RL Direct Submission to Repbase Update (31-DEC-2003). XX DR [2] (Consensus) XX CC The consensus sequence was reconstructed from the UC Santa Cruz CC genome annotation. Similar to IAPLTR3. Individual sequences are CC ~90% identical to the consensus. XX SQ Sequence 321 BP; 76 A; 80 C; 92 G; 71 T; 2 other; tgtggagagc cgcgataaca tttgccatca caagatggcg ccggcttccg cagtgcctta 60 tgccacctaa acaaagaaca agctgtggtg cgcatgtgct aagagtaatg ttcgcgccaa 120 gtcataagcc caccccgggg cgtgtcaatg agatcgtggg taagcgacca gtcaggcgtg 180 gacacgccac gctagggtgt atataagcag cgcctttctg aggctctttg tcttcctcat 240 caatatgcaa taaacgattk gctgcagaag gatcctggtg ttccgtgygc gttcttgctg 300 gcgaggaaat agcgcgggac a 321 // ID RLTR19-int repbase; DNA; ROD; 6137 BP. XX AC . XX DT 29-AUG-2008 (Rel. 13.08, Created) DT 29-AUG-2008 (Rel. 13.08, Last updated, Version -1) XX DE ERV2 Endogenous Retrovirus from Muridae. XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR; ERVK; KW RLTR19-int. XX OS Muridae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea. XX RN [1] RP 1-6137 RA Smit A.F.; RT "RLTR19-int - ERV2 Endogenous Retrovirus from Muridae."; RL Direct Submission to Repbase Update (05-AUG-2008). XX DR [1] (Consensus) XX CC (5 bp dups though) some copies as little as 7% div; pos 1-3624 CC similar to ERVK elements (closest to co-hybrids RMER17C-int and CC RMER16-int, and to ETnERV3 and MYSERV); 3' end has patch-like CC similarities to ERV1 group internal sequences (!) The internal CC sequence at pos 1433<-1711 contains an ancient fragment of an CC hnRNP core protein A1 pseudogene. XX SQ Sequence 6137 BP; 1826 A; 1247 C; 1231 G; 1782 T; 51 other; tttggtgcgt tggccgggaa gcaagccccc taccctcgag cacccctcag ttactgccag 60 tgaatgccac caagccggct gggccctccg tgaggaggta agtttcctgt acctgtgcag 120 ttcttttttc ttgtttttgt tgtttctggt ttggttggac tttcggagac ccaagagagg 180 agagctggac gcagaaatnt ccttgggtcn gaggaggtca ggagacgtcc tgcttcctct 240 ctggttccca ctgggagaac gcctggttct ggttctgggt ccctttggtg ggacatctgg 300 gtccctttgg tgggacatct ggntctcttc cctttcctcc ccaactctat tggttgtcct 360 tggcgcgctg gtctgtccat gtctgtcagt ttatgtctgt agtttcgttt nattgcttga 420 ttgttgctct gtgtttcata ttcagaggaa aaatggttaa aacttcatct gctggccgtt 480 taccccttga cttagtttta actcatttca aggattttaa gaaaaaaagc agcctgaatt 540 gggacaacaa aaactgataa actgcccttg gagccctgca cctgcgctag gagtctagca 600 ctgctggaga agccctgcca gggctgcttg gctgtacagc cgctcttaaa ggggncagca 660 gcttctcggc ttctctggta agaaagctga tggctagttc tacaaaatct ctgtgcccat 720 gagaattgga ttcaacaggt ggcaaggtgc tcctcccttg aaattacagc tagaaaaaaa 780 gaaaatttct gtctgttcta aatgtataaa tgtatgtggc cactacatnt ttgtttctga 840 ctggtcttaa atgtataagt ttactatgtt ctgcatgtct tggttataga ttattggctt 900 ataagttatt gggtatggtt aaaaatctgt aacatcngta acaaaaagtt agcttaaaac 960 nagtaactca gggttgaagc cattctgggc aacaacacac ggcatgagcc aacccaggga 1020 aacaggtctc taaagatact tttaaattgg gataatattt ttatgtaatt cctgtcctaa 1080 aaaccagact tataaaagat aggatttaaa aaatgtttct ttaatgaggt attaaagctg 1140 caccttcatg cattcatata caagaacttt tcttgctggc agccaaactt tgtaacatca 1200 aaataatgca cttggtattg attttagaga cactggattg tttaaactgt taaaaattaa 1260 aaattggttt tgtcttctaa aaattatggt tatgctctat gacttcactc tttaaaaaga 1320 tactttattt tgatattgca aaagcaactt taaaaattat agttaaatat ataaagctat 1380 gggacatttt aaaatttatt aaatgtttta ttggaatgtt ttattaatat atgatggtta 1440 caataaagga ggaaatttta atgaaataac tataatgatg atngaaacta taatgacttt 1500 agaaattata atggacaaca gcaatcaaat tatggaccca tgaagggggg acaattntgg 1560 tggaagaagc tcaggcaatc cctatggtgg tagctatgga tctgatgatg gaaatgatgg 1620 atatgatagc agaagtttta aaataaaaca gaaacgggta cagttcttag aggagagaga 1680 atgaggagtt gtcaggaaag ctgcaggtta ctttgagaca gtcgtcccaa atgcattaga 1740 ggaacantaa aaatctgcca cagaaggaat gatgatccat agtcagaaaa ttactgcagc 1800 ttaaacagga aaccttcttg ttcagactgt catgccacag tttacaaaaa atacagctat 1860 tgattaatgc aatatgatgt cagttagata tacattcctg aggntttttt atctgttgta 1920 gctttgtctt tttcttttca ttacgtcagg tatattgctc tgtaaattat ggtaatgata 1980 ccaggaataa aaattaagga atttgttaat ttaaaanttt ttagaggttt acaatattaa 2040 aaaggttaag aatcactggc tcttgaattt gcctgagctc tggcaaggct ccagcatgcc 2100 cgagtcagta aggctctttc agccgaggtc ttgcagtttt tcccaactgt taaccttttc 2160 tgtcctgaca ctggtttcag cttaaactga atcatatgag aaactgttat ctctctcaga 2220 aggtcagaaa agttcctagt ctctttatgg agtatgttta tgggtttttt ttaatactag 2280 aagagcttca attcaaaact gtaattttaa ggttcaagcc taacagggat tgatagtcag 2340 taaccttgaa ggtgatcaaa tcctttaata tgttcagaaa tatatttaaa gtcatgctaa 2400 gtactgatgc agttaattnc aagattaaag ctttatttag tctcctgttt tatgtttgna 2460 aggtacagct tagagcagat aactaagaac aaacaaagnt tgtttaactc agatatgcta 2520 ggtaggtact agccctcaaa ccagtcagag atctgctgaa tatggcattt aatatgttta 2580 aacttaccat aacagacaga gactcccaaa tcctaacagt gacccccaag gtctccaaga 2640 agatatgggc acaacgacaa aggacaccac cnggattgtg gtatgataac cactgggcat 2700 aactgcccca atgccttgcc tgctgccagg gcccagcctg aactgtggac aaacagagga 2760 caactggaga attgattgcc acacgttgcc taagacaagg tgaggtcagt ctctcccatg 2820 ttcctcctcc acaggaaaaa gcctcttcat cttctgggcc tgatggccaa agactgcctc 2880 tgcccttggt gccatagaga cacaggagcc tgggataact gtctaggtaa taaaatatgg 2940 actagtctgt cattttaatt gatacatagg ccatttagat tacaatttat ccttctcaga 3000 tctctgatca cattgatggc taagctaatt gtagcttgac agctagaaaa caataggcaa 3060 ctagctcctt acctcaaggt aatccaccac tgtgntagtt cattagtcaa ttcattagtt 3120 agttaaaact actaggtctc ttattaaaaa gggcaaacag gtttgccttg ctcacaggct 3180 tcatattaaa tgactgctct caataatcaa ctgcctatgt ctcatagtaa ataattggat 3240 taaaaatata aaatttaann ttatttaagg tctagaaaaa tgtttatggg tctagaaaaa 3300 tgtttgagat tgaaaatgca gtgataaagg ttagaggata aaaaacttat gatggctaga 3360 aaatgnttta aataagaatc ttcaataaaa atgttaaggt tggtaaatga actaagattt 3420 aagggtctaa gaagatgttt taggtatata aatagcaagt tatagaggta taaaaagtaa 3480 tttaagaaat ggaaaatgtt tcatattccc ccatgctatt gttatttcaa agttcagaat 3540 tttaacattg atcaatggag ttctgataag ctaatggagc actggcagct tacaattcag 3600 tcacaagctc aagattttaa attccctttg gttgtcttct aaatatagac ttaaaggtgc 3660 ttctcatcat gctaaaaaca tctctgttca atctgtatat ctagccctct ggactgagaa 3720 aagaatgttc tgtgcctttt gacccaaacc tatagctttt actaggtctc aaaggcatga 3780 gtctactcct gttgtcttac agacacctgt taactacttt ttctaaaata tggatttcaa 3840 tacaatggta aatactaatg tatatccttt tgtaaactaa agttcacata agcttcaggg 3900 aatcaaaaag tcataagatt tggatccacc tgccacagat caaatggact ccagataagt 3960 acacctgccc tgttcttgaa aatgggattt ttcccttggc cttgggattc cttaccttcc 4020 ccaattacta gacagtttta acttctgtcc ctagtctata tttgtctcag cagattttca 4080 cctggctgac agactccatc cagggatcac ccaatgnanc tgctgaactc tggacttgct 4140 gaaagctgat gttaaccagt ccagctgatg tgattgctcc ctgtctttca tctggatcag 4200 ctaatcagat cagatgcttc tgataaatgc cccattgccc agcctttgac tagcatttca 4260 gccttcctgg gcccctntga caacgtccta atgtcagcng gaagcagtta cagaagagaa 4320 atacgncgtc cattgtccca ccttacaggc tgaaatgcta agtcaaaagg aanccccctg 4380 ggnccacgct gaaaaggacc ccacttcntg atcctgacca ccccaatggc catgaaaata 4440 gatgggatnc aatcctggat tcatcactct catctgaagt tcaccccctg gaaatatcag 4500 gaccgggacc agcgatgggt catcagacaa cacccacagg acccattaaa aatcagactg 4560 gtaaaagact ctggagacta gttttctntt ttgtgttgcc ttgggctgct gaggcacatg 4620 ctccagtaca acacatatga actttgatta gaactacaga caggagccta attactaata 4680 tcactgtcta tggttccccc accctcacct ttganttatt tgggccaaaa tggaatangt 4740 gcccaaagac tgcaaacata tatgttccct cattctctag gtcttttaga tataggatcc 4800 agaatatgga aaaagtaatt tggccaccct ttagatatat gcctgtccct caaatggaga 4860 ccctgattgt aggggacaag atatgtactt ttntgctata tggggctgtg aaacattagc 4920 tccctgggta actgataagg ataattatat tcagttacag agggttgagg gccactctga 4980 aaaatctggg aaaagaaatc aaccccattc aaattaagat taaaacgata taaataatgt 5040 agganatagt acagggaaac agaggatgtg ttttngataa ggccacctgt tgagaacaat 5100 tatagcccct atgggaatgg tcattacaac cacctggaag aggaaaaaga aatgtcccgt 5160 ccccacctag acaaaggaca aatgacncaa taaacacctt tggtctgaga gtgccagaaa 5220 ggggccctcn ctggcacaga aaaggacagc cagccctcat ctgcccagac atggacttgg 5280 ttacctccac tggccaccat agcccgctct gggtaccgtc acctctgcct ctgacattcc 5340 tgccccggta caagctttgc tngagaccaa cactgccagt cagacnccag ttcaggcctg 5400 ctgcaggtcg ctcctgacgg gcntatcaat gctacactca gcttgtttcc aacgtacggg 5460 atatgttgaa tcaggaagga cttagggact ctgcctttat gcatctggta angaccctgg 5520 aatgatgttc accatccaga gattccaggt caccaatann aggatcttag acatcgttag 5580 tcccaacaaa gtattaaacc ctctccctga gccacctgan tctgggaggg attaaaggac 5640 aaggcctctg cttatacaca ggttnatata agatatcaga gtctcggtac cttcaacact 5700 gtaatgtaac aatccaccta cagatgttaa ctgcaggagt ttccccttac natttagtac 5760 ctccaaaggg cacttggttt gcttgtgcct cgggacaact ccctgtntca gtccctttat 5820 cctcactaaa acttctgact cttgcttact tgtacatcta ttgcctcaga tatactatta 5880 ttctagagaa gangattgga acatctggga cttcacacta atcccagatg gccagggcag 5940 ctccaatact cgtgcccctc ctantaggca tggacatagc agggtctgcc agcatgggag 6000 cagcaacact tattaaggga gatcaaattg tcaaaaaatt ttaagccaac aaaggttgat 6060 ctaagtttgt ccccctggcc tcagaaaaan taattgagtc aatgatttga ctcactggaa 6120 caactcaaag gggggaa 6137 // ID MLT1E repbase; DNA; ROD; 568 BP. XX AC . XX DT 01-OCT-1995 (Rel. 3.6, Created) DT 23-JAN-1998 (Rel. 6.4, Last updated, Version 3) XX DE Mammalian transposon-like element long terminal repeat (MLT1e DE subfamily) - a consensus. XX KW Non-LTR retrotransposon; MaLR family; MLT1e subfamily; MLT1E. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-568 RA Smit F.A.; RT "Identification of a new, abundant superfamily of mammalian RT LTR-transposons."; RL Nucleic Acids Res 21(8), 1863-1872 (1993). XX DR [1] (Consensus) XX SQ Sequence 568 BP; 146 A; 134 C; 137 G; 129 T; 22 other; tgtggtaggc agaattctaa ratgmtccca atgatcctcg cctcctggcg taatctcctt 60 gagtgtgagt aggacctgtg acttgcttct agccaacgga atatggcaaa ggtgatgara 120 trtcacgtga ttacgcgtac gtgattatgt aasattcagt ctttgmcgtc attcttgccg 180 agagactctc ctsctggtyt tgaagaagta agctgccacg tcatgagnnn ncnnannaga 240 rygccgcaag gcaagggnnt ctagagctga gagtcgccct tactgatggg cagcaagaag 300 caagccacct cagtcctaca gccgcaaaga actgaattct gccaacaacc tagtgagctt 360 ggaagcagat cctgcccagt cgagcctcca gatgagancg cagccctggc tgacgccttg 420 actgcagcct tgntagacct tgagcagagg acccagctaa gccgtgccca gactcctgac 480 ccacagaaac tgtgagataa taaatgtgtg ttgttttaag ccgctaagtt tgtggtaatt 540 tgttacgcag caatagaaaa ctaacaca 568 // ID Charlie11 repbase; DNA; ROD; 2196 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE hAT DNA transposon from placental mammals. XX KW hAT; DNA transposon; Transposable Element; Charlie11. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-2196 RA Smit A.F.; RT "Charlie11 - hAT DNA transposon from placental mammals."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC Incomplete termini (just too diverged to handle); gave rise to CC Buster 3; copies 25% diverged from consensus: average Kimura CC substitution level is >=30% (excluding Buster3 which CC shows 16% substitution level in the coding region);. XX SQ Sequence 2196 BP; 682 A; 393 C; 415 G; 643 T; 63 other; tgtggagccc agctgagcag cagtggcaag ctncatgttg aattntgttt tcatggctta 60 attcgtaccg ccacgttgct gggtgtntct taacgcnnat gnacgagaca ncatgcgaca 120 tggccttgtt tnaattcant tagcnactgt tgttaatgcg tcggttatan gttactncca 180 taacaaaatt ccgtgntatc tctggaatac tatctgagtt tttgtgtgcc atgttgaaga 240 ancnnaaatg ngatgacaac tatgttcgtt nccgattcnc ttgtacaacg gaggtggatg 300 gaactcaacg accacagtgt atgctgtgca actcgttatt ttcaagcgcc aanctcangc 360 catcgangct nccngaatat ttcaacagac agcatggcgg cacagccgga catgacctcg 420 acannctgaa gtncatgcga gcacaatttg atcanagcgg aaccttgaag acatntggat 480 ttgtgtcact tgaaaagcct ttgttacaag cattctatca agttgcgnat tcatgtgcca 540 aggaaaagaa gcctcataca gtagctgaaa aattagtgaa actttatgca ctagaaatgg 600 caaaaatagt attgggacca gangcacaaa agaagcttcg gcaggtnccc ttgtcaaatg 660 acgtgatccg ttctagaatt catgagatga gccaggatac cttgcagcaa gttatagaag 720 atatcaaagc tagtcctctt aaagtgggta ttcagcttga tgagncaact gacattgatg 780 gctgcagtca gctatcggtg tttgtgcgnt acataaaaga aagagagatc gtagangaat 840 gcttgttctg tgaaccattg cagttaacta cgaaaggaat cgntgtgttc aatctcatca 900 gagacttctt tttgaagcgt aagataacac ttgatacatg tggatcaatt tgcaccgatg 960 gtgcccctgc tatgctagga aaaaattcag gatttgttgc ctacgtaaaa aaagaaatac 1020 ctcatatcat gattacacat tgtatgttgc accgtcatgc acttgccaca aagactttgc 1080 ctacaaaatt gaaggatgtt ctgtttnctg cagtgancgc agtaaacttc accaaagaga 1140 gcgctctaaa tcatcgcctc ttccatgctt tttttaaaga aattggtacc gagcacactg 1200 tcctcctttt ccatacagaa atgaggtggc tttnntgagg ccagatactt actcgtattt 1260 ttnaaatgtn taaagaaata aatcagtttc ttcacaacta aagcagtaat ttagttgatg 1320 actttgaaaa tagagagttt atcntttgcc tagcatacgt agcagataca ttcaaacact 1380 taaatgaacn caatgtatct atgcagagaa ctaggatgaa catagcgata gccagagaga 1440 agttatctgc ttttattagg aaacttccan tttggntaaa gnatactgag aaaagaaatt 1500 ttactaactt tccttttctt gaagaaatag ttgtttcaga aaatgaagga ataactatcg 1560 caantgaagt gacaacgcat ttgcaacant tgagtgactc tttccatgga tatttttcca 1620 ctggagatct taatgaggca aagaaatgga tatcggatcc attncttttt aatctggatt 1680 ctatcaatga tagtnatttg ataaaatgtg atttcactga attacaagct aacggtcaaa 1740 tccnaatgga atttgagaca ataaagcttg agaatttctg gtgtgcccaa ttgacagcat 1800 tttcacaact ggcaaagaca gcactggaga tccttgtgnc atttgctact acataccttt 1860 gtganacagg atttttatca cttttgcata tcaaaacaaa gaccagaaac cgcttaaatg 1920 tgagtgatga catgcatgtg gctatttcaa naaaagttcc tcatttctcg aanatcattg 1980 aacaaaagct acagcagaaa tcactgtaag ctaatatact ttctttatgt aacaatttca 2040 taactttata ncattntaaa tattaagntg taactctatt tctttcattc tatanttatg 2100 atatagctta ananttgtct aaaaatataa cttnaagaac taacctacta cagcaatttt 2160 gtttatgaga tgtgagatta cgttttgaga ataaaa 2196 // ID RNSAT1c repbase; DNA; ROD; 168 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Satellite from rat. XX KW Satellite; Simple Repeat; RNSAT1c. XX OS Rattus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae. XX RN [1] RP 1-168 RA Smit A.F.; RT "RNSAT1c_ - Satellite from rat."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX SQ Sequence 168 BP; 42 A; 40 C; 22 G; 64 T; 0 other; ctttcatgcc ttttaaggtc attctgacct acaaaggctt taccacattc attacattcg 60 taaggtttct ctccagtatg aattctttca tgccttttaa ggtcattctg acctacaaag 120 gctttaccac attcattaca ttcgtaaggt ttctctccag tatgaatt 168 // ID MT2A repbase; DNA; ROD; 448 BP. XX AC . XX DT 25-APR-1997 (Rel. 2.03, Created) DT 25-APR-1997 (Rel. 2.03, Last updated, Version 1) XX DE Long terminal repeat of retrovirus-like element; MT2A. XX KW ERV3; Endogenous Retrovirus; Transposable Element; KW Long terminal repeat of retrovirus-like element; MT2A. XX OS Sciurognathi OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia. XX RN [1] RP 1-448 RA Smit A.F.; RT "Structure and evolution of mammalian interspersed repeats."; RL Ph.D. dissertation, Univ Southern California, 1995, pp 220-224.. XX DR [1] (Consensus) XX CC The MT2 elements share similarity with both MaLRs and the CC MLT2/HERVL endogenous retroviruses. XX SQ Sequence 448 BP; 107 A; 109 C; 92 G; 122 T; 18 other; tgtgatgrct aatcttggtt gtcancttga ctacatctgg aatcaactaa aacccaagca 60 gctggacgcn cctgtgagrg gttttcttga ttagattatt tgaggtggga aaatncaccc 120 taaatctggg ccacactttc tggytgcagc ccgyatrraa gracatggaa gaaggaagtc 180 tagcttttgc ttttgcctgc ttgccctcay kcttgctggc aagttcatct gtcctgctgc 240 tgaggtatcc ttcgctggta ttaragccta cttcttcagg attccagcat atactgaaga 300 scagcwgatc tccaggactc ctccataact cnagyaccag gytgggacta ccgagacagc 360 agcctgtgag ccgttctaat caatccccta caaatagaac atttttcttc catcagttct 420 gttccgttgg agaacccaga ctaataca 448 // ID MamGypLTR2_LTR repbase; DNA; ROD; 1189 BP. XX AC . XX DT 05-FEB-2007 (Rel. 12.01, Created) DT 10-APR-2007 (Rel. 12.01, Last updated, Version 1) XX DE Long terminal repeat of Gypsy LTR Retrotransposon from mammals. XX KW Gypsy; LTR Retrotransposon; Transposable Element; MamGypLTR2_LTR. XX OS Mammalia OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi. XX RN [1] RP 1-1189 RA Smit A.F.; RT "MamGypLTR2_LTR - Gypsy LTR Retrotransposon from mammals."; RL Direct Submission to Repbase Update (05-FEB-2007). XX DR [1] (Consensus) XX CC 4 bp TSD; 35% subst in dog-human (!); pos 125-1189 (end) 70-80% CC similar to GypLTR1c. No idea if this is the 5' end; 3' end much CC better represented. Intermediates between 2a and 2b/c noted. CC rnd-3_family-480. XX SQ Sequence 1189 BP; 288 A; 310 C; 376 G; 201 T; 14 other; tgtaatangg ataatatttt tgaaaatatt tcccaattat gttcccttat tggtaaaccg 60 gtttctctcc ccctgcacca tccaccatta tggccccagg gagggggcaa ggctgatgcc 120 accctaggaa gaaggaggat atgacgtagg ggaagtccnt gccaganagg aagaggaagt 180 ggaaagcccg ccccttctag ccccggngga ttgtgggaag aagagagagg cggaagtaga 240 gcaaggaggt cagacgccag ggtcctcgct tcctcccctc cctgggcccg aacccaggat 300 gggggggggn gctttagaaa catccagata ggtatggggg agcccgagaa catcggggct 360 agtggcncgc ttccccgggc atagcngggg gaggctgcag gcctctagga gaagccccgc 420 atttggctcg gcgccacgtc caacatggcg cgggagcgat ggtgcagcgn tggcggaggg 480 aggtggctag atagatgagc ctgaggcagc gctcctggct ccccatggcc tgcgtgtggc 540 atgcagggga tccagaagtt cccgcgtgcc ccggtgaggg gacgcggagg tgctgagagg 600 gccggtggac cagcagaggc ctggggtcag gacaaagagg ccgcngtgng cggggacttc 660 gagaccagag gcaaatggcc gggaccacgg actccagcgg tgggtgccag cacatcaccc 720 caaaaggcca gatgggacca gtcgcacctc agcggtcacc agtccaggga gcagaccaga 780 ccagccactc cgcagcagag accagcgagg atccagagga cnccgcatgg atccgaggnc 840 cccctctccc ctnccgccac gaggtcacat gagcccacac tcccccatac acccagatgc 900 catcttggag aggagcaggg ggaggaggag gaaatctgaa agactgagca tttacctgaa 960 agagactgag tcatccaaaa gagactaatt tacctaaaag agactgtttg aattactgga 1020 ctggactaag tttaccagac tggactaaaa tttagtcgtt ctcncgcccc tcgctaccca 1080 gcggggtggg ggctcgtgag gaagatcaga tcagttatag agaaataaag aagctacatt 1140 ttctttgcac atctgagtgt agtgtgagta aatttgcgac cccgctaca 1189 // ID MLT1E1 repbase; DNA; ROD; 641 BP. XX AC . XX DT 03-SEP-1998 (Rel. 6.6, Created) DT 03-SEP-1998 (Rel. 6.6, Last updated, Version 1) XX DE LTR from retrotransposable MaLR element - a consensus. XX KW MaLR family; MLT1E; MLT1; MLT1E1. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-641 RA Jurka J.; RT "MLT1E1."; RL Direct Submission to Repbase Update (SEP-1998). XX DR [1] (Consensus) XX SQ Sequence 641 BP; 176 A; 131 C; 162 G; 158 T; 14 other; tggtaggcag aattctaaga tggcccccaa gattcycacc ccctggtatc atatrccctg 60 tataatcccc tccncttgag tgtgggcagg atctgtgaat acgatgggat atcactcctg 120 tgattaggtt acattatatg gcaaaggtga agggattttg cagatgtaat taaggttcct 180 aatcagttga ctttgagtta atcaaaaggg agattatcct gggtgggcct gacctaatca 240 ggtgagccct ttaaargagg ytctagaagt cagacatgga agaagtcaga gagattcaaa 300 gcagcagaga tgctctcctg ctggccttga agaagcaagc tgccatgttt tgtggagagg 360 gccatatgnc agggantggn gagcagcctc taggagcnga agtcctcagt cctacaacca 420 caaggaaatg aattctgcca acaaccngar tgagcttgga agaggatcyt gagcctccag 480 atgagaaygc agccccagct aacaccttga tttcagcctt gtgagaccct gagcagagga 540 cccagctaag ctgtncccag attcctgacc catagaaact gtgagataat aaatttgtgt 600 tgttttaagc tgctaagttt gtggtaattt gttatgcagc a 641 // ID L1MD_5 repbase; DNA; ROD; 1604 BP. XX AC . XX DT 09-OCT-1997 (Rel. 6.3, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 2) XX DE Partial L1MD LINE1 repetitive element 5' end - a consensus. XX KW L1 repeat; L1MD_5; MER79; L1M6_5. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 500-1 RA Smit F.A.; RT "L1MD_5."; RL Direct Submission to Repbase Update (1996). XX RN [2] RP 1-1604 RA Jurka J., Walichiewicz J. and Kapitonov V.V.; RT "L1MD_5."; RL Direct Submission to Repbase Update (28-JUL-1997). XX RN [3] RP 1-1604 RA Smit F.A.; RT "L1MD_5."; RL Direct Submission to Repbase Update (19-AUG-1997). XX DR [3] (Consensus) XX CC 5' end of L1M subfamilies. CC Originally expanded to 1619 bp and classified as a 5'-portion of CC L1 CC (Jurka et al. [2]). A shorter version submitted by A.F.A. Smit on CC Aug. 6 - deposited in the Appendix. Minor refinements of the 1619 CC bp consensus [2] worked out by A.F.A. Smit (August 19, 1997 [3]). CC Replaces MER79 [1] and L1M6_5 [2]. Average divergence from CC consensus is CC 24%. Appears to be 5' end of L1MD1 and L1MD2 subfamily LINEs. XX SQ Sequence 1604 BP; 524 A; 412 C; 387 G; 270 T; 11 other; ttaaaaacaa agcgggagac ttccgcttcc gggaagatgg agtagacgta cttttcccta 60 ttcctcccgc taagtacaac taaaaaccct ggacattata tataaaacaa acataagaag 120 actctgaaag gtggagagaa gaaggcagac cggctaggga cctcgggacc cgaggaacga 180 cacggtagtg agttccctgg gttttctttt tgcctcatat atcccagact tggagctgaa 240 gaagccggca acccggaaac gccaacgggc acagacaaaa aaagccccaa caaaagcctg 300 ctctctctag ccaaaggacc aggaaagggg cagcctagca agacagaaaa cttttagaca 360 ataaccgctc tactccagcc aaacaccaca gaaaaaactg tggccccacc cccacccacg 420 ccagcaaagg ccgagtgggg agcctagact tccaccctca ccaggctgta acgaggcgcc 480 ccaacacctc caccgggatg gtgtcagaga aggccaagta gggagctggg actttcatcc 540 ccgccaggcg gtaatgaggc ccmccttccc cttgccmctg cggtgtcagt ggagaccacg 600 tggggagcct ggacttccac ccccacccgg cagtaatgag gcgcccctcc ccctccctac 660 tggggtggtg tcagaggagg cctagtggag agtcgggact ttcaccaccg cccagcggta 720 atgaagccac ctcctcctct tgccmccatg gtgtcagtgg aggccacgtg gggagcagta 780 atgaggcact cctacccctc ccagccaggg aggtatcagc ggaggcctag tggggagccg 840 aactcccacc cccgcccagc agtaacgagg agcccctccc tcacctcggg tgtcaacgga 900 ggccgagtgg ggaacctgga cttctacccc cacctggcag taatgaggca gcgcccctnc 960 ccctcycctg ccggagcggt gtcagaggaa gccggctaaa acagaaggtt taaataagat 1020 ccagagtctc ataacataat acccaaaatg tccaggtttc aatcgaaaat cactcgtcat 1080 accaagaacc aggaaratct caaactgaat gagaaaagac aatcaataga cgccaacacc 1140 gagatgacag agatgttaga attatctgac aaagatttta aagcagccat cataaaaaat 1200 gcttcaataa gcaattacga acgtgcttga aacaagaraa aagtagaaag cctcagcaaa 1260 gaaatagaaa gtctcagcaa agaaatagaa gatataaaga agaaccaaat ggaaatttta 1320 gaactgaaaa atacaataac cgaaataaaa anctcaatgr atgggctcaa tagcagaatg 1380 gaggggacag aggaaagaac cagtgaactt gaagatagag caacagaaat tacccattct 1440 gaacaataga gagaaaatag attggaaaaa aaaatggaca gagcctcagg gacctgtggg 1500 actataacaa aagatctaac attcgtgtca tcggagtccc agaggagagg aaaaagagrr 1560 tagtatttga agaaataatg gctgaaaatt tcccaaattt ggca 1604 // ID RLTR22_MM repbase; DNA; ROD; 573 BP. XX AC . XX DT 14-OCT-2002 (Rel. 7.09, Created) DT 14-OCT-2002 (Rel. 7.09, Last updated, Version 1) XX DE Mouse putative long terminal repeat - a consensus. XX KW LTR Retrotransposon; Transposable Element; RMER3; KW Long terminal repeat; retrotransposon; RLTR22_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-573 RA Jurka J. and Drazkiewicz A.; RT "RLTR22_MM: putative LTR from Mus musculus."; RL Repbase Reports 2(9), 7-7 (2002). XX DR [1] (Consensus) XX CC 69% similar to RMER3 (bases 154-444). CC 82% similar to RLTR22A_MM (bases 1-557). XX SQ Sequence 573 BP; 168 A; 79 C; 175 G; 151 T; 0 other; tgtagcagga ttttccctgt ccaattaatt taaaatatta gtgcagcagg aggcctgtga 60 ttggacagtg gaaaagggag gcggagctaa gagttgcaga gacagagaga gagacagaga 120 gaggagagga gagaaggaag gaagatggag gaagaggaag atgatccaga tcctgcatgg 180 ctttaaatag ccacaggtag ttatgaatat catataaagg atagaataat tgtaggataa 240 tttgtctaat ctaggtgggc agcttgtatc attatcaatt ggctctgaat ttattgtgtg 300 ggcattttgt gaattgagaa tttattgata tataaatctg actgattaat tataagcttc 360 tagagttttg attttaccag gttactggga tttgtgacag ctaaccacag ggggtggatg 420 gctgggaagt atgagcagga tctgcggcaa gggaactgcg agatgggcgg tcgctgcttg 480 gggctagcca cagaggtgga gagaccaccg gggccagaga gtagctgggt gtagcgcggg 540 aacttgcctt tttttttaat atttcccgca aca 573 // ID MER70I repbase; DNA; ROD; 5023 BP. XX AC . XX DT 29-AUG-2000 (Rel. 7.4, Created) DT 29-AUG-2000 (Rel. 7.4, Last updated, Version 1) XX DE MER70I is an internal portion of the MER70 endogenous retrovirus DE - a consensus. XX KW MER70A; LTR retrotransposon; endogenous retrovirus; env; RT; KW MER70B; ERVL group; int; MER70I. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-5023 RA Kapitonov V.V. and Jurka J.; RT "MER70I."; RL Direct Submission to Repbase Update (AUG-2000). XX DR [1] (Consensus) XX CC MER70I is an internal portion of an endogenous retrovirus flanked CC by the long terminal repeats MER70A and MER70B. CC There are about 100 copies of MERV70I survived in the human CC genome. They are ~80% identical to the consensus sequence and CC belong to two major subfamilies. CC MER70I encodes the reverse transcriptase, integrase and env CC proteins. It is related to the ERVL, HERVL, MERVL-like CC retroviruses: CC MER70I 1019 1729 ERVL 2304 3014 d 0.61 CC MER70I 1730 2285 HERVL_40 2374 2929 d 0.62 CC MER70I 2334 2725 HERVL_40 2982 3369 d 0.63 CC MER70I 2733 3028 HERVL_40 3770 4068 d 0.61 CC MER70I 4149 4264 MER51I 6384 6499 d 0.65 CC MER70I 4474 4638 MER57I 6517 6678 d 0.65 CC Its env-enoding DNA portion (position 4180-4780) is similar CC to env-like portions of MER57I and MER51I that belong to the CC MER4I-group of retroviruses related to ERV1 class. XX SQ Sequence 5023 BP; 1261 A; 1259 C; 1377 G; 1126 T; 0 other; aaattggcgc agcgagcagg gtccggtctg acagaaccat ggcttattgg cgaagcggga 60 ggggcgatac cccccagtac ccggcccagg gcggagacat cagcctgggg gtgcaggggc 120 tggacatgca gctggatgtt ttatggggaa ggaggggtga taagtgggac atagtctccc 180 cacctccccc gcagatgtga gtgagctggc tgcatggact gaggcagaga gaaaggaatg 240 tggaaacagg agagagcagc gcacgatccc ctggttgttg ggcacttatg tgatcgctca 300 ttcctggggc cggagagctc gggaagtggg tactcagatg gaccctgagg cacctgagag 360 ggaattagaa agaggtcctc agatggtccc tgaggcatct gagagggaac caacacgtac 420 aatgcgggcc ttcacagaaa gggagactca ctatctaaag gacagatatg gagaaaggcc 480 agggaagtct tatgcagcct ggttggtgtg tttgtttgat aaagggacat tgcagatcca 540 aatgtctcag gctgaatggc gcagttagga atggggccaa ggtctccggg tccccaccgg 600 aggccagtgg ccctatactc ctgtcaagat aaaaggagaa aagaaaggaa ttcagacttt 660 tctggggttc ttggatactg gagcccacat gacaatattt ccaggtcccc ttaggggaaa 720 aattaaactg atgacatcgg gaggtttggg gacaaacatg gtgacccatg gtgcttattt 780 gcttatggtg cttatctgct tgtgggtggg gccctttggg ccatttcggg tgccagtgac 840 catggttccc accgctgagt gcattatagg cattgacatt ttggctgctt gtggcacaga 900 acatcaccgc tgcctgaggg ggtatgcccc ctcacagcta agaattcgag ccataacagt 960 ggggcatatc cactcctgcc tgccacctaa gctacccaag tcccaatggg ttattcaaca 1020 aaagcagtac tgcatactaa gtggagaaaa ggacattact ttgttaattc aggacttgct 1080 acagataaaa atgttacaaa ccaccctgtc acaatgtaac agcccagttt ggctggtcaa 1140 aaaggccttc ggggcatgga gactaacaat ggactgtcgc aggctgaatg ctgtagtaga 1200 cctattgaca ctcgtggccc agatatcacc acagtaattg aacacatcat ggaggcttcc 1260 aaccaatggt atgatgcagt tattgatctg gctaatggat tcttctcaat ccctttgagg 1320 gataagggca gagatcaatt tgtattcaca tggcaaagta tacaatatac atttacagtg 1380 ctgccacagg agtatttgaa ctcacctgcc atatgccacc agtgggtagg atgggatttc 1440 gccactgtgc ttttgcctaa agtggtcatg tgcattcatt acataggtga catccttatt 1500 gtggcccttg atgatccgat cacacaagag gccttggact tgatggtcac agggacgtga 1560 caagcagact gggaagttaa ccctaacagt cctgggatca gccaaactgg tgaccttttt 1620 caaggccact tgggcgggaa gccaaagaag tatcccagat acagtcaagc aaaaattgtt 1680 ggccctggcg gcatccacta ataaaaagga ggcccaacag ctggtaggcc tctttgggta 1740 ctggagacag catatacctc acctgggtgt tcttttggcc cccttagtca aggtgaccaa 1800 caaagccgcc aactttgaat ggggcccttt gcaacagcag gccttggaag ccattcaaca 1860 agtcgtggcc caggcactgc ctttaaaacc tttacagcct gctagcccga tggaattaca 1920 ggtgtccgca acctccatgc atgctgattg gagtctgtgg caacagaaaa ctgccactgg 1980 ggtgcaccag cctctcagat tttggacaca taagttgcct gaggcagcca ccagatatac 2040 ctcttttgaa tggcaactcc ttgcttgcta ttgggcactg gtggagactg agcatcttac 2100 ggccggagcg ccacgtgtga cgctgcaacc tgaactgccc attctcactt gggtgcttac 2160 aaaccccacc agtaaaattg gacaggctca acagagctca attatcaaat ggaaatggta 2220 cattcaagat cgggcccagc caggacccca agggaccagc gggctccatg aacaaatggc 2280 tagcttacca gaagggacca agcgacccgt aggggatgct ttggctcctc ctgtggctac 2340 ctggggccca agattcagag acatgcctac cgacggtatg gcatggggtt tactgacggc 2400 tctgcgaaac aacaagccag tgggtccact gggctgtggc caccatccag ccagtggatg 2460 gccatctttt gactgagact ggacatggac gttctgccca atgggccaaa ctacatgcag 2520 tggtgatggc catgcaggcc gcccctacca ccatatcttg ctacattttc actgactcat 2580 gggccattgc caacagccta gccatctggt caggagaatg gcaactgagt gactggacta 2640 ttaaaggatc ccctgtgtgg ggacaaggac tatggcaaca gcttgctgcc tggaagggac 2700 aaatatatgt cactcatgtg gatgctggga ctaccatggc cacccttgag aggaatttat 2760 gtcatgtttt tggatacccc atgggacttc actctgacca aggaacatcc ttcactgccc 2820 aagcaacatg acaatgggca cactctcatg gaacacgatg gactttccat gcaccctgtc 2880 atccacaggc caatggagct attgaacgat ggaacagccg actcacacag caactgaaga 2940 aaggacatca agacggcctg ctagtggggt ggtaccccca tctgactagg gcaatatgga 3000 cactaaacac tgcactccaa tgcaagggaa acacggcact gcagcgcatg ttgagaaaca 3060 ctgagcttgg tgggggtgga ggtggaccag gcagccgcct gattaggctg cgcctgcgaa 3120 atcccaatct cagtgttccc aaccattctt tttccttttt ccctttacag tgcacgtcct 3180 gggggtggtt tgcggttcag gccgccatag taccccagac agggcccccc gactctaatc 3240 tggaggcgat gcttcccctg ggtgcctccc ttctatggga tcccacagga gtgggggacg 3300 ggaccaagga ataccagggt gctagggtcc ctctagtggc gccggggaca tctgatcctt 3360 ccagtcgggt gggtgatgtt atcagacatg tcaagtttgt acaggacgtg acctcccttc 3420 ctggactgga cgactgaggc tgaaaggtct gggtcaagca acaagggcaa tggtgcccac 3480 agaggtagta gcctcaggac agggacagac agactgggtc gctacaccaa ctcagcccaa 3540 cccctatctg ataggtaggg aacacctgag accctggaag ggatgggggt ggggcactaa 3600 cctgtcagtc tgctttttcc acaaggacgt ggcagcagag gcctggaagc ataacgcctt 3660 tgttaggctc ttccaagctg tggccaccgc gggtaacctg acaaaatgct ggatctgcca 3720 tcccggacct cattctgtca cagaccagag ggaccctctc atcctgccag tggtaaacta 3780 caccagcatt cctaatgcca cagtgtacac caacagaacc cgagccctgg cttaccgagt 3840 gaggatctgg cacctgccac atgggaggga accggaggtg ccctgtttta acttaactga 3900 cttaaggtgg caaaatgtca cgaccacaac taacaaaacc ttggtgggct ggtactttga 3960 cgcaccacac tcctttgatt acatggacga gaagtgtccc agtggcgacg acgaaaacaa 4020 ggaccggact attgctagcc ctctgtgtag gggcttcatg aacaatattg tatggggaaa 4080 actgagctca tgcaactatg ccatcaatga gacttggctg gtgaatgcca atgcctccat 4140 acccatgaat gggtcactga acaataaaat gggaaagggt gtgctgtgtg cacccgaggg 4200 ctacatcttt ctctgtgggc ggtccgggag tgacccaaat acgggatggg caatgtcatg 4260 cctggaaagc tggcggatgg tgggatcctg cacgttgggc gtgctggggg tgcccctgga 4320 tatcacccct gggaatgaga tgcaccattg ggccagcagc ctaaagctgt acaccaggct 4380 tactagggac ctgccaggag gtgtaactga ctctgggttt atgtccttta tgagatcttt 4440 ggtaccatac ataggagtca gtgctcatga aaaaatgata agaaacctgt ccctgaccat 4500 ggcagatatt gcttcctcca ctgccactgc cttggcagcc cagcagacat ccctcaactc 4560 ccttgggaag gttgttttag acaacagaat tgctctagac tttcttttag cccaactggg 4620 aggagtgtat gcaattgcca acacctcctg ctgtacctgg ataaacacct caggtatcgt 4680 agaaacacaa gtagaggaga tccggaagca ggttcactgg ctgcagacag tggggccacc 4740 tgaaggatcc ttctttgacc tctttagcaa cttcttacct ggatcactgg gatcctgggc 4800 taggtcactg ctccaggcag gcctgatcat cctgcttgtg gtagtagtcc tcctgggccc 4860 agtgaaatgt attctggcta tggctcaatg atgttgcact gagattgtgt cagtcaaggt 4920 gctacatcaa tctgacaaga caaacctctg cctccagatc cggggaggtc ggtgggcata 4980 tgaaatggac tagctttgct aagggggata tctgggttgg ggg 5023 // ID ERV2A2-CPo_LTR repbase; DNA; ROD; 420 BP. XX AC . XX DT 21-OCT-2009 (Rel. 14.11, Created) DT 21-OCT-2009 (Rel. 14.11, Last updated, Version 3) XX DE Long terminal repeat of ERV2 endogenous retrovirus: consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; KW ERV2A2-CPo_LTR. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-420 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from guinea pig."; RL Repbase Reports 9(11), 2870-2870 (2009). XX DR [1] (Consensus) XX CC ~94% identical to consensus. 6bp tsd. XX SQ Sequence 420 BP; 78 A; 148 C; 85 G; 109 T; 0 other; tgtagggagc ggctcagaat ggctgctgct gtcctgcccc tcccacctga gggcattttg 60 tgtgtcagcc tacatttccc atgatcctcc ccttgcctac agtgcgcatg acgtgagcac 120 cttcccgctt cagcctacat ctcccataat cccctcttgt tgacatcacg tccgcttacc 180 tatgtgccta cgtatgactt cctgctagtc tgataggccc acggcatatg agaagatcca 240 agctggccca tcctgtaagg ccacgtcccc tttgactcta tataagccat gtacacttcc 300 tgaataaacg agactcgatt ggaatctcat cctgtctcca tctctctttt ctcttgccca 360 gccctccatc cccgtcccct ctccaggcga ggcccacgga ccgatccgcg ggccggatca 420 // ID MER60 repbase; DNA; ROD; 837 BP. XX AC . XX DT 14-MAR-1997 (Rel. 6, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 3) XX DE 5' part of L1M subfamily. XX KW Repetitive sequence; LINE; L1M subfamily; MER60. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-837 RA Kapitonov V.V. and Jurka J.; RT "MER60."; RL Direct Submission to Repbase Update (1997). XX DR [1] (Consensus) XX CC MER60 is the 5' part of one of the L1M subfamily. XX SQ Sequence 837 BP; 294 A; 153 C; 177 G; 188 T; 25 other; tgcagrttgc tagtgccaga gggagcaaag aattgtgatt gtggatttta attgctggca 60 gctccctgaa ggactcacaa gtcttttttt ctcggctgaa gtggttcaaa aacatttaaa 120 ggcacatttg ctggagcaag ggacaacatg rgrcaagcaa yagatagacc aaaaagccta 180 agaaggaaga gmtgagaaat aagatgcttg gggaaawaag ggctttgaaa agtccacata 240 ttcctgggaa tctagaaggc catgcacatg tccagggctg gacacatgct cagaaaagac 300 ctaagaaggc cctaagcttt cacctctggc tgaccttnag actctgtaca agcaggaagt 360 gaaggctaag gcagagttgt aaactgcctt ggctgagtgt tgaaggaatg ccycaacaca 420 cagmcaatct gcaaagacta ggagactttt tttttgtgtt tggtddtktt gttgttrttc 480 caggtattta aggaaayctc tgtcaawyac tagctgacca ctaagctaay cgarcagaga 540 cttcagtggc cacacatgac aaagaataca gactttacaa aattagttta gaaaagtcac 600 taaacaaaca ayacaacagc amayaayaag caacaacaac aaaccctgga aaggggagag 660 aatctgattt ccagagttgc cacattataw tatttaaaat gtccagtttt ttttttcaac 720 arcaacaaaa attatgaggc atacaaagaa acaaagaaaa gtatggccca tacacaggaa 780 aaagaaatta atagaaacta tccctgagga agcccagaca ttgracttac tagacaa 837 // ID L5 repbase; DNA; ROD; 2265 BP. XX AC . XX DT 06-OCT-2006 (Rel. 14.07, Created) DT 21-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE RTE Non-LTR Retrotransposon from mammals. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; LINE; L5. XX OS Mammalia OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi. XX RN [1] RP 1-2265 RA Smit A.F.; RT "L5 - RTE Non-LTR Retrotransposon from mammals."; RL Direct Submission to Repbase Update (21-JUL-2009). XX DR [1] (Consensus) XX CC 35% subst level in borEut13. ORF from 6-2072 encodes the CC C-terminal half of a pol protein 59% similar (42% identical) to CC that of L4 (probably more as L4-pol still has many ambiguous CC residues). The ORF starts at a position matching ca. pos 500 of CC the complete pols of Expander etc. XX SQ Sequence 2265 BP; 677 A; 522 C; 409 G; 633 T; 24 other; tttaaaccaa antagagcca natanacgaa aggtttggca naatcattac agtaagatct 60 ttggggttct ctcttgggca tgtagtcaaa attttntaat agacataacn aatttgcctc 120 agtggnctcc agttaccaca ggngagatta aaggtcttag tgcatctctt tcctctggca 180 aggcaccagg agaggatgtg ttgcccccag agntatttaa acaatttccc gactggtggg 240 cnccaattct ggctaaattg ttcacgcaga ttagcaaagc aggggtttct cctgctgagt 300 ggaaacagaa tattgtcttc ccaatattta aaaaggacaa naaacaggac ccaggtaact 360 atcgcccaat aagtctcttg gatgtagcct ccaagttata tggcaaacac ttattgaaca 420 agctagaaga ctgggaaaaa tccaataacg tcatccaccc tgaacaagct ggttttagaa 480 gaggacaatc aacaactgac cattgcgtaa ctctccgtta cttagcncaa caaagcatat 540 gnagncctnc taaatacctt tacgctgcat ttgtagatct agcngcagcc tttgactcgg 600 tcaacagaaa ccggctctgg cacaaattan ctggcactaa cattgacagg cgtctcttat 660 ttctgcttca gcagcttcac agcgacancg ccgccagaat aaaagcaggg atttccggtt 720 cttcgacaga ggtgatctct attgaccaaa ggatnaaaca aaggcgtctc ttagccccac 780 ttctattcaa tctttacctt aatgacataa ttaaaangtt atctggccca gaattttttc 840 ttctctcaat tggctctcgc aaaatctcta tccttctgta tgctgacgat atagtcctac 900 tatcctctac tacttacgta ggtctcaaaa agctactgtc caaactctat gatgcgttaa 960 aagaggaatc tttaaatatt aattattcaa aaaccaaagt gatgattttn agaaagaaac 1020 ccagcaantt tcgatgggct ataaataatc agccgatcga tcagtgctgg gtatttaaat 1080 atctaggcgt ttattttaat gaaacgcttt cctggaaatc acacaccaag atagtaaagg 1140 ccacagtcac taaaaccata ggagccatac tgaaattcta tcgcactaaa ggtggccact 1200 taattgatcc tgcnctaaaa ctcttccata gcaaagcagt ggcccaaatt ctttatggag 1260 cagaggtatg gggctgggac gatacacaga ttacaaatct ggaaacctta caaaacagtt 1320 ttcttaaaaa catcttacat ttgcccccta gtatcccggc agctctaatc cgggcagagg 1380 ttggactccc ctcaattaga gcccatgttc atgtggccat aatcaaatac ttaaagaaac 1440 tgaaagtttc ccctgagaac catctgtcaa aattgtgcta tgtccagcta cagaactgta 1500 aggactgggt ttataaatac catagactgc ttcaactcta ttccatctct gaggactcac 1560 aggtcatttc tcaagcaggc accaatctac gcgactggat cttcgatcaa aacgctntgt 1620 ccgacaggct ggctatttta gacacaaatt tttccaggcg gtataggata attaaaagtg 1680 atcacagtag atcattctac ctggtcaatc ttacctttcc taagctcaga caagccttta 1740 cggcaatctg tttccaaacc atgcccactg ccatgatcga aggcagatac cgccggactc 1800 ctctagccca gcgaacctgt atctgtggag cccctgagct agaggacctt ccacattacc 1860 ttttattctg ccccatgtat ctggaaccac gccagaagtt tcttggggta attttaacta 1920 gtattcactc tagaacaata ccggaaaagg taagttactt actgtccgac acggacacgt 1980 acgtaacttt tagagtctct tattttgcac tggccgcttc aaaaatcaga gctaaagcta 2040 ttttagacac tacagttaaa acgacacggt gatgtattgg tctgatgatt ccttttatga 2100 aatcaactct aaaatgttac ctatttttac gctcagaaac tcactctaac tttgttctgt 2160 tttgatcttt ataacttatt ttaaagatca cattgtacct actctttgct ctgtaagtct 2220 tgcaatggcc tttggccaaa agcaataaaa ttctgacctg acctg 2265 // ID LTR7B_Cpo repbase; DNA; ROD; 417 BP. XX AC . XX DT 21-OCT-2009 (Rel. 14.11, Created) DT 21-OCT-2009 (Rel. 14.11, Last updated, Version 3) XX DE Long terminal repeat: consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR7B_Cpo. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-417 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from guinea pig."; RL Repbase Reports 9(11), 2877-2877 (2009). XX DR [1] (Consensus) XX CC >91% identical to consensus. 6 bp TSD. XX SQ Sequence 417 BP; 87 A; 124 C; 77 G; 129 T; 0 other; tgtgggaaac cctgcaattg gccgtcatct tatgttaata agacaacagg catcttaatg 60 ttaagtttct atttagtagt taagtttcta tctctcctca cctagcctgt cacatgtccc 120 tgtggttcgg cctgtacagc gattatgcta attagcctgc cttatgggcc cgtgcgcagg 180 agactcaaat taccctataa cccccggtct cgctagaact tcccactgtt gctaagttac 240 ctgctgacgt gtcagaaccc agccccctcc ccaataccct atatatttgt tactctttct 300 ttgaataaac gagacttgat cagagcactg tcttgtctcc attcttcgcg tctcttgtct 360 cttctcatcc ccactccccc tctagggtct ccgtggactt acccacgggt cgggaca 417 // ID LTR6B_Str repbase; DNA; ROD; 338 BP. XX AC . XX DT 19-OCT-2009 (Rel. 14.12, Created) DT 19-OCT-2009 (Rel. 14.12, Last updated, Version 2) XX DE Long terminal repeat of retrovirus-like element. XX KW ERV2; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR6B_Str. XX OS Spermophilus tridecemlineatus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Sciuridae; Xerinae; Marmotini; Spermophilus. XX RN [1] RP 1-338 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from the thirteen-lined ground squirrel."; RL Repbase Reports 9(12), 3069-3069 (2009). XX DR [1] (Consensus) XX CC >98% identical to consensus. 6bp tsd. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project. XX SQ Sequence 338 BP; 77 A; 109 C; 69 G; 83 T; 0 other; tgccagggac caaaagggag ttctccttca aagaatagcc tgacagttac aagaaacagc 60 cccctaggag acatctcgcc tgggagacac cctgcagcca tccatctccc tgagacatcc 120 tgcgctatct caccttgtga agacttctag caactgccga taagagagct ccccccatcc 180 ccagcccata aatacccctg tgtgaacaat aaaagtttgc agcttgatca gaacttttgt 240 cttgctgtca cccttcgtgt ctcttgtccc ttcattcctc cccatctagg ttcgctgccc 300 acgttgatgt gtcctgccgg tcgggacatt tggcgcca 338 // ID ERVB4_3-I_MM repbase; DNA; ROD; 8574 BP. XX AC . XX DT 11-FEB-2004 (Rel. 9.01, Created) DT 11-FEB-2004 (Rel. 9.01, Last updated, Version 1) XX DE Mouse endogeneous betaretrovirus ERVB4_3 internal sequence - a DE consensus. XX KW Endogenous Retrovirus; Transposable Element; pol domain; KW endogeneous betaretrovirus; MmERV-B4_AC124523; ERVB4_3-I_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RA Baillie J.G., Van de lagemaat N.L., Baust C. and Mager L.D.; RT "Multiple groups of endogeneous betaretroviruses in mice, rats, RT and other mammals."; RL J. Virol 78(11), 5784-5798 (2004). XX RN [2] RP 1-8574 RA Gentles A. and Jurka J.; RT "Mouse endogeneous retrovirus ERVB4_3 consensus."; RL Direct Submission to Repbase Update (31-DEC-2003). XX DR [2] (Consensus) XX CC There are 14 copies of the internal sequence of this ERV in the CC mouse, and 56 copies of the corresponding LTR. XX SQ Sequence 8574 BP; 2409 A; 2011 C; 1824 G; 2309 T; 21 other; tgctggatag gtcatagtgg cgcccacacg tggagctcga ggtacgacca ccctctggac 60 aacgataaga ttaaggtacc gccagccagg gtagagacct cctcagaacg catcgctcgt 120 gcaacgnggg gagtcttgag gtaagtctag atgttacaag gtcgtcccat tcgccatggg 180 gaaagcgctt tcaaaggaaa caatttttat aaaagagata aagggtttac ttagggagag 240 aggaataaga gtcaagaaaa aagatttgat taaattttgc ttcgtggaca ctaaatgtcc 300 atggttagtt ttgagtggac ctgagattca tccacttacg tggaataagg ttggcaaaaa 360 tattaatgat ttaataaaga agggtgaaaa tatccctgaa cctttttttt tcagttatta 420 tggaatcatt agagacctct taaaagatgc agaacaagga ggggagggag ccagactgct 480 ggcactcact gaggattttt tttgcagctt cttgctcacc atgggtaaaa ggtgaaacta 540 aaaaggacta aatgatggtt caagctcctc ttcaatcatc ataaacatgc cagatcctac 600 aagtagtttg ccagctcctc tctccccaca aagcctgaag gggcccctgg atgaatcaga 660 gaaaattgtt agaccaaaga atatttatcc agtcttacat agagtcttaa agactgtaat 720 gctccccttg acccatcctc tgaggcagaa ctagaggagg aggctgccaa atatgaggag 780 gagagatatg gtcctggctc cgaccatatg ctttgtgttt ctaacaacaa aaatccagcc 840 ctaggtaccc tcctctaccc cccttccact cggggcccca ctccctttcc ctctctgcac 900 ctctatccct ccctcagcct gccgcctcct gctcccttac cagggacgac tgtccatcat 960 ttgctccaca caaaggacgc tctggtggcc cacatttctg gtctaaaaga ggttttatac 1020 ttgcaaaaag aattaggaga cctcacttta gaaattcaaa atttacaggg ggccctctct 1080 ggagccccaa ttgtcaagaa acagtcggag caccctaaag tcataaagga ggtttctcct 1140 ycrgtctcca cggagaagat caaacataaa tattcagcca aggacaaggc acataaaaaa 1200 ttggccttcc cagtcctgac tcagggcacg ctccaaacag ataagttatg accctgagcc 1260 aactaatgat gaacgagtgc cagatacctc gggacaggca aagggagaaa ggaaagtgaa 1320 agtgaaagtg acgaatttga kcaattagcc tctgactccg aggaagaaga ggcgataaaa 1380 gatgaaagtg attatgaggc ttcacagcct gtttataaga aattaaagat caaacatatt 1440 aaagaattgc attcggctgt taaaaattat ggggttaatg ccccttttac tgtatcaatc 1500 ttggagggac tagcaggaga tggttactta acacccaatg aatggagcaa agtggtacag 1560 tcagtcctca ctagagggca gtatttaact tggaaatctg aatttgtgga tagagctgaa 1620 acccaagctg ctattaaccc tcagtgaggc ttattcttgg acctctgata aaatatgtgg 1680 taaaggtccg ttcgcctctg ataagaaaca attggggctt tctcctgggg tgcttgtgca 1740 gaccgcccag gcggccttag cagcctggag agccgtaccr gccacaggag ccctcaccac 1800 cccgttgact aaaattattc agggctccca agaatcctat gcccaatttg ttgctagact 1860 acaagaagcc gctgaaagaa ttctcggccc ccatgaaaat gaagggcttt tagttagaca 1920 acttgccttg gagaatgcta attctgcttg caaggctgcc ctgaggggta agactagagg 1980 cttagacctc acaggaattg ataaagctct gcagtgaggt agatacattt tcacatcaag 2040 tctcaaaatc tattaacctg gccatcggag caggtcttcc aaaaagctgg tggaactagc 2100 cagcctcagc agagagtgtg tttcagatgt ggactcctgg gacattttgc ccgcgaatgc 2160 ccgtccacca gaccaggcaa gtctgccacc acaccaggca cgggccaagt tacacaggga 2220 gccatgcctc ctggcctctg ccctaaatgt aagagaggtc gccactgggc acgagattgc 2280 agatccaaaa ctgatgtaaa tggccgcccc cttactgcgg tccagggaaa ctaccagggg 2340 ggcccctgcn tcggggcccg ctccccctga tagcaccccc aaaacaatcc atcgctgatt 2400 ccctccagtc tccccaacgg ggattgtgca gggctacagc agggagcgca ggattggact 2460 tgtgttcctc caccaccagg attataactc ccgaggaagg gacattagtt attgagacag 2520 gggaattcgg accccctaac cccaaaatat gttttttctt attattggcc gagcatcagg 2580 ttccctacag ggactcgtgg taacccccca ctgtggtcga tgctgactac caaggagaaa 2640 taaaaattct ggttacagcc acacatgagg gccccttaca ttgagagcag gagagcrcat 2700 cgcccaagcg ctgccgctcc cgttaattgg acaatttcca catattagaa aaaatcggtg 2760 ggccgtcctc cccagggtcc tcaggatgtt tattgggtac aaaaattaac tgactcccga 2820 cccatgttga ccctgttttt agatggcaaa caatttcagg gacttctaga tactggggca 2880 gatgcaacag taatttcttc atygcactgg cccactgctt ggcctttaga acctactgct 2940 acccatttaa aggggatagg ccaaactcag gatactttac aaagttcaaa attgttaaca 3000 tggtcagata aggaaaataa tactggaact gtccnggccc tttgtagtca ggggccttcc 3060 tgttaattta tggggaagag atatactctc tcagaatggg agttataatg tayagtccca 3120 atgagactgt cacaaattta atgctaaaaa caggggtatc tccctggaaa aggcctaggg 3180 aaaaatgaac aagggattgt tcaatccctt tgtccctgta cccaaaagag acaaaaaggg 3240 tctgggagca gacctttttt tcctagagac cactgctcct cctgcactcc aggcagataa 3300 aatatcttgg aaaactaatg atccagtctg ggtcgaccag tggtctatgc ctcaagagaa 3360 aggtccaggc agccttacag ttagtgcagg aacaattgag gctgtcgcac cttgaaccat 3420 ccacctcccc gtggaataca cctatatttg ttataaaaat agaaaaatgg gacttggagg 3480 ttattacaag atctcagagc tgttaataag actatggtgc caatgggtgc cctacagcct 3540 ggtctaccct ctcccattgc catacctaag ggctatttca aaattgttat agatattaaa 3600 gattgctttt tttctctatt ccccttcatc ctcaagattg tgtccgcttt gccttttcca 3660 ttccaattgt aaatcatgtg ggaccaaatc ctcgcttcca gtcagtggtg ggttttgcca 3720 caaggaatgg ctaatagccc caccttgtgc caaaaatatg tggcccagat cattgatcca 3780 ataagagggt gctttcccac tgcctatatt gtgcattaca tggatgattt actaatagct 3840 accaaagatt tacaacaaac ccatgagatt gcccaaatag tagttgctgc cctacaaaag 3900 agaggttttg taatagcccc agaaaaaaat acaggttcaa tatcctttca tgttcttggg 3960 cttccaacta gaacctgcaa ctgttcactc acaaaaattg gccattcgaa cgtctcatct 4020 gaagacctta aatgattttc aaaatattat ttgggagata ttaattggct acgtcctata 4080 cttaaagtta acaacaggag aattaaagcc cttgtttgat gtcttgcggg gtgattctga 4140 tcccacctcc cccygaagtc tgaccataga agcacaaaga tcactggccc gagttgaaca 4200 ggccattagt caacaggtta tgggttattt tgaccccaca cagcctttat ttattcctga 4260 tcatcttttc aaccactttt acacccacag gccttctctg gcaaaaagac agccccctct 4320 tctggatcca tttaccagcc accccctcta aagggttctc cctacctttc cactcttggt 4380 gtgccaggtg atatttttag gtttgaaaaa tggccacccg ccattttggc cgagatcctg 4440 atgtaattat ttctccttat tcctccaagc accttgcttg gttacagtcc cgatttaatg 4500 attcgggcca tattgctgtc catctaccaa gggacttttg tatacccatc tccccaggta 4560 acaggttgtt acaattttta caggtgactc ctttttgtgt ttcccaaaga ttactccatt 4620 cagaccctat ttcagaggca tggcattaac tgtgtttata gatgggtcaa acaaaatgga 4680 aaagcccact gtggtcattc atgggcaaat tcaagtcatt gatactatct ataacctctg 4740 cacaattagt tgaactttgt ggagcgttaa aggtgtttga gcttgttgca ttaccattta 4800 atctttattc agatagccac tatgtggtta gggcccttca agttttggag gtggtcccta 4860 gtatccaacc cttaactgcc acctttcaga tgttttttaa gattcagatg cttattaggg 4920 ctcatgctca tccctttttt gtaggtcata ttcgagcaca ctccggtcta cctggcccat 4980 taacagaagg aaatgattta gctgatcaag ctactcggcg tgatgtgcct tgcttccctc 5040 tctgatcccc tctcagaagc acaaacagcc catgccctcc atcaccttaa tgctcatacg 5100 ttaagactca gatataagat aactagagag caagccagac aaattgtaaa acaatgtaag 5160 aactgcctta cacttcttcc agagcctcat ctgggtgtca accccagggg acttatacct 5220 ggtgaattgt ggcagatgga tgttacacat gtcccattcc ttggaaagtt aagatttgta 5280 cacgttactg ttgatacctt cagtggtttt atttgcgcct ctgcccatat gggagaagcc 5340 actaaagatg tcatcaatca tttattgtat gtattttcag taatgggaca gccaaagatg 5400 attaaaactg acaaatggtc ctggatatac tagccaaaag tttaaacaat tttgctcaca 5460 attacagatc aagcatatta caggcattcc ttataatcct caaggacaag gaattgtcga 5520 aagagcacat cacatcagac tttaaaaaaa taccttaatc aagctggcta ctcaggaaac 5580 tatctattcc tttaaaggaa attcaaaatt gttattgtct catgcactct ttgtgctaaa 5640 tttcctgacc cttgatatgt cagggcrctc crcagctgat cgccttgtgg ccaccctaag 5700 acaagttcaa gctatgctca agttttgtgg aaagatccct taacgggaat gtggaatgga 5760 ccggatccag ttattatctg ggctaagggc tcagcttgta tctataatac aaaagaaggg 5820 ggagcgaaga tggctccctg aaagattaat aaaaccttat aataaattcc agggtaatgc 5880 ctgagaaaga attattttct cttacaggag gaagtatcag gaccgtgacc atgatgaacc 5940 tgatgatcct ggcctggata ctattcctgt tctccatcgt gaagaccacc aataatcagg 6000 gcctatcaag tttttaatta cacctgggtt attcaaaatc atgctggaga catagttaat 6060 tccagctcca aaattgatgt caagccccat tggcccgatc tagaagtaga tgtctgtgtc 6120 cttgcacttg gtgcagacgc tgcctggggc acaccctcat attttttctc cctcaatcta 6180 agcctattaa tagccctgat cctgactata ttaggacata ttgcggcttg taattcatat 6240 attaaaygag cctctatggc tgatccatac cagtggattt tatgtgtgtc caggaaaaca 6300 tcgtgataaa tccctccagt ataaatgtgg ctatcgtgat tcatattttt gtaagtcatg 6360 gaggctgtga aactacagga gatgyacatt ggaaacccac ctcctcttgg gaccttatca 6420 ccgttagacg gaagtacctt cccattatat ttacaacaga tggctaggaa taaccaacca 6480 ctggtcagcc tcttaagacc ctgtgcaaag atgactggtg tgtcccactc cttattaaat 6540 ttatggaggc gggaaaacga tacacccaca ttggaatata ggaggagaat gggggctctt 6600 tacatcgggg atgtgatgtt atggaagtcg gatgtaaaga tacctggcct tattttttca 6660 gataaaatta ttaaaagaag ttcctaataa acataaggca tccataggac ccaaccccca 6720 attacatcac ccccccagcc aagccgcaac caacagctag ggttgccccc ctccacagtg 6780 gaatcaactt accccgcatg ttcagatgag cactccttat cagcctacca tgcttcctgg 6840 ccctaagccg tctactgaat tattgctttc ccattttaaa tgcctcagcc cttgccctaa 6900 tagataaaag acaggaggat ggttcccctg attttgagga gtgttggatg tgtttctctg 6960 ccacccctcc cttctatgag ggaatagccc tgttcaataa ttttaccctt cttaatgatg 7020 cagaacaatt actctttgag cccatacaaa ttaccctaac tgaagtttct ggaattggaa 7080 gttgtgttgt ggggcctcat atgatcctgc ccctccaact tcaaaatata tgcaatgata 7140 ccattgtggt aaataatacc ttataaatat ctattggcac ctaatgatac tttttttagc 7200 atgttcctcg ggcttaaccc gatatttagt caatgaagat tttataaaga gaaaagatta 7260 ttgtgttttg gtccaattgt ttcccaatct tcggattcat gattcagatg atttgttagg 7320 gttttgggaa aggggcacag aattgccccg caggagaaaa agagagcctg tcactgcggt 7380 gacccttgcg tngggtcctc ttgggattgg gggctacagg gacaggcact ggcattgctt 7440 ccttagttac ttcccagcaa aatgtgcagc gctatcatga gcttaatgca gctattagtc 7500 aagatttaga agatttaaga gaaggtatag atcagttaac agattctttg gcatcccttt 7560 ctgaagtagt gttacaaaat agaaggggat tagatttact ccttttacag cagggaggat 7620 tgtgyrcagc tctcagagaa gagtgttgtg tatatgtgga caagacagga ttagtgaaag 7680 atagccttgc caaggtaaag gctagccttg aaaaaagaaa acgagaatga gaacaacaag 7740 agtcttggta tcagaattgg ttctccacct caccctggct atccacctta ttgccttcca 7800 ttttgggacc ccttgtggga ttcctactat taatttcttt tggtccttgg gcttttcaac 7860 gattaactyg tcttgttaaa tcacagatag attctgctct ttctgataaa tctgtttcag 7920 ttcattatca tcrtctggac accgagacca acaaagaaga acctccagct gtgaccacca 7980 tcatgtgagc cctcaccttg ggggagagag gctcaacttc cataacctct taaaataatt 8040 cgagctgacc ttaagacccc tctccccttg atttggcttt tagcgttttc ttttacctcc 8100 aggcctagct aaactctagg gttgatgagg acccccttta ggggctgcac agaactcaga 8160 actgtgcatg ctggttcata atgacacgaa cacagcatgg gcaccagctg gcrtgcgagg 8220 ctaagcaccg caagggaagg tccaatttga ccacttctga gttccctgag agtcatacct 8280 gattaaaaag gggttggtat ctaaacttgg ttgtgatctg gtcctctgtc ctccagtccc 8340 agcctcccga gtagctgacg caatgggagt tctaaattcc catggctctg ccccgcctcc 8400 ccaaaaggta ccaagagcca caagtgtggg tcatgacagc acccacggga ggaatcaggt 8460 caatgctccc ccagccaaat ggtcaaagcc cattcatcac aagatgaaca aactattcca 8520 tcaccctgag tggacttggc cttactatat taatacagac gggggagatg ttgg 8574 // ID CHARLIE1 repbase; DNA; ROD; 2739 BP. XX AC . XX DT 09-OCT-1997 (Rel. 6.3, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 2) XX DE Autonomous DNA transposon. XX KW DNA transposon fossil; MER1_type (hAT) family; Charlie1; MER64. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-177 RA Smit F.A.; RT "CHARLIE1."; RL Direct Submission to Repbase Update (1996). XX RN [2] RP 1-2739 RA Smit F.A.; RT "CHARLIE1."; RL Direct Submission to Repbase Update (1997). XX DR [2] (Consensus) XX CC An apparently full-length member of the the hobo/Activator/Tam CC group of DNA transposons. The coding region from bp 636-2504 CC encodes CC a transposase related to those of the other members of the CC family. CC 15-16 bp terminal inverted repeats. 8 bp target site CC duplications. CC Individual copies on average 20.5% diverged from consensus. XX SQ Sequence 2739 BP; 895 A; 475 C; 495 G; 861 T; 13 other; cagcggttct caaagtgtgg tccgcggacc cctgagggtc cccgagaccc tttcaggggg 60 tccgcgaggt caaaactatt ttcataataa tactaagacg ttatttgcct ttttcactct 120 cattctctca cgagtgtaca gtggagtttt ccagaggcta catgacgtgt gatgacatca 180 tcgctctgat ggctaatgga atgtgtgctt gtgtattctt gtgttttcta aaatttttaa 240 ggtagtaggt ttagggtata aatatgtagg ttttcagaga ttaactcagt ttsttcttag 300 cacttctacc gtgctcttac tagctatctt cagttatacc tgctataatc tttgtaacct 360 cattatcgtc caataaatca ttattttraa atcctaaagt tttcctcgwg cctatgtgaa 420 aacacaagaa gcaagtacta cttgtaaaac ttgcttgcaa taacattttg aaaatttccc 480 aaatgtttaa attttatcga caacatatag aatttaatat tttattctaa aataaaaatt 540 tatttataaw gtttttccta tatttaaggg tggattgttg gcttaaaaaa gggagattag 600 aagactcgct ttctcaaccc acagctgcam catctatgtc taaagatgct gaaatggaca 660 taccagcaag ttctctgatt cctcatggaa rggaggaatc tactccaaag aaactgggcg 720 aaactgtaaa taaaaaacaa aaatatgata aaagctatct tctcagcttt atagatgtta 780 ataatttacc ttattgtgtc ttatgcaaca gaacattttc gaatagtatt atggtgccag 840 ttaagttgcg gcatcatttt gagaccaatc attcagagtt taaagaaaaa ggaattaaat 900 attttaaacg tagatgtgat gagctcttta aaagccaaaa attgtttgtt gcagcttttc 960 aaactagaaa tgaaaaagcc actgaagcat cttacaggat aagttgccay attgcattgg 1020 ctggagaagc mcacacaata actgagagac taataaagcc tcgaacagtt gacattgctg 1080 aatgcctgct ggatgaaaag tcagtaaaag aaatcatagc actgccactt tccaatgata 1140 cgataactcg tcaaattaaa gatttagctg caaacatgaa gaccgagtta atatntcatc 1200 tgcagaattg tacttttgcc ttacaaatgg acgaatctac agatgtkgct agacttgctg 1260 tgttgcttgc gttcgtctgg tatcagcacc aactgatcat cgaagaactt cttttatgtg 1320 aattcttggc aacaaacaca agtggtgatg aaatattcaa agtgttgaat gacttttttg 1380 aatctcatga tttatcctgg aacaactgtg ttgacatttg cactgatggt gcaaaagcaa 1440 tggtgggtaa aactgctggc gccttagcac gaatcaaggc agtggcacca aactgtacta 1500 gtagtcattg tattcttcac cgccacgcac tcgcagtwaa aaaaatgcca gtttcactta 1560 agaatgtcct tgatgaagca gtaaaaatta ttaattttat taaatctcaa cccttgagta 1620 cacgtctttt taatattctg tgtgacaaaa tgggaagtac gcataaagca cttctgctgc 1680 ataccgaagt acgatggttg tctcgaggaa aagcacttgt gcgattgttt gagttgcgag 1740 ctgaactagc tgcttttttc atggaacacc atttttactt gaaagaacga ctgacagaca 1800 aactatggtt attcagactt gggtatttgg cagacatttt ctcgaaaatg aacgaagtga 1860 gcctgtcact tcaaggaaaa caactgacag tatttgttgc caatgataaa attcgagctt 1920 tcaagcgaaa attagaattt tggaaaactt gtatccgcca ccgtgagctt gacagcttcc 1980 caatacttaa agacttttct gatgagatcg gtggtgatat taacgaatgt gattttttga 2040 tattgtataa tgaaatgtgt caacatttgg aagatctgca taactcagtg aaccaatatt 2100 ttccaaatga ccaatgcatg atgttacaaa atcatgcatg ggtaaaagat ccattcaaag 2160 tgcaagatag accaatggat tttaatgtaa cagagtatga aaagttcatt gatatggttt 2220 cagattccac attgcaacta acctttaaga aactaccact tgtcgagttt tggtgtagta 2280 tcaaagaaga atatccacaa ttatctgaaa aggctattaa aatactcctc ccttttccaa 2340 ctacatatct gtgtgaggcc agattttctt catatacttc aaccaaaaca acatatcgca 2400 acagattgaa tgcagaagca gatatgagaa tccagctgtc ttctattaag ccagacatta 2460 aagagatttg caaaaatgta aaacaatgcc actcttctca ctaaattttt ttgttttgga 2520 aaatatagtt atttttcata aaaatgttat ttatgttaac atgtaatggg tttattattt 2580 twaaatgaat waataaatat tttaaaaatt tctcagtttt aatttctaat acggtaaata 2640 tcgatagata taacccacat aaacaaaagc tctttggggt cctcaataat ttttaagagt 2700 ataaaggggt cctgagacca aaaagtttga gaaccgctg 2739 // ID MER102 repbase; DNA; ROD; 332 BP. XX AC . XX DT 30-JUL-1998 (Rel. 6.5, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 2) XX DE Interspersed repeat MER102 - a consensus. XX KW Interspersed repeat; MER102. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-332 RA Jurka J., Naik A. and Kapitonov V.V.; RT "MER102."; RL Direct Submission to Repbase Update (JUL-1998). XX DR [1] (Consensus) XX CC Over 3000 copies in the genome. Present in Sus scrofa. CC The extreme 3' end similar to MER58 - probably insignificant. CC Potential transposable element - not confirmed. XX SQ Sequence 332 BP; 94 A; 65 C; 85 G; 86 T; 2 other; gggttgcaaa ctcaaatgcc tacaggggcc aggcaggtaa cataaatgag tgaagtgggc 60 caggtgggga ctgtggcaaa ctggagagca catgccctgt ctaaaggggg cagcagctgc 120 tactcagctc cagccaattg ttgccatgtg ggaatgtagg cccagtgttg ccagatcttc 180 tgatttttca agagaagcca gaaatctaga tttttatgtg aaatctcctg atttttaaat 240 gttggcaact aatttaaaat ttttawaaaa cactgtgtag gccaaacaaa acatatctgt 300 gggccagatt tagcccatrg gctgccagtt tg 332 // ID RLTR19A_MM repbase; DNA; ROD; 500 BP. XX AC . XX DT 30-JAN-2004 (Rel. 9, Created) DT 25-SEP-2008 (Rel. 9, Last updated, Version 2) XX DE Mouse subfamily of LTR retrotransposons - a consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR; ERVK; KW RLTR19A_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RA Karolchik D., Baertsch R., Diekhans M., Furey S.T., Hinrichs A., RA Lu T.Y., Roskin M.K., Schwartz M., Sugnet W.C. et al.; RT "The UCSC Genome Browser Database."; RL Nucleic Acids Res 31(1), 51-54 (2003). XX RN [2] RP 1-500 RA Pavlicek A. and Jurka J.; RT "RLTR19A_MM - a subfamily of LTR retrotransposons."; RL Direct Submission to Repbase Update (31-DEC-2003). XX DR [2] (Consensus) XX CC The consensus sequence was reconstructed from the UC Santa Cruz CC genome annotation. RLTR19 subfamily, weakly similar to RLTR33_MM CC (84% identity). Individual copies are ~90% identical to the CC consensus. CC 6 bp TSDs. XX SQ Sequence 500 BP; 128 A; 112 C; 98 G; 162 T; 0 other; tgttatggtc tgaccccccc cccccccagc atagctgtag ccattttgtt ccatgcctgc 60 cagctatttc atattgttgc tgtaacatgc ctgccagtca ttgacacaga gaagtgactt 120 gactcagagc aaggtcatgc tgaccacata ccctgttatg ttctgaatgt tctgtatgag 180 gtttgttaat cttaagaaat tccacaaagc tttacgtagg acccatcaaa tcaaaggtca 240 atatgaactg ttatgtctaa aatatcttga gtcagagctg accaccaggc agcacttcct 300 gcacctgcgt atgaactcat tgtggttttt gtttttttcc tttataagct gacagaaaaa 360 gatatctgtt gtcatagttc agataattct gagctatgtc cctggtctga ccagtattag 420 ggtgtgcatt caataaacta ttcttgttta actgagatca gtgttcatat ggtttgtgtg 480 gcgattcctg aaccccaaca 500 // ID CAVID2E repbase; DNA; ROD; 79 BP. XX AC . XX DT 02-DEC-2009 (Rel. 15.03, Created) DT 02-DEC-2009 (Rel. 15.03, Last updated, Version 3) XX DE tRNA-derived SINE family: consensus. XX KW SINE2/tRNA; SINE; Non-LTR Retrotransposon; Transposable Element; KW CAVID2E. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-79 RA Jurka J.; RT "SINE elements from guinea pig."; RL Repbase Reports 10(3), 507-507 (2010). XX DR [1] (Consensus) XX CC ~89% identical to consensus. XX SQ Sequence 79 BP; 25 A; 16 C; 21 G; 17 T; 0 other; ggggtttagc tcagtggcat aagcgcctgc ttggcaagcg caaggtcctg agttcaattc 60 ctggtacaaa aaaaaaaaa 79 // ID MER74A repbase; DNA; ROD; 558 BP. XX AC . XX DT 09-OCT-1997 (Rel. 6.3, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 2) XX DE MER74 repetitive element - a consensus. XX KW Long terminal repeat; MER74A. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-558 RA A A.F., S B. and Hood L.; RT "Complete Structure and Organization of Three Mammalian Prion RT Chromosomal Regions."; RL Direct Submission to Repbase Update (1996). XX DR [1] (Consensus) XX CC Putative retroposon LTR; 5 bp target site duplication. CC Orientation unclear. Average divergence from consensus 20.5%. XX SQ Sequence 558 BP; 122 A; 191 C; 103 G; 141 T; 1 other; tgtattaacc atgtttttta ttttctgtat tcttgatgct ttgacatctg gggccttgct 60 gaccctggag ggactgcccc tcccagggct agccaattcc tagagatagc aaacgactcg 120 cctgggagcg cgcctttcat atgcaaacca accaatccag agcccacacc cccaaccacc 180 tcctttatcg ggctctcaca ctctgggcca ctatccccct gccctaatca ccccagggcc 240 aggtaccaga caactaggga cagcccctat accccagagc ccgctgaaat tattcaaact 300 agccaatcct aagcctgctt accctgcctt gcccattcct tcccatggaa accacaataa 360 aggctcttgc ccacgttttc ccgtcgctcc ctctgcctcc tgaccgaccc tggtgcttcc 420 ccgtgtggcc ccccgtggcg tggcgtgccc ccttctcttg ggawctgtga gtaacaaact 480 atcttttcaa tggcagtcgt ctcctgatct gttggcctta ccatacctga ataataataa 540 aacctacatt ttaaaaca 558 // ID RLTR20D_MM repbase; DNA; ROD; 570 BP. XX AC . XX DT 30-JAN-2004 (Rel. 9, Created) DT 30-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE Mouse long terminal repeat - a consensus. XX KW LTR Retrotransposon; Transposable Element; LTR; ERVK; RLTR20D_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RA Karolchik D., Baertsch R., Diekhans M., Furey S.T., Hinrichs A., RA Lu T.Y., Roskin M.K., Schwartz M., Sugnet W.C. et al.; RT "The UCSC Genome Browser Database."; RL Nucleic Acids Res 31(1), 51-54 (2003). XX RN [2] RP 1-570 RA Pavlicek A. and Jurka J.; RT "RLTR20D_MM - a subfamily of LTR retrotransposons."; RL Direct Submission to Repbase Update (31-DEC-2003). XX DR [2] (Consensus) XX CC The consensus sequence was reconstructed from the UC Santa Cruz CC genome annotation. Similar to RLTR23_MM. Individual copies are CC ~87 % identical to the consensus. 6 bp TSDs. XX SQ Sequence 570 BP; 150 A; 105 C; 194 G; 121 T; 0 other; tgtagggggt ggttctgatg ctttgatcag gaatctgcat gtgaacgctg aaggtcctgt 60 tccccaattg gttcttgatc gatcaataaa gatgctagcg gccaatggct gggcggaaga 120 ggcaggactt ccaggttccc acaggcaggc taggagacgc aggaggagga aagaggattc 180 accatgcttc ggagggagag agagccacca gccatgtgag atctcgggtg gagtggccat 240 tggccgcttc cctgactggg cctggggtag caggcgggag attagaaaca caactaagct 300 gagggcagat ttagggtgct gagctaggac taaaggtaac tgagcaacta agttgagggc 360 agatttagag gtgttgagct gggagtgaag agaagggcac gttagccaag ggaggcttag 420 aagagcccgg ccattgagct aaaaagcata ttaaaaataa gctaatgtgt gtgtgtgtct 480 ttcatccgtg gatccaaggg aacctgggtg ggggctggta gcatggtctg cctggagctt 540 aaagcagggt agtagaaact acatgctaca 570 // ID RLTR33_MM repbase; DNA; ROD; 820 BP. XX AC . XX DT 14-OCT-2002 (Rel. 7.09, Created) DT 14-OCT-2002 (Rel. 7.09, Last updated, Version 1) XX DE Mouse putative long terminal repeat - a consensus. XX KW LTR Retrotransposon; Transposable Element; Long terminal repeat; KW retrotransposon; RLTR33_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-820 RA Jurka J. and Drazkiewicz A.; RT "RLTR33_MM: putative LTR from Mus musculus."; RL Repbase Reports 2(9), 19-19 (2002). XX DR [1] (Consensus) XX SQ Sequence 820 BP; 226 A; 178 C; 181 G; 235 T; 0 other; tgttggggtc caggaatcgc ctcacaaacc acatgaacac caatctcagt cagacaggga 60 tagtttattg agcatacacc ccaggactga tcgatcaggg ccatagtcca gactcaggag 120 ctgaaccgtg accctgagtt aagatcctac agggcttttt aagcctaaaa tccacaaaca 180 tctgtgccaa gttattccac caatcaggat ttagggatag gggatttcct ttaggaacat 240 gtctttgttg tacatttatt cctgttccca ttggttgggg tattcaactg tggcagggga 300 cttgccttgt ctaacattca tgtcttaact tgtctaccag gatgggactt gtctaacatt 360 catgtcttaa cttgccaacc aggatgtcag ttacccacgt acatgtctct ttctgccaag 420 taggatgtca gttcccaggg aggtcttggg aacttaaact ttactcgacc cctactcaaa 480 atggaagtct tattccaaat agtttcttat attggtgcag gtgtctctct tatgttgggg 540 tccatcccag gggcagctta taaaaggcaa ataccataaa agcttataca taggtgcagg 600 aagtgctgct gggtggttgg ttcgggtcca agcttattgt gggtaactat acaaagcagt 660 tctactgact tctatcgtga ttggtttcca agagcacaga acataacaga atatgtaatc 720 aaccttggca ttagaaagtt tcctagtaac agcagaaaca tatgagaaag tttccttata 780 atggcaggct agccaggtaa cagttaccta ggccttaaca 820 // ID L1MB7 repbase; DNA; ROD; 920 BP. XX AC . XX DT 20-FEB-1997 (Rel. 5, Created) DT 20-FEB-1997 (Rel. 5, Last updated, Version 1) XX DE 3'-end of L1 repeat (subfamily L1MB7) - a consensus sequence. XX KW Repetitive sequence; L1 (LINE) family; MER12; L1MB7 subfamily; KW L1MB7. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-920 RA Smit F.A., Toth G., Riggs D.A. and Jurka J.; RT "Ancestral, mammalian-wide subfamilies of LINE-1 repetitive RT sequences."; RL J. Mol. Biol 246, 401-417 (1995). XX DR [1] (Consensus) XX CC ORF2 ends at bp 675. XX SQ Sequence 920 BP; 363 A; 134 C; 167 G; 219 T; 37 other; yttgtatcca gaatatataa agaactctta caactcaaca ataaaaaaac aaacaaccta 60 attaaaaaat gggcaaaaga yttgaataga tatttctcca aagaagatat ayaaatggcc 120 aataagcaca tgaaaagatr ctcaacatca ttagtcatca gggaaatgca aatyaaaacc 180 acaatgagat aycacttcac acccaytaga atggctaaaa ttaaaaagac agrmaatamc 240 aartgttggy raggatgtrg agaaaytgga achctcatac aytgctggtg ggaatgtaaa 300 atggtacarc yactttggaa aacagtttgg cagttcctca aaaagttaam aatagagtta 360 ccatatgacc cagcaattyc actcctaggt atwtacccaa gagaaatgaa aacatayrtc 420 cayacaaaaa cttgtacaca aatgttcata gcagcattat tcataatagc caaaaagtgg 480 aaacaaccca aatgtccatc aatratwgaa tggataaaca aaatgtggta tatccataca 540 atggaatatt attcagcmat aaaaaggaat gaagtamtga tmyatgcaac aacatggatg 600 aaccttgaaa acattatgct aagtgaaaga agccarrcac aaaagrccac atattgtatg 660 attccattta tatgaaatgt ccagaatagg caaatccata gagacagaaa gtagattagt 720 ggttgccagg ggctgggggr aaggggaaat ggggagtgac tgctaatggg tacggggttt 780 ctttttgggg tgatgaaaat gttctaaaat tagatagtgg tgatggttgc acaactytgt 840 gaatatacta aaaaccactg aattgtacac tttaaaaggg tgaattttat ggtatgtgaa 900 ttatatctca ataaarctat 920 // ID CAVID2B1 repbase; DNA; ROD; 90 BP. XX AC . XX DT 26-DEC-2009 (Rel. 15.03, Created) DT 26-DEC-2009 (Rel. 15.03, Last updated, Version 3) XX DE tRNA-derived SINE family: consensus. XX KW SINE2/tRNA; SINE; Non-LTR Retrotransposon; Transposable Element; KW CAVID2B1. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-90 RA Jurka J.; RT "SINE elements from guinea pig."; RL Repbase Reports 10(3), 502-502 (2010). XX DR [1] (Consensus) XX CC ~92% identical to consensus. XX SQ Sequence 90 BP; 32 A; 18 C; 24 G; 16 T; 0 other; gggccaggga tatagctcag tggcacagca cctgcctggc aagcatgagg tcgtgagttt 60 gattcctggt accaaaaaaa aaaaaaaaaa 90 // ID L1M6_5 repbase; DNA; ROD; 2113 BP. XX AC . XX DT 30-APR-1998 (Rel. 6.5, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 2) XX DE Subfamily of LINE1 repetitive element 5' end - a consensus. XX KW L1 repeat; L1M6_5; L1 subfamily. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-2113 RA V V. and Jurka J.; RT "L1M6_5."; RL Direct Submission to Repbase Update (APR-1998). XX DR [1] (Consensus) XX CC 5' end of L1, probably of the L1MA9 subfamilies. CC Multiple ~30 bp tandem duplications in promoter region. XX SQ Sequence 2113 BP; 522 A; 791 C; 500 G; 279 T; 21 other; gacatcagca agatggcgga ataggacttt ccagcgctcg tcctcacaga aacatcaatt 60 tgaacaacta tccacgcacg aaaatacctt cacaagagct aaggaaacca ggtgagagat 120 tacagyacct gggtgtagca cagaaataag aaaagacgca ttgaagaggg taggaaggac 180 agttttacat tacccgcgtc acccctcccc caancccagg cagcacagca tggagagaga 240 taccctctgc ttgggggaag gagagggaag tgagcacagg actttgcctt ggaccccaaa 300 cactaggccc gccccagtaa aacccagtac taggcaggcc cccatagccc cagactccag 360 gccagtacct acggactgag ccgyaccaga tcccacagcc caggctccag gcctgcctgg 420 tggactcagt ctccaggcct gccccagcac caggccaacc ccagtgcccc aggctccaga 480 ccggccccag caccaggcca gccccagtag ccccaggctc caggctgscc ccagcaccag 540 gctggccccc atagccccag gcttcaggcc caccccagca ccaggctggc ccctrcagcc 600 ctagtcatca ggccagcacc tatagaccca gcctccaggc tggcccctgt agacacaggc 660 tccaggccta cccagcrcca ggccagcccc tgtagcccca ggctccaggc ccaccccagg 720 ytccagacca gcccagagcc aggtyggccc acatagcccc aggcttcagg cctgccccag 780 yaccaggtca gcacccctgg cctcagacct tagccaggta ccaggctggc acctgtagac 840 acaggctcca ggcctgccca gtaccaggcc agtccctgtg gccccaccct ccagggcyag 900 cccctgtggc cccatgctcc agcagaccca gggttcaggc ctgtcccagt agaccccagc 960 actaggctag tccccataga cccaggctcc aggactgtcc ctgtgtaccc aggtcccagg 1020 gcagccccta tggccccagg acccaggcca gccctcagag acctagcctc taggccagcc 1080 ctgcagaccc agcctccagg ctggcaccca yagacccaag ctccaggcca tcccccaggt 1140 tccaggccag cctcagtagc tccaggcacc aggctagcac ccacagaccc aggctccaga 1200 ctagccccac gctaccccag caccaggcca gccccaggct ccaggctggt ccctgtggcc 1260 caggctccag tggacccagg gtccaggcct gctccagcag acccagggtc caggcccacc 1320 ccagtagacc ctggttccag gctagccccc atggactcag gctccaggac cacccctgca 1380 gacccaggct ccaggccagc cccyatggac caggatccar ggcccatytc cccagttgca 1440 ggctccaggc ctgccccagt gccaggccag cccccatgga ctcaggctcc aggcccatcc 1500 cagtggaccc aggctccagg cccatcccag yaccaggcca gcccctgyag actcaggctc 1560 aaggcccacc ccagcaccag gtcagcccct gtggacccag gcttcaggcc agcccctata 1620 gacacaggct ccaggccyac cctcatggac ccaggctcca ggcccayccc cacagaccca 1680 atcaacaggt ccacccagtg gatccaggct ccaggcycaa ccctgtggac ccaggcacca 1740 ggcctgccac ctgctgaccc aggcaccagg ccagcctgcc taaggactcc agcagcaagc 1800 ctgcctatag accataccag atggcctgcc cagaatctct ggatgactgg tgaagggctt 1860 tcccagacaa agccagtctg caaagactgg aataagtccc tacttcttca aatgtgcaga 1920 caccaatgca aggcccaaga atcawgaaca atcagggaaa catgacacca ccaaaggaac 1980 aaaataaatt tccagtaact gaccctaaag aaatggagat ctatgaactg cctgacaaag 2040 aattcaaaat aattgtttta aggaagctca gtgaactwca agaaaacaca gatagacaat 2100 taaatgaaat cag 2113 // ID ZOMBI_B repbase; DNA; ROD; 468 BP. XX AC . XX DT 13-JAN-1998 (Rel. 3, Created) DT 31-OCT-2000 (Rel. 5.09, Last updated, Version 3) XX DE Medium reiteration frequency repeat; non-autonomous DNA DE transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; MER46; ZOMBI_B. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 468-338 RA Smit A.F. and Riggs D.A.; RT "Tiggers and other DNA transposon fossils in the human genome."; RL Proc. Natl. Acad. Sci. USA 93, 1443-1448 (1996). XX RN [2] RP 468-338 RA Jurka J., Kapitonov V.V., Klonowski P., Walichiewicz J. RA and Smit A.F.; RT "Identification of new medium reiteration frequency repeats in RT the genomes of Primates, Rodentia and Lagomorpha."; RL Genetica 98(3), 235-247 (0001). XX RN [3] RP 1-468 RA Kapitonov V.V. and Jurka J.; RT "ZOMBI_B."; RL Direct Submission to Repbase Update (30-NOV-1997).. XX DR [3] (Consensus) XX CC 26 bp terminal inverted repeats, TA target site duplication CC [1,2]. XX SQ Sequence 468 BP; 153 A; 99 C; 71 G; 126 T; 19 other; caggttgagc atccctaatc tgaaaatccg aaatccaaaa tgctccaaaa tctgaaactt 60 tttgagcacc aacatgatgc cacaagtgga aaattccaca cctgacctca tgtgataggt 120 cacagtcaaa ayacaatcaa gactnnncna gcnnctncng ttgctnttnc tgccagncaa 180 cnacagnttg tgcacctngn tggcaragan actgacacat ttgctttctk atggttcagt 240 gtacacaaac tttgtttcat gcacaaaatt atttaaaata ttgtataaaa ttaccttcag 300 gctatgtgta taaggtgtat atgaaacata aatgaatttc gtgtttagac ttgggtccca 360 tccccaagat atctcattat gtatatgcaa atattccaaa atccaaaaaa atctgaaatc 420 caaaacactt ctggtcccaa gcatttcgga taagggatac tcaacctg 468 // ID ERV2A-CPo_I repbase; DNA; ROD; 5092 BP. XX AC . XX DT 19-JUN-2009 (Rel. 14.07, Created) DT 19-JUN-2009 (Rel. 14.07, Last updated, Version 2) XX DE Internal portion of ERV2 endogenous retrovirus: consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; ERV2A-CPo_I. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-5092 RA Jurka J.; RT "ERV2-type endogenous retrovirus from guinea pig."; RL Repbase Reports 9(7), 1373-1373 (2009). XX DR [1] (Consensus) XX CC >97% identical to consensus. XX FH Key Location/Qualifiers FT CDS 169..2505 FT /product="ERV2A-CPo_I_1p" FT /translation="MGQALSEHELFVTGLREALKIRGIKVKTKELIKYFQF FT IHSVCPWFPLEGTINLNRWNRVGDALKDYYEVFGPKKVPVTAFSYWTLISE FT LLAQRHADPQLAETLAQGEAALAASRNCTPPPPVLSKAPSPPSSVVLEIPE FT PXQDGDVPTLPLSGDHEGPQNGAATAPPPYRPIYPDLSSGQEVDWSQLAEE FT AAQYHSPRVMAPAAAELLTSFQRPPGPSSALLSLEEQFHDLQQQVQLTKDI FT QRLSLELQTLHANPPRMMAVTGLPLPKIAPISGSSGGLPTMMFPVTETARP FT PKTEALRVTHPLLFPVTEVAQPGASGDNLHDGGETTRPTQGSASPQEGDIK FT SIRERGRQRRRTRRGTQSPSQEHERDKIQATTTTRGNKKRQPPRLQRRDIE FT TDNSEEENDSDSHSSDTDTDQPALALRANRAHAPFPVSALKEFKKAVSQYG FT PTAAFTLAVLDSLTQGWITPNDWRDLARATLSGGNYLLWKVEFQDQCELLA FT KDNTNRKNHYTLPMLTGTGRYATRQAQMRYEAGLFAQITLAASRAWQRLPS FT GGAASSLTKIKQKPEEHFSDFVDRLLQATECLFGTTEDNTGLVRQPAYENA FT LPACQAAIKPYRKKEDLSGYIRLCADIGPAYHQGLAMAAAVKEIFLAHKQG FT GKTNNGKCFKCDQKGHFVRDCPVAQKSQYNTSRPKPTSLCPKCRRGYHWAK FT ECRSKTDADGNPLPQQGNGIXGQPQTPTLRQTSGAIRFVPQQNNQTQQDQA FT QNNPFQTSSAPRQEVQDWTSVPPPMQY*" FT CDS 3515..4879 FT /product="ERV2A-CPo_I_3p" FT /translation="MVLMGHLQPGLPSPSAIPLGFFKIVIDLKDCFFSIPL FT HPNPKYNQPTTDPVTTVQITTVSPSAITVSQAPLHQADHIPAPQYSHSNLF FT SLIKGAYMALNHTQPDLTQSCWLCLAAPPPYYEGKALNLTFISSSSSSACD FT WDNHHKLTLPDVSGRGTCLGDSSRPPALAAQLCAQLHSKFKPNNYLIPPPG FT TSWACNFGLSPCIATAVFNKSTDFCVLVQVLPRITFHPGDYYLQASQISYI FT SKREPVTFTLAVLLGLGVATGIGTGTTALVLGDKHLAQLQAAVDQDLKEIE FT ASVTALQASLTSLSEVVLQNKRGLDLLFMKEGGLCAALKEECCFYADHSGV FT VKESMSKLREHLEEREKERIGATTFSSMWEDLSPYIAPLLGPLAALLLLLT FT VGPCIFQRIMTLINNVNNKVDTFMAKPIQVYYHRLEMAEQQTYRENSPTDG FT PHDAVAQTP*" FT CDS 2271..3269 FT /product="ERV2A-CPo_I_2p" FT /translation="MQVKDRCRRQSPPPAGKRDSGSAPDPHPQTDLRGYKV FT CPTTKQPDTTGPGAKQSISNLLRATPGSAGLDLCSASNAVLAPDMGVQAIP FT TGIYGPLPKGTCGLILGRGSTILKGLQVVPGVVDNDYTGEIVVMVSSISGL FT VSISQGQRIAQLVLLPLVQTSNAAVASYRGNSSLGSSDIYWAQFISKDKPL FT MALLLNGKSFSGLIDTGADVTVIRKEDWPPTWPLETTMTHLQGIGQSKNPQ FT RSASLLTWKDSENNQGHIQPYIVEGLPLNSWGGDSLTQMGMMMGSPNAVVT FT KQMLTQGFLPNQGLGKRGSGIKVPISPTPNPSKAGLGHFQ*" XX SQ Sequence 5092 BP; 1420 A; 1336 C; 1144 G; 1189 T; 3 other; agtggcgacc acgaagggac ctcacctggc acgagggata acgacgcggg accgttcaca 60 ggagtaaagc gctggccagc gtgactgatt caaggacatc gttcctgtga tctgtgcaga 120 cggccacctt acgtctcgcc tgaatcacgg taggtgaacg gcattaacat gggacaagca 180 ctcagtgagc acgaattgtt cgtgactggc ctcagggagg ctctcaagat tagagggatt 240 aaggttaaga caaaagaact aataaagtac tttcagttta tacattcagt gtgtccatgg 300 ttccccctag agggtaccat taatttaaac aggtggaata gagtaggaga tgctcttaag 360 gactattatg aagtttttgg tccaaaaaag gttcctgtaa ctgctttttc atattggact 420 ttaatctctg agctgctggc acaacgtcac gcagatcccc agctcgcgga gacgcttgcc 480 cagggtgagg ctgcgttagc tgcctcccgc aattgtactc cgccaccacc ggtcttatct 540 aaggcgccca gccctcccag ctctgttgtc cttgagatcc cggagcctkt tcaagatggc 600 gacgtgccga cactcccttt gtctggagac cacgagggtc ctcaaaatgg cgccgccacg 660 gcccctcctc cataccgccc tatttaccct gacctttcca gtggccagga agttgattgg 720 tcacaattag cagaagaggc tgcccaatac cattcccccc gggttatggc tcctgctgct 780 gcagaactac tcaccagttt tcagcgaccg cccggtccgt cttctgccct tctctcatta 840 gaggaacaat ttcacgattt acaacagcag gttcagctga ctaaggatat acaaaggctt 900 tccttagagc tccaaacatt gcatgcaaac ccccctagaa tgatggccgt cacgggcctg 960 ccactcccta agatagcgcc tatctcaggg tcgtcaggtg gtctgcccac aatgatgttc 1020 ccggttacag agactgcccg ccctcccaaa acagaggcct tgagagtaac acatccctta 1080 ttgtttcctg taacagaagt tgcccaacct ggcgcgagtg gcgataactt acacgatggt 1140 ggggagacaa ctcgacccac ccaaggtagt gccagtccac aagagggcga cattaaatca 1200 attagggaaa gggggagaca aaggaggcga actagacgcg ggactcaatc cccaagtcag 1260 gaacatgaaa gagataaaat acaggccact accacaacta ggggaaataa aaaaagacag 1320 ccaccacgtc tacaaagacg agatatagaa actgataata gtgaggaaga aaatgactct 1380 gactcgcata gcagtgatac ggacacagac caaccggccc tagcactgag agccaatagg 1440 gcccatgcac cattcccagt atctgcccta aaagaattta aaaaagcagt atctcaatat 1500 ggtcccacgg cggccttcac ccttgcggtc ttagattcct taacccaagg ctggattacg 1560 cctaatgatt ggagagattt agccagagct acactttcag ggggcaacta tctgctctgg 1620 aaggtagaat tccaagacca atgtgaactt ttagctaaag acaatactaa caggaaaaat 1680 cattacaccc ttcccatgct gacgggaaca ggccgatatg ccacacgaca agcccaaatg 1740 cgatatgagg caggtctttt tgctcagatc accctggcag caagcagagc atggcaaagg 1800 ctcccctctg gtggagctgc ttcttccctg acaaaaataa aacaaaaacc tgaggaacac 1860 ttctctgatt ttgttgatcg cctcttacag gcaactgagt gtctcttcgg caccaccgaa 1920 gataacactg gccttgttag gcaaccggct tatgaaaatg ctcttcctgc ctgccaggct 1980 gctatcaaac cctatagaaa aaaagaggac ctttcgggtt atattcgctt gtgtgctgac 2040 ataggccccg cctatcatca aggtttagct atggctgcag ctgtaaaaga aatcttcctc 2100 gcacacaagc aaggagggaa aacaaataac ggcaaatgct ttaaatgtga tcagaaaggc 2160 cattttgtca gagactgccc tgtggcccaa aagtctcagt acaatacttc aaggccaaaa 2220 cccacaagtc tatgyccaaa atgcagaagg ggatatcatt gggctaaaga atgcaggtca 2280 aagacagatg cagacggcaa tcccctcccc cagcagggaa acgggattcr gggtcagccc 2340 cagaccccca ccctcagaca gacctccggg gctataaggt ttgtcccaca acaaaacaac 2400 cagacacaac aggaccaggc gcaaaacaat ccatttcaaa cctcctccgc gccacgccag 2460 gaagtgcagg actggacctc tgttccgcct ccaatgcagt attagctccc gacatgggag 2520 tacaagccat ccccaccggg atttatggcc ctcttcccaa gggcacttgt gggttaattc 2580 tcggacgcgg cagcacaata ttaaaagggt tacaagttgt tccaggagtc gtagataacg 2640 actatactgg cgaaattgtt gtcatggtat cctctatctc tggcctagtt tccattagcc 2700 aaggacaacg tattgctcag ttggtgctgc tgcctttagt tcaaacttct aatgcagctg 2760 tggcttccta caggggaaat tcttcccttg gatcctcaga tatatactgg gctcaattca 2820 tttccaaaga taaacccctt atggctttat tgcttaatgg gaaaagtttc tcagggctca 2880 tagatacagg tgctgatgta acagtcatta gaaaggagga ctggcctccc acatggccct 2940 tagaaaccac tatgacgcat ttacaaggga ttggacaatc taaaaatcca caacgtagtg 3000 ccagtctact tacctggaaa gatagtgaga acaatcaggg gcacatacag ccctatattg 3060 tagaagggct gccccttaac tcgtggggcg gagattcgct aactcaaatg ggaatgatga 3120 tgggcagtcc caatgctgta gtcaccaaac aaatgctcac tcagggcttc cttcctaatc 3180 aaggtttagg caaacgaggc tcaggcatta aagtgcctat ttccccaact ccaaacccct 3240 ctaaagcagg cctaggccat tttcaatagg ggtcattgac cttcctgcgg cccatgcaaa 3300 agagattact tggttatcag acaaacctgt ttgggtcgat cagtggcccc ttacctctaa 3360 aaagctcgca gctgcacagc agctggtgca ggaccagtta gatgcgggcc acattgtccc 3420 cagtgattcc ccttggaaca cgcccatctt tgttattcga aaaaagtcag ggaaatggag 3480 gctgctacaa gaccttaggg caataaataa gaccatggtc ttaatgggac atctgcagcc 3540 aggactccct tcacccagtg ccattcctct gggatttttt aaaattgtca ttgatcttaa 3600 agattgcttt ttttctattc ccttacatcc aaatccaaaa tacaatcagc ccactacaga 3660 tcctgttact actgttcaaa taactacagt aagtccctca gccataacgg tttctcaagc 3720 tccgctacac caggcagatc atatacctgc ccctcaatat tcacattcta acctattcag 3780 tttgattaaa ggagcataca tggctttaaa tcatacccaa cctgatctca cccaatcatg 3840 ttggttatgc ttggctgctc ctcccccata ttatgaagga aaggccctca atttaacctt 3900 tatcagttct tctagctcat ctgcctgtga ctgggacaat caccacaagc tgactctccc 3960 tgatgtctca ggtagaggaa cttgtcttgg ggattccagc cgtccaccag cactggcagc 4020 acaactctgt gcccagcttc attccaaatt caagccaaat aattacctca ttccaccccc 4080 aggaacgtct tgggcctgca attttggcct atcaccctgt atagcaactg ctgtttttaa 4140 caagtctaca gatttttgtg ttcttgtaca ggttcttcct cgcataacct tccacccagg 4200 agactactac ctccaggcat cgcagatcag ctatatatct aagagagaac ctgtgacatt 4260 caccttagct gtgttactag ggttaggagt agcaacggga atagggacag ggaccacagc 4320 tctagtgttg ggagataagc atctagctca attacaggca gcagtagatc aggatttaaa 4380 agaaatagaa gcatctgtga cagctctaca ggcgtccctg acgtctctat ctgaggtagt 4440 cctgcaaaat aaaagaggac tagacctcct atttatgaag gagggtggct tatgtgcagc 4500 cctcaaggaa gagtgttgtt tttatgctga ccactcggga gtagtcaaag agtccatgag 4560 taaactgaga gagcacctcg aagagagaga aaaggagcgc atcggagcaa ccaccttctc 4620 ctccatgtgg gaggatctct ccccttacat tgcccccctc ttgggacccc tagctgcctt 4680 gctgttgctg cttactgtag gaccttgtat tttccagagg atcatgacac tcattaataa 4740 cgttaataac aaggttgaca catttatggc aaagcccatt caggtatatt accatcggtt 4800 ggaaatggca gaacaacaaa cttatcggga aaattcaccc actgacggcc cacatgatgc 4860 tgtggcccag acgccctgag gcactggaca ggcagtcaga gacgggcaac taggacacat 4920 atgtaatgcc aatcggccac ctaagacagg tccaggacca acagctgtcc tgcaggccca 4980 tgacgggtaa ggctcgattg ggttcatatg aggagtcctt gacccaaggc aggcgctgtc 5040 ctcccagggc ctgccatgtt ccaaaaaata tatataataa taagggagga ga 5092 // ID ZOMBI repbase; DNA; ROD; 2806 BP. XX AC . XX DT 13-JAN-1998 (Rel. 3, Created) DT 31-OCT-2000 (Rel. 5.09, Last updated, Version 4) XX DE Autonomous DNA transposon; POGO superfamily. XX KW DNA transposon; Transposable Element; TIRs; TA target; MER46; KW TIGGER4; ZOMBI. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 2806-2710 RA Smit A.F. and Riggs D.A.; RT "Tiggers and other DNA transposon fossils in the human genome."; RL Proc. Natl. Acad. Sci. USA 93, 1443-1448 (1996). XX RN [2] RP 2806-2710 RA Jurka J., Kapitonov V.V., Klonowski P., Walichiewicz J. RA and Smit A.F.; RT "Identification of new medium reiteration frequency repeats in RT the genomes of Primates, Rodentia and Lagomorpha."; RL Genetica 98(3), 235-247 (0001). XX RN [3] RP 1-2806 RA Kapitonov V.V. and Jurka J.; RT "ZOMBI."; RL Direct Submission to Repbase Update (31-DEC-1997). XX RN [4] RP 1-2806 RA Smit A.F.; RT "ZOMBI."; RL Direct Submission to Repbase Update (31-DEC-1997). XX RN [5] RP 1-2806 RA Jurka J. and Kapitonov V.V.; RT "Sectorial mutagenesis by transposable elements."; RL Genetica 107, 239-248 (2000). XX DR [3] (Consensus) XX CC 23 bp terminal inverted repeats and TA target site [1,2]. CC ZOMBI is an autonomous DNA transposon. Its non-autonomous CC elements have been identified as ZOMBI_A (MER46 [1,2]) and CC ZOMBI_B. CC Orientation of ZOMBI has been determined based on the CC reconstruction of its internal sequence encoding transposase [3]. CC It has been shown [3,5] that neurolepsy-related Jerky gene in CC human CC and mouse is a recruited transposase from ZOMBI. XX SQ Sequence 2806 BP; 912 A; 540 C; 600 G; 754 T; 0 other; caggttgagc atccctaatc caaaaatccg aaatctgaaa tgctccaaaa tctgaaactt 60 tttgagcgct gacatgacgc cacaagtgga aaattccaca cctgacctta tgtgacgggt 120 cacagtcaaa acgcaggtgc acaacacaca gtttattcgg cgtccccaag ggaaaaaaga 180 ccctcccagc ccccttcagc tgcggtatat cttttccgcg cacacccaga ttcccccatg 240 caagcacgcc cacaaagggt aataaaatgg cacgtgtgca ggctggacac accaacggca 300 ggttccccac aatgccccca catggggtca agacctacgt gcattactca ctgtgttttt 360 ttgcttattc tctgctctgt ggtgtaaaga tattgttgaa aatgtcaaaa aggcctgtag 420 atacccctgt gagtaacaat gataagaaaa aggaagcatt tatgtttatc tatagcacag 480 aaaagtcaag ctgttggaga aactggacag tggtgtaagt gtgaaacgtc ttacagaaga 540 gtatggtgtt ggaatgacca ccatatatga cctgaagaaa cagaaggata aactgttgaa 600 gttctatgct gaaagtgatg aacggaagtt aatgaaaaat aaaaaaacac tgcataaagc 660 taaaaatgaa gatctcgatc gtgtattgaa agagtggatc cgtcagcatc acagtgaaca 720 catgccactt aatggtacgc tgatcatgaa acaagcaaag atctgtcaca atgaactgaa 780 aattgaaggg aactgtgaat attcaacggg ctggttgcag aaatttaaga aaagacacgg 840 cattacattt ttaaagattt gtggtgataa agcatctgct gatcatgaag cagcggagaa 900 attcattgac gagtttgcca agatcatcgc tgatgaaaat ctgatgccag aacaagtcta 960 taatgctgat gaaacatcac cgttttggtg ttattgcccc agaaagacac tgactacagc 1020 tgatgagaca gcccctacag gaattaagga tgccaaggac agaataactg tgctgggatg 1080 tgctaatgca gcaggcacgc ataagtgtaa acttgctgtg ataggcaaaa gcttgcgtcc 1140 ttgctgtttt caaggagtga atttcttacc agtccattat tatgctaaca aaaaggcatg 1200 gatcaccagg gacatctttt ctgattggtt tcacaaacat tttgtaccag cggcttgtgc 1260 tcactgcagg gaagctggac tggatgatga ctgcaagatt ttgttattcc ttgacaactg 1320 ttctgctcat cctccagctg aaattctcat caaaaataat gtttatgcca tgtactttcc 1380 cccaaatgtg acttcattaa ttcagccatg tgaccagggt atctttagat caatgaagag 1440 taaatataaa aacactttct tgaacagcat gctagcagca gtgaacagag gcgtgggtgt 1500 ggaaggtttt caaaaggagt ttagcatgaa ggatgccgta tatgctgttg ccaacgcttg 1560 gaacacagtg actaaagaca cagttgtgca tgcctggcac aacctctggc ctgcgactgt 1620 gttcagtgat gatgatgaac caagtggtga ctttgaagga ttctgtatgt caagtgagaa 1680 aaaaatgatg tctgacctcc ttacatatgc aaaaaatata ccttcagagt ccgtcagtaa 1740 gctggaagaa gtggatatta aagacatttt taacatcgat aatgaggctc cagttgttca 1800 ttcattggaa gaagtggata tcaaagaagt cttccacatc gataaatgca ttaccagttg 1860 ttcaaccatc accggatggt ggaatagccg aaatggttct gaatcaaggt gattgtgatg 1920 atagtgatga tgaagatgat gacgttaaca ctgcagaaaa agcgcctata gatgacatgg 1980 tgaaaatgtg tgatgggctt attgaaggac tagagcagcg tgcattcata acagaacaag 2040 aaatcatgtc agtttataaa atcaaagaga gacttctaag acaaaaacca ttgttaatga 2100 ggcagatgac tccggaggaa acattttaaa aagccatcca gcagaatgcc tcctcatccc 2160 tagaggaccc acttcctggt ccctcaactg cttctgatgt ttcttctcac ttagaaaaca 2220 aaaaccaaaa agcaaaaaaa atacagtgta cagtaacctt ttaatcaaaa cacagcatcg 2280 tagatggaga ctgaaagcct gccattgttt gttgttgctg ttgtttaaca gctgatacag 2340 gtattctggt gatgctactg tgctgcttag ttaccctgaa cacatttttt tttcactgta 2400 ttaatggtat gtcatatttt ttactgttaa gtacttatgt gtgaataagt gtaagaaaat 2460 gattgcttat cggtagcata taaattcaga gtcaggaatg atggtgatgc caaacaacca 2520 cagattgtcc acatgggtgg ctgagatagt gacacctttg ctttctgatg gttcaatgta 2580 cacaaacttt gtttcatgca caaaattatt aaaaatattg tataaaatta ccttcaggct 2640 atgtgtataa ggtatatatg aaacataaat gaattttgtg tttagacttg ggtcccatcc 2700 ccaagatatc tcattatgta tatgcaaata ttccaaaatc tggaaaaaaa tccagaattc 2760 aaacacttct ggtcccaagc atttcggata agggatactc aacctg 2806 // ID LTR6F_Cpo repbase; DNA; ROD; 451 BP. XX AC . XX DT 19-SEP-2009 (Rel. 14.1, Created) DT 19-SEP-2009 (Rel. 14.1, Last updated, Version 3) XX DE Long terminal repeat of ERV3 endogenous retrovirus: consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; LTR6F_Cpo. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-451 RA Jurka J.; RT "Endogenous retroviruses from guinea pig."; RL Repbase Reports 9(10), 2154-2154 (2009). XX DR [1] (Consensus) XX CC ~83% identical to consensus. XX SQ Sequence 451 BP; 88 A; 106 C; 131 G; 125 T; 1 other; tgttatggct tgtgtctaga tgtcccccca aagcctcatg cggtcatagg tggggctttt 60 cggaggtggc tggatctagg gtgtgtgatg ctgggattaa gggtgtgtga ttactaaatt 120 agtccactga ttggtttggc atagttctgg gtgtggataa tgggaaggcc cgccctgggt 180 gtgggtggca ccacccnaaa ggcgcgtagc ctggatggga taaaagggag aggaggctgc 240 tgctgctgct cttcgctgct tcctgcttgc tgccttctgc ctgccatgga ctgtttctcc 300 tctgcgatgc ccctctgcca tgccaccctg ccttggagcc agccgactat ggactgaaac 360 ctctacaaac tgtgagctaa aataaacctt tcctccttta actttgggtg tcgggtattt 420 tgtctcagca acgagaaaag taactaagac a 451 // ID L1ME3A repbase; DNA; ROD; 917 BP. XX AC . XX DT 01-OCT-1995 (Rel. 1.09, Created) DT 07-OCT-1998 (Rel. 3.09, Last updated, Version 3) XX DE 3'-end of L1 repeat (subfamily L1ME3A) - a consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; KW Repetitive sequence; L1 (LINE) family; L1M4; L1ME3a subfamily; KW L1ME3a. XX OS Homo sapiens OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; OC Catarrhini; Hominidae; Homo. XX RN [1] RP 1-917 RA Smit A.F., Toth G., Riggs D.A. and Jurka J.; RT "Ancestral, mammalian-wide subfamilies of LINE-1 repetitive RT sequences."; RL J. Mol. Biol 246, 401-417 (1995). XX RN [2] RP 1-917 RA Smit A.F.; RT "Direct submission."; RL Direct Submission to Repbase Update (03-MAY-2000). XX DR [2] (Consensus) XX CC ORF2 ends at bp 675; average divergence of copies from consensus: CC 24%. XX SQ Sequence 917 BP; 342 A; 169 C; 177 G; 220 T; 9 other; cttgtatcca gaatatataa agaacgccta caactcaaca ataaaaaaac aaacaaccta 60 attaaaaaat gggcaaaaga cttgaatagg catttcacca aagaagatat acagatggcc 120 aataagcaca tgaaaagatg ctcaacatca ttagtcatca gggaaatgca aattaaaacc 180 acaatgagat accacttcac acccactaga atggctaaaa ttaaaaagac cgacaayaac 240 aagtaytggc gaggatgtgg agcaaccgaa actctcatac attgctggtg ggagtgtaaa 300 ttggtacaac cactttggaa aattgttwgg cagtatctac taaagctgaa catacgcata 360 ccctatgacc cagcaattcc actcctaggt atatncccaa gagaaatgcg tacatatgtt 420 caccaaaaga catgtacaag aatgttcata gcagcactgt tcgtaatagc ccmaaactgg 480 aaacnaccca aatgcccatc aacagtagaa tggataaata aattgtggta tattcataca 540 atggaatact acgcagcaat gagaatgaac gaactacagc tacacacaac aacatggatg 600 aatctcacaa acataatgtt gagcgaaaga agccagacac aaaagagtac atactgtatg 660 attccattta tataaagttc aaaaacaggc aaaactaatc tatgstgtta gaagtcagga 720 tagtggttac ccttgggaag gggtagtgac tagaagggag cacatgaggg gctttctggg 780 gtgctggtaa tgttctgttt cttgatctgg gtgctggtta cacgggtgtg ttcastttgt 840 gaaaattcat cgagctgtac acttatgatw tgtgcacttt tctgtatgta tgttatactt 900 caataaaaag ttaaaaa 917 // ID MARE3 repbase; DNA; ROD; 180 BP. XX AC . XX DT 25-AUG-2006 (Rel. 11.08, Created) DT 31-OCT-2006 (Rel. 11.08, Last updated, Version 2) XX DE Conserved mammalian SINE element. XX KW tRNA; Pseudogene; CYN-I; MARE3; Non-LTR retrotransposon; Rhin-1; KW SINE; SINE_SM; transposable element; conserved. XX OS Mammalia OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi. XX RN [1] RP 1-180 RA Jurka J.; RT "MARE3: Conserved mammalian SINE element."; RL Repbase Reports 6(8), 432-432 (2006). XX DR [1] (Consensus) XX CC Present in >500 copies in the human genome. Similar to 5'-ends CC of CYN-I, Rhin-1 and SINE_SM. Reconstructed from human genomic CC sequences. tRNA-derived. XX SQ Sequence 180 BP; 54 A; 39 C; 43 G; 42 T; 2 other; gctcagttgg ttagagcata gtgctaatga ggccaaggtc atgggtttaa tccccatatg 60 ggccagttag cttcacacag agaaaaacat tgtgttccct ggctatagac tgcaccccta 120 aycctagcca gctgtttcat aaatgtatgc caytggtcac aagaggaaca agggaaagaa 180 // ID IAPEZI repbase; DNA; ROD; 6388 BP. XX AC AC003993; XX DT 06-MAY-1999 (Rel. 4.04, Created) DT 06-MAY-1999 (Rel. 4.04, Last updated, Version 1) XX DE An internal portion of a recent strain of IAP provirus. XX KW Endogenous Retrovirus; Transposable Element; retrovirus; KW Intracisternal A particle element; IAP; IAPEZI. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-6388 RA Lee Y.I., Wang K., Smit A.F., Yu J., Wong K.G., Iadonato P.S., RA Magness L.C., Green P., Olson V.M. and Hood L.; RT "Large-Scale Sequence Analysis of the Mouse T-Cell Receptor Alpha RT Locus."; RL Unpublished (1996) University of Washington Human Genome Center RL Box 352145 Seattle, WA 98195;Contact: Inyoul Lee RL (borah@u.washington.edu). XX RN [2] RP 1-6388 RA Lee Y.I., Wang K., Smit A.F., Yu J., Wong K.G., Iadonato P.S., RA Magness L.C., Green P., Olson V.M. and Hood L.; RT "IAPEZI."; RL Direct Submission to Repbase Update (14-JAN-1998)Direct RL Submission Human Genome Center, University of Washington, Box RL 352145, Seattle, WA 98195, USA University of Washington Human RL Genome Center Box 352145 Seattle, WA 98195 Contact: Inyoul Lee RL (borah@u.washington.edu). XX DR GenBank; AC003993; Positions 31587 37974. XX SQ Sequence 6388 BP; 1829 A; 1391 C; 1507 G; 1661 T; 0 other; attggtgccg aattccggga cgagaaattc cgggacgaga aaaaactcgg gactggcgca 60 aggaagatcc ctcattccag aaccagaact gcgggtcgcg gtaataaagg ttcccgtaaa 120 gcagactgtt aagaaggatt caactgtatg aattcagaac ttttcagctg gggaacgaga 180 gtaccagtga gtacagcttt acgaggtaag tctgatcttg aactttctaa cgaaattcaa 240 gacagtctat cagaagtaaa gtggaatatg tttggccttg aattttttct ggtgttagga 300 gcccttttgt tccttttcac atgttatcaa gtggttaaga tagggcggat tctagaggaa 360 attcaggaca agctatcaga agtaaagcgg ggagagagag taggagcaaa gaggaaatat 420 ggtacacaaa ataagtatac aggcctttcc aagggtcttg aacccgagga aaagttaagg 480 ttaggtagga atacctggag agagattaga agaaaaagag gaaaaaggga aaagaagaaa 540 gatcaattag cggaggtctc taggaaaagg agcctgtgct catcgctgga tgggctcggg 600 aagccagctc ttagtagctc tgaagcaggt gaagaatcct cctctgagga aacagactgg 660 gaggaagaag cagcccatta ccagccagct aattggtcaa gaaaaaagcc aaaagcggct 720 ggcgaaggcc agtttgctga ttggcctcag ggcagtcggc ttcaaggtcc gccctatgcg 780 gagtctccgc cctgcgtagt gcgtcagcag tgcgcagaga ggcagtacgc agagaggcag 840 tgcgcagact cattcattcc cagagaggaa caaaggaaaa tacaacaggc atttccggtc 900 tttgaaggag ccgagggtgg gcgtgtccac gctccggtag aatacttaca aattaaagaa 960 attgccgagt cggtccgtaa atacggaacc aatgctaatt ttaccttggt gcagttagac 1020 aggctcgccg gcatggcact aactcctgct gactggcaaa tggttgtaaa agccgctctc 1080 cctagtatgg gcaaatatat ggaatggaga gcgctttggc acgaagctgc acaagcgcag 1140 gcccgagcaa acgcagctgc tttgactcca gagcagagag attggacttt tgacttgtta 1200 acgggtcagg gagcttattc tgctgatcag acaaactacc attggggagc ttatgcccaa 1260 gtttcttcca cggctattag ggcctggaag gcgctctccc gagcaggtga aaccactggt 1320 cagttaacaa agataatcca gggacctcag gaatccttct cagattttgt ggccagaatg 1380 acagaggcag cagagcgtat ttttggagag tcagagcaag ctgcgcctct gataaaacag 1440 ctaatctatg agcaagccac aaaggagtgc cgagcagcca tagccccaag aaagaacaaa 1500 ggtttacaag actggctcag ggtttgtcgg gagcttgggg gacctctcac caatgcaggc 1560 ttagcggctg ccatccttca atcccaaaaa tgctccatgg gcagaaatga tcggaggaca 1620 tgttttaact gcaggaagcc tgggcatctt aagaaagatt gcagagctcc aggtaaacag 1680 ggagggactc tcactctttg ctctaagtgt ggcaagggtt atcatagagc tgaccagtgt 1740 cgctctgtga gggatataaa gggcagaatt cttcccccac ctgatagtca atcaactgat 1800 gtgccaaaaa acgggtcatc gggccctcgg tcccagggcc ctcaaagata tgggaaccgg 1860 tttgtcagga cccaggaagc agtcagagag gcgacccagg aagacccaca agggtggacc 1920 tgcgtgccgc ctccgacttc ctattaatgc ctcaaatgag tattcagccg gtgccagtgg 1980 agcctatacc atccttgccc ccaggaacca tgggccttat tctcggccgg ggttcactca 2040 ccttgcaggg cttagtagtc caccctggag ttatggattg tcaacattcc cctgaaatac 2100 aggtcctgtg ctcaagccct aagggcgttt tttctattag taaaggagat aggatagctc 2160 agctgctgct cctccctgat aataccaggg agaaatctgc aggacctgag ataaagaaaa 2220 tgggctcctc aggaaatgat tctgcctatt tggttgtatc tttaaatgat agacctaagc 2280 tccgccttaa gattaatgga aaagagtttg aaggcatcct tgataccgga gcagataaaa 2340 gtataatctc tacacattgg tggcccaaag catggcccac cacagagtca tctcattcat 2400 tacagggcct aggatatcaa tcatgtccca ctataagctc cattgccttg acgtgggaat 2460 cctctgaagg acagcaaggg aaattcatac cttatgtgct cccactcccg gttaacctct 2520 ggggaaggga tattatgcag catttgggcc ttattttgtc caatgaaaac gccccatcgg 2580 gagggtattc agctaaagca aaaaatatca tggcaaagat gggttataaa gaaggaaaag 2640 ggttaggaca tcaagaacag ggaaggatag agcccatctc acctaatgga aaccaagaca 2700 gacagggtct gggttttcct tagcggccat tggggcagca cggcccatac catggaaaac 2760 aggggaccca gtgtgggttc ctcaatggca cctatcctct gaaaaactgg aagctgtgat 2820 tcaactggta gaggaacaat taaaattagg ccatattgaa ccctctacct caccttggaa 2880 tactccaatt tttgtaatta agaaaaagtc aggaaagtgg agactgctcc atgacctcag 2940 agccattaat gagcaaatga acttatttgg cccagtacag aggggtctcc ctgtactttc 3000 cgccttacca cgtggctgga atttaattat tatagatatt aaagattgtt tcttttctat 3060 acctttgtgt ccaagggata ggcccagatt tgcctttacc atcccctcta ttaatcacat 3120 ggaacctgat aagaggtatc aatggaaggt cttaccacag ggaatgtcca atagtcctac 3180 tatgtgtcaa ctttatgtgc aagaagctct tttgccagtg agggaacaat tcccctcttt 3240 aattttgctc ctttacatgg atgacatcct cctgtgccat aaagacctta ccatgctaca 3300 aaaggcatat ccttttctac ttaaaacttt aagtcagtgg ggtttacaga tagccacaga 3360 aaaggtccaa atttctgata caggacaatt cttgggctct gtggtgtccc cagataagat 3420 tgtgccccaa aaggtagaga taagaagaga tcacctccat accttaaatg attttcaaaa 3480 gctgttggga gatattaatt ggctcagacc ttttttaaag attccttctg ctgagttaag 3540 gcctttgttt ggtattttag aaggagatcc tcatatctcc tcccctagga ctcttactct 3600 agctgctaac caggccttac aaaaagtgga aaatgcctta caaaatgcac aattacaacg 3660 tattgaggat tcgcagcctt tcagtttgtg tgtctttaag acagcacaat tgccaactgc 3720 agttttgtgg cagaatgggc cattgttgtg gatccatcca aacgtatccc cagctaaaat 3780 aatagattgg tatcctgatg caattgcaca gcttgccctt aaaggcctaa aagcagcaat 3840 cacccacttt gggcaaagtc catatctttt aattgtacct tatactgctg cacaggttca 3900 aaccttggca gccgcatcta atgattgggc agttttagtt acctcctttt caggaaaaat 3960 agataaccat tatccaaaac atccaatctt acagtttgcc caaaatcaat ctgttgtgtt 4020 tccacaaata acagtaagaa acccacttaa aaatgggatt gtggtatata ctgatggatc 4080 aaaaactggc ataggtgcct atgtggctaa tggtaaagtg gtatccaaac aatataatga 4140 aaattcacct caagtggtaa aatgtttagt ggtcttagaa gttttaaaaa cctttttaga 4200 accccttaat attgtgtcag attcctgtta tgtggtaaat gcagtaaatc ttttagaagt 4260 ggctggagtg attaagcctt ccagtagagt tgccaatatt tttcagcaga tacaattagt 4320 tttgttatct agaagatctc ctgtttatat tactcatgtt agagcccatt caggcctacc 4380 tggccccatg gctctgggaa atgatttggc agataaggcc actaaagtgg tggctgctgc 4440 cctatcatcc ccggtagagg ctacaagaaa ttttcataac aattttcatg tgacggctaa 4500 aacattacgc agtcgtttct ccttgacaag aaaagaagcc cgtgacattg ttactcaatg 4560 tcaaagctgc tgtgagttct tgccagttcc tcatgtggga attaacccac gcggtattcg 4620 acctctacag gtctggcaaa tggatgttac acatgtttct tcctttggaa aacttcaata 4680 tctccatgtg tccattgaca catgttctgg catcatgttt gcttctccgt taaccggaga 4740 aaaagcctca catgtgattc aacattgtct tgaggcatgg agtgcttggg ggaaacccag 4800 actccttaag actgataatg gaccagctta tacgtctcaa aaattccaac agttctgccg 4860 tcagatggac gtaacccacc tgactggact tccatacaac cctcaaggac agggtattgt 4920 tgagcgtgcg catcgcaccc tcaaagccta tcttataaaa cagaagaggg gaattgaaga 4980 gattttaccc cgagcaccaa gagtgtcggt gtctttggca ctctttacac tcaatttttt 5040 aaatattgat gctcatggcc atactgcggc tgaacgtcat tgttcagagc cagataggcc 5100 caatgagatg gttaaatgga aaaatgtcct tgataataaa tggtatggcc cggatcctat 5160 cttgataaga tccaggggag ctgtctgtgt tttcccacag aatgaagaca acccattttg 5220 ggtaccagaa agactcaccc gaaaaatcca gactgaccaa gggaatacta atgtccctcg 5280 tcttggtgat gtccagggcg tcaataataa agagagagca gcgttggggg ataatgtcga 5340 catttccact cccaatgacg gtgatgtata atgctcaagt attctcctgc ttttttacca 5400 ctaactagga actgggtttg gccttaattc agacagcctt ggctctgtct ggacaggtcc 5460 agatgactga caccattaac actttgtcag cctcagtgac tacagtcata gataaacagg 5520 cctcaactaa tgtctagata cagagaggtc tcatgctggt taatcaactc atagatcttg 5580 tccagataca actagatgta ttatgacaaa taactcagca gggttgtgaa caaaagtttc 5640 cgggattgtg tgttatttcc attcagtatg ttaaatttac tagggcagct aatttgtcaa 5700 aaagtctttt tcagtatatg ttacagaatt ggatggctga atttgaacag atcccttcgg 5760 gaattgagac ttcaggtcaa ctccacgcgc ttggacctgt ccctgaccaa aggattaccc 5820 aattggatct cctcagcatt ttctttcttt aaaaaatggg tgggattaat attatttgga 5880 gatacacttt gctgtggatt agtgttgctt ctttgattgg tctgtaagct taaggcctaa 5940 actaggagag acaaggtggt tattgcccag gcgcttgcag gactagaaca tggagcttcc 6000 cctgatatat ctatgtttag gcaataggtc gctggccact cagctcttac atctcacgag 6060 gctagactca ttgcacggga tggagtgagt gtgcttcagc agcccgagag agttgcaagg 6120 ctaagcactg caatggaaag gctctgcggc atatatgagc ctattctagg gagacatgtc 6180 atctttcatg aaggttcagt gtcctagttc ccttccccca ggcaaaacga cacgggagca 6240 ggtcagggtt gctctgggta aaagcctgta agcctaagag ctaatcctgt acatggctcc 6300 tttacctaca cactggggat ttgacctcta tctccactct cattaatatg ggtggcctat 6360 ttgctcttat taaaaggata gggggaga 6388 // ID IAPLTR1a_I_MM repbase; DNA; ROD; 6481 BP. XX AC . XX DT 30-JAN-2004 (Rel. 9, Created) DT 30-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE Mouse family of LTR retrotransposon - a consensus. XX KW LTR Retrotransposon; Transposable Element; LTR; ERVK; KW IAPLTR1a_I_MM; IAPLTR1a_Mm-int; IAPLTR2_Mm. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RA Karolchik D., Baertsch R., Diekhans M., Furey S.T., Hinrichs A., RA Lu T.Y., Roskin M.K., Schwartz M., Sugnet W.C. et al.; RT "The UCSC Genome Browser Database."; RL Nucleic Acids Res 31, 51-54 (2003). XX RN [2] RP 1-6481 RA Pavlicek A. and Jurka J.; RT "IAPLTR1a_I_MM."; RL Direct Submission to Repbase Update (JAN-2004). XX DR [2] (Consensus) XX CC The consensus sequence was reconstructed from the UC Santa Cruz CC genome annotation. Similar to IAPEY3_I. LTRs are listed as CC IAPLTR1a_Mm. This family is very young and elements are ~98% CC identical to the consensus. CC IAPLTR1a_I_MM_ORF: 184-2059 (625 aa) gag CC MNSELFSWGTRVPVSTALQGKSGLELSKEIQDSLSEVKWKIALQGMFGLEFFLVLGALLFLFTCYQVIKI CC GLKILDEIQGNLSEVKRGERVGAKRKYGTQNKYTGLSKGLEPEEKFRSGKNTWGEIRRKEKKKEKKKDQL CC AEVSRRYSSLDELRKPALSSSEADEEFSSEETDWEEEAAHYQPANWSRKKPKAAGESQRTVQPPGSRFQG CC PPYAEPPPCVVRQQCAERQCAERQCAECAERQCAERQCAERQCAERQCADSFIPREEQRKIQQAFPVFEG CC AEGGRVHAPVEYVQIKELAESVRKYGTNANFTLVQLDRLAGMALTPADWQTIVKAALPSMGKYMEWRALW CC HEAAQAQARANAAALTPEQRDWTFDLLTGQGAYSADQTNYHWGAYAQISSTAIRAWKALSRAGEATGQLT CC KIIQGPQESFSDFVARMTEAAERIFGESEQAAPLVEQLIYEQATKECRAAIAPRKNKGLQDWLRVCRELG CC GPLSNAGLAAAILQSQNRSMGRNNQRTCFNCGKPGHFKKDCRAPDKQGGTLTLCSKCGKGYHRADQCRSV CC RDIKGRILPPPDSQSAYVPKNGSSGPRSQGPQRYGNRFVRTQEAVREATQEDPQGWTCVPPPTSY CC IAPLTR1a_I_MM_ORF: 2061-2835 (258 aa) pro CC MPQMSIQPVPVEPIPSLPPGTMGLILGRGSLTLQGLVVHPGVMDCQHSPEIQVLCSSPKGVFSISKGDRI CC AQLLLLPDNTREKFAGPEIKKMGSSGNDSAYLVVSLNDRPKLRLKINGKEFEGILDTGADKSIISTHWWP CC KAWPTTESSHSLQGLGYQSCPTISSIALTWESSEGQQGKFIPYVLPLPVNLWGRDIMQHLGLILSNENAP CC SGGYSAKAKNIMAKMGYKEGKGLGHQEQGRIEPISPNGNQDRQGLGFP CC IAPLTR1a_I_MM_ORF: 3071-4658 (529 aa) pol (partial) CC MNLFGPVQRGLPVLSALPRGWNLIIIDIKDCFFSIPLCPRDRPRFAFTIPSINHMEPDKRYQWKVLPQGM CC SNSPTMCQLYVQEALLPVREQFPSLILLLYMDDILLCHKDLTMLQKAYPFLLKTLSQWGLQIATEKVQIS CC DTGQFLGSVVSPDKIVPQKVEIRRDHLHTLNDFQKLLGDINWLRPFLKIPSAELRPLFGILEGDPHISSP CC RTLTLAANQALQKVEKALQNAQLQRIEDSQPFSLCVFKTAQLPTAVLWQNGPLLWIHPNVSPAKIIDWYP CC DAIAQLALKGLKAAITHFGRSPYLLIVPYTAAQVQTLAATSNDWAVLVTSFSGQIDNHYPKHPILQFAQN CC QSVVFPQITVRNPLKNGIVVYTDGSKTGIGAYVANGKVVSKQYNENSPQVVECLVVLEVLKTFLEPLNIV CC SDSCYVVNAVNLLEVAGVIKPSSRVANIFQQIQLVLLSRRFPVYITHVRAHSGLPGPMALGNDLADKATK CC VVAAALSSPVEAARNFHNNFHVTAETLRSRFSLTRKEGP CC IAPLTR1a_I_MM_ORF: 4755-5487 (244 aa) pol (partial) CC MDVTHVSSFGKLQYLHVSIDTCSGIMFASPLTGEKASHVIQHCLEAWSAWGKPKLLKTDNGPAYTSQKFQ CC QFCRQMDVTHLTGLPYNPQGQGIVERAHRTLKAYLIKQKRGTFEETLPRAPRVSVSMALFTLNFLNIDAH CC GHTAAERHCSEPDRPNEMVKWKNVLDNKWYGPDPILIRSRGAVCVFPQNEDNPFWIPERLTRKIQTDQGN CC TDVPRLGDVQGVNNKERAALGDNVDISTPNDGDV. XX SQ Sequence 6481 BP; 1865 A; 1421 C; 1537 G; 1658 T; 0 other; attaagaatt ggtgccgaaa tccgggacga gaaaaaatcc gggacgaaaa aatacaagaa 60 aactcgggaa ccggcgcaag gaagatccct cattccagaa ccagaactgc gggtcgcggt 120 aataaaggtt cccgtaaagc agactgttaa gaaggattca actgcatgaa ttcagaactt 180 ttcagctggg gaacgagagt accagtgagt acagctttac aaggtaagtc tggtcttgaa 240 ctttctaagg aaattcaaga cagtctatca gaagtaaagt ggaaaatagc tttacaaggt 300 atgtttggcc ttgaattttt tctagtgtta ggagcccttt tgttcctttt cacatgttat 360 caagtgatta agatagggct gaaaattctg gatgaaattc agggcaatct atcagaagta 420 aagcggggag agagagtagg agcaaagaga aaatatggta cacaaaataa gtatacaggc 480 ctttccaagg gtcttgaacc cgaggaaaag tttaggtcag gtaagaatac ctggggagag 540 attagaagga aggaaaagaa aaaagaaaag aaaaaagatc aattagcgga ggtctctagg 600 agatactcgt cactagatga gctcaggaag ccagctctta gtagctctga agcagatgaa 660 gaattctcct ctgaggaaac agactgggag gaagaagcag cccattacca gccagctaat 720 tggtcaagaa aaaagccaaa agcggctggc gaaagccagc gtactgttca acctccgggc 780 agtcggtttc aaggtccgcc ctatgcggag cccccgccct gcgtagtgcg tcagcaatgc 840 gcagagaggc aatgcgcaga gaggcaatgc gcagagaggc agtgcgcaga gaggcagtgc 900 gcagagaggc agtgcgcaga gaggcagtgc gcagactcat tcattcccag agaggaacaa 960 aggaaaatac aacaggcatt tccagtcttt gaaggagccg agggtgggcg tgtccacgct 1020 ccggtagaat acgtacagat taaagaactt gccgagtcgg tccgtaaata cggaaccaat 1080 gctaatttta ccttggtgca gttagacagg ctcgccggca tggcactaac tcctgctgac 1140 tggcaaacga ttgtaaaagc cgctctccct agtatgggca aatatatgga atggagagcg 1200 ctttggcacg aagctgcaca agcgcaggcc cgagcaaacg cagctgcttt gactccagag 1260 cagagagatt ggacttttga cttgttaacg ggtcagggag cttattctgc tgatcagaca 1320 aactaccatt ggggagctta tgcccaaatt tcctccacgg ctattagggc ctggaaggcg 1380 ctctcccgag caggtgaagc cactgggcag ttaacaaaga taatccaggg acctcaggag 1440 tccttctcag attttgtggc cagaatgaca gaggcagcag agcgtatttt tggagagtca 1500 gagcaagccg cgcctctggt agaacagctc atctatgagc aagccacaaa ggagtgccga 1560 gcggccatag ccccaagaaa gaacaaaggc ttacaagact ggctcagggt ttgtcgagag 1620 cttgggggac ctctcagcaa tgcaggttta gcggctgcca tccttcaatc ccaaaaccgc 1680 tccatgggca gaaataatca gaggacatgt tttaactgcg gaaagcctgg gcattttaag 1740 aaagattgca gagctccaga taaacaggga gggactctca ctctttgctc taagtgtggc 1800 aagggttatc atagagccga ccagtgtcgc tctgtgaggg atataaaggg cagaatcctt 1860 cccccacctg atagtcaatc agcttatgtg ccaaaaaacg ggtcatcggg ccctcggtcc 1920 cagggccctc aaagatatgg gaaccggttt gtcaggaccc aggaagcagt cagagaggcg 1980 acccaggaag acccacaagg gtggacctgc gtgccgcctc cgacttccta ttaatgcctc 2040 aaatgagtat tcagccggtg ccagtggagc ctataccatc cttgcccccg ggaaccatgg 2100 gccttattct cggccggggt tcactcacct tacagggctt agtagtccac cctggagtta 2160 tggattgtca acattcccct gaaatacagg tcctgtgctc aagccctaaa ggcgtttttt 2220 ctattagtaa aggagatagg atagctcagc tgctgctcct ccctgataat accagggaga 2280 aatttgcagg acctgagata aagaaaatgg gctcctcagg aaatgattct gcctatttgg 2340 ttgtatcttt gaatgataga cctaagctcc gccttaagat caacggaaaa gagtttgaag 2400 gcatccttga taccggagca gataaaagta taatttctac acattggtgg cccaaagcat 2460 ggcccaccac agagtcatct cattcattac agggcctagg ttatcaatca tgtcccacta 2520 taagctccat tgccttgacg tgggaatcct ctgaagggca gcaagggaaa ttcatacctt 2580 atgtgctccc actcccggtt aacctctggg gaagggatat tatgcagcat ttgggcctta 2640 ttttgtccaa tgaaaacgcc ccatcgggag ggtattcagc taaagcaaaa aatatcatgg 2700 caaagatggg ttataaagaa ggaaaagggt taggacatca agaacaggga aggatagagc 2760 ccatctcacc taatggaaac caagacagac agggtctggg ttttccttag tggccattgg 2820 ggcagcacga cccataccat ggaaaacagg ggacccagtg tgggttcctc aatggcccct 2880 atcctctgaa aaactagaag ctgtgattca actggtagag gaacaattaa aactaggcca 2940 tattgaaccc tctacctcac cttggaatac tccaattttt gtaattaaga aaaagtcagg 3000 aaagtggaga ctgctccatg acctcagagc cattaatgag caaatgaact tatttggccc 3060 agtacagagg ggtctccctg tactttccgc cttaccacgt ggctggaatt taattattat 3120 agatattaaa gattgtttct tttctatacc tttgtgtcca agggataggc ccagatttgc 3180 ctttaccatc ccctctatta atcacatgga acctgataag aggtatcaat ggaaggtctt 3240 accacaggga atgtccaata gtcctactat gtgtcaactt tatgtgcaag aagctctttt 3300 gccagtgagg gaacaattcc cctctttaat tttgctcctt tacatggatg acatcctcct 3360 gtgccataaa gaccttacca tgctacaaaa ggcatatcct tttctactta aaactttaag 3420 tcagtggggt ttacagatag ccacagaaaa agtccaaatt tctgatacag gacaattctt 3480 gggctctgtg gtgtccccag ataagattgt gccccaaaag gtagagataa gaagagatca 3540 cctccatacc ttaaatgatt ttcaaaagct gttgggagat attaattggc tcagaccttt 3600 tttaaagatt ccttccgctg agttaaggcc tttgtttggt attttagaag gagatcctca 3660 tatctcctcc cctaggactc ttactctagc tgctaaccag gccttacaaa aggtggaaaa 3720 agccttacag aatgcacaat tacaacgtat tgaggattcg cagcctttca gtttgtgtgt 3780 ctttaagaca gcacaattgc caaccgcagt tttgtggcag aatgggccat tgttgtggat 3840 ccatccaaac gtatccccag ctaaaataat agattggtat cctgatgcaa ttgcacagct 3900 tgcccttaaa ggcctaaaag cagcaatcac ccactttggg cgaagtccat atcttttaat 3960 tgtaccttat accgctgcac aggttcaaac cttggcagcc acatctaatg attgggcagt 4020 tttagttacc tccttttcag gacaaataga taaccattat ccaaaacatc caattttaca 4080 gtttgcccaa aatcaatctg ttgtgtttcc acaaataaca gtaagaaacc cacttaaaaa 4140 tgggattgtg gtatatactg atggatcaaa aactggcata ggtgcctatg tggctaatgg 4200 taaagtggta tccaaacaat ataatgaaaa ttcacctcaa gtggtagaat gtttagtggt 4260 tttagaagtt ttaaaaacct ttttagaacc ccttaatatt gtgtcagatt cctgttatgt 4320 ggttaatgca gtaaatcttt tagaagtggc tggagtgatt aagccttcca gtagagttgc 4380 caatattttt cagcagatac aattagtttt gttatctaga agatttcctg tttatattac 4440 tcatgttaga gcccattcag gcctacctgg ccccatggct ctgggaaatg atttggcaga 4500 taaggccact aaagtggtgg ctgctgccct atcatccccg gtagaggctg caagaaattt 4560 tcataacaat tttcatgtga cggctgaaac attacgcagt cgtttctcct tgacaagaaa 4620 agaaggcccg tgacattgtt actcaatgtc aaagctgctg tgagttcttg ccagttcctc 4680 atgtgggaat taacccacgc ggtattcgac ctctacaggt ctggcaaatg gatgttacac 4740 atgtttcttc ctttggaaaa cttcaatatc tccatgtgtc cattgacaca tgttctggca 4800 tcatgtttgc ttctccgtta accggagaaa aagcctcaca tgtgattcaa cattgtcttg 4860 aggcatggag tgcttggggg aaacccaaac tccttaagac tgataatgga ccagcttata 4920 cgtctcaaaa attccagcag ttctgccgtc agatggacgt aacccacctg actggacttc 4980 catacaaccc tcaaggacag ggtattgttg agcgtgcgca tcgcaccctc aaagcctatc 5040 ttataaaaca gaagagggga acttttgaag agactttacc ccgagcacca agagtgtctg 5100 tgtctatggc actctttaca ctcaattttt taaatattga tgctcatggc catactgcgg 5160 ctgaacgtca ttgttcagag ccagataggc ccaatgagat ggttaaatgg aaaaatgtcc 5220 ttgataataa atggtatggc ccggatccta ttttgataag atccagggga gcggtctgtg 5280 ttttcccaca gaatgaagac aacccatttt ggataccaga aagactcacc cgaaaaatcc 5340 agactgacca agggaatact gatgtccctc gtcttggtga tgtccagggc gtcaataata 5400 aagagagagc agcgttgggg gataatgtcg acatttccac tcccaatgac ggtgatgtat 5460 aatgctcaag tattctcctg cttttttacc actaactagg aactgggttt ggccttgatt 5520 cagacagcct tggctctgtc tggacaggtc cagacgactg acaccattaa cactttgtca 5580 gcctcagtga ctacagtcat agataaacag gcctcagcta atgtcaagat acagggaggt 5640 ctcatgctgg ttaatcaact catagatctt gtccagatac aactagatgt attatggcaa 5700 atagctcagc tgggatgtga acaaaagttt ccgggattgt gtgttacttc cattcagtat 5760 gttaaattta ctagggcagc taatttgtca aaaagtcttt ttcagtatat gttacagaat 5820 tggacggctg aatttgaaca gatccttcgg gaattgagac ttcaggtcaa ctccacgcgc 5880 ttggacctgt ccctgaccaa aggattaccc aattggatct cctcagcatt ttccttcttt 5940 aaagaatggg tgggattgat attatttgga gatacacttt gctgtggatt agtgttgctt 6000 ctttgattgg tctgtaagct taaggcccaa actaggagag acaaggtggt tattgcccag 6060 gcgcttgcag gactagaaca tggagcttcc cctgataata tctatgctta ggcaataggt 6120 cgctggccac tcagctctta tatcccatga ggctagtctc attgcacggg atagagtgag 6180 tgtgcttcag cagcccgaga gagttgcacg gctaagcact gcagtagaag ggctctgcgg 6240 cataatatga gcctattcta gggagacatg tcatctttca agaaggttga gtgtccaagt 6300 gtccttctct ccaggcaaaa cgacacggga gcaggtcagg gttgctctgg gtaaaagcct 6360 gtgagcctaa gagctaatcc tgtacatggc tcctttacct acacactggg gatttgacct 6420 ctatctccac tctcattaat atgggtggcc tatttgctct tattaaaagg aaagggggag 6480 a 6481 // ID CAVID2A repbase; DNA; ROD; 84 BP. XX AC . XX DT 02-DEC-2009 (Rel. 15.03, Created) DT 02-DEC-2009 (Rel. 15.03, Last updated, Version 3) XX DE tRNA-derived SINE family: consensus. XX KW SINE2/tRNA; SINE; Non-LTR Retrotransposon; Transposable Element; KW CAVID2A. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-84 RA Jurka J.; RT "SINE elements from guinea pig."; RL Repbase Reports 10(3), 500-500 (2010). XX DR [1] (Consensus) XX CC >92% identical to consensus. XX SQ Sequence 84 BP; 24 A; 19 C; 23 G; 18 T; 0 other; ggggctgggg atttagctca gtggcataag cacctgcctt gcaagcgtgc agtcatgagt 60 ttgatcccca gtaccaaaaa aaaa 84 // ID MERX repbase; DNA; ROD; 533 BP. XX AC . XX DT 24-JAN-2007 (Rel. 12.01, Created) DT 05-AUG-2007 (Rel. 12.01, Last updated, Version 2) XX DE Mammalian repeat, possible fragment of a LINE1 family, or SINE DE element. XX KW Transposable Element; MERX. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-533 RA Jurka J.; RT "Low-copy interspersed repeat from mammals."; RL Direct Submission to Repbase Update (24-JAN-2007). XX DR [1] (Consensus) XX CC Present in >200 copies in the human genome. Its small fragment CC shows weak similarity to L1MED_5. It is absent from opposum. CC Re-classified as Euterian. XX SQ Sequence 533 BP; 205 A; 100 C; 75 G; 151 T; 2 other; gaaartacag gcatccctta ttatctaaat ctaatccatt cctgaaaata tgttggatag 60 caaattttca gatagcagga gcctatattc cattatttta aatggagaaa ataatattgt 120 attcccagcc catccaaaaa ccccaaccaa tttccccaaa tatcactaaa acactcttaa 180 acacctatat aacaatccac caaggatttc tgcataatat aaagcattta aaacaccaat 240 tacctaatta tttaacactt aatgaactca ttagtgggta tgtttgtgtg cagtatgcaa 300 taatactgta cagtatcaga tatraattac attttctttc tacactgcaa tgaacatgta 360 ctgtaatgta aataaacaat gaaataatat attaaagata atatctaaat tggacccagg 420 tttagtagtg cccctaggta cctggcagaa gcaaacactg ggacccatga agggctaaac 480 cctcagggaa aaggtgggga agatgataaa aataatgcat aacatattta aac 533 // ID ERVB4_1B-I_MM repbase; DNA; ROD; 4288 BP. XX AC . XX DT 26-AUG-2008 (Rel. 13.08, Created) DT 26-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Mouse endogeneous betaretrovirus ERVB4_1B LTR subfamily (internal DE portion). XX KW LTR Retrotransposon; Transposable Element; Nonautonomous; LTR; KW endogeneous betaretrovirus; MmERV-B4_AC102561; ERVB4_1-LTR_MM; KW ERVB4_1B-I_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-4288 RA Jurka J.; RT "Endogeneous betaretrovirus family from mouse."; RL Repbase Reports 8(8), 861-861 (2008). XX DR [1] (Consensus) XX CC This is a relatively young subfamily. XX SQ Sequence 4288 BP; 1374 A; 748 C; 845 G; 1317 T; 4 other; agtggcgccc gacgtgaggg cgaggtccgg atcgtaattc ctaatcagtg gaggttccag 60 agaagttcgt cgcgacccca agaatttaaa agtagtgaag gacaccttcc gctgctcacg 120 gaagagcgag aagtccttgg tgagttgagt catctccact tcaggttatg ggacataagc 180 tatctaaaga ggcagccttc atcaaaggtt taaagatagc tctcagagaa agaagagtac 240 gagttaaaaa aaaaaaagat tagatagact ttttattttc atagaccagg tatgtccatg 300 gtttattata gatgaagcag agatacgttg taaaaaatgg tgaaaggtag gtagagattt 360 aaatgataaa ctagctaatg agggtcccga tgtggtccct gcaactgtct tttctgtccc 420 gatgcagtcc ctacaaccgt tttttcttat tgaaaagtaa ctatcaaaag cagccattgt 480 acctcctctc ccttctctgg aagtattccc agaagaagga gataaggaag tagactctga 540 acatgagaga aagaaaataa gttttagaaa agcagttatc ccctgtttga gatcttttag 600 caaaaaaaga gaaaaaatga aaataagcta tttcagagct ctcctggaga cagagggaaa 660 gcgggagacg tcccgttttc tctctgcctc ctatagagat attcttagct ctggttaaaa 720 tatctgtgtc cctttgagac accaagttct attccctgtc tgtctattgt ctgtttttga 780 tgcatcaaaa caaattgtcc atatgtctgt caattcatgt ttgtttttgt tatgttgttt 840 aaatgattat tgttctgtgt ttcatgttga aaaatataaa tggttaaaac ttgatctgct 900 ggctgtccat cctttgattt agtttaactt gtttaaaaag cagtttcaaa attacacagc 960 acaagttgat aagttagcta cactgtggct gggagccaag tacagctgaa aaagccttgc 1020 tgagggcctg attgcaaacg gagctcttaa aggggcaggc agccttttgc ttgtaacaga 1080 aagttagctt aaaattggga actcaaggtt agaatctttc taaacaacat tgaacaaaca 1140 caagaaaaca ggtctttaag gatacttcaa catagagata gtctttgagc caagactctg 1200 gcagagtgtt tagattaaga aaaggtttgc atttgggtta aggctgagct cagaggggag 1260 cagcctcagc gcggcttgta tggcacttct tttagagcct tattacagac ctagccagat 1320 gttgttctaa ataaaattta aattcaaagt ttataaaagg tcaatcaggc tgtggaattt 1380 atcaagacat tgcgatttga cttgcctact ttatataaag ttatagtgta cagagtttgc 1440 ttatatgtat aagtgtctgt tccttgttcc aaacagccat taatttggtt attgcagaat 1500 attgatgttt gatctattgc ataccatcat ataaagtaaa attgataatc attatcctaa 1560 gatagtttaa aaaattattt ttatgcatgc ttgtatatct tctataaatt catgcatgca 1620 tgcatcccat ttactgtgtt taaaaaatag cctttatatg agagaaaagt atatgtgatt 1680 taaatcacat gattattctt ttgagtttcc tcctgtttca gcacagataa ttaaattgtg 1740 tgctaaagat ggtgtttaaa aatgttgaac aatcaaactt taagttgtat attaatagtc 1800 aatattaatt ctctgagctt gcaattgctt atgcaactta taagaaaaaa atactgttcc 1860 tttaagaaga ctgtttttag acagttaaga aattgtccta ggttgcctgg aaaagacccc 1920 caggatttgc ttttgtgtta tagaggttta cataaagctt taactgataa agattacaga 1980 tagctcctga aaagatacaa actcaggatc cttataatta tttgggtttt agacttactg 2040 atcaaactgt ttttccctag aagatagtta tttacagaga caacttaaag actttctctt 2100 atttctgatt aataaatata taaattatac acatgtgact ataataactt ttcaaacatt 2160 ctagttacaa tctctcttca agagaaacaa ttgaaagtgt aattagtcac tgtccatgtt 2220 atattaaaac cgactataac agtccagtat ttaagtggtt ttgtcaaaaa ttttttaatt 2280 ctcaaagaca aggtattata aaactttaaa actctatctt cttaaaaaca aaaaagggga 2340 aaaattatac ccccatgtac attatttaaa tcatactttt attgttaaga gttttaaawt 2400 ttagatgtct aagaacttaa taaatgcctc atagtatctt aaaactagac ataatcatgt 2460 ctaggtgaga tgaaaaggac ccactcattg acacatggca tgagcctaaa cagaaggtta 2520 ttatggggaa aagggggcgg tgtttctgtt ttatccacag aatgctgcaa aagcatgcta 2580 gcttccagaa tgattcgtgt gacagactga cctgtgagtt catgagtgcc ctggcagtga 2640 tgacaagaaa gctgtgttga gtgcacagaa aaagaagaat cagagagagc agacacttac 2700 aaacaagatg aagaaacttc ctatcttcgc catgggctac aacaatgaga gccgacctag 2760 tgtccacttg aggacaacta taaatgatga cccgtggggc agagagactg ctacagtgta 2820 tttaacaaga tttcatcaga gacactgttt ttggccattc aggccaactc ccctatagct 2880 gtraaagctt ttgtacctta cctatatgct atattgttag ataattttaa gagttaagat 2940 caccaatctt gacaagtctt ttcatattca ttgtaattgt tgtcaattaa ctaatcctgt 3000 ggatcctaat ttagataagg aatttgctgt tatgattttg ttaaagagac ctccttatgt 3060 tttgctgcct gttaaaatta ggaaatgatc cttagtttaa aaaatccaga aatacaaact 3120 tggaaaaggt caaatgagca atttttagat ttactgcttt agttgcaatt ttaacttcaa 3180 tatctaccac agctttagat caacagctgc atactgatca ttttgttaat gatatgcata 3240 agtaatatta gcaacttatt gtagataaaa aattagaggc aaaggttaat gtcttagaag 3300 tagtggtctc aacaataaga caggatataa caaatattaa ggctacaaag acagaaattt 3360 caggtattag aagaatgttg tctttactct gacaagactg gcttagttca agataacata 3420 gacaaagtta gagctagttt agaagagaga aaacgcaaca gagaaaaaca agaattttgg 3480 tataaaaatt ggttttctac ttctccttgg gtcactacct tgcttcccac acttttggga 3540 cctttcttgg gtattctrct gcttttgtct tttggcccct gggcctttaa aaaattaacc 3600 agttttgtta agtcacagat tgaagctgct ttaagtaagc cggttgcagt ccactaccat 3660 cagctggata tccgggactc agacgaagaa gatcctcctc ccaccgaggc agaaacarcg 3720 actcgtctcc agttttccac ccttgctgct aatgcagagt ccccttggtt cctcaggctt 3780 tggagacaat agggacgaag aagggtgacc tccagctcct attatagctt gagagccaca 3840 aattgtggtg ttacaattgg cataattctt acagaggcct tactgagtct gaggttgaca 3900 attctcctac gggagctgca caaaactcag aattgtgcat gctggttcgt gataaatacg 3960 aatccggtat gggcccagca tgggtgcaaa ctaagtattg caggggaagg tccaaataga 4020 ccgtttctga gttccctgag agacaaaccc ggttaggata gaggttggta tctaatttca 4080 gttacaatta ataatttttt caaaggctct gcctcacctc cccaaaagat accaagagcc 4140 acgagtgtgg gtctgtgaca gcacccacgg gaggaatcgg gtcaatgtcc ccccagccat 4200 ggtaatgccc gctcatctaa agatgaaaag attatttgat cacctcagtt aagtgtggcc 4260 ttattaaatt taattcagaa gggggaga 4288 // ID RMER20B repbase; DNA; ROD; 733 BP. XX AC . XX DT 30-JAN-2004 (Rel. 9, Created) DT 30-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE Mouse long terminal repeat - a consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; ERVK; LTR; KW RMER20B. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-733 RA Karolchik D., Baertsch R., Diekhans M., Furey S.T., Hinrichs A., RA Lu T.Y., Roskin M.K., Schwartz M., Sugnet W.C. et al.; RT "The UCSC Genome Browser Database."; RL Nucleic Acids Res 31(1), 51-54 (2003). XX RN [2] RP 1-733 RA Pavlicek A. and Jurka J.; RT "RMER20B - a subfamily of LTR retrotransposons."; RL Direct Submission to Repbase Update (JAN-2004). XX DR [2] (Consensus) XX CC The consensus sequence was reconstructed from the UC Santa Cruz CC genome annotation. Individual copies share 85-90% identity with CC the consensus. Related to RMER20 (84% identity). 6 bp TSDs. XX SQ Sequence 733 BP; 201 A; 237 C; 130 G; 165 T; 0 other; tgggaaaccc tgctacttaa ggtcaaagtc taccatgtcc acctggatag ctcctctgag 60 ctcggtgcaa aatggaggac tcccatcttt tctcagcagg gcttcctctt ctctgcctga 120 ctgaccttgt cgtgggggag cctgaccccc aacacacatc cctctacctg aggaaggtca 180 cgttagtcct tggcaaacct acaagcttca acttcctctt tcacccaata agaacctgcc 240 cagcaagtac ctgggaatca ttcctggaat gcctctccat acaaatgagg cattcacagt 300 actttaaact ccccagccaa tgattttaca ttcaccctga aaactccttc ccactctcca 360 aggttcatat atagcccttg ttcaccctca aataaagtgt atgtgcatca ccagctcaaa 420 caagtgagat gatttcactg aagatcatct atcagagagc tgtttcattg aatgtatcat 480 ctaaaaagag ctgtaacact aaaatcctta aagaagcctt cctcagagaa ggccttcccc 540 cccccaccac cccctaccaa aacaacaaac ctacccccca gcactcagca ctcccagctg 600 gaccaggatc ctgcagaccc cacccgttcc agactctgct tcatccttcc agcctgggtg 660 tgcagacagc atcctggaca ccctgaaggg aggaaacaga ggacgcggaa ccacagtacc 720 tggatcccca aca 733 // ID MER21B repbase; DNA; ROD; 863 BP. XX AC . XX DT 01-OCT-1995 (Rel. 1.09, Created) DT 16-JUN-2000 (Rel. 5.05, Last updated, Version 4) XX DE Long terminal repeat of retrovirus-like element - a consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; KW Long terminal repeat of retrovirus-like element; MER21B. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RA Kaplan J.D., Jurka J., Solus F.J. and Duncan H.C.; RT "Medium reiteration frequency repetitive sequences in the human RT genome."; RL Nucleic Acids Res 19(17), 4731-4738 (1991). XX RN [2] RP 19-815 RA Smit A.F.; RT "Structure and evolution of mammalian interspersed repeats."; RL Ph.D. thesis (USC 1995). XX RN [3] RP 1-863 RA Smit A.F.; RT "Direct submission."; RL Direct Submission to Repbase Update (31-JAN-2000). XX DR [3] (Consensus) XX CC LTR of a class I retrovirus-like element. 4 bp target site dups. CC Copies are on average 17% diverged from consensus. CC MER21B is a member of a closely interrelated group of LTRs CC further CC including MER34, MER39, LTR29, LTR48 and LTR49. XX SQ Sequence 863 BP; 203 A; 188 C; 234 G; 218 T; 20 other; tgtgatattg tgaaatatat atttggtctt cgnccccgtt tcctggcaca nagctcctaa 60 aacccttgga atctccngag tgataggagt ntctttgtgt gctaatgagn tgactgntgg 120 ctggcggccc ctaggtagct tcaggatggg ggctggtcac cagaaagacc aaggcangat 180 tagagggttg ggactttcag ccccaccccc caacctccag ggaggggaga ggggctgaag 240 gttgagttga tcaccaatgg ccaatgatnt aatcaatcat gcctacgtaa tgaagcctcc 300 ataaaaaccc aaaaggacng ggttcggaga gcttctggat agctgaacac gtggaggttc 360 ctggagggtg gcgngcccgg ggagggcacg gaagctctgc gccccttctc ccatacctcg 420 ccctatgcat ctcttcatct ggctgttcat ctgtatcctt tgtaatatcc tttataataa 480 acnggtaaac gtaagtaaag tgtttccctg agttctgtga gccgctctag caaattaatc 540 gaacccaagg agggggttgt gggaacccca atttatagcc ggtcggtcag aagcacaggt 600 nacaacctgg ngcttgcgac tggcatctga agtggggggc agtcttgtgg gactgagccc 660 tcaacctgtg ggatctgacg ctatctccag gtagatagtg tcagaattga attgaattag 720 aggacaccca gctggtgtcc gctgnagaat tgnttgcttg cttgtnngtg gggaaaaacc 780 cccacacatt tggtcacaga agtnttctgt gttgattgtn ntgagtgaga gaatagaaaa 840 aacactttgt ttgtgttttt cca 863 // ID LTR6E_Cpo repbase; DNA; ROD; 403 BP. XX AC . XX DT 31-JUL-2009 (Rel. 14.07, Created) DT 19-SEP-2009 (Rel. 14.07, Last updated, Version 3) XX DE Long terminal repeat of ERV3 endogenous retrovirus: consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; LTR5_CPo; KW LTR6E_Cpo. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-403 RA Jurka J. and Baney O.; RT "Non-autonomous endogenous retrovirus from guinea pig."; RL Repbase Reports 9(7), 1546-1546 (2009). XX DR [1] (Consensus) XX CC ~84% identical to consensus. Renamed from LTR5_Cpo. XX SQ Sequence 403 BP; 88 A; 95 C; 107 G; 113 T; 0 other; tgttatggct tgtgtctaga tgtcccccca aagcctcatg cggtcatagg tggggctttt 60 cagaggtggc tggatctaga gtgtgtgatg ctgggattga ttcatggtgc aatgctagga 120 tcaagggcac taggattaag ggtgtgagtg ctacccgccc tgagtagcct ggataagata 180 taagggacag aaagaggctg ttggtcctcc ttgcttgctt tgctgccttc tgcccgccat 240 gaactgtttc tcctctgcga tgcccctctg ccatgccacc ctgccttgga gccagccgat 300 tatggactga aacctctaca aactgtgagc taaataaacc tttcctcctt taactttggg 360 tgtcgggtat tttgtctcag caacgagaaa agtaactaag aca 403 // ID RLTR38B_MM repbase; DNA; ROD; 566 BP. XX AC . XX DT 30-JAN-2004 (Rel. 9, Created) DT 30-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE Mouse long terminal repeat - a consensus. XX KW LTR Retrotransposon; Transposable Element; LTR; ERVK; RLTR31A; KW RLTR38B_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RA Karolchik D., Baertsch R., Diekhans M., Furey S.T., Hinrichs A., RA Lu T.Y., Roskin M.K., Schwartz M., Sugnet W.C. et al.; RT "The UCSC Genome Browser Database."; RL Nucleic Acids Res 31(1), 51-54 (2003). XX RN [2] RP 1-566 RA Pavlicek A. and Jurka J.; RT "RLTR38B_MM - a family of LTR retrotransposons."; RL Direct Submission to Repbase Update (31-DEC-2003). XX DR [2] (Consensus) XX CC The consensus sequence was reconstructed from the UC Santa Cruz CC genome annotation. Individual copies are ~93% identical to the CC consensus. 6 bp TSDs. RLTR31B in RepeatMasker. XX SQ Sequence 566 BP; 137 A; 91 C; 200 G; 138 T; 0 other; tgttgcagga ttttccctgt ccaatcacat tagggcagtg ggaggcttgt gattggacag 60 gagaagggag gcagagcgag gagttgagga gacagagaga gaggtctgag gaggagagag 120 agaaccagaa tggaggctga cgtggtgtta tcaaaaggtt agaataattg ggttaaagct 180 ttatcattat catttggctc tgaaattatt gtattggcat cttgtaaatt gtgttattat 240 tgatacataa atctgattgg ctaattaagc attaagagtc ttgattctac cgggtaatta 300 ggtgttgaga tggctaacca ggggtgcgtg gggcgttgcg tgaaagcgag aggaactcgg 360 gggggggggc ccatctgaga gatggcctgg tgggagccat gtggcctcgc agggctggcg 420 agttggcggg ccgagagacc gagcgagtga gatcacctgg cacaggggct agctctagag 480 agttgttggc ggtggcagga gcgcgggagc gtggcctggc cccgctgaga gttggcaggt 540 tcatttttta aatatttccc gcaaca 566 // ID ERVB5_1-I_MM repbase; DNA; ROD; 8027 BP. XX AC AC098708; XX DT 11-FEB-2004 (Rel. 9.01, Created) DT 11-FEB-2004 (Rel. 9.01, Last updated, Version 1) XX DE Mouse endogeneous beta retrovirus ERVB5_1, internal sequence. XX KW Endogenous Retrovirus; Transposable Element; ERVB5_1-I_MM; KW MmERV-B5_AC098708; endogeneous betaretrovirus; pol domain. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-8027 RA Baillie J.G., Van de lagemaat N.L., Baust C. and Mager L.D.; RT "Multiple groups of endogeneous betaretroviruses in mice,rats and RT other mammals."; RL J. Virol 78(11), 5784-5798 (2004). XX DR Genbank; AC098708; Positions 41954 52106. XX SQ Sequence 8027 BP; 2309 A; 2029 C; 1552 G; 2137 T; 0 other; tccctgtccc tgttgaccat cctgcgggtt gggacaaact ggcaccaaac agggaactcc 60 gggggtctaa gaggagagaa ctcactcatc attggagggt agacttagcc acttatctca 120 gtaagcacat gagagcagct taggttcggc atgacaccac tgtccagccc attgtgagac 180 agtcagggag atcatagtgc tcatatctta aggtaagaca tgggacaagc agtttctaaa 240 cagtcactct ttgtgctagg gcttagagtc cctcaagact cgaggaacta gggttaagaa 300 aaagtattta aagaaatttt tagagtttgt gggaggcatt tgtccatggt tcccccaaga 360 gggaacggtt gatgaaaaaa ggtggcgcat aattggcaat tgtttcaaag attattatga 420 agcatttgga cctacaaaaa tacctgtaag gcctttagct actggtacat cattaatgat 480 attcttagaa caaaccctga atggccagat ctccaacatt tggtgtccaa aggagagctg 540 tctcttaaag aatctatctc ccatgctcct tctgctaacc tagcaactca tacaccacct 600 ttgtcccctt taagaaaggg acacttgtcc gctgatggtg tttcagacct ccccacttta 660 ttgcccatta aggatatttc tttgacccca cctagggagg aaagtaaaac agcagaccct 720 tattcttctc tcattgatct tcatgacaat ctgctgccag ctgatgatgt aacccttaag 780 gaggaagcag ctagttatga atcagaccga tatccgatcc acccaaaatg gccacaattt 840 ccctagcctc agtccctcca ctgataggtt ctgcctcact cccatctttc tatcccccat 900 taccctgtgg gactttagat gcagggtttt tttttgttgt tgttgctgtt tttgttagat 960 gcagtttttg atccacttgc catttccggc acacaccacc agttaattgc caatgtggca 1020 tccctgcagc acatggtaga gacctgggag gaacaagtac aattgcttag agccctccag 1080 tcattagaga ccgaatttct tgatctaaca ccttccaacc aacaactgcc ttaggtctct 1140 gttaaaacaa aacaaaacaa aacaaaacaa aacaaaaaaa caaaaacaaa aaaaacaaac 1200 gaacaaaaaa acaacaacaa aaaaaaatca taaaaagagg gtcctagacc tccccctctc 1260 cctaaggaat gaaaccacag tccctaaaga ttccccaacc ccaaaaactg atttggagga 1320 aagccagaga gcgggtgaag aaaagagaaa aacagatgat gcgtttcaga ccaactcaca 1380 ggagatggag gatgcccctg cagcaccccc agtcaggcct gatgatgccc caccccgcta 1440 tcattatcaa ctgctagatt ttaagattct ggaaaaactc aaaggggcag tatctaatta 1500 tggccccact gcccccttca ccatggcact ccttgaatcc tctacagaga gatggctcat 1560 gcctaaagca tttttgcagc tcactcaagc cacactcact ggcggtgact ttgtattatg 1620 gaaatcagaa gcagcagaaa tggcaaaata catcaaaact aaatactgcg cgcaactaga 1680 cactaagtcg ttgaccatga agaagatttt gggcaaataa ccttataaca ccctggaagc 1740 ccaaatgctc ttccactttc tgcagtagta tccccatcct tgccagattc agagacctcc 1800 tgtcaagggc cagggagcta cccctcctgc atgggcacta ccgaccattc tgtgccccca 1860 ctgtcacagg gactgacaat gggccaatga gtgtagatcc aaaacagaca gccatggcat 1920 tctggaaact tcggggaggg gcagcctctg gcccctacct ccaggacagg ccccagggca 1980 atagggtttg tcccctagca catttctcat tacacaccca gtctattgct caactctggc 2040 gagctacaaa acacagccca ggcttggtcc tccagcacaa tactgacccc tgaggaggac 2100 attcaggcca tgcctacaag cataaattat cccctacccc aaaatacctt cagaatagtt 2160 ctagaaagag ctttactctc cttaaaggga ctccagatca tacctggaat tatagacccc 2220 aaccaccagg gagaaattca gattcttgaa aatattacag gtggtcatgt gtttatacca 2280 gcacaacaga ccattgccta attgttgccc tttcccaggg tctctactaa aaattcctac 2340 cataaagcca ccagggcccc aggcgacttg agaatagcaa aggacttttg ggcccaaaag 2400 attacttcct ctcgccccat gcttgcttat cttgagttaa aacaaacaaa caaaaaacaa 2460 acaaaaacaa caacaacaaa acatttcaag gattcttaga ttctggtgct gatgccactg 2520 tcatttctgg caggttttgg ccagctacct ggccgctgat caactcggcc actcatttac 2580 aggggattgg gcattccaga aaccctcaag tcagtgctta gaccctaggt ggacagatca 2640 agaaggaaat tcaggcactg tagtccccta tgtgatccac gatttacctg tcaatctctg 2700 ggggagagat attctctctc aattgaatgt cttcatgtgc agtcccaaca aagcagtgac 2760 aaaacaaatg ctggcccagg gattcgtgct aggtcaagga ttagggaaac aagcaatgac 2820 cagggtcata cctgctcccc agtctcacac agcgaaaatt tcctggaaag atactacccc 2880 tgtctgggtt gatcagtggc cactaactca agaaaaaaca gctgctgtga aacagttagg 2940 tatggaacaa ttagcagctg gtcacattgt gccatcttcc tccccctgga acaacccact 3000 tttggttatt aagaaaaagt ctggcaaggg gagactatta caagatctct atgcagttaa 3060 caaggttatg attcctatga gagccctgca gcctgctctc ccatctcctg ccaccatccc 3120 agcaaacctc tttaaaattg ttattgattt aaaagattgc ttttcaccat accttttcat 3180 ccagataatt gtcatcattt tgcatttagc cttcctcaaa ttaattacca gggacccaga 3240 gacccaggga cccagggacc cagggaccca gggacccagg gacccaggga cccagggacc 3300 cagggaccca gggacccagg gacccaggga cccagggacc cagggaccca gggacccagg 3360 gacccatcga cggctttcac tggtgagtcc tccctcaggg catgacaaat agccctaccc 3420 tggctcagaa atatgtagcc catgtaatcc aacctgtcag aagtgcctgg ccacaaatat 3480 acatcttaca ttatatggac aatattttgc ttgcagcccc taatagacaa caggccctct 3540 catgttttca gtaacttcaa gaggtgctaa gttcacaggg aattaaaata gtccccaaaa 3600 agattcagat aaaagaccct tattcctctc tgggttatga gctcgaattg ggacaggtcc 3660 gtaccccaaa gattgagctt cagctttcct cccttaagac tttacatgat tttcaacagt 3720 tgcttgggaa tctccagttt gttcacccct atctaaagat tcctcccgag gttctacttc 3780 ccctcaatgg actcctttca ggagactccc atcctttgtc ccctagagcc tttatgcccc 3840 aagccatttc agccctgtag cagataagtc aaaccatatc ttcccaaacc tcatttcaaa 3900 ttcattacac tgacccactt tattttattg tctgtgctac cacccatgct ccgggtggag 3960 tcttggcagc agccccaaac acctgccaga aaaggatgtc ctttgtcatg gttatatcgc 4020 ccctcaagtc caagtaaggt tttggcaaat tattattcct tgtctgccgt cctaattgtt 4080 agaggatgaa aaatgtccag acaatatttt gtcaaagatc ctgacatcat tattccttat 4140 acctcagatc aagttgaatg gcttttcaat ctaatgatga ttgggccata gcctgcacct 4200 catttgtggg catcatagac aatcattatc caaatgaccc cttatacaat ttgtaaaaat 4260 acactccttg actttctcaa agtcacctcc aagactctat tccttgaagg catcttagtg 4320 tttacagatg gttcctcaac agacaaaaac tgcctatgtt gtaccagatc gggttatttc 4380 agtgcagtcc ccatattcct ctgcccagct agtagaattg tttgccgtct ttcaggtttt 4440 tagaacatta ccaatgaccc catttaaatt gtatattgac agtgctatgt ggcccattct 4500 attcctgtct tagagaaggt tccatatatt aagcctgctt ctactgcttc caaattattt 4560 gtggagattc aatccctaat catcaaaagt acagtgccat ttttttgtgg gacactttga 4620 gaacccattc tgacctgggt ggtcctctgt cacatgaaaa tgccctggcc tacggggcca 4680 tttgtgcagc attcccctac ttgatttcat agatttggct aaatgggctc atgttctaca 4740 tcacttaaat gcctttacct tgtgacaaat gttcaagatt tctacagatc aagcaaaata 4800 acttgttaag tcctgtggtg gatacaccac tctgctgccg gttccttacc tgggggtcaa 4860 cactagagga ctcatcctta ataaactatg gcagatggat gttacccatt taccctcctt 4920 tggaaggtta aattatgtgc atgtcacatt tgatacctac agtggattta tatacatatc 4980 tccccttcct ggacgtgata acacatactc tacagtacat ggctgccatg ggaagaccac 5040 agatcattaa aactgtcaat ggccctggtt atacaggaat gaaattccaa cagttttgtt 5100 tccagtttga tatcaaacat atcactggtg ttccctataa tcctcagggc cagggcatta 5160 tagaataagc ccatcaaact taaaaaaaaa aaaaaaacac gcttcagcat ttgactgaag 5220 ttacagatgc ttgttccctc cttgacacct tgaaacaggc taaatcatgg tctttttttt 5280 tctctcaact tattgtccct ggacaatgaa ggtagtttgg cagctgacca cctgtggcat 5340 cccacatcac aggattctca gccatagtgt gatggagaga tcccctcact ggacaatgga 5400 aagaaaaaga tcccatcctt atttgcttgc atctatgaca aagagaatgc tagccaagat 5460 ggttaccaga gaggctagta aaaactgtta cctgcctgtg tcccgaatgg aacgaccttc 5520 tcctcctgag acaaatctca cttcttttac acatgttgct ccaggcaagg atactcatga 5580 tcctgtggct gactagaata aacaccaacc ttcaactgcc tccacccaaa tatcctaaca 5640 ataaacacat cccttggaat tacacttggg tcgtcctctc agagagaaat gaagttgtct 5700 gggccacttc caagatttcc accaatattt ggtggccagc cagccttacc cccatctttg 5760 tagactagca aagggagtgg cagccccttg gggtctcaaa tgccagctag atttaagtaa 5820 agccccagga taatgtgtgc aacacagagt tataatcctt acaatttatt gatttcggtt 5880 tgttggtcag tcaaacctgg ttgtgataat tccattaagg aaaccttact tcaggatgca 5940 gatttttaca cttgcccagg gcaccacaac atcatggctg ctccctccat cataagtaca 6000 gtgccaaagc tgatttttat tgtgccagct ggtgttgtga ggccttgggg gcagcctatt 6060 ggactcctac ccctcctgag attatattac tgtttcccag attttctcac catcccagaa 6120 ttctttaaat gtcaatcctc tctgtaaaaa ttctgaccat gttacctcca cctgtccatt 6180 acatttacag acgatgctaa gtccaaagac tgggtgcagg attctcatgg ggcctctgct 6240 ttattagtcg ggcagagata cgggcctaat ctttatcatt aacctcctca aagaatgcct 6300 tcatgccaag ttgccagttc cctttggtcc caaccctgtt ctagtcccct agccacagcc 6360 acccaaaaaa cttaaccctc catcttttac aaaaactatt cataccttta cctcagaaac 6420 ttcccttgct cccatagcca atcctccctc tccactcaga acaagagcca tatcctcacc 6480 ttagtgaatc aaacctttct tgcacttaac gccacctctc ctaaacttgt tgatgactgc 6540 tggctgtgtt accattccca accacctttt gatggaggca tcgatattcc tggctgtttt 6600 acccctaaaa aagattccca ccattgctac tgacaacccg gggagtggaa tggtctatcc 6660 ctctctcagg tctctggatt gggcctctgt gtgacttcta atacaccacc ttcacactat 6720 gcccatcttt gtaatattac ccttgcatct gatattagta gccagtattt agttccccca 6780 aataatacct ggtgggtttg ccttgatgac ctcactccct gcttcctttt ctgcagaaaa 6840 ttctgatttt tgtagtcttg tacaattggt tcctcgattg atttactatc attcagaaga 6900 aatctttcat ttgtgtgatc aatccgctga tacctcccct tcactggtcc accaactaag 6960 agagccccta actgccatca ccatttcagt cttcctaggc cttggtgcca tgtgcatggg 7020 gactggaatt tcttcatatg ttctctccga gtctaggtat caataactcc atgtcaccat 7080 tgacagagac atccagaatc tccaacaagg catcaacgac ctctccaatt ccctctcctc 7140 cttaactgaa gtcatcctcc aaaatcgcag aggacttgat ttactctttt gacagcaggg 7200 atgaatgtgt gcaggtctta aaaaagaatg ttacttttac gtagacaaga cagggatggt 7260 caaagagagc atgagaaaag tcagagaagg cctagagaaa agacagagac gagagagaga 7320 gagagagaga gagagagaga gagagagaga gagagagaga gagagagaga gagaaaagaa 7380 gagagctggt ataaaaattg gttttcaacc tccctgtggt tggttgtcaa ccctactccc 7440 ttccgttttg ggactgttag ttgggttgtt tttgcttatt tttttttggt ccctgggcct 7500 ttaacagact ttccaacgtt gtaaaacaat agataggtaa tttggcagca aaacctatcc 7560 aagtatatta tcataagtta gctatggaga agcaagagat tcaaaatgag attgatattc 7620 catccagggt gttcctaaaa cccatcttca tccaacataa aaaaggggta ctcccaccct 7680 tggaccaagt agagagagac tattcagaac ccctctgact ggagaactaa gttattcctt 7740 ggactgtgag gcaaagtact gcaaggaaat ataaagcaca agctaaaaat atctttctat 7800 gttccctgag ggacaagcct gattgcatat gggttgatgt ttaaactatg ccctctctag 7860 gaaaaaagac aacaggatag gtatggccca catgcttcat tggacaacac caggaggacc 7920 aaagccatta caaggtcccc tttaatcact gaaggtaagg tctctatccc cactttgaaa 7980 aagattaaaa tcgccatggt tcttgtatat aaagaaaaag ggggaga 8027 // ID MARE1 repbase; DNA; ROD; 167 BP. XX AC . XX DT 26-JUN-2006 (Rel. 11.06, Created) DT 28-JUN-2006 (Rel. 11.06, Last updated, Version 3) XX DE Mammalian-wide repeat - consensus. XX KW MARE1; MARE2. XX OS Mammalia OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi. XX RN [1] RP 1-167 RA Jurka J.; RT "MARE1: An ancient mammalian repetitive element."; RL Repbase Reports 6(6), 344-344 (2006). XX DR [1] (Consensus) XX CC This element is shared by all mammals including marsupials. It CC cannot be classified at this point. Its presence in two mosaic CC forms MARE1 and MARE2 is characteristic for SINE elements. Also CC MARE2 contains microsatellite-like repeats in its 3' end, CC characteristic for some SINE elements. MARE1 contains a 5' CC stem-loop like structure at positions 16-55. It is different from CC all currently known mammalian repeats and it is detectable in CC variable number of copies ranging from hundreds to as many as CC 3000 copies per mammalian genome. MAREs may be useful markers CC for mammalian phylogenetic studies. XX SQ Sequence 167 BP; 54 A; 33 C; 31 G; 49 T; 0 other; tacaggcagt ccccaactta caaatgggtt gtgttccaaa agttcatttg taagtcagtt 60 gtttggaact tagaatacat tttcccatag aaacaatgtt ataaatggtg gttaggttcc 120 caggccagcc cacaaaagcc tatttaaccc ataatgtagc tgaaata 167 // ID HAL1 repbase; DNA; ROD; 2463 BP. XX AC . XX DT 31-MAR-1998 (Rel. 6.4, Created) DT 09-JUN-1999 (Rel. 7, Last updated, Version 3) XX DE HAL1 repetitive element - a consensus sequence. XX KW HAL1; LINE1-like element. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 737-2463 RA Smit F.A.; RT "HAL1."; RL Direct Submission to Repbase Update (MAR-1998). XX RN [2] RP 1-736 RA Jurka J.; RT "HAL1."; RL Direct Submission to Repbase Update (JUN-1999). XX CC HAL1 resembles Half An Line1 element, as it encoded a protein CC closely CC related to the ORF1 of LINE1 product (the LINE1 mRNA binding CC protein CC p40), but has no similarity to ORF2. Sequence 892 to 1576 is 68% CC similar CC to the ORF1 region of the old LINE1 subfamily L1ME. Sequence 1703 CC to CC 1835 is 76% similar to part of the coding region in the 8A-2B and CC 8A-2V CC mRNAs (GenBank entries MM8A2BGEN and MM8A2VGEN). Average CC divergence of CC copies from the consensus is 28%. The 5' end of HAL1 is probably CC incomplete. XX SQ Sequence 2463 BP; 960 A; 430 C; 535 G; 501 T; 37 other; aagatggcag attgaacata tacatttatt tttctccctc ccaaaacccc actaaaatga 60 caataaagga attttttaaa agacataaac ccacaaggac aaagagaaca ggagaggaga 120 caatagcaac aaaattttgg aagctggaaa gcagatggat aagtggtaac tgacttagca 180 gacctgagaa agctgaatcc taagccagca gtggggaaag ccaagaagca acccaattta 240 caccgccaga atcctcccca aaaggctcag gaattggtgg caccaggtac ctctggaagt 300 gggggtgaag gtgaggctaa aaacagggag gattgattga aagtctgttt aagaagcagt 360 tagatcccca gattccctcc cccactctat ngcagccaga taactgatcc tcctccaccc 420 tagcagaaga ctggaggttt attctctgga gaggataaaa caaaggggtc tctggactgg 480 ggaacaccag gcacagttga gggcagaggg gnactgtact gaaaacaggg gaattaagtg 540 aaagtttaca tactgaatgt tgagactccc agccctcttc ccccattcag ctcccagaat 600 gctggcagcc aggcctatac cctccagaca ggagattaga agagntcttc tctggagaat 660 ctgaccagcc caagagaaaa gacttaaaaa tactgataat agaggttccc caaacaaaat 720 agcccagcca gatcacccct atagtgaagc ccacmacctg gcaagcccca cccacgcact 780 cagagcttcc aagtcgcttt ttagtstccc actcttaaat atgagcagac agccaaggat 840 caccagacat ctgaggaaan cctctaatat ggcagacaga naaataaaaa cagagaaaaa 900 cawwntntat ccatgaaaca agaayaggat gctataaaaa ggaacattca gagaacaaaa 960 agaagctctt ggaaattaaa aacgtgagag cagaaattaa aatttcaata gaagggttgg 1020 aagataaagt tgaggaaatc tcccagaaag tagaacaaac aaaggataaa gatacggaaa 1080 ataggagaga aaagaattaa aaaaattgag gatcagtcca ggaggtccaa catccaacta 1140 atagaagttc cagaaagaga gaacagagaa aaagatggaa gagaaattat caaagaaata 1200 attcaagaaa atttcccaga actgaaggac atgaatttcc agattgaaag gncccaccga 1260 gtgcttaccg caaaaatgaa ggaaaaaaat agacccacac caaggcacat cattgtgaaa 1320 tttcagaaca ctgaganaaa aggaaaatcc caaaagcttc cagagagaaa aaaaggtcac 1380 atacaaagga tnagaatcag aatggcatya gacttctcaa cagcaacact ggaagctaga 1440 agacaatgga gcaatgcctt caaaattctg agggaaaatg atttycaacc tagaattcta 1500 tacccagcca aactatcaat caagtgtgag ggtagaataa agacattttc agacatgcaa 1560 gatctcaaaa aatttacctc ccatgcaccc tttctcagga agctactgga ggatgtgctc 1620 caccaaaacg agggagtaaa ccaagaaaga ggaagacatg ggatccagga aacaggggat 1680 ccaacacagg agagaggtaa agggaattcc caggatgatg gtgaagggaa attccaggat 1740 gacagctgtg cagcaggcct agagagcaac cagtccagat tggagcagga agatggaagg 1800 ctccaggagg aatgttctca agaaaataga anacaaatga aactgataga ttatctgatg 1860 tgtttgaata tattgagagg ntatnattta gwcatgtgaa agtttggggg agaattgaat 1920 tagtgatagg tacanagaaa actaagcaaa tgaaaaaaag caagacaatt attaactcca 1980 gggaaaacaa aaagttgtac aagaaaggaa atgtaatcat agtacactac atggctcagc 2040 tgtgaataac atatttacat agtcataata atgtaaacac traatactga tttaaccaaa 2100 attatgatat aaactatatt gggaggataa gaaatggaaa gggtgtgcgt atkgtgtang 2160 nggaatgara gaantaaatc ctcatcttcc atagtaggaa gtcartagat aatgtctaaa 2220 actgaaaaat caagaaatag caatataagc atgttattta gaaatatgga ggtaaatatt 2280 ccaaaagaat cagctaaaag agttgaaagt ggttgcctct ggggaaggag ctgnaggtgg 2340 tgaagagngn agggacgggg aaytgctgtt ttttataagc cttttagtac tatttgacct 2400 tttaaactat atacatrtat tactttaatt taaaaaanta anttataaaa aagnaaataa 2460 gta 2463 // ID RLTR15B_MM repbase; DNA; ROD; 824 BP. XX AC . XX DT 02-MAY-2002 (Rel. 7.04, Created) DT 02-MAY-2002 (Rel. 7.04, Last updated, Version 1) XX DE Long Terminal Repeat from an endogenous retrovirus - a consensus. XX KW LTR Retrotransposon; Transposable Element; RLTR15B_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-824 RA Jurka J.; RT "RLTR15B_MM: putative LTR from mouse."; RL Repbase Reports 2(4), 25-25 (2002). XX DR [1] (Consensus) XX CC See commentary to RLTR15A_MM. XX SQ Sequence 824 BP; 244 A; 132 C; 215 G; 228 T; 5 other; tgctgagtcc tgcatggcca tggaagggat ggtgtgacat ggttcctrcc cctggaacag 60 agccagcagg gtgtgggcct ccacgagtgt tgaagtgttg atggttgctg tgggyataag 120 gcttgtttgt ataataatgt acatattttc atgttcctcc ttcgaggaat gtcctctctg 180 ccaaggttaa tgactccatg ataattagag acagcatgag tccaggagkt gagggtgtgt 240 ctattgtctc agcaaaatgg taaaatgctg tgaccttcag gacagccctt aaggctgtgg 300 gaaagaactc tgaaaacatg agttcaaaaa tatataattt ctcaactatg caaaatataa 360 ggatgcaata tgaattgtat gaggagcttc atagatctaa aagaacagag gcagctgcac 420 tatgagccag cttgtcagaa agatactaag gaaggagata aagagattta gggagtggta 480 atctcaggag atcccaccca gcctaagttt gtttgttgtt gtgcttacaa tttaggctca 540 ggagtggtng aaatggtaat ttcagggcag ggtcccaccc agctaggytt attgtctgtg 600 cttaacaaag gcaggcagat ctctgaattc ttttgcaatg tttaaaaaaa aaatgtgtgt 660 gttaaaaaaa atgtgcttgc tgtctctttc taagaatcaa ggggctgggg tcatgggatg 720 ctgattcatg ggataatcaa aagggaacct ggagtaaagt aaatgactga attgatatgt 780 aaataaaaga ctgggctttt ggtctgcaag agatgagcta tcca 824 // ID LTR1_CPo repbase; DNA; ROD; 411 BP. XX AC . XX DT 17-JUN-2009 (Rel. 14.06, Created) DT 17-JUN-2009 (Rel. 14.06, Last updated, Version 2) XX DE Long terminal repeat of ERV1 endogenous retrovirus: consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR1_CPo. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-411 RA Jurka J.; RT "Long terminal repeats from guinea pig."; RL Repbase Reports 9(6), 1259-1259 (2009). XX DR [1] (Consensus) XX CC ~94% identical to consensus. XX SQ Sequence 411 BP; 99 A; 123 C; 90 G; 99 T; 0 other; tgaaagagtt aactttacag ctgcaacccc cgagctatct gtacaaaaca agaagggact 60 ttccgttggt aaaaccgcag aatgttctgt ctcagtgagt taggcaagat aagacaactg 120 ttccgggaac acagtgaccg ggctacgacc cccggacacc aaagttaccc aaccctgagg 180 ccctccaatc agctcctgcc aagccctgcg ccgttcaaaa ctaaccaatg cgatctgctt 240 ctgtaacctc gcctgctttg cggtttatgc ctttaaaaac cctgtgtaac ttcccttcgg 300 ggtcctccgc actagtttgc tggacggacc ccatgcgcat ggaaataaag cttttttccc 360 cttagagacg tgggtccttg gggtcttctt ccctgcgaca ccggccttac a 411 // ID RLTR39_MM repbase; DNA; ROD; 575 BP. XX AC . XX DT 30-JAN-2004 (Rel. 9, Created) DT 18-SEP-2008 (Rel. 9, Last updated, Version 2) XX DE Mouse long terminal repeat - a consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR; ERVK; KW RLTR39_MM; RLTR31_Mur. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RA Karolchik D., Baertsch R., Diekhans M., Furey S.T., Hinrichs A., RA Lu T.Y., Roskin M.K., Schwartz M., Sugnet W.C. et al.; RT "The UCSC Genome Browser Database."; RL Nucleic Acids Res 31(1), 51-54 (2003). XX RN [2] RP 1-575 RA Pavlicek A. and Jurka J.; RT "RLTR39_MM - a family of LTR retrotransposons."; RL Direct Submission to Repbase Update (31-DEC-2003). XX DR [2] (Consensus) XX CC The consensus sequence was reconstructed from the UC Santa Cruz CC genome annotation. Similar to RLTR22_MM. Individual copies are CC ~91% identical to the consensus. RLTR31_Mur in RepeatMasker. XX SQ Sequence 575 BP; 174 A; 88 C; 182 G; 131 T; 0 other; tgtggattgc tgtctatatg caggtgaggg caggtaaaag ataaggctag agcctgtgat 60 tgggcagtgg aaaaagaagg cggggctgag agttttagag acaggacaga gaagggacag 120 agaaggacag aaggacagag acaagatgga ggaagaagag gacgaaccag atccacatgg 180 ctttaaatag ccacaggtag ctatgaatat catagaaggg caatagaata atataggaca 240 atttgtccaa tctaggtggg cagcttgtat caatatcaat tgagctctga gttcattgtg 300 tgggcatttt gtgggttgag aatttactga tataaatctg actgataaat tacaagcctc 360 tagagttttg attttactgg gttacgggga tttgtgacag ctagccacag ggggcagatg 420 gctgggaatg ttgagcaggt tccagcagca agcgaaccgc gagaagttgg gctgaggccg 480 ccgcatgggg atagccatgg gggcggggag accgccgtta ctagagcgta gccggcaata 540 gcgtggattg attttttaaa taattacatg caaca 575 // ID L1MC2 repbase; DNA; ROD; 1077 BP. XX AC . XX DT 01-OCT-1995 (Rel. 3.6, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 3) XX DE 3'-end of L1 repeat (subfamily L1MC2) - a consensus. XX KW Repetitive sequence; L1 (LINE) family; MER16; L1MC2 subfamily; KW L1MC2. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 869-1017 RA Kaplan J.D., Jurka J., Solus F.J. and Duncan H.C.; RT "Medium reiteration frequency repetitive sequences in the human RT genome."; RL Nucleic Acids Res 17, 4731-4738 (1991). XX RN [2] RP 1-1077 RA Smit F.A., Toth G., Riggs D.A. and Jurka J.; RT "Ancestral, mammalian-wide subfamilies of LINE-1 repetitive RT sequences."; RL J. Mol. Biol 246, 401-417 (1995). XX RN [3] RP 1-1077 RA Smit F.A.; RT "L1MC2."; RL Direct Submission to Repbase Update (1996). XX DR [3] (Consensus) XX CC Replaces MER16 (Acc. No. X59020) CC ORF2 ends at bp 675; average divergence of copies from consensus: CC 18%. XX SQ Sequence 1077 BP; 423 A; 176 C; 194 G; 268 T; 16 other; cttgtatcca gaatatataa agaactctta aaactcaaca ataaaaaaac aaacaaccca 60 attaaaaaat gggcaaaaga tctgaataga catctcacca aagaagatat acagatggca 120 aataagcaca tgaaaagatg ctcaacatca tatgtcatta gggaaatgca aattaaaaca 180 acaatgagat accactacac acctattaga atggctaaaa tccaaaacac tgacaacacc 240 aaatgttggc gaggatgtgg agcaacagga actctcattc attgctggtg ggaatgcaaa 300 atggtacagc cactttggaa gacagtttgg cagtttctta yaaarctaaa catactctta 360 ccatatgatc cagcaatcay actccttggt atttacccaa atgaattgaa aacttatgtc 420 cacacaaaaa cctgcacacg aatgtttata gcagctttat tcataattgc caaaacttgg 480 aaacaaccaa gatgtccttc aataggtgaa tggataaaca aactgtggta catccataca 540 atggaatatt attcagcgat aaaaagaaat gaactatcga gacatgaaaa gacatggagg 600 aaccttaaat gcatattgct aagtgaaaga agccagtctg aaaaggctac atactgtatg 660 attccaacta tatgacattc tgaaaaaggn aaaactatgg agacaagtaa aaagatcagw 720 gattnctaga gkgastrgga rrgggaggga agaataggtg gagcncaggg gatttttagg 780 gcagcgaaac tattctgtat gatactataa tggtggatac atgacattat acatttgtca 840 aaayccatag aactgtacaa yacaaagagt gaaccctaat gtaaactatg gactttagtt 900 aataataatg tatcaatgtt ggttcatcaa ttgtaacaaa tgtaccacat taatgcaaga 960 tgttaataat agggtaaact rttgtgtggg ggagggagta tatgggaact ctctgtactt 1020 tctgctcaat ttttctgtaa acctaaaact gytctaaaaa ataaagtcta ttaataa 1077 // ID L1MA2 repbase; DNA; ROD; 1055 BP. XX AC . XX DT 01-OCT-1995 (Rel. 3.6, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 3) XX DE 3'-end of L1 repeat (subfamily L1MA2) - a consensus. XX KW Repetitive sequence; L1 (LINE) family; MER14; L1MA2 subfamily; KW L1MA2. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-1055 RA Kaplan J.D., Jurka J., Solus F.J. and Duncan H.C.; RT "Medium reiteration frequency repetitive sequences in the human RT genome."; RL Nucleic Acids Res 17, 4731-4738 (1991). XX RN [2] RP 1-1055 RA Smit F.A., Toth G., Riggs D.A. and Jurka J.; RT "Ancestral, mammalian-wide subfamilies of LINE-1 repetitive RT sequences."; RL J. Mol. Biol 246, 401-417 (1995). XX DR [2] (Consensus) XX CC ORF2 ends at bp 684; average divergence of copies from consensus: CC 11% CC Replaces MER14 (Acc. No. X59018). XX SQ Sequence 1055 BP; 423 A; 156 C; 192 G; 247 T; 37 other; ytaatatcca gaatmtataa ggaactcaaa caaatcaaca agaaaaaaac aaataatccm 60 attaaaaart gggcaaaaga catgaayaga catttctcaa aagaagacat acaaatggcc 120 aacargyata tgaaaaaatg ctcaacatca ctaatcatca gagaaatgca aatcaaaacy 180 acaatgagat atcatctcac cccagttara atggctttta ttaaaaagac aaaaaataac 240 aratgctggy gaggatgtgg agaaaaggga actcttayay actgttggtg ggaatgtaaa 300 ttagtacarc caytatggaa aacagtttgg agrttyctca aaaaactaaa aatagarcta 360 ccatatratc cagcaatccc actrctgggt atatayccaa aagaaaggaa atcagtatat 420 yaaaragata cctgcactcc catgtttatt gcagcactat tcacaatagc haagatttgg 480 aatcaaccta agtgtccatc aacrgatgaa tggataaaga aaatgtggta catatacaca 540 atggagtact attcagccat aaaaaagaat garatcytgt catttgcagc aacatggatg 600 gaactggagg tcattatgtt aagtgaaata agycargmac araaagacaa acaytgcatg 660 ttctcactta tttgtgggat ctaaaaatca aaacaattga actcatggag atagagagta 720 gaaggatggt taccagaggc tgggaagggt agtnggagga ttggggggng gkgrrgaggg 780 atggttaatg ggtacaaaaa aatagttaga aagaatgaat aagacctagt atttgatagc 840 acaacaaggt gactatagtc aataataatt taattgtaca ttttaaaata actaaaagag 900 tataattgga ttgtttgtaa cacaaaggat aaatgcttga ggggatggat accccatttt 960 ccatgatgtg attattacac attgcgtgcc tgtatcaaaa catctcatgt accccataaa 1020 tatatacacc tactatgtac ccacaaaaat taaaa 1055 // ID MSTC repbase; DNA; ROD; 405 BP. XX AC . XX DT 01-OCT-1995 (Rel. 3.6, Created) DT 23-JAN-1998 (Rel. 6.4, Last updated, Version 3) XX DE Transposon-like human element long terminal repeat (MSTc DE subfamily) - a consensus. XX KW Non-LTR retrotransposon; MaLR family; MSTc subfamily; MSTC. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-405 RA Smit F.A.; RT "Identification of a new, abundant superfamily of mammalian RT LTR-transposons."; RL Nucleic Acids Res 21(8), 1863-1872 (1993). XX DR [1] (Consensus) XX SQ Sequence 405 BP; 102 A; 95 C; 88 G; 111 T; 9 other; tgctacggtt tgaatgtttg tcccctccaa aactcatgtt gaaacttaat ccccaatgtg 60 gcagtattga gagrtggggc ctttragagg tgattgggtc atgagggctc tgccctcatg 120 aatggattaa tggattaacg tattaatgga ttaattggtt atcacgggag tgggatcggt 180 ggctctatca taaaarycat tttgnctctc gnrtgngccc cttcttgccc tttcacgcct 240 tccgccatgt tatgacagag cacaaggccc tcaccagaaa ccagatgcag ccgccatgat 300 cttggacttc ccagcctsca gaaccgtgag caaaataaac ctcttttctt tataaattac 360 ccagtctatg gtattccgtt aaagcagcac gaacggacta agaca 405 // ID STRIDE1 repbase; DNA; ROD; 210 BP. XX AC . XX DT 16-OCT-2009 (Rel. 14.12, Created) DT 16-OCT-2009 (Rel. 14.12, Last updated, Version 2) XX DE SINE element - consensus. XX KW SINE1/7SL; SINE; Non-LTR Retrotransposon; Transposable Element; KW Nonautonomous; STRIDE1. XX OS Spermophilus tridecemlineatus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Sciuridae; Xerinae; Marmotini; Spermophilus. XX RN [1] RP 1-210 RA Jurka J.; RT "SINE elements from the thirteen-lined ground squirrel."; RL Repbase Reports 9(12), 3128-3128 (2009). XX DR [1] (Consensus) XX CC Its youngest sequences are ~92% identical to consensus. It CC includes subfamilies. CC This sequence was derived from sequence data generated by Broad CC Institute Mammalian Genome Project. XX SQ Sequence 210 BP; 56 A; 58 C; 63 G; 32 T; 1 other; gccgggcgcg gtggcgcacg cctgtaatcc cagcggctcg ggaggctgag gcaggaggat 60 cgcgagttca aagccagcct cagcaanagc gaggcgctaa gcaactcagt gagaccctgt 120 ctctaaataa aatacaaaat agggctgggg atgtggctca gtggtcgagt gcccctgagt 180 tcaatccccg gtacccccca caaaaaaaaa 210 // ID RLTR46 repbase; DNA; ROD; 321 BP. XX AC . XX DT 30-JAN-2004 (Rel. 9, Created) DT 24-AUG-2008 (Rel. 9, Last updated, Version 2) XX DE Mouse long terminal repeat - a consensus. XX KW LTR Retrotransposon; Transposable Element; LTR; ERVK; RLTR41_MM; KW RLTR46. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RA Karolchik D., Baertsch R., Diekhans M., Furey S.T., Hinrichs A., RA Lu T.Y., Roskin M.K., Schwartz M., Sugnet W.C. et al.; RT "The UCSC Genome Browser Database."; RL Nucleic Acids Res 31(1), 51-54 (2003). XX RN [2] RP 1-321 RA Pavlicek A. and Jurka J.; RT "RLTR46 - a family of LTR retrotransposons."; RL Direct Submission to Repbase Update (31-DEC-2003). XX DR [2] (Consensus) XX CC The consensus sequence was reconstructed from the UC Santa Cruz CC genome annotation. Similar to IAPLTR3. Individual sequences are CC ~90% identical to the consensus. XX SQ Sequence 321 BP; 76 A; 80 C; 92 G; 71 T; 2 other; tgtggagagc cgcgataaca tttgccatca caagatggcg ccggcttccg cagtgcctta 60 tgccacctaa acaaagaaca agctgtggtg cgcatgtgct aagagtaatg ttcgcgccaa 120 gtcataagcc caccccgggg cgtgtcaatg agatcgtggg taagcgacca gtcaggcgtg 180 gacacgccac gctagggtgt atataagcag cgcctttctg aggctctttg tcttcctcat 240 caatatgcaa taaacgattk gctgcagaag gatcctggtg ttccgtgygc gttcttgctg 300 gcgaggaaat agcgcgggac a 321 // ID RNSAT1c repbase; DNA; ROD; 168 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 29-MAR-2010 (Rel. 13.07, Last updated, Version 3) XX DE Satellite from rat. XX KW Satellite; Simple Repeat; RNSAT1c. XX OS Rattus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae. XX RN [1] RP 1-168 RA Smit A.F.; RT "RNSAT1c_ - Satellite from rat."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX SQ Sequence 168 BP; 64 A; 22 C; 40 G; 42 T; 0 other; taaagccttt gtaggtcaga atgaccttaa aaggcatgaa agaattcata ctggagagaa 60 accttacgaa tgtaatgaat gtggtaaagc ctttgtaggt cagaatgacc ttaaaaggca 120 tgaaagaatt catactggag agaaacctta cgaatgtaat gaatgtgg 168 // ID ERV2B1-CPo_LTR repbase; DNA; ROD; 327 BP. XX AC . XX DT 21-OCT-2009 (Rel. 14.11, Created) DT 21-OCT-2009 (Rel. 14.11, Last updated, Version 3) XX DE Long terminal repeat of ERV2 endogenous retrovirus: consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; KW ERV2B1-CPo_LTR. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-327 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from guinea pig."; RL Repbase Reports 9(11), 2871-2871 (2009). XX DR [1] (Consensus) XX CC ~93% identical to consensus. 6bp tsd. XX SQ Sequence 327 BP; 53 A; 123 C; 70 G; 81 T; 0 other; tgtagggagc ggcgcgaccg ccccgctgct gtctccccgc ctctccctta tatgggcaca 60 tagtgcgcct acatcaacca tgatcctctc ctgctcacgt cacgtacacg gacgtgacga 120 tgacccatca gagatcaccc cgtagcctct tcctgtccca ccggcccgcc cttgccctat 180 aaaagctgcg accacttcct caataaatga gacttgattg gacttcctca acttgtctcc 240 gtctctcttt gtctcttgcc ctgcctttgt caatccccgt tcctcctcca ggtggttgcc 300 ccgcggattg acccgcgggc cgggtca 327 // ID MARINER2 repbase; DNA; ROD; 1300 BP. XX AC U49974; XX DT 19-FEB-1997 (Rel. 2.01, Created) DT 19-FEB-1997 (Rel. 2.01, Last updated, Version 1) XX DE Mariner2 transposable element, complete consensus sequence. XX KW Mariner/Tc1; DNA transposon; Transposable Element; transposase; KW MARINER2; terminal inverted repeats; mariner. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RA Oosumi T., Belknap R.W. and Garlick B.; RT "Mariner transposons in humans."; RL Nature 378, 672-672 (1995). XX RN [2] RP 1-1300 RA Robertson M.H., Zumpano L.K., Lohe R.A. and Hartl L.D.; RT "Reconstructing the ancient mariners of humans."; RL Nature Genet 12, 360-361 (1996). XX DR GenBank; U49974; Positions 1 1300. XX CC repeat_region 1..31 CC repeat_region 1269..1300 CC /rpt_type=terminal inverted repeat CC CDS 182..1237 CC /codon_start=1 CC /product="mariner transposase". XX SQ Sequence 1300 BP; 441 A; 241 C; 258 G; 360 T; 0 other; caaggggtct tcaaaaagtt catggaaaat gcgtattatg aaaaaactat gcatggattt 60 caaaattttt tgcaccaaaa taaactcata ctaacttgtt ataacatgtc tgaacaggat 120 ctagtttgag gcactaagaa ggataagaca tcagtttgaa aagagcccct atcagagcaa 180 catgaattct gctaaaattg aagcaagaac aaacatcaaa tttatggtga agcttgggtg 240 gaagaatggt gaaatcattg atgctttacg aaaagtttat ggggacaatg ccccaaagaa 300 atcagcagtt tacaaatgga taactcgttt taagaaggga caagacaatg ttgaagatga 360 agcccacagt ggcagaccat ccacatcaat ttgtgaggaa aaaattaatc ttgttcatgc 420 cctaattgaa gaggaccaac aattaacagc agaaacaata gccaacacca tagacatctc 480 aattggttca gcttacacaa ttctgactga aaaattaaag ttgagcaaac tttccactca 540 atgggtgcca aaaccattgc acccagatca gctgcagaca agagcagagc tttcaatgga 600 aattttaaac aagtgggatc aagatcctga agcatttctt cgaagaattg taacaggaga 660 tgaaacatgg ctttaccagt acaatcctga agacaaagca caatcaaagc aatggctacc 720 aagaggtgga agtggtccag tcaaagcaaa agtggactgg tcaagagcaa aggtcatggc 780 aacagttttt tgggatgctc aaggcatttt gcttgttgac tttctggagg gccaaagaat 840 gataacatct gcttattatg agagtgtttt gagaaagtta gccaaagctt tagcagaaaa 900 acgcccggga aagcttcacc agagagtcct tctccaccat gacaatgctc ctgctcattc 960 ctctcatcaa acaagggcaa ttttgcgaga gtttcgatgg gaaatcatta ggcatccacc 1020 ttacagtcct gatttggctc cttctgactt ctttttgttt cctaatctta aaaagatctt 1080 taaagggcac ccatttttct tagttaataa tgtaaaaaag actgcattga catggttaaa 1140 ttcccaggac cctcagttct ttagggatgg actaaatggc tggtatcatc gcttacaaaa 1200 gtgtcttgaa cttgatggag cttatgttga gaaataaagt ttatattttt tatttttatc 1260 ttttaattcc attttccatg aactttttga agtcccctta 1300 // ID ERV1A1-CPo_LTR repbase; DNA; ROD; 446 BP. XX AC . XX DT 21-OCT-2009 (Rel. 14.11, Created) DT 21-OCT-2009 (Rel. 14.11, Last updated, Version 3) XX DE Long terminal repeat of ERV1 endogenous retrovirus: consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; KW ERV1A1-CPo_LTR. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-446 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from guinea pig."; RL Repbase Reports 9(11), 2869-2869 (2009). XX DR [1] (Consensus) XX CC >98% identical to consensus. 4bp tsd. XX SQ Sequence 446 BP; 118 A; 138 C; 85 G; 105 T; 0 other; tgaaagggtt aactccctaa ctccccagag ctatcagagc agagcagaaa ggaatgccta 60 actccccaga gctatcagaa ccgagcaaaa aaaaaaaatg cacctgcaag gttgttctgt 120 ctcaacgaga taagtaagac aagtaactgt tctgttctgg gcacatagtg actgggctag 180 gacccccagt ctccatggtt actcaacccc aaacccctcc aatcaacttc tgccaagtcc 240 tatcctgctc aaaattaacc aatgcaatct gcttctgtaa actcgcttgc ctcccgtttg 300 cgcccttaaa aaccctacac aattccccct cggggccctc cgcacttgtt tgcctggggg 360 accccatgcg catggaaata aactttttac tcctcactac gagctgagtc cttggggtct 420 tctcccctgc gacgccggtt cttaca 446 // ID LINE2 repbase; DNA; ROD; 2750 BP. XX AC . XX DT 18-APR-1997 (Rel. 6, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 3) XX DE MIR2/LINE2 repetitive element - a consensus. XX KW LINE; L2 family; MIR2; MIR2/LINE2; LINE2. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-2750 RA Degen J.S. and Davie W.E.; RT "Nucleotide sequence of the gene for human prothrombin."; RL Biochemistry 26, 6165-6177 (1987). XX RN [2] RP 2601-2750 RA Smit F.A. and Riggs D.A.; RT "MIRs are classic tRNA-derived SINEs that amplified before the RT mammalian radiation."; RL Nucl. Acids. Res 23, 98-102 (1995). XX RN [3] RP 1-2750 RA Smit F.A.; RT "LINE2."; RL Direct Submission to Repbase Update (1996). XX DR [3] (Consensus) XX CC 24 bp upstream of NcoI site; chromosome 11p11-q12. CC This sequence represents the 3' part of an ancient LINE-like CC element CC which was responsible for the amplification of the MIR elements. XX SQ Sequence 2750 BP; 593 A; 941 C; 336 G; 815 T; 65 other; ctgtcattca ttaccgntcc ttgttgcagt catccatcga gcccggctcg gtcactccct 60 ctcatttctc gaagaktttr tctcctagnt caactgtcac tctctcataa taatantcct 120 gtcataattc ttggtgattt caatatccac atagacgatc catccaatac tctggcctct 180 cagttccttg acctcctctc tcccgtggtc ttgttctcct ctacccactc tcagccactc 240 actcccatgg tcatacccta gatcttgtca ttactaataa ctgcaactcc tccataatct 300 caatttcaag catccyactc ttttaccacy acctcctatc tttctagctc actctctctg 360 gtgccctaac tccaataaty ctttgacccc accgagacct mcaatccatt gatctcatat 420 gcttttcact gtnccgtgtc ctcttccctc ctttcwttgc tcagattcca tggtcaatca 480 ttataatcac tcccwtacan ataccctcaa ctcccttgcc cctytctcmc tttgtcttac 540 ttrcttgnca aaaccacaac cctggttaaa tccaactntc cgactaattt rcgcctgcac 600 ccgngcagct aaacatggct gragaaaaat acacaaccat gctgactggt ctcactttaa 660 atttatgacc acgaacctca agtgrgccct taatgctgcc aggcaattnt actacatttc 720 cctagtccat tcactctccc actctcctag atgactattt yatactttct tnyttctcaa 780 acctccaaca yyyyytncca tctttactct cagctgatga ccttgcttcc tatttcactg 840 agaaaataga agcaatcaga agagaatttc cacatgctcc caccaccaca tctacccacc 900 tacctgcatc tgtacccata tactctgcct tccctcctgt taccgtggat gaactgtccg 960 tgctcctatc taargccaac ccctccactt gtgcactaga tcccatcccc tcttgcctac 1020 tcaaggacgt tgctccagca attctcccct ctctctcctg catcatcaat ttttccctct 1080 ctactggatc attcccatca gcatataaac atgctgnnat ttctcccatc tttaaaaaac 1140 aaaaattctc cctcgacccc acttccccct ccagctaccg ccctatttct ctgctcccct 1200 ttacagcaaa acttctcaga agagttgtct atactcgttg tctccacttc ctcacctccc 1260 gttctctctt aaacccactc caatcaggct ttcgtcccta ccactccact gaaactgctc 1320 ttgtcaaggt caccaatgac ctccacgttg ccaaatccag tggtcagttc tcagtcttca 1380 tcttacttga cctctcagca gcatttgaca cagttgatca ctccctcctt cttgaaacac 1440 tttcttcact tggcttccag gacaccacac tctcttggtt ttcctcctac ctcactggcc 1500 gttccttctc agtctccttt gctggytcct cctcatctcc ccgatctcta aatattggag 1560 tgccccaggg ctcagtcctt ggacctcttc tcttctctat ctacactcac tccctgggtg 1620 atctcatcca gtcycatggc tttaaatacc atctatacgc tgatgactcc caaatttata 1680 tctccagccc agacctctcc cctgaaytcc agactcctat atccaactgc ctactcgaca 1740 tttccatttg gatgtctaac agrcatctca aayttaacat gtccaaaact gaactcctga 1800 ttttcccccc caaacctgct yctcccgcag tcttycccat ctcagttaat ggcaactcca 1860 tccttccagt tgctcargcc aaaaaccttg gagtcatcct tgattcctct ctttctctca 1920 caccccacat ccaatcyatc agcaaatcct gttggctcta ccttcaaaat atatccagaa 1980 tccgaccact tctcaccatc tccactgcya ccaccctggt ccaagccacc atcatctctc 2040 acctggacta ctgcaatagc ctcctaactg gtctccctgc tyccaccctt gcccccctnc 2100 agtctgttct cagcacagca gccagaatga tccttttaaa atgtaaatca gatcatgtca 2160 ctccyctgct caaaaccctt caatggcttc ccgtttcact cagagtaaaa kccaaagtcc 2220 ttaccgtggc ctacaaggcc ctatatgatc tggcccccgc ytayctctct aacctcatct 2280 tctaccactt tccccctcgc tcactctgct ccagccacac tggcctcctt gctgttcctc 2340 gaacacgcca rgcwcgttcc tgcctcaggg cctttgcact tgctgttccc tctgcctgga 2400 acgctcttcc cccagatatc cacgtggctc actccytcac ctcmttcagg tctctgctca 2460 gatgtcacct yctcagagag gccttccctg accaccctat ctaaaatagc acacccyctc 2520 ccccatcatg cccatytttc ttaccycgct ttatttttct ycatagcact tatcaccatc 2580 tgacayacta tatwatttay ttgtttgttt gtttrttgtc tcctccacya gaatgtaagc 2640 tccatgaggg cagggatttt gtctgttttg ttcactgctg tatccccagc gcctagaaca 2700 gtgcctggca catagtaggc gctcaataaa tatttgttga atgaatgaat 2750 // ID RLTR22A_MM repbase; DNA; ROD; 557 BP. XX AC . XX DT 14-OCT-2002 (Rel. 7.09, Created) DT 14-OCT-2002 (Rel. 7.09, Last updated, Version 1) XX DE Mouse putative long terminal repeat - a consensus. XX KW LTR Retrotransposon; Transposable Element; Long terminal repeat; KW retrotransposon; RLTR22A_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-557 RA Jurka J. and Drazkiewicz A.; RT "RLTR22A_MM: putative LTR from Mus musculus."; RL Repbase Reports 2(9), 6-6 (2002). XX DR [1] (Consensus) XX CC 82% similar to RLTR22_MM (bases 1-573). XX SQ Sequence 557 BP; 148 A; 99 C; 167 G; 143 T; 0 other; tgttgcagga ttttccctgt ccaattacat tagtgcagca ggaggcctgt gattggacag 60 ggaaaaggga ggcggagcta agagttgcag agacagagag catctcaggg aggaaagagg 120 aaggccaaga tggaggcaga catgaaccag aaccagcatg gctttaaata gccacaggta 180 gttatgatat cataaggtta gaataattgg gataaagctt ttatcattat caattggctc 240 tgaaattatt gtattggcat cttgtaaatt gagaatttat tgatacataa atctgattgg 300 ttaattataa gcttcaagag ttttgattct actgggttac tgggtgttgt gatggctgac 360 cgcggggtgg atggtcattg cgtggggctg gcggcagtga cccgccaagg gaacttaggg 420 gccccccggg aactcgggag ctctgggcca gagagaccgc tgagttgaga ctcacccggc 480 tggagccatg tggtccactg gttgccagca gtagtgcggg aagtagcagg ttctattttt 540 taatatttcc cgcaaca 557 // ID RMER13A2 repbase; DNA; ROD; 779 BP. XX AC . XX DT 26-AUG-2008 (Rel. 13.08, Created) DT 26-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Long terminal repeat of endogenous retrovirus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; RMER13A2. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-779 RA Jurka J. and Jurka M.G.; RT "Long terminal repeats of endogenous retroviruses from mouse."; RL Repbase Reports 8(8), 902-902 (2008). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC International Collaboration for the Mouse Genome Sequencing. XX SQ Sequence 779 BP; 182 A; 241 C; 205 G; 149 T; 2 other; tgtgtggggc aagtggaccg tgccaccaga cagaagaact ggtaaggttg aagcaggctc 60 aaaccccggg gaatctcaga attgagaacg gcaggcgtgg agccagctcc ctgccaaact 120 ctacctgatc tggaacatgt aaccatgaca ccccactgag aggaccatga caatcggtca 180 cgtagcataa aaacctggag ttctccatgc taatgaggta tctagatagg ccctgagtgt 240 ttagccaatg agcttccctt cctgggcatt ccttcccgca aaaggtattt aatccctggt 300 tcaccctgag taagatgtat gcattcacat ccgccatcaa ataaagcagt ttggacaagc 360 aaggaccatc tccttcatca aggatctcgg ggtggggtgg ttgcagggga ggggggcttc 420 gtcgtaaaga gccgcgccta aatctcccgg agaaggcctt ctctcttcca gcccctccat 480 actctcggca ctcactcgga accaaaccgg attccgcccc tgtcccgggc tcagccatcc 540 ccttcctgcc tggtgcgtag atgccctgag ggaaaagccc agaccatgga cccacgttcc 600 caacggcgtc tgggtacaca ggagaccggg aaccacagca tccgtcccag accccgctgt 660 ttctcacccr ggaaaacacg gactgaggac ccctatcatg gactgtgttc tgcctcccct 720 gagcggcttg ccggyggcgc tcaggcaccc cagcggcgag gcagacacgc agcccggca 779 // ID L1_CC repbase; DNA; ROD; 679 BP. XX AC Y00726; XX DT 28-SEP-1995 (Rel. 1.2, Created) DT 17-APR-1997 (Rel. 3, Last updated, Version 3) XX DE Guinea-pig LINE repeat sequence. XX KW Repetitive sequence; LINE1; L1; CCL1; L1_CC. XX OS Cavia cutleri OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-679 RA Hall L.; RT "Direct Submission."; RL Direct Submission to Repbase Update (22-JUL-1988)Hall L., RL University of Bristol, Dept. of Biochemistry, School of Medical RL Sciences, University walk, Bristol BS8 1TD, United Kingdom.. XX RN [2] RP 1-679 RA Laird E.J., Jack L., Hall L., Boulton P.A., Parker D. RA and Craig K.R.; RT "Structure and Expression of the guinea-pig Alpha-lactalbumin RT gene."; RL Biochem. J 254, 85-94 (1988). XX DR GenBank; Y00726; Positions 1 679. XX SQ Sequence 679 BP; 163 A; 118 C; 122 G; 276 T; 0 other; aagcttttta acaagatgag cttccagttg ttgatttttt gtgaacgttt ctatgcaaca 60 gaagttttgt tcatgaagtc ttttgcctat accaatgtct tcaagagttc tacctagtct 120 atcttccagc aggtttaaca tttctgtttt aatacttagg ttttgattca tttcaatttt 180 agtttagagt gtggtgaaag atgtgggtat aattttaatc ttcagcatgt ggaaagccaa 240 tttttccagc accatttact aaagaggctt ttttccagca aaggtgtttg gcttttttgt 300 aaaaaataaa ggggctgagt gtgttggagt tgtctctgta tcttctaacc tgttccacta 360 ttccttgggt ctgttttttt gccagtacca tgctgttttt atcacagtag ctttatggta 420 taatttcaag tcagggtggg tgatgccccc ttcttggtct ttgttgccca taatttcctg 480 tactattata ggtctcttct ggttccaaat gaattttata atttctctaa ttctgcaaga 540 tatgctcttg gaattttaat cagaattgca ttaaatctgt ataatgattt tgaaagcatg 600 gccattttca ctattggttc ttcctactca agagcaaggg atttcttccc attttctgat 660 atcctattca atctcctta 679 // ID LTR8B_Cpo repbase; DNA; ROD; 637 BP. XX AC . XX DT 21-OCT-2009 (Rel. 14.11, Created) DT 21-OCT-2009 (Rel. 14.11, Last updated, Version 3) XX DE Long terminal repeat: consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR8B_Cpo. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-637 RA Jurka J. and Walichiewicz K.; RT "Long terminal repeats from guinea pig."; RL Repbase Reports 9(11), 2878-2878 (2009). XX DR [1] (Consensus) XX CC >90% identical to consensus. 4 bp TSD. XX SQ Sequence 637 BP; 172 A; 171 C; 121 G; 173 T; 0 other; tgttacaggc agctgagcta gttagaaata ttataggcag tcagacaggg ctaggccctt 60 gaggaagccg gcacagagga ccagatgtct ggtgaccttg gaggagacac acacctcttg 120 ctgtgggcca gatctcaaga cagaaggcaa aacaaatgca accacaccct taagtataaa 180 ccttagggat taggtatgga tcaccttgag aagccaggag tcaccatccc aaccccctac 240 aggaagagtc actgatccat aagaatgctc gcttgctatg tgactcgctt ccccatttgt 300 ccctgatttg gtcaatttta cctgaacaat taaacttatt ggttgaccat agttctgtcc 360 cacctcatgc ccatctgcct tgcgagttcc cagttttccc cactttagac ccgcctcacc 420 tcatggtttt actctataaa aatccctaga aaagacagaa tctttggcaa cccactcggg 480 accccttctg tctctctctc agacagaagc tttctttctc tcaataaata tttctgctct 540 taacctccat tgttcctaaa attcattctt cgattttaga gaacacgaac tcgtaccgga 600 catatagtag gtcgggagat ttcattcccc tatatca 637 // ID RLTR32_MM repbase; DNA; ROD; 616 BP. XX AC . XX DT 14-OCT-2002 (Rel. 7.09, Created) DT 14-OCT-2002 (Rel. 7.09, Last updated, Version 1) XX DE Mouse putative long terminal repeat - a consensus. XX KW LTR Retrotransposon; Transposable Element; Long terminal repeat; KW retrotransposon; RLTR32_MM; RLTR32A_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-616 RA Jurka J. and Drazkiewicz A.; RT "RLTR32_MM: putative LTR from Mus musculus."; RL Repbase Reports 2(9), 18-18 (2002). XX DR [1] (Consensus) XX CC 80% similar to RLTR32A_MM (bases 46-611). XX SQ Sequence 616 BP; 170 A; 151 C; 115 G; 180 T; 0 other; tgtagagagc acaagggcct tgtgctcaca gataagcact agggaaacca ggaatgttaa 60 acacacaggg ttgtctcctc aaaggaggag ataaaattaa atacctgttt tccacccaga 120 aggctgagag tggatgttgt taataacctg gtgacctttt tgctatctgt agagacagac 180 ctcacccaaa tcccccagaa tgattatccc agactcttcc ctcactagac cattacccat 240 ccttagggga gatgtcacat gtactttatg gcctgctgac ctctatgctg tcggtcagag 300 accacactag atccttccct aggcctttta gctatagaac ccaccttcat ggtcacatgt 360 actttgcaaa agagtttcaa tgtagtcata ccttaagctt tattgtacac ctggaattgg 420 aatgattatg gaaattattg ttgcctggaa acttttttcc aaattctact gtgtttaaat 480 atgcctacaa taaactgcct gtgcttagac tccctgaagt ctgatccagg ttgacaaagt 540 caatctgaac caaattttca ttctcacctc ttcacacggt ttgttttcct ctatgacacc 600 tgcagggacc ccctca 616 // ID MLT1R repbase; DNA; ROD; 1338 BP. XX AC . XX DT 01-MAY-1996 (Rel. 5.2, Created) DT 23-JAN-1998 (Rel. 6.4, Last updated, Version 3) XX DE MLT1-Mammalian LTR retrotransposon internal sequence - a DE consensus. XX KW Non-LTR retrotransposon; MaLR family; MLT1a subfamily; MLR; KW MLT1R. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-1338 RA Smit F.A.; RT "Structure and evolution of mammalian interspersed repeats."; RL PhD dissertation, Univ Southern California, 1995. XX DR [1] (Consensus) XX SQ Sequence 1338 BP; 392 A; 211 C; 358 G; 335 T; 42 other; gattttggca ctaggaagtg gggtgcttac gtaacaaata cctaaaaatg tggaagcgac 60 tttggaactg ggtaataagt agaggctgga agagttttga ggtacatgyt agaaaaagcc 120 tagatttcct tgaagagact gttggtagaa atatggatat taaaggyaat tctggtgggg 180 ggctcagaaa ggaagwggag agctatagag aaagctcccg tcgtcttaga gaatacaaaa 240 atcgtcatga acagaatgtt gctagaaata tgaatgttaa aggtgcttct ggtgaggtct 300 cagacggaaa tgagggagat gttattgaaa attggaggaa aggtgatcct tgttataaag 360 tagcaaagaa cttggctgaa ttgtgttcat gtcctagggc tttatggaaa gtagaacttg 420 yaagtgatga actkggatat tcagctgagg aaatttctaa gcaaagtgtt gagggcgcaa 480 cctgacttct cctaactgct tatagtaaaa tgtgagagac ataaagatgg aantgttgag 540 taaaaaggaa ccagaacgta aagatttgga aaattctcag cctgatcatg taacagaaan 600 nncaatagcg ttctctggaa agaanaccaa ggatgtnncc gngcaaccgt ttgctaaaga 660 gattagratc gtgactcgtg ggtccaatca accatctcag caraarccar gaatagagat 720 grggttattt aggaaatatc tgtggaggac tctcttatct gatggctcga acccctatga 780 cttncatagg agaccgacaa ggnttttgag aatnttacac cagcagaaac actgccaact 840 tggactraag ggracagaga cgagaaaaaa tgaaggaagr gtggcagacc gaaggtgggg 900 ctgtctcgtt tcagagcatg gkgtcacccc agcgggcccg gaagatgaat ctcaggactc 960 agaggcatta gcctccagcc ttaagatcta atgaagtttg ccctgctggg ttttggactt 1020 gcttgggacc ngtgactcct ttcttyyytt cyatttctcc cttttggaat gggaatatct 1080 atcctatgcc tgycccaccg ttgtattttg gaagcagata acttgtttna ttrcacaggt 1140 ccacagatgg agaggaattt tgccyctgaa tgaatctyac cctgagtytc tnccayacyt 1200 gatgyrgrtg atatttcgct gagattgtgg actaagagtt ggtgctggaa ggggttagac 1260 attggggaga tgttgctatg ggatgcaggg attttgcatg tgagaaggac atgattatgg 1320 ggggagcgga gggcaaac 1338 // ID MER104 repbase; DNA; ROD; 185 BP. XX AC . XX DT 05-AUG-1998 (Rel. 6.5, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 2) XX DE Non-autonomous DNA transposon - a consensus. XX KW DNA transposon; MER104. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-185 RA Jurka J. and Naik A.; RT "MER104."; RL Direct Submission to Repbase Update (AUG-1998). XX DR [1] (Consensus) XX CC Potentially related to MER33 and CHARLIE5. XX SQ Sequence 185 BP; 74 A; 28 C; 27 G; 56 T; 0 other; taccatattt cattgaatct aagatgccat caattgtaag atgcaccatt attttatgta 60 ccactaagaa agaaaaaaat gctgccaatt aaactataac atgccattaa ttgtaagatg 120 catcccaatt tcagagatgt taaaatgtga aaaaatgtgc atcttagaat cgatgaaata 180 tggta 185 // ID RSINE1 repbase; DNA; ROD; 179 BP. XX AC . XX DT 22-APR-1997 (Rel. 3, Created) DT 22-APR-1997 (Rel. 3, Last updated, Version 1) XX DE SINE element; RSINE1 family - a consensus. XX KW retroposon; SINE; RSINE1 family; ID; RSINE2; RSINE1. XX OS Rodentia OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires. XX RN [1] RP 1-179 RA Kapitonov V.V., Chopra V. and Jurka J.; RT "RSINE1."; RL Direct Submission to Repbase Update (1997). XX DR [1] (Consensus) XX CC ID-like SINE element. XX SQ Sequence 179 BP; 54 A; 45 C; 47 G; 27 T; 6 other; gggctgggga gatggctcag tnygtaaagt gcttgccgtg caagcatgag gacctgagtt 60 tgatccccag aacccacata aaaaagaggg agaggacctg aggaagacac cygaggttga 120 cctctggcyy ccacatayat gtgcacatgt gcacacatgc acacacacac acacacaca 179 // ID MER21B repbase; DNA; ROD; 795 BP. XX AC . XX DT 01-OCT-1995 (Rel. 3.6, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 3) XX DE Medium reiteration frequency MER21B repetitive sequence - a DE consensus. XX KW Repetitive sequence; MER21B. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-795 RA Kaplan J.D., Jurka J., Solus F.J. and Duncan H.C.; RT "Medium reiteration frequency repetitive sequences in the human RT genome."; RL Nucleic Acids Res 17, 4731-4738 (1991). XX RN [2] RP 1-795 RA Smit F.A.; RT "Structure and evolution of mammalian interspersed repeats."; RL Ph.D. thesis (USC 1995). XX DR [2] (Consensus) XX SQ Sequence 795 BP; 188 A; 210 C; 159 G; 191 T; 47 other; tcaacacaca acacttctgt caccaaacgt gtggrrrttt ttccccacac acaaacaakt 60 cttcagttct gcggcggaca ccarctgggt gtcctccaat tcarttcagt tctgacacta 120 wctacctrga gatagcgtca gatcccacag gtttaagggc tcagtcccac aagactgccc 180 ccacttcaga trccagtcgc aagtctrggt tktcacccgt acttctgacc aactggctat 240 aaattgttcc cacgacccct ctttaggttc gattaatttg ctagaatrgc tcacaraact 300 cagggaaaca cttatattta ycggtttatt ataaaggata ttacaaagga tacagatgaa 360 caaccagatg aagagatrca tagggcgagg tmtgggagag tccngggnnc aggagcttcc 420 gtgccctctc tggstnnrcc accttccwgg cacctccacg tgttcaccaa cccggaagct 480 ctccgaaccc tgtccttttg ggtttttatg gaggcttcat tacgtaggca tgaytgatta 540 catcantggc cattgattat caactcaacc tycagcycct ctccyctccc cagaggttgg 600 agggtggggc tgaaagttcc aaccytctaa tctgccttgg tctttctrgc gaccagcycc 660 catccwgrag cntnctaggg gctgcccrcc akgagtcgmc tcattagmac aaaagrncrt 720 ynctattacc caggarattc caagggtttt aggagytctg tgtcaggaac cggggtcaaa 780 gaccaaatat tagat 795 // ID MLT1E repbase; DNA; ROD; 568 BP. XX AC . XX DT 01-OCT-1995 (Rel. 3.6, Created) DT 18-APR-1997 (Rel. 6, Last updated, Version 2) XX DE Mammalian transposon-like element long terminal repeat (MLT1e DE subfamily) - a consensus. XX KW Repetitive sequence; MaLR family; MLT1e subfamily; MLT1E. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-568 RA Smit F.A.; RT "Identification of a new, abundant superfamily of mammalian RT LTR-transposons."; RL Nucleic Acids Res 21(8), 1863-1872 (1993). XX DR [1] (Consensus) XX SQ Sequence 568 BP; 146 A; 134 C; 137 G; 129 T; 22 other; tgtggtaggc agaattctaa ratgmtccca atgatcctcg cctcctggcg taatctcctt 60 gagtgtgagt aggacctgtg acttgcttct agccaacgga atatggcaaa ggtgatgara 120 trtcacgtga ttacgcgtac gtgattatgt aasattcagt ctttgmcgtc attcttgccg 180 agagactctc ctsctggtyt tgaagaagta agctgccacg tcatgagnnn ncnnannaga 240 rygccgcaag gcaagggnnt ctagagctga gagtcgccct tactgatggg cagcaagaag 300 caagccacct cagtcctaca gccgcaaaga actgaattct gccaacaacc tagtgagctt 360 ggaagcagat cctgcccagt cgagcctcca gatgagancg cagccctggc tgacgccttg 420 actgcagcct tgntagacct tgagcagagg acccagctaa gccgtgccca gactcctgac 480 ccacagaaac tgtgagataa taaatgtgtg ttgttttaag ccgctaagtt tgtggtaatt 540 tgttacgcag caatagaaaa ctaacaca 568 // ID RLTR20C_MM repbase; DNA; ROD; 508 BP. XX AC . XX DT 30-JAN-2004 (Rel. 9, Created) DT 30-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE Mouse long terminal repeat - a consensus. XX KW LTR Retrotransposon; Transposable Element; LTR; ERVK; RLTR20C_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RA Karolchik D., Baertsch R., Diekhans M., Furey S.T., Hinrichs A., RA Lu T.Y., Roskin M.K., Schwartz M., Sugnet W.C. et al.; RT "The UCSC Genome Browser Database."; RL Nucleic Acids Res 31(1), 51-54 (2003). XX RN [2] RP 1-508 RA Pavlicek A. and Jurka J.; RT "RLTR20C_MM - a subfamily of LTR retrotransposons."; RL Direct Submission to Repbase Update (31-DEC-2003). XX DR [2] (Consensus) XX CC The consensus sequence was reconstructed from the UC Santa Cruz CC genome annotation. Individual copies are ~90% identical to the CC consensus. 6 bp TSDs. XX SQ Sequence 508 BP; 141 A; 84 C; 159 G; 124 T; 0 other; tgttggggat tggttctaat gctttgaatt aatctggccc ccaaaattgg gaatctgcgt 60 gtccaaacgc tgaaggtcct tgtccccaat tggtttttga ttgatcaata aagagccaat 120 ggccaatggc tgggcaggta gactgaggtg ggacttttag atttgcatgg gctaggaact 180 gggagagagg aaggaagcag agatcaccat gatggggaag aagaaagacc agacctaaag 240 gcccgccgac atgtgagaat cagggaaagt ggccacacgg gccacttccc caattgtgtt 300 tggggtagca gagatgaaat atagatttta aaagatgtta actcaggagt accagagggg 360 agtgtttgct agcagcggga aggattagaa atgcccagcc attaagctag tcaaggcata 420 ttaaaattaa gctggtgtgt gtgtgtgtgt gtctttcatt cgcgaatcca gagagctctt 480 gggggggtag ggcgagaagt gtgtgtga 508 // ID MER93a_LTR repbase; DNA; ROD; 402 BP. XX AC . XX DT 16-MAR-2006 (Rel. 11.03, Created) DT 10-APR-2007 (Rel. 11.03, Last updated, Version 1) XX DE Long terminal repeat of ERV1 Endogenous Retrovirus from placental DE mammals. XX KW ERV1; Endogenous Retrovirus; Transposable Element; KW LTR retrotransposon; MER93; MER93a_LTR. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-402 RA Smit A.F.; RT "MER93a_LTR - a subfamily of endogenous retroviruses from RT placental mammals."; RL Direct Submission to Repbase Update (11-NOV-2005). XX DR [1] (Consensus) XX CC mer4 group, 20-21% diverged from the consensus. XX SQ Sequence 402 BP; 117 A; 101 C; 67 G; 110 T; 7 other; tgttaaaata attaaatggg aggccattag actgaggtgg ctctaacgcc ctgggttcct 60 acgtaagcaa accgaaacct aactcaaatg catttcttnt aagtnactac cttaggagga 120 aacgaaactt aagctcagcc aatcacaagc ngccaactgg gcattagtta tattatcang 180 aacttcccac cgggatagtc caaataaggc aactgctcaa actttaacca atcaaataat 240 ttntttgctc tgcttccgca ttcaccctat aaaagccttc ccttcangcc cctccggtgg 300 agccccgaac cacttccggt ttggngctgc ccgattcatg aatcgctgtc tgctcaaata 360 aactctttaa aattttaatg tgcctaagtt tatcttttaa ca 402 // ID MamGypLTR1a_LTR repbase; DNA; ROD; 784 BP. XX AC . XX DT 05-FEB-2007 (Rel. 12.01, Created) DT 10-APR-2007 (Rel. 12.01, Last updated, Version 1) XX DE Long terminal repeat of Gypsy LTR Retrotransposon from mammals. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW MamGypLTR1a_LTR. XX OS Mammalia OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi. XX RN [1] RP 1-784 RA Smit A.F.; RT "MamGypLTR1a_LTR - Gypsy LTR Retrotransposon from mammals."; RL Direct Submission to Repbase Update (05-FEB-2007). XX DR [1] (Consensus) XX CC 4 bp TSDs; 33% subst in dog-human. Associated with Gypsy CC internal sequence. Includes. XX SQ Sequence 784 BP; 174 A; 187 C; 257 G; 161 T; 5 other; tgtggcagga taatttnttg agatattaat ttgtgttttg ctctctgtat ttttcccttc 60 ccttcccatt ccaagcaggt agccaggccc ttngtatttc cttgcctcgg ggatttttgc 120 agggcagaaa gcagaagctg cttgaagtca acggctcttc ctgtcttttg taaaagccta 180 agctcattga agagattatg ctaggtgtcc cggaggggga ggggagagag gggagtgcct 240 ttgagggcaa gcgggagaag gagaaagagg aggagatttc ccaggactgg gaaggggaca 300 gaggtcngcg ggtcccgcga gcagcgggga cccgcgcccc gctccgtggc agcgcccggg 360 gaggtggcaa gacctcagag gggaatggct gcgtggtgca cctagggagg ctggaccccg 420 ggcacngggg ctcccagcct cgccaaagat tcccgtgccc caagcatggc acggaagcag 480 cagagccgcc ggacctgaag ggaccatgcg ggctgggaca atgggcatct cagcggtaac 540 cagtgtggac cgatgaccga tgaccggagg ggcctccccg atgctttggc gctgtgtaag 600 accccgggac ctttgcacaa ccctgggggn gggaggggga gccccaataa tgactgagat 660 tgaatttccc gccagcccgg taggatgggg gctcagagtc agatttaagt tgatttaaag 720 aaataaagaa atgtgatatt tcttgcacac ctgagtttgt ggactaagat tcatacccgc 780 taca 784 // ID ERVB4_1-I_MM repbase; DNA; ROD; 8316 BP. XX AC AC102561; XX DT 11-FEB-2004 (Rel. 9.01, Created) DT 11-FEB-2004 (Rel. 9.01, Last updated, Version 1) XX DE Mouse endogeneous betaretrovirus ERVB4_1 internal sequence. XX KW Endogenous Retrovirus; Transposable Element; gag domain; KW pol domain; endogeneous betaretrovirus; MmERV-B4_AC102561; KW ERVB4_1-I_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-8316 RA Baillie J.G., Van de lagemaat N.L., Baust C. and Mager L.D.; RT "Multiple groups of endogeneous betaretroviruses in mice, rats, RT and other mammals."; RL J. Virol 78(11), 5784-5798 (2004). XX DR Genbank; AC102561; Positions 139574 147889. XX SQ Sequence 8316 BP; 2314 A; 1916 C; 1741 G; 2345 T; 0 other; gaggtgccca acgtggggcg aggtccggat cgtaattcct aatcaatgga ctttccagag 60 aagttcttcg agaccccacg catttaggag aagtgaagga catcttccgc tgctcaagca 120 agaagtcctt ggtaagttga gtcatctcca cttcaggtta tgggaaataa gttatccaag 180 gaggcagcct tcatcaaaga tttaaagcta gctctcaggg aaagaggagt acgagtaaaa 240 aagaaagatt tgatatactt ttttattttc atagaccagg tatgtccgtg gtttattata 300 gatggagcag agatacatcc taagaaatgg agaaaagtag gtagagagtt aaatgacatt 360 ttagcaaagc agggccccga ggcggtccct gcaaatgtat ttacttattg gagtttgatc 420 cgtgatttgg tagaaaatac agttgatgat ccagagaaac agcaactttt gtcagtggca 480 gaatattgcc tctgtccatt gtctcgggaa gctatggagg gctccctccc tactactaat 540 cctccaaaag ctacagatgg gcctaaaatt ccagaggcta cactttatcc gcaggtctca 600 gtaggcgctt tgcctgagca gcctacacag cctgcaaccc cacccccctg ccaacctgcc 660 actaatcaat ctagtcttta aaaaatcttt ggcagaggga cctttaatat ctggggattc 720 agaagctata gaggaagagg tagtctggcc cataaaccgg agcagcttcg tggcgcctca 780 gctcctcctc ttaatacccc tccctacttg ccaccagtct gtgctcctgc accatggcct 840 taggctctac cagttcccac tctcattgat gctaaagaca agctcgcgtc tcaagtggca 900 ggattacaag aagtattagc attacagcaa cagtacactt gcctgtccac agagctcacg 960 tcactccaaa attctttaaa agagactatt cttgcccctc cctcgcctgc agccttgggc 1020 agcttaaaaa aaaaaggctt cgtgagcagt aaaatacaaa acattggctt ttcctgtaat 1080 caccctctct atgggagatg ctagctgtga cccgcccact tctacgggag aggaacagcc 1140 ctcaatgatc cgtgccagat ttacttctgt aaaaataggg cctttggaaa cgccacaggc 1200 cgatctttta gacactaaca gcgagaacag aacagattct gagagtgata atgaaataga 1260 gcatgagcca gaagagcgat tggcagttcg tattaggcag acagagtttc gtaaattacg 1320 tttgaaggat ttaaaagagc ttaactctgt ggttaggacc aatggtccat ctgcccctta 1380 tatgctttcg tgcctagaag ccctcccagg aggcagacaa atgctcctta gtgaatggat 1440 cagagtgatt cagacagtgt taactcgcgt acagttcctt tcttggaagg ctgattttct 1500 cgatcgttgc cagacaattg ctataactaa tcaaaggaac cctcaaacgc catctgctgg 1560 ctggaccttt gaaaagcttt ctggtcaagg aaaatacgca gcagaggcca gacaaaaaac 1620 gttttccaac aggtctgttg gctcaaactg ctaatgcagc tttgagagct tggcatgcca 1680 ttcctatgaa aggctccgtt attacccctt taactaaaat tattcaagga gtacaagagg 1740 actatagtga atttgtaagt cgtttgcttg aggctacaga gaggacctta ggtcatgagg 1800 atgcagacaa taaactcata aaacaattgg cttttgagaa tgctaattcg gcttgtaagg 1860 cagtcttgca tggtaagatc agagacaagg accttaatga gatgattcat ctttgtcatt 1920 atgttaatat gtttactcac aacatgtccc aaagagttaa tcttgcaatg gatgcagctc 1980 ttaggcctgt tatggcaata ggtgcggctc ttcagctagc tggaccccag aaaagttgtt 2040 ttaactatgg ccagcctgga cactttgcag gacagtgtcc cactgcgccc tcccctaaca 2100 gttcttcagc acaacctact aaccattctt tccggcctat acggcccaat tttctttgct 2160 catggtgtaa gagaggcaaa cattgggtaa acacttgcag agctcagaca aatgtgtttg 2220 gaaatctcct gcctcctatt cagggaaacg agtatggggg acagccccga gtccccaaaa 2280 ccatctcatt tctcccggcc acggagcaca gatggcagac aaatcaaact ctgctctctc 2340 cagagccacc acaggcagca caggcctgga cttgtattcc tctgccagcg caattttaac 2400 accagaggat ggcgtttaca tcctccctac gggggtctat gggcctcctc cacccaatac 2460 ttatttttta atattagggc gtgcttccgc cactttgata ggacttactg tccatccctc 2520 attagtagat aatgactaca ctggagaaat taaaatttaa gttagtgcct ctcaagggcc 2580 aatatctata tgcaaagggc aatgtttggc agaagccttg cctctacctt tagatacttt 2640 ttatccagca atgggcaaat gtcgcggctc ttcgcagcca gggtcatcag aactttattg 2700 ggtacaagca attaccaaag actgcccaac actctgcctt aagataaatg gcaagcgttt 2760 tgagggcctt ttagactcaa gtgcagattt cactgtgatt tcacaaagtg cctggcctgc 2820 tgcttggccg ttaaaggcct ctttaactca tttacaggga attggacaat caaaaaatac 2880 tctccatagt tcatagttgt tgacctggga agatgatgaa ggaaactctg gttctattca 2940 gccttatgtt gtccctggtc tgcccgttaa tttatgggga gagacatttt ttctcaaatg 3000 agggtcatta tgtgcagtcc taatgaagtc attactcaac aaatgctcgc ccagggttat 3060 ctccctggac agggactggg taaatatagc caagggaggc ctaccccaat agaggccacc 3120 cccaagatag atcgtgcagg tttaggttac aactcccatt tttcataagg accattgctc 3180 tccctgcacg tcaggcagat aagattactt ggaaagatga tacccctgtc tggcttgacc 3240 agtggtccct tcctgcagaa aaattgtcag cagccataga attagtgcag gaacaattgg 3300 cagctggaca caatgaatcc tccacttcac catggaacac tcctatcttt gttattagaa 3360 agagaaatag taaatagaga ttattacaag accttagagc agttaataag actatggacc 3420 ccatgggggc cctacaacct gggattccct ctccagtggc tattcccagg ggatatgcta 3480 aactagtgat tgatttaaag gattgtttct tttccattcc tctccatcct gaagattata 3540 aacgctttgc atttaccctt ccagtggtta attgtatagg gccttctcct cgcttccaat 3600 ggagggctct tcctcaaggc atggcaaata gtcctacact ttgtcagaga tatgtggcac 3660 aagttattga tccattcaga atgtcataat ctgtatgtgg tccattacat ggatgacatt 3720 ttgattgctg gacctgatca agaccaatta tatattgcta gtcaaaagct tgttaatgcc 3780 cttcaaaatc aagcactcca agtttctcca gagaaaattc agatccaccc tcctcacttg 3840 cttttgggtt ttgaattatt tcctaacaga attctctctc aaaaggtcca ggtgagacaa 3900 gattccttac agactcttaa tgattttcaa agtctgttaa gagatattaa ttggcttcgt 3960 ccttatctga aacttacaac aggagagtta aaacctttgt ttgacattct gcgaggagac 4020 tcagatccat cctctccacg catgttaact caagaggcac gaatgtcact acctaaagtt 4080 gaacaagcca tcagtgagca aaatattggg tatttttccc cagagcttcc acttcagttc 4140 cttgtcttcc ctactccctt ttcacccaca ggtctgctgt ggcagcttaa acctctgttt 4200 tgggtccaca tgtcagcttc tccctccaaa gtgctaccca catatcctca gttagttgct 4260 aatgttctgt gcctagacag agaagctgct cttaagcctt tggcagagat ccagatgtta 4320 ttgttctaac ctatgatgcc tctcaagttc agtggttact taaaaataat gatgtttggg 4380 cagttaattg catctccttt caaggtgtga ttgataatca ttaccctgct gataaattgg 4440 ttcagttttt gcataagacg cctgtggttt tccctaagag aacaaagtca gacccgattc 4500 ctggggctat gttagttttt actgatgggt cttcctcagg ttggctgctt ttaatattgg 4560 tggaaaggtc tcacatttta tgacagacct ctcctcagcg cagcttgttg aattggcagc 4620 tattgttaaa gtatttgtac tattgcctaa aactcctttc aatttatata cagacagcgc 4680 ctatgtggct acctctattc ctcttttaga aactgtcctt tatattcgcc cttctaccaa 4740 tgcgtctcca aatttgctaa acttcagagc cttattcttg ctcgtaattt tccatttttt 4800 attggtcata ttcgtgctca ttctggcctg cctggacctt tgtctgaagg caacaatata 4860 gttgatcagg ccactcaggt aatatcttcg gctctgtcta ttacccctct tgctgctgcc 4920 caacaggccc atgatttaca tcaccttaat gcacatacct taaggcttaa attctccatc 4980 acccgtgaac aggctagaca aattgtccgg cagtgtaaag gctgcctaac ccttttgcca 5040 gagccacatg tgggagtcaa cccccgagga ctaattcctg gtgaactgtg gcaggtagat 5100 gttacccatt acactccctt tggaaagtta aaatatattc atgtctctgt tgatatcttt 5160 agtggattta tctgtgcatc tttacaaaca ggagaggcta ctaaacatgt tatcagccat 5220 gttctctcct gcttggcgac tgtgccacag cctaagatcc tcaagacaga caatggccca 5280 ggatatgcaa gtgccagctt taaacagttt tatgcccaaa tgggcattaa acacattact 5340 gggattccct ataatcccca aggtcaaggt attgtaaaaa gaactcatca aacccttaaa 5400 aacatgcttt tcaaattaca gtctgggggg aaaattctat atcttcagtc tggtaactcc 5460 aagatggttt taaatcatgc attatttgtt ttaaactttc tgacgtacga caatgcaggc 5520 aagtctgctg cagatcgcct ctggcatcct tctactgcta ataactatgc acaagctatg 5580 tgggagaccc cttgtccaac aaatggaagg gtccagaccc agtcctcata tggggcaaag 5640 gacacgcttg catctacgat tcaaaagcac aacatgctag atggctccct gagcacctaa 5700 taaaacctta taattgtcca tggaagaaaa accccgagga agtttctaaa tgtgcttctt 5760 tacaaaaaat gaaagcgacg agaaaaaagt cactcctgat catcgagatt tggctgtgtc 5820 tttgcctgat gagaagccag tggtaggcca agctaagaaa aatccttatc gactttataa 5880 ttacacctgg ctaatcatta acgaggcagg tgacatagct aatgcctcct ccaagattga 5940 gggttctatc ccatgaccca tcctgagagt cgatctttgt aaattagtcc taagaggtca 6000 taatgactgg ggaactcagt cagaattcct gccacaagaa caagctattg atgatccaag 6060 acaagttgcc tataccactc ctggatgtgc tagtttaagt catagaaaaa cacttgccag 6120 tgttttagaa agatggggca tatatatttg tcctggcccc aatcacagga gtcacactct 6180 taattataaa tgtggtttcg cccccgatta tttctgtgct tcttggggct gtgagaccac 6240 aggagacacc tattggaaac ccacttctga ttgggaccta ataaaggtcc agagcaggcc 6300 tgattatgct gcctgtgcta gctccaacca gacttccaaa ggatggtgca acaccttaga 6360 gatctccttt actgacacag ggaaaaaatt taattgggag tatacactag gagctgagtg 6420 gggtttacgt atctatagaa atgaaaaaga tttcagagta acattcagaa tccagttact 6480 taagaacacc ccatctttag ggtcggctgc tataggcccc aatctcattt tacattcctc 6540 ttatcctaga aagccgagtc ttccacagac tgttacctta ggtccacctg gtactactgt 6600 attccagcct actttagctg ctggatctcc ctcctctgca gaattgattt tatccttagt 6660 aaatgcatca attgctacta tacacgcaac taatgttact cagtatgaag aatgttgggt 6720 gtgtttttct cctcaacctc acttttatga aggggtggca acattcggat cagttattgc 6780 aattaatgat tccagcaaac ttggatggca ccccgagaga catgatgggc tcactctgag 6840 tcaggtgtct ggcataggct tatgcctttt ggggccttcc atgcttcctc ctcaggcctt 6900 attagaggtt tgaaatcaga ctattagggt ggataccacc tctagatacc ttggagcacc 6960 taatggtatt tatttagcat gttctactgg acttactacc tatatagtta ctcagacctt 7020 tttagatgac agagattatt gtgttgtagt tcagcttctc ctgaagctgt cagtacacca 7080 tgaaaaagat ctcctccaat tctggaagtg tgacacagat ctaccccgtg ataagagaga 7140 gcctatttct gcagtaacct tagcagtgat cctaggctta ggagctgaag gcacaggaac 7200 gggcattaca tctttgatca cttcccaaca gcaatatact cagcttcacc ttgctgtgga 7260 cagagatata caagagctac agagaggctt aaaaaattta aaaattcctt ggtctctctg 7320 tctgaagtgg tattacaaag caggcgagat cttgatttag tgttccttaa agaaggaggg 7380 ttatgtgcag ccatcaaaga agaatgttgt atttactctg acaagactgg cttagttcaa 7440 gataaaatag acaaagttag agctagttta gaagagagaa aattcaacag agagaaacaa 7500 gaatattggt ataaaaattg gtttttactt ctccttgggt caccaccttg cttcccacac 7560 ttttgggacc tttcatgggt attttgctgc ttttgtcttt tggcccctgg gcctttaaaa 7620 gatgaaccag ttttgttaag tcacagattg aagctgcttt aagtaagccg gttgcagtcc 7680 actaccatca gctggatatc caagactcag acgaagaaga tcctcctccc accaaggcag 7740 aaacaacaac tcgtctccaa ttttccaccc ttgctgctaa tgcagagttc cctgggttcc 7800 tcaggctttg gagacaaaag ggacaaaaaa gggtgacctc cagctcctaa tatagcttaa 7860 gagccacaaa ttttggtgtt acaattggca taattcttgt agaggcttta ctgagtctga 7920 ggttgacaat tctcctacgg gagctgcaca gaactcagaa ctgtgcatgc tggttcgtga 7980 taaatatgaa tctggtatgg gcccagcatg gctgcaaact aagtattgca gaggaaggtc 8040 caaatagacc gtttctgagt tccctgagag acaaacccgg tttggatgga ggttggtatc 8100 tagtttcagt tacaagtaaa gaacttttct aaggctctgc ctcccctccc caaaagatac 8160 caagagccac aaatgtgggt ctgtgacagc acccacggga ggaatcgggt caatgtcccc 8220 ccagccaggg taatgcccac tcatccaagg atgagaaaat tatttgatca cctcagttaa 8280 gtgtggcctt attaaattta attcagaagg gggaga 8316 // ID RMER6B repbase; DNA; ROD; 449 BP. XX AC . XX DT 26-SEP-1997 (Rel. 2.08, Created) DT 26-SEP-1997 (Rel. 2.08, Last updated, Version 1) XX DE Long terminal repeat of retrovirus-like element. XX KW ERV2; Endogenous Retrovirus; Transposable Element; KW putative long terminal repeat; RMER6B. XX OS Murinae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae. XX RN [1] RP 1-449 RA Smit A.F.; RT "RMER6B."; RL Direct Submission to Repbase Update (30-NOV-1996). XX DR [1] (Consensus) XX CC 5' end incomplete. bp 344-499 almost identical to bp 682-788 of CC RMER6A, and therefore considered a putative retrovirus-like LTR. XX SQ Sequence 449 BP; 111 A; 125 C; 76 G; 134 T; 3 other; ctaacaaaat ctcaatcgtt tgattttacc aataaagnct caggagccag atgctggggt 60 gaaagcctgc tagctcagag aggcagagaa agcacccagc tgaccttcct cctcagccsa 120 catcccaaaa ggagttctcc ttctccacac catctcaaaa ccccttcaaa ctgaatgtcc 180 ctcccttcta cttcctgtgt gtctctctat ccgtcctcct gactccctct tactctctgc 240 tttttttttc ttatgttcac tccctgtcaa ctggttgctt gytctgcctc ttgacctatg 300 gttgacttta tttaatcctg tttacaataa acagaaagct cttggattaa aggtgtgtgc 360 tggggctgag ccacaccacg actagaaaca ggtttttcag taattaacac aatttcaggg 420 ttcacgatgt gatcaaatac cctgcaaca 449 // ID MLT1D repbase; DNA; ROD; 505 BP. XX AC . XX DT 01-OCT-1995 (Rel. 3.6, Created) DT 23-JAN-1998 (Rel. 6.4, Last updated, Version 3) XX DE Mammalian transposon-like element long terminal repeat (MLT1d DE subfamily) - a consensus. XX KW Non-LTR retrotransposon; MaLR family; MLT1d subfamily; MER26; KW MLT1D. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 334-465 RA Jurka J., Kaplan J.D., Duncan H.C., Walichiewicz J., RA Milosavljevic A., Murali G. and Solus F.J.; RT "Identification and characterization of new human medium RT reiteration frequency repeats."; RL Nucleic Acids Res 21, 1273-1279 (1993). XX RN [2] RP 1-505 RA Smit F.A.; RT "Identification of a new, abundant superfamily of mammalian RT LTR-transposons."; RL Nucleic Acids Res 21(8), 1863-1872 (1993). XX DR [2] (Consensus) XX CC Replaces MER26 sequence. XX SQ Sequence 505 BP; 149 A; 96 C; 138 G; 113 T; 9 other; tgtggtaggc wgaataatgg ctccccaaag atgtccacgt cctaatcccc agaacctgtg 60 aatatgttac cttacatggc aaaagggact ttgcagatgt gattaagtta aggatcttga 120 gatggggaga ttatcctgga ttatccgggt gggcccaatg taatcacaag ggtccttawa 180 agagggaggc agagggtcag agtcagaaga aggagatgtg acgatggaag cagrragnga 240 aaactcaacg ttgctggctt tgaagatgga ggaaggggcc atgagccaag gaatgcgggc 300 agcctctaga agctggaaaa ggcaaggaaa cggattctcc cctagagcct ccagaargaa 360 cgcggccctg ccgacacctt gattttagcc cagtgagacy cattttggac ttctgacctc 420 cagaactgta agataataaa tttgtgttgt tttaagccac taagtttgtg gtaatttgtt 480 acagcagcma yaggaaacta ataca 505 // ID MMERGLN_LTR repbase; DNA; ROD; 430 BP. XX AC . XX DT 22-OCT-2010 (Rel. 15.1, Created) DT 22-OCT-2010 (Rel. 15.1, Last updated, Version 2) XX DE ERV1 Endogenous Retrovirus from mouse: long terminal repeat DE (consensus). XX KW ERV1; Endogenous Retrovirus; Transposable Element; KW LTR retrotransposon; MMERGLN_I; GLN; MMERGLN_LTR. XX OS Mus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae. XX RN [1] RP 1-430 RA Smit A.F.; RT "MMERGLN Endogenous Retrovirus from mouse: long terminal RT repeat."; RL Direct Submission to Repbase Update (06-SEP-2005). XX RN [2] RP 1-430 RA Ribet D., Harper F., Esnault C., Pierron G. and Heidmann T.; RT "The GLN family of murine endogenous retroviruses contains an RT element competent for infectious viral particle formation."; RL J Virol 82(9), 4413-4419 (2008). XX RN [3] RP 1-430 RA Jurka .; RT "LTR consensus."; RL Direct Submission to Repbase Update (22-OCT-2010). XX DR [1] (Consensus) XX CC >99% identical to consensus. Active. XX SQ Sequence 430 BP; 106 A; 116 C; 93 G; 115 T; 0 other; tgaaaggaaa taaaactgta attcatgtaa tgtatgttaa atagcccaaa gagttgtttc 60 tgagctttga aacctggggc tgagaacata gcagaacaga ccaggacatg cccgggcaag 120 cccatcgcct ccctagctcc cacccctctg acctaagtta aatgttacag gctgctgatg 180 tttaaatgga ccaatcatgt gaaaccgcgc caattcctcc cccagcccca ctccttttct 240 ataaaacccc ctagcttcca agcctcgtgg tcgaatccac tgtctcctgt tgtgtgagat 300 acgtttcgac ccggagctcc gccattaaaa aacctcttgt tgttacatca aggtgttgtg 360 ttctattcgc gattcttggg tgcacgccga atcgggagct gagtgggggt ttccccactg 420 agttctttca 430 // ID MER68A repbase; DNA; ROD; 563 BP. XX AC . XX DT 18-APR-1997 (Rel. 2.03, Created) DT 07-OCT-1998 (Rel. 3.09, Last updated, Version 4) XX DE MER68 LTR element - a consensus. XX KW LTR Retrotransposon; Transposable Element; LTR; KW Interspersed repeat; HERVL68; MER68A. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 563-1 RA Smit A.F.; RT "MER68A."; RL Direct Submission to Repbase Update (30-NOV-1995). XX RN [2] RP 1-563 RA Kapitonov V.V. and Jurka J.; RT "MER68A."; RL Direct Submission to Repbase Update (31-JUL-1998). XX DR [2] (Consensus) XX CC Sequences related to MER21 and MER77. CC Original orientation [1] has been changed based on classification CC of MER68 as an LTR from HERVL68 retroelement [2]. CC Individual sequences are 82% identical with the consensus CC sequence. CC The age may be younger since this subfamily can be split further CC into minor subfamilies. XX SQ Sequence 563 BP; 119 A; 139 C; 144 G; 158 T; 3 other; tgtgcagaaa agagttaaca tagcaggcct gagactgcta tccttagaaa ggcctgcttg 60 caaggttggc ccttggctgg catctgggaa cttggatttc gggagggttc ccaccattcc 120 cwkaactgat aagagtggct cactgtgcct aaactgtttg tgcaaacaat atggtttatg 180 ctgaacacct gctttccttc tgggagtctg gaattttggt acgtgctagg cagagggtgc 240 ctacgtgacc agcccccart aaaaaccctg ggcactgagt ctctaatgag cttccctggt 300 agacaacatt tcacatgtgt tgtcacaact cgttgctggg ggaattaagc gtgtcctgtg 360 tgactccact gggagaggac tcttggaagc ttgcgcctgg tttcctccgg acttcgcccc 420 atgcgccttt tccctttgct gattttgctt tgtatccttt cgctgtaata aatcatagcc 480 gtgagtatga ctatatgctg agtcctgtga gtcctcctag cgaatcaccg aacctggggg 540 tggtcttggg aacccctaac aca 563 // ID RLTR19A2 repbase; DNA; ROD; 507 BP. XX AC . XX DT 18-SEP-2008 (Rel. 13.09, Created) DT 29-SEP-2008 (Rel. 13.09, Last updated, Version 1) XX DE Long terminal repeat of endogenous retrovirus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; RLTR19A2. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-507 RA Jurka J. and Jurka M.G.; RT "Long terminal repeats of endogenous retroviruses from mouse."; RL Repbase Reports 8(9), 1053-1053 (2008). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by CC International Collaboration for the Mouse Genome Sequencing. XX SQ Sequence 507 BP; 105 A; 131 C; 117 G; 154 T; 0 other; tgttaagacc taggtagccg ctgcctttct gacattactg tagccatttt gttttatgcc 60 tgtagccagc catcttacat tgttacagag gacctttctc atgcccgagt taaccattaa 120 ccgcatatcc tgttaagttc tatgctttgg tgttcttgga aattccccag accctcgccc 180 tcctttagag ccaatcacaa taaaggtcag ttagaactgc tttgtatagt tagctacaat 240 aagcttggac ccgaaccaac cgccgcaacc gcctggcagc agcaccttcc tgcacctgtg 300 tatgagcttt tatggttgct atgagctttt gtgggttttg cctttataag ctgcccctgg 360 gaaacagtca gggtcgcagt ctcagctccc gagtctgacc tgcgtccctg atcgatcagt 420 cctggggggt gtgtgttcaa taaactatcc ctgtttgact gagatcggtg tctgagtggt 480 ttgtggggtg attcctggac cccaaca 507 // ID CAVID2D repbase; DNA; ROD; 87 BP. XX AC . XX DT 06-NOV-2009 (Rel. 15.03, Created) DT 06-NOV-2009 (Rel. 15.03, Last updated, Version 3) XX DE tRNA-derived SINE family: consensus. XX KW SINE2/tRNA; SINE; Non-LTR Retrotransposon; Transposable Element; KW CAVID2D. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-87 RA Jurka J.; RT "SINE elements from guinea pig."; RL Repbase Reports 10(3), 506-506 (2010). XX DR [1] (Consensus) XX CC >91% identical to consensus. XX SQ Sequence 87 BP; 25 A; 21 C; 27 G; 14 T; 0 other; gggctgggga tttagctcag cggcataagc gcctgcctgg caagcgcgag gtcgtgagtt 60 cgatccccgg taccaaaaaa aaaaaaa 87 // ID LTR5B_Cpo repbase; DNA; ROD; 386 BP. XX AC . XX DT 19-SEP-2009 (Rel. 14.07, Created) DT 20-OCT-2009 (Rel. 14.07, Last updated, Version 4) XX DE Long terminal repeat of ERV3 endogenous retrovirus: consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; LTR5B_Cpo. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-386 RA Jurka J.; RT "Endogenous retroviruses from guinea pig."; RL Direct Submission to RR (20-OCT-2009). XX DR [1] (Consensus) XX CC ~88% identical to consensus. XX SQ Sequence 386 BP; 90 A; 86 C; 106 G; 104 T; 0 other; tgttatggtt gatacattgt gtcaacttga gaagtttaga agtttaactg agagactcag 60 cagggagcca gggtccttac tttgtaaata tcctcatcct gtgcaagagg agggcacgga 120 gattttgtga gtgctacacc cgccctgggg ggggggggtg gcctgcgtac aatataaggg 180 aggagaagag gcttgttttc cccccttttg ctctggtttg ctggctgctg ccttgaagtg 240 ttgccccagt gccatgccac cctgccttgg agccagctga ttatggactg aaacctccac 300 aaacagtgag ctaaataaac ctttccttcc ttcattttgg gtgtcgggta ttttgtccca 360 gcaacgagag aaaagtaacc aagaca 386 // ID MLT1E2 repbase; DNA; ROD; 593 BP. XX AC . XX DT 09-SEP-1998 (Rel. 6.6, Created) DT 09-SEP-1998 (Rel. 6.6, Last updated, Version 1) XX DE LTR from retrotransposable MaLR element - a consensus. XX KW MaLR family; MLT1E; MLT1; MLT1E2. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-593 RA Jurka J.; RT "MLT1E2."; RL Direct Submission to Repbase Update (SEP-1998). XX DR [1] (Consensus) XX SQ Sequence 593 BP; 188 A; 118 C; 144 G; 140 T; 3 other; tgccctaatc ctggaaccta tgaatacgtt acatnacata gcaaaaggga ttttgcagat 60 gtaattaagg ttactaacct taaaataggg agattancct ggattatctg ggtgggccta 120 atctaatcac atgagccctt aaaagcagag agttttctcc ggctgatagc aggaaatgta 180 aagcagaaga ggaagtcaga gagatttgaa gcatgagaag rattcgatgt accattgctg 240 gctttgaaga tggagggggc cacgatgcaa gaaatggaag aggcctctag gagctgagag 300 tggctcccag ctgacagcca gcaaggaaat ggggacctca gtcctacaac cacaaggaac 360 tgaattctgc caacaacctg aatgagcttg gaagtggatt cttccccaga gcctccagat 420 aagagcccag cctagctgac acctttgatt tcagccttgt gagaccttaa gcagagaacc 480 cagttaagcc tacctggact tctgacctac agaactgtga gataataaat gggtgttgtt 540 ttaagctgct aaatttgtgg taatttgtta cacagcaata gaaaactaat aca 593 // ID RLTR19C_MM repbase; DNA; ROD; 507 BP. XX AC . XX DT 30-JAN-2004 (Rel. 9, Created) DT 30-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE Mouse subfamily of LTR retrotransposons - a consensus. XX KW LTR Retrotransposon; Transposable Element; LTR; ERVK; RLTR19C_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RA Karolchik D., Baertsch R., Diekhans M., Furey S.T., Hinrichs A., RA Lu T.Y., Roskin M.K., Schwartz M., Sugnet W.C. et al.; RT "The UCSC Genome Browser Database."; RL Nucleic Acids Res 31(1), 51-54 (2003). XX RN [2] RP 1-507 RA Pavlicek A. and Jurka J.; RT "RLTR19C_MM - a subfamily of LTR retrotransposons."; RL Direct Submission to Repbase Update (31-DEC-2003). XX DR [2] (Consensus) XX CC The consensus sequence was reconstructed from the UC Santa Cruz CC genome annotation. RLTR19 sufamily (79% identity). Individual CC copies are ~89% identical to the consensus. 6 bp TSDs. XX SQ Sequence 507 BP; 130 A; 115 C; 87 G; 175 T; 0 other; tgttgaggcc tgctcctttt aacataactg tagccatttt gtattcagtc tccattttgc 60 tcataaggtg aaattaagtt caggttctca gactctactt cccagaagta attatccata 120 gctgacactg aagaatgcta ctaataagcc aaaaagttat ggttgaatca cttgtactct 180 gttatctcaa tgttctgaaa ttcccctgtt caccacctgt ccaccaccct tcttacctca 240 ctcaggacca atcagcttaa aggttagctg ataatacttt gtctagttag ccaaaaatgt 300 tgtaccactt cactgcttgc cttttaaact tttgaacctg gtttttccta taaaaagcct 360 gccctgagga cagactggtg ccacaattag gtttttcctt cttgtggtcc tgaacgttca 420 gtattatggt gtgtgttcaa taaactattc ttgcttaact gagattggtg tttgtatggt 480 ttgtgtggca attcccaaac cccaaca 507 // ID BC1_Cpo repbase; DNA; ROD; 124 BP. XX AC . XX DT 06-APR-2010 (Rel. 15.04, Created) DT 06-APR-2010 (Rel. 15.04, Last updated, Version 2) XX DE SINE Non-LTR Retrotransposon from Muridae. XX KW SINE2/tRNA; SINE; Non-LTR Retrotransposon; Transposable Element; KW BC1_Cpo. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-124 RA Jurka J.; RT "SINE elements from guinea pig."; RL Repbase Reports 10(4), 630-630 (2010). XX DR [1] (Consensus) XX CC ~94% identical to consensus. XX SQ Sequence 124 BP; 49 A; 26 C; 29 G; 20 T; 0 other; ggggctgggg atttggctca gtggtagaac gcttgcctag caagctggaa gccctgggtt 60 cggtcctcag caccaaacct gaaaaacaaa aaatccataa aaaaaaaaca caaaagataa 120 aaaa 124 // ID ERVB4_3-LTR_MM repbase; DNA; ROD; 510 BP. XX AC . XX DT 11-FEB-2004 (Rel. 9.01, Created) DT 11-FEB-2004 (Rel. 9.01, Last updated, Version 1) XX DE Mouse endogeneous betaretrovirus ERVB4_3 LTR sequence - a DE consensus. XX KW LTR Retrotransposon; Transposable Element; LTR; KW endogeneous betaretrovirus; MmERV-B4_AC124523; ERVB4_3-LTR_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RA Baillie J.G., Van de lagemaat N.L., Baust C. and Mager L.D.; RT "Multiple groups of endogeneous betaretroviruses in mice, rats, RT and other mammals."; RL J. Virol 78(11), 5784-5798 (2004). XX RN [2] RP 1-510 RA Gentles A. and Jurka J.; RT "Mouse endogeneous retrovirus ERVB4_3 consensus."; RL Direct Submission to Repbase Update (31-DEC-2003). XX DR [2] (Consensus) XX SQ Sequence 510 BP; 143 A; 135 C; 92 G; 134 T; 6 other; tgttgggagc caaaactaaa ttctttataa taatccaagt gaaaaagatg actaccaagt 60 cttaagacat aaagytggam ctcctcccca agctgactaa rtccagacaa aggtcaccaa 120 cggactgttc ccagttggac ctcctcccca agctgactaa gtcagatgaa ggtcaccaag 180 gactgtttcg agaattacca caagatgatc cmagacccga ctgacgcaaa cagcagatgt 240 cataaccaca agatgtactt ttagatatca tanctcccct tgtagtcacc aaggaaaatt 300 accactgccc cctcccccgt gcccttctgc gtaagggtta tttccccttt gatctttttg 360 tataaaaact acaagttttg ctgaatacaa tgagaccttg acaagattca gatttggctc 420 tgtgtcgttt ctgtgcttgg tcyccgtttc tctctcaccc ccatttggtt ttcaggatga 480 gtcccctcga gacccacgaa taactggacc 510 // ID L1MED_5 repbase; DNA; ROD; 959 BP. XX AC . XX DT 11-FEB-2000 (Rel. 7.1, Created) DT 11-FEB-2000 (Rel. 7.1, Last updated, Version 1) XX DE L1MED_5 LINE1 repetitive element - a consensus. XX KW LINE1 repeat; L1MEC_5; L1MED_5. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-959 RA Jurka J.; RT "L1MED_5."; RL Direct Submission to Repbase Update (JUL-1999). XX DR [1] (Consensus) XX CC Homologous to L1 ORF1. Most similar to L1MEC_5 (71%) and L1P_MA2. XX SQ Sequence 959 BP; 512 A; 108 C; 127 G; 200 T; 12 other; aaatcacaaa aacacacaaa gaaacaagac accantgagt aaatcagcag aaacaataaa 60 caacagacac acanagactt cagatatttg gaattttaga taaataatat aaaataaata 120 ttttaatata tttaaagaaa taaaaaatta aaaaattaaa aaaaaaaaat aaaaaaaatt 180 attaaaaaaa ttaaaaagca gatttaaaaa aaaaaaaatc aaatagaact tttagaaata 240 aaaaatataa taattaaaat ttaaaaattt cattggaaag ttttaacagc agattagacc 300 aagtagaaga gagaattagt gaantggaag atagatttaa agaaattatc cagaatgaag 360 cacagagaga aaaaaaaaaa aaaaataaaa aaaaaaaaaa aaaatatgca aaaaatatgg 420 gaaaatttca aaatatntaa aatatatcta ataagaattt gtgttccaaa aagagagaat 480 agagagaatg aaaaanggga agaaaaaata tttgaagaga taatggctgn aaaattttcc 540 agaattgatg aaagacatca atcctcagat tcaagaagct caaagaatac caagcagaat 600 aaatacaaaa aaatttacat ctagacacat aataatcaaa ctgtcaaaag tcaaagataa 660 aaaaaagatc ttaaaagcag ccagagagaa aagatagatt acctaaaaag gagcaacaat 720 aagactaaca gcagacttct cancagaaac catacaagcc agaagacagt ggaatgaaat 780 ctttaaagtg ctgaaagaaa ataaaaaant atcctnctat caatctagaa ttgtatatcc 840 agtgaaacta accttcaaaa atgaaggaga aataaaaact ttttcagaca agcaaaaana 900 anagctgagg gaatttatta ccaccagacc tgcaagaaat nctaaaggaa gtacttcag 959 // ID MER104A repbase; DNA; ROD; 698 BP. XX AC . XX DT 26-JAN-2000 (Rel. 7, Created) DT 26-JAN-2000 (Rel. 7, Last updated, Version 1) XX DE Non-autonomous DNA transposon - a consensus. XX KW DNA transposon; MER104A. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-698 RA Kapitonov V.V. and Jurka J.; RT "MER104A."; RL Direct Submission to Repbase Update (JAN-2000). XX DR [1] (Consensus) XX CC MER104A has 20-bp TIR which includes putative duplication of CC TA target-site. There are about 100-200 copies of MER104A in CC the human genome, they are ~74% identical to the consensus CC sequence. XX SQ Sequence 698 BP; 227 A; 130 C; 102 G; 232 T; 7 other; tatttcattg aatctaagat gccatcgatt ataagataca ycattatttt atgtatcact 60 aagaaaaaaa aatgctgcca attaaactat gacacaatgc ttttcttatc attagaattt 120 twtattttat acttattgaa agagctcttt tagatttatt tagacataga tttttatcat 180 atatcactct tgtgcataca taaaaaggaa aatataagta aaataaattg gttaaggtat 240 tcctaaaact tcttcacatt cagagtccaa ctcttctgaa tcactttcta ctcagaatca 300 tcratgtcca ttttcccwsa catcgtattg tcccatcaag agcgttggtg atgcagcatt 360 tcttaaaaga gtgctccact attgtctgga ttttcttcca agccgctgac atccattctg 420 ccacctgaag gtgtcaatag aarattttca gacaactatg attcatgttc cttcctcaaa 480 tggtccttaa atgatttgtt gactgaaata tcaagragtt gcaattgtcc agtcatgcca 540 ctaggaataa actatgttca ttcatgtata ggcaatgaca actacatcac aactgccgcc 600 tggctgacag caattgtaag atgccatcga ttgtaagacg catcccgatt tcagagatgt 660 taaatgtgaa aaatgtgtat cttagaatca atgaaata 698 // ID LTR6C_Cpo repbase; DNA; ROD; 368 BP. XX AC . XX DT 19-SEP-2009 (Rel. 14.1, Created) DT 19-SEP-2009 (Rel. 14.1, Last updated, Version 3) XX DE Long terminal repeat of ERV3 endogenous retrovirus: consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; LTR6C_Cpo. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-368 RA Jurka J.; RT "Endogenous retroviruses from guinea pig."; RL Repbase Reports 9(10), 2152-2152 (2009). XX DR [1] (Consensus) XX CC >86% identical to consensus. XX SQ Sequence 368 BP; 74 A; 93 C; 103 G; 98 T; 0 other; tgttatggtt tgtgtttgga tgtcccccca aagcctcatg cagtcatgga ggcggctgag 60 tctagcgttt gtgattggtt cgttgtctag cgctgggatt ggcggtgtga gtgctgcacc 120 cgcccagggg cgggtagccg gcatacaata taagggaggg aaggaggcgt gcttttccct 180 tttgcccttt tcgctcttgt cgcttgcggc cgccatgaac ggctgcccct tagccacgcc 240 accctgcctt ggagccagct gagtatggac tgaaacctcc aaaaactgta agctaaataa 300 acctttcctt ccttcatttt gggcgtcagg tattttgtct cagcaacgag agaaaagtaa 360 ccaagaca 368 // ID 5S_CPo repbase; DNA; ROD; 138 BP. XX AC . XX DT 02-APR-2010 (Rel. 15.03, Created) DT 02-APR-2010 (Rel. 15.03, Last updated, Version 4) XX DE 5S-derived retropseudogene - consensus. XX KW Nonautonomous; 5S_CPo. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-138 RA Jurka J.; RT "SINE-like 5S-derived retropseudogene from guinea pig."; RL Direct Submission to Repbase Update (17-FEB-2010). XX DR [1] (Consensus) XX CC >97% identical to consensus. >1000 copies. XX SQ Sequence 138 BP; 39 A; 33 C; 39 G; 27 T; 0 other; gtctacggcc ataccaccct gaacgcgccc gatctcgtct gatctcggaa gctaagcagg 60 gtcgggcctg gttagtactt ggatgggaga ccgcctggga ataccgggtg ctgtaggctt 120 taaaaaaaaa aaaaaaaa 138 // ID RLTR42_MM repbase; DNA; ROD; 532 BP. XX AC . XX DT 05-FEB-2004 (Rel. 9.01, Created) DT 05-FEB-2004 (Rel. 9.01, Last updated, Version 1) XX DE Mouse long terminal repeat - a consensus. XX KW LTR Retrotransposon; Transposable Element; LTR; RLTR42_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-532 RA Pavlicek A. and Jurka J.; RT "RLTR42_MM - a family of LTR retrotransposons."; RL Repbase Reports 4(1), 29-29 (2004). XX CC Individual copies are ~92% identical to the consensus. Distantly CC related to LTRIS3 (60% identity). XX SQ Sequence 532 BP; 138 A; 141 C; 116 G; 137 T; 0 other; tgagggaccc ttaagggctt ttttccagtc ccacaccctc cctacccttt acagcttccc 60 ccctcccacc tggggtcaag gctaggccaa gttccattct ccacccacag gaacattctt 120 gggaggagta aaaaaaattc tcagaaaaaa cctgcagaat gtactgactg atagtcacct 180 gaccctctgg aaagtcccag gtagagttca aatgcatgtc atgatctgcc catgcttgct 240 agccaataga tttaaaggtc aatatgctta gccaataagt ttgaactgta accttgctga 300 tgtaacctgt gcccctaaaa aagtataaaa aactgcttgt aatagccatt cgggggtcgc 360 cttcttagtc actcgccttg agggactaat tgaaggtcgg tcgaccccga cgcgcccagg 420 aaaaataaac ctcttgcttt ttgcatcgat ctgcagctct gctcttggtg tctcactcgg 480 gggggcatct caaagtaagt accctgactg actgagggtt aggggtctta ca 532 // ID L1MC4 repbase; DNA; ROD; 2724 BP. XX AC . XX DT 18-APR-1997 (Rel. 6, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 3) XX DE 3'-end of L1 repeat (subfamily L1MC4) - a consensus. XX KW Repetitive sequence; L1 (LINE) family; L1MC4 subfamily; MER42C; KW L1MC4. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1187-2724 RA Smit F.A., Toth G., Riggs D.A. and Jurka J.; RT "Ancestral, mammalian-wide subfamilies of LINE-1 repetitive RT sequences."; RL J. Mol. Biol 246, 401-417 (1995). XX RN [2] RP 1-2724 RA Smit F.A.; RT "L1MC4."; RL Direct Submission to Repbase Update (1996). XX DR [2] (Consensus) XX CC bp 1 to 1180 are temporarily identical to L1MC3 CC Replaces MER42C. XX SQ Sequence 2724 BP; 1020 A; 421 C; 541 G; 672 T; 70 other; cttgtatcca gaatatataa agaactctta aaactcaaca ataaaaaaac aaacaaccca 60 attaaaaaat gggcaaaaga tctgaataga catctcacca aagaagatat acagatggca 120 aataagcaca tgaaaagatg ctcaacatca tatgtcatta gggaaatgca aattaaaaca 180 acaatgagat accactacac acctattaga atggctaaaa tcaaaaacac tgacaacacc 240 aaatgttggc gaggatgtgg agcaacagga actctcattc attgctggtg ggaatgcaaa 300 atggtacagc cactttggaa gacagtttgg cagtttctta yaaarctaaa catacaatta 360 ccatatgatc cagcaatcay actcctaggt atttacccaa gtgaattgaa aacwtatgtc 420 cacacaaaaa cctgcacacg aatgtttata gcagctttat tcataattgc caaaacttgg 480 aaacaaccaa gatgtccttc aataggtgaa tggataaaca aactgtggta catccataca 540 atggaatatt attcagcgat aaaaaggaat gaactactga kacatgaaaa gacatggatg 600 aatctyaaat gcatattgct aagtgaaaga agccagtctg aaaaggctac atactgtacg 660 attccattta tatgacatty tggaaaaggc aaaactatag agacagaaaa cagattagtg 720 gttkccagrg gttgagagat gggaagtggg gatgrytgca aargtaaagc acargggatt 780 ttttagggtg rtaaaactat tctgtataaa ctattctgta tgatactatg gtggtggata 840 cacgacanta tgcatttgtc aaaacccaca gaacttgtca aaacccacag aactttacag 900 cataaagagt gaactttaat gtatgyaaat tttaaaaaat catttargag atcgggggat 960 cycaggatgg aatacagamt gtgacaaaag aatctaactg tattacaaat gtatgaaaca 1020 acctcactga agggratggg ggaaaaaggt gctgacctaa gtaactttgg aaatgagtgg 1080 agtctgtaag actaaaggca aaaggaactg cacataagca ctgtactcta gttrataaag 1140 ttgtttycca yaggggtaya ggttaacaat tctgaaacta ttttatatgt attgtaggat 1200 tgagcaaata agtaaacgca ttgaagataa tgggagccag gtttctcact gtcggagaag 1260 ggagttacaa atatggaatg ggggaagrct agaatgaacc ctgtggtatt rgattagaat 1320 tggaggtatc agtatgaact catgrttttt aatatayryr yryryryryr tttcctagtt 1380 ctgtccactg agagggccta gaagcaatga caccccagta gcaatgagca cacctagcgc 1440 ccagatcttg gtttctaaat accattctcc actaaaagga accagggctc cttggagaaa 1500 tggctgattc taggactagg gcaggaaatg tacaagatga gcctggaaca tcttgctrtg 1560 ccagaaaata aggaagtgct caaaaaatga trggggtatg tcaaaaggac acagragcca 1620 gcttgaaggg gctcccactg gccaaatctg ggacaatttg agcatcaaaa taaataacgg 1680 tagtaatgga ttataactca ttgaataaaa taaatatcca tgagttcata ctaatataaa 1740 taaatgaata aaagaaataa ataaataaga ragaaaggaa agytcttctt acagtagaat 1800 gccaactaay aaatgtagaa ggaataaaat aatagaaaaa tcaccatttg gcaaayacca 1860 cagtaataan tgtttcaggc aagaaccatc gatagatgct aaaattagtg ggcgaaartt 1920 tgatgagaaa cgggatattt gcatagtctc aaagtatctc yccacaagat atttattaat 1980 tayaaaggga aaaatagtra ctttacagta gagaaacctg gcagacacca ccttaaccaa 2040 gtgatcaaag ttancatcac caataatgag acaaattgac atcatgcgcy tcctgatgtg 2100 atgcgccgag aaggacacaa catcacttct gtgatattcy tgccaaaaat gcataacctg 2160 aatctaatca tragaaaata tcagacaaac ccaaattgag ggacattcta caaaataact 2220 ggcctgtact cttcaaaaat gtcaaggtca tgaaagacaa ggaaagactg aggaactgtt 2280 ycagattgac ggaagactag agacatgaca actaaatgca acgcgtgatc ctggattgga 2340 tcctggatcg agaaacggtg ggagtttkta yctataaagg acattattgg gacaattggc 2400 gaaatttgaa taaggtctgt agattagata atagtattgt atcaatgtta atttcctgat 2460 tttgataatt gtactgtggt tatgtaagag aatgtccttg tttttaggaa atacacactg 2520 aagtatttag gggtaaargg ncatsatgtc tgcaacttac tctcaaatgg ttcagraaaa 2580 aaaatnnata tayataarya gagaatgata aagcaaatgc ggcaaaatgt taacaattgg 2640 tgaatctggg tgaagggtat acgggwgttc tttgtactat tcttgcaact tttctgtaag 2700 tttgaaatta tttcaaaata aaaa 2724 // ID RLTR11A2 repbase; DNA; ROD; 592 BP. XX AC . XX DT 30-JAN-2004 (Rel. 9, Created) DT 30-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE Mouse long terminal repeat - a consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; LTR; ERVK; KW RLTR11A2. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RA Karolchik D., Baertsch R., Diekhans M., Furey S.T., Hinrichs A., RA Lu T.Y., Roskin M.K., Schwartz M., Sugnet W.C. et al.; RT "The UCSC Genome Browser Database."; RL Nucleic Acids Res 31(1), 51-54 (2003). XX RN [2] RP 1-592 RA Pavlicek A. and Jurka J.; RT "RLTR11A2 - a subfamily of LTR retrotransposons."; RL Direct Submission to Repbase Update (31-DEC-2003). XX DR [2] (Consensus) XX CC The consensus sequence was reconstructed from the UC Santa Cruz CC genome annotation. ~80% identical to RLTR11A. Individual copies CC are 88% identical to the consensus. 6 bp TSDs. XX SQ Sequence 592 BP; 175 A; 109 C; 166 G; 142 T; 0 other; tgttgaggtt tggtcttttg ctatgtattg taatgctaaa atactggccc cccccaaggc 60 ctggttgccc ccagggagaa tgagagaaca caagaatctg cacataccca agtgacaccg 120 acccaagtga tgctatgtaa ccttgccctc ctcctacccc ccaagttatc cctgattggt 180 gaataaagat gcctacagcc tatagctggg cagaagagag atagagcagg gtttttgggg 240 gttcctgggc ttgggggtac tgaggcaaga gacctcgata aaggaggagg ggagagagaa 300 ggttggagag aggagaagac aaccatgggg taggtgagtc atgaaaacat ggccatgagg 360 gctggccaat tggagttaag agcagcccag atggaacatg gcaagttata actcggggtt 420 attgatgggg aagtagattc taatagctta gagggtaaga tatctgccca gctctagtgc 480 tgattaaagg cttattataa ataataaaag ttgtgtgtct tttatctggg aactgaatga 540 tcaaaggtgg ggtagaaacc ccctaattga gattaaatat ttactacaac ca 592 // ID LTR6I_Cpo repbase; DNA; ROD; 399 BP. XX AC . XX DT 20-OCT-2009 (Rel. 14.11, Created) DT 20-OCT-2009 (Rel. 14.11, Last updated, Version 3) XX DE Long terminal repeat: consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; LTR6I_Cpo. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-399 RA Jurka J.; RT "Long terminal repeats from guinea pig."; RL Repbase Reports 9(11), 2876-2876 (2009). XX DR [1] (Consensus) XX CC ~79% identical to consensus. 5 bp TSD. XX SQ Sequence 399 BP; 77 A; 95 C; 114 G; 112 T; 1 other; tggtttagat ctaaaatgtc ccccaaaagt ctcatgtgtt cagaggtggg gcttttggaa 60 ggtgattgga tcgtgggggc gctatactca tcagtggatt aatccactga tgagttcata 120 gctgaatgtg ctgttaggag gtggggcccg gttggaggag gtgggtcact ggggntgacc 180 tggaaggtat ttctctccgg ctttcctttc tctctgcttc ctggctgcca tgggttgagc 240 agctttcctc cgccaggccc ttccgccatg ccgtttctgc cttggagcca gccgaccatg 300 gactgaaccc tctgaaaccg tgagccaaaa taaacctctc ctcctttaag ttgtgggtgt 360 cgggtatttt gtcccagcga cggaaaagtg actaataca 399 // ID RLTR11A repbase; DNA; ROD; 527 BP. XX AC . XX DT 26-SEP-1997 (Rel. 2.08, Created) DT 26-SEP-1997 (Rel. 2.08, Last updated, Version 1) XX DE Long terminal repeat of endogenous retrovirus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; KW Endogenous retroviral long terminal repeat RLTR11A; RLTR11A. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-527 RA Smit A.F.; RT "RLTR11A."; RL Direct Submission to Repbase Update (30-NOV-1996). XX DR [1] (Consensus) XX CC RLTR11A flank MYSERV internal sequences. 6 bp duplication sites. XX SQ Sequence 527 BP; 144 A; 95 C; 157 G; 126 T; 5 other; tgttgaggtt tggtctkttg ctatgtattg taatgctaaa cactggcccc caagacctgg 60 ttgcccccag ggatgagaat ccgcacatac acccaagtga tgctatgtga ccttgccccc 120 aagttatctc tgattggtga ataaagatgc ctacagctta tagctgggca gaagagagat 180 gggcggggtt tgggttcccg ggcttggggt cggaggagaa ccacgaggag ggagagtgta 240 ggwgaagaga gagrgaagcc gccatgggtt aggtgagtca tgaaaacatg gccatgaggg 300 ctggccaatt ggagttaaga gcagcccaga tgaaacatag caagtwataa ctcgggttat 360 cggcaggaaa gtagattcta atagcgtaga gggtaggtat ctgcccagct cttgtgctgw 420 ttaaggctta ttgtaaatat aaaggttgtg tgtgtctttt atccgggaac taaatggtca 480 aaggcggagt agaaacccca ggtcgagatt aaataatttc tacaaca 527 // ID L1-3_Cpo repbase; DNA; ROD; 6652 BP. XX AC . XX DT 17-SEP-2009 (Rel. 14.1, Created) DT 17-SEP-2009 (Rel. 14.1, Last updated, Version 3) XX DE L1 non-LTR retrotransposon: consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-3_Cpo. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-6652 RA Jurka J.; RT "L1-type non-LTR retrotransposons from guinea pig."; RL Repbase Reports 9(10), 2172-2172 (2009). XX DR [1] (Consensus) XX CC ~88% identical to consensus. ORFs are truncated by mutations. XX FH Key Location/Qualifiers FT CDS 1704..2501 FT /product="L1-3_Cpo_1p" FT /translation="MEAQHRKEMESIKQIQADIQENKNSTESIRSRLGQCE FT DRISDIEDRLAVSDQEKKDFSKLARDHEKSIQQLLDETKKNNLRLIGVNEQ FT AGDTTNDIKNMFTDIVTENFPGREKEFDIQISEAYRTPISNDQKKSTARHI FT IVKIPEIQHKNRILKAVREKKQITYKGKPIRITADFSAQTLKSRRAWSEVL FT QVLKENNFQPRLMYPAKLSFKIDGEIRYFHDKEHLRKFMTTKPALQNVLKD FT ILERDNKDYSSMNPNRRNPPGKANN" FT CDS 4801..6333 FT /product="L1-3_Cpo_3p" FT /translation="ERNQGNNTFYNSIQKMKYLGINLTKDVKDLYIENYST FT LKKEIEEDIRKWRDIPCSWVGRTNIVKMAILPKLLYRFNAIPIKIPIAYLT FT ELEKTILKFIWNQKRPRIAKAILGNKDKTGGITIPDLKLYYKATVIKTAWY FT WQKTEDQWNRLEDAETTPDTLSHLIFDKGAKHIHWKKDSLFNKWCWKNWLY FT TCRRLKLDPYLSPCTKLKSEWIKDLNIKTETLNLLEDRVGKTLEDIGVGKD FT FMNKTQIAQELSQRINNWDLTLLKSFCTARETTNRVKRQPTTWEKMFASCS FT SDKGLLSRTYKELKKISPPRFKDPIQKWASEMNTHFSDEEIQTANKYVKKC FT STSLVIREMQIKTTLRFHLTPERMASIKKSTNNKCWRGCGEKGTLLHCWWE FT CRLVQPLWKSVWRFLKKLGLEVPFDPAIPLLGIFPKELKTSYHSDICAPMF FT IAAQFVIARSWKQPKCPSTEEWIKKMWYFYTMEYYSAIKKDKLXTFIGKWM FT XLENILISEINKTHV" FT CDS join(3071..3655,3680..4846) FT /product="L1-3_Cpo_2p" FT /translation="MDLVDVYRVFHPTTSEYTFFSAAHGTFSKIDHILAHR FT TCLSKCKGIEIIPCILSDHSAMKLEIHAKGNHKNFINTWKLNSTLLNNQWV FT TDEIKEEIRQFLQLNDNDNTTYRNLWDTMKAVLRGKFIAVNTHIRKTEQAH FT INNLMLNLKLLEKEEQAKPKARTREEIIKIRAEINTLETKKQYKESMNRRV FT GSLKDPTLSKKKRKEKTQIHAIRNEKGEITTDPAEIQKIIYTYFENLYSNK FT MENTEEMDRFLDTYELPKLNQEDIKILNNPISINEIEDVIKNLSTKKSPGP FT DGFTAEFYKKFSEDLTPLLLKLFNEIEREAILPNSFLEASITLXPKPEKDP FT TKKENYRPISLMNIDAKILNKILANRLQQIVKKIVHHDQVGFIPGMQGWFN FT IRKSINVIHHINRAKDKNHMIISIDAEKAFDKVQHPFMIKTLQKIGIDGLY FT LNLIKAIYDKPTANIILNGQKLKAFTLKSGTRQGCPLSPLLFNIVLETLAG FT AIRQEKEIKGIKIGKEEVKLSLFADDMILYLEDPKNSTKRLLDLISKFSNV FT AGYKVNAQKSIAFLYTNNKLTEREIRETIPFTIASKK" XX SQ Sequence 6652 BP; 2522 A; 1342 C; 1295 G; 1486 T; 7 other; ggagaggatt ccgggaagat ggcggagcgg taagcagcag agaaaacagc ctctctgaga 60 gtcaccgtcc agacgctgga atatagcatc gtgaggaaga caagaaggaa ggaagtgggt 120 cccctggacc caggaagaat ctgaagtgcg aactgaagcc cacagagaga gtaaaaagaa 180 aaacttcaac gaagaagaag gcggtagctg gcggcgcctt cgcccgagcc ggactcactg 240 cgcgagagga gctgttcgaa cctggggacg cacggtcacg gtgacggtgg cggctggcga 300 ccccggaggc ctctgcagag agacgcggca gtggagggga gcggcggacg tggaggactg 360 cactggctgg cggtgagtgc agaggactca gtcctcccgc gggagagagc gattggccgg 420 gagccatttt ggagtcgggc cggataagcc cggaacccag aactgctgcc ggcttcctcc 480 agcagctccc agctgggcgc agacccgcgg aaactgcttg gactgcggcg agcccagtga 540 gtctggtctc ggggcaggcc tgctgaggtt atccactact ccgccataaa gcaactgctg 600 gcttgctggg atagagcggc acgattcggg actctgtctc cacaggctgt ctactagtga 660 acccagagag aatcctggaa gtggactgtt tgtttttttt ntttgtttgt tttttttttc 720 tctttctcct tcattntttt tttatgagtg tgagagtggt tcctaacctc tttgtgggtg 780 tggttctgtt ctgttcttcc ttttatctct tccatctctc ttttccttca gtcttctgca 840 ccagtttccc caagttggcg ngatccacct ttgtcctcca accttacttg taatctcttt 900 ctcttcttcc acttctctat tttctcctta atccatttta aattttaacc cccgagggca 960 aaattgacac attacggact tacatattgc atataattat ctggtttgtt ttacattttg 1020 ttatataatc acaaatctag ttcatacggt taaaagttgc aaattaaggg ttagctcagt 1080 ttgttgtttt ctaggctgtg tttgatgcta ttggtaccac tacagccaaa actgacataa 1140 ttgttctaaa cagctgttgg ccctgataat ccatgactga gattaactaa gggtcacaga 1200 aaatactgtg gcacagtgcc aaagaaccag ggactaggcc aaacgagcaa caacttctca 1260 tctccaacaa actgaagagg taagatcttc caacagactc aaaacccaaa aaccagctac 1320 acctggaaga tacctcagag aagtaatcat cacctaaggg tcctaaatac taccaataaa 1380 gcaacccacc ataggcaact tacaagcatc accccaggaa ttaaactaaa aaagaaaaga 1440 aatgagaaga cacttaaaca agatgcaacc ccagacatcc agttcaccaa aggcaaaccc 1500 agggggcaat gaggtgggac aatccgaaag ccccccacca ggtctctccg aagtggtaac 1560 aacaaagcaa ttgaatgagg ctctggaaaa aatgaaatca gaaataatct ctcagataac 1620 tacagagatt accaaaatac acaccatgct tagcgacgtt agagcagaca cagacaaaaa 1680 aaatagacga gcttaagcaa attatggaag cgcagcatag gaaagaaatg gaatccataa 1740 agcagattca agcagatatc caggagaata aaaactccac tgaaagtatt cgaagcagat 1800 taggccaatg tgaggacagg atctcagata ttgaagacag gcttgcagtt agcgaccagg 1860 aaaagaaaga tttctcaaaa ctagcaaggg accatgagaa atcaatccag cagctgctag 1920 atgagacaaa gaagaataac ttaagattaa ttggagtcaa tgaacaagca ggagatacca 1980 caaatgacat taaaaacatg ttcacagaca tcgtgacaga aaactttcct ggtagagaaa 2040 aagaatttga catacaaata agtgaggcat atagaactcc aattagcaat gatcaaaaaa 2100 aatctacagc caggcatatc atagtcaaga ttccagaaat tcaacacaag aatagaatct 2160 taaaagctgt tagggaaaag aaacaaatca cctataaagg aaagcccatc agaatcacag 2220 ctgacttctc agcacaaact ctcaagtcaa ggagagcttg gagtgaagta ctccaagttc 2280 taaaggaaaa caactttcaa cccagattga tgtaccccgc aaaactatct ttcaaaattg 2340 atggagaaat aagatacttc catgacaaag aacatctgag gaaatttatg accaccaaac 2400 cagcactcca aaatgttctg aaggacattc tagaaagaga taataaggac tacagctcca 2460 tgaaccccaa cagaaggaat ccaccaggga aggcaaacaa ttagagggaa ggaaaaaaag 2520 ggggggagcc aacaacagga aaataacatg acaaggataa gtcaatacat atcagttnta 2580 accatcaatg ttaatggtct caattcacca gtcaaaagac acagactggc agaatggatc 2640 aagaagcaag atccaacaat atgctgcata caagaaacac atttaattca aaaagaaatt 2700 catagactga aagtcaaagg atggaaaaca atactccatg cgacaggaac ccagaaaaaa 2760 gcaggggtag ctattctatt tgcagacaaa gtgaatttca agccaagact gattaagaga 2820 gataaagaag gtcactacat cctcgtaaag ggaacaatcc aagaggaaga gataacaatc 2880 ataaatatat atgcaacaaa taccagtgca cccaactata taaaacaatt actaacggac 2940 atgaaaaatc aaataagccc taatacaatt gtattgggag acctcaatac cccactgtca 3000 caaaaagaca gatcaactag acaaaaaaat caataatgaa atattagaac taaatcaaac 3060 ttttgagcaa atggacttag tagatgtcta tagagtgttc caccccacaa catcagaata 3120 cacattcttc tcagctgcac acgggacatt ctccaaaata gaccatatac tggctcatag 3180 aacatgctta agtaaatgca aaggaattga aattatccca tgcatactat cagaccacag 3240 tgcaatgaaa ttggaaatcc atgccaaagg aaaccacaag aacttcataa acacatggaa 3300 attaaatagc accctactga acaaccaatg ggtcacagat gaaattaaag aagaaattag 3360 acagttttta cagttgaatg ataacgacaa tacaacatac cggaacttgt gggacacaat 3420 gaaagcagtc ttaagaggga aatttatagc agtaaatacc catatcagga aaacagagca 3480 agcgcatata aacaacttaa tgttgaacct caaacttcta gaaaaagaag agcaggctaa 3540 accaaaagct cgtacaaggg aggaaattat aaagatcaga gcagaaatta acacactgga 3600 gactaaaaaa caatacaaag aatcaatgaa tcgaagagtt ggttctttga aagattaaat 3660 aaaattgaca aacccttagc caaccttatc aaaaaaaaaa aggaaagaaa agactcaaat 3720 ccacgcaatt aggaatgaaa agggcgaaat cacaacagac cctgcagaaa tccagaaaat 3780 catctatacc tactttgaaa acttgtattc taataagatg gaaaatacag aagaaatgga 3840 cagattccta gatacatatg aattaccaaa gctgaatcaa gaagatataa aaatcctaaa 3900 taacccaata tcaataaatg aaatcgaaga tgtaattaaa aacctatcaa caaagaaaag 3960 cccaggccct gatggattca ctgctgaatt ctacaagaaa tttagtgaag acctaacacc 4020 actacttctc aaactcttta atgaaattga aagggaagca atactcccaa attcattcct 4080 ggaagcaagc attactctga tnccaaaacc agagaaggac ccaactaaga aagagaatta 4140 caggccaatc tccctaatga acattgatgc aaaaatcctc aataagatac tggcaaacag 4200 actgcagcaa atcgtcaaaa agattgtaca ccatgatcaa gtgggattca tcccagggat 4260 gcagggatgg ttcaacatac gtaaatccat aaatgtaata caccatatta acagagccaa 4320 ggacaaaaat catatgatca tctcaataga tgcagaaaaa gcttttgaca aagtccagca 4380 cccattcatg ataaaaaccc tacagaaaat tggaatagat ggtctttatc tcaatctgat 4440 aaaggccatc tatgacaaac caacagccaa catcatacta aatggacaaa aactgaaagc 4500 tttcactcta aaatcaggaa caagacaagg atgtccactg tctccactct tattcaacat 4560 agtcctggaa actttagccg gagcaattag acaagagaaa gaaataaaag gaataaagat 4620 aggaaaggaa gaagttaaac tatcgttatt tgcagatgac atgatactct acttagaaga 4680 ccccaaaaac tccaccaaaa gacttctaga tttaataagc aaattcagta acgtagctgg 4740 ttacaaagtc aacgctcaaa aatcaatagc cttcctgtac acgaataaca aactcactga 4800 gagagaaatc agggaaacaa taccttttac aatagcatcc aaaaaatgaa atacctagga 4860 atcaatctaa cgaaggatgt aaaagacctc tacattgaaa actacagtac tctgaaaaag 4920 gagattgaag aggatattag aaaatggaga gatatcccgt gctcttgggt aggaagaacc 4980 aatatagtga aaatggccat acttccaaaa ctgttataca gattcaatgc aatcccaatc 5040 aaaattccaa tagcatatct cacagaatta gagaaaacaa tcctaaaatt catctggaac 5100 cagaagagac ctagaatagc aaaggcaatc ctgggcaaca aagacaagac agggggcatc 5160 acaatccctg acttgaaatt atactataaa gctactgtga taaaaacagc gtggtactgg 5220 caaaaaaccg aagatcaatg gaacagatta gaagatgcag aaacaactcc agacacgcta 5280 agccacctta tctttgacaa aggtgccaaa cacattcact ggaagaaaga tagcctcttt 5340 aataaatggt gctggaaaaa ctggctctac acatgccgaa gattaaaact ggacccatat 5400 ctttcaccat gtactaagct aaaatcagaa tggatcaaag atcttaatat taaaacagaa 5460 acattaaatc tgctggaaga tagagtaggt aaaactcttg aagacattgg tgtaggcaaa 5520 gactttatga acaaaaccca gattgcacag gaactatcac aaagaatcaa caactgggat 5580 ctcaccctac tgaaaagctt ttgcacagca agagaaacca ccaacagagt gaagagacaa 5640 cccacaacgt gggagaaaat gtttgccagc tgttcctcag acaaaggatt actatcgaga 5700 acatataaag aactaaaaaa aatcagtcct cccagattta aagacccaat ccaaaaatgg 5760 gcatctgaga tgaatacaca cttctcagat gaagaaatac aaacagcaaa taaatacgtg 5820 aaaaaatgtt caacatcact ggtcattcga gagatgcaaa ttaaaacaac actgagattc 5880 cacctcaccc cagaaagaat ggctagtatc aagaaatcaa ccaataacaa atgctggagg 5940 ggctgtgggg aaaaaggaac ccttctccac tgttggtggg aatgcagact ggtgcaacca 6000 ctgtggaaat cagtgtggag attcctcaaa aaactaggat tagaggtccc attcgaccca 6060 gctattcccc ttctgggtat cttcccaaaa gaattaaaaa catcatacca cagtgatata 6120 tgtgcaccca tgtttatagc agcacagttt gtaatagcca gatcttggaa acaacctaaa 6180 tgcccatcga ctgaggaatg gataaaaaag atgtggtatt tttatacaat ggagtactac 6240 tcagccataa agaaggataa actgganact tttataggta aatggatgca ncttgagaac 6300 atactcatta gtgaaataaa taagacacac gtgtaaacat cgtattgttt cattagtgtg 6360 agaagctgaa agaaaaaaca tatgtatatg tgtatgtata tgtatataaa atagttaaga 6420 tagttatact actgatttct ctgagttcac tgaagagaaa gatttgtgct ggtattgttc 6480 ggacaattga atgcagatgt cttgtgctct gggtgaatga tttggggatg acaatgcgca 6540 atgtatgtac cattgttgcc ccctgattga cgtgttttga aattttgttg tagttttttt 6600 cttttttttc tttttatctt taaataaaat ggatttgaaa aaaaattaaa aa 6652 // ID RLTR32A_MM repbase; DNA; ROD; 611 BP. XX AC . XX DT 14-OCT-2002 (Rel. 7.09, Created) DT 21-AUG-2008 (Rel. 7.09, Last updated, Version 2) XX DE Mouse putative long terminal repeat - a consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; KW Long terminal repeat; retrotransposon; RLTR32_MM; RLTR32A_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-611 RA Jurka J. and Drazkiewicz A.; RT "RLTR32A_MM: putative LTR from Mus musculus."; RL Repbase Reports 2(9), 17-17 (2002). XX DR [1] (Consensus) XX CC 83% similar to RLTR32_MM (bases 89-596). XX SQ Sequence 611 BP; 166 A; 143 C; 118 G; 184 T; 0 other; tgtgctcaca gataagcact agggaaacca ggaatgttaa acacacaggg ttgtctcttc 60 aaagggagag atgtataacc tgttttctac ccagaaggct gggagtggat gtggttaata 120 gcctggtgac ctttgtgcta tctctgtatg gccagaccac accagatccc ccagaccctt 180 atcatgagct ttttttccct agacccttat catgagcttt gtttctcaat ctatagaacc 240 catctttatg gaagatgtca catgtactta tagcctggtg acctctatac tatctaggtc 300 ctagaccaca ccagatccta ggctcttaac tatagaaccc atctttatgg tcacatgtac 360 tttacaaaaa gagtttcaat gtagtcatgt cttaagcttt attgtgcaca tgggattgag 420 ttaattatca ccaggaaaat tttcaccaaa ttgtgctgtg cttaaatatg cctaaaataa 480 actactcagg gtcagacact ctagaagttt gaaccaacac tggctactga gttgtgttga 540 actgaactgc cttcttgcct cccacagatc gttactcttg ctggccttgg tgttcacaga 600 gacccctcac a 611 // ID L1-2_Cpo repbase; DNA; ROD; 6160 BP. XX AC . XX DT 20-JUN-2009 (Rel. 14.07, Created) DT 20-JUN-2009 (Rel. 14.07, Last updated, Version 2) XX DE L1 non-LTR retrotransposon: consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1-2_Cpo. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-6160 RA Jurka J.; RT "L1-type non-LTR retrotransposons from guinea pig."; RL Repbase Reports 9(7), 1409-1409 (2009). XX DR [1] (Consensus) XX CC >92% identical to consensus. XX FH Key Location/Qualifiers FT CDS 858..1943 FT /product="L1-2_Cpo_1p" FT /translation="MAKQKSKMQSQTSSPPGASTGDIEIEDFKSNTPEISE FT KIASEIKKKQLHEAIEEIKLEVLSHVKIEITTLVKELLSAAKEEIEKKVQE FT HGKLIETLNEQYRKQAEFMKQTQRDIQETKNSIQSWQNRLNEAEDRISDLE FT DRIAISDHERIELLKITKQHEMTIQQLQDDAKRNNIRLIGISEDAGVNAND FT ITKIFTEVIAENFPNMGKKSDMQISEAYRTPNSHNQHKSTPRHIIIKISDI FT QHKNRILKAVREKTQLTYQGKPIRITADFSAQTLKSRRAWSEVFQALKENN FT FQPRLMYPEKLSFKFNGETRYFHDKEQLKKFTSTKPTLERVLKDILDRDKN FT DHNPQNNNRKKPPGRTSN*" FT CDS 1885..5424 FT /product="L1-2_Cpo_2p" FT /translation="MTTIPRTTTERNHQVGQAIKQGGKAESKQLEHNMAKS FT SQYISVISINVNGLNSPIKRNRLTEWIKKHDPTICCIQETHLTQKETHRLK FT VKGWKTVFHATGTQKKAGVTILFADNVNFKPKMIKRDKEGHYILVSGKIQE FT EELTIINIYAPNTGAPNYIRQILMDMKNQIHKNTIITGDLNSPLSQRDRST FT RQKVSKEIIELNHTCEQLDLVDVYRVFHPTTSEYTFFSAAHGTFSKIDHIL FT SHKTFLSNCKGIEIIPCVISDHSALKLEINGKRNCKTCINTWKLNNNLLNN FT QWATDEIKEEIKQYLQFNENAETTYCNLWDAMKAVLRGKYIAINCHIKKTE FT RAQINNLMLHLKLLEKEEQVKPKAHRREEIVKIRAEINAIETKKIQRINES FT KSWFFERINKIDKPLANLIKKKKEKTQIHAIRTEKGEITTDPMEIQKIMNT FT YFENLYSHRVENTEEMDRFLETYELPKLNQEDIAILNNPISAIEIENVIKN FT LPTKKSPGPDGFTAEFYKKFREDLMPILLKLFNEIEKEAILPKSFLEASIT FT LIPKPEKDPTKKENYRPISLMNIDAKILNKILANRMQQIIKKIIHHDQVGF FT IPGMQGWFNIRKSINVIHHINRAKNKNHMVISIDAEKAFDKVQHMFMIKTL FT QKLGIHGVYLNLIKAIYDKPTANIILNGQKLKAFTLKSGTRQGCPLSPLLF FT NIVLETLARAIRQEREIKGIKIGKEEVKLSLFADDMILYIEDPKNSIKRLL FT EVINKFSSVAGYKVNAQKSIAFLYTNKKLTEREIMEISPFTIASKKMKYLG FT INLTKNVKDLYAENYATLKKEIEEDIRKWKDIPCSWLGRTNIVKMAILPKL FT LYRFNAIPIKIPLTYLTDLEKTLLKFIWKQKKPRIAKAILGNKGKAGGISI FT PDLKLYYKATVIKSAWYWQRNRPEDQWNRLEDASTTTKTLSHLIFDKGAKH FT VHWKKDSLFNKWCWKNWLSTCRRLKLDPCLSPCTKLKSKWIKDLNIKTETL FT NLLEDKVGRSLGDIGVGREFMNKTQTARETLQRINNWDLILLKSFCTSKEI FT TNRVKRKPTEWERILASNPSDKGLISRTYKELKNIKPPKFKDPLQKWASEM FT NTQFSDEDIQMANKYMKKCSPSLVIREMQIKTALRFHLTPERIAAIKKSTD FT NKCWRGCRGKRNTAPLLVGV*" XX SQ Sequence 6160 BP; 2484 A; 1212 C; 1134 G; 1329 T; 1 other; ggccccgggg aacgcatgcc gccctcagac cttggcccgt gcagccgcgt gcttcctgcg 60 cgactccagc cggcactgag gaatgccacg ccgccctggg accttggccc gcgcccagcg 120 gacgcccgga cccaggacat ctgctggccc ctcggggaca ctccctacct cctggaccta 180 atactgctgt tgatcttcag ctcccaccct ttctcccaga accttggact acagcggatt 240 aaccgtgcta gcttgaaggg agattcagta actgcctcac ccaaacggaa gatctcagca 300 ggctccagtc agtgtggtga gtatgcccct aaaatagagg tgtacagtgg gtaaataggg 360 ccagtagtaa agctccctcc aactaactgt aagaccttgg tgagtctgac aagagtgtgg 420 gtgactcctt cctgcacaca gaaaacatca ggcagggtga ggagtgtgcc aataaggact 480 cccccaactt ggagaaaatc aaaagggctt atgagacaca caggcacaca atagggacac 540 agtagggcct cagccgacac aaacactata gacagggaga tttggtgacc tggaataacc 600 tgaagataaa gtaatagcag gaactaaaag ttgacctaca tcaatactac taggcttgtc 660 agcccaggaa gaatatagga ctcacccaac aggccctgca gtgggaagtc aggtgactgg 720 cccacctctt cattgtggga cccagcaatt ggacaaacat caaccttaca ttaggacccc 780 aagatacaca ctaccaacag ttgacactag tgggattgaa ggaccagaag gaccaaatat 840 taactcaaaa ggggaaaatg gcaaaacara aaagcaaaat gcaatctcaa acatctagtc 900 caccaggagc aagtacagga gacattgaaa tagaggactt caaaagtaac acaccggaaa 960 tctctgaaaa gatagcgagt gaaataaaaa aaaaacaact ccatgaggct atagaggaaa 1020 tcaaattaga agtactctcc catgtaaaga tagaaataac aaccttagta aaggagttgc 1080 tgagcgcagc taaagaagaa atagagaaaa aagtacagga gcatgggaaa ctgattgaaa 1140 ccttgaatga acaatataga aaacaagcag aattcatgaa acaaactcaa agagacatcc 1200 aagaaacaaa aaattctatt caaagctggc aaaaccgttt aaatgaagct gaggatagga 1260 tttcggacct tgaggacagg atcgcaatta gtgatcatga aaggatagaa cttttaaaaa 1320 taacaaagca gcatgaaatg acaatccagc agcttcaaga tgatgccaag agaaacaata 1380 taagattaat aggcatcagt gaagatgcag gagtcaatgc aaatgacata actaaaatat 1440 tcacagaagt aatagcagaa aatttcccaa acatggggaa aaaatctgat atgcagataa 1500 gtgaggcata tagaacgcca aacagtcaca accaacataa atctacaccc agacatataa 1560 taattaagat ctcagacata cagcataaaa acagaatact aaaagctgtt agagaaaaga 1620 cacaactcac ctatcaagga aagcccatta gaatcacagc agacttttca gcacaaaccc 1680 tcaagtcaag acgagcttgg agtgaggtat ttcaagccct gaaagaaaac aactttcaac 1740 ccagactgat gtacccagaa aaactttcat tcaaatttaa tggagaaaca agatacttcc 1800 atgacaaaga acaactgaag aaattcacgt ccaccaaacc aaccctagaa agagtattga 1860 aagacatact agatagagat aaaaatgacc acaatcccca gaacaacaac agaaagaaac 1920 caccaggtag gacaagcaat taaacaaggg ggaaaagcag aaagcaaaca actggaacat 1980 aacatggcaa agtcgagcca gtacatatca gtgatatcca taaatgtaaa tggcctcaat 2040 tcacctatca aaagaaatag gttgacagaa tggatcaaga aacatgatcc cacaatatgc 2100 tgtatacaag aaacacatct gacccaaaag gaaactcata gattaaaagt caaaggatgg 2160 aaaacagtat tccatgcaac aggaacccag aaaaaagcag gggtaactat tctatttgca 2220 gacaacgtga acttcaaacc taaaatgatt aaaagagata aagaaggtca ctacatactc 2280 gttagtggaa aaattcaaga ggaagaacta acaattataa atatatatgc accaaataca 2340 ggagcaccca actatataag acagatacta atggacatga aaaaccaaat acacaagaac 2400 acaattataa caggggacct taacagccca ttgtcacaaa gagatagatc aactagacaa 2460 aaagtcagca aagaaataat agaactgaat catacctgcg agcaattgga cttagtagat 2520 gtttacagag tgttccaccc aaccacatca gaatacacat tcttctcagc tgcgcatggg 2580 acattctcca aaatagacca tatattatcc cataaaacat tcttaagtaa ctgcaaaggt 2640 attgaaatta tcccatgcgt aatatctgac catagcgcac tgaaattgga aattaatggc 2700 aaaagaaatt gcaaaacctg cataaataca tggaaattaa ataacaacct tttgaacaac 2760 cagtgggcta cagatgaaat taaagaagaa ataaagcaat acctacaatt caatgaaaat 2820 gcggaaacaa catactgtaa tctgtgggat gcaatgaaag cggtcttaag agggaaatat 2880 atagcaataa attgtcatat caagaaaaca gaacgagcac aaataaacaa cctaatgctt 2940 catctcaagc ttctagaaaa agaagagcaa gttaaaccca aagctcatag aagagaggaa 3000 attgtaaaga ttagagcaga aatcaatgca attgaaacta aaaaaataca aagaattaat 3060 gaatcaaaaa gctggttttt tgagagaata aacaaaattg acaaacctct agctaacctt 3120 ataaagaaaa agaaagaaaa aacccaaatt cacgcaataa ggactgaaaa gggggaaatt 3180 actacagatc ccatggaaat tcaaaagatc atgaatacct actttgaaaa tctctactca 3240 catagagtgg aaaacacaga ggaaatggac agatttctag aaacatatga actaccaaag 3300 ctcaaccaag aagatatagc aattttgaat aacccaatat cagctattga aattgaaaac 3360 gtaattaaaa acttaccaac aaagaaaagc ccaggtcctg atggattcac ggctgaattc 3420 tacaagaaat ttagagaaga tttgatgcca atactcctta aactcttcaa tgaaattgaa 3480 aaggaagcaa tcctccctaa atcattcctg gaggcaagta ttactctcat accaaaacca 3540 gagaaagacc caaccaaaaa ggagaactat aggccaatct cactaatgaa cattgatgca 3600 aagattctca ataaaatact ggcaaataga atgcagcaaa tcatcaagaa gattatacat 3660 catgaccaag tgggattcat tccaggaatg cagggatggt ttaacatacg caaatcaata 3720 aatgtaatac accacatcaa cagagccaaa aacaaaaacc acatggtcat ctcaatagat 3780 gcagaaaaag ctttcgataa agtccaacac atgttcatga taaaaaccct tcagaaactc 3840 ggaatacacg gtgtctacct caatcttata aaggccatct atgacaaacc aacagccaac 3900 attatactaa atggacaaaa actgaaagct ttcactttaa aatcaggaac aagacaagga 3960 tgtccactat caccactcct attcaatata gttctggaaa cactagccag ggcaatcaga 4020 caagagagag aaataaaagg aataaagata ggaaaggaag aagttaaatt atcactattt 4080 gcagatgaca tgatactcta tatagaagac ccaaaaaact ccattaaaag actactagag 4140 gtaataaaca aattcagcag tgtagctgga tacaaagtca acgctcaaaa atcaatagca 4200 ttcctgtaca caaacaaaaa actcaccgag agagaaatta tggaaatatc gcccttcaca 4260 atagcatcta aaaaaatgaa atacttagga atcaacctaa ctaagaatgt aaaagaccta 4320 tatgctgaaa attatgctac gctaaaaaag gagattgaag aggatattag aaaatggaag 4380 gatattcctt gctcttggct aggaagaacc aacattgtga aaatggccat actcccaaaa 4440 ctgttataca gatttaacgc aatcccaatc aaaataccat taacatacct cacagatcta 4500 gagaaaacac tcctaaaatt catttggaaa cagaagaaac ctagaatagc aaaggcaatt 4560 ctgggcaaca aaggcaaggc aggaggcatc tcaattcctg acttgaaact atactacaaa 4620 gccacagtga taaaatcagc atggtactgg caaagaaata gacctgagga tcaatggaac 4680 agattagaag atgcaagcac aactacaaag acactcagcc acctgatatt tgacaaaggg 4740 gctaaacatg ttcactggaa gaaggatagc ctctttaata aatggtgttg gaaaaattgg 4800 ctctccacat gccgaagact gaaactagat ccatgcctat caccatgcac aaaactaaag 4860 tctaaatgga tcaaagacct aaatattaaa acagaaacac taaatctact ggaagacaag 4920 gtaggtagaa gtcttggaga cataggggta ggcagagaat tcatgaacaa aacgcaaact 4980 gcacgggaaa cacttcaaag aatcaacaac tgggatctca tattattgaa aagtttctgc 5040 acttcaaaag aaatcaccaa cagagtgaaa agaaaaccca cagaatggga aagaatttta 5100 gccagcaacc cctcagacaa aggacttata tccagaacat ataaagaatt gaaaaacatc 5160 aagcccccca aattcaaaga ccctctccaa aaatgggcat ctgagatgaa cacacagttc 5220 tcagatgaag atatacaaat ggcaaataaa tacatgaaga aatgttcacc atcactggtc 5280 attcgagaaa tgcaaattaa aacagcattg agatttcatc tcactccgga aagaatagct 5340 gccatcaaga aatcaactga taacaaatgt tggcgaggct gcaggggaaa aaggaacact 5400 gctccactgt tggtgggagt gtagactggt gcagccacta tggaaatcag tatggagaat 5460 tctcagaaga ctaggattgg aagttccatt caatccagct attcccctcc tgggtatttt 5520 cccagaaaaa ctgaaaaggt catatcacag tgatatatgt gcacccatgt tcatagcagc 5580 acaatttgta atagctaaat cttggaaaca accaagatgc ccatcaaccg cagagtggat 5640 gaaaaagatg tggtattttt atacaatgga atattactca gctataaaaa aggatgatct 5700 tgaggccttc ataggcaaat ggagacaact tgagactatc ctcattagtg aaataaataa 5760 gaatcatatt tataaatatc acattgtgtc tctagtgtaa aaagtaataa gaaatgtagg 5820 agtaataaaa cccaaaatgt gtaattgcta ttgtaaaaaa gagaggaaaa gaaagtgctc 5880 taaaggatat aactgcatgt atgtttaagt gtgtaggatt gataaccata ttactaagtt 5940 tttgttggat cgttgaacag aactgtttgc agtctcgctg tttgaatgtc cactttgtat 6000 gctttacttt tgtaacttga acaactattt ctaagaggac aatatgtaat ttatcaacta 6060 gtgatttctt actgtttatt ttttttgaaa ttatgtatta tttctaccct ttttctgttt 6120 tgtttttttt ctctttaaat aaaatttttt aaaaaaaaaa 6160 // ID MER124 repbase; DNA; ROD; 290 BP. XX AC . XX DT 06-JUL-2006 (Rel. 11.07, Created) DT 17-AUG-2007 (Rel. 11.07, Last updated, Version 4) XX DE Unclassified repetitive element from mammals - consensus. XX KW Transposable Element; Nonautonomous; MER124; conserved. XX OS Mammalia OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi. XX RN [1] RP 1-290 RA Jurka J.; RT "MER124: Unclassified repetitive element from mammals."; RL Repbase Reports 6(7), 377-377 (2006). XX RN [2] RP 1-290 RA Gentles AJ., Wakefield MJ., Kohany O., Gu W., Batzer MA., RA Pollock DD. and Jurka J.; RT "Evolutionary dynamics of transposable elements in the RT short-tailed opossum Monodelphis domestica."; RL Genome Res 17(7), 992-1004 (2007). XX RN [3] RP 1-290 RA Jurka J., Kapitonov V.V., Kohany O. and Jurka MV.; RT "Repetitive sequences in complex genomes: structure and RT evolution."; RL Annu Rev Genomics Hum Genet 8, 241-259 (2007). XX DR [1] (Consensus) XX CC Present in >500 copies per haploid genome in all mammals CC (including marsupials). Putative non-autonomous DNA transposon CC due to significant self-complementarity. XX SQ Sequence 290 BP; 90 A; 53 C; 48 G; 99 T; 0 other; ttcattaatt agggtcatgt ttacacttct ccattgtcaa ctataattat ttgtttaaat 60 aaggtctcct tagtttatta cagtcagtca ttactgcctg tcagtaagag ataatgtgct 120 tgaatttcac aagtatgtgt gaattgcaca tcttgcattt cttagcaggg actcagcctc 180 agtcaatatt ttcacagaat gacagttctc aggaaggcac agggtcttac ttaagctaat 240 aaatgattat ccttcacaat agaactacag gcaagatcct aattaatgaa 290 // ID IAPEY3_I repbase; DNA; ROD; 7680 BP. XX AC . XX DT 30-JAN-2004 (Rel. 9, Created) DT 30-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE Mouse family of LTR retrotransposons - a consensus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; ERVK; IAPEY3; KW IAPEY3_I; IAPEY3_LTR; LTR. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RA Karolchik D., Baertsch R., Diekhans M., Furey S.T., Hinrichs A., RA Lu T.Y., Roskin M.K., Schwartz M., Sugnet W.C. et al.; RT "The UCSC Genome Browser Database."; RL Nucleic Acids Res 31, 51-54 (2003). XX RN [2] RP 1-7680 RA Pavlicek A. and Jurka J.; RT "IAPEY3_I - a mouse subfamily of LTR retrotransposons."; RL Direct Submission to Repbase Update (JAN-2004). XX DR [2] (Consensus) XX CC The consensus sequence was reconstructed from the UC Santa Cruz CC genome annotation. Internal sequence of the IAPEY3 subfamily. CC LTRs are listed as IAPEY3_LTR. IAPEY3_I is 80% identical CC to IAPEYI. The IAPEY3 subfamily is young and some members still CC contain intact ORFs. 6 bp TSDs. CC IAPEY3_I_ORF: 890-1856 (322 aa) gag polyprotein CC MRKKDTIQMKGELISHRKSQKRLETKVSSSQLAARPPGRRPLGPSAPPPYVQRYHSDSFIPKEEQRKMQQ CC AFPVFEGADGGRVHAPVEYIQIKELAESVRNYGVSANFTIAQVERLATLAMTPGDWMTVVKAAVPNMGMY CC LEWKALWQDSCQTQARANATVEGDQRTWTFELLTGQGQHAANQTNYHWGAYAQISAAAVKAWKALSRKGE CC ASGHLTKIIQGAQESFSDFVARMTEAAGRIFGDPEAAMPLIEQLVYEQATQECRAAITPRKSKGLQDWLK CC VCRELGGPLTNAGLAAAILQGQRRSDTAELKLCYNCGKPGHF CC IAPEY3_I_ORF: 3149-5549 (800 aa) pol polyprotein, RT, IN, RNase CC MILEQLMHKCACLGQFSEGYHCFLPYPKIGEIIIIDIKDCFFSIPLCPQDRQRFAFTIPAINHLEPDQRY CC QWKVLPQGMANSPTMCQLYVQLALKSVRNHFPSLLLVLYMDDILICHKNSQLLQDAYPILIKTLGQWGLQ CC VATEKVQVAQMGTFLGSLIYPDKIVPQKLEIRKDQLHTLNDFQKLLGDINWLRPFLKIPSAELKPLFDIL CC EGDTHISSHRALTPAACQALQTIEKALQDAQLQRIDESKSFELCVLKTAQLPTAVLWQNGPLLWVHPNAS CC PAKIIEWYPNAVAQLALRGIKAAITHFGKEPHILIVPYTSAQVQTLAATTDDWAVLVTSYSGQIDNHYPK CC HPILQFALTQAIVFPVITAKHPLADGVVVYTDGSKSGIGAYVVNDQVTSKQYNETSPQVIECLVVLEVLK CC TFPGPLNIVSDSLYVVNAVNTLEVAGLNKPSSKLAHIFQQIQSALLHRRHLVYITHVRAHSGLPGPMSHG CC NDLADKATRIVAAALSSQAEAAREFHKRFHVTAETLRRRFYFNQKRSSDIVTQCQNCCQFLPTPHVGVNP CC RGVRPLQVWQMDVTHISSFGRNQYLHVSIDTCSGVMFATPLTGEKASHVIQHCLEAWSAWGKPTILKTDN CC GPAYTSAKFQQFCHQMDVTHLTGLPYNPQGQGIVERAHRTLKSYFIKQKGGVDETLPSVPRVAVSMALFT CC LNFLNLDEQGRSAADRHSSDPDRPKEMVKWKDVLTGLWKGPDPILIRSRGAVCVFPQSEDNPFWLPERLT CC RKIFIKDGVEDADLPRTASDPDNAELVSGS CC IAPEY3_I_ORF: 5909-6668 (253 aa) env CC MVHFYHLTFQTSQNVPHQPRIAPHCSLEDEGLILPWSDCQSSITRWADQSKTFSFSPNMMVDPEKEFVMK CC KGLFIQDIRMHPFHKWLLCGVNGSCTELNPLIFIQGGAVGKASFTGISRFAQYWGIHAASLDYLWIYYII CC PSVEITGFNKTLINQTNYLPTPVCVYPPFLFILSNDSFEDCLNDSCWISQCWDVTKDTRAMVARIPRWIP CC VPVETPSTLSLFRQKRDFWHYCCRDHSNFSQCSCCYSCRVRYG. XX SQ Sequence 7680 BP; 2186 A; 1702 C; 1775 G; 2017 T; 0 other; agtggcgccg agaacccggg aaacatacca tcaccaggcg ctgataggag accctccctt 60 ttcacggggc gggttcagaa ctacgggaca ctttacagct ctgcaaggta tgttcggtct 120 tgaaatttct ccagagttag aggccctttt gtttgttttc acggggctta ttctttttct 180 tttggttgcg ttctgcttcc accggtggaa ctgctgactg gttctcagtc gcggtcaggc 240 atggttagat aaaagagcag tatgggaact tcccactctg tggtgaccgc cttacacggt 300 cggtcctgaa gcagcgtggc ctgaaagtcg ccactaagac actagaaggc tttgtaaaag 360 agatagatcg catagcaccg tggtatgcgt gctcagggtc cttaactatc ccctcttggg 420 agaaactgaa gggagattta gttagggaac agcagaatgg caaacttaaa gcagggacca 480 tgccgttgtg gaagctgata agatcgtgct taaaggacga ggaatgtcaa caagtggtta 540 aggcagggca gagaatactg gaggaaattc aagacagtct atcagagaca gagcggggag 600 agagattagg agctcaaaag aaaaaaaggt gcaccaaata agaaaacagg cctttccacg 660 gaccttgagc ccgaggaaaa gagaatctag ggaaagaata ccctgggaga gtttagaaaa 720 aaggatgaga aggaagagaa gaagaaagat caatctgggg aggtccctag gagaaggagc 780 ctctatccgc cattagatga gtttaaggct ctagctctta gtagctcaga atcagatgag 840 gaacttagcc cctctgagga aacagactta gaggaagaag cagctcgtta tgaggaagaa 900 agataccatc cagatgaaag gcgagctaat cagtcacaga aaaagccaaa agcggctgga 960 gacaaaagtc agctcaagcc agcttgctgc tcggcctccg ggccgtcggc ctctgggtcc 1020 tagtgcacct ccgccttatg tgcagaggta tcattcagac tcattcattc caaaagagga 1080 acagagaaaa atgcaacagg catttccagt ctttgaagga gccgatggag ggcgagttca 1140 cgctccggtg gagtatatac agattaaaga gcttgctgag tcagtccgta actatggagt 1200 cagcgccaat ttcactatag cacaagtcga aaggcttgct actttggcaa tgactcctgg 1260 agattggatg actgttgtga aagctgcagt tcctaatatg ggaatgtatc tagagtggaa 1320 agcattgtgg caagattcct gccagacaca ggcaagggcc aatgccaccg ttgaagggga 1380 ccaaagaaca tggacttttg aattacttac aggtcaggga caacacgctg ctaaccagac 1440 aaattatcat tggggagcgt acgctcagat ctcagctgcc gctgttaagg cgtggaaagc 1500 actctctagg aagggagagg caagtggaca tttgacaaaa attatccagg gcgcacagga 1560 gtcattctca gactttgtgg ctagaatgac agaggcagca gggagaatat ttggagaccc 1620 agaggcagct atgcctttaa ttgaacagtt ggtttatgaa caagctacgc aggaatgcag 1680 agcagcaatt acacctagaa agagtaaagg attgcaggac tggttaaagg tttgccgtga 1740 gcttggaggg ccgctcacta atgctggatt ggcagccgcc attctgcagg ggcagaggcg 1800 ctcagacaca gctgagctca aactttgcta taattgtggc aaaccagggc acttttaaaa 1860 aaggactgca gagcccttgt aaaaaggaca gcgccagggt tgtgtactaa gtgtggaaaa 1920 ggatatcatt gggccaaaga ttgtcgctca attaaagata tacgaggcag gctcttgcag 1980 ccaggacccc ctcaagcagt agaaaatgag aatgaaggca tttcaaaaaa cgagtttcgg 2040 gcccccaggt cccagggccc caaaacatat gggaccccga cgggcaacag gtggacacca 2100 cagtccgcag ggcaacagaa ggctcagctg gattggacct ccgtgcctcc acccgattca 2160 tgctgatgcc ccagatgggg gtgcagccaa ttcctactga ctataagggg cctctgccat 2220 ctggaagtgt cggcctaata ctgggccggg cctccctcac cttacaaggc cttattgtcc 2280 accctggagt tgtagatcag gattatgaag gggaacttca ggttctctgt tcctgccctc 2340 aaggtgtctt ttctatatca caaggggata gaatagctca actaataatt ttgccaagcc 2400 tacatggcct gttttccctc ctctggtgtc cctcgagctg ccagagggat tggttctact 2460 ggaaatgatt ctgcttactt aataatgtct ttagattcca ggccatcctt agagttagtt 2520 atagaaggga aacaatttaa agggattttt agatacagga gcagataaaa gtattatctc 2580 ttcccactgg tggccaaaag cctggcccgt cactcagtca tcacactctt tacaagggtt 2640 aggctatcag tcctgcccca ctattagctc ccgttctttg agctggcaag cacctgaagg 2700 tcaaacggga caattcactc cttatgtctt accactccca gttaatctct ggggggagag 2760 atattttaca ggcaatggga atgaccttga ctaatgaata ctcaccacaa gctgttcaaa 2820 tgatgaagaa aatgggctat acagaaggaa aaggattagg gaaaggagag cagggtagac 2880 ttgaacctat cccctcaaga aggtaataat gggagacaag gtttgggttt ttttctagaa 2940 gcggctgttg agggttccat gcccatacca tggcttacag aggaagctgt atgggttcct 3000 caatggcctc tttcctctga aaaattagaa gcagccacaa aactgatctc tgaacagcta 3060 cgcttaggtc atttagagcc ctctacctca ccctggaata cacctatttt tgtaattaag 3120 aaaaaatctg gcaaatggcg cttattacat gatcttagag caattaatgc acaaatgcgc 3180 ctgtttgggt cagttcagcg agggctacca ttgctttctg ccctacccaa aaattgggga 3240 aattataatt atagatatta aagattgttt tttctctata cctttatgcc ctcaagatag 3300 gcaaagattt gcatttacca tcccagccat taatcattta gagcctgatc aaagatacca 3360 atggaaggtc ctacctcagg ggatggcaaa tagccccacc atgtgtcaac tgtatgtgca 3420 attggcactt aagtcagtta gaaatcattt tccatcatta cttctggtac tttatatgga 3480 tgacattctg atttgccata aaaattcaca gttattgcaa gatgcatacc ctatattaat 3540 aaaaacatta gggcaatggg gattgcaggt agccaccgaa aaggtgcaag ttgctcaaat 3600 gggaaccttc ctagggtcac tcatttatcc tgacaaaatt gttcctcaaa aattagagat 3660 tcgcaaagat caactacata ccttaaatga ttttcaaaaa ttgctgggag atattaattg 3720 gctgagacca tttttgaaaa ttccatcagc agagttaaag cctttatttg atatattaga 3780 aggagatact cacatctcct cccatagagc acttacccca gctgcatgtc aagctttaca 3840 aactatagaa aaggccttac aagatgctca attacaacgc attgatgagt cgaagtcatt 3900 tgaattgtgc gtattaaaaa ccgcacagtt gccaacagca gtcttgtggc aaaatggacc 3960 cttattgtgg gtccacccta atgcttcccc tgcaaaaatc attgaatggt atcctaatgc 4020 agttgctcaa cttgcactta ggggaataaa agcagccatt actcattttg gaaaagaacc 4080 tcatatacta attgtgcctt atacctctgc tcaagttcaa accctggcag caacaactga 4140 tgattgggca gtcttagtta cctcctattc aggacaaatt gataatcatt atcctaaaca 4200 tccaatttta cagtttgcct taactcaagc catagtgttt ccagtaatta cagccaaaca 4260 cccacttgca gatggggtgg tagtatatac agatggatcc aaatctggca taggtgcata 4320 tgtagtaaat gaccaagtaa catccaagca atataatgag acatcacccc aggttataga 4380 gtgtttagtg gtactagagg tccttaaaac tttcccagga ccgcttaata ttgtatctga 4440 ttccttatat gtagttaatg cagttaatac acttgaagtt gctggcttaa ataaaccatc 4500 tagcaagctt gctcacattt ttcaacaaat tcagtcagcc ttgttacata gaagacatct 4560 tgtctatatt actcatgtca gggctcattc tggccttcct ggccctatgt ctcatggaaa 4620 tgacttagca gacaaagcca ctagaatcgt ggctgctgct ctgtcctcgc aggcagaagc 4680 tgcaagggaa tttcataaac gctttcacgt gacggctgaa actttacgcc gccggtttta 4740 ctttaaccag aaaagaagct cggacattgt cacccaatgt caaaactgtt gtcaattttt 4800 acctactcct catgtagggg taaacccccg aggcgtcagg ccattacagg tctggcaaat 4860 ggatgtcact catatttcct cctttgggag aaatcaatat ttacatgttt ctattgacac 4920 ctgctctggt gtaatgtttg ccacaccttt aacaggtgaa aaagcctctc atgttataca 4980 gcactgttta gaagcctgga gtgcttgggg caagcccaca atccttaaaa cggataatgg 5040 gccagcatat acttctgcta aatttcaaca attttgtcat caaatggatg tcactcacct 5100 gactggcctt ccttataatc ctcaaggaca aggcattgtg gaacgtgccc accgcacact 5160 caagtcttat tttataaaac aaaaaggggg agttgatgag actctgccct cagtgccaag 5220 agtagctgtc tccatggcac tcttcacact taattttttg aatcttgacg agcaaggacg 5280 ttctgcagct gatcgacata gctcagaccc tgatagacca aaagaaatgg tcaaatggaa 5340 ggatgtttta actggtctgt ggaaaggccc ggatcctatt ttaataagat ccaggggagc 5400 tgtctgtgtt tttccacaga gtgaagacaa cccattttgg ctgccggaac gactcacccg 5460 caagatattc atcaaggatg gtgtggaaga tgctgacctc ccgaggactg cttctgatcc 5520 tgacaatgct gaacttgtct caggttccta gtataatggg cgaacagaga tgggccattc 5580 tctcggcttt ccctaaacca atgccagttc gccatgatgc tatagttttt ccaaaattct 5640 ttactactaa taaaacagtg gatttgccat atctacccta tgatcccacc cgagcaccat 5700 taggagaaaa tcgctcttta ctagaacagg gttctttatg ttttcaaatt aatggaccag 5760 ggaaaatgta tcaacctcac agcccgagcc ttgggaatgt ttaataagca ccgaggaggc 5820 ttggtgagca caacccaaga tacttccaac gcagatataa ccattattca aatcacatgg 5880 accttctggc aggaggcaat ttgggttaat ggtacatttc taccacctaa cttttcaaac 5940 aagtcagaac gttccccacc aaccaagaat agccccccat tgtagtttgg aagatgaagg 6000 gctgatcctg ccatggtctg attgtcaatc ctccatcact cgttgggcag atcagagtaa 6060 aaccttttcc ttttctccca acatgatggt tgacccagag aaggaatttg ttatgaaaaa 6120 gggacttttc atacaggaca ttagaatgca tccctttcac aagtggctgc tttgtggagt 6180 caatggcagt tgtacagaac tcaatccctt gatttttatc cagggaggag cagttggaaa 6240 ggcttctttt actggcatct caagatttgc tcagtattgg ggaatacatg ctgcctccct 6300 ggactactta tggatatact atataatacc cagtgtagag atcactgggt tcaataaaac 6360 tttgataaac cagactaatt atctacctac cccagtctgt gtttaccccc ctttcttgtt 6420 tattctttca aatgattcat ttgaagactg tttaaatgat tcttgttgga tttctcaatg 6480 ttgggatgta acaaaggaca cccgcgccat ggtggcccgg atcccacgtt ggatccctgt 6540 tccggtggaa acaccctcca ccttatccct gtttagacaa aaaagagatt tttggcatta 6600 ctgctgccgt gatcatagca atttcagcca gtgcagctgc tgctacagct gcagggtacg 6660 ctatggctag tacggtccaa gcaggcacaa aattaaatca gctttcagtc gatctgactg 6720 atgccatcaa tgtccaaact tctgctagtg cccagttgaa gggagggctg atgattttga 6780 atcagcgcct tgatctggtt gaggaacaga ataagtgttc tataccagtt ggcccagttg 6840 ggttgtgaaa gaaaattggg tgctctgtgt attaccagtg tccaatatga aaattttact 6900 cgtgcagcta atctgtctag acagctttcc ttgtatcttg caggaaattg gtccgaagga 6960 ttcgatgaga ctcttgaggc cctgagggcg gcagttttaa ggatcaactc gacgcgagtg 7020 gacctgtcat tgacagaggg cctctcctcc tggatttcat ctgcattttc ttattttaag 7080 gaatggggtg gggggtagtt ttaatttggt gctgccatct gctgtggact tgtgttcatg 7140 ctctggttgg tttgcaagct cagagcccaa caaaaacgtg acaaggtcgt tattgctcaa 7200 gcacttgcag ccattgaaca aggggcctcc cctgaaattt ggctatccat gcttaagaat 7260 taaatggcat ttgagttctc tgctattgca tcccacggat ttgtggatcc attgcacctg 7320 ggatgagaga tactcaacgc tcattgatca gcacttgcac ttcgctgagg tttcctcttg 7380 cacgcgatag ggtgatcatg atttccccac taattcattt tttggctggc ctcttttata 7440 gagttcgccc taggtcattt agcagttact gtcacacagt actgtggtat ccagagacgg 7500 gcaactttcc tcatgtacag accaacctaa gacacggggc ccggtggcga tagggttacc 7560 ctatgacagg aaaggctgtg acattggagg aatgacctaa gacaggagcc acagcagatg 7620 gacataactg cagaggccta gaccagcctc aaattttaat aaaaactaaa aagggggaga 7680 // ID L1MC5 repbase; DNA; ROD; 1351 BP. XX AC . XX DT 14-MAY-1998 (Rel. 6.5, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 2) XX DE 3'-end of L1 repeat (subfamily L1MC5) - a consensus. XX KW Repetitive sequence; L1 (LINE) family; L1MC5 subfamily; L1MC5. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-1351 RA Jurka J.; RT "L1MC5."; RL Direct Submission to Repbase Update (05-MAY-1998). XX DR [1] (Consensus) XX SQ Sequence 1351 BP; 513 A; 205 C; 231 G; 383 T; 19 other; aaaaaaaaaa yacatttcta scctttccac tgaaaagncc tagaaacaat gactaatcca 60 atagcaatga gtaccctyag gacccagatt gtggtctcta aataccattt cccactaaaa 120 ggaaccaggg ctccttggag aaatggctaa ttccaggtct ggggcaggaa atgtacaaga 180 tgagcctgga acatcttgtc ataccagata gcaaggaagc tataaaacta ctagggttgt 240 gtcaaaagga ctcaggagcc aacttactgg ccaaagatgg gacaatttga gcatcaataa 300 taactgcaat ttaacacatc aaatatrttt aaatccatga gtttataata aaaaaatcta 360 attggtcacc tttggagtga tgctagggaa ccaactcatt attttgaaaa ctggtaaata 420 aagggaaaga atcaagcatt tatcttgcct ttcctatata aactgtacct cwgggtaacc 480 aaatagtwga tgagggaaat ttctctttat agaagtattc cagctaataa atgaagaaaa 540 aattagaatt agaatatcac cattttgcaa cccctaatga attaatggat ctaggcaatg 600 atcatcaatg gctgctaaca tcacaaaaac tarasatytg cctcctgatg gaartawaca 660 acaccaccta tgaaatatta gtcttgccaa aaaaaaatca aacctgaatc tgatcaagcc 720 tctagatcta actaccaatt tacaggaaat acagaggaca gaggaacatg ttaaatgaca 780 ccatrgggat gcaatcagca aaatccagac tgtgggaaac tctacaggac aaatrttaac 840 ttttcttcaa caaataaatt atgagaaaaa aaagatggaa gaagaaccta tagattaaaa 900 gagacttaaa agacatatca accaattaca atgtatggac cttatttgga tcctgattca 960 aamaaatrta aactataaaa atatatrtgt atacaattgg aaatttgaac actgactaga 1020 tatttgatga tattaaggaa ttattgttat ttttaggtgt gataatggta ttatagttat 1080 tttataaaat agtccttatc ttttagagat acatactgaa atatttatag ataaaatkat 1140 atgatgtctg ggatttgctt caaaataatc caggagggag gaagtaggtg gagctataga 1200 tgaaacaaaa ttggccatga attgataatt gttgaagctg ggtgatgggt atgtggaagt 1260 tcattatact attctctcta cttttgtata tttgaaattt ttctaatwtt aaaaataaag 1320 accaatttgt tggacattaa tttgttggca a 1351 // ID LX4 repbase; DNA; ROD; 176 BP. XX AC . XX DT 22-APR-1997 (Rel. 3, Created) DT 22-APR-1997 (Rel. 3, Last updated, Version 1) XX DE Myomorpha L1 3' end - a consensus. XX KW LINE; L1 family; 3'-end; RMER7; LX4. XX OS Murinae OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae. XX RN [1] RP 1-176 RA Chopra V. and Jurka J.; RT "LX4."; RL Direct Submission to Repbase Update (1997). XX DR [1] (Consensus) XX SQ Sequence 176 BP; 31 A; 36 C; 78 G; 31 T; 0 other; gatcccgttc ccctaactgg gctgccttgt ctggcctcag tgggagagga tgtgcctagt 60 cctgcagaga cttgatgtgc cagggtgggg ggatacccag ggggggctcc ccctctcaga 120 ggagaagggg aagggggaat ggggggaggg acttgtgagg gggggactgg gaggag 176 // ID CAVID1B repbase; DNA; ROD; 96 BP. XX AC . XX DT 26-DEC-2009 (Rel. 15.03, Created) DT 26-DEC-2009 (Rel. 15.03, Last updated, Version 3) XX DE tRNA-derived SINE family: consensus. XX KW SINE2/tRNA; SINE; Non-LTR Retrotransposon; Transposable Element; KW CAVID1B. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-96 RA Jurka J.; RT "SINE elements from guinea pig."; RL Repbase Reports 10(3), 497-497 (2010). XX DR [1] (Consensus) XX CC >94% identical to consensus. XX SQ Sequence 96 BP; 29 A; 19 C; 28 G; 20 T; 0 other; ggggctgggg atttagctca gcggcataag cacctgcctt gcaagcgtgt ggtcgtgagt 60 ttgatcccca gtactgataa aaaagaaaaa gacaaa 96 // ID RLTR36_MM repbase; DNA; ROD; 1076 BP. XX AC . XX DT 30-JAN-2004 (Rel. 9, Created) DT 30-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE Mouse long terminal repeat - a consensus. XX KW LTR Retrotransposon; Transposable Element; LTR; ERVK; RLTR36_MM; KW RLTR25A. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RA Karolchik D., Baertsch R., Diekhans M., Furey S.T., Hinrichs A., RA Lu T.Y., Roskin M.K., Schwartz M., Sugnet W.C. et al.; RT "The UCSC Genome Browser Database."; RL Nucleic Acids Res 31(1), 51-54 (2003). XX RN [2] RP 1-1076 RA Pavlicek A. and Jurka J.; RT "RLTR36_MM - a family of LTR retrotransposons."; RL Direct Submission to Repbase Update (31-DEC-2003). XX DR [2] (Consensus) XX CC The consensus sequence was reconstructed from the UC Santa Cruz CC genome annotation. Similar to RLTR15B_MM. Individual copies are CC ~87% identical to the consensus. In RepeatMasker listed as CC RLTR25A. XX SQ Sequence 1076 BP; 322 A; 203 C; 281 G; 270 T; 0 other; tccggcatgg ccttggtagg gacagaatga catggttcct gcccctggga cagggccagc 60 aaggcatggg cctccatgag tgttgatggc tgctgtgggc ataaggcttg tttgtataat 120 aatgtacata ttttcacgtt cctccttcaa ggaatgtcct ccctgttaac gttaatgact 180 ccatgataat cagagacagc atgagtctgg gaggcgaagg tgtggctaat gtctcagcaa 240 aatggtaaat gctgggacct tcaggacagc ccttaaggct gtggaaaaga actctaaaaa 300 catgagttca aaaatatata atttctcaac tatgcaaaat ataaggatgc aatatgaatt 360 atatgagggg cttcatgaat ctaaaggaac aaaagcagct gtgctgtgag ccaacttgtc 420 agaaagatac taaggaagga gataaagaga tttagggagt ggtgatcaca ggagatccca 480 cccagctaag ttttttttgt ttatgcttac aaaggcagac agattcctga gttcagggtc 540 agcctgggac agagcaaggt taggcccagg tgtggtagaa atggtaattt cagggtgggg 600 tcccacccag ctagcttatt gtctgtgctt aacagaggca ggcagatctc tgaattcttt 660 tgcaatgtta aaagaaaatg tgtgcttgct gtctcctaag aatcaagggg ctggggtcat 720 gggatgctga ttcataggat aatcaaaagg gaacctggag taaatgactg gattgatatg 780 taaaataaaa gactgggctt atgatctgca agagatgagc tatccagaga atttttttag 840 aatagcaaga gagaactgct tggagaactg tctcaagcag aaaatagata gagagagagc 900 tgtctggaga tctccagaga aagcagaaca gagagaaagc aatctggaaa gctgtctcga 960 gcagaacata ggccaccagc ttggacccac gatttgactt cgagtcattt gtttttgctg 1020 ctcccagaca ccccttctct cagaacccct ctccaagctg aggctggtcc ttggca 1076 // ID MIR2 repbase; DNA; ROD; 150 BP. XX AC . XX DT 19-FEB-1997 (Rel. 5, Created) DT 19-FEB-1997 (Rel. 5, Last updated, Version 1) XX DE Repetitive element - a consensus. XX KW Repetitive sequence; MIR2; DBR; SR1. XX OS Mammalia OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi. XX RN [1] RP 1-150 RA Degen J.S. and Davie W.E.; RT "Nucleotide sequence of the gene for human prothrombin."; RL Biochemistry 26, 6165-6177 (1987). XX RN [2] RP 1-150 RA Smit F.A. and Riggs D.A.; RT "MIRs are classic tRNA-derived SINEs that amplified before the RT mammalian radiation."; RL Nucl. Acids. Res 23, 98-102 (1995). XX DR [2] (Consensus) XX CC 24 bp upstream of NcoI site; chromosome 11p11-q12. XX SQ Sequence 150 BP; 35 A; 30 C; 33 G; 49 T; 3 other; ttntttattg tctgtctcct ccactagant gtaagctcca tgagggcagg gattttgtct 60 gtyttgttca ctgctgtatc cccagcgcct agaacagtgc ctggcacata gtaggcgctc 120 aataaatatt tgttgaatga atgaatgaat 150 // ID ERVB4_1B-LTR_MM repbase; DNA; ROD; 522 BP. XX AC . XX DT 26-AUG-2008 (Rel. 13.08, Created) DT 26-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE Mouse endogeneous betaretrovirus ERVB4_1B LTR subfamily (LTR DE portion). XX KW LTR Retrotransposon; Transposable Element; LTR; KW endogeneous betaretrovirus; MmERV-B4_AC102561; ERVB4_1-LTR_MM; KW ERVB4_1B-LTR_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-522 RA Jurka J.; RT "Endogeneous betaretrovirus family from mouse."; RL Repbase Reports 8(8), 862-862 (2008). XX DR [1] (Consensus) XX CC Copies 98% identical to consensus. Termini: 5' TG....CA 3'. XX SQ Sequence 522 BP; 110 A; 171 C; 103 G; 138 T; 0 other; tgtcagagac catcccgtga gaaagtgagc ccttcacaat ctccctagca ggccaaatgg 60 ccttgtactg agagctggtt gtcacccccc tttcctccct attcctttcc tggcacctga 120 ggctgtaaaa gctgaattat agtcccctct tccctatctc ttcctgagtt cccatgccat 180 ccaaggacat gagttacgcc tgagcccagc ctgaccccca aggctgtcaa ggaggatcga 240 tgttccagag ataagatcca gagtgcccgc tgcctggcgc ctgacttcgg cccccatgtc 300 agcagatgcc cacttctttg ttctttgtat aattctccct cgacccctcc catattcccc 360 gcgatgtatg ctttaaaaag aaggcacctc agcctaataa acgagacctt gataggttca 420 atctacttgg tcctccgact cttctttctt ttactcccat ttccttccag gtttgcggtc 480 cccctcgcac ccacgaataa ctgaatcccg cgggacggga ta 522 // ID L1MA10 repbase; DNA; ROD; 1069 BP. XX AC . XX DT 01-OCT-1995 (Rel. 3.6, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 3) XX DE 3'-end of L1 repeat (subfamily L1MA10) - a consensus. XX KW Repetitive sequence; L1 (LINE) family; L1MA10 subfamily; L1MA10. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-1069 RA Smit F.A., Toth G., Riggs D.A. and Jurka J.; RT "Ancestral, mammalian-wide subfamilies of LINE-1 repetitive RT sequences."; RL J. Mol. Biol 246, 401-417 (1995). XX DR [1] (Consensus) XX CC Contains identical ORF2 region consensus (subfam L1M3) as L1MB3 CC ORF2 ends at bp 675; average divergence of copies from consensus: CC 17%. XX SQ Sequence 1069 BP; 409 A; 155 C; 205 G; 272 T; 28 other; ttaatatcca aaatatataa ggaactcaaa caactcaaca agaaraaaac aaataaccca 60 attaaaaaat gggcaaarga cctgaataga catttytcaa aagaagacat acaaatggcc 120 aacagatata tgaaaaaatg ctcaacatca ctaatcatca aggaaatgca aattaaaacc 180 acaatgagat atcacctcac acctgttaga atggctatta tcaaaaagac agaaaataat 240 aaatgttggy gaggatgtgg agaaaaggga actattgtac actgttggtg ggaatgtaaa 300 ttagtayagc caytatggaa aacagtatgg aggttcctca aaaaaytaaa aataraacta 360 ccatatgaty cagcaatccc actwctgggt atatatccaa argaattgaa atcagtatgt 420 ygaagagata yctgcactcc catgtttayt gcagcaytat tcacaatagc caagatatgg 480 aawcaaccta agtgtccatc aayggawgaa tggataaaga aaatgtggta tatatacaca 540 atggaatact attcagccat aaaaaagaat gaaatcctgt catttgyarc aacatggatg 600 aacctggagg acattatgct aagtgaaata agccaggcac agaaagacaa atactgcatg 660 attccactta tatgaggtat ctaaaatagt caaactcata gaagcagaga gtagaatggt 720 ggttgccagg ggctgggggr agggggaaat ggggagttgc tgttcaatgg gtataaagtt 780 tcagttatgc aagatgaata agttctagag atctgctgta caacattgtg cctatagtta 840 ataatactgg attgtacact taaaawttgt taaaagrgta gatctcatgt taagtgttct 900 tatcgcacaa caaaaaaatg gggaactttt ggrrgtgatg gatatgttca ttatcttgat 960 tgtggtgatg gtwtcacggg tgtntacata tgtcaaaact catcaaattg tacacwttaa 1020 atatatgcag tttttagtgt ataattatac ctcaacaaag ctgttttaa 1069 // ID L1MEC_5 repbase; DNA; ROD; 2523 BP. XX AC . XX DT 23-JAN-1998 (Rel. 6.4, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 2) XX DE L1MEC_5 LINE1 repetitive element - a consensus. XX KW L1M4_5; LINE1 repeat; L1MEC_5. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-2523 RA Smit F.A.; RT "L1MEC_5."; RL Direct Submission to Repbase Update (1997). XX DR [1] (Consensus) XX CC 5' end of LINE elements with L1ME1-2 subfamily 3' ends, CC comprising the CC 5' UTR, ORF1 (pos. 793-1771 or more) and part of ORF2 (from pos. CC 2219). XX SQ Sequence 2523 BP; 1044 A; 416 C; 499 G; 486 T; 78 other; aatgtnttaa aaaattcact kaaaacataa tasaagactt ctgcttccgg cyaagatgga 60 gtaacaggga ccggatttac cctcctrcct gaaacaacya aaacgcggac aaaatatatg 120 aaacaacggt tttcaagaca ttgracatca ggcagcgmag gacagcgatc cctgrgagaa 180 gggaaacgaa tgaggtgagc cctrtgatcg ccycagctta ctgcctkgag atagtttcca 240 ggccacggca cagagaggag aaacccaggc agagcccggc ggwctccctg agttgaggag 300 acggagctga gagtycgrgg aggccaaggc tggctagagt tcgcagggca gaataccaga 360 gaggagaaag ccgcamagag agaacttctg agatctgcag agggtccccy tcgagtcttc 420 agctgagtac tgaycagcnc atgcatgcga ggaaacyacc cgaggctagg gaaagaaccm 480 cacgaaagga ttararggaa caatccccag agctcacaca gggccgggaa tagttcctgt 540 ttccaccagc cagagtggaa aacctcataa ttcacggggc atcgggtaga gtactcagaa 600 gggttttgcc tcagtagcgg ggaaaaatta gccctaaact aaacactgct ctggtcccac 660 ctaacaaatc ttaaaagcaa gcctcgaaaa gatcaaactg tttccaagta acttaactgc 720 atcccagaat aaagctcaag aatatttawa ggaatacaaa aatatccagc acccaamaag 780 gtaaaattca caatatctgg catccaatca aaaattacca ggcatccaaa gaagcaggaa 840 aatacgacct ataatgagga gaaaaatcaa tcaatagaaa cagacccaga aatgacanag 900 atgatagaat tagtagacaa ggacatttaa amagttatta taactctnny mwatatgttc 960 aagragntaa agaaaaacat gaacataatg aagaaagagr tggaagatat aaaaagaccc 1020 aaatagaact tctagagatg aaaawtayaa trtctgaaat gaaaaataca ctggatgaga 1080 ttaacagcag attagacact ncagaagaaa raattagtga acttgaagac atancaatag 1140 aaantatcca aaatgaaata cagagacaaa gaaannaaaa ayagacaaat aaaataaagc 1200 gtcagtgagc tgtgggacaa cttcaagtgg cctaayacac gtrtwattgg agtcaaaaag 1260 aaagganaga gagaatgagg yagaagaaat attggaagaa acgatagctg agaattttcc 1320 aaaattgatg aaagacatna atctacakat tcaagaattc caatgaatsc caagyaggat 1380 aratacagaa gaaacaacac atctwgacat atcatagtca aatcgyagaa aaccaaagay 1440 aaagagaaaa tcttaaaagc agtcagagar aaacaacata ttacgtacag ggggacaaca 1500 atacaaatta aagctgactt tttaccaggc acagtggagg ccagaagaca gtgggatgaw 1560 atatttaaag tgctgaaaga aaagaactgt cgacccggaa ttctatatcc agcgaaaata 1620 tctttcaaaa ataaaggcaa aataaagaca ttctcagaca aacaaaagct gagataattc 1680 attaccagca gacctgcgct acaagaaatg ttaaaggaag tcnttcaagc agaaagaaaa 1740 tgacaccaga tggaaacata gatctacaca aagaaatgaa gagcgccgga aatggtaamt 1800 atacgggtaa atataaaaga natcctttct tattatttnn aatttcttta aaagataatt 1860 ggctgtttan agmaaaaata ataacaatgt attgtggggt ttataacata tgtaarcaaa 1920 atgtatggca acaatagcac aaaggccggn aggggagaaa tggaagtata ytgttgtaag 1980 gttcttatac tntacgtgaa gtggtataat rtcatttgaa ggtagactgt gataagttaa 2040 agatgcatat tgtaaaccct agagcaacca ctaagataac aaaacaaaga gttatagcta 2100 ataagccaac aaaggagata aaacggaatc ataaaaaata cccaattaat ccaaaaaaag 2160 gcanwaaaaa aaaaatggaa caaagaamag atgggacaaa tagaaaacaa atagcaarat 2220 gatagattta aacccaacca tatcaataat tacattaaat ataaatggtc taaacgctcc 2280 aattaaaaga cagagattgt cagaatggat aaaaaaacga gacccaaata tatgctgcct 2340 acaagaaacc cactttaaat ataaagacac aaataggtta aaagtgaaag gatgaaaaat 2400 gatatntcat gttaacgcca tccaaaagaa agctggagta gctatattaa tatcagacga 2460 agtggatttc agaggaaaga ataccgccaa gagncaaaaa aggtcatttt ataatgataa 2520 agg 2523 // ID MT2C repbase; DNA; ROD; 322 BP. XX AC . XX DT 25-APR-1997 (Rel. 3, Created) DT 25-APR-1997 (Rel. 3, Last updated, Version 1) XX DE Long terminal repeat of retrovirus-like element. XX KW Long terminal repeat of retrovirus-like element; MTE3; MT2C. XX OS Sciurognathi OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia. XX RN [1] RP 1-322 RA Wilkie M.T. and Palmiter D.R.; RT "Analysis of the integrant in MyK-103 transgenic mice in which RT males fail to transmit the integrant."; RL Mol. Cell. Biol 7(5), 1646-1655 (1987). XX RN [2] RP 1-322 RA Smit F.A.; RT "Structure and evolution of mammalian interspersed repeats."; RL Ph.D. dissertation, Univ Southern California, 1995, , pp RL 220-224.. XX DR [2] (Consensus) XX SQ Sequence 322 BP; 83 A; 62 C; 79 G; 96 T; 2 other; tgtagtggct attcctggtt gtcaacttga ctatatttgg aatgaactac aatccagaat 60 tggaaggctc accagtgacc ctaatctgga ggctgggaga tacaagtttc tgacctggat 120 cttggtatgg agatcttgag gcatagtggc tatggattcc agaagattaa ggcagggaga 180 tctttgagtt caaggtcatc gcctgcttgc ttcgtgagac tgagtaactg ctagatcctt 240 ggacttccat tcacagctrc nactgaacca ttgttgggaa ttggactgca gactgtaagt 300 catcaataaa ttcctttact at 322 // ID MERX repbase; DNA; ROD; 756 BP. XX AC . XX DT 24-JAN-2007 (Rel. 12.01, Created) DT 26-JUN-2008 (Rel. 13.06, Last updated, Version 3) XX DE Mammalian repeat, possible fragment of a LINE1 family, or SINE DE element. XX KW Transposable Element; DNA; Tigger; MERX. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-224 RA Jurka J.; RT "Low-copy interspersed repeat from mammals."; RL Direct Submission to Repbase Update (24-JAN-2007). XX RN [2] RP 1-756 RA Smit A.F.; RT "Consensus update."; RL Direct Submission to Repbase Update (28-NOV-2007). XX DR [1] (Consensus) XX CC Present in >200 copies in the human genome. Its small fragment CC shows weak similarity to L1MED_5. It is absent from opposum. CC Re-classified as Euterian. CC [2] 22 bp TIRs that match those of other Tiggers. No coding CC matches, but final 65 bp are 75% similar to 3-end of Tigger8, CC which does have a coding match (hence the orientation shifted CC from original). Much extended from original MERX sequence, which CC matches pos 232-756. Not present in opossum or platypus, so CC introduced in (early) eutherian ancestor. XX SQ Sequence 756 BP; 192 A; 154 C; 167 G; 243 T; 0 other; caggtatccc tcgctatctg aactctcact atccgaatat tcgctataac gacttgcaaa 60 aatttttacc caaaattcac tatccgaatc gaaaacctgc tataatgaat ctgcatgtgc 120 gcgccagcga aaacgtttaa gttgcgcgcg agtccgggcg agaggatgta gagtgcgctg 180 cagtcgtatc tcagctgttc tcccgatagg atcgcgtctc gtgctcgcgt tgtttaaacg 240 tgttgtgcat tatcgctatc atcttcccca ccttttccct gagggtttag cccttcatgg 300 gtcccagtgt ttgcttctgc caggcgcctg ggggcactac caacccgggt ccaatttaga 360 tagtatcttt aacatattat ttcattgttt atttacatta cagtacatgt tcgttgcagt 420 gtagaaggaa aacgtaattc gtatccgata ctgtacagta tcgttgcgta ctgcacacaa 480 acatacccac taatgagttc attaagtgtt aaataattag gtaattggtg ttttaaatgc 540 tttatattat gcagaaatcc ttggtggatt gttatatagg tgtttaagag tgttttagtg 600 atatttgggg aaattggttg gggtttttgg atgggctggg aacgcattat tatttttccc 660 atttaaaata atggaatata ggctcccgct atccgaaaat tcgctatcca acacgttttc 720 aggaacggat tagattcgga taacgaggga tgcctg 756 // ID RMER17A_Rn repbase; DNA; ROD; 333 BP. XX AC . XX DT 11-JUN-2009 (Rel. 14.06, Created) DT 11-JUN-2009 (Rel. 14.06, Last updated, Version 2) XX DE Long terminal repeat of ERV2 Endogenous Retrovirus from rat. XX KW ERV2; Endogenous Retrovirus; Transposable Element; RMER17A_Rn. XX OS Rattus norvegicus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Rattus. XX RN [1] RP 1-333 RA Jurka J.; RT "Long terminal repeats from rat."; RL Repbase Reports 9(6), 1235-1235 (2009). XX DR [1] (Consensus) XX CC 90% identical to consensus. XX SQ Sequence 333 BP; 42 A; 132 C; 51 G; 108 T; 0 other; tgttagcatt ctgtctaagc tccaccccca cagttacctg gcaacagcca ggtatgcctg 60 acactataaa aggggctgct tgccccctcc tcactctctt gctcttgctt cttgctctct 120 tgctcttccc ctcttcccct ttgtcccttc tctccccatt cccctccccc cttccctcca 180 cgtgctcatg gccggcctct actcctctcc tcttctactc ttctctctct cgtccctctc 240 ccttgtctcg tcctttcatt aaacctttcc acgtggaacc atgttggcct ggtgtggttt 300 gtccggatgc gagccgagat ttctacccca aca 333 // ID MMSAT4 repbase; DNA; ROD; 168 BP. XX AC . XX DT 06-SEP-2005 (Rel. 10.08, Created) DT 06-SEP-2005 (Rel. 10.08, Last updated, Version 1) XX DE Satellite from mouse. XX KW Satellite; Simple Repeat; MMSAT4. XX OS Mus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae. XX RN [1] RP 1-168 RA Smit A.F.; RT "MMSAT4 - Satellite from mouse."; RL Direct Submission to Repbase Update (06-SEP-2005). XX DR [1] (Consensus) XX CC this type mostly on chromosome 4; related elements on other CC chromosomes have become more complex. XX SQ Sequence 168 BP; 67 A; 25 C; 36 G; 39 T; 1 other; tgctttactg amaaaggcag tctgagaatt catcagagaa ttcatacagg agagaaacct 60 tacaaatgca gtgaatgtga caaatgcttt actgaaaaag gcagtctgag aattcatcag 120 agaattcata caggagagaa accttacaaa tgtagtgaat gtgacaaa 168 // ID L1MB3 repbase; DNA; ROD; 927 BP. XX AC . XX DT 20-FEB-1997 (Rel. 5, Created) DT 20-FEB-1997 (Rel. 5, Last updated, Version 1) XX DE 3'-end of L1 repeat (subfamily L1MB3) - a consensus sequence. XX KW Repetitive sequence; L1 (LINE) family; MER12; L1MB3 subfamily; KW L1MB3. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-927 RA Kaplan J.D., Jurka J., Solus F.J. and Duncan H.C.; RT "Medium reiteration frequency repetitive sequences in the human RT genome."; RL Nucleic Acids Res 17, 4731-4738 (1991). XX RN [2] RP 1-927 RA Smit F.A., Toth G., Riggs D.A. and Jurka J.; RT "Ancestral, mammalian-wide subfamilies of LINE-1 repetitive RT sequences."; RL J. Mol. Biol 246, 401-417 (1995). XX DR [2] (Consensus) XX CC Contains identical ORF2 region consensus (subfam L1M3) as L1MA10 CC ORF2 ends at bp 675. XX SQ Sequence 927 BP; 362 A; 135 C; 176 G; 219 T; 35 other; ttaatatcca aaatatataa agaactcyta caactcaaca aaaaaaaaac aaacaaccca 60 attaaaaaat gggcaaarga cttgaataga catttctcca aagaagayat ayaaatggcc 120 aataagyaca tgaaaaratg ctcaacatca ytaatcatta gggaaatgca aatcaaaacc 180 acaatgagat aycacctyac acccattagg atggctatta tyaaaaaamc agaaaataac 240 aagtgttgry gaggatgtgg agaaattgga acccttrtrc attgctggtg ggaatgtaaa 300 atggtrcarc cactrtggaa aacagtatgg hrgttcctca aaaaattaaa aatagaatta 360 ccatatgacc cagcaatycc actyctrggt atatacccaa aagaattgaa atcatgttcy 420 yahaaagata yttgtacacc matgttcata gcagcattat tcayaatagc caaaaggtgg 480 aaacaaccca aatgtccatc aatgrwtgaa tggataaaca aaatgtggta tatacataca 540 atggaatatt attcagcctt aaaaaggaag gaaatyctga cayatgctac aacatggatg 600 aaccttgarg acattatgct aagtgaaata agccagtcac aaaaggacaa atactgtatg 660 attccactta tatgaggtac ctagagtagt caaattcata gagacagaaa gtagaatggt 720 ggttgccagg ggctgggggg aggggggaat ggggagttak tgtttaatgg gtacagagtt 780 tcagtttggg aagatgaaaa agttctggag atggatggtg gtgatggttg cacaacaatg 840 tgaatgtact taacgccact gaactgtaca cttaaaaatg gttaaaatgg taaattttat 900 gttatgtata ttttaccaca attaaaa 927 // ID MYS1_PL repbase; DNA; ROD; 2501 BP. XX AC X02855; XX DT 28-SEP-1995 (Rel. 1.2, Created) DT 26-MAR-1997 (Rel. 3, Last updated, Version 2) XX DE Mouse mys-1 transposon. XX KW Repetitive sequence; Long terminal repeat; transposon; KW Tandem repeat; direct repeat; mys repetitive sequence; PLMYS1; KW MYS1_PL. XX OS Peromyscus leucopus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Cricetidae; Neotominae; Peromyscus. XX RN [1] RP 1-2501 RA Wichman A.H., Potter S.S. and Pine S.D.; RT "Mys, a family of mammalian transposable elements isolated by RT phylogenetic screening."; RL Nature 317, 77-81 (1985). XX DR GenBank; X02855; Positions 21 2521. XX SQ Sequence 2501 BP; 710 A; 532 C; 462 G; 797 T; 0 other; tgttgctgga gggcttctct ccaggttccc caagccccgc agtcccacaa tccatttata 60 aaataatcac tcagacgctt atatcactta taaactgtat ggccgtggca ggcttcttgc 120 taactgttct tttatcttaa attaacccat ttttataaat ctatagcttg ccacgtggct 180 ggtggcttac cggcgtctct acatgctctt ctcctggcgg tggctgcagt ctctcttcct 240 cagcctcccg cttcccagaa ttctcctctc tccttgtccc acctacctcc tgcctggtca 300 ttggccatca gtgttttatt tacatagagt gatatccaca gcacttcccc tttcttcttt 360 ttttaaaaag gaaggtttta actttaacat ggtaaaatta catataacaa aacaattacc 420 gagcaagaat tatagttaca atattaaaga agatgtccta tctatcttat atttgtgagt 480 ttaaggtttt atagctaact tatcttttat cataactgag gaaattacga ctatctagcc 540 ttcaaccaca tcaaagacct gagaaggaac ataatggtac ctgagaaatg gtagacggat 600 gcaagcaact tcgggaatct tgcaagagta gaccaagaca gctggcagcc tggacagtca 660 cctgatgttt ctcagcattg ttggtgcatt caaattggct acaggcctag agtatctgac 720 agaccatttt cagaagcagg aattctgaga gaccatctta ccctgtcttg gcagagtaca 780 gtggtcgctt tccttgtgtc ccgcttgtcc agaaaggaca gcattgcatt tgtactgtca 840 gccgtcaagg caagggcagt tctttgccca gtaggccatt ttgtgccaaa aagacaaact 900 tccaaatgga aatgtcttag aagcccaaca ttctctcggg atcaattggt gcagccagga 960 gcaattgtgt ctcacatcaa cagaattcta agttatttaa atgccatatt ttctaggtct 1020 atgaagtgtt tgaagattac ctatctatct gaaatatatc tatgtatacc tagaagactt 1080 aactaacatg gctacagata tgattatcat agatgactaa ttattaacct attttttaat 1140 tatccattac aattttaaat gagttatata aacataatac ctcaaacaag aatagaaata 1200 tatatataca gtataacaaa attaacttca agtttgtatc aatgaactaa aatttatacc 1260 aatgtaaaac attttaaaca taaactaaaa tctataccaa tgtaaaacat tttaaacaag 1320 ttgttcttta aaagtaggtt cattaatcta cccttttatc ttatcatctc catatcctcc 1380 tatatatcat atcccctttt cttttttaga aagagatcac atttataatc aacctgtttt 1440 aaataaaaat attggttttt ctctgtccca caccagaggg ctcttctgat ttgggacaca 1500 agaatctctt aaccattttt ttttttttta aagcaatatg tctgggttta gagggggagt 1560 gagccaattc cacctctaaa gccagcttgg tatatttggg aatttgggcg tagcatctct 1620 tactgcttcc tgctggaggg gggcgctgta tcttatgggg acgcaaagaa aattttagac 1680 ctatggggta gccgtgaggc tgtattgtgt gaaccagttg ccttgaaacc gatctggatg 1740 ttggatcatc tgggccatgg tgtcatcgga gtcctttcag ggggtcttgg ctggtgaaac 1800 ctgatgtatc ttaatctgga acaagtccac agcctctggc tttctgtgga aacaaaagca 1860 gaacctcttt tccaaagtaa catatcctta tatccaaatt ttgaagtcaa ggtaccttta 1920 aaatatacat tttggcataa ctcaacagct tttgtaatca aatgtttttc tttagttatg 1980 aatatcaaag agaacataat ccagattctc tgtgtggtag ccatctttat gtggcttatg 2040 ttttatatta ccttgagcct tgagcctatt gctttaaact gtaccattgt aagcgtgaaa 2100 cggcgctgtg gctgctggct ccgcccactt cagcttccca acatggcagt ggtacatttt 2160 ccgccagctc tgggagtcat caagtctcag aaatagtggg tctaagcttt tatcaaagca 2220 gcgtgtagcc cagaaacctc tttttttttt ttttaaaaaa aaaataatag taaagactaa 2280 atctaccaca cagcttaatt tgccgctggc agatgcctca tttccgccat actgccggtc 2340 agacgcaccc gccaggaacc cgccagtgtc caaacttgcg ttttgccgca tctagctgcc 2400 gtataagaca agaagcagga acctggtttt ggctctgttt agaattggtt attaaatatt 2460 ctcaggttta aggtggaaac tcgagccgtt gggcgccatt t 2501 // ID L1M3A_5 repbase; DNA; ROD; 1765 BP. XX AC . XX DT 23-JAN-1998 (Rel. 6.4, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 2) XX DE L1M3A_5 LINE1 repetitive element - a consensus. XX KW L1 repeat; L1M3A_5. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-1765 RA Smit F.A.; RT "L1M3A_5."; RL Direct Submission to Repbase Update (1997). XX DR [1] (Consensus) XX CC 5' end of LINE elements connected with L1MA7 and related CC subfamily CC 3' ends, comprising the 5' UTR and part of ORF1 (from pos. 1164). XX SQ Sequence 1765 BP; 610 A; 439 C; 382 G; 288 T; 46 other; gagtgawgtc agcaagatgg cagaatwgga ggtctcaggc tccagtctcc ctcacagaaa 60 gtccgactag caactattca cagacaagaa cgcctttgtg aaaaatgcca gaacttggaa 120 atgagrctga gacanctncg tggancacag aaacgaataa aaaccacatt aaaagggtaa 180 gaggaaccgt ctcactttaa ccacgttgcc cctmagtcgg cacagtrcca cacngagaga 240 awttccccgg gcctacagtt tctacagtgg gaaaagagag ctkgaggtgg acatccagct 300 tccctagcat tccaagatgc ttcccaggar gcccactcyn atctcacctc atgaggaaac 360 actgagggaa atggcanggc tagaccgtct ggagtcaggt agaaacaaaa aaagggggca 420 nagctcatag taaccagtgc acggatcttg gtggtagctc tgtattcctg ccagcggtgg 480 cgcctgatca gaggtaccag ccaatggcat agcccacccg caaagctgag ctggtcgctc 540 ccagaagcac ggtgagaagt tcnacccggc ttgagtccct agatggccag cctccaaacc 600 cagccttaga scctacccca gngctccacc caggcaggga gatacacacc acmatgtatt 660 tcagcagagc acagaggcta gacctgcctg acccaggagc ccaaacagtg actcggccca 720 gcctcaaagc ccaccccaag gacccacaca ggcaggaagg caaacgcgga ttgtgcakct 780 ctaccagggc aaagtgccag ccaccgtcca ttcstwncag caattccatc taaccttgca 840 gcctaggggc tggccctgcc caactgcaga gctsaaacag cggctccgcc tggccasaga 900 gtmtaccccg tggcycagcc caatagacgc gactncaacg tccantcagc ggctccgcct 960 aannncagag cccagccagc agccccgcct gacctcagag cmcaggcant agcccgncca 1020 gctagagaac ccaamagcaa gcactgccta cccatggtta ttaccagctg tcccatccag 1080 aatcacaagc tggactgaat aatgaaggtc twttcctgnc gaagaacacc tgtaaaagcc 1140 agaagaggtg gctgcctmgt caaatgcatg gataccaatg caaggacgca agggttacga 1200 agaatcaaag aatcatgaca cctccaaaat aaactaacaa agctccaaca atggamccta 1260 aagaaataaa gatctataaa atgactggca aagaattcag aataatcctc ttaaagaagt 1320 tcggtgagct acaagaatac atagatagaa aattaaataa aatttggaaa acaatacatg 1380 aamaaaacga aaagtttgac aaagaaatag aaacgawtat tctaaaaccc aaatagaaat 1440 cctagagatg aagaacacaa ttgctgaaca aaaaaattca ataaaaagct tcgacagcag 1500 actcaatcaa gcagaagaaa gaatcagtga gctcgaagac agaacatttg aaattatcca 1560 gtmagaggat caaaaataaa aaaaatgaaa aacagtgaag aaagcctata ggaattatgg 1620 gacaccatca agaaaactaa catacgcata atagaaatnc cagaaggaga agagaaagaa 1680 ggccagaaag yatattttaa gaaataatga ctgaaaactt cccaaatctg gggaaagatg 1740 ccagcatcca ggtacaagaa gcaca 1765 // ID RLTR38A_MM repbase; DNA; ROD; 516 BP. XX AC . XX DT 30-JAN-2004 (Rel. 9, Created) DT 30-JAN-2004 (Rel. 9, Last updated, Version 1) XX DE Mouse long terminal repeat - a consensus. XX KW LTR Retrotransposon; Transposable Element; LTR; ERVK; RLTR38A_MM; KW RLTR31A. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RA Karolchik D., Baertsch R., Diekhans M., Furey S.T., Hinrichs A., RA Lu T.Y., Roskin M.K., Schwartz M., Sugnet W.C. et al.; RT "The UCSC Genome Browser Database."; RL Nucleic Acids Res 31(1), 51-54 (2003). XX RN [2] RP 1-516 RA Pavlicek A. and Jurka J.; RT "RLTR38A_MM - a family of LTR retrotransposons."; RL Direct Submission to Repbase Update (31-DEC-2003). XX DR [2] (Consensus) XX CC The consensus sequence was reconstructed from the UC Santa Cruz CC genome annotation. Similar to RLTR22A_MM. Individual copies are CC ~91% identical to the consensus. 6 bp TSDs. RLTR31A in CC RepeatMasker. XX SQ Sequence 516 BP; 127 A; 84 C; 168 G; 137 T; 0 other; tgttgcagga ttttccctgt ccaattacat tagggcagta ggaggcctgt gattggacag 60 ggaaaaggga ggcggagcta agagttgcag agacagggag catctcaggg gaggagggag 120 aaggaagatg gctgcggacg tgaacccgcg tggcttttac cagccacaag tagctatgat 180 ttcacaaggt tagaaatatt gggataaagc ttttatcatt atcaattggc tctgaaatta 240 ttgtattggc atcttgtaaa ttgtgatatt attgatacat aaatctgatt ggttaatttt 300 aagctttaag agtcttgatt ctaccgggta attgggtgtt gtgatggctg accgtggggt 360 gggtgattgc atgtagctga gaggaactag ggggcgccgg agagatggca ggccagcgag 420 cccgctggag agttggtggt gccggcagga gcgcaggagc gtggtctggc cccacggaga 480 gttggcgggt tcattttttt aatatttccc gcaaca 516 // ID ERVB4_1-LTR_MM repbase; DNA; ROD; 521 BP. XX AC AC102561; XX DT 11-FEB-2004 (Rel. 9.01, Created) DT 11-FEB-2004 (Rel. 9.01, Last updated, Version 1) XX DE Mouse endogeneous betaretrovirus ERVB4_1 LTR sequence. XX KW LTR Retrotransposon; Transposable Element; LTR; KW endogeneous betaretrovirus; MmERV-B4_AC102561; ERVB4_1-LTR_MM. XX OS Mus musculus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Sciurognathi; Muroidea; Muridae; Murinae; Mus; Mus. XX RN [1] RP 1-521 RA Baillie J.G., Van de lagemaat N.L., Baust C. and Mager L.D.; RT "Multiple groups of endogeneous betaretroviruses in mice, rats, RT and other mammals."; RL J. Virol 78(11), 5784-5798 (2004). XX DR Genbank; AC102561; Positions 1 521. XX SQ Sequence 521 BP; 113 A; 164 C; 97 G; 147 T; 0 other; tgtcagagac catcccgtga gaaagtgagc tcttcacaat ctccctaaca ggccaaatgg 60 ccttgtacta agagctagtt atcacccctc tttcctccct attcctttct tggcacctga 120 ggctctaaaa gctaaattat agtcccctct tccctatctc ttcctgagtt tccatgccat 180 ccaaggacat gagttatgct tgagcccagc ctgacgccca aggctgtcaa ggaggatcga 240 tgttccagtg ataagatcca gagtgcccgc tgcccggtgc ctgacttcag ccctcatgtc 300 agcagatgcc cacttcgttg ttctttgtat gattctccct cgacccctcc catattcccc 360 atgatgtatg ctttaaagag aaggcacctc agcctaataa acgagacctt gataggtacc 420 atcttcctgg tctcccctct tctttctttt cctcccatct ccttccaggt ttgcggtccc 480 ccttacacct atgaataact gaatcccgcg ggacgggata a 521 // ID HERVL68 repbase; DNA; ROD; 3037 BP. XX AC . XX DT 18-AUG-1998 (Rel. 6.6, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 2) XX DE HERVL68 repetitive element - a consensus. XX KW Noncoding foamy-virus-like endogenous retrovirus; MER68; HERVL68. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1-3037 RA Kapitonov V.V. and Jurka J.; RT "HERVL68."; RL Direct Submission to Repbase Update (AUG-1998). XX DR [1] (Consensus) XX CC HERVL68 is an internal portion of LTR retroelement flanked by CC MER68 LTRs. It has patchy 70% DNA identity to HERVL_40 including CC its portions that encode Pol. However, HERVL68 consensus does not CC code protein sequences similar to any retrovirus. CC It is difficult to say now whether HERVL68 hase been proliferated CC in CC the genome as a nonautonomous LTR retroelement related to one of CC HERVL-like active retroviruses, or it was an active HERVL-like CC retrovirus which lost the coding potential becouse of the CC multiple mutations. XX SQ Sequence 3037 BP; 833 A; 581 C; 683 G; 829 T; 111 other; gcactrtntg ggannnttnn tggaanggaa gaagagggaa gatggatgac aaaattatgc 60 ctgggtvgct atggagttat tcatggtatr aaayagcagc tragctgctg tctgttatga 120 gaggtaaaag ttacctgtgr aatttgaaaa tgatagatcc aatcacagag agttggcnys 180 ctggatcccy tggctgcttt tggccwtkct ggctaaagtn aggaaraarc acancycarr 240 tgatgtcttt ggtttttgtt ctytcctcca rktgggagga gaaagggccc aaacattccn 300 aaangtgagc ttnntwtgsa swaaagwyaw waacaktwnn ttgnagttnt cttctccagg 360 tggnaggngn aangrtcnga aaantnctnn aggcnngnwt tgagtgrrcc aaraatgtta 420 aaagtttgca ttgctctctc ctccaagtgg aagraaaaar gttaaatcag ttcccagggc 480 tggagyaang ctttagataa accagaggag gagaaaaagt aaaaaaaaag cccnccaaac 540 aagwaaacca ctctagccnt ttctacagct tagctgctgc yacagcaact caacnccctc 600 ctttcmctac aaccttcanc tccagcanct gtaatgttaa ccytgtaaad tgctnaattt 660 actgctaggg gtttgantaa acatghaatg agtaaaaaga aggaaatnat tatwgaatsr 720 ctttgtattt tgtggctatt gcatctggat gtatgataaa aattratgta aaatgttata 780 tgcttgtaat ttcataaatg ctagaggaat catcctaatm gggaaagctg caaagaaaaa 840 aaagttagtg gnawcacaac tcccttgctt tytgctgcyg gtgggtggat tggaatttaa 900 actctttgag tattggaaas agaacaagtc tccataactg atgatttttg cctattggar 960 cctctgnaaa aagggagaca acatnaaaag agaggcaatc tccagatgga gatacacttt 1020 tggagtttta aaaanyagtt tttagtttgc taatgwgctc ttgytgaaat gagatgtctg 1080 acccttagag gctgatgatt ttcagcctga tgtgcatgat ttttgagggg gtcaatttgg 1140 actctaarac tgacaagata aaaaggmcty ttagaaaagc cttgcttgtc acttggactt 1200 ggaatacagc tgtctggntc ttcagtntct cagctttgct gctgctgaag aaaagccact 1260 ggcttttttg gaatcctgaa ttcacagatc ctaattgcct gtacctgact ctaagctgaa 1320 acctcatact gtctgctgtg rgctgtttca gcctcgngnt gagctgtgct gaactgaaat 1380 nngntgaaat ggctcgactc aatgaacaga actcagactc tgggactgtt gcagatttag 1440 gaccaccctc tgtggggcca taaactatga aaacatcacg gatgctggct ggactgtctg 1500 ggtcacatac agatgcccac aggaaagggc ttgttctctg atgggaccat ctaaaattga 1560 gctgctgatt ggctctatat ttctgataca gagactaatt gcattcttaa cttgtgattt 1620 ctgtcgaaag ctgcaagttg ggggagggca cattacaaga agtgtgaccc ctccacccac 1680 tcctgacaga ttggacttga ccccctctcg gggatgtctc actgctattg acttgttgtg 1740 ttcggctttc tggattagtt gcagtttgca acaatggact gacagcctga gtctacttcc 1800 ctcacctttc tcctggtaca cacatcttag tgagacagtt tgattattaa atgcagctgt 1860 ccccagaaag ggattgatct tttttttcct aggctgctca ccggataaat gatcaggacg 1920 aaaagggtgg gaaggttatg taaactcatt ttgaaaaatt ttgaaaattc agaattcatc 1980 ctgaccaatt tctgaacatg atgtgtcttt ccggttaagt tgtataaaaa tgtttttcta 2040 taaaaatgtt tttgtccctc ttgcatacaa cccttccaga caaaaggtct gggtgagagt 2100 aagagatgat tggagaaata tgaagttaag attgtaagtt atgaatattg ctgaatggga 2160 cacactgatt ttgtaacaac ggagaaaaaa gaaaaaccta tggaggcacg gggtgtgaga 2220 gagggcatta atgatctttg tttttcagaa ctgctcctgg aagctttccc ctcttcctga 2280 agaaaaattc cccatgctgc ctttccaagc tgctgcgagc gctttgaatt caaactgact 2340 gctgagccta agatcacacc tgacagcgct gactatgaga cagcaggggg tggccccagc 2400 tcctgcactt tccacgaaga acacctccag atgtcatcca cacacatgag acggatgaac 2460 tggtgtcatg gacaagcaaa actgtttgga actgactgcc tttaggcagt ccctctgaaa 2520 ccaaggacta aactgaatta attggactag actctttggg agtagtcccc gtggtaagag 2580 gccacactgg ggaccctgtt aacctgtact gcctgactaa tgtatgtcct gcttggggta 2640 ccctaacaat tgtaaactct ttgtttccag gggtaccacg tgtgccctct gggattgtac 2700 ttttatgtgt accctgacta tcatgacact gcctcagccc tggaaggctt tcaggtcagc 2760 ttcaacttac tggccagagt tgtgctgtgc ctgaattgat gcctcaggcc araaaaaaaa 2820 aggttaatac agaaacttaa ggaggaagcc acctggcttt ctaagataga cctttatggt 2880 taatgggatt tgttttaact ggctaaattc aggaccccta aagggcataa actgagatca 2940 atactgcagg ttggtctcac cctgctgcgt ggggtcctac tgataataat tttgctgtaa 3000 agatgccttg gccaaggggg tggactgtgc agaagag 3037 // ID LTR3_Cpo repbase; DNA; ROD; 405 BP. XX AC . XX DT 20-JUN-2009 (Rel. 14.07, Created) DT 20-JUN-2009 (Rel. 14.07, Last updated, Version 2) XX DE Long terminal repeat of ERV3 endogenous retrovirus: consensus. XX KW ERV3; Endogenous Retrovirus; Transposable Element; LTR3_CPo. XX OS Cavia porcellus OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; OC Hystricognathi; Caviidae; Cavia. XX RN [1] RP 1-405 RA Jurka J.; RT "Non-autonomous endogenous retrovirus from guinea pig."; RL Repbase Reports 9(7), 1544-1544 (2009). XX DR [1] (Consensus) XX CC >92% identical to consensus. XX SQ Sequence 405 BP; 118 A; 81 C; 87 G; 119 T; 0 other; tgttgtagct tggcataact gacgccatct tgagccctgc atttaactag tgccctgtat 60 ttgacccagg aaaatggaaa attactcagg gtataagtcc catgagaaaa gcagaatgag 120 ctgtacccac aggagaaaaa ttaacctgct caacaaaaga agcaggatac aggtgtcaac 180 cgaaaattaa tgtaagattc tagctaaatt gtttccaaca ggatgtcttt ttgtgattct 240 ctcttttaaa aactctgtaa ctttccagtt cggggccact tgtttggact ctgaaacggg 300 gggagggggt ctgtatacga gtggtcctga gctcagttaa ttaaattcca aatttatcaa 360 tttggctgct tggattccta tattgttgtt cacgaaccca cctca 405 // ID L1M3_5 repbase; DNA; ROD; 3806 BP. XX AC . XX DT 18-APR-1997 (Rel. 6, Created) DT 07-OCT-1998 (Rel. 6.7, Last updated, Version 3) XX DE L1M3/4 LINE1 repetitive element 5' end - a consensus. XX KW L1 repeat; MER43; L1-43_5; L1M3_5. XX OS Eutheria OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Mammalia. XX RN [1] RP 1719-1894 RA Rubin M.C., Leeflang P.E., Rinehart P.F. and Schmid W.C.; RT "Paucity of novel short interspersed repetitive element (SINE) RT families in human DNA and isolation of a novel MER repeat."; RL Genomics 18, 322-328 (1993). XX RN [2] RP 1631-1894 RA Smit F.A.; RT "Structure and evolution of mammalian interspersed repeats."; RL Ph.D. thesis (USC 1995). XX RN [3] RP 1-2253 RA V V. and Jurka J.; RT "L1M3_5."; RL Direct Submission to Repbase Update (1996). XX RN [4] RP 1-3806 RA Smit F.A.; RT "L1M3_5."; RL Direct Submission to Repbase Update (1996). XX DR [4] (Consensus) XX CC 5' end of L1, probably of the L1MB1-8 subfamilies. XX SQ Sequence 3806 BP; 1296 A; 787 C; 873 G; 729 T; 121 other; agagcaagat ggcagaacag aatgctcnaa caattgcccc catyctryyc cnccacagga 60 acaccaaatt gaacaactat ctacacaaaa wagcaccttc ataagaacca aaaatcannt 120 aagcgatcac agtacctggt tttaacttca tatnactgaa agaggcactg aagagggtag 180 gaaagacagt cttgaatcgc caacgccgcc cctcccccat cccccggcag tggccgcacg 240 gcacggagag agaatccgcg cacttgggng agggagagcg cagcgattgt gggactttgc 300 gttggaantc agtgctgccc tgtcanagcg gaaagcaaca ncgggcagaa ctcagccggc 360 gctcatagag ggancattta gaccagccct agncagaggg gaatcaccca tcccagcggt 420 cggaacctaa gttccggcaa gcctcgccac cgtgggctaa agtgcnntag ggtcctaaat 480 aaacttgaaa ggcagtctag gccacaagga ctgcaattcc tgggcaagtc ctggtgctgt 540 gccgagctaa gagtcagtgg acttaggggg cacacgacct agtgagacac yagccggggt 600 ggccaaggga ntgcttgcgc cacccyctcc cycaacncca ggcagcacaa ctcgcagctc 660 cgggagagac tccttccctc cgcttnagga gaggagaggg aagagtaaag aggactttnt 720 cttgcaantc ggataccagc tcagccatag taggataggg caccgagcag agtnnwgagg 780 ycccnattct aggccctagc tcccggatga catttctaaa cacaccctgg gccagaaggg 840 aanctgctgc cttaaaggga agganccagt cctagcagga ntcatcacct gctgactaaa 900 gagcccttgg gccctgaata atcagcagcg atacccaggt tagyactcgc cgtaggcctt 960 gggtgagant ctgagacgtg ctggnttcag gtgtgacnca gcacattccc agctgtggtg 1020 gctacgggga gagactcctt ctgcttgaga aaaggagagg gaanagtaaa ggggactttg 1080 tcttgcagct tagntaccag cttggccaca gtggagtaga gcaccaagcg ggctcttagg 1140 gtccccgatt ccaggccttg gctgttggat ggcatttctg gacctgccct gggccagagg 1200 agagcccact gccctgaagg gagagtctca ggcctggcag cattcaccgc aagctgacag 1260 aagagtcctt gggctttaag tgaacattkg cgrtagycag gcagtacttn ctgtgggcct 1320 gcggcggtgg tngccatagg gagagnctcc tctgcttgtn gaaaggggag ggaagagtgg 1380 gaagaacttt gtcttgtggc ttgagtgcca gctcagccgc agtagaacag agcaccgggt 1440 agatttctaa ggtttccgac tccaggccct ggctcctgga tagcatcyct ggacgtgccc 1500 ggggccagag agaactcacc accctgaagg gaaggataca agnctggctg gctttaccac 1560 ctgctgattg tagagtcccg gggccttgag cgaacataag cagcggccag gcagtggtta 1620 ctgcgggcct tgggcgagac ccagtgctgt gctggcttca ggtctgaccy agtacagtcy 1680 cagtgrtggt ggccacaggg gtgcttgtgt cacccctccy ccagctccag gcagctcagc 1740 acagagagag agacngagtt tgtttgrggg aaagtaargg aagagaacaa gagtctctgc 1800 ctggtaatcc agrgaattct tccagatctt atccaagacc acnaaggcag tacctctacg 1860 agtctgcaag agccacagta ttactgggct tggggtgccc cctaatgcag atacggccgc 1920 agtgacaaaa aacttagatc acaacacyca agtcccttca aatacctgga aagcyttccc 1980 aagaaggacg ggtacaaaca agcccagact gtgaagacta caataaatac ctaactcttc 2040 aatgcccaga cacagacaaa catctacaag catcaacacc atccaggaaa acatgacctc 2100 accaaatgaa ctaaataagg caccagggac caatcccgga garacagaga tatgtgacct 2160 ttcagacaga gaattcaaaa tagctgtttt gaggaaactc aangaaattc aagataacac 2220 agasaaggaa ttcagaatyc tatcagataa atttaacaaa gagattgaaa taattaaaaa 2280 gaatcaagca gaaattctgg agctgaaaaa tgcaattgac atactgaaga atgcatcaga 2340 gtytcttaac agcagaattg atcargcaga agaaagaatt agtgagcttg aagacaggct 2400 atttgaaaat acgcngycag aggagacaaa agaaaaaaga aaanaatgaa gcatgcctac 2460 aagatctaga aaatagcctc aaaagggcaa atctaagagt tagtgacctt aaanaggagg 2520 tagagagaga gataggggtr aaagtttatt caaagcnata ataacagaga atgtcccaaa 2580 cctagagaaa gatatcaata ttcaagtaaa agaagnttat agaataccaa gcanatttaa 2640 cncaaagaag actacctcaa gacatttawt aatcaaactc ccaaaggtca aggataaaga 2700 aaggatccta aaagcagcaa gagaaaagaa acaaataaca tacaatggag cnccgatata 2760 gtctggcagc agacttntcg gcggaaanct tacaggccag gagagagcgg catgacatat 2820 ttaaagtgct gaagaaaaaa aaaaaaactt tnatcctaga ntagcgtntc cagngaaaat 2880 atccttcaaa catgaargag aaataaagac tttcccagac aaacaaaagc tgagggattt 2940 cntcaacacc agacctgtcc tacaagaaat gctanagnga attcttcaat gtgaaagaaa 3000 aggacgttaa tgannaataa naaatyatct gnaggtrcaa aactcactga taatagtgcg 3060 cagaaaaaca naatattata acantgtaat tatggtgtat aaactactct tragtagaaa 3120 gactaaaaga tgaaccaatc aaaaatanta actanaacaa cttttcaaga catagacagt 3180 acnntaagat anaaatagaa acaacaaaaa gtttaaaaat tggnggacga agttaaaatg 3240 tagagttttt attagttttc ttnttgttnn ttgrtttttt tnaanttatg taatnagtgt 3300 tattatcagt ttaaaatata ggatattatt tgcaagcctc atggtaacct caaatcnaaa 3360 aacatacaac ggatacacaa aaataaaaag caagaaatta aaacatatca ccagagaaaa 3420 tcaccttcac taaaaggaag acaggaagga aggaaagaan aaagagaaga ccacaaaaca 3480 accagaaaac aaataasaaa atggcaggag naagtcatta cttatcaata ataacattga 3540 atgtaaatgg actaaactct ccaatcaaaa gacatagagt ggctgaatgg atataaaaat 3600 aagacccaat gatcggtcgc ctanawgwaa cacacttcac ctataaagac acakatagac 3660 tgaaaataaa gggatgaaaa aagatatttc atgcnaatgg aaaccaaaaa agagnaggag 3720 tagctatact tayatcagan aaaatatatt tcaagacgaa aactataaga agagacaaag 3780 aaggtcacta tataatgata aaggag 3806 //